HERV4_I

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000172
TE superfamily ERV1
TE class LTR
Species Haplorrhini
Length 6539
Kimura value 11.78
Tau index 0.9815
Description Internal region of an ERV1 endogenous retrovirus, HERV4 subfamily
Comment Associated long terminal repeat includes MER51A.
Sequence
TATTTTGGCGAGCCAGCCAGGAGGTAAGCCCAAAGTTTGGGATTTATTTTTCTCTTTTTCCTCTTTCTCTCTCTCTTTTCCTTTCCAACTCGGGACCCTCGGTGGACAGCGCCTAAGCACGGAGGCAACTGCAGGTTTCTGGCCGGGGCCACTCTCCGGTGAAACTGAAAGGTTTCCGTGTGGAAGCGCCTGACCGCCACCGCCCGGTTCGGGTGAGGGACCTGAGTCCTTTTCTTTTTCAGTCTTTCAGCGGCCGTTTCCTAGTAGCTCCTTGGTAATTGAGGGCAACTGGCCGGGGCCACTCTCCGGTGTTACCTGAAGGCCAAGGAGTGAACGGGGATAGCTGCCCTGCCCGGAAGGGGGAAGGACTCTTTTCTATCTTTTCCGGTTATAGTCCCTGATCCCTACGTGTGACGCAATTGGCAGCGGCAGCTCGTCCAGGGCGAACTCACACACGTTTCAGGCGACTTAAACCTTCTTTTCTTATGCTAAATTCTTCCCTTCCCCTACTCGACTGGCTAAGGACAAGTCAGAGGGTCCGGGCATGTCGTAGATGGTCTGTGTGAGTCATGGGGAGGGGATTCATGAAAGGGAATTTATGTACAATTTAATCTTGCCTAAATTTAGAGAGTTAAAGGATTGTTTTAAGTGGGATAGGAAAAAAAATCCAAAGGTTTGACTGAAAGTTAATTCTAGAAGTCGAGGCCTTCATCCAGGGACAAGAGGGAAAGCTCATAGTAGGTCATCAGTGGTGGAGGGAACCATTCCAAAGCGGTGCCGGCACCCATCTAAGGTCAGAGACGTCTGACAGACTAAGACGGGGCCCTAAAGGGGGGACGCCCCCGGGGACCCCAGTCNGGGCCCAGAATTTTTCCAGGGGGATGCCCCGGGTAAAATTTGGGTCACCTAATGAGCCCTCCACTTTTCAAAGTCCTCTTCTCTTTTCCAGACCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCCCGCTTGGCTGCATCCTCAACCATTGGAATCAATTTGACCCTGACAATCTAAGGAGAAAACGTNTGATTTTTTTCTGCAATACTGTTTGGCCCCANTATNAGCTGNNCAGCCAGGAACAATGGGCGGTCAATGGTAGCCTTAATTATGACACCATCCTGCAATTAGACCTATTTTGCAAGAGGCAGGGCAAATGGTCAGAAATCCCATATGTACAGGCCTTCATGGCCCTATACCAAAACCCAACAATCTGCAAAACTCCCAGAACCCGCCCCCCAAAGGAAAGTCCTAAGGCAGAACTAGATATTGTAGATGACCCCCTTTTACAAGGGCCACCTGTCTCTCAGGGNGAACAGCAACCGCCCCCATATAGCCCCTTGCCAAGTGCTCCTGAGGCTAAAACCCAGGAGCAAACACCGGGGACCCTACTAAGTCCCCCTCACACTCGGAGGGGAACACCNTATTCAACTCTCCCTCCAGCCCTGCTACCCCTTAGGGAAGTAGCAGGAGCCGAGGGGCCAGTCCGAGTGCAGGCCCCCTTCTCTATAACTGATATACAACAATGTAAGGAAAAGCTAGGAAGCTATTCTGAGAACCCCGGGAAATTTGCAGATGGGTTCCAAACTTTGACCTTAGCCTTTGATCTCTCATGGAGAGATGTTCAATTCATTCTAGCAACCTGTTGCACCCCCTCGGAAAAGGAACGAATCTTTGAGGCCGCCCGCCGGGAAGCGGACGANTTATTCGCCCGAAACCCTCAGGGCAATCACCCGGGCCCAGACACAGTCCCCACTACTGATCCTAATTGGGACTATAACACCCCCGTGGGAATGAACAACCGGGCTAAATTTCTTGAGGCTCTCCTTGGAGGAATGAGAAAGGGAATAACTAAGGCAGTAAATTATGATAAAGTAAGGGAGGTTACACAAGGCAAGGAGGAAAATCCAGCCATGTTTTATGGCAGGCTGGAGGAAGCCTTTAAAAAATATACNAATCTGGACCCTTCCTCTCCCGAAGGCAAAATATTAATGGCACAGCATTTCATTAGCCAATCCGCCCCGGACATTAGACGTAAGCTCCAAAAGCTACAGATGGGGCCACAAACTAATCAAAATCAGCTTCTTGATACCGCCTTTATGGTGTATAACAATCGTGACCTGGAGGAAGGAAAAAGGGAACAGAGTAAAGAAAAACGGCAAGCCAAAATTATGGCAGCCATCATTGGCGATGCCCTGAATGCCCAAAGAGCGTCCAAGGGAAACCCGAAGGGCCATAAGGATAATGCCAGCAAAGGCTCTTGCTTCAAGTGCAAGAAAAATGGGCATTGGGCAAAGGACTGTACTAAGCCCCCGCCAGGCCCCTGCCGTCAATGCGAAGGCACCAGTCACGACCCCTGGCACTGGAGAATTGACTGCCCCCGCTCCCACCGAGGGGCTCAGTCAGTCAAAACTCTAGCAGTGCAAAAGGAGGAATTAGATGAAGACTGAAGGGGCCCGGGGCCTTCCTCACCGCCCCTGTCCAGGAACATCGTNATTACTACTGAGGAGCCCCGGGTAACTCTGGACGTCATGGGCACCCAAATTCAGTTTCTTTTTGATACAGGGGCAAATTACTCTGTCCTTACTGCTTATGCAGGAAAACTTTCCTCCCGGTCCACGAGTGTTATGGGAATGGAAGGAAAGCCACAAACAAGATTCTTTACTCCTCCTTTGACTTGTCAATTTGAGAAACAAATCTTCCAACAGGAATTTCTAGTAGTACCAAGCTGCCCAGTCCCCCTGTTGGGAAGAGATATTATGGTTAAAATAGGGGCACTGCTACAATTTAAGCACCGCCCGGCGAAATTGCTAATAGTCAGNAATGCAGACAATGTCCCAGACCACGTTAATAAACAGGTCAACCCGCTGGCATGGTATACTGGGAAACCGGGGAAGGCTAAAACGGCAGTGCCAGTCAAAATACAGCTTAAAGACCCCAGCTATTTTCCCAATCGAAAACAATACCCAATTAAGCTGGAAGCAAGAAAAGGCCTAGCACCCATAGTTGAGGTATTACTTACCCATGGACTCTTAAAACCCTGCAATTCTCCCTGCAATACCCCCATCTTACCCGTTCTAAAGCCTTCGGGGGAATACCGGNTAGTACAGGACCTCAGAATAATTAATGAGGCTGTTATCCCCGTCCACCCATTGGTGGCGGATCCATATACCCTCCTGGCTCAGGTGCCAGGGGATGCAAAATGGTTCTCAGTCCTAGACCTAAAAGATGCTTTCTTCTCCATTCCTCTGGCCCCAGAGTCCCAATACCTTTTTGCCTTTGAATGGGAAAATCCTAATACCAGAGAAAAACAACAATACACTTGGACAGTGCTCCCTCAGGGCTTTCGGGATAGCCCCCATTTCTTTGCCCGAGCCTTAGAGAGGGATCTGAGGGATCTGCAATTGGAGAATGGGAGTATACTCCAGTATGTGGATGACCTTCTTGTGTGTAGCCCAACCCAGGAGGCTTCTGACCAAAATACTATAAAAACTTTGAATTTCCTGGCAGACAGGGGATACAAAGTGTCCAAAAAGAAGGCTCAGATTACCCTCCAACGGGTCCAATATTTAGGGTATGTCTTAACACCCGGAGCCCGGCAAATATCCCCAGAACGAGTGCAAGCCATATGTGGTTTGGGGCCCCCCCACACCAAGCAGCAGCTTCGTTCTTTTTNGGGAATGGCCGGGTTTTGCAGAATATGGGTACCAAATTTTGGGCTCATAGCAAAGCCCCTNTATGAAGCAACAAGGGGGCCTGAAAATGAGCTAATGGAATGGACCCCGGAAATGAGGGAAGCCTTCGCCAAGTTAAAACAGGCTCTCACCCAGGCTCCCGCTCTTGGCATCCCAGACCTNACTAAGCCCTTCTCCTTGTATGTAGCAGAGAAGAAGGGCATAGCTGTGGGAGTGCTAGCCCAGAAATTAGGATCAGAACCCAGACCAACCGCCTACTTTTCAAAGAAGTTGGACGGAGTGGCCTCGGGGTGGCCAAGCTGCCTGCGGGCAATAGCAGCCACTGCTATTTTAGTGGAGGAAGCCACTAAAATCACCCTGGGTCAACCACTGGAAGTTCTAACCCCNCATCAGGTAAAGTCAGTCTTAGAGATAAAAGGACACATCTGGATGACGGGGGAAAGGTTAACCAAATACCAGGCCATGCTCCTAGACAATCCAGATGTAACCCTTAAAACCTGTAACACCTTGAATCCAGCTTCATTGCTGCCCACAGGCCCAATAACTGATCATTCCTGCGAGCAGGTCATCGCACACACATATGTTAGCCGGCCTGATTTAAAAGATCAGCCTCTCCCAGATTCTGAGGATGACTGGTTCACAGACGGCAGTAGTTTTGTGTCAAATGGGGAGCGCCGAGCTGGATATGCAATAGTAAATCACAACACCATTATTGAAGCCCAGCCACTGCCCCCTGGCACATCAGCACAAAAGGCTGAAATCATTGCTCTTACCCGAGCATTAATGTTGGGACAAGGGAAAAAGCTTAACATCTATACAGATTCTAAATATGCATTCCTTGTGGTTCATGCTCATGCTGCAATCTGGAAAGAAAGGGGACTACTAACTAGCAAACACTCCCCCATAAAGCATGGGCCTGAAATTCTTCAGCTATTGGAAGCAATACACCTGCCAAAGGCCGTAGCTATAATCCATTGTAGGGGGCATCAAAGGGACTTAACCCCTATAGCACAAGGGAACAGAAAGGCTGATAGAGAAGCCAAAGCCGCAGCCCTCAGGGTGCAATCCCAACAGATCCTAGCACTGCTTCCTTTCTATGATTCCCCAATAGAACCTGAATACACACCACAGGAAGAACAGTTAATAAAGGAGCAAGGGGGACAAAAACAAGGATCCTGGTGGTATATGGGATCAAAAGTATATCTCCCTCAAACAGCCCAATGGAGAGTTATAAAAACCCTGCATGACTCTTTCCATATGGGGAGAGATGCCACCCTGGCCATGGTAAACAGGCTCTTCATTGGGCCTAACTTAGCTTCGGTGGTTAAGCAGGTCTGTCAAGCCTGCTCACTGTGTGCACTTAACAACCCAGGAAACAAAATGCCTCCTCTAATAGAACCAGTCCAGAGGAGAGGAACTTACCCAGGGGAAGACTGGCAATTAGACTTCACCCATATGCCAGCTTGCAGAGGATACAAGTTTTTGCTAGTACTAATAGACACCTTTACTGGCTGGGTCGAAGCTTACCCTACCAGAACAGAGAAGGCTAATGAGGTTATAAAGGTTCTCTTAAAGGAAATAATCCCCCGGTTTGGGTTACCCCAGAGCCTCCAAAGTGATAACGGCCCGTCCTTTATCTCCCAAATAACTCAAGGGGTTGCTAAGGCTCTCGGAATCAAATACTATTTACATTCAGCATGGAGGCCTCAATCCTCCGGGAAAGTAGAAAGGGCTAATCAAACTCTAAAACGGGCGTTAGCTAAGCTATGTCAGGAAACATCAGAAACTTGGGTCAGCTTACTGCCCATAGCCCTCTTAAGGATCCGTAATNCCCCTAGAGCAAAAATTAATATGAGCCCATATGAAATGTTATACGGAAGGCCATTTTTAACTAATGATCTAATTACTGATCCAGAAACAGCCGGTTTAGTAAAATACCTAGTTAACCTGGGACAATTTCAGCAGGCTTTACAAAAGTTTGGAACTCAAAGGCTCCCCACACCGGGAACTAACCAGCAACCCAAAATCAGGCCAGGAGATAAGGTACTTGTTAAAACATGGAAGGAGGGATCACCTGCTCAACAATTACAACCCAAATGGAAGGGACCGTTTTCAGTGGTACTGGCCACGCCTTCTGCGGTCAAAGTACTAGGATTAGATAGTTGGATACATCTTTCCAGGGTCAAGCCTGCGATACCTGAAGCCCCGGACCTGGAACCTGAAGCTCCCATCAGCCACTACACCTGTGAACCTGTGGAAGACCTGAAGTACCTGTTTAAAAGACAGCCAAAAGATAAGTAAATGCCTACCAACTTTCCTTGGTGTCTTTGTTGCATAGTTACTGTAGGCTGGATAATAGTAGCCATTTTTANNNNTATTTTTGCAGTTTAATTGCCTTCTTCCAAACGGATGGAATCACTTCCTTTGTAATAATTAAGCAGAATGTTTTAATTCATCTCTATAACAAACATTCCTGACAGCATAGGTATCCACCCCCTGAAGTTCCCATTAAATCTTTTAACCAAATTCATTTCCTCTCGCCTAGAGACCATCAAGCTTCAGATGATCATGCGACAAGGNTTCCAGCCAGTTCCAGGTGAAGACACCACCCCTGGCCATCAAGAAGCTACCCTGTCTCCACTAGACAGAGCAGGGCGAGAGTTCCGTGATCCCCAATAGGTAGGGACTACGCCCCAAGTCAGCATGAAGCAGTTACAGAAGAAAGACCATCGGTCCCTCTGCCTCCCATAAAGATTTATGGGGATCACGTCTCTCAGGGGGGAGA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERV4_I BPC5 65 94 - -29.73 CCCGAGTTGGAAAGGAAAAGAGAGAGAGAA
HERV4_I BPC5 59 88 - -31.15 TTGGAAAGGAAAAGAGAGAGAGAAAGAGGA
HERV4_I BPC5 47 76 - -32.26 AGAGAGAGAGAAAGAGGAAAAAGAGAAAAA
HERV4_I BPC5 51 80 - -33.99 GAAAAGAGAGAGAGAAAGAGGAAAAAGAGA
HERV4_I BPC5 220 249 - -45.76 TGAAAGACTGAAAAAGAAAAGGACTCAGGT
HERV4_I BPC5 1890 1919 + -46.34 GAGGAATGAGAAAGGGAATAACTAAGGCAG
HERV4_I BPC5 45 74 - -46.35 AGAGAGAGAAAGAGGAAAAAGAGAAAAATA
HERV4_I BPC5 67 96 - -47.91 GTCCCGAGTTGGAAAGGAAAAGAGAGAGAG
HERV4_I BPC5 43 72 - -49.23 AGAGAGAAAGAGGAAAAAGAGAAAAATAAA


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).