HERV4_I
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000172 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Haplorrhini |
Length | 6539 |
Kimura value | 11.78 |
Tau index | 0.9815 |
Description | Internal region of an ERV1 endogenous retrovirus, HERV4 subfamily |
Comment | Associated long terminal repeat includes MER51A. |
Sequence |
TATTTTGGCGAGCCAGCCAGGAGGTAAGCCCAAAGTTTGGGATTTATTTTTCTCTTTTTCCTCTTTCTCTCTCTCTTTTCCTTTCCAACTCGGGACCCTCGGTGGACAGCGCCTAAGCACGGAGGCAACTGCAGGTTTCTGGCCGGGGCCACTCTCCGGTGAAACTGAAAGGTTTCCGTGTGGAAGCGCCTGACCGCCACCGCCCGGTTCGGGTGAGGGACCTGAGTCCTTTTCTTTTTCAGTCTTTCAGCGGCCGTTTCCTAGTAGCTCCTTGGTAATTGAGGGCAACTGGCCGGGGCCACTCTCCGGTGTTACCTGAAGGCCAAGGAGTGAACGGGGATAGCTGCCCTGCCCGGAAGGGGGAAGGACTCTTTTCTATCTTTTCCGGTTATAGTCCCTGATCCCTACGTGTGACGCAATTGGCAGCGGCAGCTCGTCCAGGGCGAACTCACACACGTTTCAGGCGACTTAAACCTTCTTTTCTTATGCTAAATTCTTCCCTTCCCCTACTCGACTGGCTAAGGACAAGTCAGAGGGTCCGGGCATGTCGTAGATGGTCTGTGTGAGTCATGGGGAGGGGATTCATGAAAGGGAATTTATGTACAATTTAATCTTGCCTAAATTTAGAGAGTTAAAGGATTGTTTTAAGTGGGATAGGAAAAAAAATCCAAAGGTTTGACTGAAAGTTAATTCTAGAAGTCGAGGCCTTCATCCAGGGACAAGAGGGAAAGCTCATAGTAGGTCATCAGTGGTGGAGGGAACCATTCCAAAGCGGTGCCGGCACCCATCTAAGGTCAGAGACGTCTGACAGACTAAGACGGGGCCCTAAAGGGGGGACGCCCCCGGGGACCCCAGTCNGGGCCCAGAATTTTTCCAGGGGGATGCCCCGGGTAAAATTTGGGTCACCTAATGAGCCCTCCACTTTTCAAAGTCCTCTTCTCTTTTCCAGACCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCCCGCTTGGCTGCATCCTCAACCATTGGAATCAATTTGACCCTGACAATCTAAGGAGAAAACGTNTGATTTTTTTCTGCAATACTGTTTGGCCCCANTATNAGCTGNNCAGCCAGGAACAATGGGCGGTCAATGGTAGCCTTAATTATGACACCATCCTGCAATTAGACCTATTTTGCAAGAGGCAGGGCAAATGGTCAGAAATCCCATATGTACAGGCCTTCATGGCCCTATACCAAAACCCAACAATCTGCAAAACTCCCAGAACCCGCCCCCCAAAGGAAAGTCCTAAGGCAGAACTAGATATTGTAGATGACCCCCTTTTACAAGGGCCACCTGTCTCTCAGGGNGAACAGCAACCGCCCCCATATAGCCCCTTGCCAAGTGCTCCTGAGGCTAAAACCCAGGAGCAAACACCGGGGACCCTACTAAGTCCCCCTCACACTCGGAGGGGAACACCNTATTCAACTCTCCCTCCAGCCCTGCTACCCCTTAGGGAAGTAGCAGGAGCCGAGGGGCCAGTCCGAGTGCAGGCCCCCTTCTCTATAACTGATATACAACAATGTAAGGAAAAGCTAGGAAGCTATTCTGAGAACCCCGGGAAATTTGCAGATGGGTTCCAAACTTTGACCTTAGCCTTTGATCTCTCATGGAGAGATGTTCAATTCATTCTAGCAACCTGTTGCACCCCCTCGGAAAAGGAACGAATCTTTGAGGCCGCCCGCCGGGAAGCGGACGANTTATTCGCCCGAAACCCTCAGGGCAATCACCCGGGCCCAGACACAGTCCCCACTACTGATCCTAATTGGGACTATAACACCCCCGTGGGAATGAACAACCGGGCTAAATTTCTTGAGGCTCTCCTTGGAGGAATGAGAAAGGGAATAACTAAGGCAGTAAATTATGATAAAGTAAGGGAGGTTACACAAGGCAAGGAGGAAAATCCAGCCATGTTTTATGGCAGGCTGGAGGAAGCCTTTAAAAAATATACNAATCTGGACCCTTCCTCTCCCGAAGGCAAAATATTAATGGCACAGCATTTCATTAGCCAATCCGCCCCGGACATTAGACGTAAGCTCCAAAAGCTACAGATGGGGCCACAAACTAATCAAAATCAGCTTCTTGATACCGCCTTTATGGTGTATAACAATCGTGACCTGGAGGAAGGAAAAAGGGAACAGAGTAAAGAAAAACGGCAAGCCAAAATTATGGCAGCCATCATTGGCGATGCCCTGAATGCCCAAAGAGCGTCCAAGGGAAACCCGAAGGGCCATAAGGATAATGCCAGCAAAGGCTCTTGCTTCAAGTGCAAGAAAAATGGGCATTGGGCAAAGGACTGTACTAAGCCCCCGCCAGGCCCCTGCCGTCAATGCGAAGGCACCAGTCACGACCCCTGGCACTGGAGAATTGACTGCCCCCGCTCCCACCGAGGGGCTCAGTCAGTCAAAACTCTAGCAGTGCAAAAGGAGGAATTAGATGAAGACTGAAGGGGCCCGGGGCCTTCCTCACCGCCCCTGTCCAGGAACATCGTNATTACTACTGAGGAGCCCCGGGTAACTCTGGACGTCATGGGCACCCAAATTCAGTTTCTTTTTGATACAGGGGCAAATTACTCTGTCCTTACTGCTTATGCAGGAAAACTTTCCTCCCGGTCCACGAGTGTTATGGGAATGGAAGGAAAGCCACAAACAAGATTCTTTACTCCTCCTTTGACTTGTCAATTTGAGAAACAAATCTTCCAACAGGAATTTCTAGTAGTACCAAGCTGCCCAGTCCCCCTGTTGGGAAGAGATATTATGGTTAAAATAGGGGCACTGCTACAATTTAAGCACCGCCCGGCGAAATTGCTAATAGTCAGNAATGCAGACAATGTCCCAGACCACGTTAATAAACAGGTCAACCCGCTGGCATGGTATACTGGGAAACCGGGGAAGGCTAAAACGGCAGTGCCAGTCAAAATACAGCTTAAAGACCCCAGCTATTTTCCCAATCGAAAACAATACCCAATTAAGCTGGAAGCAAGAAAAGGCCTAGCACCCATAGTTGAGGTATTACTTACCCATGGACTCTTAAAACCCTGCAATTCTCCCTGCAATACCCCCATCTTACCCGTTCTAAAGCCTTCGGGGGAATACCGGNTAGTACAGGACCTCAGAATAATTAATGAGGCTGTTATCCCCGTCCACCCATTGGTGGCGGATCCATATACCCTCCTGGCTCAGGTGCCAGGGGATGCAAAATGGTTCTCAGTCCTAGACCTAAAAGATGCTTTCTTCTCCATTCCTCTGGCCCCAGAGTCCCAATACCTTTTTGCCTTTGAATGGGAAAATCCTAATACCAGAGAAAAACAACAATACACTTGGACAGTGCTCCCTCAGGGCTTTCGGGATAGCCCCCATTTCTTTGCCCGAGCCTTAGAGAGGGATCTGAGGGATCTGCAATTGGAGAATGGGAGTATACTCCAGTATGTGGATGACCTTCTTGTGTGTAGCCCAACCCAGGAGGCTTCTGACCAAAATACTATAAAAACTTTGAATTTCCTGGCAGACAGGGGATACAAAGTGTCCAAAAAGAAGGCTCAGATTACCCTCCAACGGGTCCAATATTTAGGGTATGTCTTAACACCCGGAGCCCGGCAAATATCCCCAGAACGAGTGCAAGCCATATGTGGTTTGGGGCCCCCCCACACCAAGCAGCAGCTTCGTTCTTTTTNGGGAATGGCCGGGTTTTGCAGAATATGGGTACCAAATTTTGGGCTCATAGCAAAGCCCCTNTATGAAGCAACAAGGGGGCCTGAAAATGAGCTAATGGAATGGACCCCGGAAATGAGGGAAGCCTTCGCCAAGTTAAAACAGGCTCTCACCCAGGCTCCCGCTCTTGGCATCCCAGACCTNACTAAGCCCTTCTCCTTGTATGTAGCAGAGAAGAAGGGCATAGCTGTGGGAGTGCTAGCCCAGAAATTAGGATCAGAACCCAGACCAACCGCCTACTTTTCAAAGAAGTTGGACGGAGTGGCCTCGGGGTGGCCAAGCTGCCTGCGGGCAATAGCAGCCACTGCTATTTTAGTGGAGGAAGCCACTAAAATCACCCTGGGTCAACCACTGGAAGTTCTAACCCCNCATCAGGTAAAGTCAGTCTTAGAGATAAAAGGACACATCTGGATGACGGGGGAAAGGTTAACCAAATACCAGGCCATGCTCCTAGACAATCCAGATGTAACCCTTAAAACCTGTAACACCTTGAATCCAGCTTCATTGCTGCCCACAGGCCCAATAACTGATCATTCCTGCGAGCAGGTCATCGCACACACATATGTTAGCCGGCCTGATTTAAAAGATCAGCCTCTCCCAGATTCTGAGGATGACTGGTTCACAGACGGCAGTAGTTTTGTGTCAAATGGGGAGCGCCGAGCTGGATATGCAATAGTAAATCACAACACCATTATTGAAGCCCAGCCACTGCCCCCTGGCACATCAGCACAAAAGGCTGAAATCATTGCTCTTACCCGAGCATTAATGTTGGGACAAGGGAAAAAGCTTAACATCTATACAGATTCTAAATATGCATTCCTTGTGGTTCATGCTCATGCTGCAATCTGGAAAGAAAGGGGACTACTAACTAGCAAACACTCCCCCATAAAGCATGGGCCTGAAATTCTTCAGCTATTGGAAGCAATACACCTGCCAAAGGCCGTAGCTATAATCCATTGTAGGGGGCATCAAAGGGACTTAACCCCTATAGCACAAGGGAACAGAAAGGCTGATAGAGAAGCCAAAGCCGCAGCCCTCAGGGTGCAATCCCAACAGATCCTAGCACTGCTTCCTTTCTATGATTCCCCAATAGAACCTGAATACACACCACAGGAAGAACAGTTAATAAAGGAGCAAGGGGGACAAAAACAAGGATCCTGGTGGTATATGGGATCAAAAGTATATCTCCCTCAAACAGCCCAATGGAGAGTTATAAAAACCCTGCATGACTCTTTCCATATGGGGAGAGATGCCACCCTGGCCATGGTAAACAGGCTCTTCATTGGGCCTAACTTAGCTTCGGTGGTTAAGCAGGTCTGTCAAGCCTGCTCACTGTGTGCACTTAACAACCCAGGAAACAAAATGCCTCCTCTAATAGAACCAGTCCAGAGGAGAGGAACTTACCCAGGGGAAGACTGGCAATTAGACTTCACCCATATGCCAGCTTGCAGAGGATACAAGTTTTTGCTAGTACTAATAGACACCTTTACTGGCTGGGTCGAAGCTTACCCTACCAGAACAGAGAAGGCTAATGAGGTTATAAAGGTTCTCTTAAAGGAAATAATCCCCCGGTTTGGGTTACCCCAGAGCCTCCAAAGTGATAACGGCCCGTCCTTTATCTCCCAAATAACTCAAGGGGTTGCTAAGGCTCTCGGAATCAAATACTATTTACATTCAGCATGGAGGCCTCAATCCTCCGGGAAAGTAGAAAGGGCTAATCAAACTCTAAAACGGGCGTTAGCTAAGCTATGTCAGGAAACATCAGAAACTTGGGTCAGCTTACTGCCCATAGCCCTCTTAAGGATCCGTAATNCCCCTAGAGCAAAAATTAATATGAGCCCATATGAAATGTTATACGGAAGGCCATTTTTAACTAATGATCTAATTACTGATCCAGAAACAGCCGGTTTAGTAAAATACCTAGTTAACCTGGGACAATTTCAGCAGGCTTTACAAAAGTTTGGAACTCAAAGGCTCCCCACACCGGGAACTAACCAGCAACCCAAAATCAGGCCAGGAGATAAGGTACTTGTTAAAACATGGAAGGAGGGATCACCTGCTCAACAATTACAACCCAAATGGAAGGGACCGTTTTCAGTGGTACTGGCCACGCCTTCTGCGGTCAAAGTACTAGGATTAGATAGTTGGATACATCTTTCCAGGGTCAAGCCTGCGATACCTGAAGCCCCGGACCTGGAACCTGAAGCTCCCATCAGCCACTACACCTGTGAACCTGTGGAAGACCTGAAGTACCTGTTTAAAAGACAGCCAAAAGATAAGTAAATGCCTACCAACTTTCCTTGGTGTCTTTGTTGCATAGTTACTGTAGGCTGGATAATAGTAGCCATTTTTANNNNTATTTTTGCAGTTTAATTGCCTTCTTCCAAACGGATGGAATCACTTCCTTTGTAATAATTAAGCAGAATGTTTTAATTCATCTCTATAACAAACATTCCTGACAGCATAGGTATCCACCCCCTGAAGTTCCCATTAAATCTTTTAACCAAATTCATTTCCTCTCGCCTAGAGACCATCAAGCTTCAGATGATCATGCGACAAGGNTTCCAGCCAGTTCCAGGTGAAGACACCACCCCTGGCCATCAAGAAGCTACCCTGTCTCCACTAGACAGAGCAGGGCGAGAGTTCCGTGATCCCCAATAGGTAGGGACTACGCCCCAAGTCAGCATGAAGCAGTTACAGAAGAAAGACCATCGGTCCCTCTGCCTCCCATAAAGATTTATGGGGATCACGTCTCTCAGGGGGGAGA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV4_I | BPC1 | 60 | 83 | - | 23.59 | AAGGAAAAGAGAGAGAGAAAGAGG |
HERV4_I | BPC1 | 62 | 85 | - | 23.42 | GAAAGGAAAAGAGAGAGAGAAAGA |
HERV4_I | BPC6 | 59 | 79 | + | 22.92 | TCCTCTTTCTCTCTCTCTTTT |
HERV4_I | BPC1 | 56 | 79 | - | 21.47 | AAAAGAGAGAGAGAAAGAGGAAAA |
HERV4_I | ZNF282 | 2681 | 2695 | - | 20.67 | CATTCCCATAACACT |
HERV4_I | BPC1 | 58 | 81 | - | 20.03 | GGAAAAGAGAGAGAGAAAGAGGAA |
HERV4_I | DREB2F | 194 | 204 | + | 19.94 | CCGCCACCGCC |
HERV4_I | BPC1 | 54 | 77 | - | 19.80 | AAGAGAGAGAGAAAGAGGAAAAAG |
HERV4_I | RAMOSA1 | 64 | 77 | - | 19.61 | AAGAGAGAGAGAAA |
HERV4_I | POU4F2 | 3176 | 3190 | + | 19.59 | AGAATAATTAATGAG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.