HERV4_I
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000172 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Haplorrhini |
Length | 6539 |
Kimura value | 11.78 |
Tau index | 0.9815 |
Description | Internal region of an ERV1 endogenous retrovirus, HERV4 subfamily |
Comment | Associated long terminal repeat includes MER51A. |
Sequence |
TATTTTGGCGAGCCAGCCAGGAGGTAAGCCCAAAGTTTGGGATTTATTTTTCTCTTTTTCCTCTTTCTCTCTCTCTTTTCCTTTCCAACTCGGGACCCTCGGTGGACAGCGCCTAAGCACGGAGGCAACTGCAGGTTTCTGGCCGGGGCCACTCTCCGGTGAAACTGAAAGGTTTCCGTGTGGAAGCGCCTGACCGCCACCGCCCGGTTCGGGTGAGGGACCTGAGTCCTTTTCTTTTTCAGTCTTTCAGCGGCCGTTTCCTAGTAGCTCCTTGGTAATTGAGGGCAACTGGCCGGGGCCACTCTCCGGTGTTACCTGAAGGCCAAGGAGTGAACGGGGATAGCTGCCCTGCCCGGAAGGGGGAAGGACTCTTTTCTATCTTTTCCGGTTATAGTCCCTGATCCCTACGTGTGACGCAATTGGCAGCGGCAGCTCGTCCAGGGCGAACTCACACACGTTTCAGGCGACTTAAACCTTCTTTTCTTATGCTAAATTCTTCCCTTCCCCTACTCGACTGGCTAAGGACAAGTCAGAGGGTCCGGGCATGTCGTAGATGGTCTGTGTGAGTCATGGGGAGGGGATTCATGAAAGGGAATTTATGTACAATTTAATCTTGCCTAAATTTAGAGAGTTAAAGGATTGTTTTAAGTGGGATAGGAAAAAAAATCCAAAGGTTTGACTGAAAGTTAATTCTAGAAGTCGAGGCCTTCATCCAGGGACAAGAGGGAAAGCTCATAGTAGGTCATCAGTGGTGGAGGGAACCATTCCAAAGCGGTGCCGGCACCCATCTAAGGTCAGAGACGTCTGACAGACTAAGACGGGGCCCTAAAGGGGGGACGCCCCCGGGGACCCCAGTCNGGGCCCAGAATTTTTCCAGGGGGATGCCCCGGGTAAAATTTGGGTCACCTAATGAGCCCTCCACTTTTCAAAGTCCTCTTCTCTTTTCCAGACCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCCCGCTTGGCTGCATCCTCAACCATTGGAATCAATTTGACCCTGACAATCTAAGGAGAAAACGTNTGATTTTTTTCTGCAATACTGTTTGGCCCCANTATNAGCTGNNCAGCCAGGAACAATGGGCGGTCAATGGTAGCCTTAATTATGACACCATCCTGCAATTAGACCTATTTTGCAAGAGGCAGGGCAAATGGTCAGAAATCCCATATGTACAGGCCTTCATGGCCCTATACCAAAACCCAACAATCTGCAAAACTCCCAGAACCCGCCCCCCAAAGGAAAGTCCTAAGGCAGAACTAGATATTGTAGATGACCCCCTTTTACAAGGGCCACCTGTCTCTCAGGGNGAACAGCAACCGCCCCCATATAGCCCCTTGCCAAGTGCTCCTGAGGCTAAAACCCAGGAGCAAACACCGGGGACCCTACTAAGTCCCCCTCACACTCGGAGGGGAACACCNTATTCAACTCTCCCTCCAGCCCTGCTACCCCTTAGGGAAGTAGCAGGAGCCGAGGGGCCAGTCCGAGTGCAGGCCCCCTTCTCTATAACTGATATACAACAATGTAAGGAAAAGCTAGGAAGCTATTCTGAGAACCCCGGGAAATTTGCAGATGGGTTCCAAACTTTGACCTTAGCCTTTGATCTCTCATGGAGAGATGTTCAATTCATTCTAGCAACCTGTTGCACCCCCTCGGAAAAGGAACGAATCTTTGAGGCCGCCCGCCGGGAAGCGGACGANTTATTCGCCCGAAACCCTCAGGGCAATCACCCGGGCCCAGACACAGTCCCCACTACTGATCCTAATTGGGACTATAACACCCCCGTGGGAATGAACAACCGGGCTAAATTTCTTGAGGCTCTCCTTGGAGGAATGAGAAAGGGAATAACTAAGGCAGTAAATTATGATAAAGTAAGGGAGGTTACACAAGGCAAGGAGGAAAATCCAGCCATGTTTTATGGCAGGCTGGAGGAAGCCTTTAAAAAATATACNAATCTGGACCCTTCCTCTCCCGAAGGCAAAATATTAATGGCACAGCATTTCATTAGCCAATCCGCCCCGGACATTAGACGTAAGCTCCAAAAGCTACAGATGGGGCCACAAACTAATCAAAATCAGCTTCTTGATACCGCCTTTATGGTGTATAACAATCGTGACCTGGAGGAAGGAAAAAGGGAACAGAGTAAAGAAAAACGGCAAGCCAAAATTATGGCAGCCATCATTGGCGATGCCCTGAATGCCCAAAGAGCGTCCAAGGGAAACCCGAAGGGCCATAAGGATAATGCCAGCAAAGGCTCTTGCTTCAAGTGCAAGAAAAATGGGCATTGGGCAAAGGACTGTACTAAGCCCCCGCCAGGCCCCTGCCGTCAATGCGAAGGCACCAGTCACGACCCCTGGCACTGGAGAATTGACTGCCCCCGCTCCCACCGAGGGGCTCAGTCAGTCAAAACTCTAGCAGTGCAAAAGGAGGAATTAGATGAAGACTGAAGGGGCCCGGGGCCTTCCTCACCGCCCCTGTCCAGGAACATCGTNATTACTACTGAGGAGCCCCGGGTAACTCTGGACGTCATGGGCACCCAAATTCAGTTTCTTTTTGATACAGGGGCAAATTACTCTGTCCTTACTGCTTATGCAGGAAAACTTTCCTCCCGGTCCACGAGTGTTATGGGAATGGAAGGAAAGCCACAAACAAGATTCTTTACTCCTCCTTTGACTTGTCAATTTGAGAAACAAATCTTCCAACAGGAATTTCTAGTAGTACCAAGCTGCCCAGTCCCCCTGTTGGGAAGAGATATTATGGTTAAAATAGGGGCACTGCTACAATTTAAGCACCGCCCGGCGAAATTGCTAATAGTCAGNAATGCAGACAATGTCCCAGACCACGTTAATAAACAGGTCAACCCGCTGGCATGGTATACTGGGAAACCGGGGAAGGCTAAAACGGCAGTGCCAGTCAAAATACAGCTTAAAGACCCCAGCTATTTTCCCAATCGAAAACAATACCCAATTAAGCTGGAAGCAAGAAAAGGCCTAGCACCCATAGTTGAGGTATTACTTACCCATGGACTCTTAAAACCCTGCAATTCTCCCTGCAATACCCCCATCTTACCCGTTCTAAAGCCTTCGGGGGAATACCGGNTAGTACAGGACCTCAGAATAATTAATGAGGCTGTTATCCCCGTCCACCCATTGGTGGCGGATCCATATACCCTCCTGGCTCAGGTGCCAGGGGATGCAAAATGGTTCTCAGTCCTAGACCTAAAAGATGCTTTCTTCTCCATTCCTCTGGCCCCAGAGTCCCAATACCTTTTTGCCTTTGAATGGGAAAATCCTAATACCAGAGAAAAACAACAATACACTTGGACAGTGCTCCCTCAGGGCTTTCGGGATAGCCCCCATTTCTTTGCCCGAGCCTTAGAGAGGGATCTGAGGGATCTGCAATTGGAGAATGGGAGTATACTCCAGTATGTGGATGACCTTCTTGTGTGTAGCCCAACCCAGGAGGCTTCTGACCAAAATACTATAAAAACTTTGAATTTCCTGGCAGACAGGGGATACAAAGTGTCCAAAAAGAAGGCTCAGATTACCCTCCAACGGGTCCAATATTTAGGGTATGTCTTAACACCCGGAGCCCGGCAAATATCCCCAGAACGAGTGCAAGCCATATGTGGTTTGGGGCCCCCCCACACCAAGCAGCAGCTTCGTTCTTTTTNGGGAATGGCCGGGTTTTGCAGAATATGGGTACCAAATTTTGGGCTCATAGCAAAGCCCCTNTATGAAGCAACAAGGGGGCCTGAAAATGAGCTAATGGAATGGACCCCGGAAATGAGGGAAGCCTTCGCCAAGTTAAAACAGGCTCTCACCCAGGCTCCCGCTCTTGGCATCCCAGACCTNACTAAGCCCTTCTCCTTGTATGTAGCAGAGAAGAAGGGCATAGCTGTGGGAGTGCTAGCCCAGAAATTAGGATCAGAACCCAGACCAACCGCCTACTTTTCAAAGAAGTTGGACGGAGTGGCCTCGGGGTGGCCAAGCTGCCTGCGGGCAATAGCAGCCACTGCTATTTTAGTGGAGGAAGCCACTAAAATCACCCTGGGTCAACCACTGGAAGTTCTAACCCCNCATCAGGTAAAGTCAGTCTTAGAGATAAAAGGACACATCTGGATGACGGGGGAAAGGTTAACCAAATACCAGGCCATGCTCCTAGACAATCCAGATGTAACCCTTAAAACCTGTAACACCTTGAATCCAGCTTCATTGCTGCCCACAGGCCCAATAACTGATCATTCCTGCGAGCAGGTCATCGCACACACATATGTTAGCCGGCCTGATTTAAAAGATCAGCCTCTCCCAGATTCTGAGGATGACTGGTTCACAGACGGCAGTAGTTTTGTGTCAAATGGGGAGCGCCGAGCTGGATATGCAATAGTAAATCACAACACCATTATTGAAGCCCAGCCACTGCCCCCTGGCACATCAGCACAAAAGGCTGAAATCATTGCTCTTACCCGAGCATTAATGTTGGGACAAGGGAAAAAGCTTAACATCTATACAGATTCTAAATATGCATTCCTTGTGGTTCATGCTCATGCTGCAATCTGGAAAGAAAGGGGACTACTAACTAGCAAACACTCCCCCATAAAGCATGGGCCTGAAATTCTTCAGCTATTGGAAGCAATACACCTGCCAAAGGCCGTAGCTATAATCCATTGTAGGGGGCATCAAAGGGACTTAACCCCTATAGCACAAGGGAACAGAAAGGCTGATAGAGAAGCCAAAGCCGCAGCCCTCAGGGTGCAATCCCAACAGATCCTAGCACTGCTTCCTTTCTATGATTCCCCAATAGAACCTGAATACACACCACAGGAAGAACAGTTAATAAAGGAGCAAGGGGGACAAAAACAAGGATCCTGGTGGTATATGGGATCAAAAGTATATCTCCCTCAAACAGCCCAATGGAGAGTTATAAAAACCCTGCATGACTCTTTCCATATGGGGAGAGATGCCACCCTGGCCATGGTAAACAGGCTCTTCATTGGGCCTAACTTAGCTTCGGTGGTTAAGCAGGTCTGTCAAGCCTGCTCACTGTGTGCACTTAACAACCCAGGAAACAAAATGCCTCCTCTAATAGAACCAGTCCAGAGGAGAGGAACTTACCCAGGGGAAGACTGGCAATTAGACTTCACCCATATGCCAGCTTGCAGAGGATACAAGTTTTTGCTAGTACTAATAGACACCTTTACTGGCTGGGTCGAAGCTTACCCTACCAGAACAGAGAAGGCTAATGAGGTTATAAAGGTTCTCTTAAAGGAAATAATCCCCCGGTTTGGGTTACCCCAGAGCCTCCAAAGTGATAACGGCCCGTCCTTTATCTCCCAAATAACTCAAGGGGTTGCTAAGGCTCTCGGAATCAAATACTATTTACATTCAGCATGGAGGCCTCAATCCTCCGGGAAAGTAGAAAGGGCTAATCAAACTCTAAAACGGGCGTTAGCTAAGCTATGTCAGGAAACATCAGAAACTTGGGTCAGCTTACTGCCCATAGCCCTCTTAAGGATCCGTAATNCCCCTAGAGCAAAAATTAATATGAGCCCATATGAAATGTTATACGGAAGGCCATTTTTAACTAATGATCTAATTACTGATCCAGAAACAGCCGGTTTAGTAAAATACCTAGTTAACCTGGGACAATTTCAGCAGGCTTTACAAAAGTTTGGAACTCAAAGGCTCCCCACACCGGGAACTAACCAGCAACCCAAAATCAGGCCAGGAGATAAGGTACTTGTTAAAACATGGAAGGAGGGATCACCTGCTCAACAATTACAACCCAAATGGAAGGGACCGTTTTCAGTGGTACTGGCCACGCCTTCTGCGGTCAAAGTACTAGGATTAGATAGTTGGATACATCTTTCCAGGGTCAAGCCTGCGATACCTGAAGCCCCGGACCTGGAACCTGAAGCTCCCATCAGCCACTACACCTGTGAACCTGTGGAAGACCTGAAGTACCTGTTTAAAAGACAGCCAAAAGATAAGTAAATGCCTACCAACTTTCCTTGGTGTCTTTGTTGCATAGTTACTGTAGGCTGGATAATAGTAGCCATTTTTANNNNTATTTTTGCAGTTTAATTGCCTTCTTCCAAACGGATGGAATCACTTCCTTTGTAATAATTAAGCAGAATGTTTTAATTCATCTCTATAACAAACATTCCTGACAGCATAGGTATCCACCCCCTGAAGTTCCCATTAAATCTTTTAACCAAATTCATTTCCTCTCGCCTAGAGACCATCAAGCTTCAGATGATCATGCGACAAGGNTTCCAGCCAGTTCCAGGTGAAGACACCACCCCTGGCCATCAAGAAGCTACCCTGTCTCCACTAGACAGAGCAGGGCGAGAGTTCCGTGATCCCCAATAGGTAGGGACTACGCCCCAAGTCAGCATGAAGCAGTTACAGAAGAAAGACCATCGGTCCCTCTGCCTCCCATAAAGATTTATGGGGATCACGTCTCTCAGGGGGGAGA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV4_I | BPC6 | 61 | 81 | + | 19.58 | CTCTTTCTCTCTCTCTTTTCC |
HERV4_I | RAMOSA1 | 66 | 79 | - | 19.57 | AAAAGAGAGAGAGA |
HERV4_I | INSM1 | 4482 | 4493 | - | 18.94 | TGCCAGGGGGCA |
HERV4_I | TCF7L1 | 1659 | 1670 | - | 18.91 | AGAGATCAAAGG |
HERV4_I | CDF5 | 50 | 70 | + | 18.88 | TTCTCTTTTTCCTCTTTCTCT |
HERV4_I | ZNF343 | 1742 | 1757 | - | 18.81 | CCGCTTCCCGGCGGGC |
HERV4_I | DOF3.6 | 50 | 70 | + | 18.79 | TTCTCTTTTTCCTCTTTCTCT |
HERV4_I | Irf1 | 160 | 170 | + | 18.30 | TGAAACTGAAA |
HERV4_I | BPC6 | 55 | 75 | + | 18.25 | TTTTTCCTCTTTCTCTCTCTC |
HERV4_I | ZBTB18 | 4175 | 4185 | - | 18.20 | ATCCAGATGTG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.