HERV4_I
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000172 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Haplorrhini |
Length | 6539 |
Kimura value | 11.78 |
Tau index | 0.9815 |
Description | Internal region of an ERV1 endogenous retrovirus, HERV4 subfamily |
Comment | Associated long terminal repeat includes MER51A. |
Sequence |
TATTTTGGCGAGCCAGCCAGGAGGTAAGCCCAAAGTTTGGGATTTATTTTTCTCTTTTTCCTCTTTCTCTCTCTCTTTTCCTTTCCAACTCGGGACCCTCGGTGGACAGCGCCTAAGCACGGAGGCAACTGCAGGTTTCTGGCCGGGGCCACTCTCCGGTGAAACTGAAAGGTTTCCGTGTGGAAGCGCCTGACCGCCACCGCCCGGTTCGGGTGAGGGACCTGAGTCCTTTTCTTTTTCAGTCTTTCAGCGGCCGTTTCCTAGTAGCTCCTTGGTAATTGAGGGCAACTGGCCGGGGCCACTCTCCGGTGTTACCTGAAGGCCAAGGAGTGAACGGGGATAGCTGCCCTGCCCGGAAGGGGGAAGGACTCTTTTCTATCTTTTCCGGTTATAGTCCCTGATCCCTACGTGTGACGCAATTGGCAGCGGCAGCTCGTCCAGGGCGAACTCACACACGTTTCAGGCGACTTAAACCTTCTTTTCTTATGCTAAATTCTTCCCTTCCCCTACTCGACTGGCTAAGGACAAGTCAGAGGGTCCGGGCATGTCGTAGATGGTCTGTGTGAGTCATGGGGAGGGGATTCATGAAAGGGAATTTATGTACAATTTAATCTTGCCTAAATTTAGAGAGTTAAAGGATTGTTTTAAGTGGGATAGGAAAAAAAATCCAAAGGTTTGACTGAAAGTTAATTCTAGAAGTCGAGGCCTTCATCCAGGGACAAGAGGGAAAGCTCATAGTAGGTCATCAGTGGTGGAGGGAACCATTCCAAAGCGGTGCCGGCACCCATCTAAGGTCAGAGACGTCTGACAGACTAAGACGGGGCCCTAAAGGGGGGACGCCCCCGGGGACCCCAGTCNGGGCCCAGAATTTTTCCAGGGGGATGCCCCGGGTAAAATTTGGGTCACCTAATGAGCCCTCCACTTTTCAAAGTCCTCTTCTCTTTTCCAGACCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCCCGCTTGGCTGCATCCTCAACCATTGGAATCAATTTGACCCTGACAATCTAAGGAGAAAACGTNTGATTTTTTTCTGCAATACTGTTTGGCCCCANTATNAGCTGNNCAGCCAGGAACAATGGGCGGTCAATGGTAGCCTTAATTATGACACCATCCTGCAATTAGACCTATTTTGCAAGAGGCAGGGCAAATGGTCAGAAATCCCATATGTACAGGCCTTCATGGCCCTATACCAAAACCCAACAATCTGCAAAACTCCCAGAACCCGCCCCCCAAAGGAAAGTCCTAAGGCAGAACTAGATATTGTAGATGACCCCCTTTTACAAGGGCCACCTGTCTCTCAGGGNGAACAGCAACCGCCCCCATATAGCCCCTTGCCAAGTGCTCCTGAGGCTAAAACCCAGGAGCAAACACCGGGGACCCTACTAAGTCCCCCTCACACTCGGAGGGGAACACCNTATTCAACTCTCCCTCCAGCCCTGCTACCCCTTAGGGAAGTAGCAGGAGCCGAGGGGCCAGTCCGAGTGCAGGCCCCCTTCTCTATAACTGATATACAACAATGTAAGGAAAAGCTAGGAAGCTATTCTGAGAACCCCGGGAAATTTGCAGATGGGTTCCAAACTTTGACCTTAGCCTTTGATCTCTCATGGAGAGATGTTCAATTCATTCTAGCAACCTGTTGCACCCCCTCGGAAAAGGAACGAATCTTTGAGGCCGCCCGCCGGGAAGCGGACGANTTATTCGCCCGAAACCCTCAGGGCAATCACCCGGGCCCAGACACAGTCCCCACTACTGATCCTAATTGGGACTATAACACCCCCGTGGGAATGAACAACCGGGCTAAATTTCTTGAGGCTCTCCTTGGAGGAATGAGAAAGGGAATAACTAAGGCAGTAAATTATGATAAAGTAAGGGAGGTTACACAAGGCAAGGAGGAAAATCCAGCCATGTTTTATGGCAGGCTGGAGGAAGCCTTTAAAAAATATACNAATCTGGACCCTTCCTCTCCCGAAGGCAAAATATTAATGGCACAGCATTTCATTAGCCAATCCGCCCCGGACATTAGACGTAAGCTCCAAAAGCTACAGATGGGGCCACAAACTAATCAAAATCAGCTTCTTGATACCGCCTTTATGGTGTATAACAATCGTGACCTGGAGGAAGGAAAAAGGGAACAGAGTAAAGAAAAACGGCAAGCCAAAATTATGGCAGCCATCATTGGCGATGCCCTGAATGCCCAAAGAGCGTCCAAGGGAAACCCGAAGGGCCATAAGGATAATGCCAGCAAAGGCTCTTGCTTCAAGTGCAAGAAAAATGGGCATTGGGCAAAGGACTGTACTAAGCCCCCGCCAGGCCCCTGCCGTCAATGCGAAGGCACCAGTCACGACCCCTGGCACTGGAGAATTGACTGCCCCCGCTCCCACCGAGGGGCTCAGTCAGTCAAAACTCTAGCAGTGCAAAAGGAGGAATTAGATGAAGACTGAAGGGGCCCGGGGCCTTCCTCACCGCCCCTGTCCAGGAACATCGTNATTACTACTGAGGAGCCCCGGGTAACTCTGGACGTCATGGGCACCCAAATTCAGTTTCTTTTTGATACAGGGGCAAATTACTCTGTCCTTACTGCTTATGCAGGAAAACTTTCCTCCCGGTCCACGAGTGTTATGGGAATGGAAGGAAAGCCACAAACAAGATTCTTTACTCCTCCTTTGACTTGTCAATTTGAGAAACAAATCTTCCAACAGGAATTTCTAGTAGTACCAAGCTGCCCAGTCCCCCTGTTGGGAAGAGATATTATGGTTAAAATAGGGGCACTGCTACAATTTAAGCACCGCCCGGCGAAATTGCTAATAGTCAGNAATGCAGACAATGTCCCAGACCACGTTAATAAACAGGTCAACCCGCTGGCATGGTATACTGGGAAACCGGGGAAGGCTAAAACGGCAGTGCCAGTCAAAATACAGCTTAAAGACCCCAGCTATTTTCCCAATCGAAAACAATACCCAATTAAGCTGGAAGCAAGAAAAGGCCTAGCACCCATAGTTGAGGTATTACTTACCCATGGACTCTTAAAACCCTGCAATTCTCCCTGCAATACCCCCATCTTACCCGTTCTAAAGCCTTCGGGGGAATACCGGNTAGTACAGGACCTCAGAATAATTAATGAGGCTGTTATCCCCGTCCACCCATTGGTGGCGGATCCATATACCCTCCTGGCTCAGGTGCCAGGGGATGCAAAATGGTTCTCAGTCCTAGACCTAAAAGATGCTTTCTTCTCCATTCCTCTGGCCCCAGAGTCCCAATACCTTTTTGCCTTTGAATGGGAAAATCCTAATACCAGAGAAAAACAACAATACACTTGGACAGTGCTCCCTCAGGGCTTTCGGGATAGCCCCCATTTCTTTGCCCGAGCCTTAGAGAGGGATCTGAGGGATCTGCAATTGGAGAATGGGAGTATACTCCAGTATGTGGATGACCTTCTTGTGTGTAGCCCAACCCAGGAGGCTTCTGACCAAAATACTATAAAAACTTTGAATTTCCTGGCAGACAGGGGATACAAAGTGTCCAAAAAGAAGGCTCAGATTACCCTCCAACGGGTCCAATATTTAGGGTATGTCTTAACACCCGGAGCCCGGCAAATATCCCCAGAACGAGTGCAAGCCATATGTGGTTTGGGGCCCCCCCACACCAAGCAGCAGCTTCGTTCTTTTTNGGGAATGGCCGGGTTTTGCAGAATATGGGTACCAAATTTTGGGCTCATAGCAAAGCCCCTNTATGAAGCAACAAGGGGGCCTGAAAATGAGCTAATGGAATGGACCCCGGAAATGAGGGAAGCCTTCGCCAAGTTAAAACAGGCTCTCACCCAGGCTCCCGCTCTTGGCATCCCAGACCTNACTAAGCCCTTCTCCTTGTATGTAGCAGAGAAGAAGGGCATAGCTGTGGGAGTGCTAGCCCAGAAATTAGGATCAGAACCCAGACCAACCGCCTACTTTTCAAAGAAGTTGGACGGAGTGGCCTCGGGGTGGCCAAGCTGCCTGCGGGCAATAGCAGCCACTGCTATTTTAGTGGAGGAAGCCACTAAAATCACCCTGGGTCAACCACTGGAAGTTCTAACCCCNCATCAGGTAAAGTCAGTCTTAGAGATAAAAGGACACATCTGGATGACGGGGGAAAGGTTAACCAAATACCAGGCCATGCTCCTAGACAATCCAGATGTAACCCTTAAAACCTGTAACACCTTGAATCCAGCTTCATTGCTGCCCACAGGCCCAATAACTGATCATTCCTGCGAGCAGGTCATCGCACACACATATGTTAGCCGGCCTGATTTAAAAGATCAGCCTCTCCCAGATTCTGAGGATGACTGGTTCACAGACGGCAGTAGTTTTGTGTCAAATGGGGAGCGCCGAGCTGGATATGCAATAGTAAATCACAACACCATTATTGAAGCCCAGCCACTGCCCCCTGGCACATCAGCACAAAAGGCTGAAATCATTGCTCTTACCCGAGCATTAATGTTGGGACAAGGGAAAAAGCTTAACATCTATACAGATTCTAAATATGCATTCCTTGTGGTTCATGCTCATGCTGCAATCTGGAAAGAAAGGGGACTACTAACTAGCAAACACTCCCCCATAAAGCATGGGCCTGAAATTCTTCAGCTATTGGAAGCAATACACCTGCCAAAGGCCGTAGCTATAATCCATTGTAGGGGGCATCAAAGGGACTTAACCCCTATAGCACAAGGGAACAGAAAGGCTGATAGAGAAGCCAAAGCCGCAGCCCTCAGGGTGCAATCCCAACAGATCCTAGCACTGCTTCCTTTCTATGATTCCCCAATAGAACCTGAATACACACCACAGGAAGAACAGTTAATAAAGGAGCAAGGGGGACAAAAACAAGGATCCTGGTGGTATATGGGATCAAAAGTATATCTCCCTCAAACAGCCCAATGGAGAGTTATAAAAACCCTGCATGACTCTTTCCATATGGGGAGAGATGCCACCCTGGCCATGGTAAACAGGCTCTTCATTGGGCCTAACTTAGCTTCGGTGGTTAAGCAGGTCTGTCAAGCCTGCTCACTGTGTGCACTTAACAACCCAGGAAACAAAATGCCTCCTCTAATAGAACCAGTCCAGAGGAGAGGAACTTACCCAGGGGAAGACTGGCAATTAGACTTCACCCATATGCCAGCTTGCAGAGGATACAAGTTTTTGCTAGTACTAATAGACACCTTTACTGGCTGGGTCGAAGCTTACCCTACCAGAACAGAGAAGGCTAATGAGGTTATAAAGGTTCTCTTAAAGGAAATAATCCCCCGGTTTGGGTTACCCCAGAGCCTCCAAAGTGATAACGGCCCGTCCTTTATCTCCCAAATAACTCAAGGGGTTGCTAAGGCTCTCGGAATCAAATACTATTTACATTCAGCATGGAGGCCTCAATCCTCCGGGAAAGTAGAAAGGGCTAATCAAACTCTAAAACGGGCGTTAGCTAAGCTATGTCAGGAAACATCAGAAACTTGGGTCAGCTTACTGCCCATAGCCCTCTTAAGGATCCGTAATNCCCCTAGAGCAAAAATTAATATGAGCCCATATGAAATGTTATACGGAAGGCCATTTTTAACTAATGATCTAATTACTGATCCAGAAACAGCCGGTTTAGTAAAATACCTAGTTAACCTGGGACAATTTCAGCAGGCTTTACAAAAGTTTGGAACTCAAAGGCTCCCCACACCGGGAACTAACCAGCAACCCAAAATCAGGCCAGGAGATAAGGTACTTGTTAAAACATGGAAGGAGGGATCACCTGCTCAACAATTACAACCCAAATGGAAGGGACCGTTTTCAGTGGTACTGGCCACGCCTTCTGCGGTCAAAGTACTAGGATTAGATAGTTGGATACATCTTTCCAGGGTCAAGCCTGCGATACCTGAAGCCCCGGACCTGGAACCTGAAGCTCCCATCAGCCACTACACCTGTGAACCTGTGGAAGACCTGAAGTACCTGTTTAAAAGACAGCCAAAAGATAAGTAAATGCCTACCAACTTTCCTTGGTGTCTTTGTTGCATAGTTACTGTAGGCTGGATAATAGTAGCCATTTTTANNNNTATTTTTGCAGTTTAATTGCCTTCTTCCAAACGGATGGAATCACTTCCTTTGTAATAATTAAGCAGAATGTTTTAATTCATCTCTATAACAAACATTCCTGACAGCATAGGTATCCACCCCCTGAAGTTCCCATTAAATCTTTTAACCAAATTCATTTCCTCTCGCCTAGAGACCATCAAGCTTCAGATGATCATGCGACAAGGNTTCCAGCCAGTTCCAGGTGAAGACACCACCCCTGGCCATCAAGAAGCTACCCTGTCTCCACTAGACAGAGCAGGGCGAGAGTTCCGTGATCCCCAATAGGTAGGGACTACGCCCCAAGTCAGCATGAAGCAGTTACAGAAGAAAGACCATCGGTCCCTCTGCCTCCCATAAAGATTTATGGGGATCACGTCTCTCAGGGGGGAGA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV4_I | BPC6 | 57 | 77 | + | 18.11 | TTTCCTCTTTCTCTCTCTCTT |
HERV4_I | NAC011 | 2319 | 2339 | + | 17.95 | CTTGCTTCAAGTGCAAGAAAA |
HERV4_I | INSM1 | 2410 | 2421 | - | 17.79 | TGCCAGGGGTCG |
HERV4_I | scrt | 1698 | 1708 | - | 17.64 | GCAACAGGTTG |
HERV4_I | DOF5.1 | 41 | 59 | - | 17.59 | AAAAAGAGAAAAATAAATC |
HERV4_I | TB1 | 3698 | 3706 | + | 17.38 | GGGCCCCCC |
HERV4_I | NFKB2 | 878 | 888 | - | 17.30 | GGGGCATCCCC |
HERV4_I | NFKB2 | 878 | 888 | + | 17.24 | GGGGATGCCCC |
HERV4_I | ZNF320 | 3306 | 3325 | - | 16.92 | TTGGGACTCTGGGGCCAGAG |
HERV4_I | BPC1 | 64 | 87 | - | 16.85 | TGGAAAGGAAAAGAGAGAGAGAAA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.