HERV1_I
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000163 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8801 |
Kimura value | 13.19 |
Tau index | 0.0000 |
Description | Internal region of an ERV1 endogenous retrovirus, HERV1_I subfamily |
Comment | The associated LTRs are the HERV1_LTR variants. Target site duplications are 4bp. The HERV1_I consensus encodes gag, protease, reverse transcriptase, RNase H, integrase and envelope proteins. HERV1 elements in the genome are ~6% diverged from the HERV1_I consensus. |
Sequence |
TTTCTGGGGGCTCGTCCGGGATTGGAGACGGCAGGTTTCTGTCTCCTTTGCCTGTGGGCTGGAGCCCCGGGNCGCGGGAGACCCGGGACCCNAGGCGCCACCGGGNAAGACTTAGCCCGGAAGGAGANCGGCTCTCCCGCGTCCCGGNGCCCTCCCCCGGCAGCGCAAACGGAACCGANNGAGGGGCTGCAGGACGATCNCAGGAGCAGCGCGCAGNCAGNCCGCTGAACCGCGGTAAGGTTGGGCCCNAGGAAGGCCCGTCCCATAAGGACGGAAGGGGAGCCTGATCACCTCCCGGGGCGCGACNACTAGTCCGACCCAGAGGGGCTGGGGGCGGCGGGAGTGGCCCGCCGATTCGGATGAANCTCACGCCCCCACTACAAGCGAGAGTGGTTCACTGGGTCTGGAGACGGGAACTGGAGGTGTGTGGGTGCGTGCGAACCNACCCGGGACACGAGGGAGGCTCGTTTCATCCGATGAGGTGGGGNAGGAGTGGTGTGTGTATGTGTGTGAATGTGGGAGCCTAACTAGGCTCACCCGGGACACGAGAGAGGCTCGTTTCATCCGATGAGGAGTCCTGGGGCGGGGGAGGTGTGTGAAAGTGTGTGAAAGAGACGGTCTCGGGAGAGGCCAACGCGGGGAGTGACGTGGGGAGGCACAGATCTCTTAGCGCGGACTGTGTGCTCCGAGGCGAGTGTGGGANAAACCAGACCTAGGNCACTGCATACGGCCGATAGGACCAGCTCCACAGCTNCACAGCAGCAGTTGGCTGTGACCTGGCTAAGCAGCGTCCGAACCTCCCGTAATAGGACCCGGTCTGGTGGATCCGAGAGTGAAAGTGAGAGTGAAAGCGCGCCGCGAGGGAGGAAATGGGAGGAAAAGCATCGAAGCCNACTCCATTGGAGTGCATGCTGAAGAACTTTAAGAAAGGTTTTAATGGTGATTATGGGGTNAAGCTAACTCCNCAGAAGCTGAGAACNCTTTGTGAGATAGACTGGCCNTCTTTTAATGTAGGGTGGCCGGCCGAGGGNACNATAGACAGGGAAATAATTGGCCGNGTGTTTCGGGTGGTCACCGGGGTCGGAGAACAGCCNGGGCACCCGGATCAGTTTCCGTATATNGACTCCTGGCTAAGCGTAATTCAGACCCGCCCNAAGTGGCTGCAGGCCTGCTTTGAGGNNTACTGTAAGACTCTAGTGGCCCGGACAAAACNAGGAACCATAGAAAAGACCCGCAAGGCGCAGNCNCAAGAGAAGGAGTCGCAGGGAAAGCAGAAAAAACCTGTCCTACAGGCCCCGCCNGAAGAGTTAGAAANTCCACCCCCCTATGCNCCAATTTATCCATCTCTGGCAAGGCTTAGGCAGGAGGCCGCCCCGGCAGCTGCCTCCGGAGGNTCAGACTCAGAGGAGAGCACCCCTCAGGCNNCACCACGCAGGGAGGAGCCAGAGCCCCTGCCTGANAAGCCAAGGGAGGAACTCCAGGATGAGGTCGGCCGCCTCAGGTCAGGCCGCGCCCGAGCNATGCAGATGCCCCTCCGAGAAACNNGGGGACAAATTTATTTGGATGCACAGAATGAAGTCCAAGGGGGAGAACGGCTCTTCGTTTATCAGCCCTTCTCTACTACTGATCTCTTAAATTGGAGACAGCATACTCCCTCCTATACGGAGAAGCCCCAGGCTCTTATAGACCTAATGCAGTCCATCTTCCTAACTCACAACCCTACCTGGGCTGATTGCAAACAACTTCTTCTGTCATTGTTTAATACGGAAGAGCGCCGNAGAGTTATACAAGCGGCTCNCCAGTGGCTGGAGAGCAATGCGCCTGCAGGCACAGGAGATGTCAGGCAGTATGCACAACAGGCNCTCCCGATAGAGGCTGACCCAGGCTGGGACCCNAACCAGGCTCAAGGGCTACAAAGCTTGCAGNGGTATCGAGAGGCACTCCTAAATGGAATAAAGGCTGGAGGGAAAAAGGCAACGAATATCGGAAAGGTCTCAGAGGTCCGCCAGAAGCCAGATGAAAGTCCCAGTGAATTTTATGAGAGGCTCTGCGAGGCTTACCGGCTTTACACGCCATTTGACCCAGAGGCTGCAGGNAATCAGTGCATGGTTAATGCGGCATTTGTAAGCCAGGCGCAAGGNGACATNAAGCGAAAGCTTCAGAAGTTGGAAGGNTTTGAAGGTATGAATATTACCCAGCTTATCCAGGTGGCTACTAAGGTGTTTGTAAATCGGGATGAGGAGGCCAAGAGAGAAGCCAAGCGCAGAGCNAAGGAAAAGGCAGANTTGCTGGCNGCAGCCCTGGTTGGAAGAGAAACTGGNTTTGCGAGAGGACGTGGACGTGGTCGTGGATGCGGTCACGGTAGAGGACAAGCTAGGCCAGGCCAGGAGGCCAGGNCAGGNCAAGAGGGCCGGCCTAGGCTNGAGAGAGATCAATGTGCGAGATGCAAGCAGANAGGGCACTGGAAGGATGAATGTCCAGAGAGAGAAAAGGATAAAGGCAACAACCAGGGACAGAATGGCTGGCCAGGGCCCCCTNCNGCCGCCGGNCANGGCGTAGTAGGATCNGACGCGGATCTAATCGGGCTGGCAGGAGTCGATGATTATTNTGAGGACTGAGACAGACCGGGCTCCATCTCATTAGGCCCCGAGGAGCCTATGGTCTCAATGGAGGTAGGGGGCCGAAAAATGGACTTTATGGTNGATACTGGTGCTGAGCACTCGGTNGTGACTCAAGCAATTGGGCCGCTGTCTAAAAACTATGCCAATATNATTGGGGCTACAGGNGTCACAGAAAAGACGCCTTNCTTCAAATCNAAGAGATGTGTGATTGGAGGNCAAGAAGTCCAACACGAGTTTTTATATTTGCCAAATTGTCCGGTGCCCTTGTTAGGAAGAGACTTGCTCCAGAAACTGCAGGCNCAAATCTCCTTTACACCGAGAGGGGACATGACCCTAAACCTAGGTCAAAGAAAGGCCATGGTANTGACCCTTACCGTCCCNANAACAGAGGAATGGAGACTCTATGAGAGNAGTTGCNAGGAATNTGNAAAGANGCACANCGCAGCTGAGAAAGAGGNANTGTNTACGGANTTACTTCTCAAGCTGCCAGGGGTCTGGGCGGAGGACAATCCCCCGGGGCTAGCCGTAAATCAGGCACCCGTAGTGGTGGAGCTGCTGCGAGGNACCTACCCAGTGCGGATCCGTCAGTATCCCATTCCCGTAGAGGCCACCCANGGGATTACAAAACACTTAAANCGGCTCCTTGAATTTGGGATAATAGAAAGATGTGCCTCCTCNTGGAACACTCCNCTGCTGCCGGTGTTAAAGCCCTCTGGNGACTACCGGCCNGTACAGGATTTGCGGGCNGTAAACAAGGTCGCGGCTACACTGCATGCCATTGTGCCCAACCCGTACACNATGCTTGGGCGAATNCCTGCTGATGCTGCTTGGTTTACATGCTTGGACATNAAGGATGCGTTCTTCTGCATCCGACTAGCCCCTGNAAGCCAGGGCATCTTTGCCTTTGAGTGGGGCCCATCNCAGTATACCTGGACCAGACTCCCCCAAGGATTTAAAAACTCCCCAACCATCTTTGAGGAAGCACTAGCCTCAGACCTGAAGGCTTTCACGCCACCAAGTGACCGCTGTGTCCTGTTGCAGTACATAGATGATCTATTGTTGGCCGCACCCACAAGAGAAGAGTGCNTCCAAGGNACAGAGAGCCTCCTTCGNGTNCTGTGGGAGGCTGGCTATAAGGTGTCTAAGGAAAAGGCACAAATCTGTGGCCAAGGAGCNCGGTATCTTGGCTTTNACGTCTCCCAAGGGCAGCGTGAGCTTGGACGNGAGCGAAAAGAGACTGTNTGTAGCATTCCTCGGCCNGACACNAGGCGGCAAGTGCGGGAGTTCCTAGGGGCAGCTGGTTTCTGCCGCATTTGGATTCCAAACTACTCGCTCNTGGCAAAGCCNTTGTATGAGGCTACCAAAGNGGGGGAAAAGGAACCCCTCCTGTGGGGAAAAGAGCAGGACATGGCCTTCAAGGAAATCAAGAAGGCTTTGATCCAGGCCCCGGCATTAGGACTGCCAGACATGACAAAGCCTTTTTACCTGTATGTCCATGAAAGAAAAGGAATAGCTACAGGAGTCTTGGTACAAACGCTAGGGTCATGGTATCGGCCCGTGGCATATTTGTCCAAGCGACTAGACTTGGTGGCTATGGGATGGCCACCCTGTTTCAAGGCACTGGCNGCCACTGCCCTGTTAGCNGAAGATGCTAACAAGCTCACATTTGGACAGAGGTTGATAATTCGGGTGCCCCACACGGTCGTCACCCTGATGGAGCAGAGGGGGCATCGCTGGCTCTCTAACCCTAGGATGTTAAGATATCAAGGGCTCTTGTGTGAAAACCCNTACATAACCTTGGAGACTGTGAATACCCTAAATCCGGCCACACTGCTGCCAATAGAATGGGCGGAGCATGGAAAGCCCCCGTTGTGTGGCCCAGGGTATCACTGTTGTGTGGAAACAGTGGATGAAGTTTTCTCAAGCCGGAAAGACTTAAAGGACCAGCCCTTAAAAGACCCAGATGTTGAATACTTTACTGATGGAAGCAGCTTCATATCTGAGGGTGTCAGAAAGGCCGGATATGCAGTGGTNACACTGAACTCAGTAGCCGAAGCCCGCCCTCTGCCGGTCGGAACCTCGGCCCAAAGGGCNGAGCTAATAGCTCTCACNAGAGCACTGCTCCTGGCGAAAGGAAAGTCAGTAAACATCTATACTGACTCAAGGTATGCTTTTGCCACTTTGCATGCCCATGGAGCCATATATAAGGAAAGAGGATTATTAACTACTGAAGGAAAGGAAATCAAAAATAAAAAGGAAATAGAGCAGCTCTTAGAAGCCGTATGGGCTCCAAAAGAAGTAGCAGTCATCCATTGCAAAGGGCATCAAACAGGAGGAGGTGATGAGGCTAGAGGAAACAGAAAGGCGGACAGAGAAGCCAAAAGAGCTGCAATGACAGAGGTAACTAAGAAGGAAGAGACCCNTACCATGCCCTTACTGGAGCTTCCCCTTACAGAACCCCCTAACTACTCCTCTAATGAAAAGGCNTGGTTCGAGCAGGAGAGCGGAAGTTACCAGAAAGGAGGTTGGTGGAAGTTCTCAGATGGGAGGCTTGCCATCCCAGAAGCAATNGCCCCCCGGTTCATAAAGCAGTTTCATCAAGGAACGCATATGGGGAAAACNGCATTAGAGACTCTCGTAGGACGGCATTTCTATGTGCCGCGCCTAACTGCCATCACTCGAGCCGTTTGTGAGCAATGTTTNACTTGTGCCCAAAACAATCCANGGCAGGGGCCAACACGGCCCCCAGGGATTCAAGAAACTGGAGCNACGCCNTGTGAAAACCTGCTTGTGGACTTTACCGAGCTGCCTCGAGCCGGAGGCTACCGGTACATGCTAGTGTTTGTCTGCACTTTCTCAGGGTGGGTCGAGGCATTTCCCACCAGGACAGAGAAGGCTCGGGAAGTAACCAGGATCTTACTAAAGGACATTATTCCTAGATTTGGACTGCCTCTAACTTTAGGNTCAGACAACGGCCCAGCATTTGTGGCAGAAGTAGTACAGCAGCTAACGCAGATGTTAAAAATCAAATGGAAACTGCATACAGCCTATCGCCCACAGAGTTCTGGAAAAGTTGAAAGAATGAACCGGACACTNAAACAGCTGTTAAAGAAGTTTTGCCAAGAAACTCATCTAAGGTGGGATCAGGTGCTGCCCATGGTCCTTCTCCGAGTCAGGTGCACCCCTACTAAATTAACTGGGTATTCACCCTATGAGATAGTGTTCGGCCGACCACCCCCAATCATAACTCAGATAAAAGGGGATTTAAAAGAAATTGGGGAATTAACCTTAAGAAGGCAAATGCAAGCCTTAGGTGAGGCCATGCAGGAAATACAAGGGTGGGTAAGAGAAAGAATACCTGTTAGCCTCACAGATGCAGTACATCCCTTCCAACCTGGAGACTCTGTCTGGGTCAAACGATGGAACCCAACCACCTTCGGGCCTTTATGGGATGGCCCCCATATTGTGATCNTGTCTACCCCCACTGCTGTTAAAGTTGCAGGTATCACACCTTGGGTTCATCATAGCCGGCTGAAACCNGCAGCCNCAGCTCAGGACCAGTGGACCAGTCAACAAGACCCAGACCACCCGACNCGGCTGATCCTGCGGNGAAACCAAGCCGCNGCNGANAAGGACGACTGCCCTGCTCCGACCACACCGGAGGCTGGTCGGTCCACGCACGGCTGAAGCTTGAGGAAACATCAAGCCCTGCTCTAGTCACACAACTGGAAGCTGACTAGTCTACGCATGGCCGAAGCTTGAGGAAACGTCAAGCCCTGCTCTAGTCACACAACCGGAAGCTGACTAGTCTACGCACGGCCGAAGCCTGAGGAAGTCAACGNTAGATAAGTAAATGTGGATTGAATTTACAAGCGTAGTTATACTCTTACTTGTACTGATTGTTTTGCTGTCATGTTATCTTTGCAAATGCTGCCAAGCTTGTTGCCCAGAAGGGTGCCCGTGCATAGTATAAGCTTAATCATACTAGTAATACTGANGCTAACAGGCATGAAAGGGGACCAAGATGACTGTCATCACTGTATGATAGAAGCCTGGTCCGGAAAAGGTATGACTAAAACTCTGTTATACCAGACCTACTATGAGTGTACAGGGACTCATACGGGAACTTGTGTCTATAACCAGACTAGTTACTCGGTCTGTGATCCNGGAAACGGGCAGCCCCAAGTATGTTATGACCCAGAGTTCTTGCCCTATGACTTCTGGTTTGAAGTCCAAATTGGCGAACCCCTAATGCCATCATATACAAACCCCACAGAAACCGGGGTCGGTAAACTCGTAAACAAAACAGAGGTATTCCCTTACTCGCATAAAGGGCCTGTCTCCATATATTTTGATGCCTGCCAAGCTGCACATCTCAGCAAACTAAACAATATTGGGGCCGTCTGTAAAAATCTAGGACAAGAAAGAGTCAGCAGCAGAGCCGCCAAGGCCGTAACAGGAGAACCCGAAAAGGANTGCCCTGATTGTGACANTCAGTGGACCACACATGAGTTCAGCCAGCGCCTNTACGCAGGAAGAGTNGCTCTGCTTGCCAGCCAAGAGGCGAAGATNGGGTGCGCGACTGGAACATGCAACCCNCTCAATCTGACNATACTAAAGCCAAATATGCCTTTCTGGACTAAAGGGCATAAAGGAGNGCTAANCTTTGATCGGGAAGGAGCAAACCTAGGTATTCCNCTAGTCATTACNAAGAAGACCCAANGGGCCNAAGTTCAAGTTAGCCCAATGCAACAGTTCAGGTTTTNTAAATCCTTCAATGAACACTTTAACCCCGAGGNACCAAAAGTTCAAATTCCNCCNATATCAGCTGAAAACCTGTTCGCTCAGCTAGCCGAAAGTATTGCTANTAATCTNGGAGTCACCTCATGTTATGTATGTGGAGGTACCAATATGGGAGACCAATGGCCCTGGGAGGCTAGAGAATTGATGCCACAAGANAATTTTACCNTACCTGAATTTGTTACAAAGTTCAATGCAAACCCAAGTGTTTGGCTACTAAGGACCCCTATCATTGGAAGATACTGCATAGCACGNTGGGGAAAGGNCTTTCAAACCCAGGTAGGGGANACAACTTGCCTAGGTCAACAATATTTCGAAGAATCCGAGAACAAGACACAGTGGAGAAGCTTTATAGACAATTCCTCTGTGCCAGATTTTAATCCCCTCTTNCAGTTTCCAGCGCTAAATCAGTCATGGTATCAACTAGATGCTCCAAATGTTTGGAGAGCACCNGCAGGACTATATTGGATCTGTGGGACAAAGGCCTATCAACTATTGCCNGANAAGTGGACNGGAGCCTGTGTGTTAGGAACAATAAGGCCATCCTTCTTCCTACTCCCACTGNAGCAAGGGGAAGATCTAAGTTACCCGGTCTATGACGAAGANAGAAAAAGGGCCAGAAGAAACGTNTTTACNCAGATAAGTACCGTGGAAAAGATAAACACAAACATNAAGAAGGACATTGAAATAGGGAGCTGGAAAGACAATGAATGGCCTCCTGAAAGAATTATCAAATACTATGGGCCAGCTACNTGGGCNCAAGATGGGTCATGGGGNTACCGTACTCCTATTTACATGTTAAACCGAATCATAAGATTGCAAGCAGTACTAGAAATCATAGTCAATGAAACAGCCCGAGCCTTGGATTTGCTAGCCATACAGGCNACCCAGATGAGAGATGCCATATATCAAAATAGGCTAGCATTAGACTATCTCCTAGCCTCAGAAGGAGGAGTTTGTGGNAAACTTAATTTGACNAACTGCTGCTTACAAATCGATGACAATGGAAGAGCTGTCATGGAAATCACTGCCAGGATGCGGAAGTTAGCCCATGTCCCGGTCCAGACCTGGTCCGGATGGAGCCCAAATTCACTTTTTGGAGGATGGTTCTCATGGTTTGGAGGCTTTAAAACTTTGATAATCGGTTTTATAGCCATAATAGGNGGATGCCTAATNCTNCCTTGTCTCCTGCCTCTTCTCATCAGAAGCATCCAGTCCACCATAGAAGCAATAGTGGACCGGACAACTACCACCCGAATAATGGCNCTGCAAAAGTACCAACCGGTNCCCCAAGAAGAGTATGTACCCACNCAAGAAGAAATAGATAACTGTGGTGCTCTTTATTAATCTACATTTATGNCGAGCACCAAAGGGGGGGAA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV1_I | MYF6 | 5719 | 5728 | + | 17.58 | AACAGCTGTT |
HERV1_I | MYF6 | 5719 | 5728 | - | 17.58 | AACAGCTGTT |
HERV1_I | SP2 | 580 | 588 | + | 17.56 | GGGGCGGGG |
HERV1_I | SP4 | 580 | 588 | + | 17.54 | GGGGCGGGG |
HERV1_I | M1BP | 4444 | 4454 | + | 17.40 | CGGCCACACTG |
HERV1_I | RARA::RXRG | 2963 | 2979 | + | 17.39 | AGGTCAAAGAAAGGCCA |
HERV1_I | KLF14 | 580 | 588 | + | 17.26 | GGGGCGGGG |
HERV1_I | TFAP4::ETV1 | 1378 | 1390 | - | 17.23 | CCGGAGGCAGCTG |
HERV1_I | Stat2 | 4975 | 4984 | + | 17.16 | GAAACAGAAA |
HERV1_I | Prdm4 | 4229 | 4239 | - | 17.08 | CCTTGAAACAG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.