HERV1_I
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000163 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8801 |
Kimura value | 13.19 |
Tau index | 0.0000 |
Description | Internal region of an ERV1 endogenous retrovirus, HERV1_I subfamily |
Comment | The associated LTRs are the HERV1_LTR variants. Target site duplications are 4bp. The HERV1_I consensus encodes gag, protease, reverse transcriptase, RNase H, integrase and envelope proteins. HERV1 elements in the genome are ~6% diverged from the HERV1_I consensus. |
Sequence |
TTTCTGGGGGCTCGTCCGGGATTGGAGACGGCAGGTTTCTGTCTCCTTTGCCTGTGGGCTGGAGCCCCGGGNCGCGGGAGACCCGGGACCCNAGGCGCCACCGGGNAAGACTTAGCCCGGAAGGAGANCGGCTCTCCCGCGTCCCGGNGCCCTCCCCCGGCAGCGCAAACGGAACCGANNGAGGGGCTGCAGGACGATCNCAGGAGCAGCGCGCAGNCAGNCCGCTGAACCGCGGTAAGGTTGGGCCCNAGGAAGGCCCGTCCCATAAGGACGGAAGGGGAGCCTGATCACCTCCCGGGGCGCGACNACTAGTCCGACCCAGAGGGGCTGGGGGCGGCGGGAGTGGCCCGCCGATTCGGATGAANCTCACGCCCCCACTACAAGCGAGAGTGGTTCACTGGGTCTGGAGACGGGAACTGGAGGTGTGTGGGTGCGTGCGAACCNACCCGGGACACGAGGGAGGCTCGTTTCATCCGATGAGGTGGGGNAGGAGTGGTGTGTGTATGTGTGTGAATGTGGGAGCCTAACTAGGCTCACCCGGGACACGAGAGAGGCTCGTTTCATCCGATGAGGAGTCCTGGGGCGGGGGAGGTGTGTGAAAGTGTGTGAAAGAGACGGTCTCGGGAGAGGCCAACGCGGGGAGTGACGTGGGGAGGCACAGATCTCTTAGCGCGGACTGTGTGCTCCGAGGCGAGTGTGGGANAAACCAGACCTAGGNCACTGCATACGGCCGATAGGACCAGCTCCACAGCTNCACAGCAGCAGTTGGCTGTGACCTGGCTAAGCAGCGTCCGAACCTCCCGTAATAGGACCCGGTCTGGTGGATCCGAGAGTGAAAGTGAGAGTGAAAGCGCGCCGCGAGGGAGGAAATGGGAGGAAAAGCATCGAAGCCNACTCCATTGGAGTGCATGCTGAAGAACTTTAAGAAAGGTTTTAATGGTGATTATGGGGTNAAGCTAACTCCNCAGAAGCTGAGAACNCTTTGTGAGATAGACTGGCCNTCTTTTAATGTAGGGTGGCCGGCCGAGGGNACNATAGACAGGGAAATAATTGGCCGNGTGTTTCGGGTGGTCACCGGGGTCGGAGAACAGCCNGGGCACCCGGATCAGTTTCCGTATATNGACTCCTGGCTAAGCGTAATTCAGACCCGCCCNAAGTGGCTGCAGGCCTGCTTTGAGGNNTACTGTAAGACTCTAGTGGCCCGGACAAAACNAGGAACCATAGAAAAGACCCGCAAGGCGCAGNCNCAAGAGAAGGAGTCGCAGGGAAAGCAGAAAAAACCTGTCCTACAGGCCCCGCCNGAAGAGTTAGAAANTCCACCCCCCTATGCNCCAATTTATCCATCTCTGGCAAGGCTTAGGCAGGAGGCCGCCCCGGCAGCTGCCTCCGGAGGNTCAGACTCAGAGGAGAGCACCCCTCAGGCNNCACCACGCAGGGAGGAGCCAGAGCCCCTGCCTGANAAGCCAAGGGAGGAACTCCAGGATGAGGTCGGCCGCCTCAGGTCAGGCCGCGCCCGAGCNATGCAGATGCCCCTCCGAGAAACNNGGGGACAAATTTATTTGGATGCACAGAATGAAGTCCAAGGGGGAGAACGGCTCTTCGTTTATCAGCCCTTCTCTACTACTGATCTCTTAAATTGGAGACAGCATACTCCCTCCTATACGGAGAAGCCCCAGGCTCTTATAGACCTAATGCAGTCCATCTTCCTAACTCACAACCCTACCTGGGCTGATTGCAAACAACTTCTTCTGTCATTGTTTAATACGGAAGAGCGCCGNAGAGTTATACAAGCGGCTCNCCAGTGGCTGGAGAGCAATGCGCCTGCAGGCACAGGAGATGTCAGGCAGTATGCACAACAGGCNCTCCCGATAGAGGCTGACCCAGGCTGGGACCCNAACCAGGCTCAAGGGCTACAAAGCTTGCAGNGGTATCGAGAGGCACTCCTAAATGGAATAAAGGCTGGAGGGAAAAAGGCAACGAATATCGGAAAGGTCTCAGAGGTCCGCCAGAAGCCAGATGAAAGTCCCAGTGAATTTTATGAGAGGCTCTGCGAGGCTTACCGGCTTTACACGCCATTTGACCCAGAGGCTGCAGGNAATCAGTGCATGGTTAATGCGGCATTTGTAAGCCAGGCGCAAGGNGACATNAAGCGAAAGCTTCAGAAGTTGGAAGGNTTTGAAGGTATGAATATTACCCAGCTTATCCAGGTGGCTACTAAGGTGTTTGTAAATCGGGATGAGGAGGCCAAGAGAGAAGCCAAGCGCAGAGCNAAGGAAAAGGCAGANTTGCTGGCNGCAGCCCTGGTTGGAAGAGAAACTGGNTTTGCGAGAGGACGTGGACGTGGTCGTGGATGCGGTCACGGTAGAGGACAAGCTAGGCCAGGCCAGGAGGCCAGGNCAGGNCAAGAGGGCCGGCCTAGGCTNGAGAGAGATCAATGTGCGAGATGCAAGCAGANAGGGCACTGGAAGGATGAATGTCCAGAGAGAGAAAAGGATAAAGGCAACAACCAGGGACAGAATGGCTGGCCAGGGCCCCCTNCNGCCGCCGGNCANGGCGTAGTAGGATCNGACGCGGATCTAATCGGGCTGGCAGGAGTCGATGATTATTNTGAGGACTGAGACAGACCGGGCTCCATCTCATTAGGCCCCGAGGAGCCTATGGTCTCAATGGAGGTAGGGGGCCGAAAAATGGACTTTATGGTNGATACTGGTGCTGAGCACTCGGTNGTGACTCAAGCAATTGGGCCGCTGTCTAAAAACTATGCCAATATNATTGGGGCTACAGGNGTCACAGAAAAGACGCCTTNCTTCAAATCNAAGAGATGTGTGATTGGAGGNCAAGAAGTCCAACACGAGTTTTTATATTTGCCAAATTGTCCGGTGCCCTTGTTAGGAAGAGACTTGCTCCAGAAACTGCAGGCNCAAATCTCCTTTACACCGAGAGGGGACATGACCCTAAACCTAGGTCAAAGAAAGGCCATGGTANTGACCCTTACCGTCCCNANAACAGAGGAATGGAGACTCTATGAGAGNAGTTGCNAGGAATNTGNAAAGANGCACANCGCAGCTGAGAAAGAGGNANTGTNTACGGANTTACTTCTCAAGCTGCCAGGGGTCTGGGCGGAGGACAATCCCCCGGGGCTAGCCGTAAATCAGGCACCCGTAGTGGTGGAGCTGCTGCGAGGNACCTACCCAGTGCGGATCCGTCAGTATCCCATTCCCGTAGAGGCCACCCANGGGATTACAAAACACTTAAANCGGCTCCTTGAATTTGGGATAATAGAAAGATGTGCCTCCTCNTGGAACACTCCNCTGCTGCCGGTGTTAAAGCCCTCTGGNGACTACCGGCCNGTACAGGATTTGCGGGCNGTAAACAAGGTCGCGGCTACACTGCATGCCATTGTGCCCAACCCGTACACNATGCTTGGGCGAATNCCTGCTGATGCTGCTTGGTTTACATGCTTGGACATNAAGGATGCGTTCTTCTGCATCCGACTAGCCCCTGNAAGCCAGGGCATCTTTGCCTTTGAGTGGGGCCCATCNCAGTATACCTGGACCAGACTCCCCCAAGGATTTAAAAACTCCCCAACCATCTTTGAGGAAGCACTAGCCTCAGACCTGAAGGCTTTCACGCCACCAAGTGACCGCTGTGTCCTGTTGCAGTACATAGATGATCTATTGTTGGCCGCACCCACAAGAGAAGAGTGCNTCCAAGGNACAGAGAGCCTCCTTCGNGTNCTGTGGGAGGCTGGCTATAAGGTGTCTAAGGAAAAGGCACAAATCTGTGGCCAAGGAGCNCGGTATCTTGGCTTTNACGTCTCCCAAGGGCAGCGTGAGCTTGGACGNGAGCGAAAAGAGACTGTNTGTAGCATTCCTCGGCCNGACACNAGGCGGCAAGTGCGGGAGTTCCTAGGGGCAGCTGGTTTCTGCCGCATTTGGATTCCAAACTACTCGCTCNTGGCAAAGCCNTTGTATGAGGCTACCAAAGNGGGGGAAAAGGAACCCCTCCTGTGGGGAAAAGAGCAGGACATGGCCTTCAAGGAAATCAAGAAGGCTTTGATCCAGGCCCCGGCATTAGGACTGCCAGACATGACAAAGCCTTTTTACCTGTATGTCCATGAAAGAAAAGGAATAGCTACAGGAGTCTTGGTACAAACGCTAGGGTCATGGTATCGGCCCGTGGCATATTTGTCCAAGCGACTAGACTTGGTGGCTATGGGATGGCCACCCTGTTTCAAGGCACTGGCNGCCACTGCCCTGTTAGCNGAAGATGCTAACAAGCTCACATTTGGACAGAGGTTGATAATTCGGGTGCCCCACACGGTCGTCACCCTGATGGAGCAGAGGGGGCATCGCTGGCTCTCTAACCCTAGGATGTTAAGATATCAAGGGCTCTTGTGTGAAAACCCNTACATAACCTTGGAGACTGTGAATACCCTAAATCCGGCCACACTGCTGCCAATAGAATGGGCGGAGCATGGAAAGCCCCCGTTGTGTGGCCCAGGGTATCACTGTTGTGTGGAAACAGTGGATGAAGTTTTCTCAAGCCGGAAAGACTTAAAGGACCAGCCCTTAAAAGACCCAGATGTTGAATACTTTACTGATGGAAGCAGCTTCATATCTGAGGGTGTCAGAAAGGCCGGATATGCAGTGGTNACACTGAACTCAGTAGCCGAAGCCCGCCCTCTGCCGGTCGGAACCTCGGCCCAAAGGGCNGAGCTAATAGCTCTCACNAGAGCACTGCTCCTGGCGAAAGGAAAGTCAGTAAACATCTATACTGACTCAAGGTATGCTTTTGCCACTTTGCATGCCCATGGAGCCATATATAAGGAAAGAGGATTATTAACTACTGAAGGAAAGGAAATCAAAAATAAAAAGGAAATAGAGCAGCTCTTAGAAGCCGTATGGGCTCCAAAAGAAGTAGCAGTCATCCATTGCAAAGGGCATCAAACAGGAGGAGGTGATGAGGCTAGAGGAAACAGAAAGGCGGACAGAGAAGCCAAAAGAGCTGCAATGACAGAGGTAACTAAGAAGGAAGAGACCCNTACCATGCCCTTACTGGAGCTTCCCCTTACAGAACCCCCTAACTACTCCTCTAATGAAAAGGCNTGGTTCGAGCAGGAGAGCGGAAGTTACCAGAAAGGAGGTTGGTGGAAGTTCTCAGATGGGAGGCTTGCCATCCCAGAAGCAATNGCCCCCCGGTTCATAAAGCAGTTTCATCAAGGAACGCATATGGGGAAAACNGCATTAGAGACTCTCGTAGGACGGCATTTCTATGTGCCGCGCCTAACTGCCATCACTCGAGCCGTTTGTGAGCAATGTTTNACTTGTGCCCAAAACAATCCANGGCAGGGGCCAACACGGCCCCCAGGGATTCAAGAAACTGGAGCNACGCCNTGTGAAAACCTGCTTGTGGACTTTACCGAGCTGCCTCGAGCCGGAGGCTACCGGTACATGCTAGTGTTTGTCTGCACTTTCTCAGGGTGGGTCGAGGCATTTCCCACCAGGACAGAGAAGGCTCGGGAAGTAACCAGGATCTTACTAAAGGACATTATTCCTAGATTTGGACTGCCTCTAACTTTAGGNTCAGACAACGGCCCAGCATTTGTGGCAGAAGTAGTACAGCAGCTAACGCAGATGTTAAAAATCAAATGGAAACTGCATACAGCCTATCGCCCACAGAGTTCTGGAAAAGTTGAAAGAATGAACCGGACACTNAAACAGCTGTTAAAGAAGTTTTGCCAAGAAACTCATCTAAGGTGGGATCAGGTGCTGCCCATGGTCCTTCTCCGAGTCAGGTGCACCCCTACTAAATTAACTGGGTATTCACCCTATGAGATAGTGTTCGGCCGACCACCCCCAATCATAACTCAGATAAAAGGGGATTTAAAAGAAATTGGGGAATTAACCTTAAGAAGGCAAATGCAAGCCTTAGGTGAGGCCATGCAGGAAATACAAGGGTGGGTAAGAGAAAGAATACCTGTTAGCCTCACAGATGCAGTACATCCCTTCCAACCTGGAGACTCTGTCTGGGTCAAACGATGGAACCCAACCACCTTCGGGCCTTTATGGGATGGCCCCCATATTGTGATCNTGTCTACCCCCACTGCTGTTAAAGTTGCAGGTATCACACCTTGGGTTCATCATAGCCGGCTGAAACCNGCAGCCNCAGCTCAGGACCAGTGGACCAGTCAACAAGACCCAGACCACCCGACNCGGCTGATCCTGCGGNGAAACCAAGCCGCNGCNGANAAGGACGACTGCCCTGCTCCGACCACACCGGAGGCTGGTCGGTCCACGCACGGCTGAAGCTTGAGGAAACATCAAGCCCTGCTCTAGTCACACAACTGGAAGCTGACTAGTCTACGCATGGCCGAAGCTTGAGGAAACGTCAAGCCCTGCTCTAGTCACACAACCGGAAGCTGACTAGTCTACGCACGGCCGAAGCCTGAGGAAGTCAACGNTAGATAAGTAAATGTGGATTGAATTTACAAGCGTAGTTATACTCTTACTTGTACTGATTGTTTTGCTGTCATGTTATCTTTGCAAATGCTGCCAAGCTTGTTGCCCAGAAGGGTGCCCGTGCATAGTATAAGCTTAATCATACTAGTAATACTGANGCTAACAGGCATGAAAGGGGACCAAGATGACTGTCATCACTGTATGATAGAAGCCTGGTCCGGAAAAGGTATGACTAAAACTCTGTTATACCAGACCTACTATGAGTGTACAGGGACTCATACGGGAACTTGTGTCTATAACCAGACTAGTTACTCGGTCTGTGATCCNGGAAACGGGCAGCCCCAAGTATGTTATGACCCAGAGTTCTTGCCCTATGACTTCTGGTTTGAAGTCCAAATTGGCGAACCCCTAATGCCATCATATACAAACCCCACAGAAACCGGGGTCGGTAAACTCGTAAACAAAACAGAGGTATTCCCTTACTCGCATAAAGGGCCTGTCTCCATATATTTTGATGCCTGCCAAGCTGCACATCTCAGCAAACTAAACAATATTGGGGCCGTCTGTAAAAATCTAGGACAAGAAAGAGTCAGCAGCAGAGCCGCCAAGGCCGTAACAGGAGAACCCGAAAAGGANTGCCCTGATTGTGACANTCAGTGGACCACACATGAGTTCAGCCAGCGCCTNTACGCAGGAAGAGTNGCTCTGCTTGCCAGCCAAGAGGCGAAGATNGGGTGCGCGACTGGAACATGCAACCCNCTCAATCTGACNATACTAAAGCCAAATATGCCTTTCTGGACTAAAGGGCATAAAGGAGNGCTAANCTTTGATCGGGAAGGAGCAAACCTAGGTATTCCNCTAGTCATTACNAAGAAGACCCAANGGGCCNAAGTTCAAGTTAGCCCAATGCAACAGTTCAGGTTTTNTAAATCCTTCAATGAACACTTTAACCCCGAGGNACCAAAAGTTCAAATTCCNCCNATATCAGCTGAAAACCTGTTCGCTCAGCTAGCCGAAAGTATTGCTANTAATCTNGGAGTCACCTCATGTTATGTATGTGGAGGTACCAATATGGGAGACCAATGGCCCTGGGAGGCTAGAGAATTGATGCCACAAGANAATTTTACCNTACCTGAATTTGTTACAAAGTTCAATGCAAACCCAAGTGTTTGGCTACTAAGGACCCCTATCATTGGAAGATACTGCATAGCACGNTGGGGAAAGGNCTTTCAAACCCAGGTAGGGGANACAACTTGCCTAGGTCAACAATATTTCGAAGAATCCGAGAACAAGACACAGTGGAGAAGCTTTATAGACAATTCCTCTGTGCCAGATTTTAATCCCCTCTTNCAGTTTCCAGCGCTAAATCAGTCATGGTATCAACTAGATGCTCCAAATGTTTGGAGAGCACCNGCAGGACTATATTGGATCTGTGGGACAAAGGCCTATCAACTATTGCCNGANAAGTGGACNGGAGCCTGTGTGTTAGGAACAATAAGGCCATCCTTCTTCCTACTCCCACTGNAGCAAGGGGAAGATCTAAGTTACCCGGTCTATGACGAAGANAGAAAAAGGGCCAGAAGAAACGTNTTTACNCAGATAAGTACCGTGGAAAAGATAAACACAAACATNAAGAAGGACATTGAAATAGGGAGCTGGAAAGACAATGAATGGCCTCCTGAAAGAATTATCAAATACTATGGGCCAGCTACNTGGGCNCAAGATGGGTCATGGGGNTACCGTACTCCTATTTACATGTTAAACCGAATCATAAGATTGCAAGCAGTACTAGAAATCATAGTCAATGAAACAGCCCGAGCCTTGGATTTGCTAGCCATACAGGCNACCCAGATGAGAGATGCCATATATCAAAATAGGCTAGCATTAGACTATCTCCTAGCCTCAGAAGGAGGAGTTTGTGGNAAACTTAATTTGACNAACTGCTGCTTACAAATCGATGACAATGGAAGAGCTGTCATGGAAATCACTGCCAGGATGCGGAAGTTAGCCCATGTCCCGGTCCAGACCTGGTCCGGATGGAGCCCAAATTCACTTTTTGGAGGATGGTTCTCATGGTTTGGAGGCTTTAAAACTTTGATAATCGGTTTTATAGCCATAATAGGNGGATGCCTAATNCTNCCTTGTCTCCTGCCTCTTCTCATCAGAAGCATCCAGTCCACCATAGAAGCAATAGTGGACCGGACAACTACCACCCGAATAATGGCNCTGCAAAAGTACCAACCGGTNCCCCAAGAAGAGTATGTACCCACNCAAGAAGAAATAGATAACTGTGGTGCTCTTTATTAATCTACATTTATGNCGAGCACCAAAGGGGGGGAA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV1_I | dar1 | 3684 | 3694 | + | 17.08 | GCCGCACCCAC |
HERV1_I | Wt1 | 583 | 592 | - | 17.08 | CCTCCCCCGC |
HERV1_I | RARA | 2963 | 2979 | + | 16.98 | AGGTCAAAGAAAGGCCA |
HERV1_I | GFI1 | 8443 | 8453 | + | 16.98 | AAATCACTGCC |
HERV1_I | TCP9 | 3530 | 3540 | - | 16.96 | ATGGGCCCCAC |
HERV1_I | ZNF213 | 1879 | 1890 | + | 16.94 | ACCCAGGCTGGG |
HERV1_I | hkb | 367 | 375 | - | 16.90 | GGGGCGTGA |
HERV1_I | Ets98B | 1099 | 1107 | + | 16.79 | ACCCGGATC |
HERV1_I | TFAP4 | 5719 | 5728 | + | 16.65 | AACAGCTGTT |
HERV1_I | TFAP4 | 5719 | 5728 | - | 16.65 | AACAGCTGTT |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.