HERV16
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000162 |
---|---|
TE superfamily | ERV3 |
TE class | LTR |
Species | Theria_mammals |
Length | 4996 |
Kimura value | 32.10 |
Tau index | 0.9660 |
Description | ERVL endogenous retrovirus, HERV16 subfamily (internal) |
Comment | LTRs of HERV16 are listed in Repbase as LTR16 sequences. HERV16 is related to HERVL and is relatively old (>20% divergence from consensus). Bases 300 to 1300 encoded a GAG protein closely similar to the murine "retrovirus restriction polypeptide" (PIDs e24674 and e242676). |
Sequence |
GTTGGTACCAGGAGTGGTCCGAGAAAGCAGACGNTNCTAAGATGGGATTTTGGAGCTGGATCACCCGCCGNCCGGCTGGCAATGAGGACCCCATCACTGGTGGTAGGTGGAGCACGGATAGCCCCTGGCACGAGGTAGCGGTGCAATTGTTAAAACTTTCACCGGTGGTGAACTGGGATGGNATACCGGTGGAAAGAGAATGCACTGGCNGGTGCAATGTNTCAGGCGTTTGAGAAATATGGGANGAATTAANTACATGNAAGGACAATGGAATTGGATGGCTGTTGCTAAGCNCGACTGACGCTCTGGAAAAAGACAATGAAAGGCTGAGAGCGATTAATCGNCAATTNAAAGCTAAGTGTGAAAGCCAGAGGGCCTCCTTGGCAGCATATAAAGAGACTCTCATCTCCTGCAGCCGGAGGGCAGAGAAAGCTGAGGATCAGGCCCAGGACTTAATCAGAGTAGCAGAGCTCCAGAGAAGGTTGAATTCTCAACCNAGGCAGGTCTGCTATGCCAAGGTCAGGGCCCTGGTTGGGAAAGAATGGGACCCTGANACNTGGGATGGGGACATCTGGGTCGATGCNCCTGAAAATCTTGAATCCCCAGATTCCCCTGAACCCTCTGGGCCTGCAGAAGTGGCCCACTCCTCCCTGTTAAGGGCTAGCACTCNCTCCTTGCTTCGCGNGAAGACGATGCAGAGGCCTCTNCCTTNGCAAGACAACATGCGCCCCCCTCAGGATCTGCCCCCACCTCCCCTCCTGGCCACCAGACCAATAACTAGGGTTAAGTCACAGCATAACCCGGCCGGGGAAGTGCTGGGCCTGNTAAGGGAGGAAAGGGACTATACCCCAAAGGAGCTGCAGGACNCGNCTAGCCAGCATGTACCGGCAGGACCGGGAGAGTACGCATGGGACTGGATTCTGAGGGTGCTGGATCAAGGGGGNCGGAACATAAAGTTGGATAAGGGAGAGTTTATCGATNTGGGAGCACTCTCCCGNGATACAGGATTTAACACCCTGGCAAGGACCCCGGGAGATGGTGCNAACACGCTGCTAGGATGGCTCCTGGAAGCNTGGAGAAAGCGATGGCCCACACTAAGTGAAGTAGAAATGCCAGAACTGCCGTGGCAGACGGTGGAAGAAGGGATCAAAAGGCTCAGAGAAGTGGGCATGCTAGAGTGGATATACTATGTAAGGCCGGAAAACCCACCAGATGACTATGTTCCGCGGGAGGGCCCAGAGGACACNCCATTTACCAAAGCNATAAGGAATGCGCTGGTGAGAGGGGCACCAGCATCACTGAGAAGCTCAGTGGTGGCTNTCCTCTGCAGGCCAGGGCTGACGGTAGGAGANGCCGTTACAGAACTGGGCTCNCTGATAGCAATGGGGATGATAGGACCCCGAAATAATAGAGGCCAGGTGGCGGCGCTTAACCGTCAGAAGCAAGGTGGGCGCAATTATCGTAATGNACGGCAAGGTCGGAGTGGCAGCCGGGGGGGCCTGACCCGCAGAGAGCTATGGAGATGGTTAATAGAACACGGCGTCCCTAGGGGCAAGATAGATGGGCAGCCAACAGGAAGNCATGGACAGTTAATAGAACATGGCGTCCTAGGGGCAAGATAGATGGGCAGCCAACAAGGGTGTATTGCTCAACTNAAACAANCAAAAGAAATCAAGGATGGATGANCAGGAGGCTGAGGGCAGTCGCCCCAATAAAAAGTCACGATCCCTTGCCCAGTTTCCGGACCTGAGCCAGTTTTCAGACCCGGAACCCATTGACTGAAGGAGAGGCCGGGTCCCCAGGAGGAAGGACCCTGCAACACCACGGCAAGTGTACACGGTAATGATTCCCCCAGTCCTTCCCCAAAGGGACCTACGGCCATTTACTCTGGGTAACCGTACACTGGGGAAAGGGGAATACCCAGACATTTCGAGGACTGTTGGACACAGGGTCCGAGTTGACATTGATACCCGGAGACCCGAAGCGTCATCATGGCCCTCCCGTTAGAGTGGGGGCATATGGGGGCCAGGTAATAAATGGAGTCCTGGCCCAGGTCCGGCTCACAGTGGGTCCACTGGGTCCACGGACCCACAGTGGTCATTTCCCCGGTCCCCGAATGTATAATTGGAATGGACATACTTGGTAGTTGGCANAACCCCCACATTGGTTCCTTGGCCTGTGGGGTAAGAGCTATCATAGTGGGGAAGGCCAAGTGGAAGCCTCTGAAACTGCCCTCACCTTCTCCGGNCAAGATAGTAAATCAAAAACAATATCGCATCCTCGGGGTGGAGAATGGCAGAGATTAGTGCCACCCTTAAAGACCTAAAGGATGCAGGAGNTGTGGTCCCCATCATATCTCCATTTAATTCACCAGTNCNGGCCCCTGCAAAAACCAGACGGATCCTGGANGACGACAGTGGACTACCGCAAACTCAACCAAGTAGTANCCCAATTGCAGCTGCTGTGCCAGATGTGGTATCTTTGCTAGAGCAGATTAACACGGCCTCAGGTACGTGGTATGCGGCCATTGATCTGGCGAATGCGTTCTTTTCCATCCCTATCAGAAAGGAGGATCAGAAGCAGTTCGCATTCACNTGGAACGGACAACAGTATACATTTACAGTCTTGCCCCAGGGCTNTGTTAACTCTCCTGCCCTCTGTCATAATATAGTCCGAAGGGACCTGGACCATCTGGACATTCCGCAGAACATCACATTGGTCCACTATATCGATGACATCATGCTAATCGGACCGGAATGAGCAAGAAGTGGCAAGTACGTTGGAGGCCTTGGTAAGACACATGCGCTCCAGAGGGTGGGAGATAAACCCTACGAAGATTCAGGGGCCTGCCACATCAGTGAAGTTTTTAGGGGTCCAGTGGTCTGGGGCATGCCGGGACATCCCCCCAAAGTAAAGGACAAATTATTGCATCTTGCACCTCCCACCACNAAGAAGGAAGCACAACGCCTGGTAGGCCTCTTCGGGTTCTGGAGGCAGCATATTCCACACTTGGGAATACTGCTCCGACCCATNTACCGGGTGACGCGAAAGGCTGCCAGCTTTGAGTGGGGCCCAAGCAGGAAAGGGCTCTGTAGCTNACCGCATATCGTGTNCGAANCGCANANNCGGNCCTCCAAGTCATAAAGTNGGGCGGGCCCAGCAGCAGTCCATCATAAGGTGGAAATAGNATATCCGAGATTAAGCCCGAGNAGGACCAGAGGGCGCAGGTNAGNTNCATGAGCAGGTAGCCCAGACCCCCATGNCACCCACCACNGTTGCACCAGCGCCTCTCCTTCGGCTCGCACCTATGGCCATATNGAGGNNTGTGGTCCCGTATGACCAGCTGAAGGAAAGCTCGAGCTTGGTTTACAGGATCAGCTCGGTATGTGGGCGCAAGCCAAAATANGTGGTGGCTGCACTACAGCCCCATTCAGGGGTGGCCCTGAAAGGCAGCAGGGAGTGAAAATATCTTCCCAGTGGGCAGAGCTGCGAGCAGTGCACCTGGTCATCCACTTTNCGTGGAAGGAGAAGTGGCCCGAGATGGTAATATATACGGATTNCTGGACAGTGGCCAATGGCCTGGTCGGCTGGTCAGGAGCTTGGAAGGAAAAGATTAGAGANAGGGAAGTCTGGGGTAGAGGCATGTGGATAGACATATNGGAAGTGGCACGAAATACGAAGATTTTTGTATCGNTNNTATGTTAACGCTCACANGAAAGCATCNACCATGAAAGAGNCACTGAACAACCAAGTAGACAAAGTGACTTCGACCACTGACGTTAGCCAGCCTTCGTCCCGGCCTCCTCAGANCTGGTNCAACGGACATCTCAANAAAGTAGCCATGGTGGCAGAGATGGAGGCTACGCATGGGCCCAACAGCATGGACTCCTATNCACCAAGNCGATCTAGCTGCTGCCCCGTNTGAATGTCCAACCTGCAGCAACAGAGACCAATGCTCTACCTCNAATACGGCACTATTTATGGAGACCACTTAGCGGCGAATTGACTATATTGGACGCCTTCCATCCTGGAANGGNCAGNGATTCATTCTCACAGGAATANATACNTATTCCGGGTATGGGTTTGCCTTTCCTGCCTGCANAGCCTCATCNAGCATCTGAGGGNCTTAGGAGTGCCTGATCCACAGGCATGGAATCCCACACAACATAGCATCTGANCAGGGCGCTCACTTCACAGCAAAGAAGNGGCACAAGTGGGTCCANAATAACGGGATCCACTTATCNTATCACGTACCGCACCATCCAGAAGNAGCTGACCTNATAGAACGCTTCTGAAGGCACAGNTGAAGCACCAGTNGAGGCGAACTCTGAAAGAATTGGGTGCCATCCTCCAGGATGCAGTATATACATTGAATCANAGGCCTATNCGTNGGGCNTTGNNTCTAATNGGGAGAATGGGATTAGGAATAAGAGGTGAAAGCAGGAGTGTGCCTACTCCCCATCGTAGTGGCCCACAAGGATCACAGCAGTGGTTTGAACACGGATTGGACTACTGATATCCAGGAATCGCCGCACAAGAAGGNGAATCCACATTTGGCAGGAGTAATTGACCNTGATCANCAGGAGGGGTAGGGCTGCTNTCGCANAATGAGGAAGAAAGGAATATGTNTGGAACCAGGTGATCCNCTTGGGTGCCTCCTGGTACTCCCTTGCCCCATTTAACTGTAAATGGACAATNCCACCGAAAAGAGCTCGGACCNCTCCGGAATGAAGGTTTGGNTCACATCGCCNGGTAAGCCACCAAGACCTGTCGAAGTGATAGCTGNGTGAGGGANATCTAGAATGGATAGTAGAAGAGGGAGACGATGAGTACCAGTTNCGGCCCCGAGACCAACTGCAGCAANNGGGGCTGTAGTTCGTCCCACTAACCTCCCTCTTCTAAGTTTCGCTTCAGAAAGAGAAGCTTAAAGGAATAATGGAGGAACTGCTCCNNGAACCTGNGTGGAGAAGTAGATCCGTCGAGTACAAAGGTGGAC
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV16 | RVE6 | 3463 | 3471 | - | 16.23 | AGATATTTT |
HERV16 | TCP3 | 2341 | 2348 | - | 16.23 | GGGACCAC |
HERV16 | TCP3 | 3326 | 3333 | - | 16.23 | GGGACCAC |
HERV16 | RVE5 | 3463 | 3471 | - | 16.21 | AGATATTTT |
HERV16 | TCP24 | 2341 | 2348 | - | 16.21 | GGGACCAC |
HERV16 | TCP24 | 3326 | 3333 | - | 16.21 | GGGACCAC |
HERV16 | ERF105 | 1419 | 1428 | + | 16.17 | GTGGCGGCGC |
HERV16 | TCP5 | 2341 | 2348 | - | 16.15 | GGGACCAC |
HERV16 | TCP5 | 3326 | 3333 | - | 16.15 | GGGACCAC |
HERV16 | NFKB2 | 1912 | 1922 | + | 16.12 | GGGGAATACCC |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.