HERV9
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000173 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8436 |
Kimura value | 5.24 |
Tau index | 0.9324 |
Description | Internal region of an ERV1 endogenous retrovirus, HERV9 subfamily |
Comment | gag ~1287-2747, pol ~2748-6323, env ~6740-8380. |
Sequence |
TTTTGGCGACCACGAAGGGACCATCGCCTATCGCCAAGCGGTGAGACTATCGCCTATCGCCAAGCGGTGAGTACCATCGGACCCCTTTCGCTTGCTATTCTGTCCTATTTTTCCTTAGAATTCGGGGGCTAAATACCGGGCACCTGTCGGCCAGTTAAAAGCGACTAGCGCGGCCGCCGGACTAAAGACACGGGTGTCAGGCTTTCTGGGAAAGGGCTCTCTAACAACCCCCGACTCTTCGGAGTTGGGAGCGTTGGTTTGCCTGGAACCAGCTTCCGCTTTTCCTGTACTTCTGGGCTGAGCCGAGGGTCGACAGAGAGGAAAGCCATTCAGCTCCAGGGGTCCCGACAACAAGTTGGTTGACCCTGCGGCCATGAGCGGAACTCTCAAAGTCATGTCGCCCAAGCGAGACTCGCCCATCTATCCTATCTATCCTGACCCTTGCCTCCTGGGTCCTAATGCCTGTCAGACAAACTTCCTCTCGCCTCTCTTCTCCGAGGCTAGTCCCGCTTCTAAAAACCACTCCCTGTCTCTGGTGCTTTTCTAGTTTCTCCTATAAGAATGATTTCTAGTATAAACTCCAGGACTCTGTTACCTTCTTTAGGCACCCGGGCTCACCAATCAGAAAGACATAATTTTTGCCCAAAGCCCCATCGTAGGGGGGACTATCTGGAATTTTAGGATCCCTCCTCAGACNAGCAGGCCTAACAAAAGCTATTCCTGAAGCTAGGATATGGGGAGCCTCAGAAATTGTATCCTTCCTATTCATATAAGTGAGGACAAAAGGCGTCACTCTTCCAACTCTGGAGATCCCTTCCCTCCCTCAGGGTATGGCCCTCCACTTCATTTTTGGGGCATAACATCTTTATAGGACACGGGTAAGGTCCCAATACTAACAGGAGAATGCTTAGGACTCTAACAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGATTTTTCTCGGTCCTCTTTGTGGTCTAGGAGGACAGGCAAGGGTGCAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGACCTTCCTCGGTCCTCCTTGTGGTCTAGGAGGAAAACTAGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAAGAGGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAGGGGGAGAAACAAACAAACCAAAACCGCGGGCGGTTTTGTCTTTCAGATGGGAAACACTCAGGCATCAACAGGCTCACCCTTGAAATGCATCCTAAGCCATTGGGACCAATTTGACCCGCAAACCCTGAAAAAGAGGCGGCTCATTTTTTTCTGCACTATGGCCTGGCCCCAATATTCTCTCTCTGATGGGGAAAAATGGCCACCTGAGGGAAGTATAAATTACAATACTATCCTGCAGCTTGACCTTTTCTGTAAGAGGGAAGGCAAATGGAGTGAAATACCTTATGTCCAAGCTTTCTTTTCATTGAAGGAGAATNCACAACTATGCAAAGCTTGCAATTTACATCCCACAGGAGGACCTCTCAGCTTACCCCCATATCCTAGCCTCCCTATAGCTCCCCTTCCTATTAATGATAAGCCTCCTCTAATCTCCCCCGCCCAGAAGGAAACAAGCAAAGAAATCTCCAAAGGACCACAAAAACCCCCGGGCTATCGGTTATGTCCCCTTCAAGCTGTAGGGGGAGGGGAATTTGGCCCAACCCGGGTACATGTCCCCTTCTCCCTCTCTGATTTAAAGCAGATCAAGGCAGACCTGGGGAAGTTTTCAGATGATCCTGATAGGTACATAGATGTCCTACAGGGTCTAGGGCAAACCTTCGACCTCACTTGGAGAGATGTCATGCTATTGTTAGATCAAACCCTGGCCTTTAATGAAAAGAATGCGGCTTTAGCTGCAGCCCGAGAGTTTGGAGATACCTGGTATCTTAGTCAAGTAAATGATAGAATGACAGCCGAAGAAAGGGACAAATTCCCTACCGGTCAGCAAGCCGTCCCCAGTATGGATCCCCACTGGGACCTCGACTCAGATCATGGGGACTGGAGTCGCAAACATCTGTTGACCTGTGTTCTAGAAGGACTAAGGAGAATTAGGAAAAAGCCCATGAATTATTCAATGATGTCCACCATAACTCAGGGAAAGGAAGAAAATCCTTCTGCCTTCCTCGAGCGGCTACGGGAGGCCTTAAGAAAATATACTCCCCTGTCACCCGACTCACTCGAGGGTCAATTGATCCTAAAAGATAAGTTTATTACCCAATCAGCCGCAGATATCAGGAGAAAGCTCCAAAAGCGAGCCCTGGGCCCTGAACAAAATCTGGAGGCATTATTAAACCTGGCAACCTCGGTGTTCTATAATAGGGACCAAGAGGAACAGGCCCAAAAGGAAAAGCGAGATCAGAGAAAGGCCGCAGCCTTAGTCATGGCCCTCAGACAAACAAACCTTGGTGGTTCAGAGAGGACAGAAAATGGAGCAGGCCAATCACCCGGTAGGGCTTGTTATCAGTGTGGTTTGCAAGGACACTTTAAAAAAGATTGTCCAACGAGAAACAAGCCGCCCCCTCGCCCATGTCCACTATGCCGAGGCAATCACTGGAAGGCGCACTGCCCCAGAGGACAAAGGTTCTCTGGGCCAGAAGCCCCCAACCAGATGATCCAACAACAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCTCATGTCATCACCCTCACTGAGCCCCGGGTACGTTTAACCATTGAGGGCCAGGAAATTGACTTCCTCCTGGACACTGGCGCGGCCTTCTCAGTGTTAATCTCCTGTCCCGGACGGCTGTCCTCAAGGTCCGTTACCATCCGAGGAATCCTGGGACAGCCTGTAACCAGGTATTTCTCCCACCTCCTCAGTTGTAATTGGGAGACTTTGCTCTTTTCACATGCCTTTCTTGTTATGCCTGAAAGTCCCACACCCTTATTAGGGAGGGACATATTAGCCAAAGCTGGAGCTATTATCTACATGAATATGGGGAACAAGTTACCCATTTGTTGTCCCCTACTTGAGGAGGGAATCAACCCTGAAGTCTGGGCATTGGAAGGACAATTCGGAAGGGCAAAAAATGCCCGCCCAGTCCAAATCAGGCTAAAAGACCCCACCACTTTTCCTTATCAAAGGCAATATCCCTTAAGGCCTGAAGCTCATAAAGGATTACAGGATATTGTTAGACATTTAAAAGCTCAAGGCTTAGTAAGGAAATGCAGCAGTCCCTGCAACACCCCAATTCTAGGAGTACAAAAACCGAACGGTCAGTGGAGACTAGTGCAAGATCTTAGACTCATCAATGAGGCAGTAATTCCTCTATATCCAGTTGTACCCAACCCCTATACCCTGCTCTCTCAAATACCAGAGGAAGCAGAATGGTTCACTGTTCTGGACCTCAAGGATGCCTTCTTCTGTATTCCCCTGCACTCTGACTCCCAGTTTCTCTTTGCCTTTGAGGATCCCACAGACCACACGTCCCAACTTACGTGGACGGTCTTGCCCCAAGGGTTTAGGGATAGCCCTCATCTGTTTGGTCAGGCACTGGCCCAAGATCTAGGCCACTTCTCAAGTCCAGGCACTCTGGTCCTTCAGTATGTGGATGATTTACTTTTGGCTACCAGTTCGGAAGCCTCATGCCAGCAGGCTACTCTAGATCTCTTGAACTTTCTAGCTAATCAAGGGTACAAGGCGTCTAGGTCGAAGGCCCAGCTCTGCCTACAGCAGGTCAAATATCTAGGCCTAATCTTAGCCAGAGGGACCAGGGCCCTCAGCAAGGAACGAATACAGCCTATACTGGCTTATCCTCGCCCTAAGACATTAAAACAGTTGCGGGGGTTCCTTGGAATCACCGGCTTTTGCCGACTATGGATCCCCGGATACAGCGAGATGGCCAGGCCNCTCTATACTCTAATCAAGGAGACCCAGAGGGCAAATACTCATCTAGTAGAATGGGAACCAGAGGCAGAAACAGCCTTCAAAACCTTAAAGCAGGCCCTAGTACAAGCTCCAGCCTTAAGCCTTCCCACAGGACAAAACTTCTCTTTATACGTCACAGAGAGAGCGGGGATAGCTCTTGGAGTCCTTACTCAGACTCGTGGGACAACCCCACAACCAGTGGCATACCTAAGTAAGGAAATTGATGTAGTAGCAAAAGGCTGGCCTCACTGTTTACGGGTAGTTGCGGCGGTGGCCGTCTTAGTGTCAGAGGCTATCAAAATAATACAAGGAAAGGATCTCACTGTCTGGACTACTCATGATGTAAATGGCATACTAGGTGCCAAAGGAAGTTTATGGCTATCAGACAACCGCCTGCTTAGATACCAGGCGCTACTCCTTGAGGGACCGGTGCTTCAAATACGCACGTGCGCGGCCCTCAACCCTGCCACTTTTCTCCCAGAGGATGGGGAACCAATCGAGCATGACTGCCAACAAATTATAGTCCAGACTTATGCCGCCCGAGATGATCTCTTAGAAGTCCCCTTAGCTAATCCTGACCTTAACCTATATACCGATGGAAGTTCATTTGTGGAGAATGGGATACGAAGGGCAGGTTATGCCATAGTTAGTGATGTAACNGTACTTGAAAGTAAGCCTCTTCCCCCAGGGACCAGCGCCCAGTTAGCAGAACTAGTGGCACTTACCCGAGCCTTAGAACTGGGAAAGGGAAAAAGAATAAATGTGTATACAGATAGCAAGTATGCTTATCTAATCCTACATGCCCATGCTGCAATATGGAAAGAAAGGGAGTTCCTAACCTCTGGGGGAACCCCCATTAAATACCACAAGGAAATTATGGAGTTATTGCACGCAGTGCAAAAACCCAAGGAGGTGGCAGTCTTACACTGCCAAAGCCATCAGAAAGGTGAAGGAGAAAAGGCAGAAGGAAACCGTCGGGCAGATGCTGAGGCCAAAATTGCTGCCAGGCGGAACCTCCCATTAGAAATACCTACGGAAGGACCCTTGGTATGGAACAACCCCCTCCAAGAGATTAAGCCCCAGTATTCCCCGACTGAAACAGAATGGGGACTTTCACGGGGGCATAGTTTTCTCCCCTCGGGGTGGTTAACGACAGAAGAAGGAAAGGTACTTATACCCGAAGCCAGCCAGTGGAAAATACTTAAAACCCTCCACCAAACTTTTCATATGGGTATTGAAAACACTCATCAAATGGCCAAATCCCTATTTACAGGGCCAAATCTCCTCCGGACCATCCGACAGGTAGTCAAAGCCTGTGAGGTGTGCCAAAGGAATAATCCCTTGGTCCATCGTAAGGCCCCTTTGGGGGAACAAAGAATAGGTCACTATCCCGGAGAGGACTGGCAGTTAGACTTCACCCATATGCCTAAGTCAAAGGGATTTCAATACTTGTTGGTCTGTGTTGATACCTTTACAAATTGGATAGAAGCTTTCCCCTGCAAGACAGAGAAGGCTCAGGAAGTGATTAAAGTCCTAATTCATGAAATAATTCCTAGATTTGGGCTTCCCCAAAGCTTACAGAGTGACAATGGTCCGGCTTTTAAAGCCACGATAACTCAGGGAATTTCCAGGGCGCTAGGGATACAATATCACCTTCACTGCGCCTGGAGGCCACAATCCTCAGGGAAGGTCGAGAAGGCAAATGAAACACTCAAGAGGCACTTAAGGAAACTAACACAAGAAACTCATCTCCCATGGCCTACTCTTTTGCCCATGGCCTTGTTGAGAATCCGAAATTCTCCTCACAAAATGGGGCTCAGTCCATATGAAATGCTGTATGGACGACCTTTTCTCACAAATGACCTCCTACTTGATCAGGAAACGGCCAACTTGGTCAAAGATATAACTTCTTTGGCAAAATATCAACAAAACCTTAAAAACCTACCTGAAGGATGTCACAGAGAAAAGGGAACAGAGTTGTTTCAACCAGGAGATCTAGTGTTGGTCAAATCTCTCCCCTCTACCTCCCCATCTATGGACTCTTTGTGGGAAGGACCATACTCGGTAATCCTCTCTACCCCCACTGCAGTTAAGGTGGCAGGAGTGGAATCTTGGATTCACCACACCCGAGTTAAACTTTGGACACCCCCTGAGGAACCTGCGGGACCGTCAGCTCAGGAGTCCCAAGATCAGCCAGACCAGCCTCGATACACCTGCGAACCGTTGGAGGACTTGCATCTCCTATTTCGGAAGGAAACATCCCAGACTAAAAAGGCTCCTACCACTGATCCTGAGGAAAAACCCCTTCCTCCTTAAAAAAGATAAGTGAAAACCTACATAATCTTTATCTTTAACACCTCTCCTTGCCCCTTTAATGGAATCCTTTTACTATTTCATCATATTATTAAGCAGCATACTAACCATACTCTTTGCGATAGGACTATATACTGTAGCTCCTGCCGGGACGAAAATCCTAATCACATCAACCTTCTTTCTATCTTCCTTCCTTCTGACAGCAATTTACTCCTACCTTTAACTCAGACTGGATAAAATGATCTCGTCTTCCAGAGCACCCTCTTTACCTTCCTATTTACTCTTTGCCTATCTATCCCTCCTGCTTCCTTGGATACCTCATACAATCACCCCTCCCCTTCCACTAGCTCCTAATTACCTCTACAAGACTCTCAACTTAACCCACTCTCTGTTAAACCAGTCCAATCCTTCCCTGGCAAATGACTGTTGGCTTTGTATCTCTCTATCAACCTCTGCTTACGTTGCCACTCCCATTCCCGCAAAAAANCTGGGTCTTTACCAACTTAACCTACCACCCTCGTTATGAAGGAAAAGACCCTTTCCGACTTCTAAATATGCAATCATTAGCCGACTTCCCCATCTCTGATAGGACCAAGAATACCCTAACAGGACGTGCAATCCAACTTTTACGTTCTTACATTTCCAACCTCACCTATTACACAAGCAATGAAAAGCCCATACACGGCCCTGTAACTACGAATACCATCTTAACTTTCCAAGCCCCTTTATGCATCCAACGCAACCTGTTATCAGGCCTGCCCCTGGGGCACCTACTACCCCATCAGTGTAATTACACCCTACAACTTCAAGCCCCAACTGATCATAGTAACTTCCGAGTCACCCAAACAGCTCCATTCAGATGGCTTGTCCGCTTCTCAGGGCCCCCAAAAATCATCACCTCCTCCCTGCTTAACAAACAGTCCAGGTTTTGTAATGGCAAACATACTCCCTGCATGACCATTCACCCCTGGACCCCCTGCAGCAGCGCCCCCACCACTAGTGAATGCCTTCTCATCCCCTCTTTCAATCACTCTCTCGAATGGTTCCTAGTAGATACAAAACGGTTTTTTCTCCAATGGGAAAATAGAACACAGGGAGCCACTCAGTTTGCTCCCAACACCCCTTTCCAGCCGCTCACCGGAGCTACCTTGGCAAGTACTCTAGGAGTATGGGAAAATGAAAACAACAAACTCACACACCTTTTTAACATACACAACCAGTTCTGTCTACCCAGCCAAGGCATATTCTTCTTATGTGGAACGTCGACCTATATCTGCCTCCCCACTAACTGGACAGGCACCTGCACCTTAGTCTTCCTAAGTCCCAACATTAACATTGCCCCAGGAAATCAGACCCTATCAGTGCCCCTCAAAGCTCAAGTCCGTCAGCGCAGGGCCATACAACTAATACCCCTACTTATAGGGTTAGGAATGGCTACTGCTACAGGAACCGGAATAGCCGGTTTATCTACTTCATTATCCTACTACCACACACTCTCAAAGGATTTCTCAGACAGTTTGCAAGAAATAACGAAATCTATCCTTACTCTACAATCCCAAATAGACTCTTTGGCAGCAGTGACTCTCCAAAACCGCCGAGGCCTAGACCTCCTCACTGCTGAGAAAGGAGGACTCTGCACCTTCTTAGGGGAAGAGTGTTGTTTTTACACTAACCAGTCAGGGATAGTACGAGATGCCGCCCGGCGTTTACAGGAAAAGGCTTCTGAAATCAGACAACGCCTTTCAAACTCTTATACCAACCTCTGGAGTTGGGCGACATGGCTTCTCCCCTTTCTAGGTCCCGTGGCAGCCATCTTGCTATTACTCGCCTTCGGGCCCTGTATTTTTAACCTCCTTGTCAAATTTGTTTCCTCTAGGATCGAGGCCATCAAGCTACAGATGGTCTTACAAATGGAACCCCAAATGAGCTCAACTAACAACTTCTACCGAGGACCCCTGGACCGACCCGCTGGCCCTTTCACTGGCCTAGAGAGTTCCCCTCTGGAGGACACTACAACTGCAGGGCCCCTTCTTCGCCCCTATCCAGCAGGAAGTAGCTAGAGCGGTCATCGCCCAATTCCCAACAGCAGTTGGGGTGTCCTGTTTAGAGGGGGGAT
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV9 | BZR1 | 4449 | 4458 | + | 16.98 | CGCACGTGCG |
HERV9 | BZR1 | 4449 | 4458 | - | 16.98 | CGCACGTGCG |
HERV9 | RAP2-6 | 4270 | 4284 | + | 16.88 | TTGCGGCGGTGGCCG |
HERV9 | Znf423 | 5363 | 5377 | + | 16.85 | GGCCCCTTTGGGGGA |
HERV9 | Prdm15 | 7241 | 7251 | - | 16.85 | CAAAACCTGGA |
HERV9 | BEH4 | 4449 | 4458 | + | 16.71 | CGCACGTGCG |
HERV9 | BEH4 | 4449 | 4458 | - | 16.71 | CGCACGTGCG |
HERV9 | sug | 7291 | 7302 | + | 16.70 | GACCCCCTGCAG |
HERV9 | BEH2 | 4449 | 4458 | + | 16.64 | CGCACGTGCG |
HERV9 | BEH2 | 4449 | 4458 | - | 16.64 | CGCACGTGCG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.