HERV9
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000173 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8436 |
Kimura value | 5.24 |
Tau index | 0.9324 |
Description | Internal region of an ERV1 endogenous retrovirus, HERV9 subfamily |
Comment | gag ~1287-2747, pol ~2748-6323, env ~6740-8380. |
Sequence |
TTTTGGCGACCACGAAGGGACCATCGCCTATCGCCAAGCGGTGAGACTATCGCCTATCGCCAAGCGGTGAGTACCATCGGACCCCTTTCGCTTGCTATTCTGTCCTATTTTTCCTTAGAATTCGGGGGCTAAATACCGGGCACCTGTCGGCCAGTTAAAAGCGACTAGCGCGGCCGCCGGACTAAAGACACGGGTGTCAGGCTTTCTGGGAAAGGGCTCTCTAACAACCCCCGACTCTTCGGAGTTGGGAGCGTTGGTTTGCCTGGAACCAGCTTCCGCTTTTCCTGTACTTCTGGGCTGAGCCGAGGGTCGACAGAGAGGAAAGCCATTCAGCTCCAGGGGTCCCGACAACAAGTTGGTTGACCCTGCGGCCATGAGCGGAACTCTCAAAGTCATGTCGCCCAAGCGAGACTCGCCCATCTATCCTATCTATCCTGACCCTTGCCTCCTGGGTCCTAATGCCTGTCAGACAAACTTCCTCTCGCCTCTCTTCTCCGAGGCTAGTCCCGCTTCTAAAAACCACTCCCTGTCTCTGGTGCTTTTCTAGTTTCTCCTATAAGAATGATTTCTAGTATAAACTCCAGGACTCTGTTACCTTCTTTAGGCACCCGGGCTCACCAATCAGAAAGACATAATTTTTGCCCAAAGCCCCATCGTAGGGGGGACTATCTGGAATTTTAGGATCCCTCCTCAGACNAGCAGGCCTAACAAAAGCTATTCCTGAAGCTAGGATATGGGGAGCCTCAGAAATTGTATCCTTCCTATTCATATAAGTGAGGACAAAAGGCGTCACTCTTCCAACTCTGGAGATCCCTTCCCTCCCTCAGGGTATGGCCCTCCACTTCATTTTTGGGGCATAACATCTTTATAGGACACGGGTAAGGTCCCAATACTAACAGGAGAATGCTTAGGACTCTAACAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGATTTTTCTCGGTCCTCTTTGTGGTCTAGGAGGACAGGCAAGGGTGCAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGACCTTCCTCGGTCCTCCTTGTGGTCTAGGAGGAAAACTAGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAAGAGGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAGGGGGAGAAACAAACAAACCAAAACCGCGGGCGGTTTTGTCTTTCAGATGGGAAACACTCAGGCATCAACAGGCTCACCCTTGAAATGCATCCTAAGCCATTGGGACCAATTTGACCCGCAAACCCTGAAAAAGAGGCGGCTCATTTTTTTCTGCACTATGGCCTGGCCCCAATATTCTCTCTCTGATGGGGAAAAATGGCCACCTGAGGGAAGTATAAATTACAATACTATCCTGCAGCTTGACCTTTTCTGTAAGAGGGAAGGCAAATGGAGTGAAATACCTTATGTCCAAGCTTTCTTTTCATTGAAGGAGAATNCACAACTATGCAAAGCTTGCAATTTACATCCCACAGGAGGACCTCTCAGCTTACCCCCATATCCTAGCCTCCCTATAGCTCCCCTTCCTATTAATGATAAGCCTCCTCTAATCTCCCCCGCCCAGAAGGAAACAAGCAAAGAAATCTCCAAAGGACCACAAAAACCCCCGGGCTATCGGTTATGTCCCCTTCAAGCTGTAGGGGGAGGGGAATTTGGCCCAACCCGGGTACATGTCCCCTTCTCCCTCTCTGATTTAAAGCAGATCAAGGCAGACCTGGGGAAGTTTTCAGATGATCCTGATAGGTACATAGATGTCCTACAGGGTCTAGGGCAAACCTTCGACCTCACTTGGAGAGATGTCATGCTATTGTTAGATCAAACCCTGGCCTTTAATGAAAAGAATGCGGCTTTAGCTGCAGCCCGAGAGTTTGGAGATACCTGGTATCTTAGTCAAGTAAATGATAGAATGACAGCCGAAGAAAGGGACAAATTCCCTACCGGTCAGCAAGCCGTCCCCAGTATGGATCCCCACTGGGACCTCGACTCAGATCATGGGGACTGGAGTCGCAAACATCTGTTGACCTGTGTTCTAGAAGGACTAAGGAGAATTAGGAAAAAGCCCATGAATTATTCAATGATGTCCACCATAACTCAGGGAAAGGAAGAAAATCCTTCTGCCTTCCTCGAGCGGCTACGGGAGGCCTTAAGAAAATATACTCCCCTGTCACCCGACTCACTCGAGGGTCAATTGATCCTAAAAGATAAGTTTATTACCCAATCAGCCGCAGATATCAGGAGAAAGCTCCAAAAGCGAGCCCTGGGCCCTGAACAAAATCTGGAGGCATTATTAAACCTGGCAACCTCGGTGTTCTATAATAGGGACCAAGAGGAACAGGCCCAAAAGGAAAAGCGAGATCAGAGAAAGGCCGCAGCCTTAGTCATGGCCCTCAGACAAACAAACCTTGGTGGTTCAGAGAGGACAGAAAATGGAGCAGGCCAATCACCCGGTAGGGCTTGTTATCAGTGTGGTTTGCAAGGACACTTTAAAAAAGATTGTCCAACGAGAAACAAGCCGCCCCCTCGCCCATGTCCACTATGCCGAGGCAATCACTGGAAGGCGCACTGCCCCAGAGGACAAAGGTTCTCTGGGCCAGAAGCCCCCAACCAGATGATCCAACAACAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCTCATGTCATCACCCTCACTGAGCCCCGGGTACGTTTAACCATTGAGGGCCAGGAAATTGACTTCCTCCTGGACACTGGCGCGGCCTTCTCAGTGTTAATCTCCTGTCCCGGACGGCTGTCCTCAAGGTCCGTTACCATCCGAGGAATCCTGGGACAGCCTGTAACCAGGTATTTCTCCCACCTCCTCAGTTGTAATTGGGAGACTTTGCTCTTTTCACATGCCTTTCTTGTTATGCCTGAAAGTCCCACACCCTTATTAGGGAGGGACATATTAGCCAAAGCTGGAGCTATTATCTACATGAATATGGGGAACAAGTTACCCATTTGTTGTCCCCTACTTGAGGAGGGAATCAACCCTGAAGTCTGGGCATTGGAAGGACAATTCGGAAGGGCAAAAAATGCCCGCCCAGTCCAAATCAGGCTAAAAGACCCCACCACTTTTCCTTATCAAAGGCAATATCCCTTAAGGCCTGAAGCTCATAAAGGATTACAGGATATTGTTAGACATTTAAAAGCTCAAGGCTTAGTAAGGAAATGCAGCAGTCCCTGCAACACCCCAATTCTAGGAGTACAAAAACCGAACGGTCAGTGGAGACTAGTGCAAGATCTTAGACTCATCAATGAGGCAGTAATTCCTCTATATCCAGTTGTACCCAACCCCTATACCCTGCTCTCTCAAATACCAGAGGAAGCAGAATGGTTCACTGTTCTGGACCTCAAGGATGCCTTCTTCTGTATTCCCCTGCACTCTGACTCCCAGTTTCTCTTTGCCTTTGAGGATCCCACAGACCACACGTCCCAACTTACGTGGACGGTCTTGCCCCAAGGGTTTAGGGATAGCCCTCATCTGTTTGGTCAGGCACTGGCCCAAGATCTAGGCCACTTCTCAAGTCCAGGCACTCTGGTCCTTCAGTATGTGGATGATTTACTTTTGGCTACCAGTTCGGAAGCCTCATGCCAGCAGGCTACTCTAGATCTCTTGAACTTTCTAGCTAATCAAGGGTACAAGGCGTCTAGGTCGAAGGCCCAGCTCTGCCTACAGCAGGTCAAATATCTAGGCCTAATCTTAGCCAGAGGGACCAGGGCCCTCAGCAAGGAACGAATACAGCCTATACTGGCTTATCCTCGCCCTAAGACATTAAAACAGTTGCGGGGGTTCCTTGGAATCACCGGCTTTTGCCGACTATGGATCCCCGGATACAGCGAGATGGCCAGGCCNCTCTATACTCTAATCAAGGAGACCCAGAGGGCAAATACTCATCTAGTAGAATGGGAACCAGAGGCAGAAACAGCCTTCAAAACCTTAAAGCAGGCCCTAGTACAAGCTCCAGCCTTAAGCCTTCCCACAGGACAAAACTTCTCTTTATACGTCACAGAGAGAGCGGGGATAGCTCTTGGAGTCCTTACTCAGACTCGTGGGACAACCCCACAACCAGTGGCATACCTAAGTAAGGAAATTGATGTAGTAGCAAAAGGCTGGCCTCACTGTTTACGGGTAGTTGCGGCGGTGGCCGTCTTAGTGTCAGAGGCTATCAAAATAATACAAGGAAAGGATCTCACTGTCTGGACTACTCATGATGTAAATGGCATACTAGGTGCCAAAGGAAGTTTATGGCTATCAGACAACCGCCTGCTTAGATACCAGGCGCTACTCCTTGAGGGACCGGTGCTTCAAATACGCACGTGCGCGGCCCTCAACCCTGCCACTTTTCTCCCAGAGGATGGGGAACCAATCGAGCATGACTGCCAACAAATTATAGTCCAGACTTATGCCGCCCGAGATGATCTCTTAGAAGTCCCCTTAGCTAATCCTGACCTTAACCTATATACCGATGGAAGTTCATTTGTGGAGAATGGGATACGAAGGGCAGGTTATGCCATAGTTAGTGATGTAACNGTACTTGAAAGTAAGCCTCTTCCCCCAGGGACCAGCGCCCAGTTAGCAGAACTAGTGGCACTTACCCGAGCCTTAGAACTGGGAAAGGGAAAAAGAATAAATGTGTATACAGATAGCAAGTATGCTTATCTAATCCTACATGCCCATGCTGCAATATGGAAAGAAAGGGAGTTCCTAACCTCTGGGGGAACCCCCATTAAATACCACAAGGAAATTATGGAGTTATTGCACGCAGTGCAAAAACCCAAGGAGGTGGCAGTCTTACACTGCCAAAGCCATCAGAAAGGTGAAGGAGAAAAGGCAGAAGGAAACCGTCGGGCAGATGCTGAGGCCAAAATTGCTGCCAGGCGGAACCTCCCATTAGAAATACCTACGGAAGGACCCTTGGTATGGAACAACCCCCTCCAAGAGATTAAGCCCCAGTATTCCCCGACTGAAACAGAATGGGGACTTTCACGGGGGCATAGTTTTCTCCCCTCGGGGTGGTTAACGACAGAAGAAGGAAAGGTACTTATACCCGAAGCCAGCCAGTGGAAAATACTTAAAACCCTCCACCAAACTTTTCATATGGGTATTGAAAACACTCATCAAATGGCCAAATCCCTATTTACAGGGCCAAATCTCCTCCGGACCATCCGACAGGTAGTCAAAGCCTGTGAGGTGTGCCAAAGGAATAATCCCTTGGTCCATCGTAAGGCCCCTTTGGGGGAACAAAGAATAGGTCACTATCCCGGAGAGGACTGGCAGTTAGACTTCACCCATATGCCTAAGTCAAAGGGATTTCAATACTTGTTGGTCTGTGTTGATACCTTTACAAATTGGATAGAAGCTTTCCCCTGCAAGACAGAGAAGGCTCAGGAAGTGATTAAAGTCCTAATTCATGAAATAATTCCTAGATTTGGGCTTCCCCAAAGCTTACAGAGTGACAATGGTCCGGCTTTTAAAGCCACGATAACTCAGGGAATTTCCAGGGCGCTAGGGATACAATATCACCTTCACTGCGCCTGGAGGCCACAATCCTCAGGGAAGGTCGAGAAGGCAAATGAAACACTCAAGAGGCACTTAAGGAAACTAACACAAGAAACTCATCTCCCATGGCCTACTCTTTTGCCCATGGCCTTGTTGAGAATCCGAAATTCTCCTCACAAAATGGGGCTCAGTCCATATGAAATGCTGTATGGACGACCTTTTCTCACAAATGACCTCCTACTTGATCAGGAAACGGCCAACTTGGTCAAAGATATAACTTCTTTGGCAAAATATCAACAAAACCTTAAAAACCTACCTGAAGGATGTCACAGAGAAAAGGGAACAGAGTTGTTTCAACCAGGAGATCTAGTGTTGGTCAAATCTCTCCCCTCTACCTCCCCATCTATGGACTCTTTGTGGGAAGGACCATACTCGGTAATCCTCTCTACCCCCACTGCAGTTAAGGTGGCAGGAGTGGAATCTTGGATTCACCACACCCGAGTTAAACTTTGGACACCCCCTGAGGAACCTGCGGGACCGTCAGCTCAGGAGTCCCAAGATCAGCCAGACCAGCCTCGATACACCTGCGAACCGTTGGAGGACTTGCATCTCCTATTTCGGAAGGAAACATCCCAGACTAAAAAGGCTCCTACCACTGATCCTGAGGAAAAACCCCTTCCTCCTTAAAAAAGATAAGTGAAAACCTACATAATCTTTATCTTTAACACCTCTCCTTGCCCCTTTAATGGAATCCTTTTACTATTTCATCATATTATTAAGCAGCATACTAACCATACTCTTTGCGATAGGACTATATACTGTAGCTCCTGCCGGGACGAAAATCCTAATCACATCAACCTTCTTTCTATCTTCCTTCCTTCTGACAGCAATTTACTCCTACCTTTAACTCAGACTGGATAAAATGATCTCGTCTTCCAGAGCACCCTCTTTACCTTCCTATTTACTCTTTGCCTATCTATCCCTCCTGCTTCCTTGGATACCTCATACAATCACCCCTCCCCTTCCACTAGCTCCTAATTACCTCTACAAGACTCTCAACTTAACCCACTCTCTGTTAAACCAGTCCAATCCTTCCCTGGCAAATGACTGTTGGCTTTGTATCTCTCTATCAACCTCTGCTTACGTTGCCACTCCCATTCCCGCAAAAAANCTGGGTCTTTACCAACTTAACCTACCACCCTCGTTATGAAGGAAAAGACCCTTTCCGACTTCTAAATATGCAATCATTAGCCGACTTCCCCATCTCTGATAGGACCAAGAATACCCTAACAGGACGTGCAATCCAACTTTTACGTTCTTACATTTCCAACCTCACCTATTACACAAGCAATGAAAAGCCCATACACGGCCCTGTAACTACGAATACCATCTTAACTTTCCAAGCCCCTTTATGCATCCAACGCAACCTGTTATCAGGCCTGCCCCTGGGGCACCTACTACCCCATCAGTGTAATTACACCCTACAACTTCAAGCCCCAACTGATCATAGTAACTTCCGAGTCACCCAAACAGCTCCATTCAGATGGCTTGTCCGCTTCTCAGGGCCCCCAAAAATCATCACCTCCTCCCTGCTTAACAAACAGTCCAGGTTTTGTAATGGCAAACATACTCCCTGCATGACCATTCACCCCTGGACCCCCTGCAGCAGCGCCCCCACCACTAGTGAATGCCTTCTCATCCCCTCTTTCAATCACTCTCTCGAATGGTTCCTAGTAGATACAAAACGGTTTTTTCTCCAATGGGAAAATAGAACACAGGGAGCCACTCAGTTTGCTCCCAACACCCCTTTCCAGCCGCTCACCGGAGCTACCTTGGCAAGTACTCTAGGAGTATGGGAAAATGAAAACAACAAACTCACACACCTTTTTAACATACACAACCAGTTCTGTCTACCCAGCCAAGGCATATTCTTCTTATGTGGAACGTCGACCTATATCTGCCTCCCCACTAACTGGACAGGCACCTGCACCTTAGTCTTCCTAAGTCCCAACATTAACATTGCCCCAGGAAATCAGACCCTATCAGTGCCCCTCAAAGCTCAAGTCCGTCAGCGCAGGGCCATACAACTAATACCCCTACTTATAGGGTTAGGAATGGCTACTGCTACAGGAACCGGAATAGCCGGTTTATCTACTTCATTATCCTACTACCACACACTCTCAAAGGATTTCTCAGACAGTTTGCAAGAAATAACGAAATCTATCCTTACTCTACAATCCCAAATAGACTCTTTGGCAGCAGTGACTCTCCAAAACCGCCGAGGCCTAGACCTCCTCACTGCTGAGAAAGGAGGACTCTGCACCTTCTTAGGGGAAGAGTGTTGTTTTTACACTAACCAGTCAGGGATAGTACGAGATGCCGCCCGGCGTTTACAGGAAAAGGCTTCTGAAATCAGACAACGCCTTTCAAACTCTTATACCAACCTCTGGAGTTGGGCGACATGGCTTCTCCCCTTTCTAGGTCCCGTGGCAGCCATCTTGCTATTACTCGCCTTCGGGCCCTGTATTTTTAACCTCCTTGTCAAATTTGTTTCCTCTAGGATCGAGGCCATCAAGCTACAGATGGTCTTACAAATGGAACCCCAAATGAGCTCAACTAACAACTTCTACCGAGGACCCCTGGACCGACCCGCTGGCCCTTTCACTGGCCTAGAGAGTTCCCCTCTGGAGGACACTACAACTGCAGGGCCCCTTCTTCGCCCCTATCCAGCAGGAAGTAGCTAGAGCGGTCATCGCCCAATTCCCAACAGCAGTTGGGGTGTCCTGTTTAGAGGGGGGAT
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV9 | ELF1 | 5523 | 5531 | + | 16.58 | CAGGAAGTG |
HERV9 | HRS1 | 6113 | 6124 | - | 16.48 | ATCCAAGATTCC |
HERV9 | ABR1 | 4269 | 4280 | + | 16.47 | GTTGCGGCGGTG |
HERV9 | DOF5.1 | 4750 | 4768 | + | 16.45 | GAAAGGGAAAAAGAATAAA |
HERV9 | ETV5::DRGX | 5524 | 5535 | + | 16.39 | AGGAAGTGATTA |
HERV9 | ETV5::FOXI1 | 6590 | 6601 | - | 16.38 | GTAAATAGGAAG |
HERV9 | ZNF257 | 479 | 488 | - | 16.37 | GAGGCGAGAG |
HERV9 | Zfx | 7914 | 7923 | + | 16.36 | GCCGAGGCCT |
HERV9 | Stat5b | 204 | 212 | - | 16.36 | TTCCCAGAA |
HERV9 | bHLH78 | 4450 | 4457 | + | 16.29 | GCACGTGC |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.