HERV9
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000173 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8436 |
Kimura value | 5.24 |
Tau index | 0.9324 |
Description | Internal region of an ERV1 endogenous retrovirus, HERV9 subfamily |
Comment | gag ~1287-2747, pol ~2748-6323, env ~6740-8380. |
Sequence |
TTTTGGCGACCACGAAGGGACCATCGCCTATCGCCAAGCGGTGAGACTATCGCCTATCGCCAAGCGGTGAGTACCATCGGACCCCTTTCGCTTGCTATTCTGTCCTATTTTTCCTTAGAATTCGGGGGCTAAATACCGGGCACCTGTCGGCCAGTTAAAAGCGACTAGCGCGGCCGCCGGACTAAAGACACGGGTGTCAGGCTTTCTGGGAAAGGGCTCTCTAACAACCCCCGACTCTTCGGAGTTGGGAGCGTTGGTTTGCCTGGAACCAGCTTCCGCTTTTCCTGTACTTCTGGGCTGAGCCGAGGGTCGACAGAGAGGAAAGCCATTCAGCTCCAGGGGTCCCGACAACAAGTTGGTTGACCCTGCGGCCATGAGCGGAACTCTCAAAGTCATGTCGCCCAAGCGAGACTCGCCCATCTATCCTATCTATCCTGACCCTTGCCTCCTGGGTCCTAATGCCTGTCAGACAAACTTCCTCTCGCCTCTCTTCTCCGAGGCTAGTCCCGCTTCTAAAAACCACTCCCTGTCTCTGGTGCTTTTCTAGTTTCTCCTATAAGAATGATTTCTAGTATAAACTCCAGGACTCTGTTACCTTCTTTAGGCACCCGGGCTCACCAATCAGAAAGACATAATTTTTGCCCAAAGCCCCATCGTAGGGGGGACTATCTGGAATTTTAGGATCCCTCCTCAGACNAGCAGGCCTAACAAAAGCTATTCCTGAAGCTAGGATATGGGGAGCCTCAGAAATTGTATCCTTCCTATTCATATAAGTGAGGACAAAAGGCGTCACTCTTCCAACTCTGGAGATCCCTTCCCTCCCTCAGGGTATGGCCCTCCACTTCATTTTTGGGGCATAACATCTTTATAGGACACGGGTAAGGTCCCAATACTAACAGGAGAATGCTTAGGACTCTAACAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGATTTTTCTCGGTCCTCTTTGTGGTCTAGGAGGACAGGCAAGGGTGCAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGACCTTCCTCGGTCCTCCTTGTGGTCTAGGAGGAAAACTAGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAAGAGGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAGGGGGAGAAACAAACAAACCAAAACCGCGGGCGGTTTTGTCTTTCAGATGGGAAACACTCAGGCATCAACAGGCTCACCCTTGAAATGCATCCTAAGCCATTGGGACCAATTTGACCCGCAAACCCTGAAAAAGAGGCGGCTCATTTTTTTCTGCACTATGGCCTGGCCCCAATATTCTCTCTCTGATGGGGAAAAATGGCCACCTGAGGGAAGTATAAATTACAATACTATCCTGCAGCTTGACCTTTTCTGTAAGAGGGAAGGCAAATGGAGTGAAATACCTTATGTCCAAGCTTTCTTTTCATTGAAGGAGAATNCACAACTATGCAAAGCTTGCAATTTACATCCCACAGGAGGACCTCTCAGCTTACCCCCATATCCTAGCCTCCCTATAGCTCCCCTTCCTATTAATGATAAGCCTCCTCTAATCTCCCCCGCCCAGAAGGAAACAAGCAAAGAAATCTCCAAAGGACCACAAAAACCCCCGGGCTATCGGTTATGTCCCCTTCAAGCTGTAGGGGGAGGGGAATTTGGCCCAACCCGGGTACATGTCCCCTTCTCCCTCTCTGATTTAAAGCAGATCAAGGCAGACCTGGGGAAGTTTTCAGATGATCCTGATAGGTACATAGATGTCCTACAGGGTCTAGGGCAAACCTTCGACCTCACTTGGAGAGATGTCATGCTATTGTTAGATCAAACCCTGGCCTTTAATGAAAAGAATGCGGCTTTAGCTGCAGCCCGAGAGTTTGGAGATACCTGGTATCTTAGTCAAGTAAATGATAGAATGACAGCCGAAGAAAGGGACAAATTCCCTACCGGTCAGCAAGCCGTCCCCAGTATGGATCCCCACTGGGACCTCGACTCAGATCATGGGGACTGGAGTCGCAAACATCTGTTGACCTGTGTTCTAGAAGGACTAAGGAGAATTAGGAAAAAGCCCATGAATTATTCAATGATGTCCACCATAACTCAGGGAAAGGAAGAAAATCCTTCTGCCTTCCTCGAGCGGCTACGGGAGGCCTTAAGAAAATATACTCCCCTGTCACCCGACTCACTCGAGGGTCAATTGATCCTAAAAGATAAGTTTATTACCCAATCAGCCGCAGATATCAGGAGAAAGCTCCAAAAGCGAGCCCTGGGCCCTGAACAAAATCTGGAGGCATTATTAAACCTGGCAACCTCGGTGTTCTATAATAGGGACCAAGAGGAACAGGCCCAAAAGGAAAAGCGAGATCAGAGAAAGGCCGCAGCCTTAGTCATGGCCCTCAGACAAACAAACCTTGGTGGTTCAGAGAGGACAGAAAATGGAGCAGGCCAATCACCCGGTAGGGCTTGTTATCAGTGTGGTTTGCAAGGACACTTTAAAAAAGATTGTCCAACGAGAAACAAGCCGCCCCCTCGCCCATGTCCACTATGCCGAGGCAATCACTGGAAGGCGCACTGCCCCAGAGGACAAAGGTTCTCTGGGCCAGAAGCCCCCAACCAGATGATCCAACAACAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCTCATGTCATCACCCTCACTGAGCCCCGGGTACGTTTAACCATTGAGGGCCAGGAAATTGACTTCCTCCTGGACACTGGCGCGGCCTTCTCAGTGTTAATCTCCTGTCCCGGACGGCTGTCCTCAAGGTCCGTTACCATCCGAGGAATCCTGGGACAGCCTGTAACCAGGTATTTCTCCCACCTCCTCAGTTGTAATTGGGAGACTTTGCTCTTTTCACATGCCTTTCTTGTTATGCCTGAAAGTCCCACACCCTTATTAGGGAGGGACATATTAGCCAAAGCTGGAGCTATTATCTACATGAATATGGGGAACAAGTTACCCATTTGTTGTCCCCTACTTGAGGAGGGAATCAACCCTGAAGTCTGGGCATTGGAAGGACAATTCGGAAGGGCAAAAAATGCCCGCCCAGTCCAAATCAGGCTAAAAGACCCCACCACTTTTCCTTATCAAAGGCAATATCCCTTAAGGCCTGAAGCTCATAAAGGATTACAGGATATTGTTAGACATTTAAAAGCTCAAGGCTTAGTAAGGAAATGCAGCAGTCCCTGCAACACCCCAATTCTAGGAGTACAAAAACCGAACGGTCAGTGGAGACTAGTGCAAGATCTTAGACTCATCAATGAGGCAGTAATTCCTCTATATCCAGTTGTACCCAACCCCTATACCCTGCTCTCTCAAATACCAGAGGAAGCAGAATGGTTCACTGTTCTGGACCTCAAGGATGCCTTCTTCTGTATTCCCCTGCACTCTGACTCCCAGTTTCTCTTTGCCTTTGAGGATCCCACAGACCACACGTCCCAACTTACGTGGACGGTCTTGCCCCAAGGGTTTAGGGATAGCCCTCATCTGTTTGGTCAGGCACTGGCCCAAGATCTAGGCCACTTCTCAAGTCCAGGCACTCTGGTCCTTCAGTATGTGGATGATTTACTTTTGGCTACCAGTTCGGAAGCCTCATGCCAGCAGGCTACTCTAGATCTCTTGAACTTTCTAGCTAATCAAGGGTACAAGGCGTCTAGGTCGAAGGCCCAGCTCTGCCTACAGCAGGTCAAATATCTAGGCCTAATCTTAGCCAGAGGGACCAGGGCCCTCAGCAAGGAACGAATACAGCCTATACTGGCTTATCCTCGCCCTAAGACATTAAAACAGTTGCGGGGGTTCCTTGGAATCACCGGCTTTTGCCGACTATGGATCCCCGGATACAGCGAGATGGCCAGGCCNCTCTATACTCTAATCAAGGAGACCCAGAGGGCAAATACTCATCTAGTAGAATGGGAACCAGAGGCAGAAACAGCCTTCAAAACCTTAAAGCAGGCCCTAGTACAAGCTCCAGCCTTAAGCCTTCCCACAGGACAAAACTTCTCTTTATACGTCACAGAGAGAGCGGGGATAGCTCTTGGAGTCCTTACTCAGACTCGTGGGACAACCCCACAACCAGTGGCATACCTAAGTAAGGAAATTGATGTAGTAGCAAAAGGCTGGCCTCACTGTTTACGGGTAGTTGCGGCGGTGGCCGTCTTAGTGTCAGAGGCTATCAAAATAATACAAGGAAAGGATCTCACTGTCTGGACTACTCATGATGTAAATGGCATACTAGGTGCCAAAGGAAGTTTATGGCTATCAGACAACCGCCTGCTTAGATACCAGGCGCTACTCCTTGAGGGACCGGTGCTTCAAATACGCACGTGCGCGGCCCTCAACCCTGCCACTTTTCTCCCAGAGGATGGGGAACCAATCGAGCATGACTGCCAACAAATTATAGTCCAGACTTATGCCGCCCGAGATGATCTCTTAGAAGTCCCCTTAGCTAATCCTGACCTTAACCTATATACCGATGGAAGTTCATTTGTGGAGAATGGGATACGAAGGGCAGGTTATGCCATAGTTAGTGATGTAACNGTACTTGAAAGTAAGCCTCTTCCCCCAGGGACCAGCGCCCAGTTAGCAGAACTAGTGGCACTTACCCGAGCCTTAGAACTGGGAAAGGGAAAAAGAATAAATGTGTATACAGATAGCAAGTATGCTTATCTAATCCTACATGCCCATGCTGCAATATGGAAAGAAAGGGAGTTCCTAACCTCTGGGGGAACCCCCATTAAATACCACAAGGAAATTATGGAGTTATTGCACGCAGTGCAAAAACCCAAGGAGGTGGCAGTCTTACACTGCCAAAGCCATCAGAAAGGTGAAGGAGAAAAGGCAGAAGGAAACCGTCGGGCAGATGCTGAGGCCAAAATTGCTGCCAGGCGGAACCTCCCATTAGAAATACCTACGGAAGGACCCTTGGTATGGAACAACCCCCTCCAAGAGATTAAGCCCCAGTATTCCCCGACTGAAACAGAATGGGGACTTTCACGGGGGCATAGTTTTCTCCCCTCGGGGTGGTTAACGACAGAAGAAGGAAAGGTACTTATACCCGAAGCCAGCCAGTGGAAAATACTTAAAACCCTCCACCAAACTTTTCATATGGGTATTGAAAACACTCATCAAATGGCCAAATCCCTATTTACAGGGCCAAATCTCCTCCGGACCATCCGACAGGTAGTCAAAGCCTGTGAGGTGTGCCAAAGGAATAATCCCTTGGTCCATCGTAAGGCCCCTTTGGGGGAACAAAGAATAGGTCACTATCCCGGAGAGGACTGGCAGTTAGACTTCACCCATATGCCTAAGTCAAAGGGATTTCAATACTTGTTGGTCTGTGTTGATACCTTTACAAATTGGATAGAAGCTTTCCCCTGCAAGACAGAGAAGGCTCAGGAAGTGATTAAAGTCCTAATTCATGAAATAATTCCTAGATTTGGGCTTCCCCAAAGCTTACAGAGTGACAATGGTCCGGCTTTTAAAGCCACGATAACTCAGGGAATTTCCAGGGCGCTAGGGATACAATATCACCTTCACTGCGCCTGGAGGCCACAATCCTCAGGGAAGGTCGAGAAGGCAAATGAAACACTCAAGAGGCACTTAAGGAAACTAACACAAGAAACTCATCTCCCATGGCCTACTCTTTTGCCCATGGCCTTGTTGAGAATCCGAAATTCTCCTCACAAAATGGGGCTCAGTCCATATGAAATGCTGTATGGACGACCTTTTCTCACAAATGACCTCCTACTTGATCAGGAAACGGCCAACTTGGTCAAAGATATAACTTCTTTGGCAAAATATCAACAAAACCTTAAAAACCTACCTGAAGGATGTCACAGAGAAAAGGGAACAGAGTTGTTTCAACCAGGAGATCTAGTGTTGGTCAAATCTCTCCCCTCTACCTCCCCATCTATGGACTCTTTGTGGGAAGGACCATACTCGGTAATCCTCTCTACCCCCACTGCAGTTAAGGTGGCAGGAGTGGAATCTTGGATTCACCACACCCGAGTTAAACTTTGGACACCCCCTGAGGAACCTGCGGGACCGTCAGCTCAGGAGTCCCAAGATCAGCCAGACCAGCCTCGATACACCTGCGAACCGTTGGAGGACTTGCATCTCCTATTTCGGAAGGAAACATCCCAGACTAAAAAGGCTCCTACCACTGATCCTGAGGAAAAACCCCTTCCTCCTTAAAAAAGATAAGTGAAAACCTACATAATCTTTATCTTTAACACCTCTCCTTGCCCCTTTAATGGAATCCTTTTACTATTTCATCATATTATTAAGCAGCATACTAACCATACTCTTTGCGATAGGACTATATACTGTAGCTCCTGCCGGGACGAAAATCCTAATCACATCAACCTTCTTTCTATCTTCCTTCCTTCTGACAGCAATTTACTCCTACCTTTAACTCAGACTGGATAAAATGATCTCGTCTTCCAGAGCACCCTCTTTACCTTCCTATTTACTCTTTGCCTATCTATCCCTCCTGCTTCCTTGGATACCTCATACAATCACCCCTCCCCTTCCACTAGCTCCTAATTACCTCTACAAGACTCTCAACTTAACCCACTCTCTGTTAAACCAGTCCAATCCTTCCCTGGCAAATGACTGTTGGCTTTGTATCTCTCTATCAACCTCTGCTTACGTTGCCACTCCCATTCCCGCAAAAAANCTGGGTCTTTACCAACTTAACCTACCACCCTCGTTATGAAGGAAAAGACCCTTTCCGACTTCTAAATATGCAATCATTAGCCGACTTCCCCATCTCTGATAGGACCAAGAATACCCTAACAGGACGTGCAATCCAACTTTTACGTTCTTACATTTCCAACCTCACCTATTACACAAGCAATGAAAAGCCCATACACGGCCCTGTAACTACGAATACCATCTTAACTTTCCAAGCCCCTTTATGCATCCAACGCAACCTGTTATCAGGCCTGCCCCTGGGGCACCTACTACCCCATCAGTGTAATTACACCCTACAACTTCAAGCCCCAACTGATCATAGTAACTTCCGAGTCACCCAAACAGCTCCATTCAGATGGCTTGTCCGCTTCTCAGGGCCCCCAAAAATCATCACCTCCTCCCTGCTTAACAAACAGTCCAGGTTTTGTAATGGCAAACATACTCCCTGCATGACCATTCACCCCTGGACCCCCTGCAGCAGCGCCCCCACCACTAGTGAATGCCTTCTCATCCCCTCTTTCAATCACTCTCTCGAATGGTTCCTAGTAGATACAAAACGGTTTTTTCTCCAATGGGAAAATAGAACACAGGGAGCCACTCAGTTTGCTCCCAACACCCCTTTCCAGCCGCTCACCGGAGCTACCTTGGCAAGTACTCTAGGAGTATGGGAAAATGAAAACAACAAACTCACACACCTTTTTAACATACACAACCAGTTCTGTCTACCCAGCCAAGGCATATTCTTCTTATGTGGAACGTCGACCTATATCTGCCTCCCCACTAACTGGACAGGCACCTGCACCTTAGTCTTCCTAAGTCCCAACATTAACATTGCCCCAGGAAATCAGACCCTATCAGTGCCCCTCAAAGCTCAAGTCCGTCAGCGCAGGGCCATACAACTAATACCCCTACTTATAGGGTTAGGAATGGCTACTGCTACAGGAACCGGAATAGCCGGTTTATCTACTTCATTATCCTACTACCACACACTCTCAAAGGATTTCTCAGACAGTTTGCAAGAAATAACGAAATCTATCCTTACTCTACAATCCCAAATAGACTCTTTGGCAGCAGTGACTCTCCAAAACCGCCGAGGCCTAGACCTCCTCACTGCTGAGAAAGGAGGACTCTGCACCTTCTTAGGGGAAGAGTGTTGTTTTTACACTAACCAGTCAGGGATAGTACGAGATGCCGCCCGGCGTTTACAGGAAAAGGCTTCTGAAATCAGACAACGCCTTTCAAACTCTTATACCAACCTCTGGAGTTGGGCGACATGGCTTCTCCCCTTTCTAGGTCCCGTGGCAGCCATCTTGCTATTACTCGCCTTCGGGCCCTGTATTTTTAACCTCCTTGTCAAATTTGTTTCCTCTAGGATCGAGGCCATCAAGCTACAGATGGTCTTACAAATGGAACCCCAAATGAGCTCAACTAACAACTTCTACCGAGGACCCCTGGACCGACCCGCTGGCCCTTTCACTGGCCTAGAGAGTTCCCCTCTGGAGGACACTACAACTGCAGGGCCCCTTCTTCGCCCCTATCCAGCAGGAAGTAGCTAGAGCGGTCATCGCCCAATTCCCAACAGCAGTTGGGGTGTCCTGTTTAGAGGGGGGAT
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV9 | BPC5 | 6745 | 6774 | - | -47.02 | AGAGGTTGATAGAGAGATACAAAGCCAACA |
HERV9 | BPC5 | 3514 | 3543 | - | -47.13 | AGGCAAAGAGAAACTGGGAGTCAGAGTGCA |
HERV9 | BPC5 | 6599 | 6628 | - | -47.27 | GAAGCAGGAGGGATAGATAGGCAAAGAGTA |
HERV9 | BPC5 | 7323 | 7352 | - | -47.79 | AGTGATTGAAAGAGGGGATGAGAAGGCATT |
HERV9 | BPC5 | 6585 | 6614 | - | -47.85 | AGATAGGCAAAGAGTAAATAGGAAGGTAAA |
HERV9 | BPC5 | 3508 | 3537 | - | -48.48 | AGAGAAACTGGGAGTCAGAGTGCAGGGGAA |
HERV9 | BPC5 | 6593 | 6622 | - | -48.59 | GGAGGGATAGATAGGCAAAGAGTAAATAGG |
HERV9 | BPC5 | 6689 | 6718 | - | -48.82 | GGTTTAACAGAGAGTGGGTTAAGTTGAGAG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.