HERV9
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000173 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8436 |
Kimura value | 5.24 |
Tau index | 0.9324 |
Description | Internal region of an ERV1 endogenous retrovirus, HERV9 subfamily |
Comment | gag ~1287-2747, pol ~2748-6323, env ~6740-8380. |
Sequence |
TTTTGGCGACCACGAAGGGACCATCGCCTATCGCCAAGCGGTGAGACTATCGCCTATCGCCAAGCGGTGAGTACCATCGGACCCCTTTCGCTTGCTATTCTGTCCTATTTTTCCTTAGAATTCGGGGGCTAAATACCGGGCACCTGTCGGCCAGTTAAAAGCGACTAGCGCGGCCGCCGGACTAAAGACACGGGTGTCAGGCTTTCTGGGAAAGGGCTCTCTAACAACCCCCGACTCTTCGGAGTTGGGAGCGTTGGTTTGCCTGGAACCAGCTTCCGCTTTTCCTGTACTTCTGGGCTGAGCCGAGGGTCGACAGAGAGGAAAGCCATTCAGCTCCAGGGGTCCCGACAACAAGTTGGTTGACCCTGCGGCCATGAGCGGAACTCTCAAAGTCATGTCGCCCAAGCGAGACTCGCCCATCTATCCTATCTATCCTGACCCTTGCCTCCTGGGTCCTAATGCCTGTCAGACAAACTTCCTCTCGCCTCTCTTCTCCGAGGCTAGTCCCGCTTCTAAAAACCACTCCCTGTCTCTGGTGCTTTTCTAGTTTCTCCTATAAGAATGATTTCTAGTATAAACTCCAGGACTCTGTTACCTTCTTTAGGCACCCGGGCTCACCAATCAGAAAGACATAATTTTTGCCCAAAGCCCCATCGTAGGGGGGACTATCTGGAATTTTAGGATCCCTCCTCAGACNAGCAGGCCTAACAAAAGCTATTCCTGAAGCTAGGATATGGGGAGCCTCAGAAATTGTATCCTTCCTATTCATATAAGTGAGGACAAAAGGCGTCACTCTTCCAACTCTGGAGATCCCTTCCCTCCCTCAGGGTATGGCCCTCCACTTCATTTTTGGGGCATAACATCTTTATAGGACACGGGTAAGGTCCCAATACTAACAGGAGAATGCTTAGGACTCTAACAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGATTTTTCTCGGTCCTCTTTGTGGTCTAGGAGGACAGGCAAGGGTGCAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGACCTTCCTCGGTCCTCCTTGTGGTCTAGGAGGAAAACTAGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAAGAGGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAGGGGGAGAAACAAACAAACCAAAACCGCGGGCGGTTTTGTCTTTCAGATGGGAAACACTCAGGCATCAACAGGCTCACCCTTGAAATGCATCCTAAGCCATTGGGACCAATTTGACCCGCAAACCCTGAAAAAGAGGCGGCTCATTTTTTTCTGCACTATGGCCTGGCCCCAATATTCTCTCTCTGATGGGGAAAAATGGCCACCTGAGGGAAGTATAAATTACAATACTATCCTGCAGCTTGACCTTTTCTGTAAGAGGGAAGGCAAATGGAGTGAAATACCTTATGTCCAAGCTTTCTTTTCATTGAAGGAGAATNCACAACTATGCAAAGCTTGCAATTTACATCCCACAGGAGGACCTCTCAGCTTACCCCCATATCCTAGCCTCCCTATAGCTCCCCTTCCTATTAATGATAAGCCTCCTCTAATCTCCCCCGCCCAGAAGGAAACAAGCAAAGAAATCTCCAAAGGACCACAAAAACCCCCGGGCTATCGGTTATGTCCCCTTCAAGCTGTAGGGGGAGGGGAATTTGGCCCAACCCGGGTACATGTCCCCTTCTCCCTCTCTGATTTAAAGCAGATCAAGGCAGACCTGGGGAAGTTTTCAGATGATCCTGATAGGTACATAGATGTCCTACAGGGTCTAGGGCAAACCTTCGACCTCACTTGGAGAGATGTCATGCTATTGTTAGATCAAACCCTGGCCTTTAATGAAAAGAATGCGGCTTTAGCTGCAGCCCGAGAGTTTGGAGATACCTGGTATCTTAGTCAAGTAAATGATAGAATGACAGCCGAAGAAAGGGACAAATTCCCTACCGGTCAGCAAGCCGTCCCCAGTATGGATCCCCACTGGGACCTCGACTCAGATCATGGGGACTGGAGTCGCAAACATCTGTTGACCTGTGTTCTAGAAGGACTAAGGAGAATTAGGAAAAAGCCCATGAATTATTCAATGATGTCCACCATAACTCAGGGAAAGGAAGAAAATCCTTCTGCCTTCCTCGAGCGGCTACGGGAGGCCTTAAGAAAATATACTCCCCTGTCACCCGACTCACTCGAGGGTCAATTGATCCTAAAAGATAAGTTTATTACCCAATCAGCCGCAGATATCAGGAGAAAGCTCCAAAAGCGAGCCCTGGGCCCTGAACAAAATCTGGAGGCATTATTAAACCTGGCAACCTCGGTGTTCTATAATAGGGACCAAGAGGAACAGGCCCAAAAGGAAAAGCGAGATCAGAGAAAGGCCGCAGCCTTAGTCATGGCCCTCAGACAAACAAACCTTGGTGGTTCAGAGAGGACAGAAAATGGAGCAGGCCAATCACCCGGTAGGGCTTGTTATCAGTGTGGTTTGCAAGGACACTTTAAAAAAGATTGTCCAACGAGAAACAAGCCGCCCCCTCGCCCATGTCCACTATGCCGAGGCAATCACTGGAAGGCGCACTGCCCCAGAGGACAAAGGTTCTCTGGGCCAGAAGCCCCCAACCAGATGATCCAACAACAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCTCATGTCATCACCCTCACTGAGCCCCGGGTACGTTTAACCATTGAGGGCCAGGAAATTGACTTCCTCCTGGACACTGGCGCGGCCTTCTCAGTGTTAATCTCCTGTCCCGGACGGCTGTCCTCAAGGTCCGTTACCATCCGAGGAATCCTGGGACAGCCTGTAACCAGGTATTTCTCCCACCTCCTCAGTTGTAATTGGGAGACTTTGCTCTTTTCACATGCCTTTCTTGTTATGCCTGAAAGTCCCACACCCTTATTAGGGAGGGACATATTAGCCAAAGCTGGAGCTATTATCTACATGAATATGGGGAACAAGTTACCCATTTGTTGTCCCCTACTTGAGGAGGGAATCAACCCTGAAGTCTGGGCATTGGAAGGACAATTCGGAAGGGCAAAAAATGCCCGCCCAGTCCAAATCAGGCTAAAAGACCCCACCACTTTTCCTTATCAAAGGCAATATCCCTTAAGGCCTGAAGCTCATAAAGGATTACAGGATATTGTTAGACATTTAAAAGCTCAAGGCTTAGTAAGGAAATGCAGCAGTCCCTGCAACACCCCAATTCTAGGAGTACAAAAACCGAACGGTCAGTGGAGACTAGTGCAAGATCTTAGACTCATCAATGAGGCAGTAATTCCTCTATATCCAGTTGTACCCAACCCCTATACCCTGCTCTCTCAAATACCAGAGGAAGCAGAATGGTTCACTGTTCTGGACCTCAAGGATGCCTTCTTCTGTATTCCCCTGCACTCTGACTCCCAGTTTCTCTTTGCCTTTGAGGATCCCACAGACCACACGTCCCAACTTACGTGGACGGTCTTGCCCCAAGGGTTTAGGGATAGCCCTCATCTGTTTGGTCAGGCACTGGCCCAAGATCTAGGCCACTTCTCAAGTCCAGGCACTCTGGTCCTTCAGTATGTGGATGATTTACTTTTGGCTACCAGTTCGGAAGCCTCATGCCAGCAGGCTACTCTAGATCTCTTGAACTTTCTAGCTAATCAAGGGTACAAGGCGTCTAGGTCGAAGGCCCAGCTCTGCCTACAGCAGGTCAAATATCTAGGCCTAATCTTAGCCAGAGGGACCAGGGCCCTCAGCAAGGAACGAATACAGCCTATACTGGCTTATCCTCGCCCTAAGACATTAAAACAGTTGCGGGGGTTCCTTGGAATCACCGGCTTTTGCCGACTATGGATCCCCGGATACAGCGAGATGGCCAGGCCNCTCTATACTCTAATCAAGGAGACCCAGAGGGCAAATACTCATCTAGTAGAATGGGAACCAGAGGCAGAAACAGCCTTCAAAACCTTAAAGCAGGCCCTAGTACAAGCTCCAGCCTTAAGCCTTCCCACAGGACAAAACTTCTCTTTATACGTCACAGAGAGAGCGGGGATAGCTCTTGGAGTCCTTACTCAGACTCGTGGGACAACCCCACAACCAGTGGCATACCTAAGTAAGGAAATTGATGTAGTAGCAAAAGGCTGGCCTCACTGTTTACGGGTAGTTGCGGCGGTGGCCGTCTTAGTGTCAGAGGCTATCAAAATAATACAAGGAAAGGATCTCACTGTCTGGACTACTCATGATGTAAATGGCATACTAGGTGCCAAAGGAAGTTTATGGCTATCAGACAACCGCCTGCTTAGATACCAGGCGCTACTCCTTGAGGGACCGGTGCTTCAAATACGCACGTGCGCGGCCCTCAACCCTGCCACTTTTCTCCCAGAGGATGGGGAACCAATCGAGCATGACTGCCAACAAATTATAGTCCAGACTTATGCCGCCCGAGATGATCTCTTAGAAGTCCCCTTAGCTAATCCTGACCTTAACCTATATACCGATGGAAGTTCATTTGTGGAGAATGGGATACGAAGGGCAGGTTATGCCATAGTTAGTGATGTAACNGTACTTGAAAGTAAGCCTCTTCCCCCAGGGACCAGCGCCCAGTTAGCAGAACTAGTGGCACTTACCCGAGCCTTAGAACTGGGAAAGGGAAAAAGAATAAATGTGTATACAGATAGCAAGTATGCTTATCTAATCCTACATGCCCATGCTGCAATATGGAAAGAAAGGGAGTTCCTAACCTCTGGGGGAACCCCCATTAAATACCACAAGGAAATTATGGAGTTATTGCACGCAGTGCAAAAACCCAAGGAGGTGGCAGTCTTACACTGCCAAAGCCATCAGAAAGGTGAAGGAGAAAAGGCAGAAGGAAACCGTCGGGCAGATGCTGAGGCCAAAATTGCTGCCAGGCGGAACCTCCCATTAGAAATACCTACGGAAGGACCCTTGGTATGGAACAACCCCCTCCAAGAGATTAAGCCCCAGTATTCCCCGACTGAAACAGAATGGGGACTTTCACGGGGGCATAGTTTTCTCCCCTCGGGGTGGTTAACGACAGAAGAAGGAAAGGTACTTATACCCGAAGCCAGCCAGTGGAAAATACTTAAAACCCTCCACCAAACTTTTCATATGGGTATTGAAAACACTCATCAAATGGCCAAATCCCTATTTACAGGGCCAAATCTCCTCCGGACCATCCGACAGGTAGTCAAAGCCTGTGAGGTGTGCCAAAGGAATAATCCCTTGGTCCATCGTAAGGCCCCTTTGGGGGAACAAAGAATAGGTCACTATCCCGGAGAGGACTGGCAGTTAGACTTCACCCATATGCCTAAGTCAAAGGGATTTCAATACTTGTTGGTCTGTGTTGATACCTTTACAAATTGGATAGAAGCTTTCCCCTGCAAGACAGAGAAGGCTCAGGAAGTGATTAAAGTCCTAATTCATGAAATAATTCCTAGATTTGGGCTTCCCCAAAGCTTACAGAGTGACAATGGTCCGGCTTTTAAAGCCACGATAACTCAGGGAATTTCCAGGGCGCTAGGGATACAATATCACCTTCACTGCGCCTGGAGGCCACAATCCTCAGGGAAGGTCGAGAAGGCAAATGAAACACTCAAGAGGCACTTAAGGAAACTAACACAAGAAACTCATCTCCCATGGCCTACTCTTTTGCCCATGGCCTTGTTGAGAATCCGAAATTCTCCTCACAAAATGGGGCTCAGTCCATATGAAATGCTGTATGGACGACCTTTTCTCACAAATGACCTCCTACTTGATCAGGAAACGGCCAACTTGGTCAAAGATATAACTTCTTTGGCAAAATATCAACAAAACCTTAAAAACCTACCTGAAGGATGTCACAGAGAAAAGGGAACAGAGTTGTTTCAACCAGGAGATCTAGTGTTGGTCAAATCTCTCCCCTCTACCTCCCCATCTATGGACTCTTTGTGGGAAGGACCATACTCGGTAATCCTCTCTACCCCCACTGCAGTTAAGGTGGCAGGAGTGGAATCTTGGATTCACCACACCCGAGTTAAACTTTGGACACCCCCTGAGGAACCTGCGGGACCGTCAGCTCAGGAGTCCCAAGATCAGCCAGACCAGCCTCGATACACCTGCGAACCGTTGGAGGACTTGCATCTCCTATTTCGGAAGGAAACATCCCAGACTAAAAAGGCTCCTACCACTGATCCTGAGGAAAAACCCCTTCCTCCTTAAAAAAGATAAGTGAAAACCTACATAATCTTTATCTTTAACACCTCTCCTTGCCCCTTTAATGGAATCCTTTTACTATTTCATCATATTATTAAGCAGCATACTAACCATACTCTTTGCGATAGGACTATATACTGTAGCTCCTGCCGGGACGAAAATCCTAATCACATCAACCTTCTTTCTATCTTCCTTCCTTCTGACAGCAATTTACTCCTACCTTTAACTCAGACTGGATAAAATGATCTCGTCTTCCAGAGCACCCTCTTTACCTTCCTATTTACTCTTTGCCTATCTATCCCTCCTGCTTCCTTGGATACCTCATACAATCACCCCTCCCCTTCCACTAGCTCCTAATTACCTCTACAAGACTCTCAACTTAACCCACTCTCTGTTAAACCAGTCCAATCCTTCCCTGGCAAATGACTGTTGGCTTTGTATCTCTCTATCAACCTCTGCTTACGTTGCCACTCCCATTCCCGCAAAAAANCTGGGTCTTTACCAACTTAACCTACCACCCTCGTTATGAAGGAAAAGACCCTTTCCGACTTCTAAATATGCAATCATTAGCCGACTTCCCCATCTCTGATAGGACCAAGAATACCCTAACAGGACGTGCAATCCAACTTTTACGTTCTTACATTTCCAACCTCACCTATTACACAAGCAATGAAAAGCCCATACACGGCCCTGTAACTACGAATACCATCTTAACTTTCCAAGCCCCTTTATGCATCCAACGCAACCTGTTATCAGGCCTGCCCCTGGGGCACCTACTACCCCATCAGTGTAATTACACCCTACAACTTCAAGCCCCAACTGATCATAGTAACTTCCGAGTCACCCAAACAGCTCCATTCAGATGGCTTGTCCGCTTCTCAGGGCCCCCAAAAATCATCACCTCCTCCCTGCTTAACAAACAGTCCAGGTTTTGTAATGGCAAACATACTCCCTGCATGACCATTCACCCCTGGACCCCCTGCAGCAGCGCCCCCACCACTAGTGAATGCCTTCTCATCCCCTCTTTCAATCACTCTCTCGAATGGTTCCTAGTAGATACAAAACGGTTTTTTCTCCAATGGGAAAATAGAACACAGGGAGCCACTCAGTTTGCTCCCAACACCCCTTTCCAGCCGCTCACCGGAGCTACCTTGGCAAGTACTCTAGGAGTATGGGAAAATGAAAACAACAAACTCACACACCTTTTTAACATACACAACCAGTTCTGTCTACCCAGCCAAGGCATATTCTTCTTATGTGGAACGTCGACCTATATCTGCCTCCCCACTAACTGGACAGGCACCTGCACCTTAGTCTTCCTAAGTCCCAACATTAACATTGCCCCAGGAAATCAGACCCTATCAGTGCCCCTCAAAGCTCAAGTCCGTCAGCGCAGGGCCATACAACTAATACCCCTACTTATAGGGTTAGGAATGGCTACTGCTACAGGAACCGGAATAGCCGGTTTATCTACTTCATTATCCTACTACCACACACTCTCAAAGGATTTCTCAGACAGTTTGCAAGAAATAACGAAATCTATCCTTACTCTACAATCCCAAATAGACTCTTTGGCAGCAGTGACTCTCCAAAACCGCCGAGGCCTAGACCTCCTCACTGCTGAGAAAGGAGGACTCTGCACCTTCTTAGGGGAAGAGTGTTGTTTTTACACTAACCAGTCAGGGATAGTACGAGATGCCGCCCGGCGTTTACAGGAAAAGGCTTCTGAAATCAGACAACGCCTTTCAAACTCTTATACCAACCTCTGGAGTTGGGCGACATGGCTTCTCCCCTTTCTAGGTCCCGTGGCAGCCATCTTGCTATTACTCGCCTTCGGGCCCTGTATTTTTAACCTCCTTGTCAAATTTGTTTCCTCTAGGATCGAGGCCATCAAGCTACAGATGGTCTTACAAATGGAACCCCAAATGAGCTCAACTAACAACTTCTACCGAGGACCCCTGGACCGACCCGCTGGCCCTTTCACTGGCCTAGAGAGTTCCCCTCTGGAGGACACTACAACTGCAGGGCCCCTTCTTCGCCCCTATCCAGCAGGAAGTAGCTAGAGCGGTCATCGCCCAATTCCCAACAGCAGTTGGGGTGTCCTGTTTAGAGGGGGGAT
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV9 | TFAP2B | 7077 | 7089 | - | 17.84 | TGCCCCAGGGGCA |
HERV9 | lmd | 7291 | 7302 | + | 17.69 | GACCCCCTGCAG |
HERV9 | DREB2G | 4268 | 4281 | + | 17.68 | AGTTGCGGCGGTGG |
HERV9 | Zm00001d015407 | 7557 | 7572 | + | 17.42 | AAGGCATATTCTTCTT |
HERV9 | NFKB2 | 4852 | 4862 | + | 17.41 | GGGGAACCCCC |
HERV9 | opa | 7291 | 7302 | + | 17.40 | GACCCCCTGCAG |
HERV9 | RELA | 5627 | 5636 | + | 17.33 | GGGAATTTCC |
HERV9 | NFKB2 | 4852 | 4862 | - | 17.32 | GGGGGTTCCCC |
HERV9 | LOB | 4269 | 4281 | - | 17.30 | CCACCGCCGCAAC |
HERV9 | ZNF257 | 439 | 448 | - | 16.99 | GAGGCAAGGG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.