HERV17
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000628 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8839 |
Kimura value | 4.20 |
Tau index | 0.9778 |
Description | Internal sequence of the class I endogenous retrovirus |
Comment | Internal sequence of HERV17 (HERV-W) flanked by LTR17 long terminal repeats. Update on existing entry. Most copies in the seed alignment are processed pseudogenes of ERV transcripts |
Sequence |
TTTGGTGGCCCACGAAGGGACTCTCCAAAGCGGTGAGTAATATTGGACCACTTTCGCTTGCTATTCTGTCCTATCCTTCCTTAGAATTGGAGGAAAATACCGGGCACCTGTCGGCCGGTTAAAAACGATTAGCGTGGCCGCCGGACTTAAGACTCAGGTGTGAGGCTNTCTGGGGAAGGGCTTTCTAACAACCCCCAACCCTTCTGGGTTGGGAGCGTTGGTCTGCCTGGAACCAGCTTCCGCTTTCAATTTTCCTGGGGGAAGCCGAGGGCCGACTAGAGGCAGAAAGCTGTCGTCCCGAACTCCCGGCATTAGCCGGTTGAGATCATGGCGCAGCCAGAAGTCTCTACTCAACAGTCGCCCATGCGTGCGCCCCTACCTTTCCTTCTGACCCATACCTCCTGGGTCCCGACCACGACTTTCTTGAAAGTGTAGCCCCAAAATTCTCCTTACCTCTGAATCTACTTCCTCCGATCCCTGCCTCCTAGGTACTAATGGTTCAGACTTTCATTTCCTCTCCCAAGTATTAGAGCAAGTTGTATCTCCAAAGGGATCTAAGGAAGCTCTACGCTGCGTCCTTAGGCACCTAGGCTATGAACCCAGGGAGTCTTGTCCCTGGTGTCCCTCCCGATTTAGGTATACAGCTCTCGACATGGGCAGTTATGTGGGACCCGTTCCCCACCACCCTTGCCAGGGCCCCAAGTTTGTAAATGGCTAAGAGGATTGCTCTCCCATTGTGTAAGATGCTCTCCTCCCCCAATTTCTACCCAGCTTACCCCTCTGCAATACAATCTCCAAGCCTTGGCTCCTTGGCCAGGGCCTTAGAACTGATGACCCAGTACTTTAACAACTGGAACTGGGTCTACGACAACATAATAGATCAGGATGAAAGCGAATTGAGTAAATTAAAGGGAGGCGCATATTCCTATAGTGGCAAATGGGGGCAACGAGCGAACGTCCTTCCGCTGTGTTTCCAAAATCCATCTACAGAGACAGAGAGGAGAGAGAGAGAGAGAGAGAAGAGAGAGAGAGAAAGAGAGAAAGAGAGAGATAGAAGTAGTAAAGAAAAAACAGTGTGCCCTATTCCTTTAAAAGCCAGGGTAAATTTAAAACCTATAATTGATAATTGAAGGTCTTCTCCGTGACCCTATAACACTCCAATACTACCTTGTTGTCAGTGTAAACAAGGGCGTAGCCCGAAAGCACTGAGACCACTGACAACCCGTAGCCTTCCTATCAAAAATCCTTAACCCAGTAACCCGCGGATGGCCCAAATGCATTCAATCTGTAGCGGCAACTGCTTTGCTAACAGAAGAAAGTAGAAAAGTAACTTTTAGAGGAAACCTCATTGTGAGCACACCTCACCAGTTCAGAACTATCCTAAGTCAAAAAAGCAAAAAGGTAGCTTACTAACTCAAAAATCTTAAAGTATGGGGCTATTCTGTTAGAAAAAGGTGATTTAACACTAACCACTGAAAATTCCCTTAACCCAGCAGATTTCCTAACAGGGGATTTAAATCTTAATTACCATACAAAGGTCCGACCAGACCTAGGAGGAACTCCCTTCAGGACAGGACGATAGATGGTTCCTCCCAGGTGATTGAGGAAAAAACCACAATGGGTATTCAGTAATTGATAGGGAGACTCTTGTGGAAGCAGAGTTAGGAAAATTGCCTAATAATTGGTCTGCTCAAACGTGCGAGCTGTTTGCACTCAGCCAAGCCTTAAAGTACTTACAGAATCAAAAAACTCTATCTCAATCCTGACTCAAAAGGTTACCTACACCCTCTCTGAAACGAATTTGCATAAGAACTGTTGTTTATGGGAATGCATCTTGATGGGGCAGCTGGGTTGTTATGAAATACTCAGGAACCCAGCCCAGCTCTAGGACTCACCCCTGAGCGCAAAGGCAATGTTGGGCACGCTGGTAAAGGACCACTAGAATCCAGCAGCCCGGACCCCTTTCTTTGTGGTCAAGAAAGGCGGGAAAACGGGTGCAGGACTGCTACATCGGTGAGCGTAACTAATCCGATAAGCAGAGGTCCATGGGTGGTTACGCACCCTGGAAAGGAATAAGCATTAGGACCATAGAGGACGCTCTAGGACTAATGCTCATCGGAAAATGACTAGGGGTGCTGGCATCCCTATGTTCTTTTTTCAGATGGGAAACGTTCCCCCCAAGGCAAAAACGCCCCTAAGATGTATTCTGGAGAATTNGGNCCAGTTTGACCCTCAGACGCTAAGAAAGAAACGACTTATATTCTTCTGCAGTACCGCCTGGCCACGATATCCTCTTCAAGGGGGAGAAACCTGGCCTCCTGAGGGAAGTATAAATTATAACACCATCTTACAGCTAGACCTCTTTTGTAGAAAAGAAGGCAAATGGAGTGAAGTGCCATATGTACAAACTTTCTTTTCATTAAGAGACAACTCGCAATTATGTAAAAAGTGTGATTTATGCCCTACAGGAAGCCCTCAGAGTCTACCTCCCTACCCCGGCGTCCCCCCGACTCCTTCCCCAACTAATAAGGACCCCCCTTCAACCCAAACGGTCCAAAAGGAGATAGACAAAGGGGTAAACAATGAACCAAAGAGTGCCAATATTCCCCGATTATGCCCCCTCCAAGCGGTGGGAGGAGGAGAATTCGGCCCAGCCAGAGTGCATGTACCTTTTTCCCTCTCAGACTTGAAGCAAATTAAAATAGACCTAGGTAAATTCTCAGATAACCCTGATGGCTATATTGATGTTTTACAAGGGTTAGGACAATCCTTTGATCTGACATGGAGAGATATAATGTTACTGCTAGATCAGACACTAACCCCAAATGAGAGAAGTGCCGCCATAACTGCAGCCCGAGAGTTTGGCGATCTCTGGTATCTCAGTCAGGTCAATGATAGGATGACAACAGAGGAAAGAGAACGATTCCCCACAGGCCAGCAGGCAGTTCCCAGTGTAGACCCTCACTGGGACGCAGAATCAGAACATGGAGATTGGTGCCGCAGACATTTGCTAACTTGCGTGCTAGAAGGACTAAGGAAAACTAGGAAGAAGCCTATGAATTATTCAATGATGTCCACTATAACACAGGGAAAGGAAGAAAATCCTACTGCCTTTCTGGAGAGACTAAGGGAGGCATTGAGGAAGCATACCTCTCTGTCACCTGACTCTATTGAAGGCCAACTAATCTTAAAGGATAAGTTTATCACTCAGTCAGCTGCAGACATTAGAAAAAAACTTCAAAAGTCCGCCTTAGGCCCGGAGCAAAACTTAGAAACCCTATTGAACTTGGCAACCTCGGTTTTTTATAATAGAGATCAGGAGGAGCAGGCGGAACGGGACAAACGGGATAAGAAAAAAAAAGGCCACCGCTTTAGTCATGGCCCTCAGGCAAGCGGACTTTGGAGGCTCTGGAAAAGGGAAAGGCTGGGCAAATCGAATGCCTAATAGGGCTTGCTTCCAGTGCGGTCTACAAGGACACTTTAAAAAAGATTGTCCGAATAGAAATAAGCCGCCCCCTCGTCCATGCCCCTTATGTCAAGGGAATCACTGGAAGGCCCACTGCCCCAGGGGACGAAGGTCCTCTGAGTCAGAAGCCACTAACCAGATGATCCAGCAGCAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCCCATGCCATCACCCTCACAGAGCCCCGGGTATGCTTGACCATTGAGGGCCAGGAGGTTAACTGTCTCCTGGACACTGGCGCGGCCTTCTCAGTCTTACTCTCCTGTCCCGGACAACTGTCCTCCAGATCTGTCACTATCCGAGGGGTCCTAGGACAGCCAGTCACTAGATACTTCTCCCAGCCACTAAGTTGTGACTGGGGAACTTTACTCTTTTCACATGCCTTTCTAATTATGCCTGAAAGCCCCACTCCCTTGTTAGGGAGAGACATTCTAGCAAAAGCAGGGGCCATTATACACCTGAACATAGGAGAAGGAACACCCGTTTGTTGTCCCCTGCTTGAGGAAGGAATTAATCCTGAAGTCTGGGCAACAGAAGGACAATATGGACGAGCAAAGAATGCCCGTCCTGTTCAAGTTAAACTAAAGGATTCCGCCTCCTTTCCCTACCAAAGGCAGTACCCCCTTAGACCCGAGGCCCAACAAGGACTCCAAAAGATTGTTAAGGACCTAAAAGCCCAAGGCCTAGTAAAACCATGCAATAGCCCCTGCAATACTCCAATTTTAGGAGTACAGAAACCCAACGGACAGTGGAGGTTAGTGCAAGATCTCAGGATTATCAATGAGGCCGTTGTCCCTCTATACCCAGCTGTACCTAACCCTTATACTCTGCTTTCCCAAATACCAGAGGAAGCAGAGTGGTTTACAGTCCTGGACCTTAAGGATGCCTTTTTCTGCATCCCTGTACATCCTGACTCTCAATTCTTGTTTGCCTTTGAAGATCCTTCGAACCCAACGTCTCAACTCACCTGGACTGTTTTACCCCAAGGGTTCAGGGATAGCCCCCATCTATTTGGCCAGGCATTAGCCCAAGACTTGAGCCAGTTCTCATACCTGGACACTCTTGTCCTTCGGTACGTGGATGATTTACTTTTAGCCGCCCGTTCAGAAACCTTGTGCCATCAAGCCACCCAAGCGCTCTTAAATTTCCTCGCCACCTGTGGCTACAAGGTTTCCAAACCAAAGGCTCAGCTCTGCTCACAGCAGGTTAAATACTTAGGGCTAAAATTATCCAAAGGCACCAGGGCCCTCAGTGAGGAACGTATCCAGCCTATACTGGCTTATCCTCATCCCAAAACCCTAAAGCAACTAAGAGGGTTCCTTGGCATAACAGGCTTCTGCCGAATATGGATTCCCAGGTACGGCGAAATAGCCAGGCCATTATATACACTAATTAAGGAAACTCAGAAAGCCAATACCCATTTAGTAAGATGGACACCTGAAGCAGAAGCGGCTTTCCAGGCCCTAAAGAAGGCCCTAACCCAAGCCCCAGTGTTAAGCTTGCCAACGGGGCAAGACTTTTCTTTATATGTCACAGAAAAAACAGGAATAGCTCTAGGAGTCCTTACACAGGTCCGAGGGACGAGCTTGCAACCCGTGGCATACCTGAGTAAGGAAATTGATGTAGTGGCAAAGGGTTGGCCTCATTGTTTACGGGTAGTGGCGGCAGTAGCAGTCTTAGTATCTGAAGCAGTTAAAATAATACAGGGAAGAGATCTTACTGTGTGGACATCTCATGATGTGAACGGCATACTCACTGCTAAAGGAGACTTGTGGCTGTCAGACAACCGTTTACTTAAATATCAGGCTCTATTACTTGAAGGGCCAGTGCTGCGACTGCGCACTTGTGCAACTCTTAACCCAGCCACATTTCTTCCAGACAATGAAGAAAAGATAGAACATAACTGTCAACAAGTAATTGCTCAAACCTACGCCGCTCGAGGGGACCTTCTAGAGGTTCCCTTGACTGATCCCGACCTCAACTTGTATACTGATGGAAGTTCCTTTGTAGAAAAAGGACTTCGAAAAGCGGGGTATGCAGTGGTCAGTGATAATGGAATACTTGAAAGTAATCCCCTCACTCCAGGAACTAGCGCTCAGCTGGCAGAACTAATAGCCCTCACTCGGGCACTAGAATTAGGAGAAGGAAAAAGGGTAAATATATATACAGACTCTAAGTATGCTTACCTAGTCCTCCATGCCCACGCAGCAATATGGAGAGAAAGGGAATTCCTAACTTCCGAGGGAACACCTATCAAACATCAGGAAGCCATTAGGAGATTATTATTGGCTGTACAGAAACCTAAAGAGGTGGCAGTCTTACACTGCCGGGGTCATCAGAAAGGAAAGGAAAGGGAAATAGAAGGGAACCGCCAAGCGGATATTGAAGCCAAAAGAGCCGCAAGGCGGGACCCTCCATTAGAAATGCTTATAGAAGGACCCCTAGTATGGGGTAATCCCCTCCGGGAAACCAAGCCCCAGTACTCAGCAGAAGAAATAGAATGGGGAACCTCACGAGGACATAGTTTCCTCCCCTCAGGATGGCTAGCCACCGAAGAAGGAAAAATACTTTTGCCTGCAGCTAACCAATGGAAATTACTTAAAACCCTTCACCAAACCTTTCACTTAGGCATTGATAGCACCCATCAGATGGCCAAATCATTATTTACTGGACCAGGCCTTTTCAAAACTATCAAGCAGATAGTCAGGGCCTGTGAAGTGTGCCAAAGAAATAATCCCCTGCACTTATCGCCAAGCTCCTTCAGGAGAACAAAGAACAGGCCATTACCCAGGAGAAGACTGGCAACTAGATTTTACCCACATGCCCAAATCTCAGGGATTTCAGTATCTACTAGTCTGGGTAGATACTTTCACTGGTTGGGCGGAGGCCTTCCCTTGTAGGACAGAAAAGGCCCAAGAGGTAATAAAGGCACTAGTTCATGAAATAATTCCCAGATTCGGACTTCCCCGAGGCTTACAGAGTGACAATGGCCCCGCTTTCAAGGCTGCAGTAACCCAGGGAGTATCCCAGGCGTTAGGCATACAATATCACTTACACTGCGCCTGGAGGCCACAATCCTCAGGAAAAGTCGAGAAAATGAACGAAACACTCAAACGACATCTAAAAAAGCTAACCCAGGAAACCCACCTCGCATGGCCTGCTCTGTTGCCTATAGCCTTACTAAGAATCCGAAACTCTCCCCAAAAAGCGGGACTTAGCCCATACGAAATGCTGTATGGACGGCCCTTCCTAACCAATGACCTTGTGCTTGACCGAGAGACGGCCAACTTAGTTGCAGACATCACCTCCTTAGCCAAATATCAACAAGTTCTTAAAACATTACAGGGAACCTGTCCCCGAGAGGAGGGAAAGGAATTATTCCACCCTGGTGACATGGTATTAGTCAAGTCCCTTCCCTCTAATTCCCCATCCCTAGATACATCCTGGGAAGGACCCTACCCAGTCATTTTATCTACCCCAACCGCGGTTAAAGTGGCTGGAGTGGAGTCTTGGATACATCACACTCGAGTCAAACCCTGGATACTGCCAAAGGAACCCGAAAATCCAGGAGACAACGCTAGCTATTCCTGTGAACCTCTAGAGGATCTGCGCCTGCTCTTCAAGCGACAACCGTGAGGAAAGTAACTAGAATCGTAGATCCCCATGGCCCTCCCTTGTCATATTTTTCTCTTTACTGTTCTCTTACCCCCTTTCACTCTCACTGCACCCCCTCCATGCCGCTGTACTACCAGTAGCTCCCCTTACCAAGAGCTTCTATGGAGAATGCGGCTTCCCGGAAATATTGATGCCCCATCGTATAGGAGTTTTTCTAAAGGAAACCCCACTTTCACCGCCCACACCCATATGCCCCGCAACTGCTATAACTCTGCCACTCTTTGCATGCATGCAAATACTCATTATTGGACAGGGAAAATGATTAATCCTAGTTGTCCTGGAGGACTTGGAGCCACTGTCTGTTGGACTTACTTCACCCATACCGGTATGTCTGATGGGGGTGGAGTTCAAGATCAGGCAAGAGAAAAACACGTAAAGGAAGTAATCTCCCAACTGACCCGGGTACATAGCACCCCTAGCCCCTACAAAGGACTAGATCTCTCAAAACTACATGAAACCCTCCGTACCCATACTCGCCTGGTAAGCCTATTTAATACCACCCTCACTGGGCTCCATGAGGTCTCGGCCCAAAACCCTACTAACTGTTGGATGTGCCTCCCCCTGCACTNCAGGCCATACATTTCAATCCCTGTATCTGAACAATGGAACAACTTCAGCACAGAAATAAACACCACTTCCGTTTTAGTAGGACCTCTTGTTTCCAATCTGGAAATAACCCATACCTCAAACCTCACCTGTGTAAAATTTAGCAATACTATAGACACAACCAACTCCCAATGCATCAGGTGGGTAACTCCTCCCACACGAATAGTCTGCCTACCCTCAGGAATATTTTTTGTCTGTGGTACCTCAGCCTATCGTTGTTTGAATGGCTCTTCAGAATCTATGTGCTTCCTCTCATTCTTAGTGCCCCCTATGACCATCTACACTGAACAAGATTTATACAATCATGTCGTACCTAAGCCCCGCAACAAAAGAGTACCCATTCTTCCTTTTGTTATCGGAGCAGGAGTGCTAGGCGGACTAGGTACTGGCATTGGCGGTATCACAACCTCTACTCAGTTCTACTACAAACTATCTCAAGAACTAAATGGTGACATGGAACGGGTCGCCGACTCCCTGGTCACCTTGCAAGATCAACTTAACTCCCTAGCAGCAGTAGTCCTTCAAAATCGAAGAGCTTTAGACTTGCTAACCGCCGAAAGAGGGGGAACCTGTTTATTTTTAGGGGAAGAATGCTGTTATTATGTTAATCAATCCGGAATCGTCACCGAGAAAGTTAAAGAAATTCGAGATCGAATACAACGTAGAGCAGAGGAGCTTCAAAACACCGGACCCTGGGGCCTCCTCAGCCAATGGATGCCCTGGATTCTCCCCTTCTTAGGACCTCTAGCAGCTATAATATTGTTACTCCTCTTTGGACCCTGTATCTTTAACCTCCTTGTTAAGTTTGTCTCTTCCAGAATCGAAGCTGTAAAACTACAAATCGTTCTTCAAATGGAGCCCCAGATGCAGTCCATGACTAAGATCTACCGCGGACCCCTGGACCGGCCTGCTAGCCCATGCTCCGATGTTAATGACATCGAAGGCACCCCTCCCGAGGAAATCTCAACTGCACGACCCCTACTACGCCCCAATTCAGCAGGAAGCAGTTAGAGCGGTCGTCGGCCAACCTCCCCAACAGCACTTGGGTTTTCCTGTTGAGAGGGGGATC
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV17 | BPC5 | 1025 | 1054 | + | 33.13 | AGAGAGAGAAAGAGAGAAAGAGAGAGATAG |
HERV17 | BPC6 | 1020 | 1040 | - | 32.28 | CTCTCTTTCTCTCTCTCTCTT |
HERV17 | BPC6 | 1022 | 1042 | - | 31.37 | TTCTCTCTTTCTCTCTCTCTC |
HERV17 | BPC1 | 1001 | 1024 | + | 30.86 | GAGAGAGAGAGAGAGAGAGAAGAG |
HERV17 | BPC6 | 1028 | 1048 | - | 30.82 | CTCTCTTTCTCTCTTTCTCTC |
HERV17 | BPC1 | 999 | 1022 | + | 30.58 | AGGAGAGAGAGAGAGAGAGAGAAG |
HERV17 | BPC6 | 1024 | 1044 | - | 29.63 | CTTTCTCTCTTTCTCTCTCTC |
HERV17 | BPC5 | 1027 | 1056 | + | 29.62 | AGAGAGAAAGAGAGAAAGAGAGAGATAGAA |
HERV17 | BPC6 | 999 | 1019 | - | 29.03 | CTCTCTCTCTCTCTCTCTCCT |
HERV17 | BPC6 | 1032 | 1052 | - | 28.37 | ATCTCTCTCTTTCTCTCTTTC |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.