HERV17
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000628 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8839 |
Kimura value | 4.20 |
Tau index | 0.9778 |
Description | Internal sequence of the class I endogenous retrovirus |
Comment | Internal sequence of HERV17 (HERV-W) flanked by LTR17 long terminal repeats. Update on existing entry. Most copies in the seed alignment are processed pseudogenes of ERV transcripts |
Sequence |
TTTGGTGGCCCACGAAGGGACTCTCCAAAGCGGTGAGTAATATTGGACCACTTTCGCTTGCTATTCTGTCCTATCCTTCCTTAGAATTGGAGGAAAATACCGGGCACCTGTCGGCCGGTTAAAAACGATTAGCGTGGCCGCCGGACTTAAGACTCAGGTGTGAGGCTNTCTGGGGAAGGGCTTTCTAACAACCCCCAACCCTTCTGGGTTGGGAGCGTTGGTCTGCCTGGAACCAGCTTCCGCTTTCAATTTTCCTGGGGGAAGCCGAGGGCCGACTAGAGGCAGAAAGCTGTCGTCCCGAACTCCCGGCATTAGCCGGTTGAGATCATGGCGCAGCCAGAAGTCTCTACTCAACAGTCGCCCATGCGTGCGCCCCTACCTTTCCTTCTGACCCATACCTCCTGGGTCCCGACCACGACTTTCTTGAAAGTGTAGCCCCAAAATTCTCCTTACCTCTGAATCTACTTCCTCCGATCCCTGCCTCCTAGGTACTAATGGTTCAGACTTTCATTTCCTCTCCCAAGTATTAGAGCAAGTTGTATCTCCAAAGGGATCTAAGGAAGCTCTACGCTGCGTCCTTAGGCACCTAGGCTATGAACCCAGGGAGTCTTGTCCCTGGTGTCCCTCCCGATTTAGGTATACAGCTCTCGACATGGGCAGTTATGTGGGACCCGTTCCCCACCACCCTTGCCAGGGCCCCAAGTTTGTAAATGGCTAAGAGGATTGCTCTCCCATTGTGTAAGATGCTCTCCTCCCCCAATTTCTACCCAGCTTACCCCTCTGCAATACAATCTCCAAGCCTTGGCTCCTTGGCCAGGGCCTTAGAACTGATGACCCAGTACTTTAACAACTGGAACTGGGTCTACGACAACATAATAGATCAGGATGAAAGCGAATTGAGTAAATTAAAGGGAGGCGCATATTCCTATAGTGGCAAATGGGGGCAACGAGCGAACGTCCTTCCGCTGTGTTTCCAAAATCCATCTACAGAGACAGAGAGGAGAGAGAGAGAGAGAGAGAAGAGAGAGAGAGAAAGAGAGAAAGAGAGAGATAGAAGTAGTAAAGAAAAAACAGTGTGCCCTATTCCTTTAAAAGCCAGGGTAAATTTAAAACCTATAATTGATAATTGAAGGTCTTCTCCGTGACCCTATAACACTCCAATACTACCTTGTTGTCAGTGTAAACAAGGGCGTAGCCCGAAAGCACTGAGACCACTGACAACCCGTAGCCTTCCTATCAAAAATCCTTAACCCAGTAACCCGCGGATGGCCCAAATGCATTCAATCTGTAGCGGCAACTGCTTTGCTAACAGAAGAAAGTAGAAAAGTAACTTTTAGAGGAAACCTCATTGTGAGCACACCTCACCAGTTCAGAACTATCCTAAGTCAAAAAAGCAAAAAGGTAGCTTACTAACTCAAAAATCTTAAAGTATGGGGCTATTCTGTTAGAAAAAGGTGATTTAACACTAACCACTGAAAATTCCCTTAACCCAGCAGATTTCCTAACAGGGGATTTAAATCTTAATTACCATACAAAGGTCCGACCAGACCTAGGAGGAACTCCCTTCAGGACAGGACGATAGATGGTTCCTCCCAGGTGATTGAGGAAAAAACCACAATGGGTATTCAGTAATTGATAGGGAGACTCTTGTGGAAGCAGAGTTAGGAAAATTGCCTAATAATTGGTCTGCTCAAACGTGCGAGCTGTTTGCACTCAGCCAAGCCTTAAAGTACTTACAGAATCAAAAAACTCTATCTCAATCCTGACTCAAAAGGTTACCTACACCCTCTCTGAAACGAATTTGCATAAGAACTGTTGTTTATGGGAATGCATCTTGATGGGGCAGCTGGGTTGTTATGAAATACTCAGGAACCCAGCCCAGCTCTAGGACTCACCCCTGAGCGCAAAGGCAATGTTGGGCACGCTGGTAAAGGACCACTAGAATCCAGCAGCCCGGACCCCTTTCTTTGTGGTCAAGAAAGGCGGGAAAACGGGTGCAGGACTGCTACATCGGTGAGCGTAACTAATCCGATAAGCAGAGGTCCATGGGTGGTTACGCACCCTGGAAAGGAATAAGCATTAGGACCATAGAGGACGCTCTAGGACTAATGCTCATCGGAAAATGACTAGGGGTGCTGGCATCCCTATGTTCTTTTTTCAGATGGGAAACGTTCCCCCCAAGGCAAAAACGCCCCTAAGATGTATTCTGGAGAATTNGGNCCAGTTTGACCCTCAGACGCTAAGAAAGAAACGACTTATATTCTTCTGCAGTACCGCCTGGCCACGATATCCTCTTCAAGGGGGAGAAACCTGGCCTCCTGAGGGAAGTATAAATTATAACACCATCTTACAGCTAGACCTCTTTTGTAGAAAAGAAGGCAAATGGAGTGAAGTGCCATATGTACAAACTTTCTTTTCATTAAGAGACAACTCGCAATTATGTAAAAAGTGTGATTTATGCCCTACAGGAAGCCCTCAGAGTCTACCTCCCTACCCCGGCGTCCCCCCGACTCCTTCCCCAACTAATAAGGACCCCCCTTCAACCCAAACGGTCCAAAAGGAGATAGACAAAGGGGTAAACAATGAACCAAAGAGTGCCAATATTCCCCGATTATGCCCCCTCCAAGCGGTGGGAGGAGGAGAATTCGGCCCAGCCAGAGTGCATGTACCTTTTTCCCTCTCAGACTTGAAGCAAATTAAAATAGACCTAGGTAAATTCTCAGATAACCCTGATGGCTATATTGATGTTTTACAAGGGTTAGGACAATCCTTTGATCTGACATGGAGAGATATAATGTTACTGCTAGATCAGACACTAACCCCAAATGAGAGAAGTGCCGCCATAACTGCAGCCCGAGAGTTTGGCGATCTCTGGTATCTCAGTCAGGTCAATGATAGGATGACAACAGAGGAAAGAGAACGATTCCCCACAGGCCAGCAGGCAGTTCCCAGTGTAGACCCTCACTGGGACGCAGAATCAGAACATGGAGATTGGTGCCGCAGACATTTGCTAACTTGCGTGCTAGAAGGACTAAGGAAAACTAGGAAGAAGCCTATGAATTATTCAATGATGTCCACTATAACACAGGGAAAGGAAGAAAATCCTACTGCCTTTCTGGAGAGACTAAGGGAGGCATTGAGGAAGCATACCTCTCTGTCACCTGACTCTATTGAAGGCCAACTAATCTTAAAGGATAAGTTTATCACTCAGTCAGCTGCAGACATTAGAAAAAAACTTCAAAAGTCCGCCTTAGGCCCGGAGCAAAACTTAGAAACCCTATTGAACTTGGCAACCTCGGTTTTTTATAATAGAGATCAGGAGGAGCAGGCGGAACGGGACAAACGGGATAAGAAAAAAAAAGGCCACCGCTTTAGTCATGGCCCTCAGGCAAGCGGACTTTGGAGGCTCTGGAAAAGGGAAAGGCTGGGCAAATCGAATGCCTAATAGGGCTTGCTTCCAGTGCGGTCTACAAGGACACTTTAAAAAAGATTGTCCGAATAGAAATAAGCCGCCCCCTCGTCCATGCCCCTTATGTCAAGGGAATCACTGGAAGGCCCACTGCCCCAGGGGACGAAGGTCCTCTGAGTCAGAAGCCACTAACCAGATGATCCAGCAGCAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCCCATGCCATCACCCTCACAGAGCCCCGGGTATGCTTGACCATTGAGGGCCAGGAGGTTAACTGTCTCCTGGACACTGGCGCGGCCTTCTCAGTCTTACTCTCCTGTCCCGGACAACTGTCCTCCAGATCTGTCACTATCCGAGGGGTCCTAGGACAGCCAGTCACTAGATACTTCTCCCAGCCACTAAGTTGTGACTGGGGAACTTTACTCTTTTCACATGCCTTTCTAATTATGCCTGAAAGCCCCACTCCCTTGTTAGGGAGAGACATTCTAGCAAAAGCAGGGGCCATTATACACCTGAACATAGGAGAAGGAACACCCGTTTGTTGTCCCCTGCTTGAGGAAGGAATTAATCCTGAAGTCTGGGCAACAGAAGGACAATATGGACGAGCAAAGAATGCCCGTCCTGTTCAAGTTAAACTAAAGGATTCCGCCTCCTTTCCCTACCAAAGGCAGTACCCCCTTAGACCCGAGGCCCAACAAGGACTCCAAAAGATTGTTAAGGACCTAAAAGCCCAAGGCCTAGTAAAACCATGCAATAGCCCCTGCAATACTCCAATTTTAGGAGTACAGAAACCCAACGGACAGTGGAGGTTAGTGCAAGATCTCAGGATTATCAATGAGGCCGTTGTCCCTCTATACCCAGCTGTACCTAACCCTTATACTCTGCTTTCCCAAATACCAGAGGAAGCAGAGTGGTTTACAGTCCTGGACCTTAAGGATGCCTTTTTCTGCATCCCTGTACATCCTGACTCTCAATTCTTGTTTGCCTTTGAAGATCCTTCGAACCCAACGTCTCAACTCACCTGGACTGTTTTACCCCAAGGGTTCAGGGATAGCCCCCATCTATTTGGCCAGGCATTAGCCCAAGACTTGAGCCAGTTCTCATACCTGGACACTCTTGTCCTTCGGTACGTGGATGATTTACTTTTAGCCGCCCGTTCAGAAACCTTGTGCCATCAAGCCACCCAAGCGCTCTTAAATTTCCTCGCCACCTGTGGCTACAAGGTTTCCAAACCAAAGGCTCAGCTCTGCTCACAGCAGGTTAAATACTTAGGGCTAAAATTATCCAAAGGCACCAGGGCCCTCAGTGAGGAACGTATCCAGCCTATACTGGCTTATCCTCATCCCAAAACCCTAAAGCAACTAAGAGGGTTCCTTGGCATAACAGGCTTCTGCCGAATATGGATTCCCAGGTACGGCGAAATAGCCAGGCCATTATATACACTAATTAAGGAAACTCAGAAAGCCAATACCCATTTAGTAAGATGGACACCTGAAGCAGAAGCGGCTTTCCAGGCCCTAAAGAAGGCCCTAACCCAAGCCCCAGTGTTAAGCTTGCCAACGGGGCAAGACTTTTCTTTATATGTCACAGAAAAAACAGGAATAGCTCTAGGAGTCCTTACACAGGTCCGAGGGACGAGCTTGCAACCCGTGGCATACCTGAGTAAGGAAATTGATGTAGTGGCAAAGGGTTGGCCTCATTGTTTACGGGTAGTGGCGGCAGTAGCAGTCTTAGTATCTGAAGCAGTTAAAATAATACAGGGAAGAGATCTTACTGTGTGGACATCTCATGATGTGAACGGCATACTCACTGCTAAAGGAGACTTGTGGCTGTCAGACAACCGTTTACTTAAATATCAGGCTCTATTACTTGAAGGGCCAGTGCTGCGACTGCGCACTTGTGCAACTCTTAACCCAGCCACATTTCTTCCAGACAATGAAGAAAAGATAGAACATAACTGTCAACAAGTAATTGCTCAAACCTACGCCGCTCGAGGGGACCTTCTAGAGGTTCCCTTGACTGATCCCGACCTCAACTTGTATACTGATGGAAGTTCCTTTGTAGAAAAAGGACTTCGAAAAGCGGGGTATGCAGTGGTCAGTGATAATGGAATACTTGAAAGTAATCCCCTCACTCCAGGAACTAGCGCTCAGCTGGCAGAACTAATAGCCCTCACTCGGGCACTAGAATTAGGAGAAGGAAAAAGGGTAAATATATATACAGACTCTAAGTATGCTTACCTAGTCCTCCATGCCCACGCAGCAATATGGAGAGAAAGGGAATTCCTAACTTCCGAGGGAACACCTATCAAACATCAGGAAGCCATTAGGAGATTATTATTGGCTGTACAGAAACCTAAAGAGGTGGCAGTCTTACACTGCCGGGGTCATCAGAAAGGAAAGGAAAGGGAAATAGAAGGGAACCGCCAAGCGGATATTGAAGCCAAAAGAGCCGCAAGGCGGGACCCTCCATTAGAAATGCTTATAGAAGGACCCCTAGTATGGGGTAATCCCCTCCGGGAAACCAAGCCCCAGTACTCAGCAGAAGAAATAGAATGGGGAACCTCACGAGGACATAGTTTCCTCCCCTCAGGATGGCTAGCCACCGAAGAAGGAAAAATACTTTTGCCTGCAGCTAACCAATGGAAATTACTTAAAACCCTTCACCAAACCTTTCACTTAGGCATTGATAGCACCCATCAGATGGCCAAATCATTATTTACTGGACCAGGCCTTTTCAAAACTATCAAGCAGATAGTCAGGGCCTGTGAAGTGTGCCAAAGAAATAATCCCCTGCACTTATCGCCAAGCTCCTTCAGGAGAACAAAGAACAGGCCATTACCCAGGAGAAGACTGGCAACTAGATTTTACCCACATGCCCAAATCTCAGGGATTTCAGTATCTACTAGTCTGGGTAGATACTTTCACTGGTTGGGCGGAGGCCTTCCCTTGTAGGACAGAAAAGGCCCAAGAGGTAATAAAGGCACTAGTTCATGAAATAATTCCCAGATTCGGACTTCCCCGAGGCTTACAGAGTGACAATGGCCCCGCTTTCAAGGCTGCAGTAACCCAGGGAGTATCCCAGGCGTTAGGCATACAATATCACTTACACTGCGCCTGGAGGCCACAATCCTCAGGAAAAGTCGAGAAAATGAACGAAACACTCAAACGACATCTAAAAAAGCTAACCCAGGAAACCCACCTCGCATGGCCTGCTCTGTTGCCTATAGCCTTACTAAGAATCCGAAACTCTCCCCAAAAAGCGGGACTTAGCCCATACGAAATGCTGTATGGACGGCCCTTCCTAACCAATGACCTTGTGCTTGACCGAGAGACGGCCAACTTAGTTGCAGACATCACCTCCTTAGCCAAATATCAACAAGTTCTTAAAACATTACAGGGAACCTGTCCCCGAGAGGAGGGAAAGGAATTATTCCACCCTGGTGACATGGTATTAGTCAAGTCCCTTCCCTCTAATTCCCCATCCCTAGATACATCCTGGGAAGGACCCTACCCAGTCATTTTATCTACCCCAACCGCGGTTAAAGTGGCTGGAGTGGAGTCTTGGATACATCACACTCGAGTCAAACCCTGGATACTGCCAAAGGAACCCGAAAATCCAGGAGACAACGCTAGCTATTCCTGTGAACCTCTAGAGGATCTGCGCCTGCTCTTCAAGCGACAACCGTGAGGAAAGTAACTAGAATCGTAGATCCCCATGGCCCTCCCTTGTCATATTTTTCTCTTTACTGTTCTCTTACCCCCTTTCACTCTCACTGCACCCCCTCCATGCCGCTGTACTACCAGTAGCTCCCCTTACCAAGAGCTTCTATGGAGAATGCGGCTTCCCGGAAATATTGATGCCCCATCGTATAGGAGTTTTTCTAAAGGAAACCCCACTTTCACCGCCCACACCCATATGCCCCGCAACTGCTATAACTCTGCCACTCTTTGCATGCATGCAAATACTCATTATTGGACAGGGAAAATGATTAATCCTAGTTGTCCTGGAGGACTTGGAGCCACTGTCTGTTGGACTTACTTCACCCATACCGGTATGTCTGATGGGGGTGGAGTTCAAGATCAGGCAAGAGAAAAACACGTAAAGGAAGTAATCTCCCAACTGACCCGGGTACATAGCACCCCTAGCCCCTACAAAGGACTAGATCTCTCAAAACTACATGAAACCCTCCGTACCCATACTCGCCTGGTAAGCCTATTTAATACCACCCTCACTGGGCTCCATGAGGTCTCGGCCCAAAACCCTACTAACTGTTGGATGTGCCTCCCCCTGCACTNCAGGCCATACATTTCAATCCCTGTATCTGAACAATGGAACAACTTCAGCACAGAAATAAACACCACTTCCGTTTTAGTAGGACCTCTTGTTTCCAATCTGGAAATAACCCATACCTCAAACCTCACCTGTGTAAAATTTAGCAATACTATAGACACAACCAACTCCCAATGCATCAGGTGGGTAACTCCTCCCACACGAATAGTCTGCCTACCCTCAGGAATATTTTTTGTCTGTGGTACCTCAGCCTATCGTTGTTTGAATGGCTCTTCAGAATCTATGTGCTTCCTCTCATTCTTAGTGCCCCCTATGACCATCTACACTGAACAAGATTTATACAATCATGTCGTACCTAAGCCCCGCAACAAAAGAGTACCCATTCTTCCTTTTGTTATCGGAGCAGGAGTGCTAGGCGGACTAGGTACTGGCATTGGCGGTATCACAACCTCTACTCAGTTCTACTACAAACTATCTCAAGAACTAAATGGTGACATGGAACGGGTCGCCGACTCCCTGGTCACCTTGCAAGATCAACTTAACTCCCTAGCAGCAGTAGTCCTTCAAAATCGAAGAGCTTTAGACTTGCTAACCGCCGAAAGAGGGGGAACCTGTTTATTTTTAGGGGAAGAATGCTGTTATTATGTTAATCAATCCGGAATCGTCACCGAGAAAGTTAAAGAAATTCGAGATCGAATACAACGTAGAGCAGAGGAGCTTCAAAACACCGGACCCTGGGGCCTCCTCAGCCAATGGATGCCCTGGATTCTCCCCTTCTTAGGACCTCTAGCAGCTATAATATTGTTACTCCTCTTTGGACCCTGTATCTTTAACCTCCTTGTTAAGTTTGTCTCTTCCAGAATCGAAGCTGTAAAACTACAAATCGTTCTTCAAATGGAGCCCCAGATGCAGTCCATGACTAAGATCTACCGCGGACCCCTGGACCGGCCTGCTAGCCCATGCTCCGATGTTAATGACATCGAAGGCACCCCTCCCGAGGAAATCTCAACTGCACGACCCCTACTACGCCCCAATTCAGCAGGAAGCAGTTAGAGCGGTCGTCGGCCAACCTCCCCAACAGCACTTGGGTTTTCCTGTTGAGAGGGGGATC
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV17 | BPC5 | 1021 | 1050 | + | 41.99 | AGAGAGAGAGAGAAAGAGAGAAAGAGAGAG |
HERV17 | BPC1 | 1022 | 1045 | + | 37.48 | GAGAGAGAGAGAAAGAGAGAAAGA |
HERV17 | BPC5 | 1019 | 1048 | + | 37.43 | GAAGAGAGAGAGAGAAAGAGAGAAAGAGAG |
HERV17 | BPC1 | 1028 | 1051 | + | 36.42 | GAGAGAAAGAGAGAAAGAGAGAGA |
HERV17 | BPC6 | 1001 | 1021 | - | 36.12 | TTCTCTCTCTCTCTCTCTCTC |
HERV17 | BPC1 | 1024 | 1047 | + | 36.02 | GAGAGAGAGAAAGAGAGAAAGAGA |
HERV17 | BPC1 | 1026 | 1049 | + | 35.82 | GAGAGAGAAAGAGAGAAAGAGAGA |
HERV17 | BPC5 | 1023 | 1052 | + | 34.78 | AGAGAGAGAGAAAGAGAGAAAGAGAGAGAT |
HERV17 | BPC1 | 1020 | 1043 | + | 34.03 | AAGAGAGAGAGAGAAAGAGAGAAA |
HERV17 | BPC1 | 1030 | 1053 | + | 33.82 | GAGAAAGAGAGAAAGAGAGAGATA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.