HERV17
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000628 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8839 |
Kimura value | 4.20 |
Tau index | 0.9778 |
Description | Internal sequence of the class I endogenous retrovirus |
Comment | Internal sequence of HERV17 (HERV-W) flanked by LTR17 long terminal repeats. Update on existing entry. Most copies in the seed alignment are processed pseudogenes of ERV transcripts |
Sequence |
TTTGGTGGCCCACGAAGGGACTCTCCAAAGCGGTGAGTAATATTGGACCACTTTCGCTTGCTATTCTGTCCTATCCTTCCTTAGAATTGGAGGAAAATACCGGGCACCTGTCGGCCGGTTAAAAACGATTAGCGTGGCCGCCGGACTTAAGACTCAGGTGTGAGGCTNTCTGGGGAAGGGCTTTCTAACAACCCCCAACCCTTCTGGGTTGGGAGCGTTGGTCTGCCTGGAACCAGCTTCCGCTTTCAATTTTCCTGGGGGAAGCCGAGGGCCGACTAGAGGCAGAAAGCTGTCGTCCCGAACTCCCGGCATTAGCCGGTTGAGATCATGGCGCAGCCAGAAGTCTCTACTCAACAGTCGCCCATGCGTGCGCCCCTACCTTTCCTTCTGACCCATACCTCCTGGGTCCCGACCACGACTTTCTTGAAAGTGTAGCCCCAAAATTCTCCTTACCTCTGAATCTACTTCCTCCGATCCCTGCCTCCTAGGTACTAATGGTTCAGACTTTCATTTCCTCTCCCAAGTATTAGAGCAAGTTGTATCTCCAAAGGGATCTAAGGAAGCTCTACGCTGCGTCCTTAGGCACCTAGGCTATGAACCCAGGGAGTCTTGTCCCTGGTGTCCCTCCCGATTTAGGTATACAGCTCTCGACATGGGCAGTTATGTGGGACCCGTTCCCCACCACCCTTGCCAGGGCCCCAAGTTTGTAAATGGCTAAGAGGATTGCTCTCCCATTGTGTAAGATGCTCTCCTCCCCCAATTTCTACCCAGCTTACCCCTCTGCAATACAATCTCCAAGCCTTGGCTCCTTGGCCAGGGCCTTAGAACTGATGACCCAGTACTTTAACAACTGGAACTGGGTCTACGACAACATAATAGATCAGGATGAAAGCGAATTGAGTAAATTAAAGGGAGGCGCATATTCCTATAGTGGCAAATGGGGGCAACGAGCGAACGTCCTTCCGCTGTGTTTCCAAAATCCATCTACAGAGACAGAGAGGAGAGAGAGAGAGAGAGAGAAGAGAGAGAGAGAAAGAGAGAAAGAGAGAGATAGAAGTAGTAAAGAAAAAACAGTGTGCCCTATTCCTTTAAAAGCCAGGGTAAATTTAAAACCTATAATTGATAATTGAAGGTCTTCTCCGTGACCCTATAACACTCCAATACTACCTTGTTGTCAGTGTAAACAAGGGCGTAGCCCGAAAGCACTGAGACCACTGACAACCCGTAGCCTTCCTATCAAAAATCCTTAACCCAGTAACCCGCGGATGGCCCAAATGCATTCAATCTGTAGCGGCAACTGCTTTGCTAACAGAAGAAAGTAGAAAAGTAACTTTTAGAGGAAACCTCATTGTGAGCACACCTCACCAGTTCAGAACTATCCTAAGTCAAAAAAGCAAAAAGGTAGCTTACTAACTCAAAAATCTTAAAGTATGGGGCTATTCTGTTAGAAAAAGGTGATTTAACACTAACCACTGAAAATTCCCTTAACCCAGCAGATTTCCTAACAGGGGATTTAAATCTTAATTACCATACAAAGGTCCGACCAGACCTAGGAGGAACTCCCTTCAGGACAGGACGATAGATGGTTCCTCCCAGGTGATTGAGGAAAAAACCACAATGGGTATTCAGTAATTGATAGGGAGACTCTTGTGGAAGCAGAGTTAGGAAAATTGCCTAATAATTGGTCTGCTCAAACGTGCGAGCTGTTTGCACTCAGCCAAGCCTTAAAGTACTTACAGAATCAAAAAACTCTATCTCAATCCTGACTCAAAAGGTTACCTACACCCTCTCTGAAACGAATTTGCATAAGAACTGTTGTTTATGGGAATGCATCTTGATGGGGCAGCTGGGTTGTTATGAAATACTCAGGAACCCAGCCCAGCTCTAGGACTCACCCCTGAGCGCAAAGGCAATGTTGGGCACGCTGGTAAAGGACCACTAGAATCCAGCAGCCCGGACCCCTTTCTTTGTGGTCAAGAAAGGCGGGAAAACGGGTGCAGGACTGCTACATCGGTGAGCGTAACTAATCCGATAAGCAGAGGTCCATGGGTGGTTACGCACCCTGGAAAGGAATAAGCATTAGGACCATAGAGGACGCTCTAGGACTAATGCTCATCGGAAAATGACTAGGGGTGCTGGCATCCCTATGTTCTTTTTTCAGATGGGAAACGTTCCCCCCAAGGCAAAAACGCCCCTAAGATGTATTCTGGAGAATTNGGNCCAGTTTGACCCTCAGACGCTAAGAAAGAAACGACTTATATTCTTCTGCAGTACCGCCTGGCCACGATATCCTCTTCAAGGGGGAGAAACCTGGCCTCCTGAGGGAAGTATAAATTATAACACCATCTTACAGCTAGACCTCTTTTGTAGAAAAGAAGGCAAATGGAGTGAAGTGCCATATGTACAAACTTTCTTTTCATTAAGAGACAACTCGCAATTATGTAAAAAGTGTGATTTATGCCCTACAGGAAGCCCTCAGAGTCTACCTCCCTACCCCGGCGTCCCCCCGACTCCTTCCCCAACTAATAAGGACCCCCCTTCAACCCAAACGGTCCAAAAGGAGATAGACAAAGGGGTAAACAATGAACCAAAGAGTGCCAATATTCCCCGATTATGCCCCCTCCAAGCGGTGGGAGGAGGAGAATTCGGCCCAGCCAGAGTGCATGTACCTTTTTCCCTCTCAGACTTGAAGCAAATTAAAATAGACCTAGGTAAATTCTCAGATAACCCTGATGGCTATATTGATGTTTTACAAGGGTTAGGACAATCCTTTGATCTGACATGGAGAGATATAATGTTACTGCTAGATCAGACACTAACCCCAAATGAGAGAAGTGCCGCCATAACTGCAGCCCGAGAGTTTGGCGATCTCTGGTATCTCAGTCAGGTCAATGATAGGATGACAACAGAGGAAAGAGAACGATTCCCCACAGGCCAGCAGGCAGTTCCCAGTGTAGACCCTCACTGGGACGCAGAATCAGAACATGGAGATTGGTGCCGCAGACATTTGCTAACTTGCGTGCTAGAAGGACTAAGGAAAACTAGGAAGAAGCCTATGAATTATTCAATGATGTCCACTATAACACAGGGAAAGGAAGAAAATCCTACTGCCTTTCTGGAGAGACTAAGGGAGGCATTGAGGAAGCATACCTCTCTGTCACCTGACTCTATTGAAGGCCAACTAATCTTAAAGGATAAGTTTATCACTCAGTCAGCTGCAGACATTAGAAAAAAACTTCAAAAGTCCGCCTTAGGCCCGGAGCAAAACTTAGAAACCCTATTGAACTTGGCAACCTCGGTTTTTTATAATAGAGATCAGGAGGAGCAGGCGGAACGGGACAAACGGGATAAGAAAAAAAAAGGCCACCGCTTTAGTCATGGCCCTCAGGCAAGCGGACTTTGGAGGCTCTGGAAAAGGGAAAGGCTGGGCAAATCGAATGCCTAATAGGGCTTGCTTCCAGTGCGGTCTACAAGGACACTTTAAAAAAGATTGTCCGAATAGAAATAAGCCGCCCCCTCGTCCATGCCCCTTATGTCAAGGGAATCACTGGAAGGCCCACTGCCCCAGGGGACGAAGGTCCTCTGAGTCAGAAGCCACTAACCAGATGATCCAGCAGCAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCCCATGCCATCACCCTCACAGAGCCCCGGGTATGCTTGACCATTGAGGGCCAGGAGGTTAACTGTCTCCTGGACACTGGCGCGGCCTTCTCAGTCTTACTCTCCTGTCCCGGACAACTGTCCTCCAGATCTGTCACTATCCGAGGGGTCCTAGGACAGCCAGTCACTAGATACTTCTCCCAGCCACTAAGTTGTGACTGGGGAACTTTACTCTTTTCACATGCCTTTCTAATTATGCCTGAAAGCCCCACTCCCTTGTTAGGGAGAGACATTCTAGCAAAAGCAGGGGCCATTATACACCTGAACATAGGAGAAGGAACACCCGTTTGTTGTCCCCTGCTTGAGGAAGGAATTAATCCTGAAGTCTGGGCAACAGAAGGACAATATGGACGAGCAAAGAATGCCCGTCCTGTTCAAGTTAAACTAAAGGATTCCGCCTCCTTTCCCTACCAAAGGCAGTACCCCCTTAGACCCGAGGCCCAACAAGGACTCCAAAAGATTGTTAAGGACCTAAAAGCCCAAGGCCTAGTAAAACCATGCAATAGCCCCTGCAATACTCCAATTTTAGGAGTACAGAAACCCAACGGACAGTGGAGGTTAGTGCAAGATCTCAGGATTATCAATGAGGCCGTTGTCCCTCTATACCCAGCTGTACCTAACCCTTATACTCTGCTTTCCCAAATACCAGAGGAAGCAGAGTGGTTTACAGTCCTGGACCTTAAGGATGCCTTTTTCTGCATCCCTGTACATCCTGACTCTCAATTCTTGTTTGCCTTTGAAGATCCTTCGAACCCAACGTCTCAACTCACCTGGACTGTTTTACCCCAAGGGTTCAGGGATAGCCCCCATCTATTTGGCCAGGCATTAGCCCAAGACTTGAGCCAGTTCTCATACCTGGACACTCTTGTCCTTCGGTACGTGGATGATTTACTTTTAGCCGCCCGTTCAGAAACCTTGTGCCATCAAGCCACCCAAGCGCTCTTAAATTTCCTCGCCACCTGTGGCTACAAGGTTTCCAAACCAAAGGCTCAGCTCTGCTCACAGCAGGTTAAATACTTAGGGCTAAAATTATCCAAAGGCACCAGGGCCCTCAGTGAGGAACGTATCCAGCCTATACTGGCTTATCCTCATCCCAAAACCCTAAAGCAACTAAGAGGGTTCCTTGGCATAACAGGCTTCTGCCGAATATGGATTCCCAGGTACGGCGAAATAGCCAGGCCATTATATACACTAATTAAGGAAACTCAGAAAGCCAATACCCATTTAGTAAGATGGACACCTGAAGCAGAAGCGGCTTTCCAGGCCCTAAAGAAGGCCCTAACCCAAGCCCCAGTGTTAAGCTTGCCAACGGGGCAAGACTTTTCTTTATATGTCACAGAAAAAACAGGAATAGCTCTAGGAGTCCTTACACAGGTCCGAGGGACGAGCTTGCAACCCGTGGCATACCTGAGTAAGGAAATTGATGTAGTGGCAAAGGGTTGGCCTCATTGTTTACGGGTAGTGGCGGCAGTAGCAGTCTTAGTATCTGAAGCAGTTAAAATAATACAGGGAAGAGATCTTACTGTGTGGACATCTCATGATGTGAACGGCATACTCACTGCTAAAGGAGACTTGTGGCTGTCAGACAACCGTTTACTTAAATATCAGGCTCTATTACTTGAAGGGCCAGTGCTGCGACTGCGCACTTGTGCAACTCTTAACCCAGCCACATTTCTTCCAGACAATGAAGAAAAGATAGAACATAACTGTCAACAAGTAATTGCTCAAACCTACGCCGCTCGAGGGGACCTTCTAGAGGTTCCCTTGACTGATCCCGACCTCAACTTGTATACTGATGGAAGTTCCTTTGTAGAAAAAGGACTTCGAAAAGCGGGGTATGCAGTGGTCAGTGATAATGGAATACTTGAAAGTAATCCCCTCACTCCAGGAACTAGCGCTCAGCTGGCAGAACTAATAGCCCTCACTCGGGCACTAGAATTAGGAGAAGGAAAAAGGGTAAATATATATACAGACTCTAAGTATGCTTACCTAGTCCTCCATGCCCACGCAGCAATATGGAGAGAAAGGGAATTCCTAACTTCCGAGGGAACACCTATCAAACATCAGGAAGCCATTAGGAGATTATTATTGGCTGTACAGAAACCTAAAGAGGTGGCAGTCTTACACTGCCGGGGTCATCAGAAAGGAAAGGAAAGGGAAATAGAAGGGAACCGCCAAGCGGATATTGAAGCCAAAAGAGCCGCAAGGCGGGACCCTCCATTAGAAATGCTTATAGAAGGACCCCTAGTATGGGGTAATCCCCTCCGGGAAACCAAGCCCCAGTACTCAGCAGAAGAAATAGAATGGGGAACCTCACGAGGACATAGTTTCCTCCCCTCAGGATGGCTAGCCACCGAAGAAGGAAAAATACTTTTGCCTGCAGCTAACCAATGGAAATTACTTAAAACCCTTCACCAAACCTTTCACTTAGGCATTGATAGCACCCATCAGATGGCCAAATCATTATTTACTGGACCAGGCCTTTTCAAAACTATCAAGCAGATAGTCAGGGCCTGTGAAGTGTGCCAAAGAAATAATCCCCTGCACTTATCGCCAAGCTCCTTCAGGAGAACAAAGAACAGGCCATTACCCAGGAGAAGACTGGCAACTAGATTTTACCCACATGCCCAAATCTCAGGGATTTCAGTATCTACTAGTCTGGGTAGATACTTTCACTGGTTGGGCGGAGGCCTTCCCTTGTAGGACAGAAAAGGCCCAAGAGGTAATAAAGGCACTAGTTCATGAAATAATTCCCAGATTCGGACTTCCCCGAGGCTTACAGAGTGACAATGGCCCCGCTTTCAAGGCTGCAGTAACCCAGGGAGTATCCCAGGCGTTAGGCATACAATATCACTTACACTGCGCCTGGAGGCCACAATCCTCAGGAAAAGTCGAGAAAATGAACGAAACACTCAAACGACATCTAAAAAAGCTAACCCAGGAAACCCACCTCGCATGGCCTGCTCTGTTGCCTATAGCCTTACTAAGAATCCGAAACTCTCCCCAAAAAGCGGGACTTAGCCCATACGAAATGCTGTATGGACGGCCCTTCCTAACCAATGACCTTGTGCTTGACCGAGAGACGGCCAACTTAGTTGCAGACATCACCTCCTTAGCCAAATATCAACAAGTTCTTAAAACATTACAGGGAACCTGTCCCCGAGAGGAGGGAAAGGAATTATTCCACCCTGGTGACATGGTATTAGTCAAGTCCCTTCCCTCTAATTCCCCATCCCTAGATACATCCTGGGAAGGACCCTACCCAGTCATTTTATCTACCCCAACCGCGGTTAAAGTGGCTGGAGTGGAGTCTTGGATACATCACACTCGAGTCAAACCCTGGATACTGCCAAAGGAACCCGAAAATCCAGGAGACAACGCTAGCTATTCCTGTGAACCTCTAGAGGATCTGCGCCTGCTCTTCAAGCGACAACCGTGAGGAAAGTAACTAGAATCGTAGATCCCCATGGCCCTCCCTTGTCATATTTTTCTCTTTACTGTTCTCTTACCCCCTTTCACTCTCACTGCACCCCCTCCATGCCGCTGTACTACCAGTAGCTCCCCTTACCAAGAGCTTCTATGGAGAATGCGGCTTCCCGGAAATATTGATGCCCCATCGTATAGGAGTTTTTCTAAAGGAAACCCCACTTTCACCGCCCACACCCATATGCCCCGCAACTGCTATAACTCTGCCACTCTTTGCATGCATGCAAATACTCATTATTGGACAGGGAAAATGATTAATCCTAGTTGTCCTGGAGGACTTGGAGCCACTGTCTGTTGGACTTACTTCACCCATACCGGTATGTCTGATGGGGGTGGAGTTCAAGATCAGGCAAGAGAAAAACACGTAAAGGAAGTAATCTCCCAACTGACCCGGGTACATAGCACCCCTAGCCCCTACAAAGGACTAGATCTCTCAAAACTACATGAAACCCTCCGTACCCATACTCGCCTGGTAAGCCTATTTAATACCACCCTCACTGGGCTCCATGAGGTCTCGGCCCAAAACCCTACTAACTGTTGGATGTGCCTCCCCCTGCACTNCAGGCCATACATTTCAATCCCTGTATCTGAACAATGGAACAACTTCAGCACAGAAATAAACACCACTTCCGTTTTAGTAGGACCTCTTGTTTCCAATCTGGAAATAACCCATACCTCAAACCTCACCTGTGTAAAATTTAGCAATACTATAGACACAACCAACTCCCAATGCATCAGGTGGGTAACTCCTCCCACACGAATAGTCTGCCTACCCTCAGGAATATTTTTTGTCTGTGGTACCTCAGCCTATCGTTGTTTGAATGGCTCTTCAGAATCTATGTGCTTCCTCTCATTCTTAGTGCCCCCTATGACCATCTACACTGAACAAGATTTATACAATCATGTCGTACCTAAGCCCCGCAACAAAAGAGTACCCATTCTTCCTTTTGTTATCGGAGCAGGAGTGCTAGGCGGACTAGGTACTGGCATTGGCGGTATCACAACCTCTACTCAGTTCTACTACAAACTATCTCAAGAACTAAATGGTGACATGGAACGGGTCGCCGACTCCCTGGTCACCTTGCAAGATCAACTTAACTCCCTAGCAGCAGTAGTCCTTCAAAATCGAAGAGCTTTAGACTTGCTAACCGCCGAAAGAGGGGGAACCTGTTTATTTTTAGGGGAAGAATGCTGTTATTATGTTAATCAATCCGGAATCGTCACCGAGAAAGTTAAAGAAATTCGAGATCGAATACAACGTAGAGCAGAGGAGCTTCAAAACACCGGACCCTGGGGCCTCCTCAGCCAATGGATGCCCTGGATTCTCCCCTTCTTAGGACCTCTAGCAGCTATAATATTGTTACTCCTCTTTGGACCCTGTATCTTTAACCTCCTTGTTAAGTTTGTCTCTTCCAGAATCGAAGCTGTAAAACTACAAATCGTTCTTCAAATGGAGCCCCAGATGCAGTCCATGACTAAGATCTACCGCGGACCCCTGGACCGGCCTGCTAGCCCATGCTCCGATGTTAATGACATCGAAGGCACCCCTCCCGAGGAAATCTCAACTGCACGACCCCTACTACGCCCCAATTCAGCAGGAAGCAGTTAGAGCGGTCGTCGGCCAACCTCCCCAACAGCACTTGGGTTTTCCTGTTGAGAGGGGGATC
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV17 | BPC6 | 1030 | 1050 | - | 27.83 | CTCTCTCTTTCTCTCTTTCTC |
HERV17 | BPC6 | 1003 | 1023 | - | 27.80 | TCTTCTCTCTCTCTCTCTCTC |
HERV17 | BPC6 | 1036 | 1056 | - | 27.38 | TTCTATCTCTCTCTTTCTCTC |
HERV17 | BPC1 | 1018 | 1041 | + | 27.12 | AGAAGAGAGAGAGAGAAAGAGAGA |
HERV17 | BPC1 | 1032 | 1055 | + | 26.55 | GAAAGAGAGAAAGAGAGAGATAGA |
HERV17 | BPC1 | 997 | 1020 | + | 26.39 | AGAGGAGAGAGAGAGAGAGAGAGA |
HERV17 | RAMOSA1 | 1001 | 1014 | + | 25.64 | GAGAGAGAGAGAGA |
HERV17 | RAMOSA1 | 1003 | 1016 | + | 25.64 | GAGAGAGAGAGAGA |
HERV17 | RAMOSA1 | 1005 | 1018 | + | 25.64 | GAGAGAGAGAGAGA |
HERV17 | RAMOSA1 | 1007 | 1020 | + | 25.64 | GAGAGAGAGAGAGA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.