HERV17
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000628 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8839 |
Kimura value | 4.20 |
Tau index | 0.9778 |
Description | Internal sequence of the class I endogenous retrovirus |
Comment | Internal sequence of HERV17 (HERV-W) flanked by LTR17 long terminal repeats. Update on existing entry. Most copies in the seed alignment are processed pseudogenes of ERV transcripts |
Sequence |
TTTGGTGGCCCACGAAGGGACTCTCCAAAGCGGTGAGTAATATTGGACCACTTTCGCTTGCTATTCTGTCCTATCCTTCCTTAGAATTGGAGGAAAATACCGGGCACCTGTCGGCCGGTTAAAAACGATTAGCGTGGCCGCCGGACTTAAGACTCAGGTGTGAGGCTNTCTGGGGAAGGGCTTTCTAACAACCCCCAACCCTTCTGGGTTGGGAGCGTTGGTCTGCCTGGAACCAGCTTCCGCTTTCAATTTTCCTGGGGGAAGCCGAGGGCCGACTAGAGGCAGAAAGCTGTCGTCCCGAACTCCCGGCATTAGCCGGTTGAGATCATGGCGCAGCCAGAAGTCTCTACTCAACAGTCGCCCATGCGTGCGCCCCTACCTTTCCTTCTGACCCATACCTCCTGGGTCCCGACCACGACTTTCTTGAAAGTGTAGCCCCAAAATTCTCCTTACCTCTGAATCTACTTCCTCCGATCCCTGCCTCCTAGGTACTAATGGTTCAGACTTTCATTTCCTCTCCCAAGTATTAGAGCAAGTTGTATCTCCAAAGGGATCTAAGGAAGCTCTACGCTGCGTCCTTAGGCACCTAGGCTATGAACCCAGGGAGTCTTGTCCCTGGTGTCCCTCCCGATTTAGGTATACAGCTCTCGACATGGGCAGTTATGTGGGACCCGTTCCCCACCACCCTTGCCAGGGCCCCAAGTTTGTAAATGGCTAAGAGGATTGCTCTCCCATTGTGTAAGATGCTCTCCTCCCCCAATTTCTACCCAGCTTACCCCTCTGCAATACAATCTCCAAGCCTTGGCTCCTTGGCCAGGGCCTTAGAACTGATGACCCAGTACTTTAACAACTGGAACTGGGTCTACGACAACATAATAGATCAGGATGAAAGCGAATTGAGTAAATTAAAGGGAGGCGCATATTCCTATAGTGGCAAATGGGGGCAACGAGCGAACGTCCTTCCGCTGTGTTTCCAAAATCCATCTACAGAGACAGAGAGGAGAGAGAGAGAGAGAGAGAAGAGAGAGAGAGAAAGAGAGAAAGAGAGAGATAGAAGTAGTAAAGAAAAAACAGTGTGCCCTATTCCTTTAAAAGCCAGGGTAAATTTAAAACCTATAATTGATAATTGAAGGTCTTCTCCGTGACCCTATAACACTCCAATACTACCTTGTTGTCAGTGTAAACAAGGGCGTAGCCCGAAAGCACTGAGACCACTGACAACCCGTAGCCTTCCTATCAAAAATCCTTAACCCAGTAACCCGCGGATGGCCCAAATGCATTCAATCTGTAGCGGCAACTGCTTTGCTAACAGAAGAAAGTAGAAAAGTAACTTTTAGAGGAAACCTCATTGTGAGCACACCTCACCAGTTCAGAACTATCCTAAGTCAAAAAAGCAAAAAGGTAGCTTACTAACTCAAAAATCTTAAAGTATGGGGCTATTCTGTTAGAAAAAGGTGATTTAACACTAACCACTGAAAATTCCCTTAACCCAGCAGATTTCCTAACAGGGGATTTAAATCTTAATTACCATACAAAGGTCCGACCAGACCTAGGAGGAACTCCCTTCAGGACAGGACGATAGATGGTTCCTCCCAGGTGATTGAGGAAAAAACCACAATGGGTATTCAGTAATTGATAGGGAGACTCTTGTGGAAGCAGAGTTAGGAAAATTGCCTAATAATTGGTCTGCTCAAACGTGCGAGCTGTTTGCACTCAGCCAAGCCTTAAAGTACTTACAGAATCAAAAAACTCTATCTCAATCCTGACTCAAAAGGTTACCTACACCCTCTCTGAAACGAATTTGCATAAGAACTGTTGTTTATGGGAATGCATCTTGATGGGGCAGCTGGGTTGTTATGAAATACTCAGGAACCCAGCCCAGCTCTAGGACTCACCCCTGAGCGCAAAGGCAATGTTGGGCACGCTGGTAAAGGACCACTAGAATCCAGCAGCCCGGACCCCTTTCTTTGTGGTCAAGAAAGGCGGGAAAACGGGTGCAGGACTGCTACATCGGTGAGCGTAACTAATCCGATAAGCAGAGGTCCATGGGTGGTTACGCACCCTGGAAAGGAATAAGCATTAGGACCATAGAGGACGCTCTAGGACTAATGCTCATCGGAAAATGACTAGGGGTGCTGGCATCCCTATGTTCTTTTTTCAGATGGGAAACGTTCCCCCCAAGGCAAAAACGCCCCTAAGATGTATTCTGGAGAATTNGGNCCAGTTTGACCCTCAGACGCTAAGAAAGAAACGACTTATATTCTTCTGCAGTACCGCCTGGCCACGATATCCTCTTCAAGGGGGAGAAACCTGGCCTCCTGAGGGAAGTATAAATTATAACACCATCTTACAGCTAGACCTCTTTTGTAGAAAAGAAGGCAAATGGAGTGAAGTGCCATATGTACAAACTTTCTTTTCATTAAGAGACAACTCGCAATTATGTAAAAAGTGTGATTTATGCCCTACAGGAAGCCCTCAGAGTCTACCTCCCTACCCCGGCGTCCCCCCGACTCCTTCCCCAACTAATAAGGACCCCCCTTCAACCCAAACGGTCCAAAAGGAGATAGACAAAGGGGTAAACAATGAACCAAAGAGTGCCAATATTCCCCGATTATGCCCCCTCCAAGCGGTGGGAGGAGGAGAATTCGGCCCAGCCAGAGTGCATGTACCTTTTTCCCTCTCAGACTTGAAGCAAATTAAAATAGACCTAGGTAAATTCTCAGATAACCCTGATGGCTATATTGATGTTTTACAAGGGTTAGGACAATCCTTTGATCTGACATGGAGAGATATAATGTTACTGCTAGATCAGACACTAACCCCAAATGAGAGAAGTGCCGCCATAACTGCAGCCCGAGAGTTTGGCGATCTCTGGTATCTCAGTCAGGTCAATGATAGGATGACAACAGAGGAAAGAGAACGATTCCCCACAGGCCAGCAGGCAGTTCCCAGTGTAGACCCTCACTGGGACGCAGAATCAGAACATGGAGATTGGTGCCGCAGACATTTGCTAACTTGCGTGCTAGAAGGACTAAGGAAAACTAGGAAGAAGCCTATGAATTATTCAATGATGTCCACTATAACACAGGGAAAGGAAGAAAATCCTACTGCCTTTCTGGAGAGACTAAGGGAGGCATTGAGGAAGCATACCTCTCTGTCACCTGACTCTATTGAAGGCCAACTAATCTTAAAGGATAAGTTTATCACTCAGTCAGCTGCAGACATTAGAAAAAAACTTCAAAAGTCCGCCTTAGGCCCGGAGCAAAACTTAGAAACCCTATTGAACTTGGCAACCTCGGTTTTTTATAATAGAGATCAGGAGGAGCAGGCGGAACGGGACAAACGGGATAAGAAAAAAAAAGGCCACCGCTTTAGTCATGGCCCTCAGGCAAGCGGACTTTGGAGGCTCTGGAAAAGGGAAAGGCTGGGCAAATCGAATGCCTAATAGGGCTTGCTTCCAGTGCGGTCTACAAGGACACTTTAAAAAAGATTGTCCGAATAGAAATAAGCCGCCCCCTCGTCCATGCCCCTTATGTCAAGGGAATCACTGGAAGGCCCACTGCCCCAGGGGACGAAGGTCCTCTGAGTCAGAAGCCACTAACCAGATGATCCAGCAGCAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCCCATGCCATCACCCTCACAGAGCCCCGGGTATGCTTGACCATTGAGGGCCAGGAGGTTAACTGTCTCCTGGACACTGGCGCGGCCTTCTCAGTCTTACTCTCCTGTCCCGGACAACTGTCCTCCAGATCTGTCACTATCCGAGGGGTCCTAGGACAGCCAGTCACTAGATACTTCTCCCAGCCACTAAGTTGTGACTGGGGAACTTTACTCTTTTCACATGCCTTTCTAATTATGCCTGAAAGCCCCACTCCCTTGTTAGGGAGAGACATTCTAGCAAAAGCAGGGGCCATTATACACCTGAACATAGGAGAAGGAACACCCGTTTGTTGTCCCCTGCTTGAGGAAGGAATTAATCCTGAAGTCTGGGCAACAGAAGGACAATATGGACGAGCAAAGAATGCCCGTCCTGTTCAAGTTAAACTAAAGGATTCCGCCTCCTTTCCCTACCAAAGGCAGTACCCCCTTAGACCCGAGGCCCAACAAGGACTCCAAAAGATTGTTAAGGACCTAAAAGCCCAAGGCCTAGTAAAACCATGCAATAGCCCCTGCAATACTCCAATTTTAGGAGTACAGAAACCCAACGGACAGTGGAGGTTAGTGCAAGATCTCAGGATTATCAATGAGGCCGTTGTCCCTCTATACCCAGCTGTACCTAACCCTTATACTCTGCTTTCCCAAATACCAGAGGAAGCAGAGTGGTTTACAGTCCTGGACCTTAAGGATGCCTTTTTCTGCATCCCTGTACATCCTGACTCTCAATTCTTGTTTGCCTTTGAAGATCCTTCGAACCCAACGTCTCAACTCACCTGGACTGTTTTACCCCAAGGGTTCAGGGATAGCCCCCATCTATTTGGCCAGGCATTAGCCCAAGACTTGAGCCAGTTCTCATACCTGGACACTCTTGTCCTTCGGTACGTGGATGATTTACTTTTAGCCGCCCGTTCAGAAACCTTGTGCCATCAAGCCACCCAAGCGCTCTTAAATTTCCTCGCCACCTGTGGCTACAAGGTTTCCAAACCAAAGGCTCAGCTCTGCTCACAGCAGGTTAAATACTTAGGGCTAAAATTATCCAAAGGCACCAGGGCCCTCAGTGAGGAACGTATCCAGCCTATACTGGCTTATCCTCATCCCAAAACCCTAAAGCAACTAAGAGGGTTCCTTGGCATAACAGGCTTCTGCCGAATATGGATTCCCAGGTACGGCGAAATAGCCAGGCCATTATATACACTAATTAAGGAAACTCAGAAAGCCAATACCCATTTAGTAAGATGGACACCTGAAGCAGAAGCGGCTTTCCAGGCCCTAAAGAAGGCCCTAACCCAAGCCCCAGTGTTAAGCTTGCCAACGGGGCAAGACTTTTCTTTATATGTCACAGAAAAAACAGGAATAGCTCTAGGAGTCCTTACACAGGTCCGAGGGACGAGCTTGCAACCCGTGGCATACCTGAGTAAGGAAATTGATGTAGTGGCAAAGGGTTGGCCTCATTGTTTACGGGTAGTGGCGGCAGTAGCAGTCTTAGTATCTGAAGCAGTTAAAATAATACAGGGAAGAGATCTTACTGTGTGGACATCTCATGATGTGAACGGCATACTCACTGCTAAAGGAGACTTGTGGCTGTCAGACAACCGTTTACTTAAATATCAGGCTCTATTACTTGAAGGGCCAGTGCTGCGACTGCGCACTTGTGCAACTCTTAACCCAGCCACATTTCTTCCAGACAATGAAGAAAAGATAGAACATAACTGTCAACAAGTAATTGCTCAAACCTACGCCGCTCGAGGGGACCTTCTAGAGGTTCCCTTGACTGATCCCGACCTCAACTTGTATACTGATGGAAGTTCCTTTGTAGAAAAAGGACTTCGAAAAGCGGGGTATGCAGTGGTCAGTGATAATGGAATACTTGAAAGTAATCCCCTCACTCCAGGAACTAGCGCTCAGCTGGCAGAACTAATAGCCCTCACTCGGGCACTAGAATTAGGAGAAGGAAAAAGGGTAAATATATATACAGACTCTAAGTATGCTTACCTAGTCCTCCATGCCCACGCAGCAATATGGAGAGAAAGGGAATTCCTAACTTCCGAGGGAACACCTATCAAACATCAGGAAGCCATTAGGAGATTATTATTGGCTGTACAGAAACCTAAAGAGGTGGCAGTCTTACACTGCCGGGGTCATCAGAAAGGAAAGGAAAGGGAAATAGAAGGGAACCGCCAAGCGGATATTGAAGCCAAAAGAGCCGCAAGGCGGGACCCTCCATTAGAAATGCTTATAGAAGGACCCCTAGTATGGGGTAATCCCCTCCGGGAAACCAAGCCCCAGTACTCAGCAGAAGAAATAGAATGGGGAACCTCACGAGGACATAGTTTCCTCCCCTCAGGATGGCTAGCCACCGAAGAAGGAAAAATACTTTTGCCTGCAGCTAACCAATGGAAATTACTTAAAACCCTTCACCAAACCTTTCACTTAGGCATTGATAGCACCCATCAGATGGCCAAATCATTATTTACTGGACCAGGCCTTTTCAAAACTATCAAGCAGATAGTCAGGGCCTGTGAAGTGTGCCAAAGAAATAATCCCCTGCACTTATCGCCAAGCTCCTTCAGGAGAACAAAGAACAGGCCATTACCCAGGAGAAGACTGGCAACTAGATTTTACCCACATGCCCAAATCTCAGGGATTTCAGTATCTACTAGTCTGGGTAGATACTTTCACTGGTTGGGCGGAGGCCTTCCCTTGTAGGACAGAAAAGGCCCAAGAGGTAATAAAGGCACTAGTTCATGAAATAATTCCCAGATTCGGACTTCCCCGAGGCTTACAGAGTGACAATGGCCCCGCTTTCAAGGCTGCAGTAACCCAGGGAGTATCCCAGGCGTTAGGCATACAATATCACTTACACTGCGCCTGGAGGCCACAATCCTCAGGAAAAGTCGAGAAAATGAACGAAACACTCAAACGACATCTAAAAAAGCTAACCCAGGAAACCCACCTCGCATGGCCTGCTCTGTTGCCTATAGCCTTACTAAGAATCCGAAACTCTCCCCAAAAAGCGGGACTTAGCCCATACGAAATGCTGTATGGACGGCCCTTCCTAACCAATGACCTTGTGCTTGACCGAGAGACGGCCAACTTAGTTGCAGACATCACCTCCTTAGCCAAATATCAACAAGTTCTTAAAACATTACAGGGAACCTGTCCCCGAGAGGAGGGAAAGGAATTATTCCACCCTGGTGACATGGTATTAGTCAAGTCCCTTCCCTCTAATTCCCCATCCCTAGATACATCCTGGGAAGGACCCTACCCAGTCATTTTATCTACCCCAACCGCGGTTAAAGTGGCTGGAGTGGAGTCTTGGATACATCACACTCGAGTCAAACCCTGGATACTGCCAAAGGAACCCGAAAATCCAGGAGACAACGCTAGCTATTCCTGTGAACCTCTAGAGGATCTGCGCCTGCTCTTCAAGCGACAACCGTGAGGAAAGTAACTAGAATCGTAGATCCCCATGGCCCTCCCTTGTCATATTTTTCTCTTTACTGTTCTCTTACCCCCTTTCACTCTCACTGCACCCCCTCCATGCCGCTGTACTACCAGTAGCTCCCCTTACCAAGAGCTTCTATGGAGAATGCGGCTTCCCGGAAATATTGATGCCCCATCGTATAGGAGTTTTTCTAAAGGAAACCCCACTTTCACCGCCCACACCCATATGCCCCGCAACTGCTATAACTCTGCCACTCTTTGCATGCATGCAAATACTCATTATTGGACAGGGAAAATGATTAATCCTAGTTGTCCTGGAGGACTTGGAGCCACTGTCTGTTGGACTTACTTCACCCATACCGGTATGTCTGATGGGGGTGGAGTTCAAGATCAGGCAAGAGAAAAACACGTAAAGGAAGTAATCTCCCAACTGACCCGGGTACATAGCACCCCTAGCCCCTACAAAGGACTAGATCTCTCAAAACTACATGAAACCCTCCGTACCCATACTCGCCTGGTAAGCCTATTTAATACCACCCTCACTGGGCTCCATGAGGTCTCGGCCCAAAACCCTACTAACTGTTGGATGTGCCTCCCCCTGCACTNCAGGCCATACATTTCAATCCCTGTATCTGAACAATGGAACAACTTCAGCACAGAAATAAACACCACTTCCGTTTTAGTAGGACCTCTTGTTTCCAATCTGGAAATAACCCATACCTCAAACCTCACCTGTGTAAAATTTAGCAATACTATAGACACAACCAACTCCCAATGCATCAGGTGGGTAACTCCTCCCACACGAATAGTCTGCCTACCCTCAGGAATATTTTTTGTCTGTGGTACCTCAGCCTATCGTTGTTTGAATGGCTCTTCAGAATCTATGTGCTTCCTCTCATTCTTAGTGCCCCCTATGACCATCTACACTGAACAAGATTTATACAATCATGTCGTACCTAAGCCCCGCAACAAAAGAGTACCCATTCTTCCTTTTGTTATCGGAGCAGGAGTGCTAGGCGGACTAGGTACTGGCATTGGCGGTATCACAACCTCTACTCAGTTCTACTACAAACTATCTCAAGAACTAAATGGTGACATGGAACGGGTCGCCGACTCCCTGGTCACCTTGCAAGATCAACTTAACTCCCTAGCAGCAGTAGTCCTTCAAAATCGAAGAGCTTTAGACTTGCTAACCGCCGAAAGAGGGGGAACCTGTTTATTTTTAGGGGAAGAATGCTGTTATTATGTTAATCAATCCGGAATCGTCACCGAGAAAGTTAAAGAAATTCGAGATCGAATACAACGTAGAGCAGAGGAGCTTCAAAACACCGGACCCTGGGGCCTCCTCAGCCAATGGATGCCCTGGATTCTCCCCTTCTTAGGACCTCTAGCAGCTATAATATTGTTACTCCTCTTTGGACCCTGTATCTTTAACCTCCTTGTTAAGTTTGTCTCTTCCAGAATCGAAGCTGTAAAACTACAAATCGTTCTTCAAATGGAGCCCCAGATGCAGTCCATGACTAAGATCTACCGCGGACCCCTGGACCGGCCTGCTAGCCCATGCTCCGATGTTAATGACATCGAAGGCACCCCTCCCGAGGAAATCTCAACTGCACGACCCCTACTACGCCCCAATTCAGCAGGAAGCAGTTAGAGCGGTCGTCGGCCAACCTCCCCAACAGCACTTGGGTTTTCCTGTTGAGAGGGGGATC
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV17 | DAL81 | 5457 | 5475 | - | -39.42 | ACAAGTTGAGGTCGGGATC |
HERV17 | BPC5 | 1045 | 1074 | + | -41.20 | AGAGAGATAGAAGTAGTAAAGAAAAAACAG |
HERV17 | BPC5 | 1041 | 1070 | + | -41.87 | AAAGAGAGAGATAGAAGTAGTAAAGAAAAA |
HERV17 | BPC5 | 1043 | 1072 | + | -43.07 | AGAGAGAGATAGAAGTAGTAAAGAAAAAAC |
HERV17 | BPC5 | 7189 | 7218 | - | -43.10 | GGGTGCAGTGAGAGTGAAAGGGGGTAAGAG |
HERV17 | BPC5 | 2560 | 2589 | + | -43.22 | GGAGATAGACAAAGGGGTAAACAATGAACC |
HERV17 | BPC5 | 7183 | 7212 | - | -43.96 | AGTGAGAGTGAAAGGGGGTAAGAGAACAGT |
HERV17 | BPC5 | 7153 | 7182 | - | -47.99 | AAAGAGAAAAATATGACAAGGGAGGGCCAT |
HERV17 | DAL81 | 260 | 278 | + | -52.00 | GGAAGCCGAGGGCCGACTA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.