HERV17

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000628
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 8839
Kimura value 4.20
Tau index 0.9778
Description Internal sequence of the class I endogenous retrovirus
Comment Internal sequence of HERV17 (HERV-W) flanked by LTR17 long terminal repeats. Update on existing entry. Most copies in the seed alignment are processed pseudogenes of ERV transcripts
Sequence
TTTGGTGGCCCACGAAGGGACTCTCCAAAGCGGTGAGTAATATTGGACCACTTTCGCTTGCTATTCTGTCCTATCCTTCCTTAGAATTGGAGGAAAATACCGGGCACCTGTCGGCCGGTTAAAAACGATTAGCGTGGCCGCCGGACTTAAGACTCAGGTGTGAGGCTNTCTGGGGAAGGGCTTTCTAACAACCCCCAACCCTTCTGGGTTGGGAGCGTTGGTCTGCCTGGAACCAGCTTCCGCTTTCAATTTTCCTGGGGGAAGCCGAGGGCCGACTAGAGGCAGAAAGCTGTCGTCCCGAACTCCCGGCATTAGCCGGTTGAGATCATGGCGCAGCCAGAAGTCTCTACTCAACAGTCGCCCATGCGTGCGCCCCTACCTTTCCTTCTGACCCATACCTCCTGGGTCCCGACCACGACTTTCTTGAAAGTGTAGCCCCAAAATTCTCCTTACCTCTGAATCTACTTCCTCCGATCCCTGCCTCCTAGGTACTAATGGTTCAGACTTTCATTTCCTCTCCCAAGTATTAGAGCAAGTTGTATCTCCAAAGGGATCTAAGGAAGCTCTACGCTGCGTCCTTAGGCACCTAGGCTATGAACCCAGGGAGTCTTGTCCCTGGTGTCCCTCCCGATTTAGGTATACAGCTCTCGACATGGGCAGTTATGTGGGACCCGTTCCCCACCACCCTTGCCAGGGCCCCAAGTTTGTAAATGGCTAAGAGGATTGCTCTCCCATTGTGTAAGATGCTCTCCTCCCCCAATTTCTACCCAGCTTACCCCTCTGCAATACAATCTCCAAGCCTTGGCTCCTTGGCCAGGGCCTTAGAACTGATGACCCAGTACTTTAACAACTGGAACTGGGTCTACGACAACATAATAGATCAGGATGAAAGCGAATTGAGTAAATTAAAGGGAGGCGCATATTCCTATAGTGGCAAATGGGGGCAACGAGCGAACGTCCTTCCGCTGTGTTTCCAAAATCCATCTACAGAGACAGAGAGGAGAGAGAGAGAGAGAGAGAAGAGAGAGAGAGAAAGAGAGAAAGAGAGAGATAGAAGTAGTAAAGAAAAAACAGTGTGCCCTATTCCTTTAAAAGCCAGGGTAAATTTAAAACCTATAATTGATAATTGAAGGTCTTCTCCGTGACCCTATAACACTCCAATACTACCTTGTTGTCAGTGTAAACAAGGGCGTAGCCCGAAAGCACTGAGACCACTGACAACCCGTAGCCTTCCTATCAAAAATCCTTAACCCAGTAACCCGCGGATGGCCCAAATGCATTCAATCTGTAGCGGCAACTGCTTTGCTAACAGAAGAAAGTAGAAAAGTAACTTTTAGAGGAAACCTCATTGTGAGCACACCTCACCAGTTCAGAACTATCCTAAGTCAAAAAAGCAAAAAGGTAGCTTACTAACTCAAAAATCTTAAAGTATGGGGCTATTCTGTTAGAAAAAGGTGATTTAACACTAACCACTGAAAATTCCCTTAACCCAGCAGATTTCCTAACAGGGGATTTAAATCTTAATTACCATACAAAGGTCCGACCAGACCTAGGAGGAACTCCCTTCAGGACAGGACGATAGATGGTTCCTCCCAGGTGATTGAGGAAAAAACCACAATGGGTATTCAGTAATTGATAGGGAGACTCTTGTGGAAGCAGAGTTAGGAAAATTGCCTAATAATTGGTCTGCTCAAACGTGCGAGCTGTTTGCACTCAGCCAAGCCTTAAAGTACTTACAGAATCAAAAAACTCTATCTCAATCCTGACTCAAAAGGTTACCTACACCCTCTCTGAAACGAATTTGCATAAGAACTGTTGTTTATGGGAATGCATCTTGATGGGGCAGCTGGGTTGTTATGAAATACTCAGGAACCCAGCCCAGCTCTAGGACTCACCCCTGAGCGCAAAGGCAATGTTGGGCACGCTGGTAAAGGACCACTAGAATCCAGCAGCCCGGACCCCTTTCTTTGTGGTCAAGAAAGGCGGGAAAACGGGTGCAGGACTGCTACATCGGTGAGCGTAACTAATCCGATAAGCAGAGGTCCATGGGTGGTTACGCACCCTGGAAAGGAATAAGCATTAGGACCATAGAGGACGCTCTAGGACTAATGCTCATCGGAAAATGACTAGGGGTGCTGGCATCCCTATGTTCTTTTTTCAGATGGGAAACGTTCCCCCCAAGGCAAAAACGCCCCTAAGATGTATTCTGGAGAATTNGGNCCAGTTTGACCCTCAGACGCTAAGAAAGAAACGACTTATATTCTTCTGCAGTACCGCCTGGCCACGATATCCTCTTCAAGGGGGAGAAACCTGGCCTCCTGAGGGAAGTATAAATTATAACACCATCTTACAGCTAGACCTCTTTTGTAGAAAAGAAGGCAAATGGAGTGAAGTGCCATATGTACAAACTTTCTTTTCATTAAGAGACAACTCGCAATTATGTAAAAAGTGTGATTTATGCCCTACAGGAAGCCCTCAGAGTCTACCTCCCTACCCCGGCGTCCCCCCGACTCCTTCCCCAACTAATAAGGACCCCCCTTCAACCCAAACGGTCCAAAAGGAGATAGACAAAGGGGTAAACAATGAACCAAAGAGTGCCAATATTCCCCGATTATGCCCCCTCCAAGCGGTGGGAGGAGGAGAATTCGGCCCAGCCAGAGTGCATGTACCTTTTTCCCTCTCAGACTTGAAGCAAATTAAAATAGACCTAGGTAAATTCTCAGATAACCCTGATGGCTATATTGATGTTTTACAAGGGTTAGGACAATCCTTTGATCTGACATGGAGAGATATAATGTTACTGCTAGATCAGACACTAACCCCAAATGAGAGAAGTGCCGCCATAACTGCAGCCCGAGAGTTTGGCGATCTCTGGTATCTCAGTCAGGTCAATGATAGGATGACAACAGAGGAAAGAGAACGATTCCCCACAGGCCAGCAGGCAGTTCCCAGTGTAGACCCTCACTGGGACGCAGAATCAGAACATGGAGATTGGTGCCGCAGACATTTGCTAACTTGCGTGCTAGAAGGACTAAGGAAAACTAGGAAGAAGCCTATGAATTATTCAATGATGTCCACTATAACACAGGGAAAGGAAGAAAATCCTACTGCCTTTCTGGAGAGACTAAGGGAGGCATTGAGGAAGCATACCTCTCTGTCACCTGACTCTATTGAAGGCCAACTAATCTTAAAGGATAAGTTTATCACTCAGTCAGCTGCAGACATTAGAAAAAAACTTCAAAAGTCCGCCTTAGGCCCGGAGCAAAACTTAGAAACCCTATTGAACTTGGCAACCTCGGTTTTTTATAATAGAGATCAGGAGGAGCAGGCGGAACGGGACAAACGGGATAAGAAAAAAAAAGGCCACCGCTTTAGTCATGGCCCTCAGGCAAGCGGACTTTGGAGGCTCTGGAAAAGGGAAAGGCTGGGCAAATCGAATGCCTAATAGGGCTTGCTTCCAGTGCGGTCTACAAGGACACTTTAAAAAAGATTGTCCGAATAGAAATAAGCCGCCCCCTCGTCCATGCCCCTTATGTCAAGGGAATCACTGGAAGGCCCACTGCCCCAGGGGACGAAGGTCCTCTGAGTCAGAAGCCACTAACCAGATGATCCAGCAGCAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCCCATGCCATCACCCTCACAGAGCCCCGGGTATGCTTGACCATTGAGGGCCAGGAGGTTAACTGTCTCCTGGACACTGGCGCGGCCTTCTCAGTCTTACTCTCCTGTCCCGGACAACTGTCCTCCAGATCTGTCACTATCCGAGGGGTCCTAGGACAGCCAGTCACTAGATACTTCTCCCAGCCACTAAGTTGTGACTGGGGAACTTTACTCTTTTCACATGCCTTTCTAATTATGCCTGAAAGCCCCACTCCCTTGTTAGGGAGAGACATTCTAGCAAAAGCAGGGGCCATTATACACCTGAACATAGGAGAAGGAACACCCGTTTGTTGTCCCCTGCTTGAGGAAGGAATTAATCCTGAAGTCTGGGCAACAGAAGGACAATATGGACGAGCAAAGAATGCCCGTCCTGTTCAAGTTAAACTAAAGGATTCCGCCTCCTTTCCCTACCAAAGGCAGTACCCCCTTAGACCCGAGGCCCAACAAGGACTCCAAAAGATTGTTAAGGACCTAAAAGCCCAAGGCCTAGTAAAACCATGCAATAGCCCCTGCAATACTCCAATTTTAGGAGTACAGAAACCCAACGGACAGTGGAGGTTAGTGCAAGATCTCAGGATTATCAATGAGGCCGTTGTCCCTCTATACCCAGCTGTACCTAACCCTTATACTCTGCTTTCCCAAATACCAGAGGAAGCAGAGTGGTTTACAGTCCTGGACCTTAAGGATGCCTTTTTCTGCATCCCTGTACATCCTGACTCTCAATTCTTGTTTGCCTTTGAAGATCCTTCGAACCCAACGTCTCAACTCACCTGGACTGTTTTACCCCAAGGGTTCAGGGATAGCCCCCATCTATTTGGCCAGGCATTAGCCCAAGACTTGAGCCAGTTCTCATACCTGGACACTCTTGTCCTTCGGTACGTGGATGATTTACTTTTAGCCGCCCGTTCAGAAACCTTGTGCCATCAAGCCACCCAAGCGCTCTTAAATTTCCTCGCCACCTGTGGCTACAAGGTTTCCAAACCAAAGGCTCAGCTCTGCTCACAGCAGGTTAAATACTTAGGGCTAAAATTATCCAAAGGCACCAGGGCCCTCAGTGAGGAACGTATCCAGCCTATACTGGCTTATCCTCATCCCAAAACCCTAAAGCAACTAAGAGGGTTCCTTGGCATAACAGGCTTCTGCCGAATATGGATTCCCAGGTACGGCGAAATAGCCAGGCCATTATATACACTAATTAAGGAAACTCAGAAAGCCAATACCCATTTAGTAAGATGGACACCTGAAGCAGAAGCGGCTTTCCAGGCCCTAAAGAAGGCCCTAACCCAAGCCCCAGTGTTAAGCTTGCCAACGGGGCAAGACTTTTCTTTATATGTCACAGAAAAAACAGGAATAGCTCTAGGAGTCCTTACACAGGTCCGAGGGACGAGCTTGCAACCCGTGGCATACCTGAGTAAGGAAATTGATGTAGTGGCAAAGGGTTGGCCTCATTGTTTACGGGTAGTGGCGGCAGTAGCAGTCTTAGTATCTGAAGCAGTTAAAATAATACAGGGAAGAGATCTTACTGTGTGGACATCTCATGATGTGAACGGCATACTCACTGCTAAAGGAGACTTGTGGCTGTCAGACAACCGTTTACTTAAATATCAGGCTCTATTACTTGAAGGGCCAGTGCTGCGACTGCGCACTTGTGCAACTCTTAACCCAGCCACATTTCTTCCAGACAATGAAGAAAAGATAGAACATAACTGTCAACAAGTAATTGCTCAAACCTACGCCGCTCGAGGGGACCTTCTAGAGGTTCCCTTGACTGATCCCGACCTCAACTTGTATACTGATGGAAGTTCCTTTGTAGAAAAAGGACTTCGAAAAGCGGGGTATGCAGTGGTCAGTGATAATGGAATACTTGAAAGTAATCCCCTCACTCCAGGAACTAGCGCTCAGCTGGCAGAACTAATAGCCCTCACTCGGGCACTAGAATTAGGAGAAGGAAAAAGGGTAAATATATATACAGACTCTAAGTATGCTTACCTAGTCCTCCATGCCCACGCAGCAATATGGAGAGAAAGGGAATTCCTAACTTCCGAGGGAACACCTATCAAACATCAGGAAGCCATTAGGAGATTATTATTGGCTGTACAGAAACCTAAAGAGGTGGCAGTCTTACACTGCCGGGGTCATCAGAAAGGAAAGGAAAGGGAAATAGAAGGGAACCGCCAAGCGGATATTGAAGCCAAAAGAGCCGCAAGGCGGGACCCTCCATTAGAAATGCTTATAGAAGGACCCCTAGTATGGGGTAATCCCCTCCGGGAAACCAAGCCCCAGTACTCAGCAGAAGAAATAGAATGGGGAACCTCACGAGGACATAGTTTCCTCCCCTCAGGATGGCTAGCCACCGAAGAAGGAAAAATACTTTTGCCTGCAGCTAACCAATGGAAATTACTTAAAACCCTTCACCAAACCTTTCACTTAGGCATTGATAGCACCCATCAGATGGCCAAATCATTATTTACTGGACCAGGCCTTTTCAAAACTATCAAGCAGATAGTCAGGGCCTGTGAAGTGTGCCAAAGAAATAATCCCCTGCACTTATCGCCAAGCTCCTTCAGGAGAACAAAGAACAGGCCATTACCCAGGAGAAGACTGGCAACTAGATTTTACCCACATGCCCAAATCTCAGGGATTTCAGTATCTACTAGTCTGGGTAGATACTTTCACTGGTTGGGCGGAGGCCTTCCCTTGTAGGACAGAAAAGGCCCAAGAGGTAATAAAGGCACTAGTTCATGAAATAATTCCCAGATTCGGACTTCCCCGAGGCTTACAGAGTGACAATGGCCCCGCTTTCAAGGCTGCAGTAACCCAGGGAGTATCCCAGGCGTTAGGCATACAATATCACTTACACTGCGCCTGGAGGCCACAATCCTCAGGAAAAGTCGAGAAAATGAACGAAACACTCAAACGACATCTAAAAAAGCTAACCCAGGAAACCCACCTCGCATGGCCTGCTCTGTTGCCTATAGCCTTACTAAGAATCCGAAACTCTCCCCAAAAAGCGGGACTTAGCCCATACGAAATGCTGTATGGACGGCCCTTCCTAACCAATGACCTTGTGCTTGACCGAGAGACGGCCAACTTAGTTGCAGACATCACCTCCTTAGCCAAATATCAACAAGTTCTTAAAACATTACAGGGAACCTGTCCCCGAGAGGAGGGAAAGGAATTATTCCACCCTGGTGACATGGTATTAGTCAAGTCCCTTCCCTCTAATTCCCCATCCCTAGATACATCCTGGGAAGGACCCTACCCAGTCATTTTATCTACCCCAACCGCGGTTAAAGTGGCTGGAGTGGAGTCTTGGATACATCACACTCGAGTCAAACCCTGGATACTGCCAAAGGAACCCGAAAATCCAGGAGACAACGCTAGCTATTCCTGTGAACCTCTAGAGGATCTGCGCCTGCTCTTCAAGCGACAACCGTGAGGAAAGTAACTAGAATCGTAGATCCCCATGGCCCTCCCTTGTCATATTTTTCTCTTTACTGTTCTCTTACCCCCTTTCACTCTCACTGCACCCCCTCCATGCCGCTGTACTACCAGTAGCTCCCCTTACCAAGAGCTTCTATGGAGAATGCGGCTTCCCGGAAATATTGATGCCCCATCGTATAGGAGTTTTTCTAAAGGAAACCCCACTTTCACCGCCCACACCCATATGCCCCGCAACTGCTATAACTCTGCCACTCTTTGCATGCATGCAAATACTCATTATTGGACAGGGAAAATGATTAATCCTAGTTGTCCTGGAGGACTTGGAGCCACTGTCTGTTGGACTTACTTCACCCATACCGGTATGTCTGATGGGGGTGGAGTTCAAGATCAGGCAAGAGAAAAACACGTAAAGGAAGTAATCTCCCAACTGACCCGGGTACATAGCACCCCTAGCCCCTACAAAGGACTAGATCTCTCAAAACTACATGAAACCCTCCGTACCCATACTCGCCTGGTAAGCCTATTTAATACCACCCTCACTGGGCTCCATGAGGTCTCGGCCCAAAACCCTACTAACTGTTGGATGTGCCTCCCCCTGCACTNCAGGCCATACATTTCAATCCCTGTATCTGAACAATGGAACAACTTCAGCACAGAAATAAACACCACTTCCGTTTTAGTAGGACCTCTTGTTTCCAATCTGGAAATAACCCATACCTCAAACCTCACCTGTGTAAAATTTAGCAATACTATAGACACAACCAACTCCCAATGCATCAGGTGGGTAACTCCTCCCACACGAATAGTCTGCCTACCCTCAGGAATATTTTTTGTCTGTGGTACCTCAGCCTATCGTTGTTTGAATGGCTCTTCAGAATCTATGTGCTTCCTCTCATTCTTAGTGCCCCCTATGACCATCTACACTGAACAAGATTTATACAATCATGTCGTACCTAAGCCCCGCAACAAAAGAGTACCCATTCTTCCTTTTGTTATCGGAGCAGGAGTGCTAGGCGGACTAGGTACTGGCATTGGCGGTATCACAACCTCTACTCAGTTCTACTACAAACTATCTCAAGAACTAAATGGTGACATGGAACGGGTCGCCGACTCCCTGGTCACCTTGCAAGATCAACTTAACTCCCTAGCAGCAGTAGTCCTTCAAAATCGAAGAGCTTTAGACTTGCTAACCGCCGAAAGAGGGGGAACCTGTTTATTTTTAGGGGAAGAATGCTGTTATTATGTTAATCAATCCGGAATCGTCACCGAGAAAGTTAAAGAAATTCGAGATCGAATACAACGTAGAGCAGAGGAGCTTCAAAACACCGGACCCTGGGGCCTCCTCAGCCAATGGATGCCCTGGATTCTCCCCTTCTTAGGACCTCTAGCAGCTATAATATTGTTACTCCTCTTTGGACCCTGTATCTTTAACCTCCTTGTTAAGTTTGTCTCTTCCAGAATCGAAGCTGTAAAACTACAAATCGTTCTTCAAATGGAGCCCCAGATGCAGTCCATGACTAAGATCTACCGCGGACCCCTGGACCGGCCTGCTAGCCCATGCTCCGATGTTAATGACATCGAAGGCACCCCTCCCGAGGAAATCTCAACTGCACGACCCCTACTACGCCCCAATTCAGCAGGAAGCAGTTAGAGCGGTCGTCGGCCAACCTCCCCAACAGCACTTGGGTTTTCCTGTTGAGAGGGGGATC



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERV17 BPC5 1025 1054 + 33.13 AGAGAGAGAAAGAGAGAAAGAGAGAGATAG
HERV17 BPC6 1020 1040 - 32.28 CTCTCTTTCTCTCTCTCTCTT
HERV17 BPC6 1022 1042 - 31.37 TTCTCTCTTTCTCTCTCTCTC
HERV17 BPC1 1001 1024 + 30.86 GAGAGAGAGAGAGAGAGAGAAGAG
HERV17 BPC6 1028 1048 - 30.82 CTCTCTTTCTCTCTTTCTCTC
HERV17 BPC1 999 1022 + 30.58 AGGAGAGAGAGAGAGAGAGAGAAG
HERV17 BPC6 1024 1044 - 29.63 CTTTCTCTCTTTCTCTCTCTC
HERV17 BPC5 1027 1056 + 29.62 AGAGAGAAAGAGAGAAAGAGAGAGATAGAA
HERV17 BPC6 999 1019 - 29.03 CTCTCTCTCTCTCTCTCTCCT
HERV17 BPC6 1032 1052 - 28.37 ATCTCTCTCTTTCTCTCTTTC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).