HERVS71

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000205
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 8978
Kimura value 7.99
Tau index 0.9113
Description Internal region of ERV1 endogenous retrovirus, HERVS71 subfamily
Comment The associated long terminal repeats are LTR6A and LTR6B.
Sequence
TAATGGAGGCCCCAGCGAGANATTAACGCCACCGGGCGAGAGCCGGGCTCGCTCCGGGCTCCCCCGGAAGGACGGCCGGCTTGTAGGGGGGGCGCCACCTGAAAAAANAATTTTCAGGNTCCCCGAAAGGTGACCGTCTTCCGGAGGAGAGCGGATCGACTACCGTGTGGGTGCCCATAAAATTCCACCTCTGAGTCCTCAGCTTCTGACCCCGGGGTCAGGTAGGTCAGATTTGACTTCGGTTCTGGTAAGAGGGAAGCGGCCCTGACGAGGGCGTCCCTCTTTTGACTCTGCCCGTTTCTCTAGGACGCTAGAGGGTNGAGCCCTGGTTTTCTGNTAGGCGCCTCTGTGTCTCTGTCTAGGAGGGAAGTGGCCCTGACAGGGGCCCTCCCTTGACTCAGTCCACGTCCCAGGATGCTGGAGGACTGAGTCCTGGTTTCTGGCAGACCGGNNNNTCNNTCTCTCTCTCTCTCTCTCTTTTTCTATCTCTCATCTTTCTCTTGTTCAAGTTTCTTGGAAATCTCCGGGAAAGAAAANNNNNNNNNNAAAAAAAAACTGTTATAAACTCTGTGTGAATGGTGAGTGAATGAGGGAGGACAAGGGCTTGCGCTTGTCCTCCAGTTTGTAGCTCCACGGCGAAAGCTACGGAGTTCGAGTGGGCCCTCACCTGCGGTTCCGTGGCGACCTCATAAGGCTTAAGGCAGCATCGGGCATAGCTCGATCCGAGCCGGGGGTTTATACCGGCCTGCCAATGCTAAGAGGAGCCCAAGTCCCCTCAGGGGGAGCGGCCAGGCGGGCATCTGACTGATCCCATCACGGGANCCCCTCCCCTTGTCTGTCTAAAAAAAAAAANAAAAAAGGAAAAACTGTCATAACTGTTTACATGCCCTAAAGTCAATTGTTTGTTTTATGTTGATTGTTCTGTTCAGTGTCTATTGTCTTGTTTAGTAGTTGTCAAAGTTTTGCATGTCAAGACGTCGATATTGCCCAAGACGTCTAGGTAAAAACTTCTTCAAGGTCCTTAGTGCTGATTTTTTGTCACAGGAGGTTAAATTTCTCATCAATCATTTAGGCTGGCCACCACAGTCCTGTCTTTTCTGCCAGAAGCAAGTCAAGTGTTGTTACGAGAACGAGTGTGAAAAACATTCGCCTGATTAAGATTTCTGGCACCATGAAAGTTGTAAGTATTTAGATCGTCATACCCCACGTCCAAGTGATTAGACCTCCTCTAAACTAAACCGGTAGTGGGTTCAAAACAGCCACCCTGCAGATTTCCTTGCTCACCTCTTTTGTCATTCTGTAACTTTTCCTGTGCCCTTAAATAGAACACTGTGTAAGGAAACGTACGCCCGTACTGCTTTACTTCGTTTAGATTCTTACTCTGTTCCTCTGTGGCTACTCTCCCATCTTAAAAATGATCCGAGTAGTCCTTTTCCNCCTCGTCCCTGCCCCCTACCCCGCACATCTCGTTTTCCGGTGCGACAGCAAGTTCAGCGTCTCCAGGACTTGGCTCTGCTCTCACTCCTTAAACCCTTAAAAGAAAAAGCTAAGTTTAAGCTATTTGCCTTTAAGTCATAGAGACACCAAAAGTATTTAAGGTGCAGATCTAGAAGAAGAAGAAGANNGAGAACGCCTAGATCAAACTGACCCAGAAGATCTCAGGCTGGCCCCTAGTCCTCCTCCCTCAATCTTAAAGCTACAGCAATGTGGCAAGTAGTATTAGCTGTTGTAGTTTTTCTGCTNCTTTCTGGTCATGTTGATTCTGTTCTTTCGATACTCCAGCCCCCCAAGGAATGAGTTTCTCTGTCCGTGCTAGGTTTAATATCTATGCTCAANATCTTATTAAATTGCCTTCAAANAANAAAAANAANNNNAAAACGGGAAACACTTCCTCCCAGCCTTGTAAAGGTTAGAGCCCTCTCCAATGTATGCTGCAGAATTTTTCTCTCGGTTTCTCAGAGGATTATAAAGTCCGCCTTAAAAAAGGCAAGCTCCGGACACTCTGCGAAATAGAATGGCCAAAGTTTAGAGTCGAGTGGCCCCCTGAAGGGTCATTGAACCTCACAATTGTTCAAGCTGTGTGGCGGGTTGTTACTGAAACTCCCAGCCACCCTGATCAGTTTCCCTACATTGATCAATGGCTAAGTTTGGTCAGGAGCCCCCCTCCATGGCTCCGTTCATGCGCCATTCATAATTCTACCTCCAAGGTCCTCCTGAGCCAGACCGCGTTTTCGCCTCGACCCTCAGCCGGTTCGGCTCCCCCTGTACTGCCTCCCTCTGAAGAAGAGGAGAGTCTCCCTCACCCAGTCCCACCGCCTTACAACCAGCCTGCTCCCTTAAAGTTATCCCATGTCTCCTCGACGACGTCCCCTGTAGGCTCGCCACCCATTGCCTCTCGATCGCGACCGCGGCGGGAGGAAGTAGCCCCTCTACTACCACTGAGAGAGGCACAAGTCCCTCCGGGTGACGAGCGCTCAGCCCCCTTCTTAGTTTATGTCCCTTTTTCTACTTCTGACTTGTATAATTGGAAAACCCATAATCCTCCCTTCTCTGAAAAGCCCCAGGCTTTGACCTCTCTGACGGAGTCCGTACTCCGGACTCACCCGCCCACCTAGGATGATTGCCAACAGCTCCTTTTAACCCTTTTCACCTCTGAAGAGAAGGAACGTATCCGAAGAGAGGCCAAAAAGTACTTCCTCGCATCAGCCAATGGACCGGAGGAGGAAGCTAGAGACCTCCTTGAGGAGGTCTTTCCCTCTACCCGGCCTAACCGGGACCCAAATTCCTCAAGTGGAAGGAGAGCTTTAGACGATTTTCACCGGTATCTCCTCGCGGGTATTAAAGGAGCCGCTCGGAAACCCATAAACTTGTCTAAGACGACCGAAGTTGTCCAGGGGCCCGATGAGTCACCAGGAGCGTTTTTAGAGCGCCTCCAGGAGGCTTATCGGATTTACACCCCTTTTGACCCGGCGGCTCCCGAAAATAGCCGTGCTCTTAATTTGGCATTTGTGGCTCAGGCAGCCCCGGATATTAAAAGGAAACTCCAAAAACTGGAAGGATTTGCTAGAATGAATATCAGTCAGCTTTTAGAAATAGCCCAAAAAGTTTTTGACAATCGAGAGTTTGAAAAACAAAAACAAGCAACACAGGCAGCTGAAAAGGCCGCTGATAAAGCATTCAAAAGACAAACAAAAATCTTAGTGGCGGCTATCCAAGAGGACAGAATGAAATGGCCCCCATTCCAGAAGAATGGCCAAGGAACCTCGGGTTCCCACCAGAAAAGTAAAAGAGGTGAACAGGCCCCTCTAGGAAAAACCAATGTGCCTATTGCAAGCAGACTGGGCACTGGAAAAAGGAGTGCCCACTACTGCCANAAGAAAAGTCAGAAAACAAAAAGGTCCTCACCCTGCCCGCAACGGAGGAGCCTGATGATTGACGGGGCCAGGGCTCCCTCGCTCTTGGCCCCCAGGANCCCATGGTAACTGCTACAGTGGGGGGCCAGCCTGTACGTTTCCTAGTAGACACCGGGGCGGAGCACTCGGTACTGCAGACTCCCTTGGGCAGTGTCTCAAATAAAAAAATGACTGTACAAAGGGCAACTGGAGCTATTCAAGAATATCCTGTCACACGCTCCCGAGAAGTAAACTTGGGACAGAAAAGAGTGACACACTCTTTTCTNGTGGTTCCAGAGTGTCCTTTTCCTCTCCTTGGACGAGACCTGCTCCATAAGTTACAGGCCTCAATCTCCTTTTCAGCTCAGCAGGCTCATCTCACACTAGGAAATGCAACTTCCCCCACTGCCCAACTCTTGCTAACTACCCCTCTGTCAGAAGAATACCTTCTGGTTTCACCATCACAATCACCGGAGGAGAATACTAATACTCTTTTGTTGGACNTACAGACACTTTTTCCCCGAGTTTGGGCCGAGTCAAACCCTCCCGGACTGGCTAAACACCATCCGCCAGTGGTTGTAGAACTCTTGGCCACTGCCATACCGGTCCAGGTAAAGCAATACCCCATGAGTCAGCAGGCTAGAGAGGNGATTAATCCCCACATTCAATGACTGTTACAAGCTGGCATACTTACACCATGTCAGTCGGCCTGGAACACNCCATTTTTGCCGGTCCAGAAACCTGGAACAAATGATTACCGGCCGGTACAAGACTTAAGGGAAGTTAATAAATGGACTGTTACTGTCCATCCAACCGTCCCTAATCCTTATACTCTACTCAGCCTGCTCCCACCAGAACATACAGTATACACTGTCCTTGACCTGAAAGATGCTTTCTTTGCTATTCCTCTGGCCCCCAAAAGCCAGCCGATTTTTGCATTTGAATGGACAGATCCAAGATCAGGAGACACTACCCAACTGACTTGGACTCAGTTACCTCAGGGTTTTAAAAATTCCCCCACCCTTTTTGGGGAGGCTCTTCGGCAAGATCTTATACCTTCCGAGCTAGTCACCCTAACTGTACTCTTCTTCAGTATGTAGATGATATTTTAATAGCTACTGAAACTATGGACAGTTGTCTACAACACACGAGGGACCTGCTCTACCTCCTTCAGGAGCTCGGGTATGGAGTCTCAGCCAAAAAGGCCCAGCTTTGTCTTCCCAGAGTGTCCTACCTGGGGTACGAGATAAACCAAGGAAAAAGGGCACTCACCAGTGCCCGGAAAGAAGCCATCCTGCGAATCCCCACTCCCGCCACCAAGAGACGGGTACGCGAATTNCTGGGGGCCGTGGGATACTGTCGCCTCTGGATATCGGGGTTCGCGGAGATTGCAAAGCCCTTGTATACTGCTACAGGANGNAATGGCCCGCTAATTTGGACAGACACNGAAGAACAGGCTTTTCAAAACCTGAAAAAGGCATTAACTGAAGCCCCTGCTTTAGCCCTCCCTAATATCTCAAAGCCGTTTCACCTGTTTGTCCATGAAAGCCAGGGAGTTGCTAAAGAGGTGCTTACTCAGACTTTAAGACCCTGGAGACGCCCAGTGGCCTATTTATCTAAGAGGCTGGATCCTGTGGCCTCTGGATGGCCAAGTTGTCTGCGAGCCGTAGCGGCTACAGCAAGCCTAGTCCAAGAAGNTGATAAGTTAACTCTAGGCCAAAATTTAACCCTTACAGCTCCTCATGCCGTAGAGACCTTACTACGAAGTGCTTCTGGCAAATGGATGTCAAATGCTCGCATCTTGCAGTATCAGAGTTTACTGTTAGATCAGCCTCGTTTGACTTTCTCTCCCACAAGGTGTTTNAATCCAGCTACACTACTTCCTGACCCAGACTCCACTATTCCTGCTCATGACTGTCAAGAACTGTTAGAAACTACCGAAACTGGCCGACCTGATCTTCAAGATGTGCCCCTAGAAAAGGCGGATGCCGCCGTGTTCACAGACGGTAGCAGCTTCCTCGAGCAGGGAGTACGAAAAGCCGGTGCAGCTGTTACCACGGAGACAGATGTGTTGTAGGCTCAGGCTTTACCAGCGAACACCTCAGCGCAAAAGGCTGAATTGATCGCCCTCACTCAGGCTCTCCGATGGGGTAAGGATAAACGTATTAACATTTACACTGACAGCAGGTACGCCTTTGCTACTGTGCATGTACATGGAGCCATCTACCAGGAANGCGGGCTACTCACCTCAGCAGGAAAGGCTATCAAAAACAAAGAAGAAATTCTAGCCCTGCTTGAAGCCGTGTGGCTCCCTCAGCAGGTAGCTGTGATCCACTGCAAAGGACATCAAAAAGAAAACACGGCCGTTGCCCGTAGTAACCAGAAAGCTGATTCAGCAGCTCAGGTCGCAGCGNGACTTTCAGTCACGCCTCTAAACTTGCTGCCCACAGTCTCCTTTCCACAGCCAGATCTGCCTGACAATCCCGTATACTCAACAAAANAAAAAAAACTGGCTTCAGATCTCAGAGCCAATAAAAATCAGGAAAGTTAGTAGATTCTTCCTGACTCTAGAATCTTCATACCCCGAACTCTTAAAGAAACTTTAATCAGTCACCTACAGTCTACCACCCATTTAAGAAGAGCAAAGCTACCTCAGCTCCTCCGGAGCCATTTTAAGATCCCCCGTCTTCAAAGCCTAACAGATCAAGCAGCTCTCCGGTGCACAACCTGCGCCCAGGTAAATGCCAAGCAAGGTCCTAAACCCAGCCCAGGCCACCGTCTCCGAAAAAACTCGCCAGGAGAAAAGTGGGAAATTGACTTTACAGAAGTAAAACCACACCGGGCTAAGTACAAATACCTTCTAGTACTAGTAGACACCTTCTCCGGATGGACTGAGGCATTTGCTACCGAAAACGAAACCGCCAACACGGTAGTTAAGTTTTTACTCAATGAAATCATCCCTCGATATAGGCTGCCTGCTGCCATAGGGTCTGATAATGGACCGGCCTTCACCTCGCCCATAGCTCAGTCAGTCAGTAAGGCGTTAAACATTCAACGGAAGCTCCATTGTGCCTATCGACCCCAGAGCTCCGGGCAGGTAGAACGCATGAACCGCACCCTAAAAAACACTCTTACAAAATTAATCTTAAAAACCGGTGNAAATTAGGTAAGTCTCCTTCCTTTAGCCCTACTTAGAGTAAGGTGCACCCCTTACCAGGCTAGGTTCTCACCTTTTGAAATCATGTATAGGAAGGCGCCGCCTATCTTGCCTAAGCTAAGAGATGCCNAATTAGCAGAAATATCACAAGCTAATTTATTACAGTACCTACAGTCTCTCCAACAGGTACAAGATATCATCCTGCCACTTGTTCGAGGAGCCCATCCCAATCCAATTCCTGACCAGACGGGGTCCTGCCATTCGTTCCAGCCAGGAGACCTAGTGTTTGTTAAAAAGTTCCAGAAAGAAGGACTCACTCCTGCTTAGAAAAGACCTCACACCGTCATCCTCACGACGCCAACGGCTCTGAAGGTGGACGGCATTCCTGCTTAGATTCATCACTCCCGCATCAAAAAGGCCAACAGAGCCCAACTAAAAACATAGGTCCCCAGGCCTAGGTCAGGCCCCTTAAAACTGCGCCTAAGTCAGGTGAAGCCATTAGATTNATTCTTTTTATCTACCTCACTTGTTTGTTTTTGCCCGTTACGTCCTCTGTGCCTTCCTACTCCTTTCTCCTCACCTCTTTCACAACAGGACGTGTATTTGCAAACACCACTTGGAAGGCCGGTACCTCCAAGGAAGTCTCCTTTGCAGTTGATTTATGTGTACTGTTCCCAAAGCCAGCCCGTACCCACGAAGAGCAACACAATCTGCCAGTCCCAGGAGCAGGAAGTGTCGACCTTGCAGCAAGATTCGGACACTCCGGGAGCCAAACTAGATGTGGAAGCTCCAAAGGTGCAGAAAAAGGACTCCAAAATGTTGACTTTTACCTCTGTCCTAGAAATCACCCTGACGCTAGCTGTCGAGATACTTATCAGTTTTTCTGCCCTGATTAGACATGTGTAACTTTAGCCACCTACTCTAAGAGATCAACCAGATCTTCAACTCTTTCCATAAGTCGTGCTTCTCATCCTAAATTATGTACTAGAAAAAATTGTAATCCTCTTACTATAACTGTCCATGACCCTAATTCAACTCAATAGTATCATGGCATGTCATGAAGATTAAGATTTTATATCCCAGGATTTGATGTTAGGACTATGTTCACCATCCAAAANAAAACCCTGGTCTCATGGAGCCCACCCAAGCCAATCGGGCCTTTAACTGATCTAGGTGACCCTATGTTCCAGAAACACCCTGACAAAGTTGATTTAACTGTTCCTCCACCATTCTTAGTTCCTAAGCCCCAGCTACAANGACANCATCTTCAACCCAGCCTGATGTCTATACTAGGTGGAGTACATCATCTCCTTAACCTCACCCAGCCTAAACTAGCCCAAGATTGTTGGCTATGTTTAAAAGCAAAACCCCCTTATTATGTAGGATTAGGAGTAGAAGCCACACTTAAANGTGGCCCTCTATCCTGTCATACACGACCCCGTGCTCTCACACTAGGAGATGTGTCTGGAAACGCTTCCTGTCTGATTAGTACCGGGTATAACTTATCTGCTTCTCCTTTTCAGGCTATTTGTAATCAGTCCCTGCTTACTTCCATAAGCACCTCAGTCTCTTACCAAGCGCCTAACAATACCTGGTTGGCCTGCACCTCAGGTCTCACTCGCTGCATTAATGGAACTGAACCAGGACCTCTCTTGTGCGTGTTAGTTCATGTNCTTCCCCAGGTATACGTGTACAGTGGACCAGAAGGACAACTCCTCATCGCTCCCCCGGAATTACATCCCAGGTTGCGCCGAGCTGCCCCACTNCTGGTTCCCCTCTTGGCCGGTCTTAGCATAGCTGGATCAGCAGCCATTGGTACGGCTGCCCTGGTTCAAGGAGAAACTGGACTAATGTCCCTGTCTCAACAGGTGGATGCTGATTTAAGTAACCTCCAGTCTGCCATAGATATACTACATTCCCAGGTAGAGTCTCTGGCTGAAGTAGTNCTTCAAAACCGCCGAGGCTTAGATCTGCTATTCCTCTCTCAAGGAGGATTATGCGCAGCTCTAGGAGAAAGCTGTTGCTTCTACGCCAATCAATCTGGAGTCATAAAAGATACACTCCAAAAAGTGCGAGAAAATCTAGATAGGCGCCAACAAGAACGAGAAAATAACATCCCCTGGTATCAAAGCATGTTCAACTGGAACCCATGGCTAACTACTCTAATCACTAAGTTAGCCGGACCCCTCCCCATCCTACTATTAAGTCTAATTTTTGGGCCTTGTATATTAAATTAGTTTCTTAATTTTGTAAAACAACGCATAGCTTCTGTCAAACTTATGTATCTTAAGACTCAATATAACCCCCTTGTTATAACTGAGGAATCAACGATTTGATTCCCCAAAAACACAAGTGGGGAAATGAAATGCCTAACGTTGTTTTTACTCTAACTNGTTACTTTGAATTTTGTCCTGCTTGTCTCTTTAATC



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERVS71 Clamp 462 475 - 18.41 GAGAGAGAGAGAGA
HERVS71 Clamp 464 477 - 18.41 GAGAGAGAGAGAGA
HERVS71 eor-1 460 472 - 18.35 AGAGAGAGAGAGA
HERVS71 eor-1 462 474 - 18.35 AGAGAGAGAGAGA
HERVS71 eor-1 464 476 - 18.35 AGAGAGAGAGAGA
HERVS71 eor-1 466 478 - 18.35 AGAGAGAGAGAGA
HERVS71 HSFA6A 5958 5970 - 18.31 GAAGATTCTAGAG
HERVS71 scrt 8411 8421 + 18.31 TCAACAGGTGG
HERVS71 IRF9 6298 6312 + 18.18 ACCGAAAACGAAACC
HERVS71 BPC6 469 489 + 18.13 CTCTCTCTCTTTTTCTATCTC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).