HERVS71

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000205
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 8978
Kimura value 7.99
Tau index 0.9113
Description Internal region of ERV1 endogenous retrovirus, HERVS71 subfamily
Comment The associated long terminal repeats are LTR6A and LTR6B.
Sequence
TAATGGAGGCCCCAGCGAGANATTAACGCCACCGGGCGAGAGCCGGGCTCGCTCCGGGCTCCCCCGGAAGGACGGCCGGCTTGTAGGGGGGGCGCCACCTGAAAAAANAATTTTCAGGNTCCCCGAAAGGTGACCGTCTTCCGGAGGAGAGCGGATCGACTACCGTGTGGGTGCCCATAAAATTCCACCTCTGAGTCCTCAGCTTCTGACCCCGGGGTCAGGTAGGTCAGATTTGACTTCGGTTCTGGTAAGAGGGAAGCGGCCCTGACGAGGGCGTCCCTCTTTTGACTCTGCCCGTTTCTCTAGGACGCTAGAGGGTNGAGCCCTGGTTTTCTGNTAGGCGCCTCTGTGTCTCTGTCTAGGAGGGAAGTGGCCCTGACAGGGGCCCTCCCTTGACTCAGTCCACGTCCCAGGATGCTGGAGGACTGAGTCCTGGTTTCTGGCAGACCGGNNNNTCNNTCTCTCTCTCTCTCTCTCTTTTTCTATCTCTCATCTTTCTCTTGTTCAAGTTTCTTGGAAATCTCCGGGAAAGAAAANNNNNNNNNNAAAAAAAAACTGTTATAAACTCTGTGTGAATGGTGAGTGAATGAGGGAGGACAAGGGCTTGCGCTTGTCCTCCAGTTTGTAGCTCCACGGCGAAAGCTACGGAGTTCGAGTGGGCCCTCACCTGCGGTTCCGTGGCGACCTCATAAGGCTTAAGGCAGCATCGGGCATAGCTCGATCCGAGCCGGGGGTTTATACCGGCCTGCCAATGCTAAGAGGAGCCCAAGTCCCCTCAGGGGGAGCGGCCAGGCGGGCATCTGACTGATCCCATCACGGGANCCCCTCCCCTTGTCTGTCTAAAAAAAAAAANAAAAAAGGAAAAACTGTCATAACTGTTTACATGCCCTAAAGTCAATTGTTTGTTTTATGTTGATTGTTCTGTTCAGTGTCTATTGTCTTGTTTAGTAGTTGTCAAAGTTTTGCATGTCAAGACGTCGATATTGCCCAAGACGTCTAGGTAAAAACTTCTTCAAGGTCCTTAGTGCTGATTTTTTGTCACAGGAGGTTAAATTTCTCATCAATCATTTAGGCTGGCCACCACAGTCCTGTCTTTTCTGCCAGAAGCAAGTCAAGTGTTGTTACGAGAACGAGTGTGAAAAACATTCGCCTGATTAAGATTTCTGGCACCATGAAAGTTGTAAGTATTTAGATCGTCATACCCCACGTCCAAGTGATTAGACCTCCTCTAAACTAAACCGGTAGTGGGTTCAAAACAGCCACCCTGCAGATTTCCTTGCTCACCTCTTTTGTCATTCTGTAACTTTTCCTGTGCCCTTAAATAGAACACTGTGTAAGGAAACGTACGCCCGTACTGCTTTACTTCGTTTAGATTCTTACTCTGTTCCTCTGTGGCTACTCTCCCATCTTAAAAATGATCCGAGTAGTCCTTTTCCNCCTCGTCCCTGCCCCCTACCCCGCACATCTCGTTTTCCGGTGCGACAGCAAGTTCAGCGTCTCCAGGACTTGGCTCTGCTCTCACTCCTTAAACCCTTAAAAGAAAAAGCTAAGTTTAAGCTATTTGCCTTTAAGTCATAGAGACACCAAAAGTATTTAAGGTGCAGATCTAGAAGAAGAAGAAGANNGAGAACGCCTAGATCAAACTGACCCAGAAGATCTCAGGCTGGCCCCTAGTCCTCCTCCCTCAATCTTAAAGCTACAGCAATGTGGCAAGTAGTATTAGCTGTTGTAGTTTTTCTGCTNCTTTCTGGTCATGTTGATTCTGTTCTTTCGATACTCCAGCCCCCCAAGGAATGAGTTTCTCTGTCCGTGCTAGGTTTAATATCTATGCTCAANATCTTATTAAATTGCCTTCAAANAANAAAAANAANNNNAAAACGGGAAACACTTCCTCCCAGCCTTGTAAAGGTTAGAGCCCTCTCCAATGTATGCTGCAGAATTTTTCTCTCGGTTTCTCAGAGGATTATAAAGTCCGCCTTAAAAAAGGCAAGCTCCGGACACTCTGCGAAATAGAATGGCCAAAGTTTAGAGTCGAGTGGCCCCCTGAAGGGTCATTGAACCTCACAATTGTTCAAGCTGTGTGGCGGGTTGTTACTGAAACTCCCAGCCACCCTGATCAGTTTCCCTACATTGATCAATGGCTAAGTTTGGTCAGGAGCCCCCCTCCATGGCTCCGTTCATGCGCCATTCATAATTCTACCTCCAAGGTCCTCCTGAGCCAGACCGCGTTTTCGCCTCGACCCTCAGCCGGTTCGGCTCCCCCTGTACTGCCTCCCTCTGAAGAAGAGGAGAGTCTCCCTCACCCAGTCCCACCGCCTTACAACCAGCCTGCTCCCTTAAAGTTATCCCATGTCTCCTCGACGACGTCCCCTGTAGGCTCGCCACCCATTGCCTCTCGATCGCGACCGCGGCGGGAGGAAGTAGCCCCTCTACTACCACTGAGAGAGGCACAAGTCCCTCCGGGTGACGAGCGCTCAGCCCCCTTCTTAGTTTATGTCCCTTTTTCTACTTCTGACTTGTATAATTGGAAAACCCATAATCCTCCCTTCTCTGAAAAGCCCCAGGCTTTGACCTCTCTGACGGAGTCCGTACTCCGGACTCACCCGCCCACCTAGGATGATTGCCAACAGCTCCTTTTAACCCTTTTCACCTCTGAAGAGAAGGAACGTATCCGAAGAGAGGCCAAAAAGTACTTCCTCGCATCAGCCAATGGACCGGAGGAGGAAGCTAGAGACCTCCTTGAGGAGGTCTTTCCCTCTACCCGGCCTAACCGGGACCCAAATTCCTCAAGTGGAAGGAGAGCTTTAGACGATTTTCACCGGTATCTCCTCGCGGGTATTAAAGGAGCCGCTCGGAAACCCATAAACTTGTCTAAGACGACCGAAGTTGTCCAGGGGCCCGATGAGTCACCAGGAGCGTTTTTAGAGCGCCTCCAGGAGGCTTATCGGATTTACACCCCTTTTGACCCGGCGGCTCCCGAAAATAGCCGTGCTCTTAATTTGGCATTTGTGGCTCAGGCAGCCCCGGATATTAAAAGGAAACTCCAAAAACTGGAAGGATTTGCTAGAATGAATATCAGTCAGCTTTTAGAAATAGCCCAAAAAGTTTTTGACAATCGAGAGTTTGAAAAACAAAAACAAGCAACACAGGCAGCTGAAAAGGCCGCTGATAAAGCATTCAAAAGACAAACAAAAATCTTAGTGGCGGCTATCCAAGAGGACAGAATGAAATGGCCCCCATTCCAGAAGAATGGCCAAGGAACCTCGGGTTCCCACCAGAAAAGTAAAAGAGGTGAACAGGCCCCTCTAGGAAAAACCAATGTGCCTATTGCAAGCAGACTGGGCACTGGAAAAAGGAGTGCCCACTACTGCCANAAGAAAAGTCAGAAAACAAAAAGGTCCTCACCCTGCCCGCAACGGAGGAGCCTGATGATTGACGGGGCCAGGGCTCCCTCGCTCTTGGCCCCCAGGANCCCATGGTAACTGCTACAGTGGGGGGCCAGCCTGTACGTTTCCTAGTAGACACCGGGGCGGAGCACTCGGTACTGCAGACTCCCTTGGGCAGTGTCTCAAATAAAAAAATGACTGTACAAAGGGCAACTGGAGCTATTCAAGAATATCCTGTCACACGCTCCCGAGAAGTAAACTTGGGACAGAAAAGAGTGACACACTCTTTTCTNGTGGTTCCAGAGTGTCCTTTTCCTCTCCTTGGACGAGACCTGCTCCATAAGTTACAGGCCTCAATCTCCTTTTCAGCTCAGCAGGCTCATCTCACACTAGGAAATGCAACTTCCCCCACTGCCCAACTCTTGCTAACTACCCCTCTGTCAGAAGAATACCTTCTGGTTTCACCATCACAATCACCGGAGGAGAATACTAATACTCTTTTGTTGGACNTACAGACACTTTTTCCCCGAGTTTGGGCCGAGTCAAACCCTCCCGGACTGGCTAAACACCATCCGCCAGTGGTTGTAGAACTCTTGGCCACTGCCATACCGGTCCAGGTAAAGCAATACCCCATGAGTCAGCAGGCTAGAGAGGNGATTAATCCCCACATTCAATGACTGTTACAAGCTGGCATACTTACACCATGTCAGTCGGCCTGGAACACNCCATTTTTGCCGGTCCAGAAACCTGGAACAAATGATTACCGGCCGGTACAAGACTTAAGGGAAGTTAATAAATGGACTGTTACTGTCCATCCAACCGTCCCTAATCCTTATACTCTACTCAGCCTGCTCCCACCAGAACATACAGTATACACTGTCCTTGACCTGAAAGATGCTTTCTTTGCTATTCCTCTGGCCCCCAAAAGCCAGCCGATTTTTGCATTTGAATGGACAGATCCAAGATCAGGAGACACTACCCAACTGACTTGGACTCAGTTACCTCAGGGTTTTAAAAATTCCCCCACCCTTTTTGGGGAGGCTCTTCGGCAAGATCTTATACCTTCCGAGCTAGTCACCCTAACTGTACTCTTCTTCAGTATGTAGATGATATTTTAATAGCTACTGAAACTATGGACAGTTGTCTACAACACACGAGGGACCTGCTCTACCTCCTTCAGGAGCTCGGGTATGGAGTCTCAGCCAAAAAGGCCCAGCTTTGTCTTCCCAGAGTGTCCTACCTGGGGTACGAGATAAACCAAGGAAAAAGGGCACTCACCAGTGCCCGGAAAGAAGCCATCCTGCGAATCCCCACTCCCGCCACCAAGAGACGGGTACGCGAATTNCTGGGGGCCGTGGGATACTGTCGCCTCTGGATATCGGGGTTCGCGGAGATTGCAAAGCCCTTGTATACTGCTACAGGANGNAATGGCCCGCTAATTTGGACAGACACNGAAGAACAGGCTTTTCAAAACCTGAAAAAGGCATTAACTGAAGCCCCTGCTTTAGCCCTCCCTAATATCTCAAAGCCGTTTCACCTGTTTGTCCATGAAAGCCAGGGAGTTGCTAAAGAGGTGCTTACTCAGACTTTAAGACCCTGGAGACGCCCAGTGGCCTATTTATCTAAGAGGCTGGATCCTGTGGCCTCTGGATGGCCAAGTTGTCTGCGAGCCGTAGCGGCTACAGCAAGCCTAGTCCAAGAAGNTGATAAGTTAACTCTAGGCCAAAATTTAACCCTTACAGCTCCTCATGCCGTAGAGACCTTACTACGAAGTGCTTCTGGCAAATGGATGTCAAATGCTCGCATCTTGCAGTATCAGAGTTTACTGTTAGATCAGCCTCGTTTGACTTTCTCTCCCACAAGGTGTTTNAATCCAGCTACACTACTTCCTGACCCAGACTCCACTATTCCTGCTCATGACTGTCAAGAACTGTTAGAAACTACCGAAACTGGCCGACCTGATCTTCAAGATGTGCCCCTAGAAAAGGCGGATGCCGCCGTGTTCACAGACGGTAGCAGCTTCCTCGAGCAGGGAGTACGAAAAGCCGGTGCAGCTGTTACCACGGAGACAGATGTGTTGTAGGCTCAGGCTTTACCAGCGAACACCTCAGCGCAAAAGGCTGAATTGATCGCCCTCACTCAGGCTCTCCGATGGGGTAAGGATAAACGTATTAACATTTACACTGACAGCAGGTACGCCTTTGCTACTGTGCATGTACATGGAGCCATCTACCAGGAANGCGGGCTACTCACCTCAGCAGGAAAGGCTATCAAAAACAAAGAAGAAATTCTAGCCCTGCTTGAAGCCGTGTGGCTCCCTCAGCAGGTAGCTGTGATCCACTGCAAAGGACATCAAAAAGAAAACACGGCCGTTGCCCGTAGTAACCAGAAAGCTGATTCAGCAGCTCAGGTCGCAGCGNGACTTTCAGTCACGCCTCTAAACTTGCTGCCCACAGTCTCCTTTCCACAGCCAGATCTGCCTGACAATCCCGTATACTCAACAAAANAAAAAAAACTGGCTTCAGATCTCAGAGCCAATAAAAATCAGGAAAGTTAGTAGATTCTTCCTGACTCTAGAATCTTCATACCCCGAACTCTTAAAGAAACTTTAATCAGTCACCTACAGTCTACCACCCATTTAAGAAGAGCAAAGCTACCTCAGCTCCTCCGGAGCCATTTTAAGATCCCCCGTCTTCAAAGCCTAACAGATCAAGCAGCTCTCCGGTGCACAACCTGCGCCCAGGTAAATGCCAAGCAAGGTCCTAAACCCAGCCCAGGCCACCGTCTCCGAAAAAACTCGCCAGGAGAAAAGTGGGAAATTGACTTTACAGAAGTAAAACCACACCGGGCTAAGTACAAATACCTTCTAGTACTAGTAGACACCTTCTCCGGATGGACTGAGGCATTTGCTACCGAAAACGAAACCGCCAACACGGTAGTTAAGTTTTTACTCAATGAAATCATCCCTCGATATAGGCTGCCTGCTGCCATAGGGTCTGATAATGGACCGGCCTTCACCTCGCCCATAGCTCAGTCAGTCAGTAAGGCGTTAAACATTCAACGGAAGCTCCATTGTGCCTATCGACCCCAGAGCTCCGGGCAGGTAGAACGCATGAACCGCACCCTAAAAAACACTCTTACAAAATTAATCTTAAAAACCGGTGNAAATTAGGTAAGTCTCCTTCCTTTAGCCCTACTTAGAGTAAGGTGCACCCCTTACCAGGCTAGGTTCTCACCTTTTGAAATCATGTATAGGAAGGCGCCGCCTATCTTGCCTAAGCTAAGAGATGCCNAATTAGCAGAAATATCACAAGCTAATTTATTACAGTACCTACAGTCTCTCCAACAGGTACAAGATATCATCCTGCCACTTGTTCGAGGAGCCCATCCCAATCCAATTCCTGACCAGACGGGGTCCTGCCATTCGTTCCAGCCAGGAGACCTAGTGTTTGTTAAAAAGTTCCAGAAAGAAGGACTCACTCCTGCTTAGAAAAGACCTCACACCGTCATCCTCACGACGCCAACGGCTCTGAAGGTGGACGGCATTCCTGCTTAGATTCATCACTCCCGCATCAAAAAGGCCAACAGAGCCCAACTAAAAACATAGGTCCCCAGGCCTAGGTCAGGCCCCTTAAAACTGCGCCTAAGTCAGGTGAAGCCATTAGATTNATTCTTTTTATCTACCTCACTTGTTTGTTTTTGCCCGTTACGTCCTCTGTGCCTTCCTACTCCTTTCTCCTCACCTCTTTCACAACAGGACGTGTATTTGCAAACACCACTTGGAAGGCCGGTACCTCCAAGGAAGTCTCCTTTGCAGTTGATTTATGTGTACTGTTCCCAAAGCCAGCCCGTACCCACGAAGAGCAACACAATCTGCCAGTCCCAGGAGCAGGAAGTGTCGACCTTGCAGCAAGATTCGGACACTCCGGGAGCCAAACTAGATGTGGAAGCTCCAAAGGTGCAGAAAAAGGACTCCAAAATGTTGACTTTTACCTCTGTCCTAGAAATCACCCTGACGCTAGCTGTCGAGATACTTATCAGTTTTTCTGCCCTGATTAGACATGTGTAACTTTAGCCACCTACTCTAAGAGATCAACCAGATCTTCAACTCTTTCCATAAGTCGTGCTTCTCATCCTAAATTATGTACTAGAAAAAATTGTAATCCTCTTACTATAACTGTCCATGACCCTAATTCAACTCAATAGTATCATGGCATGTCATGAAGATTAAGATTTTATATCCCAGGATTTGATGTTAGGACTATGTTCACCATCCAAAANAAAACCCTGGTCTCATGGAGCCCACCCAAGCCAATCGGGCCTTTAACTGATCTAGGTGACCCTATGTTCCAGAAACACCCTGACAAAGTTGATTTAACTGTTCCTCCACCATTCTTAGTTCCTAAGCCCCAGCTACAANGACANCATCTTCAACCCAGCCTGATGTCTATACTAGGTGGAGTACATCATCTCCTTAACCTCACCCAGCCTAAACTAGCCCAAGATTGTTGGCTATGTTTAAAAGCAAAACCCCCTTATTATGTAGGATTAGGAGTAGAAGCCACACTTAAANGTGGCCCTCTATCCTGTCATACACGACCCCGTGCTCTCACACTAGGAGATGTGTCTGGAAACGCTTCCTGTCTGATTAGTACCGGGTATAACTTATCTGCTTCTCCTTTTCAGGCTATTTGTAATCAGTCCCTGCTTACTTCCATAAGCACCTCAGTCTCTTACCAAGCGCCTAACAATACCTGGTTGGCCTGCACCTCAGGTCTCACTCGCTGCATTAATGGAACTGAACCAGGACCTCTCTTGTGCGTGTTAGTTCATGTNCTTCCCCAGGTATACGTGTACAGTGGACCAGAAGGACAACTCCTCATCGCTCCCCCGGAATTACATCCCAGGTTGCGCCGAGCTGCCCCACTNCTGGTTCCCCTCTTGGCCGGTCTTAGCATAGCTGGATCAGCAGCCATTGGTACGGCTGCCCTGGTTCAAGGAGAAACTGGACTAATGTCCCTGTCTCAACAGGTGGATGCTGATTTAAGTAACCTCCAGTCTGCCATAGATATACTACATTCCCAGGTAGAGTCTCTGGCTGAAGTAGTNCTTCAAAACCGCCGAGGCTTAGATCTGCTATTCCTCTCTCAAGGAGGATTATGCGCAGCTCTAGGAGAAAGCTGTTGCTTCTACGCCAATCAATCTGGAGTCATAAAAGATACACTCCAAAAAGTGCGAGAAAATCTAGATAGGCGCCAACAAGAACGAGAAAATAACATCCCCTGGTATCAAAGCATGTTCAACTGGAACCCATGGCTAACTACTCTAATCACTAAGTTAGCCGGACCCCTCCCCATCCTACTATTAAGTCTAATTTTTGGGCCTTGTATATTAAATTAGTTTCTTAATTTTGTAAAACAACGCATAGCTTCTGTCAAACTTATGTATCTTAAGACTCAATATAACCCCCTTGTTATAACTGAGGAATCAACGATTTGATTCCCCAAAAACACAAGTGGGGAAATGAAATGCCTAACGTTGTTTTTACTCTAACTNGTTACTTTGAATTTTGTCCTGCTTGTCTCTTTAATC



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERVS71 RAMOSA1 460 473 - 25.64 GAGAGAGAGAGAGA
HERVS71 RAMOSA1 462 475 - 25.64 GAGAGAGAGAGAGA
HERVS71 RAMOSA1 464 477 - 25.64 GAGAGAGAGAGAGA
HERVS71 BPC1 470 493 - 24.94 ATGAGAGATAGAAAAAGAGAGAGA
HERVS71 BPC5 463 492 - 23.33 TGAGAGATAGAAAAAGAGAGAGAGAGAGAG
HERVS71 RAMOSA1 466 479 - 22.89 AAGAGAGAGAGAGA
HERVS71 RAMOSA1 468 481 - 19.57 AAAAGAGAGAGAGA
HERVS71 ZNF282 2750 2764 + 18.79 CTTTCCCTCTACCCG
HERVS71 Bach1::Mafk 4018 4029 + 18.70 CATGAGTCAGCA
HERVS71 Clamp 460 473 - 18.41 GAGAGAGAGAGAGA


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).