HERVS71

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000205
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 8978
Kimura value 7.99
Tau index 0.9113
Description Internal region of ERV1 endogenous retrovirus, HERVS71 subfamily
Comment The associated long terminal repeats are LTR6A and LTR6B.
Sequence
TAATGGAGGCCCCAGCGAGANATTAACGCCACCGGGCGAGAGCCGGGCTCGCTCCGGGCTCCCCCGGAAGGACGGCCGGCTTGTAGGGGGGGCGCCACCTGAAAAAANAATTTTCAGGNTCCCCGAAAGGTGACCGTCTTCCGGAGGAGAGCGGATCGACTACCGTGTGGGTGCCCATAAAATTCCACCTCTGAGTCCTCAGCTTCTGACCCCGGGGTCAGGTAGGTCAGATTTGACTTCGGTTCTGGTAAGAGGGAAGCGGCCCTGACGAGGGCGTCCCTCTTTTGACTCTGCCCGTTTCTCTAGGACGCTAGAGGGTNGAGCCCTGGTTTTCTGNTAGGCGCCTCTGTGTCTCTGTCTAGGAGGGAAGTGGCCCTGACAGGGGCCCTCCCTTGACTCAGTCCACGTCCCAGGATGCTGGAGGACTGAGTCCTGGTTTCTGGCAGACCGGNNNNTCNNTCTCTCTCTCTCTCTCTCTTTTTCTATCTCTCATCTTTCTCTTGTTCAAGTTTCTTGGAAATCTCCGGGAAAGAAAANNNNNNNNNNAAAAAAAAACTGTTATAAACTCTGTGTGAATGGTGAGTGAATGAGGGAGGACAAGGGCTTGCGCTTGTCCTCCAGTTTGTAGCTCCACGGCGAAAGCTACGGAGTTCGAGTGGGCCCTCACCTGCGGTTCCGTGGCGACCTCATAAGGCTTAAGGCAGCATCGGGCATAGCTCGATCCGAGCCGGGGGTTTATACCGGCCTGCCAATGCTAAGAGGAGCCCAAGTCCCCTCAGGGGGAGCGGCCAGGCGGGCATCTGACTGATCCCATCACGGGANCCCCTCCCCTTGTCTGTCTAAAAAAAAAAANAAAAAAGGAAAAACTGTCATAACTGTTTACATGCCCTAAAGTCAATTGTTTGTTTTATGTTGATTGTTCTGTTCAGTGTCTATTGTCTTGTTTAGTAGTTGTCAAAGTTTTGCATGTCAAGACGTCGATATTGCCCAAGACGTCTAGGTAAAAACTTCTTCAAGGTCCTTAGTGCTGATTTTTTGTCACAGGAGGTTAAATTTCTCATCAATCATTTAGGCTGGCCACCACAGTCCTGTCTTTTCTGCCAGAAGCAAGTCAAGTGTTGTTACGAGAACGAGTGTGAAAAACATTCGCCTGATTAAGATTTCTGGCACCATGAAAGTTGTAAGTATTTAGATCGTCATACCCCACGTCCAAGTGATTAGACCTCCTCTAAACTAAACCGGTAGTGGGTTCAAAACAGCCACCCTGCAGATTTCCTTGCTCACCTCTTTTGTCATTCTGTAACTTTTCCTGTGCCCTTAAATAGAACACTGTGTAAGGAAACGTACGCCCGTACTGCTTTACTTCGTTTAGATTCTTACTCTGTTCCTCTGTGGCTACTCTCCCATCTTAAAAATGATCCGAGTAGTCCTTTTCCNCCTCGTCCCTGCCCCCTACCCCGCACATCTCGTTTTCCGGTGCGACAGCAAGTTCAGCGTCTCCAGGACTTGGCTCTGCTCTCACTCCTTAAACCCTTAAAAGAAAAAGCTAAGTTTAAGCTATTTGCCTTTAAGTCATAGAGACACCAAAAGTATTTAAGGTGCAGATCTAGAAGAAGAAGAAGANNGAGAACGCCTAGATCAAACTGACCCAGAAGATCTCAGGCTGGCCCCTAGTCCTCCTCCCTCAATCTTAAAGCTACAGCAATGTGGCAAGTAGTATTAGCTGTTGTAGTTTTTCTGCTNCTTTCTGGTCATGTTGATTCTGTTCTTTCGATACTCCAGCCCCCCAAGGAATGAGTTTCTCTGTCCGTGCTAGGTTTAATATCTATGCTCAANATCTTATTAAATTGCCTTCAAANAANAAAAANAANNNNAAAACGGGAAACACTTCCTCCCAGCCTTGTAAAGGTTAGAGCCCTCTCCAATGTATGCTGCAGAATTTTTCTCTCGGTTTCTCAGAGGATTATAAAGTCCGCCTTAAAAAAGGCAAGCTCCGGACACTCTGCGAAATAGAATGGCCAAAGTTTAGAGTCGAGTGGCCCCCTGAAGGGTCATTGAACCTCACAATTGTTCAAGCTGTGTGGCGGGTTGTTACTGAAACTCCCAGCCACCCTGATCAGTTTCCCTACATTGATCAATGGCTAAGTTTGGTCAGGAGCCCCCCTCCATGGCTCCGTTCATGCGCCATTCATAATTCTACCTCCAAGGTCCTCCTGAGCCAGACCGCGTTTTCGCCTCGACCCTCAGCCGGTTCGGCTCCCCCTGTACTGCCTCCCTCTGAAGAAGAGGAGAGTCTCCCTCACCCAGTCCCACCGCCTTACAACCAGCCTGCTCCCTTAAAGTTATCCCATGTCTCCTCGACGACGTCCCCTGTAGGCTCGCCACCCATTGCCTCTCGATCGCGACCGCGGCGGGAGGAAGTAGCCCCTCTACTACCACTGAGAGAGGCACAAGTCCCTCCGGGTGACGAGCGCTCAGCCCCCTTCTTAGTTTATGTCCCTTTTTCTACTTCTGACTTGTATAATTGGAAAACCCATAATCCTCCCTTCTCTGAAAAGCCCCAGGCTTTGACCTCTCTGACGGAGTCCGTACTCCGGACTCACCCGCCCACCTAGGATGATTGCCAACAGCTCCTTTTAACCCTTTTCACCTCTGAAGAGAAGGAACGTATCCGAAGAGAGGCCAAAAAGTACTTCCTCGCATCAGCCAATGGACCGGAGGAGGAAGCTAGAGACCTCCTTGAGGAGGTCTTTCCCTCTACCCGGCCTAACCGGGACCCAAATTCCTCAAGTGGAAGGAGAGCTTTAGACGATTTTCACCGGTATCTCCTCGCGGGTATTAAAGGAGCCGCTCGGAAACCCATAAACTTGTCTAAGACGACCGAAGTTGTCCAGGGGCCCGATGAGTCACCAGGAGCGTTTTTAGAGCGCCTCCAGGAGGCTTATCGGATTTACACCCCTTTTGACCCGGCGGCTCCCGAAAATAGCCGTGCTCTTAATTTGGCATTTGTGGCTCAGGCAGCCCCGGATATTAAAAGGAAACTCCAAAAACTGGAAGGATTTGCTAGAATGAATATCAGTCAGCTTTTAGAAATAGCCCAAAAAGTTTTTGACAATCGAGAGTTTGAAAAACAAAAACAAGCAACACAGGCAGCTGAAAAGGCCGCTGATAAAGCATTCAAAAGACAAACAAAAATCTTAGTGGCGGCTATCCAAGAGGACAGAATGAAATGGCCCCCATTCCAGAAGAATGGCCAAGGAACCTCGGGTTCCCACCAGAAAAGTAAAAGAGGTGAACAGGCCCCTCTAGGAAAAACCAATGTGCCTATTGCAAGCAGACTGGGCACTGGAAAAAGGAGTGCCCACTACTGCCANAAGAAAAGTCAGAAAACAAAAAGGTCCTCACCCTGCCCGCAACGGAGGAGCCTGATGATTGACGGGGCCAGGGCTCCCTCGCTCTTGGCCCCCAGGANCCCATGGTAACTGCTACAGTGGGGGGCCAGCCTGTACGTTTCCTAGTAGACACCGGGGCGGAGCACTCGGTACTGCAGACTCCCTTGGGCAGTGTCTCAAATAAAAAAATGACTGTACAAAGGGCAACTGGAGCTATTCAAGAATATCCTGTCACACGCTCCCGAGAAGTAAACTTGGGACAGAAAAGAGTGACACACTCTTTTCTNGTGGTTCCAGAGTGTCCTTTTCCTCTCCTTGGACGAGACCTGCTCCATAAGTTACAGGCCTCAATCTCCTTTTCAGCTCAGCAGGCTCATCTCACACTAGGAAATGCAACTTCCCCCACTGCCCAACTCTTGCTAACTACCCCTCTGTCAGAAGAATACCTTCTGGTTTCACCATCACAATCACCGGAGGAGAATACTAATACTCTTTTGTTGGACNTACAGACACTTTTTCCCCGAGTTTGGGCCGAGTCAAACCCTCCCGGACTGGCTAAACACCATCCGCCAGTGGTTGTAGAACTCTTGGCCACTGCCATACCGGTCCAGGTAAAGCAATACCCCATGAGTCAGCAGGCTAGAGAGGNGATTAATCCCCACATTCAATGACTGTTACAAGCTGGCATACTTACACCATGTCAGTCGGCCTGGAACACNCCATTTTTGCCGGTCCAGAAACCTGGAACAAATGATTACCGGCCGGTACAAGACTTAAGGGAAGTTAATAAATGGACTGTTACTGTCCATCCAACCGTCCCTAATCCTTATACTCTACTCAGCCTGCTCCCACCAGAACATACAGTATACACTGTCCTTGACCTGAAAGATGCTTTCTTTGCTATTCCTCTGGCCCCCAAAAGCCAGCCGATTTTTGCATTTGAATGGACAGATCCAAGATCAGGAGACACTACCCAACTGACTTGGACTCAGTTACCTCAGGGTTTTAAAAATTCCCCCACCCTTTTTGGGGAGGCTCTTCGGCAAGATCTTATACCTTCCGAGCTAGTCACCCTAACTGTACTCTTCTTCAGTATGTAGATGATATTTTAATAGCTACTGAAACTATGGACAGTTGTCTACAACACACGAGGGACCTGCTCTACCTCCTTCAGGAGCTCGGGTATGGAGTCTCAGCCAAAAAGGCCCAGCTTTGTCTTCCCAGAGTGTCCTACCTGGGGTACGAGATAAACCAAGGAAAAAGGGCACTCACCAGTGCCCGGAAAGAAGCCATCCTGCGAATCCCCACTCCCGCCACCAAGAGACGGGTACGCGAATTNCTGGGGGCCGTGGGATACTGTCGCCTCTGGATATCGGGGTTCGCGGAGATTGCAAAGCCCTTGTATACTGCTACAGGANGNAATGGCCCGCTAATTTGGACAGACACNGAAGAACAGGCTTTTCAAAACCTGAAAAAGGCATTAACTGAAGCCCCTGCTTTAGCCCTCCCTAATATCTCAAAGCCGTTTCACCTGTTTGTCCATGAAAGCCAGGGAGTTGCTAAAGAGGTGCTTACTCAGACTTTAAGACCCTGGAGACGCCCAGTGGCCTATTTATCTAAGAGGCTGGATCCTGTGGCCTCTGGATGGCCAAGTTGTCTGCGAGCCGTAGCGGCTACAGCAAGCCTAGTCCAAGAAGNTGATAAGTTAACTCTAGGCCAAAATTTAACCCTTACAGCTCCTCATGCCGTAGAGACCTTACTACGAAGTGCTTCTGGCAAATGGATGTCAAATGCTCGCATCTTGCAGTATCAGAGTTTACTGTTAGATCAGCCTCGTTTGACTTTCTCTCCCACAAGGTGTTTNAATCCAGCTACACTACTTCCTGACCCAGACTCCACTATTCCTGCTCATGACTGTCAAGAACTGTTAGAAACTACCGAAACTGGCCGACCTGATCTTCAAGATGTGCCCCTAGAAAAGGCGGATGCCGCCGTGTTCACAGACGGTAGCAGCTTCCTCGAGCAGGGAGTACGAAAAGCCGGTGCAGCTGTTACCACGGAGACAGATGTGTTGTAGGCTCAGGCTTTACCAGCGAACACCTCAGCGCAAAAGGCTGAATTGATCGCCCTCACTCAGGCTCTCCGATGGGGTAAGGATAAACGTATTAACATTTACACTGACAGCAGGTACGCCTTTGCTACTGTGCATGTACATGGAGCCATCTACCAGGAANGCGGGCTACTCACCTCAGCAGGAAAGGCTATCAAAAACAAAGAAGAAATTCTAGCCCTGCTTGAAGCCGTGTGGCTCCCTCAGCAGGTAGCTGTGATCCACTGCAAAGGACATCAAAAAGAAAACACGGCCGTTGCCCGTAGTAACCAGAAAGCTGATTCAGCAGCTCAGGTCGCAGCGNGACTTTCAGTCACGCCTCTAAACTTGCTGCCCACAGTCTCCTTTCCACAGCCAGATCTGCCTGACAATCCCGTATACTCAACAAAANAAAAAAAACTGGCTTCAGATCTCAGAGCCAATAAAAATCAGGAAAGTTAGTAGATTCTTCCTGACTCTAGAATCTTCATACCCCGAACTCTTAAAGAAACTTTAATCAGTCACCTACAGTCTACCACCCATTTAAGAAGAGCAAAGCTACCTCAGCTCCTCCGGAGCCATTTTAAGATCCCCCGTCTTCAAAGCCTAACAGATCAAGCAGCTCTCCGGTGCACAACCTGCGCCCAGGTAAATGCCAAGCAAGGTCCTAAACCCAGCCCAGGCCACCGTCTCCGAAAAAACTCGCCAGGAGAAAAGTGGGAAATTGACTTTACAGAAGTAAAACCACACCGGGCTAAGTACAAATACCTTCTAGTACTAGTAGACACCTTCTCCGGATGGACTGAGGCATTTGCTACCGAAAACGAAACCGCCAACACGGTAGTTAAGTTTTTACTCAATGAAATCATCCCTCGATATAGGCTGCCTGCTGCCATAGGGTCTGATAATGGACCGGCCTTCACCTCGCCCATAGCTCAGTCAGTCAGTAAGGCGTTAAACATTCAACGGAAGCTCCATTGTGCCTATCGACCCCAGAGCTCCGGGCAGGTAGAACGCATGAACCGCACCCTAAAAAACACTCTTACAAAATTAATCTTAAAAACCGGTGNAAATTAGGTAAGTCTCCTTCCTTTAGCCCTACTTAGAGTAAGGTGCACCCCTTACCAGGCTAGGTTCTCACCTTTTGAAATCATGTATAGGAAGGCGCCGCCTATCTTGCCTAAGCTAAGAGATGCCNAATTAGCAGAAATATCACAAGCTAATTTATTACAGTACCTACAGTCTCTCCAACAGGTACAAGATATCATCCTGCCACTTGTTCGAGGAGCCCATCCCAATCCAATTCCTGACCAGACGGGGTCCTGCCATTCGTTCCAGCCAGGAGACCTAGTGTTTGTTAAAAAGTTCCAGAAAGAAGGACTCACTCCTGCTTAGAAAAGACCTCACACCGTCATCCTCACGACGCCAACGGCTCTGAAGGTGGACGGCATTCCTGCTTAGATTCATCACTCCCGCATCAAAAAGGCCAACAGAGCCCAACTAAAAACATAGGTCCCCAGGCCTAGGTCAGGCCCCTTAAAACTGCGCCTAAGTCAGGTGAAGCCATTAGATTNATTCTTTTTATCTACCTCACTTGTTTGTTTTTGCCCGTTACGTCCTCTGTGCCTTCCTACTCCTTTCTCCTCACCTCTTTCACAACAGGACGTGTATTTGCAAACACCACTTGGAAGGCCGGTACCTCCAAGGAAGTCTCCTTTGCAGTTGATTTATGTGTACTGTTCCCAAAGCCAGCCCGTACCCACGAAGAGCAACACAATCTGCCAGTCCCAGGAGCAGGAAGTGTCGACCTTGCAGCAAGATTCGGACACTCCGGGAGCCAAACTAGATGTGGAAGCTCCAAAGGTGCAGAAAAAGGACTCCAAAATGTTGACTTTTACCTCTGTCCTAGAAATCACCCTGACGCTAGCTGTCGAGATACTTATCAGTTTTTCTGCCCTGATTAGACATGTGTAACTTTAGCCACCTACTCTAAGAGATCAACCAGATCTTCAACTCTTTCCATAAGTCGTGCTTCTCATCCTAAATTATGTACTAGAAAAAATTGTAATCCTCTTACTATAACTGTCCATGACCCTAATTCAACTCAATAGTATCATGGCATGTCATGAAGATTAAGATTTTATATCCCAGGATTTGATGTTAGGACTATGTTCACCATCCAAAANAAAACCCTGGTCTCATGGAGCCCACCCAAGCCAATCGGGCCTTTAACTGATCTAGGTGACCCTATGTTCCAGAAACACCCTGACAAAGTTGATTTAACTGTTCCTCCACCATTCTTAGTTCCTAAGCCCCAGCTACAANGACANCATCTTCAACCCAGCCTGATGTCTATACTAGGTGGAGTACATCATCTCCTTAACCTCACCCAGCCTAAACTAGCCCAAGATTGTTGGCTATGTTTAAAAGCAAAACCCCCTTATTATGTAGGATTAGGAGTAGAAGCCACACTTAAANGTGGCCCTCTATCCTGTCATACACGACCCCGTGCTCTCACACTAGGAGATGTGTCTGGAAACGCTTCCTGTCTGATTAGTACCGGGTATAACTTATCTGCTTCTCCTTTTCAGGCTATTTGTAATCAGTCCCTGCTTACTTCCATAAGCACCTCAGTCTCTTACCAAGCGCCTAACAATACCTGGTTGGCCTGCACCTCAGGTCTCACTCGCTGCATTAATGGAACTGAACCAGGACCTCTCTTGTGCGTGTTAGTTCATGTNCTTCCCCAGGTATACGTGTACAGTGGACCAGAAGGACAACTCCTCATCGCTCCCCCGGAATTACATCCCAGGTTGCGCCGAGCTGCCCCACTNCTGGTTCCCCTCTTGGCCGGTCTTAGCATAGCTGGATCAGCAGCCATTGGTACGGCTGCCCTGGTTCAAGGAGAAACTGGACTAATGTCCCTGTCTCAACAGGTGGATGCTGATTTAAGTAACCTCCAGTCTGCCATAGATATACTACATTCCCAGGTAGAGTCTCTGGCTGAAGTAGTNCTTCAAAACCGCCGAGGCTTAGATCTGCTATTCCTCTCTCAAGGAGGATTATGCGCAGCTCTAGGAGAAAGCTGTTGCTTCTACGCCAATCAATCTGGAGTCATAAAAGATACACTCCAAAAAGTGCGAGAAAATCTAGATAGGCGCCAACAAGAACGAGAAAATAACATCCCCTGGTATCAAAGCATGTTCAACTGGAACCCATGGCTAACTACTCTAATCACTAAGTTAGCCGGACCCCTCCCCATCCTACTATTAAGTCTAATTTTTGGGCCTTGTATATTAAATTAGTTTCTTAATTTTGTAAAACAACGCATAGCTTCTGTCAAACTTATGTATCTTAAGACTCAATATAACCCCCTTGTTATAACTGAGGAATCAACGATTTGATTCCCCAAAAACACAAGTGGGGAAATGAAATGCCTAACGTTGTTTTTACTCTAACTNGTTACTTTGAATTTTGTCCTGCTTGTCTCTTTAATC



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERVS71 BPC1 460 483 - 34.30 GAAAAAGAGAGAGAGAGAGAGAGA
HERVS71 BPC5 461 490 - 34.06 AGAGATAGAAAAAGAGAGAGAGAGAGAGAG
HERVS71 BPC1 462 485 - 31.97 TAGAAAAAGAGAGAGAGAGAGAGA
HERVS71 BPC1 464 487 - 31.74 GATAGAAAAAGAGAGAGAGAGAGA
HERVS71 BPC6 461 481 + 31.13 CTCTCTCTCTCTCTCTCTTTT
HERVS71 BPC1 468 491 - 30.89 GAGAGATAGAAAAAGAGAGAGAGA
HERVS71 BPC1 466 489 - 29.91 GAGATAGAAAAAGAGAGAGAGAGA
HERVS71 BPC6 463 483 + 28.47 CTCTCTCTCTCTCTCTTTTTC
HERVS71 BPC6 471 491 + 26.38 CTCTCTCTTTTTCTATCTCTC
HERVS71 BPC6 465 485 + 26.12 CTCTCTCTCTCTCTTTTTCTA


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).