HERVH

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000183
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 7713
Kimura value 5.61
Tau index 0.8930
Description Internal region of ERV1 endogenous retrovirus, HERVH subfamily
Comment The long terminal repeat associated with HERVH is LTR7. The putative env gene, consisting of about 1800 base pairs, has two open reading frames interrupted by a termination codon. The amino acid sequence of this region shows significant homology to those of other retroviral envelope proteins and contains eight potential glycosylation sites. It is estimated that there are about 100 copies of RTVL-H elements containing the env gene per haploid human genome. Note that this estimate is distinct from simply tallying all hits (long and short) from this model.
Sequence
TTTGGTGCCGTGACTCGGATCGGGGGACCTCCCTTGGGAGATCAATCCCCTGTCCTCCTGCTCTTTGCTCCGTGAGAAAGATCCACCTACGACCTCAGGTCCTCAGACCGACCAGCCCAAGGAACATCTCACCAATTTCAAATCCGGTAAGCGGCCTCTTTTTACTCTCTTCTCCAACCTCCCTCACTATCCCTCAACCTCTTTCTCCTTTCAATCTTGGCGCCACACTTCAATCTCTCCCTTCTCTTAATTTCAATTCCTTTCATTTTCTGGTAGAGACAAAGGAGACACGTTTTATCCGTGGACCCAAAACTCCGGCGCCGGTCACGGACTGGGAAGGCAGCCTTCCCTTGGTGTTTAATCATTGCAGGGACGCCTCTCTGATTATTCACCCACGTTTCAGAGGTGTCAGACCACGCAGGGACGCCTGCCTTGGTCCTTCACCCTTAGCGGCAAGTCCCGCTTTTCTGGGGGAGGGGCAAGTACCCCAACCCCTTCTCTCCGTGTCTCTACCCCTTCTCCGCTTTTCTGGGGNAGGGGCAAGNACCCCTCAACCCCTTCTCCTTCACCCTTAGCGGCAAGTCCCGCTTTTCTAGGGGGGCAAGAACCCCCAACCCCTTATTTCCGCGCCCCGACCTCTTATCTCTGCGCCCCAATCCCTTATTTCCGCGCCCCGACCTCTTATCTCTGCGCCCCGATCCCTTATTTCCGCGCCCCGACCTCTTATCTCTGCGCCCCAACCCCTTATTTCCGTGCCCCGACCCCTTTCCCGCTTTTCTGGAGGGCAAGAACCCCCGAACCCCTTCCCTCCGTGTCTCTACTCTCTCTTTTCTCTGGGCTTGCCTCCTTCACTATGGGCAACCTTCCACCCTCCATTCCTCCTTCTCCCTTAGCCTGTGTTCTCAAGAACTTAAAACCTCTTCAACTCACACCTGACCTAAAACCTAAACGCCTTATTTTCTTCTGCAATGCCGCTTGACCCCAATACAAACTCGACAGTAGTTCCAAATAGCCAGAAAACGGCACTTTCAATTTTTCCATCCTACAAGATCTAAATAATTCTTGTCGTAAAATGGGCAAACGGTCTGAGGTGCCTGACGTCCAGGCATTCTTTTACACATCGGTCCCTCCCTAGTCTCTGTNCCCAGTGCAACTCGTCCCAAATCTTCCTTCTTTCCCTCCCGCCTGTCCCCTCAGTCCCAACCCCAAGCGTCGCTGAGTCTTTCTAATCTTCCTTTTCTACAGACCCATCTGACCTCTCCCCTCCTCGCCAGGCCGAGCTAGGTCCCAATTCTTCCTCAGCCTCCGCTCCTCCACCCTATAATCCTTTTATCACCTCCCCTCCTCACACCCGGTCCGGCTTACAGTTTCGTTCCGTGACTAGCCCTCCCCCACCTGCCCAGCAATTTACTCTTAAAAAGGTGGCTGGAGCTAAAGGCATAGTCAAGGTTAATGCTCCTTTTTCTTTATCCGNNNNNTCCCAAATCAGATAGCGTTTAGGCTCTTTTTCATCAAATATAAAAANCCCAGCCCAGTTCATGGCTCGTTTGGCAGCAACCCTGAGACGCTTTACAGCCCTAGACCCTAAAAGGTCAAAAGGCCGTCTTATTCTCAATATACATTTTATTACCCAATCTGCTCCCGACATTAAATAAAACTCCAAAAATTAAATTCCGGCCCTCAAACCCCACAACAGGACTTAATTAACCTCGCCTTCAAGGTGTACAATAATAGAAAAAAGTTGCAATTCCTTGCCTCCACTGTGAGACAAACCCCAGCCACATCTCCAGCACACAAGAACTTCCAAACGCCTGAACCGCAGCGGCCAGGCGTTCCTCCAGAACCTCCTCCCCCAGGAGCTTGCTACAAGTGCCGGAAATCTGGCCACCGGGCCAAGGAATGCCCGCAGCCCGGGATTCCTCCTAAGCCGCGTCCCATCTGTGCGGGACCCCACTGGAAATCGGACTGTTCAACTCACCTGGCAGCCACTCCCAGAGCCCCTGGAACTCTGGCCCAAGGCTCTCTGACTGACTCCTTCCCAGATCTTCTCGGCTTAGCGGCTGAAGACTGACGCTGCCCGATCGCCTCGGAAGCCCCCTGGACCATCACGGACGCTTCGGGTAACTCTCACAGTGGAGGGTAAGTCCGTCCCCTTCTTAATCAATACGGAGGCTACCCACTCCACATTACCTTCTTTTCAAGGGCCTGTTTCCCTTGCCTCCATAACTGTTGTGGGTATTGACGGCCAGGCTTCTAAACCTCTTAAAACTCCCCAACTCTGGTGCCAACTTAGACAATACTCTTTTAAGCACTCCTTTTTAGTTATCCCCACCTGCCCAGTTCCCTTATTAGGCCGAGACACTTTAACTAAATTATCTGCTTCCCTGACTATTCCTGGACTACAGCCACATCTCATTGCCGCCCTTCTCCCCAATCCAAAGCCTCCTTCGCGTCCTCTTGTATCCCCCCACCTTAACCCACAAGTATAAGATACCTCTACTCCCTCCTTGGCGACCGATCATGCACCCCTTACCATCTCATTAAAACCTAATCACCCTTACCCCGCTCAACGCCAATATCCCATCCCACAGCACGCTTTAAAAGGATTAAAGCCTGTTATCACTCGCCTGCTACAGCATGGCCTTTTAAAGCCTATAAACTCTCCTTACAATTCCCCCATTTTACCTGTCCTAAAACCAGACAAGCCTTACAGGTTCAGGATCTGCGCCTTATCAACCAAATTGTTTTGCCTATCCACCCCGTGGTGCCAAACCCATATACTCTCCTATCCTCAATACCTCCCTCCACAACCCATTATTCTGTTCTGGATCTCAAACATGCTTTCTTTACTATTCCTTTGCACCCTTCATCCCAGCCTCTCTTCGCTTTCACTTGGACTGACCCTGACACCCATCAGGCTCAGCAAATTACCTGGGCTGTACTGCCGCAAGGCTTCACAGACAGCCCCCATTACTTCAGTCAAGCCCTTCCTCATGATTTACTTTCTTTCCACCCCTCCGCTTCTCACCTTATTCAATATATTGATGACCTTCTNCTTTGTAGCCCCTCCTTTGAATCTTCTCAACAAGACACNCTNCTGCTCCTTCANCATTTATTCTCCAAAGGATATCCCCCTCCAAAGCCCAAATTTCTTCCTCATCTGTTACCTATCTCGGCATAATTCTTCATAAAAACACACGTGCTCTCCCTGCCGATCGTGTCCGACTGATCTCTCAAACCCCAACCCCTTCTACAAAACAACAACTCCTTTCCTTCCTAGGCATGGTTGGATACTTTCGCCTTTGGATACCTGGTTTTGCCATCCTAACAAAACCATTATATAAACTCACAAAAGGAAACCTAGCTGACCCCATAGATCCTAAATCCTTTCCCCACTCCTCTTTCCGTTCCTTGAAGACAGCTTTAGAGACTGCCCCCACCCTAGCTCTCCCTGACTCATCCCAACCCTTTTCATTACACACAGCCGAAGTGCAGGGCTGTGCAGTCGGAATTCTTACACAAGGACCGGGACCGCGCCCTGTAGCCTTTTTGTCCAAACAACTTGACCTTACTGTTTTAGGCTGGCCATCATGTCTCCGTGCGGCGGCTGCCGCCGCCCTAATACTTTTAGAGGCCCTCAAAATCACAAACTATGCTCAACTCACTCTCTACAGTTCTCATAACTTCCAAAATCTATTTTCTTCCTCACACCTGACGCATATACTTTCTGCTCCCCGGCTCCTTCAGCTGTACTCACTCTGTTGAGTCTCCCACAATTACCATTGTTCCTGGCCCGGACTTCAATCCGGCCTCCCACATTATTCCTGATACCACACCTGACCCCCATGACTGTATCTCTCTGATCCACCTGACATTCACCCCATTTCCCCATATTTCCTTCTTTCCTGTTCCTCACCCTGATCACACTTGGTTTATTGATGGCAGTTCCACCAGGCCTAATCGCCACACACCAGCAAAGGCAGGCTATGCTATGAACTCGTTGCCTTAACTCGAGCCCTCACTCTTGCAAAGGGACTACGCGTCAATATTTATACTGACTCTAAATATGCCTTCCATATCCTGCACCACCATGCTGTTATATGGGCTGAAAGAGGTTTCCTCACTACGCAAGGGTCCTCCATCATTAATGCCTCTTTAAAAAAACTCTTCTCAAGGCCGCTTTACTTCCAAAGGAAGCTGGAGTCATTCACTGCAAGGGCCATCAAAAGGCATCAGATCCCATCGCTCAGGGCAACGCTTATGCTGATAAGGTAGCTAAAGAAGCAGCTAGCGTTCCAACTTCTGTCCCTCACGGCCAGTTTTTCTCCTTCTCATCGGTCACTCCCACCTACTCCCCCGCTGAAACTTCCACCTATCAATCTCTTCCCACACAAGGCAAATGGTTCTTGGACCAAGGAAAATATCTCCTTCCAGCCTCACAGGCCCATTCTATTCTGTCGTCATTTCATAACCTCTTCCATGTAGGTTACAAGCCGCTAGCCCGCCTCTTAGAACCTCTCATTTCCTTTCCATCGTGGAAATCTATCCTCAAGGAAATCACTTCTCAGTGTTCCATCTGCTATTCTACTACTCCTCAGGGATTGTTCAGGCCCCCTCCCTTCCCTACACATCAAGCTCGGGGATTTGCCCCCGCCCAGGACTGGCAAATTGACTTTACTCACATGCCCCGAGTCAGGAAACTAAAATACCTCTTGGTCTGGGTAGACACTTTCACTGGATGGGTAGAGGCCTTTCCCACAGGGTCTGAGAAGGCCACCGCGGTCATTTCTTCCCTTCTGTCAGACATAATTCCTCGGTTTGGCCTTCCCACCTCTATACAGTCCGATAACGGACCGGCCTTTATTAGTCAAATCAGCCAAGCAGTTTCTCAGGCTCTTGGTATTCAGTGAAACCTTTATATCCCTTACGGTCCTCAGTCTTCAGGAAAGGTAGAACGGACTAATGGTCTTTTAAAAACACACCTCACCAAGCTCAGCCACCAACTTAAAAAGGACTGGACAATACTTTTACCACTTTCCCTTCTCAGAATTCGGGCCTGTCCTCGGAATGCTACAGGGTACAGCCCATTTGAGCTCCTGTATGGACGCTCCTTTTTATTAGGCCCCAGTCTCATTCCAGACACCAGACCTCTAGGCGACTATCTTCCAGTCCTCCAGCAGGCTAGACAGGAAATTCGCCAGGCTGCTAATCTTCTCTTGCCTACTCCAGATCCCCAGCCATATGAAGACACCCTAGCTGGACGATCAGTTCTTGTTAAGAATCTGACCCCTCAAACTCTACAACCTCGATGGACCGGACCCTACTTAGTCATCTATAGTACCCCGACTGCCGTCCGCCTGCAGGATCCTCCCCACTGGGTTCACCGTTCCAGAATAAAGCTGTGTCCGTCGGACAGCCAGCCTAATCCCTCCTCTTCCTCCTGGAAGTCGCAAGTACTCTCCCCTACTTCCCTTAAACTCACTCGCATTTCTGAAGAACAGTAATAACCCTTATGAGCCTAATACATCCCTTCATTCTATTAGGTCTGTTCGTCCTTACCCTACTTTTTGCAACAGGGCTTTACGNAGTCACCCCCACCACTTGGACCGAGCCCCAAAAAACTTGTCATCCCTACTATCTTCTGTCTAGTCATACTCCTATTCNCCGTTCTCAACTACTCATAAATGCCCTACTCTTGTTTACACTGCCGGTTTACACTGTTTCTCCAAGCCATCACAGCTGATATCTCCTGGTGCTATCCCCAAACCGCCACTCTTAACTCCCTCTTAGAGTGGATAGATGATCTTTGCTGGCAGGGCACCCTCCAATACTTTCACCCTGATGAAGTTCTATTCTTTACTTTTATACTCACTCTTATTCTCATTCCCATTCTTATGCCACCCTCTACCTCTCCCCAGCTATCTCCACCACACTATCAACCTTACCCATTCTCTCCTAGCCGTTTCTAATCCCTCCTTAGCGAACAACTGCTGGCTTTGCATTTCCCTTTCTTCCAGCGCCTACACAGCTGTCCCCGCCTTACANACAGACTGGGCAACATCTCCTGTCTCCCTACACCTCCGAACTTCCTTTAACAGCCCTCACCTTTACCCTCCTGAAGAACTCATTTACTTTCTAGACAGGTCCAGCAAGACCTCCCCAGACATTTCACATCAGCAAGCTGCCGCCCTCCTCCGCACTTACTTAAAAAACCTTTCTCCTTATATCAACTCTACTCCCCCCATATTTGGACCTCTCACAACACAAACTACTATTCCTGTGGCCGCTCCTTTATGTATCTCTCGGCAAAGACCCACTGGAATTCCCCTAGGTAACCTTTCACCTTCTCGATGTTCCTTTACTCTTCATCTCCGAAGCCCAACTACACACATCACTGAAACAATTGGAGCCTTCCAGCTCCATATTACAGACAAGCCCTCTATCAATACTGGCAAACTTAAAAACATTAGCAGTAATTATTGCTTAGGAAGACACTTACCCTGTATTTCACTCCATCCTTGGCTACCTTCCCCTTGCTCGTCAGACTCTCCTCCCAGGCCCTCTTCTTGTTTACTTATACCCAGCCCCGAAAATAACAGTGAAAGGTTGCTCGTAGATACTCAACGTTTTCTCATACACCATGAAAATCGAACCTCCCCCTCTACGCAGTTACCCCATCAGTCCCCATTACAACCTCTGACGGCTGCCGCCCTAGCTGGATCCCTAGGAGTCTGGGTACAAGACACCCCTTTCAGCACTCCTTCTCATCTTTTTACTTTGCATCTCCAGTTTTGCCTCGCACAAGGTCTCTTCTTCCTCTGTGGATCCTCTACCTACATGTGTCTACCTGCTAATTGGACAGGCACATGCACACTAGTTTTCCTTACCCCCAAAATTCAATTTGCAAATGGGACCGAAGAGCTCCCTGTTCCCCTCATGACACCGACACGACAAAAAAGAGTTATTCCACTAATTCCCTTGCTNGTCGGTTTAGGACTTTCTGCCTCCACTATTGCTCTCGGTACTGGAATAGCAGGCATTTCAACCTCTGTCACGACCTTCCGTAGCCTCTCTAATGACTTCTCTGCTAGCATCACAGACATATCACAAACTTTATCAGTCCTCCAGGCCCAAGTTGACTCTTTAGCTGCAGTTGTCCTCCAAAACCGCCGAGGCCTCGACTTACTCACTGCTGAAAAAGGAGGACTCTGTATATTCTTAAATGAAGAGTGTTGTTTTTACCTAAATCAATCTGGCCTGGTGTATGACAACATAAAAAAACTCAAGGATAGAGCCCAAAAACTCGCCAACCAAGCAAGTAATTACGCTGAACCCCCTTGGGCACTCTCTAATTGGATGTCCTGGGTCCTCCCAATTCTTAGTCCTTTAATACCCGTTTTTCTCCTTCTCTTATTCGGACCTTGTGTCTTCCGTTTAGTTTCTCAATTCATNCAAAACCGTATCCAGGCCATCACCAATCATTCTATACGACAAATGCTCCTTCTAACAACCCCACAATATCACCCCTTACCACAAAATCTTCCTTCAGCTTAATCTCTCCCACTCTAGGTTCCCACGCCGCCCCTAATCCCGCTCGAAGCAGCCCTGAGAAACATCGCCCATTATCTCTCNNCATACCACCCCCCAAAAATTTTCGCCGCCCCAACACTTCANCACTATTTTATTTTTCTTATTAATATAAGAAGACAGGAA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERVH Ebf2 30 38 - 16.79 CCCAAGGGA
HERVH eor-1 7399 7411 - 16.71 AGAGAAGGAGAAA
HERVH PPARD 6333 6346 - 16.61 AAGGTGAAAGGTTA
HERVH EBF3 30 38 - 16.59 CCCAAGGGA
HERVH RVE4 4418 4426 - 16.53 AGATATTTT
HERVH BAM8 3195 3203 + 16.46 CACACGTGC
HERVH Zm00001d020595 3609 3618 - 16.42 GGCGGCGGCA
HERVH RVE7 4418 4426 + 16.38 AAAATATCT
HERVH Zfx 7169 7178 + 16.36 GCCGAGGCCT
HERVH ERF104 3601 3616 - 16.29 CGGCGGCAGCCGCCGC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).