HERVH

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000183
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 7713
Kimura value 5.61
Tau index 0.8930
Description Internal region of ERV1 endogenous retrovirus, HERVH subfamily
Comment The long terminal repeat associated with HERVH is LTR7. The putative env gene, consisting of about 1800 base pairs, has two open reading frames interrupted by a termination codon. The amino acid sequence of this region shows significant homology to those of other retroviral envelope proteins and contains eight potential glycosylation sites. It is estimated that there are about 100 copies of RTVL-H elements containing the env gene per haploid human genome. Note that this estimate is distinct from simply tallying all hits (long and short) from this model.
Sequence
TTTGGTGCCGTGACTCGGATCGGGGGACCTCCCTTGGGAGATCAATCCCCTGTCCTCCTGCTCTTTGCTCCGTGAGAAAGATCCACCTACGACCTCAGGTCCTCAGACCGACCAGCCCAAGGAACATCTCACCAATTTCAAATCCGGTAAGCGGCCTCTTTTTACTCTCTTCTCCAACCTCCCTCACTATCCCTCAACCTCTTTCTCCTTTCAATCTTGGCGCCACACTTCAATCTCTCCCTTCTCTTAATTTCAATTCCTTTCATTTTCTGGTAGAGACAAAGGAGACACGTTTTATCCGTGGACCCAAAACTCCGGCGCCGGTCACGGACTGGGAAGGCAGCCTTCCCTTGGTGTTTAATCATTGCAGGGACGCCTCTCTGATTATTCACCCACGTTTCAGAGGTGTCAGACCACGCAGGGACGCCTGCCTTGGTCCTTCACCCTTAGCGGCAAGTCCCGCTTTTCTGGGGGAGGGGCAAGTACCCCAACCCCTTCTCTCCGTGTCTCTACCCCTTCTCCGCTTTTCTGGGGNAGGGGCAAGNACCCCTCAACCCCTTCTCCTTCACCCTTAGCGGCAAGTCCCGCTTTTCTAGGGGGGCAAGAACCCCCAACCCCTTATTTCCGCGCCCCGACCTCTTATCTCTGCGCCCCAATCCCTTATTTCCGCGCCCCGACCTCTTATCTCTGCGCCCCGATCCCTTATTTCCGCGCCCCGACCTCTTATCTCTGCGCCCCAACCCCTTATTTCCGTGCCCCGACCCCTTTCCCGCTTTTCTGGAGGGCAAGAACCCCCGAACCCCTTCCCTCCGTGTCTCTACTCTCTCTTTTCTCTGGGCTTGCCTCCTTCACTATGGGCAACCTTCCACCCTCCATTCCTCCTTCTCCCTTAGCCTGTGTTCTCAAGAACTTAAAACCTCTTCAACTCACACCTGACCTAAAACCTAAACGCCTTATTTTCTTCTGCAATGCCGCTTGACCCCAATACAAACTCGACAGTAGTTCCAAATAGCCAGAAAACGGCACTTTCAATTTTTCCATCCTACAAGATCTAAATAATTCTTGTCGTAAAATGGGCAAACGGTCTGAGGTGCCTGACGTCCAGGCATTCTTTTACACATCGGTCCCTCCCTAGTCTCTGTNCCCAGTGCAACTCGTCCCAAATCTTCCTTCTTTCCCTCCCGCCTGTCCCCTCAGTCCCAACCCCAAGCGTCGCTGAGTCTTTCTAATCTTCCTTTTCTACAGACCCATCTGACCTCTCCCCTCCTCGCCAGGCCGAGCTAGGTCCCAATTCTTCCTCAGCCTCCGCTCCTCCACCCTATAATCCTTTTATCACCTCCCCTCCTCACACCCGGTCCGGCTTACAGTTTCGTTCCGTGACTAGCCCTCCCCCACCTGCCCAGCAATTTACTCTTAAAAAGGTGGCTGGAGCTAAAGGCATAGTCAAGGTTAATGCTCCTTTTTCTTTATCCGNNNNNTCCCAAATCAGATAGCGTTTAGGCTCTTTTTCATCAAATATAAAAANCCCAGCCCAGTTCATGGCTCGTTTGGCAGCAACCCTGAGACGCTTTACAGCCCTAGACCCTAAAAGGTCAAAAGGCCGTCTTATTCTCAATATACATTTTATTACCCAATCTGCTCCCGACATTAAATAAAACTCCAAAAATTAAATTCCGGCCCTCAAACCCCACAACAGGACTTAATTAACCTCGCCTTCAAGGTGTACAATAATAGAAAAAAGTTGCAATTCCTTGCCTCCACTGTGAGACAAACCCCAGCCACATCTCCAGCACACAAGAACTTCCAAACGCCTGAACCGCAGCGGCCAGGCGTTCCTCCAGAACCTCCTCCCCCAGGAGCTTGCTACAAGTGCCGGAAATCTGGCCACCGGGCCAAGGAATGCCCGCAGCCCGGGATTCCTCCTAAGCCGCGTCCCATCTGTGCGGGACCCCACTGGAAATCGGACTGTTCAACTCACCTGGCAGCCACTCCCAGAGCCCCTGGAACTCTGGCCCAAGGCTCTCTGACTGACTCCTTCCCAGATCTTCTCGGCTTAGCGGCTGAAGACTGACGCTGCCCGATCGCCTCGGAAGCCCCCTGGACCATCACGGACGCTTCGGGTAACTCTCACAGTGGAGGGTAAGTCCGTCCCCTTCTTAATCAATACGGAGGCTACCCACTCCACATTACCTTCTTTTCAAGGGCCTGTTTCCCTTGCCTCCATAACTGTTGTGGGTATTGACGGCCAGGCTTCTAAACCTCTTAAAACTCCCCAACTCTGGTGCCAACTTAGACAATACTCTTTTAAGCACTCCTTTTTAGTTATCCCCACCTGCCCAGTTCCCTTATTAGGCCGAGACACTTTAACTAAATTATCTGCTTCCCTGACTATTCCTGGACTACAGCCACATCTCATTGCCGCCCTTCTCCCCAATCCAAAGCCTCCTTCGCGTCCTCTTGTATCCCCCCACCTTAACCCACAAGTATAAGATACCTCTACTCCCTCCTTGGCGACCGATCATGCACCCCTTACCATCTCATTAAAACCTAATCACCCTTACCCCGCTCAACGCCAATATCCCATCCCACAGCACGCTTTAAAAGGATTAAAGCCTGTTATCACTCGCCTGCTACAGCATGGCCTTTTAAAGCCTATAAACTCTCCTTACAATTCCCCCATTTTACCTGTCCTAAAACCAGACAAGCCTTACAGGTTCAGGATCTGCGCCTTATCAACCAAATTGTTTTGCCTATCCACCCCGTGGTGCCAAACCCATATACTCTCCTATCCTCAATACCTCCCTCCACAACCCATTATTCTGTTCTGGATCTCAAACATGCTTTCTTTACTATTCCTTTGCACCCTTCATCCCAGCCTCTCTTCGCTTTCACTTGGACTGACCCTGACACCCATCAGGCTCAGCAAATTACCTGGGCTGTACTGCCGCAAGGCTTCACAGACAGCCCCCATTACTTCAGTCAAGCCCTTCCTCATGATTTACTTTCTTTCCACCCCTCCGCTTCTCACCTTATTCAATATATTGATGACCTTCTNCTTTGTAGCCCCTCCTTTGAATCTTCTCAACAAGACACNCTNCTGCTCCTTCANCATTTATTCTCCAAAGGATATCCCCCTCCAAAGCCCAAATTTCTTCCTCATCTGTTACCTATCTCGGCATAATTCTTCATAAAAACACACGTGCTCTCCCTGCCGATCGTGTCCGACTGATCTCTCAAACCCCAACCCCTTCTACAAAACAACAACTCCTTTCCTTCCTAGGCATGGTTGGATACTTTCGCCTTTGGATACCTGGTTTTGCCATCCTAACAAAACCATTATATAAACTCACAAAAGGAAACCTAGCTGACCCCATAGATCCTAAATCCTTTCCCCACTCCTCTTTCCGTTCCTTGAAGACAGCTTTAGAGACTGCCCCCACCCTAGCTCTCCCTGACTCATCCCAACCCTTTTCATTACACACAGCCGAAGTGCAGGGCTGTGCAGTCGGAATTCTTACACAAGGACCGGGACCGCGCCCTGTAGCCTTTTTGTCCAAACAACTTGACCTTACTGTTTTAGGCTGGCCATCATGTCTCCGTGCGGCGGCTGCCGCCGCCCTAATACTTTTAGAGGCCCTCAAAATCACAAACTATGCTCAACTCACTCTCTACAGTTCTCATAACTTCCAAAATCTATTTTCTTCCTCACACCTGACGCATATACTTTCTGCTCCCCGGCTCCTTCAGCTGTACTCACTCTGTTGAGTCTCCCACAATTACCATTGTTCCTGGCCCGGACTTCAATCCGGCCTCCCACATTATTCCTGATACCACACCTGACCCCCATGACTGTATCTCTCTGATCCACCTGACATTCACCCCATTTCCCCATATTTCCTTCTTTCCTGTTCCTCACCCTGATCACACTTGGTTTATTGATGGCAGTTCCACCAGGCCTAATCGCCACACACCAGCAAAGGCAGGCTATGCTATGAACTCGTTGCCTTAACTCGAGCCCTCACTCTTGCAAAGGGACTACGCGTCAATATTTATACTGACTCTAAATATGCCTTCCATATCCTGCACCACCATGCTGTTATATGGGCTGAAAGAGGTTTCCTCACTACGCAAGGGTCCTCCATCATTAATGCCTCTTTAAAAAAACTCTTCTCAAGGCCGCTTTACTTCCAAAGGAAGCTGGAGTCATTCACTGCAAGGGCCATCAAAAGGCATCAGATCCCATCGCTCAGGGCAACGCTTATGCTGATAAGGTAGCTAAAGAAGCAGCTAGCGTTCCAACTTCTGTCCCTCACGGCCAGTTTTTCTCCTTCTCATCGGTCACTCCCACCTACTCCCCCGCTGAAACTTCCACCTATCAATCTCTTCCCACACAAGGCAAATGGTTCTTGGACCAAGGAAAATATCTCCTTCCAGCCTCACAGGCCCATTCTATTCTGTCGTCATTTCATAACCTCTTCCATGTAGGTTACAAGCCGCTAGCCCGCCTCTTAGAACCTCTCATTTCCTTTCCATCGTGGAAATCTATCCTCAAGGAAATCACTTCTCAGTGTTCCATCTGCTATTCTACTACTCCTCAGGGATTGTTCAGGCCCCCTCCCTTCCCTACACATCAAGCTCGGGGATTTGCCCCCGCCCAGGACTGGCAAATTGACTTTACTCACATGCCCCGAGTCAGGAAACTAAAATACCTCTTGGTCTGGGTAGACACTTTCACTGGATGGGTAGAGGCCTTTCCCACAGGGTCTGAGAAGGCCACCGCGGTCATTTCTTCCCTTCTGTCAGACATAATTCCTCGGTTTGGCCTTCCCACCTCTATACAGTCCGATAACGGACCGGCCTTTATTAGTCAAATCAGCCAAGCAGTTTCTCAGGCTCTTGGTATTCAGTGAAACCTTTATATCCCTTACGGTCCTCAGTCTTCAGGAAAGGTAGAACGGACTAATGGTCTTTTAAAAACACACCTCACCAAGCTCAGCCACCAACTTAAAAAGGACTGGACAATACTTTTACCACTTTCCCTTCTCAGAATTCGGGCCTGTCCTCGGAATGCTACAGGGTACAGCCCATTTGAGCTCCTGTATGGACGCTCCTTTTTATTAGGCCCCAGTCTCATTCCAGACACCAGACCTCTAGGCGACTATCTTCCAGTCCTCCAGCAGGCTAGACAGGAAATTCGCCAGGCTGCTAATCTTCTCTTGCCTACTCCAGATCCCCAGCCATATGAAGACACCCTAGCTGGACGATCAGTTCTTGTTAAGAATCTGACCCCTCAAACTCTACAACCTCGATGGACCGGACCCTACTTAGTCATCTATAGTACCCCGACTGCCGTCCGCCTGCAGGATCCTCCCCACTGGGTTCACCGTTCCAGAATAAAGCTGTGTCCGTCGGACAGCCAGCCTAATCCCTCCTCTTCCTCCTGGAAGTCGCAAGTACTCTCCCCTACTTCCCTTAAACTCACTCGCATTTCTGAAGAACAGTAATAACCCTTATGAGCCTAATACATCCCTTCATTCTATTAGGTCTGTTCGTCCTTACCCTACTTTTTGCAACAGGGCTTTACGNAGTCACCCCCACCACTTGGACCGAGCCCCAAAAAACTTGTCATCCCTACTATCTTCTGTCTAGTCATACTCCTATTCNCCGTTCTCAACTACTCATAAATGCCCTACTCTTGTTTACACTGCCGGTTTACACTGTTTCTCCAAGCCATCACAGCTGATATCTCCTGGTGCTATCCCCAAACCGCCACTCTTAACTCCCTCTTAGAGTGGATAGATGATCTTTGCTGGCAGGGCACCCTCCAATACTTTCACCCTGATGAAGTTCTATTCTTTACTTTTATACTCACTCTTATTCTCATTCCCATTCTTATGCCACCCTCTACCTCTCCCCAGCTATCTCCACCACACTATCAACCTTACCCATTCTCTCCTAGCCGTTTCTAATCCCTCCTTAGCGAACAACTGCTGGCTTTGCATTTCCCTTTCTTCCAGCGCCTACACAGCTGTCCCCGCCTTACANACAGACTGGGCAACATCTCCTGTCTCCCTACACCTCCGAACTTCCTTTAACAGCCCTCACCTTTACCCTCCTGAAGAACTCATTTACTTTCTAGACAGGTCCAGCAAGACCTCCCCAGACATTTCACATCAGCAAGCTGCCGCCCTCCTCCGCACTTACTTAAAAAACCTTTCTCCTTATATCAACTCTACTCCCCCCATATTTGGACCTCTCACAACACAAACTACTATTCCTGTGGCCGCTCCTTTATGTATCTCTCGGCAAAGACCCACTGGAATTCCCCTAGGTAACCTTTCACCTTCTCGATGTTCCTTTACTCTTCATCTCCGAAGCCCAACTACACACATCACTGAAACAATTGGAGCCTTCCAGCTCCATATTACAGACAAGCCCTCTATCAATACTGGCAAACTTAAAAACATTAGCAGTAATTATTGCTTAGGAAGACACTTACCCTGTATTTCACTCCATCCTTGGCTACCTTCCCCTTGCTCGTCAGACTCTCCTCCCAGGCCCTCTTCTTGTTTACTTATACCCAGCCCCGAAAATAACAGTGAAAGGTTGCTCGTAGATACTCAACGTTTTCTCATACACCATGAAAATCGAACCTCCCCCTCTACGCAGTTACCCCATCAGTCCCCATTACAACCTCTGACGGCTGCCGCCCTAGCTGGATCCCTAGGAGTCTGGGTACAAGACACCCCTTTCAGCACTCCTTCTCATCTTTTTACTTTGCATCTCCAGTTTTGCCTCGCACAAGGTCTCTTCTTCCTCTGTGGATCCTCTACCTACATGTGTCTACCTGCTAATTGGACAGGCACATGCACACTAGTTTTCCTTACCCCCAAAATTCAATTTGCAAATGGGACCGAAGAGCTCCCTGTTCCCCTCATGACACCGACACGACAAAAAAGAGTTATTCCACTAATTCCCTTGCTNGTCGGTTTAGGACTTTCTGCCTCCACTATTGCTCTCGGTACTGGAATAGCAGGCATTTCAACCTCTGTCACGACCTTCCGTAGCCTCTCTAATGACTTCTCTGCTAGCATCACAGACATATCACAAACTTTATCAGTCCTCCAGGCCCAAGTTGACTCTTTAGCTGCAGTTGTCCTCCAAAACCGCCGAGGCCTCGACTTACTCACTGCTGAAAAAGGAGGACTCTGTATATTCTTAAATGAAGAGTGTTGTTTTTACCTAAATCAATCTGGCCTGGTGTATGACAACATAAAAAAACTCAAGGATAGAGCCCAAAAACTCGCCAACCAAGCAAGTAATTACGCTGAACCCCCTTGGGCACTCTCTAATTGGATGTCCTGGGTCCTCCCAATTCTTAGTCCTTTAATACCCGTTTTTCTCCTTCTCTTATTCGGACCTTGTGTCTTCCGTTTAGTTTCTCAATTCATNCAAAACCGTATCCAGGCCATCACCAATCATTCTATACGACAAATGCTCCTTCTAACAACCCCACAATATCACCCCTTACCACAAAATCTTCCTTCAGCTTAATCTCTCCCACTCTAGGTTCCCACGCCGCCCCTAATCCCGCTCGAAGCAGCCCTGAGAAACATCGCCCATTATCTCTCNNCATACCACCCCCCAAAAATTTTCGCCGCCCCAACACTTCANCACTATTTTATTTTTCTTATTAATATAAGAAGACAGGAA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERVH ZNF530 4609 4622 - 17.55 GAAGGGAGGGGGCC
HERVH E2FC 215 229 - 17.48 AGTGTGGCGCCAAGA
HERVH Su(H) 7569 7578 - 17.17 CGTGGGAACC
HERVH ABF1 7478 7491 - 17.11 TCGTATAGAATGAT
HERVH Zm00001d020267 3609 3618 + 17.04 TGCCGCCGCC
HERVH ZNF257 2212 2221 - 16.99 GAGGCAAGGG
HERVH BAD1 625 636 + 16.98 CCGCGCCCCGAC
HERVH BAD1 667 678 + 16.98 CCGCGCCCCGAC
HERVH BAD1 709 720 + 16.98 CCGCGCCCCGAC
HERVH Pparg::Rxra 1253 1265 - 16.92 AGGGGAGAGGTCA


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).