HERVE

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000174
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 7813
Kimura value 6.13
Tau index 0.9707
Description Internal region of an ERV1 endogenous retrovirus, HERVE subfamily
Comment Associated long terminal repeat is LTR2. The gag and pol genes are ~40% similar to Moloney murine leukemia virus (MoMuLV). Some unique characteristics of the endogenous human retroviral DNA included a tRNA Glu primer binding site separated from the 5' LTR by a pentanucleotide and a putative env sequence which does not appear to overlap the C terminus of pol and has virtually no homology with the env gene of known infectious retroviruses. The reconstructed (though still pseudogenic) gene boundaries are: Gag: 571-2094, Pol: 2095-5655, Env: 5678-7750.
Sequence
TTTCTTGGTTCCCTGACCGGGAAGCGAGGTAATTGACGGACGGTCGAGGCAGCCCCTTAGGCGGCTTAGGCCTGCCCTGTGGAGCATCCCTGCGGGGGACTCCGGCCAGCTTGAGCGACGCGGATCCTGAGAGCGCTCCCGGGTAGGCAATTGCCCCGGTGGAACGCCTCGTCAGAGAGTGCGTGGCAGGCCCCCGTGGAGGATCAACGCAGTGGCTGAACACCGGGAAGGAACGGGCACTTGGAGTCCGGACATTTGAAACTTGGTAAGACTGGTCTTTGGAACTTGCCCACTCCATTTGAGTGGAAGCGTGGCCTGATCACCCACGGCGTGCCTGTACCGGCACTTTGGTTTTTGTTTTTGACTTGACTTGAATTGCTTGATACTTTGGTTTTGGTTTGACCTGGCTTGGATTTCTGGATACTCTGATTTTGGTTTTGATTCTGGTTTGGTGAAAACTGAAAAAGTGTGTGTGTGCCCTTTTTACCCATTCTTTGTTCTGTGGTGTGCGTGTGGTGTGAGCTTGGTGTTTTGTCTCGAGGAAACGTGGGTCAGACACAAAGTAAGCCTACTCCGCTAGGAACTATGTTGAAAAATTTTAAGAAAGGATTTAATGGAGACTATGGGGTTACTATGACACCAGGGAAACTTAGAACTTTGTGTGAAATAGATTGGCCAACATTAGAAGTGGGTTGGCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCTAAGGTATGGCACAAGGTAACTGGTAAGTCAGGACACCCAGACCAGTTCCCATACATAGACACTTGGTTACAGCTGGTGCTAGACCCCCCACAGTGGCTAAGAGGGCAGGCAGCAGCAGTGCTAGTAGCAAAGGGACAGATAGCCAAGGAAGGATCCCGCTCCACCCGCCGAGGGAAATCAACTCCTGAAGTTCTGTTCGACCCAACATCAGAAGATCCATTGCAGGAGATGGCACCAGTGATCCCAGTGGTGCCCTCCCCTTACCAGGGAGAGAGGCTCCCCACTCTTGAGCCCACAGTGCTTGCGCCTCCGCAAGACAAACATATCCCTAGGCCACCCAGAGTAGACAAGAGAGGAGGTGAAGACTCGGGAGAAACCCCTCCCTCGGCAGCTCGTTTACGACCCAAAACGGGGATACAAATGCCCCTGAGAGAGCAGCGGTATACTGGGATAGATGAGGATGGTCACGTGGTGGAGAGGCGTGTTTTTGGGTACCAGCCCTTCACCTCCGCCGACCTTCTCAACTGGAAAAACAATACCCCGTCCTATACCGAAAAGCCACAAGCCCTAATTGATTTGCTCCAAACTGTTATCCAGACCCACAACCCCACCTGGGCTGATTGCCACCAGTTGCTCATGTTCCTCTTTAACAGAGATGAAAGGCGGAGAGTGCTCCAAGCAGCAACTAAGTGGCTAGAGGAACATGCACCAGCTGATTATCAAAACCCCCAAGAGTATGGAAGGACCCAGTTACCAGGAACCGACCCCCAGTTGGACCCACATGAAAGAGAGGATATGCAAAGGCTAAACCGAGACAGGGAAGCTCTCTTGGAAGGATTAANGAGGGGAGCTCAGAAGGCCACAAACGTTAACAAGGTCTCTGAGGTCATTCAGGGAAAAGAAGAAAGTCCAGCACAATTCTACGAGAGACTGTGTGAGGCCTATCGTATGTATACTCCCTTTGATCCCGATAGCCCTGAAAATCAGCGCATGATTCACATGGCTTTAGTCCGTCAAAGCGCAGAAGACATGAGAAGAAAACTGCAGAAACAGGCTGGGCTTGCAGGGATGAATACATCACAATTACTAGAAATAGCTAGCCAGGTGTTTGTAAACAGGGATGCAGTAAGCCGTAAGGAAANCGCAAAGAGAATGGAGGTCAGGCCCGGCGAAACGCGCCTGTTAGCTGCAGCAATCAGAGGGGCCCCCCCAAANGAGGCAAGGNNGAAGGGGGGCCCTGGGAAAGAAACTCAGCTTGGCTGTCAGAGTTTGCAGCGTAACCAGTGTGCTTATTGTAAAGAAATAGGACAGTGGAAGAACAAATGCCCTCAGCTCAAAAGAAAACAAGGTGACTCAGAGCAGGAGGCCCCGGACAAGGAGGAAGGGGCCCTGCTCAACCTGGCAGAAGGGTTATTGGACTGAGGGAGACCGGGCTCAAGCGTCCCCAAAGAGCCTCTGGTCAGAATGACAGTCGGGGGTAGAGACATTGATTTTCTTGTAGATAGCGGTGCTGAACATTCGCTAGTAACCGCCCCGGTCGCCCCCTTATCCAAAAAGACTATTGACGTCATCGGAGCCACGGGGGTTTCAGCAAAGCAAGCTTTCTGCTTGCCTCGGACTTGTACTGTAGGAGGACATAAAGTCATTCATCAGTTTTNGTACATGCCTGACTGTCCCTTGCCCTTNTTGGGAAGGGACTTGCTCAGCAAGCTGAGAGCCACTATCTCTTTGACAGAGCACGGCTCTTTGCTGCTAAAGTTACCCGGAACGGGAGTCATTATGACCCTTACGGTCCCCCGAGAGGAGGAATGGAGACTTTTCTTAACTGAGCCGGGCCAAGAGAGAAGACCAGCTCTGGCTAAGCGGTGGCCAAGAGTACGGGCGGAAGACAACCCTCCGGGGTTGGCAGTCAACCGAGCCCNCGTACTCGTNGAAGTTAAGACTGGGGCCCAGCCGGTTAGGCAAAAACAGGACCCGGTCCCCAGAGAAGCTCTTCAAGGTATCCAGGTCCGTCTCAAGCACCTAAGAACTTTTGGAATTATNGTTCCTTGTCAGTCTCCATGGAACACTCCCCTCCTGCCTGTTCCCAAGCCACGGACCAAGGACTACNGGCCGGTACAGGATTTGCGCTTGCTTNATCAAGCTACACTGACTTTACATCCAACAGTACCTAACCCGTCCACATTGTTGGGGTTGCTGCCAGCTGAGGACAGCTGGTTCACCTGCTTGGACCTGAAAGACGCTTTCTTTCCTATCAGATTAGCCCCTGAGAGGCAGAAGCTGTTTGCCTTTCAGTGGGAAGATCCGGAGTCAGGTGTCACTACTCAGTACACTTGGACCGGGCTTCCCCAAGGGTTCAAGAACTCCCCCACCATCTTCGGGGAGGCGTTGGCTCGAGACCTCCAGAAGTTTCCCACCAGAGACCTAGGCTGCGTGTTGCTCCAGTAGGTTGATGACCTTCTGCTGGGACACCCCACGGCAGTCGGGTGGCCAAGGGAACGGATGCCCTACNCCGGCACCTGGAGGACTGTGGGTATAAGGTGTCCAAGAAGAAANGCTCAGATCTGCCGACGGCAGGTACGTTACTTGGGATTTACTATCCGACAGGGGGAACGCAGCCCGGGATCAGAAAGAAAGCAGGTCATTTGCAATCTACCGGAGCCTAAGAGCAGAAGGCAGGTGAGAGAATTCTTAGGAGCTGTGGGGTTTTGTAGACTGTGGATCCCAAACTTTGCAGTATTAGCCAAGCCTTTGTATGAGGTCACAAAGGGGGGGGACCGGGAACCTTTGGAATGGGGATCCCAACAACAGCAAGTCTTTCATGAGTTAAAGGAAAAACTTCTGGCAGCCCCAGCCCTGGGGCTACCCGATCTGACAAAGCCTTTTCCATTGTATGCGTCAGAGAGAGAAAAGATGGCAGCTGGACTTTTAACCCAAACTGTGGGGCCCTGGCCGAGGCCGGTGGCCTACCTCTCTAAACAACTAGACGGGGTTTCTAAAGGATGGCCCCCCTGTTTGAGGGCCTTGGCAGCAACTGCCCTGCTAGTACAAGAAGCAAATAAGCTGACTCTTGGGCAAAACCTGAACATAAAGGCCCCCCATGCTGTGGTGACTTTAATGAATACTAAAGGACATCATTGGCTAACGAATGCCAGACTCACCAAGTACCAAACTTTGCTCTGTGAAAATCCCCGTATAACCATTGAAGTTTGTAACACCCTACACCCCGCCACCTTGCTCCCGGTATCAGAGAGCCCTGTCGAGCNTGATTGTGTAGAAGTGTTGGACTCAGTTGACTCTGGGCATCAGTAGACTGGGAACTATACGTGGATGGGAGCAGCTTCNTCAACCCCCAAGGAGAGAGAGGTGCAGGGTATGCGGTGGTAACCCTGGACACTGTTGTTGAAGCCAGATCGTTGCCCCAGGCCACTTCAGCCCAGAAAGCTGAACTCATTGCTTTCATTCGGGCCTTAGAACTCAGTGAGGGTGAGACTGTCAACATTTACACTGATTCTCGGTATGCCTTTTTAACCCTTCAAGTGCATGGAGCGTGATAGAAAGAAAAGGGCCTATTGAACTCTGGGGGAAAAGACAGAAAATATCAACAAGAAATCTTGCAATTATTAGAAGCAGTATGGAAACCCCACAAGGTGGCAGTTATGCATTGCAGAGGACACCAGCGAGCTTCCACCTTGCTGGGTTTGGGGAATTCCCGCGCTGACTCAGAGGCTCGAAAAGCAGCATCTGCCCTTCCGGGCATCAGTGACAGCCCCCCTGCTCCCTCAAGCACCTGATCTTGGACCTACTTNTTCTAAAGAAGAAAAGGACTTTCTCCAGGTAGAGGGAGGACAAGTGATGGAGGAAGGATGGATTCGGTTACCAGATGGGAGAGTAGCTGTGCCACAGCTGCTAGGAGCTGCAGTTGTACTGGCTGTGCANGAAACCACCCATCGAGGTCAGGAGTCACTGGAAAAGTTGTTAGGCCGGTATTTCTACATCTCGCNTTTGTCAGCCCTTGCCAAAACGGTGAGGCAGCGGTGTGTTACCTGCCGACAGCATGATGCGAGGCAAGGTCCAGCCGTTCCGCCCGGCATACGAGCTTATGGAGCAGCCCCCTTTGAAGATCTCCAGGTGGACTTCACAGAGATGCCAAAGTGTGGAGGTAACAAGTATTTACTAGTTCTTGGGCGTACCTACTCTGGGTGGGTGGAGGCCTATCCAACACGAACTGAGAAAGCTCGTGAAGTAACCCGTGTGCTTCTTCGAGATCTGATTCCTAGATTTGGACTGCCCTTACGGATCGGCTCAGATAACGGGCCTGCGTTTNTGGCTGNCTTGGTACAGAAGACGGCAAAGGTATTGGGGATCACACGGAAACTGCATGCCGCCTCCCGGCCTCAGAGTTCCGGAAAGGTGGAGCGGATGAATCGGACTATCAAAAATAGTTTAGGGAAAGTATGTCAGGAAACAGGATTAAAATGGATACAGGCTCTCCCTATGGTATTATTTAAAATTAGATGTACCCCTTCTAAAAGAACAGGATATTCCCCTTATGAAATATTATATCATAGGCCCCCTCCCATATTGCGGGGACTTCCAGGCACTCCCCGAGAGTTAGGTGAAATTGAGTTACAGCGACAGCTACAGGCTTTAGGAAAAATTACACAAACAATCTCAGCCTGGGTAAATGAGAGATGCCCTGTTAGCTTATTCTCCCCAGTTCACCCTTTCTCCCCAGGTGATCGAGTGTGGATCAAGGACTGGAACGTAGCCTCTTTGTGCCCACGGTGGAAAGGACCCCAGACTGTCGTCCTGANCACTCCCACCGCTGTGAAGGTAGAAGGAATCCCAGCCTGGATCCACCACAGCCGTGTAAAACCTGCAGCGCCTGAAACCTGGGAGGCAAGACCAAGCCCGGACAACCCTTGCAGAGTGACCCTGAAGAAGACGACAAGCCCTGCTCCAGTCACACCCGGAAGCTGACTGGTCCACGCACGGCCGAAGCATGAGGAAGCTCATCGTGGGATTCATTTTTCTTAAATTTTGGACTTATACAGTAAGGGCTTCAACTGACCTTACTCAAACTGGGGACTGTTCCCAGTGTATTCATCAGGTCACCGAGGTAGGACAGCAAATTAAAACAATCTTTCTGTTCTATAGTTATTATGAATGTATGGGAACATTAAAAGAAACTTGTTTGTATAATGCCACTCAGTACAAGGTATGTAGCCCGGGAAATGACCGACCTGATGTGTGTTATAACCCATCTGAGCCCCCTGCAACCACCGTTTTTGAAATAAGAATAAGAACTGGCCTTTTCCTAGGTGATACAAGTAAAATAATAACTAGAACAGAAGAAAAAGAAATCCCCAAGCAAATAACTTTAAGATTTGATGCTTGTGCAGCCATTAATAGTAAAAAGCTAGAAATAGGATGTGGTTCTCTTAACTGAGAAAGGAGCTANAGAGTAGAAAATAAATATGTTTGTCATGAGTCAGGGGTTTGTGAAAATTGTGCCTATTGGCCATGTGTTATTTAGGCTACTTAGAAAAAGAACAAAAAGGACCCGGTTCATCTTCAGAAGGGGGAAGCCAACCCCTCCTGTGCTGCCGGTCACTGTAACCCACTAGAACTAATAATTACCAATCCCCTAGATCCCCGTTGGAAAAAGGGAGAACGTGTAACCCTGGGGATCGATGGGACAGGGTTAAACCCCCAAGTTGCCATTTTAATTAGAGGGGAGGTCCACAAGCGCTCTCCCAAACCAGTATTTCAAACCTTTTATGAGGAGCTGAATCTGCCAGCACCAGAACTTCCGAAAAAGACAAAAAATTTGTTTCTCCAATTAGCAGAAAATGTAGCTCATTCCCTTAATGTTACTTCTTGTTATGTACGCGGGGGAACCACTATCGGAGACNGATGGCCTTGGGAAGCCCGAGAGTTGGTGCCTACTGATCCAGCTCCTGATATAATTCCAGTTCAGAAGGCCCAAGCTAGCAACTTCTAGGTCCTAAAAACCTCAATTATTAGACAATACTGTATAGCTAGAGAAGGGAAAGACTTTATCATCCCTGTAGGAAAGCTTAATTGTATAGGACAGAAGTTGTATAACAGCACAACAAAGACAATTACTTGGTGGGGCCTAAACCACACTGAAAAGAATCCATTTAGTAAATTTTCTAAATTAAAAACTGCTTGGGCTCATCCAGAATCTCATCAGGACTGGACGGCTCCCGCTGGACTATACTAGATATGTAGGCACAGAGCCTACATTCGGTTACCTAATAAATGGGCAGGCAGTTGTGTTATTGGCACTATTAAGCCGTCCTTTTTCTTATTACCCATAAAAACGGGTGAGCTCCTAGGTTTCCCTGTCTACGCCTCCCGAGAAAAGAAAGGCATAGTTATAGGAAACTGGAAAGATAATGAGTGGCCCCCTGAAAGGATCATNCAGTATTATGGGCCTGCCACATGGGCACAAGACGGCTCATGGGGATACCGAACCCCCATCTACATGCTCAATCGGATCATACGGTTGCAGGCCGTCTTAGAAATAATTACTAATGAAACTGGCAGAGCTTTGACTGTTTTAGCTTGGCAAGAAACCCAAATGAGGAATGCTATCTATCAGAATAGACTGGCCTTAGACTACTTGCTAGTAGCTGAAGGAGGAGTTTGTGGAAAATTTAACTTAACCAATTGCTGCCTACAAATAAATGATCAAGGACAGGTGGTTAAAAACATAGTCAGGGACATGACAAAGGTGGCACATGTGCCTGTACAGGTTTGGCACGAGTTTAATCCTGAGTCTTTATTTGGAAAATGGTTTCCAGCTATAGGAGGATTTAAAACCCTCATTGTAGGTGTATTGCTAGTGATAGGAACTTGCTTGCTGCTCCCCTGTGTATTACCCTTGCTTTTTCAAATGATAAAAGGTTTTGTTGCTACTTTGGTTCATCAGAAAACTTCAGCACACGTGTATTATATAAATCACTATCGCTCTATCTCACAAAGAGACTCAAAAAGTAAAGATGAGAGTGAGAACTCCCACTAAAAGTGAAAATNCTCAAAGGGGGGAAAA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERVE opa 814 825 + 17.51 GACCCCCCACAG
HERVE ZNF460 7106 7121 + 17.47 GTCTACGCCTCCCGAG
HERVE TB1 1934 1942 + 17.38 GGGCCCCCC
HERVE TB1 1962 1970 - 17.38 GGGCCCCCC
HERVE NFKB2 4436 4446 + 17.26 GGGGAATTCCC
HERVE TGA4 2295 2305 + 17.11 TGACGTCATCG
HERVE Usf 1196 1205 - 17.05 ACCACGTGAC
HERVE cg 468 478 - 16.93 GCACACACACA
HERVE GT-3a 7706 7719 + 16.81 CACGTGTATTATAT
HERVE NFkb 4436 4446 + 16.79 GGGGAATTCCC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).