HERV30

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000170
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 8308
Kimura value 6.06
Tau index 0.0000
Description Internal region of class I HERV30 endogenous retrovirus
Comment Associated long terminal repeats are LTR30 and LTR30N2. Several deletion products exist and are included in the seed alignment, except for ERV30N1 which has a separate entry. Coding regions are 350-1231 (MC132-like), 1370-2827 (gag), 2831-6403 (pol; 1 stopcodon at 4039 should be TGG), and 7521-8231 (env). Closely related to HERV9 and HERV17 (HERVW)
Sequence
TTTTGGCGAGCCAGCCAGGAGACTCCAGGAAAGGCATCTAGATCGTCACGCGGTGAGTACGATCGGACCTCTTTCGCTTGCTATTCTGTCCTGTCCTTCCTTAGAATTCGGAGGCTAAACACCGGGCACCTGTCGGCCACTTAAAGGCGATTAGCGCGGCCGCCGGACTAAAGACACGGGTGTCAGGCTGTCTGGAAAAGGGCTCTCTAACAACCCCCGACCCTTCGGGGTTGGGAGCATTGGTTNGCCTGGAACCAGTTCTAACTCTTTCGCTTTCCGTGGTGGTCCCGAAGTACACCCGGGAGTGCTCAGCGGACGTCTTAGTCTCCCAGATATCCTGGTTGAGACCATGGCCCCGCCAGAGGCTCCCCCTGCATGGGTTACTGAGCGTGAGACAGCCACATCTTCTGACTCCTGCCTCCTGGGTCCTAATGTCCGCCGGTTAGACTTCTTTCCTCATCTCGCAAGCAAGGTTATTCCCGCTAGGCAGGATCAAGATTCCCTATTTAGAAGTCTTAAATTCTTGGGGTGGCGCCCAGAAGATCCCTGTTCATGGTGCCCTCCAGGGTTTAGGCAGGTGTCGCCATTTGATGGCTATTTTGAAGGGCCAGTTCCCCACCATAGTGTATGGTCCCCCACATCAGGACAATTTAAAGACAGGTCTGTAATTTTCATGTGGATAGTAGAAGCCTTAGGGCATTTCCTCCATTGCTCCCCAGATAGACTTTCCCCTTCCTTGGGGCCTCTCAAGTACAATCTGTGGTGCATAGGCACGGGTCTTAGAGCCGTTGAATTGTTGTTTCAACCATTCAATAATTGGTATTGGANGGAAGAAAATATAGTCAGTTGGGACACAGGATACTGGTACCGCCTTGAGAGGGGGGCTTACTCCTTTGATGGCAAGTGGGGACAGAAGGCTAAAGTACAGCAGCTGTTCTCTCGGCCCTGGCCTAGAGGACATCCACCACCCCCTTTAAGCTTACTAAGCCTCCTGTCGCTAATTCAGAGATTTCTCCTTGAAGGACAGTTTTATGGCCAGGCCCACGTAAATTGGGCCTTAGCATGCAAGCATCAGTGGTGCCCCCGACCCAGGCCTTGCCACCCTGGAACAGGTAGGACGCGTTGGCAGAAGGACCACAATAAATCCAACAGTCCTTGTGCCCCATTTAGTGGTCAATGGGCGCACGGCAGGGGCAAGGGAAGTTTCCATCCCGCCGGTAAGCATGGTTAAATCCGGTAGATGGAGAGCTCAGGAAAAGCGGCCATGAGCTTTGAGCACAATTGGACCTGACCCTTAGGGGACGCCCTAAGGGAAGACGAGTCCCAGGACTAACCAGGGGTGCGGGCATCCCTGTGTTTAAAATTCCAGATGGGCACCACACCTTCAAAACCGGACACTCCCTTAAGATGTATCCTGAATAACTGGGACAAATTCGACCCTGAAACCTTAAAAAAGAAGCGGCTGATTTTCTTCTGTACCACTGCCTGGCCACAGTATTCCTTACAAAATGGAGAAACTTGGCCCCCTGAGGGAAGTATTAATTATAACACCCTTCTACAACTAGATCTTTTCTGTAAACAGGAAGGTAAATGGAGTGAAGTCCCTTATGTACAGGCTTTCTTTGCCCTTCGTGACAATACTGCCCTGTGCCAAGCCTGCAAGCTTTGCCCAAATGACAGAGGCCCACAATTGCCTCCATACTCAGGGCCTCTTCCCTCAGCCCCACTCTCCTCCCCCACTGACTCTCCTCCATCCGGCCCCACCGAAGTGTTAAAGGCACACCGGAAAGAGAACGTAAACTCCGCGAGCCAGGCACCCAAACTATGTCCCTTACAAGCAGTAGGAGGAGAATTTGGGCCCACCCGCGTGCATGCCCCCTTCTCACTCTCAGATTTAAAACAAATAAAGGCAGATTTAGGGAAATTCTCGGATGATCCTGATAACTATATAGATGTCCTGCAAGGATTAGGGCAGTCCTTTGATCTAACATGGAGAGATATCATGTTACTTCTTGATCAGACCTTAAGTCCTACTGAAAAGGAAGCAGCTTTAGCAGCAGCCCGGCAATTTGGGGATCTGTGGTACCTTAGCCAGGTAAACGATCGAATGGCCCTGGAGGAGAGGGAAAAATTCCCCACAGGGCAACAGGCAGTCCCCACTGTAGACCCTCATTGGGATACTGACTCAGATCATGGAGATTGGAGCCGCAGGCATTTGCTAACTTGCATTTTAGAAGGGTTGAGGAAGACTAGGAAAAAGCCTATGAACTACTCAATGCTATCCACAATTACGCAGGGAAAAGAGGAAAACCCCTCCGCTTTTCTAGAAAGGCTAAGGGAGGCCCTAAGAAAGCACACCTCCCTAACTCCGGATTCCNTGGAAGGCCAACTTATTCTAAAGGATAAATTTATCACCCAATCAGCGGCCGACATTAGGAGAAAACTCCAAAAGTCTGCCTTAGGCCCAGAACAAAATTTGGAGGCATTATTAAACCTGGCAACCTCGGTGTTCTATAACAGGGACCAAGAGGAACAGGCCAAAAGGGAAAAGCGAGATAAGAGAAAGGCTGCAGCCTTAGTCATGGCCCTCAGACAGGCAGACCTTGGTGGCTCAGAGGGAACCAAAAGAGGAGCAGGCCAATTGCCTAGTAGGGCTTGTTATCAGTGCGGTTTGCAAGGACACTTTAAGAAAGATTGTCCAACCAGAAACAAACCGCCCCCTCGCCCATGTCCAATATGCCAAGGCAATCACTGGAAGGCGCACTGCCCCAGAGGACGAAGGCCCTCTGGGCCAGAAGCACCCAACCAGATGATTCAGCAACAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCTCATGCCATCACCCTCACAGAGCCCCGGGTAAGTTTGACCATTGAGGGCCAGGAAGTGGACTTCCTCCTGGACACTGGCGCGGCCTTCTCAGTTTTAATCTCCTGCCCCGGACGACTGTCCTCAAAGTCCGTTACTATCCGAGGAATCTTAGGACAGCCTGTAACCAGGTATTTCTCTCGCCTCCTCAGCTGCAATTGGGAGACTTTGCTCTTTTCACATGCCTTTCTTGTTATGCCCGAAAGTCCCACACCCTTATTAGGGAGGGACATATTAGCCAAAGCTGGGGCTATTATCTACATGAATATGGGGAACAAATTACCCATTTGTTGTCCCCTACTTGAAGAAGGAATCAACTCTGAAGTCTGGGCCTTGGAAGGACAATTCGGAAGGGCAAAGAATGCCCATCCAGTTCAAATCAGGCTAAAAGACCCCACCACTTTTCCTTATCAAAGGCAATATCCCTTAAGGCCTGAAGCTCACAAAGGATTACAGGATATTGTTAGACATTTAAAAGCTCAAGGCTTAGTAAGAAAATGTAGCAGTCCTTGCAACACCCCAATCCTAGGAATACAAAAACCAAATGGTCAGTGGAGACTAGTGCAAGACCTCAGAATCATCAATGAGGCAGTAATTCCTTTATATCCTGCTGTACCCAACCCCTATACACTGCTCTCTCAGATACCAGAGGAAGCAGAATGGTTCACTGTTCTGGACCTCAAAGATGCCTTCTTCTGCATTCCCCTGCACTCTGACTCCCAGTTCCTCTTTGCCTTTGAGGATCCTACAGACCACACGTCCCAGCTTACGTGGACGGTCTTGCCCCAAGGGTTTAGAGATAGCCCTCATCTGTTTGGTCAGGCACTGGCCCAAGACCTAGGCCAATTCTCAAGTCCAGGCACTCTGGTCCTCCAATACGCGGATGACGTACTTCTGGCTATCAGTTTGGAAGCCTCACGTCAGCAGGCTACTCTAGATCTCTTAAACTTTCTAGCTAATCGAGGGTACAAAGTGTCTAGGACAAAGGCCCAGCTCTGTCTACAACAAGTTAAATATCTAGGCCTAGTCCTAGCCAAAGGAACTAGGGCCCTCAGCAAAGAGCGTATTCAGCCTATACTGGCCTATCCTCACCCTAAGACATTGAAACAGTTGCGGGGGTTCCTTGGAATCACTGGCTTTTGCCGACTGTGAATTCCTGGATACAGTGAAATGGCCAGGCCACTCTATACCCTGATAAAGGAGACTCAGAAGGCGAATACCCATCTAGTAGAATGGGAACCGGAGGTGGAAACAGCCTTCAAAACTTTAAAGCAGGCCCTGGTACAAGCTCCAGCCCTGAGCCTCCCCACAGGACAAAATTTATCTTTATATGTCACCGAGAGAGCAGGAATAGCTCTTGGAGTTCTTACTCAGACTCGTGGGACAGCCCCACAACCAGTGGCATACCTAAGTAAGGAAATTGATGTAGTAGCCAAAGGCTGGCCTCACTGTTTACGGGTGGTTGCAGCAGTAGCCATCTTAGTGTCAGAGGCTATTAAAATAATACAAGGAAAGGATCTCACTGTCTGGACTACTCATGATGTAAGCGGCATATTAAATGCTAAAGGAAGTTTGTGGCTCTCAGATAACNGCCTACTCAAATACCAGGCACTACTCCTTGAGGGACCAGTATTTCAAATACGCACGTGTGCGGCCCTCAACCCTGCCACTTTTCTCCCAGAGGATGAGGAACCAATTGAGCATGACTGCCAACAAATTATNGCCCAGACTTATGCCACCCGAGAAGATCTCTTAGAAGTCCCCTTAACTAACCCTGACCTTAACCTGTACTCTGATGGAAGTTCATTTGTAGAAAATNGGGTACGAAAGGCAGGCTATGCCATAGTTAGCGATGCAGCAGTACTTGAAAGTAAGCCTCTTCCCCCAGGGACCAGCGCTCAGTTAGCAGAACTCGTGGCGCTTACCCGAGCCTTAGAACTGGGAGAAGGGAAAAGAATAAATGTGTACACAGATAGCAAGTATGCTTATCTAGTCCTACANGCACATGCTGCAATATGGAAAGAAAGGGAGTTCCTAACCTCTGGAGGAACACCCATTAAGTACCACAGAGAAATCATGGAGTTATTGCACGCAGTGCAAAAACCTAAGGAGGTGGCAGTCTTACACTGCCGGGGCCATCAGAAAGGTGAAGGAGAAGAAGCAGAAGGAAACCGCCGAGCAGACGCTGAGGCCAAAATTGCTGCCAGGCAGGACTTTCCTTCAGAAATGCCCATGGAAGGACCCCTGGTATGGAGCAACCCCCTCCAGGAGGTTAAGCCCCAGTATTCCCCAACTGAAACAGAATGGGGACTTTCACGAGGACATAGTTTTCTCCCCTCGGGGTGGCTAACAACAGAGGAAGGAAAGGTGCTCATACCTGAAGCCAGCCAGTGGAAAATACTTAAAACCCTCCACCAAACTTTTCATACGGGTATTGAAAGTACCCATAAGATGGCCACATCCCTATTTACAGGGCCAAACCTCCTCAAAACCATCCGGCAAGTAGTCAAAGCCTGTGAAGTGTGCCAAAAGAATAACCCCTTGGCCCACCGTAAGGCCTCTCCAGGAGGACAAAGAACAGGACATTATCCTGGAGAGGACTGGCAGTTAGATTTTACCCATATGCCAAAGTCAAGAGGATTTCAATACTTATTGGTCTGTGTTGATACCTTTACAAATTGGGTGGAAGCCTTCCCTTGTAGAACAGAGAAGGCCCAAGAAGTGGTTAAAGTCTTAGTTCATGAAATAATTCCTAGATTTGGACTTCCCCAAAGCTTACAGAGCGACAATGGTCCAGCTTTTAAAGCTACAATAACTCAAGGAATTTCCAAGGCACTAGGAATACAATATCACCTTCACTGTGCCTGGAGGCCACAATCCTCAGGGAAAGTCGAAAAGGCAAATGAAACACTCAAGAGGCATTTGAGAAAGCTAGCGCAAGAAACTCATCTCCCATGGCCCACTCTCTTGCCCATGGCCTTATTAAGAATTCGAAANTCCCCTCACAGAATGGGGCTCAGTCCATATGAAATGCTGTATGGATGGCCTTTTCTCACAAATGACCTCCTGCTCAATCAGGAAACGGCCAATTTAGTCAAAGATATAACTTCTCTGGCAAAATATCAACAAAACCTTAAAACTTTACCCGAAAGGTGTGACAGGGAAAAAGGAATAGAGTTGTTTCAACCAGGAGATCTAGTATTGGTCAAGTCTCTCCCCTCTACCTCTCCATCTATGGATCCCTTATGGGAGGGACCATACTCGGTAATCCTCTCTACCCCCACTGCAGTTAAAGTGGCAGGAGTGGAATCCTGGATTCACCACACCCGAGTTAAACCTTGGACACCTCCTGAGGAACTTACAGGATCATCANCTCAGGAGTCACAAGGTCAGCCAGACCAGCCTCGATACACCTGTCAGCCACTAGAGGACCTGCATCTCCTATTTCGGAAGGAAACATCTCAGACCAGAAAAACTCCTGCAGTTAATCCTGAAGAGGAACTTCTCTCTACCTAAAGGAGGATAAGTAAAAAAACCTACATGATCTTTGACATCTCTCCTTGCTCTCTTTAATGGAATCCTTCTACTGTTTCGTTACATTATTAAGCAGTATACTAACTATACTCTTTGCAGTAGGATTATATACTGTAGCTCCAGCCGGGACAAAAATCTTAACCACATCAACCTTTCTTCTATCGTCCTTCCTTCTAACAGCAATTTACTCCTTTTTCCCTCCTCTTTCCTACGACCGTTCCACCTACAACACGTCATGACTCCTCTTAGGCTTCCTGCCATCCTCTTCATACTCATGTCCCTTTCTCCAACTACNACACACNCCCCATGTCAGTGTGCCTCCCCTGGAGGAGTCAACCGGCATTCTCTCAGAAACTCTTGGGGATTAGGTAGCCCCTTCCAAGCACCCGCATCTTTTGCCGCGTATACTTACATGAGAAAAGAATGTTATAAAACTGCTTCTCTCTGCTCTCACAATGGCCGTACATATCACCAAGGAAAAATGATCCGAGCTGACTGCCCTGAGAAATGGGGGGCCAACGCTTGTTGGACATATTATACCCATATAGGTATGTCTGACGGAGGAGGCGTCCAAGATGAGGCTAAAGAACGGCATATCCAACAAGTAATTAAAAACTTAGTCCAGCTCTCCAGTACTCCCAGTCCATACAAGAAATTAGACCTTTCCAGGCTACAAGAAACCCTTAACTCTCATTCTCGTCTCTGGAGCCTGTTTAACACCACCCTTACAGGAATACAAGAGGCCTCTCCTAGTAATCCAACCAACTGTTGGATGTGTCTCCCCTTGCGTTTTCAACCATATGTCCCAGTCCCTGTCCCCGGACAGTGGAACTTATCCACCCCAGTCCTAAACACCACCAAATTAATCGGTCCCATAGTCACCAATTTACCAGCCACACAGGCCTCAAATCTCACATGCATAAACTTCAGCATGACTCTCAATAAGAACACCTCCCGATGTCAGTCCTGGATATCAGTAACCTCAGGTTTCACCTGTCTAACTTCAGGCATCTTTTTCATCTGTGATAACACAGCCTATTGATGCCTAAACGGCACTCCAAAAGAATTATGCTTTCTCTCCTTTCTAGCACCTCCCATGTCCATATATACTGAACAAGAGTTACAAAGTCTCCTTATACCCCAATCCCGCCACACACGAGCCCTTATTGTCCCTTTTATTGTAGGAGCCGGAATACTGGGCGGGCTTGGGACTGGAATTGGAGGCATAACCTCCTCCACCCAATTCTATTATAAATTATCACGAGAATTAAATGATGACATGGAACGAGTTGCCAACTCCCTAGTGACCCTACAAAGCCAGCTTAATTCTCTAGCTGCGGTAGTCCTCCAAAACCGAAGAGCCCTAGACCTATTAACAGCTGAAAGAGGAGGAACCTGCCTCTTCTTAGGAGAAGAATGTTGCTATTTCGTTAACCAGTCAGGAATCATTACTGAAAAGGTCAAAGAAATAAGAGAACGGATAGAAAGTAGGAAAAAGGAGCTTGAACACTCAGGACCCTGGAATATGTTTAACCAATGGATACCTTGGCTCCTCCCCTTTCTAGGCCCTGTGACAGCCATCCTACTATTACTCGCCTTTGGGCCTTGCATTTTTAACCTCCTTGTCAAATTTGTTTCCTCCAGGATCGAGGCCATCAAGCTACAAATGGTCTTACAAATGGAACCTCAAATGAGCTCAACTCACGGCTTCTACCGAGGACCCCTGGATCGACCCGCTGGTCCCTCGACTAGCCTAGAAAGTTCCCCTCTGGAGGACACCACAACTGCAGGGCCCCTTCTTCGCCCCTAACCAGCAGGAAGTAGCCAGAACGACCGCCGCCCAGTTCCCAACAGCAGTTGGGGTGTCCTGTTTAGAGGGGGGAC



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERV30 CG7928 1333 1344 - 17.95 CGCACCCCTGGT
HERV30 BEH3 4532 4541 + 17.79 CGCACGTGTG
HERV30 FLI1::FOXI1 1576 1586 + 17.74 TAAACAGGAAG
HERV30 CTCF 1161 1191 - 17.71 CTGCCGTGCGCCCATTGACCACTAAATGGGG
HERV30 FOXO1::ELF1 1575 1587 + 17.61 GTAAACAGGAAGG
HERV30 BZR1 4532 4541 - 17.48 CACACGTGCG
HERV30 dl 2305 2314 - 17.26 GGGGTTTTCC
HERV30 SIX2 7406 7416 - 17.13 TGAAACCTGAG
HERV30 ATHB-40 811 821 - 17.02 ACCAATTATTG
HERV30 ARF25 7199 7210 - 16.84 AAGGGGAGACAC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).