HERV30
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000170 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8308 |
Kimura value | 6.06 |
Tau index | 0.0000 |
Description | Internal region of class I HERV30 endogenous retrovirus |
Comment | Associated long terminal repeats are LTR30 and LTR30N2. Several deletion products exist and are included in the seed alignment, except for ERV30N1 which has a separate entry. Coding regions are 350-1231 (MC132-like), 1370-2827 (gag), 2831-6403 (pol; 1 stopcodon at 4039 should be TGG), and 7521-8231 (env). Closely related to HERV9 and HERV17 (HERVW) |
Sequence |
TTTTGGCGAGCCAGCCAGGAGACTCCAGGAAAGGCATCTAGATCGTCACGCGGTGAGTACGATCGGACCTCTTTCGCTTGCTATTCTGTCCTGTCCTTCCTTAGAATTCGGAGGCTAAACACCGGGCACCTGTCGGCCACTTAAAGGCGATTAGCGCGGCCGCCGGACTAAAGACACGGGTGTCAGGCTGTCTGGAAAAGGGCTCTCTAACAACCCCCGACCCTTCGGGGTTGGGAGCATTGGTTNGCCTGGAACCAGTTCTAACTCTTTCGCTTTCCGTGGTGGTCCCGAAGTACACCCGGGAGTGCTCAGCGGACGTCTTAGTCTCCCAGATATCCTGGTTGAGACCATGGCCCCGCCAGAGGCTCCCCCTGCATGGGTTACTGAGCGTGAGACAGCCACATCTTCTGACTCCTGCCTCCTGGGTCCTAATGTCCGCCGGTTAGACTTCTTTCCTCATCTCGCAAGCAAGGTTATTCCCGCTAGGCAGGATCAAGATTCCCTATTTAGAAGTCTTAAATTCTTGGGGTGGCGCCCAGAAGATCCCTGTTCATGGTGCCCTCCAGGGTTTAGGCAGGTGTCGCCATTTGATGGCTATTTTGAAGGGCCAGTTCCCCACCATAGTGTATGGTCCCCCACATCAGGACAATTTAAAGACAGGTCTGTAATTTTCATGTGGATAGTAGAAGCCTTAGGGCATTTCCTCCATTGCTCCCCAGATAGACTTTCCCCTTCCTTGGGGCCTCTCAAGTACAATCTGTGGTGCATAGGCACGGGTCTTAGAGCCGTTGAATTGTTGTTTCAACCATTCAATAATTGGTATTGGANGGAAGAAAATATAGTCAGTTGGGACACAGGATACTGGTACCGCCTTGAGAGGGGGGCTTACTCCTTTGATGGCAAGTGGGGACAGAAGGCTAAAGTACAGCAGCTGTTCTCTCGGCCCTGGCCTAGAGGACATCCACCACCCCCTTTAAGCTTACTAAGCCTCCTGTCGCTAATTCAGAGATTTCTCCTTGAAGGACAGTTTTATGGCCAGGCCCACGTAAATTGGGCCTTAGCATGCAAGCATCAGTGGTGCCCCCGACCCAGGCCTTGCCACCCTGGAACAGGTAGGACGCGTTGGCAGAAGGACCACAATAAATCCAACAGTCCTTGTGCCCCATTTAGTGGTCAATGGGCGCACGGCAGGGGCAAGGGAAGTTTCCATCCCGCCGGTAAGCATGGTTAAATCCGGTAGATGGAGAGCTCAGGAAAAGCGGCCATGAGCTTTGAGCACAATTGGACCTGACCCTTAGGGGACGCCCTAAGGGAAGACGAGTCCCAGGACTAACCAGGGGTGCGGGCATCCCTGTGTTTAAAATTCCAGATGGGCACCACACCTTCAAAACCGGACACTCCCTTAAGATGTATCCTGAATAACTGGGACAAATTCGACCCTGAAACCTTAAAAAAGAAGCGGCTGATTTTCTTCTGTACCACTGCCTGGCCACAGTATTCCTTACAAAATGGAGAAACTTGGCCCCCTGAGGGAAGTATTAATTATAACACCCTTCTACAACTAGATCTTTTCTGTAAACAGGAAGGTAAATGGAGTGAAGTCCCTTATGTACAGGCTTTCTTTGCCCTTCGTGACAATACTGCCCTGTGCCAAGCCTGCAAGCTTTGCCCAAATGACAGAGGCCCACAATTGCCTCCATACTCAGGGCCTCTTCCCTCAGCCCCACTCTCCTCCCCCACTGACTCTCCTCCATCCGGCCCCACCGAAGTGTTAAAGGCACACCGGAAAGAGAACGTAAACTCCGCGAGCCAGGCACCCAAACTATGTCCCTTACAAGCAGTAGGAGGAGAATTTGGGCCCACCCGCGTGCATGCCCCCTTCTCACTCTCAGATTTAAAACAAATAAAGGCAGATTTAGGGAAATTCTCGGATGATCCTGATAACTATATAGATGTCCTGCAAGGATTAGGGCAGTCCTTTGATCTAACATGGAGAGATATCATGTTACTTCTTGATCAGACCTTAAGTCCTACTGAAAAGGAAGCAGCTTTAGCAGCAGCCCGGCAATTTGGGGATCTGTGGTACCTTAGCCAGGTAAACGATCGAATGGCCCTGGAGGAGAGGGAAAAATTCCCCACAGGGCAACAGGCAGTCCCCACTGTAGACCCTCATTGGGATACTGACTCAGATCATGGAGATTGGAGCCGCAGGCATTTGCTAACTTGCATTTTAGAAGGGTTGAGGAAGACTAGGAAAAAGCCTATGAACTACTCAATGCTATCCACAATTACGCAGGGAAAAGAGGAAAACCCCTCCGCTTTTCTAGAAAGGCTAAGGGAGGCCCTAAGAAAGCACACCTCCCTAACTCCGGATTCCNTGGAAGGCCAACTTATTCTAAAGGATAAATTTATCACCCAATCAGCGGCCGACATTAGGAGAAAACTCCAAAAGTCTGCCTTAGGCCCAGAACAAAATTTGGAGGCATTATTAAACCTGGCAACCTCGGTGTTCTATAACAGGGACCAAGAGGAACAGGCCAAAAGGGAAAAGCGAGATAAGAGAAAGGCTGCAGCCTTAGTCATGGCCCTCAGACAGGCAGACCTTGGTGGCTCAGAGGGAACCAAAAGAGGAGCAGGCCAATTGCCTAGTAGGGCTTGTTATCAGTGCGGTTTGCAAGGACACTTTAAGAAAGATTGTCCAACCAGAAACAAACCGCCCCCTCGCCCATGTCCAATATGCCAAGGCAATCACTGGAAGGCGCACTGCCCCAGAGGACGAAGGCCCTCTGGGCCAGAAGCACCCAACCAGATGATTCAGCAACAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCTCATGCCATCACCCTCACAGAGCCCCGGGTAAGTTTGACCATTGAGGGCCAGGAAGTGGACTTCCTCCTGGACACTGGCGCGGCCTTCTCAGTTTTAATCTCCTGCCCCGGACGACTGTCCTCAAAGTCCGTTACTATCCGAGGAATCTTAGGACAGCCTGTAACCAGGTATTTCTCTCGCCTCCTCAGCTGCAATTGGGAGACTTTGCTCTTTTCACATGCCTTTCTTGTTATGCCCGAAAGTCCCACACCCTTATTAGGGAGGGACATATTAGCCAAAGCTGGGGCTATTATCTACATGAATATGGGGAACAAATTACCCATTTGTTGTCCCCTACTTGAAGAAGGAATCAACTCTGAAGTCTGGGCCTTGGAAGGACAATTCGGAAGGGCAAAGAATGCCCATCCAGTTCAAATCAGGCTAAAAGACCCCACCACTTTTCCTTATCAAAGGCAATATCCCTTAAGGCCTGAAGCTCACAAAGGATTACAGGATATTGTTAGACATTTAAAAGCTCAAGGCTTAGTAAGAAAATGTAGCAGTCCTTGCAACACCCCAATCCTAGGAATACAAAAACCAAATGGTCAGTGGAGACTAGTGCAAGACCTCAGAATCATCAATGAGGCAGTAATTCCTTTATATCCTGCTGTACCCAACCCCTATACACTGCTCTCTCAGATACCAGAGGAAGCAGAATGGTTCACTGTTCTGGACCTCAAAGATGCCTTCTTCTGCATTCCCCTGCACTCTGACTCCCAGTTCCTCTTTGCCTTTGAGGATCCTACAGACCACACGTCCCAGCTTACGTGGACGGTCTTGCCCCAAGGGTTTAGAGATAGCCCTCATCTGTTTGGTCAGGCACTGGCCCAAGACCTAGGCCAATTCTCAAGTCCAGGCACTCTGGTCCTCCAATACGCGGATGACGTACTTCTGGCTATCAGTTTGGAAGCCTCACGTCAGCAGGCTACTCTAGATCTCTTAAACTTTCTAGCTAATCGAGGGTACAAAGTGTCTAGGACAAAGGCCCAGCTCTGTCTACAACAAGTTAAATATCTAGGCCTAGTCCTAGCCAAAGGAACTAGGGCCCTCAGCAAAGAGCGTATTCAGCCTATACTGGCCTATCCTCACCCTAAGACATTGAAACAGTTGCGGGGGTTCCTTGGAATCACTGGCTTTTGCCGACTGTGAATTCCTGGATACAGTGAAATGGCCAGGCCACTCTATACCCTGATAAAGGAGACTCAGAAGGCGAATACCCATCTAGTAGAATGGGAACCGGAGGTGGAAACAGCCTTCAAAACTTTAAAGCAGGCCCTGGTACAAGCTCCAGCCCTGAGCCTCCCCACAGGACAAAATTTATCTTTATATGTCACCGAGAGAGCAGGAATAGCTCTTGGAGTTCTTACTCAGACTCGTGGGACAGCCCCACAACCAGTGGCATACCTAAGTAAGGAAATTGATGTAGTAGCCAAAGGCTGGCCTCACTGTTTACGGGTGGTTGCAGCAGTAGCCATCTTAGTGTCAGAGGCTATTAAAATAATACAAGGAAAGGATCTCACTGTCTGGACTACTCATGATGTAAGCGGCATATTAAATGCTAAAGGAAGTTTGTGGCTCTCAGATAACNGCCTACTCAAATACCAGGCACTACTCCTTGAGGGACCAGTATTTCAAATACGCACGTGTGCGGCCCTCAACCCTGCCACTTTTCTCCCAGAGGATGAGGAACCAATTGAGCATGACTGCCAACAAATTATNGCCCAGACTTATGCCACCCGAGAAGATCTCTTAGAAGTCCCCTTAACTAACCCTGACCTTAACCTGTACTCTGATGGAAGTTCATTTGTAGAAAATNGGGTACGAAAGGCAGGCTATGCCATAGTTAGCGATGCAGCAGTACTTGAAAGTAAGCCTCTTCCCCCAGGGACCAGCGCTCAGTTAGCAGAACTCGTGGCGCTTACCCGAGCCTTAGAACTGGGAGAAGGGAAAAGAATAAATGTGTACACAGATAGCAAGTATGCTTATCTAGTCCTACANGCACATGCTGCAATATGGAAAGAAAGGGAGTTCCTAACCTCTGGAGGAACACCCATTAAGTACCACAGAGAAATCATGGAGTTATTGCACGCAGTGCAAAAACCTAAGGAGGTGGCAGTCTTACACTGCCGGGGCCATCAGAAAGGTGAAGGAGAAGAAGCAGAAGGAAACCGCCGAGCAGACGCTGAGGCCAAAATTGCTGCCAGGCAGGACTTTCCTTCAGAAATGCCCATGGAAGGACCCCTGGTATGGAGCAACCCCCTCCAGGAGGTTAAGCCCCAGTATTCCCCAACTGAAACAGAATGGGGACTTTCACGAGGACATAGTTTTCTCCCCTCGGGGTGGCTAACAACAGAGGAAGGAAAGGTGCTCATACCTGAAGCCAGCCAGTGGAAAATACTTAAAACCCTCCACCAAACTTTTCATACGGGTATTGAAAGTACCCATAAGATGGCCACATCCCTATTTACAGGGCCAAACCTCCTCAAAACCATCCGGCAAGTAGTCAAAGCCTGTGAAGTGTGCCAAAAGAATAACCCCTTGGCCCACCGTAAGGCCTCTCCAGGAGGACAAAGAACAGGACATTATCCTGGAGAGGACTGGCAGTTAGATTTTACCCATATGCCAAAGTCAAGAGGATTTCAATACTTATTGGTCTGTGTTGATACCTTTACAAATTGGGTGGAAGCCTTCCCTTGTAGAACAGAGAAGGCCCAAGAAGTGGTTAAAGTCTTAGTTCATGAAATAATTCCTAGATTTGGACTTCCCCAAAGCTTACAGAGCGACAATGGTCCAGCTTTTAAAGCTACAATAACTCAAGGAATTTCCAAGGCACTAGGAATACAATATCACCTTCACTGTGCCTGGAGGCCACAATCCTCAGGGAAAGTCGAAAAGGCAAATGAAACACTCAAGAGGCATTTGAGAAAGCTAGCGCAAGAAACTCATCTCCCATGGCCCACTCTCTTGCCCATGGCCTTATTAAGAATTCGAAANTCCCCTCACAGAATGGGGCTCAGTCCATATGAAATGCTGTATGGATGGCCTTTTCTCACAAATGACCTCCTGCTCAATCAGGAAACGGCCAATTTAGTCAAAGATATAACTTCTCTGGCAAAATATCAACAAAACCTTAAAACTTTACCCGAAAGGTGTGACAGGGAAAAAGGAATAGAGTTGTTTCAACCAGGAGATCTAGTATTGGTCAAGTCTCTCCCCTCTACCTCTCCATCTATGGATCCCTTATGGGAGGGACCATACTCGGTAATCCTCTCTACCCCCACTGCAGTTAAAGTGGCAGGAGTGGAATCCTGGATTCACCACACCCGAGTTAAACCTTGGACACCTCCTGAGGAACTTACAGGATCATCANCTCAGGAGTCACAAGGTCAGCCAGACCAGCCTCGATACACCTGTCAGCCACTAGAGGACCTGCATCTCCTATTTCGGAAGGAAACATCTCAGACCAGAAAAACTCCTGCAGTTAATCCTGAAGAGGAACTTCTCTCTACCTAAAGGAGGATAAGTAAAAAAACCTACATGATCTTTGACATCTCTCCTTGCTCTCTTTAATGGAATCCTTCTACTGTTTCGTTACATTATTAAGCAGTATACTAACTATACTCTTTGCAGTAGGATTATATACTGTAGCTCCAGCCGGGACAAAAATCTTAACCACATCAACCTTTCTTCTATCGTCCTTCCTTCTAACAGCAATTTACTCCTTTTTCCCTCCTCTTTCCTACGACCGTTCCACCTACAACACGTCATGACTCCTCTTAGGCTTCCTGCCATCCTCTTCATACTCATGTCCCTTTCTCCAACTACNACACACNCCCCATGTCAGTGTGCCTCCCCTGGAGGAGTCAACCGGCATTCTCTCAGAAACTCTTGGGGATTAGGTAGCCCCTTCCAAGCACCCGCATCTTTTGCCGCGTATACTTACATGAGAAAAGAATGTTATAAAACTGCTTCTCTCTGCTCTCACAATGGCCGTACATATCACCAAGGAAAAATGATCCGAGCTGACTGCCCTGAGAAATGGGGGGCCAACGCTTGTTGGACATATTATACCCATATAGGTATGTCTGACGGAGGAGGCGTCCAAGATGAGGCTAAAGAACGGCATATCCAACAAGTAATTAAAAACTTAGTCCAGCTCTCCAGTACTCCCAGTCCATACAAGAAATTAGACCTTTCCAGGCTACAAGAAACCCTTAACTCTCATTCTCGTCTCTGGAGCCTGTTTAACACCACCCTTACAGGAATACAAGAGGCCTCTCCTAGTAATCCAACCAACTGTTGGATGTGTCTCCCCTTGCGTTTTCAACCATATGTCCCAGTCCCTGTCCCCGGACAGTGGAACTTATCCACCCCAGTCCTAAACACCACCAAATTAATCGGTCCCATAGTCACCAATTTACCAGCCACACAGGCCTCAAATCTCACATGCATAAACTTCAGCATGACTCTCAATAAGAACACCTCCCGATGTCAGTCCTGGATATCAGTAACCTCAGGTTTCACCTGTCTAACTTCAGGCATCTTTTTCATCTGTGATAACACAGCCTATTGATGCCTAAACGGCACTCCAAAAGAATTATGCTTTCTCTCCTTTCTAGCACCTCCCATGTCCATATATACTGAACAAGAGTTACAAAGTCTCCTTATACCCCAATCCCGCCACACACGAGCCCTTATTGTCCCTTTTATTGTAGGAGCCGGAATACTGGGCGGGCTTGGGACTGGAATTGGAGGCATAACCTCCTCCACCCAATTCTATTATAAATTATCACGAGAATTAAATGATGACATGGAACGAGTTGCCAACTCCCTAGTGACCCTACAAAGCCAGCTTAATTCTCTAGCTGCGGTAGTCCTCCAAAACCGAAGAGCCCTAGACCTATTAACAGCTGAAAGAGGAGGAACCTGCCTCTTCTTAGGAGAAGAATGTTGCTATTTCGTTAACCAGTCAGGAATCATTACTGAAAAGGTCAAAGAAATAAGAGAACGGATAGAAAGTAGGAAAAAGGAGCTTGAACACTCAGGACCCTGGAATATGTTTAACCAATGGATACCTTGGCTCCTCCCCTTTCTAGGCCCTGTGACAGCCATCCTACTATTACTCGCCTTTGGGCCTTGCATTTTTAACCTCCTTGTCAAATTTGTTTCCTCCAGGATCGAGGCCATCAAGCTACAAATGGTCTTACAAATGGAACCTCAAATGAGCTCAACTCACGGCTTCTACCGAGGACCCCTGGATCGACCCGCTGGTCCCTCGACTAGCCTAGAAAGTTCCCCTCTGGAGGACACCACAACTGCAGGGCCCCTTCTTCGCCCCTAACCAGCAGGAAGTAGCCAGAACGACCGCCGCCCAGTTCCCAACAGCAGTTGGGGTGTCCTGTTTAGAGGGGGGAC
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERV30 | CG7928 | 1333 | 1344 | - | 17.95 | CGCACCCCTGGT |
HERV30 | BEH3 | 4532 | 4541 | + | 17.79 | CGCACGTGTG |
HERV30 | FLI1::FOXI1 | 1576 | 1586 | + | 17.74 | TAAACAGGAAG |
HERV30 | CTCF | 1161 | 1191 | - | 17.71 | CTGCCGTGCGCCCATTGACCACTAAATGGGG |
HERV30 | FOXO1::ELF1 | 1575 | 1587 | + | 17.61 | GTAAACAGGAAGG |
HERV30 | BZR1 | 4532 | 4541 | - | 17.48 | CACACGTGCG |
HERV30 | dl | 2305 | 2314 | - | 17.26 | GGGGTTTTCC |
HERV30 | SIX2 | 7406 | 7416 | - | 17.13 | TGAAACCTGAG |
HERV30 | ATHB-40 | 811 | 821 | - | 17.02 | ACCAATTATTG |
HERV30 | ARF25 | 7199 | 7210 | - | 16.84 | AAGGGGAGACAC |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.