HERVE
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000174 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 7813 |
Kimura value | 6.13 |
Tau index | 0.9707 |
Description | Internal region of an ERV1 endogenous retrovirus, HERVE subfamily |
Comment | Associated long terminal repeat is LTR2. The gag and pol genes are ~40% similar to Moloney murine leukemia virus (MoMuLV). Some unique characteristics of the endogenous human retroviral DNA included a tRNA Glu primer binding site separated from the 5' LTR by a pentanucleotide and a putative env sequence which does not appear to overlap the C terminus of pol and has virtually no homology with the env gene of known infectious retroviruses. The reconstructed (though still pseudogenic) gene boundaries are: Gag: 571-2094, Pol: 2095-5655, Env: 5678-7750. |
Sequence |
TTTCTTGGTTCCCTGACCGGGAAGCGAGGTAATTGACGGACGGTCGAGGCAGCCCCTTAGGCGGCTTAGGCCTGCCCTGTGGAGCATCCCTGCGGGGGACTCCGGCCAGCTTGAGCGACGCGGATCCTGAGAGCGCTCCCGGGTAGGCAATTGCCCCGGTGGAACGCCTCGTCAGAGAGTGCGTGGCAGGCCCCCGTGGAGGATCAACGCAGTGGCTGAACACCGGGAAGGAACGGGCACTTGGAGTCCGGACATTTGAAACTTGGTAAGACTGGTCTTTGGAACTTGCCCACTCCATTTGAGTGGAAGCGTGGCCTGATCACCCACGGCGTGCCTGTACCGGCACTTTGGTTTTTGTTTTTGACTTGACTTGAATTGCTTGATACTTTGGTTTTGGTTTGACCTGGCTTGGATTTCTGGATACTCTGATTTTGGTTTTGATTCTGGTTTGGTGAAAACTGAAAAAGTGTGTGTGTGCCCTTTTTACCCATTCTTTGTTCTGTGGTGTGCGTGTGGTGTGAGCTTGGTGTTTTGTCTCGAGGAAACGTGGGTCAGACACAAAGTAAGCCTACTCCGCTAGGAACTATGTTGAAAAATTTTAAGAAAGGATTTAATGGAGACTATGGGGTTACTATGACACCAGGGAAACTTAGAACTTTGTGTGAAATAGATTGGCCAACATTAGAAGTGGGTTGGCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCTAAGGTATGGCACAAGGTAACTGGTAAGTCAGGACACCCAGACCAGTTCCCATACATAGACACTTGGTTACAGCTGGTGCTAGACCCCCCACAGTGGCTAAGAGGGCAGGCAGCAGCAGTGCTAGTAGCAAAGGGACAGATAGCCAAGGAAGGATCCCGCTCCACCCGCCGAGGGAAATCAACTCCTGAAGTTCTGTTCGACCCAACATCAGAAGATCCATTGCAGGAGATGGCACCAGTGATCCCAGTGGTGCCCTCCCCTTACCAGGGAGAGAGGCTCCCCACTCTTGAGCCCACAGTGCTTGCGCCTCCGCAAGACAAACATATCCCTAGGCCACCCAGAGTAGACAAGAGAGGAGGTGAAGACTCGGGAGAAACCCCTCCCTCGGCAGCTCGTTTACGACCCAAAACGGGGATACAAATGCCCCTGAGAGAGCAGCGGTATACTGGGATAGATGAGGATGGTCACGTGGTGGAGAGGCGTGTTTTTGGGTACCAGCCCTTCACCTCCGCCGACCTTCTCAACTGGAAAAACAATACCCCGTCCTATACCGAAAAGCCACAAGCCCTAATTGATTTGCTCCAAACTGTTATCCAGACCCACAACCCCACCTGGGCTGATTGCCACCAGTTGCTCATGTTCCTCTTTAACAGAGATGAAAGGCGGAGAGTGCTCCAAGCAGCAACTAAGTGGCTAGAGGAACATGCACCAGCTGATTATCAAAACCCCCAAGAGTATGGAAGGACCCAGTTACCAGGAACCGACCCCCAGTTGGACCCACATGAAAGAGAGGATATGCAAAGGCTAAACCGAGACAGGGAAGCTCTCTTGGAAGGATTAANGAGGGGAGCTCAGAAGGCCACAAACGTTAACAAGGTCTCTGAGGTCATTCAGGGAAAAGAAGAAAGTCCAGCACAATTCTACGAGAGACTGTGTGAGGCCTATCGTATGTATACTCCCTTTGATCCCGATAGCCCTGAAAATCAGCGCATGATTCACATGGCTTTAGTCCGTCAAAGCGCAGAAGACATGAGAAGAAAACTGCAGAAACAGGCTGGGCTTGCAGGGATGAATACATCACAATTACTAGAAATAGCTAGCCAGGTGTTTGTAAACAGGGATGCAGTAAGCCGTAAGGAAANCGCAAAGAGAATGGAGGTCAGGCCCGGCGAAACGCGCCTGTTAGCTGCAGCAATCAGAGGGGCCCCCCCAAANGAGGCAAGGNNGAAGGGGGGCCCTGGGAAAGAAACTCAGCTTGGCTGTCAGAGTTTGCAGCGTAACCAGTGTGCTTATTGTAAAGAAATAGGACAGTGGAAGAACAAATGCCCTCAGCTCAAAAGAAAACAAGGTGACTCAGAGCAGGAGGCCCCGGACAAGGAGGAAGGGGCCCTGCTCAACCTGGCAGAAGGGTTATTGGACTGAGGGAGACCGGGCTCAAGCGTCCCCAAAGAGCCTCTGGTCAGAATGACAGTCGGGGGTAGAGACATTGATTTTCTTGTAGATAGCGGTGCTGAACATTCGCTAGTAACCGCCCCGGTCGCCCCCTTATCCAAAAAGACTATTGACGTCATCGGAGCCACGGGGGTTTCAGCAAAGCAAGCTTTCTGCTTGCCTCGGACTTGTACTGTAGGAGGACATAAAGTCATTCATCAGTTTTNGTACATGCCTGACTGTCCCTTGCCCTTNTTGGGAAGGGACTTGCTCAGCAAGCTGAGAGCCACTATCTCTTTGACAGAGCACGGCTCTTTGCTGCTAAAGTTACCCGGAACGGGAGTCATTATGACCCTTACGGTCCCCCGAGAGGAGGAATGGAGACTTTTCTTAACTGAGCCGGGCCAAGAGAGAAGACCAGCTCTGGCTAAGCGGTGGCCAAGAGTACGGGCGGAAGACAACCCTCCGGGGTTGGCAGTCAACCGAGCCCNCGTACTCGTNGAAGTTAAGACTGGGGCCCAGCCGGTTAGGCAAAAACAGGACCCGGTCCCCAGAGAAGCTCTTCAAGGTATCCAGGTCCGTCTCAAGCACCTAAGAACTTTTGGAATTATNGTTCCTTGTCAGTCTCCATGGAACACTCCCCTCCTGCCTGTTCCCAAGCCACGGACCAAGGACTACNGGCCGGTACAGGATTTGCGCTTGCTTNATCAAGCTACACTGACTTTACATCCAACAGTACCTAACCCGTCCACATTGTTGGGGTTGCTGCCAGCTGAGGACAGCTGGTTCACCTGCTTGGACCTGAAAGACGCTTTCTTTCCTATCAGATTAGCCCCTGAGAGGCAGAAGCTGTTTGCCTTTCAGTGGGAAGATCCGGAGTCAGGTGTCACTACTCAGTACACTTGGACCGGGCTTCCCCAAGGGTTCAAGAACTCCCCCACCATCTTCGGGGAGGCGTTGGCTCGAGACCTCCAGAAGTTTCCCACCAGAGACCTAGGCTGCGTGTTGCTCCAGTAGGTTGATGACCTTCTGCTGGGACACCCCACGGCAGTCGGGTGGCCAAGGGAACGGATGCCCTACNCCGGCACCTGGAGGACTGTGGGTATAAGGTGTCCAAGAAGAAANGCTCAGATCTGCCGACGGCAGGTACGTTACTTGGGATTTACTATCCGACAGGGGGAACGCAGCCCGGGATCAGAAAGAAAGCAGGTCATTTGCAATCTACCGGAGCCTAAGAGCAGAAGGCAGGTGAGAGAATTCTTAGGAGCTGTGGGGTTTTGTAGACTGTGGATCCCAAACTTTGCAGTATTAGCCAAGCCTTTGTATGAGGTCACAAAGGGGGGGGACCGGGAACCTTTGGAATGGGGATCCCAACAACAGCAAGTCTTTCATGAGTTAAAGGAAAAACTTCTGGCAGCCCCAGCCCTGGGGCTACCCGATCTGACAAAGCCTTTTCCATTGTATGCGTCAGAGAGAGAAAAGATGGCAGCTGGACTTTTAACCCAAACTGTGGGGCCCTGGCCGAGGCCGGTGGCCTACCTCTCTAAACAACTAGACGGGGTTTCTAAAGGATGGCCCCCCTGTTTGAGGGCCTTGGCAGCAACTGCCCTGCTAGTACAAGAAGCAAATAAGCTGACTCTTGGGCAAAACCTGAACATAAAGGCCCCCCATGCTGTGGTGACTTTAATGAATACTAAAGGACATCATTGGCTAACGAATGCCAGACTCACCAAGTACCAAACTTTGCTCTGTGAAAATCCCCGTATAACCATTGAAGTTTGTAACACCCTACACCCCGCCACCTTGCTCCCGGTATCAGAGAGCCCTGTCGAGCNTGATTGTGTAGAAGTGTTGGACTCAGTTGACTCTGGGCATCAGTAGACTGGGAACTATACGTGGATGGGAGCAGCTTCNTCAACCCCCAAGGAGAGAGAGGTGCAGGGTATGCGGTGGTAACCCTGGACACTGTTGTTGAAGCCAGATCGTTGCCCCAGGCCACTTCAGCCCAGAAAGCTGAACTCATTGCTTTCATTCGGGCCTTAGAACTCAGTGAGGGTGAGACTGTCAACATTTACACTGATTCTCGGTATGCCTTTTTAACCCTTCAAGTGCATGGAGCGTGATAGAAAGAAAAGGGCCTATTGAACTCTGGGGGAAAAGACAGAAAATATCAACAAGAAATCTTGCAATTATTAGAAGCAGTATGGAAACCCCACAAGGTGGCAGTTATGCATTGCAGAGGACACCAGCGAGCTTCCACCTTGCTGGGTTTGGGGAATTCCCGCGCTGACTCAGAGGCTCGAAAAGCAGCATCTGCCCTTCCGGGCATCAGTGACAGCCCCCCTGCTCCCTCAAGCACCTGATCTTGGACCTACTTNTTCTAAAGAAGAAAAGGACTTTCTCCAGGTAGAGGGAGGACAAGTGATGGAGGAAGGATGGATTCGGTTACCAGATGGGAGAGTAGCTGTGCCACAGCTGCTAGGAGCTGCAGTTGTACTGGCTGTGCANGAAACCACCCATCGAGGTCAGGAGTCACTGGAAAAGTTGTTAGGCCGGTATTTCTACATCTCGCNTTTGTCAGCCCTTGCCAAAACGGTGAGGCAGCGGTGTGTTACCTGCCGACAGCATGATGCGAGGCAAGGTCCAGCCGTTCCGCCCGGCATACGAGCTTATGGAGCAGCCCCCTTTGAAGATCTCCAGGTGGACTTCACAGAGATGCCAAAGTGTGGAGGTAACAAGTATTTACTAGTTCTTGGGCGTACCTACTCTGGGTGGGTGGAGGCCTATCCAACACGAACTGAGAAAGCTCGTGAAGTAACCCGTGTGCTTCTTCGAGATCTGATTCCTAGATTTGGACTGCCCTTACGGATCGGCTCAGATAACGGGCCTGCGTTTNTGGCTGNCTTGGTACAGAAGACGGCAAAGGTATTGGGGATCACACGGAAACTGCATGCCGCCTCCCGGCCTCAGAGTTCCGGAAAGGTGGAGCGGATGAATCGGACTATCAAAAATAGTTTAGGGAAAGTATGTCAGGAAACAGGATTAAAATGGATACAGGCTCTCCCTATGGTATTATTTAAAATTAGATGTACCCCTTCTAAAAGAACAGGATATTCCCCTTATGAAATATTATATCATAGGCCCCCTCCCATATTGCGGGGACTTCCAGGCACTCCCCGAGAGTTAGGTGAAATTGAGTTACAGCGACAGCTACAGGCTTTAGGAAAAATTACACAAACAATCTCAGCCTGGGTAAATGAGAGATGCCCTGTTAGCTTATTCTCCCCAGTTCACCCTTTCTCCCCAGGTGATCGAGTGTGGATCAAGGACTGGAACGTAGCCTCTTTGTGCCCACGGTGGAAAGGACCCCAGACTGTCGTCCTGANCACTCCCACCGCTGTGAAGGTAGAAGGAATCCCAGCCTGGATCCACCACAGCCGTGTAAAACCTGCAGCGCCTGAAACCTGGGAGGCAAGACCAAGCCCGGACAACCCTTGCAGAGTGACCCTGAAGAAGACGACAAGCCCTGCTCCAGTCACACCCGGAAGCTGACTGGTCCACGCACGGCCGAAGCATGAGGAAGCTCATCGTGGGATTCATTTTTCTTAAATTTTGGACTTATACAGTAAGGGCTTCAACTGACCTTACTCAAACTGGGGACTGTTCCCAGTGTATTCATCAGGTCACCGAGGTAGGACAGCAAATTAAAACAATCTTTCTGTTCTATAGTTATTATGAATGTATGGGAACATTAAAAGAAACTTGTTTGTATAATGCCACTCAGTACAAGGTATGTAGCCCGGGAAATGACCGACCTGATGTGTGTTATAACCCATCTGAGCCCCCTGCAACCACCGTTTTTGAAATAAGAATAAGAACTGGCCTTTTCCTAGGTGATACAAGTAAAATAATAACTAGAACAGAAGAAAAAGAAATCCCCAAGCAAATAACTTTAAGATTTGATGCTTGTGCAGCCATTAATAGTAAAAAGCTAGAAATAGGATGTGGTTCTCTTAACTGAGAAAGGAGCTANAGAGTAGAAAATAAATATGTTTGTCATGAGTCAGGGGTTTGTGAAAATTGTGCCTATTGGCCATGTGTTATTTAGGCTACTTAGAAAAAGAACAAAAAGGACCCGGTTCATCTTCAGAAGGGGGAAGCCAACCCCTCCTGTGCTGCCGGTCACTGTAACCCACTAGAACTAATAATTACCAATCCCCTAGATCCCCGTTGGAAAAAGGGAGAACGTGTAACCCTGGGGATCGATGGGACAGGGTTAAACCCCCAAGTTGCCATTTTAATTAGAGGGGAGGTCCACAAGCGCTCTCCCAAACCAGTATTTCAAACCTTTTATGAGGAGCTGAATCTGCCAGCACCAGAACTTCCGAAAAAGACAAAAAATTTGTTTCTCCAATTAGCAGAAAATGTAGCTCATTCCCTTAATGTTACTTCTTGTTATGTACGCGGGGGAACCACTATCGGAGACNGATGGCCTTGGGAAGCCCGAGAGTTGGTGCCTACTGATCCAGCTCCTGATATAATTCCAGTTCAGAAGGCCCAAGCTAGCAACTTCTAGGTCCTAAAAACCTCAATTATTAGACAATACTGTATAGCTAGAGAAGGGAAAGACTTTATCATCCCTGTAGGAAAGCTTAATTGTATAGGACAGAAGTTGTATAACAGCACAACAAAGACAATTACTTGGTGGGGCCTAAACCACACTGAAAAGAATCCATTTAGTAAATTTTCTAAATTAAAAACTGCTTGGGCTCATCCAGAATCTCATCAGGACTGGACGGCTCCCGCTGGACTATACTAGATATGTAGGCACAGAGCCTACATTCGGTTACCTAATAAATGGGCAGGCAGTTGTGTTATTGGCACTATTAAGCCGTCCTTTTTCTTATTACCCATAAAAACGGGTGAGCTCCTAGGTTTCCCTGTCTACGCCTCCCGAGAAAAGAAAGGCATAGTTATAGGAAACTGGAAAGATAATGAGTGGCCCCCTGAAAGGATCATNCAGTATTATGGGCCTGCCACATGGGCACAAGACGGCTCATGGGGATACCGAACCCCCATCTACATGCTCAATCGGATCATACGGTTGCAGGCCGTCTTAGAAATAATTACTAATGAAACTGGCAGAGCTTTGACTGTTTTAGCTTGGCAAGAAACCCAAATGAGGAATGCTATCTATCAGAATAGACTGGCCTTAGACTACTTGCTAGTAGCTGAAGGAGGAGTTTGTGGAAAATTTAACTTAACCAATTGCTGCCTACAAATAAATGATCAAGGACAGGTGGTTAAAAACATAGTCAGGGACATGACAAAGGTGGCACATGTGCCTGTACAGGTTTGGCACGAGTTTAATCCTGAGTCTTTATTTGGAAAATGGTTTCCAGCTATAGGAGGATTTAAAACCCTCATTGTAGGTGTATTGCTAGTGATAGGAACTTGCTTGCTGCTCCCCTGTGTATTACCCTTGCTTTTTCAAATGATAAAAGGTTTTGTTGCTACTTTGGTTCATCAGAAAACTTCAGCACACGTGTATTATATAAATCACTATCGCTCTATCTCACAAAGAGACTCAAAAAGTAAAGATGAGAGTGAGAACTCCCACTAAAAGTGAAAATNCTCAAAGGGGGGAAAA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVE | MGP | 6551 | 6563 | + | 20.15 | AAAAAGACAAAAA |
HERVE | SGR5 | 6550 | 6562 | + | 19.76 | GAAAAAGACAAAA |
HERVE | lmd | 814 | 825 | + | 19.51 | GACCCCCCACAG |
HERVE | GLIS2 | 814 | 827 | + | 19.20 | GACCCCCCACAGTG |
HERVE | JKD | 6551 | 6562 | - | 19.06 | TTTTGTCTTTTT |
HERVE | sug | 814 | 825 | + | 19.03 | GACCCCCCACAG |
HERVE | IDD1 | 6551 | 6562 | - | 18.82 | TTTTGTCTTTTT |
HERVE | NUC | 6551 | 6562 | - | 18.38 | TTTTGTCTTTTT |
HERVE | IDD11 | 6551 | 6562 | + | 18.31 | AAAAAGACAAAA |
HERVE | GLIS3 | 814 | 827 | + | 18.20 | GACCCCCCACAGTG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.