HERVE
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000174 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 7813 |
Kimura value | 6.13 |
Tau index | 0.9707 |
Description | Internal region of an ERV1 endogenous retrovirus, HERVE subfamily |
Comment | Associated long terminal repeat is LTR2. The gag and pol genes are ~40% similar to Moloney murine leukemia virus (MoMuLV). Some unique characteristics of the endogenous human retroviral DNA included a tRNA Glu primer binding site separated from the 5' LTR by a pentanucleotide and a putative env sequence which does not appear to overlap the C terminus of pol and has virtually no homology with the env gene of known infectious retroviruses. The reconstructed (though still pseudogenic) gene boundaries are: Gag: 571-2094, Pol: 2095-5655, Env: 5678-7750. |
Sequence |
TTTCTTGGTTCCCTGACCGGGAAGCGAGGTAATTGACGGACGGTCGAGGCAGCCCCTTAGGCGGCTTAGGCCTGCCCTGTGGAGCATCCCTGCGGGGGACTCCGGCCAGCTTGAGCGACGCGGATCCTGAGAGCGCTCCCGGGTAGGCAATTGCCCCGGTGGAACGCCTCGTCAGAGAGTGCGTGGCAGGCCCCCGTGGAGGATCAACGCAGTGGCTGAACACCGGGAAGGAACGGGCACTTGGAGTCCGGACATTTGAAACTTGGTAAGACTGGTCTTTGGAACTTGCCCACTCCATTTGAGTGGAAGCGTGGCCTGATCACCCACGGCGTGCCTGTACCGGCACTTTGGTTTTTGTTTTTGACTTGACTTGAATTGCTTGATACTTTGGTTTTGGTTTGACCTGGCTTGGATTTCTGGATACTCTGATTTTGGTTTTGATTCTGGTTTGGTGAAAACTGAAAAAGTGTGTGTGTGCCCTTTTTACCCATTCTTTGTTCTGTGGTGTGCGTGTGGTGTGAGCTTGGTGTTTTGTCTCGAGGAAACGTGGGTCAGACACAAAGTAAGCCTACTCCGCTAGGAACTATGTTGAAAAATTTTAAGAAAGGATTTAATGGAGACTATGGGGTTACTATGACACCAGGGAAACTTAGAACTTTGTGTGAAATAGATTGGCCAACATTAGAAGTGGGTTGGCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCTAAGGTATGGCACAAGGTAACTGGTAAGTCAGGACACCCAGACCAGTTCCCATACATAGACACTTGGTTACAGCTGGTGCTAGACCCCCCACAGTGGCTAAGAGGGCAGGCAGCAGCAGTGCTAGTAGCAAAGGGACAGATAGCCAAGGAAGGATCCCGCTCCACCCGCCGAGGGAAATCAACTCCTGAAGTTCTGTTCGACCCAACATCAGAAGATCCATTGCAGGAGATGGCACCAGTGATCCCAGTGGTGCCCTCCCCTTACCAGGGAGAGAGGCTCCCCACTCTTGAGCCCACAGTGCTTGCGCCTCCGCAAGACAAACATATCCCTAGGCCACCCAGAGTAGACAAGAGAGGAGGTGAAGACTCGGGAGAAACCCCTCCCTCGGCAGCTCGTTTACGACCCAAAACGGGGATACAAATGCCCCTGAGAGAGCAGCGGTATACTGGGATAGATGAGGATGGTCACGTGGTGGAGAGGCGTGTTTTTGGGTACCAGCCCTTCACCTCCGCCGACCTTCTCAACTGGAAAAACAATACCCCGTCCTATACCGAAAAGCCACAAGCCCTAATTGATTTGCTCCAAACTGTTATCCAGACCCACAACCCCACCTGGGCTGATTGCCACCAGTTGCTCATGTTCCTCTTTAACAGAGATGAAAGGCGGAGAGTGCTCCAAGCAGCAACTAAGTGGCTAGAGGAACATGCACCAGCTGATTATCAAAACCCCCAAGAGTATGGAAGGACCCAGTTACCAGGAACCGACCCCCAGTTGGACCCACATGAAAGAGAGGATATGCAAAGGCTAAACCGAGACAGGGAAGCTCTCTTGGAAGGATTAANGAGGGGAGCTCAGAAGGCCACAAACGTTAACAAGGTCTCTGAGGTCATTCAGGGAAAAGAAGAAAGTCCAGCACAATTCTACGAGAGACTGTGTGAGGCCTATCGTATGTATACTCCCTTTGATCCCGATAGCCCTGAAAATCAGCGCATGATTCACATGGCTTTAGTCCGTCAAAGCGCAGAAGACATGAGAAGAAAACTGCAGAAACAGGCTGGGCTTGCAGGGATGAATACATCACAATTACTAGAAATAGCTAGCCAGGTGTTTGTAAACAGGGATGCAGTAAGCCGTAAGGAAANCGCAAAGAGAATGGAGGTCAGGCCCGGCGAAACGCGCCTGTTAGCTGCAGCAATCAGAGGGGCCCCCCCAAANGAGGCAAGGNNGAAGGGGGGCCCTGGGAAAGAAACTCAGCTTGGCTGTCAGAGTTTGCAGCGTAACCAGTGTGCTTATTGTAAAGAAATAGGACAGTGGAAGAACAAATGCCCTCAGCTCAAAAGAAAACAAGGTGACTCAGAGCAGGAGGCCCCGGACAAGGAGGAAGGGGCCCTGCTCAACCTGGCAGAAGGGTTATTGGACTGAGGGAGACCGGGCTCAAGCGTCCCCAAAGAGCCTCTGGTCAGAATGACAGTCGGGGGTAGAGACATTGATTTTCTTGTAGATAGCGGTGCTGAACATTCGCTAGTAACCGCCCCGGTCGCCCCCTTATCCAAAAAGACTATTGACGTCATCGGAGCCACGGGGGTTTCAGCAAAGCAAGCTTTCTGCTTGCCTCGGACTTGTACTGTAGGAGGACATAAAGTCATTCATCAGTTTTNGTACATGCCTGACTGTCCCTTGCCCTTNTTGGGAAGGGACTTGCTCAGCAAGCTGAGAGCCACTATCTCTTTGACAGAGCACGGCTCTTTGCTGCTAAAGTTACCCGGAACGGGAGTCATTATGACCCTTACGGTCCCCCGAGAGGAGGAATGGAGACTTTTCTTAACTGAGCCGGGCCAAGAGAGAAGACCAGCTCTGGCTAAGCGGTGGCCAAGAGTACGGGCGGAAGACAACCCTCCGGGGTTGGCAGTCAACCGAGCCCNCGTACTCGTNGAAGTTAAGACTGGGGCCCAGCCGGTTAGGCAAAAACAGGACCCGGTCCCCAGAGAAGCTCTTCAAGGTATCCAGGTCCGTCTCAAGCACCTAAGAACTTTTGGAATTATNGTTCCTTGTCAGTCTCCATGGAACACTCCCCTCCTGCCTGTTCCCAAGCCACGGACCAAGGACTACNGGCCGGTACAGGATTTGCGCTTGCTTNATCAAGCTACACTGACTTTACATCCAACAGTACCTAACCCGTCCACATTGTTGGGGTTGCTGCCAGCTGAGGACAGCTGGTTCACCTGCTTGGACCTGAAAGACGCTTTCTTTCCTATCAGATTAGCCCCTGAGAGGCAGAAGCTGTTTGCCTTTCAGTGGGAAGATCCGGAGTCAGGTGTCACTACTCAGTACACTTGGACCGGGCTTCCCCAAGGGTTCAAGAACTCCCCCACCATCTTCGGGGAGGCGTTGGCTCGAGACCTCCAGAAGTTTCCCACCAGAGACCTAGGCTGCGTGTTGCTCCAGTAGGTTGATGACCTTCTGCTGGGACACCCCACGGCAGTCGGGTGGCCAAGGGAACGGATGCCCTACNCCGGCACCTGGAGGACTGTGGGTATAAGGTGTCCAAGAAGAAANGCTCAGATCTGCCGACGGCAGGTACGTTACTTGGGATTTACTATCCGACAGGGGGAACGCAGCCCGGGATCAGAAAGAAAGCAGGTCATTTGCAATCTACCGGAGCCTAAGAGCAGAAGGCAGGTGAGAGAATTCTTAGGAGCTGTGGGGTTTTGTAGACTGTGGATCCCAAACTTTGCAGTATTAGCCAAGCCTTTGTATGAGGTCACAAAGGGGGGGGACCGGGAACCTTTGGAATGGGGATCCCAACAACAGCAAGTCTTTCATGAGTTAAAGGAAAAACTTCTGGCAGCCCCAGCCCTGGGGCTACCCGATCTGACAAAGCCTTTTCCATTGTATGCGTCAGAGAGAGAAAAGATGGCAGCTGGACTTTTAACCCAAACTGTGGGGCCCTGGCCGAGGCCGGTGGCCTACCTCTCTAAACAACTAGACGGGGTTTCTAAAGGATGGCCCCCCTGTTTGAGGGCCTTGGCAGCAACTGCCCTGCTAGTACAAGAAGCAAATAAGCTGACTCTTGGGCAAAACCTGAACATAAAGGCCCCCCATGCTGTGGTGACTTTAATGAATACTAAAGGACATCATTGGCTAACGAATGCCAGACTCACCAAGTACCAAACTTTGCTCTGTGAAAATCCCCGTATAACCATTGAAGTTTGTAACACCCTACACCCCGCCACCTTGCTCCCGGTATCAGAGAGCCCTGTCGAGCNTGATTGTGTAGAAGTGTTGGACTCAGTTGACTCTGGGCATCAGTAGACTGGGAACTATACGTGGATGGGAGCAGCTTCNTCAACCCCCAAGGAGAGAGAGGTGCAGGGTATGCGGTGGTAACCCTGGACACTGTTGTTGAAGCCAGATCGTTGCCCCAGGCCACTTCAGCCCAGAAAGCTGAACTCATTGCTTTCATTCGGGCCTTAGAACTCAGTGAGGGTGAGACTGTCAACATTTACACTGATTCTCGGTATGCCTTTTTAACCCTTCAAGTGCATGGAGCGTGATAGAAAGAAAAGGGCCTATTGAACTCTGGGGGAAAAGACAGAAAATATCAACAAGAAATCTTGCAATTATTAGAAGCAGTATGGAAACCCCACAAGGTGGCAGTTATGCATTGCAGAGGACACCAGCGAGCTTCCACCTTGCTGGGTTTGGGGAATTCCCGCGCTGACTCAGAGGCTCGAAAAGCAGCATCTGCCCTTCCGGGCATCAGTGACAGCCCCCCTGCTCCCTCAAGCACCTGATCTTGGACCTACTTNTTCTAAAGAAGAAAAGGACTTTCTCCAGGTAGAGGGAGGACAAGTGATGGAGGAAGGATGGATTCGGTTACCAGATGGGAGAGTAGCTGTGCCACAGCTGCTAGGAGCTGCAGTTGTACTGGCTGTGCANGAAACCACCCATCGAGGTCAGGAGTCACTGGAAAAGTTGTTAGGCCGGTATTTCTACATCTCGCNTTTGTCAGCCCTTGCCAAAACGGTGAGGCAGCGGTGTGTTACCTGCCGACAGCATGATGCGAGGCAAGGTCCAGCCGTTCCGCCCGGCATACGAGCTTATGGAGCAGCCCCCTTTGAAGATCTCCAGGTGGACTTCACAGAGATGCCAAAGTGTGGAGGTAACAAGTATTTACTAGTTCTTGGGCGTACCTACTCTGGGTGGGTGGAGGCCTATCCAACACGAACTGAGAAAGCTCGTGAAGTAACCCGTGTGCTTCTTCGAGATCTGATTCCTAGATTTGGACTGCCCTTACGGATCGGCTCAGATAACGGGCCTGCGTTTNTGGCTGNCTTGGTACAGAAGACGGCAAAGGTATTGGGGATCACACGGAAACTGCATGCCGCCTCCCGGCCTCAGAGTTCCGGAAAGGTGGAGCGGATGAATCGGACTATCAAAAATAGTTTAGGGAAAGTATGTCAGGAAACAGGATTAAAATGGATACAGGCTCTCCCTATGGTATTATTTAAAATTAGATGTACCCCTTCTAAAAGAACAGGATATTCCCCTTATGAAATATTATATCATAGGCCCCCTCCCATATTGCGGGGACTTCCAGGCACTCCCCGAGAGTTAGGTGAAATTGAGTTACAGCGACAGCTACAGGCTTTAGGAAAAATTACACAAACAATCTCAGCCTGGGTAAATGAGAGATGCCCTGTTAGCTTATTCTCCCCAGTTCACCCTTTCTCCCCAGGTGATCGAGTGTGGATCAAGGACTGGAACGTAGCCTCTTTGTGCCCACGGTGGAAAGGACCCCAGACTGTCGTCCTGANCACTCCCACCGCTGTGAAGGTAGAAGGAATCCCAGCCTGGATCCACCACAGCCGTGTAAAACCTGCAGCGCCTGAAACCTGGGAGGCAAGACCAAGCCCGGACAACCCTTGCAGAGTGACCCTGAAGAAGACGACAAGCCCTGCTCCAGTCACACCCGGAAGCTGACTGGTCCACGCACGGCCGAAGCATGAGGAAGCTCATCGTGGGATTCATTTTTCTTAAATTTTGGACTTATACAGTAAGGGCTTCAACTGACCTTACTCAAACTGGGGACTGTTCCCAGTGTATTCATCAGGTCACCGAGGTAGGACAGCAAATTAAAACAATCTTTCTGTTCTATAGTTATTATGAATGTATGGGAACATTAAAAGAAACTTGTTTGTATAATGCCACTCAGTACAAGGTATGTAGCCCGGGAAATGACCGACCTGATGTGTGTTATAACCCATCTGAGCCCCCTGCAACCACCGTTTTTGAAATAAGAATAAGAACTGGCCTTTTCCTAGGTGATACAAGTAAAATAATAACTAGAACAGAAGAAAAAGAAATCCCCAAGCAAATAACTTTAAGATTTGATGCTTGTGCAGCCATTAATAGTAAAAAGCTAGAAATAGGATGTGGTTCTCTTAACTGAGAAAGGAGCTANAGAGTAGAAAATAAATATGTTTGTCATGAGTCAGGGGTTTGTGAAAATTGTGCCTATTGGCCATGTGTTATTTAGGCTACTTAGAAAAAGAACAAAAAGGACCCGGTTCATCTTCAGAAGGGGGAAGCCAACCCCTCCTGTGCTGCCGGTCACTGTAACCCACTAGAACTAATAATTACCAATCCCCTAGATCCCCGTTGGAAAAAGGGAGAACGTGTAACCCTGGGGATCGATGGGACAGGGTTAAACCCCCAAGTTGCCATTTTAATTAGAGGGGAGGTCCACAAGCGCTCTCCCAAACCAGTATTTCAAACCTTTTATGAGGAGCTGAATCTGCCAGCACCAGAACTTCCGAAAAAGACAAAAAATTTGTTTCTCCAATTAGCAGAAAATGTAGCTCATTCCCTTAATGTTACTTCTTGTTATGTACGCGGGGGAACCACTATCGGAGACNGATGGCCTTGGGAAGCCCGAGAGTTGGTGCCTACTGATCCAGCTCCTGATATAATTCCAGTTCAGAAGGCCCAAGCTAGCAACTTCTAGGTCCTAAAAACCTCAATTATTAGACAATACTGTATAGCTAGAGAAGGGAAAGACTTTATCATCCCTGTAGGAAAGCTTAATTGTATAGGACAGAAGTTGTATAACAGCACAACAAAGACAATTACTTGGTGGGGCCTAAACCACACTGAAAAGAATCCATTTAGTAAATTTTCTAAATTAAAAACTGCTTGGGCTCATCCAGAATCTCATCAGGACTGGACGGCTCCCGCTGGACTATACTAGATATGTAGGCACAGAGCCTACATTCGGTTACCTAATAAATGGGCAGGCAGTTGTGTTATTGGCACTATTAAGCCGTCCTTTTTCTTATTACCCATAAAAACGGGTGAGCTCCTAGGTTTCCCTGTCTACGCCTCCCGAGAAAAGAAAGGCATAGTTATAGGAAACTGGAAAGATAATGAGTGGCCCCCTGAAAGGATCATNCAGTATTATGGGCCTGCCACATGGGCACAAGACGGCTCATGGGGATACCGAACCCCCATCTACATGCTCAATCGGATCATACGGTTGCAGGCCGTCTTAGAAATAATTACTAATGAAACTGGCAGAGCTTTGACTGTTTTAGCTTGGCAAGAAACCCAAATGAGGAATGCTATCTATCAGAATAGACTGGCCTTAGACTACTTGCTAGTAGCTGAAGGAGGAGTTTGTGGAAAATTTAACTTAACCAATTGCTGCCTACAAATAAATGATCAAGGACAGGTGGTTAAAAACATAGTCAGGGACATGACAAAGGTGGCACATGTGCCTGTACAGGTTTGGCACGAGTTTAATCCTGAGTCTTTATTTGGAAAATGGTTTCCAGCTATAGGAGGATTTAAAACCCTCATTGTAGGTGTATTGCTAGTGATAGGAACTTGCTTGCTGCTCCCCTGTGTATTACCCTTGCTTTTTCAAATGATAAAAGGTTTTGTTGCTACTTTGGTTCATCAGAAAACTTCAGCACACGTGTATTATATAAATCACTATCGCTCTATCTCACAAAGAGACTCAAAAAGTAAAGATGAGAGTGAGAACTCCCACTAAAAGTGAAAATNCTCAAAGGGGGGAAAA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVE | BPC5 | 7747 | 7776 | + | -41.77 | AGACTCAAAAAGTAAAGATGAGAGTGAGAA |
HERVE | BPC5 | 2194 | 2223 | + | -43.07 | AGAATGACAGTCGGGGGTAGAGACATTGAT |
HERVE | BPC5 | 4280 | 4309 | + | -44.24 | AGCGTGATAGAAAGAAAAGGGCCTATTGAA |
HERVE | BPC5 | 7743 | 7772 | + | -44.70 | AAAGAGACTCAAAAAGTAAAGATGAGAGTG |
HERVE | BPC5 | 467 | 496 | - | -46.91 | AAAGAATGGGTAAAAAGGGCACACACACAC |
HERVE | BPC5 | 7714 | 7743 | - | -48.57 | TGTGAGATAGAGCGATAGTGATTTATATAA |
HERVE | BPC5 | 5350 | 5379 | + | -49.22 | GGTGAAATTGAGTTACAGCGACAGCTACAG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.