HERVE
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000174 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 7813 |
Kimura value | 6.13 |
Tau index | 0.9707 |
Description | Internal region of an ERV1 endogenous retrovirus, HERVE subfamily |
Comment | Associated long terminal repeat is LTR2. The gag and pol genes are ~40% similar to Moloney murine leukemia virus (MoMuLV). Some unique characteristics of the endogenous human retroviral DNA included a tRNA Glu primer binding site separated from the 5' LTR by a pentanucleotide and a putative env sequence which does not appear to overlap the C terminus of pol and has virtually no homology with the env gene of known infectious retroviruses. The reconstructed (though still pseudogenic) gene boundaries are: Gag: 571-2094, Pol: 2095-5655, Env: 5678-7750. |
Sequence |
TTTCTTGGTTCCCTGACCGGGAAGCGAGGTAATTGACGGACGGTCGAGGCAGCCCCTTAGGCGGCTTAGGCCTGCCCTGTGGAGCATCCCTGCGGGGGACTCCGGCCAGCTTGAGCGACGCGGATCCTGAGAGCGCTCCCGGGTAGGCAATTGCCCCGGTGGAACGCCTCGTCAGAGAGTGCGTGGCAGGCCCCCGTGGAGGATCAACGCAGTGGCTGAACACCGGGAAGGAACGGGCACTTGGAGTCCGGACATTTGAAACTTGGTAAGACTGGTCTTTGGAACTTGCCCACTCCATTTGAGTGGAAGCGTGGCCTGATCACCCACGGCGTGCCTGTACCGGCACTTTGGTTTTTGTTTTTGACTTGACTTGAATTGCTTGATACTTTGGTTTTGGTTTGACCTGGCTTGGATTTCTGGATACTCTGATTTTGGTTTTGATTCTGGTTTGGTGAAAACTGAAAAAGTGTGTGTGTGCCCTTTTTACCCATTCTTTGTTCTGTGGTGTGCGTGTGGTGTGAGCTTGGTGTTTTGTCTCGAGGAAACGTGGGTCAGACACAAAGTAAGCCTACTCCGCTAGGAACTATGTTGAAAAATTTTAAGAAAGGATTTAATGGAGACTATGGGGTTACTATGACACCAGGGAAACTTAGAACTTTGTGTGAAATAGATTGGCCAACATTAGAAGTGGGTTGGCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCTAAGGTATGGCACAAGGTAACTGGTAAGTCAGGACACCCAGACCAGTTCCCATACATAGACACTTGGTTACAGCTGGTGCTAGACCCCCCACAGTGGCTAAGAGGGCAGGCAGCAGCAGTGCTAGTAGCAAAGGGACAGATAGCCAAGGAAGGATCCCGCTCCACCCGCCGAGGGAAATCAACTCCTGAAGTTCTGTTCGACCCAACATCAGAAGATCCATTGCAGGAGATGGCACCAGTGATCCCAGTGGTGCCCTCCCCTTACCAGGGAGAGAGGCTCCCCACTCTTGAGCCCACAGTGCTTGCGCCTCCGCAAGACAAACATATCCCTAGGCCACCCAGAGTAGACAAGAGAGGAGGTGAAGACTCGGGAGAAACCCCTCCCTCGGCAGCTCGTTTACGACCCAAAACGGGGATACAAATGCCCCTGAGAGAGCAGCGGTATACTGGGATAGATGAGGATGGTCACGTGGTGGAGAGGCGTGTTTTTGGGTACCAGCCCTTCACCTCCGCCGACCTTCTCAACTGGAAAAACAATACCCCGTCCTATACCGAAAAGCCACAAGCCCTAATTGATTTGCTCCAAACTGTTATCCAGACCCACAACCCCACCTGGGCTGATTGCCACCAGTTGCTCATGTTCCTCTTTAACAGAGATGAAAGGCGGAGAGTGCTCCAAGCAGCAACTAAGTGGCTAGAGGAACATGCACCAGCTGATTATCAAAACCCCCAAGAGTATGGAAGGACCCAGTTACCAGGAACCGACCCCCAGTTGGACCCACATGAAAGAGAGGATATGCAAAGGCTAAACCGAGACAGGGAAGCTCTCTTGGAAGGATTAANGAGGGGAGCTCAGAAGGCCACAAACGTTAACAAGGTCTCTGAGGTCATTCAGGGAAAAGAAGAAAGTCCAGCACAATTCTACGAGAGACTGTGTGAGGCCTATCGTATGTATACTCCCTTTGATCCCGATAGCCCTGAAAATCAGCGCATGATTCACATGGCTTTAGTCCGTCAAAGCGCAGAAGACATGAGAAGAAAACTGCAGAAACAGGCTGGGCTTGCAGGGATGAATACATCACAATTACTAGAAATAGCTAGCCAGGTGTTTGTAAACAGGGATGCAGTAAGCCGTAAGGAAANCGCAAAGAGAATGGAGGTCAGGCCCGGCGAAACGCGCCTGTTAGCTGCAGCAATCAGAGGGGCCCCCCCAAANGAGGCAAGGNNGAAGGGGGGCCCTGGGAAAGAAACTCAGCTTGGCTGTCAGAGTTTGCAGCGTAACCAGTGTGCTTATTGTAAAGAAATAGGACAGTGGAAGAACAAATGCCCTCAGCTCAAAAGAAAACAAGGTGACTCAGAGCAGGAGGCCCCGGACAAGGAGGAAGGGGCCCTGCTCAACCTGGCAGAAGGGTTATTGGACTGAGGGAGACCGGGCTCAAGCGTCCCCAAAGAGCCTCTGGTCAGAATGACAGTCGGGGGTAGAGACATTGATTTTCTTGTAGATAGCGGTGCTGAACATTCGCTAGTAACCGCCCCGGTCGCCCCCTTATCCAAAAAGACTATTGACGTCATCGGAGCCACGGGGGTTTCAGCAAAGCAAGCTTTCTGCTTGCCTCGGACTTGTACTGTAGGAGGACATAAAGTCATTCATCAGTTTTNGTACATGCCTGACTGTCCCTTGCCCTTNTTGGGAAGGGACTTGCTCAGCAAGCTGAGAGCCACTATCTCTTTGACAGAGCACGGCTCTTTGCTGCTAAAGTTACCCGGAACGGGAGTCATTATGACCCTTACGGTCCCCCGAGAGGAGGAATGGAGACTTTTCTTAACTGAGCCGGGCCAAGAGAGAAGACCAGCTCTGGCTAAGCGGTGGCCAAGAGTACGGGCGGAAGACAACCCTCCGGGGTTGGCAGTCAACCGAGCCCNCGTACTCGTNGAAGTTAAGACTGGGGCCCAGCCGGTTAGGCAAAAACAGGACCCGGTCCCCAGAGAAGCTCTTCAAGGTATCCAGGTCCGTCTCAAGCACCTAAGAACTTTTGGAATTATNGTTCCTTGTCAGTCTCCATGGAACACTCCCCTCCTGCCTGTTCCCAAGCCACGGACCAAGGACTACNGGCCGGTACAGGATTTGCGCTTGCTTNATCAAGCTACACTGACTTTACATCCAACAGTACCTAACCCGTCCACATTGTTGGGGTTGCTGCCAGCTGAGGACAGCTGGTTCACCTGCTTGGACCTGAAAGACGCTTTCTTTCCTATCAGATTAGCCCCTGAGAGGCAGAAGCTGTTTGCCTTTCAGTGGGAAGATCCGGAGTCAGGTGTCACTACTCAGTACACTTGGACCGGGCTTCCCCAAGGGTTCAAGAACTCCCCCACCATCTTCGGGGAGGCGTTGGCTCGAGACCTCCAGAAGTTTCCCACCAGAGACCTAGGCTGCGTGTTGCTCCAGTAGGTTGATGACCTTCTGCTGGGACACCCCACGGCAGTCGGGTGGCCAAGGGAACGGATGCCCTACNCCGGCACCTGGAGGACTGTGGGTATAAGGTGTCCAAGAAGAAANGCTCAGATCTGCCGACGGCAGGTACGTTACTTGGGATTTACTATCCGACAGGGGGAACGCAGCCCGGGATCAGAAAGAAAGCAGGTCATTTGCAATCTACCGGAGCCTAAGAGCAGAAGGCAGGTGAGAGAATTCTTAGGAGCTGTGGGGTTTTGTAGACTGTGGATCCCAAACTTTGCAGTATTAGCCAAGCCTTTGTATGAGGTCACAAAGGGGGGGGACCGGGAACCTTTGGAATGGGGATCCCAACAACAGCAAGTCTTTCATGAGTTAAAGGAAAAACTTCTGGCAGCCCCAGCCCTGGGGCTACCCGATCTGACAAAGCCTTTTCCATTGTATGCGTCAGAGAGAGAAAAGATGGCAGCTGGACTTTTAACCCAAACTGTGGGGCCCTGGCCGAGGCCGGTGGCCTACCTCTCTAAACAACTAGACGGGGTTTCTAAAGGATGGCCCCCCTGTTTGAGGGCCTTGGCAGCAACTGCCCTGCTAGTACAAGAAGCAAATAAGCTGACTCTTGGGCAAAACCTGAACATAAAGGCCCCCCATGCTGTGGTGACTTTAATGAATACTAAAGGACATCATTGGCTAACGAATGCCAGACTCACCAAGTACCAAACTTTGCTCTGTGAAAATCCCCGTATAACCATTGAAGTTTGTAACACCCTACACCCCGCCACCTTGCTCCCGGTATCAGAGAGCCCTGTCGAGCNTGATTGTGTAGAAGTGTTGGACTCAGTTGACTCTGGGCATCAGTAGACTGGGAACTATACGTGGATGGGAGCAGCTTCNTCAACCCCCAAGGAGAGAGAGGTGCAGGGTATGCGGTGGTAACCCTGGACACTGTTGTTGAAGCCAGATCGTTGCCCCAGGCCACTTCAGCCCAGAAAGCTGAACTCATTGCTTTCATTCGGGCCTTAGAACTCAGTGAGGGTGAGACTGTCAACATTTACACTGATTCTCGGTATGCCTTTTTAACCCTTCAAGTGCATGGAGCGTGATAGAAAGAAAAGGGCCTATTGAACTCTGGGGGAAAAGACAGAAAATATCAACAAGAAATCTTGCAATTATTAGAAGCAGTATGGAAACCCCACAAGGTGGCAGTTATGCATTGCAGAGGACACCAGCGAGCTTCCACCTTGCTGGGTTTGGGGAATTCCCGCGCTGACTCAGAGGCTCGAAAAGCAGCATCTGCCCTTCCGGGCATCAGTGACAGCCCCCCTGCTCCCTCAAGCACCTGATCTTGGACCTACTTNTTCTAAAGAAGAAAAGGACTTTCTCCAGGTAGAGGGAGGACAAGTGATGGAGGAAGGATGGATTCGGTTACCAGATGGGAGAGTAGCTGTGCCACAGCTGCTAGGAGCTGCAGTTGTACTGGCTGTGCANGAAACCACCCATCGAGGTCAGGAGTCACTGGAAAAGTTGTTAGGCCGGTATTTCTACATCTCGCNTTTGTCAGCCCTTGCCAAAACGGTGAGGCAGCGGTGTGTTACCTGCCGACAGCATGATGCGAGGCAAGGTCCAGCCGTTCCGCCCGGCATACGAGCTTATGGAGCAGCCCCCTTTGAAGATCTCCAGGTGGACTTCACAGAGATGCCAAAGTGTGGAGGTAACAAGTATTTACTAGTTCTTGGGCGTACCTACTCTGGGTGGGTGGAGGCCTATCCAACACGAACTGAGAAAGCTCGTGAAGTAACCCGTGTGCTTCTTCGAGATCTGATTCCTAGATTTGGACTGCCCTTACGGATCGGCTCAGATAACGGGCCTGCGTTTNTGGCTGNCTTGGTACAGAAGACGGCAAAGGTATTGGGGATCACACGGAAACTGCATGCCGCCTCCCGGCCTCAGAGTTCCGGAAAGGTGGAGCGGATGAATCGGACTATCAAAAATAGTTTAGGGAAAGTATGTCAGGAAACAGGATTAAAATGGATACAGGCTCTCCCTATGGTATTATTTAAAATTAGATGTACCCCTTCTAAAAGAACAGGATATTCCCCTTATGAAATATTATATCATAGGCCCCCTCCCATATTGCGGGGACTTCCAGGCACTCCCCGAGAGTTAGGTGAAATTGAGTTACAGCGACAGCTACAGGCTTTAGGAAAAATTACACAAACAATCTCAGCCTGGGTAAATGAGAGATGCCCTGTTAGCTTATTCTCCCCAGTTCACCCTTTCTCCCCAGGTGATCGAGTGTGGATCAAGGACTGGAACGTAGCCTCTTTGTGCCCACGGTGGAAAGGACCCCAGACTGTCGTCCTGANCACTCCCACCGCTGTGAAGGTAGAAGGAATCCCAGCCTGGATCCACCACAGCCGTGTAAAACCTGCAGCGCCTGAAACCTGGGAGGCAAGACCAAGCCCGGACAACCCTTGCAGAGTGACCCTGAAGAAGACGACAAGCCCTGCTCCAGTCACACCCGGAAGCTGACTGGTCCACGCACGGCCGAAGCATGAGGAAGCTCATCGTGGGATTCATTTTTCTTAAATTTTGGACTTATACAGTAAGGGCTTCAACTGACCTTACTCAAACTGGGGACTGTTCCCAGTGTATTCATCAGGTCACCGAGGTAGGACAGCAAATTAAAACAATCTTTCTGTTCTATAGTTATTATGAATGTATGGGAACATTAAAAGAAACTTGTTTGTATAATGCCACTCAGTACAAGGTATGTAGCCCGGGAAATGACCGACCTGATGTGTGTTATAACCCATCTGAGCCCCCTGCAACCACCGTTTTTGAAATAAGAATAAGAACTGGCCTTTTCCTAGGTGATACAAGTAAAATAATAACTAGAACAGAAGAAAAAGAAATCCCCAAGCAAATAACTTTAAGATTTGATGCTTGTGCAGCCATTAATAGTAAAAAGCTAGAAATAGGATGTGGTTCTCTTAACTGAGAAAGGAGCTANAGAGTAGAAAATAAATATGTTTGTCATGAGTCAGGGGTTTGTGAAAATTGTGCCTATTGGCCATGTGTTATTTAGGCTACTTAGAAAAAGAACAAAAAGGACCCGGTTCATCTTCAGAAGGGGGAAGCCAACCCCTCCTGTGCTGCCGGTCACTGTAACCCACTAGAACTAATAATTACCAATCCCCTAGATCCCCGTTGGAAAAAGGGAGAACGTGTAACCCTGGGGATCGATGGGACAGGGTTAAACCCCCAAGTTGCCATTTTAATTAGAGGGGAGGTCCACAAGCGCTCTCCCAAACCAGTATTTCAAACCTTTTATGAGGAGCTGAATCTGCCAGCACCAGAACTTCCGAAAAAGACAAAAAATTTGTTTCTCCAATTAGCAGAAAATGTAGCTCATTCCCTTAATGTTACTTCTTGTTATGTACGCGGGGGAACCACTATCGGAGACNGATGGCCTTGGGAAGCCCGAGAGTTGGTGCCTACTGATCCAGCTCCTGATATAATTCCAGTTCAGAAGGCCCAAGCTAGCAACTTCTAGGTCCTAAAAACCTCAATTATTAGACAATACTGTATAGCTAGAGAAGGGAAAGACTTTATCATCCCTGTAGGAAAGCTTAATTGTATAGGACAGAAGTTGTATAACAGCACAACAAAGACAATTACTTGGTGGGGCCTAAACCACACTGAAAAGAATCCATTTAGTAAATTTTCTAAATTAAAAACTGCTTGGGCTCATCCAGAATCTCATCAGGACTGGACGGCTCCCGCTGGACTATACTAGATATGTAGGCACAGAGCCTACATTCGGTTACCTAATAAATGGGCAGGCAGTTGTGTTATTGGCACTATTAAGCCGTCCTTTTTCTTATTACCCATAAAAACGGGTGAGCTCCTAGGTTTCCCTGTCTACGCCTCCCGAGAAAAGAAAGGCATAGTTATAGGAAACTGGAAAGATAATGAGTGGCCCCCTGAAAGGATCATNCAGTATTATGGGCCTGCCACATGGGCACAAGACGGCTCATGGGGATACCGAACCCCCATCTACATGCTCAATCGGATCATACGGTTGCAGGCCGTCTTAGAAATAATTACTAATGAAACTGGCAGAGCTTTGACTGTTTTAGCTTGGCAAGAAACCCAAATGAGGAATGCTATCTATCAGAATAGACTGGCCTTAGACTACTTGCTAGTAGCTGAAGGAGGAGTTTGTGGAAAATTTAACTTAACCAATTGCTGCCTACAAATAAATGATCAAGGACAGGTGGTTAAAAACATAGTCAGGGACATGACAAAGGTGGCACATGTGCCTGTACAGGTTTGGCACGAGTTTAATCCTGAGTCTTTATTTGGAAAATGGTTTCCAGCTATAGGAGGATTTAAAACCCTCATTGTAGGTGTATTGCTAGTGATAGGAACTTGCTTGCTGCTCCCCTGTGTATTACCCTTGCTTTTTCAAATGATAAAAGGTTTTGTTGCTACTTTGGTTCATCAGAAAACTTCAGCACACGTGTATTATATAAATCACTATCGCTCTATCTCACAAAGAGACTCAAAAAGTAAAGATGAGAGTGAGAACTCCCACTAAAAGTGAAAATNCTCAAAGGGGGGAAAA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVE | TRB2 | 1295 | 1305 | + | 16.79 | AAGCCCTAATT |
HERVE | ZIC1 | 814 | 827 | + | 16.68 | GACCCCCCACAGTG |
HERVE | IDD7 | 6549 | 6562 | + | 16.65 | CGAAAAAGACAAAA |
HERVE | TB1 | 1935 | 1943 | + | 16.64 | GGCCCCCCC |
HERVE | IDD4 | 6549 | 6563 | + | 16.56 | CGAAAAAGACAAAAA |
HERVE | CTCF | 4375 | 4389 | + | 16.56 | CCCACAAGGTGGCAG |
HERVE | ceh-10::ttx-3 | 7419 | 7432 | - | 16.52 | ATTGGTTAAGTTAA |
HERVE | USV1 | 7165 | 7174 | + | 16.51 | GCCCCCTGAA |
HERVE | zip-8 | 2295 | 2303 | - | 16.50 | ATGACGTCA |
HERVE | cg | 466 | 476 | - | 16.45 | ACACACACACT |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.