HERVE
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000174 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 7813 |
Kimura value | 6.13 |
Tau index | 0.9707 |
Description | Internal region of an ERV1 endogenous retrovirus, HERVE subfamily |
Comment | Associated long terminal repeat is LTR2. The gag and pol genes are ~40% similar to Moloney murine leukemia virus (MoMuLV). Some unique characteristics of the endogenous human retroviral DNA included a tRNA Glu primer binding site separated from the 5' LTR by a pentanucleotide and a putative env sequence which does not appear to overlap the C terminus of pol and has virtually no homology with the env gene of known infectious retroviruses. The reconstructed (though still pseudogenic) gene boundaries are: Gag: 571-2094, Pol: 2095-5655, Env: 5678-7750. |
Sequence |
TTTCTTGGTTCCCTGACCGGGAAGCGAGGTAATTGACGGACGGTCGAGGCAGCCCCTTAGGCGGCTTAGGCCTGCCCTGTGGAGCATCCCTGCGGGGGACTCCGGCCAGCTTGAGCGACGCGGATCCTGAGAGCGCTCCCGGGTAGGCAATTGCCCCGGTGGAACGCCTCGTCAGAGAGTGCGTGGCAGGCCCCCGTGGAGGATCAACGCAGTGGCTGAACACCGGGAAGGAACGGGCACTTGGAGTCCGGACATTTGAAACTTGGTAAGACTGGTCTTTGGAACTTGCCCACTCCATTTGAGTGGAAGCGTGGCCTGATCACCCACGGCGTGCCTGTACCGGCACTTTGGTTTTTGTTTTTGACTTGACTTGAATTGCTTGATACTTTGGTTTTGGTTTGACCTGGCTTGGATTTCTGGATACTCTGATTTTGGTTTTGATTCTGGTTTGGTGAAAACTGAAAAAGTGTGTGTGTGCCCTTTTTACCCATTCTTTGTTCTGTGGTGTGCGTGTGGTGTGAGCTTGGTGTTTTGTCTCGAGGAAACGTGGGTCAGACACAAAGTAAGCCTACTCCGCTAGGAACTATGTTGAAAAATTTTAAGAAAGGATTTAATGGAGACTATGGGGTTACTATGACACCAGGGAAACTTAGAACTTTGTGTGAAATAGATTGGCCAACATTAGAAGTGGGTTGGCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCTAAGGTATGGCACAAGGTAACTGGTAAGTCAGGACACCCAGACCAGTTCCCATACATAGACACTTGGTTACAGCTGGTGCTAGACCCCCCACAGTGGCTAAGAGGGCAGGCAGCAGCAGTGCTAGTAGCAAAGGGACAGATAGCCAAGGAAGGATCCCGCTCCACCCGCCGAGGGAAATCAACTCCTGAAGTTCTGTTCGACCCAACATCAGAAGATCCATTGCAGGAGATGGCACCAGTGATCCCAGTGGTGCCCTCCCCTTACCAGGGAGAGAGGCTCCCCACTCTTGAGCCCACAGTGCTTGCGCCTCCGCAAGACAAACATATCCCTAGGCCACCCAGAGTAGACAAGAGAGGAGGTGAAGACTCGGGAGAAACCCCTCCCTCGGCAGCTCGTTTACGACCCAAAACGGGGATACAAATGCCCCTGAGAGAGCAGCGGTATACTGGGATAGATGAGGATGGTCACGTGGTGGAGAGGCGTGTTTTTGGGTACCAGCCCTTCACCTCCGCCGACCTTCTCAACTGGAAAAACAATACCCCGTCCTATACCGAAAAGCCACAAGCCCTAATTGATTTGCTCCAAACTGTTATCCAGACCCACAACCCCACCTGGGCTGATTGCCACCAGTTGCTCATGTTCCTCTTTAACAGAGATGAAAGGCGGAGAGTGCTCCAAGCAGCAACTAAGTGGCTAGAGGAACATGCACCAGCTGATTATCAAAACCCCCAAGAGTATGGAAGGACCCAGTTACCAGGAACCGACCCCCAGTTGGACCCACATGAAAGAGAGGATATGCAAAGGCTAAACCGAGACAGGGAAGCTCTCTTGGAAGGATTAANGAGGGGAGCTCAGAAGGCCACAAACGTTAACAAGGTCTCTGAGGTCATTCAGGGAAAAGAAGAAAGTCCAGCACAATTCTACGAGAGACTGTGTGAGGCCTATCGTATGTATACTCCCTTTGATCCCGATAGCCCTGAAAATCAGCGCATGATTCACATGGCTTTAGTCCGTCAAAGCGCAGAAGACATGAGAAGAAAACTGCAGAAACAGGCTGGGCTTGCAGGGATGAATACATCACAATTACTAGAAATAGCTAGCCAGGTGTTTGTAAACAGGGATGCAGTAAGCCGTAAGGAAANCGCAAAGAGAATGGAGGTCAGGCCCGGCGAAACGCGCCTGTTAGCTGCAGCAATCAGAGGGGCCCCCCCAAANGAGGCAAGGNNGAAGGGGGGCCCTGGGAAAGAAACTCAGCTTGGCTGTCAGAGTTTGCAGCGTAACCAGTGTGCTTATTGTAAAGAAATAGGACAGTGGAAGAACAAATGCCCTCAGCTCAAAAGAAAACAAGGTGACTCAGAGCAGGAGGCCCCGGACAAGGAGGAAGGGGCCCTGCTCAACCTGGCAGAAGGGTTATTGGACTGAGGGAGACCGGGCTCAAGCGTCCCCAAAGAGCCTCTGGTCAGAATGACAGTCGGGGGTAGAGACATTGATTTTCTTGTAGATAGCGGTGCTGAACATTCGCTAGTAACCGCCCCGGTCGCCCCCTTATCCAAAAAGACTATTGACGTCATCGGAGCCACGGGGGTTTCAGCAAAGCAAGCTTTCTGCTTGCCTCGGACTTGTACTGTAGGAGGACATAAAGTCATTCATCAGTTTTNGTACATGCCTGACTGTCCCTTGCCCTTNTTGGGAAGGGACTTGCTCAGCAAGCTGAGAGCCACTATCTCTTTGACAGAGCACGGCTCTTTGCTGCTAAAGTTACCCGGAACGGGAGTCATTATGACCCTTACGGTCCCCCGAGAGGAGGAATGGAGACTTTTCTTAACTGAGCCGGGCCAAGAGAGAAGACCAGCTCTGGCTAAGCGGTGGCCAAGAGTACGGGCGGAAGACAACCCTCCGGGGTTGGCAGTCAACCGAGCCCNCGTACTCGTNGAAGTTAAGACTGGGGCCCAGCCGGTTAGGCAAAAACAGGACCCGGTCCCCAGAGAAGCTCTTCAAGGTATCCAGGTCCGTCTCAAGCACCTAAGAACTTTTGGAATTATNGTTCCTTGTCAGTCTCCATGGAACACTCCCCTCCTGCCTGTTCCCAAGCCACGGACCAAGGACTACNGGCCGGTACAGGATTTGCGCTTGCTTNATCAAGCTACACTGACTTTACATCCAACAGTACCTAACCCGTCCACATTGTTGGGGTTGCTGCCAGCTGAGGACAGCTGGTTCACCTGCTTGGACCTGAAAGACGCTTTCTTTCCTATCAGATTAGCCCCTGAGAGGCAGAAGCTGTTTGCCTTTCAGTGGGAAGATCCGGAGTCAGGTGTCACTACTCAGTACACTTGGACCGGGCTTCCCCAAGGGTTCAAGAACTCCCCCACCATCTTCGGGGAGGCGTTGGCTCGAGACCTCCAGAAGTTTCCCACCAGAGACCTAGGCTGCGTGTTGCTCCAGTAGGTTGATGACCTTCTGCTGGGACACCCCACGGCAGTCGGGTGGCCAAGGGAACGGATGCCCTACNCCGGCACCTGGAGGACTGTGGGTATAAGGTGTCCAAGAAGAAANGCTCAGATCTGCCGACGGCAGGTACGTTACTTGGGATTTACTATCCGACAGGGGGAACGCAGCCCGGGATCAGAAAGAAAGCAGGTCATTTGCAATCTACCGGAGCCTAAGAGCAGAAGGCAGGTGAGAGAATTCTTAGGAGCTGTGGGGTTTTGTAGACTGTGGATCCCAAACTTTGCAGTATTAGCCAAGCCTTTGTATGAGGTCACAAAGGGGGGGGACCGGGAACCTTTGGAATGGGGATCCCAACAACAGCAAGTCTTTCATGAGTTAAAGGAAAAACTTCTGGCAGCCCCAGCCCTGGGGCTACCCGATCTGACAAAGCCTTTTCCATTGTATGCGTCAGAGAGAGAAAAGATGGCAGCTGGACTTTTAACCCAAACTGTGGGGCCCTGGCCGAGGCCGGTGGCCTACCTCTCTAAACAACTAGACGGGGTTTCTAAAGGATGGCCCCCCTGTTTGAGGGCCTTGGCAGCAACTGCCCTGCTAGTACAAGAAGCAAATAAGCTGACTCTTGGGCAAAACCTGAACATAAAGGCCCCCCATGCTGTGGTGACTTTAATGAATACTAAAGGACATCATTGGCTAACGAATGCCAGACTCACCAAGTACCAAACTTTGCTCTGTGAAAATCCCCGTATAACCATTGAAGTTTGTAACACCCTACACCCCGCCACCTTGCTCCCGGTATCAGAGAGCCCTGTCGAGCNTGATTGTGTAGAAGTGTTGGACTCAGTTGACTCTGGGCATCAGTAGACTGGGAACTATACGTGGATGGGAGCAGCTTCNTCAACCCCCAAGGAGAGAGAGGTGCAGGGTATGCGGTGGTAACCCTGGACACTGTTGTTGAAGCCAGATCGTTGCCCCAGGCCACTTCAGCCCAGAAAGCTGAACTCATTGCTTTCATTCGGGCCTTAGAACTCAGTGAGGGTGAGACTGTCAACATTTACACTGATTCTCGGTATGCCTTTTTAACCCTTCAAGTGCATGGAGCGTGATAGAAAGAAAAGGGCCTATTGAACTCTGGGGGAAAAGACAGAAAATATCAACAAGAAATCTTGCAATTATTAGAAGCAGTATGGAAACCCCACAAGGTGGCAGTTATGCATTGCAGAGGACACCAGCGAGCTTCCACCTTGCTGGGTTTGGGGAATTCCCGCGCTGACTCAGAGGCTCGAAAAGCAGCATCTGCCCTTCCGGGCATCAGTGACAGCCCCCCTGCTCCCTCAAGCACCTGATCTTGGACCTACTTNTTCTAAAGAAGAAAAGGACTTTCTCCAGGTAGAGGGAGGACAAGTGATGGAGGAAGGATGGATTCGGTTACCAGATGGGAGAGTAGCTGTGCCACAGCTGCTAGGAGCTGCAGTTGTACTGGCTGTGCANGAAACCACCCATCGAGGTCAGGAGTCACTGGAAAAGTTGTTAGGCCGGTATTTCTACATCTCGCNTTTGTCAGCCCTTGCCAAAACGGTGAGGCAGCGGTGTGTTACCTGCCGACAGCATGATGCGAGGCAAGGTCCAGCCGTTCCGCCCGGCATACGAGCTTATGGAGCAGCCCCCTTTGAAGATCTCCAGGTGGACTTCACAGAGATGCCAAAGTGTGGAGGTAACAAGTATTTACTAGTTCTTGGGCGTACCTACTCTGGGTGGGTGGAGGCCTATCCAACACGAACTGAGAAAGCTCGTGAAGTAACCCGTGTGCTTCTTCGAGATCTGATTCCTAGATTTGGACTGCCCTTACGGATCGGCTCAGATAACGGGCCTGCGTTTNTGGCTGNCTTGGTACAGAAGACGGCAAAGGTATTGGGGATCACACGGAAACTGCATGCCGCCTCCCGGCCTCAGAGTTCCGGAAAGGTGGAGCGGATGAATCGGACTATCAAAAATAGTTTAGGGAAAGTATGTCAGGAAACAGGATTAAAATGGATACAGGCTCTCCCTATGGTATTATTTAAAATTAGATGTACCCCTTCTAAAAGAACAGGATATTCCCCTTATGAAATATTATATCATAGGCCCCCTCCCATATTGCGGGGACTTCCAGGCACTCCCCGAGAGTTAGGTGAAATTGAGTTACAGCGACAGCTACAGGCTTTAGGAAAAATTACACAAACAATCTCAGCCTGGGTAAATGAGAGATGCCCTGTTAGCTTATTCTCCCCAGTTCACCCTTTCTCCCCAGGTGATCGAGTGTGGATCAAGGACTGGAACGTAGCCTCTTTGTGCCCACGGTGGAAAGGACCCCAGACTGTCGTCCTGANCACTCCCACCGCTGTGAAGGTAGAAGGAATCCCAGCCTGGATCCACCACAGCCGTGTAAAACCTGCAGCGCCTGAAACCTGGGAGGCAAGACCAAGCCCGGACAACCCTTGCAGAGTGACCCTGAAGAAGACGACAAGCCCTGCTCCAGTCACACCCGGAAGCTGACTGGTCCACGCACGGCCGAAGCATGAGGAAGCTCATCGTGGGATTCATTTTTCTTAAATTTTGGACTTATACAGTAAGGGCTTCAACTGACCTTACTCAAACTGGGGACTGTTCCCAGTGTATTCATCAGGTCACCGAGGTAGGACAGCAAATTAAAACAATCTTTCTGTTCTATAGTTATTATGAATGTATGGGAACATTAAAAGAAACTTGTTTGTATAATGCCACTCAGTACAAGGTATGTAGCCCGGGAAATGACCGACCTGATGTGTGTTATAACCCATCTGAGCCCCCTGCAACCACCGTTTTTGAAATAAGAATAAGAACTGGCCTTTTCCTAGGTGATACAAGTAAAATAATAACTAGAACAGAAGAAAAAGAAATCCCCAAGCAAATAACTTTAAGATTTGATGCTTGTGCAGCCATTAATAGTAAAAAGCTAGAAATAGGATGTGGTTCTCTTAACTGAGAAAGGAGCTANAGAGTAGAAAATAAATATGTTTGTCATGAGTCAGGGGTTTGTGAAAATTGTGCCTATTGGCCATGTGTTATTTAGGCTACTTAGAAAAAGAACAAAAAGGACCCGGTTCATCTTCAGAAGGGGGAAGCCAACCCCTCCTGTGCTGCCGGTCACTGTAACCCACTAGAACTAATAATTACCAATCCCCTAGATCCCCGTTGGAAAAAGGGAGAACGTGTAACCCTGGGGATCGATGGGACAGGGTTAAACCCCCAAGTTGCCATTTTAATTAGAGGGGAGGTCCACAAGCGCTCTCCCAAACCAGTATTTCAAACCTTTTATGAGGAGCTGAATCTGCCAGCACCAGAACTTCCGAAAAAGACAAAAAATTTGTTTCTCCAATTAGCAGAAAATGTAGCTCATTCCCTTAATGTTACTTCTTGTTATGTACGCGGGGGAACCACTATCGGAGACNGATGGCCTTGGGAAGCCCGAGAGTTGGTGCCTACTGATCCAGCTCCTGATATAATTCCAGTTCAGAAGGCCCAAGCTAGCAACTTCTAGGTCCTAAAAACCTCAATTATTAGACAATACTGTATAGCTAGAGAAGGGAAAGACTTTATCATCCCTGTAGGAAAGCTTAATTGTATAGGACAGAAGTTGTATAACAGCACAACAAAGACAATTACTTGGTGGGGCCTAAACCACACTGAAAAGAATCCATTTAGTAAATTTTCTAAATTAAAAACTGCTTGGGCTCATCCAGAATCTCATCAGGACTGGACGGCTCCCGCTGGACTATACTAGATATGTAGGCACAGAGCCTACATTCGGTTACCTAATAAATGGGCAGGCAGTTGTGTTATTGGCACTATTAAGCCGTCCTTTTTCTTATTACCCATAAAAACGGGTGAGCTCCTAGGTTTCCCTGTCTACGCCTCCCGAGAAAAGAAAGGCATAGTTATAGGAAACTGGAAAGATAATGAGTGGCCCCCTGAAAGGATCATNCAGTATTATGGGCCTGCCACATGGGCACAAGACGGCTCATGGGGATACCGAACCCCCATCTACATGCTCAATCGGATCATACGGTTGCAGGCCGTCTTAGAAATAATTACTAATGAAACTGGCAGAGCTTTGACTGTTTTAGCTTGGCAAGAAACCCAAATGAGGAATGCTATCTATCAGAATAGACTGGCCTTAGACTACTTGCTAGTAGCTGAAGGAGGAGTTTGTGGAAAATTTAACTTAACCAATTGCTGCCTACAAATAAATGATCAAGGACAGGTGGTTAAAAACATAGTCAGGGACATGACAAAGGTGGCACATGTGCCTGTACAGGTTTGGCACGAGTTTAATCCTGAGTCTTTATTTGGAAAATGGTTTCCAGCTATAGGAGGATTTAAAACCCTCATTGTAGGTGTATTGCTAGTGATAGGAACTTGCTTGCTGCTCCCCTGTGTATTACCCTTGCTTTTTCAAATGATAAAAGGTTTTGTTGCTACTTTGGTTCATCAGAAAACTTCAGCACACGTGTATTATATAAATCACTATCGCTCTATCTCACAAAGAGACTCAAAAAGTAAAGATGAGAGTGAGAACTCCCACTAAAAGTGAAAATNCTCAAAGGGGGGAAAA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVE | TGA10 | 2295 | 2304 | - | 18.17 | GATGACGTCA |
HERVE | Dif | 4437 | 4446 | + | 18.13 | GGGAATTCCC |
HERVE | Dif | 4437 | 4446 | - | 18.13 | GGGAATTCCC |
HERVE | GAF1 | 6548 | 6562 | + | 18.00 | CCGAAAAAGACAAAA |
HERVE | IDD6 | 6548 | 6563 | - | 17.94 | TTTTTGTCTTTTTCGG |
HERVE | IDD6 | 6548 | 6563 | - | 17.89 | TTTTTGTCTTTTTCGG |
HERVE | ZIC4 | 814 | 827 | + | 17.83 | GACCCCCCACAGTG |
HERVE | NFKB2 | 4436 | 4446 | - | 17.74 | GGGAATTCCCC |
HERVE | ERF057 | 1235 | 1248 | + | 17.54 | TCACCTCCGCCGAC |
HERVE | ci | 813 | 823 | + | 17.52 | AGACCCCCCAC |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.