HERVS71
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000205 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8978 |
Kimura value | 7.99 |
Tau index | 0.9113 |
Description | Internal region of ERV1 endogenous retrovirus, HERVS71 subfamily |
Comment | The associated long terminal repeats are LTR6A and LTR6B. |
Sequence |
TAATGGAGGCCCCAGCGAGANATTAACGCCACCGGGCGAGAGCCGGGCTCGCTCCGGGCTCCCCCGGAAGGACGGCCGGCTTGTAGGGGGGGCGCCACCTGAAAAAANAATTTTCAGGNTCCCCGAAAGGTGACCGTCTTCCGGAGGAGAGCGGATCGACTACCGTGTGGGTGCCCATAAAATTCCACCTCTGAGTCCTCAGCTTCTGACCCCGGGGTCAGGTAGGTCAGATTTGACTTCGGTTCTGGTAAGAGGGAAGCGGCCCTGACGAGGGCGTCCCTCTTTTGACTCTGCCCGTTTCTCTAGGACGCTAGAGGGTNGAGCCCTGGTTTTCTGNTAGGCGCCTCTGTGTCTCTGTCTAGGAGGGAAGTGGCCCTGACAGGGGCCCTCCCTTGACTCAGTCCACGTCCCAGGATGCTGGAGGACTGAGTCCTGGTTTCTGGCAGACCGGNNNNTCNNTCTCTCTCTCTCTCTCTCTTTTTCTATCTCTCATCTTTCTCTTGTTCAAGTTTCTTGGAAATCTCCGGGAAAGAAAANNNNNNNNNNAAAAAAAAACTGTTATAAACTCTGTGTGAATGGTGAGTGAATGAGGGAGGACAAGGGCTTGCGCTTGTCCTCCAGTTTGTAGCTCCACGGCGAAAGCTACGGAGTTCGAGTGGGCCCTCACCTGCGGTTCCGTGGCGACCTCATAAGGCTTAAGGCAGCATCGGGCATAGCTCGATCCGAGCCGGGGGTTTATACCGGCCTGCCAATGCTAAGAGGAGCCCAAGTCCCCTCAGGGGGAGCGGCCAGGCGGGCATCTGACTGATCCCATCACGGGANCCCCTCCCCTTGTCTGTCTAAAAAAAAAAANAAAAAAGGAAAAACTGTCATAACTGTTTACATGCCCTAAAGTCAATTGTTTGTTTTATGTTGATTGTTCTGTTCAGTGTCTATTGTCTTGTTTAGTAGTTGTCAAAGTTTTGCATGTCAAGACGTCGATATTGCCCAAGACGTCTAGGTAAAAACTTCTTCAAGGTCCTTAGTGCTGATTTTTTGTCACAGGAGGTTAAATTTCTCATCAATCATTTAGGCTGGCCACCACAGTCCTGTCTTTTCTGCCAGAAGCAAGTCAAGTGTTGTTACGAGAACGAGTGTGAAAAACATTCGCCTGATTAAGATTTCTGGCACCATGAAAGTTGTAAGTATTTAGATCGTCATACCCCACGTCCAAGTGATTAGACCTCCTCTAAACTAAACCGGTAGTGGGTTCAAAACAGCCACCCTGCAGATTTCCTTGCTCACCTCTTTTGTCATTCTGTAACTTTTCCTGTGCCCTTAAATAGAACACTGTGTAAGGAAACGTACGCCCGTACTGCTTTACTTCGTTTAGATTCTTACTCTGTTCCTCTGTGGCTACTCTCCCATCTTAAAAATGATCCGAGTAGTCCTTTTCCNCCTCGTCCCTGCCCCCTACCCCGCACATCTCGTTTTCCGGTGCGACAGCAAGTTCAGCGTCTCCAGGACTTGGCTCTGCTCTCACTCCTTAAACCCTTAAAAGAAAAAGCTAAGTTTAAGCTATTTGCCTTTAAGTCATAGAGACACCAAAAGTATTTAAGGTGCAGATCTAGAAGAAGAAGAAGANNGAGAACGCCTAGATCAAACTGACCCAGAAGATCTCAGGCTGGCCCCTAGTCCTCCTCCCTCAATCTTAAAGCTACAGCAATGTGGCAAGTAGTATTAGCTGTTGTAGTTTTTCTGCTNCTTTCTGGTCATGTTGATTCTGTTCTTTCGATACTCCAGCCCCCCAAGGAATGAGTTTCTCTGTCCGTGCTAGGTTTAATATCTATGCTCAANATCTTATTAAATTGCCTTCAAANAANAAAAANAANNNNAAAACGGGAAACACTTCCTCCCAGCCTTGTAAAGGTTAGAGCCCTCTCCAATGTATGCTGCAGAATTTTTCTCTCGGTTTCTCAGAGGATTATAAAGTCCGCCTTAAAAAAGGCAAGCTCCGGACACTCTGCGAAATAGAATGGCCAAAGTTTAGAGTCGAGTGGCCCCCTGAAGGGTCATTGAACCTCACAATTGTTCAAGCTGTGTGGCGGGTTGTTACTGAAACTCCCAGCCACCCTGATCAGTTTCCCTACATTGATCAATGGCTAAGTTTGGTCAGGAGCCCCCCTCCATGGCTCCGTTCATGCGCCATTCATAATTCTACCTCCAAGGTCCTCCTGAGCCAGACCGCGTTTTCGCCTCGACCCTCAGCCGGTTCGGCTCCCCCTGTACTGCCTCCCTCTGAAGAAGAGGAGAGTCTCCCTCACCCAGTCCCACCGCCTTACAACCAGCCTGCTCCCTTAAAGTTATCCCATGTCTCCTCGACGACGTCCCCTGTAGGCTCGCCACCCATTGCCTCTCGATCGCGACCGCGGCGGGAGGAAGTAGCCCCTCTACTACCACTGAGAGAGGCACAAGTCCCTCCGGGTGACGAGCGCTCAGCCCCCTTCTTAGTTTATGTCCCTTTTTCTACTTCTGACTTGTATAATTGGAAAACCCATAATCCTCCCTTCTCTGAAAAGCCCCAGGCTTTGACCTCTCTGACGGAGTCCGTACTCCGGACTCACCCGCCCACCTAGGATGATTGCCAACAGCTCCTTTTAACCCTTTTCACCTCTGAAGAGAAGGAACGTATCCGAAGAGAGGCCAAAAAGTACTTCCTCGCATCAGCCAATGGACCGGAGGAGGAAGCTAGAGACCTCCTTGAGGAGGTCTTTCCCTCTACCCGGCCTAACCGGGACCCAAATTCCTCAAGTGGAAGGAGAGCTTTAGACGATTTTCACCGGTATCTCCTCGCGGGTATTAAAGGAGCCGCTCGGAAACCCATAAACTTGTCTAAGACGACCGAAGTTGTCCAGGGGCCCGATGAGTCACCAGGAGCGTTTTTAGAGCGCCTCCAGGAGGCTTATCGGATTTACACCCCTTTTGACCCGGCGGCTCCCGAAAATAGCCGTGCTCTTAATTTGGCATTTGTGGCTCAGGCAGCCCCGGATATTAAAAGGAAACTCCAAAAACTGGAAGGATTTGCTAGAATGAATATCAGTCAGCTTTTAGAAATAGCCCAAAAAGTTTTTGACAATCGAGAGTTTGAAAAACAAAAACAAGCAACACAGGCAGCTGAAAAGGCCGCTGATAAAGCATTCAAAAGACAAACAAAAATCTTAGTGGCGGCTATCCAAGAGGACAGAATGAAATGGCCCCCATTCCAGAAGAATGGCCAAGGAACCTCGGGTTCCCACCAGAAAAGTAAAAGAGGTGAACAGGCCCCTCTAGGAAAAACCAATGTGCCTATTGCAAGCAGACTGGGCACTGGAAAAAGGAGTGCCCACTACTGCCANAAGAAAAGTCAGAAAACAAAAAGGTCCTCACCCTGCCCGCAACGGAGGAGCCTGATGATTGACGGGGCCAGGGCTCCCTCGCTCTTGGCCCCCAGGANCCCATGGTAACTGCTACAGTGGGGGGCCAGCCTGTACGTTTCCTAGTAGACACCGGGGCGGAGCACTCGGTACTGCAGACTCCCTTGGGCAGTGTCTCAAATAAAAAAATGACTGTACAAAGGGCAACTGGAGCTATTCAAGAATATCCTGTCACACGCTCCCGAGAAGTAAACTTGGGACAGAAAAGAGTGACACACTCTTTTCTNGTGGTTCCAGAGTGTCCTTTTCCTCTCCTTGGACGAGACCTGCTCCATAAGTTACAGGCCTCAATCTCCTTTTCAGCTCAGCAGGCTCATCTCACACTAGGAAATGCAACTTCCCCCACTGCCCAACTCTTGCTAACTACCCCTCTGTCAGAAGAATACCTTCTGGTTTCACCATCACAATCACCGGAGGAGAATACTAATACTCTTTTGTTGGACNTACAGACACTTTTTCCCCGAGTTTGGGCCGAGTCAAACCCTCCCGGACTGGCTAAACACCATCCGCCAGTGGTTGTAGAACTCTTGGCCACTGCCATACCGGTCCAGGTAAAGCAATACCCCATGAGTCAGCAGGCTAGAGAGGNGATTAATCCCCACATTCAATGACTGTTACAAGCTGGCATACTTACACCATGTCAGTCGGCCTGGAACACNCCATTTTTGCCGGTCCAGAAACCTGGAACAAATGATTACCGGCCGGTACAAGACTTAAGGGAAGTTAATAAATGGACTGTTACTGTCCATCCAACCGTCCCTAATCCTTATACTCTACTCAGCCTGCTCCCACCAGAACATACAGTATACACTGTCCTTGACCTGAAAGATGCTTTCTTTGCTATTCCTCTGGCCCCCAAAAGCCAGCCGATTTTTGCATTTGAATGGACAGATCCAAGATCAGGAGACACTACCCAACTGACTTGGACTCAGTTACCTCAGGGTTTTAAAAATTCCCCCACCCTTTTTGGGGAGGCTCTTCGGCAAGATCTTATACCTTCCGAGCTAGTCACCCTAACTGTACTCTTCTTCAGTATGTAGATGATATTTTAATAGCTACTGAAACTATGGACAGTTGTCTACAACACACGAGGGACCTGCTCTACCTCCTTCAGGAGCTCGGGTATGGAGTCTCAGCCAAAAAGGCCCAGCTTTGTCTTCCCAGAGTGTCCTACCTGGGGTACGAGATAAACCAAGGAAAAAGGGCACTCACCAGTGCCCGGAAAGAAGCCATCCTGCGAATCCCCACTCCCGCCACCAAGAGACGGGTACGCGAATTNCTGGGGGCCGTGGGATACTGTCGCCTCTGGATATCGGGGTTCGCGGAGATTGCAAAGCCCTTGTATACTGCTACAGGANGNAATGGCCCGCTAATTTGGACAGACACNGAAGAACAGGCTTTTCAAAACCTGAAAAAGGCATTAACTGAAGCCCCTGCTTTAGCCCTCCCTAATATCTCAAAGCCGTTTCACCTGTTTGTCCATGAAAGCCAGGGAGTTGCTAAAGAGGTGCTTACTCAGACTTTAAGACCCTGGAGACGCCCAGTGGCCTATTTATCTAAGAGGCTGGATCCTGTGGCCTCTGGATGGCCAAGTTGTCTGCGAGCCGTAGCGGCTACAGCAAGCCTAGTCCAAGAAGNTGATAAGTTAACTCTAGGCCAAAATTTAACCCTTACAGCTCCTCATGCCGTAGAGACCTTACTACGAAGTGCTTCTGGCAAATGGATGTCAAATGCTCGCATCTTGCAGTATCAGAGTTTACTGTTAGATCAGCCTCGTTTGACTTTCTCTCCCACAAGGTGTTTNAATCCAGCTACACTACTTCCTGACCCAGACTCCACTATTCCTGCTCATGACTGTCAAGAACTGTTAGAAACTACCGAAACTGGCCGACCTGATCTTCAAGATGTGCCCCTAGAAAAGGCGGATGCCGCCGTGTTCACAGACGGTAGCAGCTTCCTCGAGCAGGGAGTACGAAAAGCCGGTGCAGCTGTTACCACGGAGACAGATGTGTTGTAGGCTCAGGCTTTACCAGCGAACACCTCAGCGCAAAAGGCTGAATTGATCGCCCTCACTCAGGCTCTCCGATGGGGTAAGGATAAACGTATTAACATTTACACTGACAGCAGGTACGCCTTTGCTACTGTGCATGTACATGGAGCCATCTACCAGGAANGCGGGCTACTCACCTCAGCAGGAAAGGCTATCAAAAACAAAGAAGAAATTCTAGCCCTGCTTGAAGCCGTGTGGCTCCCTCAGCAGGTAGCTGTGATCCACTGCAAAGGACATCAAAAAGAAAACACGGCCGTTGCCCGTAGTAACCAGAAAGCTGATTCAGCAGCTCAGGTCGCAGCGNGACTTTCAGTCACGCCTCTAAACTTGCTGCCCACAGTCTCCTTTCCACAGCCAGATCTGCCTGACAATCCCGTATACTCAACAAAANAAAAAAAACTGGCTTCAGATCTCAGAGCCAATAAAAATCAGGAAAGTTAGTAGATTCTTCCTGACTCTAGAATCTTCATACCCCGAACTCTTAAAGAAACTTTAATCAGTCACCTACAGTCTACCACCCATTTAAGAAGAGCAAAGCTACCTCAGCTCCTCCGGAGCCATTTTAAGATCCCCCGTCTTCAAAGCCTAACAGATCAAGCAGCTCTCCGGTGCACAACCTGCGCCCAGGTAAATGCCAAGCAAGGTCCTAAACCCAGCCCAGGCCACCGTCTCCGAAAAAACTCGCCAGGAGAAAAGTGGGAAATTGACTTTACAGAAGTAAAACCACACCGGGCTAAGTACAAATACCTTCTAGTACTAGTAGACACCTTCTCCGGATGGACTGAGGCATTTGCTACCGAAAACGAAACCGCCAACACGGTAGTTAAGTTTTTACTCAATGAAATCATCCCTCGATATAGGCTGCCTGCTGCCATAGGGTCTGATAATGGACCGGCCTTCACCTCGCCCATAGCTCAGTCAGTCAGTAAGGCGTTAAACATTCAACGGAAGCTCCATTGTGCCTATCGACCCCAGAGCTCCGGGCAGGTAGAACGCATGAACCGCACCCTAAAAAACACTCTTACAAAATTAATCTTAAAAACCGGTGNAAATTAGGTAAGTCTCCTTCCTTTAGCCCTACTTAGAGTAAGGTGCACCCCTTACCAGGCTAGGTTCTCACCTTTTGAAATCATGTATAGGAAGGCGCCGCCTATCTTGCCTAAGCTAAGAGATGCCNAATTAGCAGAAATATCACAAGCTAATTTATTACAGTACCTACAGTCTCTCCAACAGGTACAAGATATCATCCTGCCACTTGTTCGAGGAGCCCATCCCAATCCAATTCCTGACCAGACGGGGTCCTGCCATTCGTTCCAGCCAGGAGACCTAGTGTTTGTTAAAAAGTTCCAGAAAGAAGGACTCACTCCTGCTTAGAAAAGACCTCACACCGTCATCCTCACGACGCCAACGGCTCTGAAGGTGGACGGCATTCCTGCTTAGATTCATCACTCCCGCATCAAAAAGGCCAACAGAGCCCAACTAAAAACATAGGTCCCCAGGCCTAGGTCAGGCCCCTTAAAACTGCGCCTAAGTCAGGTGAAGCCATTAGATTNATTCTTTTTATCTACCTCACTTGTTTGTTTTTGCCCGTTACGTCCTCTGTGCCTTCCTACTCCTTTCTCCTCACCTCTTTCACAACAGGACGTGTATTTGCAAACACCACTTGGAAGGCCGGTACCTCCAAGGAAGTCTCCTTTGCAGTTGATTTATGTGTACTGTTCCCAAAGCCAGCCCGTACCCACGAAGAGCAACACAATCTGCCAGTCCCAGGAGCAGGAAGTGTCGACCTTGCAGCAAGATTCGGACACTCCGGGAGCCAAACTAGATGTGGAAGCTCCAAAGGTGCAGAAAAAGGACTCCAAAATGTTGACTTTTACCTCTGTCCTAGAAATCACCCTGACGCTAGCTGTCGAGATACTTATCAGTTTTTCTGCCCTGATTAGACATGTGTAACTTTAGCCACCTACTCTAAGAGATCAACCAGATCTTCAACTCTTTCCATAAGTCGTGCTTCTCATCCTAAATTATGTACTAGAAAAAATTGTAATCCTCTTACTATAACTGTCCATGACCCTAATTCAACTCAATAGTATCATGGCATGTCATGAAGATTAAGATTTTATATCCCAGGATTTGATGTTAGGACTATGTTCACCATCCAAAANAAAACCCTGGTCTCATGGAGCCCACCCAAGCCAATCGGGCCTTTAACTGATCTAGGTGACCCTATGTTCCAGAAACACCCTGACAAAGTTGATTTAACTGTTCCTCCACCATTCTTAGTTCCTAAGCCCCAGCTACAANGACANCATCTTCAACCCAGCCTGATGTCTATACTAGGTGGAGTACATCATCTCCTTAACCTCACCCAGCCTAAACTAGCCCAAGATTGTTGGCTATGTTTAAAAGCAAAACCCCCTTATTATGTAGGATTAGGAGTAGAAGCCACACTTAAANGTGGCCCTCTATCCTGTCATACACGACCCCGTGCTCTCACACTAGGAGATGTGTCTGGAAACGCTTCCTGTCTGATTAGTACCGGGTATAACTTATCTGCTTCTCCTTTTCAGGCTATTTGTAATCAGTCCCTGCTTACTTCCATAAGCACCTCAGTCTCTTACCAAGCGCCTAACAATACCTGGTTGGCCTGCACCTCAGGTCTCACTCGCTGCATTAATGGAACTGAACCAGGACCTCTCTTGTGCGTGTTAGTTCATGTNCTTCCCCAGGTATACGTGTACAGTGGACCAGAAGGACAACTCCTCATCGCTCCCCCGGAATTACATCCCAGGTTGCGCCGAGCTGCCCCACTNCTGGTTCCCCTCTTGGCCGGTCTTAGCATAGCTGGATCAGCAGCCATTGGTACGGCTGCCCTGGTTCAAGGAGAAACTGGACTAATGTCCCTGTCTCAACAGGTGGATGCTGATTTAAGTAACCTCCAGTCTGCCATAGATATACTACATTCCCAGGTAGAGTCTCTGGCTGAAGTAGTNCTTCAAAACCGCCGAGGCTTAGATCTGCTATTCCTCTCTCAAGGAGGATTATGCGCAGCTCTAGGAGAAAGCTGTTGCTTCTACGCCAATCAATCTGGAGTCATAAAAGATACACTCCAAAAAGTGCGAGAAAATCTAGATAGGCGCCAACAAGAACGAGAAAATAACATCCCCTGGTATCAAAGCATGTTCAACTGGAACCCATGGCTAACTACTCTAATCACTAAGTTAGCCGGACCCCTCCCCATCCTACTATTAAGTCTAATTTTTGGGCCTTGTATATTAAATTAGTTTCTTAATTTTGTAAAACAACGCATAGCTTCTGTCAAACTTATGTATCTTAAGACTCAATATAACCCCCTTGTTATAACTGAGGAATCAACGATTTGATTCCCCAAAAACACAAGTGGGGAAATGAAATGCCTAACGTTGTTTTTACTCTAACTNGTTACTTTGAATTTTGTCCTGCTTGTCTCTTTAATC
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVS71 | IRF7 | 6300 | 6312 | + | 17.85 | CGAAAACGAAACC |
HERVS71 | NR4A2::RXRA | 2049 | 2061 | + | 17.82 | GGGTCATTGAACC |
HERVS71 | MAF::NFE2 | 4019 | 4029 | + | 17.74 | ATGAGTCAGCA |
HERVS71 | pan | 2838 | 2851 | - | 17.67 | GCGGCTCCTTTAAT |
HERVS71 | eor-1 | 344 | 356 | - | 17.58 | AGAGACACAGAGG |
HERVS71 | SCRT1 | 8411 | 8420 | + | 17.58 | TCAACAGGTG |
HERVS71 | MAFG::NFE2L1 | 4019 | 4029 | + | 17.52 | ATGAGTCAGCA |
HERVS71 | WRKY71 | 7368 | 7377 | - | 17.48 | AAAAGTCAAC |
HERVS71 | CTCF | 5700 | 5732 | - | 17.47 | TTGCAGTGGATCACAGCTACCTGCTGAGGGAGC |
HERVS71 | WRKY8 | 7367 | 7377 | - | 17.43 | AAAAGTCAACA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.