HERVS71
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000205 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Catarrhini |
Length | 8978 |
Kimura value | 7.99 |
Tau index | 0.9113 |
Description | Internal region of ERV1 endogenous retrovirus, HERVS71 subfamily |
Comment | The associated long terminal repeats are LTR6A and LTR6B. |
Sequence |
TAATGGAGGCCCCAGCGAGANATTAACGCCACCGGGCGAGAGCCGGGCTCGCTCCGGGCTCCCCCGGAAGGACGGCCGGCTTGTAGGGGGGGCGCCACCTGAAAAAANAATTTTCAGGNTCCCCGAAAGGTGACCGTCTTCCGGAGGAGAGCGGATCGACTACCGTGTGGGTGCCCATAAAATTCCACCTCTGAGTCCTCAGCTTCTGACCCCGGGGTCAGGTAGGTCAGATTTGACTTCGGTTCTGGTAAGAGGGAAGCGGCCCTGACGAGGGCGTCCCTCTTTTGACTCTGCCCGTTTCTCTAGGACGCTAGAGGGTNGAGCCCTGGTTTTCTGNTAGGCGCCTCTGTGTCTCTGTCTAGGAGGGAAGTGGCCCTGACAGGGGCCCTCCCTTGACTCAGTCCACGTCCCAGGATGCTGGAGGACTGAGTCCTGGTTTCTGGCAGACCGGNNNNTCNNTCTCTCTCTCTCTCTCTCTTTTTCTATCTCTCATCTTTCTCTTGTTCAAGTTTCTTGGAAATCTCCGGGAAAGAAAANNNNNNNNNNAAAAAAAAACTGTTATAAACTCTGTGTGAATGGTGAGTGAATGAGGGAGGACAAGGGCTTGCGCTTGTCCTCCAGTTTGTAGCTCCACGGCGAAAGCTACGGAGTTCGAGTGGGCCCTCACCTGCGGTTCCGTGGCGACCTCATAAGGCTTAAGGCAGCATCGGGCATAGCTCGATCCGAGCCGGGGGTTTATACCGGCCTGCCAATGCTAAGAGGAGCCCAAGTCCCCTCAGGGGGAGCGGCCAGGCGGGCATCTGACTGATCCCATCACGGGANCCCCTCCCCTTGTCTGTCTAAAAAAAAAAANAAAAAAGGAAAAACTGTCATAACTGTTTACATGCCCTAAAGTCAATTGTTTGTTTTATGTTGATTGTTCTGTTCAGTGTCTATTGTCTTGTTTAGTAGTTGTCAAAGTTTTGCATGTCAAGACGTCGATATTGCCCAAGACGTCTAGGTAAAAACTTCTTCAAGGTCCTTAGTGCTGATTTTTTGTCACAGGAGGTTAAATTTCTCATCAATCATTTAGGCTGGCCACCACAGTCCTGTCTTTTCTGCCAGAAGCAAGTCAAGTGTTGTTACGAGAACGAGTGTGAAAAACATTCGCCTGATTAAGATTTCTGGCACCATGAAAGTTGTAAGTATTTAGATCGTCATACCCCACGTCCAAGTGATTAGACCTCCTCTAAACTAAACCGGTAGTGGGTTCAAAACAGCCACCCTGCAGATTTCCTTGCTCACCTCTTTTGTCATTCTGTAACTTTTCCTGTGCCCTTAAATAGAACACTGTGTAAGGAAACGTACGCCCGTACTGCTTTACTTCGTTTAGATTCTTACTCTGTTCCTCTGTGGCTACTCTCCCATCTTAAAAATGATCCGAGTAGTCCTTTTCCNCCTCGTCCCTGCCCCCTACCCCGCACATCTCGTTTTCCGGTGCGACAGCAAGTTCAGCGTCTCCAGGACTTGGCTCTGCTCTCACTCCTTAAACCCTTAAAAGAAAAAGCTAAGTTTAAGCTATTTGCCTTTAAGTCATAGAGACACCAAAAGTATTTAAGGTGCAGATCTAGAAGAAGAAGAAGANNGAGAACGCCTAGATCAAACTGACCCAGAAGATCTCAGGCTGGCCCCTAGTCCTCCTCCCTCAATCTTAAAGCTACAGCAATGTGGCAAGTAGTATTAGCTGTTGTAGTTTTTCTGCTNCTTTCTGGTCATGTTGATTCTGTTCTTTCGATACTCCAGCCCCCCAAGGAATGAGTTTCTCTGTCCGTGCTAGGTTTAATATCTATGCTCAANATCTTATTAAATTGCCTTCAAANAANAAAAANAANNNNAAAACGGGAAACACTTCCTCCCAGCCTTGTAAAGGTTAGAGCCCTCTCCAATGTATGCTGCAGAATTTTTCTCTCGGTTTCTCAGAGGATTATAAAGTCCGCCTTAAAAAAGGCAAGCTCCGGACACTCTGCGAAATAGAATGGCCAAAGTTTAGAGTCGAGTGGCCCCCTGAAGGGTCATTGAACCTCACAATTGTTCAAGCTGTGTGGCGGGTTGTTACTGAAACTCCCAGCCACCCTGATCAGTTTCCCTACATTGATCAATGGCTAAGTTTGGTCAGGAGCCCCCCTCCATGGCTCCGTTCATGCGCCATTCATAATTCTACCTCCAAGGTCCTCCTGAGCCAGACCGCGTTTTCGCCTCGACCCTCAGCCGGTTCGGCTCCCCCTGTACTGCCTCCCTCTGAAGAAGAGGAGAGTCTCCCTCACCCAGTCCCACCGCCTTACAACCAGCCTGCTCCCTTAAAGTTATCCCATGTCTCCTCGACGACGTCCCCTGTAGGCTCGCCACCCATTGCCTCTCGATCGCGACCGCGGCGGGAGGAAGTAGCCCCTCTACTACCACTGAGAGAGGCACAAGTCCCTCCGGGTGACGAGCGCTCAGCCCCCTTCTTAGTTTATGTCCCTTTTTCTACTTCTGACTTGTATAATTGGAAAACCCATAATCCTCCCTTCTCTGAAAAGCCCCAGGCTTTGACCTCTCTGACGGAGTCCGTACTCCGGACTCACCCGCCCACCTAGGATGATTGCCAACAGCTCCTTTTAACCCTTTTCACCTCTGAAGAGAAGGAACGTATCCGAAGAGAGGCCAAAAAGTACTTCCTCGCATCAGCCAATGGACCGGAGGAGGAAGCTAGAGACCTCCTTGAGGAGGTCTTTCCCTCTACCCGGCCTAACCGGGACCCAAATTCCTCAAGTGGAAGGAGAGCTTTAGACGATTTTCACCGGTATCTCCTCGCGGGTATTAAAGGAGCCGCTCGGAAACCCATAAACTTGTCTAAGACGACCGAAGTTGTCCAGGGGCCCGATGAGTCACCAGGAGCGTTTTTAGAGCGCCTCCAGGAGGCTTATCGGATTTACACCCCTTTTGACCCGGCGGCTCCCGAAAATAGCCGTGCTCTTAATTTGGCATTTGTGGCTCAGGCAGCCCCGGATATTAAAAGGAAACTCCAAAAACTGGAAGGATTTGCTAGAATGAATATCAGTCAGCTTTTAGAAATAGCCCAAAAAGTTTTTGACAATCGAGAGTTTGAAAAACAAAAACAAGCAACACAGGCAGCTGAAAAGGCCGCTGATAAAGCATTCAAAAGACAAACAAAAATCTTAGTGGCGGCTATCCAAGAGGACAGAATGAAATGGCCCCCATTCCAGAAGAATGGCCAAGGAACCTCGGGTTCCCACCAGAAAAGTAAAAGAGGTGAACAGGCCCCTCTAGGAAAAACCAATGTGCCTATTGCAAGCAGACTGGGCACTGGAAAAAGGAGTGCCCACTACTGCCANAAGAAAAGTCAGAAAACAAAAAGGTCCTCACCCTGCCCGCAACGGAGGAGCCTGATGATTGACGGGGCCAGGGCTCCCTCGCTCTTGGCCCCCAGGANCCCATGGTAACTGCTACAGTGGGGGGCCAGCCTGTACGTTTCCTAGTAGACACCGGGGCGGAGCACTCGGTACTGCAGACTCCCTTGGGCAGTGTCTCAAATAAAAAAATGACTGTACAAAGGGCAACTGGAGCTATTCAAGAATATCCTGTCACACGCTCCCGAGAAGTAAACTTGGGACAGAAAAGAGTGACACACTCTTTTCTNGTGGTTCCAGAGTGTCCTTTTCCTCTCCTTGGACGAGACCTGCTCCATAAGTTACAGGCCTCAATCTCCTTTTCAGCTCAGCAGGCTCATCTCACACTAGGAAATGCAACTTCCCCCACTGCCCAACTCTTGCTAACTACCCCTCTGTCAGAAGAATACCTTCTGGTTTCACCATCACAATCACCGGAGGAGAATACTAATACTCTTTTGTTGGACNTACAGACACTTTTTCCCCGAGTTTGGGCCGAGTCAAACCCTCCCGGACTGGCTAAACACCATCCGCCAGTGGTTGTAGAACTCTTGGCCACTGCCATACCGGTCCAGGTAAAGCAATACCCCATGAGTCAGCAGGCTAGAGAGGNGATTAATCCCCACATTCAATGACTGTTACAAGCTGGCATACTTACACCATGTCAGTCGGCCTGGAACACNCCATTTTTGCCGGTCCAGAAACCTGGAACAAATGATTACCGGCCGGTACAAGACTTAAGGGAAGTTAATAAATGGACTGTTACTGTCCATCCAACCGTCCCTAATCCTTATACTCTACTCAGCCTGCTCCCACCAGAACATACAGTATACACTGTCCTTGACCTGAAAGATGCTTTCTTTGCTATTCCTCTGGCCCCCAAAAGCCAGCCGATTTTTGCATTTGAATGGACAGATCCAAGATCAGGAGACACTACCCAACTGACTTGGACTCAGTTACCTCAGGGTTTTAAAAATTCCCCCACCCTTTTTGGGGAGGCTCTTCGGCAAGATCTTATACCTTCCGAGCTAGTCACCCTAACTGTACTCTTCTTCAGTATGTAGATGATATTTTAATAGCTACTGAAACTATGGACAGTTGTCTACAACACACGAGGGACCTGCTCTACCTCCTTCAGGAGCTCGGGTATGGAGTCTCAGCCAAAAAGGCCCAGCTTTGTCTTCCCAGAGTGTCCTACCTGGGGTACGAGATAAACCAAGGAAAAAGGGCACTCACCAGTGCCCGGAAAGAAGCCATCCTGCGAATCCCCACTCCCGCCACCAAGAGACGGGTACGCGAATTNCTGGGGGCCGTGGGATACTGTCGCCTCTGGATATCGGGGTTCGCGGAGATTGCAAAGCCCTTGTATACTGCTACAGGANGNAATGGCCCGCTAATTTGGACAGACACNGAAGAACAGGCTTTTCAAAACCTGAAAAAGGCATTAACTGAAGCCCCTGCTTTAGCCCTCCCTAATATCTCAAAGCCGTTTCACCTGTTTGTCCATGAAAGCCAGGGAGTTGCTAAAGAGGTGCTTACTCAGACTTTAAGACCCTGGAGACGCCCAGTGGCCTATTTATCTAAGAGGCTGGATCCTGTGGCCTCTGGATGGCCAAGTTGTCTGCGAGCCGTAGCGGCTACAGCAAGCCTAGTCCAAGAAGNTGATAAGTTAACTCTAGGCCAAAATTTAACCCTTACAGCTCCTCATGCCGTAGAGACCTTACTACGAAGTGCTTCTGGCAAATGGATGTCAAATGCTCGCATCTTGCAGTATCAGAGTTTACTGTTAGATCAGCCTCGTTTGACTTTCTCTCCCACAAGGTGTTTNAATCCAGCTACACTACTTCCTGACCCAGACTCCACTATTCCTGCTCATGACTGTCAAGAACTGTTAGAAACTACCGAAACTGGCCGACCTGATCTTCAAGATGTGCCCCTAGAAAAGGCGGATGCCGCCGTGTTCACAGACGGTAGCAGCTTCCTCGAGCAGGGAGTACGAAAAGCCGGTGCAGCTGTTACCACGGAGACAGATGTGTTGTAGGCTCAGGCTTTACCAGCGAACACCTCAGCGCAAAAGGCTGAATTGATCGCCCTCACTCAGGCTCTCCGATGGGGTAAGGATAAACGTATTAACATTTACACTGACAGCAGGTACGCCTTTGCTACTGTGCATGTACATGGAGCCATCTACCAGGAANGCGGGCTACTCACCTCAGCAGGAAAGGCTATCAAAAACAAAGAAGAAATTCTAGCCCTGCTTGAAGCCGTGTGGCTCCCTCAGCAGGTAGCTGTGATCCACTGCAAAGGACATCAAAAAGAAAACACGGCCGTTGCCCGTAGTAACCAGAAAGCTGATTCAGCAGCTCAGGTCGCAGCGNGACTTTCAGTCACGCCTCTAAACTTGCTGCCCACAGTCTCCTTTCCACAGCCAGATCTGCCTGACAATCCCGTATACTCAACAAAANAAAAAAAACTGGCTTCAGATCTCAGAGCCAATAAAAATCAGGAAAGTTAGTAGATTCTTCCTGACTCTAGAATCTTCATACCCCGAACTCTTAAAGAAACTTTAATCAGTCACCTACAGTCTACCACCCATTTAAGAAGAGCAAAGCTACCTCAGCTCCTCCGGAGCCATTTTAAGATCCCCCGTCTTCAAAGCCTAACAGATCAAGCAGCTCTCCGGTGCACAACCTGCGCCCAGGTAAATGCCAAGCAAGGTCCTAAACCCAGCCCAGGCCACCGTCTCCGAAAAAACTCGCCAGGAGAAAAGTGGGAAATTGACTTTACAGAAGTAAAACCACACCGGGCTAAGTACAAATACCTTCTAGTACTAGTAGACACCTTCTCCGGATGGACTGAGGCATTTGCTACCGAAAACGAAACCGCCAACACGGTAGTTAAGTTTTTACTCAATGAAATCATCCCTCGATATAGGCTGCCTGCTGCCATAGGGTCTGATAATGGACCGGCCTTCACCTCGCCCATAGCTCAGTCAGTCAGTAAGGCGTTAAACATTCAACGGAAGCTCCATTGTGCCTATCGACCCCAGAGCTCCGGGCAGGTAGAACGCATGAACCGCACCCTAAAAAACACTCTTACAAAATTAATCTTAAAAACCGGTGNAAATTAGGTAAGTCTCCTTCCTTTAGCCCTACTTAGAGTAAGGTGCACCCCTTACCAGGCTAGGTTCTCACCTTTTGAAATCATGTATAGGAAGGCGCCGCCTATCTTGCCTAAGCTAAGAGATGCCNAATTAGCAGAAATATCACAAGCTAATTTATTACAGTACCTACAGTCTCTCCAACAGGTACAAGATATCATCCTGCCACTTGTTCGAGGAGCCCATCCCAATCCAATTCCTGACCAGACGGGGTCCTGCCATTCGTTCCAGCCAGGAGACCTAGTGTTTGTTAAAAAGTTCCAGAAAGAAGGACTCACTCCTGCTTAGAAAAGACCTCACACCGTCATCCTCACGACGCCAACGGCTCTGAAGGTGGACGGCATTCCTGCTTAGATTCATCACTCCCGCATCAAAAAGGCCAACAGAGCCCAACTAAAAACATAGGTCCCCAGGCCTAGGTCAGGCCCCTTAAAACTGCGCCTAAGTCAGGTGAAGCCATTAGATTNATTCTTTTTATCTACCTCACTTGTTTGTTTTTGCCCGTTACGTCCTCTGTGCCTTCCTACTCCTTTCTCCTCACCTCTTTCACAACAGGACGTGTATTTGCAAACACCACTTGGAAGGCCGGTACCTCCAAGGAAGTCTCCTTTGCAGTTGATTTATGTGTACTGTTCCCAAAGCCAGCCCGTACCCACGAAGAGCAACACAATCTGCCAGTCCCAGGAGCAGGAAGTGTCGACCTTGCAGCAAGATTCGGACACTCCGGGAGCCAAACTAGATGTGGAAGCTCCAAAGGTGCAGAAAAAGGACTCCAAAATGTTGACTTTTACCTCTGTCCTAGAAATCACCCTGACGCTAGCTGTCGAGATACTTATCAGTTTTTCTGCCCTGATTAGACATGTGTAACTTTAGCCACCTACTCTAAGAGATCAACCAGATCTTCAACTCTTTCCATAAGTCGTGCTTCTCATCCTAAATTATGTACTAGAAAAAATTGTAATCCTCTTACTATAACTGTCCATGACCCTAATTCAACTCAATAGTATCATGGCATGTCATGAAGATTAAGATTTTATATCCCAGGATTTGATGTTAGGACTATGTTCACCATCCAAAANAAAACCCTGGTCTCATGGAGCCCACCCAAGCCAATCGGGCCTTTAACTGATCTAGGTGACCCTATGTTCCAGAAACACCCTGACAAAGTTGATTTAACTGTTCCTCCACCATTCTTAGTTCCTAAGCCCCAGCTACAANGACANCATCTTCAACCCAGCCTGATGTCTATACTAGGTGGAGTACATCATCTCCTTAACCTCACCCAGCCTAAACTAGCCCAAGATTGTTGGCTATGTTTAAAAGCAAAACCCCCTTATTATGTAGGATTAGGAGTAGAAGCCACACTTAAANGTGGCCCTCTATCCTGTCATACACGACCCCGTGCTCTCACACTAGGAGATGTGTCTGGAAACGCTTCCTGTCTGATTAGTACCGGGTATAACTTATCTGCTTCTCCTTTTCAGGCTATTTGTAATCAGTCCCTGCTTACTTCCATAAGCACCTCAGTCTCTTACCAAGCGCCTAACAATACCTGGTTGGCCTGCACCTCAGGTCTCACTCGCTGCATTAATGGAACTGAACCAGGACCTCTCTTGTGCGTGTTAGTTCATGTNCTTCCCCAGGTATACGTGTACAGTGGACCAGAAGGACAACTCCTCATCGCTCCCCCGGAATTACATCCCAGGTTGCGCCGAGCTGCCCCACTNCTGGTTCCCCTCTTGGCCGGTCTTAGCATAGCTGGATCAGCAGCCATTGGTACGGCTGCCCTGGTTCAAGGAGAAACTGGACTAATGTCCCTGTCTCAACAGGTGGATGCTGATTTAAGTAACCTCCAGTCTGCCATAGATATACTACATTCCCAGGTAGAGTCTCTGGCTGAAGTAGTNCTTCAAAACCGCCGAGGCTTAGATCTGCTATTCCTCTCTCAAGGAGGATTATGCGCAGCTCTAGGAGAAAGCTGTTGCTTCTACGCCAATCAATCTGGAGTCATAAAAGATACACTCCAAAAAGTGCGAGAAAATCTAGATAGGCGCCAACAAGAACGAGAAAATAACATCCCCTGGTATCAAAGCATGTTCAACTGGAACCCATGGCTAACTACTCTAATCACTAAGTTAGCCGGACCCCTCCCCATCCTACTATTAAGTCTAATTTTTGGGCCTTGTATATTAAATTAGTTTCTTAATTTTGTAAAACAACGCATAGCTTCTGTCAAACTTATGTATCTTAAGACTCAATATAACCCCCTTGTTATAACTGAGGAATCAACGATTTGATTCCCCAAAAACACAAGTGGGGAAATGAAATGCCTAACGTTGTTTTTACTCTAACTNGTTACTTTGAATTTTGTCCTGCTTGTCTCTTTAATC
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVS71 | BPC1 | 460 | 483 | - | 34.30 | GAAAAAGAGAGAGAGAGAGAGAGA |
HERVS71 | BPC5 | 461 | 490 | - | 34.06 | AGAGATAGAAAAAGAGAGAGAGAGAGAGAG |
HERVS71 | BPC1 | 462 | 485 | - | 31.97 | TAGAAAAAGAGAGAGAGAGAGAGA |
HERVS71 | BPC1 | 464 | 487 | - | 31.74 | GATAGAAAAAGAGAGAGAGAGAGA |
HERVS71 | BPC6 | 461 | 481 | + | 31.13 | CTCTCTCTCTCTCTCTCTTTT |
HERVS71 | BPC1 | 468 | 491 | - | 30.89 | GAGAGATAGAAAAAGAGAGAGAGA |
HERVS71 | BPC1 | 466 | 489 | - | 29.91 | GAGATAGAAAAAGAGAGAGAGAGA |
HERVS71 | BPC6 | 463 | 483 | + | 28.47 | CTCTCTCTCTCTCTCTTTTTC |
HERVS71 | BPC6 | 471 | 491 | + | 26.38 | CTCTCTCTTTTTCTATCTCTC |
HERVS71 | BPC6 | 465 | 485 | + | 26.12 | CTCTCTCTCTCTCTTTTTCTA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.