ERVL
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000118 |
---|---|
TE superfamily | ERV3 |
TE class | LTR |
Species | Boreoeutheria |
Length | 5757 |
Kimura value | 20.00 |
Tau index | 0.8293 |
Description | Endogenous retrovirus, ERVL subfamily (internal region) |
Comment | Internal sequence of endogenous retrovirus ancestral to HERVL. ERVL represents a subfamily active in a common ancestor of most or all primates, and possibly outside of primates. |
Sequence |
AATTTTGGTACCAAGAGTGTCTAGAGGAACAGAATTTTAAGGATGGGTTTTCTGAATTGGTTTTGGGGNTTCNGGAATTGGCTNCCTAATNTGATTAGATTTAAAGACGCTAATGACTCTATTTCCAGTAGTAAAGAGAGCACTGATAGTCCATGGCATANCGNNNNATNNNACGNCGTGAACTGTTTATAGAGATATGCAAAATATCTGCATTGGATACTCCTAATCAACCACTTATAAGAGGCAAGGAGCTNAGTGACTCTATATATANTCGATACCTGAACATTTTTGGAAAACTAAGGAATATAATGANGTTGGTTGGTTGCTCCTAATGTCGCTGGACAAAGTGGCGAAAGAAAAGGATGAGCTCAGGGATTCGAATTCCCAGCTCAAGTAGCTNCATAAATGACCTGAAAGCTTCTATGTNTGCCCTGAANGAGANCCTTATCTCCTGTAGCCGCAGGGCTGAAATTGCTGAAAATCAAACGCAGAATCTCATCCTGCGANTGGCTGAATTACAACGCAAGTTGAACTCCCAGCCTCGCAAGGTGTCTACTGTTAAAGTGAGGGCATTGATTGGGAAAGAATGGGATCCTGNAAGTTGGAATGGGGACGTGTGGGAAGACCCTGATGAAGCTGGGGACATTGAGCCCCTAAATTCTGATGAGTCTTCTTTTGCCAGTGGAAGCGGCCTCCCCACCCCCAGCAGAAGTGGCCTCCCCACCCCCAGTGGCAGCGGCTTCCCCACCCACGGTGGTATCGGCCTTTCCACCTCGTCTGAGGGGATTAACCCTGCATTGCCTGAGGAAACGGTAATGGCCTCCCCTGAGGCAGTTGCCATGCAAGACAATGCTGATTCTCCTCAGGACCCACCCCCACCACCCCTCTTTGCTTCTAGACCTATAACTAGACTCAAGTCCCAGCAGGCCCCTAAAGGTGAGGTACAAAGTGTGACCCATGAGGAGGTGCGCTACACTCCAAAAGAACTACTTGAGTTTTCTAATTTATACAGGCAGAAATCTGGGGAACATGTATGGGAATGGATATTAAGGGTGTGGGATAATGGCGGAAGGAACATAAAGTTGGATCAGGCCGAATTTATTGATATGGGCTCACTAAGCAGAGATTCTGCATTTAATGTTGCAGCTCGGGGAGTTAGAAAGGGCTCTAACAGTTTGNTTGNTTGGTTGGCTGAAACATGGATCAAAAGATGGCCCACCGTGAGCGAGCTGGAAATGCCCGANCTCCCTTGGTTTAATGTAGAGGAAGGGATCCAAAGGCTTAGGGAGATTGGAATGTTGGAGTGGATTTGTCATTTAAGACCCACTCACCCACACTGGGAGGGTCCAGAAGACATNCCTTTCACCAANACTTTGAGAAATANATTTGTGAGGGGAGCNCCAGCATCCTTGAAGAGCTCTGTGATNGCTCTTCTCTGTAGGCCAGACCTTACAGTGGGAACCGCAGTCACTNAACTGGAAAACTTAAATGCAATGGGAATAATNGGATCCCGGGGTGGCAGGGGCCAAGTGGCGGCACTCAACCGCCAAAGGCAAGGTGGGCGTAGTTACCGTAATGGACAGCAGAGNCGAAGCGGCAATCAGAATAGTCTGACTCATGCAGACCTNTGGCANTGGCTAATTAATCATGGTGTTCCTAGAAGTGAAATAGATAGGAAGCCTACTAAATTCTTACTTGATCTGTATAAGCAGAAAACTTCTAGGTCNAGTGAACAAAAGTCTAACTCGAATCATAAAAACAGAGAGTCACGGCCCCTCAATCAATTCCCAGACTTGAGCCAGTTTACAGACCCAGAACCCCTTGAATGAAGGGGAGGCCGGGTCCCCTCGAGGAAGGACCCTGNTACACTACCGAAAATNTATACTGTNAATCTTTCTCCCAGCCTTCCCCAAAGGGACCTGCGGCCNTTTACCAGGGTAACTGTGCACTGGGGAAAGGGAAATANTCAGACCTTTCGGGGACTACTGGACACTGGCTCTGAACTGACACTGATTCCAGGAGACCCAAAACGTCACCGTGGCCCTCCAGTCAGAGTAGGGGCTTACGGAGGTCAGGTGATNAATGGAGTTTTAGCTCAGGTCCATCTCACAGTGGGTCCAGTGGGTCCCCGAACCCATCCTGCGGTTATTTCCCCAGTTCCAGAATGCATAATTGGAATAGACATACTTAGCAGCTGGCAGAATCCCCACATTGGTTCCCTGACCTGTGGAGTGAGGGCTATTATGGTGGGAAAGGCCAAGTGGAAGCCNCTAGAACTGCCTCTACCTAGGAAAATAGTAAATCAAAAGCAATACCGCATCCCTGGAGGAATTGCAGAGATTAGTGCCACCATCAAGGACTTGAAAGATGCAGGGGTGGTGATTCCCACCACATCCCCATTCAACTCNCCTATTTGGCCTGTGCAGAAGACAGATGGATCTTGGAGAATGACAGTGGATTATCGTAAGCTTAACCAAGTGGTGACTCCAATTGCAGCTGCTGTNCCAGATGTGGTTTCATTGCTTGAGCAAATTAACACATCCCCTGGTACCTGGTATGCAGCTATTGATCTGGCAAATGCCTTTTTCTCCATNCCTGTCCATAAGGCCCACCAGAAGCAGTTTGCTTTCAGCTGGCAAGGCCAGCAATATACCTTCACTGTCCTACCTCAGGGNTATATCAACTCTCCAGCCCTATGTCATAATTTAGTTCGCAGGGATCTTGATCGCCTTTCCCTTCCACAAGATATCACACTGGTCCATTACATTGATGACATTATGCTGATTGGACCTAGTGAGCAAGAAGTAGCAANTACTCTAGACTTATTGGTAAGACATTTGCGTGTCAGAGGGTGGGAAATAAATCCGACTAAAATTCAGGGGCCTTCTACCTCAGTGAAATTTCTAGGGGTCCAGTGGTGTGGGGCATGTCGAGATATCCCTTCTAAGGTGAAGGATAAGTTGTTGCATCTGGCCCCTCCTACAACCAAGAAAGAGGCACAACGCCTAGTGGGCCTNTTTGGATTTTGGAGGCAACATATTCCTCATTTGGGTGTGTTACTCCGGCCCATTTACCGAGTGACCCGAAAAGCTGCTAGTTTTGAGTGGGGCCCAGAACAGGAGAAGGCTCTGCAACAGGTCCAGGCTGCTGTGCAAGCTGCTCTGCCACTTGGGCCATATGATCCAGCAGATCCAATGGTGCTTGAAGTGTCAGTGGCAGATAGGGATGCTGTTTGGAGCCTTTGGCAGGCCCCTATAGGTGAATCGCAGCGCAGGCCTTTAGGATTTTGGAGCAAGGCCCTGCCATCNTCCGCAGATAACTACTCTCCTTTTGAGAGACAGCTCTTGGCCTGCTACTGGGCCTTAGTAGAGACTGAACGCTTGACCATGGGCCACCAAGTTACCATGCGACCTGAGCTGCCCATCATGAACTGGGTGTTATCTGACCCACCAAGCCATAAAGTTGGGCATGCACAGCAGCACTCCATCATCAAATGGAAGTGGTATATACGTGATCGGGCCCGAGCAGGCCCTGAAGGCACAAGTAAGTTACATGAAGAAGTGGCCCAAATGCCCATGGTCCCCACTCCTGCTACACTGCCTTCTCTCTCCCAGCCTGCACCTATGGCCTCATGGGGAGTTTCCCTACGATCAGTTGACAGAGGAAGAGAAGACTCGGGCCTGGTTTACAGATGGTTCTGCACGATATGCAGGCACCACCCGAAAGTGGACAGCTGCAGCACTACAGCCCCTTTCTGGGACATCCCTGAAGGACAGTGGTGAAGGGAAATCCTCCCAGTGGGCAGAACTTCGAGCAGTGCACCTGGTTGTGCACTTTGCTTGGAAGGAGAAATGGCCAGACGTGCGATTATATACCGATTCATGGGCTGTAGCCAATGGTTTGGCTGGATGGTCAGGGACTTGGAAGGAACATGATTGGAAAATTGGTGACAAAGAAATTTGGGGAAGAGGTATGTGGATAGACCTCTCTGAATGGGCAAAAAACGTGAAGATATTTGTGTCCCATGTGAATGCTCACCAAAGGGTGACCTCAGCAGAGGAGGATTTTAATAATCAAGTGGATAGGATGACCCGTTCTGTGGATACCAGTCAGCCTCTTTCCCCAGCCACCCTGTCATTGCCCAATGGGCTCATGAACAAAGTGGCCATGGTGGCAGGGATGGAGGTTATGCATGGGCTCAGCAACATGGACTTCCACTCACCAAGGCCGACCTGGCTACGGCCACCGCTGAGTGCCCAATCTGCCAGCAGCAGAGACCAACACTGAGCCCCCGATATGGCACCATTCCCCGGGGTGATCAGCCAGCTACCTGGTGGCAGGTTGATTACATTGGACCNCTTCCATCATGGAAGGGGCAGCGTTTTGTCCTTACTGGAATAGACACTTACTCTGGATACGGATTTGCCTTCCCTGCACGCAATGCTTCTGCCAAAACTACCATCCGTGGACTTACAGAATGCCTTATCCACCATCATGGTATTCCACACAGCATTGCTTCTGACCAAGGAACTCACTTCACAGCAAAAGAAGTGCGGCAATGGGCTCATGCTCATGGAATTCACTGGTCTTACCATGTTCCCCACCATCCTGAAGCAGCTGGCTTGATAGAACGGTGGAATGGCCTTTTGAAGACTCAGTTACAGCGCCAGCTAGGTGGCAATACCTTGCAGGGCTGGGGCAAGGTTCTCCAGAAGGCTGTATATGCTCTGAATCAGCGTCCAATATATGGTGCTGTTTCTCCCATAGCCAGGATTCACGGGTCCAGGAATCAAGGGGTGGAAATGGGAGTGGCACCACTCACTATTACCCCTAGTGACCCACTAGCAAAATTTTTGCTTCCTGTTCCCGCGACTTTATGCTCTGCTGGCCTAGAGGTCTTAGTTCCAGAGGGAGGAATGCTTCCACCAGGAGACACAACAATGATTCCATTGAACTGGAAGTTAAGACTGCCACCCGGCCACTTTGGGCTCCTCATGCCTCTGAGTCAACAGGCAAAGAAGGGAGTTACGGTGTTGGCTGGGGTGATTGATCCNGACTACCAAGGGGAAATTGGACTACTACTCCACAATGGAGGTAAGGAAGAGTATGTCTGGAATACAGGAGATCCCTTAGGGCGTCTCTTAGTATTACCATGCCCTGTGATTAAGGTCAATGGAAAACTACAACAACCCAATCCAGGCAGGACTACNAATGGCCCAGACCCTTCAGGAATGAAGGTTTGGGTCACCCCGCCAGGTAAAGAACCATGACCAGCTGAGGTGCTTGCTGAAGGCAAAGGGAATACAGAATGGGTAGTAGAAGAAGGTAGTTATAAATACCAGCTACGACCACGTGACCAGTTACAGAAACGAGGACTGTAATTGTCATGAGTATTTCCTCCTTATTTTGTTATGAATATGTTTGTGTGTATATATACATATATTAAGCAAATATCTTTGTTTTCTTTCCTCTCTTATTCCCTTATCATGTAACATAAGATGTATTGACTTTATATCAGTATTTAAGTATTGTTAATTTTACATCATAGTATTTAAGTTACGGGATATCAAGGAGAAGAGTAAACATCACTCAAGGACTTTACCTCCTCTTCTGGGGAAGGGGTTAGTGCGTTTTCGGTTGTACGCAGGATAGTTGTATCATGTTAGGTGGAATTATGACCTTGTTATTGTCTTTATTTGGAGATTAAGTATGGTTTAAGGAGATGCGTATGGGTGCCAAGTTGACAAGGGGTGGACT
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
ERVL | ZNF157 | 2639 | 2659 | + | 33.88 | AGGCCAGCAATATACCTTCAC |
ERVL | ZNF707 | 3581 | 3595 | + | 21.75 | CCCCACTCCTGCTAC |
ERVL | ZNF320 | 3567 | 3586 | - | 19.81 | GTGGGGACCATGGGCATTTG |
ERVL | Klf15 | 864 | 885 | + | 19.08 | CAGGACCCACCCCCACCACCCC |
ERVL | Klf15 | 862 | 883 | + | 18.58 | CTCAGGACCCACCCCCACCACC |
ERVL | TCP7 | 3104 | 3114 | + | 17.20 | GTGGGGCCCAG |
ERVL | ZNF281 | 695 | 704 | - | 17.16 | GGGGGTGGGG |
ERVL | ZNF281 | 719 | 728 | - | 17.16 | GGGGGTGGGG |
ERVL | odd | 3350 | 3360 | - | 17.08 | CCCAGTAGCAG |
ERVL | Usf | 5348 | 5357 | + | 17.05 | ACCACGTGAC |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.