ERVL
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000118 |
---|---|
TE superfamily | ERV3 |
TE class | LTR |
Species | Boreoeutheria |
Length | 5757 |
Kimura value | 20.00 |
Tau index | 0.8293 |
Description | Endogenous retrovirus, ERVL subfamily (internal region) |
Comment | Internal sequence of endogenous retrovirus ancestral to HERVL. ERVL represents a subfamily active in a common ancestor of most or all primates, and possibly outside of primates. |
Sequence |
AATTTTGGTACCAAGAGTGTCTAGAGGAACAGAATTTTAAGGATGGGTTTTCTGAATTGGTTTTGGGGNTTCNGGAATTGGCTNCCTAATNTGATTAGATTTAAAGACGCTAATGACTCTATTTCCAGTAGTAAAGAGAGCACTGATAGTCCATGGCATANCGNNNNATNNNACGNCGTGAACTGTTTATAGAGATATGCAAAATATCTGCATTGGATACTCCTAATCAACCACTTATAAGAGGCAAGGAGCTNAGTGACTCTATATATANTCGATACCTGAACATTTTTGGAAAACTAAGGAATATAATGANGTTGGTTGGTTGCTCCTAATGTCGCTGGACAAAGTGGCGAAAGAAAAGGATGAGCTCAGGGATTCGAATTCCCAGCTCAAGTAGCTNCATAAATGACCTGAAAGCTTCTATGTNTGCCCTGAANGAGANCCTTATCTCCTGTAGCCGCAGGGCTGAAATTGCTGAAAATCAAACGCAGAATCTCATCCTGCGANTGGCTGAATTACAACGCAAGTTGAACTCCCAGCCTCGCAAGGTGTCTACTGTTAAAGTGAGGGCATTGATTGGGAAAGAATGGGATCCTGNAAGTTGGAATGGGGACGTGTGGGAAGACCCTGATGAAGCTGGGGACATTGAGCCCCTAAATTCTGATGAGTCTTCTTTTGCCAGTGGAAGCGGCCTCCCCACCCCCAGCAGAAGTGGCCTCCCCACCCCCAGTGGCAGCGGCTTCCCCACCCACGGTGGTATCGGCCTTTCCACCTCGTCTGAGGGGATTAACCCTGCATTGCCTGAGGAAACGGTAATGGCCTCCCCTGAGGCAGTTGCCATGCAAGACAATGCTGATTCTCCTCAGGACCCACCCCCACCACCCCTCTTTGCTTCTAGACCTATAACTAGACTCAAGTCCCAGCAGGCCCCTAAAGGTGAGGTACAAAGTGTGACCCATGAGGAGGTGCGCTACACTCCAAAAGAACTACTTGAGTTTTCTAATTTATACAGGCAGAAATCTGGGGAACATGTATGGGAATGGATATTAAGGGTGTGGGATAATGGCGGAAGGAACATAAAGTTGGATCAGGCCGAATTTATTGATATGGGCTCACTAAGCAGAGATTCTGCATTTAATGTTGCAGCTCGGGGAGTTAGAAAGGGCTCTAACAGTTTGNTTGNTTGGTTGGCTGAAACATGGATCAAAAGATGGCCCACCGTGAGCGAGCTGGAAATGCCCGANCTCCCTTGGTTTAATGTAGAGGAAGGGATCCAAAGGCTTAGGGAGATTGGAATGTTGGAGTGGATTTGTCATTTAAGACCCACTCACCCACACTGGGAGGGTCCAGAAGACATNCCTTTCACCAANACTTTGAGAAATANATTTGTGAGGGGAGCNCCAGCATCCTTGAAGAGCTCTGTGATNGCTCTTCTCTGTAGGCCAGACCTTACAGTGGGAACCGCAGTCACTNAACTGGAAAACTTAAATGCAATGGGAATAATNGGATCCCGGGGTGGCAGGGGCCAAGTGGCGGCACTCAACCGCCAAAGGCAAGGTGGGCGTAGTTACCGTAATGGACAGCAGAGNCGAAGCGGCAATCAGAATAGTCTGACTCATGCAGACCTNTGGCANTGGCTAATTAATCATGGTGTTCCTAGAAGTGAAATAGATAGGAAGCCTACTAAATTCTTACTTGATCTGTATAAGCAGAAAACTTCTAGGTCNAGTGAACAAAAGTCTAACTCGAATCATAAAAACAGAGAGTCACGGCCCCTCAATCAATTCCCAGACTTGAGCCAGTTTACAGACCCAGAACCCCTTGAATGAAGGGGAGGCCGGGTCCCCTCGAGGAAGGACCCTGNTACACTACCGAAAATNTATACTGTNAATCTTTCTCCCAGCCTTCCCCAAAGGGACCTGCGGCCNTTTACCAGGGTAACTGTGCACTGGGGAAAGGGAAATANTCAGACCTTTCGGGGACTACTGGACACTGGCTCTGAACTGACACTGATTCCAGGAGACCCAAAACGTCACCGTGGCCCTCCAGTCAGAGTAGGGGCTTACGGAGGTCAGGTGATNAATGGAGTTTTAGCTCAGGTCCATCTCACAGTGGGTCCAGTGGGTCCCCGAACCCATCCTGCGGTTATTTCCCCAGTTCCAGAATGCATAATTGGAATAGACATACTTAGCAGCTGGCAGAATCCCCACATTGGTTCCCTGACCTGTGGAGTGAGGGCTATTATGGTGGGAAAGGCCAAGTGGAAGCCNCTAGAACTGCCTCTACCTAGGAAAATAGTAAATCAAAAGCAATACCGCATCCCTGGAGGAATTGCAGAGATTAGTGCCACCATCAAGGACTTGAAAGATGCAGGGGTGGTGATTCCCACCACATCCCCATTCAACTCNCCTATTTGGCCTGTGCAGAAGACAGATGGATCTTGGAGAATGACAGTGGATTATCGTAAGCTTAACCAAGTGGTGACTCCAATTGCAGCTGCTGTNCCAGATGTGGTTTCATTGCTTGAGCAAATTAACACATCCCCTGGTACCTGGTATGCAGCTATTGATCTGGCAAATGCCTTTTTCTCCATNCCTGTCCATAAGGCCCACCAGAAGCAGTTTGCTTTCAGCTGGCAAGGCCAGCAATATACCTTCACTGTCCTACCTCAGGGNTATATCAACTCTCCAGCCCTATGTCATAATTTAGTTCGCAGGGATCTTGATCGCCTTTCCCTTCCACAAGATATCACACTGGTCCATTACATTGATGACATTATGCTGATTGGACCTAGTGAGCAAGAAGTAGCAANTACTCTAGACTTATTGGTAAGACATTTGCGTGTCAGAGGGTGGGAAATAAATCCGACTAAAATTCAGGGGCCTTCTACCTCAGTGAAATTTCTAGGGGTCCAGTGGTGTGGGGCATGTCGAGATATCCCTTCTAAGGTGAAGGATAAGTTGTTGCATCTGGCCCCTCCTACAACCAAGAAAGAGGCACAACGCCTAGTGGGCCTNTTTGGATTTTGGAGGCAACATATTCCTCATTTGGGTGTGTTACTCCGGCCCATTTACCGAGTGACCCGAAAAGCTGCTAGTTTTGAGTGGGGCCCAGAACAGGAGAAGGCTCTGCAACAGGTCCAGGCTGCTGTGCAAGCTGCTCTGCCACTTGGGCCATATGATCCAGCAGATCCAATGGTGCTTGAAGTGTCAGTGGCAGATAGGGATGCTGTTTGGAGCCTTTGGCAGGCCCCTATAGGTGAATCGCAGCGCAGGCCTTTAGGATTTTGGAGCAAGGCCCTGCCATCNTCCGCAGATAACTACTCTCCTTTTGAGAGACAGCTCTTGGCCTGCTACTGGGCCTTAGTAGAGACTGAACGCTTGACCATGGGCCACCAAGTTACCATGCGACCTGAGCTGCCCATCATGAACTGGGTGTTATCTGACCCACCAAGCCATAAAGTTGGGCATGCACAGCAGCACTCCATCATCAAATGGAAGTGGTATATACGTGATCGGGCCCGAGCAGGCCCTGAAGGCACAAGTAAGTTACATGAAGAAGTGGCCCAAATGCCCATGGTCCCCACTCCTGCTACACTGCCTTCTCTCTCCCAGCCTGCACCTATGGCCTCATGGGGAGTTTCCCTACGATCAGTTGACAGAGGAAGAGAAGACTCGGGCCTGGTTTACAGATGGTTCTGCACGATATGCAGGCACCACCCGAAAGTGGACAGCTGCAGCACTACAGCCCCTTTCTGGGACATCCCTGAAGGACAGTGGTGAAGGGAAATCCTCCCAGTGGGCAGAACTTCGAGCAGTGCACCTGGTTGTGCACTTTGCTTGGAAGGAGAAATGGCCAGACGTGCGATTATATACCGATTCATGGGCTGTAGCCAATGGTTTGGCTGGATGGTCAGGGACTTGGAAGGAACATGATTGGAAAATTGGTGACAAAGAAATTTGGGGAAGAGGTATGTGGATAGACCTCTCTGAATGGGCAAAAAACGTGAAGATATTTGTGTCCCATGTGAATGCTCACCAAAGGGTGACCTCAGCAGAGGAGGATTTTAATAATCAAGTGGATAGGATGACCCGTTCTGTGGATACCAGTCAGCCTCTTTCCCCAGCCACCCTGTCATTGCCCAATGGGCTCATGAACAAAGTGGCCATGGTGGCAGGGATGGAGGTTATGCATGGGCTCAGCAACATGGACTTCCACTCACCAAGGCCGACCTGGCTACGGCCACCGCTGAGTGCCCAATCTGCCAGCAGCAGAGACCAACACTGAGCCCCCGATATGGCACCATTCCCCGGGGTGATCAGCCAGCTACCTGGTGGCAGGTTGATTACATTGGACCNCTTCCATCATGGAAGGGGCAGCGTTTTGTCCTTACTGGAATAGACACTTACTCTGGATACGGATTTGCCTTCCCTGCACGCAATGCTTCTGCCAAAACTACCATCCGTGGACTTACAGAATGCCTTATCCACCATCATGGTATTCCACACAGCATTGCTTCTGACCAAGGAACTCACTTCACAGCAAAAGAAGTGCGGCAATGGGCTCATGCTCATGGAATTCACTGGTCTTACCATGTTCCCCACCATCCTGAAGCAGCTGGCTTGATAGAACGGTGGAATGGCCTTTTGAAGACTCAGTTACAGCGCCAGCTAGGTGGCAATACCTTGCAGGGCTGGGGCAAGGTTCTCCAGAAGGCTGTATATGCTCTGAATCAGCGTCCAATATATGGTGCTGTTTCTCCCATAGCCAGGATTCACGGGTCCAGGAATCAAGGGGTGGAAATGGGAGTGGCACCACTCACTATTACCCCTAGTGACCCACTAGCAAAATTTTTGCTTCCTGTTCCCGCGACTTTATGCTCTGCTGGCCTAGAGGTCTTAGTTCCAGAGGGAGGAATGCTTCCACCAGGAGACACAACAATGATTCCATTGAACTGGAAGTTAAGACTGCCACCCGGCCACTTTGGGCTCCTCATGCCTCTGAGTCAACAGGCAAAGAAGGGAGTTACGGTGTTGGCTGGGGTGATTGATCCNGACTACCAAGGGGAAATTGGACTACTACTCCACAATGGAGGTAAGGAAGAGTATGTCTGGAATACAGGAGATCCCTTAGGGCGTCTCTTAGTATTACCATGCCCTGTGATTAAGGTCAATGGAAAACTACAACAACCCAATCCAGGCAGGACTACNAATGGCCCAGACCCTTCAGGAATGAAGGTTTGGGTCACCCCGCCAGGTAAAGAACCATGACCAGCTGAGGTGCTTGCTGAAGGCAAAGGGAATACAGAATGGGTAGTAGAAGAAGGTAGTTATAAATACCAGCTACGACCACGTGACCAGTTACAGAAACGAGGACTGTAATTGTCATGAGTATTTCCTCCTTATTTTGTTATGAATATGTTTGTGTGTATATATACATATATTAAGCAAATATCTTTGTTTTCTTTCCTCTCTTATTCCCTTATCATGTAACATAAGATGTATTGACTTTATATCAGTATTTAAGTATTGTTAATTTTACATCATAGTATTTAAGTTACGGGATATCAAGGAGAAGAGTAAACATCACTCAAGGACTTTACCTCCTCTTCTGGGGAAGGGGTTAGTGCGTTTTCGGTTGTACGCAGGATAGTTGTATCATGTTAGGTGGAATTATGACCTTGTTATTGTCTTTATTTGGAGATTAAGTATGGTTTAAGGAGATGCGTATGGGTGCCAAGTTGACAAGGGGTGGACT
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
ERVL | TFEC | 5350 | 5357 | + | 15.88 | CACGTGAC |
ERVL | LHY | 201 | 209 | - | 15.86 | AGATATTTT |
ERVL | PK19717.1 | 3104 | 3112 | - | 15.77 | GGGCCCCAC |
ERVL | SREBF1 | 2071 | 2080 | - | 15.63 | ATCACCTGAC |
ERVL | cassava45561.m1 | 3104 | 3112 | - | 15.61 | GGGCCCCAC |
ERVL | RVE1 | 201 | 209 | + | 15.59 | AAAATATCT |
ERVL | SREBF2 | 5038 | 5047 | + | 15.54 | CTGGGGTGAT |
ERVL | VIP1 | 2629 | 2638 | - | 15.51 | TGCCAGCTGA |
ERVL | KLF4 | 695 | 702 | + | 15.48 | CCCCACCC |
ERVL | KLF4 | 719 | 726 | + | 15.48 | CCCCACCC |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.