ERVL-E
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000120 |
---|---|
TE superfamily | ERV3 |
TE class | LTR |
Species | Eutheria |
Length | 5667 |
Kimura value | 35.44 |
Tau index | 0.8922 |
Description | ERVL endogenous retrovirus, ERVL-E subfamily |
Comment | MLT2E LTRs. ORFs at roughly pos 131-1720, 1725-5267. |
Sequence |
ATTATTGGTACCGGGAGTGGTTCCAGGGGAACAGAACCTTAAGGATGGGAATCTGGAATTGGTTCTCTGATCTGATTAGATTTAAAGGCGCTAATGACCCTGTTTCCAGTGGTAAAGGGGACACTGGTAGTCCATGGCATGCAGTGGCAAAAAGTTACTCAAATTATCACCTGTGGNCACCTGTAATCAAGTGCCTATAGAAGGCAAGGCTTTGGGTGACCANGTANTTGCTGCCNTAGAACATTTTAGTGGAAATAAGGAGTATAATGANGTTGGTTGGTTGCTTCTAANTGCGCTGGAGAACTTGGAGAAAGAAAATGATGAGCTCAGGGCTTTAAATTCCCAGCTCAAGNTCCGGGTAAGGGACCNGAAAGCTTCTATGACTGCCCTGAAAGAAACCCTTATCTCCTGTAGCCGCAGGGCTGAGATTTCTGAAAACCAAACCCAAAGTCTNATCCTGCGGGTGGCTGAATTACAACGCAAATTGAATTCNCAACCTCGCAGGGTNTCTTNTGTTAAAGTTAGGGCATTGATTGGGAAGGAATGGGACCCTGAAAATTGGAATGGGGACATNTGGGCGGATTCCGATGAAGCTGGGGACCTTGAACCCCTAAATTCTGCCGAGCCTTCTTTGCCAGTAGAAGCAGCCCTTTCNCCCCTGTCTGAGGAGGTTAGTCTCCCCTTGCCTGAAGAANCTGTAATGGCCTCCCCTGAGGTAGTTGCCTTGCAAGGNANTGCTGATTCTCCTCAGGACCTACCCCCACCACCTCTCNTTGCTTCTAGACCTATAACTAGACTCAAGTCCCAGCAGGCCCCNAGGGGTNAGGTACAAAGTGTGACCCATGAGGAGGTANNNTACACACCAAAAGAATTGCAAGATTTTNCCAATTTATATCGACAGAAACCTGGGGAATATGTGTGGGAATGGATNCTAAGGGTGTTGGATCANGGTGGAAGGAATATAACGTTGGATCGGGCCGAATTTATTGATATGGGTNCACTAAGCAGAGATTCTGGATTCAATGTGTTAGCTCGAGNAGCTGGAAGTGGCTCTAACAGTTTGCTTGGTTGGTTGACTGAAACNTGGACTCAAAGGTGGCCTACANTNAATGAAGTTGAGATGCCAGAACTTCCTTGGTATANTGTAGAGGAAGGNATCCAAAGGCTTAGGGAGATNGGAATGTTGGAGTGGATTTATCATGTAAGACCTGCTCACCTACACCCTCTAACTATGTCCCCGGGAGGGTCCAGGGACACTCCCTTCACCAAGGCNTTGAGAAATACATTNGTGAGGGGAGCACCAGCATCCTTGAAGAGCTCTGTGGTGGCTNTTCTCTGTAGGCCAGGNATGACGGTGGGAGATGCTGCCATTGAAATGGGCTCCCTGANTTCAATGGGGATGATGGGATCCCGGGGTGGCAGAGGCCAAGTGGCGGCACTTAACCGCCAGAGACAAGGTGGGCGCGGTTACCGTAATGGGCAGCAGAGCCAAAGCGGTAATCAGAATGGTTTGACCCGCAGAGATCTTTGGCGNTGGCTAATTGATCATGGTGTCCCTAGGANTGAAATAGATGGGCAGCCTACTAAAGTCTTACTTGATTTGTATAAGCAGAAAAGCTCTAGGTCTGGTGAACAGAAGTCTGACTTGAGTCACCAACGGAGAGTCACGGCCCCTCANCCAGTTCCCAGACTTGAGCCAGTTCACAGCCAGACCCCTTGAATGAAGGGGAGGCCGGGTCCCCTTGAGGAAGGACCCTGCNACACTGCCAAAAATNTATACTGTAAATCTTCCTCCNAGCCTTCCCCAAAGGGACCTGCGGCCATTTACCAGGGTGACTGTGCACTGGGGAAAGGGAAATANCCAGACTTTTNAGGGATTACTGGACACTGGCTCTGAACTGACGCTAATTCCTGGAGACCCAAAACGCCACTGTGGTCCACCAGTCAGAGTAGGGGCTTATGGAGGTCAGGTGATNAATGGAGTTTTGGCTCGAGTCCGTCTCACAGTGGGCCCAGTGGGTCCNCGAACCCACCCTGTGGTTATTTCCCCAGTTCCNGAATGCATAGTTGGAATAGACATACTCAGCAACTGGCAGAATCCCCACATTGGTTCCCTGACCCGTGGAGTGAGGGCTATTATGGTAGGAAAGGCCAAGTGGAAGCCNCTGGAACTGCCTCTNCCTACCAAAATAGTAAACCAAAAGCAATACCGCATCCCTGGAGGAATTGCAGAGATTAGTGCCACCATCAAAGACTTGAAAGATGCAGGGGTGGTGATTCCTACCACATCCCCATTTAACTCGCCTGTTTGGCCTGTGCAGAAGACAGATGGATCTTGGAGAATGACAGTGGATTATCGTAAACTTAATCAGGTGGTGACTCCAATTGCAGCTGCTGTTCCAGATGTGGTNTCTTTACTGGAGCAAATCAACACANCCCCTGGCACCTGGTATGCAGCTATTGATCTGGCAAATGCTTTTTTCTCTATACCTGTTAGTAAAGACCACCAGAAGCAGTTTGCTTTCACCTGGCAGGGNCAGCAGTACACCTTCACTGTCTTGCCTCAGGGCTATGTCAACTCTCCNGCTCTCTGTCATAATNTAGTCCGCAGGGACCTTGATCGTCTTNNCATTCCACAGGACATCACGCTGGTCCACTACATTGATGACATCATGCTGATTGGACCTGGTGAGCAGGAAGTAGCAAGTACTCTAGACGCCTTGGTAAGACACATGCGTGCCAGAGGGTGGGAGATAAATCCCACGAAAATTCAGGGGCCTGCCACCTCGGTGAAGTTTCTAGGGGTCCAGTGGTCTGGGGCATGTCGAGATATCCCTTCCAAGGTGAAGGACAAGTTGCTGCATCTNGCNCCTCCTACCACTAAGAAAGAGGCACAATGCTTGGTGGGCCTCTTTGGATTTTGGAGGCAACATATACCNCATTTGGGCGTGCTGCTCCGACCCATTTACCGAGTAACCCGNAAGGCTGCCAGTTTTGAGTGGGGCCCAGAGCAAGAGAAGGCTCTGCAGCAGGTCCAGGCTGCNGTGCAAGCTGCTCTGCCACTTGGGCCNTATGACCCAGCAGATCCAATGGTGCTCGAAGTGTCTGTGGCAGATAGGGATGCTGTATGGAGCCTCTGGCAAGCCCCNATAGGNGAATCACAGCGCAGACCCCTAGGATTTTGGAGCAAAGCCATGCCATCTTCTGCAGATAACTATTCTCCTTTTGAGAAACAGCTCCTGGCTTGCTACTGGGCCCTGGTAGAGACTGAACGCCTGACCATGGGNCACCAAGTNACCATGCGACCTGAGCTGCCCATCATGAACTGGGTGTTATCTGACCCACCAAGCCATAAAGTTGGGCGTGCACAGCAGCANTCCATCATCAAGTGGAAGTGGTATATACGAGATCGGGCTCGAGCAGGTCCNGAAGGCACAAGTAAGTTGCATGAGCAGGTGGCTCAGACTCCCATGGCNCCTACTCCTGCTGCATTGCCTCCTCTCCCTCAACCCGCACCTATGGCCTCATGGGGAGTTCCCTATGACCAGTTGACTGAGGAAGAAAAAACTCGGGCCTGGTTTACAGATGGTTCTGCACGATATGCTGGCACCANCCGAAAGTGGACGGCTGCAGCACTACAGCCCCACTCAGGGGTGGCCCTGAAGGACAGTGGTGAAGGGAAATCCTCCCAGTGGGCAGAACTTCGAGCAGTGCACCTGGTTGTCCACTTTGCCTGGAAGGAGAGATGGCCAGAGGTACGGATCTACACTGATTCATGGGCAGTGGCTAACGGTTTGGCTGGATGGTCAGGGACTTGGAAGGAACANGATTGGAAGATTGGTGACAAGGAGGTCTGGGGAAGAGGTATGTGGATGGACCTCTCGGAATGGGCACAGAGTGTGAAGATATTTGTGTCCCATGTGAATGCTCACCAAAGGGCANCCNCNGCAGAGGAGGNTCTCAATAATCAGGTGGACAAGATGACCCGTTCTGTGGATGTCAGTCAGCCTCTTTCCCCAGCCACCCCNGTGCTTGCTCAATGGGCTCATGAACAAAGTGGCCATGGTGGCAGGGATGGAGGCTATGCATGGGCTCAGCAACATGGACTTCCNCTCACCAAGGCTGATCTGGCTACNGCCACTGCTGAGTGCCCAACCTGCCAACAGCAGAGACCAACGCTGAGCCCCCGATATGGCACCATTCCCCGGGGGGACCAGCCAGCCACCTGGTGGCAGGTTGATTACATTGGACCNCTTCCATCATGGAAGGGGCAGCGATTTGTCCTCACTGGAATAGACACTTATTCTGGATATGGATTTGCCTTCCCTGCCCGCAATGCTTCTGCCAGNACCACCATCCGTGGACTTACAGAATGCCTTATTCACCGTCATGGTATTCCACACAGCATTGCTTCTGACCAAGGAACTCATTTTACAGCAAANGAAGTGCGGCAATGGGCTCATGCCCATGGAATTCACTGGTCTTACCACGTNCCCCATCACCCNGAAGCAGCTGGCCTGATAGAACGGTGGAATGGCCTNTTGAAGACTCAGTTACGGCGCCAGCTGGGNGACAACACCTTGNAGGGCTGGGGTNNTGTCCTNCAGGATGCGGTATATGCTCTGAATCAGCGACCAATATATGGTGCTGTTTCTCCCATAGCCAGAATNCACGGGTCCGGGAATCAAGGGGTGGAAGTGGGAGTGGCTCCTCTCACTATTACNCCTAATGACCCACTNGCAAAATTTTTGCTTCCCGTCCCCGCAACTTTGGGCTCTGCTGGTTTAGAGGTCTTAGTTCCCAAGGGAGGAATGCTTCCACCAGGGGACACAACAATGGTTCCATTGAACTGGAAGCTGAGACTGCCACCTGGCCACTTTGGGCTCCTCATGCCACTGAACCAACAGGCAAAGAAGGGAGTTACTGTACTGGCTGGGGTGATTGATCCTGATTATCAAGGGGAAATTGGGTTGCTGCTACACAATGGGGGCAAGGAGGANTATGTCTGGAATNCAGGAGATCCTCTGGGGCGCCTCTTAGTACTCCCATGTCCNGTGATAAAAGTTAATGGAAAACTACAGCAACCCAATANAGGCAGGACCGCTAANGGCTCAGACCCTTCAGGAATGAAGGTTTGGGTCACCCCACCAGGCAAAGAACCACGACCAGCTGAGGTGCTNGCTGAGGGCAAAGGGAATATGGAATGGGTAGTGGAAGAAGGAAGTTATAAATACCAGCTACGACCNCGTGACCAGTTGCAGAAACGAGGACTGTAGTAGTTATGNGTATTTCTTCCTTGCTTTGATATGAATATATTTGTGATATATATATTAACNAATATCTTTNTTTTCTTTCCTCTCTCATTCCCCTACTATCTAACATAAGATGTGTTAATAGTAGTTAACCTTATATCTCAGTATTTAAGTTACAGGATATCAAAGGGGGANTGTGACTCAGCTAGAAGAGNAATGAACATCACCCAGAGATGGATAAAGTGACNTNTGGGACTTTGTATCCTCTTTTGGGGAGAGGGTTAGCGTGTTTTCGGTTGTACGAGGGATAGTTGCATCATGTTAGGCGGAAGCATGATTTTGCTATTGTCTTTATTTGGAAGTTAAATATGGTTNAAAGAGGTGTGTATGGATGCCGAGTTGACAAGGGGTGGAC
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
ERVL-E | PK19717.1 | 2999 | 3007 | - | 15.77 | GGGCCCCAC |
ERVL-E | FOS | 2663 | 2675 | - | 15.76 | CATGATGTCATCA |
ERVL-E | MAFK | 5449 | 5458 | + | 15.74 | GTGACTCAGC |
ERVL-E | SP3 | 4698 | 4708 | - | 15.73 | GCCACTCCCAC |
ERVL-E | dar1 | 4698 | 4708 | - | 15.72 | GCCACTCCCAC |
ERVL-E | SREBF2 | 5139 | 5148 | - | 15.71 | GTGGGGTGAC |
ERVL-E | OsI_08196 | 2007 | 2014 | - | 15.69 | GGGCCCAC |
ERVL-E | KLF3 | 1460 | 1469 | - | 15.68 | AACCGCGCCC |
ERVL-E | ZNF331 | 4772 | 4781 | - | 15.68 | AGCAGAGCCC |
ERVL-E | kn | 4799 | 4808 | + | 15.66 | TCCCAAGGGA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.