ERVL-B4
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000775 |
---|---|
TE superfamily | ERV3 |
TE class | LTR |
Species | Eutheria |
Length | 5714 |
Kimura value | 19.25 |
Tau index | 0.8724 |
Description | ERVL Endogenous retrovirus, ERVL-B4 subfamily |
Comment | MLT2B4 LTRs. ORFs at roughly pos. 53-1792, 1793-5344. |
Sequence |
GATTTTGGTACCGAGAGTGGTTCTAGAGGAACAGAATTTTAAGGATGAGTTTTCTGAATTGGTTCTGGGGTTTCTGGAATTGGCTCTCTAATCTGATTAGATTTAAAGACGCTAATGACTCTATTTCCAGTAGTAAAGAGAGCACTGATAGTCCATGGCGTGATCTGGCAATAGAGATACGCAAAATATCNCCATTGGATACTCCTAATCAACCACTTATAAGAAGCAAGGANCTGGGTGACTNTGTATATGATACTTTCGAACATTTTTGGNAAACTAACGAATATAATGAGATTGGCTGGTTGCTCCTAATGTCGCTGGACAAAGTGGNGAAAGAAAAGGATGAGCTCAGGGATTCGAATTCCCAGCTCAAGCGCCGCATAAATGACCTGAAAGCTTCTATGTGTGCCCTGAAGGAGACCCTTATCTCCTGTAGCCGCAGGGCTGAGATTGCTGAAAATCAAACGCAGAATCTCATCCTGCGACTGGCTGAATTACAACGCAAGTTGAACTCCCAGCCTCGCAGGGTGTCTACTGTTAAAGTGAGGGCATTGATTGGGAAAGAATGGGATCCTGNAAGTTGGAATGGGGACGTGTGGGAAGACCCTGATGAAGCTGGGGACATTGAGCCCCTAAATTCTGATGAGTCTTCTTTGCCAGTGGAAGNGGCCTCCCCACCCCCAGTGGAAGCGGCCTCCCCACCCCCAGTGGTAGCGGCCTCTCCACCCCCGTCTGAGGGGATTAACCCTGCATTGCCTGAGGAAACTGTAATGGCCTCCCCTGAGGCAGTTGCCATGCAAGACAATGCTGATTCTCCTCAGGACCCACCCCCACCACCCCTCTTTGCTTCTAGACCTATAACTAGACTCAAGTCCCAGCAGGCCCCTAAAGGTGAGGTACAAAGTGTGACCCATGAGGAGGTGCGCTACACTCCAAAAGAACTACTTGAGTTTTCTAATTTATACAGACAGAAATCCGGGGAACATGTGTGGGAATGGATATTAAGGGTGTGGGATAATGGTGGAAGGAACATAAAGTTGGATCAGGCCGAATTTATTGATATGGGCCCACTAAGCAGAGATTCTGCATTTAATGTTGCAGCTCGGGGAGTTAGAAAGGGCTCTAACAGTTTGTTTGGTTGGTTGGCTGAAACATGGACCAAAAGGTGGCCCACAGTGAGCGAATTGGAAATGCCGGACCTGCCTTGGTTTAATGTAGAGGAAGGGATTCAAAGGCTTAGGGAGATTGGAATGTTAGAGTGGATTTGTCATTTAAGACCTACTCACCCACACTGGGAGGGTCCAGAAGACATACCTTTCACCANNACTGTGAGAAATAAATTTGTGAGGGGAGCCCCAGCATCCTTGAAGAGCTCTGTGATCGCTCTTCTCTGTAGGCCAGACCTTACAGTGGGAACTGCAGCCACTGAATTGGGAAACCTAAATGCAATGGGAGTAATTGGATCCCGGGGTGGCAGGGGCCAAGTGGCGGCACTCAACCGCCAAAGGCAAGGTGGGCGTGGTTACCGTAATGGACAGCAGAGTCAAAGCAGCAATCAGAATAGTCTGACTCGCGCAGACCTATGGCGTTGGCTAGTTGATCATGGTGTTCCTAGAAGTGAAATAGATAGGAAGCCTACTAAATTCTTACTTGATCTGTATAAGCAGAAAAGTTCTAGGTCAAGTGAACAAAAGTCTAACTTGAATCATAAAAACAGAGAGTCACGGCCCCTCAATCAATTCCCAGACTTGAGCCAGTTTACAGACCCAGAACCCCTTGAATGAAGGGGAGGCCGGGTCCCCTTGAGGAAGGACCCCGGTACACTGCCAAAAATTTATACTGTTAATCTTTCTCCCAGCCTTCCCCAAAGGGACCTACGGCCTTTTACCAGGGTAACTGTGCATTGGGGAAAAGGAAATAATCAGACCTTTCGGGGACTACTGGACACTGGCTCTGAACTGACACTAATTCCAGGAGACCCAAAACGTCACTGTGGTCCACCAGTCAGAGTAGGGGCTTATGGAGGTCAGGTGATCAATGGAGTTTTAGCTCAGGTCCGTCTCACAGTGGGCCCAGTGGGTCCCCGAACCCATCCTGTGGTTATTTCCCCAGTTCCGGAATGCATAATTGGAATAGACATACTCAGCAGCTGGCAGAATCCCCACATTGGTTCCCTGACCTGTGGAGTGAGGGCTATTATGGTGGGAAAGGCCAAGTGGAAGCCACTAGAACTGCCTCTACCTAGGAAAATAGTAAACCAAAAGCAATACCGCATTCCTGGAGGGATTGCAGAGATTAGTGCCACCATCAAGGACTTGAAAGATGCAGGGGTGGTGATTCCCACCACATCCCCATTCAACTCGCCTATTTGGCCTGTGCAGAAGACAGATGGATCTTGGAGAATGACAGTGGATTATCGTAAGCTTAACCAGGTGGTGACTCCAATTGCAGCTGCTGTACCAGATGTGGTTTCATTGCTTGAGCAAATTAACACATCCCCTGGTACCTGGTATGCAGCTATTGATCTGGCAAATGCCTTTTTCTCCATACCTGTCAATAAGGNCCACCAGAAGCAGTTTGCTTTCAGCTGGCAAGGCCAGCAATACACCTTCACTGTCCTACCTCAGGGGTATATCAACTCTCCAGCCCTATGTCATAATTTAGTTCGCAGGGATCTTGATCGCCTTTCCCTTCCACAAGATATCACACTGGTCCATTACATTGATGACATTATGCTGATTGGACCTAGTGAGCAAGAAGTAGCAACTACTCTAGACTTATTGGTAAGACATTTGCGTGTCAGAGGGTGGGAAATAAATCCGACAAAAATTCAGGGGCCTTCTACCTCAGTGAAATTTCTAGGGGTCCAGTGGTGTGGGGCATGTCGAGATATCCCTTCTAAGGTGAAGGATAAGTTGTTGCATCTGGCCCCTCCTACAACCAAAAAAGAGGCACAATGCCTAGTGGGCCTCTTTGGATTTTGGAGGCAACATATTCCTCATTTGGGTGTGTTACTCCGGCCCATTTACCGAGTGACCCGAAAAGCTGCTAGTTTTGAGTGGGGCCCAGAACAAGAGAAGGCTCTGCAACAGGTCCAGGCTGCTGTGCAAGCTGCTCTGCCACTTGGGCCATATGATCCAGCAGATCCAATGGTGCTTGAAGTGTCAGTGGCAGATAGGGATGCTGTTTGGAGCCTTTGGCAGGCCCCTATAGGTGAATCGCAGCGCAGGCCCTTAGGATTTTGGAGCAAAGCCCTGCCATCCTCTGCAGATAACTACTCTCCTTTTGAGAAACAGCTCTTGGCCTGCTACTGGGCCTTAGTAGAGACTGAACGCTTAACCATGGGCCACCAAGTTACCATGCGACCTGAGCTGCCCATCATGAACTGGGTGTTATCTGACCCACCAAGCCATAAAGTTGGGCGTGCACAGCAGCACTCCATCATCAAATGGAAGTGGTATATACGTGATCGGGCCCGAGCAGGCCCTGAAGGCACAAGTAAGTTACATGAAGAAGTGGCCCAAATGCCCATGGTCCCCACTCCTGCTACACTGCCTTCTCTCTCCCAGCCTGCACCTATGGCCTCATGGGGAGTTCCCTACGATCAGTTGACAGAGGAAGAGAAGACTCGGGCCTGGTTTACAGATGGTTCTGCACGATATGCAGGCACCACCCGAAAGTGGACAGCTGCAGCACTACAGCCCCTTTCTGGGACATCCCTGAAGGACAGTGGTGAAGGGAAATCCTCCCAGTGGGCAGAACTTCGAGCAGTGCACCTGGTTGTTCACTTTGCTTGGAAGGAGAAATGGCCAGACGTGCGATTATATACCGATTCATGGGCTGTGGCCAATGGTTTGGCTGGATGGTCAGGGACTTGGAAGGAACATGATTGGAAAATTGGTGACAAGGAAATTTGGGGAAGAGGTATGTGGATAGACCTCTCTGAATGGGCAAAAAACGTGAAGATATTTGTGTCCCATGTGAATGCTCACCAAAGGGTGACCTCAGCAGAGGAGGATTTTAATAATCAAGTGGATAGGATGACCCGTTCTGTGGATACCAGTCAGCCTCTTTCCCCAGCCACCCCTGTCATCGCCCAATGGGCTCATGAACAAAGTGGCCATGGTGGCAGGGATGGAGGTTATGCATGGGCTCAGCAACATGGACTTCCACTCACCAAGGCCGACCTGGCTACGGCCACCGCTGAGTGCCCAATCTGCCAGCAGCAGAGACCAACACTGAGTCCCCGATATGGCACCATTCCCCGGGGTGATCAGCCAGCTACCTGGTGGCAGGTTGATTACATTGGACCGCTTCCATCATGGAAGGGGCAGCGTTTTGTTCTTACTGGAATAGACACTTACTCTGGATACGGATTTGCCTTCCCTGCACGCAATGCTTCTGCCAAAACTACCATCCGTGGACTTACAGAATGCCTTATCCACCGTCATGGTATTCCACACAGCATTGCTTCTGATCAAGGAACTCACTTCACAGCAAANGAAGTGCGGCAATGGGCCCATGCTCATGGAATTCACTGGTCTTACCATGTTCCCCACCATCCTGAAGCAGCTGGCTTGATAGAACGGTGGAATGGCCTTTTGAAGACTCAGTTACAGCGCCAGCTAGGTGGCAATACCTTGCAGGGCTGGGGCAAGGTTCTCCAGAAGGCTGTATATGCTCTGAATCAGCGTCCAATATATGGTGCTGTTTCTCCCATAGCCAGGATTCACGGGTCCAGGAATCAAGGGGTGGAAATGGGAGTGGCACCACTCACTATTACCCCTAGTGACCCACTAGCAAAATTTTTGCTTCCTGTTCCCGCGACCTTATGCTCTGCTGGCCTAGAGGTCTTAGTTCCAAAGGGAGGAATGCTTCCACCAGGAGACACAACAATGATTCCATTGAACTGGAAGTTAAGACTGCCACCCGGCCACTTTGGGCTCCTCATGCCTCTGAATCAACAGGCAAAGAAGGGAGTTACTGTGCTGGCTGGGGTGATTGATCCTGACTACCAAGGGGAAATTGGACTGCTACTCCACAATGGAGGTAAGGAAGAGTATGTCTGGAATACAGGAGATCCCTTAGGGCGTCTCTTAGTATTACCATGCCCTGTGATTAAGGTCAATGGAAAACTACAACAACCCAATCCAGGCAGGACTACTAATGGCCCAGACCCTTCAGGAATGAAGGTTTGGGTCACCCCACCAGGTAAAGAACCACGACCAGCTGAGGTGCTTGCTGAAGGCAAAGGGAATACGGAATGGGTAGTGGAAGAAGGTAGTTATAAATACCAGCTACGACCACGTGACCAGTTACAGAAACGAGGACTGTAATTGTCATGAGTATTTCCTCCTTATTTTGTTATGAATATGTTTGTGTGTATATATACATATATTAAGCAAATATCTTTGTTTTCTTTCCTCTCTTATTCCCTTATCATGTAACATAAGATGTATTGACTTTATATCATAGTATTTAAGTATTGTTAATTTTACATCATAGTATTTAAGTTACGGGATATCAAGGAGAAGAGTAAACATCACTCAAGGACTTTACCTCCTCTTCTGGGGAAGGGGTTAGTGCGTTTTCGGTTGTACGCAGGATAGTTGTATCATGTTAGGCGGAATTATGACCTTGTTATTGTCTTTATTTGGAGATTAAGTATGGTTTAAGGAGATGCGTATGGGTGCCAAGTTGACAAGGGGTGGACT
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
ERVL-B4 | ZNF157 | 2594 | 2614 | + | 33.91 | AGGCCAGCAATACACCTTCAC |
ERVL-B4 | ZNF707 | 3536 | 3550 | + | 21.75 | CCCCACTCCTGCTAC |
ERVL-B4 | ZNF320 | 3522 | 3541 | - | 19.81 | GTGGGGACCATGGGCATTTG |
ERVL-B4 | Klf15 | 819 | 840 | + | 19.08 | CAGGACCCACCCCCACCACCCC |
ERVL-B4 | Klf15 | 817 | 838 | + | 18.58 | CTCAGGACCCACCCCCACCACC |
ERVL-B4 | SP3 | 1513 | 1523 | - | 18.38 | ACCACGCCCAC |
ERVL-B4 | KLF11 | 1513 | 1522 | - | 17.79 | CCACGCCCAC |
ERVL-B4 | SP8 | 1512 | 1522 | - | 17.55 | CCACGCCCACC |
ERVL-B4 | TCP7 | 3059 | 3069 | + | 17.20 | GTGGGGCCCAG |
ERVL-B4 | ZNF281 | 673 | 682 | - | 17.16 | GGGGGTGGGG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.