ERVL-B4
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000775 |
---|---|
TE superfamily | ERV3 |
TE class | LTR |
Species | Eutheria |
Length | 5714 |
Kimura value | 19.25 |
Tau index | 0.8724 |
Description | ERVL Endogenous retrovirus, ERVL-B4 subfamily |
Comment | MLT2B4 LTRs. ORFs at roughly pos. 53-1792, 1793-5344. |
Sequence |
GATTTTGGTACCGAGAGTGGTTCTAGAGGAACAGAATTTTAAGGATGAGTTTTCTGAATTGGTTCTGGGGTTTCTGGAATTGGCTCTCTAATCTGATTAGATTTAAAGACGCTAATGACTCTATTTCCAGTAGTAAAGAGAGCACTGATAGTCCATGGCGTGATCTGGCAATAGAGATACGCAAAATATCNCCATTGGATACTCCTAATCAACCACTTATAAGAAGCAAGGANCTGGGTGACTNTGTATATGATACTTTCGAACATTTTTGGNAAACTAACGAATATAATGAGATTGGCTGGTTGCTCCTAATGTCGCTGGACAAAGTGGNGAAAGAAAAGGATGAGCTCAGGGATTCGAATTCCCAGCTCAAGCGCCGCATAAATGACCTGAAAGCTTCTATGTGTGCCCTGAAGGAGACCCTTATCTCCTGTAGCCGCAGGGCTGAGATTGCTGAAAATCAAACGCAGAATCTCATCCTGCGACTGGCTGAATTACAACGCAAGTTGAACTCCCAGCCTCGCAGGGTGTCTACTGTTAAAGTGAGGGCATTGATTGGGAAAGAATGGGATCCTGNAAGTTGGAATGGGGACGTGTGGGAAGACCCTGATGAAGCTGGGGACATTGAGCCCCTAAATTCTGATGAGTCTTCTTTGCCAGTGGAAGNGGCCTCCCCACCCCCAGTGGAAGCGGCCTCCCCACCCCCAGTGGTAGCGGCCTCTCCACCCCCGTCTGAGGGGATTAACCCTGCATTGCCTGAGGAAACTGTAATGGCCTCCCCTGAGGCAGTTGCCATGCAAGACAATGCTGATTCTCCTCAGGACCCACCCCCACCACCCCTCTTTGCTTCTAGACCTATAACTAGACTCAAGTCCCAGCAGGCCCCTAAAGGTGAGGTACAAAGTGTGACCCATGAGGAGGTGCGCTACACTCCAAAAGAACTACTTGAGTTTTCTAATTTATACAGACAGAAATCCGGGGAACATGTGTGGGAATGGATATTAAGGGTGTGGGATAATGGTGGAAGGAACATAAAGTTGGATCAGGCCGAATTTATTGATATGGGCCCACTAAGCAGAGATTCTGCATTTAATGTTGCAGCTCGGGGAGTTAGAAAGGGCTCTAACAGTTTGTTTGGTTGGTTGGCTGAAACATGGACCAAAAGGTGGCCCACAGTGAGCGAATTGGAAATGCCGGACCTGCCTTGGTTTAATGTAGAGGAAGGGATTCAAAGGCTTAGGGAGATTGGAATGTTAGAGTGGATTTGTCATTTAAGACCTACTCACCCACACTGGGAGGGTCCAGAAGACATACCTTTCACCANNACTGTGAGAAATAAATTTGTGAGGGGAGCCCCAGCATCCTTGAAGAGCTCTGTGATCGCTCTTCTCTGTAGGCCAGACCTTACAGTGGGAACTGCAGCCACTGAATTGGGAAACCTAAATGCAATGGGAGTAATTGGATCCCGGGGTGGCAGGGGCCAAGTGGCGGCACTCAACCGCCAAAGGCAAGGTGGGCGTGGTTACCGTAATGGACAGCAGAGTCAAAGCAGCAATCAGAATAGTCTGACTCGCGCAGACCTATGGCGTTGGCTAGTTGATCATGGTGTTCCTAGAAGTGAAATAGATAGGAAGCCTACTAAATTCTTACTTGATCTGTATAAGCAGAAAAGTTCTAGGTCAAGTGAACAAAAGTCTAACTTGAATCATAAAAACAGAGAGTCACGGCCCCTCAATCAATTCCCAGACTTGAGCCAGTTTACAGACCCAGAACCCCTTGAATGAAGGGGAGGCCGGGTCCCCTTGAGGAAGGACCCCGGTACACTGCCAAAAATTTATACTGTTAATCTTTCTCCCAGCCTTCCCCAAAGGGACCTACGGCCTTTTACCAGGGTAACTGTGCATTGGGGAAAAGGAAATAATCAGACCTTTCGGGGACTACTGGACACTGGCTCTGAACTGACACTAATTCCAGGAGACCCAAAACGTCACTGTGGTCCACCAGTCAGAGTAGGGGCTTATGGAGGTCAGGTGATCAATGGAGTTTTAGCTCAGGTCCGTCTCACAGTGGGCCCAGTGGGTCCCCGAACCCATCCTGTGGTTATTTCCCCAGTTCCGGAATGCATAATTGGAATAGACATACTCAGCAGCTGGCAGAATCCCCACATTGGTTCCCTGACCTGTGGAGTGAGGGCTATTATGGTGGGAAAGGCCAAGTGGAAGCCACTAGAACTGCCTCTACCTAGGAAAATAGTAAACCAAAAGCAATACCGCATTCCTGGAGGGATTGCAGAGATTAGTGCCACCATCAAGGACTTGAAAGATGCAGGGGTGGTGATTCCCACCACATCCCCATTCAACTCGCCTATTTGGCCTGTGCAGAAGACAGATGGATCTTGGAGAATGACAGTGGATTATCGTAAGCTTAACCAGGTGGTGACTCCAATTGCAGCTGCTGTACCAGATGTGGTTTCATTGCTTGAGCAAATTAACACATCCCCTGGTACCTGGTATGCAGCTATTGATCTGGCAAATGCCTTTTTCTCCATACCTGTCAATAAGGNCCACCAGAAGCAGTTTGCTTTCAGCTGGCAAGGCCAGCAATACACCTTCACTGTCCTACCTCAGGGGTATATCAACTCTCCAGCCCTATGTCATAATTTAGTTCGCAGGGATCTTGATCGCCTTTCCCTTCCACAAGATATCACACTGGTCCATTACATTGATGACATTATGCTGATTGGACCTAGTGAGCAAGAAGTAGCAACTACTCTAGACTTATTGGTAAGACATTTGCGTGTCAGAGGGTGGGAAATAAATCCGACAAAAATTCAGGGGCCTTCTACCTCAGTGAAATTTCTAGGGGTCCAGTGGTGTGGGGCATGTCGAGATATCCCTTCTAAGGTGAAGGATAAGTTGTTGCATCTGGCCCCTCCTACAACCAAAAAAGAGGCACAATGCCTAGTGGGCCTCTTTGGATTTTGGAGGCAACATATTCCTCATTTGGGTGTGTTACTCCGGCCCATTTACCGAGTGACCCGAAAAGCTGCTAGTTTTGAGTGGGGCCCAGAACAAGAGAAGGCTCTGCAACAGGTCCAGGCTGCTGTGCAAGCTGCTCTGCCACTTGGGCCATATGATCCAGCAGATCCAATGGTGCTTGAAGTGTCAGTGGCAGATAGGGATGCTGTTTGGAGCCTTTGGCAGGCCCCTATAGGTGAATCGCAGCGCAGGCCCTTAGGATTTTGGAGCAAAGCCCTGCCATCCTCTGCAGATAACTACTCTCCTTTTGAGAAACAGCTCTTGGCCTGCTACTGGGCCTTAGTAGAGACTGAACGCTTAACCATGGGCCACCAAGTTACCATGCGACCTGAGCTGCCCATCATGAACTGGGTGTTATCTGACCCACCAAGCCATAAAGTTGGGCGTGCACAGCAGCACTCCATCATCAAATGGAAGTGGTATATACGTGATCGGGCCCGAGCAGGCCCTGAAGGCACAAGTAAGTTACATGAAGAAGTGGCCCAAATGCCCATGGTCCCCACTCCTGCTACACTGCCTTCTCTCTCCCAGCCTGCACCTATGGCCTCATGGGGAGTTCCCTACGATCAGTTGACAGAGGAAGAGAAGACTCGGGCCTGGTTTACAGATGGTTCTGCACGATATGCAGGCACCACCCGAAAGTGGACAGCTGCAGCACTACAGCCCCTTTCTGGGACATCCCTGAAGGACAGTGGTGAAGGGAAATCCTCCCAGTGGGCAGAACTTCGAGCAGTGCACCTGGTTGTTCACTTTGCTTGGAAGGAGAAATGGCCAGACGTGCGATTATATACCGATTCATGGGCTGTGGCCAATGGTTTGGCTGGATGGTCAGGGACTTGGAAGGAACATGATTGGAAAATTGGTGACAAGGAAATTTGGGGAAGAGGTATGTGGATAGACCTCTCTGAATGGGCAAAAAACGTGAAGATATTTGTGTCCCATGTGAATGCTCACCAAAGGGTGACCTCAGCAGAGGAGGATTTTAATAATCAAGTGGATAGGATGACCCGTTCTGTGGATACCAGTCAGCCTCTTTCCCCAGCCACCCCTGTCATCGCCCAATGGGCTCATGAACAAAGTGGCCATGGTGGCAGGGATGGAGGTTATGCATGGGCTCAGCAACATGGACTTCCACTCACCAAGGCCGACCTGGCTACGGCCACCGCTGAGTGCCCAATCTGCCAGCAGCAGAGACCAACACTGAGTCCCCGATATGGCACCATTCCCCGGGGTGATCAGCCAGCTACCTGGTGGCAGGTTGATTACATTGGACCGCTTCCATCATGGAAGGGGCAGCGTTTTGTTCTTACTGGAATAGACACTTACTCTGGATACGGATTTGCCTTCCCTGCACGCAATGCTTCTGCCAAAACTACCATCCGTGGACTTACAGAATGCCTTATCCACCGTCATGGTATTCCACACAGCATTGCTTCTGATCAAGGAACTCACTTCACAGCAAANGAAGTGCGGCAATGGGCCCATGCTCATGGAATTCACTGGTCTTACCATGTTCCCCACCATCCTGAAGCAGCTGGCTTGATAGAACGGTGGAATGGCCTTTTGAAGACTCAGTTACAGCGCCAGCTAGGTGGCAATACCTTGCAGGGCTGGGGCAAGGTTCTCCAGAAGGCTGTATATGCTCTGAATCAGCGTCCAATATATGGTGCTGTTTCTCCCATAGCCAGGATTCACGGGTCCAGGAATCAAGGGGTGGAAATGGGAGTGGCACCACTCACTATTACCCCTAGTGACCCACTAGCAAAATTTTTGCTTCCTGTTCCCGCGACCTTATGCTCTGCTGGCCTAGAGGTCTTAGTTCCAAAGGGAGGAATGCTTCCACCAGGAGACACAACAATGATTCCATTGAACTGGAAGTTAAGACTGCCACCCGGCCACTTTGGGCTCCTCATGCCTCTGAATCAACAGGCAAAGAAGGGAGTTACTGTGCTGGCTGGGGTGATTGATCCTGACTACCAAGGGGAAATTGGACTGCTACTCCACAATGGAGGTAAGGAAGAGTATGTCTGGAATACAGGAGATCCCTTAGGGCGTCTCTTAGTATTACCATGCCCTGTGATTAAGGTCAATGGAAAACTACAACAACCCAATCCAGGCAGGACTACTAATGGCCCAGACCCTTCAGGAATGAAGGTTTGGGTCACCCCACCAGGTAAAGAACCACGACCAGCTGAGGTGCTTGCTGAAGGCAAAGGGAATACGGAATGGGTAGTGGAAGAAGGTAGTTATAAATACCAGCTACGACCACGTGACCAGTTACAGAAACGAGGACTGTAATTGTCATGAGTATTTCCTCCTTATTTTGTTATGAATATGTTTGTGTGTATATATACATATATTAAGCAAATATCTTTGTTTTCTTTCCTCTCTTATTCCCTTATCATGTAACATAAGATGTATTGACTTTATATCATAGTATTTAAGTATTGTTAATTTTACATCATAGTATTTAAGTTACGGGATATCAAGGAGAAGAGTAAACATCACTCAAGGACTTTACCTCCTCTTCTGGGGAAGGGGTTAGTGCGTTTTCGGTTGTACGCAGGATAGTTGTATCATGTTAGGCGGAATTATGACCTTGTTATTGTCTTTATTTGGAGATTAAGTATGGTTTAAGGAGATGCGTATGGGTGCCAAGTTGACAAGGGGTGGACT
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
ERVL-B4 | ZNF331 | 3079 | 3088 | - | 16.05 | TGCAGAGCCT |
ERVL-B4 | ZNF75D | 2202 | 2213 | + | 16.03 | GTGGGAAAGGCC |
ERVL-B4 | KLF16 | 1513 | 1523 | - | 16.02 | ACCACGCCCAC |
ERVL-B4 | KLF13 | 1508 | 1524 | - | 15.93 | AACCACGCCCACCTTGC |
ERVL-B4 | REF6 | 1711 | 1719 | + | 15.93 | AAAACAGAG |
ERVL-B4 | ARF39 | 978 | 988 | + | 15.89 | GGGGAACATGT |
ERVL-B4 | klu | 825 | 835 | + | 15.89 | CCACCCCCACC |
ERVL-B4 | TFEC | 5305 | 5312 | + | 15.88 | CACGTGAC |
ERVL-B4 | Spps | 1513 | 1523 | - | 15.82 | ACCACGCCCAC |
ERVL-B4 | PK19717.1 | 3059 | 3067 | - | 15.77 | GGGCCCCAC |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.