ERVL-B4

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000775
TE superfamily ERV3
TE class LTR
Species Eutheria
Length 5714
Kimura value 19.25
Tau index 0.8724
Description ERVL Endogenous retrovirus, ERVL-B4 subfamily
Comment MLT2B4 LTRs. ORFs at roughly pos. 53-1792, 1793-5344.
Sequence
GATTTTGGTACCGAGAGTGGTTCTAGAGGAACAGAATTTTAAGGATGAGTTTTCTGAATTGGTTCTGGGGTTTCTGGAATTGGCTCTCTAATCTGATTAGATTTAAAGACGCTAATGACTCTATTTCCAGTAGTAAAGAGAGCACTGATAGTCCATGGCGTGATCTGGCAATAGAGATACGCAAAATATCNCCATTGGATACTCCTAATCAACCACTTATAAGAAGCAAGGANCTGGGTGACTNTGTATATGATACTTTCGAACATTTTTGGNAAACTAACGAATATAATGAGATTGGCTGGTTGCTCCTAATGTCGCTGGACAAAGTGGNGAAAGAAAAGGATGAGCTCAGGGATTCGAATTCCCAGCTCAAGCGCCGCATAAATGACCTGAAAGCTTCTATGTGTGCCCTGAAGGAGACCCTTATCTCCTGTAGCCGCAGGGCTGAGATTGCTGAAAATCAAACGCAGAATCTCATCCTGCGACTGGCTGAATTACAACGCAAGTTGAACTCCCAGCCTCGCAGGGTGTCTACTGTTAAAGTGAGGGCATTGATTGGGAAAGAATGGGATCCTGNAAGTTGGAATGGGGACGTGTGGGAAGACCCTGATGAAGCTGGGGACATTGAGCCCCTAAATTCTGATGAGTCTTCTTTGCCAGTGGAAGNGGCCTCCCCACCCCCAGTGGAAGCGGCCTCCCCACCCCCAGTGGTAGCGGCCTCTCCACCCCCGTCTGAGGGGATTAACCCTGCATTGCCTGAGGAAACTGTAATGGCCTCCCCTGAGGCAGTTGCCATGCAAGACAATGCTGATTCTCCTCAGGACCCACCCCCACCACCCCTCTTTGCTTCTAGACCTATAACTAGACTCAAGTCCCAGCAGGCCCCTAAAGGTGAGGTACAAAGTGTGACCCATGAGGAGGTGCGCTACACTCCAAAAGAACTACTTGAGTTTTCTAATTTATACAGACAGAAATCCGGGGAACATGTGTGGGAATGGATATTAAGGGTGTGGGATAATGGTGGAAGGAACATAAAGTTGGATCAGGCCGAATTTATTGATATGGGCCCACTAAGCAGAGATTCTGCATTTAATGTTGCAGCTCGGGGAGTTAGAAAGGGCTCTAACAGTTTGTTTGGTTGGTTGGCTGAAACATGGACCAAAAGGTGGCCCACAGTGAGCGAATTGGAAATGCCGGACCTGCCTTGGTTTAATGTAGAGGAAGGGATTCAAAGGCTTAGGGAGATTGGAATGTTAGAGTGGATTTGTCATTTAAGACCTACTCACCCACACTGGGAGGGTCCAGAAGACATACCTTTCACCANNACTGTGAGAAATAAATTTGTGAGGGGAGCCCCAGCATCCTTGAAGAGCTCTGTGATCGCTCTTCTCTGTAGGCCAGACCTTACAGTGGGAACTGCAGCCACTGAATTGGGAAACCTAAATGCAATGGGAGTAATTGGATCCCGGGGTGGCAGGGGCCAAGTGGCGGCACTCAACCGCCAAAGGCAAGGTGGGCGTGGTTACCGTAATGGACAGCAGAGTCAAAGCAGCAATCAGAATAGTCTGACTCGCGCAGACCTATGGCGTTGGCTAGTTGATCATGGTGTTCCTAGAAGTGAAATAGATAGGAAGCCTACTAAATTCTTACTTGATCTGTATAAGCAGAAAAGTTCTAGGTCAAGTGAACAAAAGTCTAACTTGAATCATAAAAACAGAGAGTCACGGCCCCTCAATCAATTCCCAGACTTGAGCCAGTTTACAGACCCAGAACCCCTTGAATGAAGGGGAGGCCGGGTCCCCTTGAGGAAGGACCCCGGTACACTGCCAAAAATTTATACTGTTAATCTTTCTCCCAGCCTTCCCCAAAGGGACCTACGGCCTTTTACCAGGGTAACTGTGCATTGGGGAAAAGGAAATAATCAGACCTTTCGGGGACTACTGGACACTGGCTCTGAACTGACACTAATTCCAGGAGACCCAAAACGTCACTGTGGTCCACCAGTCAGAGTAGGGGCTTATGGAGGTCAGGTGATCAATGGAGTTTTAGCTCAGGTCCGTCTCACAGTGGGCCCAGTGGGTCCCCGAACCCATCCTGTGGTTATTTCCCCAGTTCCGGAATGCATAATTGGAATAGACATACTCAGCAGCTGGCAGAATCCCCACATTGGTTCCCTGACCTGTGGAGTGAGGGCTATTATGGTGGGAAAGGCCAAGTGGAAGCCACTAGAACTGCCTCTACCTAGGAAAATAGTAAACCAAAAGCAATACCGCATTCCTGGAGGGATTGCAGAGATTAGTGCCACCATCAAGGACTTGAAAGATGCAGGGGTGGTGATTCCCACCACATCCCCATTCAACTCGCCTATTTGGCCTGTGCAGAAGACAGATGGATCTTGGAGAATGACAGTGGATTATCGTAAGCTTAACCAGGTGGTGACTCCAATTGCAGCTGCTGTACCAGATGTGGTTTCATTGCTTGAGCAAATTAACACATCCCCTGGTACCTGGTATGCAGCTATTGATCTGGCAAATGCCTTTTTCTCCATACCTGTCAATAAGGNCCACCAGAAGCAGTTTGCTTTCAGCTGGCAAGGCCAGCAATACACCTTCACTGTCCTACCTCAGGGGTATATCAACTCTCCAGCCCTATGTCATAATTTAGTTCGCAGGGATCTTGATCGCCTTTCCCTTCCACAAGATATCACACTGGTCCATTACATTGATGACATTATGCTGATTGGACCTAGTGAGCAAGAAGTAGCAACTACTCTAGACTTATTGGTAAGACATTTGCGTGTCAGAGGGTGGGAAATAAATCCGACAAAAATTCAGGGGCCTTCTACCTCAGTGAAATTTCTAGGGGTCCAGTGGTGTGGGGCATGTCGAGATATCCCTTCTAAGGTGAAGGATAAGTTGTTGCATCTGGCCCCTCCTACAACCAAAAAAGAGGCACAATGCCTAGTGGGCCTCTTTGGATTTTGGAGGCAACATATTCCTCATTTGGGTGTGTTACTCCGGCCCATTTACCGAGTGACCCGAAAAGCTGCTAGTTTTGAGTGGGGCCCAGAACAAGAGAAGGCTCTGCAACAGGTCCAGGCTGCTGTGCAAGCTGCTCTGCCACTTGGGCCATATGATCCAGCAGATCCAATGGTGCTTGAAGTGTCAGTGGCAGATAGGGATGCTGTTTGGAGCCTTTGGCAGGCCCCTATAGGTGAATCGCAGCGCAGGCCCTTAGGATTTTGGAGCAAAGCCCTGCCATCCTCTGCAGATAACTACTCTCCTTTTGAGAAACAGCTCTTGGCCTGCTACTGGGCCTTAGTAGAGACTGAACGCTTAACCATGGGCCACCAAGTTACCATGCGACCTGAGCTGCCCATCATGAACTGGGTGTTATCTGACCCACCAAGCCATAAAGTTGGGCGTGCACAGCAGCACTCCATCATCAAATGGAAGTGGTATATACGTGATCGGGCCCGAGCAGGCCCTGAAGGCACAAGTAAGTTACATGAAGAAGTGGCCCAAATGCCCATGGTCCCCACTCCTGCTACACTGCCTTCTCTCTCCCAGCCTGCACCTATGGCCTCATGGGGAGTTCCCTACGATCAGTTGACAGAGGAAGAGAAGACTCGGGCCTGGTTTACAGATGGTTCTGCACGATATGCAGGCACCACCCGAAAGTGGACAGCTGCAGCACTACAGCCCCTTTCTGGGACATCCCTGAAGGACAGTGGTGAAGGGAAATCCTCCCAGTGGGCAGAACTTCGAGCAGTGCACCTGGTTGTTCACTTTGCTTGGAAGGAGAAATGGCCAGACGTGCGATTATATACCGATTCATGGGCTGTGGCCAATGGTTTGGCTGGATGGTCAGGGACTTGGAAGGAACATGATTGGAAAATTGGTGACAAGGAAATTTGGGGAAGAGGTATGTGGATAGACCTCTCTGAATGGGCAAAAAACGTGAAGATATTTGTGTCCCATGTGAATGCTCACCAAAGGGTGACCTCAGCAGAGGAGGATTTTAATAATCAAGTGGATAGGATGACCCGTTCTGTGGATACCAGTCAGCCTCTTTCCCCAGCCACCCCTGTCATCGCCCAATGGGCTCATGAACAAAGTGGCCATGGTGGCAGGGATGGAGGTTATGCATGGGCTCAGCAACATGGACTTCCACTCACCAAGGCCGACCTGGCTACGGCCACCGCTGAGTGCCCAATCTGCCAGCAGCAGAGACCAACACTGAGTCCCCGATATGGCACCATTCCCCGGGGTGATCAGCCAGCTACCTGGTGGCAGGTTGATTACATTGGACCGCTTCCATCATGGAAGGGGCAGCGTTTTGTTCTTACTGGAATAGACACTTACTCTGGATACGGATTTGCCTTCCCTGCACGCAATGCTTCTGCCAAAACTACCATCCGTGGACTTACAGAATGCCTTATCCACCGTCATGGTATTCCACACAGCATTGCTTCTGATCAAGGAACTCACTTCACAGCAAANGAAGTGCGGCAATGGGCCCATGCTCATGGAATTCACTGGTCTTACCATGTTCCCCACCATCCTGAAGCAGCTGGCTTGATAGAACGGTGGAATGGCCTTTTGAAGACTCAGTTACAGCGCCAGCTAGGTGGCAATACCTTGCAGGGCTGGGGCAAGGTTCTCCAGAAGGCTGTATATGCTCTGAATCAGCGTCCAATATATGGTGCTGTTTCTCCCATAGCCAGGATTCACGGGTCCAGGAATCAAGGGGTGGAAATGGGAGTGGCACCACTCACTATTACCCCTAGTGACCCACTAGCAAAATTTTTGCTTCCTGTTCCCGCGACCTTATGCTCTGCTGGCCTAGAGGTCTTAGTTCCAAAGGGAGGAATGCTTCCACCAGGAGACACAACAATGATTCCATTGAACTGGAAGTTAAGACTGCCACCCGGCCACTTTGGGCTCCTCATGCCTCTGAATCAACAGGCAAAGAAGGGAGTTACTGTGCTGGCTGGGGTGATTGATCCTGACTACCAAGGGGAAATTGGACTGCTACTCCACAATGGAGGTAAGGAAGAGTATGTCTGGAATACAGGAGATCCCTTAGGGCGTCTCTTAGTATTACCATGCCCTGTGATTAAGGTCAATGGAAAACTACAACAACCCAATCCAGGCAGGACTACTAATGGCCCAGACCCTTCAGGAATGAAGGTTTGGGTCACCCCACCAGGTAAAGAACCACGACCAGCTGAGGTGCTTGCTGAAGGCAAAGGGAATACGGAATGGGTAGTGGAAGAAGGTAGTTATAAATACCAGCTACGACCACGTGACCAGTTACAGAAACGAGGACTGTAATTGTCATGAGTATTTCCTCCTTATTTTGTTATGAATATGTTTGTGTGTATATATACATATATTAAGCAAATATCTTTGTTTTCTTTCCTCTCTTATTCCCTTATCATGTAACATAAGATGTATTGACTTTATATCATAGTATTTAAGTATTGTTAATTTTACATCATAGTATTTAAGTTACGGGATATCAAGGAGAAGAGTAAACATCACTCAAGGACTTTACCTCCTCTTCTGGGGAAGGGGTTAGTGCGTTTTCGGTTGTACGCAGGATAGTTGTATCATGTTAGGCGGAATTATGACCTTGTTATTGTCTTTATTTGGAGATTAAGTATGGTTTAAGGAGATGCGTATGGGTGCCAAGTTGACAAGGGGTGGACT



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
ERVL-B4 ZNF211 3455 3464 - 16.39 TATATACCAC
ERVL-B4 TCP23 1064 1071 + 16.36 GGGCCCAC
ERVL-B4 TCP23 2067 2074 - 16.36 GGGCCCAC
ERVL-B4 ERF014 4436 4450 + 16.33 CCTTATCCACCGTCA
ERVL-B4 ZNF816 587 601 + 16.25 TGGGGACGTGTGGGA
ERVL-B4 ZNF331 4150 4159 - 16.18 TGCTGAGCCC
ERVL-B4 KLF3 1515 1524 - 16.09 AACCACGCCC
ERVL-B4 LjSGA_053525.1 1064 1071 + 16.05 GGGCCCAC
ERVL-B4 LjSGA_053525.1 2067 2074 - 16.05 GGGCCCAC
ERVL-B4 SREBF1 5199 5208 + 16.05 GTCACCCCAC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).