ERVL47

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000119
TE superfamily ERV3
TE class LTR
Species Primates
Length 5246
Kimura value 18.41
Tau index 1.0000
Description ERVL endogenous retrovirus, ERVL47 subfamily
Comment Associated with LTR47B2 LTRs. ORF coordinates are roughly: Gag (234-1559), Pol (1560-4955).
Sequence
AATGGCGAGCCAGCCAGGAGGAGCCNGGGAACCAAAGTATGGGCAAGGGGAAGGAAGCATCCGTGGGGGAAATCCCGGGGCGGCCCTCCTCATCTATGTGGGGTGGTGTTGCCCATTTGCTAGATGCCTGTGGACCACCGCGTGAGTACGGGGACGCCCCGAAAACGCCGGAGGGTCTAGAANGCTTGTTAGTAGAAAGCCGTATGACTCACTGTGGTAAGGAAGGTCGCNAGGCGGCGGCAACAGTGGGTTNGCCGCTGTTATGGNCTGTGCGAGTGGCCACAGAAGCGCAGGTAGCAGCTGAGGCGAGGGTAGGAAAGCTAGAGAAAGAATTACAGTTAGAAAGGGATATGAGAACTTCCACGTCTGTGTTAGCGTCTGGTTTGGTGGATAAGCTCGAAACTCAGGAGGCACAGTTGGAAACCGTAGCCTGCCGCTTTGTGCGGCTGGGAGGGAGGAAGCTGCGCCGGCCGAAAGTGCGGGCTGTCCTTGCCAAACCGGACTGGGATGCNAANACCTGGAATTCTTGGGAACCAACTAGTGAGTCAGATGANGACATAGAGGTTANCTCAGAGGAGGAGGATAATTATCCTCCCCTTTTGAGAGCGAGACCGCTCATGCAGCGGAAAGCAAAAGCCCAACACATGCAGCCGGGCAATAACGGACAGCCTTTTCAGGAAACCTTAACTGTCCGAGAGTACACCTCGGCAGGATTATTGGACATAGCAAAAACTTTTAAACAGCTGCCGAGAGAAAGCTTAGCCACATGGATGGTGCGGCTATGGGACACCGGTGGGGATGGCATCTCCCTAGCAGGGAGTGAAGCTGAAAAAATGAGTAATATAACCACCCATCCNGCTTTGAGACAGCATTTGCATAATGCTAGAACTGTCGAGGGNAATCATAGCCTTATGGACTGGATTATCCTGGCCATGAGAGAGGCTTGGCCNAATGAAGGGGACTTCCCGGGTCAGACCCCGGCATGGCGATCTCTGGAGGAAGCGCAGAGCGATTTANGGGAGTTGGGCATGCGNCAGGCCATCTATGCCCAGCAATTTGNAGGACCAGATAAGGCTGTGTTTACCGTGGGCATGAAAAATAAGTTGTTACAAAGTGCCCCCCGGGAGTGGCATGGCCCTCTCATATCCCTCTTGAGCCCCCTTGTGGGACAAGATGTATATGACGTGGGGGAAGCCATNNCTGATCTTGGAGAAACTGAAAAGGGAAGAGATAAGGTCCGACTGGTTACTAAAGGAAAAGGGAAAAAGGGAGAAACCAAGTCTGCTGGACTGAAAAAGGGAGGTCAAAAAGGCCCAGTTAGGATTACCAGGAAACAAATGTGGTATGATCTGATCTCAGCTGGGGTAGACAAAGAGAAAATAGACCGGCAACCCAATGCCATATTAGTGGGCCTTTGGAAAGACCTGACTCCCGATCAACAGTTTAGACCCCTTCCTAGTGCCCCACCAGAAGAGGGAGAAGGAGAAGAAGAAACTCCGCGTAAAAGAACCCCAATCTCTGCGTTCCAGGGGTGGACTCCTCCCCAGCCTAGAGACTAGGGGTGGGGCCAAGGTTGCCTCCGTGTANGAGCAATAGGGGGCGACCAGAGGCCCCATGTGGAGCTCACAATCTATTGGAGTCCAAAAAACAAGCAGAGAACCTTAGCTTTAGTGGGCACAGGTGCAGAATGCACCTTAATTCATGGAAATCCAGAGAGACACCCTGGTAAGTGGGCAGCTATAGATGGTTACGGGGGGCGAACAATCCGAGTGAAACAAACTCCTCTCCTCCTTGGTNTTGGGCGGAGTCCCCCCGCTTACTNTACTGTCTTTATCTCACCTATTCCAGAAAACATCTTGGGTATGGATGTCCTTTTAGGATGCACTTTACAAACATCTGTGGGGGAATTCCACCTATGAGTTTGGGCGGTAAAGGCTATTTTAAGAGGGGAAGCAAAATGGGAACCTGTACACCTCCCTCCCCCACGGCGTATTGTTAATGTGAAACAATACCATCTTCCTGGTGGGATAGAAGAAATCACAGCCACCATACAGGAACTGGCCAAAGTTAATATTATNCGGCCAGCCCAGAGTCCCTTCAACAGTCCTGTATGGCCGGTAAGAAAACCTGATGGCACCTGGCACATGACAGTAGACTACCGGGAACTAAATAAGGTGGTTCCCGAGATACACGCTGCTGTGCCTAATATAACTCAAGTGATAGAGCAAATAATACAAAATATAGGCACTTATCATGCTGTGTTAGATTTGGCTAATGCCTTCTTCAGCATCCCTTTACACCCTGACTCGCAGGACCAATTTGCTTTTACTTGGAATGGCCAACAATGGACATTCCAAGTGTTGCCCCAGGGNTATCTACANAGCCCCACTATTTGTCATGGAATGATTGCTAGAGATTTAACTTTATGTCCACTGCCACCTGCTGTTAAACAGTTTCATTACATTGATGATATTATGCTAACCTCTGAAGACTTGTCATTGCTACAGCAACACCTTGATGCATTGTGCACCCTTCTCCAATCCAGAGGATGGGCCATCAACCCGCAAAAGATACAAGGCCCAGGACCGGCTGTAAAGTTCCTAGGGGTCACTTGGTCGGGTAAGACACGCCTTATCCCAGGCATAGTCATTGACAAAATACAACAATTTTCCATGCCTAAAACAGTTAAACAGTTACAAAGTTTCCTAGGTCTTTTGGGATATTGGCGGGCTTTTATTCCACATTTAGCTCAATGTTTGCGTCCCCTATACCGACTAGTAAAGAAGGGATCTAGTTGGTGCTGGGATAAAGAACAAGAGGAAGCATTTGAGAAGGCTAAANTATTAGTGGCTCGGGCACAAGCCTTAGGTTCCCCCCTTCCCGGGNTACCGNTNTCTTTGGATGTGACCATAAGCCCTGAGGGGACCAGCTGGGCCCTCGGGCAAGTCCAGCATGGGAAAGCGGTTCCCCTAGGATTCTGGTCACANCTATGGAAGGGCGCTGAAACCCGCTATTCCCCGATTGAACAACGGGTCCTGGGGGTATANAAGGCCTTGCGGCAAGTTGAACCCGTAACTGCCGCTTTNCCGGTAACAGNGAAAACGGGTCTCCCTATNAAGGGCTGGANAGAAGGGTTGTTTNCCAGGCCTGNNTCAGCTATTGCCCAGGCCTCCACTTTACAAAANTGGCATGCANACCTGCAACAACGTAGCGCCCTCTCCACGAGTCCCTTGGGAGATGAACTGCATGCTNTCTTAGGGCCAGTACACTATGAGACCAGTGCTGCCCCTATTGTGGAGCCCCCACGGGGGATGCCTCCGATGNTACACGAAGGCACGGCCCCCATTCCTGAAAACGCTTGGTACTCGGATGGGTNNAGCCGAGGTAACCCTTGTGTATGGACGGCAGTAGCTGTACAACCGCAGACAGATACTATCTGGTTTGAGATGGGAATGCAGCAAAGCAGTCAATGGGCAGAACTCCGAGCTGCATGGTTGGTTTGTACCCATGAGCCATGGCCTATAGTTCTCTGTACAGATAGTTGGGCAGTATTTAAGGGTCTTACAACTTGGCTTGCCCAGTGGGCCCGGGATGACTGGCATGTACTTTAAAAACCCTTATGGGGAGCTGCCATGTGGAAAGACATTTGGGAAAGGCTACAAGAACCCACTGCGAGCCTAATTGTGTATCATGTTTCAGCACACTGGTCAGATTCACCTCCCGGTAACATGGAGGCTGATACCCTAGCAAAAATTAGAACACTGGCTCCCTCGCAATCNTCTGAGCTAGCTGATTGGGTACATAAACACAGTGGGCATCGCAGTGCACGAGTGGGCTGGCAAATAGCAAAGGGAGCAGGATTGCCCCTCCGCTATGCAGATTTAGCGGCGGCAGTAACAAACTGCTTAGTTTGCTCCCGCCTGCGCCCCCGCCGCATCCCACATACACCTGGACACATACATAAGACAGCCGCCCCTGTGAGAGACTGGCAGATAGACTACATCGGACCCCTGCCAGTAAGCTTGGGACAAAAGTATGCACTAACATGTGTAGACACTGCCACGGGATTGTTGCAGGCCTTCCCTTGCAAAAGGGCAAACCAAACAGCCACCATTAAGGGCTTGGAGCAACTCAGTGTCATGTATGGATACCCTCGATGCATTGATAGCGACCGAGGCACGCATTTCACTGGACATGATGTCCAAGATTGGGCACATGAAAAGGATATAGACCGGAGATTTCACTTGCCATATAATCCCCAAGCGGCAGGGTTGATTGAAAGGAAAAATGGCATTTTGAAGGCACAACTGCGAGCACTTTCACAATCCAATACCTTGCATGGGTGGGCGAAGGTTTTGCCTCAAGCCATTAGAAACCTTAATTCGGTTGAGACAAATACGGGGCTGGCACCATACCAATGACTCGGGACCACCGCAGAGGAGGGTCCATTAACCATAGTTGTAAAGAAAGTCCGACCAGACGCATTTCTACCGGAGCTGATAAAAGGCCAATGGCAAATGTTATTTGGGACTCCCCAAGACCTTGAGCCAGGGGAGGGGACACTTGAATGGGGGTTGGACTGGCAACTTCCCCCAGGTTGGATAGGGTATTTCTTGCCAGAGAGCGAGGAATTCCCCGGCCAACTAAAGTGGTCTCCGTTGATCCTGTTGGAGTCTGGGCCAAAACGCTCCACATACCAATACACTGGAACACGGCCCCTTTTAAAGGGCACTCTGGTCGGCCGTTTGACATGGTCCTTTGCTGCCCCTGTAACCTTNACAGATAATACCGGCACCTTCGCCCTTAGGCAACATGTTTGGTATGCACCCCCAGCCCATAATCCTTNGGCTGCCTATGTCCTAACTAACAGAGATGAAGCTACCACAGTCATTTTGCTTGATGGGGAAGAACTGCCCCGCCAAGTACCTACTAAACACTTGTATTTCCGCCCATAGTCTTCTGTTCCTGTTGCTGCTCTGCCCTACAAAAGCCTCTTGGTTTGGATCCTGGGGAACATGGTGGCAAAAGGCATTATTAATTCTGTGTCTTATACTTGGCATAGGTATCATTGCCTGTTGTTGTCTGTATTGTTGCTGCGGCCTCTGCTTACAAGTAGAAAACAAATTGATGCAACGTGTCACCCACGCCATGAAATGACTGCAGCACCCCTCNCCTCTAAAGGCTCAGGGACCATCGCGGAAGAGGNGGGCGCGTGAGATTGTAAGAGCCGGATTAGAGGGGTGGAG



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
ERVL47 TFAP2B 2913 2925 - 16.54 TCCCCTCAGGGCT
ERVL47 Gfi1B 2035 2044 + 16.50 AAATCACAGC
ERVL47 TFAP2A 2913 2925 - 16.46 TCCCCTCAGGGCT
ERVL47 Zm00001d020595 233 242 + 16.42 GGCGGCGGCA
ERVL47 TCP23 3581 3588 - 16.36 GGGCCCAC
ERVL47 CTCF 2433 2447 - 16.33 AACAGCAGGTGGCAG
ERVL47 ZBTB6 4541 4549 + 16.28 CCTTGAGCC
ERVL47 TCP3 4425 4432 + 16.23 GGGACCAC
ERVL47 TCP24 4425 4432 + 16.21 GGGACCAC
ERVL47 TFAP2B 2913 2925 + 16.18 AGCCCTGAGGGGA


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).