ERVL47

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000119
TE superfamily ERV3
TE class LTR
Species Primates
Length 5246
Kimura value 18.41
Tau index 1.0000
Description ERVL endogenous retrovirus, ERVL47 subfamily
Comment Associated with LTR47B2 LTRs. ORF coordinates are roughly: Gag (234-1559), Pol (1560-4955).
Sequence
AATGGCGAGCCAGCCAGGAGGAGCCNGGGAACCAAAGTATGGGCAAGGGGAAGGAAGCATCCGTGGGGGAAATCCCGGGGCGGCCCTCCTCATCTATGTGGGGTGGTGTTGCCCATTTGCTAGATGCCTGTGGACCACCGCGTGAGTACGGGGACGCCCCGAAAACGCCGGAGGGTCTAGAANGCTTGTTAGTAGAAAGCCGTATGACTCACTGTGGTAAGGAAGGTCGCNAGGCGGCGGCAACAGTGGGTTNGCCGCTGTTATGGNCTGTGCGAGTGGCCACAGAAGCGCAGGTAGCAGCTGAGGCGAGGGTAGGAAAGCTAGAGAAAGAATTACAGTTAGAAAGGGATATGAGAACTTCCACGTCTGTGTTAGCGTCTGGTTTGGTGGATAAGCTCGAAACTCAGGAGGCACAGTTGGAAACCGTAGCCTGCCGCTTTGTGCGGCTGGGAGGGAGGAAGCTGCGCCGGCCGAAAGTGCGGGCTGTCCTTGCCAAACCGGACTGGGATGCNAANACCTGGAATTCTTGGGAACCAACTAGTGAGTCAGATGANGACATAGAGGTTANCTCAGAGGAGGAGGATAATTATCCTCCCCTTTTGAGAGCGAGACCGCTCATGCAGCGGAAAGCAAAAGCCCAACACATGCAGCCGGGCAATAACGGACAGCCTTTTCAGGAAACCTTAACTGTCCGAGAGTACACCTCGGCAGGATTATTGGACATAGCAAAAACTTTTAAACAGCTGCCGAGAGAAAGCTTAGCCACATGGATGGTGCGGCTATGGGACACCGGTGGGGATGGCATCTCCCTAGCAGGGAGTGAAGCTGAAAAAATGAGTAATATAACCACCCATCCNGCTTTGAGACAGCATTTGCATAATGCTAGAACTGTCGAGGGNAATCATAGCCTTATGGACTGGATTATCCTGGCCATGAGAGAGGCTTGGCCNAATGAAGGGGACTTCCCGGGTCAGACCCCGGCATGGCGATCTCTGGAGGAAGCGCAGAGCGATTTANGGGAGTTGGGCATGCGNCAGGCCATCTATGCCCAGCAATTTGNAGGACCAGATAAGGCTGTGTTTACCGTGGGCATGAAAAATAAGTTGTTACAAAGTGCCCCCCGGGAGTGGCATGGCCCTCTCATATCCCTCTTGAGCCCCCTTGTGGGACAAGATGTATATGACGTGGGGGAAGCCATNNCTGATCTTGGAGAAACTGAAAAGGGAAGAGATAAGGTCCGACTGGTTACTAAAGGAAAAGGGAAAAAGGGAGAAACCAAGTCTGCTGGACTGAAAAAGGGAGGTCAAAAAGGCCCAGTTAGGATTACCAGGAAACAAATGTGGTATGATCTGATCTCAGCTGGGGTAGACAAAGAGAAAATAGACCGGCAACCCAATGCCATATTAGTGGGCCTTTGGAAAGACCTGACTCCCGATCAACAGTTTAGACCCCTTCCTAGTGCCCCACCAGAAGAGGGAGAAGGAGAAGAAGAAACTCCGCGTAAAAGAACCCCAATCTCTGCGTTCCAGGGGTGGACTCCTCCCCAGCCTAGAGACTAGGGGTGGGGCCAAGGTTGCCTCCGTGTANGAGCAATAGGGGGCGACCAGAGGCCCCATGTGGAGCTCACAATCTATTGGAGTCCAAAAAACAAGCAGAGAACCTTAGCTTTAGTGGGCACAGGTGCAGAATGCACCTTAATTCATGGAAATCCAGAGAGACACCCTGGTAAGTGGGCAGCTATAGATGGTTACGGGGGGCGAACAATCCGAGTGAAACAAACTCCTCTCCTCCTTGGTNTTGGGCGGAGTCCCCCCGCTTACTNTACTGTCTTTATCTCACCTATTCCAGAAAACATCTTGGGTATGGATGTCCTTTTAGGATGCACTTTACAAACATCTGTGGGGGAATTCCACCTATGAGTTTGGGCGGTAAAGGCTATTTTAAGAGGGGAAGCAAAATGGGAACCTGTACACCTCCCTCCCCCACGGCGTATTGTTAATGTGAAACAATACCATCTTCCTGGTGGGATAGAAGAAATCACAGCCACCATACAGGAACTGGCCAAAGTTAATATTATNCGGCCAGCCCAGAGTCCCTTCAACAGTCCTGTATGGCCGGTAAGAAAACCTGATGGCACCTGGCACATGACAGTAGACTACCGGGAACTAAATAAGGTGGTTCCCGAGATACACGCTGCTGTGCCTAATATAACTCAAGTGATAGAGCAAATAATACAAAATATAGGCACTTATCATGCTGTGTTAGATTTGGCTAATGCCTTCTTCAGCATCCCTTTACACCCTGACTCGCAGGACCAATTTGCTTTTACTTGGAATGGCCAACAATGGACATTCCAAGTGTTGCCCCAGGGNTATCTACANAGCCCCACTATTTGTCATGGAATGATTGCTAGAGATTTAACTTTATGTCCACTGCCACCTGCTGTTAAACAGTTTCATTACATTGATGATATTATGCTAACCTCTGAAGACTTGTCATTGCTACAGCAACACCTTGATGCATTGTGCACCCTTCTCCAATCCAGAGGATGGGCCATCAACCCGCAAAAGATACAAGGCCCAGGACCGGCTGTAAAGTTCCTAGGGGTCACTTGGTCGGGTAAGACACGCCTTATCCCAGGCATAGTCATTGACAAAATACAACAATTTTCCATGCCTAAAACAGTTAAACAGTTACAAAGTTTCCTAGGTCTTTTGGGATATTGGCGGGCTTTTATTCCACATTTAGCTCAATGTTTGCGTCCCCTATACCGACTAGTAAAGAAGGGATCTAGTTGGTGCTGGGATAAAGAACAAGAGGAAGCATTTGAGAAGGCTAAANTATTAGTGGCTCGGGCACAAGCCTTAGGTTCCCCCCTTCCCGGGNTACCGNTNTCTTTGGATGTGACCATAAGCCCTGAGGGGACCAGCTGGGCCCTCGGGCAAGTCCAGCATGGGAAAGCGGTTCCCCTAGGATTCTGGTCACANCTATGGAAGGGCGCTGAAACCCGCTATTCCCCGATTGAACAACGGGTCCTGGGGGTATANAAGGCCTTGCGGCAAGTTGAACCCGTAACTGCCGCTTTNCCGGTAACAGNGAAAACGGGTCTCCCTATNAAGGGCTGGANAGAAGGGTTGTTTNCCAGGCCTGNNTCAGCTATTGCCCAGGCCTCCACTTTACAAAANTGGCATGCANACCTGCAACAACGTAGCGCCCTCTCCACGAGTCCCTTGGGAGATGAACTGCATGCTNTCTTAGGGCCAGTACACTATGAGACCAGTGCTGCCCCTATTGTGGAGCCCCCACGGGGGATGCCTCCGATGNTACACGAAGGCACGGCCCCCATTCCTGAAAACGCTTGGTACTCGGATGGGTNNAGCCGAGGTAACCCTTGTGTATGGACGGCAGTAGCTGTACAACCGCAGACAGATACTATCTGGTTTGAGATGGGAATGCAGCAAAGCAGTCAATGGGCAGAACTCCGAGCTGCATGGTTGGTTTGTACCCATGAGCCATGGCCTATAGTTCTCTGTACAGATAGTTGGGCAGTATTTAAGGGTCTTACAACTTGGCTTGCCCAGTGGGCCCGGGATGACTGGCATGTACTTTAAAAACCCTTATGGGGAGCTGCCATGTGGAAAGACATTTGGGAAAGGCTACAAGAACCCACTGCGAGCCTAATTGTGTATCATGTTTCAGCACACTGGTCAGATTCACCTCCCGGTAACATGGAGGCTGATACCCTAGCAAAAATTAGAACACTGGCTCCCTCGCAATCNTCTGAGCTAGCTGATTGGGTACATAAACACAGTGGGCATCGCAGTGCACGAGTGGGCTGGCAAATAGCAAAGGGAGCAGGATTGCCCCTCCGCTATGCAGATTTAGCGGCGGCAGTAACAAACTGCTTAGTTTGCTCCCGCCTGCGCCCCCGCCGCATCCCACATACACCTGGACACATACATAAGACAGCCGCCCCTGTGAGAGACTGGCAGATAGACTACATCGGACCCCTGCCAGTAAGCTTGGGACAAAAGTATGCACTAACATGTGTAGACACTGCCACGGGATTGTTGCAGGCCTTCCCTTGCAAAAGGGCAAACCAAACAGCCACCATTAAGGGCTTGGAGCAACTCAGTGTCATGTATGGATACCCTCGATGCATTGATAGCGACCGAGGCACGCATTTCACTGGACATGATGTCCAAGATTGGGCACATGAAAAGGATATAGACCGGAGATTTCACTTGCCATATAATCCCCAAGCGGCAGGGTTGATTGAAAGGAAAAATGGCATTTTGAAGGCACAACTGCGAGCACTTTCACAATCCAATACCTTGCATGGGTGGGCGAAGGTTTTGCCTCAAGCCATTAGAAACCTTAATTCGGTTGAGACAAATACGGGGCTGGCACCATACCAATGACTCGGGACCACCGCAGAGGAGGGTCCATTAACCATAGTTGTAAAGAAAGTCCGACCAGACGCATTTCTACCGGAGCTGATAAAAGGCCAATGGCAAATGTTATTTGGGACTCCCCAAGACCTTGAGCCAGGGGAGGGGACACTTGAATGGGGGTTGGACTGGCAACTTCCCCCAGGTTGGATAGGGTATTTCTTGCCAGAGAGCGAGGAATTCCCCGGCCAACTAAAGTGGTCTCCGTTGATCCTGTTGGAGTCTGGGCCAAAACGCTCCACATACCAATACACTGGAACACGGCCCCTTTTAAAGGGCACTCTGGTCGGCCGTTTGACATGGTCCTTTGCTGCCCCTGTAACCTTNACAGATAATACCGGCACCTTCGCCCTTAGGCAACATGTTTGGTATGCACCCCCAGCCCATAATCCTTNGGCTGCCTATGTCCTAACTAACAGAGATGAAGCTACCACAGTCATTTTGCTTGATGGGGAAGAACTGCCCCGCCAAGTACCTACTAAACACTTGTATTTCCGCCCATAGTCTTCTGTTCCTGTTGCTGCTCTGCCCTACAAAAGCCTCTTGGTTTGGATCCTGGGGAACATGGTGGCAAAAGGCATTATTAATTCTGTGTCTTATACTTGGCATAGGTATCATTGCCTGTTGTTGTCTGTATTGTTGCTGCGGCCTCTGCTTACAAGTAGAAAACAAATTGATGCAACGTGTCACCCACGCCATGAAATGACTGCAGCACCCCTCNCCTCTAAAGGCTCAGGGACCATCGCGGAAGAGGNGGGCGCGTGAGATTGTAAGAGCCGGATTAGAGGGGTGGAG



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
ERVL47 Ebf2 3227 3235 - 16.79 CCCAAGGGA
ERVL47 TFAP2A 2913 2925 + 16.72 AGCCCTGAGGGGA
ERVL47 KLF5 1559 1568 - 16.67 GCCCCACCCC
ERVL47 HLH4C 294 307 + 16.61 GTAGCAGCTGAGGC
ERVL47 DOF5.8 1249 1267 - 16.61 TTTTTCCCTTTTCCTTTAG
ERVL47 TFAP2C 2913 2925 - 16.60 TCCCCTCAGGGCT
ERVL47 EBF3 3227 3235 - 16.59 CCCAAGGGA
ERVL47 FAR1 5208 5216 - 16.59 TCACGCGCC
ERVL47 TFAP2C 2913 2925 + 16.55 AGCCCTGAGGGGA
ERVL47 ARALYDRAFT_496250 4425 4432 + 16.55 GGGACCAC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).