ERVL47

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000119
TE superfamily ERV3
TE class LTR
Species Primates
Length 5246
Kimura value 18.41
Tau index 1.0000
Description ERVL endogenous retrovirus, ERVL47 subfamily
Comment Associated with LTR47B2 LTRs. ORF coordinates are roughly: Gag (234-1559), Pol (1560-4955).
Sequence
AATGGCGAGCCAGCCAGGAGGAGCCNGGGAACCAAAGTATGGGCAAGGGGAAGGAAGCATCCGTGGGGGAAATCCCGGGGCGGCCCTCCTCATCTATGTGGGGTGGTGTTGCCCATTTGCTAGATGCCTGTGGACCACCGCGTGAGTACGGGGACGCCCCGAAAACGCCGGAGGGTCTAGAANGCTTGTTAGTAGAAAGCCGTATGACTCACTGTGGTAAGGAAGGTCGCNAGGCGGCGGCAACAGTGGGTTNGCCGCTGTTATGGNCTGTGCGAGTGGCCACAGAAGCGCAGGTAGCAGCTGAGGCGAGGGTAGGAAAGCTAGAGAAAGAATTACAGTTAGAAAGGGATATGAGAACTTCCACGTCTGTGTTAGCGTCTGGTTTGGTGGATAAGCTCGAAACTCAGGAGGCACAGTTGGAAACCGTAGCCTGCCGCTTTGTGCGGCTGGGAGGGAGGAAGCTGCGCCGGCCGAAAGTGCGGGCTGTCCTTGCCAAACCGGACTGGGATGCNAANACCTGGAATTCTTGGGAACCAACTAGTGAGTCAGATGANGACATAGAGGTTANCTCAGAGGAGGAGGATAATTATCCTCCCCTTTTGAGAGCGAGACCGCTCATGCAGCGGAAAGCAAAAGCCCAACACATGCAGCCGGGCAATAACGGACAGCCTTTTCAGGAAACCTTAACTGTCCGAGAGTACACCTCGGCAGGATTATTGGACATAGCAAAAACTTTTAAACAGCTGCCGAGAGAAAGCTTAGCCACATGGATGGTGCGGCTATGGGACACCGGTGGGGATGGCATCTCCCTAGCAGGGAGTGAAGCTGAAAAAATGAGTAATATAACCACCCATCCNGCTTTGAGACAGCATTTGCATAATGCTAGAACTGTCGAGGGNAATCATAGCCTTATGGACTGGATTATCCTGGCCATGAGAGAGGCTTGGCCNAATGAAGGGGACTTCCCGGGTCAGACCCCGGCATGGCGATCTCTGGAGGAAGCGCAGAGCGATTTANGGGAGTTGGGCATGCGNCAGGCCATCTATGCCCAGCAATTTGNAGGACCAGATAAGGCTGTGTTTACCGTGGGCATGAAAAATAAGTTGTTACAAAGTGCCCCCCGGGAGTGGCATGGCCCTCTCATATCCCTCTTGAGCCCCCTTGTGGGACAAGATGTATATGACGTGGGGGAAGCCATNNCTGATCTTGGAGAAACTGAAAAGGGAAGAGATAAGGTCCGACTGGTTACTAAAGGAAAAGGGAAAAAGGGAGAAACCAAGTCTGCTGGACTGAAAAAGGGAGGTCAAAAAGGCCCAGTTAGGATTACCAGGAAACAAATGTGGTATGATCTGATCTCAGCTGGGGTAGACAAAGAGAAAATAGACCGGCAACCCAATGCCATATTAGTGGGCCTTTGGAAAGACCTGACTCCCGATCAACAGTTTAGACCCCTTCCTAGTGCCCCACCAGAAGAGGGAGAAGGAGAAGAAGAAACTCCGCGTAAAAGAACCCCAATCTCTGCGTTCCAGGGGTGGACTCCTCCCCAGCCTAGAGACTAGGGGTGGGGCCAAGGTTGCCTCCGTGTANGAGCAATAGGGGGCGACCAGAGGCCCCATGTGGAGCTCACAATCTATTGGAGTCCAAAAAACAAGCAGAGAACCTTAGCTTTAGTGGGCACAGGTGCAGAATGCACCTTAATTCATGGAAATCCAGAGAGACACCCTGGTAAGTGGGCAGCTATAGATGGTTACGGGGGGCGAACAATCCGAGTGAAACAAACTCCTCTCCTCCTTGGTNTTGGGCGGAGTCCCCCCGCTTACTNTACTGTCTTTATCTCACCTATTCCAGAAAACATCTTGGGTATGGATGTCCTTTTAGGATGCACTTTACAAACATCTGTGGGGGAATTCCACCTATGAGTTTGGGCGGTAAAGGCTATTTTAAGAGGGGAAGCAAAATGGGAACCTGTACACCTCCCTCCCCCACGGCGTATTGTTAATGTGAAACAATACCATCTTCCTGGTGGGATAGAAGAAATCACAGCCACCATACAGGAACTGGCCAAAGTTAATATTATNCGGCCAGCCCAGAGTCCCTTCAACAGTCCTGTATGGCCGGTAAGAAAACCTGATGGCACCTGGCACATGACAGTAGACTACCGGGAACTAAATAAGGTGGTTCCCGAGATACACGCTGCTGTGCCTAATATAACTCAAGTGATAGAGCAAATAATACAAAATATAGGCACTTATCATGCTGTGTTAGATTTGGCTAATGCCTTCTTCAGCATCCCTTTACACCCTGACTCGCAGGACCAATTTGCTTTTACTTGGAATGGCCAACAATGGACATTCCAAGTGTTGCCCCAGGGNTATCTACANAGCCCCACTATTTGTCATGGAATGATTGCTAGAGATTTAACTTTATGTCCACTGCCACCTGCTGTTAAACAGTTTCATTACATTGATGATATTATGCTAACCTCTGAAGACTTGTCATTGCTACAGCAACACCTTGATGCATTGTGCACCCTTCTCCAATCCAGAGGATGGGCCATCAACCCGCAAAAGATACAAGGCCCAGGACCGGCTGTAAAGTTCCTAGGGGTCACTTGGTCGGGTAAGACACGCCTTATCCCAGGCATAGTCATTGACAAAATACAACAATTTTCCATGCCTAAAACAGTTAAACAGTTACAAAGTTTCCTAGGTCTTTTGGGATATTGGCGGGCTTTTATTCCACATTTAGCTCAATGTTTGCGTCCCCTATACCGACTAGTAAAGAAGGGATCTAGTTGGTGCTGGGATAAAGAACAAGAGGAAGCATTTGAGAAGGCTAAANTATTAGTGGCTCGGGCACAAGCCTTAGGTTCCCCCCTTCCCGGGNTACCGNTNTCTTTGGATGTGACCATAAGCCCTGAGGGGACCAGCTGGGCCCTCGGGCAAGTCCAGCATGGGAAAGCGGTTCCCCTAGGATTCTGGTCACANCTATGGAAGGGCGCTGAAACCCGCTATTCCCCGATTGAACAACGGGTCCTGGGGGTATANAAGGCCTTGCGGCAAGTTGAACCCGTAACTGCCGCTTTNCCGGTAACAGNGAAAACGGGTCTCCCTATNAAGGGCTGGANAGAAGGGTTGTTTNCCAGGCCTGNNTCAGCTATTGCCCAGGCCTCCACTTTACAAAANTGGCATGCANACCTGCAACAACGTAGCGCCCTCTCCACGAGTCCCTTGGGAGATGAACTGCATGCTNTCTTAGGGCCAGTACACTATGAGACCAGTGCTGCCCCTATTGTGGAGCCCCCACGGGGGATGCCTCCGATGNTACACGAAGGCACGGCCCCCATTCCTGAAAACGCTTGGTACTCGGATGGGTNNAGCCGAGGTAACCCTTGTGTATGGACGGCAGTAGCTGTACAACCGCAGACAGATACTATCTGGTTTGAGATGGGAATGCAGCAAAGCAGTCAATGGGCAGAACTCCGAGCTGCATGGTTGGTTTGTACCCATGAGCCATGGCCTATAGTTCTCTGTACAGATAGTTGGGCAGTATTTAAGGGTCTTACAACTTGGCTTGCCCAGTGGGCCCGGGATGACTGGCATGTACTTTAAAAACCCTTATGGGGAGCTGCCATGTGGAAAGACATTTGGGAAAGGCTACAAGAACCCACTGCGAGCCTAATTGTGTATCATGTTTCAGCACACTGGTCAGATTCACCTCCCGGTAACATGGAGGCTGATACCCTAGCAAAAATTAGAACACTGGCTCCCTCGCAATCNTCTGAGCTAGCTGATTGGGTACATAAACACAGTGGGCATCGCAGTGCACGAGTGGGCTGGCAAATAGCAAAGGGAGCAGGATTGCCCCTCCGCTATGCAGATTTAGCGGCGGCAGTAACAAACTGCTTAGTTTGCTCCCGCCTGCGCCCCCGCCGCATCCCACATACACCTGGACACATACATAAGACAGCCGCCCCTGTGAGAGACTGGCAGATAGACTACATCGGACCCCTGCCAGTAAGCTTGGGACAAAAGTATGCACTAACATGTGTAGACACTGCCACGGGATTGTTGCAGGCCTTCCCTTGCAAAAGGGCAAACCAAACAGCCACCATTAAGGGCTTGGAGCAACTCAGTGTCATGTATGGATACCCTCGATGCATTGATAGCGACCGAGGCACGCATTTCACTGGACATGATGTCCAAGATTGGGCACATGAAAAGGATATAGACCGGAGATTTCACTTGCCATATAATCCCCAAGCGGCAGGGTTGATTGAAAGGAAAAATGGCATTTTGAAGGCACAACTGCGAGCACTTTCACAATCCAATACCTTGCATGGGTGGGCGAAGGTTTTGCCTCAAGCCATTAGAAACCTTAATTCGGTTGAGACAAATACGGGGCTGGCACCATACCAATGACTCGGGACCACCGCAGAGGAGGGTCCATTAACCATAGTTGTAAAGAAAGTCCGACCAGACGCATTTCTACCGGAGCTGATAAAAGGCCAATGGCAAATGTTATTTGGGACTCCCCAAGACCTTGAGCCAGGGGAGGGGACACTTGAATGGGGGTTGGACTGGCAACTTCCCCCAGGTTGGATAGGGTATTTCTTGCCAGAGAGCGAGGAATTCCCCGGCCAACTAAAGTGGTCTCCGTTGATCCTGTTGGAGTCTGGGCCAAAACGCTCCACATACCAATACACTGGAACACGGCCCCTTTTAAAGGGCACTCTGGTCGGCCGTTTGACATGGTCCTTTGCTGCCCCTGTAACCTTNACAGATAATACCGGCACCTTCGCCCTTAGGCAACATGTTTGGTATGCACCCCCAGCCCATAATCCTTNGGCTGCCTATGTCCTAACTAACAGAGATGAAGCTACCACAGTCATTTTGCTTGATGGGGAAGAACTGCCCCGCCAAGTACCTACTAAACACTTGTATTTCCGCCCATAGTCTTCTGTTCCTGTTGCTGCTCTGCCCTACAAAAGCCTCTTGGTTTGGATCCTGGGGAACATGGTGGCAAAAGGCATTATTAATTCTGTGTCTTATACTTGGCATAGGTATCATTGCCTGTTGTTGTCTGTATTGTTGCTGCGGCCTCTGCTTACAAGTAGAAAACAAATTGATGCAACGTGTCACCCACGCCATGAAATGACTGCAGCACCCCTCNCCTCTAAAGGCTCAGGGACCATCGCGGAAGAGGNGGGCGCGTGAGATTGTAAGAGCCGGATTAGAGGGGTGGAG



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
ERVL47 ZNF16 1729 1749 + 19.47 AGTGGGCAGCTATAGATGGTT
ERVL47 AS2 3917 3934 - 19.00 CGGCGGGGGCGCAGGCGG
ERVL47 Wt1 1977 1986 + 18.54 CCTCCCCCAC
ERVL47 HLH4C 2923 2936 + 18.16 GGACCAGCTGGGCC
ERVL47 HLH4C 2923 2936 - 18.08 GGCCCAGCTGGTCC
ERVL47 ZBTB24 3031 3040 - 17.49 CCCAGGACCC
ERVL47 DOF3.2 1256 1271 - 17.44 TCCCTTTTTCCCTTTT
ERVL47 sens 2035 2044 + 17.41 AAATCACAGC
ERVL47 Dif 67 76 - 17.39 GGGATTTCCC
ERVL47 GT-4 4450 4461 - 17.33 CAACTATGGTTA


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).