ERVL47
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000119 |
---|---|
TE superfamily | ERV3 |
TE class | LTR |
Species | Primates |
Length | 5246 |
Kimura value | 18.41 |
Tau index | 1.0000 |
Description | ERVL endogenous retrovirus, ERVL47 subfamily |
Comment | Associated with LTR47B2 LTRs. ORF coordinates are roughly: Gag (234-1559), Pol (1560-4955). |
Sequence |
AATGGCGAGCCAGCCAGGAGGAGCCNGGGAACCAAAGTATGGGCAAGGGGAAGGAAGCATCCGTGGGGGAAATCCCGGGGCGGCCCTCCTCATCTATGTGGGGTGGTGTTGCCCATTTGCTAGATGCCTGTGGACCACCGCGTGAGTACGGGGACGCCCCGAAAACGCCGGAGGGTCTAGAANGCTTGTTAGTAGAAAGCCGTATGACTCACTGTGGTAAGGAAGGTCGCNAGGCGGCGGCAACAGTGGGTTNGCCGCTGTTATGGNCTGTGCGAGTGGCCACAGAAGCGCAGGTAGCAGCTGAGGCGAGGGTAGGAAAGCTAGAGAAAGAATTACAGTTAGAAAGGGATATGAGAACTTCCACGTCTGTGTTAGCGTCTGGTTTGGTGGATAAGCTCGAAACTCAGGAGGCACAGTTGGAAACCGTAGCCTGCCGCTTTGTGCGGCTGGGAGGGAGGAAGCTGCGCCGGCCGAAAGTGCGGGCTGTCCTTGCCAAACCGGACTGGGATGCNAANACCTGGAATTCTTGGGAACCAACTAGTGAGTCAGATGANGACATAGAGGTTANCTCAGAGGAGGAGGATAATTATCCTCCCCTTTTGAGAGCGAGACCGCTCATGCAGCGGAAAGCAAAAGCCCAACACATGCAGCCGGGCAATAACGGACAGCCTTTTCAGGAAACCTTAACTGTCCGAGAGTACACCTCGGCAGGATTATTGGACATAGCAAAAACTTTTAAACAGCTGCCGAGAGAAAGCTTAGCCACATGGATGGTGCGGCTATGGGACACCGGTGGGGATGGCATCTCCCTAGCAGGGAGTGAAGCTGAAAAAATGAGTAATATAACCACCCATCCNGCTTTGAGACAGCATTTGCATAATGCTAGAACTGTCGAGGGNAATCATAGCCTTATGGACTGGATTATCCTGGCCATGAGAGAGGCTTGGCCNAATGAAGGGGACTTCCCGGGTCAGACCCCGGCATGGCGATCTCTGGAGGAAGCGCAGAGCGATTTANGGGAGTTGGGCATGCGNCAGGCCATCTATGCCCAGCAATTTGNAGGACCAGATAAGGCTGTGTTTACCGTGGGCATGAAAAATAAGTTGTTACAAAGTGCCCCCCGGGAGTGGCATGGCCCTCTCATATCCCTCTTGAGCCCCCTTGTGGGACAAGATGTATATGACGTGGGGGAAGCCATNNCTGATCTTGGAGAAACTGAAAAGGGAAGAGATAAGGTCCGACTGGTTACTAAAGGAAAAGGGAAAAAGGGAGAAACCAAGTCTGCTGGACTGAAAAAGGGAGGTCAAAAAGGCCCAGTTAGGATTACCAGGAAACAAATGTGGTATGATCTGATCTCAGCTGGGGTAGACAAAGAGAAAATAGACCGGCAACCCAATGCCATATTAGTGGGCCTTTGGAAAGACCTGACTCCCGATCAACAGTTTAGACCCCTTCCTAGTGCCCCACCAGAAGAGGGAGAAGGAGAAGAAGAAACTCCGCGTAAAAGAACCCCAATCTCTGCGTTCCAGGGGTGGACTCCTCCCCAGCCTAGAGACTAGGGGTGGGGCCAAGGTTGCCTCCGTGTANGAGCAATAGGGGGCGACCAGAGGCCCCATGTGGAGCTCACAATCTATTGGAGTCCAAAAAACAAGCAGAGAACCTTAGCTTTAGTGGGCACAGGTGCAGAATGCACCTTAATTCATGGAAATCCAGAGAGACACCCTGGTAAGTGGGCAGCTATAGATGGTTACGGGGGGCGAACAATCCGAGTGAAACAAACTCCTCTCCTCCTTGGTNTTGGGCGGAGTCCCCCCGCTTACTNTACTGTCTTTATCTCACCTATTCCAGAAAACATCTTGGGTATGGATGTCCTTTTAGGATGCACTTTACAAACATCTGTGGGGGAATTCCACCTATGAGTTTGGGCGGTAAAGGCTATTTTAAGAGGGGAAGCAAAATGGGAACCTGTACACCTCCCTCCCCCACGGCGTATTGTTAATGTGAAACAATACCATCTTCCTGGTGGGATAGAAGAAATCACAGCCACCATACAGGAACTGGCCAAAGTTAATATTATNCGGCCAGCCCAGAGTCCCTTCAACAGTCCTGTATGGCCGGTAAGAAAACCTGATGGCACCTGGCACATGACAGTAGACTACCGGGAACTAAATAAGGTGGTTCCCGAGATACACGCTGCTGTGCCTAATATAACTCAAGTGATAGAGCAAATAATACAAAATATAGGCACTTATCATGCTGTGTTAGATTTGGCTAATGCCTTCTTCAGCATCCCTTTACACCCTGACTCGCAGGACCAATTTGCTTTTACTTGGAATGGCCAACAATGGACATTCCAAGTGTTGCCCCAGGGNTATCTACANAGCCCCACTATTTGTCATGGAATGATTGCTAGAGATTTAACTTTATGTCCACTGCCACCTGCTGTTAAACAGTTTCATTACATTGATGATATTATGCTAACCTCTGAAGACTTGTCATTGCTACAGCAACACCTTGATGCATTGTGCACCCTTCTCCAATCCAGAGGATGGGCCATCAACCCGCAAAAGATACAAGGCCCAGGACCGGCTGTAAAGTTCCTAGGGGTCACTTGGTCGGGTAAGACACGCCTTATCCCAGGCATAGTCATTGACAAAATACAACAATTTTCCATGCCTAAAACAGTTAAACAGTTACAAAGTTTCCTAGGTCTTTTGGGATATTGGCGGGCTTTTATTCCACATTTAGCTCAATGTTTGCGTCCCCTATACCGACTAGTAAAGAAGGGATCTAGTTGGTGCTGGGATAAAGAACAAGAGGAAGCATTTGAGAAGGCTAAANTATTAGTGGCTCGGGCACAAGCCTTAGGTTCCCCCCTTCCCGGGNTACCGNTNTCTTTGGATGTGACCATAAGCCCTGAGGGGACCAGCTGGGCCCTCGGGCAAGTCCAGCATGGGAAAGCGGTTCCCCTAGGATTCTGGTCACANCTATGGAAGGGCGCTGAAACCCGCTATTCCCCGATTGAACAACGGGTCCTGGGGGTATANAAGGCCTTGCGGCAAGTTGAACCCGTAACTGCCGCTTTNCCGGTAACAGNGAAAACGGGTCTCCCTATNAAGGGCTGGANAGAAGGGTTGTTTNCCAGGCCTGNNTCAGCTATTGCCCAGGCCTCCACTTTACAAAANTGGCATGCANACCTGCAACAACGTAGCGCCCTCTCCACGAGTCCCTTGGGAGATGAACTGCATGCTNTCTTAGGGCCAGTACACTATGAGACCAGTGCTGCCCCTATTGTGGAGCCCCCACGGGGGATGCCTCCGATGNTACACGAAGGCACGGCCCCCATTCCTGAAAACGCTTGGTACTCGGATGGGTNNAGCCGAGGTAACCCTTGTGTATGGACGGCAGTAGCTGTACAACCGCAGACAGATACTATCTGGTTTGAGATGGGAATGCAGCAAAGCAGTCAATGGGCAGAACTCCGAGCTGCATGGTTGGTTTGTACCCATGAGCCATGGCCTATAGTTCTCTGTACAGATAGTTGGGCAGTATTTAAGGGTCTTACAACTTGGCTTGCCCAGTGGGCCCGGGATGACTGGCATGTACTTTAAAAACCCTTATGGGGAGCTGCCATGTGGAAAGACATTTGGGAAAGGCTACAAGAACCCACTGCGAGCCTAATTGTGTATCATGTTTCAGCACACTGGTCAGATTCACCTCCCGGTAACATGGAGGCTGATACCCTAGCAAAAATTAGAACACTGGCTCCCTCGCAATCNTCTGAGCTAGCTGATTGGGTACATAAACACAGTGGGCATCGCAGTGCACGAGTGGGCTGGCAAATAGCAAAGGGAGCAGGATTGCCCCTCCGCTATGCAGATTTAGCGGCGGCAGTAACAAACTGCTTAGTTTGCTCCCGCCTGCGCCCCCGCCGCATCCCACATACACCTGGACACATACATAAGACAGCCGCCCCTGTGAGAGACTGGCAGATAGACTACATCGGACCCCTGCCAGTAAGCTTGGGACAAAAGTATGCACTAACATGTGTAGACACTGCCACGGGATTGTTGCAGGCCTTCCCTTGCAAAAGGGCAAACCAAACAGCCACCATTAAGGGCTTGGAGCAACTCAGTGTCATGTATGGATACCCTCGATGCATTGATAGCGACCGAGGCACGCATTTCACTGGACATGATGTCCAAGATTGGGCACATGAAAAGGATATAGACCGGAGATTTCACTTGCCATATAATCCCCAAGCGGCAGGGTTGATTGAAAGGAAAAATGGCATTTTGAAGGCACAACTGCGAGCACTTTCACAATCCAATACCTTGCATGGGTGGGCGAAGGTTTTGCCTCAAGCCATTAGAAACCTTAATTCGGTTGAGACAAATACGGGGCTGGCACCATACCAATGACTCGGGACCACCGCAGAGGAGGGTCCATTAACCATAGTTGTAAAGAAAGTCCGACCAGACGCATTTCTACCGGAGCTGATAAAAGGCCAATGGCAAATGTTATTTGGGACTCCCCAAGACCTTGAGCCAGGGGAGGGGACACTTGAATGGGGGTTGGACTGGCAACTTCCCCCAGGTTGGATAGGGTATTTCTTGCCAGAGAGCGAGGAATTCCCCGGCCAACTAAAGTGGTCTCCGTTGATCCTGTTGGAGTCTGGGCCAAAACGCTCCACATACCAATACACTGGAACACGGCCCCTTTTAAAGGGCACTCTGGTCGGCCGTTTGACATGGTCCTTTGCTGCCCCTGTAACCTTNACAGATAATACCGGCACCTTCGCCCTTAGGCAACATGTTTGGTATGCACCCCCAGCCCATAATCCTTNGGCTGCCTATGTCCTAACTAACAGAGATGAAGCTACCACAGTCATTTTGCTTGATGGGGAAGAACTGCCCCGCCAAGTACCTACTAAACACTTGTATTTCCGCCCATAGTCTTCTGTTCCTGTTGCTGCTCTGCCCTACAAAAGCCTCTTGGTTTGGATCCTGGGGAACATGGTGGCAAAAGGCATTATTAATTCTGTGTCTTATACTTGGCATAGGTATCATTGCCTGTTGTTGTCTGTATTGTTGCTGCGGCCTCTGCTTACAAGTAGAAAACAAATTGATGCAACGTGTCACCCACGCCATGAAATGACTGCAGCACCCCTCNCCTCTAAAGGCTCAGGGACCATCGCGGAAGAGGNGGGCGCGTGAGATTGTAAGAGCCGGATTAGAGGGGTGGAG
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
ERVL47 | ZNF16 | 1729 | 1749 | + | 19.47 | AGTGGGCAGCTATAGATGGTT |
ERVL47 | AS2 | 3917 | 3934 | - | 19.00 | CGGCGGGGGCGCAGGCGG |
ERVL47 | Wt1 | 1977 | 1986 | + | 18.54 | CCTCCCCCAC |
ERVL47 | HLH4C | 2923 | 2936 | + | 18.16 | GGACCAGCTGGGCC |
ERVL47 | HLH4C | 2923 | 2936 | - | 18.08 | GGCCCAGCTGGTCC |
ERVL47 | ZBTB24 | 3031 | 3040 | - | 17.49 | CCCAGGACCC |
ERVL47 | DOF3.2 | 1256 | 1271 | - | 17.44 | TCCCTTTTTCCCTTTT |
ERVL47 | sens | 2035 | 2044 | + | 17.41 | AAATCACAGC |
ERVL47 | Dif | 67 | 76 | - | 17.39 | GGGATTTCCC |
ERVL47 | GT-4 | 4450 | 4461 | - | 17.33 | CAACTATGGTTA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.