ERVL47
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000119 |
---|---|
TE superfamily | ERV3 |
TE class | LTR |
Species | Primates |
Length | 5246 |
Kimura value | 18.41 |
Tau index | 1.0000 |
Description | ERVL endogenous retrovirus, ERVL47 subfamily |
Comment | Associated with LTR47B2 LTRs. ORF coordinates are roughly: Gag (234-1559), Pol (1560-4955). |
Sequence |
AATGGCGAGCCAGCCAGGAGGAGCCNGGGAACCAAAGTATGGGCAAGGGGAAGGAAGCATCCGTGGGGGAAATCCCGGGGCGGCCCTCCTCATCTATGTGGGGTGGTGTTGCCCATTTGCTAGATGCCTGTGGACCACCGCGTGAGTACGGGGACGCCCCGAAAACGCCGGAGGGTCTAGAANGCTTGTTAGTAGAAAGCCGTATGACTCACTGTGGTAAGGAAGGTCGCNAGGCGGCGGCAACAGTGGGTTNGCCGCTGTTATGGNCTGTGCGAGTGGCCACAGAAGCGCAGGTAGCAGCTGAGGCGAGGGTAGGAAAGCTAGAGAAAGAATTACAGTTAGAAAGGGATATGAGAACTTCCACGTCTGTGTTAGCGTCTGGTTTGGTGGATAAGCTCGAAACTCAGGAGGCACAGTTGGAAACCGTAGCCTGCCGCTTTGTGCGGCTGGGAGGGAGGAAGCTGCGCCGGCCGAAAGTGCGGGCTGTCCTTGCCAAACCGGACTGGGATGCNAANACCTGGAATTCTTGGGAACCAACTAGTGAGTCAGATGANGACATAGAGGTTANCTCAGAGGAGGAGGATAATTATCCTCCCCTTTTGAGAGCGAGACCGCTCATGCAGCGGAAAGCAAAAGCCCAACACATGCAGCCGGGCAATAACGGACAGCCTTTTCAGGAAACCTTAACTGTCCGAGAGTACACCTCGGCAGGATTATTGGACATAGCAAAAACTTTTAAACAGCTGCCGAGAGAAAGCTTAGCCACATGGATGGTGCGGCTATGGGACACCGGTGGGGATGGCATCTCCCTAGCAGGGAGTGAAGCTGAAAAAATGAGTAATATAACCACCCATCCNGCTTTGAGACAGCATTTGCATAATGCTAGAACTGTCGAGGGNAATCATAGCCTTATGGACTGGATTATCCTGGCCATGAGAGAGGCTTGGCCNAATGAAGGGGACTTCCCGGGTCAGACCCCGGCATGGCGATCTCTGGAGGAAGCGCAGAGCGATTTANGGGAGTTGGGCATGCGNCAGGCCATCTATGCCCAGCAATTTGNAGGACCAGATAAGGCTGTGTTTACCGTGGGCATGAAAAATAAGTTGTTACAAAGTGCCCCCCGGGAGTGGCATGGCCCTCTCATATCCCTCTTGAGCCCCCTTGTGGGACAAGATGTATATGACGTGGGGGAAGCCATNNCTGATCTTGGAGAAACTGAAAAGGGAAGAGATAAGGTCCGACTGGTTACTAAAGGAAAAGGGAAAAAGGGAGAAACCAAGTCTGCTGGACTGAAAAAGGGAGGTCAAAAAGGCCCAGTTAGGATTACCAGGAAACAAATGTGGTATGATCTGATCTCAGCTGGGGTAGACAAAGAGAAAATAGACCGGCAACCCAATGCCATATTAGTGGGCCTTTGGAAAGACCTGACTCCCGATCAACAGTTTAGACCCCTTCCTAGTGCCCCACCAGAAGAGGGAGAAGGAGAAGAAGAAACTCCGCGTAAAAGAACCCCAATCTCTGCGTTCCAGGGGTGGACTCCTCCCCAGCCTAGAGACTAGGGGTGGGGCCAAGGTTGCCTCCGTGTANGAGCAATAGGGGGCGACCAGAGGCCCCATGTGGAGCTCACAATCTATTGGAGTCCAAAAAACAAGCAGAGAACCTTAGCTTTAGTGGGCACAGGTGCAGAATGCACCTTAATTCATGGAAATCCAGAGAGACACCCTGGTAAGTGGGCAGCTATAGATGGTTACGGGGGGCGAACAATCCGAGTGAAACAAACTCCTCTCCTCCTTGGTNTTGGGCGGAGTCCCCCCGCTTACTNTACTGTCTTTATCTCACCTATTCCAGAAAACATCTTGGGTATGGATGTCCTTTTAGGATGCACTTTACAAACATCTGTGGGGGAATTCCACCTATGAGTTTGGGCGGTAAAGGCTATTTTAAGAGGGGAAGCAAAATGGGAACCTGTACACCTCCCTCCCCCACGGCGTATTGTTAATGTGAAACAATACCATCTTCCTGGTGGGATAGAAGAAATCACAGCCACCATACAGGAACTGGCCAAAGTTAATATTATNCGGCCAGCCCAGAGTCCCTTCAACAGTCCTGTATGGCCGGTAAGAAAACCTGATGGCACCTGGCACATGACAGTAGACTACCGGGAACTAAATAAGGTGGTTCCCGAGATACACGCTGCTGTGCCTAATATAACTCAAGTGATAGAGCAAATAATACAAAATATAGGCACTTATCATGCTGTGTTAGATTTGGCTAATGCCTTCTTCAGCATCCCTTTACACCCTGACTCGCAGGACCAATTTGCTTTTACTTGGAATGGCCAACAATGGACATTCCAAGTGTTGCCCCAGGGNTATCTACANAGCCCCACTATTTGTCATGGAATGATTGCTAGAGATTTAACTTTATGTCCACTGCCACCTGCTGTTAAACAGTTTCATTACATTGATGATATTATGCTAACCTCTGAAGACTTGTCATTGCTACAGCAACACCTTGATGCATTGTGCACCCTTCTCCAATCCAGAGGATGGGCCATCAACCCGCAAAAGATACAAGGCCCAGGACCGGCTGTAAAGTTCCTAGGGGTCACTTGGTCGGGTAAGACACGCCTTATCCCAGGCATAGTCATTGACAAAATACAACAATTTTCCATGCCTAAAACAGTTAAACAGTTACAAAGTTTCCTAGGTCTTTTGGGATATTGGCGGGCTTTTATTCCACATTTAGCTCAATGTTTGCGTCCCCTATACCGACTAGTAAAGAAGGGATCTAGTTGGTGCTGGGATAAAGAACAAGAGGAAGCATTTGAGAAGGCTAAANTATTAGTGGCTCGGGCACAAGCCTTAGGTTCCCCCCTTCCCGGGNTACCGNTNTCTTTGGATGTGACCATAAGCCCTGAGGGGACCAGCTGGGCCCTCGGGCAAGTCCAGCATGGGAAAGCGGTTCCCCTAGGATTCTGGTCACANCTATGGAAGGGCGCTGAAACCCGCTATTCCCCGATTGAACAACGGGTCCTGGGGGTATANAAGGCCTTGCGGCAAGTTGAACCCGTAACTGCCGCTTTNCCGGTAACAGNGAAAACGGGTCTCCCTATNAAGGGCTGGANAGAAGGGTTGTTTNCCAGGCCTGNNTCAGCTATTGCCCAGGCCTCCACTTTACAAAANTGGCATGCANACCTGCAACAACGTAGCGCCCTCTCCACGAGTCCCTTGGGAGATGAACTGCATGCTNTCTTAGGGCCAGTACACTATGAGACCAGTGCTGCCCCTATTGTGGAGCCCCCACGGGGGATGCCTCCGATGNTACACGAAGGCACGGCCCCCATTCCTGAAAACGCTTGGTACTCGGATGGGTNNAGCCGAGGTAACCCTTGTGTATGGACGGCAGTAGCTGTACAACCGCAGACAGATACTATCTGGTTTGAGATGGGAATGCAGCAAAGCAGTCAATGGGCAGAACTCCGAGCTGCATGGTTGGTTTGTACCCATGAGCCATGGCCTATAGTTCTCTGTACAGATAGTTGGGCAGTATTTAAGGGTCTTACAACTTGGCTTGCCCAGTGGGCCCGGGATGACTGGCATGTACTTTAAAAACCCTTATGGGGAGCTGCCATGTGGAAAGACATTTGGGAAAGGCTACAAGAACCCACTGCGAGCCTAATTGTGTATCATGTTTCAGCACACTGGTCAGATTCACCTCCCGGTAACATGGAGGCTGATACCCTAGCAAAAATTAGAACACTGGCTCCCTCGCAATCNTCTGAGCTAGCTGATTGGGTACATAAACACAGTGGGCATCGCAGTGCACGAGTGGGCTGGCAAATAGCAAAGGGAGCAGGATTGCCCCTCCGCTATGCAGATTTAGCGGCGGCAGTAACAAACTGCTTAGTTTGCTCCCGCCTGCGCCCCCGCCGCATCCCACATACACCTGGACACATACATAAGACAGCCGCCCCTGTGAGAGACTGGCAGATAGACTACATCGGACCCCTGCCAGTAAGCTTGGGACAAAAGTATGCACTAACATGTGTAGACACTGCCACGGGATTGTTGCAGGCCTTCCCTTGCAAAAGGGCAAACCAAACAGCCACCATTAAGGGCTTGGAGCAACTCAGTGTCATGTATGGATACCCTCGATGCATTGATAGCGACCGAGGCACGCATTTCACTGGACATGATGTCCAAGATTGGGCACATGAAAAGGATATAGACCGGAGATTTCACTTGCCATATAATCCCCAAGCGGCAGGGTTGATTGAAAGGAAAAATGGCATTTTGAAGGCACAACTGCGAGCACTTTCACAATCCAATACCTTGCATGGGTGGGCGAAGGTTTTGCCTCAAGCCATTAGAAACCTTAATTCGGTTGAGACAAATACGGGGCTGGCACCATACCAATGACTCGGGACCACCGCAGAGGAGGGTCCATTAACCATAGTTGTAAAGAAAGTCCGACCAGACGCATTTCTACCGGAGCTGATAAAAGGCCAATGGCAAATGTTATTTGGGACTCCCCAAGACCTTGAGCCAGGGGAGGGGACACTTGAATGGGGGTTGGACTGGCAACTTCCCCCAGGTTGGATAGGGTATTTCTTGCCAGAGAGCGAGGAATTCCCCGGCCAACTAAAGTGGTCTCCGTTGATCCTGTTGGAGTCTGGGCCAAAACGCTCCACATACCAATACACTGGAACACGGCCCCTTTTAAAGGGCACTCTGGTCGGCCGTTTGACATGGTCCTTTGCTGCCCCTGTAACCTTNACAGATAATACCGGCACCTTCGCCCTTAGGCAACATGTTTGGTATGCACCCCCAGCCCATAATCCTTNGGCTGCCTATGTCCTAACTAACAGAGATGAAGCTACCACAGTCATTTTGCTTGATGGGGAAGAACTGCCCCGCCAAGTACCTACTAAACACTTGTATTTCCGCCCATAGTCTTCTGTTCCTGTTGCTGCTCTGCCCTACAAAAGCCTCTTGGTTTGGATCCTGGGGAACATGGTGGCAAAAGGCATTATTAATTCTGTGTCTTATACTTGGCATAGGTATCATTGCCTGTTGTTGTCTGTATTGTTGCTGCGGCCTCTGCTTACAAGTAGAAAACAAATTGATGCAACGTGTCACCCACGCCATGAAATGACTGCAGCACCCCTCNCCTCTAAAGGCTCAGGGACCATCGCGGAAGAGGNGGGCGCGTGAGATTGTAAGAGCCGGATTAGAGGGGTGGAG
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
ERVL47 | DOF3.6 | 1252 | 1272 | - | 17.32 | CTCCCTTTTTCCCTTTTCCTT |
ERVL47 | Dif | 67 | 76 | + | 17.21 | GGGAAATCCC |
ERVL47 | Irf1 | 1211 | 1221 | + | 17.10 | AGAAACTGAAA |
ERVL47 | Zm00001d020267 | 233 | 242 | - | 17.04 | TGCCGCCGCC |
ERVL47 | CDF5 | 1252 | 1272 | - | 17.00 | CTCCCTTTTTCCCTTTTCCTT |
ERVL47 | HLH4C | 294 | 307 | - | 16.93 | GCCTCAGCTGCTAC |
ERVL47 | CG7928 | 1525 | 1536 | - | 16.89 | TCCACCCCTGGA |
ERVL47 | ZNF708 | 2196 | 2204 | + | 16.87 | GCTGTGCCT |
ERVL47 | Spz1 | 3735 | 3745 | - | 16.82 | AGGGTATCAGC |
ERVL47 | ZNF257 | 303 | 312 | + | 16.80 | GAGGCGAGGG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.