Arthur1
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000069 |
---|---|
TE superfamily | Tip100 |
TE class | DNA |
Species | Eutheria |
Length | 3947 |
Kimura value | 24.05 |
Tau index | 1.0000 |
Description | hAT-Tip100 DNA transposon, Arthur1 subfamily |
Comment | Arthur1 has 8 bp TSDs and 11 bp TIRs. A complete ORF (~1150-3711) encodes a transposase related to that of the Tip100-group of DNA transposons. Note that MER69C is an incomplete reconstruction of an autonomous hAT-family DNA transposon related to MER45 and Zaphod. Arthur1 is a more complete reconstruction. |
Sequence |
CAGAGGCGGATTTACCGTGAAGCTAATGAAGCTTAAGCTTCAGGGCCCCTCACTTGCACGGGCCCCTTCCAAGGCCCTGGGAGGGGCCCTAGCAATGTGTTCACATGGTCATATGTTTTTGTAAAATTTGCAAAAGTAAGATATTTTAACCGCAATCGGTTAAGACCGCTGTCTCTTTCCACTCCGACTTCCCCTCCGTCACACTTCCCCTCGTGTCGGGTGGCGTTGGAGTGGCCGCGGGCATTTTGGGGATCCGGCTAAGGGGAAGTTGAGTTGGGGATACATTTAGTTTGGGTTTAGTGGGATATATTTATGTGGTTCGCAGTCACTTCCGTGTATAGTTAAGTTATTGCTAGCCGTCCCGGTGTAGGAATGGCTTCCAGGAATACTCCTACCGCCCACTGTGCCGACTCACCCGGCGTCGTGACACGAAGGTGCAGGGCCAGAGGTCGTATCGCGATATGAACGTGTCCTACGGCGCCCGGCACCGGAAGTATGTGGGTAGTGGAGGAGAAACAAGGTTTGAAATGTACGGAGCCAGAAGCTAGTCTGTGGAAAATTCTTCCAATCATCAGACGTGTAAAATTGTAAGCGGAGGATTCGGTTCTCATCGACGCCTAGTCAAAACGGAAGTTCTCTCCTGTCAGGAATATACTCGATAATGCAGCATATACAATTATAAACGCACCATACATTTTTTTTCTTTTTTGATGGGAATCGCGCGAAATAGAATTTATCAGAATTCCTGTGTTTGTAGGGCACAAACCTGTAGCAGTACTACAAACAGCGAGTACGTCTGTGTGTGAAGTCGCATGTTTTATGCATCCCAACGTATCGGATCGCATCTTAGCGTATCTGACGCATCTTGTCCTGACGAAGTCTGGCGGTTCCAGACGAAACGGCACGCAAAATTGCCACCGACGCAGCGAAACGGCCTGAAGAAACAAGTGTTCCAGTGACAAAACTCCTACACATATTCATCAATAAGTCAATGTATGTAGTATAAGAGTAATTTGATTACACAATTATGTAGCTCAATACAACGTAGCGGCAATTTNAAAATGCATTTGAATTGCTCTTGTGTATGCCTTAATTTCAGATTAATTTTGTATATTTTTTCAGAATTATTGATCATACCAAAATTCAAATATGACGGGAAGGAAGTATCCTAGTGGAAGCCAAAAACGTAAACTGAAAGAAAGAAGACTGGCAGAAGCTTCAAAATGTCAAAAACTCTCTTATTATANCACAACGGAAACTAAAAACTCTGATGGTCAAAATCATGAAAATAAGGAAAAGAATCACGATAGCCAAGAAAATATAGATAATAAAAATGACAACAGAAGAGATAAAGAAGATTATGAGAATCAGGACACTCCCAAACAGGAAACAAAAAAGCAGAAGGAAGATGAAAATAATAGTAAAGAAAAACACAATTTGTATGGTCAACAGAGCACGCACGAAGGTCAGCAGCAGCTTCATTTCGACGAATTCGTTGACTTCGATAAGTACCGTGATCTTTCGTTATGNCCTGTTACAATGAGCAATAATTTCATTCAACACTGTTTAATNAAAGAAGTTTGTTTTTTCAAAATCGTGATCCCAATAATATTTACAGAGAATCAAACAGAACATACAATGGACAAAAGAGATGTTTCTCAAACAGATATTTTGAAAAAAATTAAAAATGGTCAAACTNTTATGAGATCCTGGCTAGCCTATTCCAAGACTAAAGGTTGCATATATTGTTTTGTTTGTAAACTGTTTTCAACAAGACAGACATTACTTANATCAGATGAGTTTTCAGACTGGATGAATATATTGAGAACCCTCAAAAGTCATGAAGACTCCACAGAGCACAAGAAGGCTATGTTTACTTGGATTACACGTAAATCAAATAAAAATGCACTAGACCAATGTCTTGAAGAACAGAGAAAGAACATTCAGTATTATTTTGAAGTGCTAAAAAGAGTAGTTGCGGTTATTAAATTTTTAAGCGAAAGAGGTTTGCCCTTCAGAGGTCACGATGAAAAGTGGCATTCTTCAAATAATGGAAATTTCATGGGAATCATCAAACTAATTGCTGAATTTGATCCATTTTTGCACGAGCACTTGGAAAAATNTCAAAATGAAAAAACAAATGTGACTTATTTATCCAAAACAGTTTATGAAGAATTAATCGAAATAATGGGAAAACGTGTACAAAATGAAGTGGTAAATCAAATAAATAAACCAGACACCAAATACTACTCCATTATTGTAGATTCTACACCGGATGTGACAAATACTGATCAGTTGGCGATTATTGTGCGCTACTGTTATAACGGAAAACCCTATGAGCGATTTTTAACTTTTTCGCCAACTGAAAACCATCTATCTGTGACTTTATTCAACAAAGTGAAACAAGTTCTGGACGATTGTAACCTGCCGTTGAACAACATTCGCGGTCAGTCATATGACAATACAGCTGTTATGAAAGGTGAGGACAAAGGGCTGCAAGCGCTCTTTAAAAACATTAATAAGTACGCAGAATATGTGCCGTGCGCAGCTCATTCCCTTAATCTCGTCGGAGAAAAAGCTGCCTCCACAGTACCTGAAGTTGTTGACTACTTCGGTATTTTGCAACAGTTATATGTTTTCTTTTCGGGTTCGTCTCGTAGATGGAGTATTTTAAACACACATGCCAACTTGGATTTTTCTTTAAAGAGTCTAAGTGTAACGAGATGGTCTGCCCATTATAAAGCTGTTCAAGCCTTGCAACATGGGTACAACGACATTTTAAGGACTCTAAAATATATTTTCGAAGACTCGGAAGAGAAACCCGAACACAAACGAGATGCAAAAAGTTTATTCAAAAAGTTGATAAAGCTCGAATACGCCATTTTAACTGCAATTTGGAAAGACGTTTTGGAACGTTTCAATAAAACAAGTGAGAAATTACAGACTCCTGATTTGGACGTATATGAAGGGTACCTTCTTTTGTCATCTTTAAATTTATTTATTAAAGAACTGAGAGAAAATTCAGATAAAAAATTAATGGAATATGAAACAAAAGCGATAAAGATGAGCGATGAAATCAATAGAAATTATTCTGACATCGAGAAGCAAATTGTAACAAAAAAATTTTCAGATCATACCAAAAGCAGTAATTCATTAAGAGGAAGAGATAAATTTCGAATAGAAGTAATGAATAGGCTCTTGGATTGTTTGATAATCCAATTAATGAAGAGGAGTGAATCATATGAACACATTGGAAAAAGATTTAAATTTCTTGCTGATTTGACACGAAATACTGACATCGACGAAGATAACATAAAACTCATAATACGCCACTACAACGAGGACATCGACGACAAACTGGTTAATGAGTGTCGTCAATTCAAAGAGTATTTAAGATTAGTCACTGCACAAGAAAACTTGAAATGCCCTGAAATCTTACAGCTCATATATGAAAGAAACTTGATAGAGGTTTTCCCAAATTTGACAACAATCCTAAAAATTTACATGACATTACCAATAACGAGTTGTGAAGCTGAAAGAAACTTTTCTAAACTATCAATAATAAAAAACAAATTTCGATCAACCATGCTAGAGGAAAGACTGAATTATCTTTCTATTCTCTCTATAGAAAATGATATTACAAAATCGTTGTCATATGAAGAGGCGATCAAAGAGTATGCAGCCAAAAAATGTAGGAAAAAAGTATTATAGAGGTGTGTCAGGCAGTTAATTAATAAAAATATTATGTTATTTTTCTGGATTTTGTGATGTTTGTGGTATTTGTCAGCTTTTTAAAATTTGTAATTTGTTGTGATTTCTTTTCTCATTCTAAATAAATATTCACTTTCGTACCTAATTTTGTATTCGTAATTTTGTATTCTTTTTCTTAAAGAGGGCCCCCCAAATTGTATAAGCTTCAGGCCCCACAAAACCTGGATCCGCCCCTG
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
Arthur1 | TB1 | 3894 | 3902 | + | 17.38 | GGGCCCCCC |
Arthur1 | BRN2 | 1877 | 1892 | + | 17.19 | CTTGGATTACACGTAA |
Arthur1 | NAC031 | 1874 | 1892 | + | 17.10 | TTACTTGGATTACACGTAA |
Arthur1 | FEV | 487 | 495 | + | 17.06 | ACCGGAAGT |
Arthur1 | NAC058 | 1874 | 1892 | + | 17.05 | TTACTTGGATTACACGTAA |
Arthur1 | NAC087 | 1874 | 1892 | - | 17.03 | TTACGTGTAATCCAAGTAA |
Arthur1 | ELK1 | 487 | 495 | + | 17.02 | ACCGGAAGT |
Arthur1 | ETV5 | 487 | 495 | + | 17.00 | ACCGGAAGT |
Arthur1 | PHOX2B | 1009 | 1020 | - | 16.93 | TAATCAAATTAC |
Arthur1 | Prdm15 | 3927 | 3937 | + | 16.85 | CAAAACCTGGA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.