Arthur1

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000069
TE superfamily Tip100
TE class DNA
Species Eutheria
Length 3947
Kimura value 24.05
Tau index 1.0000
Description hAT-Tip100 DNA transposon, Arthur1 subfamily
Comment Arthur1 has 8 bp TSDs and 11 bp TIRs. A complete ORF (~1150-3711) encodes a transposase related to that of the Tip100-group of DNA transposons. Note that MER69C is an incomplete reconstruction of an autonomous hAT-family DNA transposon related to MER45 and Zaphod. Arthur1 is a more complete reconstruction.
Sequence
CAGAGGCGGATTTACCGTGAAGCTAATGAAGCTTAAGCTTCAGGGCCCCTCACTTGCACGGGCCCCTTCCAAGGCCCTGGGAGGGGCCCTAGCAATGTGTTCACATGGTCATATGTTTTTGTAAAATTTGCAAAAGTAAGATATTTTAACCGCAATCGGTTAAGACCGCTGTCTCTTTCCACTCCGACTTCCCCTCCGTCACACTTCCCCTCGTGTCGGGTGGCGTTGGAGTGGCCGCGGGCATTTTGGGGATCCGGCTAAGGGGAAGTTGAGTTGGGGATACATTTAGTTTGGGTTTAGTGGGATATATTTATGTGGTTCGCAGTCACTTCCGTGTATAGTTAAGTTATTGCTAGCCGTCCCGGTGTAGGAATGGCTTCCAGGAATACTCCTACCGCCCACTGTGCCGACTCACCCGGCGTCGTGACACGAAGGTGCAGGGCCAGAGGTCGTATCGCGATATGAACGTGTCCTACGGCGCCCGGCACCGGAAGTATGTGGGTAGTGGAGGAGAAACAAGGTTTGAAATGTACGGAGCCAGAAGCTAGTCTGTGGAAAATTCTTCCAATCATCAGACGTGTAAAATTGTAAGCGGAGGATTCGGTTCTCATCGACGCCTAGTCAAAACGGAAGTTCTCTCCTGTCAGGAATATACTCGATAATGCAGCATATACAATTATAAACGCACCATACATTTTTTTTCTTTTTTGATGGGAATCGCGCGAAATAGAATTTATCAGAATTCCTGTGTTTGTAGGGCACAAACCTGTAGCAGTACTACAAACAGCGAGTACGTCTGTGTGTGAAGTCGCATGTTTTATGCATCCCAACGTATCGGATCGCATCTTAGCGTATCTGACGCATCTTGTCCTGACGAAGTCTGGCGGTTCCAGACGAAACGGCACGCAAAATTGCCACCGACGCAGCGAAACGGCCTGAAGAAACAAGTGTTCCAGTGACAAAACTCCTACACATATTCATCAATAAGTCAATGTATGTAGTATAAGAGTAATTTGATTACACAATTATGTAGCTCAATACAACGTAGCGGCAATTTNAAAATGCATTTGAATTGCTCTTGTGTATGCCTTAATTTCAGATTAATTTTGTATATTTTTTCAGAATTATTGATCATACCAAAATTCAAATATGACGGGAAGGAAGTATCCTAGTGGAAGCCAAAAACGTAAACTGAAAGAAAGAAGACTGGCAGAAGCTTCAAAATGTCAAAAACTCTCTTATTATANCACAACGGAAACTAAAAACTCTGATGGTCAAAATCATGAAAATAAGGAAAAGAATCACGATAGCCAAGAAAATATAGATAATAAAAATGACAACAGAAGAGATAAAGAAGATTATGAGAATCAGGACACTCCCAAACAGGAAACAAAAAAGCAGAAGGAAGATGAAAATAATAGTAAAGAAAAACACAATTTGTATGGTCAACAGAGCACGCACGAAGGTCAGCAGCAGCTTCATTTCGACGAATTCGTTGACTTCGATAAGTACCGTGATCTTTCGTTATGNCCTGTTACAATGAGCAATAATTTCATTCAACACTGTTTAATNAAAGAAGTTTGTTTTTTCAAAATCGTGATCCCAATAATATTTACAGAGAATCAAACAGAACATACAATGGACAAAAGAGATGTTTCTCAAACAGATATTTTGAAAAAAATTAAAAATGGTCAAACTNTTATGAGATCCTGGCTAGCCTATTCCAAGACTAAAGGTTGCATATATTGTTTTGTTTGTAAACTGTTTTCAACAAGACAGACATTACTTANATCAGATGAGTTTTCAGACTGGATGAATATATTGAGAACCCTCAAAAGTCATGAAGACTCCACAGAGCACAAGAAGGCTATGTTTACTTGGATTACACGTAAATCAAATAAAAATGCACTAGACCAATGTCTTGAAGAACAGAGAAAGAACATTCAGTATTATTTTGAAGTGCTAAAAAGAGTAGTTGCGGTTATTAAATTTTTAAGCGAAAGAGGTTTGCCCTTCAGAGGTCACGATGAAAAGTGGCATTCTTCAAATAATGGAAATTTCATGGGAATCATCAAACTAATTGCTGAATTTGATCCATTTTTGCACGAGCACTTGGAAAAATNTCAAAATGAAAAAACAAATGTGACTTATTTATCCAAAACAGTTTATGAAGAATTAATCGAAATAATGGGAAAACGTGTACAAAATGAAGTGGTAAATCAAATAAATAAACCAGACACCAAATACTACTCCATTATTGTAGATTCTACACCGGATGTGACAAATACTGATCAGTTGGCGATTATTGTGCGCTACTGTTATAACGGAAAACCCTATGAGCGATTTTTAACTTTTTCGCCAACTGAAAACCATCTATCTGTGACTTTATTCAACAAAGTGAAACAAGTTCTGGACGATTGTAACCTGCCGTTGAACAACATTCGCGGTCAGTCATATGACAATACAGCTGTTATGAAAGGTGAGGACAAAGGGCTGCAAGCGCTCTTTAAAAACATTAATAAGTACGCAGAATATGTGCCGTGCGCAGCTCATTCCCTTAATCTCGTCGGAGAAAAAGCTGCCTCCACAGTACCTGAAGTTGTTGACTACTTCGGTATTTTGCAACAGTTATATGTTTTCTTTTCGGGTTCGTCTCGTAGATGGAGTATTTTAAACACACATGCCAACTTGGATTTTTCTTTAAAGAGTCTAAGTGTAACGAGATGGTCTGCCCATTATAAAGCTGTTCAAGCCTTGCAACATGGGTACAACGACATTTTAAGGACTCTAAAATATATTTTCGAAGACTCGGAAGAGAAACCCGAACACAAACGAGATGCAAAAAGTTTATTCAAAAAGTTGATAAAGCTCGAATACGCCATTTTAACTGCAATTTGGAAAGACGTTTTGGAACGTTTCAATAAAACAAGTGAGAAATTACAGACTCCTGATTTGGACGTATATGAAGGGTACCTTCTTTTGTCATCTTTAAATTTATTTATTAAAGAACTGAGAGAAAATTCAGATAAAAAATTAATGGAATATGAAACAAAAGCGATAAAGATGAGCGATGAAATCAATAGAAATTATTCTGACATCGAGAAGCAAATTGTAACAAAAAAATTTTCAGATCATACCAAAAGCAGTAATTCATTAAGAGGAAGAGATAAATTTCGAATAGAAGTAATGAATAGGCTCTTGGATTGTTTGATAATCCAATTAATGAAGAGGAGTGAATCATATGAACACATTGGAAAAAGATTTAAATTTCTTGCTGATTTGACACGAAATACTGACATCGACGAAGATAACATAAAACTCATAATACGCCACTACAACGAGGACATCGACGACAAACTGGTTAATGAGTGTCGTCAATTCAAAGAGTATTTAAGATTAGTCACTGCACAAGAAAACTTGAAATGCCCTGAAATCTTACAGCTCATATATGAAAGAAACTTGATAGAGGTTTTCCCAAATTTGACAACAATCCTAAAAATTTACATGACATTACCAATAACGAGTTGTGAAGCTGAAAGAAACTTTTCTAAACTATCAATAATAAAAAACAAATTTCGATCAACCATGCTAGAGGAAAGACTGAATTATCTTTCTATTCTCTCTATAGAAAATGATATTACAAAATCGTTGTCATATGAAGAGGCGATCAAAGAGTATGCAGCCAAAAAATGTAGGAAAAAAGTATTATAGAGGTGTGTCAGGCAGTTAATTAATAAAAATATTATGTTATTTTTCTGGATTTTGTGATGTTTGTGGTATTTGTCAGCTTTTTAAAATTTGTAATTTGTTGTGATTTCTTTTCTCATTCTAAATAAATATTCACTTTCGTACCTAATTTTGTATTCGTAATTTTGTATTCTTTTTCTTAAAGAGGGCCCCCCAAATTGTATAAGCTTCAGGCCCCACAAAACCTGGATCCGCCCCTG



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
Arthur1 DOF5.8 690 708 + 16.50 ATACATTTTTTTTCTTTTT
Arthur1 TCX6 3790 3804 + 16.40 TTTTAAAATTTGTAA
Arthur1 RVE7 139 147 - 16.38 AAAATATCT
Arthur1 RVE7 1665 1673 - 16.38 AAAATATCT
Arthur1 ERF 487 495 + 16.29 ACCGGAAGT
Arthur1 WRKY55 1494 1502 + 16.27 CGTTGACTT
Arthur1 RVE6 139 147 + 16.23 AGATATTTT
Arthur1 RVE6 1665 1673 + 16.23 AGATATTTT
Arthur1 RVE5 139 147 + 16.21 AGATATTTT
Arthur1 RVE5 1665 1673 + 16.21 AGATATTTT


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).