Arthur2

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0001276
TE superfamily Tip100
TE class DNA
Species Mammalia
Length 3700
Kimura value 32.91
Tau index 1.0000
Description Repetitive element conserved in all mammals.
Comment True termini are as yet undefined. ORF from pos 504-3239 encodes a transposases closest to that of hAT-82_HMa in Hydra in the Arthur (Dent/Riggs) sub group of Tip100/Hitchhiker elements.
Sequence
CTTTCTTTTNCAGAGCATTAAAAAATCATAAAGCAGAAGNCTTGCCACTAAGTAGTGGGTGGTGTGNTGGGTGGGGAAGGGGCCATTCTGTCGAAACANAGTGAAGAAGAAAACCANNTAGCGGACAATGTGCTAAGAAGAAGACGGCATTGTGTTTAGAATAAAGACTGGGAAAAGCAGTTCAGTTGGGTGACACCCGATCGAGACAGTCCTACTAATGGAAAGTGCAAACACTGTTTGGGGACATTTACTGTGAAATGGGATGGGGTAAAGGCACTGAAGGTGCATGAAAAATCAGCCTTGCGTAGGCAGAAAACACAAACTTTAGCAGAAAATAAATTGCTGGAAAAGTTTTTATTGAAGAAGAGTAGTTCTGAGTGTGTGATAGCTTTAGTTAAAGCTTACGTAGTTAATCATTCCGTATTTGAGATTCGGGTTTTTTTTCCCCTTGATGTAATTAGCCATTAGGCTTACTAAAGTACATTTTTCATGTCAAGCAGACAATGGCTTCAAAGAAGAGAAAGGGCGGGGCTGCAAGGGAGCGAGAAAAGAAGGCCAGGTTGATGCNTGAAGAGGCAAAACAGTGTAGGAAAATATCAACATTTCTTGCACAATCANTCACTGAAAATGTGTCAGCAAGCACGAGGCCATCTTTAACTGGCAATAATACAACTGAAATAGATCATTTAAGTAAGGGAGGCAGTATTACTGAGNCCAACATGCCTATCGACACTTGTTTGACTGTCTCAGAAGAACAAAATACCGAGAGGCAAGAGCTTAAAGTGAATGAAAATNTATCCAATTCACAATCAAATGAAAATGCTATGGGCCCAGAGCCAGAGAATATTCTTGNATTGGACTTTTTTCAGCGCCCTAAATTTAACCAGCTGGAATTCTTTTTTAAGTACCATCCTATTCAACCTTTTGAAATGAAGGATTTGCCTTTTAATGGGAAATCTGCATTTCATCGTAAAGATGGAACACAACGTCCTTGGCTTAGTTATTCTCCAGAGAAGCAGGCTCTTTTTTGTACTGTATGCTTAGCATATAGTAAAGACACAGATTCCAGCAAATTTATTTCAGGCATGANAGACTGGAGGCATACATACGTACGGATAGAGGAGCATGAAAAATGTAACCTCCACTTCCAGNGCGCTGAATGCCATATAATGAAAACTCTCGGAGGTGACATCACTAAACTCCTGTTTCACAACCAGGAGTCCGCGAGGATAGAACAAGTCAAGAGAAAAAGAGCAGTTTTAGAACGAGTGATTGATGTAATTAAAGTTATTGGTAAAGAAGGGCTCAGTTATCGAGGGGCAAATGAGTCAGCAGCAAGTTTAAGCAATGAAACAGCGCGCCATGGGGTGTTTCTCGAATTCTTGCTTATTTTGGGAAAGTATGATGCCCTACTCAAAGAGCACCTTGAATCTGTAATAGCTAGAGCAAATCGAAAAAGTAGTGGAGGAAGAAGTGCTCATATAACTCTCATTTCTAAAACAACAGTGAACTATGTTATTGACAGTATCGGCAGCTTAATTAAGAAGTCTGTTAGTGATGATGTTAAGAAAGCTGGCACATTTTCAATTCAGATTGACACTACTCAAGACATAGGCATAACTGATGTTTGTTCAGTAATACTAAGATATGTCACAGAATCTGTGAGAGAACGTCTGATATCTATAGTAAGCTTAAAGTCAAGTACCGGGGAAAATATGTTCCAGACTGTTGCAGATGTTCTCAGATCAAATNATATTTCTNTGAAAAATTGTGTAGGATGTTCAACTGATGGAGCATCNAACATGAAAGGACAGTTCAATGGGTTTGCATCNTGGTTAAAAAAAGAATCATCTAGCCAAGTTTATGTGTGGTGTTATTCACATGTTTTGAATCTGGTAATAGTAGATGTAACGGGAGTGTCAGAAGAGGCTACTTCACTATTTGGACTTTTAAATTCCTGTGCTGCATTTTTAAGAGAGTCACATAAAAGAATGGATATTTGGCGTGAACGCTGCAGGGACCTGTCGCACTCTCTCACTGGAGAAACAAGGTGGTGGTCTAAAGAGGTAGCTTTGAGAAAAGCGNTTGGGGTTTTTAAAGAGCCATCTAATTCATTATTTGTGACAATAATNGAAACTTCGAGTAGAATATGTTCCCACACAGAGGATTTTGGTTTAGATACCAGATACAAGGCTAGGACACTAAAACAGTCACTGTGCAGGTTTGAAACCATACTTACTGCTCAGTTGTTTCTGAAAANTTTTAGATCAACTACACCACTTTCAAAGTATCTGCAGACAAGTGGCATGGATATATTACAGGCCCAGAGAATGGTTTCAGAAACAATCAAAACATTACATGCAGAGTCAAGAAATTTTAGAGATGTTCTTGAGGGAGCACAAGCCTTCGTTTCATGGGCAAATNAAGAACTCAGTGCCAAGGAAATTGATGTTATGCTAGAAAGTTCCCTNCCTGAAGTACGTACGCGACTTAAGGGGAAAATGGATGGAGAAAATATAAGTGACCAACCTATGACATCAAATGAAAAATTCAGAATAAATGTCCATAATGTTGTAATGGACACAGTTGTTCAACACCTCAAAGATCGATTTAAAACGCATTCNCATCTTGCTGCAGATATCGCCTATCTAGATCCGAGAAACTTTAGTCACATAAACTCTAATATGCCTGATTTAGCTTTAAGAAACCTTTGTGACCTAATAAACAAACAAGGTCTGGTGGGAACCAAAGTGGAAATTCAGGATCTCAAAGACGAGCTGAGAGATTTTGCTCAGAAATGGGACAGATTGAAACAATCTCTACCAAAGGAGTATGAGACAGAAAAGGAGATTGANTCAGAAGAGGAAGAAGGCCAGTGTCATACTGAAGAGCCACAAATGCCACAAATTGAAAACAAAATGCGCAATACTTGTTATAATTGTTCAGTGTGCTGCTATAACGTTTTAAAACATTACAAACTTTATAGCAGTGCTTATGACAATATTTACCTGGCTTATAAACTGATTTTGACATTGTCTTGTACTCAGGTTGCATGTGAAAGAAGTTTTTCTACCTTAAAGAATATCAGAACCAGGCTCGGGAATAGACTCACTGAAGAACACTTAGAGAGTTTCATGCTTATGAGCATAAATTGTGATATTTTACTACGCCTAAACTATGATGAAATCATTGAAGGCGTCATAGCAAGAAGCAAAACCCTAGGGAGGCTATTAACAATTTAGCATTTCCAGTTCATGTGGGAATGTTAATTCCACTTACCGAAGAGAAGTACAGAAATGGTGTTCATTATGATTAAGACCTTGTAACTGTAGAAATGTAGAAGTGTGAAATTTATAATATTATAGACATGTTTAGTATAAAATGCAAATATCGCAGCAGAGTGGGTTGGGCATCCGGGTCTCAGCTGGGTTTCCCTTGAACCTATTACCTTACAAAGTTAAACACAGGTATAGGTCTTTCACAAAGGCAGTATGTTATTCTAAAGTAGCATCTTAGATCATCTTACTAGGGTTACGATGGCCACTTCAACTGGCTAACAGCTGTGCTAGCCTCACAAATAACCAAAAAGTATATTNATTTTTGGCAGCGGACTAGGGAAGTTGTGTGTGTGTTGGTTTAACAGTAAAATAGCATTTTCAAATTTTAATAAATTATTGGCATGTTATCTGTCATATACATG



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
Arthur2 cg 3622 3632 - 19.33 ACACACACACA
Arthur2 TSO1 3656 3670 - 17.92 TTATTAAAATTTGAA
Arthur2 MAF::NFE2 1324 1334 + 17.74 ATGAGTCAGCA
Arthur2 SOL1 3655 3669 - 17.72 TATTAAAATTTGAAA
Arthur2 MAFG::NFE2L1 1324 1334 + 17.52 ATGAGTCAGCA
Arthur2 Nfe2l2 1324 1334 + 17.38 ATGAGTCAGCA
Arthur2 TCX2 3655 3669 - 17.37 TATTAAAATTTGAAA
Arthur2 Bach1::Mafk 1323 1334 + 16.96 AATGAGTCAGCA
Arthur2 PHL11 841 850 + 16.78 AGAATATTCT
Arthur2 PHL11 841 850 - 16.78 AGAATATTCT


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).