Arthur2

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0001276
TE superfamily Tip100
TE class DNA
Species Mammalia
Length 3700
Kimura value 32.91
Tau index 1.0000
Description Repetitive element conserved in all mammals.
Comment True termini are as yet undefined. ORF from pos 504-3239 encodes a transposases closest to that of hAT-82_HMa in Hydra in the Arthur (Dent/Riggs) sub group of Tip100/Hitchhiker elements.
Sequence
CTTTCTTTTNCAGAGCATTAAAAAATCATAAAGCAGAAGNCTTGCCACTAAGTAGTGGGTGGTGTGNTGGGTGGGGAAGGGGCCATTCTGTCGAAACANAGTGAAGAAGAAAACCANNTAGCGGACAATGTGCTAAGAAGAAGACGGCATTGTGTTTAGAATAAAGACTGGGAAAAGCAGTTCAGTTGGGTGACACCCGATCGAGACAGTCCTACTAATGGAAAGTGCAAACACTGTTTGGGGACATTTACTGTGAAATGGGATGGGGTAAAGGCACTGAAGGTGCATGAAAAATCAGCCTTGCGTAGGCAGAAAACACAAACTTTAGCAGAAAATAAATTGCTGGAAAAGTTTTTATTGAAGAAGAGTAGTTCTGAGTGTGTGATAGCTTTAGTTAAAGCTTACGTAGTTAATCATTCCGTATTTGAGATTCGGGTTTTTTTTCCCCTTGATGTAATTAGCCATTAGGCTTACTAAAGTACATTTTTCATGTCAAGCAGACAATGGCTTCAAAGAAGAGAAAGGGCGGGGCTGCAAGGGAGCGAGAAAAGAAGGCCAGGTTGATGCNTGAAGAGGCAAAACAGTGTAGGAAAATATCAACATTTCTTGCACAATCANTCACTGAAAATGTGTCAGCAAGCACGAGGCCATCTTTAACTGGCAATAATACAACTGAAATAGATCATTTAAGTAAGGGAGGCAGTATTACTGAGNCCAACATGCCTATCGACACTTGTTTGACTGTCTCAGAAGAACAAAATACCGAGAGGCAAGAGCTTAAAGTGAATGAAAATNTATCCAATTCACAATCAAATGAAAATGCTATGGGCCCAGAGCCAGAGAATATTCTTGNATTGGACTTTTTTCAGCGCCCTAAATTTAACCAGCTGGAATTCTTTTTTAAGTACCATCCTATTCAACCTTTTGAAATGAAGGATTTGCCTTTTAATGGGAAATCTGCATTTCATCGTAAAGATGGAACACAACGTCCTTGGCTTAGTTATTCTCCAGAGAAGCAGGCTCTTTTTTGTACTGTATGCTTAGCATATAGTAAAGACACAGATTCCAGCAAATTTATTTCAGGCATGANAGACTGGAGGCATACATACGTACGGATAGAGGAGCATGAAAAATGTAACCTCCACTTCCAGNGCGCTGAATGCCATATAATGAAAACTCTCGGAGGTGACATCACTAAACTCCTGTTTCACAACCAGGAGTCCGCGAGGATAGAACAAGTCAAGAGAAAAAGAGCAGTTTTAGAACGAGTGATTGATGTAATTAAAGTTATTGGTAAAGAAGGGCTCAGTTATCGAGGGGCAAATGAGTCAGCAGCAAGTTTAAGCAATGAAACAGCGCGCCATGGGGTGTTTCTCGAATTCTTGCTTATTTTGGGAAAGTATGATGCCCTACTCAAAGAGCACCTTGAATCTGTAATAGCTAGAGCAAATCGAAAAAGTAGTGGAGGAAGAAGTGCTCATATAACTCTCATTTCTAAAACAACAGTGAACTATGTTATTGACAGTATCGGCAGCTTAATTAAGAAGTCTGTTAGTGATGATGTTAAGAAAGCTGGCACATTTTCAATTCAGATTGACACTACTCAAGACATAGGCATAACTGATGTTTGTTCAGTAATACTAAGATATGTCACAGAATCTGTGAGAGAACGTCTGATATCTATAGTAAGCTTAAAGTCAAGTACCGGGGAAAATATGTTCCAGACTGTTGCAGATGTTCTCAGATCAAATNATATTTCTNTGAAAAATTGTGTAGGATGTTCAACTGATGGAGCATCNAACATGAAAGGACAGTTCAATGGGTTTGCATCNTGGTTAAAAAAAGAATCATCTAGCCAAGTTTATGTGTGGTGTTATTCACATGTTTTGAATCTGGTAATAGTAGATGTAACGGGAGTGTCAGAAGAGGCTACTTCACTATTTGGACTTTTAAATTCCTGTGCTGCATTTTTAAGAGAGTCACATAAAAGAATGGATATTTGGCGTGAACGCTGCAGGGACCTGTCGCACTCTCTCACTGGAGAAACAAGGTGGTGGTCTAAAGAGGTAGCTTTGAGAAAAGCGNTTGGGGTTTTTAAAGAGCCATCTAATTCATTATTTGTGACAATAATNGAAACTTCGAGTAGAATATGTTCCCACACAGAGGATTTTGGTTTAGATACCAGATACAAGGCTAGGACACTAAAACAGTCACTGTGCAGGTTTGAAACCATACTTACTGCTCAGTTGTTTCTGAAAANTTTTAGATCAACTACACCACTTTCAAAGTATCTGCAGACAAGTGGCATGGATATATTACAGGCCCAGAGAATGGTTTCAGAAACAATCAAAACATTACATGCAGAGTCAAGAAATTTTAGAGATGTTCTTGAGGGAGCACAAGCCTTCGTTTCATGGGCAAATNAAGAACTCAGTGCCAAGGAAATTGATGTTATGCTAGAAAGTTCCCTNCCTGAAGTACGTACGCGACTTAAGGGGAAAATGGATGGAGAAAATATAAGTGACCAACCTATGACATCAAATGAAAAATTCAGAATAAATGTCCATAATGTTGTAATGGACACAGTTGTTCAACACCTCAAAGATCGATTTAAAACGCATTCNCATCTTGCTGCAGATATCGCCTATCTAGATCCGAGAAACTTTAGTCACATAAACTCTAATATGCCTGATTTAGCTTTAAGAAACCTTTGTGACCTAATAAACAAACAAGGTCTGGTGGGAACCAAAGTGGAAATTCAGGATCTCAAAGACGAGCTGAGAGATTTTGCTCAGAAATGGGACAGATTGAAACAATCTCTACCAAAGGAGTATGAGACAGAAAAGGAGATTGANTCAGAAGAGGAAGAAGGCCAGTGTCATACTGAAGAGCCACAAATGCCACAAATTGAAAACAAAATGCGCAATACTTGTTATAATTGTTCAGTGTGCTGCTATAACGTTTTAAAACATTACAAACTTTATAGCAGTGCTTATGACAATATTTACCTGGCTTATAAACTGATTTTGACATTGTCTTGTACTCAGGTTGCATGTGAAAGAAGTTTTTCTACCTTAAAGAATATCAGAACCAGGCTCGGGAATAGACTCACTGAAGAACACTTAGAGAGTTTCATGCTTATGAGCATAAATTGTGATATTTTACTACGCCTAAACTATGATGAAATCATTGAAGGCGTCATAGCAAGAAGCAAAACCCTAGGGAGGCTATTAACAATTTAGCATTTCCAGTTCATGTGGGAATGTTAATTCCACTTACCGAAGAGAAGTACAGAAATGGTGTTCATTATGATTAAGACCTTGTAACTGTAGAAATGTAGAAGTGTGAAATTTATAATATTATAGACATGTTTAGTATAAAATGCAAATATCGCAGCAGAGTGGGTTGGGCATCCGGGTCTCAGCTGGGTTTCCCTTGAACCTATTACCTTACAAAGTTAAACACAGGTATAGGTCTTTCACAAAGGCAGTATGTTATTCTAAAGTAGCATCTTAGATCATCTTACTAGGGTTACGATGGCCACTTCAACTGGCTAACAGCTGTGCTAGCCTCACAAATAACCAAAAAGTATATTNATTTTTGGCAGCGGACTAGGGAAGTTGTGTGTGTGTTGGTTTAACAGTAAAATAGCATTTTCAAATTTTAATAAATTATTGGCATGTTATCTGTCATATACATG



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
Arthur2 PHL1 839 850 + 16.59 AGAGAATATTCT
Arthur2 ZNF257 767 776 + 16.56 GAGGCAAGAG
Arthur2 SPDEF 3410 3419 - 16.33 ACCCGGATGC
Arthur2 KLF5 523 532 - 16.29 GCCCCGCCCT
Arthur2 KLF15 524 531 - 16.27 CCCCGCCC
Arthur2 INO4 3051 3059 + 16.24 GCATGTGAA
Arthur2 INO2 3051 3059 + 16.20 GCATGTGAA
Arthur2 PHL12 838 849 + 16.17 CAGAGAATATTC
Arthur2 ZNF85 1060 1071 + 16.15 CAGATTCCAGCA
Arthur2 PHL1 841 852 - 16.11 CAAGAATATTCT


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).