Arthur2
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0001276 |
---|---|
TE superfamily | Tip100 |
TE class | DNA |
Species | Mammalia |
Length | 3700 |
Kimura value | 32.91 |
Tau index | 1.0000 |
Description | Repetitive element conserved in all mammals. |
Comment | True termini are as yet undefined. ORF from pos 504-3239 encodes a transposases closest to that of hAT-82_HMa in Hydra in the Arthur (Dent/Riggs) sub group of Tip100/Hitchhiker elements. |
Sequence |
CTTTCTTTTNCAGAGCATTAAAAAATCATAAAGCAGAAGNCTTGCCACTAAGTAGTGGGTGGTGTGNTGGGTGGGGAAGGGGCCATTCTGTCGAAACANAGTGAAGAAGAAAACCANNTAGCGGACAATGTGCTAAGAAGAAGACGGCATTGTGTTTAGAATAAAGACTGGGAAAAGCAGTTCAGTTGGGTGACACCCGATCGAGACAGTCCTACTAATGGAAAGTGCAAACACTGTTTGGGGACATTTACTGTGAAATGGGATGGGGTAAAGGCACTGAAGGTGCATGAAAAATCAGCCTTGCGTAGGCAGAAAACACAAACTTTAGCAGAAAATAAATTGCTGGAAAAGTTTTTATTGAAGAAGAGTAGTTCTGAGTGTGTGATAGCTTTAGTTAAAGCTTACGTAGTTAATCATTCCGTATTTGAGATTCGGGTTTTTTTTCCCCTTGATGTAATTAGCCATTAGGCTTACTAAAGTACATTTTTCATGTCAAGCAGACAATGGCTTCAAAGAAGAGAAAGGGCGGGGCTGCAAGGGAGCGAGAAAAGAAGGCCAGGTTGATGCNTGAAGAGGCAAAACAGTGTAGGAAAATATCAACATTTCTTGCACAATCANTCACTGAAAATGTGTCAGCAAGCACGAGGCCATCTTTAACTGGCAATAATACAACTGAAATAGATCATTTAAGTAAGGGAGGCAGTATTACTGAGNCCAACATGCCTATCGACACTTGTTTGACTGTCTCAGAAGAACAAAATACCGAGAGGCAAGAGCTTAAAGTGAATGAAAATNTATCCAATTCACAATCAAATGAAAATGCTATGGGCCCAGAGCCAGAGAATATTCTTGNATTGGACTTTTTTCAGCGCCCTAAATTTAACCAGCTGGAATTCTTTTTTAAGTACCATCCTATTCAACCTTTTGAAATGAAGGATTTGCCTTTTAATGGGAAATCTGCATTTCATCGTAAAGATGGAACACAACGTCCTTGGCTTAGTTATTCTCCAGAGAAGCAGGCTCTTTTTTGTACTGTATGCTTAGCATATAGTAAAGACACAGATTCCAGCAAATTTATTTCAGGCATGANAGACTGGAGGCATACATACGTACGGATAGAGGAGCATGAAAAATGTAACCTCCACTTCCAGNGCGCTGAATGCCATATAATGAAAACTCTCGGAGGTGACATCACTAAACTCCTGTTTCACAACCAGGAGTCCGCGAGGATAGAACAAGTCAAGAGAAAAAGAGCAGTTTTAGAACGAGTGATTGATGTAATTAAAGTTATTGGTAAAGAAGGGCTCAGTTATCGAGGGGCAAATGAGTCAGCAGCAAGTTTAAGCAATGAAACAGCGCGCCATGGGGTGTTTCTCGAATTCTTGCTTATTTTGGGAAAGTATGATGCCCTACTCAAAGAGCACCTTGAATCTGTAATAGCTAGAGCAAATCGAAAAAGTAGTGGAGGAAGAAGTGCTCATATAACTCTCATTTCTAAAACAACAGTGAACTATGTTATTGACAGTATCGGCAGCTTAATTAAGAAGTCTGTTAGTGATGATGTTAAGAAAGCTGGCACATTTTCAATTCAGATTGACACTACTCAAGACATAGGCATAACTGATGTTTGTTCAGTAATACTAAGATATGTCACAGAATCTGTGAGAGAACGTCTGATATCTATAGTAAGCTTAAAGTCAAGTACCGGGGAAAATATGTTCCAGACTGTTGCAGATGTTCTCAGATCAAATNATATTTCTNTGAAAAATTGTGTAGGATGTTCAACTGATGGAGCATCNAACATGAAAGGACAGTTCAATGGGTTTGCATCNTGGTTAAAAAAAGAATCATCTAGCCAAGTTTATGTGTGGTGTTATTCACATGTTTTGAATCTGGTAATAGTAGATGTAACGGGAGTGTCAGAAGAGGCTACTTCACTATTTGGACTTTTAAATTCCTGTGCTGCATTTTTAAGAGAGTCACATAAAAGAATGGATATTTGGCGTGAACGCTGCAGGGACCTGTCGCACTCTCTCACTGGAGAAACAAGGTGGTGGTCTAAAGAGGTAGCTTTGAGAAAAGCGNTTGGGGTTTTTAAAGAGCCATCTAATTCATTATTTGTGACAATAATNGAAACTTCGAGTAGAATATGTTCCCACACAGAGGATTTTGGTTTAGATACCAGATACAAGGCTAGGACACTAAAACAGTCACTGTGCAGGTTTGAAACCATACTTACTGCTCAGTTGTTTCTGAAAANTTTTAGATCAACTACACCACTTTCAAAGTATCTGCAGACAAGTGGCATGGATATATTACAGGCCCAGAGAATGGTTTCAGAAACAATCAAAACATTACATGCAGAGTCAAGAAATTTTAGAGATGTTCTTGAGGGAGCACAAGCCTTCGTTTCATGGGCAAATNAAGAACTCAGTGCCAAGGAAATTGATGTTATGCTAGAAAGTTCCCTNCCTGAAGTACGTACGCGACTTAAGGGGAAAATGGATGGAGAAAATATAAGTGACCAACCTATGACATCAAATGAAAAATTCAGAATAAATGTCCATAATGTTGTAATGGACACAGTTGTTCAACACCTCAAAGATCGATTTAAAACGCATTCNCATCTTGCTGCAGATATCGCCTATCTAGATCCGAGAAACTTTAGTCACATAAACTCTAATATGCCTGATTTAGCTTTAAGAAACCTTTGTGACCTAATAAACAAACAAGGTCTGGTGGGAACCAAAGTGGAAATTCAGGATCTCAAAGACGAGCTGAGAGATTTTGCTCAGAAATGGGACAGATTGAAACAATCTCTACCAAAGGAGTATGAGACAGAAAAGGAGATTGANTCAGAAGAGGAAGAAGGCCAGTGTCATACTGAAGAGCCACAAATGCCACAAATTGAAAACAAAATGCGCAATACTTGTTATAATTGTTCAGTGTGCTGCTATAACGTTTTAAAACATTACAAACTTTATAGCAGTGCTTATGACAATATTTACCTGGCTTATAAACTGATTTTGACATTGTCTTGTACTCAGGTTGCATGTGAAAGAAGTTTTTCTACCTTAAAGAATATCAGAACCAGGCTCGGGAATAGACTCACTGAAGAACACTTAGAGAGTTTCATGCTTATGAGCATAAATTGTGATATTTTACTACGCCTAAACTATGATGAAATCATTGAAGGCGTCATAGCAAGAAGCAAAACCCTAGGGAGGCTATTAACAATTTAGCATTTCCAGTTCATGTGGGAATGTTAATTCCACTTACCGAAGAGAAGTACAGAAATGGTGTTCATTATGATTAAGACCTTGTAACTGTAGAAATGTAGAAGTGTGAAATTTATAATATTATAGACATGTTTAGTATAAAATGCAAATATCGCAGCAGAGTGGGTTGGGCATCCGGGTCTCAGCTGGGTTTCCCTTGAACCTATTACCTTACAAAGTTAAACACAGGTATAGGTCTTTCACAAAGGCAGTATGTTATTCTAAAGTAGCATCTTAGATCATCTTACTAGGGTTACGATGGCCACTTCAACTGGCTAACAGCTGTGCTAGCCTCACAAATAACCAAAAAGTATATTNATTTTTGGCAGCGGACTAGGGAAGTTGTGTGTGTGTTGGTTTAACAGTAAAATAGCATTTTCAAATTTTAATAAATTATTGGCATGTTATCTGTCATATACATG
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
Arthur2 | PHL1 | 839 | 850 | + | 16.59 | AGAGAATATTCT |
Arthur2 | ZNF257 | 767 | 776 | + | 16.56 | GAGGCAAGAG |
Arthur2 | SPDEF | 3410 | 3419 | - | 16.33 | ACCCGGATGC |
Arthur2 | KLF5 | 523 | 532 | - | 16.29 | GCCCCGCCCT |
Arthur2 | KLF15 | 524 | 531 | - | 16.27 | CCCCGCCC |
Arthur2 | INO4 | 3051 | 3059 | + | 16.24 | GCATGTGAA |
Arthur2 | INO2 | 3051 | 3059 | + | 16.20 | GCATGTGAA |
Arthur2 | PHL12 | 838 | 849 | + | 16.17 | CAGAGAATATTC |
Arthur2 | ZNF85 | 1060 | 1071 | + | 16.15 | CAGATTCCAGCA |
Arthur2 | PHL1 | 841 | 852 | - | 16.11 | CAAGAATATTCT |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.