Arthur2
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0001276 |
---|---|
TE superfamily | Tip100 |
TE class | DNA |
Species | Mammalia |
Length | 3700 |
Kimura value | 32.91 |
Tau index | 1.0000 |
Description | Repetitive element conserved in all mammals. |
Comment | True termini are as yet undefined. ORF from pos 504-3239 encodes a transposases closest to that of hAT-82_HMa in Hydra in the Arthur (Dent/Riggs) sub group of Tip100/Hitchhiker elements. |
Sequence |
CTTTCTTTTNCAGAGCATTAAAAAATCATAAAGCAGAAGNCTTGCCACTAAGTAGTGGGTGGTGTGNTGGGTGGGGAAGGGGCCATTCTGTCGAAACANAGTGAAGAAGAAAACCANNTAGCGGACAATGTGCTAAGAAGAAGACGGCATTGTGTTTAGAATAAAGACTGGGAAAAGCAGTTCAGTTGGGTGACACCCGATCGAGACAGTCCTACTAATGGAAAGTGCAAACACTGTTTGGGGACATTTACTGTGAAATGGGATGGGGTAAAGGCACTGAAGGTGCATGAAAAATCAGCCTTGCGTAGGCAGAAAACACAAACTTTAGCAGAAAATAAATTGCTGGAAAAGTTTTTATTGAAGAAGAGTAGTTCTGAGTGTGTGATAGCTTTAGTTAAAGCTTACGTAGTTAATCATTCCGTATTTGAGATTCGGGTTTTTTTTCCCCTTGATGTAATTAGCCATTAGGCTTACTAAAGTACATTTTTCATGTCAAGCAGACAATGGCTTCAAAGAAGAGAAAGGGCGGGGCTGCAAGGGAGCGAGAAAAGAAGGCCAGGTTGATGCNTGAAGAGGCAAAACAGTGTAGGAAAATATCAACATTTCTTGCACAATCANTCACTGAAAATGTGTCAGCAAGCACGAGGCCATCTTTAACTGGCAATAATACAACTGAAATAGATCATTTAAGTAAGGGAGGCAGTATTACTGAGNCCAACATGCCTATCGACACTTGTTTGACTGTCTCAGAAGAACAAAATACCGAGAGGCAAGAGCTTAAAGTGAATGAAAATNTATCCAATTCACAATCAAATGAAAATGCTATGGGCCCAGAGCCAGAGAATATTCTTGNATTGGACTTTTTTCAGCGCCCTAAATTTAACCAGCTGGAATTCTTTTTTAAGTACCATCCTATTCAACCTTTTGAAATGAAGGATTTGCCTTTTAATGGGAAATCTGCATTTCATCGTAAAGATGGAACACAACGTCCTTGGCTTAGTTATTCTCCAGAGAAGCAGGCTCTTTTTTGTACTGTATGCTTAGCATATAGTAAAGACACAGATTCCAGCAAATTTATTTCAGGCATGANAGACTGGAGGCATACATACGTACGGATAGAGGAGCATGAAAAATGTAACCTCCACTTCCAGNGCGCTGAATGCCATATAATGAAAACTCTCGGAGGTGACATCACTAAACTCCTGTTTCACAACCAGGAGTCCGCGAGGATAGAACAAGTCAAGAGAAAAAGAGCAGTTTTAGAACGAGTGATTGATGTAATTAAAGTTATTGGTAAAGAAGGGCTCAGTTATCGAGGGGCAAATGAGTCAGCAGCAAGTTTAAGCAATGAAACAGCGCGCCATGGGGTGTTTCTCGAATTCTTGCTTATTTTGGGAAAGTATGATGCCCTACTCAAAGAGCACCTTGAATCTGTAATAGCTAGAGCAAATCGAAAAAGTAGTGGAGGAAGAAGTGCTCATATAACTCTCATTTCTAAAACAACAGTGAACTATGTTATTGACAGTATCGGCAGCTTAATTAAGAAGTCTGTTAGTGATGATGTTAAGAAAGCTGGCACATTTTCAATTCAGATTGACACTACTCAAGACATAGGCATAACTGATGTTTGTTCAGTAATACTAAGATATGTCACAGAATCTGTGAGAGAACGTCTGATATCTATAGTAAGCTTAAAGTCAAGTACCGGGGAAAATATGTTCCAGACTGTTGCAGATGTTCTCAGATCAAATNATATTTCTNTGAAAAATTGTGTAGGATGTTCAACTGATGGAGCATCNAACATGAAAGGACAGTTCAATGGGTTTGCATCNTGGTTAAAAAAAGAATCATCTAGCCAAGTTTATGTGTGGTGTTATTCACATGTTTTGAATCTGGTAATAGTAGATGTAACGGGAGTGTCAGAAGAGGCTACTTCACTATTTGGACTTTTAAATTCCTGTGCTGCATTTTTAAGAGAGTCACATAAAAGAATGGATATTTGGCGTGAACGCTGCAGGGACCTGTCGCACTCTCTCACTGGAGAAACAAGGTGGTGGTCTAAAGAGGTAGCTTTGAGAAAAGCGNTTGGGGTTTTTAAAGAGCCATCTAATTCATTATTTGTGACAATAATNGAAACTTCGAGTAGAATATGTTCCCACACAGAGGATTTTGGTTTAGATACCAGATACAAGGCTAGGACACTAAAACAGTCACTGTGCAGGTTTGAAACCATACTTACTGCTCAGTTGTTTCTGAAAANTTTTAGATCAACTACACCACTTTCAAAGTATCTGCAGACAAGTGGCATGGATATATTACAGGCCCAGAGAATGGTTTCAGAAACAATCAAAACATTACATGCAGAGTCAAGAAATTTTAGAGATGTTCTTGAGGGAGCACAAGCCTTCGTTTCATGGGCAAATNAAGAACTCAGTGCCAAGGAAATTGATGTTATGCTAGAAAGTTCCCTNCCTGAAGTACGTACGCGACTTAAGGGGAAAATGGATGGAGAAAATATAAGTGACCAACCTATGACATCAAATGAAAAATTCAGAATAAATGTCCATAATGTTGTAATGGACACAGTTGTTCAACACCTCAAAGATCGATTTAAAACGCATTCNCATCTTGCTGCAGATATCGCCTATCTAGATCCGAGAAACTTTAGTCACATAAACTCTAATATGCCTGATTTAGCTTTAAGAAACCTTTGTGACCTAATAAACAAACAAGGTCTGGTGGGAACCAAAGTGGAAATTCAGGATCTCAAAGACGAGCTGAGAGATTTTGCTCAGAAATGGGACAGATTGAAACAATCTCTACCAAAGGAGTATGAGACAGAAAAGGAGATTGANTCAGAAGAGGAAGAAGGCCAGTGTCATACTGAAGAGCCACAAATGCCACAAATTGAAAACAAAATGCGCAATACTTGTTATAATTGTTCAGTGTGCTGCTATAACGTTTTAAAACATTACAAACTTTATAGCAGTGCTTATGACAATATTTACCTGGCTTATAAACTGATTTTGACATTGTCTTGTACTCAGGTTGCATGTGAAAGAAGTTTTTCTACCTTAAAGAATATCAGAACCAGGCTCGGGAATAGACTCACTGAAGAACACTTAGAGAGTTTCATGCTTATGAGCATAAATTGTGATATTTTACTACGCCTAAACTATGATGAAATCATTGAAGGCGTCATAGCAAGAAGCAAAACCCTAGGGAGGCTATTAACAATTTAGCATTTCCAGTTCATGTGGGAATGTTAATTCCACTTACCGAAGAGAAGTACAGAAATGGTGTTCATTATGATTAAGACCTTGTAACTGTAGAAATGTAGAAGTGTGAAATTTATAATATTATAGACATGTTTAGTATAAAATGCAAATATCGCAGCAGAGTGGGTTGGGCATCCGGGTCTCAGCTGGGTTTCCCTTGAACCTATTACCTTACAAAGTTAAACACAGGTATAGGTCTTTCACAAAGGCAGTATGTTATTCTAAAGTAGCATCTTAGATCATCTTACTAGGGTTACGATGGCCACTTCAACTGGCTAACAGCTGTGCTAGCCTCACAAATAACCAAAAAGTATATTNATTTTTGGCAGCGGACTAGGGAAGTTGTGTGTGTGTTGGTTTAACAGTAAAATAGCATTTTCAAATTTTAATAAATTATTGGCATGTTATCTGTCATATACATG
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
Arthur2 | TCX6 | 3655 | 3669 | + | 16.09 | TTTCAAATTTTAATA |
Arthur2 | STB3 | 2541 | 2551 | - | 15.72 | AATTTTTCATT |
Arthur2 | KLF7 | 524 | 531 | + | 15.72 | GGGCGGGG |
Arthur2 | Spps | 522 | 532 | - | 15.62 | GCCCCGCCCTT |
Arthur2 | NR3C2 | 978 | 992 | + | 15.57 | GGAACACAACGTCCT |
Arthur2 | AP3 | 2745 | 2757 | + | 15.57 | ACCAAAGTGGAAA |
Arthur2 | KLF1 | 524 | 531 | + | 15.53 | GGGCGGGG |
Arthur2 | ZNF766 | 2702 | 2710 | + | 15.50 | AAGAAACCT |
Arthur2 | KLF4 | 69 | 76 | - | 15.48 | CCCCACCC |
Arthur2 | AGL6 | 1380 | 1398 | - | 15.35 | TTCCCAAAATAAGCAAGAA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.