Arthur2
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0001276 |
---|---|
TE superfamily | Tip100 |
TE class | DNA |
Species | Mammalia |
Length | 3700 |
Kimura value | 32.91 |
Tau index | 1.0000 |
Description | Repetitive element conserved in all mammals. |
Comment | True termini are as yet undefined. ORF from pos 504-3239 encodes a transposases closest to that of hAT-82_HMa in Hydra in the Arthur (Dent/Riggs) sub group of Tip100/Hitchhiker elements. |
Sequence |
CTTTCTTTTNCAGAGCATTAAAAAATCATAAAGCAGAAGNCTTGCCACTAAGTAGTGGGTGGTGTGNTGGGTGGGGAAGGGGCCATTCTGTCGAAACANAGTGAAGAAGAAAACCANNTAGCGGACAATGTGCTAAGAAGAAGACGGCATTGTGTTTAGAATAAAGACTGGGAAAAGCAGTTCAGTTGGGTGACACCCGATCGAGACAGTCCTACTAATGGAAAGTGCAAACACTGTTTGGGGACATTTACTGTGAAATGGGATGGGGTAAAGGCACTGAAGGTGCATGAAAAATCAGCCTTGCGTAGGCAGAAAACACAAACTTTAGCAGAAAATAAATTGCTGGAAAAGTTTTTATTGAAGAAGAGTAGTTCTGAGTGTGTGATAGCTTTAGTTAAAGCTTACGTAGTTAATCATTCCGTATTTGAGATTCGGGTTTTTTTTCCCCTTGATGTAATTAGCCATTAGGCTTACTAAAGTACATTTTTCATGTCAAGCAGACAATGGCTTCAAAGAAGAGAAAGGGCGGGGCTGCAAGGGAGCGAGAAAAGAAGGCCAGGTTGATGCNTGAAGAGGCAAAACAGTGTAGGAAAATATCAACATTTCTTGCACAATCANTCACTGAAAATGTGTCAGCAAGCACGAGGCCATCTTTAACTGGCAATAATACAACTGAAATAGATCATTTAAGTAAGGGAGGCAGTATTACTGAGNCCAACATGCCTATCGACACTTGTTTGACTGTCTCAGAAGAACAAAATACCGAGAGGCAAGAGCTTAAAGTGAATGAAAATNTATCCAATTCACAATCAAATGAAAATGCTATGGGCCCAGAGCCAGAGAATATTCTTGNATTGGACTTTTTTCAGCGCCCTAAATTTAACCAGCTGGAATTCTTTTTTAAGTACCATCCTATTCAACCTTTTGAAATGAAGGATTTGCCTTTTAATGGGAAATCTGCATTTCATCGTAAAGATGGAACACAACGTCCTTGGCTTAGTTATTCTCCAGAGAAGCAGGCTCTTTTTTGTACTGTATGCTTAGCATATAGTAAAGACACAGATTCCAGCAAATTTATTTCAGGCATGANAGACTGGAGGCATACATACGTACGGATAGAGGAGCATGAAAAATGTAACCTCCACTTCCAGNGCGCTGAATGCCATATAATGAAAACTCTCGGAGGTGACATCACTAAACTCCTGTTTCACAACCAGGAGTCCGCGAGGATAGAACAAGTCAAGAGAAAAAGAGCAGTTTTAGAACGAGTGATTGATGTAATTAAAGTTATTGGTAAAGAAGGGCTCAGTTATCGAGGGGCAAATGAGTCAGCAGCAAGTTTAAGCAATGAAACAGCGCGCCATGGGGTGTTTCTCGAATTCTTGCTTATTTTGGGAAAGTATGATGCCCTACTCAAAGAGCACCTTGAATCTGTAATAGCTAGAGCAAATCGAAAAAGTAGTGGAGGAAGAAGTGCTCATATAACTCTCATTTCTAAAACAACAGTGAACTATGTTATTGACAGTATCGGCAGCTTAATTAAGAAGTCTGTTAGTGATGATGTTAAGAAAGCTGGCACATTTTCAATTCAGATTGACACTACTCAAGACATAGGCATAACTGATGTTTGTTCAGTAATACTAAGATATGTCACAGAATCTGTGAGAGAACGTCTGATATCTATAGTAAGCTTAAAGTCAAGTACCGGGGAAAATATGTTCCAGACTGTTGCAGATGTTCTCAGATCAAATNATATTTCTNTGAAAAATTGTGTAGGATGTTCAACTGATGGAGCATCNAACATGAAAGGACAGTTCAATGGGTTTGCATCNTGGTTAAAAAAAGAATCATCTAGCCAAGTTTATGTGTGGTGTTATTCACATGTTTTGAATCTGGTAATAGTAGATGTAACGGGAGTGTCAGAAGAGGCTACTTCACTATTTGGACTTTTAAATTCCTGTGCTGCATTTTTAAGAGAGTCACATAAAAGAATGGATATTTGGCGTGAACGCTGCAGGGACCTGTCGCACTCTCTCACTGGAGAAACAAGGTGGTGGTCTAAAGAGGTAGCTTTGAGAAAAGCGNTTGGGGTTTTTAAAGAGCCATCTAATTCATTATTTGTGACAATAATNGAAACTTCGAGTAGAATATGTTCCCACACAGAGGATTTTGGTTTAGATACCAGATACAAGGCTAGGACACTAAAACAGTCACTGTGCAGGTTTGAAACCATACTTACTGCTCAGTTGTTTCTGAAAANTTTTAGATCAACTACACCACTTTCAAAGTATCTGCAGACAAGTGGCATGGATATATTACAGGCCCAGAGAATGGTTTCAGAAACAATCAAAACATTACATGCAGAGTCAAGAAATTTTAGAGATGTTCTTGAGGGAGCACAAGCCTTCGTTTCATGGGCAAATNAAGAACTCAGTGCCAAGGAAATTGATGTTATGCTAGAAAGTTCCCTNCCTGAAGTACGTACGCGACTTAAGGGGAAAATGGATGGAGAAAATATAAGTGACCAACCTATGACATCAAATGAAAAATTCAGAATAAATGTCCATAATGTTGTAATGGACACAGTTGTTCAACACCTCAAAGATCGATTTAAAACGCATTCNCATCTTGCTGCAGATATCGCCTATCTAGATCCGAGAAACTTTAGTCACATAAACTCTAATATGCCTGATTTAGCTTTAAGAAACCTTTGTGACCTAATAAACAAACAAGGTCTGGTGGGAACCAAAGTGGAAATTCAGGATCTCAAAGACGAGCTGAGAGATTTTGCTCAGAAATGGGACAGATTGAAACAATCTCTACCAAAGGAGTATGAGACAGAAAAGGAGATTGANTCAGAAGAGGAAGAAGGCCAGTGTCATACTGAAGAGCCACAAATGCCACAAATTGAAAACAAAATGCGCAATACTTGTTATAATTGTTCAGTGTGCTGCTATAACGTTTTAAAACATTACAAACTTTATAGCAGTGCTTATGACAATATTTACCTGGCTTATAAACTGATTTTGACATTGTCTTGTACTCAGGTTGCATGTGAAAGAAGTTTTTCTACCTTAAAGAATATCAGAACCAGGCTCGGGAATAGACTCACTGAAGAACACTTAGAGAGTTTCATGCTTATGAGCATAAATTGTGATATTTTACTACGCCTAAACTATGATGAAATCATTGAAGGCGTCATAGCAAGAAGCAAAACCCTAGGGAGGCTATTAACAATTTAGCATTTCCAGTTCATGTGGGAATGTTAATTCCACTTACCGAAGAGAAGTACAGAAATGGTGTTCATTATGATTAAGACCTTGTAACTGTAGAAATGTAGAAGTGTGAAATTTATAATATTATAGACATGTTTAGTATAAAATGCAAATATCGCAGCAGAGTGGGTTGGGCATCCGGGTCTCAGCTGGGTTTCCCTTGAACCTATTACCTTACAAAGTTAAACACAGGTATAGGTCTTTCACAAAGGCAGTATGTTATTCTAAAGTAGCATCTTAGATCATCTTACTAGGGTTACGATGGCCACTTCAACTGGCTAACAGCTGTGCTAGCCTCACAAATAACCAAAAAGTATATTNATTTTTGGCAGCGGACTAGGGAAGTTGTGTGTGTGTTGGTTTAACAGTAAAATAGCATTTTCAAATTTTAATAAATTATTGGCATGTTATCTGTCATATACATG
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
Arthur2 | Sox21b | 3461 | 3475 | - | -9.00 | ACCTATACCTGTGTT |
Arthur2 | THI2 | 1814 | 1828 | - | -9.44 | GCAAACCCATTGAAC |
Arthur2 | EWSR1-FLI1 | 534 | 551 | + | -9.57 | GCAAGGGAGCGAGAAAAG |
Arthur2 | EWSR1-FLI1 | 687 | 704 | + | -11.51 | TTAAGTAAGGGAGGCAGT |
Arthur2 | EWSR1-FLI1 | 1455 | 1472 | + | -11.70 | AAAAGTAGTGGAGGAAGA |
Arthur2 | EWSR1-FLI1 | 538 | 555 | + | -12.57 | GGGAGCGAGAAAAGAAGG |
Arthur2 | EWSR1-FLI1 | 691 | 708 | + | -15.54 | GTAAGGGAGGCAGTATTA |
Arthur2 | ZSCAN16 | 1301 | 1318 | - | -21.19 | CCTCGATAACTGAGCCCT |
Arthur2 | BPC5 | 515 | 544 | + | -44.00 | GAAGAGAAAGGGCGGGGCTGCAAGGGAGCG |
Arthur2 | BPC5 | 3101 | 3130 | + | -44.34 | GAATAGACTCACTGAAGAACACTTAGAGAG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.