Eutr5

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0001258
TE superfamily Long_Terminal_Repeat_Element
TE class LTR
Species Eutheria
Length 1130
Kimura value 34.16
Tau index 0.9620
Description Ancient interspersed repetitive sequence from eutherian mammals.
Comment >71% identical to consensus. Origin unknown. More than 200 copies in eutherian mammals. Not present in opossum. ; but it shows no similarity to known SINEs and LINEs. Provisionally guessed LTR because of TG...CA termini but no obvious TSDs nor an obvious (AWTAAA) poly A signal in either orientation. GA-rich 5' end and C-rich satellite-like 3' tail support classification as a SINE. Present at orthologous sites in elephants and humans, absent in opossum.
Sequence
TGTGGCGGGATGAACTGAGATCCTCTTATCCATTCCCCTCCCTCCCGCGGTCAGGGGAGAAACCTGGGGATTAGAGAGAAGCGATTTGGGCATGGCGCCATTCATTGTCAGTTAATGTGAGGAAGCTTTGTAATGTAACCCAGGGCAGGGAAGGGGCAAATGCTCTCTTAGCATAATAAGGCCAGGGTTATTATTTCGCAAGAAATAGGGTCAAAGTTATGTTAATAGGAGGAATATGAATTTCTCATGTAGACCAGACTGGGTGGTCTTCGGAAGGAAATTAATCCTCGCGGTGGCGAGGTGTGTTTGATTTTAGATACTTTAGGAGAAAACCAGCCGCAGCCCGGAGTGGAAGCCCCACTCTGTTCCCACCCCCCTCCCCTNCCCCCAGGACCCAGAGCTCACCCGCCTCCTTTCCTGGGGAGCCAGCGGACCCTGAGACCAGAGCCCACCCCTCCTCTGCAGAGCCAGCAGAANCTGGGCAGAGGAGAAAAGCCACTGACATTCAGCGGACGCCTGAGGCTGCGGAATTTGCAAGACACAAGAAGGGACTAGGGAGACTGAGACTTGATTATTTCTGTAGGGAAGGGTGGCGGATCCCGGGAGAAAGAGGGATTGGATGGGGAAGCAGAAGGAGAAGAGGTTAAAGGGAGAGGGCNGNAGAAATGAGGGAGAAGATAGGAAGCGAGGGAGCGTCCACGTGGGACGCCCTCCGAAGGCCTAGGCAGTGAGGCCTCGTCCCTTGTGTGGTACAAGGGAGCGAGGTAGCTAGACCAGAGNCGCAATCTCCCATCTCCTTCCTCCATCCCGCTTTGGGACAATTACTTGAACCCTTCAGTAAAGAACTGCTTAAACCTCCATCGGAGTCCAGCGTGTATTCGGTTCGGGGAAGACGGGCGAGGCGGTATTCGGANCGGGGAGGGATAACCCCGCTCCNCGGCCATCTGATGGGCATCGGCAGCNCCCTCNCTGTGGACTACTGGGCGGCGAGGAGCATCNACCCGCAGACGACGACCCAGCTGGAGAAACGGTAAGGCCGATCCCCCCGACTTCCCTCCCCCCAGAGAGGCGACTAGAGACATNATCTGTGGGACAGAAAAGGGAGGAGCCCGCCACATGGGTCACATACA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
Eutr5 MZF1 65 72 - 14.60 AATCCCCA
Eutr5 mxl-3 697 704 + 14.59 CCACGTGG
Eutr5 mxl-3 697 704 - 14.59 CCACGTGG
Eutr5 ZNF417 93 99 - 14.55 GGCGCCA
Eutr5 ZNF417 94 100 + 14.55 GGCGCCA
Eutr5 NFIX 88 101 - 14.53 ATGGCGCCATGCCC
Eutr5 TFAP2E 515 523 - 14.51 GCCTCAGGC
Eutr5 AT1G74840 26 34 - 14.47 AATGGATAA
Eutr5 TFAP2E 515 523 + 14.47 GCCTGAGGC
Eutr5 GCR1 350 357 + 14.47 TGGAAGCC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).