Eutr5

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0001258
TE superfamily Long_Terminal_Repeat_Element
TE class LTR
Species Eutheria
Length 1130
Kimura value 34.16
Tau index 0.9620
Description Ancient interspersed repetitive sequence from eutherian mammals.
Comment >71% identical to consensus. Origin unknown. More than 200 copies in eutherian mammals. Not present in opossum. ; but it shows no similarity to known SINEs and LINEs. Provisionally guessed LTR because of TG...CA termini but no obvious TSDs nor an obvious (AWTAAA) poly A signal in either orientation. GA-rich 5' end and C-rich satellite-like 3' tail support classification as a SINE. Present at orthologous sites in elephants and humans, absent in opossum.
Sequence
TGTGGCGGGATGAACTGAGATCCTCTTATCCATTCCCCTCCCTCCCGCGGTCAGGGGAGAAACCTGGGGATTAGAGAGAAGCGATTTGGGCATGGCGCCATTCATTGTCAGTTAATGTGAGGAAGCTTTGTAATGTAACCCAGGGCAGGGAAGGGGCAAATGCTCTCTTAGCATAATAAGGCCAGGGTTATTATTTCGCAAGAAATAGGGTCAAAGTTATGTTAATAGGAGGAATATGAATTTCTCATGTAGACCAGACTGGGTGGTCTTCGGAAGGAAATTAATCCTCGCGGTGGCGAGGTGTGTTTGATTTTAGATACTTTAGGAGAAAACCAGCCGCAGCCCGGAGTGGAAGCCCCACTCTGTTCCCACCCCCCTCCCCTNCCCCCAGGACCCAGAGCTCACCCGCCTCCTTTCCTGGGGAGCCAGCGGACCCTGAGACCAGAGCCCACCCCTCCTCTGCAGAGCCAGCAGAANCTGGGCAGAGGAGAAAAGCCACTGACATTCAGCGGACGCCTGAGGCTGCGGAATTTGCAAGACACAAGAAGGGACTAGGGAGACTGAGACTTGATTATTTCTGTAGGGAAGGGTGGCGGATCCCGGGAGAAAGAGGGATTGGATGGGGAAGCAGAAGGAGAAGAGGTTAAAGGGAGAGGGCNGNAGAAATGAGGGAGAAGATAGGAAGCGAGGGAGCGTCCACGTGGGACGCCCTCCGAAGGCCTAGGCAGTGAGGCCTCGTCCCTTGTGTGGTACAAGGGAGCGAGGTAGCTAGACCAGAGNCGCAATCTCCCATCTCCTTCCTCCATCCCGCTTTGGGACAATTACTTGAACCCTTCAGTAAAGAACTGCTTAAACCTCCATCGGAGTCCAGCGTGTATTCGGTTCGGGGAAGACGGGCGAGGCGGTATTCGGANCGGGGAGGGATAACCCCGCTCCNCGGCCATCTGATGGGCATCGGCAGCNCCCTCNCTGTGGACTACTGGGCGGCGAGGAGCATCNACCCGCAGACGACGACCCAGCTGGAGAAACGGTAAGGCCGATCCCCCCGACTTCCCTCCCCCCAGAGAGGCGACTAGAGACATNATCTGTGGGACAGAAAAGGGAGGAGCCCGCCACATGGGTCACATACA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
Eutr5 ZNF148 374 383 + 16.10 CCCCTCCCCT
Eutr5 MAZ 35 42 + 16.08 CCCCTCCC
Eutr5 MAZ 374 381 + 16.08 CCCCTCCC
Eutr5 ZNF148 35 44 + 16.08 CCCCTCCCTC
Eutr5 TFAP2B 515 523 - 16.07 GCCTCAGGC
Eutr5 SP1 374 382 - 16.06 GGGGAGGGG
Eutr5 ZNF331 461 470 + 15.87 TGCAGAGCCA
Eutr5 SP4 374 382 - 15.85 GGGGAGGGG
Eutr5 Zfx 717 726 - 15.73 GCCTAGGCCT
Eutr5 TfAP-2 515 523 + 15.56 GCCTGAGGC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).