HERV4_I

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000172
TE superfamily ERV1
TE class LTR
Species Haplorrhini
Length 6539
Kimura value 11.78
Tau index 0.9815
Description Internal region of an ERV1 endogenous retrovirus, HERV4 subfamily
Comment Associated long terminal repeat includes MER51A.
Sequence
TATTTTGGCGAGCCAGCCAGGAGGTAAGCCCAAAGTTTGGGATTTATTTTTCTCTTTTTCCTCTTTCTCTCTCTCTTTTCCTTTCCAACTCGGGACCCTCGGTGGACAGCGCCTAAGCACGGAGGCAACTGCAGGTTTCTGGCCGGGGCCACTCTCCGGTGAAACTGAAAGGTTTCCGTGTGGAAGCGCCTGACCGCCACCGCCCGGTTCGGGTGAGGGACCTGAGTCCTTTTCTTTTTCAGTCTTTCAGCGGCCGTTTCCTAGTAGCTCCTTGGTAATTGAGGGCAACTGGCCGGGGCCACTCTCCGGTGTTACCTGAAGGCCAAGGAGTGAACGGGGATAGCTGCCCTGCCCGGAAGGGGGAAGGACTCTTTTCTATCTTTTCCGGTTATAGTCCCTGATCCCTACGTGTGACGCAATTGGCAGCGGCAGCTCGTCCAGGGCGAACTCACACACGTTTCAGGCGACTTAAACCTTCTTTTCTTATGCTAAATTCTTCCCTTCCCCTACTCGACTGGCTAAGGACAAGTCAGAGGGTCCGGGCATGTCGTAGATGGTCTGTGTGAGTCATGGGGAGGGGATTCATGAAAGGGAATTTATGTACAATTTAATCTTGCCTAAATTTAGAGAGTTAAAGGATTGTTTTAAGTGGGATAGGAAAAAAAATCCAAAGGTTTGACTGAAAGTTAATTCTAGAAGTCGAGGCCTTCATCCAGGGACAAGAGGGAAAGCTCATAGTAGGTCATCAGTGGTGGAGGGAACCATTCCAAAGCGGTGCCGGCACCCATCTAAGGTCAGAGACGTCTGACAGACTAAGACGGGGCCCTAAAGGGGGGACGCCCCCGGGGACCCCAGTCNGGGCCCAGAATTTTTCCAGGGGGATGCCCCGGGTAAAATTTGGGTCACCTAATGAGCCCTCCACTTTTCAAAGTCCTCTTCTCTTTTCCAGACCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCACTATGGGCAACTCTCCATCTATTCCACCTGATTCCCCGCTTGGCTGCATCCTCAACCATTGGAATCAATTTGACCCTGACAATCTAAGGAGAAAACGTNTGATTTTTTTCTGCAATACTGTTTGGCCCCANTATNAGCTGNNCAGCCAGGAACAATGGGCGGTCAATGGTAGCCTTAATTATGACACCATCCTGCAATTAGACCTATTTTGCAAGAGGCAGGGCAAATGGTCAGAAATCCCATATGTACAGGCCTTCATGGCCCTATACCAAAACCCAACAATCTGCAAAACTCCCAGAACCCGCCCCCCAAAGGAAAGTCCTAAGGCAGAACTAGATATTGTAGATGACCCCCTTTTACAAGGGCCACCTGTCTCTCAGGGNGAACAGCAACCGCCCCCATATAGCCCCTTGCCAAGTGCTCCTGAGGCTAAAACCCAGGAGCAAACACCGGGGACCCTACTAAGTCCCCCTCACACTCGGAGGGGAACACCNTATTCAACTCTCCCTCCAGCCCTGCTACCCCTTAGGGAAGTAGCAGGAGCCGAGGGGCCAGTCCGAGTGCAGGCCCCCTTCTCTATAACTGATATACAACAATGTAAGGAAAAGCTAGGAAGCTATTCTGAGAACCCCGGGAAATTTGCAGATGGGTTCCAAACTTTGACCTTAGCCTTTGATCTCTCATGGAGAGATGTTCAATTCATTCTAGCAACCTGTTGCACCCCCTCGGAAAAGGAACGAATCTTTGAGGCCGCCCGCCGGGAAGCGGACGANTTATTCGCCCGAAACCCTCAGGGCAATCACCCGGGCCCAGACACAGTCCCCACTACTGATCCTAATTGGGACTATAACACCCCCGTGGGAATGAACAACCGGGCTAAATTTCTTGAGGCTCTCCTTGGAGGAATGAGAAAGGGAATAACTAAGGCAGTAAATTATGATAAAGTAAGGGAGGTTACACAAGGCAAGGAGGAAAATCCAGCCATGTTTTATGGCAGGCTGGAGGAAGCCTTTAAAAAATATACNAATCTGGACCCTTCCTCTCCCGAAGGCAAAATATTAATGGCACAGCATTTCATTAGCCAATCCGCCCCGGACATTAGACGTAAGCTCCAAAAGCTACAGATGGGGCCACAAACTAATCAAAATCAGCTTCTTGATACCGCCTTTATGGTGTATAACAATCGTGACCTGGAGGAAGGAAAAAGGGAACAGAGTAAAGAAAAACGGCAAGCCAAAATTATGGCAGCCATCATTGGCGATGCCCTGAATGCCCAAAGAGCGTCCAAGGGAAACCCGAAGGGCCATAAGGATAATGCCAGCAAAGGCTCTTGCTTCAAGTGCAAGAAAAATGGGCATTGGGCAAAGGACTGTACTAAGCCCCCGCCAGGCCCCTGCCGTCAATGCGAAGGCACCAGTCACGACCCCTGGCACTGGAGAATTGACTGCCCCCGCTCCCACCGAGGGGCTCAGTCAGTCAAAACTCTAGCAGTGCAAAAGGAGGAATTAGATGAAGACTGAAGGGGCCCGGGGCCTTCCTCACCGCCCCTGTCCAGGAACATCGTNATTACTACTGAGGAGCCCCGGGTAACTCTGGACGTCATGGGCACCCAAATTCAGTTTCTTTTTGATACAGGGGCAAATTACTCTGTCCTTACTGCTTATGCAGGAAAACTTTCCTCCCGGTCCACGAGTGTTATGGGAATGGAAGGAAAGCCACAAACAAGATTCTTTACTCCTCCTTTGACTTGTCAATTTGAGAAACAAATCTTCCAACAGGAATTTCTAGTAGTACCAAGCTGCCCAGTCCCCCTGTTGGGAAGAGATATTATGGTTAAAATAGGGGCACTGCTACAATTTAAGCACCGCCCGGCGAAATTGCTAATAGTCAGNAATGCAGACAATGTCCCAGACCACGTTAATAAACAGGTCAACCCGCTGGCATGGTATACTGGGAAACCGGGGAAGGCTAAAACGGCAGTGCCAGTCAAAATACAGCTTAAAGACCCCAGCTATTTTCCCAATCGAAAACAATACCCAATTAAGCTGGAAGCAAGAAAAGGCCTAGCACCCATAGTTGAGGTATTACTTACCCATGGACTCTTAAAACCCTGCAATTCTCCCTGCAATACCCCCATCTTACCCGTTCTAAAGCCTTCGGGGGAATACCGGNTAGTACAGGACCTCAGAATAATTAATGAGGCTGTTATCCCCGTCCACCCATTGGTGGCGGATCCATATACCCTCCTGGCTCAGGTGCCAGGGGATGCAAAATGGTTCTCAGTCCTAGACCTAAAAGATGCTTTCTTCTCCATTCCTCTGGCCCCAGAGTCCCAATACCTTTTTGCCTTTGAATGGGAAAATCCTAATACCAGAGAAAAACAACAATACACTTGGACAGTGCTCCCTCAGGGCTTTCGGGATAGCCCCCATTTCTTTGCCCGAGCCTTAGAGAGGGATCTGAGGGATCTGCAATTGGAGAATGGGAGTATACTCCAGTATGTGGATGACCTTCTTGTGTGTAGCCCAACCCAGGAGGCTTCTGACCAAAATACTATAAAAACTTTGAATTTCCTGGCAGACAGGGGATACAAAGTGTCCAAAAAGAAGGCTCAGATTACCCTCCAACGGGTCCAATATTTAGGGTATGTCTTAACACCCGGAGCCCGGCAAATATCCCCAGAACGAGTGCAAGCCATATGTGGTTTGGGGCCCCCCCACACCAAGCAGCAGCTTCGTTCTTTTTNGGGAATGGCCGGGTTTTGCAGAATATGGGTACCAAATTTTGGGCTCATAGCAAAGCCCCTNTATGAAGCAACAAGGGGGCCTGAAAATGAGCTAATGGAATGGACCCCGGAAATGAGGGAAGCCTTCGCCAAGTTAAAACAGGCTCTCACCCAGGCTCCCGCTCTTGGCATCCCAGACCTNACTAAGCCCTTCTCCTTGTATGTAGCAGAGAAGAAGGGCATAGCTGTGGGAGTGCTAGCCCAGAAATTAGGATCAGAACCCAGACCAACCGCCTACTTTTCAAAGAAGTTGGACGGAGTGGCCTCGGGGTGGCCAAGCTGCCTGCGGGCAATAGCAGCCACTGCTATTTTAGTGGAGGAAGCCACTAAAATCACCCTGGGTCAACCACTGGAAGTTCTAACCCCNCATCAGGTAAAGTCAGTCTTAGAGATAAAAGGACACATCTGGATGACGGGGGAAAGGTTAACCAAATACCAGGCCATGCTCCTAGACAATCCAGATGTAACCCTTAAAACCTGTAACACCTTGAATCCAGCTTCATTGCTGCCCACAGGCCCAATAACTGATCATTCCTGCGAGCAGGTCATCGCACACACATATGTTAGCCGGCCTGATTTAAAAGATCAGCCTCTCCCAGATTCTGAGGATGACTGGTTCACAGACGGCAGTAGTTTTGTGTCAAATGGGGAGCGCCGAGCTGGATATGCAATAGTAAATCACAACACCATTATTGAAGCCCAGCCACTGCCCCCTGGCACATCAGCACAAAAGGCTGAAATCATTGCTCTTACCCGAGCATTAATGTTGGGACAAGGGAAAAAGCTTAACATCTATACAGATTCTAAATATGCATTCCTTGTGGTTCATGCTCATGCTGCAATCTGGAAAGAAAGGGGACTACTAACTAGCAAACACTCCCCCATAAAGCATGGGCCTGAAATTCTTCAGCTATTGGAAGCAATACACCTGCCAAAGGCCGTAGCTATAATCCATTGTAGGGGGCATCAAAGGGACTTAACCCCTATAGCACAAGGGAACAGAAAGGCTGATAGAGAAGCCAAAGCCGCAGCCCTCAGGGTGCAATCCCAACAGATCCTAGCACTGCTTCCTTTCTATGATTCCCCAATAGAACCTGAATACACACCACAGGAAGAACAGTTAATAAAGGAGCAAGGGGGACAAAAACAAGGATCCTGGTGGTATATGGGATCAAAAGTATATCTCCCTCAAACAGCCCAATGGAGAGTTATAAAAACCCTGCATGACTCTTTCCATATGGGGAGAGATGCCACCCTGGCCATGGTAAACAGGCTCTTCATTGGGCCTAACTTAGCTTCGGTGGTTAAGCAGGTCTGTCAAGCCTGCTCACTGTGTGCACTTAACAACCCAGGAAACAAAATGCCTCCTCTAATAGAACCAGTCCAGAGGAGAGGAACTTACCCAGGGGAAGACTGGCAATTAGACTTCACCCATATGCCAGCTTGCAGAGGATACAAGTTTTTGCTAGTACTAATAGACACCTTTACTGGCTGGGTCGAAGCTTACCCTACCAGAACAGAGAAGGCTAATGAGGTTATAAAGGTTCTCTTAAAGGAAATAATCCCCCGGTTTGGGTTACCCCAGAGCCTCCAAAGTGATAACGGCCCGTCCTTTATCTCCCAAATAACTCAAGGGGTTGCTAAGGCTCTCGGAATCAAATACTATTTACATTCAGCATGGAGGCCTCAATCCTCCGGGAAAGTAGAAAGGGCTAATCAAACTCTAAAACGGGCGTTAGCTAAGCTATGTCAGGAAACATCAGAAACTTGGGTCAGCTTACTGCCCATAGCCCTCTTAAGGATCCGTAATNCCCCTAGAGCAAAAATTAATATGAGCCCATATGAAATGTTATACGGAAGGCCATTTTTAACTAATGATCTAATTACTGATCCAGAAACAGCCGGTTTAGTAAAATACCTAGTTAACCTGGGACAATTTCAGCAGGCTTTACAAAAGTTTGGAACTCAAAGGCTCCCCACACCGGGAACTAACCAGCAACCCAAAATCAGGCCAGGAGATAAGGTACTTGTTAAAACATGGAAGGAGGGATCACCTGCTCAACAATTACAACCCAAATGGAAGGGACCGTTTTCAGTGGTACTGGCCACGCCTTCTGCGGTCAAAGTACTAGGATTAGATAGTTGGATACATCTTTCCAGGGTCAAGCCTGCGATACCTGAAGCCCCGGACCTGGAACCTGAAGCTCCCATCAGCCACTACACCTGTGAACCTGTGGAAGACCTGAAGTACCTGTTTAAAAGACAGCCAAAAGATAAGTAAATGCCTACCAACTTTCCTTGGTGTCTTTGTTGCATAGTTACTGTAGGCTGGATAATAGTAGCCATTTTTANNNNTATTTTTGCAGTTTAATTGCCTTCTTCCAAACGGATGGAATCACTTCCTTTGTAATAATTAAGCAGAATGTTTTAATTCATCTCTATAACAAACATTCCTGACAGCATAGGTATCCACCCCCTGAAGTTCCCATTAAATCTTTTAACCAAATTCATTTCCTCTCGCCTAGAGACCATCAAGCTTCAGATGATCATGCGACAAGGNTTCCAGCCAGTTCCAGGTGAAGACACCACCCCTGGCCATCAAGAAGCTACCCTGTCTCCACTAGACAGAGCAGGGCGAGAGTTCCGTGATCCCCAATAGGTAGGGACTACGCCCCAAGTCAGCATGAAGCAGTTACAGAAGAAAGACCATCGGTCCCTCTGCCTCCCATAAAGATTTATGGGGATCACGTCTCTCAGGGGGGAGA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERV4_I DREB2D 191 204 + 16.73 TGACCGCCACCGCC
HERV4_I BPC1 52 75 - 16.70 GAGAGAGAGAAAGAGGAAAAAGAG
HERV4_I PLAG1 1386 1399 - 16.67 GGGGCTATATGGGG
HERV4_I TB1 3699 3707 + 16.64 GGCCCCCCC
HERV4_I Hnf1A 1659 1668 + 16.49 CCTTTGATCT
HERV4_I DOF5.8 41 59 + 16.45 GATTTATTTTTCTCTTTTT
HERV4_I DOF3.4 43 59 + 16.35 TTTATTTTTCTCTTTTT
HERV4_I ASR1 4288 4296 + 16.33 AGGCCCAAT
HERV4_I ASR1 5054 5062 - 16.33 AGGCCCAAT
HERV4_I eor-1 64 76 - 16.31 AGAGAGAGAGAAA


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).