HERV9

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000173
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 8436
Kimura value 5.24
Tau index 0.9324
Description Internal region of an ERV1 endogenous retrovirus, HERV9 subfamily
Comment gag ~1287-2747, pol ~2748-6323, env ~6740-8380.
Sequence
TTTTGGCGACCACGAAGGGACCATCGCCTATCGCCAAGCGGTGAGACTATCGCCTATCGCCAAGCGGTGAGTACCATCGGACCCCTTTCGCTTGCTATTCTGTCCTATTTTTCCTTAGAATTCGGGGGCTAAATACCGGGCACCTGTCGGCCAGTTAAAAGCGACTAGCGCGGCCGCCGGACTAAAGACACGGGTGTCAGGCTTTCTGGGAAAGGGCTCTCTAACAACCCCCGACTCTTCGGAGTTGGGAGCGTTGGTTTGCCTGGAACCAGCTTCCGCTTTTCCTGTACTTCTGGGCTGAGCCGAGGGTCGACAGAGAGGAAAGCCATTCAGCTCCAGGGGTCCCGACAACAAGTTGGTTGACCCTGCGGCCATGAGCGGAACTCTCAAAGTCATGTCGCCCAAGCGAGACTCGCCCATCTATCCTATCTATCCTGACCCTTGCCTCCTGGGTCCTAATGCCTGTCAGACAAACTTCCTCTCGCCTCTCTTCTCCGAGGCTAGTCCCGCTTCTAAAAACCACTCCCTGTCTCTGGTGCTTTTCTAGTTTCTCCTATAAGAATGATTTCTAGTATAAACTCCAGGACTCTGTTACCTTCTTTAGGCACCCGGGCTCACCAATCAGAAAGACATAATTTTTGCCCAAAGCCCCATCGTAGGGGGGACTATCTGGAATTTTAGGATCCCTCCTCAGACNAGCAGGCCTAACAAAAGCTATTCCTGAAGCTAGGATATGGGGAGCCTCAGAAATTGTATCCTTCCTATTCATATAAGTGAGGACAAAAGGCGTCACTCTTCCAACTCTGGAGATCCCTTCCCTCCCTCAGGGTATGGCCCTCCACTTCATTTTTGGGGCATAACATCTTTATAGGACACGGGTAAGGTCCCAATACTAACAGGAGAATGCTTAGGACTCTAACAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGATTTTTCTCGGTCCTCTTTGTGGTCTAGGAGGACAGGCAAGGGTGCAGGTTTTCGAGAATGCGTCGGTAAGGGCCACTAAATCCGACCTTCCTCGGTCCTCCTTGTGGTCTAGGAGGAAAACTAGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAAGAGGTGTTTCTGCTGCTGCGTCGGTGAGCGCAACTATTCCGATCAGCAGGGTCCAGGGACCGTTGCGGGTTCTTGGGCAGGGGGAGAAACAAACAAACCAAAACCGCGGGCGGTTTTGTCTTTCAGATGGGAAACACTCAGGCATCAACAGGCTCACCCTTGAAATGCATCCTAAGCCATTGGGACCAATTTGACCCGCAAACCCTGAAAAAGAGGCGGCTCATTTTTTTCTGCACTATGGCCTGGCCCCAATATTCTCTCTCTGATGGGGAAAAATGGCCACCTGAGGGAAGTATAAATTACAATACTATCCTGCAGCTTGACCTTTTCTGTAAGAGGGAAGGCAAATGGAGTGAAATACCTTATGTCCAAGCTTTCTTTTCATTGAAGGAGAATNCACAACTATGCAAAGCTTGCAATTTACATCCCACAGGAGGACCTCTCAGCTTACCCCCATATCCTAGCCTCCCTATAGCTCCCCTTCCTATTAATGATAAGCCTCCTCTAATCTCCCCCGCCCAGAAGGAAACAAGCAAAGAAATCTCCAAAGGACCACAAAAACCCCCGGGCTATCGGTTATGTCCCCTTCAAGCTGTAGGGGGAGGGGAATTTGGCCCAACCCGGGTACATGTCCCCTTCTCCCTCTCTGATTTAAAGCAGATCAAGGCAGACCTGGGGAAGTTTTCAGATGATCCTGATAGGTACATAGATGTCCTACAGGGTCTAGGGCAAACCTTCGACCTCACTTGGAGAGATGTCATGCTATTGTTAGATCAAACCCTGGCCTTTAATGAAAAGAATGCGGCTTTAGCTGCAGCCCGAGAGTTTGGAGATACCTGGTATCTTAGTCAAGTAAATGATAGAATGACAGCCGAAGAAAGGGACAAATTCCCTACCGGTCAGCAAGCCGTCCCCAGTATGGATCCCCACTGGGACCTCGACTCAGATCATGGGGACTGGAGTCGCAAACATCTGTTGACCTGTGTTCTAGAAGGACTAAGGAGAATTAGGAAAAAGCCCATGAATTATTCAATGATGTCCACCATAACTCAGGGAAAGGAAGAAAATCCTTCTGCCTTCCTCGAGCGGCTACGGGAGGCCTTAAGAAAATATACTCCCCTGTCACCCGACTCACTCGAGGGTCAATTGATCCTAAAAGATAAGTTTATTACCCAATCAGCCGCAGATATCAGGAGAAAGCTCCAAAAGCGAGCCCTGGGCCCTGAACAAAATCTGGAGGCATTATTAAACCTGGCAACCTCGGTGTTCTATAATAGGGACCAAGAGGAACAGGCCCAAAAGGAAAAGCGAGATCAGAGAAAGGCCGCAGCCTTAGTCATGGCCCTCAGACAAACAAACCTTGGTGGTTCAGAGAGGACAGAAAATGGAGCAGGCCAATCACCCGGTAGGGCTTGTTATCAGTGTGGTTTGCAAGGACACTTTAAAAAAGATTGTCCAACGAGAAACAAGCCGCCCCCTCGCCCATGTCCACTATGCCGAGGCAATCACTGGAAGGCGCACTGCCCCAGAGGACAAAGGTTCTCTGGGCCAGAAGCCCCCAACCAGATGATCCAACAACAGGACTGAGGGTGCCCGGGGCAAGCGCCAGCTCATGTCATCACCCTCACTGAGCCCCGGGTACGTTTAACCATTGAGGGCCAGGAAATTGACTTCCTCCTGGACACTGGCGCGGCCTTCTCAGTGTTAATCTCCTGTCCCGGACGGCTGTCCTCAAGGTCCGTTACCATCCGAGGAATCCTGGGACAGCCTGTAACCAGGTATTTCTCCCACCTCCTCAGTTGTAATTGGGAGACTTTGCTCTTTTCACATGCCTTTCTTGTTATGCCTGAAAGTCCCACACCCTTATTAGGGAGGGACATATTAGCCAAAGCTGGAGCTATTATCTACATGAATATGGGGAACAAGTTACCCATTTGTTGTCCCCTACTTGAGGAGGGAATCAACCCTGAAGTCTGGGCATTGGAAGGACAATTCGGAAGGGCAAAAAATGCCCGCCCAGTCCAAATCAGGCTAAAAGACCCCACCACTTTTCCTTATCAAAGGCAATATCCCTTAAGGCCTGAAGCTCATAAAGGATTACAGGATATTGTTAGACATTTAAAAGCTCAAGGCTTAGTAAGGAAATGCAGCAGTCCCTGCAACACCCCAATTCTAGGAGTACAAAAACCGAACGGTCAGTGGAGACTAGTGCAAGATCTTAGACTCATCAATGAGGCAGTAATTCCTCTATATCCAGTTGTACCCAACCCCTATACCCTGCTCTCTCAAATACCAGAGGAAGCAGAATGGTTCACTGTTCTGGACCTCAAGGATGCCTTCTTCTGTATTCCCCTGCACTCTGACTCCCAGTTTCTCTTTGCCTTTGAGGATCCCACAGACCACACGTCCCAACTTACGTGGACGGTCTTGCCCCAAGGGTTTAGGGATAGCCCTCATCTGTTTGGTCAGGCACTGGCCCAAGATCTAGGCCACTTCTCAAGTCCAGGCACTCTGGTCCTTCAGTATGTGGATGATTTACTTTTGGCTACCAGTTCGGAAGCCTCATGCCAGCAGGCTACTCTAGATCTCTTGAACTTTCTAGCTAATCAAGGGTACAAGGCGTCTAGGTCGAAGGCCCAGCTCTGCCTACAGCAGGTCAAATATCTAGGCCTAATCTTAGCCAGAGGGACCAGGGCCCTCAGCAAGGAACGAATACAGCCTATACTGGCTTATCCTCGCCCTAAGACATTAAAACAGTTGCGGGGGTTCCTTGGAATCACCGGCTTTTGCCGACTATGGATCCCCGGATACAGCGAGATGGCCAGGCCNCTCTATACTCTAATCAAGGAGACCCAGAGGGCAAATACTCATCTAGTAGAATGGGAACCAGAGGCAGAAACAGCCTTCAAAACCTTAAAGCAGGCCCTAGTACAAGCTCCAGCCTTAAGCCTTCCCACAGGACAAAACTTCTCTTTATACGTCACAGAGAGAGCGGGGATAGCTCTTGGAGTCCTTACTCAGACTCGTGGGACAACCCCACAACCAGTGGCATACCTAAGTAAGGAAATTGATGTAGTAGCAAAAGGCTGGCCTCACTGTTTACGGGTAGTTGCGGCGGTGGCCGTCTTAGTGTCAGAGGCTATCAAAATAATACAAGGAAAGGATCTCACTGTCTGGACTACTCATGATGTAAATGGCATACTAGGTGCCAAAGGAAGTTTATGGCTATCAGACAACCGCCTGCTTAGATACCAGGCGCTACTCCTTGAGGGACCGGTGCTTCAAATACGCACGTGCGCGGCCCTCAACCCTGCCACTTTTCTCCCAGAGGATGGGGAACCAATCGAGCATGACTGCCAACAAATTATAGTCCAGACTTATGCCGCCCGAGATGATCTCTTAGAAGTCCCCTTAGCTAATCCTGACCTTAACCTATATACCGATGGAAGTTCATTTGTGGAGAATGGGATACGAAGGGCAGGTTATGCCATAGTTAGTGATGTAACNGTACTTGAAAGTAAGCCTCTTCCCCCAGGGACCAGCGCCCAGTTAGCAGAACTAGTGGCACTTACCCGAGCCTTAGAACTGGGAAAGGGAAAAAGAATAAATGTGTATACAGATAGCAAGTATGCTTATCTAATCCTACATGCCCATGCTGCAATATGGAAAGAAAGGGAGTTCCTAACCTCTGGGGGAACCCCCATTAAATACCACAAGGAAATTATGGAGTTATTGCACGCAGTGCAAAAACCCAAGGAGGTGGCAGTCTTACACTGCCAAAGCCATCAGAAAGGTGAAGGAGAAAAGGCAGAAGGAAACCGTCGGGCAGATGCTGAGGCCAAAATTGCTGCCAGGCGGAACCTCCCATTAGAAATACCTACGGAAGGACCCTTGGTATGGAACAACCCCCTCCAAGAGATTAAGCCCCAGTATTCCCCGACTGAAACAGAATGGGGACTTTCACGGGGGCATAGTTTTCTCCCCTCGGGGTGGTTAACGACAGAAGAAGGAAAGGTACTTATACCCGAAGCCAGCCAGTGGAAAATACTTAAAACCCTCCACCAAACTTTTCATATGGGTATTGAAAACACTCATCAAATGGCCAAATCCCTATTTACAGGGCCAAATCTCCTCCGGACCATCCGACAGGTAGTCAAAGCCTGTGAGGTGTGCCAAAGGAATAATCCCTTGGTCCATCGTAAGGCCCCTTTGGGGGAACAAAGAATAGGTCACTATCCCGGAGAGGACTGGCAGTTAGACTTCACCCATATGCCTAAGTCAAAGGGATTTCAATACTTGTTGGTCTGTGTTGATACCTTTACAAATTGGATAGAAGCTTTCCCCTGCAAGACAGAGAAGGCTCAGGAAGTGATTAAAGTCCTAATTCATGAAATAATTCCTAGATTTGGGCTTCCCCAAAGCTTACAGAGTGACAATGGTCCGGCTTTTAAAGCCACGATAACTCAGGGAATTTCCAGGGCGCTAGGGATACAATATCACCTTCACTGCGCCTGGAGGCCACAATCCTCAGGGAAGGTCGAGAAGGCAAATGAAACACTCAAGAGGCACTTAAGGAAACTAACACAAGAAACTCATCTCCCATGGCCTACTCTTTTGCCCATGGCCTTGTTGAGAATCCGAAATTCTCCTCACAAAATGGGGCTCAGTCCATATGAAATGCTGTATGGACGACCTTTTCTCACAAATGACCTCCTACTTGATCAGGAAACGGCCAACTTGGTCAAAGATATAACTTCTTTGGCAAAATATCAACAAAACCTTAAAAACCTACCTGAAGGATGTCACAGAGAAAAGGGAACAGAGTTGTTTCAACCAGGAGATCTAGTGTTGGTCAAATCTCTCCCCTCTACCTCCCCATCTATGGACTCTTTGTGGGAAGGACCATACTCGGTAATCCTCTCTACCCCCACTGCAGTTAAGGTGGCAGGAGTGGAATCTTGGATTCACCACACCCGAGTTAAACTTTGGACACCCCCTGAGGAACCTGCGGGACCGTCAGCTCAGGAGTCCCAAGATCAGCCAGACCAGCCTCGATACACCTGCGAACCGTTGGAGGACTTGCATCTCCTATTTCGGAAGGAAACATCCCAGACTAAAAAGGCTCCTACCACTGATCCTGAGGAAAAACCCCTTCCTCCTTAAAAAAGATAAGTGAAAACCTACATAATCTTTATCTTTAACACCTCTCCTTGCCCCTTTAATGGAATCCTTTTACTATTTCATCATATTATTAAGCAGCATACTAACCATACTCTTTGCGATAGGACTATATACTGTAGCTCCTGCCGGGACGAAAATCCTAATCACATCAACCTTCTTTCTATCTTCCTTCCTTCTGACAGCAATTTACTCCTACCTTTAACTCAGACTGGATAAAATGATCTCGTCTTCCAGAGCACCCTCTTTACCTTCCTATTTACTCTTTGCCTATCTATCCCTCCTGCTTCCTTGGATACCTCATACAATCACCCCTCCCCTTCCACTAGCTCCTAATTACCTCTACAAGACTCTCAACTTAACCCACTCTCTGTTAAACCAGTCCAATCCTTCCCTGGCAAATGACTGTTGGCTTTGTATCTCTCTATCAACCTCTGCTTACGTTGCCACTCCCATTCCCGCAAAAAANCTGGGTCTTTACCAACTTAACCTACCACCCTCGTTATGAAGGAAAAGACCCTTTCCGACTTCTAAATATGCAATCATTAGCCGACTTCCCCATCTCTGATAGGACCAAGAATACCCTAACAGGACGTGCAATCCAACTTTTACGTTCTTACATTTCCAACCTCACCTATTACACAAGCAATGAAAAGCCCATACACGGCCCTGTAACTACGAATACCATCTTAACTTTCCAAGCCCCTTTATGCATCCAACGCAACCTGTTATCAGGCCTGCCCCTGGGGCACCTACTACCCCATCAGTGTAATTACACCCTACAACTTCAAGCCCCAACTGATCATAGTAACTTCCGAGTCACCCAAACAGCTCCATTCAGATGGCTTGTCCGCTTCTCAGGGCCCCCAAAAATCATCACCTCCTCCCTGCTTAACAAACAGTCCAGGTTTTGTAATGGCAAACATACTCCCTGCATGACCATTCACCCCTGGACCCCCTGCAGCAGCGCCCCCACCACTAGTGAATGCCTTCTCATCCCCTCTTTCAATCACTCTCTCGAATGGTTCCTAGTAGATACAAAACGGTTTTTTCTCCAATGGGAAAATAGAACACAGGGAGCCACTCAGTTTGCTCCCAACACCCCTTTCCAGCCGCTCACCGGAGCTACCTTGGCAAGTACTCTAGGAGTATGGGAAAATGAAAACAACAAACTCACACACCTTTTTAACATACACAACCAGTTCTGTCTACCCAGCCAAGGCATATTCTTCTTATGTGGAACGTCGACCTATATCTGCCTCCCCACTAACTGGACAGGCACCTGCACCTTAGTCTTCCTAAGTCCCAACATTAACATTGCCCCAGGAAATCAGACCCTATCAGTGCCCCTCAAAGCTCAAGTCCGTCAGCGCAGGGCCATACAACTAATACCCCTACTTATAGGGTTAGGAATGGCTACTGCTACAGGAACCGGAATAGCCGGTTTATCTACTTCATTATCCTACTACCACACACTCTCAAAGGATTTCTCAGACAGTTTGCAAGAAATAACGAAATCTATCCTTACTCTACAATCCCAAATAGACTCTTTGGCAGCAGTGACTCTCCAAAACCGCCGAGGCCTAGACCTCCTCACTGCTGAGAAAGGAGGACTCTGCACCTTCTTAGGGGAAGAGTGTTGTTTTTACACTAACCAGTCAGGGATAGTACGAGATGCCGCCCGGCGTTTACAGGAAAAGGCTTCTGAAATCAGACAACGCCTTTCAAACTCTTATACCAACCTCTGGAGTTGGGCGACATGGCTTCTCCCCTTTCTAGGTCCCGTGGCAGCCATCTTGCTATTACTCGCCTTCGGGCCCTGTATTTTTAACCTCCTTGTCAAATTTGTTTCCTCTAGGATCGAGGCCATCAAGCTACAGATGGTCTTACAAATGGAACCCCAAATGAGCTCAACTAACAACTTCTACCGAGGACCCCTGGACCGACCCGCTGGCCCTTTCACTGGCCTAGAGAGTTCCCCTCTGGAGGACACTACAACTGCAGGGCCCCTTCTTCGCCCCTATCCAGCAGGAAGTAGCTAGAGCGGTCATCGCCCAATTCCCAACAGCAGTTGGGGTGTCCTGTTTAGAGGGGGGAT



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERV9 ELF1 5523 5531 + 16.58 CAGGAAGTG
HERV9 HRS1 6113 6124 - 16.48 ATCCAAGATTCC
HERV9 ABR1 4269 4280 + 16.47 GTTGCGGCGGTG
HERV9 DOF5.1 4750 4768 + 16.45 GAAAGGGAAAAAGAATAAA
HERV9 ETV5::DRGX 5524 5535 + 16.39 AGGAAGTGATTA
HERV9 ETV5::FOXI1 6590 6601 - 16.38 GTAAATAGGAAG
HERV9 ZNF257 479 488 - 16.37 GAGGCGAGAG
HERV9 Zfx 7914 7923 + 16.36 GCCGAGGCCT
HERV9 Stat5b 204 212 - 16.36 TTCCCAGAA
HERV9 bHLH78 4450 4457 + 16.29 GCACGTGC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).