HERV1_I

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000163
TE superfamily ERV1
TE class LTR
Species Catarrhini
Length 8801
Kimura value 13.19
Tau index 0.0000
Description Internal region of an ERV1 endogenous retrovirus, HERV1_I subfamily
Comment The associated LTRs are the HERV1_LTR variants. Target site duplications are 4bp. The HERV1_I consensus encodes gag, protease, reverse transcriptase, RNase H, integrase and envelope proteins. HERV1 elements in the genome are ~6% diverged from the HERV1_I consensus.
Sequence
TTTCTGGGGGCTCGTCCGGGATTGGAGACGGCAGGTTTCTGTCTCCTTTGCCTGTGGGCTGGAGCCCCGGGNCGCGGGAGACCCGGGACCCNAGGCGCCACCGGGNAAGACTTAGCCCGGAAGGAGANCGGCTCTCCCGCGTCCCGGNGCCCTCCCCCGGCAGCGCAAACGGAACCGANNGAGGGGCTGCAGGACGATCNCAGGAGCAGCGCGCAGNCAGNCCGCTGAACCGCGGTAAGGTTGGGCCCNAGGAAGGCCCGTCCCATAAGGACGGAAGGGGAGCCTGATCACCTCCCGGGGCGCGACNACTAGTCCGACCCAGAGGGGCTGGGGGCGGCGGGAGTGGCCCGCCGATTCGGATGAANCTCACGCCCCCACTACAAGCGAGAGTGGTTCACTGGGTCTGGAGACGGGAACTGGAGGTGTGTGGGTGCGTGCGAACCNACCCGGGACACGAGGGAGGCTCGTTTCATCCGATGAGGTGGGGNAGGAGTGGTGTGTGTATGTGTGTGAATGTGGGAGCCTAACTAGGCTCACCCGGGACACGAGAGAGGCTCGTTTCATCCGATGAGGAGTCCTGGGGCGGGGGAGGTGTGTGAAAGTGTGTGAAAGAGACGGTCTCGGGAGAGGCCAACGCGGGGAGTGACGTGGGGAGGCACAGATCTCTTAGCGCGGACTGTGTGCTCCGAGGCGAGTGTGGGANAAACCAGACCTAGGNCACTGCATACGGCCGATAGGACCAGCTCCACAGCTNCACAGCAGCAGTTGGCTGTGACCTGGCTAAGCAGCGTCCGAACCTCCCGTAATAGGACCCGGTCTGGTGGATCCGAGAGTGAAAGTGAGAGTGAAAGCGCGCCGCGAGGGAGGAAATGGGAGGAAAAGCATCGAAGCCNACTCCATTGGAGTGCATGCTGAAGAACTTTAAGAAAGGTTTTAATGGTGATTATGGGGTNAAGCTAACTCCNCAGAAGCTGAGAACNCTTTGTGAGATAGACTGGCCNTCTTTTAATGTAGGGTGGCCGGCCGAGGGNACNATAGACAGGGAAATAATTGGCCGNGTGTTTCGGGTGGTCACCGGGGTCGGAGAACAGCCNGGGCACCCGGATCAGTTTCCGTATATNGACTCCTGGCTAAGCGTAATTCAGACCCGCCCNAAGTGGCTGCAGGCCTGCTTTGAGGNNTACTGTAAGACTCTAGTGGCCCGGACAAAACNAGGAACCATAGAAAAGACCCGCAAGGCGCAGNCNCAAGAGAAGGAGTCGCAGGGAAAGCAGAAAAAACCTGTCCTACAGGCCCCGCCNGAAGAGTTAGAAANTCCACCCCCCTATGCNCCAATTTATCCATCTCTGGCAAGGCTTAGGCAGGAGGCCGCCCCGGCAGCTGCCTCCGGAGGNTCAGACTCAGAGGAGAGCACCCCTCAGGCNNCACCACGCAGGGAGGAGCCAGAGCCCCTGCCTGANAAGCCAAGGGAGGAACTCCAGGATGAGGTCGGCCGCCTCAGGTCAGGCCGCGCCCGAGCNATGCAGATGCCCCTCCGAGAAACNNGGGGACAAATTTATTTGGATGCACAGAATGAAGTCCAAGGGGGAGAACGGCTCTTCGTTTATCAGCCCTTCTCTACTACTGATCTCTTAAATTGGAGACAGCATACTCCCTCCTATACGGAGAAGCCCCAGGCTCTTATAGACCTAATGCAGTCCATCTTCCTAACTCACAACCCTACCTGGGCTGATTGCAAACAACTTCTTCTGTCATTGTTTAATACGGAAGAGCGCCGNAGAGTTATACAAGCGGCTCNCCAGTGGCTGGAGAGCAATGCGCCTGCAGGCACAGGAGATGTCAGGCAGTATGCACAACAGGCNCTCCCGATAGAGGCTGACCCAGGCTGGGACCCNAACCAGGCTCAAGGGCTACAAAGCTTGCAGNGGTATCGAGAGGCACTCCTAAATGGAATAAAGGCTGGAGGGAAAAAGGCAACGAATATCGGAAAGGTCTCAGAGGTCCGCCAGAAGCCAGATGAAAGTCCCAGTGAATTTTATGAGAGGCTCTGCGAGGCTTACCGGCTTTACACGCCATTTGACCCAGAGGCTGCAGGNAATCAGTGCATGGTTAATGCGGCATTTGTAAGCCAGGCGCAAGGNGACATNAAGCGAAAGCTTCAGAAGTTGGAAGGNTTTGAAGGTATGAATATTACCCAGCTTATCCAGGTGGCTACTAAGGTGTTTGTAAATCGGGATGAGGAGGCCAAGAGAGAAGCCAAGCGCAGAGCNAAGGAAAAGGCAGANTTGCTGGCNGCAGCCCTGGTTGGAAGAGAAACTGGNTTTGCGAGAGGACGTGGACGTGGTCGTGGATGCGGTCACGGTAGAGGACAAGCTAGGCCAGGCCAGGAGGCCAGGNCAGGNCAAGAGGGCCGGCCTAGGCTNGAGAGAGATCAATGTGCGAGATGCAAGCAGANAGGGCACTGGAAGGATGAATGTCCAGAGAGAGAAAAGGATAAAGGCAACAACCAGGGACAGAATGGCTGGCCAGGGCCCCCTNCNGCCGCCGGNCANGGCGTAGTAGGATCNGACGCGGATCTAATCGGGCTGGCAGGAGTCGATGATTATTNTGAGGACTGAGACAGACCGGGCTCCATCTCATTAGGCCCCGAGGAGCCTATGGTCTCAATGGAGGTAGGGGGCCGAAAAATGGACTTTATGGTNGATACTGGTGCTGAGCACTCGGTNGTGACTCAAGCAATTGGGCCGCTGTCTAAAAACTATGCCAATATNATTGGGGCTACAGGNGTCACAGAAAAGACGCCTTNCTTCAAATCNAAGAGATGTGTGATTGGAGGNCAAGAAGTCCAACACGAGTTTTTATATTTGCCAAATTGTCCGGTGCCCTTGTTAGGAAGAGACTTGCTCCAGAAACTGCAGGCNCAAATCTCCTTTACACCGAGAGGGGACATGACCCTAAACCTAGGTCAAAGAAAGGCCATGGTANTGACCCTTACCGTCCCNANAACAGAGGAATGGAGACTCTATGAGAGNAGTTGCNAGGAATNTGNAAAGANGCACANCGCAGCTGAGAAAGAGGNANTGTNTACGGANTTACTTCTCAAGCTGCCAGGGGTCTGGGCGGAGGACAATCCCCCGGGGCTAGCCGTAAATCAGGCACCCGTAGTGGTGGAGCTGCTGCGAGGNACCTACCCAGTGCGGATCCGTCAGTATCCCATTCCCGTAGAGGCCACCCANGGGATTACAAAACACTTAAANCGGCTCCTTGAATTTGGGATAATAGAAAGATGTGCCTCCTCNTGGAACACTCCNCTGCTGCCGGTGTTAAAGCCCTCTGGNGACTACCGGCCNGTACAGGATTTGCGGGCNGTAAACAAGGTCGCGGCTACACTGCATGCCATTGTGCCCAACCCGTACACNATGCTTGGGCGAATNCCTGCTGATGCTGCTTGGTTTACATGCTTGGACATNAAGGATGCGTTCTTCTGCATCCGACTAGCCCCTGNAAGCCAGGGCATCTTTGCCTTTGAGTGGGGCCCATCNCAGTATACCTGGACCAGACTCCCCCAAGGATTTAAAAACTCCCCAACCATCTTTGAGGAAGCACTAGCCTCAGACCTGAAGGCTTTCACGCCACCAAGTGACCGCTGTGTCCTGTTGCAGTACATAGATGATCTATTGTTGGCCGCACCCACAAGAGAAGAGTGCNTCCAAGGNACAGAGAGCCTCCTTCGNGTNCTGTGGGAGGCTGGCTATAAGGTGTCTAAGGAAAAGGCACAAATCTGTGGCCAAGGAGCNCGGTATCTTGGCTTTNACGTCTCCCAAGGGCAGCGTGAGCTTGGACGNGAGCGAAAAGAGACTGTNTGTAGCATTCCTCGGCCNGACACNAGGCGGCAAGTGCGGGAGTTCCTAGGGGCAGCTGGTTTCTGCCGCATTTGGATTCCAAACTACTCGCTCNTGGCAAAGCCNTTGTATGAGGCTACCAAAGNGGGGGAAAAGGAACCCCTCCTGTGGGGAAAAGAGCAGGACATGGCCTTCAAGGAAATCAAGAAGGCTTTGATCCAGGCCCCGGCATTAGGACTGCCAGACATGACAAAGCCTTTTTACCTGTATGTCCATGAAAGAAAAGGAATAGCTACAGGAGTCTTGGTACAAACGCTAGGGTCATGGTATCGGCCCGTGGCATATTTGTCCAAGCGACTAGACTTGGTGGCTATGGGATGGCCACCCTGTTTCAAGGCACTGGCNGCCACTGCCCTGTTAGCNGAAGATGCTAACAAGCTCACATTTGGACAGAGGTTGATAATTCGGGTGCCCCACACGGTCGTCACCCTGATGGAGCAGAGGGGGCATCGCTGGCTCTCTAACCCTAGGATGTTAAGATATCAAGGGCTCTTGTGTGAAAACCCNTACATAACCTTGGAGACTGTGAATACCCTAAATCCGGCCACACTGCTGCCAATAGAATGGGCGGAGCATGGAAAGCCCCCGTTGTGTGGCCCAGGGTATCACTGTTGTGTGGAAACAGTGGATGAAGTTTTCTCAAGCCGGAAAGACTTAAAGGACCAGCCCTTAAAAGACCCAGATGTTGAATACTTTACTGATGGAAGCAGCTTCATATCTGAGGGTGTCAGAAAGGCCGGATATGCAGTGGTNACACTGAACTCAGTAGCCGAAGCCCGCCCTCTGCCGGTCGGAACCTCGGCCCAAAGGGCNGAGCTAATAGCTCTCACNAGAGCACTGCTCCTGGCGAAAGGAAAGTCAGTAAACATCTATACTGACTCAAGGTATGCTTTTGCCACTTTGCATGCCCATGGAGCCATATATAAGGAAAGAGGATTATTAACTACTGAAGGAAAGGAAATCAAAAATAAAAAGGAAATAGAGCAGCTCTTAGAAGCCGTATGGGCTCCAAAAGAAGTAGCAGTCATCCATTGCAAAGGGCATCAAACAGGAGGAGGTGATGAGGCTAGAGGAAACAGAAAGGCGGACAGAGAAGCCAAAAGAGCTGCAATGACAGAGGTAACTAAGAAGGAAGAGACCCNTACCATGCCCTTACTGGAGCTTCCCCTTACAGAACCCCCTAACTACTCCTCTAATGAAAAGGCNTGGTTCGAGCAGGAGAGCGGAAGTTACCAGAAAGGAGGTTGGTGGAAGTTCTCAGATGGGAGGCTTGCCATCCCAGAAGCAATNGCCCCCCGGTTCATAAAGCAGTTTCATCAAGGAACGCATATGGGGAAAACNGCATTAGAGACTCTCGTAGGACGGCATTTCTATGTGCCGCGCCTAACTGCCATCACTCGAGCCGTTTGTGAGCAATGTTTNACTTGTGCCCAAAACAATCCANGGCAGGGGCCAACACGGCCCCCAGGGATTCAAGAAACTGGAGCNACGCCNTGTGAAAACCTGCTTGTGGACTTTACCGAGCTGCCTCGAGCCGGAGGCTACCGGTACATGCTAGTGTTTGTCTGCACTTTCTCAGGGTGGGTCGAGGCATTTCCCACCAGGACAGAGAAGGCTCGGGAAGTAACCAGGATCTTACTAAAGGACATTATTCCTAGATTTGGACTGCCTCTAACTTTAGGNTCAGACAACGGCCCAGCATTTGTGGCAGAAGTAGTACAGCAGCTAACGCAGATGTTAAAAATCAAATGGAAACTGCATACAGCCTATCGCCCACAGAGTTCTGGAAAAGTTGAAAGAATGAACCGGACACTNAAACAGCTGTTAAAGAAGTTTTGCCAAGAAACTCATCTAAGGTGGGATCAGGTGCTGCCCATGGTCCTTCTCCGAGTCAGGTGCACCCCTACTAAATTAACTGGGTATTCACCCTATGAGATAGTGTTCGGCCGACCACCCCCAATCATAACTCAGATAAAAGGGGATTTAAAAGAAATTGGGGAATTAACCTTAAGAAGGCAAATGCAAGCCTTAGGTGAGGCCATGCAGGAAATACAAGGGTGGGTAAGAGAAAGAATACCTGTTAGCCTCACAGATGCAGTACATCCCTTCCAACCTGGAGACTCTGTCTGGGTCAAACGATGGAACCCAACCACCTTCGGGCCTTTATGGGATGGCCCCCATATTGTGATCNTGTCTACCCCCACTGCTGTTAAAGTTGCAGGTATCACACCTTGGGTTCATCATAGCCGGCTGAAACCNGCAGCCNCAGCTCAGGACCAGTGGACCAGTCAACAAGACCCAGACCACCCGACNCGGCTGATCCTGCGGNGAAACCAAGCCGCNGCNGANAAGGACGACTGCCCTGCTCCGACCACACCGGAGGCTGGTCGGTCCACGCACGGCTGAAGCTTGAGGAAACATCAAGCCCTGCTCTAGTCACACAACTGGAAGCTGACTAGTCTACGCATGGCCGAAGCTTGAGGAAACGTCAAGCCCTGCTCTAGTCACACAACCGGAAGCTGACTAGTCTACGCACGGCCGAAGCCTGAGGAAGTCAACGNTAGATAAGTAAATGTGGATTGAATTTACAAGCGTAGTTATACTCTTACTTGTACTGATTGTTTTGCTGTCATGTTATCTTTGCAAATGCTGCCAAGCTTGTTGCCCAGAAGGGTGCCCGTGCATAGTATAAGCTTAATCATACTAGTAATACTGANGCTAACAGGCATGAAAGGGGACCAAGATGACTGTCATCACTGTATGATAGAAGCCTGGTCCGGAAAAGGTATGACTAAAACTCTGTTATACCAGACCTACTATGAGTGTACAGGGACTCATACGGGAACTTGTGTCTATAACCAGACTAGTTACTCGGTCTGTGATCCNGGAAACGGGCAGCCCCAAGTATGTTATGACCCAGAGTTCTTGCCCTATGACTTCTGGTTTGAAGTCCAAATTGGCGAACCCCTAATGCCATCATATACAAACCCCACAGAAACCGGGGTCGGTAAACTCGTAAACAAAACAGAGGTATTCCCTTACTCGCATAAAGGGCCTGTCTCCATATATTTTGATGCCTGCCAAGCTGCACATCTCAGCAAACTAAACAATATTGGGGCCGTCTGTAAAAATCTAGGACAAGAAAGAGTCAGCAGCAGAGCCGCCAAGGCCGTAACAGGAGAACCCGAAAAGGANTGCCCTGATTGTGACANTCAGTGGACCACACATGAGTTCAGCCAGCGCCTNTACGCAGGAAGAGTNGCTCTGCTTGCCAGCCAAGAGGCGAAGATNGGGTGCGCGACTGGAACATGCAACCCNCTCAATCTGACNATACTAAAGCCAAATATGCCTTTCTGGACTAAAGGGCATAAAGGAGNGCTAANCTTTGATCGGGAAGGAGCAAACCTAGGTATTCCNCTAGTCATTACNAAGAAGACCCAANGGGCCNAAGTTCAAGTTAGCCCAATGCAACAGTTCAGGTTTTNTAAATCCTTCAATGAACACTTTAACCCCGAGGNACCAAAAGTTCAAATTCCNCCNATATCAGCTGAAAACCTGTTCGCTCAGCTAGCCGAAAGTATTGCTANTAATCTNGGAGTCACCTCATGTTATGTATGTGGAGGTACCAATATGGGAGACCAATGGCCCTGGGAGGCTAGAGAATTGATGCCACAAGANAATTTTACCNTACCTGAATTTGTTACAAAGTTCAATGCAAACCCAAGTGTTTGGCTACTAAGGACCCCTATCATTGGAAGATACTGCATAGCACGNTGGGGAAAGGNCTTTCAAACCCAGGTAGGGGANACAACTTGCCTAGGTCAACAATATTTCGAAGAATCCGAGAACAAGACACAGTGGAGAAGCTTTATAGACAATTCCTCTGTGCCAGATTTTAATCCCCTCTTNCAGTTTCCAGCGCTAAATCAGTCATGGTATCAACTAGATGCTCCAAATGTTTGGAGAGCACCNGCAGGACTATATTGGATCTGTGGGACAAAGGCCTATCAACTATTGCCNGANAAGTGGACNGGAGCCTGTGTGTTAGGAACAATAAGGCCATCCTTCTTCCTACTCCCACTGNAGCAAGGGGAAGATCTAAGTTACCCGGTCTATGACGAAGANAGAAAAAGGGCCAGAAGAAACGTNTTTACNCAGATAAGTACCGTGGAAAAGATAAACACAAACATNAAGAAGGACATTGAAATAGGGAGCTGGAAAGACAATGAATGGCCTCCTGAAAGAATTATCAAATACTATGGGCCAGCTACNTGGGCNCAAGATGGGTCATGGGGNTACCGTACTCCTATTTACATGTTAAACCGAATCATAAGATTGCAAGCAGTACTAGAAATCATAGTCAATGAAACAGCCCGAGCCTTGGATTTGCTAGCCATACAGGCNACCCAGATGAGAGATGCCATATATCAAAATAGGCTAGCATTAGACTATCTCCTAGCCTCAGAAGGAGGAGTTTGTGGNAAACTTAATTTGACNAACTGCTGCTTACAAATCGATGACAATGGAAGAGCTGTCATGGAAATCACTGCCAGGATGCGGAAGTTAGCCCATGTCCCGGTCCAGACCTGGTCCGGATGGAGCCCAAATTCACTTTTTGGAGGATGGTTCTCATGGTTTGGAGGCTTTAAAACTTTGATAATCGGTTTTATAGCCATAATAGGNGGATGCCTAATNCTNCCTTGTCTCCTGCCTCTTCTCATCAGAAGCATCCAGTCCACCATAGAAGCAATAGTGGACCGGACAACTACCACCCGAATAATGGCNCTGCAAAAGTACCAACCGGTNCCCCAAGAAGAGTATGTACCCACNCAAGAAGAAATAGATAACTGTGGTGCTCTTTATTAATCTACATTTATGNCGAGCACCAAAGGGGGGGAA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERV1_I BPC5 822 851 + -41.50 TGGATCCGAGAGTGAAAGTGAGAGTGAAAG
HERV1_I BPC5 2472 2501 + -42.43 GAATGTCCAGAGAGAGAAAAGGATAAAGGC
HERV1_I BPC5 587 616 + -42.90 GGGAGGTGTGTGAAAGTGTGTGAAAGAGAC
HERV1_I BPC5 826 855 + -43.20 TCCGAGAGTGAAAGTGAGAGTGAAAGCGCG
HERV1_I BPC5 2480 2509 + -43.84 AGAGAGAGAAAAGGATAAAGGCAACAACCA
HERV1_I BPC5 4980 5009 + -43.96 AGAAAGGCGGACAGAGAAGCCAAAAGAGCT
HERV1_I BPC5 8105 8134 + -45.79 TGAAATAGGGAGCTGGAAAGACAATGAATG
HERV1_I BPC5 830 859 + -46.12 AGAGTGAAAGTGAGAGTGAAAGCGCGCCGC
HERV1_I BPC5 8095 8124 + -48.12 AGAAGGACATTGAAATAGGGAGCTGGAAAG
HERV1_I BPC5 589 618 + -48.30 GAGGTGTGTGAAAGTGTGTGAAAGAGACGG


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).