HERV16

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000162
TE superfamily ERV3
TE class LTR
Species Theria_mammals
Length 4996
Kimura value 32.10
Tau index 0.9660
Description ERVL endogenous retrovirus, HERV16 subfamily (internal)
Comment LTRs of HERV16 are listed in Repbase as LTR16 sequences. HERV16 is related to HERVL and is relatively old (>20% divergence from consensus). Bases 300 to 1300 encoded a GAG protein closely similar to the murine "retrovirus restriction polypeptide" (PIDs e24674 and e242676).
Sequence
GTTGGTACCAGGAGTGGTCCGAGAAAGCAGACGNTNCTAAGATGGGATTTTGGAGCTGGATCACCCGCCGNCCGGCTGGCAATGAGGACCCCATCACTGGTGGTAGGTGGAGCACGGATAGCCCCTGGCACGAGGTAGCGGTGCAATTGTTAAAACTTTCACCGGTGGTGAACTGGGATGGNATACCGGTGGAAAGAGAATGCACTGGCNGGTGCAATGTNTCAGGCGTTTGAGAAATATGGGANGAATTAANTACATGNAAGGACAATGGAATTGGATGGCTGTTGCTAAGCNCGACTGACGCTCTGGAAAAAGACAATGAAAGGCTGAGAGCGATTAATCGNCAATTNAAAGCTAAGTGTGAAAGCCAGAGGGCCTCCTTGGCAGCATATAAAGAGACTCTCATCTCCTGCAGCCGGAGGGCAGAGAAAGCTGAGGATCAGGCCCAGGACTTAATCAGAGTAGCAGAGCTCCAGAGAAGGTTGAATTCTCAACCNAGGCAGGTCTGCTATGCCAAGGTCAGGGCCCTGGTTGGGAAAGAATGGGACCCTGANACNTGGGATGGGGACATCTGGGTCGATGCNCCTGAAAATCTTGAATCCCCAGATTCCCCTGAACCCTCTGGGCCTGCAGAAGTGGCCCACTCCTCCCTGTTAAGGGCTAGCACTCNCTCCTTGCTTCGCGNGAAGACGATGCAGAGGCCTCTNCCTTNGCAAGACAACATGCGCCCCCCTCAGGATCTGCCCCCACCTCCCCTCCTGGCCACCAGACCAATAACTAGGGTTAAGTCACAGCATAACCCGGCCGGGGAAGTGCTGGGCCTGNTAAGGGAGGAAAGGGACTATACCCCAAAGGAGCTGCAGGACNCGNCTAGCCAGCATGTACCGGCAGGACCGGGAGAGTACGCATGGGACTGGATTCTGAGGGTGCTGGATCAAGGGGGNCGGAACATAAAGTTGGATAAGGGAGAGTTTATCGATNTGGGAGCACTCTCCCGNGATACAGGATTTAACACCCTGGCAAGGACCCCGGGAGATGGTGCNAACACGCTGCTAGGATGGCTCCTGGAAGCNTGGAGAAAGCGATGGCCCACACTAAGTGAAGTAGAAATGCCAGAACTGCCGTGGCAGACGGTGGAAGAAGGGATCAAAAGGCTCAGAGAAGTGGGCATGCTAGAGTGGATATACTATGTAAGGCCGGAAAACCCACCAGATGACTATGTTCCGCGGGAGGGCCCAGAGGACACNCCATTTACCAAAGCNATAAGGAATGCGCTGGTGAGAGGGGCACCAGCATCACTGAGAAGCTCAGTGGTGGCTNTCCTCTGCAGGCCAGGGCTGACGGTAGGAGANGCCGTTACAGAACTGGGCTCNCTGATAGCAATGGGGATGATAGGACCCCGAAATAATAGAGGCCAGGTGGCGGCGCTTAACCGTCAGAAGCAAGGTGGGCGCAATTATCGTAATGNACGGCAAGGTCGGAGTGGCAGCCGGGGGGGCCTGACCCGCAGAGAGCTATGGAGATGGTTAATAGAACACGGCGTCCCTAGGGGCAAGATAGATGGGCAGCCAACAGGAAGNCATGGACAGTTAATAGAACATGGCGTCCTAGGGGCAAGATAGATGGGCAGCCAACAAGGGTGTATTGCTCAACTNAAACAANCAAAAGAAATCAAGGATGGATGANCAGGAGGCTGAGGGCAGTCGCCCCAATAAAAAGTCACGATCCCTTGCCCAGTTTCCGGACCTGAGCCAGTTTTCAGACCCGGAACCCATTGACTGAAGGAGAGGCCGGGTCCCCAGGAGGAAGGACCCTGCAACACCACGGCAAGTGTACACGGTAATGATTCCCCCAGTCCTTCCCCAAAGGGACCTACGGCCATTTACTCTGGGTAACCGTACACTGGGGAAAGGGGAATACCCAGACATTTCGAGGACTGTTGGACACAGGGTCCGAGTTGACATTGATACCCGGAGACCCGAAGCGTCATCATGGCCCTCCCGTTAGAGTGGGGGCATATGGGGGCCAGGTAATAAATGGAGTCCTGGCCCAGGTCCGGCTCACAGTGGGTCCACTGGGTCCACGGACCCACAGTGGTCATTTCCCCGGTCCCCGAATGTATAATTGGAATGGACATACTTGGTAGTTGGCANAACCCCCACATTGGTTCCTTGGCCTGTGGGGTAAGAGCTATCATAGTGGGGAAGGCCAAGTGGAAGCCTCTGAAACTGCCCTCACCTTCTCCGGNCAAGATAGTAAATCAAAAACAATATCGCATCCTCGGGGTGGAGAATGGCAGAGATTAGTGCCACCCTTAAAGACCTAAAGGATGCAGGAGNTGTGGTCCCCATCATATCTCCATTTAATTCACCAGTNCNGGCCCCTGCAAAAACCAGACGGATCCTGGANGACGACAGTGGACTACCGCAAACTCAACCAAGTAGTANCCCAATTGCAGCTGCTGTGCCAGATGTGGTATCTTTGCTAGAGCAGATTAACACGGCCTCAGGTACGTGGTATGCGGCCATTGATCTGGCGAATGCGTTCTTTTCCATCCCTATCAGAAAGGAGGATCAGAAGCAGTTCGCATTCACNTGGAACGGACAACAGTATACATTTACAGTCTTGCCCCAGGGCTNTGTTAACTCTCCTGCCCTCTGTCATAATATAGTCCGAAGGGACCTGGACCATCTGGACATTCCGCAGAACATCACATTGGTCCACTATATCGATGACATCATGCTAATCGGACCGGAATGAGCAAGAAGTGGCAAGTACGTTGGAGGCCTTGGTAAGACACATGCGCTCCAGAGGGTGGGAGATAAACCCTACGAAGATTCAGGGGCCTGCCACATCAGTGAAGTTTTTAGGGGTCCAGTGGTCTGGGGCATGCCGGGACATCCCCCCAAAGTAAAGGACAAATTATTGCATCTTGCACCTCCCACCACNAAGAAGGAAGCACAACGCCTGGTAGGCCTCTTCGGGTTCTGGAGGCAGCATATTCCACACTTGGGAATACTGCTCCGACCCATNTACCGGGTGACGCGAAAGGCTGCCAGCTTTGAGTGGGGCCCAAGCAGGAAAGGGCTCTGTAGCTNACCGCATATCGTGTNCGAANCGCANANNCGGNCCTCCAAGTCATAAAGTNGGGCGGGCCCAGCAGCAGTCCATCATAAGGTGGAAATAGNATATCCGAGATTAAGCCCGAGNAGGACCAGAGGGCGCAGGTNAGNTNCATGAGCAGGTAGCCCAGACCCCCATGNCACCCACCACNGTTGCACCAGCGCCTCTCCTTCGGCTCGCACCTATGGCCATATNGAGGNNTGTGGTCCCGTATGACCAGCTGAAGGAAAGCTCGAGCTTGGTTTACAGGATCAGCTCGGTATGTGGGCGCAAGCCAAAATANGTGGTGGCTGCACTACAGCCCCATTCAGGGGTGGCCCTGAAAGGCAGCAGGGAGTGAAAATATCTTCCCAGTGGGCAGAGCTGCGAGCAGTGCACCTGGTCATCCACTTTNCGTGGAAGGAGAAGTGGCCCGAGATGGTAATATATACGGATTNCTGGACAGTGGCCAATGGCCTGGTCGGCTGGTCAGGAGCTTGGAAGGAAAAGATTAGAGANAGGGAAGTCTGGGGTAGAGGCATGTGGATAGACATATNGGAAGTGGCACGAAATACGAAGATTTTTGTATCGNTNNTATGTTAACGCTCACANGAAAGCATCNACCATGAAAGAGNCACTGAACAACCAAGTAGACAAAGTGACTTCGACCACTGACGTTAGCCAGCCTTCGTCCCGGCCTCCTCAGANCTGGTNCAACGGACATCTCAANAAAGTAGCCATGGTGGCAGAGATGGAGGCTACGCATGGGCCCAACAGCATGGACTCCTATNCACCAAGNCGATCTAGCTGCTGCCCCGTNTGAATGTCCAACCTGCAGCAACAGAGACCAATGCTCTACCTCNAATACGGCACTATTTATGGAGACCACTTAGCGGCGAATTGACTATATTGGACGCCTTCCATCCTGGAANGGNCAGNGATTCATTCTCACAGGAATANATACNTATTCCGGGTATGGGTTTGCCTTTCCTGCCTGCANAGCCTCATCNAGCATCTGAGGGNCTTAGGAGTGCCTGATCCACAGGCATGGAATCCCACACAACATAGCATCTGANCAGGGCGCTCACTTCACAGCAAAGAAGNGGCACAAGTGGGTCCANAATAACGGGATCCACTTATCNTATCACGTACCGCACCATCCAGAAGNAGCTGACCTNATAGAACGCTTCTGAAGGCACAGNTGAAGCACCAGTNGAGGCGAACTCTGAAAGAATTGGGTGCCATCCTCCAGGATGCAGTATATACATTGAATCANAGGCCTATNCGTNGGGCNTTGNNTCTAATNGGGAGAATGGGATTAGGAATAAGAGGTGAAAGCAGGAGTGTGCCTACTCCCCATCGTAGTGGCCCACAAGGATCACAGCAGTGGTTTGAACACGGATTGGACTACTGATATCCAGGAATCGCCGCACAAGAAGGNGAATCCACATTTGGCAGGAGTAATTGACCNTGATCANCAGGAGGGGTAGGGCTGCTNTCGCANAATGAGGAAGAAAGGAATATGTNTGGAACCAGGTGATCCNCTTGGGTGCCTCCTGGTACTCCCTTGCCCCATTTAACTGTAAATGGACAATNCCACCGAAAAGAGCTCGGACCNCTCCGGAATGAAGGTTTGGNTCACATCGCCNGGTAAGCCACCAAGACCTGTCGAAGTGATAGCTGNGTGAGGGANATCTAGAATGGATAGTAGAAGAGGGAGACGATGAGTACCAGTTNCGGCCCCGAGACCAACTGCAGCAANNGGGGCTGTAGTTCGTCCCACTAACCTCCCTCTTCTAAGTTTCGCTTCAGAAAGAGAAGCTTAAAGGAATAATGGAGGAACTGCTCCNNGAACCTGNGTGGAGAAGTAGATCCGTCGAGTACAAAGGTGGAC



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERV16 RVE6 3463 3471 - 16.23 AGATATTTT
HERV16 TCP3 2341 2348 - 16.23 GGGACCAC
HERV16 TCP3 3326 3333 - 16.23 GGGACCAC
HERV16 RVE5 3463 3471 - 16.21 AGATATTTT
HERV16 TCP24 2341 2348 - 16.21 GGGACCAC
HERV16 TCP24 3326 3333 - 16.21 GGGACCAC
HERV16 ERF105 1419 1428 + 16.17 GTGGCGGCGC
HERV16 TCP5 2341 2348 - 16.15 GGGACCAC
HERV16 TCP5 3326 3333 - 16.15 GGGACCAC
HERV16 NFKB2 1912 1922 + 16.12 GGGGAATACCC


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).