HERVK3

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000195
TE superfamily ERV2
TE class LTR
Species Simiiformes
Length 7202
Kimura value 9.82
Tau index 1.0000
Description Internal region of HERVK endogenous retrovirus, HERVK3 subfamily
Comment The associated long terminal repeat is LTR3, and has 6 bp TSDs.
Sequence
AGTGGCGTCCGAACACAGGGACTTCGAGGACGTGAACGAAGAAGGTCTGCTGGAGCAGAGGAACTGAAATTGACAAGGCGAACGGGGACCCCGGGACGAGTCTGCCGGCAGCGGATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAGCAATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAACGGGAAGTTTTCTGAATCAGNGGTAACATGGGGCAGAATTTGTCTATTGAAGAAAAACATTATCGTGCAGTTGCTTAAAGTTCTGTTGAGACAGTCTGGNGCTCAGGTTAGTTCNCAGACACTAACTAAGATGCCGCAGGAGGTTATTACGCATAACCCATGGTTTCCACAGGCAGGCACTCTTGATGTGGAAAATTGGGACAGAGCAGGAGAAGGATTAAAACGGGCTCATCAAAAAGGTCTTAAAGTTGATTCTTCTGTTTTCTCCACTTGGAGTTTAGTTCGTACTGTNCTTCTGCCATTATCTCCTTNTTATTCTGCNGGACAGCAGGAGTCATGTTCTGAGTCTAAAAATCTGAAAGAATCTGTTGTCCCACCCACAGCTCCAATTGAAAATAAAAAACAGGAGAGGGAGGATAAAAATTGGCCTATACCGCCTCCTCCAGTTGCAGAAACATCTGTACCGCCTCCTTCGGTAGCAGAAATAGAGACCCCAATACAAAGAATTTTACGCTCTGCTGCCATAGCTGGAGAGCCCTTAGGACCTNTGCGCTTTTCCTATTTCCGTAAGGCCTGATCCAAATAATCCACAGCAGNTTATTCATGAACACACCCCACTAGAGTTTAAGTTGTTGAAGGAATTAAAAGCTAAGTGTGGTNAATAATGGCGTACAGAGCCCATTCACTTTAGGATTGCTAGAATCTGTGTTTGGTGCTATGCGTCTTCTACCCTTTGATGTAAAACACTTGGCNCGAACTTGCTTGTCTGCTAGTGCATATCTGACATGGAATTTAAATTGGCAAGAAATGTGTGCAGACCAGGCTAGACAGAACCGTGCTGCTGGACACGGAGACATTACAGAGGATATGCTGTTAGGTAATGGCCCTTNATTCAGACCTGGAACGTCAAATGGCACTCCCAGACGCTGCTTATCAGCAGTGTGCACAGGCCGCTAAACGCGCCTGGGCCACAATTCCTGAAGAGGGAGTCCCAGTACAATCCTTTTTACATATCATGCAAGGGTCGCAGGAACCCTATGCGCAATTTCTTGCAAGATTACAAGAGGCAGTGAAGCGTCAGATTCCTCATACCGCTGCCGCAGAAATGCTAACCTTAACTCTAGCTTTTGAGAATGCAAACGCGGATTGTAAACGTGCACTGGCACCTGTGAGGTGTAAAAAAACTTGGGAAATTTTCTCAGAGCTTGTCAGGATGTAGGAACTGAGCTTCATCGCTCTGCAATGTTAGCNCAAGCAATGGCTAATTTAGCAGTTGACAAATCTAAAAGGAGCCAAGGGTCAAACCCTAAAATGGGAAAATGTTATAATTGTGGAAAAACTGGACATTTTAAAAAGGAATGCCGCCAGATCTCAGGACAGAAAGGACCTTACAATGCAGTNCCCCCCACCCCCGCGNNCCAGCGGAAAAAAACGCCAGGACTTTGTCCTCGCTGTAACAAAGGAAATCACTGGGCTAATCAGTGCCGCTCAAAATTTCATCAGAATGGCACCCCCCTGTCGGGAAACGAGANGGGGGCCTGGACCCGGGCCCCTCAAACAATGAGGGCATTCCCAGTCCAGACCACAACCCCGTTTCAGGGATGGGTTCCCGGAGGNACATTGATTCCCTCACCCCAGGAACACCAGGAAGTGCAGGATTAGATCTCCCCGCCAGAGAAAGAATTACGTTAGTTGGNGGAGACAAACCCACCAAAGTTCCCACTGGCATTTGGGGACCTTTACCAGCAGGATACATGGGACTAATTTTAGGCAAAAGCCGCCTTAACTTGCAAGGCATTACTGTAGTCCCAGGAGTNGTTGACTCCGATTATGAAGGAGAAATTCAAGTAGTTTTAATGTCACAAGATCTTTGGGTTTTTGAACCGGGAGAATATATTGCTCAATTATTGCTTATTCCCTGCAAATTACACCCTTCTCCACGAAAGGAGAAACGAGGAAATAAAGGGTTTGGGAGCACAACTACATGGGAAATCTATCTATCCNCAACCCATAGCCTCTAATAGACCCACCTGTGTAGTACAAATTAAAGGAAAGAAATTTTATGGGCTTATGGATACGGGAGCTGATGTGTCAGTAATATCTAGNAACGACTGGCCCCCATCCTGGCCCCTGCGATTAACTTCTACATCCCTAGTGGGAGTAGGAACAGCTCAAAGTGTTCAACAGAGTGCTGAGATTTTACCTTGTCTTGGTCCGGATGGACAGTCATGTACTTTTCAGCCTTATGTTGCAAATATAGCTATCAATTTATGGGGTCGAGACTTACTTACAGCATGGGATATGAGACTTACAAATGAAAACTTTGATAACCCAGGATTTAAAATGTTGAAGGACATGGGATATCAGAGTGGAAAAGGTTTAGGGAAATTCCTACAAGGAAACCCTAACCCGATATCAGTAACTGGAAAAACAGATAGAAAAGGGCTAGGACGTCAGGATTTCTGACGGGGGTCATTGATATTTCTCCTCCGCCCACTGCCTTACCATTAGAATGGCTNAGTGACAAACCTGTGTGGGTGGATCAATGGCCCCTANCACAGGAGAAGCTAGNTCAACTTCATCNGCTAGTAAAAGAGCAATTGGATGCAGGACATATAGAGAAGAGTTNCAGCCCCTGGAATTCACCGGTGTTTGTTATTCCAAAAAAGTCCGGAAGATGGTGACTGCTGCATGATTTGAGAGCTATTAATGCACAAATTAAACCGATGGGTGCATTACAGCAAGGTCTGCCATCCCCAGCGGCCATTCCAAGAGACTGGCCTCTCGTAGTAATAGATCTTAAGGATTGTTTCTTTACTATACCNTTACACGAGAAGGATAAGCCTCGATTTGCCTTCTCTGTGCCTTCTATTAATCAAAGAGAACCTGTTTCTCGTTATCAATGGAAAGTTTTACCCCAAGGCATGCTTAACAGTCCTACGCTATGTCAGCATTTTGTAGGACGGGCATTAAAGGAGCCTCGGAATATGTTTCCCACTGCTTACATCATTCATTNTATGGATGATATTCTTTTGGCCGCTCCTACAGATCAAATCTTACATCAGTTATTCAGAGAAACAAAGCGGGCTTTGACTAAATGGAATCTCAAAATNGCTCCAGAGAAGGTGCAAACAACTTCCCCATACCANTACTTAGGAACTATTGTTACGGAGAGAAGTGTACGGCCTCAGAAAGTAGTTCTCCGTAAAGACAGGTTACAGACTTTAAATGATTTTCAACAATTATTAGGGGATATTAATTGGCTGCGCCCGATGCTAGGTATTGCTACCTATCAACTTACACATCTTTACCAAACCCTGCAAGGAGATTCTTCNTTAAATTCCCCGCGGCAACTNACTAAAGAGGCAGAAGCCGAGTTACGGCTTGTAGAGCAGATGCTTCAGCAGAGACATGCCTCNCGGCTACAGCCGCAAAAACCTTTGCTTTTGTTTATTCTTCCTACCCCCCACTCTCCAACAGGACTTTTGGGCCAGTTCATAGACAAGTCTGTAACAGTAATAGAATGGCTCTTTCTACCTAATCAGTCAAAACCTTGCAAGTTTATCTTTCTTTAATTACACAAATTGTGACTATGGGCAGGCATAGGTCAAAAATGCTTACGGGATATGATCCNGACAAAATTATTGTTCCCTTAGACTCCCAGCAACAGGCCGCAGCNTGGGAAATGTCGACTGCNTGGCAAATCGCTTTCGCAGACTTCGTGGGAATAATAGATAACCACTATCCCTCAGACAAAATTTTGCAGTTTTATAAAGTCCATTCTTTCATTCTTCCTGTGATTACTCATCACAAGCCTATTCCAGGTGGACAGACTTATTTTACTGATGGCTCTTCCAAAGGTCGTGCAGCTATCTATGGACCTAAACATACTCAAACAATAATGACCTCTGGGGTTTCAGCTCAACGCTCAGAGCTAATTGCAGTCATTCAGGTTTTACAGCTCACAGCTTCAGATCCTATCAACATTGTCTGTGATTCAGCTTATGTTGTAAATGTAGCCAGTCGCATAGAAACTGCTACAATTAAAAGTACACTAGACCCAGAACTGCTTAATTTGTTTCTAAGACTTCANACAGCTATTCGCTCTCGTGCAGCTCCTTTTCATATTTCTCATATTCGCTCTCACACACAACTTCCTGGACCACTATCTCTAGGTAATGATAGAGCAGATAAACTGATTGGTTCTGTGTTTCAGCAAGCTCAAGCNTCTCNATGCGCTACTGCACCAAAACACCTCCGCCCTTACTCGCATGTTTCATCTGCCTCGCAGCCAAGCTAGGGCTATNGTACAAGCCTGTCCTACTTGCCAGCATGTCCCTGGNGCCGCACCTGTAGAAGGNTGTAACCCACGAGGTTTGGCTCCAAATGAAATCTGGCAAATGGATGTTACACACATAGCAGCCTTTGGCAAGCTTAGCTATGTTCTGTGANCTATAGACACTTATTCTCATATGCTGCATGCTACATGCCAAACAGGTGAGACAGCTGGTCATGTACGGCGACATTGTCTGTCATCATTTGCTCATATGGGGATACCTAAACAATTAAAAACTGACAATGGACCCGCTTATACTAGTCATGCTTTTCAAAATTTCTTACAGCTTTGGGCTATAACCCATAAAACAGGAATTCCTTATAATCCTAGAGGACAAGGCATTATAGAGCGGGCACATCAAACATTACAACGCATGTTGAAAAAACAAAAAGGGNGGTATAGGAGGCCAACTACCACCTCAATCAAAACTACATTTAGCCTTATTTACTTTAAATTTTTTNGACTCCTGGTACGGATGGTAAGACTCCAGCAGAAAGACATTGGCAAGTGTTAGAGGAAAAGAGGAAAGTTTATCCGAAAGTGTTATGGAAATCCCCGGAAGAAGNGACAATGGAAAGGTCCGGTGGATTTACTGACGTGGGGANGAGGGTATGCTTGTGTTTTTACAGGAGATGGACAAACCGTGTGGGTGCCCTCAAGGTGCGTGCGACCATGGAACGGGAGACTGGAGGAACCCAGGGTGGCCAACCATGGGCCCGGTCCCTCCGGTACGAGCCATGAGCCAGCTGAGCCTGAGTGCAAAGACGGAGAGAAGGCCGACCGGAGTCACGACGACATCAACCCCCATAACCTGGGGACAACTCAAGAAAACCACGCAGGAAGCTGAGAAACTACTGGAGCGTCAGGGNCAGGCAAAAACCCCTGATTCCATGTTCTTGGCCATGTTAGCCATAATGTCCTGTGCGGTATGTTTTCCCTGTGCAGAGGCAAAAACATATTGGGCATATGTTCCCAATCCCCCAGCAGTACGACCTGTACTTTGGAGTGACACTCCTCCTGAGATTTATCATGATCAGGGAGCGTGGGCTCCAGGACCCCTAACTCCCCTGACANTAGAACAGTTAGACTCTCAGAACAATGTCATCAATTATACCGCTCCACTGGAAGGACTCCCTTTGTGTATCACCACAAAGACGTCGCTCAGCCGTAGCTGTCTTACAATTCAAGCTCAAGCATGGTTGAGTCACTATGGAAAAGTCATGTACTTATTAGGTCTTGGTTCTATTAATGTAACTGGTGTGCTAACCAACCATTCCCGGCCCAATCGCCCTAATTGTGCTGACTATACGGAATGGATTCCCTTCAATAGTTCCTACCCCCCCTCNCGTGGACCCAGTGTCTTGGCCCACTGGCTAGAAAACAATCTATGTTAACTGGAGACATTGTGGATTGGGGACCTAAAGGTCAATTAGATGGAAAAGATGAAAATCAGAAATCATGGCACAAACTTCGCTGGCATTGGTGGCAAGCTTTTAATGCTTCTTCTTTATACNACACCGGGATCCAATCCCAGTCTGCCGCCCAGATTGCTTGGCATGGAGCAGGCTTTAGCCCGCCTCTTCCTCAGTGGCATTATCTAGGGAGGAAAGGACCAATTCAAGAGACGATATGGAAGGCAGCACTCCCATTTACGAATGGAGCATCTGGGTTNGGGATACTATCCAATAATAGCAATAGTAAGCGACACAGTCTTAATGTTACATTTGTAAAGAATATCACCACTCAATTTACGGTTTGTGTTTTTAATCCTTATGTCTTTTTGGCAGCTAAGAAGGACCAGCTCCAGGTAAACAATACCCAATTGACCTGTAAATCTTGCCAGTTATATCACTGCATTAATCATAGCACATTGCAAACACATAATATCTCTACTTTGATGATTTTGGGTCGCATCCCTGGGCTATGGATTCCTGTTAATCTGTCCGAGCCTTGGGCTGCCACACCTGCTTTGCATTTTGTGAAACTTCTTCTAACTCAGCTTACTCATCGTGTCCGTAGAGCCTTAGGCATGATAATTTTTGCTATTGTTTCCTTGGTCACACTAATAACTTCTGTTGTGATGTCCTCTGTAGCTTTGCATAGTTCTGTTCAAACAGCTCAGTACGTGGAGAACTGGACGCGCACAGCCGACCAAGCGTGGCTACTTCAGAATAAAATTAACACTGAGTTACAAACTGAAGTGGCAATGTTGAAATCCACGGTTCTATGGTTAGGGGAACAAGTACAAAGCTTGCAGTTGCAGCAGCAATTGCGTTGTCATTTTAATCACACTCATATTTGTGTAACCAACTTAGAATATAACCAAAGTGAGTATCCGTGGGACCTTGTGAAAGCCCATTTGCAGGGAGCTTTCACATCCAACATCACCTTTGATATTGGTGAATTACAAAACAAAATTCTTGATTTAAATAGGCAAACTCAAGAGTTTCAGCCTTCTTTAGAAGACTGGACCGAATTCCAGCAAGGCCTGGAGAGCCTCAACCCTTGGACCTATCTAAGGCACCACATTAACATCTTATATGTAGTTCTTGGAATAATGTTGTTTTGTCTCTGTCTTCTGTTCATAGTCTGTAAAATCGGATGGACCGCCAATCGGAGAATGAGAGCTGCCCAGCCTGGCCTTACATTCTTTCAATTAATNCATAAACAGAAAGGGGGATA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERVK3 NR2F1 155 169 + 16.04 AAGGTCAGTGCCCTA
HERVK3 Mafb 3131 3141 - 16.03 AAAATGCTGAC
HERVK3 MYB96 4926 4936 + 16.00 GCCAACTACCA
HERVK3 ATHB-40 2086 2096 - 15.98 AGCAATAATTG
HERVK3 ZHD3 2086 2094 + 15.98 CAATTATTG
HERVK3 Gfi1B 2375 2384 - 15.95 AAATCTCAGC
HERVK3 Stat5a 7067 7075 - 15.91 TTCCAAGAA
HERVK3 NRL 5814 5825 + 15.89 ATTGTGCTGACT
HERVK3 HAT22 2086 2094 - 15.87 CAATAATTG
HERVK3 ATHB-53 2086 2094 - 15.87 CAATAATTG


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).