HERVK3

Basic information Differential Expression Stage analysis Survival analysis Correlation analysis

DF ID DF0000195
TE superfamily ERV2
TE class LTR
Species Simiiformes
Length 7202
Kimura value 9.82
Tau index 1.0000
Description Internal region of HERVK endogenous retrovirus, HERVK3 subfamily
Comment The associated long terminal repeat is LTR3, and has 6 bp TSDs.
Sequence
AGTGGCGTCCGAACACAGGGACTTCGAGGACGTGAACGAAGAAGGTCTGCTGGAGCAGAGGAACTGAAATTGACAAGGCGAACGGGGACCCCGGGACGAGTCTGCCGGCAGCGGATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAGCAATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAACGGGAAGTTTTCTGAATCAGNGGTAACATGGGGCAGAATTTGTCTATTGAAGAAAAACATTATCGTGCAGTTGCTTAAAGTTCTGTTGAGACAGTCTGGNGCTCAGGTTAGTTCNCAGACACTAACTAAGATGCCGCAGGAGGTTATTACGCATAACCCATGGTTTCCACAGGCAGGCACTCTTGATGTGGAAAATTGGGACAGAGCAGGAGAAGGATTAAAACGGGCTCATCAAAAAGGTCTTAAAGTTGATTCTTCTGTTTTCTCCACTTGGAGTTTAGTTCGTACTGTNCTTCTGCCATTATCTCCTTNTTATTCTGCNGGACAGCAGGAGTCATGTTCTGAGTCTAAAAATCTGAAAGAATCTGTTGTCCCACCCACAGCTCCAATTGAAAATAAAAAACAGGAGAGGGAGGATAAAAATTGGCCTATACCGCCTCCTCCAGTTGCAGAAACATCTGTACCGCCTCCTTCGGTAGCAGAAATAGAGACCCCAATACAAAGAATTTTACGCTCTGCTGCCATAGCTGGAGAGCCCTTAGGACCTNTGCGCTTTTCCTATTTCCGTAAGGCCTGATCCAAATAATCCACAGCAGNTTATTCATGAACACACCCCACTAGAGTTTAAGTTGTTGAAGGAATTAAAAGCTAAGTGTGGTNAATAATGGCGTACAGAGCCCATTCACTTTAGGATTGCTAGAATCTGTGTTTGGTGCTATGCGTCTTCTACCCTTTGATGTAAAACACTTGGCNCGAACTTGCTTGTCTGCTAGTGCATATCTGACATGGAATTTAAATTGGCAAGAAATGTGTGCAGACCAGGCTAGACAGAACCGTGCTGCTGGACACGGAGACATTACAGAGGATATGCTGTTAGGTAATGGCCCTTNATTCAGACCTGGAACGTCAAATGGCACTCCCAGACGCTGCTTATCAGCAGTGTGCACAGGCCGCTAAACGCGCCTGGGCCACAATTCCTGAAGAGGGAGTCCCAGTACAATCCTTTTTACATATCATGCAAGGGTCGCAGGAACCCTATGCGCAATTTCTTGCAAGATTACAAGAGGCAGTGAAGCGTCAGATTCCTCATACCGCTGCCGCAGAAATGCTAACCTTAACTCTAGCTTTTGAGAATGCAAACGCGGATTGTAAACGTGCACTGGCACCTGTGAGGTGTAAAAAAACTTGGGAAATTTTCTCAGAGCTTGTCAGGATGTAGGAACTGAGCTTCATCGCTCTGCAATGTTAGCNCAAGCAATGGCTAATTTAGCAGTTGACAAATCTAAAAGGAGCCAAGGGTCAAACCCTAAAATGGGAAAATGTTATAATTGTGGAAAAACTGGACATTTTAAAAAGGAATGCCGCCAGATCTCAGGACAGAAAGGACCTTACAATGCAGTNCCCCCCACCCCCGCGNNCCAGCGGAAAAAAACGCCAGGACTTTGTCCTCGCTGTAACAAAGGAAATCACTGGGCTAATCAGTGCCGCTCAAAATTTCATCAGAATGGCACCCCCCTGTCGGGAAACGAGANGGGGGCCTGGACCCGGGCCCCTCAAACAATGAGGGCATTCCCAGTCCAGACCACAACCCCGTTTCAGGGATGGGTTCCCGGAGGNACATTGATTCCCTCACCCCAGGAACACCAGGAAGTGCAGGATTAGATCTCCCCGCCAGAGAAAGAATTACGTTAGTTGGNGGAGACAAACCCACCAAAGTTCCCACTGGCATTTGGGGACCTTTACCAGCAGGATACATGGGACTAATTTTAGGCAAAAGCCGCCTTAACTTGCAAGGCATTACTGTAGTCCCAGGAGTNGTTGACTCCGATTATGAAGGAGAAATTCAAGTAGTTTTAATGTCACAAGATCTTTGGGTTTTTGAACCGGGAGAATATATTGCTCAATTATTGCTTATTCCCTGCAAATTACACCCTTCTCCACGAAAGGAGAAACGAGGAAATAAAGGGTTTGGGAGCACAACTACATGGGAAATCTATCTATCCNCAACCCATAGCCTCTAATAGACCCACCTGTGTAGTACAAATTAAAGGAAAGAAATTTTATGGGCTTATGGATACGGGAGCTGATGTGTCAGTAATATCTAGNAACGACTGGCCCCCATCCTGGCCCCTGCGATTAACTTCTACATCCCTAGTGGGAGTAGGAACAGCTCAAAGTGTTCAACAGAGTGCTGAGATTTTACCTTGTCTTGGTCCGGATGGACAGTCATGTACTTTTCAGCCTTATGTTGCAAATATAGCTATCAATTTATGGGGTCGAGACTTACTTACAGCATGGGATATGAGACTTACAAATGAAAACTTTGATAACCCAGGATTTAAAATGTTGAAGGACATGGGATATCAGAGTGGAAAAGGTTTAGGGAAATTCCTACAAGGAAACCCTAACCCGATATCAGTAACTGGAAAAACAGATAGAAAAGGGCTAGGACGTCAGGATTTCTGACGGGGGTCATTGATATTTCTCCTCCGCCCACTGCCTTACCATTAGAATGGCTNAGTGACAAACCTGTGTGGGTGGATCAATGGCCCCTANCACAGGAGAAGCTAGNTCAACTTCATCNGCTAGTAAAAGAGCAATTGGATGCAGGACATATAGAGAAGAGTTNCAGCCCCTGGAATTCACCGGTGTTTGTTATTCCAAAAAAGTCCGGAAGATGGTGACTGCTGCATGATTTGAGAGCTATTAATGCACAAATTAAACCGATGGGTGCATTACAGCAAGGTCTGCCATCCCCAGCGGCCATTCCAAGAGACTGGCCTCTCGTAGTAATAGATCTTAAGGATTGTTTCTTTACTATACCNTTACACGAGAAGGATAAGCCTCGATTTGCCTTCTCTGTGCCTTCTATTAATCAAAGAGAACCTGTTTCTCGTTATCAATGGAAAGTTTTACCCCAAGGCATGCTTAACAGTCCTACGCTATGTCAGCATTTTGTAGGACGGGCATTAAAGGAGCCTCGGAATATGTTTCCCACTGCTTACATCATTCATTNTATGGATGATATTCTTTTGGCCGCTCCTACAGATCAAATCTTACATCAGTTATTCAGAGAAACAAAGCGGGCTTTGACTAAATGGAATCTCAAAATNGCTCCAGAGAAGGTGCAAACAACTTCCCCATACCANTACTTAGGAACTATTGTTACGGAGAGAAGTGTACGGCCTCAGAAAGTAGTTCTCCGTAAAGACAGGTTACAGACTTTAAATGATTTTCAACAATTATTAGGGGATATTAATTGGCTGCGCCCGATGCTAGGTATTGCTACCTATCAACTTACACATCTTTACCAAACCCTGCAAGGAGATTCTTCNTTAAATTCCCCGCGGCAACTNACTAAAGAGGCAGAAGCCGAGTTACGGCTTGTAGAGCAGATGCTTCAGCAGAGACATGCCTCNCGGCTACAGCCGCAAAAACCTTTGCTTTTGTTTATTCTTCCTACCCCCCACTCTCCAACAGGACTTTTGGGCCAGTTCATAGACAAGTCTGTAACAGTAATAGAATGGCTCTTTCTACCTAATCAGTCAAAACCTTGCAAGTTTATCTTTCTTTAATTACACAAATTGTGACTATGGGCAGGCATAGGTCAAAAATGCTTACGGGATATGATCCNGACAAAATTATTGTTCCCTTAGACTCCCAGCAACAGGCCGCAGCNTGGGAAATGTCGACTGCNTGGCAAATCGCTTTCGCAGACTTCGTGGGAATAATAGATAACCACTATCCCTCAGACAAAATTTTGCAGTTTTATAAAGTCCATTCTTTCATTCTTCCTGTGATTACTCATCACAAGCCTATTCCAGGTGGACAGACTTATTTTACTGATGGCTCTTCCAAAGGTCGTGCAGCTATCTATGGACCTAAACATACTCAAACAATAATGACCTCTGGGGTTTCAGCTCAACGCTCAGAGCTAATTGCAGTCATTCAGGTTTTACAGCTCACAGCTTCAGATCCTATCAACATTGTCTGTGATTCAGCTTATGTTGTAAATGTAGCCAGTCGCATAGAAACTGCTACAATTAAAAGTACACTAGACCCAGAACTGCTTAATTTGTTTCTAAGACTTCANACAGCTATTCGCTCTCGTGCAGCTCCTTTTCATATTTCTCATATTCGCTCTCACACACAACTTCCTGGACCACTATCTCTAGGTAATGATAGAGCAGATAAACTGATTGGTTCTGTGTTTCAGCAAGCTCAAGCNTCTCNATGCGCTACTGCACCAAAACACCTCCGCCCTTACTCGCATGTTTCATCTGCCTCGCAGCCAAGCTAGGGCTATNGTACAAGCCTGTCCTACTTGCCAGCATGTCCCTGGNGCCGCACCTGTAGAAGGNTGTAACCCACGAGGTTTGGCTCCAAATGAAATCTGGCAAATGGATGTTACACACATAGCAGCCTTTGGCAAGCTTAGCTATGTTCTGTGANCTATAGACACTTATTCTCATATGCTGCATGCTACATGCCAAACAGGTGAGACAGCTGGTCATGTACGGCGACATTGTCTGTCATCATTTGCTCATATGGGGATACCTAAACAATTAAAAACTGACAATGGACCCGCTTATACTAGTCATGCTTTTCAAAATTTCTTACAGCTTTGGGCTATAACCCATAAAACAGGAATTCCTTATAATCCTAGAGGACAAGGCATTATAGAGCGGGCACATCAAACATTACAACGCATGTTGAAAAAACAAAAAGGGNGGTATAGGAGGCCAACTACCACCTCAATCAAAACTACATTTAGCCTTATTTACTTTAAATTTTTTNGACTCCTGGTACGGATGGTAAGACTCCAGCAGAAAGACATTGGCAAGTGTTAGAGGAAAAGAGGAAAGTTTATCCGAAAGTGTTATGGAAATCCCCGGAAGAAGNGACAATGGAAAGGTCCGGTGGATTTACTGACGTGGGGANGAGGGTATGCTTGTGTTTTTACAGGAGATGGACAAACCGTGTGGGTGCCCTCAAGGTGCGTGCGACCATGGAACGGGAGACTGGAGGAACCCAGGGTGGCCAACCATGGGCCCGGTCCCTCCGGTACGAGCCATGAGCCAGCTGAGCCTGAGTGCAAAGACGGAGAGAAGGCCGACCGGAGTCACGACGACATCAACCCCCATAACCTGGGGACAACTCAAGAAAACCACGCAGGAAGCTGAGAAACTACTGGAGCGTCAGGGNCAGGCAAAAACCCCTGATTCCATGTTCTTGGCCATGTTAGCCATAATGTCCTGTGCGGTATGTTTTCCCTGTGCAGAGGCAAAAACATATTGGGCATATGTTCCCAATCCCCCAGCAGTACGACCTGTACTTTGGAGTGACACTCCTCCTGAGATTTATCATGATCAGGGAGCGTGGGCTCCAGGACCCCTAACTCCCCTGACANTAGAACAGTTAGACTCTCAGAACAATGTCATCAATTATACCGCTCCACTGGAAGGACTCCCTTTGTGTATCACCACAAAGACGTCGCTCAGCCGTAGCTGTCTTACAATTCAAGCTCAAGCATGGTTGAGTCACTATGGAAAAGTCATGTACTTATTAGGTCTTGGTTCTATTAATGTAACTGGTGTGCTAACCAACCATTCCCGGCCCAATCGCCCTAATTGTGCTGACTATACGGAATGGATTCCCTTCAATAGTTCCTACCCCCCCTCNCGTGGACCCAGTGTCTTGGCCCACTGGCTAGAAAACAATCTATGTTAACTGGAGACATTGTGGATTGGGGACCTAAAGGTCAATTAGATGGAAAAGATGAAAATCAGAAATCATGGCACAAACTTCGCTGGCATTGGTGGCAAGCTTTTAATGCTTCTTCTTTATACNACACCGGGATCCAATCCCAGTCTGCCGCCCAGATTGCTTGGCATGGAGCAGGCTTTAGCCCGCCTCTTCCTCAGTGGCATTATCTAGGGAGGAAAGGACCAATTCAAGAGACGATATGGAAGGCAGCACTCCCATTTACGAATGGAGCATCTGGGTTNGGGATACTATCCAATAATAGCAATAGTAAGCGACACAGTCTTAATGTTACATTTGTAAAGAATATCACCACTCAATTTACGGTTTGTGTTTTTAATCCTTATGTCTTTTTGGCAGCTAAGAAGGACCAGCTCCAGGTAAACAATACCCAATTGACCTGTAAATCTTGCCAGTTATATCACTGCATTAATCATAGCACATTGCAAACACATAATATCTCTACTTTGATGATTTTGGGTCGCATCCCTGGGCTATGGATTCCTGTTAATCTGTCCGAGCCTTGGGCTGCCACACCTGCTTTGCATTTTGTGAAACTTCTTCTAACTCAGCTTACTCATCGTGTCCGTAGAGCCTTAGGCATGATAATTTTTGCTATTGTTTCCTTGGTCACACTAATAACTTCTGTTGTGATGTCCTCTGTAGCTTTGCATAGTTCTGTTCAAACAGCTCAGTACGTGGAGAACTGGACGCGCACAGCCGACCAAGCGTGGCTACTTCAGAATAAAATTAACACTGAGTTACAAACTGAAGTGGCAATGTTGAAATCCACGGTTCTATGGTTAGGGGAACAAGTACAAAGCTTGCAGTTGCAGCAGCAATTGCGTTGTCATTTTAATCACACTCATATTTGTGTAACCAACTTAGAATATAACCAAAGTGAGTATCCGTGGGACCTTGTGAAAGCCCATTTGCAGGGAGCTTTCACATCCAACATCACCTTTGATATTGGTGAATTACAAAACAAAATTCTTGATTTAAATAGGCAAACTCAAGAGTTTCAGCCTTCTTTAGAAGACTGGACCGAATTCCAGCAAGGCCTGGAGAGCCTCAACCCTTGGACCTATCTAAGGCACCACATTAACATCTTATATGTAGTTCTTGGAATAATGTTGTTTTGTCTCTGTCTTCTGTTCATAGTCTGTAAAATCGGATGGACCGCCAATCGGAGAATGAGAGCTGCCCAGCCTGGCCTTACATTCTTTCAATTAATNCATAAACAGAAAGGGGGATA



TF motifs of the concenus sequence

Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.

TE_family TFBS Start End Strand Score Matched sequence
HERVK3 nfya-1 4380 4389 - 17.16 AACCAATCAG
HERVK3 erm 2775 2785 - 17.15 ATTGCTCTTTT
HERVK3 M1BP 6575 6585 + 17.11 TGGTCACACTA
HERVK3 skn-1 3128 3141 - 16.97 AAAATGCTGACATA
HERVK3 RELA 5068 5077 - 16.96 GGGGATTTCC
HERVK3 eor-1 5281 5293 + 16.87 AAAGACGGAGAGA
HERVK3 NR2F6 119 133 - 16.83 TAGGGCACTGACCTT
HERVK3 NR2F6 155 169 - 16.83 TAGGGCACTGACCTT
HERVK3 AT1G72740 2584 2594 - 16.68 GGTTAGGGTTT
HERVK3 NR2F6 119 133 + 16.63 AAGGTCAGTGCCCTA


TFBS enrichment in GRCh38

Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.




GTEx

The promoter activity across 46 body sites from The Genotype-Tissue Expression (GTEx) project.




TCGA

The promoter activity across 33 cancer types from The Cancer Genome Atlas (TCGA).