HERVK3
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000195 |
---|---|
TE superfamily | ERV2 |
TE class | LTR |
Species | Simiiformes |
Length | 7202 |
Kimura value | 9.82 |
Tau index | 1.0000 |
Description | Internal region of HERVK endogenous retrovirus, HERVK3 subfamily |
Comment | The associated long terminal repeat is LTR3, and has 6 bp TSDs. |
Sequence |
AGTGGCGTCCGAACACAGGGACTTCGAGGACGTGAACGAAGAAGGTCTGCTGGAGCAGAGGAACTGAAATTGACAAGGCGAACGGGGACCCCGGGACGAGTCTGCCGGCAGCGGATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAGCAATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAACGGGAAGTTTTCTGAATCAGNGGTAACATGGGGCAGAATTTGTCTATTGAAGAAAAACATTATCGTGCAGTTGCTTAAAGTTCTGTTGAGACAGTCTGGNGCTCAGGTTAGTTCNCAGACACTAACTAAGATGCCGCAGGAGGTTATTACGCATAACCCATGGTTTCCACAGGCAGGCACTCTTGATGTGGAAAATTGGGACAGAGCAGGAGAAGGATTAAAACGGGCTCATCAAAAAGGTCTTAAAGTTGATTCTTCTGTTTTCTCCACTTGGAGTTTAGTTCGTACTGTNCTTCTGCCATTATCTCCTTNTTATTCTGCNGGACAGCAGGAGTCATGTTCTGAGTCTAAAAATCTGAAAGAATCTGTTGTCCCACCCACAGCTCCAATTGAAAATAAAAAACAGGAGAGGGAGGATAAAAATTGGCCTATACCGCCTCCTCCAGTTGCAGAAACATCTGTACCGCCTCCTTCGGTAGCAGAAATAGAGACCCCAATACAAAGAATTTTACGCTCTGCTGCCATAGCTGGAGAGCCCTTAGGACCTNTGCGCTTTTCCTATTTCCGTAAGGCCTGATCCAAATAATCCACAGCAGNTTATTCATGAACACACCCCACTAGAGTTTAAGTTGTTGAAGGAATTAAAAGCTAAGTGTGGTNAATAATGGCGTACAGAGCCCATTCACTTTAGGATTGCTAGAATCTGTGTTTGGTGCTATGCGTCTTCTACCCTTTGATGTAAAACACTTGGCNCGAACTTGCTTGTCTGCTAGTGCATATCTGACATGGAATTTAAATTGGCAAGAAATGTGTGCAGACCAGGCTAGACAGAACCGTGCTGCTGGACACGGAGACATTACAGAGGATATGCTGTTAGGTAATGGCCCTTNATTCAGACCTGGAACGTCAAATGGCACTCCCAGACGCTGCTTATCAGCAGTGTGCACAGGCCGCTAAACGCGCCTGGGCCACAATTCCTGAAGAGGGAGTCCCAGTACAATCCTTTTTACATATCATGCAAGGGTCGCAGGAACCCTATGCGCAATTTCTTGCAAGATTACAAGAGGCAGTGAAGCGTCAGATTCCTCATACCGCTGCCGCAGAAATGCTAACCTTAACTCTAGCTTTTGAGAATGCAAACGCGGATTGTAAACGTGCACTGGCACCTGTGAGGTGTAAAAAAACTTGGGAAATTTTCTCAGAGCTTGTCAGGATGTAGGAACTGAGCTTCATCGCTCTGCAATGTTAGCNCAAGCAATGGCTAATTTAGCAGTTGACAAATCTAAAAGGAGCCAAGGGTCAAACCCTAAAATGGGAAAATGTTATAATTGTGGAAAAACTGGACATTTTAAAAAGGAATGCCGCCAGATCTCAGGACAGAAAGGACCTTACAATGCAGTNCCCCCCACCCCCGCGNNCCAGCGGAAAAAAACGCCAGGACTTTGTCCTCGCTGTAACAAAGGAAATCACTGGGCTAATCAGTGCCGCTCAAAATTTCATCAGAATGGCACCCCCCTGTCGGGAAACGAGANGGGGGCCTGGACCCGGGCCCCTCAAACAATGAGGGCATTCCCAGTCCAGACCACAACCCCGTTTCAGGGATGGGTTCCCGGAGGNACATTGATTCCCTCACCCCAGGAACACCAGGAAGTGCAGGATTAGATCTCCCCGCCAGAGAAAGAATTACGTTAGTTGGNGGAGACAAACCCACCAAAGTTCCCACTGGCATTTGGGGACCTTTACCAGCAGGATACATGGGACTAATTTTAGGCAAAAGCCGCCTTAACTTGCAAGGCATTACTGTAGTCCCAGGAGTNGTTGACTCCGATTATGAAGGAGAAATTCAAGTAGTTTTAATGTCACAAGATCTTTGGGTTTTTGAACCGGGAGAATATATTGCTCAATTATTGCTTATTCCCTGCAAATTACACCCTTCTCCACGAAAGGAGAAACGAGGAAATAAAGGGTTTGGGAGCACAACTACATGGGAAATCTATCTATCCNCAACCCATAGCCTCTAATAGACCCACCTGTGTAGTACAAATTAAAGGAAAGAAATTTTATGGGCTTATGGATACGGGAGCTGATGTGTCAGTAATATCTAGNAACGACTGGCCCCCATCCTGGCCCCTGCGATTAACTTCTACATCCCTAGTGGGAGTAGGAACAGCTCAAAGTGTTCAACAGAGTGCTGAGATTTTACCTTGTCTTGGTCCGGATGGACAGTCATGTACTTTTCAGCCTTATGTTGCAAATATAGCTATCAATTTATGGGGTCGAGACTTACTTACAGCATGGGATATGAGACTTACAAATGAAAACTTTGATAACCCAGGATTTAAAATGTTGAAGGACATGGGATATCAGAGTGGAAAAGGTTTAGGGAAATTCCTACAAGGAAACCCTAACCCGATATCAGTAACTGGAAAAACAGATAGAAAAGGGCTAGGACGTCAGGATTTCTGACGGGGGTCATTGATATTTCTCCTCCGCCCACTGCCTTACCATTAGAATGGCTNAGTGACAAACCTGTGTGGGTGGATCAATGGCCCCTANCACAGGAGAAGCTAGNTCAACTTCATCNGCTAGTAAAAGAGCAATTGGATGCAGGACATATAGAGAAGAGTTNCAGCCCCTGGAATTCACCGGTGTTTGTTATTCCAAAAAAGTCCGGAAGATGGTGACTGCTGCATGATTTGAGAGCTATTAATGCACAAATTAAACCGATGGGTGCATTACAGCAAGGTCTGCCATCCCCAGCGGCCATTCCAAGAGACTGGCCTCTCGTAGTAATAGATCTTAAGGATTGTTTCTTTACTATACCNTTACACGAGAAGGATAAGCCTCGATTTGCCTTCTCTGTGCCTTCTATTAATCAAAGAGAACCTGTTTCTCGTTATCAATGGAAAGTTTTACCCCAAGGCATGCTTAACAGTCCTACGCTATGTCAGCATTTTGTAGGACGGGCATTAAAGGAGCCTCGGAATATGTTTCCCACTGCTTACATCATTCATTNTATGGATGATATTCTTTTGGCCGCTCCTACAGATCAAATCTTACATCAGTTATTCAGAGAAACAAAGCGGGCTTTGACTAAATGGAATCTCAAAATNGCTCCAGAGAAGGTGCAAACAACTTCCCCATACCANTACTTAGGAACTATTGTTACGGAGAGAAGTGTACGGCCTCAGAAAGTAGTTCTCCGTAAAGACAGGTTACAGACTTTAAATGATTTTCAACAATTATTAGGGGATATTAATTGGCTGCGCCCGATGCTAGGTATTGCTACCTATCAACTTACACATCTTTACCAAACCCTGCAAGGAGATTCTTCNTTAAATTCCCCGCGGCAACTNACTAAAGAGGCAGAAGCCGAGTTACGGCTTGTAGAGCAGATGCTTCAGCAGAGACATGCCTCNCGGCTACAGCCGCAAAAACCTTTGCTTTTGTTTATTCTTCCTACCCCCCACTCTCCAACAGGACTTTTGGGCCAGTTCATAGACAAGTCTGTAACAGTAATAGAATGGCTCTTTCTACCTAATCAGTCAAAACCTTGCAAGTTTATCTTTCTTTAATTACACAAATTGTGACTATGGGCAGGCATAGGTCAAAAATGCTTACGGGATATGATCCNGACAAAATTATTGTTCCCTTAGACTCCCAGCAACAGGCCGCAGCNTGGGAAATGTCGACTGCNTGGCAAATCGCTTTCGCAGACTTCGTGGGAATAATAGATAACCACTATCCCTCAGACAAAATTTTGCAGTTTTATAAAGTCCATTCTTTCATTCTTCCTGTGATTACTCATCACAAGCCTATTCCAGGTGGACAGACTTATTTTACTGATGGCTCTTCCAAAGGTCGTGCAGCTATCTATGGACCTAAACATACTCAAACAATAATGACCTCTGGGGTTTCAGCTCAACGCTCAGAGCTAATTGCAGTCATTCAGGTTTTACAGCTCACAGCTTCAGATCCTATCAACATTGTCTGTGATTCAGCTTATGTTGTAAATGTAGCCAGTCGCATAGAAACTGCTACAATTAAAAGTACACTAGACCCAGAACTGCTTAATTTGTTTCTAAGACTTCANACAGCTATTCGCTCTCGTGCAGCTCCTTTTCATATTTCTCATATTCGCTCTCACACACAACTTCCTGGACCACTATCTCTAGGTAATGATAGAGCAGATAAACTGATTGGTTCTGTGTTTCAGCAAGCTCAAGCNTCTCNATGCGCTACTGCACCAAAACACCTCCGCCCTTACTCGCATGTTTCATCTGCCTCGCAGCCAAGCTAGGGCTATNGTACAAGCCTGTCCTACTTGCCAGCATGTCCCTGGNGCCGCACCTGTAGAAGGNTGTAACCCACGAGGTTTGGCTCCAAATGAAATCTGGCAAATGGATGTTACACACATAGCAGCCTTTGGCAAGCTTAGCTATGTTCTGTGANCTATAGACACTTATTCTCATATGCTGCATGCTACATGCCAAACAGGTGAGACAGCTGGTCATGTACGGCGACATTGTCTGTCATCATTTGCTCATATGGGGATACCTAAACAATTAAAAACTGACAATGGACCCGCTTATACTAGTCATGCTTTTCAAAATTTCTTACAGCTTTGGGCTATAACCCATAAAACAGGAATTCCTTATAATCCTAGAGGACAAGGCATTATAGAGCGGGCACATCAAACATTACAACGCATGTTGAAAAAACAAAAAGGGNGGTATAGGAGGCCAACTACCACCTCAATCAAAACTACATTTAGCCTTATTTACTTTAAATTTTTTNGACTCCTGGTACGGATGGTAAGACTCCAGCAGAAAGACATTGGCAAGTGTTAGAGGAAAAGAGGAAAGTTTATCCGAAAGTGTTATGGAAATCCCCGGAAGAAGNGACAATGGAAAGGTCCGGTGGATTTACTGACGTGGGGANGAGGGTATGCTTGTGTTTTTACAGGAGATGGACAAACCGTGTGGGTGCCCTCAAGGTGCGTGCGACCATGGAACGGGAGACTGGAGGAACCCAGGGTGGCCAACCATGGGCCCGGTCCCTCCGGTACGAGCCATGAGCCAGCTGAGCCTGAGTGCAAAGACGGAGAGAAGGCCGACCGGAGTCACGACGACATCAACCCCCATAACCTGGGGACAACTCAAGAAAACCACGCAGGAAGCTGAGAAACTACTGGAGCGTCAGGGNCAGGCAAAAACCCCTGATTCCATGTTCTTGGCCATGTTAGCCATAATGTCCTGTGCGGTATGTTTTCCCTGTGCAGAGGCAAAAACATATTGGGCATATGTTCCCAATCCCCCAGCAGTACGACCTGTACTTTGGAGTGACACTCCTCCTGAGATTTATCATGATCAGGGAGCGTGGGCTCCAGGACCCCTAACTCCCCTGACANTAGAACAGTTAGACTCTCAGAACAATGTCATCAATTATACCGCTCCACTGGAAGGACTCCCTTTGTGTATCACCACAAAGACGTCGCTCAGCCGTAGCTGTCTTACAATTCAAGCTCAAGCATGGTTGAGTCACTATGGAAAAGTCATGTACTTATTAGGTCTTGGTTCTATTAATGTAACTGGTGTGCTAACCAACCATTCCCGGCCCAATCGCCCTAATTGTGCTGACTATACGGAATGGATTCCCTTCAATAGTTCCTACCCCCCCTCNCGTGGACCCAGTGTCTTGGCCCACTGGCTAGAAAACAATCTATGTTAACTGGAGACATTGTGGATTGGGGACCTAAAGGTCAATTAGATGGAAAAGATGAAAATCAGAAATCATGGCACAAACTTCGCTGGCATTGGTGGCAAGCTTTTAATGCTTCTTCTTTATACNACACCGGGATCCAATCCCAGTCTGCCGCCCAGATTGCTTGGCATGGAGCAGGCTTTAGCCCGCCTCTTCCTCAGTGGCATTATCTAGGGAGGAAAGGACCAATTCAAGAGACGATATGGAAGGCAGCACTCCCATTTACGAATGGAGCATCTGGGTTNGGGATACTATCCAATAATAGCAATAGTAAGCGACACAGTCTTAATGTTACATTTGTAAAGAATATCACCACTCAATTTACGGTTTGTGTTTTTAATCCTTATGTCTTTTTGGCAGCTAAGAAGGACCAGCTCCAGGTAAACAATACCCAATTGACCTGTAAATCTTGCCAGTTATATCACTGCATTAATCATAGCACATTGCAAACACATAATATCTCTACTTTGATGATTTTGGGTCGCATCCCTGGGCTATGGATTCCTGTTAATCTGTCCGAGCCTTGGGCTGCCACACCTGCTTTGCATTTTGTGAAACTTCTTCTAACTCAGCTTACTCATCGTGTCCGTAGAGCCTTAGGCATGATAATTTTTGCTATTGTTTCCTTGGTCACACTAATAACTTCTGTTGTGATGTCCTCTGTAGCTTTGCATAGTTCTGTTCAAACAGCTCAGTACGTGGAGAACTGGACGCGCACAGCCGACCAAGCGTGGCTACTTCAGAATAAAATTAACACTGAGTTACAAACTGAAGTGGCAATGTTGAAATCCACGGTTCTATGGTTAGGGGAACAAGTACAAAGCTTGCAGTTGCAGCAGCAATTGCGTTGTCATTTTAATCACACTCATATTTGTGTAACCAACTTAGAATATAACCAAAGTGAGTATCCGTGGGACCTTGTGAAAGCCCATTTGCAGGGAGCTTTCACATCCAACATCACCTTTGATATTGGTGAATTACAAAACAAAATTCTTGATTTAAATAGGCAAACTCAAGAGTTTCAGCCTTCTTTAGAAGACTGGACCGAATTCCAGCAAGGCCTGGAGAGCCTCAACCCTTGGACCTATCTAAGGCACCACATTAACATCTTATATGTAGTTCTTGGAATAATGTTGTTTTGTCTCTGTCTTCTGTTCATAGTCTGTAAAATCGGATGGACCGCCAATCGGAGAATGAGAGCTGCCCAGCCTGGCCTTACATTCTTTCAATTAATNCATAAACAGAAAGGGGGATA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVK3 | NR2F1 | 155 | 169 | + | 16.04 | AAGGTCAGTGCCCTA |
HERVK3 | Mafb | 3131 | 3141 | - | 16.03 | AAAATGCTGAC |
HERVK3 | MYB96 | 4926 | 4936 | + | 16.00 | GCCAACTACCA |
HERVK3 | ATHB-40 | 2086 | 2096 | - | 15.98 | AGCAATAATTG |
HERVK3 | ZHD3 | 2086 | 2094 | + | 15.98 | CAATTATTG |
HERVK3 | Gfi1B | 2375 | 2384 | - | 15.95 | AAATCTCAGC |
HERVK3 | Stat5a | 7067 | 7075 | - | 15.91 | TTCCAAGAA |
HERVK3 | NRL | 5814 | 5825 | + | 15.89 | ATTGTGCTGACT |
HERVK3 | HAT22 | 2086 | 2094 | - | 15.87 | CAATAATTG |
HERVK3 | ATHB-53 | 2086 | 2094 | - | 15.87 | CAATAATTG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.