HERVK3
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000195 |
---|---|
TE superfamily | ERV2 |
TE class | LTR |
Species | Simiiformes |
Length | 7202 |
Kimura value | 9.82 |
Tau index | 1.0000 |
Description | Internal region of HERVK endogenous retrovirus, HERVK3 subfamily |
Comment | The associated long terminal repeat is LTR3, and has 6 bp TSDs. |
Sequence |
AGTGGCGTCCGAACACAGGGACTTCGAGGACGTGAACGAAGAAGGTCTGCTGGAGCAGAGGAACTGAAATTGACAAGGCGAACGGGGACCCCGGGACGAGTCTGCCGGCAGCGGATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAGCAATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAACGGGAAGTTTTCTGAATCAGNGGTAACATGGGGCAGAATTTGTCTATTGAAGAAAAACATTATCGTGCAGTTGCTTAAAGTTCTGTTGAGACAGTCTGGNGCTCAGGTTAGTTCNCAGACACTAACTAAGATGCCGCAGGAGGTTATTACGCATAACCCATGGTTTCCACAGGCAGGCACTCTTGATGTGGAAAATTGGGACAGAGCAGGAGAAGGATTAAAACGGGCTCATCAAAAAGGTCTTAAAGTTGATTCTTCTGTTTTCTCCACTTGGAGTTTAGTTCGTACTGTNCTTCTGCCATTATCTCCTTNTTATTCTGCNGGACAGCAGGAGTCATGTTCTGAGTCTAAAAATCTGAAAGAATCTGTTGTCCCACCCACAGCTCCAATTGAAAATAAAAAACAGGAGAGGGAGGATAAAAATTGGCCTATACCGCCTCCTCCAGTTGCAGAAACATCTGTACCGCCTCCTTCGGTAGCAGAAATAGAGACCCCAATACAAAGAATTTTACGCTCTGCTGCCATAGCTGGAGAGCCCTTAGGACCTNTGCGCTTTTCCTATTTCCGTAAGGCCTGATCCAAATAATCCACAGCAGNTTATTCATGAACACACCCCACTAGAGTTTAAGTTGTTGAAGGAATTAAAAGCTAAGTGTGGTNAATAATGGCGTACAGAGCCCATTCACTTTAGGATTGCTAGAATCTGTGTTTGGTGCTATGCGTCTTCTACCCTTTGATGTAAAACACTTGGCNCGAACTTGCTTGTCTGCTAGTGCATATCTGACATGGAATTTAAATTGGCAAGAAATGTGTGCAGACCAGGCTAGACAGAACCGTGCTGCTGGACACGGAGACATTACAGAGGATATGCTGTTAGGTAATGGCCCTTNATTCAGACCTGGAACGTCAAATGGCACTCCCAGACGCTGCTTATCAGCAGTGTGCACAGGCCGCTAAACGCGCCTGGGCCACAATTCCTGAAGAGGGAGTCCCAGTACAATCCTTTTTACATATCATGCAAGGGTCGCAGGAACCCTATGCGCAATTTCTTGCAAGATTACAAGAGGCAGTGAAGCGTCAGATTCCTCATACCGCTGCCGCAGAAATGCTAACCTTAACTCTAGCTTTTGAGAATGCAAACGCGGATTGTAAACGTGCACTGGCACCTGTGAGGTGTAAAAAAACTTGGGAAATTTTCTCAGAGCTTGTCAGGATGTAGGAACTGAGCTTCATCGCTCTGCAATGTTAGCNCAAGCAATGGCTAATTTAGCAGTTGACAAATCTAAAAGGAGCCAAGGGTCAAACCCTAAAATGGGAAAATGTTATAATTGTGGAAAAACTGGACATTTTAAAAAGGAATGCCGCCAGATCTCAGGACAGAAAGGACCTTACAATGCAGTNCCCCCCACCCCCGCGNNCCAGCGGAAAAAAACGCCAGGACTTTGTCCTCGCTGTAACAAAGGAAATCACTGGGCTAATCAGTGCCGCTCAAAATTTCATCAGAATGGCACCCCCCTGTCGGGAAACGAGANGGGGGCCTGGACCCGGGCCCCTCAAACAATGAGGGCATTCCCAGTCCAGACCACAACCCCGTTTCAGGGATGGGTTCCCGGAGGNACATTGATTCCCTCACCCCAGGAACACCAGGAAGTGCAGGATTAGATCTCCCCGCCAGAGAAAGAATTACGTTAGTTGGNGGAGACAAACCCACCAAAGTTCCCACTGGCATTTGGGGACCTTTACCAGCAGGATACATGGGACTAATTTTAGGCAAAAGCCGCCTTAACTTGCAAGGCATTACTGTAGTCCCAGGAGTNGTTGACTCCGATTATGAAGGAGAAATTCAAGTAGTTTTAATGTCACAAGATCTTTGGGTTTTTGAACCGGGAGAATATATTGCTCAATTATTGCTTATTCCCTGCAAATTACACCCTTCTCCACGAAAGGAGAAACGAGGAAATAAAGGGTTTGGGAGCACAACTACATGGGAAATCTATCTATCCNCAACCCATAGCCTCTAATAGACCCACCTGTGTAGTACAAATTAAAGGAAAGAAATTTTATGGGCTTATGGATACGGGAGCTGATGTGTCAGTAATATCTAGNAACGACTGGCCCCCATCCTGGCCCCTGCGATTAACTTCTACATCCCTAGTGGGAGTAGGAACAGCTCAAAGTGTTCAACAGAGTGCTGAGATTTTACCTTGTCTTGGTCCGGATGGACAGTCATGTACTTTTCAGCCTTATGTTGCAAATATAGCTATCAATTTATGGGGTCGAGACTTACTTACAGCATGGGATATGAGACTTACAAATGAAAACTTTGATAACCCAGGATTTAAAATGTTGAAGGACATGGGATATCAGAGTGGAAAAGGTTTAGGGAAATTCCTACAAGGAAACCCTAACCCGATATCAGTAACTGGAAAAACAGATAGAAAAGGGCTAGGACGTCAGGATTTCTGACGGGGGTCATTGATATTTCTCCTCCGCCCACTGCCTTACCATTAGAATGGCTNAGTGACAAACCTGTGTGGGTGGATCAATGGCCCCTANCACAGGAGAAGCTAGNTCAACTTCATCNGCTAGTAAAAGAGCAATTGGATGCAGGACATATAGAGAAGAGTTNCAGCCCCTGGAATTCACCGGTGTTTGTTATTCCAAAAAAGTCCGGAAGATGGTGACTGCTGCATGATTTGAGAGCTATTAATGCACAAATTAAACCGATGGGTGCATTACAGCAAGGTCTGCCATCCCCAGCGGCCATTCCAAGAGACTGGCCTCTCGTAGTAATAGATCTTAAGGATTGTTTCTTTACTATACCNTTACACGAGAAGGATAAGCCTCGATTTGCCTTCTCTGTGCCTTCTATTAATCAAAGAGAACCTGTTTCTCGTTATCAATGGAAAGTTTTACCCCAAGGCATGCTTAACAGTCCTACGCTATGTCAGCATTTTGTAGGACGGGCATTAAAGGAGCCTCGGAATATGTTTCCCACTGCTTACATCATTCATTNTATGGATGATATTCTTTTGGCCGCTCCTACAGATCAAATCTTACATCAGTTATTCAGAGAAACAAAGCGGGCTTTGACTAAATGGAATCTCAAAATNGCTCCAGAGAAGGTGCAAACAACTTCCCCATACCANTACTTAGGAACTATTGTTACGGAGAGAAGTGTACGGCCTCAGAAAGTAGTTCTCCGTAAAGACAGGTTACAGACTTTAAATGATTTTCAACAATTATTAGGGGATATTAATTGGCTGCGCCCGATGCTAGGTATTGCTACCTATCAACTTACACATCTTTACCAAACCCTGCAAGGAGATTCTTCNTTAAATTCCCCGCGGCAACTNACTAAAGAGGCAGAAGCCGAGTTACGGCTTGTAGAGCAGATGCTTCAGCAGAGACATGCCTCNCGGCTACAGCCGCAAAAACCTTTGCTTTTGTTTATTCTTCCTACCCCCCACTCTCCAACAGGACTTTTGGGCCAGTTCATAGACAAGTCTGTAACAGTAATAGAATGGCTCTTTCTACCTAATCAGTCAAAACCTTGCAAGTTTATCTTTCTTTAATTACACAAATTGTGACTATGGGCAGGCATAGGTCAAAAATGCTTACGGGATATGATCCNGACAAAATTATTGTTCCCTTAGACTCCCAGCAACAGGCCGCAGCNTGGGAAATGTCGACTGCNTGGCAAATCGCTTTCGCAGACTTCGTGGGAATAATAGATAACCACTATCCCTCAGACAAAATTTTGCAGTTTTATAAAGTCCATTCTTTCATTCTTCCTGTGATTACTCATCACAAGCCTATTCCAGGTGGACAGACTTATTTTACTGATGGCTCTTCCAAAGGTCGTGCAGCTATCTATGGACCTAAACATACTCAAACAATAATGACCTCTGGGGTTTCAGCTCAACGCTCAGAGCTAATTGCAGTCATTCAGGTTTTACAGCTCACAGCTTCAGATCCTATCAACATTGTCTGTGATTCAGCTTATGTTGTAAATGTAGCCAGTCGCATAGAAACTGCTACAATTAAAAGTACACTAGACCCAGAACTGCTTAATTTGTTTCTAAGACTTCANACAGCTATTCGCTCTCGTGCAGCTCCTTTTCATATTTCTCATATTCGCTCTCACACACAACTTCCTGGACCACTATCTCTAGGTAATGATAGAGCAGATAAACTGATTGGTTCTGTGTTTCAGCAAGCTCAAGCNTCTCNATGCGCTACTGCACCAAAACACCTCCGCCCTTACTCGCATGTTTCATCTGCCTCGCAGCCAAGCTAGGGCTATNGTACAAGCCTGTCCTACTTGCCAGCATGTCCCTGGNGCCGCACCTGTAGAAGGNTGTAACCCACGAGGTTTGGCTCCAAATGAAATCTGGCAAATGGATGTTACACACATAGCAGCCTTTGGCAAGCTTAGCTATGTTCTGTGANCTATAGACACTTATTCTCATATGCTGCATGCTACATGCCAAACAGGTGAGACAGCTGGTCATGTACGGCGACATTGTCTGTCATCATTTGCTCATATGGGGATACCTAAACAATTAAAAACTGACAATGGACCCGCTTATACTAGTCATGCTTTTCAAAATTTCTTACAGCTTTGGGCTATAACCCATAAAACAGGAATTCCTTATAATCCTAGAGGACAAGGCATTATAGAGCGGGCACATCAAACATTACAACGCATGTTGAAAAAACAAAAAGGGNGGTATAGGAGGCCAACTACCACCTCAATCAAAACTACATTTAGCCTTATTTACTTTAAATTTTTTNGACTCCTGGTACGGATGGTAAGACTCCAGCAGAAAGACATTGGCAAGTGTTAGAGGAAAAGAGGAAAGTTTATCCGAAAGTGTTATGGAAATCCCCGGAAGAAGNGACAATGGAAAGGTCCGGTGGATTTACTGACGTGGGGANGAGGGTATGCTTGTGTTTTTACAGGAGATGGACAAACCGTGTGGGTGCCCTCAAGGTGCGTGCGACCATGGAACGGGAGACTGGAGGAACCCAGGGTGGCCAACCATGGGCCCGGTCCCTCCGGTACGAGCCATGAGCCAGCTGAGCCTGAGTGCAAAGACGGAGAGAAGGCCGACCGGAGTCACGACGACATCAACCCCCATAACCTGGGGACAACTCAAGAAAACCACGCAGGAAGCTGAGAAACTACTGGAGCGTCAGGGNCAGGCAAAAACCCCTGATTCCATGTTCTTGGCCATGTTAGCCATAATGTCCTGTGCGGTATGTTTTCCCTGTGCAGAGGCAAAAACATATTGGGCATATGTTCCCAATCCCCCAGCAGTACGACCTGTACTTTGGAGTGACACTCCTCCTGAGATTTATCATGATCAGGGAGCGTGGGCTCCAGGACCCCTAACTCCCCTGACANTAGAACAGTTAGACTCTCAGAACAATGTCATCAATTATACCGCTCCACTGGAAGGACTCCCTTTGTGTATCACCACAAAGACGTCGCTCAGCCGTAGCTGTCTTACAATTCAAGCTCAAGCATGGTTGAGTCACTATGGAAAAGTCATGTACTTATTAGGTCTTGGTTCTATTAATGTAACTGGTGTGCTAACCAACCATTCCCGGCCCAATCGCCCTAATTGTGCTGACTATACGGAATGGATTCCCTTCAATAGTTCCTACCCCCCCTCNCGTGGACCCAGTGTCTTGGCCCACTGGCTAGAAAACAATCTATGTTAACTGGAGACATTGTGGATTGGGGACCTAAAGGTCAATTAGATGGAAAAGATGAAAATCAGAAATCATGGCACAAACTTCGCTGGCATTGGTGGCAAGCTTTTAATGCTTCTTCTTTATACNACACCGGGATCCAATCCCAGTCTGCCGCCCAGATTGCTTGGCATGGAGCAGGCTTTAGCCCGCCTCTTCCTCAGTGGCATTATCTAGGGAGGAAAGGACCAATTCAAGAGACGATATGGAAGGCAGCACTCCCATTTACGAATGGAGCATCTGGGTTNGGGATACTATCCAATAATAGCAATAGTAAGCGACACAGTCTTAATGTTACATTTGTAAAGAATATCACCACTCAATTTACGGTTTGTGTTTTTAATCCTTATGTCTTTTTGGCAGCTAAGAAGGACCAGCTCCAGGTAAACAATACCCAATTGACCTGTAAATCTTGCCAGTTATATCACTGCATTAATCATAGCACATTGCAAACACATAATATCTCTACTTTGATGATTTTGGGTCGCATCCCTGGGCTATGGATTCCTGTTAATCTGTCCGAGCCTTGGGCTGCCACACCTGCTTTGCATTTTGTGAAACTTCTTCTAACTCAGCTTACTCATCGTGTCCGTAGAGCCTTAGGCATGATAATTTTTGCTATTGTTTCCTTGGTCACACTAATAACTTCTGTTGTGATGTCCTCTGTAGCTTTGCATAGTTCTGTTCAAACAGCTCAGTACGTGGAGAACTGGACGCGCACAGCCGACCAAGCGTGGCTACTTCAGAATAAAATTAACACTGAGTTACAAACTGAAGTGGCAATGTTGAAATCCACGGTTCTATGGTTAGGGGAACAAGTACAAAGCTTGCAGTTGCAGCAGCAATTGCGTTGTCATTTTAATCACACTCATATTTGTGTAACCAACTTAGAATATAACCAAAGTGAGTATCCGTGGGACCTTGTGAAAGCCCATTTGCAGGGAGCTTTCACATCCAACATCACCTTTGATATTGGTGAATTACAAAACAAAATTCTTGATTTAAATAGGCAAACTCAAGAGTTTCAGCCTTCTTTAGAAGACTGGACCGAATTCCAGCAAGGCCTGGAGAGCCTCAACCCTTGGACCTATCTAAGGCACCACATTAACATCTTATATGTAGTTCTTGGAATAATGTTGTTTTGTCTCTGTCTTCTGTTCATAGTCTGTAAAATCGGATGGACCGCCAATCGGAGAATGAGAGCTGCCCAGCCTGGCCTTACATTCTTTCAATTAATNCATAAACAGAAAGGGGGATA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVK3 | Znf423 | 5211 | 5225 | + | 21.35 | GGAACCCAGGGTGGC |
HERVK3 | Znf423 | 5211 | 5225 | - | 19.02 | GCCACCCTGGGTTCC |
HERVK3 | klu | 2715 | 2725 | - | 18.79 | CCACCCACACA |
HERVK3 | MAFF | 3131 | 3141 | + | 18.24 | GTCAGCATTTT |
HERVK3 | PATZ1 | 1587 | 1597 | - | 17.64 | GGGGGTGGGGG |
HERVK3 | MYB30 | 4927 | 4938 | + | 17.49 | CCAACTACCACC |
HERVK3 | NR2F1 | 119 | 133 | - | 17.47 | TAGGGCACTGACCTT |
HERVK3 | NR2F1 | 155 | 169 | - | 17.47 | TAGGGCACTGACCTT |
HERVK3 | dl | 5068 | 5077 | - | 17.26 | GGGGATTTCC |
HERVK3 | ZNF281 | 1588 | 1597 | - | 17.16 | GGGGGTGGGG |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.