HERVK3
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000195 |
---|---|
TE superfamily | ERV2 |
TE class | LTR |
Species | Simiiformes |
Length | 7202 |
Kimura value | 9.82 |
Tau index | 1.0000 |
Description | Internal region of HERVK endogenous retrovirus, HERVK3 subfamily |
Comment | The associated long terminal repeat is LTR3, and has 6 bp TSDs. |
Sequence |
AGTGGCGTCCGAACACAGGGACTTCGAGGACGTGAACGAAGAAGGTCTGCTGGAGCAGAGGAACTGAAATTGACAAGGCGAACGGGGACCCCGGGACGAGTCTGCCGGCAGCGGATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAGCAATATAAGGTCAGTGCCCTAAAGAGGTACTGGGAACGGGAAGTTTTCTGAATCAGNGGTAACATGGGGCAGAATTTGTCTATTGAAGAAAAACATTATCGTGCAGTTGCTTAAAGTTCTGTTGAGACAGTCTGGNGCTCAGGTTAGTTCNCAGACACTAACTAAGATGCCGCAGGAGGTTATTACGCATAACCCATGGTTTCCACAGGCAGGCACTCTTGATGTGGAAAATTGGGACAGAGCAGGAGAAGGATTAAAACGGGCTCATCAAAAAGGTCTTAAAGTTGATTCTTCTGTTTTCTCCACTTGGAGTTTAGTTCGTACTGTNCTTCTGCCATTATCTCCTTNTTATTCTGCNGGACAGCAGGAGTCATGTTCTGAGTCTAAAAATCTGAAAGAATCTGTTGTCCCACCCACAGCTCCAATTGAAAATAAAAAACAGGAGAGGGAGGATAAAAATTGGCCTATACCGCCTCCTCCAGTTGCAGAAACATCTGTACCGCCTCCTTCGGTAGCAGAAATAGAGACCCCAATACAAAGAATTTTACGCTCTGCTGCCATAGCTGGAGAGCCCTTAGGACCTNTGCGCTTTTCCTATTTCCGTAAGGCCTGATCCAAATAATCCACAGCAGNTTATTCATGAACACACCCCACTAGAGTTTAAGTTGTTGAAGGAATTAAAAGCTAAGTGTGGTNAATAATGGCGTACAGAGCCCATTCACTTTAGGATTGCTAGAATCTGTGTTTGGTGCTATGCGTCTTCTACCCTTTGATGTAAAACACTTGGCNCGAACTTGCTTGTCTGCTAGTGCATATCTGACATGGAATTTAAATTGGCAAGAAATGTGTGCAGACCAGGCTAGACAGAACCGTGCTGCTGGACACGGAGACATTACAGAGGATATGCTGTTAGGTAATGGCCCTTNATTCAGACCTGGAACGTCAAATGGCACTCCCAGACGCTGCTTATCAGCAGTGTGCACAGGCCGCTAAACGCGCCTGGGCCACAATTCCTGAAGAGGGAGTCCCAGTACAATCCTTTTTACATATCATGCAAGGGTCGCAGGAACCCTATGCGCAATTTCTTGCAAGATTACAAGAGGCAGTGAAGCGTCAGATTCCTCATACCGCTGCCGCAGAAATGCTAACCTTAACTCTAGCTTTTGAGAATGCAAACGCGGATTGTAAACGTGCACTGGCACCTGTGAGGTGTAAAAAAACTTGGGAAATTTTCTCAGAGCTTGTCAGGATGTAGGAACTGAGCTTCATCGCTCTGCAATGTTAGCNCAAGCAATGGCTAATTTAGCAGTTGACAAATCTAAAAGGAGCCAAGGGTCAAACCCTAAAATGGGAAAATGTTATAATTGTGGAAAAACTGGACATTTTAAAAAGGAATGCCGCCAGATCTCAGGACAGAAAGGACCTTACAATGCAGTNCCCCCCACCCCCGCGNNCCAGCGGAAAAAAACGCCAGGACTTTGTCCTCGCTGTAACAAAGGAAATCACTGGGCTAATCAGTGCCGCTCAAAATTTCATCAGAATGGCACCCCCCTGTCGGGAAACGAGANGGGGGCCTGGACCCGGGCCCCTCAAACAATGAGGGCATTCCCAGTCCAGACCACAACCCCGTTTCAGGGATGGGTTCCCGGAGGNACATTGATTCCCTCACCCCAGGAACACCAGGAAGTGCAGGATTAGATCTCCCCGCCAGAGAAAGAATTACGTTAGTTGGNGGAGACAAACCCACCAAAGTTCCCACTGGCATTTGGGGACCTTTACCAGCAGGATACATGGGACTAATTTTAGGCAAAAGCCGCCTTAACTTGCAAGGCATTACTGTAGTCCCAGGAGTNGTTGACTCCGATTATGAAGGAGAAATTCAAGTAGTTTTAATGTCACAAGATCTTTGGGTTTTTGAACCGGGAGAATATATTGCTCAATTATTGCTTATTCCCTGCAAATTACACCCTTCTCCACGAAAGGAGAAACGAGGAAATAAAGGGTTTGGGAGCACAACTACATGGGAAATCTATCTATCCNCAACCCATAGCCTCTAATAGACCCACCTGTGTAGTACAAATTAAAGGAAAGAAATTTTATGGGCTTATGGATACGGGAGCTGATGTGTCAGTAATATCTAGNAACGACTGGCCCCCATCCTGGCCCCTGCGATTAACTTCTACATCCCTAGTGGGAGTAGGAACAGCTCAAAGTGTTCAACAGAGTGCTGAGATTTTACCTTGTCTTGGTCCGGATGGACAGTCATGTACTTTTCAGCCTTATGTTGCAAATATAGCTATCAATTTATGGGGTCGAGACTTACTTACAGCATGGGATATGAGACTTACAAATGAAAACTTTGATAACCCAGGATTTAAAATGTTGAAGGACATGGGATATCAGAGTGGAAAAGGTTTAGGGAAATTCCTACAAGGAAACCCTAACCCGATATCAGTAACTGGAAAAACAGATAGAAAAGGGCTAGGACGTCAGGATTTCTGACGGGGGTCATTGATATTTCTCCTCCGCCCACTGCCTTACCATTAGAATGGCTNAGTGACAAACCTGTGTGGGTGGATCAATGGCCCCTANCACAGGAGAAGCTAGNTCAACTTCATCNGCTAGTAAAAGAGCAATTGGATGCAGGACATATAGAGAAGAGTTNCAGCCCCTGGAATTCACCGGTGTTTGTTATTCCAAAAAAGTCCGGAAGATGGTGACTGCTGCATGATTTGAGAGCTATTAATGCACAAATTAAACCGATGGGTGCATTACAGCAAGGTCTGCCATCCCCAGCGGCCATTCCAAGAGACTGGCCTCTCGTAGTAATAGATCTTAAGGATTGTTTCTTTACTATACCNTTACACGAGAAGGATAAGCCTCGATTTGCCTTCTCTGTGCCTTCTATTAATCAAAGAGAACCTGTTTCTCGTTATCAATGGAAAGTTTTACCCCAAGGCATGCTTAACAGTCCTACGCTATGTCAGCATTTTGTAGGACGGGCATTAAAGGAGCCTCGGAATATGTTTCCCACTGCTTACATCATTCATTNTATGGATGATATTCTTTTGGCCGCTCCTACAGATCAAATCTTACATCAGTTATTCAGAGAAACAAAGCGGGCTTTGACTAAATGGAATCTCAAAATNGCTCCAGAGAAGGTGCAAACAACTTCCCCATACCANTACTTAGGAACTATTGTTACGGAGAGAAGTGTACGGCCTCAGAAAGTAGTTCTCCGTAAAGACAGGTTACAGACTTTAAATGATTTTCAACAATTATTAGGGGATATTAATTGGCTGCGCCCGATGCTAGGTATTGCTACCTATCAACTTACACATCTTTACCAAACCCTGCAAGGAGATTCTTCNTTAAATTCCCCGCGGCAACTNACTAAAGAGGCAGAAGCCGAGTTACGGCTTGTAGAGCAGATGCTTCAGCAGAGACATGCCTCNCGGCTACAGCCGCAAAAACCTTTGCTTTTGTTTATTCTTCCTACCCCCCACTCTCCAACAGGACTTTTGGGCCAGTTCATAGACAAGTCTGTAACAGTAATAGAATGGCTCTTTCTACCTAATCAGTCAAAACCTTGCAAGTTTATCTTTCTTTAATTACACAAATTGTGACTATGGGCAGGCATAGGTCAAAAATGCTTACGGGATATGATCCNGACAAAATTATTGTTCCCTTAGACTCCCAGCAACAGGCCGCAGCNTGGGAAATGTCGACTGCNTGGCAAATCGCTTTCGCAGACTTCGTGGGAATAATAGATAACCACTATCCCTCAGACAAAATTTTGCAGTTTTATAAAGTCCATTCTTTCATTCTTCCTGTGATTACTCATCACAAGCCTATTCCAGGTGGACAGACTTATTTTACTGATGGCTCTTCCAAAGGTCGTGCAGCTATCTATGGACCTAAACATACTCAAACAATAATGACCTCTGGGGTTTCAGCTCAACGCTCAGAGCTAATTGCAGTCATTCAGGTTTTACAGCTCACAGCTTCAGATCCTATCAACATTGTCTGTGATTCAGCTTATGTTGTAAATGTAGCCAGTCGCATAGAAACTGCTACAATTAAAAGTACACTAGACCCAGAACTGCTTAATTTGTTTCTAAGACTTCANACAGCTATTCGCTCTCGTGCAGCTCCTTTTCATATTTCTCATATTCGCTCTCACACACAACTTCCTGGACCACTATCTCTAGGTAATGATAGAGCAGATAAACTGATTGGTTCTGTGTTTCAGCAAGCTCAAGCNTCTCNATGCGCTACTGCACCAAAACACCTCCGCCCTTACTCGCATGTTTCATCTGCCTCGCAGCCAAGCTAGGGCTATNGTACAAGCCTGTCCTACTTGCCAGCATGTCCCTGGNGCCGCACCTGTAGAAGGNTGTAACCCACGAGGTTTGGCTCCAAATGAAATCTGGCAAATGGATGTTACACACATAGCAGCCTTTGGCAAGCTTAGCTATGTTCTGTGANCTATAGACACTTATTCTCATATGCTGCATGCTACATGCCAAACAGGTGAGACAGCTGGTCATGTACGGCGACATTGTCTGTCATCATTTGCTCATATGGGGATACCTAAACAATTAAAAACTGACAATGGACCCGCTTATACTAGTCATGCTTTTCAAAATTTCTTACAGCTTTGGGCTATAACCCATAAAACAGGAATTCCTTATAATCCTAGAGGACAAGGCATTATAGAGCGGGCACATCAAACATTACAACGCATGTTGAAAAAACAAAAAGGGNGGTATAGGAGGCCAACTACCACCTCAATCAAAACTACATTTAGCCTTATTTACTTTAAATTTTTTNGACTCCTGGTACGGATGGTAAGACTCCAGCAGAAAGACATTGGCAAGTGTTAGAGGAAAAGAGGAAAGTTTATCCGAAAGTGTTATGGAAATCCCCGGAAGAAGNGACAATGGAAAGGTCCGGTGGATTTACTGACGTGGGGANGAGGGTATGCTTGTGTTTTTACAGGAGATGGACAAACCGTGTGGGTGCCCTCAAGGTGCGTGCGACCATGGAACGGGAGACTGGAGGAACCCAGGGTGGCCAACCATGGGCCCGGTCCCTCCGGTACGAGCCATGAGCCAGCTGAGCCTGAGTGCAAAGACGGAGAGAAGGCCGACCGGAGTCACGACGACATCAACCCCCATAACCTGGGGACAACTCAAGAAAACCACGCAGGAAGCTGAGAAACTACTGGAGCGTCAGGGNCAGGCAAAAACCCCTGATTCCATGTTCTTGGCCATGTTAGCCATAATGTCCTGTGCGGTATGTTTTCCCTGTGCAGAGGCAAAAACATATTGGGCATATGTTCCCAATCCCCCAGCAGTACGACCTGTACTTTGGAGTGACACTCCTCCTGAGATTTATCATGATCAGGGAGCGTGGGCTCCAGGACCCCTAACTCCCCTGACANTAGAACAGTTAGACTCTCAGAACAATGTCATCAATTATACCGCTCCACTGGAAGGACTCCCTTTGTGTATCACCACAAAGACGTCGCTCAGCCGTAGCTGTCTTACAATTCAAGCTCAAGCATGGTTGAGTCACTATGGAAAAGTCATGTACTTATTAGGTCTTGGTTCTATTAATGTAACTGGTGTGCTAACCAACCATTCCCGGCCCAATCGCCCTAATTGTGCTGACTATACGGAATGGATTCCCTTCAATAGTTCCTACCCCCCCTCNCGTGGACCCAGTGTCTTGGCCCACTGGCTAGAAAACAATCTATGTTAACTGGAGACATTGTGGATTGGGGACCTAAAGGTCAATTAGATGGAAAAGATGAAAATCAGAAATCATGGCACAAACTTCGCTGGCATTGGTGGCAAGCTTTTAATGCTTCTTCTTTATACNACACCGGGATCCAATCCCAGTCTGCCGCCCAGATTGCTTGGCATGGAGCAGGCTTTAGCCCGCCTCTTCCTCAGTGGCATTATCTAGGGAGGAAAGGACCAATTCAAGAGACGATATGGAAGGCAGCACTCCCATTTACGAATGGAGCATCTGGGTTNGGGATACTATCCAATAATAGCAATAGTAAGCGACACAGTCTTAATGTTACATTTGTAAAGAATATCACCACTCAATTTACGGTTTGTGTTTTTAATCCTTATGTCTTTTTGGCAGCTAAGAAGGACCAGCTCCAGGTAAACAATACCCAATTGACCTGTAAATCTTGCCAGTTATATCACTGCATTAATCATAGCACATTGCAAACACATAATATCTCTACTTTGATGATTTTGGGTCGCATCCCTGGGCTATGGATTCCTGTTAATCTGTCCGAGCCTTGGGCTGCCACACCTGCTTTGCATTTTGTGAAACTTCTTCTAACTCAGCTTACTCATCGTGTCCGTAGAGCCTTAGGCATGATAATTTTTGCTATTGTTTCCTTGGTCACACTAATAACTTCTGTTGTGATGTCCTCTGTAGCTTTGCATAGTTCTGTTCAAACAGCTCAGTACGTGGAGAACTGGACGCGCACAGCCGACCAAGCGTGGCTACTTCAGAATAAAATTAACACTGAGTTACAAACTGAAGTGGCAATGTTGAAATCCACGGTTCTATGGTTAGGGGAACAAGTACAAAGCTTGCAGTTGCAGCAGCAATTGCGTTGTCATTTTAATCACACTCATATTTGTGTAACCAACTTAGAATATAACCAAAGTGAGTATCCGTGGGACCTTGTGAAAGCCCATTTGCAGGGAGCTTTCACATCCAACATCACCTTTGATATTGGTGAATTACAAAACAAAATTCTTGATTTAAATAGGCAAACTCAAGAGTTTCAGCCTTCTTTAGAAGACTGGACCGAATTCCAGCAAGGCCTGGAGAGCCTCAACCCTTGGACCTATCTAAGGCACCACATTAACATCTTATATGTAGTTCTTGGAATAATGTTGTTTTGTCTCTGTCTTCTGTTCATAGTCTGTAAAATCGGATGGACCGCCAATCGGAGAATGAGAGCTGCCCAGCCTGGCCTTACATTCTTTCAATTAATNCATAAACAGAAAGGGGGATA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVK3 | nfya-1 | 4380 | 4389 | - | 17.16 | AACCAATCAG |
HERVK3 | erm | 2775 | 2785 | - | 17.15 | ATTGCTCTTTT |
HERVK3 | M1BP | 6575 | 6585 | + | 17.11 | TGGTCACACTA |
HERVK3 | skn-1 | 3128 | 3141 | - | 16.97 | AAAATGCTGACATA |
HERVK3 | RELA | 5068 | 5077 | - | 16.96 | GGGGATTTCC |
HERVK3 | eor-1 | 5281 | 5293 | + | 16.87 | AAAGACGGAGAGA |
HERVK3 | NR2F6 | 119 | 133 | - | 16.83 | TAGGGCACTGACCTT |
HERVK3 | NR2F6 | 155 | 169 | - | 16.83 | TAGGGCACTGACCTT |
HERVK3 | AT1G72740 | 2584 | 2594 | - | 16.68 | GGTTAGGGTTT |
HERVK3 | NR2F6 | 119 | 133 | + | 16.63 | AAGGTCAGTGCCCTA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.