HERVFH21
Basic information Differential Expression Stage analysis Survival analysis Correlation analysisDF ID | DF0000182 |
---|---|
TE superfamily | ERV1 |
TE class | LTR |
Species | Simiiformes |
Length | 6529 |
Kimura value | 12.49 |
Tau index | 0.8642 |
Description | Internal region of ERV1 endogenous retrovirus, HERVFH21 subfamily |
Comment | HERVFH21 is the internal region of HERVH-related endogenous retrovirus, and is flanked by LTR21 long terminal repeats, and has 5 bp TSDs. It has a PBS site similar to Phe tRNA binding site in intracisternal A-particles from mouse and hamster genomes. |
Sequence |
TTCTGGTGCCGAAACCCGGGAAGGGGATAGGCTCTGGCCGGGANTCACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCCTCTCCCTCCCCTATCACCCCTCTCCCGGCCGANCTCCCCCTTCCCGAACCTGCCGAAGACCCGGAGGATCTCCTAGACCCTCCCATTGCTGGCGACCNCATCCANCACCAGGGCCTCCGCAGGGGTGAGTNAAAAGAGACTCTTGCCATTCCCCGGAACCCTTGACCGTCCGTCTCTCCNTTCCCGAAAGACCCAGCGCTGGGCCAAGGGCTTCCTCCCGCCTCCGGGCCTCCAGGCGNCCCTCGGTTCCTCCGTTTCAGGGACGCCTGACNCGGCGGTTACCTCCCCTATTCCGGACAAGNCCCGCGGGACAGGGGACGCCCTCTCCCGCCGTCCTTGTCACCGGCAGTCTCTTCTTCCTCTACCCCCCCCTCCTCCATTCACCATGGGAGCCTCTCAGTCTACTCCCTCNAAGACNNCCCCCCTGGGTGTCTCCTCCGCAACCTCAACGCCCTCGGCCTCCGTTCAGAAATCCGNCCNAAAAGGCTTATCTTTTTCAACATNCTCAGACCTAGATAATTTTTGCCATCGAAATGGGAAATGGTCTGAGGTGCCTTACGTCCAGGCATTTTTCACACTTCGTNCCCNCCCTNCCCTCTGTCAGTCCCGTTCCACTTCCCAAATCCTCCTCNCCCNCTCCNANCCTGNCTCGCCTCCTANCCCCCNCCCCACAGCCCCAGCTGACGATTCCTCTTCCTTTGACCCNNCCGACTTTCCCCTTCCCCGNCGGCATCACGNTCCTCCGCCAGAGCGTCCCGATCCTCCACCGTATGCCCCCGCTCCGGCTCTACCTTTCTCCCCTCCTCTCCCCAACCACCCNGCTTCTGACTCTGGGTCCTCTCCNTCTCCACCCCTCACCCGCTCTCNGGCCCAACNTGCCCAGCAACCAGCTCCCCTGCTTCCTCTTAGAGAGGTGGCTGGAGCTGAAGGCATAGTCAGGGTACATGTNCCTTTTTCTCTATCAGACCTTTCCCAAATCAGCCAGCGTTTAGGCTCTTTCTCATCAGACCCCGNCACTTATATACAGGAATTCCAATATCTAACTCNGTCCTACAATTTAACCTGGAGTGACTTAAATGTCATCCTGACTTCTACCCTCTCCCCAGATGAACGGGAAAGATTTNTCTCNTAGCCCAATCTCACGCTGATAACCTTCATGAGCCAGACCCCANNAAGGCNNTAGAGGCCGCCGCAGTTCCCCGAGAGGANCCCCNNCGGGANTACCAGCCCGCAGACCCCGGCCGGGCATCTCGAGATTACATGGTTTCCTGCCTAGTTGAAGGGCTTAAAAAGGCAGCNTACAAAGCTGTTAATTATGACAAGCTTAAAGAAACTACCCAAGGTAAAGACGAAAACCCAGCCCAGTTCATGGCCCGCTTNGCAGCGACCCTNAGACGCTNTACCGCCCTAGACCCNGANGGNCCGGAAGGCCGCCTTATTCTTAATATGCATTTTATCACCCAGTCCGCTCCCGACATTAGGAAAAAGCTTCAAAAATTAGANTCCGGCCCTCAAACCCCACAACAGGACTTAATCAACCTCGCCTTCAAGGTGTNCAATAATAGNGANGAGGNAAGNNAGGCAGCAACANGCAGAGTTNCAANTACTTGCCTCCGCTGTNAGANGCCCTNCAGNCCACGGGACGCAGCTCCACACGGAAGCCTCCTAGCAGTTCACCTCCNCCNGGANCTTGCTTCAAGTGCGGCAANGAAGGCCACTGGGCCAAGGAATGCCCCAGCCCAGGNATTCCTCCTAAGCCGTGCCCCTCTGTGCGGGACCCCACTGGAAGTCGGACTGTNANTNGCCCCCGCAAGGACCGCCCCCANCCCTTCCCGAGCCGGCCGAAACCTCCTTCCCAGATCTCCTCGGCTTAGCGGCTGAAGACTGACGCTGCCCNGGAACGGACGCCCCGGNAGCTACCATCACNGCATCCGAGCTTCGGGTAACTCTTACAGTGGAGGGTAAGTCNGTCCCCTTNNTTTNATCGATACGGGGGCTACCCACTCCGCATTACCTTCTTTTCAAGGGCCCGCTCCGTCCTCCCNNGTCTCTGTTGTGGGTATTGACGGCCAGGCTTCNAAACCCCTTAAAACTCCCCNACTCTGGTGCCACTTGGACAACATTNCTTTTACGCACTCTTTTTTAGTTATCCCCACCTGCCCAGCTCCCTTATTAGGCCGAGACATTTTANCNAAATTNTCCGCTTCCCNTGTNTTCCTGGCNACTACAGCCACANTCCCTCCCGCCCCTCTCCGCCAGTCCAGCCCCTNACCCCTCTCCCCGGCATCCNCTCCCCACCTCCCNCGTTAACCCACAAGTATGGGACACCTCTACTCCCTCCCTGGCGACCGATCACGACCCCATTACCATCCCATTAAAAAATCCCCCTAACCCGCCCAACGNTCNCCAATATCCCATCCCACNAGCACGCTTAAAAGGATTAAAGCCTGTTATCANCNNGCTCTGCTCACGNCGTCTTCTAAAACCTACAAACTCTCCTTACAATTCCCCCATTCTACCTGTCCAAAAACCGGACAAGTCTTACAGGTTAGTTCAGGATCTNCGCCTTATCAACCAAATTGTCTTGCCTATCCACCCCGTGGTGCCNAACCCGTATACTCTCCTNTCNCTCATNCCTTCCTCCACAACCCACTATTCCGTTCTTGACCTNAAAGATGCTTTCTTCACTATTCCCCTGCACCCCTCNTCCCAGCCTCTCTTTGCTTTCACCTGGACTGACCCTGACACCCATCAGTCCCAGCAGCTTACCTGGGCTGTACTGCCGCAAGGCTTCAGGGACAGCCCTCATTACTTCGGCCAAGCTCTTTCTCATGATCTTACTTCCTTNNATCCGTCCCCCNGCCACCTTATTCAATATATTGATGACCTTCTNCTTTGTAGCCCCTCCTTNGAANNCTCCCAAACNCACACTNCCGCCCTTCTAAACTTCTTTNCTAANAAAGGATATCGGGTATCCCCTCCAAAGCNCAAATTTCTNCCTCCATCGTTACCTACCTCGGCATNNAACTCTTCNTNAAAACACGNGCTNTGACCCCTGCCCGAGCGGCCTTAATNGATAANTAATCTACCCCANCCCCTTCTAAAAACGAAATCCTTTCCTTCCTAGGCATGGTTGGATACTTTCGNCTTTGGATACCCGGTTTTGCCATCCTAACNAAACCATTATATAAACTCGCAAAAGGCCCCCTCGATGANCCCCTAAACCCCTCNCNTAACCTACTCCCCANCTTCCGTNNACTCCAAACAGCTCTNGNNACCGCNCCCGCTCTANCCCTNCCCGACNTCTCCCAACCCTTCTCNCTNCACACAGCCGAAGNCCGAGGNNTNGCNCTCGGNGTCTTAGGACAACAGAAGGGAANTCCTCCNTCCTTTGCCCCTGTAGCCTTTCTGTCCAAACAACTTGACCTTACTGTTTTAGGCTGGCCNTCATGTCTCCGNGCGNCGGCNGCNGCCGCCACTTTTAGCCCTCGAAATCACGAAACTAACGCTCAACTNNNNCGNCACTCTCTACAGTTCTCATAACTTCCAAAATCTATTTTCCTCCCGAGCATTANGCTCCCTTTCTGCTCCCCGGCTCCNNTTNCTCTATNCNCTCTTTGTTGAGTCTCCCAAATTCAGTCTTGCCAANAGTGCTCCCTTCAATCCGGCCTCCTTATTCCCGATATCCNCCTCCCCNCCTACCCATTCNTGCACTGATATCCTGATCCACCTGCATTCACNCTTTCCCCATATTTCCTTCTTTCCTGTTCCTCACCCCGNTGATCAACTGTTTATTGATGGCAGTTCCACCAGGCCTAATCGCTCACCCAAAGNNGCAGGCTATGCTATTGTCTTCCACATCTANNTCATTGAGGCTACNNCCCTGCCTCCANNNACTACCTCTCAGCAAGCCGAACTCATTGCCTNTAACTCGAGCCCTCACTCTTGCAAAAGGACTACGCGTCAATATTTATACTGACTCTAAATATGCCTTCCATATCCTGCACCACCATGCTGTTATATGGGCNGAAAGAGGTTTCCTCACTACGCAAGGGTCCTCCATCATTAATGCCTCCNNTTAATTAAAAACTCCTTNAGGCCGCTTTACTTCCAAAGGAAGCTGGAGTCATTCACTGCAAGGGCCATCAAANAGGCTCAGANNNNCCNNTGANNCGNCGNATCAATCTCAAGAGGNAACANNAANGCCGATAAGGCAGCAAAAAACAGNACCTCAAAAGGGATCAGANNTGAAATCTCAAGAGGNAACGNNNANGCCGATGAGGCAGCAAAAGAAGCCTCCCTTTCTTCTGTCCCTGCCTCTCTCCTCCTCATTACCCCNGCANTCCNACCCAAGTACTCCCCCACTNAGAAANNGCTTCGCTACTACAGCAAGGAGCCTCCCTCCAAGGGGACTGGATAGTCAAAGATCAAAAGCTCNTCCTCCCCCAAGAGCAAGCCAANNNNATTCTGACATCNCTTCACCAACCCTTCCATGTNGGTGCGCGCCCCCNGTACCTGCCCCTTCGCCCTCATTTCTCCTCCCCCCATCTATTCACCTCACTAAAGGACATAACCTCNAACTGTCNCATNTGCTNTGCTACTTCCTCCCAAGGGGCCCTCCGCTCTCCCCCCATCCCTACACATCAGCTCAGAGGANCGCTCCCAGGGNAGGACTGGCAAATNGACTTCACCCACATGCCTCCCGTCAAGAAAACAAAATATCTTCTTANTCTNGTAGACACTTTCNCTGGNTGGGTAGAGGCNTTTCCCACNNCNTCNGAGAAGGCCGCNGNAGTCTCCCAAATTCTCGTAACAGANATNATCCCTNGGTTTGGCCTCCCCANCTCCATACAGTCNGACAACGGNCCNTAGCTTCATCNNCCAAATCACCCAACCGGTTTCTCAGNCCCTTGGTATTCAGTGGCGCCTCCATATCCCNTACCGNCCCCAGTCNTCAGGAAAAGTCGAAAGGGCAAATGGNATTCTNAAAACNCAGTTNACCAAACTCACCCTCGAAGTCNAAAAACCNTGGACCTCCCTTTTACCCATAGCNCTGGCCCGCATCAGAGCCAGTCCAAAAGCACCCTCCTTCCTCAGTCCATTTGAGTTAATGTATGGACGCCCTTTCCTCTTACAAAACAGGCCCCCTCCTAACTCTCAGTTAGGAGAATACCTCCCAACACTCTCCCTCATCCGCCATCTCCTCCGCGAACAAGCNGACCAGGNCCCTCCCAAAACCCCACGAAGGCCCTTGACACCTCCCACCTCCTTAACAAAAATTCAGGGTACTGCAACGGCCGACACCTCCCCTGTTTATCCCTCTTCCCTTGGCTCCCCTCTCCGTGCACCATCCCTGCCCNNNACCCACCACCCGCGACTGTCTCCTTATCCCANCGTTCAACAACACTCCCACANGNNTCTTAGTGGACACAAAACGCTTCCTCCTACACTGGGAAAAAAAAAAANNAACNCAAAGAGCCNCCCAGCCTAANCCAAACACCCCCTTACAACCACTCNCCGCAGCGGCCCTAGCCGGCACCCTAGGAGCATGGATGCACGAGGACAACAAAATTNTACACCTTTTTAGCATACACAACCAGTTCTGCCTACCNAGCCAAGGCATATTCTTCCTATGCGGCACCTCNACCTATNTCTGCCTCCCCNCCAACTGGACAGGCACCTNCGCACCCTGGTTTTCCTCAGCCCNAAAATCGACGTTGCCCCAGGAGACCAACCCCTACCAATCCCNGTTAANACCCANATCCGACACCGCCGNGCCATACAGCTCATACCCCTNCTAGTAGGCCTAGGAATNACNGCAGNAGTNGGAACCGGAGTNGCGGGATTNGCNACCTCCCTNTCCTATTACCAATCCCTCTCCAAAGACCTTACGGANAGCTTGGAAGACATNGCCAANTCCATTNCNACCCTCCAANCNCAAATAGACTCTTAGCAGCAGTCGCCCTNCAAAACCGCAGAGGCCTNGACCTNCTCACCGCCGAAAAAGGAGGNCTCTGCNTCTTTCTAGATGAANAGTGCTGCTTTTATCTCAACCAGTCAGGCTTAGTNCAAGATGCNGTCAAAAAACTNAAAGACCGAGCNCAAAAAATCAAAGAAAACGTCCTCTGCTGGCCCNCNTGGCCCTCNTGGTCCCTCAGCNCCTGGGCTCCCTGGCTACTGCCCCTCCTNGGCCCNGCCATAACCNTTCTTCTCCTTCTAGCNTTCGGNCCCTGCCTCNTACGCCTCCTCACCCAGTTTTTACAGGACCGTATCAGAGCCTTCACCCACGGAACAATACAAGATATGATGCTGCTCCAGGAATACCGACGGCTCCAAGAACGGCAGTCCCTACCNTCCAGCCTTTCCCCNCAACCGCCGCCCCTTCCCAGCNNGAAGCAGCCAGACGACAACGGCGCCCCTCTTCTATTACCTATTAAAAGGCTGGAA
|
TF motifs of the concenus sequence
Use FIMO to detect transcription factor motifs in the concenus sequence of the TE family.
TE_family | TFBS | Start | End | Strand | Score | Matched sequence |
---|---|---|---|---|---|---|
HERVFH21 | BPC5 | 48 | 77 | - | 59.00 | AGAGAGAGAGAGAGAGAGAGAGAGAGAGAG |
HERVFH21 | BPC5 | 50 | 79 | - | 59.00 | AGAGAGAGAGAGAGAGAGAGAGAGAGAGAG |
HERVFH21 | BPC5 | 52 | 81 | - | 55.54 | GGAGAGAGAGAGAGAGAGAGAGAGAGAGAG |
HERVFH21 | BPC5 | 46 | 75 | - | 47.17 | AGAGAGAGAGAGAGAGAGAGAGAGAGAGTG |
HERVFH21 | BPC5 | 56 | 85 | - | 47.17 | AGAGGGAGAGAGAGAGAGAGAGAGAGAGAG |
HERVFH21 | BPC5 | 54 | 83 | - | 47.13 | AGGGAGAGAGAGAGAGAGAGAGAGAGAGAG |
HERVFH21 | BPC5 | 58 | 87 | - | 43.72 | GGAGAGGGAGAGAGAGAGAGAGAGAGAGAG |
HERVFH21 | BPC1 | 49 | 72 | - | 43.52 | GAGAGAGAGAGAGAGAGAGAGAGA |
HERVFH21 | BPC1 | 51 | 74 | - | 43.52 | GAGAGAGAGAGAGAGAGAGAGAGA |
HERVFH21 | BPC1 | 53 | 76 | - | 43.52 | GAGAGAGAGAGAGAGAGAGAGAGA |
TFBS enrichment in GRCh38
Use Fisher's exact test to perform enrichment analysis of transcription factor binding sites in the TE family of GRCh38.