Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCCCGGTCCAACGCCAATGTTATTGATTTGAACCCAATTAAAAATGGAGTGAAAAGAACGCCAACTCGGTGAACTGCATATGGTGATCGAATTCCTGGCAACACACACTATAAACTCTCAATTCGTCCGGCAATCCTCTCTATCTGTTCGAGTGTTGCAGCAAAAGAGAAGTATTCCTCAAATATGCCGCTAAAGCCAGAACCTTGCAGAAATTTTCTGCGTGGACAGTAAGCTACAATTTCTTTCTCCTCTACCTTTCCTTTTTTTTTTCTTTGTTTTGAATAATTATTTTCTGCGCTGAAGTTTGAACTTCAATGGTAATGCTAAAACTTCTGCTTATTTCTTCAATACCCTTTAATTTATCGGGTGTATGCTATTGTGGGTGTCCTTTGATTTCATTATTTTGTGTAATTTTCGTGTTTTTGCAATTGGGTTTTCGTGGGGTTGGTATTTGGGAAATTTTTGTTCTCAATTTGACATTTAAGTAAGAAGAAAATTACTTATTATTCAGTTAACTGTGAAATTTTGTGTTATATACATCATTGTTAGTGTTAATTTGGTAGAGAATTCGTTGATATATGGAAGAATTTGGAGAAAACTAAGAACAGCACTGTAGCAAAGTTACTTTCTAGAAACAGAATGTAATTTCTGCTAAGATGACGAGTTTTACTGAATAGTTCAGCAGTTCCAAAGAATGAATTGGCTTATGATTGATCCCAATGTGATTTTTTACGAGTTTGGTGGAAAAAAACGAAAGGAAAGAATATAAGGAACTTTCCGATTTATGGTTTTAGTGATTGTACAAGAATTCGTAGGAAAAGTAGTGATAGGGAAGAATTTACATAAGAATTCTGATTACGTAAAGTCGTAAAGTTCCTTCATTGTCACTAAGTTCCCCACAATTAGTTGGTCGGAAAAAAATCCTTTCAGTTCCTGGTAACTCATCCAAACAAATGAAATGCTATTTCTTCCCCCTTCTCATCCCTTTACCCCTTCATCTCATTTTAATTAAAAGTTAATAGAGGTCGATGCATAACCATAATGAGATGAATAAGCAAAGAGAAATGCAGTGGTACTTTTGTTATTTTCTTTTTCCATTCCCTTTTTATTATTTTGTGGGGATGTGGGGTAACAAGAAGTTGTTGAATTGCTACTTTTTGCCGGTATTTGTGGGTTAAGTTTGTAAGATCTAGCAAACATCAACAATGACTAAATCCTTCGTTTTTAATGATCCCGTTAAACACCTTTCTGGCTTTCTCCAAAATAATTGTAAATGTTATTAGTGTTATCTTTATTCAATGTTTTACCTGATCACCAACTTCTTCTCATTACTTGACTTTTCAGTGCTATGAACAAGTCTGCTAGCAGTTTAAGTTATTAACTGCAATTTCCAAAGCATTAAGTGTTTTGTTTGGGGATTGAGAGAATAAATTGAAGATTTCAGTTGTTTGGCCCATATCATGTTACTTTTTTAATGTGACAGTTCATTACTTTTTAGTTATTTATTTACCAAGTATGGACCCCATCATTTTCATTTGATTTGGTAAATAGTTTTACACTCCTCCTTTTACACAATTAGGGTCCTTGGGAGAATAGGCTTGCTTGTTAGAAGTTAGGAAGACTTGGGCTTCCTTGGATTGGGGTTAGACGAATTAAGTTGTGATGGTGTGAGGAGGAGGGAGGACTGGGAGAATTAGTACCCTTATTTTGTGAACTTATTACCTTTTTTGATTGATATGGATGAGCACCATGGATTGATAAATGCTGCAAAGTTTAAGATAGGCTTCCCTTGCAAACCATAAGTTAATATTTGTGAACCCTTAATTTTAGATTAGAGGTCAGACTGGTTCCTCTTGTAGTTCTCTTGATATCCCTTACTCTTATTTGATCCAAGTCTACTTTGTAGAAAAGGAAAGGGTGGTTCATTGAATTTCCTTATTGCCTTATTTTTCCACCATTATTTCTGCATTGGAAAGACTATGAGGCGTCTATATGTGGCATGTTATTACAATAATTTTGCGAATTTGGGATTGAAGTACTGAGAATAAGGAGCTCGTATGAAAGATCTCTGCAAATTCATAGATAAGTCTATGCATGCAACATTAGCGTAACAAAAGGATTTCGTATTTTATTTTCTGCTGCTTGGAGATTAGCTGGCTAATTTTTGGTTTTTTGATAATGACAATGTTCATATTTCTTTCAGTTGCCGTTTTGGTGATAAATGTAACTTTCCGCACGTCACTCAGCAGCAGCCAAAGCCCAATGCTTTTGGCTTTGGCTCGCAGACTGCCACACAATCCCAACAGCAGAAGCCCAATCCTTTTGGTTTTGGTGTCCAGAACTCGACCAGTGGAGCTCCTAATTCTGGAGGATTTGGGCAATCTAATTCACAAAAGGTTCTTGTCCCTACTTATCTGAATTTCTGTGTTCGCTTATTGTGTGCTTGTCTTCTGCTTGTACAGTTGGTCTGAACTTGTTGTGATTTCTATTTAGTTTATTTTAATTTTTTTTTTAAATTTTTAAAATAGAGTTTCTGGCTACTCCGTACCATTTATGAAGAGCTTCAAGTTTTTGTTTGCTGATTTTAATTCATATGAATTCACAGCCTTTTGAGAATAAATGGGTCCGCCCAGAGAAGCCCCAGAATCAGTCTCAGGCTGCTAGTCACAAGTGAGCTTCAAGTTTTTTTTCTTTCTCTTTATGTCCTTTACTGTAAAGTTAGAGGGACAACTTTCTAGTTTGTTAGCTCTAGAATTTTGTCTTTGCTTATGATGTGGGAATGTGATATGTCGATAGCTGCACAGATCCTGATTCTTGCAAACGCCAAATCGCGGACGATTTTCAGCATGAGAGTCCACTTTGGAAGCTAACTTGTTATGGACATAATAAACAGTAAGTCTTTCTTGGTAACTATAGTCGGTGGATTTTGTTTACCAGGTTTGTTTTTCTTAGCCTGTGGTTTTGTTTGTCGAGAAGGCAGGAATTGCTCTTATTTAATTATTTTCATTACTATGAGAACAACGTAAGTGGTTGAGTTCTCCAGTTCACCATGATTCTATTGAACCAATAAATAAATTTTTTATGTAGTTATTAGGAAAGATTGTTTTAAGCCTGTCACATAAATTTCTTATTGTATGGTTCCTTAATCCTTATGCTACATGTATGACTGTATTGACATATATCTGCTGATAGATTTTGTATTAGAGCAAGAAGGGCTACCAAGAGGGTTTTCTGAGTGATTAAGGAAATTAAAATTTTAAAATTTGACCTTCCTCGGATCCTACTGTGCAATCTATTGTGTAAAGTGTCACCCGCTTCTTTAGAGGGTTCAGTGAAGGAAGAGCATGTCGTTTATACAATACAAAGACTTCGAAAGAATGATCTTATAGAAGCTGTTCTTGCCTAATTACACTTGTGAGATCATATTGGATTTGGTGTTCAAATGCTGTTGAATTTCCACCCTCTTACAAAAGGTTTATCTTATTTCAAAAATATATTCATGTTCATGGGATAACTTACGCCTCTAGACATTTTTAAACTGAACCAAGGTTTCCTGCTAAATATGTTGACCCCGTGGATCCTTGGTTATTCTTCTTCTGTTCAATGAATGGATGGAAACAAAAATCAAGCTTGGTATGAGTACATGTGAATGAAAGTTAACGCCTTAACGGCAAGGAAAATCTTTTCAGTATCCAGCTTCTATTAGTAACACAATCAAACTGCCAAACTGGTGGTTTGAGATAGTGAGATTTATATAACCTCACATGCAAATTGTAGACAAGTGGTTCCTGCCTTCCTGGTGAACCCTAGAATAAACTGTTGACTGTCTGAATACTTCCCAAACTATGTACGGAGTATTACTTACTCTGTCTTGAAAAGATGACGATTTTTTTTTTAATTTTGTCAGTATATCCTTCCAAATTATAGTTAAAATTACCTCAGAATATAGCTGCAACTGCACTGTAATTTTGCATAGCAAGATATAAATGAGAGCGAAACCGTACAAACATGTCCCTTTTTTTTCACTCCCTAAATGATTTTAATGTGTTGGGGGAAACGGGAGTATATCGGAAGGTGATATTCCTCTCAAAGAAAAATAGGTGATTACTATCCATTGATATCTTTCTATCTGTGTTAATAATTTTGGAACATTTGGTTGCTCTGAAACTTTTTCGTACCAGTATTCCAAGGAAAGGAGTTCTGAAAATGTTTCTCTTGCAGTTTGCCATGTGATATTTCCGGTGATATTAGCTATGAAGAATTGCGGGCAGTTGCATATGATGATGCAAAACGTGGCACTAGTTTGCCGTCAATTGTAAGCCTTTTACCTTCACTGCTAAAAGAACAAATGTTGCACTAGTTTGCTGTTATTTGAAATTATTGTCCCAAGGGTCTTAGATGATCACCATTTATCACATACTTGTTGCATATTCTAAAATTTCCCTTCCTAAATCCTAGTGCTTGCAAGATACATACAATACTCGTGACGAACTTTTCCTTGTTTATGTTGATTAACAAAATAATTTACACGTGACATAGCTGTATACAGTAGTTTGTCTAAAGTCTGATTAAGTTTTGAATTTTGATCTACTGAGATGAAAACTGTATCTATTGATGCTGATGGGAGTGTACAAATCTTTTTAAGTAAGATTATAGTAGCCTTTTTAATATATTTGATTATCTGCATTGTCAGAAATATAATATTTTTTGGAAAAATTTATATTATTGGTCCCAACTGTGGCCGATTTACTTTAAAGGTTCAACTTTTGATTATCTTAGAGCGGTCCCAACTCCCAACTATAGCGAGTCCTTTCCAAAAATTGTCCCTTAGTTAAAATTAAGTGGTATAACTTAATTAGGACTCATAACTCTCATGCAATAATCATATTTTTTAATTAAATGAAGTTAATGGAATAGTATGATATGCTTAAATCAACTTATTTAACAGTTTAATTAAGCTAAAGGACCATTTTGGGAAAACACACTCAAAGTTGGGACCGCTCTAGGATAATCGAAAGTTAAGACCTTTTAAAGTGAAGTGACCATATTCGGGATTGTCAGTATCAATTTTTCCTTTTTTTAACAGAGAAATTTTTTGGTAGCCTTTTGTGGTAACTCTTTGTTTCATGATCTTTTTCCTGCCAAAGTAGGTCTTCTTTTGGGTCAAAATCTTACTGATTCTCATTGGTTTGGTTCTGTGTATGAGTCTTCAATATTTAGCTGCATGGAAACTCTTTGTTTCTTGGCCTTTTTTATTGAGTTAGTTTGTGATGCATTTGTTCTGTCAGCCAAGGACTGTTGTTTAAGTCTAGAATTAGTAGCTTCATTGTTTCCTTCTGAGATTCAGACATCTGGGTTCTGGCCTTCTGGGTGACATTCTGTACTTTGAGTTTGTTCAGATTTTTTACTACGACCATGGTTAAACTCTTTTTTTGCAGGTTGAAAGAGAGAGGGGTTTAGTGAATTCTAAGAAGATTGAATTTCAAAACTTGCTTAATAATCCTTACACAAAGCACACAGTTCCTGCTCAGTCCAACCAAAGTCCATTTCCTGCAGCGGTTGCCAGCTCACCATCACCGACACCTCAAAATAACGGCTTCTCTTCATTTGCAGCTTCAAATCAACAGAATCCACCACCTAACCTGTCATTTGGTGTAAGGCAAGTGTCTTATTTTTTTATTTTATTGCTAAAATTTAAGGGATGGTTTAAAATTTTGATGGCTTCAACCAGTAGAACACTACTTCCTCCACTTTTTTTTTGTTAGTTGCAACCAAGTGAACTTTTGCACTATTCACATACTCCCTTATGACCATCTTAGGTGATTTATAGGCCAGTTAAAACAGTCTTGTTGGATCTTATTAGATCCATCTGATTGTATATTTTCAAATATCTAGTTTTTATAACTTCTATTTGTGAATAATTAAAGATATTTATGGTCTAAGTAGTGCGTTGACAAGCGTGAAAGTCAAATGGTTGCAACTAAAAAAGAATGGAGGAAGTATATCACTATCTTAGGCTTGGGCATGTAGTTCCAGGTAGTTGTGTGAAGTATGTACATTAAGCTCATTCGTACCAGGACAATGAATATAAACAGTAACCTAAGAGAGGCTTAAAAGAACATGTTTATGCCAAACAAAGTGTTACCCTTTAAACTAGTAGTAGTACCATCTAATATGTGAATTTTGGAGCGTAAGATGAAAATTTCTCTCTACAAAGTTTCTAAACTTCTACTATCTGCTAACAATTAACCTGATTTGGTCGACCATTCACTTATTGCAGGCCTTCTACTCCATCAAATAGTAGTGGTTTTGGGCAATTCCAAGGTCCTGGTCAATTTACGACAAATACTGTTCCTTCAGGTTCGCAAGTGCACCTTTGAACTCTCAATGCATCTAGCATGTACTATCCTCTGTTTTTTTGCCGTTTTATATACTTCGTATTTAAGTACTACTGTACTAGACTCCTACATGAGATGATGAGAATGGATACTTGGACACTTCAAGAGAAAACATGCACATATTTCTGATATACTTCGTATGTGCATGTCCAAGGACTGTACATGTATATTGGATAATTTAGCATACAAGTACCATACATGGTATCCTTTCATCAGTAGTGTCTCCTCCATACATCTCATCTCGCAACATGGAGATAAATTATTTAACTAGGTTCTCCAATCTGTAGGTCCCTTTAGCACTCCAGTTCCTGCACAAACAGGGGGAAATCTGCCTGCTTCCAACAATAGTGGCTTTGGCAATAGTGGCTTCGGCAATAATGGATTGAAAAGTGGTATTCCTCCATACATCTCCGTATCTTGCAACATGGAGATGAATTGTTTAACTAACTTCTACCGTCTCTAGGTTCCTTTGGCACTCAAGTTCCTGCACAAACAGGGGGAAATTTTGCGCCAAACATAGGTGGCTTCGGTAACAGTGCCATGAGTGGTCAAGGTAGTCTATTCCCTGCACAAACTGGGGGGAATACCTTCACTCCAAATTCATTAGGCTTTGGCAATGTTGGGATGACTAGTCAAGTCCCAGTGCAAATAGGTGGAACACCATTTACTTCTAACGTAGGAGGCTTTGGTAATAGCGGCATGAGTAGTCAAGGCAGCCAACCACCTTCAACTTCACTTGGTGGTGCTAATGGTGGTCAACAAACGATGTAAGTTTATGGGTTAAAAGTTAAAACATCAATTATTGGCATTTTTACATATCACAAATAATACACTTAAATTAATATGTTTTCTGTTATTGAGTGATGAAGTCTATGTAATTTGCTTTCTCTTCCAACCTGTTCTGATAGAACAGGACCCTGTATCTGCAGTTCCGGACCCTCCTTTATGTTTAAGCCTTAGATTTTGTGTTATATGCCAATGGCCGTATTGGCACTGCTTTGTTACTGATCCTCTTGTTTGCAGAGGCAGCAATCCTCAAGAGATATCAGCTACAGATACAGGCATTTGGTTGAAAGAGGAGTGGAAACTTGGAGAGGTAAAACCTGATATCACCATTTGTTTTGATCCTTGAACTGAAAATCTTATATTATAGTAGCAGTTCTAGTTTTATTCATGTAAGCTGGCAAAAGTATCCTTATGCTTGTCAAGTGGATCAAGTGGGATATGAAGTTGTAAATTTGTTGACGCTATTACAAGAAAATTCCCAATGCACTGTTTTGATTCTTGTGAGAAGCATCTGTTATTTTTGGATATTGTTCGTAGGATCATAGGAAAGATGGATGCCCTCTGTCTGGAATTAATAGTTATGTACTCTGTATATAGGATGATGGAAAAAGTTATTGTTTTTCCGATAATAGTAATAGTGCATGATTGGATTCCTTACATTCAACTTGCTAAAAATTAAGTGACAGTGGAGAGGATATATTAATAACTCAATTATTTTAATGGGTTAGTAGGTAAAATCATATCAGATTGTTAATCATGGGAGGTTGGGACAATATAGCAACTATTTCGGGAGAGTATCTTGCTGAAAAGTGTTATTAACCTTGAACGTTGCCTCTCTGTATCTTGCAATATTATGAATTATTTGGTTTCAGATTCTTAAGCACAGGATCCTTTTATCATGTATAACCAACCCTTTTAACTTCCTTCTCTACAGATTCCAGAGCAACCACCTCCGGCAGCTTATGTTAAATAGAGTATTTCATGTCTTAGAGTGGTATTTGGACCCAACTTTGCCAGTACCAGAGAGGAGACTTGCCAATGATCGTTGCTGTCTAAAGTAGACAGCAAAAGCTGTTATGATGTTGAGTTCAGTTCATAGACTGTATTCTTTTGTTTATCTGAATGTAGTCGTGAACATTTTGTTTTCCACATGGATTGGTAGATTAGTGTCTTAACACACACATAGGGAGTACATGTGAATAGATTTTAGGGAACTGACATCAACTTATATGAGGATTCGGCCTTTTGTATGGTCGGTGATAGTATCATATTACAATGTTAATGTAGATTGATAGAACTGAGTGCTTTCCTCTTTTGGTTTCTCATAGACTTGTGGGGTTTTCCCAGGAAGTTAAACTTGTTAATTGTTATCCCCTCTTCTCTTTTGGCTCTTTGTTT
mRNA sequence
GGCCCGGTCCAACGCCAATGTTATTGATTTGAACCCAATTAAAAATGGAGTGAAAAGAACGCCAACTCGGTGAACTGCATATGGTGATCGAATTCCTGGCAACACACACTATAAACTCTCAATTCGTCCGGCAATCCTCTCTATCTGTTCGAGTGTTGCAGCAAAAGAGAAGTATTCCTCAAATATGCCGCTAAAGCCAGAACCTTGCAGAAATTTTCTGCGTGGACATTGCCGTTTTGGTGATAAATGTAACTTTCCGCACGTCACTCAGCAGCAGCCAAAGCCCAATGCTTTTGGCTTTGGCTCGCAGACTGCCACACAATCCCAACAGCAGAAGCCCAATCCTTTTGGTTTTGGTGTCCAGAACTCGACCAGTGGAGCTCCTAATTCTGGAGGATTTGGGCAATCTAATTCACAAAAGCCTTTTGAGAATAAATGGGTCCGCCCAGAGAAGCCCCAGAATCAGTCTCAGGCTGCTAGTCACAACTGCACAGATCCTGATTCTTGCAAACGCCAAATCGCGGACGATTTTCAGCATGAGAGTCCACTTTGGAAGCTAACTTGTTATGGACATAATAAACATTTGCCATGTGATATTTCCGGTGATATTAGCTATGAAGAATTGCGGGCAGTTGCATATGATGATGCAAAACGTGGCACTAGTTTGCCGTCAATTGTTGAAAGAGAGAGGGGTTTAGTGAATTCTAAGAAGATTGAATTTCAAAACTTGCTTAATAATCCTTACACAAAGCACACAGTTCCTGCTCAGTCCAACCAAAGTCCATTTCCTGCAGCGGTTGCCAGCTCACCATCACCGACACCTCAAAATAACGGCTTCTCTTCATTTGCAGCTTCAAATCAACAGAATCCACCACCTAACCTGTCATTTGGTGTAAGGCCTTCTACTCCATCAAATAGTAGTGGTTTTGGGCAATTCCAAGGTCCTGGTCAATTTACGACAAATACTGTTCCTTCAGGTCCCTTTAGCACTCCAGTTCCTGCACAAACAGGGGGAAATCTGCCTGCTTCCAACAATAGTGGCTTTGGCAATAGTGGCTTCGGCAATAATGGATTGAAAAGTGGTTCCTTTGGCACTCAAGTTCCTGCACAAACAGGGGGAAATTTTGCGCCAAACATAGGTGGCTTCGGTAACAGTGCCATGAGTGGTCAAGGTAGTCTATTCCCTGCACAAACTGGGGGGAATACCTTCACTCCAAATTCATTAGGCTTTGGCAATGTTGGGATGACTAGTCAAGTCCCAGTGCAAATAGGTGGAACACCATTTACTTCTAACGTAGGAGGCTTTGGTAATAGCGGCATGAGTAGTCAAGGCAGCCAACCACCTTCAACTTCACTTGGTGGTGCTAATGGTGGTCAACAAACGATAGGCAGCAATCCTCAAGAGATATCAGCTACAGATACAGGCATTTGGTTGAAAGAGGAGTGGAAACTTGGAGAGATTCCAGAGCAACCACCTCCGGCAGCTTATGTTAAATAGAGTATTTCATGTCTTAGAGTGGTATTTGGACCCAACTTTGCCAGTACCAGAGAGGAGACTTGCCAATGATCGTTGCTGTCTAAAGTAGACAGCAAAAGCTGTTATGATGTTGAGTTCAGTTCATAGACTGTATTCTTTTGTTTATCTGAATGTAGTCGTGAACATTTTGTTTTCCACATGGATTGGTAGATTAGTGTCTTAACACACACATAGGGAGTACATGTGAATAGATTTTAGGGAACTGACATCAACTTATATGAGGATTCGGCCTTTTGTATGGTCGGTGATAGTATCATATTACAATGTTAATGTAGATTGATAGAACTGAGTGCTTTCCTCTTTTGGTTTCTCATAGACTTGTGGGGTTTTCCCAGGAAGTTAAACTTGTTAATTGTTATCCCCTCTTCTCTTTTGGCTCTTTGTTT
Coding sequence (CDS)
ATGCCGCTAAAGCCAGAACCTTGCAGAAATTTTCTGCGTGGACATTGCCGTTTTGGTGATAAATGTAACTTTCCGCACGTCACTCAGCAGCAGCCAAAGCCCAATGCTTTTGGCTTTGGCTCGCAGACTGCCACACAATCCCAACAGCAGAAGCCCAATCCTTTTGGTTTTGGTGTCCAGAACTCGACCAGTGGAGCTCCTAATTCTGGAGGATTTGGGCAATCTAATTCACAAAAGCCTTTTGAGAATAAATGGGTCCGCCCAGAGAAGCCCCAGAATCAGTCTCAGGCTGCTAGTCACAACTGCACAGATCCTGATTCTTGCAAACGCCAAATCGCGGACGATTTTCAGCATGAGAGTCCACTTTGGAAGCTAACTTGTTATGGACATAATAAACATTTGCCATGTGATATTTCCGGTGATATTAGCTATGAAGAATTGCGGGCAGTTGCATATGATGATGCAAAACGTGGCACTAGTTTGCCGTCAATTGTTGAAAGAGAGAGGGGTTTAGTGAATTCTAAGAAGATTGAATTTCAAAACTTGCTTAATAATCCTTACACAAAGCACACAGTTCCTGCTCAGTCCAACCAAAGTCCATTTCCTGCAGCGGTTGCCAGCTCACCATCACCGACACCTCAAAATAACGGCTTCTCTTCATTTGCAGCTTCAAATCAACAGAATCCACCACCTAACCTGTCATTTGGTGTAAGGCCTTCTACTCCATCAAATAGTAGTGGTTTTGGGCAATTCCAAGGTCCTGGTCAATTTACGACAAATACTGTTCCTTCAGGTCCCTTTAGCACTCCAGTTCCTGCACAAACAGGGGGAAATCTGCCTGCTTCCAACAATAGTGGCTTTGGCAATAGTGGCTTCGGCAATAATGGATTGAAAAGTGGTTCCTTTGGCACTCAAGTTCCTGCACAAACAGGGGGAAATTTTGCGCCAAACATAGGTGGCTTCGGTAACAGTGCCATGAGTGGTCAAGGTAGTCTATTCCCTGCACAAACTGGGGGGAATACCTTCACTCCAAATTCATTAGGCTTTGGCAATGTTGGGATGACTAGTCAAGTCCCAGTGCAAATAGGTGGAACACCATTTACTTCTAACGTAGGAGGCTTTGGTAATAGCGGCATGAGTAGTCAAGGCAGCCAACCACCTTCAACTTCACTTGGTGGTGCTAATGGTGGTCAACAAACGATAGGCAGCAATCCTCAAGAGATATCAGCTACAGATACAGGCATTTGGTTGAAAGAGGAGTGGAAACTTGGAGAGATTCCAGAGCAACCACCTCCGGCAGCTTATGTTAAATAG
Protein sequence
MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQNSTSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQHESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIEFQNLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGVRPSTPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNGLKSGSFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTSQVPVQIGGTPFTSNVGGFGNSGMSSQGSQPPSTSLGGANGGQQTIGSNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYVK
Homology
BLAST of Spo02995.1 vs. NCBI nr
Match:
gi|902228915|gb|KNA21326.1| (hypothetical protein SOVF_044430 [Spinacia oleracea])
HSP 1 Score: 880.9 bits (2275), Expect = 8.900e-253
Identity = 437/437 (100.00%), Postives = 437/437 (100.00%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ 60
MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ
Sbjct: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ 60
Query: 61 NSTSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQHES 120
NSTSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQHES
Sbjct: 61 NSTSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQHES 120
Query: 121 PLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIEFQ 180
PLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIEFQ
Sbjct: 121 PLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIEFQ 180
Query: 181 NLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGVRPS 240
NLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGVRPS
Sbjct: 181 NLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGVRPS 240
Query: 241 TPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNGLKSG 300
TPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNGLKSG
Sbjct: 241 TPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNGLKSG 300
Query: 301 SFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTSQVPV 360
SFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTSQVPV
Sbjct: 301 SFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTSQVPV 360
Query: 361 QIGGTPFTSNVGGFGNSGMSSQGSQPPSTSLGGANGGQQTIGSNPQEISATDTGIWLKEE 420
QIGGTPFTSNVGGFGNSGMSSQGSQPPSTSLGGANGGQQTIGSNPQEISATDTGIWLKEE
Sbjct: 361 QIGGTPFTSNVGGFGNSGMSSQGSQPPSTSLGGANGGQQTIGSNPQEISATDTGIWLKEE 420
Query: 421 WKLGEIPEQPPPAAYVK 438
WKLGEIPEQPPPAAYVK
Sbjct: 421 WKLGEIPEQPPPAAYVK 437
BLAST of Spo02995.1 vs. NCBI nr
Match:
gi|731318280|ref|XP_010669635.1| (PREDICTED: zinc finger CCCH domain-containing protein 16 isoform X1 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 555.1 bits (1429), Expect = 1.100e-154
Identity = 311/464 (67.03%), Postives = 338/464 (72.84%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ 60
MPLK EPCRNF RG+CRFGD+C F HVTQQQPK N FGFGSQTAT QQQKPNPFGFGVQ
Sbjct: 1 MPLKKEPCRNFQRGNCRFGDRCKFLHVTQQQPKSNVFGFGSQTATPFQQQKPNPFGFGVQ 60
Query: 61 NS--TSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQH 120
NS TSGAP SGG GQ NS KPFENKW R E+P NQSQAASHNCTDPDSCKRQ +D+QH
Sbjct: 61 NSGPTSGAPFSGGHGQFNSSKPFENKWTRSEQPTNQSQAASHNCTDPDSCKRQFVEDYQH 120
Query: 121 ESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIE 180
E PLWKLTCYGH K+LPCDI+GDIS+EELRAVAYDDAKRGTSLPSIVERER LVNSK E
Sbjct: 121 ERPLWKLTCYGHYKYLPCDITGDISFEELRAVAYDDAKRGTSLPSIVERERSLVNSKSSE 180
Query: 181 FQNLLNNPYTKH-TVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGV 240
+ LL NPYTK+ T PAQ NQSPFPA +SPSPTPQN G S F+ S+QQNP PNLSFGV
Sbjct: 181 YDKLLQNPYTKNATPPAQLNQSPFPALNVNSPSPTPQNIGPSLFSTSSQQNPSPNLSFGV 240
Query: 241 RPSTPSNSSGFGQFQG-PGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNG 300
RPSTP N SGFGQFQG PG +N PSG F T P QTGG+L A N GFGNSG + G
Sbjct: 241 RPSTPPN-SGFGQFQGHPGPAGSNISPSGSFGTQAPTQTGGSLLAGNVGGFGNSGMNSQG 300
Query: 301 LKSGSFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTS 360
G+ G + Q GN N GFGNS M+ G+ FPAQTGG TFTP++ N+G++S
Sbjct: 301 F--GNSGMNI--QGFGNSGMNSQGFGNSGMNSPGTHFPAQTGGTTFTPSAGSVSNIGVSS 360
Query: 361 QVP---VQIGGTPFTSNVGGFGNSGMSSQGSQPPSTS--------------------LGG 420
Q P VQ G P S VGGF M SQGSQ PSTS L G
Sbjct: 361 QAPQVSVQTSGVPNASFVGGF---AMHSQGSQLPSTSSVSLHTQTSAVNNSSTLFNGLSG 420
Query: 421 ANGGQQTIGSNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYVK 438
NG QQTI +NPQE SATD IWLKEEWKLGEIPEQPPP AYV+
Sbjct: 421 PNGVQQTIDNNPQEASATDASIWLKEEWKLGEIPEQPPPEAYVR 456
BLAST of Spo02995.1 vs. NCBI nr
Match:
gi|731318284|ref|XP_010669637.1| (PREDICTED: zinc finger CCCH domain-containing protein 16 isoform X2 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 545.4 bits (1404), Expect = 8.900e-152
Identity = 305/464 (65.73%), Postives = 329/464 (70.91%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ 60
MPLK EPCRNF RG+CRFGD+C F HVTQQQPK N FGFGSQTAT QQQKPNPFGFGVQ
Sbjct: 1 MPLKKEPCRNFQRGNCRFGDRCKFLHVTQQQPKSNVFGFGSQTATPFQQQKPNPFGFGVQ 60
Query: 61 NS--TSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQH 120
NS TSGAP SGG GQ NS KPFENKW R E+P NQSQAASHNCTDPDSCKRQ +D+QH
Sbjct: 61 NSGPTSGAPFSGGHGQFNSSKPFENKWTRSEQPTNQSQAASHNCTDPDSCKRQFVEDYQH 120
Query: 121 ESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIE 180
E PLWKLTCYGH K+LPCDI+GDIS+EELRAVAYDDAKRGTSLPSIVERER LVNSK E
Sbjct: 121 ERPLWKLTCYGHYKYLPCDITGDISFEELRAVAYDDAKRGTSLPSIVERERSLVNSKSSE 180
Query: 181 FQNLLNNPYTKH-TVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGV 240
+ LL NPYTK+ T PAQ NQSPFPA +SPSPTPQN G S F+ S+QQNP PNLSFGV
Sbjct: 181 YDKLLQNPYTKNATPPAQLNQSPFPALNVNSPSPTPQNIGPSLFSTSSQQNPSPNLSFGV 240
Query: 241 RPSTPSNSSGFGQFQG-PGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNG 300
RPSTP N SGFGQFQG PG +N PSG F T P QTGG+L A N GFGNSG
Sbjct: 241 RPSTPPN-SGFGQFQGHPGPAGSNISPSGSFGTQAPTQTGGSLLAGNVGGFGNSGM---- 300
Query: 301 LKSGSFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTS 360
N GFGNS M+ G+ FPAQTGG TFTP++ N+G++S
Sbjct: 301 --------------------NSQGFGNSGMNSPGTHFPAQTGGTTFTPSAGSVSNIGVSS 360
Query: 361 QVP---VQIGGTPFTSNVGGFGNSGMSSQGSQPPSTS--------------------LGG 420
Q P VQ G P S VGGF M SQGSQ PSTS L G
Sbjct: 361 QAPQVSVQTSGVPNASFVGGF---AMHSQGSQLPSTSSVSLHTQTSAVNNSSTLFNGLSG 420
Query: 421 ANGGQQTIGSNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYVK 438
NG QQTI +NPQE SATD IWLKEEWKLGEIPEQPPP AYV+
Sbjct: 421 PNGVQQTIDNNPQEASATDASIWLKEEWKLGEIPEQPPPEAYVR 436
BLAST of Spo02995.1 vs. NCBI nr
Match:
gi|731403355|ref|XP_010655028.1| (PREDICTED: zinc finger CCCH domain-containing protein 16 isoform X1 [Vitis vinifera])
HSP 1 Score: 298.5 bits (763), Expect = 1.900e-77
Identity = 201/454 (44.27%), Postives = 249/454 (54.85%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTA-----TQSQQQKPNPF 60
M K EPCRNF RG C++G++C F HVTQQQPKPN GFG+QT+ T QQQ+ NPF
Sbjct: 1 MNRKKEPCRNFQRGSCQYGERCKFLHVTQQQPKPNVSGFGAQTSSNFQHTNVQQQRSNPF 60
Query: 61 GFGVQNSTSGAPNSGGFGQSNSQKPFENKWVR------------PEKPQNQSQAASHNCT 120
GFGVQ+++ S + N KPFENKW R + NQ Q A+H CT
Sbjct: 61 GFGVQSNSLPKGTSDFGSKQNHFKPFENKWTRFSPLTTGSSSSSSRQSDNQVQPANHKCT 120
Query: 121 DPDSCKRQIADDFQHESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPS 180
DP+SCKRQI +DF+HE P+WKLTCYGH K PCDI GDISYEELR AYDDA RG SL S
Sbjct: 121 DPESCKRQIVEDFEHERPIWKLTCYGHCKSAPCDIVGDISYEELRLAAYDDAGRGLSLQS 180
Query: 181 IVERERGLVNSKKIEFQNLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAA 240
IVERER L+NSK IEF +LL PY + QSP P AS S T QNN S ++
Sbjct: 181 IVERERSLLNSKLIEFDSLLRKPYAAPPNSTLAIQSPAPNPDAS--SLTAQNNTPPSASS 240
Query: 241 SNQQNPPPNLSFGVRPSTPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLPASN 300
+Q + PN+ FG RPSTPSN++ FGQ QF++ T SG F T N
Sbjct: 241 FSQLSTSPNMGFGTRPSTPSNNA-FGQ-SNSIQFSSQT--SGAFGT-------------N 300
Query: 301 NSGFGNSGFGNNGLKSGSFGTQVPAQTGGNFAP-NIGGFGNSAMSGQGSLFPAQTGGNTF 360
N FGN +GSFG+Q+P Q G+ P N GF ++ MS F
Sbjct: 301 NLAFGN---------AGSFGSQLPVQMHGSPLPSNTAGFSHNNMSAGSKAFSPPAA---- 360
Query: 361 TPNSLGFGNVGMTSQVPVQIGGTPFTSNVGGFGNSGMSSQGSQPPSTSLGGANGGQQTIG 420
+P + F N +Q P+ GT NS S++ + +
Sbjct: 361 SPQIISFAN----NQSPILSSGT----------NSMFSAESTM------------HAQLE 396
Query: 421 SNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYV 437
++ + D IWLKEEW GEIPE+ PP A+V
Sbjct: 421 KMQRDNFSGDMSIWLKEEWNPGEIPEEEPPDAFV 396
BLAST of Spo02995.1 vs. NCBI nr
Match:
gi|970069598|ref|XP_015061935.1| (PREDICTED: zinc finger CCCH domain-containing protein 16 [Solanum pennellii])
HSP 1 Score: 287.0 bits (733), Expect = 5.700e-74
Identity = 201/466 (43.13%), Postives = 253/466 (54.29%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQT----ATQSQQQKPNPFG 60
MP + EPCRNF+RG C++G++C F H QQQPKPN FGFG Q+ +T QQ K NPFG
Sbjct: 1 MPPRKEPCRNFMRGSCQYGERCKFLHAAQQQPKPNPFGFGIQSTNFQSTNMQQTKSNPFG 60
Query: 61 FGVQNSTSGAPNSGGFG-QSNSQKPFENKWVRP--------EKPQNQSQAASHNCTDPDS 120
FGVQ S S S FG + N KPFENKW R + NQ A +H CTD +S
Sbjct: 61 FGVQ-SNSQPRGSSDFGLKQNQYKPFENKWTRSATTNSSSSRQTDNQPVAPNHTCTDAES 120
Query: 121 CKRQIADDFQHESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVER 180
C+RQI +DF +E PLW LTCYGH K+ PCDI+GD+SYEELRAVAYDDAKRG SL SIVER
Sbjct: 121 CRRQIVEDFNNEKPLWLLTCYGHRKNGPCDITGDVSYEELRAVAYDDAKRGQSLMSIVER 180
Query: 181 ERGLVNSKKIEFQNLLNNPYTKHTVPAQSNQSPFPAAV------ASSPSPTPQNNGFSSF 240
ER LVNSK EF+NLL NPY + A + QSPFP A A SP P + S F
Sbjct: 181 ERSLVNSKVAEFENLLRNPYASTSTSALNAQSPFPGATPSASLSAQSPFPGAAPSASSPF 240
Query: 241 ------AASNQQNP----PPNLSFGVRPSTPSNSSGFGQFQGPGQFTTNTVPSGPFSTPV 300
A+ + Q+P PN + S P ++S F Q T+T P+ F P
Sbjct: 241 PGAAPNASLSAQSPFPGAAPNALSSAQSSFPPSASSFSQLGTILNTGTSTPPTSTFGQP- 300
Query: 301 PAQTGGNLPASNNSGFGNSGFGNNGLK-SGSFGTQVPAQTGGNFAPNIGGFGNSAMSGQG 360
+ G + SN+SG FGN S FGTQVP Q+ N S
Sbjct: 301 -SMPGNSFKTSNSSGANAFSFGNTSTSGSFGFGTQVPTQSYQN------------PSTPS 360
Query: 361 SLFPAQTGGNTFTPNSLGFGNVGMTSQVPVQIGGTPFTSNVGGFGNSGMSSQGSQPPSTS 420
++F A +G N F+ ++ +P +N G G +SQG P +TS
Sbjct: 361 NIF-ASSGRNLFSTSTT-----------------SPHFANPSG-GQLPTTSQGLFPVTTS 420
Query: 421 LGGANGGQQTIGSNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYV 437
N T ++ ++ S D IW K+EWK+GEIPE+ PP Y+
Sbjct: 421 PVSIN---LTNTASTEDFSG-DNSIWTKKEWKIGEIPEEAPPDRYI 428
BLAST of Spo02995.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9RP59_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_044430 PE=4 SV=1)
HSP 1 Score: 880.9 bits (2275), Expect = 6.200e-253
Identity = 437/437 (100.00%), Postives = 437/437 (100.00%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ 60
MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ
Sbjct: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ 60
Query: 61 NSTSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQHES 120
NSTSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQHES
Sbjct: 61 NSTSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQHES 120
Query: 121 PLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIEFQ 180
PLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIEFQ
Sbjct: 121 PLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIEFQ 180
Query: 181 NLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGVRPS 240
NLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGVRPS
Sbjct: 181 NLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGVRPS 240
Query: 241 TPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNGLKSG 300
TPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNGLKSG
Sbjct: 241 TPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNGLKSG 300
Query: 301 SFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTSQVPV 360
SFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTSQVPV
Sbjct: 301 SFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTSQVPV 360
Query: 361 QIGGTPFTSNVGGFGNSGMSSQGSQPPSTSLGGANGGQQTIGSNPQEISATDTGIWLKEE 420
QIGGTPFTSNVGGFGNSGMSSQGSQPPSTSLGGANGGQQTIGSNPQEISATDTGIWLKEE
Sbjct: 361 QIGGTPFTSNVGGFGNSGMSSQGSQPPSTSLGGANGGQQTIGSNPQEISATDTGIWLKEE 420
Query: 421 WKLGEIPEQPPPAAYVK 438
WKLGEIPEQPPPAAYVK
Sbjct: 421 WKLGEIPEQPPPAAYVK 437
BLAST of Spo02995.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8CV93_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_2g036110 PE=4 SV=1)
HSP 1 Score: 555.1 bits (1429), Expect = 7.800e-155
Identity = 311/464 (67.03%), Postives = 338/464 (72.84%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ 60
MPLK EPCRNF RG+CRFGD+C F HVTQQQPK N FGFGSQTAT QQQKPNPFGFGVQ
Sbjct: 1 MPLKKEPCRNFQRGNCRFGDRCKFLHVTQQQPKSNVFGFGSQTATPFQQQKPNPFGFGVQ 60
Query: 61 NS--TSGAPNSGGFGQSNSQKPFENKWVRPEKPQNQSQAASHNCTDPDSCKRQIADDFQH 120
NS TSGAP SGG GQ NS KPFENKW R E+P NQSQAASHNCTDPDSCKRQ +D+QH
Sbjct: 61 NSGPTSGAPFSGGHGQFNSSKPFENKWTRSEQPTNQSQAASHNCTDPDSCKRQFVEDYQH 120
Query: 121 ESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLVNSKKIE 180
E PLWKLTCYGH K+LPCDI+GDIS+EELRAVAYDDAKRGTSLPSIVERER LVNSK E
Sbjct: 121 ERPLWKLTCYGHYKYLPCDITGDISFEELRAVAYDDAKRGTSLPSIVERERSLVNSKSSE 180
Query: 181 FQNLLNNPYTKH-TVPAQSNQSPFPAAVASSPSPTPQNNGFSSFAASNQQNPPPNLSFGV 240
+ LL NPYTK+ T PAQ NQSPFPA +SPSPTPQN G S F+ S+QQNP PNLSFGV
Sbjct: 181 YDKLLQNPYTKNATPPAQLNQSPFPALNVNSPSPTPQNIGPSLFSTSSQQNPSPNLSFGV 240
Query: 241 RPSTPSNSSGFGQFQG-PGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFGNSGFGNNG 300
RPSTP N SGFGQFQG PG +N PSG F T P QTGG+L A N GFGNSG + G
Sbjct: 241 RPSTPPN-SGFGQFQGHPGPAGSNISPSGSFGTQAPTQTGGSLLAGNVGGFGNSGMNSQG 300
Query: 301 LKSGSFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSLGFGNVGMTS 360
G+ G + Q GN N GFGNS M+ G+ FPAQTGG TFTP++ N+G++S
Sbjct: 301 F--GNSGMNI--QGFGNSGMNSQGFGNSGMNSPGTHFPAQTGGTTFTPSAGSVSNIGVSS 360
Query: 361 QVP---VQIGGTPFTSNVGGFGNSGMSSQGSQPPSTS--------------------LGG 420
Q P VQ G P S VGGF M SQGSQ PSTS L G
Sbjct: 361 QAPQVSVQTSGVPNASFVGGF---AMHSQGSQLPSTSSVSLHTQTSAVNNSSTLFNGLSG 420
Query: 421 ANGGQQTIGSNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYVK 438
NG QQTI +NPQE SATD IWLKEEWKLGEIPEQPPP AYV+
Sbjct: 421 PNGVQQTIDNNPQEASATDASIWLKEEWKLGEIPEQPPPEAYVR 456
BLAST of Spo02995.1 vs. UniProtKB/TrEMBL
Match:
K4DC27_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)
HSP 1 Score: 286.2 bits (731), Expect = 6.800e-74
Identity = 199/465 (42.80%), Postives = 253/465 (54.41%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQT----ATQSQQQKPNPFG 60
MP + EPCRNF+RG C++G++C F H QQQPKPN FGFGSQ+ +T QQ K NPFG
Sbjct: 1 MPPRKEPCRNFMRGSCQYGERCKFLHAAQQQPKPNPFGFGSQSTNFQSTNMQQTKSNPFG 60
Query: 61 FGVQNSTSGAPNSGGFGQSNSQKPFENKWVRP--------EKPQNQSQAASHNCTDPDSC 120
FGVQ+++ +S + N KPFENKW R + NQ A +H CTD +SC
Sbjct: 61 FGVQSNSQPRGSSDLGLKQNQYKPFENKWTRSATTNSSSSRQTDNQPVAPNHTCTDAESC 120
Query: 121 KRQIADDFQHESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERE 180
+RQI +DF +E PLW LTCYGH K+ PCDI+GD+SYEELRAVAYDDAKRG SL SIVERE
Sbjct: 121 RRQIVEDFNNEKPLWLLTCYGHRKNGPCDITGDVSYEELRAVAYDDAKRGQSLMSIVERE 180
Query: 181 RGLVNSKKIEFQNLLNNPYTKHTVPAQSNQSPFPAAV------ASSPSPTPQNNGFSSF- 240
R VNSK EF+NLL NPY + A + QSPFP A A SP P + S F
Sbjct: 181 RSQVNSKVAEFENLLQNPYASSSTSALNAQSPFPGATPSASLSAQSPFPGATPSASSPFP 240
Query: 241 -----AASNQQNP----PPNLSFGVRPSTPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVP 300
A+ + Q+P PN + S P ++S F Q T+T P+ F P
Sbjct: 241 GAAPNASLSAQSPFPGAAPNALSSAQSSFPPSASSFSQLGTILNTGTSTPPTSTFGQ--P 300
Query: 301 AQTGGNLPASNNSGFGNSGFGNNGLK-SGSFGTQVPAQTGGNFAPNIGGFGNSAMSGQGS 360
+ G + SN+SG FGN S FGTQVP Q+ N S S
Sbjct: 301 SLPGNSFKTSNSSGVNAFSFGNTSTSGSFGFGTQVPTQSYQN------------PSTPSS 360
Query: 361 LFPAQTGGNTFTPNSLGFGNVGMTSQVPVQIGGTPFTSNVGGFGNSGMSSQGSQPPSTSL 420
+F A +G N F+ ++ +P +N G G +SQG P +TS
Sbjct: 361 IF-ASSGRNLFSTSTT-----------------SPHFANPSG-GQLPTTSQGLFPVATSP 420
Query: 421 GGANGGQQTIGSNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYV 437
N T ++ ++ S D IW K+EWK+GEIPE+ PP YV
Sbjct: 421 VSIN---LTNIASTEDFSG-DNSIWTKKEWKIGEIPEEAPPDRYV 428
BLAST of Spo02995.1 vs. UniProtKB/TrEMBL
Match:
A0A061GS52_THECC (Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_039821 PE=4 SV=1)
HSP 1 Score: 285.4 bits (729), Expect = 1.200e-73
Identity = 192/450 (42.67%), Postives = 251/450 (55.78%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKPNPFGFGVQ 60
M K EPCRNF RG C++G++C F HV QQQPK NAFGFG+Q A+ QQQKPNPFGFGVQ
Sbjct: 1 MHYKKEPCRNFQRGSCQYGERCKFLHVIQQQPKSNAFGFGTQ-ASSHQQQKPNPFGFGVQ 60
Query: 61 NSTSGAPNSGGFGQSNSQKPFENKWVRPE--------KPQNQSQAASHNCTDPDSCKRQI 120
N+ + + FG N Q F+N W R +P NQ QA +H CTDP+ CKR I
Sbjct: 61 NNVQ-SKGANDFG--NKQNQFKNTWTRSSASSAPSLRQPDNQPQATNHQCTDPELCKRTI 120
Query: 121 ADDFQHESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERERGLV 180
+DF+HE PLWKLTCY H K+ PCDI GD+S+EELRA AYDDAK G SL SIVERER L+
Sbjct: 121 IEDFEHERPLWKLTCYSHWKNSPCDIVGDVSFEELRATAYDDAKHGLSLQSIVERERNLL 180
Query: 181 NSKKIEFQNLLNNPYTKHTVPAQSNQSPFPAAVASSPSPT---PQNNGF-SSFAASNQQN 240
NSK +EF+NLL NPYT + Q PFP A A++ SP NGF S ++ +Q
Sbjct: 181 NSKLVEFENLLRNPYTGPVGSTLAQQIPFPTATATAFSPQNTGRSQNGFPPSVSSFSQLG 240
Query: 241 PPPNLSFGVRPSTPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLPASNNSGFG 300
N VRPS SN++ FGQ P F+++ S F+T N+P +N
Sbjct: 241 ASLNSGSAVRPSIQSNNA-FGQ---PISFSSSAQASSVFAT-------NNIPLAN----- 300
Query: 301 NSGFGNNGLKSGSFGTQVPAQT-GGNFAPNIGGFGNSAMSGQGSLFPAQTGGNTFTPNSL 360
+ SFG Q P Q+ F ++ F NS+++ T N F+ +
Sbjct: 301 ----------AFSFGNQQPNQSVAAAFTTSMSSFSNSSVT--------STALNQFSAPVV 360
Query: 361 GFGNVGMTSQVPVQIGGTPFTSNVGGFGNSGMSSQGSQPPSTSLGGANGGQQTIGSNPQ- 420
N+ ++S P + F S + S+ +T + +GSN Q
Sbjct: 361 STQNLSLSSVQPPAL-----------FNVSNLISKADGQSATDI--------QLGSNLQR 393
Query: 421 EISATDTGIWLKEEWKLGEIPEQPPPAAYV 437
+I + D+ +WLKE+W GEIPE+ PP AYV
Sbjct: 421 KIVSGDSSVWLKEKWIPGEIPEEAPPDAYV 393
BLAST of Spo02995.1 vs. UniProtKB/TrEMBL
Match:
A0A0V0I6S8_SOLCH (Putative ovule protein OS=Solanum chacoense PE=4 SV=1)
HSP 1 Score: 280.4 bits (716), Expect = 3.700e-72
Identity = 197/465 (42.37%), Postives = 250/465 (53.76%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTA----TQSQQQKPNPFG 60
MP + EPCRNF+RG C++G++C F H QQQPKPN FGFGSQ++ T QQ K NPFG
Sbjct: 1 MPPRKEPCRNFMRGSCQYGERCKFLHAAQQQPKPNPFGFGSQSSNFQNTNMQQTKSNPFG 60
Query: 61 FGVQNSTSGAPNSGGFGQSNSQKPFENKWVRP--------EKPQNQSQAASHNCTDPDSC 120
FGVQ+++ +S + N KPFENKW R + NQ A +H CTD +SC
Sbjct: 61 FGVQSNSQPRGSSDLGLKQNQYKPFENKWTRSATTNSSSSRQTDNQPVAPNHTCTDAESC 120
Query: 121 KRQIADDFQHESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIVERE 180
+RQI +DF +E PLW LTCYGH K+ PCDI+GD+S EELRA AYDDAKRG SL SIVERE
Sbjct: 121 RRQIVEDFNNEKPLWLLTCYGHRKNGPCDITGDVSCEELRAAAYDDAKRGQSLMSIVERE 180
Query: 181 RGLVNSKKIEFQNLLNNPYTKHTVPAQSNQSPFPAAV------ASSPSPTPQNNGFSSF- 240
R LVNSK EF+NLL NPY + A + QSPFP A A SPSP + S F
Sbjct: 181 RSLVNSKVAEFENLLRNPYASTSTSAPNAQSPFPGATPSASLSAQSPSPGAAPSASSPFP 240
Query: 241 -----AASNQQNP----PPNLSFGVRPSTPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVP 300
A+ + Q+P PN + S P ++S F Q T+T P+ F P
Sbjct: 241 GAAPSASLSAQSPFPGAAPNALSSAQSSFPPSASSFSQLGTILNTGTSTPPTSTFGQP-- 300
Query: 301 AQTGGNLPASNNSGFGNSGFGNNGLK-SGSFGTQVPAQTGGNFAPNIGGFGNSAMSGQGS 360
+ G + SN SG FGN S FGTQV Q+ N S +
Sbjct: 301 SLPGNSFKTSNLSGANAFSFGNTSASGSFGFGTQVSTQSYQN------------PSTPSN 360
Query: 361 LFPAQTGGNTFTPNSLGFGNVGMTSQVPVQIGGTPFTSNVGGFGNSGMSSQGSQPPSTSL 420
+F A +G N F+ ++ +P +N G G SQG P +TS
Sbjct: 361 IF-ASSGRNLFSTSTT-----------------SPHFANTSG-GQFPTPSQGLFPVTTSP 420
Query: 421 GGANGGQQTIGSNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYV 437
N T ++ ++ S D IW K+EWK+GEIPE+ PP YV
Sbjct: 421 VSIN---LTNTASTEDFSG-DNSIWTKKEWKIGEIPEEAPPDRYV 428
BLAST of Spo02995.1 vs. ExPASy Swiss-Prot
Match:
C3H16_ARATH (Zinc finger CCCH domain-containing protein 16 OS=Arabidopsis thaliana GN=CG1 PE=1 SV=2)
HSP 1 Score: 226.5 bits (576), Expect = 5.700e-58
Identity = 181/464 (39.01%), Postives = 229/464 (49.35%), Query Frame = 1
Query: 3 LKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKP----NPFGFG 62
++ E CRNF RG CR+G+ C F H Q +P N FGFG+Q Q QQQ+ NPFGFG
Sbjct: 1 MRKELCRNFQRGSCRYGENCRFLHPQQAKPN-NPFGFGTQNQQQQQQQQQQNSSNPFGFG 60
Query: 63 VQNSTSGAPNSGGFGQSNSQKPFENKWVRP-------------EKPQNQSQAASHNCTDP 122
VQ+ S PN F+N W R ++ Q+Q A H CTDP
Sbjct: 61 VQSGGSSRPNQ-----------FQNTWSRTASTPTGGGAAASTQQTGKQTQPADHKCTDP 120
Query: 123 DSCKRQIADDFQHESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIV 182
+CKR + DDF++E P+WKLTCYGH K+ PCD++GDISYEELRAVAY++AKRG L SIV
Sbjct: 121 AACKRVMQDDFKNERPMWKLTCYGHWKYFPCDVTGDISYEELRAVAYEEAKRGIPLQSIV 180
Query: 183 ERERGLVNSKKIEFQNLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSS----F 242
ERER L NSK EF+N L NPY K +V A NQSPF A++PS PQ++ +S F
Sbjct: 181 ERERNLQNSKIAEFENFLRNPY-KGSVTA--NQSPF---AATTPSIFPQSSQINSPSPAF 240
Query: 243 AASNQQNPPPNLSF-GVRPSTPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLP 302
+ NQQ N + G+ S P N+ F F F NT G S+ P
Sbjct: 241 SGFNQQTAFSNTNAGGLSSSGPPNA--FASFNQQTTF-PNTNAGGVSSSGPPNPFASFTQ 300
Query: 303 ASNNSGFGNSGFGNNGLKSGSFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGN 362
SNN S GL S A N PN G
Sbjct: 301 QSNNQQTAFSNTNAGGLSSSG---PPNAFASFNKQPNAFSVNTPQPVPSGPSGFQTNPST 360
Query: 363 TFTPNSL----GFGNVGMTSQVPVQIGGTPFT----SNVGGFGNSGMSSQGSQPPSTSLG 422
TF P S GF + + Q TP T +N F + + + P +
Sbjct: 361 TFKPASFGPGPGFATTPQNNNIFGQSTPTPATNTSQNNQTAFNFNVPVASFTAPAINTTN 420
Query: 423 GANGGQQTIGSNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYV 437
++G + IG +P D+ IWLKE+W GEIPEQ PP A+V
Sbjct: 421 TSSGTELQIGGDP-----VDSSIWLKEKWNPGEIPEQAPPDAFV 435
BLAST of Spo02995.1 vs. ExPASy Swiss-Prot
Match:
C3H46_ORYSJ (Zinc finger CCCH domain-containing protein 46 OS=Oryza sativa subsp. japonica GN=Os06g0704300 PE=2 SV=1)
HSP 1 Score: 194.5 bits (493), Expect = 2.400e-48
Identity = 139/326 (42.64%), Postives = 180/326 (55.21%), Query Frame = 1
Query: 1 MPLKPEPCRNFLRGHCRFGDKCNFPHVT------QQQPKPNAFGFGSQTATQSQ------ 60
M + E CRNF RG C++G +C + H + QQQ KPN FGFG+ + Q Q
Sbjct: 1 MSRRQEICRNFQRGSCKYGAQCRYLHASPHQQQQQQQAKPNPFGFGTGSRQQQQPSFGSQ 60
Query: 61 -------QQKPNPFGFGVQNSTSGAPNSGGFGQSNSQKPFENKWVR-PEKPQNQS----- 120
QQKPNPFGFGVQ + + + N+ G KPF+NKWVR P P Q+
Sbjct: 61 FQQQQQQQQKPNPFGFGVQGANAQSRNAPG-----PAKPFQNKWVRDPSAPTKQTEAVQP 120
Query: 121 ---QAASHNCTDPDSCKRQIADDFQHESPLWKLTCYGHNKHLPCDISGDISYEELRAVAY 180
QAA +C DP SC++QI++DF++E+P+WKLTCY H ++ PC+I GDIS+EELRA AY
Sbjct: 121 PQAQAAHTSCEDPQSCRQQISEDFKNEAPIWKLTCYAHLRNGPCNIKGDISFEELRAKAY 180
Query: 181 DDAKRGTSLPSIVERERGLVNSKKIEFQNLLNNPYTKHTVPAQSNQSP-FPAAVASSPSP 240
++ K+G SL SIVE ER L N+K +EF NLLN+ A+ +Q+P FP S P
Sbjct: 181 EEGKQGHSLQSIVEGERNLQNAKLMEFTNLLNS--------ARPSQTPSFP---TMSSFP 240
Query: 241 TPQNNGFSSFAASNQQNPPPNLSFGVRPSTPSNSSGFGQFQGPGQFTTNTVPSGPFSTPV 297
+NN SSF AS PP SF S ++ G GPG S PF P
Sbjct: 241 EVKNN--SSFGASQTNGPPVFSSF----SQIGAATNIG--PGPGTTAPGMPASSPFGHPS 300
BLAST of Spo02995.1 vs. TAIR (Arabidopsis)
Match:
AT1G75340.1 (Zinc finger C-x8-C-x5-C-x3-H type family protein)
HSP 1 Score: 226.5 bits (576), Expect = 3.200e-59
Identity = 181/464 (39.01%), Postives = 229/464 (49.35%), Query Frame = 1
Query: 3 LKPEPCRNFLRGHCRFGDKCNFPHVTQQQPKPNAFGFGSQTATQSQQQKP----NPFGFG 62
++ E CRNF RG CR+G+ C F H Q +P N FGFG+Q Q QQQ+ NPFGFG
Sbjct: 1 MRKELCRNFQRGSCRYGENCRFLHPQQAKPN-NPFGFGTQNQQQQQQQQQQNSSNPFGFG 60
Query: 63 VQNSTSGAPNSGGFGQSNSQKPFENKWVRP-------------EKPQNQSQAASHNCTDP 122
VQ+ S PN F+N W R ++ Q+Q A H CTDP
Sbjct: 61 VQSGGSSRPNQ-----------FQNTWSRTASTPTGGGAAASTQQTGKQTQPADHKCTDP 120
Query: 123 DSCKRQIADDFQHESPLWKLTCYGHNKHLPCDISGDISYEELRAVAYDDAKRGTSLPSIV 182
+CKR + DDF++E P+WKLTCYGH K+ PCD++GDISYEELRAVAY++AKRG L SIV
Sbjct: 121 AACKRVMQDDFKNERPMWKLTCYGHWKYFPCDVTGDISYEELRAVAYEEAKRGIPLQSIV 180
Query: 183 ERERGLVNSKKIEFQNLLNNPYTKHTVPAQSNQSPFPAAVASSPSPTPQNNGFSS----F 242
ERER L NSK EF+N L NPY K +V A NQSPF A++PS PQ++ +S F
Sbjct: 181 ERERNLQNSKIAEFENFLRNPY-KGSVTA--NQSPF---AATTPSIFPQSSQINSPSPAF 240
Query: 243 AASNQQNPPPNLSF-GVRPSTPSNSSGFGQFQGPGQFTTNTVPSGPFSTPVPAQTGGNLP 302
+ NQQ N + G+ S P N+ F F F NT G S+ P
Sbjct: 241 SGFNQQTAFSNTNAGGLSSSGPPNA--FASFNQQTTF-PNTNAGGVSSSGPPNPFASFTQ 300
Query: 303 ASNNSGFGNSGFGNNGLKSGSFGTQVPAQTGGNFAPNIGGFGNSAMSGQGSLFPAQTGGN 362
SNN S GL S A N PN G
Sbjct: 301 QSNNQQTAFSNTNAGGLSSSG---PPNAFASFNKQPNAFSVNTPQPVPSGPSGFQTNPST 360
Query: 363 TFTPNSL----GFGNVGMTSQVPVQIGGTPFT----SNVGGFGNSGMSSQGSQPPSTSLG 422
TF P S GF + + Q TP T +N F + + + P +
Sbjct: 361 TFKPASFGPGPGFATTPQNNNIFGQSTPTPATNTSQNNQTAFNFNVPVASFTAPAINTTN 420
Query: 423 GANGGQQTIGSNPQEISATDTGIWLKEEWKLGEIPEQPPPAAYV 437
++G + IG +P D+ IWLKE+W GEIPEQ PP A+V
Sbjct: 421 TSSGTELQIGGDP-----VDSSIWLKEKWNPGEIPEQAPPDAFV 435
The following BLAST results are available for this feature: