Spo07204 (gene)

Overview
NameSpo07204
Typegene
OrganismSpinacia oleracea (Spinach)
DescriptionGlycosyl hydrolase family 3 family protein
LocationSpoScf_00689 : 165292 .. 178678 (-)
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTGAAATTCAAATGGGACGGTGGATTAAACTCCGACAAACGACAATGAACAACAACTAAACGGCTATTAAATCCCATAACTTCCGTCTATAAACGCCCGTCAAGACCCTATTTTCCTTTCTCCTTTATGTTTTTCCATTTAACCAACATTCTCCTACCTGTGTCCCATTTATTTTCCTTAATACCCCGAATTAACGCTGCTGGTTAATAGGCCAGATTAATTGGGGGCCTCCTAAAATACGTACTCTTTATGTACGGATAACGGAATGGTGTGGACGTCTGGTCTGGTACCGATCGACTACCGAGTAACCGACCTCACCTTTCTCCCGCCATATTTGTGAAATCGTTAGTTATGCTTGAAATGGAAATCTGGTAATAATCAAACCCTCAATTCGTCACTCCCTCCGCTCTTCTTCCTCACAAAGTCAACCCTCTCTCTCCGAAATTGTTGAATCGAATTGGGCGCCATGGAAGAAAACTGTATTTACAGAAACCCTAACTCATCAATTGAAGAACGAATCAAAGACCTCCTTTCTCGTATGACCCTGCAAGAGAAGCTCGCTCAAATGGCGCAAATCGAACGAACCGTCGCTTCTCCTACCTCTCTCAAAGATATTGGCATTGGTTACTTCTTATTCTTCGTTTTTCTCTCTCCTAGTGTTTCTGTTTTGCGTATTTTTTTAATCAGTTTCATGATTGCGTCTATTTTTACTTCAGGTAGTGTATTGAGTGCCGGTGGAAGTGCGCCTTTCGACAAGGCAGTGTCGTCTGATTGGGCTGATATGATTGATAATTTTCAGAAATTGGCCATGGAATCACGATTAGGGATTCCAATTATTTATGGTGTTGATGCTGTTCATGGTAATAATAATGTTTATGGTGCTACCATTTTCCCTCACAATGTTAATCTTGGTGCCACTAGGTTGGTTGATTACTTCTTTTTTAAACTTAATTTTTTCAATTTTGTTTGGTTTTATGTGTGTGAAAACCGTGTTAATTTTAGGGATTGGATGCAGGGATGCTAATTTGGCTCATAGAATTGGTGCTGCAACTGCTTTGGAAGTTAGAGCTAGTGGAGCTCATTACACCTTTGCTCCCTGTGTAGCTGTAAGTAAATTAGCAGTAATTTTGCATAACATATTGCCCGATTACTTAATTGGATGATGTGAATATAATGATACGGCTTACCCTAATACAGACTCATATATATAATAATATCGTATACTAGTACAATTAGGTGTATACTCTTAGTATCCTCTATTACACCCCCTCAATATGGTGGGGGAACCACGTTGAGATTGAACCTTAAAAACTCAAAACGCTGACAAACCCATCGGTTTGTAACTCGGTAGGGATATAATGTACAACTAAATCACCATGAGCAACACGATCCTTAACAAAATGATAATCTATTTTGATATTTTACTGCGATCATGCTGAACCGGATTCACGCCAAGATAAGACGCATAAACATTATCACAAAATAACTTCACGGGTGTGCGGATAACGATTCCCAACTCGGCTAACAAGGAACGAATCCAAAGTGTATCTTGTACTGTGTACGCAACAACGCGGTACTGTACTTCTGTAGAAGATTTGGAGACAGTGGGTTGCTTTTTCGAGCGCTAGGAAATGAGATTGCCTCCAAGAAAATTGCATAACCAGTAGTGGAACGACGGACAACCTCCCCAATCTGCATCTGAGTAGGCAATCATGATCGACACGTCTGTGGCGGGGAGAAGCCATAGACCGAAGTCTTTTTACCTGTATATACGCTTAACAACCAACAAGTAAGATGTGCGGGGTGCATTCATAAATTGTGAAACTAAATGCAGAACATATGTAATATCAGGTCTTGTCATAGTCAAGTACTGTAAGGTCCCCAACCATACTGCGATACTGAGTAGGATCTACCCTCTGTATGAGAAAGTGTGGTTTGACTAGGAAGAGGGGTATGCACAGGTTTAGGTTTAGCAGTGTGCAGATGGATTCTGACACAAAAATAAACCCTTAGAAGTACGCACAACGTGCACTCCTAGGAAGAAGTGAAATTACCTTATATCTTTCATAGCAGACTCGGCAAATAAAAGAGAAATGAATTTTGTGATCAAGGAAGCCGAGTTGCCTGTGACAATTATATCGTCAACATAATGAAGAAGATGCATAATGTATTTCTCACGACAAAAAATAAACACAGAATTATCTTGTCACGACGAAAAACAAACATAAAATTATCTAATTGACAAGAAAAGAAGCCCTGTTGTAACAAACAAAAAACTGCTGAAGCGGAGATACCAAGCTCTAGGAGCCTGTTTCAAACCATAGATTGTTTTCCATAAGGATGAACGAAACCTGCAGGTTGTTTCGTGTAGACCTCCTCAATTAGATCACCATGTAATAGGTACTTTTAACTTCTAATTGACGAATGACCCAATTATAGGAAAGAGCTATAGACAGAACCAACCGAACAGTAGTTTTAACCACAAGGCTAAAAGTGACTAATCTATACCTTCTTGCTGGCCGAACCCAGACGAGCCTTAGGCCTAATGACAGAACCATATTCGTGAGTATTTATTGTATATCCATTTTGTACCAACCAAGTTCTATGACGCATCAAATGGTACTAAATCCCAAATTTTGTTGACAAGCATTGCATTAATCTCTTCCACCATAGATTTCCCTACAAACAAATTGTTAGTATCACGGGAAGCAAGAGCAACAACAACACTATTAGATGACAAAACATAATTTCATAATGATATACCTCGTGAGTTTTGTGTTTGGGCTTGTGTTTGGTTTTGGTTTGCACCCCCACCACGTCCTTTTTCCATGACCACTACAACCTTTACTACAACCTTTACCACCTCCATGACCCCCATTATTGTTCTGAAATCCCTCAGAATTTCTCCGTGCAGTAGAGGTCGAAGATTGAGTCTTTGACTTGGTCGAGCCAGTATGAGTACTTGCCATAAACACCTGATTTGTTGCAAAACCCTGTTCCATAGCCTGGCGTCCTTTGGCGTCCTTTAACAATCTGTTCGTGATTCATTAATGCCCGACTAACCTCATCAAAAGTCAAGCAACCAGTAAAATGATAAATTATGTGGTCAATGCGTCATAATCTTAATCAAGACCCATCACAATGTGTAGAATTAAATCAAATCATGGGTAGAAATAGGAGCATTAATTGCAGCAAGGGAATCGTTAAGAATTTTTATGTCAGGTAAATAAGATTCCATCGACTATGATTCTTGCTTAACAACATTCTGCAACATTTTCAATTCTATAGAACGAGCCATACTAGCATTCATAAAACGAGATTTTAATCTTTCCCACACCAATGCAGAAAACTTGACCTCATGTGTTTCAACAAGTATATCTTTAGACATCGTAGCAAATAACCATGCCCTAACAGTTTGATTAATTCTCAACCAATGATAATAATCCGGATTCACGGTTTCTCTATTTAAATTATCAATTATATAAACATTGGGCATAGAAATCGAACCATCAAGCATGCCTAATAATCCATGAGAAACTAGAAAAGATTCAAATTGAGTACGCCATGGTAAATAATCCTCCACTGTGCTAAGAACGGTTGTAACAATACTTTTAATATGAGGTCGAAATTATCAAATTGGAAGACAATTCAGCAAGATTTTGAGCGTACAGGGAAAACCAGGTGGTCTACCATACCCACTCCGTGTCAACGATTGTGGGAGAAGACCATGTCCTCCCGTAAAAGTAGGGCCGCCATAATTTGTTGATGACGAAGTGGACGCAAAGTCGTGTGCAATTCCTCCAGCAAATTGTGAATTTGAGCCACGAACTCCATTAATAGGAGGGGTTGGAATAGAAATCAACACCACTGGAATACCCACCGGAAAACCCGCCTGTGCCACCGAAAACTACACCATAACCACCAGCAAGCTGCGTAGAGGAAGCGTAATAAAGGCGGCGTGGAGATGGGTGCGTCGGTTGCCACGTGAAGGTGGTTGCATATTGTTGGTCACCAGTTGCTTGGGGTTACATCAGACCTATCACCGGTGTGCCATTGGTGGTGCGTGGCTGCGAACCGGACGGAGACTGCACGATTCCTAGGGCTGGGTTGCGCAGGGGTGGAAGGTGTGTCCGTCGATGTTGTCAATGGTGAGGTCGATGTTGTCAATCGTTCGAAAACTTGAAGATTTCTACATTACGAACAACTAAGCTAATGTAATAACCAATGCATTGATCTTGAAGTTGTATATAGTTCCGGTGGCGAAATGATTTTGTAGCCGTTATTAAGTTTAAAGTTTTTCAGTAGGAATTATGTCGGATAATATGCAAGTATGTAACGATAAAGCTGTGGAGTTAAGAAAATCCATATGTTTGACACGTCGACTTAGAAAGCCACAAATTGTTCCTTGGGGGGGGCTGCATCCCCTCATCAGAATTCAGAAACCCCCCCCCCCCCTCTCTCAAAAAAAAAAAAAAAGATTAACAAGAAGAGAGGGAGAACTGAGAAGTATGCACTCAATAGGAAATCTTGATACGCTTACTTTCCCGGCCACAATAGTGATGGGTCAATAGCATGGAAGATCTGAGTACGAGTTCATGACTTGATAATGAGCCCTGGGCCTAGGCGACGAGTGCAGCGGCCTAGGGTCCAAGGCGGGAAGGGGGCCCAAAATTTATACTGCCTGCAAATTTGTACTTATAAATAAATGAACATAGGAAGTCATCAAATGCAATACAGCAAGCGGTAAATTGTGATTATGAGCTCCTCAATTTGAGGGTCGTGTGTGGTTCGATCCTCCTTGACCCCTTGTAACAGACATGTTTTATTTTTTGGTAAATCAACAGACATGTTTTTGTCTCTACTCTTTTTCAAGTATCATTCAATTTATTTTAAAATGTCTTTCCTATATTTGTTTTTGACATCTTTATGCCTCTTTATCTCTTTTCCATTCTCATATTGGTAGAATTTTCTTTCACCTGTAATTATTTTATATTTTATTCATTTTCTCATATTCATAAAAATGTTTATTTTATTTGCACTTTGTAACCATGAAGGTAACTAACTAAACTAAGTCACATAGTGGTAGTGGGTATATTAACAGGTAAAAAGTGAAAACCTTTAGTGCCATGAAGTATTATCGGAAGTACTTCATACTAACATTTCTACTCGTTCTTCTGTTTTACCATTATAAAAAAAAAATAGTTTCTTCTTTTTTACGTTAGAATACATTCAAAGCACTCCGTATATATTTGTGAGGTTTTATCAGATTTAATAGATTATATAATACTACATTATATAAATTACTTCGTTACAAGGGGCCCTGATCTCTTACTTTGCCTAGGGCCCATGAATTGTCAGGAACGGGGCTGTTGATAATTGGGACTTTGGGAGGAACACAGAAAGTGATGGAATTCCATTGCTTAAATACAAGTATACAACTATAGTGTCCAACTATTGTCATAATAAGTATTTCGTATCTCTTTGTAGTGCTTACTGCTTATGTATCTTGTGATTCTTGTCTCTTTCTGAAAAATATGAAGTTGGTCTTCTAACAGGTCTGCAGAGATCCGAGATGGGGAAGATGCTATGAGTGTTATAGCGAAGACACTGAAATTGTGAGGAAGATGACTTCCATTGTATCAGGGTTGCAGGGGGATCCACCAAAAGGACACCCAAACGGTTATCCGTTTCTGAAAGGAAGGTGAGGGAAAAGTCAGTTTTCAACCAATCCTTTCATAATATTAGAATATATTGTAGCATAGATTACAGTAGACCGTAGTGCCTTATGGTTCAATTTGGTACGATTTTGGAGTCTGAAGATTAGTATCCTTTTTCATTTATGTTTTCTTTTTGGTTTGTATTTTGTTCCAGAAAGAATGCCATTGCTTGTGCGAAACACTTTGTTGGTGATGGAGGAACACACAAAGGTGTAAATGAGGGGAATACAATCTCTTCATATGAAGATTTAGAGAGGATTCATATGGCACCTTATCTAGATTGTATTTCTCAGGGAGTTTGCACTGTCATGGCCTCCTATACCAGTTGGAATGAGAGAAAGCTACATGCTGATCGCTTTCTTCTCACTGAGGTCCTTAAAGATAAGCTAGGATTCAAGGTATATGGTCATATTTACGTTAATTTATATAAGCTTATGTTGCCTCATTTTTGTTTAAGTGGGGTAAGGGGGGGTAGGACGTTCGCAACCTTACTCCTGCAATTTGCAGAGAGGTTGTTTCCAATTGACCCAAAAGAGATTACGGGACGACAACGGGATGACCATCTTCTACTTCATGGAAAGGAAGCGAAGAGGTTTTAAGGGCCATTTTGATTTAGTCCATACATTCCACAGCCAAAAGAACAAATGAATCTTTGGCTCCATCAGAAGTGGACATTGGACATTTGTTTAAGTAACTCAAGAAAATGCTTTACCTTGGATATTATTTTACTTTTGGAACTATTGATTCTACCCTTGCTCCTTGTTTCCCTTTAGATAAACCATTTAGAAGCTGTGCTACCCTATCATTTACCACAATGAGCCTTGCTTAGAATGCTTACCTTCTTCTACTAGCCCTGATAAATGTGACACTGACAAATCCTGCTTTTATTTTACATAAGTTTTAATTTGGAAGCTTGTGTATGTGTTTCAACTGGGAGATGTTCAGATTGTTCACCATCCTGGTTTCCACTGCAGGGATTTGTAATTTCAGATTGGGAAGGCATTGATAGACTCTGTGTACCTCAAGGTGCAGACTTTCGCTTCTGCATTACAGCTGCTGTTAATGCAGGAATTGACATGGTAAGTGGTAGGACACTAAATTGTAGAACTTAGTTTTTCTTATATTTGCATCTTCCAAGTTCCAACCATGATGAAGTAAGCGTCATGCAGGTAATGGTACCTTTTAAGTACGAAAAGTATCTAGAGGACTTGACCTATCTTGTGGATTCAAAAGACATACCGTTGTCTAGGATTGACGATGCTGTTGAACGGATTCTTAGGGTAAAATTCGTTGCAGGCCTTTTTGAGCACCCTTACACTGACCGGTCATTGCTGGATACAGTTGGCTGCAAGGTTAGTTTTTTTTTTGCATTTATGTCTTCCTTATTTTTCTTGAAGTTTTGGATTTACCTCAATGAGAAATTAAATAGGTTGTTTTTTCTGATTTTTTATTTTTATTTTGAAGTGGCTTTCCTTTAATCTCTTTACTGGATTTTGAAAGGCTTGGATCACATACTATTAGATTTTTAATATCTAGCCATTCCTACTATTTTTGCAGCTTGCAGAAATAAATATATTCTTATGGACTGTATTAAGCGGTTCTCTAAACACCATCTTTTGTTCCAAATTTGTGTTCCCATTTCTTACTTATCGTTGGTAAATAGTGCTTTGGTACTTTCAAACTGTCTATTTAATTTATTTCTGGGATAAGAATGATCTCCTAGCTCATTACAACGATGACCTGCGTTTGTGACAGATGTCACTGCCACTGGGCATGGGGAAATTTCCCAAAGAGGTCTAGCTACATGGAAAGCATGACCTAATGGTTGGTTGGAACTTGGAACTGTTGGAAGGCTTTTCACTTTCTGTAATGTGGTGTTAACCCCGTCATATAAAAGCATCCTAAGAAACCTACTTTATTTTGTTTGGTAGTATTCTAAAGACTTTAAGCCTGGGCGGGGTGGTGAAGGGAGCATTTGTGTCTTTCTTACATGTTCTGTATGTCAATCAATGGATGCCACTCCCCTTGCAGCACTTTTTGACGTAGCTCTTTGGACAACAATGTTAAACTTTACTTTGATTGTTACCCTAGGACCTTTTATACCATCACATGACATCACAAGAAAAATGCGTCATTTATGATATCAATCAGTTATTATATTTTTATGAAATCATTGTCTCCATTCTAAATTAATCTTCATTGCTTATGGTTTATCCACTGGCATAGATCTAGAAGTAAAGAACTAACTGCTCGACTTTCTGTCAGTTGCATCGCGAGCTAGCACGTGAAGCAGTCCGGAAGTCTTTGGTGCTCCTCAAAAACGGGAAAGACCCTAAATATCCATTCCTTCCGCTGGGCAGGAATGCCAGAAAAGTTCTTGTTGCTGGAACACATGCCAACAATCTTGGATATCAATGTGGTGGGTGGACTATAACTTGGTACGGCCTTAGTGGCAGAGTTACTATTGGTACGTATTTCTTCATTTCTTGGACAAGATGTTGTTCCTTGCATCTTTTGTACTTGTGTTGCCGATCTTGTTAGCTTTTTGTGTGCTGCTTTGCACTGTTATTAATTGTACTAAAAAGATTGTTATCTCTATACACTTCTACCGATATTAATGATGCACTTGTATAGTTGTATGTATATTACTTAATGATACTATTCACTTTAATTTTATTGAAAATAAACATTGATTTCAGAGTTTCAGACTAATACATCATCTGTGCTCCAGGCACTACCATTTTGGATGCTATTAAACAAGCCGTTGGAGACAACACTCAAGTGATATATGAAGAGTTTCCATCACGTGACACCTTAGCAAGAGAAGATTTTGATTTTGCCATTGTTGCTGCTGGTGAAGCTCCATACGCAGAATTCACAGGGGACAATTCGGTGCTCAATGTTCCCTTGAATGGAGCTGATGTAATTAGTGCAGTTGCAGATAAAATTCCTACACTTGTCATTCTGGTTTCTGGAAGGCCTTTAGTTTTAGAGCCTTGGCTTTTGGGAAAAATCGATGCTTTGGTTGCTGCATTTTTACCTGGAAGTGAAGGAGATGGAGTAACTGATGTTCTTTTTGGAGATTATGAATTTGAGGGTCTTCTCCCTGTGACATGGTTTAAAAGTGTTGATCAACTGCCTATGACGGCTGGACAGAATTCATACGATCCTTTATTCCCCTTGGGTTTTGGGTTGAAAACCAACAACGGAAAACATGTAATTTGATGATTTTGAGTATCAAGAATCGCTACAGGTGTTGTATTTTTTGATAAAGAAAGAATATTTTTGTGAAATAATAAGAGCTTTTATTATTCGTCGATAATTCGTGTATTTATCCTTCTGGTTTAACTCATTCCCGCCGGTCCGAAGCAGCACAATTATGATGTTGATTCATTAGTGCGTTTCAATGGATTCGGAACATACATGTCCATCAGTTTGAACTCACAGTTATTGTCTATGATTACACTAATGTATCGGTCCGTTATAATGTGACCTGAATCGTCATTTTCAGGATTCCAACTGTCAAAGAAAAAGGATCATGAAGTCATCATCGTTTGGTGCTCTAATAAAGTGAGCCTCAGTCGCCAAATCAGTTTTGGTATGTCTATTTTTATCTGCCCTCTAGCTTTCTAAATGGTGCCTTCATTATACTGTTCTGTTACAAAGTCCATTCCCTCTTTCACATTTGTGATTTGTGAATATGTGATGAAGGTTTTCATCATTCATTGACATGAAACTAGTGCTTTGCCATTTGTGTTTCCTTTTAAAGAAATTTTACCTCGCAATGGTGTCCTGACTCTTTTTCTCTCATCTGTTATCCCGACTTCTGATTTTTTATGCCGGTGATTTTCTGGTTTGAAAATTACTCATAGAATAAAGTAGTCTCACCTTTATTTGTGCATGTTGCAAAGAAAAACTTATTGGAGATGAAGTGAGCATTTTTTAATTTCTATAGTTGAGTTTTATGCTCTATGAACAGAACTTACTGCATGTACAACACAGACGTGAGGTAGGCTTGTATGACATCTGATGGTGAAATATGAGCTTGCCTGCTTTCTGGCCTCAGCTAGTTACTTGTCTAATCATTGAAGATGAACATGCAATGCAACTGGGACTACATCTTAGTGTTTGTAAGTTGGTTAGGCATGAGTATTTTCTGTGGAAACGACCTTTGTAATGGATATTTGTTAGTAAGGGGGAAGGAATCCGGTCCTTACCGCGTTGGAATTAAAGGTTCACTAGATTGCTGCAATCATAATAACAGGACGGGATAACTTCCTGTGAAGGTGGTTCTGTGACAGTTATCGTGATCATGAAGAGGTTAGGTATCATCAATTTGAACCTTAGGTGAGTAATTGGCCTCTTTGGTTAATGGGTCATACACTGTAGTAGGTTTTTGGGTGAGTAATTGGCTTCTTTGGTTAATGGGTCAAAGTATTTAATACCTGGGGAATGATTTCGGTACATCAAATTTTGATTTAGGCACACCTGGTGTGAGCACTACTGCTCTATTTGTGAGCAGGTGCACCAAAATCAGTTTTCGTACATATACATTGTTTAGTTGTAAACCCACTCGTGATTAGACTATACTAACTACAAGTACATTATTAATGTGCAATAGTGGCAATTACGTCATGAATGAATGCATCATCACGATAATAAAGGGCCGTGTATGCGTAAAGCATAAAGCATTTTGCTTTGTCTCCCTGAACTATGTCAGTTTTTGAGCCATAGGCTTGAGTGATGAGTCTTTAATCGCTTTTGTTTTTAATGTAAATAGGTTATGCCGTACAGACCGATTGCAAAAGATCGTGATGTTTTCATGAATAAATAAACTATGAAAAAGTAGGCAGTGACATGCTAGTGGAATTGGATTTAGTTACTCCTAATACAGGACGATTTAGTGGAAATGATTCGTTAGGAAAATGGATGGAGTTGTCAAATCTCTTTTTTTTTGTTAGTTTCCTTGTATTTTGATCACCGACAACAACAATAAAGCCTTTGTCCCAAAATCTTTTGGGGTCGGCCAACATGAATGTATTTTGATCATCATTTATAAAAAATCATAGGAGTACTTTTTTCACTCCTATAATTTAAGAAATTGGGTACAAGGAGAGGGAGATGTGAATCATCTACAGTATCTAAGTAAGGAATTTAGGAAGTTTCTAAAAGGGTATGCTTAGGTATTTGCAGTGCAAAAGAAAATATGGGGTAAGTAAAACTTTATTAAATGGTAAACATTGTGATATTTTTAGCGAAATAATACTTTATCCCATCTCCTACTTTCATTCTCAAATGGAAGAGTGACTTTCCGTTTGACAAAGTATGATTTTCAGCATGGCTGCCTGTGATAGAAAGGATGAGACTGAGTTCTTCTATGCTGGAGTTAATTTAGGAAACCATATATTTTCCAACTAGATTTGCCATCTCTGGGTTCCTATGATTCTATGAAATAAATAGTGCAAAAGAAGAGGAGGAAAAGGCAACCAACTGGAATCTTATTGAGTGCGTTTTGACAAAGCCAACCTCTCCCAAGTATCCAATGCCATGGTCATTTCTTTCTTTTGTGCCGAATGTCTTTTCGCATTAATCTTGTGGTTGAATATAGCCGTTTAGCTTAAGGCTTACAGTATGCTGAAAGGACGTTTCTATCTCATTCTGAAGTAGAGATGCAGCTACTGCTCATGACATGGAGAGTTAATGTTTTGATGAAGAGAGTTGCTTAGTTGAATAATTAAGCCAAAAATGGATGTAACTTCATATGTGAATGTGGCAGAATCACCGGGAAACGCTAATTTGGGAGAACACTAATTTGTAGCTATTAAATATGTGCCCCTATGTTTGAAACATAGCGGATACATGACACGTCAGATTCTAGAGGAGTCTGTGCATGCATGACGATGTCTGTGCATAGGCATTCAGGGTAGCTGTTGCTTGATGATAAACTGGCGAACTTCAAAGAGAACAATGTCATAGATTAATTGTCATTGATCATGATATTGAGTCAATGTCCTCATCATACCATATTAGCATTACACCCGCTATAATGGTAATTTTGTGACTAAACCCGCTGAGGTTGAGAGATCCATTTGTTCTTTCCTTCTTGACTCATGCTAAGTTAATTATCTATATAACTATCGGCTGATGACGTATTATAATCTTGTATGCATGCTGATTGACTCTATGCTTGCTAAAGCAGAGTCTGTGAAACCAGAATATTTTGTACCATTTATGATCATAAGTCAAATTAGGGCAAATTCCCTGAGCAAGGATCTTCCATGTTATAATATGAGAGATTCAGAATTTTCTGTATCTACTAATCTACTATATCATGGTGATTCTTCCTTGAAATTCGTATATTGACTAAATTTCAGCTAGTAAATGACATCACCACTCGCACCATACACAGTTACACACACAGACTGAGGGACAACCAAATCTATGCTGGTGAATTTTATAAAAATCAACAGCATCCAAGAACGTCAAGGCTTCTAACTAATGTTTCAGGCCCTGTCAAGGCCTTTACCAGTAGTGGCTCTTTGCAAATTTATGCCCTCATTCTCCCTTTTATATACTTTGTTTTATTTGACTAATTCTGACAAAGATTCTATACTATCCTAGTAAGAAGGCACTACTGTACAAGTATACGAATGGATAGAAAATAGCATCTGCCTATTGTGTCTACAATTCTGCATCATAGATATCCCAAATTGATTAGCACTCTCACTTCCTCCCGAGTTTTATGATGAGTTCCTGACTTATTAATCTATCCTGATAAACTCTATTTACACAACAATACATAATGTATTTCTGAAAATTCAGAAGTCTGTATACTAAGCTGTCAGTTTCCTTTGATAAACTCTATTTGCACAACAATACATAACATATTTCTCAAATATCCAGTATAGTTGTCATTGTTCACTAGCATCGTTGATGCCCTTGAGTTTTCCTCCAAATGCTTTCCTNAGTATAGTTGTCATTGTTCACTAGCATCGTTGATGCCCTTGAGTTTTCCTCCAAATGCTTTCCTCTATCTAATTTCATTTGATATTTGGATTGACTATACCTTTTATATATATATATATATATATTTTTTTTTTTTACTGGAACTCTTGCATTTTCTATGAGTTTTTTATGAATATTTTGTTGCTGTATAAATGGAGCTGCATGCATCTCATGCCGATGTAGCTAATATGGAACAGCTCTTGTTTGGATCCCTGTGTATCAGCAACGGAAAAGTATTTGAGAGACAAAACAAGGAAATAAATTTGCTGACTTTTCACGGAAAAAATATACTAGCTGATGCCTGATGGTGCTTTTGGATATACGAAGTCAGCACGACCCCTCTCCCCCAAACAAAACAAAACAGAAAAGGAAAACTGAAAGACTTGGGTTTTCCCGTAATTCTCTACTCTCTTCTGTGCATGTAGTTATGCATGCCTGCTGATCTAGATATCTGTGTGGTCTCATGATTGGGTCATAAGTCATTCTTATTGTCATTGATCATGTTTTCTGTGGAACTTTATTACTGCTATGAAGTTGAATATTAATCAAGCATTTGACTGATCATAGCTTTTGGATTGAAGTTTGTTATGACATTCTTTACGGATCGCAGGAGTGGCTACACTTTTCTGTGGACTCACATCAAAATATAAAGGCCTCAATACA

mRNA sequence

TTTTGAAATTCAAATGGGACGGTGGATTAAACTCCGACAAACGACAATGAACAACAACTAAACGGCTATTAAATCCCATAACTTCCGTCTATAAACGCCCGTCAAGACCCTATTTTCCTTTCTCCTTTATGTTTTTCCATTTAACCAACATTCTCCTACCTGTGTCCCATTTATTTTCCTTAATACCCCGAATTAACGCTGCTGGTTAATAGGCCAGATTAATTGGGGGCCTCCTAAAATACGTACTCTTTATGTACGGATAACGGAATGGTGTGGACGTCTGGTCTGGTACCGATCGACTACCGAGTAACCGACCTCACCTTTCTCCCGCCATATTTGTGAAATCGTTAGTTATGCTTGAAATGGAAATCTGGTAATAATCAAACCCTCAATTCGTCACTCCCTCCGCTCTTCTTCCTCACAAAGTCAACCCTCTCTCTCCGAAATTGTTGAATCGAATTGGGCGCCATGGAAGAAAACTGTATTTACAGAAACCCTAACTCATCAATTGAAGAACGAATCAAAGACCTCCTTTCTCGTATGACCCTGCAAGAGAAGCTCGCTCAAATGGCGCAAATCGAACGAACCGTCGCTTCTCCTACCTCTCTCAAAGATATTGGCATTGGTAGTGTATTGAGTGCCGGTGGAAGTGCGCCTTTCGACAAGGCAGTGTCGTCTGATTGGGCTGATATGATTGATAATTTTCAGAAATTGGCCATGGAATCACGATTAGGGATTCCAATTATTTATGGTGTTGATGCTGTTCATGGTAATAATAATGTTTATGGTGCTACCATTTTCCCTCACAATGTTAATCTTGGTGCCACTAGGGATGCTAATTTGGCTCATAGAATTGGTGCTGCAACTGCTTTGGAAGTTAGAGCTAGTGGAGCTCATTACACCTTTGCTCCCTGTGTAGCTGTCTGCAGAGATCCGAGATGGGGAAGATGCTATGAGTGTTATAGCGAAGACACTGAAATTGTGAGGAAGATGACTTCCATTGTATCAGGGTTGCAGGGGGATCCACCAAAAGGACACCCAAACGGTTATCCGTTTCTGAAAGGAAGAAAGAATGCCATTGCTTGTGCGAAACACTTTGTTGGTGATGGAGGAACACACAAAGGTGTAAATGAGGGGAATACAATCTCTTCATATGAAGATTTAGAGAGGATTCATATGGCACCTTATCTAGATTGTATTTCTCAGGGAGTTTGCACTGTCATGGCCTCCTATACCAGTTGGAATGAGAGAAAGCTACATGCTGATCGCTTTCTTCTCACTGAGGTCCTTAAAGATAAGCTAGGATTCAAGGGATTTGTAATTTCAGATTGGGAAGGCATTGATAGACTCTGTGTACCTCAAGGTGCAGACTTTCGCTTCTGCATTACAGCTGCTGTTAATGCAGGAATTGACATGGTAATGGTACCTTTTAAGTACGAAAAGTATCTAGAGGACTTGACCTATCTTGTGGATTCAAAAGACATACCGTTGTCTAGGATTGACGATGCTGTTGAACGGATTCTTAGGGTAAAATTCGTTGCAGGCCTTTTTGAGCACCCTTACACTGACCGGTCATTGCTGGATACAGTTGGCTGCAAGTTGCATCGCGAGCTAGCACGTGAAGCAGTCCGGAAGTCTTTGGTGCTCCTCAAAAACGGGAAAGACCCTAAATATCCATTCCTTCCGCTGGGCAGGAATGCCAGAAAAGTTCTTGTTGCTGGAACACATGCCAACAATCTTGGATATCAATGTGGTGGGTGGACTATAACTTGGTACGGCCTTAGTGGCAGAGTTACTATTGGCACTACCATTTTGGATGCTATTAAACAAGCCGTTGGAGACAACACTCAAGTGATATATGAAGAGTTTCCATCACGTGACACCTTAGCAAGAGAAGATTTTGATTTTGCCATTGTTGCTGCTGGTGAAGCTCCATACGCAGAATTCACAGGGGACAATTCGGTGCTCAATGTTCCCTTGAATGGAGCTGATGTAATTAGTGCAGTTGCAGATAAAATTCCTACACTTGTCATTCTGGTTTCTGGAAGGCCTTTAGTTTTAGAGCCTTGGCTTTTGGGAAAAATCGATGCTTTGGTTGCTGCATTTTTACCTGGAAGTGAAGGAGATGGAGTAACTGATGTTCTTTTTGGAGATTATGAATTTGAGGGTCTTCTCCCTGTGACATGGTTTAAAAGTGTTGATCAACTGCCTATGACGGCTGGACAGAATTCATACGATCCTTTATTCCCCTTGGGTTTTGGGTTGAAAACCAACAACGGAAAACATGTAATTTGATGATTTTGAGTATCAAGAATCGCTACAGGTGTTGTATTTTTTGATAAAGAAAGAATATTTTTGTGAAATAATAAGAGCTTTTATTATTCGTCGATAATTCGTGTATTTATCCTTCTGGTTTAACTCATTCCCGCCGGTCCGAAGCAGCACAATTATGATGTTGATTCATTAGTGCGTTTCAATGGATTCGGAACATACATGTCCATCAGTTTGAACTCACAGTTATTGTCTATGATTACACTAATGTATCGGTCCGTTATAATGTGACCTGAATCGTCATTTTCAGGATTCCAACTGTCAAAGAAAAAGGATCATGAAGTCATCATCGTTTGGTGCTCTAATAAAGTGAGCCTCAGTCGCCAAATCAGTTTTGGAGTGGCTACACTTTTCTGTGGACTCACATCAAAATATAAAGGCCTCAATACA

Coding sequence (CDS)

ATGGAAGAAAACTGTATTTACAGAAACCCTAACTCATCAATTGAAGAACGAATCAAAGACCTCCTTTCTCGTATGACCCTGCAAGAGAAGCTCGCTCAAATGGCGCAAATCGAACGAACCGTCGCTTCTCCTACCTCTCTCAAAGATATTGGCATTGGTAGTGTATTGAGTGCCGGTGGAAGTGCGCCTTTCGACAAGGCAGTGTCGTCTGATTGGGCTGATATGATTGATAATTTTCAGAAATTGGCCATGGAATCACGATTAGGGATTCCAATTATTTATGGTGTTGATGCTGTTCATGGTAATAATAATGTTTATGGTGCTACCATTTTCCCTCACAATGTTAATCTTGGTGCCACTAGGGATGCTAATTTGGCTCATAGAATTGGTGCTGCAACTGCTTTGGAAGTTAGAGCTAGTGGAGCTCATTACACCTTTGCTCCCTGTGTAGCTGTCTGCAGAGATCCGAGATGGGGAAGATGCTATGAGTGTTATAGCGAAGACACTGAAATTGTGAGGAAGATGACTTCCATTGTATCAGGGTTGCAGGGGGATCCACCAAAAGGACACCCAAACGGTTATCCGTTTCTGAAAGGAAGAAAGAATGCCATTGCTTGTGCGAAACACTTTGTTGGTGATGGAGGAACACACAAAGGTGTAAATGAGGGGAATACAATCTCTTCATATGAAGATTTAGAGAGGATTCATATGGCACCTTATCTAGATTGTATTTCTCAGGGAGTTTGCACTGTCATGGCCTCCTATACCAGTTGGAATGAGAGAAAGCTACATGCTGATCGCTTTCTTCTCACTGAGGTCCTTAAAGATAAGCTAGGATTCAAGGGATTTGTAATTTCAGATTGGGAAGGCATTGATAGACTCTGTGTACCTCAAGGTGCAGACTTTCGCTTCTGCATTACAGCTGCTGTTAATGCAGGAATTGACATGGTAATGGTACCTTTTAAGTACGAAAAGTATCTAGAGGACTTGACCTATCTTGTGGATTCAAAAGACATACCGTTGTCTAGGATTGACGATGCTGTTGAACGGATTCTTAGGGTAAAATTCGTTGCAGGCCTTTTTGAGCACCCTTACACTGACCGGTCATTGCTGGATACAGTTGGCTGCAAGTTGCATCGCGAGCTAGCACGTGAAGCAGTCCGGAAGTCTTTGGTGCTCCTCAAAAACGGGAAAGACCCTAAATATCCATTCCTTCCGCTGGGCAGGAATGCCAGAAAAGTTCTTGTTGCTGGAACACATGCCAACAATCTTGGATATCAATGTGGTGGGTGGACTATAACTTGGTACGGCCTTAGTGGCAGAGTTACTATTGGCACTACCATTTTGGATGCTATTAAACAAGCCGTTGGAGACAACACTCAAGTGATATATGAAGAGTTTCCATCACGTGACACCTTAGCAAGAGAAGATTTTGATTTTGCCATTGTTGCTGCTGGTGAAGCTCCATACGCAGAATTCACAGGGGACAATTCGGTGCTCAATGTTCCCTTGAATGGAGCTGATGTAATTAGTGCAGTTGCAGATAAAATTCCTACACTTGTCATTCTGGTTTCTGGAAGGCCTTTAGTTTTAGAGCCTTGGCTTTTGGGAAAAATCGATGCTTTGGTTGCTGCATTTTTACCTGGAAGTGAAGGAGATGGAGTAACTGATGTTCTTTTTGGAGATTATGAATTTGAGGGTCTTCTCCCTGTGACATGGTTTAAAAGTGTTGATCAACTGCCTATGACGGCTGGACAGAATTCATACGATCCTTTATTCCCCTTGGGTTTTGGGTTGAAAACCAACAACGGAAAACATGTAATTTGA

Protein sequence

MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGSAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATRDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSGLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGADFRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLFEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDFAIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKIDALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLKTNNGKHVI
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spo07204.1Spo07204.1mRNA


Homology
BLAST of Spo07204.1 vs. NCBI nr
Match: gi|902208388|gb|KNA15936.1| (hypothetical protein SOVF_093700 [Spinacia oleracea])

HSP 1 Score: 1239.9 bits (3207), Expect = 0.000e+0
Identity = 607/608 (99.84%), Postives = 607/608 (99.84%), Query Frame = 1

		  

Query: 1   MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGG 60
           MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGG
Sbjct: 1   MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGG 60

Query: 61  SAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 120
           SAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT
Sbjct: 61  SAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 120

Query: 121 RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVS 180
           RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVS
Sbjct: 121 RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVS 180

Query: 181 GLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPY 240
           GLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPY
Sbjct: 181 GLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPY 240

Query: 241 LDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGA 300
           LDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGA
Sbjct: 241 LDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGA 300

Query: 301 DFRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGL 360
           DFRF ITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGL
Sbjct: 301 DFRFGITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGL 360

Query: 361 FEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTH 420
           FEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTH
Sbjct: 361 FEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTH 420

Query: 421 ANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDF 480
           ANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDF
Sbjct: 421 ANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDF 480

Query: 481 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 540
           AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID
Sbjct: 481 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 540

Query: 541 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK 600
           ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK
Sbjct: 541 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK 600

Query: 601 TNNGKHVI 609
           TNNGKHVI
Sbjct: 601 TNNGKHVI 608

BLAST of Spo07204.1 vs. NCBI nr
Match: gi|731349751|ref|XP_010686158.1| (PREDICTED: lysosomal beta glucosidase-like [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1124.4 bits (2907), Expect = 0.000e+0
Identity = 538/608 (88.49%), Postives = 574/608 (94.41%), Query Frame = 1

		  

Query: 1   MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGG 60
           ME NCIYRNPN+ IEERIKDLLSRMTLQEKLAQM QIER VAS +++KD+GIGS+LSAGG
Sbjct: 373 MEVNCIYRNPNAPIEERIKDLLSRMTLQEKLAQMTQIERRVASSSAIKDLGIGSILSAGG 432

Query: 61  SAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 120
           SAPFDKA+SSDWADMID FQKLAM+SRL IPIIYGVDAVHGNNNVYGATIFPHNVNLGAT
Sbjct: 433 SAPFDKALSSDWADMIDGFQKLAMDSRLAIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 492

Query: 121 RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVS 180
           RDA+LAHRIG ATALEVRASGAHYTFAPCVAVCRD RWGRCYECY ED+EIVRKMTSIVS
Sbjct: 493 RDADLAHRIGVATALEVRASGAHYTFAPCVAVCRDSRWGRCYECYGEDSEIVRKMTSIVS 552

Query: 181 GLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPY 240
           GLQG+PP GHPNGYPFL+GRKN IACAKHFVGDGGTHKG+NEGNTISSYEDLERIHMAPY
Sbjct: 553 GLQGEPPTGHPNGYPFLEGRKNVIACAKHFVGDGGTHKGINEGNTISSYEDLERIHMAPY 612

Query: 241 LDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGA 300
           LDCISQGVCTVMASY+SWN RKLHAD FLLTEVLKDKLGFKGFVISDWEGIDRLC PQGA
Sbjct: 613 LDCISQGVCTVMASYSSWNGRKLHADHFLLTEVLKDKLGFKGFVISDWEGIDRLCEPQGA 672

Query: 301 DFRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGL 360
           D+RFCI+AAVNAGIDMVMVPF YEKYLED T+LV+SK+IPLSRIDDAVERILRVKFVAGL
Sbjct: 673 DYRFCISAAVNAGIDMVMVPFNYEKYLEDFTHLVESKEIPLSRIDDAVERILRVKFVAGL 732

Query: 361 FEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTH 420
           FEHPY DRSLLDTVGCKLHR+LAREAVRKSLVLLKNGKD + PFLPL R  +K+LVAGTH
Sbjct: 733 FEHPYADRSLLDTVGCKLHRDLAREAVRKSLVLLKNGKDSRSPFLPLDRGVKKILVAGTH 792

Query: 421 ANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDF 480
           A+NLGYQCGGWT+TWYGLSGRVTIGTTILDAIK AVGDNTQVIYEE PS D LAREDFDF
Sbjct: 793 ADNLGYQCGGWTVTWYGLSGRVTIGTTILDAIKDAVGDNTQVIYEEMPSSDMLAREDFDF 852

Query: 481 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 540
           AIVA GEAPYAEFTGDNS+LN+P+NGA+VISAVADKIPTLVILVSGRPLVLEPWLL K+D
Sbjct: 853 AIVAVGEAPYAEFTGDNSILNIPMNGANVISAVADKIPTLVILVSGRPLVLEPWLLDKMD 912

Query: 541 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK 600
           AL+AAFLPG+EG GVTDVLFGDYEFEGLLPVTWFKSVDQLP+ AG +SYDPLFPLGFG+K
Sbjct: 913 ALIAAFLPGTEGSGVTDVLFGDYEFEGLLPVTWFKSVDQLPINAGHSSYDPLFPLGFGMK 972

Query: 601 TNNGKHVI 609
           TN+ K VI
Sbjct: 973 TNDKKSVI 980

BLAST of Spo07204.1 vs. NCBI nr
Match: gi|870853233|gb|KMT05114.1| (hypothetical protein BVRB_7g172650 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1124.4 bits (2907), Expect = 0.000e+0
Identity = 538/608 (88.49%), Postives = 574/608 (94.41%), Query Frame = 1

		  

Query: 1   MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGG 60
           ME NCIYRNPN+ IEERIKDLLSRMTLQEKLAQM QIER VAS +++KD+GIGS+LSAGG
Sbjct: 1   MEVNCIYRNPNAPIEERIKDLLSRMTLQEKLAQMTQIERRVASSSAIKDLGIGSILSAGG 60

Query: 61  SAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 120
           SAPFDKA+SSDWADMID FQKLAM+SRL IPIIYGVDAVHGNNNVYGATIFPHNVNLGAT
Sbjct: 61  SAPFDKALSSDWADMIDGFQKLAMDSRLAIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 120

Query: 121 RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVS 180
           RDA+LAHRIG ATALEVRASGAHYTFAPCVAVCRD RWGRCYECY ED+EIVRKMTSIVS
Sbjct: 121 RDADLAHRIGVATALEVRASGAHYTFAPCVAVCRDSRWGRCYECYGEDSEIVRKMTSIVS 180

Query: 181 GLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPY 240
           GLQG+PP GHPNGYPFL+GRKN IACAKHFVGDGGTHKG+NEGNTISSYEDLERIHMAPY
Sbjct: 181 GLQGEPPTGHPNGYPFLEGRKNVIACAKHFVGDGGTHKGINEGNTISSYEDLERIHMAPY 240

Query: 241 LDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGA 300
           LDCISQGVCTVMASY+SWN RKLHAD FLLTEVLKDKLGFKGFVISDWEGIDRLC PQGA
Sbjct: 241 LDCISQGVCTVMASYSSWNGRKLHADHFLLTEVLKDKLGFKGFVISDWEGIDRLCEPQGA 300

Query: 301 DFRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGL 360
           D+RFCI+AAVNAGIDMVMVPF YEKYLED T+LV+SK+IPLSRIDDAVERILRVKFVAGL
Sbjct: 301 DYRFCISAAVNAGIDMVMVPFNYEKYLEDFTHLVESKEIPLSRIDDAVERILRVKFVAGL 360

Query: 361 FEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTH 420
           FEHPY DRSLLDTVGCKLHR+LAREAVRKSLVLLKNGKD + PFLPL R  +K+LVAGTH
Sbjct: 361 FEHPYADRSLLDTVGCKLHRDLAREAVRKSLVLLKNGKDSRSPFLPLDRGVKKILVAGTH 420

Query: 421 ANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDF 480
           A+NLGYQCGGWT+TWYGLSGRVTIGTTILDAIK AVGDNTQVIYEE PS D LAREDFDF
Sbjct: 421 ADNLGYQCGGWTVTWYGLSGRVTIGTTILDAIKDAVGDNTQVIYEEMPSSDMLAREDFDF 480

Query: 481 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 540
           AIVA GEAPYAEFTGDNS+LN+P+NGA+VISAVADKIPTLVILVSGRPLVLEPWLL K+D
Sbjct: 481 AIVAVGEAPYAEFTGDNSILNIPMNGANVISAVADKIPTLVILVSGRPLVLEPWLLDKMD 540

Query: 541 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK 600
           AL+AAFLPG+EG GVTDVLFGDYEFEGLLPVTWFKSVDQLP+ AG +SYDPLFPLGFG+K
Sbjct: 541 ALIAAFLPGTEGSGVTDVLFGDYEFEGLLPVTWFKSVDQLPINAGHSSYDPLFPLGFGMK 600

Query: 601 TNNGKHVI 609
           TN+ K VI
Sbjct: 601 TNDKKSVI 608

BLAST of Spo07204.1 vs. NCBI nr
Match: gi|645216291|ref|XP_008220446.1| (PREDICTED: lysosomal beta glucosidase-like [Prunus mume])

HSP 1 Score: 998.8 bits (2581), Expect = 4.100e-288
Identity = 466/602 (77.41%), Postives = 540/602 (89.70%), Query Frame = 1

		  

Query: 4   NCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGSAP 63
           NCIYRNPN  IE R+KDLLSRMTL+EK+ QM QIER V++P +++D  IGSVLSAGGS P
Sbjct: 8   NCIYRNPNEPIEARVKDLLSRMTLKEKVGQMTQIERRVSTPDAIRDFSIGSVLSAGGSVP 67

Query: 64  FDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATRDA 123
           F+KA+SSDWADM+D FQ+ A+ESRLGIP+IYG+DAVHGNN+VYGATIFPHNV LGATRDA
Sbjct: 68  FEKALSSDWADMVDGFQRSALESRLGIPLIYGIDAVHGNNSVYGATIFPHNVGLGATRDA 127

Query: 124 NLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSGLQ 183
           +L  RIGAATALEVRASG  YTFAPCVAVCRDPRWGRCYE YSEDTEIVRKMTSIV+GLQ
Sbjct: 128 DLVKRIGAATALEVRASGIQYTFAPCVAVCRDPRWGRCYESYSEDTEIVRKMTSIVTGLQ 187

Query: 184 GDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDC 243
           G PP+G+P GYPF+ GR N IACAKHFVGDGGTHKG+NEGNTISSY+DLERIHMAPYL+C
Sbjct: 188 GQPPQGYPKGYPFVLGRNNTIACAKHFVGDGGTHKGLNEGNTISSYDDLERIHMAPYLNC 247

Query: 244 ISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGADFR 303
           IS GV TVMASY+SWN  KLHADRFLLTE+LKDKLGFKGFVISDWE +D+LC P+GAD+R
Sbjct: 248 ISDGVSTVMASYSSWNGSKLHADRFLLTEILKDKLGFKGFVISDWEALDQLCEPRGADYR 307

Query: 304 FCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLFEH 363
           FCI++AVNAGIDMVMVPF+YE++++DL YLV+  +I +SRIDDAVERILRVKFV+GLFEH
Sbjct: 308 FCISSAVNAGIDMVMVPFRYEQFVKDLVYLVEHGNISMSRIDDAVERILRVKFVSGLFEH 367

Query: 364 PYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHANN 423
           P++DRSLLD VGCKLHR+LAREAVRKSLVLLKNGKD + PFLPL R A+++LVAGTHA++
Sbjct: 368 PFSDRSLLDMVGCKLHRDLAREAVRKSLVLLKNGKDSRKPFLPLDRKAKRILVAGTHADD 427

Query: 424 LGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDFAIV 483
           LGYQCGGWT TW G SGR+TIGTTIL+AIK+AVGD+T++IYE++PS DTLARED  FAI+
Sbjct: 428 LGYQCGGWTATWDGRSGRITIGTTILEAIKKAVGDDTEIIYEQYPSADTLAREDISFAIL 487

Query: 484 AAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKIDALV 543
           A GE PYAEF GDN  L +P NG DVIS+VAD++PTLVIL+SGRPL LEPWLL K+DALV
Sbjct: 488 AVGEGPYAEFRGDNLELAIPFNGTDVISSVADRLPTLVILISGRPLTLEPWLLEKMDALV 547

Query: 544 AAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLKTNN 603
           AA+LPGSEG+G+ DV+FGDY+FEGLLPV+WFK V+QLPM A  NSYDPL+PLG+GL  N 
Sbjct: 548 AAWLPGSEGEGIADVIFGDYDFEGLLPVSWFKRVEQLPMNALDNSYDPLYPLGYGLTYNK 607

Query: 604 GK 606
           GK
Sbjct: 608 GK 609

BLAST of Spo07204.1 vs. NCBI nr
Match: gi|595835810|ref|XP_007207129.1| (hypothetical protein PRUPE_ppa003012mg [Prunus persica])

HSP 1 Score: 998.8 bits (2581), Expect = 4.100e-288
Identity = 464/602 (77.08%), Postives = 540/602 (89.70%), Query Frame = 1

		  

Query: 4   NCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGSAP 63
           NCIYRNPN  +E R+KDLLSRMTL+EK+ QM QIER V++P +++D  IGSVLSAGGS P
Sbjct: 8   NCIYRNPNEPVEARVKDLLSRMTLKEKVGQMTQIERRVSTPDAIRDFSIGSVLSAGGSVP 67

Query: 64  FDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATRDA 123
           F+KA+SSDWADM+D FQ+ A+ESRLGIP+IYG+DAVHGNN+VYGATIFPHNV LGATRDA
Sbjct: 68  FEKALSSDWADMVDGFQRSALESRLGIPLIYGIDAVHGNNSVYGATIFPHNVGLGATRDA 127

Query: 124 NLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSGLQ 183
           +L  RIGAATALEVRASG HYTFAPCVAVCRDPRWGRCYE YSEDTEIVRKMTSIV+GLQ
Sbjct: 128 DLVKRIGAATALEVRASGIHYTFAPCVAVCRDPRWGRCYESYSEDTEIVRKMTSIVTGLQ 187

Query: 184 GDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDC 243
           G PP+G+P GYPF+ GR N IACAKHFVGDGGTHKG+NEGNTISSY+DLERIHMAPYL+C
Sbjct: 188 GQPPQGYPKGYPFVLGRNNTIACAKHFVGDGGTHKGLNEGNTISSYDDLERIHMAPYLNC 247

Query: 244 ISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGADFR 303
           IS GV TVMASY+SWN  KLHADRFLLTE+LKDKLGFKGFVISDWE +D+LC P+GAD+R
Sbjct: 248 ISDGVSTVMASYSSWNGSKLHADRFLLTEILKDKLGFKGFVISDWEALDQLCEPRGADYR 307

Query: 304 FCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLFEH 363
           FCI++AVNAGIDMVMVPF+YE++++DL YLV+  +I +SRIDDAVERILRVKFV+GLFEH
Sbjct: 308 FCISSAVNAGIDMVMVPFRYEQFVKDLVYLVEHGNISMSRIDDAVERILRVKFVSGLFEH 367

Query: 364 PYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHANN 423
           P++DRSLLD VGCKLHR+LAREAVRKSLVLLKNGKD + PFLPL R A+++LVAGTHA++
Sbjct: 368 PFSDRSLLDMVGCKLHRDLAREAVRKSLVLLKNGKDSRKPFLPLDRKAKRILVAGTHADD 427

Query: 424 LGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDFAIV 483
           LGYQCGGWT TW G SGR+T GTT+L+AI++AVGD+T++IYE++PS DTLARED  FAIV
Sbjct: 428 LGYQCGGWTATWDGRSGRITTGTTVLEAIQKAVGDDTEIIYEQYPSADTLAREDISFAIV 487

Query: 484 AAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKIDALV 543
           A GE PYAEF GDN  L +P NG DVIS+VAD++PTLVIL+SGRPL LEPWLL K+DALV
Sbjct: 488 AVGEGPYAEFRGDNLELAIPFNGTDVISSVADRLPTLVILISGRPLTLEPWLLEKMDALV 547

Query: 544 AAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLKTNN 603
           AA+LPGSEG+G+ DV+FGDY+FEGLLPV+WFK V+QLPM A  NSYDPL+PLG+GL  N 
Sbjct: 548 AAWLPGSEGEGIADVIFGDYDFEGLLPVSWFKRVEQLPMNALDNSYDPLYPLGYGLTYNK 607

Query: 604 GK 606
           GK
Sbjct: 608 GK 609

BLAST of Spo07204.1 vs. UniProtKB/TrEMBL
Match: A0A0K9R8W1_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_093700 PE=3 SV=1)

HSP 1 Score: 1239.9 bits (3207), Expect = 0.000e+0
Identity = 607/608 (99.84%), Postives = 607/608 (99.84%), Query Frame = 1

		  

Query: 1   MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGG 60
           MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGG
Sbjct: 1   MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGG 60

Query: 61  SAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 120
           SAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT
Sbjct: 61  SAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 120

Query: 121 RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVS 180
           RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVS
Sbjct: 121 RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVS 180

Query: 181 GLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPY 240
           GLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPY
Sbjct: 181 GLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPY 240

Query: 241 LDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGA 300
           LDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGA
Sbjct: 241 LDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGA 300

Query: 301 DFRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGL 360
           DFRF ITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGL
Sbjct: 301 DFRFGITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGL 360

Query: 361 FEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTH 420
           FEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTH
Sbjct: 361 FEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTH 420

Query: 421 ANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDF 480
           ANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDF
Sbjct: 421 ANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDF 480

Query: 481 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 540
           AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID
Sbjct: 481 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 540

Query: 541 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK 600
           ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK
Sbjct: 541 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK 600

Query: 601 TNNGKHVI 609
           TNNGKHVI
Sbjct: 601 TNNGKHVI 608

BLAST of Spo07204.1 vs. UniProtKB/TrEMBL
Match: A0A0J8BYP9_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_7g172650 PE=3 SV=1)

HSP 1 Score: 1124.4 bits (2907), Expect = 0.000e+0
Identity = 538/608 (88.49%), Postives = 574/608 (94.41%), Query Frame = 1

		  

Query: 1   MEENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGG 60
           ME NCIYRNPN+ IEERIKDLLSRMTLQEKLAQM QIER VAS +++KD+GIGS+LSAGG
Sbjct: 1   MEVNCIYRNPNAPIEERIKDLLSRMTLQEKLAQMTQIERRVASSSAIKDLGIGSILSAGG 60

Query: 61  SAPFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 120
           SAPFDKA+SSDWADMID FQKLAM+SRL IPIIYGVDAVHGNNNVYGATIFPHNVNLGAT
Sbjct: 61  SAPFDKALSSDWADMIDGFQKLAMDSRLAIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 120

Query: 121 RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVS 180
           RDA+LAHRIG ATALEVRASGAHYTFAPCVAVCRD RWGRCYECY ED+EIVRKMTSIVS
Sbjct: 121 RDADLAHRIGVATALEVRASGAHYTFAPCVAVCRDSRWGRCYECYGEDSEIVRKMTSIVS 180

Query: 181 GLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPY 240
           GLQG+PP GHPNGYPFL+GRKN IACAKHFVGDGGTHKG+NEGNTISSYEDLERIHMAPY
Sbjct: 181 GLQGEPPTGHPNGYPFLEGRKNVIACAKHFVGDGGTHKGINEGNTISSYEDLERIHMAPY 240

Query: 241 LDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGA 300
           LDCISQGVCTVMASY+SWN RKLHAD FLLTEVLKDKLGFKGFVISDWEGIDRLC PQGA
Sbjct: 241 LDCISQGVCTVMASYSSWNGRKLHADHFLLTEVLKDKLGFKGFVISDWEGIDRLCEPQGA 300

Query: 301 DFRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGL 360
           D+RFCI+AAVNAGIDMVMVPF YEKYLED T+LV+SK+IPLSRIDDAVERILRVKFVAGL
Sbjct: 301 DYRFCISAAVNAGIDMVMVPFNYEKYLEDFTHLVESKEIPLSRIDDAVERILRVKFVAGL 360

Query: 361 FEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTH 420
           FEHPY DRSLLDTVGCKLHR+LAREAVRKSLVLLKNGKD + PFLPL R  +K+LVAGTH
Sbjct: 361 FEHPYADRSLLDTVGCKLHRDLAREAVRKSLVLLKNGKDSRSPFLPLDRGVKKILVAGTH 420

Query: 421 ANNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDF 480
           A+NLGYQCGGWT+TWYGLSGRVTIGTTILDAIK AVGDNTQVIYEE PS D LAREDFDF
Sbjct: 421 ADNLGYQCGGWTVTWYGLSGRVTIGTTILDAIKDAVGDNTQVIYEEMPSSDMLAREDFDF 480

Query: 481 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 540
           AIVA GEAPYAEFTGDNS+LN+P+NGA+VISAVADKIPTLVILVSGRPLVLEPWLL K+D
Sbjct: 481 AIVAVGEAPYAEFTGDNSILNIPMNGANVISAVADKIPTLVILVSGRPLVLEPWLLDKMD 540

Query: 541 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK 600
           AL+AAFLPG+EG GVTDVLFGDYEFEGLLPVTWFKSVDQLP+ AG +SYDPLFPLGFG+K
Sbjct: 541 ALIAAFLPGTEGSGVTDVLFGDYEFEGLLPVTWFKSVDQLPINAGHSSYDPLFPLGFGMK 600

Query: 601 TNNGKHVI 609
           TN+ K VI
Sbjct: 601 TNDKKSVI 608

BLAST of Spo07204.1 vs. UniProtKB/TrEMBL
Match: M5WGE3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003012mg PE=3 SV=1)

HSP 1 Score: 998.8 bits (2581), Expect = 2.800e-288
Identity = 464/602 (77.08%), Postives = 540/602 (89.70%), Query Frame = 1

		  

Query: 4   NCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGSAP 63
           NCIYRNPN  +E R+KDLLSRMTL+EK+ QM QIER V++P +++D  IGSVLSAGGS P
Sbjct: 8   NCIYRNPNEPVEARVKDLLSRMTLKEKVGQMTQIERRVSTPDAIRDFSIGSVLSAGGSVP 67

Query: 64  FDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATRDA 123
           F+KA+SSDWADM+D FQ+ A+ESRLGIP+IYG+DAVHGNN+VYGATIFPHNV LGATRDA
Sbjct: 68  FEKALSSDWADMVDGFQRSALESRLGIPLIYGIDAVHGNNSVYGATIFPHNVGLGATRDA 127

Query: 124 NLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSGLQ 183
           +L  RIGAATALEVRASG HYTFAPCVAVCRDPRWGRCYE YSEDTEIVRKMTSIV+GLQ
Sbjct: 128 DLVKRIGAATALEVRASGIHYTFAPCVAVCRDPRWGRCYESYSEDTEIVRKMTSIVTGLQ 187

Query: 184 GDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDC 243
           G PP+G+P GYPF+ GR N IACAKHFVGDGGTHKG+NEGNTISSY+DLERIHMAPYL+C
Sbjct: 188 GQPPQGYPKGYPFVLGRNNTIACAKHFVGDGGTHKGLNEGNTISSYDDLERIHMAPYLNC 247

Query: 244 ISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGADFR 303
           IS GV TVMASY+SWN  KLHADRFLLTE+LKDKLGFKGFVISDWE +D+LC P+GAD+R
Sbjct: 248 ISDGVSTVMASYSSWNGSKLHADRFLLTEILKDKLGFKGFVISDWEALDQLCEPRGADYR 307

Query: 304 FCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLFEH 363
           FCI++AVNAGIDMVMVPF+YE++++DL YLV+  +I +SRIDDAVERILRVKFV+GLFEH
Sbjct: 308 FCISSAVNAGIDMVMVPFRYEQFVKDLVYLVEHGNISMSRIDDAVERILRVKFVSGLFEH 367

Query: 364 PYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHANN 423
           P++DRSLLD VGCKLHR+LAREAVRKSLVLLKNGKD + PFLPL R A+++LVAGTHA++
Sbjct: 368 PFSDRSLLDMVGCKLHRDLAREAVRKSLVLLKNGKDSRKPFLPLDRKAKRILVAGTHADD 427

Query: 424 LGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDFAIV 483
           LGYQCGGWT TW G SGR+T GTT+L+AI++AVGD+T++IYE++PS DTLARED  FAIV
Sbjct: 428 LGYQCGGWTATWDGRSGRITTGTTVLEAIQKAVGDDTEIIYEQYPSADTLAREDISFAIV 487

Query: 484 AAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKIDALV 543
           A GE PYAEF GDN  L +P NG DVIS+VAD++PTLVIL+SGRPL LEPWLL K+DALV
Sbjct: 488 AVGEGPYAEFRGDNLELAIPFNGTDVISSVADRLPTLVILISGRPLTLEPWLLEKMDALV 547

Query: 544 AAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLKTNN 603
           AA+LPGSEG+G+ DV+FGDY+FEGLLPV+WFK V+QLPM A  NSYDPL+PLG+GL  N 
Sbjct: 548 AAWLPGSEGEGIADVIFGDYDFEGLLPVSWFKRVEQLPMNALDNSYDPLYPLGYGLTYNK 607

Query: 604 GK 606
           GK
Sbjct: 608 GK 609

BLAST of Spo07204.1 vs. UniProtKB/TrEMBL
Match: A0A061GZX5_THECC (Glycosyl hydrolase family protein isoform 1 OS=Theobroma cacao GN=TCM_041670 PE=3 SV=1)

HSP 1 Score: 989.6 bits (2557), Expect = 1.700e-285
Identity = 459/604 (75.99%), Postives = 538/604 (89.07%), Query Frame = 1

		  

Query: 4   NCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGSAP 63
           +C+Y+NPN+ IE+R+KDLLSRMTLQEK+ QM QIER VA P++LKD  IGS+LSAGGS P
Sbjct: 2   DCVYKNPNAPIEDRVKDLLSRMTLQEKIGQMTQIERRVADPSALKDFSIGSILSAGGSGP 61

Query: 64  FDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATRDA 123
           F+ A+SSDWADM+D FQ+ A+ESRLGIP+IYG+DAVHGNN+VYGATIFPHNV LGATRDA
Sbjct: 62  FENALSSDWADMVDRFQQAALESRLGIPLIYGIDAVHGNNSVYGATIFPHNVGLGATRDA 121

Query: 124 NLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSGLQ 183
           +LA RIG ATALEVRASG  YTFAPCV VCRDPRWGRCYE YSEDT  VRKMTSIV+GLQ
Sbjct: 122 DLAQRIGTATALEVRASGIQYTFAPCVTVCRDPRWGRCYESYSEDTNSVRKMTSIVTGLQ 181

Query: 184 GDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDC 243
           G PP GHP GYPF+ GR N IACAKHFVGDGGT KG+NEGNTI SY+DLERIHMAPYLDC
Sbjct: 182 GQPPVGHPKGYPFVAGRNNVIACAKHFVGDGGTEKGINEGNTILSYDDLERIHMAPYLDC 241

Query: 244 ISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGADFR 303
           ISQGV T+MAS++SWN RKLHAD FLLTE+LKDKLGFKGFVISDWE +D+LC PQG++ R
Sbjct: 242 ISQGVSTIMASFSSWNGRKLHADHFLLTEILKDKLGFKGFVISDWEALDQLCEPQGSNNR 301

Query: 304 FCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLFEH 363
           +CI++AVNAGIDMVMVPFKY++++EDL +LV+S ++ +SRIDDAVERILRVKFV+GLFEH
Sbjct: 302 YCISSAVNAGIDMVMVPFKYKQFVEDLAFLVESGEVQMSRIDDAVERILRVKFVSGLFEH 361

Query: 364 PYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHANN 423
           P++DRSLLD VGCKLHRELAREAVRKSLVLLKNGK+P+ PFLPL +NA+++LVAGTHA++
Sbjct: 362 PFSDRSLLDIVGCKLHRELAREAVRKSLVLLKNGKNPENPFLPLDKNAKRILVAGTHADD 421

Query: 424 LGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDFAIV 483
           LGYQCGGWT TW+G SGR+TIGTTILDAI++AVGD T+VIY+++PS D+LA ++F FAIV
Sbjct: 422 LGYQCGGWTGTWHGCSGRITIGTTILDAIREAVGDKTEVIYDQYPSPDSLAGKNFSFAIV 481

Query: 484 AAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKIDALV 543
             GE PYAE  GDN+ L +P NG+D+IS+VADKIPTL IL+SGRPLVLEPWLL K+DALV
Sbjct: 482 VVGEPPYAETLGDNAELVIPFNGSDIISSVADKIPTLAILISGRPLVLEPWLLEKVDALV 541

Query: 544 AAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLKTNN 603
           AA+ PGSEG GVTDV+FGD+EFEG LP+TWF+S++QLPM AG NSYDPLFPLGFGL  N 
Sbjct: 542 AAWFPGSEGGGVTDVVFGDFEFEGRLPMTWFRSINQLPMNAGHNSYDPLFPLGFGLTCNK 601

Query: 604 GKHV 608
            K V
Sbjct: 602 EKSV 605

BLAST of Spo07204.1 vs. UniProtKB/TrEMBL
Match: A0A061GWE0_THECC (Glycosyl hydrolase family protein isoform 3 OS=Theobroma cacao GN=TCM_041670 PE=3 SV=1)

HSP 1 Score: 984.9 bits (2545), Expect = 4.200e-284
Identity = 459/605 (75.87%), Postives = 538/605 (88.93%), Query Frame = 1

		  

Query: 4   NCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGSAP 63
           +C+Y+NPN+ IE+R+KDLLSRMTLQEK+ QM QIER VA P++LKD  IGS+LSAGGS P
Sbjct: 2   DCVYKNPNAPIEDRVKDLLSRMTLQEKIGQMTQIERRVADPSALKDFSIGSILSAGGSGP 61

Query: 64  FDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATRDA 123
           F+ A+SSDWADM+D FQ+ A+ESRLGIP+IYG+DAVHGNN+VYGATIFPHNV LGATRDA
Sbjct: 62  FENALSSDWADMVDRFQQAALESRLGIPLIYGIDAVHGNNSVYGATIFPHNVGLGATRDA 121

Query: 124 NLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSGLQ 183
           +LA RIG ATALEVRASG  YTFAPCV VCRDPRWGRCYE YSEDT  VRKMTSIV+GLQ
Sbjct: 122 DLAQRIGTATALEVRASGIQYTFAPCVTVCRDPRWGRCYESYSEDTNSVRKMTSIVTGLQ 181

Query: 184 GDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDC 243
           G PP GHP GYPF+ GR N IACAKHFVGDGGT KG+NEGNTI SY+DLERIHMAPYLDC
Sbjct: 182 GQPPVGHPKGYPFVAGRNNVIACAKHFVGDGGTEKGINEGNTILSYDDLERIHMAPYLDC 241

Query: 244 ISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGADFR 303
           ISQGV T+MAS++SWN RKLHAD FLLTE+LKDKLGFKGFVISDWE +D+LC PQG++ R
Sbjct: 242 ISQGVSTIMASFSSWNGRKLHADHFLLTEILKDKLGFKGFVISDWEALDQLCEPQGSNNR 301

Query: 304 FCITAAVNAGIDM-VMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLFE 363
           +CI++AVNAGIDM VMVPFKY++++EDL +LV+S ++ +SRIDDAVERILRVKFV+GLFE
Sbjct: 302 YCISSAVNAGIDMVVMVPFKYKQFVEDLAFLVESGEVQMSRIDDAVERILRVKFVSGLFE 361

Query: 364 HPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHAN 423
           HP++DRSLLD VGCKLHRELAREAVRKSLVLLKNGK+P+ PFLPL +NA+++LVAGTHA+
Sbjct: 362 HPFSDRSLLDIVGCKLHRELAREAVRKSLVLLKNGKNPENPFLPLDKNAKRILVAGTHAD 421

Query: 424 NLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDFAI 483
           +LGYQCGGWT TW+G SGR+TIGTTILDAI++AVGD T+VIY+++PS D+LA ++F FAI
Sbjct: 422 DLGYQCGGWTGTWHGCSGRITIGTTILDAIREAVGDKTEVIYDQYPSPDSLAGKNFSFAI 481

Query: 484 VAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKIDAL 543
           V  GE PYAE  GDN+ L +P NG+D+IS+VADKIPTL IL+SGRPLVLEPWLL K+DAL
Sbjct: 482 VVVGEPPYAETLGDNAELVIPFNGSDIISSVADKIPTLAILISGRPLVLEPWLLEKVDAL 541

Query: 544 VAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLKTN 603
           VAA+ PGSEG GVTDV+FGD+EFEG LP+TWF+S++QLPM AG NSYDPLFPLGFGL  N
Sbjct: 542 VAAWFPGSEGGGVTDVVFGDFEFEGRLPMTWFRSINQLPMNAGHNSYDPLFPLGFGLTCN 601

Query: 604 NGKHV 608
             K V
Sbjct: 602 KEKSV 606

BLAST of Spo07204.1 vs. ExPASy Swiss-Prot
Match: BGH3B_BACO1 (Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / NCTC 11153) GN=BACOVA_02659 PE=1 SV=1)

HSP 1 Score: 287.7 bits (735), Expect = 2.900e-76
Identity = 208/652 (31.90%), Postives = 318/652 (48.77%), Query Frame = 1

		  

Query: 13  SIEERIKDLLSRMTLQEKLAQMAQIERTVASP--TSLKD------------IG---IGSV 72
           +IE  I++ L +MTL++K+ QM +I   V S   TS K             IG   +GS+
Sbjct: 34  AIETHIREWLQKMTLEQKIGQMCEITIDVVSDLETSRKKGFCLSEAMLDTVIGKYKVGSL 93

Query: 73  LSAGGSAPFDKAVSSD-WADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHN 132
           L+     P   A   + WA+ I   Q+ +M+  +GIP IYGVD +HG       T+FP  
Sbjct: 94  LNV----PLGVAQKKEKWAEAIKQIQEKSMKE-IGIPCIYGVDQIHGTTYTLDGTMFPQG 153

Query: 133 VNLGATRDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRK 192
           +N+GAT +  L  R    +A E +A    +TFAP V + RDPRW R +E Y ED  +  +
Sbjct: 154 INMGATFNRELTRRGAKISAYETKAGCIPWTFAPVVDLGRDPRWARMWENYGEDCYVNAE 213

Query: 193 M-TSIVSGLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLE 252
           M  S V G QG+ P           G  N  AC KH++G G    G +   +  S  D+ 
Sbjct: 214 MGVSAVKGFQGEDPN--------RIGEYNVAACMKHYMGYGVPVSGKDRTPSSISRSDMR 273

Query: 253 RIHMAPYLDCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDR 312
             H AP+L  + QG  +VM +    N    HA+R LLTE LK+ L + G +++DW  I+ 
Sbjct: 274 EKHFAPFLAAVRQGALSVMVNSGVDNGLPFHANRELLTEWLKEDLNWDGLIVTDWADINN 333

Query: 313 LCVPQ--GADFRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERI 372
           LC      A  +  +   +NAGIDM MVP++   + + L  LV+  ++ + RIDDAV R+
Sbjct: 334 LCTRDHIAATKKEAVKIVINAGIDMSMVPYEV-SFCDYLKELVEEGEVSMERIDDAVARV 393

Query: 373 LRVKFVAGLFEHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNA 432
           LR+K+  GLF+HPY D    D  G K    +A +A  +S VLLKN  +     LP+ +  
Sbjct: 394 LRLKYRLGLFDHPYWDIKKYDKFGSKEFAAVALQAAEESEVLLKNDGN----ILPIAK-G 453

Query: 433 RKVLVAGTHANNLGYQCGGWTITWYG--LSGRVTIGTTILDAIKQAVGDNTQVIYEEFPS 492
           +K+L+ G +AN++    GGW+ +W G           TI +A+ +  G    +IYE   +
Sbjct: 454 KKILLTGPNANSMRCLNGGWSYSWQGHVADEYAQAYHTIYEALCEKYG-KENIIYEPGVT 513

Query: 493 RDTLAREDF--------DFAIVAAGEAP-YAEFTGDNSVLNVPLNGAD----------VI 552
             +   +++        +  + AA +A       G+NS    P N  D          V 
Sbjct: 514 YASYKNDNWWEENKPETEKPVAAAAQADIIITCIGENSYCETPGNLTDLTLSENQRNLVK 573

Query: 553 SAVADKIPTLVILVSGRPLVLEPWLLGKIDALVAAFLPGS-EGDGVTDVLFGDYEFEGLL 607
           +  A   P +++L  GRP ++   ++    A+V   LP +  GD + ++L GD  F G +
Sbjct: 574 ALAATGKPIVLVLNQGRPRIIND-IVPLAKAVVNIMLPSNYGGDALANLLAGDANFSGKM 633

BLAST of Spo07204.1 vs. ExPASy Swiss-Prot
Match: GLUA_DICDI (Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2)

HSP 1 Score: 239.2 bits (609), Expect = 1.200e-61
Identity = 189/633 (29.86%), Postives = 296/633 (46.76%), Query Frame = 1

		  

Query: 18  IKDLLSRMTLQEKLAQMAQIE-RTVASPTSL-----------KDIGIGSVL----SAGGS 77
           + +L+S+M++ EK+ QM Q++  T+ SP ++           K   IGS L    S G +
Sbjct: 80  VDNLMSKMSITEKIGQMTQLDITTLTSPNTITINETTLAYYAKTYYIGSYLNSPVSGGLA 139

Query: 78  APFDKAVSSDWADMIDNFQKLAME-SRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT 137
                  SS W DMI+  Q + +E S   IP+IYG+D+VHG N V+ AT+FPHN  L AT
Sbjct: 140 GDIHHINSSVWLDMINTIQTIVIEGSPNKIPMIYGLDSVHGANYVHKATLFPHNTGLAAT 199

Query: 138 RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKM-TSIV 197
            +   A      T+ +  A G  + FAP + +   P W R YE + ED  +   M  + V
Sbjct: 200 FNIEHATTAAQITSKDTVAVGIPWVFAPVLGIGVQPLWSRIYETFGEDPYVASMMGAAAV 259

Query: 198 SGLQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAP 257
            G QG       N +       +A+  AKH+ G      G +          L R  +  
Sbjct: 260 RGFQGG-----NNSFDGPINAPSAVCTAKHYFGYSDPTSGKDRTAAWIPERMLRRYFLPS 319

Query: 258 YLDCIS-QGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCV-- 317
           + + I+  G  T+M +    N   +H     LTEVL+ +L F+G  ++DW+ I++L    
Sbjct: 320 FAEAITGAGAGTIMINSGEVNGVPMHTSYKYLTEVLRGELQFEGVAVTDWQDIEKLVYFH 379

Query: 318 PQGADFRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKF 377
                    I  A++AGIDM MVP     +   L  +V +  +P SR+D +V RIL +K+
Sbjct: 380 HTAGSAEEAILQALDAGIDMSMVPLDL-SFPIILAEMVAAGTVPESRLDLSVRRILNLKY 439

Query: 378 VAGLFEHPY--TDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNA-RK 437
             GLF +PY   + +++DT+G    RE A     +S+ LL+N    K   LPL  N  + 
Sbjct: 440 ALGLFSNPYPNPNAAIVDTIGQVQDREAAAATAEESITLLQN----KNNILPLNTNTIKN 499

Query: 438 VLVAGTHANNLGYQCGGWTITWYGL--SGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRD 497
           VL+ G  A+++    GGW++ W G         GT+IL  +++   D       +F  + 
Sbjct: 500 VLLTGPSADSIRNLNGGWSVHWQGAYEDSEFPFGTSILTGLREITNDTA-----DFNIQY 559

Query: 498 TLARE--------DFDFAIVAAGEAPYA----------EFTGDNSVLNVPLNGADVISAV 557
           T+  E          D A+  A  +             E  GD   L++  N   ++  +
Sbjct: 560 TIGHEIGVPTNQTSIDEAVELAQSSDVVVVVIGELPEAETPGDIYDLSMDPNEVLLLQQL 619

Query: 558 ADK-IPTLVILVSGRPLVLEPWLLGKIDALVAAFLPGSE-GDGVTDVLFGDYEFEGLLPV 600
            D   P ++ILV  RP +L P L+    A++ A+LPGSE G  + ++L G+    G LP+
Sbjct: 620 VDTGKPVVLILVEARPRILPPDLVYSCAAVLMAYLPGSEGGKPIANILMGNVNPSGRLPL 679

BLAST of Spo07204.1 vs. ExPASy Swiss-Prot
Match: BGLX_SALTY (Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=bglX PE=3 SV=2)

HSP 1 Score: 232.3 bits (591), Expect = 1.400e-59
Identity = 192/650 (29.54%), Postives = 293/650 (45.08%), Query Frame = 1

		  

Query: 18  IKDLLSRMTLQEKLAQMAQIERTVASPTS-----LKDIGIGSVLSAGGSAPFDKAVSSDW 77
           + DLL +MT+ EK+ Q+  I     +P       +KD  +G++        F+     D 
Sbjct: 38  VTDLLKKMTVDEKIGQLRLISVGPDNPKEAIREMIKDGQVGAI--------FNTVTRQDI 97

Query: 78  ADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATRDANLAHRIGAA 137
             M D    L   SRL IP+ +  D VHG       T+FP ++ L ++ + +    +G  
Sbjct: 98  RQMQDQVMAL---SRLKIPLFFAYDVVHGQR-----TVFPISLGLASSFNLDAVRTVGRV 157

Query: 138 TALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKM-TSIVSGLQGDPPKGHP 197
           +A E    G + T+AP V V RDPRWGR  E + EDT +   M  ++V  +QG  P    
Sbjct: 158 SAYEAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPAD-- 217

Query: 198 NGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDCISQGVCTV 257
                   R + +   KHF   G    G        S + L   +M PY   +  G   V
Sbjct: 218 --------RYSVMTSVKHFAAYGAVEGGKEYNTVDMSSQRLFNDYMPPYKAGLDAGSGAV 277

Query: 258 MASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQG--ADFRFCITAA 317
           M +  S N     +D +LL +VL+D+ GFKG  +SD   I  L +  G  AD    +  A
Sbjct: 278 MVALNSLNGTPATSDSWLLKDVLRDEWGFKGITVSDHGAIKEL-IKHGTAADPEDAVRVA 337

Query: 318 VNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLFEHPYT--- 377
           + AG+DM M    Y KYL     L+ S  + ++ +DDA   +L VK+  GLF  PY+   
Sbjct: 338 LKAGVDMSMADEYYSKYLPG---LIKSGKVTMAELDDATRHVLNVKYDMGLFNDPYSHLG 397

Query: 378 --DRSLLDT-VGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHANN 437
             +   +DT    +LHR+ ARE  R+S+VLLKN    +   LPL + +  + V G  A++
Sbjct: 398 PKESDPVDTNAESRLHRKEAREVARESVVLLKN----RLETLPL-KKSGTIAVVGPLADS 457

Query: 438 LGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQV-------------------IY 497
                G W+      +G      T+L  I+ AVGD  ++                   +Y
Sbjct: 458 QRDVMGSWS-----AAGVANQSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLY 517

Query: 498 EEFPSRD-----------TLAREDFDFAIVAAGEAP-YAEFTGDNSVLNVPLNGADVISA 557
           EE    D             A +  D  +   GE+   A      + + +P +  D+I+A
Sbjct: 518 EEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITA 577

Query: 558 VADKIPTLV-ILVSGRPLVLEPWLLGKIDALVAAFLPGSE-GDGVTDVLFGDYEFEGLLP 600
           +      LV +L++GRPL L      + DA++  +  G+E G+ + DVLFGDY   G LP
Sbjct: 578 LKATGKPLVLVLMNGRPLALVK-EDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLP 637

BLAST of Spo07204.1 vs. ExPASy Swiss-Prot
Match: BGLX_ECOLI (Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2)

HSP 1 Score: 216.9 bits (551), Expect = 6.300e-55
Identity = 184/613 (30.02%), Postives = 284/613 (46.33%), Query Frame = 1

		  

Query: 18  IKDLLSRMTLQEKLAQMAQIERTVASPTS-----LKDIGIGSVLSAGGSAPFDKAVSSDW 77
           + +LL +MT+ EK+ Q+  I     +P       +KD  +G++        F+     D 
Sbjct: 38  VTELLKKMTVDEKIGQLRLISVGPDNPKEAIREMIKDGQVGAI--------FNTVTRQDI 97

Query: 78  ADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATRDANLAHRIGAA 137
             M D   +L   SRL IP+ +  D +HG       T+FP ++ L ++ + +    +G  
Sbjct: 98  RAMQDQVMEL---SRLKIPLFFAYDVLHGQR-----TVFPISLGLASSFNLDAVKTVGRV 157

Query: 138 TALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKM-TSIVSGLQGDPPKGHP 197
           +A E    G + T+AP V V RDPRWGR  E + EDT +   M  ++V  +QG  P    
Sbjct: 158 SAYEAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPAD-- 217

Query: 198 NGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDCISQGVCTV 257
                   R + +   KHF   G    G        S + L   +M PY   +  G   V
Sbjct: 218 --------RYSVMTSVKHFAAYGAVEGGKEYNTVDMSPQRLFNDYMPPYKAGLDAGSGAV 277

Query: 258 MASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQG--ADFRFCITAA 317
           M +  S N     +D +LL +VL+D+ GFKG  +SD   I  L +  G  AD    +  A
Sbjct: 278 MVALNSLNGTPATSDSWLLKDVLRDQWGFKGITVSDHGAIKEL-IKHGTAADPEDAVRVA 337

Query: 318 VNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLFEHPYT--- 377
           + +GI+M M    Y KYL     L+ S  + ++ +DDA   +L VK+  GLF  PY+   
Sbjct: 338 LKSGINMSMSDEYYSKYLPG---LIKSGKVTMAELDDAARHVLNVKYDMGLFNDPYSHLG 397

Query: 378 --DRSLLDT-VGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHANN 437
             +   +DT    +LHR+ ARE  R+SLVLLKN    +   LPL ++A  + V G  A++
Sbjct: 398 PKESDPVDTNAESRLHRKEAREVARESLVLLKN----RLETLPLKKSA-TIAVVGPLADS 457

Query: 438 LGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDFAIV 497
                G W+      +G      T+L  IK AVG+N +V+Y +  +  T  +   DF + 
Sbjct: 458 KRDVMGSWS-----AAGVADQSVTVLTGIKNAVGENGKVLYAK-GANVTSDKGIIDF-LN 517

Query: 498 AAGEAPYAEFTGDNSVLNVPLNGA---DVISAVADK---------------IP------T 557
              EA   +      +++  +  A   DV+ AV  +               IP       
Sbjct: 518 QYEEAVKVDPRSPQEMIDEAVQTAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLI 577

Query: 558 LVILVSGRPLVL-----EPWLLGK----IDALVAAFLPGSE-GDGVTDVLFGDYEFEGLL 583
             +  +G+PLVL      P  L K     DA++  +  G+E G+ + DVLFGDY   G L
Sbjct: 578 AALKATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKL 608

BLAST of Spo07204.1 vs. ExPASy Swiss-Prot
Match: BGLB_CLOTH (Thermostable beta-glucosidase B OS=Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) GN=bglB PE=1 SV=2)

HSP 1 Score: 160.2 bits (404), Expect = 7.000e-38
Identity = 161/569 (28.30%), Postives = 243/569 (42.71%), Query Frame = 1

		  

Query: 87  RLGIPIIYGVDAVHGN------------NNVYGATIFPHNVNLGATRDANLAHRIGAATA 146
           RLGIP I   D  HG             NN   AT FP    L  + D  L  R+GAA  
Sbjct: 34  RLGIPSIMMTDGPHGLRKQREDAEIADINNSVPATCFPSAAGLACSWDRELVERVGAALG 93

Query: 147 LEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMT-SIVSGLQGDPPKGHPNG 206
            E +A        P   + R P  GR +E +SED  +  ++  S + G+Q          
Sbjct: 94  EECQAENVSILLGPGANIKRSPLCGRNFEYFSEDPYLSSELAASHIKGVQS--------- 153

Query: 207 YPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDCISQG-VCTVM 266
                  +   AC KHF  +   H+ +   +TI     L  I+ A + + + +     VM
Sbjct: 154 -------QGVGACLKHFAANNQEHRRMTV-DTIVDERTLREIYFASFENAVKKARPWVVM 213

Query: 267 ASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGI-DRLCVPQGADFRFCITAAVN 326
            +Y   N      +R+LLTEVLK++    GFV+SDW  + DR+             + ++
Sbjct: 214 CAYNKLNGEYCSENRYLLTEVLKNEWMHDGFVVSDWGAVNDRV-------------SGLD 273

Query: 327 AGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVA--GLFEHPYTDRS 386
           AG+D+ M P  +    + +   V S  +  + ++ AVERIL+V F+A     E+   D+ 
Sbjct: 274 AGLDLEM-PTSHGITDKKIVEAVKSGKLSENILNRAVERILKVIFMALENKKENAQYDKD 333

Query: 387 LLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHANNLGYQCG 446
                    H  LAR+A  +S+VLLKN  D     LPL ++    L+ G       YQ  
Sbjct: 334 --------AHHRLARQAAAESMVLLKNEDD----VLPLKKSGTIALI-GAFVKKPRYQGS 393

Query: 447 GWTITWYGLSGRVTIG--TTILDAIKQAVGDNTQVIY------------EEFPSRDTLAR 506
           G        S  +T      I + IK+A GD   ++Y            EE  +    A 
Sbjct: 394 G--------SSHITPTRLDDIYEEIKKAGGDKVNLVYSEGYRLENDGIDEELINEAKKAA 453

Query: 507 EDFDFAIVAAGEAPYAEFTG-DNSVLNVPLNGADVISAVAD-KIPTLVILVSGRPLVLEP 566
              D A+V AG     E  G D + +++P N   +I AVA+ +   +V+L++G P+ + P
Sbjct: 454 SSSDVAVVFAGLPDEYESEGFDRTHMSIPENQNRLIEAVAEVQSNIVVVLLNGSPVEM-P 513

Query: 567 WLLGKIDALVAAFLPGSE-GDGVTDVLFGDYEFEGLLPVTWFKSVDQLP----------- 600
           W + K+ +++ A+L G   G  + DVLFG+    G L  T+   +   P           
Sbjct: 514 W-IDKVKSVLEAYLGGQALGGALADVLFGEVNPSGKLAETFPVKLSHNPSYLNFPGEDDR 548

BLAST of Spo07204.1 vs. TAIR (Arabidopsis)
Match: AT3G47000.1 (Glycosyl hydrolase family protein)

HSP 1 Score: 914.8 bits (2363), Expect = 2.700e-266
Identity = 428/601 (71.21%), Postives = 516/601 (85.86%), Query Frame = 1

		  

Query: 2   EENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGS 61
           E +C+Y+N ++ +E R+KDLLSRMTL EK+ QM QIER VASP++  D  IGSVL+AGGS
Sbjct: 5   ESSCVYKNGDAPVEARVKDLLSRMTLPEKIGQMTQIERRVASPSAFTDFFIGSVLNAGGS 64

Query: 62  APFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATR 121
            PF+ A SSDWADMID FQ+ A+ SRLGIPIIYG DAVHGNNNVYGAT+FPHN+ LGATR
Sbjct: 65  VPFEDAKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLGATR 124

Query: 122 DANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSG 181
           DA+L  RIGAATALEVRASG H+ F+PCVAV RDPRWGRCYE Y ED E+V +MTS+VSG
Sbjct: 125 DADLVRRIGAATALEVRASGVHWAFSPCVAVLRDPRWGRCYESYGEDPELVCEMTSLVSG 184

Query: 182 LQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYL 241
           LQG PP+ HPNGYPF+ GR N +AC KHFVGDGGT KG+NEGNTI+SYE+LE+IH+ PYL
Sbjct: 185 LQGVPPEEHPNGYPFVAGRNNVVACVKHFVGDGGTDKGINEGNTIASYEELEKIHIPPYL 244

Query: 242 DCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGAD 301
            C++QGV TVMASY+SWN  +LHADRFLLTE+LK+KLGFKGF++SDWEG+DRL  PQG++
Sbjct: 245 KCLAQGVSTVMASYSSWNGTRLHADRFLLTEILKEKLGFKGFLVSDWEGLDRLSEPQGSN 304

Query: 302 FRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLF 361
           +R+CI  AVNAGIDMVMVPFKYE++++D+T LV+S +IP++RI+DAVERILRVKFVAGLF
Sbjct: 305 YRYCIKTAVNAGIDMVMVPFKYEQFIQDMTDLVESGEIPMARINDAVERILRVKFVAGLF 364

Query: 362 EHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHA 421
            HP TDRSLL TVGCK HRELA+EAVRKSLVLLK+GK+   PFLPL RNA+++LV GTHA
Sbjct: 365 GHPLTDRSLLPTVGCKEHRELAQEAVRKSLVLLKSGKNADKPFLPLDRNAKRILVTGTHA 424

Query: 422 NNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLA-REDFDF 481
           ++LGYQCGGWT TW+GLSGR+TIGTT+LDAIK+AVGD T+VIYE+ PS++TLA  E F +
Sbjct: 425 DDLGYQCGGWTKTWFGLSGRITIGTTLLDAIKEAVGDETEVIYEKTPSKETLASSEGFSY 484

Query: 482 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 541
           AIVA GE PYAE  GDNS L +P NG D+++AVA+ IPTLVIL+SGRP+VLEP +L K +
Sbjct: 485 AIVAVGEPPYAETMGDNSELRIPFNGTDIVTAVAEIIPTLVILISGRPVVLEPTVLEKTE 544

Query: 542 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK 601
           ALVAA+LPG+EG GV DV+FGDY+F+G LPV+WFK V+ LP+ A  NSYDPLFP GFGL 
Sbjct: 545 ALVAAWLPGTEGQGVADVVFGDYDFKGKLPVSWFKHVEHLPLDAHANSYDPLFPFGFGLN 604

BLAST of Spo07204.1 vs. TAIR (Arabidopsis)
Match: AT3G47010.1 (Glycosyl hydrolase family protein)

HSP 1 Score: 868.6 bits (2243), Expect = 2.200e-252
Identity = 411/604 (68.05%), Postives = 505/604 (83.61%), Query Frame = 1

		  

Query: 2   EENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGS 61
           E + +Y+N ++ +E R+KDLLSRMTL EK+ QM QIER+VASP  + +  IGSV S  GS
Sbjct: 6   ESSWVYKNRDAPVEARVKDLLSRMTLPEKIGQMTQIERSVASPQVITNSFIGSVQSGAGS 65

Query: 62  APFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATR 121
            P + A SSDWADMID FQ+ A+ SRLGIPIIYG DAVHGNNNVYGAT+FPHN+ LGATR
Sbjct: 66  WPLEDAKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLGATR 125

Query: 122 DANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSG 181
           DA+L  RIGAATALE+RASG H+TFAPCVAV  DPRWGRCYE YSE  +IV +M+ ++SG
Sbjct: 126 DADLVKRIGAATALEIRASGVHWTFAPCVAVLGDPRWGRCYESYSEAAKIVCEMSLLISG 185

Query: 182 LQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYL 241
           LQG+PP+ HP GYPFL GR N IACAKHFVGDGGT KG++EGNTI+SYEDLE+IH+APYL
Sbjct: 186 LQGEPPEEHPYGYPFLAGRNNVIACAKHFVGDGGTEKGLSEGNTITSYEDLEKIHVAPYL 245

Query: 242 DCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGAD 301
           +CI+QGV TVMAS++SWN  +LH+D FLLTEVLK KLGFKGF++SDW+G++ +  P+G++
Sbjct: 246 NCIAQGVSTVMASFSSWNGSRLHSDYFLLTEVLKQKLGFKGFLVSDWDGLETISEPEGSN 305

Query: 302 FRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLF 361
           +R C+   +NAGIDMVMVPFKYE++++D+T LV+S +IP++R++DAVERILRVKFVAGLF
Sbjct: 306 YRNCVKLGINAGIDMVMVPFKYEQFIQDMTDLVESGEIPMARVNDAVERILRVKFVAGLF 365

Query: 362 EHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHA 421
           EHP  DRSLL TVGCK HRE+AREAVRKSLVLLKNGK+   PFLPL RNA+++LV G HA
Sbjct: 366 EHPLADRSLLGTVGCKEHREVAREAVRKSLVLLKNGKNADTPFLPLDRNAKRILVVGMHA 425

Query: 422 NNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLARED-FDF 481
           N+LG QCGGWT    G SGR+TIGTT+LD+IK AVGD T+VI+E+ P+++TLA  D F +
Sbjct: 426 NDLGNQCGGWTKIKSGQSGRITIGTTLLDSIKAAVGDKTEVIFEKTPTKETLASSDGFSY 485

Query: 482 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 541
           AIVA GE PYAE  GDNS L +P NG ++I+AVA+KIPTLVIL SGRP+VLEP +L K +
Sbjct: 486 AIVAVGEPPYAEMKGDNSELTIPFNGNNIITAVAEKIPTLVILFSGRPMVLEPTVLEKTE 545

Query: 542 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLK 601
           ALVAA+ PG+EG G++DV+FGDY+F+G LPV+WFK VDQLP+ A  NSYDPLFPLGFGL 
Sbjct: 546 ALVAAWFPGTEGQGMSDVIFGDYDFKGKLPVSWFKRVDQLPLNAEANSYDPLFPLGFGLT 605

Query: 602 TNNG 605
           +N G
Sbjct: 606 SNFG 609

BLAST of Spo07204.1 vs. TAIR (Arabidopsis)
Match: AT3G47040.2 (Glycosyl hydrolase family protein)

HSP 1 Score: 858.2 bits (2216), Expect = 3.000e-249
Identity = 418/639 (65.41%), Postives = 508/639 (79.50%), Query Frame = 1

		  

Query: 3   ENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGSA 62
           E C+Y+N ++ +E R+KDLLSRMTL EK+ QM QIER V +P  + D  IGSVL+ GGS 
Sbjct: 6   ETCVYKNKDAPVEARVKDLLSRMTLPEKIGQMTQIERVVTTPPVITDNFIGSVLNGGGSW 65

Query: 63  PFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGAT-- 122
           PF+ A +SDWADMID +Q  A+ SRLGIPIIYG+DAVHGNNNVYGATIFPHN+ LGAT  
Sbjct: 66  PFEDAKTSDWADMIDGYQNAALASRLGIPIIYGIDAVHGNNNVYGATIFPHNIGLGATSL 125

Query: 123 -----------------------RDANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPR 182
                                  RDA+L  R+GAATALEVRA GAH+ FAPCVA     R
Sbjct: 126 VMLLHIDLEPKSLGRNKVVVKCDRDADLIRRVGAATALEVRACGAHWAFAPCVATSIQGR 185

Query: 183 WG-----RCY---ECYSEDTEIVRKMTSIVSGLQGDPPKGHPNGYPFLKGRKNAIACAKH 242
                  + Y   E   ED +I+ +++S+VSGLQG+PPK HPNGYPFL GR N +ACAKH
Sbjct: 186 IPNKKIKKIYMRKELKCEDPDIICELSSLVSGLQGEPPKEHPNGYPFLAGRNNVVACAKH 245

Query: 243 FVGDGGTHKGVNEGNTISSYEDLERIHMAPYLDCISQGVCTVMASYTSWNERKLHADRFL 302
           FVGDGGT KG+NEGNTI SYE+LE+IH+APYL+C++QGV TVMASY+SWN  KLH+D FL
Sbjct: 246 FVGDGGTDKGINEGNTIVSYEELEKIHLAPYLNCLAQGVSTVMASYSSWNGSKLHSDYFL 305

Query: 303 LTEVLKDKLGFKGFVISDWEGIDRLCVPQGADFRFCITAAVNAGIDMVMVPFKYEKYLED 362
           LTE+LK KLGFKGFVISDWE ++RL  P G+++R C+  +VNAG+DMVMVPFKYE++++D
Sbjct: 306 LTELLKQKLGFKGFVISDWEALERLSEPFGSNYRNCVKISVNAGVDMVMVPFKYEQFIKD 365

Query: 363 LTYLVDSKDIPLSRIDDAVERILRVKFVAGLFEHPYTDRSLLDTVGCKLHRELAREAVRK 422
           LT LV+S ++ +SRIDDAVERILRVKFVAGLFEHP TDRSLL TVGCK HRELARE+VRK
Sbjct: 366 LTDLVESGEVTMSRIDDAVERILRVKFVAGLFEHPLTDRSLLGTVGCKEHRELARESVRK 425

Query: 423 SLVLLKNGKDPKYPFLPLGRNARKVLVAGTHANNLGYQCGGWTITWYGLSGRVTIGTTIL 482
           SLVLLKNG + + PFLPL RN +++LV GTHA++LGYQCGGWT  W+GLSGR+TIGTT+L
Sbjct: 426 SLVLLKNGTNSEKPFLPLDRNVKRILVTGTHADDLGYQCGGWTKAWFGLSGRITIGTTLL 485

Query: 483 DAIKQAVGDNTQVIYEEFPSRDTLAR-EDFDFAIVAAGEAPYAEFTGDNSVLNVPLNGAD 542
           DAIK+AVGD T+VIYE+ PS +TLA  + F +AIVA GE PYAE  GDNS L +PLNG D
Sbjct: 486 DAIKEAVGDKTEVIYEKTPSEETLASLQRFSYAIVAVGETPYAETLGDNSELTIPLNGND 545

Query: 543 VISAVADKIPTLVILVSGRPLVLEPWLLGKIDALVAAFLPGSEGDGVTDVLFGDYEFEGL 602
           +++A+A+KIPTLV+L SGRPLVLEP +L K +ALVAA+LPG+EG G+TDV+FGDY+FEG 
Sbjct: 546 IVTALAEKIPTLVVLFSGRPLVLEPLVLEKAEALVAAWLPGTEGQGMTDVIFGDYDFEGK 605

Query: 603 LPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLKTNNGKHV 608
           LPV+WFK VDQLP+TA  NSYDPLFPLGFGL  N+ ++V
Sbjct: 606 LPVSWFKRVDQLPLTADANSYDPLFPLGFGLNYNSSENV 644

BLAST of Spo07204.1 vs. TAIR (Arabidopsis)
Match: AT3G47050.1 (Glycosyl hydrolase family protein)

HSP 1 Score: 845.1 bits (2182), Expect = 2.700e-245
Identity = 409/599 (68.28%), Postives = 487/599 (81.30%), Query Frame = 1

		  

Query: 2   EENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGS 61
           E++ +Y+N  + +E R+KDLLSRMTL EK+ QM  IER+VAS   ++D  IGSVL+  G 
Sbjct: 5   EKSYVYKNREAPVEARVKDLLSRMTLAEKIGQMTLIERSVASEAVIRDFSIGSVLNRAGG 64

Query: 62  APFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATR 121
            PF+ A SS+WADMID FQ+ A+ESRLGIPIIYG+DAVHGNN+VYGATIFPHN+ LGATR
Sbjct: 65  WPFEDAKSSNWADMIDGFQRSALESRLGIPIIYGIDAVHGNNDVYGATIFPHNIGLGATR 124

Query: 122 DANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSG 181
           DA+L  RIGAATALEVRA GAH+ FAPCVAV +DPRWGRCYE Y E  +IV +MTS+VSG
Sbjct: 125 DADLVKRIGAATALEVRACGAHWAFAPCVAVVKDPRWGRCYESYGEVAQIVSEMTSLVSG 184

Query: 182 LQGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYL 241
           LQG+P K H NGYPFL GRKN +ACAKHFVGDGGT+K +NEGNTI  YEDLER H+APY 
Sbjct: 185 LQGEPSKDHTNGYPFLAGRKNVVACAKHFVGDGGTNKAINEGNTILRYEDLERKHIAPYK 244

Query: 242 DCISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGAD 301
            CISQGV TVMASY+SWN  KLH+  FLLTE+LK KLGFKG+V+SDWEG+DRL  P G++
Sbjct: 245 KCISQGVSTVMASYSSWNGDKLHSHYFLLTEILKQKLGFKGYVVSDWEGLDRLSDPPGSN 304

Query: 302 FRFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLF 361
           +R C+   +NAGIDMVMVPFKYE++  DL  LV+S ++ ++R++DAVERILRVKFVAGLF
Sbjct: 305 YRNCVKIGINAGIDMVMVPFKYEQFRNDLIDLVESGEVSMARVNDAVERILRVKFVAGLF 364

Query: 362 EHPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHA 421
           E P TDRSLL TVGCK HRELAREAVRKSLVLLKNG+  +  FLPL  NA ++LV GTHA
Sbjct: 365 EFPLTDRSLLPTVGCKEHRELAREAVRKSLVLLKNGRYGE--FLPLNCNAERILVVGTHA 424

Query: 422 NNLGYQCGGWTITWYGLSGRVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAR-EDFDF 481
           ++LGYQCGGWT T YG SGR+T GTT+LDAIK AVGD T+VIYE+ PS +TLA    F +
Sbjct: 425 DDLGYQCGGWTKTMYGQSGRITDGTTLLDAIKAAVGDETEVIYEKSPSEETLASGYRFSY 484

Query: 482 AIVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKID 541
           AIVA GE+PYAE  GDNS L +P NG+++I+ VA+KIPTLVIL SGRP+ LEP +L K +
Sbjct: 485 AIVAVGESPYAETMGDNSELVIPFNGSEIITTVAEKIPTLVILFSGRPMFLEPQVLEKAE 544

Query: 542 ALVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGL 600
           ALVAA+LPG+EG G+ DV+FGDY+F G LP TWFK VDQLP+    N Y PLFPLGFGL
Sbjct: 545 ALVAAWLPGTEGQGIADVIFGDYDFRGKLPATWFKRVDQLPLDIESNGYLPLFPLGFGL 601

BLAST of Spo07204.1 vs. TAIR (Arabidopsis)
Match: AT5G04885.1 (Glycosyl hydrolase family protein)

HSP 1 Score: 711.1 bits (1834), Expect = 6.000e-205
Identity = 338/607 (55.68%), Postives = 440/607 (72.49%), Query Frame = 1

		  

Query: 3   ENCIYRNPNSSIEERIKDLLSRMTLQEKLAQMAQIERTVASPTSLKDIGIGSVLSAGGSA 62
           E  +Y++P  ++ +R+ DL  RMTL+EK+ QM QI+R+VA+   ++D  IGSVLS GGSA
Sbjct: 26  EYLLYKDPKQTVSDRVADLFGRMTLEEKIGQMVQIDRSVATVNIMRDYFIGSVLSGGGSA 85

Query: 63  PFDKAVSSDWADMIDNFQKLAMESRLGIPIIYGVDAVHGNNNVYGATIFPHNVNLGATRD 122
           P  +A + +W DMI+ +QK A+ SRLGIP+IYG+DAVHG+NNVY ATIFPHNV LGATRD
Sbjct: 86  PLPEASAQNWVDMINEYQKGALVSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRD 145

Query: 123 ANLAHRIGAATALEVRASGAHYTFAPCVAVCRDPRWGRCYECYSEDTEIVRKMTSIVSGL 182
            +L  RIGAATA+EVRA+G  YTFAPC+AVCRDPRWGRCYE YSED ++V  MT ++ GL
Sbjct: 146 PDLVKRIGAATAVEVRATGIPYTFAPCIAVCRDPRWGRCYESYSEDHKVVEDMTDVILGL 205

Query: 183 QGDPPKGHPNGYPFLKGRKNAIACAKHFVGDGGTHKGVNEGNTISSYEDLERIHMAPYLD 242
           QG+PP  + +G PF+ GR    ACAKH+VGDGGT +GVNE NT++    L  +HM  Y D
Sbjct: 206 QGEPPSNYKHGVPFVGGRDKVAACAKHYVGDGGTTRGVNENNTVTDLHGLLSVHMPAYAD 265

Query: 243 CISQGVCTVMASYTSWNERKLHADRFLLTEVLKDKLGFKGFVISDWEGIDRLCVPQGADF 302
            + +GV TVM SY+SWN  K+HA+  L+T  LK  L FKGFVISDW+G+D++  P    +
Sbjct: 266 AVYKGVSTVMVSYSSWNGEKMHANTELITGYLKGTLKFKGFVISDWQGVDKISTPPHTHY 325

Query: 303 RFCITAAVNAGIDMVMVPFKYEKYLEDLTYLVDSKDIPLSRIDDAVERILRVKFVAGLFE 362
              + AA+ AGIDMVMVPF + +++ DLT LV +  IP++RIDDAV RIL VKF  GLFE
Sbjct: 326 TASVRAAIQAGIDMVMVPFNFTEFVNDLTTLVKNNSIPVTRIDDAVRRILLVKFTMGLFE 385

Query: 363 HPYTDRSLLDTVGCKLHRELAREAVRKSLVLLKNGKDPKYPFLPLGRNARKVLVAGTHAN 422
           +P  D S    +G + HR+LAREAVRKSLVLLKNG     P LPL R   K+LVAGTHA+
Sbjct: 386 NPLADYSFSSELGSQAHRDLAREAVRKSLVLLKNGNKTN-PMLPLPRKTSKILVAGTHAD 445

Query: 423 NLGYQCGGWTITWYGLSG-RVTIGTTILDAIKQAVGDNTQVIYEEFPSRDTLAREDFDFA 482
           NLGYQCGGWTITW G SG + T GTT+L A+K AV  +T+V++ E P  + +   +F +A
Sbjct: 446 NLGYQCGGWTITWQGFSGNKNTRGTTLLSAVKSAVDQSTEVVFRENPDAEFIKSNNFAYA 505

Query: 483 IVAAGEAPYAEFTGDNSVLNVPLNGADVISAVADKIPTLVILVSGRPLVLEPWLLGKIDA 542
           I+A GE PYAE  GD+  L +   G  +IS+    +  +V+++SGRPLV+EP+ +  IDA
Sbjct: 506 IIAVGEPPYAETAGDSDKLTMLDPGPAIISSTCQAVKCVVVVISGRPLVMEPY-VASIDA 565

Query: 543 LVAAFLPGSEGDGVTDVLFGDYEFEGLLPVTWFKSVDQLPMTAGQNSYDPLFPLGFGLKT 602
           LVAA+LPG+EG G+TD LFGD+ F G LPVTWF++ +QLPM+ G   YDPLF  G GL+T
Sbjct: 566 LVAAWLPGTEGQGITDALFGDHGFSGKLPVTWFRNTEQLPMSYGDTHYDPLFAYGSGLET 625

Query: 603 NNGKHVI 609
            +   ++
Sbjct: 626 ESVASIV 630

The following BLAST results are available for this feature:
BLAST of Spo07204.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902208388|gb|KNA15936.1|0.0e+099.8hypothetical protein SOVF_0937... [more]
gi|731349751|ref|XP_010686158.1|0.0e+088.4PREDICTED: lysosomal beta gluc... [more]
gi|870853233|gb|KMT05114.1|0.0e+088.4hypothetical protein BVRB_7g17... [more]
gi|645216291|ref|XP_008220446.1|4.1e-28877.4PREDICTED: lysosomal beta gluc... [more]
gi|595835810|ref|XP_007207129.1|4.1e-28877.0hypothetical protein PRUPE_ppa... [more]
back to top
BLAST of Spo07204.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9R8W1_SPIOL0.0e+099.8Uncharacterized protein OS=Spi... [more]
A0A0J8BYP9_BETVU0.0e+088.4Uncharacterized protein OS=Bet... [more]
M5WGE3_PRUPE2.8e-28877.0Uncharacterized protein OS=Pru... [more]
A0A061GZX5_THECC1.7e-28575.9Glycosyl hydrolase family prot... [more]
A0A061GWE0_THECC4.2e-28475.8Glycosyl hydrolase family prot... [more]
back to top
BLAST of Spo07204.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
BGH3B_BACO12.9e-7631.9Beta-glucosidase BoGH3B OS=Bac... [more]
GLUA_DICDI1.2e-6129.8Lysosomal beta glucosidase OS=... [more]
BGLX_SALTY1.4e-5929.5Periplasmic beta-glucosidase O... [more]
BGLX_ECOLI6.3e-5530.0Periplasmic beta-glucosidase O... [more]
BGLB_CLOTH7.0e-3828.3Thermostable beta-glucosidase ... [more]
back to top
BLAST of Spo07204.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 5
Match NameE-valueIdentityDescription
AT3G47000.12.7e-26671.2Glycosyl hydrolase family prot... [more]
AT3G47010.12.2e-25268.0Glycosyl hydrolase family prot... [more]
AT3G47040.23.0e-24965.4Glycosyl hydrolase family prot... [more]
AT3G47050.12.7e-24568.2Glycosyl hydrolase family prot... [more]
AT5G04885.16.0e-20555.6Glycosyl hydrolase family prot... [more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalPRINTSPR00133GLHYDRLASE3coord: 273..291
score: 7.3E-24coord: 111..130
score: 7.3E-24coord: 87..103
score: 7.3E-24coord: 157..173
score: 7.3E-24coord: 203..219
score: 7.3
IPR001764Glycoside hydrolase, family 3, N-terminalGENE3D3.20.20.300coord: 6..370
score: 2.7E
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 26..354
score: 4.1
IPR002772Glycoside hydrolase family 3 C-terminal domainGENE3D3.40.50.1700coord: 377..599
score: 3.3
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 391..600
score: 4.1
IPR002772Glycoside hydrolase family 3 C-terminal domainunknownSSF52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 391..599
score: 1.15
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 6..390
score: 1.05E
IPR019800Glycoside hydrolase, family 3, active sitePROSITEPS00775GLYCOSYL_HYDROL_F3coord: 273..290
scor
IPR026892Glycoside hydrolase family 3PANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 2..41
score: 0.0coord: 60..603
score:
NoneNo IPR availablePANTHERPTHR30620:SF33BETA-D-GLUCAN EXOHYDROLASE-LIKE PROTEIN-RELATEDcoord: 2..41
score: 0.0coord: 60..603
score:

GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0006499 N-terminal protein myristoylation
biological_process GO:0044085 cellular component biogenesis
biological_process GO:0009987 cellular process
biological_process GO:0034660 ncRNA metabolic process
biological_process GO:0009657 plastid organization
biological_process GO:0015031 protein transport
biological_process GO:0031323 regulation of cellular metabolic process
biological_process GO:0060255 regulation of macromolecule metabolic process
biological_process GO:0080090 regulation of primary metabolic process
biological_process GO:0006780 uroporphyrinogen III biosynthetic process
biological_process GO:0009058 biosynthetic process
biological_process GO:0006782 protoporphyrinogen IX biosynthetic process
biological_process GO:0006075 (1->3)-beta-D-glucan biosynthetic process
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0015995 chlorophyll biosynthetic process
biological_process GO:0006508 proteolysis
biological_process GO:0051301 cell division
biological_process GO:0048767 root hair elongation
biological_process GO:0044262 cellular carbohydrate metabolic process
biological_process GO:0043248 proteasome assembly
biological_process GO:0016579 protein deubiquitination
biological_process GO:0043161 proteasome-mediated ubiquitin-dependent protein catabolic process
biological_process GO:0006633 fatty acid biosynthetic process
biological_process GO:2001295 malonyl-CoA biosynthetic process
biological_process GO:0006090 pyruvate metabolic process
biological_process GO:0009407 toxin catabolic process
biological_process GO:0009651 response to salt stress
biological_process GO:0051788 response to misfolded protein
biological_process GO:0046686 response to cadmium ion
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0080129 proteasome core complex assembly
biological_process GO:0006096 glycolytic process
biological_process GO:0006094 gluconeogenesis
biological_process GO:0006635 fatty acid beta-oxidation
biological_process GO:0006511 ubiquitin-dependent protein catabolic process
biological_process GO:0008643 carbohydrate transport
biological_process GO:0009793 embryo development ending in seed dormancy
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005839 proteasome core complex
cellular_component GO:0005829 cytosol
cellular_component GO:0000502 proteasome complex
cellular_component GO:0000148 1,3-beta-D-glucan synthase complex
cellular_component GO:0005774 vacuolar membrane
cellular_component GO:0005737 cytoplasm
cellular_component GO:0019773 proteasome core complex, alpha-subunit complex
cellular_component GO:0000785 chromatin
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0032991 protein-containing complex
cellular_component GO:0042651 thylakoid membrane
cellular_component GO:0005730 nucleolus
cellular_component GO:0009507 chloroplast
cellular_component GO:0005665 DNA-directed RNA polymerase II, core complex
cellular_component GO:0009343 biotin carboxylase complex
cellular_component GO:0005634 nucleus
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0004075 biotin carboxylase activity
molecular_function GO:0005524 ATP binding
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity
molecular_function GO:0003989 acetyl-CoA carboxylase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0003682 chromatin binding
molecular_function GO:0008236 serine-type peptidase activity
molecular_function GO:0004852 uroporphyrinogen-III synthase activity
molecular_function GO:0008422 beta-glucosidase activity
molecular_function GO:0036459 thiol-dependent ubiquitinyl hydrolase activity
molecular_function GO:0008234 cysteine-type peptidase activity
molecular_function GO:0004185 serine-type carboxypeptidase activity
molecular_function GO:0003843 1,3-beta-D-glucan synthase activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0004843 thiol-dependent ubiquitin-specific protease activity
molecular_function GO:0005515 protein binding
molecular_function GO:0008242 omega peptidase activity
molecular_function GO:0004298 threonine-type endopeptidase activity
RNA-Seq Expression