Spo30040 (gene)

Overview
NameSpo30040
Typegene
OrganismSpinacia oleracea (Spinach)
Description(Carboxyl-terminal-processing protease, putative) (3.4.21.102)
LocationSuper_scaffold_120 : 1512037 .. 1538854 (+)
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGACTATGTTCTCTCACTTCACCCAACCCCTGCAACACCACCATTATCATCACCAACAACAAAAATGTCGGAGTAGCCAAGAGCTACACCAACAATTGCAACCCTAATTTCAATTCCAATCATCAAAACTCTCCTAAAAAATCTTCAATTGGTGCTGCCGTAGCTGCTGCTGCACTCTCATTCAGCCTCTTACTATATCCACCTGTTTCGATTGCTGCTAATTCGCCGGCTTCTGATCAATCGTCATCGTCTGATGAGTATTGCCGCGAAGATGGGGGATTATTGGGGGATGATATGGCGTTATCGGCGCCGAAATTGGGGACGAATGAGGGTATTGTGGAGGAAGCTTGGGAGATAGTGAATGATAGTTTTCTTGATACGGGTCGTAGTCGTTGGTCGCCGGAATCGTGGATTGTAAGACCCTCTTCTCTCTCTTTGATTCGGTTATGATAATCGATTTATATTGGTTTATTTTGTATTTTGATGAAATTGCATATTTTTTTTTATTTAGTAATTGAGTGATTGCAGAGAATTAGACTGGATATTGCTAGAAATGCATTTCCATGGTGTAGTTCTGGTGGAGGAAAACTCTTCGAATTTGAGTTTTGATCATATACCTCACTATCTATAGTTACTTATAGTTGGAAAATTTAAGAAAATTGATTATGATCAATGGAAATTAAGATTAATTGGGAAAAGGGTAATGGAAATAGATTTTTGTGGAGCAGTGCATTAAATTATAAAGACATTTTTTTTTTCTTTCTTTCTGGTGTGGTGCGGATGAGGCAGAAGGGCAATAAATTACAAATAGCGGTGGGTGATTCGAACCTGACCTATCGTAGACAGACCCTCAATCTTAATCACTAGGCCAAGACCTCATTGGTCTTAATCACTAGGCCAAGACCTCATTGGTCTTAACCAATAGGCCAAGACCTCATTGGTATGGAGACATTTGATTTGAAACAAGCGAAAATGGGAAGGACATTCATTATGAGATGAAGGGCCTAAAAACTTATTATTGTATCTTATTATTATTTTATACCATATACCTAATTCTTAAACTTCAAACTGCTACGTTTTTAATTGCTGCATCATTCTGCTCTCCAACGTGCATCAACTCATAATAGTTTGTAAGCTCAAGCAGAATGTATGGTGCAACATCGACGCTGTTGAGCTAGAGTTCTAAAGATGGAGGTGTAATGATTAATCAAAAATAGGATTTTGAAGAAGTCTTTAAAGCTTTTCATCTTGTGTGCTGTGTGAAGGAAAGTGGATTCCGCGAACACTGGTCCGCTTGGAGGGTGCAACCGGCACGTTCACGGAGGGAGCGATTAGTGCTAGCTAAGGGAAGGAGCTAACATAAAAAGAGAAGCTTTTTAGCCGGAGCAGATGAAGATTGTCGGCGCGCAGCTCCGATGAGGAGGAGCGGAGCGAAGTCTGTCGAAAATCGGCTATTTCTTCCTCTCGAGCCGCATCAATTTCCTTATCTCATTCCATTTTCTACTTAAGACGTTTGGTTTGTTGAGCATTTGCACCATCCATTTCAAATGGAATTTGTTTCGTGGACACATGAGCAATCATAGTTACCATTTGAAATGGATGTTACCTTTGCTTACTGAACATAGTCAATATGTTCTTGCACCTTAAGTTATGAAATCTTCATGTGCCTGCTCATGAAATTTCAAATGAGCAATATTATCAAGCAAGATTCAATTTAAGAGTATCTTCTTTGACCTTGTTGTTGTGGAGTGTGGATTCAACATTTGGCAGCACCGTCAAGTTATGAGTGTATCACTTTGGTACCTTTTCTCTTATTATGGGGAAGATCACTAGTGTTCATTTGAAGTTTTAATGAGGTGGATGATTTCTGAGGCTCATGTGTACATGTATATATATATTTGTGTCTAAGTATGCATGAATTAGTTATATTGCTTATGGATTCCTTCAAATTTTCTAAGATCAATTATGAAGTGTTCTATATATTTGTGTCTAAGTATGCATGAATTAGTTAGATTGCTTATGGATTCCTTTAAATTTTCTAAGATCAATTATGAAGGGTTCTATGCATGTGACCTTGTTGTAAGCCTTCCAAGTACATCATACATGATCATGACCTTTGTTTTGAAATAGGATGACTAAATTTTCACCGGTTGTTTAAAGCCCATTTGTCGTTTGCTACTGTCCACTAAATTTGATGCGCTTCACCTTTATTGTCTAGTATTCCTCCATCCCAATTAATATGTCCCACATTAACCTTTCTTTTTGTCTGAAATTTGAGTTTATCCAACTAATACATACTTCAAGGAACTTCTCTTTGTATCTCCTACCATATTATCCTTCCTTATTTGTATACAAAGTTTAAACGGGGCTCTTAGTTTGGGAGAAAGGGAATAGTAAATTTCTTTGCCATAGGGTGCCTGGCTTGGTATGTTTTGATGAAACCTTAGTAGCTAAAGAGGTCATTGGGGATGTAGCTAAGTATGTTACTAGAACTTATGAATACCTATTGCTCTTGAGTATAAATCATTTTTTGACCTATCATAGGAATTCCAATTTAGGGAGAGCACATGATTTTACTTCCCTTGTGCTTTGATCTTTATCAACATGCCCTAAAGTGACGGGTAAACTTATGGGTTAAGAGAAAAGGTGAAATGGTTCTCACTGAGTGTTTAGTAAGGCTTCATCAAGGTCCACGAGAGTAGATAGAGGAGGTGGGCTCTTTTGTCTTTATAAGATTCTTGTAATTTAAGTGGACTGTCGGAAGTCCTGAGATTCATTTTGTGCTTTTGGAGTTGTTAGATGAGATATCCTCCTTATGCTTTCCTTGTGATGTGATTGTGGATATTTTGAACTACTTCCTCTATTAACTAATATATTGCCACATTTAGAGTTTTACACTTTCTAACCGTTAATATGTTTAATTATCTTTTAGTAAAAATTCTAAAGATTTGATGTTTTAGAAATATAGATCGAGTGACAGGCTAAATCGCACTCCTATCAAAGATAAACCTAATGTTCACCAACTAAAAATATAGCAAGGGAAGTAGGGATCGTATACACAGGAAAACAACGTTCTTTCTACCTAGGATGCTCGGACTCGGGTACGGGTTTCCGACTCGGGTACGGGTCCAAGGGTCGGACCTGAATTTTTTTTAAAATCAAGGACTCGGGGACACGTTTTAAAAGTGTCCAAAATTTAGAAACGGACTTGGGGACACGGCAAGCATTTAAATATTTATATTAATCCGCATAAATATAATAAATGAAAGTATTATTTTTGTGATTTTAAAATAAAATTTAAAAGTAAATGTTTTGTAAGTTGAAACTACGGAGTCATGTGATGATGTGATCTCTTTTCCCAAAATAAAAGTGGTTTAGAAGTCAAAAAGAACGAATATCAAAACTTATTAAAAAAATAAAAATTTACTGACAGTTGTACTGTACCACCTATTTACTTTTTTTTCTTTTTTTTAGTGGGACCACACCTGTTTACTTACCCAGTTATCCCACGTTATTATTTTAAAAAGAGATGAGTAGTAGACAGTTGTACCTTACCTGTCCCACACCTAATTTTACTTATTTTTTTTAAAAAAAAGAAAATCTAAGTTTTTTTTTTTTAATTTTAACATCGAACTTATCGACTTCCATGGAGGTCGCTTCCATGGAGTCTGTCTGCAGCATCCAAGAGTATCAAACCCTGCCGGAGAATATTCCTGGTGAGCTCGCAATTCTCAGTTGATTGCCTTTCCCTTTGTTGAAACCCCCCAACCAAACGCCCCCTAAGTTCTCAGTCTCTGTATCCCTGATTTAAGGCCTCTATTTTTAGATCTGGGTTTCCAAAACCCTAAAAAGACGAGTCCATTTTCCCCATGGCGAGTCCGGTCCGGATTCCGTGTCCGCACCCGCGTCGTGTCGTGTCGACACCTGTACGGCCGGGTGACAGGAAAAGTGGTTAGTTTGAATATTATAACTACGACAATAATTAAATCACGATAATATCAATATAAAAAGGTCTAGGGCATAGGTTCACCAATGAATAATACTCCGGGATGAGACATAACAATTAACAATCAAAGAAATCATTAATCAGACTGACATGCTATCTCGAGTCCATATTAATCATAAAATTAGAAATAACACTCTCACAATTTATTAACACTAATTCTATCTATTGAAACTAGCCTTAGCATCGAGTTGCATCTCTCGATTATTAACTCAATGCTGCCCAACTATAACAATTAAACCTGCCCAAGTCTAATTGCATGGTAATGAAATCAATCAATGGGAAATTAATCAACAACACCAACAATCAGCAATAATAAATCATCCCTTCATATTAATTCATGGATTCCCAAAACTCTAGAAAATAGACTACTCACACATAATAATGAATAAAACAATTAAAATTAGGGAAAGCATGAAATACATAATTAATTAATAATACCAATCTGAAGAAATAAATGTCGAGTCTTGAATTGAAATAACGAGTGTAATCGACAAAAGTTTAGAGCGATATAGATACCAAAAACTACGAAACTTTTAATACTAGAATAAAACTAAGGCCGAGTTAAAGATGTCCCATTAAAAGAAAACTAAGTTTAATATATAGGCTTCCCCTAAAAGCGTAAGAAGAAAACTGACTAAAGCAGGGAAAAGCAACGAAAACCGGTCGGGTCTCGAAAACCGGCCGGTTTTCGGCACAACAAGCTGTTCTTGATTCCTGCGTACTCAAATCCGCTGCGATCGGGTTTCAGAACTCGGACGGTTGCGATTGCATCTCAGATATCTCCTTGTGCTCTAGCCCGGTCGGGCGGAAAACCGGTGCCACCAGTTTTCGTTGCATTTTGTCGTATTTCCGTGGGCCTTGAATCTTAAGGCCCAAGCCTAACTCCTTTGGGTCTAAATTGTCGTTTGTGGCCCCTTGAAGTTGGCTAGATCACCCCGTACATGTTCGGGATTCGTGTAGTCATCCAGGAATAATAGAAACGGGTCTCAAAAGACCAATTACCACCAAAAAGCGGTCCTGCGACTAAACATACTACACAACACATATTAGCATTAAAATTGGTTCCGAAGAGCTCAAATGAGAACAAAAGTGCATACGGGACGGGGATAAAAACTCTATAAAATATGAATGTATCAAACTCCCCCAAGCTAGACCTTTGCTTGTTTCTAAGAATAGACATCATTGATGAATAAATAAATCAAAACTTCACCCCAAATACATTTCAAACCAATAGATACGATTCAAGGCAAATCCAACAAATACGCATCAATGTCAACCAAACAAATCCATCCGATTACCAATAGCAATTGTCAAGCAGGGCAAACCGCAATTGCATATGGAATCCATAACTAGTGCTCACACACTCTTCACTCATTTATGTGGCGGGTAAAAAATGTCCCCGATCGTTCTCCAACCAATCTANTACAATAGCAATTGTCAAGCGGGGCAAACCGCAATTGCATATGGAATCCATAACTAGTGCTCACACACTCTTCACTCATTTATGTGGCGGGTAAAAAATGTCCCCAATCGTTCTCCAACCAATCTATGCGAACAATGCCCTCCCAAACTCACAACACTCGTGTGTTTAAAAAAAAAAGACATACACTCAACCTAATAGTCGATTCGGAATATGTCACGTCATTTGTTGCATTGATAAACAACTAATTTCACATTAATACAACTCTATGCACAAAGGTCGTAAGGTATTCAAAGCTTGTAATGTAAGGCATAGGTAAGGGGTATGGTAAATTTGGGAATATGTGAGAGTAAAGCTTTAGCTTTGGGGAGCCAAAATCCTAATACGTCCCATGGTTAAAAATAAATAATGACCATAAACTCTCCCACACAACTCTACGTATAATCACCAAGCAAACCACACAAAATTTTCCACTAATTTTCATCCGTACAATTGATCACTCATAAAAATATAAACTTGTAACACTTGAAAATGAAACCATGCTCTGCACTTTTCTTCTTTTGAAAGTAATGATTTTTTTTTTCAATCTCACAAATTCTTTTGAGGAACCTTCACTAGAACATTTGAACATTGATAATTCTTCTCATTTTTGCTCTTTTTTTCTCTCTTTTTTTTCTCTTTTTTTTTCCACCCAAATTTTGCATGAATGAAACACACTACAACTCCCTCACCTCACTTACTAGTAGCTCCCCAAAGCCAAGCTAATGTGGCTAATAAACAATAAAAGGGGCACATGCAACAACATGGGTAGCAAGAAGGGAACAAATGTATCTAAAATCATCTGAAAGCCACCTAAAAGAACAAATGTCTTCGTCCCCCTCAATATGCATGTATATGAGTCATAAAAACATTACTAAAAGTTGTAAACCAATACTCAAAGTCTATGACATACTAGGAAACTAACACATATAAATTCCTAACCAAATCAACTAGTCAAGTACCAAGTTCACAAACAGTTGGACAAAGTTAACTCATGCTAGAGCATCAAAACAATCGAATATCAACTCATAGCTAACATGTCCACAATATTCATCATTAATAACCAAGAATAAAAACAATTTTCGAAAAGATAATTTCATCAATATTCATCACAGGGAAGCAAGATTTAGTATTCATCGGAACGTTTTTTTTTCTTTTTCAAACCAATAAAGCTAACCTAGGAAGCAATAAAACACACCAAAAATAGACTAATCATACCAAAAAGCAAGAAATAGAATGAAGGTATGGAGGAAGAGAAATCATACCATATATACACGAAAAGTGAACACCCCCCAAGCTAAATCAGACAATGCCCTCATTGGCTCAAACGGATGTGGAGTCATCTCCAATATCATCATCATCAACGTCATCATCATCACCTCGGCCACCACTCGTCCCTACTCCACTACCTCCTCCGGCACCATAACTCCCTTGCCCACCTACATGGTTAGTGGGCATATAAGTGCCATGGGTGTCGGAAACTCGCCCTGAGTGGCGTAATAATCGCATATGGGCCTATGGGGAACATGGCTCTTTGTTGATCACTCTCAAAGCGATCAAATCGACTCTCGGATCGATCGAAACGAGAGGTGAGAGCCTCTTGGTGGAGTTCTAATTGTCGGGACGGAGCATAGATAGTCTCTAGGTCTAGGGGACCGGGCTCACCAGGTTGCCCGTATCACGGAAATAAGCGTGCCGTGGTATAGGTTAGGAGCTGTATAGTTCCAGGAAGTTGCTCATCAAGGATTGCCATATATGTAGTTTGCTTGATCAGGAAGGAAGACCGTGAGATTATTAGGCATTGTGAAAAGCCCTATTCCATTAACCATCCATAAGATCTCACCGGCTTCAAATTATTATGAACTGACTTGATCCATCGGTCCACCCTTGCCGTTCTAGATATGCAAGTGTGATAGTATTGCCGCCTACAACATGGCCATTGGCCGGTGGAACAAAGTCTGCAAGACCCATAGCAAGATGAGTGATCATGCCCCCAATCATGATGGGGGTGTGGGATCCTGACTTCCCGGCGGTGTGGAAGTGGCCATGTGGTGTGTAACATTAATCTTATAAGGATTGGTACTATGGTAGACCCAGCTGCCCATAAGCATCAGTTCAGCACTGCGGACATTGCCGGGCTCAAACCTCCCAAATATGGACAACCCGAAGAACCATTGCCCCAACCTAAGGATAGGGTGTTGGAAAGCAGTAGCATGCAAATGTTACAATCCCCTGTCAAAGTATTCCATATTGTAACCTATTGGGGGGGGGTTCTCTAGGGTATTCCGTTTTCATGCCAAACACACGTGCGAATTGTGCCAAAGTAATATGAAGCAACCGAAATTGCACATGTGTGATTTCACCACTACTAGCTTTCACCACCTTAAAACTACTCAAGCATTCAAGAGTCAAGCTAGCATAAGTTTTCCTACCAATAATAAATCATCCCTTCATATTAATTCATGGATTCCAAAAAACCCTAGAAAATAGACTTCTCATACATAATAATGAACAAAGCAATTAAAATTAGAGAAAGCATGAAATACATAATTAATCAATAACACATATTGAAATAGAATAGATAAATACCAATCTGAAGAAATAAAGGTTGAGTCTTGAATTGAAATAACAAGCGTAATCGACAAAAGTTAAGAGCGATATAGGTACCAAAAACTACGAAACTTTTAATACTAGAATAAAACTAAGGCCGAAGTAAAGATGTCCCATTAAAAGAAAACTAAGTTTAATATATAGGCTTCCCCTAAAAGCGTACGAAGAAAACCGACTAAAGAAGTAAAAAGCATCGAAAACCGGTGGGGTCTCGAAAACCGGCCAGTTTTCGGCACAACGAGTTGTTCTTGATTCCTGCGTACTCAAATCCGCTGCAATCGCGTTTCAGAACCCGGTCGGTTGCGATTGCATCTCAGATATCTCCTTGTGCTCTAGCCCGGTCGGGCGGCAAAATCCGGTGCCACCCGTTTTCTTTGCATTTTGCCGCATTTTCATGGGCCTTGAATCTTGAGGCCCAAGCCTAACTCTTTTGGGTCTAAATTGTCGTTTGTGGCCCCTTGAAGTTTGCTAGATCATCCGGTATATGTTCGGGATCCGTGTAGTCATCCGGGAATAATAGAAACAGGTCTCGAAAGACCAATTACCACAAAAAAGCGGTCCTGTAACTAAACATACTACACAACATATATTAGCACTAAAATTGGCTCCGAAGAGCTCAAATGAGAACAAAAGTGCATACGGGACGGGGGTAAAAATACTATAATATATGAATGTATCATCGAGATGAATCTAATAAGTTTGAACATGACTAACTATATATTTCCTTACATGTAAATTGGAATAAAAGGTTAAAGGAGAGTGCTTGAATAGTGTAGAAGTCGAAGCGGTGCAATATTAAGAGAACGAAGGAGGTATATAGTATTTCACAAGGGAGAAACAATGGCAGGTTGTCAAATACGAGGTTTTATCGGGACATTCAAAATGTGTTGACGTTACTAAGTTTTTTTATTTTGATGATTTGAGAGTTTAGAAAATAGATAGGCTGGCCTAGCATCACGGGCATTTTTGATAACAAATTTCCATGTAAAAGATAGTGTTTTGACCCAATGGAAACACTATATAAGTTAAAAGTTGTTTTTGGTAAGCTAGTGGTTAGATGTTGAAGTTACTTAAAACTTTAGGTAACATATGCTGGTTTTTTGCTACTATTACCCATGAAATTTGAACTAAAATCTAACTTTAACACATATTCAGACAGCTATCAGCTAATCACTTATGGTTGATTGAATAGTCAACCGTTGGTAGAAAAGCTAACACCACTTTTTACTTTTAAGTTTTTAACATACAAGTTCTTAGGGGCTAGGGTCATGATCAAAGCACAATTATGAACGCCAGCATGTATTGAAGGGGTTTATTTTAAAATTTCATAAAGAATAAGTTCAGTGAAGGAAAAAAATAGAATCAAACCACCGCACCCAGAACGTGGTATGAAAGATCATTGACTTGAAGAAGAAACTACAAAGTACCAAATATTTGTATTTTTACCTTGATGTAGTGATAGAAGTGCAATCTGGATGATGATGCTAGAACTGACTTCAATTAAATGAAGGGAAAATATTTTCAGTTAAAGAACTAGAAACAGAAAAGTCAGACCTAATGCTCGTCTTTATGTCACTTGTTTCACTGAGACTATATACCAGGTATTGCTGCATAGGAATTCTGTTATCTTCAAACAGAACACTAATCCTGTAGTAAGCATTGTAAAGGAAATCCATGCCGTAGGTTCCCTGAAGAGTTCAGGAAGCTACTTGTGGTGTAATGTTTGCTGCAGCTTGTTGCTGTTTTTGATTTTGTTGTTCTGAGGTCTGAGCTGTAATACTTGTCATAATTTTGACAAGGCTGTCCGGTTATTAATCAATTTACTAATTCATCATTATCATTACCCCATTGCCTCAAAGAGGCTCCCGCAAGAAGCGGGGTAAGGGGGGTCGGACATACGCAACCTTACCCCTGTAATTGCAGAGAGGTTGTTTTCAATTGACCCAAAAGCGATAACAGGACGAGACGGAACGACCATCTTCTACTTCATGGAAAGGAAGCGAAGAGGTATTAAGAGCCATTTTACTAATTGCTAGAATATGAGAAAAAGAAAAGATTGAATAAGGGATTAAAAAGGGCTTCTGATGTCAAATGTGAGGGTTAATTTGGACGCCCATTGTAAATCTATACTAGTATTACCGGAAGTTTTGCATGCCACATTGCCCCATGTCCTCCCTTTGTATTTTCATTTTCATTTTATTTTGAATCACTTGTTCTCCTCTCTCTCCCAATTACCTTATATTCCCCACCCCCACCCCCAACCCCACCCCCAAATTCTCTTCTTCCATGTTTATTGACCTTCCCCAACCAATTACGTCCAATTCAACAATTTCACCTCTTGATTCCACTTCAACAATTTTAATCAAACTCCAAACTCATTCTTCATTTCTGCTATTACGTAGACAAACAAAACTAATTTTAAACTAGATGCATGGGCTAAAAAACTAGTTGACTTCAACTAGTGACTTAAAGTAGAAACTTTAGATTTTGCGTTTGTGTGTGTGTGAGTGATTGTGCATGCTTCAAGAATTTACGTATGAATGATATATTATGAATTTCAATTACTTGTCTATTGCGTCCCATTTTGTGCTGCGATCAACAAATTTCATCAAGTTTCTGACTGGCTAGGAGTTTCCTTGATGCTAATAGTAGTTTGCTCCTTCATGTTTGGACTTGAAAGAATACTCTGGCTTGCAAACTTTTGATTGTATATTGTTAGTTTCTTTCTCTTCTACCAAATCTACGTTTTCTTTCTAAAACATATTTCACTTATGTATATAGCGCAAGAAGGAAGACATATTGAGCACTTCTATTAAGACTAGATCAAAGGCTCACAATATCGTTCAAAAAATGTTGGCAAGTTTGGAAGATCCTTATACACGTTTTTTGTCTCCAGAAGAGGTAATCGGGTTTAAGGATGATGTTTGTATAGTTGTATACTACTCTTTCTGTTTCACTCTAATTGTCTGAGTTTCCAAATACAAACTCCACTAGAATATGCTACTATTGGTTATTCATGTAGCACATTTAATATGCTCAAACAGTTCAAGTGATACAGAGGAGGGTGTAGGTGCTTCCAAAATCTATGCATGTACTGTCTGTTTGATGTCTATGTCTGGTTACACTAGTCTGATAACATGAGAATATTAGGCTGATGTGCTGCAATTCTGAGGTTCTAGGCTTCTAGCTTGATTGGTGTGCTGGATACTTTTCAAAGCTTTAATATGAAACGTACTTGATTGTTTAGGTTTTGTTTGCAATATCTTTTTCTCTATTGTCATTTGTCTTGCGTTCCAATGCCCTCGTATATGTACCACCTATTTGATTGCATGTTTAGCATTTCCCAATGTTTTAAATAGTGTCAGATTGACGGCTACATGCATTTTTAGATTACAAATTTTGTCCTGGTTTATAATTTTAGTAGATTAGTCCCAGGTTTTAACTCACGCATGTTGTAGTATTTTTTCCATGGCATGTGAATCATTGATGGATGAAAGTTCTCGAAGATGGCTAGATATGACATGACTGGTATTGGAATAAATGTCAGGGAGATGCCTGACAGCAGTGGAAGCTTCAAACTGAAGGTGCTTGGACTCGTATTGGATGGGCCTGCTCAAACTGCTGGAATTAGACAGGTAACTAAAACAAGAGTGCTATTCTTTAATGGTTATACTGTATTAATATCTTTATAAATATTAGAGTTGTTGTAATAGTGTAAAATATTATCAATAGGCTTTCTATTGTGTGTCAGTCATAAATTGGGCCTTTAGCATTTAGCTTAGATTTGGGATTCTGTTGCTTGACTCGGTTTCGAACTTTTCTTGTTGGACTCGATTTGAGATTCTGTTGCTTGAATCGGCTTGATTATCTTGCTGGAGCGTCCTGGGTCATTAGACAGGAATCCGAAAATTAAAAGAAACAAAGAATATTACTACTGTGTAAACAAAGAGTTTGAGAGTAAGTGAACCATTATGGCAAGGAAAAGTTATTCAGCGGGCTAAGAAGATAAAAAATTTGGAACAAAATATCATTTGATTCGTCGATCCTCAAAACAAGAAGTAGGCAAAGTAACGAGTGATAAATGGGATATTCATGGAAAGTTACAATCTCCACCACGTAATAGTTCACCTACGCGTCTTCATCGATGTTGTTTTTTGACGGGCCGGTTCGATGGCTCTATATGAATTAGCTGTTTTTGATCCCTCTGACCCCGTTCTTGATCCAATGTGGAGACAGGGTATGTTCGCACTTTTTTACTTTACGGTCTTTTGTTAAAATGGAAGAGCAATCTCTTTTCACGTGGTACAAAAACATCTCTTGCACAAGGTATAGAGAGGGGTCATTGTAAACTTCCAGTCTAAAGGGCGTTTACAGTGGAGAGAGAGGCTTGTGCAGGTGATACTTTGGACAAAAAGAAAGAGACTTGTGCAGATGATACTTCCCCCATAAAGAAATCTTATATGTTGGATTGAAACGAGCCTATTATAACTGACTGAGAACCCTTCTTTGCCTGTTTTATGTCAAATTTGTAAATAGTGCATCATAGGCTGAACTGAAATACCATTTTACTCTAAGTTGAAGGAAACTCAATCTCAGATAGCTAGGAATGGAGGAGAGACAAGGAGGGATGAAAATTCCATTCAAAAATGAATTGTCAATAGGAGTAATCATGGTGTAAAAACTCGCCCGAGTTAACTCGTATCACCTGAGTTAACAAGGTTTTGGAGGCTAGCGATACGAGATATCCCGAGTTGTAAAGGTGCTTTTAAAAACGGCGAGATCTCGGCCGGTTCAACCGATCCGAACGAATTTTGTCCGCATTAAGCGTTATAAACCTCACATCTACTCGGTTTTCAGCCGAATTCCCCATGTTTTCGACAAAAAGAAAGGGCAAAAGGAAAGAAATGGAGAAACTCCATTGAAATCGAGTAGAGAACTCCATTTTCAAATCTAAAACAAGAGAGGAGAGAGAAGGTAGACAAAGGAACTCATTTTAATTATGTCACGTGTAGGGGTGTGCACCGGTCCAAAACCGGTCCAGTCCGGACCTGAACCGGAATGACCCGGAATCGGAATCTATAAATTTCTTAGTCCGGAGACCGGTCCGGAATTAGACCGGTCCGGACCTGAAAATTCCGTTTCGGACCGGTTTTTGTCCTGTTGGACCTGTTTAGCGATTTTCCTTTGAAAAATATTTTCCTTCAAAAATTCAACAGGCGAGTAGAATACTTAATTTCACACCAAATTTTCCTTCATAACATACAGTTCATATACATTAATTTTGATCTATAAAATAAACTTATACCATAGACTTACAAATTTTGTTCTATAAGATAAACTTATACGAAGTACCATACATTCAATATTCCATTAGCTTTAAAAGCGGATTTAATACCTAAAAAGTAACAACATCGATTTAGAATGACCAAATGATTAAACTTGAATATTAGTTGGATAGATTTGTAACATCAGAACTAAGAACATCCATAAGAAGATTTACCTTTATTTATTAAAAACATCAAATTACATAAAAAAAAGGGGTAAAAGGGAGTGGAAGGTGAACCGATAAGGGTTCGCCTAAATGATTCCGGATCGATTACTAGCTTTAGTTAAGATGAAGGCCACGTGAATGGGATAATATTGCCCTTGACGAACTCACCGATCATCTTTGTAATAGCCGACACTGCCGCATCGAATCGTAGATCTGAAAAATAGGTGGTCTAAGAAGTAGAAAGAAAGTGGTCGGGATAAGAAGTAGAGAAAAACAAAGACTAACAAAGATAGAGGAAGTAAATAATGGAGGTTGGGTTGAGAAGTAGAGAGGAGAAGGAATAACAGAGGAGAGAGGAAGAGAGGAGGGTGATTTAGATCTTGAAAAGTTGGAGGGTTGGCGGAGTAATAGAGGAGTAGGTTAAATATGAGAGCAGAAATTAAAAATACGAAGCATATACTCCCTCCGTCCCGGAATACTTGACCTGTTTTCCTTATCGGGCCGTCCCTTAATACTTGACCTGTTTCTAAAAATGGAAATATTCTAACAATACTATATTATTTCTCACTCCACCCCTATTAACCCACCTACCCCCTACTCCATACAAAAAATAATTAAAAATTCAACCCCTACTCTCCCCCAACCCCACATCTTAACACATTTCCTACTAACTACATTAAAATAATACCCCACTATCAACTACTACCTATTAAATTAAATAAGTCAATTCAAGTCCCTTAAACTCTGTGCCGGTCAAACCGGGTCGAGTATTTCGGGACAGAGGGAGTATATTTTAATATTTATGTATCCGGGTTTCCGGACTTGACCGGAAAATTCCGGTTTTTACCCGGTCCGGTCCGGAATCCGAAAACTCATTTTCTTTTGGACCGGTGTCCGGTCCTGAATCCACTGGTCCGGTCCGGACCGGACAATTCAGGTCCGGTTCCGGACCGGATTCAGGACCCGGTCAATTTCTGCACACCCCTAGTCACGTGTACGGTTTAGGGGAGGGGGACTAGGGTAGGTAGTGAATGGGGCTGACATGGGGCTTCTTTTTCAAATTTCAAATTCCTACGCGACAACCGCGACACGGTCCGCGACTAACGCATCATTTCCGTTACCGAGACTCCGCGACCGATCCCCCGACCGTGACCGATACCGCATTTTTTATTTATGGGAGTAATAGCCAACTCAGGTTTTAAACTGACGGCTATTATCTTGAACTTTCAATATGGGACAACAGTGACATACACTTGTACCTTACACAAAGATCGAAGCCTAATATTTTAGATCTATGACAGGTTACACTTTCTGGGAATGTGCTTGTCAAAGAATACCCTACGTTAAATTATATTACTCTGTAGTATAAAATCTTCATTAGGCTTCCATAGTTGAAACTAAGTAGTCAAATGAATGAAATATGTTTGAACTCTGCAAGCTTAAACCAATCATGCGAATATGCCATTCAGGGTTTGCTGATTTCAGGTCCTCCATCATATTTCTCTTGTACTTTCGTTCTATATGCTTATGTAAGCTTTAATTTGCCTTCTGTTTTGCAATCTTGGGAATAGAATAGGTTAATGCTGTTGGATATCTGTGTTTGTGCCTGAGATTGTTTTGTCTGGCTTTAAGAGGGTTATGCTAGTGTTTGTTCCAATAGCCATCTGGTGATGGTGAGAGTCTGTGCCTGATTTCTGTGTTTTTTATTGCCTTGTACACATAATGAGAATTGTTATATGGTGGGGAGTTTACCTGATGGAATCATGATAAACTTGTGGCTGATCAATCTGCAGTCTGCCATGAGCTAGTATTTGTGATGAATATCATTGAAGTATATGATATGGATTTTATCTGATGGGACCATATTAAAGTCCGGGTTGACTTTTCCTTAAAAAAAAAGAAGAATTGGATGAGTATTGCATTTACTTAGTATGAATAGCAGATCATGGCTCTCCAGACCTTTAGCCTGATGAGATTCCTTGTAGTCACTATTACATTTTCCGGAGAGATTTATTATAGCTTGGTTCTCTCTTTCCCTTTCCCTTTCACTCCCACTGTTGTGTTTGGTCATGGTTTTCTTTTCCCTCCCTTGTCTTGGGTGTTTTTGTTTCATTGATTTTGTGTGCATGACTGCATGTGCATGTGACTGTGTCAACTGAGAGTTTTTCAAATAGTATCTCTGCAACTATAGCCTATAAGGGTCGGTCAGATTGCACTAGTGCATTATACTACCAGCTTTTTGATGAGAGAAATTTTAGAGCTGTAGCTAATTTTTTCCCCCCAAAGGGCTTCTTCACAACAAACAGTGAAACAAACAGCAGTGACATTACTGTATTGCTTCTAAAGAGGTTCCTCCAGGCGACTAAAGATCCGAAATAGAAGCCCATATATAAAAGGATAACTGGCGACAGCCTCAATATCACATAGCTAAGCCGCTAAGGTTGTGTGTATCTTAAGGGGTGGGTCTGACTGCACTACTATACTATACTATACTATACTATACTATACTATACTATACTACCGACTCTATTATGAGAACTTTTAGCGATGTTGATAATTTCTTTCTTCTCCAAAGGGCTCCTTCACAAGATACAGCCGCGCTGACATTACCCCCATTGCTTCCAAAGAGTTCATGCCCTTGTGTTAGAATTCAACCTTCCTTTATGTCTATTTTGCTTATCTAAGGCTTTCTAGGACTTTTCAAATAGATTTCTTGGCTTTAAATAGAATTATTATGTTAGTGTAAATCCAATGTTCACTGGTAAGTTTTAGCTTATTCAATATAAACCTCTTTAGGGAGTATGCACCAATACTCCATACCCATAAAAGTTTCCTTGCTTTATTACTTGCGTTGTTTTATACCTTTATAGATATTGTGCTAGCGCTTGTTATAAATCTCCCTACAATCAACAAAGTCTTCCAAAACAAATATATTCTTGTCGTCGGTCTCGCTTGTGCGTGCTGTCCGCACACAAGCGCCCCGATTGCCCCTCGCTCTTGTCACCAATAGGCAAGAGTGCCCATTCTGAATCAAGGACCCGCCAGGGCCCCACGAGCCTTGAGATACACTTCTCGCACAAGACTTTTGCCCTTGCCGCCCTTAGGCAAGAGCGCGCCGCAATATCCGGGGTCAAATCACTTGGACCGTGACTCGGTGGCATGATAAGGATTGGACCTTTAAATGCAGACAACCTTAGCGGCTGATATGAATCTAAACCCCTTAGTTTAGTTAAGAGGTCTAAACTCAAGAACCCACGAAAAATTCACTAATTGGTGCGTTTTATCATTCAGCAAGTACCCACGAAAATTCTAGTCAGCCCGTTAATTGATCAGCAAGTAACCGATATTTCAATGTATGAATCCACTCTTGAACTCACAATTGAATAACGAACCTAAGTGACCACTCTTGATCAATTAGTTCCGCAAGCCTAACCCTAATAACTCCTGATTATTGTCAAGAAAAAAACGCAAGTTCAAAGCTCTCACAAACTTAAGCTAAGCTTTAGATTTTATTGTAGTCAAATTGTGTGTAAAAACAAGCATAAGCACCTCTATGTATAGGGGATAAAGTGTTAGTATTCTGGAAGCTTTAAGCTTAATCCTAAAGCAATAAGGAAACCTAATCCTAGTGCAAGTAAGAAACTAATCCTAGTGCATGTAGGAAACGTAATCCAACTCTAAATAGGAAAGCTAAAACCGAATCCTAATAGGTTTAGCAAACCAACTCCTAATCCTAATAGAATTAGGAAAGCTAACATACATCAAACTGAAATTTAAACTATGCCCAAAATAAGAAAACCGTCCCAAGTGATACCAAAAACTCATGAGCAAAATCTAAATAAGCAATAAAGATATGAAGAGTCTTGAAACGCTACTCATCCACACATCACCATGCATTCTCTCGCATCAACGGCTTAGCCTTGTGATAACAAAGAGGCTGTTAATCTTTTTAAAAGGGCTTCTATGTCAGATCATTAGTCAGCCTGGAGGTATTTATTAGTATATACTGCAGATGAAGTCCCCATCTTTTCTTTTGTTTCTCTTTGTTATCTTTGTATTTCTACTTCCCTCTTGAGCTTTACCTATTTTGTATTATGTACTATATATCAGAACTATGATGAAACTATGAAAGTGGTGGCTCTATGTACGTTATAGCCTATGGCTTTAAATATATCATCTCTGTGGTAGTGTGGTGGTAAATTTTATCTTTTTCAGTGTTACTGTGTTAGTCTCTGAGGCATTCTTGCAAGAAAGAGACGATTTCTACATATGCAGGCAGAAACTTCACATTATTTTCCATTATCTTTTTGATAGGGTGACGAAATCTTGTCTGTTGATGGGGAGGATGTGAGGGGCAAATCAGCTTTTGATGTATCGTCAAAGTTGCAAGGTCCCAGTGAAACTTTTGTAACAGTTGAGGTGATATTTCTGTGTGTTAAAGTATAAATTCTATATCTGAAGACTTCATTAATCGCGTCACCTACTAAGTAATGAAAATCAAAATTTAGGTGAAGCATGGCAACTGTGGTCCTCTCCAGTCTGTCAAAGTTCAGAGACAGCTGGTTGCTAGACCTCCAGTGTTCTACCGGTTGGAGAAAATTGACAATGGTGCAACCTCTGTTGGATATGTGCGTCTTAAAGAGTTCAATGCACTGGCCAGAAAAGATATAGTAACAGGTAACTACTTCCTCCGTTCTTTTTTGTTGACACGTTTTGACTTTTGACGCTATACAGATACTCTACTTTACTGTCTTTTGTGATTTATACTTCAGAAAAAATATGGTTATGTGAGATCTTGTTACATTCGTCTCATTATATATTTTCAGTATCTATTTAAAAAAAAATTACTTGTCGTCAATTAAAGATATTAATGATTGCAATTGTGCATCGGTAAACGTGATAAAAGAAACGTGTCATCCAAAAAAGAACGGAGGAATCATTACTTTGTATTGATTAACTGTGATAGATAGAGATTTTTGAGAGAGCAAATAATGTTGAAGTGTGTGAGAATCTTTTGCTTACTGTAACAGTGTCTGTAAGCCCTTTAGAAAAAGTTTGTTTTGAAAGAATCAGCGAACCTTTTTATGTAATCAAGAAGTATTAAGCCTAATAAGAAATATTTATAAACTAGTGCCGGCTGGTAATTTTTTAAATAGAAGTTTAGAAAACCACCTTTTGTACTTTCTCTGTTCCAGAATTATTGCAACACCTGCCTTTTTAGGACGTTTCATATGAATTGCAGCATTACTTTATTTGTATAGTGGTTCCCTCTCCTTATCACATGGGCCCTATTTTTATCACTCTCCTCATTATATGGGCTCCTAAATACTTGCACTTCCCACTGTGTTGCAAACTTGCAATTAGTTTTTGTTGACAGTTATTGTCATTTTCCATTGAAAACACCCAAATTGCATTGTTATGAATTTAGTTGCAATTTAGCCAAGAAATAAAGGTAATTGGGCAAAAAAATTTGTTTTTCATTTCACTTAAAGTAGGGATATGAATCAAGCATTCACACGATCACACCCCAAGGTGTAAATAACTGAATACCAAGTGTAAATACTTCTGAAGAAAAGATGTCTGATGTTGACGCCAAATTAAGTCTCTTATAGTATGGCTTTTGTGTGTCACTTTCCCATGATAATAAGCTACATGTTCTTAATGGGCCTGACCATGCCGATTGCTTTTGCCAAGTTACTTTTGTTGAAAAGCATAAACATGCTTCTTTTCAGTGAACTGTGTATAACTGTATATTGTCCCAACTCACAAGTGATTGCATTTTCTCAAACATGACTACTTCGAGATCTTAGGTTATTTTCTCAAAGATGTTATTTCCAACATATACTTTCTCAATTCTTTTCTGGAATTGCAACATAAAGAGTAGGCACGCATATTAAGGAGTGTATGTGATGAGGAGAAACTTAAATAAAGGGACCCCATGTGACGAGTGTGGTGTTGCAGTAAAAATAAGCAGGAGTAATATGTATTTTATGTTTCTATTAGCAAAATCTTTTGAACCATGGAACATGACACATATATGTACCAACTTCTAGTTTTTTCTATTATCTGATTTATCATGACTCAAATATAATCTTGCGCAGCAATGAAGCGCCTCCAGGACCAAGGTGCATCATCTTTTGTTCTGGATCTAAGAGACAATTTTGGTGGCCTAGTGCAGGTATATGCCTTTAAGTGGATGTTCATGCGTTATGTGCCTTAATGTTAATGTTTATGTGTGTCTGTAAACATTCATTTAGTAATATTTTAAGGAAAATATTGTCTTACACTAGTTTGCTTTGGTAAGATGAATTTTGTGACTAATTGAGTATTTCAGTCTACTATACCTAGAGTTAGTTAAACCTAATTTCTCTGCACTTTTCTTTTTGGGAACAACTCAAAGTAGATATTTTCTTCTGTTGCTCTGAATGCCATAACTCTAAACTGGGTGAGATTGCCTTTATTGTTATTTCTCTTTGGGAGTAATTTATGGAGTCAACGGAAGGAGTTTACATAGGAAAGACCTCAAAAAACTAAACCAAGTTTTTAGTGAAAAAGTTATATTCCTTTGAAACAATCTATTCTAAAATGTTTGTAAGTTGAATTAGTTTTGAAGAAATGACCGAAAAGTGGTCTTATATCAAGAACAAAAAAGCCGGGATTACTTCGATGTATCCCGCATATCCAACGTAAGACAACCATCAGACTCGATTTTTTAATGAAAGTTTTACCATAAGGCTAATTAATGCATGAGGCACATCCTTTTTTCCATTCTTCTTCTCTGTTTGATTTTCCCTCTCTTCCCCTAGTTCTCTTGCTGTTGCTGTTAAAGAAAGCCCCATCTATGAGAACTATAGAGCATTTTCTCTCTGCCCCTCAGAATTTATAACCATTTGTGTGAATCAGTGGCCTATTGGGTGCGTCTTCTACTTGCACCTTGCACCTAGCTCTCTGTGCCTTCCTTGCGCTTTCTGAAACCAAGGTTAGAATGCAGCACTTGTCAGATGAATAAGAACCATTTCTGATATCTCCTTTAATTTGGCTCTCAATATCTCAAAGGAGTGGTTGGTCCTTAAAGTGGAAGGAGTTGGATTTGGATTGATGATTACTATTCCTTCTTGTTGTACTTGTGGTCTTGGTTAAATCTTGCTGACACTGCGTGCTGTATGTCTGTATTGATGCTCGTTGATGTTGAACCATTGGATGATCTTGTCGGTTTTTGAGCAAAAAGGTTGTAGGGCCTGGCTGCCTGCAGCCTGCTATCATCTTTTATTGTTGAAGCCTGTTTCATGATTTTTATATTTTTAATATGCCTACTTCCAGTTCAATGTTTCAGGGGGCGTTTGGTTCGCAAACTGGTATGGGGTTAGAATCAGGAATGTTATCAAGGTGGCATAGGTTTGGAACTTGATTCTTAATACCATGTCTTTGGTTCGATTTGGGGATGAGCTTTTTTCCTTTTGTTTGATACCTGGAGTAAAGGTATGAGCCATACCCACCTCTCCCCCTAGGTTTCTAAACCCCATACCTTGGGGGTTTGAGGTATGGGTTATAACTCTTAAAATTTCCAACCAAACACATGGTATGGGTTTATTTGCAATTCTAAACCCATGCTGAGTTTGTTAGTGTTCCTCATAGTGTACTCGGAAGAATATATGTATATATTACTTGTCTTTATAGATTTGAATATTGTCGTTTGTTTATGTAGGGCTTTGTGAATTCTTTTTGCAGGCTGGGATAGAAATTGCAAAGCTGTTTCTGGATAAGGGGGATACGGTATGGTAATGCTACCATTTGTGCTTGCCTAATGCACTGGTCTTCTTTTTCATCTTCTTCTCTCCCTTCTTTTGTTTGTGCCGCATGTGTTATCTTGTGTAAGCATGATCTTAGTACAGTTATCTACATAAATTACTATCTCAACAATTTGTTTGTTAAAATATTCAATAATAAGATTGAGACATTGAAATATCTGCCTATTGATTTGCGGGGATAAACAAAAACCATATTCCTTTTACCTGGGAAGAAAGAGAGATCATTGAATGACAAATGCTGAAAGAAAAGGTATTTCATGAAAAGATAAAGAAAAGAAATAATATCCTCTTCTTAATGAATAACAAATATACAGAATAACTGATGTGAGGTTCTGGTGCATTTTAACTATTAGGTCTTTGGGAAACAACTACTATAAGATCTTGAATGGTTAGGTTAAAGTGGTTAGGTTAAAGGATAGGTATATCTTGACCTCTCCAAACCTCTTGTACATGGAGAGATGTTATTGGTGTATTTATGAGATTGGAGGGGCTAGGAGTTTGGATGCTTTCAATTCATTTCTCTGACGTGTGATATTTGTATTTAGAAGATATTGGACAGACTAAGTTTCACATTATGTTAGCATGAGAACCAAATAATTTCTTCCCCGTCAACCTACCTGTTTCCTCTATTTAAGATATTTTTTAAGCAGATCAGTGCCATGTCTATATATTTTCCAAACTCTAAAGACGTTGATGGTAGTGAGAAAAGAGAAGGATATTATACAGTTATTGATCCTTCTTAACATATTAATTCTCATAATTGTTGACGCATCTTTCTTTTTAGCAAGACTTTTAGCACTATCAGCACTAAATTTTACTACTTAGGTACACAATTTGTAGTATTATTGTTTTTTCTTTTCTTTTATGCACTTCCACTTTACAACATAGATCCACTAACGCTTAATTATTTTGTGAAATTTGTTTGGACTATTTGGCTGTGGGATGTAATGGACTAAATCAAAATGACTATTAATACCTCTTCGCTTCCTTTCCATGAAGTAGAAGATGGTCGTCCCGCTTTTGGGTCAATTTGGAAATAACCTCTTTGCAATTGCAGGGGTAAGGTTGCGTACGTCCGACCCCCCTTACCCCGCTTCTTGCGGGAGCCTCTTTGAGGCAATGGGGTAATGGTAATGATGGTAATGAATGTTTGGACTATTTAACAAAGTTATTTTAAAGTTCAAAAAAAATTATAACTACTTATTATTATAATTTTATCGCATGATTAACAAAGCTATTTATGTTTTCTATTTATTTCCCCTAAATCCAAAATTATATGCTTTAGTCAAGTATTGAAGGTCCAAAAAGTACTACTAACTTACTAAGTCTACTATGAAACTTTTTATTTAAATTGAATATCTTTTCATTTTTTTTAAATAATACTCCGTACGTAGTATTATGTTTAGTAACATTTTCTCTTAAATTCAAGCTTGCTCGTTTATCAAGGCTCATGTATTAAAAGCTTGTTTATTGATCTAGCTCGGCTCGAGCTCAAGCTCGAATTTAAACGAGCTATTCGTGAGCTAAGCTCGAACAGTGAAAAACTTAAACGAGTCAAGCGCGTATAGGGGAAATGAAAACTCCGTTAAGCATGGCTGGAAATTCAGGCCTAGCTTGAACAGCCAAATACTCAACTCGACGTGGCTAGTTGACATCCCTAATAATGTACTATCAAAATTTAAATCGGGTCAGTGGTTAGTTTGCTCGGTTTGGGTTGTTTTTCGGGTTTGTATGGTTGTGGGTTTGCAGTATAGGGTGATAATGATAAGTCTCATCTCTGATTGTGAACTCCTGGACTTTCATTGACTCAACAAGTCAAAAGCATTTTGAGGTCCTTTTTTGTGTGTGTCTCGGTTATCATATTTAGTTGAGCTACTTAAATGTTATAATCAAACTTTACTTACTCGTGATAGTATCCCTTTTCAGTATCTATTCTAAATTTCTAATATCTAAAGAAAATTTACAAACAAAGCAGTCCTACTTTCAGATTATATCAAGTAGTACATTTGACCATATTGATGGATATTACAAATGAGCAGATGATAAAACAACTAGTTTTTGAGGGTATAAATCATCATGAAGAGTTGACCGAACAGCTGGAAAACCATGACTCTGATGGTCATATGTTAAATGTATTGCGTAATCCAGTTATGTGTTGACGAATGGGTTGGGGGGGGGGGGGTTTACAAATGCCAATAAATCGTTCTATAACTTTATTGGGGGATCAACAACTTGATGGTTTTTTCTTTTATTACATTGACGAAGCATGTTGTAATACTTGTAAATCGGTTAGATAGAGAGAGATTCATACGAAGTGAAGCAATTGAGTGAAAAAGGTGTTAAAGATTTTGTAATCTTGAAGCCTGTCATAGTCCTGCTAAAATTCTTAATTTTACCTAGCAGGTATAGACTTTGAGTCAATGGCTATTGTCAAATTGAGGGAATTGTATGAATCTTAATCATTTATAGATAAATTCAATTTAGAATGGCCAATATCTAGGCTGTCGTCTATAGGTGGAAGGAATCTAATTTTAGCATATTGCTTGCTTGCTCTATTTACGCCAGCAATTCCTTCCACAAAAAATTGTAGCTCAACCCATCCCAATGCCTTAACCCGGTTTTTGGCATGGCTTCTGCCCTGGGGATCGATGCCGAATAGTGTGATCTACTTTCCATTAATTTTATTAAAATAGTAAGTCCTCATTATTATGAAGACAAAAGTTCGGGACTCTTACTGCTCGTTGCTTTGTTCTTTATTCTCAGGCCTGCTCACCTTGAAAAGGCGAAAATTGCTTCCTACTGTGATTTTCCGGGCATTGGATGAAAATATTGATTTTTTTTTTTGCTTATGTTTCCTCCTTATGGCTTGTTTAAGTGACCTTAAATTATTGAAGTCATTTAGGTTTGGTTAAAAATTGATTTTCACAATCCAGTCAAATATTATGCTCAAAATTCAAAAATGTTTCAGTCACCTTCAGTTTTTGAGGTCCTTTTGAAAGGTCCGTGAACAACTCAACAACGATTTCTTTGCTTGCTAATTGTCTCCTATCAACAGGTGATTTATACTGCTGGAAGGGATCTGTCTCAGAATGCTTTTGTGGCAGAGTCTGCACCACTCGTCAAAGCGCCTGTCATGGTATGAATTATCAACTAAGATGTGAGATCATCATAAATAGAATAGATGCTTTTTATATGATACATTTCTTGTTGGAGATATCCATGTAGTGATCAGATCTAAATTCAGAACAAAATAGTGTTAAAGCTGCTGGCCTTTGTGCCACCTCCTTCGGCAAAACTGGGATAAAGTTGAGCCAAAAGATCCCGATTTATGATAATTCATGAGGAAGCGGTATCTCTTTAGGATCTACTCCAGCATTTAGAAATTGGTTAATCGGTAAAGATAAAGGATTTGATGACCCGCCCAAGAAAGAGACCCAAGTCCTAGTAGTCCTACTAAATGGTGATTCAACATAGATTCCACATCTTGGAAACAAGCTAATTTTGGATAAAGTTCCTGTTTTCTTCATCATGCCTCATCATTATGCAAAAACGCATTTATATGATGTTTACTTGTTTTCATTAACATAAAAGAGGAAACAAAAACTTTACAGCTTTTGGCTAGGACCTCGATATCCTAGATTCCTAGTACAAAAGTAAATGCAATACAACTATTACGACATGTGCAGTATAGACAATCGTAGCTGGTCGGTTCTAGTGGTTTGTCGTTGCATTTGTTTACACTTGGGCCCTGTTCTCTTTGATTTTTCCTGAACTGGAATGACACCTTAAATCCCAAGTTTGGGGTGACATCCCATATTATTTGAAGTTTGAGTCGTTACTCATTAAGGTGAAGATCATAAGAGCATAATTGGAGTTTCTCTCATAAGATCATTTACTACGAAGTACTAATTAAAATTGATTCTTACTGAATTATTTTTCTTTAAAAATAAAAATATTTAAGGTGAAGATCATAAGAGCAACTCCAATTTTGAGCTACAAGGTGTTGGTGGCTAATTTTGCCACATTAGATTTTTAGCTACAATTAATTTAAGCTACATATTTTCACATTGGTGGGCTACAATAACTTGTAGCTCCCTATAATATTTAATTAAAAAAAATTATTTATTTGAGCTACATTTTTTTTGAACCACCTGCCCAAAATAGCTACAAGGTTTTATCAGCTGCATATGCCATTGTTGCACCAATGGTGGGCTACATGCTCACATGTTCATTTTTTTGGTGAGCAGGTCTATTTTTTAACCATTGGAGTTGCTCTAAGGCCTTAGTAATAATGGAATGTTAAGTGAGTTTGAGGAATTACTAGTTTGGTTAATTAAATGGCAGATTTCTGACAAGTGGTATCCTGACTCAAGGTGTTCATTTTGCTAAATATTATTGACTCTAAGCAACCAGAGATTATTTTATTTCCAATTTAGATCCAATAAAATTGAAGAAGTTTAACGGAAATTTCATTTGATCCCGTTGAGCACATCATTGCTGGTCAATTTTTTTTTTTATTTTTTTATTTTGCTTGGCTAACATTATTTACCCGAGTTTCTTGTTTGAGTATTGGTCCGTGTAACATGTGTCATCAACTCATCGAATATTTGTTGGATTGATATTTCTTTGACCTGTTATGGTCTTGAGCATCTATTTATTGGTTTTCAACTATTCAAGGCAAAAAAGAGCAGATATGGAGTTTTTATCTTTTCTTTCTCAGTTAACTTCAATTATTGTTATTATTATGCAGGTTTTGGTGAACCATAATACTGCAAGTGCCAGTGAAATAGTAAGTCCCGCATTGCTAGTACTTCTGATAGTTAGACGACTTAGACCATTCTCTTTGGTAGCTGCTTAACTCTTGCATCAGTACAGGTCGCATCAGCTCTTCATGATAATTGCAAAGCTGTTCTGGTGGGGGAAAAAACTTATGGCAAGGTTTGAATTCCAGTTTCTCGCAACTAGTCTTGCTTCAGTTTTTCCTCTTTCACCTCTGTTCACTTGATTTATTTCTTCATGAAGGGAAGGGTTTTTAATCTTTTTATTTTATTTTTTCTTTCATACATGGTCGTTGAGTCTAGATTAAATTGCGTGGTTAGGGGCCATAGACAGATTTTACAAGATTGAGCAATATTATGCGCAGGGTCTAATTCAATCAGTTTTTGAGCTCCATGATGGATCTGGCGTGGTTGTGACAATAGGGAAGTATGTAACGCCTCACCACTTAGATATCAATGGCAATGGAATTGAACCCGACTTCTCTAACATCCCACGTGAGATTTTATGTTATTTTATTTGTTTTATTTATTTAATAAATGGCACTCTTGGCTTATTTTACCTTATATTGAAGCCTCTTAGTACTTACTGTGGTCTTGGTAAGTTTAGGGTGGAGCGAGGTGAAAGAATATCTTTCTACTTGTAATCGATTGACACAAGGATGA

mRNA sequence

ATGAGACTATGTTCTCTCACTTCACCCAACCCCTGCAACACCACCATTATCATCACCAACAACAAAAATGTCGGAGTAGCCAAGAGCTACACCAACAATTGCAACCCTAATTTCAATTCCAATCATCAAAACTCTCCTAAAAAATCTTCAATTGGTGCTGCCGTAGCTGCTGCTGCACTCTCATTCAGCCTCTTACTATATCCACCTGTTTCGATTGCTGCTAATTCGCCGGCTTCTGATCAATCGTCATCGTCTGATGAGTATTGCCGCGAAGATGGGGGATTATTGGGGGATGATATGGCGTTATCGGCGCCGAAATTGGGGACGAATGAGGGTATTGTGGAGGAAGCTTGGGAGATAGTGAATGATAGTTTTCTTGATACGGGTCGTAGTCGTTGGTCGCCGGAATCGTGGATTCGCAAGAAGGAAGACATATTGAGCACTTCTATTAAGACTAGATCAAAGGCTCACAATATCGTTCAAAAAATGTTGGCAAGTTTGGAAGATCCTTATACACGTTTTTTGTCTCCAGAAGAGTTCTCGAAGATGGCTAGATATGACATGACTGGTATTGGAATAAATGTCAGGGAGATGCCTGACAGCAGTGGAAGCTTCAAACTGAAGGTGCTTGGACTCGTATTGGATGGGCCTGCTCAAACTGCTGGAATTAGACAGGGTGACGAAATCTTGTCTGTTGATGGGGAGGATGTGAGGGGCAAATCAGCTTTTGATGTATCGTCAAAGTTGCAAGGTCCCAGTGAAACTTTTGTAACAGTTGAGGTGAAGCATGGCAACTGTGGTCCTCTCCAGTCTGTCAAAGTTCAGAGACAGCTGGTTGCTAGACCTCCAGTGTTCTACCGGTTGGAGAAAATTGACAATGGTGCAACCTCTGTTGGATATGTGCGTCTTAAAGAGTTCAATGCACTGGCCAGAAAAGATATAGTAACAGCAATGAAGCGCCTCCAGGACCAAGGTGCATCATCTTTTGTTCTGGATCTAAGAGACAATTTTGGTGGCCTAGTGCAGGCTGGGATAGAAATTGCAAAGCTGTTTCTGGATAAGGGGGATACGGTGATTTATACTGCTGGAAGGGATCTGTCTCAGAATGCTTTTGTGGCAGAGTCTGCACCACTCGTCAAAGCGCCTGTCATGGTTTTGGTGAACCATAATACTGCAAGTGCCAGTGAAATAGTCGCATCAGCTCTTCATGATAATTGCAAAGCTGTTCTGGTGGGGGAAAAAACTTATGGCAAGGGTCTAATTCAATCAGTTTTTGAGCTCCATGATGGATCTGGCGTGGTTGTGACAATAGGGAAGTATGTAACGCCTCACCACTTAGATATCAATGGCAATGGAATTGAACCCGACTTCTCTAACATCCCACGGTGGAGCGAGGTGAAAGAATATCTTTCTACTTGTAATCGATTGACACAAGGATGA

Coding sequence (CDS)

ATGAGACTATGTTCTCTCACTTCACCCAACCCCTGCAACACCACCATTATCATCACCAACAACAAAAATGTCGGAGTAGCCAAGAGCTACACCAACAATTGCAACCCTAATTTCAATTCCAATCATCAAAACTCTCCTAAAAAATCTTCAATTGGTGCTGCCGTAGCTGCTGCTGCACTCTCATTCAGCCTCTTACTATATCCACCTGTTTCGATTGCTGCTAATTCGCCGGCTTCTGATCAATCGTCATCGTCTGATGAGTATTGCCGCGAAGATGGGGGATTATTGGGGGATGATATGGCGTTATCGGCGCCGAAATTGGGGACGAATGAGGGTATTGTGGAGGAAGCTTGGGAGATAGTGAATGATAGTTTTCTTGATACGGGTCGTAGTCGTTGGTCGCCGGAATCGTGGATTCGCAAGAAGGAAGACATATTGAGCACTTCTATTAAGACTAGATCAAAGGCTCACAATATCGTTCAAAAAATGTTGGCAAGTTTGGAAGATCCTTATACACGTTTTTTGTCTCCAGAAGAGTTCTCGAAGATGGCTAGATATGACATGACTGGTATTGGAATAAATGTCAGGGAGATGCCTGACAGCAGTGGAAGCTTCAAACTGAAGGTGCTTGGACTCGTATTGGATGGGCCTGCTCAAACTGCTGGAATTAGACAGGGTGACGAAATCTTGTCTGTTGATGGGGAGGATGTGAGGGGCAAATCAGCTTTTGATGTATCGTCAAAGTTGCAAGGTCCCAGTGAAACTTTTGTAACAGTTGAGGTGAAGCATGGCAACTGTGGTCCTCTCCAGTCTGTCAAAGTTCAGAGACAGCTGGTTGCTAGACCTCCAGTGTTCTACCGGTTGGAGAAAATTGACAATGGTGCAACCTCTGTTGGATATGTGCGTCTTAAAGAGTTCAATGCACTGGCCAGAAAAGATATAGTAACAGCAATGAAGCGCCTCCAGGACCAAGGTGCATCATCTTTTGTTCTGGATCTAAGAGACAATTTTGGTGGCCTAGTGCAGGCTGGGATAGAAATTGCAAAGCTGTTTCTGGATAAGGGGGATACGGTGATTTATACTGCTGGAAGGGATCTGTCTCAGAATGCTTTTGTGGCAGAGTCTGCACCACTCGTCAAAGCGCCTGTCATGGTTTTGGTGAACCATAATACTGCAAGTGCCAGTGAAATAGTCGCATCAGCTCTTCATGATAATTGCAAAGCTGTTCTGGTGGGGGAAAAAACTTATGGCAAGGGTCTAATTCAATCAGTTTTTGAGCTCCATGATGGATCTGGCGTGGTTGTGACAATAGGGAAGTATGTAACGCCTCACCACTTAGATATCAATGGCAATGGAATTGAACCCGACTTCTCTAACATCCCACGGTGGAGCGAGGTGAAAGAATATCTTTCTACTTGTAATCGATTGACACAAGGATGA

Protein sequence

MRLCSLTSPNPCNTTIIITNNKNVGVAKSYTNNCNPNFNSNHQNSPKKSSIGAAVAAAALSFSLLLYPPVSIAANSPASDQSSSSDEYCREDGGLLGDDMALSAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDPYTRFLSPEEFSKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIYTAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNIPRWSEVKEYLSTCNRLTQG
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spo30040.1Spo30040.1mRNA


Homology
BLAST of Spo30040.1 vs. NCBI nr
Match: gi|902232875|gb|KNA22820.1| (hypothetical protein SOVF_030520 [Spinacia oleracea])

HSP 1 Score: 941.0 bits (2431), Expect = 8.000e-271
Identity = 478/479 (99.79%), Postives = 478/479 (99.79%), Query Frame = 1

		  

Query: 1   MRLCSLTSPNPCNTTIIITNNKNVGVAKSYTNNCNPNFNSNHQNSPKKSSIGAAVAAAAL 60
           MRLCSLTS NPCNTTIIITNNKNVGVAKSYTNNCNPNFNSNHQNSPKKSSIGAAVAAAAL
Sbjct: 1   MRLCSLTSSNPCNTTIIITNNKNVGVAKSYTNNCNPNFNSNHQNSPKKSSIGAAVAAAAL 60

Query: 61  SFSLLLYPPVSIAANSPASDQSSSSDEYCREDGGLLGDDMALSAPKLGTNEGIVEEAWEI 120
           SFSLLLYPPVSIAANSPASDQSSSSDEYCREDGGLLGDDMALSAPKLGTNEGIVEEAWEI
Sbjct: 61  SFSLLLYPPVSIAANSPASDQSSSSDEYCREDGGLLGDDMALSAPKLGTNEGIVEEAWEI 120

Query: 121 VNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDPYTRFLSPEEF 180
           VNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDPYTRFLSPEEF
Sbjct: 121 VNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDPYTRFLSPEEF 180

Query: 181 SKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGK 240
           SKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGK
Sbjct: 181 SKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGK 240

Query: 241 SAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYRLEKIDNGATSVGY 300
           SAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYRLEKIDNGATSVGY
Sbjct: 241 SAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYRLEKIDNGATSVGY 300

Query: 301 VRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIY 360
           VRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIY
Sbjct: 301 VRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIY 360

Query: 361 TAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKGL 420
           TAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKGL
Sbjct: 361 TAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKGL 420

Query: 421 IQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNIPRWSEVKEYLSTCNRLTQG 480
           IQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNIPRWSEVKEYLSTCNRLTQG
Sbjct: 421 IQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNIPRWSEVKEYLSTCNRLTQG 479

BLAST of Spo30040.1 vs. NCBI nr
Match: gi|731373636|ref|XP_010666703.1| (PREDICTED: carboxyl-terminal-processing peptidase 1, chloroplastic [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 727.6 bits (1877), Expect = 1.400e-206
Identity = 386/475 (81.26%), Postives = 415/475 (87.37%), Query Frame = 1

		  

Query: 6   LTSPNPCNTTIIITNNKNVGVAKSYTNNCNPNFNSNHQNSPKKSSIGAAVAAAALSFSLL 65
           LTSP PCN    I NN      ++Y NN N ++ ++  N  K S + A++AAA LSF+L 
Sbjct: 6   LTSPKPCNIFANINNNNG----RTYNNNNNNSYINSLLNCSKNSLVSASIAAA-LSFTLQ 65

Query: 66  LYPPVSIAANSP------ASDQSSSSDEYCREDGGLLGDDMALSAPKLGTNEGIVEEAWE 125
           L  P+SIAA+SP      +S  SSSS E CRED   L   + LSAP + TNEGIVEEAWE
Sbjct: 66  LPSPLSIAADSPPFRQPSSSSSSSSSVEDCREDE--LVAQLGLSAPNMVTNEGIVEEAWE 125

Query: 126 IVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDPYTRFLSPEE 185
           IVNDSFL TGR RWSPESW+ KKEDILS+SIKTRSKAHNIVQKMLASL DPYTRFLSPEE
Sbjct: 126 IVNDSFLYTGRDRWSPESWLHKKEDILSSSIKTRSKAHNIVQKMLASLGDPYTRFLSPEE 185

Query: 186 FSKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRG 245
           FSKMARYDMTGIGIN+REMPDS+GSFKLKVLGLVLDGPA TAGIRQGDEILSVDGEDVRG
Sbjct: 186 FSKMARYDMTGIGINLREMPDSNGSFKLKVLGLVLDGPANTAGIRQGDEILSVDGEDVRG 245

Query: 246 KSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYRLEKIDNGATSVG 305
           KSAF+VSSKLQGPSETFV VEVKHGNCGP+QSVKVQRQLVARPPVF RLE+IDNGATSVG
Sbjct: 246 KSAFEVSSKLQGPSETFVIVEVKHGNCGPVQSVKVQRQLVARPPVFCRLEQIDNGATSVG 305

Query: 306 YVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVI 365
           YVRLKEFN+LARKDIVTAMKRLQD GAS FVLDLRDN+GGLVQAGIEIAKLFL+KG+TVI
Sbjct: 306 YVRLKEFNSLARKDIVTAMKRLQDLGASCFVLDLRDNYGGLVQAGIEIAKLFLNKGETVI 365

Query: 366 YTAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKG 425
           YTAGRDLSQNA V+ESAPLV APVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKG
Sbjct: 366 YTAGRDLSQNAVVSESAPLVIAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKG 425

Query: 426 LIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNIPRWSEVKEYLSTCN 475
           LIQSVFELHDGSGVVVT+GKYVTP H DINGNGIEPDFSNIPRWS+V EYLSTCN
Sbjct: 426 LIQSVFELHDGSGVVVTVGKYVTPGHSDINGNGIEPDFSNIPRWSKVTEYLSTCN 473

BLAST of Spo30040.1 vs. NCBI nr
Match: gi|743903903|ref|XP_011045311.1| (PREDICTED: carboxyl-terminal-processing peptidase 1, chloroplastic isoform X1 [Populus euphratica])

HSP 1 Score: 613.2 bits (1580), Expect = 3.800e-172
Identity = 307/439 (69.93%), Postives = 373/439 (84.97%), Query Frame = 1

		  

Query: 44  NSPKKSSIGAAVAAAALSFSLLLYPPVSIAANSPASD--QSSSSDEYCREDGGLLGDDMA 103
           N  +K+ +G A+    LS +LLL  P S+A  SP+    QS S++  CRE+       + 
Sbjct: 31  NWTRKTLLGGAITGV-LSINLLLSSPSSLAFESPSPSLLQSQSTEYLCREEEIQQDFKVE 90

Query: 104 LSAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQ 163
             AP++ TNEGIVEEAWEIVNDSFLD+GR RW+P+SW +K+EDILS SI++R+KAH+I++
Sbjct: 91  SEAPQVVTNEGIVEEAWEIVNDSFLDSGRRRWTPQSWQQKREDILSGSIQSRAKAHDIIR 150

Query: 164 KMLASLEDPYTRFLSPEEFSKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTA 223
           +MLASL DPYTRFLSP EFSKM RYD++GIGIN+RE+PD +G  KLKVLGL+LDGPA +A
Sbjct: 151 RMLASLGDPYTRFLSPAEFSKMGRYDISGIGINLREIPDENGEVKLKVLGLLLDGPAYSA 210

Query: 224 GIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVAR 283
           G+RQGDE+LSV+GEDV+GKSAF+VSS LQGP+ETFVT++VKHGNCGP+ S++VQRQLVAR
Sbjct: 211 GVRQGDELLSVNGEDVKGKSAFEVSSLLQGPNETFVTIKVKHGNCGPVHSIEVQRQLVAR 270

Query: 284 PPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLV 343
            PVFYRLE+IDN   SVGY+RL+EFNALARKD+V AMKRLQD+GAS F+LDLRDN GGLV
Sbjct: 271 TPVFYRLEQIDNSTASVGYMRLREFNALARKDLVIAMKRLQDRGASYFILDLRDNLGGLV 330

Query: 344 QAGIEIAKLFLDKGDTVIYTAGRDLS-QNAFVAESAPLVKAPVMVLVNHNTASASEIVAS 403
           QAGIEIAKLFL++G+ VIYT GRD   QN  VA+SAPLVKAPV+VLVN+ TASASEIVAS
Sbjct: 331 QAGIEIAKLFLNEGEKVIYTVGRDPQYQNTIVADSAPLVKAPVIVLVNNKTASASEIVAS 390

Query: 404 ALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNI 463
           ALHDNC+AVLVGE+T+GKGLIQSVFELHDGSGVVVT+GKYVTP+H+DINGNGIEPD+ N 
Sbjct: 391 ALHDNCRAVLVGERTFGKGLIQSVFELHDGSGVVVTVGKYVTPNHMDINGNGIEPDYQNF 450

Query: 464 PRWSEVKEYLSTCNRLTQG 480
           P WS+V+++LS CN   QG
Sbjct: 451 PGWSDVQKHLSECNINQQG 468

BLAST of Spo30040.1 vs. NCBI nr
Match: gi|224114365|ref|XP_002316739.1| (peptidase S41 family protein [Populus trichocarpa])

HSP 1 Score: 611.7 bits (1576), Expect = 1.100e-171
Identity = 306/439 (69.70%), Postives = 373/439 (84.97%), Query Frame = 1

		  

Query: 44  NSPKKSSIGAAVAAAALSFSLLLYPPVSIAANSPAS--DQSSSSDEYCREDGGLLGDDMA 103
           N  +K+ +G A+  A LS +LLL  P  +A  SP+   + S S++  CRE+       + 
Sbjct: 31  NWTRKTLLGGAITGA-LSINLLLSSPSLLALESPSPSLEHSQSTEYLCREEETQQDFKVE 90

Query: 104 LSAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQ 163
             AP++ TNEGIVEEAWEIVNDSFLD+GR RW+P+SW +KKEDILS SI++R+KAH+I++
Sbjct: 91  SEAPQVVTNEGIVEEAWEIVNDSFLDSGRRRWTPQSWQQKKEDILSGSIQSRAKAHDIIR 150

Query: 164 KMLASLEDPYTRFLSPEEFSKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTA 223
           +MLASL DPYTRFLSP EFSKM RYD++GIGIN+RE+PD +G  KLKVLGL+LDGPA +A
Sbjct: 151 RMLASLGDPYTRFLSPAEFSKMGRYDVSGIGINLREIPDENGEVKLKVLGLLLDGPAYSA 210

Query: 224 GIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVAR 283
           G+RQGDE+LSV+GEDV+GKSAF+VSS LQGP+ETFVT++VKHGNCGP+ S++VQRQLVAR
Sbjct: 211 GVRQGDELLSVNGEDVKGKSAFEVSSLLQGPNETFVTIKVKHGNCGPVHSIEVQRQLVAR 270

Query: 284 PPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLV 343
            PV YRLE+I+N   SVGY+RL+EFNALARKD+V AMKRLQD+GAS F+LDLRDN GGLV
Sbjct: 271 TPVSYRLEQIENSTASVGYIRLREFNALARKDLVIAMKRLQDRGASYFILDLRDNLGGLV 330

Query: 344 QAGIEIAKLFLDKGDTVIYTAGRDLS-QNAFVAESAPLVKAPVMVLVNHNTASASEIVAS 403
           QAGIEI+KLFL++G+ VIYTAGRD   QN  VA+SAPLVKAPV+VLVN+ TASASEIVAS
Sbjct: 331 QAGIEISKLFLNEGEKVIYTAGRDPQYQNTIVADSAPLVKAPVIVLVNNKTASASEIVAS 390

Query: 404 ALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNI 463
           ALHDNC+AVLVGE+T+GKGLIQSVFELHDGSGVVVT+GKYVTP+H+DINGNGIEPD+ N 
Sbjct: 391 ALHDNCRAVLVGERTFGKGLIQSVFELHDGSGVVVTVGKYVTPNHMDINGNGIEPDYQNF 450

Query: 464 PRWSEVKEYLSTCNRLTQG 480
           P WS+VK++LS CN   QG
Sbjct: 451 PGWSDVKKHLSECNINRQG 468

BLAST of Spo30040.1 vs. NCBI nr
Match: gi|1000965308|ref|XP_015574941.1| (PREDICTED: carboxyl-terminal-processing peptidase 1, chloroplastic isoform X2 [Ricinus communis])

HSP 1 Score: 610.9 bits (1574), Expect = 1.900e-171
Identity = 309/450 (68.67%), Postives = 375/450 (83.33%), Query Frame = 1

		  

Query: 37  NFNSNHQNSPKKSSIGAAVAAAALSFSLLLYPPVSIAANSP------ASDQSSSSDEYCR 96
           N N++     +K+ +GA      LSF+LLL  P S+A+ SP       S  ++SS E C+
Sbjct: 34  NLNNHTTTWARKTFLGALTGV--LSFNLLLSSPFSLASQSPYPQLQLPSPPNNSSIEQCQ 93

Query: 97  EDGGLLGDDMALSAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTSI 156
           E   +  +  ++      TNEGIVEEAW+IVNDSFLD GR RW+P+SW +KKEDILSTSI
Sbjct: 94  EQEQVEQNQESV------TNEGIVEEAWQIVNDSFLDAGRHRWTPQSWQQKKEDILSTSI 153

Query: 157 KTRSKAHNIVQKMLASLEDPYTRFLSPEEFSKMARYDMTGIGINVREMPDSSGSFKLKVL 216
           ++RSKAH+++++MLASL DPYTRFLSP EFSKMARYDM+GIGIN+RE+P+ +G  KLKVL
Sbjct: 154 QSRSKAHDLIKRMLASLGDPYTRFLSPAEFSKMARYDMSGIGINLREVPEENGEVKLKVL 213

Query: 217 GLVLDGPAQTAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQ 276
           GL+LDGPA TAG++QGDEIL+V+GEDVRGKSAF+VSS LQGP+ETFVT++VKHGNCGP+Q
Sbjct: 214 GLLLDGPAYTAGVKQGDEILAVNGEDVRGKSAFEVSSSLQGPNETFVTIKVKHGNCGPIQ 273

Query: 277 SVKVQRQLVARPPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGASSFV 336
           S++VQRQLVAR PVFYRLE++D G TSVGY+RLKEFNALARKD+V AMKRL+D GAS F+
Sbjct: 274 SLEVQRQLVARTPVFYRLEQVDKGTTSVGYMRLKEFNALARKDLVIAMKRLKDMGASYFI 333

Query: 337 LDLRDNFGGLVQAGIEIAKLFLDKGDTVIYTAGRDLS-QNAFVAESAPLVKAPVMVLVNH 396
           LDLRDN GGLVQAGIEI+KLFL++G+ VIYT GRD   QN  VA++APLV APV+VLVN+
Sbjct: 334 LDLRDNLGGLVQAGIEISKLFLNEGEKVIYTVGRDPQYQNTIVADTAPLVTAPVIVLVNN 393

Query: 397 NTASASEIVASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHLDIN 456
           NTASASEIVASALHDNC+AVLVGE+T+GKGLIQSVFELHDGSGVVVT+GKYVTP+H+DIN
Sbjct: 394 NTASASEIVASALHDNCRAVLVGERTFGKGLIQSVFELHDGSGVVVTVGKYVTPNHMDIN 453

Query: 457 GNGIEPDFSNIPRWSEVKEYLSTCNRLTQG 480
           GNGIEPD+ N P WS+V  +LS CN   QG
Sbjct: 454 GNGIEPDYRNFPAWSDVTRHLSQCNMNRQG 475

BLAST of Spo30040.1 vs. UniProtKB/TrEMBL
Match: A0A0K9RTF0_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_030520 PE=4 SV=1)

HSP 1 Score: 941.0 bits (2431), Expect = 5.500e-271
Identity = 478/479 (99.79%), Postives = 478/479 (99.79%), Query Frame = 1

		  

Query: 1   MRLCSLTSPNPCNTTIIITNNKNVGVAKSYTNNCNPNFNSNHQNSPKKSSIGAAVAAAAL 60
           MRLCSLTS NPCNTTIIITNNKNVGVAKSYTNNCNPNFNSNHQNSPKKSSIGAAVAAAAL
Sbjct: 1   MRLCSLTSSNPCNTTIIITNNKNVGVAKSYTNNCNPNFNSNHQNSPKKSSIGAAVAAAAL 60

Query: 61  SFSLLLYPPVSIAANSPASDQSSSSDEYCREDGGLLGDDMALSAPKLGTNEGIVEEAWEI 120
           SFSLLLYPPVSIAANSPASDQSSSSDEYCREDGGLLGDDMALSAPKLGTNEGIVEEAWEI
Sbjct: 61  SFSLLLYPPVSIAANSPASDQSSSSDEYCREDGGLLGDDMALSAPKLGTNEGIVEEAWEI 120

Query: 121 VNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDPYTRFLSPEEF 180
           VNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDPYTRFLSPEEF
Sbjct: 121 VNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDPYTRFLSPEEF 180

Query: 181 SKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGK 240
           SKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGK
Sbjct: 181 SKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGK 240

Query: 241 SAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYRLEKIDNGATSVGY 300
           SAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYRLEKIDNGATSVGY
Sbjct: 241 SAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYRLEKIDNGATSVGY 300

Query: 301 VRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIY 360
           VRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIY
Sbjct: 301 VRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIY 360

Query: 361 TAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKGL 420
           TAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKGL
Sbjct: 361 TAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKGL 420

Query: 421 IQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNIPRWSEVKEYLSTCNRLTQG 480
           IQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNIPRWSEVKEYLSTCNRLTQG
Sbjct: 421 IQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNIPRWSEVKEYLSTCNRLTQG 479

BLAST of Spo30040.1 vs. UniProtKB/TrEMBL
Match: A0A0J8B7H3_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_003960 PE=4 SV=1)

HSP 1 Score: 727.6 bits (1877), Expect = 9.600e-207
Identity = 386/475 (81.26%), Postives = 415/475 (87.37%), Query Frame = 1

		  

Query: 6   LTSPNPCNTTIIITNNKNVGVAKSYTNNCNPNFNSNHQNSPKKSSIGAAVAAAALSFSLL 65
           LTSP PCN    I NN      ++Y NN N ++ ++  N  K S + A++AAA LSF+L 
Sbjct: 6   LTSPKPCNIFANINNNNG----RTYNNNNNNSYINSLLNCSKNSLVSASIAAA-LSFTLQ 65

Query: 66  LYPPVSIAANSP------ASDQSSSSDEYCREDGGLLGDDMALSAPKLGTNEGIVEEAWE 125
           L  P+SIAA+SP      +S  SSSS E CRED   L   + LSAP + TNEGIVEEAWE
Sbjct: 66  LPSPLSIAADSPPFRQPSSSSSSSSSVEDCREDE--LVAQLGLSAPNMVTNEGIVEEAWE 125

Query: 126 IVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDPYTRFLSPEE 185
           IVNDSFL TGR RWSPESW+ KKEDILS+SIKTRSKAHNIVQKMLASL DPYTRFLSPEE
Sbjct: 126 IVNDSFLYTGRDRWSPESWLHKKEDILSSSIKTRSKAHNIVQKMLASLGDPYTRFLSPEE 185

Query: 186 FSKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRG 245
           FSKMARYDMTGIGIN+REMPDS+GSFKLKVLGLVLDGPA TAGIRQGDEILSVDGEDVRG
Sbjct: 186 FSKMARYDMTGIGINLREMPDSNGSFKLKVLGLVLDGPANTAGIRQGDEILSVDGEDVRG 245

Query: 246 KSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYRLEKIDNGATSVG 305
           KSAF+VSSKLQGPSETFV VEVKHGNCGP+QSVKVQRQLVARPPVF RLE+IDNGATSVG
Sbjct: 246 KSAFEVSSKLQGPSETFVIVEVKHGNCGPVQSVKVQRQLVARPPVFCRLEQIDNGATSVG 305

Query: 306 YVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVI 365
           YVRLKEFN+LARKDIVTAMKRLQD GAS FVLDLRDN+GGLVQAGIEIAKLFL+KG+TVI
Sbjct: 306 YVRLKEFNSLARKDIVTAMKRLQDLGASCFVLDLRDNYGGLVQAGIEIAKLFLNKGETVI 365

Query: 366 YTAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKG 425
           YTAGRDLSQNA V+ESAPLV APVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKG
Sbjct: 366 YTAGRDLSQNAVVSESAPLVIAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKG 425

Query: 426 LIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNIPRWSEVKEYLSTCN 475
           LIQSVFELHDGSGVVVT+GKYVTP H DINGNGIEPDFSNIPRWS+V EYLSTCN
Sbjct: 426 LIQSVFELHDGSGVVVTVGKYVTPGHSDINGNGIEPDFSNIPRWSKVTEYLSTCN 473

BLAST of Spo30040.1 vs. UniProtKB/TrEMBL
Match: B9I114_POPTR (Peptidase S41 family protein OS=Populus trichocarpa GN=POPTR_0011s02860g PE=4 SV=1)

HSP 1 Score: 611.7 bits (1576), Expect = 7.700e-172
Identity = 306/439 (69.70%), Postives = 373/439 (84.97%), Query Frame = 1

		  

Query: 44  NSPKKSSIGAAVAAAALSFSLLLYPPVSIAANSPAS--DQSSSSDEYCREDGGLLGDDMA 103
           N  +K+ +G A+  A LS +LLL  P  +A  SP+   + S S++  CRE+       + 
Sbjct: 31  NWTRKTLLGGAITGA-LSINLLLSSPSLLALESPSPSLEHSQSTEYLCREEETQQDFKVE 90

Query: 104 LSAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQ 163
             AP++ TNEGIVEEAWEIVNDSFLD+GR RW+P+SW +KKEDILS SI++R+KAH+I++
Sbjct: 91  SEAPQVVTNEGIVEEAWEIVNDSFLDSGRRRWTPQSWQQKKEDILSGSIQSRAKAHDIIR 150

Query: 164 KMLASLEDPYTRFLSPEEFSKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTA 223
           +MLASL DPYTRFLSP EFSKM RYD++GIGIN+RE+PD +G  KLKVLGL+LDGPA +A
Sbjct: 151 RMLASLGDPYTRFLSPAEFSKMGRYDVSGIGINLREIPDENGEVKLKVLGLLLDGPAYSA 210

Query: 224 GIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVAR 283
           G+RQGDE+LSV+GEDV+GKSAF+VSS LQGP+ETFVT++VKHGNCGP+ S++VQRQLVAR
Sbjct: 211 GVRQGDELLSVNGEDVKGKSAFEVSSLLQGPNETFVTIKVKHGNCGPVHSIEVQRQLVAR 270

Query: 284 PPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLV 343
            PV YRLE+I+N   SVGY+RL+EFNALARKD+V AMKRLQD+GAS F+LDLRDN GGLV
Sbjct: 271 TPVSYRLEQIENSTASVGYIRLREFNALARKDLVIAMKRLQDRGASYFILDLRDNLGGLV 330

Query: 344 QAGIEIAKLFLDKGDTVIYTAGRDLS-QNAFVAESAPLVKAPVMVLVNHNTASASEIVAS 403
           QAGIEI+KLFL++G+ VIYTAGRD   QN  VA+SAPLVKAPV+VLVN+ TASASEIVAS
Sbjct: 331 QAGIEISKLFLNEGEKVIYTAGRDPQYQNTIVADSAPLVKAPVIVLVNNKTASASEIVAS 390

Query: 404 ALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFSNI 463
           ALHDNC+AVLVGE+T+GKGLIQSVFELHDGSGVVVT+GKYVTP+H+DINGNGIEPD+ N 
Sbjct: 391 ALHDNCRAVLVGERTFGKGLIQSVFELHDGSGVVVTVGKYVTPNHMDINGNGIEPDYQNF 450

Query: 464 PRWSEVKEYLSTCNRLTQG 480
           P WS+VK++LS CN   QG
Sbjct: 451 PGWSDVKKHLSECNINRQG 468

BLAST of Spo30040.1 vs. UniProtKB/TrEMBL
Match: A0A067GHB1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012084mg PE=4 SV=1)

HSP 1 Score: 607.8 bits (1566), Expect = 1.100e-170
Identity = 309/445 (69.44%), Postives = 371/445 (83.37%), Query Frame = 1

		  

Query: 41  NHQNSPKKSSIGAAVAAAALSFSLLLYPPVSIAANSPASD--QSSSSDEYCREDGGLLGD 100
           ++ N  KK+ I   V   ALSF+LLL  P+++ ++S       S S    C E     G+
Sbjct: 34  SNTNWAKKAVIN--VLTGALSFNLLLSSPLALESSSSVQSVPPSPSPSLTCHE-----GE 93

Query: 101 DMALSAPK---LGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSK 160
           D A S P+     TNEGIVEEAW+IVNDSFLDTGR RW+P++W RK+EDILS+SI+TRSK
Sbjct: 94  DAAESEPRQVVAKTNEGIVEEAWQIVNDSFLDTGRHRWTPQNWQRKREDILSSSIQTRSK 153

Query: 161 AHNIVQKMLASLEDPYTRFLSPEEFSKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLD 220
           AH I+++MLASL DPYTRFLSP EFSKMARYDM+GIGIN+RE+PD++G   LKVLGL+LD
Sbjct: 154 AHGIIKRMLASLGDPYTRFLSPAEFSKMARYDMSGIGINLREVPDANGVVTLKVLGLILD 213

Query: 221 GPAQTAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQ 280
           GPA +AG+RQGDE+L+V+G DVRGKSAF+VSS LQGPSETFVT+EVKHGNCGP++S++VQ
Sbjct: 214 GPAHSAGVRQGDEVLAVNGVDVRGKSAFEVSSLLQGPSETFVTIEVKHGNCGPIESIQVQ 273

Query: 281 RQLVARPPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRD 340
           RQLVAR PVFYRLE +DNG TSVGY+RLKEFNALARKD+VTAMKRLQD GAS F+LDLRD
Sbjct: 274 RQLVARTPVFYRLEHLDNGTTSVGYMRLKEFNALARKDLVTAMKRLQDMGASYFILDLRD 333

Query: 341 NFGGLVQAGIEIAKLFLDKGDTVIYTAGRDLS-QNAFVAESAPLVKAPVMVLVNHNTASA 400
           N GGLVQAGIEIAKLFL++G+T+ YT GRD   Q   VA+++PLV APV+VLVN+ TASA
Sbjct: 334 NLGGLVQAGIEIAKLFLNEGETITYTVGRDPQYQKTIVADNSPLVTAPVIVLVNNRTASA 393

Query: 401 SEIVASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIE 460
           SEIVASALHDNC+AVLVGEKT+GKGLIQSV+ELHDGSGVVVTIGKYVTP+H+DINGNGIE
Sbjct: 394 SEIVASALHDNCRAVLVGEKTFGKGLIQSVYELHDGSGVVVTIGKYVTPNHMDINGNGIE 453

Query: 461 PDFSNIPRWSEVKEYLSTCNRLTQG 480
           PD+ N+P W++V ++LS C    QG
Sbjct: 454 PDYRNLPAWNDVTKHLSQCTMHPQG 471

BLAST of Spo30040.1 vs. UniProtKB/TrEMBL
Match: F6HMG8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0003g02730 PE=4 SV=1)

HSP 1 Score: 607.4 bits (1565), Expect = 1.500e-170
Identity = 308/441 (69.84%), Postives = 364/441 (82.54%), Query Frame = 1

		  

Query: 44  NSPKKSSIGAAVAAAALSFSLLLYPPVSIAANS----PASDQSSSSDEYCREDGGLLGDD 103
           N P K+ +GA   A  +SFSLL+  P SIA +S    P+    SS+ +YCR+D     + 
Sbjct: 26  NWPHKTLVGALTGA--VSFSLLISSPSSIALDSASVPPSPSSHSSATDYCRQDDDT--EA 85

Query: 104 MALSAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNI 163
           M  +AP+L TNE IVEEAW IVNDSFLD+ R RWS + W +KKEDIL TSI+TRSKAH+I
Sbjct: 86  MPETAPELVTNEAIVEEAWNIVNDSFLDSSRRRWSSDIWKQKKEDILGTSIQTRSKAHDI 145

Query: 164 VQKMLASLEDPYTRFLSPEEFSKMARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQ 223
           +++MLASL DPYTRFLSP EFSKMARYDMTGIGIN+RE+ D +G  KLKVLGL+LDGPA 
Sbjct: 146 IRRMLASLGDPYTRFLSPAEFSKMARYDMTGIGINIREVQDDNGGVKLKVLGLILDGPAH 205

Query: 224 TAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLV 283
            AG+RQGDEILSV+G DV GKSAF+ SS LQGP+ETFVT+EVKHGNCGP+QS++VQRQLV
Sbjct: 206 AAGVRQGDEILSVNGMDVTGKSAFEASSLLQGPNETFVTLEVKHGNCGPVQSIEVQRQLV 265

Query: 284 ARPPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGG 343
           AR PVFYRLEKI+NGA SVGY+RLKEFNALARKD+V AMKRLQD GA  F+LDLRDN GG
Sbjct: 266 ARTPVFYRLEKIENGAASVGYMRLKEFNALARKDLVIAMKRLQDMGAKYFILDLRDNLGG 325

Query: 344 LVQAGIEIAKLFLDKGDTVIYTAGRDLS-QNAFVAESAPLVKAPVMVLVNHNTASASEIV 403
           LVQAGIEIAKLFL++G+TV YT GRD   +    AE+APL+ AP++VLVN+ TASASEIV
Sbjct: 326 LVQAGIEIAKLFLNEGETVTYTVGRDPQYEKTITAETAPLITAPLIVLVNNKTASASEIV 385

Query: 404 ASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPDFS 463
           ++ALHDNC+AVLVG++T+GKGLIQSVFELHDGSGVVVTIGKYVTP+H+DIN NGIEPDF 
Sbjct: 386 SAALHDNCRAVLVGQRTFGKGLIQSVFELHDGSGVVVTIGKYVTPNHMDINKNGIEPDFR 445

Query: 464 NIPRWSEVKEYLSTCNRLTQG 480
             P WSEV ++L+ CN L QG
Sbjct: 446 EFPAWSEVTQHLAQCNTLRQG 462

BLAST of Spo30040.1 vs. ExPASy Swiss-Prot
Match: CTPA1_ARATH (Carboxyl-terminal-processing peptidase 1, chloroplastic OS=Arabidopsis thaliana GN=CTPA1 PE=1 SV=1)

HSP 1 Score: 551.2 bits (1419), Expect = 1.100e-155
Identity = 286/452 (63.27%), Postives = 352/452 (77.88%), Query Frame = 1

		  

Query: 47  KKSSIGAAVAAAALSFSLLLYPPVS---------IAANSPASDQSSSSDEY------C-- 106
           KKS IG    A  LS +L+   P+S         ++ N P+S   SS + +      C  
Sbjct: 42  KKSVIGTLTGA--LSLTLVFSSPISSVAATNDPYLSVNPPSSSFESSLNHFDSAPEDCPN 101

Query: 107 --REDGGLLGDDMALSAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILS 166
               D  +  DD+    P+L TNEGIVEEAWEIVN +FLDT    W+PE+W ++K+DIL+
Sbjct: 102 EEEADTEIQDDDIE---PQLVTNEGIVEEAWEIVNGAFLDTRSHSWTPETWQKQKDDILA 161

Query: 167 TSIKTRSKAHNIVQKMLASLEDPYTRFLSPEEFSKMARYDMTGIGINVREMPDSSGSFKL 226
           + IK+RSKAH +++ MLASL D YTRFLSP+EFS+M++YD+TGIGIN+RE+ D  G+ KL
Sbjct: 162 SPIKSRSKAHEVIKNMLASLGDQYTRFLSPDEFSRMSKYDITGIGINLREVSDGGGNVKL 221

Query: 227 KVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCG 286
           KVLGLVLD  A  AG++QGDEIL+V+G DV GKS+F+VSS LQGPS+TFV ++VKHG CG
Sbjct: 222 KVLGLVLDSAADIAGVKQGDEILAVNGMDVSGKSSFEVSSLLQGPSKTFVVLKVKHGKCG 281

Query: 287 PLQSVKVQRQLVARPPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGAS 346
           P++S+K+QRQ+ A+ PV YRLEK+DNG  SVGY+RLKEFNALARKD+V AMKRL D+GAS
Sbjct: 282 PVKSLKIQRQVNAQTPVSYRLEKVDNGTVSVGYIRLKEFNALARKDLVIAMKRLLDKGAS 341

Query: 347 SFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIYTAGRD-LSQNAFVAESAPLVKAPVMVL 406
            FV+DLRDN GGLVQAGIE AKLFLD+GDTVIYTAGRD  +Q   V++  PL+ AP++V+
Sbjct: 342 YFVMDLRDNLGGLVQAGIETAKLFLDEGDTVIYTAGRDPEAQKTVVSDKKPLITAPLIVM 401

Query: 407 VNHNTASASEIVASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHL 466
           VN+ TASASEIVASALHDNCKAVLVGE+TYGKGLIQSV+EL DGSGVVVTIGKYVTP+H+
Sbjct: 402 VNNRTASASEIVASALHDNCKAVLVGERTYGKGLIQSVYELRDGSGVVVTIGKYVTPNHM 461

Query: 467 DINGNGIEPDFSNIPRWSEVKEYLSTCNRLTQ 479
           DING GIEPDF N+P W EVKE LS C+ L Q
Sbjct: 462 DINGGGIEPDFRNLPAWDEVKERLSKCSILQQ 488

BLAST of Spo30040.1 vs. ExPASy Swiss-Prot
Match: CTPA_SYNP2 (Carboxyl-terminal-processing protease OS=Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) GN=ctpA PE=3 SV=2)

HSP 1 Score: 267.3 bits (682), Expect = 3.200e-70
Identity = 144/349 (41.26%), Postives = 227/349 (65.04%), Query Frame = 1

		  

Query: 111 EGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQKMLASLEDP 170
           + ++ +AW  V+ +++D     ++ ++W   ++  L   +KTR +A+  V +MLA L+DP
Sbjct: 34  QDLLLQAWRYVSQAYVD---ETFNHQNWWLIRQKFLKRPLKTRDEAYEAVGEMLALLDDP 93

Query: 171 YTRFLSPEEFSKM---ARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQTAGIRQGD 230
           YTR L PE++  +      +++G+G+ +   P+      L+V+  +   PA+ AGI   D
Sbjct: 94  YTRLLRPEQYRSLKVSTSGELSGVGLQINVNPEVD---VLEVILPLPGSPAEAAGIEAKD 153

Query: 231 EILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLVARPPVFYR 290
           +IL++DG D R     + +++++G   + V++ VK      +++VKV R  +A  PV+ +
Sbjct: 154 QILAIDGIDTRNIGLEEAAARMRGKKGSTVSLTVKSPKTDTVRTVKVTRDTIALNPVYDK 213

Query: 291 LEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEI 350
           L+  +     VGY+RL +F+A A+ +I+ ++ +LQ QGA  +VLDLR+N GGL+QAGIEI
Sbjct: 214 LD--EKNGEKVGYIRLNQFSANAKTEIIKSLNQLQKQGADRYVLDLRNNPGGLLQAGIEI 273

Query: 351 AKLFLDKGDTVIYTAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVASALHDNCK 410
           A+L+LD+ +T++YT  R     ++ A   PL  AP++VLVN  TASASEI+A AL DN +
Sbjct: 274 ARLWLDQ-ETIVYTVNRQGIFESYSAVGQPLTDAPLVVLVNQATASASEILAGALQDNGR 333

Query: 411 AVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPD 457
           A+LVGEKT+GKGLIQS+FEL DG+G+ VT+ KY TP H DIN  GI PD
Sbjct: 334 AMLVGEKTFGKGLIQSLFELPDGAGMAVTVAKYETPLHHDINKLGIMPD 373

BLAST of Spo30040.1 vs. ExPASy Swiss-Prot
Match: CTPA_SYNY3 (Carboxyl-terminal-processing protease OS=Synechocystis sp. (strain PCC 6803 / Kazusa) GN=ctpA PE=3 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 7.100e-70
Identity = 137/357 (38.38%), Postives = 230/357 (64.43%), Query Frame = 1

		  

Query: 103 SAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTSIKTRSKAHNIVQK 162
           SA      + ++ ++W +VN S+LD     ++ ++W   +E  +   ++ R + +  +++
Sbjct: 28  SALAFTEEQKLLLQSWRLVNQSYLD---ETFNHQNWWLLREKYVKRPLRNREETYTAIEE 87

Query: 163 MLASLEDPYTRFLSPEEFSKM---ARYDMTGIGINVREMPDSSGSFKLKVLGLVLDGPAQ 222
           MLA+L++P+TR L PE++  +      +++G+G+ +   P+++   +L+++  +   PA+
Sbjct: 88  MLATLDEPFTRLLRPEQYGNLQVTTTGELSGVGLQININPETN---QLEIMAPLAGSPAE 147

Query: 223 TAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQRQLV 282
            AG++  D+IL++DG D +  S  + +++++GP  T V++E+        Q   + RQL+
Sbjct: 148 EAGLQPHDQILAIDGVDTQTLSLDEAAARMRGPKNTKVSLEILSAGTEVPQEFTLTRQLI 207

Query: 283 ARPPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLDLRDNFGG 342
           +  PV  +L+    G  SVGY+RL +F+A A K++  A+ +L++QGA  ++LDLR+N GG
Sbjct: 208 SLSPVAAQLDDSRPGQ-SVGYIRLSQFSANAYKEVAHALHQLEEQGADGYILDLRNNPGG 267

Query: 343 LVQAGIEIAKLFLDKGDTVIYTAGRDLSQNAFVAESAPLVKAPVMVLVNHNTASASEIVA 402
           L+QAGI+IA+L+L +  T++YT  R  +Q +F A        P++VLVN  TASASEI+A
Sbjct: 268 LLQAGIDIARLWLPES-TIVYTVNRQGTQESFTANGEAATDRPLVVLVNQGTASASEILA 327

Query: 403 SALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHLDINGNGIEPD 457
            AL DN +A LVGEKT+GKGLIQS+FEL DG+G+ VT+ KY TP H DI+  GI PD
Sbjct: 328 GALQDNQRATLVGEKTFGKGLIQSLFELSDGAGIAVTVAKYETPQHHDIHKLGIMPD 376

BLAST of Spo30040.1 vs. ExPASy Swiss-Prot
Match: CTPA_ACUOB (C-terminal processing peptidase, chloroplastic OS=Acutodesmus obliquus GN=ctpA PE=1 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 3.800e-55
Identity = 134/371 (36.12%), Postives = 214/371 (57.68%), Query Frame = 1

		  

Query: 101 ALSAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTS-IKTRSKAHNI 160
           AL A  + + + +  EAW  V+ +++D     ++ +SW + +E  L    +  R++ ++ 
Sbjct: 72  ALPAQAVTSEQLLFLEAWRAVDRAYVDKS---FNGQSWFKLRETYLKKEPMDRRAQTYDA 131

Query: 161 VQKMLASLEDPYTRFLSPEEFSKMARY---DMTGIGINVREMPDSSGSFKLKVLGLVLDG 220
           ++K+LA L+DP+TRFL P   + + R     +TG+G+ +    D      + VL     G
Sbjct: 132 IRKLLAVLDDPFTRFLEPSRLAALRRGTAGSVTGVGLEITY--DGGSGKDVVVLTPAPGG 191

Query: 221 PAQTAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGP--LQSVKV 280
           PA+ AG R GD I++VDG  V+G S +DVS  LQG +++ V V V H    P   +++++
Sbjct: 192 PAEKAGARAGDVIVTVDGTAVKGLSLYDVSDLLQGEADSQVEV-VLHAPGAPSNTRTLQL 251

Query: 281 QRQLVARPPVFYRL------EKIDNGATS--VGYVRLKEFNALARKDIVTAMKRLQDQGA 340
            RQ V   PV +          +  GA    +GYVRL  FN+        A   L  QG 
Sbjct: 252 TRQKVTINPVTFTTCSNVAAAALPPGAAKQQLGYVRLATFNSNTTAAAQQAFTELSKQGV 311

Query: 341 SSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIYTAGRDLSQNAFVAESAPLVKA-PVMV 400
           +  VLD+R+N GGL  AG+ +A++ +D+GD V+    + + ++ + A+   +  A P++V
Sbjct: 312 AGLVLDIRNNGGGLFPAGVNVARMLVDRGDLVLIADSQGI-RDIYSADGNSIDSATPLVV 371

Query: 401 LVNHNTASASEIVASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHH 457
           LVN  TASASE++A AL D+ + ++ GE+T+GKGLIQ+V +L DGSGV VT+ +Y TP  
Sbjct: 372 LVNRGTASASEVLAGALKDSKRGLIAGERTFGKGLIQTVVDLSDGSGVAVTVARYQTPAG 431

BLAST of Spo30040.1 vs. ExPASy Swiss-Prot
Match: CTPA2_ARATH (Carboxyl-terminal-processing peptidase 2, chloroplastic OS=Arabidopsis thaliana GN=CTPA2 PE=1 SV=1)

HSP 1 Score: 216.9 bits (551), Expect = 5.000e-55
Identity = 138/370 (37.30%), Postives = 204/370 (55.14%), Query Frame = 1

		  

Query: 103 SAPKLGTNEG--IVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTS-IKTRSKAHNI 162
           S P  G  E   +  EAW  ++ +++D     ++ +SW R +E  L    + TR + +  
Sbjct: 121 SPPSWGLTEENLLFLEAWRTIDRAYID---KTFNGQSWFRYRETALRNEPMNTREETYMA 180

Query: 163 VQKMLASLEDPYTRFLSPEEFSKM---ARYDMTGIGINVREMPDSSGS-FKLKVLGLVLD 222
           ++KM+A+L+DP+TRFL P +F  +    +  +TG+G+++     S G    L V+     
Sbjct: 181 IKKMVATLDDPFTRFLEPGKFKSLRSGTQGAVTGVGLSIGYPTASDGPPAGLVVISAAPG 240

Query: 223 GPAQTAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQ 282
           GPA  AGI  GD I  +D       + +D +  LQGP  + V + ++ G     + + + 
Sbjct: 241 GPANRAGILPGDVIQGIDNTTTETLTIYDAAQMLQGPEGSAVELAIRSGP--ETRLLTLT 300

Query: 283 RQLVARPPVFYRLEKIDNGATS---VGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLD 342
           R+ V+  PV  RL ++    ++   +GY++L  FN  A   +  A++ L+    ++FVLD
Sbjct: 301 RERVSVNPVKSRLCELPGSGSNSPKIGYIKLTTFNQNASSAVREAIETLRGNNVNAFVLD 360

Query: 343 LRDNFGGLVQAGIEIAKLFLDKGDTVIYTAGR------DLSQNAFVAESAPLVKAPVMVL 402
           LRDN GG    GIEIAK +LDKG  V     R      D   +  +A S PL      VL
Sbjct: 361 LRDNSGGSFPEGIEIAKFWLDKGVIVYICDSRGVRDIYDTDGSNAIATSEPLA-----VL 420

Query: 403 VNHNTASASEIVASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHL 457
           VN  TASASEI+A AL DN +A++ GE TYGKG IQSVFEL DGSG+ VT+ +Y TP H 
Sbjct: 421 VNKGTASASEILAGALKDNKRALVYGEPTYGKGKIQSVFELSDGSGLAVTVARYETPAHT 480

BLAST of Spo30040.1 vs. TAIR (Arabidopsis)
Match: AT5G46390.2 (Peptidase S41 family protein)

HSP 1 Score: 551.2 bits (1419), Expect = 6.300e-157
Identity = 286/452 (63.27%), Postives = 352/452 (77.88%), Query Frame = 1

		  

Query: 47  KKSSIGAAVAAAALSFSLLLYPPVS---------IAANSPASDQSSSSDEY------C-- 106
           KKS IG    A  LS +L+   P+S         ++ N P+S   SS + +      C  
Sbjct: 42  KKSVIGTLTGA--LSLTLVFSSPISSVAATNDPYLSVNPPSSSFESSLNHFDSAPEDCPN 101

Query: 107 --REDGGLLGDDMALSAPKLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILS 166
               D  +  DD+    P+L TNEGIVEEAWEIVN +FLDT    W+PE+W ++K+DIL+
Sbjct: 102 EEEADTEIQDDDIE---PQLVTNEGIVEEAWEIVNGAFLDTRSHSWTPETWQKQKDDILA 161

Query: 167 TSIKTRSKAHNIVQKMLASLEDPYTRFLSPEEFSKMARYDMTGIGINVREMPDSSGSFKL 226
           + IK+RSKAH +++ MLASL D YTRFLSP+EFS+M++YD+TGIGIN+RE+ D  G+ KL
Sbjct: 162 SPIKSRSKAHEVIKNMLASLGDQYTRFLSPDEFSRMSKYDITGIGINLREVSDGGGNVKL 221

Query: 227 KVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCG 286
           KVLGLVLD  A  AG++QGDEIL+V+G DV GKS+F+VSS LQGPS+TFV ++VKHG CG
Sbjct: 222 KVLGLVLDSAADIAGVKQGDEILAVNGMDVSGKSSFEVSSLLQGPSKTFVVLKVKHGKCG 281

Query: 287 PLQSVKVQRQLVARPPVFYRLEKIDNGATSVGYVRLKEFNALARKDIVTAMKRLQDQGAS 346
           P++S+K+QRQ+ A+ PV YRLEK+DNG  SVGY+RLKEFNALARKD+V AMKRL D+GAS
Sbjct: 282 PVKSLKIQRQVNAQTPVSYRLEKVDNGTVSVGYIRLKEFNALARKDLVIAMKRLLDKGAS 341

Query: 347 SFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIYTAGRD-LSQNAFVAESAPLVKAPVMVL 406
            FV+DLRDN GGLVQAGIE AKLFLD+GDTVIYTAGRD  +Q   V++  PL+ AP++V+
Sbjct: 342 YFVMDLRDNLGGLVQAGIETAKLFLDEGDTVIYTAGRDPEAQKTVVSDKKPLITAPLIVM 401

Query: 407 VNHNTASASEIVASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHL 466
           VN+ TASASEIVASALHDNCKAVLVGE+TYGKGLIQSV+EL DGSGVVVTIGKYVTP+H+
Sbjct: 402 VNNRTASASEIVASALHDNCKAVLVGERTYGKGLIQSVYELRDGSGVVVTIGKYVTPNHM 461

Query: 467 DINGNGIEPDFSNIPRWSEVKEYLSTCNRLTQ 479
           DING GIEPDF N+P W EVKE LS C+ L Q
Sbjct: 462 DINGGGIEPDFRNLPAWDEVKERLSKCSILQQ 488

BLAST of Spo30040.1 vs. TAIR (Arabidopsis)
Match: AT4G17740.1 (Peptidase S41 family protein)

HSP 1 Score: 216.9 bits (551), Expect = 2.800e-56
Identity = 138/370 (37.30%), Postives = 204/370 (55.14%), Query Frame = 1

		  

Query: 103 SAPKLGTNEG--IVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDILSTS-IKTRSKAHNI 162
           S P  G  E   +  EAW  ++ +++D     ++ +SW R +E  L    + TR + +  
Sbjct: 121 SPPSWGLTEENLLFLEAWRTIDRAYID---KTFNGQSWFRYRETALRNEPMNTREETYMA 180

Query: 163 VQKMLASLEDPYTRFLSPEEFSKM---ARYDMTGIGINVREMPDSSGS-FKLKVLGLVLD 222
           ++KM+A+L+DP+TRFL P +F  +    +  +TG+G+++     S G    L V+     
Sbjct: 181 IKKMVATLDDPFTRFLEPGKFKSLRSGTQGAVTGVGLSIGYPTASDGPPAGLVVISAAPG 240

Query: 223 GPAQTAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEVKHGNCGPLQSVKVQ 282
           GPA  AGI  GD I  +D       + +D +  LQGP  + V + ++ G     + + + 
Sbjct: 241 GPANRAGILPGDVIQGIDNTTTETLTIYDAAQMLQGPEGSAVELAIRSGP--ETRLLTLT 300

Query: 283 RQLVARPPVFYRLEKIDNGATS---VGYVRLKEFNALARKDIVTAMKRLQDQGASSFVLD 342
           R+ V+  PV  RL ++    ++   +GY++L  FN  A   +  A++ L+    ++FVLD
Sbjct: 301 RERVSVNPVKSRLCELPGSGSNSPKIGYIKLTTFNQNASSAVREAIETLRGNNVNAFVLD 360

Query: 343 LRDNFGGLVQAGIEIAKLFLDKGDTVIYTAGR------DLSQNAFVAESAPLVKAPVMVL 402
           LRDN GG    GIEIAK +LDKG  V     R      D   +  +A S PL      VL
Sbjct: 361 LRDNSGGSFPEGIEIAKFWLDKGVIVYICDSRGVRDIYDTDGSNAIATSEPLA-----VL 420

Query: 403 VNHNTASASEIVASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGVVVTIGKYVTPHHL 457
           VN  TASASEI+A AL DN +A++ GE TYGKG IQSVFEL DGSG+ VT+ +Y TP H 
Sbjct: 421 VNKGTASASEILAGALKDNKRALVYGEPTYGKGKIQSVFELSDGSGLAVTVARYETPAHT 480

BLAST of Spo30040.1 vs. TAIR (Arabidopsis)
Match: AT3G57680.1 (Peptidase S41 family protein)

HSP 1 Score: 211.5 bits (537), Expect = 1.200e-54
Identity = 124/383 (32.38%), Postives = 220/383 (57.44%), Query Frame = 1

		  

Query: 92  DGGLLGDDMALSAP-----KLGTNEGIVEEAWEIVNDSFLDTGRSRWSPESWIRKKEDIL 151
           D   L + + ++ P     ++ T +  + EAW ++ ++F+D     ++ + W  K +  +
Sbjct: 94  DSPALAESLTIAFPVSRAREVTTVQRTLVEAWGLIRETFVDP---TFNHQDWDFKLQQTM 153

Query: 152 STSIKTRSK--AHNIVQKMLASLEDPYTRFLSPEEFSKM---ARYDMTGIGINVREMPDS 211
                 RS   A+  ++ ML++L DP+TR ++P+E+      +  ++ G+G+ +   P +
Sbjct: 154 VEMFPLRSADAAYGKLKAMLSTLGDPFTRLITPKEYQSFRIGSDGNLQGVGLFINSEPRT 213

Query: 212 SGSFKLKVLGLVLDGPAQTAGIRQGDEILSVDGEDVRGKSAFDVSSKLQGPSETFVTVEV 271
                L V+  V   PA  AGI +G+E++ ++GE +    +   + KL+G   TFVT+++
Sbjct: 214 G---HLVVMSCVEGSPADRAGIHEGEELVEINGEKLDDVDSEAAAQKLRGRVGTFVTIKL 273

Query: 272 KH----GNCGPLQSVKVQRQLVARPPVFYRL---EKIDNGATSVGYVRLKEFNALARKDI 331
           K+    G    ++ VK+ R  +   P+   +      D      GYV+L  F+  A  D+
Sbjct: 274 KNVNGSGTDSGIREVKLPRDYIKLSPISSAIIPHTTPDGRLAKTGYVKLTAFSQTAASDM 333

Query: 332 VTAMKRLQDQGASSFVLDLRDNFGGLVQAGIEIAKLFLDKGDTVIYTAGRD-LSQNAFVA 391
             A+  +++Q   S++LDLR+N GGLV+AG+++A+L+LD  +T++YT  R+ ++    + 
Sbjct: 334 ENAVHEMENQDVQSYILDLRNNPGGLVRAGLDVAQLWLDGDETLVYTIDREGVTSPINMI 393

Query: 392 ESAPLVKAPVMVLVNHNTASASEIVASALHDNCKAVLVGEKTYGKGLIQSVFELHDGSGV 451
               +   P++VLVN  +ASASEI+A ALHDN +A+LVG +T+GKG IQS+ EL+DGS +
Sbjct: 394 NGHAVTHDPLVVLVNEGSASASEILAGALHDNGRAILVGNRTFGKGKIQSITELNDGSAL 453

Query: 452 VVTIGKYVTPHHLDINGNGIEPD 457
            VT+ KY++P   +I+  GI PD
Sbjct: 454 FVTVAKYLSPSLHEIDQVGIAPD 470

The following BLAST results are available for this feature:
BLAST of Spo30040.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902232875|gb|KNA22820.1|8.0e-27199.7hypothetical protein SOVF_0305... [more]
gi|731373636|ref|XP_010666703.1|1.4e-20681.2PREDICTED: carboxyl-terminal-p... [more]
gi|743903903|ref|XP_011045311.1|3.8e-17269.9PREDICTED: carboxyl-terminal-p... [more]
gi|224114365|ref|XP_002316739.1|1.1e-17169.7peptidase S41 family protein [... [more]
gi|1000965308|ref|XP_015574941.1|1.9e-17168.6PREDICTED: carboxyl-terminal-p... [more]
back to top
BLAST of Spo30040.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9RTF0_SPIOL5.5e-27199.7Uncharacterized protein OS=Spi... [more]
A0A0J8B7H3_BETVU9.6e-20781.2Uncharacterized protein OS=Bet... [more]
B9I114_POPTR7.7e-17269.7Peptidase S41 family protein O... [more]
A0A067GHB1_CITSI1.1e-17069.4Uncharacterized protein OS=Cit... [more]
F6HMG8_VITVI1.5e-17069.8Putative uncharacterized prote... [more]
back to top
BLAST of Spo30040.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
CTPA1_ARATH1.1e-15563.2Carboxyl-terminal-processing p... [more]
CTPA_SYNP23.2e-7041.2Carboxyl-terminal-processing p... [more]
CTPA_SYNY37.1e-7038.3Carboxyl-terminal-processing p... [more]
CTPA_ACUOB3.8e-5536.1C-terminal processing peptidas... [more]
CTPA2_ARATH5.0e-5537.3Carboxyl-terminal-processing p... [more]
back to top
BLAST of Spo30040.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 3
Match NameE-valueIdentityDescription
AT5G46390.26.3e-15763.2Peptidase S41 family protein[more]
AT4G17740.12.8e-5637.3Peptidase S41 family protein[more]
AT3G57680.11.2e-5432.3Peptidase S41 family protein[more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001478PDZ domainGENE3D2.30.42.10coord: 195..275
score: 1.8
IPR001478PDZ domainPFAMPF00595PDZcoord: 189..261
score: 3.
IPR001478PDZ domainSMARTSM00228pdz_newcoord: 189..265
score: 3.8
IPR001478PDZ domainPROFILEPS50106PDZcoord: 180..250
score: 10
IPR001478PDZ domainunknownSSF50156PDZ domain-likecoord: 163..282
score: 7.36
IPR004447C-terminal-processing peptidase S41ATIGRFAMsTIGR00225TIGR00225coord: 143..458
score: 4.7
IPR005151Tail specific proteasePFAMPF03572Peptidase_S41coord: 298..457
score: 2.3
IPR005151Tail specific proteaseSMARTSM00245tsp_4coord: 267..461
score: 1.0
NoneNo IPR availableGENE3D3.30.750.44coord: 109..194
score: 1.5
NoneNo IPR availablePANTHERPTHR32060FAMILY NOT NAMEDcoord: 53..474
score: 1.7E
NoneNo IPR availablePANTHERPTHR32060:SF8PEPTIDASE S41 FAMILY PROTEINcoord: 53..474
score: 1.7E

GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016556 mRNA modification
biological_process GO:0006508 proteolysis
biological_process GO:0008152 metabolic process
biological_process GO:0007165 signal transduction
cellular_component GO:0031977 thylakoid lumen
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016459 myosin complex
molecular_function GO:0005515 protein binding
molecular_function GO:0008236 serine-type peptidase activity
molecular_function GO:0003779 actin binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0003774 motor activity
molecular_function GO:0004871 obsolete signal transducer activity
RNA-Seq Expression
   



Co-expression
Gener valueExpression
Spo249640.67Barchart | Table