Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSexonthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGATAGCGGAGGTGAGTGTCATCACCTGATCGAAGACTGCAGAAGGCTAATTCGATTGAATGACTGGGAAGTCGACATAATCCATGTGTATAGGGAGGAAAACCGAGCTGCGGACTGGCTTGCTAATAATGGGGTCGATAATGCCAACAATTTGTCCATCTTAGGGTCAACACCTGTAGCACTCGCAAGGATCTTGGAGGAAGATAGAATGGGGATCAACAACCCTCGTTTCATCCCCCTTGAAGGCTTAAATAGGCTAGCAAAGTAGGAGGTTGTTAGTGTTTTGTTCTTTTCCTTTCGATTTTTGCTTTACTTTATTTTTCTTCTTTGTAATTTTGCCTTTTTCTGGATTTCTATATCCTCATGAGAATCAAAAAAATAAAAATAAATAAAAAACTTTTCTTTAAGAAAATAAAAAGAAAAACTCTTCTCAAATCAACCGACTTTCCAGTCATAAACTCTCACGGGTCACTCAGCTTTCAAGAGAGATTTCCTCCGCTATTGCTCACTGCCTTCGCTGCAAATACTCCGGCAAGGTGGCTCTGCGGCGACGCTTGCTACCACACCGCTCGCCGGCTCTCCGAATATCTCAACATGGGTCGACCCGAACCCTGTGTACTCTTTGCCCAGACCTTCGTTCACCCTCAGTTGGACGAATACGTTGACGAAGTAATCCCTTTTTTTCTCTCTTCTATTGTTCGTCATTGTTGCAATTTAGTAATTTTTGTTAATTTTTAAATAAGTTGGAAATTTCCCGTCACTTATATTCCAAGATATTTGGGATTTGGGGTTTATTGGATGCTGATTGAATTAGTTTAGTTTTTTTTTTGTATTATGAAAATTTATGGTCTAAGTACTTTTTGTGAATTTCTGCCTTAATTTTGTGGATGTTTCTGATTGATTTAATTCTTTGTTAGGTCATGTTTGCTGAACCTATTGTGATATCCGCCTGTGAGTTTTTGGAGCAAAATGCGCCGTCAGCTTCGTCTGCCGTCACTCTTCTCGGGTATATATATGACATAACACTTGCTTACATTGTTCTTCTTCTTCTTCTTCCTTTTTCTTTTTATTTATTTATTTAATATTACTTTTATTTCTGATGGCAATATTCAAAGCTAACCAATTGGACAAGAAAATTTGCAGTCTTTGCACTAACTTTCATTTTTGTTACCGTGTCCTGTAAATACAATTTGAGGGTTTAGCTTTTGAGTCAAGGGTTAGCTGAAATGAAAAATATCATGATATCATCAAGGTATCCTGGGGATGAACAAGTTTACTTGTAGGTCGGATCAACAAGGAATATCTAATGGAAAAATCTTAACAAAGGTGGGTAGAATGGTAGATTATCCATCTAATATTAAGTGCATATTTTGTTTTCTTCTAAAATCTATCTCATGTTAGTAGATGCCTGTGGGTAAAAGTAGTCTAAAGCCCAATTTATGCCTGTTTAGTTTTAAGAAAAGATATCAGTTTGAAAAGCAAGCCTTTATAAAGAGGCTAGCAAGCAGTTAGGAAAGATAAGTGTAACCAACATTTAGAAACGAAGGTAGGATATCTTATTTTTGTTTTATTATAATCATCTTTAAACCTAACTATCTTTCTTCTAAAACAGTGTTAAAGTAGTAATTATCTCATAGGCATCTGATTCCTTGCCCATTATACTTTATGGATAGAAGATTATTAATTTTGTGAAAAAATTGCCGTCTTTGATTTCTTTTTTCCAGGGCTACTTCACCTCCTTCTTTCGCTTTGGAGGTATTTGTTCAGTGTGAGGGGGAAGCGAGATTTAGGCGTCTTTGCCAACCATTTCTATACTCACATTCGTCATCAAACGTTCTTGAAGTTGAGGTAGTCAATGCTTCATGTTTTGCAAATTTGTACTTGGGAATGAATTTCTTTAATTGTTATAATAGTTATTTACATACATATTTTCACATAGCCGTGTTTATGGATATAACAATCAAATTTGTGTATGTTATGATGACATTTCTCTTACAAGTAAGTAACACAATAGGTGAAAAAGGGGATTTTAAGTTTTAAGGATGGTTCTAAATTACATCATGATATTTTTATATTACATTTATGTGGAATACTAGACCTATGCAATGGACCGGGTCAGGGTCGGGTCAGAGGTTAAGTGGACCGGCAGATTGGGTCATGCTCATCTTTTAATGGACCGGGTCGGGTCATGTTCCAGTTATCATGACCCGACCCACCTCGTCCCGTTTAAATTTCTGAAAATAAATATAAAATCAATTCCATTTTCTTCTTTTTCATCTTTGAAGCAAGACAGTAACACTTTAGTCAACTAAATAAAACGCATCCTCTAAATAAAAACAAATATAGTACCCTAATCGGGAAAAACTTATTCACGACAACTTAAAGCAAGACAGTAACAAATTAGTTAACTAAATAAAAAACCCAAATTCAGCAAAAAAAGAACAATAAAACCAATTCTTAGCGCATAACGCCAAGATCTTTGAAGCAAGATCTATCAATAAATTTACCTCAATTAGCGCATAAAATATATGCAAACCTTCAAAAAAAAAGGCAGGGAATGTGAATTATTATTATTATTATTATTATTATATAAACTGAGATAATTTTCAAACTAGCTTAGAAACATCGACAAATGTTTAATTCAGTTACCCATTATTATAATCTAATAATGGAAGAATATCATCAAAATGAAATAATGGGTATCCTGAATTTTAAAATTTGGGAATTCAATATAATAATTTTGCGGAGATTTTAGTCGGACAATGAAATTACAATTAAGCTGGTTAAAAAATTTTGAGGAACTCATTTGGGGTATTGACGTTGCAAATTTAACAGCAGAGGAAAGAGAAAGAGAGAAAGCTAGAGGAGTTGGGATGATATTCATATTTTACGTTCGTTTCTTCAACAGAGAGAACTCGAAATAAAATTGACTCTTTAAAAGCCCATCAGGGCCCGGCCCAACCCATTTAGTGTTAAAATTATGTTGACCCTTTAAGGACCCACACCCATTTATGTAGAGGGTCGGGTTTGACCTTGTAATTTCAGGGTCGGGTCAGGGCCCTTAATGGGCGTCCGCCCCATTGCACAGGTCTAGTAAATACTGATATATATTAGAATGCAACCTTCACTGAAGGATCACAAAAATTAGGAGGTTATGAGTCTAGTTTTTGTTGTTTTAATTTGGATCTTCACTGTGTCTGACAATATTTTCAACAACATTAAGCTACCAAAGAATATTTTGGATTGTCTTCCTATGGATCTGTCTTGATTCTGTCGGTTAATATTTAAAGAGGCTAATTCACCTCTCTTAAGTTGCACTCCTGCTATTAAAATACTTGGCATAGGAACTTGTAATGTGTTTTACTTTGTGTGCAGTGCCCTGAAGTTGTCAACTAGTACCTTTGTGATTCTTGAGGTTACGAGTTTAGTCTTTTGGTATAAAGAGGGCGACTAATAAAGGAAAAATTAGGTCGTTACTTGTCTCTACTAAATCAAACAAAAACCCAACATCACCAAACCAAGTAAAGTAATAAGTAAGGGTCGAAATCCACAAGGACTAAGGTACACCAGTTTATGCTAATCAAAGAATAAGCTAAGTCGGAATCAAGTTGAAAATTGTCAACTAACAAGTAACAACTAATGACTAATAAAGTTAACTACAAAGGACAAACAAGGTTAAGGAATGAGATCAAACACCAACAATTCTGAAACAAGTATATGAGGAACAAATAAGAGTAATGATGAACAAACAAAGAAGGCGTTGGTTAGAGATATATCCCCACATGACTCCCGCCTTGGACCAATTCTTAAAAAGCTGAAAAGTCTTTGTACGATCAACATCACTAATCGCCTAACTCTAACAAAGGGGCGCGCTAACACCTGAAAAAGGGAGGATGATCAATCCAACTAAAACTATAGTCAAATGCGAACTCCTGCCCTACACACAAGTACACAACTATCCTTAGGTCAAGTTTAGACACGAACTAAACAATAGTCAAGATTTAGGTCACGATCCCTTGATGTTCCTAAAGTTAATTACCCCAAAGTTAAGGTCACACGCAAGTAGCTAATTTGCATATAACAAAGTTGGAGTATGCTCAAACAAGCAATGATTTAATCATACAAAATCACCAAATTCAAATTCAAATTCAAATTCAAATACCCAAATCAATTATTGTTGGGTTTTCACCGAAACCCTAACCACGAAATTACTCCATCAGCATAAAAATAGGAAATTGAAAGAAATTAACAACTAGAAACATGAAACTAATCAATAGTAACAATTATGCTGACCATGAATCATAAAACTACGCAAAAACAATTATACTAAGCATGAACAATTAAACTAATCAAAGTAACAAATAAATCAATAAATGTGAAGAGAGAGTAAGAAAATCTCATTGATTAACCGGTGGATGTTCAAATCATCAAGTTCCCCATCTTTGATTCTTCAAAATCCATTTTCAACTCATTGTTTTTCTCACTTTCTCTCTCTAAAACCAATTAAAATCCTAGAAATATTGAAATTAATCTAATATGTACAATCTAATTGTCATAACCAAAAGTTTGGTATTATACTACAAGCCTAAAAAGAATTAAAAGATAGGAGATTAAAAAGGAAAAGATAAAGAAAAATAATTTAAAAGGTATTAAAAGGAGAAAAGAGAAAAAAGAAAAAGAAACTAAAACCCAAACCAAACATATCAAACCCAAGTTCCTTCGTTTCTTTACTGCTCTTCCCCTTCTTTCATCTTCCTTCATCCACCATCATCATGATCGTTCATCTTCCTCAGGGGCGGATCCAGGAATTTACAAATTGGGGTCCAAAATATTTTAATTTCTCATATTTGCTTTTTGTAAGGGGTTAAAGTTGAAAGGTAAGTAGTCAAAATATTACAAACGCCATAATAAAATCGGGGGGATTTAGATTAGTGGAGGATTTATTAGGTTTCATAACAATAAAACTCAATACCTAGTCTAAATTACAACAATAACTTGGCTCGATTTCAATAATAAAAAAATATGAGTGTAAAGGACGAGTTTTCATACTTTGAAGGAGATTTAAGTTATTACAATGGTCGTTGAACGTTACACAATGACACTAATAACAAATGGAACATCATTCTTAATGCTTATGGTGCGTTAAGGTATTATATAATTAGGCCTTTAAATAGTTCAAAGAAACATGTAACAAATAAGTCTATAATAAGCAACAAAAGAAAAAAGAAAGTGATAAAAAAAAAAGACAAACAAAGAGAAAGTAAATTAACAAAAATAAAACCATGAAAAATTTAGAGTTCAACAAAGGACAAATAAAAGACCAAAAAATGAAGGAAAATTTGTCATAAAGGACCTTTTAAAACACAAAAATTGTGAGAAAGGACCTTTTAAAAAAAAGTTGTGAAAGAAGGCCCAATTGGGTTTTTTTTGTTGTGAATAAGGACCATAAGCGATTTTTCGGTAGTCAACGACTGTTTTCCGGCGTTGACCATCACGTGTCTTAAATTTTAACATTTTTTTAAAAAAAAGTTTTTCTCCAATATTTTCCCCCATCCCCAACTAACGTTTTCAGATGTAAAAAAAAACAAGCAATTGTGTAAAATCAGAATTAAACTCCTGTATAGTAAAAGTAAAATCAGAAGAAAAAAAAAACCTAAAATACCCTAATTTTAAGAATCCGAAAATTGGAGAAATTTAAGGAAAAAAAATTGGGGCTTTTAGTGATACCTCGTACGAAAAGGACCGACAAGCAATTGGGTTGCGCGAATTTGATTGATTTCAACTGAGGAGAAGAAATTCGTGGTTGGAAGGAAGAAGAAAATGGCGGAGCAGCACATAGCAGGAGAGAGAAAATCGGTTTTTTTTTTTTTTTTTTTTTACATCTGAATAAGTTAGTTGAAGGTGGGGGAATATTTTTTTTAAAAAATAATTGTTAAAATTTAAGACACGTGCGTGTCACGTGATGGTCAACGCCGGAAAACAGTCGTTGACCACCGGAAAATCGCTTATGGTCCTTATTCACAACAAAAAAACCCAATTGGGCCTTCTTTCACAACTTTTTTTTTAAAAGGTCCTTTCTCACAATTTTTGTGTTTTAAAAGGTCCTTTCTGACAAATTTTTCAAAAATGAATGAAAAAAGCCCCCATTATTGAGAATCGAACTTGTGTCCCCTATTTAGCATATTTATTAAGCAACCAATAAGCTACAAGTATATTTCATTAACTTTAGTGTAAGTTAATGATTAATATATTAATTATTTCAAGTGCAATTAATTATTTCAACATTAATTTAACAGTTTTTTTTAAAAAAATTGGGGATCCCTTTGGACCCCTCCAGGACCATGTAGGTCCGCCACTGATCTTCCTCTTCTAAGTTAATCATACTGAATTTGTCACTCAACATCAGTTCTACCGGACCGTTCATTTTCATTATTCCGGTACAAAAATCCGCGCTCAAATGTTTGATCGAACTTAAAAGAGGTTCAATCGGATCCAATTTTGGTCCGGGCAAACTTAAGGGAGGTTCAGTAGGAACCGAAATCTTCAATTTTGAAAGAATTTTTCCTAAATTTGGGTTTGGTCTGTATTTGGGCTGTAATGTATGGTGTGGGGAGTGCTGAAGGATGCTGCAATTGGCGTTTGGTAGAGGTGGAGCTTAGACTGTTGTGCAATGATGCTGCGAGACTGCTGTTGTTGGCGGAGGAGGAGAGATGGAGGTGCAACGCTGTCTTGTTTGGGTGGTGTAGGCGAGTTGGGGAAGATGAGGGAGTGTTGGGGCTTCCATTGACTGTTTGTGTGGGCGAAGAGTGCTGATGCCCGGTGGTGCATGCGGGAAAGAGAAAGGCTGTGTTGTTTGGAGAGGGAGAAGATGGAGATGATGCATGGCTTATTGATGTGGTGTAGGATAAGATAAAGGTGGTGTATGATAAGGCTGTTGTTGACCATGTTGCATAATTAATTTTTGCCTCGAAACTTGTACTCCGGCACGGCGAACAAAAATCACCTACACAATTAATACTGAAAATTATTAGGTACACCCGGGTTCACCATGCACCCAAGGATATTTTTATGGCCATTGATAGTCAACTATCACATTTCCTCCTCACATGTAAGAAGAAAACGTGAGAGGGTGCACCGTGCAACCGGGTGCATGTAAATATGTAATATGTTCTAAATTAATACGTGGGAAAAAATGGAAAATAGAAAATAAAACGAAAATAAAACAATAAAGACGAGAGTTACTAAAAAGAAATAAAAATAGAAAGTCAATTACGGAAAGTAAGCTAATTAAACTATAAATTATAAAATAAAGAAAAAGAGATAAATTACGCAATTAAGGCCTAAAATCTAATAAAAACCAATAGAAACATTAGCTAAGCTATAAAATAATAAAATAATAAGAAGTAAAGGCTAAAAGGGATTAAAATAGATTAAAATATAAACAAAAGTACCTAAAAACTACGCCTAGGAGTGACAATTTTTGACACTCATCAGCGACATTGTCTACAACTATTTTACTGATGAATTTCTGTAGGAGTATGAGCTGTAAATCTGCAATATGTCTAATATTCTCCATGTTACCTTTTCCAGACCCAGAAAAATTGATTTTAACACTGATTTTATTCTTTTCAAGAAGGTAGCGTAATGCGTTAAGTATTCATTTAGACTTTAGGAGTTGGACTGCCAGTTTCCTATGAGAAATGTGGTCTGCGGAAACGTAGCATGCTGTTGCTGTTTGGTACTAGCTGGTTTAACATTATAATTACCTCTTTTTTGGATAGAGAGGGAGTGCATAGAAACATGTTTCCACGAGACAACCATGGCTGTCAGTGTTATGGTGAATGTTGACATTTTTCTCTTGCAGTTTCTTCTTAATCGACCAGAGGCCTTTGTTTAATTACTTCTATAGAATACATGGAAGTATAAACAGTTCACAACATTTGGCTTCAAAGGAACTCGAACATTTTTAGCAATGATAATTTTATGTAGTACTTAGTAAAATTCTGTTTAATATTGCTTGTAGATGTGATGATGAGATGAGAGCTGTTCTCATTTATTTGGGGCTTGTGGGGCTTCATCAACCCTTCCTAGAAGGTGGGGGGAAGTTGTTACCACTAATAGACTTAGGTGTTTTGGTGTATTTACTACTTTTGGTTATAAAACTTTATTGCCAAAAAAGAGTTTCTCTTCCTTTAGATTGGAGGGGAGGGTTTCGTGAAAAGTTGTGGTTGATGCCTTGAAGAAGGGTGCATGATTTTTGCTGATAGATAAAACTTTGCAGTTTTCATGCACAGCAACCAACTCAGAAATTGAAACCTATAAAGCTCCTGTGCCTGAATATGTGTGGGGCAGGAAAACATGGAGATTTCGATTCTGGAATGACTTATTCTTTTAGGAATAAATTATTCATTATGTTATGAAGCCTTTTAAATTTATAAGATCACGTGTTCTACCTTACCATGTAAAATTACTTCTCATCTTACGTTAAAAATAATAATCCGTTTATAAATTTGAAAGGAGGTTGGTTCAGAAAATAAAAAGGGTCTGATTTAGCACCTTCATTACCATGTGCATGTGCGATAATGATATAATGCATTTTACTTTGCAAGAGTGCTTACATCATGTCTAACTAATTTCTCATTGATAAGTTCTATTCATTCTTGAGTATCCTTCTCCCTCAAATAGAAAAATTTGAAAAGTACTTTCTGTTTAGCTACTGACATATGTTTTGGCAGGCTGTGGTGACTAATCATCTAGTCGTGAGGGGCAGTTACCGCAGTCTAAGCTTGGTTGTATACGGAAACACAGCTGAAGATCTTGGGCAGTTTAATGTTGAATTTGACTTGGACAACTCTGTAGCAAAACTTGTTTCTTCTTCTGAGGCAAAGCTTGAGGACCTGCCACCTCTATTACATTCTAGAAATCCTAAATTTGAGGATGCGATATCCTCTCTGAAGGCATTGTCTTCAATGCTCCCAGCGGCAGATATTTCTGTACAGATGCGGCAGTTTTTGCAACTTATATTTAAGATATTGGATATCCCAAACTTTGGTGAATTTAGTCAGAAGGTTATGGCTTTGATTACTTCAGCAGTTGCTTCATTTTTCACAGATGACTTAGCTAGTGCTGGAATCACATTGCAAGAGTTAAATTTGGGAGCATTAGCGAATGTCAAAGGGTCTTTTCATGTTTTCACTGAGGCTCAAAGGAATCTTTTTGAGCTTTGTAAAAGATACGTAGGAGCAGGCCATATATATGGAGATTTTTTGGTAGGGTGCAGTTTTCTCGAGTCTGAAGCCGAGTTAGCCACTTCTAAAGAGTTGGTCGATATGTTGTTCCAGCATTTCCCGTTCAATAGGGATACTGTATTTGTTGGACATCCTCATCTATCAAAGGTAAATTATGATTTTTCTATTGCTATGGAAAGTCTTGTCAAATAAGTAGGAAACTAAGATATAGAATTCTGGCCCTTCTTCTGGTCATGTTTTCTCGAGGTTGGTTGATATAACTATTACTTGGAGAACTTATTTCTAGCCCCAAAATCAGTTTATAAAAATATTTTTTGATTCCTAATGCTTCTTTGTACTTGTCAAGTTTCACAGGACTATTGTTTTTGCGATCCTAGTCCCCTCTTTTTATTAACTGTAAAATTTAGAAGATGCACTATTCCTTCCTTCTGTAAAGCATATAAATGGAAGTATGGCACTTCATTATTTCTCAATACACTTGGTTTCTTATTGATGTTTTGATATGTTGACATCTTACTGATCCGTTCTTCCGTTGTTTTTTCTTTCATTTCCAGAGTAATAATGTACTTCTTTGGTTGAGTTTAGCTCTGATTCTATGTTCGGGGAAAGAGAGTTGTTTTCATTATGTAAATGGTGGAGGAATGGAGCAGCTTGCACATTTCTTTAGCCAGGAAATGCTAAATAGTGCGACAGTTACTTTGATGCTTCTTGGAGCTATGGAACAGGCAACTCGATACTCTGTTGGGTGTGAAGGTTTGTTAGGTTGGTGGCCTCGTGAAGGTGAAAAGGTCCCATCTGGTGTTAGTGAAGGATATAGCCTACTGTTAAAGTTACTTATGCAAAAACAACGGCATGATATTGCTGCATTGACGACTTATATACTTCATCGGTTGCGGTTCTATGAAGTCGCTTCGAGATATGAGGTATATTTTCTGGATTATGAATTCTTCATTATGCTACAAGAAGCTATTCTGATTTTTCTGATTGCAGTATGCAGTACTTTCCATATTGGGTGGACTCTCTGGTGATGGGAATGCTACAACCACGACCTTAAGCCTGCTATTGAAAGCCAAGTTACAGCTAAGAAAGCTCTTGGTTAGTTTTTTTTTTTGGGGTGATATCAGGGCTGCCATTATTCTATCATAAAACTAAAATCAATTTCAGTTTTTTTGATATGTTGATGCTTTGGTACTCCCTCCCATATAAGTTTCTTTTTGCATACAGACAAAGGAGTAGGTTTTGCATTGGTATTCAAAGGTGGCTTCTACTTTTCTTTTGGTACCTAAACTGACAAGTAGTCAAGTAGTAGTATATTGATAATTTATTTGACAAATATGGAATAAAGGAGTAAGTTAAGTGGTAATATAATGTGGAGTGTAGGGAGTAGAAGAGTTTGAGTCAAAATGTCTTTAAATTTTGTTTCAACCTTTGTTTCTACTTTTATGTGATGGTTAAAGGAAGTAAGATGCAAAATTGACTACTTAGATGCTTGACCAGGAAATTTCGAAGTTCTAAGGAATTTTTTGGTTGGTTGCTGTTTCAGTTCAAGGTCTTTTTATTTGTTGGGGGGTGTGAAGTGAAAATATAATGAAGTTTTTTTTAACAAACAAATTGGGTAGAAAACTACCAACCTAAGATCAATGGACTCTGAAACAACACCACCTGATTTCTAGATTTTGAAAACCATTTGTCACTTATTTTAACATTATGCACCTTACGTGTGGTAAAGGTGTTCAATTAGCAGCCATTTGTTTTTTCCCTTTTTTTTTTCTTCCTCCGATAATAGGATTCTAAAACACAACAGTTAATCTCTAGATTTGAAAACCGCACCTGGCTTATTACGTTTATAAAATATTTCGACCAGTTCATGATTGTGAGACGTGTGTTTAACTTGCACCCTTTTGTCTTTGTTGGAGGTGTCCACCCTTCTTTGCTTGTCAGAAAAAAATTTACTTTTAAGTGAACCCGCTGCGGTCTAATCTGTAGTAATGTATAAAAAAACGAGGCATTGTTTTTTTGGTGATTTCTGACTAAGTGTTCCAACTGTATGTCTTATAGTATCTAGGTAATCTATGTCTATCACGTGTTATCTTGTTCTCATGTAAAGTTGGATTATCACTTCTTGGTTTCTGTTCGTTGACAAATGTATATGCAGAAAATGATTAACTCGTCGGGTTGGGTTGAGGATCCTTCTCCAATGGCCTGTGCAATCCGATCATTAATGGTGGCATCTTCTGAAGGATCATTGTCTTTCAAAGCAACCAGCAAGCTTATTGCTTCGTCAAGTTGTCGTTTTCTGCATCAGAAAATTGATGCAAAGTTGCTCTCCCTTTTGAAGGTACTTTCTCCTAGATACATGGAATTAAAACTCGGCCGTTCCGTTATGTAACGGCCGTTCTGTAACGGGTTTTGGATTTGCCGTTCCAGGAAAGCCGAACCCATCCGCGTTTTTTTTTCTATGTCTAGATAGATTCTTGTTGAGTTTCACTTTCATTTCTTTAAATTTTTATCTTTATTTTACAGTCAGCTACTAGTGAATTATTTAATCCTGATTGCTTAATTTCGTTACAGTCCAGATTGAAACCATCAGAAAATTCAGAATCAGGGTATTAAATACGTGACCGTGGAAGTTGTAGTTCAATGCTTATAAAGATGTATAAAGGCTATTCACATATCTCATTTTCTTGCATGGTTAGAAAATATTGATCTAGTCAATAGAATTAAAGTAATTCAATACACTTACGTTCTTTTGTTGTCAAATAATATCTTTAACTTATTTGAATTTGTTATATTTATAGAATTCTCTATCACAACGATTGGTGTGATACTATGATGTGGTTGTCAAAGGGCATAACAGTGACATGATTTAAGTCTTGATAATTAAAGATATTAATGATTGAAATATTGCATTGGCAAACGTGATAAAAGAATTTTGTCAACTTATAAAGAACGGAGGAAATATATACTAAGCTGACTTCATGCTTGAATCTTGGAAATTTTGGTATCATGCCCCGAGCGCTGACAAATACGAGTTTATATTGATGGTCGATTGACTTCAGATGTTTTACTTCCCTCTTTCTTTCATTGACTTGACTTTACCAAGTCCAGTTTTTACTGAACCTTTTTCATTGGGCTGAACCCACTGTTTTTCTTGTGCTAGATTTGACACAGTTTGCTTTTATTGTAGGAGAGAGGGTTTCTTCCCTTATCAGTTGCATTAATATCGTCAACAGAGTTGCGGGCTGAAAGAGGACATGCTCTCGAAGCTTTTTTGGATTTGACGTCTTCTATTGAAGCCACAATTCTTTCACTTATGTTTTGCCGTTCAGGTTATTATGTTTCCTTTTTGTTTTGCTTCCTTGTTTTTCTTTACTTTGTGCTGTTAATAACATGCATTAGATTTGGCAGGTCTTGTCTTCCTGCTGCAGCATCCTGATCTTTCCTCCACAATCATTCGTGCATTCAAGGGGGATGATAATGTCACTAAATTTATTCCTCTTCGATATGCATATGTTTTGGTATCAAAAGGGTTTTTTTGCCATCTGCAAGAGGTTGGAACAAACCTTGAGATGCATCTGAGAGTGGTTAGTCCTTGCTGGTCATTTATTTTTCCGATTTCTCCTATTTATCAATTTTCCACATTATCATTTCTTGTGTTTTCCTTCTTTCATCTTGGGATAATGGGGTGGTTGTTGCTAACTTGTTGGTGCTTTCTTCTGTGCTGATCAGGTTCATGCTGTTGATCGCTTGCTTACATCAACACCATGTTCAGAGGAACTTCTTTGGGTGTTGTGGGAGCTTTGTGGTCTGTCCAGGTAGACTAGGCTCTGCGAACTTTTCACCTTTGTTTCTCTCTCCCCCCTCCCCTCCCAGATAAAATCAGTTATCTGTCATAAAACCATCATTATGTTAGCTTGTTTACAGTAGAAGCATGGCATATCTTTTAATCTTCCTCCCAACTGTATTTTGTCTTCGTGATGTGTGCAAGAGTTCAATTATTATCTGTGTAAGATTTGAACTTATGGAACTGATCAAAGTTTGTGTTTTCAACTTCCTATGCTATTTGACTCAGACAAAAGAGAAAAGAATGATTTCTATTCCTCTCACTGGAAGCCTTCGCACCAACTCAAGCTCCTTGCTTTCCTTCTTCCTGTCTATGCTTGTCCAAACCCACCTTTCATCTGTTTAGTAGGGTTGTCGGAGGCAAGATAAGTTTAGATTAACGCAAGCAATTTAAGAGATATTTTGTTAAAAATTACTAATCTTATGATGTGGCTCATACTTTTTTTTAATAATCTCAAGTGCATTACTATTTAATGACACTCATCTAAATTTCCTGGTCAATATCCCATATTTTGGTGCACATGCAGTTTTTAAAACAAGATTAAGAGTATGGTATGGACAACACATAGTGATCAGAAATCTTCCATTTTAAAAGAACTTCAACATCAATCGTTTTTTTCCTTTGCCTTTTTGACACTTTTCAACTAATCTCACTCAAACTTTGATTAAAGATTGAGTAATCACAAAGATTACTGTTTTCCTTAAAAGAACAATTTACTCACAAAGATCACAGATGGAGAGAGAAAGTCTCATATATTAATAATCATCAAAATTTGCACTTACAAAATGATTCTCTTCTTTAGGTAACCCTTAGGAACTTCTAAACAGCCCTTGGACTATAAATGAATGGAACATCATATTATTTTTTAAAAGACGCACTACGAAATTCTGGGATTAAAGCCGTATTAACCTACTGATGTAAATTTCTTGGACACAACTTTCATCTTCAAGGGGTCTCACCCATGAATAATGCGTGGCTATTCAACGTATTAACCATGCCCTTACGTTCCTTTTCATCCTTGCTTTCCAATTTCCTTTTATAGTCTCATAGCTTCACCCAGTTCCAAGCCTAGTGTTTCATGCAACAATAAAAACAGTCAAAACATAGCTTCACCTCACTCTTACAAATCTCACGATACGGGTGAGTGGGACTGGGTGCCGGTCAAAATCTAAGGAACCGTTACCGGAACCGATCAGAACCGGGTCCCGAATTTTGGAACCGGTCCCGGAACCATTCAAACCGGGTCCAAATTTTTTGGAACCAGAACCGGTTCCGGGTACCCGGTTAGCCTTCACTTTTAATTAATGGATGCTTAATTAATTAATGATTGCTTAGTTAATTAATGCTTGCTTAATTAACAATTAAGTTAAAGATGAATTTTATACTTCAGTTTTTTTTTCTAATAAACTATCAACTTGACAATTTTTAAGTTGTTTTTTTCTAGAATTTTTACCTCAAAATAACAAATAATTATTATATTTTTGAACTATTTGTCTCCGCTGGTACCCGGTACCGGAACCGGAACCGGTACCAACAGCTACCCACTGGGTACCGAGTCCTAGAAATCTAGATCCGGAACCAAAACCGGTTATGACCGGTTCCAAAAATTTGGAACCGGAACCGGTCCAACGAGCTGGTTCCGGCTCCGGACCCGCCGGGTCCTTTGCGGTTCCGAGTCCGGGACCGGGTACCCCTATTACTGCTCACCCCTATTTCACGATAAGGTAGACATGAAGCAACCGTTTCACAACAATTTATCTATACCAAACTCAACAATTAAGGGTCCAGTGCACATGCGAAGCACATCATAATGTTGTGAATATCAAGGATTTGGATATCATTATCACAATCCGCTCCTAAATCTCTCAACATTCCAATGTATGTAAGGTCTTCCTCATTATACCTGACAGCATAAAGAGAAGTGAAAAGAATGGACTACCTCATGAAGTGAAGATAAGTCGGAGGTTGGTCTTACTCGTTATATCTTGGTGGTGGAATGGTGGTAGTGATTAAAGATTGAGTAGTCACAATGAAGGTTTTTCGTTCGAGGCATACTTGGTCTTGCTCGACGACTAAATGGAATAAATACTACCTCCGTTCCACAATACATGCAAAATTTCTTTTTGCACGTATTCCAACACGCTTCTTTGGGCATTAATATCTCTAAATGCGTAAAAGTAAAATTATATTAGTTAATATTTGAAATCCTCGCATTGATACGAATTTAACAAGATCTCACTTGAGTATGTTTTTTTTACATAATTGTGTAAGAAAAATTGTCTAACTTGATTGATGAATAGTGCGCAGAAGAGAAATGATGCATATATTGCGAAACTGAGGTAGTAGAAAGTAAAGTATTGTCAATCAGATCACAAAAGGCATAATGCTTAACCATTTGGCTAGCAAGCTAAATGACTTTGATTAATTGATTTTATTAGACCATTTTGGTTAGATGGTAGTGATTAAAGATGGCTAGGAAGATCCATCATGTCTAGTCCCTAATGGAATTTTTTTATACACCAGCATTATAGTTTTAGGGGTGAGCTTTCTCTGGAACAAATAATTGCTCTTTAACTTGGAATTCCTTTGCATTATGTGGAAAAGCTCAGGCCCTTCTTTGATGATAGGTTCTTACTCCATGCCAGTGTTTAAAAAGCGTAAAGCGGTAGAGCAAAGCGCTTAATTAGGCCTACACGAAGCGTTTTAAAGAAGCGCGCCGAAGCGCACGAAGCGTGCTTTTTTGCGCTTCATTGAAATTCTTTCACTTTTTTTATTTTACGTTGTAAACTTTTTTTTTATTTTTATGTTGTTATTGTGTTTTGGTCCACACCCTATTAATCCTTAACAACAGAAAGTGGGACAAGAGTTTAAAAAAAATGGAAAAAGAAGAAAAGAAGAAGACAACGGTCACAACAGCCGACCCTAGCTTCTTGATAAACACTACTCCCCCCTTCCCTAGAAAAAGATTGCTGCATTGCAATTTTTTGGGTCAAACTTTGGAAACTTTGACCATGAATTCTCATTAGTCTATAAGAAATAATATATTCATGTGGTGTCTTGCTAGATTAGTTTCAATATATATTTTCGGTAGATCAACTTTTTACAATTTTTACGTGTGTAGAAATCATAGATAATGTAAACTCCCTCTAATAGTACCTATGGACATATTAGAGGGACTTTTTTTTTTGAAAAGTTACATGTAAACTCATTATCATTTACATCAATTAATTATCGCCTACCAAGCGGACGTAAGAGATAATGAGGCAAAGGGTGTATTTTTCAGCATGATCTTGAATTCTTGATAGGGACTTAAAGATTCGAGAGTTTCACATTTATCCACTGTATTTTAGTTGTTTTTTTATTTTCACCTTATTTTCCCTTCTCTGAATTTATATGAATAGCAACCGTTATCTGAACTAATATACTTTTGTGGAACTCTTTTGCCTTGTTTCAATTTCATATAATTGTGAATTGAACTTAGTTGCCCCTATTCGAATTTATTGCTCCTTATTTTCCCTTGTTGAAGTTATTGGCCCTTATATGAATTTTCTTGAATCGTTTTTATTTTTTTGGTTACTTGAAAAAGACAAGGCCTACATCTCTCTTTATCTACTGTCAATCATTCTCCGTGGTAGTTTCTGCTTACATGTGCAAGTTATATGCAGATCTGATGGTGGACGTCAAGCATTACTGGCTCTCACACACTTTCCAGAGGTATGGATGTGATGTCTTGTATCAGGAGTATCTGATATTTATAATCATGTTGCTGGACTTGAGTTCCCTCCCATGCTAGCTAGGGTTTCGGATTAAATCCTTACTGTTTTACTGTGTACTTTATGGGTTATTTGCAGAATACTAGGAGTGGGGGCCTTTGATTTAAAAATAAAAATTCCATCTTGAGATGCAAAAATTGCAGGGAACACTTTTAAAGTCCATATATCCTTTTAAATACATTTTGTTAACAATTTATTGTCAAGTATCAGATTTAGTGTGTAAATTGTTGTTTAACCTCGCTTATGACATTCTCTTTTCCCATGTCGCTTCTACTTTGGAAGACAATTATTGTGTACCAAAACATGGGTGGAAGGGGAAAACAAGAAATGGTTAAAAACGGGTTTTCATGTTATAATCTATTATGTTGGAGACAGAGACTGATCAAAACAGTTAAAAGTTCTTTNGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGTGGGCCCATTAAGGATGGTTTGTTACTTATAATCCTTATCATGTTATGCCATGATGTATACATTCAAAACTTAGTTCTTTCATTATTGTATTATATGAAGTATTGGTGTATATGCTGTGGAATCTGATACACTTATTACATGAATAAACTGCCTTTGAGAATTGAGACATTCGGGTTATAGAAGGAAACAATTACGCGTATGAGTATGCCGACAGCTCAGTATTCATAAACATGATAAAAGTACATGATAACTGGTTGAGTGATGTAAAAACAATTCACTCTAAGGATACTTGTTTTGAACTTGCAGGCTTTTACAATATTGATTGAAGCACTGCATTCTGCCAAGGAGTTGGAACCTATTTCTTTGTCTTGCGGTATTCCAATATCAAACTTCTGTAACAAGTGTTACACTTCTACAGTGCTTCTGTAGTGTTTTCTATATTATGATTGACATGTCGTGTTCTTTTGTGGTTTCTAGGCACCCCTTTGAATCTTGCTATCTTTCATGCTGCTGCTGAGATATGTGAGATCATTGTTACAGATCCAACTTCATCTTCAGTCAGTGTCTGGATTGACCATGCAACTGATCTCCATAGAGCTCTGCATTCCTCTTCTCCTGGATCTAATAGAAAAGATGCCCCTAGTAGACTCTTGGAGTGGATAGATGCTGGGGTTGTTTACCATAAAAATGGGGCTGTTGGTCTTCTACGTTATGCCGCTGTTTTAGCCTTTGGGGGTGATGCTCATATAACTTCTACAAACATTTTGGTATCAGACTCAATGGATGTGGAGGATGTAGTTGGAGATACTTCTAATGAATCAGATGTCAATGTTGTTGAAAGTCTGCTTGGGAAACTTGTTTCTGACAAGTCATTTGATGGGGTTGCGCTTCGCGATTCATCTGTGGCTCAGTTAATAACAGCAATTAGGATTTTGGCGCTCATTTCTGAAAATGCGGTGAGATTTGTCAATGCACATATTTTAGGAACATATTGGCTTTATAGCTTGCGTATATCTGGTGTCTGGACCTCCCTACTGGTTTCCATGTAGTGGAACTCATTACATTGGAATGGATCCTGTAAAATGCTCATTAAAGTAGATGTTTTAATTTCTAAATGCTCGACAATGCTTACAATAACTTATATGGTCCATTAAATTCCCTCCCGTTTTCAATTTACACAGGACGTTGCTGCTTCACTGTTTGATGAAGGTGCTATGACAGTTGTCTATATCATTCTCGTGAACACCAGATTGATGCTTGAAAGATCTTCCAATAATTATGGTAATTATATTGTTGTTGGAATTTGTGATTTAAGGCTACATTTGTTTGATGGACCGATTACTGTGGACAGCAGTATGACTATTCTTGCACCAGGAGAGCAATTAAGAAATTGACAAACTTGTTTCTTTTTAGTTCTCTTATTAACTTTCGATCGTTTTTGCACATTTGAGTAATATCCAACTTTATCTAAGAGTCATAGCAAAGAGTAGTATTAATTTATAAATTGTTGTGTATAAGTGAAAACTTTATTTTTCGTGCATCAATGTTTGGCATGGTGATTTGTTGCTTGTTGAATTTTGGGAAGATAATTCATTGGGAGGTTGAGATTAGTGGGTTTGAGTCGGATGTTTATGTGGGTAATTGTATGTTGGCATTGTACTTGTAACTCGGGAAATGGGTAATGCACAGTTAGTGTTTGATGGAAATTAGGTTGAACCGTAAGCACAGGATTTTGAATAGCAGAGTTGTGGTGATGCTGCAGTGGTTGGGCGGCTAACGGTGGTGACGGCAGTGGTGGTGTTCAGTTGATGGAAGAGGAGATGAAAGGAAAGTGATTAAGGTGGTGGATGTAGTAACGGTGTTGGTGGTGACGAGTGATTGAGGAGAGAGGGGATTTGGGAAGTTTAAGGAGAGAGAAAAATATTGAAGTTAAGAGGTGAGAAAATAAATATTTTGTTTTTTTCCTTCTCCAAAAATCATTTTGTTGTTATTTTAATGAAGTTAACTATATTGCAGCGGAGTTGTATCCTAAGGTTCTTAACTATACCGTTTGAAAGTTGAGGTTCTTAACTAAAACTTGTGGCAAATTTTAGGTCCTTGAAGTATGATTTATGCTTTACTCTTGGGTTTTCCCATATGGAGGACCGAATTTTTTTTAATTTCATACATAACCTTGTGTTATTGTTAACACTTCAACTTTCTTCTGTAACATAAATGATCCTTTTCTGTTCATGCACTGTTGGTTTTCTCATATATTGTGATAAAATTAGTTTGCAGAGCCTGGATTCATTAAATGCCCCTTTCTCCTCAGCTGTTAGTTTCCTCCACCCCCATTGGGTAATTCCTTGAGCAATTCTACACCAAGACCACCACCACAGCAACATCTCCCTGACCTATCCAGCAACATCATGCCATAATTCACTCCAAAAGAATTCACTTCATTTCATCGATGATGTTGCTGGCGGCAGAGAATTGGAGAGGTCCTGTGTATTCTCCTTGACCACCCATGTTTCACTAGCATCTAATTGCCTTCACTTGTTCCCCTTGGATTCACCGGCATCCTCTTGTTTCCTTCTTCAAATATGTGCGTGTTTTAGTTTTCACTGCCAAAGATTGCATGTAGTTTTCACCTCTTGGTTCCGTATTTGAACGTTTGGAACTTGATTTAGAAAACACATCCACCTAAATGATTATGAAATATTAGATGTTTCTTTCAGAGGTCATGCAGTCAATAATAGTAATCTATCTACAGTCCTTTTTCTGTTAATGCATATGCTATATTTTCTGGGTTTTAAGAGGTGGTTGGATTCCAGGCCTTAAGTCAAGAATGGATAAGAAGGGTGGTAGGAGGTCCAGGGAGAAGAGTTCTTATGTTTCCTTTTTCTTTGATTTCTCGTGGAGGATGGTTGTGGTTTCTGCAGTTCATGACTAGTAGTGGCTATGAGCTTGTGGTAAGGGGAAGGGGAAGGTTTGGATTGGGGTAGAAACGGGAAAGAGGGGGGGGGGGGGAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAGGGGGGGGGGGGGGTAATGTTGATTCATACGTAAAGTATTTCTCGATTTTTTTTACAAGAGCTAGCAGTATATTTGACGGTTTTACATTTTATGATTAAGAAATTTTGAACAGGTCCAAGTGGTTATTTTGTGAAATTTATCTTTGTCCCCTATCCACCAAAGAGAATGTGCAAAAACATAGGGTTACCAAGTAGAATTAACCTTTCTTTTCTGGTGGTTGGGGGGAGGGAGAGGGTGAGTAGGGATGAAGAGACACTTGCGTGAATGTAGAGCCAAGATGAGCGCTACATAAATATATGATATCTGGAAAAATTAACTGGAAATGATACGAATAGAAGTTCTAATTGGCTAGAAAGAGTAGAGAGGCACAACAGGGTTTGGCGCTGACTAACAATGCAGAAATCTTCTCTTTTGGGATCGTAAATGTGAATGAATAGGAGTCTTTCCTTTTCTGTCTTTCATGAATTAATAAAGTAACCGAATTATATTAGATTTGGAAAGATTAAATATAGGAAATTTTGCAGTCCGACATTTAACATCTCTTGCAAGGAGATGTCATTTATGCTACTGTAACTTCCATGCAAAGTTTGATGTTTCCTCTTTCAAGTAATAAAATTGTTTGTAGGACCTTATGTAGCTATATTGGGTTTAATTGATTATTGAACTTTGGCTTTCTTAGGTTTTCATGATGTGCACCCTAAAAGGAAATTTACCAATCTTGCTATATTTTAATATATCAAGTTTGATATTTACTTTGAATTGTTCTTAGGACTTTGGCTAGCTATATACTGGTTTTGGTTTTTGAATGTTTTGCTTCCTTCCGTTTCCATGTTATTGACTTGGATTTTTTTTCTGGGTAAACTCTACTTATGAGACAATTTAATTTGTTATCGGAACAGATTATCTTGTTGATGATGGCATAGAGTGCAGTTCTACATCTGATCTATTGCTGGAGCGTAATCGCGAGAAATACCTGGTTGATCTTTTGATTCCTTCGCTTCTGCTGCTTATAGAGCTTCTGCAAAGATTAAAGGTTGTTGCACAATTTTCTGAAATTCATCTTCCACTAAAATTTTGTTACGCATGATTATCACCCTGTGTACAAGTGGAATCTGGAAACCTCATACTGTGATAGTATTTGACTGGATGTAGGTGTTGGTGGCTTCAATATTAGCTCATAGAAAGTTAGTATATGTATAGATCAGGTTTTGAAAAGGCAAAATTGTCAAAAAGTACCTTACTTTTGTACCTTTTTGTGGCCTAGTACCTTAACTGTTTGATTTTCTCAAATATTACCTTGTTTTCAACTTCTTTTAAATATGGCACCTTTAAGACATTTTCCGATGTAAAAGTCAATTTCTGGGGAGTTGATTAACCTATTTCTTCTTTTTTCTTGCGTGTTCCGCACGTACTCCTCCAACACAAATCTCACTCGCTTCTTTTTCTTCTCTCTTCTCTCTATATTTACCAATCCTTCCAGATTATTTTGATCTCACCTCCTGTTCGTCTTTATCATCACCATTATTTCCCGTTCTTTAGCCCCTCTCTTTCTTAACAAATACGAGAACTACAAATTTTTTGGTCTCGATTTCAAGAACTATATATATTTTCTTGGTCATTTACAAATTAAACATCAAAGATTTTTAGTTTTTACTGTTTTTTTTTATAATTTCCTAGAAAAGGGGGGTCGGGGAGGTTAGATGATGCATAAATAATTTTCTAGAGTTGAAGATTTAAAAGTTGATCTAAAAAATAAAATCAAAATATATGCTCCACTTTAAAGTGCAAGTGTAATATGATATGAATAATATAATGATAGAAACCAAATAAATATGCATTAAAAAAATAACTAATACACAAGTCAGTGCAATCATAGAGAACCATCCAAGTCTACAAAACACGGACATGGGATGCGTGATGAATGGGAAAAGAATAAAGGAGAAAGAATGAGCGAAATGTAAAGAGTAGATGCCCTAGCTAGATGAGCATATCATTTTGTAAGAAGAAAAGGGTTTGAGTTCCTTTTGAAAGAAGAAAAAAGAGAGATTGAGGTTCACGCACAAGACACAATAAAAAGAGAAGGAAAATGCTAGTCAACTACCGGAAAATTGAATTTTCTATCAGAAAAAGTATTTTCATGGTGCCATTTTAAAAGAAATTGAAAAGTGAGGTATTATTTGACAAAATCATACAATTAAGGTACTAGTTCACAAAACTAATATAAAGCAAGGTACTTTTTGACAAAATTTCCTTTTGGAGATGGAAGTGAAAGTATGATTGTTGGGGTGTGGATAGAGCTGGCATTGCTAAATGTCATAAAGAAAATGGCATGTGAAATTTTCAATAGATTGATGATTGTCTAGGGTTCATCATGCACACGGATTTGTTCACAGGCTAAATGTTATCCACATATGGTACAAGAGTAAAAGTTAATTCATTTTGCATGCACCAATTTCCATTCCCACAGCCTGAAACTTACATTTTATATTTTGTAAGATGTAGGTCTGCTGAGTGAAAACCGTGTGTGTTTTTGTCATTTTGTAGCACAAGCATTTGCTTGTATTTCAAGCAACTTTCCTTTTACTTCGTTTGAGCATAATTCCATATTATAGCACAGGCATTTCCTTGTATTTTGTGGTAGAAATTCGTTTGAGCATAATTCCATATTATAATATGATGCTGTGATGTTCCAGTTGTGTTTCGATCATTGTTTGATAATAGTTGAATTCTTCGTTTGTCAAGTTAATAATCTTAGTAAAATGTATTAAGTTTAATACATTAAAGTTAATAATGTTAACATGTATTCCCTCCGTTCTTAAATGTTAGTCCTAGTTTTATTTTTCCACCGTTTCAACTCAATATTTAAACGTAAATATCTACAAATATGTATGTTAATTTTTTTTTTAAAATTTGATAATCTGAAACTATTGATTGAGACGAATAAAACAAGATTCCACATGAATATGTTTTCAAATACATATTGGAAGGAATTTAGAAAATTCTCTACAATTGTGAATAGTGTCAAGATTCCAAAGAGGACTAACATTCAAGAACAGAAGGAGTATACTCCCTCCGTCCCGGAATACTCGAACCGGTTTGACCGGCACAGAGTTTAATACAAAGTAATTGACTTATTATTTAATTAGATGGTAGTTGATAGTTGGGTATTTTTTAATATAGATAGAGATATGTGTCAACTTTTTAAAGGGGGTAAGGGGGTAGGGGGGAGGTGTATGGGACCACAAAATGACATGTGAAGGGGCGTAGATGGGTAGTTGATATAATATTTATAAAAGTTTACCATTTATAGAAACGGTACAAGTAATCCGGGACGGCTTTATAACGAAAGCGGTGCAAGTGTTCCTGGACGGAGGAAGTATTTAGTAAGCTGAAATTCCAAATAATTATTTGTTGAAATTTCACAATTTTGTTCCCATTTTCATTTGCTAAATTTGACAACTACTATACGACGGAGTAGTTTTTAGCATGCTTTACTTTAGAAAAAGTATTATCCAATTTTTGAATCCAAATTCTGTATCACAATTATCTCAAGGAATTGGTGTTAGAAATCTAATTATTTTGTGGAAGAAACGGATTATGAACCATCATACTTATGTAGGCATTTACAGTATCATCAACTTGTTTTAATTTCTTTTTGTCTGCCTAATTTTCTCCCCTTTGGCTGCTCTAATACACTACTACCATGGAAAAATCAGGTGTGCGAAAGCTTTATGAAAAATCGTTGGGTCAACCCATTATTATTTATGTATGAGTTAGAGCCCAGGTGTGGATCAAGTAAATCTATGGGCAGTCTTTGTCTAATGTGAAAATATCAGTAGAGTCTAATATCTTTGTCACTTTGACTACTTTGATATAACATTTTAGAGTTCTTATCTTACTTAAATATGTTGATCACCCCTTTGCATTTGTTTATTATAAGGTTTGTGTAATGAATAACATCCCCGTCTTCTGTATCTTTTAATGTTTTCCTTCATGTTTCTGGAATGAGATGGTATGCGTGCATATTTTCGAGTTGCATTCATTTATATCCGTAAATTATCTCTTACGATGTTATCACATTTGTAGGAAGCCAAAGAGCAGCACAGGAATACCAAACTAATGAATGTCCTCTTACGACTGCACCGTGAATTAAGGTAATTGCCGCTTATCCTTCTCCTTTGGATCTCTAGTTACAAAAGTACAATATTTACATGATCAGATGGAATTGACAAAACTGTGCCTCTTGCAGCCCCAAGTTAGCAGCTTGTTCTGTGGATCTATCCTCATCTTATCCTGACTCTGCACTTGGTTTAGAAACAGTTTGCCATCTGATTGTGTCAGCTCTAGCTTGCTGGCCTGCACATGCTTGGGCTCCAGGTCTTTTCCACACACTTCTTGATAGTGTTCAGGCAGCTTCCTTGCTAGCATTGGGTCCGAAGGAAACTTGTAGCATGCTGTATTTATTGGTACGTAAGCTTGTATAGCTGATTGGCTAGTGTAGGTTTGCTACATATTTATCTTAAACAATGTTGGTGGAAAGTCTGAAATTCCCTTTTTAGGTGTATAATGAAGCTTTTCTCTTTTATCTTCAAACATTAGCATCCATTTACTGTTGTTTGATTTGTATGATCTTTATTTTACTGCCACATCCAAAACTGCAGAAGCTGAATCTTATATTTGTTTCATTTTATTTGGTTGATAGATTGATCTGTTTCCTGATGAGGGTGTCTGGCATTGGAAGAATGGTATGCCCTTGCTAACAGCTTCCCGATCGTTGGCTGTTGGGACATTGTTAGGTCCCCTAGAGGAGAGGAAAGTTAACTGGTACCTGGAGCGTCCATGTGTTGAGATTCTTCTTGGTCAATTGAGTCCACATCTTGATAAAATTGCGCAAGTTATACATCATTATGCTGTTGGTGTACGTATTTCTCTGACATCTTTGTTTTCCTTTTTTCTTTGGAGGGGGTTTTGGGAAGGTGAATTATGCAATATGTTCTATTACATACTTTCTGCTTCTTTTTTTTCATTTTAATTTTACCATTGGGTTGGGCACAAATATTGATAGAGATAAAATATTAAAACTAACCATGTTATAACGGACAAAAAAGCGAGAAAAGGTTAGGTAACAACAGTAAATATAAAATGTGCACAAGGGTAAAATGGGTATTAGCTGTGTGAACACACCAAAAAGTGTGTGAAAGCATATCATAAAAGGATATGGGTGTAATTAATATGCAACTTCCAAATAAGGCAAATATCGCAATTAATAAAAAACGGAGGAAGCATCTTTTTTCCTTTTGCTGGGAGAGAGCTTGTCGTCTTTAAGGCCTTCTTTCATGCAAGTGTATATATGTTTCTCCTTATGTCCTTTTCTTAATTGTTTCTCTCTTTCTAGACGCTAGTCGTGGTACAAGATATACTTCGCATACTCATACTTCGTATTGCACGTCAAAGAACAGAATTTGCGACAGTGCTTCTGCAGCCTATGATTTCTTGGATTGAGAACCACTTGATTGGAACTTCACCTCTGGACGAGATAGACGCATACAAGGTGGTTTGAGTTAACTGTTTCCACAGCGTCTTGTTTATGATGTGCTGAATATTTAATTGTGTCTTTATGATTGTGCAGGGTTACAAGTTGCTAGATTTCCTTTCTCTCTTGCTTGAACATCCACTTGCTAAGGTTCTTTCTTCTACCTTCTATTTTCTTACGTCCAATCACCTCTTGCACTCATTTTTTATTGTTGATGTTTTATTTCATCAGACCTTACTACTGAAAGAGGGTACACTCCAGATTCTGACAAAAGTGCTTGGAAGAATTGATGCTCATGTGAAATCTGTTTCAAATGGAAGAAGCTCATTAAACTCTAGATTCAATATCTATAATTGCTGCCTCCCAGTATTGAGGTCCTTGTCTTTAATTTGTGAAACTCGATCTGCTCAGAAGCTACATTTGTTGCTCGACGGGTTAGTGAATGAACAATTGTTCCCAAGCTTATTTAAATATCATTGCTTGTTTTACCATTAACAACTAGCTCCCTATTTTACACTGATGGTTGGTAGAAATTGGTTGGGAAGAGAATGAAAGGAACTGGAAAGAGAAAAGTCATATCACTTTTATGTTCGGCTTTTAATTAAATCATAGAATTTAGGAAGAAAAAAGAAGTGACAGGTAGGAAAAGGTGGTCTATTGAAATATCATTTTACTTTTTTCAGAAACTAGAGAAAGCTAAGGCCCTATTCCTGTTTGGTTTCATTTTAAATTTTTTGTTTTTGTAAAACTGGAAGATGATTTCATTTTGTAAATTGAAGTATGGAGTGCAAAACTACCGCATTAATAAATGAATAAAGTACTATACTACGTTCGTAATACTTGCCCAACATAAATGAAGCTGAGATTAAGGATGGAAAAAAGATAGATCAGAAAGCTATTGGAATCCATTGCAAAGTCTGCAATAAATAATTGAAGGGCTTTGGACTCGTGGCTTGTATACAAGCATTGCCTCTCTGCTTGTAGACATTAATATCAATCTGGATGAAACTTGGTTGATTTTGGTCCTTTTTAAATGTTTTGAATAACAGATAAAAGTGGAGTTGGAAAACAAATTAACCCAGTTGCTGGAAATTGTAAAAGTTTATCGATCGTCACCGCTACCACCTAGCTGCGCAAGCTTCAGTTGTCGGCCATAACTCGATTCACACTCCATTCAATTGTTTCTGCCGGCAAATTGCCACAGTTTTTCCCAGAAATATAAGAGAAAACTAGAGAGAAGGAAGGGCTGAGGTTTAAGAGAGAGAAAGCTGGAGAGAGAGAGAGGCAGTAGGATGGGAAAGAGACGACTTGAGCGAGACTGACAAGGGAAAAAGAGATAATAAAGGGGTTCTGTCTATGTGACGCACACGTTTCTCGAAACCAAGAATCCGAAACAACTGCAGTTTTGTATTTACTGCTGCTGTTTGTTAAAGTAGCTTGTGATTGTTTTTTCTTAAGCAGTACAGCCGAACACATTTTTTTTTTAGGGAAGGAGAAATGGAAAACATAGAGAAATGTTTTATAATTGGCTAACAAAACAGAAGATATCTCTCAGTTTCTACTTATTAAACTGTCCAGTAGCAGAATTATTTATACCCATACGTTCTCGTATTATCCTTTCTGTGTCCTCTCTGATCATTGAGTAAAAGCTTATACGATTAACCAAGTTTTTTTTCCTTTACTTTTTAACTAACATCAACAGGCTCAGTGAGCTGGTAAATGTGTTCCTAATTGTTCAGGAAAACAAACCAGCAGGGACCACCCACGTTCATAATTTTTACGTTTATCATAGAATGAGCTTTAATCTGGTTAACAAAAATGAAGAATTCTTTGCTGGGGTCTAATGTAAAGCTCCCAGATTACGTACACCCCTTTTTTTGAGGAGTTCCCTCTCATTGTCAGATTCTTCTGCAATAAGGAACTTGAAAATTTCTGGTCTATGTCCATTCAGGATGATTGTCAAATATGATTGTAATGCATAACCATTAAATACTCCCTCCGTTTTGTGTTGATAATACCCATTTAGTAAACTGTTGTTTTTAGGAGAGGTGAGTGATTGTAAGAAAAATGTGGATAAAGTAAGAGTCTAAGAGATATGTATTAAAAAAGGTGGTTGAGTGAGAAAAAGGTGGATAAAGTAAGAGATGGTGAGGGGAAAGTAAATATTGTGGGGGTAAGGGGGGTAAGAGGGTAAAATATATGTACCTAAAATAGAATAAAGTAGAAGTGTGTATATCAAATAAAAATACCGCAAATGGGAAGTGGGTGCATGATTAAAAAAACTGGAGTACTTCCTTTTCTATTTGGCTTTTATTTCAGGTTCTTCTATTTTTCTCTATCCTAAGAGACGTGATTATTGTTTAATGTTTCATTTTCTTTATTGCGATATGCACAAGGTGAAGGACTGGAGGACTAAAAAGTTTTTCTGTTTTTTGTAGCAATGAGACTCTGAGTACTGAAGATTCTGCTGTGATTTTTAGCTACATATTCAAACTCTGTCAGGTATAAGCAGTTTATTCATGCACCTTTTTCCTTACTCAAATTTTCAGAGATTCATACTCAGTGTTTGCTGATGCGCGAAGCATGTATTCACATTTACTAATGCATTTATGCAGAGCAAGTATTCATATTAACTAATGCATTTACGCATCGCAAGTATTTTATTTCTGCATAATGGTCCGAGCAGGGGATAGTCAGCTGCTGATGTGGAGGGTGAATAAGTATTTATTTTATCGTCTGCTTCTTCATGTCCCGATTATATTTGACGATTTTTTTGTTTACTGAATACCGCAGCATCTACCAGCTGGAAATGAGTTGGTTGCTTGTTTAGGCACTTTCAAAGATCTAGCTTCTACTGCTATAGGTCAGAGCACATTGCTCGACACTTACCTTTTTATCAGATCCAGTAGTAACAACGAAAACCAACCGGGACGATCACAAGAAAGTTATCAATTTTATAATCGCCTAATTGATGCTGAATGGAAGGGGTGTCCTCTGCTAACGTGCTGGATTAATTTATATGGGTCAGTTGAGACTGAGGGTCTATCCACTTGTTCAGTTGAAGCTATTGGTCTGCTATCAATTGGTGTGTTGAGATTTTGCATGAGTGGCAGAAGGTGAATATCTTTGCTTATTTTTATTTCCAGTCATATATCTCAAACATTTCGAACTGTTGAAATTCAATTATACTGTAAGTAGGCCCTTGATTAACACTGGAATAAGGTACTGTGTAAACCAGACCAACCCCCACCCTCCCCCTATCCCTCTTCCCCTTCCTCTTCTCACTTGTCAAGGACCGGAGTTCTGAAAAGGCATTAGGATTTAATGTTGACACTGCATTAAAAACTTCGAAAATCTTCCTAAGATGATGCATCAGAGGACTAGATAACTAGTCATTAAGAAACCAGGCTTCAAAAGTTTTTTCCACTATGAGTTATAGTAAATTAAAGGAAGAAAGATCAACTATGATTAAATTTCGAAGAACTATCATGGTCTACAATCTCTCCAGATCATAGGCATTTCTGAAAACACTTCTAAGGAACTTTATTTCTGAGAGCTTTATATTTGAGGGATGCGGTAATATTTATTTCTTGGGCTTGTAATACCAGGATTAGTGTTCAAGCTATTGGTTGTTACACCTACCTACTAGCTACTACATTTACACTGAAAAATAAGAAAGCTATGCTTTCTCTCCACCCAGAACACTCGCATTAGACCTATCAATGGACTAACGTGTACCTTGAATGACCACCTGTTTGCATGCTGTATACTCCTTTTGTACGTTGTTGATATTCTCATGCAGGTATCCCAAAAAGGACTTGGTAAAACTTGTGCCTTATGACCTGAGGTAGACTTAAATATTTTTCCTCTTTCCATACACCTGAAATACACTGGAAGCTTGTTGCTAGAAGCACAAAACAAACAGTTTCATCGTGTTCGTGATCGAACCACAAGTGATGCTATTAGTTTCGGTATAGTGCGTGACGCTTTATGTTCCGTAAGCTAAAAAAGTTGTGTCATACTACTGTTTTATGTCCACTTGCTTCTACATTTATTATTTTTCTGCCAAATAGTGATAAGTGGCTACAACTACAAGTGTGGGATTGGAATGCCTTCACCTTCTGGATCCTAAAATCTTTTCTTTGCCCAGTTTCAACGAGGATAGAATTGATGCCTTGAAGTATTTCTTTGGGCTTTCAAGTGATGTTGATGGAGCAAATGGTGTTCTCGAGGAAAACATTAAGTATGTCCAAGATATGTGTACTTTGTTGAGTTCAAAGATAACTGATGATGAATATGCTGTTTCATCACATCTGAAGGCGACCTTGCATCAGGTTTATTACATTTATCTTTAGCTTATGTCCTTTTACAGACTTGCGAATGCTTCTAATCATCTTACAATTATTTTTTTTGTAATACGCAGGCCGTGGAATCTGTGAAGTTCCTTTCTTTACTGCTACTGAAACCTATTGATTCATTTAAGGATGATGATGCTTTCACTGATGAAGTTCTCTTGTTTTCATCTGATGCTAAACTTTCTTCAAAGATACATTTGCTAACAGATAGCAGTTCAGAAAGGGTTGAAAATGAATTGAACTTTGGTGAGTTAGGTGAAAGTTTTCAATGGGAGTGCCCTGAAAATTTACGCAATAGGCTGTCACAGGCTGGTCTCCTGGGCAAGAGGAAGATTTCAGCAGTGGACGGAGTCAACAGACATGGCAAGGGAGAAAATGCTCCAGTTGACGTACGAGGATCAGGGTCATCTATGGCACCTCCTATTCCAACTCGAAGAGATACCTTTCGACAACGCAAGCCGAATACAAGCAGGCCTCCATCTATGCACGTAGATGATTATGTTGCCAGAGAAAGGAATGAGTCTTCAAATCCTAACGTTATTGCAGTTCCACGTTTGGGGTCATCTGGTGGGCGGCCTCCATCAATTCATGTTGATGAGTTCATGGCTAGGCAGAGGGAGCGTCACAATTCTGTTCCTATAGCAGGCGTAGAGGCTTCATCCCAAGGGAAAAAGATTACGCTTGAGATTGATAACGATACAGAAAAGTCCAACCGATCTAAGCAGCTCAAAGTTGATCTTGATGACGATCTTCAAGAAATTATGTTTGATGCTGAGGAAAATGAAGCAGATGATAAGCTGCCTTTTCCTCAAGCAGATGATGACGTACAGCAGACTGCATCTGCTTCAGCTGAGACAAGTCCTCAACATTCAATTGTTGAAGAGACAGAGAGTGAAGCCAAGGACCTGGAAGCAACTGTGGCTTCTAATGCGGTTGAGAATGCACAAAGTGAACTTTCTTCTAGAATGTCAGTATCACGTCCAGAGATGCAACTGAGACGTGAACCCAGTGTTTCTTCAGAAAATAAATATTTTGAGCAACCTGATGAGACGAATAATAAAAATCCAACTATGGTCTCTGGTGGCTTTGATTCTGTTGCAGTAGCAAAGACGTCTGGATTCCCTGCCTCTGTATATGGAAAACATTCTCCATCAGGGGCTCAGGTTCCTGCCGATGCAAGGATACCACCTTCCAGTTTTATTCCTGATAATAATCTTCAAAGAGTTGGGAATGTGCTCCTTGGCAGGTCACAAGGCCGCCATGAGCAGAAACATCTGCCAAATCAACCACCATTACCTCCAACACCACCTCCATCAACAATTTCACCTGTTAGATCACAGGTTTCTGAAAACCAAACTTCAAGATCTCCATTCGTTAATCCTCATAATGATGTGCAGCCGCCTCTGTATCCAACTTATGTAAGAGAAGAAACTGTGTCCCTTCTGCTTGTTCTCTTTTATTGGGGGAGAGAGGAAGGTTGGGACGGGGGAGCATGGGGTTGTAATGTATAAAAAGATTTTTCTATCTCATATTTATGGCTTCTGAAGTTTTTCTTCTTTTGCAGGCACAGACTGATTATCAGTCTTCTTTTGGCAATGTCCCCACATCATTTGCATCTTCCAACTTCATGTCGGATTCACGCTATGCACGAACAACCAACTCTTCTCCCGGGAGCTCTAATAGACCTTTGCCACCACTTCCTCCAACACCACCTTCTTTCTCAGCTACCCCATTGAATGTTCCATCCAGGTCTCCTACATCCCAATCCTCGATGTATGTTCAGCATAACGCCGGAGCAACTGACGCTTCTCAGTATTCATCTCCGCCGCTGATTCCCCCAGGTTTTAGCAGGCTACCTTCAATGCCATTTAATATGCATGGAAATATTCCTGTTCAGCAAAGTGATAATATAACGGGCATAGCTCAGAATAGCCGTGGTCCACAACCCTCAATGCAATCAATGCAACCACTTCCCCACCTGCAACCCTTGCAACCCCCTCAACTCCCTTGCCCTCAGCAACTAAGATTGCCAATACAAAATTCGCTACATACAGAGCAAAGTGGTTCCATGTCACAGAGCCATGTCCAGATGCAGGTTCAGCCACCGCAAATTCTGCAACAATCTCAAACGTCTCCATATCATTTGTATTATCAGACCCAGTCGCAGGAGAACATCCCACATCCACAGCAACCACAAGCAGCACATCACCAGGTAGATGCTGCTACCCAGCAGCAGATGGATTCTTCAATGTCACTGCAACAGTATTTCAGTTCTCCTGAGGCGATTCAGGTGTGTACTGGCTATTTTGTTTGATTCATGCATCCTTTCTATCATTCTCTAATCAATGATTATATCATCACAGTCCTTGTTGAGTGATCGTGATAAGCTTTGCCAACTTCTAGAGCAGCACCCAAAGTTGATGCAGATGCTTCAGGTTTGAGAACCACCTCTATCTTGATCATTGTGGTTTTAATATTTCTTTGCAGTTTTTTAACATATGTTCTCATTATCCCTCTTTTCAACGAAGCAATATTGACACTCCTTTTTTCTTGTGTTTGTTAGTTGTTTAAATGCATGGATGCATATAGTAGTTATAGATTTCTAGGTATGGAGACGATTATCGTAGGTCACACAGACTATTGAACAATAAGTGTAGGCATAACTAATTTGAAGTTAAACCAGATTTTTCGCCTTTTGGAGAACTCTAGAATATGGCACTTTAAAAGTTGGTTGTGTTGTCAAGACTTGTGTTATCACTTGTGTTATTTTCTAACTTTTTACTTTGTTGTGCGACTTTCCTACTTAGAACATAGAGTTGTTGTTTTTTTGTTTCCAATGCTGTTGATTACCATTTTCTCTTATAAACGTGTCTTCTATGGCATGCAGGACCAACTTGGCCATCTCTAAACTTCAACCTGAGGCAACTAGGCAACTGAAGGAATATTTTCTTACACCTGCCCCTTATTATATGCATAACATTTTGTGTGGGCGGCATTTTCTCAACTATCAATGTATCAGGGTTTCACCGCTTTTTGTATCAGTGCTACAATTCGTTTGTACATTTCTTCATTGTAATTTGTGAATAAATTAGTGCTTCTAATGGCCATTCCATATTCATTGTTGACCCAAAACTTCATAGTGTAAGGCTACAAGGTGGGCAAGTTGATGAAAATACTCAAACCTCTGTGTTTGTGACTCAGAGGACTGTGGGACGTGGAAATTAAGGACTGTTATTTTTAAACGAAGGCCAGTATAAAGCAGATACAGTAACAGGTAGATTTGATGTTTTTGGAGTTCGGAATCATTCCCGAATAGTTGAATACGATGAACATAAGCTTTATTGAGTCTCTCATTTAATAATGCACCAGCACCACCCGTATCAGTAGGCACCGGAAGTGTGTGATTAAAATTTGAAATCATTGCCTAAAAATGTTTAGTTACTAATCCTAGGATCAAGTTCTAGATTCCGTATCTATTCAATGTTGCACTAACCCT
mRNA sequence
ATGGAAGATAGCGGAGGTGAGTGTCATCACCTGATCGAAGACTGCAGAAGGCTAATTCGATTGAATGACTGGGAAGTCGACATAATCCATGTGTATAGGGAGGAAAACCGAGCTGCGGACTGGCTTGCTAATAATGGGGTCGATAATGCCAACAATTTGTCCATCTTAGGGTCAACACCTGTAGCACTCGCAAGGATCTTGGAGGAAGATAGAATGGGGATCAACAACCCTCGTTTCATCCCCCTTGAAGGCTTAAATAGGCTAGCAAACTTTCAAGAGAGATTTCCTCCGCTATTGCTCACTGCCTTCGCTGCAAATACTCCGGCAAGGTGGCTCTGCGGCGACGCTTGCTACCACACCGCTCGCCGGCTCTCCGAATATCTCAACATGGGTCGACCCGAACCCTGTGTACTCTTTGCCCAGACCTTCGTTCACCCTCAGTTGGACGAATACGTTGACGAAGTCATGTTTGCTGAACCTATTGTGATATCCGCCTGTGAGTTTTTGGAGCAAAATGCGCCGTCAGCTTCGTCTGCCGTCACTCTTCTCGGGGCTACTTCACCTCCTTCTTTCGCTTTGGAGGTATTTGTTCAGTGTGAGGGGGAAGCGAGATTTAGGCGTCTTTGCCAACCATTTCTATACTCACATTCGTCATCAAACGTTCTTGAAGTTGAGGCTGTGGTGACTAATCATCTAGTCGTGAGGGGCAGTTACCGCAGTCTAAGCTTGGTTGTATACGGAAACACAGCTGAAGATCTTGGGCAGTTTAATGTTGAATTTGACTTGGACAACTCTGTAGCAAAACTTGTTTCTTCTTCTGAGGCAAAGCTTGAGGACCTGCCACCTCTATTACATTCTAGAAATCCTAAATTTGAGGATGCGATATCCTCTCTGAAGGCATTGTCTTCAATGCTCCCAGCGGCAGATATTTCTGTACAGATGCGGCAGTTTTTGCAACTTATATTTAAGATATTGGATATCCCAAACTTTGGTGAATTTAGTCAGAAGGTTATGGCTTTGATTACTTCAGCAGTTGCTTCATTTTTCACAGATGACTTAGCTAGTGCTGGAATCACATTGCAAGAGTTAAATTTGGGAGCATTAGCGAATGTCAAAGGGTCTTTTCATGTTTTCACTGAGGCTCAAAGGAATCTTTTTGAGCTTTGTAAAAGATACGTAGGAGCAGGCCATATATATGGAGATTTTTTGGTAGGGTGCAGTTTTCTCGAGTCTGAAGCCGAGTTAGCCACTTCTAAAGAGTTGGTCGATATGTTGTTCCAGCATTTCCCGTTCAATAGGGATACTGTATTTGTTGGACATCCTCATCTATCAAAGAGTAATAATGTACTTCTTTGGTTGAGTTTAGCTCTGATTCTATGTTCGGGGAAAGAGAGTTGTTTTCATTATGTAAATGGTGGAGGAATGGAGCAGCTTGCACATTTCTTTAGCCAGGAAATGCTAAATAGTGCGACAGTTACTTTGATGCTTCTTGGAGCTATGGAACAGGCAACTCGATACTCTGTTGGGTGTGAAGGTTTGTTAGGTTGGTGGCCTCGTGAAGGTGAAAAGGTCCCATCTGGTGTTAGTGAAGGATATAGCCTACTGTTAAAGTTACTTATGCAAAAACAACGGCATGATATTGCTGCATTGACGACTTATATACTTCATCGGTTGCGGTTCTATGAAGTCGCTTCGAGATATGAGTATGCAGTACTTTCCATATTGGGTGGACTCTCTGGTGATGGGAATGCTACAACCACGACCTTAAGCCTGCTATTGAAAGCCAAGTTACAGCTAAGAAAGCTCTTGAAAATGATTAACTCGTCGGGTTGGGTTGAGGATCCTTCTCCAATGGCCTGTGCAATCCGATCATTAATGGTGGCATCTTCTGAAGGATCATTGTCTTTCAAAGCAACCAGCAAGCTTATTGCTTCGTCAAGTTGTCGTTTTCTGCATCAGAAAATTGATGCAAAGTTGCTCTCCCTTTTGAAGGAGAGAGGGTTTCTTCCCTTATCAGTTGCATTAATATCGTCAACAGAGTTGCGGGCTGAAAGAGGACATGCTCTCGAAGCTTTTTTGGATTTGACGTCTTCTATTGAAGCCACAATTCTTTCACTTATGTTTTGCCGTTCAGGTCTTGTCTTCCTGCTGCAGCATCCTGATCTTTCCTCCACAATCATTCGTGCATTCAAGGGGGATGATAATGTCACTAAATTTATTCCTCTTCGATATGCATATGTTTTGGTATCAAAAGGGTTTTTTTGCCATCTGCAAGAGGTTGGAACAAACCTTGAGATGCATCTGAGAGTGGTTCATGCTGTTGATCGCTTGCTTACATCAACACCATGTTCAGAGGAACTTCTTTGGGTGTTGTGGGAGCTTTGTGGTCTGTCCAGATCTGATGGTGGACGTCAAGCATTACTGGCTCTCACACACTTTCCAGAGGCTTTTACAATATTGATTGAAGCACTGCATTCTGCCAAGGAGTTGGAACCTATTTCTTTGTCTTGCGGCACCCCTTTGAATCTTGCTATCTTTCATGCTGCTGCTGAGATATGTGAGATCATTGTTACAGATCCAACTTCATCTTCAGTCAGTGTCTGGATTGACCATGCAACTGATCTCCATAGAGCTCTGCATTCCTCTTCTCCTGGATCTAATAGAAAAGATGCCCCTAGTAGACTCTTGGAGTGGATAGATGCTGGGGTTGTTTACCATAAAAATGGGGCTGTTGGTCTTCTACGTTATGCCGCTGTTTTAGCCTTTGGGGGTGATGCTCATATAACTTCTACAAACATTTTGGTATCAGACTCAATGGATGTGGAGGATGTAGTTGGAGATACTTCTAATGAATCAGATGTCAATGTTGTTGAAAGTCTGCTTGGGAAACTTGTTTCTGACAAGTCATTTGATGGGGTTGCGCTTCGCGATTCATCTGTGGCTCAGTTAATAACAGCAATTAGGATTTTGGCGCTCATTTCTGAAAATGCGGACGTTGCTGCTTCACTGTTTGATGAAGGTGCTATGACAGTTGTCTATATCATTCTCGTGAACACCAGATTGATGCTTGAAAGATCTTCCAATAATTATGATTATCTTGTTGATGATGGCATAGAGTGCAGTTCTACATCTGATCTATTGCTGGAGCGTAATCGCGAGAAATACCTGGTTGATCTTTTGATTCCTTCGCTTCTGCTGCTTATAGAGCTTCTGCAAAGATTAAAGGAAGCCAAAGAGCAGCACAGGAATACCAAACTAATGAATGTCCTCTTACGACTGCACCGTGAATTAAGCCCCAAGTTAGCAGCTTGTTCTGTGGATCTATCCTCATCTTATCCTGACTCTGCACTTGGTTTAGAAACAGTTTGCCATCTGATTGTGTCAGCTCTAGCTTGCTGGCCTGCACATGCTTGGGCTCCAGGTCTTTTCCACACACTTCTTGATAGTGTTCAGGCAGCTTCCTTGCTAGCATTGGGTCCGAAGGAAACTTGTAGCATGCTGTATTTATTGATTGATCTGTTTCCTGATGAGGGTGTCTGGCATTGGAAGAATGGTATGCCCTTGCTAACAGCTTCCCGATCGTTGGCTGTTGGGACATTGTTAGGTCCCCTAGAGGAGAGGAAAGTTAACTGGTACCTGGAGCGTCCATGTGTTGAGATTCTTCTTGGTCAATTGAGTCCACATCTTGATAAAATTGCGCAAGTTATACATCATTATGCTGTTGGTACGCTAGTCGTGGTACAAGATATACTTCGCATACTCATACTTCGTATTGCACGTCAAAGAACAGAATTTGCGACAGTGCTTCTGCAGCCTATGATTTCTTGGATTGAGAACCACTTGATTGGAACTTCACCTCTGGACGAGATAGACGCATACAAGGGTTACAAGTTGCTAGATTTCCTTTCTCTCTTGCTTGAACATCCACTTGCTAAGACCTTACTACTGAAAGAGGGTACACTCCAGATTCTGACAAAAGTGCTTGGAAGAATTGATGCTCATGTGAAATCTGTTTCAAATGGAAGAAGCTCATTAAACTCTAGATTCAATATCTATAATTGCTGCCTCCCAGTATTGAGGTCCTTGTCTTTAATTTGTGAAACTCGATCTGCTCAGAAGCTACATTTGTTGCTCGACGGCAATGAGACTCTGAGTACTGAAGATTCTGCTGTGATTTTTAGCTACATATTCAAACTCTGTCAGCATCTACCAGCTGGAAATGAGTTGGTTGCTTGTTTAGGCACTTTCAAAGATCTAGCTTCTACTGCTATAGGTCAGAGCACATTGCTCGACACTTACCTTTTTATCAGATCCAGTAGTAACAACGAAAACCAACCGGGACGATCACAAGAAAGTTATCAATTTTATAATCGCCTAATTGATGCTGAATGGAAGGGGTGTCCTCTGCTAACGTGCTGGATTAATTTATATGGGTCAGTTGAGACTGAGGGTCTATCCACTTGTTCAGTTGAAGCTATTGGTCTGCTATCAATTGGTGTGTTGAGATTTTGCATGAGTGGCAGAAGTTTCAACGAGGATAGAATTGATGCCTTGAAGTATTTCTTTGGGCTTTCAAGTGATGTTGATGGAGCAAATGGTGTTCTCGAGGAAAACATTAAGTATGTCCAAGATATGTGTACTTTGTTGAGTTCAAAGATAACTGATGATGAATATGCTGTTTCATCACATCTGAAGGCGACCTTGCATCAGGCCGTGGAATCTGTGAAGTTCCTTTCTTTACTGCTACTGAAACCTATTGATTCATTTAAGGATGATGATGCTTTCACTGATGAAGTTCTCTTGTTTTCATCTGATGCTAAACTTTCTTCAAAGATACATTTGCTAACAGATAGCAGTTCAGAAAGGGTTGAAAATGAATTGAACTTTGGTGAGTTAGGTGAAAGTTTTCAATGGGAGTGCCCTGAAAATTTACGCAATAGGCTGTCACAGGCTGGTCTCCTGGGCAAGAGGAAGATTTCAGCAGTGGACGGAGTCAACAGACATGGCAAGGGAGAAAATGCTCCAGTTGACGTACGAGGATCAGGGTCATCTATGGCACCTCCTATTCCAACTCGAAGAGATACCTTTCGACAACGCAAGCCGAATACAAGCAGGCCTCCATCTATGCACGTAGATGATTATGTTGCCAGAGAAAGGAATGAGTCTTCAAATCCTAACGTTATTGCAGTTCCACGTTTGGGGTCATCTGGTGGGCGGCCTCCATCAATTCATGTTGATGAGTTCATGGCTAGGCAGAGGGAGCGTCACAATTCTGTTCCTATAGCAGGCGTAGAGGCTTCATCCCAAGGGAAAAAGATTACGCTTGAGATTGATAACGATACAGAAAAGTCCAACCGATCTAAGCAGCTCAAAGTTGATCTTGATGACGATCTTCAAGAAATTATGTTTGATGCTGAGGAAAATGAAGCAGATGATAAGCTGCCTTTTCCTCAAGCAGATGATGACGTACAGCAGACTGCATCTGCTTCAGCTGAGACAAGTCCTCAACATTCAATTGTTGAAGAGACAGAGAGTGAAGCCAAGGACCTGGAAGCAACTGTGGCTTCTAATGCGGTTGAGAATGCACAAAGTGAACTTTCTTCTAGAATGTCAGTATCACGTCCAGAGATGCAACTGAGACGTGAACCCAGTGTTTCTTCAGAAAATAAATATTTTGAGCAACCTGATGAGACGAATAATAAAAATCCAACTATGGTCTCTGGTGGCTTTGATTCTGTTGCAGTAGCAAAGACGTCTGGATTCCCTGCCTCTGTATATGGAAAACATTCTCCATCAGGGGCTCAGGTTCCTGCCGATGCAAGGATACCACCTTCCAGTTTTATTCCTGATAATAATCTTCAAAGAGTTGGGAATGTGCTCCTTGGCAGGTCACAAGGCCGCCATGAGCAGAAACATCTGCCAAATCAACCACCATTACCTCCAACACCACCTCCATCAACAATTTCACCTGTTAGATCACAGGTTTCTGAAAACCAAACTTCAAGATCTCCATTCGTTAATCCTCATAATGATGTGCAGCCGCCTCTGTATCCAACTTATGCACAGACTGATTATCAGTCTTCTTTTGGCAATGTCCCCACATCATTTGCATCTTCCAACTTCATGTCGGATTCACGCTATGCACGAACAACCAACTCTTCTCCCGGGAGCTCTAATAGACCTTTGCCACCACTTCCTCCAACACCACCTTCTTTCTCAGCTACCCCATTGAATGTTCCATCCAGGTCTCCTACATCCCAATCCTCGATGTATGTTCAGCATAACGCCGGAGCAACTGACGCTTCTCAGTATTCATCTCCGCCGCTGATTCCCCCAGGTTTTAGCAGGCTACCTTCAATGCCATTTAATATGCATGGAAATATTCCTGTTCAGCAAAGTGATAATATAACGGGCATAGCTCAGAATAGCCGTGGTCCACAACCCTCAATGCAATCAATGCAACCACTTCCCCACCTGCAACCCTTGCAACCCCCTCAACTCCCTTGCCCTCAGCAACTAAGATTGCCAATACAAAATTCGCTACATACAGAGCAAAGTGGTTCCATGTCACAGAGCCATGTCCAGATGCAGGTTCAGCCACCGCAAATTCTGCAACAATCTCAAACGTCTCCATATCATTTGTATTATCAGACCCAGTCGCAGGAGAACATCCCACATCCACAGCAACCACAAGCAGCACATCACCAGGTAGATGCTGCTACCCAGCAGCAGATGGATTCTTCAATGTCACTGCAACAGTATTTCAGTTCTCCTGAGGCGATTCAGTCCTTGTTGAGTGATCGTGATAAGCTTTGCCAACTTCTAGAGCAGCACCCAAAGTTGATGCAGATGCTTCAGGACCAACTTGGCCATCTCTAAACTTCAACCTGAGGCAACTAGGCAACTGAAGGAATATTTTCTTACACCTGCCCCTTATTATATGCATAACATTTTGTGTGGGCGGCATTTTCTCAACTATCAATGTATCAGGGTTTCACCGCTTTTTGTATCAGTGCTACAATTCGTTTGTACATTTCTTCATTGTAATTTGTGAATAAATTAGTGCTTCTAATGGCCATTCCATATTCATTGTTGACCCAAAACTTCATAGTGTAAGGCTACAAGGTGGGCAAGTTGATGAAAATACTCAAACCTCTGTGTTTGTGACTCAGAGGACTGTGGGACGTGGAAATTAAGGACTGTTATTTTTAAACGAAGGCCAGTATAAAGCAGATACAGTAACAGGTAGATTTGATGTTTTTGGAGTTCGGAATCATTCCCGAATAGTTGAATACGATGAACATAAGCTTTATTGAGTCTCTCATTTAATAATGCACCAGCACCACCCGTATCAGTAGGCACCGGAAGTGTGTGATTAAAATTTGAAATCATTGCCTAAAAATGTTTAGTTACTAATCCTAGGATCAAGTTCTAGATTCCGTATCTATTCAATGTTGCACTAACCCT
Coding sequence (CDS)
ATGGAAGATAGCGGAGGTGAGTGTCATCACCTGATCGAAGACTGCAGAAGGCTAATTCGATTGAATGACTGGGAAGTCGACATAATCCATGTGTATAGGGAGGAAAACCGAGCTGCGGACTGGCTTGCTAATAATGGGGTCGATAATGCCAACAATTTGTCCATCTTAGGGTCAACACCTGTAGCACTCGCAAGGATCTTGGAGGAAGATAGAATGGGGATCAACAACCCTCGTTTCATCCCCCTTGAAGGCTTAAATAGGCTAGCAAACTTTCAAGAGAGATTTCCTCCGCTATTGCTCACTGCCTTCGCTGCAAATACTCCGGCAAGGTGGCTCTGCGGCGACGCTTGCTACCACACCGCTCGCCGGCTCTCCGAATATCTCAACATGGGTCGACCCGAACCCTGTGTACTCTTTGCCCAGACCTTCGTTCACCCTCAGTTGGACGAATACGTTGACGAAGTCATGTTTGCTGAACCTATTGTGATATCCGCCTGTGAGTTTTTGGAGCAAAATGCGCCGTCAGCTTCGTCTGCCGTCACTCTTCTCGGGGCTACTTCACCTCCTTCTTTCGCTTTGGAGGTATTTGTTCAGTGTGAGGGGGAAGCGAGATTTAGGCGTCTTTGCCAACCATTTCTATACTCACATTCGTCATCAAACGTTCTTGAAGTTGAGGCTGTGGTGACTAATCATCTAGTCGTGAGGGGCAGTTACCGCAGTCTAAGCTTGGTTGTATACGGAAACACAGCTGAAGATCTTGGGCAGTTTAATGTTGAATTTGACTTGGACAACTCTGTAGCAAAACTTGTTTCTTCTTCTGAGGCAAAGCTTGAGGACCTGCCACCTCTATTACATTCTAGAAATCCTAAATTTGAGGATGCGATATCCTCTCTGAAGGCATTGTCTTCAATGCTCCCAGCGGCAGATATTTCTGTACAGATGCGGCAGTTTTTGCAACTTATATTTAAGATATTGGATATCCCAAACTTTGGTGAATTTAGTCAGAAGGTTATGGCTTTGATTACTTCAGCAGTTGCTTCATTTTTCACAGATGACTTAGCTAGTGCTGGAATCACATTGCAAGAGTTAAATTTGGGAGCATTAGCGAATGTCAAAGGGTCTTTTCATGTTTTCACTGAGGCTCAAAGGAATCTTTTTGAGCTTTGTAAAAGATACGTAGGAGCAGGCCATATATATGGAGATTTTTTGGTAGGGTGCAGTTTTCTCGAGTCTGAAGCCGAGTTAGCCACTTCTAAAGAGTTGGTCGATATGTTGTTCCAGCATTTCCCGTTCAATAGGGATACTGTATTTGTTGGACATCCTCATCTATCAAAGAGTAATAATGTACTTCTTTGGTTGAGTTTAGCTCTGATTCTATGTTCGGGGAAAGAGAGTTGTTTTCATTATGTAAATGGTGGAGGAATGGAGCAGCTTGCACATTTCTTTAGCCAGGAAATGCTAAATAGTGCGACAGTTACTTTGATGCTTCTTGGAGCTATGGAACAGGCAACTCGATACTCTGTTGGGTGTGAAGGTTTGTTAGGTTGGTGGCCTCGTGAAGGTGAAAAGGTCCCATCTGGTGTTAGTGAAGGATATAGCCTACTGTTAAAGTTACTTATGCAAAAACAACGGCATGATATTGCTGCATTGACGACTTATATACTTCATCGGTTGCGGTTCTATGAAGTCGCTTCGAGATATGAGTATGCAGTACTTTCCATATTGGGTGGACTCTCTGGTGATGGGAATGCTACAACCACGACCTTAAGCCTGCTATTGAAAGCCAAGTTACAGCTAAGAAAGCTCTTGAAAATGATTAACTCGTCGGGTTGGGTTGAGGATCCTTCTCCAATGGCCTGTGCAATCCGATCATTAATGGTGGCATCTTCTGAAGGATCATTGTCTTTCAAAGCAACCAGCAAGCTTATTGCTTCGTCAAGTTGTCGTTTTCTGCATCAGAAAATTGATGCAAAGTTGCTCTCCCTTTTGAAGGAGAGAGGGTTTCTTCCCTTATCAGTTGCATTAATATCGTCAACAGAGTTGCGGGCTGAAAGAGGACATGCTCTCGAAGCTTTTTTGGATTTGACGTCTTCTATTGAAGCCACAATTCTTTCACTTATGTTTTGCCGTTCAGGTCTTGTCTTCCTGCTGCAGCATCCTGATCTTTCCTCCACAATCATTCGTGCATTCAAGGGGGATGATAATGTCACTAAATTTATTCCTCTTCGATATGCATATGTTTTGGTATCAAAAGGGTTTTTTTGCCATCTGCAAGAGGTTGGAACAAACCTTGAGATGCATCTGAGAGTGGTTCATGCTGTTGATCGCTTGCTTACATCAACACCATGTTCAGAGGAACTTCTTTGGGTGTTGTGGGAGCTTTGTGGTCTGTCCAGATCTGATGGTGGACGTCAAGCATTACTGGCTCTCACACACTTTCCAGAGGCTTTTACAATATTGATTGAAGCACTGCATTCTGCCAAGGAGTTGGAACCTATTTCTTTGTCTTGCGGCACCCCTTTGAATCTTGCTATCTTTCATGCTGCTGCTGAGATATGTGAGATCATTGTTACAGATCCAACTTCATCTTCAGTCAGTGTCTGGATTGACCATGCAACTGATCTCCATAGAGCTCTGCATTCCTCTTCTCCTGGATCTAATAGAAAAGATGCCCCTAGTAGACTCTTGGAGTGGATAGATGCTGGGGTTGTTTACCATAAAAATGGGGCTGTTGGTCTTCTACGTTATGCCGCTGTTTTAGCCTTTGGGGGTGATGCTCATATAACTTCTACAAACATTTTGGTATCAGACTCAATGGATGTGGAGGATGTAGTTGGAGATACTTCTAATGAATCAGATGTCAATGTTGTTGAAAGTCTGCTTGGGAAACTTGTTTCTGACAAGTCATTTGATGGGGTTGCGCTTCGCGATTCATCTGTGGCTCAGTTAATAACAGCAATTAGGATTTTGGCGCTCATTTCTGAAAATGCGGACGTTGCTGCTTCACTGTTTGATGAAGGTGCTATGACAGTTGTCTATATCATTCTCGTGAACACCAGATTGATGCTTGAAAGATCTTCCAATAATTATGATTATCTTGTTGATGATGGCATAGAGTGCAGTTCTACATCTGATCTATTGCTGGAGCGTAATCGCGAGAAATACCTGGTTGATCTTTTGATTCCTTCGCTTCTGCTGCTTATAGAGCTTCTGCAAAGATTAAAGGAAGCCAAAGAGCAGCACAGGAATACCAAACTAATGAATGTCCTCTTACGACTGCACCGTGAATTAAGCCCCAAGTTAGCAGCTTGTTCTGTGGATCTATCCTCATCTTATCCTGACTCTGCACTTGGTTTAGAAACAGTTTGCCATCTGATTGTGTCAGCTCTAGCTTGCTGGCCTGCACATGCTTGGGCTCCAGGTCTTTTCCACACACTTCTTGATAGTGTTCAGGCAGCTTCCTTGCTAGCATTGGGTCCGAAGGAAACTTGTAGCATGCTGTATTTATTGATTGATCTGTTTCCTGATGAGGGTGTCTGGCATTGGAAGAATGGTATGCCCTTGCTAACAGCTTCCCGATCGTTGGCTGTTGGGACATTGTTAGGTCCCCTAGAGGAGAGGAAAGTTAACTGGTACCTGGAGCGTCCATGTGTTGAGATTCTTCTTGGTCAATTGAGTCCACATCTTGATAAAATTGCGCAAGTTATACATCATTATGCTGTTGGTACGCTAGTCGTGGTACAAGATATACTTCGCATACTCATACTTCGTATTGCACGTCAAAGAACAGAATTTGCGACAGTGCTTCTGCAGCCTATGATTTCTTGGATTGAGAACCACTTGATTGGAACTTCACCTCTGGACGAGATAGACGCATACAAGGGTTACAAGTTGCTAGATTTCCTTTCTCTCTTGCTTGAACATCCACTTGCTAAGACCTTACTACTGAAAGAGGGTACACTCCAGATTCTGACAAAAGTGCTTGGAAGAATTGATGCTCATGTGAAATCTGTTTCAAATGGAAGAAGCTCATTAAACTCTAGATTCAATATCTATAATTGCTGCCTCCCAGTATTGAGGTCCTTGTCTTTAATTTGTGAAACTCGATCTGCTCAGAAGCTACATTTGTTGCTCGACGGCAATGAGACTCTGAGTACTGAAGATTCTGCTGTGATTTTTAGCTACATATTCAAACTCTGTCAGCATCTACCAGCTGGAAATGAGTTGGTTGCTTGTTTAGGCACTTTCAAAGATCTAGCTTCTACTGCTATAGGTCAGAGCACATTGCTCGACACTTACCTTTTTATCAGATCCAGTAGTAACAACGAAAACCAACCGGGACGATCACAAGAAAGTTATCAATTTTATAATCGCCTAATTGATGCTGAATGGAAGGGGTGTCCTCTGCTAACGTGCTGGATTAATTTATATGGGTCAGTTGAGACTGAGGGTCTATCCACTTGTTCAGTTGAAGCTATTGGTCTGCTATCAATTGGTGTGTTGAGATTTTGCATGAGTGGCAGAAGTTTCAACGAGGATAGAATTGATGCCTTGAAGTATTTCTTTGGGCTTTCAAGTGATGTTGATGGAGCAAATGGTGTTCTCGAGGAAAACATTAAGTATGTCCAAGATATGTGTACTTTGTTGAGTTCAAAGATAACTGATGATGAATATGCTGTTTCATCACATCTGAAGGCGACCTTGCATCAGGCCGTGGAATCTGTGAAGTTCCTTTCTTTACTGCTACTGAAACCTATTGATTCATTTAAGGATGATGATGCTTTCACTGATGAAGTTCTCTTGTTTTCATCTGATGCTAAACTTTCTTCAAAGATACATTTGCTAACAGATAGCAGTTCAGAAAGGGTTGAAAATGAATTGAACTTTGGTGAGTTAGGTGAAAGTTTTCAATGGGAGTGCCCTGAAAATTTACGCAATAGGCTGTCACAGGCTGGTCTCCTGGGCAAGAGGAAGATTTCAGCAGTGGACGGAGTCAACAGACATGGCAAGGGAGAAAATGCTCCAGTTGACGTACGAGGATCAGGGTCATCTATGGCACCTCCTATTCCAACTCGAAGAGATACCTTTCGACAACGCAAGCCGAATACAAGCAGGCCTCCATCTATGCACGTAGATGATTATGTTGCCAGAGAAAGGAATGAGTCTTCAAATCCTAACGTTATTGCAGTTCCACGTTTGGGGTCATCTGGTGGGCGGCCTCCATCAATTCATGTTGATGAGTTCATGGCTAGGCAGAGGGAGCGTCACAATTCTGTTCCTATAGCAGGCGTAGAGGCTTCATCCCAAGGGAAAAAGATTACGCTTGAGATTGATAACGATACAGAAAAGTCCAACCGATCTAAGCAGCTCAAAGTTGATCTTGATGACGATCTTCAAGAAATTATGTTTGATGCTGAGGAAAATGAAGCAGATGATAAGCTGCCTTTTCCTCAAGCAGATGATGACGTACAGCAGACTGCATCTGCTTCAGCTGAGACAAGTCCTCAACATTCAATTGTTGAAGAGACAGAGAGTGAAGCCAAGGACCTGGAAGCAACTGTGGCTTCTAATGCGGTTGAGAATGCACAAAGTGAACTTTCTTCTAGAATGTCAGTATCACGTCCAGAGATGCAACTGAGACGTGAACCCAGTGTTTCTTCAGAAAATAAATATTTTGAGCAACCTGATGAGACGAATAATAAAAATCCAACTATGGTCTCTGGTGGCTTTGATTCTGTTGCAGTAGCAAAGACGTCTGGATTCCCTGCCTCTGTATATGGAAAACATTCTCCATCAGGGGCTCAGGTTCCTGCCGATGCAAGGATACCACCTTCCAGTTTTATTCCTGATAATAATCTTCAAAGAGTTGGGAATGTGCTCCTTGGCAGGTCACAAGGCCGCCATGAGCAGAAACATCTGCCAAATCAACCACCATTACCTCCAACACCACCTCCATCAACAATTTCACCTGTTAGATCACAGGTTTCTGAAAACCAAACTTCAAGATCTCCATTCGTTAATCCTCATAATGATGTGCAGCCGCCTCTGTATCCAACTTATGCACAGACTGATTATCAGTCTTCTTTTGGCAATGTCCCCACATCATTTGCATCTTCCAACTTCATGTCGGATTCACGCTATGCACGAACAACCAACTCTTCTCCCGGGAGCTCTAATAGACCTTTGCCACCACTTCCTCCAACACCACCTTCTTTCTCAGCTACCCCATTGAATGTTCCATCCAGGTCTCCTACATCCCAATCCTCGATGTATGTTCAGCATAACGCCGGAGCAACTGACGCTTCTCAGTATTCATCTCCGCCGCTGATTCCCCCAGGTTTTAGCAGGCTACCTTCAATGCCATTTAATATGCATGGAAATATTCCTGTTCAGCAAAGTGATAATATAACGGGCATAGCTCAGAATAGCCGTGGTCCACAACCCTCAATGCAATCAATGCAACCACTTCCCCACCTGCAACCCTTGCAACCCCCTCAACTCCCTTGCCCTCAGCAACTAAGATTGCCAATACAAAATTCGCTACATACAGAGCAAAGTGGTTCCATGTCACAGAGCCATGTCCAGATGCAGGTTCAGCCACCGCAAATTCTGCAACAATCTCAAACGTCTCCATATCATTTGTATTATCAGACCCAGTCGCAGGAGAACATCCCACATCCACAGCAACCACAAGCAGCACATCACCAGGTAGATGCTGCTACCCAGCAGCAGATGGATTCTTCAATGTCACTGCAACAGTATTTCAGTTCTCCTGAGGCGATTCAGTCCTTGTTGAGTGATCGTGATAAGCTTTGCCAACTTCTAGAGCAGCACCCAAAGTTGATGCAGATGCTTCAGGACCAACTTGGCCATCTCTAA
Protein sequence
MEDSGGECHHLIEDCRRLIRLNDWEVDIIHVYREENRAADWLANNGVDNANNLSILGSTPVALARILEEDRMGINNPRFIPLEGLNRLANFQERFPPLLLTAFAANTPARWLCGDACYHTARRLSEYLNMGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPPSFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNTAEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAADISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALANVKGSFHVFTEAQRNLFELCKRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQHFPFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLNSATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIAALTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINSSGWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGFLPLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIRAFKGDDNVTKFIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEELLWVLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIFHAAAEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYHKNGAVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVSDKSFDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNTRLMLERSSNNYDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKLMNVLLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTLLDSVQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEERKVNWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFATVLLQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTKVLGRIDAHVKSVSNGRSSLNSRFNIYNCCLPVLRSLSLICETRSAQKLHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDTYLFIRSSSNNENQPGRSQESYQFYNRLIDAEWKGCPLLTCWINLYGSVETEGLSTCSVEAIGLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVDGANGVLEENIKYVQDMCTLLSSKITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSFKDDDAFTDEVLLFSSDAKLSSKIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLSQAGLLGKRKISAVDGVNRHGKGENAPVDVRGSGSSMAPPIPTRRDTFRQRKPNTSRPPSMHVDDYVARERNESSNPNVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQGKKITLEIDNDTEKSNRSKQLKVDLDDDLQEIMFDAEENEADDKLPFPQADDDVQQTASASAETSPQHSIVEETESEAKDLEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSENKYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIPDNNLQRVGNVLLGRSQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRSPFVNPHNDVQPPLYPTYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTPPSFSATPLNVPSRSPTSQSSMYVQHNAGATDASQYSSPPLIPPGFSRLPSMPFNMHGNIPVQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLPCPQQLRLPIQNSLHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQENIPHPQQPQAAHHQVDAATQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKLMQMLQDQLGHL
Homology
BLAST of Spo15904.1 vs. NCBI nr
Match:
gi|902195668|gb|KNA12921.1| (hypothetical protein SOVF_121890, partial [Spinacia oleracea])
HSP 1 Score: 4260.7 bits (11049), Expect = 0.000e+0
Identity = 2202/2203 (99.95%), Postives = 2203/2203 (100.00%), Query Frame = 1
Query: 90 NFQERFPPLLLTAFAANTPARWLCGDACYHTARRLSEYLNMGRPEPCVLFAQTFVHPQLD 149
+FQERFPPLLLTAFAANTPARWLCGDACYHTARRLSEYLNMGRPEPCVLFAQTFVHPQLD
Sbjct: 40 SFQERFPPLLLTAFAANTPARWLCGDACYHTARRLSEYLNMGRPEPCVLFAQTFVHPQLD 99
Query: 150 EYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPPSFALEVFVQCEGEARFRRLC 209
EYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPPSFALEVFVQCEGEARFRRLC
Sbjct: 100 EYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPPSFALEVFVQCEGEARFRRLC 159
Query: 210 QPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNTAEDLGQFNVEFDLDNSVAKL 269
QPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNTAEDLGQFNVEFDLDNSVAKL
Sbjct: 160 QPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNTAEDLGQFNVEFDLDNSVAKL 219
Query: 270 VSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAADISVQMRQFLQLIFKILDIPN 329
VSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAADISVQMRQFLQLIFKILDIPN
Sbjct: 220 VSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAADISVQMRQFLQLIFKILDIPN 279
Query: 330 FGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALANVKGSFHVFTEAQRNLFELC 389
FGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALANVKGSFHVFTEAQRNLFELC
Sbjct: 280 FGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALANVKGSFHVFTEAQRNLFELC 339
Query: 390 KRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQHFPFNRDTVFVGHPHLSKSNNV 449
KRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQHFPFNRDTVFVGHPHLSKSNNV
Sbjct: 340 KRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQHFPFNRDTVFVGHPHLSKSNNV 399
Query: 450 LLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLNSATVTLMLLGAMEQATRYSVG 509
LLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLNSATVTLMLLGAMEQATRYSVG
Sbjct: 400 LLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLNSATVTLMLLGAMEQATRYSVG 459
Query: 510 CEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIAALTTYILHRLRFYEVASRYEY 569
CEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIAALTTYILHRLRFYEVASRYEY
Sbjct: 460 CEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIAALTTYILHRLRFYEVASRYEY 519
Query: 570 AVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINSSGWVEDPSPMACAIRSLMVAS 629
AVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINSSGWVEDPSPMACAIRSLMVAS
Sbjct: 520 AVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINSSGWVEDPSPMACAIRSLMVAS 579
Query: 630 SEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGFLPLSVALISSTELRAERGHAL 689
SEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGFLPLSVALISSTELRAERGHAL
Sbjct: 580 SEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGFLPLSVALISSTELRAERGHAL 639
Query: 690 EAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIRAFKGDDNVTKFIPLRYAYVLV 749
EAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIRAFKGDDNVTKFIPLRYAYVLV
Sbjct: 640 EAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIRAFKGDDNVTKFIPLRYAYVLV 699
Query: 750 SKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEELLWVLWELCGLSRSDGGRQALLA 809
SKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEELLWVLWELCGLSRSDGGRQALLA
Sbjct: 700 SKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEELLWVLWELCGLSRSDGGRQALLA 759
Query: 810 LTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIFHAAAEICEIIVTDPTSSSVSVWI 869
LTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIFHAAAEICEIIVTDPTSSSVSVWI
Sbjct: 760 LTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIFHAAAEICEIIVTDPTSSSVSVWI 819
Query: 870 DHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYHKNGAVGLLRYAAVLAFGGDAHIT 929
DHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYHKNGAVGLLRYAAVLAFGGDAHIT
Sbjct: 820 DHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYHKNGAVGLLRYAAVLAFGGDAHIT 879
Query: 930 STNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVSDKSFDGVALRDSSVAQLITAIRI 989
STNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVSDKSFDGVALRDSSVAQLITAIRI
Sbjct: 880 STNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVSDKSFDGVALRDSSVAQLITAIRI 939
Query: 990 LALISENADVAASLFDEGAMTVVYIILVNTRLMLERSSNNYDYLVDDGIECSSTSDLLLE 1049
LALISENADVAASLFDEGAMTVVYIILVNTRLMLERSSNNYDYLVDDGIECSSTSDLLLE
Sbjct: 940 LALISENADVAASLFDEGAMTVVYIILVNTRLMLERSSNNYDYLVDDGIECSSTSDLLLE 999
Query: 1050 RNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKLMNVLLRLHRELSPKLAACSVDLS 1109
RNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKLMNVLLRLHRELSPKLAACSVDLS
Sbjct: 1000 RNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKLMNVLLRLHRELSPKLAACSVDLS 1059
Query: 1110 SSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTLLDSVQAASLLALGPKETCSMLYL 1169
SSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTLLDSVQAASLLALGPKETCSMLYL
Sbjct: 1060 SSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTLLDSVQAASLLALGPKETCSMLYL 1119
Query: 1170 LIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEERKVNWYLERPCVEILLGQLSPHL 1229
LIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEERKVNWYLERPCVEILLGQLSPHL
Sbjct: 1120 LIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEERKVNWYLERPCVEILLGQLSPHL 1179
Query: 1230 DKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFATVLLQPMISWIENHLIGTSPLDE 1289
DKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFATVLLQPMISWIENHLIGTSPLDE
Sbjct: 1180 DKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFATVLLQPMISWIENHLIGTSPLDE 1239
Query: 1290 IDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTKVLGRIDAHVKSVSNGRSSLNSRF 1349
IDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTKVLGRIDAHVKSVSNGRSSLNSRF
Sbjct: 1240 IDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTKVLGRIDAHVKSVSNGRSSLNSRF 1299
Query: 1350 NIYNCCLPVLRSLSLICETRSAQKLHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNEL 1409
NIYNCCLPVLRSLSLICETRSAQKLHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNEL
Sbjct: 1300 NIYNCCLPVLRSLSLICETRSAQKLHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNEL 1359
Query: 1410 VACLGTFKDLASTAIGQSTLLDTYLFIRSSSNNENQPGRSQESYQFYNRLIDAEWKGCPL 1469
VACLGTFKDLASTAIGQSTLLDTYLFIRSSSNNENQPGRSQESYQFYNRLIDAEWKGCPL
Sbjct: 1360 VACLGTFKDLASTAIGQSTLLDTYLFIRSSSNNENQPGRSQESYQFYNRLIDAEWKGCPL 1419
Query: 1470 LTCWINLYGSVETEGLSTCSVEAIGLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVD 1529
LTCWINLYGSVETEGLSTCSVEAIGLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVD
Sbjct: 1420 LTCWINLYGSVETEGLSTCSVEAIGLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVD 1479
Query: 1530 GANGVLEENIKYVQDMCTLLSSKITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSF 1589
GANGVLEENIKYVQDMCTLLSSKITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSF
Sbjct: 1480 GANGVLEENIKYVQDMCTLLSSKITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSF 1539
Query: 1590 KDDDAFTDEVLLFSSDAKLSSKIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLS 1649
KDDDAFTDEVLLFSSDAKLSSKIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLS
Sbjct: 1540 KDDDAFTDEVLLFSSDAKLSSKIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLS 1599
Query: 1650 QAGLLGKRKISAVDGVNRHGKGENAPVDVRGSGSSMAPPIPTRRDTFRQRKPNTSRPPSM 1709
QAGLLGKRKISAVDGVNRHGKGENAPVDVRGSGSSMAPPIPTRRDTFRQRKPNTSRPPSM
Sbjct: 1600 QAGLLGKRKISAVDGVNRHGKGENAPVDVRGSGSSMAPPIPTRRDTFRQRKPNTSRPPSM 1659
Query: 1710 HVDDYVARERNESSNPNVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQ 1769
HVDDYVARERNESSNPNVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQ
Sbjct: 1660 HVDDYVARERNESSNPNVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQ 1719
Query: 1770 GKKITLEIDNDTEKSNRSKQLKVDLDDDLQEIMFDAEENEADDKLPFPQADDDVQQTASA 1829
GKKITLEIDNDTEKSNRSKQLKVDLDDDLQEIMFDAEENEADDKLPFPQADDDVQQTASA
Sbjct: 1720 GKKITLEIDNDTEKSNRSKQLKVDLDDDLQEIMFDAEENEADDKLPFPQADDDVQQTASA 1779
Query: 1830 SAETSPQHSIVEETESEAKDLEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSEN 1889
SAETSPQHSIVEETESEAKDLEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSEN
Sbjct: 1780 SAETSPQHSIVEETESEAKDLEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSEN 1839
Query: 1890 KYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIP 1949
KYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIP
Sbjct: 1840 KYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIP 1899
Query: 1950 DNNLQRVGNVLLGRSQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRSPFVNPH 2009
DNNLQRVGNVLLGRSQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRSPFVNPH
Sbjct: 1900 DNNLQRVGNVLLGRSQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRSPFVNPH 1959
Query: 2010 NDVQPPLYPTYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTP 2069
NDVQPPLYPTYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTP
Sbjct: 1960 NDVQPPLYPTYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTP 2019
Query: 2070 PSFSATPLNVPSRSPTSQSSMYVQHNAGATDASQYSSPPLIPPGFSRLPSMPFNMHGNIP 2129
PSFSATPLNVPSRSPTSQSSMYVQHNAGATDASQYSSPPLIPPGFSRLPSMPFNMHGNIP
Sbjct: 2020 PSFSATPLNVPSRSPTSQSSMYVQHNAGATDASQYSSPPLIPPGFSRLPSMPFNMHGNIP 2079
Query: 2130 VQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLPCPQQLRLPIQNSLHTEQSGSM 2189
VQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLPCPQQLRLPIQNSLHTEQSGSM
Sbjct: 2080 VQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLPCPQQLRLPIQNSLHTEQSGSM 2139
Query: 2190 SQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQENIPHPQQPQAAHHQVDAATQQQMDSSM 2249
SQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQENIPHPQQPQAAHHQVDAATQQQMDSSM
Sbjct: 2140 SQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQENIPHPQQPQAAHHQVDAATQQQMDSSM 2199
Query: 2250 SLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKLMQMLQDQLGHL 2293
SLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKLMQMLQDQLGHL
Sbjct: 2200 SLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKLMQMLQDQLGHL 2242
BLAST of Spo15904.1 vs. NCBI nr
Match:
gi|731333628|ref|XP_010677830.1| (PREDICTED: uncharacterized protein LOC104893420 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 3389.4 bits (8787), Expect = 0.000e+0
Identity = 1777/2171 (81.85%), Postives = 1923/2171 (88.58%), Query Frame = 1
Query: 130 MGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPP 189
M RPEPCVLFAQTFVHPQLDEYVDEV+FAEPIVI+ACEFLEQNA SAS+AVTLLGAT PP
Sbjct: 1 MVRPEPCVLFAQTFVHPQLDEYVDEVIFAEPIVITACEFLEQNASSASAAVTLLGATLPP 60
Query: 190 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 249
SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT
Sbjct: 61 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 120
Query: 250 AEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAAD 309
AEDLGQFNVEFDLDNSV LVSSSE KLEDLPPLLHSRNPK ED ISSLKALSSMLPA D
Sbjct: 121 AEDLGQFNVEFDLDNSVTNLVSSSEMKLEDLPPLLHSRNPKHEDTISSLKALSSMLPATD 180
Query: 310 ISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALA 369
IS + RQFLQLI KILD+P+ G+F QK+MAL+TS ASFFT DLAS GIT QELNLGA+
Sbjct: 181 ISAERRQFLQLILKILDVPDCGDFFQKIMALVTSVAASFFTIDLASTGITFQELNLGAVL 240
Query: 370 NVKGSFHVFTEAQRNLFELCKRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQHF 429
N KG+ HVFTEA+ NL +LCK YV AGHI GDFLV CSFLESEAELATSKELVD+ FQHF
Sbjct: 241 NFKGASHVFTEAKINLLDLCKTYVEAGHICGDFLVDCSFLESEAELATSKELVDIFFQHF 300
Query: 430 PFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLNS 489
F+R+T+ VGHPH+S+SNNVLLWLSLALIL SGKESCFHYVNGGGM+QLAHF S +M NS
Sbjct: 301 HFDRETI-VGHPHISQSNNVLLWLSLALILSSGKESCFHYVNGGGMKQLAHFLSHDMQNS 360
Query: 490 ATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIAA 549
ATVTLMLLG +EQATRYSVGCEGLLGWWPRE E VPS VSEGYS LLKLLMQKQRHDIAA
Sbjct: 361 ATVTLMLLGVIEQATRYSVGCEGLLGWWPRESEDVPSSVSEGYSQLLKLLMQKQRHDIAA 420
Query: 550 LTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINSS 609
L TYILHRLR YEVASRYEYAVLSILGG SGDGNATTTTLS+L+ AKLQLRKLLK+INSS
Sbjct: 421 LVTYILHRLRVYEVASRYEYAVLSILGGHSGDGNATTTTLSMLVNAKLQLRKLLKLINSS 480
Query: 610 GWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGFL 669
GWVEDPSP+ACAIRS M+++SEG SFKAT KLIASSSCRFLHQ+IDAKLLSL+KERGFL
Sbjct: 481 GWVEDPSPLACAIRSSMMSASEGVSSFKATGKLIASSSCRFLHQEIDAKLLSLMKERGFL 540
Query: 670 PLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIRA 729
PLSVALIS +LR ERGHALE+FLD+TSSIEATILS MFCRSGLVFLLQHPDLSSTII A
Sbjct: 541 PLSVALISLPKLRDERGHALESFLDVTSSIEATILSFMFCRSGLVFLLQHPDLSSTIIHA 600
Query: 730 FKGDDNVTKFIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEELLW 789
FKG+D V + IPLRYA +LVSKGFFC LQEVG N+EMHLRVV+AV RLLTSTPCSEELLW
Sbjct: 601 FKGNDEVIECIPLRYASLLVSKGFFCRLQEVGLNIEMHLRVVNAVGRLLTSTPCSEELLW 660
Query: 790 VLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIFHAA 849
VLWELCGLSRSDGGRQA+LAL HFPEA TILIEALHSAKE E ISLS GTPLNLAIFHAA
Sbjct: 661 VLWELCGLSRSDGGRQAVLALAHFPEAITILIEALHSAKESESISLSSGTPLNLAIFHAA 720
Query: 850 AEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYHKNG 909
AEICEIIVTDPTSSSVSVWIDHA +LH+ALHSSSPGS+RKDAPSRLLEWIDAGVVYHKNG
Sbjct: 721 AEICEIIVTDPTSSSVSVWIDHAAELHKALHSSSPGSSRKDAPSRLLEWIDAGVVYHKNG 780
Query: 910 AVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVSDKS 969
AVGLLRYAA LA GGDAHITSTNILVSDSMDVE+VVGD+SNESDVNVVE+LLGKLVSDK+
Sbjct: 781 AVGLLRYAAFLASGGDAHITSTNILVSDSMDVENVVGDSSNESDVNVVENLLGKLVSDKT 840
Query: 970 FDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNTRLMLERSSNN 1029
FDGV LRDSSVAQL TAIRILALISEN DVAASLFDEGAMTV+Y ILVN+RLMLERSSNN
Sbjct: 841 FDGVPLRDSSVAQLTTAIRILALISENVDVAASLFDEGAMTVIYAILVNSRLMLERSSNN 900
Query: 1030 YDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKLMNV 1089
YDYLVDDG+ECSSTSDLLLERNREKYLVDLLIPSLLLL+ELLQRLKEAKEQHRNTKLM V
Sbjct: 901 YDYLVDDGVECSSTSDLLLERNREKYLVDLLIPSLLLLLELLQRLKEAKEQHRNTKLMTV 960
Query: 1090 LLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTLLDS 1149
LLRLH ELSPKLAA +VDLSSSYP+SALGLETVCHLI SALACWPAHAWAPGLFHTLLDS
Sbjct: 961 LLRLHCELSPKLAASAVDLSSSYPESALGLETVCHLIASALACWPAHAWAPGLFHTLLDS 1020
Query: 1150 VQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEERKV 1209
VQAASLLALGPKETCSML +LIDLFP+EG WHWKNGMPL+TA+RSLA+GTLLGPL+E+KV
Sbjct: 1021 VQAASLLALGPKETCSMLCVLIDLFPEEGFWHWKNGMPLMTAARSLALGTLLGPLKEKKV 1080
Query: 1210 NWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFATVL 1269
NWYLE +EILLGQLSPHLDKIAQVI HYAV TLVVVQD+LRILILRIARQR E+A VL
Sbjct: 1081 NWYLEHSYIEILLGQLSPHLDKIAQVILHYAVSTLVVVQDMLRILILRIARQRAEYAIVL 1140
Query: 1270 LQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTKVLG 1329
LQP+ISW++NH+ S L ++D YKGYKLLDFLSLL EHPLAKTLLLKEGTLQILTKVLG
Sbjct: 1141 LQPIISWVQNHVAEPSSLGDMDTYKGYKLLDFLSLLFEHPLAKTLLLKEGTLQILTKVLG 1200
Query: 1330 RIDAHVKSVSNGRSSLNSRFNIYNCCLPVLRSLSLICETRSAQKLHLLLDGNETLSTEDS 1389
RI AH KSVS+GRSSLN+ F+IYNCCLPVLRSL+LIC+TR QKL+ + D NE LSTEDS
Sbjct: 1201 RIYAHGKSVSDGRSSLNAGFSIYNCCLPVLRSLTLICDTRPTQKLYGVFDCNEILSTEDS 1260
Query: 1390 AVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDTYLFIRSSSNNENQPGRS 1449
AVI +Y+FKLCQ LP GNELVACL TFK++AS A GQ L D +LFIRS + +ENQPGR
Sbjct: 1261 AVILNYVFKLCQLLPVGNELVACLTTFKEVASHAEGQRALFDAFLFIRSGNTSENQPGRP 1320
Query: 1450 QESYQFYNRLIDAEWKGCPLLTCWINLYGSVETEGLSTCSVEAIGLLSIGVLRFCMSGRS 1509
QE + FYN LID EW+ CPLL CWINLY SVE EGLST SV+A+G+LS+G +RFC+ G+S
Sbjct: 1321 QEFHGFYNHLIDVEWRRCPLLCCWINLYRSVENEGLSTYSVDAVGVLSVGAMRFCIEGKS 1380
Query: 1510 FNEDRIDALKYFFGLSSDVDGANGVLEENIKYVQDMCTLLSSKITDDEYAVSSHLKATLH 1569
FN+D+IDALKYFFGL D D NG LEEN+KYVQDM +LLSSK+TD+EYA SSHLKATLH
Sbjct: 1381 FNKDKIDALKYFFGLWCDTDETNGTLEENMKYVQDMSSLLSSKMTDNEYAASSHLKATLH 1440
Query: 1570 QAVESVKFLSLLLLKPIDSFKDDDAFTDEVLLFSSDAKLSSKIHLLTDSSSERVENELNF 1629
Q VES + LSLLLLKPID+ K +D ++ VLLFS DAKLS+KIHLLTDSSSERVENE N
Sbjct: 1441 QVVESARCLSLLLLKPIDALKVNDIISEGVLLFSPDAKLSTKIHLLTDSSSERVENEWNI 1500
Query: 1630 GELGESFQWECPENLRNRLSQAGLLGKRKISAVDGVNRHGKGENAPVDVRGSGSSMAPPI 1689
GEL + FQWECPENLRNRLSQAGLLGKRKIS VDG NRHGKG+NAPVDVRGSG S+APPI
Sbjct: 1501 GELSDQFQWECPENLRNRLSQAGLLGKRKISTVDGANRHGKGDNAPVDVRGSGPSIAPPI 1560
Query: 1690 PTRRDTFRQRKPNTSRPPSMHVDDYVARERNES--SNPNVIAVPRLGSSGGRPPSIHVDE 1749
PTRRDTFRQRKPNTSRPPSMHVDDYVARERN SNPNVIAVPRLGSSGGRPPSIHVDE
Sbjct: 1561 PTRRDTFRQRKPNTSRPPSMHVDDYVARERNSDGVSNPNVIAVPRLGSSGGRPPSIHVDE 1620
Query: 1750 FMARQRERHNSVPIAGVEASSQGKKITLEIDNDTEKSNRSKQLKVDLDDDLQ--EIMFDA 1809
FMARQRERHNSV +AGVE +S GKK+T E DN TEKSNRSKQLKVDLDDDLQ +I+FDA
Sbjct: 1621 FMARQRERHNSVAMAGVEVASLGKKVTPENDNGTEKSNRSKQLKVDLDDDLQGIDIVFDA 1680
Query: 1810 EENEADDKLPFPQADDDVQQTASASAETSPQHSIVEETESEAKDLEATVASNAVENAQSE 1869
EENE DDKLPFPQ DD++Q+ ASASAE SPQHSIV ETES+AKDLEATVASNA EN QSE
Sbjct: 1681 EENEPDDKLPFPQEDDEIQRAASASAERSPQHSIVAETESDAKDLEATVASNADENTQSE 1740
Query: 1870 LSSRMSVSRPEMQLRREPSVSSENKYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASV 1929
LSSRMSVS P MQLRREPSVSSE KYF + D+ N ++S GFDS AVAKTSG AS
Sbjct: 1741 LSSRMSVSHPGMQLRREPSVSSEKKYFGKLDQNNGT--AIISDGFDS-AVAKTSGLSASK 1800
Query: 1930 YGKHSPSGAQVPADARIPPSSFIPDNNLQRVGNVLLGRSQGRHEQKHLPNQPPLPPTPPP 1989
YGK + S AQVP D+R PP S+ PDNNLQR+GNV LG QG HEQK++PNQPPLPPTPPP
Sbjct: 1801 YGKGT-SMAQVPMDSRTPPPSYCPDNNLQRIGNVALGGPQGHHEQKYMPNQPPLPPTPPP 1860
Query: 1990 STISPVRSQVSENQTSRSPFVNPHNDVQPPLYPTYAQTDYQSSFGNVPTSFASSNFMSDS 2049
STISP +SQV N S SPFVNPHNDVQPPLY Y QTDYQS FG+VP+S ASSNFM DS
Sbjct: 1861 STISPGKSQVDTNSGSASPFVNPHNDVQPPLYQNYTQTDYQS-FGSVPSSLASSNFMLDS 1920
Query: 2050 RYARTTNSSPGSSNRPLPPLPPTPPSFSATPLNVPSRSPTSQSSMYVQHNAGATDASQYS 2109
RYARTT+SSPG SNRPLPPLPPTPP FS TPLN+PSRS SQSS+Y Q+N G + SQ++
Sbjct: 1921 RYARTTHSSPGGSNRPLPPLPPTPPPFSTTPLNLPSRSSASQSSVYAQNNTGVNEVSQFA 1980
Query: 2110 SPPLIPPGFSRLPSMPFNMHGNIPVQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPP 2169
PPL+PPGF+R SMPF MHGNIPVQQSDN+TG+AQNS QP MQSMQPLPHLQPLQPP
Sbjct: 1981 PPPLMPPGFNRPSSMPFTMHGNIPVQQSDNVTGVAQNSNASQPPMQSMQPLPHLQPLQPP 2040
Query: 2170 QLPCPQ---QLRLPIQNSLHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQE 2229
QLPCPQ QLR P+QN LHTEQ GS SQSH VQP QILQQSQ SPYHLYYQ Q Q+
Sbjct: 2041 QLPCPQLPHQLRPPMQNPLHTEQRGSASQSH----VQPQQILQQSQMSPYHLYYQNQLQD 2100
Query: 2230 NIPHPQQP-QAAHHQVDAATQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKL 2289
N PH QQ Q AHHQ +A TQQQ+DSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKL
Sbjct: 2101 NFPHSQQQLQVAHHQGEAVTQQQVDSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKL 2160
Query: 2290 MQMLQDQLGHL 2293
MQMLQ++LG L
Sbjct: 2161 MQMLQERLGRL 2161
BLAST of Spo15904.1 vs. NCBI nr
Match:
gi|731374428|ref|XP_010653013.1| (PREDICTED: uncharacterized protein LOC100246722 [Vitis vinifera])
HSP 1 Score: 2369.0 bits (6138), Expect = 0.000e+0
Identity = 1320/2232 (59.14%), Postives = 1634/2232 (73.21%), Query Frame = 1
Query: 130 MGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPP 189
MGRPEPCVLFAQTFVHPQLDEYVDEV+FAEP+VI++CEFLEQNA S S +TLLGATSPP
Sbjct: 1 MGRPEPCVLFAQTFVHPQLDEYVDEVIFAEPVVITSCEFLEQNASSVSPVITLLGATSPP 60
Query: 190 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 249
SFALEVFVQ EGE RFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLV+YGNT
Sbjct: 61 SFALEVFVQSEGETRFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVIYGNT 120
Query: 250 AEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAAD 309
AEDLGQ+N+EFDLD+S+ +V SSE KL+DLPP LHS+N E++ISSLKALS + A+D
Sbjct: 121 AEDLGQYNIEFDLDSSLTNVVCSSEGKLDDLPPALHSKNLTIEESISSLKALSLPVAASD 180
Query: 310 ISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALA 369
IS++++QFLQL+FKIL++ N G+ KV+ + SA +S+ DL A + ++
Sbjct: 181 ISIEIKQFLQLMFKILELTNLGDAVHKVLDTVVSAASSYSAHDLHYAAVNQKKFTQSTNN 240
Query: 370 NVKGSFHVFTEAQRNLFELCKRYVG-AGHIYGDFLVGCSFLESEAELATSKELVDMLFQH 429
+ + S V A++ L +L K +G+ + L CSFLESE +LA+SKEL+DML QH
Sbjct: 241 SNEESHFVLDAAKKELLDLYKTLQDESGNSSVELLEECSFLESEIDLASSKELMDMLIQH 300
Query: 430 FPFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLN 489
F F R+ + VGH HLS+ V+L LS+AL LCS KESCF +VNGGGMEQLA FS ++ N
Sbjct: 301 FLFKRNFLSVGHYHLSQKKKVILVLSVALFLCSAKESCFQFVNGGGMEQLACVFSDDLQN 360
Query: 490 SATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIA 549
S +TLMLLG +EQATRYS+GCEG LGWWPRE + VPSG+SEGYS LLKLL++KQRHDIA
Sbjct: 361 STAITLMLLGVVEQATRYSIGCEGFLGWWPREDDNVPSGISEGYSRLLKLLLEKQRHDIA 420
Query: 550 ALTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINS 609
+L TY LHRLRFYEV SRYE AVLS+LGGLS G T TL +L+ AK+QL+KLLK+INS
Sbjct: 421 SLATYALHRLRFYEVVSRYECAVLSVLGGLSTVGRVTGATLDMLISAKVQLKKLLKLINS 480
Query: 610 SGWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGF 669
G +EDPSP+ACA RSL++ +EG LS+KATS LI S+C F + ID LLSL+KERGF
Sbjct: 481 RGPIEDPSPVACASRSLILGQTEGLLSYKATSNLIGLSNCCFSSRDIDIHLLSLVKERGF 540
Query: 670 LPLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIR 729
LPLS AL+SS+ LR+E GHA++ F+D+TSSIEA ILSL+FCRSGL+FLL HP+LS+T+I
Sbjct: 541 LPLSAALLSSSILRSEVGHAMDIFVDITSSIEAIILSLLFCRSGLIFLLLHPELSATVIL 600
Query: 730 AFKGDDNVTK--FIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEE 789
A +G D+ K PLRYA +L+SKGFFC +EVG +EMHLRVV+AVDRLL+STP SEE
Sbjct: 601 ALRGVDDFHKEDCAPLRYASILISKGFFCRPREVGLVVEMHLRVVNAVDRLLSSTPQSEE 660
Query: 790 LLWVLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIF 849
LWVLWELCGLSRSD GRQALLAL HFPEA +L+EALHS KELEP++ + +PLNLAIF
Sbjct: 661 FLWVLWELCGLSRSDSGRQALLALGHFPEAVLVLMEALHSVKELEPVTTTGTSPLNLAIF 720
Query: 850 HAAAEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYH 909
H+A+EI E++VTD T+SS++ WI HA +LH+ALHSSSPGSNRKDAP+RLLEWIDAGVV+H
Sbjct: 721 HSASEIFEVLVTDSTASSLASWIGHAMELHKALHSSSPGSNRKDAPTRLLEWIDAGVVFH 780
Query: 910 KNGAVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVS 969
KNG GLLRYAAVLA GGDAH+TST+IL SDSMDVE+ VGD+S+ SD NV+E+L GKL+S
Sbjct: 781 KNGVTGLLRYAAVLASGGDAHLTSTSILGSDSMDVENAVGDSSSGSDTNVIENL-GKLIS 840
Query: 970 DKSFDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNTRLMLERS 1029
+KSFDGV LRDSSVAQL TA RILA ISEN+ VAA+L+DEGA+ ++Y +LV+ R MLERS
Sbjct: 841 EKSFDGVTLRDSSVAQLTTAFRILAFISENSAVAAALYDEGAIIIIYAVLVDCRFMLERS 900
Query: 1030 SNNYDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKL 1089
SNNYDYLVD+G EC+STSDLLLER+REK LVDLLIP L+LLI LL++L+EA+EQHRNTKL
Sbjct: 901 SNNYDYLVDEGTECNSTSDLLLERSREKSLVDLLIPLLVLLITLLKKLQEAQEQHRNTKL 960
Query: 1090 MNVLLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTL 1149
MN LLRLHRE+SPKLAAC+ DLSSSYPD+ALG VC+L+VSALACWP + W PGLFH+L
Sbjct: 961 MNALLRLHREVSPKLAACAADLSSSYPDAALGFGAVCNLLVSALACWPIYGWTPGLFHSL 1020
Query: 1150 LDSVQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEE 1209
L SVQA S LALGPKETCS+L +L DLFP+EGVW WKNGMPLL+A R+LAVGTLLGP +E
Sbjct: 1021 LASVQATSSLALGPKETCSLLCILNDLFPEEGVWLWKNGMPLLSAVRTLAVGTLLGPQKE 1080
Query: 1210 RKVNWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFA 1269
R+VNWYL E+LL QL+P LDKI+QVI HYA+ +LVV+QD+LR+ I+RIA Q+ + A
Sbjct: 1081 REVNWYLHPGHPEVLLNQLTPQLDKISQVILHYAMTSLVVIQDMLRVFIIRIACQKADNA 1140
Query: 1270 TVLLQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTK 1329
++LLQP++SWI L +S ++DAYK Y+LLDFL+ LLEHP AK LLLKEG +Q+L K
Sbjct: 1141 SLLLQPIMSWIRMRLSESSCQTDVDAYKIYRLLDFLACLLEHPCAKPLLLKEGAIQMLIK 1200
Query: 1330 VLGR----IDAHVKSVSNGRSSLNSRFNIYNCCLPVLRSLSLICETRSAQKL--HLLLDG 1389
L R ++ K +S+GR+S ++ CLP+ +SLSLIC + ++ + +
Sbjct: 1201 ALERCVDATESDGKQLSDGRNSAKCSLTAFSWCLPLCKSLSLICGSHMSRHYIGNYAKND 1260
Query: 1390 NETLSTEDSAVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDTYLFIRSSS 1449
E LS+ED ++I Y+ KLCQ LP G EL+ACL FK+L S GQ+ L+ +L R SS
Sbjct: 1261 FEHLSSEDCSLILPYLLKLCQILPVGRELLACLTVFKELGSCNEGQNALMAVFLRAR-SS 1320
Query: 1450 NNENQPGRSQESYQFYNRLIDAEW-KGCPLLTCWINLYGSVE-TEGLSTCSVEAIGLLSI 1509
+ E + + E YN L + EW K PLL CW L SV+ ++G ++EA+G LS+
Sbjct: 1321 DEELELEKGHERGGNYNVLNEYEWMKLPPLLCCWTKLLRSVDPSDGFPAYAIEAVGALSL 1380
Query: 1510 GVLRFCMSGRSFNEDRIDALKYFFGLSSDVDGANGVLEENIKYVQDMCTLLSSKITDDEY 1569
G LRFCM G+S N DR+ A+K+ FGL D+ G + EENI+Y+Q++ TLL SK+TD++Y
Sbjct: 1381 GALRFCMDGKSLNLDRVFAMKFLFGLPHDLSGMDDFPEENIRYIQELTTLLGSKVTDEDY 1440
Query: 1570 AVSSHLKATLHQAVESVKFLSLLLLKPIDSFKDDDAFTDE-VLLFSSDAKLSSKIHLLTD 1629
+ S +K TL +A + K L L+L P S D + E V L +D LSS+IH + D
Sbjct: 1441 SAKSDMKTTLCRASDYAKSLLLMLQNPAGSLDLGDIISSEDVPLSPNDVILSSRIHQMID 1500
Query: 1630 SSSERVENELNFGELGESFQWECPENLRNRLSQAGLLGKRKISAVDGVNRHGKGENAPVD 1689
+S+E+VE+ G L + F WECPE L +RL Q L KRK+S+++G +R +G+N+P +
Sbjct: 1501 NSAEKVEDYCCLGGLEDKFLWECPETLPDRLLQTTLPAKRKMSSLEGPSRRARGDNSPAE 1560
Query: 1690 VRGSGS---SMAPPI----PTRRDTFRQRKPNTSRPPSMHVDDYVARERNES--SNPNVI 1749
G+ ++ PP P+RRDTFR RKPNTSRPPSMHVDDYVARERN SN NVI
Sbjct: 1561 TVAQGAFSRALGPPSASSGPSRRDTFRLRKPNTSRPPSMHVDDYVARERNVDGVSNSNVI 1620
Query: 1750 AVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQGKKITLEIDNDTEKSNRSK 1809
AV R+G++GGRPPSIHVDEFMARQRER N V A E ++Q K E D D EK N+S+
Sbjct: 1621 AVQRIGTTGGRPPSIHVDEFMARQRERQNPVVSAVGEVAAQAKNAAPENDADMEKFNKSR 1680
Query: 1810 QLKVDLDDDLQ--EIMFDAEENEADDKLPFPQADDDVQQTASASAETSPQHSIVEETESE 1869
Q+K DLDDDLQ +I+FD EE+E D+KLPFPQ DD++QQ AS E S SIVEETES+
Sbjct: 1681 QIKADLDDDLQGIDIVFDGEESEPDEKLPFPQPDDNLQQPASVIVEQSSPRSIVEETESD 1740
Query: 1870 AKD------LEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSENKYFEQPDETNN 1929
+ L +A N EN +SE SSRMSVSRPE L REPSVSSE KYFEQ D+ N
Sbjct: 1741 VNENSQFSRLGTPLALNVNENPESEFSSRMSVSRPERPLTREPSVSSEKKYFEQSDDMKN 1800
Query: 1930 KNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIPDNNLQRVGNVL 1989
P M +DS A +SGFPAS YGK S S + D+R+ +F N+ Q+ GN+
Sbjct: 1801 VIPAMTPSRYDSAGAAISSGFPASTYGKASVSSVPLMVDSRMVQPNFYLKNSSQQAGNMA 1860
Query: 1990 LGR-SQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSR-SPFVNPHNDVQPPLYP 2049
L SQG ++QK + NQPPLPP PPP TISP+ SQ + S+ S FVN DVQPPL
Sbjct: 1861 LATGSQGLYDQKFMLNQPPLPPMPPPPTISPIISQAPDPALSQSSSFVNTATDVQPPLPT 1920
Query: 2050 TY-AQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTPPSFSATPL 2109
+ Q++Y S+F N TS ASS M DS+Y+R + SSP S RP PPLPPTPP FSA P
Sbjct: 1921 AFQVQSEYLSAFTNSSTSLASSLSMPDSKYSRASLSSPSGSARPPPPLPPTPPPFSAAPF 1980
Query: 2110 NVPS-RSPTSQSSMYVQHNAGATDASQYSS----------------------PPLIPP-G 2169
+ S + S SS+Y Q + TD Q S PPL+PP
Sbjct: 1981 TLASLKVSVSSSSVYNQTSGATTDLPQISGASLTDARLGNLSASGTRLSSYPPPLVPPLV 2040
Query: 2170 FSRLPSMPFNMHGNIPV-QQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLP---- 2229
FSR S+P +++G+ QQ +N + QN PQ S+QS+Q LQPLQPPQLP
Sbjct: 2041 FSRPASIPVSIYGSTTTQQQGENPSNTIQNPPIPQLSIQSIQSFAQLQPLQPPQLPRPPQ 2100
Query: 2230 CPQQLRLPIQNSLHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQENIPHPQ 2289
PQ LR P+Q S EQ S+ QS +Q+ VQP Q+LQQ Q SP H+YYQ Q QEN PH Q
Sbjct: 2101 PPQHLRPPVQPSQQPEQGVSLLQSPIQLPVQPLQMLQQPQVSPLHVYYQQQQQENFPHVQ 2160
Query: 2290 QPQAAHH--------QVDAATQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPK 2293
Q Q H Q D+++Q + DS MSLQQYFSSPEAIQSLL DRDKLCQLLEQHPK
Sbjct: 2161 QQQQVEHGQHQVLRQQGDSSSQLEQDSGMSLQQYFSSPEAIQSLLCDRDKLCQLLEQHPK 2220
BLAST of Spo15904.1 vs. NCBI nr
Match:
gi|1009164275|ref|XP_015900411.1| (PREDICTED: uncharacterized protein LOC107433626 isoform X2 [Ziziphus jujuba])
HSP 1 Score: 2243.8 bits (5813), Expect = 0.000e+0
Identity = 1267/2234 (56.71%), Postives = 1606/2234 (71.89%), Query Frame = 1
Query: 130 MGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPP 189
MGRPEPCVLFAQTFVHP LDEYVDEV+FAEPIVI+ACEFLEQNA SAS +VTL+GATSPP
Sbjct: 1 MGRPEPCVLFAQTFVHPHLDEYVDEVLFAEPIVITACEFLEQNASSASQSVTLVGATSPP 60
Query: 190 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 249
SFALEVFVQCEGE RFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSL++YGNT
Sbjct: 61 SFALEVFVQCEGETRFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLIIYGNT 120
Query: 250 AEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAAD 309
AEDLGQFN+EFD D+S+ LV+S+E KLEDLP LHS + E++IS LKALS A+D
Sbjct: 121 AEDLGQFNIEFD-DSSITNLVTSAEGKLEDLPLALHSTDLTIEESISPLKALSLPPLASD 180
Query: 310 ISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTD--DLASAGITLQELNLGA 369
IS++ +QFL L+ KIL++P+ G+ K+++ + S +S+ + D S LQ
Sbjct: 181 ISIEAKQFLLLLLKILELPHLGDGLHKIVSSLVSVASSYVSHAWDRCSDRQELQS----- 240
Query: 370 LANVKGSFHVFTEAQRNLFELCKRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQ 429
V +EA++ L EL + + + LV FLESEA+LATSK+LVD+L Q
Sbjct: 241 ---------VISEARKELLELFQHK--SRDESAELLVDGRFLESEADLATSKQLVDLLSQ 300
Query: 430 HFPFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEML 489
+F FNR++ F GH +SK+ NV++ LS+AL+LCSG+ESCFH+VNGGGMEQLAH F +M
Sbjct: 301 YFCFNRESSFFGHHQISKTTNVIMGLSVALLLCSGRESCFHFVNGGGMEQLAHVFCGDMQ 360
Query: 490 NSATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDI 549
NS+ TL+LLG +EQATR+S GCEG LGWWPRE E +P G SE Y+ LL LL+QK RHD+
Sbjct: 361 NSSATTLLLLGVVEQATRHSYGCEGFLGWWPREDENIPPGSSEAYNRLLNLLLQKPRHDV 420
Query: 550 AALTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMIN 609
A+L TYILHRLRFYEV RYE AVLS+LGGLS G AT TL +L+ AK QL+KLLK++N
Sbjct: 421 ASLATYILHRLRFYEVVLRYECAVLSVLGGLSAVGKATNVTLGMLISAKSQLKKLLKLMN 480
Query: 610 SSGWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERG 669
G +EDPSP + A RSL + +EG LS+KAT+ LIASS C F + +D+ LL+LLKERG
Sbjct: 481 LRGPIEDPSPASLATRSLNLGQTEGLLSYKATNSLIASSHCCFSNWDVDSHLLALLKERG 540
Query: 670 FLPLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTII 729
FLPLSVAL+SS+ LR+E GH + F+D+ SSIEA ILSL+FCRSGLVFLLQ P+LS+T++
Sbjct: 541 FLPLSVALLSSSSLRSEVGHVNDIFMDIASSIEAIILSLLFCRSGLVFLLQQPELSATVV 600
Query: 730 RAFKGDDNVTK--FIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSE 789
A +G D+V K +PLRYA VL++KGFFC +EVG + MHLRVV+A+DRLLTS P SE
Sbjct: 601 HALRGSDDVNKDACVPLRYASVLIAKGFFCGPREVGMIIGMHLRVVNAIDRLLTSAPHSE 660
Query: 790 ELLWVLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCGT-PLNLA 849
E LWVLWELCGL+RSD GRQALLAL +FPEA ++LIEALHSAKE EP++ + G P+NLA
Sbjct: 661 EFLWVLWELCGLARSDCGRQALLALGYFPEAISLLIEALHSAKEPEPVAKNSGALPINLA 720
Query: 850 IFHAAAEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVV 909
IFH+AAEI E+IVTD T+SS+ WI HAT+LHRALHSSSPGSNRKDAP+RLLEWIDAGVV
Sbjct: 721 IFHSAAEIFEVIVTDSTASSLGSWIGHATELHRALHSSSPGSNRKDAPARLLEWIDAGVV 780
Query: 910 YHKNGAVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVESLLGKL 969
YH+NGA+GLLRYAAVLA GDAH+T+ +VSD D+E+++GDT+ +SDVNV+E+L GK
Sbjct: 781 YHRNGAIGLLRYAAVLASEGDAHLTT---IVSDLTDLENIIGDTTGDSDVNVMENL-GKF 840
Query: 970 VSDKSFDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNTRLMLE 1029
+S+K+FDGV LRD SVAQL A+RILA ISEN+ +AA+L+DEGA+TV+Y +LVN R MLE
Sbjct: 841 ISEKTFDGVTLRDCSVAQLTIALRILAFISENSAIAAALYDEGAITVIYTVLVNCRFMLE 900
Query: 1030 RSSNNYDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNT 1089
RSSN+YDYLVD+G EC+STSD+LLERNRE+ LVDLL+PSL+LLI LLQ+L+EA+EQHRNT
Sbjct: 901 RSSNSYDYLVDEGTECNSTSDILLERNREQSLVDLLVPSLVLLINLLQKLQEAEEQHRNT 960
Query: 1090 KLMNVLLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWAPGLFH 1149
KLMN LLRLHRE+SPKLAAC+ DLSSSYPD+ALG +CHL+ SALACWP + W+PGLFH
Sbjct: 961 KLMNALLRLHREVSPKLAACAADLSSSYPDAALGFGAICHLVASALACWPVYGWSPGLFH 1020
Query: 1150 TLLDSVQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPL 1209
+LL SVQA S+LALGPKETCS+L LL DLFP+EG+W WK GMPLL+A R+L++GTLLGP
Sbjct: 1021 SLLASVQATSMLALGPKETCSLLCLLNDLFPEEGIWLWKKGMPLLSALRTLSIGTLLGPQ 1080
Query: 1210 EERKVNWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIARQRTE 1269
+E+++NWYL+ +E LLGQL+P L+KIA VI HYA+ LVV+QD+LR+ ++RIA Q+ E
Sbjct: 1081 KEKQINWYLQPLHLEKLLGQLTPQLEKIALVIQHYAISALVVIQDMLRVFVIRIAYQKAE 1140
Query: 1270 FATVLLQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQIL 1329
+VLL+P++SWI H+ + L E+D +K +K LDFL+ LLEHP AKTLL+KEG +Q+L
Sbjct: 1141 SCSVLLRPILSWISGHVSDSCSLSEMDTFKVFKYLDFLASLLEHPHAKTLLVKEGIIQLL 1200
Query: 1330 TKVLGRIDAHVKSVSNGRSSLNSR------FNIYNCCLPVLRSLSLICETRSAQKLHLLL 1389
+ VL + + V + S+G+ + R F + CLPV +S SL+C++R++ + H +
Sbjct: 1201 STVLRQ--SLVAANSDGKHGGDDRNLSKFCFTTLSWCLPVFKSFSLLCDSRTSIQ-HTVR 1260
Query: 1390 DGNETLSTEDSAVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDTYLFIRS 1449
+E L++E+ ++ +Y+ K CQ LP G EL+ACL FK+L S G+S+L Y S
Sbjct: 1261 HNSENLTSEECLLMLAYLLKFCQVLPPGKELLACLLAFKELGSCYEGRSSLAAFYRIHSS 1320
Query: 1450 SSNNENQPGRSQESYQFYNRLIDAEW-KGCPLLTCWINLYGSVET-EGLSTCSVEAIGLL 1509
+Q G + Y+ L D EW K PLL CW L SV++ + S+EAI +L
Sbjct: 1321 FEEAGSQRGHERNGNANYDILSDFEWRKRPPLLCCWKELLRSVDSRDSFLDYSIEAINVL 1380
Query: 1510 SIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVDGANGVLEENIKYVQDMCTLLSSKITDD 1569
S+G L FC+ G+S D++ A+K+ FGL D + +GV EE I Y+ + ++L SKI D
Sbjct: 1381 SVGSLCFCVDGKSLKLDQVAAVKFLFGLPGDTNADDGVSEETIDYINKLTSILHSKIAAD 1440
Query: 1570 EYAVSSHLKATLHQAVESVKFLSLLLLKPIDSFKDDDAFTDEVLLFSSDAKLSSKIHLLT 1629
+Y V+ ++ +L+QA++S K L+LLL P S + DD F + S D + SKIHL++
Sbjct: 1441 DY-VNYDMQPSLYQALDSAKALALLLQTPAHSVRFDDVFPSGFPV-SFDDLIFSKIHLMS 1500
Query: 1630 DSSSERVENELNFGELGESFQWECPENLRNRLSQAGLLGKRKISAVDGV-NRHGKGENAP 1689
D S+ER E+ L G LG+ F W+CPE L +R Q + KRK+ +V+G NR +GENAP
Sbjct: 1501 DGSAERSEDYLCQGGLGDKFLWDCPETLPDRFPQTTV--KRKLPSVEGPPNRRSRGENAP 1560
Query: 1690 VDV-----RGSGSSMAPPIPTRRDTFRQRKPNTSRPPSMHVDDYVARERNESS--NPNVI 1749
++ RG GSS P PTRRDTFRQRKPNTSRPPSMHVDDYVARERN S N NVI
Sbjct: 1561 AEITGQNARGLGSSTTPSGPTRRDTFRQRKPNTSRPPSMHVDDYVARERNVDSATNSNVI 1620
Query: 1750 AVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQGKKITLEIDNDTEKSNRSK 1809
AV R+G++GGRPPSIHVDEFMARQRER N +A+ Q K D EKS++SK
Sbjct: 1621 AVQRIGATGGRPPSIHVDEFMARQRERQNPASTVIGDAAPQSKIAASVNDMAIEKSSKSK 1680
Query: 1810 QLKVDLDDDLQ--EIMFDAEENEADDKLPFPQADDDVQQTASASAETSPQHSIVEETESE 1869
+LK D DDDL +I+FD EE+E+DDKLPFPQ DD++QQ A E S HSIVEETES+
Sbjct: 1681 ELKNDPDDDLHGIDIVFDGEESESDDKLPFPQPDDNLQQPAPLIIEQSSPHSIVEETESD 1740
Query: 1870 AKD------LEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSENKYFEQPDETNN 1929
+ + +ASN EN SE SSRMS+SRPEM L REPSVSS+ K+ EQ D+ N
Sbjct: 1741 IHEGGQFSHIGTPLASNIDENTHSEFSSRMSISRPEMPLTREPSVSSDKKFSEQADDMKN 1800
Query: 1930 KNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIPDNNLQRVGNVL 1989
N S GFDS + GFP SVY S S AQ+ D R+ P +F P N GNV
Sbjct: 1801 VNTVKTSNGFDSTVAINSPGFPGSVYNNVSASSAQLLVDNRMTPQNFYPKNGPLPAGNVP 1860
Query: 1990 LGR-SQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRS-PFVNPHNDVQ-PPLY 2049
+ SQG ++Q+ +PNQPPLPP PPP T++PV +Q S+ S+S PFVN DV P +
Sbjct: 1861 MATGSQGLYDQRFMPNQPPLPPMPPPPTVAPVVAQTSDPVPSQSSPFVNSMTDVPLPTAF 1920
Query: 2050 PTYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTPPSFSATPL 2109
+A+ Y S+ N TS +S M DS+Y+RT+ SSPG S RP PPLPPTPP +S+ P
Sbjct: 1921 QVHAE--YLSASNNSSTSLGASLHMPDSKYSRTSLSSPGGSARPPPPLPPTPPPYSSNPY 1980
Query: 2110 NV-PSRSPTSQSSMYVQHNAGATDASQ----------------------YSSPPLIPPG- 2169
N+ P+R+ TSQSS+Y Q + G T+ Q Y PPL P
Sbjct: 1981 NLTPNRTSTSQSSVYNQTSVGTTELPQSTTPLNDSRLGSHSTSGARINTYPPPPLGLPHL 2040
Query: 2170 -FSRLPSMPFNMHGNIPVQQS-DNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLPCP- 2229
FSR S+P ++GN+P QQ +N+ + Q+ PQ SMQ++ + LQPLQPPQLP P
Sbjct: 2041 VFSRPGSIPSGIYGNMPTQQQGENLPSVLQSLSIPQSSMQTIHSVAQLQPLQPPQLPRPP 2100
Query: 2230 ---QQLRLPIQNSLHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQE----- 2289
Q LR P+Q S EQ VQMQ+ Q+LQQ S H+++Q+Q QE
Sbjct: 2101 QPPQHLRPPVQASQQLEQGT------VQMQMHSLQMLQQPHVSSMHMFHQSQQQEFSHVQ 2160
Query: 2290 ----NIPHPQQPQAAHHQVDAATQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQH 2293
++ H Q QA VD A+QQQ DS MSL +YF SPEAIQSLLSDRDKLCQLLEQH
Sbjct: 2161 KQQQHVEHAPQ-QAMSTTVDNASQQQ-DSGMSLHEYFKSPEAIQSLLSDRDKLCQLLEQH 2196
BLAST of Spo15904.1 vs. NCBI nr
Match:
gi|1009164273|ref|XP_015900410.1| (PREDICTED: uncharacterized protein LOC107433626 isoform X1 [Ziziphus jujuba])
HSP 1 Score: 2240.7 bits (5805), Expect = 0.000e+0
Identity = 1268/2237 (56.68%), Postives = 1607/2237 (71.84%), Query Frame = 1
Query: 130 MGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPP 189
MGRPEPCVLFAQTFVHP LDEYVDEV+FAEPIVI+ACEFLEQNA SAS +VTL+GATSPP
Sbjct: 1 MGRPEPCVLFAQTFVHPHLDEYVDEVLFAEPIVITACEFLEQNASSASQSVTLVGATSPP 60
Query: 190 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 249
SFALEVFVQCEGE RFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSL++YGNT
Sbjct: 61 SFALEVFVQCEGETRFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLIIYGNT 120
Query: 250 AEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAAD 309
AEDLGQFN+EFD D+S+ LV+S+E KLEDLP LHS + E++IS LKALS A+D
Sbjct: 121 AEDLGQFNIEFD-DSSITNLVTSAEGKLEDLPLALHSTDLTIEESISPLKALSLPPLASD 180
Query: 310 ISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTD--DLASAGITLQELNLGA 369
IS++ +QFL L+ KIL++P+ G+ K+++ + S +S+ + D S LQ
Sbjct: 181 ISIEAKQFLLLLLKILELPHLGDGLHKIVSSLVSVASSYVSHAWDRCSDRQELQS----- 240
Query: 370 LANVKGSFHVFTEAQRNLFELCKRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQ 429
V +EA++ L EL + + + LV FLESEA+LATSK+LVD+L Q
Sbjct: 241 ---------VISEARKELLELFQHK--SRDESAELLVDGRFLESEADLATSKQLVDLLSQ 300
Query: 430 HFPFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEML 489
+F FNR++ F GH +SK+ NV++ LS+AL+LCSG+ESCFH+VNGGGMEQLAH F +M
Sbjct: 301 YFCFNRESSFFGHHQISKTTNVIMGLSVALLLCSGRESCFHFVNGGGMEQLAHVFCGDMQ 360
Query: 490 NSATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDI 549
NS+ TL+LLG +EQATR+S GCEG LGWWPRE E +P G SE Y+ LL LL+QK RHD+
Sbjct: 361 NSSATTLLLLGVVEQATRHSYGCEGFLGWWPREDENIPPGSSEAYNRLLNLLLQKPRHDV 420
Query: 550 AALTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMIN 609
A+L TYILHRLRFYEV RYE AVLS+LGGLS G AT TL +L+ AK QL+KLLK++N
Sbjct: 421 ASLATYILHRLRFYEVVLRYECAVLSVLGGLSAVGKATNVTLGMLISAKSQLKKLLKLMN 480
Query: 610 SSGWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERG 669
G +EDPSP + A RSL + +EG LS+KAT+ LIASS C F + +D+ LL+LLKERG
Sbjct: 481 LRGPIEDPSPASLATRSLNLGQTEGLLSYKATNSLIASSHCCFSNWDVDSHLLALLKERG 540
Query: 670 FLPLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTII 729
FLPLSVAL+SS+ LR+E GH + F+D+ SSIEA ILSL+FCRSGLVFLLQ P+LS+T++
Sbjct: 541 FLPLSVALLSSSSLRSEVGHVNDIFMDIASSIEAIILSLLFCRSGLVFLLQQPELSATVV 600
Query: 730 RAFKGDDNVTK--FIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSE 789
A +G D+V K +PLRYA VL++KGFFC +EVG + MHLRVV+A+DRLLTS P SE
Sbjct: 601 HALRGSDDVNKDACVPLRYASVLIAKGFFCGPREVGMIIGMHLRVVNAIDRLLTSAPHSE 660
Query: 790 ELLWVLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCGT-PLNLA 849
E LWVLWELCGL+RSD GRQALLAL +FPEA ++LIEALHSAKE EP++ + G P+NLA
Sbjct: 661 EFLWVLWELCGLARSDCGRQALLALGYFPEAISLLIEALHSAKEPEPVAKNSGALPINLA 720
Query: 850 IFHAAAEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVV 909
IFH+AAEI E+IVTD T+SS+ WI HAT+LHRALHSSSPGSNRKDAP+RLLEWIDAGVV
Sbjct: 721 IFHSAAEIFEVIVTDSTASSLGSWIGHATELHRALHSSSPGSNRKDAPARLLEWIDAGVV 780
Query: 910 YHKNGAVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVESLLGKL 969
YH+NGA+GLLRYAAVLA GDAH+T+ +VSD D+E+++GDT+ +SDVNV+E+L GK
Sbjct: 781 YHRNGAIGLLRYAAVLASEGDAHLTT---IVSDLTDLENIIGDTTGDSDVNVMENL-GKF 840
Query: 970 VSDKSFDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNTRLMLE 1029
+S+K+FDGV LRD SVAQL A+RILA ISEN+ +AA+L+DEGA+TV+Y +LVN R MLE
Sbjct: 841 ISEKTFDGVTLRDCSVAQLTIALRILAFISENSAIAAALYDEGAITVIYTVLVNCRFMLE 900
Query: 1030 RSSNNYDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNT 1089
RSSN+YDYLVD+G EC+STSD+LLERNRE+ LVDLL+PSL+LLI LLQ+L+EA+EQHRNT
Sbjct: 901 RSSNSYDYLVDEGTECNSTSDILLERNREQSLVDLLVPSLVLLINLLQKLQEAEEQHRNT 960
Query: 1090 KLMNVLLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWAPGLFH 1149
KLMN LLRLHRE+SPKLAAC+ DLSSSYPD+ALG +CHL+ SALACWP + W+PGLFH
Sbjct: 961 KLMNALLRLHREVSPKLAACAADLSSSYPDAALGFGAICHLVASALACWPVYGWSPGLFH 1020
Query: 1150 TLLDSVQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPL 1209
+LL SVQA S+LALGPKETCS+L LL DLFP+EG+W WK GMPLL+A R+L++GTLLGP
Sbjct: 1021 SLLASVQATSMLALGPKETCSLLCLLNDLFPEEGIWLWKKGMPLLSALRTLSIGTLLGPQ 1080
Query: 1210 EERKVNWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIARQRTE 1269
+E+++NWYL+ +E LLGQL+P L+KIA VI HYA+ LVV+QD+LR+ ++RIA Q+ E
Sbjct: 1081 KEKQINWYLQPLHLEKLLGQLTPQLEKIALVIQHYAISALVVIQDMLRVFVIRIAYQKAE 1140
Query: 1270 FATVLLQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQIL 1329
+VLL+P++SWI H+ + L E+D +K +K LDFL+ LLEHP AKTLL+KEG +Q+L
Sbjct: 1141 SCSVLLRPILSWISGHVSDSCSLSEMDTFKVFKYLDFLASLLEHPHAKTLLVKEGIIQLL 1200
Query: 1330 TKVLGRIDAHVKSVSNGRSSLNSR------FNIYNCCLPVLRSLSLICETRSAQKLHLL- 1389
+ VL + + V + S+G+ + R F + CLPV +S SL+C++R++ + H +
Sbjct: 1201 STVLRQ--SLVAANSDGKHGGDDRNLSKFCFTTLSWCLPVFKSFSLLCDSRTSIQ-HTVR 1260
Query: 1390 --LDGNETLSTEDSAVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDTYLF 1449
L +E L++E+ ++ +Y+ K CQ LP G EL+ACL FK+L S G+S+L Y
Sbjct: 1261 HDLHNSENLTSEECLLMLAYLLKFCQVLPPGKELLACLLAFKELGSCYEGRSSLAAFYRI 1320
Query: 1450 IRSSSNNENQPGRSQESYQFYNRLIDAEW-KGCPLLTCWINLYGSVET-EGLSTCSVEAI 1509
S +Q G + Y+ L D EW K PLL CW L SV++ + S+EAI
Sbjct: 1321 HSSFEEAGSQRGHERNGNANYDILSDFEWRKRPPLLCCWKELLRSVDSRDSFLDYSIEAI 1380
Query: 1510 GLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVDGANGVLEENIKYVQDMCTLLSSKI 1569
+LS+G L FC+ G+S D++ A+K+ FGL D + +GV EE I Y+ + ++L SKI
Sbjct: 1381 NVLSVGSLCFCVDGKSLKLDQVAAVKFLFGLPGDTNADDGVSEETIDYINKLTSILHSKI 1440
Query: 1570 TDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSFKDDDAFTDEVLLFSSDAKLSSKIH 1629
D+Y V+ ++ +L+QA++S K L+LLL P S + DD F + S D + SKIH
Sbjct: 1441 AADDY-VNYDMQPSLYQALDSAKALALLLQTPAHSVRFDDVFPSGFPV-SFDDLIFSKIH 1500
Query: 1630 LLTDSSSERVENELNFGELGESFQWECPENLRNRLSQAGLLGKRKISAVDGV-NRHGKGE 1689
L++D S+ER E+ L G LG+ F W+CPE L +R Q + KRK+ +V+G NR +GE
Sbjct: 1501 LMSDGSAERSEDYLCQGGLGDKFLWDCPETLPDRFPQTTV--KRKLPSVEGPPNRRSRGE 1560
Query: 1690 NAPVDV-----RGSGSSMAPPIPTRRDTFRQRKPNTSRPPSMHVDDYVARERNESS--NP 1749
NAP ++ RG GSS P PTRRDTFRQRKPNTSRPPSMHVDDYVARERN S N
Sbjct: 1561 NAPAEITGQNARGLGSSTTPSGPTRRDTFRQRKPNTSRPPSMHVDDYVARERNVDSATNS 1620
Query: 1750 NVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQGKKITLEIDNDTEKSN 1809
NVIAV R+G++GGRPPSIHVDEFMARQRER N +A+ Q K D EKS+
Sbjct: 1621 NVIAVQRIGATGGRPPSIHVDEFMARQRERQNPASTVIGDAAPQSKIAASVNDMAIEKSS 1680
Query: 1810 RSKQLKVDLDDDLQ--EIMFDAEENEADDKLPFPQADDDVQQTASASAETSPQHSIVEET 1869
+SK+LK D DDDL +I+FD EE+E+DDKLPFPQ DD++QQ A E S HSIVEET
Sbjct: 1681 KSKELKNDPDDDLHGIDIVFDGEESESDDKLPFPQPDDNLQQPAPLIIEQSSPHSIVEET 1740
Query: 1870 ESEAKD------LEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSENKYFEQPDE 1929
ES+ + + +ASN EN SE SSRMS+SRPEM L REPSVSS+ K+ EQ D+
Sbjct: 1741 ESDIHEGGQFSHIGTPLASNIDENTHSEFSSRMSISRPEMPLTREPSVSSDKKFSEQADD 1800
Query: 1930 TNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIPDNNLQRVG 1989
N N S GFDS + GFP SVY S S AQ+ D R+ P +F P N G
Sbjct: 1801 MKNVNTVKTSNGFDSTVAINSPGFPGSVYNNVSASSAQLLVDNRMTPQNFYPKNGPLPAG 1860
Query: 1990 NVLLGR-SQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRS-PFVNPHNDVQ-P 2049
NV + SQG ++Q+ +PNQPPLPP PPP T++PV +Q S+ S+S PFVN DV P
Sbjct: 1861 NVPMATGSQGLYDQRFMPNQPPLPPMPPPPTVAPVVAQTSDPVPSQSSPFVNSMTDVPLP 1920
Query: 2050 PLYPTYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTPPSFSA 2109
+ +A+ Y S+ N TS +S M DS+Y+RT+ SSPG S RP PPLPPTPP +S+
Sbjct: 1921 TAFQVHAE--YLSASNNSSTSLGASLHMPDSKYSRTSLSSPGGSARPPPPLPPTPPPYSS 1980
Query: 2110 TPLNV-PSRSPTSQSSMYVQHNAGATDASQ----------------------YSSPPLIP 2169
P N+ P+R+ TSQSS+Y Q + G T+ Q Y PPL
Sbjct: 1981 NPYNLTPNRTSTSQSSVYNQTSVGTTELPQSTTPLNDSRLGSHSTSGARINTYPPPPLGL 2040
Query: 2170 PG--FSRLPSMPFNMHGNIPVQQS-DNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLP 2229
P FSR S+P ++GN+P QQ +N+ + Q+ PQ SMQ++ + LQPLQPPQLP
Sbjct: 2041 PHLVFSRPGSIPSGIYGNMPTQQQGENLPSVLQSLSIPQSSMQTIHSVAQLQPLQPPQLP 2100
Query: 2230 CP----QQLRLPIQNSLHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQE-- 2289
P Q LR P+Q S EQ VQMQ+ Q+LQQ S H+++Q+Q QE
Sbjct: 2101 RPPQPPQHLRPPVQASQQLEQGT------VQMQMHSLQMLQQPHVSSMHMFHQSQQQEFS 2160
Query: 2290 -------NIPHPQQPQAAHHQVDAATQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQLL 2293
++ H Q QA VD A+QQQ DS MSL +YF SPEAIQSLLSDRDKLCQLL
Sbjct: 2161 HVQKQQQHVEHAPQ-QAMSTTVDNASQQQ-DSGMSLHEYFKSPEAIQSLLSDRDKLCQLL 2199
BLAST of Spo15904.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9R074_SPIOL (Uncharacterized protein (Fragment) OS=Spinacia oleracea GN=SOVF_121890 PE=4 SV=1)
HSP 1 Score: 4260.7 bits (11049), Expect = 0.000e+0
Identity = 2202/2203 (99.95%), Postives = 2203/2203 (100.00%), Query Frame = 1
Query: 90 NFQERFPPLLLTAFAANTPARWLCGDACYHTARRLSEYLNMGRPEPCVLFAQTFVHPQLD 149
+FQERFPPLLLTAFAANTPARWLCGDACYHTARRLSEYLNMGRPEPCVLFAQTFVHPQLD
Sbjct: 40 SFQERFPPLLLTAFAANTPARWLCGDACYHTARRLSEYLNMGRPEPCVLFAQTFVHPQLD 99
Query: 150 EYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPPSFALEVFVQCEGEARFRRLC 209
EYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPPSFALEVFVQCEGEARFRRLC
Sbjct: 100 EYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPPSFALEVFVQCEGEARFRRLC 159
Query: 210 QPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNTAEDLGQFNVEFDLDNSVAKL 269
QPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNTAEDLGQFNVEFDLDNSVAKL
Sbjct: 160 QPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNTAEDLGQFNVEFDLDNSVAKL 219
Query: 270 VSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAADISVQMRQFLQLIFKILDIPN 329
VSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAADISVQMRQFLQLIFKILDIPN
Sbjct: 220 VSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAADISVQMRQFLQLIFKILDIPN 279
Query: 330 FGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALANVKGSFHVFTEAQRNLFELC 389
FGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALANVKGSFHVFTEAQRNLFELC
Sbjct: 280 FGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALANVKGSFHVFTEAQRNLFELC 339
Query: 390 KRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQHFPFNRDTVFVGHPHLSKSNNV 449
KRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQHFPFNRDTVFVGHPHLSKSNNV
Sbjct: 340 KRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQHFPFNRDTVFVGHPHLSKSNNV 399
Query: 450 LLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLNSATVTLMLLGAMEQATRYSVG 509
LLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLNSATVTLMLLGAMEQATRYSVG
Sbjct: 400 LLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLNSATVTLMLLGAMEQATRYSVG 459
Query: 510 CEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIAALTTYILHRLRFYEVASRYEY 569
CEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIAALTTYILHRLRFYEVASRYEY
Sbjct: 460 CEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIAALTTYILHRLRFYEVASRYEY 519
Query: 570 AVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINSSGWVEDPSPMACAIRSLMVAS 629
AVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINSSGWVEDPSPMACAIRSLMVAS
Sbjct: 520 AVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINSSGWVEDPSPMACAIRSLMVAS 579
Query: 630 SEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGFLPLSVALISSTELRAERGHAL 689
SEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGFLPLSVALISSTELRAERGHAL
Sbjct: 580 SEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGFLPLSVALISSTELRAERGHAL 639
Query: 690 EAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIRAFKGDDNVTKFIPLRYAYVLV 749
EAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIRAFKGDDNVTKFIPLRYAYVLV
Sbjct: 640 EAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIRAFKGDDNVTKFIPLRYAYVLV 699
Query: 750 SKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEELLWVLWELCGLSRSDGGRQALLA 809
SKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEELLWVLWELCGLSRSDGGRQALLA
Sbjct: 700 SKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEELLWVLWELCGLSRSDGGRQALLA 759
Query: 810 LTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIFHAAAEICEIIVTDPTSSSVSVWI 869
LTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIFHAAAEICEIIVTDPTSSSVSVWI
Sbjct: 760 LTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIFHAAAEICEIIVTDPTSSSVSVWI 819
Query: 870 DHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYHKNGAVGLLRYAAVLAFGGDAHIT 929
DHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYHKNGAVGLLRYAAVLAFGGDAHIT
Sbjct: 820 DHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYHKNGAVGLLRYAAVLAFGGDAHIT 879
Query: 930 STNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVSDKSFDGVALRDSSVAQLITAIRI 989
STNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVSDKSFDGVALRDSSVAQLITAIRI
Sbjct: 880 STNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVSDKSFDGVALRDSSVAQLITAIRI 939
Query: 990 LALISENADVAASLFDEGAMTVVYIILVNTRLMLERSSNNYDYLVDDGIECSSTSDLLLE 1049
LALISENADVAASLFDEGAMTVVYIILVNTRLMLERSSNNYDYLVDDGIECSSTSDLLLE
Sbjct: 940 LALISENADVAASLFDEGAMTVVYIILVNTRLMLERSSNNYDYLVDDGIECSSTSDLLLE 999
Query: 1050 RNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKLMNVLLRLHRELSPKLAACSVDLS 1109
RNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKLMNVLLRLHRELSPKLAACSVDLS
Sbjct: 1000 RNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKLMNVLLRLHRELSPKLAACSVDLS 1059
Query: 1110 SSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTLLDSVQAASLLALGPKETCSMLYL 1169
SSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTLLDSVQAASLLALGPKETCSMLYL
Sbjct: 1060 SSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTLLDSVQAASLLALGPKETCSMLYL 1119
Query: 1170 LIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEERKVNWYLERPCVEILLGQLSPHL 1229
LIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEERKVNWYLERPCVEILLGQLSPHL
Sbjct: 1120 LIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEERKVNWYLERPCVEILLGQLSPHL 1179
Query: 1230 DKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFATVLLQPMISWIENHLIGTSPLDE 1289
DKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFATVLLQPMISWIENHLIGTSPLDE
Sbjct: 1180 DKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFATVLLQPMISWIENHLIGTSPLDE 1239
Query: 1290 IDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTKVLGRIDAHVKSVSNGRSSLNSRF 1349
IDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTKVLGRIDAHVKSVSNGRSSLNSRF
Sbjct: 1240 IDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTKVLGRIDAHVKSVSNGRSSLNSRF 1299
Query: 1350 NIYNCCLPVLRSLSLICETRSAQKLHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNEL 1409
NIYNCCLPVLRSLSLICETRSAQKLHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNEL
Sbjct: 1300 NIYNCCLPVLRSLSLICETRSAQKLHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNEL 1359
Query: 1410 VACLGTFKDLASTAIGQSTLLDTYLFIRSSSNNENQPGRSQESYQFYNRLIDAEWKGCPL 1469
VACLGTFKDLASTAIGQSTLLDTYLFIRSSSNNENQPGRSQESYQFYNRLIDAEWKGCPL
Sbjct: 1360 VACLGTFKDLASTAIGQSTLLDTYLFIRSSSNNENQPGRSQESYQFYNRLIDAEWKGCPL 1419
Query: 1470 LTCWINLYGSVETEGLSTCSVEAIGLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVD 1529
LTCWINLYGSVETEGLSTCSVEAIGLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVD
Sbjct: 1420 LTCWINLYGSVETEGLSTCSVEAIGLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVD 1479
Query: 1530 GANGVLEENIKYVQDMCTLLSSKITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSF 1589
GANGVLEENIKYVQDMCTLLSSKITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSF
Sbjct: 1480 GANGVLEENIKYVQDMCTLLSSKITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSF 1539
Query: 1590 KDDDAFTDEVLLFSSDAKLSSKIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLS 1649
KDDDAFTDEVLLFSSDAKLSSKIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLS
Sbjct: 1540 KDDDAFTDEVLLFSSDAKLSSKIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLS 1599
Query: 1650 QAGLLGKRKISAVDGVNRHGKGENAPVDVRGSGSSMAPPIPTRRDTFRQRKPNTSRPPSM 1709
QAGLLGKRKISAVDGVNRHGKGENAPVDVRGSGSSMAPPIPTRRDTFRQRKPNTSRPPSM
Sbjct: 1600 QAGLLGKRKISAVDGVNRHGKGENAPVDVRGSGSSMAPPIPTRRDTFRQRKPNTSRPPSM 1659
Query: 1710 HVDDYVARERNESSNPNVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQ 1769
HVDDYVARERNESSNPNVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQ
Sbjct: 1660 HVDDYVARERNESSNPNVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQ 1719
Query: 1770 GKKITLEIDNDTEKSNRSKQLKVDLDDDLQEIMFDAEENEADDKLPFPQADDDVQQTASA 1829
GKKITLEIDNDTEKSNRSKQLKVDLDDDLQEIMFDAEENEADDKLPFPQADDDVQQTASA
Sbjct: 1720 GKKITLEIDNDTEKSNRSKQLKVDLDDDLQEIMFDAEENEADDKLPFPQADDDVQQTASA 1779
Query: 1830 SAETSPQHSIVEETESEAKDLEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSEN 1889
SAETSPQHSIVEETESEAKDLEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSEN
Sbjct: 1780 SAETSPQHSIVEETESEAKDLEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSEN 1839
Query: 1890 KYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIP 1949
KYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIP
Sbjct: 1840 KYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIP 1899
Query: 1950 DNNLQRVGNVLLGRSQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRSPFVNPH 2009
DNNLQRVGNVLLGRSQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRSPFVNPH
Sbjct: 1900 DNNLQRVGNVLLGRSQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRSPFVNPH 1959
Query: 2010 NDVQPPLYPTYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTP 2069
NDVQPPLYPTYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTP
Sbjct: 1960 NDVQPPLYPTYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTP 2019
Query: 2070 PSFSATPLNVPSRSPTSQSSMYVQHNAGATDASQYSSPPLIPPGFSRLPSMPFNMHGNIP 2129
PSFSATPLNVPSRSPTSQSSMYVQHNAGATDASQYSSPPLIPPGFSRLPSMPFNMHGNIP
Sbjct: 2020 PSFSATPLNVPSRSPTSQSSMYVQHNAGATDASQYSSPPLIPPGFSRLPSMPFNMHGNIP 2079
Query: 2130 VQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLPCPQQLRLPIQNSLHTEQSGSM 2189
VQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLPCPQQLRLPIQNSLHTEQSGSM
Sbjct: 2080 VQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLPCPQQLRLPIQNSLHTEQSGSM 2139
Query: 2190 SQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQENIPHPQQPQAAHHQVDAATQQQMDSSM 2249
SQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQENIPHPQQPQAAHHQVDAATQQQMDSSM
Sbjct: 2140 SQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQENIPHPQQPQAAHHQVDAATQQQMDSSM 2199
Query: 2250 SLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKLMQMLQDQLGHL 2293
SLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKLMQMLQDQLGHL
Sbjct: 2200 SLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKLMQMLQDQLGHL 2242
BLAST of Spo15904.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8CH03_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_5g110090 PE=4 SV=1)
HSP 1 Score: 3389.4 bits (8787), Expect = 0.000e+0
Identity = 1777/2171 (81.85%), Postives = 1923/2171 (88.58%), Query Frame = 1
Query: 130 MGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPP 189
M RPEPCVLFAQTFVHPQLDEYVDEV+FAEPIVI+ACEFLEQNA SAS+AVTLLGAT PP
Sbjct: 1 MVRPEPCVLFAQTFVHPQLDEYVDEVIFAEPIVITACEFLEQNASSASAAVTLLGATLPP 60
Query: 190 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 249
SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT
Sbjct: 61 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 120
Query: 250 AEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAAD 309
AEDLGQFNVEFDLDNSV LVSSSE KLEDLPPLLHSRNPK ED ISSLKALSSMLPA D
Sbjct: 121 AEDLGQFNVEFDLDNSVTNLVSSSEMKLEDLPPLLHSRNPKHEDTISSLKALSSMLPATD 180
Query: 310 ISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALA 369
IS + RQFLQLI KILD+P+ G+F QK+MAL+TS ASFFT DLAS GIT QELNLGA+
Sbjct: 181 ISAERRQFLQLILKILDVPDCGDFFQKIMALVTSVAASFFTIDLASTGITFQELNLGAVL 240
Query: 370 NVKGSFHVFTEAQRNLFELCKRYVGAGHIYGDFLVGCSFLESEAELATSKELVDMLFQHF 429
N KG+ HVFTEA+ NL +LCK YV AGHI GDFLV CSFLESEAELATSKELVD+ FQHF
Sbjct: 241 NFKGASHVFTEAKINLLDLCKTYVEAGHICGDFLVDCSFLESEAELATSKELVDIFFQHF 300
Query: 430 PFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQEMLNS 489
F+R+T+ VGHPH+S+SNNVLLWLSLALIL SGKESCFHYVNGGGM+QLAHF S +M NS
Sbjct: 301 HFDRETI-VGHPHISQSNNVLLWLSLALILSSGKESCFHYVNGGGMKQLAHFLSHDMQNS 360
Query: 490 ATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRHDIAA 549
ATVTLMLLG +EQATRYSVGCEGLLGWWPRE E VPS VSEGYS LLKLLMQKQRHDIAA
Sbjct: 361 ATVTLMLLGVIEQATRYSVGCEGLLGWWPRESEDVPSSVSEGYSQLLKLLMQKQRHDIAA 420
Query: 550 LTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKMINSS 609
L TYILHRLR YEVASRYEYAVLSILGG SGDGNATTTTLS+L+ AKLQLRKLLK+INSS
Sbjct: 421 LVTYILHRLRVYEVASRYEYAVLSILGGHSGDGNATTTTLSMLVNAKLQLRKLLKLINSS 480
Query: 610 GWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKERGFL 669
GWVEDPSP+ACAIRS M+++SEG SFKAT KLIASSSCRFLHQ+IDAKLLSL+KERGFL
Sbjct: 481 GWVEDPSPLACAIRSSMMSASEGVSSFKATGKLIASSSCRFLHQEIDAKLLSLMKERGFL 540
Query: 670 PLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSSTIIRA 729
PLSVALIS +LR ERGHALE+FLD+TSSIEATILS MFCRSGLVFLLQHPDLSSTII A
Sbjct: 541 PLSVALISLPKLRDERGHALESFLDVTSSIEATILSFMFCRSGLVFLLQHPDLSSTIIHA 600
Query: 730 FKGDDNVTKFIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPCSEELLW 789
FKG+D V + IPLRYA +LVSKGFFC LQEVG N+EMHLRVV+AV RLLTSTPCSEELLW
Sbjct: 601 FKGNDEVIECIPLRYASLLVSKGFFCRLQEVGLNIEMHLRVVNAVGRLLTSTPCSEELLW 660
Query: 790 VLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCGTPLNLAIFHAA 849
VLWELCGLSRSDGGRQA+LAL HFPEA TILIEALHSAKE E ISLS GTPLNLAIFHAA
Sbjct: 661 VLWELCGLSRSDGGRQAVLALAHFPEAITILIEALHSAKESESISLSSGTPLNLAIFHAA 720
Query: 850 AEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPGSNRKDAPSRLLEWIDAGVVYHKNG 909
AEICEIIVTDPTSSSVSVWIDHA +LH+ALHSSSPGS+RKDAPSRLLEWIDAGVVYHKNG
Sbjct: 721 AEICEIIVTDPTSSSVSVWIDHAAELHKALHSSSPGSSRKDAPSRLLEWIDAGVVYHKNG 780
Query: 910 AVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVESLLGKLVSDKS 969
AVGLLRYAA LA GGDAHITSTNILVSDSMDVE+VVGD+SNESDVNVVE+LLGKLVSDK+
Sbjct: 781 AVGLLRYAAFLASGGDAHITSTNILVSDSMDVENVVGDSSNESDVNVVENLLGKLVSDKT 840
Query: 970 FDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNTRLMLERSSNN 1029
FDGV LRDSSVAQL TAIRILALISEN DVAASLFDEGAMTV+Y ILVN+RLMLERSSNN
Sbjct: 841 FDGVPLRDSSVAQLTTAIRILALISENVDVAASLFDEGAMTVIYAILVNSRLMLERSSNN 900
Query: 1030 YDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKEQHRNTKLMNV 1089
YDYLVDDG+ECSSTSDLLLERNREKYLVDLLIPSLLLL+ELLQRLKEAKEQHRNTKLM V
Sbjct: 901 YDYLVDDGVECSSTSDLLLERNREKYLVDLLIPSLLLLLELLQRLKEAKEQHRNTKLMTV 960
Query: 1090 LLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWAPGLFHTLLDS 1149
LLRLH ELSPKLAA +VDLSSSYP+SALGLETVCHLI SALACWPAHAWAPGLFHTLLDS
Sbjct: 961 LLRLHCELSPKLAASAVDLSSSYPESALGLETVCHLIASALACWPAHAWAPGLFHTLLDS 1020
Query: 1150 VQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLGPLEERKV 1209
VQAASLLALGPKETCSML +LIDLFP+EG WHWKNGMPL+TA+RSLA+GTLLGPL+E+KV
Sbjct: 1021 VQAASLLALGPKETCSMLCVLIDLFPEEGFWHWKNGMPLMTAARSLALGTLLGPLKEKKV 1080
Query: 1210 NWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIARQRTEFATVL 1269
NWYLE +EILLGQLSPHLDKIAQVI HYAV TLVVVQD+LRILILRIARQR E+A VL
Sbjct: 1081 NWYLEHSYIEILLGQLSPHLDKIAQVILHYAVSTLVVVQDMLRILILRIARQRAEYAIVL 1140
Query: 1270 LQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQILTKVLG 1329
LQP+ISW++NH+ S L ++D YKGYKLLDFLSLL EHPLAKTLLLKEGTLQILTKVLG
Sbjct: 1141 LQPIISWVQNHVAEPSSLGDMDTYKGYKLLDFLSLLFEHPLAKTLLLKEGTLQILTKVLG 1200
Query: 1330 RIDAHVKSVSNGRSSLNSRFNIYNCCLPVLRSLSLICETRSAQKLHLLLDGNETLSTEDS 1389
RI AH KSVS+GRSSLN+ F+IYNCCLPVLRSL+LIC+TR QKL+ + D NE LSTEDS
Sbjct: 1201 RIYAHGKSVSDGRSSLNAGFSIYNCCLPVLRSLTLICDTRPTQKLYGVFDCNEILSTEDS 1260
Query: 1390 AVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDTYLFIRSSSNNENQPGRS 1449
AVI +Y+FKLCQ LP GNELVACL TFK++AS A GQ L D +LFIRS + +ENQPGR
Sbjct: 1261 AVILNYVFKLCQLLPVGNELVACLTTFKEVASHAEGQRALFDAFLFIRSGNTSENQPGRP 1320
Query: 1450 QESYQFYNRLIDAEWKGCPLLTCWINLYGSVETEGLSTCSVEAIGLLSIGVLRFCMSGRS 1509
QE + FYN LID EW+ CPLL CWINLY SVE EGLST SV+A+G+LS+G +RFC+ G+S
Sbjct: 1321 QEFHGFYNHLIDVEWRRCPLLCCWINLYRSVENEGLSTYSVDAVGVLSVGAMRFCIEGKS 1380
Query: 1510 FNEDRIDALKYFFGLSSDVDGANGVLEENIKYVQDMCTLLSSKITDDEYAVSSHLKATLH 1569
FN+D+IDALKYFFGL D D NG LEEN+KYVQDM +LLSSK+TD+EYA SSHLKATLH
Sbjct: 1381 FNKDKIDALKYFFGLWCDTDETNGTLEENMKYVQDMSSLLSSKMTDNEYAASSHLKATLH 1440
Query: 1570 QAVESVKFLSLLLLKPIDSFKDDDAFTDEVLLFSSDAKLSSKIHLLTDSSSERVENELNF 1629
Q VES + LSLLLLKPID+ K +D ++ VLLFS DAKLS+KIHLLTDSSSERVENE N
Sbjct: 1441 QVVESARCLSLLLLKPIDALKVNDIISEGVLLFSPDAKLSTKIHLLTDSSSERVENEWNI 1500
Query: 1630 GELGESFQWECPENLRNRLSQAGLLGKRKISAVDGVNRHGKGENAPVDVRGSGSSMAPPI 1689
GEL + FQWECPENLRNRLSQAGLLGKRKIS VDG NRHGKG+NAPVDVRGSG S+APPI
Sbjct: 1501 GELSDQFQWECPENLRNRLSQAGLLGKRKISTVDGANRHGKGDNAPVDVRGSGPSIAPPI 1560
Query: 1690 PTRRDTFRQRKPNTSRPPSMHVDDYVARERNES--SNPNVIAVPRLGSSGGRPPSIHVDE 1749
PTRRDTFRQRKPNTSRPPSMHVDDYVARERN SNPNVIAVPRLGSSGGRPPSIHVDE
Sbjct: 1561 PTRRDTFRQRKPNTSRPPSMHVDDYVARERNSDGVSNPNVIAVPRLGSSGGRPPSIHVDE 1620
Query: 1750 FMARQRERHNSVPIAGVEASSQGKKITLEIDNDTEKSNRSKQLKVDLDDDLQ--EIMFDA 1809
FMARQRERHNSV +AGVE +S GKK+T E DN TEKSNRSKQLKVDLDDDLQ +I+FDA
Sbjct: 1621 FMARQRERHNSVAMAGVEVASLGKKVTPENDNGTEKSNRSKQLKVDLDDDLQGIDIVFDA 1680
Query: 1810 EENEADDKLPFPQADDDVQQTASASAETSPQHSIVEETESEAKDLEATVASNAVENAQSE 1869
EENE DDKLPFPQ DD++Q+ ASASAE SPQHSIV ETES+AKDLEATVASNA EN QSE
Sbjct: 1681 EENEPDDKLPFPQEDDEIQRAASASAERSPQHSIVAETESDAKDLEATVASNADENTQSE 1740
Query: 1870 LSSRMSVSRPEMQLRREPSVSSENKYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASV 1929
LSSRMSVS P MQLRREPSVSSE KYF + D+ N ++S GFDS AVAKTSG AS
Sbjct: 1741 LSSRMSVSHPGMQLRREPSVSSEKKYFGKLDQNNGT--AIISDGFDS-AVAKTSGLSASK 1800
Query: 1930 YGKHSPSGAQVPADARIPPSSFIPDNNLQRVGNVLLGRSQGRHEQKHLPNQPPLPPTPPP 1989
YGK + S AQVP D+R PP S+ PDNNLQR+GNV LG QG HEQK++PNQPPLPPTPPP
Sbjct: 1801 YGKGT-SMAQVPMDSRTPPPSYCPDNNLQRIGNVALGGPQGHHEQKYMPNQPPLPPTPPP 1860
Query: 1990 STISPVRSQVSENQTSRSPFVNPHNDVQPPLYPTYAQTDYQSSFGNVPTSFASSNFMSDS 2049
STISP +SQV N S SPFVNPHNDVQPPLY Y QTDYQS FG+VP+S ASSNFM DS
Sbjct: 1861 STISPGKSQVDTNSGSASPFVNPHNDVQPPLYQNYTQTDYQS-FGSVPSSLASSNFMLDS 1920
Query: 2050 RYARTTNSSPGSSNRPLPPLPPTPPSFSATPLNVPSRSPTSQSSMYVQHNAGATDASQYS 2109
RYARTT+SSPG SNRPLPPLPPTPP FS TPLN+PSRS SQSS+Y Q+N G + SQ++
Sbjct: 1921 RYARTTHSSPGGSNRPLPPLPPTPPPFSTTPLNLPSRSSASQSSVYAQNNTGVNEVSQFA 1980
Query: 2110 SPPLIPPGFSRLPSMPFNMHGNIPVQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPP 2169
PPL+PPGF+R SMPF MHGNIPVQQSDN+TG+AQNS QP MQSMQPLPHLQPLQPP
Sbjct: 1981 PPPLMPPGFNRPSSMPFTMHGNIPVQQSDNVTGVAQNSNASQPPMQSMQPLPHLQPLQPP 2040
Query: 2170 QLPCPQ---QLRLPIQNSLHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQE 2229
QLPCPQ QLR P+QN LHTEQ GS SQSH VQP QILQQSQ SPYHLYYQ Q Q+
Sbjct: 2041 QLPCPQLPHQLRPPMQNPLHTEQRGSASQSH----VQPQQILQQSQMSPYHLYYQNQLQD 2100
Query: 2230 NIPHPQQP-QAAHHQVDAATQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKL 2289
N PH QQ Q AHHQ +A TQQQ+DSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKL
Sbjct: 2101 NFPHSQQQLQVAHHQGEAVTQQQVDSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKL 2160
Query: 2290 MQMLQDQLGHL 2293
MQMLQ++LG L
Sbjct: 2161 MQMLQERLGRL 2161
BLAST of Spo15904.1 vs. UniProtKB/TrEMBL
Match:
A0A067FWV3_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g000104mg PE=4 SV=1)
HSP 1 Score: 2167.5 bits (5615), Expect = 0.000e+0
Identity = 1239/2238 (55.36%), Postives = 1572/2238 (70.24%), Query Frame = 1
Query: 130 MGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPP 189
MGRPEPCVLFAQTFVHPQLDEYVDEV+FAEPIVI+ACEFLEQNA S S AV+L+GATSPP
Sbjct: 1 MGRPEPCVLFAQTFVHPQLDEYVDEVLFAEPIVITACEFLEQNASSTSQAVSLVGATSPP 60
Query: 190 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 249
SFALEVFVQCEGE RFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSL++YGNT
Sbjct: 61 SFALEVFVQCEGETRFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLIIYGNT 120
Query: 250 AEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAAD 309
AEDLGQFN++FD D+S+ LV+S+E KLEDLP LHS N E+++SSLK LS + +D
Sbjct: 121 AEDLGQFNIDFD-DSSLTDLVTSAEGKLEDLPLALHSINRTIEESLSSLKVLSLPVAPSD 180
Query: 310 ISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGA-- 369
IS++++Q L L+ K+ ++PN K ++ + A +SF T DL G T ++ +L +
Sbjct: 181 ISIEVKQLLHLMLKVFELPNPESAVHKTVSTVVLAASSFVTHDL---GDTTRQKHLASGK 240
Query: 370 -LANVKGSFHVFTEAQRNLFELCKRYVGAGHIYG----DFLVGCSFLESEAELATSKELV 429
+N H +A++ L +L Y H G + L C+FLE+EA++A+SKELV
Sbjct: 241 RKSNEDDLQHAVIKARKELIQL---YAALQHDSGNDSAELLGDCTFLETEADMASSKELV 300
Query: 430 DMLFQHFPFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFF 489
DML Q+F F+ ++ +GH +L ++ +V+L LS+AL+LCSG+E CFH+V+ GGM+QLA+ F
Sbjct: 301 DMLSQYFNFSGNSTSLGHHNLPQNKSVILGLSVALLLCSGREGCFHFVDSGGMDQLAYVF 360
Query: 490 SQEMLNSATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQK 549
S+++ NS + L++LGA+EQATR+S+GCEG LGWWPRE E +PSG+SEGYS LL LL+QK
Sbjct: 361 SRDIENSTVIMLLVLGAIEQATRHSIGCEGFLGWWPREDENIPSGISEGYSRLLNLLLQK 420
Query: 550 QRHDIAALTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKL 609
RHD+A+L T++LHRLR+YEVASRYE AVLS+LGGL G T T ++L+ AK QL+KL
Sbjct: 421 PRHDVASLATFVLHRLRYYEVASRYESAVLSVLGGLPAAGKVTAATSNMLISAKSQLKKL 480
Query: 610 LKMINSSGWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSL 669
LK+IN G +EDPSP++ A RSL + +EG LS+K TS LIASS+C F + ID LL+L
Sbjct: 481 LKLINLRGPIEDPSPVSSARRSLTLVQAEGLLSYKVTSNLIASSTCSFSNSDIDPYLLTL 540
Query: 670 LKERGFLPLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDL 729
LKERGFL LS AL+SS+ LR E G A++ +LD+ SSI A ILSL+FC SGLVFLL H ++
Sbjct: 541 LKERGFLSLSAALLSSSILRTEVGDAMDVYLDIASSIGAIILSLLFCHSGLVFLLHHYEI 600
Query: 730 SSTIIRAFKG--DDNVTKFIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTS 789
S+T+I A +G D N + +PLRYAYVL+SKGF C LQEV T +EMHLRVV+A+DRLLTS
Sbjct: 601 SATLIHALRGVTDMNKEECVPLRYAYVLMSKGFTCGLQEVATIVEMHLRVVNAIDRLLTS 660
Query: 790 TPCSEELLWVLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCGT- 849
TP SEE LWVLWELCG+SRSD GRQALL L FPEA ++LIEALHSAKE EP + S GT
Sbjct: 661 TPQSEEFLWVLWELCGVSRSDCGRQALLTLGFFPEAVSMLIEALHSAKEQEPSTKSGGTS 720
Query: 850 PLNLAIFHAAAEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPGSNRKDAPSRLLEWI 909
PL+LAI H+AAEI EIIVTD T+SS+ WI A +LH+ALHSSSPGSNRKDAP+RLLEWI
Sbjct: 721 PLSLAILHSAAEIFEIIVTDSTASSLGSWIGRAMELHKALHSSSPGSNRKDAPTRLLEWI 780
Query: 910 DAGVVYHKNGAVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVES 969
D GVVYHK+G +GLLRYAAVLA GGDAH++ST+ LVSD M+VE+ G+ S+ SD NV+E+
Sbjct: 781 DPGVVYHKSGVIGLLRYAAVLASGGDAHLSSTSNLVSDLMEVENATGEPSSGSDFNVMEN 840
Query: 970 LLGKLVSDKSFDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNT 1029
L+ K++S+KSFDGV LRDSS+AQL TA+RILA ISEN+ VAA+L++EGA+TVVY ILVN
Sbjct: 841 LV-KIISEKSFDGVTLRDSSIAQLTTALRILAFISENSAVAAALYEEGAVTVVYTILVNC 900
Query: 1030 RLMLERSSNNYDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKE 1089
R MLERSSNNYDYL+DDG EC+S+SDLLLERNRE+ LVDLL+PSL+ LI +LQ+L+E E
Sbjct: 901 RFMLERSSNNYDYLIDDGTECNSSSDLLLERNREQCLVDLLVPSLVFLITILQKLQEGNE 960
Query: 1090 QHRNTKLMNVLLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWA 1149
QH+NTKLMN LLRLHRE+SPKLAAC+ DLSS YP+SAL VC L VSALA WP + W
Sbjct: 961 QHKNTKLMNALLRLHREVSPKLAACAADLSSPYPNSALSFGAVCRLGVSALAFWPIYGWT 1020
Query: 1150 PGLFHTLLDSVQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGT 1209
PGLFH+LL SVQ SLLALGPKETCS+L LL DLFP+E +W W+NGMP L+A R+LAVG+
Sbjct: 1021 PGLFHSLLVSVQTTSLLALGPKETCSLLCLLNDLFPEEDIWLWRNGMPSLSALRTLAVGS 1080
Query: 1210 LLGPLEERKVNWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIA 1269
LLGP +ER+V WYLE C E LL QL PHLDKIAQ+I HYA+ L+V+QD+LR+LI+R+A
Sbjct: 1081 LLGPQKEREVEWYLEPGCREKLLTQLRPHLDKIAQIIRHYAISALIVIQDMLRVLIIRVA 1140
Query: 1270 RQRTEFATVLLQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEG 1329
Q++E A++LLQP+++WI +H+ +S ++D YK ++LLDFL+ LLEHP AK +LLKEG
Sbjct: 1141 SQKSENASLLLQPILAWIRDHVSDSSSPSDMDVYKVHRLLDFLASLLEHPCAKAVLLKEG 1200
Query: 1330 TLQILTKVLGRI----DAHVKSVSNGRSSLNSRFNIYNCCLPVLRSLSLICETRSAQK-- 1389
Q+L +VL R D+ K S+ +S+ + + CLPV +S SL+C +++ +
Sbjct: 1201 VPQMLIEVLKRCFEATDSDGKQFSDQLNSVKIGSTLTSWCLPVFKSFSLLCCSQTPMQHP 1260
Query: 1390 -LHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDT 1449
H L + LS +D ++I +I K CQ LP G ELV CL F++L S GQS L+
Sbjct: 1261 GRHDLYKF-DNLSADDCSLILPHILKFCQVLPVGKELVFCLTAFRELVSCGEGQSALISI 1320
Query: 1450 YLFIRSSSNNENQPGRSQESYQFYNRLIDAEW-KGCPLLTCWINLYGSVET-EGLSTCSV 1509
S+ E GR E + L + EW K PLL CW L SV++ +GLST +V
Sbjct: 1321 ICHTHSTLE-EFDSGRGHERNDDRSLLNEFEWRKNPPLLCCWTKLLNSVDSNDGLSTYAV 1380
Query: 1510 EAIGLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVDGANGVLEENIKYVQDMCTLLS 1569
EA+ LS+G LRFC+ +S N + I ALKY FGL D G EEN+K +Q M T+LS
Sbjct: 1381 EAVCALSLGSLRFCLDRKSLNSNAIAALKYLFGLPDDKSGTESFPEENVKLIQKMYTVLS 1440
Query: 1570 SKITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSFKDDDA-FTDEVLLFSSDAKLS 1629
D+ Y+ L+ +L Q +E VK L LLL KP S D+ FT+ +L +D +
Sbjct: 1441 KINDDNYYSAIPDLQTSLCQVLEFVKVLLLLLQKPTGSVDVDNVIFTEGILPSPNDILVL 1500
Query: 1630 SKIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLSQAGLLGKRKISAVDGVNRHG 1689
S IH + + E+ +++L L + F WECPE L RLSQ L KRK+ V+G++R
Sbjct: 1501 SNIHQMAGGNVEKDDDKLYLVGLEDKFMWECPETLPERLSQTALPAKRKMPPVEGLSRRA 1560
Query: 1690 KGENAPVDV------RGSGSSMAPPIPTRRDTFRQRKPNTSRPPSMHVDDYVARERNES- 1749
+GEN+ + RG G + AP PT+RD+FRQRKPNTSR PS+HVDDY+A+ER+
Sbjct: 1561 RGENSAAETTQNTFSRGLGPTTAPSGPTKRDSFRQRKPNTSRAPSLHVDDYIAKERSGEG 1620
Query: 1750 -SNPNVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQGKKITLEIDNDT 1809
SN NVI R+GS+GGR PS+HVDEFMAR+RER + EA+ Q K
Sbjct: 1621 VSNSNVIIAQRVGSAGGRAPSVHVDEFMARERERQKRIVTVVGEATIQVKNEAPTSGTQK 1680
Query: 1810 EKSNRSKQLKVDLDDDLQ--EIMFDAEENEADDKLPFPQADDDVQQTASASAETSPQHSI 1869
EK ++ KQLK D DDDLQ +I+FD EE+E DDKLPFPQ DD++QQ A E S HSI
Sbjct: 1681 EKVDKPKQLKTDPDDDLQGIDIVFDDEESEPDDKLPFPQLDDNLQQPAPVIVEQSSPHSI 1740
Query: 1870 VEETESEAKD------LEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSENKYFE 1929
VEETES+ + + +ASNA ENAQSE SSRMSVSRP++ L REPSVSS+ K+FE
Sbjct: 1741 VEETESDVNESGQFSRMSTPLASNADENAQSEFSSRMSVSRPDVPLTREPSVSSDKKFFE 1800
Query: 1930 QPDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIPDNNL 1989
Q D++ N SG FDS A A + GF AS+Y ++ +G+ +P D+R+ +F P N+
Sbjct: 1801 QSDDSKNVITAKASGVFDSGAAANSPGFSASLY--NNATGSSMPTDSRMN-QNFYPKNSP 1860
Query: 1990 QRVGNVLLGR-SQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRS-PFVNPHND 2049
Q N+ +G S+G ++QK +PNQPPLPP PPP ISP SQ S++ S S P+VN +
Sbjct: 1861 QHAANLPVGTGSRGLYDQKVMPNQPPLPPMPPPQAISPGMSQASDSIPSHSSPYVNSLTE 1920
Query: 2050 VQPPLYPTY-AQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTPP 2109
VQ + P + DY S+F +S+PG S+RP PLPPTPP
Sbjct: 1921 VQMSVPPGFQVHADYLSAFSG--------------------SSTPGGSSRP--PLPPTPP 1980
Query: 2110 SFSATPLNVPSRSPTSQSSMYVQHNAGATDASQ------------------------YSS 2169
FS++P N+PS SQ SMY Q+ G TD Q Y
Sbjct: 1981 PFSSSPYNLPSFKANSQMSMYNQNIGGTTDLPQAQSSVVPMIDARLGSVSASAAGVSYPP 2040
Query: 2170 PPLIPP-GFSRLPSMPFNMHGNIPVQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPP 2229
P ++PP F+R S+P ++GN P QQ I QN PQ S+QSM L LQPLQPP
Sbjct: 2041 PHIMPPLVFNRPASIPATLYGNTPAQQQGEIM---QNLSIPQSSIQSMHSLAPLQPLQPP 2100
Query: 2230 QLPCPQQ----LRLPIQNSLHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQ 2289
Q+P PQQ LR P+Q S EQ S+ Q+ VQMQV P Q +Q Q SP H YYQ+Q Q
Sbjct: 2101 QVPRPQQPPQHLRPPMQASQQLEQVTSL-QNPVQMQVHPLQTMQPPQVSPIHTYYQSQQQ 2160
Query: 2290 ENIPHPQQ-------PQAAHHQVDAATQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQL 2293
E P QQ PQ HHQ D +QQQ D +MSL +YF SPEAIQSLLSDR+KLCQL
Sbjct: 2161 EFSPAQQQQQVERNQPQVQHHQGDIGSQQQQDPAMSLHEYFKSPEAIQSLLSDREKLCQL 2199
BLAST of Spo15904.1 vs. UniProtKB/TrEMBL
Match:
V4S738_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10010887mg PE=4 SV=1)
HSP 1 Score: 2164.0 bits (5606), Expect = 0.000e+0
Identity = 1234/2237 (55.16%), Postives = 1569/2237 (70.14%), Query Frame = 1
Query: 130 MGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPP 189
MGRPEPCVLFAQTFVHPQLDEYVDEV+FAEPIVI+ACEFLEQNA S S AV+L+GATSPP
Sbjct: 1 MGRPEPCVLFAQTFVHPQLDEYVDEVLFAEPIVITACEFLEQNASSTSQAVSLVGATSPP 60
Query: 190 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 249
SFALEVFVQCEGE RFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSL++YGNT
Sbjct: 61 SFALEVFVQCEGETRFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLIIYGNT 120
Query: 250 AEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAAD 309
AEDLGQFN++FD D+S+ LV+S+E KLEDLP LHS N E+++SSLK LS + +D
Sbjct: 121 AEDLGQFNIDFD-DSSLTDLVTSAEGKLEDLPLALHSINRTIEESLSSLKVLSLPVAPSD 180
Query: 310 ISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGA-- 369
IS++++Q L L+ K+ ++PN K ++ + A +SF T DL G T ++ +L +
Sbjct: 181 ISIEVKQLLHLMLKVFELPNPESAVHKTVSTVVLAASSFVTHDL---GDTTRQKHLASGK 240
Query: 370 -LANVKGSFHVFTEAQRNLFELCKRYVGAGHIYG----DFLVGCSFLESEAELATSKELV 429
+N H +A++ L +L Y H G + L C+F E+EA++A+SKELV
Sbjct: 241 RKSNEDDLQHAVIKARKELIQL---YAALQHDSGNDSAELLGDCAFFETEADMASSKELV 300
Query: 430 DMLFQHFPFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFF 489
DML Q+F F+ ++ +GH +L ++ +V+L LS+AL+LCSG+E CFH+V+ GGM+QLA+ F
Sbjct: 301 DMLSQYFNFSGNSTSLGHHNLPQNKSVILGLSVALLLCSGREGCFHFVDSGGMDQLAYVF 360
Query: 490 SQEMLNSATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQK 549
S+++ NS + L++LGA+EQATR+S+GCEG LGWWPRE E +PSG+SEGYS LL LL+QK
Sbjct: 361 SRDIENSTVIMLLVLGAIEQATRHSIGCEGFLGWWPREDENIPSGISEGYSRLLNLLLQK 420
Query: 550 QRHDIAALTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKL 609
RHD+A+L T++L RLR+YEVASRYE AVLS+LGGL G T T ++L+ AK QL+KL
Sbjct: 421 PRHDVASLATFVLRRLRYYEVASRYESAVLSVLGGLPAAGKVTAATSNMLISAKSQLKKL 480
Query: 610 LKMINSSGWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSL 669
LK+IN G +EDPSP++ A RSL + +EG LS+K TS LIASS+C F + ID LL+L
Sbjct: 481 LKLINLRGPIEDPSPVSSARRSLTLVQAEGLLSYKVTSNLIASSTCSFSNSDIDPYLLTL 540
Query: 670 LKERGFLPLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDL 729
LKERGFL LS AL+SS+ LR E G A++ +LD+ SSI A ILSL+FC SGLVFLL H ++
Sbjct: 541 LKERGFLSLSAALLSSSILRTEVGDAMDVYLDIASSIGAIILSLLFCHSGLVFLLHHYEI 600
Query: 730 SSTIIRAFKG--DDNVTKFIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTS 789
S+T+I A +G D N + +PLRYAYVL+SKGF C LQEV T +EMHLRVV+A+DRLLTS
Sbjct: 601 SATLIHALRGVTDMNKEECVPLRYAYVLMSKGFTCGLQEVATIVEMHLRVVNAIDRLLTS 660
Query: 790 TPCSEELLWVLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCGTP 849
TP SEE LWVLWELCG+SRSD GRQALL L FPEA ++LIEALHSAKE EP + S +P
Sbjct: 661 TPQSEEFLWVLWELCGVSRSDCGRQALLTLGFFPEAVSMLIEALHSAKEQEPSTKSGASP 720
Query: 850 LNLAIFHAAAEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPGSNRKDAPSRLLEWID 909
L+LAI H+AAEI EIIVTD T+SS+ WI A +LH+ALHSSSPGSNRKDAP+RLLEWID
Sbjct: 721 LSLAILHSAAEIFEIIVTDSTASSLGSWIGRAMELHKALHSSSPGSNRKDAPTRLLEWID 780
Query: 910 AGVVYHKNGAVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVESL 969
GVVYHK+G +GLLRYAAVLA GGDAH++ST+ LVSD M+VE+ G+ S+ SD NV+E+L
Sbjct: 781 PGVVYHKSGVIGLLRYAAVLASGGDAHLSSTSNLVSDLMEVENATGEPSSGSDFNVMENL 840
Query: 970 LGKLVSDKSFDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNTR 1029
+ K++S+KSFDGV LRDSS+AQL TA+RILA ISEN+ VAA+L++EGA+TVVY ILVN R
Sbjct: 841 V-KIISEKSFDGVTLRDSSIAQLTTALRILAFISENSAVAAALYEEGAVTVVYTILVNCR 900
Query: 1030 LMLERSSNNYDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKEQ 1089
MLERSSNNYDYL+DDG EC+S+SDLLLERNRE+ LVDLL+PSL+ LI +LQ+L+E EQ
Sbjct: 901 FMLERSSNNYDYLIDDGTECNSSSDLLLERNREQCLVDLLVPSLVFLITILQKLQEGNEQ 960
Query: 1090 HRNTKLMNVLLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWAP 1149
H+NTKLMN LLRLHRE+SPKLAAC+ DLSS YP+SAL VC L+VSALA WP + W P
Sbjct: 961 HKNTKLMNALLRLHREVSPKLAACAADLSSPYPNSALSFGAVCRLVVSALAFWPIYGWTP 1020
Query: 1150 GLFHTLLDSVQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGTL 1209
GLFH+LL SVQ SLLALGPKETCS+L LL DLFP+E +W W+NGMP L+A R+LAVG+L
Sbjct: 1021 GLFHSLLVSVQTTSLLALGPKETCSLLCLLNDLFPEEDIWLWRNGMPSLSALRTLAVGSL 1080
Query: 1210 LGPLEERKVNWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIAR 1269
LGP +ER+V WYLE C E LL QL PHLDKIAQ+I HYA+ L+V+QD+LR+LI+R+A
Sbjct: 1081 LGPQKEREVEWYLEPGCREKLLTQLRPHLDKIAQIIRHYAISALIVIQDMLRVLIIRVAS 1140
Query: 1270 QRTEFATVLLQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEGT 1329
Q++E A++LL+P+++WI +H+ +S ++D YK ++LLDFLS LLEHP AK +LLKEG
Sbjct: 1141 QKSENASLLLRPILAWIRDHVSDSSSPSDMDVYKVHRLLDFLSSLLEHPCAKAVLLKEGV 1200
Query: 1330 LQILTKVLGRI----DAHVKSVSNGRSSLNSRFNIYNCCLPVLRSLSLICETRSAQK--- 1389
Q+L +VL R D+ K S+ +S+ + + CLPV +S SL+C +++ +
Sbjct: 1201 PQMLIEVLKRCFEATDSDGKQFSDQLNSVKIGSTLTSWCLPVFKSFSLLCCSQTPMQHPG 1260
Query: 1390 LHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDTY 1449
H L + LS +D ++I +I K CQ LP G ELV CL F++L S GQS L+
Sbjct: 1261 RHDLYKF-DNLSADDCSLILPHILKFCQVLPVGKELVFCLTAFRELVSCGEGQSALISII 1320
Query: 1450 LFIRSSSNNENQPGRSQESYQFYNRLIDAEW-KGCPLLTCWINLYGSVET-EGLSTCSVE 1509
S+ E GR E + L + EW K PLL CW L SV++ +GLST +VE
Sbjct: 1321 CHTHSTLE-EFDSGRGHERNDDRSLLNEFEWRKNPPLLCCWTKLLNSVDSNDGLSTYAVE 1380
Query: 1510 AIGLLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVDGANGVLEENIKYVQDMCTLLSS 1569
A+ L +G LRFC+ +S N + I ALKY FGL D G EEN+K +Q M T+LS
Sbjct: 1381 AVCALLLGSLRFCLDRKSLNSNAIAALKYLFGLPDDKSGTESFPEENVKLIQKMFTVLSK 1440
Query: 1570 KITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSFKDDDA-FTDEVLLFSSDAKLSS 1629
D+ Y+ L+ +L Q +E VK L LLL KP S D+ FT+ +L +D + S
Sbjct: 1441 INDDNYYSAIPDLQTSLCQVLEFVKVLLLLLQKPTGSVDVDNVIFTEGILPSPNDVLVLS 1500
Query: 1630 KIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLSQAGLLGKRKISAVDGVNRHGK 1689
IH + + E+ +++L L + F WECPE L RLSQ L KRK+ V+G++R +
Sbjct: 1501 NIHQMAGGNVEKDDDKLYLVGLEDKFMWECPETLPERLSQTALPAKRKMPPVEGLSRRAR 1560
Query: 1690 GENAPVDV------RGSGSSMAPPIPTRRDTFRQRKPNTSRPPSMHVDDYVARERNES-- 1749
GEN+ + RG G + AP PT+RD+FRQRKPNTSR PS+HVDDY+A+ER+
Sbjct: 1561 GENSAAETTQNTFSRGLGPTTAPSGPTKRDSFRQRKPNTSRAPSLHVDDYIAKERSGEGV 1620
Query: 1750 SNPNVIAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQGKKITLEIDNDTE 1809
SN NVI R+GS+GGR PS+HVDEFMAR+RER + EA+ Q K E
Sbjct: 1621 SNSNVIIAQRVGSAGGRAPSVHVDEFMARERERQKRIVTVVGEATIQVKNEAPTSGTQKE 1680
Query: 1810 KSNRSKQLKVDLDDDLQ--EIMFDAEENEADDKLPFPQADDDVQQTASASAETSPQHSIV 1869
K ++ KQLK D DDDLQ +I+FD EE+E DDKLPFPQ DD++QQ A E S HSIV
Sbjct: 1681 KVDKPKQLKTDPDDDLQGIDIVFDDEESEPDDKLPFPQLDDNLQQPAPVIVEQSSPHSIV 1740
Query: 1870 EETESEAKD------LEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSENKYFEQ 1929
EETES+ + + +ASNA ENAQSE SSRMSVSRP++ L REPSVSS+ K+FEQ
Sbjct: 1741 EETESDVNESGQFSRMSTPLASNADENAQSEFSSRMSVSRPDVPLTREPSVSSDKKFFEQ 1800
Query: 1930 PDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIPDNNLQ 1989
D++ N SG FDS A A + GF AS+Y ++ +G+ +P D+R+ +F P N+ Q
Sbjct: 1801 SDDSKNVITAKASGVFDSGAAANSPGFSASLY--NNATGSSMPTDSRMN-QNFYPKNSPQ 1860
Query: 1990 RVGNVLLGR-SQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSRS-PFVNPHNDV 2049
N+ +G S+G ++QK +PNQPPLPP PPP ISP SQ S++ S S P+VN +V
Sbjct: 1861 HAANLPVGTGSRGLYDQKVMPNQPPLPPMPPPQAISPGMSQASDSIPSHSSPYVNSLTEV 1920
Query: 2050 QPPLYPTY-AQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTPPS 2109
Q + P + DY S+F +S+PG S+RP PLPPTPP
Sbjct: 1921 QMSVPPGFQVHADYLSAFSG--------------------SSTPGGSSRP--PLPPTPPP 1980
Query: 2110 FSATPLNVPSRSPTSQSSMYVQHNAGATDASQ------------------------YSSP 2169
FS++P N+PS SQ SMY Q+ G TD Q Y P
Sbjct: 1981 FSSSPYNLPSFKANSQMSMYNQNIGGTTDLPQAQSSVVPMIDARLGSVSASAAGVSYPPP 2040
Query: 2170 PLIPP-GFSRLPSMPFNMHGNIPVQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQ 2229
++PP F+R S+P ++GN P QQ I QN PQ S+QSM L LQPLQPPQ
Sbjct: 2041 HIMPPLVFNRPASIPATLYGNTPAQQQGEIM---QNLSIPQSSIQSMHSLAPLQPLQPPQ 2100
Query: 2230 LPCPQQ----LRLPIQNSLHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQE 2289
+P PQQ LR P+Q S EQ S+ Q+ VQMQV P Q +Q Q SP H YYQ+Q QE
Sbjct: 2101 VPRPQQPPQHLRPPMQASQQLEQVTSL-QNPVQMQVHPLQTMQPPQVSPIHTYYQSQQQE 2160
Query: 2290 NIPHPQQ-------PQAAHHQVDAATQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQLL 2293
P QQ PQ HHQ D +QQQ D +MSL +YF SPEAIQSLLSDR+KLCQLL
Sbjct: 2161 FSPAQQQQQVERTQPQVQHHQGDIGSQQQQDPAMSLHEYFKSPEAIQSLLSDREKLCQLL 2198
BLAST of Spo15904.1 vs. UniProtKB/TrEMBL
Match:
K7K9K2_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_02G195600 PE=4 SV=1)
HSP 1 Score: 2140.5 bits (5545), Expect = 0.000e+0
Identity = 1231/2212 (55.65%), Postives = 1553/2212 (70.21%), Query Frame = 1
Query: 130 MGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPP 189
MGRPEPCVLFAQ FVHP LDEYVDEVMF+EPIVI+ACEFLEQ+A S + AVTL+GATSPP
Sbjct: 1 MGRPEPCVLFAQNFVHPHLDEYVDEVMFSEPIVITACEFLEQSASSVAQAVTLVGATSPP 60
Query: 190 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 249
SFA+EVFV CEGE RFRRLCQPFLYSHSSSNVLEVEAVVT+HLVVRGSYRSLSLV+YGNT
Sbjct: 61 SFAIEVFVHCEGETRFRRLCQPFLYSHSSSNVLEVEAVVTSHLVVRGSYRSLSLVIYGNT 120
Query: 250 AEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAAD 309
AEDLGQFN++ D DN++ LV S+E KLEDLPP L S + +D+ S L LS +PA D
Sbjct: 121 AEDLGQFNIDID-DNALTDLVDSTEGKLEDLPPALRSTSFTIDDSRSYLNVLSIPVPATD 180
Query: 310 ISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALA 369
ISV++ FL L+ K L+ + G+ K++ + SA++S+ + D+ + ++ +
Sbjct: 181 ISVEVNLFLGLMLKFLEFSDLGDAGHKIVNTVVSAISSYISSDICESIGGRYQMRKRS-E 240
Query: 370 NVKGSFHVFTEAQRNLFELCKRYVGAGHIYGDFLVGCS----FLESEAELATSKELVDML 429
N++ V EA++ L E+ Y + CS +LE +AE+ SK LVDM
Sbjct: 241 NLEELHIVVDEARKELLEV---YKVLHKKFRSESSECSSDAYYLEMDAEMLDSKTLVDMF 300
Query: 430 FQHFPFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFFSQE 489
Q+F F R++ +G LS+S + LL LS+A +LCSG++S F +V+ GGMEQLA FFS++
Sbjct: 301 NQYFHFQRNSSCIGDHCLSQSEHALLGLSMAYLLCSGRKSGFQFVSSGGMEQLALFFSKD 360
Query: 490 MLNSATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQRH 549
NS T+ L+LLG +E+ATRYSVGCE LGWWPRE + +PS +SEGYS LLKL++ K RH
Sbjct: 361 GQNSTTIMLLLLGVVERATRYSVGCEAFLGWWPREDDSIPSSISEGYSHLLKLILSKPRH 420
Query: 550 DIAALTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLLKM 609
D+A+L TY+LHRLRFYE+ASRYE AVLS+LG +S G T TL++L +++ LRKLLK+
Sbjct: 421 DVASLATYLLHRLRFYEIASRYESAVLSVLGNISTVGRVTDVTLNMLSSSEILLRKLLKL 480
Query: 610 INSSGWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLLKE 669
INS G +EDPSP+ACA RSL+ ++G LS+K TS LI+SSSC F ID+ LL LLKE
Sbjct: 481 INSRGPIEDPSPIACASRSLITGQTDGLLSYKTTSSLISSSSCCFSDCDIDSHLLGLLKE 540
Query: 670 RGFLPLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLSST 729
RGFL LS AL+SS++LR E GHA+E F+D+TSSIEA ILS +FCRSGL+FLLQ P+LSST
Sbjct: 541 RGFLSLSTALLSSSKLRMESGHAMEIFMDVTSSIEAVILSFLFCRSGLIFLLQDPELSST 600
Query: 730 IIRAFKGDDNVTK--FIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTSTPC 789
+I A + K IPLRYA +L+SKGFFC E+G +EMHL++V+A+D LL+S P
Sbjct: 601 LIHALRSGHRGNKEDCIPLRYASILISKGFFCSPLEIGMIIEMHLKMVNAIDSLLSSNPQ 660
Query: 790 SEELLWVLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCGTP-LN 849
SEE LWV+WEL LSRSD GRQALLAL +FPEA +ILIEAL S KE E + + G+ +N
Sbjct: 661 SEEFLWVVWELSTLSRSDCGRQALLALGNFPEAVSILIEALSSFKESESVGKNSGSSAVN 720
Query: 850 LAIFHAAAEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPGSNRKDAPSRLLEWIDAG 909
L IFH+AAEI E IVTD T+SS+ WI HA +LHRALH SSPGSNRKDAPSRLLEWIDAG
Sbjct: 721 LTIFHSAAEIIEAIVTDSTASSLGSWIGHALELHRALHFSSPGSNRKDAPSRLLEWIDAG 780
Query: 910 VVYHKNGAVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVESLLG 969
VVYHK G +GLLRYAAVLA GGDA +T+ +LVSD DVE+VVG++S+ SD+NV+E+L G
Sbjct: 781 VVYHKQGGIGLLRYAAVLASGGDAQLTT--VLVSDLTDVENVVGESSSGSDINVMENL-G 840
Query: 970 KLVSDKSFDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNTRLM 1029
K +S+KSFDGV LRDSS+AQL TA+RIL+ ISEN VAA+L+DEGA+ V+Y ILVN R M
Sbjct: 841 KFISEKSFDGVTLRDSSLAQLTTALRILSFISENPTVAATLYDEGAVIVIYAILVNCRFM 900
Query: 1030 LERSSNNYDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKEQHR 1089
LERSSNNYDYLVD+G EC++TSDLLLERNRE +VDLL+PSL+LLI LLQ+L+EAKEQHR
Sbjct: 901 LERSSNNYDYLVDEGTECNATSDLLLERNRELNIVDLLVPSLVLLITLLQKLQEAKEQHR 960
Query: 1090 NTKLMNVLLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWAPGL 1149
NTKLMN LLRLH E+SPKLAAC+ DLSS YPD A+G VCHL+ SALA WP H W+PGL
Sbjct: 961 NTKLMNALLRLHSEISPKLAACADDLSSPYPDYAIGYGAVCHLVASALAFWPVHGWSPGL 1020
Query: 1150 FHTLLDSVQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGTLLG 1209
FHTLL SVQ+ SLL LGPKETCS+LYLLIDLFP+E +W W +GMPLLTA R LAVG +LG
Sbjct: 1021 FHTLLASVQSTSLLTLGPKETCSLLYLLIDLFPEEDIWLWTSGMPLLTARRMLAVGNILG 1080
Query: 1210 PLEERKVNWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIARQR 1269
P +ER VNWYLE E L+GQL+PHLDKIA++I HYAV LVV+QD+LR+ ++RIA Q
Sbjct: 1081 PQKERHVNWYLESGHQEKLVGQLAPHLDKIAEIILHYAVSALVVIQDLLRVFVIRIACQN 1140
Query: 1270 TEFATVLLQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEGTLQ 1329
++A++L++P +S + +H+ +S + DAYK +LLDFL LLEHPL K LLL+EGTLQ
Sbjct: 1141 AKYASMLIKPALSSVIHHVSESSCPSDTDAYKVLRLLDFLVSLLEHPLGKGLLLREGTLQ 1200
Query: 1330 ILTKVLGR----IDAHVKSVSNGRSSLNSRFNIYNCCLPVLRSLSLICETRSAQKLHLLL 1389
ILTKVL R +D K + + RSS FN ++ CLP+ + L+ + ++
Sbjct: 1201 ILTKVLDRCFVIVDVDGKQI-HDRSSAKCSFNFFSWCLPIFNFMMLLFRSEISRHYPRRD 1260
Query: 1390 D--GNETLSTEDSAVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDTYLFI 1449
D E LS ED A+I Y+ K CQ LP G EL+ACL FK+LAS GQ T+ I
Sbjct: 1261 DFKNFEKLSDEDCALILRYLLKSCQVLPVGKELLACLTAFKELASCGEGQMAFGATHFGI 1320
Query: 1450 RSSSNNENQPGRSQESYQFYNRLIDAEWKGC-PLLTCWINLYGSVET-EGLSTCSVEAIG 1509
S + E +P + + YN AEW C PLL+CW+ L+ S++T EGLS ++EA
Sbjct: 1321 HSHA-LELEPRKDDRNVN-YNVSSVAEWIKCPPLLSCWMKLFRSIDTKEGLSAYAIEAAY 1380
Query: 1510 LLSIGVLRFCMSGRSFNEDRIDALKYFFGLSSDVDGANGVLEENIKYVQDMCTLLSSKIT 1569
LS+G L+FCM G S N DR+ ALKY FG+S+D+ ++G EENI Y+ + LLSSK +
Sbjct: 1381 ALSVGSLQFCMDGDSLNSDRVVALKYLFGISNDMTRSDGFPEENINYILEFSALLSSKAS 1440
Query: 1570 DDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSFKDDDAFTDEVLLFSSDAKLSSKIHL 1629
D+ V+S + L+Q ESVK LSL+L +P+DS K +D V+L ++ + SK H
Sbjct: 1441 MDDCLVNSQSQIPLYQVSESVKSLSLVLQRPVDSMKLED-----VVLHQNEVLVFSKTHQ 1500
Query: 1630 LTDSSSERVENELNFGELGESFQWECPENLRNRLSQAGLLGKRKISAVDGVNRHGKGENA 1689
L ++S E++++ LN G LG+ F WECPE L +RL+Q L KRK+ ++DG R +GE+
Sbjct: 1501 LLENSVEKIDDHLNVGGLGDKFLWECPETLPDRLTQTTLAAKRKLPSMDGPVRRARGESF 1560
Query: 1690 PVDV-------RGSGSSMAPPIPTRRDTFRQRKPNTSRPPSMHVDDYVARERNESSNPNV 1749
D+ RG S PTRRD FRQRKPNTSRPPSMHVDDYVARE+N NV
Sbjct: 1561 QADMSSQNAFSRGVAQSAVSSGPTRRDAFRQRKPNTSRPPSMHVDDYVAREKNVEGVTNV 1620
Query: 1750 IAVPRLGSSGGRPPSIHVDEFMARQRERHNSVPIAGVEASSQGKKITLEIDNDTEKSNRS 1809
I+VPR GS+GGRPPSIHVDEFMARQRERHN EA K + DTEK N+S
Sbjct: 1621 ISVPRAGSTGGRPPSIHVDEFMARQRERHNPSATVVGEAVGHPKDASPVKPTDTEKLNKS 1680
Query: 1810 KQLKVDLDDDLQ--EIMFDAEENEADDKLPFPQADDDVQQTASASAETSPQHSIVEETES 1869
KQLK DL DDLQ +I+FD EE++ DDKLPFPQ DDD+QQ A E S HSIVEETES
Sbjct: 1681 KQLKTDLYDDLQGIDIVFDGEESDPDDKLPFPQLDDDLQQPAPVIIEQSSPHSIVEETES 1740
Query: 1870 EAKD------LEATVASNAVENAQSELSSRMSVSRPEMQLRREPSVSSENKYFEQPDETN 1929
+ D + + SN EN Q+E SS+MS SRP+M L RE SVSS+ KY EQ D+T
Sbjct: 1741 DVVDSSQFSQMGTPLGSNIDENGQTEFSSKMSGSRPDMSLTRESSVSSDRKYVEQADDTK 1800
Query: 1930 NKNPTMVSGGFDSVAVAKTSGFPASVYGKHSPSGAQVPADARIPPSSFIPDNNLQRVGNV 1989
N SG +DS V+ + FP S+Y S S Q PAD+R+ +++ N+ Q G
Sbjct: 1801 NVQ-ARPSGRYDS--VSSNTSFPMSLYNNPSTS-MQSPADSRMVSQNYLLKNSPQHAG-- 1860
Query: 1990 LLGRSQGRHEQKHLPNQPPLPPTPPPSTISPVRSQVSENQTSR-SPFVNPHNDVQPPLYP 2049
+ SQG ++Q+ L NQPPLPP PPP T+SPV S +++ SPFVN Q P+
Sbjct: 1861 IASGSQGLYDQRFLTNQPPLPPMPPPPTVSPVISHATDSVPGHSSPFVNSLAGTQRPV-A 1920
Query: 2050 TYAQTDYQSSFGNVPTSFASSNFMSDSRYARTTNSSPGSSNRPLPPLPPTPPSFSATPLN 2109
++DY S F N T+ ASS + DS+Y+RT+ SSPG +R PPLPPTPP F++ N
Sbjct: 1921 FQVRSDYSSPFINGSTA-ASSVPVPDSKYSRTSVSSPGGPSRVAPPLPPTPPPFASNQYN 1980
Query: 2110 VPS-RSPTSQSSMYVQHNAGATDASQ------------YSSPPLIPPGFSRLPSMPFNMH 2169
+PS ++ SQ SMY Q + GAT+ SQ Y +PP++ GFSR SMP M
Sbjct: 1981 LPSVKTSASQPSMYNQTSIGATELSQASISSSGARLSSYPNPPMMSAGFSRSASMPLTMF 2040
Query: 2170 GNIP-VQQSDNITGIAQNSRGPQPSMQSMQPLPHLQPLQPPQLPCPQQ----LRLPIQNS 2229
GN P QQ++N I Q+ P S QSM P+ LQPLQPPQLP P Q LR P+ ++
Sbjct: 2041 GNSPNQQQTENQPSILQSISVPPASFQSMHPVTQLQPLQPPQLPRPPQPPQLLRPPV-HA 2100
Query: 2230 LHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQSQENIPHPQQPQAAHHQVDAA 2289
L + G QS+VQ+ Q Q+LQQ Q YYQTQ Q+ QQ + ++
Sbjct: 2101 LQQLEQGMAVQSNVQVHHQ-LQMLQQPQVPSMQTYYQTQQQQFSHEQQQVEYTQQPGNSL 2160
Query: 2290 TQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPKLMQMLQDQLGHL 2293
+QQQ D++MSL +YF SPEAIQSLLSDRDKLCQLLEQHPKLMQMLQ++LG L
Sbjct: 2161 SQQQQDAAMSLHEYFKSPEAIQSLLSDRDKLCQLLEQHPKLMQMLQEKLGQL 2186
BLAST of Spo15904.1 vs. TAIR (Arabidopsis)
Match:
AT3G05680.2 (embryo defective 2016)
HSP 1 Score: 1782.7 bits (4616), Expect = 0.000e+0
Identity = 1113/2232 (49.87%), Postives = 1432/2232 (64.16%), Query Frame = 1
Query: 130 MGRPEPCVLFAQTFVHPQLDEYVDEVMFAEPIVISACEFLEQNAPSASSAVTLLGATSPP 189
M R EPCVLFAQTFVHPQLDEYVDEV+FAEP++I+ACEFLEQNA S+S AV+L+GATSPP
Sbjct: 1 MVRSEPCVLFAQTFVHPQLDEYVDEVIFAEPVIITACEFLEQNASSSSQAVSLVGATSPP 60
Query: 190 SFALEVFVQCEGEARFRRLCQPFLYSHSSSNVLEVEAVVTNHLVVRGSYRSLSLVVYGNT 249
SFALEVFV+CEGE++F+RLC PFLY+ S+ LEVEAVVTNHLVVRGSYRSLSL+VYGN
Sbjct: 61 SFALEVFVRCEGESKFKRLCNPFLYTPSAPYPLEVEAVVTNHLVVRGSYRSLSLIVYGNI 120
Query: 250 AEDLGQFNVEFDLDNSVAKLVSSSEAKLEDLPPLLHSRNPKFEDAISSLKALSSMLPAAD 309
+DLGQ+N+ + SV +VSS+E LEDLP +LHS N E+ +SSL +S L A D
Sbjct: 121 VKDLGQYNIILE-GRSVTDIVSSTEGNLEDLPLVLHSVNRTIEECLSSLDIVSLPLAAVD 180
Query: 310 ISVQMRQFLQLIFKILDIPNFGEFSQKVMALITSAVASFFTDDLASAGITLQELNLGALA 369
+ V++++ LQL+ KI D + K + + S V+S+ TD++ L+ N A+
Sbjct: 181 LPVEVKRLLQLLLKIFDKLATNDVVNKFVDTVVSGVSSYVTDNV---DFFLKNKNCSAVT 240
Query: 370 NV--KGSFHVFTEAQR----NLFELCKRYVGAGHIYGDFLVGCSFLESEAELATSKELVD 429
+ G FH + + +L E+ + V G SFLESE LATS++LV
Sbjct: 241 SSLDSGLFHDIVDRVKEDILDLNEIQESDVALGLF--------SFLESETYLATSQQLVV 300
Query: 430 MLFQHFPFNRDTVFVGHPHLSKSNNVLLWLSLALILCSGKESCFHYVNGGGMEQLAHFFS 489
ML + F RD++ P LSK LL LSLA +LCSG+E C +VN GGM+QL + F
Sbjct: 301 MLSPYIQFERDSLCTVLPKLSKGKATLLGLSLAFLLCSGREGCLQFVNSGGMDQLVYLFG 360
Query: 490 QEMLNSATVTLMLLGAMEQATRYSVGCEGLLGWWPREGEKVPSGVSEGYSLLLKLLMQKQ 549
+ NS T+TL+LLG +EQATR+SVGCEG LGWWPRE +PSG SEGY LLLKLLMQK
Sbjct: 361 HDGQNSTTITLLLLGVVEQATRHSVGCEGFLGWWPREDGSIPSGKSEGYCLLLKLLMQKP 420
Query: 550 RHDIAALTTYILHRLRFYEVASRYEYAVLSILGGLSGDGNATTTTLSLLLKAKLQLRKLL 609
H+IA+L YIL RLR YEV SRYE+AVLS L GLS A T L++L AK QL+KL
Sbjct: 421 CHEIASLAIYILRRLRIYEVISRYEFAVLSALEGLSNSHGAATHNLNMLSDAKSQLQKLQ 480
Query: 610 KMINSSGWVEDPSPMACAIRSLMVASSEGSLSFKATSKLIASSSCRFLHQKIDAKLLSLL 669
++ S G VEDPSP A A RSL+ SEG LS+KATSKL +S +C F ID+ +L+LL
Sbjct: 481 NLMKSLGSVEDPSPSAYAERSLVSDHSEGWLSYKATSKLTSSWTCPFYSSGIDSHILALL 540
Query: 670 KERGFLPLSVALISSTELRAERGHALEAFLDLTSSIEATILSLMFCRSGLVFLLQHPDLS 729
KERGFLPLS AL+S EL ++ G ++ F D+ I ILS MF R+GL FLL HP+L+
Sbjct: 541 KERGFLPLSAALLSMPELHSKVGDIMDVFTDIAMFIGNIILSFMFSRTGLSFLLHHPELT 600
Query: 730 STIIRAFKG--DDNVTKFIPLRYAYVLVSKGFFCHLQEVGTNLEMHLRVVHAVDRLLTST 789
+TII++ KG D N + +PL YA +L+SKGF C L E+G NLEMHLRVV AVDRLL S
Sbjct: 601 ATIIQSLKGSVDLNKEECVPLHYASILISKGFTCSLLEIGINLEMHLRVVSAVDRLLKSI 660
Query: 790 PCSEELLWVLWELCGLSRSDGGRQALLALTHFPEAFTILIEALHSAKELEPISLSCG-TP 849
+EE LW+LWEL +SRSD GR+ALL L FPEA +LIEALHSAK++EP + G +P
Sbjct: 661 QQTEEFLWILWELRDVSRSDCGREALLTLGVFPEALAVLIEALHSAKDMEPAVENSGISP 720
Query: 850 LNLAIFHAAAEICEIIVTDPTSSSVSVWIDHATDLHRALHSSSPG-SNRKDAPSRLLEWI 909
LNLAI H+AAEI E+IV+D T+S + WI+HA LH+ALH+ SPG SNRKDAPSRLL+WI
Sbjct: 721 LNLAICHSAAEIFEVIVSDSTASCLHAWIEHAPVLHKALHTLSPGGSNRKDAPSRLLKWI 780
Query: 910 DAGVVYHKNGAVGLLRYAAVLAFGGDAHITSTNILVSDSMDVEDVVGDTSNESDVNVVES 969
DAGVVYHK+G GLLRYAAVLA GGDA ++S++IL D E+ G+++N S++NV+++
Sbjct: 781 DAGVVYHKHGVGGLLRYAAVLASGGDAQLSSSSILALDLTPAENGAGESTNVSEMNVLDN 840
Query: 970 LLGKLVSDKSFDGVALRDSSVAQLITAIRILALISENADVAASLFDEGAMTVVYIILVNT 1029
L GK++ +KSF+GV L DSS++QL TA+RILALIS+N+ VAA+L+DEGA+TVVY ILVN
Sbjct: 841 L-GKVIFEKSFEGVNLSDSSISQLTTALRILALISDNSTVAAALYDEGAVTVVYAILVNC 900
Query: 1030 RLMLERSSNNYDYLVDDGIECSSTSDLLLERNREKYLVDLLIPSLLLLIELLQRLKEAKE 1089
M ERSSN YDYLVDD CSS SD L ERNRE+ LVDLLIPSL+LLI +LQRL+ KE
Sbjct: 901 SFMFERSSNIYDYLVDDDHGCSSISDFLSERNREQSLVDLLIPSLVLLISVLQRLQGTKE 960
Query: 1090 QHRNTKLMNVLLRLHRELSPKLAACSVDLSSSYPDSALGLETVCHLIVSALACWPAHAWA 1149
Q+RNTKLM LLRLHRE+SPKLAAC+ DLSS YPDSALG VCHLIVSAL CWP + W
Sbjct: 961 QYRNTKLMKALLRLHREVSPKLAACAADLSSHYPDSALGFGAVCHLIVSALVCWPVYGWI 1020
Query: 1150 PGLFHTLLDSVQAASLLALGPKETCSMLYLLIDLFPDEGVWHWKNGMPLLTASRSLAVGT 1209
PGLFHTLL VQ +S+ ALGPKETCS L +L D+ P+EGVW WK+GMPLL+ R LAVGT
Sbjct: 1021 PGLFHTLLSGVQTSSVPALGPKETCSFLCILSDILPEEGVWFWKSGMPLLSGLRKLAVGT 1080
Query: 1210 LLGPLEERKVNWYLERPCVEILLGQLSPHLDKIAQVIHHYAVGTLVVVQDILRILILRIA 1269
L+GP +E+++NWYLE +E L+ L+P+LDKIA++I H+AV LVV+QD+LR+ I+RIA
Sbjct: 1081 LMGPQKEKQINWYLEPGPLEKLINHLTPNLDKIAKIIQHHAVSALVVIQDMLRVFIVRIA 1140
Query: 1270 RQRTEFATVLLQPMISWIENHLIGTSPLDEIDAYKGYKLLDFLSLLLEHPLAKTLLLKEG 1329
QR E A++LL+P+ S I + ++ S + +AY Y+ L+FL+ LLEHP AK LLL+EG
Sbjct: 1141 CQRVEHASILLRPIFSSIRDGILDQSSTRDTEAYMVYRYLNFLASLLEHPHAKGLLLEEG 1200
Query: 1330 TLQILTKVLGR-IDAHVKSVSNG-RSSLNSRFNIYNCCLPVLRSLSLICETR----SAQK 1389
+Q+L +VL R DA S + + S ++ C+P RS+SL+C+++ QK
Sbjct: 1201 IVQLLVEVLERCYDATYPSENRVLEYGIVSASSVIQWCIPAFRSISLLCDSQVPLLCFQK 1260
Query: 1390 LHLLLDGNETLSTEDSAVIFSYIFKLCQHLPAGNELVACLGTFKDLASTAIGQSTLLDTY 1449
LL +LS +D A+IF ++ K CQ LP GNEL++CLG FKDL+S GQ L+ +
Sbjct: 1261 KELL----ASLSAKDCALIFPFVLKFCQVLPVGNELLSCLGAFKDLSSCGEGQDGLV-SL 1320
Query: 1450 LFIRSSSNNENQPGRSQESYQFYNRLIDAEWKGCPLLTCWINLYGSVET-EGLSTCSVEA 1509
LF S E+ R ++ +D + K P L+CWI L S+ + +GLS+ +++A
Sbjct: 1321 LFHLFSGTEESVSERWCDTNSLSLDQLDMK-KNPPFLSCWIKLLNSINSKDGLSSLAMKA 1380
Query: 1510 IGLLSIGVLRFCMSG--------------RSFNEDRIDALKYFFGLSSDVDGANGVLEEN 1569
+ +LS+G +R C+ G S ++ ALK FGL S+ G + EEN
Sbjct: 1381 VNVLSVGSIRLCLDGDISLKIIKQKMCLVASDAVHKVAALKSLFGLPSEFSGTDTFREEN 1440
Query: 1570 IKYVQDMCTLLSSKITDDEYAVSSHLKATLHQAVESVKFLSLLLLKPIDSFKDDDAFTDE 1629
I ++ M TLLSS + + + ++ +K LH+A +S+ LSLL D D+
Sbjct: 1441 IGLIEQMVTLLSSMTSGSDSSATAEMKPYLHEASQSL--LSLL----------KDGNIDD 1500
Query: 1630 VL----LFSSDAKLSSKIHLLTDSSSERVENELNFGELGESFQWECPENLRNRLSQAGLL 1689
++ +F S L + D S +E++L L + F WECPE L RL Q+ L
Sbjct: 1501 IISCKGVFVSPGNLD-----MDDLVSRNIEDDLYQRGLEDKFWWECPETLPERLPQSSLP 1560
Query: 1690 GKRKISAVDGVNRHGKGENAPVDV-------RGSGSSMAPPIPTRRDTFRQRKPNTSRPP 1749
KRK+ ++ +R KGEN+ VD+ RG GS PP PTRRD FRQRKPNTSRPP
Sbjct: 1561 AKRKLPTLESSSRRAKGENSSVDIPTQNSIQRGMGSVSLPPAPTRRDAFRQRKPNTSRPP 1620
Query: 1750 SMHVDDYVARER--NESSNPNVIAVPRLGSSGGRPPSIHVDEFMARQRER-HNSVPIAGV 1809
SMHVDDYVARER + + N N I + R GSS GRPPSIHVDEFMARQRER N I
Sbjct: 1621 SMHVDDYVARERSVDTAGNSNAITISRAGSSSGRPPSIHVDEFMARQRERGQNPSTIVVG 1680
Query: 1810 EASSQGKKITLEIDNDTEK-SNRSKQLKVDLDDDLQ--EIMFDAEENEA-DDKLPFPQAD 1869
EA Q K T DTEK + + KQ K D DDDLQ +I+FD EE E DDKLPF Q D
Sbjct: 1681 EAVVQVKNPTPA--RDTEKVAGKPKQFKADPDDDLQGIDIVFDGEECEGPDDKLPFLQPD 1740
Query: 1870 DDVQQTASASAETSPQHSIVEETESEA------KDLEATVASNAVENAQSELSSRMSVSR 1929
+++ Q A E + HSIVEETES+A + VASN ENAQSE SSR+SVSR
Sbjct: 1741 ENLMQPAPVMVEQNSPHSIVEETESDANGSSQFSHMGTPVASNVDENAQSEFSSRISVSR 1800
Query: 1930 PEMQLRREPSVSSENKYFEQPDETNNKNPTMVSGGFDSVAVAKTSGFPASVYGKHSP-SG 1989
PEM L REPS+SS+ K+ EQ DE P +G SGF + H P S
Sbjct: 1801 PEMSLIREPSISSDRKFVEQADEAKKMAPLKSAG-------ISESGF---IPAYHMPGSS 1860
Query: 1990 AQVPADARIPPSSFIPDNNLQRVGNVLLGRS-QGRHEQKHLPNQPPLPPTPPPSTISPVR 2049
Q D R+ P F + Q G++ G S +G +EQK +PNQPPLP PPPS +SPV
Sbjct: 1861 GQNSIDPRVGPQGFYSKSGQQHTGHIHGGFSGRGVYEQKVMPNQPPLPLVPPPS-VSPVI 1920
Query: 2050 SQVSENQTSR-SPFVNPHND-------VQPPL---YPTYAQTDYQSSFGNVPTSFASSNF 2109
S++ +++ SPF++ + PPL P Y+ Y S P + +F
Sbjct: 1921 PHSSDSLSNQSSPFISHGTQSSGGPTRLMPPLPSAIPQYSSNPYASL---PPRTSTVQSF 1980
Query: 2110 MSDSRYARTTNSSPGSSNRPLPPLPPTPPSFSATPLNVPSRSPTSQSSMYVQHNAGATDA 2169
+ TT PT S L+V + ++ HN
Sbjct: 1981 GYNHAGVGTTEQQQSG---------PTIDHQSGN-LSVTGMTSYPPPNLMPSHN------ 2040
Query: 2170 SQYSSPPLIPPGFSRLPSMPFNMHGNIPVQQSDNITGIAQNSRGPQ-PSMQSMQPLPHLQ 2229
+S P +P F PS + G+ P Q + I Q+ P + SMQ +
Sbjct: 2041 --FSRPSSLPVPFYGNPS---HQGGDKP-QTMLLVPSIPQSLNTQSIPQLPSMQLSQLQR 2100
Query: 2230 PLQPPQLPCPQQLRLPIQNSLHTEQSGSMSQSHVQMQVQPPQILQQSQTSPYHLYYQTQS 2289
P+QP PQ +R PIQ S +EQ SM Q+ Q+ + Q++QQ+Q PY+ Q Q
Sbjct: 2101 PMQP-----PQHVRPPIQISQPSEQGVSM-QNPFQIPMHQMQLMQQTQVQPYYHPPQQQE 2152
Query: 2290 QENIPHPQQPQAAHHQVDAATQQQMDSSMSLQQYFSSPEAIQSLLSDRDKLCQLLEQHPK 2293
+ QQ A Q A T QQ +S MSL YF SPEAIQ+LLSDRDKLCQLLEQHPK
Sbjct: 2161 ISQVQQQQQHHAVQGQQGAGTSQQQESGMSLHDYFKSPEAIQALLSDRDKLCQLLEQHPK 2152
The following BLAST results are available for this feature: