Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACTCCAAATCTGGCAAATCACCCAGGCGCATCTGCTCTGCATCTCCTCCTCCTCCTCCTCCTCCTCCTTGTCGGGAAATTACAACAGCAGCATCTTGTAGTTGTATCACCATTTCACTACCTCTAGTCGGCGGTGCCTCTGTCAACCATCTCCTCCACTGCTGCTGCTATTCTCGGTAATGGATCAGGGACGGGCCTGATTTTAGAGAGAAGGAAAAACGGAGCAATCGAAAATGCTGAGATTACGGGCATTTCGGCCGACGAATGAGAAGATTGTCAAAATTCAACTTCATCCAACTCATCCATGGCTAGTCACCTCTGATGCCTCTGATCATGTCTCCGTTTGGAATTGGGAACATCGCCAGGTCTATACATACTTACTCTTACATTTCCCCATTTTTTGGGTGTGAATTTAAATGTATGAAATGAGTATGTATGAATTTGTGTTTTTTTTTTGTTGCTGTTTGGTTTTGATTATAGGTTATTTATGAGCTAAAAGCTGGTGGAGTTGATGAAAGGCGGCTTGTTGGTGCCAAATTGGAGAAGCTTGCCGAGGGCGAGTCTGGTAAATATCCAACGATCTCTTGTTCATTTATCCAGCATATGAATTGTGTTTGGGGGTATTTAATGTTATCACTTCTGTATTGAGTTAGAGTGCTGTTTGTTGCTTTGGTTGTATGATCATGAGATTAAACTAAGGCAGCTTTTATTTGAACTCTATGGTTTTCATGAATAGTGATATAGATATACAAATGAACACTCTCGATGCCCACTTCACTTCGTAAGAAACTCATAGTTTGTGTAGTGTGGATAAGCTCTTGAGTCTTGCCTTGCTCGATGGAGCGAGTACATGTTTTGTAGTTTGAACATGCAAAAAAGTATAATTTGCCATTTAACCATTCAATACATGAACACCGCCTCCTCTATTGTCTATGAAATGTTGATTTTAACATGCAAAAGCTGTTGCTACTTGCAAGGGATTAGATTGTAGTTTGATGTTGGAGTAAGGTATCCTCTATTTTATCAGAAGGGACTAATAATATCTTTCTACAAACCATCCTCTTAACAAACTGTGAAGACGGTATTGCCAACTTTTCCATCCATTGTTTTAGATTATGCTCTCCACCGACTTCAGTTGATAGGCTTTTCTGCAACCTAATTTTTTTCCACCCGTTTTAGCCACTATTTCTTTCCTCAATTTTCCATTTTTTTTTTGTGAGGGGACAATTTTGCTTTCTTAATGCTAAAATGTCTACTCAAATGTAGCAGGCTGTTGATTCTTGTATGATTTCAGAATTTCTTTTATTTTGGTTACCCATCTAGTATTGTATTTTGTATATATATGAAGGAAATTGGGTTTCTATGTTTTGGCATAATTTGGTCTCTTTATTTTCTTGTTTAATAATACTTCCTTTGTTTATTTTAGCTGTTTCATATTGACTTTGGGCACTATTCACATGTTTCTCTCTGACCTTTGTTAGGCTTTTATAAAATAAATAGAAAACAATCATTTGGATCTTGTTAGATTCGTCTCAATATGTAGGTTACACATATAACTTTTTATAATTTTTACTAATCAATAATTAGAGTTGTTAATCGTCAAAGTTGTGCGTTGACAATCGTGCTAGTCAAAGTGGAACAACTAAAAGAAAACAGAGGAAGTACAATATATCTGTACATTGAAATTGCAAAAAGGAAAATATTAAAACATTCCCACCATTTTCCAATTCCATCAGAGCACCAATGTCCTTAAATCGCAATTTACATAAAAGAGATACCAAGAAGAAGTAATTCACATACAAAATTTGATGCTTTACTCCGGGCCTCTCAAGACTACAACATTGTCTCCTCAATTGAGGTTAATAGTTGATTCCGCACGACAAAATACAGTGGCTTGTCCTTTTTCTATATTGTTACATCATTTTTGTGTGTGTGCACTGTTTTAGATGCAACTTATTTTGATTGATGAGAGAATTGGTATCCACCCTTCTGGCTTCCAAGTACTTCATTTTTCTTTTCTAAGCCATAAAGCAGATTGTTGGCTGTTTTGCAATAAGCAGAACAATTCCAATTAAGGGCAACCTCTGTTTAGTACATTAGTAACCAAGGACACACAAGTACATAATAAAGTGACCCCTTGCCTTCTTCCAAAGGTCGCACCTACTACTATAAATCAAGTGGAGTATTATTTATTTGAGAATCTTATGTAACAGCTTTCCCCTTGATACTTTCCTAGCTGTAATGAAATGCTTTTTGAACTCCAATTCTCCAAATGGCGAGGGAGGGAAGTCTGATCAATTGGCTTGGCTTCTTCAGAAGCAAGTATTGTATATCAAGCCTATATCCTTATGCAACCAGACCATATCTGCATGCGAATAGGATGGTAGTAGAACTGAGTAGATTTAACTAGATTCATAATGGATAGCTTGTCCTTAGATTTATAAGGAACTGAGTTATCTTGTGAGATTATAACCGAGCCTGGCTTTAGACACACAAATATGAGCTCTGGCGTAGATTGACACCCTTCCTCTTTTACTGCGTGTTAGAGTAAAAATTGTTACCCGAAGATAACATTGTAAGAAAAGGCGTGAAATTCTTGAAAGTGCACTAGATGGATGCTGAATAATTGGTTTACCTATACGAGGATATTTAATTTCCCAGCTTGGCATATTTACGAGTTAGCGGTTAGCCACATCAAGACATCAAGTTGAGATTTTACCTAGGCTAGCCTTAGTGTTGGTCAAGCTGACTAAACTTTGCTAGAGGCTATCACACATCTAACAGATAAAATCGTGTTATATCTGGTGTACTCAGTGAGTTATAACGTACTAGTATTTCTCCCTCCGTCTAATACTAGTTTTGACGCTCATCAAGACACAAAGTTTTGGGAGATAGTTGTTGTTTAGTTATCAAATATTATTTATTGGAAATGTAGACGTGAACAGATAATGAGGTATTATTTTATTGAAAAGTAAATGTGATAGAAGATAGTGGAATACTTTTTTAATTAAGTGTGAGAGAGTGTGGGACCCTACATTTTTGGCAGTGGGAAGAGAGTGGTATTAAAATAATTGTTGGGCTTTTCCCAAAATAGAAGTGTAAAAACTACTCCCTCCGTCCCTTAATACTCGCACCGGTTTGACCGGTGCGGAGTTTAAGACACTTGAACTGACTTATTAATTTAATGGGTGTTAGTTGATACTTGATAGTGGGGTATTTTTTTTAATATAGTTAGTGGGGAATGTGCAAGGGGTGGGGAGTGGGGGGTGTCAATTTTTAAATGATTTTTTGTAGGGAGTAGGAGTGTAGGTGGATTAGTAGAATAGTAGGTAAGTGTGAGAAATAATATAATATTGGTAAAGAATATCCATGTATAGAAGCGGCGCAAGTATTAAGGGACGACCCGAAAAGGAAAGCGGTGCGAGTATTAAGGGACGGAGGGAGTAATCTGGAATGGATAACAATGGAAAGTGTAAATTCTAAATATAGGAGAGAGGGAGTAGTTTATAACTTCATACTATAATGTGATATACTAGTGTGTCATTTACTTGCTTTTTTTTCCCTCAAAGATTATTATCATGATTGGTTGCTTATCAATTATTAGCAAGTTAGAAAGTAAAGGTTATAGTGTTATACCATTGAGTGTAAAATATTTTTTGATTTATCATGTGACTGTTCTTGTCATTAATGTGTTTTGATTAGCCTTCGCTGATACTATTTTACTGTCGTTGACATCTTTCATCATGCTGTTGTTAGAGTCTAGAGGGAAGCCTACTGAAGCTATACGTGGAGGAAGGTGAGGTGTTCTTACTTTCTTTACTTGCAATGGTTGTTTAGTTTCCTACTGTTTCGAGGTACAAATTGAATTTTACGTGGCTGTTTTTGGTTGAAGTGTTAAGCAAGTCAACTTTTATGATGATGATGTACGGTTTTGGCAACTTTGGCGTAATCGAGCAGCAGCTGCTGAGTCTCCATCGGCAGTTAGTAATGTCACTTCTGTTTTGAGTCCTCTCGCACCAGCAACAAAAGGAAGACATTTTCTGGTCATCTGTTGTGAGAACAAAGCTATATTTTTGGACTTGGTAACAATGCGTGGTCGTGATGTTCCCAAGCAAGAACTTGATAATAGATCCCTAATGTGGTAATGCTACTAGAATCTCTAAATAAATTGTCTTTGTATATATACCCAAGATATTGGCATGAGGGGAAGAAGAAATTAGCAGTAATGCCTGAGTAGGAGGTGTGCATTTAGGCAAAACACGGTGCACAACTGCATAGCTAAGAGTAGTCATTCATATAAAAGCCATTGTTTTTTTTTTATATTAGTGCATCACGATATACGTGTCACATAAAGAAGAAAAGTACTTCCTCCGATTAGAAATAGTTGCACCACTTTGACTTTCACGTTTGCCAATGCACACATTTGACCGTTAATGTCTCTAATTATCTATTAGTAAATATTTTGTGATTATTTGAAAAGTAGACACTGAGACTAATCCAACCACGTTTTTATATGATAACATTTATATTTTATGTATGTGTAAAAAGAGAGTCAAAGTTAATATATGCATAGTGCATAAAGTGAAAATGGTGCAACTATTTCTAAACGGAGGAAGTAGGATTTATGTCTTTTGAATCTTGAAGCATTTCAGAGCCTGATATAATTACTTAATCTGTTACTCCGGCACATCTCTTGTGCCTCAAGCACTGCAATGCTTTAGACTGCCTCGAGGCCCTCAACCGTGCCTTTTGACATGCTTACCATGTTACTTGGCAACTGGCAAGTCATACATGATCTATTTTAAGTCTTTGCATCTCCGCTTTACCATCAAAGAGAAATCAATGAAAAAATATGAAAGGGGTTTTGATCCTGATATAAGATTGGACTCTAGGGTGGCTGTTGCATATGCCATTGCTTACTCGTGTTTAAATTTGGATTTGCAGTATGGAATTCTTGTATAGATCCACTGCAGTTGAAGGTCCACTAGTTGCTTTTGGTGGATCAGATGGTGTTATACGAGTTCTTTCGATGATAACTTGGAAGGTCAGACTGTGAAGCTTTTCTGATTCCTGTTGACTTCTGATATTTTGTTTCATAGAGTGTCTGATGCTTCGTTACAGCTTGCCCGAAGGTACACTGGAGGCCACAAAGGAGCTATAAATTGTCTCATGACCTTCATGGCTTCTTCAGGCGAGGTAATCCTTCGTAGAATTTGTATAAACAGCTTAACACCGACTACTGGTGCATTTGTCAAACTAATTTGTTGTTTTTTGACATCTTCTCTACTGTTTGGAGGAGTAAAAAGCATGAATCTGGTAAGATGGAGGGTCAAATAAGCCTCATCAGTTTCTTTGCCATCATTGCCCCTTGGGTCTGAGTTTCTTTTGATAATGGTATATTGGTTTAAGCCTGTGTACACTTGTTTTACTGTGGTCTGAAGTATCATTTCCTTCAGTTGCTGTTATATTCCATACTTCACAGTTCACACCAATGTCACCAACCTCTCCATGAAACATATGCATGCTTTAGGCATAGGACCATATGGTTTCTCATTCAATTGTTTCAACCATAAAGGTTGAGCTAGTTTTGGAAAACTCCACCTCCGTCCTGAAATAGATTTGACCCTTGGGTTTGGCGGGAAGATTAAAAATTGTGGATAGAGTGATAGACAGAGAAAAAGCAAGAGAAAGTTAGTTGAAAGCAAGTGTTGAAGTTTCTCAGACATCTATTTAAGGCAAGGGTGAAATCTTCTTAAAAACCAAGGGAGTAATATTTTAGCTGTCTATATTGAATTCATGCTGACTGTTAAGAATTTCCTGGCTGTTGGAGGGGAATGTGGAAGGGATTTGGAGGCTCCGAAGAACAGATGTTCTATTGTTCTTGGATATGACTACAGGAGAAGAGGTGCTAGATTATGTTTGTTACTTTGGAACTTGCTATTATAAATTAGCACACCATGTTTGGATAGTGGAATGGATATTTTTAGAGTCCTGTCACATGATGGGGATTACCTCTCTATATACGTGGATCTTGTTTGGTTGGTGGCTACCATGTCAATATTGTTGATTGATCATCACTTTTCACGTGATTATAGGCATTGCTGGTCTCAGGTGGAAGTGATGGTTTACTAATACTTTGGAGTGCCGATCATGGTCACGATTCACGAGAGCTTGTACCCAAACTTAGTCTAAAGGTAGCTGCTCTGTTCTCTTTCTTCTGATCTTATAGTCGAATATGGTAATTTCATGATGATGTTTGTTTCTTATTGGCTACTTTCATCGAGAGTTAAATTTGAAACCGTTCAACTTGCAAGCCTGTCATCATTTCTTAATCTTTTTGTGATCCATCTGAAATTTCTTTATGTTTCTTGGTACTTTTGAAGGCACATGATGGTGGAGTTGTGGCTGTAGAATTATCAAGAGTTTCTGGTGGTGCACCGCAGCTTATCACGATCGGTGCTGATAAAACATTGGCTATATGGGATACAATCTCATTCAAGGTAGGTTGTGAATGCTGCTGAGTTTTCTTTTTGGCAGTTTAATGCTTTCTGTTGTGCAGTTTCTGATTAAACTATGAGTACTATACTGGTATGTGAACAGAAAAATTAATTTGCTGTTTTATTTCTGGTATAAGATTGAGGAAACTAGGCCATTGATTGCAAATGTTGATATATGCTATCATAACAGTTGAGTTCAAGATTTTGGAATTTTTAATTTAAAGGGGCAATATCTGCTGTATAGTCAGTGTAGACGAGATAAATCTTTTTCCTGATGATTGTTCTAGATTTGCACTCTCACATAGTCACATGATTGTTCTTCTTTCTCAGGAGATGCGGCGCATAAAGCCAGTTCCTAAGATGTCTTGCCATAGTGTGGCATCTTGGTGCCATCCTCGTGCTCCAAACCTTGATATATTAACCTGTGTGAAGGATTCACATATATGGTAATGCATCTGCACTACTATTTAGATTTTCCTGACATCCACTTCTATGTTATGTTGGAAAGTGCTTTAGCAAGTTGCTTGGAGATTATTTTTGTCGTGATTAGAAATGACTGGCACTCTTTCTTACATGGGGATTATTGTCCTTTTGGCAGGGCCATTGAACATCCAACTTATTCAGTCCTGACAAGACCATTATGCGAACTTTCCTCCCTTGTACCTCCACATGCTCTTGCACCGAGCAAAAAGCTTCGGGTATGTACCATTGCAAGTTTTTTCGGTGTCGTTAGTAAGTCTGAAGTAGTAATTATGGGAGGCTTGATAATTTTGGTGAAGCTGATCATGTTAAATTATGGTTTTTTTTTTGGGTCCATGTGTTCGTGTTCGCGTTTGTGTTTTATTTTGTTCGTGTGTTTGCGTTTTTATTTTTATTTTGTGCTATTTCTTTTTATTTTGTATAGTGATATTGTAGCATGTAAATCTTAGAGCAGTTGACCCGAAAGCTAACAATATCTCAATACTTCCTCCGTCTCTTAATAGATGCATCATATTGACTTTTCACACTAGTTATTTATTAACTTTGACTACATTTATTACTAATATGTAAAGACAGATTTTGTATGTTTCATTGTAACTGGGTTAATCTCAATATATAACTTTAAAATATCTAATTTTTTTATTTTATTTTATTAATAATGTAATTAAAGATATCAGTTAAAGTTAGTGTATAAATAGTTAAAAAGTCAACATGATGTATCTATTGTGAAATGGAGGAAGTAAGTGGATAGGGGAAAATTCTAGCTTCTGATAATATCCCCTGTGAAAAAGGCTCCTGATCTTGTAGGGATTGAACGTAGATGTATGGATAAAAAGTTGATATGATGTAGGGCCCTAAGCTATGGTTGGATTCAAATCTTGTCGGTGGAGATAATGCTTAGTTCATAGGCCACATTTTCACTCAAGTTATTCATTTTTAAGATTTAGTTTGTTCAACATAGCAACCTCAGGTAAGCTTATTGTTAAAATTGCCAAACAGGATTAACATTTGTTAACAAGTATTGCTCCTTGCAGTAAGAGAGTGGTTGAAATAGTAGTACGGAGTAATATTTATTGGTAAATGTTGAATGTTAATATGATTGTAAGTCATTTGGTCTCACGGATCCTAGTGGCGGTTAACACATGTTTGAAGTCTTGTTTTGTGTTTTTGGAACCACCACAGGTATTTTCTTTTTACTGTATAACCTCAGGAAGAGCATATTGATTTCTGTCCTATATTTAAAGTTTCACTAAAATTTAAGCTTTCAAGTACTGTTCAAGTTGGTGTACTAGTATCCTGGCTGCATGGGAAGTACCGGAGCATTCACCAATGCATTAGTTTTGGGGGAAAAGGTTGTCTTCCCTGAATCCCCGCCAAGAATTAGAAAGGTAAAAAAATAGTAGACACAATTAGAGGCCTGTTAATTCTGGTAAAGGAAAGGGAAAGAAACGAAGAAAGAGAATTTGATTAGTTTATTTAGTTTCAACAGGGAATTAAATCTCATTCCTTTGAGCAGAGTTTTTCTTCTTAAATTACAAGGGAAAGTACCCTCATCTGTAATTTATCTATTTATCTGTTTGACCCTTTTTGGTTACTCAGTTCTCCACGTTCCTCCCTCTCCCATCCTCCATCTTCGGATCTTCCACAACTACCACATCCTCCAACCAGTTAAGACCACTGTCTCCTAGTCCTTCCTTGAAGCAGCCAGACTCTGGCATCCAGAAATATTATTGTTCTTTTGGATGTCAATATTTTTTTTCCCATTGAGACTCAATTTGGAGTTCGGTCAATTGGTGCCTTGCTGGTTGGGGGATTTGTTTCTTCTACTTGTTTGATTTTTGTCATTTCTCTCATGAGATATTTATATCTACTCGTATCTCTCCTAAAGTGTACGGTGTACCTTTAAACTTTTCCGGAAAGACCCACTTTGTTTTCCTCTTATCTCTTTTGATCTGATTTGGAGTTTGAATTTGCAAAGTACCAACTACGAAGTATTAAAGGTTTTGAAAACAGAGAGGCTGAAGAACTGGTGGTGGATTTGGGTCGAACTGTTCTCTATAAATGAGTTTCACATTGTTTTGGTTGGGGCCTTGGGGGGAGATGTTTTTGGTGTCTTTAGTAAAGAGAGAAGTGGATTTAGGTTGCTGGAGAGGGCTGTGCTGGGGCTAGCAGTCAAGCTGGGGTTGGGTTGATATTATAACATATTGGACGGAAGATAAGTTGGGGCTGGTCAGCTGGTGGCTTGAGGTCCATTATGTCAGAGGCCAGTTGCCAGGATGGTGAGGGTTGGGGTGTCTGGCAAAAGAGAGTGGCGGTGTTGACGAACCTGGTCGTGGTGGGTGATTTTGATCAGCTGAGAGTCAAAACTAAAAACCAAATTTGGGGAATCTACACTTTGTTTGGATAAGAGGAAATGGAGAGAAAGGGAGGGAAAGGGGAGGGAAAGGAAAGGGAGAGGGACAAGCTCTCTTTAATCTTTATTGGATAAGGGGTTGGCAGAAGGAGACGGAGACATCCATATTCTCTGCCTTTTTTAATCAACATCATCCCTCAAAATTGGAGAGTTTTGGTGCTCCACCTCCTCCTCCCTTTCTTTCCCCTCTCCTCCCCTTCCTTTCCCTTTTATTAGTATAAGTCCCAAGTGCGGTAGGGGGGGATCTTAGAAAGGGCATTTGTGGTGCACCCATTTAAACAAAGCAGAGAAGGGAACCAAAGCAAAAGTTAACTAAAATCTAGTAGGGGTAAAAGAAAATTCCCTTCCCTACTCTGAAGTCTGAAAACCTCTAACCAAATGTCCAGTTGCGGTTTGTCTTCCATATGAAATGTTCTACATGCTTCTGTCAAAATTTGAATGTCAATGCGGTTGATGCTTAATTTGATTGTTTTAGAGCTGATGTAAATCTCTCAATACATTATTGGTAAGGATAAGGGGTAATTTTCTCTAATTAGTTATTAATCAGTAAGCTTTTGTAAGTTTGGATTTGTCAGCAACTCAACGTTGACAATTTTTTTGCATTGACTGAAATAGAATTTTTTTCAACATGGATTCAAGGATTGATGCAATGTAATTGTTAATCTCTTGTAGCGTGTTGGCAGTTAGTAGGGGTAGGTATAGACGTATAGTGACTGTGTAGTATTGTAATCCCCAAAGGATTCTTAATTAGATGCTGTCATGCAGGACATGGTATTAGTAAAATTGATGGCAACCCCTTTGACTTTCTCCTTTCCATTTTAGATATGTATTTTACTTTAGAGAAATCTGTTTCACGCTGCTTCTGAAGCACAGATCATGAAGCTTGTTTAGGTAATATCTTTGCTGAACTTATAGTGTATGTCTTTTTCTTATTAAATTTCTGTACCGATCTAGTCATTGTTAATATAGTTTTTTAGGTAATATCTTTGCTGAAGTTATAGTTGATGTCTTTTTCTTATTAAATTTTTGTACTGGTTTAGTCATTGTTAATATGGTTTTTTGAAATATACTGAGATCTATTCCGTAAGGTGACAGAAAATCAAAGATTAAATAGTGAAATTATTACTTTCTACTGCAGGTTTACTGTATGGTTGCACATCCTTTACAGCCACATTTGGTTGCTACTGGAACAAATATTGGTGTAATTATTAGCGAGTTTGATTTTCGAGCTCTTCCACCTGTGGTTGCTCTACCATCACCACCCGGAAGTCGAGAGCATTCTGCTGTATATGTAGTTGAAAGAGAACTCAAATTATTGACCTTTCAGTTGTCTAATACAGCAAACCCATCTTTGGGTAGCAACGGCTCGTTGACCGAATCAGCTAGGTCTAAGGTGGATTCAGAAGCATTGCAGGTCAAACAGATAAAGAAGCACATAAGCACACCTGTTCCACATGATTCATATTCAGTTCTTTCTTTAAGCAGTACAGGAAAGTAAGTGTCTAGTACGTGTCTTAATTTTTGCTAGCTTAACTCCATGCATGTATGGCATATTGCCTACAGCCCTACACTATTCAACTCCGGGACGATACTCATGGTTTGGATTTTGAAAGTTACCACTCTTGAGGTCAAAATATTTACAAGTACAACATTGTTTCTCCAAGTGCTGATGGCTATAAAGCTGTTTGAAATTAATAGATTGATTTTCTACAGATGTTTTAAAAATGGTACTTGATAATACTTTGGAGGGTATTGGTTTGGTATAAACAATATATCGTTTGATGTATGCACATAAAAGGTTGTGACATAGTGGTAAAGTATTACAAAATTGTCTATGGTACAAAAATGTTTGTCAAACAGTTTTTTATTAGGTATTATAAGCATAAAGTTTCCTCATACATTATCATATTTATTCTATTTGATCATAAGAGAGTTAAAGAGACTAAAATTAACACCTAATCTATCAAAAAGGGACCAAGAGATCCCTATATAATCTGAAGTGTCCCATCTTATTGTGTTATCCACCAGAAATTTATATTCTGTACCTTATCTTACTTATGGTTTGATGTAGGTATCTCTCAGTGGTGTGGCCCGATATCCCGTACTTTTCTATATACAAGGTCAGCGACTGGTCCATTGTTGATTCAGGCAGTGCGCGACTCTTGGCATGGGATACCTGTCGTGATAGATTTGCAATATTAGAATCTGCTGTTGTTCCACGAATACCTGTAATTCCAAAGGGGGGGTCATCTAGAAAAGCGAAGGAAGCAGCAGCAGCTGCAGCTGCGGCTGCTGCAGCTAGTGCGGCGTCTTCTGCCTCTGTTCAAGTGCGTATTGTACTTGATGATGGGACATCTAACATATTGATGAGATCGATTGGTGAAAGGAGTGAACCGGTTTGTTCTGTATTCACTCTGTCAATTAGTGTTTCTATTTTCTACAACTTGATATGCAAAAATGTGTTAATTATGGAAGGTTTGACATGATTGGCTTTTACGGTTTCATTCTCTTCACCTTGCCTTATTTTTTCACCTAGGTTAGATGTTAACTGTTACATACTCTCTCCAATTTTGTTTTTCCGACAAGAAGTGAGGGGGTGGAAGAAAAAGATGGATAAAGTAAGAGAGAGTGAGTGGAATGGTGTAGATATTATGGGAAAAAGGTTAAGATAGTAAAATGTGTAAGCCAAAAATAGAACAAAGTAGAAGTGTGTAGAACCAAAAAGAACACCCCTTGATGGAAAATTGGTATAACAAAAGAAAACAGAGGGAGTATATATATAAAAGGGGGTAAATTTGAATAGGTTACTTCAGAATTTCATTTGTAGTTTTTGCATTGCTCACCCTGGTTTCAGCTGATGACATTAGTGGTCCTATTGGTAAATGAGAATTAGGGAACTGCTTTTGACAAAATTGTGTAGCAAATGAAACCATATGTGCAGTTAATGAAGTACTATCACCTCTTCCATTATCTTTTCCTTTTCCATTTCCTTTCAACAATAAGCTTCTACCGTTGTACTTATCAAGATTATTGATCAGAAGCCGAATTTGATTAGGGTGACCCTTTTCATGTCAGTGGAGTTATGTGTAGGATTATAAAATCGAATTGAAGTGGAAAAATTGCCCACCTGTTTTGGAATTAGATGGATGTCAGAGGAGGTAAAAGCTTTCTTGTTCAAGAAGACAGGTTATGAATGAGTGTCTATCTTCTGAGTGGTAGCTTCTTGTTCTGTGTGCGTGCCATGAATGGCGGGGGACTGAAGACTCTTCTATGGGTCCTGGGACTTGGGAGAGTGATGTCCATGCGCTTCTGGTTGGTGATTCTGTGCAGGGTAGTGGCTAGAATTGAGGTTATGCTGCATGGGTCAGGGTTTGGGAGCAGTAGAGGCTAATGAGTTTTGGAATGCAGAACGTCAGGGAGAAATCCTCCCTTGGTATGGTTTGGAAGTTGGTGTGCTTCTTGACACGTGGAGAACACATGGAACAGGATTTGGCACTATTTTTCTTAAATGTGGAAAGTGGTAGATTGATGATAATCACCTTATAATGATAATGGTCACTAGAACAAGATCTTGGGATTTCAAGCAATCAACTTGTATCTTATTATCTTCTATCTGATTATGATTGTGCATAAATAGGTTTTCCATTACGGGTGTCGATGATGCATTCTTCAAATCACCCAGTTTGTAATTTTCTATGTATGGTTTGTATAACTGACGTTAGGGGATCTGTTCAGCCTATTTTGCACATCATAATGTAATTGTAATGATTTTTCAACCAGGTTATCGGTTTGCACGGTGGTTCACTCCTTGGTGTTGCATATAGAACTTCTCGCCGTGTTAGTGCTGTTGCAGCTACTGCAATTTCAACAATTCAGTCAATGCCATTGTCAGGGTTTGGCTCCAGTGCATCTTCATTTAACACATTTGATGACGGCGTCGGTTCTGGTCCTTTATCCACTCAAAATTTTCAGCTATATAGGTGAGCCATTCTGCTTCCGTTTGGCACAGTGTGTCCTGTATTATGTTAAAAAATTATTAACTACATGCTAGTTTTTGGATTATCTGCTGATAGATGAGCTATATCTATGCTTTACTCTATATTGACTGTTAATGGTTATGTATAATGCATTATTTATTATATCTCTTTACATATAGAAAAGTTGGGTAATGTCCATTTCCGATGGAGCCAAAGATTCATTTGTTCTTTTGGTTGTGGATGTAATGGACTAAATCAAAATGGCTCTTAAAACCTCTTTGCTTCCTTTCCATGAAGTAGGAGATGGTCGTCCCGGCCGTAGTTTCCCGTTATCGCTTTTGGGTCAATTGGAAACAACCTCTCTGCAATTGCAGGGGTAAGGTTGCGTACACCCGATCCCCCCTTACCCCGCTTCTTGCGGGAGCTTCTTTGAGGCAATGGGGTAATGATAATGAATGAACATATAGAAAAGTTGGGTATACAATCAATAGGAATTTATATTATTATCTCTGTACCTAGTATATGTCTTTGTGTAGCATTAGTAGTTGTAATCCAAATCAAATGGATGAAGGGGCATGCACTTTCTTTTCTTTTCGTGTATTCTAGGCGTTGTATGCTTCTCTAGAATATCATTAAGTTTTGTTCGTCATAGCCCATGAAAGTTCATCTCTGAGGTATCTTCTTTGTATATTCCCTTGGTCAAGGGTCCGACCCTCTGGTGGGTATAGGGACAAATGTGGATTTTAGAAGGGTTATCACTTATTATGCTTCTTTTTCATAGTTCTTGGAAGGCGAGAACTAGTTTTTCTGAACCAGTAGCTTAGAAGAAATCCACAAGATGAGCTTTGCTTAGTCGTAAATCTCGTAAAATCTTGGATACTGGTCCTCAATGCAAATGATAGACTTTATGCATATTTAACCAGTTAAGTAGGGTCGTGTGTAAGTTAGGAATTGATATGGCATAAAATTTCATATTTACCCAGAAAAGATGGACTTCCGGAGCAGATGGTGTAAGTTGATTTTCTTCTGCACCCTTTTCATTTGCTTTTCAGTGTGGTTAATGTAAACCTATATGTTTTAAATCTCTACCGCTTGAGGACTTCAGGTTCATTTCAAGTTGATCCATTATTCTAGTTATTCTTTCTGAGGGAGACTGGGTTCTTCAGTGCTTAGGTGAATAATGCGAGATTATCTGCTGATTGTTTATCCATAGGAATGCTCATTTGCAGCTGCTTGTGATCTTGTATATCTCATTTGCTTCAGTTTGGTTGGTGATATCCTACCTGAAAACACAGCGTCCTTTCCCTTTTAATTTTCTGTTCAGTTTCTTATATTTGGTCATGTTAGCTTACTTTATCTTTTCAATGTTTATCTGATTAATTTTCTGCCTGCAGCTGGGAAACTTTTCAACCTGTTGGCAGTCTTCTCCCACAGCCAGAATGGACAGCTTGGGACCAAACTGCTGAGTATTGTGCTTTTGCGTATCAGAAATATATTGTCATATCTTCCTTGCGTCCTCAGTATAGATACTTGGGAGATGTTGCAATTCCTTATGCAACTAGTGCTGTTTGGCATCGCAGACAGCTATTTGTTGCTACTCCAACAACCATCGAGTAAGCTATTTGATCTTCAGCTATGTTGTTCAACACCATTAGCTATGATTTAAAATAAGTGTCCCTTCTTTGTCCTTTTTAACTCTTTTTTTAGTTGCACCATTTTTCTTGTTTCATGGTCTCCAACACACGCCTTTGACCATTAAACCTTTAATTTTGTATTTTTAATTTTTTTTATCTAATAAAACCCCACATGATAATAATTTTGTATATATTCGTAGATAAATGTGGTTGAGGTTCACAATATTTGACAAGGAAAAAAGAAAGGGTGCAACTAAAAAGTTATCCGGAGGTTGTATACACAAACATCTATAGCGACATTAATATTGAACGGATATTCTTATGAGAGTACCTTTTAAATAATATTAATATTACTTCCCGTTTTCTATCTATGTGCACATAACTTTTGTAAATAAGTGGTCAAAATATACAATTTTCCCTTCTACCATTTCTGTTTTATTTTTGATAAAACAACTTACTACATTGTTACTTTGTCTGAACTTCGGTTTTTGATATTTCTCTCGGTATATACATGTTTCCCACTTTAAGCTTTTTACTATATATGTCATAACTTTTGCACATAACTTTTGTAATCGAGTGTGATAGTTTTGCTCTATCTTTTCTTACACCTTGTTTGGTTAGCATTTGTGGTTGGAGAGGGAGGGAGGATAAAGAGGGAAGAGAAAGGTGAGGTGAGGGAAAGAGTGAACTTCTGTTTTCCTCCCAAATCTTTCCAAAAATGAAAGGATTTGATTAATGAAAAAGGGAAAGAAAATGGATCCTTGAAATTCCCTTCCCCTATGTCTTATCCAAATAATGCAACATGTATCCTTCACCTTCCTCTCCCTCTTGTGTCCTTCAAATCTTCCTATCAAAATATAGTGTTATACTATGGTAGTCTCTAGGCGTCTTTCTCACGTCATTCTATTTTATTAAAGAAGCTAAAGGAAAGGAGCTATTTATGGGCAAGTCACAGTCAAGAAGGTTTAACAATTTAACATTAATTGGTAATGCTATATAACTATATTAACCTAGTTAAGGACTTAAGTATATGTATTGCAACATGAAAACAGATGTTAGGGTTTAGTGGAGGTGTGACTGAGAATTCTTCAATTATGACTGCATTACACGAAGGAAGGTTTTGTGGAACAATTTTTCAATTTCGACTGCATTACACAAACGAACTGTAAGTCTTTTCCTCTTCGCAACATTTTTGGATGAAGTAGTAGGTATATCCAAAGAGATACCGTGGTGTTTGCTGATGTTATTGTGTTGATTGATGAGACTGGAGCTGGGGGGTAAGTAATGTGGAGTGTTTGGAGACAAAAGAGAGAATCTCAAGGGTTCAGGTTAAGTTGGAGTAAAACCGAGTACACGGAGTGTCATTTAGTATTGGGAGGAAGCGTAGTGAGGCTCTTGTGACTTTAGATGGTAATTAGTTAATGTTTTAAGTATCTCAGATCGATTTTACAGGGATAAATATCAGGATGTAACTCATAGGATTAATACCTTGTGGCTAAAATGGTAGCAAACTGGGTTATTGTGTGTTCTTAGTGTGCGTTTAAGGTTGAAAGGAAAATTCTATAGACTGAGTATTAGACGGACGTTACTATGTTGGTCCAAATATTGGCATTTAGCAAGTAAGAAGATGATTGTGGCAGAGATGCGTTAAGATGGATGTGTGGGAATACCTTCAGGGATGAAATTAGGAATGAAGGTATTCAGATGAAAGTTGAAGAAGCAAACAAATGAATATGAGATAAGGGAGAAACAACTGCATTGGTTTAGTCATGTACACCGTTGACAAGAGGATGCACCTGCCAGGAATGTGGAAGTTGGGATAAAGGAGATTTCAGAAGAAGCCGAGAGAGGCCAAATATGGCTCGAATAGAGGGATTTAAGAATGACATGAACAAAGTGGCGTAGCAAGAGCATGTAGCAAGATAGAAGTGAGTGGAGAAAGAAGATCTGCGTGGATGACCACAAGAAATAGATTCCTGATAGTGCATCCGACCCCATCCAGCTGGGAATAAAGCTTTGGTTTTGTTGTTGTTGCAAATACCTAGTGAAAAGCATGTAAATAAAAGCGATGTAGTACAGGAAGAACTAGGATCGATTTTAAAGAGCATAGATCCTTAGAAGAGACGATGTTGATTAGAATGATAACAAAAATGTCTGCATGTAGTGCTAGCAGAATTCGTGTGCTCTGTGATAGTAGATTGGTTTGCTGTTACAGCTCTAATTTCAGGGTTTTGTTGGCAATGCCAGTTGGTGCTGCCCCTTCTTTATTAATTCTCTGGGCTTGTGCACTATTGTTCACCAGAATACTCTGATTCTTTTATACCAACAATTATCGTGGGCTCTTTAGGCTTGAATTTTCATGAAGTTTGCTTCCTCGAACGTGTTTACGTTATCTTTATAAACTTTGCCAGGGTTGTTTTTGTGGATGCTGGAATTGCTCCAATTGATCTAGAGACAAAGAGGATGAAAGAAGAACTGAGACTAAAAGAGGTACAAGCTAGAGCTGTTGCTGAGCATGGTGAATTAGCTCTGATTTCAGTTGATGGTCCAAAAGCCGATACAAATGAAAGGGTATCTTTTAGACCACCAATGCTTCAGGTATATTGTTTTCTTCAACTATGTATGCTCCCATTGGATAGTAAATTAATTGTTTTTGGAGCAATGAAACTAGTTACAACTTACAAGGATAATTTCCGATATTTCTACATGTGATGAGGTTACAGAGTATTCAGTTAAATTTGGGATTACATTGTCCTTTCTATGAGAGCATCTGTTGCAAGATCTTTTATAATTGGTTGATCTTTTAATACAATGTTCCACTCATGACGTAGCGACTCTGCTATCACTTGTATGATTAACAAATGATTGATGCATGGAATATCATTGCAGGTCGTGAGATTAGCTTCATTTCAGCATGCTCCTTCTGTGCCACCTTACTTAACATTGCCAAAACTATCGAAAATTGACGGGGATGATTCAGGGATGCCCGAAGATAGAAGAGCTAATGACATTGCTGTTGGTGGTGGGGGTGTCGCTGTTGCGGTAACTCGCTTTCCTATGGAGCAGAAACGACCAATCGGTCCACTTGTAGTCGTGGGGGTTAGAGATGGAGTACTCTGGCTGATTGACAGGTACCAATCACATCTTAACTATAGTAGAACCGTAAAATCATGTGTGGCTGCAAAAAGCTAAATCTTTGGGTGCTTTCTTCTTTTTCTTCTTCTTCTTCTTTTTCTTCTTCTTTTGTCCTTTGGAAATTTAAGGTTTTTTGATGCTTACCAGAAAAGAAAAACCAGAAAATTGACTTTGGCATTTCAACCCATCTTGTCTTTAGATGTTTGATATTTCTACTTTGAGTTGGTTCTTTATATTTCAGTTTTCATTTTCATGCATTGCTGGAAATTTCCTCCAGGTATATGCGGGCCCATGCTTTATCTCTTAGTCATCCTGGTATTCGCTGCCGCTGTCTTGCTGCCTATGGGGATGCTGTCAGTGCAGTTAAATGGTAATGTTTTTCTTCAAATTTCCAATGTTAGCTGGATGTATATGCTTAATGAGGTGGTCCATTAATTACTTCTGCACCTTCAGGGCTACTAGACTTGGCAGAGAGCATCATGATGACTTGGCTGAATTTATGCTCGGAATGGGTTATGCTGCTGAAGCTCTTCATTTGCCTGGAATATCTAAGAGGTGAAGCTCATGCCAAAAGCTATATAAGATGTTCATTTCAACAAAGTGCTACTTCTGCTTTTGACTGGTCTTTATGATTTGTTGTGGATGGCAATTATAATGCTTGTTTTCATTGCATAGAATTTGGTCAACGTTTTGACTATGAATTAATACTCCAACATTTCTAGAATATGTGATAATCTTTAATATGCGATTCCATCTTTTCTATCGCAGATTGGAGTTTGATCTGGCCATGCAGGGAAATGATCTGAAAAGAGCCCTTCAGTGTCTTTTGACAATGAGCAACAGCAGGGATATAGGGCAAGAAACTTCAGGGTTGGATTTGACAAACCTGTTAAATGTAGCTGCTAAGAAGGAAAATATTGTAGATGCAGTTCAAGGGATAGCAAAATATGCAAAACAGTTCTTGGAACTTATTGATGCTGCAGATGCAACGGCACAATCTGAGATTGCCCGTGAAGCTCTTAAGCGGTTAGCTGCTGCTGGTTCTGTAAAGGGATCTTTGCAGAGTCATGAATTGAGGGGCTTGGCCTTGCGACTTGCAAATCATGGGGAGTTAACACGCTTAAGTGTATGTTCCTTCTATTCTCTTCTACTATAGTGTACTTGGATACTGTTGTTCAAGCTATTAGAGTATCGACCCTGAAAAGGAGCTTCTGGATTTACTTAGTATGCATTTTATTTTATTTTTCATATTTTCCTCAGCTATTTATGATAATTCAGGACAAGTGAGGCGTCTCTTGGGAATTTCTATCTATAAATGTGGATATCAATGGCTCTCCTTTATTCCGTTGCATCTAGCAACTTTTAGAATATCTCCACTCTTCAATAACAATAAAAACCCATTTTTTAATATAAACTGGCAATAATTGCATCCCATCATTGAAACCTGAATTTTAACTGAGGTGAAAAGAGGGAAGTGGGCCATTTTGACTTTTGAAGTAGGAATAGCATCATCTTTATTTAAGTTTTTACGAGTCTGCCTTATATCAGTGCATATCCTTGGACAAAAAATCCGTTCCCTCTTCCCGGAAAGTAAATGTTTCACAGAACATGGTGATGGATCTTTGCCTTTGGATCTTGGTCGGCTTGGTTTATTCCCTATCTTTGTCTCGGTCTCTAACCACATATGACCTTTTAGATGATACTGGGCTAGTGGTACCCATCCTTGCGACCTTGCACGACTTCAGAAATGTACTTATTGTTTCATTTTCTCTTCTTCGTGTCTCTCCTTTCTGTAATCATTTGTTTTGAATGTTCCAACATAGCTGCTGTACAAGTGGTAGAAATAGTACAGCAAAATTTGTATTTTTAGAAGTTAGAATTGTTAAACGGAGTAGGTATTTGGGAGAGACTAGGAAATTCAAATTTTCAGCTTCTCTTTGTTGCAGTGGATGCATTAACTGAAGTGGTGAGAAATAAACTCGTGAATTTCTATCTGAAGTGATAGTTTATGTCCGGCATCACTTTATTTAGTTTTCAAATTCGGAGGTCTTTTGGTAACGGATTGTGGACCCAAGCATGTATTCCTTCATCTTATTCTTCTGGTAGTAGTTATACTTGGAGTTTTAGGTGTGTTGGAGAAATGTTGTTTAATGTAAGGGTCAGTTGAGAACAGCCTCCTCTATTGTCACAGAGGTATGTCTGCAAAGATCAGAATCCCCCTCCTCCCTTTTTACCTCGCCGAGGGTGGGAGCTTTATAGGAGGTGTTGGAGAGATGTTGTTTGTATTTATGGAGTGATATGCTTTGGAGGTTTCTTACCTTCTAAGTTGGTCTTTTAGCTTATCAATAGGACTTTAAACTTTTTTTTTTTGCCTTTAACTCCATGCTGTGAAATGAGAATATTGTTTGATTTCGCGGGAGTACTGGTCGCTGTGACGCTCTCACATCCCCTGGGTGTTATAGAACTTCTTGGCTACTTGAAAAAGGGTGTTTGTCTTTCTAGAGGGACCACTCTCCATTTTTCCAAAGCTGCTGAAAGAGAGAATAATTTCACTCTTCATTTAATATTACCTTCTCTTATCGTACAAGTAATACATACCCAGAATTTCTCATAATTCTCACATCAATGTTTTGCTCCACTTATGAATTGATTAATTAATTTACTCCTTTTTCTTGGAACTAAGCTTAATTTCCAAAACTTGATTACACCCTTTAACTATGTTGTCGAAAATGGCCAAATTTCAACAAAGCATACAACAAAGAAGAATGAGTGTTCCTCTTCCTCGTCACGTGAGCTTCATTTTTCTTCACTTCCTCTGCTTCAACATCTTATCAGCCCTCTCTTTCTCTCTCCTCATTTTTCCATTTTTTAACTAACAAGCACAACTATTTTAGTCTAGAAATAAAACGGGTTTCCATTTTTAGTCTTAAAACGGTGTAAGGGTGAAAGAGGGTAGTGTTGAGGGATTGGAGGAAGAGAGCGGTGACTGGAGTTAGGGTTTCGAGGGCGGTAAGGATGGTGGTCTGGTGGAAAAGTTTCAGATCTGAAAATTCCGGCTAAAATCGACCACCATAGGGTCGAAACTCAACCACTAGAGAAAGGAGGCAAAGGAGGAGAGAGAAAATTCAGTGTTGAAAAGAGAAGCATGAGGGCTCGTCGGTGGTGAATGGTGATGGTGGACCGGCAATATGAGGGGGGTTGTGGGAAATCAACTATTTTTAAAGTGCTGATGTCAAAGAAAATGATAGCAATCATTCTTGATATGGGTCACGTCGTACATGGTGGAGTTACGCTGGTGGGGGCACCGGAGAGATGATGTCGGTAGAGCCGGTGGTTAGAGGTGAGAGGAAAGAGGTGTTTGGGATGGTGGAGACACTTTGTATGTTATGGAGAAGGGGGAGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGGAAGAAGAGAATTAGAGAAAATGTGAGTTTACGATTGACATTTCATTATACAACACTTACCTAAAATAGGAACACGTCAACTAAAAAGAAATTTCCCAGAAAGGAATACGGGTCAACTAATTTGGAACGGAGGAAGTATGGTCTATAAGAGGCTGAAGAGCATTGTCTTCCCGCATTGTGTAGAGGACCTAAATATCTTTCTGTTATTTTCTAGTTCCCTCTCATTATGTAGATAGAGCTATGGATTTTTTCATATCAGAATTTGTTGAAGCCAATAATTTGTGATGAGAAACTTTGAAATACCAATTATTTGGTGCTTCTCTGAAATATGCTGTAAATTTCAAGCAACTCCTTTGCTGTGATTTTATTTAATGTATTATATTTCATGCGTAGTAAGATATAGTACTTCCTAGCTTTTGATTTAGAGTCGTTGAACATATTTCACTTATTTTCATATGAACACCAGCAGCCCAACCCAATATAGGCACCTTTCTGCTGAGGTAGTTGAGGGGAGGGTCCGAGTAAGGAAGGAAATAGTTTTCTTTTAATTTTATTCATTTACGCCAATAGTAGCCCTTTTCAATTTTTCATCATCAAAACCTTCTTTTTTCTCTCCTTTGTGATTACATGGCTTGTGGGACTCAAAATTATATGATGCTCCAATTACTTGCAGACTCTGGTGAATAATTTAGTCTCACTTGGTTTGGGACGTGAAGCTGCATTCTCAGCTGCAGTTTTGGGTGACAACGCTCTCATGGAGAAGGCATGGCATGAGACAGGGATGCTTGCGGAGGCTGTACTTCATTCTCATGTATGCTATTTCAAGTTTCAAGAATCCCTTTTTTTTATTGGTATTACTGCTTTATCTGACCGGTCCATATTTTGTCAGGCCCATGGACGACCAACTCTTAAGAACTTGGTTGAGTCTTGGAATAAAATGTTGCAGAAGGATCTTGAGCATACTCGAACAACCAAAACAGATGCAACTGCTGCATTCCTGGCTTCATTGGAGGATCCCAAGCTTCCTAGTCTAGGCGACACAGGTAAAAAAGCTCCTATTGAAATTCTTCCTCCAGGGATGAGTGCTCTCTCGCTCTCTATATCTGCTCAAAAGAAGCCGGTGCCTGCTACAAAGGGTTTACAACAGGAGCCTGGTAAGCCATTGTTATTAGGAGCACCTCCTACTGCAGCACCGATTAATGGTGTTACACTGCCACCGAAAGATTCTGGTGATTCGAGTGTGCCACCTCCAGCAGATTCCAGCCAACCTGGAGCGTCGCCTCCAGCAGATTCCAGTCAACCTGGAGTTCCAACTATAGCAGAATCCAGCCAATCTGGAGCTGCTCCTCCAGCAGAGTCCATCCTGCCTGGAGCTGCTCCTCCAGCAGCTGCCAGCCAACCTGGACCAGCTCTATCAGATTCTATCCAGTCTGGTGAAGTTGGAGCTGCTTCTGCGGAAGTTTCTGATCAACCACAACCAGGAGCACCTTCTTTAGCAGATTTAAGTGATCCAGGATCTTCAACTCATGTTGATTCTGGCCAGTTGGAAGCTTCTAAACCCACGGATTTTAGTCAGTCCGTTGCTCTACCACCAGTTGAATCAAATCAACCTGAAGCTTCACAACCTACATCTGACTCGACTCAATCAGTCTTGGATCTCAAGGCTTCTGAGCAACAATTAATTGACAGTATAGGTTCAGGGGTCACGAAACCAAGCAACTCAAGGTCTCCATTCGCGGATCTAGAACAACTGCTTATGTAGCCAATAAGTGAAGAGATATGTACTTTCTAGTATATTCCATTGAAATTTAACATTTTGGTGTGTTAAGATATCAGTTTGTCTATGTAATATTTGTCCCTGAGCTTCTTTGCTGGGATTTCTTGGGTATGTTGCAGGCAATTTCACCATTGATTTTTACGATCAATATTCTTTGTATTCAGTAAATCAAGGTACAGTTAGGGTTTTCTGGAAGCTGTATACTAAATTTGTAGATTTTCCTATCATTTTTGTGAGTGATTTGTTACATATTTGTCGGACTAATTCTGGTTTGGCTATTTGACTAGGAACAACTAAGTGCTTATTTTGTTTCTAACATATATGATCCTCTCGACTATCCCTTTCCCTCTATACGCCTATGCTGTGGAGTTTTCATTGGTTTTGGTTATTTGTCGGATTTTGTCACCTGATACTTTGTGCCTCTTTCTACACGCAAAAAAAAAAACATTTAGTTACCCGCTATGATGATTATGAAATCAAAAGGCTGTGCACGCTTAAGGTTTACAATAAATTTATCATACATTTGGATAACTTTTACCTCTTTTTTTTGTGACTTTTAATACAAAGTATGTTTTTATTAACTTTTATATTATGAAAAAAGTTGATGGGTAAAGGTTAAATAATTACCTTTATACATTATAAGTGATTTTGAAGGAAAATGCTTTTATAAAATGAAAATTTATCACTAAAAAATAGATAACTTTTACATAGAGTAGAGTAATTTTTAACCATTTTGAGTTAACTTTTACTCCGACTTATAATTTTCATTGTAGAACGATTTCCTTCAATATTGCTTAAAGAACGTTAAATTCCCAAATCACTTTATAACATATTTCCCACAATAATAGATAACTTTTACATAGAGTAGAGTAATTTTTAATCATTTTGAGTTAACTTTTACTCCGACTTATAATTTTCATTGTAGAACGATTTCTTTCAATATTGCTTAAAGAACGTTAAATTCCCAAAATACATCACTTTATAACATATTTCCCACCATACCGTCCACGTCAGCATCAAAACCGGTCTATTTTTTTTAATCAATCGGTACTAAATCGTCAACTGAATAGCGGTTGACCCAACTAAACCGCTATTTCAATTAGCGGCTTAGATTTCAACCATTTTATGGCTGTAAACTGGGAACAGCAGGTTGTGAGGTATGATGCATACAAAACCGCCAACTCAAATGGGGGTTTTGTTTTAAAGCAAACCGCCATTTGAATTGGCGGTTCGTGCTGAATCGGGAAAAAAATATTTTTCTTTCTTCCATGCTGACGTAGACGATATGGTAAAAAATATGTTATAAAATGGCGTATTTTGAAAATTTAACGTTCTTTAAGCAATATTGAAGGAAATCGTTCTTCGTTATATACCACTAATAAATAAAAATGTAACGTTATGAAATTCACTAACGGTTAAGAAAATGGTATACACCACTAATATACTGGTTTAACCAATTTTTGGACTTATTGCCTTATTGGCTTGGCAATTCGTGGACTATGGACCGTGGTGCACATAGAGCTACGTACACCAAAAGAACATATGTGCATAAGTAAAAGAACACGACACTACGACAAAAGAACATGCGATATAATATTTCACTTTTTTATTATAAAAATAATTTATTATTTTACTCTTTATTAAAACCTAAATAAAAAAATACTCAGGTTCTTTTCTCGTATCCAGATGTTCTTTCAATTATATACTGTGTTCTTTTGTTGCACGTAGCTCTATGTGCACCATGATCCACCTTCTAAATTGCGTTGGCTTGGGCATCATATAGATTTTAGAAAGCCTATCCAATATCGATATCAGTTCGGATACTCCGTATTCGAATACATACGGAGAATTACAAGGGTGTCAAGAATCTCAACTAAAGTTAGGGGTAAGAAGATTAGAGTGACAAGAGAGAAGGAGATAAGGATTAAAGCTTCCAGCACAAAAATGGAGGTCGCATTCCATAACCGCTCGATGTCGGTCATCCCTCTCATCAGAGGTTAGCCTCGTTTTTCACTTTTCTCTCTTTTTATCTTAATTCTGCTTAATTTTATTTTTAAGTTTTGCTTAATTATTCGAATATTTAGTTGTCATTACTACACTATCAAGGTGACTGTTAAATTCAGACGGATGAAGTATTTAGTTCTAGTATCAACTTAATTTGTATAGTTACATAAATTACATAATAGTTTGATGATGTTTTTGTGCAATAGTTCAATGCATAATTTGCAAATTAATGGTAAATGTTGCGAGTTTTGACAACTAAAAGCTAAATGCGGCCTATTATTTTGAGGAGGTAGTAACATAAAACAGGATTTCAAGTTTGCTGACCTTAGTTAGAAACTTAGAACATACTAATGTGGATGCCAAATTACCCTATGGTTAATAAGATCGAATCCTGTCTATATTATTGGGGTGATGATAAAGGTCGCAAGTTGCGACAACATTTAGTTGTCGCAATCTTATGTGTCACAGTTTAATTGGTACATATAGGCGCCACGTAGGCTATGTGTCATCCGAATATTTTTACTACCAATGCAATATTTTTACTATGAAAATATGTAAATAAGGTAGTCGATATAAAAAAGGAAACTAATTAGAATATTAAGATTTTCTCACCTTTTTTGTACATATATAGTATGTTATATATAAGTTAAATATTTCAGTTTCCAATTTAAATCATGACACGTAAACATGCCATGTCATCATTGTGACACGTCTTAAAGGTCGCATGTNTGTGGATGCCAAATTACCCTATGGTTAATAAGATCGAATCCTGTCTATATTATTGGGGTGATGATAAAGGTCGCAAGTTGCGACAACATTTAGTTGTCGCAATCTTATGTGTCACAGTTTAATTGATACATATAGGCGCCACGTAGGCTATGTGTCATCCGAATATTTTTACTACCAATGCAATATTTTTACTACCAATGCAATATTTTTACTATGAAAATATGTAAATAAGGTAGTCGATATAAAAAAGGAAACTAATTAGAATATTAAGATTTTCTCACCTTTTTTGTACATATATAATATGTTATATATAAGTTAAATATTTCAGTTTCCAATTTAAATCATGACACGTAGACATGTCATGTCATCATTGTGACACGTCTTAAAGGTCGCATGTTGCGACCTTTATCATTTTCGATATTATTGTTAGGAAATTTGGTGAAACAGTTTGATGGTTTTGTAAATTGCTACCTTTTAAAAAAGAAAATTATATTTTACTACCTTATAAGTTTGGTTTGCTTGACTAGCACTACCATTTGCCATTTTTTATTGATGTTTTCCGTCTTTGAACTCCACTACAACGGTTTTTTAGTATGGCAAATGAACAAAACGACAAATGAACAGAGATATTTAAAAGAATAGAACAAATACACGGTGGAAAACATTAACGGAAAACGTCAAATGGTAGTGTTAATCGAACAAACCAAACTTATAAGGTAGCTACATTTAAAAAAGTTTTTATAAGGTCGCAATTCACAAAACCACCAAACTTAAAGGTAGTGTTTCACAAACTTTTCTTATTGTTATTCTCCTTAGTGGCTCGATATATACCAAAAGCAAAAGTTCACTCGTATGTCTCTAGTACGTTGTTGACCCCGTATACATTTGGGGAGCTTGTAAAAGGTTTCTCTTGTATTAACTCCGTTCTCAAAAGTTCTTTACGCTATAAGTTCACTTGTATGTCCCTAGTACGTTGTACATCCTGTGTACGGAGTACATTTGAGGAGCTTGTAAAAGGTTTCTCTTATACTAAGTCCGTTCCCAAAAGTTCTTTACACTTTTGAATATGGGTCAAAAGTACGAAAACTTTGACCGTAAATTCTCATTGTTATATACATAAAAATATGATCATGTGGAAATCTTATTAGATTCGTGTTATTAATTATTTTCAAAATTTCAAGTGTTATAATTTTTCACATACAAAATTGGATAAATTAGTGGTTGAATATTGCATTGTAGTCCGTGCAAATTAGTGAGTGTAAACAACTCTAAGAACGGAGGTAGTATGAATCAATATAAGGGAAAGAAACGTACTCCATAATAACTTGTTGGGAAGTTTCCTTGCCAATTGTGTGTTCTCCTCGTAAATATTTATCGACAGTTCACTCATTGTCATCTTTTCCTTACTATATCACACAAACTCCATTACTAAGAGCTTTAGAATAAGGTAGTGTACACCCAATTTTTTCTGTTGTATGGCCTGTATCCATGTGCATCTATACTGTAAAGGTTGTATGGCAGTGTCAAAATTTAAGAATAAATAGAAGGTCAAGCCGAGGTAATTTATGTGCTTTATTATCTTTCCTAAGAGGCTAAGAAACTGTCTTTTCATATCAAGTCTTTTGCAGCTTTCATATTTGGTTTGGGTTCACAACGCCTCTGCTATTGTACTTCAATTTAGCAGGGTTTGTCTTTACTTGATTTATGAGTATGTCTGTGCAATGTTTGAGTCGAGTTATTAGACGATAATTGAAAATAGTGACTCTCTTGAGGGAACAGGTAACTAAGCCAGGTAGACCTAAAAAATCATTGGTTATTCTAAACATTTTGGGAGGAAATTATCTTAAGTCTACAAGTCTTTTTACATTCAATGTTAGTTTCAAGCACTATAAATGATGACTTTGTGTTCTTCTCTGACTTCTCTCCTCAATAGCAGTGCACTTATTAAGAAGTTATGTTGCTAGATCGGAGTAACATGTAAACAGTCATATGTTACGGATTCTACTTCAAACTGTTTTGAGTTGTTTAGTTAGTGTTCATGTTCCTTGTAGTTTAAATGCTGGTTTTTTTTGCCCAGAATATGATCTGATTAGTTCGTGCAGTTCACGTAGTAAATTGTCTTGTGATTTTCATTATGTAATCTAATGAAGTGACTAGGACCCAAGGGCTTGAAGTTGCATGACCACAGAAGGAAATTCTTCAGAATATCTTCTGCATTGCCAGAAACAATTATGTCTGTGACATTAGCAACTGCAGTTGTTGGTGCAGCGGCCACTATTCTCGTTAAAAGAACCAAAGAATCTGAAAAAAGGTGTGTGCTAAAATTCTCATTACTTGCAGTGCATTTTGTTGTATCTGGGAACTCCACTGGCAGTAAAAGGATCTGTAACTATATTATATGCTTCTCGGGACACCTATTAATATAAGAGAGGTCCTCTTTTTCCTTGTCTTCTAGCCTTCTGGCCAGAGTCCTTAATGTTGAATTGTGTATCTGACTGTTAAGTCCTAAATGTTAACATGAAGTTCTGTGAATCAGGTGCAGGGGATTTTATCTACAAGTATATACCATTGCTAGTAGTCAAATGCTACTTGTGCTGGTTAGTTTACTGTTACTGGTTTCACTTAGGCTCCGTTTGTTTCAGAGAAAAACATATTCAAAGGAAAATGATTTTCCATTTCAAGGGAAAACATCAAGGGAAAATGGTTTTCCTTTGTTTGTTTTCTTACAAGAAAAATTGTAAAGAAACAAAACATAGAAGTGGGAATGAGATGATAGAAAAGGAAATGTAAGAGATTAGAGTGAAAAATGTGGCGTTCCTTCTTTTCAAAGGAAAGTTGGTTTTCCCCTAGAGAAAGTGGTTTTCCACCTTTCGGAAAATTGTTTTTCACTCTGAACTAAACAAACAAAGGAAAATGGGAACATTAATTTATGGGGAAGTGTTTTCCGTCAAAACAAACGGAGCCTTAATGGAGTCGGAAAACGACTAGTATTCTAGAAGACTCAAGAGCGTCATAGTGTCAAGTCAAGCCTTTGTTAACTCTTTCTATTTGCTTCCAATTGTTTAATCCGTGTCTTTGTAGCACTGGATACGAGAGGCCGACATAAAAAATGTGTTTGTTTTGATCTTGATGCTATAGATTTATTTCACTCGAATGTCAACATCTAATTTGTTCCTAATTCATCTCAGTTACAATAATTTGCTGTGGAACTGCGTAACTTCTTGCAAATGTGAAAGTTATAATCAATACATCACATGTGCTTGAATTTAATGGGAATACAATTTAGATGTCTGTTTTGAATCCACTCTTGATGCTGTTTGTTTGAATTCAGCTACCCTATAAGTATCTACTTCAGTTAGTTGTTGAATTTTCCTAAGATATAACCGAAGCATATTTTTGGTAACTGAATGAAGTCAGACTCCAGTAAAAATGTGTGAAGATTGTGGCGGTTCTGGTATATGCTCAGAGTGCAAAGGAGAGGGCTTTGTGCTTCAGAAACTATCAGAAGAAAGGGCTGCCCGAGCTAGGATGATTTCTAAGGATGCAGCTACTCGCTATACATCTGGGTACGTAATGATCTCCTTGTAAATATTAAAGTAGAATGTATTATACTTCGTATTTATTTTGGTTTCCTGACAGAAGTTACTGATGTGAAACAGGCTTCCTAGGAAATGGAGTTATTGCACACGGTGTTCATCTGCTCGGAATTGTAGAACCTGCAATGGGAGTGGAAACCTGAGTTTGTAATTATTTCTAACATCTCTAGTTTTATCTGGAGTTTTGATGGAATCTGTGATTAATTGAAGGCTACTATTTGTTGCTGTATAAACTACATGCATATACTCCGTAATAGATAAAGCAAGGTAGTTGTATAAAACTTTGTGCCGTGGTATACTGGTAGGGGTTAAAATTAATAAGGGACGGAGGGAGTAATTGAAGCACTTGGCATTTGATTGAAATTATATGGTACTCCCTCCGTATTTATTTAAGAGATACACTTGGTCGGGCACGGGTATTAAGAAAAAGAATTGAATGAAATAAAGTAATAAAACAAGTGGGGTTGGGTAGATATTTTAATAAGTAAAATAAGTGGGGACCATGTCATTTTAGGGGATGGAGGTGGGGGTGGAGTGTAAATAGATTATTTAATTAGATGGTAGGGTTGATAAGTTACCAAAAATAGCAAGTGTATCTCTTAAATAAATACGGCCGGAAAAGACAAGTGTATCTCTTAAAAAAATACAGAGGGAGTATTCATTTATA
mRNA sequence
AACTCCAAATCTGGCAAATCACCCAGGCGCATCTGCTCTGCATCTCCTCCTCCTCCTCCTCCTCCTCCTTGTCGGGAAATTACAACAGCAGCATCTTGTAGTTGTATCACCATTTCACTACCTCTAGTCGGCGGTGCCTCTGTCAACCATCTCCTCCACTGCTGCTGCTATTCTCGGTAATGGATCAGGGACGGGCCTGATTTTAGAGAGAAGGAAAAACGGAGCAATCGAAAATGCTGAGATTACGGGCATTTCGGCCGACGAATGAGAAGATTGTCAAAATTCAACTTCATCCAACTCATCCATGGCTAGTCACCTCTGATGCCTCTGATCATGTCTCCGTTTGGAATTGGGAACATCGCCAGGTTATTTATGAGCTAAAAGCTGGTGGAGTTGATGAAAGGCGGCTTGTTGGTGCCAAATTGGAGAAGCTTGCCGAGGGCGAGTCTGAGTCTAGAGGGAAGCCTACTGAAGCTATACGTGGAGGAAGTGTTAAGCAAGTCAACTTTTATGATGATGATGTACGGTTTTGGCAACTTTGGCGTAATCGAGCAGCAGCTGCTGAGTCTCCATCGGCAGTTAGTAATGTCACTTCTGTTTTGAGTCCTCTCGCACCAGCAACAAAAGGAAGACATTTTCTGGTCATCTGTTGTGAGAACAAAGCTATATTTTTGGACTTGGTAACAATGCGTGGTCGTGATGTTCCCAAGCAAGAACTTGATAATAGATCCCTAATGTGTATGGAATTCTTGTATAGATCCACTGCAGTTGAAGGTCCACTAGTTGCTTTTGGTGGATCAGATGGTGTTATACGAGTTCTTTCGATGATAACTTGGAAGCTTGCCCGAAGGTACACTGGAGGCCACAAAGGAGCTATAAATTGTCTCATGACCTTCATGGCTTCTTCAGGCGAGGCATTGCTGGTCTCAGGTGGAAGTGATGGTTTACTAATACTTTGGAGTGCCGATCATGGTCACGATTCACGAGAGCTTGTACCCAAACTTAGTCTAAAGGCACATGATGGTGGAGTTGTGGCTGTAGAATTATCAAGAGTTTCTGGTGGTGCACCGCAGCTTATCACGATCGGTGCTGATAAAACATTGGCTATATGGGATACAATCTCATTCAAGGAGATGCGGCGCATAAAGCCAGTTCCTAAGATGTCTTGCCATAGTGTGGCATCTTGGTGCCATCCTCGTGCTCCAAACCTTGATATATTAACCTGTGTGAAGGATTCACATATATGGGCCATTGAACATCCAACTTATTCAGTCCTGACAAGACCATTATGCGAACTTTCCTCCCTTGTACCTCCACATGCTCTTGCACCGAGCAAAAAGCTTCGGGTTTACTGTATGGTTGCACATCCTTTACAGCCACATTTGGTTGCTACTGGAACAAATATTGGTGTAATTATTAGCGAGTTTGATTTTCGAGCTCTTCCACCTGTGGTTGCTCTACCATCACCACCCGGAAGTCGAGAGCATTCTGCTGTATATGTAGTTGAAAGAGAACTCAAATTATTGACCTTTCAGTTGTCTAATACAGCAAACCCATCTTTGGGTAGCAACGGCTCGTTGACCGAATCAGCTAGGTCTAAGGTGGATTCAGAAGCATTGCAGGTCAAACAGATAAAGAAGCACATAAGCACACCTGTTCCACATGATTCATATTCAGTTCTTTCTTTAAGCAGTACAGGAAAGTATCTCTCAGTGGTGTGGCCCGATATCCCGTACTTTTCTATATACAAGGTCAGCGACTGGTCCATTGTTGATTCAGGCAGTGCGCGACTCTTGGCATGGGATACCTGTCGTGATAGATTTGCAATATTAGAATCTGCTGTTGTTCCACGAATACCTGTAATTCCAAAGGGGGGGTCATCTAGAAAAGCGAAGGAAGCAGCAGCAGCTGCAGCTGCGGCTGCTGCAGCTAGTGCGGCGTCTTCTGCCTCTGTTCAAGTGCGTATTGTACTTGATGATGGGACATCTAACATATTGATGAGATCGATTGGTGAAAGGAGTGAACCGGTTATCGGTTTGCACGGTGGTTCACTCCTTGGTGTTGCATATAGAACTTCTCGCCGTGTTAGTGCTGTTGCAGCTACTGCAATTTCAACAATTCAGTCAATGCCATTGTCAGGGTTTGGCTCCAGTGCATCTTCATTTAACACATTTGATGACGGCGTCGGTTCTGGTCCTTTATCCACTCAAAATTTTCAGCTATATAGCTGGGAAACTTTTCAACCTGTTGGCAGTCTTCTCCCACAGCCAGAATGGACAGCTTGGGACCAAACTGCTGAGTATTGTGCTTTTGCGTATCAGAAATATATTGTCATATCTTCCTTGCGTCCTCAGTATAGATACTTGGGAGATGTTGCAATTCCTTATGCAACTAGTGCTGTTTGGCATCGCAGACAGCTATTTGTTGCTACTCCAACAACCATCGAGGTTGTTTTTGTGGATGCTGGAATTGCTCCAATTGATCTAGAGACAAAGAGGATGAAAGAAGAACTGAGACTAAAAGAGGTACAAGCTAGAGCTGTTGCTGAGCATGGTGAATTAGCTCTGATTTCAGTTGATGGTCCAAAAGCCGATACAAATGAAAGGGTATCTTTTAGACCACCAATGCTTCAGGTCGTGAGATTAGCTTCATTTCAGCATGCTCCTTCTGTGCCACCTTACTTAACATTGCCAAAACTATCGAAAATTGACGGGGATGATTCAGGGATGCCCGAAGATAGAAGAGCTAATGACATTGCTGTTGGTGGTGGGGGTGTCGCTGTTGCGGTAACTCGCTTTCCTATGGAGCAGAAACGACCAATCGGTCCACTTGTAGTCGTGGGGGTTAGAGATGGAGTACTCTGGCTGATTGACAGGTATATGCGGGCCCATGCTTTATCTCTTAGTCATCCTGGTATTCGCTGCCGCTGTCTTGCTGCCTATGGGGATGCTGTCAGTGCAGTTAAATGGGCTACTAGACTTGGCAGAGAGCATCATGATGACTTGGCTGAATTTATGCTCGGAATGGGTTATGCTGCTGAAGCTCTTCATTTGCCTGGAATATCTAAGAGATTGGAGTTTGATCTGGCCATGCAGGGAAATGATCTGAAAAGAGCCCTTCAGTGTCTTTTGACAATGAGCAACAGCAGGGATATAGGGCAAGAAACTTCAGGGTTGGATTTGACAAACCTGTTAAATGTAGCTGCTAAGAAGGAAAATATTGTAGATGCAGTTCAAGGGATAGCAAAATATGCAAAACAGTTCTTGGAACTTATTGATGCTGCAGATGCAACGGCACAATCTGAGATTGCCCGTGAAGCTCTTAAGCGGTTAGCTGCTGCTGGTTCTGTAAAGGGATCTTTGCAGAGTCATGAATTGAGGGGCTTGGCCTTGCGACTTGCAAATCATGGGGAGTTAACACGCTTAAGTACTCTGGTGAATAATTTAGTCTCACTTGGTTTGGGACGTGAAGCTGCATTCTCAGCTGCAGTTTTGGGTGACAACGCTCTCATGGAGAAGGCATGGCATGAGACAGGGATGCTTGCGGAGGCTGTACTTCATTCTCATGCCCATGGACGACCAACTCTTAAGAACTTGGTTGAGTCTTGGAATAAAATGTTGCAGAAGGATCTTGAGCATACTCGAACAACCAAAACAGATGCAACTGCTGCATTCCTGGCTTCATTGGAGGATCCCAAGCTTCCTAGTCTAGGCGACACAGGTAAAAAAGCTCCTATTGAAATTCTTCCTCCAGGGATGAGTGCTCTCTCGCTCTCTATATCTGCTCAAAAGAAGCCGGTGCCTGCTACAAAGGGTTTACAACAGGAGCCTGGTAAGCCATTGTTATTAGGAGCACCTCCTACTGCAGCACCGATTAATGGTGTTACACTGCCACCGAAAGATTCTGGTGATTCGAGTGTGCCACCTCCAGCAGATTCCAGCCAACCTGGAGCGTCGCCTCCAGCAGATTCCAGTCAACCTGGAGTTCCAACTATAGCAGAATCCAGCCAATCTGGAGCTGCTCCTCCAGCAGAGTCCATCCTGCCTGGAGCTGCTCCTCCAGCAGCTGCCAGCCAACCTGGACCAGCTCTATCAGATTCTATCCAGTCTGGTGAAGTTGGAGCTGCTTCTGCGGAAGTTTCTGATCAACCACAACCAGGAGCACCTTCTTTAGCAGATTTAAGTGATCCAGGATCTTCAACTCATGTTGATTCTGGCCAGTTGGAAGCTTCTAAACCCACGGATTTTAGTCAGTCCGTTGCTCTACCACCAGTTGAATCAAATCAACCTGAAGCTTCACAACCTACATCTGACTCGACTCAATCAGTCTTGGATCTCAAGGCTTCTGAGCAACAATTAATTGACAGTATAGGTTCAGGGGTCACGAAACCAAGCAACTCAAGGTCTCCATTCGCGGATCTAGAACAACTGCTTATATATCAGTTTGTCTATGTAATATTTGTCCCTGAGCTTCTTTGCTGGGATTTCTTGGATTTTAGAAAGCCTATCCAATATCGATATCAGTTCGGATACTCCGTATTCGAATACATACGGAGAATTACAAGGGTGTCAAGAATCTCAACTAAAGTTAGGGGTAAGAAGATTAGAGTGACAAGAGAGAAGGAGATAAGGATTAAAGCTTCCAGCACAAAAATGGAGGTCGCATTCCATAACCGCTCGATGTCGGTCATCCCTCTCATCAGAGGACCCAAGGGCTTGAAGTTGCATGACCACAGAAGGAAATTCTTCAGAATATCTTCTGCATTGCCAGAAACAATTATGTCTGTGACATTAGCAACTGCAGTTGTTGGTGCAGCGGCCACTATTCTCGTTAAAAGAACCAAAGAATCTGAAAAAAGTCAGACTCCAGTAAAAATGTGTGAAGATTGTGGCGGTTCTGGTATATGCTCAGAGTGCAAAGGAGAGGGCTTTGTGCTTCAGAAACTATCAGAAGAAAGGGCTGCCCGAGCTAGGATGATTTCTAAGGATGCAGCTACTCGCTATACATCTGGGCTTCCTAGGAAATGGAGTTATTGCACACGGTGTTCATCTGCTCGGAATTGTAGAACCTGCAATGGGAGTGGAAACCTGAGTTTGTAATTATTTCTAACATCTCTAGTTTTATCTGGAGTTTTGATGGAATCTGTGATTAATTGAAGGCTACTATTTGTTGCTGTATAAACTACATGCATATACTCCGTAATAGATAAAGCAAGGTAGTTGTATAAAACTTTGTGCCGTGGTATACTGGTAGGGGTTAAAATTAATAAGGGACGGAGGGAGTAATTGAAGCACTTGGCATTTGATTGAAATTATATGGTACTCCCTCCGTATTTATTTAAGAGATACACTTGGTCGGGCACGGGTATTAAGAAAAAGAATTGAATGAAATAAAGTAATAAAACAAGTGGGGTTGGGTAGATATTTTAATAAGTAAAATAAGTGGGGACCATGTCATTTTAGGGGATGGAGGTGGGGGTGGAGTGTAAATAGATTATTTAATTAGATGGTAGGGTTGATAAGTTACCAAAAATAGCAAGTGTATCTCTTAAATAAATACGGCCGGAAAAGACAAGTGTATCTCTTAAAAAAATACAGAGGGAGTATTCATTTATA
Coding sequence (CDS)
ATGCTGAGATTACGGGCATTTCGGCCGACGAATGAGAAGATTGTCAAAATTCAACTTCATCCAACTCATCCATGGCTAGTCACCTCTGATGCCTCTGATCATGTCTCCGTTTGGAATTGGGAACATCGCCAGGTTATTTATGAGCTAAAAGCTGGTGGAGTTGATGAAAGGCGGCTTGTTGGTGCCAAATTGGAGAAGCTTGCCGAGGGCGAGTCTGAGTCTAGAGGGAAGCCTACTGAAGCTATACGTGGAGGAAGTGTTAAGCAAGTCAACTTTTATGATGATGATGTACGGTTTTGGCAACTTTGGCGTAATCGAGCAGCAGCTGCTGAGTCTCCATCGGCAGTTAGTAATGTCACTTCTGTTTTGAGTCCTCTCGCACCAGCAACAAAAGGAAGACATTTTCTGGTCATCTGTTGTGAGAACAAAGCTATATTTTTGGACTTGGTAACAATGCGTGGTCGTGATGTTCCCAAGCAAGAACTTGATAATAGATCCCTAATGTGTATGGAATTCTTGTATAGATCCACTGCAGTTGAAGGTCCACTAGTTGCTTTTGGTGGATCAGATGGTGTTATACGAGTTCTTTCGATGATAACTTGGAAGCTTGCCCGAAGGTACACTGGAGGCCACAAAGGAGCTATAAATTGTCTCATGACCTTCATGGCTTCTTCAGGCGAGGCATTGCTGGTCTCAGGTGGAAGTGATGGTTTACTAATACTTTGGAGTGCCGATCATGGTCACGATTCACGAGAGCTTGTACCCAAACTTAGTCTAAAGGCACATGATGGTGGAGTTGTGGCTGTAGAATTATCAAGAGTTTCTGGTGGTGCACCGCAGCTTATCACGATCGGTGCTGATAAAACATTGGCTATATGGGATACAATCTCATTCAAGGAGATGCGGCGCATAAAGCCAGTTCCTAAGATGTCTTGCCATAGTGTGGCATCTTGGTGCCATCCTCGTGCTCCAAACCTTGATATATTAACCTGTGTGAAGGATTCACATATATGGGCCATTGAACATCCAACTTATTCAGTCCTGACAAGACCATTATGCGAACTTTCCTCCCTTGTACCTCCACATGCTCTTGCACCGAGCAAAAAGCTTCGGGTTTACTGTATGGTTGCACATCCTTTACAGCCACATTTGGTTGCTACTGGAACAAATATTGGTGTAATTATTAGCGAGTTTGATTTTCGAGCTCTTCCACCTGTGGTTGCTCTACCATCACCACCCGGAAGTCGAGAGCATTCTGCTGTATATGTAGTTGAAAGAGAACTCAAATTATTGACCTTTCAGTTGTCTAATACAGCAAACCCATCTTTGGGTAGCAACGGCTCGTTGACCGAATCAGCTAGGTCTAAGGTGGATTCAGAAGCATTGCAGGTCAAACAGATAAAGAAGCACATAAGCACACCTGTTCCACATGATTCATATTCAGTTCTTTCTTTAAGCAGTACAGGAAAGTATCTCTCAGTGGTGTGGCCCGATATCCCGTACTTTTCTATATACAAGGTCAGCGACTGGTCCATTGTTGATTCAGGCAGTGCGCGACTCTTGGCATGGGATACCTGTCGTGATAGATTTGCAATATTAGAATCTGCTGTTGTTCCACGAATACCTGTAATTCCAAAGGGGGGGTCATCTAGAAAAGCGAAGGAAGCAGCAGCAGCTGCAGCTGCGGCTGCTGCAGCTAGTGCGGCGTCTTCTGCCTCTGTTCAAGTGCGTATTGTACTTGATGATGGGACATCTAACATATTGATGAGATCGATTGGTGAAAGGAGTGAACCGGTTATCGGTTTGCACGGTGGTTCACTCCTTGGTGTTGCATATAGAACTTCTCGCCGTGTTAGTGCTGTTGCAGCTACTGCAATTTCAACAATTCAGTCAATGCCATTGTCAGGGTTTGGCTCCAGTGCATCTTCATTTAACACATTTGATGACGGCGTCGGTTCTGGTCCTTTATCCACTCAAAATTTTCAGCTATATAGCTGGGAAACTTTTCAACCTGTTGGCAGTCTTCTCCCACAGCCAGAATGGACAGCTTGGGACCAAACTGCTGAGTATTGTGCTTTTGCGTATCAGAAATATATTGTCATATCTTCCTTGCGTCCTCAGTATAGATACTTGGGAGATGTTGCAATTCCTTATGCAACTAGTGCTGTTTGGCATCGCAGACAGCTATTTGTTGCTACTCCAACAACCATCGAGGTTGTTTTTGTGGATGCTGGAATTGCTCCAATTGATCTAGAGACAAAGAGGATGAAAGAAGAACTGAGACTAAAAGAGGTACAAGCTAGAGCTGTTGCTGAGCATGGTGAATTAGCTCTGATTTCAGTTGATGGTCCAAAAGCCGATACAAATGAAAGGGTATCTTTTAGACCACCAATGCTTCAGGTCGTGAGATTAGCTTCATTTCAGCATGCTCCTTCTGTGCCACCTTACTTAACATTGCCAAAACTATCGAAAATTGACGGGGATGATTCAGGGATGCCCGAAGATAGAAGAGCTAATGACATTGCTGTTGGTGGTGGGGGTGTCGCTGTTGCGGTAACTCGCTTTCCTATGGAGCAGAAACGACCAATCGGTCCACTTGTAGTCGTGGGGGTTAGAGATGGAGTACTCTGGCTGATTGACAGGTATATGCGGGCCCATGCTTTATCTCTTAGTCATCCTGGTATTCGCTGCCGCTGTCTTGCTGCCTATGGGGATGCTGTCAGTGCAGTTAAATGGGCTACTAGACTTGGCAGAGAGCATCATGATGACTTGGCTGAATTTATGCTCGGAATGGGTTATGCTGCTGAAGCTCTTCATTTGCCTGGAATATCTAAGAGATTGGAGTTTGATCTGGCCATGCAGGGAAATGATCTGAAAAGAGCCCTTCAGTGTCTTTTGACAATGAGCAACAGCAGGGATATAGGGCAAGAAACTTCAGGGTTGGATTTGACAAACCTGTTAAATGTAGCTGCTAAGAAGGAAAATATTGTAGATGCAGTTCAAGGGATAGCAAAATATGCAAAACAGTTCTTGGAACTTATTGATGCTGCAGATGCAACGGCACAATCTGAGATTGCCCGTGAAGCTCTTAAGCGGTTAGCTGCTGCTGGTTCTGTAAAGGGATCTTTGCAGAGTCATGAATTGAGGGGCTTGGCCTTGCGACTTGCAAATCATGGGGAGTTAACACGCTTAAGTACTCTGGTGAATAATTTAGTCTCACTTGGTTTGGGACGTGAAGCTGCATTCTCAGCTGCAGTTTTGGGTGACAACGCTCTCATGGAGAAGGCATGGCATGAGACAGGGATGCTTGCGGAGGCTGTACTTCATTCTCATGCCCATGGACGACCAACTCTTAAGAACTTGGTTGAGTCTTGGAATAAAATGTTGCAGAAGGATCTTGAGCATACTCGAACAACCAAAACAGATGCAACTGCTGCATTCCTGGCTTCATTGGAGGATCCCAAGCTTCCTAGTCTAGGCGACACAGGTAAAAAAGCTCCTATTGAAATTCTTCCTCCAGGGATGAGTGCTCTCTCGCTCTCTATATCTGCTCAAAAGAAGCCGGTGCCTGCTACAAAGGGTTTACAACAGGAGCCTGGTAAGCCATTGTTATTAGGAGCACCTCCTACTGCAGCACCGATTAATGGTGTTACACTGCCACCGAAAGATTCTGGTGATTCGAGTGTGCCACCTCCAGCAGATTCCAGCCAACCTGGAGCGTCGCCTCCAGCAGATTCCAGTCAACCTGGAGTTCCAACTATAGCAGAATCCAGCCAATCTGGAGCTGCTCCTCCAGCAGAGTCCATCCTGCCTGGAGCTGCTCCTCCAGCAGCTGCCAGCCAACCTGGACCAGCTCTATCAGATTCTATCCAGTCTGGTGAAGTTGGAGCTGCTTCTGCGGAAGTTTCTGATCAACCACAACCAGGAGCACCTTCTTTAGCAGATTTAAGTGATCCAGGATCTTCAACTCATGTTGATTCTGGCCAGTTGGAAGCTTCTAAACCCACGGATTTTAGTCAGTCCGTTGCTCTACCACCAGTTGAATCAAATCAACCTGAAGCTTCACAACCTACATCTGACTCGACTCAATCAGTCTTGGATCTCAAGGCTTCTGAGCAACAATTAATTGACAGTATAGGTTCAGGGGTCACGAAACCAAGCAACTCAAGGTCTCCATTCGCGGATCTAGAACAACTGCTTATATATCAGTTTGTCTATGTAATATTTGTCCCTGAGCTTCTTTGCTGGGATTTCTTGGATTTTAGAAAGCCTATCCAATATCGATATCAGTTCGGATACTCCGTATTCGAATACATACGGAGAATTACAAGGGTGTCAAGAATCTCAACTAAAGTTAGGGGTAAGAAGATTAGAGTGACAAGAGAGAAGGAGATAAGGATTAAAGCTTCCAGCACAAAAATGGAGGTCGCATTCCATAACCGCTCGATGTCGGTCATCCCTCTCATCAGAGGACCCAAGGGCTTGAAGTTGCATGACCACAGAAGGAAATTCTTCAGAATATCTTCTGCATTGCCAGAAACAATTATGTCTGTGACATTAGCAACTGCAGTTGTTGGTGCAGCGGCCACTATTCTCGTTAAAAGAACCAAAGAATCTGAAAAAAGTCAGACTCCAGTAAAAATGTGTGAAGATTGTGGCGGTTCTGGTATATGCTCAGAGTGCAAAGGAGAGGGCTTTGTGCTTCAGAAACTATCAGAAGAAAGGGCTGCCCGAGCTAGGATGATTTCTAAGGATGCAGCTACTCGCTATACATCTGGGCTTCCTAGGAAATGGAGTTATTGCACACGGTGTTCATCTGCTCGGAATTGTAGAACCTGCAATGGGAGTGGAAACCTGAGTTTGTAA
Protein sequence
MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLVGAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVTSVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVEGPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLILWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKEMRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVPPHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSAVYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDSEALQVKQIKKHISTPVPHDSYSVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVPRIPVIPKGGSSRKAKEAAAAAAAAAAASAASSASVQVRIVLDDGTSNILMRSIGERSEPVIGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLSTQNFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDVAIPYATSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHGELALISVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPEDRRANDIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLKRALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFLELIDAADATAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGREAAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDLEHTRTTKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATKGLQQEPGKPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPTIAESSQSGAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGAASAEVSDQPQPGAPSLADLSDPGSSTHVDSGQLEASKPTDFSQSVALPPVESNQPEASQPTSDSTQSVLDLKASEQQLIDSIGSGVTKPSNSRSPFADLEQLLIYQFVYVIFVPELLCWDFLDFRKPIQYRYQFGYSVFEYIRRITRVSRISTKVRGKKIRVTREKEIRIKASSTKMEVAFHNRSMSVIPLIRGPKGLKLHDHRRKFFRISSALPETIMSVTLATAVVGAAATILVKRTKESEKSQTPVKMCEDCGGSGICSECKGEGFVLQKLSEERAARARMISKDAATRYTSGLPRKWSYCTRCSSARNCRTCNGSGNLSL
Homology
BLAST of Spo17192.1 vs. NCBI nr
Match:
gi|902204523|gb|KNA14896.1| (hypothetical protein SOVF_102910 [Spinacia oleracea])
HSP 1 Score: 2702.2 bits (7003), Expect = 0.000e+0
Identity = 1397/1401 (99.71%), Postives = 1400/1401 (99.93%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV
Sbjct: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT
Sbjct: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE
Sbjct: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI
Sbjct: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP
Sbjct: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA
Sbjct: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDSEALQVKQIKKHISTPVPHDSY 480
VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDSEALQVKQIKKHISTPVPHDSY
Sbjct: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDSEALQVKQIKKHISTPVPHDSY 480
Query: 481 SVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVPR 540
SVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVPR
Sbjct: 481 SVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVPR 540
Query: 541 IPVIPKGGSSRKAKEAAAAAAAAAAASAASSASVQVRIVLDDGTSNILMRSIGERSEPVI 600
+P+IPKGGSSRKAKEAAAAAAAAA ASAASSASVQVRIVLDDGTSNILMRSIGERSEPVI
Sbjct: 541 VPIIPKGGSSRKAKEAAAAAAAAA-ASAASSASVQVRIVLDDGTSNILMRSIGERSEPVI 600
Query: 601 GLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLSTQN 660
GLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLSTQN
Sbjct: 601 GLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLSTQN 660
Query: 661 FQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDVAIPYAT 720
FQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDVAIPYAT
Sbjct: 661 FQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDVAIPYAT 720
Query: 721 SAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHGELALIS 780
SAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHGELALIS
Sbjct: 721 SAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHGELALIS 780
Query: 781 VDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPEDRRAND 840
VDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPEDRRAND
Sbjct: 781 VDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPEDRRAND 840
Query: 841 IAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPGIRCRCL 900
IAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPGIRCRCL
Sbjct: 841 IAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPGIRCRCL 900
Query: 901 AAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLKR 960
AAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLKR
Sbjct: 901 AAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLKR 960
Query: 961 ALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFLELIDAADAT 1020
ALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFLELIDAADAT
Sbjct: 961 ALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFLELIDAADAT 1020
Query: 1021 AQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGREA 1080
AQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGREA
Sbjct: 1021 AQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGREA 1080
Query: 1081 AFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDLEHTRTTK 1140
AFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDLEHTRTTK
Sbjct: 1081 AFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDLEHTRTTK 1140
Query: 1141 TDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATKGLQQEPG 1200
TDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATKGLQQEPG
Sbjct: 1141 TDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATKGLQQEPG 1200
Query: 1201 KPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPTIAESSQS 1260
KPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPTIAESSQS
Sbjct: 1201 KPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPTIAESSQS 1260
Query: 1261 GAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGAASAEVSDQPQPGAPSLADLSDPG 1320
GAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGAASAEVSDQPQPGAPSLADLSDPG
Sbjct: 1261 GAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGAASAEVSDQPQPGAPSLADLSDPG 1320
Query: 1321 SSTHVDSGQLEASKPTDFSQSVALPPVESNQPEASQPTSDSTQSVLDLKASEQQLIDSIG 1380
SSTHVDSGQLEASKPTDFSQSVALPPVESNQPEASQPTSDSTQSVLDLKASEQQLIDSIG
Sbjct: 1321 SSTHVDSGQLEASKPTDFSQSVALPPVESNQPEASQPTSDSTQSVLDLKASEQQLIDSIG 1380
Query: 1381 SGVTKPSNSRSPFADLEQLLI 1402
SGVTKPSNSRSPFADLEQLL+
Sbjct: 1381 SGVTKPSNSRSPFADLEQLLM 1400
BLAST of Spo17192.1 vs. NCBI nr
Match:
gi|731324833|ref|XP_010673184.1| (PREDICTED: uncharacterized protein LOC104889623 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 2431.8 bits (6301), Expect = 0.000e+0
Identity = 1269/1407 (90.19%), Postives = 1316/1407 (93.53%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLRAFRPTNEKIVKIQ+HPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV
Sbjct: 1 MLRLRAFRPTNEKIVKIQVHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAE+PSAVS VT
Sbjct: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAEAPSAVSLVT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
S LSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTA E
Sbjct: 121 SALSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAGE 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI
Sbjct: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHGHDSRELVPKLSLKAHDGGVV VELSRVSGGAPQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGHDSRELVPKLSLKAHDGGVVGVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP
Sbjct: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
P ALAP+KKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPG+REHSA
Sbjct: 361 PQALAPNKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGNREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDSEALQVKQIKKHISTPVPHDSY 480
VYVVERELK+ TFQLSNTAN SLGSNGSLTESARSKVDSEALQVKQIKK +STPVPHDSY
Sbjct: 421 VYVVERELKVSTFQLSNTANQSLGSNGSLTESARSKVDSEALQVKQIKKPVSTPVPHDSY 480
Query: 481 SVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVPR 540
SVLSLSS+GKYL+VVWPDIPYFSIYKVSDWSIVDSGSARLL+WDTCRDRFAILESAVVPR
Sbjct: 481 SVLSLSSSGKYLAVVWPDIPYFSIYKVSDWSIVDSGSARLLSWDTCRDRFAILESAVVPR 540
Query: 541 IPVIPKGGSSRKAKE-AAAAAAAAAAASAASSASVQVRIVLDDGTSNILMRSIGERSEPV 600
+PVIPKGGSSRKAKE AAAAAAAAAAASAASSASVQVRI+LDDGTSNILMRSIG RSEPV
Sbjct: 541 VPVIPKGGSSRKAKEAAAAAAAAAAAASAASSASVQVRILLDDGTSNILMRSIGGRSEPV 600
Query: 601 IGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLSTQ 660
IGLHGG+LLG+AYRTSRRVSAVAATAISTIQSMPLSGFGS ASSFNTFDDGVGSG LSTQ
Sbjct: 601 IGLHGGALLGIAYRTSRRVSAVAATAISTIQSMPLSGFGSGASSFNTFDDGVGSGTLSTQ 660
Query: 661 NFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDVAIPYA 720
NFQLYSWETFQPVGSLLPQPEWTAWDQT EYCAFAYQKYIVISSLRPQYRYLGDVAIPYA
Sbjct: 661 NFQLYSWETFQPVGSLLPQPEWTAWDQTVEYCAFAYQKYIVISSLRPQYRYLGDVAIPYA 720
Query: 721 TSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHGELALI 780
TSAVWHRRQLFVATPTTIEVVFVDAG+A ID+ETKRMKEELRLKEVQARAVAEHGELALI
Sbjct: 721 TSAVWHRRQLFVATPTTIEVVFVDAGVAQIDIETKRMKEELRLKEVQARAVAEHGELALI 780
Query: 781 SVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPEDRRAN 840
SVDGPK +TNER+ RPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM D +AN
Sbjct: 781 SVDGPKTETNERIPLRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM-SDGKAN 840
Query: 841 DIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPGIRCRC 900
+IAVGGGGVAVAVTRFPMEQKRPIGPLVVVGV+DGVLWLIDRYMRAHALSLSHPGIRCRC
Sbjct: 841 EIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVKDGVLWLIDRYMRAHALSLSHPGIRCRC 900
Query: 901 LAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLK 960
LAAYGDAVSAVKWATRLGREHHDDLA+FMLGMGYAAEALHLPGISKRLEFDLAMQGNDLK
Sbjct: 901 LAAYGDAVSAVKWATRLGREHHDDLAQFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLK 960
Query: 961 RALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFLELIDAADA 1020
RALQCLLTMSNSRDIG+ETSGLDLTNLLNVAAKKENIVDAVQGI K+AKQFL+LIDAADA
Sbjct: 961 RALQCLLTMSNSRDIGKETSGLDLTNLLNVAAKKENIVDAVQGITKFAKQFLDLIDAADA 1020
Query: 1021 TAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGRE 1080
TAQ+EIAREALKRLAAAGS+KGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGRE
Sbjct: 1021 TAQAEIAREALKRLAAAGSIKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGRE 1080
Query: 1081 AAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDLEHTR-T 1140
AAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVE+WNKMLQK++EHTR T
Sbjct: 1081 AAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVEAWNKMLQKEIEHTRTT 1140
Query: 1141 TKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATKGLQQE 1200
TKTDATAAFLASLEDPKLPSLGDT KK PIEILPPGMS+LSLSISA KKP+P KG QQE
Sbjct: 1141 TKTDATAAFLASLEDPKLPSLGDTDKKPPIEILPPGMSSLSLSISAPKKPLPTVKGSQQE 1200
Query: 1201 PGKPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPTIAESS 1260
PGKPLLLGAPPT APINGVT PP DSGDSSV PPA+SSQPGA A+SSQPG P+ AESS
Sbjct: 1201 PGKPLLLGAPPTTAPINGVTPPPTDSGDSSVVPPAESSQPGAPSTAESSQPGAPSTAESS 1260
Query: 1261 QSGAAPPAESILPGAAPPAAASQPGPALSDSIQ----SGEVGAASAEVSDQPQP-GAPSL 1320
Q GA ES GA +SQP DS Q SGE GA SAEVS+QP+P G SL
Sbjct: 1261 QPGAPSTVESSQSGAPSTVESSQPEVPPPDSTQVGEESGEAGAPSAEVSNQPKPEGTSSL 1320
Query: 1321 ADLSDPGSSTHVDSGQLEASKPTDFSQSVALPPVESNQPEASQPTSDSTQSVLDLKASEQ 1380
D S+ G+S V+S + E +K D SQS A PP +SNQPEA PT+D TQ LD K +EQ
Sbjct: 1321 PDSSETGASNQVNSSESEPTKLADSSQSGAPPPADSNQPEAPPPTTDLTQLPLDPKVAEQ 1380
Query: 1381 QLIDSIGSGVTKPSNSRSPFADLEQLL 1401
QL+DSIG V KPSNSRSPFADL L+
Sbjct: 1381 QLLDSIGPEVAKPSNSRSPFADLALLM 1406
BLAST of Spo17192.1 vs. NCBI nr
Match:
gi|225439033|ref|XP_002263744.1| (PREDICTED: uncharacterized protein LOC100248418 [Vitis vinifera])
HSP 1 Score: 2043.5 bits (5293), Expect = 0.000e+0
Identity = 1040/1273 (81.70%), Postives = 1147/1273 (90.10%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLR FRPTN+KIVKIQLHPTHPWLVT+DASDHVSVWNWEHRQVIYELKAGG+DERRLV
Sbjct: 1 MLRLRTFRPTNDKIVKIQLHPTHPWLVTADASDHVSVWNWEHRQVIYELKAGGIDERRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGESE +GKPTEA+RGGSVKQV+FYDDDVRFWQLWRNR+AAAE+PSAV++VT
Sbjct: 61 GAKLEKLAEGESEPKGKPTEAMRGGSVKQVDFYDDDVRFWQLWRNRSAAAEAPSAVNHVT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
S S AP+TKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDN+SL+CMEFL RS +
Sbjct: 121 SAFSSPAPSTKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNKSLLCMEFLSRSAGGD 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
PLVAFGGSDGVIRVLSMITWKL RRYTGGHKG+I+CLMTFMASSGEALL+SG SDGLLI
Sbjct: 181 APLVAFGGSDGVIRVLSMITWKLVRRYTGGHKGSISCLMTFMASSGEALLISGASDGLLI 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHG DSRELVPKLSLKAHDGGVVAVELSRV GGAPQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGQDSRELVPKLSLKAHDGGVVAVELSRVIGGAPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
+RRIKPVPK++CHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYS LTRPLCELSSLVP
Sbjct: 301 LRRIKPVPKLACHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSALTRPLCELSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
P LAP+KKLRVYCMVAHPLQPHLVATGTNIGVI+SEFD R+LP V ALP+P GSREHSA
Sbjct: 361 PQVLAPNKKLRVYCMVAHPLQPHLVATGTNIGVIVSEFDARSLPAVAALPTPVGSREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDS-EALQVKQIKKHISTPVPHDS 480
VYVVERELKLL FQLS+TANPSLGSNGSL+E+ R + DS E L VKQIKKHISTPVPHDS
Sbjct: 421 VYVVERELKLLNFQLSSTANPSLGSNGSLSETGRFRGDSLEPLHVKQIKKHISTPVPHDS 480
Query: 481 YSVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVP 540
YSVLS+SS+GKYL++VWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFA+LES++ P
Sbjct: 481 YSVLSISSSGKYLAIVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFALLESSLPP 540
Query: 541 RIPVIPKGGSSRKAKEAAAAA--AAAAAASAASSASVQVRIVLDDGTSNILMRSIGERSE 600
RIP+IPKGG SRKAKEAAAAA AAAAAASAAS+A+VQ+RI+LDDGTSN+ MRSIG RS+
Sbjct: 541 RIPIIPKGG-SRKAKEAAAAAAQAAAAAASAASTATVQLRILLDDGTSNVYMRSIGGRSD 600
Query: 601 PVIGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSA-SSFNTFDDGVGSGPL 660
PVIGLHGG+LLGVAYRTSRR+S VAATAISTIQSMPLSGFGSS SSF T DDG S
Sbjct: 601 PVIGLHGGALLGVAYRTSRRISPVAATAISTIQSMPLSGFGSSGLSSFTTLDDGFSSHKS 660
Query: 661 ST----QNFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLG 720
T QNFQLYSWETF+PVG LLPQPEWTAWDQT EYCAF YQ+YIVISSLRPQYRYLG
Sbjct: 661 PTEAAPQNFQLYSWETFEPVGGLLPQPEWTAWDQTVEYCAFGYQQYIVISSLRPQYRYLG 720
Query: 721 DVAIPYATSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAE 780
DVAIPYAT AVWHRRQLFVATPTTIE VFVDAG+APID+ET++MKEE++ KE +ARAVAE
Sbjct: 721 DVAIPYATGAVWHRRQLFVATPTTIECVFVDAGVAPIDIETRKMKEEMKSKEARARAVAE 780
Query: 781 HGELALISVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM 840
HGELALI+VDGP+ NER++ RPPMLQVVRLASFQH PSVPP+LTLPK SK+DGDDS +
Sbjct: 781 HGELALITVDGPQTVANERIALRPPMLQVVRLASFQHPPSVPPFLTLPKQSKVDGDDSVL 840
Query: 841 P---EDRRANDIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALS 900
E+R+ N+IAVGGGGV+VAVTRFP EQ+RP+GPLVVVGVRDGVLWLIDRYM AHALS
Sbjct: 841 QKEMEERKTNEIAVGGGGVSVAVTRFPTEQRRPVGPLVVVGVRDGVLWLIDRYMCAHALS 900
Query: 901 LSHPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEF 960
LSHPGIRCRCLAAYGDAVSAVKWA+RL REHHDDLA+FMLGMGYA EALHLPGISKRLEF
Sbjct: 901 LSHPGIRCRCLAAYGDAVSAVKWASRLAREHHDDLAQFMLGMGYATEALHLPGISKRLEF 960
Query: 961 DLAMQGNDLKRALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQ 1020
DLAMQ NDLKRALQCLLTMSNSRDIGQE +GL L ++L++ KKENI+DAVQGI K+AK+
Sbjct: 961 DLAMQSNDLKRALQCLLTMSNSRDIGQENTGLSLNDILSLTTKKENILDAVQGIVKFAKE 1020
Query: 1021 FLELIDAADATAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVN 1080
FL+LIDAADATAQ++IAREALKRLAAAGS+KG+LQ HELRGLALRLANHGELT+LS LVN
Sbjct: 1021 FLDLIDAADATAQADIAREALKRLAAAGSMKGALQGHELRGLALRLANHGELTQLSGLVN 1080
Query: 1081 NLVSLGLGREAAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKML 1140
NL+S+GLGREAAF+AAVLGDNALMEKAW +TGMLAEAVLH+HAHGRPTLKNLV++WNKML
Sbjct: 1081 NLISVGLGREAAFAAAVLGDNALMEKAWQDTGMLAEAVLHAHAHGRPTLKNLVQAWNKML 1140
Query: 1141 QKDLEHTRTTKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPV 1200
QK++EHT +TKTDA AAFLASLE+PKL SL + GKK PIEILPPGM +LS IS QKKPV
Sbjct: 1141 QKEIEHTPSTKTDAAAAFLASLEEPKLTSLAEAGKKPPIEILPPGMLSLSAPISVQKKPV 1200
Query: 1201 PATKGLQQEPGKPLLLGAPPTAAPINGVTLPPKDSGDSSVP---PPADSSQPGASP---- 1253
PA +G QQ+PGKPLLL APPT ++ T P +S +++ P + + PG P
Sbjct: 1201 PAIQGSQQQPGKPLLLEAPPTTTSVSAPT--PSESSEATAEDNNPSSSVTDPGPDPVALA 1260
BLAST of Spo17192.1 vs. NCBI nr
Match:
gi|296085801|emb|CBI31125.3| (unnamed protein product [Vitis vinifera])
HSP 1 Score: 2043.5 bits (5293), Expect = 0.000e+0
Identity = 1040/1273 (81.70%), Postives = 1147/1273 (90.10%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLR FRPTN+KIVKIQLHPTHPWLVT+DASDHVSVWNWEHRQVIYELKAGG+DERRLV
Sbjct: 1 MLRLRTFRPTNDKIVKIQLHPTHPWLVTADASDHVSVWNWEHRQVIYELKAGGIDERRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGESE +GKPTEA+RGGSVKQV+FYDDDVRFWQLWRNR+AAAE+PSAV++VT
Sbjct: 61 GAKLEKLAEGESEPKGKPTEAMRGGSVKQVDFYDDDVRFWQLWRNRSAAAEAPSAVNHVT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
S S AP+TKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDN+SL+CMEFL RS +
Sbjct: 121 SAFSSPAPSTKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNKSLLCMEFLSRSAGGD 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
PLVAFGGSDGVIRVLSMITWKL RRYTGGHKG+I+CLMTFMASSGEALL+SG SDGLLI
Sbjct: 181 APLVAFGGSDGVIRVLSMITWKLVRRYTGGHKGSISCLMTFMASSGEALLISGASDGLLI 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHG DSRELVPKLSLKAHDGGVVAVELSRV GGAPQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGQDSRELVPKLSLKAHDGGVVAVELSRVIGGAPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
+RRIKPVPK++CHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYS LTRPLCELSSLVP
Sbjct: 301 LRRIKPVPKLACHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSALTRPLCELSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
P LAP+KKLRVYCMVAHPLQPHLVATGTNIGVI+SEFD R+LP V ALP+P GSREHSA
Sbjct: 361 PQVLAPNKKLRVYCMVAHPLQPHLVATGTNIGVIVSEFDARSLPAVAALPTPVGSREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDS-EALQVKQIKKHISTPVPHDS 480
VYVVERELKLL FQLS+TANPSLGSNGSL+E+ R + DS E L VKQIKKHISTPVPHDS
Sbjct: 421 VYVVERELKLLNFQLSSTANPSLGSNGSLSETGRFRGDSLEPLHVKQIKKHISTPVPHDS 480
Query: 481 YSVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVP 540
YSVLS+SS+GKYL++VWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFA+LES++ P
Sbjct: 481 YSVLSISSSGKYLAIVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFALLESSLPP 540
Query: 541 RIPVIPKGGSSRKAKEAAAAA--AAAAAASAASSASVQVRIVLDDGTSNILMRSIGERSE 600
RIP+IPKGG SRKAKEAAAAA AAAAAASAAS+A+VQ+RI+LDDGTSN+ MRSIG RS+
Sbjct: 541 RIPIIPKGG-SRKAKEAAAAAAQAAAAAASAASTATVQLRILLDDGTSNVYMRSIGGRSD 600
Query: 601 PVIGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSA-SSFNTFDDGVGSGPL 660
PVIGLHGG+LLGVAYRTSRR+S VAATAISTIQSMPLSGFGSS SSF T DDG S
Sbjct: 601 PVIGLHGGALLGVAYRTSRRISPVAATAISTIQSMPLSGFGSSGLSSFTTLDDGFSSHKS 660
Query: 661 ST----QNFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLG 720
T QNFQLYSWETF+PVG LLPQPEWTAWDQT EYCAF YQ+YIVISSLRPQYRYLG
Sbjct: 661 PTEAAPQNFQLYSWETFEPVGGLLPQPEWTAWDQTVEYCAFGYQQYIVISSLRPQYRYLG 720
Query: 721 DVAIPYATSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAE 780
DVAIPYAT AVWHRRQLFVATPTTIE VFVDAG+APID+ET++MKEE++ KE +ARAVAE
Sbjct: 721 DVAIPYATGAVWHRRQLFVATPTTIECVFVDAGVAPIDIETRKMKEEMKSKEARARAVAE 780
Query: 781 HGELALISVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM 840
HGELALI+VDGP+ NER++ RPPMLQVVRLASFQH PSVPP+LTLPK SK+DGDDS +
Sbjct: 781 HGELALITVDGPQTVANERIALRPPMLQVVRLASFQHPPSVPPFLTLPKQSKVDGDDSVL 840
Query: 841 P---EDRRANDIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALS 900
E+R+ N+IAVGGGGV+VAVTRFP EQ+RP+GPLVVVGVRDGVLWLIDRYM AHALS
Sbjct: 841 QKEMEERKTNEIAVGGGGVSVAVTRFPTEQRRPVGPLVVVGVRDGVLWLIDRYMCAHALS 900
Query: 901 LSHPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEF 960
LSHPGIRCRCLAAYGDAVSAVKWA+RL REHHDDLA+FMLGMGYA EALHLPGISKRLEF
Sbjct: 901 LSHPGIRCRCLAAYGDAVSAVKWASRLAREHHDDLAQFMLGMGYATEALHLPGISKRLEF 960
Query: 961 DLAMQGNDLKRALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQ 1020
DLAMQ NDLKRALQCLLTMSNSRDIGQE +GL L ++L++ KKENI+DAVQGI K+AK+
Sbjct: 961 DLAMQSNDLKRALQCLLTMSNSRDIGQENTGLSLNDILSLTTKKENILDAVQGIVKFAKE 1020
Query: 1021 FLELIDAADATAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVN 1080
FL+LIDAADATAQ++IAREALKRLAAAGS+KG+LQ HELRGLALRLANHGELT+LS LVN
Sbjct: 1021 FLDLIDAADATAQADIAREALKRLAAAGSMKGALQGHELRGLALRLANHGELTQLSGLVN 1080
Query: 1081 NLVSLGLGREAAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKML 1140
NL+S+GLGREAAF+AAVLGDNALMEKAW +TGMLAEAVLH+HAHGRPTLKNLV++WNKML
Sbjct: 1081 NLISVGLGREAAFAAAVLGDNALMEKAWQDTGMLAEAVLHAHAHGRPTLKNLVQAWNKML 1140
Query: 1141 QKDLEHTRTTKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPV 1200
QK++EHT +TKTDA AAFLASLE+PKL SL + GKK PIEILPPGM +LS IS QKKPV
Sbjct: 1141 QKEIEHTPSTKTDAAAAFLASLEEPKLTSLAEAGKKPPIEILPPGMLSLSAPISVQKKPV 1200
Query: 1201 PATKGLQQEPGKPLLLGAPPTAAPINGVTLPPKDSGDSSVP---PPADSSQPGASP---- 1253
PA +G QQ+PGKPLLL APPT ++ T P +S +++ P + + PG P
Sbjct: 1201 PAIQGSQQQPGKPLLLEAPPTTTSVSAPT--PSESSEATAEDNNPSSSVTDPGPDPVALA 1260
BLAST of Spo17192.1 vs. NCBI nr
Match:
gi|590582372|ref|XP_007014606.1| (Transducin/WD40 repeat-like superfamily protein isoform 2 [Theobroma cacao])
HSP 1 Score: 2025.8 bits (5247), Expect = 0.000e+0
Identity = 1060/1365 (77.66%), Postives = 1176/1365 (86.15%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLRAFR TNEKIVKI +HPTHPWLVT+DASDHVSVWNWEHRQVIYELKAGGVD+RRLV
Sbjct: 1 MLRLRAFRATNEKIVKIAVHPTHPWLVTADASDHVSVWNWEHRQVIYELKAGGVDQRRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGESE +GKPTEAIRGGSVKQV F+DDDVRFWQLWRNR+AAAE+P+AV+++T
Sbjct: 61 GAKLEKLAEGESEPKGKPTEAIRGGSVKQVTFFDDDVRFWQLWRNRSAAAEAPTAVNHLT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
S + AP+TKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDN+SL+C+EFL RS+A +
Sbjct: 121 SAFASPAPSTKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNKSLLCLEFLSRSSAGD 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
PLVAFGGSDGVIRVLSMITWKL RRYTGGHKG+I+CLMTFMASSGEALL SG SDGLLI
Sbjct: 181 SPLVAFGGSDGVIRVLSMITWKLVRRYTGGHKGSISCLMTFMASSGEALLASGASDGLLI 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHG DSRELVPKLSLKAHDGGVVAVELSRV GG PQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGQDSRELVPKLSLKAHDGGVVAVELSRVIGGTPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
+RRIKPVPK++CHSV SWCHPRAPNLDILTCVKDS+IWAIEHPTYS LTRPLC+LSSLVP
Sbjct: 301 LRRIKPVPKLACHSVVSWCHPRAPNLDILTCVKDSYIWAIEHPTYSALTRPLCDLSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
+AP+KKLRVYCMVAHPLQPHLVATGTNIG+I+SEFD R+LPPVV L +PPGSREHSA
Sbjct: 361 -QVVAPNKKLRVYCMVAHPLQPHLVATGTNIGIIVSEFDARSLPPVVPLLTPPGSREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDS-EALQVKQIKKHISTPVPHDS 480
VY+VERELKLL FQLSNTANPSLG+NGSL+E+ + K DS E L VKQIKKHISTPVPHDS
Sbjct: 421 VYIVERELKLLNFQLSNTANPSLGNNGSLSETGKLKGDSFEPLHVKQIKKHISTPVPHDS 480
Query: 481 YSVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVP 540
YSVLS+SS+GKYL++VWPDIPYFSIYKVSDWSIVDSGSARLLAWDTC DRFAILESA+ P
Sbjct: 481 YSVLSVSSSGKYLAIVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCCDRFAILESALPP 540
Query: 541 RIPVIPKGGSSRKAKEAAAAAA-AAAAASAASSASVQVRIVLDDGTSNILMRSIGERSEP 600
R+P++PKG SSRKAKEAAAAAA AAAAA+ A+SA+VQVRI+LDDGTSNILMRSIG RSEP
Sbjct: 541 RMPILPKGSSSRKAKEAAAAAAQAAAAAATAASANVQVRILLDDGTSNILMRSIGSRSEP 600
Query: 601 VIGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLST 660
VIGLHGG+LLGVAYRTSRR+S +ATAISTIQSMPLSGFGSS S F FDDG S +
Sbjct: 601 VIGLHGGALLGVAYRTSRRISPGSATAISTIQSMPLSGFGSSGS-FAAFDDGFSSNRSPS 660
Query: 661 ----QNFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDV 720
QNFQL+SWETFQPVG LLPQPEWTAWDQT EYCAFAYQ YIVISSLRPQYRYLGDV
Sbjct: 661 EAVPQNFQLFSWETFQPVGGLLPQPEWTAWDQTVEYCAFAYQHYIVISSLRPQYRYLGDV 720
Query: 721 AIPYATSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHG 780
AI YAT AVW RRQLFVATPTTIE VFVDAG+AP+D+ET++MKEE++LKE QARAVAEHG
Sbjct: 721 AIAYATGAVWQRRQLFVATPTTIECVFVDAGVAPMDIETRKMKEEMKLKEAQARAVAEHG 780
Query: 781 ELALISVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM-- 840
ELALI+VDGP+ T ER++ RPP+LQVVRLASFQHAPSVPP+L+LPK SK+DGDD+ M
Sbjct: 781 ELALITVDGPQTATQERITLRPPILQVVRLASFQHAPSVPPFLSLPKQSKVDGDDATMLK 840
Query: 841 -PEDRRANDIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLS 900
E+R+ N++AVGGGGV+VAVTRFP EQKRP+GPL+VVGVRDGVLWLIDRYM AHALSLS
Sbjct: 841 EMEERKVNELAVGGGGVSVAVTRFPTEQKRPVGPLIVVGVRDGVLWLIDRYMTAHALSLS 900
Query: 901 HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDL 960
HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLA+FMLGMGYA EALHLPGISKRLEFDL
Sbjct: 901 HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAQFMLGMGYATEALHLPGISKRLEFDL 960
Query: 961 AMQGNDLKRALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFL 1020
AM+ NDLKRALQCLLTMSNSRDIGQ+ GLDL ++LN+ AKKEN+V+AVQGI K+A +FL
Sbjct: 961 AMKSNDLKRALQCLLTMSNSRDIGQDNPGLDLNDILNLTAKKENLVEAVQGIVKFANEFL 1020
Query: 1021 ELIDAADATAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNL 1080
ELIDAADATAQ++IAREALKRLA AGSVKGSLQ HELRGLALRLANHGELTRLS LVNNL
Sbjct: 1021 ELIDAADATAQADIAREALKRLATAGSVKGSLQGHELRGLALRLANHGELTRLSGLVNNL 1080
Query: 1081 VSLGLGREAAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQK 1140
+SLGLGREAAFSAAVLGDNALMEKAW +TGMLAEAVLH+HAHGRPTLKNLVE+WN++LQK
Sbjct: 1081 ISLGLGREAAFSAAVLGDNALMEKAWQDTGMLAEAVLHAHAHGRPTLKNLVEAWNRVLQK 1140
Query: 1141 DLEHTRTTKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPA 1200
++EHT + KTDATAAFLASLEDPKL SL + GKK PIEILPPGMSALS SI+ +KKP P
Sbjct: 1141 EVEHTPSAKTDATAAFLASLEDPKLTSLSEAGKKPPIEILPPGMSALSASITVKKKPAPV 1200
Query: 1201 TKGLQQEPGKPLLLGAPPTAAPING-VTLPPKDSGDSSVPPPADSSQPGASPPADSSQPG 1260
T QQ+PGKPL L APP + P + PP + ++ P + PG A ++ PG
Sbjct: 1201 THSSQQQPGKPLALEAPPPSGPAEAPIGAPPPGASAAAAGTPIGAPPPG----APAATPG 1260
Query: 1261 VPTIAESSQSGAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGA-----ASAE---- 1320
P A S + AA P GA P + AS+ PAL D S G+ ASAE
Sbjct: 1261 TPIGAPPSGAPAAAPI-----GAPPTSKASE--PALDDKAPSSSAGSNPDMIASAESNPA 1320
Query: 1321 --VSDQPQPGAPSLADLSDPGSSTHVDSGQLEASKPTDFSQSVAL 1345
SD P P A ++AD T + Q E S PT S L
Sbjct: 1321 VTASDTPAPDA-TVADKPLAEVPTVIPDNQ-ETSVPTTLPTSEPL 1350
BLAST of Spo17192.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9R5W0_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_102910 PE=4 SV=1)
HSP 1 Score: 2702.2 bits (7003), Expect = 0.000e+0
Identity = 1397/1401 (99.71%), Postives = 1400/1401 (99.93%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV
Sbjct: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT
Sbjct: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE
Sbjct: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI
Sbjct: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP
Sbjct: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA
Sbjct: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDSEALQVKQIKKHISTPVPHDSY 480
VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDSEALQVKQIKKHISTPVPHDSY
Sbjct: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDSEALQVKQIKKHISTPVPHDSY 480
Query: 481 SVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVPR 540
SVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVPR
Sbjct: 481 SVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVPR 540
Query: 541 IPVIPKGGSSRKAKEAAAAAAAAAAASAASSASVQVRIVLDDGTSNILMRSIGERSEPVI 600
+P+IPKGGSSRKAKEAAAAAAAAA ASAASSASVQVRIVLDDGTSNILMRSIGERSEPVI
Sbjct: 541 VPIIPKGGSSRKAKEAAAAAAAAA-ASAASSASVQVRIVLDDGTSNILMRSIGERSEPVI 600
Query: 601 GLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLSTQN 660
GLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLSTQN
Sbjct: 601 GLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLSTQN 660
Query: 661 FQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDVAIPYAT 720
FQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDVAIPYAT
Sbjct: 661 FQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDVAIPYAT 720
Query: 721 SAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHGELALIS 780
SAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHGELALIS
Sbjct: 721 SAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHGELALIS 780
Query: 781 VDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPEDRRAND 840
VDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPEDRRAND
Sbjct: 781 VDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPEDRRAND 840
Query: 841 IAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPGIRCRCL 900
IAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPGIRCRCL
Sbjct: 841 IAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPGIRCRCL 900
Query: 901 AAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLKR 960
AAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLKR
Sbjct: 901 AAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLKR 960
Query: 961 ALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFLELIDAADAT 1020
ALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFLELIDAADAT
Sbjct: 961 ALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFLELIDAADAT 1020
Query: 1021 AQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGREA 1080
AQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGREA
Sbjct: 1021 AQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGREA 1080
Query: 1081 AFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDLEHTRTTK 1140
AFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDLEHTRTTK
Sbjct: 1081 AFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDLEHTRTTK 1140
Query: 1141 TDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATKGLQQEPG 1200
TDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATKGLQQEPG
Sbjct: 1141 TDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATKGLQQEPG 1200
Query: 1201 KPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPTIAESSQS 1260
KPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPTIAESSQS
Sbjct: 1201 KPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPTIAESSQS 1260
Query: 1261 GAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGAASAEVSDQPQPGAPSLADLSDPG 1320
GAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGAASAEVSDQPQPGAPSLADLSDPG
Sbjct: 1261 GAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGAASAEVSDQPQPGAPSLADLSDPG 1320
Query: 1321 SSTHVDSGQLEASKPTDFSQSVALPPVESNQPEASQPTSDSTQSVLDLKASEQQLIDSIG 1380
SSTHVDSGQLEASKPTDFSQSVALPPVESNQPEASQPTSDSTQSVLDLKASEQQLIDSIG
Sbjct: 1321 SSTHVDSGQLEASKPTDFSQSVALPPVESNQPEASQPTSDSTQSVLDLKASEQQLIDSIG 1380
Query: 1381 SGVTKPSNSRSPFADLEQLLI 1402
SGVTKPSNSRSPFADLEQLL+
Sbjct: 1381 SGVTKPSNSRSPFADLEQLLM 1400
BLAST of Spo17192.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8FGB7_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_3g063920 PE=4 SV=1)
HSP 1 Score: 2431.8 bits (6301), Expect = 0.000e+0
Identity = 1269/1407 (90.19%), Postives = 1316/1407 (93.53%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLRAFRPTNEKIVKIQ+HPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV
Sbjct: 1 MLRLRAFRPTNEKIVKIQVHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAE+PSAVS VT
Sbjct: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAEAPSAVSLVT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
S LSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTA E
Sbjct: 121 SALSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAGE 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI
Sbjct: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHGHDSRELVPKLSLKAHDGGVV VELSRVSGGAPQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGHDSRELVPKLSLKAHDGGVVGVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP
Sbjct: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
P ALAP+KKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPG+REHSA
Sbjct: 361 PQALAPNKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGNREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDSEALQVKQIKKHISTPVPHDSY 480
VYVVERELK+ TFQLSNTAN SLGSNGSLTESARSKVDSEALQVKQIKK +STPVPHDSY
Sbjct: 421 VYVVERELKVSTFQLSNTANQSLGSNGSLTESARSKVDSEALQVKQIKKPVSTPVPHDSY 480
Query: 481 SVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVPR 540
SVLSLSS+GKYL+VVWPDIPYFSIYKVSDWSIVDSGSARLL+WDTCRDRFAILESAVVPR
Sbjct: 481 SVLSLSSSGKYLAVVWPDIPYFSIYKVSDWSIVDSGSARLLSWDTCRDRFAILESAVVPR 540
Query: 541 IPVIPKGGSSRKAKE-AAAAAAAAAAASAASSASVQVRIVLDDGTSNILMRSIGERSEPV 600
+PVIPKGGSSRKAKE AAAAAAAAAAASAASSASVQVRI+LDDGTSNILMRSIG RSEPV
Sbjct: 541 VPVIPKGGSSRKAKEAAAAAAAAAAAASAASSASVQVRILLDDGTSNILMRSIGGRSEPV 600
Query: 601 IGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLSTQ 660
IGLHGG+LLG+AYRTSRRVSAVAATAISTIQSMPLSGFGS ASSFNTFDDGVGSG LSTQ
Sbjct: 601 IGLHGGALLGIAYRTSRRVSAVAATAISTIQSMPLSGFGSGASSFNTFDDGVGSGTLSTQ 660
Query: 661 NFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDVAIPYA 720
NFQLYSWETFQPVGSLLPQPEWTAWDQT EYCAFAYQKYIVISSLRPQYRYLGDVAIPYA
Sbjct: 661 NFQLYSWETFQPVGSLLPQPEWTAWDQTVEYCAFAYQKYIVISSLRPQYRYLGDVAIPYA 720
Query: 721 TSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHGELALI 780
TSAVWHRRQLFVATPTTIEVVFVDAG+A ID+ETKRMKEELRLKEVQARAVAEHGELALI
Sbjct: 721 TSAVWHRRQLFVATPTTIEVVFVDAGVAQIDIETKRMKEELRLKEVQARAVAEHGELALI 780
Query: 781 SVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPEDRRAN 840
SVDGPK +TNER+ RPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM D +AN
Sbjct: 781 SVDGPKTETNERIPLRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM-SDGKAN 840
Query: 841 DIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPGIRCRC 900
+IAVGGGGVAVAVTRFPMEQKRPIGPLVVVGV+DGVLWLIDRYMRAHALSLSHPGIRCRC
Sbjct: 841 EIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVKDGVLWLIDRYMRAHALSLSHPGIRCRC 900
Query: 901 LAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLK 960
LAAYGDAVSAVKWATRLGREHHDDLA+FMLGMGYAAEALHLPGISKRLEFDLAMQGNDLK
Sbjct: 901 LAAYGDAVSAVKWATRLGREHHDDLAQFMLGMGYAAEALHLPGISKRLEFDLAMQGNDLK 960
Query: 961 RALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFLELIDAADA 1020
RALQCLLTMSNSRDIG+ETSGLDLTNLLNVAAKKENIVDAVQGI K+AKQFL+LIDAADA
Sbjct: 961 RALQCLLTMSNSRDIGKETSGLDLTNLLNVAAKKENIVDAVQGITKFAKQFLDLIDAADA 1020
Query: 1021 TAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGRE 1080
TAQ+EIAREALKRLAAAGS+KGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGRE
Sbjct: 1021 TAQAEIAREALKRLAAAGSIKGSLQSHELRGLALRLANHGELTRLSTLVNNLVSLGLGRE 1080
Query: 1081 AAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDLEHTR-T 1140
AAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVE+WNKMLQK++EHTR T
Sbjct: 1081 AAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVEAWNKMLQKEIEHTRTT 1140
Query: 1141 TKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATKGLQQE 1200
TKTDATAAFLASLEDPKLPSLGDT KK PIEILPPGMS+LSLSISA KKP+P KG QQE
Sbjct: 1141 TKTDATAAFLASLEDPKLPSLGDTDKKPPIEILPPGMSSLSLSISAPKKPLPTVKGSQQE 1200
Query: 1201 PGKPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPTIAESS 1260
PGKPLLLGAPPT APINGVT PP DSGDSSV PPA+SSQPGA A+SSQPG P+ AESS
Sbjct: 1201 PGKPLLLGAPPTTAPINGVTPPPTDSGDSSVVPPAESSQPGAPSTAESSQPGAPSTAESS 1260
Query: 1261 QSGAAPPAESILPGAAPPAAASQPGPALSDSIQ----SGEVGAASAEVSDQPQP-GAPSL 1320
Q GA ES GA +SQP DS Q SGE GA SAEVS+QP+P G SL
Sbjct: 1261 QPGAPSTVESSQSGAPSTVESSQPEVPPPDSTQVGEESGEAGAPSAEVSNQPKPEGTSSL 1320
Query: 1321 ADLSDPGSSTHVDSGQLEASKPTDFSQSVALPPVESNQPEASQPTSDSTQSVLDLKASEQ 1380
D S+ G+S V+S + E +K D SQS A PP +SNQPEA PT+D TQ LD K +EQ
Sbjct: 1321 PDSSETGASNQVNSSESEPTKLADSSQSGAPPPADSNQPEAPPPTTDLTQLPLDPKVAEQ 1380
Query: 1381 QLIDSIGSGVTKPSNSRSPFADLEQLL 1401
QL+DSIG V KPSNSRSPFADL L+
Sbjct: 1381 QLLDSIGPEVAKPSNSRSPFADLALLM 1406
BLAST of Spo17192.1 vs. UniProtKB/TrEMBL
Match:
A0A061GQY0_THECC (Transducin/WD40 repeat-like superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_039839 PE=4 SV=1)
HSP 1 Score: 2025.8 bits (5247), Expect = 0.000e+0
Identity = 1060/1365 (77.66%), Postives = 1176/1365 (86.15%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLRAFR TNEKIVKI +HPTHPWLVT+DASDHVSVWNWEHRQVIYELKAGGVD+RRLV
Sbjct: 1 MLRLRAFRATNEKIVKIAVHPTHPWLVTADASDHVSVWNWEHRQVIYELKAGGVDQRRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGESE +GKPTEAIRGGSVKQV F+DDDVRFWQLWRNR+AAAE+P+AV+++T
Sbjct: 61 GAKLEKLAEGESEPKGKPTEAIRGGSVKQVTFFDDDVRFWQLWRNRSAAAEAPTAVNHLT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
S + AP+TKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDN+SL+C+EFL RS+A +
Sbjct: 121 SAFASPAPSTKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNKSLLCLEFLSRSSAGD 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
PLVAFGGSDGVIRVLSMITWKL RRYTGGHKG+I+CLMTFMASSGEALL SG SDGLLI
Sbjct: 181 SPLVAFGGSDGVIRVLSMITWKLVRRYTGGHKGSISCLMTFMASSGEALLASGASDGLLI 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHG DSRELVPKLSLKAHDGGVVAVELSRV GG PQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGQDSRELVPKLSLKAHDGGVVAVELSRVIGGTPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
+RRIKPVPK++CHSV SWCHPRAPNLDILTCVKDS+IWAIEHPTYS LTRPLC+LSSLVP
Sbjct: 301 LRRIKPVPKLACHSVVSWCHPRAPNLDILTCVKDSYIWAIEHPTYSALTRPLCDLSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
+AP+KKLRVYCMVAHPLQPHLVATGTNIG+I+SEFD R+LPPVV L +PPGSREHSA
Sbjct: 361 -QVVAPNKKLRVYCMVAHPLQPHLVATGTNIGIIVSEFDARSLPPVVPLLTPPGSREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDS-EALQVKQIKKHISTPVPHDS 480
VY+VERELKLL FQLSNTANPSLG+NGSL+E+ + K DS E L VKQIKKHISTPVPHDS
Sbjct: 421 VYIVERELKLLNFQLSNTANPSLGNNGSLSETGKLKGDSFEPLHVKQIKKHISTPVPHDS 480
Query: 481 YSVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVP 540
YSVLS+SS+GKYL++VWPDIPYFSIYKVSDWSIVDSGSARLLAWDTC DRFAILESA+ P
Sbjct: 481 YSVLSVSSSGKYLAIVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCCDRFAILESALPP 540
Query: 541 RIPVIPKGGSSRKAKEAAAAAA-AAAAASAASSASVQVRIVLDDGTSNILMRSIGERSEP 600
R+P++PKG SSRKAKEAAAAAA AAAAA+ A+SA+VQVRI+LDDGTSNILMRSIG RSEP
Sbjct: 541 RMPILPKGSSSRKAKEAAAAAAQAAAAAATAASANVQVRILLDDGTSNILMRSIGSRSEP 600
Query: 601 VIGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLST 660
VIGLHGG+LLGVAYRTSRR+S +ATAISTIQSMPLSGFGSS S F FDDG S +
Sbjct: 601 VIGLHGGALLGVAYRTSRRISPGSATAISTIQSMPLSGFGSSGS-FAAFDDGFSSNRSPS 660
Query: 661 ----QNFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDV 720
QNFQL+SWETFQPVG LLPQPEWTAWDQT EYCAFAYQ YIVISSLRPQYRYLGDV
Sbjct: 661 EAVPQNFQLFSWETFQPVGGLLPQPEWTAWDQTVEYCAFAYQHYIVISSLRPQYRYLGDV 720
Query: 721 AIPYATSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHG 780
AI YAT AVW RRQLFVATPTTIE VFVDAG+AP+D+ET++MKEE++LKE QARAVAEHG
Sbjct: 721 AIAYATGAVWQRRQLFVATPTTIECVFVDAGVAPMDIETRKMKEEMKLKEAQARAVAEHG 780
Query: 781 ELALISVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM-- 840
ELALI+VDGP+ T ER++ RPP+LQVVRLASFQHAPSVPP+L+LPK SK+DGDD+ M
Sbjct: 781 ELALITVDGPQTATQERITLRPPILQVVRLASFQHAPSVPPFLSLPKQSKVDGDDATMLK 840
Query: 841 -PEDRRANDIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLS 900
E+R+ N++AVGGGGV+VAVTRFP EQKRP+GPL+VVGVRDGVLWLIDRYM AHALSLS
Sbjct: 841 EMEERKVNELAVGGGGVSVAVTRFPTEQKRPVGPLIVVGVRDGVLWLIDRYMTAHALSLS 900
Query: 901 HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDL 960
HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLA+FMLGMGYA EALHLPGISKRLEFDL
Sbjct: 901 HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAQFMLGMGYATEALHLPGISKRLEFDL 960
Query: 961 AMQGNDLKRALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFL 1020
AM+ NDLKRALQCLLTMSNSRDIGQ+ GLDL ++LN+ AKKEN+V+AVQGI K+A +FL
Sbjct: 961 AMKSNDLKRALQCLLTMSNSRDIGQDNPGLDLNDILNLTAKKENLVEAVQGIVKFANEFL 1020
Query: 1021 ELIDAADATAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNL 1080
ELIDAADATAQ++IAREALKRLA AGSVKGSLQ HELRGLALRLANHGELTRLS LVNNL
Sbjct: 1021 ELIDAADATAQADIAREALKRLATAGSVKGSLQGHELRGLALRLANHGELTRLSGLVNNL 1080
Query: 1081 VSLGLGREAAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQK 1140
+SLGLGREAAFSAAVLGDNALMEKAW +TGMLAEAVLH+HAHGRPTLKNLVE+WN++LQK
Sbjct: 1081 ISLGLGREAAFSAAVLGDNALMEKAWQDTGMLAEAVLHAHAHGRPTLKNLVEAWNRVLQK 1140
Query: 1141 DLEHTRTTKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPA 1200
++EHT + KTDATAAFLASLEDPKL SL + GKK PIEILPPGMSALS SI+ +KKP P
Sbjct: 1141 EVEHTPSAKTDATAAFLASLEDPKLTSLSEAGKKPPIEILPPGMSALSASITVKKKPAPV 1200
Query: 1201 TKGLQQEPGKPLLLGAPPTAAPING-VTLPPKDSGDSSVPPPADSSQPGASPPADSSQPG 1260
T QQ+PGKPL L APP + P + PP + ++ P + PG A ++ PG
Sbjct: 1201 THSSQQQPGKPLALEAPPPSGPAEAPIGAPPPGASAAAAGTPIGAPPPG----APAATPG 1260
Query: 1261 VPTIAESSQSGAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGA-----ASAE---- 1320
P A S + AA P GA P + AS+ PAL D S G+ ASAE
Sbjct: 1261 TPIGAPPSGAPAAAPI-----GAPPTSKASE--PALDDKAPSSSAGSNPDMIASAESNPA 1320
Query: 1321 --VSDQPQPGAPSLADLSDPGSSTHVDSGQLEASKPTDFSQSVAL 1345
SD P P A ++AD T + Q E S PT S L
Sbjct: 1321 VTASDTPAPDA-TVADKPLAEVPTVIPDNQ-ETSVPTTLPTSEPL 1350
BLAST of Spo17192.1 vs. UniProtKB/TrEMBL
Match:
A0A061GRW4_THECC (Transducin/WD40 repeat-like superfamily protein isoform 3 OS=Theobroma cacao GN=TCM_039839 PE=4 SV=1)
HSP 1 Score: 2021.1 bits (5235), Expect = 0.000e+0
Identity = 1058/1365 (77.51%), Postives = 1175/1365 (86.08%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLRAFR TNEKIVKI +HPTHPWLVT+DASDHVSVWNWEHRQVIYELKAGGVD+RRLV
Sbjct: 1 MLRLRAFRATNEKIVKIAVHPTHPWLVTADASDHVSVWNWEHRQVIYELKAGGVDQRRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGESE +GKPTEAIRGGSVKQV F+DDDVRFWQLWRNR+AAAE+P+AV+++T
Sbjct: 61 GAKLEKLAEGESEPKGKPTEAIRGGSVKQVTFFDDDVRFWQLWRNRSAAAEAPTAVNHLT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
S + AP+TKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDN+SL+C+EFL RS+A +
Sbjct: 121 SAFASPAPSTKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNKSLLCLEFLSRSSAGD 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
PLVAFGGSDGVIRVLSMITWKL RRYTGGHKG+I+CLMTFMASS +ALL SG SDGLLI
Sbjct: 181 SPLVAFGGSDGVIRVLSMITWKLVRRYTGGHKGSISCLMTFMASSVQALLASGASDGLLI 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHG DSRELVPKLSLKAHDGGVVAVELSRV GG PQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGQDSRELVPKLSLKAHDGGVVAVELSRVIGGTPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
+RRIKPVPK++CHSV SWCHPRAPNLDILTCVKDS+IWAIEHPTYS LTRPLC+LSSLVP
Sbjct: 301 LRRIKPVPKLACHSVVSWCHPRAPNLDILTCVKDSYIWAIEHPTYSALTRPLCDLSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
+AP+KKLRVYCMVAHPLQPHLVATGTNIG+I+SEFD R+LPPVV L +PPGSREHSA
Sbjct: 361 -QVVAPNKKLRVYCMVAHPLQPHLVATGTNIGIIVSEFDARSLPPVVPLLTPPGSREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDS-EALQVKQIKKHISTPVPHDS 480
VY+VERELKLL FQLSNTANPSLG+NGSL+E+ + K DS E L VKQIKKHISTPVPHDS
Sbjct: 421 VYIVERELKLLNFQLSNTANPSLGNNGSLSETGKLKGDSFEPLHVKQIKKHISTPVPHDS 480
Query: 481 YSVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVP 540
YSVLS+SS+GKYL++VWPDIPYFSIYKVSDWSIVDSGSARLLAWDTC DRFAILESA+ P
Sbjct: 481 YSVLSVSSSGKYLAIVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCCDRFAILESALPP 540
Query: 541 RIPVIPKGGSSRKAKEAAAAAA-AAAAASAASSASVQVRIVLDDGTSNILMRSIGERSEP 600
R+P++PKG SSRKAKEAAAAAA AAAAA+ A+SA+VQVRI+LDDGTSNILMRSIG RSEP
Sbjct: 541 RMPILPKGSSSRKAKEAAAAAAQAAAAAATAASANVQVRILLDDGTSNILMRSIGSRSEP 600
Query: 601 VIGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSASSFNTFDDGVGSGPLST 660
VIGLHGG+LLGVAYRTSRR+S +ATAISTIQSMPLSGFGSS S F FDDG S +
Sbjct: 601 VIGLHGGALLGVAYRTSRRISPGSATAISTIQSMPLSGFGSSGS-FAAFDDGFSSNRSPS 660
Query: 661 ----QNFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDV 720
QNFQL+SWETFQPVG LLPQPEWTAWDQT EYCAFAYQ YIVISSLRPQYRYLGDV
Sbjct: 661 EAVPQNFQLFSWETFQPVGGLLPQPEWTAWDQTVEYCAFAYQHYIVISSLRPQYRYLGDV 720
Query: 721 AIPYATSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHG 780
AI YAT AVW RRQLFVATPTTIE VFVDAG+AP+D+ET++MKEE++LKE QARAVAEHG
Sbjct: 721 AIAYATGAVWQRRQLFVATPTTIECVFVDAGVAPMDIETRKMKEEMKLKEAQARAVAEHG 780
Query: 781 ELALISVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM-- 840
ELALI+VDGP+ T ER++ RPP+LQVVRLASFQHAPSVPP+L+LPK SK+DGDD+ M
Sbjct: 781 ELALITVDGPQTATQERITLRPPILQVVRLASFQHAPSVPPFLSLPKQSKVDGDDATMLK 840
Query: 841 -PEDRRANDIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLS 900
E+R+ N++AVGGGGV+VAVTRFP EQKRP+GPL+VVGVRDGVLWLIDRYM AHALSLS
Sbjct: 841 EMEERKVNELAVGGGGVSVAVTRFPTEQKRPVGPLIVVGVRDGVLWLIDRYMTAHALSLS 900
Query: 901 HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDL 960
HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLA+FMLGMGYA EALHLPGISKRLEFDL
Sbjct: 901 HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAQFMLGMGYATEALHLPGISKRLEFDL 960
Query: 961 AMQGNDLKRALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFL 1020
AM+ NDLKRALQCLLTMSNSRDIGQ+ GLDL ++LN+ AKKEN+V+AVQGI K+A +FL
Sbjct: 961 AMKSNDLKRALQCLLTMSNSRDIGQDNPGLDLNDILNLTAKKENLVEAVQGIVKFANEFL 1020
Query: 1021 ELIDAADATAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNL 1080
ELIDAADATAQ++IAREALKRLA AGSVKGSLQ HELRGLALRLANHGELTRLS LVNNL
Sbjct: 1021 ELIDAADATAQADIAREALKRLATAGSVKGSLQGHELRGLALRLANHGELTRLSGLVNNL 1080
Query: 1081 VSLGLGREAAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQK 1140
+SLGLGREAAFSAAVLGDNALMEKAW +TGMLAEAVLH+HAHGRPTLKNLVE+WN++LQK
Sbjct: 1081 ISLGLGREAAFSAAVLGDNALMEKAWQDTGMLAEAVLHAHAHGRPTLKNLVEAWNRVLQK 1140
Query: 1141 DLEHTRTTKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPA 1200
++EHT + KTDATAAFLASLEDPKL SL + GKK PIEILPPGMSALS SI+ +KKP P
Sbjct: 1141 EVEHTPSAKTDATAAFLASLEDPKLTSLSEAGKKPPIEILPPGMSALSASITVKKKPAPV 1200
Query: 1201 TKGLQQEPGKPLLLGAPPTAAPING-VTLPPKDSGDSSVPPPADSSQPGASPPADSSQPG 1260
T QQ+PGKPL L APP + P + PP + ++ P + PG A ++ PG
Sbjct: 1201 THSSQQQPGKPLALEAPPPSGPAEAPIGAPPPGASAAAAGTPIGAPPPG----APAATPG 1260
Query: 1261 VPTIAESSQSGAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGA-----ASAE---- 1320
P A S + AA P GA P + AS+ PAL D S G+ ASAE
Sbjct: 1261 TPIGAPPSGAPAAAPI-----GAPPTSKASE--PALDDKAPSSSAGSNPDMIASAESNPA 1320
Query: 1321 --VSDQPQPGAPSLADLSDPGSSTHVDSGQLEASKPTDFSQSVAL 1345
SD P P A ++AD T + Q E S PT S L
Sbjct: 1321 VTASDTPAPDA-TVADKPLAEVPTVIPDNQ-ETSVPTTLPTSEPL 1350
BLAST of Spo17192.1 vs. UniProtKB/TrEMBL
Match:
B9S5A6_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0888260 PE=4 SV=1)
HSP 1 Score: 2018.0 bits (5227), Expect = 0.000e+0
Identity = 1043/1318 (79.14%), Postives = 1163/1318 (88.24%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLRLRA+RP++EKIVKIQLHPTHPWLVT+DASD VSVWNWEHRQVIYELKAGGVDERRLV
Sbjct: 1 MLRLRAYRPSSEKIVKIQLHPTHPWLVTADASDRVSVWNWEHRQVIYELKAGGVDERRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGES+ +GKPTEA+RGGSVKQV+FYDDDVRFWQLW NR+AAAE+PSAV+NV+
Sbjct: 61 GAKLEKLAEGESDIKGKPTEAMRGGSVKQVSFYDDDVRFWQLWHNRSAAAEAPSAVNNVS 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
+ SP AP+TKGRHFLVICCENKAIFLDLVTMRGRDV KQELDN+SL+CMEFL RSTA +
Sbjct: 121 TFTSP-APSTKGRHFLVICCENKAIFLDLVTMRGRDVLKQELDNKSLLCMEFLCRSTAGD 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
GPLVAFGGSDGVIRVLSMITWKL RRYTGGHKG+I+CLMTFMASSGE LL+SGGSDGLL+
Sbjct: 181 GPLVAFGGSDGVIRVLSMITWKLVRRYTGGHKGSISCLMTFMASSGEGLLISGGSDGLLV 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHG DSRELVPKLSLKAHDGGVVA+ELSRV GGAPQLITIGADKTLAIWDTISFKE
Sbjct: 241 LWSADHGQDSRELVPKLSLKAHDGGVVAIELSRVIGGAPQLITIGADKTLAIWDTISFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
+RRIKPVPK++CHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYS LTRPLCELSSLVP
Sbjct: 301 LRRIKPVPKLTCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSALTRPLCELSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
P LAP+KKLRVYCMVAH LQPHLV TGTNIGVI+SEFD R+LP V ALP+P G+REHSA
Sbjct: 361 PQVLAPNKKLRVYCMVAHSLQPHLVVTGTNIGVIVSEFDPRSLPAVAALPTPSGNREHSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVDS-EALQVKQIKKHISTPVPHDS 480
VYVVERELKLL FQLSNTAN SLGSNGSL+E+ + K DS E L VKQIKKHISTPVPHDS
Sbjct: 421 VYVVERELKLLNFQLSNTANLSLGSNGSLSETGKYKGDSSEPLLVKQIKKHISTPVPHDS 480
Query: 481 YSVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVP 540
YSVLS+SS+GKYL++VWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESA+ P
Sbjct: 481 YSVLSVSSSGKYLAIVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESALAP 540
Query: 541 RIPVIPKGGSSRKAKEAAAAAA--AAAAASAASSASVQVRIVLDDGTSNILMRSIGERSE 600
RIPVIPKG SSRKAKEAAAAAA AAAAASAAS+ASVQVRI+L+DGTSNILMRSIG RSE
Sbjct: 541 RIPVIPKGVSSRKAKEAAAAAAQAAAAAASAASAASVQVRILLEDGTSNILMRSIGSRSE 600
Query: 601 PVIGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFGSSA-SSFNTFDDGVGSGPL 660
PVIGLHGG+LLGVAYRTSRRVS +AATAISTIQSMPLSGFG S SSF+TF+DG S
Sbjct: 601 PVIGLHGGALLGVAYRTSRRVSPIAATAISTIQSMPLSGFGGSGVSSFSTFEDGFSSQRS 660
Query: 661 ST----QNFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLG 720
+T QNF+LYSWETF+PVG LLPQPEWTAWDQT EYCAFAYQ+YIVISSLRPQYRYLG
Sbjct: 661 ATEAAPQNFELYSWETFEPVGGLLPQPEWTAWDQTVEYCAFAYQQYIVISSLRPQYRYLG 720
Query: 721 DVAIPYATSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAE 780
DVAIPYAT AVWHRRQLFVATPTTIE VFVDAGIA ID+ET++MKEE+++KE QARA+AE
Sbjct: 721 DVAIPYATGAVWHRRQLFVATPTTIECVFVDAGIAAIDIETRKMKEEMKMKEAQARAIAE 780
Query: 781 HGELALISVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGM 840
HG+LALI+V+GP++ + ER+ RPPMLQVVRLASFQH PSVPP+LTLPK +K+D DS +
Sbjct: 781 HGDLALITVEGPQSASQERIKLRPPMLQVVRLASFQHVPSVPPFLTLPKQTKVDDGDSAL 840
Query: 841 PED-RRANDIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLS 900
P++ R N+IAVGGGGV+VAVTRFP EQKRP+GPLV+VGVRDGVLWLIDRYM AHALSL+
Sbjct: 841 PKEIERVNEIAVGGGGVSVAVTRFPAEQKRPVGPLVMVGVRDGVLWLIDRYMSAHALSLN 900
Query: 901 HPGIRCRCLAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDL 960
HPGIRCRCLAAYGDAVSAVKWA+RLGREHHDDLA+FMLGMGYA EALHLPGISKRLEFDL
Sbjct: 901 HPGIRCRCLAAYGDAVSAVKWASRLGREHHDDLAQFMLGMGYATEALHLPGISKRLEFDL 960
Query: 961 AMQGNDLKRALQCLLTMSNSRDIGQETSGLDLTNLLNVAAKKENIVDAVQGIAKYAKQFL 1020
AMQ NDLKRALQCLLTMSNSRDIGQ+ +GL LT++LN+ AKKENIV+AVQG+ K+AK+FL
Sbjct: 961 AMQSNDLKRALQCLLTMSNSRDIGQDGTGLGLTDILNLTAKKENIVEAVQGVVKFAKEFL 1020
Query: 1021 ELIDAADATAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNL 1080
ELIDAADATAQ++IAREALKRLAAAGSVKG+LQ HELRGLALRLANHGELTRLS+LVNNL
Sbjct: 1021 ELIDAADATAQADIAREALKRLAAAGSVKGALQGHELRGLALRLANHGELTRLSSLVNNL 1080
Query: 1081 VSLGLGREAAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQK 1140
+S+GLGREAAFSAAVLGDNALMEKAW +TGMLAE+VLH+ AHGRPTLKNLV++WNKMLQK
Sbjct: 1081 ISIGLGREAAFSAAVLGDNALMEKAWQDTGMLAESVLHAQAHGRPTLKNLVQAWNKMLQK 1140
Query: 1141 DLEHTRTTKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPA 1200
++EH+ +TK DA AFLASLE+PKL SL + GKK PIEILPPGM +LS I++QKKP PA
Sbjct: 1141 EVEHSPSTKADAATAFLASLEEPKLTSLAEAGKKPPIEILPPGMPSLSAFITSQKKPTPA 1200
Query: 1201 TKGLQQEPGKPLLLGAPPTA--------APINGVTLPPKDSGDSSVPPPA-DSSQP---G 1260
T+ QQ+PG+PL + PP A PI P+++ SS P A SS P
Sbjct: 1201 TQSSQQQPGQPLQIEGPPPANSETITESTPITATETAPENTPQSSAPENAPQSSAPELET 1260
Query: 1261 ASPPADSSQPGVPTIAESSQSGAAPPAESILPGAAPPAAASQPG--PALSDSIQSGEV 1296
ASPP ++S+P +G+ G+ P A S P +DSI S E+
Sbjct: 1261 ASPPLEASEP----------NGSDDKTPISTSGSNPDLATSGDNIPPTSTDSITSTEI 1307
BLAST of Spo17192.1 vs. TAIR (Arabidopsis)
Match:
AT5G24710.1 (Transducin/WD40 repeat-like superfamily protein)
HSP 1 Score: 1916.4 bits (4963), Expect = 0.000e+0
Identity = 1004/1367 (73.45%), Postives = 1143/1367 (83.61%), Query Frame = 1
Query: 1 MLRLRAFRPTNEKIVKIQLHPTHPWLVTSDASDHVSVWNWEHRQVIYELKAGGVDERRLV 60
MLR RAFR TN KIVKIQ+HPTHPWLVT+D SDHVSVWNWEHRQVIYELKAGGVDERRLV
Sbjct: 1 MLRARAFRQTNGKIVKIQVHPTHPWLVTADDSDHVSVWNWEHRQVIYELKAGGVDERRLV 60
Query: 61 GAKLEKLAEGESESRGKPTEAIRGGSVKQVNFYDDDVRFWQLWRNRAAAAESPSAVSNVT 120
GAKLEKLAEGES+ + KPTEAIRGGSVKQV FYDDDVR+WQLWRNR+AAAESPSAV+++T
Sbjct: 61 GAKLEKLAEGESDYKAKPTEAIRGGSVKQVKFYDDDVRYWQLWRNRSAAAESPSAVNHLT 120
Query: 121 SVLSPLAPATKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNRSLMCMEFLYRSTAVE 180
S + AP+TKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDN+SL+CMEFL RS+ +
Sbjct: 121 SAFTSPAPSTKGRHFLVICCENKAIFLDLVTMRGRDVPKQELDNKSLLCMEFLSRSSGGD 180
Query: 181 GPLVAFGGSDGVIRVLSMITWKLARRYTGGHKGAINCLMTFMASSGEALLVSGGSDGLLI 240
GPLVAFG +DGVIRVLSMITWKLARRYTGGHKG+I CLM FMASSGEALLVSGGSDGLL+
Sbjct: 181 GPLVAFGSTDGVIRVLSMITWKLARRYTGGHKGSIYCLMNFMASSGEALLVSGGSDGLLV 240
Query: 241 LWSADHGHDSRELVPKLSLKAHDGGVVAVELSRVSGGAPQLITIGADKTLAIWDTISFKE 300
LWSADHG DSRELVPKLSLKAHDGGVVAVELSRVSG APQLITIGADKTLAIWDT++FKE
Sbjct: 241 LWSADHGADSRELVPKLSLKAHDGGVVAVELSRVSGSAPQLITIGADKTLAIWDTMTFKE 300
Query: 301 MRRIKPVPKMSCHSVASWCHPRAPNLDILTCVKDSHIWAIEHPTYSVLTRPLCELSSLVP 360
+RRIKPVPK++CHSVASWCHPRAPNLDILTCVKDSHIW+IEHPTYS LTRPLCELSSLVP
Sbjct: 301 LRRIKPVPKLACHSVASWCHPRAPNLDILTCVKDSHIWSIEHPTYSALTRPLCELSSLVP 360
Query: 361 PHALAPSKKLRVYCMVAHPLQPHLVATGTNIGVIISEFDFRALPPVVALPSPPGSREHSA 420
P LA +KLRVYCMVAHPLQPHLVATGTN+G+I+SEFD RA+P LP+ PGSRE+SA
Sbjct: 361 PQVLATHRKLRVYCMVAHPLQPHLVATGTNVGIIVSEFDPRAIPSAAPLPALPGSRENSA 420
Query: 421 VYVVERELKLLTFQLSNTANPSLGSNGSLTESARSKVD-SEALQVKQIKKHISTPVPHDS 480
+Y++ RELKLL FQLSNTANPSLG+N +L+ES SK D E L VKQ KK I PVPHDS
Sbjct: 421 IYILGRELKLLNFQLSNTANPSLGNNSALSESGLSKGDPGEQLTVKQTKKQIVAPVPHDS 480
Query: 481 YSVLSLSSTGKYLSVVWPDIPYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESAVVP 540
YSVLS+SS+GKY++VVWPDI YFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILES +
Sbjct: 481 YSVLSVSSSGKYVAVVWPDILYFSIYKVSDWSIVDSGSARLLAWDTCRDRFAILESVLPH 540
Query: 541 RIPVIPKGGSSRKAKEAAAAAA-AAAAASAASSASVQVRIVLDDGTSNILMRSIGERSEP 600
R+P+IPKGGSSRKAKEAAAAAA AAAAASAASSASVQVRI+LDDGTSNILMRS+G RSEP
Sbjct: 541 RMPIIPKGGSSRKAKEAAAAAAQAAAAASAASSASVQVRILLDDGTSNILMRSVGGRSEP 600
Query: 601 VIGLHGGSLLGVAYRTSRRVSAVAATAISTIQSMPLSGFG-SSASSFNTFDDGVG---SG 660
VIGLHGG+LLG+ YRTSRR+S VAATAISTIQSMPLSGFG S+ SSF+++DDG S
Sbjct: 601 VIGLHGGALLGIGYRTSRRISPVAATAISTIQSMPLSGFGNSNVSSFSSYDDGFSSQKSA 660
Query: 661 PLSTQNFQLYSWETFQPVGSLLPQPEWTAWDQTAEYCAFAYQKYIVISSLRPQYRYLGDV 720
+ N+QLYSWE F+PVG +LPQPEWTAWDQT EYCAFAYQ+Y+VISSLRPQYRYLGDV
Sbjct: 661 ESAPLNYQLYSWENFEPVGGMLPQPEWTAWDQTVEYCAFAYQQYMVISSLRPQYRYLGDV 720
Query: 721 AIPYATSAVWHRRQLFVATPTTIEVVFVDAGIAPIDLETKRMKEELRLKEVQARAVAEHG 780
AI +AT AVWHRRQLFVATPTTIE VFVDAG++ ID+ET++MKEE++LKE QARAVAEHG
Sbjct: 721 AIAHATGAVWHRRQLFVATPTTIECVFVDAGVSEIDIETRKMKEEMKLKEAQARAVAEHG 780
Query: 781 ELALISVDGPKADTNERVSFRPPMLQVVRLASFQHAPSVPPYLTLPKLSKIDGDDSGMPE 840
ELALI+V+G +A ER+S RPPMLQVVRLASFQ+APSVPP+L+LP+ S+ D DD + +
Sbjct: 781 ELALITVEGSQAAKQERISLRPPMLQVVRLASFQNAPSVPPFLSLPRQSRGDSDD--IMD 840
Query: 841 DRRANDIAVGGGGVAVAVTRFPMEQKRPIGPLVVVGVRDGVLWLIDRYMRAHALSLSHPG 900
+RR N++AVGGGGV+VAVTRFP+EQKRP+GPLVV GVRDGVLWLIDRYM AHA+SL+HPG
Sbjct: 841 ERRVNEVAVGGGGVSVAVTRFPVEQKRPVGPLVVAGVRDGVLWLIDRYMCAHAISLNHPG 900
Query: 901 IRCRCLAAYGDAVSAVKWATRLGREHHDDLAEFMLGMGYAAEALHLPGISKRLEFDLAMQ 960
IRCRCLAAYGDAVSAVKWA+RLGREHHDDLA+FMLGMGYA EALHLPGISKRLEFDLAMQ
Sbjct: 901 IRCRCLAAYGDAVSAVKWASRLGREHHDDLAQFMLGMGYATEALHLPGISKRLEFDLAMQ 960
Query: 961 GNDLKRALQCLLTMSNSRDIGQETSGLDLTNLLNV-AAKKENIVDAVQGIAKYAKQFLEL 1020
NDLKRAL CLLTMSNS+DIGQ+ GLDL+++L++ A KKE++V+AV+GI K+AK+FL+L
Sbjct: 961 SNDLKRALHCLLTMSNSKDIGQDGVGLDLSDILSLTATKKEDVVEAVEGIVKFAKEFLDL 1020
Query: 1021 IDAADATAQSEIAREALKRLAAAGSVKGSLQSHELRGLALRLANHGELTRLSTLVNNLVS 1080
IDAADAT ++IAREALKRLA AGSVKG+LQ HELRGL+LRLANHGELTRLS LVNNL+S
Sbjct: 1021 IDAADATGHADIAREALKRLATAGSVKGALQGHELRGLSLRLANHGELTRLSGLVNNLIS 1080
Query: 1081 LGLGREAAFSAAVLGDNALMEKAWHETGMLAEAVLHSHAHGRPTLKNLVESWNKMLQKDL 1140
+GLGRE+AFSAAVLGDNALMEKAW +TGMLAEAVLH+HAHGRPTLKNLV++WNK LQK++
Sbjct: 1081 IGLGRESAFSAAVLGDNALMEKAWQDTGMLAEAVLHAHAHGRPTLKNLVQAWNKTLQKEV 1140
Query: 1141 EHTRTTKTDATAAFLASLEDPKLPSLGDTGKKAPIEILPPGMSALSLSISAQKKPVPATK 1200
E ++KTDA +AFLASLEDPKL SL D +K PIEILPPGMS++ SI+A KKP+ K
Sbjct: 1141 EKAPSSKTDAASAFLASLEDPKLTSLSDASRKPPIEILPPGMSSIFASITAPKKPLLTQK 1200
Query: 1201 GLQQEPGKPLLLGAPPTAAPINGVTLPPKDSGDSSVPPPADSSQPGASPPADSSQPGVPT 1260
Q E KPL L P I PP S P S P + A+S P
Sbjct: 1201 TAQPEVAKPLALEEPTKPLAIEA---PP------SSEAPQTESAPETAAAAESPAPETAA 1260
Query: 1261 IAESSQSGAAPPAESILPGAAPPAAASQPGPALSDSIQSGEVGAASAEVSDQPQPGAPSL 1320
+AES G A AE+ A+ AAA GP + V + ++ P
Sbjct: 1261 VAESPAPGTAAVAEA---PASETAAAPVDGPVTETVSEPPPVEKEETSLEEKSDPS---- 1320
Query: 1321 ADLSDPGSSTHVDSGQLEASKPTDFSQSVALP-PVESNQPEASQPTS 1360
S P + T + + T S + A P P+ + PE T+
Sbjct: 1321 ---STPNTETATSTENTSQTTTTPESVTTAPPEPITTAPPETVTTTA 1346
BLAST of Spo17192.1 vs. TAIR (Arabidopsis)
Match:
AT3G45050.2 (unknown protein)
HSP 1 Score: 141.0 bits (354), Expect = 6.600e-33
Identity = 69/125 (55.20%), Postives = 91/125 (72.80%), Query Frame = 1
Query: 1498 HDHRRKFFRISSALPETIMSVTLATAVVGAAATILVKRT-KESEKSQTPVKMCEDCGGSG 1557
H ++ + AL ET +S+ +A VVG AATILV+R K SE+++ +K CE C GSG
Sbjct: 34 HQRAKRSSTVVPALAETAVSIAIAATVVGTAATILVRRNNKASEEAEASMKECEACLGSG 93
Query: 1558 ICSECKGEGFVLQKLSEERAARARMISKDAATRYTSGLPRKWSYCTRCSSARNCRTCNGS 1617
IC ECKGEGFVL+KLS+ A +AR+ +K+ ATRYT+GLP+KWSYCT+CSS R+C C GS
Sbjct: 94 ICPECKGEGFVLKKLSDANAEKARLAAKNMATRYTAGLPKKWSYCTKCSSTRSCMICGGS 153
Query: 1618 GNLSL 1622
G S+
Sbjct: 154 GKTSI 158
The following BLAST results are available for this feature: