Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGAATTTGATGGCGGAGCCTTCTCCTAGTACTGCTCGGGTAAGCATTTTAAACACAGAATTCCTATCCTTTGCATGCAATGTGACCATTGTTATAGTGCTTAGTTTGACTCATTTCTGATTATATAAAAAGTTGGTTTAGGTTTTAAGCGGAAATAATCTAAAGTGTAGAAACATGAATATGTAGGTGTTTGGGTTCACTTGTTGTAGCTGCAGTATTTCTAGTTTAGTTGTAGATAAGAAATTGAAACGACTTTTCAAAGCTCCTATTGTTTGCCAGACAATCCAATATTTGTTACTTAGCCCACCTGCAAAAGGCATGCAAAGTATACCAGAAATGTTAATTCTTGAAGTGTATGAAGCTATAACTTACATATCCGTGCCTTCTCGGCATAAATTACATATCTCAAGCAATGTATTATGTGTAAGGGTACAAACTGGAGATTTACAACAAGGAAAACCGTTAGAGAGGTTAGGGATTGAGAGAGATCCACATGTATGTTTATAGAGAGGCCTTAACCTGCTAGTCAAAAACCATTAGAGAAGTTGTTTATGGAAGACCCTCATATAGTAATTTTTATATGAAAGTAGGCTTATTTTGCAAAAGTTTGAACAAATTTTTTTCTTGAAATAGGAGTGGATATGAAAGCATAGTATGACAGTCAAAAGTAACAATAATGGGCGAAAATATTTTGGAGACCATCTTCCATTCTATGTGCATTTATCGGCAACGTGCTGACTTAAAAGCCGCATGTTTGTGCAAAATTTTTACTGTATTGTTTTTATCTTTTCGTAATTCCATTTGTATGTATTATACTATTATGTAAATGAGTGACTGGGTTGTCCTGGAGTTTGCTTATATAGAATAGAATGCCAGACAAGGTTTCAAAACAGTTAAGCAATACGTTCATAGTTTTGATTCTCCCGTATGGTTTCTTTTGTGGAAGGGGTTGATAAAAAATCTAGAGACAACAAGAAGGTTGGTCTTATGGAATGTTACCAGAAAGGAATTTGTGGGATAAGTTGCTTGCATAAGTTTCTCTGGGCTAAGATACAGTAGTATTCCTTCATTATATCTTTCTAATTGGTGCTGAAGATGTCTTAACTCTGAATCTGCTTGGATGACTGGACTGTAGAGTAGTCAACTGTTATCCTTCTGCTGTTTAGATGAAGAATGCTTTGATGATAATTAGCTTCTTGTTTTTGCTTATGTTGCGGGTTTTTTTTTTGTTGCTGGGTGTCCTGTTTCGGCTTTTTGTTTGACCTATGTTTATCGCATCTTACCTCTCTGGTATCTACTATAGAACTGTCTATGTCTATCTTCCTTACTGTAGTTACGGTCGTATGTTATTCTGGGCCTCTGGATGTTGTGTATTTCAGTGGTTCTCTTTCTTCTTCTCCCTTATCCATGATGTTTCCATCTTTGACAGGCTATTGACTTGATCACAGAAGTAAAAGAGTTGCAGAGATTTAACTCTCAGGAGCTTAGTAAGCTGTTAAAGGACTCTGAAAACTTCTCCCTCTTCTACATTACAGCAAAGTCTTTGCAGGTGGGTTGACTATGTTATTTCTATTTACTCATCATATATCATAATTACTATTCTTCCTTGGTTTTATATTACAGAGCATTTGAAATTGTAGAGTACAAATTAAGGAGCCTCCTATGCCCTTATATTAAACTCTACTGTATAATAAAAAAGAAAAAGTAGGACAACATTGATATGTAACACATTGGTTTTGTGAAGTGACTTACATTAAGAAAGAAATATCAAAAGGTTATATGCCCTTCATTTTGAAATGGAAGGAAGTTATTTTTTTTTTGTTACTTTTTTGATTGCTTTTGTGGATGAGAATATTGAAAGCAGTTGTGTCACAGATTGACATGGAAAAGCTTGTTCGACTACTTCCTTTGCACCTTACTGCCGTGCTTTTGTCCTCTCAAAGAGATGAAGCCTCATTGAGATATCTATTATCTGGACTCAGGCTTCTATATTCTTTATGTGAAATATCTTCTCGCCACCCTAAGCTTGAACAGGTCAGTGACTTGTTTGGACTATATGTCCTTTTCAAACTATACATTCCTTTTGTCCCATGTAAATGTTCCAATTTTGAGAGGTGCCAACTAATGTTCCCAATTGTTATTCAGTGTACAATTTCTTCATCATTTTGGCCTTTTGGTCTAGTGGCTTATGTATTCTCAAATTATTATTAAATGTGTTTAGATTAACTACTTCGACACTATTTTTGTCAAAAAAAGTTTATGCACATAATTGATGATCTTTCTTAATACATGTTCTACAGTAATTAGGGATATTTATCTGGGATGGAGGGAGTACTACATAGTTATTTTTGTGGGAAGGCTGTATTTGTTTGCATGCATTTGGAGTCCTTATTGACGTTGCAACTGTAGTTTCAACAAATTGTAAAGGCTTAAGGATGTACTCCGTAGCCGTATAAGAGCTGAAGTGTAAAAAGAATGTTTTAATGCTTCAAGATGTTTGATGTTTACAAATTGTAGTACTCCCTCCGTATTTTAAAAAGAGATACACTTTCTTTAAACGACGATATTTTAAAAAGAGATACACTTTCCTTTTTTGACATGTCTTGGTCCCCACTTCCAATTATATTAATATTTCTCTCTTTCTTATAGTCCCCACACTTACTCACATCATCTTTTTACTATAATAAACAATTCACCCACTACCCCACTTTCATCTTATTTTAATAAATTCAACTCACTCTCCTAAAACTATGTGCCGGTCAAAATGTATCTCTTTTTAAAATGCGCGGGGAGTAACTTTTTTGGGTCATTTTAGTTCATATAAGGAGCGAGTTGTCACATAGCAGAGAATTTTTTTTTACCTTTCGTTATTCTCATTATAATGCAACTGAAGCCTTTAAGGTTTGTGATGCAAATATTGTCCCCTTATCCCCCTTTAGAATTGCTTTCAAATAATATCTCATTTTCTACAATACCTTTACCACACACTAAATGTTATAATGATCGAAAATGAATCACTACTGTTTAACTAAAAAAATATAAAGTTTGGACTTTAGTTACTTAGTAAGTTATTGTTTGTCATTGGGGCATATGGGTTTGAGGATCATTGTATACTTTTCTAATATGTGAATTCCCATTTTGTTGATGGATAAGAAACCCCATGTTAAAAAAAAGGATGAAATACCCAAGTTCTGTGTCTTCACTGTTATTATCAATGCCAATATGCCATAATGTTGTGGGAGGGGGTCAATTTTAACACGTAGAAGTTCCCGTAAAATTTGTTCTGTTGTGAAGGTCACCAAACTTTTTATGAAAGTTGCTGTTATATTCTTAAGCTATATATTGCTTAGTCTTGGCTGGGAAGATCACTATACTCAAATATGTGATTTTCCTTTTCCTTCAGAGTTAAGAAACCTTTGCCAAGCCCTGTGCCCTATGCGAATATTACCATGTCATGTAGGATAAACTATTGGAAATAGAGGGGTGCTCATGTCAAGCTTATGGGCAAGGAAACACGTTTTATATAATAGTATACATACGCAGTATCTTTATGTCGTATTACTCATATTGTCTAATCCAATTTTTAAAGTAATATGTAAGATTTACTTAAAACAGTACCCGATGTTCATATATGATGTATTTATATATGCAGATGGTTGAACCTAATATAGCTACTGTTACAAAGTAGCAGTTTAGAAAATAATTTCTACTTTTACGAGCAGCATTTTGGATATATAATGTAAATCTGCTAGCTGTTATTTCCAAAAAAAAGGAATTGGGATATCTATGGTCCAGGAGTTAACCTAACGTTGGAGACCTAACCTTTTGCACCTTCCACTTTTGGGACTTCAACCAAAGAAGATATCTGCTTGCTTAACCATGATTCATCAATAACTTTTCAAACTTTGAGTTAAGCATAACAACTAGTGAACTGCAACCTAACCTTTTGCAATTTTCACTATTCGGGACATCACCTTTTTTTTTTCCCACCTTTTGACACCTTACCCGTGATATTTCACATGCTAGTTGAATTTTTTTGGACGATCTGATAAACGTAGTTAAATCGGTGGGCCATTTACTATAAAGTAGGAAGATCAATTTTATTTTTTCCCGTCTTATGCTTTGTTCTCTTCACCTAGCCTAGTTTGATTTTTCTAGTTTCTGGTCGTCTGAATGTTTTCTGAGAGTGCTTTTTGCTTTTTCTAGTCTGGTCTGATTTAGTAAGCAATAAGAATAAGGTGAAGAGAACATAGCTTATATCTCCTCAATCCACCTCCATTAACCCAATCCTTGACATCAAGTTTTCACGACTTCACACTGTTGCATCACTTTCCCTCTCGTCTATCATCGACTTCAATCTATTGTTAGTTCTCTAATCCCCTTCTCCAATTTCTTATAAAATTGGGATATTTTTGCCAATAATTTTTTTGATTTGATCTTCTCATTGTCCAGTGTAAAACCACCATTTTCCGAATTCAATGTCATGTTGTATATCACCTCCATCTATATCTTCCGCATTGGCTTTATCTCGGGCAAATCAGTGTTGAATCAGGTGAACCGGCCAAAATGTGTTCTGTTTTTAAGCCTTTACAAGCCGGGGAATATCGCAATGATAAACTTGAAAAGCGTTACAACTCTCCACACCCTGATGGTTATTTTACACCATGGACTGGAATGTAATATGAGAAAATTACGCATGAGTGTCTCATGTATAGAGATCCTCTGCCTTCTTTTAAGTTTGCCATTTCTAATTTTGTACTCTGTATGCTCCCATCCCTTCCCAATTGTCCTATCCCAACTGTACTCGATCTTTTCTCGGTGTTAAGTGGTCACAAACTCGCTTGATGGTAGCTTGGTAGAACCATGAGCTTTGTGCTTATAGTTTCGGTTGATTCACAAAGATTGAAGAGAAAGGAAAGAATGTATCAAGATTTCAAGAGTTTGATGGCTTTTCAGGGATGCTAGTACTGCTAGGTGGTTAGCTGGCGTTGATTCCATAAGTTTTCTTTTGATTTTCCACCATTTATCTTTCTGAATTTCAAGGAAAGAGAATATAAATGCAAGGGGGAGTCTGGCATCTTGGTTTTTCCACAAGGAAAAAAAGAGTGGAGAAAGTGGAAAAGATTTTAACTAGGAAAGCATTATGCTATCTTGCTTTTTCCTAGCCAAAGGATCAGACTCCTGTCAATCATTTTATATATAATAACTGTGAGTTGTTAAAACCTGTGTGGTATTTAGATGGTAATAGTTTCGGGTTTGCAGCATTTCCATTGCAGCTTATATGTGAACTCGTACTGTTCTCTATTTCCTCTATCTGTTTCAGTTGGGGTCAAGTTCTAATGTAGCTTTCTCCAAAAGTGGCATATAAACTTATGAACAGGAATGCACACTCATCCTTGTTTATGTTAACAGTTCTTTTATCTTACAGATTTTGCTCGACGATCTGAAAGTCTCAGAACAGCTGCTTGATTTGGTGTTCTATTTACTTATCCTCTGCTGTAAACAGGTTCTCCATCTTTATTAGCTTGCTTAATTATGTTTACGAAAATTCGATATCATGGCCATATAACTGCAACAAAACTCTTAGCCGTGACATTTTATCTTTTCAAAATTCTGGTTTTGGATTATGAGCGAGTATAACCTGCGGTCTACATTATTTTGAAGTTATTAAGAATTTGGTGGTCCCATTCCAACAGAAAAATAATTTTTAAAAAAGAACTGAATTAAAAAAGAAAGAAAAATCTAAGCCGTTAGCCATCTTTCTTTCCATGCCAGCCCCCTTCCCTCTTGTCGCTCCTAATACTAGTTTGTGCGCTGTGGTGATGCCAAAGGGTACTTACAAGCAAGTGGCAACTCCTTCTGATTGATATTTTCCGTTCCTAGCAAAAAGCCTTCTGTCTCTAGCCATGCTTGTCAATGACTTAAACCACCCTGAAGCCACTGTCATGGGCCTGGTGTGTTAGATGTGATAAAGAGTGAAATAAAAAGAAAGAGGTGGGACATGGAAGATCTAACCAGTGTTAGTGTTAAGCTCTGTTTGGTTCAACTAAATTCAACTGGAATTAACTTATCAAACTTAATTGAACTTATCTTTGTTGAAAAAAAACTTATTTATGTGTGTAAATTGTCCTAAAAACCTATTTTTGCTGAACTGATCTTATCTGAACTTGTTTTTGCTGACATAAATCAAAATAAGTTAAATATGTTGAAACAAACAGACCCTTACACAATTTGAGAGTTATCCTCGCATCATATGTGGACTTTTAAACACGCAGGTTGATTCTAAATTGTGTCTCGATGATATGGTGGTTGCGGATGATATATTTGGGGTTGACAATAATAGAGTTATTGATTATGATATATTAGGAGACAAGGAATGTATGGGGAATAGATTTGATGAGGGAGATAGGAGAGGAAGATGGTACGGACCATTGTGGATTATATATTTGCCTCGGAGTTGGCAATAATGGAATTATTGATCATGAGAGTTATACTACATGGGGTATTAGCCATCAGATATATGGGGTTTCTTATTTGTAAAAAAGCACTGATGGAAATGAAATATAAAAGGCTGAAATTCTAAAATGTTTCGAAGCAAGCTAAAATTTGGAAAGCTTGCTCACCATACACCTCTTTGATGCTAATCTAGCTAATTTGACCAAAGTCTGAGTTATCAACCAGCTCAAATGGTTGTCAAACACGAGCATTAATGAACATTCATGAGTTTCGTTGAATACTACTTAATTGATTATTCAACCTAATAACTTAACATTCGGTGAGGGTAAGTGATTGCTATTCTTATCTTTGTTCCACGAACCAGTGAACCACAATGTTCCGAATTATCACCTGGACTTAAGACAAAGGGACATTTTGTTTGGTTGTCAATGTTGTCACATTTACAGGTCAACATGTTTCTTTACTGAAATGACATATTCCTGCCTGAACATTAACTACAAAAAGAACTTCCATGTACAAATTTTGGACCAAAGACACCTTGTGAATGACATAAAGCTAAGGGGTAAACATTAGGCAAAAGGCACTATTGCCGAAATCAACCCATCAAGTGCTAAGACAACTTCCTCCATTCAAAAAGAAATTCCATTCCATCCATCTTTATGAAAATCTCCTTTTCCCCTTCACTCTCACTTATCTATCTCCCGACTTGCTGACTTTGAGCCATCACTGCCCCCATTGATCTCCTTCTCCGCAACTATAAATTTATGCTATTGTGGCCCTTACTTGCCCACCTTAGTGCTGATGGGTTCTCTCCCTGCCAAGAGGTTGCCAATTTTCATGGAGTGTTAGATCCTGGCCATTTGGTGGTAAATTCGACCATCAAGAGCTCTGTTTTGGAGCCTTGTGGTGGATTTCCACTACCTACTGCCCATGAATAATGGTGGTAAGGATGAAATCTTATGATGGTTAGGTTGAGTTTCATCCAATGAAGGATAATGTTTTTCCCCCAAATCTAATTTTTTTCCATCTTCCCATTGTTTTAGGCTATGGCTTTGGGCTACAGAATGGGGATTATTGGTTGAACTGGCTGTCTGGCTGTGGAAGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAATAGCTAGTGGATTTTATTAGGAGTAGAATATGAAAAATGTGAAATAATACGATATGAATTGAATGGTTAGGATGGTGATGTGGCCCCCTTCCCTCCAACGCCCCTCTGTATGTATGTTATTATTTATTTTAGTACCTATTGATTCAGTGATAGAGCCTATCAACCAATTAGTGCAATGCTCCATAAGCATAACCAAAGATTTCTTGAAAAAAGAGACCACTGAATAGTTGTTTACAAGGGCACAATTTTGAATTCTTTTATTTGTTTATTAGTAAGATATTTGATGATACAACTGATATGTGAAACTGCTTCATCTCTTGCATGATCTAGGGATACAGTTTTTCTGGTGCTATGCCACTTCTTTACTCAGCTCTTGTAGCTTGCAGTATGTATCTACTGACCGCTTGCATTTCATCACAGTGGAGTGAACTCACTTATGTTCTCCAGGCACACCCTAAGGTATATTATTATAATTTGTCAATTCTGTTTTTTTTTACTTTTAATCTGATAATTTACCCTAGCTGTTTTAACTTTTATGTGCTTGTTCTGGTATCAGTTTTTTTTGCTTTTCTTTCTTATTCCGTTCCTCTGGTAAATTTACTGATTATCTGCAATTCTGTACCATCTCTTCTCCCTTTACGTCAATGCCACCCTTGGTGTGTGGAGAAGGAAAGTATACCAATCACAATCTTGGCATCTCGGAGTATATATGAACTGCTAATCAAATTTCAGACCTAGAGCATGTTCTGAAGCTAGTGTTTACAGTTTTGCATCTACATATGAATTGCTGAATAAGTCTCTCTTTGACAGGTTGACATATTTTTAGATGCAGCCTTTGGAGCTGTCCGCATGAATATCAACTTCCTTCAGATCAAGCTGTCAGCAGAGGAAAATGAGTTCACAAATTCTAGTCTAACAGCTGAAAAAGCAGTCCACTTTTGTTTCCAGCAATGTGAGGCTTCTTTGCAGTTCCTCCATTCATTATGCCAAAATAAGTCTTTTCGAGAACGCCTTCTCAAGAATAAGGTTTCCACTATTTCCTTTAAAAAAATTACCTTTTCTTCTTCTTTTATTATCTTTTGTTTTCTTCTAAATATGATTTGTGGCTGGCTTTTAAGATGAGAGAAGTAGACTGCTGATAAGCAAAACCACATTAGCAAGACTGCAAGATCCATATTAGGATTTCCACACAAGAGACAAGGGAGCTTGACAGCTGCTTTACTTTATATATTCATTTGAAAGGGGCTATTGCATTCTGTTCACAATGCATCTTATCATGCTTTTAGATCTCATTAAGTTCCTCGTTTTTATGATTTATATGATAGTCCAGTCCAGCTATTAAAGAATAGTAAGAGGGGAAAAATATGGTGGTCAGGTCTTATGAAGAAAACCTTTCAGTGCAATGGATAAAACGATAGCTGAAGAGGTGCTACAAAGTCTAATAATGGCGATAGTGGATTATTCAATATGTCAGTTGTATAATCCAACTATGAGAATTTGTACGGTTTGTAGAAAAGGGAAGGAGGAAATTTGGACAGGTGGTTGTGTTGCGGCTTGCTGAATGTGTGAGTTCTAGAATTGTCCTTATCAGGCTGGTTAGATTTGAGAAGCAAAGGAAGTTTCGATGAGCTCAAATCAAACTCCAGAGGAAGATTTGTTGTGCTTGGCTTCTCCTGGAAGGGTAAACCAGCCACCATTTTCTTTCTAAGAAGGTTGTCAGGGGCTCCTTCTAGCCATCAGGGTCGACTAAAACAAAGTGGTCAGGAGGATGACAAGGCTTGTGGTACCTCTTCCAAGATGTTTGCCTGTTTAGTGAATCCGAGTCTGGTCGTTGGAGGATCAAGGGAGAAAGGCTGAGTTTTGAGGAGCTTAAAGTTGATGAGATGCATGTAAAAATCACATAAATGGACTTACATTACAGTAATACATTTTACATAGAAGTCCCTGGTCAGATGTCCACTGATCCTCTTTTAAGAGGTTGCGGGGGAGGGATTTTCTGGGTTATACTGGTAACAATGTAGTGGAACCCTGTTGGTAGATTTGAAGGATAAATATCCTGCTGAAGCATTGTTGGGAGACTAGAGATACATTTTTTTCTTCTTTATGAGCCCTCAAGTTCTCTCCAGATAGATTGTTTGGGATAACGGAGATTCCATTGGTGTTTTCACTTCCTATCACGTGTCATGTGCAGGTTATCTATGTGTGCTTTGGGGGAATAAGGGAAAGAAGTTCTATTACTATTATGAAAATTGATATGGGGAGTCAAGGAAACCTGGGAGATTTGGCCGATGGATCTTCTTGTTATGAATTTTTTAAAAAGAAAGGAGATAAGAGGATAGCTTCGCTACTGATTGCCGTAAAATCTAGAAGGACCGCACATCAGCGATTACCGTTATACCAAAATACTCCTGTTCCTTAAGTGCCTTTTGGTCTTGAGAAATCTGTAATCCTCTTTATTATTTGTTGTTTCATTTGGAAGTGATTGTTTTGGAGATCATTGTTATCTTATTTTAGTTTTTGGACCACTGTCTTTTTTATTTTTCTTACAAATTTTTTTGTTTTCTTATATATCTAGGAATTATGTGGGAAAGGAGGAGTCCTTCATCTTGTTCAAGCCACCTTGAAGTTGAATATCTCGCCATATCTGAAGGAGCCACTTGAAATGATAGCTGCTATTTCTAGACTGAAGGCTCGAGTTTTATCAATTGTAAGTCCAAATCTTGTAAATTTCTGCATGTGGTCCTCTCGATTTGGACTCATTGTTGCAGTTCATAGTATCTATGATTTGGCTTAGTAGCTTAATTGTATTATCAAGTTTTCAACTAAGAAACAAATCAAATTATGGCAGAAAATCGTAACAGTGAACTTCAGAACTGAGGGATTTCATTTGATTGCAATTGTTAAGTGGGCTACTAATTGACTTAAGAAGTATTCGATTTTACAGCTGCTGCATCTTTGTGAATCAGAAAGTGTGTCTTATCTGGATGAGGTAGCCAGCATACCAGAAAGCCTGGATCTGGCAAAGTCTGTTGCAGTAGAGGTTAGTATTACTTGAACGGTTCTGTTAGATGTATCAAGAGGAAAAAACATGTTATTCATCTATGTTACTCCTTATAAAAGATTGTCATCATACGGGTGTTATGTTCGTTGTTGTCGTGTCGACAACACACCTTTCTCTCTCAAAAATGAAAAAGAATGTTAATTGTTGTGATGTTTAGTTGTTAGATTCAACTGCCTCCCCTTCCTTGAGGGCATTGTCCTACTGCTTCTTGAGGGGTCTTTCTTTTTCGTTTGAAAGGACGACAGCCAAAAAAAGCAGCTGAAGATGTGATATGCTGCAATTTTTCACTGTCTATGGTCATGTGGCCCTTTTTTCTGTACTTATGGTTATGTTTTCTTCAACTTAACAGAATTCTAGTATTACCTTTTTCTCCTTCACTATTAATTATAGAACTTTTTTTTTTGAAAGGAATTATAGAACATTAACTTGGAGTTAAAATTGTTCACCTAATCGACAATCTAAAATATACGGAAACACACTTCATATTTAAAAAAAAAATACACATACTATGCTAAATTAATTAATTTGAACTTATTGTTTACACAACTTTGGATTCTATGCTATCGCACATGTCATTCATCTCACATTTCCAATATCAAATCTCAAAATTTAAGTTTATTTTGTTTAATTAGACAAATTATTTATCATTTTTAAAGTATACTGTATAAATAATAATAAATTAATTTAGTAGCATCCGTGCATTACGCAGGTCTTAAATTACCCTCTTCATAAAAAGTAGGAGATATAACCAAACAAAGTTTTTTTTCCTATTCATTTCTGAAGTTGAATTCCTCATTCAAGAATTTTTTTTTATTCAATCTGGTTGTCCCTTCAGAACCAAGGATCTACAGATTACGTGTACTAATGATAACTGTGATGATCTAGGTTATTACATTACTGAAGACCATGGTCAGCAGAGATCTTAAACTTCTGGCGAATCACTCGGGAAAAACATATCCAAGTGGGCTTTTGCAGCTCAATGCATTGCGCCTGACAGATATCTTCTCTGATGATTCAAATTTTAGATCGTACATTACAATATACTTTGTAAGCTCTCTTGTTCATTGCCTTATTGTTTTATCATATTATTTAATATTGCACTTATCACATGGTTTCTTCTCAACTCTTAGTTGATTTTGCTGATACTTGACATGTCATTACACAGTTATTTTAGGAATATAAAGGCCAAATTTTCCACAATGCATGTTTCACACCTATATACCTTCATGTTTCCTTGTCCTGTAAATCACTTTGTATAGATATATTCCTGTCGTGCCTATTTTTCTCTATTACCTTCATACACTGTCATGTTTTTTTCTTTCAATGCGCCCTGATTATGGTATTGGTTTTCCGTTTTTTTTTTCCCCAGGCTGAAGTGTTGGCAGCAATCTTGTCAACTCCTTACCAACAGTTTTTGGCTAGTTGGTGTTCATCAGAATTACCTCTGAAGGAAGAGGATGCTTCATTAGAATATGAGCCTTTTACTATGGCTGGATTGATTTTGGACTCAACTTCAAGTCTAGATACAATAAGATCCTTTGAATCCAACTTTATTCCAACCAACAATGCGCAATCATCTTATGCCCATCAACGAACTTCTCTATTGGTTAAAGTGATATCCAATCTGCATTGTTTTGTGCCTAAGATTTGCAAAGGTTAATTTCCTAGACCAAAATCTTGTCGTATTTGCATCAAGGATTCATCTGAGAAATCATCATCTATTCATTGTTTTATGTAGTTAACAATCTTGATTTCACCCTAAACTGCAGAGCAGGAGAGGAACTTTTTTCTTCACAAGTTTCTTGAACGTCTGAAAATGGACTGGCAAGACCCAAAGGATGGATTTGCGTTTAGTTCTGCACCTCAAAAAGCTGCCACTATTTGCAAGAATCTTCGTAAGTATAACTTCTTGACCCCCTCTTTCAGTTTGTGGAATAAAAAATTATACCTTATTCATGAATTTCTTGTTTGTGTAGGTTCGTTGTTGGGTCATGCAGAATCTTTAATTCCGACTTCTCTTAACGAGGAAGATGTACAGCTTTTAAGGTAATTTTGCATCTTCTCTGTCACTGTTGAATTCTGATTATTCTTGCAAGAATCAAGCTCATGTTTATCCGGACTTAAAAATCTAGGGTTAGGGGGGTTGGTCGTGATGATCAATGGCAAGATGTTTATGAGGGTATCCCACAATCCATCTTAATCCTACGGATTTTTCTCTTGAACATGGTGATTCCATTCTTAGTGTTGTTGCATGTAAGAGTGCTTTCATCATGATGAACAGCAAAAATGAGGTAACATATTGCAAAGGGTAGCCTCGGTGCTTTTTCTGAAGTATTTGCATTACCTTCGTTGAAATTTTAGAGAAATACTATGTGTTGTTCAAGAAAGAAAGGAAAACCAATGTTTCTACCTCTATTTTACAGTTCTACCTTTTCATCTTAAGTTCTGAACGCTATCTTTCTCACCTGTTTTAGACCCCAACAACCCATTTGATGACTTCTGAAAGGTTAGGAATGGAGGTTTTTCAGTTCTAATGGTAGGTAGAGATCATCAGACATAAAGGTCTGAATAACCATTGTGTAAAGTCGCCCTTGCTGATCCATAGGCCTAGGGTTCCCATGAGGTTCAGATGAGGTGTACAAAGACAAATTAAGTGCCACTTTTCCGTGTTGTCTTTTTTTTGTATACCGAATTTGTGTGTCATACTGTCTCTCTGCTGTATTTATTTCCTTGCTAGGGGCCTGTGTCTATCCTTCCTCCTCTCTGTCTGTAGCCTCTCTAATTAACTCTTTCCCGTCTTTCTCTAATTCAACTTTGACCTTGCCTATAATCAGGTTCTAAAACAACGAAAAACAGATTGCTCACCCTCCATCCCTTCCTCTTTCCTCTGAGTCTACTCATCTGAAGTCTGAAGATACTTTTAGTCCTGTATCCCTTCTGCCCTTCCTATCCTTGCCTCTTTTCCCTTAACCCTCAACTTCTTCACATGAGCTTCTTGTGTCGTTAAGGCACAAGTCTATGCCTTGCACTCAATGGCCTTTTTACCTCAGCATGACTCCAGTTTGTTAAGACTTAAGACCCTTTTTCTTTCTTTTTTCTACAAGTAATATTTTGGTTATACATTAGTTTAAGAAGGCTGATAAATTGTAAGCCACCAAAGGAGGCGTAGATCAAGGAATTTCTATGAACTTGTGATATTTTCCCTTCAGTCACAAGTTTATTTAGCTATTTTCATTGATAAAAGGCATAAAGGAACAACTTCATGATGCTTGTTTGAAATTATGGATGATTTTTCATTACTATATTGCCAACAGTTGAGAAATTTGGTGTGAACCTAGAGTACCGAAAATGGCCTTACTCTTTATGTACTTTTTAAAATTTAATATAAATATATCAGAATTGTTGCACATGTTATTTGAAATGTGAAGCATGCATCGCTTGCATTTAAATATAGTTTTGAAATGTGAAGCATGCATCGCTTGCATTTAAATATAGTAGAGACAATGTGAACAAAATATAGGAGTTAGAAGAACAAAAAAATCGGTTCTAGCGTAAGAAACTGTGACAGAACTTGGTTGAAGTCCAACCAGAAACTAATGTGCAAAATGTGCCAGGACTCAAAGCAAAAAGACTAATGATCCTTTTGGCTGGAATTGGATGGAAATTGCTGTATATGAAAGTGAAAGCACCTTAACAGTGTAGCTACTTTTATTTGGTGGTTAATGTTTAGTGCTTACTGGTCAATACTCAGAAGACTATGGCCTGGGGAAAAAAAATCTTCCATTTTGCAGAATGAACCCAAGTCTAAACTCTAGATGCATCTTACTTTAAGACCTGGTTCTTCAGGGTTAACCATTGTATAGTGCACATGTGTTTGCTCCTTGTGCTTCTATTTTCTTAAATCATTCCTCCTGATCTATATCCAAAATTCCAAACAAGTCTTCAACCTTAATCTGCTATGTTTTATTTTTTTCACCATGCTTTATCCTGGCTCATCTATGTTCCAATATGCCAATGTGGAGATGAATATTCTAAACGATGCTTTATTTTCTTCACCATGCTTTATTTTTTTGTTTCTGTTGCAGGCTTTTCTTTACCCAACTGCAGTCACTCATTGGTCCGGTTGATTATGAAGTTAATAGAGCACTGGTAAAAGTGTATCCTTGATGTGATGTGGAACGGTTTGCAAACTTGACCTATCTTTGGCTGCATGATTCTTGACCGTGGTAATTATTGTTGTTAGGAACTTCAAATTGCAAGAGGCTGTACATCATCTCTTCAAGGAAATGTGGACTTTGATCACAAGAACAAACACTCTAACTTGAGAGAAGGTATGTCAGAAAATTCAGCTTTTCCAGGAGTGGACAATTCTTGCACTAAAGTTGAGGTTACTGCTGAAGCTGATGGTGCAATGCACAATCAGAAGACATGTGAAGGAAAATCTGTAAAATATTCAGCTGAAATTTTAGTAGAGACTGAAAAGGAGTTTCATCATGAAGAGACAAGTGGATCAGATTCAAGCACTACAAGAGGAAAGAATCCGACTGATCAAGTTGGTAATGGTGAAAATACAAAACCAGGTGAGCATTTCCATAGAAGTATAGTTGGAGTTGTTCAACAAGATGGGAGGGTTGAAAGTGTGAATCACGAAGAGAAGCAAGTACGGAAACGTAAACGACATATAATGAACGATAAACAAATATCGTTGATTGAAAGGGGGCTGCAGGAGGAACCTGATATGCATAAGAACGCAGCCTCTTTACAATCATGGTCTGACAGATTAAGTTTACATGTAAGAGAATTTTAGCCATGTTCTTTTTTATTTGTTTGAAGCTATTTTACCTTGATAAATAGTCTCTATGTGTTAATGCCTAGATAGATAGTATTTACGTATTATGATTCTTACGCGTGGTCCTTGCAGGGTTCTGAAGTCACATCTTCACAGCTGAAAAATTGGTGAGGCAACTTTCCTTTTCGTTGTTATATATACATGTAACACATCATTGAACACATAACTTGTTGTCATGCTCATCGAGGTCTCTATTTTGCGGGAATTTCACTGGTGTGGAAAGCTTTAAAACGAGTGGCGGCACTATATCTATGCCTATAATACAAGAAGGCTTTATTTTTAATTCATTCATCATCGCCCTCTAAAGAGGCTTATTTTAAATGTACAATAAACTTGACACTGGATTGACATCAACATAGTAAAACCTTCTAAATCTTAGAAAGTATTAAATAAGGAGAATACAGGTAAATGAAGGTTGGGTTCTAACACCTAGCGTGCTTCCACTCTTAAATTCCGTGGAAGCTTAACTGCTTCTTAAAATTTTTCCTGGAAATAGCAAATATACGATGTCTGATAGCACCTTCAAAGACTGTTTATGTGAAGAAAGCAAACTCGTCATTAATGGGTACACTTTTTTTTTTTTTTTGGTTCAGATAGGGGTAAATTCCCGTGAGAGAAGGCATGGAGTGCTAAAATATGCAATATGATGGATACTTACTCTAATTTAGGCTTTGAAAGTGCTTTTAGCCTATCAATGCCTGCTTGTCTCTTCTCCGTAGAGTGAAGTCTGAACCTAATTATACACTAAAAATGTAAACTGTTTTTTTTCCTTGCTTGGCCCTTTGCTTTTCTGAGTTTCTCTCTTGGTTGAAGCCAAAATCTAGCTGCCTAGGCGTATGAGTACATAACTTGTATTTTCAAAAGAAATAAAATAAATAATTGGACATGAGATAGGAGTGTACCATTTAACTATATGAATATCAACTTCCTTGCAACTCTTGCAAAATAGGTTACTTCATACATTACCACTGTTCAAGCTATCAAATCTGATGATATGTATAACCACCGTTACTCCATGTGAAATCGGATGATAGATTCACTAGGCTGACCAAAGCATGTAAAAAAGGAAAAAAAAAAAAGGTAAACTCTGCTACATTAAAAGTTGAATATGGTTATGCAAAAAAACCAACCTTTGCTACTTGTTATCAATTTTAGAAAGTCACCCCTGCTTCCCCTGTTCCCGGTTCTAGACTAGAGTTTTCATTCTGATTATCATGGTGCTTCCGGTGAACAGGTTGAATAACAGGAAAGCGAAGCTTGCGCGAGCAGCCAAGGACGGTGGTCACCCATTGTCTGAGGAGAATGCTATAGCTGACGCACAAGCTGGGTCCCTCATGCATACTACTTCGGGTTCTCCTGAAAGTCAGAATGAGGACTTGCCTCCTTCCTCTGCACCCAAGGACCACAACCAGGTGTCTGCAGTCCGGAGAACCAGTTCAGGGATAACTATGAGCCCAGTTCTTGAGATTCCCCGACCTGAAAGTTCAAGCAGATTAAATATGCAAATTGAGGGTCCGGCTATAAGTTCGAGTTGTGTAAACTTTGAGGCGGGTCAGGCTGTTCTACTTACTGATACCCAAGGGAAAGAAATAGCCAAAGGAACTGTATTTCAGGTTGAAGGTGAATGGCATGACTGCAATCTGTCAGATACGAGAACCTGTGTTGTAGATATTACAGAACTTAGGTCTGAGAGGATTCTGAGGCTTCCCCATCCATCAATAGAAGCAGGCGCCACATTTGAACAGGCTGAAGTCAAGATTGGGACTATGAGAGTGCTATGGGATTCTGGCCGGATGCTGAAGCCGTAGATAGTACCATGAATATCATCTTGCTTAGAAATGGAATGTTTATTGGAGGTTGTTTAAAGTAGAGTAAGCTGGTGATGTTGATCTAAGTCAAAGTAGAGCCACAAATTTGGCAGAATATAATTAACAGGTTTAGGTTGGCGCTCTTTTTAAAAGTAGAGCTCCATTTCTTGTATAGCATCTCTGTTGGGAAGTTGTGATCTTAACTATGTCATTTGTAAATTACTAATGTCTGATGATATCCGGGAATGATAATCTTGTATGATTTTGAGTTCTGACCGTGTTGCAATTTCAATGAGTAGGGTTTGCTTGCTAAGAGTGTTGCAATTTCAATGAGTAGGGTTTGCTTGCTAAGAGCTAAGCTAAGGTTGGTGAAATAACTGTGAATGACCATTGGTGAGGTATCAGCAATGGCGGAGGGCAGGATCGGGGTGTCGACATGTTGGTAAAACAATTATTTAATCCGTATTAATTTACGTTTTCTAATAAATATTACCTTTGTCCTTATTGTTCGCCCCATGTAACAAATGTGTTTTATTAAGATATGTAGTCCG
mRNA sequence
ATGAGGAATTTGATGGCGGAGCCTTCTCCTAGTACTGCTCGGGCTATTGACTTGATCACAGAAGTAAAAGAGTTGCAGAGATTTAACTCTCAGGAGCTTAGTAAGCTGTTAAAGGACTCTGAAAACTTCTCCCTCTTCTACATTACAGCAAAGTCTTTGCAGATTGACATGGAAAAGCTTGTTCGACTACTTCCTTTGCACCTTACTGCCGTGCTTTTGTCCTCTCAAAGAGATGAAGCCTCATTGAGATATCTATTATCTGGACTCAGGCTTCTATATTCTTTATGTGAAATATCTTCTCGCCACCCTAAGCTTGAACAGATTTTGCTCGACGATCTGAAAGTCTCAGAACAGCTGCTTGATTTGGTGTTCTATTTACTTATCCTCTGCTGTAAACAGGGATACAGTTTTTCTGGTGCTATGCCACTTCTTTACTCAGCTCTTGTAGCTTGCAGTATGTATCTACTGACCGCTTGCATTTCATCACAGTGGAGTGAACTCACTTATGTTCTCCAGGCACACCCTAAGGTTGACATATTTTTAGATGCAGCCTTTGGAGCTGTCCGCATGAATATCAACTTCCTTCAGATCAAGCTGTCAGCAGAGGAAAATGAGTTCACAAATTCTAGTCTAACAGCTGAAAAAGCAGTCCACTTTTGTTTCCAGCAATGTGAGGCTTCTTTGCAGTTCCTCCATTCATTATGCCAAAATAAGTCTTTTCGAGAACGCCTTCTCAAGAATAAGGAATTATGTGGGAAAGGAGGAGTCCTTCATCTTGTTCAAGCCACCTTGAAGTTGAATATCTCGCCATATCTGAAGGAGCCACTTGAAATGATAGCTGCTATTTCTAGACTGAAGGCTCGAGTTTTATCAATTCTGCTGCATCTTTGTGAATCAGAAAGTGTGTCTTATCTGGATGAGGTAGCCAGCATACCAGAAAGCCTGGATCTGGCAAAGTCTGTTGCAGTAGAGGTTATTACATTACTGAAGACCATGGTCAGCAGAGATCTTAAACTTCTGGCGAATCACTCGGGAAAAACATATCCAAGTGGGCTTTTGCAGCTCAATGCATTGCGCCTGACAGATATCTTCTCTGATGATTCAAATTTTAGATCGTACATTACAATATACTTTGCTGAAGTGTTGGCAGCAATCTTGTCAACTCCTTACCAACAGTTTTTGGCTAGTTGGTGTTCATCAGAATTACCTCTGAAGGAAGAGGATGCTTCATTAGAATATGAGCCTTTTACTATGGCTGGATTGATTTTGGACTCAACTTCAAGTCTAGATACAATAAGATCCTTTGAATCCAACTTTATTCCAACCAACAATGCGCAATCATCTTATGCCCATCAACGAACTTCTCTATTGGTTAAAGTGATATCCAATCTGCATTGTTTTGTGCCTAAGATTTGCAAAGAGCAGGAGAGGAACTTTTTTCTTCACAAGTTTCTTGAACGTCTGAAAATGGACTGGCAAGACCCAAAGGATGGATTTGCGTTTAGTTCTGCACCTCAAAAAGCTGCCACTATTTGCAAGAATCTTCGTTCGTTGTTGGGTCATGCAGAATCTTTAATTCCGACTTCTCTTAACGAGGAAGATGTACAGCTTTTAAGGCTTTTCTTTACCCAACTGCAGTCACTCATTGGTCCGGTTGATTATGAAGTTAATAGAGCACTGGAACTTCAAATTGCAAGAGGCTGTACATCATCTCTTCAAGGAAATGTGGACTTTGATCACAAGAACAAACACTCTAACTTGAGAGAAGGTATGTCAGAAAATTCAGCTTTTCCAGGAGTGGACAATTCTTGCACTAAAGTTGAGGTTACTGCTGAAGCTGATGGTGCAATGCACAATCAGAAGACATGTGAAGGAAAATCTGTAAAATATTCAGCTGAAATTTTAGTAGAGACTGAAAAGGAGTTTCATCATGAAGAGACAAGTGGATCAGATTCAAGCACTACAAGAGGAAAGAATCCGACTGATCAAGTTGGTAATGGTGAAAATACAAAACCAGGTGAGCATTTCCATAGAAGTATAGTTGGAGTTGTTCAACAAGATGGGAGGGTTGAAAGTGTGAATCACGAAGAGAAGCAAGTACGGAAACGTAAACGACATATAATGAACGATAAACAAATATCGTTGATTGAAAGGGGGCTGCAGGAGGAACCTGATATGCATAAGAACGCAGCCTCTTTACAATCATGGTCTGACAGATTAAGTTTACATGGTTCTGAAGTCACATCTTCACAGCTGAAAAATTGGTTGAATAACAGGAAAGCGAAGCTTGCGCGAGCAGCCAAGGACGGTGGTCACCCATTGTCTGAGGAGAATGCTATAGCTGACGCACAAGCTGGGTCCCTCATGCATACTACTTCGGGTTCTCCTGAAAGTCAGAATGAGGACTTGCCTCCTTCCTCTGCACCCAAGGACCACAACCAGGTGTCTGCAGTCCGGAGAACCAGTTCAGGGATAACTATGAGCCCAGTTCTTGAGATTCCCCGACCTGAAAGTTCAAGCAGATTAAATATGCAAATTGAGGGTCCGGCTATAAGTTCGAGTTGTGTAAACTTTGAGGCGGGTCAGGCTGTTCTACTTACTGATACCCAAGGGAAAGAAATAGCCAAAGGAACTGTATTTCAGGTTGAAGGTGAATGGCATGACTGCAATCTGTCAGATACGAGAACCTGTGTTGTAGATATTACAGAACTTAGGTCTGAGAGGATTCTGAGGCTTCCCCATCCATCAATAGAAGCAGGCGCCACATTTGAACAGGCTGAAGTCAAGATTGGGACTATGAGAGTGCTATGGGATTCTGGCCGGATGCTGAAGCCGTAGATAGTACCATGAATATCATCTTGCTTAGAAATGGAATGTTTATTGGAGGTTGTTTAAAGTAGAGTAAGCTGGTGATGTTGATCTAAGTCAAAGTAGAGCCACAAATTTGGCAGAATATAATTAACAGGTTTAGGTTGGCGCTCTTTTTAAAAGTAGAGCTCCATTTCTTGTATAGCATCTCTGTTGGGAAGTTGTGATCTTAACTATGTCATTTGTAAATTACTAATGTCTGATGATATCCGGGAATGATAATCTTGTATGATTTTGAGTTCTGACCGTGTTGCAATTTCAATGAGTAGGGTTTGCTTGCTAAGAGTGTTGCAATTTCAATGAGTAGGGTTTGCTTGCTAAGAGCTAAGCTAAGGTTGGTGAAATAACTGTGAATGACCATTGGTGAGGTATCAGCAATGGCGGAGGGCAGGATCGGGGTGTCGACATGTTGGTAAAACAATTATTTAATCCGTATTAATTTACGTTTTCTAATAAATATTACCTTTGTCCTTATTGTTCGCCCCATGTAACAAATGTGTTTTATTAAGATATGTAGTCCG
Coding sequence (CDS)
ATGAGGAATTTGATGGCGGAGCCTTCTCCTAGTACTGCTCGGGCTATTGACTTGATCACAGAAGTAAAAGAGTTGCAGAGATTTAACTCTCAGGAGCTTAGTAAGCTGTTAAAGGACTCTGAAAACTTCTCCCTCTTCTACATTACAGCAAAGTCTTTGCAGATTGACATGGAAAAGCTTGTTCGACTACTTCCTTTGCACCTTACTGCCGTGCTTTTGTCCTCTCAAAGAGATGAAGCCTCATTGAGATATCTATTATCTGGACTCAGGCTTCTATATTCTTTATGTGAAATATCTTCTCGCCACCCTAAGCTTGAACAGATTTTGCTCGACGATCTGAAAGTCTCAGAACAGCTGCTTGATTTGGTGTTCTATTTACTTATCCTCTGCTGTAAACAGGGATACAGTTTTTCTGGTGCTATGCCACTTCTTTACTCAGCTCTTGTAGCTTGCAGTATGTATCTACTGACCGCTTGCATTTCATCACAGTGGAGTGAACTCACTTATGTTCTCCAGGCACACCCTAAGGTTGACATATTTTTAGATGCAGCCTTTGGAGCTGTCCGCATGAATATCAACTTCCTTCAGATCAAGCTGTCAGCAGAGGAAAATGAGTTCACAAATTCTAGTCTAACAGCTGAAAAAGCAGTCCACTTTTGTTTCCAGCAATGTGAGGCTTCTTTGCAGTTCCTCCATTCATTATGCCAAAATAAGTCTTTTCGAGAACGCCTTCTCAAGAATAAGGAATTATGTGGGAAAGGAGGAGTCCTTCATCTTGTTCAAGCCACCTTGAAGTTGAATATCTCGCCATATCTGAAGGAGCCACTTGAAATGATAGCTGCTATTTCTAGACTGAAGGCTCGAGTTTTATCAATTCTGCTGCATCTTTGTGAATCAGAAAGTGTGTCTTATCTGGATGAGGTAGCCAGCATACCAGAAAGCCTGGATCTGGCAAAGTCTGTTGCAGTAGAGGTTATTACATTACTGAAGACCATGGTCAGCAGAGATCTTAAACTTCTGGCGAATCACTCGGGAAAAACATATCCAAGTGGGCTTTTGCAGCTCAATGCATTGCGCCTGACAGATATCTTCTCTGATGATTCAAATTTTAGATCGTACATTACAATATACTTTGCTGAAGTGTTGGCAGCAATCTTGTCAACTCCTTACCAACAGTTTTTGGCTAGTTGGTGTTCATCAGAATTACCTCTGAAGGAAGAGGATGCTTCATTAGAATATGAGCCTTTTACTATGGCTGGATTGATTTTGGACTCAACTTCAAGTCTAGATACAATAAGATCCTTTGAATCCAACTTTATTCCAACCAACAATGCGCAATCATCTTATGCCCATCAACGAACTTCTCTATTGGTTAAAGTGATATCCAATCTGCATTGTTTTGTGCCTAAGATTTGCAAAGAGCAGGAGAGGAACTTTTTTCTTCACAAGTTTCTTGAACGTCTGAAAATGGACTGGCAAGACCCAAAGGATGGATTTGCGTTTAGTTCTGCACCTCAAAAAGCTGCCACTATTTGCAAGAATCTTCGTTCGTTGTTGGGTCATGCAGAATCTTTAATTCCGACTTCTCTTAACGAGGAAGATGTACAGCTTTTAAGGCTTTTCTTTACCCAACTGCAGTCACTCATTGGTCCGGTTGATTATGAAGTTAATAGAGCACTGGAACTTCAAATTGCAAGAGGCTGTACATCATCTCTTCAAGGAAATGTGGACTTTGATCACAAGAACAAACACTCTAACTTGAGAGAAGGTATGTCAGAAAATTCAGCTTTTCCAGGAGTGGACAATTCTTGCACTAAAGTTGAGGTTACTGCTGAAGCTGATGGTGCAATGCACAATCAGAAGACATGTGAAGGAAAATCTGTAAAATATTCAGCTGAAATTTTAGTAGAGACTGAAAAGGAGTTTCATCATGAAGAGACAAGTGGATCAGATTCAAGCACTACAAGAGGAAAGAATCCGACTGATCAAGTTGGTAATGGTGAAAATACAAAACCAGGTGAGCATTTCCATAGAAGTATAGTTGGAGTTGTTCAACAAGATGGGAGGGTTGAAAGTGTGAATCACGAAGAGAAGCAAGTACGGAAACGTAAACGACATATAATGAACGATAAACAAATATCGTTGATTGAAAGGGGGCTGCAGGAGGAACCTGATATGCATAAGAACGCAGCCTCTTTACAATCATGGTCTGACAGATTAAGTTTACATGGTTCTGAAGTCACATCTTCACAGCTGAAAAATTGGTTGAATAACAGGAAAGCGAAGCTTGCGCGAGCAGCCAAGGACGGTGGTCACCCATTGTCTGAGGAGAATGCTATAGCTGACGCACAAGCTGGGTCCCTCATGCATACTACTTCGGGTTCTCCTGAAAGTCAGAATGAGGACTTGCCTCCTTCCTCTGCACCCAAGGACCACAACCAGGTGTCTGCAGTCCGGAGAACCAGTTCAGGGATAACTATGAGCCCAGTTCTTGAGATTCCCCGACCTGAAAGTTCAAGCAGATTAAATATGCAAATTGAGGGTCCGGCTATAAGTTCGAGTTGTGTAAACTTTGAGGCGGGTCAGGCTGTTCTACTTACTGATACCCAAGGGAAAGAAATAGCCAAAGGAACTGTATTTCAGGTTGAAGGTGAATGGCATGACTGCAATCTGTCAGATACGAGAACCTGTGTTGTAGATATTACAGAACTTAGGTCTGAGAGGATTCTGAGGCTTCCCCATCCATCAATAGAAGCAGGCGCCACATTTGAACAGGCTGAAGTCAAGATTGGGACTATGAGAGTGCTATGGGATTCTGGCCGGATGCTGAAGCCGTAG
Protein sequence
MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAKSLQIDMEKLVRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQLLDLVFYLLILCCKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIFLDAAFGAVRMNINFLQIKLSAEENEFTNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSFRERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESESVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRLTDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAGLILDSTSSLDTIRSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFFLHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRLFFTQLQSLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGVDNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNPTDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGLQEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAIADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSSRLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVDITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP
Homology
BLAST of Spo03238.1 vs. NCBI nr
Match:
gi|902232064|gb|KNA22502.1| (hypothetical protein SOVF_033670 [Spinacia oleracea])
HSP 1 Score: 1820.4 bits (4714), Expect = 0.000e+0
Identity = 945/945 (100.00%), Postives = 945/945 (100.00%), Query Frame = 1
Query: 1 MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAKSLQIDMEKL 60
MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAKSLQIDMEKL
Sbjct: 1 MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAKSLQIDMEKL 60
Query: 61 VRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQLL 120
VRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQLL
Sbjct: 61 VRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQLL 120
Query: 121 DLVFYLLILCCKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIF 180
DLVFYLLILCCKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIF
Sbjct: 121 DLVFYLLILCCKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIF 180
Query: 181 LDAAFGAVRMNINFLQIKLSAEENEFTNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSF 240
LDAAFGAVRMNINFLQIKLSAEENEFTNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSF
Sbjct: 181 LDAAFGAVRMNINFLQIKLSAEENEFTNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSF 240
Query: 241 RERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESE 300
RERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESE
Sbjct: 241 RERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESE 300
Query: 301 SVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRL 360
SVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRL
Sbjct: 301 SVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRL 360
Query: 361 TDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAG 420
TDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAG
Sbjct: 361 TDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAG 420
Query: 421 LILDSTSSLDTIRSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFF 480
LILDSTSSLDTIRSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFF
Sbjct: 421 LILDSTSSLDTIRSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFF 480
Query: 481 LHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRL 540
LHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRL
Sbjct: 481 LHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRL 540
Query: 541 FFTQLQSLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGV 600
FFTQLQSLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGV
Sbjct: 541 FFTQLQSLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGV 600
Query: 601 DNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNP 660
DNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNP
Sbjct: 601 DNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNP 660
Query: 661 TDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGL 720
TDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGL
Sbjct: 661 TDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGL 720
Query: 721 QEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAI 780
QEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAI
Sbjct: 721 QEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAI 780
Query: 781 ADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSS 840
ADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSS
Sbjct: 781 ADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSS 840
Query: 841 RLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVD 900
RLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVD
Sbjct: 841 RLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVD 900
Query: 901 ITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP 946
ITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP
Sbjct: 901 ITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP 945
BLAST of Spo03238.1 vs. NCBI nr
Match:
gi|731369258|ref|XP_010696495.1| (PREDICTED: uncharacterized protein LOC104909017 isoform X1 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 1434.9 bits (3713), Expect = 0.000e+0
Identity = 763/950 (80.32%), Postives = 835/950 (87.89%), Query Frame = 1
Query: 1 MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAK--SLQIDME 60
MRN + E S S+ AIDLI+EVKELQRFNSQELSKLLKD ENFSLF+ITA S+QIDM+
Sbjct: 2 MRNSITEASHSSRGAIDLISEVKELQRFNSQELSKLLKDCENFSLFHITANGLSVQIDMD 61
Query: 61 KLVRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQ 120
KLVRLLPLHL AVLLSSQRDEASLRYLLSGLRLLY+LCEI+SRH KLEQIL DDLKVSEQ
Sbjct: 62 KLVRLLPLHLIAVLLSSQRDEASLRYLLSGLRLLYTLCEIASRHSKLEQILFDDLKVSEQ 121
Query: 121 LLDLVFYLLILCCKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVD 180
LLDLVF+LL++CCKQG SFSG MPLLYSALVACSMYLLT CISSQWSEL YVLQAHPKVD
Sbjct: 122 LLDLVFHLLLVCCKQGNSFSGGMPLLYSALVACSMYLLTTCISSQWSELAYVLQAHPKVD 181
Query: 181 IFLDAAFGAVRMNINFLQIKLSAEENEF-TNSSLTAEKAVHFCFQQCEASLQFLHSLCQN 240
IFLDAAFGAVRM+IN LQ KLS EN+F + SSLTAE+ V+FC QQCEASLQFL SLCQN
Sbjct: 182 IFLDAAFGAVRMSINILQNKLSEAENDFHSKSSLTAERVVNFCCQQCEASLQFLQSLCQN 241
Query: 241 KSFRERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLC 300
KSFRER+L+NKELCGKGGVLHLVQ+TLKLNISP+LKEPL ++AA+SRLKARVLSILLHLC
Sbjct: 242 KSFRERILRNKELCGKGGVLHLVQSTLKLNISPFLKEPLAVVAAVSRLKARVLSILLHLC 301
Query: 301 ESESVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNA 360
E+ESVSYLDEVASIPESLDLAKSVAVEVITLLKTM+SRDLKLL NHSGK YP GLLQLNA
Sbjct: 302 EAESVSYLDEVASIPESLDLAKSVAVEVITLLKTMLSRDLKLLENHSGKIYPRGLLQLNA 361
Query: 361 LRLTDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFT 420
LRL DIFSDDSNFRSYITIYFAEVLAAIL PYQQFL+SWCSS+LPLKE DASLEY+PFT
Sbjct: 362 LRLADIFSDDSNFRSYITIYFAEVLAAILLIPYQQFLSSWCSSDLPLKEVDASLEYDPFT 421
Query: 421 MAGLILDSTSSLDTI--RSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQ 480
MAG ILDS+ LD + ++ ESNF PTNNAQ+SYAHQRTSLLVKVI+NLHCFVP ICKEQ
Sbjct: 422 MAGWILDSSPMLDIVSSKTCESNFNPTNNAQASYAHQRTSLLVKVIANLHCFVPNICKEQ 481
Query: 481 ERNFFLHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDV 540
ERNFFLHKFLERLK+D QD + GF+FSSAPQKA TICKNLRSLLGHAESLIPT LNEEDV
Sbjct: 482 ERNFFLHKFLERLKVDRQDQQAGFSFSSAPQKAVTICKNLRSLLGHAESLIPTFLNEEDV 541
Query: 541 QLLRLFFTQLQSLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENS 600
QLLRLFFTQLQSLIGPVDYEVN+AL + SL GNV+ D+K HSNLREG SEN
Sbjct: 542 QLLRLFFTQLQSLIGPVDYEVNKALVNEY-----PSLPGNVELDNKIGHSNLREGTSENL 601
Query: 601 AFPGVDNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTT 660
AF GVDNSC KVE+ EADGA+H++KTCEGKS+K E LVET+KE H+ ETSGSDSS+T
Sbjct: 602 AFSGVDNSCVKVEIIGEADGALHDEKTCEGKSIKALGESLVETDKEVHNAETSGSDSSST 661
Query: 661 RGKNPTDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISL 720
RGKNPTDQVGNG+NTK EH HRS+VG Q+D R++++N EEK+VRKRKR+IMNDKQIS+
Sbjct: 662 RGKNPTDQVGNGDNTKSSEHIHRSVVGGFQEDERIDNLNCEEKRVRKRKRNIMNDKQISM 721
Query: 721 IERGLQEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLS 780
IER LQEEPDMHKNAASLQSW+DRLS HG+EVT SQLKNWLNNRKAKLARAAKD G PLS
Sbjct: 722 IERALQEEPDMHKNAASLQSWADRLSFHGAEVTFSQLKNWLNNRKAKLARAAKD-GRPLS 781
Query: 781 EENAIADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPR 840
EEN AD GSL+ T S SPES EDLP SSA KD QVSAVR+TS G MS LE
Sbjct: 782 EENVTADKLVGSLVRTASDSPESHIEDLPTSSASKDRIQVSAVRKTSLGTIMSQSLETTP 841
Query: 841 PESSSRLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTR 900
PESS RLN++ E P ++ CV FEAGQAV+LTDTQGKEIAKGTV+QVEGEWH CNL+DTR
Sbjct: 842 PESSIRLNLRNECPTSNTCCVIFEAGQAVVLTDTQGKEIAKGTVYQVEGEWHGCNLADTR 901
Query: 901 TCVVDITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP 946
TCVVDITELRSERI+RLPHP+IEAGATFEQAEVKIGTMRVLWDSGRMLKP
Sbjct: 902 TCVVDITELRSERIVRLPHPTIEAGATFEQAEVKIGTMRVLWDSGRMLKP 945
BLAST of Spo03238.1 vs. NCBI nr
Match:
gi|731369262|ref|XP_010696496.1| (PREDICTED: uncharacterized protein LOC104909017 isoform X2 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 1229.2 bits (3179), Expect = 0.000e+0
Identity = 647/808 (80.07%), Postives = 708/808 (87.62%), Query Frame = 1
Query: 141 MPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIFLDAAFGAVRMNINFLQIKLS 200
MPLLYSALVACSMYLLT CISSQWSEL YVLQAHPKVDIFLDAAFGAVRM+IN LQ KLS
Sbjct: 1 MPLLYSALVACSMYLLTTCISSQWSELAYVLQAHPKVDIFLDAAFGAVRMSINILQNKLS 60
Query: 201 AEENEF-TNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSFRERLLKNKELCGKGGVLHL 260
EN+F + SSLTAE+ V+FC QQCEASLQFL SLCQNKSFRER+L+NKELCGKGGVLHL
Sbjct: 61 EAENDFHSKSSLTAERVVNFCCQQCEASLQFLQSLCQNKSFRERILRNKELCGKGGVLHL 120
Query: 261 VQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESESVSYLDEVASIPESLDLAK 320
VQ+TLKLNISP+LKEPL ++AA+SRLKARVLSILLHLCE+ESVSYLDEVASIPESLDLAK
Sbjct: 121 VQSTLKLNISPFLKEPLAVVAAVSRLKARVLSILLHLCEAESVSYLDEVASIPESLDLAK 180
Query: 321 SVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRLTDIFSDDSNFRSYITIYFA 380
SVAVEVITLLKTM+SRDLKLL NHSGK YP GLLQLNALRL DIFSDDSNFRSYITIYFA
Sbjct: 181 SVAVEVITLLKTMLSRDLKLLENHSGKIYPRGLLQLNALRLADIFSDDSNFRSYITIYFA 240
Query: 381 EVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAGLILDSTSSLDTI--RSFES 440
EVLAAIL PYQQFL+SWCSS+LPLKE DASLEY+PFTMAG ILDS+ LD + ++ ES
Sbjct: 241 EVLAAILLIPYQQFLSSWCSSDLPLKEVDASLEYDPFTMAGWILDSSPMLDIVSSKTCES 300
Query: 441 NFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFFLHKFLERLKMDWQDPKD 500
NF PTNNAQ+SYAHQRTSLLVKVI+NLHCFVP ICKEQERNFFLHKFLERLK+D QD +
Sbjct: 301 NFNPTNNAQASYAHQRTSLLVKVIANLHCFVPNICKEQERNFFLHKFLERLKVDRQDQQA 360
Query: 501 GFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRLFFTQLQSLIGPVDYEVN 560
GF+FSSAPQKA TICKNLRSLLGHAESLIPT LNEEDVQLLRLFFTQLQSLIGPVDYEVN
Sbjct: 361 GFSFSSAPQKAVTICKNLRSLLGHAESLIPTFLNEEDVQLLRLFFTQLQSLIGPVDYEVN 420
Query: 561 RALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGVDNSCTKVEVTAEADGAM 620
+AL + SL GNV+ D+K HSNLREG SEN AF GVDNSC KVE+ EADGA+
Sbjct: 421 KALVNEY-----PSLPGNVELDNKIGHSNLREGTSENLAFSGVDNSCVKVEIIGEADGAL 480
Query: 621 HNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNPTDQVGNGENTKPGEHFH 680
H++KTCEGKS+K E LVET+KE H+ ETSGSDSS+TRGKNPTDQVGNG+NTK EH H
Sbjct: 481 HDEKTCEGKSIKALGESLVETDKEVHNAETSGSDSSSTRGKNPTDQVGNGDNTKSSEHIH 540
Query: 681 RSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGLQEEPDMHKNAASLQSWS 740
RS+VG Q+D R++++N EEK+VRKRKR+IMNDKQIS+IER LQEEPDMHKNAASLQSW+
Sbjct: 541 RSVVGGFQEDERIDNLNCEEKRVRKRKRNIMNDKQISMIERALQEEPDMHKNAASLQSWA 600
Query: 741 DRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAIADAQAGSLMHTTSGSPE 800
DRLS HG+EVT SQLKNWLNNRKAKLARAAKD G PLSEEN AD GSL+ T S SPE
Sbjct: 601 DRLSFHGAEVTFSQLKNWLNNRKAKLARAAKD-GRPLSEENVTADKLVGSLVRTASDSPE 660
Query: 801 SQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSSRLNMQIEGPAISSSCVN 860
S EDLP SSA KD QVSAVR+TS G MS LE PESS RLN++ E P ++ CV
Sbjct: 661 SHIEDLPTSSASKDRIQVSAVRKTSLGTIMSQSLETTPPESSIRLNLRNECPTSNTCCVI 720
Query: 861 FEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVDITELRSERILRLPHPSI 920
FEAGQAV+LTDTQGKEIAKGTV+QVEGEWH CNL+DTRTCVVDITELRSERI+RLPHP+I
Sbjct: 721 FEAGQAVVLTDTQGKEIAKGTVYQVEGEWHGCNLADTRTCVVDITELRSERIVRLPHPTI 780
Query: 921 EAGATFEQAEVKIGTMRVLWDSGRMLKP 946
EAGATFEQAEVKIGTMRVLWDSGRMLKP
Sbjct: 781 EAGATFEQAEVKIGTMRVLWDSGRMLKP 802
BLAST of Spo03238.1 vs. NCBI nr
Match:
gi|870843757|gb|KMS96882.1| (hypothetical protein BVRB_7g180730 isoform A [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 1138.3 bits (2943), Expect = 0.000e+0
Identity = 601/759 (79.18%), Postives = 660/759 (86.96%), Query Frame = 1
Query: 190 MNINFLQIKLSAEENEF-TNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSFRERLLKNK 249
M+IN LQ KLS EN+F + SSLTAE+ V+FC QQCEASLQFL SLCQNKSFRER+L+NK
Sbjct: 1 MSINILQNKLSEAENDFHSKSSLTAERVVNFCCQQCEASLQFLQSLCQNKSFRERILRNK 60
Query: 250 ELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESESVSYLDEV 309
ELCGKGGVLHLVQ+TLKLNISP+LKEPL ++AA+SRLKARVLSILLHLCE+ESVSYLDEV
Sbjct: 61 ELCGKGGVLHLVQSTLKLNISPFLKEPLAVVAAVSRLKARVLSILLHLCEAESVSYLDEV 120
Query: 310 ASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRLTDIFSDDS 369
ASIPESLDLAKSVAVEVITLLKTM+SRDLKLL NHSGK YP GLLQLNALRL DIFSDDS
Sbjct: 121 ASIPESLDLAKSVAVEVITLLKTMLSRDLKLLENHSGKIYPRGLLQLNALRLADIFSDDS 180
Query: 370 NFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAGLILDSTSS 429
NFRSYITIYFAEVLAAIL PYQQFL+SWCSS+LPLKE DASLEY+PFTMAG ILDS+
Sbjct: 181 NFRSYITIYFAEVLAAILLIPYQQFLSSWCSSDLPLKEVDASLEYDPFTMAGWILDSSPM 240
Query: 430 LDTIRS--FESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFFLHKFLE 489
LD + S ESNF PTNNAQ+SYAHQRTSLLVKVI+NLHCFVP ICKEQERNFFLHKFLE
Sbjct: 241 LDIVSSKTCESNFNPTNNAQASYAHQRTSLLVKVIANLHCFVPNICKEQERNFFLHKFLE 300
Query: 490 RLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRLFFTQLQ 549
RLK+D QD + GF+FSSAPQKA TICKNLRSLLGHAESLIPT LNEEDVQLLRLFFTQLQ
Sbjct: 301 RLKVDRQDQQAGFSFSSAPQKAVTICKNLRSLLGHAESLIPTFLNEEDVQLLRLFFTQLQ 360
Query: 550 SLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGVDNSCTK 609
SLIGPVDYEVN+AL + SL GNV+ D+K HSNLREG SEN AF GVDNSC K
Sbjct: 361 SLIGPVDYEVNKALVNEYP-----SLPGNVELDNKIGHSNLREGTSENLAFSGVDNSCVK 420
Query: 610 VEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNPTDQVGN 669
VE+ EADGA+H++KTCEGKS+K E LVET+KE H+ ETSGSDSS+TRGKNPTDQVGN
Sbjct: 421 VEIIGEADGALHDEKTCEGKSIKALGESLVETDKEVHNAETSGSDSSSTRGKNPTDQVGN 480
Query: 670 GENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGLQEEPDM 729
G+NTK EH HRS+VG Q+D R++++N EEK+VRKRKR+IMNDKQIS+IER LQEEPDM
Sbjct: 481 GDNTKSSEHIHRSVVGGFQEDERIDNLNCEEKRVRKRKRNIMNDKQISMIERALQEEPDM 540
Query: 730 HKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAIADAQAG 789
HKNAASLQSW+DRLS HG+EVT SQLKNWLNNRKAKLARAAKD G PLSEEN AD G
Sbjct: 541 HKNAASLQSWADRLSFHGAEVTFSQLKNWLNNRKAKLARAAKD-GRPLSEENVTADKLVG 600
Query: 790 SLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSSRLNMQI 849
SL+ T S SPES EDLP SSA KD QVSAVR+TS G MS LE PESS RLN++
Sbjct: 601 SLVRTASDSPESHIEDLPTSSASKDRIQVSAVRKTSLGTIMSQSLETTPPESSIRLNLRN 660
Query: 850 EGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVDITELRS 909
E P ++ CV FEAGQAV+LTDTQGKEIAKGTV+QVEGEWH CNL+DTRTCVVDITELRS
Sbjct: 661 ECPTSNTCCVIFEAGQAVVLTDTQGKEIAKGTVYQVEGEWHGCNLADTRTCVVDITELRS 720
Query: 910 ERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP 946
ERI+RLPHP+IEAGATFEQAEVKIGTMRVLWDSGRMLKP
Sbjct: 721 ERIVRLPHPTIEAGATFEQAEVKIGTMRVLWDSGRMLKP 753
BLAST of Spo03238.1 vs. NCBI nr
Match:
gi|731395171|ref|XP_010652083.1| (PREDICTED: uncharacterized protein LOC100259581 [Vitis vinifera])
HSP 1 Score: 902.9 bits (2332), Expect = 4.700e-259
Identity = 531/966 (54.97%), Postives = 663/966 (68.63%), Query Frame = 1
Query: 1 MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAK--SLQIDME 60
MR+ E S T + IDL++ VK L NSQEL+KLL+DSENF++ Y T K SLQID E
Sbjct: 1 MRHNKEEQSYCTEQVIDLVSAVKGLHTLNSQELNKLLRDSENFTIQYTTEKGPSLQIDAE 60
Query: 61 KLVRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQ 120
KL LPLHL AVL+SS +DEA +YLL GLRLL+SLC+++ R KLEQILLDD+KVSEQ
Sbjct: 61 KLAGFLPLHLIAVLISSDKDEALFKYLLCGLRLLHSLCDLAPRQNKLEQILLDDVKVSEQ 120
Query: 121 LLDLVFYLLILC--CKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPK 180
LLDLVF LLI+ ++ + S PLL+SALVACS+YLLT IS+QW +L +VL AHPK
Sbjct: 121 LLDLVFALLIVLGSSREEHQLSSHAPLLHSALVACSLYLLTGFISTQWQDLGHVLTAHPK 180
Query: 181 VDIFLDAAFGAVRMNINFLQIKLSAEENEFTNSSLTAEKAVHFCFQQCEASLQFLHSLCQ 240
VDIF++AAF AV ++I LQIKLSA+ +F + AE+ V+ QQCEASLQFL SLCQ
Sbjct: 181 VDIFMEAAFRAVHLSIRSLQIKLSAQCVDFPSP---AEQVVNSLCQQCEASLQFLQSLCQ 240
Query: 241 NKSFRERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHL 300
K FRERLLKNKELCGKGGVL L QA LKL I+P KE ++AA+SRLKA+VLSI+L L
Sbjct: 241 QKMFRERLLKNKELCGKGGVLLLAQAILKLCITPLFKESSTIVAAVSRLKAKVLSIVLCL 300
Query: 301 CESESVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLN 360
CE+ES+SYLDEVAS P SLDLAKS+A+EV+ LLKT D K L+ S KT+P+GLLQLN
Sbjct: 301 CEAESISYLDEVASYPGSLDLAKSIALEVLELLKTAFGGDQKYLSGGSEKTHPTGLLQLN 360
Query: 361 ALRLTDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPF 420
A+RL DIFSDDSNFRS+IT+YF EVLAAI S P+ +FL+SWCSS+LP++EEDASLEY+PF
Sbjct: 361 AMRLADIFSDDSNFRSFITVYFTEVLAAIFSLPHGEFLSSWCSSDLPVREEDASLEYDPF 420
Query: 421 TMAGLILDSTSSLDTIR--SFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKE 480
AG +LDS SS D + S ES FI N +Q+ YAHQRTSLLVKVI+NLHCFVP IC+E
Sbjct: 421 VAAGWVLDSFSSPDLLNLMSSESTFIQNNMSQAPYAHQRTSLLVKVIANLHCFVPNICEE 480
Query: 481 QERNFFLHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEED 540
QE++ FLHK LE L+M+ + F+FSS QKAAT+CKNLRSLLGHAESLIP LNEED
Sbjct: 481 QEKDLFLHKCLECLQME----RPRFSFSSDAQKAATVCKNLRSLLGHAESLIPLFLNEED 540
Query: 541 VQLLRLFFTQLQSLIGPVDYEVNRA------------------LELQIARGCTSSLQGNV 600
VQLLR+FF ++QSLI P + E ++ E Q GC+S L
Sbjct: 541 VQLLRVFFKEIQSLITPTELEESKLEGSMSWDKFSRLDIGEHHQEAQSTGGCSSPLLRKA 600
Query: 601 DFDHKNKHSNLREGMSENSAFPGVDNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILV 660
D N+ +NL+EG SENS VD + +AD M + K L
Sbjct: 601 APDVTNRSANLKEGTSENSTLQEVDQFFGR--NMDQADDVMRQDRR---KDKNKLGRALR 660
Query: 661 ETEKEFHHEETSGSDSSTTRGKNPTDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHE 720
+ EK+ + ETSGSDSS+TRGKN TDQ+ N E K EH S G VQ+D +VE + E
Sbjct: 661 DGEKDVQNVETSGSDSSSTRGKNSTDQIDNSEFPKSNEHIKASGSGGVQEDEKVEIIPSE 720
Query: 721 EKQVRKRKRHIMNDKQISLIERGLQEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWL 780
EKQ RKRKR IMND Q++LIE+ L +EPDM +NAA +QSW+D+LS HG E+T+SQLKNWL
Sbjct: 721 EKQRRKRKRTIMNDTQMTLIEKALVDEPDMQRNAALIQSWADKLSFHGPELTASQLKNWL 780
Query: 781 NNRKAKLARAAKDGGHPLSEENAIADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVS 840
NNRKA+LARAAKD ++ D Q GS + + SPES ED S + S
Sbjct: 781 NNRKARLARAAKDVRVASEVDSTFPDKQVGSGVGSLHDSPESPGEDFFAPSTARGGTHQS 840
Query: 841 AVRRTSSGITMSPVLEIPRPESSSRLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAK 900
A+ + S E+++ + I + V E GQ V+L D QG +I K
Sbjct: 841 AIGGSVSRAGAD------NAEAATAEFVDIN----PAEFVRREPGQYVVLLDGQGDDIGK 900
Query: 901 GTVFQVEGEWHDCNLSDTRTCVVDITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVL 943
G V QV+G+W+ NL +++TCVVD+ EL++ER RLPHPS G +F++AE K+G MRV
Sbjct: 901 GKVHQVQGKWYGKNLEESQTCVVDVMELKAERWSRLPHPSETTGTSFDEAETKLGVMRVS 944
BLAST of Spo03238.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9RUI3_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_033670 PE=4 SV=1)
HSP 1 Score: 1820.4 bits (4714), Expect = 0.000e+0
Identity = 945/945 (100.00%), Postives = 945/945 (100.00%), Query Frame = 1
Query: 1 MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAKSLQIDMEKL 60
MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAKSLQIDMEKL
Sbjct: 1 MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAKSLQIDMEKL 60
Query: 61 VRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQLL 120
VRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQLL
Sbjct: 61 VRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQLL 120
Query: 121 DLVFYLLILCCKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIF 180
DLVFYLLILCCKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIF
Sbjct: 121 DLVFYLLILCCKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIF 180
Query: 181 LDAAFGAVRMNINFLQIKLSAEENEFTNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSF 240
LDAAFGAVRMNINFLQIKLSAEENEFTNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSF
Sbjct: 181 LDAAFGAVRMNINFLQIKLSAEENEFTNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSF 240
Query: 241 RERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESE 300
RERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESE
Sbjct: 241 RERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESE 300
Query: 301 SVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRL 360
SVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRL
Sbjct: 301 SVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRL 360
Query: 361 TDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAG 420
TDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAG
Sbjct: 361 TDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAG 420
Query: 421 LILDSTSSLDTIRSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFF 480
LILDSTSSLDTIRSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFF
Sbjct: 421 LILDSTSSLDTIRSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFF 480
Query: 481 LHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRL 540
LHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRL
Sbjct: 481 LHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRL 540
Query: 541 FFTQLQSLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGV 600
FFTQLQSLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGV
Sbjct: 541 FFTQLQSLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGV 600
Query: 601 DNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNP 660
DNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNP
Sbjct: 601 DNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNP 660
Query: 661 TDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGL 720
TDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGL
Sbjct: 661 TDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGL 720
Query: 721 QEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAI 780
QEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAI
Sbjct: 721 QEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAI 780
Query: 781 ADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSS 840
ADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSS
Sbjct: 781 ADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSS 840
Query: 841 RLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVD 900
RLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVD
Sbjct: 841 RLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVD 900
Query: 901 ITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP 946
ITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP
Sbjct: 901 ITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP 945
BLAST of Spo03238.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8B6I9_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_7g180730 PE=4 SV=1)
HSP 1 Score: 1434.9 bits (3713), Expect = 0.000e+0
Identity = 763/950 (80.32%), Postives = 835/950 (87.89%), Query Frame = 1
Query: 1 MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAK--SLQIDME 60
MRN + E S S+ AIDLI+EVKELQRFNSQELSKLLKD ENFSLF+ITA S+QIDM+
Sbjct: 2 MRNSITEASHSSRGAIDLISEVKELQRFNSQELSKLLKDCENFSLFHITANGLSVQIDMD 61
Query: 61 KLVRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQ 120
KLVRLLPLHL AVLLSSQRDEASLRYLLSGLRLLY+LCEI+SRH KLEQIL DDLKVSEQ
Sbjct: 62 KLVRLLPLHLIAVLLSSQRDEASLRYLLSGLRLLYTLCEIASRHSKLEQILFDDLKVSEQ 121
Query: 121 LLDLVFYLLILCCKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVD 180
LLDLVF+LL++CCKQG SFSG MPLLYSALVACSMYLLT CISSQWSEL YVLQAHPKVD
Sbjct: 122 LLDLVFHLLLVCCKQGNSFSGGMPLLYSALVACSMYLLTTCISSQWSELAYVLQAHPKVD 181
Query: 181 IFLDAAFGAVRMNINFLQIKLSAEENEF-TNSSLTAEKAVHFCFQQCEASLQFLHSLCQN 240
IFLDAAFGAVRM+IN LQ KLS EN+F + SSLTAE+ V+FC QQCEASLQFL SLCQN
Sbjct: 182 IFLDAAFGAVRMSINILQNKLSEAENDFHSKSSLTAERVVNFCCQQCEASLQFLQSLCQN 241
Query: 241 KSFRERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLC 300
KSFRER+L+NKELCGKGGVLHLVQ+TLKLNISP+LKEPL ++AA+SRLKARVLSILLHLC
Sbjct: 242 KSFRERILRNKELCGKGGVLHLVQSTLKLNISPFLKEPLAVVAAVSRLKARVLSILLHLC 301
Query: 301 ESESVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNA 360
E+ESVSYLDEVASIPESLDLAKSVAVEVITLLKTM+SRDLKLL NHSGK YP GLLQLNA
Sbjct: 302 EAESVSYLDEVASIPESLDLAKSVAVEVITLLKTMLSRDLKLLENHSGKIYPRGLLQLNA 361
Query: 361 LRLTDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFT 420
LRL DIFSDDSNFRSYITIYFAEVLAAIL PYQQFL+SWCSS+LPLKE DASLEY+PFT
Sbjct: 362 LRLADIFSDDSNFRSYITIYFAEVLAAILLIPYQQFLSSWCSSDLPLKEVDASLEYDPFT 421
Query: 421 MAGLILDSTSSLDTI--RSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQ 480
MAG ILDS+ LD + ++ ESNF PTNNAQ+SYAHQRTSLLVKVI+NLHCFVP ICKEQ
Sbjct: 422 MAGWILDSSPMLDIVSSKTCESNFNPTNNAQASYAHQRTSLLVKVIANLHCFVPNICKEQ 481
Query: 481 ERNFFLHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDV 540
ERNFFLHKFLERLK+D QD + GF+FSSAPQKA TICKNLRSLLGHAESLIPT LNEEDV
Sbjct: 482 ERNFFLHKFLERLKVDRQDQQAGFSFSSAPQKAVTICKNLRSLLGHAESLIPTFLNEEDV 541
Query: 541 QLLRLFFTQLQSLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENS 600
QLLRLFFTQLQSLIGPVDYEVN+AL + SL GNV+ D+K HSNLREG SEN
Sbjct: 542 QLLRLFFTQLQSLIGPVDYEVNKALVNEY-----PSLPGNVELDNKIGHSNLREGTSENL 601
Query: 601 AFPGVDNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTT 660
AF GVDNSC KVE+ EADGA+H++KTCEGKS+K E LVET+KE H+ ETSGSDSS+T
Sbjct: 602 AFSGVDNSCVKVEIIGEADGALHDEKTCEGKSIKALGESLVETDKEVHNAETSGSDSSST 661
Query: 661 RGKNPTDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISL 720
RGKNPTDQVGNG+NTK EH HRS+VG Q+D R++++N EEK+VRKRKR+IMNDKQIS+
Sbjct: 662 RGKNPTDQVGNGDNTKSSEHIHRSVVGGFQEDERIDNLNCEEKRVRKRKRNIMNDKQISM 721
Query: 721 IERGLQEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLS 780
IER LQEEPDMHKNAASLQSW+DRLS HG+EVT SQLKNWLNNRKAKLARAAKD G PLS
Sbjct: 722 IERALQEEPDMHKNAASLQSWADRLSFHGAEVTFSQLKNWLNNRKAKLARAAKD-GRPLS 781
Query: 781 EENAIADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPR 840
EEN AD GSL+ T S SPES EDLP SSA KD QVSAVR+TS G MS LE
Sbjct: 782 EENVTADKLVGSLVRTASDSPESHIEDLPTSSASKDRIQVSAVRKTSLGTIMSQSLETTP 841
Query: 841 PESSSRLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTR 900
PESS RLN++ E P ++ CV FEAGQAV+LTDTQGKEIAKGTV+QVEGEWH CNL+DTR
Sbjct: 842 PESSIRLNLRNECPTSNTCCVIFEAGQAVVLTDTQGKEIAKGTVYQVEGEWHGCNLADTR 901
Query: 901 TCVVDITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP 946
TCVVDITELRSERI+RLPHP+IEAGATFEQAEVKIGTMRVLWDSGRMLKP
Sbjct: 902 TCVVDITELRSERIVRLPHPTIEAGATFEQAEVKIGTMRVLWDSGRMLKP 945
BLAST of Spo03238.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8BA86_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_7g180730 PE=4 SV=1)
HSP 1 Score: 1229.2 bits (3179), Expect = 0.000e+0
Identity = 647/808 (80.07%), Postives = 708/808 (87.62%), Query Frame = 1
Query: 141 MPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIFLDAAFGAVRMNINFLQIKLS 200
MPLLYSALVACSMYLLT CISSQWSEL YVLQAHPKVDIFLDAAFGAVRM+IN LQ KLS
Sbjct: 1 MPLLYSALVACSMYLLTTCISSQWSELAYVLQAHPKVDIFLDAAFGAVRMSINILQNKLS 60
Query: 201 AEENEF-TNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSFRERLLKNKELCGKGGVLHL 260
EN+F + SSLTAE+ V+FC QQCEASLQFL SLCQNKSFRER+L+NKELCGKGGVLHL
Sbjct: 61 EAENDFHSKSSLTAERVVNFCCQQCEASLQFLQSLCQNKSFRERILRNKELCGKGGVLHL 120
Query: 261 VQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESESVSYLDEVASIPESLDLAK 320
VQ+TLKLNISP+LKEPL ++AA+SRLKARVLSILLHLCE+ESVSYLDEVASIPESLDLAK
Sbjct: 121 VQSTLKLNISPFLKEPLAVVAAVSRLKARVLSILLHLCEAESVSYLDEVASIPESLDLAK 180
Query: 321 SVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRLTDIFSDDSNFRSYITIYFA 380
SVAVEVITLLKTM+SRDLKLL NHSGK YP GLLQLNALRL DIFSDDSNFRSYITIYFA
Sbjct: 181 SVAVEVITLLKTMLSRDLKLLENHSGKIYPRGLLQLNALRLADIFSDDSNFRSYITIYFA 240
Query: 381 EVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAGLILDSTSSLDTI--RSFES 440
EVLAAIL PYQQFL+SWCSS+LPLKE DASLEY+PFTMAG ILDS+ LD + ++ ES
Sbjct: 241 EVLAAILLIPYQQFLSSWCSSDLPLKEVDASLEYDPFTMAGWILDSSPMLDIVSSKTCES 300
Query: 441 NFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFFLHKFLERLKMDWQDPKD 500
NF PTNNAQ+SYAHQRTSLLVKVI+NLHCFVP ICKEQERNFFLHKFLERLK+D QD +
Sbjct: 301 NFNPTNNAQASYAHQRTSLLVKVIANLHCFVPNICKEQERNFFLHKFLERLKVDRQDQQA 360
Query: 501 GFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRLFFTQLQSLIGPVDYEVN 560
GF+FSSAPQKA TICKNLRSLLGHAESLIPT LNEEDVQLLRLFFTQLQSLIGPVDYEVN
Sbjct: 361 GFSFSSAPQKAVTICKNLRSLLGHAESLIPTFLNEEDVQLLRLFFTQLQSLIGPVDYEVN 420
Query: 561 RALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGVDNSCTKVEVTAEADGAM 620
+AL + SL GNV+ D+K HSNLREG SEN AF GVDNSC KVE+ EADGA+
Sbjct: 421 KALVNEY-----PSLPGNVELDNKIGHSNLREGTSENLAFSGVDNSCVKVEIIGEADGAL 480
Query: 621 HNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNPTDQVGNGENTKPGEHFH 680
H++KTCEGKS+K E LVET+KE H+ ETSGSDSS+TRGKNPTDQVGNG+NTK EH H
Sbjct: 481 HDEKTCEGKSIKALGESLVETDKEVHNAETSGSDSSSTRGKNPTDQVGNGDNTKSSEHIH 540
Query: 681 RSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGLQEEPDMHKNAASLQSWS 740
RS+VG Q+D R++++N EEK+VRKRKR+IMNDKQIS+IER LQEEPDMHKNAASLQSW+
Sbjct: 541 RSVVGGFQEDERIDNLNCEEKRVRKRKRNIMNDKQISMIERALQEEPDMHKNAASLQSWA 600
Query: 741 DRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAIADAQAGSLMHTTSGSPE 800
DRLS HG+EVT SQLKNWLNNRKAKLARAAKD G PLSEEN AD GSL+ T S SPE
Sbjct: 601 DRLSFHGAEVTFSQLKNWLNNRKAKLARAAKD-GRPLSEENVTADKLVGSLVRTASDSPE 660
Query: 801 SQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSSRLNMQIEGPAISSSCVN 860
S EDLP SSA KD QVSAVR+TS G MS LE PESS RLN++ E P ++ CV
Sbjct: 661 SHIEDLPTSSASKDRIQVSAVRKTSLGTIMSQSLETTPPESSIRLNLRNECPTSNTCCVI 720
Query: 861 FEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVDITELRSERILRLPHPSI 920
FEAGQAV+LTDTQGKEIAKGTV+QVEGEWH CNL+DTRTCVVDITELRSERI+RLPHP+I
Sbjct: 721 FEAGQAVVLTDTQGKEIAKGTVYQVEGEWHGCNLADTRTCVVDITELRSERIVRLPHPTI 780
Query: 921 EAGATFEQAEVKIGTMRVLWDSGRMLKP 946
EAGATFEQAEVKIGTMRVLWDSGRMLKP
Sbjct: 781 EAGATFEQAEVKIGTMRVLWDSGRMLKP 802
BLAST of Spo03238.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8BAD3_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_7g180730 PE=4 SV=1)
HSP 1 Score: 1138.3 bits (2943), Expect = 0.000e+0
Identity = 601/759 (79.18%), Postives = 660/759 (86.96%), Query Frame = 1
Query: 190 MNINFLQIKLSAEENEF-TNSSLTAEKAVHFCFQQCEASLQFLHSLCQNKSFRERLLKNK 249
M+IN LQ KLS EN+F + SSLTAE+ V+FC QQCEASLQFL SLCQNKSFRER+L+NK
Sbjct: 1 MSINILQNKLSEAENDFHSKSSLTAERVVNFCCQQCEASLQFLQSLCQNKSFRERILRNK 60
Query: 250 ELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESESVSYLDEV 309
ELCGKGGVLHLVQ+TLKLNISP+LKEPL ++AA+SRLKARVLSILLHLCE+ESVSYLDEV
Sbjct: 61 ELCGKGGVLHLVQSTLKLNISPFLKEPLAVVAAVSRLKARVLSILLHLCEAESVSYLDEV 120
Query: 310 ASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRLTDIFSDDS 369
ASIPESLDLAKSVAVEVITLLKTM+SRDLKLL NHSGK YP GLLQLNALRL DIFSDDS
Sbjct: 121 ASIPESLDLAKSVAVEVITLLKTMLSRDLKLLENHSGKIYPRGLLQLNALRLADIFSDDS 180
Query: 370 NFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAGLILDSTSS 429
NFRSYITIYFAEVLAAIL PYQQFL+SWCSS+LPLKE DASLEY+PFTMAG ILDS+
Sbjct: 181 NFRSYITIYFAEVLAAILLIPYQQFLSSWCSSDLPLKEVDASLEYDPFTMAGWILDSSPM 240
Query: 430 LDTIRS--FESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFFLHKFLE 489
LD + S ESNF PTNNAQ+SYAHQRTSLLVKVI+NLHCFVP ICKEQERNFFLHKFLE
Sbjct: 241 LDIVSSKTCESNFNPTNNAQASYAHQRTSLLVKVIANLHCFVPNICKEQERNFFLHKFLE 300
Query: 490 RLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRLFFTQLQ 549
RLK+D QD + GF+FSSAPQKA TICKNLRSLLGHAESLIPT LNEEDVQLLRLFFTQLQ
Sbjct: 301 RLKVDRQDQQAGFSFSSAPQKAVTICKNLRSLLGHAESLIPTFLNEEDVQLLRLFFTQLQ 360
Query: 550 SLIGPVDYEVNRALELQIARGCTSSLQGNVDFDHKNKHSNLREGMSENSAFPGVDNSCTK 609
SLIGPVDYEVN+AL + SL GNV+ D+K HSNLREG SEN AF GVDNSC K
Sbjct: 361 SLIGPVDYEVNKALVNEYP-----SLPGNVELDNKIGHSNLREGTSENLAFSGVDNSCVK 420
Query: 610 VEVTAEADGAMHNQKTCEGKSVKYSAEILVETEKEFHHEETSGSDSSTTRGKNPTDQVGN 669
VE+ EADGA+H++KTCEGKS+K E LVET+KE H+ ETSGSDSS+TRGKNPTDQVGN
Sbjct: 421 VEIIGEADGALHDEKTCEGKSIKALGESLVETDKEVHNAETSGSDSSSTRGKNPTDQVGN 480
Query: 670 GENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMNDKQISLIERGLQEEPDM 729
G+NTK EH HRS+VG Q+D R++++N EEK+VRKRKR+IMNDKQIS+IER LQEEPDM
Sbjct: 481 GDNTKSSEHIHRSVVGGFQEDERIDNLNCEEKRVRKRKRNIMNDKQISMIERALQEEPDM 540
Query: 730 HKNAASLQSWSDRLSLHGSEVTSSQLKNWLNNRKAKLARAAKDGGHPLSEENAIADAQAG 789
HKNAASLQSW+DRLS HG+EVT SQLKNWLNNRKAKLARAAKD G PLSEEN AD G
Sbjct: 541 HKNAASLQSWADRLSFHGAEVTFSQLKNWLNNRKAKLARAAKD-GRPLSEENVTADKLVG 600
Query: 790 SLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSPVLEIPRPESSSRLNMQI 849
SL+ T S SPES EDLP SSA KD QVSAVR+TS G MS LE PESS RLN++
Sbjct: 601 SLVRTASDSPESHIEDLPTSSASKDRIQVSAVRKTSLGTIMSQSLETTPPESSIRLNLRN 660
Query: 850 EGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHDCNLSDTRTCVVDITELRS 909
E P ++ CV FEAGQAV+LTDTQGKEIAKGTV+QVEGEWH CNL+DTRTCVVDITELRS
Sbjct: 661 ECPTSNTCCVIFEAGQAVVLTDTQGKEIAKGTVYQVEGEWHGCNLADTRTCVVDITELRS 720
Query: 910 ERILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRMLKP 946
ERI+RLPHP+IEAGATFEQAEVKIGTMRVLWDSGRMLKP
Sbjct: 721 ERIVRLPHPTIEAGATFEQAEVKIGTMRVLWDSGRMLKP 753
BLAST of Spo03238.1 vs. UniProtKB/TrEMBL
Match:
F6HQ32_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0104g01050 PE=4 SV=1)
HSP 1 Score: 902.9 bits (2332), Expect = 3.300e-259
Identity = 531/966 (54.97%), Postives = 663/966 (68.63%), Query Frame = 1
Query: 1 MRNLMAEPSPSTARAIDLITEVKELQRFNSQELSKLLKDSENFSLFYITAK--SLQIDME 60
MR+ E S T + IDL++ VK L NSQEL+KLL+DSENF++ Y T K SLQID E
Sbjct: 1 MRHNKEEQSYCTEQVIDLVSAVKGLHTLNSQELNKLLRDSENFTIQYTTEKGPSLQIDAE 60
Query: 61 KLVRLLPLHLTAVLLSSQRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQ 120
KL LPLHL AVL+SS +DEA +YLL GLRLL+SLC+++ R KLEQILLDD+KVSEQ
Sbjct: 61 KLAGFLPLHLIAVLISSDKDEALFKYLLCGLRLLHSLCDLAPRQNKLEQILLDDVKVSEQ 120
Query: 121 LLDLVFYLLILC--CKQGYSFSGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPK 180
LLDLVF LLI+ ++ + S PLL+SALVACS+YLLT IS+QW +L +VL AHPK
Sbjct: 121 LLDLVFALLIVLGSSREEHQLSSHAPLLHSALVACSLYLLTGFISTQWQDLGHVLTAHPK 180
Query: 181 VDIFLDAAFGAVRMNINFLQIKLSAEENEFTNSSLTAEKAVHFCFQQCEASLQFLHSLCQ 240
VDIF++AAF AV ++I LQIKLSA+ +F + AE+ V+ QQCEASLQFL SLCQ
Sbjct: 181 VDIFMEAAFRAVHLSIRSLQIKLSAQCVDFPSP---AEQVVNSLCQQCEASLQFLQSLCQ 240
Query: 241 NKSFRERLLKNKELCGKGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHL 300
K FRERLLKNKELCGKGGVL L QA LKL I+P KE ++AA+SRLKA+VLSI+L L
Sbjct: 241 QKMFRERLLKNKELCGKGGVLLLAQAILKLCITPLFKESSTIVAAVSRLKAKVLSIVLCL 300
Query: 301 CESESVSYLDEVASIPESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLN 360
CE+ES+SYLDEVAS P SLDLAKS+A+EV+ LLKT D K L+ S KT+P+GLLQLN
Sbjct: 301 CEAESISYLDEVASYPGSLDLAKSIALEVLELLKTAFGGDQKYLSGGSEKTHPTGLLQLN 360
Query: 361 ALRLTDIFSDDSNFRSYITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPF 420
A+RL DIFSDDSNFRS+IT+YF EVLAAI S P+ +FL+SWCSS+LP++EEDASLEY+PF
Sbjct: 361 AMRLADIFSDDSNFRSFITVYFTEVLAAIFSLPHGEFLSSWCSSDLPVREEDASLEYDPF 420
Query: 421 TMAGLILDSTSSLDTIR--SFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKE 480
AG +LDS SS D + S ES FI N +Q+ YAHQRTSLLVKVI+NLHCFVP IC+E
Sbjct: 421 VAAGWVLDSFSSPDLLNLMSSESTFIQNNMSQAPYAHQRTSLLVKVIANLHCFVPNICEE 480
Query: 481 QERNFFLHKFLERLKMDWQDPKDGFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEED 540
QE++ FLHK LE L+M+ + F+FSS QKAAT+CKNLRSLLGHAESLIP LNEED
Sbjct: 481 QEKDLFLHKCLECLQME----RPRFSFSSDAQKAATVCKNLRSLLGHAESLIPLFLNEED 540
Query: 541 VQLLRLFFTQLQSLIGPVDYEVNRA------------------LELQIARGCTSSLQGNV 600
VQLLR+FF ++QSLI P + E ++ E Q GC+S L
Sbjct: 541 VQLLRVFFKEIQSLITPTELEESKLEGSMSWDKFSRLDIGEHHQEAQSTGGCSSPLLRKA 600
Query: 601 DFDHKNKHSNLREGMSENSAFPGVDNSCTKVEVTAEADGAMHNQKTCEGKSVKYSAEILV 660
D N+ +NL+EG SENS VD + +AD M + K L
Sbjct: 601 APDVTNRSANLKEGTSENSTLQEVDQFFGR--NMDQADDVMRQDRR---KDKNKLGRALR 660
Query: 661 ETEKEFHHEETSGSDSSTTRGKNPTDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHE 720
+ EK+ + ETSGSDSS+TRGKN TDQ+ N E K EH S G VQ+D +VE + E
Sbjct: 661 DGEKDVQNVETSGSDSSSTRGKNSTDQIDNSEFPKSNEHIKASGSGGVQEDEKVEIIPSE 720
Query: 721 EKQVRKRKRHIMNDKQISLIERGLQEEPDMHKNAASLQSWSDRLSLHGSEVTSSQLKNWL 780
EKQ RKRKR IMND Q++LIE+ L +EPDM +NAA +QSW+D+LS HG E+T+SQLKNWL
Sbjct: 721 EKQRRKRKRTIMNDTQMTLIEKALVDEPDMQRNAALIQSWADKLSFHGPELTASQLKNWL 780
Query: 781 NNRKAKLARAAKDGGHPLSEENAIADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVS 840
NNRKA+LARAAKD ++ D Q GS + + SPES ED S + S
Sbjct: 781 NNRKARLARAAKDVRVASEVDSTFPDKQVGSGVGSLHDSPESPGEDFFAPSTARGGTHQS 840
Query: 841 AVRRTSSGITMSPVLEIPRPESSSRLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAK 900
A+ + S E+++ + I + V E GQ V+L D QG +I K
Sbjct: 841 AIGGSVSRAGAD------NAEAATAEFVDIN----PAEFVRREPGQYVVLLDGQGDDIGK 900
Query: 901 GTVFQVEGEWHDCNLSDTRTCVVDITELRSERILRLPHPSIEAGATFEQAEVKIGTMRVL 943
G V QV+G+W+ NL +++TCVVD+ EL++ER RLPHPS G +F++AE K+G MRV
Sbjct: 901 GKVHQVQGKWYGKNLEESQTCVVDVMELKAERWSRLPHPSETTGTSFDEAETKLGVMRVS 944
BLAST of Spo03238.1 vs. ExPASy Swiss-Prot
Match:
NDX_ARATH (Nodulin homeobox OS=Arabidopsis thaliana GN=NDX PE=2 SV=1)
HSP 1 Score: 548.1 bits (1411), Expect = 1.900e-154
Identity = 384/957 (40.13%), Postives = 554/957 (57.89%), Query Frame = 1
Query: 18 LITEVKELQRFNSQELSKLLKDSENFSLFYITAKSL--QIDMEKLVRLLPLHLTAVLLSS 77
++ V L NS E KLLKD+ +FS+ + + + L +I +EK+V++LP HL AV+++
Sbjct: 10 MVQAVNALHWRNSVEFHKLLKDNGDFSICFNSEQVLPQKISVEKMVKMLPRHLIAVVMTP 69
Query: 78 QRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQLLDLVFYLLILCCKQGY 137
+D S RY+L G+RLL +LC+++ R+ KLEQ+LLDD+K+S Q++DLV ++I +
Sbjct: 70 NKDGKS-RYILCGIRLLQTLCDLTPRNAKLEQVLLDDVKLSAQMIDLVILVIIALGRNRK 129
Query: 138 SF--SGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIFLDAAFGAVRMNIN 197
S LL + LVA ++L IS +L VL AHP+VD+F+D+AFGAV +
Sbjct: 130 ESCNSNKESLLEATLVASCLHLFHGFISPNSQDLVLVLLAHPRVDVFIDSAFGAVLNVVI 189
Query: 198 FLQIKLSAEENEFTNS-SLTAEKAVHFCFQQCEASLQFLHSLCQNKSFRERLLKNKELCG 257
L+ KL + + ++ + V+F QQ EA+LQFLHSLCQ+K FRER+ KNKELCG
Sbjct: 190 SLKAKLLYRQTDSPKKLGASSVEEVNFHCQQAEAALQFLHSLCQHKPFRERVAKNKELCG 249
Query: 258 KGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESESVSYLDEVASIP 317
KGGVL L Q+ L L I+P IA+ SR+KA+VLSIL HL E+ESVS+LDEVA+
Sbjct: 250 KGGVLRLAQSILSLTITPEFVGATVTIASTSRMKAKVLSILQHLFEAESVSFLDEVANAG 309
Query: 318 ESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRLTDIFSDDSNFRS 377
+L LAK+VA EV+ LL+ +S+ A+ YP G + LNA+RL D+ +DDSNFRS
Sbjct: 310 -NLHLAKTVASEVLKLLRLGLSKASMATASPD---YPMGFVLLNAMRLADVLTDDSNFRS 369
Query: 378 YITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAGLILD--STSSLD 437
+ T +F+ VL+A+ + FL+ CSS+L +E+DA+++Y+ F AG IL S+S
Sbjct: 370 FFTEHFSMVLSAVFCLSHGDFLSMLCSSDLSSREDDANVDYDLFKSAGWILSVFSSSGQS 429
Query: 438 TIRSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFFLHKFLERLKM 497
F+ + + N SSYAHQRTSL +K+I+NLHCFVP +C+EQ+RN F+ + L+
Sbjct: 430 VTPQFKLS-LQNNLTMSSYAHQRTSLFIKMIANLHCFVPNVCQEQDRNRFIQNVMSGLR- 489
Query: 498 DWQDPKD-------GFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRLFFT 557
+DP G +++ Q+ +C+NL SLL HAESLIP+SLNEED LLR+F
Sbjct: 490 --KDPSSILIKMLPGSSYTPVAQRGTGVCRNLGSLLRHAESLIPSSLNEEDFLLLRVFCD 549
Query: 558 QLQSLI------GPVDYEVNRALELQIARG------CTSSLQGNVDFDHKNKHSNLREGM 617
QLQ LI V +V + L C +L +++ N L+E +
Sbjct: 550 QLQPLIHSEFEESQVQVKVKKLFALLYIGFTILWLICLVTLIQDIEGRGGNLSGKLKELL 609
Query: 618 SENSAFPGVDNSCTKVEVTAEADGAMHNQKTCEGKSVKY-SAEILVETEKEFHHEETSGS 677
+ N+ E + + D + T +G + + + E L E++ + + ETSGS
Sbjct: 610 NLNNE-----------EASEDCDVRVEGVMTKQGVNEEIDTVERLKESDADASNLETSGS 669
Query: 678 DSSTTRGKNPTDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMND 737
D+S+ RGK ++ +N + F S G V++D + E+ EKQ +KRKR IMN
Sbjct: 670 DTSSNRGKGLVEEGELVQNMS--KRFKGSASGEVKEDEKSETFLVFEKQKKKRKRSIMNA 729
Query: 738 KQISLIERGLQEEPDMHKNAASLQSWSDRLSLHGSEV-TSSQLKNWLNNRKAKLARAAKD 797
Q+ +IE+ L EEPD+ +N+AS Q W+D++S GSEV TSSQLKNWLNNRKAKLARA K
Sbjct: 730 DQMGMIEKALAEEPDLQRNSASRQLWADKISQKGSEVITSSQLKNWLNNRKAKLARANKQ 789
Query: 798 GGHPLSEENAIADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSP 857
G P + N+ D PES P D N ++ S+ I
Sbjct: 790 TG-PAHDNNSSGDL------------PES----------PGDENTWQ--QKPSTPIKDQT 849
Query: 858 VLEIPRP-ESSSRLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHD 917
V E P+ E+ R + SSS + GQ V L D +G EI KGTV + +GEW+
Sbjct: 850 VTETPKTGENLMRTS--------SSSEEGIKQGQQVRLMDERGDEIGKGTVLRTDGEWNG 909
Query: 918 CNLSDTRTCVVDITELRSE---RILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRM 943
+L + CVVD+ EL +P+ S + G TF +A + G MRV WD ++
Sbjct: 910 LSLETRQICVVDVMELSESYDGSKKMIPYGSDDVGRTFTEANSRFGVMRVAWDVNKL 911
BLAST of Spo03238.1 vs. TAIR (Arabidopsis)
Match:
AT4G03090.1 (sequence-specific DNA binding;sequence-specific DNA binding transcription factors)
HSP 1 Score: 548.1 bits (1411), Expect = 1.000e-155
Identity = 384/957 (40.13%), Postives = 554/957 (57.89%), Query Frame = 1
Query: 18 LITEVKELQRFNSQELSKLLKDSENFSLFYITAKSL--QIDMEKLVRLLPLHLTAVLLSS 77
++ V L NS E KLLKD+ +FS+ + + + L +I +EK+V++LP HL AV+++
Sbjct: 10 MVQAVNALHWRNSVEFHKLLKDNGDFSICFNSEQVLPQKISVEKMVKMLPRHLIAVVMTP 69
Query: 78 QRDEASLRYLLSGLRLLYSLCEISSRHPKLEQILLDDLKVSEQLLDLVFYLLILCCKQGY 137
+D S RY+L G+RLL +LC+++ R+ KLEQ+LLDD+K+S Q++DLV ++I +
Sbjct: 70 NKDGKS-RYILCGIRLLQTLCDLTPRNAKLEQVLLDDVKLSAQMIDLVILVIIALGRNRK 129
Query: 138 SF--SGAMPLLYSALVACSMYLLTACISSQWSELTYVLQAHPKVDIFLDAAFGAVRMNIN 197
S LL + LVA ++L IS +L VL AHP+VD+F+D+AFGAV +
Sbjct: 130 ESCNSNKESLLEATLVASCLHLFHGFISPNSQDLVLVLLAHPRVDVFIDSAFGAVLNVVI 189
Query: 198 FLQIKLSAEENEFTNS-SLTAEKAVHFCFQQCEASLQFLHSLCQNKSFRERLLKNKELCG 257
L+ KL + + ++ + V+F QQ EA+LQFLHSLCQ+K FRER+ KNKELCG
Sbjct: 190 SLKAKLLYRQTDSPKKLGASSVEEVNFHCQQAEAALQFLHSLCQHKPFRERVAKNKELCG 249
Query: 258 KGGVLHLVQATLKLNISPYLKEPLEMIAAISRLKARVLSILLHLCESESVSYLDEVASIP 317
KGGVL L Q+ L L I+P IA+ SR+KA+VLSIL HL E+ESVS+LDEVA+
Sbjct: 250 KGGVLRLAQSILSLTITPEFVGATVTIASTSRMKAKVLSILQHLFEAESVSFLDEVANAG 309
Query: 318 ESLDLAKSVAVEVITLLKTMVSRDLKLLANHSGKTYPSGLLQLNALRLTDIFSDDSNFRS 377
+L LAK+VA EV+ LL+ +S+ A+ YP G + LNA+RL D+ +DDSNFRS
Sbjct: 310 -NLHLAKTVASEVLKLLRLGLSKASMATASPD---YPMGFVLLNAMRLADVLTDDSNFRS 369
Query: 378 YITIYFAEVLAAILSTPYQQFLASWCSSELPLKEEDASLEYEPFTMAGLILD--STSSLD 437
+ T +F+ VL+A+ + FL+ CSS+L +E+DA+++Y+ F AG IL S+S
Sbjct: 370 FFTEHFSMVLSAVFCLSHGDFLSMLCSSDLSSREDDANVDYDLFKSAGWILSVFSSSGQS 429
Query: 438 TIRSFESNFIPTNNAQSSYAHQRTSLLVKVISNLHCFVPKICKEQERNFFLHKFLERLKM 497
F+ + + N SSYAHQRTSL +K+I+NLHCFVP +C+EQ+RN F+ + L+
Sbjct: 430 VTPQFKLS-LQNNLTMSSYAHQRTSLFIKMIANLHCFVPNVCQEQDRNRFIQNVMSGLR- 489
Query: 498 DWQDPKD-------GFAFSSAPQKAATICKNLRSLLGHAESLIPTSLNEEDVQLLRLFFT 557
+DP G +++ Q+ +C+NL SLL HAESLIP+SLNEED LLR+F
Sbjct: 490 --KDPSSILIKMLPGSSYTPVAQRGTGVCRNLGSLLRHAESLIPSSLNEEDFLLLRVFCD 549
Query: 558 QLQSLI------GPVDYEVNRALELQIARG------CTSSLQGNVDFDHKNKHSNLREGM 617
QLQ LI V +V + L C +L +++ N L+E +
Sbjct: 550 QLQPLIHSEFEESQVQVKVKKLFALLYIGFTILWLICLVTLIQDIEGRGGNLSGKLKELL 609
Query: 618 SENSAFPGVDNSCTKVEVTAEADGAMHNQKTCEGKSVKY-SAEILVETEKEFHHEETSGS 677
+ N+ E + + D + T +G + + + E L E++ + + ETSGS
Sbjct: 610 NLNNE-----------EASEDCDVRVEGVMTKQGVNEEIDTVERLKESDADASNLETSGS 669
Query: 678 DSSTTRGKNPTDQVGNGENTKPGEHFHRSIVGVVQQDGRVESVNHEEKQVRKRKRHIMND 737
D+S+ RGK ++ +N + F S G V++D + E+ EKQ +KRKR IMN
Sbjct: 670 DTSSNRGKGLVEEGELVQNMS--KRFKGSASGEVKEDEKSETFLVFEKQKKKRKRSIMNA 729
Query: 738 KQISLIERGLQEEPDMHKNAASLQSWSDRLSLHGSEV-TSSQLKNWLNNRKAKLARAAKD 797
Q+ +IE+ L EEPD+ +N+AS Q W+D++S GSEV TSSQLKNWLNNRKAKLARA K
Sbjct: 730 DQMGMIEKALAEEPDLQRNSASRQLWADKISQKGSEVITSSQLKNWLNNRKAKLARANKQ 789
Query: 798 GGHPLSEENAIADAQAGSLMHTTSGSPESQNEDLPPSSAPKDHNQVSAVRRTSSGITMSP 857
G P + N+ D PES P D N ++ S+ I
Sbjct: 790 TG-PAHDNNSSGDL------------PES----------PGDENTWQ--QKPSTPIKDQT 849
Query: 858 VLEIPRP-ESSSRLNMQIEGPAISSSCVNFEAGQAVLLTDTQGKEIAKGTVFQVEGEWHD 917
V E P+ E+ R + SSS + GQ V L D +G EI KGTV + +GEW+
Sbjct: 850 VTETPKTGENLMRTS--------SSSEEGIKQGQQVRLMDERGDEIGKGTVLRTDGEWNG 909
Query: 918 CNLSDTRTCVVDITELRSE---RILRLPHPSIEAGATFEQAEVKIGTMRVLWDSGRM 943
+L + CVVD+ EL +P+ S + G TF +A + G MRV WD ++
Sbjct: 910 LSLETRQICVVDVMELSESYDGSKKMIPYGSDDVGRTFTEANSRFGVMRVAWDVNKL 911
The following BLAST results are available for this feature: