Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AACGAAAACTCCAAAAACACCTGGAATCTTTCCCCTCTTTCTCACACACATTTGTCTCCTCTTCAAGAACCCTAATCTCAGCTAAAAATCCAATCTCCCCCAATTCAATTTCAAAAAATCACATTCACCCCACTCTTTCTCTCTCCTCCACAGTCATTGCTGTCGCCCTCAATCGCGACTTTCGACCACCACTACCCACAATTTTCCCCCTTTTCTCCCTCCTTTGTTTCCCCACTTTATTTTCTCTTTCCTCCTTCCCCATATAAATTAGGGCATCTTTTAGTGTTCCAAGAGATGACTTCTTCCTCTTCCACTAACAATAATAATAACCCTGAACCTGATTCTCCCCCTCCTCCTAATCCCCCTGCTTCTGAAACTGTAGCTGTTGAAACTGACAATCTTGAAGCTGAAGAAGAAGCTTTCATGGAGAAATCCATGGTTTCTGATGATCATATGGATGAAGATTCTGTTAATAATCATCCTGCTACTGTTTTTTGTATCACATTGAACCAATCTCGCTCCAATTTGCTTCATAAAATGAGTGTTCCTGATCTTTGTCGCAATTTTAGGTACTTTACTTCATGTTAACATATTTTGCGTCTACTTGTTAAAATTTGTATACTACTCCGTATATAATAATTGTTTTTGAGAGTTTTTTTGGGTGTGATTGATAATTTTAGTTATTGCCCTGTGTTTATGTCACCTCTTTTGAATCAATTTAGCGTCAACTTGTTGGAATGCGGGATGCTTTTTGTTTACTTTATCTTTGAAGTCCACAACTTTAGTAATCTAAATATCTATTTGTTGAAATTTGAAATGGTTTTTTTTGTTATACTCCATAATTTGATTTTGAGTTTTTTCCCCCGACGAGATTGCTAATATCTATTTGTTGAAATTTAACCTCATTTGCATCATTTTTTAGCATTTACTTGTTTAAATATGAAGTGGTTTTTGTTTTTGTTGTTGGTACTCTATGAATTACTAATTGAGTAAGCTTATTTTCGTGTCATTAATAAAATATGCAAATTTTGATGGGCTGAGTGTGTGAAGATGATGGGTATTACGGGCACTAGTGAAATATAAGGCAGCTTGTGTCTACTTAGAAATCACAATATATAGGTTAGCTCTGCAGAGGAGTAAGTTACTACTACTAGTAATTTACGGAGGCAGCTTCCAACGGTGTATAAGGGTTTTAATTTTTACTTGGTGGGGAACTGAAATTATCCTTGGAAGCAATTAAAGATTTCCCTATTTGGGACTGGATCTTATTTGATTTTGATTATATGTAGGGTAACCTCGTGTCTGAATGGATTACGTTGTTAATAAGTGTGATCGTCAGAGGTGCTTTTGGAAGGTGATTTATTGATCACCGAAAACGGTCCTGATCCTGTTTCAGTTTTCTACTACTAGTAATTTACCGAGGCAGCTTCCATCGGTGTATAAGGGTTATAAGTTTTACTTGTTGGGGCATTGAAATTACCCTTGGAGTAGCTTCTGGAAGCAAATTGAAGATTCCCTAAATGGGACTGGATCTAATTTGATTTTGATTATATATAGGGTAACGTCGTGTAATCTGAATGGATTATAACATTATGTTTTTAATAAAGTACAATCATCAAAGGTGTCGTGGAAGTGATTTGTTGATTGCCGAAAACAGTTCTGACCCCGTTTCAGTTATAATTGTGAAATCTCATGTAATCAGAATGGACGGTTTTATCCCAGTAAATATTTGTCGTTGACTAGTTGCTTGTTTAGGTAACTTGCCTTCTTCTTTATAGCTTTCTCTCGTACAAATATCTTACCAGGATCGTATTGTTTTCTATCTTCAAGTGCTGTAGCCTGGTGTGGGAAGCTGAACGCCGTAGCATGTGCTTCAGAAACTTGTGCTCGAATTCCAAGGTTAGCTTACTTTTAAACTCTTTCTTTGGCTCAATTTTTATAATTTATTATCATATACCCTTCTTTCTGAACCTCTGAAAGTCAAAGCAGTGGAGTGTTTAACTGTTCACTACCACATTATGTTGAATTGGGTAAAATTCTCCTCATATTATTAAAAATTAAGAAGATGAGATAAAGCATAAACTTCAGCTAGTGTATATCGTAATGATGATTATCAATAAGTTTCACTTGCACAGTAAAATGAGACACCTTCACATTTTATCCAGTAAGCTTCCAGCACATGATTTCTTATTCATGACTGTGAATTGAATATATGTCAAACTTGAGGTTTACGTACTTATTGGTTTTATTACTAGCGTTTTTAATTTAAGGGAAAACCCGAATTTAGGGATTAAAAATAATATGTGGGATGGAAGATTATAAGAGATTATGGTGGAAGAACGATAGATAAACTTAGATGAAATTGCAATCACCTACAGGTTCTAATATAGAAATCTCGATCGAATACTAGATATTCTTCAATCGAATCAAACACAAAATAATCTACAGAATGGGTAATTTCATTTCTCATAAAACGTCTTCCCATATGAAACAATCATGAGCATATTTATACCATGAAACCCCTGTGGTCCTAGGAACCCTAGAGCCTTGACGTGCTAGCTAGGCCTCTAATTATTACTTCCTCCGTCTTTTAATACTCGCAACGTTTGGACTTTTGCCACTATTCATATAATCTACTTTGACTATTCGTAGTGTTTTTTGTATAAGATAAAACATAGTCATGTGGGATCTTGTTAGATTCGTCTCAATGTGTATTTTCAAAATATCAACTTTTTATAATTTTTGCATAAAGAGAATTTAAGATATAAATGATCAAAGTTGTGCATTGGCATGCGTGAAACTAACAAACGTTGCGAGTATTAAAAGACGGAGGAAGTATTAAACTAATAGACAAGGCTTAAAAGCTCAAACTTAACCAATCTACCCGGTAACAAACTAAGTTGCACGGAAACGGGTACGGAGACGGGTACCGGGACGGGTACAGGGACGGAAAACGGCAAAATCAAAATTGCAAAAAACGGGTACGGCATGGATTCGGCAAAAAATAAATAAAAAAAATAAAAAAAATAAAAAATACAAGAATTTAACTAAGTCAAATACATGTGAGAAGTGAGAATCAAACGGTCCCAAAATAAACTCTTGTTTATTTAAAACTAGATTAGATCCCGTGCACGCACGGATTTTGATATTTTTTTCACAAAATATTTAACTGAATATCTCCCAAAGTACTACATAGTTATAACATTTTTAACATACTCGTTTAAATTTATTCATCTAATATGCATATTTAAAATGGATATAACAATTTGACCGAAATATTGAGTCTACCAGGGGTGTGGTAAGTGTTAATTTTGTAAGTAGTTAAGGACTTAATAAGTCAACCTTCATGTCAATCTTCATGCCTCCCTTCTTTCCTCTCCCCTTAATAAGTCAATCTTTATGTCTCCCTTCTTTCCTCTTCCCTTAGATTATTATTCTTTGTATAAGCGCTACAATTTTTCTTTCTCGCCTTCATTTTGCACTTTGCCTTCAAGGAATCCCCCAATCAGCTAAGATAATCTAGTGTAAATTTTGACTTTTTTACTTGGATTCGAGAGTAAGGTTATGGATCAAGGTTTGGTTGCCAGGGAATTGAAAATAGGGCCGTGACATGTCCATGTCCCTAAACTCTTTACTTTTTTTTTTGTAAAATATGAACATGGTAAAATTGTTTCCGTGACTCAGTCAATTGTTATACTTCACCAGAAGTCCAATACTTCCTCCATTCTTTTTTAGTTGATTCTTATTTTGTCACGTTTGCCAATGCAATATTTCCACAATTAAGATCTTTAATACTTCGTAATCAATAGGTAAATATTATAAAAAATTGATATTTATAAATCTATGGAAAAATTAATTCTAATAATCCAGCCTTTGGCCGGCCCTCCAATAGCAATCCTACTAGTACGATATCTCGTGCTAATCCAACTTTTGATACCTATTATCTTTAAACGAACCTCGTTAAATTGCGACCTTGTCCTACGTGGCTGTGACTGTTGGCAATTAAGCCGACCCCTTTAAGAGCGCTTCCTTCCTCTCTTCCCTTCCCTGCTTTTGCTCCAAACTACGCCATTAACACACCGAAAAAAGCTTTCTCTTCTTCTCCAAACCATGAATTAATTGGCCATGGCTTTCAATTTTATTCGTTCCAAGGATGAATTAATTAGCCACAGATCATCACCAAGATACTTCTGGGTCAATTCACTACCAACAACACGTAGAAAAGTAGCTGATAAACACATGAAGAACACCATTTGCACAACAAATTATCGGCAAAACTCAACCAGATCGTCAAGGCAACCAGATCGTCAAGGCAGACACACACGAACACATGAAGAACACCATTTGCAGAACGAATACCCTGTGCTTAATCATATTATTATATACCCTTCTACATTGACTCCTCGAGCACCATCAAATGATGAATAAGGGACAAATACAAAAACAAGAGAAAAAAATCCAACAAATCATCATCAACAAGGACAATTACAAATGCAAAGAATCATGTATCGCCCTGCTTTGATTATGGTTGTTTCGAATGTTCTCCCAATTCTGGAATCATTGCTGGCATTCTTCCCCTCACAACGAGCTTATCAATCAAGTTATCAAAGCTTTCGAGGAGCATATTTCGAGAAACAAGAACGAACCGGTCAACCTCCATCATCTCCGGTCAATCCTTCGAACAAAACGCCAAGGAACGCGGAAACACAACATGCATCAATAATCTATCACCAATACCTAGATTCTTGCAAGCACACTAAAAACTTCAAGAACCCTAGAATTTGGGAAATTCAATAATCAACATTTATTTGATTCAAGATTCGAAAGAAATGATCAATGAATGAAATCAGAAAATGTCAGGGCAAATTAAATCTCTAATTCGTCGAGTTTTGACCCAATTCGAGAAAAAAAATTGAAGTAGAAATTTGGGGGGAAAATGATGAGTTGATTTGAATGAAAATTTTATGCGGAATTAGAGAGCCAGGACCGTCCCACTCTGATATCATGTTAGAATTGTCTAATTGAGAGTTTAGGGAAGGAAAACGGAGAAGGAAGAAAGGAAAGAAGATAAGTGAAGCCATTGTTGAGCTTGAGAGGTCGTGAGTCTCAGAGAACAAAAGGGAGAGGGAGTCAAATGAAGGGGAGTCAACAATTAATTAAAAGAAAGTAAAAAAAAACCAACTTAGATAAATAATTTCAATTTAAAAGTTGTTGACTTTCTTTTCAGGTAACCGTTCATAAGTCCGTTTAAAGATAATAGGTATCAAGGGTTGGATTAACATGAGATATCGTATTAGTGGGATTATTAGAATTAATTTTTCCTAAATCTATACAATGAAACTATTATAACAAGGCACCACATGATTTTTTTTTCTTCTCATGTATAAATCACAATTGATAGTCAAAGTAGTTTATATGAATAGTGTCAAAAGTCAAACCGTGTCAACTAAAAAAGAACAGAGGAAATATAATTTAATTTGAACAATAAAGAAAGGATGCTTTGAATGAAAAAAGGTCGCCAACTACGGAATGTTGAGCTATTGAACAAAGAAAATAAGGGGGCTGTAGTACATCTTGTTGGGTTGAGCTGTAAAATGTAAGCCACTGCAAGTTAGTTATTGTTCGTGGCCTTCATGCAGTATGGAGTTCTCCAATTGGGTCTCTTTTTCTAGCAGCTTGAATGCTAGGCAAAGAAAAGAACAAAAGTCTGTTGATGTCAAAGGATTTATGTTCCTTTTTCAGTCATACTTCTAACTTTAAAGGGAAATGGAAAAGAAAATTAGAGGGGTATGGTGGCAGAGGTAAGTAAGAGGAAAAATGTACCCACCTGAAATTTATGGAATGTCTGCCATTTGTTTCCAGTTGTTTTTATCAGATGGTCTGAGCATCTTTTTTGTTAATGTCTTGACTGCTGAGAATCTGTTGCGACTTATGAGTAATTTTTTTGAGCGCCTCTCAGCAGCTTCATTAACTTAAATCTGATTAGAAATGCTGCTTCTAATGATGTTGATCTCCTTTATTGGTGTAGTTCTAATGCCAATCCACCATTTTGGATCCCTATACACATAGTGATTCCAGAGCGACCCACAGAGAGTTCCATATTCAATGTCATAGCAGGTATCTTTACTATGCTGCTGATGTAATTACTCTGGTTAATTTATCTTTTACCCTTCACTTTGCTTTTGCTTAAATGTAACTTAAAACTTTCATTTTACACCTTTCTGCGACGGATTCTATAGCTGGAGGTGTTACCATGTTAATGGATGATATCCACGCTATAGTTTGTCATGTATATTTTCTGGGAAGTGTAAATTTCATTCGGATTACTTGGTATTCATCTTTCCTTTTTAACGGGACTTAGAGATCTGAAGATATTCAATTTTCCCCTGAAGTTATGTACGCATATAACTTTTCATTTACTCTCTTTCAAACAAATGTTAATTCATGTAATTATCTAGTTCAGTTATGCTCTATTTGATAATGTCTAGATGAAAGTACCCATATCGACTTGACCTTAACAATAATGGTGATTTATGTTCCTCAATGTTTATAGCTTCAATAAAACTTCTGATATTAGCAAAATATTTAATATTATTTGAGATTGTATAAACAATTTTTTTTTACTAGTACGCATTATTTGAAAGACTAAGGGTGTTTTTGTCTTTTTAGTATCCTTTATGACCAAAAGGGACTGGATTAACATAACTTGGGGGTTGTTGTTAAGTATGTCTCAGTGTGTGTGTGTTGGGGGGGGGGGGGGGGGTTACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGTGTTAAATAGAATATGTGCAAATTTTATGATGTGAAATGACAATCTTCATATCTCTGGCATTGAGATGAAACTTGGTCAAATGTCAAGGTGACAGGAGGAAAATAAAAATAGTATATGTTGTGTGTGTGTACTTTATGCAATGATGTTAACAACTAGGGAAGTATTGGAGTCATAAATTGAGGACATGAAACTTGGAAGTTAGCCTTATTGAGAACACATATAGTGAGACAGTGAGGGGATAGCCGGGAAGTACATTAAAGTTGCACTTCTTACTATATTGTGGTAAGAAGGATACATATGAGAAGTAGGACTTATTTGTAAGTGAATACATATAGTGAGACAGTGAGGGGAAAGCCGGAATTTCATTAAAGTTGCACTTCTTACTATGTTGAGGTAAGAAGGATACATATGAGAAGTAGGACTTATTTGTAAGTGTCCTACTCCTAGCACAAGAAGTCAAAATTTTGAAGGGAGGGAGGAAGCGAGAGAGTATAAATGTGAAACATGCTTTCGTTTCATAATGTAATTTTTACTGTTATTTTGCTTTGGATGACTTTTGTACATTGTTCGAAAAAAATAATTTATGAGTAAGGAGAACCTACAATACAGAGGACACGTCCTCGTAGCAGCAGAAAGAAATTACAAATTAGCCTAATGTGTTGAATTTCTCCCAGCGCATTACTGTTGAAAGCCCCAGTTGACTTAGCCCACAAAGATGCTAGCTAGTAATTATACGGTACCCCAAAGCAAAACACAAGTTACTTATAATCTTGCAATGCCATTTTGACCAGAACTACAGAATCGCCATAAAAGCATTGCTACGCTACACCTTCTCTACTGCCTTTCCAAAATCCCATAAAACGAACTTGAATATTATTCACCATCTCAAAATGAAGGTAGACATGCGACTGATGTGAGCCAGACTCACTATTGGCACAAGACAATAAGACATGCACAGTTCGGGTGATATAGCCATTTCTCTGGTTGCTATTTTGGAGCGTGTCAGTACACAATAGTATTTACCCAACGCCAAAATCACACAAAAGCTCTCACATTTAGAGGAACTCTAGCCTTCCAAATAAATCTGTCAACACTCAACAGGGATGAGAGGGTACGTCAGAATCAAAAAGATAAGGAAAACTAAAGGGACTTGCAAGTAAAGCTGCCTGATGAATCCCCCACCCACCTCCACTTTTTGTACATTTTTTAATCACAGTAAATCAATGGAACTGAATAACTGATTCGTTTCACGTATGGTCCTCATCTTTGTAATGTATATACTTTTCTTTGAAATCACCTGATTTGGATATCATTTACAGATAGTCCCCGCGATTCAGTTCAGTTTATTGAATGGTCTCCACCTTCTTGTCCGCGGGCATTATTGATTGCAAACTTCTATGGAAGGGTTACTATTTGGTCTCAGCAGACTCAGGTTAGATTGCCCACTGAATCCTTGGCACTTAGTTTTTTTTCTCCCGCTGTTTACTTATTTCCTTCCTCAACTTCTTAGGTTGGCTTCAGTCCTCCTCTTAATTTCATTTGTTACTTTATCTGTTGGTCTTTGAAGTACCTTAAGAGACCCTAATAAAATTCTATTCAGGGACCAGCTAACCTAGTGCGAGATGTTAGTCGCTGGCAGCCTGAACATGAATGGAGGCAAGATATTGCAGTTGTGACAAAGTGGCTCTCTGGTGTTTCTCCGGTAAATATCCTTCCTGTGTGTTTTATGGTCATTGCATAGTGAATACCCTTTTTGAATGTTTCTGTTGTGCTACATGGAATCCGTTCTAAGAGTGAGACCAGGGAGGCAGGGAGTTCCGACTCCTGGGAAAAAGAGCAGGTATTGACGAAAAACGGGGAAGGAAATGTTGGGTCTCTAATAAAAGTTTGAGCCTTTGAGGCCTAGAAGGGTTTACCAAGTGCTAGACAGTGGCAGTCAATTGACCATTGTTCCACAAATAACCATTACCTTTTCTGGAGATTGAGAGGGGAAAATCACTATGAAATCTTGTTTTATTTCCACCAACACAAGCTAGCAGAGAGGCCCAATACATGAAGGAACATTTTCGGATGTCCACATTTTTGCATAATTGCTCGATTATTTTGCAACATGATGTAATTAACCATTTCCAAATCTGGCTGATTGAACCGTTGAACTCATTCTTTGTGATTACTGTCAGAATGCAGACATGATGAGTGATTAGAAATACTAAGAAAAAGGGACGAATGTAGCTGCTAGGCCACAGGAATTAGAAAAGGAAAATTGAAACCAAAAGATGCTGAGAAAATAAGAAAACGCTAAGCTGAAGTTGAGTCACGCTAGTTTTGTCCTGTTTTTTAAGTATTGTGTTATCAGATGTCCTCGTGTACCCATGTTGACATTACTCCAGGATTGCAGGTTGGACTCTTCATTTGTCGCCAGAAACAGTTACCCATATGAGCCACAGACTGGGGAGACAGATTGGGAGGGAGGGAAGGGGGGGGGGGGGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGTGGGGCTAGGGAAAGAGATCTGGTACTTATTTTGGATGGGAACTACCATGTTCTAAAATTTGGCGCCTTGACAAATCATATACTCAGACAAGTACCTGTAACTGATAATGTCACTTGAGTTCATCAGGTAGCATAAGTCTATAGTCTGTATTTGTGTCCTGTTTGTTTTCAGTTTGTAACATGCTTGTTACATCACTTTAGTTGTCATCATGCAAGTTTTCTACTTGCAGTGCTGCATAGTTTGGTTTCATATACTCTTATGTATTCAACAACTATGTTGCTACGATATTCAACAGTTTCCTTGTAAAAAATTATATTCAACATTTTATTGTTCCCACCCAACAGAAGATTGAGTTTCTGTAAGTGATGCACAGGTTTTAAAATTTCTCTCTTTTGGTATGTTGATTTTTTTTTAAATAAGCTGCCAGGTTTTTTTTATCTTGTTTAATTATCTTTTGCTGCTTTAGCAATAAACTTAGATGTTCTTAATGTAGTACAGGTTTGTGGTTATATGCTTCGAGATTTCTGTCTAACCTGAGGGCGGGAATACCTTTTTTCATGTCATTTCTTATATCTGAAGTCTGTTATATGCGCAAGTGTAATAGTGGTGACCAATTCTTCCTCTTATACTCCCCTGTAAATATGTTTCAAAGATTTCGTTCTGTTTCCTTGTTGTTATCTCTATAGGGATTGTTACCTTTCTTGATGATAAGTTTTTGGTCAATACGTGTCATCGTCGGTTCCGACACTTCTTCAGCCTAGTTTGTTCCCTGGCATGACTTTTTACTTATCCTTGATCAGTATAGGTGGCTTTCTGCAAAATCCAATAATTCTACAAGCTTGAAGTCAACTTTTGAGGAGAAGTTCCTTCCTCAGCATCCTCAAACGTCAGGTTGGTTATCTCTGTAGTTCTGCGTGGAAAATATTGTTGGATGTAAGTAACTGTGTTTCTATCCAAGTTATGGATGTATTGTTCATTGTGAAATGTGAAGTAATTTGAGTAGTATGCTATGATGTAAAGAACTGATAATTTTGAGCAGAAAATTAATGGTAGAAATATACGAAGTATTAATTTGCTTGAAAGATGATGAACACAAAAAGGGGTACAAAAGGTAGTGACTTGAGGCCACCAATCTCCAAGGGTTAGTGACTTGAGGTCCACTCTCCAAAGCCTAAGCTTTACTAATTTTTCCACATCCCTTCTAAACCTAGTGCTCCCTTATTTATAGCCTCTCTAGTCTCTACTCTAACCACCTAATTTGTCTTCTAGCATTAATACTACTTGCACTACCCATTATAACCTTTACTATTTAATTCATTACATATGGATTCATTATTCTTTATTTCTAAAATGTTATTATTTTATTGAGAAAAAAGTAAGTAGGAGAAAGTAGGTGCAAATCACAATTTTTTTTGAGTAGTATGGACTTATCAAAGTGCTCCTTAGTGAAGTCTACAACTTTTATAGTGCTTGAAGAAAGCTGGTTGCTTAAGACATATCTGTCCTAATACACGTGCCATAGTTGTGAATAGAATAGTATGATTTGTAAGATGAATTGGAATAAGTTCCATTACTCTTAGATTTATAAATATTCCGAGGCCTTAGTTTTACTTTCTCAATTTGGTGAAAGGCAGCTAGGTTGCACGGAAACGGATACGGGGAAACGGGAAATTCTCAAATATTAGGAAACGGGGAAACGGTTAAAAATATATATTTTTAATAATTTTTCTTTTTTTAGCATCATACTTACCAACTTTAAAATTCTCACTAATTTCTTCTTTCTTAAACAAAAAAAAAAAATAAATATATTTTAAGCTAAAAAGATATCCGTAAAATAGAAATGTGAAAGTATCTTAGGACGCAAATTTTAAAGAAAAATTAAAAAAAAAAATTAAAAAAACTCTGAGTCACAGTCTCAGTGTCTCACAGACTCACCAGTCACCGTGATTTAATAAAAATCAGTCACATATTCACCATCACTGACCATCGTTAAAAAAACTTGAAAAAATCAGATTTAAATGTTAAAAATAAAAAAGTTAATTTGTCCACTCAAACCCTTCCACCTGTTACAAGAAGGTCCACACCCTCTTTCTTTTTTGTACTTTCTTCTTCCTTCTTTCATTTTCTTACTTTTACAGAGAACTAACCATTTTCATGGTTTCTGCACCTCCCCTTTTCTTTTTCAATTACATTTGCAACAAGAGGAAAACAGATAACCTAGCATATTTAGAGTTTTTGATAAGGGAAACGTCTTTTAGTCTTTGAAAAGACGTTTCTTGTGCGTGTCTTGGGGCTTCAGAAGTGCGAAACGTTTCTACAGAAAGGGAAACATTTTTTACCCGTTTCGAAACGCGTTTCGAAACGTTTCTGAAACGGATGTATCTCGGAAAATGGCGTTTCCGTGCATCATAGAAAGGCAGCCTCTTGCTCCTTGGATGCTAGCAGTTGCAATGCAATTTTATTCTTTTGCTTAATTGAATATTTTGGTGGTCATGGTTATCTGTATGGAACTTCTGCACTTTCCTGTAGTGCACATATGGTTCAGATATTTAATATCACACCAATCTCTTTACCCTATGTGTAACACTTTGGATAAGGTGATGTTATGTTTCTAATGCAATTTCTCTCTTTTGCTTGTCTTCTCAGCTAGGTGGCCGAATATCCTATGTGTATGTTCTGTATTCTCTTCAGGGTCGGTCCAACTTCATTGGTCTCAGTGGCCACCTCGAGAGGGTGCATCAGCGAAATGGTTTTGCACAAGCAAAGGGCTTTTGGGGGCTGGACCCAGCGGTATAATGGCAGCTGATGCTATTGTGACTGACTCAGGAGCCATGCATATCGCAGGTGTCCCTATTGTAAATCCATCAACTGTCGTAATCTGGGAGGTCACACCAGGACCTGGCAATGGATTAGAAGCAACTCCAAAGACGACAATAAACAATGCAGTCCCTCCGTCTGTTGACCCCCATAGTTGGTCAGGATTTGCCCCTCTTGCAGCTTATTTGTTTGACTGGCAAGAATACTTCATATCAGAAGGAAAACTAGGGAAAAAGCAGTCATATAAAGACTTCACGGAAAATGTATCATTGTATTGTTCACCAGTCTCAAATTTTTCGGCATATGTGAGTCCAGAGGCTGCATCACAATCAACTGCAACTACTACATGGGGTTCTGGTGTAACGGCCGTTTCATTTGATCCAACTCGTGGGGGTTCAGTCATCACTGTGGTGATAGTTGAGGGTATAACATACTGCACTTCTTATTATGTACAATGAATTGAATATGCTTCACATTTTGGCGTTTTCGATGTTCTTGATTTACACTTATTTTTCCTAGAGGGTGCAGAATCATCACTAGATTTATTCTCTCCTTCGTTGTTGGACACTTAGATTTAAGTCAGCATATTGTGCATATCATATACAATTACTTAACGCCTTGGATGAGGCTTTTGCCTATGGAGGGGGGTGGATCTTAGATGTATAATTGTATACAATCATGCTTCTTCAAATTGACTTGTTTCTTATAGACCCTCAAAGATTTTGTGGAATTATTTTATGCTGATGATATCCTAGGGTTGAAGCTGTTTAGCATTAGATACGGTTGTGCCATGTTGATCATCATTACCATACTTGGTCTCTTTTGATTTGGCACCATCAGGCAATGAGACCTGCAAATGAAAGAACATTTTCTTTTGATTATGTGCAAGGGAAGTTTTTAGCATTTGAAACATGCAGACTGAAGAGGAACCAAATTGATTTAAAGGGATTCCGTTTGCACGCTCTTTGTATAGAATGCAGAAGTTTGATGGATACTCTGCTCTATTTGGCTGTTTTTTGAACACGTCATAAGGAAGTCTTGTGATTTGGCAATACCTTGTAAAATTACGGGCGGGTTGGCTTTTACGTAATTTTGACTAGCAAAAGATATACACATCATAAGAAAACCTTATGGTGATTTGTCACTACGTTATCAGTTAGGGGTTGGATTGGCTTGTTCGATTACTTCTCCTTTCAGTTTGATTATGCTGGGCCTGGTGGCCTTGGTCTGCTTATTTGCAATGGGTGCTCCTTATCTTTCCTCAAATTTTTCTTTTTGTCGAACACGGTAATGGGGAGCTTTGTTGTATACACCTTTATTTATTGCCTGGAAATGCATTTTTATGGTTTTGGTGTTCTGTTAACTAAATTTCTTGATTTGTCTTCTTTGGGTTCTTGTATAGCTTGCTTGACTTTTGTGGTGGCTTTCTTATAGGGGATTGTGCTTTTTACATGTATTACTTTTCTTGCTTGACCAACATGTTCTATGTATTAGGTACTGATTTTCTTAACTTATCAATATCCATCTTATTTTTACAAATTGATTAAATATTTGGCACTTGAGTACTTATTGTCATGTCTATCAGAAGCTTTATGCAAATTGATTTTTGCCTTTCTGGATGTGCAACAGAAAGCTCCTTTCGAGCTGAGAGGCTCATTGACTATAATCATATTGGCATTACTTCCTTTTACGTTAGACGCAAATTGAATACAATAACAAAAAATTATAAACCCTAATTGATTGGTGATTTGTTTACCTCTCGTTCCATACCGCTAACTGAGTTTGGAATTTGAGCTGCTTTTTGAGGTATACTTATGAAACGTGTTGATTCATATAACTACACATTTGTTGCAGGACAGTACATGTCCCCATATGACCCTGATGAGGGGCCCTCAATCACAGGATGGAGAGTGCAGCGTTGGGAATCATGCCTTAAACCTGTTGTCCTTCACCAGATATTTGGTAGTCCTACCAATTTCGGAGGGCAGCCACCTATGCAAACAGTTTGGGAGACCAAGGTCAACAAGAGCATGCCACCAACTGATGATTTAAAGTCTCAGCAAGCTGCATCAGTAGGACCAACTTCTGATCTGCAGCCAAAACCCGAGTCTTCTGGTGATAAGGCAAAAAGGGTCACCTTTGATCACTTTAACATGCCCAGTGACGTTAGGACACTTGCTCGCATTGTATACTCGGCTCATGGTGGTGAAATTGCTATTGCTTTTTTACGTGGCGGGATCCATATTTTTTCAGGGCCAAACTTTACACCTGTTGATACATACCAGATTAATGTTGGCTCAGCGATAGCTGCTCCTGCTTTTTCTTCAACAAGCTGTTGTTCAGCTTCTGTATGGCATGACACCAGTAAAGACTGCACTATGCTGAAGATAATCCGTGTTCTTCCACCTGCTGTTCCCAGCAGTCAAGCAAAAGCCAGTTCTCCTTCATGGGAACGTGCAATTGCAGAAAGGTAAACAATATTTGCAGTTTTATGCATTGTGGGTGTCCCAAAATTTCAGCCGTCTTTCTCTTTATGATTGCCCCAAACCTTGTCCTGATCTAAAACGAGAGCCAAAAGAGCCTATAGGTTTCTGAAGTGCCCCCTTATTGAACTGAATTGTTAATAATAGTTGAAGAAAGACACATCCCTTAAGATTCTAAGATTTTGGGACAAAGGGAATAGTTTTTGTGTTTGACTACAGTTTGCAGTCTCCCTGGTCTGTCTGTTGAAGTTTCCTAGTCTGGTTATAACTATGATGTTACTGAGTTTCATGTTTGTCTTCACTCCCTTTCTTTTTTGAGCAGGTTTTGGTGGAGTCTTTTGGTCGGAGTAGATTGGTGGGATGCTGTCGGGTGTACACAAAGTGCTGCTGAAGATGGAATTGGTATTGAAATATACCTTTTTCAAAGTGCCAAATTAATATTTATCAACTATTATTCTTTGTTATTCAATTTGATCATGGAGAGGAAGAAGTGGAATATTAACTTGTCAGCTGTTATTCATATTCTTTGATGGAAAGTTGTGCTAATGGGTCTCTTATGTTCTATCTGAAGTTGTTTTACTCTCTTTCTGTTTTCAGTTTCCTTGAACAGCGTTATTGCAGTTTTGGATGCTGACTTCCATTCCTTGCCCTCTACTCAGCACAGGCTGCAATATGGACCTGTATGTTACTCTTGTTTTCTTTTCAATGATAAATACCTTCCATTTGGAAATAATTCTTCGTGCTTTACTGTTCGTAACGTGGGAGCGTTCTAGCTCATCTTTTATATGTATGCTATTTCGATGATTGCTGCTGTTGCAACATGTACGAGTTCATTGCTTAATCTGACTTTTTATCATTTCAGTGTGGTTCTTTTATTTTATGCGTGTTGTTGTTATAGCTAGGGTTTGTTCATGTAGTTATTTTGTAGGGGTGTATACTGAAGAGAAAGGGATTAATGTTTAACCATATAGATCTCTATGTTCCAAAATTTGTCTGACCAGGTCATAATTAATCATTTGTCAAATTAGGTCATAATTAATCAACTGATTTTGGCTATTTTGGTGATTGCTTCTGTTGCTAGATATACTGGTCCAATGTTCTGTGTATTGTCTTCTCATTTTCATTTTGGTGCTTTCATTATTTCTGCATTTAGGTTTTATTTATGTTATCTTTTGGTAGAAGTGTAATAAAATGGGAAAAGGGATATAAAAGAAATTAATCGTGTATTTCTGGAACTGTCAAATGTGGTGTTTGGGTTGAGTTATGTCGGGTCAATTAGCTAGTTAGCATAGATAAAAAATGCGGTATCGGTCACAGTCGCGGGATCGGTCGCGGAGTCTCGGTAATGGAACTGATGCGTAAGTCGCGGACCGTGTCGCGCTTGTCGCGTAGGAATTTGAAATTTAAAAAGTTCAGCCCCATTCACTACCTACCCTAGTCCCCCTCCCCTAAACCGTACACGCGACATAATTAAAATGAATTCCTTTCTCTACCTTCTCTCTCCTCTCTTGTTTTAGATTTGAAAATGGAGTTCTCTACTCTATTTCAATGGAGTTTCTCCGTTTCTTTCATTTTTCCCTTTCTTTTTGTCGAAAACATGGGAATTCGACTGAAAACCAAGAAGATGTGAGGTTTATAACGTTTAATGCGGACAAAATTCGTTCGGATCGGTTGAACCCGGCCGAGATCTCGCCGTTTTTAAAAACACCTTTACAACTCGGGATATCTCGTATCGCCAGCCTCCAAAACCTTGTTAACTCGGGCAATACGAGTTAACTCGGGCTAGTTTTTACACCATGCTAGTTAGTCATGTTACTCATGTCAGGTAATTTCGTCCAGGCTAGTTTCTGACAGACTGATATTTCTATGTTCCAAATTTCCCCTAATCAGGTCATGGTTAAGCTATTGATTTTGGTTTCATTTCTTGAGGGATGGCAAAGTTGATCTATGCTTTTGTATTGCAGAGCCTTGACAGGATAAAATGCAGACTTTTGGAAGGTACAAACGCTCAGGAAGTCAGAGCTATGGTTCTTGATATGCAAGCAAGATTGTTGTTAGATATGCTTGGGAAAGGAATTGAATCAGCATTGATAAATCCTTCAGCTTTAGTTTCTGAGCCATGGCAGGCATCTGGCGAGATGTTGAATTCGATTGATTGTGAATCCATGGCTGTTGATCCAGCGCTGGTTTTAAGTGTGCAGGTCTTCATCCTGTGCTAGTCATTACGAAAAACGAAGTATTATTTATATGAGAAAATTCTAACTCGAAACACCATGAACCTAATTTTTAGCATTAATTCAGGCCTATGTTGATGCTGTTCTTGATCTTGCTTCGCATTTCATCACACGCCTGCGTCGTTATGCGAGCTTTTGTCGTACACTAGCGAATCATGCTGTTCAAGCAGGAACTGGTGGCAACCGGAGTATGGTGGCTAGTCCTGCACAAAGTTCTGCATCTCCTGCTCCAAGTCAGGGTGAGTGGTTGAATCTCTTCTTCACTTTGCTACCCCTCTCACCCAATATACAAGTCTCAATTATTTCTCCAGTCAAGCTCAAAGTTGCTTTCTGCTTGTAGAACTTCTTGAAAGTCTTTCCGGTGGTGGATAATAATGTGAAGATTGAATGTGTATAAATTATTATGTCTTTAATCTTCTAAGTTGTGGTTTCTCTAGATCAGTTTTTTGCTGTTTCTTGCGTAGACGGTGATGGATTTAGGAATCAAGTGTTGTCATTCCTAACCCTAAGCATGTGTGGAACTGTGGACTACATGCGTTCCGTTATTTTACTTTCTTTTCTTCTTCTTTCGTGTGTGTGTGTGTACCTGAGTACGATTAACAGAAGATATGTGTGTTGTGCTACGCTTTCAGGAGCTCAAAGCGGTACTACAAATTCAACAGGAAGCACACAAATGCATGCTTGGGTTCAAGGTGCTATTGCTAAGATTAGTGGTACAACAGATGGAGTTTCTACTTCAACTCCTAACCCTTTGAGCGGGCCAGCTCCGTATATGCCAATCAGCATAAATACTGGAACGTTTCCTGGTACACCTGCTGTCAGGCTTATAGGCGACTGCCATTTCCTCCATAGGTTGTGCCAACTTTTGCTATTCTGCTTTTTTTTCCGACGAGCACAGCTACCACGCTACATTCAGAGTTCCCAGAGAATGGCTGATGCAAGTATGCAAAAGCCCCAGTCCAATGCTGCAGGCAAGGTTGAAGAGAACCAGACAAAATCCATGCCAACAGCAGGCAGGCCAGAAGATTCTCAGGGTGCTCGTAGCAATCAGTTAGTTGCTGGTGTCAAAGTGGAAGATGGTCCTGCCAGCAGAGCAAGACTGGGGACTGGGAATGCTGGTCAAGGATACACTTTTGATGAGGTAAACATTTACTTATTTTTGTTTGCACCAGCATATGTCATCTTCAAAGTCTTATGTTTGCACTTCAATTTTATTTTCTGGCTTAATTATTTTGATTGACTATGAACAATCTGGAAATTGTTGCTCTGTACTCGTTGGAATTAATTATATCTTGGCGAGTCTGTCAAGTTCATAAAATGTAAGTTCATTCTGCTCTTGACTGCAGGTCAAAGTGCTGTTTCTGATATTGATGGATCTCTGTAAGCGAACAGCAGGCCTTGCGCACCCGTTGCCTGTTTCACAGGTTGGAATGTCCAGTATCCAGGTCCGGCTACATTATATTGATGGAAACTATACCGTCTTGCCAGAGGTTGTTGAAGCATCACTTGGCCCACATATGCAGGTAGTAAGCAATTATCTTTCTCTGGACATTTTTGTGATTCTCTGTTGGATCTTGCAGTAGCAACCTGACAAAGCAATAAAATGAGTTAAATCAATTATTATGCACTGTGAATCCCAAGTCCAAGTAGAGAAATTTTCTTGTTGTTTCTGATATATAATTGAATTAAGAACCTGGGAAACACTTGTTATTTCTTGGTTGTCGCTGTTAGACATTAATAGTGCTTCCGTCCCAAATTGCAAAATACGTCTGTTTTTGGTAGGGATAAGAAGTAGCTTATTTGTGGATTGTTTTTCTCGTGATCCGTTAATTGTGGTTGGAAAGGAATGGATACAGTTTCACGACTGCAGATGAGTGTTTGTTTTGTTGGGATGGGAATGTGAGTATTAATCCCATTTGCAACAAACTCCCATGTTTGTTATGTGGGATTGGGATTATAAGTGGGTAATGGGATTTGGAATGAAAAACCTTCTATTCCCTTGAAAAAAATATTTGTGGGGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGCTTGTATTATGGGGTGGGAATTGGGAGGGGCATAAAATGACTAAAATGTGACTAAAGAAAAAGGAGGAGAATTGTTGATAAGGATAACTTTGTAAATTCCTTATTTATTTCTTACATATTTCCATCAACCACGAAACAAGGTACTGGTCATATTTTCATGAAATTCCCAACAACCAATTCCCAATCTTCATCCCACCTTCTAATCCCCAAACTCCCTTTCCCGCATACCAAATTCCCCAAAGGGTTTTATCTGTTTATGTTTAAGAGCCTTGAGGTCAGCCCACCAATTTACTGATGTTGACGCTTTGTACCCAAGGTCATGAGTTATAGCACCCCCCATCCTCTGCTGGGTAACCACCTACTGTTCTAATGTTTTTATGTTCTCTGCAGAATATGCCTAGGCCTAGAGGTGCAGATGCTGCTGGTTTACTTCTACGTGAGTTGGAACTCCACCCCCCTGCAGAAGAATGGAATCGACGGAACATGTTTGGGGGTCCATGGTCCGATCTTGATGAAATGGGCTCTGTAGATGATAGCTCTAAGTTAAGCGCCTCCATTGATCCAGTTGATTTAACTTCGATGGAAACTTGTGATGTTTCTTATGGAGTGCAAAGCCTATGGCCAAGGAAACGCAGGCTGTCTGAAAGGGATGCTGCTTTTGGCTTTAACACTTCAGTTGGTCTAGGAGCTTATCTTGGCATTATGGGATCTCGGCGAGATGTGGTGACTGCTGTTTGGAAGACTGGTTTGGAGGGTGTTTGGTTCAAGGTATGTTCGTCAGTTTATTTCTTTATCAACTTTCAAGGTAAGATCCCAGTGTGGACAATAAAACGCAGTCTAGTATAAGTATTGAAGTATGTTTTAGGTTGTATTTGAAGATGACCAACAAATAAAGTTCATGGGTTGTGTTGCGTACTTGCGTGGATTCTGTACCTCCAGAAATTCTATGCTGTCCAACCCTCTGCTAATAGAAAGAATAGGAAATATGCCTTCCCTGCATGCAATAGGAAAACTGAAAAAGAAGAGTATAAGAACAAATGTGCAAGAAAAGGGGGAAGAGAGGATAGGCAAAATTATAACACTGATAGAAAATATGAAACTATGTAAATTTGTATTTACTTATTTAGACAAGGAAACTTGAAGCCTATTATCTTTATATTTTAAAATAAATAGATAAATTTGCAGGGTCCTTATTAAGAAATATAAGATTTTGAGAAAGGAAGTAGTTAACCCGAAAACTCGATATTAAGAAAAAAATATAAGATTTTGAGAAAGGAAGTAGTTAACCCGAAAACTCAATATTAAGAAATGATTAGTATATAAATGTCTCTTGTACCAAAAGACATTCATTACAGTCGATTCAATAGCTTTGATATACTGGTTCTCATGATCTCTTGCACCAGAAGACACATTACAGAAATACAATAGCTTTTAATATACTGGTTCTTATGGTTTGCTAAAAACATTGACATCTCCATGATTCTTTGCTGCGACCCTCAAAAATGTTTACGCTTATATATATATGCTCTTTCTGCTAAGCTTCTAACACGTATTTTAATTCTTCATGCAGTGTATAAGGTGTTTGCGACAAACCTCGGCCTTTTCCCCAACAGATTCTAATGGTCCATCCAATCAGAATGACCGAGAGACATACTGGATAAGTCGCTGGGCTTATGGTTGCCCAATGTGTGGTGGAGCATGGGTCCGAGTTGTATAATTATTGTTAGCTAGGTAGGTTTTGTCTCTAACCACGTTTATCTGAGTTCTTCTTTCCATGGGTCAAAGTGAAGAAATTATGGATAAATGCTTTGCTCTTGTTGCTTGCTTCTACACCATTTGATTTTGACTGGGATTAATGTTTTGCTGACATGTTATTTCATCTTGTTCTTTAAATCACAGTTAGTTAGTGTACTGCTAGATGATGCATTGAAACTTTAGGTATGTTAAAAGTGAAAATTTGTCAATCACAACCTTTTGAAACACTTGACATTGCTACTTCTTTCCGGATTTTGGTGGAAAATTTCACCCCAACATTTGTCATGAGGTAAATTTCTGAGGTCACTCTGGAACCAAAAAAGATTTAGTAGGAAATACTCGTAGGAATGATAAATGTTAGTTGTTTGAAAAGGTAATGTACATTCTTTTTCCCTAAAAAAAGGCAACTCACAAAAGTAACCTACTTGCAGGTATTTGACAAATTGTCCTTTTCAGTATGTTAAATTTTAACTTGTATATTTTGTTTATCTTCCAGGTTGTACATTCTTTATTTTGAACTCTGGCACGAGAATAAGCATTATCTCTTCGTCGAAGCTGTTATGACAAGGCTTTATTGTCTGAGAAATACAGCAGATTACATTAGAGTGACTGAGGACATTGTAAACATATGATGTCATTATAGTCTCGACTCTGGTGGACGAAGAGGCTTGCATCGTCAGCATGCCCTCCAACTCTTCTGAGACTTGGTGGGGAAGAAGTTTACAAGGATCTTCGCCCTGGTTGA
mRNA sequence
AACGAAAACTCCAAAAACACCTGGAATCTTTCCCCTCTTTCTCACACACATTTGTCTCCTCTTCAAGAACCCTAATCTCAGCTAAAAATCCAATCTCCCCCAATTCAATTTCAAAAAATCACATTCACCCCACTCTTTCTCTCTCCTCCACAGTCATTGCTGTCGCCCTCAATCGCGACTTTCGACCACCACTACCCACAATTTTCCCCCTTTTCTCCCTCCTTTGTTTCCCCACTTTATTTTCTCTTTCCTCCTTCCCCATATAAATTAGGGCATCTTTTAGTGTTCCAAGAGATGACTTCTTCCTCTTCCACTAACAATAATAATAACCCTGAACCTGATTCTCCCCCTCCTCCTAATCCCCCTGCTTCTGAAACTGTAGCTGTTGAAACTGACAATCTTGAAGCTGAAGAAGAAGCTTTCATGGAGAAATCCATGGTTTCTGATGATCATATGGATGAAGATTCTGTTAATAATCATCCTGCTACTGTTTTTTGTATCACATTGAACCAATCTCGCTCCAATTTGCTTCATAAAATGAGTGTTCCTGATCTTTGTCGCAATTTTAGTGCTGTAGCCTGGTGTGGGAAGCTGAACGCCGTAGCATGTGCTTCAGAAACTTGTGCTCGAATTCCAAGTTCTAATGCCAATCCACCATTTTGGATCCCTATACACATAGTGATTCCAGAGCGACCCACAGAGAGTTCCATATTCAATGTCATAGCAGATAGTCCCCGCGATTCAGTTCAGTTTATTGAATGGTCTCCACCTTCTTGTCCGCGGGCATTATTGATTGCAAACTTCTATGGAAGGGTTACTATTTGGTCTCAGCAGACTCAGGGACCAGCTAACCTAGTGCGAGATGTTAGTCGCTGGCAGCCTGAACATGAATGGAGGCAAGATATTGCAGTTGTGACAAAGTGGCTCTCTGGTGTTTCTCCGTATAGGTGGCTTTCTGCAAAATCCAATAATTCTACAAGCTTGAAGTCAACTTTTGAGGAGAAGTTCCTTCCTCAGCATCCTCAAACGTCAGCTAGGTGGCCGAATATCCTATGTGTATGTTCTGTATTCTCTTCAGGGTCGGTCCAACTTCATTGGTCTCAGTGGCCACCTCGAGAGGGTGCATCAGCGAAATGGTTTTGCACAAGCAAAGGGCTTTTGGGGGCTGGACCCAGCGGTATAATGGCAGCTGATGCTATTGTGACTGACTCAGGAGCCATGCATATCGCAGGTGTCCCTATTGTAAATCCATCAACTGTCGTAATCTGGGAGGTCACACCAGGACCTGGCAATGGATTAGAAGCAACTCCAAAGACGACAATAAACAATGCAGTCCCTCCGTCTGTTGACCCCCATAGTTGGTCAGGATTTGCCCCTCTTGCAGCTTATTTGTTTGACTGGCAAGAATACTTCATATCAGAAGGAAAACTAGGGAAAAAGCAGTCATATAAAGACTTCACGGAAAATGTATCATTGTATTGTTCACCAGTCTCAAATTTTTCGGCATATGTGAGTCCAGAGGCTGCATCACAATCAACTGCAACTACTACATGGGGTTCTGGTGTAACGGCCGTTTCATTTGATCCAACTCGTGGGGGTTCAGTCATCACTGTGGTGATAGTTGAGGGACAGTACATGTCCCCATATGACCCTGATGAGGGGCCCTCAATCACAGGATGGAGAGTGCAGCGTTGGGAATCATGCCTTAAACCTGTTGTCCTTCACCAGATATTTGGTAGTCCTACCAATTTCGGAGGGCAGCCACCTATGCAAACAGTTTGGGAGACCAAGGTCAACAAGAGCATGCCACCAACTGATGATTTAAAGTCTCAGCAAGCTGCATCAGTAGGACCAACTTCTGATCTGCAGCCAAAACCCGAGTCTTCTGGTGATAAGGCAAAAAGGGTCACCTTTGATCACTTTAACATGCCCAGTGACGTTAGGACACTTGCTCGCATTGTATACTCGGCTCATGGTGGTGAAATTGCTATTGCTTTTTTACGTGGCGGGATCCATATTTTTTCAGGGCCAAACTTTACACCTGTTGATACATACCAGATTAATGTTGGCTCAGCGATAGCTGCTCCTGCTTTTTCTTCAACAAGCTGTTGTTCAGCTTCTGTATGGCATGACACCAGTAAAGACTGCACTATGCTGAAGATAATCCGTGTTCTTCCACCTGCTGTTCCCAGCAGTCAAGCAAAAGCCAGTTCTCCTTCATGGGAACGTGCAATTGCAGAAAGGTTTTGGTGGAGTCTTTTGGTCGGAGTAGATTGGTGGGATGCTGTCGGGTGTACACAAAGTGCTGCTGAAGATGGAATTGTTTCCTTGAACAGCGTTATTGCAGTTTTGGATGCTGACTTCCATTCCTTGCCCTCTACTCAGCACAGGCTGCAATATGGACCTAGCCTTGACAGGATAAAATGCAGACTTTTGGAAGGTACAAACGCTCAGGAAGTCAGAGCTATGGTTCTTGATATGCAAGCAAGATTGTTGTTAGATATGCTTGGGAAAGGAATTGAATCAGCATTGATAAATCCTTCAGCTTTAGTTTCTGAGCCATGGCAGGCATCTGGCGAGATGTTGAATTCGATTGATTGTGAATCCATGGCTGTTGATCCAGCGCTGGTTTTAAGTGTGCAGGCCTATGTTGATGCTGTTCTTGATCTTGCTTCGCATTTCATCACACGCCTGCGTCGTTATGCGAGCTTTTGTCGTACACTAGCGAATCATGCTGTTCAAGCAGGAACTGGTGGCAACCGGAGTATGGTGGCTAGTCCTGCACAAAGTTCTGCATCTCCTGCTCCAAGTCAGGGAGCTCAAAGCGGTACTACAAATTCAACAGGAAGCACACAAATGCATGCTTGGGTTCAAGGTGCTATTGCTAAGATTAGTGGTACAACAGATGGAGTTTCTACTTCAACTCCTAACCCTTTGAGCGGGCCAGCTCCGTATATGCCAATCAGCATAAATACTGGAACGTTTCCTGGTACACCTGCTGTCAGGCTTATAGGCGACTGCCATTTCCTCCATAGGTTGTGCCAACTTTTGCTATTCTGCTTTTTTTTCCGACGAGCACAGCTACCACGCTACATTCAGAGTTCCCAGAGAATGGCTGATGCAAGTATGCAAAAGCCCCAGTCCAATGCTGCAGGCAAGGTTGAAGAGAACCAGACAAAATCCATGCCAACAGCAGGCAGGCCAGAAGATTCTCAGGGTGCTCGTAGCAATCAGTTAGTTGCTGGTGTCAAAGTGGAAGATGGTCCTGCCAGCAGAGCAAGACTGGGGACTGGGAATGCTGGTCAAGGATACACTTTTGATGAGGTCAAAGTGCTGTTTCTGATATTGATGGATCTCTGTAAGCGAACAGCAGGCCTTGCGCACCCGTTGCCTGTTTCACAGGTTGGAATGTCCAGTATCCAGGTCCGGCTACATTATATTGATGGAAACTATACCGTCTTGCCAGAGGTTGTTGAAGCATCACTTGGCCCACATATGCAGAATATGCCTAGGCCTAGAGGTGCAGATGCTGCTGGTTTACTTCTACGTGAGTTGGAACTCCACCCCCCTGCAGAAGAATGGAATCGACGGAACATGTTTGGGGGTCCATGGTCCGATCTTGATGAAATGGGCTCTGTAGATGATAGCTCTAAGTTAAGCGCCTCCATTGATCCAGTTGATTTAACTTCGATGGAAACTTGTGATGTTTCTTATGGAGTGCAAAGCCTATGGCCAAGGAAACGCAGGCTGTCTGAAAGGGATGCTGCTTTTGGCTTTAACACTTCAGTTGGTCTAGGAGCTTATCTTGGCATTATGGGATCTCGGCGAGATGTGGTGACTGCTGTTTGGAAGACTGGTTTGGAGGGTGTTTGGTTCAAGTGTATAAGGTGTTTGCGACAAACCTCGGCCTTTTCCCCAACAGATTCTAATGGTCCATCCAATCAGAATGACCGAGAGACATACTGGATAAGTCGCTGGGCTTATGGTTGCCCAATGTGTGGTGGAGCATGGTCTCGACTCTGGTGGACGAAGAGGCTTGCATCGTCAGCATGCCCTCCAACTCTTCTGAGACTTGGTGGGGAAGAAGTTTACAAGGATCTTCGCCCTGGTTGA
Coding sequence (CDS)
ATGACTTCTTCCTCTTCCACTAACAATAATAATAACCCTGAACCTGATTCTCCCCCTCCTCCTAATCCCCCTGCTTCTGAAACTGTAGCTGTTGAAACTGACAATCTTGAAGCTGAAGAAGAAGCTTTCATGGAGAAATCCATGGTTTCTGATGATCATATGGATGAAGATTCTGTTAATAATCATCCTGCTACTGTTTTTTGTATCACATTGAACCAATCTCGCTCCAATTTGCTTCATAAAATGAGTGTTCCTGATCTTTGTCGCAATTTTAGTGCTGTAGCCTGGTGTGGGAAGCTGAACGCCGTAGCATGTGCTTCAGAAACTTGTGCTCGAATTCCAAGTTCTAATGCCAATCCACCATTTTGGATCCCTATACACATAGTGATTCCAGAGCGACCCACAGAGAGTTCCATATTCAATGTCATAGCAGATAGTCCCCGCGATTCAGTTCAGTTTATTGAATGGTCTCCACCTTCTTGTCCGCGGGCATTATTGATTGCAAACTTCTATGGAAGGGTTACTATTTGGTCTCAGCAGACTCAGGGACCAGCTAACCTAGTGCGAGATGTTAGTCGCTGGCAGCCTGAACATGAATGGAGGCAAGATATTGCAGTTGTGACAAAGTGGCTCTCTGGTGTTTCTCCGTATAGGTGGCTTTCTGCAAAATCCAATAATTCTACAAGCTTGAAGTCAACTTTTGAGGAGAAGTTCCTTCCTCAGCATCCTCAAACGTCAGCTAGGTGGCCGAATATCCTATGTGTATGTTCTGTATTCTCTTCAGGGTCGGTCCAACTTCATTGGTCTCAGTGGCCACCTCGAGAGGGTGCATCAGCGAAATGGTTTTGCACAAGCAAAGGGCTTTTGGGGGCTGGACCCAGCGGTATAATGGCAGCTGATGCTATTGTGACTGACTCAGGAGCCATGCATATCGCAGGTGTCCCTATTGTAAATCCATCAACTGTCGTAATCTGGGAGGTCACACCAGGACCTGGCAATGGATTAGAAGCAACTCCAAAGACGACAATAAACAATGCAGTCCCTCCGTCTGTTGACCCCCATAGTTGGTCAGGATTTGCCCCTCTTGCAGCTTATTTGTTTGACTGGCAAGAATACTTCATATCAGAAGGAAAACTAGGGAAAAAGCAGTCATATAAAGACTTCACGGAAAATGTATCATTGTATTGTTCACCAGTCTCAAATTTTTCGGCATATGTGAGTCCAGAGGCTGCATCACAATCAACTGCAACTACTACATGGGGTTCTGGTGTAACGGCCGTTTCATTTGATCCAACTCGTGGGGGTTCAGTCATCACTGTGGTGATAGTTGAGGGACAGTACATGTCCCCATATGACCCTGATGAGGGGCCCTCAATCACAGGATGGAGAGTGCAGCGTTGGGAATCATGCCTTAAACCTGTTGTCCTTCACCAGATATTTGGTAGTCCTACCAATTTCGGAGGGCAGCCACCTATGCAAACAGTTTGGGAGACCAAGGTCAACAAGAGCATGCCACCAACTGATGATTTAAAGTCTCAGCAAGCTGCATCAGTAGGACCAACTTCTGATCTGCAGCCAAAACCCGAGTCTTCTGGTGATAAGGCAAAAAGGGTCACCTTTGATCACTTTAACATGCCCAGTGACGTTAGGACACTTGCTCGCATTGTATACTCGGCTCATGGTGGTGAAATTGCTATTGCTTTTTTACGTGGCGGGATCCATATTTTTTCAGGGCCAAACTTTACACCTGTTGATACATACCAGATTAATGTTGGCTCAGCGATAGCTGCTCCTGCTTTTTCTTCAACAAGCTGTTGTTCAGCTTCTGTATGGCATGACACCAGTAAAGACTGCACTATGCTGAAGATAATCCGTGTTCTTCCACCTGCTGTTCCCAGCAGTCAAGCAAAAGCCAGTTCTCCTTCATGGGAACGTGCAATTGCAGAAAGGTTTTGGTGGAGTCTTTTGGTCGGAGTAGATTGGTGGGATGCTGTCGGGTGTACACAAAGTGCTGCTGAAGATGGAATTGTTTCCTTGAACAGCGTTATTGCAGTTTTGGATGCTGACTTCCATTCCTTGCCCTCTACTCAGCACAGGCTGCAATATGGACCTAGCCTTGACAGGATAAAATGCAGACTTTTGGAAGGTACAAACGCTCAGGAAGTCAGAGCTATGGTTCTTGATATGCAAGCAAGATTGTTGTTAGATATGCTTGGGAAAGGAATTGAATCAGCATTGATAAATCCTTCAGCTTTAGTTTCTGAGCCATGGCAGGCATCTGGCGAGATGTTGAATTCGATTGATTGTGAATCCATGGCTGTTGATCCAGCGCTGGTTTTAAGTGTGCAGGCCTATGTTGATGCTGTTCTTGATCTTGCTTCGCATTTCATCACACGCCTGCGTCGTTATGCGAGCTTTTGTCGTACACTAGCGAATCATGCTGTTCAAGCAGGAACTGGTGGCAACCGGAGTATGGTGGCTAGTCCTGCACAAAGTTCTGCATCTCCTGCTCCAAGTCAGGGAGCTCAAAGCGGTACTACAAATTCAACAGGAAGCACACAAATGCATGCTTGGGTTCAAGGTGCTATTGCTAAGATTAGTGGTACAACAGATGGAGTTTCTACTTCAACTCCTAACCCTTTGAGCGGGCCAGCTCCGTATATGCCAATCAGCATAAATACTGGAACGTTTCCTGGTACACCTGCTGTCAGGCTTATAGGCGACTGCCATTTCCTCCATAGGTTGTGCCAACTTTTGCTATTCTGCTTTTTTTTCCGACGAGCACAGCTACCACGCTACATTCAGAGTTCCCAGAGAATGGCTGATGCAAGTATGCAAAAGCCCCAGTCCAATGCTGCAGGCAAGGTTGAAGAGAACCAGACAAAATCCATGCCAACAGCAGGCAGGCCAGAAGATTCTCAGGGTGCTCGTAGCAATCAGTTAGTTGCTGGTGTCAAAGTGGAAGATGGTCCTGCCAGCAGAGCAAGACTGGGGACTGGGAATGCTGGTCAAGGATACACTTTTGATGAGGTCAAAGTGCTGTTTCTGATATTGATGGATCTCTGTAAGCGAACAGCAGGCCTTGCGCACCCGTTGCCTGTTTCACAGGTTGGAATGTCCAGTATCCAGGTCCGGCTACATTATATTGATGGAAACTATACCGTCTTGCCAGAGGTTGTTGAAGCATCACTTGGCCCACATATGCAGAATATGCCTAGGCCTAGAGGTGCAGATGCTGCTGGTTTACTTCTACGTGAGTTGGAACTCCACCCCCCTGCAGAAGAATGGAATCGACGGAACATGTTTGGGGGTCCATGGTCCGATCTTGATGAAATGGGCTCTGTAGATGATAGCTCTAAGTTAAGCGCCTCCATTGATCCAGTTGATTTAACTTCGATGGAAACTTGTGATGTTTCTTATGGAGTGCAAAGCCTATGGCCAAGGAAACGCAGGCTGTCTGAAAGGGATGCTGCTTTTGGCTTTAACACTTCAGTTGGTCTAGGAGCTTATCTTGGCATTATGGGATCTCGGCGAGATGTGGTGACTGCTGTTTGGAAGACTGGTTTGGAGGGTGTTTGGTTCAAGTGTATAAGGTGTTTGCGACAAACCTCGGCCTTTTCCCCAACAGATTCTAATGGTCCATCCAATCAGAATGACCGAGAGACATACTGGATAAGTCGCTGGGCTTATGGTTGCCCAATGTGTGGTGGAGCATGGTCTCGACTCTGGTGGACGAAGAGGCTTGCATCGTCAGCATGCCCTCCAACTCTTCTGAGACTTGGTGGGGAAGAAGTTTACAAGGATCTTCGCCCTGGTTGA
Protein sequence
MTSSSSTNNNNNPEPDSPPPPNPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVNNHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANPPFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQTQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLPQHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIMAADAIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFAPLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIFGSPTNFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTFDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQGAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEENQTKSMPTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLILMDLCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAAGLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRCLRQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRLWWTKRLASSACPPTLLRLGGEEVYKDLRPG
Homology
BLAST of Spo14124.1 vs. NCBI nr
Match:
gi|902151894|gb|KNA05569.1| (hypothetical protein SOVF_189130 [Spinacia oleracea])
HSP 1 Score: 2506.9 bits (6496), Expect = 0.000e+0
Identity = 1240/1242 (99.84%), Postives = 1241/1242 (99.92%), Query Frame = 1
Query: 1 MTSSSSTNNNNNPEPDSPPPPNPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVN 60
MTSSSSTNNNNNPEPDSPPPPNPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVN
Sbjct: 1 MTSSSSTNNNNNPEPDSPPPPNPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVN 60
Query: 61 NHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANP 120
NHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANP
Sbjct: 61 NHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANP 120
Query: 121 PFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQ 180
PFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQ
Sbjct: 121 PFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQ 180
Query: 181 TQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLP 240
TQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLP
Sbjct: 181 TQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLP 240
Query: 241 QHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIMAAD 300
QHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIMAAD
Sbjct: 241 QHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIMAAD 300
Query: 301 AIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFA 360
AIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFA
Sbjct: 301 AIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFA 360
Query: 361 PLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTW 420
PLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTW
Sbjct: 361 PLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTW 420
Query: 421 GSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIF 480
GSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIF
Sbjct: 421 GSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIF 480
Query: 481 GSPTNFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTF 540
GSPTNFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTF
Sbjct: 481 GSPTNFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTF 540
Query: 541 DHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAF 600
DHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAF
Sbjct: 541 DHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAF 600
Query: 601 SSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVD 660
SSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVD
Sbjct: 601 SSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVD 660
Query: 661 WWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQ 720
WWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQ
Sbjct: 661 WWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQ 720
Query: 721 EVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVL 780
EVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVL
Sbjct: 721 EVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVL 780
Query: 781 SVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQ 840
SVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQ
Sbjct: 781 SVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQ 840
Query: 841 GAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGT 900
GAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGT
Sbjct: 841 GAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGT 900
Query: 901 PAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEENQ 960
PAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEENQ
Sbjct: 901 PAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEENQ 960
Query: 961 TKSMPTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLILMD 1020
TKSMPTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLILMD
Sbjct: 961 TKSMPTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLILMD 1020
Query: 1021 LCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAA 1080
LCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAA
Sbjct: 1021 LCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAA 1080
Query: 1081 GLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVSY 1140
GLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVSY
Sbjct: 1081 GLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVSY 1140
Query: 1141 GVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRCL 1200
GVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRCL
Sbjct: 1141 GVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRCL 1200
Query: 1201 RQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
RQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAW R+
Sbjct: 1201 RQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWVRV 1242
BLAST of Spo14124.1 vs. NCBI nr
Match:
gi|731350504|ref|XP_010686537.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 16 isoform X1 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 2253.4 bits (5838), Expect = 0.000e+0
Identity = 1114/1245 (89.48%), Postives = 1165/1245 (93.57%), Query Frame = 1
Query: 2 TSSSSTNNNNNPEPDSPPPP--NPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSV 61
T++++TN NN+ EPDS P +PP+SE + VE + + EEE FMEKS D M+EDS+
Sbjct: 10 TNNNNTNTNNDSEPDSSSSPVLHPPSSELLVVEAEQ-QQEEEEFMEKS--DDPTMEEDSI 69
Query: 62 NNH--PATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSN 121
NN+ PATVFCI L Q R+NLLHKMSVPDLCRNFSAVAWCGKLNA+ACASETCARIPSSN
Sbjct: 70 NNNNNPATVFCIKLKQPRANLLHKMSVPDLCRNFSAVAWCGKLNAIACASETCARIPSSN 129
Query: 122 ANPPFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIW 181
ANPPFWIPIHIVIPERPTES++FNVIAD PRDSVQFIEWSPPSCPRALLIANFYGRVTIW
Sbjct: 130 ANPPFWIPIHIVIPERPTESAVFNVIADCPRDSVQFIEWSPPSCPRALLIANFYGRVTIW 189
Query: 182 SQQTQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEK 241
SQ +QGPANLVRD S WQ EHEWRQDIAVVTKWLSGVSPYRWLSAKSNN+TS KSTFEEK
Sbjct: 190 SQPSQGPANLVRDASSWQHEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNNTSSKSTFEEK 249
Query: 242 FLPQHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIM 301
FLPQ PQTSARWPNILCVCSVFSSGS+QLHWSQWP R+GA+AKWFCTSKGLLGAGPSGIM
Sbjct: 250 FLPQQPQTSARWPNILCVCSVFSSGSIQLHWSQWPSRDGAAAKWFCTSKGLLGAGPSGIM 309
Query: 302 AADAIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWS 361
AADAIVTDSGAMH+AGVPIVNPSTVV+WEVTPGPGNG EATPKTTI+N VPPS +P SWS
Sbjct: 310 AADAIVTDSGAMHVAGVPIVNPSTVVVWEVTPGPGNGFEATPKTTISNVVPPSANPPSWS 369
Query: 362 GFAPLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTAT 421
GFAPLAAYLFDWQEYFISE K GKK SYKDFTE +SLYCSPVSNFSAYVSPEAASQS AT
Sbjct: 370 GFAPLAAYLFDWQEYFISEAKQGKKHSYKDFTECISLYCSPVSNFSAYVSPEAASQSAAT 429
Query: 422 TTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLH 481
TTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLH
Sbjct: 430 TTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLH 489
Query: 482 QIFGSPTNFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKR 541
QIFGSP NFGGQPPMQTVWETKVNKS+PPTDD KSQQAAS GPTSD+ PKPESS DK KR
Sbjct: 490 QIFGSPNNFGGQPPMQTVWETKVNKSIPPTDDFKSQQAASAGPTSDVHPKPESSADKTKR 549
Query: 542 VTFDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAA 601
V+FDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGP FTPVDTYQI VGS IAA
Sbjct: 550 VSFDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPTFTPVDTYQIYVGSTIAA 609
Query: 602 PAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLV 661
PAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPA+P +QAKASSP+WERAIAERFWWSLLV
Sbjct: 610 PAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPALPKNQAKASSPTWERAIAERFWWSLLV 669
Query: 662 GVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGT 721
GVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGT
Sbjct: 670 GVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGT 729
Query: 722 NAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPA 781
NAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNS + ESMAVDPA
Sbjct: 730 NAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSFEIESMAVDPA 789
Query: 782 LVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPA 841
LVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASP S+ASPA
Sbjct: 790 LVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPTLSAASPA 849
Query: 842 PSQGAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTF 901
PSQGAQSGTT+STGSTQM AWVQGAIAKISGTTDGV +S PNPLSGP YMPISINTGTF
Sbjct: 850 PSQGAQSGTTSSTGSTQMQAWVQGAIAKISGTTDGVPSSAPNPLSGPTSYMPISINTGTF 909
Query: 902 PGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVE 961
PGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRY+QSSQRM DASMQKPQ N A KVE
Sbjct: 910 PGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYLQSSQRMTDASMQKPQPNGASKVE 969
Query: 962 ENQTKSMPTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLI 1021
ENQTK M AGR ED QG R NQLVAGVKVEDGPASRARLGTGNAGQGYT+DEVKVLFLI
Sbjct: 970 ENQTKPMTVAGRAEDVQGVRPNQLVAGVKVEDGPASRARLGTGNAGQGYTYDEVKVLFLI 1029
Query: 1022 LMDLCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGA 1081
LMDLCKRTAGLAHPLPVSQVG S+IQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGA
Sbjct: 1030 LMDLCKRTAGLAHPLPVSQVGSSNIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGA 1089
Query: 1082 DAAGLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCD 1141
DAAGLLLRELELHPPAEEW+RRNMFGGPWSDLD+MGSVDDSSKLSAS+DP+D +++E+CD
Sbjct: 1090 DAAGLLLRELELHPPAEEWHRRNMFGGPWSDLDDMGSVDDSSKLSASVDPLDSSAVESCD 1149
Query: 1142 VSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCI 1201
VSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVW+TGLEGVW+KCI
Sbjct: 1150 VSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWRTGLEGVWYKCI 1209
Query: 1202 RCLRQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
RCLRQTSAFS T S SNQNDRETYWISRWAYGCPMCGG W R+
Sbjct: 1210 RCLRQTSAFSSTGSTNSSNQNDRETYWISRWAYGCPMCGGVWIRV 1251
BLAST of Spo14124.1 vs. NCBI nr
Match:
gi|731350508|ref|XP_010686539.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 16 isoform X2 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 1979.1 bits (5126), Expect = 0.000e+0
Identity = 971/1058 (91.78%), Postives = 1004/1058 (94.90%), Query Frame = 1
Query: 185 ANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLPQHPQ 244
ANLVRD S WQ EHEWRQDIAVVTKWLSGVSPYRWLSAKSNN+TS KSTFEEKFLPQ PQ
Sbjct: 28 ANLVRDASSWQHEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNNTSSKSTFEEKFLPQQPQ 87
Query: 245 TSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIMAADAIVT 304
TSARWPNILCVCSVFSSGS+QLHWSQWP R+GA+AKWFCTSKGLLGAGPSGIMAADAIVT
Sbjct: 88 TSARWPNILCVCSVFSSGSIQLHWSQWPSRDGAAAKWFCTSKGLLGAGPSGIMAADAIVT 147
Query: 305 DSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFAPLAA 364
DSGAMH+AGVPIVNPSTVV+WEVTPGPGNG EATPKTTI+N VPPS +P SWSGFAPLAA
Sbjct: 148 DSGAMHVAGVPIVNPSTVVVWEVTPGPGNGFEATPKTTISNVVPPSANPPSWSGFAPLAA 207
Query: 365 YLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTWGSGV 424
YLFDWQEYFISE K GKK SYKDFTE +SLYCSPVSNFSAYVSPEAASQS ATTTWGSGV
Sbjct: 208 YLFDWQEYFISEAKQGKKHSYKDFTECISLYCSPVSNFSAYVSPEAASQSAATTTWGSGV 267
Query: 425 TAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIFGSPT 484
TAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIFGSP
Sbjct: 268 TAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIFGSPN 327
Query: 485 NFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTFDHFN 544
NFGGQPPMQTVWETKVNKS+PPTDD KSQQAAS GPTSD+ PKPESS DK KRV+FDHFN
Sbjct: 328 NFGGQPPMQTVWETKVNKSIPPTDDFKSQQAASAGPTSDVHPKPESSADKTKRVSFDHFN 387
Query: 545 MPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAFSSTS 604
MPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGP FTPVDTYQI VGS IAAPAFSSTS
Sbjct: 388 MPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPTFTPVDTYQIYVGSTIAAPAFSSTS 447
Query: 605 CCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVDWWDA 664
CCSASVWHDTSKDCTMLKIIRVLPPA+P +QAKASSP+WERAIAERFWWSLLVGVDWWDA
Sbjct: 448 CCSASVWHDTSKDCTMLKIIRVLPPALPKNQAKASSPTWERAIAERFWWSLLVGVDWWDA 507
Query: 665 VGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQEVRA 724
VGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQEVRA
Sbjct: 508 VGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQEVRA 567
Query: 725 MVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVLSVQA 784
MVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNS + ESMAVDPALVLSVQA
Sbjct: 568 MVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSFEIESMAVDPALVLSVQA 627
Query: 785 YVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQGAQS 844
YVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASP S+ASPAPSQGAQS
Sbjct: 628 YVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPTLSAASPAPSQGAQS 687
Query: 845 GTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGTPAVR 904
GTT+STGSTQM AWVQGAIAKISGTTDGV +S PNPLSGP YMPISINTGTFPGTPAVR
Sbjct: 688 GTTSSTGSTQMQAWVQGAIAKISGTTDGVPSSAPNPLSGPTSYMPISINTGTFPGTPAVR 747
Query: 905 LIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEENQTKSM 964
LIGDCHFLHRLCQLLLFCFFFRRAQLPRY+QSSQRM DASMQKPQ N A KVEENQTK M
Sbjct: 748 LIGDCHFLHRLCQLLLFCFFFRRAQLPRYLQSSQRMTDASMQKPQPNGASKVEENQTKPM 807
Query: 965 PTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLILMDLCKR 1024
AGR ED QG R NQLVAGVKVEDGPASRARLGTGNAGQGYT+DEVKVLFLILMDLCKR
Sbjct: 808 TVAGRAEDVQGVRPNQLVAGVKVEDGPASRARLGTGNAGQGYTYDEVKVLFLILMDLCKR 867
Query: 1025 TAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAAGLLL 1084
TAGLAHPLPVSQVG S+IQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAAGLLL
Sbjct: 868 TAGLAHPLPVSQVGSSNIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAAGLLL 927
Query: 1085 RELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVSYGVQS 1144
RELELHPPAEEW+RRNMFGGPWSDLD+MGSVDDSSKLSAS+DP+D +++E+CDVSYGVQS
Sbjct: 928 RELELHPPAEEWHRRNMFGGPWSDLDDMGSVDDSSKLSASVDPLDSSAVESCDVSYGVQS 987
Query: 1145 LWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRCLRQTS 1204
LWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVW+TGLEGVW+KCIRCLRQTS
Sbjct: 988 LWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWRTGLEGVWYKCIRCLRQTS 1047
Query: 1205 AFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
AFS T S SNQNDRETYWISRWAYGCPMCGG W R+
Sbjct: 1048 AFSSTGSTNSSNQNDRETYWISRWAYGCPMCGGVWIRV 1085
BLAST of Spo14124.1 vs. NCBI nr
Match:
gi|255556001|ref|XP_002519035.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 16 isoform X1 [Ricinus communis])
HSP 1 Score: 1973.0 bits (5110), Expect = 0.000e+0
Identity = 986/1254 (78.63%), Postives = 1086/1254 (86.60%), Query Frame = 1
Query: 5 SSTNNNNNPEPDSPPPP----------NPPASETVAVETDNLEAEEEAFMEKSMVSDDHM 64
S T N PE +S + PA E V+ E D++E DD M
Sbjct: 14 SGTGGNKEPEEESVGQSLEIVAKGAGSDKPAGEPVSSEEDSVEKP-----------DDPM 73
Query: 65 DEDSVNNHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIP 124
+EDSV+ PATVFCI L Q RSNL HKMSVP+LCRNFSAVAWCGKLNA+ACASETCARIP
Sbjct: 74 EEDSVS--PATVFCIRLKQPRSNLQHKMSVPELCRNFSAVAWCGKLNAIACASETCARIP 133
Query: 125 SSNANPPFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRV 184
SSNANPPFWIPIHIVIPERPTE ++FNVIADSPRDSVQFIEWSP SCPRALLIANF+GR+
Sbjct: 134 SSNANPPFWIPIHIVIPERPTECAVFNVIADSPRDSVQFIEWSPTSCPRALLIANFHGRI 193
Query: 185 TIWSQQTQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTF 244
TIW+Q +QGP N+VRD S WQ EHEWRQDIAVVTKWLSGVSPYRWLS+KS++ST+ KSTF
Sbjct: 194 TIWTQPSQGPVNMVRDASCWQREHEWRQDIAVVTKWLSGVSPYRWLSSKSSSSTNSKSTF 253
Query: 245 EEKFLPQHPQTSARWPNILCVCSVFSSGSVQLHWSQWPP-REGASAKWFCTSKGLLGAGP 304
EEKFL Q QTSARWPN LCVCSVFSSGSVQLHWSQWPP R A+ +WFCTSKGLLGAGP
Sbjct: 254 EEKFLSQQSQTSARWPNFLCVCSVFSSGSVQLHWSQWPPSRTNATPEWFCTSKGLLGAGP 313
Query: 305 SGIMAADAIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDP 364
SGIMAADAIVTDSGAMH+AGVPIVNPSTVV+WEVTPG G+G +ATPKT+I+N VPPS++P
Sbjct: 314 SGIMAADAIVTDSGAMHVAGVPIVNPSTVVVWEVTPGLGHGFQATPKTSISNGVPPSLNP 373
Query: 365 HSWSGFAPLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQ 424
+WSGFAPLAAYLF WQEY ISE K G+K + +DF+ VSL+CSPVSNFSAYVSPEAA+Q
Sbjct: 374 PNWSGFAPLAAYLFSWQEYLISEAKQGRKHTDQDFSNTVSLHCSPVSNFSAYVSPEAAAQ 433
Query: 425 STATTTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKP 484
S ATTTWGSGVTAV+FDPTRGGSVI VVIVEGQYMSPYDPDEGPSITGWRVQRWES L+P
Sbjct: 434 SAATTTWGSGVTAVAFDPTRGGSVIAVVIVEGQYMSPYDPDEGPSITGWRVQRWESSLQP 493
Query: 485 VVLHQIFGSPTN-FGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSG 544
VVLHQIFG+PT+ FGGQ PMQTVW +KV+ S+PPT+D K+ Q S GP D + +S
Sbjct: 494 VVLHQIFGNPTSSFGGQAPMQTVWVSKVDTSIPPTNDFKNHQTVSAGPAPDARKASDSGV 553
Query: 545 DKAKRVTFDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVG 604
+KAK +TFD F++PSDVR+LARIVYSAHGGEIAIAFLRGG+HIFSGPNFTPVD+YQINVG
Sbjct: 554 EKAKSLTFDPFDLPSDVRSLARIVYSAHGGEIAIAFLRGGVHIFSGPNFTPVDSYQINVG 613
Query: 605 SAIAAPAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFW 664
SAIAAPAFSSTSCCSASVWHDTSKD T+LKIIRVLPPAVPSSQ KA+S +WERAIAERFW
Sbjct: 614 SAIAAPAFSSTSCCSASVWHDTSKDRTILKIIRVLPPAVPSSQVKANSSTWERAIAERFW 673
Query: 665 WSLLVGVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCR 724
WSLLVGVDWWDAVGCTQSAAED IVSLNSVIAVLDADFHSLPSTQHR QYGPSLDRIKCR
Sbjct: 674 WSLLVGVDWWDAVGCTQSAAEDNIVSLNSVIAVLDADFHSLPSTQHRQQYGPSLDRIKCR 733
Query: 725 LLEGTNAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESM 784
LLEGTNAQEVRAMVLDMQARLLLDMLGKGIESALINPSALV EPWQASGE L+ ID E+M
Sbjct: 734 LLEGTNAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVPEPWQASGETLSGIDPEAM 793
Query: 785 AVDPALVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQS 844
AV+P+LV S+QAYVDAVLDLASHFITRLRRYASFCRTLA+HAV AGTG NRSMV SP QS
Sbjct: 794 AVEPSLVPSIQAYVDAVLDLASHFITRLRRYASFCRTLASHAVTAGTGSNRSMVTSPTQS 853
Query: 845 SASPAPSQGAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISI 904
+ASPA SQG Q+GTT+STGSTQM AWVQGAIAKIS T DGVS +TPNP+SGP+ +MPISI
Sbjct: 854 AASPATSQGGQNGTTSSTGSTQMQAWVQGAIAKISSTNDGVSNATPNPISGPSSFMPISI 913
Query: 905 NTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNA 964
NTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRR QLPR+I +QR D +MQKPQS A
Sbjct: 914 NTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRRTQLPRFIGVAQRSTDTNMQKPQSGA 973
Query: 965 AGKVEE-NQTKSMPTAG--RPEDSQGARSNQLVAGVK-VEDGPASRARLGTGNAGQGYTF 1024
GKVEE N S P R ++ Q AR QLV G K VE+GPA R+RLG GNAGQGYTF
Sbjct: 974 PGKVEEANSVSSKPAQAMVRSDEVQTARGGQLVPGGKGVEEGPAGRSRLGYGNAGQGYTF 1033
Query: 1025 DEVKVLFLILMDLCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHM 1084
+EVKVLFLILMDLC+RTA LAHPLPVSQVG S+IQVRLHYI+GNYTVLPEVVEASLGPHM
Sbjct: 1034 EEVKVLFLILMDLCRRTAALAHPLPVSQVGSSNIQVRLHYINGNYTVLPEVVEASLGPHM 1093
Query: 1085 QNMPRPRGADAAGLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPV 1144
QNMPRPRGADAAGLLLRELELHPP+EEW+RRNMFGGPWSD +++ S DD+ ++S+ D +
Sbjct: 1094 QNMPRPRGADAAGLLLRELELHPPSEEWHRRNMFGGPWSDPEDITSADDTPRMSSYTDSL 1153
Query: 1145 DLTSMETCDVSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTG 1204
D +S+E CDV YGV LWPRKRR+SERDAAFG NTSVGLGAYLGIMGSRRDVVTAVWKTG
Sbjct: 1154 DFSSLENCDVYYGVNGLWPRKRRMSERDAAFGLNTSVGLGAYLGIMGSRRDVVTAVWKTG 1213
Query: 1205 LEGVWFKCIRCLRQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
LEGVW+KCIRCLRQTSAF+ + P NQNDRE +WISRWAYGCPMCGG W R+
Sbjct: 1214 LEGVWYKCIRCLRQTSAFASPGATNPPNQNDREAWWISRWAYGCPMCGGTWVRV 1254
BLAST of Spo14124.1 vs. NCBI nr
Match:
gi|1009109435|ref|XP_015890204.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 16 [Ziziphus jujuba])
HSP 1 Score: 1957.2 bits (5069), Expect = 0.000e+0
Identity = 972/1230 (79.02%), Postives = 1076/1230 (87.48%), Query Frame = 1
Query: 23 PPASETVAVETDNLEAE----EEAFMEKSMVSDDHMDEDSVNNHPATVFCITLNQSRSNL 82
P A V V + +AE +E +EK +D M+EDSV+ PATVFCI L Q RSNL
Sbjct: 14 PAAQSLVEVSKGSDKAETLSPDEVAVEKP---NDPMEEDSVS--PATVFCIRLKQPRSNL 73
Query: 83 LHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANPPFWIPIHIVIPERPTESS 142
+HKMSVP+LCRNFSAVAWCGKLNA+ACASETCARIPSSNANPPFWIPIHIVIPERPTE
Sbjct: 74 MHKMSVPELCRNFSAVAWCGKLNAIACASETCARIPSSNANPPFWIPIHIVIPERPTECE 133
Query: 143 IFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQTQGPANLVRDVSRWQPEH 202
+FNVIADSPRDSVQFIEWSP SCPRALLIANF+GR+TIW+Q +QG ANLVRD + WQ EH
Sbjct: 134 VFNVIADSPRDSVQFIEWSPTSCPRALLIANFHGRITIWTQPSQGSANLVRDTNCWQREH 193
Query: 203 EWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLPQHPQTSARWPNILCVCSV 262
EWRQDIAVV KWLSGVSPYRWLS+KS+ +++ KSTFEEKFL Q Q SARWPN LCVCSV
Sbjct: 194 EWRQDIAVVMKWLSGVSPYRWLSSKSSATSNSKSTFEEKFLSQQSQNSARWPNFLCVCSV 253
Query: 263 FSSGSVQLHWSQWPPREGASA-KWFCTSKGLLGAGPSGIMAADAIVTDSGAMHIAGVPIV 322
FSSGSVQLHWSQWPP + +S KWF TSKGLLGAGPSGIMAADAI+TDSGAMH+AGVPIV
Sbjct: 254 FSSGSVQLHWSQWPPTQNSSPPKWFWTSKGLLGAGPSGIMAADAIITDSGAMHVAGVPIV 313
Query: 323 NPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFAPLAAYLFDWQEYFISEG 382
NPSTVV+WEVTPGPGNG TPKT+ + VPPS++P +W+GF+PLAAYLF WQEY ISE
Sbjct: 314 NPSTVVVWEVTPGPGNGFHTTPKTSTTSGVPPSINPPAWAGFSPLAAYLFSWQEYLISEA 373
Query: 383 KLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTWGSGVTAVSFDPTRGGSV 442
K G+KQ+ ++ V L+CSPVSNFSAYVSPEAA+QS TTTWGSGVTAV+FDPT GGSV
Sbjct: 374 KQGRKQADPGVSDTVPLHCSPVSNFSAYVSPEAAAQSATTTTWGSGVTAVAFDPTCGGSV 433
Query: 443 ITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIFGSPTN-FGGQPPMQTVW 502
I VVIVEGQYMSPYDPDEGPSITGWRVQRWES L+PVVLHQIFG+PT+ FGGQ PMQTVW
Sbjct: 434 IAVVIVEGQYMSPYDPDEGPSITGWRVQRWESSLQPVVLHQIFGNPTSSFGGQAPMQTVW 493
Query: 503 ETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTFDHFNMPSDVRTLARIV 562
+KV+ S+ PT+D K+QQAA+ GPTSD + ES +KAKRV FD F++PSDVRTL+RIV
Sbjct: 494 VSKVDTSIQPTNDFKNQQAAATGPTSDGRKISESIVEKAKRVAFDPFDLPSDVRTLSRIV 553
Query: 563 YSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAFSSTSCCSASVWHDTSK 622
YSAHGGEIA+AFLRGG+HIFSGPNF PVD YQINVGSAIAAPAFSSTSCCSASVWHDT+K
Sbjct: 554 YSAHGGEIAVAFLRGGVHIFSGPNFAPVDNYQINVGSAIAAPAFSSTSCCSASVWHDTTK 613
Query: 623 DCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGI 682
D T+LKIIRVLPPAVP SQ KA+S +WERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGI
Sbjct: 614 DRTILKIIRVLPPAVPVSQLKANSSAWERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGI 673
Query: 683 VSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQEVRAMVLDMQARLLLD 742
VSLNSVIAVLDADFHSLPSTQHR QYGPSLDRIKCRLLEGT AQEVRAMVLDMQARLLLD
Sbjct: 674 VSLNSVIAVLDADFHSLPSTQHRQQYGPSLDRIKCRLLEGTTAQEVRAMVLDMQARLLLD 733
Query: 743 MLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVLSVQAYVDAVLDLASHF 802
MLGKGIESALINPSALV EPW SGE L+ ID E+MAV+PALV S+QAYVDAVLDLASHF
Sbjct: 734 MLGKGIESALINPSALVPEPWHESGETLSVIDPEAMAVEPALVPSIQAYVDAVLDLASHF 793
Query: 803 ITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQGAQSGTTNSTGSTQMH 862
ITRLRRYASFCRTLA+HAV AGTG NR+MVASP QSSA+PA SQG QSGTT+STGSTQM
Sbjct: 794 ITRLRRYASFCRTLASHAVTAGTGNNRNMVASPNQSSAAPATSQGGQSGTTSSTGSTQMQ 853
Query: 863 AWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGTPAVRLIGDCHFLHRLC 922
AWVQGAIAKIS TTDGVS+STPNP+SGP+ +MPISINTGTFPGTPAVRLIGDCHFLHRLC
Sbjct: 854 AWVQGAIAKISSTTDGVSSSTPNPISGPSSFMPISINTGTFPGTPAVRLIGDCHFLHRLC 913
Query: 923 QLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEE---NQTKSMPTAGRPEDS 982
QLLLFCFFFRR+Q+PR++ +QR ADA+MQKPQS GKVEE K RP+++
Sbjct: 914 QLLLFCFFFRRSQVPRFVGGAQRNADANMQKPQSVPPGKVEEVGSVSAKPSSAIARPDEN 973
Query: 983 QGARSNQLVAGVK-VEDGPASRARLGTGNAGQGYTFDEVKVLFLILMDLCKRTAGLAHPL 1042
Q R+ Q++ G K E+GPA R+RLG GNAGQGYTF+EVKVLFLILMDLC+RTAGL HPL
Sbjct: 974 QVVRTGQVMPGAKGAEEGPAGRSRLGAGNAGQGYTFEEVKVLFLILMDLCRRTAGLTHPL 1033
Query: 1043 PVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAAGLLLRELELHPP 1102
PVSQVG S+IQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAAGLLLRELELHPP
Sbjct: 1034 PVSQVGSSNIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAAGLLLRELELHPP 1093
Query: 1103 AEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVSYGVQSLWPRKRRL 1162
AEEW+RRNMFGGPWSD +++G VDD+ KLS S DP+D +S+E+CDV YG Q LWPRKRRL
Sbjct: 1094 AEEWHRRNMFGGPWSDPEDVGPVDDNPKLSNSGDPLDFSSVESCDVYYGAQRLWPRKRRL 1153
Query: 1163 SERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRCLRQTSAFSPTDSN 1222
SERDAAFG NTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVW+KCIRCLRQTSAF+ +
Sbjct: 1154 SERDAAFGLNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWYKCIRCLRQTSAFASPGAT 1213
Query: 1223 GPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
P +QNDRE +WISRWAYGCPMCGG W R+
Sbjct: 1214 NPPSQNDREVWWISRWAYGCPMCGGTWVRV 1238
BLAST of Spo14124.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9QE69_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_189130 PE=4 SV=1)
HSP 1 Score: 2506.9 bits (6496), Expect = 0.000e+0
Identity = 1240/1242 (99.84%), Postives = 1241/1242 (99.92%), Query Frame = 1
Query: 1 MTSSSSTNNNNNPEPDSPPPPNPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVN 60
MTSSSSTNNNNNPEPDSPPPPNPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVN
Sbjct: 1 MTSSSSTNNNNNPEPDSPPPPNPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVN 60
Query: 61 NHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANP 120
NHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANP
Sbjct: 61 NHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANP 120
Query: 121 PFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQ 180
PFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQ
Sbjct: 121 PFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQ 180
Query: 181 TQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLP 240
TQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLP
Sbjct: 181 TQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLP 240
Query: 241 QHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIMAAD 300
QHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIMAAD
Sbjct: 241 QHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIMAAD 300
Query: 301 AIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFA 360
AIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFA
Sbjct: 301 AIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFA 360
Query: 361 PLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTW 420
PLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTW
Sbjct: 361 PLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTW 420
Query: 421 GSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIF 480
GSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIF
Sbjct: 421 GSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIF 480
Query: 481 GSPTNFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTF 540
GSPTNFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTF
Sbjct: 481 GSPTNFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTF 540
Query: 541 DHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAF 600
DHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAF
Sbjct: 541 DHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAF 600
Query: 601 SSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVD 660
SSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVD
Sbjct: 601 SSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVD 660
Query: 661 WWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQ 720
WWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQ
Sbjct: 661 WWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQ 720
Query: 721 EVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVL 780
EVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVL
Sbjct: 721 EVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVL 780
Query: 781 SVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQ 840
SVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQ
Sbjct: 781 SVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQ 840
Query: 841 GAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGT 900
GAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGT
Sbjct: 841 GAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGT 900
Query: 901 PAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEENQ 960
PAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEENQ
Sbjct: 901 PAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEENQ 960
Query: 961 TKSMPTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLILMD 1020
TKSMPTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLILMD
Sbjct: 961 TKSMPTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLILMD 1020
Query: 1021 LCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAA 1080
LCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAA
Sbjct: 1021 LCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADAA 1080
Query: 1081 GLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVSY 1140
GLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVSY
Sbjct: 1081 GLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVSY 1140
Query: 1141 GVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRCL 1200
GVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRCL
Sbjct: 1141 GVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRCL 1200
Query: 1201 RQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
RQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAW R+
Sbjct: 1201 RQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWVRV 1242
BLAST of Spo14124.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8BSY5_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g183580 PE=4 SV=1)
HSP 1 Score: 2253.4 bits (5838), Expect = 0.000e+0
Identity = 1114/1245 (89.48%), Postives = 1165/1245 (93.57%), Query Frame = 1
Query: 2 TSSSSTNNNNNPEPDSPPPP--NPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSV 61
T++++TN NN+ EPDS P +PP+SE + VE + + EEE FMEKS D M+EDS+
Sbjct: 10 TNNNNTNTNNDSEPDSSSSPVLHPPSSELLVVEAEQ-QQEEEEFMEKS--DDPTMEEDSI 69
Query: 62 NNH--PATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSN 121
NN+ PATVFCI L Q R+NLLHKMSVPDLCRNFSAVAWCGKLNA+ACASETCARIPSSN
Sbjct: 70 NNNNNPATVFCIKLKQPRANLLHKMSVPDLCRNFSAVAWCGKLNAIACASETCARIPSSN 129
Query: 122 ANPPFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIW 181
ANPPFWIPIHIVIPERPTES++FNVIAD PRDSVQFIEWSPPSCPRALLIANFYGRVTIW
Sbjct: 130 ANPPFWIPIHIVIPERPTESAVFNVIADCPRDSVQFIEWSPPSCPRALLIANFYGRVTIW 189
Query: 182 SQQTQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEK 241
SQ +QGPANLVRD S WQ EHEWRQDIAVVTKWLSGVSPYRWLSAKSNN+TS KSTFEEK
Sbjct: 190 SQPSQGPANLVRDASSWQHEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNNTSSKSTFEEK 249
Query: 242 FLPQHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASAKWFCTSKGLLGAGPSGIM 301
FLPQ PQTSARWPNILCVCSVFSSGS+QLHWSQWP R+GA+AKWFCTSKGLLGAGPSGIM
Sbjct: 250 FLPQQPQTSARWPNILCVCSVFSSGSIQLHWSQWPSRDGAAAKWFCTSKGLLGAGPSGIM 309
Query: 302 AADAIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWS 361
AADAIVTDSGAMH+AGVPIVNPSTVV+WEVTPGPGNG EATPKTTI+N VPPS +P SWS
Sbjct: 310 AADAIVTDSGAMHVAGVPIVNPSTVVVWEVTPGPGNGFEATPKTTISNVVPPSANPPSWS 369
Query: 362 GFAPLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTAT 421
GFAPLAAYLFDWQEYFISE K GKK SYKDFTE +SLYCSPVSNFSAYVSPEAASQS AT
Sbjct: 370 GFAPLAAYLFDWQEYFISEAKQGKKHSYKDFTECISLYCSPVSNFSAYVSPEAASQSAAT 429
Query: 422 TTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLH 481
TTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLH
Sbjct: 430 TTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLH 489
Query: 482 QIFGSPTNFGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKR 541
QIFGSP NFGGQPPMQTVWETKVNKS+PPTDD KSQQAAS GPTSD+ PKPESS DK KR
Sbjct: 490 QIFGSPNNFGGQPPMQTVWETKVNKSIPPTDDFKSQQAASAGPTSDVHPKPESSADKTKR 549
Query: 542 VTFDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAA 601
V+FDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGP FTPVDTYQI VGS IAA
Sbjct: 550 VSFDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPTFTPVDTYQIYVGSTIAA 609
Query: 602 PAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLV 661
PAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPA+P +QAKASSP+WERAIAERFWWSLLV
Sbjct: 610 PAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPALPKNQAKASSPTWERAIAERFWWSLLV 669
Query: 662 GVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGT 721
GVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGT
Sbjct: 670 GVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGT 729
Query: 722 NAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPA 781
NAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNS + ESMAVDPA
Sbjct: 730 NAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSFEIESMAVDPA 789
Query: 782 LVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPA 841
LVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASP S+ASPA
Sbjct: 790 LVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPTLSAASPA 849
Query: 842 PSQGAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTF 901
PSQGAQSGTT+STGSTQM AWVQGAIAKISGTTDGV +S PNPLSGP YMPISINTGTF
Sbjct: 850 PSQGAQSGTTSSTGSTQMQAWVQGAIAKISGTTDGVPSSAPNPLSGPTSYMPISINTGTF 909
Query: 902 PGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVE 961
PGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRY+QSSQRM DASMQKPQ N A KVE
Sbjct: 910 PGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYLQSSQRMTDASMQKPQPNGASKVE 969
Query: 962 ENQTKSMPTAGRPEDSQGARSNQLVAGVKVEDGPASRARLGTGNAGQGYTFDEVKVLFLI 1021
ENQTK M AGR ED QG R NQLVAGVKVEDGPASRARLGTGNAGQGYT+DEVKVLFLI
Sbjct: 970 ENQTKPMTVAGRAEDVQGVRPNQLVAGVKVEDGPASRARLGTGNAGQGYTYDEVKVLFLI 1029
Query: 1022 LMDLCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGA 1081
LMDLCKRTAGLAHPLPVSQVG S+IQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGA
Sbjct: 1030 LMDLCKRTAGLAHPLPVSQVGSSNIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGA 1089
Query: 1082 DAAGLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCD 1141
DAAGLLLRELELHPPAEEW+RRNMFGGPWSDLD+MGSVDDSSKLSAS+DP+D +++E+CD
Sbjct: 1090 DAAGLLLRELELHPPAEEWHRRNMFGGPWSDLDDMGSVDDSSKLSASVDPLDSSAVESCD 1149
Query: 1142 VSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCI 1201
VSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVW+TGLEGVW+KCI
Sbjct: 1150 VSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWRTGLEGVWYKCI 1209
Query: 1202 RCLRQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
RCLRQTSAFS T S SNQNDRETYWISRWAYGCPMCGG W R+
Sbjct: 1210 RCLRQTSAFSSTGSTNSSNQNDRETYWISRWAYGCPMCGGVWIRV 1251
BLAST of Spo14124.1 vs. UniProtKB/TrEMBL
Match:
B9RZ66_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0935900 PE=4 SV=1)
HSP 1 Score: 1973.0 bits (5110), Expect = 0.000e+0
Identity = 986/1254 (78.63%), Postives = 1086/1254 (86.60%), Query Frame = 1
Query: 5 SSTNNNNNPEPDSPPPP----------NPPASETVAVETDNLEAEEEAFMEKSMVSDDHM 64
S T N PE +S + PA E V+ E D++E DD M
Sbjct: 14 SGTGGNKEPEEESVGQSLEIVAKGAGSDKPAGEPVSSEEDSVEKP-----------DDPM 73
Query: 65 DEDSVNNHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIP 124
+EDSV+ PATVFCI L Q RSNL HKMSVP+LCRNFSAVAWCGKLNA+ACASETCARIP
Sbjct: 74 EEDSVS--PATVFCIRLKQPRSNLQHKMSVPELCRNFSAVAWCGKLNAIACASETCARIP 133
Query: 125 SSNANPPFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRV 184
SSNANPPFWIPIHIVIPERPTE ++FNVIADSPRDSVQFIEWSP SCPRALLIANF+GR+
Sbjct: 134 SSNANPPFWIPIHIVIPERPTECAVFNVIADSPRDSVQFIEWSPTSCPRALLIANFHGRI 193
Query: 185 TIWSQQTQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTF 244
TIW+Q +QGP N+VRD S WQ EHEWRQDIAVVTKWLSGVSPYRWLS+KS++ST+ KSTF
Sbjct: 194 TIWTQPSQGPVNMVRDASCWQREHEWRQDIAVVTKWLSGVSPYRWLSSKSSSSTNSKSTF 253
Query: 245 EEKFLPQHPQTSARWPNILCVCSVFSSGSVQLHWSQWPP-REGASAKWFCTSKGLLGAGP 304
EEKFL Q QTSARWPN LCVCSVFSSGSVQLHWSQWPP R A+ +WFCTSKGLLGAGP
Sbjct: 254 EEKFLSQQSQTSARWPNFLCVCSVFSSGSVQLHWSQWPPSRTNATPEWFCTSKGLLGAGP 313
Query: 305 SGIMAADAIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDP 364
SGIMAADAIVTDSGAMH+AGVPIVNPSTVV+WEVTPG G+G +ATPKT+I+N VPPS++P
Sbjct: 314 SGIMAADAIVTDSGAMHVAGVPIVNPSTVVVWEVTPGLGHGFQATPKTSISNGVPPSLNP 373
Query: 365 HSWSGFAPLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQ 424
+WSGFAPLAAYLF WQEY ISE K G+K + +DF+ VSL+CSPVSNFSAYVSPEAA+Q
Sbjct: 374 PNWSGFAPLAAYLFSWQEYLISEAKQGRKHTDQDFSNTVSLHCSPVSNFSAYVSPEAAAQ 433
Query: 425 STATTTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKP 484
S ATTTWGSGVTAV+FDPTRGGSVI VVIVEGQYMSPYDPDEGPSITGWRVQRWES L+P
Sbjct: 434 SAATTTWGSGVTAVAFDPTRGGSVIAVVIVEGQYMSPYDPDEGPSITGWRVQRWESSLQP 493
Query: 485 VVLHQIFGSPTN-FGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSG 544
VVLHQIFG+PT+ FGGQ PMQTVW +KV+ S+PPT+D K+ Q S GP D + +S
Sbjct: 494 VVLHQIFGNPTSSFGGQAPMQTVWVSKVDTSIPPTNDFKNHQTVSAGPAPDARKASDSGV 553
Query: 545 DKAKRVTFDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVG 604
+KAK +TFD F++PSDVR+LARIVYSAHGGEIAIAFLRGG+HIFSGPNFTPVD+YQINVG
Sbjct: 554 EKAKSLTFDPFDLPSDVRSLARIVYSAHGGEIAIAFLRGGVHIFSGPNFTPVDSYQINVG 613
Query: 605 SAIAAPAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFW 664
SAIAAPAFSSTSCCSASVWHDTSKD T+LKIIRVLPPAVPSSQ KA+S +WERAIAERFW
Sbjct: 614 SAIAAPAFSSTSCCSASVWHDTSKDRTILKIIRVLPPAVPSSQVKANSSTWERAIAERFW 673
Query: 665 WSLLVGVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCR 724
WSLLVGVDWWDAVGCTQSAAED IVSLNSVIAVLDADFHSLPSTQHR QYGPSLDRIKCR
Sbjct: 674 WSLLVGVDWWDAVGCTQSAAEDNIVSLNSVIAVLDADFHSLPSTQHRQQYGPSLDRIKCR 733
Query: 725 LLEGTNAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESM 784
LLEGTNAQEVRAMVLDMQARLLLDMLGKGIESALINPSALV EPWQASGE L+ ID E+M
Sbjct: 734 LLEGTNAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVPEPWQASGETLSGIDPEAM 793
Query: 785 AVDPALVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQS 844
AV+P+LV S+QAYVDAVLDLASHFITRLRRYASFCRTLA+HAV AGTG NRSMV SP QS
Sbjct: 794 AVEPSLVPSIQAYVDAVLDLASHFITRLRRYASFCRTLASHAVTAGTGSNRSMVTSPTQS 853
Query: 845 SASPAPSQGAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISI 904
+ASPA SQG Q+GTT+STGSTQM AWVQGAIAKIS T DGVS +TPNP+SGP+ +MPISI
Sbjct: 854 AASPATSQGGQNGTTSSTGSTQMQAWVQGAIAKISSTNDGVSNATPNPISGPSSFMPISI 913
Query: 905 NTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNA 964
NTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRR QLPR+I +QR D +MQKPQS A
Sbjct: 914 NTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRRTQLPRFIGVAQRSTDTNMQKPQSGA 973
Query: 965 AGKVEE-NQTKSMPTAG--RPEDSQGARSNQLVAGVK-VEDGPASRARLGTGNAGQGYTF 1024
GKVEE N S P R ++ Q AR QLV G K VE+GPA R+RLG GNAGQGYTF
Sbjct: 974 PGKVEEANSVSSKPAQAMVRSDEVQTARGGQLVPGGKGVEEGPAGRSRLGYGNAGQGYTF 1033
Query: 1025 DEVKVLFLILMDLCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHM 1084
+EVKVLFLILMDLC+RTA LAHPLPVSQVG S+IQVRLHYI+GNYTVLPEVVEASLGPHM
Sbjct: 1034 EEVKVLFLILMDLCRRTAALAHPLPVSQVGSSNIQVRLHYINGNYTVLPEVVEASLGPHM 1093
Query: 1085 QNMPRPRGADAAGLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPV 1144
QNMPRPRGADAAGLLLRELELHPP+EEW+RRNMFGGPWSD +++ S DD+ ++S+ D +
Sbjct: 1094 QNMPRPRGADAAGLLLRELELHPPSEEWHRRNMFGGPWSDPEDITSADDTPRMSSYTDSL 1153
Query: 1145 DLTSMETCDVSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTG 1204
D +S+E CDV YGV LWPRKRR+SERDAAFG NTSVGLGAYLGIMGSRRDVVTAVWKTG
Sbjct: 1154 DFSSLENCDVYYGVNGLWPRKRRMSERDAAFGLNTSVGLGAYLGIMGSRRDVVTAVWKTG 1213
Query: 1205 LEGVWFKCIRCLRQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
LEGVW+KCIRCLRQTSAF+ + P NQNDRE +WISRWAYGCPMCGG W R+
Sbjct: 1214 LEGVWYKCIRCLRQTSAFASPGATNPPNQNDREAWWISRWAYGCPMCGGTWVRV 1254
BLAST of Spo14124.1 vs. UniProtKB/TrEMBL
Match:
A0A067JN45_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21415 PE=4 SV=1)
HSP 1 Score: 1950.3 bits (5051), Expect = 0.000e+0
Identity = 975/1243 (78.44%), Postives = 1081/1243 (86.97%), Query Frame = 1
Query: 7 TNNNNNPEPDSPPPPNPPASETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVNNHPATV 66
+ N + E +S ++ E + EE +EK DD M+EDS++ PATV
Sbjct: 11 SGGNKDHEEESVGQSLEIVAKAAGSEKPEPVSSEEESVEKP---DDPMEEDSMS--PATV 70
Query: 67 FCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANPPFWIPI 126
FCI L Q RSNL HKMSVP+LCRNFSAVAWCGKLNA+ACASETCARIP SNANPPFWIPI
Sbjct: 71 FCIRLKQPRSNLQHKMSVPELCRNFSAVAWCGKLNAIACASETCARIPCSNANPPFWIPI 130
Query: 127 HIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQTQGPAN 186
HIVIPERPTE ++FNVIADSPRDSVQFIEWSP SCPRALLIANF+GR+TIW+Q +QG AN
Sbjct: 131 HIVIPERPTECAVFNVIADSPRDSVQFIEWSPTSCPRALLIANFHGRITIWTQPSQGAAN 190
Query: 187 LVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLPQHPQTS 246
LVRD S WQ EHEWRQDIAVVTKWLSGVSPYRWLS+KS++ST+ KSTFEEKFL Q QTS
Sbjct: 191 LVRDASCWQREHEWRQDIAVVTKWLSGVSPYRWLSSKSSSSTNSKSTFEEKFLSQQSQTS 250
Query: 247 ARWPNILCVCSVFSSGSVQLHWSQWPPREG-ASAKWFCTSKGLLGAGPSGIMAADAIVTD 306
ARWPN LCVCSVFSSGSVQLHWSQWPP + A+ KWF TSKGLLGAGPSGIMAADAIVTD
Sbjct: 251 ARWPNFLCVCSVFSSGSVQLHWSQWPPSQNNATPKWFSTSKGLLGAGPSGIMAADAIVTD 310
Query: 307 SGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFAPLAAY 366
SGAMH+AGVPIVNPSTVV+WEVTPGPG G +A KT+ +N VPPS++P +WSGFAPLAAY
Sbjct: 311 SGAMHVAGVPIVNPSTVVVWEVTPGPGPGFQAIAKTSTSNGVPPSLNPPTWSGFAPLAAY 370
Query: 367 LFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTWGSGVT 426
LF WQEY ISE K GKKQ+ +DF++ VSL+CSPVSNFSAYVSPEAA+QS ATTTWGSGVT
Sbjct: 371 LFSWQEYLISEAKHGKKQTDQDFSDAVSLHCSPVSNFSAYVSPEAAAQSAATTTWGSGVT 430
Query: 427 AVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIFGSPTN 486
AV+FDPTRGGSVI VVIVEGQYMSPYDPDEGPSITGWRVQRWES L+PVVLHQIFG+PT+
Sbjct: 431 AVAFDPTRGGSVIAVVIVEGQYMSPYDPDEGPSITGWRVQRWESSLQPVVLHQIFGNPTS 490
Query: 487 -FGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTFDHFN 546
FGGQ PMQTVW +KV+ S+ PT+D K+ QA + GPTSD++ +S +KAK +TFD F+
Sbjct: 491 SFGGQAPMQTVWVSKVDTSIHPTNDFKNHQAVAAGPTSDVRKTSDSGVEKAKSLTFDPFD 550
Query: 547 MPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAFSSTS 606
+PSDVR+LARIVYSAHGGEIAIAFLRGG+HIFSGPNF VD+YQINVGSAIAAPAFSSTS
Sbjct: 551 LPSDVRSLARIVYSAHGGEIAIAFLRGGVHIFSGPNFALVDSYQINVGSAIAAPAFSSTS 610
Query: 607 CCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVDWWDA 666
CCSASVWHD SKD T+LKIIRVLPPAVPSSQ KA+S +WERAIAERFWWSLLVGVDWWDA
Sbjct: 611 CCSASVWHDASKDRTILKIIRVLPPAVPSSQVKANSSTWERAIAERFWWSLLVGVDWWDA 670
Query: 667 VGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQEVRA 726
VGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHR QYGPSLDRIKCRLLEGTNAQEVRA
Sbjct: 671 VGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRQQYGPSLDRIKCRLLEGTNAQEVRA 730
Query: 727 MVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVLSVQA 786
MVLDMQARLLLDMLGKGIESALINPSALV EPWQASGE L+SID E+MAV+P LV S+QA
Sbjct: 731 MVLDMQARLLLDMLGKGIESALINPSALVPEPWQASGETLSSIDPEAMAVEPNLVPSIQA 790
Query: 787 YVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQGAQS 846
YVDAVLDLASHFITRLRRYASFCRTLA+HAV AGTG NRS+V SP QSSASPA SQG Q+
Sbjct: 791 YVDAVLDLASHFITRLRRYASFCRTLASHAVTAGTGSNRSVVTSPTQSSASPAASQGGQN 850
Query: 847 GTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGTPAVR 906
GTT+STGSTQM AWVQGAIAKIS T+DGVS STPNP+SGP+ +MPISINTGTFPGTPAVR
Sbjct: 851 GTTSSTGSTQMQAWVQGAIAKISSTSDGVSNSTPNPISGPSSFMPISINTGTFPGTPAVR 910
Query: 907 LIGDCHFLHRLCQLLLFCFFFRRA-QLPRYIQSSQRMADASMQKPQSNAAGKVEEN---Q 966
LIGDCHFLHRLCQLLLFC+FFRR QLPR+I +QR D++MQKPQ A GKVEE
Sbjct: 911 LIGDCHFLHRLCQLLLFCYFFRRTQQLPRFIGGAQRNPDSNMQKPQPGAPGKVEEGNSVS 970
Query: 967 TKSMPTAGRPEDSQGARSNQLVAGVK-VEDGPASRARLGTGNAGQGYTFDEVKVLFLILM 1026
+K PT R ++ Q AR QLV GVK EDGPA R+RLG+GNAGQGYTF+EV+VLFLILM
Sbjct: 971 SKLAPTMVRSDEGQAARGAQLVPGVKGAEDGPAGRSRLGSGNAGQGYTFEEVRVLFLILM 1030
Query: 1027 DLCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADA 1086
DLC+RT+ LAHPLPVSQVG +IQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADA
Sbjct: 1031 DLCRRTSALAHPLPVSQVGSGNIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADA 1090
Query: 1087 AGLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVS 1146
AGLLLRELELHPP+EEW+RRNMFGGPWSDL++ S DD+ + S+ D +D +S+E CDV
Sbjct: 1091 AGLLLRELELHPPSEEWHRRNMFGGPWSDLEDTSSADDTPRQSSYTDSLDFSSLENCDVY 1150
Query: 1147 YGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRC 1206
GV LWP+KRR+SERDAAFG NTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVW+KCIRC
Sbjct: 1151 CGVNGLWPKKRRMSERDAAFGLNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWYKCIRC 1210
Query: 1207 LRQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
LRQTSAF+ + P NQN+RE +WISRWAYGCPMCGG W R+
Sbjct: 1211 LRQTSAFASPGATSPPNQNEREAWWISRWAYGCPMCGGTWVRV 1248
BLAST of Spo14124.1 vs. UniProtKB/TrEMBL
Match:
A0A061GMG9_THECC (Sensitive to freezing 6 OS=Theobroma cacao GN=TCM_029906 PE=4 SV=1)
HSP 1 Score: 1941.4 bits (5028), Expect = 0.000e+0
Identity = 972/1259 (77.20%), Postives = 1083/1259 (86.02%), Query Frame = 1
Query: 1 MTSSSSTNNNNNPEPD-----------SPPPPNPPASETVAVETDNLEAEEEAFMEKSMV 60
M + NN +PE + P P A E VAV E+E
Sbjct: 1 MNQQQGSGNNKDPEEEPVTQSVVDSTVKTGPDRPVAPEPVAVT-----GEDEVVTSSEKT 60
Query: 61 SDDHMDEDSVNNHPATVFCITLNQSRSNLLHKMSVPDLCRNFSAVAWCGKLNAVACASET 120
D M++DSVN PATVFCI L Q RSNL HKMSVP+LCRNFSAVAWCGKLNA+ACASET
Sbjct: 61 EDTPMEDDSVN--PATVFCIRLKQPRSNLQHKMSVPELCRNFSAVAWCGKLNAIACASET 120
Query: 121 CARIPSSNANPPFWIPIHIVIPERPTESSIFNVIADSPRDSVQFIEWSPPSCPRALLIAN 180
CARIPSSNANPPFWIPIHIVIPERPTE ++FNVIADSPRDSVQFIEWSP SCPRALLIAN
Sbjct: 121 CARIPSSNANPPFWIPIHIVIPERPTECAVFNVIADSPRDSVQFIEWSPTSCPRALLIAN 180
Query: 181 FYGRVTIWSQQTQGPANLVRDVSRWQPEHEWRQDIAVVTKWLSGVSPYRWLSAKSNNSTS 240
F+GR+TIW+Q +QGPA+LVRD S WQ EHEWRQDIAVVTKWLSGVS YRWLS+KS+N +
Sbjct: 181 FHGRITIWTQPSQGPAHLVRDASCWQREHEWRQDIAVVTKWLSGVSLYRWLSSKSSNPAN 240
Query: 241 LKSTFEEKFLPQHPQTSARWPNILCVCSVFSSGSVQLHWSQWPPREGASA-KWFCTSKGL 300
KSTFEEKFL Q Q SARWPN LCVCSVFSSGSVQLHWSQWPP +G++A KWFCTSKG+
Sbjct: 241 SKSTFEEKFLSQQSQNSARWPNFLCVCSVFSSGSVQLHWSQWPPTQGSTARKWFCTSKGI 300
Query: 301 LGAGPSGIMAADAIVTDSGAMHIAGVPIVNPSTVVIWEVTPGPGNGLEATPKTTINNAVP 360
LGAGPSGIMAADAI+TDSGAMH+AGVPIVNPSTVV+WEVTPGPGNG +AT KT+ ++ +P
Sbjct: 301 LGAGPSGIMAADAIITDSGAMHVAGVPIVNPSTVVVWEVTPGPGNGFQATAKTSTSSGIP 360
Query: 361 PSVDPHSWSGFAPLAAYLFDWQEYFISEGKLGKKQSYKDFTENVSLYCSPVSNFSAYVSP 420
PSV+P +W+GFAPLAAYLF WQEY ISE K GKK + +DF + SL+CSPVSNFSAYVSP
Sbjct: 361 PSVNPPNWAGFAPLAAYLFSWQEYLISEAKQGKKSTDQDFNDAASLHCSPVSNFSAYVSP 420
Query: 421 EAASQSTATTTWGSGVTAVSFDPTRGGSVITVVIVEGQYMSPYDPDEGPSITGWRVQRWE 480
EAA+QS ATTTWGSGVTAV+FDPTRGGSVI VVIVEGQYMSPYDPDEGP+ITGWRVQRWE
Sbjct: 421 EAAAQSAATTTWGSGVTAVAFDPTRGGSVIAVVIVEGQYMSPYDPDEGPTITGWRVQRWE 480
Query: 481 SCLKPVVLHQIFGSPTN-FGGQPPMQTVWETKVNKSMPPTDDLKSQQAASVGPTSDLQPK 540
S L+PVV+H IFG+P++ FGGQ PMQTVW +KV+ S+PPT+D K QQAA+ GPT D++
Sbjct: 481 SSLQPVVIHHIFGNPSSSFGGQAPMQTVWVSKVDTSIPPTNDFKIQQAAAAGPTPDVRKA 540
Query: 541 PESSGDKAKRVTFDHFNMPSDVRTLARIVYSAHGGEIAIAFLRGGIHIFSGPNFTPVDTY 600
+ S +KAKRV+FD F++PSDVRTLARIVYSAHGGEIAI+FLRGG+HIFSGP+FT VD Y
Sbjct: 541 SDLSAEKAKRVSFDPFDLPSDVRTLARIVYSAHGGEIAISFLRGGVHIFSGPDFTAVDNY 600
Query: 601 QINVGSAIAAPAFSSTSCCSASVWHDTSKDCTMLKIIRVLPPAVPSSQAKASSPSWERAI 660
QINVGSAIAAPAFSSTSCCSASVWHDTSKD T+LKIIRVLPPAV SS+ KA+S +WERAI
Sbjct: 601 QINVGSAIAAPAFSSTSCCSASVWHDTSKDRTILKIIRVLPPAVSSSEIKANSSTWERAI 660
Query: 661 AERFWWSLLVGVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSTQHRLQYGPSLD 720
AERFWWSLLVGVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPS QHR QYGPSLD
Sbjct: 661 AERFWWSLLVGVDWWDAVGCTQSAAEDGIVSLNSVIAVLDADFHSLPSIQHRQQYGPSLD 720
Query: 721 RIKCRLLEGTNAQEVRAMVLDMQARLLLDMLGKGIESALINPSALVSEPWQASGEMLNSI 780
RIKCRLLEGTNAQEVRAMVLDMQARLLLDMLGKGIESAL+NPSALVSEPW ASGE L SI
Sbjct: 721 RIKCRLLEGTNAQEVRAMVLDMQARLLLDMLGKGIESALVNPSALVSEPWHASGETLASI 780
Query: 781 DCESMAVDPALVLSVQAYVDAVLDLASHFITRLRRYASFCRTLANHAVQAGTGGNRSMVA 840
D E+MAVDPALV S+QAYVDAVLDLASHFITRLRRYASFCRTLA+HAV AG+G NR+MVA
Sbjct: 781 DLEAMAVDPALVPSIQAYVDAVLDLASHFITRLRRYASFCRTLASHAVNAGSGSNRNMVA 840
Query: 841 SPAQSSASPAPSQGAQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPY 900
SP QSSA+PA SQ QSGTT+STGSTQM AWVQGAIAKIS +TDGV+ STP +SGP+ +
Sbjct: 841 SPTQSSATPATSQAGQSGTTSSTGSTQMQAWVQGAIAKISSSTDGVANSTPT-ISGPSTF 900
Query: 901 MPISINTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQK 960
MPISINTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQ PR +QR ADA+ QK
Sbjct: 901 MPISINTGTFPGTPAVRLIGDCHFLHRLCQLLLFCFFFRRAQHPR---CAQRTADANHQK 960
Query: 961 PQSNAAGKVEENQT---KSMPTAGRPEDSQGARSNQLVAGVK-VEDGPASRARLGTGNAG 1020
Q A GK+EE + K T R +++QG+R+ Q+V G K E+GPA R ++G+GNAG
Sbjct: 961 SQPGAPGKMEEVNSVSVKPTTTMTRSDEAQGSRTGQVVPGAKGFEEGPAGRLKMGSGNAG 1020
Query: 1021 QGYTFDEVKVLFLILMDLCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEAS 1080
QGYTF+EVKVLFLILMDLC+RTA LAHPLPVSQVG SSIQVRLHYIDGNYTVLPEVVEAS
Sbjct: 1021 QGYTFEEVKVLFLILMDLCRRTAALAHPLPVSQVGSSSIQVRLHYIDGNYTVLPEVVEAS 1080
Query: 1081 LGPHMQNMPRPRGADAAGLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSA 1140
LGPHMQNMPRPRGADAAGLLLRELELHPP+EEW+RRNMFGGPWSD ++MG +DDS +LS
Sbjct: 1081 LGPHMQNMPRPRGADAAGLLLRELELHPPSEEWHRRNMFGGPWSDPEDMGPIDDSPRLSN 1140
Query: 1141 SIDPVDLTSMETCDVSYGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTA 1200
SID +D++S+E D YG Q+LWPRKRRLSERDAAFG NTSVGLGAYLGIMGSRRDVVTA
Sbjct: 1141 SIDSIDMSSLENFDGYYGAQTLWPRKRRLSERDAAFGLNTSVGLGAYLGIMGSRRDVVTA 1200
Query: 1201 VWKTGLEGVWFKCIRCLRQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
VWKTGLEGVW+KCIRCLRQTSAF+ S +QN+RET+WISRWA+GCPMCGG W R+
Sbjct: 1201 VWKTGLEGVWYKCIRCLRQTSAFASPGSTSQRSQNERETWWISRWAHGCPMCGGTWVRV 1248
BLAST of Spo14124.1 vs. ExPASy Swiss-Prot
Match:
MED16_ARATH (Mediator of RNA polymerase II transcription subunit 16 OS=Arabidopsis thaliana GN=MED16 PE=1 SV=1)
HSP 1 Score: 1802.3 bits (4667), Expect = 0.000e+0
Identity = 898/1243 (72.24%), Postives = 1018/1243 (81.90%), Query Frame = 1
Query: 27 ETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVNN------HPATVFCITLNQSRSNLLH 86
ET+ L EE +EKS+ + D S +N PATVFC+ L Q SNLLH
Sbjct: 42 ETIESTDPILVVVEEKLLEKSVDGEKEDDNSSSSNMEIDPVSPATVFCVKLKQPNSNLLH 101
Query: 87 KMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANPPFWIPIHIVIPERPTESSIF 146
KMSVP+LCRNFSAVAWCGKLNA+ACASETCARIPSS AN PFWIPIHI+IPERPTE ++F
Sbjct: 102 KMSVPELCRNFSAVAWCGKLNAIACASETCARIPSSKANTPFWIPIHILIPERPTECAVF 161
Query: 147 NVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQTQGPANLVRDVSRWQPEHEW 206
NV+ADSPRDSVQFIEWSP SCPRALLIANF+GR+TIW+Q TQG ANLV D + WQ EHEW
Sbjct: 162 NVVADSPRDSVQFIEWSPTSCPRALLIANFHGRITIWTQPTQGSANLVHDATSWQCEHEW 221
Query: 207 RQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLPQHPQTSARWPNILCVCSVFS 266
RQDIAVVTKWL+G SPYRWLS+K ++ T+ KSTFEEKFL Q ++SARWPN LCVCSVFS
Sbjct: 222 RQDIAVVTKWLTGASPYRWLSSKPSSGTNAKSTFEEKFLSQSSESSARWPNFLCVCSVFS 281
Query: 267 SGSVQLHWSQWPPREGASA-KWFCTSKGLLGAGPSGIMAADAIVTDSGAMHIAGVPIVNP 326
SGSVQ+HWSQWP +G++A KWF T KGLLGAGPSGIMAADAI+TDSGAMH+AGVPIVNP
Sbjct: 282 SGSVQIHWSQWPSNQGSTAPKWFSTKKGLLGAGPSGIMAADAIITDSGAMHVAGVPIVNP 341
Query: 327 STVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFAPLAAYLFDWQEYFISEGKL 386
ST+V+WEVTPGPGNGL+ATPK + + VPPS+ SW+GFAPLAAYLF WQEY ISE K
Sbjct: 342 STIVVWEVTPGPGNGLQATPKISTGSRVPPSLSSSSWTGFAPLAAYLFSWQEYLISEIKQ 401
Query: 387 GKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTWGSGVTAVSFDPTRGGSVIT 446
GKK S +D ++ +SL CSPVSNFSAYVSPEAA+QS ATTTWGSGVTAV+FDPTRGGSVI
Sbjct: 402 GKKPSDQDSSDAISLSCSPVSNFSAYVSPEAAAQSAATTTWGSGVTAVAFDPTRGGSVIA 461
Query: 447 VVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIFGSPT-NFGGQPPMQTVWET 506
VVIVEGQYMSPYDPDEGPSITGWRVQRWES ++PVVLHQIFG+PT NFGGQ P QTVW +
Sbjct: 462 VVIVEGQYMSPYDPDEGPSITGWRVQRWESSVQPVVLHQIFGNPTSNFGGQVPTQTVWVS 521
Query: 507 KVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTFDHFNMPSDVRTLARIVYS 566
+V+ S+PPT D K+ Q A+ GP+ D +P+S +KA +V FD F++PSD+RTLARIVYS
Sbjct: 522 RVDMSIPPTKDFKNHQVAAAGPSVDAPKEPDSGDEKANKVVFDPFDLPSDIRTLARIVYS 581
Query: 567 AHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAFSSTSCCSASVWHDTSKDC 626
AHGGEIAIAFLRGG+HIFSGP F+PV+ YQINVGSAIAAPAFS TSCCSASVWHD +KDC
Sbjct: 582 AHGGEIAIAFLRGGVHIFSGPTFSPVENYQINVGSAIAAPAFSPTSCCSASVWHDAAKDC 641
Query: 627 TMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGIVS 686
MLKIIRVLPPA+P +Q+K +WERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGIVS
Sbjct: 642 AMLKIIRVLPPALPRNQSKVDQSTWERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGIVS 701
Query: 687 LNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQEVRAMVLDMQARLLLDML 746
LNSVIAV+DADFHSLPSTQHR QYGP+LDRIKCRLLEGTNAQEVRAMVLDMQARLLLDML
Sbjct: 702 LNSVIAVMDADFHSLPSTQHRQQYGPNLDRIKCRLLEGTNAQEVRAMVLDMQARLLLDML 761
Query: 747 GKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVLSVQAYVDAVLDLASHFIT 806
GKGIESAL+NPSALV EPW+ GE + I+ E+MAVDPALV S+QAYVDAVLDLASHFIT
Sbjct: 762 GKGIESALVNPSALVFEPWRVDGETITGINPEAMAVDPALVSSIQAYVDAVLDLASHFIT 821
Query: 807 RLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQ-----------------G 866
RLRRYASFCRTLA+HA AGTG NR+ V SP Q+++SPA Q
Sbjct: 822 RLRRYASFCRTLASHAASAGTGSNRNNVTSPTQNASSPATPQVFPDKSLYLAVGQPTTTT 881
Query: 867 AQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGTP 926
+ TTNS+GS+ + AW+QGAIAKIS + DG S ST +P+SG +MPISINTGTFPGTP
Sbjct: 882 TTTATTNSSGSSHVQAWMQGAIAKISSSNDG-SNSTASPISGSPTFMPISINTGTFPGTP 941
Query: 927 AVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEE-NQ 986
AVRLIGDCHFLHRLCQLLLFCF R ++ P QR AD S QK Q+ A K+EE N
Sbjct: 942 AVRLIGDCHFLHRLCQLLLFCFLQRSSRFP------QRNADVSSQKLQTGATSKLEEVNS 1001
Query: 987 TKSMPTAGRPEDSQGARSNQLVAGVK-VEDGPASRARLGTGNAGQGYTFDEVKVLFLILM 1046
K P R ED+QG R QL GVK +++ A ++G+GNAGQGYT++EV+VLF ILM
Sbjct: 1002 AKPTPALNRIEDAQGFRGAQLGTGVKGIDENSARTTKMGSGNAGQGYTYEEVRVLFHILM 1061
Query: 1047 DLCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADA 1106
DLCKRT+GLAHPLP SQVG +IQVRLHYIDGNYTVLPEVVEA+LGPHMQNMPRPRGADA
Sbjct: 1062 DLCKRTSGLAHPLPGSQVGSGNIQVRLHYIDGNYTVLPEVVEAALGPHMQNMPRPRGADA 1121
Query: 1107 AGLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVS 1166
AGLLLRELELHPP+EEW+RRN+FGGP S+ ++M DD SKLS S+D D CD
Sbjct: 1122 AGLLLRELELHPPSEEWHRRNLFGGPGSEPEDMILTDDVSKLSNSLDLPDTNFSGICDGY 1181
Query: 1167 YGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRC 1226
V SLWPRKRR+SERDAAFG NTSVGLGAYLGIMGSRRDVVTA WKTGLEGVW+KCIRC
Sbjct: 1182 NRVHSLWPRKRRMSERDAAFGSNTSVGLGAYLGIMGSRRDVVTATWKTGLEGVWYKCIRC 1241
Query: 1227 LRQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
LRQTSAF+ + N N+RET+W SRW Y CPMCGG W R+
Sbjct: 1242 LRQTSAFASPGATKQPNPNERETWWTSRWVYCCPMCGGTWVRV 1277
BLAST of Spo14124.1 vs. TAIR (Arabidopsis)
Match:
AT4G04920.1 (sensitive to freezing 6)
HSP 1 Score: 1802.3 bits (4667), Expect = 0.000e+0
Identity = 898/1243 (72.24%), Postives = 1018/1243 (81.90%), Query Frame = 1
Query: 27 ETVAVETDNLEAEEEAFMEKSMVSDDHMDEDSVNN------HPATVFCITLNQSRSNLLH 86
ET+ L EE +EKS+ + D S +N PATVFC+ L Q SNLLH
Sbjct: 42 ETIESTDPILVVVEEKLLEKSVDGEKEDDNSSSSNMEIDPVSPATVFCVKLKQPNSNLLH 101
Query: 87 KMSVPDLCRNFSAVAWCGKLNAVACASETCARIPSSNANPPFWIPIHIVIPERPTESSIF 146
KMSVP+LCRNFSAVAWCGKLNA+ACASETCARIPSS AN PFWIPIHI+IPERPTE ++F
Sbjct: 102 KMSVPELCRNFSAVAWCGKLNAIACASETCARIPSSKANTPFWIPIHILIPERPTECAVF 161
Query: 147 NVIADSPRDSVQFIEWSPPSCPRALLIANFYGRVTIWSQQTQGPANLVRDVSRWQPEHEW 206
NV+ADSPRDSVQFIEWSP SCPRALLIANF+GR+TIW+Q TQG ANLV D + WQ EHEW
Sbjct: 162 NVVADSPRDSVQFIEWSPTSCPRALLIANFHGRITIWTQPTQGSANLVHDATSWQCEHEW 221
Query: 207 RQDIAVVTKWLSGVSPYRWLSAKSNNSTSLKSTFEEKFLPQHPQTSARWPNILCVCSVFS 266
RQDIAVVTKWL+G SPYRWLS+K ++ T+ KSTFEEKFL Q ++SARWPN LCVCSVFS
Sbjct: 222 RQDIAVVTKWLTGASPYRWLSSKPSSGTNAKSTFEEKFLSQSSESSARWPNFLCVCSVFS 281
Query: 267 SGSVQLHWSQWPPREGASA-KWFCTSKGLLGAGPSGIMAADAIVTDSGAMHIAGVPIVNP 326
SGSVQ+HWSQWP +G++A KWF T KGLLGAGPSGIMAADAI+TDSGAMH+AGVPIVNP
Sbjct: 282 SGSVQIHWSQWPSNQGSTAPKWFSTKKGLLGAGPSGIMAADAIITDSGAMHVAGVPIVNP 341
Query: 327 STVVIWEVTPGPGNGLEATPKTTINNAVPPSVDPHSWSGFAPLAAYLFDWQEYFISEGKL 386
ST+V+WEVTPGPGNGL+ATPK + + VPPS+ SW+GFAPLAAYLF WQEY ISE K
Sbjct: 342 STIVVWEVTPGPGNGLQATPKISTGSRVPPSLSSSSWTGFAPLAAYLFSWQEYLISEIKQ 401
Query: 387 GKKQSYKDFTENVSLYCSPVSNFSAYVSPEAASQSTATTTWGSGVTAVSFDPTRGGSVIT 446
GKK S +D ++ +SL CSPVSNFSAYVSPEAA+QS ATTTWGSGVTAV+FDPTRGGSVI
Sbjct: 402 GKKPSDQDSSDAISLSCSPVSNFSAYVSPEAAAQSAATTTWGSGVTAVAFDPTRGGSVIA 461
Query: 447 VVIVEGQYMSPYDPDEGPSITGWRVQRWESCLKPVVLHQIFGSPT-NFGGQPPMQTVWET 506
VVIVEGQYMSPYDPDEGPSITGWRVQRWES ++PVVLHQIFG+PT NFGGQ P QTVW +
Sbjct: 462 VVIVEGQYMSPYDPDEGPSITGWRVQRWESSVQPVVLHQIFGNPTSNFGGQVPTQTVWVS 521
Query: 507 KVNKSMPPTDDLKSQQAASVGPTSDLQPKPESSGDKAKRVTFDHFNMPSDVRTLARIVYS 566
+V+ S+PPT D K+ Q A+ GP+ D +P+S +KA +V FD F++PSD+RTLARIVYS
Sbjct: 522 RVDMSIPPTKDFKNHQVAAAGPSVDAPKEPDSGDEKANKVVFDPFDLPSDIRTLARIVYS 581
Query: 567 AHGGEIAIAFLRGGIHIFSGPNFTPVDTYQINVGSAIAAPAFSSTSCCSASVWHDTSKDC 626
AHGGEIAIAFLRGG+HIFSGP F+PV+ YQINVGSAIAAPAFS TSCCSASVWHD +KDC
Sbjct: 582 AHGGEIAIAFLRGGVHIFSGPTFSPVENYQINVGSAIAAPAFSPTSCCSASVWHDAAKDC 641
Query: 627 TMLKIIRVLPPAVPSSQAKASSPSWERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGIVS 686
MLKIIRVLPPA+P +Q+K +WERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGIVS
Sbjct: 642 AMLKIIRVLPPALPRNQSKVDQSTWERAIAERFWWSLLVGVDWWDAVGCTQSAAEDGIVS 701
Query: 687 LNSVIAVLDADFHSLPSTQHRLQYGPSLDRIKCRLLEGTNAQEVRAMVLDMQARLLLDML 746
LNSVIAV+DADFHSLPSTQHR QYGP+LDRIKCRLLEGTNAQEVRAMVLDMQARLLLDML
Sbjct: 702 LNSVIAVMDADFHSLPSTQHRQQYGPNLDRIKCRLLEGTNAQEVRAMVLDMQARLLLDML 761
Query: 747 GKGIESALINPSALVSEPWQASGEMLNSIDCESMAVDPALVLSVQAYVDAVLDLASHFIT 806
GKGIESAL+NPSALV EPW+ GE + I+ E+MAVDPALV S+QAYVDAVLDLASHFIT
Sbjct: 762 GKGIESALVNPSALVFEPWRVDGETITGINPEAMAVDPALVSSIQAYVDAVLDLASHFIT 821
Query: 807 RLRRYASFCRTLANHAVQAGTGGNRSMVASPAQSSASPAPSQ-----------------G 866
RLRRYASFCRTLA+HA AGTG NR+ V SP Q+++SPA Q
Sbjct: 822 RLRRYASFCRTLASHAASAGTGSNRNNVTSPTQNASSPATPQVFPDKSLYLAVGQPTTTT 881
Query: 867 AQSGTTNSTGSTQMHAWVQGAIAKISGTTDGVSTSTPNPLSGPAPYMPISINTGTFPGTP 926
+ TTNS+GS+ + AW+QGAIAKIS + DG S ST +P+SG +MPISINTGTFPGTP
Sbjct: 882 TTTATTNSSGSSHVQAWMQGAIAKISSSNDG-SNSTASPISGSPTFMPISINTGTFPGTP 941
Query: 927 AVRLIGDCHFLHRLCQLLLFCFFFRRAQLPRYIQSSQRMADASMQKPQSNAAGKVEE-NQ 986
AVRLIGDCHFLHRLCQLLLFCF R ++ P QR AD S QK Q+ A K+EE N
Sbjct: 942 AVRLIGDCHFLHRLCQLLLFCFLQRSSRFP------QRNADVSSQKLQTGATSKLEEVNS 1001
Query: 987 TKSMPTAGRPEDSQGARSNQLVAGVK-VEDGPASRARLGTGNAGQGYTFDEVKVLFLILM 1046
K P R ED+QG R QL GVK +++ A ++G+GNAGQGYT++EV+VLF ILM
Sbjct: 1002 AKPTPALNRIEDAQGFRGAQLGTGVKGIDENSARTTKMGSGNAGQGYTYEEVRVLFHILM 1061
Query: 1047 DLCKRTAGLAHPLPVSQVGMSSIQVRLHYIDGNYTVLPEVVEASLGPHMQNMPRPRGADA 1106
DLCKRT+GLAHPLP SQVG +IQVRLHYIDGNYTVLPEVVEA+LGPHMQNMPRPRGADA
Sbjct: 1062 DLCKRTSGLAHPLPGSQVGSGNIQVRLHYIDGNYTVLPEVVEAALGPHMQNMPRPRGADA 1121
Query: 1107 AGLLLRELELHPPAEEWNRRNMFGGPWSDLDEMGSVDDSSKLSASIDPVDLTSMETCDVS 1166
AGLLLRELELHPP+EEW+RRN+FGGP S+ ++M DD SKLS S+D D CD
Sbjct: 1122 AGLLLRELELHPPSEEWHRRNLFGGPGSEPEDMILTDDVSKLSNSLDLPDTNFSGICDGY 1181
Query: 1167 YGVQSLWPRKRRLSERDAAFGFNTSVGLGAYLGIMGSRRDVVTAVWKTGLEGVWFKCIRC 1226
V SLWPRKRR+SERDAAFG NTSVGLGAYLGIMGSRRDVVTA WKTGLEGVW+KCIRC
Sbjct: 1182 NRVHSLWPRKRRMSERDAAFGSNTSVGLGAYLGIMGSRRDVVTATWKTGLEGVWYKCIRC 1241
Query: 1227 LRQTSAFSPTDSNGPSNQNDRETYWISRWAYGCPMCGGAWSRL 1243
LRQTSAF+ + N N+RET+W SRW Y CPMCGG W R+
Sbjct: 1242 LRQTSAFASPGATKQPNPNERETWWTSRWVYCCPMCGGTWVRV 1277
The following BLAST results are available for this feature: