Spo25396 (gene)

Overview
NameSpo25396
Typegene
OrganismSpinacia oleracea (Spinach)
DescriptionDNA-3-methyladenine glycosylase
Locationchr3 : 30865603 .. 30888412 (-)
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATCCCATGCAAAAAAGTCAATCCTTAACCAAATTCCATAAATCTCTTTTTAATTGTTTATGCTGCTTCTTCTTCTTCCAACTAATATTCAACTCCTCCCTTCTCAACTCCGGTGTAGGTGGGCCCCAATGACCGGACCTGAACGCTCCAAACTGCGCTCCAAATCTAATTCCGGCGAAAACCTATCTCAAGCAAAACTCAAAATTCGCGCCACGAAGTCTAAATCGAAATCGAAATCGAAATCGAAAGTCCCGTTATCTCCAGAACATCCAATTTCCAACATTTCTGAGATTGATCAACAGAATTTGGTTATTCTAGGGCCCGATTTCTGCTCCGTTGATGCTCTTGATCTTGCTCCACTTCTTCTTGGCAAGTACCTTCGTCGTGATGATGTTATACTTCAGATAACCGAGGTGAGATTTCATTTGCTATTGATGTTGCACGCTAACTGTTCGACGAAATTACCCAACAAGCTAGAATTAGAATTGCATAATCAGTGATTTTTCTATTTTTGGGGTCATAAATAATGATCAATTTTCTTATTAATGAATTACTTTCTGTGGTCAGACCATTTTTCTTCAGTTCATGGTTAATAAAATTCAGGTTCTTTTTCGACTTTTGTTTAGTTTCTGAGGGTCATGAACGTATAGTTTAAAACCCGATTATTTGAGTCGGTTATGTCAAGGTTTTTGCACACCAACTGTTCGAAGAAAATTACTCAATGAGCTAAAATGTTTTTCTTGTCTTTTTGGATTTGCTTATCTTTGTATGAATACTATATAGAGTTTCCAAACAAGTCAGTTTTGGGGTAACAAAATCATGGTCAAGTTCTTTTTCTATTTGTCTATTAGATCAATTTTCTCTAGTTCTTAGTCCAATAAATTATGGTTCTTTCTGTTTTATGATATTGGTGTTTGGGTGATATTTCAAACCCCATTTCTGGAATCAGGTTTTTAAGGTTCACAGGCAAGTCCAAAGCTTAAATGGGTGTAAACTTTGCAGGTGGAAGCATACAGGCCGAATGATTCAGCATGCCATGGTCGCTTTGGCAATACAGCAAGGACAGCTCCTGTTGTAAGATATACATTATATTCACATGGTCACTTTTCTCTAGTGGTTGGATATTGATTTAGTAATTTTGTCTCATGAACAAAATTTATACCCTTTTTGATGAATTATGGAGTTTCTTATTTTATCTTGTGCAATTTGGGTAGTTTGGACCATGTGGGCATGCGTATGTGTATCTCTGCTATGGTCTTCACATGATGCTTAACATTGTTGCTGATAAGGAGGGTGTTGGTGCAGCTGTATTGATACGTTCTTGTGCTCCTATTAGCGGTAATTGCATGTAACCTATGTGTTCTGTAACCTACGTATTGTCCTGCCATTAGCATAAAGCTATGCATGACAACATGTGCAGTAACAAGTTGAATAAAATCATGCTATTAGGTTGGTTGGATGTGGAAATTAGACAGTACTGGTCAATCAACTCTGACTAAAACAATTTTTTTTTCTTATATATGGTAATCTGTCATTCACTCATTTGCACAAGCATTAACGTTTCATATTGAAGAAGAGCATTCCACTGTTTCTATTGATAATGGAAGAAGGGAGGTGATAGGAGTCAAGGTTATTGCATTCTTAAATCCTTGACCAGTAGTTGAGTGTTCTGAATACGCGAGACTGATGTTGCAGGTTTGACGACTATTCAGCAGCGCCGTGGTCAGATTACCAATAAGCCTGTTCTCCTCGCAGGGCCAGGAAAGGTTAGCAGAAAATCCTTACAGTCTCTTTTTTCATTATGTTGGGTAAAATTAGATGCTTGGTAGTGTTGTTTAAATTTTTTTTTTTGGCGTGAAAGAGGAAAACCTCTCAACAAAAATAAAGCAAGAAAAGACAACAATATAAATAAAAAGCAGCAAAACTAGCTACGAAGTCCTAAGCCGACGGAGCCGAAACTCTTCTTGCTATAGTCATGCCTCAACAGGTCCTCCAATAAAACTGCACGAAGCACGGTTTGCAGCAAATACATTGAAGGGTTCCCCTCTAATCCGTCGGATAAGACCACCACATCGCGCAATACCCGGGTTGCCCTTTGCAGCTCCATCGGTATTCAATTTCACCCAACCTTCTCTCGGCTTAAACCATCGCACAAACACCTCAACTTTGCCTTGTTTTCTTTTGCCCAACATCAGATCATCTTTGTTCATTGCTTGCTTAATAAAGCCCACTGTCCTCATAATGAAGCTATATTGGTCGAAAGGTATATCAATATTAGCATTGAAGCACTTCATTTCTCCATTTCCATATCCACAAAAGCGAATTTGCAAACACAATGTACCAATCTTCAACCCAAGATTGATTCGTTCGTTGAGATTATATTGCAACCACTCAATAAATGGCTTCGAAAATAGCTCGTTATCTGTCATCCAAGGGAATTCTTTCCAAACGAGTGTGGCTGCTAGACAAGTTCTCAAAATGTGCTCAATGTTTCTTCGCTTGCTCCACATATAGTACAGCTCAGATCCTGGATTAAACCTCTCATCAACCTAGTTGTGTTCGATATCAACCTGTTTCGAAACACCAGCCAAAAAGAAACAAAACGAATGCGCTGTGGCAAGCGGAGCTTCCATAACCTCCCCAACGGGCAGTGCAGGGGTTCTTCATCATTAGCTGCAACGCTGAGTTAATAGAGAAGCCCCACGAGGCCTCTCCGTGGAGTTAAAAGGACTTAATCATCTTGAGTATGTTCGAAGGAAGAAAGTCACTAAACAGATCGATTTTCCAGCCACTACTCGCGTCCCACATATCAACTCTTGAGGTGGAGCAGAAGTAACATGTTCTAGCAGAGGAACGTTTGTAGCCCACGGATGTCGCCAAAAAATGGTGTCATTACCATTACCAACTACCATACCCATCCCTCTCCTGAGGACATCTATGTTATCTAGTATACCCCTCCACGCATTAGAGGCATCATTCCTTGGCATGAACATGTCGATATCACACCGACCTTCACAATATTTGCTTCTCAATACTCGAGACCACAAGGCATCCGGTTCTGCAAGCACACGCCATCCAAGCATGGTAAGAAAAGCGGCATTGGCTTGTCTCAAAGATCGAATGCCAAGTCCTAGCTTTTTATGCTTTGTGATCGTCTCCCAGGAAACCAAGTGAACCTTGCGTTTATCTTCTGAACCACCCTACAAAAGTCTCCAAGAAATACGATCGATCTCGTCACAAGTAGACCGAGGTAATTTGGTTGTCTGCATAGCATAGTACGACATTGAGGAAAGACACGATTGGAATAAAGTAGCTCAACCAGCAATAGAAATCATCAATCATCTCTGCCATCCAACCTGCCAGTTTACCATTCACACGATCCACCAAGAATTGTTACTCCTTTTTAGAAGACATGTCGTTAATCATAGGGACACCGAGATACATCCCCAAATCATCCGTCTCTGTAATATGAAAGGTCTCACAAATGCCAAACCTTATATCAACAGACGTATTGTTGGAGAAAAAAATTCGGGACTTCGAACAACTAAGCTTGTTGCCTAAGCTTGTTGCCTGATACTTTACAGAATTGTTCCAAGCAAGCTGCTACCACCTTGGCTTGGTCAATTGTAGTTTCCGCAAATAAAATTAAATCATCCGCGAAAGCTATATTAGAAATATTCGGACCATTGCGGCTTGCCCGGACAGGTTTTCAAGCTCCCAGGGTAACTGCTTTCGGTCCATACAAATGACATAGAGCTAAGGTGTAACACCCCAAATTTATCTCTTTTAAAAATCTATATTTTGTTTAATAAAATCAACATTGAGAGAAACTTAGGAGTATTACCGCCACATGATAACATTAAAGGCTAACCAAACTCCAAACAATGCAGCGGAAACTCAAATGATACTCTTTTATTATAAATAAACTATAATGTAATTAAGTATTTAAAACCTTAATAAATAAAAATTTAACTTGAAATGATACCCTAGTAGGTTATTCTCTATCACAAATCCCGCATATGAAGATGCCACATTAATCACCAAGTTCTTGCAAATCTATTCATCATAAGATGAATACGGCAAAGGCAAAACAAAAAGAATACGGATTAGCAAAAACGCTAAGTACATATATGTACAAGATATGTAAAAACATTTAAACATACTTGAACATACTTTCATAATTCATTTATTTCATAAACTTTTGCATTTTGAATTATAGAAAGGACAAATTGTTTTTTACCACCTAAAATAATCGAAAAATTGTTTTTTACCACCTAAAAAACTAAAACTTGTATTTTACCACCTAAAAAAATTTAAAACTTTGTTTTTACCACCTAAAATGGAAAAAATAAACAAAACCGTTATGTGGAGGGCCACAATGACGGTAATATATCTAATATTTTCTCTTTTTCCACAATTTCATGTATTTAACTCCCTTCATTCTTTTTAAGAATTCACATGATTTTTTCTCTCATTAATTCACAAAATTAACAAAATTCAATGGAAATGAAGAGAGGAGGGTGAAGGAGTTAAAGAAAATCGCACGGGGAGTGTATAAAATTGGTGGGATATTGAATTAGTAAGCCCCACATAACGATTTCATCTTTTTTTTTTCCGTTTTCACGTGGTAAAATACAACTTTTAATTTTTTTTGTAGGTGGTAAAATACAACTTTTATTTTTTAGAAGGTAAAAAACAAATTTTCAACTTTTCTAGGTGGTAAAAAGCAACATGTCCTTATAGAAAATCCATTTTTCAAAAGATGGTGACTTTCCAAAATAGTAACTTTCCAAAAATAGAAAACATTTCAAAAGTAGAAACCTCCCAATCAAGACAACTTTCCAAAAATAGGAATTTCCAAAATAAATCTTTCAAAATAGAACTTTCCAAAAATAGAAATTACAAAAAATATCTTTCCAAAAATAGAACTTTCCAAAAATAATTACTTCAAAAAATAAGGTTTCGAAAAGTAAAAACTTTCCAAACATAGATTCCAATATAAATATGGAATCATACCCTCTTCTTATCTCTCCTTCCTTCCTAAATGTGCACACCTCCAAGTTAGTATTCCATTTTACAAATATATGCATTTCGGTACTAACCAAAATAAATGTAACAAGTGACCTCGACTTACCCAACAAATGATGTGTATGAACATTAGTTCCATAGATCTAGAAAGATCCATTATACCAATAATAATAAGTCATGAAAGACTTTGAAAACTTTCTAAAATATAGTTTTTCCATAAAAGACTTTCTATAAAATAAATATGCCACATAAAGGCTTTCCATAAAATAACTTTTCCATGTCAACCTTACTTGTCCACCAAGTATTTTAAAAACACTCATTTTGTTTTTGAAACATATTTTTAAAGATCAACAAACTATTATTTGAATAAAAGTTGTACGTTCAAACCATTTCAGTCACCAGCAACATACATGAAAATCATCATCATCATATTTGAACATCATCAACAATATTATTCAAAAATCATAATTATCATAATTAAAAATACCAATTAAATCACACATAGATTTTAATCATAATTAAAGATCTAGCCGCATACACATACCTTGATCAATCAACCACTTCTCACGAGATCAAATAGGCTTAGGAGATTCATTCATGAACTCCTAATCAATTAATTCACTAAACCTAATTAATAATAATACTAAACATAACTAATTAAATATGATTAAATCTAAAGATGATAAATTTAAATCATGATTCAGTTGATAAAAACATTTTTATTTTATAAACATGTTTTTAAATCACAAACGTTTATCTAAAATAATTGAATTACTAATCGGAAAAGAGTGATTTAAATCAACGATTTGAAGAACGACAAACGACAACGAAGAACGACGAACGAAGAAGCCAACAACACGTGGTGGTGTTGAGCAAGGTGGACGGATGGTGGTGGCTGCCTCCTTGGGGCGGCGGCGGAGGGTTTTAGAAAAGAGAGAGAGTGAGAGTTTGGGAGAATTATAAAATTTAGGTTTTTTTTAAAGAAACTCTAAAGCTTTTTGCGTGGAAAATAATAAAGGCCATGAAGGGGTATTTATAGTTTAATGGGGTATGGGTTATGGTAATTAGGGTTTAGAGGATTTAGGTAAAAAAATATTCTACAATGTTCATTGGAAGAATTAAGGAAAGAGAGAATAATTAATTATGGAAAATGCCAATAGAATAGAATTTAGAGTTATTTTCAAATTATTAAAAATCAGATTTTTCCTAAATCGTTCCTCAAAAATTCAGAATAATTTTAGTTTCCAAATTTCCAATGTTGATTTTCGATTTTAGGAAAACCCTTTAAAATATTCGTATTTTAGGAATTGTTTTCGAAAATCTTTTCTTATTTTTAAAAGTCTGATTTATTTTCGAAAATTCAGATTTCTGAAAGAATATTTTTCTTAATTTTTTTAATTAAAAATCCCTTTTAAACTAGTTATTCCAAAATTACATAAATATCCTTAATGAATTTTGAAACAATACTTCCCTCCAAATTAATTTGAAAAGGTTTTTCCCTCCAAAATCATTTTGGATATAATTACGTAATTAATCAAAATTAAGTTTTTGAAATGCATGAGCTAATCAAATTAATTACGCCAAAAATACGAGGTATTACATAAGGTGAGAGGGGGTTTCCTTGTCGTATACCCCGACTAGGGGTGAAGCTCTTTGTGGGCTCACTGTTCCATAGTACCTGCCGGCTCGTGGATGAGACACACCTCATAATTATCTCAACCATTAATTGAGGCAGTCAAAGCTCCATAATTTATTCGCGGATAAATTTCGATCTTAGGCGGTCTTACGCTTTCTCGAAGTCAATTTAAATGGCCATGTATCCCACTTTACCTTGATTCTTCCTCACAGTATGGAACATTTCTTGCGCTATTATAACCTTGTCTGTAATTTGCCTCCACAGAACAAAGGAATATTGAGTGGGGGAGATGACTGCAGGGAGCATAAACGTCAAGCAATTCACTAGAACCTTGGTAACCGCTTTGTAAACGATGTTGCAAAGACAAATTGAGGTGAGTTCTCCTTAGGGATTAGGACTAAAAAAGCTTTATTAAGGTCAGTAGGAAAATCTCGACCTTCCAACACATCTAACACCGTCGGTTACTAGTAACCGACGGGGAAAGAGGTGTTGATAGAAAACAGGTTGAAAGTCGTTCGGTCCCGGAGCCTTGTGGCCTCATGCTCATAATAGAAGCCTTTACTTCACAAGCAGTGTAGGGTCGAGTAAAATTCTTCCAATCTAATCATCAGCAGGAATTGGGGAAGCAATTGGGCATAAATACACTTAATTGAGCTAGTGGATTTTTTTCTTGGACTAAGGATTTCTAGTAATCCACAACTAAGATTTAAACCTCTTCAGCATTCGAAACTCATTGCCCTTCAGAATTTTGGAGCATCTCAATACGGTTATGTCTTCGTCGGATCACCGTCGAGAGATGGAAGAATTTCGTGTTCCGATCCCCATCTCTCAAGGGCCTCTAATCTTGACTTTTGGAACCAAAGAAGTTCTTCCTGTTGCAGAACACATCCATCACCTTCGAGTTTAGACCAAAACTGGGCTTTTTTGCGAAAATATATTATGAAATATATTTCTATTCTAGTAATTGAGTTTGGTAGCCAAGGTTGCCAGGAAATGAATGAATGGGCTAGAATTATCCCAATTATTATGCATGAACGAACCAGGCATCTTGAAACCGAAAAGGTTTGCATGCCATGGGAATTGGCGCATAACCACAAGTACTCCACAATATTGGACAGTGGTCCAAGCATGATTTTGGAAGATGTTTCCCCGCCCCTTCAGTGAAGCGAAGCCTTTAATCCTCATTCACAATGAACCTATCAAGTCGAGCTGATTTAAAGGTGGCTCGAGTATCACATCGAAACCAAGAGTGGTTGGGACCCGAAAACCCCAAATCTATCTAACCCATGCTCCTCCATCCAGTTACTAAAGCATCGAAATTGCATTTCCGATCCTCCAACCCCATGTCTTTCATCCATCGTTCTTGTTTCATTGAAGTCACCCACATTAAGCCATGGTCCATGAAAATTTTGTTTATCCCTCTCAAGGGCACTCCACAATTCACGACAAGTAAAAGAGTCAGGCTATGTTACTCAGACACGGGTGTCGGTGTCGTGTCAGACACGGGTGTCGGAGTATCCGACACTCCGATCCAAATTTTTGTAGACACTTCGACACTTTCAAAGGAGTGTCTTGACTAGGTTTTTAGACACGGCTACCATATAAAATGTCGGAAAAAAATCGACACTTTTTATAAATTAAATTTTTTCTAAAAGTAAACATTAAAAGTAATAATAATATGGTATTATGTATTTAATAGGCTAAGGTAAAAGTCTTTGTATAGGGAAGATAAAAGTCTTATCTAGGAAAGGTGAATGGTGTAAAGAAAATAAAAACTAAAAAATAAAATACGGAATAGTTATTAAAAACTAAAAAAAAGACCTCAAAAAGATAAAAGATTCCCTCAAAACAGTGAGTTGTACAGCTACAAAACTTTAATTAATAAAAGAAAAATAAAAAATATCAAAAAAATTAATAGAAGATGAAGGGTTGGACCATAACCCACCTTCTGTCTTTCTTCTTTCTCTCTCCTCAACTCTTCTTTCTGTTTTCTTTCTTCTCTTTCTTCTTTCTCTCCTGTACCCTTATTTCAGCCATTTTTTCCGGCCGATTCGCCTCACCAAGGAGATCAAATGCCCATTATTTTTTCGGTGAAGTTCAGATCTACAACAAAGAGAAAGAGTTCGACACCTGATTTTTGGATTCTTTTCCGACATCGGTGTCGACACCTCCCCGGACACGTGTCGAGTGTCGTGTCGCGGGTCGGGTCGTGTCAACACTGATACCCACCGCCGGACGCCGTGTCGGAGTAACATAGGAGTCAGGGCTCGCGTAAACTACTGAAATGAGTCATGGTTCTTCACTCACCTTAGCTATTTTAATAGTAATGTGCTGAGTGTGGAGTTCGATTGGATTGACTGTAACCACCAAAGCCCCGTGCTTCCACTCTAAGTTGTCTAGAAAAGCCAATACAGTCACAAATACTTTGCGCTTTATCTCCACTGATGTGAGTTTCGACGAGTGCTAGCAAATTAGGTTTATCAATTTTAATGAGTTCCTTAATCGTAGGCATAAAAGCCTGCTTCGCTTCCCCTTGAACATTCCAAGTCATGATTTTTATGGACATATTAAACATTGGTATACGGGGGTAACAATCATAGACATTCAACGACACACTAAGTAGTATGCTCAGGAAGGCGTCTCTCGACGTTCTCATTGGAATCACTAAGTCCGCCGGGTGTTGAATTGTAGCCATTAACACCTTGGGAACATGCATCGGAGGCAGAAGCGAGCTCACATTGAACTCCACTCGGTTGAGTGTGGGATGAAGCGTAATCAATGTTGGACCCTCCATTCGATAATCTAAGAACATTGTGTATGCCGAGGAAATATAAACTTCTAAGGTTGTTGGAGGATGTAGGGGGTTCATGGCTAACTTCTCTTCCTCCCGTCTCGAAAGGCTGAGGAACAATTCTATGAGGAAGGGAAGAAACAAATGTAGGGAAAATAGGGGTAGAGTATTCCTTTCCTACGACAGTATGGTGTGCAATAGGAGAAGGAAGGGACTGACTTATTTTCTGGGGATTTATTTTTGATGTTCTGATTACTTTCCTTAAATGGAAACTGTCTTGGCCCATAGGATTTTCTTCCTTGATTTACTGCCGAATTTAACTTAGGAACAAAGGACATTAGCCTAATATCTCGTTTCTCCCTAAAGGAAGGTGTAGCCCCGGAGTTCAAATTGACCGTTGATACAAAATCATCCTTCCTATCCCTAGAAGATATTTCCATATTACCCGTAATATTTGCCATAGAACAAGAGTCCTTAGTTAGGTCCACTATGGGAATTTCCGCATTAATATCAACCTCCGTAATTATACGATGTTCCATATCAATATCTTTTACCGTAACATTCCTTTCTAAAACAAACTCTTGATTACCAAGATCTTCCTCATTATTCCCCTTCAAAATAGGCTATAATTGGGCTTGGTTAATTTATGTACTATCATTGGCCATGTTTCTATACTCTCTCCTTGATTTAGTACATCAAATCTTGACCCCCTTGACCCATTTGCGACTGATTTTGCTCTCTTCCTCTTTGATCTGTTGACATGGTTCTATGGGTAGTATTCTGTTGACCTTGTGTGTTGGAGCCATGTCCTCTGTTAGGGTTCCCTTTGTGCTTTTGTGGTTACCTTCTTGGAGGACGTTGAACAACATCCATGCGCGTATTGTTCCGATGTATCATGTCTTAATGGAGTTGAGTTCGATGGATTCAAATGACGTACATTTTTCCTCCTTGTGGCCAAGCTTGCCACACTTGAAACAGATCTGTTGAAGCCCTTCGTATTGGACCAGCCATACACGTTCATTGAGTCGAAACTTTGAGAGTTGATGTCTAATGATATCAACCTCGACACATTAGTCGAATGTATTGTCCTCTACTCATGGATTATGTGTGCTTGGCAATTTTAGTGACTCTGCCGATTCGTTCACCTATTTTCATGAGGAACTTTTAATTAAAGTACTCCACAGATAAGTTAGGAATTCTAACTCAAGCTATCAAGACATTGATAGGTTCTTCGTCTAGGACAAAATTTGGGACCATTTAGGGATCGTGAGGTAACTGTCGCCAATAACCCATGGCCTTTGTGTCACTGTGTCAGCACCAGGGTTACCTAATTCGGTGGAGACCCTACCGTACTCTGATTTAGAACGGTTTGTATCGATTCGAGTACGAATTCAGTCTAATTCGACCAAATAGTCGAGTCGACCGATTTATACTCTTTTTCGCTGGTCTTCTTCGCCGAATAACGTTTCCCATGGAGATACGAAAGAATTAATTAAACTGGTTTGTAATGTTTTTTAAATGGTGGATGTCAGAAGGTTGAAAGAACTATAACCATGAAAGTAGATCTGAAAGAAAAGCGGGGAAGGAGAAGGAGGAAGTGGGTGACAACTTTAGAAGTGCGATCCACCACAAATTTGGATTAATTGTGATTTTTTCCCCTTAAACAGTACGTGTCATCGTTTTGGTGTTGTACTGCTACTAGGAGTCTAGCTGTTAACATTTGTCCACTTTATATTATTTATTTGTTTATTTTTTAATTTATTTATAAAGGATTAATATTTTAACTTATTTGAGAAACTTTTAATCTATACAAGAGCTCCTATTTTAGGCTTTTTTTTTCCGCAGGACTCTTGTATTTTATTTTTTAGTATTTTATCTTTTTTTTAGTTTCTATTAAACATAAATATGTACCTGTTTTTTTTAATTTAAATGAAATCGTATTTTAAAATTACTCATAACGTATTTGTAATTCGACCCTCTCTCGTATTCCGTATTCGTACTAAATCGTACTCCCTCCGTCCTAGAATACTCGCAACGGTTTGATTGGACACGCTTGCCAATGCACAACTTTGACCATCAATATCTTCAATTATATATTATAAAAATTTATAAAATATTAATATTTTGAAAATATATATTAAGATAAAGTCAACAATATATTATATGCTAACACATACTTCGTATATACTATAAATAAAATAAAGTCAAAGTAAATTATGTTAATAGTGCAAAAAATCAAACCGTTTCGAGTATTCCGGGACGGAGGGAGTATTACCTATTAGGTAACCCTGGTCAGCACAAACGAGTAGTCATCTATGTTTGAGAATCGAACAACATAGTATGAACACCCCACATCAGTTAGGGAAATATCTCCCTTCAGACTCCGCTTATTCCTTAGCCTTCTAATAAGATCTTCATAATCAAAGCATTCAACCATTGCGTGCGCAACATGCCTTTGAAAGACGGATGGTTCGACAGTCCGGGTGTTCCAAGTCCTTCATTAGCCATTTCATCATCTGAGATATCTTTTGTGAACATCCCCCCATCAAGATCTAATATCGCATTGTTTAATCCAATACCACCGGCAGCATGAGTATATGAAAGGCCTTCTCATTTCAAACCATTATCTGATGGAACTCCATCTTGTGCGGGAGATACATGAGTTTGAATGTCTTAATCCATAGCCATGGAGTACTCCTTATCATCACGATGCTTTATTTTCTTATTCAATCTCCTCACAATGCTTTATTTTCTTATTCGATCTCCTCATGAGATCAGTTTCCTACGAAGAAAGTCCTTCATTTTCTCTCATTTCCTCCTAGGTTCATGATAAAGTAGGTCACAAATAAGACCATTTACATTATTCTTAGGGAATCACTCAACCAGTAGTGTTCTTTCAATCTTTTCAAGTATAAGTTATTTCACTGATAAAATAAAAACTATAATTCAAAATTCAAATTCAATTAATGCAAAATAGTTATTTGGGGCTGTCTTGAACCTACGAACAGTTATAGAATCTTTTGGTAAATATTTATTTTTATCAACCAAAGTAAAGAGCAATGCTAACAACAAAACTTATAAACTAAAGGGCAAAGTGAAGCCTGATAACCCTTCTAGGAAGGCTCGTCCGGTTTTAAACGCTCACACCATTACAACTCTAGCCAAATTCTTGAGCTTACTACATGATGTAAGTCTAGAAACACCTTCCAAGGAGATGGGTGAGGATTTTAAACTCAAAGGAACAAGAATACTCACAAGAACACTTTTAATTTTTTAAACACTCGAAACAAGGATAGTGACCAAATCACACGAACACAATCAATGATTTGGGGTTTTAAGGAAATAGAAGACTCACTCAATCTCTCCCATACTTATCACTCAAGACTAACTCAAAGAATAAGGGAAGTTTAAACTTGAGAAACAAAGCATAAACTATGAAAACTCAAGGTTTTTTTTTTATCAAATTCAAAAATGATTTTTATTCATCATTTTCTCTCCTTATGTATAGGATGGAGCAGCAAAAGGAAGCCAAAAGAATTTACAAAGAAACCCTAAAAACCGTCCAAGAGTGAAACGGTTTTAGAGCTAGAAAGGCTCCTTGATTCATGCCCCATTACAAGTTTCATCAATGAGCTGTTACAAAGATGTTACACCAGCAATAGATGACCTAATTAGGCTCATAATCACACGCCTAATCATCACAATCTCACAGCCAAAATGTTACTTTAGGAGTAATCTCTTGGGCTAATTTTTTTTTTTTGGCAAATAAGATTTTATTACTTACCAAAGATAGATAACCAACCTAACACCAAAACTCCTAGGCTTAAGATCCCAAACTTTTAAGAAGACTTGGGACCAACTGAAGCCAAACTACATGAAGCACTGTTTCAAACATTAAACAAAATTCTATTTGCAACTACCTTACAAAGTTACAATCATCTAGCTTATTGCTAAAATGTTTAGCATTTCTCTGCAACCATAAAGCATACACAATTTCTATAAATGTCATGCTATACACTTTGTCAGCATCTTTCTTACCTGGAAACCCATGGGCTGCCTGTGTACACCGACATTCTTCATTACTAAGCACCAGATCCCTTGAAAAAGCGCAACTAAAGATGGTGCAACGTTTCTGCTTGCTGCTGACAGAGGCAACAAATACCATCAGTAGGCACTCCCCAATTAAGCAGTCTGTCTTTTGTAGTGATTCTATTCTGTACCGCAAGCCAAACAATGAAAATACTCTTTGGACTCGCTTTGCTGTTACATATGATACTCTTCCACTCCACCTAGTTCCCAGTGTCCCTCAAAAACCTATACATTTTCTGCACTTTAAACTTACCAACAATAATTTAAAGCCACCAACATCATGAATAACTTCTCTACTTTGCCGTATCTTCCTTAAAGTCCAAGTGAGTCCACTAGGAATAGACATGGACCAAAAATCCTTCTGCTTGATGTATGAACTGTGCAGCGCCACCAGCATCATGAATAACTTCTCTACTTTGCCATATCTTCCTTAAAGATCAAGTGAGTCCACTAGGAATAGACATGGACCGAAAATCCTTCTGCTTGATGTGACCCATTTAATCCAAAGCCGATCTTATTTCAAAGATAAAGCCCAACAATAAGGTCTTTCAGGTTCCACCCACCACATACCTTAGGCAAGCACATCGGTCCCAAGCTACTGGGGCTCTTCTTGAATTTGAATCACCACCAGAGGAACACCCTGCAAATCCTCTGAATTTCTTTGATCACCTTCTTAGGCATTATGAATATCTGGCACCAGCATAGTTGAATGCCAAAGAGCACTGATTTGATCAATTGAACCCTGCCTGCATAAGAAAGAAGCTTAGCGGACCAACTTCAAATTCTAGCTATTGTCCTTTCAATCAAAGGTTTGCAGTCTATGTAACTCAGTTTTCTTGTAGTCAAAGGGACTCTAGGTACGTAAAGGGAAAAGTACTTTTGGGAATTCCCAAAGTCTCAACAATATGATTTCCCTCTGTATCTGAAATCCCAGCCACATACACTTCACTTTTATGAAGATTGGCAACCAAACCTGATGCTCCAGAGTACTTATTAAAAACATCAAACATCACTTTAACAAAATGTGTATCAACTTTACAGAACATCAACAGATCGTCTACGAACATCGAACGAGTGACCCCTAACATCTTGCATCTTGGATGGAATTTGAAAGCAGTAGAACCAGAAGCAACTTCTAAGCAACAAGACAAATATTCCATACCCAAAGCAAAAAGGAAGGAAGACGTTGGGTCGCCTTGTCTTAACCCTTTCCCTCTTGGGCTTCATTACAAGCTAAAAAAATAAAATAAAATACTCTTGCCCATAACAGAAATATTGGCTGCTCATATTTACTCACTTTCACGCCACTTTGTTGCTTCATTCATGTTGGGCCTTCAATCCAAGTAATGCAAGTCATAAACTCGGCCTTTGCGACTTGAATTGTAAGATGATCCAAATCCAATAATTTGAAAGGTCATCTAAATTAATTATTTTAAGTCGCCTTGTCGCATCTGTATCATTCTCTCTTTCTTTGGAAAAAAATCGACCCCGACTTTGCCGATTCTCGAAAGCCGAAACCGATCAAAAGAAATGATAGCCTTGTCAAACATGGTATGTGGTATCATGGAACTCTAGAGCGTGTCAAAAGAAATTCTTGGAACATCATCTTGAACCTCAACATAATCATAGTCATGCTCAATAATGCTCTCGTTAACCCCAAAAGTGGGAACTAGCCAATCTGGAGGGGGTTCGGAAATTGTATCAAAATGTTTCATGAAAGCCTTACGTGGAAAGTTGCAAGCACTAGAGTCATGGAATGACTCTTTGTGACAGCGACCACCAAAAAAGGAACTACAAATTTTTCCTTAATCACCTCATTTTCATTTTTCTTGTGAATTCGTTCAACAACAATGTTCTCTTCAACAGTAGGGTCTCAGTCTAATAAATCAGAAGCATCCCCACGTTTATGCGTGATATCTGTTTTAGGAAAAGCGTCGAGAACTAGGTGTTTATCTCATGAAGATTGGTTTTAAATTCTTCGATTCAAGGACGATGAGTTTCATACTCCACACTCCTGTTATTTTGCTAGAGATTGGTTAACATTAAATGTTCTTTTTCTCATATTCCCTAGGAAAAATTACTTTAAATAATCCAACCTTTTGACGATTTTACGTTGATAATCCAATCTTTGGATTATTATCTAATAATCCAACCTTTGTACCCCACTAACTCTTATTGCACCCAACCGGTCACCAACCTGATATAGTAGGTAATCTATACACGTGTCATGCAACTATTCGCCCTAAGTTTTTTTTTTTTTGCCTTCTTCCTTCTAGTTTATTTATTTTCATGTAGTTTTTTTTTTATTCTTCATCATCATGTTACTCCTCCAGCAGAGCCACTCTCTCTCCTACCCTTCGGTCCTCCGCATCACCGACGTTGTACACCATTATTGCTGGTAAGTCTCTTCCACCCATTAACAACCAACCCAGCCAATTAACACTCACCACACCACAAACCCCCTCCTCCGTTCCTCCTTTCTATCTGATTGTCTCCTCCCCTTCACAACCCCCTTTTTCTTCTCCTCCAAATACGTACACCATCAAAGACTAATTAGACCAATAAACAAAAAAAAAGTATTACAACATACAACACCACCAACCAACAACGAAAATAAAAACTGTAGCAGGGTTACTGGGGAGATGAGAGGCAAGGCGAAGCGTAAAAATTCTCTAGCAATTGAACTCGGGCGGTGGTGTGCGGTAGCAGCAAAATGCAGCAATAAGAAAGAACCTCGTCGACAAAAAATTAGTCGTCTTTGGGCTCTTATTGTTTTTAGTTTGAATAACACATTTTTGTTAGAGTTACCATTGATGATTGATTTATTAAAAGTTGTTCATTGAGAAATTGAAAACAAAATTACTATTGACCAGACACAAAAGAGTAGGAGAAAGGAAGATGAGTGTGCCTCCATTGGAATGGACCTCCACCTTCTTCTCATTGTTCTATGCTTCGTGGAAATTAAACCACCGGAACTCAAGTTCCGATTTTGGAACTCCTTTCCCAGTTAATTAGCCTAATTCCAAAAGCTTTGACCTAGTATCTATTTGGACAATTTATAACTGTATTTTCTCCTTTAAATTTTATTAAGCTCTTGTAGACTTTAAAGCCATTGATGAAATCTTGGAAGAGGAGAGAGTCTGGGATTTGGGGATAAGTAAGGGTAGTTTTAAAAAAAATTAGTTCGTAATGAAATAGAGATGTTGACCTATTTTGCCGGTTGTAAGTCCTTTAGGTGCAATAAAAGCGAATGGGGTATAAAGGTTGGATTATTATATAATAATCGATAAGTTGGATTATTAGCGGAAAATCGTTAAAAGGTTGGATTATTTGAAGTAATTTTTCCCATTCCCTACCCACCCATTTGTCTCTCATTTTCTTCTTTAATCTCTCCCACAAATCAATTTTCTCCTTCCTTTTTTTTCTTCTTCGTGACTTAAGATTTTCATACCATATGACAACTCCTTTATTGAGCTACGAAATAGCAAGTTTAAACCTCTTTGTGTCATCATAACTCTTGTACTCAAAGACGGTCTCAATTTTTCTTACCTAATCAAGGAAAGACTCACTATCAATGACCACCATCTAAGTTTGGCAAATCAATCATAGGAGCTCTATCTTCATCATTCTGCCTAAGGTTATAAGTCTTACTATTATCAACATTTCTCAAGCAATTCAGTTTCCTATTAGTTTGCAACAACATTTCCGCCATAGTATCCATTCTAGTCTTTTCCCTAGCCTTCTCCACCTCATTTTCTTCTGTTTTTTCCTCCAAATTCTATAGTTTTTAAGTTAGTAATTGTATCATCCTAGCAAGATCCCCCTATTTTCCATTATTTTGCTAGACTATTACAATTACAATTGAGTTTATGCCCGAAACTTTAAGCAACAAAACGGAAATGAAAACGACGTTGAGACTCTCAAAGGCTTACAGGTTATGAAACTTCAGGAAATTCCAAGCAATTTCAAAATAAAGAAACTCCTCAAAAGCTGTCCGGATCAAAGAGTAGTCCTCGTTTAAAATCTGATGCGCTGGATTGGCTTCGTTTTGCGCAGTTTTCGGAAATCTGCCCAGTTTAATAGCAAAAACCAAAAATCTGAAATTAAATATGAGTATTCTACACAGAAAAATACCCAGATTAAAAAAAAAATAAAAAAAAAATCTGTCAATTTGGTGCCCCGAGCAAAGATAAATCGCTGTTCAAAGTTGACAGGAAAATTGTTTGTATCCGGCAGTTTTCTTCTTCTCCAAAAATCAGATTTTCTTCAAGACCTCCACGCCAAAGCTCTGATACCAGATGATGCAAGTCTAAAAGCACCTTCCAAGGAGATGGATAATGATTCTAAACTCAAAGGAACAAGAACACTCACAAGAACATTTTTAAACACTCGAAATAAGACCAATGACCAAATCACAAGAAAACACTCAATGATTTGGGGTTTTAGGCCAGATATAAGACTCACTCAATCTCTCCTAAGCTTATCACTCAAGACTAACTCAAAGCGATAAGGGAAGTTTAAACTTGAGACTAACTCAAAGCATAAACTATGAAAATTCAAGGTTTTTTTTTATCAAACTCAAAATTGATTTCTATTTCATCATTTGCTCTCCTTATATAGAGGAATGGAGAAGCAAAACGAAGCCAAAGACTTCACAACCCTATAAACCGTCCAAGAGTGAAACTGTTTTGGAGCTAGAAAGGCTCCTTGATTCATGCCCCATTATTAGCTTCATCAATGAGCTGCTACAAAGATGTTACAACAAGTCAACAACGATGTTACAGCAGCAATAGAGGACCTAGTTAGGCTCCTACTCACACGCCTAATCATCACAAACTCACCACCAAAATGAGGTAATTTAGGAGTAATCTCTTGGCTTTATTACAAGCTAAAAACAGCAAATAAAATATTCTTGACCATAACAGAACTATGGGCTTTTCATATTAAATCCCTTTCACGCCACTTTGTTGCTTCATTCATGTTGGGCCTTCAATCCAAGTAATGCAAGTCACAAACTTGGCCTTTGCTACCTTGAATTGTAAGATGATCAAATCCAATAATTTGAAAGGTCATCTAAATTAATTATTTTAAGTCGCCTTGTCGCATCTGTTTCACTACACATAAAACATCTACTGTATACAAGAAATTGGTTTTTTTTCTTCTGACAATACTCTGTGCATCAAGTCTGCAGTTCTGAAACCTCCTTTGATTCCTTTCCTTCCAAATATGACAAATGGTGGTGATCAAAGCCATCCTATAGAGTTGAAGCTTCACCGAGTTACCCTTGGCACGAGCTAACATGACTGAACCTTCATAAGCCATCAGTTCTCCAGATGCCAGCAGACATACTACATTCAAAAAAAAAAAAGTTCTCATTAATTACTCTCCATGCTCAGATTGAAAAGAATACATATTCTGTGATACTGAATCCAAAAGTTGTTCAGTCAATCACATGTAGATAATCTTTCCTGGGCTGCCATAATTATTACAAAGTTACCTCTCAAAGCGTCATACACTTTCCAAATAGGAAAGTTTCCTTAAACATCTGCAAGCCTTGAAGAATTTGTATGTACTGCACAGCTCCTAGAATCTTTTTTTTTATCACACATGAATCCTGATTTAGTACTGCCATCAGCAGTACAGAATTGTTCTTGATATGTAAGTGTGAATCGATTTGATCAACACTTTATCCTTTTTCTTATGAACATTCCATAAGAGTTTGCAAAGTGCCGCTTGATTCCAGTATGATTAACAATCCAACCACCTGCGTTTTTTTTGGTAAGCCAACATTTTACCAAGCAATGAGGGATCTAGAAGCTGTGAAGTTTGGATCATTTTCAGAACTTTCTTAGGTATCAAGAAAACTTGGCACGTAGCTTGGATGCTGAAGATATAACTAATTTAGTCAGTTGAATTCCACCAGCATAGCTCAACAACTTGGCTAACCATCTGCCCACTAGTCAATTCTCTACAAATTCTTCTGCATTCAACTTCCTGTATTGCAGTCAACTTCCTACTTGATAAAGGTACTCCGAGTTACCTAAAAGGTAAGGTACCTTTAGTTATACCCTTATACTCATCCGAATCTCTTCCTTGACCCCACCACAATTCACTGAACTTTTACTGAAAATTAACCGTCAAGCCAGAGACAGCAGAAGAGCTTTGAAAAACACCATATAACTTTATCACAGATTCAACATCACCCTTAGCAAAGAGGAGCAAGTCATCCGCAAAGCAAACACAAGTCACGTAAATTTTTAAAATCTAGTATGGAAATGAAATCTTGGATTTTCTTGTAAATGAGATAATCATCTGTTCAAGTACTCCATGCTTAAGACCAAAAGATAGGGAAACACTGTATCTCCCTGCCTAAGGCCTTTTTCTCTTTGAAAGGGCATGATATTTGTCCGTTAACTGTGAAAGTGTAGGATGCAGTGCTAATACATTAAATACACTCCATAACCCAATTTCCTGTAATTTTCCATATCCAAGTGGAAGTAAATGACTGAACTCAACATTCACCTAAAAGAAAGAAATCCATAGTTCAGCCAGCGGAAGTCAGCGAGATAACGGGTTGTCCAACCCCTTTATCCAGATGCCATGTATATCGCACTAGAAGTAACATATTAGAAGTACTTACGGACTTTGTCTATAGGTTAATTTCTATTATTAAGCAGAATTCGAAAGCTTCAACTTTAGCTCAACTTCGAGCTTTAGGGAGTCAACTACCAAAATAATACTCCCTCCGTCCCAGATTAGTTGTTACACTTTCCTTTTTCGTCTGTCCCAGATTAGTTGTTACACTTCTAAATTAGGAGTGACCCTACAATTATTATATTGTCTCTCTCTTCCCACTAATTTTTTTTTGTCCCCACACCATCTTTCTTTCAATTAAAAAATACCCCACTAACTCTTATCACATCTACTTTTTCAATAAAATAACAATTGCTAACCACACTACTACTTATCGCATTAAACTGTGTGCCCATGAGAGTGTAACAACTATTCTGGGACGGAGGGAGTAGCTTTAGAGACCCATCACCCATGCCATCTTGTCCAAAGCAAGAGAGCCCCCGCTCGTTTGAAGTCATTGCGAGTCGGAAGGGGTGCACTAAATTGAGACGGATAGAAGCAGTATTTCATAAAATTATTGAACCTTTTGCTCGGCCAAACAACAATAACAACAACAACAACAACAAAGCCTTAGTCCCAAAATGATTTGGGGTCGGCTAACATGAATCGTCGTAGGAGATCGTTATTGTCACCAATCAAACCAGAAAGGAGCAGTAAGTAAAAAAAGAAAAAGCAAGGAAGAGAAAGTGGAATTAATGAAAGTAAGATAAGAGAAAACACCATTATTTTAACCTCCCGATTTGCATTGAATCCTTGGTGCAACAAATGTTTCATGTCTTTCAACATCAAAAGTGTGTATACTTGGTGTATATATCATGATCAGAAGTACATAGTGCTGGTTAACGGTAGTTCAAATGGAGCAGAAAGCCGCGATGATAACAAAATATGCTTAGACTGGTCTTGGTGGTCTTGGAGTAACTGCTTTTCTATGTTACTAGGCCTTGTATGGTACCCATGCCAATTCAAGTAAACTGGTTGACTAGTTGAGGAAGGAGGGAATCTATATATTATCTTTCTTCCCAATGCTTTAACTCCAGAAAGTACTAACCTTGACATGCTCGCATGCTTGATTCATTTTCTTGAAGATTAGATTTAAAGGTGCCGTTGTGCTAATCCTAATTTGATAAAAAAAGAAGTTAATCATGGGTTTGAGTGAGCTAAACAAAAATTATTAAACATCCTCTGGATGAAAGGATAAGAAATATTTTTTCGATGGTAAGCAGTTTTTGTTGAACTAGTCCAGGTGAACTTCTCAACATTAGAAATTTAGAATCATATGTTCATTTACAGGTTGGCCAAGCACTTGGATTATCAACTGAATGGTCGAATCATCCCCTTTATGTGCCTGGTAAGATACCTTGCCTTGGTAGGTAAAATTCTGAATTTTGCATAATCAGTCTAAAATGGGACAAGTTTTTCTATGATATATCATTTATGCAGGTGGTTTGGAGCTTCTGGATGGGCCAAAGCCTGAAAAGATACTGGTTGGTCCCCGTGTTGGCATCAATTATGCTTTGCCCGAGCATGTAAATGCTTTATGGAGATTTGCTGTTGCAGGTAGCCCATGGATAAGTGCCCCAAAGAACACGCTTAGGTTTCCCTGATAGAAAGAAAATTCTTTCAACCATGTAAGAGGTAACCGAATGAATGCAATCTTAGAGGACAAAAATGGAGCTTACACATCTCTTGTTTATTTACTACTGGGAGAAGGAAGGAAATGTTACGCCTCCCCTCCCCCCCTCCTCCTCCTTTTGTGGTGTTTAGGAATCCAAAAAATTTAGCAGTTGATGATTTAGCTACTAGCTAGAGCTCAAGTTGATGTTGTTTTCCTTGTTTCATTTTCTACTGGAAATTGGAAAACAAGGAGTCCTCATCAATGACATCATATGTACACAGAAATCATATCGTATTCTTTGCCTTGCGTTTTTACTTGAACAAACCAGAAGCAAAAATTCTTCATGCATTGTTGACAATATGGTGGTAATTAGTACTTTAGTAGTGATGGTTTTTAACTTCATTTGAAGTTTTGGAACCCATGGGTTGCACACACAAGCAAAATGACAGTAATATACGCAGAGGCGATTTATTCTATAAGAAATTGGTAATCCCCTGCATGCTTATGGTAAGAGATTGATGTTAATTTACTTTTTTTTAAAAAGTTTTTTCCACCGAGTGGTGTTTACTTGAAAAACGAAAGCAGGTAAAATGTGGTTTTTGTTAGTACCGTTTCTTTATAAGTGAGACATGAACATGGAGCTCCATGGCCTACTCTTGAAAAAAGTACAAAAACACAACTTCAACTCCTTTGATTCAGGTAAAAGTTGAAGTTAAGAGAACAATCGTTCTTTTACTATCCCAAGTGCGCTTCCACTGTTCTACACTGAGTGAGACTGAGAACTCGATTGCAGGTGTAGCTGAATGCCAATTTGCTTTTAATGCAGTAAAGTCTTTTATCTTCTAAGTAGCTCGTAATCCTTTCTTTCCTAACAGAACCCTGTTTATATTGTTCACGGAAGCACATTTTTTTTTTTTGGCAATTTTATAATCAGGTTTTTTTTTTTAAGGAGTCACAATTACTACTTTCGGCCGTATTTTCTAGAAGTTATAATTTTATTATTGGTACTACTCCGG

mRNA sequence

AAAATCCCATGCAAAAAAGTCAATCCTTAACCAAATTCCATAAATCTCTTTTTAATTGTTTATGCTGCTTCTTCTTCTTCCAACTAATATTCAACTCCTCCCTTCTCAACTCCGGTGTAGGTGGGCCCCAATGACCGGACCTGAACGCTCCAAACTGCGCTCCAAATCTAATTCCGGCGAAAACCTATCTCAAGCAAAACTCAAAATTCGCGCCACGAAGTCTAAATCGAAATCGAAATCGAAATCGAAAGTCCCGTTATCTCCAGAACATCCAATTTCCAACATTTCTGAGATTGATCAACAGAATTTGGTTATTCTAGGGCCCGATTTCTGCTCCGTTGATGCTCTTGATCTTGCTCCACTTCTTCTTGGCAAGTACCTTCGTCGTGATGATGTTATACTTCAGATAACCGAGGTGGAAGCATACAGGCCGAATGATTCAGCATGCCATGGTCGCTTTGGCAATACAGCAAGGACAGCTCCTGTTTTTGGACCATGTGGGCATGCGTATGTGTATCTCTGCTATGGTCTTCACATGATGCTTAACATTGTTGCTGATAAGGAGGGTGTTGGTGCAGCTGTATTGATACGTTCTTGTGCTCCTATTAGCGGTTTGACGACTATTCAGCAGCGCCGTGGTCAGATTACCAATAAGCCTGTTCTCCTCGCAGGGCCAGGAAAGGTTGGCCAAGCACTTGGATTATCAACTGAATGGTCGAATCATCCCCTTTATGTGCCTGGTGGTTTGGAGCTTCTGGATGGGCCAAAGCCTGAAAAGATACTGGTTGGTCCCCGTGTTGGCATCAATTATGCTTTGCCCGAGCATGTAAATGCTTTATGGAGATTTGCTGTTGCAGGTAGCCCATGGATAAGTGCCCCAAAGAACACGCTTAGGTTTCCCTGATAGAAAGAAAATTCTTTCAACCATGTAAGAGGTAACCGAATGAATGCAATCTTAGAGGACAAAAATGGAGCTTACACATCTCTTGTTTATTTACTACTGGGAGAAGGAAGGAAATGTTACGCCTCCCCTCCCCCCCTCCTCCTCCTTTTGTGGTGTTTAGGAATCCAAAAAATTTAGCAGTTGATGATTTAGCTACTAGCTAGAGCTCAAGTTGATGTTGTTTTCCTTGTTTCATTTTCTACTGGAAATTGGAAAACAAGGAGTCCTCATCAATGACATCATATGTACACAGAAATCATATCGTATTCTTTGCCTTGCGTTTTTACTTGAACAAACCAGAAGCAAAAATTCTTCATGCATTGTTGACAATATGGTGGTAATTAGTACTTTAGTAGTGATGGTTTTTAACTTCATTTGAAGTTTTGGAACCCATGGGTTGCACACACAAGCAAAATGACAGTAATATACGCAGAGGCGATTTATTCTATAAGAAATTGGTAATCCCCTGCATGCTTATGGTAAGAGATTGATGTTAATTTACTTTTTTTTAAAAAGTTTTTTCCACCGAGTGGTGTTTACTTGAAAAACGAAAGCAGGTAAAATGTGGTTTTTGTTAGTACCGTTTCTTTATAAGTGAGACATGAACATGGAGCTCCATGGCCTACTCTTGAAAAAAGTACAAAAACACAACTTCAACTCCTTTGATTCAGGTAAAAGTTGAAGTTAAGAGAACAATCGTTCTTTTACTATCCCAAGTGCGCTTCCACTGTTCTACACTGAGTGAGACTGAGAACTCGATTGCAGGTGTAGCTGAATGCCAATTTGCTTTTAATGCAGTAAAGTCTTTTATCTTCTAAGTAGCTCGTAATCCTTTCTTTCCTAACAGAACCCTGTTTATATTGTTCACGGAAGCACATTTTTTTTTTTTGGCAATTTTATAATCAGGTTTTTTTTTTTAAGGAGTCACAATTACTACTTTCGGCCGTATTTTCTAGAAGTTATAATTTTATTATTGGTACTACTCCGG

Coding sequence (CDS)

ATGCTGCTTCTTCTTCTTCCAACTAATATTCAACTCCTCCCTTCTCAACTCCGGTGTAGGTGGGCCCCAATGACCGGACCTGAACGCTCCAAACTGCGCTCCAAATCTAATTCCGGCGAAAACCTATCTCAAGCAAAACTCAAAATTCGCGCCACGAAGTCTAAATCGAAATCGAAATCGAAATCGAAAGTCCCGTTATCTCCAGAACATCCAATTTCCAACATTTCTGAGATTGATCAACAGAATTTGGTTATTCTAGGGCCCGATTTCTGCTCCGTTGATGCTCTTGATCTTGCTCCACTTCTTCTTGGCAAGTACCTTCGTCGTGATGATGTTATACTTCAGATAACCGAGGTGGAAGCATACAGGCCGAATGATTCAGCATGCCATGGTCGCTTTGGCAATACAGCAAGGACAGCTCCTGTTTTTGGACCATGTGGGCATGCGTATGTGTATCTCTGCTATGGTCTTCACATGATGCTTAACATTGTTGCTGATAAGGAGGGTGTTGGTGCAGCTGTATTGATACGTTCTTGTGCTCCTATTAGCGGTTTGACGACTATTCAGCAGCGCCGTGGTCAGATTACCAATAAGCCTGTTCTCCTCGCAGGGCCAGGAAAGGTTGGCCAAGCACTTGGATTATCAACTGAATGGTCGAATCATCCCCTTTATGTGCCTGGTGGTTTGGAGCTTCTGGATGGGCCAAAGCCTGAAAAGATACTGGTTGGTCCCCGTGTTGGCATCAATTATGCTTTGCCCGAGCATGTAAATGCTTTATGGAGATTTGCTGTTGCAGGTAGCCCATGGATAAGTGCCCCAAAGAACACGCTTAGGTTTCCCTGA

Protein sequence

MLLLLLPTNIQLLPSQLRCRWAPMTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPEHPISNISEIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNTARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKPVLLAGPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNALWRFAVAGSPWISAPKNTLRFP
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spo25396.1Spo25396.1mRNA


Homology
BLAST of Spo25396.1 vs. NCBI nr
Match: gi|731323806|ref|XP_010672632.1| (PREDICTED: DNA-3-methyladenine glycosylase [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 441.8 bits (1135), Expect = 8.900e-121
Identity = 223/284 (78.52%), Postives = 244/284 (85.92%), Query Frame = 1

		  

Query: 4   LLLPTNIQLLPSQLRCRWAPMTGPERSK-------LRSKSNSGENLSQAKLKIRATKSKS 63
           +LLPT +QLLPS+L CR  PMT P+ S+       L+S + + EN SQ+KLK R+    S
Sbjct: 1   MLLPTTLQLLPSRLHCRCVPMTKPQPSQRCAKFKSLQSNAITDENPSQSKLKFRS----S 60

Query: 64  KSKSKSKVPLSPEHPISNISEIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQI 123
           K K K K PLSPE PI N SE+  QNL ILG +F SVDALDLAPLLLGK++RRDDVILQI
Sbjct: 61  KIKFKLKAPLSPERPIPNTSELIYQNLAILGANFYSVDALDLAPLLLGKFIRRDDVILQI 120

Query: 124 TEVEAYRPNDSACHGRFGNTARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLI 183
           TEVEAYRPNDSACHGRFG T RTAPVFGP GHAYVYLCYGLHMMLN+VADK+GVGAAVLI
Sbjct: 121 TEVEAYRPNDSACHGRFGITVRTAPVFGPGGHAYVYLCYGLHMMLNVVADKDGVGAAVLI 180

Query: 184 RSCAPISGLTTIQQRRGQITNKPVLLAGPGKVGQALGLSTEWSNHPLYVPGGLELLDGPK 243
           RSCAP+SGL TIQQRRGQIT+KPVLLAGPGKVGQALGLSTEWSNHPLY PGGLELLDGPK
Sbjct: 181 RSCAPVSGLKTIQQRRGQITDKPVLLAGPGKVGQALGLSTEWSNHPLYAPGGLELLDGPK 240

Query: 244 PEKILVGPRVGINYALPEHVNALWRFAVAGSPWISAPKNTLRFP 281
           PEKI+VGPRVGINYALPEHVNALWRFA+AG+PWISAPKNTLR P
Sbjct: 241 PEKIVVGPRVGINYALPEHVNALWRFAIAGTPWISAPKNTLRSP 280

BLAST of Spo25396.1 vs. NCBI nr
Match: gi|870864166|gb|KMT15286.1| (hypothetical protein BVRB_3g059950 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 416.0 bits (1068), Expect = 5.200e-113
Identity = 210/264 (79.55%), Postives = 228/264 (86.36%), Query Frame = 1

		  

Query: 24  MTGPERSK-------LRSKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPEHPISNIS 83
           MT P+ S+       L+S + + EN SQ+KLK R+    SK K K K PLSPE PI N S
Sbjct: 1   MTKPQPSQRCAKFKSLQSNAITDENPSQSKLKFRS----SKIKFKLKAPLSPERPIPNTS 60

Query: 84  EIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNT 143
           E+  QNL ILG +F SVDALDLAPLLLGK++RRDDVILQITEVEAYRPNDSACHGRFG T
Sbjct: 61  ELIYQNLAILGANFYSVDALDLAPLLLGKFIRRDDVILQITEVEAYRPNDSACHGRFGIT 120

Query: 144 ARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQIT 203
            RTAPVFGP GHAYVYLCYGLHMMLN+VADK+GVGAAVLIRSCAP+SGL TIQQRRGQIT
Sbjct: 121 VRTAPVFGPGGHAYVYLCYGLHMMLNVVADKDGVGAAVLIRSCAPVSGLKTIQQRRGQIT 180

Query: 204 NKPVLLAGPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHV 263
           +KPVLLAGPGKVGQALGLSTEWSNHPLY PGGLELLDGPKPEKI+VGPRVGINYALPEHV
Sbjct: 181 DKPVLLAGPGKVGQALGLSTEWSNHPLYAPGGLELLDGPKPEKIVVGPRVGINYALPEHV 240

Query: 264 NALWRFAVAGSPWISAPKNTLRFP 281
           NALWRFA+AG+PWISAPKNTLR P
Sbjct: 241 NALWRFAIAGTPWISAPKNTLRSP 260

BLAST of Spo25396.1 vs. NCBI nr
Match: gi|902233240|gb|KNA22966.1| (hypothetical protein SOVF_029050, partial [Spinacia oleracea])

HSP 1 Score: 406.8 bits (1044), Expect = 3.200e-110
Identity = 207/207 (100.00%), Postives = 207/207 (100.00%), Query Frame = 1

		  

Query: 1   MLLLLLPTNIQLLPSQLRCRWAPMTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKS 60
           MLLLLLPTNIQLLPSQLRCRWAPMTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKS
Sbjct: 1   MLLLLLPTNIQLLPSQLRCRWAPMTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKS 60

Query: 61  KSKVPLSPEHPISNISEIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVE 120
           KSKVPLSPEHPISNISEIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVE
Sbjct: 61  KSKVPLSPEHPISNISEIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVE 120

Query: 121 AYRPNDSACHGRFGNTARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCA 180
           AYRPNDSACHGRFGNTARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCA
Sbjct: 121 AYRPNDSACHGRFGNTARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCA 180

Query: 181 PISGLTTIQQRRGQITNKPVLLAGPGK 208
           PISGLTTIQQRRGQITNKPVLLAGPGK
Sbjct: 181 PISGLTTIQQRRGQITNKPVLLAGPGK 207

BLAST of Spo25396.1 vs. NCBI nr
Match: gi|568879813|ref|XP_006492840.1| (PREDICTED: DNA-3-methyladenine glycosylase isoform X1 [Citrus sinensis])

HSP 1 Score: 362.5 bits (929), Expect = 6.800e-97
Identity = 178/244 (72.95%), Postives = 204/244 (83.61%), Query Frame = 1

		  

Query: 34  SKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPEHPISNISEIDQQNLVILGPDFCSV 93
           ++S + +++ +   + ++   K KSKS  K  ++P  PIS     +  N+ IL   F  +
Sbjct: 25  TESTNDDSIIRTGHRPKSVTVKIKSKSSPKPKIAPSVPISAGPSFE--NMTILPHHFFQI 84

Query: 94  DALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNTARTAPVFGPCGHAYVYL 153
           DALDLAP LLGK+LRRDDV+LQITEVEAYRPNDSACHGRFG TARTAPVFGP G AYVYL
Sbjct: 85  DALDLAPRLLGKFLRRDDVVLQITEVEAYRPNDSACHGRFGITARTAPVFGPGGLAYVYL 144

Query: 154 CYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKPVLLAGPGKVGQALG 213
           CYGLH MLN+VADKEGVGAAVLIRSCAP+SGL TIQQRR Q+T KPVLL GPGKVGQALG
Sbjct: 145 CYGLHTMLNVVADKEGVGAAVLIRSCAPVSGLKTIQQRRAQLTEKPVLLTGPGKVGQALG 204

Query: 214 LSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNALWRFAVAGSPWISAP 273
           +STEWSNHPLY+PGGLE+LDGPKPEKIL+GPRVGI YALPEHVNALWRFA+AG+PWISAP
Sbjct: 205 ISTEWSNHPLYMPGGLEILDGPKPEKILIGPRVGIEYALPEHVNALWRFAIAGTPWISAP 264

Query: 274 KNTL 278
           KNTL
Sbjct: 265 KNTL 266

BLAST of Spo25396.1 vs. NCBI nr
Match: gi|823256402|ref|XP_012460848.1| (PREDICTED: DNA-3-methyladenine glycosylase isoform X1 [Gossypium raimondii])

HSP 1 Score: 362.1 bits (928), Expect = 8.900e-97
Identity = 180/261 (68.97%), Postives = 206/261 (78.93%), Query Frame = 1

		  

Query: 20  RWAPMTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPEHPISNISEID 79
           ++ P+    R  +RS     E  SQ+  K    ++++K   K   P S     S +S   
Sbjct: 6   QFKPVASKSRKTVRSSQCLEERRSQSS-KAATVRTRTKPSRKQTRPQS-----SPVSLPC 65

Query: 80  QQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNTART 139
            +N  IL P+F  +DALDLAP LLGK+LRRDDV+LQITEVEAYRPNDSACHGRFG TART
Sbjct: 66  FRNTTILPPEFFQIDALDLAPHLLGKFLRRDDVVLQITEVEAYRPNDSACHGRFGVTART 125

Query: 140 APVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKP 199
           APVFGP GHAYVYLCYGLH MLN+VADKEG GAAVLIR+CAP+ GL TIQ+RRGQ T KP
Sbjct: 126 APVFGPGGHAYVYLCYGLHTMLNVVADKEGAGAAVLIRACAPVCGLETIQERRGQQTEKP 185

Query: 200 VLLAGPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNAL 259
           +LL GPGKVGQALG++TEWSNHPLY PGGLELLDGP+P+KIL+GPRVGI YALPEHVNAL
Sbjct: 186 ILLTGPGKVGQALGITTEWSNHPLYTPGGLELLDGPEPDKILIGPRVGIEYALPEHVNAL 245

Query: 260 WRFAVAGSPWISAPKNTLRFP 281
           WRFA+AGSPWISAPKNTLR P
Sbjct: 246 WRFAIAGSPWISAPKNTLRLP 260

BLAST of Spo25396.1 vs. UniProtKB/TrEMBL
Match: A0A0J8FHB0_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_3g059950 PE=3 SV=1)

HSP 1 Score: 416.0 bits (1068), Expect = 3.600e-113
Identity = 210/264 (79.55%), Postives = 228/264 (86.36%), Query Frame = 1

		  

Query: 24  MTGPERSK-------LRSKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPEHPISNIS 83
           MT P+ S+       L+S + + EN SQ+KLK R+    SK K K K PLSPE PI N S
Sbjct: 1   MTKPQPSQRCAKFKSLQSNAITDENPSQSKLKFRS----SKIKFKLKAPLSPERPIPNTS 60

Query: 84  EIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNT 143
           E+  QNL ILG +F SVDALDLAPLLLGK++RRDDVILQITEVEAYRPNDSACHGRFG T
Sbjct: 61  ELIYQNLAILGANFYSVDALDLAPLLLGKFIRRDDVILQITEVEAYRPNDSACHGRFGIT 120

Query: 144 ARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQIT 203
            RTAPVFGP GHAYVYLCYGLHMMLN+VADK+GVGAAVLIRSCAP+SGL TIQQRRGQIT
Sbjct: 121 VRTAPVFGPGGHAYVYLCYGLHMMLNVVADKDGVGAAVLIRSCAPVSGLKTIQQRRGQIT 180

Query: 204 NKPVLLAGPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHV 263
           +KPVLLAGPGKVGQALGLSTEWSNHPLY PGGLELLDGPKPEKI+VGPRVGINYALPEHV
Sbjct: 181 DKPVLLAGPGKVGQALGLSTEWSNHPLYAPGGLELLDGPKPEKIVVGPRVGINYALPEHV 240

Query: 264 NALWRFAVAGSPWISAPKNTLRFP 281
           NALWRFA+AG+PWISAPKNTLR P
Sbjct: 241 NALWRFAIAGTPWISAPKNTLRSP 260

BLAST of Spo25396.1 vs. UniProtKB/TrEMBL
Match: A0A0K9RTU4_SPIOL (Uncharacterized protein (Fragment) OS=Spinacia oleracea GN=SOVF_029050 PE=4 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 2.200e-110
Identity = 207/207 (100.00%), Postives = 207/207 (100.00%), Query Frame = 1

		  

Query: 1   MLLLLLPTNIQLLPSQLRCRWAPMTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKS 60
           MLLLLLPTNIQLLPSQLRCRWAPMTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKS
Sbjct: 1   MLLLLLPTNIQLLPSQLRCRWAPMTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKS 60

Query: 61  KSKVPLSPEHPISNISEIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVE 120
           KSKVPLSPEHPISNISEIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVE
Sbjct: 61  KSKVPLSPEHPISNISEIDQQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVE 120

Query: 121 AYRPNDSACHGRFGNTARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCA 180
           AYRPNDSACHGRFGNTARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCA
Sbjct: 121 AYRPNDSACHGRFGNTARTAPVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCA 180

Query: 181 PISGLTTIQQRRGQITNKPVLLAGPGK 208
           PISGLTTIQQRRGQITNKPVLLAGPGK
Sbjct: 181 PISGLTTIQQRRGQITNKPVLLAGPGK 207

BLAST of Spo25396.1 vs. UniProtKB/TrEMBL
Match: A0A0D2V3S9_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G099400 PE=3 SV=1)

HSP 1 Score: 362.1 bits (928), Expect = 6.200e-97
Identity = 180/261 (68.97%), Postives = 206/261 (78.93%), Query Frame = 1

		  

Query: 20  RWAPMTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPEHPISNISEID 79
           ++ P+    R  +RS     E  SQ+  K    ++++K   K   P S     S +S   
Sbjct: 6   QFKPVASKSRKTVRSSQCLEERRSQSS-KAATVRTRTKPSRKQTRPQS-----SPVSLPC 65

Query: 80  QQNLVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNTART 139
            +N  IL P+F  +DALDLAP LLGK+LRRDDV+LQITEVEAYRPNDSACHGRFG TART
Sbjct: 66  FRNTTILPPEFFQIDALDLAPHLLGKFLRRDDVVLQITEVEAYRPNDSACHGRFGVTART 125

Query: 140 APVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKP 199
           APVFGP GHAYVYLCYGLH MLN+VADKEG GAAVLIR+CAP+ GL TIQ+RRGQ T KP
Sbjct: 126 APVFGPGGHAYVYLCYGLHTMLNVVADKEGAGAAVLIRACAPVCGLETIQERRGQQTEKP 185

Query: 200 VLLAGPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNAL 259
           +LL GPGKVGQALG++TEWSNHPLY PGGLELLDGP+P+KIL+GPRVGI YALPEHVNAL
Sbjct: 186 ILLTGPGKVGQALGITTEWSNHPLYTPGGLELLDGPEPDKILIGPRVGIEYALPEHVNAL 245

Query: 260 WRFAVAGSPWISAPKNTLRFP 281
           WRFA+AGSPWISAPKNTLR P
Sbjct: 246 WRFAIAGSPWISAPKNTLRLP 260

BLAST of Spo25396.1 vs. UniProtKB/TrEMBL
Match: V4SVX9_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10012466mg PE=3 SV=1)

HSP 1 Score: 359.8 bits (922), Expect = 3.100e-96
Identity = 177/244 (72.54%), Postives = 203/244 (83.20%), Query Frame = 1

		  

Query: 34  SKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPEHPISNISEIDQQNLVILGPDFCSV 93
           ++S + +++ +   + ++     KSKS  K  ++P  PIS     +  N+ IL   F  +
Sbjct: 25  TESTNDDSIIRTGHRPKSVTVDIKSKSSPKPKIAPSVPISAGPSFE--NMTILPHHFFQI 84

Query: 94  DALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNTARTAPVFGPCGHAYVYL 153
           DALDLAP LLGK+LRRDDV+LQITEVEAYRPNDSACHGRFG TARTAPVFGP G AYVYL
Sbjct: 85  DALDLAPRLLGKFLRRDDVLLQITEVEAYRPNDSACHGRFGITARTAPVFGPGGLAYVYL 144

Query: 154 CYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKPVLLAGPGKVGQALG 213
           CYGLH MLN+VADKEGVGAAVLIRSCAP+SGL TIQQRR Q+T KPVLL GPGKVGQALG
Sbjct: 145 CYGLHTMLNVVADKEGVGAAVLIRSCAPVSGLKTIQQRRAQLTEKPVLLTGPGKVGQALG 204

Query: 214 LSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNALWRFAVAGSPWISAP 273
           +STEWSNHPLY+PGGLE+LDGPKPEKIL+GPRVGI YALPEHVNALWRFA+AG+PWISAP
Sbjct: 205 ISTEWSNHPLYMPGGLEILDGPKPEKILIGPRVGIEYALPEHVNALWRFAIAGTPWISAP 264

Query: 274 KNTL 278
           KNTL
Sbjct: 265 KNTL 266

BLAST of Spo25396.1 vs. UniProtKB/TrEMBL
Match: B9SWJ1_RICCO (DNA-3-methyladenine glycosylase, putative OS=Ricinus communis GN=RCOM_0278080 PE=3 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 4.000e-96
Identity = 181/258 (70.16%), Postives = 203/258 (78.68%), Query Frame = 1

		  

Query: 24  MTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPE-HPISNISEIDQQN 83
           M    R K  +KS     L+   + ++      K KSK + P +P  H + N + +  + 
Sbjct: 1   MKRSRRFKRVAKSIKPIQLNDPNINVQPDSITVKDKSKPRQPKTPVLHSVPNSNPLSFEK 60

Query: 84  LVILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNTARTAPV 143
             IL   F  +DALDLAP LLGK+LRRDDVILQITEVEAYRPNDSACHGRFG TARTAPV
Sbjct: 61  SRILPSQFFQIDALDLAPRLLGKFLRRDDVILQITEVEAYRPNDSACHGRFGVTARTAPV 120

Query: 144 FGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKPVLL 203
           FGP GHAYVYLCYGLH MLN+VADKEGVGAAVLIRSCAPISGL TIQQRRGQ T KPVLL
Sbjct: 121 FGPGGHAYVYLCYGLHTMLNVVADKEGVGAAVLIRSCAPISGLDTIQQRRGQKTEKPVLL 180

Query: 204 AGPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNALWRF 263
            GPGK+GQALG+STEWS+HPLY PGGLELLDGP+P+ ILVGPR+GI YALPEHVNALWRF
Sbjct: 181 TGPGKIGQALGISTEWSSHPLYSPGGLELLDGPEPKNILVGPRIGIEYALPEHVNALWRF 240

Query: 264 AVAGSPWISAPKNTLRFP 281
           A+A +PWISAPKNTLR P
Sbjct: 241 AIADTPWISAPKNTLRPP 258

BLAST of Spo25396.1 vs. ExPASy Swiss-Prot
Match: 3MG_ARATH (DNA-3-methyladenine glycosylase OS=Arabidopsis thaliana GN=MAG PE=2 SV=1)

HSP 1 Score: 338.6 bits (867), Expect = 6.600e-92
Identity = 168/256 (65.62%), Postives = 202/256 (78.91%), Query Frame = 1

		  

Query: 24  MTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPEHPISNISEIDQQNL 83
           M  P R   R      E     ++ +R  K+       ++V   P++P++  +   +  +
Sbjct: 1   MKTPARRSKRVNQEESETNVTTRVVLRTRKTNCSKTRAARV--RPDYPLTRTTS--ESEM 60

Query: 84  VILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNTARTAPVF 143
            ++ P+F  +DALDLAP LLGK++RRD+V+L+ITEVEAYRPNDSACHGRFG T RTAPVF
Sbjct: 61  KLMPPEFFQIDALDLAPRLLGKFMRRDNVVLRITEVEAYRPNDSACHGRFGVTPRTAPVF 120

Query: 144 GPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKPVLLA 203
           GP GHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSC+P+SG+ TIQ+RRG  T+KPVLL 
Sbjct: 121 GPGGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCSPVSGMETIQERRGLKTDKPVLLN 180

Query: 204 GPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKP-EKILVGPRVGINYALPEHVNALWRF 263
           GPGKVGQALGLSTEWS+HPLY PGGLELLDG +  EK++VGPRVGI+YALPEHVNALWRF
Sbjct: 181 GPGKVGQALGLSTEWSHHPLYSPGGLELLDGGEDVEKVMVGPRVGIDYALPEHVNALWRF 240

Query: 264 AVAGSPWISAPKNTLR 279
           AVA +PWISAPKNTL+
Sbjct: 241 AVADTPWISAPKNTLK 252

BLAST of Spo25396.1 vs. ExPASy Swiss-Prot
Match: 3MGH_NITOC (Putative 3-methyladenine DNA glycosylase OS=Nitrosococcus oceani (strain ATCC 19707 / NCIMB 11848) GN=Noc_0079 PE=3 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 1.400e-54
Identity = 107/189 (56.61%), Postives = 135/189 (71.43%), Query Frame = 1

		  

Query: 85  ILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVEAYR-PNDSACHGRFGNTARTAPVF 144
           +L P F + DAL++A  LLG  L R+ V+L+ITEVEAYR P D+A HGR G T R  P++
Sbjct: 4   LLPPRFYARDALEVAADLLGASLCREQVVLRITEVEAYRWPEDTANHGRHGQTLRNEPLW 63

Query: 145 GPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRR-GQITNKPVLL 204
           GP G  Y+YLCYG+H +LN+V  +EG  AAVLIR+C P++GL  IQ+RR G+I  KP LL
Sbjct: 64  GPPGRVYLYLCYGIHHLLNLVTGEEGQAAAVLIRACEPVAGLDLIQRRRRGKI--KPGLL 123

Query: 205 AGPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNALWRF 264
            GPGKVG ALGL   W++HPLY PGGLE+  G     +L GPRVGI YA PEH +A WR 
Sbjct: 124 TGPGKVGAALGLDLSWNHHPLYEPGGLEVRRGTPVAALLAGPRVGIAYAHPEHRDAPWRL 183

Query: 265 AVAGSPWIS 272
           A+  +PW+S
Sbjct: 184 AIPDNPWVS 190

BLAST of Spo25396.1 vs. ExPASy Swiss-Prot
Match: 3MGH_NITMU (Putative 3-methyladenine DNA glycosylase OS=Nitrosospira multiformis (strain ATCC 25196 / NCIMB 11849) GN=Nmul_A1264 PE=3 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 1.000e-28
Identity = 72/182 (39.56%), Postives = 108/182 (59.34%), Query Frame = 1

		  

Query: 92  SVDALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNTARTAPVFGPCGHAYV 151
           S  ++D+A  L+G  L  + V  +I E EAY  +D A H   G T R   +FGP  HAY+
Sbjct: 11  SASSVDVARSLIGATLLVNGVGGRIVETEAYDHDDPASHSFSGPTRRNQVMFGPPCHAYI 70

Query: 152 YLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKPVLLAGPGKVGQA 211
           Y  YG+H  LN V    G GA VLIR+  P+ GL  +++RRG ++++ +L +GPG+V +A
Sbjct: 71  YRSYGIHWCLNFVCRPAGHGAGVLIRAIEPLVGLDIMRKRRG-LSDERLLCSGPGRVCEA 130

Query: 212 LGLSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNALWRFAVAGSPWIS 271
           LG++ E+S   +  P   +L     P  ++ GPR+GI+ A     +  WRF +AGSP++S
Sbjct: 131 LGITQEYSGMSIDTP-PFQLTPPLDPVPVVTGPRIGISKA----KDVPWRFGLAGSPFLS 186

Query: 272 AP 274
            P
Sbjct: 191 RP 186

BLAST of Spo25396.1 vs. ExPASy Swiss-Prot
Match: 3MGH_CHLTE (Putative 3-methyladenine DNA glycosylase OS=Chlorobium tepidum (strain ATCC 49652 / DSM 12025 / NBRC 103806 / TLS) GN=CT1853 PE=3 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 1.400e-28
Identity = 78/193 (40.41%), Postives = 108/193 (55.96%), Query Frame = 1

		  

Query: 86  LGPDFCSVDALDLAPLLLGKYLRRDDVILQIT-----EVEAYRPN-DSACHGRFGNTART 145
           LG DF  +  + LA  LLGK     +V  ++T     E EAY  + D ACH   G T R 
Sbjct: 4   LGADFYQMPTILLAERLLGKIFVHHEVSGRVTKGRIVETEAYLGDGDEACHAWRGMTERN 63

Query: 146 APVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKP 205
             +FGP GH Y+Y  YG H + NIV++++G+  AVL+R+  PI G+  +Q+RRG  T++ 
Sbjct: 64  HVMFGPPGHLYIYFSYGCHYLANIVSEQKGIAGAVLLRAMEPIEGIEWMQERRGT-TDER 123

Query: 206 VLLAGPGKVGQALGLS-TEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNA 265
            L++GPGK+ QALGL    +    L     LE      PE I   PR+GI+       + 
Sbjct: 124 ALMSGPGKLTQALGLGPAHYGESLLGDICWLEEAPDIPPELIGTSPRIGIS----RSTDL 183

Query: 266 LWRFAVAGSPWIS 272
           LWR  +AGSP+IS
Sbjct: 184 LWRKFIAGSPYIS 191

BLAST of Spo25396.1 vs. ExPASy Swiss-Prot
Match: 3MGH_METAR (Putative 3-methyladenine DNA glycosylase OS=Methanocella arvoryzae (strain DSM 22066 / NBRC 105507 / MRE50) GN=UNCMA_17850 PE=3 SV=1)

HSP 1 Score: 127.1 bits (318), Expect = 3.000e-28
Identity = 77/192 (40.10%), Postives = 105/192 (54.69%), Query Frame = 1

		  

Query: 85  ILGPDFCSVDALDLAPLLLGKYLRRD----DVILQITEVEAY-RPNDSACHGRFGNTART 144
           +L  DF     L++A  LLGK L R      V L+I E EAY   ND ACH   G TAR 
Sbjct: 3   VLPRDFYDRPTLEVARDLLGKTLVRQLPAGRVALRIVETEAYIGENDKACHASKGMTARN 62

Query: 145 APVFGPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKP 204
             +FG  GHAYVYL YG++  LN+V +K+G  AAVLIR+  PI G   +   R +     
Sbjct: 63  RVMFGQPGHAYVYLIYGMYNCLNLVTEKDGYPAAVLIRAGEPIEGEEIMSSLRPKARKHH 122

Query: 205 VLLAGPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKPEKILVGPRVGINYALPEHVNAL 264
            + +GPGK+  A+ ++   +   +   G L + DGP  +KI+  PR+G++YA  E     
Sbjct: 123 EIASGPGKLCGAMSITRALNGADVCASGELYVEDGPAVKKIVACPRIGVDYA-GEDALRP 182

Query: 265 WRFAVAGSPWIS 272
           WRF    SP +S
Sbjct: 183 WRFYDKNSPCVS 193

BLAST of Spo25396.1 vs. TAIR (Arabidopsis)
Match: AT3G12040.1 (DNA-3-methyladenine glycosylase (MAG))

HSP 1 Score: 338.6 bits (867), Expect = 3.700e-93
Identity = 168/256 (65.62%), Postives = 202/256 (78.91%), Query Frame = 1

		  

Query: 24  MTGPERSKLRSKSNSGENLSQAKLKIRATKSKSKSKSKSKVPLSPEHPISNISEIDQQNL 83
           M  P R   R      E     ++ +R  K+       ++V   P++P++  +   +  +
Sbjct: 1   MKTPARRSKRVNQEESETNVTTRVVLRTRKTNCSKTRAARV--RPDYPLTRTTS--ESEM 60

Query: 84  VILGPDFCSVDALDLAPLLLGKYLRRDDVILQITEVEAYRPNDSACHGRFGNTARTAPVF 143
            ++ P+F  +DALDLAP LLGK++RRD+V+L+ITEVEAYRPNDSACHGRFG T RTAPVF
Sbjct: 61  KLMPPEFFQIDALDLAPRLLGKFMRRDNVVLRITEVEAYRPNDSACHGRFGVTPRTAPVF 120

Query: 144 GPCGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCAPISGLTTIQQRRGQITNKPVLLA 203
           GP GHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSC+P+SG+ TIQ+RRG  T+KPVLL 
Sbjct: 121 GPGGHAYVYLCYGLHMMLNIVADKEGVGAAVLIRSCSPVSGMETIQERRGLKTDKPVLLN 180

Query: 204 GPGKVGQALGLSTEWSNHPLYVPGGLELLDGPKP-EKILVGPRVGINYALPEHVNALWRF 263
           GPGKVGQALGLSTEWS+HPLY PGGLELLDG +  EK++VGPRVGI+YALPEHVNALWRF
Sbjct: 181 GPGKVGQALGLSTEWSHHPLYSPGGLELLDGGEDVEKVMVGPRVGIDYALPEHVNALWRF 240

Query: 264 AVAGSPWISAPKNTLR 279
           AVA +PWISAPKNTL+
Sbjct: 241 AVADTPWISAPKNTLK 252

The following BLAST results are available for this feature:
BLAST of Spo25396.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|731323806|ref|XP_010672632.1|8.9e-12178.5PREDICTED: DNA-3-methyladenine... [more]
gi|870864166|gb|KMT15286.1|5.2e-11379.5hypothetical protein BVRB_3g05... [more]
gi|902233240|gb|KNA22966.1|3.2e-110100.hypothetical protein SOVF_0290... [more]
gi|568879813|ref|XP_006492840.1|6.8e-9772.9PREDICTED: DNA-3-methyladenine... [more]
gi|823256402|ref|XP_012460848.1|8.9e-9768.9PREDICTED: DNA-3-methyladenine... [more]
back to top
BLAST of Spo25396.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0J8FHB0_BETVU3.6e-11379.5Uncharacterized protein OS=Bet... [more]
A0A0K9RTU4_SPIOL2.2e-110100.Uncharacterized protein (Fragm... [more]
A0A0D2V3S9_GOSRA6.2e-9768.9Uncharacterized protein OS=Gos... [more]
V4SVX9_9ROSI3.1e-9672.5Uncharacterized protein OS=Cit... [more]
B9SWJ1_RICCO4.0e-9670.1DNA-3-methyladenine glycosylas... [more]
back to top
BLAST of Spo25396.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 5
Match NameE-valueIdentityDescription
3MG_ARATH6.6e-9265.6DNA-3-methyladenine glycosylas... [more]
3MGH_NITOC1.4e-5456.6Putative 3-methyladenine DNA g... [more]
3MGH_NITMU1.0e-2839.5Putative 3-methyladenine DNA g... [more]
3MGH_CHLTE1.4e-2840.4Putative 3-methyladenine DNA g... [more]
3MGH_METAR3.0e-2840.1Putative 3-methyladenine DNA g... [more]
back to top
BLAST of Spo25396.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 1
Match NameE-valueIdentityDescription
AT3G12040.13.7e-9365.6DNA-3-methyladenine glycosylas... [more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003180Methylpurine-DNA glycosylase (MPG)GENE3D3.10.300.10coord: 85..275
score: 3.5
IPR003180Methylpurine-DNA glycosylase (MPG)HAMAPMF_005273MGHcoord: 88..271
score: 26
IPR003180Methylpurine-DNA glycosylase (MPG)PANTHERPTHR10429DNA-3-METHYLADENINE GLYCOSYLASEcoord: 63..278
score: 1.8
IPR003180Methylpurine-DNA glycosylase (MPG)PFAMPF02245Pur_DNA_glycocoord: 90..268
score: 1.6
IPR003180Methylpurine-DNA glycosylase (MPG)TIGRFAMsTIGR00567TIGR00567coord: 86..271
score: 5.5
IPR011034Formyl transferase, C-terminal-likeunknownSSF50486FMT C-terminal domain-likecoord: 84..276
score: 2.9

GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0009909 regulation of flower development
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0051301 cell division
biological_process GO:0048825 cotyledon development
biological_process GO:0009880 embryonic pattern specification
biological_process GO:0032264 IMP salvage
biological_process GO:0019915 lipid storage
biological_process GO:0009640 photomorphogenesis
biological_process GO:0010072 primary shoot apical meristem specification
biological_process GO:0016567 protein ubiquitination
biological_process GO:0045595 regulation of cell differentiation
biological_process GO:0009737 response to abscisic acid
biological_process GO:0042967 obsolete acyl-carrier-protein biosynthetic process
biological_process GO:0050826 response to freezing
biological_process GO:0010162 seed dormancy process
biological_process GO:0009845 seed germination
biological_process GO:0010182 sugar mediated signaling pathway
biological_process GO:0010228 vegetative to reproductive phase transition of meristem
biological_process GO:0006508 proteolysis
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0008152 metabolic process
biological_process GO:0006094 gluconeogenesis
biological_process GO:0006096 glycolytic process
biological_process GO:0001522 pseudouridine synthesis
biological_process GO:0042254 ribosome biogenesis
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0006855 drug transmembrane transport
biological_process GO:0006139 nucleobase-containing compound metabolic process
biological_process GO:0051026 chiasma assembly
biological_process GO:0000911 cytokinesis by cell plate formation
biological_process GO:0032508 DNA duplex unwinding
biological_process GO:0006302 double-strand break repair
biological_process GO:0000278 mitotic cell cycle
biological_process GO:0006312 mitotic recombination
biological_process GO:0010564 regulation of cell cycle process
biological_process GO:0007062 sister chromatid cohesion
biological_process GO:0022618 ribonucleoprotein complex assembly
biological_process GO:0042138 meiotic DNA double-strand break formation
cellular_component GO:0031307 integral component of mitochondrial outer membrane
cellular_component GO:0005657 replication fork
cellular_component GO:0000785 chromatin
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0005829 cytosol
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005774 vacuolar membrane
cellular_component GO:0005730 nucleolus
cellular_component GO:0031225 anchored component of membrane
cellular_component GO:0005634 nucleus
cellular_component GO:0016020 membrane
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0043916 DNA-7-methylguanine glycosylase activity
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0052821 DNA-7-methyladenine glycosylase activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0052822 DNA-3-methylguanine glycosylase activity
molecular_function GO:0097159 organic cyclic compound binding
molecular_function GO:1901363 heterocyclic compound binding
molecular_function GO:0003682 chromatin binding
molecular_function GO:0004034 aldose 1-epimerase activity
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0008236 serine-type peptidase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0043424 protein histidine kinase binding
molecular_function GO:0008026 ATP-dependent helicase activity
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity
molecular_function GO:0005488 binding
molecular_function GO:0016887 ATPase activity
molecular_function GO:0005525 GTP binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003678 DNA helicase activity
molecular_function GO:0005524 ATP binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003905 alkylbase DNA N-glycosylase activity
molecular_function GO:0008080 N-acetyltransferase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0015238 drug transmembrane transporter activity
molecular_function GO:0015297 antiporter activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0003876 AMP deaminase activity
molecular_function GO:0009982 pseudouridine synthase activity
RNA-Seq Expression