You are on page 1of 18

A = promotor

1 accattagaa aatcacgaaa agagctatca acatggccta aagtgaccaa tgaattggag
61 tttatttcac ggcgttgtga aatatgctcc aactttaacc caaaaaccca cacacactca
121 ttgccaatgt attaatgttt ttttttattt ttatttatac ttgggggaga tgtgtatttt
181 tttagtgggg cgaatgttct agaaagctct ttaaagtacc tcgatttgtg tgggaatcta
241 acgctcgttg tttgcccctt cctcgcaata gttgtttgtg aatatgttgc gatgtatacg
301 tatccacgat ttcagtgcat cttcactacc aaatccaaga agaagaaact taaaattcat
361 caatggaaaa gtttttgcaa tccccctgtc tcgctcactt gctctggccc cctcaatcgc
421 cgctctcttt ccgccgcaag gcgcacaaca agtggcaaac cgaagcggcg gcgtcgtctt
481 caaaccagtt ttactgtagc gttgttgttg ctgcttcagc ttacactgaa aaaataacgc
541 agtcaataac gcaaattagt ccatagcttg ttggtaccat ttcgtttgta tattttataa
601 gcacattttg taaacaggaa ttaatggaga atacgcaaca caaatctaag gcacgatttc
661 taaaaaagtt taaatgatgt ggtgattaat gtttgtttat ttatcacgac acttaaattg
721 ttctctgtgt aattctgcgg ctgctgcgtc tccgcgataa ttggtgcgga ggttgcccct
781 tcagccctcc ccccaaactc ctccatctac cattccatct accgttccgc cgatcagtat
841 ttgcttttat gggcaaaagt tcttcaatga gtggtcacag catttgtgaa gggagggggg
901 cggagggggc tgcggtacgt gcatgcaatt ccttattggc gacaatgttg aaatgtgtgc
961 gcctgtttga aggaaaacac cagagatcaa accaatagct ttaaaacttt gattgccttt
1021 tgattaaagc catgtggaac ttagtgccat caccgttatt gggcagattc taattggttt
1081 tccaagtaag atgctgaaaa tgcaattttt tctcgtaacg ttaatcggtg aacagggaat
1141 ccccctcatt gatctcccag cgcagtggag ccaaagtcct aagttttttc agcatggatg
1201 acagtgccta ttagtgtgac cgctggtgtt tgtggtggct tgggccaaaa gtgaattatt
1261 catgttgttg ctgctgtcgc tctgccagtg gaaaattccc aaaaaaaatt cgcatgaact
1321 gcgccggctg ctcctccgct cctccgctgc cccacatgct tctccttcca ctcgttcttc
1381 ttcttctttg agtgcccccc aaaggcaaaa agtcggagtc gaagttactt ggtgtgttag
1441 ttgttgttgc tcttgcccac tcattcgctt cggttttttt ttattttctt ttcttgtctt
1501 taacattttt tctttttata aaaatagcac agacgaagaa atatttgcac acacacacac
1561 acaccacgca cacacgtgta ttgtagcatt tgctaatttt cttcgtttgt ttcgcttggc
1621 gttttttgtt tgttgattcg cctattgcct ccgtcttccc tctcgctctc cgccactttt
1681 atattttcaa agacagcaat ttgtatcttt agtgcgggga tcacttttct ctcttaaacc
1741 cgactttctc acgctctctt ccggcgctct ttcgtctcct tatccaccac ccacccacgc
1801 cctaaatgtg tcaatggttc aaggcggacg aatgaataat caaagaagaa ttaactcgga
1861 gcttttcaca ttaaaaagct tctcaattgt ggagctttgc tttcgtcaga cttttttcgg
1921 gggccaaaac tgcatgacgt gccgccaaaa tcaataaaca tagtattgtt tcgcactttg
1981 tgtatacata ctatgtactt tcacacttgt tcttcccatc caatttatgc aagatcatat
2041 gcttagttct tagtactgcc aaatccttac aaaaaaaagt cgggtttaat gcggctgtca
2101 ccgcctctta tctttcattt ccctccaata tgcctcccac ttgcaggcag ttgactttgt
2161 ttcactggaa gccacgcgtt gcttgtcgat ttgatcctct tataaatatt tgcacgtctc
2221 gatttacatt gtttgcttgt tgtttttgta tgtatggcat aatgacgttt acgatgacgc
2281 gcttgaaacg actaaaagtc ggaattatat ggctggcccc ccgtcgattt ctcacgccca
2341 tttttggaag tgaatcgcgg ctttggcatt tttggtatcc agaaatgttg tcgccactgc
2401 aaatacgaac aaaatataca taataacatg cggaatctgg cccatggcgt cgtaaaccgg
2461 tgcaacctta atgacaaatt gcaaatgtga atagagtgcg tttcataatg cacttccggt
2521 cataaaatat aataattcct aaggcgaatt aattgtttat caatcaattg ttttgataca
2581 acaatttgtc aataaagaag cacacatggg caccaagaat ctgctgacgt tgtcgcattc
2641 ttgttatcga tcaattaacg tttttcactt caatttgcaa taaagtaatt cttacgataa
2701 taatgacagg gcaggacggg atggtacgta ttcgaatcgg taatatgagg caagggaaag
2761 gaagACGGgt aaaaataaag ttaacgatat ggcagtaaac aatgtgggtt ggacacaagc
2821 aagctgtcca agaagccact tgatgcgacc gacaacttgg gttataaaac ttcgacaaca
2881 gttttcacag cacgcgctgt tccattccgt tttattccgt tcgaaaattg ctttttcgct
2941 caattaatta acttattttt tattgcacac aattctatta aatgtgtgtt gtctgttgat
3001 tggattgctg ggattgtgtt gggagcatac aaatctattg gagaatgtgc cccccatcat
3061 gctggcacta aaatcaactt aatagctgca cacagcttgg gaacatgttc attgtgtgga
3121 tttcttcagg acttactctt cctgtatgac gggaattaac tcgtcgatat ccactcgtcg
3181 tttgttcagt aactgtcatc gggtggtggt ttcgattgaa ttgattgaca gcgagtgaca
3241 agttaaaact aatctttaaa ttgagttcta aacgaaaata ttgtcaataa cgtgaggagg
3301 gtggggaggg ggggttgctc gatgccgctt gcttggcttt attgtgcccg tttttgcgga
3361 aactcgtagc gcagcacgaa aaacgcaatt aatttgtgcg agaatgtgga aatggcggaa
3421 aagcatttgc ctttgtgacg caccgctcct ccttacattt tatttgtgca ggtgaatgga
3481 atggaaaaag gtctatactt ctatgactta agttgtgaga ttacattata tatatattat
3541 atggtgtact tgtatcgcat cgaatgcgct tttaaccttt gcggtttaga tttggctttg
3601 gccatttgct gttcttttgt tggtgggctg gctgtttggt ttgcgagtgg cgaggccagt
3661 aagccacatt gaaaccagtt gtgtggtgtt cgcaactacg ccgcccacca gcccctttac
3721 catcgttccc gccccccaat tggcaaggcc gattatcgac cgctgaccga aggctgcgaa
3781 gaaaaagcaa acaacgagcc gtttagaaag ttaccaactg tttaagttgc cttaattgat

3841 ctgttgcgtt tattcctcgt ttcattttgc tgtttcaggt taacggtcgt gcctggcctt 3901 ggctgcgttc tattccgctg tgccaacgca acgtttgaaa tttagccagc taaaaactgt 3961 gcatttaccc aactgtaagg atttgtattt ggtttttatt gcgcttaatt tgtttcgttg 4021 tggttatcac ctccgggttt aatttcactg tttcaggcta aatcacattt ggccttgact 4081 gtccgtgatc cattccgatt cgtttcgccc attccactta gcacattgcc tggcgtcaat 4141 taattgatca acagtaggaa cgttttgcat ccaaaaatag acgccaaata ttatcacttg 4201 ccttttgtct atttttaatg caaattgtgt gggcaggtaa tggggagaaa aggacgttta 4261 atcagcggcg cgtggcgtct aacgaaatta tcgcatgacc aactaaatcg gctgcccccc 4321 cccccctccc ctcccctcct caaccaccga ttacactgag atttttcgcc gctcgccaaa 4381 actttccagt ctgctgcaca acaatgtgaa acggaacacg aaacgttcct cctcttcttg 4441 attcacatat gtgtgcatat atagaccgac agacagacac aaacgtattt tctgcgtttc 4501 gttgcgaatc ttttggcgca attagcgtgc acaattagca catctctagt actcgtagcg 4561 ccgtcctttt ccttccagct cgcttgcgtg caaatttaat tagcggccca tgctcgacga 4621 gaaagagacg gcatgagatg ggaagtacta gatgaattat tggatggaaa tttaatggct 4681 tcgaggtgaa gtcttttatt ccacgactcc caattatcga ttaaaaagag aaaagaaaga 4741 cacacaaatt ttccactggg tcgagatttg caattagcac ttcgttattt atgcgccgaa 4801 ttcattttcc gccttgcaat ttacctacga caaatataaa catacagtat tgggactagc 4861 acttagcgtg aatccatcaa aaaatccgct tatcacttat cacgtgcttc cccaaacgcc 4921 cccgagtgtc gtctacgagt atttgcgatg tcgtttgtaa ctcgcagtaa atttcgacca 4981 gacttttgtg ggtctggtgg tccggtggca tctggtctga accgaatctg acgcaacttg 5041 acccaagttg cttggcatcg atcttccagt gcactagcac cttgggcgaa aagtgggtct 5101 tgctgtcgct gttggatttt gtctgtctgt ggggaaatct tatcgcacgc catatagagt 5161 tttgggctcg cctgccgtct ggtgtcatca agtcatcaag tgtctgcgcc acctaacaga 5221 tacgatgtgt aaaaaactac ttgagtggaa tcttggtaaa gtttatttac tatacccaga 5281 ataatattga tttggttttc ttgcttttct actctagtac tgcgttcatg caatgcacta 5341 tactcgcatt ttgtttcaat ttaagttcgc gcttttacga aatcatgaaa ccttactgca 5401 taaacgtaca tagcgcatga agtcggtgtg tcggcgaact ttgtaagggg ttatgtggct 5461 aacttctgtg ctttcgattt taaacaataa tgcggggaag cacaatgaaa atgaaacata 5521 acgagaaaca agcaacagaa acaaaaacaa aaacaacaca aaggcgtttc atcaagaaac 5581 aggggaagcg gcgagaaaga agcagaaact tcagcattcg cctgacgtaa atgctgccac 5641 gcattgcgtc attgtggggc tgaagtggga gcgagagggg ctagactgtg ggagtgagat 5701 ggttgcaact gcgtgagagg gagagcagct gtcgttcctt ttctcatgag agcatttgtc 5761 ggctttttca gctcattaat gattcgattt ttcttttata ttttgtggtt ttgtgtgtgt 5821 gtgccttttg tttttttttt gttttgctgt tggatgacac atggttcact tattcaacga 5881 ggggggcggg ggtgtgagcc aaaaaataaa aacaaaaaaa agtggagaga ttgtctataa 5941 ataaaccaac attttggtaa actttatgcg gtgcaagcgc ttttctctgt gataaactgt 6001 gcgtaatttt tgtgggaaaa atgggagaaa aatgacgaaa atctaaattt catggtgacc 6061 tgaatttttg agttcactat cctaacgatc gtgcttattg aattggctga tgcaattcat 6121 tgatggcttg caaatattat tattaagtta taagtttttt tttttttttt ttttttgtta 6181 ttgttttact tgactaatag ctataagggt ctttaaaagt ctgggttgaa ggcagttcgc 6241 tggcttactt ttgcatattt tttcttcttt gttttgtttc tcaggtttct tggctcgagt 6301 ttttgcattt gggggacaac ccaagtactt ggaactccac atttagtcag cgctcgttga 6361 atttgtcttg gtttttttcc ttcggtcgtc tttgaatggg taaaaatggg cagtgactaa 6421 gccaatgtca atttaagtgg ccacttgaag tagtagaatt gtttcgacaa aaatcactgt 6481 gttaagcaac ataacataat acattcaaac actctttcgc tttatgagtt gactcactgt 6541 gttttttgtt ttttatgcat attgtgtttc gttttatttt tacaatcgca gtcgtcgtga 6601 ctttcccata aattccaaaa aataaacaaa aatattttgt aatttcttat caatgtggaa 6661 aatgcttctt ctcgtcgctg ccttgtttat ttgtttacat taagcaacaa cccaacattg 6721 ttgttgtttt gtcatgtaat tatgataatc aagtgcagta aaaacgaaaa caaagaactt 6781 ggcaacaaga aggggttttt tttcgagggg gttttgggga ttggcatacg ttcgcatata 6841 catatgtatg gtgtgaggtt tttagagtgg attgccgctt atcatcgcaa gttaatctac 6901 tttcccagtt catatactct cgtacacgat catgtactcc tatatgttct atatgcatac 6961 ttaaaaacga cgggccatat agataggcga tacatgatcc gaggtgcttc aatataaata 7021 acccccagtc ccatttcact ctgtttcccg ttcccaatta acagggaaac gtacttagca 7081 caacacacac atgaccaaag ttcacttttg gctaagcgtt ttatattatt tttgattaat 7141 taggcaattt gcctgaagcg ttgttgttgc tactttttat ttgctgataa gattgggcca 7201 tgaattgcga catgcacgtg tggttttggc ataagtcgag ttcctcctgc ttccgcccgc 7261 ccaccagaac agaagcaaag ccactccaga atcggtgctt tggtttggcg acaacttgac 7321 attcaggcca gccacccacc atcccacctg gttggtcttt ggttttgttg gttcgtatcg 7381 tattgagtgc acgactaagg accgaaagtg aatttcaagt agaagggagt gggtttttaa 7441 gtggatgagt gacacgaata gcgttattat ttgattggtc atagtttcat ggagtggagt 7501 ggctctctga aactgtttgg tagtctagca acataatatg ggctcctcat ttttttagag 7561 cttatatcta ctatatagct tttctggaat ggaatcgaat actagaattg ttagctcttg 7621 tttgcatcag ggttctttgt gagtagtgct ctctttatat ttcctcgcct tgagttcaat 7681 ttattttttt ttttttgttt ccattgtttg gccgttatct cttgctgcaa ttattgattt 7741 tcagctgaaa ttactaatca attgataatg gattgctctt aatgtatata tttaaagtca 7801 tcgtttcgaa gtcagagcga aaaactttgt tctccaagtt tcctcgtttc cattttatca .

7861 ctttgttgtc atataagtgg gttaaacaat tatcatataa ttgccgttct aatgatacga 7921 gcaaacgcat acgccgatgc ctaatcaaca aatattgatt gttttcatta tacatgcatt 7981 ccgtttacat agaaggattt ctgaatatat ataaaatata tatatttaca tgtgttctag 8041 acgacgagaa tcaaggtatt attattacca caaattatcg cattttgaga agccagtttg 8101 gacgttgagc ttgttgtttc ggtttttacc agtttttggt tgttcttgct gccatttggg 8161 actataaagg tgtgggtggt ggtggtaatt ggtgattggg tggcggttgc caccagggtt 8221 gggcattcac taatcgcgtg ttggctttgg caccacctaa ccaccatcta cctcccctca 8281 catttctccc atcacctttc ctttgtgttg ttcgcttcct tttcctttag ttttccagag 8341 ctgctggctc gtgtttttgc cattggctct ctttaccatt ttgacggcat tgtgttggtg 8401 gctttgtgcc ctgtaacttt ccttttgtgt taaatgtccc attgtctttt gtttgtcgag 8461 ttccctggca agaggcaact acgcgttgca atatcaatta ttaaaggctt ttttctagaa 8521 gaatttgtac agatatggta tacgaaaaat gtaataaggt agagaatctt ctttgtttca 8581 atctatttct gcttttaacc agtttacgtt tgtttatcgt tatcgctgta atcggcattc 8641 gccaccattg ctatcgaagt catatgcctt taagatgtta atcgacgtcg gtgtgatgcg 8701 atgtgctgca gtctgtcatg ggttaattgt ctggtcagct ctaattattg attgccgact 8761 cccggtcatt agcagctggc gaaggaagtc aacgatagcg atagcgatag ccatgcacaa 8821 gctttttcgg tttcttcttt ttcgttatac atacatatgt atgtacatag tagaatatga 8881 tgtatgtatg tatgaatatg tttgatattt ttatttacca ccccctgcgt agggtgggct 8941 tttgtgtatc accacaccag ggcctcccac ttttggttgc ttcttctgtt gttgactggt 9001 tggtattctc aagtctttgg caaatatgta gatgcgcaaa atttcctgtg gcgctttact 9061 cttcaattcg tctcatttaa tgattcttca cctatttctt tcccaaatga aatgcaaatg 9121 tttagctgtt tgctattttc ttttgcacat ttcatctact gttctatttt atgctgttct 9181 atgcgctcgt tgttgttgta ctttcacttt cgcgcggttg atgaggtggc caacagatgg 9241 tgctggcgtg caagcaacag gcgttgatgg aactgtggaa tcggaacgca ttcgatgtat 9301 ttttgggtct cctttattcc ttttgctaat tatttgctgc ttatttatat ttcactttcg 9361 atttcgaatg ccgcgccgcg tctacagtat atttttagtt tcgattcagt tcgatagcct 9421 agacctcaaa aaggggcacg gtacgaatgg caaatggtaa aaaatatttt gctttttgac 9481 atttttcctg cattcggaat caatatattc cccagccggt ataaatagcc tgttaaagca 9541 tgtcttgagg caattaccgc tgttaataat accaaccacg gtaaatgtca gagcgcatta 9601 tcgacatgca agttcaagac caagtaaaca agaaccacag ccgcttggct cacaggttac 9661 cccccccccc ccccccctca caagaccacc ccaagacgac cagcaaaggc tttctctgtg 9721 gaaaccccac tacttaagcc cgacccaact catttggtat ttaaatggaa gttgactcaa 9781 aactggaacc caccggcacg tgggtaatat cgattgaaat ggaaaataaa cagtattgtc 9841 ctgttttgtg acacatttca aaattttccg tttcctgccg ctacagatgt gatactcaga 9901 agcagttgta atttgaggca ctgcacacga aagtatattt taatgcattt taatatcact 9961 ttcataaaaa ctaagctgca tgcgtatagc ggaaacttgt ttttcttact tacgtaactt 10021 aattcccagg tgttcattag agaaacaacc aatttgtaga tatttattac aatgaattat 10081 ttctagagtg aatgtcccga actagtacgg cctaataagt caccttctgc tcaagccaac 10141 aagagcccat tcgatggtgt atacagcaag taataacttg taggcattat atatttaagg 10201 attggggctc tgcgtctctt ttctaagctc caatgtcaag tcaataaaca ttaaaccaaa 10261 tgacgcagtg gcgaatggaa ttggttgctg ttagtagggg ttgcttgtag gcgaaagtgc 10321 aagaggtgaa tgggcttacc aatggaaagt tcgtgaacat ggaaccgatt gatataaatt 10381 atatgatatt taacaaagta agcattatta ttgtacaaaa agaaaagtat ttaaaagttg 10441 atgcccgagg tccgagtggc gacatgtaca ttgaacaccc tgtacgaact acatatcacc 10501 ctgtaggtag cttttcttca acccccaact agcagggttt cgtatttctg tttctttatt 10561 attattttcc cgtttcttgt tgtttttgtg ccatggcatg acctaaccaa ctaagccaac 10621 aaactgaccc aagcgcaccc ttcacagcta cccccgtctc gtcccgttgt tggacacttg 10681 tcatcgtctg caaaaccgct ttcaacctga aaccaacaac ccaccgtacg ccgtttaggg 10741 gaatttgtac ctacaggcgg agactcgtgc aataattgtg ttgcctcgtt gatggggttt 10801 gtctttggtc tattgcgcac tattggtgtt gaccattgtt ggtttttttt tgttaatgtt 10861 taccttctac acgtttgccg tagttccttg tgtagtagtt ttcttggata actactcaat 10921 cttatttaaa taaataagac ttgcggacta tccccttatt cgcagtaatg cggcgatgat 10981 gttccttaac catttaggat cgatatctat agatagaatc tagatggtac gataccaatt 11041 ggtatatagg aaggatatat gtccttctat aatttccgtt tttgatattg ttagttgcag 11101 tgctattttt tttttctttt gcacatcttt ccagatgccg ttaagctgtt gcttaattta 11161 taattcattc ctgtttgatt taacacctcc gcactttagg tctcttgata aatatgtatg 11221 tgcagttgta aattaatggg gccggcaaca aaaagatttc ctacccgccc cacccatagg 11281 cggcatgcat aagtaaataa aaaaagcctg ttgttttatt gttgtttctt tttttggctt 11341 ttagtgtagt atttttaatg ctgataaatg gcagtttaaa gtagtgataa gattaaggtt 11401 tttacggcaa gaggaataaa ggaagaagaa gataccagga gcttatcaca aatctgtttg 11461 ttccagaatt gcatctgtgt tccattccat tgtgttccgg tctctgcagg tgtttcgctg 11521 ccaaagcctt tacacacatt tgaccgagga aagtttgctc gccatctcgc tctgaatccg 11581 tgcccctctc attgttctct gctctccctt tctctttctg tcactgcgtt gtgttccatt 11641 accacacgtt cctccgtcca tcacatatac atatgaacat atgtgtggtt ggcgtgcgtc 11701 agcgataata atttggaatt tgagttgtcg ttttcgctct gtttcgctct ttctctctgc 11761 gtttctgttt ctcggtttcg tttgagcttc tcgaGTTCTC AGTTCATTCG CGACCTTAAA 11821 GGCGGCCGCA CATGTTGCAC GCTGAGAAAA ACGTACACCA GACCAGACCA GAACACAAAT .

11881 AAATAACCCA AATAGACAGT AAAATATTGA AAATCACAAA GATCTCCGCA TTTCTGTTAT 11941 TTTTATTTTT TTTTCGTTTT TGTTTCGTGA GAGTGTGTTT AAATTCGAAT GCTTTTTGTT 12001 GTTTGGCTTT TCTCTATGGT TTTTACGGTC TTAACAAACC GCAGTGCTGG TCTAAATTTA 12061 GCCAGAAAGT CAAAATAGAA CAAATTGGTG TTTGAAAATG CAGCAAAAAC AGCAACAATT 12121 CGTTTAACAA ATCGAAAACA ACCACTAATT TGTTTACTTG ATTTGAATAA TATTAGGCAA 12181 TGTGACTGTG AAGCGCCAAT ACTAAACAAA ATAAAAAACA AAAGTAATCG AATCGAAACT 12241 AAACTAAAAT CAAAAGAAGT GATTTAAAAT ATACCCAAAA CAGAAAAACT GTGCCGCTTT 12301 AGACGCTTTA TCAATTTCAA AGAACCGAAA AGGAAATACT CTAACGCCTA GAGTATTTAA 12361 CAGACCATTA AAAACCTGAT GGCAACAACA ACAACTACGC AGGCAGCAGG AGCTGCACCA 12421 GCTCTCAATT TATTGCCCGC CAGCAATAAC AATATAAATA ATACACTGAT CAACAACAAC 12481 AATAATAATA ATAATACTAG TAACAGTAAT AATAATAATA ACAACGTTAT AAGCCAGCCG 12541 ATTAAAATAC CGCTAACCGA GCGCTTCTCA TCGCAAACAT CGACGGGCTC GGCGGATAGC 12601 GGTGTAATTG TTTCCAGTGC ATCGCAGCAG CAACTGCAGT TGCCACCACC ACGCAGTAGC 12661 AGTGGATCGC TGAGTCTGCC ACAAGCGCCA CCTGGCGGCA AGTGGCGGCA GAAGCAGCAG 12721 CGCCAACAGT TGCTGCTCAG CCAGGACAGC GGCATCGAAA ATGGTGTCAC CACTCGTCCA 12781 TCGAAAGCCA AGGACAACCA GGGTGCGGGA AAAGCCAGTC ACAATGCCAC AAGCTCGAAG 12841 GAGAGCGGCG CGCAGTCGAA CAGCAGCAGC GAGAGCCTGG GCAGCAATTG CTCCGAGGCC 12901 CAGGAGCAGC AGAGAGTAAG AGCCTCCTCC GCTCTGGAGC TCAGCAGCGT GGACACTCCC 12961 GTGATCGTCG GCGGTGTGGT CAGTGGAGGC AACAGCATCT TGCGCAGCCG CATTAAGTAC 13021 AAGAGTACGA ACAGCACCGG AACCCAGGGA TTCGATGTGG AGGATCGCAT CGATGAGGTG 13081 GATATCTGTG ATGATGATGA TGTCGACTGC GATGATCGCG GATCGGAGAT CGAGGAGGAG 13141 GAGGAGGAGG AGGAGGACGA CGGCGTCAAT GTGGACGACG ATGTCGAGGA GGCCGACAAC 13201 CAGTCGGACA ATCAGTCGGG TATTATAATA AACCTCAAGA GCCAAACCGA ACAAGAGGAG 13261 GAGGTCGATG AGGTGGATGC CAAGCCGAAG AACCGACTTT TGCCACCGGA TCAGGCGGAA 13321 CTCACAGTGG CGGCGGCCAT GGCACGTCGA CGCGATGCCA AGAGCCTGGC CACCGACGGT 13381 CACATATATT TCCCACTGCT CAAGATCAGC GAGGATCCGC ACATTGATTC GAAGCTGATC 13441 AATCGCAAGG ATGGCCTCCA GGACACCATG TATTATTTGG ACGAATTCGG CAGTCCAAAG 13501 TTGCGAGAGA AGTTCGCCCG CAAGCAGAAG CAGCTGCTCG CCAAGCAGCA GAAGCAGTTG 13561 ATGAAACGTG AAAGGAGGAG CGAGGAGCAG CGCAAGAAGC GAAACACCAC CGTGGCATCC 13621 AACTTGGCGG CCAGCGGAGC GGTGGTGGAC GACACCAAAG ATGATTACAA ACAACAACCA 13681 CACTGTGATA CTAGCTCTAG GAGCAAAAAT AACTCGGTAC CCAATCCACC CAGCAGCCAT 13741 CTCCATCAGA ACCACAATCA TCTCGTTGTG GATGTGCAAG AGGATGTGGA TGATGTGAAT 13801 GTGGTTGCCA CCAGCGACGT GGACAGTGGT GTCGTCAAGA TGCGCCGCCA TAGCCACGAT 13861 AACCACTACG ACCGAATTCC CCGGAGCAAT GCTGCCACCA TTACCACCCG CCCTCAAATC 13921 GACCAACAGT CGTCGCACCA CCAGAACACC GAGGATGTGG AGCAAGGAGC TGAGCCCCAA 13981 ATCGATGGCG AAGCGGATCT GGATGCGGAT GCGGATGCGG ACAGCGATGG GAGTGGCGAG 14041 AACGTTAAGA CTGCCAAATT GGCCAGAACA CAGTCCTGCG TCAGTTGGAC CAAAGTGGTG 14101 CAAAAGTTCA AGAATATATT AGgtaaaatc tatgccctaa gcttaatctg tgacttaaac 14161 ataagcggaa gttatgtata ggatagtaat tagtacggac tatatagtat atagacttca 14221 aagccgcaga ctttgcccca tcatatagat tttccacaat ggccagtgca ctgggctttt 14281 tcagcaaata gcagaatgaa tatctcttat caattgtttt taaactgtgc ttttctaata 14341 tctttcattt cctttcttct atttgtaggt aagtgtcaat tgaatggaaa cacaaattcc 14401 tcataactcc cgagggttat cgtatcgggt gagattgttc tatgtacaca cacacacaca 14461 catatatata tatatatata tatatatata tatacacgca tagataagta ttctttatgg 14521 gtgcggaaaa ttttcggaga tttaatgtaa agtgataaag gtctggctgc tctctgcttc 14581 ttcttctatt cttagtctta gtgaattagt cagcgaatga aatgttatgc ataatgcaaa 14641 atggttttat gaaacgtact ttgttgcata ttttctatct tgttgaagtt gggtcaataa 14701 cccagtagca acaagtgaat ttttccgata gctttgtctt cagccttctc cgcgctgaga 14761 taaacgtggg tgggatatat atgcctggat aatatatatg atcttattct cccttttatt 14821 gtcatatgta aaatgtaaaa tttgatcttg gctttattgt ggagcgcagt ctcgacggct 14881 ggaacacgat aaggcaaagc ggaatattct cgctcgtctc tacaacaggc catttaactg 14941 ttgttgcatt tgtcgctgat aagaaaatcc gacaatggtt agaggcaggg aggggggggg 15001 gggtagcagc ggcgcggcag gagcagggca acagcactga cttacgccag ctgatgttga 15061 tttcttcgcc aactcatcta catagtacac tttatgactg gcaaactttg gattaagtgt 15121 gtgtataggt tataattata atacgctgct gttctcaacg atagtagaaa taagtgcgca 15181 caaaatgacc gacgagtttt tggactggct tatggataat ataacatttc cgacgagcca 15241 ccgcatccat cgggatccag ttacccagct gcccaccagt gtcatacgat ggggtagtac 15301 ttgcagtcga ttgtttttaa agattccact aggtgtccgt tttagatata cgataaactg 15361 gtggctgtca gttgcatcag ttcggaatct tggccccgtg agcttagcga acgaagcgtt 15421 aacgccgcgg tgcgaaaagt ggcaagtggc tagtggctag tggcacattc cccttgctgc 15481 tgactgctac gattacagcg tatggaaggc actgtgtcac agcgtgcttg tcacctcggt 15541 atcatcatca tatcatcatc agcccaaagc agttcccatc cggcgagtcg tagtggtgat 15601 ttgcagcttt cgcagatgct gactgatagc aggaaggttg gcgaactata tatatatata 15661 tatgttgtag aacttgccta gcttactttt ggtacttcac ttcgcccaac taccgcccac 15721 ccccgatttc tcgctgtgta gaggttaagt caaagctacg acagcctgca tgatcgaaga 15781 tcatcggcag taaagagatc aagaggcacg cggctatatt gcttacagcg ctgattatat 15841 atagccatag gtatatatat atatatgtat atatattccc gtcgaggggg tggcaggaaa .

15901 gaggatggga agatggaagg aattggggat tcgggagcag agattacgcc gtgttagctg 15961 ctgttgtttc tttttttttt aacgctcgac aatcacgcgc tcgtttcata aaacggatga 16021 cagagcaaaa aacttttcca aaaatatcaa agcccaacaa catgcatata ttgaacagct 16081 tgtgaaatgt atttttacgt atatgtatat atgtattata tatgtatgta cattcgtata 16141 ttcactaaat ctgtgaccaa acggaggcgt gtctctgtcg tctttgtttt tgttttttgt 16201 tttttatgcc gccctctcac ctcatttcct ttctaccctg ccctgcctca cgttcgattt 16261 ggaatttgaa atggaaaaat atgtttttgt gcttttcact gggagtagta gtagtaggaa 16321 tatttgcaat tcggattatt atttgttgga aaagagcagg gaatccatat aaaacctatt 16381 gatgctgctg accattgaag agtaagagta aatgtgggct tgtattgatg gatattgatg 16441 ctccatattg atctgctaag aacaatatag caatttgtgt atttatattt gcacactgat 16501 agcgatgcaa ccaagtgatt agtgattagt gttgctattt tcgtcactta cagcgatgga 16561 aatgtttttt atttagcatt tgcccagctg tttttccatt gccgttacgc gtctgctttt 16621 atattaattt gctttatttt attttatgtt atcatgtcaa ctcgcattca atactcgtgt 16681 ttgatcagct gtgaaaagag cattttcaat ggattctagt tactagatgc cagatatcag 16741 atacaagtag ttctctttaa aatgcacctc acaaactatt gatttctctt tggcgaaatg 16801 gcttcctctt caccgatttg tttgtgtttt gtatatttgc tgttgctgtt ttctgggcac 16861 tttgaggtct tctaacgatt ggggatgatg gggaggcatt tgggctttac gtagatacaa 16921 aagagaaaaa aatgttgccc gattttattg gtggcattca aaagtatgct atggaaagtt 16981 ccaccgaagc tgcgatagaa gagacgaccg accaccgaca cccgttgtgg tgatgtgata 17041 acgatgcagg tgacaatggc tggcggattc atttctataa ggttgccaag tgctgcagat 17101 tgtggggcat ctcgtgatct ccgcccgcgt gggtcgtcat ctggccatcc cttttccacc 17161 gcctcgctta accgatctct gccctctctt ttgcccaaaa accgtgatcg acagatagat 17221 ttcttagcca ataaaataga ataaaatcaa ttcgtttcaa ttaaatgaag tctatatatg 17281 ctaacactag tctatactgc atatgcatta gcattagtag caaacagaaa aatggcactg 17341 atagagtata ttttttaatt gtcagcacaa cggcaaggtg aagattgcga atcaaatata 17401 taaaaaaata ctctaaaata aaaaaaacaa ggtgcacact ggggctatac atactcgtac 17461 atacatttac caaagaccct agaataacaa gatgcgtaac ggccatacat tggtttggca 17521 ctatgcagcc acttttttgg tgacggccaa aattactctc tttcggctca ctcccgctga 17581 gagcgtaaga aatctaaaaa tataatttgc ttgcttgtgt gagtaaaaac aagagacaag 17641 aacgcgtata agtgtgcgtg ttgtgctaga agacgatttt cgggccgaaa tcaattctga 17701 tcgaagaaac gaatttacat ggtacatatt agggtagttt ttgccaattt cctagcaata 17761 tgataaatta aaaaaaaatt attataattt taaagctttt taaatttgtt tgttaaaatt 17821 gttgctcgaa ttagctaccg tttacacatt tatatttatg tttaattcta atttgtctct 17881 catctgacaa ttttttaaaa gctaaatatt ttttttgaaa cacttttaat gttaatgtta 17941 catcatatta agtcaaatga tttaataaat atactaaata attaaatatg ataactgttt 18001 attgcaaaag taatatcaaa gacactagaa ttattctagt ttctttgctt tggtcatatt 18061 ttgaggcacg aagtgcggac acaagcactc aacaatcatt accttattaa ttattcacac 18121 gccgcaagat gaatactcta atgacaaata ttctaatata aagccatttt tgaaatttat 18181 ttttgtgata atatgtacat agatttggct atttctaacc tattttcaaa taataataac 18241 gttaaggcat gcaaaacaag aatttttcgc atggtgccaa ttgatcaaaa ataatataga 18301 tttaaagtct aagaacttct gaggtgaagg gcatattttg tcaaatttac aatgcatgag 18361 catacgtgtg cacacataca gttgtctgct atcacacttt gtgcgttgaa aagagctgtt 18421 cgctgtagcg ctcttcgctc tctcgctctc taacaaaaat tcgagagagc ctggagccac 18481 ctctagagcc acggccaaaa aattgtgtgc caaaaaatcg tatggcgtta cgcatcttgt 18541 tattctagtg tctttgcatt tacccttcag acgttccagt cttggctaat cttaagtgaa 18601 atccaaggga tacatctaca tctacatcct tgaaataaaa ctagtttgct attgggtaag 18661 ggttttcatt caatttcatt caacttggtt ggggttctga cggatagggc atttattttg 18721 ggcgtggttc aactgaaacc gaagttcgtg cggctaaact gcggcgatga ctccacttcc 18781 acgtccatgc cacacagata ttgaatggtg ggggtgcaaa atgccagttt tcgggttaga 18841 tcccgattat tgtttagtga acgcgctttc cattttcctt tattgctctc catccatcca 18901 tccgtctgtg ttagcttcct ccacctttcg ttccgttccg ttccgaatgg ggttttcgcc 18961 ccctttaatg tggtgatact gcgtcatggc attttgcatg ccaccgcact gcgccgccca 19021 caaacgccca cgcccccttt ttgcggctga gtggtgccaa atgcttgtat atcgattagt 19081 ttccgtcgac ggtggcggca acacaacaaa ttgctttgcc gctcggacag ctccaactgt 19141 tccatcagca gcagcggccc atgtttcacc ttttcgatat agtttctcaa tgtttggtca 19201 ggggggattg tggggggggg gtgaggagtg tgtttaaccc attgataatt tgattgattt 19261 ggtcgcagtc ctgtagaaac tcagttgatt aatgtgagaa tggcagcgga ggcaacaaaa 19321 cataaccgat ttacaattga tgaatcgatc aaatcgataa atgcacatcg atatatgtat 19381 gtatattgat cctttgcgat tctttcgaaa gtgcgaaggt cacattttcg tttaggcaat 19441 aattttaata tcgattgaca aagtatatgg gccaatcaag taaattggtt tattagctag 19501 cacgcaaata ttttattatc aattttatta tcattttttc aactttgagt acttttgcta 19561 acattacgca tatggatctc ttatctcttc gctattgcag atgcaataat gagaaacaat 19621 gtcaatgcta ttgggccttt aattttcatc gtcgcatttg tgattctaat tgattcatta 19681 aaataaaata aatttgattt gcacttgcac ttgcatgtgc tgcgtggttg tatcctttga 19741 ttggtcttct ccgtcgtatt ctcttcgaat tcttctggcg aatcttcgtt ctccgctctg 19801 cctcgttctt ctccgccgcc gcccgccgca ttgaactgtc agttgtctgt cgtcaaaaaa 19861 aaaaaaaaaa aaggcagggg ccataagggg gcgtggctgt cgcatgcttg accacgcttg .

19921 gctgccactg tctttcttct gtcgttctgt cttctgtctt ctgcgtctcc gaaccagcgt 19981 gtcttctctg ctttggcctt atcgcaattt tccctcaatt aaaattttcc ccaaaatcgc 20041 aacaaattgt acgcgattat tatgggtatg gtctaagatt agtaggcttt tatattaatt 20101 tatttttttt tttcaagcgc gttttaaacg atcgaagaat tggtgaggat cgcattcgtg 20161 tgtgggtcgc gtgtggaacc catcgatcgc ccttcgtgtg gcattcgagt gcgatctctg 20221 ggtctctcag ctgtcgttcg tcgttcgtgt tgtcctctgg cttcttccat ttttttgtcg 20281 ccaattgtgc aacggtagct cgagcgatcg gatcgttgga tcgaaagcgg aacgaaaagc 20341 ggaacgcatc ggatttgatc tgatcagagg ggaggagcac tcctctttct cttttttcct 20401 cgcttcttct tcttcttgct gcttcttcct aaaaaaaaat aaaaataaaa aaaaatacaa 20461 catccgaata atcgaagatc gtactatttc ggttcgcaat tcttcttttt ttttttactg 20521 tgatattcca ctttgttgtt gttttcggta gtgccgcgtg tttgcatttt tgctatttga 20581 agatcgatcg atcgattgat ctgcacacaa agcgaacgaa acgtcaagtg accgaaaata 20641 aaaactaggc caaagcctgc caccgtacac agagagaaaa agcatttaaa gtttgtatgc 20701 agtaaaccaa accttaacaa aaaaaaaaaa aaaatttttt tttttggcat tcgagactgg 20761 aaaattggac gttagagatt atccattgta aaacacaagc acttgcatat gtgcatatac 20821 attttgtatc tgtgaaatAA TGCTCTTAAG AGTGAGCTTG CGTAATAAgt aggatttaaa 20881 gcaattgctt acgcatgtgt ggtatacact ttgtatattc tccaagatat gttgcttaaa 20941 ttctataatt tataatttgc ctgccttcgg aattttgtgg cattaagttc ttttgtatta 21001 atgatttcca aatatatttc atcccataat gagtttcgaa tataggtctc aactaattta 21061 gttgccgttc tgcgcaagct ttgatttcaa gcattgatac atatatattt tttttttttt 21121 tgccgtgtaa ggcaggcatc actagagcgc tatacccact ctaccgccac tcaaacggtg 21181 gttcagtgct ttagTGATTC AAGAAACCAG TGGCGGCCAG TACTCCACGC TCAAGGTGGA 21241 TAAGTCACAG GTGGTGCCCG TGGCGGTGCC GCGCGGTGTC CGCAAGGTGG TGCGAGTGGT 21301 TCGCAAAAAG AAGCTGGCAC CCGGCAGTGG GTCAGTGAAC GAGGCCAGTG AATCGGATGG 21361 TGCTGGCTCT GGCACCACCA CCTCCGGCAG GCAGAACTCC ATAGACGCCA GCAAACCGCC 21421 CGCCAAGGTC ATCAAGAATA AAAGGGGCTC CCTGGGCGGT GGTGGTGCAG CGCCACCCAT 21481 ACCGTTGGTC ACTAAAAAAA AGAACAAACG TCGATCGTCC AGCGAGGAGG AGGCGAGCAC 21541 CGGCAACGAA AGCCCCATCG AATCCGAGCC GGAATCCGGT TCGGGCAGCA GCTCCTCCAG 21601 CGCAGGCAGC GAATCTGATA CGGATAGCGA AACCACCAGC AGTAGTTCCT CCACCACCGA 21661 GTCCTCCAAT GCGGATCGCA AGTCGAAGGC CAATGGCAAG CACAAACCCC TTTCCGCCGC 21721 GGCCAAGGCG GCCCAGTCCG CCTGCTCATC GTCGTTGGTG GTGGCCGCCG CTCTGAAGAA 21781 AAGCAAACCG CCGAACCGGA GCGGCAGTGG GGCCAGCATC GCCAGTGCCG GCGGAGGAGG 21841 AAAGCAGTAC AAGGACCCAA gtacagagat aggacctaga agaacatccc cgaatatccc 21901 agatcccaaa atgtgatcca cgcacagaga taagagttgt gttaaaaaaa aaaaaacccg 21961 ccagcaagca gcagttgaat cctttatcga tgtagatata aagatataaa gatagaacga 22021 tagaatttgt tatggaaaag cgaaaagtac gagagatctg aagatttgtg accaccaaaa 22081 tcgacagtga tgcgagaggc agtagcagct gccccatcat catcatcatc acatcgtcgt 22141 catcatgaac acatgaacat ttacatcggc atctttggcg ccaatgccaa aattaattag 22201 caacaaaaaa aaaatgcaaa gggctcggtt gcatattgca gagaaaaatc agacacttag 22261 tttagttgct gatcttatgc taaatgtatt caatttaacc taatcggtat gttcgcaagc 22321 gatagttaga gatgcaggtg tgacataact ttaatctctt caattctcaa acgattttat 22381 tttgcacctc gtcgcgcggt gtacaacact atccacatct ttttatctct gtttctcgtg 22441 cttctaacct cctaacttat ttgcctgtcc ctccaatccg aactcttctg tctgtatctc 22501 caactcgtgt ttgtttagtt caacccgtga cttcttctat cttcctatcc ctcgttccgc 22561 tgttttcgtt tcgagattat ccctgagcac agggactgag tgactgactg agatcgttat 22621 cctgacaata tcttgtgtga ttcgtgtctg atttgtgtgc agGCATGAAA CAACTGATCG 22681 GCAAGCTGAA CGACCTGTGG CCCGAGCACA GTGTTGCACT GTCCATTCCA AAGGAGGTCG 22741 ATAGGAGCAA GGAGAAGCTG GAGGGCGCCT GGGAGACGAC AGgtgtataa atcatctaaa 22801 tttatcttta gatatatata tgtgtgtgtg tggtgtgcct aggataagcg gggtgcctac 22861 catgagggaa ccaactacca aataccaaat acttgaacta acaaattatc atttcactct 22921 tgtgccccaa aaaccaaaca gGTCGCGATG GTTCTAAAAT CACAACAGTT GTTGCAACAC 22981 CCGGCCAAGG CACCGATCGC GTACAAGAGG TCTCCTATAC AGACACAAAG GTCATCGGCA 23041 ATGGCAGCTT CGGCGTCGTG TTCCAGGCAA AGCTCTGCGA TACCGGCGAA CTGGTGGCAA 23101 TCAAAAAAGT TTTACAAGAC AGACGATTTA AGgtgggtgc atcaattgaa tctggcgcta 23161 aaagtattaa ttctaactaa caatgacact ttcccttcac agAATCGCGA ATTGCAAATA 23221 ATGCGCAAAT TGGAGCATTG TAATATTGTG AAGCTTTTGT ACTTTTTCTA TTCGAGTGGT 23281 GAAAAGgtaa gaaaaggggg tcaagtagct aatcaagtag agaacccttt ttttttgggg 23341 ggggaggtgg actgctgaag ttgccagcat tggtaattac ttagtacgtt ttcagttgaa 23401 aatccatcaa ccatcagcga atggaattcg taagaaagtg tgtatataaa tgatgagttt 23461 tgagccaatt ttgttgatgg ttaaaaccgt actaagtatt tatttttagc taaactgagc 23521 tttcgcttag cattttatgt gttattgagt ttatgagtta agcgattacc gaactgaaac 23581 gaatctcttt cacttgcata tcgaaactga aactgaaacc gaatccaaaa ccgaatccga 23641 aatcgaaatc ggtcactagc tggccgagtt ccccatcgat ttgggcatat tcgcgagtgt 23701 gctgaagccc caggtatccc agttctcaaa gttatcagtt cttcagctca gatcattcag 23761 cgacttgctc acactcgatc tagatctttc tctatctctc ataaacgaat ccgaaaaaaa 23821 aaccctgcta tctattagag aaatgtacaa tgaaacgtac agaagatcac atcgcatgtg 23881 gtacccgtac aatctaatct cccgtgaatc aatgatatcc tcctgctggt atgctagtta .

23941 tgctagactc agtttaaatg tcgccgactg tacaaacccg atcaaataaa acgcatgttg 24001 atagctcgca actgcatcgg tcagcttcct ccaaaatcca gactcacata cgcatccaca 24061 catggatcat ttacgctcaa aaaaaaaaaa aaaactctga tatcgtaacc taatcgattc 24121 ttttttgtgc gcccctccct ccattgcagC GTGATGAAGT ATTTTTGAAT TTAGTCCTCG 24181 AATATATACC AGAAACCGTA TACAAAGTGG CTCGCCAATA TGCCAAAACC AAGCAAACGA 24241 TACCAATCAA CTTTATTCGG gtgagtactg atctgctatc catctttgtg tagtcgacac 24301 taacttgcat cttcctgttg tttcgcccga ttatagCTCT ACATGTATCA ACTGTTCAGA 24361 AGTTTGGCCT ACATCCACTC GCTGGGCATT TGCCATCGTG ATATCAAGCC GCAGAATCTT 24421 CTGCTCGATC CGGAGACGGC TGTGCTGAAG CTCTGTGACT TTGGCAGCGC CAAACAGCTG 24481 CTGCACGGCG AGCCGAATGT ATCGTATATC TGCTCCCGGT ATTACCGCGC CCCCGAGCTC 24541 ATCTTTGGCG CCATCAATTA TACAACAAAG ATCGgtgagt atttcacaca taccatcaca 24601 ttattgcatt caaattccat gtaaattcta atatcctaaa ttgacgctac atttcagATG 24661 TCTGGAGTGC CGGTTGCGTT TTGGCCGAAC TGCTGCTGGG CCAGCCCATC TTCCCTGGCG 24721 ATTCCGGTGT GGATCAGCTC GTCGAGGTCA TCAAGGTCCT GGGCACACCG ACAAGAGAAC 24781 AGATACGCGA AATGAATCCA AACTACACGG AATTCAAGTT CCCTCAGATT AAGAGTCATC 24841 CATGGCAGAA Agtaagtggc tagttgccgc caaaatgtga attggatcga ttatgatttt 24901 cgatattccc cttgcgcttt tcacgcgctc atctgacccc cgcccctcgg cattcttctc 24961 gggatgggga ggaaacttca tttttccccc ggtacaagca ccccccccca ccccccccaa 25021 aaaaaaagca ctcattgtta atttaccaca agtgtttcta accgaaaatt gtgtgtacat 25081 acaaactcgt tgcagTCACT ACTCGAACGC ACCCAATTTC CAAACGCCCT AAACCAGAAA 25141 CAACGATTGC GAgtaagcca agaaacgaat tgaaaacatc aaacaaaaat ttaaaaaaaa 25201 gcgttaagca aaatgaccaa attatataag cacaatgcca atcgcagcca catttgctca 25261 acaaatcatg caggggggtt gaatgcattt gaaatctatg tcgtatcatg actactcaag 25321 tgattgtggg gccatccaga gccagatcca gattcccatt cttgcttgat cttgatcttt 25381 caactgataa gcctggttta ttcgaaaagc tttctgggtt cactcactca gttatgtatt 25441 aagtgtgcga gcctacaaat gcagctgcct caatcaaaca atcaattcgc atttcatgct 25501 caaccacata cataggaatg ggtataagta tccagatatc cagattccga gatgcaattc 25561 catggcttaa gattcatgct tagcttttcg ccagccctcc cagctgtaca aagaagttat 25621 aaattggttt aatttaattg atttattgga tttgatttga tttcgagttg aatttaattg 25681 attgttttgg tttcttgccc tgcctactgc cgacgtttgt tttgatccgt aaaaaaaaaa 25741 aaatgagtgt tcgatcaaat ctgtaaatac gaacgcttcc tctgtgttct cgtttgtgtc 25801 ctgtccatat ctacgatgat tgcctcactg cccgccccgc cccccttttg cccaccagcc 25861 caattctgtc ccgcaagaaa gtcagtgcaa agatgtcaga tggttaaaac gccacttaac 25921 cgaatgctcc tttcacacac agGTTTTCCG TATACGCACT CCTACAGAAG CTATCAACTT 25981 GGTGTCCCTG CTGCTCGAGT ATACGCCCAG TGCCAGGATC ACACCGCTCA AGGCCTGCGC 26041 ACATCCGTTC TTCGATGAGC TACGCATGGA GGGTAATCAC ACCTTGCCCA ACGGTCGCGA 26101 TATGCCGCCG CTGTTCAACT TCACAGAGCA TGgtgagtga gatcagatcg atcagccagg 26161 tggcagaatt tgttgcaaca ctaatgtcgc cttcaatccg cagAGCTCTC AATACAGCCC 26221 AGCCTAGTGC CGCAGTTGTT GCCCAAGCAT CTGCAGAACG CATCCGGACC TGGCGGCAAT 26281 CGACCCTCGG CCGGCGGAGC AGCCTCCATT GCGGCCAGCG GCTCCACCAG CGTCTCGTCA 26341 ACGGGCAGTG GTGCCTCGGT GGAAGGATCC GCCCAGCCAC AGTCGCAGGG TACAGCAGCA 26401 GCTGCGGGAT CCGGATCGGG CGGAGCAACA GCAGGAACCG GCGGAGCGAG TGCCGGTGGA 26461 CCCGGATCTG GTAACAACAG TAGCAGCGGC GGAGCATCGG GAGCGCCGTC CGCTGTGGCT 26521 GCCGGAGGAG CCAATGCCGC CGTCGCTGGC GGTGCTGGTG GTGGTGGCGG AGCCGGTGCG 26581 GCGACCGCAG CTGCAACAGC AACTGGCGCT ATAGGCGCGA CTAATGCCGG CGGCGCCAAT 26641 GTAACAGgtg agtaagcggt tggccatgca gctccatccc cgccttccgt cgcctgctcc 26701 ttttcacctc ctctcctctt tttcccagtc attgtatcat cagtattgta tcgtatcgta 26761 tcgtatcttc agtttctaac ttgtgtgcca tgtgacaaga ggggcactag ttttccaaat 26821 ctagccttcc atatttgcaa taaaaagtag catttaatca cgatacgata ctttgatctt 26881 ggcaattcgc ttgccacaca caatcccata ctttatccat tgcccgatcc ttagCTGGTG 26941 TCCATCTCAT GATGCGGCAA CATCGCAAGT TGCCGTTGTC GGGGAAGCCC TTCGTCCGCT 27001 ATACGGCCAA CATTTGATTG CGATGCTATC CGAGCATCGT TCCTCTATCC ATCTATTATA 27061 AGCAAACGCC TGAAATCCAA GCAAAACCCA GGAAAGATAT ACGAATAGAT CTAGTAAGGA 27121 AGCCGTAGAA GAAGCATGCT ATTTGGGGGC AAAGAGTGGA AATCATGAAT CAAGAATTGA 27181 GACAAGAATT GCGAATCAAG ACATAACATT TTGCAGATTC AAGAAGAAGA TGTTCTCAAA 27241 AACAAATTAT TGCTGTAGTT ATCTTTTCAG TTGAATATAA ATCAGTTATT TCCAAGTCAT 27301 TTaaaaagtt tcagtgaggt tttgtaaatg gttcctaaag tcgagatttt caatagagtt 27361 cttcggagtc tttccaaggt ttgccctaca atctatgcac tgaaaatcct tgtctgcaat 27421 tgtccttttc acgatatttc gaatcaatcg ggatatattg atcgtccatg gctatggaac 27481 acttgataga ttgaattggt tttcattcga tttcatttga tttcattaat tcatgtttac 27541 tttggtattt tggtgtgtca ttagacggct aaagatttat gattttaatt tgtttttttt 27601 ttgtttcctt ttgttttatt tacagATTCA TAGGGGAAAT AGTAACATAC ATACACACAC 27661 TAAATATATA TCCAAGCATA TATATATAGT AATCATTATA TATAACACCT ACACCCACAA 27721 CAACAACAAC AGCAATTATA TATAATAACC ATAAACAAGA ATGGAGAAAG CCAATCCAGC 27781 AATCACAGCA AACTATATAC ACAACAACAA CAATTAAATT AATTAATGCA ATTGATGAAA 27841 GAACAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCATC AACCGCAATT TCAAAAGAAC 27901 TCTAGAAACA GCAAAGGCAT AAAATATAAC AAAAGAAATA TTTTACTTAG GTAAAACATT .

27961 AAATTTATTT TAAATCTAAA ATAAACTAAT AAGCATTAAA TAATACATGA TAATGGTAAA 28021 TAAACACACA ATAATTATAA TAGTAGAGCG AGCGCTGATC GATTGTCATT TTATTGCTGC 28081 CGCGCGTGGC GATATATATA TATATATATA TATATATCTT TTAATTAATA TTTTAAGTGA 28141 TCCTCTCCGC AACTCTCTTC GTTAATTAAT GTATCCCTCC TATTTTTTTT GACGCCTTGA 28201 AAAAGAAATG AACCAATGTA TATGTATATT TAAAAGAGTC ACTGCATATT TTTTTTACAA 28261 CACCACCTTG ATTTAGTACG TTTAACTTAT GATAACTGAT GGTAATAGAA TGGCGGACGA 28321 GTTTTGTGTG GTTAGAGCGA GCTAAGATCT AACTAACTAA GAGTTTGCGT AGGTTAAACA 28381 AGGCATGGTC TTGCAACAAC GTGCAGCATG CAGCATGCAA CACACTCACA CACACACACA 28441 CACCACCCTA AGAAAGCAAG AGGAAGGCAG AAGAGGACAG AAGAGGAGGC GAAGATGAAG 28501 TAAAGTGGAA CAGATTGAGA AAGAGAAGGA GAATGAGAAG GAGGAACAGA AACAAAGCAA 28561 AGCCCCGAGC ATAATGTTAA TGTTATGTTA AAACCTAAAT TTAATGCAAA TTATTAACGC 28621 AGAAAAGAAC GAAAGAGAAA AGGAAAAAAC AAAAAAAAAA AAAGCAAAAA AACAAAGCAA 28681 AAGCGAAAGC GAAACTGTTT AAACTATACC AATATAATAT ATAATCATTA TGAATATAAA 28741 CTATATATTT TTTTTTTTTT TCAACACACA CATTATGTAT ACATACATAC TGAATAAATA 28801 AATTATTTTT ATTTTTATTA TATATCGTAT CGTATATCGT ATATATTTTT TTTCGCGCTA 28861 AAATATATAT GTACAGATAC CATTATTTAG CCAGTAGATG AGTTATACGG ACACATACGG 28921 ACATACAGAA CGGAGGAGGA AGAGAGCGCA GGCGTGGAGC TGGCGCCTGA GAACCGGCGG 28981 ATTAGTTGCA ATATGTAGAT AAGGTCCAAA TAACCGGGTT CCCGCCACCG TAGAGCTCCA 29041 TTATTATTAT TCGCATAGAT AGTCAGTGTC GTCCTGCCTC GCCCCCAAAA AGCTCCTTCC 29101 CCGCTCCCCC GCTCGCCACA ATTTCCGCAC AGAGCTGCTA CTGAATATTA TTAACAATTT 29161 CTTGCTCAGA GTGGCAAGGG AAGAAGAAGA AAAGAAAAAA AAGAAAAAAT GAGAAGAACG 29221 AAGCGAATTG CATAAGCGAT ATGATGAAAA ATGATGAGCA AAAAACTTAT ATTTATTTCT 29281 ATAGCTTATT ATAATCGACC TAAAACTAAT TATTATGCTA ATTATAAACG ATTATTGAAT 29341 ACACACACGC AGAATAAAAT ACATTTTCCT AGTAACTTAG GCACACGCGA GTAAAAAAAA 29401 AAAAAAGAAC AACTGGAAAA CCTTGAAAAA AAATGCAAAA AAAAAGAATC ATGAAAAATT 29461 AAACACGTTA GCTTATTTTT AGACTCGCTA ATTACATAAC ACACACACAC ACACTCTATA 29521 CACAGACACG TGCATACACC GACAATTGTA TATGTAATGC TGTAATAATC ATGATAATAT 29581 TTAGATTCGT TGATGATAAT GAGCAAAGAA GCCGTAATGA TAATGATAAT AAATGAATAC 29641 AACAAAATCC AACAAATAAA AAGAGAAACA AATACAATAT TTAAACaaag taaaaccttc 29701 tgttgttgcc tttcctttta accaccgtat ccacagtatc attttgttca gtgtgtgaac 29761 ctttcgtctg catcttcatt tcccctctcc gaagtaattg ggctcggaat gggtgggata 29821 acatcgattc taatcgattt aatcgattcg ccatacttaa tgtttaagta taattatttg 29881 taatttcagt taaacatgcg tttttttttt aaatcatttc aatattattt cttacttaaa 29941 cgatggcttg tgcattctga ttgagcttct aggaaatggt ggcttgcaac tgtgtgaaat 30001 ggctgggtct gctctgtttt gggcctccca agcgaatcac ccacaaaaac aacataaagc 30061 ttaatccata cgaatagcaa tccacatctg tatatctttt tccttctttg gctctgaatc 30121 gttttgtagG CTCGCAATCG AACAGCGCCC TCAATAGCAG CGGAAGTGGC GGAAGCGGAA 30181 ACGGAGAGGC AGCCGGCTCG GGTTCCGGAT CCGGATCGGG CTCAGGAGGC GGGAACGGCG 30241 GGGATAACGA TGCTGGCGAC AGTGGAGCAA TCGCATCTGG AGGCGGAGCA GCAGAAACCG 30301 AGGCAGCGGC GTCGGGTTAG CGCGAGCGAG TAGCTCTTTA AATGTAATGT TATTAGCAGG 30361 TTTTTCGCTC GGCCCGGGGA TTCAGTTAAC CCATGTCGGC CAAGAGCGAG ATGACAACAC 30421 CACACACCAC ACACACACAT ACACACACGC AGCTACACTC GAAATATGAA AGAGATGTCG 30481 GATGTCCCAG CAGAAGCTAT GAGTAAATGA AATGAGACAA AGGAATTCAC TACACAAAAC 30541 GCCCAGTATC CTTACACCCC CACACAACAA TAAACCCATC CACACACACA CACACACACT 30601 AACACAAACA CACACACACA CACATGTATG TATGTATATC TAGCTATATG CATTGGGCGC 30661 AAGCAAATAT TTAGCATAAA ATCGAAATAA AACCAAAAAT CCACTTTAAA CTATGCATAA 30721 ATAATTAAAA TAATTATCTG TACTATTAAA GAGAAAGAGA AATCCCCGAA GGAATGTTGA 30781 GAAATAATCG GAAAACCCCT CGCCCGCGTC CCAAACCTTC AATATAGTAA ATAACACTTA 30841 ACAACCAGAT CGCGGAACGT AATATAAATT AAGTCAAAAA AAAAAACAAA AAGCAGAAGC 30901 AACTTGAATG AAATACTTAG TGAAATAAAT CAAAATTTTT GCCCATTTAA CGTTTATATA 30961 TATGCGCGGT TATACAAATA TATATAACGA TCACAGCAGT TAGCAATCCA TAGTAAAAGT 31021 AAACAATTAA AGGCGGCAAG TAAGAGGAAC TAGCAAAAGG GCGGATACAA CATAAACTAA Atgc =ekson Atgc =intron Analisa potongan DNA di atas dan jawab pertanyaan berikut : a. Dari organisme apa dan pada kromosom berapa fragmen DNA tersebut berasal? (5) .

ketika saya melihat query cover serta identity yang dimiliki yaitu mencapai 100% pada Drosophila melanogaster yang memiliki kromosom X. pada kasus ini Drosophila melanogaster adalah eukariotik. Karena sebenarnya ORF dari NCBI lebih cocok untuk prokariot daripada eukariot. Kemudian pada web tersebut. saya memilih BLAST. Ada berapa prediksi open reading frame (ORF)? Berapa exon diprediksi dari setiap ORF? Tandai prediksi daerah promoter. b. Sehingga pada kromosom 10 fragmen DNA itu berasal. Melalui sekuens tersebut. apabila mencari ORF menggunakan ORF Finder dari NCBI ditemukan hasil seperti berikut: . Selanjutnya menggunakan pilihan blastn. saya mencari data tersebut menggunakan NCBI. (35) Untuk mengetahui prediksi open reading frame (ORF). Setelah itu. untuk mencari organismenya apa. start dan stop codon. Sehingga. Seperti yang kita ketahui pencarian menggunakan ORF dari NCBI hanya memberikan alternatif mengenai start dan stop kodonnya saja. Hasil yang saya dapatkan yaitu. sekuens ini dimiliki oleh organisme Drosophila melanogaster. exon dan intron dan daerah terminator (penambahan polyA). dan dimasukkan sekuens yang sudah saya dapatkan dan di RUN BLAST.

Sehingga melalui GENSCAN. kita dapat mengetahui coding ORFnya. start.Sehingga digunakan GENSCAN. intron. hingga daerah terminator (penambahan poly A). . untuk mengetahui exon. stop kodon.

ORF 1 ORF 2 Pada gambar diatas. Jawaban untuk menandai. Apakah prediksi urutan asam aminonya dari setiap ORF dan apakah fungsinya? Berikan data dukung (45) PREDIKSI URUTAN ASAM AMINO ORF MENURUT GENSCAN . pada prediksi ORF 2 terdapat 14 ekson. lihat sekuens di atas. Dilihat dari ORF 1: diprediksi terdapat 5 ekson. Dimana Init menandakan adanya ekson pertama yang terdapat start kodon. sedangkan term yaituekson terakhir dan terdapat stop kodon. Dapat diketahui yaitu ada dua prediksi ORF. c. Sedangkan.

Prediksi urutan asam amino dari ORF 2. Sedangkan urutan asam amino ORF 2 ditemukan data sebagai berikut: .Prediksi urutan asam amino dari ORF 1. Tidak adanya fungsi yang ditemukan dari urutan asam amino ORF 1.

Penjelasan yang didapatkan dari protein katalitik domain (accnumber: cd14137) yaitu sebagai berikut: .

polypeptide substrate binding site dengan 12 residu. dimer interface dengan 25 residu. dan activation loop (Aloop) dengan 22 residu. hasil yang ditemukan sebagai berikut: . Kemudian terdapat axin binding site dengan 13 residu. Namun. apabila kita mencari fungsi dari urutan asam amino ORF yang didapatkan dari CDS di GenBank dari Drosophila melanogaster. active site yaitu 13 residu.Pada gambar di atas juga menunjukkan bahwa protein tersebut memiliki ATP binding site dengan 18 residu.

Urutan asam aminonya yaitu: Dan apabila menurut salah satu referensi dari Pubmed yaitu: .

Prediksi daerah functional domainnya dan berikan penjelasan (15) Untuk memprediksi daerah functional domain. maka menggunakan program Prosite. hasil yang diperoleh yaitu sebagai berikut: . prediksi urutan asam amino ORF 1 pada GENSCAN.walaupun memiliki urutan asam amino yang berbeda. Menurut data BLAST. ataupun glikogen synthase kinase 3. Sehingga sama dengan functional domain yang ditunjukkan oleh Prosite maupun conserved domain dari NCBI yaitu menunjukkan urutan asam amino tersebut mengkode protein kinase serine/thereonin. Sedangkan untuk urutan asam amino pada ORF 2 GENSCAN. d. urutan asam amino dari ORF mengkode protein glikogen synthase kinase/ Shaggy. tidak memunculkan hasil apapun (no hit) sehingga dapat diartikan bahwa urutan asam amino tersebut tidak terdapat pada database.

Umumnya protein ini memberikan deskripsi yaitu: Eukaryotic protein kinases are enzymes that belong to a very extensive family of proteins which share a conserved catalytic core common to both serine/threonine and tyrosine protein kinases. There are a number of conserved regions in the catalytic domain of protein kinases. is a glycine-rich stretch of residues in the vicinity of a lysine residue. The first region. We have selected two of these regions to build signature patterns. dan Protein Kinase ST (Serine/Threonine protein kinases active-site signature) (aa1182- aa1194).Daerah functional domain yang didapatkan yaitu mengkodekan protein kinase dom yaitu pada asam amino 1061 hingga 1345. which has been shown . Juga diprediksi terdapat functional domain lainnya yang ditemukan pada prosite yaitu Protein kinase ATP(Protein kinases ATP-binding region signature ) (aa1067-aa1091). which is located in the N-terminal extremity of the catalytic domain.

to be involved in ATP binding. contains a conserved aspartic acid residue which is important for the catalytic activity of the enzyme. The second region. . we have derived two signature patterns for that region: one specific for serine/ threonine kinases and the other for tyrosine kinases. We also developed a profile which is based on the alignment in and covers the entire catalytic domain. which is located in the central part of the catalytic domain.