U.S. patent application number 13/861814 was filed with the patent office on 2013-11-07 for plants having enhanced yield-related traits and a method for making the same. This patent application is currently assigned to Crop Functional Genomics Center. The applicant listed for this patent is Yang Do Choi, In Gyu Hwang, Seok Won Jeong Jeong, Jonghee Oh, Youn-Il Park. Invention is credited to Yang Do Choi, In Gyu Hwang, Seok Won Jeong Jeong, Jonghee Oh, Youn-Il Park.
Application Number | 20130298287 13/861814 |
Document ID | / |
Family ID | 39420317 |
Filed Date | 2013-11-07 |
United States Patent Application | 20130298287 |
Kind Code | A1 |
Park; Youn-Il ; et al. | November 7, 2013 |
The present invention relates generally to the field of molecular biology and concerns a method for enhancing various economically important yield-related traits in plants. More specifically, the present invention concerns a method for enhancing yield-related traits in plants by modulating expression in a plant of a nucleic acid encoding a Harpin-associated Factor G polypeptide (hereinafter termed HpaG"). The present invention also concerns plants having modulated expression of a nucleic acid encoding an HpaG polypeptide, which plants have enhanced yield-related traits relative to control plants. The invention also provides constructs comprising HpaG-encoding nucleic acids, useful in performing the methods of the invention. The present invention also provides a method for enhancing yield-related traits in plants relative to control plants, by modulating (preferably increasing) expression in a plant of a nucleic acid sequence encoding a SWITCH 2/SUCROSE NON-FERMENTING 2 (SWI2/SNF2) polypeptide. The present invention also concerns plants having modulated expression of a nucleic acid sequence encoding a SWI2/SNF2 polypeptide, which plants have enhanced yield-related traits relative to control plants. The invention also provides constructs useful in performing the methods of the invention.
Inventors: | Park; Youn-Il; (Daejeon, KR) ; Choi; Yang Do; (Seoul, KR) ; Jeong; Seok Won Jeong; (Seoul, KR) ; Hwang; In Gyu; (Seoul, KR) ; Oh; Jonghee; (Seoul, KR) | ||||||||||
Applicant: |
|
||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Assignee: | Crop Functional Genomics
Center Seoul KR CropDesign N.V. Zwijnaarde BE |
||||||||||
Family ID: | 39420317 | ||||||||||
Appl. No.: | 13/861814 | ||||||||||
Filed: | April 25, 2013 |
Application Number | Filing Date | Patent Number | ||
---|---|---|---|---|
12528809 | Nov 10, 2009 | 8440881 | ||
PCT/EP2008/052450 | Feb 28, 2008 | |||
13861814 | ||||
60896050 | Mar 21, 2007 | |||
60909510 | Apr 2, 2007 | |||
Current U.S. Class: | 800/290 ; 435/320.1; 435/419; 800/298; 800/320; 800/320.1; 800/320.3 |
Current CPC Class: | C12N 15/8273 20130101; C07K 14/21 20130101; Y02A 40/146 20180101; C12N 15/8261 20130101; C12N 15/827 20130101 |
Class at Publication: | 800/290 ; 800/298; 800/320.1; 800/320.3; 800/320; 435/320.1; 435/419 |
International Class: | C12N 15/82 20060101 C12N015/82 |
Date | Code | Application Number |
---|---|---|
Feb 28, 2007 | EP | 07103271.8 |
Mar 15, 2007 | EP | 07104197.4 |
Sequence CWU 1
1
1141402DNAXanthomonas axonopodis 1atgaattctt tgaacacaca gctcggcgcc
aactcgtcct tctttcaggt tgaccccggc 60cagaacacgc aatctagtcc gaaccagggc
aaccagggca tctcggaaaa gcaactggac 120cagctgctga cccagctcat
catggccctg cttcagcaga gcaacaatgc cgagcagggt 180cagggtcaag
gccagggtgg tgactctggc ggtcagggcg gcaatccgcg gcaggccggg
240cagtccaacg gctccccctc gcaatacacc caggcgctga tgaatatcgt
cggagacatt 300ctccaggcgc agaatggtgg cggcttcggc ggcggctttg
gtggtggctt cggtggcatc 360ctcgtcacca gccttgcgag cgacaccgga
tcgatgcagt aa 4022133PRTXanthomonas axonopodis 2Met Asn Ser Leu Asn
Thr Gln Leu Gly Ala Asn Ser Ser Phe Phe Gln 1 5 10 15 Val Asp Pro
Gly Gln Asn Thr Gln Ser Ser Pro Asn Gln Gly Asn Gln 20 25 30 Gly
Ile Ser Glu Lys Gln Leu Asp Gln Leu Leu Thr Gln Leu Ile Met 35 40
45 Ala Leu Leu Gln Gln Ser Asn Asn Ala Glu Gln Gly Gln Gly Gln Gly
50 55 60 Gln Gly Gly Asp Ser Gly Gly Gln Gly Gly Asn Pro Arg Gln
Ala Gly 65 70 75 80 Gln Ser Asn Gly Ser Pro Ser Gln Tyr Thr Gln Ala
Leu Met Asn Ile 85 90 95 Val Gly Asp Ile Leu Gln Ala Gln Asn Gly
Gly Gly Phe Gly Gly Gly 100 105 110 Phe Gly Gly Gly Phe Gly Gly Ile
Leu Val Thr Ser Leu Ala Ser Asp 115 120 125 Thr Gly Ser Met Gln 130
311PRTArtificial sequenceconserved motif 1 of the HpaG protein 3Gly
Xaa Xaa Xaa Xaa Gln Xaa Gly Xaa Xaa Gly 1 5 10 414PRTArtificial
sequenceconserved motif 2 of the HpaG protein 4Xaa Ser Xaa Xaa Thr
Gln Xaa Leu Met Xaa Ile Val Xaa Xaa 1 5 10 52194DNAOryza sativa
5aatccgaaaa gtttctgcac cgttttcacc ccctaactaa caatataggg aacgtgtgct
60aaatataaaa tgagacctta tatatgtagc gctgataact agaactatgc aagaaaaact
120catccaccta ctttagtggc aatcgggcta aataaaaaag agtcgctaca
ctagtttcgt 180tttccttagt aattaagtgg gaaaatgaaa tcattattgc
ttagaatata cgttcacatc 240tctgtcatga agttaaatta ttcgaggtag
ccataattgt catcaaactc ttcttgaata 300aaaaaatctt tctagctgaa
ctcaatgggt aaagagagag atttttttta aaaaaataga 360atgaagatat
tctgaacgta ttggcaaaga tttaaacata taattatata attttatagt
420ttgtgcattc gtcatatcgc acatcattaa ggacatgtct tactccatcc
caatttttat 480ttagtaatta aagacaattg acttattttt attatttatc
ttttttcgat tagatgcaag 540gtacttacgc acacactttg tgctcatgtg
catgtgtgag tgcacctcct caatacacgt 600tcaactagca acacatctct
aatatcactc gcctatttaa tacatttagg tagcaatatc 660tgaattcaag
cactccacca tcaccagacc acttttaata atatctaaaa tacaaaaaat
720aattttacag aatagcatga aaagtatgaa acgaactatt taggtttttc
acatacaaaa 780aaaaaaagaa ttttgctcgt gcgcgagcgc caatctccca
tattgggcac acaggcaaca 840acagagtggc tgcccacaga acaacccaca
aaaaacgatg atctaacgga ggacagcaag 900tccgcaacaa ccttttaaca
gcaggctttg cggccaggag agaggaggag aggcaaagaa 960aaccaagcat
cctccttctc ccatctataa attcctcccc ccttttcccc tctctatata
1020ggaggcatcc aagccaagaa gagggagagc accaaggaca cgcgactagc
agaagccgag 1080cgaccgcctt ctcgatccat atcttccggt cgagttcttg
gtcgatctct tccctcctcc 1140acctcctcct cacagggtat gtgcctccct
tcggttgttc ttggatttat tgttctaggt 1200tgtgtagtac gggcgttgat
gttaggaaag gggatctgta tctgtgatga ttcctgttct 1260tggatttggg
atagaggggt tcttgatgtt gcatgttatc ggttcggttt gattagtagt
1320atggttttca atcgtctgga gagctctatg gaaatgaaat ggtttaggga
tcggaatctt 1380gcgattttgt gagtaccttt tgtttgaggt aaaatcagag
caccggtgat tttgcttggt 1440gtaataaagt acggttgttt ggtcctcgat
tctggtagtg atgcttctcg atttgacgaa 1500gctatccttt gtttattccc
tattgaacaa aaataatcca actttgaaga cggtcccgtt 1560gatgagattg
aatgattgat tcttaagcct gtccaaaatt tcgcagctgg cttgtttaga
1620tacagtagtc cccatcacga aattcatgga aacagttata atcctcagga
acaggggatt 1680ccctgttctt ccgatttgct ttagtcccag aatttttttt
cccaaatatc ttaaaaagtc 1740actttctggt tcagttcaat gaattgattg
ctacaaataa tgcttttata gcgttatcct 1800agctgtagtt cagttaatag
gtaatacccc tatagtttag tcaggagaag aacttatccg 1860atttctgatc
tccattttta attatatgaa atgaactgta gcataagcag tattcatttg
1920gattattttt tttattagct ctcacccctt cattattctg agctgaaagt
ctggcatgaa 1980ctgtcctcaa ttttgttttc aaattcacat cgattatcta
tgcattatcc tcttgtatct 2040acctgtagaa gtttcttttt ggttattcct
tgactgcttg attacagaaa gaaatttatg 2100aagctgtaat cgggatagtt
atactgcttg ttcttatgat tcatttcctt tgtgcagttc 2160ttggtgtagc
ttgccacttt caccagcaaa gttc 219461179DNAOryza sativa 6ttgcagttgt
gaccaagtaa gctgagcatg cccttaactt cacctagaaa aaagtatact 60tggcttaact
gctagtaaga catttcagaa ctgagactgg tgtacgcatt tcatgcaagc
120cattaccact ttacctgaca ttttggacag agattagaaa tagtttcgta
ctacctgcaa 180gttgcaactt gaaaagtgaa atttgttcct tgctaatata
ttggcgtgta attcttttat 240gcgttagcgt aaaaagttga aatttgggtc
aagttactgg tcagattaac cagtaactgg 300ttaaagttga aagatggtct
tttagtaatg gagggagtac tacactatcc tcagctgatt 360taaatcttat
tccgtcggtg gtgatttcgt caatctccca acttagtttt tcaatatatt
420cataggatag agtgtgcata tgtgtgttta tagggatgag tctacgcgcc
ttatgaacac 480ctacttttgt actgtatttg tcaatgaaaa gaaaatctta
ccaatgctgc gatgctgaca 540ccaagaagag gcgatgaaaa gtgcaacgga
tatcgtgcca cgtcggttgc caagtcagca 600cagacccaat gggcctttcc
tacgtgtctc ggccacagcc agtcgtttac cgcacgttca 660catgggcacg
aactcgcgtc atcttcccac gcaaaacgac agatctgccc tatctggtcc
720cacccatcag tggcccacac ctcccatgct gcattatttg cgactcccat
cccgtcctcc 780acgcccaaac accgcacacg ggtcgcgata gccacgaccc
aatcacacaa cgccacgtca 840ccatatgtta cgggcagcca tgcgcagaag
atcccgcgac gtcgctgtcc cccgtgtcgg 900ttacgaaaaa atatcccacc
acgtgtcgct ttcacaggac aatatctcga aggaaaaaaa 960tcgtagcgga
aaatccgagg cacgagctgc gattggctgg gaggcgtcca gcgtggtggg
1020gggcccaccc ccttatcctt agcccgtggc gctcctcgct cctcgggtcc
gtgtataaat 1080accctccgga actcactctt gctggtcacc aacacgaagc
aaaaggacac cagaaacata 1140gtacacttga gctcactcca aactcaaaca
ctcacacca 11797402DNAArtificial sequencesynthetic construct mutant
elicitor of hypersensitive response HpaG_T44C gene 7atgaattctt
tgaacacaca gctcggcgcc aactcgtcct tctttcaggt tgaccccggc 60cagaacacgc
aatctagtcc gaaccagggc aaccagggca tctcggaaaa gcaactggac
120cagctgctgt gccagctcat catggccctg cttcagcaga gcaacaatgc
cgagcagggt 180cagggtcaag gccagggtgg tgactctggc ggtcagggcg
gcaatccgcg gcaggccggg 240cagtccaacg gctccccctc gcaatacacc
caggcgctga tgaatatcgt cggagacatt 300ctccaggcgc agaatggtgg
cggcttcggc ggcggctttg gtggtggctt cggtggcatc 360ctcgtcacca
gccttgcgag cgacaccgga tcgatgcagt aa 4028133PRTArtificial
sequencemutant elicitor of hypersensitive response HpaG_T44C 8Met
Asn Ser Leu Asn Thr Gln Leu Gly Ala Asn Ser Ser Phe Phe Gln 1 5 10
15 Val Asp Pro Gly Gln Asn Thr Gln Ser Ser Pro Asn Gln Gly Asn Gln
20 25 30 Gly Ile Ser Glu Lys Gln Leu Asp Gln Leu Leu Cys Gln Leu
Ile Met 35 40 45 Ala Leu Leu Gln Gln Ser Asn Asn Ala Glu Gln Gly
Gln Gly Gln Gly 50 55 60 Gln Gly Gly Asp Ser Gly Gly Gln Gly Gly
Asn Pro Arg Gln Ala Gly 65 70 75 80 Gln Ser Asn Gly Ser Pro Ser Gln
Tyr Thr Gln Ala Leu Met Asn Ile 85 90 95 Val Gly Asp Ile Leu Gln
Ala Gln Asn Gly Gly Gly Phe Gly Gly Gly 100 105 110 Phe Gly Gly Gly
Phe Gly Gly Ile Leu Val Thr Ser Leu Ala Ser Asp 115 120 125 Thr Gly
Ser Met Gln 130 9378DNAArtificial sequencesynthetic construct
mutant elicitor of hypersensitive response HpaG-T gene 9atgaattctt
tgaacacaca gctcggcgcc aactcgtcct tctttcaggt tgaccccggc 60cagaacacgc
aatctagtcc gaaccagggc aaccagggca tctcggaaaa gcaactggac
120cagctgctga cccagctcat catggccctg cttcagcaga gcaacaatgc
cgagcagggt 180cagggtcaag gccagggtgg tgactctggc ggtcagggcg
gcaatccgcg gcaggccggg 240cagtccaacg gctccccctc gcaatacacc
caggcgctga tgaatatcgt cggagacggc 300ttcggcggcg gctttggtgg
tggcttcggt ggcatcctcg tcaccagcct tgcgagcgac 360accggatcga tgcagtaa
37810125PRTArtificial sequencemutant elicitor of hypersensitive
response HpaG-T 10Met Asn Ser Leu Asn Thr Gln Leu Gly Ala Asn Ser
Ser Phe Phe Gln 1 5 10 15 Val Asp Pro Gly Gln Asn Thr Gln Ser Ser
Pro Asn Gln Gly Asn Gln 20 25 30 Gly Ile Ser Glu Lys Gln Leu Asp
Gln Leu Leu Thr Gln Leu Ile Met 35 40 45 Ala Leu Leu Gln Gln Ser
Asn Asn Ala Glu Gln Gly Gln Gly Gln Gly 50 55 60 Gln Gly Gly Asp
Ser Gly Gly Gln Gly Gly Asn Pro Arg Gln Ala Gly 65 70 75 80 Gln Ser
Asn Gly Ser Pro Ser Gln Tyr Thr Gln Ala Leu Met Asn Ile 85 90 95
Val Gly Asp Gly Phe Gly Gly Gly Phe Gly Gly Gly Phe Gly Gly Ile 100
105 110 Leu Val Thr Ser Leu Ala Ser Asp Thr Gly Ser Met Gln 115 120
125 11414DNAXanthomonas axonopodisXanthomonas axonopodis pv. citri
11ttactgcatc gatccggtgt cgctcgcaag gctggtgccg aggctggtgc cgaggccgcc
60gccgaagcca ccaccaaagc cgccgccgaa gccaccacca ttctgcgcct ggagaatgtc
120tccgacgata ttcatcagca tctgggtgta ttgcgagggg gagccgttgg
actgaccggc 180ctgctgccga ttgccgccct gaccaccaga gtcaccaccc
tggccttgac cctgaccctg 240ctcggcattg ttgctctgct gaagcagggc
catgatgagc tgggtcagca gctggtccag 300ttgcttttcc gagatgccct
ggttgccctg gttcgaacca gattgcgtgt tctggctggg 360gtcaacctga
aagaaggacg agttggcgcc gagctgtgtg ttcaaagaat tcat
41412137PRTXanthomonas axonopodisXanthomonas axonopodis pv. citri
12Met Asn Ser Leu Asn Thr Gln Leu Gly Ala Asn Ser Ser Phe Phe Gln 1
5 10 15 Val Asp Pro Ser Gln Asn Thr Gln Ser Gly Ser Asn Gln Gly Asn
Gln 20 25 30 Gly Ile Ser Glu Lys Gln Leu Asp Gln Leu Leu Thr Gln
Leu Ile Met 35 40 45 Ala Leu Leu Gln Gln Ser Asn Asn Ala Glu Gln
Gly Gln Gly Gln Gly 50 55 60 Gln Gly Gly Asp Ser Gly Gly Gln Gly
Gly Asn Arg Gln Gln Ala Gly 65 70 75 80 Gln Ser Asn Gly Ser Pro Ser
Gln Tyr Thr Gln Met Leu Met Asn Ile 85 90 95 Val Gly Asp Ile Leu
Gln Ala Gln Asn Gly Gly Gly Phe Gly Gly Gly 100 105 110 Phe Gly Gly
Gly Phe Gly Gly Gly Leu Gly Thr Ser Leu Gly Thr Ser 115 120 125 Leu
Ala Ser Asp Thr Gly Ser Met Gln 130 135 13366DNAArtificial
sequencesynthetic construct mutant elicitor of hypersensitive
response HpaG-N gene 13atgaattctt tgaacacaca gctcggcgcc aactcgtcct
tctttcaggt tgaccccggc 60cagaacacgc aatctagtcc gaaccagggc aacacccagc
tcatcatggc cctgcttcag 120cagagcaaca atgccgagca gggtcagggt
caaggccagg gtggtgactc tggcggtcag 180ggcggcaatc cgcggcaggc
cgggcagtcc aacggctccc cctcgcaata cacccaggcg 240ctgatgaata
tcgtcggaga cattctccag gcgcagaatg gtggcggctt cggcggcggc
300tttggtggtg gcttcggtgg catcctcgtc accagccttg cgagcgacac
cggatcgatg 360cagtaa 36614121PRTArtificial sequencemutant elicitor
of hypersensitive response HpaG-N 14Met Asn Ser Leu Asn Thr Gln Leu
Gly Ala Asn Ser Ser Phe Phe Gln 1 5 10 15 Val Asp Pro Gly Gln Asn
Thr Gln Ser Ser Pro Asn Gln Gly Asn Thr 20 25 30 Gln Leu Ile Met
Ala Leu Leu Gln Gln Ser Asn Asn Ala Glu Gln Gly 35 40 45 Gln Gly
Gln Gly Gln Gly Gly Asp Ser Gly Gly Gln Gly Gly Asn Pro 50 55 60
Arg Gln Ala Gly Gln Ser Asn Gly Ser Pro Ser Gln Tyr Thr Gln Ala 65
70 75 80 Leu Met Asn Ile Val Gly Asp Ile Leu Gln Ala Gln Asn Gly
Gly Gly 85 90 95 Phe Gly Gly Gly Phe Gly Gly Gly Phe Gly Gly Ile
Leu Val Thr Ser 100 105 110 Leu Ala Ser Asp Thr Gly Ser Met Gln 115
120 15366DNAXanthomonas axonopodis 15atgaattctt tgaacacaca
gctcggcgcc aactcgtcct tctttcaggt tgaccccggc 60cagaacacgc aatctagtcc
gaaccagggc aaccagggca tctcggaaaa gcaactggac 120cagctgctga
cccagctcat catggccctg cttcagcaga gcaacaatgc cgagcagggt
180cagggtcaag gccagggtgg tgactctggc ggtcagggcg gcaatccgcg
gcaggccggg 240cagtccaacg gctccccctc gcaatacacc caggcgctga
tgaatatcgt cggagacatt 300ctccaggcgc agaatggctt tatcctcgtc
accagccttg cgagcgacac cggatcgatg 360cagtaa 36616121PRTXanthomonas
axonopodis 16Met Asn Ser Leu Asn Thr Gln Leu Gly Ala Asn Ser Ser
Phe Phe Gln 1 5 10 15 Val Asp Pro Gly Gln Asn Thr Gln Ser Ser Pro
Asn Gln Gly Asn Gln 20 25 30 Gly Ile Ser Glu Lys Gln Leu Asp Gln
Leu Leu Thr Gln Leu Ile Met 35 40 45 Ala Leu Leu Gln Gln Ser Asn
Asn Ala Glu Gln Gly Gln Gly Gln Gly 50 55 60 Gln Gly Gly Asp Ser
Gly Gly Gln Gly Gly Asn Pro Arg Gln Ala Gly 65 70 75 80 Gln Ser Asn
Gly Ser Pro Ser Gln Tyr Thr Gln Ala Leu Met Asn Ile 85 90 95 Val
Gly Asp Ile Leu Gln Ala Gln Asn Gly Phe Ile Leu Val Thr Ser 100 105
110 Leu Ala Ser Asp Thr Gly Ser Met Gln 115 120 17402DNAXanthomonas
smithiiXanthomonas smithii subsp. smithii 17atgaattctt tgaacacaca
gatcggcgcc aactcgtcct tcttgcaggt cgacccgagc 60cagaacacgc aattcggtcc
gaaccagggc aatcaaggca tctcggaaaa gcagctggac 120cagctgctga
cccagctcat catggccctg cttcagcaga gcaacaatgc cgaccagggt
180cagggtggtg actctggtgg tcaaggcggc aattcgcggc aggccgggca
gcccaatggt 240tccccctcgg catacaccca gatgctgatg aatatcgtcg
gagacattct ccaggcgcag 300aatggtggtg gcttcggcgg cgggttcggc
ggtggctttg gtggcgggct cggcaccagc 360ctcggcagca gccttgcgag
cgacaccgga tcgatgcagt aa 40218133PRTXanthomonas smithiiXanthomonas
smithii subsp. smithii 18Met Asn Ser Leu Asn Thr Gln Ile Gly Ala
Asn Ser Ser Phe Leu Gln 1 5 10 15 Val Asp Pro Ser Gln Asn Thr Gln
Phe Gly Pro Asn Gln Gly Asn Gln 20 25 30 Gly Ile Ser Glu Lys Gln
Leu Asp Gln Leu Leu Thr Gln Leu Ile Met 35 40 45 Ala Leu Leu Gln
Gln Ser Asn Asn Ala Asp Gln Gly Gln Gly Gly Asp 50 55 60 Ser Gly
Gly Gln Gly Gly Asn Ser Arg Gln Ala Gly Gln Pro Asn Gly 65 70 75 80
Ser Pro Ser Ala Tyr Thr Gln Met Leu Met Asn Ile Val Gly Asp Ile 85
90 95 Leu Gln Ala Gln Asn Gly Gly Gly Phe Gly Gly Gly Phe Gly Gly
Gly 100 105 110 Phe Gly Gly Gly Leu Gly Thr Ser Leu Gly Ser Ser Leu
Ala Ser Asp 115 120 125 Thr Gly Ser Met Gln 130 19420DNAXanthomonas
oryzaeXanthomonas oryzae pv. oryzae 19atgaactctt tgaacacaca
attcggcggc agcacgtcca accttcaggt tggcccaagc 60caggacacaa cgttcggttc
gaaccagggc ggcaaccagg gcatctcgga aaagcaactg 120gaccagttgc
tgtgccagct catctcggcc ctgcttcagt cgagcaaaaa tgctgaggag
180ggtaagggtc agggtggcga taatggcggt ggccagggcg gcaattcgca
gcaggccggg 240cagcagaatg gcccctcgcc attcacccag atgctgatgc
atatcgtcgg agagattctc 300caggcgcaga atggtggtgg tgctggtggc
ggcggtttcg gcggcgggtt cggcggcgac 360tttagtggcg acctcggcct
cggcaccaac ctctcgagcg acagcgcatc aatgcagtaa 42020139PRTXanthomonas
oryzaeXanthomonas oryzae pv. oryzae 20Met Asn Ser Leu Asn Thr Gln
Phe Gly Gly Ser Thr Ser Asn Leu Gln 1 5 10 15 Val Gly Pro Ser Gln
Asp Thr Thr Phe Gly Ser Asn Gln Gly Gly Asn 20 25 30 Gln Gly Ile
Ser Glu Lys Gln Leu Asp Gln Leu Leu Cys Gln Leu Ile 35 40 45 Ser
Ala Leu Leu Gln Ser Ser Lys Asn Ala Glu Glu Gly Lys Gly Gln 50 55
60 Gly Gly Asp Asn Gly Gly Gly Gln Gly Gly Asn Ser Gln Gln Ala Gly
65 70 75 80 Gln Gln Asn Gly Pro Ser Pro Phe Thr Gln Met Leu Met His
Ile Val 85 90 95 Gly Glu Ile Leu Gln Ala Gln Asn Gly Gly Gly Ala
Gly Gly Gly Gly 100 105 110 Phe Gly Gly Gly Phe Gly Gly Asp Phe Ser
Gly Asp Leu Gly Leu Gly 115 120 125 Thr Asn Leu Ser Ser Asp Ser Ala
Ser Met Gln 130 135 21420DNAXanthomonas oryzaeXanthomonas oryzae
pv. oryzae 21atgaattctt tgaacacaca attcggcggc agcacgtcca accttcaggt
tggcccaagc 60caggacacaa cgttcggttc gaaccagggc ggcaaccagg
gcatctcgga aaagcaactg 120gaccagttgc tgtgccagct catctcggcc
ctgcttcagt cgagcaaaaa tgctgaggag 180ggtaagggtc agggtggcga
taatggcggt ggccagggcg gcaattcgca gcaggctggg 240cagcagaatg
gcccctcgcc attcacccag atgctgatgc atatcgtcgg agagattctc
300caggcgcaga atggtggtgg tgctggtggc ggcgggttcg gcggcgggtt
cggcggtgac 360tttagtggcg acctcggcct cggcaccaac ctctcgagcg
acagcgcatc gatgcagtaa 42022139PRTXanthomonas oryzaeXanthomonas
oryzae pv. oryzae 22Met Asn Ser Leu Asn Thr Gln Phe Gly Gly Ser Thr
Ser Asn Leu Gln 1 5 10 15 Val Gly Pro Ser Gln Asp Thr Thr Phe Gly
Ser Asn Gln Gly Gly Asn 20 25 30 Gln Gly Ile Ser Glu Lys Gln Leu
Asp Gln Leu Leu Cys Gln Leu Ile 35 40 45 Ser Ala Leu Leu Gln Ser
Ser Lys Asn Ala Glu Glu Gly Lys Gly Gln 50 55 60 Gly Gly Asp Asn
Gly Gly Gly Gln Gly Gly Asn Ser Gln Gln Ala Gly 65 70 75 80 Gln Gln
Asn Gly Pro Ser Pro Phe Thr Gln Met Leu Met His Ile Val 85 90 95
Gly Glu Ile Leu Gln Ala Gln Asn Gly Gly Gly Ala Gly Gly Gly Gly 100
105 110 Phe Gly Gly Gly Phe Gly Gly Asp Phe Ser Gly Asp Leu Gly Leu
Gly 115 120 125 Thr Asn Leu Ser Ser Asp Ser Ala Ser Met Gln 130 135
23420DNAXanthomonas oryzaeXanthomonas oryzae pv. oryzae
23atgaattctt tgaacacaca attcggcggc agcacgtcca accttcaggt tggcccaagc
60caggacacaa cgttcggttc gaaccagggc ggcaaccagg gcatctcgga aaagcaactg
120gaccagttgc tgtgccagct catctcggcc ctgcttcagt cgagcaaaaa
tgctgaggag 180ggtaagggtc agggtggcga taatggcggt ggccagggcg
gcaattcgca gcaggccggg 240cagcagaatg gcccctcgcc attcacccag
atgctgatgc atatcgtcgg agagattctc 300caggcgcaga atggtggtgg
tgctggtggc ggcgggttcg gcggcgggtt cggcggtgac 360tttagtggcg
acctcggcct cggcaccaac ctctcgagcg acagcgcatc gatgcagtaa
42024139PRTXanthomonas oryzaeXanthomonas oryzae pv. oryzae 24Met
Asn Ser Leu Asn Thr Gln Phe Gly Gly Ser Thr Ser Asn Leu Gln 1 5 10
15 Val Gly Pro Ser Gln Asp Thr Thr Phe Gly Ser Asn Gln Gly Gly Asn
20 25 30 Gln Gly Ile Ser Glu Lys Gln Leu Asp Gln Leu Leu Cys Gln
Leu Ile 35 40 45 Ser Ala Leu Leu Gln Ser Ser Lys Asn Ala Glu Glu
Gly Lys Gly Gln 50 55 60 Gly Gly Asp Asn Gly Gly Gly Gln Gly Gly
Asn Ser Gln Gln Ala Gly 65 70 75 80 Gln Gln Asn Gly Pro Ser Pro Phe
Thr Gln Met Leu Met His Ile Val 85 90 95 Gly Glu Ile Leu Gln Ala
Gln Asn Gly Gly Gly Ala Gly Gly Gly Gly 100 105 110 Phe Gly Gly Gly
Phe Gly Gly Asp Phe Ser Gly Asp Leu Gly Leu Gly 115 120 125 Thr Asn
Leu Ser Ser Asp Ser Ala Ser Met Gln 130 135 25378DNAXanthomonas
oryzae pv.Xanthomonas oryzae pv. oryzicola 25atgaattctt tgaacacaca
attcggcggc agcgcgtcca acttccaggt tgaccaaagc 60cagaacgcgc aatccgattc
gagccagggc agcaatggca gccagggtat ctcggaaaag 120caactggacc
agttgctgtg ccagctcatc caggccctgc ttcagccgaa caaaaatgct
180gaggaaggta agggtcagca gggtggcgag aataatcagc aggccgggaa
ggagaatggc 240gcctcgccac tcacccagat gctgatgaat atcgtcggag
agattctcca ggcgcagaat 300gccggcggca gcagcggcgg cgactttggt
ggcagtttcg ccagcagctt ctcgaacgac 360agcggatcga tgcagtaa
37826125PRTXanthomonas oryzaeXanthomonas oryzae pv. oryzicola 26Met
Asn Ser Leu Asn Thr Gln Phe Gly Gly Ser Ala Ser Asn Phe Gln 1 5 10
15 Val Asp Gln Ser Gln Asn Ala Gln Ser Asp Ser Ser Gln Gly Ser Asn
20 25 30 Gly Ser Gln Gly Ile Ser Glu Lys Gln Leu Asp Gln Leu Leu
Cys Gln 35 40 45 Leu Ile Gln Ala Leu Leu Gln Pro Asn Lys Asn Ala
Glu Glu Gly Lys 50 55 60 Gly Gln Gln Gly Gly Glu Asn Asn Gln Gln
Ala Gly Lys Glu Asn Gly 65 70 75 80 Ala Ser Pro Leu Thr Gln Met Leu
Met Asn Ile Val Gly Glu Ile Leu 85 90 95 Gln Ala Gln Asn Ala Gly
Gly Ser Ser Gly Gly Asp Phe Gly Gly Ser 100 105 110 Phe Ala Ser Ser
Phe Ser Asn Asp Ser Gly Ser Met Gln 115 120 125 27366DNAXanthomonas
campestrisXanthomonas campestris pv. campestris 27tcaggcttgg
ccggtgatgc tcgacaggtt ggcattgaag ccgccaccca agctggtgcc 60gcccatgccg
gcgccgcctt ggttctgcat cagctgcatc acgatctgca tcagcatctg
120cgtcaacgga ctcacaccgt cctgttgacc gctctgcggt tgttcgtctc
cgcactcctg 180atcggcatcg ctgccctggc tctgttggag catcatcatg
atgaacatgg cgagcagctg 240atccagctgc tgctcggagt cagccgaagg
cgagcgctga ctggagttct gggtttgctg 300gggcccgatg cccatcgtct
gcaggttgat gaagttggaa aatttgtttc cgatagatga 360gtccat
36628121PRTXanthomonas campestrisXanthomonas campestris pv.
campestris 28Met Asp Ser Ser Ile Gly Asn Lys Phe Ser Asn Phe Ile
Asn Leu Gln 1 5 10 15 Thr Met Gly Ile Gly Pro Gln Gln Thr Gln Asn
Ser Ser Gln Arg Ser 20 25 30 Pro Ser Ala Asp Ser Glu Gln Gln Leu
Asp Gln Leu Leu Ala Met Phe 35 40 45 Ile Met Met Met Leu Gln Gln
Ser Gln Gly Ser Asp Ala Asp Gln Glu 50 55 60 Cys Gly Asp Glu Gln
Pro Gln Ser Gly Gln Gln Asp Gly Val Ser Pro 65 70 75 80 Leu Thr Gln
Met Leu Met Gln Ile Val Met Gln Leu Met Gln Asn Gln 85 90 95 Gly
Gly Ala Gly Met Gly Gly Thr Ser Leu Gly Gly Gly Phe Asn Ala 100 105
110 Asn Leu Ser Ser Ile Thr Gly Gln Ala 115 120
293282DNASynechocystis sp. 29tgttcgttgc acaaattgat gagcaatgct
tttttataat gccaactttg tacaaaaaag 60caggcttaaa caatggcgac tatccacggt
aattggcaac cctcccacgg ggaaaacggc 120ggcaaactgt ttctttgggc
ggatacctgg ggtcatcctt tgccagaaac cattggcgat 180cgccatccct
ttgcgttgga tctgccggat ttgctacagg cctggtcgaa tttgcccctg
240gccttcccca aggcggatgg ggtgacagag gcagccctta ctctgcattt
acccagccat 300cgccagcaaa aaattcccct accctttgtc acagggcaag
atccggtggc catggatgcg 360aaatatctcc actggcgatc gtggcaggta
accggggtaa atctgacccc aagccaaacg 420ttaacgttgc tccaatctat
tcccctgggg ggccaagcct tagctaactt aggatcagag 480ttttactttt
acggtcaact gcaccgctgg tgtttagatt tggtgctacg gggtaaattt
540gtgccgggac tggagcaaag gggggaagac ggtaattact atgcccaatg
gattcctatc 600ctcgatagca tccaagacca aacccattta gcccaattta
gccagagagt acctgcctgc 660gccctggcca acctgactga ctcccaggag
ccccaaatgt tggtggtgga tttactacaa 720aaattattgc aagcccaaat
tggtgccgtc agtcccagcc tagccaacgt taaagaagtc 780tggttgaatg
attggctccg gggattaacc catggggggc aaacctccct cggcacaagc
840aaagctctac aacgattagc cacatcctta gaccattggt atttaccagt
ccagaattat 900ttgggccaaa aaaataacca agctttagcc caacggcaat
ggcggggggc tctgcggtta 960caacctccag cggacgatgg ggggggaacc
tggcaactgg attatggttt acaagccctg 1020gatgacgggg aattttggct
cccggcggct tccctctggg ccatggccgg cgatcgcctg 1080gtgtggcagg
gaaggagggt tgaccagggg gcggaaagtt tactgcgggg cttaggggta
1140gctgcccaaa tttacgaacc cattgctgca agtttgacgg aaaggtgtcc
cacgggctgt 1200gggctagatg ccatccaagc ctacgaattt atcctggcaa
tcgcccatca attgcgggat 1260cgggggttag gggtaatcct cccgccgggg
ttagaacggg gcggcaccgc caaacggtta 1320ggggtaaaag tggtggggga
agtgcaacgg caaaggggcc agcggctaac tctgcaaagt 1380ttaattaatt
acgacttgca actaatgatg gggagcgggg acaatgcccg gttattgacg
1440gccaaggact ttgaagcgtt actagcccaa aaatctcccc tggtggtgct
ggacggagaa 1500tggattaccc tgcaaccggc ggacgtgcgg gcggccaagg
tcattttaca gcagcaacaa 1560tctgccccgc ccctcacagt ggaggatgct
ctgcgcctca gcattggtga tttacaaacc 1620gtctctaaac tgccggtgac
ccagtttgct gctcggggca tattacagga attgatcgac 1680accctccgta
acccggaagg agtgaaagcc attgctgacc caccgggctt tcagggtact
1740ttacggccct accaagctcg gggagtgggc tggttagctt ttctggaacg
gtgggggctg 1800ggggcctgtt tggcagacga tatgggtttg ggaaaaacac
cccagttgct ggcttttctg 1860ctccatttag ccgcggagga tatgttagtt
aagccggtgt tgattgtttg tcctacgtcg 1920gtgctgagca attggggtca
tgaaattaat aagtttgcgc cccaacttaa aaccctattg 1980caccatggcg
atcgccggaa aaaagggcaa ccgttggtta aacaggtcaa agaccagcaa
2040attgtcctca ccagttacgc tttactgcaa cgggatttta gtagtttgaa
attggtggac 2100tggcagggga tcgtgctgga cgaagcccaa aatatcaaaa
atccccaagc taaacagtcc 2160caggcggccc ggcaattgcc agcgggtttt
cgcattgccc tcacggggac tccggtggaa 2220aatcgcctga cggaattgtg
gtcaatttta gaatttttaa atcccggttt cctgggtaat 2280cagagctttt
tccaacggcg ctttgccaat cccatcgaaa aatttggcga tcgccagtcg
2340ttgttaattt tgcggaattt agtgcggccg tttattttgc ggcggttaaa
aaccgaccaa 2400accattattc aagatttacc agaaaaacaa gaaatgaccg
tcttctgtga cctttcccaa 2460gagcaagctg gtttatatca acaattggtg
gaggaatccc tccaggcgat cgccgacagc 2520gaaggcattc aaaggcacgg
tttagtttta accctattaa ccaaactcaa acaggtttgt 2580aaccatcccg
atctattgct gaaaaagccc gccatcaccc acgggcacca gtccggcaag
2640ctaattcgtc tggcggaaat gctggaagaa atcatcagcg aaggcgatcg
ggtgttaatt 2700ttcacccaat ttgccagttg gggtcattta ctcaaaccct
atctggaaaa atactttaac 2760caagaggtgc tctatctcca cgggggcact
ccagcagagc aacggcaagc tctggtggaa 2820cgattccaac aggaccccaa
cagtccctat ttatttatcc tttctctcaa ggctggcggc 2880acagggttga
acctcacgag ggctaaccat gtgttccatg tggaccggtg gtggaatccg
2940gcggtggaaa atcaggctac cgatcgtgct tttcgcattg gccaaactcg
caacgtccag 3000gtgcacaaat ttgtctgtac aggcaccttg gaagaaaaaa
ttaacgccat gatggcggat 3060aaacaacaat tggcagaaca aaccgtggat
gccggggaaa attggctcac ccgcctagac 3120accgataaac tccgtcagtt
gcttaccctc tccgccaccc cggtggatta ccaagccgaa 3180gcgtccgatt
gaacccagct ttcttgtaca aagttggcat gataagaaag cattgcttat
3240caatttgttg caacgaacag gtcactatca gtcaaaataa at
3282301039PRTSynechocystis sp. 30Met Ala Thr Ile His Gly Asn Trp
Gln Pro Ser His Gly Glu Asn Gly 1 5 10 15 Gly Lys Leu Phe Leu Trp
Ala Asp Thr Trp Gly His Pro Leu Pro Glu 20 25 30 Thr Ile Gly Asp
Arg His Pro Phe Ala Leu Asp Leu Pro Asp Leu Leu 35 40 45 Gln Ala
Trp Ser Asn Leu Pro Leu Ala Phe Pro Lys Ala Asp Gly Val 50 55 60
Thr Glu Ala Ala Leu Thr Leu His Leu Pro Ser His Arg Gln Gln Lys 65
70 75 80 Ile Pro Leu Pro Phe Val Thr Gly Gln Asp Pro Val Ala Met
Asp Ala 85 90 95 Lys Tyr Leu His Trp Arg Ser Trp Gln Val Thr Gly
Val Asn Leu Thr 100 105 110 Pro Ser Gln Thr Leu Thr Leu Leu Gln Ser
Ile Pro Leu Gly Gly Gln 115 120 125 Ala Leu Ala Asn Leu Gly Ser Glu
Phe Tyr Phe Tyr Gly Gln Leu His 130 135 140 Arg Trp Cys Leu Asp Leu
Val Leu Arg Gly Lys Phe Val Pro Gly Leu 145 150 155 160 Glu Gln Arg
Gly Glu Asp Gly Asn Tyr Tyr Ala Gln Trp Ile Pro Ile 165 170 175 Leu
Asp Ser Ile Gln Asp Gln Thr His Leu Ala Gln Phe Ser Gln Arg 180 185
190 Val Pro Ala Cys Ala Leu Ala Asn Leu Thr Asp Ser Gln Glu Pro Gln
195 200 205 Met Leu Val Val Asp Leu Leu Gln Lys Leu Leu Gln Ala Gln
Ile Gly 210 215 220 Ala Val Ser Pro Ser Leu Ala Asn Val Lys Glu Val
Trp Leu Asn Asp 225 230 235 240 Trp Leu Arg Gly Leu Thr His Gly Gly
Gln Thr Ser Leu Gly Thr Ser 245 250 255 Lys Ala Leu Gln Arg Leu Ala
Thr Ser Leu Asp His Trp Tyr Leu Pro 260 265 270 Val Gln Asn Tyr Leu
Gly Gln Lys Asn Asn Gln Ala Leu Ala Gln Arg 275 280 285 Gln Trp Arg
Gly Ala Leu Arg Leu Gln Pro Pro Ala Asp Asp Gly Gly 290 295 300 Gly
Thr Trp Gln Leu Asp Tyr Gly Leu Gln Ala Leu Asp Asp Gly Glu 305 310
315 320 Phe Trp Leu Pro Ala Ala Ser Leu Trp Ala Met Ala Gly Asp Arg
Leu 325 330 335 Val Trp Gln Gly Arg Arg Val Asp Gln Gly Ala Glu Ser
Leu Leu Arg 340 345 350 Gly Leu Gly Val Ala Ala Gln Ile Tyr Glu Pro
Ile Ala Ala Ser Leu 355 360 365 Thr Glu Arg Cys Pro Thr Gly Cys Gly
Leu Asp Ala Ile Gln Ala Tyr 370 375 380 Glu Phe Ile Leu Ala Ile Ala
His Gln Leu Arg Asp Arg Gly Leu Gly 385 390 395 400 Val Ile Leu Pro
Pro Gly Leu Glu Arg Gly Gly Thr Ala Lys Arg Leu 405 410 415 Gly Val
Lys Val Val Gly Glu Val Gln Arg Gln Arg Gly Gln Arg Leu 420 425 430
Thr Leu Gln Ser Leu Ile Asn Tyr Asp Leu Gln Leu Met Met Gly Ser 435
440 445 Gly Asp Asn Ala Arg Leu Leu Thr Ala Lys Asp Phe Glu Ala Leu
Leu 450 455 460 Ala Gln Lys Ser Pro Leu Val Val Leu Asp Gly Glu Trp
Ile Thr Leu 465 470 475 480 Gln Pro Ala Asp Val Arg Ala Ala Lys Val
Ile Leu Gln Gln Gln Gln 485 490 495 Ser Ala Pro Pro Leu Thr Val Glu
Asp Ala Leu Arg Leu Ser Ile Gly 500 505 510 Asp Leu Gln Thr Val Ser
Lys Leu Pro Val Thr Gln Phe Ala Ala Arg 515 520 525 Gly Ile Leu Gln
Glu Leu Ile Asp Thr Leu Arg Asn Pro Glu Gly Val 530 535 540 Lys Ala
Ile Ala Asp Pro Pro Gly Phe Gln Gly Thr Leu Arg Pro Tyr 545 550 555
560 Gln Ala Arg Gly Val Gly Trp Leu Ala Phe Leu Glu Arg Trp Gly Leu
565 570 575 Gly Ala Cys Leu Ala Asp Asp Met Gly Leu Gly Lys Thr Pro
Gln Leu 580 585 590 Leu Ala Phe Leu Leu His Leu Ala Ala Glu Asp Met
Leu Val Lys Pro 595 600 605 Val Leu Ile Val Cys Pro Thr Ser Val Leu
Ser Asn Trp Gly His Glu 610 615 620 Ile Asn Lys Phe Ala Pro Gln Leu
Lys Thr Leu Leu His His Gly Asp 625 630 635 640 Arg Arg Lys Lys Gly
Gln Pro Leu Val Lys Gln Val Lys Asp Gln Gln 645 650 655 Ile Val Leu
Thr Ser Tyr Ala Leu Leu Gln Arg Asp Phe Ser Ser Leu 660 665 670 Lys
Leu Val Asp Trp Gln Gly Ile Val Leu Asp Glu Ala Gln Asn Ile 675 680
685 Lys Asn Pro Gln Ala Lys Gln Ser Gln Ala Ala Arg Gln Leu Pro Ala
690 695 700 Gly Phe Arg Ile Ala Leu Thr Gly Thr Pro Val Glu Asn Arg
Leu Thr 705 710 715 720 Glu Leu Trp Ser Ile Leu Glu Phe Leu Asn Pro
Gly Phe Leu Gly Asn 725 730 735 Gln Ser Phe Phe Gln Arg Arg Phe Ala
Asn Pro Ile Glu Lys Phe Gly 740 745 750 Asp Arg Gln Ser Leu Leu Ile
Leu Arg Asn Leu Val Arg Pro Phe Ile 755 760 765 Leu Arg Arg Leu Lys
Thr Asp Gln Thr Ile Ile Gln Asp Leu Pro Glu 770 775 780 Lys Gln Glu
Met Thr Val Phe Cys Asp Leu Ser Gln Glu Gln Ala Gly 785 790 795 800
Leu Tyr Gln Gln Leu Val Glu Glu Ser Leu Gln Ala Ile Ala Asp Ser 805
810 815 Glu Gly Ile Gln Arg His Gly Leu Val Leu Thr Leu Leu Thr Lys
Leu 820 825 830 Lys Gln Val Cys Asn His Pro Asp Leu Leu Leu Lys Lys
Pro Ala Ile 835 840 845 Thr His Gly His Gln Ser Gly Lys Leu Ile Arg
Leu Ala Glu Met Leu 850 855 860 Glu Glu Ile Ile Ser Glu Gly Asp Arg
Val Leu Ile Phe Thr Gln Phe 865 870 875 880 Ala Ser Trp Gly His Leu
Leu Lys Pro Tyr Leu Glu Lys Tyr Phe Asn 885 890 895 Gln Glu Val Leu
Tyr Leu His Gly Gly Thr Pro Ala Glu Gln Arg Gln 900 905 910 Ala Leu
Val Glu Arg Phe Gln Gln Asp Pro Asn Ser Pro Tyr Leu Phe 915 920 925
Ile Leu Ser Leu Lys Ala Gly Gly Thr Gly Leu Asn Leu Thr Arg Ala 930
935 940 Asn His Val Phe His Val Asp Arg Trp Trp Asn Pro Ala Val Glu
Asn 945 950 955 960 Gln Ala Thr Asp Arg Ala Phe Arg Ile Gly Gln
Thr Arg Asn Val Gln 965 970 975 Val His Lys Phe Val Cys Thr Gly Thr
Leu Glu Glu Lys Ile Asn Ala 980 985 990 Met Met Ala Asp Lys Gln Gln
Leu Ala Glu Gln Thr Val Asp Ala Gly 995 1000 1005 Glu Asn Trp Leu
Thr Arg Leu Asp Thr Asp Lys Leu Arg Gln Leu 1010 1015 1020 Leu Thr
Leu Ser Ala Thr Pro Val Asp Tyr Gln Ala Glu Ala Ser 1025 1030 1035
Asp 313237DNAAnaebena variabilis 31atggcaattt tacacggtag ttggatatta
agtgagcagg atagttattt atttatttgg 60ggggaaactt ggcgatcgcc acaagtaaat
tttagttttg aggaaatagc cctcaatccc 120ttggctctgt ctgcatctga
attaagcgag tggttgcagt ctcaacatca ggcgatcgct 180cagattttac
cacaacagtt ggcaaaaaaa acctccaaag cagcaagttc cccaacaaca
240aatttaccaa ttcactcgca aataattgtt ctgccaacgg aaatttctca
acctcgtaag 300aaagaaacaa ttttcatttc tcctgtgcat tctgccgctt
tagaatctga tgcagactct 360gaagtttatt tacaaccttg gcgtgtagaa
ggtttttgtc ttcctcctag tgcagcagtt 420aaatttctaa cttctttacc
tttaaatatc actagcacag agaatgcttt tttaggtgga 480gatttacgtt
tttggtcaca aattgcccgt tggagtttag atttaatttc taggtctaag
540tttctcccaa ttatccaacg acaacctaat aattctgtaa gtgccaaatg
gcaagtactg 600ttagatagtg ctgtagatgg aactcgttta gaaaagttcg
ccgcgaagat gcctttggtt 660tgtcggactt atcagagatt agggaacgag
gaattatctc catctcctat atatatagat 720tttcctagtc agccgcagga
attaatattg ggttttctca atagtgcaat agatacgcaa 780ttacgggaaa
tggtggggaa tcagcctgtg gtggaaactc gcttgatggc atctttaccg
840tcggcggtac gacagtggct gcaagggtta agtggtgcat ctaattcagt
tgatgcagat 900gcagttggtt tggaaaggct ggaagcagcg ctcaaggctt
ggacgatgcc gctacaatat 960caactagcaa gtaaaaatca atttcgcacc
tgttttgaat tacgttctcc agaaccagga 1020gaaactgaat ggacactagc
ctatttcctg caagcagccg ataatccaga atttctagta 1080gatgcgggca
ctatttggca acatcctgtt gaacagctaa tttatcaaca gcgatcgatt
1140caagaacccc aggaaacatt tttacgaggt ttggggttag cttctcgatt
gtatccggtc 1200attgccccca ctttagatac agaatcaccg caattttgtc
atctcaaccc catgcaggct 1260tatgaattta tcaaggctgt ggcttggcga
tttgaagata gcggtttagg ggtgatttta 1320cctcctagtt tggcgaaccg
ggaaggctgg gcaaaccgct tgggattgaa aatctccgcc 1380gaaaccccaa
agaaaaagcc aggacgcttg ggattgcaga gtttgcttaa ttttcaatgg
1440cacttagcaa ttggtgggca aactatttct aaaggggaat ttgacagact
agtagcttta 1500aaaagcccat tggtagaaat aaatggcgaa tgggtggagt
tgcgtcccca agatatcaag 1560acagccgaag ccttttttgc tgcacgtaaa
gaccaaatgg ccttatcttt agaagatgct 1620ttacgtctga gtagtgggga
tactcaagta attgagaaat taccagtagt cagctttgaa 1680gcctctggcg
cattacaaga attaattggg gcgctgacaa ataatcaagc agttgcacca
1740ttacctacgc caaagaactt ccaaggaaag ttgcgtcctt atcaagaaag
gggtgcggct 1800tggttggcat tcctcgaacg ctggggttta ggtgcttgtc
tcgccgacga catgggactg 1860ggaaaaacga tacagttcat tgctttcctt
ctccatctta aagaacagga tgtattagaa 1920aaaccaactt tactagtgtg
tcctacttct gttttaggta actgggaacg agaagtgaaa 1980aaatttgcac
ctacacttaa agttctccaa tatcatggtg ataaacgtcc taaaggtaaa
2040gcttttccag aagcagtaaa aaatcatgat ttagttatca ccagttactc
actaattcat 2100agagacatca aatcattgca gggtctttct tggcagataa
ttgttttaga tgaagcccag 2160aatgtgaaga atgcggaagc caaacaatca
caagcagtcc gacaattaga cacaaccttt 2220cgcattgctt taacggggac
accagtcgaa aatagactac aggaactttg gtcaatttta 2280gatttcctca
accctggtta tttaggtaat aagcaattct tccaaagacg ctttgccatg
2340ccaattgaaa agtatggtga tgcagcatct ttaaatcaat tgcgtgcctt
agtacaacca 2400tttattctgc gtcgcctgaa aacagaccgt gatattattc
aagacttgcc agataagcaa 2460gaaatgacag tattttgcgg tttgactgga
gaacaagctg cactttatca aaaagtggta 2520gaaacatctt tagcagaaat
tgaatcggcc gaaggattgc aacgccgagg gatgatttta 2580gctttattaa
ttaaactcaa acaaatctgc aatcatccag cccaatatct gaaaacaaat
2640accttagaac aatacagttc aggaaaactg caacgattag aagaaatgtt
agaagaggtg 2700ttagcggaga gtaatactta tggtgttgct ggtgcgggac
gtgctttaat cttcacccag 2760tttgcagaat ggggtaagtt actcaaacca
catttagaaa aacaactagg gcgggaagta 2820tttttcttat atggtagtac
cagtaaaaag caacgtgaag aaatgattga ccgttttcaa 2880cacgaccctc
aggggccacc aattatgatt ctctctctca aagcaggtgg tgtagggttg
2940aacttaacca gagcaaatca tgtatttcac tttgatagat ggtggaatcc
agccgtagag 3000aaccaagcca cagaccgcgt atttcgtatt ggtcaaaccc
gcaatgtaca ggtgcataaa 3060tttgtttgca atggtacctt agaagaaaaa
atccacgaca tgattgaaag taaaaaacaa 3120ctagcggaac aggttgttgg
tgcaggcgaa gagtggttaa ctgaattaga tacagatcaa 3180ctccgcaact
tactgatact tgatcgtagt gcagtaattg atgaagaagc agagtaa
3237321078PRTAnaebena variabilis 32Met Ala Ile Leu His Gly Ser Trp
Ile Leu Ser Glu Gln Asp Ser Tyr 1 5 10 15 Leu Phe Ile Trp Gly Glu
Thr Trp Arg Ser Pro Gln Val Asn Phe Ser 20 25 30 Phe Glu Glu Ile
Ala Leu Asn Pro Leu Ala Leu Ser Ala Ser Glu Leu 35 40 45 Ser Glu
Trp Leu Gln Ser Gln His Gln Ala Ile Ala Gln Ile Leu Pro 50 55 60
Gln Gln Leu Ala Lys Lys Thr Ser Lys Ala Ala Ser Ser Pro Thr Thr 65
70 75 80 Asn Leu Pro Ile His Ser Gln Ile Ile Val Leu Pro Thr Glu
Ile Ser 85 90 95 Gln Pro Arg Lys Lys Glu Thr Ile Phe Ile Ser Pro
Val His Ser Ala 100 105 110 Ala Leu Glu Ser Asp Ala Asp Ser Glu Val
Tyr Leu Gln Pro Trp Arg 115 120 125 Val Glu Gly Phe Cys Leu Pro Pro
Ser Ala Ala Val Lys Phe Leu Thr 130 135 140 Ser Leu Pro Leu Asn Ile
Thr Ser Thr Glu Asn Ala Phe Leu Gly Gly 145 150 155 160 Asp Leu Arg
Phe Trp Ser Gln Ile Ala Arg Trp Ser Leu Asp Leu Ile 165 170 175 Ser
Arg Ser Lys Phe Leu Pro Ile Ile Gln Arg Gln Pro Asn Asn Ser 180 185
190 Val Ser Ala Lys Trp Gln Val Leu Leu Asp Ser Ala Val Asp Gly Thr
195 200 205 Arg Leu Glu Lys Phe Ala Ala Lys Met Pro Leu Val Cys Arg
Thr Tyr 210 215 220 Gln Arg Leu Gly Asn Glu Glu Leu Ser Pro Ser Pro
Ile Tyr Ile Asp 225 230 235 240 Phe Pro Ser Gln Pro Gln Glu Leu Ile
Leu Gly Phe Leu Asn Ser Ala 245 250 255 Ile Asp Thr Gln Leu Arg Glu
Met Val Gly Asn Gln Pro Val Val Glu 260 265 270 Thr Arg Leu Met Ala
Ser Leu Pro Ser Ala Val Arg Gln Trp Leu Gln 275 280 285 Gly Leu Ser
Gly Ala Ser Asn Ser Val Asp Ala Asp Ala Val Gly Leu 290 295 300 Glu
Arg Leu Glu Ala Ala Leu Lys Ala Trp Thr Met Pro Leu Gln Tyr 305 310
315 320 Gln Leu Ala Ser Lys Asn Gln Phe Arg Thr Cys Phe Glu Leu Arg
Ser 325 330 335 Pro Glu Pro Gly Glu Thr Glu Trp Thr Leu Ala Tyr Phe
Leu Gln Ala 340 345 350 Ala Asp Asn Pro Glu Phe Leu Val Asp Ala Gly
Thr Ile Trp Gln His 355 360 365 Pro Val Glu Gln Leu Ile Tyr Gln Gln
Arg Ser Ile Gln Glu Pro Gln 370 375 380 Glu Thr Phe Leu Arg Gly Leu
Gly Leu Ala Ser Arg Leu Tyr Pro Val 385 390 395 400 Ile Ala Pro Thr
Leu Asp Thr Glu Ser Pro Gln Phe Cys His Leu Asn 405 410 415 Pro Met
Gln Ala Tyr Glu Phe Ile Lys Ala Val Ala Trp Arg Phe Glu 420 425 430
Asp Ser Gly Leu Gly Val Ile Leu Pro Pro Ser Leu Ala Asn Arg Glu 435
440 445 Gly Trp Ala Asn Arg Leu Gly Leu Lys Ile Ser Ala Glu Thr Pro
Lys 450 455 460 Lys Lys Pro Gly Arg Leu Gly Leu Gln Ser Leu Leu Asn
Phe Gln Trp 465 470 475 480 His Leu Ala Ile Gly Gly Gln Thr Ile Ser
Lys Gly Glu Phe Asp Arg 485 490 495 Leu Val Ala Leu Lys Ser Pro Leu
Val Glu Ile Asn Gly Glu Trp Val 500 505 510 Glu Leu Arg Pro Gln Asp
Ile Lys Thr Ala Glu Ala Phe Phe Ala Ala 515 520 525 Arg Lys Asp Gln
Met Ala Leu Ser Leu Glu Asp Ala Leu Arg Leu Ser 530 535 540 Ser Gly
Asp Thr Gln Val Ile Glu Lys Leu Pro Val Val Ser Phe Glu 545 550 555
560 Ala Ser Gly Ala Leu Gln Glu Leu Ile Gly Ala Leu Thr Asn Asn Gln
565 570 575 Ala Val Ala Pro Leu Pro Thr Pro Lys Asn Phe Gln Gly Lys
Leu Arg 580 585 590 Pro Tyr Gln Glu Arg Gly Ala Ala Trp Leu Ala Phe
Leu Glu Arg Trp 595 600 605 Gly Leu Gly Ala Cys Leu Ala Asp Asp Met
Gly Leu Gly Lys Thr Ile 610 615 620 Gln Phe Ile Ala Phe Leu Leu His
Leu Lys Glu Gln Asp Val Leu Glu 625 630 635 640 Lys Pro Thr Leu Leu
Val Cys Pro Thr Ser Val Leu Gly Asn Trp Glu 645 650 655 Arg Glu Val
Lys Lys Phe Ala Pro Thr Leu Lys Val Leu Gln Tyr His 660 665 670 Gly
Asp Lys Arg Pro Lys Gly Lys Ala Phe Pro Glu Ala Val Lys Asn 675 680
685 His Asp Leu Val Ile Thr Ser Tyr Ser Leu Ile His Arg Asp Ile Lys
690 695 700 Ser Leu Gln Gly Leu Ser Trp Gln Ile Ile Val Leu Asp Glu
Ala Gln 705 710 715 720 Asn Val Lys Asn Ala Glu Ala Lys Gln Ser Gln
Ala Val Arg Gln Leu 725 730 735 Asp Thr Thr Phe Arg Ile Ala Leu Thr
Gly Thr Pro Val Glu Asn Arg 740 745 750 Leu Gln Glu Leu Trp Ser Ile
Leu Asp Phe Leu Asn Pro Gly Tyr Leu 755 760 765 Gly Asn Lys Gln Phe
Phe Gln Arg Arg Phe Ala Met Pro Ile Glu Lys 770 775 780 Tyr Gly Asp
Ala Ala Ser Leu Asn Gln Leu Arg Ala Leu Val Gln Pro 785 790 795 800
Phe Ile Leu Arg Arg Leu Lys Thr Asp Arg Asp Ile Ile Gln Asp Leu 805
810 815 Pro Asp Lys Gln Glu Met Thr Val Phe Cys Gly Leu Thr Gly Glu
Gln 820 825 830 Ala Ala Leu Tyr Gln Lys Val Val Glu Thr Ser Leu Ala
Glu Ile Glu 835 840 845 Ser Ala Glu Gly Leu Gln Arg Arg Gly Met Ile
Leu Ala Leu Leu Ile 850 855 860 Lys Leu Lys Gln Ile Cys Asn His Pro
Ala Gln Tyr Leu Lys Thr Asn 865 870 875 880 Thr Leu Glu Gln Tyr Ser
Ser Gly Lys Leu Gln Arg Leu Glu Glu Met 885 890 895 Leu Glu Glu Val
Leu Ala Glu Ser Asn Thr Tyr Gly Val Ala Gly Ala 900 905 910 Gly Arg
Ala Leu Ile Phe Thr Gln Phe Ala Glu Trp Gly Lys Leu Leu 915 920 925
Lys Pro His Leu Glu Lys Gln Leu Gly Arg Glu Val Phe Phe Leu Tyr 930
935 940 Gly Ser Thr Ser Lys Lys Gln Arg Glu Glu Met Ile Asp Arg Phe
Gln 945 950 955 960 His Asp Pro Gln Gly Pro Pro Ile Met Ile Leu Ser
Leu Lys Ala Gly 965 970 975 Gly Val Gly Leu Asn Leu Thr Arg Ala Asn
His Val Phe His Phe Asp 980 985 990 Arg Trp Trp Asn Pro Ala Val Glu
Asn Gln Ala Thr Asp Arg Val Phe 995 1000 1005 Arg Ile Gly Gln Thr
Arg Asn Val Gln Val His Lys Phe Val Cys 1010 1015 1020 Asn Gly Thr
Leu Glu Glu Lys Ile His Asp Met Ile Glu Ser Lys 1025 1030 1035 Lys
Gln Leu Ala Glu Gln Val Val Gly Ala Gly Glu Glu Trp Leu 1040 1045
1050 Thr Glu Leu Asp Thr Asp Gln Leu Arg Asn Leu Leu Ile Leu Asp
1055 1060 1065 Arg Ser Ala Val Ile Asp Glu Glu Ala Glu 1070 1075
333129DNAmethanogenic archaeonuncultured methanogenic archaeon
33atgattacac ttcacggaac ctggactact gtcgatcccc tgaatggcac atttttcctc
60tggggagaga gtgatccggc cacgcagcat aaaagaagag gcaggcctcg gaaaagtgca
120ggggagaaac agcacccgtt tcacgccggc atcaaagagc tggaagctgg
agcgggggct 180atcaattcat cgtgtataag acatatagca gatgcgggag
cacgggcgga gcaggtttta 240attttgccgt cagctacgga caggcccctg
agatctgcga gcccttcagc actggagtca 300ggtgaagaaa ccaaccctga
cagcagttta caatttcttc cgtggacggt gaccggcatc 360aacattaagc
ccgggaatgc tctggtactt ctatcctcta tagccgaatc acaaaagcgg
420atcggagata tggcgatagg cccagacctg ctttactgga gtaaggtagc
caagtttacg 480cttaagctcc tgataagcca gcagttcagg ccggaggttg
tcgaagtaat gagcggaaaa 540gcatatagcc gttggagatt tgcgctcacc
gatgaaactg accggaaaca ctatgcctcg 600ctcgaaaact ccatgccgct
ggcatgtatt gcggtttcag gaaaggctgg catttataat 660cgaaaagaag
ccttagattt gttcattaat accgcccttg acacatttat ccgggaccag
720attgccctgc ccgctgacag caggatgacg aacctgctat cgcaagcatg
gctagattcg 780ctcggcaccg gagagagtat ccgcctgtcg gctcctgaga
tgaagaaact caaagattcg 840gcaggccgct ggacatcccg catgaaaaca
gagagcaaac aagctttaaa gacctgcttc 900atcctggagc cgccagcccc
ggatacagag tatcctgaag cgccgtggaa cctacggtac 960tgcttgcagg
catccgatga ccccagtctg gtaattccgg ctgagactgt gtggaaagag
1020ttgaagaaga cgctgaagta cctgaataag agatacgata accctcagga
gcaattgtta 1080caggatctcg gaaaagcgat gcagatgttt cccgaaatcg
agcccagcct caacacgtca 1140aaacctctgt ccgcaacgct gagcaccagt
gaagcctaca agttcctgac agaagcggcg 1200cctctgctgc aggacagcgg
gtatagcatt atcctaccgg aatggtggcg caacagcact 1260ggcaggctca
agctcggcgc caggcttcgc ttcaagccga aagccgaagg taaagcgggt
1320aaaagccagt tcaccatgga taccctcgtc agctacgact ggcgcctggc
gctgggcgat 1380caggagatca ccgaaacaga gttcaggaag ctggcagccc
tgaaagagcc gcttctgcag 1440ataggcggga aatggtttgc gctgaaaaag
gaagacatag acagcatcat gaaagcattc 1500agggcgaaga agactggaga
gatggcttta tcggaggcac tgcgcctcaa cggcgggctg 1560gaagacttca
acggcatccc cgtcagcggc atgaaatcgt caggatggct ggcagaactt
1620ttcgacaggc tggcagccgg cgaaaaaata acgagccttg ccccgccgga
cggtttcaac 1680ggggagctta gagattacca ggttaaaggc tactcctggc
tggccttcat gaaaaagtat 1740ggcctgggct ccattctggc tgacgacatg
ggcctgggta agacgataca gctgctggcg 1800ttgctcctga aagagaagga
aagaggcact aaaggcccta ctctgttgat ctgccccacc 1860tcgattctcg
gaaactggca gcgggaggcg aagaaatttg ccccggccct gaaagtccac
1920atacaccatg gggcaggaag ggctgataaa gagcagttcg gaaaaatcgt
caaggctcac 1980gacctgatcc tgagcactta cgctcacgcc taccgggacg
aggaactgct taaagaggtg 2040aactggaagc tggtagtgct cgacgaggct
cagaatatca agaatcatca tacccggcag 2100gccagagcta tccgggctct
taaggccgat caccgaatag ccatgacggg aacgccgata 2160gagaacagac
tctcggagct gtggtcgatc gtggacttcc tgaaccccgg ctacctgggc
2220aaggcggaga cattcaggaa acaattcgcc atacctatcg agagatacga
tgacgctgcc 2280cggtcggaaa aattgaagca ggccatcaag cccctggtgc
tgcgcagagt gaagacggat 2340ccggccatca tcaaagacct gccggacaag
atcgagatca aggagccctg caacctcacc 2400aaagaacagg ccacgctcta
cgaggccatc gtagagaaca tgctgaaaag tatagataag 2460gccacggcaa
tgcagagacg gggaatcgtc ttagcgtccc tgatgaagct caaacaggtc
2520tgcgatcacc cgtcgctgta catcaaaacg ggcgctgtga ccgacgataa
gacgctgatc 2580aggtctggca agctgaagcg cctcacggag ctgctcgaag
aagcgctggc cgaaggcgac 2640agcgtgctga tcttcaccca gttcgtggaa
atgggggaga tgctgaaagc ctacctgcag 2700agcacgttcg acgaagaagc
cctctttttg cacggcggag taccgcagaa ggccagagac 2760aagatggtcc
tccgtttcgg ggaaaaggac gggccacgga tctttatcgt ctcgctgaaa
2820gccggcggcg tcggcctcaa cctgacgaag gcaagccacg tgttccactt
cgatcgctgg 2880tggaacccgg cggtcgagaa ccaggcgaca gatcgagctt
acaggatagg ccagagcaaa 2940aatgtactgg tccataaatt cgtctgcgcc
ggcacgctgg aagaaaagat cgacgagctg 3000atcgagagca aaaaggcgct
gtcggcgaac atcctcggca cgggagaaga ctggatcacg 3060gagttgtcga
ccgaacagct gagggacatg gtcatgctga gatgggacga ggtagccgat
3120gatggctaa 3129341042PRTmethanogenic archaeonuncultured
methanogenic archaeon 34Met Ile Thr Leu His Gly Thr Trp Thr Thr Val
Asp Pro Leu Asn Gly 1 5 10 15 Thr Phe Phe Leu Trp Gly Glu Ser Asp
Pro Ala Thr Gln His Lys Arg 20 25 30 Arg Gly Arg Pro Arg Lys Ser
Ala Gly Glu Lys Gln His Pro Phe His 35 40 45 Ala Gly Ile Lys Glu
Leu Glu Ala Gly Ala Gly Ala Ile Asn Ser Ser 50 55 60 Cys Ile Arg
His Ile Ala Asp Ala Gly Ala Arg Ala Glu Gln Val Leu 65 70 75 80 Ile
Leu Pro Ser Ala Thr Asp Arg Pro Leu Arg Ser Ala Ser Pro Ser 85 90
95 Ala Leu Glu Ser Gly Glu Glu Thr Asn Pro Asp Ser Ser Leu Gln Phe
100 105 110 Leu Pro Trp Thr Val Thr Gly Ile Asn Ile Lys Pro Gly Asn
Ala Leu 115 120 125 Val Leu Leu Ser Ser Ile Ala Glu Ser Gln Lys Arg
Ile Gly Asp Met 130
135 140 Ala Ile Gly Pro Asp Leu Leu Tyr Trp Ser Lys Val Ala Lys Phe
Thr 145 150 155 160 Leu Lys Leu Leu Ile Ser Gln Gln Phe Arg Pro Glu
Val Val Glu Val 165 170 175 Met Ser Gly Lys Ala Tyr Ser Arg Trp Arg
Phe Ala Leu Thr Asp Glu 180 185 190 Thr Asp Arg Lys His Tyr Ala Ser
Leu Glu Asn Ser Met Pro Leu Ala 195 200 205 Cys Ile Ala Val Ser Gly
Lys Ala Gly Ile Tyr Asn Arg Lys Glu Ala 210 215 220 Leu Asp Leu Phe
Ile Asn Thr Ala Leu Asp Thr Phe Ile Arg Asp Gln 225 230 235 240 Ile
Ala Leu Pro Ala Asp Ser Arg Met Thr Asn Leu Leu Ser Gln Ala 245 250
255 Trp Leu Asp Ser Leu Gly Thr Gly Glu Ser Ile Arg Leu Ser Ala Pro
260 265 270 Glu Met Lys Lys Leu Lys Asp Ser Ala Gly Arg Trp Thr Ser
Arg Met 275 280 285 Lys Thr Glu Ser Lys Gln Ala Leu Lys Thr Cys Phe
Ile Leu Glu Pro 290 295 300 Pro Ala Pro Asp Thr Glu Tyr Pro Glu Ala
Pro Trp Asn Leu Arg Tyr 305 310 315 320 Cys Leu Gln Ala Ser Asp Asp
Pro Ser Leu Val Ile Pro Ala Glu Thr 325 330 335 Val Trp Lys Glu Leu
Lys Lys Thr Leu Lys Tyr Leu Asn Lys Arg Tyr 340 345 350 Asp Asn Pro
Gln Glu Gln Leu Leu Gln Asp Leu Gly Lys Ala Met Gln 355 360 365 Met
Phe Pro Glu Ile Glu Pro Ser Leu Asn Thr Ser Lys Pro Leu Ser 370 375
380 Ala Thr Leu Ser Thr Ser Glu Ala Tyr Lys Phe Leu Thr Glu Ala Ala
385 390 395 400 Pro Leu Leu Gln Asp Ser Gly Tyr Ser Ile Ile Leu Pro
Glu Trp Trp 405 410 415 Arg Asn Ser Thr Gly Arg Leu Lys Leu Gly Ala
Arg Leu Arg Phe Lys 420 425 430 Pro Lys Ala Glu Gly Lys Ala Gly Lys
Ser Gln Phe Thr Met Asp Thr 435 440 445 Leu Val Ser Tyr Asp Trp Arg
Leu Ala Leu Gly Asp Gln Glu Ile Thr 450 455 460 Glu Thr Glu Phe Arg
Lys Leu Ala Ala Leu Lys Glu Pro Leu Leu Gln 465 470 475 480 Ile Gly
Gly Lys Trp Phe Ala Leu Lys Lys Glu Asp Ile Asp Ser Ile 485 490 495
Met Lys Ala Phe Arg Ala Lys Lys Thr Gly Glu Met Ala Leu Ser Glu 500
505 510 Ala Leu Arg Leu Asn Gly Gly Leu Glu Asp Phe Asn Gly Ile Pro
Val 515 520 525 Ser Gly Met Lys Ser Ser Gly Trp Leu Ala Glu Leu Phe
Asp Arg Leu 530 535 540 Ala Ala Gly Glu Lys Ile Thr Ser Leu Ala Pro
Pro Asp Gly Phe Asn 545 550 555 560 Gly Glu Leu Arg Asp Tyr Gln Val
Lys Gly Tyr Ser Trp Leu Ala Phe 565 570 575 Met Lys Lys Tyr Gly Leu
Gly Ser Ile Leu Ala Asp Asp Met Gly Leu 580 585 590 Gly Lys Thr Ile
Gln Leu Leu Ala Leu Leu Leu Lys Glu Lys Glu Arg 595 600 605 Gly Thr
Lys Gly Pro Thr Leu Leu Ile Cys Pro Thr Ser Ile Leu Gly 610 615 620
Asn Trp Gln Arg Glu Ala Lys Lys Phe Ala Pro Ala Leu Lys Val His 625
630 635 640 Ile His His Gly Ala Gly Arg Ala Asp Lys Glu Gln Phe Gly
Lys Ile 645 650 655 Val Lys Ala His Asp Leu Ile Leu Ser Thr Tyr Ala
His Ala Tyr Arg 660 665 670 Asp Glu Glu Leu Leu Lys Glu Val Asn Trp
Lys Leu Val Val Leu Asp 675 680 685 Glu Ala Gln Asn Ile Lys Asn His
His Thr Arg Gln Ala Arg Ala Ile 690 695 700 Arg Ala Leu Lys Ala Asp
His Arg Ile Ala Met Thr Gly Thr Pro Ile 705 710 715 720 Glu Asn Arg
Leu Ser Glu Leu Trp Ser Ile Val Asp Phe Leu Asn Pro 725 730 735 Gly
Tyr Leu Gly Lys Ala Glu Thr Phe Arg Lys Gln Phe Ala Ile Pro 740 745
750 Ile Glu Arg Tyr Asp Asp Ala Ala Arg Ser Glu Lys Leu Lys Gln Ala
755 760 765 Ile Lys Pro Leu Val Leu Arg Arg Val Lys Thr Asp Pro Ala
Ile Ile 770 775 780 Lys Asp Leu Pro Asp Lys Ile Glu Ile Lys Glu Pro
Cys Asn Leu Thr 785 790 795 800 Lys Glu Gln Ala Thr Leu Tyr Glu Ala
Ile Val Glu Asn Met Leu Lys 805 810 815 Ser Ile Asp Lys Ala Thr Ala
Met Gln Arg Arg Gly Ile Val Leu Ala 820 825 830 Ser Leu Met Lys Leu
Lys Gln Val Cys Asp His Pro Ser Leu Tyr Ile 835 840 845 Lys Thr Gly
Ala Val Thr Asp Asp Lys Thr Leu Ile Arg Ser Gly Lys 850 855 860 Leu
Lys Arg Leu Thr Glu Leu Leu Glu Glu Ala Leu Ala Glu Gly Asp 865 870
875 880 Ser Val Leu Ile Phe Thr Gln Phe Val Glu Met Gly Glu Met Leu
Lys 885 890 895 Ala Tyr Leu Gln Ser Thr Phe Asp Glu Glu Ala Leu Phe
Leu His Gly 900 905 910 Gly Val Pro Gln Lys Ala Arg Asp Lys Met Val
Leu Arg Phe Gly Glu 915 920 925 Lys Asp Gly Pro Arg Ile Phe Ile Val
Ser Leu Lys Ala Gly Gly Val 930 935 940 Gly Leu Asn Leu Thr Lys Ala
Ser His Val Phe His Phe Asp Arg Trp 945 950 955 960 Trp Asn Pro Ala
Val Glu Asn Gln Ala Thr Asp Arg Ala Tyr Arg Ile 965 970 975 Gly Gln
Ser Lys Asn Val Leu Val His Lys Phe Val Cys Ala Gly Thr 980 985 990
Leu Glu Glu Lys Ile Asp Glu Leu Ile Glu Ser Lys Lys Ala Leu Ser 995
1000 1005 Ala Asn Ile Leu Gly Thr Gly Glu Asp Trp Ile Thr Glu Leu
Ser 1010 1015 1020 Thr Glu Gln Leu Arg Asp Met Val Met Leu Arg Trp
Asp Glu Val 1025 1030 1035 Ala Asp Asp Gly 1040 352757DNABacillus
cereus 35atgatcaatc aaactgaagt aacaattagg ctccagcacg ttagtcacgg
ttggttcctt 60tggggagaag atgatagcgg tactccatta tccgtaacaa gttggaaacg
aaatgcattt 120acatggcact ccacttcctt ctacggcacg tttctaaaag
aagcaagctt tgaaggaaga 180caaggtgtta tgctaacaaa cgcacaagca
tttgaataca tcgcgaataa accgatgaac 240tcctttgccc gtattcaaat
gaacggccct attacagcac ttacggaaga tgcgaacgaa 300ttgtgggatg
ccttcacaag cggtagcttc gtacctgata tggagcgttg gcctaaacaa
360ccatcttgga aagttcaaaa tactccaatc gaagatgaaa cattggcatc
tcttttctcg 420gctgcagtaa atgaaagcat attacaagat aaccgttcaa
atgacggatg ggaagatgca 480aagagacttt atgaacatta cgactttacg
aaaagacaat tagacgcagc actacatgaa 540gaagattggc ttcgaaaaat
tggttacatt gaagatgacc ttccctttac aatcggacta 600cgactacaag
agccgcaaga agaatttgaa atgtggaagc ttgaaacaat tgttacgcca
660aagcgcgggg cacatcgcat atatgtatat gagagtatcg attctttacc
aaaacgatgg 720cacgattatg aagaacgtat tctggaaaca caagaaagct
tcagtaagct cgtaccgtgg 780ctaaaagatg gtgatacatt ccgaagtgaa
ctctttgaaa cagaagcgtg gaacttctta 840acagaagcaa gtaacgaatt
actcgccgca ggtattacaa tcttattacc atcgtggtgg 900caaaatttaa
aagcgacaaa accaaaatta cgtgtgcaac tgaagcaaaa tgctacacaa
960acgcaatctt tcttcggcat gaatacactc gttaattttg actggcgcat
ttcaacgaac 1020ggcattgatt tatcagaaag cgaatttttt gaactcgttg
aacaaaacaa gcggttattc 1080aatataaatg gtcaatggat gcgactagat
ccagccttta ttgaagaagt acgaaagctc 1140atgaatcgtg ctgataagta
tggacttgaa atgaaagatg tcctgcagca acatttatca 1200aacacggctg
aaacagaaat tgtagaagag gatagtccgt ttacagatat tgaaattgaa
1260ctagatggat attatgaaga cttattccaa aaactattgc acattggaga
tattccgaaa 1320gtagatgtcc cttcatcact aaacgccaca ctccgtccgt
atcaacaaca tggcattgag 1380tggttattat atttaagaaa gcttggattc
ggcgcattgt tagctgacga catgggactt 1440ggaaagagta ttcaaacgat
cacttactta ctatatataa aagaaaacaa tctccaaaca 1500ggtcctgctt
taatcgtggc tccgacatct gttcttggaa attggcaaaa agaatttgag
1560cgtttcgcac cgaatttacg tgttcagtta cattatggaa gtaaccgagc
taaaggggaa 1620ccctttaaag atttccttca atcagcagat gttgtattaa
catcttatgc attagctcag 1680cttgatgagg aagaacttag tacgttatgc
tgggatgctg ttattttgga tgaagcacaa 1740aatattaaaa acccacatac
gaaacagtct aaagcagtac gaaacttaca agcaaatcac 1800aaaatcgcat
taactgggac accgatggaa aaccgccttg ccgagctttg gtctattttc
1860gacttcatta atcatggata tcttggcagc ttaggacaat tccagcgccg
cttcgtctca 1920ccaattgaaa aggaccgtga cgaaggaaaa atccaacaag
ttcaacgttt tatctcaccg 1980tttttactgc gtcgtacgaa gaaagatcaa
acagtcgcat taaacttacc agataaacaa 2040gaacagaaag cttactgtcc
actaactggt gaacaagctt ccttatatga acaacttgtt 2100caagatacgt
tgcaaaatgt agaaggatta agcggaattg aacgacgcgg atttatatta
2160ctcatgctga acaaacttaa acaaatttgt aatcatcccg ctctttattt
aaaagaaaca 2220gaaccgaaag acatcatcga gcgttccatg aaaacgagca
cgctcatgga actcattgaa 2280aatataaaag atcaaaatga aagttgctta
atcttcacgc aatacatcgg tatggggaac 2340atgctaaaag atgtgttaga
agaacatttc ggtcagcgcg tcctcttctt aaacggtagt 2400gtaccgaaga
aagaacgtga caaaatgatc gaacagttcc aaaacggaac gtatgacatc
2460ttcattttat cgttaaaagc aggtggtaca ggattaaact taacagctgc
caaccatgtc 2520attcactacg atcgttggtg gaatccagcg gtagaaaacc
aagcaacaga ccgtgcatat 2580cgcattggtc aaaagcgctt cgttcacgtt
cataaactga ttacaacggg gacacttgaa 2640gagaaaatcg atgaaatgtt
agaaagaaaa caatcattaa acaacgccgt cattacaagc 2700gatagttgga
tgacagaact atctacagat gaactaaaag aattacttgg tgtataa
275736918PRTBacillus cereus 36Met Ile Asn Gln Thr Glu Val Thr Ile
Arg Leu Gln His Val Ser His 1 5 10 15 Gly Trp Phe Leu Trp Gly Glu
Asp Asp Ser Gly Thr Pro Leu Ser Val 20 25 30 Thr Ser Trp Lys Arg
Asn Ala Phe Thr Trp His Ser Thr Ser Phe Tyr 35 40 45 Gly Thr Phe
Leu Lys Glu Ala Ser Phe Glu Gly Arg Gln Gly Val Met 50 55 60 Leu
Thr Asn Ala Gln Ala Phe Glu Tyr Ile Ala Asn Lys Pro Met Asn 65 70
75 80 Ser Phe Ala Arg Ile Gln Met Asn Gly Pro Ile Thr Ala Leu Thr
Glu 85 90 95 Asp Ala Asn Glu Leu Trp Asp Ala Phe Thr Ser Gly Ser
Phe Val Pro 100 105 110 Asp Met Glu Arg Trp Pro Lys Gln Pro Ser Trp
Lys Val Gln Asn Thr 115 120 125 Pro Ile Glu Asp Glu Thr Leu Ala Ser
Leu Phe Ser Ala Ala Val Asn 130 135 140 Glu Ser Ile Leu Gln Asp Asn
Arg Ser Asn Asp Gly Trp Glu Asp Ala 145 150 155 160 Lys Arg Leu Tyr
Glu His Tyr Asp Phe Thr Lys Arg Gln Leu Asp Ala 165 170 175 Ala Leu
His Glu Glu Asp Trp Leu Arg Lys Ile Gly Tyr Ile Glu Asp 180 185 190
Asp Leu Pro Phe Thr Ile Gly Leu Arg Leu Gln Glu Pro Gln Glu Glu 195
200 205 Phe Glu Met Trp Lys Leu Glu Thr Ile Val Thr Pro Lys Arg Gly
Ala 210 215 220 His Arg Ile Tyr Val Tyr Glu Ser Ile Asp Ser Leu Pro
Lys Arg Trp 225 230 235 240 His Asp Tyr Glu Glu Arg Ile Leu Glu Thr
Gln Glu Ser Phe Ser Lys 245 250 255 Leu Val Pro Trp Leu Lys Asp Gly
Asp Thr Phe Arg Ser Glu Leu Phe 260 265 270 Glu Thr Glu Ala Trp Asn
Phe Leu Thr Glu Ala Ser Asn Glu Leu Leu 275 280 285 Ala Ala Gly Ile
Thr Ile Leu Leu Pro Ser Trp Trp Gln Asn Leu Lys 290 295 300 Ala Thr
Lys Pro Lys Leu Arg Val Gln Leu Lys Gln Asn Ala Thr Gln 305 310 315
320 Thr Gln Ser Phe Phe Gly Met Asn Thr Leu Val Asn Phe Asp Trp Arg
325 330 335 Ile Ser Thr Asn Gly Ile Asp Leu Ser Glu Ser Glu Phe Phe
Glu Leu 340 345 350 Val Glu Gln Asn Lys Arg Leu Phe Asn Ile Asn Gly
Gln Trp Met Arg 355 360 365 Leu Asp Pro Ala Phe Ile Glu Glu Val Arg
Lys Leu Met Asn Arg Ala 370 375 380 Asp Lys Tyr Gly Leu Glu Met Lys
Asp Val Leu Gln Gln His Leu Ser 385 390 395 400 Asn Thr Ala Glu Thr
Glu Ile Val Glu Glu Asp Ser Pro Phe Thr Asp 405 410 415 Ile Glu Ile
Glu Leu Asp Gly Tyr Tyr Glu Asp Leu Phe Gln Lys Leu 420 425 430 Leu
His Ile Gly Asp Ile Pro Lys Val Asp Val Pro Ser Ser Leu Asn 435 440
445 Ala Thr Leu Arg Pro Tyr Gln Gln His Gly Ile Glu Trp Leu Leu Tyr
450 455 460 Leu Arg Lys Leu Gly Phe Gly Ala Leu Leu Ala Asp Asp Met
Gly Leu 465 470 475 480 Gly Lys Ser Ile Gln Thr Ile Thr Tyr Leu Leu
Tyr Ile Lys Glu Asn 485 490 495 Asn Leu Gln Thr Gly Pro Ala Leu Ile
Val Ala Pro Thr Ser Val Leu 500 505 510 Gly Asn Trp Gln Lys Glu Phe
Glu Arg Phe Ala Pro Asn Leu Arg Val 515 520 525 Gln Leu His Tyr Gly
Ser Asn Arg Ala Lys Gly Glu Pro Phe Lys Asp 530 535 540 Phe Leu Gln
Ser Ala Asp Val Val Leu Thr Ser Tyr Ala Leu Ala Gln 545 550 555 560
Leu Asp Glu Glu Glu Leu Ser Thr Leu Cys Trp Asp Ala Val Ile Leu 565
570 575 Asp Glu Ala Gln Asn Ile Lys Asn Pro His Thr Lys Gln Ser Lys
Ala 580 585 590 Val Arg Asn Leu Gln Ala Asn His Lys Ile Ala Leu Thr
Gly Thr Pro 595 600 605 Met Glu Asn Arg Leu Ala Glu Leu Trp Ser Ile
Phe Asp Phe Ile Asn 610 615 620 His Gly Tyr Leu Gly Ser Leu Gly Gln
Phe Gln Arg Arg Phe Val Ser 625 630 635 640 Pro Ile Glu Lys Asp Arg
Asp Glu Gly Lys Ile Gln Gln Val Gln Arg 645 650 655 Phe Ile Ser Pro
Phe Leu Leu Arg Arg Thr Lys Lys Asp Gln Thr Val 660 665 670 Ala Leu
Asn Leu Pro Asp Lys Gln Glu Gln Lys Ala Tyr Cys Pro Leu 675 680 685
Thr Gly Glu Gln Ala Ser Leu Tyr Glu Gln Leu Val Gln Asp Thr Leu 690
695 700 Gln Asn Val Glu Gly Leu Ser Gly Ile Glu Arg Arg Gly Phe Ile
Leu 705 710 715 720 Leu Met Leu Asn Lys Leu Lys Gln Ile Cys Asn His
Pro Ala Leu Tyr 725 730 735 Leu Lys Glu Thr Glu Pro Lys Asp Ile Ile
Glu Arg Ser Met Lys Thr 740 745 750 Ser Thr Leu Met Glu Leu Ile Glu
Asn Ile Lys Asp Gln Asn Glu Ser 755 760 765 Cys Leu Ile Phe Thr Gln
Tyr Ile Gly Met Gly Asn Met Leu Lys Asp 770 775 780 Val Leu Glu Glu
His Phe Gly Gln Arg Val Leu Phe Leu Asn Gly Ser 785 790 795 800 Val
Pro Lys Lys Glu Arg Asp Lys Met Ile Glu Gln Phe Gln Asn Gly 805 810
815 Thr Tyr Asp Ile Phe Ile Leu Ser Leu Lys Ala Gly Gly Thr Gly Leu
820 825 830 Asn Leu Thr Ala Ala Asn His Val Ile His Tyr Asp Arg Trp
Trp Asn 835 840 845 Pro Ala Val Glu Asn Gln Ala Thr Asp Arg Ala Tyr
Arg Ile Gly Gln 850 855 860 Lys Arg Phe Val His Val His Lys Leu Ile
Thr Thr Gly Thr Leu Glu 865 870 875 880 Glu Lys Ile Asp Glu Met Leu
Glu Arg Lys Gln Ser Leu Asn Asn Ala 885 890 895 Val Ile Thr Ser Asp
Ser Trp Met Thr Glu Leu Ser Thr Asp Glu Leu 900 905 910 Lys Glu Leu
Leu Gly Val 915 373141DNACrocosphaera watsonii 37atgacaatat
tacatggaac ttggattgaa aatacctctg aaaaacattt ttttatttgg 60ggggaaactt
ggcgttcttt atcctctgat atttcctcag atgattctat tttaatgtat
120ccattttctg tagataaaca gggaattatt gaacaattaa actcgaataa
gattaagatt 180gaaaaaaaca aaaatattga atctgtttct caaatatttt
atttgcctag taaatttatt 240gctaaatcga agcaaagtat ccctttacta
tcaacagaat taaaagataa agattttgaa 300caaggggata ttcagttaat
tgcttggaaa atcgaaggga taaaattaaa tgttgatgat 360acaattaata
ttttaagtca gttaccgttg ggattaacca ataatgacga aaattacata
420ggcgataatt taaaattttg gacacatatt tatcgttgga gtctagattt
attaactaga 480ggtaaatatt taccgcaaat ggaagaacaa gataataact
gttatggaca atgggaacct 540ttactagata gtttagttga tcagcaacgg
ttctctaaat ttatacaaac tatgccaaat 600agttctcttg cttatcataa
tttaatggag ggtgaattat cctcttcttt actcaaacaa 660actactattc
ttgatttttt atctactatc attaatcaac aagtacgtca atttattgat
720gttgctatta cccctagttc atttatccaa aagtggttat actctttaac
acaagactta 780tctaaatttg aagcatcaga agttgaaaga aagggattaa
agaatgctat taataattgg 840aaatcttctt taagtgaata tattataaag
tctgataatc aaccattagg aattaaccag 900tttcgtgttt gttttaaact
agaaaatcca gctaaaagtg gtaagaaatt agaacaaagt 960aattggcagt
tacactacta tctccaagct ttagatgatc ctaattttct gatctctgcc
1020aaggttattt gggaaaatcc tgttactaga ttaatctgca ataatagaac
aattaatcat 1080cctcaagaaa ccttgctaaa aggactaggt ttagcttcac
gtctatatta tctaattgaa 1140gaaagtttac aagacaataa gcctagtttt
tctgagttag atcccataca agtctatgaa 1200tttttacgtt caattgctaa
tattcttaaa gataatggct taggggttat cttaccagct 1260agtctagagc
aaggagtcga agaaaaacgc ttaggaatta gtctaaccgc agaagttaag
1320tcgaaaaaag gacaaagact tagcttacaa agtttgttaa gttataagct
aaatttagca 1380attggtgata aaacaatatc gaaaaaagac tttgaaaaac
tattagcgca aaagtcacct 1440ttagttgaag taaaaggaga atggatagca
ttacaacctg ctgatgtcaa ggccgcacaa 1500caaattttaa ataagtccta
tgatccccta gaactttctg tagaagatgc tttacgcttc 1560agcacaggag
atatttcaac tgttgccaaa ctgccgatta ctaactttga agcaaaaggg
1620gaattagcca atctaattaa tgctataaat aataatgaat caatccctat
gatcgaaaat 1680cccagaggat ttaaaggtca attacgtccc tatcaacagc
gaggagtcgg ttggttatcg 1740ttcttagaaa aatggggttt aggggcttgt
cttgccgatg atatgggatt aggaaaaaca 1800ccacaattaa ttgggtttct
cttacattta agaagcgaag gaatgttaga tcaacctacc 1860ttagttattt
gtcctacatc tgttttaaat aactgggaaa gagaagttca aaaatttgcc
1920ccaacccttt ctactttgat tcatcatgga gataaacgta gtaaagggaa
agcttttgtt 1980aaagcagtta gtaaaaaaaa tgttatcatt actagctatt
ctttaattta tcgagatatt 2040aaaagctttg aacaggtaga atggcaaggt
attgtcttag atgaagcaca aaatataaaa 2100aatccccagg caaaacaatc
ccaagcagtg cgtcaaattt ccacacagtt tcgtattgct 2160ttaacaggaa
ctcctgtaga aaatcgccta acagaattat ggtcaattct tgactttctt
2220aacccaggat ttttagggac acagcagttt ttccgtcgtc gttttgccac
tcctatcgaa 2280aaatatgggg ataaagaatc actgcaaatt atgcgttctt
tggtacgtcc tttcattctc 2340agacgattga aaacagataa aactattatt
caagatttac ccgaaaaaca agaaatgacc 2400attttttgtg ggttatcctc
agaacaagga aaactttatc aacaattagt agataattct 2460ctggtagcaa
tagaagagaa aacaggaatt gaacgcaaag gcttaatttt aagcttactg
2520ctaaaactca aacaaatttg taaccatcct gctcattttc tcaagcaaaa
gagcttaaaa 2580acagcagaac aatctggtaa attattaaga ctagaagaaa
tgctagaaga attaatcgaa 2640gaaggagatc atgctttaat ctttacccaa
ttttctgaat ggggtaaact gctgcaacct 2700tatttacaga aaaaatttca
gcaagacgtt ctctttttgt atggtgctac tcgcagagtt 2760caaagacaag
aaatgatcga tcgctttcaa caggatccca acggacccag aatttttatt
2820ctctccttaa aagcaggggg aaccggatta aatttaaccc gcgctaacca
tgtatttcat 2880attgatcgtt ggtggaaccc agcagtagaa aatcaagcaa
ccgatcgcgc gtttcgttta 2940ggacaaaaac gcaatgttca agtacataaa
tttgtctgta caggaaccct agaagaaaaa 3000attaacgaaa tgttagaaag
taaacaaaaa ttagccgaac aaaccgttga cgcaggggaa 3060caatggttga
cagaattaga tacagatcaa ctgcgtaacc tcttattatt ggatcgagat
3120accattattg acgaacaata a 3141381046PRTCrocosphaera watsonii
38Met Thr Ile Leu His Gly Thr Trp Ile Glu Asn Thr Ser Glu Lys His 1
5 10 15 Phe Phe Ile Trp Gly Glu Thr Trp Arg Ser Leu Ser Ser Asp Ile
Ser 20 25 30 Ser Asp Asp Ser Ile Leu Met Tyr Pro Phe Ser Val Asp
Lys Gln Gly 35 40 45 Ile Ile Glu Gln Leu Asn Ser Asn Lys Ile Lys
Ile Glu Lys Asn Lys 50 55 60 Asn Ile Glu Ser Val Ser Gln Ile Phe
Tyr Leu Pro Ser Lys Phe Ile 65 70 75 80 Ala Lys Ser Lys Gln Ser Ile
Pro Leu Leu Ser Thr Glu Leu Lys Asp 85 90 95 Lys Asp Phe Glu Gln
Gly Asp Ile Gln Leu Ile Ala Trp Lys Ile Glu 100 105 110 Gly Ile Lys
Leu Asn Val Asp Asp Thr Ile Asn Ile Leu Ser Gln Leu 115 120 125 Pro
Leu Gly Leu Thr Asn Asn Asp Glu Asn Tyr Ile Gly Asp Asn Leu 130 135
140 Lys Phe Trp Thr His Ile Tyr Arg Trp Ser Leu Asp Leu Leu Thr Arg
145 150 155 160 Gly Lys Tyr Leu Pro Gln Met Glu Glu Gln Asp Asn Asn
Cys Tyr Gly 165 170 175 Gln Trp Glu Pro Leu Leu Asp Ser Leu Val Asp
Gln Gln Arg Phe Ser 180 185 190 Lys Phe Ile Gln Thr Met Pro Asn Ser
Ser Leu Ala Tyr His Asn Leu 195 200 205 Met Glu Gly Glu Leu Ser Ser
Ser Leu Leu Lys Gln Thr Thr Ile Leu 210 215 220 Asp Phe Leu Ser Thr
Ile Ile Asn Gln Gln Val Arg Gln Phe Ile Asp 225 230 235 240 Val Ala
Ile Thr Pro Ser Ser Phe Ile Gln Lys Trp Leu Tyr Ser Leu 245 250 255
Thr Gln Asp Leu Ser Lys Phe Glu Ala Ser Glu Val Glu Arg Lys Gly 260
265 270 Leu Lys Asn Ala Ile Asn Asn Trp Lys Ser Ser Leu Ser Glu Tyr
Ile 275 280 285 Ile Lys Ser Asp Asn Gln Pro Leu Gly Ile Asn Gln Phe
Arg Val Cys 290 295 300 Phe Lys Leu Glu Asn Pro Ala Lys Ser Gly Lys
Lys Leu Glu Gln Ser 305 310 315 320 Asn Trp Gln Leu His Tyr Tyr Leu
Gln Ala Leu Asp Asp Pro Asn Phe 325 330 335 Leu Ile Ser Ala Lys Val
Ile Trp Glu Asn Pro Val Thr Arg Leu Ile 340 345 350 Cys Asn Asn Arg
Thr Ile Asn His Pro Gln Glu Thr Leu Leu Lys Gly 355 360 365 Leu Gly
Leu Ala Ser Arg Leu Tyr Tyr Leu Ile Glu Glu Ser Leu Gln 370 375 380
Asp Asn Lys Pro Ser Phe Ser Glu Leu Asp Pro Ile Gln Val Tyr Glu 385
390 395 400 Phe Leu Arg Ser Ile Ala Asn Ile Leu Lys Asp Asn Gly Leu
Gly Val 405 410 415 Ile Leu Pro Ala Ser Leu Glu Gln Gly Val Glu Glu
Lys Arg Leu Gly 420 425 430 Ile Ser Leu Thr Ala Glu Val Lys Ser Lys
Lys Gly Gln Arg Leu Ser 435 440 445 Leu Gln Ser Leu Leu Ser Tyr Lys
Leu Asn Leu Ala Ile Gly Asp Lys 450 455 460 Thr Ile Ser Lys Lys Asp
Phe Glu Lys Leu Leu Ala Gln Lys Ser Pro 465 470 475 480 Leu Val Glu
Val Lys Gly Glu Trp Ile Ala Leu Gln Pro Ala Asp Val 485 490 495 Lys
Ala Ala Gln Gln Ile Leu Asn Lys Ser Tyr Asp Pro Leu Glu Leu 500 505
510 Ser Val Glu Asp Ala Leu Arg Phe Ser Thr Gly Asp Ile Ser Thr Val
515 520 525 Ala Lys Leu Pro Ile Thr Asn Phe Glu Ala Lys Gly Glu Leu
Ala Asn 530 535 540 Leu Ile Asn Ala Ile Asn Asn Asn Glu Ser Ile Pro
Met Ile Glu Asn 545 550 555 560 Pro Arg Gly Phe Lys Gly Gln Leu Arg
Pro Tyr Gln Gln Arg Gly Val 565 570 575 Gly Trp Leu Ser Phe Leu Glu
Lys Trp Gly Leu Gly Ala Cys Leu Ala 580 585 590 Asp Asp Met Gly Leu
Gly Lys Thr Pro Gln Leu Ile Gly Phe Leu Leu 595 600 605 His Leu Arg
Ser Glu Gly Met Leu Asp Gln Pro Thr Leu Val Ile Cys 610 615 620 Pro
Thr Ser Val Leu Asn Asn Trp Glu Arg Glu Val Gln Lys Phe Ala 625 630
635 640 Pro Thr Leu Ser Thr Leu Ile His His Gly Asp Lys Arg Ser Lys
Gly 645 650 655 Lys Ala Phe Val Lys Ala Val Ser Lys Lys Asn Val Ile
Ile Thr Ser 660 665 670 Tyr Ser Leu Ile Tyr Arg Asp Ile Lys Ser Phe
Glu Gln Val Glu Trp 675 680 685 Gln Gly Ile Val Leu Asp Glu Ala Gln
Asn Ile Lys Asn Pro Gln Ala 690 695 700 Lys Gln Ser Gln Ala Val Arg
Gln Ile Ser Thr Gln Phe Arg Ile Ala 705 710 715 720 Leu Thr Gly Thr
Pro Val Glu Asn Arg Leu Thr Glu Leu Trp Ser Ile 725 730 735 Leu Asp
Phe Leu Asn Pro Gly Phe Leu Gly Thr Gln Gln Phe Phe Arg 740 745 750
Arg Arg Phe Ala Thr Pro Ile Glu Lys Tyr Gly Asp Lys Glu Ser Leu 755
760 765 Gln Ile Met Arg Ser Leu Val Arg Pro Phe Ile Leu Arg Arg Leu
Lys 770 775 780 Thr Asp Lys Thr Ile Ile Gln Asp Leu Pro Glu Lys Gln
Glu Met Thr 785 790 795 800 Ile Phe Cys Gly Leu Ser Ser Glu Gln Gly
Lys Leu Tyr Gln Gln Leu 805 810 815 Val Asp Asn Ser Leu Val Ala Ile
Glu Glu Lys Thr Gly Ile Glu Arg 820 825 830 Lys Gly Leu Ile Leu Ser
Leu Leu Leu Lys Leu Lys Gln Ile Cys Asn 835 840 845 His Pro Ala His
Phe Leu Lys Gln Lys Ser Leu Lys Thr Ala Glu Gln 850 855 860 Ser Gly
Lys Leu Leu Arg Leu Glu Glu Met Leu Glu Glu Leu Ile Glu 865 870 875
880 Glu Gly Asp His Ala Leu Ile Phe Thr Gln Phe Ser Glu Trp Gly Lys
885 890 895 Leu Leu Gln Pro Tyr Leu Gln Lys Lys Phe Gln Gln Asp Val
Leu Phe 900 905 910 Leu Tyr Gly Ala Thr Arg Arg Val Gln Arg Gln Glu
Met Ile Asp Arg 915 920 925 Phe Gln Gln Asp Pro Asn Gly Pro Arg Ile
Phe Ile Leu Ser Leu Lys 930 935 940 Ala Gly Gly Thr Gly Leu Asn Leu
Thr Arg Ala Asn His Val Phe His 945 950 955 960 Ile Asp Arg Trp Trp
Asn Pro Ala Val Glu Asn Gln Ala Thr Asp Arg 965 970 975 Ala Phe Arg
Leu Gly Gln Lys Arg Asn Val Gln Val His Lys Phe Val 980 985 990 Cys
Thr Gly Thr Leu Glu Glu Lys Ile Asn Glu Met Leu Glu Ser Lys 995
1000 1005 Gln Lys Leu Ala Glu Gln Thr Val Asp Ala Gly Glu Gln Trp
Leu 1010 1015 1020 Thr Glu Leu Asp Thr Asp Gln Leu Arg Asn Leu Leu
Leu Leu Asp 1025 1030 1035 Arg Asp Thr Ile Ile Asp Glu Gln 1040
1045 393027DNAGloeobacter violaceus 39atggctatct tgcacggtat
ctgggttcac caaccccccc gggccgggct tttcctttgg 60ggagaaacct ggaggcaggt
cgcaaagcgg cgcaagcgct ccgaagcacc cgctccgcat 120ccctatgtcc
agcaaccggc cgagttgtcc ccccgcctgg ctgcccagtt tccccagata
180ccgctcagct tgctggtacc cgagacgctt gcactccagt tgcccgccac
ggtcgaaaac 240gtggtctact ccgcaagcat tgctcccgag ggcaagcttt
tggagttgga accgtggctg 300gtggaaggtt tctggctcga cggtcaccag
gcttttgaac tgttgctcgg ggtacccctg 360ggcggcgggg acgcatcgat
tggcgacgac ctgcgcttct ggtcgcagtg cgcccgctgg 420gtgcttgact
tgctggtgcg cgccaagtac ctgcccgacc tggagagcgg cgacggccag
480gaaatcccca cagcccgctg ggtgcccctg ctcgacagcg ccgtcgatca
agcccgcctc 540aaagaatttg ccgcccgttt gccgggcgcc tgccgcgccg
ctacccccga actatctccg 600caccagattc tcaagagttt cctgagcgcc
atgctcgacg cgcgggtgcg cacgctgctc 660gcttgcgagc ctcccgatcc
gcgcacgctg cctgccggag cggtgcgccc ctggcttctg 720gccctggccc
atgcccagcc ccagctcaaa tctccggacc cggagacgcc ggctctggcg
780gaagccctgg ccacctggcg cgcccccctg agctatcagg ttcgctcgcg
cacctgcttc 840cgtctgcagc cgcccgagga gagccagggc gagtggaagc
tgcactttct attgcaaaca 900ggcgacgatc ccgattcgct gatggctgcc
cagcaagtct ggagcagcgc gggtgagctg 960caggaggtgt ttctcgcggg
cttgggcctc gcctcgcgta tctttgtgcc cgtcgagcgg 1020ggattgctcg
tcccccagcc cacctgctgc accatgagca ccgtcgaggc gtttcagttt
1080ctcaaagccg ccacctggcg gttgcgcgac agcggcttcg gggtgttgtt
gcccgagagc 1140ctcgcggacg cgggcagcct gcgcaaccgc ctgggcctca
aactcgaagc gaacgcgccg 1200gggcgcaacg gttcgggcct cggcatgcag
agcttgctcg cttttaaatg ggagctgtcg 1260ctcgcgggca agaccctgag
ccgcgccgag ttcgaccgcc tcgccgctag ttctgaaccc 1320ctggtcaaag
tcaacgacaa ctgggtcgaa ttgcgccccc aggacgtgcg cgccgcccac
1380agctttttgc agtcgcgcaa agatcaggtc ggactctcgt tggaggatgt
gctgcgcctc 1440aacttcggcg acacccccaa aatcgacggt ctccccatcg
tcaacttcga cagctccggc 1500cccattcagc aactgctgga gaccctcacc
gatcagcgca aactcacccc catcgacgaa 1560ccgccggggt tcaagggcac
cctgcggccc tatcaaaaaa ttggcgtcgg ctggctcgcc 1620tttttgcaga
agtggggcct gggtgcttgc ctagccgacg acatgggact cgggaagacc
1680gtagagttga tagcatttct tctttttctc aaatccaaaa atgagctgga
cggccctata 1740ttgttaattt gtccgacttc agtgatggga aactgggaaa
gagaaataaa gaaattttct 1800cctagtttat ctgtacatgt ccatcatggg
gcgcggcggc cgaaggggcg caattttgtc 1860gagacggccc agaaaaagca
aatcatcgtc agcagctacg ccctggtaca gcgcgacagc 1920aaagatctca
agcgcgtcga atggttgggc ctggtgctcg acgaagccca gaacatcaaa
1980aaccccgacg ccaagcagac ccagtcgatt cgggaactga cagcgcgctt
tcgcatcgcc 2040ctcaccggca caccggtcga gaatcgcctc gcggaactgt
ggtcgatcct cgattttctc 2100aatcccggct atctgggggc gcgcaacttc
tttcagcgcc gcttcgcagt tccgatcgaa 2160aagtacgggg atcgctcctc
ggcgaacgcc ctcaaagctc tggtgcagcc gtttatcctg 2220cggcggctca
aatccgaccc gcagattatt caagatctgc ccgagaagca ggagacgaat
2280gtcttctgtc cgctcacacc cgagcaggcg gccctctacg agcgggtggt
gaacgaatcg 2340ctcgccaaga tcgagcagag caccggcatc cagcggcgcg
ggacggtgct ggccaccttg 2400gtcaaactca agcagatctg caaccacccg
agccactacc tgggtgacga cggaccgctc 2460gccaaccgct cgggcaaact
cagccgcctg ggcgagatgc tcgaagaagt gctcgccgac 2520gaggagcggg
cgctgatttt tacccagttc gccgagtggg gccacctgct gcaggcgcac
2580ctgagccgcc agttgggttc agaagtgttt ttcctctacg gcggcaccag
caaaaaccag 2640cgcgaggcga tgatcgagcg cttccagagc gatccgcagg
ggccgcggat ttttattctt 2700tcgctgaagg cagggggtgt cggcctcaac
ctcacccgcg ccaaccacgt cttccacttc 2760gaccgctggt ggaacccggc
ggtcgagaat caggccaccg accgcgtctt ccgcatcggc 2820caaaccaaga
acgtacaagt ctacaagtac gtgtgcaccg gcacgctcga agagcgcatc
2880aacgccctga tcgaaagcaa aaaggccctg gctgagcagg tggtgagcgc
cggtgagaac 2940tggctgtcgg atctaaatac cgatcaactg cggcaactgt
tggtactcga tcgctcggag 3000attatcgaca cggaggacac cgcgtga
3027401008PRTGloeobacter violaceus 40Met Ala Ile Leu His Gly Ile
Trp Val His Gln Pro Pro Arg Ala Gly 1 5 10 15 Leu Phe Leu Trp Gly
Glu Thr Trp Arg Gln Val Ala Lys Arg Arg Lys 20 25 30 Arg Ser Glu
Ala Pro Ala Pro His Pro Tyr Val Gln Gln Pro Ala Glu 35 40 45 Leu
Ser Pro Arg Leu Ala Ala Gln Phe Pro Gln Ile Pro Leu Ser Leu 50 55
60 Leu Val Pro Glu Thr Leu Ala Leu Gln Leu Pro Ala Thr Val Glu Asn
65 70 75 80 Val Val Tyr Ser Ala Ser Ile Ala Pro Glu Gly Lys Leu Leu
Glu Leu 85 90 95 Glu Pro Trp Leu Val Glu Gly Phe Trp Leu Asp Gly
His Gln Ala Phe 100 105 110 Glu Leu Leu Leu Gly Val Pro Leu Gly Gly
Gly Asp Ala Ser Ile Gly 115 120 125 Asp Asp Leu Arg Phe Trp Ser Gln
Cys Ala Arg Trp Val Leu Asp Leu 130 135 140 Leu Val Arg Ala Lys Tyr
Leu Pro Asp Leu Glu Ser Gly Asp Gly Gln 145 150 155 160 Glu Ile Pro
Thr Ala Arg Trp Val Pro Leu Leu Asp Ser Ala Val Asp 165 170 175 Gln
Ala Arg Leu Lys Glu Phe Ala Ala Arg Leu Pro Gly Ala Cys Arg 180 185
190 Ala Ala Thr Pro Glu Leu Ser Pro His Gln Ile Leu Lys Ser Phe Leu
195 200 205 Ser Ala Met Leu Asp Ala Arg Val Arg Thr Leu Leu Ala Cys
Glu Pro 210 215 220 Pro Asp Pro Arg Thr Leu Pro Ala Gly Ala Val Arg
Pro Trp Leu Leu 225 230 235 240 Ala Leu Ala His Ala Gln Pro Gln Leu
Lys Ser Pro Asp Pro Glu Thr 245 250 255 Pro Ala Leu Ala Glu Ala Leu
Ala Thr Trp Arg Ala Pro Leu Ser Tyr 260 265 270 Gln Val Arg Ser Arg
Thr Cys Phe Arg Leu Gln Pro Pro Glu Glu Ser 275 280 285 Gln Gly Glu
Trp Lys Leu His Phe Leu Leu Gln Thr Gly Asp Asp Pro 290 295 300 Asp
Ser Leu Met Ala Ala Gln Gln Val Trp Ser Ser Ala Gly Glu Leu 305 310
315 320 Gln Glu Val Phe Leu Ala Gly Leu Gly Leu Ala Ser Arg Ile Phe
Val 325 330
335 Pro Val Glu Arg Gly Leu Leu Val Pro Gln Pro Thr Cys Cys Thr Met
340 345 350 Ser Thr Val Glu Ala Phe Gln Phe Leu Lys Ala Ala Thr Trp
Arg Leu 355 360 365 Arg Asp Ser Gly Phe Gly Val Leu Leu Pro Glu Ser
Leu Ala Asp Ala 370 375 380 Gly Ser Leu Arg Asn Arg Leu Gly Leu Lys
Leu Glu Ala Asn Ala Pro 385 390 395 400 Gly Arg Asn Gly Ser Gly Leu
Gly Met Gln Ser Leu Leu Ala Phe Lys 405 410 415 Trp Glu Leu Ser Leu
Ala Gly Lys Thr Leu Ser Arg Ala Glu Phe Asp 420 425 430 Arg Leu Ala
Ala Ser Ser Glu Pro Leu Val Lys Val Asn Asp Asn Trp 435 440 445 Val
Glu Leu Arg Pro Gln Asp Val Arg Ala Ala His Ser Phe Leu Gln 450 455
460 Ser Arg Lys Asp Gln Val Gly Leu Ser Leu Glu Asp Val Leu Arg Leu
465 470 475 480 Asn Phe Gly Asp Thr Pro Lys Ile Asp Gly Leu Pro Ile
Val Asn Phe 485 490 495 Asp Ser Ser Gly Pro Ile Gln Gln Leu Leu Glu
Thr Leu Thr Asp Gln 500 505 510 Arg Lys Leu Thr Pro Ile Asp Glu Pro
Pro Gly Phe Lys Gly Thr Leu 515 520 525 Arg Pro Tyr Gln Lys Ile Gly
Val Gly Trp Leu Ala Phe Leu Gln Lys 530 535 540 Trp Gly Leu Gly Ala
Cys Leu Ala Asp Asp Met Gly Leu Gly Lys Thr 545 550 555 560 Val Glu
Leu Ile Ala Phe Leu Leu Phe Leu Lys Ser Lys Asn Glu Leu 565 570 575
Asp Gly Pro Ile Leu Leu Ile Cys Pro Thr Ser Val Met Gly Asn Trp 580
585 590 Glu Arg Glu Ile Lys Lys Phe Ser Pro Ser Leu Ser Val His Val
His 595 600 605 His Gly Ala Arg Arg Pro Lys Gly Arg Asn Phe Val Glu
Thr Ala Gln 610 615 620 Lys Lys Gln Ile Ile Val Ser Ser Tyr Ala Leu
Val Gln Arg Asp Ser 625 630 635 640 Lys Asp Leu Lys Arg Val Glu Trp
Leu Gly Leu Val Leu Asp Glu Ala 645 650 655 Gln Asn Ile Lys Asn Pro
Asp Ala Lys Gln Thr Gln Ser Ile Arg Glu 660 665 670 Leu Thr Ala Arg
Phe Arg Ile Ala Leu Thr Gly Thr Pro Val Glu Asn 675 680 685 Arg Leu
Ala Glu Leu Trp Ser Ile Leu Asp Phe Leu Asn Pro Gly Tyr 690 695 700
Leu Gly Ala Arg Asn Phe Phe Gln Arg Arg Phe Ala Val Pro Ile Glu 705
710 715 720 Lys Tyr Gly Asp Arg Ser Ser Ala Asn Ala Leu Lys Ala Leu
Val Gln 725 730 735 Pro Phe Ile Leu Arg Arg Leu Lys Ser Asp Pro Gln
Ile Ile Gln Asp 740 745 750 Leu Pro Glu Lys Gln Glu Thr Asn Val Phe
Cys Pro Leu Thr Pro Glu 755 760 765 Gln Ala Ala Leu Tyr Glu Arg Val
Val Asn Glu Ser Leu Ala Lys Ile 770 775 780 Glu Gln Ser Thr Gly Ile
Gln Arg Arg Gly Thr Val Leu Ala Thr Leu 785 790 795 800 Val Lys Leu
Lys Gln Ile Cys Asn His Pro Ser His Tyr Leu Gly Asp 805 810 815 Asp
Gly Pro Leu Ala Asn Arg Ser Gly Lys Leu Ser Arg Leu Gly Glu 820 825
830 Met Leu Glu Glu Val Leu Ala Asp Glu Glu Arg Ala Leu Ile Phe Thr
835 840 845 Gln Phe Ala Glu Trp Gly His Leu Leu Gln Ala His Leu Ser
Arg Gln 850 855 860 Leu Gly Ser Glu Val Phe Phe Leu Tyr Gly Gly Thr
Ser Lys Asn Gln 865 870 875 880 Arg Glu Ala Met Ile Glu Arg Phe Gln
Ser Asp Pro Gln Gly Pro Arg 885 890 895 Ile Phe Ile Leu Ser Leu Lys
Ala Gly Gly Val Gly Leu Asn Leu Thr 900 905 910 Arg Ala Asn His Val
Phe His Phe Asp Arg Trp Trp Asn Pro Ala Val 915 920 925 Glu Asn Gln
Ala Thr Asp Arg Val Phe Arg Ile Gly Gln Thr Lys Asn 930 935 940 Val
Gln Val Tyr Lys Tyr Val Cys Thr Gly Thr Leu Glu Glu Arg Ile 945 950
955 960 Asn Ala Leu Ile Glu Ser Lys Lys Ala Leu Ala Glu Gln Val Val
Ser 965 970 975 Ala Gly Glu Asn Trp Leu Ser Asp Leu Asn Thr Asp Gln
Leu Arg Gln 980 985 990 Leu Leu Val Leu Asp Arg Ser Glu Ile Ile Asp
Thr Glu Asp Thr Ala 995 1000 1005 413186DNALyngbya sp. 41atggcaattt
tacacggaag ttggctccag caccccaaaa attatttgtt tatttgggga 60gaaacctggc
gtcgcattac acccaatgaa tttaatccgg ctgatggtgt tttgggttat
120ccttttgctt taagccctgt tgaattggaa aagtggtgca gtgaaaagca
gttatctata 180gagagtaaag ttgtcgttac agaaactctc gcccttccca
ctaaactctc cccaaaaata 240ggactatatc cccttcaatc tacgcctcaa
actgattctg aaactgattc tgagtcgatc 300tgtctttatc cctggaaaat
tgaaggtatt tgtctcaaca gtacagaagc ctttgacttt 360ttacaatccc
ttcctctggg aaacctgacc acagaaaact catttattgg ctcagattta
420cagttttggt ctcatctttc ccgttggagt ttagacttac tcgcccggag
taaattttta 480cccagtctca cttttaaccc ctcaaaagat cactttatcg
ctgaatggaa acctttactc 540gatagtgcga cagatcaagc cagattaatt
cgtttttcta aacaaatacc ctctgcttgt 600cggatctatc aactctggtc
aaaagaggct caaaatcaat ttgaaaattt agccctagat 660ttacctcaaa
atccccaaaa cttaattgat gattttttaa cggcaattat tgatagtcaa
720gtcaagaaag ttgcagaaga aagtgaaaaa aaagcgatta caaatctaac
cgctattcaa 780ccgattgttc agagttggtt acacgcttta gccagtgaat
ctaatctagc aaaatccaaa 840aaatctgaat caaaaaccct agaaaaaatt
ctttccaatt ggacggctcc tcttcaacaa 900actctcgctg aacataattt
gtttagaacg ggatttcgac tctctcctcc ggaaaataat 960caaaaaaatt
ggacgctaga ttattgttta caagcaattg atgaacccga atttttagtg
1020gatgctcaaa ctatttggac tcatccagtc gaagcctttg ttcacaatgg
acgtatgatt 1080aaacgtcctc aagaaaccct cctcaaaggt ttaggtttag
cctcaaaact atatcctctc 1140ctagaaccca gtttacaaga agcccgtcct
caaacttgct tattaacgcc cctacaagcc 1200tatgaattta ttaaaagtat
taattggcgg tttacagata gcggtttagg agtgatttta 1260cccccgagtt
tagtcagtca aaatggatgg gcgaaccgtt taggtttaag tgttcaagcg
1320gcgacatcaa aatccaaaca aaatgttagc ttgggattag atagtctgct
gaattttaaa 1380tgggaattgt caattggggg tcaaacctta tcaaaaacag
aatttaaccg tttagtcgct 1440caagaaagtc cgttagttga aattaatggc
gaatgggtgg aattacgtcc tactgatatt 1500aaagccgcta aagccttctt
ttcgagtcgc aaagatcaac tttcacttac ccttgaagat 1560gctttacgtt
tatcgacggg tgactcgcaa atggtggaaa agttaccgat tgttaacttt
1620gaagcgggtg gaaaattaga agaacttctc aatactttaa cgaataaccg
ttcgctcgat 1680gagatcaaaa ctcctagtaa ttttcaagga gaactacgcc
cctatcaagc ccgaggggtg 1740agttggttag cctttttaga agaatggggt
ttaggggctt gtttagctga tgatatgggg 1800ctaggaaaaa ccatagaatt
aattgctttt ctcttgtatt tgcaggaaaa agaaacctta 1860gacgctcctg
ttttactggt ttgtccgaca tcagttttag gaaactggga acgagaagtt
1920aaacgattta gtccgagttt aaaagttact gttcatcacg gggataaacg
ccagaaaggg 1980aaaaactttg ctcaatttgc ccagaaatat aatttaatta
ttaccagtta tccgttaact 2040tttcgagatg agaaagaact caaaacggta
aattggaaag gattagtttt agacgaagct 2100caaaatatta aaaatcccga
ggctaaacaa tcaaaaacgg tgagaaatct acaggcgagt 2160tttaaaattg
ctctgactgg aacacctgtc gaaaaccgtc tgtctgaatt atggtcaatt
2220atggattttc tcaacccagg ttatttagga cagcgacaat tttttcagcg
aagatttgct 2280attccgattg aaaaatacgg cgatacagac tccttaaaaa
cattgcgatc tttggttcaa 2340ccgtttattt tacggcgctt aaaaacagat
agagagatta tccaagactt acccgaaaaa 2400caggaaaata cgatcttttg
ttctctgtct acagaacaag caacgcttta tcaaaagatt 2460gttgatcagt
ctttagctga catagactca gccgcaggaa ttcaacgtcg agggatgatt
2520ttagcgttgt tagtgaaatt aaaacaggtt tgtaatcatc ccattttatt
gaatggaaaa 2580gcgacaaaaa ctggaaagaa aaaggtcgag actcagggtt
taagcctgca aagttcaggg 2640aagttacaac gcttcaaaga aatgctggaa
gaattgttgt cagaaggaga tcgcgccatt 2700gtatttaccc agtttgcaga
atggggaaaa gttttacaac cttatttaga acagcaatta 2760aaccgagagg
tattattttt gtatggcgca actcgtaaaa ataaacgaga agaaatgatt
2820gatcgttttc aacaagatcc tcaagggcca ccgattttta ttctatcttt
aaaagcggga 2880ggtgtgggtt taaatttgac tcgtgctaat catgtttttc
actttgatcg ttggtggaac 2940cctgcggttg aaaatcaagc aacagatcgg
gtgtttagaa ttggtcaaac gcgcaatgtt 3000caggttcata agtttgtctg
taccggaacg ttggaagaaa aaatccatga tttaattgaa 3060agtaaaaaag
tgttggctga acaagttgtg ggttcaggag aaaattggtt aactgaattg
3120gatacggatc aactcagaaa cttactcatt attgaccgaa atgcggtgat
tgatgaagaa 3180gaataa 3186421061PRTLyngbya sp. 42Met Ala Ile Leu
His Gly Ser Trp Leu Gln His Pro Lys Asn Tyr Leu 1 5 10 15 Phe Ile
Trp Gly Glu Thr Trp Arg Arg Ile Thr Pro Asn Glu Phe Asn 20 25 30
Pro Ala Asp Gly Val Leu Gly Tyr Pro Phe Ala Leu Ser Pro Val Glu 35
40 45 Leu Glu Lys Trp Cys Ser Glu Lys Gln Leu Ser Ile Glu Ser Lys
Val 50 55 60 Val Val Thr Glu Thr Leu Ala Leu Pro Thr Lys Leu Ser
Pro Lys Ile 65 70 75 80 Gly Leu Tyr Pro Leu Gln Ser Thr Pro Gln Thr
Asp Ser Glu Thr Asp 85 90 95 Ser Glu Ser Ile Cys Leu Tyr Pro Trp
Lys Ile Glu Gly Ile Cys Leu 100 105 110 Asn Ser Thr Glu Ala Phe Asp
Phe Leu Gln Ser Leu Pro Leu Gly Asn 115 120 125 Leu Thr Thr Glu Asn
Ser Phe Ile Gly Ser Asp Leu Gln Phe Trp Ser 130 135 140 His Leu Ser
Arg Trp Ser Leu Asp Leu Leu Ala Arg Ser Lys Phe Leu 145 150 155 160
Pro Ser Leu Thr Phe Asn Pro Ser Lys Asp His Phe Ile Ala Glu Trp 165
170 175 Lys Pro Leu Leu Asp Ser Ala Thr Asp Gln Ala Arg Leu Ile Arg
Phe 180 185 190 Ser Lys Gln Ile Pro Ser Ala Cys Arg Ile Tyr Gln Leu
Trp Ser Lys 195 200 205 Glu Ala Gln Asn Gln Phe Glu Asn Leu Ala Leu
Asp Leu Pro Gln Asn 210 215 220 Pro Gln Asn Leu Ile Asp Asp Phe Leu
Thr Ala Ile Ile Asp Ser Gln 225 230 235 240 Val Lys Lys Val Ala Glu
Glu Ser Glu Lys Lys Ala Ile Thr Asn Leu 245 250 255 Thr Ala Ile Gln
Pro Ile Val Gln Ser Trp Leu His Ala Leu Ala Ser 260 265 270 Glu Ser
Asn Leu Ala Lys Ser Lys Lys Ser Glu Ser Lys Thr Leu Glu 275 280 285
Lys Ile Leu Ser Asn Trp Thr Ala Pro Leu Gln Gln Thr Leu Ala Glu 290
295 300 His Asn Leu Phe Arg Thr Gly Phe Arg Leu Ser Pro Pro Glu Asn
Asn 305 310 315 320 Gln Lys Asn Trp Thr Leu Asp Tyr Cys Leu Gln Ala
Ile Asp Glu Pro 325 330 335 Glu Phe Leu Val Asp Ala Gln Thr Ile Trp
Thr His Pro Val Glu Ala 340 345 350 Phe Val His Asn Gly Arg Met Ile
Lys Arg Pro Gln Glu Thr Leu Leu 355 360 365 Lys Gly Leu Gly Leu Ala
Ser Lys Leu Tyr Pro Leu Leu Glu Pro Ser 370 375 380 Leu Gln Glu Ala
Arg Pro Gln Thr Cys Leu Leu Thr Pro Leu Gln Ala 385 390 395 400 Tyr
Glu Phe Ile Lys Ser Ile Asn Trp Arg Phe Thr Asp Ser Gly Leu 405 410
415 Gly Val Ile Leu Pro Pro Ser Leu Val Ser Gln Asn Gly Trp Ala Asn
420 425 430 Arg Leu Gly Leu Ser Val Gln Ala Ala Thr Ser Lys Ser Lys
Gln Asn 435 440 445 Val Ser Leu Gly Leu Asp Ser Leu Leu Asn Phe Lys
Trp Glu Leu Ser 450 455 460 Ile Gly Gly Gln Thr Leu Ser Lys Thr Glu
Phe Asn Arg Leu Val Ala 465 470 475 480 Gln Glu Ser Pro Leu Val Glu
Ile Asn Gly Glu Trp Val Glu Leu Arg 485 490 495 Pro Thr Asp Ile Lys
Ala Ala Lys Ala Phe Phe Ser Ser Arg Lys Asp 500 505 510 Gln Leu Ser
Leu Thr Leu Glu Asp Ala Leu Arg Leu Ser Thr Gly Asp 515 520 525 Ser
Gln Met Val Glu Lys Leu Pro Ile Val Asn Phe Glu Ala Gly Gly 530 535
540 Lys Leu Glu Glu Leu Leu Asn Thr Leu Thr Asn Asn Arg Ser Leu Asp
545 550 555 560 Glu Ile Lys Thr Pro Ser Asn Phe Gln Gly Glu Leu Arg
Pro Tyr Gln 565 570 575 Ala Arg Gly Val Ser Trp Leu Ala Phe Leu Glu
Glu Trp Gly Leu Gly 580 585 590 Ala Cys Leu Ala Asp Asp Met Gly Leu
Gly Lys Thr Ile Glu Leu Ile 595 600 605 Ala Phe Leu Leu Tyr Leu Gln
Glu Lys Glu Thr Leu Asp Ala Pro Val 610 615 620 Leu Leu Val Cys Pro
Thr Ser Val Leu Gly Asn Trp Glu Arg Glu Val 625 630 635 640 Lys Arg
Phe Ser Pro Ser Leu Lys Val Thr Val His His Gly Asp Lys 645 650 655
Arg Gln Lys Gly Lys Asn Phe Ala Gln Phe Ala Gln Lys Tyr Asn Leu 660
665 670 Ile Ile Thr Ser Tyr Pro Leu Thr Phe Arg Asp Glu Lys Glu Leu
Lys 675 680 685 Thr Val Asn Trp Lys Gly Leu Val Leu Asp Glu Ala Gln
Asn Ile Lys 690 695 700 Asn Pro Glu Ala Lys Gln Ser Lys Thr Val Arg
Asn Leu Gln Ala Ser 705 710 715 720 Phe Lys Ile Ala Leu Thr Gly Thr
Pro Val Glu Asn Arg Leu Ser Glu 725 730 735 Leu Trp Ser Ile Met Asp
Phe Leu Asn Pro Gly Tyr Leu Gly Gln Arg 740 745 750 Gln Phe Phe Gln
Arg Arg Phe Ala Ile Pro Ile Glu Lys Tyr Gly Asp 755 760 765 Thr Asp
Ser Leu Lys Thr Leu Arg Ser Leu Val Gln Pro Phe Ile Leu 770 775 780
Arg Arg Leu Lys Thr Asp Arg Glu Ile Ile Gln Asp Leu Pro Glu Lys 785
790 795 800 Gln Glu Asn Thr Ile Phe Cys Ser Leu Ser Thr Glu Gln Ala
Thr Leu 805 810 815 Tyr Gln Lys Ile Val Asp Gln Ser Leu Ala Asp Ile
Asp Ser Ala Ala 820 825 830 Gly Ile Gln Arg Arg Gly Met Ile Leu Ala
Leu Leu Val Lys Leu Lys 835 840 845 Gln Val Cys Asn His Pro Ile Leu
Leu Asn Gly Lys Ala Thr Lys Thr 850 855 860 Gly Lys Lys Lys Val Glu
Thr Gln Gly Leu Ser Leu Gln Ser Ser Gly 865 870 875 880 Lys Leu Gln
Arg Phe Lys Glu Met Leu Glu Glu Leu Leu Ser Glu Gly 885 890 895 Asp
Arg Ala Ile Val Phe Thr Gln Phe Ala Glu Trp Gly Lys Val Leu 900 905
910 Gln Pro Tyr Leu Glu Gln Gln Leu Asn Arg Glu Val Leu Phe Leu Tyr
915 920 925 Gly Ala Thr Arg Lys Asn Lys Arg Glu Glu Met Ile Asp Arg
Phe Gln 930 935 940 Gln Asp Pro Gln Gly Pro Pro Ile Phe Ile Leu Ser
Leu Lys Ala Gly 945 950 955 960 Gly Val Gly Leu Asn Leu Thr Arg Ala
Asn His Val Phe His Phe Asp 965 970 975 Arg Trp Trp Asn Pro Ala Val
Glu Asn Gln Ala Thr Asp Arg Val Phe 980 985 990 Arg Ile Gly Gln Thr
Arg Asn Val Gln Val His Lys Phe Val Cys Thr 995 1000 1005 Gly Thr
Leu Glu Glu Lys Ile His Asp Leu Ile Glu Ser Lys Lys 1010 1015 1020
Val Leu Ala Glu Gln Val Val Gly Ser Gly Glu Asn Trp Leu Thr 1025
1030 1035 Glu Leu Asp Thr Asp Gln Leu Arg Asn Leu Leu Ile Ile Asp
Arg 1040 1045 1050 Asn Ala Val Ile Asp Glu Glu Glu 1055 1060
433237DNAMethanosarcina acetivorans 43atgataattt tgcatgcagg
aagagtcgga aaacagttct ttctgtgggg cgaaagcccg 60gctgaaaatg aaactccgcc
tgtccggcgc gggagaaagc ctaagaagcc ggttgcaaaa 120ccttatcctt
acgattcggg tgttgaaaac ctgtcttctg ctcttgagct gctgctgggc
180agtactggcc ggaaaaaggc agaggaaatc aatgtctgga tcccgacagc
aggctggaat 240ccaatcccct ccagtcctct cgttgctgaa attccggctt
cgaaagcaga actttcccta 300gctccctgga ctgttcacgc atatcctctg
gaagctgaag aagctattgt tctcctctgc 360gcctgtatgg gaaaaaaggt
tcttgctccc
ggcataatct cgggaaatga tcttctctgg 420tgggcggatg ccctgaaatt
tgcaggctcg ctggtagcag gacagaaata cctgcctggc 480gtcaggggcg
gggaaggaga gtacaaggct ttctgggaac ccgtattttc cggagaagat
540gcgggggagc tggcaagact tgcaaagcaa atgcctccgg ctgcaaaggc
tcttgctctt 600gaaacctctt ccgtgcagcc ggaaatactt gctgctgtag
cggcaaggca gtttatcgaa 660gaggctcttg actggatagt ccggtccgag
atcggggaaa aagagcttgc aaaagaggcg 720cgtaaaagaa aatcctttga
tagcgtccat gacgcctggg tttccgctct taaaagccct 780gacgggttga
tccacggaga agaaaaagaa ctcctgcagc ttgcgttccg gacccgtgaa
840tggcagcgcc cccttactgt acttacaact tctcccttca ggttctgttt
ccggcttgaa 900gagccagctg cggaagaaga actcgaagaa accgaggaat
ccgaagccgg aaaaatggat 960actaaaaaag gcaggaaagg gatagctgac
atagaagttc ccgaagaact ctggtacgtc 1020cgctatatgc ttcagtccta
cgaagaccca agccttctga ttcctgtaaa agaggcctgg 1080aaaccaaaga
agggcagccc gttgaaaaga tatgatgtaa aaaacattcg ccaatttctg
1140ttatcttccc ttggacaggc tgctggcatc agtgcaggaa ttgcttccag
ccttgaagct 1200cccaacccgt ccggatattc ccttgatacg aaagaagctt
accgcttcct gactgaaagt 1260gcagcggatt taagccaggc gggcttcggg
ttacttctcc ccggctggtg gacccgtaaa 1320ggtacaaaga cccacttaaa
agcccaggct aatgttaagg gcaagaagtt gaaggccgga 1380tacgggctta
cactcgataa aatcgtcagc tttgactggg aaattgccct tggagaccgt
1440gcactcacag tcagggaact gcaggctctt gcaaagctca aagctccgct
tgtgaaattc 1500cgcgggcagt gggtcgaggt caacgatgcg gaaatccggg
ctgcccttga gttctggaag 1560aaaaaccccc acggggaagc aagtctgcgc
gaagttctaa aactggctgt gggagtctcc 1620gaaaaagccg atggtgtaga
cgttgaaggg cttaatgcag ccggctggat cgaagaatta 1680atccgccgcc
tgaaggacaa aaccgggttt gaagaacttc cggctcctga cggtttttca
1740ggcaccctca ggccctacca gttcagaggt tactcctggc tggctttcct
gaggcagtgg 1800ggcataggag cctgccttgc agacgacatg gggcttggta
aaaccatcca gacccttgcc 1860cttatccagc acgacctgga acaggttaaa
gggcaggttg aagaaaaggt tatagaaaat 1920gctgaagaaa aagttgaagg
acttaaagct gcaaaaccgg ttcttctggt ctgtccgacc 1980tctgtcatca
acaactggaa aaaagaggcg gctcgcttta ccccggaact ttcggtaatg
2040gtccaccacg ggaccagccg gaaaaaggaa gaggaattca aaaaggaagc
cacgaatcat 2100tctattgtcg tctcaagcta cgggcttttg cagcgggatc
ttaagttttt aaaaggggtt 2160tcctgggccg gagtggtact tgacgaagcc
cagaatatca aaaacccgga aaccaaacag 2220gcaaaggcag ccagagctct
tgaagccgat taccgcatag ctcttacggg gactccggtt 2280gaaaacaacg
tgggagacct ctggtctatc atggagtttt taaaccccgg cttcctaggc
2340aaccaggcag gtttcaagcg gaatttcttt attcccattc aggccgaaag
ggatcaggaa 2400gctgcaagga ggttaaaaga aattacgggc ccctttatcc
tgcgccgtct gaagaccgat 2460acttcgatta tctccgacct gccggaaaag
atggaaatga aaacctattg tacgctgaca 2520aaagaacagg cttccctcta
tgccgcagtc ctcgaagaca tcgaagagac gatggaagag 2580gctgaagaag
gcatccagag aaaaggtata atcctgtccg cccttaccag gctcaaacag
2640gtctgcaacc atccggcgca gtttttgaag gataactctg ctgtacccgg
caggtcagga 2700aaacttgcaa ggcttaccga aatgctggat gtaatcctgg
aaaatgggga aaaagccctt 2760gtgttcaccc agtttgcgga gatgggaaaa
atgctaaaag aacacctgca ggcaagtttt 2820ggctgtgaag tccttttcct
gcacggcggg gtccccagaa agcagaggga tcggatgctt 2880gagcgtttcc
aggagggaaa agaatacctc cctatctttg tcctctccct taaagctgga
2940ggcacggggc ttaaccttac aggagcgaac cacgttttcc attttgaccg
ctggtggaac 3000cctgctgttg aaaaccaggc tacggacagg gctttccgta
taggccagac gaaaaatgta 3060gaggtgcata agttcatctg tgcgggtacg
cttgaagaaa aaatcgatga gattatcgag 3120cgcaaagtgc aggttgcaga
gaacgttgtc ggaacaggtg aaggttggct gacagaactt 3180tccaacgagg
aattgaagga tattcttgct ctccgagaag aagcggtagg tgaataa
3237441078PRTMethanosarcina acetivorans 44Met Ile Ile Leu His Ala
Gly Arg Val Gly Lys Gln Phe Phe Leu Trp 1 5 10 15 Gly Glu Ser Pro
Ala Glu Asn Glu Thr Pro Pro Val Arg Arg Gly Arg 20 25 30 Lys Pro
Lys Lys Pro Val Ala Lys Pro Tyr Pro Tyr Asp Ser Gly Val 35 40 45
Glu Asn Leu Ser Ser Ala Leu Glu Leu Leu Leu Gly Ser Thr Gly Arg 50
55 60 Lys Lys Ala Glu Glu Ile Asn Val Trp Ile Pro Thr Ala Gly Trp
Asn 65 70 75 80 Pro Ile Pro Ser Ser Pro Leu Val Ala Glu Ile Pro Ala
Ser Lys Ala 85 90 95 Glu Leu Ser Leu Ala Pro Trp Thr Val His Ala
Tyr Pro Leu Glu Ala 100 105 110 Glu Glu Ala Ile Val Leu Leu Cys Ala
Cys Met Gly Lys Lys Val Leu 115 120 125 Ala Pro Gly Ile Ile Ser Gly
Asn Asp Leu Leu Trp Trp Ala Asp Ala 130 135 140 Leu Lys Phe Ala Gly
Ser Leu Val Ala Gly Gln Lys Tyr Leu Pro Gly 145 150 155 160 Val Arg
Gly Gly Glu Gly Glu Tyr Lys Ala Phe Trp Glu Pro Val Phe 165 170 175
Ser Gly Glu Asp Ala Gly Glu Leu Ala Arg Leu Ala Lys Gln Met Pro 180
185 190 Pro Ala Ala Lys Ala Leu Ala Leu Glu Thr Ser Ser Val Gln Pro
Glu 195 200 205 Ile Leu Ala Ala Val Ala Ala Arg Gln Phe Ile Glu Glu
Ala Leu Asp 210 215 220 Trp Ile Val Arg Ser Glu Ile Gly Glu Lys Glu
Leu Ala Lys Glu Ala 225 230 235 240 Arg Lys Arg Lys Ser Phe Asp Ser
Val His Asp Ala Trp Val Ser Ala 245 250 255 Leu Lys Ser Pro Asp Gly
Leu Ile His Gly Glu Glu Lys Glu Leu Leu 260 265 270 Gln Leu Ala Phe
Arg Thr Arg Glu Trp Gln Arg Pro Leu Thr Val Leu 275 280 285 Thr Thr
Ser Pro Phe Arg Phe Cys Phe Arg Leu Glu Glu Pro Ala Ala 290 295 300
Glu Glu Glu Leu Glu Glu Thr Glu Glu Ser Glu Ala Gly Lys Met Asp 305
310 315 320 Thr Lys Lys Gly Arg Lys Gly Ile Ala Asp Ile Glu Val Pro
Glu Glu 325 330 335 Leu Trp Tyr Val Arg Tyr Met Leu Gln Ser Tyr Glu
Asp Pro Ser Leu 340 345 350 Leu Ile Pro Val Lys Glu Ala Trp Lys Pro
Lys Lys Gly Ser Pro Leu 355 360 365 Lys Arg Tyr Asp Val Lys Asn Ile
Arg Gln Phe Leu Leu Ser Ser Leu 370 375 380 Gly Gln Ala Ala Gly Ile
Ser Ala Gly Ile Ala Ser Ser Leu Glu Ala 385 390 395 400 Pro Asn Pro
Ser Gly Tyr Ser Leu Asp Thr Lys Glu Ala Tyr Arg Phe 405 410 415 Leu
Thr Glu Ser Ala Ala Asp Leu Ser Gln Ala Gly Phe Gly Leu Leu 420 425
430 Leu Pro Gly Trp Trp Thr Arg Lys Gly Thr Lys Thr His Leu Lys Ala
435 440 445 Gln Ala Asn Val Lys Gly Lys Lys Leu Lys Ala Gly Tyr Gly
Leu Thr 450 455 460 Leu Asp Lys Ile Val Ser Phe Asp Trp Glu Ile Ala
Leu Gly Asp Arg 465 470 475 480 Ala Leu Thr Val Arg Glu Leu Gln Ala
Leu Ala Lys Leu Lys Ala Pro 485 490 495 Leu Val Lys Phe Arg Gly Gln
Trp Val Glu Val Asn Asp Ala Glu Ile 500 505 510 Arg Ala Ala Leu Glu
Phe Trp Lys Lys Asn Pro His Gly Glu Ala Ser 515 520 525 Leu Arg Glu
Val Leu Lys Leu Ala Val Gly Val Ser Glu Lys Ala Asp 530 535 540 Gly
Val Asp Val Glu Gly Leu Asn Ala Ala Gly Trp Ile Glu Glu Leu 545 550
555 560 Ile Arg Arg Leu Lys Asp Lys Thr Gly Phe Glu Glu Leu Pro Ala
Pro 565 570 575 Asp Gly Phe Ser Gly Thr Leu Arg Pro Tyr Gln Phe Arg
Gly Tyr Ser 580 585 590 Trp Leu Ala Phe Leu Arg Gln Trp Gly Ile Gly
Ala Cys Leu Ala Asp 595 600 605 Asp Met Gly Leu Gly Lys Thr Ile Gln
Thr Leu Ala Leu Ile Gln His 610 615 620 Asp Leu Glu Gln Val Lys Gly
Gln Val Glu Glu Lys Val Ile Glu Asn 625 630 635 640 Ala Glu Glu Lys
Val Glu Gly Leu Lys Ala Ala Lys Pro Val Leu Leu 645 650 655 Val Cys
Pro Thr Ser Val Ile Asn Asn Trp Lys Lys Glu Ala Ala Arg 660 665 670
Phe Thr Pro Glu Leu Ser Val Met Val His His Gly Thr Ser Arg Lys 675
680 685 Lys Glu Glu Glu Phe Lys Lys Glu Ala Thr Asn His Ser Ile Val
Val 690 695 700 Ser Ser Tyr Gly Leu Leu Gln Arg Asp Leu Lys Phe Leu
Lys Gly Val 705 710 715 720 Ser Trp Ala Gly Val Val Leu Asp Glu Ala
Gln Asn Ile Lys Asn Pro 725 730 735 Glu Thr Lys Gln Ala Lys Ala Ala
Arg Ala Leu Glu Ala Asp Tyr Arg 740 745 750 Ile Ala Leu Thr Gly Thr
Pro Val Glu Asn Asn Val Gly Asp Leu Trp 755 760 765 Ser Ile Met Glu
Phe Leu Asn Pro Gly Phe Leu Gly Asn Gln Ala Gly 770 775 780 Phe Lys
Arg Asn Phe Phe Ile Pro Ile Gln Ala Glu Arg Asp Gln Glu 785 790 795
800 Ala Ala Arg Arg Leu Lys Glu Ile Thr Gly Pro Phe Ile Leu Arg Arg
805 810 815 Leu Lys Thr Asp Thr Ser Ile Ile Ser Asp Leu Pro Glu Lys
Met Glu 820 825 830 Met Lys Thr Tyr Cys Thr Leu Thr Lys Glu Gln Ala
Ser Leu Tyr Ala 835 840 845 Ala Val Leu Glu Asp Ile Glu Glu Thr Met
Glu Glu Ala Glu Glu Gly 850 855 860 Ile Gln Arg Lys Gly Ile Ile Leu
Ser Ala Leu Thr Arg Leu Lys Gln 865 870 875 880 Val Cys Asn His Pro
Ala Gln Phe Leu Lys Asp Asn Ser Ala Val Pro 885 890 895 Gly Arg Ser
Gly Lys Leu Ala Arg Leu Thr Glu Met Leu Asp Val Ile 900 905 910 Leu
Glu Asn Gly Glu Lys Ala Leu Val Phe Thr Gln Phe Ala Glu Met 915 920
925 Gly Lys Met Leu Lys Glu His Leu Gln Ala Ser Phe Gly Cys Glu Val
930 935 940 Leu Phe Leu His Gly Gly Val Pro Arg Lys Gln Arg Asp Arg
Met Leu 945 950 955 960 Glu Arg Phe Gln Glu Gly Lys Glu Tyr Leu Pro
Ile Phe Val Leu Ser 965 970 975 Leu Lys Ala Gly Gly Thr Gly Leu Asn
Leu Thr Gly Ala Asn His Val 980 985 990 Phe His Phe Asp Arg Trp Trp
Asn Pro Ala Val Glu Asn Gln Ala Thr 995 1000 1005 Asp Arg Ala Phe
Arg Ile Gly Gln Thr Lys Asn Val Glu Val His 1010 1015 1020 Lys Phe
Ile Cys Ala Gly Thr Leu Glu Glu Lys Ile Asp Glu Ile 1025 1030 1035
Ile Glu Arg Lys Val Gln Val Ala Glu Asn Val Val Gly Thr Gly 1040
1045 1050 Glu Gly Trp Leu Thr Glu Leu Ser Asn Glu Glu Leu Lys Asp
Ile 1055 1060 1065 Leu Ala Leu Arg Glu Glu Ala Val Gly Glu 1070
1075 453147DNAMethanospirillum hungatei 45gtgaccgcga aacgaccagc
accaatccac gataaagaag aagagaccat acccgatact 60tcgcttccgg tctttcatgc
cctgatttac ccggccgttg aaggggtagc gatatgtgcc 120gaatatataa
ctgataaacc tgcaccggtc aggaaaaaag gctacgcaaa ggataaacct
180ggcgaatatc catattccct ggatcatacc gcccttaaaa cgctcataga
gaactgtttt 240ggagcatatg atgacctgaa ggctaccaga tggattatct
atctccccgc tgaagaaacg 300gttcctcctt cctctcagtt ctcatcaaaa
aagaagccat caccaaagga gaaaaaactc 360ccccttgttc cgatgtatat
ccccgttctt ctctgcccgt atgaaacctt ttttcaaatc 420tggaaagccg
ctcagaatac agataaaaat tatattgctg gcgattcctt ccagtacatc
480tccattctga tggagagtac cgtccggctc atacaaaacg gacggttcaa
accatctcta 540gaacggacct ttgccggata tcatgccgta tgggtacctg
ccctttctcc tcaggatatg 600gaatgggtat cagatttttc aagccggatg
ccaacggtct gcaagtacgc tatcccccgg 660gtcgcaaaag atccctacat
ttataaacct gagaccagat tagagaaatt catcgttgag 720atgatgcggg
tgatcatccg tactgccctt ggtggttata cactgaaaga agagacagat
780cccttttatg aaccctcaga aaacgagatg cagttcatga ctgaccttct
cggggtaacc 840gacccaataa ggaacaaagg atttgagaga actttcttac
gggcgatgca ggactggctg 900accttctcaa gttcaggacg gtttgctccc
tttgagttct gcatgatcat aaaagatcca 960ccagaaggac agacagaacc
atgggatttc actctcgcgg tcagatcaga ggcagaacca 1020tctcttctca
tcccggcaga aataatctgg gaattgcctg atcaccagag cgggctcttc
1080ccccaggcag cctatctcaa acatatcctc cttgctggta tcgggctctt
gacctcatca 1140tcatcggcat tatggcgtcc cctgtccgga tcgaaaccca
ccgggggaag tatgaccctg 1200aaagaggctg caacgttctt gggttcagac
ctcgcaagag ccaggaggaa gggagtaacg 1260gtgctcctgc cagactggtg
gactgatacg acctatacac cacgggttga aatccatgca 1320aggcggcggg
atcccaccca tacgcagaca cggataggac tgcaggaact cctttctttt
1380gattaccgga ttgcaatcgg tgatgagtca ttttcaccgg atgagttctg
ggaaaaggta 1440aaagaaaagg ctccctttat ctggctgggg aaccggtgga
tatcctttca tccggatgcg 1500atacaacatg ccctggattc tttcagcagg
catcagagca aaggagggga tacaatagga 1560gatctgctcc ggctctccct
gaaaaaaatg gaggattccg cggtaccggt atcgattcat 1620gcaaaagatg
actgggttgc ggatcttctg gattttttca ggaccgaaac aaatcaggca
1680gttccagtcc caaagaaatt taaagggata ctcaggccat accaggaaga
ggggttctcc 1740ttcctttgtc aatgtaccag aaggggcttt ggagcctgcc
ttgcagatga catggggctt 1800ggaaaaactc cccagacact tgcatggctg
gtctatctca aggagaaaga aaaacccacg 1860actccgtccc tccttatatg
cccgatgtcg gttgttggga actgggagcg ggagatacag 1920cggtttgcgc
catcactccg ttcatgggtg catcatggga ctgaccgatg caaaggcgat
1980gattttgtga gacatgtcgg ttcatatgac ctggtcctga ccacctatca
tctggcagca 2040cgggacgtag accacctcaa aaccgttccc tggtctgcaa
tcattcttga cgaggcacaa 2100aatatcaaga acctccatgc aaaccagacc
gtagcagtca aatctctcac cggtgagaga 2160cgggttgctc tgaccggaac
cccggtggag aaccggttac tcgaactctg gtctatcatg 2220gactttttaa
atccaggata ccttggttca cagagtgcat ttacaaaccg ctattcccgc
2280ccgattgagc aggaaaaaaa tacggaactg atacaggaat taaggtccct
catccgtccg 2340ttcctgctca ggcggatgaa aacagacaag catgttatcg
atgatcttcc ggaaaagatg 2400gagaaccggg tatattgcac cctcacaccc
gaacaggcaa ccttatatca ggctgttgtg 2460cttgatatgg caaagaacct
tgataaagtg gagggtattg ccaggaaagg ggcaatcctt 2520gctgcgatca
cacgactgaa acagatctgt aaccatccgg gacgtgttgg cagggataaa
2580acaataaagg ctgagcggtc cgggaaggtg agccggctgc ttgagatgat
tgaggagatc 2640acttccgaag gggactcagc actcatattc agtcagtatg
caacatttgc tgaggaactg 2700gcagggatga tagagaaaca gggagatacg
cccgttcttc tcctgaccgg gtcaacacca 2760cggaaaaaac gggaacagat
gatagaggag tttcaggcct caaccacccc gataatcttt 2820gttatttctc
tgaaagccgg gggaacgggt ctgaacctga cgaaagcgac tcatgtgttt
2880catgtagacc ggtggtggaa tccggcggtt gaagaccagg ctactgaccg
gacgtaccgg 2940atcggacaaa agagaaatgt ccaagttcac ctgatgataa
ccgccggaac cctggaggaa 3000cggatagatc tgataaacca ggagaaacgg
acgcttgcaa aggaagtcct tgcacagagt 3060gatgagtatc tgacaaatct
ctcaacaaaa gaacttctgg agattgtatc acttcgtgac 3120agtctctttc
gcggggagga tgcatga 3147461048PRTMethanospirillum hungatei 46Val Thr
Ala Lys Arg Pro Ala Pro Ile His Asp Lys Glu Glu Glu Thr 1 5 10 15
Ile Pro Asp Thr Ser Leu Pro Val Phe His Ala Leu Ile Tyr Pro Ala 20
25 30 Val Glu Gly Val Ala Ile Cys Ala Glu Tyr Ile Thr Asp Lys Pro
Ala 35 40 45 Pro Val Arg Lys Lys Gly Tyr Ala Lys Asp Lys Pro Gly
Glu Tyr Pro 50 55 60 Tyr Ser Leu Asp His Thr Ala Leu Lys Thr Leu
Ile Glu Asn Cys Phe 65 70 75 80 Gly Ala Tyr Asp Asp Leu Lys Ala Thr
Arg Trp Ile Ile Tyr Leu Pro 85 90 95 Ala Glu Glu Thr Val Pro Pro
Ser Ser Gln Phe Ser Ser Lys Lys Lys 100 105 110 Pro Ser Pro Lys Glu
Lys Lys Leu Pro Leu Val Pro Met Tyr Ile Pro 115 120 125 Val Leu Leu
Cys Pro Tyr Glu Thr Phe Phe Gln Ile Trp Lys Ala Ala 130 135 140 Gln
Asn Thr Asp Lys Asn Tyr Ile Ala Gly Asp Ser Phe Gln Tyr Ile 145 150
155 160 Ser Ile Leu Met Glu Ser Thr Val Arg Leu Ile Gln Asn Gly Arg
Phe 165 170 175 Lys Pro Ser Leu Glu Arg Thr Phe Ala Gly Tyr His Ala
Val Trp Val 180 185 190 Pro Ala Leu Ser Pro Gln Asp Met Glu Trp Val
Ser Asp Phe Ser Ser 195 200 205 Arg Met Pro Thr Val Cys Lys Tyr Ala
Ile Pro Arg Val Ala Lys Asp 210 215 220 Pro Tyr Ile Tyr Lys Pro Glu
Thr Arg Leu Glu Lys Phe Ile Val Glu 225 230 235 240 Met Met Arg Val
Ile Ile Arg Thr Ala Leu Gly Gly Tyr Thr Leu Lys 245 250 255 Glu Glu
Thr Asp Pro Phe Tyr Glu Pro Ser Glu Asn Glu Met Gln Phe 260 265 270
Met Thr Asp Leu Leu Gly Val Thr Asp Pro Ile Arg Asn Lys Gly Phe 275
280
285 Glu Arg Thr Phe Leu Arg Ala Met Gln Asp Trp Leu Thr Phe Ser Ser
290 295 300 Ser Gly Arg Phe Ala Pro Phe Glu Phe Cys Met Ile Ile Lys
Asp Pro 305 310 315 320 Pro Glu Gly Gln Thr Glu Pro Trp Asp Phe Thr
Leu Ala Val Arg Ser 325 330 335 Glu Ala Glu Pro Ser Leu Leu Ile Pro
Ala Glu Ile Ile Trp Glu Leu 340 345 350 Pro Asp His Gln Ser Gly Leu
Phe Pro Gln Ala Ala Tyr Leu Lys His 355 360 365 Ile Leu Leu Ala Gly
Ile Gly Leu Leu Thr Ser Ser Ser Ser Ala Leu 370 375 380 Trp Arg Pro
Leu Ser Gly Ser Lys Pro Thr Gly Gly Ser Met Thr Leu 385 390 395 400
Lys Glu Ala Ala Thr Phe Leu Gly Ser Asp Leu Ala Arg Ala Arg Arg 405
410 415 Lys Gly Val Thr Val Leu Leu Pro Asp Trp Trp Thr Asp Thr Thr
Tyr 420 425 430 Thr Pro Arg Val Glu Ile His Ala Arg Arg Arg Asp Pro
Thr His Thr 435 440 445 Gln Thr Arg Ile Gly Leu Gln Glu Leu Leu Ser
Phe Asp Tyr Arg Ile 450 455 460 Ala Ile Gly Asp Glu Ser Phe Ser Pro
Asp Glu Phe Trp Glu Lys Val 465 470 475 480 Lys Glu Lys Ala Pro Phe
Ile Trp Leu Gly Asn Arg Trp Ile Ser Phe 485 490 495 His Pro Asp Ala
Ile Gln His Ala Leu Asp Ser Phe Ser Arg His Gln 500 505 510 Ser Lys
Gly Gly Asp Thr Ile Gly Asp Leu Leu Arg Leu Ser Leu Lys 515 520 525
Lys Met Glu Asp Ser Ala Val Pro Val Ser Ile His Ala Lys Asp Asp 530
535 540 Trp Val Ala Asp Leu Leu Asp Phe Phe Arg Thr Glu Thr Asn Gln
Ala 545 550 555 560 Val Pro Val Pro Lys Lys Phe Lys Gly Ile Leu Arg
Pro Tyr Gln Glu 565 570 575 Glu Gly Phe Ser Phe Leu Cys Gln Cys Thr
Arg Arg Gly Phe Gly Ala 580 585 590 Cys Leu Ala Asp Asp Met Gly Leu
Gly Lys Thr Pro Gln Thr Leu Ala 595 600 605 Trp Leu Val Tyr Leu Lys
Glu Lys Glu Lys Pro Thr Thr Pro Ser Leu 610 615 620 Leu Ile Cys Pro
Met Ser Val Val Gly Asn Trp Glu Arg Glu Ile Gln 625 630 635 640 Arg
Phe Ala Pro Ser Leu Arg Ser Trp Val His His Gly Thr Asp Arg 645 650
655 Cys Lys Gly Asp Asp Phe Val Arg His Val Gly Ser Tyr Asp Leu Val
660 665 670 Leu Thr Thr Tyr His Leu Ala Ala Arg Asp Val Asp His Leu
Lys Thr 675 680 685 Val Pro Trp Ser Ala Ile Ile Leu Asp Glu Ala Gln
Asn Ile Lys Asn 690 695 700 Leu His Ala Asn Gln Thr Val Ala Val Lys
Ser Leu Thr Gly Glu Arg 705 710 715 720 Arg Val Ala Leu Thr Gly Thr
Pro Val Glu Asn Arg Leu Leu Glu Leu 725 730 735 Trp Ser Ile Met Asp
Phe Leu Asn Pro Gly Tyr Leu Gly Ser Gln Ser 740 745 750 Ala Phe Thr
Asn Arg Tyr Ser Arg Pro Ile Glu Gln Glu Lys Asn Thr 755 760 765 Glu
Leu Ile Gln Glu Leu Arg Ser Leu Ile Arg Pro Phe Leu Leu Arg 770 775
780 Arg Met Lys Thr Asp Lys His Val Ile Asp Asp Leu Pro Glu Lys Met
785 790 795 800 Glu Asn Arg Val Tyr Cys Thr Leu Thr Pro Glu Gln Ala
Thr Leu Tyr 805 810 815 Gln Ala Val Val Leu Asp Met Ala Lys Asn Leu
Asp Lys Val Glu Gly 820 825 830 Ile Ala Arg Lys Gly Ala Ile Leu Ala
Ala Ile Thr Arg Leu Lys Gln 835 840 845 Ile Cys Asn His Pro Gly Arg
Val Gly Arg Asp Lys Thr Ile Lys Ala 850 855 860 Glu Arg Ser Gly Lys
Val Ser Arg Leu Leu Glu Met Ile Glu Glu Ile 865 870 875 880 Thr Ser
Glu Gly Asp Ser Ala Leu Ile Phe Ser Gln Tyr Ala Thr Phe 885 890 895
Ala Glu Glu Leu Ala Gly Met Ile Glu Lys Gln Gly Asp Thr Pro Val 900
905 910 Leu Leu Leu Thr Gly Ser Thr Pro Arg Lys Lys Arg Glu Gln Met
Ile 915 920 925 Glu Glu Phe Gln Ala Ser Thr Thr Pro Ile Ile Phe Val
Ile Ser Leu 930 935 940 Lys Ala Gly Gly Thr Gly Leu Asn Leu Thr Lys
Ala Thr His Val Phe 945 950 955 960 His Val Asp Arg Trp Trp Asn Pro
Ala Val Glu Asp Gln Ala Thr Asp 965 970 975 Arg Thr Tyr Arg Ile Gly
Gln Lys Arg Asn Val Gln Val His Leu Met 980 985 990 Ile Thr Ala Gly
Thr Leu Glu Glu Arg Ile Asp Leu Ile Asn Gln Glu 995 1000 1005 Lys
Arg Thr Leu Ala Lys Glu Val Leu Ala Gln Ser Asp Glu Tyr 1010 1015
1020 Leu Thr Asn Leu Ser Thr Lys Glu Leu Leu Glu Ile Val Ser Leu
1025 1030 1035 Arg Asp Ser Leu Phe Arg Gly Glu Asp Ala 1040 1045
473270DNAMethanosarcina mazei 47atgataattc ttcatgcagg aagagttgga
aaacagttct tcttatgggg tgaaagcccg 60gcagaaaatg aaactccggt tgttcggcgc
gggagaaagc ctaaaacccc tatcgtaaaa 120ccttaccctt acgattcggg
ctttgaaaac ctgtcttctg cccttgagct gctgctgggc 180agtactgacc
ggaaaaaggc ggagaaaatc aacgtctgga ccccaactat cggagggaat
240cctgtccctt ccagccctct tgttgctgaa atttcggatt cgaaagcaga
acctgcactg 300gctccctgta ctgttcacgc atatcctctg gaagctgaag
aagctattgt tctcctctgc 360acctgtatgg aaaaaaaggt tctggctccc
ggtatcatct cgggaaatga ccttctctgg 420tgggcagatg ccctgaaatt
tgcaggctcg ctggtagcag ggcagaaata tttgcctggc 480gtcaggggcg
gggaaggaga gtacagggct ttctgggaac ccgtattttc cggcgaagat
540gccggaaagc tggcaaaact tgcaaagcaa atgcctcctg ctgcaagggc
tcttgctcct 600gaagcctctt ccatgccgcc ggaaatgcct gctgctttag
cggcaaagca gtttattgaa 660gactctctcg actggatagt ccggtccgag
atcggggaaa aaaagcttgc aaaagagacg 720cgcaaaagaa aatcctttga
tagcgtccat gatgcctggg tttctgctct tagaagccct 780gaagggctga
tctatggaga cgaaaacgaa cttctgcagc ttgcggcccg gacccgcgaa
840tggcagcgcc cactcaccat ccttaccact tctcctttca ggttctgttt
ccgtcttgaa 900gaaccggctt tagaagaaga gatcgaagaa actgaagaaa
ccgaagaaat agaagaaaat 960gaagccggga aaagagatac taaaaaaggc
agggaaggga tagctgatat agaagttccc 1020gaagggctct ggtacgtccg
ttatatgctt cagtcctacg aagacccgag ccttctgatc 1080cctgtaaaag
aagcctggaa gccaaaaaaa ggcagcccgt tgaaaaaata cgatgtgaaa
1140aacattcgcc aattcctgtt atcttccctt ggacaggctt ccagtataag
tgcaggaatt 1200gcttcgagtc ttgaagctcc caacccatct ggatattccc
ttgatactaa agaggcttac 1260cgctttctga ctgaaagtgc agcgaattta
agtcaggccg gtttcggggt acttctccct 1320ggctggtgga cccgtaaagg
tacaaagaca cacttaaaag cccaggctaa tgttaagggc 1380aagaagaagt
tgcaggccgg atacgggctt acactcgatg aaatcgtcag ctttgactgg
1440gaaatcgccc ttggagacag ggtactgaca gtcagagaac tgcaggctct
tgcaaagctt 1500aaagctccgc ttgtgaaatt ccgcgggcag tgggttgagg
taaacgatgc ggaaatcagg 1560gctgcccttg agttctggaa gaaaaatccc
aacggtgaag caagtctgcg tgaagttcta 1620aaactggcag tgggagtttc
cgaaaaagcc gatggtgtga acgttgaagg gctcaatgca 1680accggatgga
ttggagaatt aatcagccgc ttaaaagaca aaaccgggtt tgaagaactt
1740cctgctccca acggcttttc aggcaccctt cggccatatc agttcagagg
ttactcctgg 1800ctggcttttc tgaggcagtg gggtatagga gcctgccttg
cagacgatat ggggcttggt 1860aaaaccgtcc agactcttgc tcttattcag
cacgatctgg aacaggctaa agagaaagct 1920gaagaaaaga ttgaagaacc
ggctgaagaa aagattgaag aaaaagttga cggacgtaag 1980gccccaaaac
ctgttcttct ggtttgtcct acctctgtta tcaacaactg gaaaaaagag
2040gcttcccgct ttacgccaga actttcggta atggtccacc acgggaccag
ccggaaaaag 2100gaagaggaat tcaagaagga agccatgaat catgctattg
tcatctcaag ctatggcctt 2160gtgcagcggg atcttaaatt tttaaaagag
gttcattggg caggagttgt acttgacgaa 2220gcccagaaca tcaaaaaccc
ggaaaccaaa caggcaaagg cagccagggc tcttgaatcc 2280gattaccgct
tagctcttac agggactccg gttgaaaata acgtgggaga cctctggtcc
2340ataatggagt ttttaaaccc cggcttcctc ggaagtcagg cgggtttcaa
gcggaatttc 2400tttatcccca ttcaggcaga aagggatcag gaggctgcaa
ggaggctgaa agaaattaca 2460ggtcccttca tccttcgccg tttgaagact
gacacttcga ttatctccga cctgccggaa 2520aaaatggaga tgaagaccta
ttgtacgctg acaaaagaac aggcctccct ctatgctgca 2580gtccttgaag
acatcagaga agcgattgaa ggagccgaag aaggcatcca gaggaaaggt
2640ataatcctgt ctgccctttc caggctcaag caggtctgca accaccctgc
gcagtttttg 2700aaggacaact ccactatccc cggcaggtcc ggaaaactcg
caaggcttac cgaaatgctg 2760gatgtagtcc tggaaaacgg ggaaaaagcc
cttgttttta cccagtttgc ggagatgggc 2820aaaatggtga aagaacacct
gcaagcaagc tttggctgtg aagtcctttt cctgcacggc 2880ggggtcccca
ggaagcagag agaccggatg cttgagaggt tccaggaagg aaaagaatac
2940ctccctattt ttgtcctctc ccttaaagcc ggcggcacgg ggcttaacct
cacaggggca 3000aaccacgttt tccactttga tcgctggtgg aacccggctg
ttgaaaacca ggctacagac 3060agggcattcc gtataggcca gaagaaaaac
gttgaggtcc ataaattcat ctgcgcaggt 3120acgcttgaag aaaaaatcga
tgagattatc gaacgcaaag tgcaggtcgc agagaacgtt 3180gttgggacag
gtgaagactg gctgacagag ctttccaacg atgaactgaa ggatattctt
3240gctcttagag aagaagcggt aggtgaataa 3270481089PRTMethanosarcina
mazei 48Met Ile Ile Leu His Ala Gly Arg Val Gly Lys Gln Phe Phe Leu
Trp 1 5 10 15 Gly Glu Ser Pro Ala Glu Asn Glu Thr Pro Val Val Arg
Arg Gly Arg 20 25 30 Lys Pro Lys Thr Pro Ile Val Lys Pro Tyr Pro
Tyr Asp Ser Gly Phe 35 40 45 Glu Asn Leu Ser Ser Ala Leu Glu Leu
Leu Leu Gly Ser Thr Asp Arg 50 55 60 Lys Lys Ala Glu Lys Ile Asn
Val Trp Thr Pro Thr Ile Gly Gly Asn 65 70 75 80 Pro Val Pro Ser Ser
Pro Leu Val Ala Glu Ile Ser Asp Ser Lys Ala 85 90 95 Glu Pro Ala
Leu Ala Pro Cys Thr Val His Ala Tyr Pro Leu Glu Ala 100 105 110 Glu
Glu Ala Ile Val Leu Leu Cys Thr Cys Met Glu Lys Lys Val Leu 115 120
125 Ala Pro Gly Ile Ile Ser Gly Asn Asp Leu Leu Trp Trp Ala Asp Ala
130 135 140 Leu Lys Phe Ala Gly Ser Leu Val Ala Gly Gln Lys Tyr Leu
Pro Gly 145 150 155 160 Val Arg Gly Gly Glu Gly Glu Tyr Arg Ala Phe
Trp Glu Pro Val Phe 165 170 175 Ser Gly Glu Asp Ala Gly Lys Leu Ala
Lys Leu Ala Lys Gln Met Pro 180 185 190 Pro Ala Ala Arg Ala Leu Ala
Pro Glu Ala Ser Ser Met Pro Pro Glu 195 200 205 Met Pro Ala Ala Leu
Ala Ala Lys Gln Phe Ile Glu Asp Ser Leu Asp 210 215 220 Trp Ile Val
Arg Ser Glu Ile Gly Glu Lys Lys Leu Ala Lys Glu Thr 225 230 235 240
Arg Lys Arg Lys Ser Phe Asp Ser Val His Asp Ala Trp Val Ser Ala 245
250 255 Leu Arg Ser Pro Glu Gly Leu Ile Tyr Gly Asp Glu Asn Glu Leu
Leu 260 265 270 Gln Leu Ala Ala Arg Thr Arg Glu Trp Gln Arg Pro Leu
Thr Ile Leu 275 280 285 Thr Thr Ser Pro Phe Arg Phe Cys Phe Arg Leu
Glu Glu Pro Ala Leu 290 295 300 Glu Glu Glu Ile Glu Glu Thr Glu Glu
Thr Glu Glu Ile Glu Glu Asn 305 310 315 320 Glu Ala Gly Lys Arg Asp
Thr Lys Lys Gly Arg Glu Gly Ile Ala Asp 325 330 335 Ile Glu Val Pro
Glu Gly Leu Trp Tyr Val Arg Tyr Met Leu Gln Ser 340 345 350 Tyr Glu
Asp Pro Ser Leu Leu Ile Pro Val Lys Glu Ala Trp Lys Pro 355 360 365
Lys Lys Gly Ser Pro Leu Lys Lys Tyr Asp Val Lys Asn Ile Arg Gln 370
375 380 Phe Leu Leu Ser Ser Leu Gly Gln Ala Ser Ser Ile Ser Ala Gly
Ile 385 390 395 400 Ala Ser Ser Leu Glu Ala Pro Asn Pro Ser Gly Tyr
Ser Leu Asp Thr 405 410 415 Lys Glu Ala Tyr Arg Phe Leu Thr Glu Ser
Ala Ala Asn Leu Ser Gln 420 425 430 Ala Gly Phe Gly Val Leu Leu Pro
Gly Trp Trp Thr Arg Lys Gly Thr 435 440 445 Lys Thr His Leu Lys Ala
Gln Ala Asn Val Lys Gly Lys Lys Lys Leu 450 455 460 Gln Ala Gly Tyr
Gly Leu Thr Leu Asp Glu Ile Val Ser Phe Asp Trp 465 470 475 480 Glu
Ile Ala Leu Gly Asp Arg Val Leu Thr Val Arg Glu Leu Gln Ala 485 490
495 Leu Ala Lys Leu Lys Ala Pro Leu Val Lys Phe Arg Gly Gln Trp Val
500 505 510 Glu Val Asn Asp Ala Glu Ile Arg Ala Ala Leu Glu Phe Trp
Lys Lys 515 520 525 Asn Pro Asn Gly Glu Ala Ser Leu Arg Glu Val Leu
Lys Leu Ala Val 530 535 540 Gly Val Ser Glu Lys Ala Asp Gly Val Asn
Val Glu Gly Leu Asn Ala 545 550 555 560 Thr Gly Trp Ile Gly Glu Leu
Ile Ser Arg Leu Lys Asp Lys Thr Gly 565 570 575 Phe Glu Glu Leu Pro
Ala Pro Asn Gly Phe Ser Gly Thr Leu Arg Pro 580 585 590 Tyr Gln Phe
Arg Gly Tyr Ser Trp Leu Ala Phe Leu Arg Gln Trp Gly 595 600 605 Ile
Gly Ala Cys Leu Ala Asp Asp Met Gly Leu Gly Lys Thr Val Gln 610 615
620 Thr Leu Ala Leu Ile Gln His Asp Leu Glu Gln Ala Lys Glu Lys Ala
625 630 635 640 Glu Glu Lys Ile Glu Glu Pro Ala Glu Glu Lys Ile Glu
Glu Lys Val 645 650 655 Asp Gly Arg Lys Ala Pro Lys Pro Val Leu Leu
Val Cys Pro Thr Ser 660 665 670 Val Ile Asn Asn Trp Lys Lys Glu Ala
Ser Arg Phe Thr Pro Glu Leu 675 680 685 Ser Val Met Val His His Gly
Thr Ser Arg Lys Lys Glu Glu Glu Phe 690 695 700 Lys Lys Glu Ala Met
Asn His Ala Ile Val Ile Ser Ser Tyr Gly Leu 705 710 715 720 Val Gln
Arg Asp Leu Lys Phe Leu Lys Glu Val His Trp Ala Gly Val 725 730 735
Val Leu Asp Glu Ala Gln Asn Ile Lys Asn Pro Glu Thr Lys Gln Ala 740
745 750 Lys Ala Ala Arg Ala Leu Glu Ser Asp Tyr Arg Leu Ala Leu Thr
Gly 755 760 765 Thr Pro Val Glu Asn Asn Val Gly Asp Leu Trp Ser Ile
Met Glu Phe 770 775 780 Leu Asn Pro Gly Phe Leu Gly Ser Gln Ala Gly
Phe Lys Arg Asn Phe 785 790 795 800 Phe Ile Pro Ile Gln Ala Glu Arg
Asp Gln Glu Ala Ala Arg Arg Leu 805 810 815 Lys Glu Ile Thr Gly Pro
Phe Ile Leu Arg Arg Leu Lys Thr Asp Thr 820 825 830 Ser Ile Ile Ser
Asp Leu Pro Glu Lys Met Glu Met Lys Thr Tyr Cys 835 840 845 Thr Leu
Thr Lys Glu Gln Ala Ser Leu Tyr Ala Ala Val Leu Glu Asp 850 855 860
Ile Arg Glu Ala Ile Glu Gly Ala Glu Glu Gly Ile Gln Arg Lys Gly 865
870 875 880 Ile Ile Leu Ser Ala Leu Ser Arg Leu Lys Gln Val Cys Asn
His Pro 885 890 895 Ala Gln Phe Leu Lys Asp Asn Ser Thr Ile Pro Gly
Arg Ser Gly Lys 900 905 910 Leu Ala Arg Leu Thr Glu Met Leu Asp Val
Val Leu Glu Asn Gly Glu 915 920 925 Lys Ala Leu Val Phe Thr Gln Phe
Ala Glu Met Gly Lys Met Val Lys 930 935 940 Glu His Leu Gln Ala Ser
Phe Gly Cys Glu Val Leu Phe Leu His Gly 945 950 955 960 Gly Val Pro
Arg Lys Gln Arg Asp Arg Met Leu Glu Arg Phe Gln Glu 965 970 975 Gly
Lys Glu Tyr Leu Pro Ile Phe Val Leu Ser Leu Lys Ala Gly Gly 980 985
990 Thr Gly Leu Asn Leu Thr Gly Ala Asn His Val Phe His Phe Asp Arg
995 1000 1005 Trp Trp Asn Pro Ala Val Glu Asn Gln Ala Thr Asp Arg
Ala Phe 1010 1015 1020 Arg Ile Gly Gln Lys Lys Asn Val
Glu Val His Lys Phe Ile Cys 1025 1030 1035 Ala Gly Thr Leu Glu Glu
Lys Ile Asp Glu Ile Ile Glu Arg Lys 1040 1045 1050 Val Gln Val Ala
Glu Asn Val Val Gly Thr Gly Glu Asp Trp Leu 1055 1060 1065 Thr Glu
Leu Ser Asn Asp Glu Leu Lys Asp Ile Leu Ala Leu Arg 1070 1075 1080
Glu Glu Ala Val Gly Glu 1085 493042DNAMycobacterium bovis
49atgctggttt tgcacggctt ctggtccaac tccggcggga tgcggctgtg ggcggaggac
60tccgatctgc tggtgaagag cccgagtcag gcgctgcgct ccgcgcggcc acacccgttc
120gcggcgcccg ctgacctgat cgccggcata catccgggca aacccgcaac
cgccgttttg 180ctgttgccgt cgttgcgatc ggcgccgctg gactcgccgg
agctgatccg gctcgccccg 240cgcccggccg cgcgaaccga tccgatgctg
ttggcgtgga cggtaccggt ggtggacctg 300gaccccaccg cggcgttggc
cgccttcgac cagcccgccc ccgacgtccg ctacggcgcg 360tccgtcgact
acctggccga gctggccgtt ttcgcgcgcg agttggtcga gcgtggtcgc
420gtgctgcccc agctgcgccg cgacacccac ggcgcggccg cctgctggcg
tccggtgttg 480cagggacgcg acgtggtcgc gatgacctcg ctggtctcgg
cgatgccgcc ggtctgccgc 540gccgaagttg gtgggcacga cccgcacgaa
ctggcaacct cggctctgga cgcgatggtc 600gacgccgccg tgcgcgcggc
gctgtcaccg atggacctgc tgcccccgcg acggggtcgc 660tccaaacggc
atcgggccgt ggaggcttgg ctgaccgcgt tgacctgccc ggacggccgg
720ttcgacgcgg agcccgacga actcgacgcg ctggccgagg cgttgcggcc
atgggacgac 780gtcggtatcg gcaccgtcgg cccggcgcgg gcgacgtttc
ggctgtccga agtcgagacc 840gaaaacgagg agacgcccgc gggctcgttg
tggaggctgg agttcttatt gcagtcgacg 900caggacccca gcctgctggt
ccccgccgag caggcatgga acgacgacgg cagcctgcgc 960cgctggctgg
accggccgca ggagctgctg ctgaccgaac tgggccgggc ctctcggatt
1020ttccccgagc tcgtcccggc gctgcgcacc gcgtgcccgt ccgggcttga
gctcgacgcc 1080gacggcgcct accgattcct gtcgggtacg gccgcggtgc
tcgacgaggc tgggtttggc 1140gtgctgctgc cgtcctggtg ggaccgccgc
cgcaagctgg gcttggtcct gtccgcatat 1200accccggtcg acggcgtggt
gggcaaggcc agcaagttcg gccgcgagca gctcgtcgag 1260ttccgctggg
agctggccgt gggcgacgat ccgctcagcg aggaggagat cgcggcgctg
1320accgaaacca agtccccgct gatccggctg cgtggccagt gggtggcgct
cgataccgaa 1380cagctgcgcc gcgggctgga gtttttggag cgtaagccaa
ccggccgcaa gaccaccgcc 1440gagatcctcg cgctggccgc cagccacccc
gacgacgtgg acaccccgct cgaggtcacc 1500gccgtacgcg ccgacggctg
gctcggggac ctgctcgccg gggccgccgc ggcgtcgctg 1560cagccgttgg
acccgcccga cggattcacc gcgacgctgc gtccctacca gcagcgcggt
1620ctggcgtggc tggcgttttt gtcctcgctc ggtttgggca gctgcctggc
cgacgacatg 1680ggcctgggca agacggtgca gctattggcc ctggaaacct
tggaatccgt tcagcgccac 1740caggatcgcg gcgtcggacc cacactgcta
ctgtgcccga tgtcgttggt gggcaactgg 1800cagcaggaag cggccaggtt
tgcacccaac ctgcgggtgt acgcccacca cgggggcgcc 1860cggctgcacg
gcgaggcgtt gcgcgaccac ctcgagcgca ccgacctggt cgtgagcacc
1920tataccaccg ccacccgcga catcgacgag ctgtcggaat acgaatggaa
ccgggtggtg 1980ctggacgagg cccaggcggt gaagaacagc ctgtcccggg
cggccaaggc ggtgcgacgg 2040ctacgcgcgg cgcaccgggt cgcgctgacc
gggacaccga tggagaaccg gctcgccgag 2100ctgtggtcga tcatggactt
cctcaacccg ggcctgctcg gatcctccga acgcttccgc 2160acccgctacg
cgatcccgat cgagcggcac gggcacaccg aaccggccga acggctgcgc
2220gcatcgacgc ggccctacat cctgcgccgg ctcaagaccg acccggcgat
catcgacgat 2280ctgccggaga agatcgagat caagcagtac tgccaactca
ccaccgagca ggcgtcgctg 2340tatcaggccg tcgtcgccga catgatggaa
aagatcgaaa acaccgaagg gatcgagcgg 2400cgcggcaacg tgctggccgc
gatggccaag ctcaaacagg tgtgcaacca ccccgcccag 2460ctgctgcacg
atcgctcccc ggtcggtcgg cggtccggga aggtgatccg gctcgaggag
2520atcctggaag agatcctggc cgagggcgac cgggtgctgt gttttaccca
gttcaccgag 2580ttcgccgagc tgctggtgcc gcacctggcc gcacgcttcg
gccgtgccgc ccgagacatt 2640gcctacctgc acggtggcac cccgaggaag
cggcgtgacg agatggtggc ccggttccag 2700tccggtgacg gcccgcccat
ttttctgctg tcgttgaagg cgggcggtac cgggctgaac 2760ctcaccgccg
ccaatcatgt tgtgcacctg gaccgctggt ggaacccggc ggtcgagaac
2820caggcgacgg accgggcgtt tcggatcggg cagcggcgca cggtgcaggt
ccgcaagttc 2880atctgcaccg gcaccctcga ggagaagatc gacgaaatga
tcgaggagaa aaaggcgctg 2940gccgacttgg tggtcaccga cggcgaaggc
tggctgaccg aactgtccac ccgcgatctg 3000cgcgaggtgt tcgcgctgtc
cgaaggcgcc gtcggtgagt ag 3042501013PRTMycobacterium bovis 50Met Leu
Val Leu His Gly Phe Trp Ser Asn Ser Gly Gly Met Arg Leu 1 5 10 15
Trp Ala Glu Asp Ser Asp Leu Leu Val Lys Ser Pro Ser Gln Ala Leu 20
25 30 Arg Ser Ala Arg Pro His Pro Phe Ala Ala Pro Ala Asp Leu Ile
Ala 35 40 45 Gly Ile His Pro Gly Lys Pro Ala Thr Ala Val Leu Leu
Leu Pro Ser 50 55 60 Leu Arg Ser Ala Pro Leu Asp Ser Pro Glu Leu
Ile Arg Leu Ala Pro 65 70 75 80 Arg Pro Ala Ala Arg Thr Asp Pro Met
Leu Leu Ala Trp Thr Val Pro 85 90 95 Val Val Asp Leu Asp Pro Thr
Ala Ala Leu Ala Ala Phe Asp Gln Pro 100 105 110 Ala Pro Asp Val Arg
Tyr Gly Ala Ser Val Asp Tyr Leu Ala Glu Leu 115 120 125 Ala Val Phe
Ala Arg Glu Leu Val Glu Arg Gly Arg Val Leu Pro Gln 130 135 140 Leu
Arg Arg Asp Thr His Gly Ala Ala Ala Cys Trp Arg Pro Val Leu 145 150
155 160 Gln Gly Arg Asp Val Val Ala Met Thr Ser Leu Val Ser Ala Met
Pro 165 170 175 Pro Val Cys Arg Ala Glu Val Gly Gly His Asp Pro His
Glu Leu Ala 180 185 190 Thr Ser Ala Leu Asp Ala Met Val Asp Ala Ala
Val Arg Ala Ala Leu 195 200 205 Ser Pro Met Asp Leu Leu Pro Pro Arg
Arg Gly Arg Ser Lys Arg His 210 215 220 Arg Ala Val Glu Ala Trp Leu
Thr Ala Leu Thr Cys Pro Asp Gly Arg 225 230 235 240 Phe Asp Ala Glu
Pro Asp Glu Leu Asp Ala Leu Ala Glu Ala Leu Arg 245 250 255 Pro Trp
Asp Asp Val Gly Ile Gly Thr Val Gly Pro Ala Arg Ala Thr 260 265 270
Phe Arg Leu Ser Glu Val Glu Thr Glu Asn Glu Glu Thr Pro Ala Gly 275
280 285 Ser Leu Trp Arg Leu Glu Phe Leu Leu Gln Ser Thr Gln Asp Pro
Ser 290 295 300 Leu Leu Val Pro Ala Glu Gln Ala Trp Asn Asp Asp Gly
Ser Leu Arg 305 310 315 320 Arg Trp Leu Asp Arg Pro Gln Glu Leu Leu
Leu Thr Glu Leu Gly Arg 325 330 335 Ala Ser Arg Ile Phe Pro Glu Leu
Val Pro Ala Leu Arg Thr Ala Cys 340 345 350 Pro Ser Gly Leu Glu Leu
Asp Ala Asp Gly Ala Tyr Arg Phe Leu Ser 355 360 365 Gly Thr Ala Ala
Val Leu Asp Glu Ala Gly Phe Gly Val Leu Leu Pro 370 375 380 Ser Trp
Trp Asp Arg Arg Arg Lys Leu Gly Leu Val Leu Ser Ala Tyr 385 390 395
400 Thr Pro Val Asp Gly Val Val Gly Lys Ala Ser Lys Phe Gly Arg Glu
405 410 415 Gln Leu Val Glu Phe Arg Trp Glu Leu Ala Val Gly Asp Asp
Pro Leu 420 425 430 Ser Glu Glu Glu Ile Ala Ala Leu Thr Glu Thr Lys
Ser Pro Leu Ile 435 440 445 Arg Leu Arg Gly Gln Trp Val Ala Leu Asp
Thr Glu Gln Leu Arg Arg 450 455 460 Gly Leu Glu Phe Leu Glu Arg Lys
Pro Thr Gly Arg Lys Thr Thr Ala 465 470 475 480 Glu Ile Leu Ala Leu
Ala Ala Ser His Pro Asp Asp Val Asp Thr Pro 485 490 495 Leu Glu Val
Thr Ala Val Arg Ala Asp Gly Trp Leu Gly Asp Leu Leu 500 505 510 Ala
Gly Ala Ala Ala Ala Ser Leu Gln Pro Leu Asp Pro Pro Asp Gly 515 520
525 Phe Thr Ala Thr Leu Arg Pro Tyr Gln Gln Arg Gly Leu Ala Trp Leu
530 535 540 Ala Phe Leu Ser Ser Leu Gly Leu Gly Ser Cys Leu Ala Asp
Asp Met 545 550 555 560 Gly Leu Gly Lys Thr Val Gln Leu Leu Ala Leu
Glu Thr Leu Glu Ser 565 570 575 Val Gln Arg His Gln Asp Arg Gly Val
Gly Pro Thr Leu Leu Leu Cys 580 585 590 Pro Met Ser Leu Val Gly Asn
Trp Gln Gln Glu Ala Ala Arg Phe Ala 595 600 605 Pro Asn Leu Arg Val
Tyr Ala His His Gly Gly Ala Arg Leu His Gly 610 615 620 Glu Ala Leu
Arg Asp His Leu Glu Arg Thr Asp Leu Val Val Ser Thr 625 630 635 640
Tyr Thr Thr Ala Thr Arg Asp Ile Asp Glu Leu Ser Glu Tyr Glu Trp 645
650 655 Asn Arg Val Val Leu Asp Glu Ala Gln Ala Val Lys Asn Ser Leu
Ser 660 665 670 Arg Ala Ala Lys Ala Val Arg Arg Leu Arg Ala Ala His
Arg Val Ala 675 680 685 Leu Thr Gly Thr Pro Met Glu Asn Arg Leu Ala
Glu Leu Trp Ser Ile 690 695 700 Met Asp Phe Leu Asn Pro Gly Leu Leu
Gly Ser Ser Glu Arg Phe Arg 705 710 715 720 Thr Arg Tyr Ala Ile Pro
Ile Glu Arg His Gly His Thr Glu Pro Ala 725 730 735 Glu Arg Leu Arg
Ala Ser Thr Arg Pro Tyr Ile Leu Arg Arg Leu Lys 740 745 750 Thr Asp
Pro Ala Ile Ile Asp Asp Leu Pro Glu Lys Ile Glu Ile Lys 755 760 765
Gln Tyr Cys Gln Leu Thr Thr Glu Gln Ala Ser Leu Tyr Gln Ala Val 770
775 780 Val Ala Asp Met Met Glu Lys Ile Glu Asn Thr Glu Gly Ile Glu
Arg 785 790 795 800 Arg Gly Asn Val Leu Ala Ala Met Ala Lys Leu Lys
Gln Val Cys Asn 805 810 815 His Pro Ala Gln Leu Leu His Asp Arg Ser
Pro Val Gly Arg Arg Ser 820 825 830 Gly Lys Val Ile Arg Leu Glu Glu
Ile Leu Glu Glu Ile Leu Ala Glu 835 840 845 Gly Asp Arg Val Leu Cys
Phe Thr Gln Phe Thr Glu Phe Ala Glu Leu 850 855 860 Leu Val Pro His
Leu Ala Ala Arg Phe Gly Arg Ala Ala Arg Asp Ile 865 870 875 880 Ala
Tyr Leu His Gly Gly Thr Pro Arg Lys Arg Arg Asp Glu Met Val 885 890
895 Ala Arg Phe Gln Ser Gly Asp Gly Pro Pro Ile Phe Leu Leu Ser Leu
900 905 910 Lys Ala Gly Gly Thr Gly Leu Asn Leu Thr Ala Ala Asn His
Val Val 915 920 925 His Leu Asp Arg Trp Trp Asn Pro Ala Val Glu Asn
Gln Ala Thr Asp 930 935 940 Arg Ala Phe Arg Ile Gly Gln Arg Arg Thr
Val Gln Val Arg Lys Phe 945 950 955 960 Ile Cys Thr Gly Thr Leu Glu
Glu Lys Ile Asp Glu Met Ile Glu Glu 965 970 975 Lys Lys Ala Leu Ala
Asp Leu Val Val Thr Asp Gly Glu Gly Trp Leu 980 985 990 Thr Glu Leu
Ser Thr Arg Asp Leu Arg Glu Val Phe Ala Leu Ser Glu 995 1000 1005
Gly Ala Val Gly Glu 1010 513042DNAMycobacterium tuberculosis
51atgctggttt tgcacggctt ctggtccaac tccggcggga tgcggctgtg ggcggaggac
60tccgatctgc tggtgaagag cccgagtcag gcgctgcgct ccgcgcggcc acacccgttc
120gcggcgcccg ctgacctgat cgccggcata catccgggca aacccgcaac
cgccgttttg 180ctgttgccgt cgttgcgatc ggcgccgctg gactcgccgg
agctgatccg gctcgccccg 240cgcccggccg cgcgaaccga tccgatgctg
ttggcgtgga cggtaccggt ggtggacctg 300gaccccaccg cggcgttggc
cgccttcgac cagcccgccc ccgacgtccg ctacggcgcg 360tccgtcgact
acctggccga gctggccgtt ttcgcgcgcg agttggtcga gcgtggtcgc
420gtgctgcccc agctgcgccg cgacacccac ggcgcggccg cctgctggcg
tccggtgttg 480cagggacgcg acgtggtcgc gatgacctcg ctggtctcgg
cgatgccgcc ggtctgccgc 540gccgaagttg gtgggcacga cccgcacgaa
ctggcaacct cggctctgga cgcgatggtc 600gacgccgccg tgcgcgcggc
gctgtcaccg atggacctgc tgcccccgcg acggggtcgc 660tccaaacggc
atcgggccgt ggaggcttgg ctgaccgcgt tgacctgccc ggacggccgg
720ttcgacgcgg agcccgacga actcgacgcg ctggccgagg cgttgcggcc
atgggacgac 780gtcggtatcg gcaccgtcgg cccggcgcgg gcgacgtttc
ggctgtccga agtcgagacc 840gaaaacgagg agacgcccgc gggctcgttg
tggaggctgg agttcttatt gcagtcgacg 900caggacccca gcctgctggt
ccccgccgag caggcatgga acgacgacgg cagcctgcgc 960cgctggctgg
accggccgca ggagctgctg ctgaccgaac tgggccgggc ctctcggatt
1020ttccccgagc tcgtcccggc gctgcgcacc gcgtgcccgt ccgggcttga
gctcgacgcc 1080gacggcgcct accgattcct gtcgggtacg gccgcggtgc
tcgacgaggc tgggtttggc 1140gtgctgctgc cgtcctggtg ggaccgccgc
cgcaagctgg gcttggtcct gtccgcatat 1200accccggtcg acggcgtggt
gggcaaggcc agcaagttcg gccgcgagca gctcgtcgag 1260ttccgctggg
agctggccgt gggcgacgat ccgctcagcg aggaggagat cgcggcgctg
1320accgaaacca agtccccgct gatccggctg cgtggccagt gggtcgcgct
cgataccgaa 1380cagatgcgcc gcgggctgga gtttttggag cgtaagccaa
ccggccgcaa gaccaccgcc 1440gagatcctcg cgctggccgc cagccacccc
gacgacgtgg acaccccgct cgaggtcacc 1500gccgtacgcg ccgacggctg
gctcggggac ctgctcgccg gggccgccgc ggcgtcgctg 1560cagccgttgg
acccgcccga cggattcacc gcgacgctgc gtccctacca gcagcgcggt
1620ctggcgtggc tggcgttttt gtcctcgctc ggtttgggca gctgcctggc
cgacgacatg 1680ggcctgggca agacggtgca gctattggcc ctggaaacct
tggaatccgt tcagcgccac 1740caggatcgcg gcgtcggacc cacactgcta
ctgtgcccga tgtcgttggt gggcaactgg 1800ccgcaggaag cggccaggtt
tgcacccaac ctgcgggtgt acgcccacca cgggggcgcc 1860cggctgcacg
gcgaggcgtt gcgcgaccac ctcgagcgca ccgacctggt cgtgagcacc
1920tataccaccg ccacccgcga catcgacgag ctggcggaat acgaatggaa
ccgggtggtg 1980ctggacgagg cccaggcggt gaagaacagc ctgtcccggg
cggccaaggc ggtgcgacgg 2040ctacgcgcgg cgcaccgggt cgcgctgacc
gggacaccga tggagaaccg gctcgccgag 2100ctgtggtcga tcatggactt
cctcaacccg ggcctgctcg gatcctccga acgcttccgc 2160acccgctacg
cgatcccgat cgagcggcac gggcacaccg aaccggccga acggctgcgc
2220gcatcgacgc ggccctacat cctgcgccgg ctcaagaccg acccggcgat
catcgacgat 2280ctgccggaga agatcgagat caagcagtac tgccaactca
ccaccgagca ggcgtcgctg 2340tatcaggccg tcgtcgccga catgatggaa
aagatcgaaa acaccgaagg gatcgagcgg 2400cgcggcaacg tgctggccgc
gatggccaag ctcaaacagg tgtgcaacca ccccgcccag 2460ctgctgcacg
atcgctcccc ggtcggtcgg cggtccggga aggtgatccg gctcgaggag
2520atcctggaag agatcctggc cgagggcgac cgggtgctgt gttttaccca
gttcaccgag 2580ttcgccgagc tgctggtgcc gcacctggcc gcacgcttcg
gccgtgccgc ccgagacatt 2640gcctacctgc acggtggcac cccgaggaag
cggcgtgacg agatggtggc ccggttccag 2700tccggtgacg gcccgcccat
ttttctgctg tcgttgaagg cgggcggtac cgggctgaac 2760ctcaccgccg
ccaatcatgt tgtgcacctg gaccgctggt ggaacccggc ggtcgagaac
2820caggcgacgg accgggcgtt tcggatcggg cagcggcgca cggtgcaggt
ccgcaagttc 2880atctgcaccg gcaccctcga ggagaagatc gacgaaatga
tcgaggagaa aaaggcgctg 2940gccgacttgg tggtcaccga cggcgaaggc
tggctgaccg aactgtccac ccgcgatctg 3000cgcgaggtgt tcgcgctgtc
cgaaggcgcc gtcggtgagt ag 3042521013PRTMycobacterium tuberculosis
52Met Leu Val Leu His Gly Phe Trp Ser Asn Ser Gly Gly Met Arg Leu 1
5 10 15 Trp Ala Glu Asp Ser Asp Leu Leu Val Lys Ser Pro Ser Gln Ala
Leu 20 25 30 Arg Ser Ala Arg Pro His Pro Phe Ala Ala Pro Ala Asp
Leu Ile Ala 35 40 45 Gly Ile His Pro Gly Lys Pro Ala Thr Ala Val
Leu Leu Leu Pro Ser 50 55 60 Leu Arg Ser Ala Pro Leu Asp Ser Pro
Glu Leu Ile Arg Leu Ala Pro 65 70 75 80 Arg Pro Ala Ala Arg Thr Asp
Pro Met Leu Leu Ala Trp Thr Val Pro 85 90 95 Val Val Asp Leu Asp
Pro Thr Ala Ala Leu Ala Ala Phe Asp Gln Pro 100 105 110 Ala Pro Asp
Val Arg Tyr Gly Ala Ser Val Asp Tyr Leu Ala Glu Leu 115 120 125 Ala
Val Phe Ala Arg Glu Leu Val Glu Arg Gly Arg Val Leu Pro Gln 130 135
140 Leu Arg Arg Asp Thr His Gly Ala Ala Ala Cys Trp Arg Pro Val Leu
145 150 155 160 Gln Gly Arg Asp Val Val Ala Met Thr Ser Leu Val Ser
Ala Met Pro 165 170 175 Pro Val Cys Arg Ala Glu Val Gly Gly His Asp
Pro His Glu Leu Ala 180 185 190 Thr Ser Ala Leu Asp Ala Met Val Asp
Ala Ala Val Arg Ala Ala Leu 195 200 205 Ser Pro Met Asp Leu Leu Pro
Pro Arg Arg Gly Arg Ser Lys Arg His 210 215 220 Arg Ala Val Glu Ala
Trp Leu Thr Ala Leu Thr Cys Pro Asp Gly Arg 225 230 235 240 Phe Asp
Ala Glu Pro Asp Glu Leu Asp Ala Leu Ala Glu Ala Leu Arg 245 250 255
Pro Trp Asp Asp Val Gly Ile Gly Thr Val Gly Pro Ala Arg Ala Thr 260
265
270 Phe Arg Leu Ser Glu Val Glu Thr Glu Asn Glu Glu Thr Pro Ala Gly
275 280 285 Ser Leu Trp Arg Leu Glu Phe Leu Leu Gln Ser Thr Gln Asp
Pro Ser 290 295 300 Leu Leu Val Pro Ala Glu Gln Ala Trp Asn Asp Asp
Gly Ser Leu Arg 305 310 315 320 Arg Trp Leu Asp Arg Pro Gln Glu Leu
Leu Leu Thr Glu Leu Gly Arg 325 330 335 Ala Ser Arg Ile Phe Pro Glu
Leu Val Pro Ala Leu Arg Thr Ala Cys 340 345 350 Pro Ser Gly Leu Glu
Leu Asp Ala Asp Gly Ala Tyr Arg Phe Leu Ser 355 360 365 Gly Thr Ala
Ala Val Leu Asp Glu Ala Gly Phe Gly Val Leu Leu Pro 370 375 380 Ser
Trp Trp Asp Arg Arg Arg Lys Leu Gly Leu Val Leu Ser Ala Tyr 385 390
395 400 Thr Pro Val Asp Gly Val Val Gly Lys Ala Ser Lys Phe Gly Arg
Glu 405 410 415 Gln Leu Val Glu Phe Arg Trp Glu Leu Ala Val Gly Asp
Asp Pro Leu 420 425 430 Ser Glu Glu Glu Ile Ala Ala Leu Thr Glu Thr
Lys Ser Pro Leu Ile 435 440 445 Arg Leu Arg Gly Gln Trp Val Ala Leu
Asp Thr Glu Gln Met Arg Arg 450 455 460 Gly Leu Glu Phe Leu Glu Arg
Lys Pro Thr Gly Arg Lys Thr Thr Ala 465 470 475 480 Glu Ile Leu Ala
Leu Ala Ala Ser His Pro Asp Asp Val Asp Thr Pro 485 490 495 Leu Glu
Val Thr Ala Val Arg Ala Asp Gly Trp Leu Gly Asp Leu Leu 500 505 510
Ala Gly Ala Ala Ala Ala Ser Leu Gln Pro Leu Asp Pro Pro Asp Gly 515
520 525 Phe Thr Ala Thr Leu Arg Pro Tyr Gln Gln Arg Gly Leu Ala Trp
Leu 530 535 540 Ala Phe Leu Ser Ser Leu Gly Leu Gly Ser Cys Leu Ala
Asp Asp Met 545 550 555 560 Gly Leu Gly Lys Thr Val Gln Leu Leu Ala
Leu Glu Thr Leu Glu Ser 565 570 575 Val Gln Arg His Gln Asp Arg Gly
Val Gly Pro Thr Leu Leu Leu Cys 580 585 590 Pro Met Ser Leu Val Gly
Asn Trp Pro Gln Glu Ala Ala Arg Phe Ala 595 600 605 Pro Asn Leu Arg
Val Tyr Ala His His Gly Gly Ala Arg Leu His Gly 610 615 620 Glu Ala
Leu Arg Asp His Leu Glu Arg Thr Asp Leu Val Val Ser Thr 625 630 635
640 Tyr Thr Thr Ala Thr Arg Asp Ile Asp Glu Leu Ala Glu Tyr Glu Trp
645 650 655 Asn Arg Val Val Leu Asp Glu Ala Gln Ala Val Lys Asn Ser
Leu Ser 660 665 670 Arg Ala Ala Lys Ala Val Arg Arg Leu Arg Ala Ala
His Arg Val Ala 675 680 685 Leu Thr Gly Thr Pro Met Glu Asn Arg Leu
Ala Glu Leu Trp Ser Ile 690 695 700 Met Asp Phe Leu Asn Pro Gly Leu
Leu Gly Ser Ser Glu Arg Phe Arg 705 710 715 720 Thr Arg Tyr Ala Ile
Pro Ile Glu Arg His Gly His Thr Glu Pro Ala 725 730 735 Glu Arg Leu
Arg Ala Ser Thr Arg Pro Tyr Ile Leu Arg Arg Leu Lys 740 745 750 Thr
Asp Pro Ala Ile Ile Asp Asp Leu Pro Glu Lys Ile Glu Ile Lys 755 760
765 Gln Tyr Cys Gln Leu Thr Thr Glu Gln Ala Ser Leu Tyr Gln Ala Val
770 775 780 Val Ala Asp Met Met Glu Lys Ile Glu Asn Thr Glu Gly Ile
Glu Arg 785 790 795 800 Arg Gly Asn Val Leu Ala Ala Met Ala Lys Leu
Lys Gln Val Cys Asn 805 810 815 His Pro Ala Gln Leu Leu His Asp Arg
Ser Pro Val Gly Arg Arg Ser 820 825 830 Gly Lys Val Ile Arg Leu Glu
Glu Ile Leu Glu Glu Ile Leu Ala Glu 835 840 845 Gly Asp Arg Val Leu
Cys Phe Thr Gln Phe Thr Glu Phe Ala Glu Leu 850 855 860 Leu Val Pro
His Leu Ala Ala Arg Phe Gly Arg Ala Ala Arg Asp Ile 865 870 875 880
Ala Tyr Leu His Gly Gly Thr Pro Arg Lys Arg Arg Asp Glu Met Val 885
890 895 Ala Arg Phe Gln Ser Gly Asp Gly Pro Pro Ile Phe Leu Leu Ser
Leu 900 905 910 Lys Ala Gly Gly Thr Gly Leu Asn Leu Thr Ala Ala Asn
His Val Val 915 920 925 His Leu Asp Arg Trp Trp Asn Pro Ala Val Glu
Asn Gln Ala Thr Asp 930 935 940 Arg Ala Phe Arg Ile Gly Gln Arg Arg
Thr Val Gln Val Arg Lys Phe 945 950 955 960 Ile Cys Thr Gly Thr Leu
Glu Glu Lys Ile Asp Glu Met Ile Glu Glu 965 970 975 Lys Lys Ala Leu
Ala Asp Leu Val Val Thr Asp Gly Glu Gly Trp Leu 980 985 990 Thr Glu
Leu Ser Thr Arg Asp Leu Arg Glu Val Phe Ala Leu Ser Glu 995 1000
1005 Gly Ala Val Gly Glu 1010 533282DNAMyxococcus xanthus
53gtgcgagcct ggaggggcgt cctccgctgg gctgccgctg gcctctccct gtccgcggct
60cggagtccga ccggccacct cccagtgttt tcaggttttt ccgtggcgac cgatggcgtc
120gggctgttcg cgggtctgtc tgttcgggcc cttgtccatc aagggcctgg
aggaggaccg 180ctacgagcgc ctcacggaca acccggcagg cctgcggctc
acggagccgg caatcccgtg 240caggggcgct cgcaggcctg cttgcgtgtg
ccgcttgccc ggacggagtt tacattcgca 300gcgatgcccc tcgtgttcct
gcccgacgcc gagacgctgt tcctctgggg gcccgaccgg 360ctgccacgtg
agctcgccgg cctgccggag acgggggacc gcgcctccgc gctgctcgtg
420acgcccgagg gattgcgtga atgcgagggg cacgggctgc ccctggccgc
caccgtcgag 480cggctcgcgg tggtgcaaac ctccgaggcc gagtcctttc
ctggctccat cgccctgtgg 540acgctggcca gcaagctcgc gctggagttg
gtggcgcgcg agcgcgtggt gcccacgctc 600ctgcggcggg gcgagcgcat
cgaggctcgc tgggcggcgg ccctctccgc caccgaggac 660gccggccgcg
tcgccgcgct cgcccggagc atgccgcccg gcgcgcacgc cgtccccgca
720ggcgccaggc caggccgcgc cgtctgggcc ccggacgcct tgctgcgcgc
cttcctcgac 780gccaccgtcg acgccttcgt gcgcgccgcg cgcggtgcgc
cttcgttgcc ggcccggcgc 840gcggcctcgt gggacgagcg ctggcgcgag
gcgctcaccg gcgcgcgacg cgacttcgcg 900ccggagggct tcgccgagcg
ctccgtcgtc gatgagctga cgcgctggag cgaacccgcg 960ctcggcgccc
gggacaagct gcgcgcctgc ttccggctgg agcccccgac ggaggagcgc
1020gagcccttcg tgctgagctt ccacctccag tccccggacg acccaagcct
gctcgtcccg 1080gccgcggacg tctggaagac gcgcgggcgc agcctggaga
agctcggccg cgccttccgt 1140gacccgcagg agtccctgct cgaggcactc
ggccgcgccg cccggctctt ccccccgctg 1200gcgctcgtgc tggagagccc
acgtccccag gcgctcctgc tcgagcccga caccgcgtgg 1260acgttcctct
cggagggcgc ccgcgtgctc tcagacgccg gcttcggcgt catcgtccct
1320ggcgagctca ccacctcggg ccgacgccgc ctgcgcctgc gcatgcgcgt
gggcgcgagc 1380acgaaggccg cgggggccgt cggtggcacc gcggggctcg
ggctcgacgc gctgctgcgc 1440gtggactggg acgccgtgct gggcgaccaa
cccctctccg cccaggagct ggcgctgctg 1500gcccagcgca aggccccgct
cgtgcgattc cgcggcgagt gggtcgcggt ggatcccctc 1560gaactcgacg
ccatccagcg ccacctcgcc cagggccccg gccgcatggc gctgagcgag
1620gcggtgcggg tgtccctgct aggcgaaacg cgccacggac agctccccgt
caccgttctc 1680gccaccgggg cgctggagga gcgcctgcgc ctgcttcggg
agggcggggc caccgctcag 1740gacgcccccc gcgcgctgcg cgccacgctg
cggccctacc agtcgcgcgg tctgcactgg 1800ctggacacgc tggcctcatt
ggggctcggc gcctgcctcg cggacgacat gggcctgggc 1860aagacggtgc
aggtgctggc cttcctgctg cggcggctcg agcaggcgcc tgacgaggcg
1920cgccccacgc tgctggtggc ccccacctcc gtggtgggca actgggagcg
tgagctcgcc 1980cgcttcgccc ccaccttgcg cctgacgcgg cactacggcg
ccgagcgcgc ccgcgcggcg 2040aaccgcttcc cccgcgcgcc cggcgccgtc
gtgctcacca cctacggctt gctgcgccgg 2100gacgccgcgc tgctcgcgcg
cgtggactgg ggcgcggtgg tgctcgacga ggcgcagaac 2160atcaagaacg
cggcgtcggc taccgcccgc gcggcccggg cgttgcgcgc cagccagcgc
2220ttcgcgctca cgggcacgcc ggtggagaac cgcctggcgg agctgtggtc
catcctcgag 2280ttcgccaacc cgggcctgct cgggccgctg gagacgttcc
ggcgggagct ggcgctgccc 2340attgaacgcc atggcaatca ggaggcctcg
gcccggctgc gccggctcgt gagccccttc 2400gtcctgcgcc gcctcaagag
cgacccgacc atcatcacgg acctgcccgc gaagaatgag 2460atgaaggtcg
tctgcacgct cacgcgcgag caggcctcgc tctacaaggc ggtggtggac
2520gaggagctgc ggcgcatcga ggaggccgac ggcatggagc gccggggccg
cgtgctcgcg 2580ctgctgctgt acacgaagca gatcgccaac cacccggcgc
agtacctcgg ggagtccggg 2640cccctgccgg ggcgctcggg gaagctggcg
cgcgtggtgg agatgctcga ggagtccctg 2700gccgctggcg acaaggcgct
cgtcttcacg cagttccggg agatgggcga caagctggtg 2760gcgcacctgt
cggagtacct gggccacgag gtgctcttcc tccacggcgg cacgccccgc
2820aaggcgcgcg acgagatggt gcggcgcttc caggaggacg tccacggtcc
gcgtgtgttc 2880gtgctgtccg tcaaggcggg aggcacgggg ctcaacctga
cggcggcgag ccatgtgttc 2940cattacgacc gctggtggaa cccggccgtc
gaggaccagg ccaccgaccg cgcgtaccgc 3000atcgggcaga cgcgcgcggt
gcaggtccac aagctggtgt gtgcgggcac tgtcgaggag 3060aaggtggacc
ggctgctcga acagaagcgc cagctcgccg agaaggtcgt gggcgcgggc
3120gagcactggg tgaccgagct ggacacgacg gcgctgcgcg agctgttctc
gctgtccgag 3180ggcgccgtgg cggacgatgg cgacgcggaa ggggaagacg
acgcgcgggt gcgcgccccg 3240cgacggcgcg gccgtgcgag cgcgaaggcg
gtgtcgcgat ga 3282541093PRTMyxococcus xanthus 54Val Arg Ala Trp Arg
Gly Val Leu Arg Trp Ala Ala Ala Gly Leu Ser 1 5 10 15 Leu Ser Ala
Ala Arg Ser Pro Thr Gly His Leu Pro Val Phe Ser Gly 20 25 30 Phe
Ser Val Ala Thr Asp Gly Val Gly Leu Phe Ala Gly Leu Ser Val 35 40
45 Arg Ala Leu Val His Gln Gly Pro Gly Gly Gly Pro Leu Arg Ala Pro
50 55 60 His Gly Gln Pro Gly Arg Pro Ala Ala His Gly Ala Gly Asn
Pro Val 65 70 75 80 Gln Gly Arg Ser Gln Ala Cys Leu Arg Val Pro Leu
Ala Arg Thr Glu 85 90 95 Phe Thr Phe Ala Ala Met Pro Leu Val Phe
Leu Pro Asp Ala Glu Thr 100 105 110 Leu Phe Leu Trp Gly Pro Asp Arg
Leu Pro Arg Glu Leu Ala Gly Leu 115 120 125 Pro Glu Thr Gly Asp Arg
Ala Ser Ala Leu Leu Val Thr Pro Glu Gly 130 135 140 Leu Arg Glu Cys
Glu Gly His Gly Leu Pro Leu Ala Ala Thr Val Glu 145 150 155 160 Arg
Leu Ala Val Val Gln Thr Ser Glu Ala Glu Ser Phe Pro Gly Ser 165 170
175 Ile Ala Leu Trp Thr Leu Ala Ser Lys Leu Ala Leu Glu Leu Val Ala
180 185 190 Arg Glu Arg Val Val Pro Thr Leu Leu Arg Arg Gly Glu Arg
Ile Glu 195 200 205 Ala Arg Trp Ala Ala Ala Leu Ser Ala Thr Glu Asp
Ala Gly Arg Val 210 215 220 Ala Ala Leu Ala Arg Ser Met Pro Pro Gly
Ala His Ala Val Pro Ala 225 230 235 240 Gly Ala Arg Pro Gly Arg Ala
Val Trp Ala Pro Asp Ala Leu Leu Arg 245 250 255 Ala Phe Leu Asp Ala
Thr Val Asp Ala Phe Val Arg Ala Ala Arg Gly 260 265 270 Ala Pro Ser
Leu Pro Ala Arg Arg Ala Ala Ser Trp Asp Glu Arg Trp 275 280 285 Arg
Glu Ala Leu Thr Gly Ala Arg Arg Asp Phe Ala Pro Glu Gly Phe 290 295
300 Ala Glu Arg Ser Val Val Asp Glu Leu Thr Arg Trp Ser Glu Pro Ala
305 310 315 320 Leu Gly Ala Arg Asp Lys Leu Arg Ala Cys Phe Arg Leu
Glu Pro Pro 325 330 335 Thr Glu Glu Arg Glu Pro Phe Val Leu Ser Phe
His Leu Gln Ser Pro 340 345 350 Asp Asp Pro Ser Leu Leu Val Pro Ala
Ala Asp Val Trp Lys Thr Arg 355 360 365 Gly Arg Ser Leu Glu Lys Leu
Gly Arg Ala Phe Arg Asp Pro Gln Glu 370 375 380 Ser Leu Leu Glu Ala
Leu Gly Arg Ala Ala Arg Leu Phe Pro Pro Leu 385 390 395 400 Ala Leu
Val Leu Glu Ser Pro Arg Pro Gln Ala Leu Leu Leu Glu Pro 405 410 415
Asp Thr Ala Trp Thr Phe Leu Ser Glu Gly Ala Arg Val Leu Ser Asp 420
425 430 Ala Gly Phe Gly Val Ile Val Pro Gly Glu Leu Thr Thr Ser Gly
Arg 435 440 445 Arg Arg Leu Arg Leu Arg Met Arg Val Gly Ala Ser Thr
Lys Ala Ala 450 455 460 Gly Ala Val Gly Gly Thr Ala Gly Leu Gly Leu
Asp Ala Leu Leu Arg 465 470 475 480 Val Asp Trp Asp Ala Val Leu Gly
Asp Gln Pro Leu Ser Ala Gln Glu 485 490 495 Leu Ala Leu Leu Ala Gln
Arg Lys Ala Pro Leu Val Arg Phe Arg Gly 500 505 510 Glu Trp Val Ala
Val Asp Pro Leu Glu Leu Asp Ala Ile Gln Arg His 515 520 525 Leu Ala
Gln Gly Pro Gly Arg Met Ala Leu Ser Glu Ala Val Arg Val 530 535 540
Ser Leu Leu Gly Glu Thr Arg His Gly Gln Leu Pro Val Thr Val Leu 545
550 555 560 Ala Thr Gly Ala Leu Glu Glu Arg Leu Arg Leu Leu Arg Glu
Gly Gly 565 570 575 Ala Thr Ala Gln Asp Ala Pro Arg Ala Leu Arg Ala
Thr Leu Arg Pro 580 585 590 Tyr Gln Ser Arg Gly Leu His Trp Leu Asp
Thr Leu Ala Ser Leu Gly 595 600 605 Leu Gly Ala Cys Leu Ala Asp Asp
Met Gly Leu Gly Lys Thr Val Gln 610 615 620 Val Leu Ala Phe Leu Leu
Arg Arg Leu Glu Gln Ala Pro Asp Glu Ala 625 630 635 640 Arg Pro Thr
Leu Leu Val Ala Pro Thr Ser Val Val Gly Asn Trp Glu 645 650 655 Arg
Glu Leu Ala Arg Phe Ala Pro Thr Leu Arg Leu Thr Arg His Tyr 660 665
670 Gly Ala Glu Arg Ala Arg Ala Ala Asn Arg Phe Pro Arg Ala Pro Gly
675 680 685 Ala Val Val Leu Thr Thr Tyr Gly Leu Leu Arg Arg Asp Ala
Ala Leu 690 695 700 Leu Ala Arg Val Asp Trp Gly Ala Val Val Leu Asp
Glu Ala Gln Asn 705 710 715 720 Ile Lys Asn Ala Ala Ser Ala Thr Ala
Arg Ala Ala Arg Ala Leu Arg 725 730 735 Ala Ser Gln Arg Phe Ala Leu
Thr Gly Thr Pro Val Glu Asn Arg Leu 740 745 750 Ala Glu Leu Trp Ser
Ile Leu Glu Phe Ala Asn Pro Gly Leu Leu Gly 755 760 765 Pro Leu Glu
Thr Phe Arg Arg Glu Leu Ala Leu Pro Ile Glu Arg His 770 775 780 Gly
Asn Gln Glu Ala Ser Ala Arg Leu Arg Arg Leu Val Ser Pro Phe 785 790
795 800 Val Leu Arg Arg Leu Lys Ser Asp Pro Thr Ile Ile Thr Asp Leu
Pro 805 810 815 Ala Lys Asn Glu Met Lys Val Val Cys Thr Leu Thr Arg
Glu Gln Ala 820 825 830 Ser Leu Tyr Lys Ala Val Val Asp Glu Glu Leu
Arg Arg Ile Glu Glu 835 840 845 Ala Asp Gly Met Glu Arg Arg Gly Arg
Val Leu Ala Leu Leu Leu Tyr 850 855 860 Thr Lys Gln Ile Ala Asn His
Pro Ala Gln Tyr Leu Gly Glu Ser Gly 865 870 875 880 Pro Leu Pro Gly
Arg Ser Gly Lys Leu Ala Arg Val Val Glu Met Leu 885 890 895 Glu Glu
Ser Leu Ala Ala Gly Asp Lys Ala Leu Val Phe Thr Gln Phe 900 905 910
Arg Glu Met Gly Asp Lys Leu Val Ala His Leu Ser Glu Tyr Leu Gly 915
920 925 His Glu Val Leu Phe Leu His Gly Gly Thr Pro Arg Lys Ala Arg
Asp 930 935 940 Glu Met Val Arg Arg Phe Gln Glu Asp Val His Gly Pro
Arg Val Phe 945 950 955 960 Val Leu Ser Val Lys Ala Gly Gly Thr Gly
Leu Asn Leu Thr Ala Ala 965 970 975 Ser His Val Phe His Tyr Asp Arg
Trp Trp Asn Pro Ala Val Glu Asp 980 985 990 Gln Ala Thr Asp Arg Ala
Tyr Arg Ile Gly Gln Thr Arg Ala Val Gln 995 1000 1005 Val His Lys
Leu Val Cys Ala Gly Thr Val Glu Glu Lys Val Asp 1010 1015 1020 Arg
Leu Leu Glu Gln Lys Arg Gln Leu Ala Glu Lys Val Val Gly 1025 1030
1035 Ala Gly Glu His Trp Val Thr Glu Leu Asp Thr Thr Ala Leu Arg
1040
1045 1050 Glu Leu Phe Ser Leu Ser Glu Gly Ala Val Ala Asp Asp Gly
Asp 1055 1060 1065 Ala Glu Gly Glu Asp Asp Ala Arg Val Arg Ala Pro
Arg Arg Arg 1070 1075 1080 Gly Arg Ala Ser Ala Lys Ala Val Ser Arg
1085 1090 552871DNANocardia farcinica 55atggtgggcg ccggcggccc
gccgggtgtc ggtgccacct gcttggatgg acggatgctg 60cacggactgt ggtcgccggg
ttccggcctg gtgctgtgga ccgagggcga ggtgccgccc 120gcgctgcccg
acccggccgg tgcgttgctg cgcgcatcgc ggttccggca tcgggcgcag
180gtgctggtgc cgggccccgc cggcccacag ctcacgcagg tgcgcgcgca
cgccctggtg 240ccacaggccg cggtcgacgt gctgcggcag cggttacccg
tcgaatcggt ggcgggtgac 300ctgcgctttc tcgctcacgt cgccgacggg
atcgatcggt gggtgcgggc cggtcgcgtg 360gtgcccgacc tgcaccgggc
cgacggacag tggtgggcgc gctggcggct ggtcggcggt 420gcccggcagc
gggcctggct ggccgaactc gcggtggcga tgcccgcggc gctgcgggtg
480gccgggcagc ccgcggcggt gctcgacgat ctggtcaccg agctgaccga
tccgatcgtg 540cgcaccaggc tcgccgacgc gccggtgacg cacccgctgg
tgcgcgcact ggtgcgggac 600cagccgctcg agacgggtag ccaccagctg
gccgaggtgc tgcggcgctg gcgcgagagc 660ctcaccgtcg acgagccgga
gctggtgttg cggctgctgg aaccggacgg ggagaccggt 720atcgacgggg
acggcgggga cgaccgggac gacaccgtgg cgctgtggcg gctggaggtc
780tgcctccgca ccgagggcga ggccccggcc ccggtgccgg cgaccgccga
cccgaacctg 840ctgcgcatcg ccgtcgagca gctcggccgg gcgcagcggg
cctacccccg gctgcgcgat 900ctgcccggcg atccgcacag cctcgacctg
ctgttgccca ccgaggtggt ggccgatctc 960gtcgcgcacg gtgcgcaggc
gttgcgcgag gcgggggtgc ggctgctgct gccgcgcgcc 1020tggaccatcg
ccgaacccac cctgcggctc gcggtgagca gcgccgcgcc cgccgcggag
1080agcaccgtgg gcatgcaggg tctgctgtcc tatcggtggg aactggcggt
cggcgacaag 1140gtgctcaccc gcgccgagat ggagcgcctg gtccgcgcca
aatccgacct ggtgcagttg 1200cgcggggaat gggtgcaggc cgaccacaag
gtgctcgccg ccgccgcccg ctacgtcgcc 1260gcgcatctgg acacgtcgcc
ggtcaccctc gccgacctgc tcggcgagat cgccgccacc 1320cgcgtcgaca
aggtgccgct caccgaggtc accgccaccg gctgggcggg cgagttgttc
1380gacggcggcc gcgagccggt ggcgaccccg ggtgggctga aggcgcagct
gcgcccgtat 1440cagctgcgcg gcctgagctg gctggcgacg atgagccgga
tgggctgcgg cggcatcctc 1500gccgacgaca tgggtctcgg caagacggtg
caggtgctgg ccctgctggt gcacgagcgc 1560gagaccagca cggcaccgcc
cggcccgaca ctgctggtgt gcccgatgtc ggtggtcggc 1620aactggcagc
gcgaggcgca gcggttcgcc cccgggctgc gggtgctggt gcaccacggc
1680gccgaccgcc gtcgcgacgc cgaactcgat gccgcggtgg cggattcgga
cctggtgctc 1740accacctacg ccatcctggc cagggatgcg gccgaactgt
cgcgccagtc gtgggaccgg 1800gtggtgctcg acgaggcgca gcacatcaag
aacgccgcga ccaggcaggc acgtgccgcc 1860cgtgccctgc cggcccggca
tcgcctggcg ctcaccggaa ccccggtgga gaaccggctc 1920gaagagttgc
gctcgatcat ggatttcgcg gtgcccaagc tgctcggtac cgcaccgacc
1980ttccgcgccc ggttcgccgt ccccatcgaa cgcgggcagg atcccaacgc
cctgtcccgc 2040ctgcgcttcc tcacccaacc gttcgtgctg cgccgggtca
aggccgatcc ggcggtcatc 2100ggcgatctgc ccgacaagct cgagatgacg
gtgcgggcga acctgaccgt cgagcaggcc 2160gccctgtacc aagccgtcgt
cgacgacatg ctggtgaaac tgcgcagtgc caagggcatg 2220gcccgcaagg
gtgcggtgct cggcgcgctc acccggctca agcaggtgtg caaccatccc
2280gcgcacttcc tcggtgacgg ttccccggtg ctgcatcgcg gcaggcaccg
ctccggcaag 2340ctcgccttgg tcgaggacgt gctcgacacc gtcgtcgcgg
acggggagaa ggcgttgctg 2400ttcacccagt tccgtgagtt cggcgacctg
ctcgcgccct atctgtccga gcggttcggc 2460gcgccgatcc cgttcctgca
cggcggcgtg accaagaaga accgggacac gatggtcgag 2520cgcttccagt
ccggcgacgg cccgccggtc atgctgctgt ccctcaaggc cggcggcacc
2580gggctcaccc tcaccgccgc caatcacgtg gtgcacctgg atcgctggtg
gaatccggcg 2640gtggagaacc aggccaccga tcgcgccttc cgcatcggcc
agcgccgcga cgtccaggtg 2700cgcaagctgg tctgcgtcga caccatcgag
gaacggatcg acgagatgat caccggcaag 2760agcaggctcg cggacctggc
cgtggacgcg ggggagaact ggatcaccga gctgggcacc 2820gaggagctgc
gcgagttgtt caccctcggc gccgaggcgg tgggggagtg a 287156956PRTNocardia
farcinica 56Met Val Gly Ala Gly Gly Pro Pro Gly Val Gly Ala Thr Cys
Leu Asp 1 5 10 15 Gly Arg Met Leu His Gly Leu Trp Ser Pro Gly Ser
Gly Leu Val Leu 20 25 30 Trp Thr Glu Gly Glu Val Pro Pro Ala Leu
Pro Asp Pro Ala Gly Ala 35 40 45 Leu Leu Arg Ala Ser Arg Phe Arg
His Arg Ala Gln Val Leu Val Pro 50 55 60 Gly Pro Ala Gly Pro Gln
Leu Thr Gln Val Arg Ala His Ala Leu Val 65 70 75 80 Pro Gln Ala Ala
Val Asp Val Leu Arg Gln Arg Leu Pro Val Glu Ser 85 90 95 Val Ala
Gly Asp Leu Arg Phe Leu Ala His Val Ala Asp Gly Ile Asp 100 105 110
Arg Trp Val Arg Ala Gly Arg Val Val Pro Asp Leu His Arg Ala Asp 115
120 125 Gly Gln Trp Trp Ala Arg Trp Arg Leu Val Gly Gly Ala Arg Gln
Arg 130 135 140 Ala Trp Leu Ala Glu Leu Ala Val Ala Met Pro Ala Ala
Leu Arg Val 145 150 155 160 Ala Gly Gln Pro Ala Ala Val Leu Asp Asp
Leu Val Thr Glu Leu Thr 165 170 175 Asp Pro Ile Val Arg Thr Arg Leu
Ala Asp Ala Pro Val Thr His Pro 180 185 190 Leu Val Arg Ala Leu Val
Arg Asp Gln Pro Leu Glu Thr Gly Ser His 195 200 205 Gln Leu Ala Glu
Val Leu Arg Arg Trp Arg Glu Ser Leu Thr Val Asp 210 215 220 Glu Pro
Glu Leu Val Leu Arg Leu Leu Glu Pro Asp Gly Glu Thr Gly 225 230 235
240 Ile Asp Gly Asp Gly Gly Asp Asp Arg Asp Asp Thr Val Ala Leu Trp
245 250 255 Arg Leu Glu Val Cys Leu Arg Thr Glu Gly Glu Ala Pro Ala
Pro Val 260 265 270 Pro Ala Thr Ala Asp Pro Asn Leu Leu Arg Ile Ala
Val Glu Gln Leu 275 280 285 Gly Arg Ala Gln Arg Ala Tyr Pro Arg Leu
Arg Asp Leu Pro Gly Asp 290 295 300 Pro His Ser Leu Asp Leu Leu Leu
Pro Thr Glu Val Val Ala Asp Leu 305 310 315 320 Val Ala His Gly Ala
Gln Ala Leu Arg Glu Ala Gly Val Arg Leu Leu 325 330 335 Leu Pro Arg
Ala Trp Thr Ile Ala Glu Pro Thr Leu Arg Leu Ala Val 340 345 350 Ser
Ser Ala Ala Pro Ala Ala Glu Ser Thr Val Gly Met Gln Gly Leu 355 360
365 Leu Ser Tyr Arg Trp Glu Leu Ala Val Gly Asp Lys Val Leu Thr Arg
370 375 380 Ala Glu Met Glu Arg Leu Val Arg Ala Lys Ser Asp Leu Val
Gln Leu 385 390 395 400 Arg Gly Glu Trp Val Gln Ala Asp His Lys Val
Leu Ala Ala Ala Ala 405 410 415 Arg Tyr Val Ala Ala His Leu Asp Thr
Ser Pro Val Thr Leu Ala Asp 420 425 430 Leu Leu Gly Glu Ile Ala Ala
Thr Arg Val Asp Lys Val Pro Leu Thr 435 440 445 Glu Val Thr Ala Thr
Gly Trp Ala Gly Glu Leu Phe Asp Gly Gly Arg 450 455 460 Glu Pro Val
Ala Thr Pro Gly Gly Leu Lys Ala Gln Leu Arg Pro Tyr 465 470 475 480
Gln Leu Arg Gly Leu Ser Trp Leu Ala Thr Met Ser Arg Met Gly Cys 485
490 495 Gly Gly Ile Leu Ala Asp Asp Met Gly Leu Gly Lys Thr Val Gln
Val 500 505 510 Leu Ala Leu Leu Val His Glu Arg Glu Thr Ser Thr Ala
Pro Pro Gly 515 520 525 Pro Thr Leu Leu Val Cys Pro Met Ser Val Val
Gly Asn Trp Gln Arg 530 535 540 Glu Ala Gln Arg Phe Ala Pro Gly Leu
Arg Val Leu Val His His Gly 545 550 555 560 Ala Asp Arg Arg Arg Asp
Ala Glu Leu Asp Ala Ala Val Ala Asp Ser 565 570 575 Asp Leu Val Leu
Thr Thr Tyr Ala Ile Leu Ala Arg Asp Ala Ala Glu 580 585 590 Leu Ser
Arg Gln Ser Trp Asp Arg Val Val Leu Asp Glu Ala Gln His 595 600 605
Ile Lys Asn Ala Ala Thr Arg Gln Ala Arg Ala Ala Arg Ala Leu Pro 610
615 620 Ala Arg His Arg Leu Ala Leu Thr Gly Thr Pro Val Glu Asn Arg
Leu 625 630 635 640 Glu Glu Leu Arg Ser Ile Met Asp Phe Ala Val Pro
Lys Leu Leu Gly 645 650 655 Thr Ala Pro Thr Phe Arg Ala Arg Phe Ala
Val Pro Ile Glu Arg Gly 660 665 670 Gln Asp Pro Asn Ala Leu Ser Arg
Leu Arg Phe Leu Thr Gln Pro Phe 675 680 685 Val Leu Arg Arg Val Lys
Ala Asp Pro Ala Val Ile Gly Asp Leu Pro 690 695 700 Asp Lys Leu Glu
Met Thr Val Arg Ala Asn Leu Thr Val Glu Gln Ala 705 710 715 720 Ala
Leu Tyr Gln Ala Val Val Asp Asp Met Leu Val Lys Leu Arg Ser 725 730
735 Ala Lys Gly Met Ala Arg Lys Gly Ala Val Leu Gly Ala Leu Thr Arg
740 745 750 Leu Lys Gln Val Cys Asn His Pro Ala His Phe Leu Gly Asp
Gly Ser 755 760 765 Pro Val Leu His Arg Gly Arg His Arg Ser Gly Lys
Leu Ala Leu Val 770 775 780 Glu Asp Val Leu Asp Thr Val Val Ala Asp
Gly Glu Lys Ala Leu Leu 785 790 795 800 Phe Thr Gln Phe Arg Glu Phe
Gly Asp Leu Leu Ala Pro Tyr Leu Ser 805 810 815 Glu Arg Phe Gly Ala
Pro Ile Pro Phe Leu His Gly Gly Val Thr Lys 820 825 830 Lys Asn Arg
Asp Thr Met Val Glu Arg Phe Gln Ser Gly Asp Gly Pro 835 840 845 Pro
Val Met Leu Leu Ser Leu Lys Ala Gly Gly Thr Gly Leu Thr Leu 850 855
860 Thr Ala Ala Asn His Val Val His Leu Asp Arg Trp Trp Asn Pro Ala
865 870 875 880 Val Glu Asn Gln Ala Thr Asp Arg Ala Phe Arg Ile Gly
Gln Arg Arg 885 890 895 Asp Val Gln Val Arg Lys Leu Val Cys Val Asp
Thr Ile Glu Glu Arg 900 905 910 Ile Asp Glu Met Ile Thr Gly Lys Ser
Arg Leu Ala Asp Leu Ala Val 915 920 925 Asp Ala Gly Glu Asn Trp Ile
Thr Glu Leu Gly Thr Glu Glu Leu Arg 930 935 940 Glu Leu Phe Thr Leu
Gly Ala Glu Ala Val Gly Glu 945 950 955 573264DNANodularia
spumigena 57atggcaattt tacacggtaa ttggttagta agaaatcaaa atggttgttt
atttatttgg 60ggtgaaactt ggcgttcatc acgagtcgat tttgctctga atgtatctca
agatatacca 120ctacatccat tggtaatgtc accaattgat ttgagtgagt
tgttaagtta tcataatatc 180aaaattccta gcttaataca gcaatcccaa
gttgctttat ctggcactgg gcgaactcgt 240aaaagtacaa gtactactaa
atttagctgg acaactcact ctctaatcat tgatttacca 300actcatatct
cagaaaataa tccccaagaa atagaattta tttccccttt gcattctgct
360actttgggtt ctgaaataaa ttcaccccaa tatctccaac cgtggcgagt
cgagggtttt 420tgtctcaacc ccactgaagc gataaaattt ctcgctgctg
ttcctttaaa tgctgctaga 480gaagaagata ctttgttcgg tggagattta
cgtttttggt cacaaattgc ccgttggagt 540ttggatttaa tctctcggtg
taagtttttg ccaactattc aaagacagtt tgatagttct 600attgttgcta
ggtggcaagt gcttttagac agtgcaatag atggaacacg cctggaaaaa
660ttttctgcaa aaatgccatt agcttgtcgt acttatcgga agggaatggg
gagtggggag 720tggggagtgg ggagtgggga ggaatcttcc ccatccataa
tgtatgtaga ttttccaact 780gaaccccagg aactattatt aggatttctc
aacagtacca tagatgccca agtgcgagaa 840atgttagctt ctcaacctct
actagaaact agagtgatgg catctttacc atctgcggtg 900cgacagtggt
tgcaaggttt aaccagtgca tctcacacag tgaatgcaga tgcaatggaa
960gtagaaagat tagaagcagc cctgaaatct tggactatgc cgttgcaata
tcaactggta 1020ggaaaaccct cgtttcgcgc ctgttttcaa ctgcttcccc
ctgcttctgg ggcaacagat 1080tggatattgg catattttct ccaagctgcg
gatgatgaaa atttattagt ggatgcggca 1140actatttggc atcacccagt
tgaacaatta gtttatcaaa atcgcaccat tgatcaaccc 1200caagaaactt
tattgcgggg cttgggttta gcttcgcgat tatatccagt tcttacaccg
1260agtttagaaa cagaatatcc ccaatgttgt cgcctcaacc cattacaagc
ttatgaattt 1320atcaagtctg tagcttggcg atttgaagat agtggtttgg
gggtaatttt acctcctagt 1380ttgactaacc gcgaaggatg ggcgaaccgt
ttggggttaa aaattagtgc tgaaactcaa 1440aagaaaaaac agggacgctt
gggtttacaa agtttactga attttcaatg gcaattggca 1500attggtggac
aaacaatttc taaaaccgag tttaataaac tggtagcttt aaatagccca
1560ctggtagaaa ttaacggcga atgggtggaa ttgcgacccc aggatattaa
aacagcacag 1620acattttttg cttctcgtaa agacgaaatg acgctttctt
tggaagatgc tttacgcctc 1680agttctggcg atacccaagc gattgaaaag
ttacctgtgg tcagttttga agcatctggg 1740acattgcaag agttaattgg
ggcgttaacc aataatcaag ccatttcacc cctcccaaca 1800cctgcaaatt
ttcaaggaca gttacgacct tatcaagaaa gaggggcggc ttggctggct
1860ttcttagaac gttggggttt aggtgcttgt ttggctgatg atatggggct
gggaaaaaca 1920attcagttaa ttgccttttt actgcacctc aaagaacaag
acgcactgga aaatcccaca 1980ttacttgttt gtccgacttc tattttaggt
aactgggaac gggaaattaa aaaatttgct 2040cctactctca aagttttaca
gcaccacggc gataaacgtc tcaaaggtaa agcgtttgta 2100gaagcagtca
aaaaacacga tgtaattatt accagttact cactcgttca ccgggatatt
2160aaatctttgc agagtgtcga ttggcaaaca gttgtattag atgaagccca
gaatgtgaaa 2220aatcctgaag ctaaacaatc gcaggctgtg aggggattaa
aaactacatt tcgcatagct 2280ttaacaggga caccagtaga aaacaaactg
caagaattgt ggtctatttt agattttctt 2340aatcctgggt atttgggaaa
tcgtcaattt ttccagagac ggtttgctat gccaattgaa 2400aagtatggtg
atacagcatc tttaaatcaa ttgcggggtt tagttcaacc gtttattcta
2460cgtcgtctga aaacagatcg tgatattatt caagatttgc cagaaaagca
agaaatgacg 2520gttttttgtg ggcttgcggc tgaacaagct gcactttatc
aacaagtagt tgaagcatct 2580ttagtagaaa ttgaatctgc tgagggtttg
caacgtcgag ggatgatttt agctttactt 2640gtgaaactta aacaaatctg
taatcatcca gcccaatatt tgaaagccgc gacattacaa 2700gaacatagtt
ctgctaaact gcaacggcta gatgaaatgt taacggtagc tttggaggaa
2760ggagataggg ctttaatttt cactcaattt gctgaatggg gtaagttatt
aaaagctcat 2820ttacaacaaa cacttgggaa agaaatattc tttttatatg
gtggtagcag taaaaaacaa 2880cgcgaggaaa tgattgaccg tttccaacat
gacccccaag gacctccgat tatgattctt 2940tctttaaaag cgggtggggt
aggcttgaat ttaaccaggg ctaatcatgt atttcacttt 3000gatagatggt
ggaatcccgc agtggaaaat caagcgacag atagagtatt tcgtattggt
3060caaacccgga atgtgcaagt gcataaattt gtctgtactg gcacattaga
agaaaaaatt 3120catgacatga ttgaaagtaa aaaacaatta gcggaacaag
tagttggtgc tggtgaggag 3180tggctgactg aaatgaatac tgaccaattg
cgtgatttac tcattcttga tcgcagtgcc 3240ataattgatg aggatgaagt ttaa
3264581087PRTNodularia spumigena 58Met Ala Ile Leu His Gly Asn Trp
Leu Val Arg Asn Gln Asn Gly Cys 1 5 10 15 Leu Phe Ile Trp Gly Glu
Thr Trp Arg Ser Ser Arg Val Asp Phe Ala 20 25 30 Leu Asn Val Ser
Gln Asp Ile Pro Leu His Pro Leu Val Met Ser Pro 35 40 45 Ile Asp
Leu Ser Glu Leu Leu Ser Tyr His Asn Ile Lys Ile Pro Ser 50 55 60
Leu Ile Gln Gln Ser Gln Val Ala Leu Ser Gly Thr Gly Arg Thr Arg 65
70 75 80 Lys Ser Thr Ser Thr Thr Lys Phe Ser Trp Thr Thr His Ser
Leu Ile 85 90 95 Ile Asp Leu Pro Thr His Ile Ser Glu Asn Asn Pro
Gln Glu Ile Glu 100 105 110 Phe Ile Ser Pro Leu His Ser Ala Thr Leu
Gly Ser Glu Ile Asn Ser 115 120 125 Pro Gln Tyr Leu Gln Pro Trp Arg
Val Glu Gly Phe Cys Leu Asn Pro 130 135 140 Thr Glu Ala Ile Lys Phe
Leu Ala Ala Val Pro Leu Asn Ala Ala Arg 145 150 155 160 Glu Glu Asp
Thr Leu Phe Gly Gly Asp Leu Arg Phe Trp Ser Gln Ile 165 170 175 Ala
Arg Trp Ser Leu Asp Leu Ile Ser Arg Cys Lys Phe Leu Pro Thr 180 185
190 Ile Gln Arg Gln Phe Asp Ser Ser Ile Val Ala Arg Trp Gln Val Leu
195 200 205 Leu Asp Ser Ala Ile Asp Gly Thr Arg Leu Glu Lys Phe Ser
Ala Lys 210 215 220 Met Pro Leu Ala Cys Arg Thr Tyr Arg Lys Gly Met
Gly Ser Gly Glu 225 230 235 240 Trp Gly Val Gly Ser Gly Glu Glu Ser
Ser Pro Ser Ile Met Tyr Val 245 250 255 Asp Phe Pro Thr Glu Pro Gln
Glu Leu Leu Leu Gly Phe Leu Asn Ser 260 265 270 Thr Ile Asp Ala Gln
Val Arg Glu Met Leu Ala Ser Gln Pro Leu Leu 275 280 285 Glu Thr Arg
Val Met Ala Ser Leu Pro Ser Ala Val Arg Gln Trp Leu 290 295 300 Gln
Gly Leu Thr Ser Ala Ser His Thr Val Asn Ala Asp Ala Met Glu 305 310
315 320 Val Glu Arg Leu Glu Ala Ala Leu Lys Ser Trp Thr Met Pro Leu
Gln 325 330 335 Tyr Gln Leu Val Gly Lys Pro Ser Phe
Arg Ala Cys Phe Gln Leu Leu 340 345 350 Pro Pro Ala Ser Gly Ala Thr
Asp Trp Ile Leu Ala Tyr Phe Leu Gln 355 360 365 Ala Ala Asp Asp Glu
Asn Leu Leu Val Asp Ala Ala Thr Ile Trp His 370 375 380 His Pro Val
Glu Gln Leu Val Tyr Gln Asn Arg Thr Ile Asp Gln Pro 385 390 395 400
Gln Glu Thr Leu Leu Arg Gly Leu Gly Leu Ala Ser Arg Leu Tyr Pro 405
410 415 Val Leu Thr Pro Ser Leu Glu Thr Glu Tyr Pro Gln Cys Cys Arg
Leu 420 425 430 Asn Pro Leu Gln Ala Tyr Glu Phe Ile Lys Ser Val Ala
Trp Arg Phe 435 440 445 Glu Asp Ser Gly Leu Gly Val Ile Leu Pro Pro
Ser Leu Thr Asn Arg 450 455 460 Glu Gly Trp Ala Asn Arg Leu Gly Leu
Lys Ile Ser Ala Glu Thr Gln 465 470 475 480 Lys Lys Lys Gln Gly Arg
Leu Gly Leu Gln Ser Leu Leu Asn Phe Gln 485 490 495 Trp Gln Leu Ala
Ile Gly Gly Gln Thr Ile Ser Lys Thr Glu Phe Asn 500 505 510 Lys Leu
Val Ala Leu Asn Ser Pro Leu Val Glu Ile Asn Gly Glu Trp 515 520 525
Val Glu Leu Arg Pro Gln Asp Ile Lys Thr Ala Gln Thr Phe Phe Ala 530
535 540 Ser Arg Lys Asp Glu Met Thr Leu Ser Leu Glu Asp Ala Leu Arg
Leu 545 550 555 560 Ser Ser Gly Asp Thr Gln Ala Ile Glu Lys Leu Pro
Val Val Ser Phe 565 570 575 Glu Ala Ser Gly Thr Leu Gln Glu Leu Ile
Gly Ala Leu Thr Asn Asn 580 585 590 Gln Ala Ile Ser Pro Leu Pro Thr
Pro Ala Asn Phe Gln Gly Gln Leu 595 600 605 Arg Pro Tyr Gln Glu Arg
Gly Ala Ala Trp Leu Ala Phe Leu Glu Arg 610 615 620 Trp Gly Leu Gly
Ala Cys Leu Ala Asp Asp Met Gly Leu Gly Lys Thr 625 630 635 640 Ile
Gln Leu Ile Ala Phe Leu Leu His Leu Lys Glu Gln Asp Ala Leu 645 650
655 Glu Asn Pro Thr Leu Leu Val Cys Pro Thr Ser Ile Leu Gly Asn Trp
660 665 670 Glu Arg Glu Ile Lys Lys Phe Ala Pro Thr Leu Lys Val Leu
Gln His 675 680 685 His Gly Asp Lys Arg Leu Lys Gly Lys Ala Phe Val
Glu Ala Val Lys 690 695 700 Lys His Asp Val Ile Ile Thr Ser Tyr Ser
Leu Val His Arg Asp Ile 705 710 715 720 Lys Ser Leu Gln Ser Val Asp
Trp Gln Thr Val Val Leu Asp Glu Ala 725 730 735 Gln Asn Val Lys Asn
Pro Glu Ala Lys Gln Ser Gln Ala Val Arg Gly 740 745 750 Leu Lys Thr
Thr Phe Arg Ile Ala Leu Thr Gly Thr Pro Val Glu Asn 755 760 765 Lys
Leu Gln Glu Leu Trp Ser Ile Leu Asp Phe Leu Asn Pro Gly Tyr 770 775
780 Leu Gly Asn Arg Gln Phe Phe Gln Arg Arg Phe Ala Met Pro Ile Glu
785 790 795 800 Lys Tyr Gly Asp Thr Ala Ser Leu Asn Gln Leu Arg Gly
Leu Val Gln 805 810 815 Pro Phe Ile Leu Arg Arg Leu Lys Thr Asp Arg
Asp Ile Ile Gln Asp 820 825 830 Leu Pro Glu Lys Gln Glu Met Thr Val
Phe Cys Gly Leu Ala Ala Glu 835 840 845 Gln Ala Ala Leu Tyr Gln Gln
Val Val Glu Ala Ser Leu Val Glu Ile 850 855 860 Glu Ser Ala Glu Gly
Leu Gln Arg Arg Gly Met Ile Leu Ala Leu Leu 865 870 875 880 Val Lys
Leu Lys Gln Ile Cys Asn His Pro Ala Gln Tyr Leu Lys Ala 885 890 895
Ala Thr Leu Gln Glu His Ser Ser Ala Lys Leu Gln Arg Leu Asp Glu 900
905 910 Met Leu Thr Val Ala Leu Glu Glu Gly Asp Arg Ala Leu Ile Phe
Thr 915 920 925 Gln Phe Ala Glu Trp Gly Lys Leu Leu Lys Ala His Leu
Gln Gln Thr 930 935 940 Leu Gly Lys Glu Ile Phe Phe Leu Tyr Gly Gly
Ser Ser Lys Lys Gln 945 950 955 960 Arg Glu Glu Met Ile Asp Arg Phe
Gln His Asp Pro Gln Gly Pro Pro 965 970 975 Ile Met Ile Leu Ser Leu
Lys Ala Gly Gly Val Gly Leu Asn Leu Thr 980 985 990 Arg Ala Asn His
Val Phe His Phe Asp Arg Trp Trp Asn Pro Ala Val 995 1000 1005 Glu
Asn Gln Ala Thr Asp Arg Val Phe Arg Ile Gly Gln Thr Arg 1010 1015
1020 Asn Val Gln Val His Lys Phe Val Cys Thr Gly Thr Leu Glu Glu
1025 1030 1035 Lys Ile His Asp Met Ile Glu Ser Lys Lys Gln Leu Ala
Glu Gln 1040 1045 1050 Val Val Gly Ala Gly Glu Glu Trp Leu Thr Glu
Met Asn Thr Asp 1055 1060 1065 Gln Leu Arg Asp Leu Leu Ile Leu Asp
Arg Ser Ala Ile Ile Asp 1070 1075 1080 Glu Asp Glu Val 1085
593228DNANostoc sp. 59atggcaattc tacacggtag ttggatatta aatgagcagg
agagttgttt atttatttgg 60ggggaaactt ggcgatcgcc acaagtggat tttaattttg
cggagatatc cctcaatccc 120ttggcgctgt ctgcactgga attaagtgag
tggttgcagt ctcaacatca ggcgatcgct 180aagttgttac cgcaacaatt
ggaaaaacga acctccaaag cagcaagttc tgtaaaaata 240aatttattaa
ctcattcaca aataattgcc ctgccaacgg aaatttccca acctcgtaaa
300aaagaaacca ttttaatttc tcctgtgcat tctgccgctt tagcatctga
gtcagactct 360gaagtttatt tacaaacttg gcgtgtagaa ggtttttgtc
ttcctcctag tgcagcaatt 420aaattgctaa cttctttacc tttaaatata
actagtgggg agaatgcttt tttaggtgga 480gatttacgtt tctggtcaca
aattgcccgt tggagtttag atttaatttc taggtctaag 540tttctcccaa
ttatccaacg acaacctaat aattctgtaa gtgctaaatg gcaagtactt
600ttagatagtg ccgtagatgg aactcgttta gaaaagtttg ctgcgaagat
gcccttggtt 660tgtcggactt atcaagaaat tgggagtggg gaatctccta
tatatataga ttttcctagt 720cagccgcagg atttaatctt gggttttctc
aatagtgcga tagatacgca attgcgggag 780atggtgggga atcagcctgt
ggtggaaact cggttgatgg catctttacc atcggcggtg 840cgacagtggt
tgcaagcgtt aattgctgca tctaattcaa ttgatgcaga tgctgttggt
900ttagaaaggc tggaagcggc gctcaaggct tggacgatgc cgctacaata
tcaactagca 960agtaaaaatc aatttcgcac ttgttttgaa ttacgttctc
cagaaccaga cgaaactgaa 1020tggacgctgg cgtatttcct gcaagcagcc
gatgatccag aatttttagt agatgcggcg 1080actatttggc aaaatcctgt
tgaacagcta atttatcaac agcgaacgat tgaagaaccc 1140caggaaacgt
ttttgcgagg tttggggtta gcttctcgat tgtatccggt cattgccccc
1200actttagata cagaatcacc ccaattttgt catctcaagc ccatgcaggc
ttatgaattt 1260atcaaggctg tggcttggcg atttgaagat agcggcttag
gggtgatttt acctcctagt 1320ttggcgaatc gtgaaggctg ggcaaatcgc
ttgggtttga aaatctccgc cgaaacgccg 1380aagaaaaaac caggacgctt
aggattgcag agtttgctca atttccaatg gcacttagcg 1440attggtgggc
aaactatttc taaagctgaa tttgacagac tggtagcttt aaaaagccca
1500ttggtagaaa ttaacggcga gtgggtggaa ttacgtcccc aagatatcaa
aacagctgaa 1560gcctttttta ctgcgcgtaa agaccaaatg gccttatctt
tagaagatgc cttacgtcta 1620agtagtggcg atacacaagt aattgagaaa
ttaccagtag tcagctttga agcctctggc 1680gcattacaag aattgattgg
ggcgctgaca aataatcaag cagttgcacc attacctacg 1740ccgaaaaact
tccaaggaca gttacgtcct tatcaagaaa ggggtgcggc ttggttggcg
1800ttcctcgaac gctggggttt aggtgcttgt ctcgccgacg acatgggact
gggaaaaacg 1860atacagttca ttgctttcct tctccatctt aaagaacagg
atgtattaga aaaaccaact 1920ttactagtgt gtcctacttc tgttttaggt
aactgggaac gagaggtgag aaaatttgca 1980cctacactta aagttctcca
gtatcatggt gacaaacgtc ctaaaggtaa agcatttcaa 2040gaagcagtaa
aaaaacatga tttagttatt acaagttact cattaattca tagagatatc
2100aaatcattgc agggtattcc ttggcaaata attgttttag atgaagccca
aaatgtgaag 2160aatgcggaag ccaaacaatc acaagcagtc agacaattag
aaacaacatt tcgtattgct 2220ttaacaggta caccagtaga aaatagacta
caagaacttt ggtcaatttt agattttctt 2280aatcctggtt acttaggtaa
taagcaattc tttcaaagac gttttgctat gccaattgaa 2340aagtatggtg
atgcagcatc tttaaatcaa ttgcgtgctt tagtgcaacc atttattctg
2400cgtcggctga aaacagaccg tgatattatt caagacttgc ccgataagca
agaaatgaca 2460gtattttgtg gtttgactgg agaacaagct gcactttatc
aaaaagcggt agaaacatct 2520ttagcagaaa ttgaatcagc cgaaggattg
caacgccgag ggatgatttt agctttatta 2580attaaactca aacaaatctg
caatcatcca gcccaatatc tgaaaataaa tacattagaa 2640caacacagtt
ctggaaaact gcaaagatta gaagaaatgt tagaagaggt gttagcagag
2700agtaatactt acggtgttgc cggtgcggga cgtgctttga tttttaccca
atttgcagaa 2760tggggtaagt tactcaaacc acatttagaa aaacaactag
ggcgggaaat atttttctta 2820tatggtggta cgagtaaaaa gcaacgagaa
gaaatgattg accgttttca acacgacccc 2880caagggccac caattatgat
tctctccctc aaagcaggtg gtgtagggtt gaacttaacc 2940agggcaaatc
atgtatttca ctttgataga tggtggaatc cagccgtaga gaatcaagct
3000acagaccgcg tatttcgcat tggtcaaact cgcaatgtac aggtgcataa
atttgtttgt 3060aatggcacct tagaagagaa aattcacgac atgattgaaa
gtaaaaaaca actagcggaa 3120caggttgttg gagcaggcga agaatggtta
actgaattag atacagatca actccgcaac 3180ttactgatac ttgatcgtag
tacagtaatt gatgaagaag cagattga 3228601075PRTNostoc sp. 60Met Ala
Ile Leu His Gly Ser Trp Ile Leu Asn Glu Gln Glu Ser Cys 1 5 10 15
Leu Phe Ile Trp Gly Glu Thr Trp Arg Ser Pro Gln Val Asp Phe Asn 20
25 30 Phe Ala Glu Ile Ser Leu Asn Pro Leu Ala Leu Ser Ala Leu Glu
Leu 35 40 45 Ser Glu Trp Leu Gln Ser Gln His Gln Ala Ile Ala Lys
Leu Leu Pro 50 55 60 Gln Gln Leu Glu Lys Arg Thr Ser Lys Ala Ala
Ser Ser Val Lys Ile 65 70 75 80 Asn Leu Leu Thr His Ser Gln Ile Ile
Ala Leu Pro Thr Glu Ile Ser 85 90 95 Gln Pro Arg Lys Lys Glu Thr
Ile Leu Ile Ser Pro Val His Ser Ala 100 105 110 Ala Leu Ala Ser Glu
Ser Asp Ser Glu Val Tyr Leu Gln Thr Trp Arg 115 120 125 Val Glu Gly
Phe Cys Leu Pro Pro Ser Ala Ala Ile Lys Leu Leu Thr 130 135 140 Ser
Leu Pro Leu Asn Ile Thr Ser Gly Glu Asn Ala Phe Leu Gly Gly 145 150
155 160 Asp Leu Arg Phe Trp Ser Gln Ile Ala Arg Trp Ser Leu Asp Leu
Ile 165 170 175 Ser Arg Ser Lys Phe Leu Pro Ile Ile Gln Arg Gln Pro
Asn Asn Ser 180 185 190 Val Ser Ala Lys Trp Gln Val Leu Leu Asp Ser
Ala Val Asp Gly Thr 195 200 205 Arg Leu Glu Lys Phe Ala Ala Lys Met
Pro Leu Val Cys Arg Thr Tyr 210 215 220 Gln Glu Ile Gly Ser Gly Glu
Ser Pro Ile Tyr Ile Asp Phe Pro Ser 225 230 235 240 Gln Pro Gln Asp
Leu Ile Leu Gly Phe Leu Asn Ser Ala Ile Asp Thr 245 250 255 Gln Leu
Arg Glu Met Val Gly Asn Gln Pro Val Val Glu Thr Arg Leu 260 265 270
Met Ala Ser Leu Pro Ser Ala Val Arg Gln Trp Leu Gln Ala Leu Ile 275
280 285 Ala Ala Ser Asn Ser Ile Asp Ala Asp Ala Val Gly Leu Glu Arg
Leu 290 295 300 Glu Ala Ala Leu Lys Ala Trp Thr Met Pro Leu Gln Tyr
Gln Leu Ala 305 310 315 320 Ser Lys Asn Gln Phe Arg Thr Cys Phe Glu
Leu Arg Ser Pro Glu Pro 325 330 335 Asp Glu Thr Glu Trp Thr Leu Ala
Tyr Phe Leu Gln Ala Ala Asp Asp 340 345 350 Pro Glu Phe Leu Val Asp
Ala Ala Thr Ile Trp Gln Asn Pro Val Glu 355 360 365 Gln Leu Ile Tyr
Gln Gln Arg Thr Ile Glu Glu Pro Gln Glu Thr Phe 370 375 380 Leu Arg
Gly Leu Gly Leu Ala Ser Arg Leu Tyr Pro Val Ile Ala Pro 385 390 395
400 Thr Leu Asp Thr Glu Ser Pro Gln Phe Cys His Leu Lys Pro Met Gln
405 410 415 Ala Tyr Glu Phe Ile Lys Ala Val Ala Trp Arg Phe Glu Asp
Ser Gly 420 425 430 Leu Gly Val Ile Leu Pro Pro Ser Leu Ala Asn Arg
Glu Gly Trp Ala 435 440 445 Asn Arg Leu Gly Leu Lys Ile Ser Ala Glu
Thr Pro Lys Lys Lys Pro 450 455 460 Gly Arg Leu Gly Leu Gln Ser Leu
Leu Asn Phe Gln Trp His Leu Ala 465 470 475 480 Ile Gly Gly Gln Thr
Ile Ser Lys Ala Glu Phe Asp Arg Leu Val Ala 485 490 495 Leu Lys Ser
Pro Leu Val Glu Ile Asn Gly Glu Trp Val Glu Leu Arg 500 505 510 Pro
Gln Asp Ile Lys Thr Ala Glu Ala Phe Phe Thr Ala Arg Lys Asp 515 520
525 Gln Met Ala Leu Ser Leu Glu Asp Ala Leu Arg Leu Ser Ser Gly Asp
530 535 540 Thr Gln Val Ile Glu Lys Leu Pro Val Val Ser Phe Glu Ala
Ser Gly 545 550 555 560 Ala Leu Gln Glu Leu Ile Gly Ala Leu Thr Asn
Asn Gln Ala Val Ala 565 570 575 Pro Leu Pro Thr Pro Lys Asn Phe Gln
Gly Gln Leu Arg Pro Tyr Gln 580 585 590 Glu Arg Gly Ala Ala Trp Leu
Ala Phe Leu Glu Arg Trp Gly Leu Gly 595 600 605 Ala Cys Leu Ala Asp
Asp Met Gly Leu Gly Lys Thr Ile Gln Phe Ile 610 615 620 Ala Phe Leu
Leu His Leu Lys Glu Gln Asp Val Leu Glu Lys Pro Thr 625 630 635 640
Leu Leu Val Cys Pro Thr Ser Val Leu Gly Asn Trp Glu Arg Glu Val 645
650 655 Arg Lys Phe Ala Pro Thr Leu Lys Val Leu Gln Tyr His Gly Asp
Lys 660 665 670 Arg Pro Lys Gly Lys Ala Phe Gln Glu Ala Val Lys Lys
His Asp Leu 675 680 685 Val Ile Thr Ser Tyr Ser Leu Ile His Arg Asp
Ile Lys Ser Leu Gln 690 695 700 Gly Ile Pro Trp Gln Ile Ile Val Leu
Asp Glu Ala Gln Asn Val Lys 705 710 715 720 Asn Ala Glu Ala Lys Gln
Ser Gln Ala Val Arg Gln Leu Glu Thr Thr 725 730 735 Phe Arg Ile Ala
Leu Thr Gly Thr Pro Val Glu Asn Arg Leu Gln Glu 740 745 750 Leu Trp
Ser Ile Leu Asp Phe Leu Asn Pro Gly Tyr Leu Gly Asn Lys 755 760 765
Gln Phe Phe Gln Arg Arg Phe Ala Met Pro Ile Glu Lys Tyr Gly Asp 770
775 780 Ala Ala Ser Leu Asn Gln Leu Arg Ala Leu Val Gln Pro Phe Ile
Leu 785 790 795 800 Arg Arg Leu Lys Thr Asp Arg Asp Ile Ile Gln Asp
Leu Pro Asp Lys 805 810 815 Gln Glu Met Thr Val Phe Cys Gly Leu Thr
Gly Glu Gln Ala Ala Leu 820 825 830 Tyr Gln Lys Ala Val Glu Thr Ser
Leu Ala Glu Ile Glu Ser Ala Glu 835 840 845 Gly Leu Gln Arg Arg Gly
Met Ile Leu Ala Leu Leu Ile Lys Leu Lys 850 855 860 Gln Ile Cys Asn
His Pro Ala Gln Tyr Leu Lys Ile Asn Thr Leu Glu 865 870 875 880 Gln
His Ser Ser Gly Lys Leu Gln Arg Leu Glu Glu Met Leu Glu Glu 885 890
895 Val Leu Ala Glu Ser Asn Thr Tyr Gly Val Ala Gly Ala Gly Arg Ala
900 905 910 Leu Ile Phe Thr Gln Phe Ala Glu Trp Gly Lys Leu Leu Lys
Pro His 915 920 925 Leu Glu Lys Gln Leu Gly Arg Glu Ile Phe Phe Leu
Tyr Gly Gly Thr 930 935 940 Ser Lys Lys Gln Arg Glu Glu Met Ile Asp
Arg Phe Gln His Asp Pro 945 950 955 960 Gln Gly Pro Pro Ile Met Ile
Leu Ser Leu Lys Ala Gly Gly Val Gly 965 970 975 Leu Asn Leu Thr Arg
Ala Asn His Val Phe His Phe Asp Arg Trp Trp 980 985 990 Asn Pro Ala
Val Glu Asn Gln Ala Thr Asp Arg Val Phe Arg Ile Gly 995 1000 1005
Gln Thr Arg Asn Val Gln Val His Lys Phe Val Cys Asn Gly Thr 1010
1015 1020 Leu Glu Glu Lys Ile His Asp Met Ile Glu Ser Lys Lys Gln
Leu 1025 1030 1035 Ala Glu Gln Val Val Gly Ala Gly Glu Glu Trp Leu
Thr Glu Leu 1040 1045 1050
Asp Thr Asp Gln Leu Arg Asn Leu Leu Ile Leu Asp Arg Ser Thr 1055
1060 1065 Val Ile Asp Glu Glu Ala Asp 1070 1075 613168DNANostoc sp.
61atgaaagtcc ttcatggctc gtggatacca aaccaatata gcgattttgt gcagtctgga
60gcattttatc tatgggtaga aactccgatt aataacaaaa agcgtactca tacacaagtt
120catcccggac atctatcttc tcttgaatta ctcaattttc tgactcaaac
tttggggatt 180aaagaaactg aagcgcaatt aaaacaacgg atatgttcta
aatattttgc cctaccaact 240gctaataatg agccattacc ttcaccagag
ttagtcaaat atttagaagt agaagttcct 300gaagagtatg aaaattttca
atattggcag gtaacttgtt atgaaactgt tacttctgtg 360aaagcagtga
tagcaattaa tattattaaa ttactcaaag atattcattt tttagccctg
420tacaatgcta gtgaatttca attagggtca gatttattat tttggtatca
ttatacgcaa 480tcatttagac aaataattac taaggatcaa tatattccat
ctttaaaata tagagcgaac 540gcagcgacta caaagaaaaa acctaaacaa
ccacccccag gatttgaaat atatgctggt 600tgggaaataa tttccgagca
atacgaagcc aatattcaaa aatatattga atatatgcca 660ttgatttgtg
tagcaggtaa cagcacacaa actgataaat tagaattttt tgctccagaa
720actctattac gccacttcag cgagtatctg cttaataatt tagtgagtaa
gacaccattg 780accgcagcat ttgaaaaaca aattgatgat tctttaattc
actattgtct ttatccccaa 840aaacacaacc cactcaaaac ccatactgct
ctccaagagt atcagcagtg gttgggatgg 900aaaaacagga ttatccgtac
tcaagctgaa tcaccatttc atctttgctt ccaattacat 960tcacctgatg
ctgaacaaat tgacaattgg cagatgcaat ttttagtatc aagtaaaaaa
1020gatccgtctc taaaattagc tttggcagat tactggataa tgaattccaa
aaccaaagct 1080ggtgtacata aagagtttgg caaagatttc gatactaatt
tactgctgaa tttaggctat 1140gcagcaagaa tgtatcccaa actttggcaa
ggtttagaaa cggactctcc cacaggaatg 1200cagctaagtt tagatgaggc
gtttgatttt ctcaaagata gtgcttgggt gttggaagac 1260tcaggattta
aggtcattgt cccggcttgg tatactccgg ctggtcgtcg tcgtgcgaaa
1320atccgcctca aagcttctag tggtcgcaag gtagctgcta cggtagggga
aagcaaaagt 1380tatttcggtt tagattcact agtgcagtat cagtatgaat
tagcaattgg agagcaaact 1440ctcacacctc aagaatggga acaattgatt
aatactaaag caccactagt gcattttcgc 1500ggtcaatgga tggaattaga
ccgggataaa atgcagcagt tattagaatt ttggcagtcc 1560cacggcgatg
aacagcccca aatgagcttg ttagagttca tgcaacgcag cgcccaaggg
1620gaagatgact gggaaattga atatgatgca gctttatcag aaataatggc
aaagttacaa 1680gataagagtc agctagagcc aatttctgaa gacttaaatt
tgcaaggcaa cctgcgagaa 1740tatcaaaagc ggggtgtagc ctggttacaa
tatttagaaa aattgggatt aaatggctgt 1800ttagccgatg atatgggact
gggtaagtcc gtgcaggtaa ttgcgagatt agtacaggag 1860aaagatagcc
aaagttcccc attaccgaca ttattaattg cgccgacttc ggttgttggt
1920aactggcaaa gagaaattgc taagtttgca ccccatttaa aaactatggt
gcatcatggt 1980agcgatcgcc tgcaagatgc tgcggagttt aagtccgcct
gtcaacagca tgatgtggtg 2040ataagttcct ttactttggc tcgcttagat
gaaaaactcc taaatagtgt gacatggcaa 2100cggttagttt tagatgaagc
acaaaacatt aaaaatccca aagcagcgca gactaaagct 2160atactcaaac
tcagtgctaa acaccgtcta gctttaactg gtacaccagt tgagaaccgc
2220ttacttgatt tgtggtcaat ttttaatttt ctcaatcccg gttatttagg
gaaagaagca 2280cagtttcgca aatcctttga aattcccatc cagaaggaca
acgataaagt aaaatcgact 2340accttaaaga aactggttga accgttaatt
ttacgacggg tcaaaacaga ccaatcaatt 2400attaaagact taccagataa
agttgaacaa aaactctata ccaacctcac caaagaacag 2460gcttcgctat
atgaagtggt agtcagagat gtggaagaaa aattgcaaga agctgaggga
2520atacaacgca aaggtttaat tctctcaacg ctgatgaaat taaaacagat
ttgcaatcat 2580cccagacagt tcctccaaga taatagcgaa tttttaccgg
agcgctcgca caaactttcc 2640cgcttagtcg aaatggtaga tgaagccatt
tctgaaggag aaagtctttt aatatttagt 2700caatttacag aagtctgcga
acaaatagaa aaatatctca aacacaactt acattgcaat 2760acctactacc
tacatggggg tacaagtcgc caacgtcggg aacaaatgat tagtgacttt
2820caaaatcctg atacggaagc atctgtattt gtcctttccc taaaagctgg
cggcgtgggg 2880attactttaa ctaaagccaa ccacgtcttt cattttgacc
gttggtggaa tccagccgtt 2940gaagaccaag ccacagaccg cgcttttcgc
ataggtcaga aaaaaaatgt gtttgtacat 3000aaatttgtcg cccttgggac
tttagaagaa agaatcgacc aaatgattga agataagaaa 3060aaactttctt
ccgccgtagt tggtagtgat gaatcgtggc taaccgaatt agataacgaa
3120gcctttaaga aactaattgc cttgaataaa agcacaatta tggagtag
3168621055PRTNostoc sp. 62Met Lys Val Leu His Gly Ser Trp Ile Pro
Asn Gln Tyr Ser Asp Phe 1 5 10 15 Val Gln Ser Gly Ala Phe Tyr Leu
Trp Val Glu Thr Pro Ile Asn Asn 20 25 30 Lys Lys Arg Thr His Thr
Gln Val His Pro Gly His Leu Ser Ser Leu 35 40 45 Glu Leu Leu Asn
Phe Leu Thr Gln Thr Leu Gly Ile Lys Glu Thr Glu 50 55 60 Ala Gln
Leu Lys Gln Arg Ile Cys Ser Lys Tyr Phe Ala Leu Pro Thr 65 70 75 80
Ala Asn Asn Glu Pro Leu Pro Ser Pro Glu Leu Val Lys Tyr Leu Glu 85
90 95 Val Glu Val Pro Glu Glu Tyr Glu Asn Phe Gln Tyr Trp Gln Val
Thr 100 105 110 Cys Tyr Glu Thr Val Thr Ser Val Lys Ala Val Ile Ala
Ile Asn Ile 115 120 125 Ile Lys Leu Leu Lys Asp Ile His Phe Leu Ala
Leu Tyr Asn Ala Ser 130 135 140 Glu Phe Gln Leu Gly Ser Asp Leu Leu
Phe Trp Tyr His Tyr Thr Gln 145 150 155 160 Ser Phe Arg Gln Ile Ile
Thr Lys Asp Gln Tyr Ile Pro Ser Leu Lys 165 170 175 Tyr Arg Ala Asn
Ala Ala Thr Thr Lys Lys Lys Pro Lys Gln Pro Pro 180 185 190 Pro Gly
Phe Glu Ile Tyr Ala Gly Trp Glu Ile Ile Ser Glu Gln Tyr 195 200 205
Glu Ala Asn Ile Gln Lys Tyr Ile Glu Tyr Met Pro Leu Ile Cys Val 210
215 220 Ala Gly Asn Ser Thr Gln Thr Asp Lys Leu Glu Phe Phe Ala Pro
Glu 225 230 235 240 Thr Leu Leu Arg His Phe Ser Glu Tyr Leu Leu Asn
Asn Leu Val Ser 245 250 255 Lys Thr Pro Leu Thr Ala Ala Phe Glu Lys
Gln Ile Asp Asp Ser Leu 260 265 270 Ile His Tyr Cys Leu Tyr Pro Gln
Lys His Asn Pro Leu Lys Thr His 275 280 285 Thr Ala Leu Gln Glu Tyr
Gln Gln Trp Leu Gly Trp Lys Asn Arg Ile 290 295 300 Ile Arg Thr Gln
Ala Glu Ser Pro Phe His Leu Cys Phe Gln Leu His 305 310 315 320 Ser
Pro Asp Ala Glu Gln Ile Asp Asn Trp Gln Met Gln Phe Leu Val 325 330
335 Ser Ser Lys Lys Asp Pro Ser Leu Lys Leu Ala Leu Ala Asp Tyr Trp
340 345 350 Ile Met Asn Ser Lys Thr Lys Ala Gly Val His Lys Glu Phe
Gly Lys 355 360 365 Asp Phe Asp Thr Asn Leu Leu Leu Asn Leu Gly Tyr
Ala Ala Arg Met 370 375 380 Tyr Pro Lys Leu Trp Gln Gly Leu Glu Thr
Asp Ser Pro Thr Gly Met 385 390 395 400 Gln Leu Ser Leu Asp Glu Ala
Phe Asp Phe Leu Lys Asp Ser Ala Trp 405 410 415 Val Leu Glu Asp Ser
Gly Phe Lys Val Ile Val Pro Ala Trp Tyr Thr 420 425 430 Pro Ala Gly
Arg Arg Arg Ala Lys Ile Arg Leu Lys Ala Ser Ser Gly 435 440 445 Arg
Lys Val Ala Ala Thr Val Gly Glu Ser Lys Ser Tyr Phe Gly Leu 450 455
460 Asp Ser Leu Val Gln Tyr Gln Tyr Glu Leu Ala Ile Gly Glu Gln Thr
465 470 475 480 Leu Thr Pro Gln Glu Trp Glu Gln Leu Ile Asn Thr Lys
Ala Pro Leu 485 490 495 Val His Phe Arg Gly Gln Trp Met Glu Leu Asp
Arg Asp Lys Met Gln 500 505 510 Gln Leu Leu Glu Phe Trp Gln Ser His
Gly Asp Glu Gln Pro Gln Met 515 520 525 Ser Leu Leu Glu Phe Met Gln
Arg Ser Ala Gln Gly Glu Asp Asp Trp 530 535 540 Glu Ile Glu Tyr Asp
Ala Ala Leu Ser Glu Ile Met Ala Lys Leu Gln 545 550 555 560 Asp Lys
Ser Gln Leu Glu Pro Ile Ser Glu Asp Leu Asn Leu Gln Gly 565 570 575
Asn Leu Arg Glu Tyr Gln Lys Arg Gly Val Ala Trp Leu Gln Tyr Leu 580
585 590 Glu Lys Leu Gly Leu Asn Gly Cys Leu Ala Asp Asp Met Gly Leu
Gly 595 600 605 Lys Ser Val Gln Val Ile Ala Arg Leu Val Gln Glu Lys
Asp Ser Gln 610 615 620 Ser Ser Pro Leu Pro Thr Leu Leu Ile Ala Pro
Thr Ser Val Val Gly 625 630 635 640 Asn Trp Gln Arg Glu Ile Ala Lys
Phe Ala Pro His Leu Lys Thr Met 645 650 655 Val His His Gly Ser Asp
Arg Leu Gln Asp Ala Ala Glu Phe Lys Ser 660 665 670 Ala Cys Gln Gln
His Asp Val Val Ile Ser Ser Phe Thr Leu Ala Arg 675 680 685 Leu Asp
Glu Lys Leu Leu Asn Ser Val Thr Trp Gln Arg Leu Val Leu 690 695 700
Asp Glu Ala Gln Asn Ile Lys Asn Pro Lys Ala Ala Gln Thr Lys Ala 705
710 715 720 Ile Leu Lys Leu Ser Ala Lys His Arg Leu Ala Leu Thr Gly
Thr Pro 725 730 735 Val Glu Asn Arg Leu Leu Asp Leu Trp Ser Ile Phe
Asn Phe Leu Asn 740 745 750 Pro Gly Tyr Leu Gly Lys Glu Ala Gln Phe
Arg Lys Ser Phe Glu Ile 755 760 765 Pro Ile Gln Lys Asp Asn Asp Lys
Val Lys Ser Thr Thr Leu Lys Lys 770 775 780 Leu Val Glu Pro Leu Ile
Leu Arg Arg Val Lys Thr Asp Gln Ser Ile 785 790 795 800 Ile Lys Asp
Leu Pro Asp Lys Val Glu Gln Lys Leu Tyr Thr Asn Leu 805 810 815 Thr
Lys Glu Gln Ala Ser Leu Tyr Glu Val Val Val Arg Asp Val Glu 820 825
830 Glu Lys Leu Gln Glu Ala Glu Gly Ile Gln Arg Lys Gly Leu Ile Leu
835 840 845 Ser Thr Leu Met Lys Leu Lys Gln Ile Cys Asn His Pro Arg
Gln Phe 850 855 860 Leu Gln Asp Asn Ser Glu Phe Leu Pro Glu Arg Ser
His Lys Leu Ser 865 870 875 880 Arg Leu Val Glu Met Val Asp Glu Ala
Ile Ser Glu Gly Glu Ser Leu 885 890 895 Leu Ile Phe Ser Gln Phe Thr
Glu Val Cys Glu Gln Ile Glu Lys Tyr 900 905 910 Leu Lys His Asn Leu
His Cys Asn Thr Tyr Tyr Leu His Gly Gly Thr 915 920 925 Ser Arg Gln
Arg Arg Glu Gln Met Ile Ser Asp Phe Gln Asn Pro Asp 930 935 940 Thr
Glu Ala Ser Val Phe Val Leu Ser Leu Lys Ala Gly Gly Val Gly 945 950
955 960 Ile Thr Leu Thr Lys Ala Asn His Val Phe His Phe Asp Arg Trp
Trp 965 970 975 Asn Pro Ala Val Glu Asp Gln Ala Thr Asp Arg Ala Phe
Arg Ile Gly 980 985 990 Gln Lys Lys Asn Val Phe Val His Lys Phe Val
Ala Leu Gly Thr Leu 995 1000 1005 Glu Glu Arg Ile Asp Gln Met Ile
Glu Asp Lys Lys Lys Leu Ser 1010 1015 1020 Ser Ala Val Val Gly Ser
Asp Glu Ser Trp Leu Thr Glu Leu Asp 1025 1030 1035 Asn Glu Ala Phe
Lys Lys Leu Ile Ala Leu Asn Lys Ser Thr Ile 1040 1045 1050 Met Glu
1055 632856DNANostoc punctiforme 63atggcgattt tacacagtaa ttggttacta
aaaagtcaaa aaggttgttt atttatttgg 60ggagaaactt ggcgatcgcc acgagttaat
ttcgagtcta atggatctgg agatatccca 120ctaaatccat tggcaatgac
atcactagag ttgagcgagt ggttggtttc ccagaagatg 180gccattacca
actttatcca gcaaccccaa attgccatcg ctactactgg gcgaacacgt
240aaagcagcca ctgccactga gataaactta ccaacgcatt cacaaataat
tgccttacca 300acttatattc ccgaagagag tgcagaagga acatctgcaa
ttttccctgt gcattctgcc 360agcttgagac tagaaacaga ctctccgcaa
tatttgcaac cgtggctagt tgagggtttt 420tgtcttaacc ccagcgaagc
agtaaaattt ctcgctgctg ttcccctgaa tgctgctaaa 480ggggaagatg
cttttttagg aggagattta cgtttttggt cgcaagtttc ccgatggagt
540ttagatttaa tctcgcggtg taagttttta ccaagaattg aacggcaatc
agacggtgca 600tttgctgcta aatggcaagt acttctagac agtgctgtag
atggaactcg cctagaaaag 660ttttctgcgg atatgccgtt ggtttgccgc
acttatcagg agggagtggg gactggggac 720tggggactga ggactgggga
ggagttttcc caatccctaa tccctaattc ccaatcccta 780ctttatgtaa
acttccctac tgaacctcaa gaattgttgc tgggatttct caacagtacg
840atagatgccc aagtgcgagg gatggtgggt tctcagcctc caatggaagc
taaggcaatg 900gcatctttac catctggggt gcggcagtgg ttgcaaggct
tgactagtac atctggtaca 960gttaacgcag atgccattga agtggaacga
ctggaagcgg cactgaaggc ttggatgatg 1020ccgctacaat accaattaac
tcttaaaact ctatttcgta cctgttttca actgcgttct 1080ccagaagctg
gcgaaacaga ttggacattg gcgtattttc tgcaagcggc tgacgatcct
1140gattttttgg tggatgcggc aactatttgg aacaatccag ttgaacgttt
ggtttatgaa 1200aatcgaacaa ttgagcaacc acaggaaaca tttttgcgag
gtttaggggt agcttcccga 1260ttatatccag cgatcgcacc cagttttgaa
accgaatatc cccaatcttc tcggatcaca 1320cccatgcaag cttatgagtt
tatcaaggct gtagcttgga ggttggaaga cagtggtttg 1380ggggtaattt
tgcctcctag tttagcgaac cgcgaaggat gggcaaatcg tttgggtttg
1440aaaattactg ctgaaacccc aaagaaaaag cagggacgtt tagggttgca
aagtctgctg 1500aatttccaat ggcaattggc aattggcgga cagactattt
ccaaagctga gtttgataaa 1560cttgtggctt taaatagtcc actagtggaa
attaacggtg agtgggtaga attgcggccc 1620caagatatca agacagccca
aacatttttt accactcgca aagaccaaat ggcgctttcc 1680ttggaagatg
ccttgcgttt cagtacagga gatacccagg taattgaaaa attaccagtg
1740gtcagctttg aggcatctgg ggcattgcaa gagttgattg gggcgctaaa
taataatcaa 1800gcgatcgcac ctttaccgac accagtaggc tttaaaggac
agttgcgacc ttatcaagaa 1860cgtggtgctg cttggctgtc cttcttggaa
cgttggggct taggcgcgtg tctcgccgac 1920gatatgggac tcggtaaaac
tattcagttt attgcttttt tgctacatct taaagaacag 1980gatgcactag
aaaattcaac actgctagtt tgtccaactt ctgttttagg caactgggaa
2040agggaagtca ataaatttgc accaagcctg aaaattttgc aatatcacgg
tgacaaacgt 2100ccaaaaggga aagcgttttt agaagcagtg aaaaatcacg
atttaatcgt taccagctac 2160tcactgcttc atcgggatat caagtcattg
caaagtgttc cttggcagat aattgtttta 2220gacgaagccc agaatgtgaa
aaatccagag gcgaagcagt caaaagctgt gcggcaatta 2280gaagctacat
ttcgcattgc attaacgggg acaccagtag aaaatagact gcaagaacta
2340tggtctattt tggattttct caatccaggg tatttaggta ataagcaatt
tttccagcgg 2400cggtttgcca tgccaattga aaagtatggt gatacggctt
ctttgggtca attacgttca 2460ttagttcagc catttatact gcggcgatta
aaaagcgatc gcgaaattat tcaagacttg 2520ccagataagc aagagatgac
cgtattttgc ggtttaactg ccgaccaagc tgcactttat 2580caacaagttg
tagaacaatc tttagtagag atagaatctg ctgaaggatt gcaacgtcgg
2640gggatgattt tggctttgct aatcaaactg aagcaaatct gcaatcatcc
agcccaatat 2700ttgaaacagg cgacattaga gcaacataat tcagccaaac
ttctgcggct agaagaaatg 2760ttagaagaag ttttagcaga aagtgaccgg
gctttaatct ttacacaatt tgcagagtgg 2820ggtaagttac ttaaacccaa
aagtgttgaa tgttaa 285664951PRTNostoc punctiforme 64Met Ala Ile Leu
His Ser Asn Trp Leu Leu Lys Ser Gln Lys Gly Cys 1 5 10 15 Leu Phe
Ile Trp Gly Glu Thr Trp Arg Ser Pro Arg Val Asn Phe Glu 20 25 30
Ser Asn Gly Ser Gly Asp Ile Pro Leu Asn Pro Leu Ala Met Thr Ser 35
40 45 Leu Glu Leu Ser Glu Trp Leu Val Ser Gln Lys Met Ala Ile Thr
Asn 50 55 60 Phe Ile Gln Gln Pro Gln Ile Ala Ile Ala Thr Thr Gly
Arg Thr Arg 65 70 75 80 Lys Ala Ala Thr Ala Thr Glu Ile Asn Leu Pro
Thr His Ser Gln Ile 85 90 95 Ile Ala Leu Pro Thr Tyr Ile Pro Glu
Glu Ser Ala Glu Gly Thr Ser 100 105 110 Ala Ile Phe Pro Val His Ser
Ala Ser Leu Arg Leu Glu Thr Asp Ser 115 120 125 Pro Gln Tyr Leu Gln
Pro Trp Leu Val Glu Gly Phe Cys Leu Asn Pro 130 135 140 Ser Glu Ala
Val Lys Phe Leu Ala Ala Val Pro Leu Asn Ala Ala Lys 145 150 155 160
Gly Glu Asp Ala Phe Leu Gly Gly Asp Leu Arg Phe Trp Ser Gln Val 165
170 175 Ser Arg Trp Ser Leu Asp Leu Ile Ser Arg Cys Lys Phe Leu Pro
Arg 180 185 190 Ile Glu Arg Gln Ser Asp Gly Ala Phe Ala Ala Lys Trp
Gln Val Leu 195 200 205 Leu Asp Ser Ala Val Asp Gly Thr Arg Leu Glu
Lys Phe Ser Ala Asp 210 215 220 Met Pro Leu Val Cys Arg Thr Tyr Gln
Glu Gly Val Gly Thr Gly Asp 225 230 235 240 Trp Gly Leu Arg Thr Gly
Glu Glu Phe Ser Gln Ser Leu Ile Pro Asn 245 250 255 Ser Gln Ser Leu
Leu Tyr Val Asn Phe Pro Thr Glu Pro Gln Glu Leu 260 265 270 Leu Leu
Gly Phe Leu Asn Ser Thr Ile Asp Ala Gln Val Arg Gly Met
275 280 285 Val Gly Ser Gln Pro Pro Met Glu Ala Lys Ala Met Ala Ser
Leu Pro 290 295 300 Ser Gly Val Arg Gln Trp Leu Gln Gly Leu Thr Ser
Thr Ser Gly Thr 305 310 315 320 Val Asn Ala Asp Ala Ile Glu Val Glu
Arg Leu Glu Ala Ala Leu Lys 325 330 335 Ala Trp Met Met Pro Leu Gln
Tyr Gln Leu Thr Leu Lys Thr Leu Phe 340 345 350 Arg Thr Cys Phe Gln
Leu Arg Ser Pro Glu Ala Gly Glu Thr Asp Trp 355 360 365 Thr Leu Ala
Tyr Phe Leu Gln Ala Ala Asp Asp Pro Asp Phe Leu Val 370 375 380 Asp
Ala Ala Thr Ile Trp Asn Asn Pro Val Glu Arg Leu Val Tyr Glu 385 390
395 400 Asn Arg Thr Ile Glu Gln Pro Gln Glu Thr Phe Leu Arg Gly Leu
Gly 405 410 415 Val Ala Ser Arg Leu Tyr Pro Ala Ile Ala Pro Ser Phe
Glu Thr Glu 420 425 430 Tyr Pro Gln Ser Ser Arg Ile Thr Pro Met Gln
Ala Tyr Glu Phe Ile 435 440 445 Lys Ala Val Ala Trp Arg Leu Glu Asp
Ser Gly Leu Gly Val Ile Leu 450 455 460 Pro Pro Ser Leu Ala Asn Arg
Glu Gly Trp Ala Asn Arg Leu Gly Leu 465 470 475 480 Lys Ile Thr Ala
Glu Thr Pro Lys Lys Lys Gln Gly Arg Leu Gly Leu 485 490 495 Gln Ser
Leu Leu Asn Phe Gln Trp Gln Leu Ala Ile Gly Gly Gln Thr 500 505 510
Ile Ser Lys Ala Glu Phe Asp Lys Leu Val Ala Leu Asn Ser Pro Leu 515
520 525 Val Glu Ile Asn Gly Glu Trp Val Glu Leu Arg Pro Gln Asp Ile
Lys 530 535 540 Thr Ala Gln Thr Phe Phe Thr Thr Arg Lys Asp Gln Met
Ala Leu Ser 545 550 555 560 Leu Glu Asp Ala Leu Arg Phe Ser Thr Gly
Asp Thr Gln Val Ile Glu 565 570 575 Lys Leu Pro Val Val Ser Phe Glu
Ala Ser Gly Ala Leu Gln Glu Leu 580 585 590 Ile Gly Ala Leu Asn Asn
Asn Gln Ala Ile Ala Pro Leu Pro Thr Pro 595 600 605 Val Gly Phe Lys
Gly Gln Leu Arg Pro Tyr Gln Glu Arg Gly Ala Ala 610 615 620 Trp Leu
Ser Phe Leu Glu Arg Trp Gly Leu Gly Ala Cys Leu Ala Asp 625 630 635
640 Asp Met Gly Leu Gly Lys Thr Ile Gln Phe Ile Ala Phe Leu Leu His
645 650 655 Leu Lys Glu Gln Asp Ala Leu Glu Asn Ser Thr Leu Leu Val
Cys Pro 660 665 670 Thr Ser Val Leu Gly Asn Trp Glu Arg Glu Val Asn
Lys Phe Ala Pro 675 680 685 Ser Leu Lys Ile Leu Gln Tyr His Gly Asp
Lys Arg Pro Lys Gly Lys 690 695 700 Ala Phe Leu Glu Ala Val Lys Asn
His Asp Leu Ile Val Thr Ser Tyr 705 710 715 720 Ser Leu Leu His Arg
Asp Ile Lys Ser Leu Gln Ser Val Pro Trp Gln 725 730 735 Ile Ile Val
Leu Asp Glu Ala Gln Asn Val Lys Asn Pro Glu Ala Lys 740 745 750 Gln
Ser Lys Ala Val Arg Gln Leu Glu Ala Thr Phe Arg Ile Ala Leu 755 760
765 Thr Gly Thr Pro Val Glu Asn Arg Leu Gln Glu Leu Trp Ser Ile Leu
770 775 780 Asp Phe Leu Asn Pro Gly Tyr Leu Gly Asn Lys Gln Phe Phe
Gln Arg 785 790 795 800 Arg Phe Ala Met Pro Ile Glu Lys Tyr Gly Asp
Thr Ala Ser Leu Gly 805 810 815 Gln Leu Arg Ser Leu Val Gln Pro Phe
Ile Leu Arg Arg Leu Lys Ser 820 825 830 Asp Arg Glu Ile Ile Gln Asp
Leu Pro Asp Lys Gln Glu Met Thr Val 835 840 845 Phe Cys Gly Leu Thr
Ala Asp Gln Ala Ala Leu Tyr Gln Gln Val Val 850 855 860 Glu Gln Ser
Leu Val Glu Ile Glu Ser Ala Glu Gly Leu Gln Arg Arg 865 870 875 880
Gly Met Ile Leu Ala Leu Leu Ile Lys Leu Lys Gln Ile Cys Asn His 885
890 895 Pro Ala Gln Tyr Leu Lys Gln Ala Thr Leu Glu Gln His Asn Ser
Ala 900 905 910 Lys Leu Leu Arg Leu Glu Glu Met Leu Glu Glu Val Leu
Ala Glu Ser 915 920 925 Asp Arg Ala Leu Ile Phe Thr Gln Phe Ala Glu
Trp Gly Lys Leu Leu 930 935 940 Lys Pro Lys Ser Val Glu Cys 945 950
653024DNAPelodictyon phaeoclathratiforme 65atgattgcgc tgcacatctc
catcattgac ggagtcccgc tactctggag tgagggaaaa 60aagatcggga tgctgaagga
gttacgcctc gcaacggctg gaatcggcat gttttccctg 120ctcgacaaca
ccacaaaaga gttttgtgtc tggctgccct gccgcgagaa aaaagctgtc
180ccatcatctc cgcttgtcgg cgccatgccc gacctgagtg atgaagagca
actccatgcc 240tttccgatta ccgcgcttcg gctgaatttc aacgctctgt
tcgagctttc cctgcttacg 300gaaaagggca acatccccgg cagtggcatc
atcttcggaa gctctctcca ctgggcacgg 360caggtagtaa aaattgcact
gaacattgtc agaacccagt cgctgctccc ttcgatcatc 420aaaaacgata
cattctggga ggccttgtgg ttgcccctcc ccgacagtgc cacatccctc
480gcagttgaac agcttgccga tgccatgcct gcggtctgtc gctctctcgg
ccgcaccgac 540acgcaaccgc cggaaacacc aaaaaagtta ctgctcaaag
gacttctctc tttccttgtc 600aatacactgt cacgtacttt tgaaagagca
ggggtgccaa aaatcagtga cttcgagagt 660atccatgacg cgtggcttca
tgcattatca aacagtgatc cccggctgaa atggaaaaat 720gagcaggaga
ttgagcagtt tgcctgtcag ctcaacgcat ggcggcgtcc cattgacctg
780catgagcgat cacccttcag gttttgcctg caactgacag agccaccact
gaaagggcgg 840aaaaaggagc gctggcatgt tgcctatcaa ctgcagttga
aagcggatcc aagcctgatt 900cttgacgccg gggatctctg gaaccccgaa
agcgaggcat cacagcacgc tttaacgtat 960acctccgatt gtaccgaatt
cctgcttact tccctgggac aagcctccgg cctctgcccc 1020gcagtcaccc
aaagcctgaa aaagaagcag ccgggtggct ttgatcttga taccgaaggg
1080gcttacagat ttttgctgga gtatgcggaa ctgttgcgaa gcgcaggatt
tgtggtcaag 1140cttccctcgt ggtggatcgg tcgcagagga gtcaaccgta
tcgggatcaa gacaaaagtg 1200aagcttccct ctatgaaagg aagcgggtcg
ggtctcacgc tggatcgcat ggttgcctgc 1260gattatgctg ctgcacttgg
caatgaggag cttgacctgc aggagctgaa aacactggca 1320aacctgaaag
ttccgctggt acgggtgcgc ggacagtgga cacagattga ccataaggag
1380cttgccaatg ctctccattt tcttgaaaaa catccaactg gtgaactttc
tgccagagaa 1440ctcctctcaa cagctctcgg agcacaaaaa aaggaggatg
ctctctttct tcgatcggtt 1500gaaatcgagg ggtggcttca ggaactgctt
gaaaaacttt cctctcaggg acaatttgaa 1560ctgcttccac cacctgagca
tttcgaggga acgcttcgcc tctatcagga gcgaggcttt 1620tcatggctct
catttctccg caagtgggga ctgggcgcct gtcttgccga cgacatgggc
1680cttggcaaaa ccattcagac gcttgcactg ctgcagcggg agcgtgaact
tggagaaaaa 1740agggcggtgc tcctgatctg ccccacctct gtagtcaaca
actggcgaaa ggaggcggag 1800cggttcactc cggatttagc ggtgctggtg
catcatggta tcgaccggat gaaaacagca 1860gattttcgca aagctgcaag
cgcttcagcc cttgtcattt caagctatgg attgttacag 1920cgcgaccttg
aatttctgtc gaaggttccc tgggcaggca ttattctcga tgaagcgcag
1980aacatcaaaa accctgagac aaaacagtca aaagctgccc gaacaatccg
ggctgattac 2040cgtattgccc tgaccggcac tcccgttgaa aatcatgtcg
gcgacctttg ggcactcatg 2100gattttctca atcccggttt tcttggaacc
cagcactttt tcaaacagaa cttctacacg 2160ccgattcagt ggtatggcga
ccctgaggct tcagcacgac tgaagtcgct gaccggcccg 2220tttattctgc
gccgcatgaa aagcgacaag tcgattattt ccgatctgcc cgacaagatc
2280gaaatgaaag agtattgctc gctgaccaaa gagcaggcat cgctctacaa
ggctgttgtc 2340gatgaactgc aggagaaaat tgaaagcgcc gaagggattg
accggcgggg ccttgtactt 2400gcgctgctgg tcaagctcaa gcaggtctgc
aaccatccgg cacatttgct tggcgacaac 2460tctgccattg cacatcgttc
aggaaaaata aaacgcctga ccgaactgct tggcgacatc 2520cgcgaagctg
gcgaaaaaac gctgctcttt acacagttta ccatgatggg aacgatgctc
2580cagcactatc ttcaggagtt gtacggtgaa gaggtactgt ttctgcacgg
tggcgtaacc 2640aaaaaaaggc gggatgagat ggtagagagc ttccagaagg
aagagggcag ttcaccctcc 2700atctttattc tctcactgaa agccggagga
acgggtctta acctgacaac agcgaaccac 2760gttgttcact ttgaccgatg
gtggaacccg gcagtagaga atcaggcaac tgaccgggct 2820ttccgtatcg
ggcagcacaa aaacgttgaa gttcataaat ttattacgac gggcacgctc
2880gaagagcgca ttgatgagat gattgagaaa aaaacaacgg tcgccggcca
ggttctcgga 2940acgggtgagc agtggctgac cgaactgtcg aacaatgatc
tgcgcaagct cattatgctc 3000ggacaggaag caatgggaga ataa
3024661007PRTPelodictyon phaeoclathratiforme 66Met Ile Ala Leu His
Ile Ser Ile Ile Asp Gly Val Pro Leu Leu Trp 1 5 10 15 Ser Glu Gly
Lys Lys Ile Gly Met Leu Lys Glu Leu Arg Leu Ala Thr 20 25 30 Ala
Gly Ile Gly Met Phe Ser Leu Leu Asp Asn Thr Thr Lys Glu Phe 35 40
45 Cys Val Trp Leu Pro Cys Arg Glu Lys Lys Ala Val Pro Ser Ser Pro
50 55 60 Leu Val Gly Ala Met Pro Asp Leu Ser Asp Glu Glu Gln Leu
His Ala 65 70 75 80 Phe Pro Ile Thr Ala Leu Arg Leu Asn Phe Asn Ala
Leu Phe Glu Leu 85 90 95 Ser Leu Leu Thr Glu Lys Gly Asn Ile Pro
Gly Ser Gly Ile Ile Phe 100 105 110 Gly Ser Ser Leu His Trp Ala Arg
Gln Val Val Lys Ile Ala Leu Asn 115 120 125 Ile Val Arg Thr Gln Ser
Leu Leu Pro Ser Ile Ile Lys Asn Asp Thr 130 135 140 Phe Trp Glu Ala
Leu Trp Leu Pro Leu Pro Asp Ser Ala Thr Ser Leu 145 150 155 160 Ala
Val Glu Gln Leu Ala Asp Ala Met Pro Ala Val Cys Arg Ser Leu 165 170
175 Gly Arg Thr Asp Thr Gln Pro Pro Glu Thr Pro Lys Lys Leu Leu Leu
180 185 190 Lys Gly Leu Leu Ser Phe Leu Val Asn Thr Leu Ser Arg Thr
Phe Glu 195 200 205 Arg Ala Gly Val Pro Lys Ile Ser Asp Phe Glu Ser
Ile His Asp Ala 210 215 220 Trp Leu His Ala Leu Ser Asn Ser Asp Pro
Arg Leu Lys Trp Lys Asn 225 230 235 240 Glu Gln Glu Ile Glu Gln Phe
Ala Cys Gln Leu Asn Ala Trp Arg Arg 245 250 255 Pro Ile Asp Leu His
Glu Arg Ser Pro Phe Arg Phe Cys Leu Gln Leu 260 265 270 Thr Glu Pro
Pro Leu Lys Gly Arg Lys Lys Glu Arg Trp His Val Ala 275 280 285 Tyr
Gln Leu Gln Leu Lys Ala Asp Pro Ser Leu Ile Leu Asp Ala Gly 290 295
300 Asp Leu Trp Asn Pro Glu Ser Glu Ala Ser Gln His Ala Leu Thr Tyr
305 310 315 320 Thr Ser Asp Cys Thr Glu Phe Leu Leu Thr Ser Leu Gly
Gln Ala Ser 325 330 335 Gly Leu Cys Pro Ala Val Thr Gln Ser Leu Lys
Lys Lys Gln Pro Gly 340 345 350 Gly Phe Asp Leu Asp Thr Glu Gly Ala
Tyr Arg Phe Leu Leu Glu Tyr 355 360 365 Ala Glu Leu Leu Arg Ser Ala
Gly Phe Val Val Lys Leu Pro Ser Trp 370 375 380 Trp Ile Gly Arg Arg
Gly Val Asn Arg Ile Gly Ile Lys Thr Lys Val 385 390 395 400 Lys Leu
Pro Ser Met Lys Gly Ser Gly Ser Gly Leu Thr Leu Asp Arg 405 410 415
Met Val Ala Cys Asp Tyr Ala Ala Ala Leu Gly Asn Glu Glu Leu Asp 420
425 430 Leu Gln Glu Leu Lys Thr Leu Ala Asn Leu Lys Val Pro Leu Val
Arg 435 440 445 Val Arg Gly Gln Trp Thr Gln Ile Asp His Lys Glu Leu
Ala Asn Ala 450 455 460 Leu His Phe Leu Glu Lys His Pro Thr Gly Glu
Leu Ser Ala Arg Glu 465 470 475 480 Leu Leu Ser Thr Ala Leu Gly Ala
Gln Lys Lys Glu Asp Ala Leu Phe 485 490 495 Leu Arg Ser Val Glu Ile
Glu Gly Trp Leu Gln Glu Leu Leu Glu Lys 500 505 510 Leu Ser Ser Gln
Gly Gln Phe Glu Leu Leu Pro Pro Pro Glu His Phe 515 520 525 Glu Gly
Thr Leu Arg Leu Tyr Gln Glu Arg Gly Phe Ser Trp Leu Ser 530 535 540
Phe Leu Arg Lys Trp Gly Leu Gly Ala Cys Leu Ala Asp Asp Met Gly 545
550 555 560 Leu Gly Lys Thr Ile Gln Thr Leu Ala Leu Leu Gln Arg Glu
Arg Glu 565 570 575 Leu Gly Glu Lys Arg Ala Val Leu Leu Ile Cys Pro
Thr Ser Val Val 580 585 590 Asn Asn Trp Arg Lys Glu Ala Glu Arg Phe
Thr Pro Asp Leu Ala Val 595 600 605 Leu Val His His Gly Ile Asp Arg
Met Lys Thr Ala Asp Phe Arg Lys 610 615 620 Ala Ala Ser Ala Ser Ala
Leu Val Ile Ser Ser Tyr Gly Leu Leu Gln 625 630 635 640 Arg Asp Leu
Glu Phe Leu Ser Lys Val Pro Trp Ala Gly Ile Ile Leu 645 650 655 Asp
Glu Ala Gln Asn Ile Lys Asn Pro Glu Thr Lys Gln Ser Lys Ala 660 665
670 Ala Arg Thr Ile Arg Ala Asp Tyr Arg Ile Ala Leu Thr Gly Thr Pro
675 680 685 Val Glu Asn His Val Gly Asp Leu Trp Ala Leu Met Asp Phe
Leu Asn 690 695 700 Pro Gly Phe Leu Gly Thr Gln His Phe Phe Lys Gln
Asn Phe Tyr Thr 705 710 715 720 Pro Ile Gln Trp Tyr Gly Asp Pro Glu
Ala Ser Ala Arg Leu Lys Ser 725 730 735 Leu Thr Gly Pro Phe Ile Leu
Arg Arg Met Lys Ser Asp Lys Ser Ile 740 745 750 Ile Ser Asp Leu Pro
Asp Lys Ile Glu Met Lys Glu Tyr Cys Ser Leu 755 760 765 Thr Lys Glu
Gln Ala Ser Leu Tyr Lys Ala Val Val Asp Glu Leu Gln 770 775 780 Glu
Lys Ile Glu Ser Ala Glu Gly Ile Asp Arg Arg Gly Leu Val Leu 785 790
795 800 Ala Leu Leu Val Lys Leu Lys Gln Val Cys Asn His Pro Ala His
Leu 805 810 815 Leu Gly Asp Asn Ser Ala Ile Ala His Arg Ser Gly Lys
Ile Lys Arg 820 825 830 Leu Thr Glu Leu Leu Gly Asp Ile Arg Glu Ala
Gly Glu Lys Thr Leu 835 840 845 Leu Phe Thr Gln Phe Thr Met Met Gly
Thr Met Leu Gln His Tyr Leu 850 855 860 Gln Glu Leu Tyr Gly Glu Glu
Val Leu Phe Leu His Gly Gly Val Thr 865 870 875 880 Lys Lys Arg Arg
Asp Glu Met Val Glu Ser Phe Gln Lys Glu Glu Gly 885 890 895 Ser Ser
Pro Ser Ile Phe Ile Leu Ser Leu Lys Ala Gly Gly Thr Gly 900 905 910
Leu Asn Leu Thr Thr Ala Asn His Val Val His Phe Asp Arg Trp Trp 915
920 925 Asn Pro Ala Val Glu Asn Gln Ala Thr Asp Arg Ala Phe Arg Ile
Gly 930 935 940 Gln His Lys Asn Val Glu Val His Lys Phe Ile Thr Thr
Gly Thr Leu 945 950 955 960 Glu Glu Arg Ile Asp Glu Met Ile Glu Lys
Lys Thr Thr Val Ala Gly 965 970 975 Gln Val Leu Gly Thr Gly Glu Gln
Trp Leu Thr Glu Leu Ser Asn Asn 980 985 990 Asp Leu Arg Lys Leu Ile
Met Leu Gly Gln Glu Ala Met Gly Glu 995 1000 1005
673189DNAProchlorococcus marinus 67atgactctgc tgcacgccac ttggatttca
actaattggc atccatctaa tttaggtcaa 60tcagaattgt tcctttgggc agaccaatgg
cgcgtagtaa ctccaaaaca aataatacaa 120acaccttcac ctcacccgtt
tagcctatct tcagatgaat taaaagaatg gctcaatagc 180aaaaaattat
tgcctaatga gagtattaat acatctgcat gtctcactct tcctagtaaa
240cccattcaca aaaaaaataa ccaaaaatct aagaatcaaa aaactggtat
tgaatctgaa 300tggaagggac tccctttaca agctcatgaa gaaatagcaa
cacaatatga atgttggcca 360tggaaagtag atggaatttc actcactact
gtcgaagcaa cagaatggct tacaaaatta 420cctttatcaa aaaaagattc
tgatcttagt gaagaattac tttggtgggc tcatttagag 480cgttggtctc
ttaatctaat tgcgagtgga ctatggctac ctcaagttaa attacacaag
540aaagaaggaa atgaatatcg tgcatcatgg atacctctgc tgaatcaaga
aaatgaaaga 600aatcgcttag aagagtttgc aaaaaatatt cccttggtcg
ctatttgtgc agtcccatgg 660atagaagcta aaggacaaat agtcaatact
gagcaagtct caaattcaaa caataataca 720ctctctttat ataggccaag
acacaatcgc gtagaagtga tggatcttct cgaagaactt 780attgatgcac
aacttcgaaa agattttcaa ccaagaacta aaaacttgga tccattgtta
840aaagcgtggc aagaagcact tggcacgaaa gatggaataa ttaacctatc
gaatgaaaac 900gctaaaagat tagaaaaagc aagtaagaat
tggaaaagag ggttgtctag taatgttcaa 960cctgcgaaaa catgtctaga
gctaattgca ccgattgatg atctagattt atgggactta 1020aacttttcat
tgcaatcaga atcagatccg agtatcagac tagctgcaga tcaaatttgg
1080gaagcaggcg tagaagtaac caaagttggc ggaataacaa ttgacaaccc
aagtgaaatt 1140cttttagaag gcctaggaag aagtcttgaa attttccctc
caattgaaaa aggactagaa 1200agcccaactc ctcacacaat gaaactgtct
gcatcagaag catttgtact tattagaaca 1260gcagcagcaa aacttcgtga
catgggtatt ggtgtaatac tgcctaatag tttgtccaaa 1320ggatttgcaa
gtcgacttgg tcttgctatt caagccgaat taccagagtc ttcactaggc
1380gtaatgctag gagaaagttt gaactgggat tgggagttaa tgatcggagg
tataaattta 1440agcatgaaag aactagaaat gcttgcaaaa aaaaatagtc
ctctactcaa tcacaaaggg 1500acatggatcg aattacgtcc taatgatctg
aaaaatgctt caaaattttt tgctaatact 1560ccagaattaa acctcgataa
agcattaagg cttagtgcta ataaaggcaa cacttttatg 1620aaacttccag
tacatcattt tgaatctgga ccaagattac aaagtgtctt agagcaatat
1680caccatcaga aagcgcctga acctttacca gcacctaatg gattccatgg
gcaattaagg 1740ccttaccaag aaagaggtct tgggtggctt gcatttcttt
atcgttttaa gcaaggagca 1800tgcttagcag atgacatggg gcttggtaaa
actattcaat tattatgttt tattcagcac 1860ctaaaagttc aaaacgagct
tactaagcct gtactcctaa ttgcgcctac atctgtgctg 1920acaaattgga
aaagagaggc tgccactttt actccagaac tatgtataca tgaacactat
1980ggtagtaaga gacattcttc aataccaaaa ttacaaaatt atctaaaaaa
agttgacatt 2040atgatcacaa gttatgggtt actttatcga gatggcgagc
tgctacaaga aatcgactgg 2100caaggaatag ttattgatga agctcaagct
attaaaaatt ccaaatcaaa gcaaagtatt 2160ataactagag caataagcaa
aaatctcata agtaatccct ttagaattgc tttaacagga 2220acgccagtag
aaaatcgtat tagtgaacta tgggcactaa tggatttcct taatccaaaa
2280gtattaggtg aagaagattt ttttaatcag cgatacaagt taccgattga
gcattatggc 2340gacatctctt cattaaaaga tctcaaaaca caggtcagtc
cttttatttt aagaagattg 2400aaaaccgatc aatctattat ttctgatttg
cctcaaaaga ttgaattaaa tgagtgggtt 2460ggactaagcc aagagcaaga
gcttctatat aaacaaacgg tagagaaaag cttagatgaa 2520ctcgcctcat
tacccattgg tcaacgccag ggtaaaacat tgggtctact tactcgtctt
2580aaacaaattt gtaatcatcc agcaattgct ttaaaagaaa ctcaagtcga
gaagaatttc 2640ttattaagat cttcaaaatt acaaagactg gaagaaatac
tacaagaagt gaaagaatct 2700catgatagag ctctgctctt tactcaattt
gctgaatggg ggcatttatt gcaagcgtac 2760ttacaaacaa aatgggaatc
agaagtacct ttcctacacg gaggcactcc taaagggaag 2820cgacaagaaa
tgatagatcg ttttcaagat gatcctagag ggccaaatat ctttttactt
2880tcactaaaag caggaggagt gggtcttaat ctaactcgtg cgaatcatgt
ttttcatatt 2940gatcgttggt ggaatccagc agtagaaaat caagcaacag
atcgtgcata ccgaattggt 3000caaaaaaaaa gtgttatcgt ccataagttt
ataaccaccg gcacaatcga agaaaaaatc 3060aatcaaatga ttctcgaaaa
gactgaacta gcagaaaata ttgtcggatc aggagaaagc 3120tggttagggc
aattaagtct tgaaaaattg agtgaattag ttgctttaga tagcaatcca
3180gaattctaa 3189681062PRTProchlorococcus marinus 68Met Thr Leu
Leu His Ala Thr Trp Ile Ser Thr Asn Trp His Pro Ser 1 5 10 15 Asn
Leu Gly Gln Ser Glu Leu Phe Leu Trp Ala Asp Gln Trp Arg Val 20 25
30 Val Thr Pro Lys Gln Ile Ile Gln Thr Pro Ser Pro His Pro Phe Ser
35 40 45 Leu Ser Ser Asp Glu Leu Lys Glu Trp Leu Asn Ser Lys Lys
Leu Leu 50 55 60 Pro Asn Glu Ser Ile Asn Thr Ser Ala Cys Leu Thr
Leu Pro Ser Lys 65 70 75 80 Pro Ile His Lys Lys Asn Asn Gln Lys Ser
Lys Asn Gln Lys Thr Gly 85 90 95 Ile Glu Ser Glu Trp Lys Gly Leu
Pro Leu Gln Ala His Glu Glu Ile 100 105 110 Ala Thr Gln Tyr Glu Cys
Trp Pro Trp Lys Val Asp Gly Ile Ser Leu 115 120 125 Thr Thr Val Glu
Ala Thr Glu Trp Leu Thr Lys Leu Pro Leu Ser Lys 130 135 140 Lys Asp
Ser Asp Leu Ser Glu Glu Leu Leu Trp Trp Ala His Leu Glu 145 150 155
160 Arg Trp Ser Leu Asn Leu Ile Ala Ser Gly Leu Trp Leu Pro Gln Val
165 170 175 Lys Leu His Lys Lys Glu Gly Asn Glu Tyr Arg Ala Ser Trp
Ile Pro 180 185 190 Leu Leu Asn Gln Glu Asn Glu Arg Asn Arg Leu Glu
Glu Phe Ala Lys 195 200 205 Asn Ile Pro Leu Val Ala Ile Cys Ala Val
Pro Trp Ile Glu Ala Lys 210 215 220 Gly Gln Ile Val Asn Thr Glu Gln
Val Ser Asn Ser Asn Asn Asn Thr 225 230 235 240 Leu Ser Leu Tyr Arg
Pro Arg His Asn Arg Val Glu Val Met Asp Leu 245 250 255 Leu Glu Glu
Leu Ile Asp Ala Gln Leu Arg Lys Asp Phe Gln Pro Arg 260 265 270 Thr
Lys Asn Leu Asp Pro Leu Leu Lys Ala Trp Gln Glu Ala Leu Gly 275 280
285 Thr Lys Asp Gly Ile Ile Asn Leu Ser Asn Glu Asn Ala Lys Arg Leu
290 295 300 Glu Lys Ala Ser Lys Asn Trp Lys Arg Gly Leu Ser Ser Asn
Val Gln 305 310 315 320 Pro Ala Lys Thr Cys Leu Glu Leu Ile Ala Pro
Ile Asp Asp Leu Asp 325 330 335 Leu Trp Asp Leu Asn Phe Ser Leu Gln
Ser Glu Ser Asp Pro Ser Ile 340 345 350 Arg Leu Ala Ala Asp Gln Ile
Trp Glu Ala Gly Val Glu Val Thr Lys 355 360 365 Val Gly Gly Ile Thr
Ile Asp Asn Pro Ser Glu Ile Leu Leu Glu Gly 370 375 380 Leu Gly Arg
Ser Leu Glu Ile Phe Pro Pro Ile Glu Lys Gly Leu Glu 385 390 395 400
Ser Pro Thr Pro His Thr Met Lys Leu Ser Ala Ser Glu Ala Phe Val 405
410 415 Leu Ile Arg Thr Ala Ala Ala Lys Leu Arg Asp Met Gly Ile Gly
Val 420 425 430 Ile Leu Pro Asn Ser Leu Ser Lys Gly Phe Ala Ser Arg
Leu Gly Leu 435 440 445 Ala Ile Gln Ala Glu Leu Pro Glu Ser Ser Leu
Gly Val Met Leu Gly 450 455 460 Glu Ser Leu Asn Trp Asp Trp Glu Leu
Met Ile Gly Gly Ile Asn Leu 465 470 475 480 Ser Met Lys Glu Leu Glu
Met Leu Ala Lys Lys Asn Ser Pro Leu Leu 485 490 495 Asn His Lys Gly
Thr Trp Ile Glu Leu Arg Pro Asn Asp Leu Lys Asn 500 505 510 Ala Ser
Lys Phe Phe Ala Asn Thr Pro Glu Leu Asn Leu Asp Lys Ala 515 520 525
Leu Arg Leu Ser Ala Asn Lys Gly Asn Thr Phe Met Lys Leu Pro Val 530
535 540 His His Phe Glu Ser Gly Pro Arg Leu Gln Ser Val Leu Glu Gln
Tyr 545 550 555 560 His His Gln Lys Ala Pro Glu Pro Leu Pro Ala Pro
Asn Gly Phe His 565 570 575 Gly Gln Leu Arg Pro Tyr Gln Glu Arg Gly
Leu Gly Trp Leu Ala Phe 580 585 590 Leu Tyr Arg Phe Lys Gln Gly Ala
Cys Leu Ala Asp Asp Met Gly Leu 595 600 605 Gly Lys Thr Ile Gln Leu
Leu Cys Phe Ile Gln His Leu Lys Val Gln 610 615 620 Asn Glu Leu Thr
Lys Pro Val Leu Leu Ile Ala Pro Thr Ser Val Leu 625 630 635 640 Thr
Asn Trp Lys Arg Glu Ala Ala Thr Phe Thr Pro Glu Leu Cys Ile 645 650
655 His Glu His Tyr Gly Ser Lys Arg His Ser Ser Ile Pro Lys Leu Gln
660 665 670 Asn Tyr Leu Lys Lys Val Asp Ile Met Ile Thr Ser Tyr Gly
Leu Leu 675 680 685 Tyr Arg Asp Gly Glu Leu Leu Gln Glu Ile Asp Trp
Gln Gly Ile Val 690 695 700 Ile Asp Glu Ala Gln Ala Ile Lys Asn Ser
Lys Ser Lys Gln Ser Ile 705 710 715 720 Ile Thr Arg Ala Ile Ser Lys
Asn Leu Ile Ser Asn Pro Phe Arg Ile 725 730 735 Ala Leu Thr Gly Thr
Pro Val Glu Asn Arg Ile Ser Glu Leu Trp Ala 740 745 750 Leu Met Asp
Phe Leu Asn Pro Lys Val Leu Gly Glu Glu Asp Phe Phe 755 760 765 Asn
Gln Arg Tyr Lys Leu Pro Ile Glu His Tyr Gly Asp Ile Ser Ser 770 775
780 Leu Lys Asp Leu Lys Thr Gln Val Ser Pro Phe Ile Leu Arg Arg Leu
785 790 795 800 Lys Thr Asp Gln Ser Ile Ile Ser Asp Leu Pro Gln Lys
Ile Glu Leu 805 810 815 Asn Glu Trp Val Gly Leu Ser Gln Glu Gln Glu
Leu Leu Tyr Lys Gln 820 825 830 Thr Val Glu Lys Ser Leu Asp Glu Leu
Ala Ser Leu Pro Ile Gly Gln 835 840 845 Arg Gln Gly Lys Thr Leu Gly
Leu Leu Thr Arg Leu Lys Gln Ile Cys 850 855 860 Asn His Pro Ala Ile
Ala Leu Lys Glu Thr Gln Val Glu Lys Asn Phe 865 870 875 880 Leu Leu
Arg Ser Ser Lys Leu Gln Arg Leu Glu Glu Ile Leu Gln Glu 885 890 895
Val Lys Glu Ser His Asp Arg Ala Leu Leu Phe Thr Gln Phe Ala Glu 900
905 910 Trp Gly His Leu Leu Gln Ala Tyr Leu Gln Thr Lys Trp Glu Ser
Glu 915 920 925 Val Pro Phe Leu His Gly Gly Thr Pro Lys Gly Lys Arg
Gln Glu Met 930 935 940 Ile Asp Arg Phe Gln Asp Asp Pro Arg Gly Pro
Asn Ile Phe Leu Leu 945 950 955 960 Ser Leu Lys Ala Gly Gly Val Gly
Leu Asn Leu Thr Arg Ala Asn His 965 970 975 Val Phe His Ile Asp Arg
Trp Trp Asn Pro Ala Val Glu Asn Gln Ala 980 985 990 Thr Asp Arg Ala
Tyr Arg Ile Gly Gln Lys Lys Ser Val Ile Val His 995 1000 1005 Lys
Phe Ile Thr Thr Gly Thr Ile Glu Glu Lys Ile Asn Gln Met 1010 1015
1020 Ile Leu Glu Lys Thr Glu Leu Ala Glu Asn Ile Val Gly Ser Gly
1025 1030 1035 Glu Ser Trp Leu Gly Gln Leu Ser Leu Glu Lys Leu Ser
Glu Leu 1040 1045 1050 Val Ala Leu Asp Ser Asn Pro Glu Phe 1055
1060 693204DNAProchlorococcus marinus 69atgagtctgc tacacgctac
ttggctgcca gcaatgcgaa ccggaagttc gcataatcca 60ggactactca tctgggctga
ttcatggaga gttgcaaaac caagcatagt cagcaatcag 120cctgtaatac
atccatttgc cttatcagca gcagatttac gtatttggct attgcaaaaa
180aagcttttac ctaaagaaag tattgaatgt acagccttat taactctacc
tagtaaatct 240attaaaaact cattagacaa aaaattaaat ggagtaacgg
actcacaaaa tactagcgat 300caacctcaat ggagtggact acctttacaa
gcaggagagc cagtaactaa acaatgtgaa 360tggtggccct ggcaagttga
aggtatagca atcaaaccca gtgaagctgc atcgtggctt 420gcaaacttac
ctctcacgaa aaaagatcct gagcttagtg aagagatcct atggtggagt
480catttagaac gttggtctct aagtttaatt gctcgtggcc tttggttgcc
acaagttgaa 540ttaaatacaa ttgataatat tggagctaga gctaggtgga
gtcctttact taataacgaa 600aacgagcgca aaagattaga agaattctct
atcaggcttc cattagtagc aacatgtgcc 660ataaaaagag aggaaacttc
tgaagaaaat caaaaccata tattaaagac tactcctagg 720gaaacactcg
atgaatacgg acttgcagta tgtcgaccaa tcaatagtcg acttcaagtg
780gcttatctct tagaagaact cgtggatgga cagctaagaa aagattttga
ggaaagttct 840gaagaccttg atccattgct gaaagcttgg caagaggcat
taggatcaca taatggagtc 900attcgtcttc cgttggaaga ttgtgaaaga
ttagccaagg caagtaaaaa ttggaaagaa 960aatttatcag gcaatgttaa
aggtgcaaga gcatgccttg agctttttgc accacttgaa 1020ggagaagatt
tatgggactt acaattctct ttacaagctg aagcagatcc atcactaaag
1080gtagcagcag aagcagtatg gaatgcagac tcagcagttc tacagattgg
tgatattcaa 1140atagcgcagc ctggagaaat tctactagaa ggtcttggca
gagcactcaa tatctttcaa 1200ccaatagaaa ggggtctgga aaatgctact
ccaaataata tgcaactcac acctgcagaa 1260gcttttgttc tagtacgtac
agcctcaaag caattacgtg atattggtat tggtgtaata 1320ctacctagaa
gtttatcagg aggattagca agtcgactag gtatagctat taaagcagag
1380ttagcgacta gtgccagagg attaacactt cgagagaatc tagaatggag
ttgggagcta 1440atgatagggg gaagcatatt aagccttaaa gatctagaac
aactggcaag taaacgcagc 1500cctctagttc gctataagga ttcatggctt
gaattacgtc caaatgatct taaaatcgcc 1560gaaaaattct gtagcaataa
tcctgaatta agcctagatg acgcattaag acttaccgca 1620actaaagggg
agactctaat gaagcttcca gtacatcaat ttaatgctgg gccaaagctc
1680caaggcgttt tagagcaata ccaccaacat acaagtcctg agcctctagc
tgcaccagat 1740ggcttctatg gacaactgag gccttatcaa gaacgtggca
taggatggtt ggctttcttg 1800catcgtttta atcaaggtgc atgtttagca
gatgacatgg gcctgggcaa aacaattcaa 1860gtgcttgctt ttattcagca
cttaaaaagt aacaaggacc tcaagaaacc tgttttgcta 1920attgcaccta
cgtcagtatt aacaaactgg aaacgagaag cttattcatt tacaccagag
1980ttatctgtat tagagcatta cggtcctaat cgttcatcta catcaacact
cttgaaaaag 2040attctcaaaa aagtagacat tcttattact agctatggcc
tactacatag agataaacag 2100cttctgaaaa caattgattg gcaaggtgta
attattgatg aagcacaagc tataaaaaat 2160ccaaattcaa aacaaagtca
aacaactcgt gaaattgtta aaggcggaaa aataatccct 2220tttcgtattg
cattaactgg tacccctata gaaaatcgtg taagtgagct ttggtcatta
2280atggattttt taaatccatc agtacttgga gaaaaagaat tttttgatca
acgctacaaa 2340ttaccgattg aacgttatgg tgatatttct tcgttaaccg
atctcaaagc tcgtgtcagt 2400ccctttattc ttagaaggtt aaaaagtgat
aaatcaatta tctcggatct accaagcaaa 2460gtcgaactaa aagaatggat
tactcttagt caagagcaaa gagctcttta taacaaaact 2520gtagacaata
ccttacagga aatcgcaaga agtcctattg gtcagcgtca tgcgaaaacc
2580ttaggtctat taacacgtct caaacaaata tgtaatcatc ctgctcttgc
cctcaaagaa 2640aaaaacatta gcgatgattt tggaatacga tcaaccaaac
ttcaaaggct ggaagaactt 2700cttgatgtga tattcgcaac agaggacaga
gctcttcttt ttacccaatt cgctgaatgg 2760ggtcacttac tacaagctta
tctagaaaaa aagtggggac atagcatact ttttctacat 2820ggaggaactc
gcaaaataga tagacaatca atggttgatc aatttcaaga agatcccaga
2880ggcccaaaat tatttttact ttctctcaaa gcaggtggta ttggtctgaa
cctgactcga 2940gctaaccacg tgttgcatat tgatcgatgg tggaaccctg
ccgtagaaaa tcaggcaaca 3000gatcgtgctt atagaattgg tcaaaaaaat
agcgtaatgg ttcacaaatt tattgctaca 3060gggtcagtag aagaaaaaat
tgatcaaatg attactgaaa agtctaagct cgcagaaaat 3120ataattggtg
caggtgaaga ttggcttggc aaacttggca tcaatgaatt acgtgaatta
3180gtttccttag aaaaagagag ttaa 3204701067PRTProchlorococcus marinus
70Met Ser Leu Leu His Ala Thr Trp Leu Pro Ala Met Arg Thr Gly Ser 1
5 10 15 Ser His Asn Pro Gly Leu Leu Ile Trp Ala Asp Ser Trp Arg Val
Ala 20 25 30 Lys Pro Ser Ile Val Ser Asn Gln Pro Val Ile His Pro
Phe Ala Leu 35 40 45 Ser Ala Ala Asp Leu Arg Ile Trp Leu Leu Gln
Lys Lys Leu Leu Pro 50 55 60 Lys Glu Ser Ile Glu Cys Thr Ala Leu
Leu Thr Leu Pro Ser Lys Ser 65 70 75 80 Ile Lys Asn Ser Leu Asp Lys
Lys Leu Asn Gly Val Thr Asp Ser Gln 85 90 95 Asn Thr Ser Asp Gln
Pro Gln Trp Ser Gly Leu Pro Leu Gln Ala Gly 100 105 110 Glu Pro Val
Thr Lys Gln Cys Glu Trp Trp Pro Trp Gln Val Glu Gly 115 120 125 Ile
Ala Ile Lys Pro Ser Glu Ala Ala Ser Trp Leu Ala Asn Leu Pro 130 135
140 Leu Thr Lys Lys Asp Pro Glu Leu Ser Glu Glu Ile Leu Trp Trp Ser
145 150 155 160 His Leu Glu Arg Trp Ser Leu Ser Leu Ile Ala Arg Gly
Leu Trp Leu 165 170 175 Pro Gln Val Glu Leu Asn Thr Ile Asp Asn Ile
Gly Ala Arg Ala Arg 180 185 190 Trp Ser Pro Leu Leu Asn Asn Glu Asn
Glu Arg Lys Arg Leu Glu Glu 195 200 205 Phe Ser Ile Arg Leu Pro Leu
Val Ala Thr Cys Ala Ile Lys Arg Glu 210 215 220 Glu Thr Ser Glu Glu
Asn Gln Asn His Ile Leu Lys Thr Thr Pro Arg 225 230 235 240 Glu Thr
Leu Asp Glu Tyr Gly Leu Ala Val Cys Arg Pro Ile Asn Ser 245 250 255
Arg Leu Gln Val Ala Tyr Leu Leu Glu Glu Leu Val Asp Gly Gln Leu 260
265 270 Arg Lys Asp Phe Glu Glu Ser Ser Glu Asp Leu Asp Pro Leu Leu
Lys 275 280 285 Ala Trp Gln Glu Ala Leu Gly Ser His Asn Gly Val Ile
Arg Leu Pro 290 295 300 Leu Glu Asp Cys Glu Arg Leu Ala Lys Ala Ser
Lys Asn Trp Lys Glu 305 310 315 320 Asn Leu Ser Gly Asn Val Lys Gly
Ala Arg Ala Cys Leu Glu Leu Phe 325 330 335 Ala Pro Leu Glu Gly Glu
Asp Leu Trp Asp Leu Gln Phe Ser Leu Gln 340 345 350 Ala Glu Ala Asp
Pro Ser Leu Lys Val Ala Ala Glu Ala Val Trp Asn 355 360 365 Ala Asp
Ser Ala Val Leu Gln Ile Gly Asp Ile Gln Ile Ala Gln Pro 370
375 380 Gly Glu Ile Leu Leu Glu Gly Leu Gly Arg Ala Leu Asn Ile Phe
Gln 385 390 395 400 Pro Ile Glu Arg Gly Leu Glu Asn Ala Thr Pro Asn
Asn Met Gln Leu 405 410 415 Thr Pro Ala Glu Ala Phe Val Leu Val Arg
Thr Ala Ser Lys Gln Leu 420 425 430 Arg Asp Ile Gly Ile Gly Val Ile
Leu Pro Arg Ser Leu Ser Gly Gly 435 440 445 Leu Ala Ser Arg Leu Gly
Ile Ala Ile Lys Ala Glu Leu Ala Thr Ser 450 455 460 Ala Arg Gly Leu
Thr Leu Arg Glu Asn Leu Glu Trp Ser Trp Glu Leu 465 470 475 480 Met
Ile Gly Gly Ser Ile Leu Ser Leu Lys Asp Leu Glu Gln Leu Ala 485 490
495 Ser Lys Arg Ser Pro Leu Val Arg Tyr Lys Asp Ser Trp Leu Glu Leu
500 505 510 Arg Pro Asn Asp Leu Lys Ile Ala Glu Lys Phe Cys Ser Asn
Asn Pro 515 520 525 Glu Leu Ser Leu Asp Asp Ala Leu Arg Leu Thr Ala
Thr Lys Gly Glu 530 535 540 Thr Leu Met Lys Leu Pro Val His Gln Phe
Asn Ala Gly Pro Lys Leu 545 550 555 560 Gln Gly Val Leu Glu Gln Tyr
His Gln His Thr Ser Pro Glu Pro Leu 565 570 575 Ala Ala Pro Asp Gly
Phe Tyr Gly Gln Leu Arg Pro Tyr Gln Glu Arg 580 585 590 Gly Ile Gly
Trp Leu Ala Phe Leu His Arg Phe Asn Gln Gly Ala Cys 595 600 605 Leu
Ala Asp Asp Met Gly Leu Gly Lys Thr Ile Gln Val Leu Ala Phe 610 615
620 Ile Gln His Leu Lys Ser Asn Lys Asp Leu Lys Lys Pro Val Leu Leu
625 630 635 640 Ile Ala Pro Thr Ser Val Leu Thr Asn Trp Lys Arg Glu
Ala Tyr Ser 645 650 655 Phe Thr Pro Glu Leu Ser Val Leu Glu His Tyr
Gly Pro Asn Arg Ser 660 665 670 Ser Thr Ser Thr Leu Leu Lys Lys Ile
Leu Lys Lys Val Asp Ile Leu 675 680 685 Ile Thr Ser Tyr Gly Leu Leu
His Arg Asp Lys Gln Leu Leu Lys Thr 690 695 700 Ile Asp Trp Gln Gly
Val Ile Ile Asp Glu Ala Gln Ala Ile Lys Asn 705 710 715 720 Pro Asn
Ser Lys Gln Ser Gln Thr Thr Arg Glu Ile Val Lys Gly Gly 725 730 735
Lys Ile Ile Pro Phe Arg Ile Ala Leu Thr Gly Thr Pro Ile Glu Asn 740
745 750 Arg Val Ser Glu Leu Trp Ser Leu Met Asp Phe Leu Asn Pro Ser
Val 755 760 765 Leu Gly Glu Lys Glu Phe Phe Asp Gln Arg Tyr Lys Leu
Pro Ile Glu 770 775 780 Arg Tyr Gly Asp Ile Ser Ser Leu Thr Asp Leu
Lys Ala Arg Val Ser 785 790 795 800 Pro Phe Ile Leu Arg Arg Leu Lys
Ser Asp Lys Ser Ile Ile Ser Asp 805 810 815 Leu Pro Ser Lys Val Glu
Leu Lys Glu Trp Ile Thr Leu Ser Gln Glu 820 825 830 Gln Arg Ala Leu
Tyr Asn Lys Thr Val Asp Asn Thr Leu Gln Glu Ile 835 840 845 Ala Arg
Ser Pro Ile Gly Gln Arg His Ala Lys Thr Leu Gly Leu Leu 850 855 860
Thr Arg Leu Lys Gln Ile Cys Asn His Pro Ala Leu Ala Leu Lys Glu 865
870 875 880 Lys Asn Ile Ser Asp Asp Phe Gly Ile Arg Ser Thr Lys Leu
Gln Arg 885 890 895 Leu Glu Glu Leu Leu Asp Val Ile Phe Ala Thr Glu
Asp Arg Ala Leu 900 905 910 Leu Phe Thr Gln Phe Ala Glu Trp Gly His
Leu Leu Gln Ala Tyr Leu 915 920 925 Glu Lys Lys Trp Gly His Ser Ile
Leu Phe Leu His Gly Gly Thr Arg 930 935 940 Lys Ile Asp Arg Gln Ser
Met Val Asp Gln Phe Gln Glu Asp Pro Arg 945 950 955 960 Gly Pro Lys
Leu Phe Leu Leu Ser Leu Lys Ala Gly Gly Ile Gly Leu 965 970 975 Asn
Leu Thr Arg Ala Asn His Val Leu His Ile Asp Arg Trp Trp Asn 980 985
990 Pro Ala Val Glu Asn Gln Ala Thr Asp Arg Ala Tyr Arg Ile Gly Gln
995 1000 1005 Lys Asn Ser Val Met Val His Lys Phe Ile Ala Thr Gly
Ser Val 1010 1015 1020 Glu Glu Lys Ile Asp Gln Met Ile Thr Glu Lys
Ser Lys Leu Ala 1025 1030 1035 Glu Asn Ile Ile Gly Ala Gly Glu Asp
Trp Leu Gly Lys Leu Gly 1040 1045 1050 Ile Asn Glu Leu Arg Glu Leu
Val Ser Leu Glu Lys Glu Ser 1055 1060 1065 713300DNAProchlorococcus
marinus 71atgattggtt gtggaactcc tgcgtggatg gttgccgttg atcggcagtg
cactcctgct 60ccaagaaacc caacacatac tttttgcgtc gcggccatga gcctgctgca
cgccacctgg 120cttccagcca tccgtactcc gaccagctcc ggtcgccctg
cgctccttgt gtgggcagat 180acctggcgag tcgctacccc agcaggacca
gcagcaactc ccgcactcca ccccttcaca 240ctcaacccag acgatctacg
tgcctggctg attgagcgcg atctactgcc cgatgaaatc 300atcgacgcca
cagcatgtct gaccctgcct agccgaacag tcaaaccgcg cagcaaagcc
360aagaacgtat ccactgaatc cgacgaagac aaagaccaca aaacaagttg
gacaggactg 420cccttacaag caggcgaacc cattcccaaa cagactgaat
ggtggccctg gcaggtgcaa 480ggcctggcag tggagcctgc tgctgcaacg
gcctggcttt cgaaactgcc tctttcagga 540gatcatcctg atctcgccga
tgaattgcgc tggtggagcc atctacagcg ctgggccctg 600agcatgattg
ctcgcggacg ttggctaccc caggtggaac tcagcaaggg agagggctat
660ccccaccgag cacgctggac accgctactc aaccgtgaag atgatcgccg
ccgcctcgaa 720gaccttgccg ctcagctccc cttagtggcc acctgcgccc
tcccctggcg ggagcccacc 780ggaaggcgta gcaaccgaat gacccgccta
agaccagagg cgatgcgagc cgctaaccct 840gtggcttcat gccgaccccg
cagcggtcgc cttcgcgtag ccagcctgct ggaagaactc 900ttggatgccc
aactgcgcac cggatttgaa gcgagtgagc aaggcctaga cccattgctc
960acagcctggc aggaagcact ggggtcggac agcggcgtga tcaacctccc
cgatgaggaa 1020gccgaacgtc tagcgacagc aagcaaccat tggcgagaag
gcgtggctgg caacgtcgca 1080ccagccaggg cctgcttaga actcttcact
cccggcgaag gggaagacct ctgggagctg 1140cgcttcgcct tacaggctga
ggctgatccc acgatcaaag taccggccgc agcagcctgg 1200gcagcgggtc
ccaaggtcct gcaactaggc gaaatccgtg tggaacatcc aggcgaggtg
1260ctactggaag gcatggggcg agccctcacg gtgtttgcac cgatcgaacg
aggcctcgac 1320agcgccacac cagaagcaat gcagctcacc cctgctgaag
cctttgtatt ggtgcgcact 1380gcagcggccc aactgcgtga tgttggcgtt
ggcgtggaat tgcctgccag cctctcggga 1440gggctggcca gtcgcctagg
cctagcgatc aaggcggagc tatcggagag atctagaggt 1500ttcactttgg
gcgaaaccct cgactggagt tgggagctca tgatcggtgg cgtcaccctg
1560acgcttcgcg agctggagcg actagcaagc aagcgcagcc cgcttgtcaa
ccacaagggc 1620gcctggatcg aattacgccc caacgatctc aaaaatgcgg
aacacttctg cagcgtcaat 1680ccaggcatca gcctcgacga tgccttgcgc
cttaccgcaa ccgatggcga cacgctgatg 1740agactgcccg ttcaccgctt
tgaggccggt ccacgactac aggcggtgtt ggagcagtac 1800caccagcaaa
aagctcccga ccccctacct gctcccgaag gcttctgcgg tcagctaagg
1860ccttatcagg aaaggggtct gggttggctg gccttcctgc atcgcttcga
tcaaggggca 1920tgcctggccg acgacatggg cctgggcaaa acgatccagc
tactggcatt cctgcaacat 1980ctcaaggcgg aacaggaact caaacggccg
gtattgctta tcgctcccac atccgtactt 2040accaactgga agagagaggc
attggccttc acaccagagt taaacgtccg agaacactat 2100gggccgcgtc
ggccctctac ccccgccgcc ttaaagaaag cactcaaagg cttagacctc
2160gttctcacca gttacgggct cctgcagcga gatagtgagc tcctggaaac
ggtcgactgg 2220caaggagtgg tcatcgatga agcccaagcc attaagaacc
ccaacgccaa acagagccaa 2280gcagcacgcg atatgggccg cccagacaaa
aacaatcgct tcaggattgc tcttaccggc 2340acacccgtcg aaaaccgagt
cagtgaactt tgggcactga tggacttcct caacccaagg 2400gttctcggtg
aagaagactt cttccgccag cgctaccggc tgccaattga acgctatggc
2460gacatgtctt ccctgcgaga cctcaaaggc cgtgttggtc ccttcatcct
gagacgacta 2520aaaaccgaca aggcaatcat ctccgaccta cctgaaaagg
tagagctgag cgaatgggtg 2580ggtctgagca aagaacaggc agccctctat
cgcaacacag tggatgaaac actggaggcc 2640attgcccgcg cacccagtgg
tcaacgtcat ggcaaggtgc tcggcttgct tacccgactg 2700aagcaaatct
gcaaccatcc cgccctagcc ctcaaagaaa aaaccgttgc aaaaggcttc
2760atggaccgct ccgccaagct gctgcgtttg gaagaaattc tcgaggaagt
gatcgaggca 2820ggagatcgcg ctctgttatt cacccaattc gcagaatggg
gtcatctcct taaggcctac 2880ctgcaacaac gctggcgctt tgaagttccc
ttcctgcacg gcagcacaag caaaactgaa 2940cgtcaggcca tggttgatcg
cttccaggag gatccacgtg gaccccaact gttcctgctg 3000tcactcaaag
ccggtggcgt aggcctaaac ctcacgcggg ctagccatgt gtttcatgtc
3060gatcgctggt ggaatcctgc cgtagaaaac caggccactg atcgcgctta
caggatcgga 3120caaaccaatc gggtgatggt gcacaaattc atcaccagcg
gctcagttga agagaaaatt 3180gatcgcatga ttcgcgaaaa atctcgactt
gccgaagaca tcattggctc tggagaagac 3240tggttaggtg gcttaggcgt
cagtcaattg cgcgaactag tggccctaga agacagctga
3300721099PRTProchlorococcus marinus 72Met Ile Gly Cys Gly Thr Pro
Ala Trp Met Val Ala Val Asp Arg Gln 1 5 10 15 Cys Thr Pro Ala Pro
Arg Asn Pro Thr His Thr Phe Cys Val Ala Ala 20 25 30 Met Ser Leu
Leu His Ala Thr Trp Leu Pro Ala Ile Arg Thr Pro Thr 35 40 45 Ser
Ser Gly Arg Pro Ala Leu Leu Val Trp Ala Asp Thr Trp Arg Val 50 55
60 Ala Thr Pro Ala Gly Pro Ala Ala Thr Pro Ala Leu His Pro Phe Thr
65 70 75 80 Leu Asn Pro Asp Asp Leu Arg Ala Trp Leu Ile Glu Arg Asp
Leu Leu 85 90 95 Pro Asp Glu Ile Ile Asp Ala Thr Ala Cys Leu Thr
Leu Pro Ser Arg 100 105 110 Thr Val Lys Pro Arg Ser Lys Ala Lys Asn
Val Ser Thr Glu Ser Asp 115 120 125 Glu Asp Lys Asp His Lys Thr Ser
Trp Thr Gly Leu Pro Leu Gln Ala 130 135 140 Gly Glu Pro Ile Pro Lys
Gln Thr Glu Trp Trp Pro Trp Gln Val Gln 145 150 155 160 Gly Leu Ala
Val Glu Pro Ala Ala Ala Thr Ala Trp Leu Ser Lys Leu 165 170 175 Pro
Leu Ser Gly Asp His Pro Asp Leu Ala Asp Glu Leu Arg Trp Trp 180 185
190 Ser His Leu Gln Arg Trp Ala Leu Ser Met Ile Ala Arg Gly Arg Trp
195 200 205 Leu Pro Gln Val Glu Leu Ser Lys Gly Glu Gly Tyr Pro His
Arg Ala 210 215 220 Arg Trp Thr Pro Leu Leu Asn Arg Glu Asp Asp Arg
Arg Arg Leu Glu 225 230 235 240 Asp Leu Ala Ala Gln Leu Pro Leu Val
Ala Thr Cys Ala Leu Pro Trp 245 250 255 Arg Glu Pro Thr Gly Arg Arg
Ser Asn Arg Met Thr Arg Leu Arg Pro 260 265 270 Glu Ala Met Arg Ala
Ala Asn Pro Val Ala Ser Cys Arg Pro Arg Ser 275 280 285 Gly Arg Leu
Arg Val Ala Ser Leu Leu Glu Glu Leu Leu Asp Ala Gln 290 295 300 Leu
Arg Thr Gly Phe Glu Ala Ser Glu Gln Gly Leu Asp Pro Leu Leu 305 310
315 320 Thr Ala Trp Gln Glu Ala Leu Gly Ser Asp Ser Gly Val Ile Asn
Leu 325 330 335 Pro Asp Glu Glu Ala Glu Arg Leu Ala Thr Ala Ser Asn
His Trp Arg 340 345 350 Glu Gly Val Ala Gly Asn Val Ala Pro Ala Arg
Ala Cys Leu Glu Leu 355 360 365 Phe Thr Pro Gly Glu Gly Glu Asp Leu
Trp Glu Leu Arg Phe Ala Leu 370 375 380 Gln Ala Glu Ala Asp Pro Thr
Ile Lys Val Pro Ala Ala Ala Ala Trp 385 390 395 400 Ala Ala Gly Pro
Lys Val Leu Gln Leu Gly Glu Ile Arg Val Glu His 405 410 415 Pro Gly
Glu Val Leu Leu Glu Gly Met Gly Arg Ala Leu Thr Val Phe 420 425 430
Ala Pro Ile Glu Arg Gly Leu Asp Ser Ala Thr Pro Glu Ala Met Gln 435
440 445 Leu Thr Pro Ala Glu Ala Phe Val Leu Val Arg Thr Ala Ala Ala
Gln 450 455 460 Leu Arg Asp Val Gly Val Gly Val Glu Leu Pro Ala Ser
Leu Ser Gly 465 470 475 480 Gly Leu Ala Ser Arg Leu Gly Leu Ala Ile
Lys Ala Glu Leu Ser Glu 485 490 495 Arg Ser Arg Gly Phe Thr Leu Gly
Glu Thr Leu Asp Trp Ser Trp Glu 500 505 510 Leu Met Ile Gly Gly Val
Thr Leu Thr Leu Arg Glu Leu Glu Arg Leu 515 520 525 Ala Ser Lys Arg
Ser Pro Leu Val Asn His Lys Gly Ala Trp Ile Glu 530 535 540 Leu Arg
Pro Asn Asp Leu Lys Asn Ala Glu His Phe Cys Ser Val Asn 545 550 555
560 Pro Gly Ile Ser Leu Asp Asp Ala Leu Arg Leu Thr Ala Thr Asp Gly
565 570 575 Asp Thr Leu Met Arg Leu Pro Val His Arg Phe Glu Ala Gly
Pro Arg 580 585 590 Leu Gln Ala Val Leu Glu Gln Tyr His Gln Gln Lys
Ala Pro Asp Pro 595 600 605 Leu Pro Ala Pro Glu Gly Phe Cys Gly Gln
Leu Arg Pro Tyr Gln Glu 610 615 620 Arg Gly Leu Gly Trp Leu Ala Phe
Leu His Arg Phe Asp Gln Gly Ala 625 630 635 640 Cys Leu Ala Asp Asp
Met Gly Leu Gly Lys Thr Ile Gln Leu Leu Ala 645 650 655 Phe Leu Gln
His Leu Lys Ala Glu Gln Glu Leu Lys Arg Pro Val Leu 660 665 670 Leu
Ile Ala Pro Thr Ser Val Leu Thr Asn Trp Lys Arg Glu Ala Leu 675 680
685 Ala Phe Thr Pro Glu Leu Asn Val Arg Glu His Tyr Gly Pro Arg Arg
690 695 700 Pro Ser Thr Pro Ala Ala Leu Lys Lys Ala Leu Lys Gly Leu
Asp Leu 705 710 715 720 Val Leu Thr Ser Tyr Gly Leu Leu Gln Arg Asp
Ser Glu Leu Leu Glu 725 730 735 Thr Val Asp Trp Gln Gly Val Val Ile
Asp Glu Ala Gln Ala Ile Lys 740 745 750 Asn Pro Asn Ala Lys Gln Ser
Gln Ala Ala Arg Asp Met Gly Arg Pro 755 760 765 Asp Lys Asn Asn Arg
Phe Arg Ile Ala Leu Thr Gly Thr Pro Val Glu 770 775 780 Asn Arg Val
Ser Glu Leu Trp Ala Leu Met Asp Phe Leu Asn Pro Arg 785 790 795 800
Val Leu Gly Glu Glu Asp Phe Phe Arg Gln Arg Tyr Arg Leu Pro Ile 805
810 815 Glu Arg Tyr Gly Asp Met Ser Ser Leu Arg Asp Leu Lys Gly Arg
Val 820 825 830 Gly Pro Phe Ile Leu Arg Arg Leu Lys Thr Asp Lys Ala
Ile Ile Ser 835 840 845 Asp Leu Pro Glu Lys Val Glu Leu Ser Glu Trp
Val Gly Leu Ser Lys 850 855 860 Glu Gln Ala Ala Leu Tyr Arg Asn Thr
Val Asp Glu Thr Leu Glu Ala 865 870 875 880 Ile Ala Arg Ala Pro Ser
Gly Gln Arg His Gly Lys Val Leu Gly Leu 885 890 895 Leu Thr Arg Leu
Lys Gln Ile Cys Asn His Pro Ala Leu Ala Leu Lys 900 905 910 Glu Lys
Thr Val Ala Lys Gly Phe Met Asp Arg Ser Ala Lys Leu Leu 915 920 925
Arg Leu Glu Glu Ile Leu Glu Glu Val Ile Glu Ala Gly Asp Arg Ala 930
935 940 Leu Leu Phe Thr Gln Phe Ala Glu Trp Gly His Leu Leu Lys Ala
Tyr 945 950 955 960 Leu Gln Gln Arg Trp Arg Phe Glu Val Pro Phe Leu
His Gly Ser Thr 965 970 975 Ser Lys Thr Glu Arg Gln Ala Met Val Asp
Arg Phe Gln Glu Asp Pro 980 985 990 Arg Gly Pro Gln Leu Phe Leu Leu
Ser Leu Lys Ala Gly Gly Val Gly 995 1000 1005 Leu Asn Leu Thr Arg
Ala Ser His Val Phe His Val Asp Arg Trp 1010 1015 1020 Trp Asn Pro
Ala Val Glu Asn Gln Ala Thr Asp Arg Ala Tyr Arg 1025 1030 1035 Ile
Gly Gln Thr Asn Arg Val Met Val His Lys Phe Ile Thr Ser 1040 1045
1050 Gly Ser Val Glu Glu Lys Ile Asp Arg Met Ile Arg Glu Lys Ser
1055 1060 1065 Arg Leu Ala Glu Asp Ile Ile Gly Ser Gly Glu Asp Trp
Leu Gly 1070 1075 1080 Gly Leu Gly Val Ser Gln Leu Arg Glu Leu Val
Ala Leu Glu Asp 1085
1090 1095 Ser 733300DNAProchlorococcus marinus 73atgattggtt
gtggaactcc tgcgtggatg gttgccgttg atcggcagtg cactcctgct 60ccaagaaacc
caacacatac tttttgcgtc gcggccatga gcctgctgca cgccacctgg
120cttccagcca tccgtactcc gaccagctcc ggtcgccctg cgctccttgt
gtgggcagat 180acctggcgag tcgctacccc agcaggacca gcagcaactc
ccgcactcca ccccttcacc 240ctcagcccag acgatctacg tgcctggctc
attgagcgcg atctactgcc tgatgaaatc 300atcgacgcca cagcatgtct
gaccctgcct agccgaacag tcaaaccgcg caacaaaacc 360aagaacgtat
ccactgaatc cgacgaagcc aaagacaaca aaacaagttg gacaggactg
420cccttacaag caggcgaacc cattcccaaa caaacagaat ggtggccctg
gcaggtgcaa 480ggcctggcag tggaacctgc tgccgcaacg gcctggcttt
cgaaactgcc tctttcagga 540aatcatcctg atctggccga tgaattgcgc
tggtggagcc atctacagcg ctgggccctg 600agcatgattg ctcgcggacg
ttggctaccc caggtggaac tcagcaaggg agagggctat 660ccccaccgag
cacgctggac accgctactc aaccgtgaag atgatcgccg ccgcctcgaa
720gaccttgccg ctcagcttcc cttagtggcc acctgcgccc tcccctggcg
ggagcccacc 780ggaaggcgta gcaaccgaat gacccgccta agaccagagg
cgatgcgagc cgctaaccct 840gtggcttcat gccgaccccg cagcggtcgc
cttcgcgtag ccagcttgct ggaagaactc 900ttggatgccc aactgcgcac
cggatttgaa gcgagtgagc aaggcctaga cccattgctc 960acagcctggc
aggaagcact ggggtccgac agcggcgtga tcaacctccc cgatgaggaa
1020gccgaacgtc tagctacagc aagcaaccat tggcgtgaag gcgtggctgg
caacgtcgca 1080ccagccagag cctgcttaga actcttcact cccggagaag
gggaagacct ctgggagctg 1140cgcttctcct tacaggctga ggctgatccc
acaatcaaag taccggccgc agcagcctgg 1200gcagctggtc ccaaggtgtt
gcaactaggc gaaatccgtg tggaacatcc aggcgaggtg 1260ctactggaag
gcatggggcg agccctcacg gtgtttgcac cgatcgaacg aggcctcgac
1320agcgccacac cagaagcaat gcagctcacc cctgctgaag cctttgtatt
ggtgcgcact 1380gcagcgaccc aactgcgtga tgttggcgtt ggcgtggaat
tgcctgccag cctctcggga 1440gggctggcca gtcgcctagg cctagcgatc
aaggcggagc tatcggagag atctagaggt 1500ttcactctgg gcgaaaccct
cgactggagt tgggagctca tgatcggtgg cgtcaccctg 1560acgcttcgcg
aactggagcg actagcaagc aagcgcagcc cgcttgtcaa ccacaagggc
1620gcctggatcg aattacgccc caacgatctc aaacatgcgg aacacttctg
cagcgtcaat 1680ccaggcatca gcctcgacga tgccttgcgc cttaccgcaa
cagatggcga cacgctgatg 1740agactgcccg ttcaccgctt tgaggccggt
ccacgactac aggcggtgtt ggagcagtac 1800caccagcaaa aagcaccaga
ccccctacct gctcccgaag gcttctgcgg tcagctaagg 1860ccttatcagg
aaaggggtct gggttggctg gccttcctgc atcgcttcga tcaaggggca
1920tgcctggccg acgacatggg ccttggcaaa acgatccagc tactggcatt
cctgcaacat 1980ctcaaggcgg aacaggaact caaacggccg gtattgctta
tcgctcccac gtccgtactc 2040accaactgga agagagaggc gttggccttc
acaccagagt taaacgtccg cgaacactat 2100gggccgcgtc ggccctctac
ccccgccgcc ttaaagaaag cactcaaagg cttagacctc 2160gttctcacca
gttatgggct cctgcagcga gatagtgagc tcctggaaac ggtcgactgg
2220caaggcgtgg tcatcgatga agcccaagcc attaagaacc ccaacgccaa
acagagccaa 2280gcagcacgcg atatgggccg cccagacaaa aacaatcgct
tcaggattgc tcttaccggc 2340acacccgtcg aaaaccgagt aagtgaactt
tgggcactaa tggacttcct taacccaagg 2400gttctcggtg aagaagactt
cttccgccag cgctaccggc tgccgattga gcgctatggc 2460gacatgtctt
ccctgcgaga cctcaagggc cgtgttggtc ccttcatcct gagacgactc
2520aaaaccgaca aggcaatcat ctccgaccta cccgaaaaag tagagctgag
cgaatgggtg 2580gggctgagca aagaacaggc agccctctat cgcaacacag
tggatgaaac actggaggcc 2640attgcccgcg cacccagggg tcaacgccat
ggcaaggtgc tcggattgct taccagactg 2700aagcaaatct gcaaccatcc
cgccctagcc ctcaaagaac aaaccgttgc aaaagggttc 2760atggaccgct
ccgccaagct gctgcgtttg gaagaaattc tcgaagaagt aatcgaggca
2820ggagatcgcg ctctgttatt cacccaattc gcagaatggg gtcatctcct
taaggcctac 2880ctgcaacaac gctggcgctt tgaagttccc ttcctgcacg
gcagcacaag caaaactgaa 2940cgtcaggcca tggttgatcg cttccaggag
gatccacgtg gaccccaact gttcctgctg 3000tcactcaaag ccggtggtgt
aggcctcaac ctgacgcggg ctagccatgt gtttcatgtt 3060gatcgctggt
ggaatcctgc cgtagaaaac caggccactg atcgcgctta caggatcggg
3120caaaccagtc gggtgatggt gcacaaattc atcaccagcg gctcagttga
agagaaaatt 3180gatcgcatga ttcgtgaaaa atctcgactt gccgaagaca
tcattggctc tggagaagac 3240tggttaggtg gcttaggcgt cagtcaattg
cgcgaactag tggccctaga agacagctga 3300741099PRTProchlorococcus
marinus 74Met Ile Gly Cys Gly Thr Pro Ala Trp Met Val Ala Val Asp
Arg Gln 1 5 10 15 Cys Thr Pro Ala Pro Arg Asn Pro Thr His Thr Phe
Cys Val Ala Ala 20 25 30 Met Ser Leu Leu His Ala Thr Trp Leu Pro
Ala Ile Arg Thr Pro Thr 35 40 45 Ser Ser Gly Arg Pro Ala Leu Leu
Val Trp Ala Asp Thr Trp Arg Val 50 55 60 Ala Thr Pro Ala Gly Pro
Ala Ala Thr Pro Ala Leu His Pro Phe Thr 65 70 75 80 Leu Ser Pro Asp
Asp Leu Arg Ala Trp Leu Ile Glu Arg Asp Leu Leu 85 90 95 Pro Asp
Glu Ile Ile Asp Ala Thr Ala Cys Leu Thr Leu Pro Ser Arg 100 105 110
Thr Val Lys Pro Arg Asn Lys Thr Lys Asn Val Ser Thr Glu Ser Asp 115
120 125 Glu Ala Lys Asp Asn Lys Thr Ser Trp Thr Gly Leu Pro Leu Gln
Ala 130 135 140 Gly Glu Pro Ile Pro Lys Gln Thr Glu Trp Trp Pro Trp
Gln Val Gln 145 150 155 160 Gly Leu Ala Val Glu Pro Ala Ala Ala Thr
Ala Trp Leu Ser Lys Leu 165 170 175 Pro Leu Ser Gly Asn His Pro Asp
Leu Ala Asp Glu Leu Arg Trp Trp 180 185 190 Ser His Leu Gln Arg Trp
Ala Leu Ser Met Ile Ala Arg Gly Arg Trp 195 200 205 Leu Pro Gln Val
Glu Leu Ser Lys Gly Glu Gly Tyr Pro His Arg Ala 210 215 220 Arg Trp
Thr Pro Leu Leu Asn Arg Glu Asp Asp Arg Arg Arg Leu Glu 225 230 235
240 Asp Leu Ala Ala Gln Leu Pro Leu Val Ala Thr Cys Ala Leu Pro Trp
245 250 255 Arg Glu Pro Thr Gly Arg Arg Ser Asn Arg Met Thr Arg Leu
Arg Pro 260 265 270 Glu Ala Met Arg Ala Ala Asn Pro Val Ala Ser Cys
Arg Pro Arg Ser 275 280 285 Gly Arg Leu Arg Val Ala Ser Leu Leu Glu
Glu Leu Leu Asp Ala Gln 290 295 300 Leu Arg Thr Gly Phe Glu Ala Ser
Glu Gln Gly Leu Asp Pro Leu Leu 305 310 315 320 Thr Ala Trp Gln Glu
Ala Leu Gly Ser Asp Ser Gly Val Ile Asn Leu 325 330 335 Pro Asp Glu
Glu Ala Glu Arg Leu Ala Thr Ala Ser Asn His Trp Arg 340 345 350 Glu
Gly Val Ala Gly Asn Val Ala Pro Ala Arg Ala Cys Leu Glu Leu 355 360
365 Phe Thr Pro Gly Glu Gly Glu Asp Leu Trp Glu Leu Arg Phe Ser Leu
370 375 380 Gln Ala Glu Ala Asp Pro Thr Ile Lys Val Pro Ala Ala Ala
Ala Trp 385 390 395 400 Ala Ala Gly Pro Lys Val Leu Gln Leu Gly Glu
Ile Arg Val Glu His 405 410 415 Pro Gly Glu Val Leu Leu Glu Gly Met
Gly Arg Ala Leu Thr Val Phe 420 425 430 Ala Pro Ile Glu Arg Gly Leu
Asp Ser Ala Thr Pro Glu Ala Met Gln 435 440 445 Leu Thr Pro Ala Glu
Ala Phe Val Leu Val Arg Thr Ala Ala Thr Gln 450 455 460 Leu Arg Asp
Val Gly Val Gly Val Glu Leu Pro Ala Ser Leu Ser Gly 465 470 475 480
Gly Leu Ala Ser Arg Leu Gly Leu Ala Ile Lys Ala Glu Leu Ser Glu 485
490 495 Arg Ser Arg Gly Phe Thr Leu Gly Glu Thr Leu Asp Trp Ser Trp
Glu 500 505 510 Leu Met Ile Gly Gly Val Thr Leu Thr Leu Arg Glu Leu
Glu Arg Leu 515 520 525 Ala Ser Lys Arg Ser Pro Leu Val Asn His Lys
Gly Ala Trp Ile Glu 530 535 540 Leu Arg Pro Asn Asp Leu Lys His Ala
Glu His Phe Cys Ser Val Asn 545 550 555 560 Pro Gly Ile Ser Leu Asp
Asp Ala Leu Arg Leu Thr Ala Thr Asp Gly 565 570 575 Asp Thr Leu Met
Arg Leu Pro Val His Arg Phe Glu Ala Gly Pro Arg 580 585 590 Leu Gln
Ala Val Leu Glu Gln Tyr His Gln Gln Lys Ala Pro Asp Pro 595 600 605
Leu Pro Ala Pro Glu Gly Phe Cys Gly Gln Leu Arg Pro Tyr Gln Glu 610
615 620 Arg Gly Leu Gly Trp Leu Ala Phe Leu His Arg Phe Asp Gln Gly
Ala 625 630 635 640 Cys Leu Ala Asp Asp Met Gly Leu Gly Lys Thr Ile
Gln Leu Leu Ala 645 650 655 Phe Leu Gln His Leu Lys Ala Glu Gln Glu
Leu Lys Arg Pro Val Leu 660 665 670 Leu Ile Ala Pro Thr Ser Val Leu
Thr Asn Trp Lys Arg Glu Ala Leu 675 680 685 Ala Phe Thr Pro Glu Leu
Asn Val Arg Glu His Tyr Gly Pro Arg Arg 690 695 700 Pro Ser Thr Pro
Ala Ala Leu Lys Lys Ala Leu Lys Gly Leu Asp Leu 705 710 715 720 Val
Leu Thr Ser Tyr Gly Leu Leu Gln Arg Asp Ser Glu Leu Leu Glu 725 730
735 Thr Val Asp Trp Gln Gly Val Val Ile Asp Glu Ala Gln Ala Ile Lys
740 745 750 Asn Pro Asn Ala Lys Gln Ser Gln Ala Ala Arg Asp Met Gly
Arg Pro 755 760 765 Asp Lys Asn Asn Arg Phe Arg Ile Ala Leu Thr Gly
Thr Pro Val Glu 770 775 780 Asn Arg Val Ser Glu Leu Trp Ala Leu Met
Asp Phe Leu Asn Pro Arg 785 790 795 800 Val Leu Gly Glu Glu Asp Phe
Phe Arg Gln Arg Tyr Arg Leu Pro Ile 805 810 815 Glu Arg Tyr Gly Asp
Met Ser Ser Leu Arg Asp Leu Lys Gly Arg Val 820 825 830 Gly Pro Phe
Ile Leu Arg Arg Leu Lys Thr Asp Lys Ala Ile Ile Ser 835 840 845 Asp
Leu Pro Glu Lys Val Glu Leu Ser Glu Trp Val Gly Leu Ser Lys 850 855
860 Glu Gln Ala Ala Leu Tyr Arg Asn Thr Val Asp Glu Thr Leu Glu Ala
865 870 875 880 Ile Ala Arg Ala Pro Arg Gly Gln Arg His Gly Lys Val
Leu Gly Leu 885 890 895 Leu Thr Arg Leu Lys Gln Ile Cys Asn His Pro
Ala Leu Ala Leu Lys 900 905 910 Glu Gln Thr Val Ala Lys Gly Phe Met
Asp Arg Ser Ala Lys Leu Leu 915 920 925 Arg Leu Glu Glu Ile Leu Glu
Glu Val Ile Glu Ala Gly Asp Arg Ala 930 935 940 Leu Leu Phe Thr Gln
Phe Ala Glu Trp Gly His Leu Leu Lys Ala Tyr 945 950 955 960 Leu Gln
Gln Arg Trp Arg Phe Glu Val Pro Phe Leu His Gly Ser Thr 965 970 975
Ser Lys Thr Glu Arg Gln Ala Met Val Asp Arg Phe Gln Glu Asp Pro 980
985 990 Arg Gly Pro Gln Leu Phe Leu Leu Ser Leu Lys Ala Gly Gly Val
Gly 995 1000 1005 Leu Asn Leu Thr Arg Ala Ser His Val Phe His Val
Asp Arg Trp 1010 1015 1020 Trp Asn Pro Ala Val Glu Asn Gln Ala Thr
Asp Arg Ala Tyr Arg 1025 1030 1035 Ile Gly Gln Thr Ser Arg Val Met
Val His Lys Phe Ile Thr Ser 1040 1045 1050 Gly Ser Val Glu Glu Lys
Ile Asp Arg Met Ile Arg Glu Lys Ser 1055 1060 1065 Arg Leu Ala Glu
Asp Ile Ile Gly Ser Gly Glu Asp Trp Leu Gly 1070 1075 1080 Gly Leu
Gly Val Ser Gln Leu Arg Glu Leu Val Ala Leu Glu Asp 1085 1090 1095
Ser 752886DNARhodococcus sp. 75atggcgcgag cagggacttc acgcgctgtc
ggtcgcacct gcttggatgg gtgcatgctg 60cacggcctct ggacaccggg ttcgggtctc
atgctgtggg tggaggatcg gaatccggca 120gctccggagc cgacggacgc
ggtcgggcgg atgctggcgc ggaagttccg gcatcacgtg 180aaggtgccga
tgccgacgcc gtcggggccg gagatgctcg agtgggccgc ggttgcgctc
240gcaccaccgg atgcgacgga gttcctgctg tcggtgtcgt cccgcgaccc
ccggatcgcc 300ggggatctgc gctacctcgc ccacgtcgcc cgcggtgtcg
agcggtgggc acgggccggg 360cgggtggtgc ccgaggtaca ccgggcggag
ggcggctggt ggccgcgctg gcggctgctc 420ggcggtgaac ggcagcgtgc
gtggctcacg gagctggccg tggcgatgcc gccggtccag 480cgtcacggca
cgaccccccg ggccgtgctc gacgacatgg tcaccgagct gaccgacccc
540gtcgcccgcc gtgtcctcga acgacggcac ccggacgatt ccggcggcga
cgtggatcat 600ccgctgatcg acgcgctcgt gcggggtgac cagttcgccg
agggcaccgc ccagctgtcg 660ggatcgctgg acgggtggcg cgacagcctc
aaggtggacg agcccgaact ggtgctgcgg 720ctcctcgagc cggaagacgt
ggacgtggag ggggattggg acccggacac ggtgctgtgg 780cgactggagg
tctgccttcg accggaaggc gaagccccgg tgccgattcc gttgcaccgc
840acggaggcga gtcgtctgca gatcggggtg cgcaagctga cggaggccgt
ggccgcctac 900ccgcgactgc aggacgttcc cagtgacccc gacagcctgg
acctgatgtt gcccaccgcc 960gtggtcatcg accttgtcgg gcacggtgcg
gtggcgttga aggagaaggg catcagcctg 1020ctgctgccgc gggcgtggag
tgtggcgtcg ccgtcgatgc gtctgcgggt gagctcgccg 1080agcactccgg
cgagcgcgga gaaccgggcc gtcggcaaag accagttggt gcaatacaac
1140tgggagctgg cactcggcga cacggtgctc accgccgcgg agatgaatcg
actggtcaac 1200tccaagagcg atctcgtgcg gttgcgcggt gagtgggttc
gggcggatca ggaggtgctc 1260tcccgcgccg cgcgctacgt ggcggagcgg
cacgccagcg gcgaccgggc catcgtggac 1320ctgctgaagg acctgatcgc
ggacgatctg tccgatcttc ccgtggagga ggtcacggcc 1380accggctggg
cggccgcgtt gctggacggc gacacgaagc cgcaggacgt gccgaccccg
1440gacgggttgg acgccacgct gcgcccgtac cagaagcggg ggctcgactg
gctggtgttc 1500atgagccgtc tcggcctcgg ggccgtcctc gccgacgaca
tgggactcgg caagacgctg 1560cagttgctgg cgctgctggc acacgagaag
gcgcccacgc ccacgctgct ggtgtgcccg 1620atgtcggtgg tcggcaactg
gcagcgcgag gcagcgcgct tcgtcccctc gctgcgggtg 1680ctcgtccacc
acggtccgca gcggctgagc ggcgcggagt tcaccgccgc cgtgacacag
1740agcgatctgg tgatcaccac gtatgcgctg ctggcccgcg acgtcgcgca
cctgaaggag 1800caggactggc ggcgtgtcgt gctggacgag gcgcagcaca
tcaagaacgc gaagacgtcg 1860caggcgcggg cggcgcggag cattccggcg
gcgcaccgcg tcgcgctgac cggcactccg 1920gtcgagaacc gcctcgacga
actgcgctcg atcctcgact tcgcgaactc gggcatcctg 1980ggctcggagg
tgatgttccg caagcgcttc gtggtgccga tcgagcggga gcaggacgag
2040acagccgtcg cccggctccg cgcggtcacg tccccgttcg tgctgcgccg
ggtcaagacc 2100gatcccgcgg tcatcgccga cctccccgac aagttcgaga
tgacggtgcg cgccaacctc 2160accgcggagc aggccgcgct gtaccgggcg
gtggtcgacg acatgatggc gcagatcaag 2220gacaagaagg ggatgaagcg
caagggcgcc gtcctcgccg ccctgacgaa actcaagcag 2280gtgtgcaacc
acccggcaca cttcctgcgc gacgggtcgg cggtgatgcg gcgcggacag
2340caccgctccg gcaagctggg gctcgtcgag gacatcctgg attccgtggt
cgcggacggc 2400gagaaggcgt tgctgttcac ccagttccgg gaattcggcg
acctcgtcac cccgtacctc 2460gcggagcgtt tcggtactcc cgtgccgttt
ctgcacgggg gcgtgtccaa gcagaagcgc 2520gacgacatgg tggcctcgtt
ccagggcgac gacgggccgc cgatcatgat gctctcgctg 2580aaggcgggcg
ggacgggttt gaacctcacc gcggccaatc acgtcgtcca cctcgaccgg
2640tggtggaatc cggcggtcga gaaccaggcc acggacaggg cgttccggat
cggccagcgg 2700cgggacgtgc aggtgcgcaa gctcgtgtgc gtcggcaccc
tggaggagcg gatcgacgcg 2760atgatcgcca ccaagcagga gctggccgat
ctcgccgtcg ggacgggcga gaactgggtg 2820acggagatga gcaccgaaca
actgggcgaa ctgctccgcc tcggtgacga ggcggtgggc 2880gaatga
288676961PRTRhodococcus sp. 76Met Ala Arg Ala Gly Thr Ser Arg Ala
Val Gly Arg Thr Cys Leu Asp 1 5 10 15 Gly Cys Met Leu His Gly Leu
Trp Thr Pro Gly Ser Gly Leu Met Leu 20 25 30 Trp Val Glu Asp Arg
Asn Pro Ala Ala Pro Glu Pro Thr Asp Ala Val 35 40 45 Gly Arg Met
Leu Ala Arg Lys Phe Arg His His Val Lys Val Pro Met 50 55 60 Pro
Thr Pro Ser Gly Pro Glu Met Leu Glu Trp Ala Ala Val Ala Leu 65 70
75 80 Ala Pro Pro Asp Ala Thr Glu Phe Leu Leu Ser Val Ser Ser Arg
Asp 85 90 95 Pro Arg Ile Ala Gly Asp Leu Arg Tyr Leu Ala His Val
Ala Arg Gly 100 105 110 Val Glu Arg Trp Ala Arg Ala Gly Arg Val Val
Pro Glu Val His Arg 115 120 125 Ala Glu Gly Gly Trp Trp Pro Arg Trp
Arg Leu Leu Gly Gly Glu Arg 130 135 140 Gln Arg Ala Trp Leu Thr Glu
Leu Ala Val Ala Met Pro Pro Val Gln 145 150 155 160 Arg His Gly Thr
Thr Pro Arg Ala Val Leu Asp Asp Met Val Thr Glu 165 170 175 Leu Thr
Asp Pro Val Ala Arg Arg Val Leu Glu Arg Arg His Pro Asp 180 185 190
Asp Ser Gly Gly Asp Val Asp His Pro Leu Ile Asp Ala Leu Val Arg 195
200 205 Gly Asp Gln Phe Ala Glu Gly Thr Ala Gln Leu Ser Gly Ser Leu
Asp 210 215
220 Gly Trp Arg Asp Ser Leu Lys Val Asp Glu Pro Glu Leu Val Leu Arg
225 230 235 240 Leu Leu Glu Pro Glu Asp Val Asp Val Glu Gly Asp Trp
Asp Pro Asp 245 250 255 Thr Val Leu Trp Arg Leu Glu Val Cys Leu Arg
Pro Glu Gly Glu Ala 260 265 270 Pro Val Pro Ile Pro Leu His Arg Thr
Glu Ala Ser Arg Leu Gln Ile 275 280 285 Gly Val Arg Lys Leu Thr Glu
Ala Val Ala Ala Tyr Pro Arg Leu Gln 290 295 300 Asp Val Pro Ser Asp
Pro Asp Ser Leu Asp Leu Met Leu Pro Thr Ala 305 310 315 320 Val Val
Ile Asp Leu Val Gly His Gly Ala Val Ala Leu Lys Glu Lys 325 330 335
Gly Ile Ser Leu Leu Leu Pro Arg Ala Trp Ser Val Ala Ser Pro Ser 340
345 350 Met Arg Leu Arg Val Ser Ser Pro Ser Thr Pro Ala Ser Ala Glu
Asn 355 360 365 Arg Ala Val Gly Lys Asp Gln Leu Val Gln Tyr Asn Trp
Glu Leu Ala 370 375 380 Leu Gly Asp Thr Val Leu Thr Ala Ala Glu Met
Asn Arg Leu Val Asn 385 390 395 400 Ser Lys Ser Asp Leu Val Arg Leu
Arg Gly Glu Trp Val Arg Ala Asp 405 410 415 Gln Glu Val Leu Ser Arg
Ala Ala Arg Tyr Val Ala Glu Arg His Ala 420 425 430 Ser Gly Asp Arg
Ala Ile Val Asp Leu Leu Lys Asp Leu Ile Ala Asp 435 440 445 Asp Leu
Ser Asp Leu Pro Val Glu Glu Val Thr Ala Thr Gly Trp Ala 450 455 460
Ala Ala Leu Leu Asp Gly Asp Thr Lys Pro Gln Asp Val Pro Thr Pro 465
470 475 480 Asp Gly Leu Asp Ala Thr Leu Arg Pro Tyr Gln Lys Arg Gly
Leu Asp 485 490 495 Trp Leu Val Phe Met Ser Arg Leu Gly Leu Gly Ala
Val Leu Ala Asp 500 505 510 Asp Met Gly Leu Gly Lys Thr Leu Gln Leu
Leu Ala Leu Leu Ala His 515 520 525 Glu Lys Ala Pro Thr Pro Thr Leu
Leu Val Cys Pro Met Ser Val Val 530 535 540 Gly Asn Trp Gln Arg Glu
Ala Ala Arg Phe Val Pro Ser Leu Arg Val 545 550 555 560 Leu Val His
His Gly Pro Gln Arg Leu Ser Gly Ala Glu Phe Thr Ala 565 570 575 Ala
Val Thr Gln Ser Asp Leu Val Ile Thr Thr Tyr Ala Leu Leu Ala 580 585
590 Arg Asp Val Ala His Leu Lys Glu Gln Asp Trp Arg Arg Val Val Leu
595 600 605 Asp Glu Ala Gln His Ile Lys Asn Ala Lys Thr Ser Gln Ala
Arg Ala 610 615 620 Ala Arg Ser Ile Pro Ala Ala His Arg Val Ala Leu
Thr Gly Thr Pro 625 630 635 640 Val Glu Asn Arg Leu Asp Glu Leu Arg
Ser Ile Leu Asp Phe Ala Asn 645 650 655 Ser Gly Ile Leu Gly Ser Glu
Val Met Phe Arg Lys Arg Phe Val Val 660 665 670 Pro Ile Glu Arg Glu
Gln Asp Glu Thr Ala Val Ala Arg Leu Arg Ala 675 680 685 Val Thr Ser
Pro Phe Val Leu Arg Arg Val Lys Thr Asp Pro Ala Val 690 695 700 Ile
Ala Asp Leu Pro Asp Lys Phe Glu Met Thr Val Arg Ala Asn Leu 705 710
715 720 Thr Ala Glu Gln Ala Ala Leu Tyr Arg Ala Val Val Asp Asp Met
Met 725 730 735 Ala Gln Ile Lys Asp Lys Lys Gly Met Lys Arg Lys Gly
Ala Val Leu 740 745 750 Ala Ala Leu Thr Lys Leu Lys Gln Val Cys Asn
His Pro Ala His Phe 755 760 765 Leu Arg Asp Gly Ser Ala Val Met Arg
Arg Gly Gln His Arg Ser Gly 770 775 780 Lys Leu Gly Leu Val Glu Asp
Ile Leu Asp Ser Val Val Ala Asp Gly 785 790 795 800 Glu Lys Ala Leu
Leu Phe Thr Gln Phe Arg Glu Phe Gly Asp Leu Val 805 810 815 Thr Pro
Tyr Leu Ala Glu Arg Phe Gly Thr Pro Val Pro Phe Leu His 820 825 830
Gly Gly Val Ser Lys Gln Lys Arg Asp Asp Met Val Ala Ser Phe Gln 835
840 845 Gly Asp Asp Gly Pro Pro Ile Met Met Leu Ser Leu Lys Ala Gly
Gly 850 855 860 Thr Gly Leu Asn Leu Thr Ala Ala Asn His Val Val His
Leu Asp Arg 865 870 875 880 Trp Trp Asn Pro Ala Val Glu Asn Gln Ala
Thr Asp Arg Ala Phe Arg 885 890 895 Ile Gly Gln Arg Arg Asp Val Gln
Val Arg Lys Leu Val Cys Val Gly 900 905 910 Thr Leu Glu Glu Arg Ile
Asp Ala Met Ile Ala Thr Lys Gln Glu Leu 915 920 925 Ala Asp Leu Ala
Val Gly Thr Gly Glu Asn Trp Val Thr Glu Met Ser 930 935 940 Thr Glu
Gln Leu Gly Glu Leu Leu Arg Leu Gly Asp Glu Ala Val Gly 945 950 955
960 Glu 773153DNASalinispora tropica 77gtgctggttg tccacgggtc
gtggcggctc ggcatcgggc tcgccatctg ggccgaggac 60agcgcgtcgc cgcctcgggc
gccgcgccgg gccgggcggg cgccccgcga gcgaccccac 120ccgttcgccg
ccggtcaccc cgtgcttgcg gcagctctgg ccgaggtcgc cgagccgacc
180gagcccggca cggcactgct caccctgccc acccgagctg gttcgccgct
ggactcgccg 240gagctggtcc gcaccgcgtc ggtcgagccg ctccgtgggc
cggtcacgtt ggccgggtgg 300cgggtgcccg ccctggttta cgccccggac
gccgccctgt cgctgctctc ccagatcacc 360gcggccggcg ctctacctga
cgccgtaccc ggtgccactc tgcgtcacct cgcggagctg 420gcggccttcg
ccgtggacct cgccgcccgt ggtcgggtcc tgcccggcgt ccggccaccg
480aaggaacgtg ccagcgccgc ctgggcggtg tggcagcccc tgctcaccgg
cgtggacgct 540ggctgggccc gggccctcgc cctcgccctg ccgcccgcgg
tccgtgccgc cgtcgagatc 600gatccggctc cactcgccgt acccggcgga
ccggaaacgc ccgccaacgg tggtgtgccg 660ccgcaggctc gtacgaggcg
accgaccgca gccgccgggg aaccaggtga actggtggtc 720gaggcgctcg
acgcgctcac cgacgcggcc gtacgggctg ccctcgcgga gacctccctt
780acccggggag cccgtccgcg gggcgcggtc gcggcctggc tcgcggcgct
caccggcccg 840cgtcgtgact tcaccgccga ctcggcggag ctcgacaccc
tgcgcggtga gttggacgcc 900tggcagcgcg acgctgtggg aggttcggtc
cgggccagct tccggctggt ggagccgccg 960acggacggac tctttgaggc
ggcggccggg gggctggccg cggccgaggg gtcgtggcgg 1020gtcgagttcg
gcctacagcc ggccgaccag ccgggtctgc atgttgacgc cgtgcggatc
1080tggcacgagt cggcggccct accgggcccg gccgctccgc aggaggccct
gctgaccgag 1140ttggggcggg ccagccgact ctggccggag ctgaactcgg
ccctgcgcac cgccactcca 1200gaggcgctgg agctggacgc cgcgggcgcg
catcgctttc tacgcgacgg cgcgccggtg 1260ctgcacgcag ccgggttcgc
ggtgctgttg ccctcgtggt ggcagcgtcc gtcgtcccgg 1320ctcggcgctc
gactacaggc ccagagccgt accgccccgg gcaccgtcgc cggggctggc
1380gacggggtgg ggttggatgc cctggtcgac taccgctggg aggtgtccct
cggcgaccag 1440ccgctgaccg ccgaggaact ggagtcgctg gccgcgctga
aatctccgtt ggtccgcctg 1500cgtgggcgct gggtggagct ggacccgaaa
cgtctcgccg ccggcctgcg gctgctccgt 1560tccgccggcg agctgaccgt
cggcgacctg ctgcggctcg gcctctccga ccctgctacc 1620gacgcgctgc
cggtgctcga ggtggcggcc gacggtgcgt tgggtgactt gctcgccgga
1680gctgtggagc ggcaactcac cccggtggac gcggttccgt cgttccaggg
cgttctccgc 1740ccctaccagc ggcgagggct ggcctggctg tcctttctgc
agtccctcgg cctcggcggg 1800gtgctcgctg acgacatggg tctcggcaag
acggtacagc tactcgcgtt gctcgctggt 1860gacccgccgg gcgtcggtcc
gaccctgttg gtctgtccga tgtcactggt cggtaactgg 1920cagcgggagg
cggcgacctt caccccgggc gtacgggtcc atgtgcatca cggcgccgag
1980cgggcccgcg gggcggcgtt caccgcggcg gtggaggcag cggacctggt
cctcaccacc 2040tacacggtgg ctgcccgcga tgcgggggag ctggccgggg
tcgactggca tcgggtggtg 2100gtggacgagg cacaggccat caagaacgcc
tcgacgcggc aagccgaggc ggtccgggcg 2160ttgcccgccc ggcatcggat
cgcggtcacc ggcaccccgg tggagaatcg gctcgccgac 2220ctctggtcga
tcatgcagtt cgccaatccc ggtctgctcg gcccggccgc cgagttcaag
2280aagcggtacg ccgaaccgat cgagcgacac ggcgacgcgg aggcggccga
gcggctgcgc 2340cggatcaccg gcccgttcgt gctgcgtcgc ctcaagaccg
actcttcggt tatctccgac 2400ctgccagaga agctggagat ggaggtggtg
tgcaacctga ccgcggaaca ggctgccctc 2460taccgtgcgg tggtggacga
catgatggcc cagatcgagt ccagcgaggg catcgagcga 2520cgtgggctcg
tgctggccgc catgacccgg ctcaagcagg tctgcaacca cccggcgcac
2580ctgctgcggg acaactcggc gctggtcggc cgctccggca agctggcccg
gctggaggag 2640atcctcgacg aggtgcttgt cgcgggggag aaggccctgc
tcttcaccca gtacgccgag 2700ttcggcggca tgctgcgcgg ccacctgtcg
gcccggttcg gacaggagac gctgttcctg 2760cacggcggcg tcggtaaggc
cgaccgggac gcgatggtga cgcggttcca gtccccggac 2820ggccccgcgc
tcttcgtact ctcgctcaag gccggtggta ccggtctcac cctgaccgcg
2880gccaaccatg tcgtgcacgt tgaccgctgg tggaatccgg cggtggagga
ccaggccacg 2940gaccgggcgt tccgcatcgg gcagcggcgg cgcgttcagg
tccgcaagtt tgtctgcgcc 3000ggcacggtgg aggagaaggt cgccgcgctc
atcgccgaca agcgtcggct cgcctcgacg 3060gtggtgggtg ccggtgagca
gtgggttacc gagctgtcca cggcgcagct gcgggagctg 3120ttccagctgg
agtccggggc ggtggccgaa tga 3153781050PRTSalinispora tropica 78Val
Leu Val Val His Gly Ser Trp Arg Leu Gly Ile Gly Leu Ala Ile 1 5 10
15 Trp Ala Glu Asp Ser Ala Ser Pro Pro Arg Ala Pro Arg Arg Ala Gly
20 25 30 Arg Ala Pro Arg Glu Arg Pro His Pro Phe Ala Ala Gly His
Pro Val 35 40 45 Leu Ala Ala Ala Leu Ala Glu Val Ala Glu Pro Thr
Glu Pro Gly Thr 50 55 60 Ala Leu Leu Thr Leu Pro Thr Arg Ala Gly
Ser Pro Leu Asp Ser Pro 65 70 75 80 Glu Leu Val Arg Thr Ala Ser Val
Glu Pro Leu Arg Gly Pro Val Thr 85 90 95 Leu Ala Gly Trp Arg Val
Pro Ala Leu Val Tyr Ala Pro Asp Ala Ala 100 105 110 Leu Ser Leu Leu
Ser Gln Ile Thr Ala Ala Gly Ala Leu Pro Asp Ala 115 120 125 Val Pro
Gly Ala Thr Leu Arg His Leu Ala Glu Leu Ala Ala Phe Ala 130 135 140
Val Asp Leu Ala Ala Arg Gly Arg Val Leu Pro Gly Val Arg Pro Pro 145
150 155 160 Lys Glu Arg Ala Ser Ala Ala Trp Ala Val Trp Gln Pro Leu
Leu Thr 165 170 175 Gly Val Asp Ala Gly Trp Ala Arg Ala Leu Ala Leu
Ala Leu Pro Pro 180 185 190 Ala Val Arg Ala Ala Val Glu Ile Asp Pro
Ala Pro Leu Ala Val Pro 195 200 205 Gly Gly Pro Glu Thr Pro Ala Asn
Gly Gly Val Pro Pro Gln Ala Arg 210 215 220 Thr Arg Arg Pro Thr Ala
Ala Ala Gly Glu Pro Gly Glu Leu Val Val 225 230 235 240 Glu Ala Leu
Asp Ala Leu Thr Asp Ala Ala Val Arg Ala Ala Leu Ala 245 250 255 Glu
Thr Ser Leu Thr Arg Gly Ala Arg Pro Arg Gly Ala Val Ala Ala 260 265
270 Trp Leu Ala Ala Leu Thr Gly Pro Arg Arg Asp Phe Thr Ala Asp Ser
275 280 285 Ala Glu Leu Asp Thr Leu Arg Gly Glu Leu Asp Ala Trp Gln
Arg Asp 290 295 300 Ala Val Gly Gly Ser Val Arg Ala Ser Phe Arg Leu
Val Glu Pro Pro 305 310 315 320 Thr Asp Gly Leu Phe Glu Ala Ala Ala
Gly Gly Leu Ala Ala Ala Glu 325 330 335 Gly Ser Trp Arg Val Glu Phe
Gly Leu Gln Pro Ala Asp Gln Pro Gly 340 345 350 Leu His Val Asp Ala
Val Arg Ile Trp His Glu Ser Ala Ala Leu Pro 355 360 365 Gly Pro Ala
Ala Pro Gln Glu Ala Leu Leu Thr Glu Leu Gly Arg Ala 370 375 380 Ser
Arg Leu Trp Pro Glu Leu Asn Ser Ala Leu Arg Thr Ala Thr Pro 385 390
395 400 Glu Ala Leu Glu Leu Asp Ala Ala Gly Ala His Arg Phe Leu Arg
Asp 405 410 415 Gly Ala Pro Val Leu His Ala Ala Gly Phe Ala Val Leu
Leu Pro Ser 420 425 430 Trp Trp Gln Arg Pro Ser Ser Arg Leu Gly Ala
Arg Leu Gln Ala Gln 435 440 445 Ser Arg Thr Ala Pro Gly Thr Val Ala
Gly Ala Gly Asp Gly Val Gly 450 455 460 Leu Asp Ala Leu Val Asp Tyr
Arg Trp Glu Val Ser Leu Gly Asp Gln 465 470 475 480 Pro Leu Thr Ala
Glu Glu Leu Glu Ser Leu Ala Ala Leu Lys Ser Pro 485 490 495 Leu Val
Arg Leu Arg Gly Arg Trp Val Glu Leu Asp Pro Lys Arg Leu 500 505 510
Ala Ala Gly Leu Arg Leu Leu Arg Ser Ala Gly Glu Leu Thr Val Gly 515
520 525 Asp Leu Leu Arg Leu Gly Leu Ser Asp Pro Ala Thr Asp Ala Leu
Pro 530 535 540 Val Leu Glu Val Ala Ala Asp Gly Ala Leu Gly Asp Leu
Leu Ala Gly 545 550 555 560 Ala Val Glu Arg Gln Leu Thr Pro Val Asp
Ala Val Pro Ser Phe Gln 565 570 575 Gly Val Leu Arg Pro Tyr Gln Arg
Arg Gly Leu Ala Trp Leu Ser Phe 580 585 590 Leu Gln Ser Leu Gly Leu
Gly Gly Val Leu Ala Asp Asp Met Gly Leu 595 600 605 Gly Lys Thr Val
Gln Leu Leu Ala Leu Leu Ala Gly Asp Pro Pro Gly 610 615 620 Val Gly
Pro Thr Leu Leu Val Cys Pro Met Ser Leu Val Gly Asn Trp 625 630 635
640 Gln Arg Glu Ala Ala Thr Phe Thr Pro Gly Val Arg Val His Val His
645 650 655 His Gly Ala Glu Arg Ala Arg Gly Ala Ala Phe Thr Ala Ala
Val Glu 660 665 670 Ala Ala Asp Leu Val Leu Thr Thr Tyr Thr Val Ala
Ala Arg Asp Ala 675 680 685 Gly Glu Leu Ala Gly Val Asp Trp His Arg
Val Val Val Asp Glu Ala 690 695 700 Gln Ala Ile Lys Asn Ala Ser Thr
Arg Gln Ala Glu Ala Val Arg Ala 705 710 715 720 Leu Pro Ala Arg His
Arg Ile Ala Val Thr Gly Thr Pro Val Glu Asn 725 730 735 Arg Leu Ala
Asp Leu Trp Ser Ile Met Gln Phe Ala Asn Pro Gly Leu 740 745 750 Leu
Gly Pro Ala Ala Glu Phe Lys Lys Arg Tyr Ala Glu Pro Ile Glu 755 760
765 Arg His Gly Asp Ala Glu Ala Ala Glu Arg Leu Arg Arg Ile Thr Gly
770 775 780 Pro Phe Val Leu Arg Arg Leu Lys Thr Asp Ser Ser Val Ile
Ser Asp 785 790 795 800 Leu Pro Glu Lys Leu Glu Met Glu Val Val Cys
Asn Leu Thr Ala Glu 805 810 815 Gln Ala Ala Leu Tyr Arg Ala Val Val
Asp Asp Met Met Ala Gln Ile 820 825 830 Glu Ser Ser Glu Gly Ile Glu
Arg Arg Gly Leu Val Leu Ala Ala Met 835 840 845 Thr Arg Leu Lys Gln
Val Cys Asn His Pro Ala His Leu Leu Arg Asp 850 855 860 Asn Ser Ala
Leu Val Gly Arg Ser Gly Lys Leu Ala Arg Leu Glu Glu 865 870 875 880
Ile Leu Asp Glu Val Leu Val Ala Gly Glu Lys Ala Leu Leu Phe Thr 885
890 895 Gln Tyr Ala Glu Phe Gly Gly Met Leu Arg Gly His Leu Ser Ala
Arg 900 905 910 Phe Gly Gln Glu Thr Leu Phe Leu His Gly Gly Val Gly
Lys Ala Asp 915 920 925 Arg Asp Ala Met Val Thr Arg Phe Gln Ser Pro
Asp Gly Pro Ala Leu 930 935 940 Phe Val Leu Ser Leu Lys Ala Gly Gly
Thr Gly Leu Thr Leu Thr Ala 945 950 955 960 Ala Asn His Val Val His
Val Asp Arg Trp Trp Asn Pro Ala Val Glu 965 970 975 Asp Gln Ala Thr
Asp Arg Ala Phe Arg Ile Gly Gln Arg Arg Arg Val 980 985 990 Gln Val
Arg Lys Phe Val Cys Ala Gly Thr Val Glu Glu Lys Val Ala 995 1000
1005 Ala Leu Ile Ala Asp Lys Arg Arg Leu Ala Ser Thr Val Val Gly
1010 1015 1020 Ala Gly Glu Gln Trp Val Thr Glu Leu Ser Thr Ala Gln
Leu Arg 1025 1030 1035 Glu Leu Phe Gln Leu Glu Ser Gly Ala Val Ala
Glu 1040 1045 1050 792970DNASymbiobacterium thermophilum
79atgatcacgg ttcacggcag tttcgtcccc tccggcgcgt ccggcttctt cttcctgtgg
60ggcctggacg
gcgtggccgc ccgggatgcc gctcctcccg gccggcgccg ccgcggggtt
120ccgcgccacc catgcgcaac cgagccggaa gcgctctacc ccgccctgag
aggattgccc 180tacctgaaca ccctgtccct ggtccagtgg cagcccggac
cggacggcgt cagcccggcc 240cgggtcccgg ggatcgccct gtccgtgccc
aacgccgtgc agtggctgtt ggatctgccc 300gaccacttcc gcggcacgcc
cctccggccg gggcacagcc tgcagctctg gtgcgtcgca 360tccaagctgc
ttctggagtt cctggggcgg ggcctgatgc tgccggtgct gcaggccgag
420gccggggtgc tgagcgcggg ctgggcgctc cacctgaccg acgccgacga
cgtccgccgc 480ctgacccggc tggccgctgg attgccggag gcctgccgcg
cccttgtgcc ccccgaccga 540acccccaaca cctaccccct gccggtcgcc
gacggcctgg tccaccagtt catgcgtacg 600gcggccgccg gcgtgatccg
gctcctcctg gaggaagagc ccctgcccga ggcccagtcg 660ctacaggata
ccgccctgcg ccactggctg gcggcgctga ccggggcgga ggcccgggac
720ctgccgccgg gcctgcccgg cgcgcaggag ctgtacgccg ccctggaccg
ctggagcgcc 780cccgccaccg gcgtgctgag ccacgccagt ctgcggacgg
gggtccgcct ccacctgccc 840ggccccgaga ccgacggcga gtgggagctg
gagctcacgc tccatgcgcc ggacgagggt 900gcgctgcccg tcaccgccga
tgcggtctgg gccagcctgg gcgccgaggt ggagatcggc 960gggcagcggt
accagggcgc cgagcagcgg ctgctggccg acctgccggc catggcccgc
1020ctcttcccgc cactggcgcc gctgctccgg gaccccgcgc ccagccgcat
gcgcattccg 1080gcggacgacg tgctggccct gatccaggaa ggggccatgc
tgctccagca ggccggccac 1140cccgtgctgc tgccggccgc ccttgcgaag
cccgccgccc tccgggtcgg aatgcgcctc 1200agccccgccg ggggcagccc
ctccatgttc gggctgcacc agatcgtgaa cgtgcgctgg 1260gacgtggccc
tgggcggcac cccgctcacg ctggacgagc tgcgccacct ggcgcggcag
1320aagcggcccc tggtacagat gcagggccgg tgggtgcggg tggacgaacg
caccctggct 1380gcggtcctcc gccggatcga gcagcacggc gggcagatgg
agctgggcac ggcgctgcgc 1440ctggcacccg aggcggacga ggccaccgcg
accggctgga tcgccgagct gctggagcgg 1500ctgcaggagc cagcccggat
ggagccggtg ccgacccccg ggggcttcgc cggcaccctg 1560cggccgtacc
agcagcgggg cctcgcctgg ctggcgttcc tgcgccgctg gggcctgggc
1620gcgtgcctcg ccgacgacat ggggctgggc aagaccgtgc agctcatcgc
ccttctcctg 1680cacgagcggg aggccgggtg ggccgcgggc ccgaccctgc
tggtctgccc cgtctcggtc 1740ctgggcaact ggtgccggga gctggcccgc
ttcgccccgg gcctgcgggt cctggtgcac 1800catggccccg ggaggctggg
cgagccggac ttcgcccggc aggccggggc ccacgacgtg 1860gtgctgacca
cgtactccct gctggcccgg gatgccgcgc tgctgggcca ggtgacctgg
1920aacgggatcg tcgccgacga ggcgcagaac ctgaaaaacc ccgacacaca
gcacgcccgg 1980gcgctgcgaa gcctttccgg cggctaccgc atcgccctca
ccggtacgcc cgtcgaaaac 2040cacctgggcg acctgtggtc gctcttccag
ttcctcaacc cggggctgct gggcagccgc 2100gaggagttcg agcggcgcta
cgccgtgccg atccagcggt accaggacga ggaggctgcg 2160gcccggctcc
gccggcaggt gggtcccttc atcctgcgcc ggcagaagaa cgaccccgcc
2220atcgcgccgg acctgcccga caagctggag aacaccgagc tggtgaccct
ctcggtggaa 2280caggcggcgc tgtacgaggc catcgtgcag gagacgctgg
agcgggccgc gcaggccgac 2340ggcatccagc ggcaggcggc ggtcctggca
ggcctcacgc ggctgaagca ggtgtgcaac 2400catcccgcag ccgccaccgg
cgacggcccc ctggtggggc ggagcggcaa gatcgaccgg 2460ctggtgcaac
tgctgcagga ggtgctggcg gcgggcgagc aggccctgct cttcacccag
2520ttcgcccgct tcggcgggcg gctgcaggcc tacctggcgg agacgctggg
ctgcgaggtg 2580ctcttcctgc acggcggcac gccccagccc gagcgggacc
ggctcgtcgc ccggttccag 2640gccggcgagg cgcccctctt catcctctcg
ctgaaagccg gcggccttgg cctcaacctc 2700accgccgcga cccacgtctt
tcacgtggac cggtggtgga atccggcggt ggaggatcag 2760gccacagacc
gggcctaccg catcggccag acgcgcaggg tgctggtgca ccggctgatc
2820accgccggca cgctggagga gcgcatcgac cggctgctgg ccgagaagcg
tgccctggcg 2880ggccaggtga tcatcagcgg cgagtcgtgg ctcggccagc
tctccaccga ggagctgcgg 2940gccctgatcg ccctggaccg ggaggtgtag
297080989PRTSymbiobacterium thermophilum 80Met Ile Thr Val His Gly
Ser Phe Val Pro Ser Gly Ala Ser Gly Phe 1 5 10 15 Phe Phe Leu Trp
Gly Leu Asp Gly Val Ala Ala Arg Asp Ala Ala Pro 20 25 30 Pro Gly
Arg Arg Arg Arg Gly Val Pro Arg His Pro Cys Ala Thr Glu 35 40 45
Pro Glu Ala Leu Tyr Pro Ala Leu Arg Gly Leu Pro Tyr Leu Asn Thr 50
55 60 Leu Ser Leu Val Gln Trp Gln Pro Gly Pro Asp Gly Val Ser Pro
Ala 65 70 75 80 Arg Val Pro Gly Ile Ala Leu Ser Val Pro Asn Ala Val
Gln Trp Leu 85 90 95 Leu Asp Leu Pro Asp His Phe Arg Gly Thr Pro
Leu Arg Pro Gly His 100 105 110 Ser Leu Gln Leu Trp Cys Val Ala Ser
Lys Leu Leu Leu Glu Phe Leu 115 120 125 Gly Arg Gly Leu Met Leu Pro
Val Leu Gln Ala Glu Ala Gly Val Leu 130 135 140 Ser Ala Gly Trp Ala
Leu His Leu Thr Asp Ala Asp Asp Val Arg Arg 145 150 155 160 Leu Thr
Arg Leu Ala Ala Gly Leu Pro Glu Ala Cys Arg Ala Leu Val 165 170 175
Pro Pro Asp Arg Thr Pro Asn Thr Tyr Pro Leu Pro Val Ala Asp Gly 180
185 190 Leu Val His Gln Phe Met Arg Thr Ala Ala Ala Gly Val Ile Arg
Leu 195 200 205 Leu Leu Glu Glu Glu Pro Leu Pro Glu Ala Gln Ser Leu
Gln Asp Thr 210 215 220 Ala Leu Arg His Trp Leu Ala Ala Leu Thr Gly
Ala Glu Ala Arg Asp 225 230 235 240 Leu Pro Pro Gly Leu Pro Gly Ala
Gln Glu Leu Tyr Ala Ala Leu Asp 245 250 255 Arg Trp Ser Ala Pro Ala
Thr Gly Val Leu Ser His Ala Ser Leu Arg 260 265 270 Thr Gly Val Arg
Leu His Leu Pro Gly Pro Glu Thr Asp Gly Glu Trp 275 280 285 Glu Leu
Glu Leu Thr Leu His Ala Pro Asp Glu Gly Ala Leu Pro Val 290 295 300
Thr Ala Asp Ala Val Trp Ala Ser Leu Gly Ala Glu Val Glu Ile Gly 305
310 315 320 Gly Gln Arg Tyr Gln Gly Ala Glu Gln Arg Leu Leu Ala Asp
Leu Pro 325 330 335 Ala Met Ala Arg Leu Phe Pro Pro Leu Ala Pro Leu
Leu Arg Asp Pro 340 345 350 Ala Pro Ser Arg Met Arg Ile Pro Ala Asp
Asp Val Leu Ala Leu Ile 355 360 365 Gln Glu Gly Ala Met Leu Leu Gln
Gln Ala Gly His Pro Val Leu Leu 370 375 380 Pro Ala Ala Leu Ala Lys
Pro Ala Ala Leu Arg Val Gly Met Arg Leu 385 390 395 400 Ser Pro Ala
Gly Gly Ser Pro Ser Met Phe Gly Leu His Gln Ile Val 405 410 415 Asn
Val Arg Trp Asp Val Ala Leu Gly Gly Thr Pro Leu Thr Leu Asp 420 425
430 Glu Leu Arg His Leu Ala Arg Gln Lys Arg Pro Leu Val Gln Met Gln
435 440 445 Gly Arg Trp Val Arg Val Asp Glu Arg Thr Leu Ala Ala Val
Leu Arg 450 455 460 Arg Ile Glu Gln His Gly Gly Gln Met Glu Leu Gly
Thr Ala Leu Arg 465 470 475 480 Leu Ala Pro Glu Ala Asp Glu Ala Thr
Ala Thr Gly Trp Ile Ala Glu 485 490 495 Leu Leu Glu Arg Leu Gln Glu
Pro Ala Arg Met Glu Pro Val Pro Thr 500 505 510 Pro Gly Gly Phe Ala
Gly Thr Leu Arg Pro Tyr Gln Gln Arg Gly Leu 515 520 525 Ala Trp Leu
Ala Phe Leu Arg Arg Trp Gly Leu Gly Ala Cys Leu Ala 530 535 540 Asp
Asp Met Gly Leu Gly Lys Thr Val Gln Leu Ile Ala Leu Leu Leu 545 550
555 560 His Glu Arg Glu Ala Gly Trp Ala Ala Gly Pro Thr Leu Leu Val
Cys 565 570 575 Pro Val Ser Val Leu Gly Asn Trp Cys Arg Glu Leu Ala
Arg Phe Ala 580 585 590 Pro Gly Leu Arg Val Leu Val His His Gly Pro
Gly Arg Leu Gly Glu 595 600 605 Pro Asp Phe Ala Arg Gln Ala Gly Ala
His Asp Val Val Leu Thr Thr 610 615 620 Tyr Ser Leu Leu Ala Arg Asp
Ala Ala Leu Leu Gly Gln Val Thr Trp 625 630 635 640 Asn Gly Ile Val
Ala Asp Glu Ala Gln Asn Leu Lys Asn Pro Asp Thr 645 650 655 Gln His
Ala Arg Ala Leu Arg Ser Leu Ser Gly Gly Tyr Arg Ile Ala 660 665 670
Leu Thr Gly Thr Pro Val Glu Asn His Leu Gly Asp Leu Trp Ser Leu 675
680 685 Phe Gln Phe Leu Asn Pro Gly Leu Leu Gly Ser Arg Glu Glu Phe
Glu 690 695 700 Arg Arg Tyr Ala Val Pro Ile Gln Arg Tyr Gln Asp Glu
Glu Ala Ala 705 710 715 720 Ala Arg Leu Arg Arg Gln Val Gly Pro Phe
Ile Leu Arg Arg Gln Lys 725 730 735 Asn Asp Pro Ala Ile Ala Pro Asp
Leu Pro Asp Lys Leu Glu Asn Thr 740 745 750 Glu Leu Val Thr Leu Ser
Val Glu Gln Ala Ala Leu Tyr Glu Ala Ile 755 760 765 Val Gln Glu Thr
Leu Glu Arg Ala Ala Gln Ala Asp Gly Ile Gln Arg 770 775 780 Gln Ala
Ala Val Leu Ala Gly Leu Thr Arg Leu Lys Gln Val Cys Asn 785 790 795
800 His Pro Ala Ala Ala Thr Gly Asp Gly Pro Leu Val Gly Arg Ser Gly
805 810 815 Lys Ile Asp Arg Leu Val Gln Leu Leu Gln Glu Val Leu Ala
Ala Gly 820 825 830 Glu Gln Ala Leu Leu Phe Thr Gln Phe Ala Arg Phe
Gly Gly Arg Leu 835 840 845 Gln Ala Tyr Leu Ala Glu Thr Leu Gly Cys
Glu Val Leu Phe Leu His 850 855 860 Gly Gly Thr Pro Gln Pro Glu Arg
Asp Arg Leu Val Ala Arg Phe Gln 865 870 875 880 Ala Gly Glu Ala Pro
Leu Phe Ile Leu Ser Leu Lys Ala Gly Gly Leu 885 890 895 Gly Leu Asn
Leu Thr Ala Ala Thr His Val Phe His Val Asp Arg Trp 900 905 910 Trp
Asn Pro Ala Val Glu Asp Gln Ala Thr Asp Arg Ala Tyr Arg Ile 915 920
925 Gly Gln Thr Arg Arg Val Leu Val His Arg Leu Ile Thr Ala Gly Thr
930 935 940 Leu Glu Glu Arg Ile Asp Arg Leu Leu Ala Glu Lys Arg Ala
Leu Ala 945 950 955 960 Gly Gln Val Ile Ile Ser Gly Glu Ser Trp Leu
Gly Gln Leu Ser Thr 965 970 975 Glu Glu Leu Arg Ala Leu Ile Ala Leu
Asp Arg Glu Val 980 985 813114DNASynechococcus sp. 81atgagcctgc
tgcacgccac ctggctgtcg gccgacaccg ccgccgtgcc cgccctggga 60ggcggctacc
ggccgggctt gctgctctgg gccgacacct ggcgggtggc ggaaccccag
120acaccggcca gcgaggcgcc ccagcacccc ctcagcctcg accaggacga
cctcggcgcc 180tggcttgagg aggccgacct ctggacggag gatttccgcc
cggccggagc caccctctgc 240ctgcccagcc gccgccaggg ggccaggggg
aaaaagaaaa gcgacaccag cagctggagc 300ggcctgcccc tgcaggcggg
cgagccgatc ccgaaatccg tggagtggtg gccctggcgg 360gtggagggct
ggtggctgga gcccggcgcc gccaccctct ggcttgggcg cctgcccctc
420tcaggcgacc atcccgacct ggccgatgac ctgcgctggt ggagccatct
gcagcgctgg 480tcgctgagcc tgctggcccg gggccggctg ctgccccagg
tggagggggg ccgcgcccgc 540tggctgccgt tgatcaaccg cgaagacgac
cggcgccgcc tggaggatct ggcctcgcgt 600ctgccccagg tggcggtggc
ggccctggag cccggccagg gggaggccgg cgtcgcgatg 660gcgtgctggc
ggccgggatc cgggcgtcgg cggctggcct cgatcctcac gcacctggtg
720gatgcacgca tgcgtgcggg cttcaccccc agcgaagagg ggctggatcc
gctgctggcg 780gcctggcagc gggccctcgg ccccggtgac ggccgcctcg
atctcgggga cgacgactgc 840gaacgcctgc aggtggccac tcaccactgg
cgcgaagcgg tggctggccg ggtcgagccg 900gcccgggcct gtcttgagct
cgacacaccc gatgaggggg aagatctctg gcccctgcgc 960ttcagcctcc
aggccgaggc cgatcccagt ctgctgctgc ccgcagccgg ggtctgggcc
1020gccggggccg gctgcctgca gctgggtgaa accgaactcc agcaacccgg
tgaactgctg 1080ctggaaggcc tcgggagagc cctgcaggtg ttcgagccga
tcgagagggg tctcgacacc 1140gccacaccgg agcggatggc tctcaccccg
gccgaagcct tcgtgctggt gcgcaccgcc 1200gcgctgaagc tgcgtgatgt
gggcgtcggc gtggtcctgc cccccagcct cagcggtggc 1260ctggccagcc
ggctcggcct ctcgatcgag gccgatctgc ccgagcgctc ccgcggcttc
1320agcctcggtg aaagcctgca gtggagctgg gagctgatga tcggcggcgt
cacgctcacc 1380ctgcgggacc tggagcggct ggcgggcaag cgcagcccgc
tggtgcagca caagggggcc 1440tggatcgagc tgcgtccggg tgatctgcgc
aatgccgaga agttctgcgc cctcgatccg 1500gtcctcagcc tcgatgacgc
cctgcgcctg accggcaacg agggggagac cctgcagcgg 1560ctgccggtgc
accgcttcac agccggcccg aggctgaagg cggtgctgga gcagtaccac
1620cagcagaagg cccccgatcc cctgccggcc cccgagggct tcgccggcca
gctgcggccc 1680taccaggagc gcggcctggg ctggctggcc ttcctgcacc
gcttcgatca gggggcctgc 1740ctggccgacg acatgggcct gggcaagaca
atccagctgc tggccttcct gcagcacctc 1800aaggcggagc aggaactgaa
gcgtcccgta ctgctggtgg cccccacctc ggtgctcacc 1860aactggctgc
gggaagcgaa ggccttcacg ccggaactga acgtggtgga gcactacggc
1920ccccggcggc cctccacccc cgccgccctg aagaagaagc tggaggggat
ggatctggtg 1980ctcaccagct acggcctgct gcagcgcgac agcgagttac
tgagcagcct cgactggcag 2040ggggtggtga ttgatgaggc ccaggcgatc
aagaattcct cagcgcgcca gtcgcaggca 2100gcccgcgatc tggcacgccc
gctcaagcag agccgcttcc gtatcgcact caccggcacc 2160ccggtggaga
accgggtcag tgagctctgg gccctgatgg acttcctcaa tccgaaggtg
2220cttggggagg aggagttctt ccgccagcgc taccgcctgc cgatcgagcg
ctatggcgac 2280atggcctcgg tgcgcgacct caaggcccgc gtcggcccgt
tcatcctgcg gcgcctcaag 2340actgaccgct cgatcatctc cgacctgccc
gagaaggtgg aactgaagga gtgggttgga 2400ctctcacccg agcaggtcaa
gctctaccgc cgcaccgtgg aggacaccct cgatgcgatc 2460gcgcgggcac
ccgtgggcca gaagcacggc caggtgctgg ggctgctcac caagctcaag
2520caggtctgca accacccggc cctgatgctc aaggaagggg aggtgggggc
cggcttcagc 2580gcccgctcgg ccaagttgca gcggctcgag gaaatcgtcg
aggaggtgat cgcggccggc 2640gatcgggccc tcctgtttac ccagttcgcc
gaatggggcc acctgctcca gacccacctg 2700cagcagcgct tccaccagga
ggtgcccttt ctctatggca gtaccagcaa gggggagcgt 2760caggcgatgg
tggatcgctt ccaggacgac ccccggggac cacagctgtt cctgctctcg
2820ctcaaggcag gcggcgtggg gctcaacctc acccgggcca gtcatgtgtt
ccacatcgac 2880cgctggtgga atccggcggt ggagaaccag gccaccgacc
gggcctaccg catcggccag 2940accaaccggg tgatggtgca caagttcatc
accagcggct cggtggagga gaagatcgac 3000cgcatgatcc gcgaaaaggc
ccgcctggcc gaagacatcg tcggcagcgg tgaggagtgg 3060ctcggaggcc
tcgatcccgg ccagctgcgc gacctggtgg ccctggagga gtga
3114821037PRTSynechococcus sp. 82Met Ser Leu Leu His Ala Thr Trp
Leu Ser Ala Asp Thr Ala Ala Val 1 5 10 15 Pro Ala Leu Gly Gly Gly
Tyr Arg Pro Gly Leu Leu Leu Trp Ala Asp 20 25 30 Thr Trp Arg Val
Ala Glu Pro Gln Thr Pro Ala Ser Glu Ala Pro Gln 35 40 45 His Pro
Leu Ser Leu Asp Gln Asp Asp Leu Gly Ala Trp Leu Glu Glu 50 55 60
Ala Asp Leu Trp Thr Glu Asp Phe Arg Pro Ala Gly Ala Thr Leu Cys 65
70 75 80 Leu Pro Ser Arg Arg Gln Gly Ala Arg Gly Lys Lys Lys Ser
Asp Thr 85 90 95 Ser Ser Trp Ser Gly Leu Pro Leu Gln Ala Gly Glu
Pro Ile Pro Lys 100 105 110 Ser Val Glu Trp Trp Pro Trp Arg Val Glu
Gly Trp Trp Leu Glu Pro 115 120 125 Gly Ala Ala Thr Leu Trp Leu Gly
Arg Leu Pro Leu Ser Gly Asp His 130 135 140 Pro Asp Leu Ala Asp Asp
Leu Arg Trp Trp Ser His Leu Gln Arg Trp 145 150 155 160 Ser Leu Ser
Leu Leu Ala Arg Gly Arg Leu Leu Pro Gln Val Glu Gly 165 170 175 Gly
Arg Ala Arg Trp Leu Pro Leu Ile Asn Arg Glu Asp Asp Arg Arg 180 185
190 Arg Leu Glu Asp Leu Ala Ser Arg Leu Pro Gln Val Ala Val Ala Ala
195 200 205 Leu Glu Pro Gly Gln Gly Glu Ala Gly Val Ala Met Ala Cys
Trp Arg 210 215 220 Pro Gly Ser Gly Arg Arg Arg Leu Ala Ser Ile Leu
Thr His Leu Val 225 230 235 240 Asp Ala Arg Met Arg Ala Gly Phe Thr
Pro Ser Glu Glu Gly Leu Asp 245 250 255 Pro Leu Leu Ala Ala Trp Gln
Arg Ala Leu Gly Pro Gly Asp Gly Arg 260 265 270 Leu Asp Leu Gly Asp
Asp Asp Cys Glu Arg Leu Gln Val Ala Thr His 275 280 285 His Trp Arg
Glu Ala Val Ala Gly Arg Val Glu Pro Ala Arg Ala Cys 290 295 300 Leu
Glu Leu Asp Thr Pro Asp Glu Gly Glu Asp Leu Trp Pro Leu Arg 305 310
315 320 Phe Ser Leu Gln Ala Glu Ala Asp Pro Ser Leu Leu Leu Pro Ala
Ala 325 330 335 Gly Val Trp Ala Ala Gly Ala Gly Cys Leu Gln Leu Gly
Glu Thr Glu 340 345 350 Leu Gln Gln Pro Gly Glu Leu Leu Leu Glu Gly
Leu Gly Arg Ala Leu 355 360 365 Gln Val Phe Glu Pro Ile Glu Arg Gly
Leu Asp Thr Ala Thr Pro Glu 370
375 380 Arg Met Ala Leu Thr Pro Ala Glu Ala Phe Val Leu Val Arg Thr
Ala 385 390 395 400 Ala Leu Lys Leu Arg Asp Val Gly Val Gly Val Val
Leu Pro Pro Ser 405 410 415 Leu Ser Gly Gly Leu Ala Ser Arg Leu Gly
Leu Ser Ile Glu Ala Asp 420 425 430 Leu Pro Glu Arg Ser Arg Gly Phe
Ser Leu Gly Glu Ser Leu Gln Trp 435 440 445 Ser Trp Glu Leu Met Ile
Gly Gly Val Thr Leu Thr Leu Arg Asp Leu 450 455 460 Glu Arg Leu Ala
Gly Lys Arg Ser Pro Leu Val Gln His Lys Gly Ala 465 470 475 480 Trp
Ile Glu Leu Arg Pro Gly Asp Leu Arg Asn Ala Glu Lys Phe Cys 485 490
495 Ala Leu Asp Pro Val Leu Ser Leu Asp Asp Ala Leu Arg Leu Thr Gly
500 505 510 Asn Glu Gly Glu Thr Leu Gln Arg Leu Pro Val His Arg Phe
Thr Ala 515 520 525 Gly Pro Arg Leu Lys Ala Val Leu Glu Gln Tyr His
Gln Gln Lys Ala 530 535 540 Pro Asp Pro Leu Pro Ala Pro Glu Gly Phe
Ala Gly Gln Leu Arg Pro 545 550 555 560 Tyr Gln Glu Arg Gly Leu Gly
Trp Leu Ala Phe Leu His Arg Phe Asp 565 570 575 Gln Gly Ala Cys Leu
Ala Asp Asp Met Gly Leu Gly Lys Thr Ile Gln 580 585 590 Leu Leu Ala
Phe Leu Gln His Leu Lys Ala Glu Gln Glu Leu Lys Arg 595 600 605 Pro
Val Leu Leu Val Ala Pro Thr Ser Val Leu Thr Asn Trp Leu Arg 610 615
620 Glu Ala Lys Ala Phe Thr Pro Glu Leu Asn Val Val Glu His Tyr Gly
625 630 635 640 Pro Arg Arg Pro Ser Thr Pro Ala Ala Leu Lys Lys Lys
Leu Glu Gly 645 650 655 Met Asp Leu Val Leu Thr Ser Tyr Gly Leu Leu
Gln Arg Asp Ser Glu 660 665 670 Leu Leu Ser Ser Leu Asp Trp Gln Gly
Val Val Ile Asp Glu Ala Gln 675 680 685 Ala Ile Lys Asn Ser Ser Ala
Arg Gln Ser Gln Ala Ala Arg Asp Leu 690 695 700 Ala Arg Pro Leu Lys
Gln Ser Arg Phe Arg Ile Ala Leu Thr Gly Thr 705 710 715 720 Pro Val
Glu Asn Arg Val Ser Glu Leu Trp Ala Leu Met Asp Phe Leu 725 730 735
Asn Pro Lys Val Leu Gly Glu Glu Glu Phe Phe Arg Gln Arg Tyr Arg 740
745 750 Leu Pro Ile Glu Arg Tyr Gly Asp Met Ala Ser Val Arg Asp Leu
Lys 755 760 765 Ala Arg Val Gly Pro Phe Ile Leu Arg Arg Leu Lys Thr
Asp Arg Ser 770 775 780 Ile Ile Ser Asp Leu Pro Glu Lys Val Glu Leu
Lys Glu Trp Val Gly 785 790 795 800 Leu Ser Pro Glu Gln Val Lys Leu
Tyr Arg Arg Thr Val Glu Asp Thr 805 810 815 Leu Asp Ala Ile Ala Arg
Ala Pro Val Gly Gln Lys His Gly Gln Val 820 825 830 Leu Gly Leu Leu
Thr Lys Leu Lys Gln Val Cys Asn His Pro Ala Leu 835 840 845 Met Leu
Lys Glu Gly Glu Val Gly Ala Gly Phe Ser Ala Arg Ser Ala 850 855 860
Lys Leu Gln Arg Leu Glu Glu Ile Val Glu Glu Val Ile Ala Ala Gly 865
870 875 880 Asp Arg Ala Leu Leu Phe Thr Gln Phe Ala Glu Trp Gly His
Leu Leu 885 890 895 Gln Thr His Leu Gln Gln Arg Phe His Gln Glu Val
Pro Phe Leu Tyr 900 905 910 Gly Ser Thr Ser Lys Gly Glu Arg Gln Ala
Met Val Asp Arg Phe Gln 915 920 925 Asp Asp Pro Arg Gly Pro Gln Leu
Phe Leu Leu Ser Leu Lys Ala Gly 930 935 940 Gly Val Gly Leu Asn Leu
Thr Arg Ala Ser His Val Phe His Ile Asp 945 950 955 960 Arg Trp Trp
Asn Pro Ala Val Glu Asn Gln Ala Thr Asp Arg Ala Tyr 965 970 975 Arg
Ile Gly Gln Thr Asn Arg Val Met Val His Lys Phe Ile Thr Ser 980 985
990 Gly Ser Val Glu Glu Lys Ile Asp Arg Met Ile Arg Glu Lys Ala Arg
995 1000 1005 Leu Ala Glu Asp Ile Val Gly Ser Gly Glu Glu Trp Leu
Gly Gly 1010 1015 1020 Leu Asp Pro Gly Gln Leu Arg Asp Leu Val Ala
Leu Glu Glu 1025 1030 1035 833090DNASynechococcus sp. 83atgagcctgc
tgcacgccac ctggcttccc gccattcgta cttccagcag ttccggacaa 60ccggcactgc
tcgtttgggc tgacacctgg cgtgtcgcct caccggaggg acctggactc
120acacccgctc tgcatccctt cacccttggc tcgaacgatc tcaaggcttg
gttgaccgaa 180cgggacctga tgcctggggg cagcatcgat gccaccgcct
gcctcaccct cccaagccgc 240accgtcaaac cccgcaaaag tcgaacccaa
tcgagcgaac cagatccgga ggggccagcc 300tggaccgggt tgccaatgca
agcgggagaa cccattccaa aacaaatgga atggtggcca 360tggcaagtgc
aaggcctggc ggtcgagcca tcggccgcca cggaatggct ggcccgttta
420cccctatcgg gccgacatcc agaccttggg gatgaactgc gctggtggag
tcacctccaa 480cgttggtccc tcagcttggt ggcccgtggt cgctggattc
cccaaatgga attaagcaaa 540ggcgaggggt acccccaccg agcgcgctgg
gttcccctgc tgaaccgtga ggaggatcga 600cgccggctcg aagacctcgc
cgcgacgctg cccctcgtag cgacctgtgc cctcccttgg 660cgtgagccac
tcggacgccg cagcaaccgc accaccaggc ttcgaccgga agcgatgcga
720gccgccaatc cggtcgcctg ctgtcgccca cgaagcggtc gcctcagggt
ggccaccttg 780cttgaagact tggtggatgc ggagctgcgc aagggatttg
aaccaagcac ggaaggcctc 840gaccccttac tcaccttgtg gcaagaggcc
ctggcctcag aaaccggtgt tgtggaggtg 900ggcaacgaag acgcagaacg
cctcaccgcg gcaagcctgc actggcgcga gggaattgcc 960ggaggcttcg
cggccgcccg cacctgcctc gaactcaaca ccccaaacga aggcgaagaa
1020ctctgggacc tgaagtttgg attgcaagcg gaggccgatc ccagcctcaa
gctgccggcc 1080gccgcggcct gggcctcagg agcggaaacc cttcaactgg
gggaaatcca agttgaccag 1140gcgggggaag tgctgctgga gggtcttggc
cgagccctca cggtgttccc tccgatcgaa 1200cgcggactgg aaagcgcaac
accggaaacg atgcagctca ctccagcgga ggcatttgtg 1260ttggtgcgaa
cagcaacgca ccagctccgc aatgccggca tcggcgtcga actgcccccc
1320agtctttcag ggggcctcgc cagccggctt ggcttagcga ttaaagcgga
tctaccggat 1380cgatccagcg gcttcaccct cggcgaatct cttgactgga
gctgggatct catgatcggc 1440ggcgtcacac tcaccctccg agagctcgaa
cgtctcagcg gtaagcgaag tccgctggta 1500cgccacaagg gcgcctggat
cgaactacgg cccaacgatc tccgcaacgc cgaacgcttt 1560tgtggagcca
atccagaact gagcctcgac gacgcactac ggctcacggc cacagaaggg
1620gagctcatga tgcgcctgcc ggtgcatcgc tttgatgcag ggcctcgtct
tcagggagtt 1680ctcgagcaat accaccagca aaaagccccc gatcccctgc
cagctccaga gggattttcc 1740ggacaactcc gtccctatca agaacgtggc
ttgggctggc tggccttcct gcatcgcttc 1800gatcagggcg cctgcctggc
ggacgacatg ggcttgggca agaccatcca gttattggcg 1860ttcctgcagc
acctcaaagc ggaaaacgaa ctcaaacgcc cggtgctgtt ggtggcccca
1920acctcggtgc tcacgaattg gcgacgggaa gcggaagcct tcacccctga
gctgtcggtg 1980agagagcact acgggccacg ccggccttcc acgccggccg
ccttgaaaaa agagctcaaa 2040ggtgtggatc tggtgctcac cagttacgga
ctgatgcaac gcgacagtga gctgctggac 2100aacctcgact ggcaaggggt
tgtgatcgat gaagctcagg cgatcaagaa ccctggggca 2160aagcaaagcc
aagcggcccg agacctagcg cgagccggga agagcagcag gttccgcatt
2220gcactcacgg gcacaccggt ggaaaaccgc gtcagcgagc tgtgggcgct
gatggatttc 2280ctcaacccca aagtgttggg tgaggaagac ttttttcgtc
agcgctaccg catgccaatt 2340gagcgctacg gcgatatgtc gtcgttacgc
gatctcaaag cacgggttgg tcccttcatc 2400ctgcgccgcc tcaaaaccga
caagtcgatc atttccgacc tgcctgaaaa ggtggagctc 2460agcgaatggg
tggggctcag caaagaacag aaatcgctgt acaacaaaac cgttgaagac
2520accctcgatg ccattgccac cgcacctcga gggcaacgcc atggccaggt
gctggcgctc 2580ttgacccgtt taaaacagat ttgcaatcac ccggccttag
cccaacgcga aggtgccgtt 2640gacgccgaat tccttagccg gtccgccaag
ctcatgcggc tggaagaaat ccttgaagag 2700gtgattgaag ccggcgatcg
cgctttgctg ttcacccagt tcgccgaatg gggacacctc 2760ttgcaggcct
ggatgcaaca acgctggaag tctgaggttc cctttctgca cggcggaacc
2820cgcaaaagtg atcggcaagc gatggtggat cgattccaag aggacccccg
gggacctcaa 2880ctcttccttc tctccctcaa ggccggtggt gttggcctaa
acctcacccg ggccagccac 2940gtgttccacg ttggatcgct ggtggaatcc
agcggtggaa aaccaagcca ccgaccgggc 3000ctatcgaatt ggtcaaacca
accgggtgat ggtgcacaaa ttcgtcaccc gtggctcggt 3060ggaagaaaaa
atcgaccaaa tgattcgtga 3090841029PRTSynechococcus sp. 84Met Ser Leu
Leu His Ala Thr Trp Leu Pro Ala Ile Arg Thr Ser Ser 1 5 10 15 Ser
Ser Gly Gln Pro Ala Leu Leu Val Trp Ala Asp Thr Trp Arg Val 20 25
30 Ala Ser Pro Glu Gly Pro Gly Leu Thr Pro Ala Leu His Pro Phe Thr
35 40 45 Leu Gly Ser Asn Asp Leu Lys Ala Trp Leu Thr Glu Arg Asp
Leu Met 50 55 60 Pro Gly Gly Ser Ile Asp Ala Thr Ala Cys Leu Thr
Leu Pro Ser Arg 65 70 75 80 Thr Val Lys Pro Arg Lys Ser Arg Thr Gln
Ser Ser Glu Pro Asp Pro 85 90 95 Glu Gly Pro Ala Trp Thr Gly Leu
Pro Met Gln Ala Gly Glu Pro Ile 100 105 110 Pro Lys Gln Met Glu Trp
Trp Pro Trp Gln Val Gln Gly Leu Ala Val 115 120 125 Glu Pro Ser Ala
Ala Thr Glu Trp Leu Ala Arg Leu Pro Leu Ser Gly 130 135 140 Arg His
Pro Asp Leu Gly Asp Glu Leu Arg Trp Trp Ser His Leu Gln 145 150 155
160 Arg Trp Ser Leu Ser Leu Val Ala Arg Gly Arg Trp Ile Pro Gln Met
165 170 175 Glu Leu Ser Lys Gly Glu Gly Tyr Pro His Arg Ala Arg Trp
Val Pro 180 185 190 Leu Leu Asn Arg Glu Glu Asp Arg Arg Arg Leu Glu
Asp Leu Ala Ala 195 200 205 Thr Leu Pro Leu Val Ala Thr Cys Ala Leu
Pro Trp Arg Glu Pro Leu 210 215 220 Gly Arg Arg Ser Asn Arg Thr Thr
Arg Leu Arg Pro Glu Ala Met Arg 225 230 235 240 Ala Ala Asn Pro Val
Ala Cys Cys Arg Pro Arg Ser Gly Arg Leu Arg 245 250 255 Val Ala Thr
Leu Leu Glu Asp Leu Val Asp Ala Glu Leu Arg Lys Gly 260 265 270 Phe
Glu Pro Ser Thr Glu Gly Leu Asp Pro Leu Leu Thr Leu Trp Gln 275 280
285 Glu Ala Leu Ala Ser Glu Thr Gly Val Val Glu Val Gly Asn Glu Asp
290 295 300 Ala Glu Arg Leu Thr Ala Ala Ser Leu His Trp Arg Glu Gly
Ile Ala 305 310 315 320 Gly Gly Phe Ala Ala Ala Arg Thr Cys Leu Glu
Leu Asn Thr Pro Asn 325 330 335 Glu Gly Glu Glu Leu Trp Asp Leu Lys
Phe Gly Leu Gln Ala Glu Ala 340 345 350 Asp Pro Ser Leu Lys Leu Pro
Ala Ala Ala Ala Trp Ala Ser Gly Ala 355 360 365 Glu Thr Leu Gln Leu
Gly Glu Ile Gln Val Asp Gln Ala Gly Glu Val 370 375 380 Leu Leu Glu
Gly Leu Gly Arg Ala Leu Thr Val Phe Pro Pro Ile Glu 385 390 395 400
Arg Gly Leu Glu Ser Ala Thr Pro Glu Thr Met Gln Leu Thr Pro Ala 405
410 415 Glu Ala Phe Val Leu Val Arg Thr Ala Thr His Gln Leu Arg Asn
Ala 420 425 430 Gly Ile Gly Val Glu Leu Pro Pro Ser Leu Ser Gly Gly
Leu Ala Ser 435 440 445 Arg Leu Gly Leu Ala Ile Lys Ala Asp Leu Pro
Asp Arg Ser Ser Gly 450 455 460 Phe Thr Leu Gly Glu Ser Leu Asp Trp
Ser Trp Asp Leu Met Ile Gly 465 470 475 480 Gly Val Thr Leu Thr Leu
Arg Glu Leu Glu Arg Leu Ser Gly Lys Arg 485 490 495 Ser Pro Leu Val
Arg His Lys Gly Ala Trp Ile Glu Leu Arg Pro Asn 500 505 510 Asp Leu
Arg Asn Ala Glu Arg Phe Cys Gly Ala Asn Pro Glu Leu Ser 515 520 525
Leu Asp Asp Ala Leu Arg Leu Thr Ala Thr Glu Gly Glu Leu Met Met 530
535 540 Arg Leu Pro Val His Arg Phe Asp Ala Gly Pro Arg Leu Gln Gly
Val 545 550 555 560 Leu Glu Gln Tyr His Gln Gln Lys Ala Pro Asp Pro
Leu Pro Ala Pro 565 570 575 Glu Gly Phe Ser Gly Gln Leu Arg Pro Tyr
Gln Glu Arg Gly Leu Gly 580 585 590 Trp Leu Ala Phe Leu His Arg Phe
Asp Gln Gly Ala Cys Leu Ala Asp 595 600 605 Asp Met Gly Leu Gly Lys
Thr Ile Gln Leu Leu Ala Phe Leu Gln His 610 615 620 Leu Lys Ala Glu
Asn Glu Leu Lys Arg Pro Val Leu Leu Val Ala Pro 625 630 635 640 Thr
Ser Val Leu Thr Asn Trp Arg Arg Glu Ala Glu Ala Phe Thr Pro 645 650
655 Glu Leu Ser Val Arg Glu His Tyr Gly Pro Arg Arg Pro Ser Thr Pro
660 665 670 Ala Ala Leu Lys Lys Glu Leu Lys Gly Val Asp Leu Val Leu
Thr Ser 675 680 685 Tyr Gly Leu Met Gln Arg Asp Ser Glu Leu Leu Asp
Asn Leu Asp Trp 690 695 700 Gln Gly Val Val Ile Asp Glu Ala Gln Ala
Ile Lys Asn Pro Gly Ala 705 710 715 720 Lys Gln Ser Gln Ala Ala Arg
Asp Leu Ala Arg Ala Gly Lys Ser Ser 725 730 735 Arg Phe Arg Ile Ala
Leu Thr Gly Thr Pro Val Glu Asn Arg Val Ser 740 745 750 Glu Leu Trp
Ala Leu Met Asp Phe Leu Asn Pro Lys Val Leu Gly Glu 755 760 765 Glu
Asp Phe Phe Arg Gln Arg Tyr Arg Met Pro Ile Glu Arg Tyr Gly 770 775
780 Asp Met Ser Ser Leu Arg Asp Leu Lys Ala Arg Val Gly Pro Phe Ile
785 790 795 800 Leu Arg Arg Leu Lys Thr Asp Lys Ser Ile Ile Ser Asp
Leu Pro Glu 805 810 815 Lys Val Glu Leu Ser Glu Trp Val Gly Leu Ser
Lys Glu Gln Lys Ser 820 825 830 Leu Tyr Asn Lys Thr Val Glu Asp Thr
Leu Asp Ala Ile Ala Thr Ala 835 840 845 Pro Arg Gly Gln Arg His Gly
Gln Val Leu Ala Leu Leu Thr Arg Leu 850 855 860 Lys Gln Ile Cys Asn
His Pro Ala Leu Ala Gln Arg Glu Gly Ala Val 865 870 875 880 Asp Ala
Glu Phe Leu Ser Arg Ser Ala Lys Leu Met Arg Leu Glu Glu 885 890 895
Ile Leu Glu Glu Val Ile Glu Ala Gly Asp Arg Ala Leu Leu Phe Thr 900
905 910 Gln Phe Ala Glu Trp Gly His Leu Leu Gln Ala Trp Met Gln Gln
Arg 915 920 925 Trp Lys Ser Glu Val Pro Phe Leu His Gly Gly Thr Arg
Lys Ser Asp 930 935 940 Arg Gln Ala Met Val Asp Arg Phe Gln Glu Asp
Pro Arg Gly Pro Gln 945 950 955 960 Leu Phe Leu Leu Ser Leu Lys Ala
Gly Gly Val Gly Leu Asn Leu Thr 965 970 975 Arg Ala Ser His Val Phe
His Val Gly Ser Leu Val Glu Ser Ser Gly 980 985 990 Gly Lys Pro Ser
His Arg Pro Gly Leu Ser Asn Trp Ser Asn Gln Pro 995 1000 1005 Gly
Asp Gly Ala Gln Ile Arg His Pro Trp Leu Gly Gly Arg Lys 1010 1015
1020 Asn Arg Pro Asn Asp Ser 1025 853195DNASynechococcus sp.
85atgagcctgc tgcacgccac ctggcttccg gccattcgta ctcctaccag ctctggacga
60gctgcccttt tggtgtgggc cgacacctgg cgcgttgccg agcctgcagg cccaagtaca
120acccctgcgc ttcacccgtt caccctcagc ccagacgatc tccgggcctt
gctcacggaa 180cgggatcttt tacccgacgg catcattgat gccacggcat
gcctcaccct gccgagccgc 240agcgtgaagc cccgaaaaaa acgcgaaaca
gagaccagca gcactgaaca gcccagctgg 300acaggccttc ccttacaggc
tggagaaccg atccccaaac aaacagagtg gtggccttgg 360caggttcagg
ggctcgcaat tgaccccatg gcggccaccg cctggctgtc caaactgcct
420ctgtcaggac gacatcctga tttggctgat gagttgcgct ggtggagtca
catgcagcgt 480tggtccctca gcctcgtagc ccgaagtcgc tggctccccc
aagtggagct gagcaagggc 540gagggctatc cccatcgcgc ccgctgggta
ccgcttctga atcgggaaga agacaggcgc 600cgtctagaag acttggccgc
agggctccct ctcgttgcca cctgtgccct gccttggcga 660gaaccaacgg
gcaaacgcag caaccgaatc accaggctca gaccagaagc catgcgcgcc
720gcgaatcccg tggcttgctg caggcctcgc agcggacgac taagggttgc
cacgttattg 780gccgacctga tggacgcgca
gctgcgcaag ggctttactc ctgaccctga cggcttggac 840cccctgctac
gcgcctggga ggaggccttg agctcggata caggtgaaat ccaactcagc
900gatgaagaaa ccgaacgcct agccaccgcc agtaatcatt ggcgtgaagg
ggtcgctgga 960aatgttgctg cagcccgcgc ctgcctggag ctggcaacac
cagcggacga tgaggacctt 1020tggccactgc gcttctttct gcaggcggaa
gcagatccaa ccctcaagct gcccgcagga 1080gcggcatggg ctgcaggccc
cagcggcctc caacttgggg aaatcaaggt ggagcacccc 1140agcgaggtct
tgctcgaggg tatggggcga gccctgaccg tgttccaacc gatcgagcgc
1200ggactggaca gtgccacgcc agagagcatg cagctcacac cagctgaagc
gtttgttttg 1260gtgcgcacag cagtccgaca actgcgggat gtgggcgttg
gcgttgacct gccaccaagc 1320ctgtctggag ggctggctag caggcttggc
ctcgccatca aggcagaact ctccgagcgt 1380tcgcgaggct tcacgctcgg
tgaaaacctt gactggagct gggagctgat gatcggcggg 1440gtgacgctga
ccttgcgaga gcttgagcga ttggctggta agcgcagccc tctggtgcgt
1500cacaaagggg cttggatcga actacggccc aatgacctca aaaatgccga
gcgcttttgc 1560gccgccaatc cagacctgag cctcgacgac gcgcttcggc
tcaccgccac cgaaggcgac 1620acgatgatgc gcctgcccgt gcatcaattt
gatgccggtc cgcggctgca agccgtgctg 1680gagcagtacc accagcagaa
agcgccagac ccactccccg ctcccgaggg cttttcgggt 1740caactcaggc
cctatcaaga gagaggactc ggctggcttg ccttcctgca tcgcttcgac
1800caaggcgcct gcttggccga tgacatgggc cttggcaaaa ccatccagct
gctggctttt 1860ctgcaacacc tcaaggcaga aaacgaactc aagcgatcag
tgcttttaat tgcacccaca 1920tctgtcctta cgaactggaa acgagaggca
acagcgttta cacccgagct caaggtgcat 1980gagcactacg gtccaaaacg
cccgagcacc ccagcagcac tgaaaaaggc gctgaaagac 2040gtggatctcg
tgctcaccag ctatggcctg ttacaacgcg acagtgagct cctcgaaagt
2100cacgattggc aaggcctcgt gatcgatgaa gcgcaggcga taaaaaaccc
ctccgcgaag 2160caaagccaag ccgcccgtga tctggcccgc ccgaaaaaga
acagccgttt tcgcatcgca 2220ctcaccggca caccagttga gaaccgcgtc
agcgagctct gggccctgat ggacttcctc 2280aaccctcggg tactgggaga
ggaagaattt ttccgacatc gctatcgcat gccgattgag 2340cgttacggag
acctgtcctc gctgcgcgac ctcaaagccc gagtgggacc tttcatcctc
2400agacgactca aaacagacaa agcgatcatc tcggatctac ccgagaaggt
ggaattgagc 2460gagtgggttg ggctgagcaa agagcagaag tcgctgtatg
ccaaaaccgt tgaagacacc 2520ttggatgcca ttgcccgcgc gccacgcggc
aaacgtcatg gtcaggtgtt gggtctgctc 2580accaagctca agcagatttg
caaccaccct gcgcttgccc tcaaggagca gggcgccagc 2640gaagatttcc
tcaaacggtc cgtgaagctg caacgtctcg aagaaatttt ggacgaggtt
2700gtagaagctg gggatcgagc cttgctgttt acccagttcg cggaatgggg
caagttgctc 2760caggattatt tgcaacgacg ctggcgcagc gaagttccct
tcctcagcgg cagcaccagc 2820aaaagtgaac ggcaagccat ggtcgatcgc
ttccaggagg atccgcgcgg gccccagctt 2880ttcctgttat cactcaaagc
tggcggagtc ggcctcaacc tcacgcgcgc cagtcatgtc 2940tttcacatcg
accgttggtg gaaccccgcc gttgaaaatc aagccacgga ccgtgcctat
3000cgcatcggcc aaacgaaccg ggtcatggtg cataagttca tcaccagcgg
ctccgttgag 3060gagaaaattg accgcatgat ccgcgagaag tccagactgg
cggaagacat cattggctcc 3120ggcgaagact ggcttggagg cctggaaatg
ggacaactca aagagctagt gagcctggag 3180gacaaccaag catga
3195861064PRTSynechococcus sp. 86Met Ser Leu Leu His Ala Thr Trp
Leu Pro Ala Ile Arg Thr Pro Thr 1 5 10 15 Ser Ser Gly Arg Ala Ala
Leu Leu Val Trp Ala Asp Thr Trp Arg Val 20 25 30 Ala Glu Pro Ala
Gly Pro Ser Thr Thr Pro Ala Leu His Pro Phe Thr 35 40 45 Leu Ser
Pro Asp Asp Leu Arg Ala Leu Leu Thr Glu Arg Asp Leu Leu 50 55 60
Pro Asp Gly Ile Ile Asp Ala Thr Ala Cys Leu Thr Leu Pro Ser Arg 65
70 75 80 Ser Val Lys Pro Arg Lys Lys Arg Glu Thr Glu Thr Ser Ser
Thr Glu 85 90 95 Gln Pro Ser Trp Thr Gly Leu Pro Leu Gln Ala Gly
Glu Pro Ile Pro 100 105 110 Lys Gln Thr Glu Trp Trp Pro Trp Gln Val
Gln Gly Leu Ala Ile Asp 115 120 125 Pro Met Ala Ala Thr Ala Trp Leu
Ser Lys Leu Pro Leu Ser Gly Arg 130 135 140 His Pro Asp Leu Ala Asp
Glu Leu Arg Trp Trp Ser His Met Gln Arg 145 150 155 160 Trp Ser Leu
Ser Leu Val Ala Arg Ser Arg Trp Leu Pro Gln Val Glu 165 170 175 Leu
Ser Lys Gly Glu Gly Tyr Pro His Arg Ala Arg Trp Val Pro Leu 180 185
190 Leu Asn Arg Glu Glu Asp Arg Arg Arg Leu Glu Asp Leu Ala Ala Gly
195 200 205 Leu Pro Leu Val Ala Thr Cys Ala Leu Pro Trp Arg Glu Pro
Thr Gly 210 215 220 Lys Arg Ser Asn Arg Ile Thr Arg Leu Arg Pro Glu
Ala Met Arg Ala 225 230 235 240 Ala Asn Pro Val Ala Cys Cys Arg Pro
Arg Ser Gly Arg Leu Arg Val 245 250 255 Ala Thr Leu Leu Ala Asp Leu
Met Asp Ala Gln Leu Arg Lys Gly Phe 260 265 270 Thr Pro Asp Pro Asp
Gly Leu Asp Pro Leu Leu Arg Ala Trp Glu Glu 275 280 285 Ala Leu Ser
Ser Asp Thr Gly Glu Ile Gln Leu Ser Asp Glu Glu Thr 290 295 300 Glu
Arg Leu Ala Thr Ala Ser Asn His Trp Arg Glu Gly Val Ala Gly 305 310
315 320 Asn Val Ala Ala Ala Arg Ala Cys Leu Glu Leu Ala Thr Pro Ala
Asp 325 330 335 Asp Glu Asp Leu Trp Pro Leu Arg Phe Phe Leu Gln Ala
Glu Ala Asp 340 345 350 Pro Thr Leu Lys Leu Pro Ala Gly Ala Ala Trp
Ala Ala Gly Pro Ser 355 360 365 Gly Leu Gln Leu Gly Glu Ile Lys Val
Glu His Pro Ser Glu Val Leu 370 375 380 Leu Glu Gly Met Gly Arg Ala
Leu Thr Val Phe Gln Pro Ile Glu Arg 385 390 395 400 Gly Leu Asp Ser
Ala Thr Pro Glu Ser Met Gln Leu Thr Pro Ala Glu 405 410 415 Ala Phe
Val Leu Val Arg Thr Ala Val Arg Gln Leu Arg Asp Val Gly 420 425 430
Val Gly Val Asp Leu Pro Pro Ser Leu Ser Gly Gly Leu Ala Ser Arg 435
440 445 Leu Gly Leu Ala Ile Lys Ala Glu Leu Ser Glu Arg Ser Arg Gly
Phe 450 455 460 Thr Leu Gly Glu Asn Leu Asp Trp Ser Trp Glu Leu Met
Ile Gly Gly 465 470 475 480 Val Thr Leu Thr Leu Arg Glu Leu Glu Arg
Leu Ala Gly Lys Arg Ser 485 490 495 Pro Leu Val Arg His Lys Gly Ala
Trp Ile Glu Leu Arg Pro Asn Asp 500 505 510 Leu Lys Asn Ala Glu Arg
Phe Cys Ala Ala Asn Pro Asp Leu Ser Leu 515 520 525 Asp Asp Ala Leu
Arg Leu Thr Ala Thr Glu Gly Asp Thr Met Met Arg 530 535 540 Leu Pro
Val His Gln Phe Asp Ala Gly Pro Arg Leu Gln Ala Val Leu 545 550 555
560 Glu Gln Tyr His Gln Gln Lys Ala Pro Asp Pro Leu Pro Ala Pro Glu
565 570 575 Gly Phe Ser Gly Gln Leu Arg Pro Tyr Gln Glu Arg Gly Leu
Gly Trp 580 585 590 Leu Ala Phe Leu His Arg Phe Asp Gln Gly Ala Cys
Leu Ala Asp Asp 595 600 605 Met Gly Leu Gly Lys Thr Ile Gln Leu Leu
Ala Phe Leu Gln His Leu 610 615 620 Lys Ala Glu Asn Glu Leu Lys Arg
Ser Val Leu Leu Ile Ala Pro Thr 625 630 635 640 Ser Val Leu Thr Asn
Trp Lys Arg Glu Ala Thr Ala Phe Thr Pro Glu 645 650 655 Leu Lys Val
His Glu His Tyr Gly Pro Lys Arg Pro Ser Thr Pro Ala 660 665 670 Ala
Leu Lys Lys Ala Leu Lys Asp Val Asp Leu Val Leu Thr Ser Tyr 675 680
685 Gly Leu Leu Gln Arg Asp Ser Glu Leu Leu Glu Ser His Asp Trp Gln
690 695 700 Gly Leu Val Ile Asp Glu Ala Gln Ala Ile Lys Asn Pro Ser
Ala Lys 705 710 715 720 Gln Ser Gln Ala Ala Arg Asp Leu Ala Arg Pro
Lys Lys Asn Ser Arg 725 730 735 Phe Arg Ile Ala Leu Thr Gly Thr Pro
Val Glu Asn Arg Val Ser Glu 740 745 750 Leu Trp Ala Leu Met Asp Phe
Leu Asn Pro Arg Val Leu Gly Glu Glu 755 760 765 Glu Phe Phe Arg His
Arg Tyr Arg Met Pro Ile Glu Arg Tyr Gly Asp 770 775 780 Leu Ser Ser
Leu Arg Asp Leu Lys Ala Arg Val Gly Pro Phe Ile Leu 785 790 795 800
Arg Arg Leu Lys Thr Asp Lys Ala Ile Ile Ser Asp Leu Pro Glu Lys 805
810 815 Val Glu Leu Ser Glu Trp Val Gly Leu Ser Lys Glu Gln Lys Ser
Leu 820 825 830 Tyr Ala Lys Thr Val Glu Asp Thr Leu Asp Ala Ile Ala
Arg Ala Pro 835 840 845 Arg Gly Lys Arg His Gly Gln Val Leu Gly Leu
Leu Thr Lys Leu Lys 850 855 860 Gln Ile Cys Asn His Pro Ala Leu Ala
Leu Lys Glu Gln Gly Ala Ser 865 870 875 880 Glu Asp Phe Leu Lys Arg
Ser Val Lys Leu Gln Arg Leu Glu Glu Ile 885 890 895 Leu Asp Glu Val
Val Glu Ala Gly Asp Arg Ala Leu Leu Phe Thr Gln 900 905 910 Phe Ala
Glu Trp Gly Lys Leu Leu Gln Asp Tyr Leu Gln Arg Arg Trp 915 920 925
Arg Ser Glu Val Pro Phe Leu Ser Gly Ser Thr Ser Lys Ser Glu Arg 930
935 940 Gln Ala Met Val Asp Arg Phe Gln Glu Asp Pro Arg Gly Pro Gln
Leu 945 950 955 960 Phe Leu Leu Ser Leu Lys Ala Gly Gly Val Gly Leu
Asn Leu Thr Arg 965 970 975 Ala Ser His Val Phe His Ile Asp Arg Trp
Trp Asn Pro Ala Val Glu 980 985 990 Asn Gln Ala Thr Asp Arg Ala Tyr
Arg Ile Gly Gln Thr Asn Arg Val 995 1000 1005 Met Val His Lys Phe
Ile Thr Ser Gly Ser Val Glu Glu Lys Ile 1010 1015 1020 Asp Arg Met
Ile Arg Glu Lys Ser Arg Leu Ala Glu Asp Ile Ile 1025 1030 1035 Gly
Ser Gly Glu Asp Trp Leu Gly Gly Leu Glu Met Gly Gln Leu 1040 1045
1050 Lys Glu Leu Val Ser Leu Glu Asp Asn Gln Ala 1055 1060
873198DNASynechococcus sp. 87atgagcctgc tgcacgccac ctggcttccc
gccatccgca cctccagcag ttccggtcaa 60ccggcactgc tcgtttgggc tgacacctgg
cgggtggcca caccggaagg cccgggcctt 120accccagcgc tgcacccctt
caccctaagc catgaagacc tcagggcctg gctgagcgaa 180cgcgacctct
tgcccggcgg ctgcatcgat gccacggcgt gcctcaccct gccgagccgc
240acggtgaagc tgcgcaaaag ccgcagcaca aaagaggagc caacaccgga
accaccgggt 300tggaccgggc taccgatgca ggccggcgaa ccgatcccca
agcaaaccga atggtggccc 360tggcaggtgc aggggctcgc ggtggaaccg
tcggcagcca cggagtggct gtcccgattg 420ccgctctccg gcaccaatcc
agacctggct gatgaactgc gctggtggag ccatctgcag 480cgctgggcct
tgagtctggt ggcccggggc cgctggattc cccagatgga gttcagcaaa
540ggggagggct atccccatcg ggcccgttgg gtgccgcttc tcaaccggga
agaagaccgg 600cgccggctgg aggatctggc ggccagcctg ccgctggtgg
ccacctgcgc cttgccctgg 660cgggaacccc tggggcgccg cagcaaccgc
accacccggt tacgaccgga ggcgatgcga 720gccgccaacc ctgtggccag
ctgccggccc cgcagcggac gcctgcgggt ggcgacgctg 780ctggaagatc
tagtggacgc gcagctgcgc aaggactttg aaccctccac cgatgggctt
840gatcccctgc tgaccctctg gcaggaggcc ctggggtcgg agaccggggt
gatcgagatc 900ggcgatgaag aggccgaacg cctggccacc gccagccatc
actggcggga gggcatcgcc 960ggcgattttg ctgcggcccg cacctgcctt
gaactgcaca ccccaccgga tggggaggat 1020ctctgggagc tgcgcttcgg
gctgcaggcg gaagctgacc ccagcctgaa gctcccggcc 1080gccgcggcct
gggcggctgg tgcggaaccg ctacagcttg gagagatccg ggtggaccaa
1140ccgggtgaag tgctgctgga aggcatgggc cgcgccctga gcgtgtttcc
ggcaattgag 1200cggggtctgg agagcgccac acctgaaacg atgcagctca
ccccggccga ggccttcgtg 1260ctggtgcgca cggccgcccg gcagctgcgg
gatgccggcg tgggagtgga gctgccgccc 1320agcctctccg gtggcctggc
cagccgactg ggcctgtcga tcaaagcgga actgcccgaa 1380cgctcgagcg
gtttcacgtt gggtgagtgt ctggcctggg agtgggatct gatgatcggc
1440ggggtgacgc tcaccctgcg ggaattggag cgcctgagcg gcaagcgcag
ccccctggtg 1500cgccacaagg gggcctggat cgaactgcgg cccaacgacc
tcaaaaatgc cgaacgcttc 1560tgtggggcga aacctgaact gagcctcgac
gacgcgctgc ggctgacggg gacggaaggg 1620gaactgttga tgcggatgcc
ggtgcaccgc ttcgacgccg gcccacggct gcaatcggtg 1680ttgcagcaat
accaccagca gaaggccccc gaccccttgc cggccccgga aggattcagc
1740gggcagctgc ggccttatca ggagcggggc ctcggctggc tcgccttcct
gcaccgcttc 1800gatcaagggg cctgtctagc tgacgacatg ggcttgggca
aaaccattca gttgctagcg 1860ttcctgcagc acctcaaagc ggagcaagaa
ctgaaacgcc cggtgctgct ggtggccccc 1920acatcggtgc tcaccaactg
gcgacgggag gcggaatcgt tcactccaga gttgaaggtc 1980accgagcatt
acgggcctcg ccggccctcc acacccgccg aactcaaaaa agcgttgaag
2040gaggtggatc tggtgctcac cagctacggg ctgctgcagc gtgacagcga
actgctggaa 2100acccaggact ggcagggggt ggtgattgac gaagcccagg
cgatcaagaa ccctggcgcc 2160aaacagagcc aagccgcccg ggatctggcc
cgcaccggcc gcatcaagag caaccgcttc 2220cgcatcgcac tcaccggcac
ccccgtggaa aaccgggtga gcgaactgtg ggccttgatg 2280gacttcctca
acccaaaggt gcttggggaa gaagacttct tccgccagcg ctatcggatg
2340ccgattgagc gctacggcga catgtcgtcc ctgcgggacc tgaaaggccg
cgtgggtccg 2400ttcatcctgc gccggctgaa aaccgacaag acgatcattt
ccgacctgcc tgaaaaggtg 2460gagctgagcg aatgggtggg gctgagcaag
gagcagaaat ctctgtacag caagaccgtg 2520gaagacaccc tcgatgccat
tgcccgggcg ccgcgcgggc agcgccacgg gcaggtgctg 2580gccctgctca
cccggctgaa acagatctgc aaccatcccg ccctggccct gagcgaaggg
2640gccgtggacg atggcttcct gggccgttcg gccaagctgc agcggctgga
ggagatcctc 2700gatgaggtga tcgaagcggg cgatcgggcc ctgctgttca
cccagttcgc cgaatggggg 2760catttgctaa gggcctggat gcagcagcgc
tggaaatcag aagtgccctt cctgcacggc 2820ggcacccgca agaacgaacg
ccaggcgatg gtggatcgct tccaggagga tccccgcggt 2880ccacagctgt
tcctgctctc gctcaaggcc ggtggtgtgg gcctcaacct cacgcgggcc
2940agccatgtgt tccacatcga tcgctggtgg aaccctgccg tggaaaacca
ggccaccgac 3000cgggcctatc ggatcggcca aacgaaccga gtgatggttc
ataaattcat caccagcggt 3060tcggtggagg aaaaaatcga tcgcatgatc
cgcgagaaat cacgcctggc cgaagatgtg 3120atcggctccg gcgaagattg
gctgggaagc ctcggtggcg atcaattgcg cgatctcgtt 3180tctttggagg acacctga
3198881065PRTSynechococcus sp. 88Met Ser Leu Leu His Ala Thr Trp
Leu Pro Ala Ile Arg Thr Ser Ser 1 5 10 15 Ser Ser Gly Gln Pro Ala
Leu Leu Val Trp Ala Asp Thr Trp Arg Val 20 25 30 Ala Thr Pro Glu
Gly Pro Gly Leu Thr Pro Ala Leu His Pro Phe Thr 35 40 45 Leu Ser
His Glu Asp Leu Arg Ala Trp Leu Ser Glu Arg Asp Leu Leu 50 55 60
Pro Gly Gly Cys Ile Asp Ala Thr Ala Cys Leu Thr Leu Pro Ser Arg 65
70 75 80 Thr Val Lys Leu Arg Lys Ser Arg Ser Thr Lys Glu Glu Pro
Thr Pro 85 90 95 Glu Pro Pro Gly Trp Thr Gly Leu Pro Met Gln Ala
Gly Glu Pro Ile 100 105 110 Pro Lys Gln Thr Glu Trp Trp Pro Trp Gln
Val Gln Gly Leu Ala Val 115 120 125 Glu Pro Ser Ala Ala Thr Glu Trp
Leu Ser Arg Leu Pro Leu Ser Gly 130 135 140 Thr Asn Pro Asp Leu Ala
Asp Glu Leu Arg Trp Trp Ser His Leu Gln 145 150 155 160 Arg Trp Ala
Leu Ser Leu Val Ala Arg Gly Arg Trp Ile Pro Gln Met 165 170 175 Glu
Phe Ser Lys Gly Glu Gly Tyr Pro His Arg Ala Arg Trp Val Pro 180 185
190 Leu Leu Asn Arg Glu Glu Asp Arg Arg Arg Leu Glu Asp Leu Ala Ala
195 200 205 Ser Leu Pro Leu Val Ala Thr Cys Ala Leu Pro Trp Arg Glu
Pro Leu 210 215 220 Gly Arg Arg Ser Asn Arg Thr Thr Arg Leu Arg Pro
Glu Ala Met Arg 225 230 235 240 Ala Ala Asn Pro Val Ala Ser Cys Arg
Pro Arg Ser Gly Arg Leu Arg 245 250 255 Val Ala Thr Leu Leu Glu Asp
Leu Val Asp Ala Gln Leu Arg Lys Asp 260 265 270 Phe Glu Pro Ser Thr
Asp Gly Leu Asp Pro Leu Leu Thr Leu Trp Gln 275 280 285 Glu Ala Leu
Gly Ser Glu Thr Gly Val Ile Glu Ile Gly Asp Glu Glu 290 295 300 Ala
Glu Arg Leu Ala Thr Ala Ser His His Trp Arg Glu Gly Ile Ala 305 310
315 320 Gly Asp Phe Ala Ala Ala Arg Thr Cys Leu Glu Leu His Thr Pro
Pro 325 330 335 Asp Gly Glu Asp Leu Trp Glu Leu Arg Phe Gly Leu Gln
Ala Glu Ala 340 345 350 Asp Pro Ser Leu Lys Leu Pro Ala Ala Ala Ala
Trp Ala
Ala Gly Ala 355 360 365 Glu Pro Leu Gln Leu Gly Glu Ile Arg Val Asp
Gln Pro Gly Glu Val 370 375 380 Leu Leu Glu Gly Met Gly Arg Ala Leu
Ser Val Phe Pro Ala Ile Glu 385 390 395 400 Arg Gly Leu Glu Ser Ala
Thr Pro Glu Thr Met Gln Leu Thr Pro Ala 405 410 415 Glu Ala Phe Val
Leu Val Arg Thr Ala Ala Arg Gln Leu Arg Asp Ala 420 425 430 Gly Val
Gly Val Glu Leu Pro Pro Ser Leu Ser Gly Gly Leu Ala Ser 435 440 445
Arg Leu Gly Leu Ser Ile Lys Ala Glu Leu Pro Glu Arg Ser Ser Gly 450
455 460 Phe Thr Leu Gly Glu Cys Leu Ala Trp Glu Trp Asp Leu Met Ile
Gly 465 470 475 480 Gly Val Thr Leu Thr Leu Arg Glu Leu Glu Arg Leu
Ser Gly Lys Arg 485 490 495 Ser Pro Leu Val Arg His Lys Gly Ala Trp
Ile Glu Leu Arg Pro Asn 500 505 510 Asp Leu Lys Asn Ala Glu Arg Phe
Cys Gly Ala Lys Pro Glu Leu Ser 515 520 525 Leu Asp Asp Ala Leu Arg
Leu Thr Gly Thr Glu Gly Glu Leu Leu Met 530 535 540 Arg Met Pro Val
His Arg Phe Asp Ala Gly Pro Arg Leu Gln Ser Val 545 550 555 560 Leu
Gln Gln Tyr His Gln Gln Lys Ala Pro Asp Pro Leu Pro Ala Pro 565 570
575 Glu Gly Phe Ser Gly Gln Leu Arg Pro Tyr Gln Glu Arg Gly Leu Gly
580 585 590 Trp Leu Ala Phe Leu His Arg Phe Asp Gln Gly Ala Cys Leu
Ala Asp 595 600 605 Asp Met Gly Leu Gly Lys Thr Ile Gln Leu Leu Ala
Phe Leu Gln His 610 615 620 Leu Lys Ala Glu Gln Glu Leu Lys Arg Pro
Val Leu Leu Val Ala Pro 625 630 635 640 Thr Ser Val Leu Thr Asn Trp
Arg Arg Glu Ala Glu Ser Phe Thr Pro 645 650 655 Glu Leu Lys Val Thr
Glu His Tyr Gly Pro Arg Arg Pro Ser Thr Pro 660 665 670 Ala Glu Leu
Lys Lys Ala Leu Lys Glu Val Asp Leu Val Leu Thr Ser 675 680 685 Tyr
Gly Leu Leu Gln Arg Asp Ser Glu Leu Leu Glu Thr Gln Asp Trp 690 695
700 Gln Gly Val Val Ile Asp Glu Ala Gln Ala Ile Lys Asn Pro Gly Ala
705 710 715 720 Lys Gln Ser Gln Ala Ala Arg Asp Leu Ala Arg Thr Gly
Arg Ile Lys 725 730 735 Ser Asn Arg Phe Arg Ile Ala Leu Thr Gly Thr
Pro Val Glu Asn Arg 740 745 750 Val Ser Glu Leu Trp Ala Leu Met Asp
Phe Leu Asn Pro Lys Val Leu 755 760 765 Gly Glu Glu Asp Phe Phe Arg
Gln Arg Tyr Arg Met Pro Ile Glu Arg 770 775 780 Tyr Gly Asp Met Ser
Ser Leu Arg Asp Leu Lys Gly Arg Val Gly Pro 785 790 795 800 Phe Ile
Leu Arg Arg Leu Lys Thr Asp Lys Thr Ile Ile Ser Asp Leu 805 810 815
Pro Glu Lys Val Glu Leu Ser Glu Trp Val Gly Leu Ser Lys Glu Gln 820
825 830 Lys Ser Leu Tyr Ser Lys Thr Val Glu Asp Thr Leu Asp Ala Ile
Ala 835 840 845 Arg Ala Pro Arg Gly Gln Arg His Gly Gln Val Leu Ala
Leu Leu Thr 850 855 860 Arg Leu Lys Gln Ile Cys Asn His Pro Ala Leu
Ala Leu Ser Glu Gly 865 870 875 880 Ala Val Asp Asp Gly Phe Leu Gly
Arg Ser Ala Lys Leu Gln Arg Leu 885 890 895 Glu Glu Ile Leu Asp Glu
Val Ile Glu Ala Gly Asp Arg Ala Leu Leu 900 905 910 Phe Thr Gln Phe
Ala Glu Trp Gly His Leu Leu Arg Ala Trp Met Gln 915 920 925 Gln Arg
Trp Lys Ser Glu Val Pro Phe Leu His Gly Gly Thr Arg Lys 930 935 940
Asn Glu Arg Gln Ala Met Val Asp Arg Phe Gln Glu Asp Pro Arg Gly 945
950 955 960 Pro Gln Leu Phe Leu Leu Ser Leu Lys Ala Gly Gly Val Gly
Leu Asn 965 970 975 Leu Thr Arg Ala Ser His Val Phe His Ile Asp Arg
Trp Trp Asn Pro 980 985 990 Ala Val Glu Asn Gln Ala Thr Asp Arg Ala
Tyr Arg Ile Gly Gln Thr 995 1000 1005 Asn Arg Val Met Val His Lys
Phe Ile Thr Ser Gly Ser Val Glu 1010 1015 1020 Glu Lys Ile Asp Arg
Met Ile Arg Glu Lys Ser Arg Leu Ala Glu 1025 1030 1035 Asp Val Ile
Gly Ser Gly Glu Asp Trp Leu Gly Ser Leu Gly Gly 1040 1045 1050 Asp
Gln Leu Arg Asp Leu Val Ser Leu Glu Asp Thr 1055 1060 1065
893192DNASynechococcus sp. 89atgagcctgc tgcacgccac ctggcttccc
gccattcgta cttccagcag ttccggacag 60ccggcactgc tcatttgggc tgacacctgg
cgtgtcgcct caccggaggg gcccggactc 120acacccgctc tgcatccctt
cacccttggc tcggacgatc tcaaagcttg gttgaccgaa 180cgggacctga
tgcctggggg cagcatcgat gccaccgcct gcctcaccct cccaagccgc
240agcgtcaaac cccgcaaaag tcgaacccaa ccgagcgaac cagccccaga
gggaccggcc 300tggaccggat tgccaatgca agcaggagag cccattccga
agcaaatgga atggtggccc 360tggcaggtac aaggcctcgc ggtggagcca
tcggccgcaa cggaatggct cgcccgttta 420cccctatcgg gccgacatcc
agacctcgga gatgaattgc gctggtggag ccatctccaa 480cgttggtccc
tcagcttggt ggcccggggg cgctggattc cccagatgga attaagcaaa
540ggcgagggtt acccccaccg agcgcgctgg gttcccttgt tgaaccgtga
ggaagatcga 600cgacggctcg aagacctcgc ggccacgctg cccctcgtgg
cgacctgtgc cctcccttgg 660cgtgagccac ttggacgccg tagcaaccgc
accaccaggc ttcgaccgga agcgatgcga 720gccgccaacc cggtggcttg
ctgccgcccc cggagcggtc gcctcagggt ggccaccttg 780cttgaagact
tggtggatgc agagctgcgc aagggatttg aacccaccac agaggggctc
840gaccccctac tcaccctgtg gcaagaggcc ctggcctcag aaaccggtgt
tgtggaggtg 900ggcaacgagg atgcagaacg ccttaccgcg gcaagcctgc
actggcgcga agggattgcc 960ggaggcttcg ctgctgcccg cacctgcctc
gaactaaaca ccccaaacga aggcgaagaa 1020ctctgggacc tgaagtttgg
cttgcaagcg gaggccgatc ccagcctcaa gctgccggcc 1080gccgcggcct
gggcctcagg agccgaaaca ctccagctcg gggagatcaa agttgaccag
1140gcgggggaag tgctgctgga gggtcttggc cgagccctca cggtgttccc
tccgatcgaa 1200cgcggactgg aaagcgcaac gccagaaacg atgcagctca
cgccagcgga ggcgtttgtc 1260ttggtgcgaa cagcaacgca ccagctccgc
aatgccggca tcggcgtcga actgcccccc 1320agcctttcag ggggcctcgc
cagccggctt ggtttagcca tcaaggcaga tttaccagat 1380cgatccagcg
gcttcaccct cggagaatct ctggactgga gctgggatct gatgatcggc
1440ggcgtcacac tcaccctgcg agagctcgaa cggctcagcg gtaagcgcag
tccgcttgtg 1500cgccacaagg gagcctggat cgaactgcga cccaacgatc
tccgcaacgc cgaacgcttc 1560tgtggagcca atccagaact gagcctcgac
gatgccctaa ggctcacggc cacagaaggg 1620gagctaatga tgcgcttgcc
ggtgcatcgc tttgatgcgg ggcctcggct tcagggagtt 1680ctcgagcaat
atcaccagca aaaagccccc gatccccttc ccgctccaga gggattttcc
1740ggacaactgc gtccttatca agaacgtggc ttgggctggc tggccttctt
acatcgcttc 1800gatcaaggcg cctgcctggc ggacgacatg ggcttgggca
agaccatcca attgttggcc 1860ttcctgcagc acctcaaagc cgagcacgaa
ctcaaacgcc cggtgctgtt ggtggcccca 1920acctcggtgc tcacgaattg
gcgacgggag gcggaagcct tcacccccga gctgtcggtg 1980aaagagcact
acggcccacg ccggccttcc acgccggccg ccttgaaaaa agaactcaaa
2040gatgtggatc tggtgctcac cagttacggc ctgatgcaac gcgacagcga
gctgctggac 2100agcgtcgact ggcaaggggt tgtgatcgac gaagcgcagg
cgatcaaaaa ccctggggcg 2160aaacaaagcc aagcagcccg agacctggcc
cgagctggaa agagcagcag gttccgcatc 2220gcactcaccg gcacaccggt
ggaaaaccgc gtcagcgagc tgtgggcgct gatggatttc 2280ctcaacccaa
aggtgttggg agaggaagac ttctttcgtc agcgctaccg catgccaatt
2340gagcgctacg gcgatatgtc gtcgttacgc gatctcaaag cgcgggtcgg
ccccttcatc 2400ctgcgccgtc tcaaaaccga caagtcgatc atttccgacc
tgcctgaaaa ggtggagctc 2460agtgaatggg tgggtctcag caaagaacag
aaatcgctgt acaacaaaac cgttgaagac 2520accctcgacg ccattgccac
cgcaccgcgg gggcaacgcc atggccaggt gctagccctc 2580ttgacccggt
taaagcagat ttgcaatcac ccggctttag cccaacgcga aggggccgtt
2640gacagcgaat tccttggccg ttccgccaag ctgatgcgac tcgaagaaat
cctcgaagag 2700gtgattgaag ccggcgatcg cgctttgcta ttcacccaat
tcgccgaatg ggggcatctc 2760ctgcaggcct ggatgcaaca acgctggaag
tctgaggttc ccttcctgca cggcggaacc 2820cgcaagagtg atcggcaagc
gatggtggat cgattccaag aggacccccg gggacctcaa 2880ctctttcttc
tgtccctcaa ggccggtggt gtaggcctca acctcacccg ggccagtcat
2940gtgttccacg tcgatcgctg gtggaatcca gcggtggaaa accaagccac
cgaccgggcc 3000tatcgaattg gtcaaaccaa ccgggtaatg gtgcacaaat
tcgtcacccg tggctcggtg 3060gaagaaaaaa tcgaccaaat gattcgtgaa
aaagctcgaa tggctgaaga cgtgatcggc 3120tccggtgaag actggctcgg
gagccttggc ggcgatcagc tgcgcaatct tgttgccctc 3180gaggacacct aa
3192901063PRTSynechococcus sp. 90Met Ser Leu Leu His Ala Thr Trp
Leu Pro Ala Ile Arg Thr Ser Ser 1 5 10 15 Ser Ser Gly Gln Pro Ala
Leu Leu Ile Trp Ala Asp Thr Trp Arg Val 20 25 30 Ala Ser Pro Glu
Gly Pro Gly Leu Thr Pro Ala Leu His Pro Phe Thr 35 40 45 Leu Gly
Ser Asp Asp Leu Lys Ala Trp Leu Thr Glu Arg Asp Leu Met 50 55 60
Pro Gly Gly Ser Ile Asp Ala Thr Ala Cys Leu Thr Leu Pro Ser Arg 65
70 75 80 Ser Val Lys Pro Arg Lys Ser Arg Thr Gln Pro Ser Glu Pro
Ala Pro 85 90 95 Glu Gly Pro Ala Trp Thr Gly Leu Pro Met Gln Ala
Gly Glu Pro Ile 100 105 110 Pro Lys Gln Met Glu Trp Trp Pro Trp Gln
Val Gln Gly Leu Ala Val 115 120 125 Glu Pro Ser Ala Ala Thr Glu Trp
Leu Ala Arg Leu Pro Leu Ser Gly 130 135 140 Arg His Pro Asp Leu Gly
Asp Glu Leu Arg Trp Trp Ser His Leu Gln 145 150 155 160 Arg Trp Ser
Leu Ser Leu Val Ala Arg Gly Arg Trp Ile Pro Gln Met 165 170 175 Glu
Leu Ser Lys Gly Glu Gly Tyr Pro His Arg Ala Arg Trp Val Pro 180 185
190 Leu Leu Asn Arg Glu Glu Asp Arg Arg Arg Leu Glu Asp Leu Ala Ala
195 200 205 Thr Leu Pro Leu Val Ala Thr Cys Ala Leu Pro Trp Arg Glu
Pro Leu 210 215 220 Gly Arg Arg Ser Asn Arg Thr Thr Arg Leu Arg Pro
Glu Ala Met Arg 225 230 235 240 Ala Ala Asn Pro Val Ala Cys Cys Arg
Pro Arg Ser Gly Arg Leu Arg 245 250 255 Val Ala Thr Leu Leu Glu Asp
Leu Val Asp Ala Glu Leu Arg Lys Gly 260 265 270 Phe Glu Pro Thr Thr
Glu Gly Leu Asp Pro Leu Leu Thr Leu Trp Gln 275 280 285 Glu Ala Leu
Ala Ser Glu Thr Gly Val Val Glu Val Gly Asn Glu Asp 290 295 300 Ala
Glu Arg Leu Thr Ala Ala Ser Leu His Trp Arg Glu Gly Ile Ala 305 310
315 320 Gly Gly Phe Ala Ala Ala Arg Thr Cys Leu Glu Leu Asn Thr Pro
Asn 325 330 335 Glu Gly Glu Glu Leu Trp Asp Leu Lys Phe Gly Leu Gln
Ala Glu Ala 340 345 350 Asp Pro Ser Leu Lys Leu Pro Ala Ala Ala Ala
Trp Ala Ser Gly Ala 355 360 365 Glu Thr Leu Gln Leu Gly Glu Ile Lys
Val Asp Gln Ala Gly Glu Val 370 375 380 Leu Leu Glu Gly Leu Gly Arg
Ala Leu Thr Val Phe Pro Pro Ile Glu 385 390 395 400 Arg Gly Leu Glu
Ser Ala Thr Pro Glu Thr Met Gln Leu Thr Pro Ala 405 410 415 Glu Ala
Phe Val Leu Val Arg Thr Ala Thr His Gln Leu Arg Asn Ala 420 425 430
Gly Ile Gly Val Glu Leu Pro Pro Ser Leu Ser Gly Gly Leu Ala Ser 435
440 445 Arg Leu Gly Leu Ala Ile Lys Ala Asp Leu Pro Asp Arg Ser Ser
Gly 450 455 460 Phe Thr Leu Gly Glu Ser Leu Asp Trp Ser Trp Asp Leu
Met Ile Gly 465 470 475 480 Gly Val Thr Leu Thr Leu Arg Glu Leu Glu
Arg Leu Ser Gly Lys Arg 485 490 495 Ser Pro Leu Val Arg His Lys Gly
Ala Trp Ile Glu Leu Arg Pro Asn 500 505 510 Asp Leu Arg Asn Ala Glu
Arg Phe Cys Gly Ala Asn Pro Glu Leu Ser 515 520 525 Leu Asp Asp Ala
Leu Arg Leu Thr Ala Thr Glu Gly Glu Leu Met Met 530 535 540 Arg Leu
Pro Val His Arg Phe Asp Ala Gly Pro Arg Leu Gln Gly Val 545 550 555
560 Leu Glu Gln Tyr His Gln Gln Lys Ala Pro Asp Pro Leu Pro Ala Pro
565 570 575 Glu Gly Phe Ser Gly Gln Leu Arg Pro Tyr Gln Glu Arg Gly
Leu Gly 580 585 590 Trp Leu Ala Phe Leu His Arg Phe Asp Gln Gly Ala
Cys Leu Ala Asp 595 600 605 Asp Met Gly Leu Gly Lys Thr Ile Gln Leu
Leu Ala Phe Leu Gln His 610 615 620 Leu Lys Ala Glu His Glu Leu Lys
Arg Pro Val Leu Leu Val Ala Pro 625 630 635 640 Thr Ser Val Leu Thr
Asn Trp Arg Arg Glu Ala Glu Ala Phe Thr Pro 645 650 655 Glu Leu Ser
Val Lys Glu His Tyr Gly Pro Arg Arg Pro Ser Thr Pro 660 665 670 Ala
Ala Leu Lys Lys Glu Leu Lys Asp Val Asp Leu Val Leu Thr Ser 675 680
685 Tyr Gly Leu Met Gln Arg Asp Ser Glu Leu Leu Asp Ser Val Asp Trp
690 695 700 Gln Gly Val Val Ile Asp Glu Ala Gln Ala Ile Lys Asn Pro
Gly Ala 705 710 715 720 Lys Gln Ser Gln Ala Ala Arg Asp Leu Ala Arg
Ala Gly Lys Ser Ser 725 730 735 Arg Phe Arg Ile Ala Leu Thr Gly Thr
Pro Val Glu Asn Arg Val Ser 740 745 750 Glu Leu Trp Ala Leu Met Asp
Phe Leu Asn Pro Lys Val Leu Gly Glu 755 760 765 Glu Asp Phe Phe Arg
Gln Arg Tyr Arg Met Pro Ile Glu Arg Tyr Gly 770 775 780 Asp Met Ser
Ser Leu Arg Asp Leu Lys Ala Arg Val Gly Pro Phe Ile 785 790 795 800
Leu Arg Arg Leu Lys Thr Asp Lys Ser Ile Ile Ser Asp Leu Pro Glu 805
810 815 Lys Val Glu Leu Ser Glu Trp Val Gly Leu Ser Lys Glu Gln Lys
Ser 820 825 830 Leu Tyr Asn Lys Thr Val Glu Asp Thr Leu Asp Ala Ile
Ala Thr Ala 835 840 845 Pro Arg Gly Gln Arg His Gly Gln Val Leu Ala
Leu Leu Thr Arg Leu 850 855 860 Lys Gln Ile Cys Asn His Pro Ala Leu
Ala Gln Arg Glu Gly Ala Val 865 870 875 880 Asp Ser Glu Phe Leu Gly
Arg Ser Ala Lys Leu Met Arg Leu Glu Glu 885 890 895 Ile Leu Glu Glu
Val Ile Glu Ala Gly Asp Arg Ala Leu Leu Phe Thr 900 905 910 Gln Phe
Ala Glu Trp Gly His Leu Leu Gln Ala Trp Met Gln Gln Arg 915 920 925
Trp Lys Ser Glu Val Pro Phe Leu His Gly Gly Thr Arg Lys Ser Asp 930
935 940 Arg Gln Ala Met Val Asp Arg Phe Gln Glu Asp Pro Arg Gly Pro
Gln 945 950 955 960 Leu Phe Leu Leu Ser Leu Lys Ala Gly Gly Val Gly
Leu Asn Leu Thr 965 970 975 Arg Ala Ser His Val Phe His Val Asp Arg
Trp Trp Asn Pro Ala Val 980 985 990 Glu Asn Gln Ala Thr Asp Arg Ala
Tyr Arg Ile Gly Gln Thr Asn Arg 995 1000 1005 Val Met Val His Lys
Phe Val Thr Arg Gly Ser Val Glu Glu Lys 1010 1015 1020 Ile Asp Gln
Met Ile Arg Glu Lys Ala Arg Met Ala Glu Asp Val 1025 1030 1035 Ile
Gly Ser Gly Glu Asp Trp Leu Gly Ser Leu Gly Gly Asp Gln 1040 1045
1050 Leu Arg Asn Leu Val Ala Leu Glu Asp Thr 1055 1060
913198DNASynechococcus sp. 91atgagcctgc tgcacgccac ctggctcccg
gccatccgta cacccaccag ttccgggcgt 60gccgccctgc tggtgtgggc ggacacctgg
cgtgtggcgg agccggcggg ccccggcgtg 120accccggcca
cccatccctt caccctcagc gccgatgacc tgcgcgcctg gctgagcgaa
180cgggagctgc tgcccgacgg catcatcgat gccaccgcct gcctcaccct
gcccagccgc 240acggtgaaac cgaagcggaa gcgtggcgag accgcccctg
tggatgaggg ctggacgggt 300ctgcccctgc aggcgggaga accgattccg
aagcagaccg aatggtggcc ctggcaggta 360cagggcctgg cggtcgaacc
cggtgcagcc accgcctggc tggcccgctt gcccctctcc 420ggccgccacc
ccgacctcgc cgatgagctg cgctggtgga gccacatgca gcgctgggcc
480ctcagcctga ttgctcgcag tcgctggatt ccccaggtgg agctgagcaa
aggggagggc 540tacccccacc gcgcccgttg ggtgcctctg ctcaatcgcg
aagacgatcg ccgccgcctg 600gaagacatgg cggcccgcct gccgctggtg
gccacctgcg ctctcccctg gcgcgaaccc 660accgggaagc gcagcaaccg
caccacccgg ctgcggcctg aggcgatgcg ggcggccaat 720ccggtggcct
gttgtcgtcc ccgcagcggc cgactgcgcg tcgccaccct gctcgaagac
780ctggtggatg cccagctgcg cacgggtttc acagcccaga cggacgggct
cgatcccctg 840cttgccgcct gggaggaggc cctcggcagc gacaccggcg
tgatccacct gggcgatgaa 900gacgcagagc gtctggccac cgccagccat
cactggcgcg aaggggtggc cggcactgtg 960gcggcggcgc gggcctgcct
ggaactggag acccccgacg acggcgatga cctctggacc 1020ctgcggttcg
cactgcaggc cgaagcggat cccacgctca aggtgccggc cgccctcgcc
1080tgggcggccg gtccgaaggg actccagctc ggcgaaatcg ccgtggagca
tccgggcgaa 1140ctgctgctgg aaggcatggg ccgggcgctc acggtgtttc
caccgatcga acgcggtctc 1200gacagcgcca cgccggaagg gatgcaactc
acccccgccg aagccttcgt gctggtgcgc 1260accgcagccc gcgaactccg
cgatgtgggg gtgggcgtgg agcttccagc cagcctctcg 1320ggtggcctgg
cgagcaggct cggcctggcg attcaggcgg aactaccgga gaaatcccgc
1380ggtttcacgc tgggcgaaac cctcgactgg agctgggagc tgatgatcgg
cggcgtcacc 1440ctgacgctgc gggaactgga gcgcctggcg ggcaagcgca
gccccctggt gcggcacaag 1500ggcacctgga tcgagctgcg ccccaacgat
ctcaagaatg cggagcggtt tttcgccgcg 1560aagcccgatc tcagcctcga
cgatgccctg cgcctcaccg ccagcgaagg cgacacgctg 1620atgcgcatgc
cggtgcaccg cctggaagcg ggcccacggc tgcaggcggt gctcgagcag
1680tatcaccaac agaaagctcc cgatcccctg ccggcgccgg agggcttctg
cggccagctg 1740cggccttacc aggagcgggg cctcggctgg ctggcctttc
tgcaccgctt tgatcaaggc 1800gcctgcctgg ccgacgacat gggtctgggc
aagaccatcc agctgctcgc ctttctgcag 1860cacctgaagg ccgagcagga
gctgaagagg ccggtgttgc tcgtggcgcc cacctcggtg 1920ctcaccaact
ggaagcggga ggccgccgcc ttcacgccgg agctcgaggt gaaggagcac
1980tacgggccca ggcgccctgc cacccctgca gcactcaaga agagcctcaa
ggatgtggat 2040ctggtgctca ccagctacgg cctgctccaa cgcgacagcg
aactgctcga aagtctcgat 2100tggcaggggg tggtgatcga cgaagcgcag
gcaatcaaga atccgagcgc caaacagagc 2160atggcggccc gagacctggc
ccgcgcagga cgcagcagcc gtttccgcat tgccctcacc 2220ggcacgccgg
tggagaaccg ggtgagcgag ctctgggcct tgatggattt cctcaacccg
2280cgggtgctcg gcgaagagga cttcttccgc cagcgctacc gcatgccgat
tgagcgctat 2340ggcgacatgt cgtcgctgcg ggatctgaaa tcccgcgtgg
gacctttcat tcttcgccgg 2400ctcaaaaccg acaaagcgat catttccgac
ctgcccgaaa aggtggaact gagcgaatgg 2460gtgggattga gcagggagca
gaaagcgctc tatgccaaaa ccgtcgagga caccctcgat 2520gcgattgccc
gggcgccccg cggacaacgg catggccagg tgctggggtt gctcaccaag
2580ctgaagcaga tctgtaacca tcccgccctg gccctgaaag aggaggcggc
cggcgacgag 2640ttcctgcagc gctccatgaa actgcagcgc ctggaggaaa
tcctcgagga ggtgatcgac 2700gccggcgacc gcgccctgct cttcacccag
ttcgccgaat ggggccatct gctgcagggt 2760tacctgcaac ggcgctggcg
cagcgaagtg ccgttcctga acggcagcac cagcaagagc 2820gaacgccagg
cgatggtcga tcgcttccag gaagacccgc gggggcctca gctgttcctg
2880ctgtcactga aagccggtgg tgtgggcctc aacctcaccc gcgccagcca
tgtgtttcac 2940atcgatcgct ggtggaatcc ggcggtggaa aaccaggcca
ccgaccgcgc ctaccggatc 3000ggccagacga accgggtgat ggtgcacaag
ttcatcacca gtggatcggt cgaagaaaaa 3060atcgaccgga tgatccgcga
gaaatcacgc ctcgccgaag acatcatcgg ctcaggcgaa 3120gattggctcg
gcgggctcga catgggccag ctgaaggaac tggtgagcct cgacgacaac
3180ggatcacttt cagcatga 3198921065PRTSynechococcus sp. 92Met Ser
Leu Leu His Ala Thr Trp Leu Pro Ala Ile Arg Thr Pro Thr 1 5 10 15
Ser Ser Gly Arg Ala Ala Leu Leu Val Trp Ala Asp Thr Trp Arg Val 20
25 30 Ala Glu Pro Ala Gly Pro Gly Val Thr Pro Ala Thr His Pro Phe
Thr 35 40 45 Leu Ser Ala Asp Asp Leu Arg Ala Trp Leu Ser Glu Arg
Glu Leu Leu 50 55 60 Pro Asp Gly Ile Ile Asp Ala Thr Ala Cys Leu
Thr Leu Pro Ser Arg 65 70 75 80 Thr Val Lys Pro Lys Arg Lys Arg Gly
Glu Thr Ala Pro Val Asp Glu 85 90 95 Gly Trp Thr Gly Leu Pro Leu
Gln Ala Gly Glu Pro Ile Pro Lys Gln 100 105 110 Thr Glu Trp Trp Pro
Trp Gln Val Gln Gly Leu Ala Val Glu Pro Gly 115 120 125 Ala Ala Thr
Ala Trp Leu Ala Arg Leu Pro Leu Ser Gly Arg His Pro 130 135 140 Asp
Leu Ala Asp Glu Leu Arg Trp Trp Ser His Met Gln Arg Trp Ala 145 150
155 160 Leu Ser Leu Ile Ala Arg Ser Arg Trp Ile Pro Gln Val Glu Leu
Ser 165 170 175 Lys Gly Glu Gly Tyr Pro His Arg Ala Arg Trp Val Pro
Leu Leu Asn 180 185 190 Arg Glu Asp Asp Arg Arg Arg Leu Glu Asp Met
Ala Ala Arg Leu Pro 195 200 205 Leu Val Ala Thr Cys Ala Leu Pro Trp
Arg Glu Pro Thr Gly Lys Arg 210 215 220 Ser Asn Arg Thr Thr Arg Leu
Arg Pro Glu Ala Met Arg Ala Ala Asn 225 230 235 240 Pro Val Ala Cys
Cys Arg Pro Arg Ser Gly Arg Leu Arg Val Ala Thr 245 250 255 Leu Leu
Glu Asp Leu Val Asp Ala Gln Leu Arg Thr Gly Phe Thr Ala 260 265 270
Gln Thr Asp Gly Leu Asp Pro Leu Leu Ala Ala Trp Glu Glu Ala Leu 275
280 285 Gly Ser Asp Thr Gly Val Ile His Leu Gly Asp Glu Asp Ala Glu
Arg 290 295 300 Leu Ala Thr Ala Ser His His Trp Arg Glu Gly Val Ala
Gly Thr Val 305 310 315 320 Ala Ala Ala Arg Ala Cys Leu Glu Leu Glu
Thr Pro Asp Asp Gly Asp 325 330 335 Asp Leu Trp Thr Leu Arg Phe Ala
Leu Gln Ala Glu Ala Asp Pro Thr 340 345 350 Leu Lys Val Pro Ala Ala
Leu Ala Trp Ala Ala Gly Pro Lys Gly Leu 355 360 365 Gln Leu Gly Glu
Ile Ala Val Glu His Pro Gly Glu Leu Leu Leu Glu 370 375 380 Gly Met
Gly Arg Ala Leu Thr Val Phe Pro Pro Ile Glu Arg Gly Leu 385 390 395
400 Asp Ser Ala Thr Pro Glu Gly Met Gln Leu Thr Pro Ala Glu Ala Phe
405 410 415 Val Leu Val Arg Thr Ala Ala Arg Glu Leu Arg Asp Val Gly
Val Gly 420 425 430 Val Glu Leu Pro Ala Ser Leu Ser Gly Gly Leu Ala
Ser Arg Leu Gly 435 440 445 Leu Ala Ile Gln Ala Glu Leu Pro Glu Lys
Ser Arg Gly Phe Thr Leu 450 455 460 Gly Glu Thr Leu Asp Trp Ser Trp
Glu Leu Met Ile Gly Gly Val Thr 465 470 475 480 Leu Thr Leu Arg Glu
Leu Glu Arg Leu Ala Gly Lys Arg Ser Pro Leu 485 490 495 Val Arg His
Lys Gly Thr Trp Ile Glu Leu Arg Pro Asn Asp Leu Lys 500 505 510 Asn
Ala Glu Arg Phe Phe Ala Ala Lys Pro Asp Leu Ser Leu Asp Asp 515 520
525 Ala Leu Arg Leu Thr Ala Ser Glu Gly Asp Thr Leu Met Arg Met Pro
530 535 540 Val His Arg Leu Glu Ala Gly Pro Arg Leu Gln Ala Val Leu
Glu Gln 545 550 555 560 Tyr His Gln Gln Lys Ala Pro Asp Pro Leu Pro
Ala Pro Glu Gly Phe 565 570 575 Cys Gly Gln Leu Arg Pro Tyr Gln Glu
Arg Gly Leu Gly Trp Leu Ala 580 585 590 Phe Leu His Arg Phe Asp Gln
Gly Ala Cys Leu Ala Asp Asp Met Gly 595 600 605 Leu Gly Lys Thr Ile
Gln Leu Leu Ala Phe Leu Gln His Leu Lys Ala 610 615 620 Glu Gln Glu
Leu Lys Arg Pro Val Leu Leu Val Ala Pro Thr Ser Val 625 630 635 640
Leu Thr Asn Trp Lys Arg Glu Ala Ala Ala Phe Thr Pro Glu Leu Glu 645
650 655 Val Lys Glu His Tyr Gly Pro Arg Arg Pro Ala Thr Pro Ala Ala
Leu 660 665 670 Lys Lys Ser Leu Lys Asp Val Asp Leu Val Leu Thr Ser
Tyr Gly Leu 675 680 685 Leu Gln Arg Asp Ser Glu Leu Leu Glu Ser Leu
Asp Trp Gln Gly Val 690 695 700 Val Ile Asp Glu Ala Gln Ala Ile Lys
Asn Pro Ser Ala Lys Gln Ser 705 710 715 720 Met Ala Ala Arg Asp Leu
Ala Arg Ala Gly Arg Ser Ser Arg Phe Arg 725 730 735 Ile Ala Leu Thr
Gly Thr Pro Val Glu Asn Arg Val Ser Glu Leu Trp 740 745 750 Ala Leu
Met Asp Phe Leu Asn Pro Arg Val Leu Gly Glu Glu Asp Phe 755 760 765
Phe Arg Gln Arg Tyr Arg Met Pro Ile Glu Arg Tyr Gly Asp Met Ser 770
775 780 Ser Leu Arg Asp Leu Lys Ser Arg Val Gly Pro Phe Ile Leu Arg
Arg 785 790 795 800 Leu Lys Thr Asp Lys Ala Ile Ile Ser Asp Leu Pro
Glu Lys Val Glu 805 810 815 Leu Ser Glu Trp Val Gly Leu Ser Arg Glu
Gln Lys Ala Leu Tyr Ala 820 825 830 Lys Thr Val Glu Asp Thr Leu Asp
Ala Ile Ala Arg Ala Pro Arg Gly 835 840 845 Gln Arg His Gly Gln Val
Leu Gly Leu Leu Thr Lys Leu Lys Gln Ile 850 855 860 Cys Asn His Pro
Ala Leu Ala Leu Lys Glu Glu Ala Ala Gly Asp Glu 865 870 875 880 Phe
Leu Gln Arg Ser Met Lys Leu Gln Arg Leu Glu Glu Ile Leu Glu 885 890
895 Glu Val Ile Asp Ala Gly Asp Arg Ala Leu Leu Phe Thr Gln Phe Ala
900 905 910 Glu Trp Gly His Leu Leu Gln Gly Tyr Leu Gln Arg Arg Trp
Arg Ser 915 920 925 Glu Val Pro Phe Leu Asn Gly Ser Thr Ser Lys Ser
Glu Arg Gln Ala 930 935 940 Met Val Asp Arg Phe Gln Glu Asp Pro Arg
Gly Pro Gln Leu Phe Leu 945 950 955 960 Leu Ser Leu Lys Ala Gly Gly
Val Gly Leu Asn Leu Thr Arg Ala Ser 965 970 975 His Val Phe His Ile
Asp Arg Trp Trp Asn Pro Ala Val Glu Asn Gln 980 985 990 Ala Thr Asp
Arg Ala Tyr Arg Ile Gly Gln Thr Asn Arg Val Met Val 995 1000 1005
His Lys Phe Ile Thr Ser Gly Ser Val Glu Glu Lys Ile Asp Arg 1010
1015 1020 Met Ile Arg Glu Lys Ser Arg Leu Ala Glu Asp Ile Ile Gly
Ser 1025 1030 1035 Gly Glu Asp Trp Leu Gly Gly Leu Asp Met Gly Gln
Leu Lys Glu 1040 1045 1050 Leu Val Ser Leu Asp Asp Asn Gly Ser Leu
Ser Ala 1055 1060 1065 933213DNASynechococcus sp. 93atgagcctgc
tgcacgccac ctggctaccc gccatccgca ctcccagcag ctccggaagg 60gctgctttgc
tggtatgggc tgacacctgg cgtgtggccg accccctcgg ccccggggcc
120acacccgccc ttcatccgtt caccctgagc gcggaggatc tgcgcgcctg
gctcacagag 180cgcgatttgc ttccggacgg aatcatcgat gcgaccgcat
gcctcaccct gccgagccgc 240agtgtgaaac cacggcggcc ccgtggctca
gctgccgcca ccccctcatc agaagagcag 300cccccttggt gcgggctgcc
gctgcaagcc ggcgaaccga tcccgaaaac caccgagtgg 360tggccatggc
aggtgcaggg gctggcgatc gaaccgatgg ccgccacggc atggctggcc
420aagcttccac tgtcaggcca tcaccctgat ctggccgatg agttgcgctg
gtggagtcac 480atgcagcgat gggccctcag tcttgtggct agggggcgct
ggctgcccca ggtggaattg 540agccgaggtg aggggtatcc acaccgggcc
cgctgggtcc cgcttctcaa tcgagaggaa 600gaccggcgcc gcctggagga
ccttgccgcc cgtctgcccc tggttgccac gtgtgcgttg 660ccctggagag
agcccacagg aaagcgcagc aatcgcatca ccaggctgcg cccagaggcc
720atgcgcgctg ccaatcccgt ggcctgctgt cgtccccgca gcggtcgatt
gcgggtggcc 780acattgctgg aggatctggt agatgcccag ctgcgcaagg
gcttccatcc cgatgacgag 840gggctcgacc ccctgctctg cgcctgggaa
aacgccctga gttcggagac cggggtgatc 900gatctgaatg atgaagatgc
cgaacgcctt gccacggcga gccaccactg gcgcgaggga 960gtggctggca
atgtggcggc tgccagggcc tgccttgaac tcgccacacc gaacgagggg
1020gaagagctct gggatctgcg cttctatctg caggccgaag ccgatccaac
gctgaaggta 1080ccggccggag cagcctgggc cgctggaccc gaaggccttc
aactcgggga gattcctgtg 1140gagcatcccg gtgaggtgct gctcgaaggc
atggggcgtg ctctcacggt gttcgaacca 1200atcgaacggg gcctggatag
cgccacgccg gaagcgatgc agctcacccc ggcggaagcc 1260ttcgtgctgg
tgcgcaccgc cgcccgtcag ctccgggacg tgggcgttgg tgtggatctc
1320cctcccagcc tctcgggagg cctggccagc cgcctcggtc tggcgatcaa
ggccgaacta 1380cccaaacgct cgcgggggtt cacccttggg gaaaatctcg
actggaactg ggagctgatg 1440atcgggggcg tcaccctgac gctgcgggag
ctggaacggc tggccggcaa gcgcagcccc 1500ttggtgcgcc acaagggggc
ctggatcgaa ctcaggccca atgatctcaa aaatgcagaa 1560cgattctgtg
ccgccaatcc tgatctgagc ctggacgatg cccttcgcct gacggccagc
1620gaaggggaca cgctgatgcg cctccccgtt catgcctttg atgctggccc
tcgccttcaa 1680ggggtgttgg agcaatacca ccagcagaaa gcaccggatc
cacttcctgc gcccgagggt 1740ttctgcggtc agcttcgccc ttaccaggaa
cgaggcctgg gctggctggc cttcctgcac 1800cgcttcgatc agggagcctg
cctcgccgac gacatgggcc tgggcaagac gatccagctg 1860ctggccttcc
tccagcacct gaagatggaa caagaactga aacggccggt gctgctggtg
1920gctcccacct ccgtgctcac caactggaaa cgggaagccg cggccttcac
ccccgagctc 1980acagtgcatg agcactacgg ccccaaacga ccctccaccc
cagcagcact gaaaaaagcc 2040ctgaaagacg ttgacctggt gctcaccagc
tacgggcttc tgcaaagaga cagtgaactg 2100cttgaaagtt tcgactggca
gggaaccgtg atcgatgaag ctcaggcgat caagaaccct 2160tcggccaagc
aaagccaggc agcccgtgat ctggctcgca cccgcaaggg ctccaggttc
2220cgcattgccc tcactggcac accggttgaa aacagagtga gcgagctctg
ggccctgatg 2280gatttcctca atccgaacgt gctcggcgaa gaggaatttt
tccggcagcg ctaccgcatg 2340ccgatcgaac gctatggcga tatgtcgtcg
cttcgcgatc tcaagtcgcg ggtgggacca 2400ttcattctgc ggcgcttgaa
aaccgacaag gcgatcatct ccgacctccc cgaaaaagtg 2460gagctgagtg
aatgggtggg gctgagcaag gaacagaagt ccctttacgc gaaaaccgtg
2520gagaacaccc tcgatgccat cgcccgagct ccccgaggca agcgtcacgg
ccaggtgctg 2580ggactgctga cgcgcctcaa acagatctgc aatcacccgg
ctctggcctt aaaggaagag 2640gtggcaggcg acgacttcct gcagcgatcg
gtgaagctgc agcggctcga agagattctc 2700gaagaggtga ttgcagcggg
ggatcgagcc ctgctgttca cccagttcgc ggaatggggg 2760catctgctgc
agggctacct gcaacgccgc tggcgcagcg aggtgccgtt cctgagcggc
2820agcactagca aaggagaacg tcaggccatg gtggatcgct tccaggaaga
cccgcgcggc 2880ccccagctgt tcctgttgtc cctcaaagcc ggcggtgtgg
gattgaacct gacccgggcc 2940agccacgtgt tccacatcga ccgctggtgg
aatcctgcag ttgaaaacca ggccactgac 3000cgtgcttacc ggattggcca
gaccaatcgg gtgatggtgc ataagttcat caccagtggc 3060tcagtggaag
agaagatcga ccggatgatc cgggagaagt ccagactggc ggaagacatc
3120gtgggctccg gcgaggagtg gctcggtggc ttcgacatgg gccaactcaa
ggagctggtg 3180agcctcgagg acaacgaaac acgcaaccca tga
3213941070PRTSynechococcus sp. 94Met Ser Leu Leu His Ala Thr Trp
Leu Pro Ala Ile Arg Thr Pro Ser 1 5 10 15 Ser Ser Gly Arg Ala Ala
Leu Leu Val Trp Ala Asp Thr Trp Arg Val 20 25 30 Ala Asp Pro Leu
Gly Pro Gly Ala Thr Pro Ala Leu His Pro Phe Thr 35 40 45 Leu Ser
Ala Glu Asp Leu Arg Ala Trp Leu Thr Glu Arg Asp Leu Leu 50 55 60
Pro Asp Gly Ile Ile Asp Ala Thr Ala Cys Leu Thr Leu Pro Ser Arg 65
70 75 80 Ser Val Lys Pro Arg Arg Pro Arg Gly Ser Ala Ala Ala Thr
Pro Ser 85 90 95 Ser Glu Glu Gln Pro Pro Trp Cys Gly Leu Pro Leu
Gln Ala Gly Glu 100 105 110 Pro Ile Pro Lys Thr Thr Glu Trp Trp Pro
Trp Gln Val Gln Gly Leu 115 120 125 Ala Ile Glu Pro Met Ala Ala Thr
Ala Trp Leu Ala Lys Leu Pro Leu 130 135 140 Ser Gly His His Pro Asp
Leu Ala Asp Glu Leu Arg Trp Trp Ser His 145 150 155 160 Met Gln Arg
Trp Ala Leu Ser Leu Val Ala Arg Gly Arg Trp Leu Pro 165 170 175 Gln
Val Glu Leu Ser Arg Gly Glu Gly Tyr Pro His Arg Ala Arg Trp 180 185
190 Val Pro Leu Leu Asn Arg Glu Glu Asp Arg Arg Arg Leu Glu Asp Leu
195 200 205 Ala Ala Arg Leu Pro Leu Val Ala Thr Cys Ala Leu Pro Trp
Arg Glu 210 215 220 Pro Thr Gly Lys Arg Ser Asn Arg Ile Thr Arg Leu
Arg Pro Glu Ala 225 230 235 240 Met Arg Ala Ala Asn Pro Val Ala Cys
Cys Arg Pro Arg Ser Gly Arg 245 250
255 Leu Arg Val Ala Thr Leu Leu Glu Asp Leu Val Asp Ala Gln Leu Arg
260 265 270 Lys Gly Phe His Pro Asp Asp Glu Gly Leu Asp Pro Leu Leu
Cys Ala 275 280 285 Trp Glu Asn Ala Leu Ser Ser Glu Thr Gly Val Ile
Asp Leu Asn Asp 290 295 300 Glu Asp Ala Glu Arg Leu Ala Thr Ala Ser
His His Trp Arg Glu Gly 305 310 315 320 Val Ala Gly Asn Val Ala Ala
Ala Arg Ala Cys Leu Glu Leu Ala Thr 325 330 335 Pro Asn Glu Gly Glu
Glu Leu Trp Asp Leu Arg Phe Tyr Leu Gln Ala 340 345 350 Glu Ala Asp
Pro Thr Leu Lys Val Pro Ala Gly Ala Ala Trp Ala Ala 355 360 365 Gly
Pro Glu Gly Leu Gln Leu Gly Glu Ile Pro Val Glu His Pro Gly 370 375
380 Glu Val Leu Leu Glu Gly Met Gly Arg Ala Leu Thr Val Phe Glu Pro
385 390 395 400 Ile Glu Arg Gly Leu Asp Ser Ala Thr Pro Glu Ala Met
Gln Leu Thr 405 410 415 Pro Ala Glu Ala Phe Val Leu Val Arg Thr Ala
Ala Arg Gln Leu Arg 420 425 430 Asp Val Gly Val Gly Val Asp Leu Pro
Pro Ser Leu Ser Gly Gly Leu 435 440 445 Ala Ser Arg Leu Gly Leu Ala
Ile Lys Ala Glu Leu Pro Lys Arg Ser 450 455 460 Arg Gly Phe Thr Leu
Gly Glu Asn Leu Asp Trp Asn Trp Glu Leu Met 465 470 475 480 Ile Gly
Gly Val Thr Leu Thr Leu Arg Glu Leu Glu Arg Leu Ala Gly 485 490 495
Lys Arg Ser Pro Leu Val Arg His Lys Gly Ala Trp Ile Glu Leu Arg 500
505 510 Pro Asn Asp Leu Lys Asn Ala Glu Arg Phe Cys Ala Ala Asn Pro
Asp 515 520 525 Leu Ser Leu Asp Asp Ala Leu Arg Leu Thr Ala Ser Glu
Gly Asp Thr 530 535 540 Leu Met Arg Leu Pro Val His Ala Phe Asp Ala
Gly Pro Arg Leu Gln 545 550 555 560 Gly Val Leu Glu Gln Tyr His Gln
Gln Lys Ala Pro Asp Pro Leu Pro 565 570 575 Ala Pro Glu Gly Phe Cys
Gly Gln Leu Arg Pro Tyr Gln Glu Arg Gly 580 585 590 Leu Gly Trp Leu
Ala Phe Leu His Arg Phe Asp Gln Gly Ala Cys Leu 595 600 605 Ala Asp
Asp Met Gly Leu Gly Lys Thr Ile Gln Leu Leu Ala Phe Leu 610 615 620
Gln His Leu Lys Met Glu Gln Glu Leu Lys Arg Pro Val Leu Leu Val 625
630 635 640 Ala Pro Thr Ser Val Leu Thr Asn Trp Lys Arg Glu Ala Ala
Ala Phe 645 650 655 Thr Pro Glu Leu Thr Val His Glu His Tyr Gly Pro
Lys Arg Pro Ser 660 665 670 Thr Pro Ala Ala Leu Lys Lys Ala Leu Lys
Asp Val Asp Leu Val Leu 675 680 685 Thr Ser Tyr Gly Leu Leu Gln Arg
Asp Ser Glu Leu Leu Glu Ser Phe 690 695 700 Asp Trp Gln Gly Thr Val
Ile Asp Glu Ala Gln Ala Ile Lys Asn Pro 705 710 715 720 Ser Ala Lys
Gln Ser Gln Ala Ala Arg Asp Leu Ala Arg Thr Arg Lys 725 730 735 Gly
Ser Arg Phe Arg Ile Ala Leu Thr Gly Thr Pro Val Glu Asn Arg 740 745
750 Val Ser Glu Leu Trp Ala Leu Met Asp Phe Leu Asn Pro Asn Val Leu
755 760 765 Gly Glu Glu Glu Phe Phe Arg Gln Arg Tyr Arg Met Pro Ile
Glu Arg 770 775 780 Tyr Gly Asp Met Ser Ser Leu Arg Asp Leu Lys Ser
Arg Val Gly Pro 785 790 795 800 Phe Ile Leu Arg Arg Leu Lys Thr Asp
Lys Ala Ile Ile Ser Asp Leu 805 810 815 Pro Glu Lys Val Glu Leu Ser
Glu Trp Val Gly Leu Ser Lys Glu Gln 820 825 830 Lys Ser Leu Tyr Ala
Lys Thr Val Glu Asn Thr Leu Asp Ala Ile Ala 835 840 845 Arg Ala Pro
Arg Gly Lys Arg His Gly Gln Val Leu Gly Leu Leu Thr 850 855 860 Arg
Leu Lys Gln Ile Cys Asn His Pro Ala Leu Ala Leu Lys Glu Glu 865 870
875 880 Val Ala Gly Asp Asp Phe Leu Gln Arg Ser Val Lys Leu Gln Arg
Leu 885 890 895 Glu Glu Ile Leu Glu Glu Val Ile Ala Ala Gly Asp Arg
Ala Leu Leu 900 905 910 Phe Thr Gln Phe Ala Glu Trp Gly His Leu Leu
Gln Gly Tyr Leu Gln 915 920 925 Arg Arg Trp Arg Ser Glu Val Pro Phe
Leu Ser Gly Ser Thr Ser Lys 930 935 940 Gly Glu Arg Gln Ala Met Val
Asp Arg Phe Gln Glu Asp Pro Arg Gly 945 950 955 960 Pro Gln Leu Phe
Leu Leu Ser Leu Lys Ala Gly Gly Val Gly Leu Asn 965 970 975 Leu Thr
Arg Ala Ser His Val Phe His Ile Asp Arg Trp Trp Asn Pro 980 985 990
Ala Val Glu Asn Gln Ala Thr Asp Arg Ala Tyr Arg Ile Gly Gln Thr 995
1000 1005 Asn Arg Val Met Val His Lys Phe Ile Thr Ser Gly Ser Val
Glu 1010 1015 1020 Glu Lys Ile Asp Arg Met Ile Arg Glu Lys Ser Arg
Leu Ala Glu 1025 1030 1035 Asp Ile Val Gly Ser Gly Glu Glu Trp Leu
Gly Gly Phe Asp Met 1040 1045 1050 Gly Gln Leu Lys Glu Leu Val Ser
Leu Glu Asp Asn Glu Thr Arg 1055 1060 1065 Asn Pro 1070
953192DNASynechococcus sp. 95atgagcctgc tgcacgccac ctggcttccc
gccatccgta cctctggcag ttccggccaa 60ccggcactgc tcatttgggc tgacacctgg
cgggtggcga caccagaggg ccccgggcta 120actccggcgc tgcacccgtt
caccctggaa cccgacgacc tcaaggcctg gcttcaggaa 180cgcgacctgt
tgccaggcgg cagcatcgat gccaccgcct gcctcaccct gcccagtcgc
240acggtaaaac cccgcaagag ccgcagcaaa acggccgaac cagcgcccga
agagcccatc 300tggaccggtc tgccgatgca ggccggagag ccgattccga
aacagacaga atggtggccg 360tggcaagtcc agggcctcgc tgtcgagccc
tctgccgcca cggagtggct ctcacgcctt 420cccctgtcag gacggaatcc
agacctggcc gatgagctgc gctggtggag ccacctgcag 480cgctgggccc
tcagccttgt ggcccggggg cgctggattc cccagatgga actgagcaaa
540ggcgagggat atccccaccg ggcccgttgg gtgcctctgc tcaaccgcga
ggaggaccgg 600cgacgtctgg aggatctggc cgccagcctg ccgctggtgg
ccacctgcgc cctgccctgg 660cgggaaccga tgggtcggcg cagcaaccgc
atgacacggc tgcgtccgga ggccatgcgt 720gccgccaacc cggtggcctg
ctgccggccc cgcagtggcc gcctgcgggt ggccacgctg 780ctggaggatc
tggtcgacgc acagctgcgc aaggactttg aaccatccac cgacggcctc
840gatcccctgt tgaccctgtg gcaagacgcc ctgggctccg aaacaggggt
gattgagatc 900ggtgatgaac aggccgaacg gctggccagc gccagcttcc
attggcgcga gggcatcgct 960ggagatttcg ccgctgcacg cacctgcctg
gaactgcaga cacctgcaga gggagaagag 1020ctctgggagc tgcggtttgg
gctgcaggcg gagtcggatc cgagcctcaa gctgcccgcc 1080gctgcggcct
gggcctccgg tgccgaccaa ctccagttgg gagaagtgac agtcgagcag
1140cccggtgaag tgctgctgga gggtctggga cgcgccctca ccgtgttccc
accgatcgaa 1200aggggcctgg agaccgctac gcctgacacg atgcagctga
cccccgccga agccttcgtg 1260ctggtgcgga ccgcagcgcg gcagctgcgg
gatgccggcg tcggcgtcga ccttcccccc 1320agcctgtcgg ggggcctggc
cagccgcctg ggtctggcga tcaaggcgga gctgccagag 1380cgctccagcg
gcttcagcct cggcgaatcc ctcgactgga gctgggatct gatgatcggc
1440ggggtgacgc tcaccctgcg ggaactggag cggttgagcg gcaaacgcag
ccccctcgtg 1500cgccacaagg gggcctggat cgaattgcga ccgaacgatc
tgagaaacgc cgaacgcttc 1560tgcggtgcca acccggagct cagcctggac
gatgccctgc ggatcaccgc caccgaaggc 1620gatctgctga tgcgtctgcc
ggtgcatcgc tttgaggccg gccccaggct gcaggcggtg 1680ctggagcagt
accaccagca gaaggccccg gatccgttgc cagcgccgga ggggttctgc
1740ggccagctgc ggccttacca ggagcgtggc ctgggctggc tggccttcct
caaccgcttc 1800gaccaaggcg cctgcctggc ggacgacatg ggtctgggta
agaccatcca gctgctggcc 1860ttcctgcagc acctgaaagc agagcaggaa
ctgaagcgcc cggtgctgct ggtggccccc 1920acatcggtgc tcacaaactg
gcgacgggaa gcggaagcct tcacccccga actggcggtg 1980cgcgagcact
acggaccgcg gcgtccctcc actccggctg cgctgaagaa ggcgttgaag
2040gatgtcgact tagtcctcac cagctacggc ctactgcaga gggacagtga
attgctggag 2100tctcaggatt ggcagggggt tgtgatcgat gaagcccaag
cgatcaagaa tcccagtgcc 2160aagcagagcc aggcagcccg agacctggcc
agaccagcca aaggcaaccg cttccgcatc 2220gccctcacgg gcacaccggt
ggagaacagg gtcagcgagc tctgggcttt gatggatttc 2280ctcagtccca
aggtgctggg agaagaagac ttcttccgtc agcgctaccg gatgccgatc
2340gagcgctatg gcgacatggc atccctacgg gacttaaaag ccagggtcgg
ccccttcatc 2400ctgcgccggc tgaaaaccga caagacgatc atttccgatc
tgcccgagaa ggtggaactc 2460agcgaatggg tggggttgag caaggagcag
aaatcgctgt acagcaaaac cgttgaagac 2520accctggatg ccattgcccg
ggcgcctcgt ggacagcgcc atggtcaggt gctgggactg 2580ctcacccgcc
tgaagcagat ctgcaaccat ccggccctgg cattgagtga aaacgctgtt
2640gacgacggct ttctggggcg ctccgccaag ttgcaacggc ttgaggaaat
cctcgatgag 2700gtgatcgaag caggggatcg ggcgctgctg ttcacccagt
tcgccgagtg gggccatctg 2760ctgcagtcct ggatgcaaca acgttggaag
gcggatgtgc ccttcctgca tggagggacg 2820cgcaaaaacg aacggcaggc
catggtggat cgttttcagg aggacccccg cggcccgcag 2880ctgttcctgc
tgtcgctcaa agccggcggg gtgggtctga acctgaccag ggccagccac
2940gtgttccaca tcgatcgctg gtggaaccct gcggtagaga accaggccac
cgaccgtgct 3000tatcggatcg gccagaccaa ccgggtgatg gtgcacaaat
tcatcacaag cggatccgta 3060gaagaaaaaa ttgaccggat gatccgagag
aagtcgcgcc tggcagagga tgtgatcggt 3120tccggtgaag actggctcgg
gtgcctggcc ggtgatcagc tgcgcaatct cgttgccctg 3180gaggacacct ga
3192961063PRTSynechococcus sp. 96Met Ser Leu Leu His Ala Thr Trp
Leu Pro Ala Ile Arg Thr Ser Gly 1 5 10 15 Ser Ser Gly Gln Pro Ala
Leu Leu Ile Trp Ala Asp Thr Trp Arg Val 20 25 30 Ala Thr Pro Glu
Gly Pro Gly Leu Thr Pro Ala Leu His Pro Phe Thr 35 40 45 Leu Glu
Pro Asp Asp Leu Lys Ala Trp Leu Gln Glu Arg Asp Leu Leu 50 55 60
Pro Gly Gly Ser Ile Asp Ala Thr Ala Cys Leu Thr Leu Pro Ser Arg 65
70 75 80 Thr Val Lys Pro Arg Lys Ser Arg Ser Lys Thr Ala Glu Pro
Ala Pro 85 90 95 Glu Glu Pro Ile Trp Thr Gly Leu Pro Met Gln Ala
Gly Glu Pro Ile 100 105 110 Pro Lys Gln Thr Glu Trp Trp Pro Trp Gln
Val Gln Gly Leu Ala Val 115 120 125 Glu Pro Ser Ala Ala Thr Glu Trp
Leu Ser Arg Leu Pro Leu Ser Gly 130 135 140 Arg Asn Pro Asp Leu Ala
Asp Glu Leu Arg Trp Trp Ser His Leu Gln 145 150 155 160 Arg Trp Ala
Leu Ser Leu Val Ala Arg Gly Arg Trp Ile Pro Gln Met 165 170 175 Glu
Leu Ser Lys Gly Glu Gly Tyr Pro His Arg Ala Arg Trp Val Pro 180 185
190 Leu Leu Asn Arg Glu Glu Asp Arg Arg Arg Leu Glu Asp Leu Ala Ala
195 200 205 Ser Leu Pro Leu Val Ala Thr Cys Ala Leu Pro Trp Arg Glu
Pro Met 210 215 220 Gly Arg Arg Ser Asn Arg Met Thr Arg Leu Arg Pro
Glu Ala Met Arg 225 230 235 240 Ala Ala Asn Pro Val Ala Cys Cys Arg
Pro Arg Ser Gly Arg Leu Arg 245 250 255 Val Ala Thr Leu Leu Glu Asp
Leu Val Asp Ala Gln Leu Arg Lys Asp 260 265 270 Phe Glu Pro Ser Thr
Asp Gly Leu Asp Pro Leu Leu Thr Leu Trp Gln 275 280 285 Asp Ala Leu
Gly Ser Glu Thr Gly Val Ile Glu Ile Gly Asp Glu Gln 290 295 300 Ala
Glu Arg Leu Ala Ser Ala Ser Phe His Trp Arg Glu Gly Ile Ala 305 310
315 320 Gly Asp Phe Ala Ala Ala Arg Thr Cys Leu Glu Leu Gln Thr Pro
Ala 325 330 335 Glu Gly Glu Glu Leu Trp Glu Leu Arg Phe Gly Leu Gln
Ala Glu Ser 340 345 350 Asp Pro Ser Leu Lys Leu Pro Ala Ala Ala Ala
Trp Ala Ser Gly Ala 355 360 365 Asp Gln Leu Gln Leu Gly Glu Val Thr
Val Glu Gln Pro Gly Glu Val 370 375 380 Leu Leu Glu Gly Leu Gly Arg
Ala Leu Thr Val Phe Pro Pro Ile Glu 385 390 395 400 Arg Gly Leu Glu
Thr Ala Thr Pro Asp Thr Met Gln Leu Thr Pro Ala 405 410 415 Glu Ala
Phe Val Leu Val Arg Thr Ala Ala Arg Gln Leu Arg Asp Ala 420 425 430
Gly Val Gly Val Asp Leu Pro Pro Ser Leu Ser Gly Gly Leu Ala Ser 435
440 445 Arg Leu Gly Leu Ala Ile Lys Ala Glu Leu Pro Glu Arg Ser Ser
Gly 450 455 460 Phe Ser Leu Gly Glu Ser Leu Asp Trp Ser Trp Asp Leu
Met Ile Gly 465 470 475 480 Gly Val Thr Leu Thr Leu Arg Glu Leu Glu
Arg Leu Ser Gly Lys Arg 485 490 495 Ser Pro Leu Val Arg His Lys Gly
Ala Trp Ile Glu Leu Arg Pro Asn 500 505 510 Asp Leu Arg Asn Ala Glu
Arg Phe Cys Gly Ala Asn Pro Glu Leu Ser 515 520 525 Leu Asp Asp Ala
Leu Arg Ile Thr Ala Thr Glu Gly Asp Leu Leu Met 530 535 540 Arg Leu
Pro Val His Arg Phe Glu Ala Gly Pro Arg Leu Gln Ala Val 545 550 555
560 Leu Glu Gln Tyr His Gln Gln Lys Ala Pro Asp Pro Leu Pro Ala Pro
565 570 575 Glu Gly Phe Cys Gly Gln Leu Arg Pro Tyr Gln Glu Arg Gly
Leu Gly 580 585 590 Trp Leu Ala Phe Leu Asn Arg Phe Asp Gln Gly Ala
Cys Leu Ala Asp 595 600 605 Asp Met Gly Leu Gly Lys Thr Ile Gln Leu
Leu Ala Phe Leu Gln His 610 615 620 Leu Lys Ala Glu Gln Glu Leu Lys
Arg Pro Val Leu Leu Val Ala Pro 625 630 635 640 Thr Ser Val Leu Thr
Asn Trp Arg Arg Glu Ala Glu Ala Phe Thr Pro 645 650 655 Glu Leu Ala
Val Arg Glu His Tyr Gly Pro Arg Arg Pro Ser Thr Pro 660 665 670 Ala
Ala Leu Lys Lys Ala Leu Lys Asp Val Asp Leu Val Leu Thr Ser 675 680
685 Tyr Gly Leu Leu Gln Arg Asp Ser Glu Leu Leu Glu Ser Gln Asp Trp
690 695 700 Gln Gly Val Val Ile Asp Glu Ala Gln Ala Ile Lys Asn Pro
Ser Ala 705 710 715 720 Lys Gln Ser Gln Ala Ala Arg Asp Leu Ala Arg
Pro Ala Lys Gly Asn 725 730 735 Arg Phe Arg Ile Ala Leu Thr Gly Thr
Pro Val Glu Asn Arg Val Ser 740 745 750 Glu Leu Trp Ala Leu Met Asp
Phe Leu Ser Pro Lys Val Leu Gly Glu 755 760 765 Glu Asp Phe Phe Arg
Gln Arg Tyr Arg Met Pro Ile Glu Arg Tyr Gly 770 775 780 Asp Met Ala
Ser Leu Arg Asp Leu Lys Ala Arg Val Gly Pro Phe Ile 785 790 795 800
Leu Arg Arg Leu Lys Thr Asp Lys Thr Ile Ile Ser Asp Leu Pro Glu 805
810 815 Lys Val Glu Leu Ser Glu Trp Val Gly Leu Ser Lys Glu Gln Lys
Ser 820 825 830 Leu Tyr Ser Lys Thr Val Glu Asp Thr Leu Asp Ala Ile
Ala Arg Ala 835 840 845 Pro Arg Gly Gln Arg His Gly Gln Val Leu Gly
Leu Leu Thr Arg Leu 850 855 860 Lys Gln Ile Cys Asn His Pro Ala Leu
Ala Leu Ser Glu Asn Ala Val 865 870 875 880 Asp Asp Gly Phe Leu Gly
Arg Ser Ala Lys Leu Gln Arg Leu Glu Glu 885 890 895 Ile Leu Asp Glu
Val Ile Glu Ala Gly Asp Arg Ala Leu Leu Phe Thr 900 905 910 Gln Phe
Ala Glu Trp Gly His Leu Leu Gln Ser Trp Met Gln Gln Arg 915 920 925
Trp Lys Ala Asp Val Pro Phe Leu His Gly Gly Thr Arg Lys Asn Glu 930
935 940 Arg Gln Ala Met Val Asp Arg Phe Gln Glu Asp Pro Arg Gly Pro
Gln 945 950 955 960 Leu Phe Leu Leu Ser Leu Lys Ala Gly Gly Val Gly
Leu Asn Leu Thr 965 970 975 Arg Ala Ser His Val Phe His Ile Asp Arg
Trp Trp Asn Pro Ala
Val 980 985 990 Glu Asn Gln Ala Thr Asp Arg Ala Tyr Arg Ile Gly Gln
Thr Asn Arg 995 1000 1005 Val Met Val His Lys Phe Ile Thr Ser Gly
Ser Val Glu Glu Lys 1010 1015 1020 Ile Asp Arg Met Ile Arg Glu Lys
Ser Arg Leu Ala Glu Asp Val 1025 1030 1035 Ile Gly Ser Gly Glu Asp
Trp Leu Gly Cys Leu Ala Gly Asp Gln 1040 1045 1050 Leu Arg Asn Leu
Val Ala Leu Glu Asp Thr 1055 1060 973060DNASynechococcus elongatus
97atggcagtgc tgcacggtgg ctggctcggc gatcgcttct gcgtttgggc cgaggcttgg
60caggctggtg agcctcagtc ggcagcagaa attgcgattc atccctacgc gatcgcggcc
120actgacttaa atgattggtg ccagaagtac cgtctgggat ccctgacggg
gacgccaaca 180gaagtcctgc tctctattcc cagtgacctg aagaaagagg
cggttctacc gtttctgagt 240ggtcaggaaa ttccagatgg ggcgctgctt
tggtcttggc agatccccgt gctgtcgcta 300gaagccgcga tcgccggtca
atggctggcg accttgccgc tgggttcggc ggaggatcat 360ccttggctgg
ggccagatct acgcttttgg agccacatct accgctgggc acaaagtttg
420ctggctcggg ggcgctttta tccggcgctg gagtcgagcg atcgcggttt
aacggcagtt 480tggttgccac tgtttaatca agcgggcgat cgccagcgct
tcgatcgcta tagtcagcag 540ctgcccttta gtcagttttg ctatcaggca
atcgaaacag cggcagcttg tccttggcag 600cctcaaccgc aggatctgtt
gctgcgagtc ctacagactt ggttgacagc acgactacaa 660ccggcgatcg
cggcgggaac tctcgtgtct gctgatctgc tggcggcttg gcagcaatcg
720ctagcgaatg gaaaaccgct aaagctagaa gacagtgaag ccagtcgctt
gcaaacggcg 780atcgatcgct ggttactacc agtgcagaat ggcgcagctc
aggcttggcg gatggttttg 840cgccttgtcc cgcctacgga gcaagagcag
ccctggcaat tggagtttgg cttacaagca 900gcgaccgatc ccgatcgctt
tcggccggcc tctctcctct ggcaggatcc gctgccacct 960gggctaccag
atcaatctca ggaattgctg ttacgcggct tgggacaggc ttgtcggctc
1020tatccccaat tgcaaaccag tctggcgaca gcctgtccag aattccatcc
actgaccaca 1080gcggaggtct atcagctgct caagcaggtg attcctcagt
ggcaagagca gggcattgaa 1140gtgcaactgc cgccgggctt gcgtggtcaa
gggcgacacc ggctgggagt ggaagtcagc 1200gccacgttgc cgagcgatcg
cccgagtgtg gggctggaag cactactgca gtttcgttgg 1260gagctgagtc
tgggcggtca gcggctgacc aaagcagaag tggaacgctt ggcagccctg
1320gaaacgccct tggtggaaat caacggcgac tggattgagg tgcggccgca
ggatattgag 1380tcggcgcgag agtttttccg taagcgcaag gatcagccaa
atttgacctt ggcggatgcg 1440atcgcgatcg ccagtggtga gtcgccgaat
gttggtcgcc tgccggtggt caattttgaa 1500gcggcgggct tactcgaaga
agccttggcc gtgtttcagg ggcagcgatc gcctgcggct 1560ttgcccgctc
cgcccacctt tcagggcgag ctgcgaccct atcaagagcg gggggtgggc
1620tggctcagct ttttgcagcg cttcgggatt ggggcttgcc tcgccgacga
catgggcttg 1680ggtaagacga ttcagctgct ggccttttta ctgcatctca
aacacagcaa cgagctgacg 1740cggccggtgc tgctagtctg tccgacttcg
gtgctgggca actgggaacg ggaggtgcag 1800aaatttgcac cggagcttcg
ctggaagctg cactatggcc ccgatcgcgc tcagggtaag 1860gctttggcga
cagcgctcaa ggactgcgat ttggtgctga ccagttactc cttggtggcg
1920cgagatcaga aagcgatcgc ggcgatcgac tggcaaggca ttgtgctgga
tgaagcccag 1980aacatcaaga atgaccaggc gaaacagacg caggcggtgc
gagcgatcgc ccaaagtccg 2040acgcaaaagc cccgctttcg gattgccctg
acagggacgc cggttgagaa tcgcctcagt 2100gagttgtggt cgattgtcga
gtttttgcag ccgggacatt taggcaccaa gccattcttt 2160caaaagcgct
ttgtcacgcc gatcgagcgt tttggcgatg cggattcgct gacagcattg
2220cggcagcgcg tgcaaccgtt aatcctacgg cgactgaaaa ccgatcgcag
cattattgcc 2280gacttgcctg agaagcaaga aatgacggtc ttttgtccgt
tggtacagga gcaggccgat 2340cgctatcagg tgctagtcaa tgaagcgcta
gccaatattg aagcaagtga aggcattcag 2400cggcgcggcc agattttggc
attgctaacg cgactgaagc agctctgtaa tcatccgtcg 2460ttgttgctcg
aaaagccgaa gctcgatccg aattttggcg atcgctcagc caagttgcag
2520cgcttactag aaatgttggc ggagctaacg gatgcgggcg atcgcgcttt
ggtgtttacg 2580cagtttgcgg gctggggtag tttgctgcag caatttttgc
aggaacagct agggcgagag 2640gtgctgtttt tgtcgggcag taccaagaag
ggcgatcgcc aacagatggt tgatcgcttc 2700caaaatgatc cgcaggcacc
ggcaattttc atcctgtcat tgaaggctgg cggggtgggg 2760ctcaacctga
cgaaagccaa tcatgtcttt cattacgatc gctggtggaa tccggcagtt
2820gaaaaccaag cgaccgatcg cgcgtttcgg attgggcaac gacgcaatgt
acaggtgcac 2880aagtttgtct gcgctggcac tctagaagaa aaaattgatc
agatgatcgc tagcaagcaa 2940gcattagcac agcagattgt cggtagtggt
gaggattggc taacggaact agacaccaat 3000caactccggc aactcttgat
cctcgatcgc tcagcttggg tagaagagga agagccttag
3060981019PRTSynechococcus elongatus 98Met Ala Val Leu His Gly Gly
Trp Leu Gly Asp Arg Phe Cys Val Trp 1 5 10 15 Ala Glu Ala Trp Gln
Ala Gly Glu Pro Gln Ser Ala Ala Glu Ile Ala 20 25 30 Ile His Pro
Tyr Ala Ile Ala Ala Thr Asp Leu Asn Asp Trp Cys Gln 35 40 45 Lys
Tyr Arg Leu Gly Ser Leu Thr Gly Thr Pro Thr Glu Val Leu Leu 50 55
60 Ser Ile Pro Ser Asp Leu Lys Lys Glu Ala Val Leu Pro Phe Leu Ser
65 70 75 80 Gly Gln Glu Ile Pro Asp Gly Ala Leu Leu Trp Ser Trp Gln
Ile Pro 85 90 95 Val Leu Ser Leu Glu Ala Ala Ile Ala Gly Gln Trp
Leu Ala Thr Leu 100 105 110 Pro Leu Gly Ser Ala Glu Asp His Pro Trp
Leu Gly Pro Asp Leu Arg 115 120 125 Phe Trp Ser His Ile Tyr Arg Trp
Ala Gln Ser Leu Leu Ala Arg Gly 130 135 140 Arg Phe Tyr Pro Ala Leu
Glu Ser Ser Asp Arg Gly Leu Thr Ala Val 145 150 155 160 Trp Leu Pro
Leu Phe Asn Gln Ala Gly Asp Arg Gln Arg Phe Asp Arg 165 170 175 Tyr
Ser Gln Gln Leu Pro Phe Ser Gln Phe Cys Tyr Gln Ala Ile Glu 180 185
190 Thr Ala Ala Ala Cys Pro Trp Gln Pro Gln Pro Gln Asp Leu Leu Leu
195 200 205 Arg Val Leu Gln Thr Trp Leu Thr Ala Arg Leu Gln Pro Ala
Ile Ala 210 215 220 Ala Gly Thr Leu Val Ser Ala Asp Leu Leu Ala Ala
Trp Gln Gln Ser 225 230 235 240 Leu Ala Asn Gly Lys Pro Leu Lys Leu
Glu Asp Ser Glu Ala Ser Arg 245 250 255 Leu Gln Thr Ala Ile Asp Arg
Trp Leu Leu Pro Val Gln Asn Gly Ala 260 265 270 Ala Gln Ala Trp Arg
Met Val Leu Arg Leu Val Pro Pro Thr Glu Gln 275 280 285 Glu Gln Pro
Trp Gln Leu Glu Phe Gly Leu Gln Ala Ala Thr Asp Pro 290 295 300 Asp
Arg Phe Arg Pro Ala Ser Leu Leu Trp Gln Asp Pro Leu Pro Pro 305 310
315 320 Gly Leu Pro Asp Gln Ser Gln Glu Leu Leu Leu Arg Gly Leu Gly
Gln 325 330 335 Ala Cys Arg Leu Tyr Pro Gln Leu Gln Thr Ser Leu Ala
Thr Ala Cys 340 345 350 Pro Glu Phe His Pro Leu Thr Thr Ala Glu Val
Tyr Gln Leu Leu Lys 355 360 365 Gln Val Ile Pro Gln Trp Gln Glu Gln
Gly Ile Glu Val Gln Leu Pro 370 375 380 Pro Gly Leu Arg Gly Gln Gly
Arg His Arg Leu Gly Val Glu Val Ser 385 390 395 400 Ala Thr Leu Pro
Ser Asp Arg Pro Ser Val Gly Leu Glu Ala Leu Leu 405 410 415 Gln Phe
Arg Trp Glu Leu Ser Leu Gly Gly Gln Arg Leu Thr Lys Ala 420 425 430
Glu Val Glu Arg Leu Ala Ala Leu Glu Thr Pro Leu Val Glu Ile Asn 435
440 445 Gly Asp Trp Ile Glu Val Arg Pro Gln Asp Ile Glu Ser Ala Arg
Glu 450 455 460 Phe Phe Arg Lys Arg Lys Asp Gln Pro Asn Leu Thr Leu
Ala Asp Ala 465 470 475 480 Ile Ala Ile Ala Ser Gly Glu Ser Pro Asn
Val Gly Arg Leu Pro Val 485 490 495 Val Asn Phe Glu Ala Ala Gly Leu
Leu Glu Glu Ala Leu Ala Val Phe 500 505 510 Gln Gly Gln Arg Ser Pro
Ala Ala Leu Pro Ala Pro Pro Thr Phe Gln 515 520 525 Gly Glu Leu Arg
Pro Tyr Gln Glu Arg Gly Val Gly Trp Leu Ser Phe 530 535 540 Leu Gln
Arg Phe Gly Ile Gly Ala Cys Leu Ala Asp Asp Met Gly Leu 545 550 555
560 Gly Lys Thr Ile Gln Leu Leu Ala Phe Leu Leu His Leu Lys His Ser
565 570 575 Asn Glu Leu Thr Arg Pro Val Leu Leu Val Cys Pro Thr Ser
Val Leu 580 585 590 Gly Asn Trp Glu Arg Glu Val Gln Lys Phe Ala Pro
Glu Leu Arg Trp 595 600 605 Lys Leu His Tyr Gly Pro Asp Arg Ala Gln
Gly Lys Ala Leu Ala Thr 610 615 620 Ala Leu Lys Asp Cys Asp Leu Val
Leu Thr Ser Tyr Ser Leu Val Ala 625 630 635 640 Arg Asp Gln Lys Ala
Ile Ala Ala Ile Asp Trp Gln Gly Ile Val Leu 645 650 655 Asp Glu Ala
Gln Asn Ile Lys Asn Asp Gln Ala Lys Gln Thr Gln Ala 660 665 670 Val
Arg Ala Ile Ala Gln Ser Pro Thr Gln Lys Pro Arg Phe Arg Ile 675 680
685 Ala Leu Thr Gly Thr Pro Val Glu Asn Arg Leu Ser Glu Leu Trp Ser
690 695 700 Ile Val Glu Phe Leu Gln Pro Gly His Leu Gly Thr Lys Pro
Phe Phe 705 710 715 720 Gln Lys Arg Phe Val Thr Pro Ile Glu Arg Phe
Gly Asp Ala Asp Ser 725 730 735 Leu Thr Ala Leu Arg Gln Arg Val Gln
Pro Leu Ile Leu Arg Arg Leu 740 745 750 Lys Thr Asp Arg Ser Ile Ile
Ala Asp Leu Pro Glu Lys Gln Glu Met 755 760 765 Thr Val Phe Cys Pro
Leu Val Gln Glu Gln Ala Asp Arg Tyr Gln Val 770 775 780 Leu Val Asn
Glu Ala Leu Ala Asn Ile Glu Ala Ser Glu Gly Ile Gln 785 790 795 800
Arg Arg Gly Gln Ile Leu Ala Leu Leu Thr Arg Leu Lys Gln Leu Cys 805
810 815 Asn His Pro Ser Leu Leu Leu Glu Lys Pro Lys Leu Asp Pro Asn
Phe 820 825 830 Gly Asp Arg Ser Ala Lys Leu Gln Arg Leu Leu Glu Met
Leu Ala Glu 835 840 845 Leu Thr Asp Ala Gly Asp Arg Ala Leu Val Phe
Thr Gln Phe Ala Gly 850 855 860 Trp Gly Ser Leu Leu Gln Gln Phe Leu
Gln Glu Gln Leu Gly Arg Glu 865 870 875 880 Val Leu Phe Leu Ser Gly
Ser Thr Lys Lys Gly Asp Arg Gln Gln Met 885 890 895 Val Asp Arg Phe
Gln Asn Asp Pro Gln Ala Pro Ala Ile Phe Ile Leu 900 905 910 Ser Leu
Lys Ala Gly Gly Val Gly Leu Asn Leu Thr Lys Ala Asn His 915 920 925
Val Phe His Tyr Asp Arg Trp Trp Asn Pro Ala Val Glu Asn Gln Ala 930
935 940 Thr Asp Arg Ala Phe Arg Ile Gly Gln Arg Arg Asn Val Gln Val
His 945 950 955 960 Lys Phe Val Cys Ala Gly Thr Leu Glu Glu Lys Ile
Asp Gln Met Ile 965 970 975 Ala Ser Lys Gln Ala Leu Ala Gln Gln Ile
Val Gly Ser Gly Glu Asp 980 985 990 Trp Leu Thr Glu Leu Asp Thr Asn
Gln Leu Arg Gln Leu Leu Ile Leu 995 1000 1005 Asp Arg Ser Ala Trp
Val Glu Glu Glu Glu Pro 1010 1015 993060DNASynechococcus elongatus
99atggcagtgc tgcacggtgg ctggctcggc gatcgcttct gcgtttgggc cgaggcttgg
60caggctggtg agcctcagtc ggcagcagaa attgcgattc atccctacgc gatcgcggcc
120actgacttaa atgattggtg ccagaagtac cgtctgggat ccctgacggg
gacgccaaca 180gaagtcctgc tctctattcc cagtgacctg aagaaagagg
cggttctacc gtttctgagt 240ggtcaggaaa ttccagatgg ggcgctgctt
tggtcttggc agatccccgt gctgtcacta 300gaagccgcga tcgccggtca
atggctggcg accttgccgc tgggttcggc ggaggatcat 360ccttggctgg
ggccagatct acgcttttgg agccacatct accgctgggc acaaagtttg
420ctggctcggg ggcgctttta tccggcgctg gagtcgagcg atcgcggttt
aacggcagtt 480tggttgccac tgtttaatca agcgggcgat cgccagcgct
tcgatcgcta tagtcagcag 540ctgcccttta gtcagttttg ctatcaggca
atcgaaacag cggcagcttg tccttggcag 600cctcaaccgc aggatctgtt
gctgcgagtc ctacagactt ggttgacagc acgactacaa 660ccggcgatcg
cggcgggaac tctcgtgtct gctgatctgc tggcggcttg gcagcaatcg
720ctagcgaatg gaaaaccgct aaagctagaa gacagtgaag ccagtcgctt
gcaaacggcg 780atcgatcgct ggttactacc agtgcagaat ggcgcagctc
aggcttggcg gatggttttg 840cgccttgtcc cgcctacgga gcaagagcag
ccctggcaat tggagtttgg cttacaagca 900gcgaccgatc ccgatcgctt
ttggccggcc tctctcctct ggcaggatcc gctgccacct 960gggctaccag
atcaatctca ggaattgctg ttacgcggct tgggacaggc ttgtcggctc
1020tatccccaat tgcaaaccag tctggcgaca gcctgtccag aattccatcc
actgaccaca 1080gcggaggtct atcagctgct caagcaggtg attcctcagt
ggcaagagca gggcattgaa 1140gtgcaactgc cgccgggctt gcgtggtcaa
gggcgacacc ggctgggagt ggaagtcagc 1200gccacgttgc cgagcgatcg
cccgagtgtg gggctggaag cactactgca gtttcgttgg 1260gagctgagtc
tgggcggtca gcggctgacc aaagcagaag tggaacgctt ggcagccctg
1320gaaacgccct tggtggaaat caacggcgac tggattgagg tgcggccgca
ggatattgag 1380tcggcgcgag agtttttccg taagcgcaag gatcagccaa
atttgacctt ggcggatgcg 1440atcgcgatcg ccagtggtga gtcgccgaat
gttggtcgcc tgccggtggt caattttgaa 1500gcggcgggct tactcgaaga
agccttggcc gtgtttcagg ggcagcgatc gcctgcggct 1560ttgcccgctc
cgcccacctt tcagggcgag ctgcgaccct atcaagagcg gggggtgggc
1620tggctcagct ttttgcagcg cttcgggatt ggggcttgcc tcgccgacga
catgggcttg 1680ggtaagacga ttcagctgct ggccttttta ctgcatctca
aacacagcaa cgagctgacg 1740cggccggtgc tgctagtctg tccgacttcg
gtgctgggca actgggaacg ggaggtgcag 1800aaatttgcac cggagcttcg
ctggaagctg cactatggcc ccgatcgcgc tcagggtaag 1860gctttggcga
cagcgctcaa ggactgcgat ttggtgctga ccagttactc cttggtggcg
1920cgagatcaga aagcgatcgc ggcgatcgac tggcaaggca ttgtgctgga
tgaagcccag 1980aacatcaaga atgaccaggc gaaacagacg caggcggtgc
gagcgatcgc ccaaagtccg 2040acgcaaaagc cccgctttcg gattgccctg
acagggacgc cggttgagaa tcgcctcagt 2100gagttgtggt cgattgtcga
gtttttgcag ccgggacatt taggcaccaa gccattcttt 2160caaaagcgct
ttgtcacgcc gatcgagcgt tttggcgatg cggattcgct gacagcattg
2220cggcagcgcg tgcaaccgtt aatcctacgg cgactgaaaa ccgatcgcag
cattattgcc 2280gacttgcctg agaagcaaga aatgacggtc ttttgtccgt
tggtacagga gcaggccgat 2340cgctatcagg tgctagtcaa tgaagcgcta
gccaatattg aagcaagtga aggcattcag 2400cggcgcggcc agattttggc
attgctaacg cgactgaagc agctctgtaa tcatccgtcg 2460ttgttgctcg
aaaagccgaa gctcgatccg aattttggcg atcgctcagc caagttgcag
2520cgcttactag aaatgttggc ggagctaacg gatgcgggcg atcgcgcttt
ggtgtttacg 2580cagtttgcgg gctggggtag tttgctgcag caatttttgc
aggaacagct agggcgagag 2640gtgctgtttt tgtcgggcag taccaagaag
ggcgatcgcc aacagatggt tgatcgcttc 2700caaaatgatc cgcaggcacc
ggcaattttc atcctgtcat tgaaggctgg cggggtgggg 2760ctcaacctga
cgaaagccaa tcatgtcttt cattacgatc gctggtggaa tccggcagtt
2820gaaaaccaag cgaccgatcg cgcgtttcgg attgggcaac gacgcaatgt
acaggtgcac 2880aagtttgtct gcgctggcac tctagaagaa aaaattgatc
agatgatcgc tagcaagcaa 2940gcattagcac agcagattgt cggtagtggt
gaggattggc taacggaact agacaccaat 3000caactccggc aactcttgat
cctcgatcgc tcagcttggg tagaagagga agagccttag
30601001019PRTSynechococcus elongatus 100Met Ala Val Leu His Gly
Gly Trp Leu Gly Asp Arg Phe Cys Val Trp 1 5 10 15 Ala Glu Ala Trp
Gln Ala Gly Glu Pro Gln Ser Ala Ala Glu Ile Ala 20 25 30 Ile His
Pro Tyr Ala Ile Ala Ala Thr Asp Leu Asn Asp Trp Cys Gln 35 40 45
Lys Tyr Arg Leu Gly Ser Leu Thr Gly Thr Pro Thr Glu Val Leu Leu 50
55 60 Ser Ile Pro Ser Asp Leu Lys Lys Glu Ala Val Leu Pro Phe Leu
Ser 65 70 75 80 Gly Gln Glu Ile Pro Asp Gly Ala Leu Leu Trp Ser Trp
Gln Ile Pro 85 90 95 Val Leu Ser Leu Glu Ala Ala Ile Ala Gly Gln
Trp Leu Ala Thr Leu 100 105 110 Pro Leu Gly Ser Ala Glu Asp His Pro
Trp Leu Gly Pro Asp Leu Arg 115 120 125 Phe Trp Ser His Ile Tyr Arg
Trp Ala Gln Ser Leu Leu Ala Arg Gly 130 135 140 Arg Phe Tyr Pro Ala
Leu Glu Ser Ser Asp Arg Gly Leu Thr Ala Val 145 150 155 160 Trp Leu
Pro Leu Phe Asn Gln Ala Gly Asp Arg Gln Arg Phe Asp Arg 165 170 175
Tyr Ser Gln Gln Leu Pro Phe Ser Gln Phe Cys Tyr Gln Ala Ile Glu 180
185 190 Thr Ala Ala Ala Cys Pro Trp Gln Pro Gln Pro Gln Asp Leu Leu
Leu 195 200 205 Arg Val Leu Gln Thr Trp Leu Thr Ala Arg Leu Gln Pro
Ala Ile Ala 210 215 220 Ala Gly Thr Leu Val Ser Ala Asp Leu Leu Ala
Ala Trp Gln Gln Ser 225 230 235 240 Leu Ala Asn Gly Lys
Pro Leu Lys Leu Glu Asp Ser Glu Ala Ser Arg 245 250 255 Leu Gln Thr
Ala Ile Asp Arg Trp Leu Leu Pro Val Gln Asn Gly Ala 260 265 270 Ala
Gln Ala Trp Arg Met Val Leu Arg Leu Val Pro Pro Thr Glu Gln 275 280
285 Glu Gln Pro Trp Gln Leu Glu Phe Gly Leu Gln Ala Ala Thr Asp Pro
290 295 300 Asp Arg Phe Trp Pro Ala Ser Leu Leu Trp Gln Asp Pro Leu
Pro Pro 305 310 315 320 Gly Leu Pro Asp Gln Ser Gln Glu Leu Leu Leu
Arg Gly Leu Gly Gln 325 330 335 Ala Cys Arg Leu Tyr Pro Gln Leu Gln
Thr Ser Leu Ala Thr Ala Cys 340 345 350 Pro Glu Phe His Pro Leu Thr
Thr Ala Glu Val Tyr Gln Leu Leu Lys 355 360 365 Gln Val Ile Pro Gln
Trp Gln Glu Gln Gly Ile Glu Val Gln Leu Pro 370 375 380 Pro Gly Leu
Arg Gly Gln Gly Arg His Arg Leu Gly Val Glu Val Ser 385 390 395 400
Ala Thr Leu Pro Ser Asp Arg Pro Ser Val Gly Leu Glu Ala Leu Leu 405
410 415 Gln Phe Arg Trp Glu Leu Ser Leu Gly Gly Gln Arg Leu Thr Lys
Ala 420 425 430 Glu Val Glu Arg Leu Ala Ala Leu Glu Thr Pro Leu Val
Glu Ile Asn 435 440 445 Gly Asp Trp Ile Glu Val Arg Pro Gln Asp Ile
Glu Ser Ala Arg Glu 450 455 460 Phe Phe Arg Lys Arg Lys Asp Gln Pro
Asn Leu Thr Leu Ala Asp Ala 465 470 475 480 Ile Ala Ile Ala Ser Gly
Glu Ser Pro Asn Val Gly Arg Leu Pro Val 485 490 495 Val Asn Phe Glu
Ala Ala Gly Leu Leu Glu Glu Ala Leu Ala Val Phe 500 505 510 Gln Gly
Gln Arg Ser Pro Ala Ala Leu Pro Ala Pro Pro Thr Phe Gln 515 520 525
Gly Glu Leu Arg Pro Tyr Gln Glu Arg Gly Val Gly Trp Leu Ser Phe 530
535 540 Leu Gln Arg Phe Gly Ile Gly Ala Cys Leu Ala Asp Asp Met Gly
Leu 545 550 555 560 Gly Lys Thr Ile Gln Leu Leu Ala Phe Leu Leu His
Leu Lys His Ser 565 570 575 Asn Glu Leu Thr Arg Pro Val Leu Leu Val
Cys Pro Thr Ser Val Leu 580 585 590 Gly Asn Trp Glu Arg Glu Val Gln
Lys Phe Ala Pro Glu Leu Arg Trp 595 600 605 Lys Leu His Tyr Gly Pro
Asp Arg Ala Gln Gly Lys Ala Leu Ala Thr 610 615 620 Ala Leu Lys Asp
Cys Asp Leu Val Leu Thr Ser Tyr Ser Leu Val Ala 625 630 635 640 Arg
Asp Gln Lys Ala Ile Ala Ala Ile Asp Trp Gln Gly Ile Val Leu 645 650
655 Asp Glu Ala Gln Asn Ile Lys Asn Asp Gln Ala Lys Gln Thr Gln Ala
660 665 670 Val Arg Ala Ile Ala Gln Ser Pro Thr Gln Lys Pro Arg Phe
Arg Ile 675 680 685 Ala Leu Thr Gly Thr Pro Val Glu Asn Arg Leu Ser
Glu Leu Trp Ser 690 695 700 Ile Val Glu Phe Leu Gln Pro Gly His Leu
Gly Thr Lys Pro Phe Phe 705 710 715 720 Gln Lys Arg Phe Val Thr Pro
Ile Glu Arg Phe Gly Asp Ala Asp Ser 725 730 735 Leu Thr Ala Leu Arg
Gln Arg Val Gln Pro Leu Ile Leu Arg Arg Leu 740 745 750 Lys Thr Asp
Arg Ser Ile Ile Ala Asp Leu Pro Glu Lys Gln Glu Met 755 760 765 Thr
Val Phe Cys Pro Leu Val Gln Glu Gln Ala Asp Arg Tyr Gln Val 770 775
780 Leu Val Asn Glu Ala Leu Ala Asn Ile Glu Ala Ser Glu Gly Ile Gln
785 790 795 800 Arg Arg Gly Gln Ile Leu Ala Leu Leu Thr Arg Leu Lys
Gln Leu Cys 805 810 815 Asn His Pro Ser Leu Leu Leu Glu Lys Pro Lys
Leu Asp Pro Asn Phe 820 825 830 Gly Asp Arg Ser Ala Lys Leu Gln Arg
Leu Leu Glu Met Leu Ala Glu 835 840 845 Leu Thr Asp Ala Gly Asp Arg
Ala Leu Val Phe Thr Gln Phe Ala Gly 850 855 860 Trp Gly Ser Leu Leu
Gln Gln Phe Leu Gln Glu Gln Leu Gly Arg Glu 865 870 875 880 Val Leu
Phe Leu Ser Gly Ser Thr Lys Lys Gly Asp Arg Gln Gln Met 885 890 895
Val Asp Arg Phe Gln Asn Asp Pro Gln Ala Pro Ala Ile Phe Ile Leu 900
905 910 Ser Leu Lys Ala Gly Gly Val Gly Leu Asn Leu Thr Lys Ala Asn
His 915 920 925 Val Phe His Tyr Asp Arg Trp Trp Asn Pro Ala Val Glu
Asn Gln Ala 930 935 940 Thr Asp Arg Ala Phe Arg Ile Gly Gln Arg Arg
Asn Val Gln Val His 945 950 955 960 Lys Phe Val Cys Ala Gly Thr Leu
Glu Glu Lys Ile Asp Gln Met Ile 965 970 975 Ala Ser Lys Gln Ala Leu
Ala Gln Gln Ile Val Gly Ser Gly Glu Asp 980 985 990 Trp Leu Thr Glu
Leu Asp Thr Asn Gln Leu Arg Gln Leu Leu Ile Leu 995 1000 1005 Asp
Arg Ser Ala Trp Val Glu Glu Glu Glu Pro 1010 1015
1013000DNAThermosynechococcus elongatus 101atggctattt tccatggcac
atggctccca gagccggcgc cacagttttt catttgggcg 60gaagaatggc gatcgctggc
tcaggcaatc acgccttggg ctcccccggc gattccggtt 120tatccctacg
ccacccagag aaaaacacct cttaggaaga cagcccgccc aagtgccacc
180tacgttgctt taccggccca gattcagggg catcaactgt taccaccacc
gctggcggaa 240gtgcaggggg aactcctatt tttgtggcag gtgcccggct
ggtcaattcc cgcttcagaa 300gttttagaac aactgcatca actgagtctt
cacggccaag acagtggcag tattggcgat 360gatttgcgct attggctgca
cgtgagtcgc tggttgctgg atttaattgt gcgtggccaa 420tacctgccaa
caccagaggg ctggcggatt ctgctgaccc acgggggcga tcgcgatcgc
480ctgcgccact tcagccaatt gatgccggat ctgtgtcgct gttatcaagc
cgatggcaca 540gcgttgcagt tgccacccca tgctgcagat ctcctggcgg
attttctaca gcacacccta 600cagggttatc tccacactgc ccttgctgac
ctcgaattgc ccaaagtagg cttagccaaa 660gaacatggcc actggctagc
cttcctgaaa acgggtcaaa ccccggaact gccacctccc 720ctcattgaac
gcctgcaccg ctggcaagaa ccctaccgcg agcagttgca tctgcgtccc
780caatggcgac tggctctgca attggttccc ccagatactg ccgatggtga
ctggcacttg 840gcctttgggc tgcaaacgga aggggaaacg gacaccatgc
taagggccgc cgagatttgg 900caatgcaccc aagaggccct cctctatcaa
gggcaggtgc tctggcagcc ccaagaaacc 960ctgttgcggg gactgggctt
ggcctcccgc atctatcgtc ccctcgatcg cagtcttcaa 1020gaacgctccc
ccgtggctct gactttgcac accacggaag tttatgcctt cttgcaaagt
1080gcaattgcgc cccttgagca gcagggggtt gcgatcattt tgccaccgag
tctgcgccgc 1140aatagcgccc aacatcgctt gggtctgaaa ataattgcca
cattgccgcc gccggccact 1200aacggcttga cgattgacag cttgatgcag
tttcagtggc agttgcagtt ggggcagcat 1260cccctctcgg aggcggattt
tgatcaactg cgccgccaag ggacgcccct ggtttatctc 1320aatggtgagt
gggtcttgct gcgcccccaa gaggtcaagg ccgctcaaga gtttctccag
1380tctcccccaa agacccaact ctcccttgca gagacactgc gcattgctac
gggggatacg 1440gtaacggtgg ccaagttgcc gattcttggc ttagacacca
atgatgcact ccagaccctc 1500ttggatggcc tcacgggcaa acaaagcctt
gatccagtgc caacaccgca ggagttttgc 1560ggtgaactgc gcccctacca
ggcacggggg gtggcgtggc tgagtttctt ggaacgctgg 1620cggctggggg
cttgcttggc ggacgatatg ggcttgggga aaaccattca actgttggcc
1680tttttgctcc acctcaagga aacgggacgg gcctaccgac cgacactgtt
gatctgtcct 1740acctcggtgc tggggaactg gctgcgggag tgccaaaagt
ttgccccaac cttgcgggcc 1800tatgtccacc atgggagcga tcgccccaag
ggcaaggcat ttctgaaaaa ggttgaaact 1860cacgatctaa ttttgaccag
ttatgccctc ctccagcgcg atcgcaccac cttgcagcag 1920gttctgtggc
agcatttggt actggatgaa gcccaaaaca tcaagaatgc caacacccag
1980cagtcccaag cagcgcggga actttccgcc cagtttcgca ttgccctgac
gggaaccccc 2040ctagaaaacc gcctcctcga actttggtcc attatggact
tcctccatcc ggggtacttg 2100ggccatcgca cctactttca acaccgctat
gtccgtccca ttgaacgcta tggcgacacc 2160acctccctca atgctctgcg
cacctatgtc cagcccttta ttctgcggcg cctgaaaacc 2220gaccgcagta
ttattcaaga cctgccggaa aaacaggaga tgctggtgta ttgtggcctc
2280accctagagc agatgcagct ttacactgct gtggtggaag actcccttgc
tgctatcgaa 2340aatagtcaag gcattcagcg gcggggcaat atcttggcca
ccctgaccaa gttgaagcaa 2400atctgtaacc atcccgccca gtatctcaag
caagaagact atgcccccga tcgctcaggt 2460aaattgcaac ggcttataga
aatgctgcaa gcgcttcagg aagtgggcga tcgcgccctt 2520gtctttaccc
aatttgccga gtttggcacc cacctgaaaa cctatctgga aaaggcgctc
2580cagcaggagg tgtttttcct ctcaggacgc acccccaaag cccagcggga
actcatggtg 2640gaacgctttc aacacgatcc cgaggccccc agggtcttta
ttctttccct caaggcaggg 2700ggcgtcggtc tcaatttgac tcgcgctaac
catgtctttc actacgatcg ctggtggaac 2760ccagcggtag aaaatcaggc
cagcgatcgc gtcttccgca ttggtcaggc ccgcaatgtc 2820caaatccata
aatttatctg cacgggtacc ctcgaagaaa agatccacga gcaaatcgaa
2880cagaaaaaag cccttgcgga aatgattgtg ggtagtggcg aacactggct
gactgaactc 2940aacctcgacc agttgcggca actgctcacc ttagacaaag
agcggctgat caccctctag 3000102999PRTThermosynechococcus elongatus
102Met Ala Ile Phe His Gly Thr Trp Leu Pro Glu Pro Ala Pro Gln Phe
1 5 10 15 Phe Ile Trp Ala Glu Glu Trp Arg Ser Leu Ala Gln Ala Ile
Thr Pro 20 25 30 Trp Ala Pro Pro Ala Ile Pro Val Tyr Pro Tyr Ala
Thr Gln Arg Lys 35 40 45 Thr Pro Leu Arg Lys Thr Ala Arg Pro Ser
Ala Thr Tyr Val Ala Leu 50 55 60 Pro Ala Gln Ile Gln Gly His Gln
Leu Leu Pro Pro Pro Leu Ala Glu 65 70 75 80 Val Gln Gly Glu Leu Leu
Phe Leu Trp Gln Val Pro Gly Trp Ser Ile 85 90 95 Pro Ala Ser Glu
Val Leu Glu Gln Leu His Gln Leu Ser Leu His Gly 100 105 110 Gln Asp
Ser Gly Ser Ile Gly Asp Asp Leu Arg Tyr Trp Leu His Val 115 120 125
Ser Arg Trp Leu Leu Asp Leu Ile Val Arg Gly Gln Tyr Leu Pro Thr 130
135 140 Pro Glu Gly Trp Arg Ile Leu Leu Thr His Gly Gly Asp Arg Asp
Arg 145 150 155 160 Leu Arg His Phe Ser Gln Leu Met Pro Asp Leu Cys
Arg Cys Tyr Gln 165 170 175 Ala Asp Gly Thr Ala Leu Gln Leu Pro Pro
His Ala Ala Asp Leu Leu 180 185 190 Ala Asp Phe Leu Gln His Thr Leu
Gln Gly Tyr Leu His Thr Ala Leu 195 200 205 Ala Asp Leu Glu Leu Pro
Lys Val Gly Leu Ala Lys Glu His Gly His 210 215 220 Trp Leu Ala Phe
Leu Lys Thr Gly Gln Thr Pro Glu Leu Pro Pro Pro 225 230 235 240 Leu
Ile Glu Arg Leu His Arg Trp Gln Glu Pro Tyr Arg Glu Gln Leu 245 250
255 His Leu Arg Pro Gln Trp Arg Leu Ala Leu Gln Leu Val Pro Pro Asp
260 265 270 Thr Ala Asp Gly Asp Trp His Leu Ala Phe Gly Leu Gln Thr
Glu Gly 275 280 285 Glu Thr Asp Thr Met Leu Arg Ala Ala Glu Ile Trp
Gln Cys Thr Gln 290 295 300 Glu Ala Leu Leu Tyr Gln Gly Gln Val Leu
Trp Gln Pro Gln Glu Thr 305 310 315 320 Leu Leu Arg Gly Leu Gly Leu
Ala Ser Arg Ile Tyr Arg Pro Leu Asp 325 330 335 Arg Ser Leu Gln Glu
Arg Ser Pro Val Ala Leu Thr Leu His Thr Thr 340 345 350 Glu Val Tyr
Ala Phe Leu Gln Ser Ala Ile Ala Pro Leu Glu Gln Gln 355 360 365 Gly
Val Ala Ile Ile Leu Pro Pro Ser Leu Arg Arg Asn Ser Ala Gln 370 375
380 His Arg Leu Gly Leu Lys Ile Ile Ala Thr Leu Pro Pro Pro Ala Thr
385 390 395 400 Asn Gly Leu Thr Ile Asp Ser Leu Met Gln Phe Gln Trp
Gln Leu Gln 405 410 415 Leu Gly Gln His Pro Leu Ser Glu Ala Asp Phe
Asp Gln Leu Arg Arg 420 425 430 Gln Gly Thr Pro Leu Val Tyr Leu Asn
Gly Glu Trp Val Leu Leu Arg 435 440 445 Pro Gln Glu Val Lys Ala Ala
Gln Glu Phe Leu Gln Ser Pro Pro Lys 450 455 460 Thr Gln Leu Ser Leu
Ala Glu Thr Leu Arg Ile Ala Thr Gly Asp Thr 465 470 475 480 Val Thr
Val Ala Lys Leu Pro Ile Leu Gly Leu Asp Thr Asn Asp Ala 485 490 495
Leu Gln Thr Leu Leu Asp Gly Leu Thr Gly Lys Gln Ser Leu Asp Pro 500
505 510 Val Pro Thr Pro Gln Glu Phe Cys Gly Glu Leu Arg Pro Tyr Gln
Ala 515 520 525 Arg Gly Val Ala Trp Leu Ser Phe Leu Glu Arg Trp Arg
Leu Gly Ala 530 535 540 Cys Leu Ala Asp Asp Met Gly Leu Gly Lys Thr
Ile Gln Leu Leu Ala 545 550 555 560 Phe Leu Leu His Leu Lys Glu Thr
Gly Arg Ala Tyr Arg Pro Thr Leu 565 570 575 Leu Ile Cys Pro Thr Ser
Val Leu Gly Asn Trp Leu Arg Glu Cys Gln 580 585 590 Lys Phe Ala Pro
Thr Leu Arg Ala Tyr Val His His Gly Ser Asp Arg 595 600 605 Pro Lys
Gly Lys Ala Phe Leu Lys Lys Val Glu Thr His Asp Leu Ile 610 615 620
Leu Thr Ser Tyr Ala Leu Leu Gln Arg Asp Arg Thr Thr Leu Gln Gln 625
630 635 640 Val Leu Trp Gln His Leu Val Leu Asp Glu Ala Gln Asn Ile
Lys Asn 645 650 655 Ala Asn Thr Gln Gln Ser Gln Ala Ala Arg Glu Leu
Ser Ala Gln Phe 660 665 670 Arg Ile Ala Leu Thr Gly Thr Pro Leu Glu
Asn Arg Leu Leu Glu Leu 675 680 685 Trp Ser Ile Met Asp Phe Leu His
Pro Gly Tyr Leu Gly His Arg Thr 690 695 700 Tyr Phe Gln His Arg Tyr
Val Arg Pro Ile Glu Arg Tyr Gly Asp Thr 705 710 715 720 Thr Ser Leu
Asn Ala Leu Arg Thr Tyr Val Gln Pro Phe Ile Leu Arg 725 730 735 Arg
Leu Lys Thr Asp Arg Ser Ile Ile Gln Asp Leu Pro Glu Lys Gln 740 745
750 Glu Met Leu Val Tyr Cys Gly Leu Thr Leu Glu Gln Met Gln Leu Tyr
755 760 765 Thr Ala Val Val Glu Asp Ser Leu Ala Ala Ile Glu Asn Ser
Gln Gly 770 775 780 Ile Gln Arg Arg Gly Asn Ile Leu Ala Thr Leu Thr
Lys Leu Lys Gln 785 790 795 800 Ile Cys Asn His Pro Ala Gln Tyr Leu
Lys Gln Glu Asp Tyr Ala Pro 805 810 815 Asp Arg Ser Gly Lys Leu Gln
Arg Leu Ile Glu Met Leu Gln Ala Leu 820 825 830 Gln Glu Val Gly Asp
Arg Ala Leu Val Phe Thr Gln Phe Ala Glu Phe 835 840 845 Gly Thr His
Leu Lys Thr Tyr Leu Glu Lys Ala Leu Gln Gln Glu Val 850 855 860 Phe
Phe Leu Ser Gly Arg Thr Pro Lys Ala Gln Arg Glu Leu Met Val 865 870
875 880 Glu Arg Phe Gln His Asp Pro Glu Ala Pro Arg Val Phe Ile Leu
Ser 885 890 895 Leu Lys Ala Gly Gly Val Gly Leu Asn Leu Thr Arg Ala
Asn His Val 900 905 910 Phe His Tyr Asp Arg Trp Trp Asn Pro Ala Val
Glu Asn Gln Ala Ser 915 920 925 Asp Arg Val Phe Arg Ile Gly Gln Ala
Arg Asn Val Gln Ile His Lys 930 935 940 Phe Ile Cys Thr Gly Thr Leu
Glu Glu Lys Ile His Glu Gln Ile Glu 945 950 955 960 Gln Lys Lys Ala
Leu Ala Glu Met Ile Val Gly Ser Gly Glu His Trp 965 970 975 Leu Thr
Glu Leu Asn Leu Asp Gln Leu Arg Gln Leu Leu Thr Leu Asp 980 985 990
Lys Glu Arg Leu Ile Thr Leu 995 10310PRTArtificial sequencemotif 1
of SWI2/SNF2 polypeptides 103Leu Ala Asp Asp Met Gly Leu Gly Lys
Xaa 1 5 10 10412PRTArtificial sequencemotif 1a of SWI2/SNF2
polypeptides 104Leu Xaa Xaa Xaa Pro Xaa Ser Xaa Xaa Xaa Asn Trp 1 5
10 1058PRTArtificial sequencemotif 2 of SWI2/SNF2 polypeptides
105Asp Glu Ala Gln Xaa Xaa Lys Asn 1 5 1069PRTArtificial
sequencemotif 3 of SWI2/SNF2 polypeptides 106Ala Xaa Thr Gly Thr
Pro Xaa Glu Asn 1 5 1076PRTArtificial sequencemotif 4 of SWI2/SNF2
polypeptides 107Xaa Xaa Phe Xaa Gln Xaa 1 5 10817PRTArtificial
sequencemotif 5 of SWI2/SNF2 polypeptides 108Ser Xaa Lys Ala Gly
Gly Xaa Gly Xaa Xaa Leu Thr Xaa Ala Asn His 1 5 10 15 Val
1099PRTArtificial sequencemotif 5a of SWI2/SNF2 polypeptides 109Asp
Arg Trp Trp Asn Pro Ala Val Glu 1 5 11011PRTArtificial
sequencemotif 6 of SWI2/SNF2 polypeptides 110Gln Ala Xaa Asp Arg
Xaa Xaa Arg Xaa Gly Gln 1 5 10 111460PRTArtificial sequenceATPase
domain of SEQ ID NO 2 111Leu Ala Asp Asp Met Gly Leu Gly Lys Thr
Pro Gln Leu Leu Ala Phe 1 5 10 15 Leu Leu His Leu Ala Ala Glu Asp
Met Leu Val Lys Pro Val Leu Ile 20 25 30 Val Cys Pro Thr Ser Val
Leu Ser Asn Trp Gly His Glu Ile Asn Lys 35 40 45 Phe Ala Pro Gln
Leu Lys Thr Leu Leu His His Gly Asp Arg Arg Lys 50 55 60 Lys Gly
Gln Pro Leu Val Lys Gln Val Lys Asp Gln Gln Ile Val Leu 65 70 75 80
Thr Ser Tyr Ala Leu Leu Gln Arg Asp Phe Ser Ser Leu Lys Leu Val 85
90 95 Asp Trp Gln Gly Ile Val Leu Asp Glu Ala Gln Asn Ile Lys Asn
Pro 100 105 110 Gln Ala Lys Gln Ser Gln Ala Ala Arg Gln Leu Pro Ala
Gly Phe Arg 115 120 125 Ile Ala Leu Thr Gly Thr Pro Val Glu Asn Arg
Leu Thr Glu Leu Trp 130 135 140 Ser Ile Leu Glu Phe Leu Asn Pro Gly
Phe Leu Gly Asn Gln Ser Phe 145 150 155 160 Phe Gln Arg Arg Phe Ala
Asn Pro Ile Glu Lys Phe Gly Asp Arg Gln 165 170 175 Ser Leu Leu Ile
Leu Arg Asn Leu Val Arg Pro Phe Ile Leu Arg Arg 180 185 190 Leu Lys
Thr Asp Gln Thr Ile Ile Gln Asp Leu Pro Glu Lys Gln Glu 195 200 205
Met Thr Val Phe Cys Asp Leu Ser Gln Glu Gln Ala Gly Leu Tyr Gln 210
215 220 Gln Leu Val Glu Glu Ser Leu Gln Ala Ile Ala Asp Ser Glu Gly
Ile 225 230 235 240 Gln Arg His Gly Leu Val Leu Thr Leu Leu Thr Lys
Leu Lys Gln Val 245 250 255 Cys Asn His Pro Asp Leu Leu Leu Lys Lys
Pro Ala Ile Thr His Gly 260 265 270 His Gln Ser Gly Lys Leu Ile Arg
Leu Ala Glu Met Leu Glu Glu Ile 275 280 285 Ile Ser Glu Gly Asp Arg
Val Leu Ile Phe Thr Gln Phe Ala Ser Trp 290 295 300 Gly His Leu Leu
Lys Pro Tyr Leu Glu Lys Tyr Phe Asn Gln Glu Val 305 310 315 320 Leu
Tyr Leu His Gly Gly Thr Pro Ala Glu Gln Arg Gln Ala Leu Val 325 330
335 Glu Arg Phe Gln Gln Asp Pro Asn Ser Pro Tyr Leu Phe Ile Leu Ser
340 345 350 Leu Lys Ala Gly Gly Thr Gly Leu Asn Leu Thr Arg Ala Asn
His Val 355 360 365 Phe His Val Asp Arg Trp Trp Asn Pro Ala Val Glu
Asn Gln Ala Thr 370 375 380 Asp Arg Ala Phe Arg Ile Gly Gln Thr Arg
Asn Val Gln Val His Lys 385 390 395 400 Phe Val Cys Thr Gly Thr Leu
Glu Glu Lys Ile Asn Ala Met Met Ala 405 410 415 Asp Lys Gln Gln Leu
Ala Glu Gln Thr Val Asp Ala Gly Glu Asn Trp 420 425 430 Leu Thr Arg
Leu Asp Thr Asp Lys Leu Arg Gln Leu Leu Thr Leu Ser 435 440 445 Ala
Thr Pro Val Asp Tyr Gln Ala Glu Ala Ser Asp 450 455 460
1121244DNAOryza sativa 112aaaaccaccg agggacctga tctgcaccgg
ttttgatagt tgagggaccc gttgtgtctg 60gttttccgat cgagggacga aaatcggatt
cggtgtaaag ttaagggacc tcagatgaac 120ttattccgga gcatgattgg
gaagggagga cataaggccc atgtcgcatg tgtttggacg 180gtccagatct
ccagatcact cagcaggatc ggccgcgttc gcgtagcacc cgcggtttga
240ttcggcttcc cgcaaggcgg cggccggtgg ccgtgccgcc gtagcttccg
ccggaagcga 300gcacgccgcc gccgccgacc cggctctgcg tttgcaccgc
cttgcacgcg atacatcggg 360atagatagct actactctct ccgtttcaca
atgtaaatca ttctactatt ttccacattc 420atattgatgt taatgaatat
agacatatat atctatttag attcattaac atcaatatga 480atgtaggaaa
tgctagaatg acttacattg tgaattgtga aatggacgaa gtacctacga
540tggatggatg caggatcatg aaagaattaa tgcaagatcg tatctgccgc
atgcaaaatc 600ttactaattg cgctgcatat atgcatgaca gcctgcatgc
gggcgtgtaa gcgtgttcat 660ccattaggaa gtaaccttgt cattacttat
accagtacta catactatat agtattgatt 720tcatgagcaa atctacaaaa
ctggaaagca ataagaaata cgggactgga aaagactcaa 780cattaatcac
caaatatttc gccttctcca gcagaatata tatctctcca tcttgatcac
840tgtacacact gacagtgtac gcataaacgc agcagccagc ttaactgtcg
tctcaccgtc 900gcacactggc cttccatctc aggctagctt tctcagccac
ccatcgtaca tgtcaactcg 960gcgcgcgcac aggcacaaat tacgtacaaa
acgcatgacc aaatcaaaac caccggagaa 1020gaatcgctcc cgcgcgcggc
ggcgacgcgc acgtacgaac gcacgcacgc acgcccaacc 1080ccacgacacg
atcgcgcgcg acgccggcga caccggccgt ccacccgcgc cctcacctcg
1140ccgactataa atacgtaggc atctgcttga tcttgtcatc catctcacca
ccaaaaaaaa 1200aaggaaaaaa aaacaaaaca caccaagcca aataaaagcg acaa
124411359DNAArtificial sequenceprimer prm08774 113ggggacaagt
ttgtacaaaa aagcaggctt aaacaatggc gactatccac ggtaattgg
5911449DNAArtificial sequenceprimer prm08779 114ggggaccact
ttgtacaaga aagctgggtt caatcggacg cttcggctt 49
uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.
While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.
All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.