U.S. patent application number 15/438520 was filed with the patent office on 2017-08-10 for plants with enhanced size and growth rate. The applicant listed for this patent is Mendel Biotechnology, Inc.. Invention is credited to Robert A. Creelman, Neal I. Gutterson, Oliver J. Ratcliffe, Peter P. Repetti.
Application Number | 20170226528 15/438520 |
Document ID | / |
Family ID | 39082532 |
Filed Date | 2017-08-10 |
United States Patent Application | 20170226528 |
Kind Code | A1 |
Ratcliffe; Oliver J. ; et al. | August 10, 2017 |
Polynucleotides and polypeptides incorporated into expression vectors have been introduced into plants and were ectopically expressed. The polypeptides of the invention regulate transcription in these plants and have been shown to confer at least one regulatory activity that results in increased size, biomass, growth rate, and/or yield as compared to a control plant.
Inventors: | Ratcliffe; Oliver J.; (Hayward, CA) ; Repetti; Peter P.; (Emeryville, CA) ; Gutterson; Neal I.; (Oakland, CA) ; Creelman; Robert A.; (Castro Valley, CA) | ||||||||||
Applicant: |
|
||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Family ID: | 39082532 | ||||||||||
Appl. No.: | 15/438520 | ||||||||||
Filed: | February 21, 2017 |
Application Number | Filing Date | Patent Number | ||
---|---|---|---|---|
14590872 | Jan 6, 2015 | |||
15438520 | ||||
12376569 | Feb 5, 2009 | 8927811 | ||
PCT/US07/17321 | Aug 3, 2007 | |||
14590872 | ||||
60836243 | Aug 7, 2006 | |||
Current U.S. Class: | 1/1 |
Current CPC Class: | C12N 15/8261 20130101; Y02A 40/146 20180101; C07K 14/415 20130101; C12N 15/8262 20130101; A01H 1/02 20130101 |
International Class: | C12N 15/82 20060101 C12N015/82; A01H 1/02 20060101 A01H001/02 |
Sequence CWU 1
1
2161953DNAArabidopsis thalianaG929 1ggagagacct ttaacaattt
tctgagggta agatccagag attgattgaa tcagcttact 60attttatata attcagtttg
ttgttcctca gacttgtaac taggacagtc ttctcatgaa 120tcatgacttc
ttcagtacat gagctctctg ataacaatga aagtcatgcg aagaaagaac
180gtccagattc ccaaacccga ccacaggttc cttcaggacg aagttcggaa
tctattgata 240caaactctgt ctactcagag cccatggcac atggattata
cccgtatcca gatccttact 300acagaagcgt ctttgcacag caagcgtatc
ttccacatcc ctatcctggg gtccaattgc 360agttaatggg aatgcagcag
ccaggagttc cattgcaatg tgatgcagtc gaggaacctg 420tttttgttaa
cgcaaagcaa taccatggta tactcaggcg caggcaatcc cgggcaaaac
480ttgaggcacg aaatagagcc atcaaagcaa aaaagccata catgcatgaa
tctcggcatt 540tacatgcgat aagacggcca agaggatgtg gtggccggtt
tctcaatgcc aagaaggaaa 600atggagacca caaggaggag gaggaggcaa
cctctgatga gaacacttca gaagcaagtt 660ccagcctcag gtccgagaaa
ttagctatgg ctacttctgg tcctaatggt agatcttgag 720gaaggtttct
gcacaaccac aagtttagtt tctattttgg gtggatgttc tcagggcatc
780atcgtcttta gtgtttttgg atacgctgtg tacaggttat ttgctagggt
aaactttgtt 840ttagcgatta gaaataaaac taagcaaaga aatgaaaagt
gtgattggaa gtattgttgt 900accaaattga tattctttgc caatgaactc
atgttttgga aagtaaaaaa aaa 9532198PRTArabidopsis thalianaG929
polypeptide (domain in aa coordinates 98-157) 2Met Thr Ser Ser Val
His Glu Leu Ser Asp Asn Asn Glu Ser His Ala 1 5 10 15 Lys Lys Glu
Arg Pro Asp Ser Gln Thr Arg Pro Gln Val Pro Ser Gly 20 25 30 Arg
Ser Ser Glu Ser Ile Asp Thr Asn Ser Val Tyr Ser Glu Pro Met 35 40
45 Ala His Gly Leu Tyr Pro Tyr Pro Asp Pro Tyr Tyr Arg Ser Val Phe
50 55 60 Ala Gln Gln Ala Tyr Leu Pro His Pro Tyr Pro Gly Val Gln
Leu Gln 65 70 75 80 Leu Met Gly Met Gln Gln Pro Gly Val Pro Leu Gln
Cys Asp Ala Val 85 90 95 Glu Glu Pro Val Phe Val Asn Ala Lys Gln
Tyr His Gly Ile Leu Arg 100 105 110 Arg Arg Gln Ser Arg Ala Lys Leu
Glu Ala Arg Asn Arg Ala Ile Lys 115 120 125 Ala Lys Lys Pro Tyr Met
His Glu Ser Arg His Leu His Ala Ile Arg 130 135 140 Arg Pro Arg Gly
Cys Gly Gly Arg Phe Leu Asn Ala Lys Lys Glu Asn 145 150 155 160 Gly
Asp His Lys Glu Glu Glu Glu Ala Thr Ser Asp Glu Asn Thr Ser 165 170
175 Glu Ala Ser Ser Ser Leu Arg Ser Glu Lys Leu Ala Met Ala Thr Ser
180 185 190 Gly Pro Asn Gly Arg Ser 195 3573DNAArabidopsis
thalianaG2344 3atgacttctt caatccatga gctttctgat aacattggaa
gtcatgagaa gcaagaacag 60agagattctc atttccaacc accaatccct tctgcaagaa
attatgaatc aattgttaca 120agtttagtct actcagaccc ggggactaca
aattccatgg cacctggaca atatccatat 180ccagatcctt actacagaag
catatttgca ccgcctccac aaccgtatac cggggtacat 240ctacagttga
tgggagtgca gcaacaaggc gttcctttac catctgatgc agtcgaggaa
300cctgtttttg ttaacgcaaa gcaataccac ggtatactaa ggcgcagaca
atcaagagca 360agacttgagt ctcagaataa agtcatcaag tcacgtaagc
cgtatttgca tgaatctcgg 420catttgcatg cgataagacg accaagagga
tgtggcgggc ggtttctaaa tgccaagaag 480gaggatgagc atcacgaaga
cagtagtcat gaagaaaaat ccaaccttag cgctggtaaa 540tccgccatgg
ctgcttctag tggtacatct tga 5734190PRTArabidopsis thalianaG2344
polypeptide (domain in aa coordinates 100-159) 4Met Thr Ser Ser Ile
His Glu Leu Ser Asp Asn Ile Gly Ser His Glu 1 5 10 15 Lys Gln Glu
Gln Arg Asp Ser His Phe Gln Pro Pro Ile Pro Ser Ala 20 25 30 Arg
Asn Tyr Glu Ser Ile Val Thr Ser Leu Val Tyr Ser Asp Pro Gly 35 40
45 Thr Thr Asn Ser Met Ala Pro Gly Gln Tyr Pro Tyr Pro Asp Pro Tyr
50 55 60 Tyr Arg Ser Ile Phe Ala Pro Pro Pro Gln Pro Tyr Thr Gly
Val His 65 70 75 80 Leu Gln Leu Met Gly Val Gln Gln Gln Gly Val Pro
Leu Pro Ser Asp 85 90 95 Ala Val Glu Glu Pro Val Phe Val Asn Ala
Lys Gln Tyr His Gly Ile 100 105 110 Leu Arg Arg Arg Gln Ser Arg Ala
Arg Leu Glu Ser Gln Asn Lys Val 115 120 125 Ile Lys Ser Arg Lys Pro
Tyr Leu His Glu Ser Arg His Leu His Ala 130 135 140 Ile Arg Arg Pro
Arg Gly Cys Gly Gly Arg Phe Leu Asn Ala Lys Lys 145 150 155 160 Glu
Asp Glu His His Glu Asp Ser Ser His Glu Glu Lys Ser Asn Leu 165 170
175 Ser Ala Gly Lys Ser Ala Met Ala Ala Ser Ser Gly Thr Ser 180 185
190 51180DNAArabidopsis thalianaG931 5ggaggttctt tgacagacac
atgtatcatc aatcttctct gttgaagcag agagagagag 60agctaattgt tgcctctgag
tcacatggat aagaaagttt catttactag ctctgtggca 120cattcaactc
caccatacct tagtacttcc atctcatggg gacttccaac caaatccaat
180ggtgtgactg aatcactgag tttgaaggtg gtagatgcaa gaccagaacg
tcttataaac 240acaaagaata tcagtttcca ggaccaggat tcatcttcaa
ctctgtcctc tgctcaatct 300tctaacgatg ttacaagtag tggagatgat
aacccctcaa gacaaatctc atttttagca 360cattcagatg tttgtaaagg
atttgaagaa actcaaagga agcgatttgc aattaaatca 420ggctcctcca
cggcaggaat cgctgatatt cactcttctc cttccaaggc taacttctca
480tttcactatg ccgatccaca ttttggtggt ttaatgcctg cggcttacct
accacaggca 540acaatatgga atccccaaat gactcgagtt ccgctaccat
tcgatctcat agagaatgag 600cctgtctttg tcaatgcaaa gcaattccat
gcaattatga ggaggaggca acagcgtgct 660aagctagagg cgcaaaacaa
actaatcaaa gcccgtaagc cgtatcttca tgaatctcga 720catgttcacg
ctcttaaacg acctagagga tctggtggaa gattcctaaa caccaaaaag
780cttcaagaat ctacagatcc aaaacaagac atgccaatcc aacagcaaca
cgcaacggga 840aacatgtcaa gatttgtgct ttatcagttg cagaacagca
atgactgtga ttgttcaacc 900acttctcgct ctgacatcac atctgcttct
gacagcgtta atctctttgg acactctgaa 960tttctgatat cagattgccc
atctcagaca aacccaacaa tgtatgttca tggtcaatca 1020aatgacatgc
atggaggtag gaacacacac catttctctg tccatatctg agccggtgga
1080atctggtaat gtgtacgttc ctacaaaaaa agggaagtca tccttggctg
ctacttcgct 1140tattagctag ttcttatttc acacgctttg tccagatatc
11806328PRTArabidopsis thalianaG931 polypeptide (domain in aa
coordinates 172-231) 6Met Asp Lys Lys Val Ser Phe Thr Ser Ser Val
Ala His Ser Thr Pro 1 5 10 15 Pro Tyr Leu Ser Thr Ser Ile Ser Trp
Gly Leu Pro Thr Lys Ser Asn 20 25 30 Gly Val Thr Glu Ser Leu Ser
Leu Lys Val Val Asp Ala Arg Pro Glu 35 40 45 Arg Leu Ile Asn Thr
Lys Asn Ile Ser Phe Gln Asp Gln Asp Ser Ser 50 55 60 Ser Thr Leu
Ser Ser Ala Gln Ser Ser Asn Asp Val Thr Ser Ser Gly 65 70 75 80 Asp
Asp Asn Pro Ser Arg Gln Ile Ser Phe Leu Ala His Ser Asp Val 85 90
95 Cys Lys Gly Phe Glu Glu Thr Gln Arg Lys Arg Phe Ala Ile Lys Ser
100 105 110 Gly Ser Ser Thr Ala Gly Ile Ala Asp Ile His Ser Ser Pro
Ser Lys 115 120 125 Ala Asn Phe Ser Phe His Tyr Ala Asp Pro His Phe
Gly Gly Leu Met 130 135 140 Pro Ala Ala Tyr Leu Pro Gln Ala Thr Ile
Trp Asn Pro Gln Met Thr 145 150 155 160 Arg Val Pro Leu Pro Phe Asp
Leu Ile Glu Asn Glu Pro Val Phe Val 165 170 175 Asn Ala Lys Gln Phe
His Ala Ile Met Arg Arg Arg Gln Gln Arg Ala 180 185 190 Lys Leu Glu
Ala Gln Asn Lys Leu Ile Lys Ala Arg Lys Pro Tyr Leu 195 200 205 His
Glu Ser Arg His Val His Ala Leu Lys Arg Pro Arg Gly Ser Gly 210 215
220 Gly Arg Phe Leu Asn Thr Lys Lys Leu Gln Glu Ser Thr Asp Pro Lys
225 230 235 240 Gln Asp Met Pro Ile Gln Gln Gln His Ala Thr Gly Asn
Met Ser Arg 245 250 255 Phe Val Leu Tyr Gln Leu Gln Asn Ser Asn Asp
Cys Asp Cys Ser Thr 260 265 270 Thr Ser Arg Ser Asp Ile Thr Ser Ala
Ser Asp Ser Val Asn Leu Phe 275 280 285 Gly His Ser Glu Phe Leu Ile
Ser Asp Cys Pro Ser Gln Thr Asn Pro 290 295 300 Thr Met Tyr Val His
Gly Gln Ser Asn Asp Met His Gly Gly Arg Asn 305 310 315 320 Thr His
His Phe Ser Val His Ile 325 7956DNAGlycine maxG3920 7agttggtgct
aagatgccag ggaaacctga cactgatgat tggcgtgtag agcgtgggga 60gcagattcag
tttcagtctt ccatttactc tcatcatcag ccttggtggc gcggagtggg
120ggaaaatgcc tccaaatcat cttcagatga tcagttaaat ggttcaatcg
tgaatggtat 180cacgcggtct gagaccaatg ataagtcagg cggaggtgtt
gccaaagaat accaaaacat 240caaacatgcc atgttgtcaa ccccatttac
catggagaaa catcttgctc caaatcccca 300gatggaactt gttggtcatt
cagttgtttt aacatctcct tattcagatg cacagtatgg 360tcaaatcttg
actacttacg ggcaacaagt tatgataaat cctcagttgt atggaatgca
420tcatgctaga atgcctttgc cacttgaaat ggaagaggag cctgtttatg
tcaatgcgaa 480gcagtatcat ggtattttga ggcgaagaca gtcacgtgct
aaggctgaga ttgaaaagaa 540agtaatcaaa aacaggaagc catacctcca
tgaatcccgt caccttcatg caatgagaag 600ggcaagaggc aacggtggtc
gctttctcaa cacaaagaag cttgaaaata acaattctaa 660ttccacttca
gacaaaggca acaatactcg tgcaaacgcc tcaacaaact cgcctaacac
720tcaacttttg ttcaccaaca atttgaatct aggctcatca aatgtttcac
aagccacagt 780tcagcacatg cacacagagc agagtttcac tataggttac
cataatggaa atggtcttac 840agcactatac cgttcacaag caaatgggaa
aaaggaggga aactgctttg gtaaagagag 900ggaccctaat ggggatttca
aataacactt ccctcagcca tacagcaaga gttagg 9568303PRTGlycine maxG3920
polypeptide (domain in aa coordinates 149-208) 8Met Pro Gly Lys Pro
Asp Thr Asp Asp Trp Arg Val Glu Arg Gly Glu 1 5 10 15 Gln Ile Gln
Phe Gln Ser Ser Ile Tyr Ser His His Gln Pro Trp Trp 20 25 30 Arg
Gly Val Gly Glu Asn Ala Ser Lys Ser Ser Ser Asp Asp Gln Leu 35 40
45 Asn Gly Ser Ile Val Asn Gly Ile Thr Arg Ser Glu Thr Asn Asp Lys
50 55 60 Ser Gly Gly Gly Val Ala Lys Glu Tyr Gln Asn Ile Lys His
Ala Met 65 70 75 80 Leu Ser Thr Pro Phe Thr Met Glu Lys His Leu Ala
Pro Asn Pro Gln 85 90 95 Met Glu Leu Val Gly His Ser Val Val Leu
Thr Ser Pro Tyr Ser Asp 100 105 110 Ala Gln Tyr Gly Gln Ile Leu Thr
Thr Tyr Gly Gln Gln Val Met Ile 115 120 125 Asn Pro Gln Leu Tyr Gly
Met His His Ala Arg Met Pro Leu Pro Leu 130 135 140 Glu Met Glu Glu
Glu Pro Val Tyr Val Asn Ala Lys Gln Tyr His Gly 145 150 155 160 Ile
Leu Arg Arg Arg Gln Ser Arg Ala Lys Ala Glu Ile Glu Lys Lys 165 170
175 Val Ile Lys Asn Arg Lys Pro Tyr Leu His Glu Ser Arg His Leu His
180 185 190 Ala Met Arg Arg Ala Arg Gly Asn Gly Gly Arg Phe Leu Asn
Thr Lys 195 200 205 Lys Leu Glu Asn Asn Asn Ser Asn Ser Thr Ser Asp
Lys Gly Asn Asn 210 215 220 Thr Arg Ala Asn Ala Ser Thr Asn Ser Pro
Asn Thr Gln Leu Leu Phe 225 230 235 240 Thr Asn Asn Leu Asn Leu Gly
Ser Ser Asn Val Ser Gln Ala Thr Val 245 250 255 Gln His Met His Thr
Glu Gln Ser Phe Thr Ile Gly Tyr His Asn Gly 260 265 270 Asn Gly Leu
Thr Ala Leu Tyr Arg Ser Gln Ala Asn Gly Lys Lys Glu 275 280 285 Gly
Asn Cys Phe Gly Lys Glu Arg Asp Pro Asn Gly Asp Phe Lys 290 295 300
91516DNAArabidopsis thalianaG928 9ctcatggcga tgttggtttc ccaggaaagg
taaaagagac ggagacgaac caaaacaagg 60aagaaagaag aagatcttac atacgaagat
cactctctga ttcactctga gagacaaact 120ggtttacttt ggttctgttt
gacaaaagga gacatgcaaa aataaatctc tatcccttgt 180ttttcttctt
cgcttcatcg attactcaaa gaggttgttg gttgtgagaa taattagctt
240gttaaggaag acgttatgat gcatcagatg ttgaataaga aagattcagc
tactcattcc 300actttgccat accttaatac tagcatctct tggggagtgg
ttccaactga ttccgttgct 360aatcgtcgcg gtcctgctga atcactaagc
ttgaaggttg attcaagacc tgggcatata 420caaactacaa agcaaatcag
ttttcaggac caagattcat cttcaacaca gtccactggt 480caatcttata
ctgaagttgc tagtagtggt gatgataatc cttccagaca aatctccttt
540tcggctaaat caggatctga aataactcaa cggaaggggt ttgcaagtaa
tcctaaacaa 600ggctcgatga ctggatttcc gaatattcac tttgctcctg
cacaggctaa tttctcattt 660cactatgctg atccacatta tggtggttta
ttagctgcaa cttacctacc acaggcacca 720acatgcaatc ctcaaatggt
gagtatgatt cctggtcgtg ttcctttacc agcagagctc 780acagaaactg
atccagtctt tgtcaatgcg aagcaatacc acgcaattat gaggaggaga
840cagcaacgtg ctaagcttga ggctcaaaac aaactaatca gagcccgtaa
gccctatctt 900catgagtctc gacatgttca tgctcttaaa aggccaagag
gatctggtgg aagattccta 960aacaccaaaa aacttcttca agaatccgaa
caggctgctg ctagagaaca agaacaggac 1020aagttaggcc aacaggtaaa
cagaaagacc aacatgtcta gattcgaagc tcatatgctg 1080cagaacaaca
aagaccgcag ctcaaccact tctggctcag acatcacctc tgtttccgac
1140ggtgctgata tctttggaca cactgaattc cagttttcag gtttcccaac
tccgataaac 1200cgagccatgc ttgttcatgg tcagtctaat gacatgcatg
gaggtggaga catgcaccat 1260ttctctgtcc atatctgaga cagtggatct
tggtgctgtg ttcatgttcc caccaagaag 1320gggaagtcat ccttggctac
tactagttct ttcgcttgtt gtaacttcag tgtttttatt 1380tcatattatg
tctgtgttag acatcacaag aacgaccaag atcttcactt tgaaacactc
1440tattaccttt tcatcttctg ttaccatgga tctcttgtct aaactagtga
tatgattctt 1500ctgataaaaa aaaaaa 151610340PRTArabidopsis
thalianaG928 polypeptide (domain in aa coordinates 179-238) 10Met
Met His Gln Met Leu Asn Lys Lys Asp Ser Ala Thr His Ser Thr 1 5 10
15 Leu Pro Tyr Leu Asn Thr Ser Ile Ser Trp Gly Val Val Pro Thr Asp
20 25 30 Ser Val Ala Asn Arg Arg Gly Pro Ala Glu Ser Leu Ser Leu
Lys Val 35 40 45 Asp Ser Arg Pro Gly His Ile Gln Thr Thr Lys Gln
Ile Ser Phe Gln 50 55 60 Asp Gln Asp Ser Ser Ser Thr Gln Ser Thr
Gly Gln Ser Tyr Thr Glu 65 70 75 80 Val Ala Ser Ser Gly Asp Asp Asn
Pro Ser Arg Gln Ile Ser Phe Ser 85 90 95 Ala Lys Ser Gly Ser Glu
Ile Thr Gln Arg Lys Gly Phe Ala Ser Asn 100 105 110 Pro Lys Gln Gly
Ser Met Thr Gly Phe Pro Asn Ile His Phe Ala Pro 115 120 125 Ala Gln
Ala Asn Phe Ser Phe His Tyr Ala Asp Pro His Tyr Gly Gly 130 135 140
Leu Leu Ala Ala Thr Tyr Leu Pro Gln Ala Pro Thr Cys Asn Pro Gln 145
150 155 160 Met Val Ser Met Ile Pro Gly Arg Val Pro Leu Pro Ala Glu
Leu Thr 165 170 175 Glu Thr Asp Pro Val Phe Val Asn Ala Lys Gln Tyr
His Ala Ile Met 180 185 190 Arg Arg Arg Gln Gln Arg Ala Lys Leu Glu
Ala Gln Asn Lys Leu Ile 195 200 205 Arg Ala Arg Lys Pro Tyr Leu His
Glu Ser Arg His Val His Ala Leu 210 215 220 Lys Arg Pro Arg Gly Ser
Gly Gly Arg Phe Leu Asn Thr Lys Lys Leu 225 230 235 240 Leu Gln Glu
Ser Glu Gln Ala Ala Ala Arg Glu Gln Glu Gln Asp Lys 245 250 255 Leu
Gly Gln Gln Val Asn Arg Lys Thr Asn Met Ser Arg Phe Glu Ala 260 265
270 His Met Leu Gln Asn Asn Lys Asp Arg Ser Ser Thr Thr Ser Gly Ser
275 280 285 Asp Ile Thr Ser Val Ser Asp Gly Ala Asp Ile Phe Gly His
Thr Glu 290 295 300 Phe Gln Phe Ser Gly Phe Pro Thr Pro Ile Asn Arg
Ala Met Leu Val 305 310 315 320 His Gly Gln Ser Asn Asp Met His Gly
Gly Gly Asp Met His His Phe 325 330 335 Ser Val His Ile 340
11927DNAArabidopsis thalianaG1782 11atgcaagtgt ttcaaaggaa
agaagattca tcttggggaa actcaatgcc tacaacaaat 60tcaaatattc aaggatctga
atctttcagc ttgactaagg atatgataat gtctacaaca 120caattacccg
cgatgaaaca ttcgggtttg cagctgcaaa atcaagattc aacctcatca
180caatctactg aagaagaatc aggcggcggt gaagttgcaa gctttggaga
atataagcgt 240tatggatgca gcattgttaa taacaatctc tcaggttaca
tcgaaaactt gggaaagcct 300attgaaaatt atactaagtc aattactacc
tcgtcgatgg tgtctcaaga ctctgtgttt 360cctgctccta
cttctggtca aatatcttgg tctcttcaat gtgctgaaac gtcacatttc
420aatggtttct tggctcctga atatgcatca acaccaacgg cgctgccaca
tttagagatg 480atgggtttgg tttcttcaag agtgccattg cctcatcaca
ttcaagagaa tgaaccaata 540tttgtcaatg cgaaacagta tcatgcgatt
ctccgtcgca ggaagcaccg tgctaaactc 600gaagctcaga acaaactcat
caaatgccgt aaaccgtacc ttcatgagtc tcgccatctt 660catgctttaa
agagagctag aggctccggt ggacgtttcc tcaatacaaa gaagcttcaa
720gaatcatcaa actcactgtg ttcttctcaa atggcaaatg gacaaaattt
ctctatgagc 780cctcacggtg gtggaagcgg aatcgggtct agttcgatct
caccgagctc caattcaaac 840tgtatcaaca tgttccaaaa cccgcagttc
agattctcag gttatccgtc aacacaccat 900gcctcagctc tcatgtcagg gacttga
92712308PRTArabidopsis thalianaG1782 polypeptide (domain in aa
coordinates 178-237) 12Met Gln Val Phe Gln Arg Lys Glu Asp Ser Ser
Trp Gly Asn Ser Met 1 5 10 15 Pro Thr Thr Asn Ser Asn Ile Gln Gly
Ser Glu Ser Phe Ser Leu Thr 20 25 30 Lys Asp Met Ile Met Ser Thr
Thr Gln Leu Pro Ala Met Lys His Ser 35 40 45 Gly Leu Gln Leu Gln
Asn Gln Asp Ser Thr Ser Ser Gln Ser Thr Glu 50 55 60 Glu Glu Ser
Gly Gly Gly Glu Val Ala Ser Phe Gly Glu Tyr Lys Arg 65 70 75 80 Tyr
Gly Cys Ser Ile Val Asn Asn Asn Leu Ser Gly Tyr Ile Glu Asn 85 90
95 Leu Gly Lys Pro Ile Glu Asn Tyr Thr Lys Ser Ile Thr Thr Ser Ser
100 105 110 Met Val Ser Gln Asp Ser Val Phe Pro Ala Pro Thr Ser Gly
Gln Ile 115 120 125 Ser Trp Ser Leu Gln Cys Ala Glu Thr Ser His Phe
Asn Gly Phe Leu 130 135 140 Ala Pro Glu Tyr Ala Ser Thr Pro Thr Ala
Leu Pro His Leu Glu Met 145 150 155 160 Met Gly Leu Val Ser Ser Arg
Val Pro Leu Pro His His Ile Gln Glu 165 170 175 Asn Glu Pro Ile Phe
Val Asn Ala Lys Gln Tyr His Ala Ile Leu Arg 180 185 190 Arg Arg Lys
His Arg Ala Lys Leu Glu Ala Gln Asn Lys Leu Ile Lys 195 200 205 Cys
Arg Lys Pro Tyr Leu His Glu Ser Arg His Leu His Ala Leu Lys 210 215
220 Arg Ala Arg Gly Ser Gly Gly Arg Phe Leu Asn Thr Lys Lys Leu Gln
225 230 235 240 Glu Ser Ser Asn Ser Leu Cys Ser Ser Gln Met Ala Asn
Gly Gln Asn 245 250 255 Phe Ser Met Ser Pro His Gly Gly Gly Ser Gly
Ile Gly Ser Ser Ser 260 265 270 Ile Ser Pro Ser Ser Asn Ser Asn Cys
Ile Asn Met Phe Gln Asn Pro 275 280 285 Gln Phe Arg Phe Ser Gly Tyr
Pro Ser Thr His His Ala Ser Ala Leu 290 295 300 Met Ser Gly Thr 305
131011DNAArabidopsis thalianaG1363 13cgtctaccta ctatggtctg
gagattagtt cgtttattga actaatgttt tagacaatgc 60aagagttcca tagtagcaaa
gattcattgc cttgtcctgc aacttcttgg gataactctg 120tcttcaccaa
ctcaaatgtc caaggatcat catccttgac cgataacaac actttaagct
180tgacaatgga gatgaaacaa actggttttc aaatgcagca ctatgattcc
tcctctactc 240aatccactgg aggagaatca tatagtgaag ttgctagctt
aagtgaacct actaatcgtt 300atggccacaa cattgttgtc actcatctct
caggttacaa agaaaacccg gaaaatccta 360ttggaagtca ttcgatatca
aaggtgtctc aagattcagt ggttcttcct attgaggcgg 420cttcttggcc
tttacacggc aatgtaacgc cacatttcaa tggtttcttg tcttttcctt
480atgcatcaca acacacggtg cagcatcctc aaatcagagg gttggttccg
tctagaatgc 540ctttgcctca caacattcca gagaacgaac caattttcgt
caatgcaaaa cagtaccaag 600ccattctccg ccgcagagag cgccgtgcaa
agcttgaagc tcagaacaag ctcatcaaag 660tccgcaaacc atatcttcac
gagtcgcggc acctccatgc actaaagaga gttagaggct 720ctggtggacg
tttcctcaac acaaagaagc atcaagaatc aaattcctca ctatctcctc
780cattcttgat tccacctcat gtcttcaaga actctccagg aaagttccgg
caaatggaca 840tttcaagggg tggggttgtg tctagtgtct cgacaacatc
ttgctcggac ataaccggga 900acaacaacga catgttccag caaaacccac
aattcaggtt ctcaggttat ccatcaaacc 960accatgtctc agtcctcatg
tgagagagct cccgcaagtg gtggatgagg c 101114308PRTArabidopsis
thalianaG1363 polypeptide (domain in aa coordinates 171-230) 14Met
Gln Glu Phe His Ser Ser Lys Asp Ser Leu Pro Cys Pro Ala Thr 1 5 10
15 Ser Trp Asp Asn Ser Val Phe Thr Asn Ser Asn Val Gln Gly Ser Ser
20 25 30 Ser Leu Thr Asp Asn Asn Thr Leu Ser Leu Thr Met Glu Met
Lys Gln 35 40 45 Thr Gly Phe Gln Met Gln His Tyr Asp Ser Ser Ser
Thr Gln Ser Thr 50 55 60 Gly Gly Glu Ser Tyr Ser Glu Val Ala Ser
Leu Ser Glu Pro Thr Asn 65 70 75 80 Arg Tyr Gly His Asn Ile Val Val
Thr His Leu Ser Gly Tyr Lys Glu 85 90 95 Asn Pro Glu Asn Pro Ile
Gly Ser His Ser Ile Ser Lys Val Ser Gln 100 105 110 Asp Ser Val Val
Leu Pro Ile Glu Ala Ala Ser Trp Pro Leu His Gly 115 120 125 Asn Val
Thr Pro His Phe Asn Gly Phe Leu Ser Phe Pro Tyr Ala Ser 130 135 140
Gln His Thr Val Gln His Pro Gln Ile Arg Gly Leu Val Pro Ser Arg 145
150 155 160 Met Pro Leu Pro His Asn Ile Pro Glu Asn Glu Pro Ile Phe
Val Asn 165 170 175 Ala Lys Gln Tyr Gln Ala Ile Leu Arg Arg Arg Glu
Arg Arg Ala Lys 180 185 190 Leu Glu Ala Gln Asn Lys Leu Ile Lys Val
Arg Lys Pro Tyr Leu His 195 200 205 Glu Ser Arg His Leu His Ala Leu
Lys Arg Val Arg Gly Ser Gly Gly 210 215 220 Arg Phe Leu Asn Thr Lys
Lys His Gln Glu Ser Asn Ser Ser Leu Ser 225 230 235 240 Pro Pro Phe
Leu Ile Pro Pro His Val Phe Lys Asn Ser Pro Gly Lys 245 250 255 Phe
Arg Gln Met Asp Ile Ser Arg Gly Gly Val Val Ser Ser Val Ser 260 265
270 Thr Thr Ser Cys Ser Asp Ile Thr Gly Asn Asn Asn Asp Met Phe Gln
275 280 285 Gln Asn Pro Gln Phe Arg Phe Ser Gly Tyr Pro Ser Asn His
His Val 290 295 300 Ser Val Leu Met 305 151959DNAOryza sativaG3924
15gactggatcc tgattggcgt tgcggccaat caggatcctg ctggatcctg attggatcct
60gattggcgtt gcggccgtgt ctctactgtt gttgttgttg tgtttttttt ttctttttta
120cctttttttg ccgccttggt tttgatgcgg agtctggatg tttctacttt
tggatggggt 180ttttttacct tacccgaccg aattcgtggg tggattggat
gcggttcatg gaagggaacg 240ggatcttggt cggctactcg gatggggggt
ttgccggccg gttgggattt cgggaaccgg 300atcgaggaag aggcggagga
gaattgatcc gcggcggcgg cggcggagga ggaggagaga 360atatgtggtg
attttcatct gagcccggtt gcagaagtcc aactgtatcg gagaatctta
420ctcggttcgt actacagtcc cccacgctgg tattcaagga atcttctctg
gatggaccag 480ttcttggttg gtcccggttc tttcatgtcc agttccccat
ggtggttcag ttggcagttg 540tgccctagtt gttgtaggag taattgtcgg
tggctttaaa tggttcatgc tcgtcagttc 600ttccgagcat tccgaggtga
gcgagcatgg agtcgaggcc ggggggaacc aacctcgtgg 660agccgagggg
gcagggcgcg ctgccgtccg gcataccgat ccagcagccg tggtggacga
720cctccgccgg ggtcggggcg gtgtcgcccg ccgtcgtggc gccggggagc
ggtgcgggga 780tcagcctgtc gggcagggat ggcggcggcg acgacgcggc
agaggagagc agcgatgact 840cacgaagatc aggggagacc aaagatggaa
gcactgatca agaaaagcat catgcaacat 900cgcagatgac tgctttggca
tcagactatt taacaccatt ttcacagctg gaactaaacc 960aaccaattgc
ttcggcagca taccagtacc ctgactctta ctatatgggc atggttggtc
1020cctatggacc tcaagctatg tccgcacaga ctcatttcca gctacctgga
ttaactcact 1080ctcgtatgcc gttgcctctt gaaatatctg aggagcctgt
ttatgtaaat gctaagcaat 1140atcatggaat tttaagacgg aggcagtcac
gtgcgaaggc tgaacttgag aaaaaagttg 1200ttaaatcaag aaagccctat
cttcatgagt ctcgtcatca acatgctatg cgaagggcaa 1260gaggaacggg
tggacgcttc ctgaacacaa agaaaaatga agatggtgct cccagtgaga
1320aagccgaacc aaacaaagga gagcagaact ccgggtatcg ccggatccct
cctgacttac 1380agctcctaca gaaggaaaca tgaagtagcg gctcgaaacc
tagaacagtg gcttctgtcc 1440accggcattc actcttgagg gtggattctt
gctccagaat tgtgctgcca tctttcaaat 1500gatcttcatc gtgcaaagta
attatatgta cattcctctg aatgatctat gcaccaattg 1560ttgatcctgg
cagggtaata atctggatgt attgagtcca tcacagtgcg aatgtcacgg
1620gtagatctgc tgttttcagg caattcattc ttggctttct atcccacccg
ttgttgttgc 1680aagttaagct agcagtactt gtctcagtgt ccgtgagacg
tttgtgtaag attaggttaa 1740actagaagtt gtaatgctgt attaagtgtt
tgtatttcta atatgaaccg taacaaggcc 1800agagcagaac tcgttataca
tacaaaaatt gatggccagg tcagtgttac cgtattatta 1860tgcaatggca
gaagcttgca taaggcgtgg tgccactcgt tgctttgctg tatgtttttg
1920agtttcattc gatttatttt cactgttgag tttgtgggt 195916258PRTOryza
sativaG3924 polypeptide (domain in aa coordinates 163-222) 16Met
Glu Ser Arg Pro Gly Gly Thr Asn Leu Val Glu Pro Arg Gly Gln 1 5 10
15 Gly Ala Leu Pro Ser Gly Ile Pro Ile Gln Gln Pro Trp Trp Thr Thr
20 25 30 Ser Ala Gly Val Gly Ala Val Ser Pro Ala Val Val Ala Pro
Gly Ser 35 40 45 Gly Ala Gly Ile Ser Leu Ser Gly Arg Asp Gly Gly
Gly Asp Asp Ala 50 55 60 Ala Glu Glu Ser Ser Asp Asp Ser Arg Arg
Ser Gly Glu Thr Lys Asp 65 70 75 80 Gly Ser Thr Asp Gln Glu Lys His
His Ala Thr Ser Gln Met Thr Ala 85 90 95 Leu Ala Ser Asp Tyr Leu
Thr Pro Phe Ser Gln Leu Glu Leu Asn Gln 100 105 110 Pro Ile Ala Ser
Ala Ala Tyr Gln Tyr Pro Asp Ser Tyr Tyr Met Gly 115 120 125 Met Val
Gly Pro Tyr Gly Pro Gln Ala Met Ser Ala Gln Thr His Phe 130 135 140
Gln Leu Pro Gly Leu Thr His Ser Arg Met Pro Leu Pro Leu Glu Ile 145
150 155 160 Ser Glu Glu Pro Val Tyr Val Asn Ala Lys Gln Tyr His Gly
Ile Leu 165 170 175 Arg Arg Arg Gln Ser Arg Ala Lys Ala Glu Leu Glu
Lys Lys Val Val 180 185 190 Lys Ser Arg Lys Pro Tyr Leu His Glu Ser
Arg His Gln His Ala Met 195 200 205 Arg Arg Ala Arg Gly Thr Gly Gly
Arg Phe Leu Asn Thr Lys Lys Asn 210 215 220 Glu Asp Gly Ala Pro Ser
Glu Lys Ala Glu Pro Asn Lys Gly Glu Gln 225 230 235 240 Asn Ser Gly
Tyr Arg Arg Ile Pro Pro Asp Leu Gln Leu Leu Gln Lys 245 250 255 Glu
Thr 171631DNAOryza sativaG3926 17gagtacaaag gagagagaga gagagactct
agtgcatatt tggcaggaag agccgaagag 60gggaggtgag gatcagagga ggcagcctca
tcgtcatgct tcagcactag tagtagtagt 120gccaccattt ctttgccctc
gatctttccc cagagagaga gagagagaga gagagagtct 180tgattggggg
aggagagagg gagagagaga aagagagagg acagaaaatg tttgtggatc
240ttgagtaatg ccttctaata atgataatgc tgttgcaaga aatggagaat
catcctgtcc 300aatgcatggc caagaccaac tatgattttc ttgccaggaa
taactatcca atgaaacagt 360tagttcagag gaactctgat ggtgactcgt
caccaacaaa gtctggggag tctcaccaag 420aagcatctgc agtaagtgac
agcagtctca acggacaaca cacctcacca caatcagtgt 480ttgtcccctc
agatattaac aacaatgata gttgtgggga gcgggaccat ggcactaagt
540cggtattgtc tttggggaac acagaagctg cctttcctcc ttcaaagttc
gattacaacc 600agccttttgc atgtgtttct tatccatatg gtactgatcc
atattatggt ggagtatcaa 660caggatacac ttcacatgca tttgttcatc
ctcaaattac tggtgctgca aactctagga 720tgccattggc tgttgatcct
tctgtagaag agcccatatt tgtcaatgca aagcaataca 780atgcgatcct
tagaagaagg caaacgcgtg caaaattgga ggcccaaaat aaggcggtga
840aaggtcggaa gccttacctc catgaatctc gacatcatca tgctatgaag
cgagcccgtg 900gatcaggtgg tcggttcctt accaaaaagg agctgctgga
acagcagcag cagcagcagc 960agcagaagcc accaccggca tcagctcagt
ctccaacagg tagagccaga acgagcggcg 1020gtgccgttgt ccttggcaag
aacctgtgcc cagagaacag cacatcctgc tcgccatcga 1080caccgacagg
ctccgagatc tccagcatct catttggggg cggcatgctg gctcaccaag
1140agcacatcag cttcgcatcc gctgatcgcc accccacaat gaaccagaac
caccgtgtcc 1200ccgtcatgag gtgaaaacct cgggatcgcg ggacacgggc
ggttctggtt taccctcact 1260ggcgcactcc ggtgtgcccg tggcaattca
tccttggctt atgaagtatc tacctgataa 1320tagtctgctg tcagtttata
tgcaatgcaa cctctgtcag ataaactctt atagtttgtt 1380ttattgtaag
ctatgactga acgaactgtc gagcagatgg ctaatttgta tgttgtgggt
1440acagaaatcc tgaagctttt gatgtaccta attgcctttt gcttatactc
ttggtgtata 1500cccattacca agttgcctta aaaaccctcc aattatgtaa
tcagtcatgg ttttatagaa 1560ccttgccaca tgtaatcaat cacctgtttt
tgtaaattga tctataaacg ctataggctg 1620ctgtgttatc t 163118317PRTOryza
sativaG3926 polypeptide (domain in aa coordinates 164-222) 18Met
Ile Met Leu Leu Gln Glu Met Glu Asn His Pro Val Gln Cys Met 1 5 10
15 Ala Lys Thr Asn Tyr Asp Phe Leu Ala Arg Asn Asn Tyr Pro Met Lys
20 25 30 Gln Leu Val Gln Arg Asn Ser Asp Gly Asp Ser Ser Pro Thr
Lys Ser 35 40 45 Gly Glu Ser His Gln Glu Ala Ser Ala Val Ser Asp
Ser Ser Leu Asn 50 55 60 Gly Gln His Thr Ser Pro Gln Ser Val Phe
Val Pro Ser Asp Ile Asn 65 70 75 80 Asn Asn Asp Ser Cys Gly Glu Arg
Asp His Gly Thr Lys Ser Val Leu 85 90 95 Ser Leu Gly Asn Thr Glu
Ala Ala Phe Pro Pro Ser Lys Phe Asp Tyr 100 105 110 Asn Gln Pro Phe
Ala Cys Val Ser Tyr Pro Tyr Gly Thr Asp Pro Tyr 115 120 125 Tyr Gly
Gly Val Ser Thr Gly Tyr Thr Ser His Ala Phe Val His Pro 130 135 140
Gln Ile Thr Gly Ala Ala Asn Ser Arg Met Pro Leu Ala Val Asp Pro 145
150 155 160 Ser Val Glu Glu Pro Ile Phe Val Asn Ala Lys Gln Tyr Asn
Ala Ile 165 170 175 Leu Arg Arg Arg Gln Thr Arg Ala Lys Leu Glu Ala
Gln Asn Lys Ala 180 185 190 Val Lys Gly Arg Lys Pro Tyr Leu His Glu
Ser Arg His His His Ala 195 200 205 Met Lys Arg Ala Arg Gly Ser Gly
Gly Arg Phe Leu Thr Lys Lys Glu 210 215 220 Leu Leu Glu Gln Gln Gln
Gln Gln Gln Gln Gln Lys Pro Pro Pro Ala 225 230 235 240 Ser Ala Gln
Ser Pro Thr Gly Arg Ala Arg Thr Ser Gly Gly Ala Val 245 250 255 Val
Leu Gly Lys Asn Leu Cys Pro Glu Asn Ser Thr Ser Cys Ser Pro 260 265
270 Ser Thr Pro Thr Gly Ser Glu Ile Ser Ser Ile Ser Phe Gly Gly Gly
275 280 285 Met Leu Ala His Gln Glu His Ile Ser Phe Ala Ser Ala Asp
Arg His 290 295 300 Pro Thr Met Asn Gln Asn His Arg Val Pro Val Met
Arg 305 310 315 191020DNAOryza sativaG3925 19ccacgcgtcc ggcaaggtga
gaagtgagga ggcagcaagg gaggaggttt gccggagagg 60ggacatgctc cctcctcatc
tcacagaaaa tggcacagta atgattcagt ttggtcataa 120aatgcctgac
tacgagtcat cagctaccca atcaactagt ggatctcctc gtgaagtgtc
180tggaatgagc gaaggaagcc tcaatgagca gaatgatcaa tctggtaatc
ttgatggtta 240cacgaagagt gatgaaggta agatgatgtc agctttatct
ctgggcaaat cagaaactgt 300gtatgcacat tcggaacctg accgtagcca
accctttggc atatcatatc catatgctga 360ttcgttctat ggtggtgctg
tagcgactta tggcacacat gctattatgc atccccagat 420tgtgggcgtg
atgtcatcct cccgagtccc gctaccaata gaaccagcca ccgaagagcc
480tatttatgta aatgcaaagc aataccatgc gattctccga aggagacagc
tccgtgcaaa 540gttagaggct gaaaacaagc tggtgaaaaa ccgcaagccg
tacctccatg aatcccggca 600tcaacacgcg atgaaaagag ctcggggaac
aggggggaga ttcctcaaca caaagcagca 660gcctgaagct tcagatggtg
gcaccccaag gctcgtctct gcaaacggcg ttgtgttctc 720aaagcacgag
cacagcttgt cgtccagtga tctccatcat cgtgcgaaag agggcgcttg
780agatcctcgc cgtttctgtc atggcaaatc atccttggct tatgtgtggt
gcccagcaaa 840aaaaaatctg actgaacctg tgtgtaaact gatgggtatg
ggtgggtttt gtgcaactgt 900tacttgggtg cttgaaatct gtttctgtgt
ttcctctgcc tccttagttt ggagacggtg 960cagctgcagc tggtaccagt
aatctgatca tgctagactt gtgacaaaaa aaaaaaaaaa 102020238PRTOryza
sativaG3925 polypeptide (domain in aa coordinates 138-197) 20Met
Leu Pro Pro His Leu Thr Glu Asn Gly Thr Val Met Ile Gln Phe 1 5 10
15 Gly His Lys Met Pro Asp Tyr Glu Ser Ser Ala Thr Gln Ser Thr Ser
20 25 30 Gly Ser Pro Arg Glu Val Ser Gly Met Ser Glu Gly Ser Leu
Asn Glu 35 40 45 Gln Asn Asp Gln Ser Gly Asn Leu Asp Gly Tyr Thr
Lys Ser Asp Glu 50 55 60 Gly Lys Met Met Ser Ala Leu Ser Leu Gly
Lys Ser Glu Thr Val Tyr 65 70 75
80 Ala His Ser Glu Pro Asp Arg Ser Gln Pro Phe Gly Ile Ser Tyr Pro
85 90 95 Tyr Ala Asp Ser Phe Tyr Gly Gly Ala Val Ala Thr Tyr Gly
Thr His 100 105 110 Ala Ile Met His Pro Gln Ile Val Gly Val Met Ser
Ser Ser Arg Val 115 120 125 Pro Leu Pro Ile Glu Pro Ala Thr Glu Glu
Pro Ile Tyr Val Asn Ala 130 135 140 Lys Gln Tyr His Ala Ile Leu Arg
Arg Arg Gln Leu Arg Ala Lys Leu 145 150 155 160 Glu Ala Glu Asn Lys
Leu Val Lys Asn Arg Lys Pro Tyr Leu His Glu 165 170 175 Ser Arg His
Gln His Ala Met Lys Arg Ala Arg Gly Thr Gly Gly Arg 180 185 190 Phe
Leu Asn Thr Lys Gln Gln Pro Glu Ala Ser Asp Gly Gly Thr Pro 195 200
205 Arg Leu Val Ser Ala Asn Gly Val Val Phe Ser Lys His Glu His Ser
210 215 220 Leu Ser Ser Ser Asp Leu His His Arg Ala Lys Glu Gly Ala
225 230 235 21885DNAZea maysG3921 21atggaggatc attctgtcca
tcccatgtct aagtctaacc atggctcctt gtcaggaaat 60ggttatgaga tgaaacatcc
aggccatgaa gtttgcgata gggattcatc atcggagtct 120gatcggtctc
accaagaagc atcagcagcg agtgaaagca gtccagatga acacacatca
180actcaatcag acaatgatga agatcatggg aaggataatc aggacacatt
gaagccagta 240ttgtccttgg ggaaggaagg ctctgccact ggggccccaa
aattacatta cagcccatct 300tttgcttgta ttccttatac tgctgacgct
tactatggtg ccgttggggt cttgacagga 360tatcctccac atgccattgt
ccatccccag caaaatgata caacgaacac tccgggtatg 420ttacctgtgg
aacctgcaga agaaccaata tatgttaatg caaaacaata ccatgcaatc
480cttaggagga ggcaaacacg tgctaaattg gaggcccaga acaagatggt
gaaaggtcgg 540aagccatacc ttcatgagtc ccgacatcgt catgccatga
aacgggcccg tggctcagga 600gggcggttcc tcaacacaaa gcagctccag
gaccaaaacc agcagtttca ggaagcgtcg 660agtggttcaa tgtgctcaaa
gatcattggc aacagcataa tctcccaaag tggccccacc 720tgcacgccct
cttctggcac tgcaggtgct tcaacagcca gccaggaccg cagctgcttg
780ccctcagttg gcttccgccc cacaaccaac ttcagtgacc aaggtcgagg
aggcttgaag 840ctggccgtga tcggcatgca gcagcgtgtt tccaccataa ggtga
88522294PRTZea maysG3921 polypeptide (domain in aa coordinates
148-207) 22Met Glu Asp His Ser Val His Pro Met Ser Lys Ser Asn His
Gly Ser 1 5 10 15 Leu Ser Gly Asn Gly Tyr Glu Met Lys His Pro Gly
His Glu Val Cys 20 25 30 Asp Arg Asp Ser Ser Ser Glu Ser Asp Arg
Ser His Gln Glu Ala Ser 35 40 45 Ala Ala Ser Glu Ser Ser Pro Asp
Glu His Thr Ser Thr Gln Ser Asp 50 55 60 Asn Asp Glu Asp His Gly
Lys Asp Asn Gln Asp Thr Leu Lys Pro Val 65 70 75 80 Leu Ser Leu Gly
Lys Glu Gly Ser Ala Thr Gly Ala Pro Lys Leu His 85 90 95 Tyr Ser
Pro Ser Phe Ala Cys Ile Pro Tyr Thr Ala Asp Ala Tyr Tyr 100 105 110
Gly Ala Val Gly Val Leu Thr Gly Tyr Pro Pro His Ala Ile Val His 115
120 125 Pro Gln Gln Asn Asp Thr Thr Asn Thr Pro Gly Met Leu Pro Val
Glu 130 135 140 Pro Ala Glu Glu Pro Ile Tyr Val Asn Ala Lys Gln Tyr
His Ala Ile 145 150 155 160 Leu Arg Arg Arg Gln Thr Arg Ala Lys Leu
Glu Ala Gln Asn Lys Met 165 170 175 Val Lys Gly Arg Lys Pro Tyr Leu
His Glu Ser Arg His Arg His Ala 180 185 190 Met Lys Arg Ala Arg Gly
Ser Gly Gly Arg Phe Leu Asn Thr Lys Gln 195 200 205 Leu Gln Asp Gln
Asn Gln Gln Phe Gln Glu Ala Ser Ser Gly Ser Met 210 215 220 Cys Ser
Lys Ile Ile Gly Asn Ser Ile Ile Ser Gln Ser Gly Pro Thr 225 230 235
240 Cys Thr Pro Ser Ser Gly Thr Ala Gly Ala Ser Thr Ala Ser Gln Asp
245 250 255 Arg Ser Cys Leu Pro Ser Val Gly Phe Arg Pro Thr Thr Asn
Phe Ser 260 265 270 Asp Gln Gly Arg Gly Gly Leu Lys Leu Ala Val Ile
Gly Met Gln Gln 275 280 285 Arg Val Ser Thr Ile Arg 290
231405DNAZea maysG3922 23tgggaccctc gaggccggcc gggatacgat
tccgaagaag gtagcgaccc acgcgcgcgg 60gccagaggcc ggaagaggga gatacaggtt
aatttttagg taccagatca tctgatttct 120cagaagcaaa atgttgtttg
gagctcagtg acaccatctt gtaatgcctg tgattttacg 180ggaaatggag
gatcattctg tccatcccat gtctaagtct aaccatggct ccttgtcagg
240aaatggttat gagatgaaac attcaggcca taaagtttgc gatagggatt
catcatcgga 300gtctgatcgg tctcaccaag aagcatcagc agcaagtgaa
agcagtccaa atgaacacac 360atcaactcaa tcagacaatg atgaagatca
tgggaaagat aatcaggaca caatgaagcc 420agtattgtcc ttggggaagg
aaggctctgc ctttttggcc ccaaaattac attacagccc 480atcttttgct
tgtattcctt atactgctga tgcttattat agtggggttg gggtctcgac
540aggatatgct ccacatgcca ttgtatgttc actcttaatc tttcagtttc
tgtcttcctg 600gccacattct gtccatcccc agcaaaatga tacaacgaac
actccgggta tgttacctgt 660ggaacctgca gaagaaccaa tatatgttaa
tgcaaaacaa taccatgcaa tccttaggag 720gaggcaaaca cgtgctaaat
tggaggccca gaacaagatg gtgaaaaatc ggaagccata 780tcttcatgag
tcccgacatc gtcatgccat gaaacgggct cgtggatcag gaggacggtt
840cctcaacaca aagcagctcc aggagcagaa ccagcagtat caggcatcga
gtggttcatt 900gtgctcaaag atcattgcca acagcataat ctcccaaagt
ggccccacct gcacgccctc 960ttctgacact gcaggtcttc agcagccagc
caggaccgcg gctgcttgcc ctcggtgggc 1020ttccgcccca cagccaactt
cagtgagcaa ggtggaggcg gctcgaagct ggtcgtgaac 1080ggcatgcagc
agcgtgtttc caccataagg tgaagagaag tgggcacgac accattccca
1140ggcgcgcact gcctgtggca actcatcctt ggcttttgaa actatggata
tgcaatggac 1200atgtagcttc gagttcctca gaataaccaa acgtgaagaa
tatgcaaagt ccttttgaga 1260tttgctgtag ctgaaagaac tgtggttagg
ttgagtttct tcctggagac tgatccatac 1320atgacatgct acctcgtgct
gagtttctga ggtgaagcca tcgaaacatg accgtgtggt 1380tcagtaaaaa
aaaaaaaaaa aaaaa 140524304PRTZea maysG3922 polypeptide (domain in
aa coordinates 171-230) 24Met Pro Val Ile Leu Arg Glu Met Glu Asp
His Ser Val His Pro Met 1 5 10 15 Ser Lys Ser Asn His Gly Ser Leu
Ser Gly Asn Gly Tyr Glu Met Lys 20 25 30 His Ser Gly His Lys Val
Cys Asp Arg Asp Ser Ser Ser Glu Ser Asp 35 40 45 Arg Ser His Gln
Glu Ala Ser Ala Ala Ser Glu Ser Ser Pro Asn Glu 50 55 60 His Thr
Ser Thr Gln Ser Asp Asn Asp Glu Asp His Gly Lys Asp Asn 65 70 75 80
Gln Asp Thr Met Lys Pro Val Leu Ser Leu Gly Lys Glu Gly Ser Ala 85
90 95 Phe Leu Ala Pro Lys Leu His Tyr Ser Pro Ser Phe Ala Cys Ile
Pro 100 105 110 Tyr Thr Ala Asp Ala Tyr Tyr Ser Gly Val Gly Val Ser
Thr Gly Tyr 115 120 125 Ala Pro His Ala Ile Val Cys Ser Leu Leu Ile
Phe Gln Phe Leu Ser 130 135 140 Ser Trp Pro His Ser Val His Pro Gln
Gln Asn Asp Thr Thr Asn Thr 145 150 155 160 Pro Gly Met Leu Pro Val
Glu Pro Ala Glu Glu Pro Ile Tyr Val Asn 165 170 175 Ala Lys Gln Tyr
His Ala Ile Leu Arg Arg Arg Gln Thr Arg Ala Lys 180 185 190 Leu Glu
Ala Gln Asn Lys Met Val Lys Asn Arg Lys Pro Tyr Leu His 195 200 205
Glu Ser Arg His Arg His Ala Met Lys Arg Ala Arg Gly Ser Gly Gly 210
215 220 Arg Phe Leu Asn Thr Lys Gln Leu Gln Glu Gln Asn Gln Gln Tyr
Gln 225 230 235 240 Ala Ser Ser Gly Ser Leu Cys Ser Lys Ile Ile Ala
Asn Ser Ile Ile 245 250 255 Ser Gln Ser Gly Pro Thr Cys Thr Pro Ser
Ser Asp Thr Ala Gly Leu 260 265 270 Gln Gln Pro Ala Arg Thr Ala Ala
Ala Cys Pro Arg Trp Ala Ser Ala 275 280 285 Pro Gln Pro Thr Ser Val
Ser Lys Val Glu Ala Ala Arg Ser Trp Ser 290 295 300 251155DNAZea
maysG4264 25tggtttcggc aatttgggtt aatttttagg taccagatca tctgatttct
cagaagcaaa 60atgttgtttg gagctcagtg acaccatctt gtaatgcctg tgattttacg
ggaaatggag 120gatcattctg tccatcccat gtctaagtct aaccatggct
ccttgtcagg aaatggttat 180gagatgaaac attcaggcca taaagtttgc
gatagggatt catcatcgga gtctgatcgg 240tctcaccaag aagcatcagc
agcaagtgaa agcagtccaa atgaacacac atcaactcaa 300tcagacaatg
atgaagatca tgggaaagat aatcaggaca caatgaagcc agtattgtcc
360ttggggaagg aaggctctgc ctttttggcc ccaaaattac attacagccc
atcttttgct 420tgtattcctt atacttctga tgcttattat agtgcggttg
gggtcttgac aggatatcct 480ccacatgcca ttgtccatcc ccagcaaaat
gatacaacga acactccggg tatgttacct 540gtggaacctg cagaagaacc
aatatatgtt aatgcaaaac aataccatgc aatccttagg 600aggaggcaaa
cacgtgctaa attggaggcc cagaacaaga tggtgaaaaa tcggaagcca
660tatcttcatg agtcccgaca tcgtcatgcc atgaaacggg ctcgtggatc
aggaggacgg 720ttcctcaaca caaagcagct ccaggagcag aaccagcagt
atcaggcatc gagtggttca 780ttgtgctcaa agatcattgc caacagcata
atctcccaaa gtggccccac ctgcacgccc 840tcttctggca ctgcaggtgc
ttcaacagcc ggccaggacc gcagctgctt gccctcagtt 900ggcttccgcc
ccacgacaaa cttcagtgac caaggtcgag gaggcttgaa gttggccgtg
960atcggcatgc agcagcgtgt ttccaccata aggtgaagag aagtgggcac
aacaccattc 1020ccaggcacac tgcctgtggc aactcatcct tggctcttgg
aactttgaat atgcaatcga 1080catgtagctt gagatcctca gaataaacca
aaccttcagt tatatgcaag ccttttttga 1140ggttgctgtt gctgt
115526300PRTZea maysG4264 polypeptide (domain in aa coordinates
155-214) 26Met Pro Val Ile Leu Arg Glu Met Glu Asp His Ser Val His
Pro Met 1 5 10 15 Ser Lys Ser Asn His Gly Ser Leu Ser Gly Asn Gly
Tyr Glu Met Lys 20 25 30 His Ser Gly His Lys Val Cys Asp Arg Asp
Ser Ser Ser Glu Ser Asp 35 40 45 Arg Ser His Gln Glu Ala Ser Ala
Ala Ser Glu Ser Ser Pro Asn Glu 50 55 60 His Thr Ser Thr Gln Ser
Asp Asn Asp Glu Asp His Gly Lys Asp Asn 65 70 75 80 Gln Asp Thr Met
Lys Pro Val Leu Ser Leu Gly Lys Glu Gly Ser Ala 85 90 95 Phe Leu
Ala Pro Lys Leu His Tyr Ser Pro Ser Phe Ala Cys Ile Pro 100 105 110
Tyr Thr Ser Asp Ala Tyr Tyr Ser Ala Val Gly Val Leu Thr Gly Tyr 115
120 125 Pro Pro His Ala Ile Val His Pro Gln Gln Asn Asp Thr Thr Asn
Thr 130 135 140 Pro Gly Met Leu Pro Val Glu Pro Ala Glu Glu Pro Ile
Tyr Val Asn 145 150 155 160 Ala Lys Gln Tyr His Ala Ile Leu Arg Arg
Arg Gln Thr Arg Ala Lys 165 170 175 Leu Glu Ala Gln Asn Lys Met Val
Lys Asn Arg Lys Pro Tyr Leu His 180 185 190 Glu Ser Arg His Arg His
Ala Met Lys Arg Ala Arg Gly Ser Gly Gly 195 200 205 Arg Phe Leu Asn
Thr Lys Gln Leu Gln Glu Gln Asn Gln Gln Tyr Gln 210 215 220 Ala Ser
Ser Gly Ser Leu Cys Ser Lys Ile Ile Ala Asn Ser Ile Ile 225 230 235
240 Ser Gln Ser Gly Pro Thr Cys Thr Pro Ser Ser Gly Thr Ala Gly Ala
245 250 255 Ser Thr Ala Gly Gln Asp Arg Ser Cys Leu Pro Ser Val Gly
Phe Arg 260 265 270 Pro Thr Thr Asn Phe Ser Asp Gln Gly Arg Gly Gly
Leu Lys Leu Ala 275 280 285 Val Ile Gly Met Gln Gln Arg Val Ser Thr
Ile Arg 290 295 300 27912DNAArabidopsis thalianaG2632 27atgggaattg
aagacatgca ttcaaaatct gacagtggtg ggaacaaggt tgattcagag 60gttcatggta
cagtatcgtc gtcgataaat agtttaaacc cttggcatcg tgctgctgct
120gcttgcaatg caaattctag tgtggaagct ggagataaat cttctaagtc
aatagcatta 180gcattggaat caaacggttc caaatcacca tccaatagag
ataatactgt taacaaggaa 240tcacaagtca caacgtctcc acaatcagct
ggagattata gtgataaaaa ccaagaatct 300ctgcatcatg gcatcacaca
acctcctcct caccctcaac ttgttggcca cacagttgga 360tgggcatcct
caaatccata ccaggatcca tattatgcag gagtgatggg agcctatgga
420catcatcccc tggggtttgt tccatatggt gggatgcctc attcaagaat
gccactgccg 480cctgagatgg cacaagaacc agttttcgtg aatgctaaac
agtaccaggc gattctgagg 540cgaaggcagg cacgcgccaa ggcagagcta
gagaagaagc taataaaatc cagaaagcct 600tatctacatg aatctcggca
tcaacatgct atgaggaggc caaggggtac tggaggacgg 660tttgcaaaga
aaaccaacac cgaagcttca aagcgtaaag ctgaagaaaa gagcaatggt
720catgttactc agtccccgtc atcatctaat tctgatcaag gtgaagcttg
gaatggtgac 780tatagaacac ctcagggaga tgagatgcag agctcagctt
ataagagaag ggaagaagga 840gagtgttcag ggcagcaatg gaacagcctt
tcctcaaacc atccttctca agctcgtcta 900gccattaaat ga
91228303PRTArabidopsis thalianaG2632 polypeptide (domain in aa
coordinates 166-223) 28Met Gly Ile Glu Asp Met His Ser Lys Ser Asp
Ser Gly Gly Asn Lys 1 5 10 15 Val Asp Ser Glu Val His Gly Thr Val
Ser Ser Ser Ile Asn Ser Leu 20 25 30 Asn Pro Trp His Arg Ala Ala
Ala Ala Cys Asn Ala Asn Ser Ser Val 35 40 45 Glu Ala Gly Asp Lys
Ser Ser Lys Ser Ile Ala Leu Ala Leu Glu Ser 50 55 60 Asn Gly Ser
Lys Ser Pro Ser Asn Arg Asp Asn Thr Val Asn Lys Glu 65 70 75 80 Ser
Gln Val Thr Thr Ser Pro Gln Ser Ala Gly Asp Tyr Ser Asp Lys 85 90
95 Asn Gln Glu Ser Leu His His Gly Ile Thr Gln Pro Pro Pro His Pro
100 105 110 Gln Leu Val Gly His Thr Val Gly Trp Ala Ser Ser Asn Pro
Tyr Gln 115 120 125 Asp Pro Tyr Tyr Ala Gly Val Met Gly Ala Tyr Gly
His His Pro Leu 130 135 140 Gly Phe Val Pro Tyr Gly Gly Met Pro His
Ser Arg Met Pro Leu Pro 145 150 155 160 Pro Glu Met Ala Gln Glu Pro
Val Phe Val Asn Ala Lys Gln Tyr Gln 165 170 175 Ala Ile Leu Arg Arg
Arg Gln Ala Arg Ala Lys Ala Glu Leu Glu Lys 180 185 190 Lys Leu Ile
Lys Ser Arg Lys Pro Tyr Leu His Glu Ser Arg His Gln 195 200 205 His
Ala Met Arg Arg Pro Arg Gly Thr Gly Gly Arg Phe Ala Lys Lys 210 215
220 Thr Asn Thr Glu Ala Ser Lys Arg Lys Ala Glu Glu Lys Ser Asn Gly
225 230 235 240 His Val Thr Gln Ser Pro Ser Ser Ser Asn Ser Asp Gln
Gly Glu Ala 245 250 255 Trp Asn Gly Asp Tyr Arg Thr Pro Gln Gly Asp
Glu Met Gln Ser Ser 260 265 270 Ala Tyr Lys Arg Arg Glu Glu Gly Glu
Cys Ser Gly Gln Gln Trp Asn 275 280 285 Ser Leu Ser Ser Asn His Pro
Ser Gln Ala Arg Leu Ala Ile Lys 290 295 300 291274DNAArabidopsis
thalianaG1334 29atagctccca actaatagga atctcaagct tctcactctc
tcttgttttt ccattggact 60tttggaacat aagctatgca aactgaggag cttttgtcgc
caccacagac tccttggtgg 120aatgcttttg gatctcagcc gttgactaca
gagagccttt ccggcgaagc ttctgattca 180ttcaccggag ttaaggcagt
tactacggag gcagaacaag gtgtggtgga taaacaaact 240tctacaactc
tcttcacttt ctcacctggt ggtgaaaaga gttcaagaga tgtgccaaag
300cctcatgttg ctttcgcgat gcaatcagct tgcttcgagt ttggatttgc
tcagccaatg 360atgtacacaa agcatcctca tgttgaacaa tactatggag
ttgtttcagc atacggatct 420cagaggtctt cgggccgagt aatgattcca
ctgaagatgg agacagaaga agatggtacc 480atctatgtga actcaaagca
gtaccatgga attatcaggc gacgccagtc ccgagcaaag 540gctgaaaaac
tgagtagatg ccgtaagcca tatatgcatc actcacgcca tctccatgct
600atgcgccgtc ctagaggatc tggcgggcgt ttcttgaaca ccaagacagc
tgatgcggct 660aagcagtcta agccgagtaa ttctcagagt tctgaagtct
ttcatccgga aaatgagacc 720ataaactcat cgagggaagc aaatgagtca
aatctctcgg attctgcagt tacaagtatg 780gattactttc taagttcgtc
ggcttattct cctggtggca tggtcatgcc tatcaagtgg 840aatgcagcag
caatggatat tggctgctgc aaacttaata tatgatcagc agatagggga
900caagacatga ttggtcacca gtccttttgt cttgtccctt atctttcagc
caaacggaaa 960gagaacttgt gtcttggaaa aaagacattg agtttccttg
gtttataaga ttggtccttt 1020taccatccgt ttggctgtaa acaggcaaat
catctttggc tcatgcttca tcaagttctt 1080atcttcgtct gttttcttct
acgcatcttc ataagatctc tgaactagtg aataacattt 1140cctagcatca
tgtttcaact agtgtgtgtt gtaagaaact ctgccttatt tccagatgat
1200gtattgtgtg taacgtgttt atgaaacaaa cgtaagactt tcaagttaaa
aaaaaaaaaa 1260aaaaaaaaaa
aaaa 127430269PRTArabidopsis thalianaG1334 polypeptide (domain in
aa coordinates 133-190) 30Met Gln Thr Glu Glu Leu Leu Ser Pro Pro
Gln Thr Pro Trp Trp Asn 1 5 10 15 Ala Phe Gly Ser Gln Pro Leu Thr
Thr Glu Ser Leu Ser Gly Glu Ala 20 25 30 Ser Asp Ser Phe Thr Gly
Val Lys Ala Val Thr Thr Glu Ala Glu Gln 35 40 45 Gly Val Val Asp
Lys Gln Thr Ser Thr Thr Leu Phe Thr Phe Ser Pro 50 55 60 Gly Gly
Glu Lys Ser Ser Arg Asp Val Pro Lys Pro His Val Ala Phe 65 70 75 80
Ala Met Gln Ser Ala Cys Phe Glu Phe Gly Phe Ala Gln Pro Met Met 85
90 95 Tyr Thr Lys His Pro His Val Glu Gln Tyr Tyr Gly Val Val Ser
Ala 100 105 110 Tyr Gly Ser Gln Arg Ser Ser Gly Arg Val Met Ile Pro
Leu Lys Met 115 120 125 Glu Thr Glu Glu Asp Gly Thr Ile Tyr Val Asn
Ser Lys Gln Tyr His 130 135 140 Gly Ile Ile Arg Arg Arg Gln Ser Arg
Ala Lys Ala Glu Lys Leu Ser 145 150 155 160 Arg Cys Arg Lys Pro Tyr
Met His His Ser Arg His Leu His Ala Met 165 170 175 Arg Arg Pro Arg
Gly Ser Gly Gly Arg Phe Leu Asn Thr Lys Thr Ala 180 185 190 Asp Ala
Ala Lys Gln Ser Lys Pro Ser Asn Ser Gln Ser Ser Glu Val 195 200 205
Phe His Pro Glu Asn Glu Thr Ile Asn Ser Ser Arg Glu Ala Asn Glu 210
215 220 Ser Asn Leu Ser Asp Ser Ala Val Thr Ser Met Asp Tyr Phe Leu
Ser 225 230 235 240 Ser Ser Ala Tyr Ser Pro Gly Gly Met Val Met Pro
Ile Lys Trp Asn 245 250 255 Ala Ala Ala Met Asp Ile Gly Cys Cys Lys
Leu Asn Ile 260 265 311388DNAArabidopsis thalianaG926 31ccaaaatcta
gggttttctt ctcgcccaat ttcacttttc ttctacgaaa ttctccattc 60ctgccggctg
tcgggttttc tgaatcgatt ctccttcacc aacttcttct ctggttctgt
120tcgattctga ttttttttca aggtcaattt tttcttctct ttaaactctg
caaaatcgtg 180atcgattaaa ttcacctcag ggttttttga tttctgaaag
aagttaatct tcttcgaagg 240cgattgcaaa agagtgctct gctgtgaatt
tccactgaga tgcaatcaaa accgggaaga 300gaaaacgaag aggaagtcaa
taatcaccat gctgttcagc agccgatgat gtatgcagag 360ccctggtgga
aaaacaactc ctttggtgtt gtacctcaag cgagaccttc tggaattcca
420tcaaattcct cttctttgga ttgccccaat ggttccgagt caaacgatgt
tcattcagca 480tctgaagacg gtgcgttgaa tggtgaaaac gatggcactt
ggaaggattc acaagctgca 540acttcctctc gttcagataa tcacggaatg
gaaggaaatg acccagcgct ctctatccgt 600aacatgcatg atcagccact
tgtacaacca ccagagcttg ttggacacta tatcgcttgt 660gtcccaaacc
catatcagga tccatattat gggggattga tgggagcata tggtcatcag
720caattgggtt ttcgtccata tcttggaatg cctcgtgaaa gaacagctct
gccacttgac 780atggcacaag agcccgttta tgtgaatgca aagcagtacg
agggaattct aaggcgaaga 840aaagcacgtg ccaaggcaga gctagagagg
aaagtcatcc gggacagaaa gccatatctt 900cacgagtcaa gacacaagca
tgcaatgaga agggcacgag cgagtggagg ccggtttgcg 960aagaaaagtg
aggtagaagc gggagaggat gcaggaggga gagacagaga aaggggttca
1020gcaaccaact catcaggctc tgaacaagtt gagacagact ctaatgagac
cctgaattct 1080tctggtgcac cataataaaa aaagccaaag ctctgagagg
agagagagac acacactttg 1140gctaatataa tccattgcct caaaccggca
aatcattctt ggctttttcg tttttgtgtt 1200tgctagttgt tcttgtcaga
gtctcatatt gtgtgggttt aacagttatg atgaatgtac 1260aaagagcgag
ttatgttagg tgttagattt tggagacaag agacaaagga atagcaagta
1320ggtcttgttt ttattctttg accttttttt tctcttttgc aaaattgaaa
aatacgtttg 1380cttaaaaa 138832271PRTArabidopsis thalianaG926
polypeptide (domain in aa coordinates 171-228) 32Met Gln Ser Lys
Pro Gly Arg Glu Asn Glu Glu Glu Val Asn Asn His 1 5 10 15 His Ala
Val Gln Gln Pro Met Met Tyr Ala Glu Pro Trp Trp Lys Asn 20 25 30
Asn Ser Phe Gly Val Val Pro Gln Ala Arg Pro Ser Gly Ile Pro Ser 35
40 45 Asn Ser Ser Ser Leu Asp Cys Pro Asn Gly Ser Glu Ser Asn Asp
Val 50 55 60 His Ser Ala Ser Glu Asp Gly Ala Leu Asn Gly Glu Asn
Asp Gly Thr 65 70 75 80 Trp Lys Asp Ser Gln Ala Ala Thr Ser Ser Arg
Ser Asp Asn His Gly 85 90 95 Met Glu Gly Asn Asp Pro Ala Leu Ser
Ile Arg Asn Met His Asp Gln 100 105 110 Pro Leu Val Gln Pro Pro Glu
Leu Val Gly His Tyr Ile Ala Cys Val 115 120 125 Pro Asn Pro Tyr Gln
Asp Pro Tyr Tyr Gly Gly Leu Met Gly Ala Tyr 130 135 140 Gly His Gln
Gln Leu Gly Phe Arg Pro Tyr Leu Gly Met Pro Arg Glu 145 150 155 160
Arg Thr Ala Leu Pro Leu Asp Met Ala Gln Glu Pro Val Tyr Val Asn 165
170 175 Ala Lys Gln Tyr Glu Gly Ile Leu Arg Arg Arg Lys Ala Arg Ala
Lys 180 185 190 Ala Glu Leu Glu Arg Lys Val Ile Arg Asp Arg Lys Pro
Tyr Leu His 195 200 205 Glu Ser Arg His Lys His Ala Met Arg Arg Ala
Arg Ala Ser Gly Gly 210 215 220 Arg Phe Ala Lys Lys Ser Glu Val Glu
Ala Gly Glu Asp Ala Gly Gly 225 230 235 240 Arg Asp Arg Glu Arg Gly
Ser Ala Thr Asn Ser Ser Gly Ser Glu Gln 245 250 255 Val Glu Thr Asp
Ser Asn Glu Thr Leu Asn Ser Ser Gly Ala Pro 260 265 270
331385DNAArabidopsis thalianaG927 33ggaatctgaa gctcttctct
actctctact ctatcactcc atctgtgaac atatctttct 60tattcttcta ggcactatct
atttttcact ttttgtaatt ggaatttgga gatggctatg 120caaactgtga
gagaaggtct cttctctgct ccacagactt cttggtggac tgcttttgga
180tctcagccgt tggctccgga gagtctcgcc ggcgattctg actcattcgc
cggagttaag 240gtcggatctg tcggagagac aagacaacgt gtggataaac
agagcaactc tgcaacgcac 300ttagctttct cacttggtga tgtaaagagt
ccaagacttg tgccaaagcc tcatggagct 360actttctcaa tgcaatcacc
ttgcttggaa cttggatttt ctcagccacc gatctataca 420aagtatccct
atggagaaca acaatactat ggagttgttt cagcctatgg atctcagagc
480agggtaatgc ttcctctaaa catggaaacg gaagatagta ccatctatgt
gaactcaaag 540caataccatg gaatcataag gagacgccaa tcccgcgcaa
aggctgctgc tgttcttgat 600cagaagaaat tgagtagtag atgccgcaag
ccatatatgc atcattcgcg ccatctccat 660gcattgcggc gtcctagagg
atccggtggg agattcttga acactaaaag tcagaacttg 720gaaaatagcg
gaaccaatgc aaagaaaggt gatggaagta tgcagattca gtctcagcct
780aagcctcagc aaagtaactc tcagaattct gaagttgttc atccggaaaa
cgggaccatg 840aacttatcga acggattaaa tgtgtcggga tcagaagtta
ctagcatgaa ctacttccta 900agttctcccg ttcattctct tggtggcatg
gtaatgccta gcaagtggat agcagcagca 960gcagcaatgg ataatggctg
ctgcaatttc aaaacctgat cctttaccgt ttcacagtca 1020aacggagaga
gataaagaac tcttgccttg gtataaagga ttttcctttt tgccaatccg
1080ctttggctgt gaacaggcaa atcatctttg gctcattctc tattaaggta
acttcgccgt 1140gaggtgaaaa aagctttgat atatttatct tcagtgtaaa
agtagttaaa actggtgaag 1200aacaatgatg tgtttggtca ctaaacccac
ttgttccaac tagtagtgtg tgttttaaga 1260aaactctgtt atctgatttt
gtagctctct ctggctttgt gtgtttctca aacaactgta 1320acaactttta
agttatgtgg tttatgtaac atatttaaga cctgtgtttt tgtataaaaa 1380aaaaa
138534295PRTArabidopsis thalianaG927 polypeptide (domain in aa
coordinates 136-199) 34Met Ala Met Gln Thr Val Arg Glu Gly Leu Phe
Ser Ala Pro Gln Thr 1 5 10 15 Ser Trp Trp Thr Ala Phe Gly Ser Gln
Pro Leu Ala Pro Glu Ser Leu 20 25 30 Ala Gly Asp Ser Asp Ser Phe
Ala Gly Val Lys Val Gly Ser Val Gly 35 40 45 Glu Thr Arg Gln Arg
Val Asp Lys Gln Ser Asn Ser Ala Thr His Leu 50 55 60 Ala Phe Ser
Leu Gly Asp Val Lys Ser Pro Arg Leu Val Pro Lys Pro 65 70 75 80 His
Gly Ala Thr Phe Ser Met Gln Ser Pro Cys Leu Glu Leu Gly Phe 85 90
95 Ser Gln Pro Pro Ile Tyr Thr Lys Tyr Pro Tyr Gly Glu Gln Gln Tyr
100 105 110 Tyr Gly Val Val Ser Ala Tyr Gly Ser Gln Ser Arg Val Met
Leu Pro 115 120 125 Leu Asn Met Glu Thr Glu Asp Ser Thr Ile Tyr Val
Asn Ser Lys Gln 130 135 140 Tyr His Gly Ile Ile Arg Arg Arg Gln Ser
Arg Ala Lys Ala Ala Ala 145 150 155 160 Val Leu Asp Gln Lys Lys Leu
Ser Ser Arg Cys Arg Lys Pro Tyr Met 165 170 175 His His Ser Arg His
Leu His Ala Leu Arg Arg Pro Arg Gly Ser Gly 180 185 190 Gly Arg Phe
Leu Asn Thr Lys Ser Gln Asn Leu Glu Asn Ser Gly Thr 195 200 205 Asn
Ala Lys Lys Gly Asp Gly Ser Met Gln Ile Gln Ser Gln Pro Lys 210 215
220 Pro Gln Gln Ser Asn Ser Gln Asn Ser Glu Val Val His Pro Glu Asn
225 230 235 240 Gly Thr Met Asn Leu Ser Asn Gly Leu Asn Val Ser Gly
Ser Glu Val 245 250 255 Thr Ser Met Asn Tyr Phe Leu Ser Ser Pro Val
His Ser Leu Gly Gly 260 265 270 Met Val Met Pro Ser Lys Trp Ile Ala
Ala Ala Ala Ala Met Asp Asn 275 280 285 Gly Cys Cys Asn Phe Lys Thr
290 295 351446DNAZea maysG3911 35accacacgtc cgcccacgcg tccgcctacg
cgtcggcgga ctcgcgtgcc cccacgcggg 60cgggcttggc ttgggactgg gccgcccggc
cgcgaggaat aaactcactc ctgccttcat 120acgtatccat agccgcggca
gtacgtgtat gtggttagct atacgcgacc tcagctcggg 180cgcaagctac
aacgccgacc aggcgagaag aagcatcgat agtgtgacga gctaacccac
240cagcagcaac gtaatccaaa tccatggaca accagccgct gccctactcc
acaggccagc 300cccctgcccc cggaggagcc ccggtggcgg gcatgcctgg
cgcggccggc ctcccacccg 360tgccgcacca cctacccgtt ccatctcaag
tgaaagagat gacaactgtc ctaacaaaca 420aactagggct caaaactaac
ttcaaaaaaa tcacccacta aaagcacctt cctcttcctc 480ttcctccgcc
cccaatcccc ctcgtctcac aaccctagct gcccccgaat ccatggatcc
540taacaaatcc agcaccccgc cgccgcctcc agtcatgggt gcccccgttg
cctaccctcc 600gcctgcgtac cctcccggtg tggccgccgg cgccggcgcc
tacccgccgc agctctacgc 660accgccggct gctgccgcgg cccagcaggc
ggcggccgcg cagcagcagc agctgcagat 720attctgggcg gagcagtacc
gcgagatcga ggccactacc gacttcaaga atcacaacct 780cccgctcgcc
cgcatcaaga agatcatgaa agccgacgag gacgtccgca tgatcgccgc
840cgaggctccc gtggtgttcg cccgggcctg cgagatgttc atcctcgagc
tcacccatcg 900cggctgggcg cacgccgaag agaacaagcg ccgcacgctc
cagaaatccg acattgccgc 960tgccatcgcc cgcaccgagg tattcgactt
ccttgtggac atcgttccgc gcgacgacgg 1020taaagacgct gatgcggcgg
ccgccgcagc tgccgcggct gccgggatcc cgcgccccgc 1080cgccggagta
ccagccaccg accctctcgc ctactactac gtgcctcagc agtaatgtat
1140catcatcacg ttattgttcc gtctatgtgc ctgagcaata atgtatcatc
attgccttat 1200tgttccgggg cagttgtgtt atttgtgtct gtttagttgc
tgctgctgtt accgcgtaat 1260agcatatgtg ttatctgtgt ctgtttagtt
gctgctgctg ttgccgcgta ataaaacttg 1320gtcatttacg gggctccctc
aagattaaga attgagttgt ttgatggtag aatcctggta 1380aggttgttgt
aactgggggg cgcctttgtt tgggctggta gtgtatgcct aggcctcact 1440tatctg
144636200PRTZea maysG3911 polypeptide (domain in aa coordinates
83-148) 36Met Asp Pro Asn Lys Ser Ser Thr Pro Pro Pro Pro Pro Val
Met Gly 1 5 10 15 Ala Pro Val Ala Tyr Pro Pro Pro Ala Tyr Pro Pro
Gly Val Ala Ala 20 25 30 Gly Ala Gly Ala Tyr Pro Pro Gln Leu Tyr
Ala Pro Pro Ala Ala Ala 35 40 45 Ala Ala Gln Gln Ala Ala Ala Ala
Gln Gln Gln Gln Leu Gln Ile Phe 50 55 60 Trp Ala Glu Gln Tyr Arg
Glu Ile Glu Ala Thr Thr Asp Phe Lys Asn 65 70 75 80 His Asn Leu Pro
Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu 85 90 95 Asp Val
Arg Met Ile Ala Ala Glu Ala Pro Val Val Phe Ala Arg Ala 100 105 110
Cys Glu Met Phe Ile Leu Glu Leu Thr His Arg Gly Trp Ala His Ala 115
120 125 Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys Ser Asp Ile Ala Ala
Ala 130 135 140 Ile Ala Arg Thr Glu Val Phe Asp Phe Leu Val Asp Ile
Val Pro Arg 145 150 155 160 Asp Asp Gly Lys Asp Ala Asp Ala Ala Ala
Ala Ala Ala Ala Ala Ala 165 170 175 Ala Gly Ile Pro Arg Pro Ala Ala
Gly Val Pro Ala Thr Asp Pro Leu 180 185 190 Ala Tyr Tyr Tyr Val Pro
Gln Gln 195 200 37618DNAOryza sativaG3546 37atggagccca aatccaccac
ccctcctccg cctcctccgc cccccgtgct gggcgccccc 60gtcccttacc cgccggcggg
agcctacccc ccacccgtcg ggccctacgc ccacgcgccg 120ccgctctacg
ccccgcctcc ccccgccgcc gccgccgcct ccgccgccgc caccgccgcc
180tcgcagcagg ccgccgccgc gcagctccag aacttctggg cggagcagta
ccgcgagatc 240gagcacacca ccgacttcaa gaaccacaac ctccccctcg
cccgcatcaa gaagatcatg 300aaggccgacg aggacgtccg catgatcgcc
gccgaggccc ccgtcgtgtt cgccagggcg 360tgcgagatgt tcatcctcga
gctcacccac cgcggctggg cgcacgccga ggagaacaag 420cgccgcacgc
tccagaagtc cgacatcgcc gccgccatcg cccgcaccga ggtcttcgac
480ttcctcgtcg acatcgtgcc ccgcgacgag gccaaggacg ccgaggccgc
cgccgccgtt 540gccgccggga tcccccaccc cgccgccggt ttgcccgcca
ccgaccccat ggcctactac 600tatgtccagc cgcagtaa 61838205PRTOryza
sativaG3546 polypeptide (domain in aa coordinates 91-156) 38Met Glu
Pro Lys Ser Thr Thr Pro Pro Pro Pro Pro Pro Pro Pro Val 1 5 10 15
Leu Gly Ala Pro Val Pro Tyr Pro Pro Ala Gly Ala Tyr Pro Pro Pro 20
25 30 Val Gly Pro Tyr Ala His Ala Pro Pro Leu Tyr Ala Pro Pro Pro
Pro 35 40 45 Ala Ala Ala Ala Ala Ser Ala Ala Ala Thr Ala Ala Ser
Gln Gln Ala 50 55 60 Ala Ala Ala Gln Leu Gln Asn Phe Trp Ala Glu
Gln Tyr Arg Glu Ile 65 70 75 80 Glu His Thr Thr Asp Phe Lys Asn His
Asn Leu Pro Leu Ala Arg Ile 85 90 95 Lys Lys Ile Met Lys Ala Asp
Glu Asp Val Arg Met Ile Ala Ala Glu 100 105 110 Ala Pro Val Val Phe
Ala Arg Ala Cys Glu Met Phe Ile Leu Glu Leu 115 120 125 Thr His Arg
Gly Trp Ala His Ala Glu Glu Asn Lys Arg Arg Thr Leu 130 135 140 Gln
Lys Ser Asp Ile Ala Ala Ala Ile Ala Arg Thr Glu Val Phe Asp 145 150
155 160 Phe Leu Val Asp Ile Val Pro Arg Asp Glu Ala Lys Asp Ala Glu
Ala 165 170 175 Ala Ala Ala Val Ala Ala Gly Ile Pro His Pro Ala Ala
Gly Leu Pro 180 185 190 Ala Thr Asp Pro Met Ala Tyr Tyr Tyr Val Gln
Pro Gln 195 200 205 39865DNAZea maysG3909 39ccctgaccgc cgtaacaccc
taggcaatgg agcccaaatc caccacccct cccccgcccc 60ccgtgatggg cgcgcccatc
gcgtatcctc ccccgcccgg cgccgcgtac cccgccgggc 120cgtacgtgca
cgcgccggcg gccgcgctct accctcctcc tcccctgccg ccggcgcccc
180cctcctcgca gcagggcgcc gcggcggcgc accagcagca gctattctgg
gcggagcaat 240accgcgagat cgaggccacc accgacttca agaaccacaa
cctgccgctc gcccgcatca 300agaagatcat gaaggccgac gaggacgtgc
gcatgatcgc cgccgaggcg cccgtcgtct 360tctcccgcgc ctgcgagatg
ttcatcctcg agctcaccca ccgcggctgg gcacacgccg 420aggagaacaa
gcgccgcacg ctgcagaagt ccgacatcgc cgccgccgtc gcgcgcaccg
480aggtcttcga cttcctcgtc gacatcgtgc cgcgggacga ggccaaggac
gccgactccg 540ccgccatggg agcagccggg atcccgcacc ccgccgccgg
cctgcccgcc gccgatccca 600tgggctacta ctacgtccag ccgccgcagt
aacgaatttg cttccttatc atggcttcgc 660ttccatgcag cctttgcggg
ttttagtaaa ctattattat tactgagagt gccctgttgt 720tacccatgct
ctgttgttgc cacccaataa ctcgatgacc tgatgatcat ctgatgtgcc
780ccccgttccg taacaagtga ttccatttct gatttcagag aaaaaaaaaa
aaaaaaaaaa 840aaaaaaaaaa aaaaaaaaaa aaaaa 86540201PRTZea maysG3909
polypeptide (domain in aa coordinates 86-151) 40Met Glu Pro Lys Ser
Thr Thr Pro Pro Pro Pro Pro Val Met Gly Ala 1 5 10 15 Pro Ile Ala
Tyr Pro Pro Pro Pro Gly Ala Ala Tyr Pro Ala Gly Pro 20 25 30 Tyr
Val His Ala Pro Ala Ala Ala Leu Tyr Pro Pro Pro Pro Leu Pro 35 40
45 Pro Ala Pro Pro Ser Ser Gln Gln Gly Ala Ala Ala Ala His Gln Gln
50 55
60 Gln Leu Phe Trp Ala Glu Gln Tyr Arg Glu Ile Glu Ala Thr Thr Asp
65 70 75 80 Phe Lys Asn His Asn Leu Pro Leu Ala Arg Ile Lys Lys Ile
Met Lys 85 90 95 Ala Asp Glu Asp Val Arg Met Ile Ala Ala Glu Ala
Pro Val Val Phe 100 105 110 Ser Arg Ala Cys Glu Met Phe Ile Leu Glu
Leu Thr His Arg Gly Trp 115 120 125 Ala His Ala Glu Glu Asn Lys Arg
Arg Thr Leu Gln Lys Ser Asp Ile 130 135 140 Ala Ala Ala Val Ala Arg
Thr Glu Val Phe Asp Phe Leu Val Asp Ile 145 150 155 160 Val Pro Arg
Asp Glu Ala Lys Asp Ala Asp Ser Ala Ala Met Gly Ala 165 170 175 Ala
Gly Ile Pro His Pro Ala Ala Gly Leu Pro Ala Ala Asp Pro Met 180 185
190 Gly Tyr Tyr Tyr Val Gln Pro Pro Gln 195 200 411256DNAZea
maysG3552 41ttaagaacct agtaggcaga cccggccggc gtagagaggg ggggaggtcg
acgagacaga 60gagagaaggc caagaggctt cctctcccca ttcctccctt ccgtgcccta
gccgagccag 120ccgcgaggaa ggaggcatcc cgccgtctcg cctggcgccc
gcccgtcggc cgaccttctg 180ccgcagcttc caattctaaa aagatcatag
atttttgtgc aagagcgagt ggatatggaa 240ccatcccctc agcctatggg
tgtcgctgcc ggtgggtcac aagtgtatcc tgcctctgcc 300tatccgcctg
cagcaacagt agctcctgct tctgttgtat ctgctggttt acagtcaggg
360cagccattcc cagccaaccc tggtcatatg agtgctcagc accagattgt
ctaccaacaa 420gctcaacaat tccaccaaca gctccagcag cagcaacagc
agcagcttca gcagttctgg 480gtcgaacgca tgactgaaat tgaggcgacg
actgatttca agaaccacaa cttgccactt 540gcgaggataa agaagatcat
gaaggccgat gaagatgttc gcatgatctc agccgaagct 600cctgtggtct
tcgcaaaagc ttgtgagata ttcatactgg agctgacact taggtcgtgg
660atgcacactg aggagaacaa gcgccgcacc ttgcaaaaga atgacattgc
agcagcgatc 720actaggactg acatttatga cttcttggtc gacattgttc
ccagggatga gatgaaggag 780gacggaattg ggcttcctag ggctggtctg
ccacccatgg gagccccagc tgatgcatat 840ccatactact acatgccaca
gcagcaggtg cctggttctg gaatggttta tggtgcccag 900caagggcacc
cagtgactta tttgtggcag gagcctcagc aacagcagga gcaagctcct
960gaagagcagc aatctgcatg aaagtggctg agaatattgc tcagaagcta
tcacctgatt 1020cagagttctc attttaggtt gtccaaactg caggttttct
tagtaatatc gttggttatc 1080aaactgaaac aggcgattct aagtagggtg
tagcatcatg gtagtttcat ttctgcttgt 1140gatgttagtt gaaaggataa
tgattagtgg ctagtggatt aaagttacca taccatttcc 1200ttctattccg
aaagtttgcc tccatgaggc ctctgatatg acgaaaaaat aaaaaa 125642248PRTZea
maysG3552 polypeptide (domain in aa coordinates 100-165) 42Met Glu
Pro Ser Pro Gln Pro Met Gly Val Ala Ala Gly Gly Ser Gln 1 5 10 15
Val Tyr Pro Ala Ser Ala Tyr Pro Pro Ala Ala Thr Val Ala Pro Ala 20
25 30 Ser Val Val Ser Ala Gly Leu Gln Ser Gly Gln Pro Phe Pro Ala
Asn 35 40 45 Pro Gly His Met Ser Ala Gln His Gln Ile Val Tyr Gln
Gln Ala Gln 50 55 60 Gln Phe His Gln Gln Leu Gln Gln Gln Gln Gln
Gln Gln Leu Gln Gln 65 70 75 80 Phe Trp Val Glu Arg Met Thr Glu Ile
Glu Ala Thr Thr Asp Phe Lys 85 90 95 Asn His Asn Leu Pro Leu Ala
Arg Ile Lys Lys Ile Met Lys Ala Asp 100 105 110 Glu Asp Val Arg Met
Ile Ser Ala Glu Ala Pro Val Val Phe Ala Lys 115 120 125 Ala Cys Glu
Ile Phe Ile Leu Glu Leu Thr Leu Arg Ser Trp Met His 130 135 140 Thr
Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala 145 150
155 160 Ala Ile Thr Arg Thr Asp Ile Tyr Asp Phe Leu Val Asp Ile Val
Pro 165 170 175 Arg Asp Glu Met Lys Glu Asp Gly Ile Gly Leu Pro Arg
Ala Gly Leu 180 185 190 Pro Pro Met Gly Ala Pro Ala Asp Ala Tyr Pro
Tyr Tyr Tyr Met Pro 195 200 205 Gln Gln Gln Val Pro Gly Ser Gly Met
Val Tyr Gly Ala Gln Gln Gly 210 215 220 His Pro Val Thr Tyr Leu Trp
Gln Glu Pro Gln Gln Gln Gln Glu Gln 225 230 235 240 Ala Pro Glu Glu
Gln Gln Ser Ala 245 43748DNAArabidopsis thalianaG483 43gagatagctt
agcaatggag cagtcagaag agggtcaaca gcaacagcaa cagggagtga 60tggattatgt
acctcctcat gcttatcaga gtgggccagt aaatgcagct tcccatatgg
120cattccaaca agctcaccac ttccaccacc accatcagca gcaacaacag
cagcagcttc 180agatgttctg ggctaaccaa atgcaagaga tcgagcatac
cactgatttc aagaaccaca 240cccttcccct agcccgcatc aagaagatca
tgaaagctga tgaagatgtg aggatgatct 300ctgcggaggc tcctgtgatt
tttgccaagg cctgtgagat gttcattttg gagctcactc 360tacgtgcttg
gatccacacc gaggagaaca agaggaggac cttgcagaag aacgacatcg
420ccgctgccat ttccaggacc gacgtgtttg atttccttgt ggacataatc
ccgagggacg 480agctgaaaga agaaggttta ggcgtgacca aagggaccat
accatcggtg gtgggttccc 540cgccatacta ttacttgcaa caacagggga
tgatgcaaca ctggccccag gagcaacacc 600ctgatgagtc ttaaaacttt
tcccctttcg tttgtttggt tgtatcgtag taaggtagct 660ctgctctgct
gggaaccatt tctattgtgt tctgtaatga catgttagta tatccccagt
720ctatatctat ggcaatgcag tttctgtt 74844199PRTArabidopsis
thalianaG483 polypeptide (domain in aa coordinates 77-142) 44Met
Glu Gln Ser Glu Glu Gly Gln Gln Gln Gln Gln Gln Gly Val Met 1 5 10
15 Asp Tyr Val Pro Pro His Ala Tyr Gln Ser Gly Pro Val Asn Ala Ala
20 25 30 Ser His Met Ala Phe Gln Gln Ala His His Phe His His His
His Gln 35 40 45 Gln Gln Gln Gln Gln Gln Leu Gln Met Phe Trp Ala
Asn Gln Met Gln 50 55 60 Glu Ile Glu His Thr Thr Asp Phe Lys Asn
His Thr Leu Pro Leu Ala 65 70 75 80 Arg Ile Lys Lys Ile Met Lys Ala
Asp Glu Asp Val Arg Met Ile Ser 85 90 95 Ala Glu Ala Pro Val Ile
Phe Ala Lys Ala Cys Glu Met Phe Ile Leu 100 105 110 Glu Leu Thr Leu
Arg Ala Trp Ile His Thr Glu Glu Asn Lys Arg Arg 115 120 125 Thr Leu
Gln Lys Asn Asp Ile Ala Ala Ala Ile Ser Arg Thr Asp Val 130 135 140
Phe Asp Phe Leu Val Asp Ile Ile Pro Arg Asp Glu Leu Lys Glu Glu 145
150 155 160 Gly Leu Gly Val Thr Lys Gly Thr Ile Pro Ser Val Val Gly
Ser Pro 165 170 175 Pro Tyr Tyr Tyr Leu Gln Gln Gln Gly Met Met Gln
His Trp Pro Gln 180 185 190 Glu Gln His Pro Asp Glu Ser 195
451123DNAGlycine maxG3547 45acagcttttg ttctagcact tcgctgtctg
aggttctgga ttctcagtgt ttgcggagcg 60cagcatcatc ttttagggaa gaatggatca
tcaagggcat agccagaacc catctatggg 120ggttgttggt agtggagctc
aattagcata tggttctaac ccatatcagc caggccaaat 180aactgggcca
ccggggtctg ttgtgacatc agttgggacc attcaatcca ccggtcaacc
240tgctggagct cagcttggac agcatcaact tgcttatcag catattcatc
agcaacaaca 300gcaccagctt cagcaacagc tccaacaatt ttggtcaagc
cagtaccaag aaattgagaa 360ggttactgat tttaagaacc acagtcttcc
cctggcaagg atcaagaaga ttatgaaggc 420tgacgaggat gttaggatga
tatcagctga agcaccagtc atttttgcaa gggcatgtga 480aatgttcata
ttagagttaa ccctgcgctc ttggaatcac actgaagaga acaaaaggcg
540aacacttcag aaaaatgata ttgctgctgc tatcacaagg actgacatct
ttgatttctt 600ggttgacatt gtgcctcgtg aggacttgaa agatgaagtg
cttgcatcaa tcccaagagg 660aacaatgcct gttgcagggc ctgctgatgc
ccttccatac tgctacatgc cgcctcagca 720tccgtcccaa gttggagctg
ctggtgtcat aatgggtaag cctgtgatgg acccaaacat 780gtatgctcag
cagtctcacc cttacatggc accacaaatg tggccacagc caccagacca
840acgacagtcg tctccagaac attagctgat gtgtcgtgga aattaagata
accaggcact 900ggaatcagtt gtgaatgtca aactgaatgg ttgggaaggt
cgatactaca tagcgagcag 960aagctgtagc tgatagttta catgcaatgc
agactataaa catatgtaga taatgtgcta 1020gggaaaactt aaccttatct
ttgatttagc tggaaaaaat ggtatttttc attttaattc 1080acaggtcatc
agatgataat atttatttac tggtgcatag cag 112346260PRTGlycine maxG3547
polypeptide (domain in aa coordinates 102-167) 46Met Asp His Gln
Gly His Ser Gln Asn Pro Ser Met Gly Val Val Gly 1 5 10 15 Ser Gly
Ala Gln Leu Ala Tyr Gly Ser Asn Pro Tyr Gln Pro Gly Gln 20 25 30
Ile Thr Gly Pro Pro Gly Ser Val Val Thr Ser Val Gly Thr Ile Gln 35
40 45 Ser Thr Gly Gln Pro Ala Gly Ala Gln Leu Gly Gln His Gln Leu
Ala 50 55 60 Tyr Gln His Ile His Gln Gln Gln Gln His Gln Leu Gln
Gln Gln Leu 65 70 75 80 Gln Gln Phe Trp Ser Ser Gln Tyr Gln Glu Ile
Glu Lys Val Thr Asp 85 90 95 Phe Lys Asn His Ser Leu Pro Leu Ala
Arg Ile Lys Lys Ile Met Lys 100 105 110 Ala Asp Glu Asp Val Arg Met
Ile Ser Ala Glu Ala Pro Val Ile Phe 115 120 125 Ala Arg Ala Cys Glu
Met Phe Ile Leu Glu Leu Thr Leu Arg Ser Trp 130 135 140 Asn His Thr
Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile 145 150 155 160
Ala Ala Ala Ile Thr Arg Thr Asp Ile Phe Asp Phe Leu Val Asp Ile 165
170 175 Val Pro Arg Glu Asp Leu Lys Asp Glu Val Leu Ala Ser Ile Pro
Arg 180 185 190 Gly Thr Met Pro Val Ala Gly Pro Ala Asp Ala Leu Pro
Tyr Cys Tyr 195 200 205 Met Pro Pro Gln His Pro Ser Gln Val Gly Ala
Ala Gly Val Ile Met 210 215 220 Gly Lys Pro Val Met Asp Pro Asn Met
Tyr Ala Gln Gln Ser His Pro 225 230 235 240 Tyr Met Ala Pro Gln Met
Trp Pro Gln Pro Pro Asp Gln Arg Gln Ser 245 250 255 Ser Pro Glu His
260 47925DNAArabidopsis thalianaG714 47ccacgcgtcc gcgtcaatct
ttgagtttgg tagagaaatg gatcaacaag gacaatcatc 60agctatgaac tatggttcaa
acccatatca aaccaacgcc atgaccacta caccaaccgg 120ttcagaccat
ccagcttacc atcagatcca ccagcaacaa caacaacagc tcactcaaca
180gcttcaatct ttctgggaga ctcaattcaa agagattgag aaaaccactg
atttcaagaa 240ccatagcctt ccattggcaa gaatcaagaa aatcatgaaa
gctgatgaag atgtgcgtat 300gatctcggcc gaggcgcctg ttgtgttcgc
cagggcctgc gagatgttta ttctggagct 360tacgttaagg tcttggaacc
atactgagga gaacaagaga aggacgttgc agaagaatga 420tatcgcggct
gcggtgacta gaactgatat ttttgatttt cttgtggata ttgttcctcg
480ggaggatctt cgtgatgaag tcttgggtgg tgttggtgct gaagctgcta
cagctgcggg 540ttatccgtat ggatacttgc ctcctggaac agctccaatt
gggaacccgg gaatggttat 600gggtaacccg ggcgcgtatc cgccgaaggc
gtatatgggt cagccaatgt ggcaacaacc 660aggacctgag cagcaggatc
ctgacaatta gcttggccta ataaactagc cgtctaattc 720gaagctctcc
ccggtggatc tactcaagaa gaagaatgtt aatagaaaac tattgcgaca
780taaaaagttt ggtgtagtag aataatttct gttttatgat ccatggattt
atcaattgtt 840attcagtttg gtttatcttg tcatcaaact gttttcggtc
aatgtaacaa attcataaat 900tgagaattga acttacaaaa ggcta
92548217PRTArabidopsis thalianaG714 polypeptide (domain in aa
coordinates 71-136) 48Met Asp Gln Gln Gly Gln Ser Ser Ala Met Asn
Tyr Gly Ser Asn Pro 1 5 10 15 Tyr Gln Thr Asn Ala Met Thr Thr Thr
Pro Thr Gly Ser Asp His Pro 20 25 30 Ala Tyr His Gln Ile His Gln
Gln Gln Gln Gln Gln Leu Thr Gln Gln 35 40 45 Leu Gln Ser Phe Trp
Glu Thr Gln Phe Lys Glu Ile Glu Lys Thr Thr 50 55 60 Asp Phe Lys
Asn His Ser Leu Pro Leu Ala Arg Ile Lys Lys Ile Met 65 70 75 80 Lys
Ala Asp Glu Asp Val Arg Met Ile Ser Ala Glu Ala Pro Val Val 85 90
95 Phe Ala Arg Ala Cys Glu Met Phe Ile Leu Glu Leu Thr Leu Arg Ser
100 105 110 Trp Asn His Thr Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys
Asn Asp 115 120 125 Ile Ala Ala Ala Val Thr Arg Thr Asp Ile Phe Asp
Phe Leu Val Asp 130 135 140 Ile Val Pro Arg Glu Asp Leu Arg Asp Glu
Val Leu Gly Gly Val Gly 145 150 155 160 Ala Glu Ala Ala Thr Ala Ala
Gly Tyr Pro Tyr Gly Tyr Leu Pro Pro 165 170 175 Gly Thr Ala Pro Ile
Gly Asn Pro Gly Met Val Met Gly Asn Pro Gly 180 185 190 Ala Tyr Pro
Pro Lys Ala Tyr Met Gly Gln Pro Met Trp Gln Gln Pro 195 200 205 Gly
Pro Glu Gln Gln Asp Pro Asp Asn 210 215 49780DNAOryza sativaG3542
49atggaaccat cctcacagcc tcagcctgtg atgggtgttg ccactggtgg gtcacaagca
60tatcctcctc ctgctgctgc atatccacct caagccatgg ttcctggagc tcctgctgtt
120gttcctcctg gctcacagcc atcagcacca ttccccacta atccagctca
actcagtgct 180cagcaccagc tagtctacca acaagcccag caatttcatc
agcagctgca gcaacagcaa 240cagcagcaac tccgtgagtt ctgggctaac
caaatggaag agattgagca aacaaccgac 300ttcaagaacc acagcttgcc
actcgcaagg ataaagaaga taatgaaggc tgatgaggat 360gtccggatga
tctcggcaga agcccccgtt gtcttcgcaa aggcatgcga ggtattcata
420ttagagttaa cattgaggtc gtggatgcac acggaggaga acaagcgccg
gaccttgcag 480aagaatgaca ttgcagctgc catcaccagg actgatatct
atgacttctt ggtggacata 540gttcccaggg atgaaatgaa agaagaaggg
cttgggcttc cgagggttgg cctaccgcct 600aatgtggggg gcgcagcaga
cacatatcca tattactacg tgccagcgca gcaggggcct 660ggatcaggaa
tgatgtacgg tggacagcaa ggtcacccgg tgacgtatgt gtggcagcag
720cctcaagagc aacaggaaga ggcccctgaa gagcagcact ctctgccaga
aagtagctaa 78050259PRTOryza sativaG3542 polypeptide (domain in aa
coordinates 106-171) 50Met Glu Pro Ser Ser Gln Pro Gln Pro Val Met
Gly Val Ala Thr Gly 1 5 10 15 Gly Ser Gln Ala Tyr Pro Pro Pro Ala
Ala Ala Tyr Pro Pro Gln Ala 20 25 30 Met Val Pro Gly Ala Pro Ala
Val Val Pro Pro Gly Ser Gln Pro Ser 35 40 45 Ala Pro Phe Pro Thr
Asn Pro Ala Gln Leu Ser Ala Gln His Gln Leu 50 55 60 Val Tyr Gln
Gln Ala Gln Gln Phe His Gln Gln Leu Gln Gln Gln Gln 65 70 75 80 Gln
Gln Gln Leu Arg Glu Phe Trp Ala Asn Gln Met Glu Glu Ile Glu 85 90
95 Gln Thr Thr Asp Phe Lys Asn His Ser Leu Pro Leu Ala Arg Ile Lys
100 105 110 Lys Ile Met Lys Ala Asp Glu Asp Val Arg Met Ile Ser Ala
Glu Ala 115 120 125 Pro Val Val Phe Ala Lys Ala Cys Glu Val Phe Ile
Leu Glu Leu Thr 130 135 140 Leu Arg Ser Trp Met His Thr Glu Glu Asn
Lys Arg Arg Thr Leu Gln 145 150 155 160 Lys Asn Asp Ile Ala Ala Ala
Ile Thr Arg Thr Asp Ile Tyr Asp Phe 165 170 175 Leu Val Asp Ile Val
Pro Arg Asp Glu Met Lys Glu Glu Gly Leu Gly 180 185 190 Leu Pro Arg
Val Gly Leu Pro Pro Asn Val Gly Gly Ala Ala Asp Thr 195 200 205 Tyr
Pro Tyr Tyr Tyr Val Pro Ala Gln Gln Gly Pro Gly Ser Gly Met 210 215
220 Met Tyr Gly Gly Gln Gln Gly His Pro Val Thr Tyr Val Trp Gln Gln
225 230 235 240 Pro Gln Glu Gln Gln Glu Glu Ala Pro Glu Glu Gln His
Ser Leu Pro 245 250 255 Glu Ser Ser 51927DNAArabidopsis
thalianaG489 51gaattcggca cgagacccac caagaggatc aaccagtctc
ttccccttca gattctcctt 60tccacagtaa tggatcaaca agaccatgga cagtctggag
ctatgaacta tggcacaaac 120ccataccaaa ccaacccgat gagcaccact
gctgctactg tagcaggagg tgcggcacaa 180ccaggccagc tggcgttcca
ccagatccat cagcagcagc agcagcaaca gctggcacag 240cagcttcaag
cattttggga gaaccaattc aaagagattg agaagactac cgatttcaag
300aagcacagcc ttccccttgc gagaatcaag aaaatcatga aagcggatga
agatgtccgt 360atgatctcgg ctgaggcgcc tgtcgtgttt gcaagggcct
gtgagatgtt catcctggag 420ctgacactca ggtcgtggaa ccacactgag
gagaataaga ggcggacgtt gcagaagaac 480gatattgctg ctgctgtgac
tagaaccgat atttttgatt tccttgtgga tattgttccc 540cgggaggatc
tccgagatga agtcttggga agtattccga ggggcactgt cccggaagct
600gctgctgctg gttacccgta tggatacttg cctgcaggaa ctgctccaat
aggaaatccg 660ggaatggtta tgggtaatcc cggtggtgcg tatccaccta
atccttatat gggtcaacca 720atgtggcaac aacaggcacc tgaccaacct
gaccaggaaa attagcaaga aactgtgagt 780cttcccgctt cttttaggcc
taccttgtag tcttggggtt ttgtttctgt tttcgaataa 840tggtaacctt
tgtataactt atttcagtat cgtctcagtt tggtactatg tcagttttgg
900taaaaaaaaa aaaaaaaaaa aaaaaaa 92752231PRTArabidopsis
thalianaG489 polypeptide (domain in aa
coordinates 81-146) 52Met Asp Gln Gln Asp His Gly Gln Ser Gly Ala
Met Asn Tyr Gly Thr 1 5 10 15 Asn Pro Tyr Gln Thr Asn Pro Met Ser
Thr Thr Ala Ala Thr Val Ala 20 25 30 Gly Gly Ala Ala Gln Pro Gly
Gln Leu Ala Phe His Gln Ile His Gln 35 40 45 Gln Gln Gln Gln Gln
Gln Leu Ala Gln Gln Leu Gln Ala Phe Trp Glu 50 55 60 Asn Gln Phe
Lys Glu Ile Glu Lys Thr Thr Asp Phe Lys Lys His Ser 65 70 75 80 Leu
Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu Asp Val 85 90
95 Arg Met Ile Ser Ala Glu Ala Pro Val Val Phe Ala Arg Ala Cys Glu
100 105 110 Met Phe Ile Leu Glu Leu Thr Leu Arg Ser Trp Asn His Thr
Glu Glu 115 120 125 Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala
Ala Ala Val Thr 130 135 140 Arg Thr Asp Ile Phe Asp Phe Leu Val Asp
Ile Val Pro Arg Glu Asp 145 150 155 160 Leu Arg Asp Glu Val Leu Gly
Ser Ile Pro Arg Gly Thr Val Pro Glu 165 170 175 Ala Ala Ala Ala Gly
Tyr Pro Tyr Gly Tyr Leu Pro Ala Gly Thr Ala 180 185 190 Pro Ile Gly
Asn Pro Gly Met Val Met Gly Asn Pro Gly Gly Ala Tyr 195 200 205 Pro
Pro Asn Pro Tyr Met Gly Gln Pro Met Trp Gln Gln Gln Ala Pro 210 215
220 Asp Gln Pro Asp Gln Glu Asn 225 230 53753DNAOryza sativaG3544
53atggagccat catcacaacc tcagccggca attggtgttg ttgctggtgg atcacaagtg
60taccctgcat accggcctgc agcaacagtg cctacagctc ctgctgtcat tcctgccggt
120tcacagccag caccgtcgtt ccctgccaac cctgatcaac tgagtgctca
gcaccagctc 180gtctatcagc aagcccagca atttcaccag cagcttcagc
agcagcaaca gcgtcaactc 240cagcagtttt gggctgaacg tctggtcgat
attgaacaaa ctactgactt caagaaccac 300agcttgccac ttgctaggat
aaagaagatc atgaaggcag atgaggacgt tcgcatgatc 360tccgcagagg
ctcctgtgat ctttgcgaaa gcatgtgaga tattcatact ggagctgacc
420ctgagatcat ggatgcacac ggaggagaac aagcgccgta ccttgcagaa
gaatgacata 480gcagctgcca tcaccaggac ggatatgtac gatttcttgg
tagatatagt tcccagggat 540gacttgaagg aggagggagt tgggctccct
agggctggat tgccgccctt gggtgtccct 600gctgactcat atccgtatgg
ctactatgtg ccacagcagc aggtcccagg tgcaggaata 660gcgtatggtg
gtcagcaagg tcatccgggg tatctgtggc aggatcctca ggaacagcag
720gaagagcctc ctgcagagca gcaaagtgat taa 75354250PRTOryza
sativaG3544 polypeptide (domain in aa coordinates 102-167) 54Met
Glu Pro Ser Ser Gln Pro Gln Pro Ala Ile Gly Val Val Ala Gly 1 5 10
15 Gly Ser Gln Val Tyr Pro Ala Tyr Arg Pro Ala Ala Thr Val Pro Thr
20 25 30 Ala Pro Ala Val Ile Pro Ala Gly Ser Gln Pro Ala Pro Ser
Phe Pro 35 40 45 Ala Asn Pro Asp Gln Leu Ser Ala Gln His Gln Leu
Val Tyr Gln Gln 50 55 60 Ala Gln Gln Phe His Gln Gln Leu Gln Gln
Gln Gln Gln Arg Gln Leu 65 70 75 80 Gln Gln Phe Trp Ala Glu Arg Leu
Val Asp Ile Glu Gln Thr Thr Asp 85 90 95 Phe Lys Asn His Ser Leu
Pro Leu Ala Arg Ile Lys Lys Ile Met Lys 100 105 110 Ala Asp Glu Asp
Val Arg Met Ile Ser Ala Glu Ala Pro Val Ile Phe 115 120 125 Ala Lys
Ala Cys Glu Ile Phe Ile Leu Glu Leu Thr Leu Arg Ser Trp 130 135 140
Met His Thr Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile 145
150 155 160 Ala Ala Ala Ile Thr Arg Thr Asp Met Tyr Asp Phe Leu Val
Asp Ile 165 170 175 Val Pro Arg Asp Asp Leu Lys Glu Glu Gly Val Gly
Leu Pro Arg Ala 180 185 190 Gly Leu Pro Pro Leu Gly Val Pro Ala Asp
Ser Tyr Pro Tyr Gly Tyr 195 200 205 Tyr Val Pro Gln Gln Gln Val Pro
Gly Ala Gly Ile Ala Tyr Gly Gly 210 215 220 Gln Gln Gly His Pro Gly
Tyr Leu Trp Gln Asp Pro Gln Glu Gln Gln 225 230 235 240 Glu Glu Pro
Pro Ala Glu Gln Gln Ser Asp 245 250 551126DNAGlycine maxG3550
55cttgcgccca atttccatgg aactgtaaag agaggatagt tagaagatta aatcttaaag
60cagtaagtca tcatggataa atcagagcag actcaacagc agcagcagca acaacagcat
120gtgatgggag ttgccgcagg ggctagccaa atggcctatt cttctcacta
cccgactgct 180tccatggtgg cttctggcac gcccgctgta actgctcctt
ccccaactca ggctccagct 240gccttctcta gttctgctca ccagcttgca
taccagcaag cacagcattt ccaccaccaa 300cagcagcaac accaacaaca
gcagcttcaa atgttctggt caaaccaaat gcaagaaatt 360gagcaaacaa
ttgactttaa aaaccatagc cttcctcttg ctcggataaa aaagataatg
420aaagctgatg aagatgtccg gatgatttca gcagaagctc cggtcatatt
tgcaaaagct 480tgtgaaatgt tcatattaga gttgacgttg cgatcttgga
tccacacaga agagaacaag 540aggagaactc tacaaaagaa tgatatagca
gctgctattt cgagaaacga tgtttttgat 600ttcttggttg atattattcc
aagagatgag ttgaaagagg aaggacttgg aataaccaag 660gctactattc
cgttagtggg ttctccagct gatatgccat attactatgt ccctccacag
720catcctgttg taggaccacc tgggatgatc atgggcaagc ccattggcgc
tgagcaagca 780acactatatt ctacacagca gcctcgacct cctgtggcgt
tcatgccatg gcctcataca 840caacccctgc aacagcagcc accccaacat
caacaaacag actcatgatg actatgcaat 900tcaattaggt tggaaagtag
cctgcacctt ttgattatta caaatttact taatgccttt 960cagccagctg
tagtttagtg ttgtgcattg aaaaaaagca aaagattgtt ttgaggtttc
1020ttgcactcat ttatgattgt atgagctctt gtgatgagtt acttttggtt
gtgtttacta 1080ttggtgtagt ggttaaatta tttggcagct gtccataacc agagag
112656271PRTGlycine maxG3550 polypeptide (domain in aa coordinates
107-172) 56Met Asp Lys Ser Glu Gln Thr Gln Gln Gln Gln Gln Gln Gln
Gln His 1 5 10 15 Val Met Gly Val Ala Ala Gly Ala Ser Gln Met Ala
Tyr Ser Ser His 20 25 30 Tyr Pro Thr Ala Ser Met Val Ala Ser Gly
Thr Pro Ala Val Thr Ala 35 40 45 Pro Ser Pro Thr Gln Ala Pro Ala
Ala Phe Ser Ser Ser Ala His Gln 50 55 60 Leu Ala Tyr Gln Gln Ala
Gln His Phe His His Gln Gln Gln Gln His 65 70 75 80 Gln Gln Gln Gln
Leu Gln Met Phe Trp Ser Asn Gln Met Gln Glu Ile 85 90 95 Glu Gln
Thr Ile Asp Phe Lys Asn His Ser Leu Pro Leu Ala Arg Ile 100 105 110
Lys Lys Ile Met Lys Ala Asp Glu Asp Val Arg Met Ile Ser Ala Glu 115
120 125 Ala Pro Val Ile Phe Ala Lys Ala Cys Glu Met Phe Ile Leu Glu
Leu 130 135 140 Thr Leu Arg Ser Trp Ile His Thr Glu Glu Asn Lys Arg
Arg Thr Leu 145 150 155 160 Gln Lys Asn Asp Ile Ala Ala Ala Ile Ser
Arg Asn Asp Val Phe Asp 165 170 175 Phe Leu Val Asp Ile Ile Pro Arg
Asp Glu Leu Lys Glu Glu Gly Leu 180 185 190 Gly Ile Thr Lys Ala Thr
Ile Pro Leu Val Gly Ser Pro Ala Asp Met 195 200 205 Pro Tyr Tyr Tyr
Val Pro Pro Gln His Pro Val Val Gly Pro Pro Gly 210 215 220 Met Ile
Met Gly Lys Pro Ile Gly Ala Glu Gln Ala Thr Leu Tyr Ser 225 230 235
240 Thr Gln Gln Pro Arg Pro Pro Val Ala Phe Met Pro Trp Pro His Thr
245 250 255 Gln Pro Leu Gln Gln Gln Pro Pro Gln His Gln Gln Thr Asp
Ser 260 265 270 571223DNAGlycine maxG3548 57caaaccaaac ctctctttct
cagtttctct ctcttagggt tttctcctcc cccattgacc 60caccgtccat cgcaaaggaa
gtcgcgccca atttccatgg actcagcagc aacatcagca 120tgggatgggc
gttgccacag gtgctagcca aatggcctat tcttctcact acccgactgc
180tcccatggtg gcttctggca cgcctgctgt agctgttcct tccccaactc
aggctccagc 240tgccttctct agttctgctc accagcttgc ataccagcaa
gcacagcatt tccaccacca 300acagcagcaa caccaacaac agcagcttca
aatgttctgg tcaaaccaaa tgcaagaaat 360tgagcaaaca attgacttta
aaaaccacag tcttcctctt gctcggataa aaaagataat 420gaaagctgat
gaagatgtcc ggatgatttc tgcagaagct ccagtcatat ttgcaaaagc
480atgtgaaatg ttcatattag agttgacgtt gagatcttgg atccacacag
aagagaacaa 540gaggagaact ctacaaaaga atgatatagc agctgctatt
tcgagaaacg atgtttttga 600tttcttggtt gatattatcc caagagatga
gttgaaagag gaaggacttg gaataaccaa 660ggctactatt ccattggtga
attctccagc tgatatgcca tattactatg tccctccaca 720gcatcctgtt
gtaggacctc ctgggatgat catgggcaag cccgttggtg ctgagcaagc
780aacgctgtat tctacacagc agcctcgacc tcccatggcg ttcatgccat
ggccccatac 840acaaccccag caacagcagc caccccaaca tcaacaaaca
gactcatgat gaccatgcaa 900ttcaattagg tcggaaagta gcatgcacct
tatgattatt acaaatttac ttaatgcctt 960taagtcagct gtagtttagt
gttttgcatt gaaaaatgcc aaagattgtt tgaggtttct 1020tgcactcatt
tatgattgta tgagctctta tgctgagtta cttttggttg tgtttatttg
1080aggtactggt gtggtagtta aattagtttg tagctgtcca taagtaaaca
gcgtagctgc 1140ttaattagga ggtctgaaat gatgaaatag tttgtattgt
tattgcagaa ggtaggtttt 1200attcagtatt tcattctact gca
122358254PRTGlycine maxG3548 polypeptide (domain in aa coordinates
90-155) 58Met Gly Val Ala Thr Gly Ala Ser Gln Met Ala Tyr Ser Ser
His Tyr 1 5 10 15 Pro Thr Ala Pro Met Val Ala Ser Gly Thr Pro Ala
Val Ala Val Pro 20 25 30 Ser Pro Thr Gln Ala Pro Ala Ala Phe Ser
Ser Ser Ala His Gln Leu 35 40 45 Ala Tyr Gln Gln Ala Gln His Phe
His His Gln Gln Gln Gln His Gln 50 55 60 Gln Gln Gln Leu Gln Met
Phe Trp Ser Asn Gln Met Gln Glu Ile Glu 65 70 75 80 Gln Thr Ile Asp
Phe Lys Asn His Ser Leu Pro Leu Ala Arg Ile Lys 85 90 95 Lys Ile
Met Lys Ala Asp Glu Asp Val Arg Met Ile Ser Ala Glu Ala 100 105 110
Pro Val Ile Phe Ala Lys Ala Cys Glu Met Phe Ile Leu Glu Leu Thr 115
120 125 Leu Arg Ser Trp Ile His Thr Glu Glu Asn Lys Arg Arg Thr Leu
Gln 130 135 140 Lys Asn Asp Ile Ala Ala Ala Ile Ser Arg Asn Asp Val
Phe Asp Phe 145 150 155 160 Leu Val Asp Ile Ile Pro Arg Asp Glu Leu
Lys Glu Glu Gly Leu Gly 165 170 175 Ile Thr Lys Ala Thr Ile Pro Leu
Val Asn Ser Pro Ala Asp Met Pro 180 185 190 Tyr Tyr Tyr Val Pro Pro
Gln His Pro Val Val Gly Pro Pro Gly Met 195 200 205 Ile Met Gly Lys
Pro Val Gly Ala Glu Gln Ala Thr Leu Tyr Ser Thr 210 215 220 Gln Gln
Pro Arg Pro Pro Met Ala Phe Met Pro Trp Pro His Thr Gln 225 230 235
240 Pro Gln Gln Gln Gln Pro Pro Gln His Gln Gln Thr Asp Ser 245 250
59705DNAArabidopsis thalianaG715 59atggatacca acaaccagca accacctccc
tccgccgccg gaatccctcc tccaccacct 60ggaaccacca tctccgccgc aggaggagga
gcttcttacc accaccttct ccaacaacaa 120caacaacagc tccaactatt
ctggacctac caacgccaag agatcgaaca agttaacgat 180ttcaaaaacc
atcagcttcc actagctagg ataaaaaaga tcatgaaagc cgatgaagat
240gttcgtatga tctccgcaga agcaccgatt ctcttcgcga aagcttgtga
gcttttcatt 300ctcgagctca cgatcagatc ttggcttcac gctgaggaga
ataaacgtcg tacgcttcag 360aaaaacgata tcgctgctgc gattactagg
actgatatct tcgatttcct tgttgatatt 420gttcctagag atgagattaa
ggacgaagcc gcagtcctcg gtggtggaat ggtggtggct 480cctaccgcga
gcggcgtgcc ttactattat ccgccgatgg gacaaccagc tggtcctgga
540gggatgatga ttgggagacc agctatggat ccgaatggtg tttatgtcca
gcctccgtct 600caggcgtggc agagtgtttg gcagacttcg acggggacgg
gagatgatgt ctcttatggt 660agtggtggaa gttccggtca agggaatctc
gacggccaag gttaa 70560234PRTArabidopsis thalianaG715 polypeptide
(domain in aa coordinates 66-131) 60Met Asp Thr Asn Asn Gln Gln Pro
Pro Pro Ser Ala Ala Gly Ile Pro 1 5 10 15 Pro Pro Pro Pro Gly Thr
Thr Ile Ser Ala Ala Gly Gly Gly Ala Ser 20 25 30 Tyr His His Leu
Leu Gln Gln Gln Gln Gln Gln Leu Gln Leu Phe Trp 35 40 45 Thr Tyr
Gln Arg Gln Glu Ile Glu Gln Val Asn Asp Phe Lys Asn His 50 55 60
Gln Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu Asp 65
70 75 80 Val Arg Met Ile Ser Ala Glu Ala Pro Ile Leu Phe Ala Lys
Ala Cys 85 90 95 Glu Leu Phe Ile Leu Glu Leu Thr Ile Arg Ser Trp
Leu His Ala Glu 100 105 110 Glu Asn Lys Arg Arg Thr Leu Gln Lys Asn
Asp Ile Ala Ala Ala Ile 115 120 125 Thr Arg Thr Asp Ile Phe Asp Phe
Leu Val Asp Ile Val Pro Arg Asp 130 135 140 Glu Ile Lys Asp Glu Ala
Ala Val Leu Gly Gly Gly Met Val Val Ala 145 150 155 160 Pro Thr Ala
Ser Gly Val Pro Tyr Tyr Tyr Pro Pro Met Gly Gln Pro 165 170 175 Ala
Gly Pro Gly Gly Met Met Ile Gly Arg Pro Ala Met Asp Pro Asn 180 185
190 Gly Val Tyr Val Gln Pro Pro Ser Gln Ala Trp Gln Ser Val Trp Gln
195 200 205 Thr Ser Thr Gly Thr Gly Asp Asp Val Ser Tyr Gly Ser Gly
Gly Ser 210 215 220 Ser Gly Gln Gly Asn Leu Asp Gly Gln Gly 225 230
61711DNAGlycine maxG3886 61atggagacca acaaccagca acaacaacaa
caaggagctc aagcccaatc gggaccctac 60cccgtcgccg gcgccggcgg cagtgcaggt
gcaggtgcag gcgctcctcc ccctttccag 120caccttctcc agcagcagca
gcagcagctc cagatgttct ggtcttacca gcgtcaagaa 180atcgagcacg
tgaacgactt taagaatcac cagctccctc ttgcccgcat caagaagatc
240atgaaggccg acgaggatgt ccgcatgatc tccgccgagg cccccatcct
cttcgccaag 300gcctgcgagc tcttcatcct cgagctcacc atccgctcct
ggctccacgc cgaggagaac 360aagcgccgca ccctccagaa gaacgacatc
gccgccgcca tcacccgcac cgacattttc 420gacttcctcg ttgatattgt
cccccgcgac gagatcaagg acgacgctgc tcttgtgggg 480gccaccgcca
gtggggtgcc ttactactac ccgcccattg gacagcctgc cgggatgatg
540attggccgcc ccgccgtcga tcccgccacc ggggtttatg tccagccgcc
ctcccaggca 600tggcagtccg tctggcagtc cgctgccgag gacgcttcct
atggcaccgg ccccgccggt 660gcccagcgga gccttgatgg ccagagctag
ctcgagcctg caggaagggc g 71162229PRTGlycine maxG3886 polypeptide
(domain in aa coordinates 72-137) 62Met Glu Thr Asn Asn Gln Gln Gln
Gln Gln Gln Gly Ala Gln Ala Gln 1 5 10 15 Ser Gly Pro Tyr Pro Val
Ala Gly Ala Gly Gly Ser Ala Gly Ala Gly 20 25 30 Ala Gly Ala Pro
Pro Pro Phe Gln His Leu Leu Gln Gln Gln Gln Gln 35 40 45 Gln Leu
Gln Met Phe Trp Ser Tyr Gln Arg Gln Glu Ile Glu His Val 50 55 60
Asn Asp Phe Lys Asn His Gln Leu Pro Leu Ala Arg Ile Lys Lys Ile 65
70 75 80 Met Lys Ala Asp Glu Asp Val Arg Met Ile Ser Ala Glu Ala
Pro Ile 85 90 95 Leu Phe Ala Lys Ala Cys Glu Leu Phe Ile Leu Glu
Leu Thr Ile Arg 100 105 110 Ser Trp Leu His Ala Glu Glu Asn Lys Arg
Arg Thr Leu Gln Lys Asn 115 120 125 Asp Ile Ala Ala Ala Ile Thr Arg
Thr Asp Ile Phe Asp Phe Leu Val 130 135 140 Asp Ile Val Pro Arg Asp
Glu Ile Lys Asp Asp Ala Ala Leu Val Gly 145 150 155 160 Ala Thr Ala
Ser Gly Val Pro Tyr Tyr Tyr Pro Pro Ile Gly Gln Pro 165 170 175 Ala
Gly Met Met Ile Gly Arg Pro Ala Val Asp Pro Ala Thr Gly Val 180 185
190 Tyr Val Gln Pro Pro Ser Gln Ala Trp Gln Ser Val Trp Gln Ser Ala
195 200 205 Ala Glu Asp Ala Ser Tyr Gly Thr Gly Pro Ala Gly Ala Gln
Arg Ser 210 215 220 Leu Asp Gly Gln Ser 225 63760DNAZea maysG3889
63cccagcagca acgtaatcca aatccatgga caaccagccg ctgccctact ccacaggcca
60gccccctgcc cccggaggag ccccggtggc gggcatgcct ggcgcggccg gcctcccacc
120cgtgccgcac caccacctgc tccagcagca ggcccagctg caggcgttct
gggcgtacca 180gcgccaggag gcggagcgcg cgtccgcgtc ggacttcaag
aaccaccagc tgcctctggc 240ccggatcaag aagatcatga aggccgacga
ggacgtgcgc atgatctccg ccgaggcgcc
300cgtgctgttc gccaaggcct gcgagctctt catcctcgag ctcactatcc
gctcctggct 360ccacgccgag gagaacaagc gccgcaccct gcagcgcaac
gacgtcgccg cggccatcgc 420gcgcaccgac gtcttcgatt tcctcgtcga
catcgtgccc cgcgaggagg ccaaggagga 480gcccggcagc gccctcggct
tcgcggcgcc tgggaccggc gtcgtcgggg ctggcgcccc 540gggcggggcg
ccagccgccg ggatgcccta ctactatccg ccgatggggc agccggcgcc
600gatgatgccg gcctggcatg ttccggcctg ggacccggcc tggcagcaag
gggcagcgga 660tgtcgatcag agcggcagct tcagcgagga aggacaaggg
tttggagcag gccatggcgg 720cgccgctagc ttccctcctg cgcctccgac
ctccgagtga 76064244PRTZea maysG3889 polypeptide (domain in aa
coordinates 69-134) 64Met Asp Asn Gln Pro Leu Pro Tyr Ser Thr Gly
Gln Pro Pro Ala Pro 1 5 10 15 Gly Gly Ala Pro Val Ala Gly Met Pro
Gly Ala Ala Gly Leu Pro Pro 20 25 30 Val Pro His His His Leu Leu
Gln Gln Gln Ala Gln Leu Gln Ala Phe 35 40 45 Trp Ala Tyr Gln Arg
Gln Glu Ala Glu Arg Ala Ser Ala Ser Asp Phe 50 55 60 Lys Asn His
Gln Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala 65 70 75 80 Asp
Glu Asp Val Arg Met Ile Ser Ala Glu Ala Pro Val Leu Phe Ala 85 90
95 Lys Ala Cys Glu Leu Phe Ile Leu Glu Leu Thr Ile Arg Ser Trp Leu
100 105 110 His Ala Glu Glu Asn Lys Arg Arg Thr Leu Gln Arg Asn Asp
Val Ala 115 120 125 Ala Ala Ile Ala Arg Thr Asp Val Phe Asp Phe Leu
Val Asp Ile Val 130 135 140 Pro Arg Glu Glu Ala Lys Glu Glu Pro Gly
Ser Ala Leu Gly Phe Ala 145 150 155 160 Ala Pro Gly Thr Gly Val Val
Gly Ala Gly Ala Pro Gly Gly Ala Pro 165 170 175 Ala Ala Gly Met Pro
Tyr Tyr Tyr Pro Pro Met Gly Gln Pro Ala Pro 180 185 190 Met Met Pro
Ala Trp His Val Pro Ala Trp Asp Pro Ala Trp Gln Gln 195 200 205 Gly
Ala Ala Asp Val Asp Gln Ser Gly Ser Phe Ser Glu Glu Gly Gln 210 215
220 Gly Phe Gly Ala Gly His Gly Gly Ala Ala Ser Phe Pro Pro Ala Pro
225 230 235 240 Pro Thr Ser Glu 65800DNAArabidopsis thalianaG1646
65gatcttttga tccaatcaca aggcaaagat ccaatggaca ataacaacaa caacaacaac
60cagcaaccac caccaacctc cgtctatcca cctggctccg ccgtcacaac cgtaatccct
120cctccaccat ctggatctgc atcaatagtc accggaggag gagcgacata
ccaccacctc 180ctccagcaac aacagcaaca gcttcaaatg ttctggacat
accagagaca agagatcgaa 240caggtaaacg atttcaaaaa ccatcagctc
cctctagctc gtatcaaaaa aatcatgaaa 300gctgatgaag atgtgcgtat
gatctccgcc gaagcaccga ttctcttcgc gaaagcttgt 360gagcttttca
ttctcgaact tacgattaga tcttggcttc acgctgaaga gaacaaacgt
420cgtacgcttc agaaaaacga tatcgctgct gcgattacta gaaccgatat
cttcgatttc 480cttgttgata ttgttcctag ggaagagatc aaggaagagg
aagatgcagc atcggctctt 540ggtggaggag gtatggttgc tcccgccgcg
agcggtgttc cttattatta tccaccgatg 600ggacaaccgg cggttcctgg
agggatgatg attggaagac cggcgatgga tcctagcggt 660gtttatgctc
agcctccttc tcaggcatgg caaagcgttt ggcagaattc agctggtggt
720ggtgatgatg tgtcttatgg aagtggagga agtagcggcc atggtaatct
cgatagccaa 780gggtaagtga attctagtag 80066250PRTArabidopsis
thalianaG1646 polypeptide (domain in aa coordinates 79-144) 66Met
Asp Asn Asn Asn Asn Asn Asn Asn Gln Gln Pro Pro Pro Thr Ser 1 5 10
15 Val Tyr Pro Pro Gly Ser Ala Val Thr Thr Val Ile Pro Pro Pro Pro
20 25 30 Ser Gly Ser Ala Ser Ile Val Thr Gly Gly Gly Ala Thr Tyr
His His 35 40 45 Leu Leu Gln Gln Gln Gln Gln Gln Leu Gln Met Phe
Trp Thr Tyr Gln 50 55 60 Arg Gln Glu Ile Glu Gln Val Asn Asp Phe
Lys Asn His Gln Leu Pro 65 70 75 80 Leu Ala Arg Ile Lys Lys Ile Met
Lys Ala Asp Glu Asp Val Arg Met 85 90 95 Ile Ser Ala Glu Ala Pro
Ile Leu Phe Ala Lys Ala Cys Glu Leu Phe 100 105 110 Ile Leu Glu Leu
Thr Ile Arg Ser Trp Leu His Ala Glu Glu Asn Lys 115 120 125 Arg Arg
Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala Ile Thr Arg Thr 130 135 140
Asp Ile Phe Asp Phe Leu Val Asp Ile Val Pro Arg Glu Glu Ile Lys 145
150 155 160 Glu Glu Glu Asp Ala Ala Ser Ala Leu Gly Gly Gly Gly Met
Val Ala 165 170 175 Pro Ala Ala Ser Gly Val Pro Tyr Tyr Tyr Pro Pro
Met Gly Gln Pro 180 185 190 Ala Val Pro Gly Gly Met Met Ile Gly Arg
Pro Ala Met Asp Pro Ser 195 200 205 Gly Val Tyr Ala Gln Pro Pro Ser
Gln Ala Trp Gln Ser Val Trp Gln 210 215 220 Asn Ser Ala Gly Gly Gly
Asp Asp Val Ser Tyr Gly Ser Gly Gly Ser 225 230 235 240 Ser Gly His
Gly Asn Leu Asp Ser Gln Gly 245 250 67915DNAOryza sativaG3543
67atggacaacc agcagctacc ctacgccggt cagccggcgg ccgcaggcgc cggagccccg
60gtgccgggcg tgcctggcgc gggcgggccg ccggcggtgc cgcaccacca cctgctccag
120cagcagcagg cgcagctgca ggcgttctgg gcgtaccagc ggcaggaggc
ggagcgcgcg 180tcggcgtcgg acttcaagaa ccaccagctg ccgctggcgg
ggatcaagaa gatcatgaag 240gcggacgagg acgtgcgcat gatctcggcg
gaggcgcccg tgctgttcgc caaggcgtgc 300gagctcttca tcctggagct
caccatccgc tcgtggctgc acgccgagga gaacaagcgc 360cgcaccctgc
agcgcaagga cgtcgccgcc gccatcgcgc gcaccgacgt gttcgacttc
420ctcgtcgaca tcgtgccgcg ggaggaggcc aaggaggagc ccggcagcgc
gctcgggttc 480gcggcgggag ggcccgccgg cgccgttgga gcggccggcc
ccgccgcggg gctgccgtac 540tactacccgc cgatggggca gccggcgccg
atgatgccgg cgtggcatgt tccggcgtgg 600gacccggcgt ggcagcaagg
agcagcgccg gatgtggacc agggcgccgc cggcagcttc 660agcgaggaag
ggcagcaagg ttttgcaggc catggcggtg cggcagctag cttccctcct
720gcacctccaa gctccgaata gtgatgatcc atatggttcc atgcatgcat
cgctgaggtg 780ctagctagct actatagctg ctcaaatcaa atgctcaatg
tgtcggtaat taattaatgt 840ggtacgtatt aacttaaccg atgtacgtaa
tggacgctca agctaattaa gggatgtaca 900atttactaaa aaaaa
91568246PRTOryza sativaG3543 polypeptide (domain in aa coordinates
70-135) 68Met Asp Asn Gln Gln Leu Pro Tyr Ala Gly Gln Pro Ala Ala
Ala Gly 1 5 10 15 Ala Gly Ala Pro Val Pro Gly Val Pro Gly Ala Gly
Gly Pro Pro Ala 20 25 30 Val Pro His His His Leu Leu Gln Gln Gln
Gln Ala Gln Leu Gln Ala 35 40 45 Phe Trp Ala Tyr Gln Arg Gln Glu
Ala Glu Arg Ala Ser Ala Ser Asp 50 55 60 Phe Lys Asn His Gln Leu
Pro Leu Ala Gly Ile Lys Lys Ile Met Lys 65 70 75 80 Ala Asp Glu Asp
Val Arg Met Ile Ser Ala Glu Ala Pro Val Leu Phe 85 90 95 Ala Lys
Ala Cys Glu Leu Phe Ile Leu Glu Leu Thr Ile Arg Ser Trp 100 105 110
Leu His Ala Glu Glu Asn Lys Arg Arg Thr Leu Gln Arg Lys Asp Val 115
120 125 Ala Ala Ala Ile Ala Arg Thr Asp Val Phe Asp Phe Leu Val Asp
Ile 130 135 140 Val Pro Arg Glu Glu Ala Lys Glu Glu Pro Gly Ser Ala
Leu Gly Phe 145 150 155 160 Ala Ala Gly Gly Pro Ala Gly Ala Val Gly
Ala Ala Gly Pro Ala Ala 165 170 175 Gly Leu Pro Tyr Tyr Tyr Pro Pro
Met Gly Gln Pro Ala Pro Met Met 180 185 190 Pro Ala Trp His Val Pro
Ala Trp Asp Pro Ala Trp Gln Gln Gly Ala 195 200 205 Ala Pro Asp Val
Asp Gln Gly Ala Ala Gly Ser Phe Ser Glu Glu Gly 210 215 220 Gln Gln
Gly Phe Ala Gly His Gly Gly Ala Ala Ala Ser Phe Pro Pro 225 230 235
240 Ala Pro Pro Ser Ser Glu 245 69732DNAArabidopsis thalianaG1820
69ctcacttcca acatccaaat ccctagaaat tgtaaatggc tgagaacaac aacaacaacg
60gcgacaacat gaacaacgac aaccaccagc aaccaccgtc gtactcgcag ctgccgccga
120tggcatcatc caaccctcag ttacgtaatt actggattga gcagatggaa
accgtctcgg 180atttcaaaaa ccgtcagctt ccattggctc gaattaagaa
gatcatgaag gctgatccag 240atgtgcacat ggtctccgca gaggctccga
tcatcttcgc aaaggcttgc gaaatgttca 300tcgttgatct cacgatgcgg
tcgtggctca aagccgagga gaacaaacgc cacacgcttc 360agaaatcgga
tatctccaac gcagtggcta gctctttcac ctacgatttc cttcttgatg
420ttgtccctaa ggacgagtct atcgccaccg ctgatcctgg ctttgtggct
atgccacatc 480ctgacggtgg aggagtaccg caatattatt atccaccggg
agtggtgatg ggaactccta 540tggttggtag tggaatgtac gcgccatcgc
aggcgtggcc agcagcggct ggtgacgggg 600aggatgatgc tgaggataat
ggaggaaacg gcggcggaaa ttgaagtgta gatttagggt 660ttgtaaccgc
ctatgtggga aatttgaaat ttggtggtgt ttattagggt tcttcaattc
720gtcggatttg ct 73270202PRTArabidopsis thalianaG1820 polypeptide
(domain in aa coordinates 55-120) 70Met Ala Glu Asn Asn Asn Asn Asn
Gly Asp Asn Met Asn Asn Asp Asn 1 5 10 15 His Gln Gln Pro Pro Ser
Tyr Ser Gln Leu Pro Pro Met Ala Ser Ser 20 25 30 Asn Pro Gln Leu
Arg Asn Tyr Trp Ile Glu Gln Met Glu Thr Val Ser 35 40 45 Asp Phe
Lys Asn Arg Gln Leu Pro Leu Ala Arg Ile Lys Lys Ile Met 50 55 60
Lys Ala Asp Pro Asp Val His Met Val Ser Ala Glu Ala Pro Ile Ile 65
70 75 80 Phe Ala Lys Ala Cys Glu Met Phe Ile Val Asp Leu Thr Met
Arg Ser 85 90 95 Trp Leu Lys Ala Glu Glu Asn Lys Arg His Thr Leu
Gln Lys Ser Asp 100 105 110 Ile Ser Asn Ala Val Ala Ser Ser Phe Thr
Tyr Asp Phe Leu Leu Asp 115 120 125 Val Val Pro Lys Asp Glu Ser Ile
Ala Thr Ala Asp Pro Gly Phe Val 130 135 140 Ala Met Pro His Pro Asp
Gly Gly Gly Val Pro Gln Tyr Tyr Tyr Pro 145 150 155 160 Pro Gly Val
Val Met Gly Thr Pro Met Val Gly Ser Gly Met Tyr Ala 165 170 175 Pro
Ser Gln Ala Trp Pro Ala Ala Ala Gly Asp Gly Glu Asp Asp Ala 180 185
190 Glu Asp Asn Gly Gly Asn Gly Gly Gly Asn 195 200
71668DNAArabidopsis thalianaG1836 71ataacaagcc tagaacacta
gaaacttcaa aaaagaaaaa aatcttatgg agaacaacaa 60cggcaacaac cagctgccac
cgaaaggtaa cgagcaactg aagagtttct ggtcaaaaga 120gatggaaggt
aacttagatt tcaaaaatca cgaccttcct ataactcgta tcaagaagat
180tatgaagtat gatccggatg tgactatgat agctagtgag gctccaatcc
tcctctcgaa 240agcatgtgag atgtttatca tggatctcac gatgcgttcg
tggctccatg ctcaggaaag 300caaacgagtc acgctacaga aatctaatgt
cgatgccgca gtggctcaaa ctgttatctt 360tgatttcttg cttgatgatg
acattgaggt aaagagagag tctgttgccg ccgctgctga 420tcctgtggcc
atgccaccta ttgacgatgg agagctgcct ccaggaatgg taattggaac
480tcctgtttgt tgtagtcttg gaatccacca accacaacca caaatgcagg
catggcctgg 540agcttggacc tcggtgtctg gtgaggagga agaagcgcgt
gggaaaaaag gaggtgacga 600cggaaactaa taagtggaat acgttttagg
gtattttcaa gggaatatgt agtaaatagt 660catggatc 66872187PRTArabidopsis
thalianaG1836 polypeptide (domain in aa coordinates 37-102) 72Met
Glu Asn Asn Asn Gly Asn Asn Gln Leu Pro Pro Lys Gly Asn Glu 1 5 10
15 Gln Leu Lys Ser Phe Trp Ser Lys Glu Met Glu Gly Asn Leu Asp Phe
20 25 30 Lys Asn His Asp Leu Pro Ile Thr Arg Ile Lys Lys Ile Met
Lys Tyr 35 40 45 Asp Pro Asp Val Thr Met Ile Ala Ser Glu Ala Pro
Ile Leu Leu Ser 50 55 60 Lys Ala Cys Glu Met Phe Ile Met Asp Leu
Thr Met Arg Ser Trp Leu 65 70 75 80 His Ala Gln Glu Ser Lys Arg Val
Thr Leu Gln Lys Ser Asn Val Asp 85 90 95 Ala Ala Val Ala Gln Thr
Val Ile Phe Asp Phe Leu Leu Asp Asp Asp 100 105 110 Ile Glu Val Lys
Arg Glu Ser Val Ala Ala Ala Ala Asp Pro Val Ala 115 120 125 Met Pro
Pro Ile Asp Asp Gly Glu Leu Pro Pro Gly Met Val Ile Gly 130 135 140
Thr Pro Val Cys Cys Ser Leu Gly Ile His Gln Pro Gln Pro Gln Met 145
150 155 160 Gln Ala Trp Pro Gly Ala Trp Thr Ser Val Ser Gly Glu Glu
Glu Glu 165 170 175 Ala Arg Gly Lys Lys Gly Gly Asp Asp Gly Asn 180
185 73639DNAArabidopsis thalianaG1819 73atggaagaga acaacggcaa
caacaaccac tacctgccgc aaccatcgtc ttcccaactg 60ccgccgccac cattgtatta
tcaatcaatg ccgttgccgt catattcact gccgctgccg 120tactcaccgc
agatgcggaa ttattggatt gcgcagatgg gaaacgcaac tgatgttaag
180catcatgcgt ttccactaac caggataaag aaaatcatga agtccaaccc
ggaagtgaac 240atggtcactg cagaggctcc ggtccttata tcgaaggcct
gtgagatgct cattcttgat 300ctcacaatgc gatcgtggct tcataccgtg
gagggcggtc gccaaactct caagagatcc 360gatacgctca cgagatccga
tatctccgcc gcaacgactc gtagtttcaa atttaccttc 420cttggcgacg
ttgtcccaag agacccttcc gtcgttaccg atgatcccgt gctacatccg
480gacggtgaag tacttcctcc gggaacggtg ataggatatc cggtgtttga
ttgtaatggt 540gtgtacgcgt caccgccaca gatgcaggag tggccggcgg
tgcctggtga cggagaggag 600gcagctgggg aaattggagg aagcagcggc ggtaattga
63974212PRTArabidopsis thalianaG1819 polypeptide (domain in aa
coordinates 64-135) 74Met Glu Glu Asn Asn Gly Asn Asn Asn His Tyr
Leu Pro Gln Pro Ser 1 5 10 15 Ser Ser Gln Leu Pro Pro Pro Pro Leu
Tyr Tyr Gln Ser Met Pro Leu 20 25 30 Pro Ser Tyr Ser Leu Pro Leu
Pro Tyr Ser Pro Gln Met Arg Asn Tyr 35 40 45 Trp Ile Ala Gln Met
Gly Asn Ala Thr Asp Val Lys His His Ala Phe 50 55 60 Pro Leu Thr
Arg Ile Lys Lys Ile Met Lys Ser Asn Pro Glu Val Asn 65 70 75 80 Met
Val Thr Ala Glu Ala Pro Val Leu Ile Ser Lys Ala Cys Glu Met 85 90
95 Leu Ile Leu Asp Leu Thr Met Arg Ser Trp Leu His Thr Val Glu Gly
100 105 110 Gly Arg Gln Thr Leu Lys Arg Ser Asp Thr Leu Thr Arg Ser
Asp Ile 115 120 125 Ser Ala Ala Thr Thr Arg Ser Phe Lys Phe Thr Phe
Leu Gly Asp Val 130 135 140 Val Pro Arg Asp Pro Ser Val Val Thr Asp
Asp Pro Val Leu His Pro 145 150 155 160 Asp Gly Glu Val Leu Pro Pro
Gly Thr Val Ile Gly Tyr Pro Val Phe 165 170 175 Asp Cys Asn Gly Val
Tyr Ala Ser Pro Pro Gln Met Gln Glu Trp Pro 180 185 190 Ala Val Pro
Gly Asp Gly Glu Glu Ala Ala Gly Glu Ile Gly Gly Ser 195 200 205 Ser
Gly Gly Asn 210 751740DNAArabidopsis thalianaG1818 75taacaaatca
aataattaga gaaataacca aaatttaact tttagaggga ctacaggatt 60tgtactttgt
acattcatat attattgtta tatatcgttt catacattaa tttgaaccaa
120tgtaaattaa gtaaaattca atttaacatc atgagcaaat tcttattaaa
attctcttaa 180aattttgagc aaattatgct ttcacattta acatttgaaa
acatcatttt taacaagata 240ttcaaaacta agttttgtac agcaaaattt
taactttcaa ttttatagag aaaaaggtat 300tttttttttt gtttcatttt
tataagacta ttatttggta tataatatac actttaagta 360aaaacaaatc
tctttctttt ttcttcttat aataccaacc acaagtctgt cagtcacaca
420catacagtta ataacattaa atattcttaa caaactacta aataggttga
gattcatata 480tgtaaagaga tcacttctta atcttatcct accatatctt
atatacgctt aattttcctt 540tatatatgca aacctccaca taaaaatatc
tcaaacccaa acacttcaaa caaaaaaaaa 600atggagaaca acaacaacaa
ccaccaacag ccaccgaaag ataacgagca actaaagagt 660ttctggtcaa
aggggatgga aggtgacttg aatgtcaaga atcacgagtt ccccatctct
720cgtatcaaga ggataatgaa gtttgatccg gatgtgagta tgatcgctgc
tgaggctcca 780aatctcttat ctaaggcttg tgaaatgttt gtcatggacc
tcacgatgcg ttcatggctc 840catgctcaag agagcaaccg actcacgata
cggaaatctg atgttgatgc cgtagtgtct 900caaaccgtca tctttgattt
cttgcgtgat gatgtcccta aggacgaggg agagcccgtt 960gtcgccgctg
ctgatcctgt ggacgatgtt gctgatcatg tggctgtgcc agatcttaac
1020aatgaagaac tgccgccggg aacggtgata ggaactccgg tttgttacgg
tttaggaata 1080cacgcgccac acccgcagat gcctggagct tggaccgagg
aggatgcgac tggggcaaat 1140ggaggaaacg gtgggaatta atatttggat
tgggttttgt aaccgctgtt gtgagaactt 1200gaatttcttt ttgagttctg
cttatgtttt caatgttatg ttttttagtt gttgaatgta
1260tttctgttgt tttgtccaaa aaaaaaaaag aatgtatttc tgttgttgtc
tttcaaatga 1320atctaatggt ttatgaatat tggctttaga ttaatttatg
catacaaaaa cacaaggatt 1380acggataaaa aagtcctcag tttacccatg
gaaacataat cttctagtga ttccttatga 1440gagtagaaaa gaatcatata
ttataatcta tttcataaga gatagggtac tgtaaacaag 1500gatgtttatt
cggctatttc tttttttttt aatcactttt acttgtcaag actcttttgt
1560gtttgcagct ttttgttaga ttacattcta gaggcaacaa gatccagaga
tctagcaaaa 1620aaaacttatt ttgaaacctg aatctatttt aaaaattttc
caactcattt ttcgttctta 1680ttctttgttt tccaacggaa tttggcgcac
aaacgattta tttgaatttt gtctttcaag 174076186PRTArabidopsis
thalianaG1818 polypeptide (domain in aa coordinates 38-102) 76Met
Glu Asn Asn Asn Asn Asn His Gln Gln Pro Pro Lys Asp Asn Glu 1 5 10
15 Gln Leu Lys Ser Phe Trp Ser Lys Gly Met Glu Gly Asp Leu Asn Val
20 25 30 Lys Asn His Glu Phe Pro Ile Ser Arg Ile Lys Arg Ile Met
Lys Phe 35 40 45 Asp Pro Asp Val Ser Met Ile Ala Ala Glu Ala Pro
Asn Leu Leu Ser 50 55 60 Lys Ala Cys Glu Met Phe Val Met Asp Leu
Thr Met Arg Ser Trp Leu 65 70 75 80 His Ala Gln Glu Ser Asn Arg Leu
Thr Ile Arg Lys Ser Asp Val Asp 85 90 95 Ala Val Val Ser Gln Thr
Val Ile Phe Asp Phe Leu Arg Asp Asp Val 100 105 110 Pro Lys Asp Glu
Gly Glu Pro Val Val Ala Ala Ala Asp Pro Val Asp 115 120 125 Asp Val
Ala Asp His Val Ala Val Pro Asp Leu Asn Asn Glu Glu Leu 130 135 140
Pro Pro Gly Thr Val Ile Gly Thr Pro Val Cys Tyr Gly Leu Gly Ile 145
150 155 160 His Ala Pro His Pro Gln Met Pro Gly Ala Trp Thr Glu Glu
Asp Ala 165 170 175 Thr Gly Ala Asn Gly Gly Asn Gly Gly Asn 180 185
77588DNAArabidopsis thalianaG490 77atgaggaggc caaagtcatc tcacgtcagg
atggaacctg ttgcgcctcg ttcacataac 60acgatgccaa tgcttgatca atttcgatct
aatcatcctg aaacaagcaa gatcgagggg 120gtctcttcgt tggacacagc
tctgaaggtg ttttggaata atcaaaggga gcagctagga 180aactttgcag
gccaaactca tttgccgcta tctagggtca gaaagatttt gaaatctgat
240cctgaagtca agaagataag ctgtgatgtt cctgctttgt tttcgaaagc
ctgtgaatac 300ttcattctag aggtaacatt acgagcttgg atgcatactc
aatcatgcac tcgtgagacc 360atccggcgtt gtgatatctt ccaggccgta
aagaactcag gaacttatga tttcctgatt 420gatcgtgtcc cttttggacc
gcactgtgtc acccatcagg gtgtgcaacc tcctgctgaa 480atgattttgc
cggatatgaa tgttccaatc gatatggacc agattgagga ggagaatatg
540atggaagagc gctctgtcgg gtttgacctc aactgtgatc tccagtga
58878195PRTArabidopsis thalianaG490 polypeptide (domain in aa
coordinates 68-133) 78Met Arg Arg Pro Lys Ser Ser His Val Arg Met
Glu Pro Val Ala Pro 1 5 10 15 Arg Ser His Asn Thr Met Pro Met Leu
Asp Gln Phe Arg Ser Asn His 20 25 30 Pro Glu Thr Ser Lys Ile Glu
Gly Val Ser Ser Leu Asp Thr Ala Leu 35 40 45 Lys Val Phe Trp Asn
Asn Gln Arg Glu Gln Leu Gly Asn Phe Ala Gly 50 55 60 Gln Thr His
Leu Pro Leu Ser Arg Val Arg Lys Ile Leu Lys Ser Asp 65 70 75 80 Pro
Glu Val Lys Lys Ile Ser Cys Asp Val Pro Ala Leu Phe Ser Lys 85 90
95 Ala Cys Glu Tyr Phe Ile Leu Glu Val Thr Leu Arg Ala Trp Met His
100 105 110 Thr Gln Ser Cys Thr Arg Glu Thr Ile Arg Arg Cys Asp Ile
Phe Gln 115 120 125 Ala Val Lys Asn Ser Gly Thr Tyr Asp Phe Leu Ile
Asp Arg Val Pro 130 135 140 Phe Gly Pro His Cys Val Thr His Gln Gly
Val Gln Pro Pro Ala Glu 145 150 155 160 Met Ile Leu Pro Asp Met Asn
Val Pro Ile Asp Met Asp Gln Ile Glu 165 170 175 Glu Glu Asn Met Met
Glu Glu Arg Ser Val Gly Phe Asp Leu Asn Cys 180 185 190 Asp Leu Gln
195 79882DNAArabidopsis thalianaG3074 79atgaggaaga agctcgatac
tcggttccca gctgctcgta ttaaaaagat tatgcaagct 60gatgaggatg ttggcaagat
agctttggca gtgcctgtct tagtctcaaa atctttggag 120ttgttcttgc
aagacctttg tgatcgtaca tatgaaatta cccttgaaag aggtgccaag
180actgtgagct cattgcacct aaaacattgt gtggaaagat ataacgtgtt
tgattttctg 240agggaagttg tgagtaaggt gcctgactat ggccattccc
aagggcaagg acatggtgat 300gttaccatgg atgatcgcag catctccaag
agaaggaagc ccatcagcga tgaagtgaat 360gacagtgacg aggaatataa
gaaaagcaaa acgcaagaga tagggagtgc taagaccagt 420ggcaggggtg
gtagaggaag agggcgagga agaggtcgtg gtggacgagc tgcaaaagca
480gccgaaagag agggtctcaa ccgcgagatg gaagtagaag ccgccaattc
tggacagcca 540ccaccagaag acaatgtcaa gatgcatgcg tcagagtcat
caccacaaga ggatgagaag 600aaaggcatcg acggcacagc agcatcgaac
gaagacacca agcaacacct tcaaagtccc 660aaagaaggca ttgactttga
tctcaacgct gaatccctcg acctaaacga gaccaaactg 720gcaccagcca
caggcacaac cacaaccaca actgcagcaa cagactctga ggagtattcg
780ggctggccta tgatggacat aagcaaaatg gatccagcac agcttgctag
tctgggtaag 840aggatagacg aggatgagga agattatgac gaagaaggct aa
88280293PRTArabidopsis thalianaG3074 polypeptide (domain in aa
coordinates 9-73) 80Met Arg Lys Lys Leu Asp Thr Arg Phe Pro Ala Ala
Arg Ile Lys Lys 1 5 10 15 Ile Met Gln Ala Asp Glu Asp Val Gly Lys
Ile Ala Leu Ala Val Pro 20 25 30 Val Leu Val Ser Lys Ser Leu Glu
Leu Phe Leu Gln Asp Leu Cys Asp 35 40 45 Arg Thr Tyr Glu Ile Thr
Leu Glu Arg Gly Ala Lys Thr Val Ser Ser 50 55 60 Leu His Leu Lys
His Cys Val Glu Arg Tyr Asn Val Phe Asp Phe Leu 65 70 75 80 Arg Glu
Val Val Ser Lys Val Pro Asp Tyr Gly His Ser Gln Gly Gln 85 90 95
Gly His Gly Asp Val Thr Met Asp Asp Arg Ser Ile Ser Lys Arg Arg 100
105 110 Lys Pro Ile Ser Asp Glu Val Asn Asp Ser Asp Glu Glu Tyr Lys
Lys 115 120 125 Ser Lys Thr Gln Glu Ile Gly Ser Ala Lys Thr Ser Gly
Arg Gly Gly 130 135 140 Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Gly
Arg Ala Ala Lys Ala 145 150 155 160 Ala Glu Arg Glu Gly Leu Asn Arg
Glu Met Glu Val Glu Ala Ala Asn 165 170 175 Ser Gly Gln Pro Pro Pro
Glu Asp Asn Val Lys Met His Ala Ser Glu 180 185 190 Ser Ser Pro Gln
Glu Asp Glu Lys Lys Gly Ile Asp Gly Thr Ala Ala 195 200 205 Ser Asn
Glu Asp Thr Lys Gln His Leu Gln Ser Pro Lys Glu Gly Ile 210 215 220
Asp Phe Asp Leu Asn Ala Glu Ser Leu Asp Leu Asn Glu Thr Lys Leu 225
230 235 240 Ala Pro Ala Thr Gly Thr Thr Thr Thr Thr Thr Ala Ala Thr
Asp Ser 245 250 255 Glu Glu Tyr Ser Gly Trp Pro Met Met Asp Ile Ser
Lys Met Asp Pro 260 265 270 Ala Gln Leu Ala Ser Leu Gly Lys Arg Ile
Asp Glu Asp Glu Glu Asp 275 280 285 Tyr Asp Glu Glu Gly 290
81738DNAArabidopsis thalianaG1249 81tcgaccgttc ttctcaatct
caccaatcgg tttaagctga aaacccgaat tagcaaaatc 60ttcgttcggg ctgttttggt
taatccggtt tacatgtttt ctcattgctc attttcattt 120tcccgccgtg
acagagcgcg taaatctcaa aaccctaaaa atgtcgaaca tatacaattc
180attaccttaa tcagattttc tcaacagaat caaaatcaaa atccatggag
gaagaagaag 240gatcaatccg accagagttt ccaatcggaa gagtaaagaa
gataatgaaa ctggacaaag 300acatcaacaa aatcaactca gaagctcttc
acgtcatcac ttactccacc gaactcttcc 360tccacttcct cgccgagaaa
tctgctgttg ttacggcgga gaagaagcgt aagactgtta 420atctcgatca
tttaagaatc gccgtgaaaa gacaccaacc tactagtgat ttcctcttag
480actcgcttcc gttgccggct cagcctgtca aacataccaa atcggtttcc
gacaagaaga 540ttccggcgcc gccaattggg actcgtcgta tcgatgattt
cttcagtaaa gggaaagcaa 600agactgattc agcctaaagt aaaatttctc
attttgttca caattgcaaa ttttactctg 660ttctcaaatc aaaatcttgt
tttgctaaaa gtgtagtgag aatgtatgga tcatgaggaa 720cttttatagg aagcggcc
73882130PRTArabidopsis thalianaG1249 polypeptide (domain in aa
coordinates 12-76) 82Met Glu Glu Glu Glu Gly Ser Ile Arg Pro Glu
Phe Pro Ile Gly Arg 1 5 10 15 Val Lys Lys Ile Met Lys Leu Asp Lys
Asp Ile Asn Lys Ile Asn Ser 20 25 30 Glu Ala Leu His Val Ile Thr
Tyr Ser Thr Glu Leu Phe Leu His Phe 35 40 45 Leu Ala Glu Lys Ser
Ala Val Val Thr Ala Glu Lys Lys Arg Lys Thr 50 55 60 Val Asn Leu
Asp His Leu Arg Ile Ala Val Lys Arg His Gln Pro Thr 65 70 75 80 Ser
Asp Phe Leu Leu Asp Ser Leu Pro Leu Pro Ala Gln Pro Val Lys 85 90
95 His Thr Lys Ser Val Ser Asp Lys Lys Ile Pro Ala Pro Pro Ile Gly
100 105 110 Thr Arg Arg Ile Asp Asp Phe Phe Ser Lys Gly Lys Ala Lys
Thr Asp 115 120 125 Ser Ala 130 83621DNAArabidopsis thalianaG3075
83atggtgtcgt caaagaaacc caaggagaag aaggcgagga gcgatgtcgt cgtcaataaa
60gcgagtggtc ggagtaaacg cagctccggt tccagaacga agaagacgtc gaacaaggtt
120aacattgtga agaagaagcc ggagatttac gagatctcag aatcatcgag
cagtgactct 180gtggaagaag caataagagg cgatgaggcg aagaaaagta
acggcgtcgt gagcaagagg 240ggtaacggaa agagtgtagg aattccgacg
aagacgagta aaaatcgaga agaggacgat 300ggaggcgcgg aagatgctaa
gatcaagttt ccgatgaatc ggattcggcg gatcatgaga 360agcgataatt
ctgctcctca gattatgcag gatgctgtat ttcttgtcaa caaagccacg
420gagatgttca ttgagcggtt ttctgaagaa gcttatgata gttccgtcaa
ggacaaaaag 480aaattcatcc actacaaaca cctctcatcc gtagtgagta
acgaccagag atacgagttc 540cttgcagata gtgttcccga gaaacttaaa
gcagaggccg cgttggagga atgggaaaga 600ggcatgacag atgcaggctg a
62184206PRTArabidopsis thalianaG3075 polypeptide (domain in aa
coordinates 110-173) 84Met Val Ser Ser Lys Lys Pro Lys Glu Lys Lys
Ala Arg Ser Asp Val 1 5 10 15 Val Val Asn Lys Ala Ser Gly Arg Ser
Lys Arg Ser Ser Gly Ser Arg 20 25 30 Thr Lys Lys Thr Ser Asn Lys
Val Asn Ile Val Lys Lys Lys Pro Glu 35 40 45 Ile Tyr Glu Ile Ser
Glu Ser Ser Ser Ser Asp Ser Val Glu Glu Ala 50 55 60 Ile Arg Gly
Asp Glu Ala Lys Lys Ser Asn Gly Val Val Ser Lys Arg 65 70 75 80 Gly
Asn Gly Lys Ser Val Gly Ile Pro Thr Lys Thr Ser Lys Asn Arg 85 90
95 Glu Glu Asp Asp Gly Gly Ala Glu Asp Ala Lys Ile Lys Phe Pro Met
100 105 110 Asn Arg Ile Arg Arg Ile Met Arg Ser Asp Asn Ser Ala Pro
Gln Ile 115 120 125 Met Gln Asp Ala Val Phe Leu Val Asn Lys Ala Thr
Glu Met Phe Ile 130 135 140 Glu Arg Phe Ser Glu Glu Ala Tyr Asp Ser
Ser Val Lys Asp Lys Lys 145 150 155 160 Lys Phe Ile His Tyr Lys His
Leu Ser Ser Val Val Ser Asn Asp Gln 165 170 175 Arg Tyr Glu Phe Leu
Ala Asp Ser Val Pro Glu Lys Leu Lys Ala Glu 180 185 190 Ala Ala Leu
Glu Glu Trp Glu Arg Gly Met Thr Asp Ala Gly 195 200 205
8560PRTArabidopsis thalianaG929 conserved domain 85Glu Pro Val Phe
Val Asn Ala Lys Gln Tyr His Gly Ile Leu Arg Arg 1 5 10 15 Arg Gln
Ser Arg Ala Lys Leu Glu Ala Arg Asn Arg Ala Ile Lys Ala 20 25 30
Lys Lys Pro Tyr Met His Glu Ser Arg His Leu His Ala Ile Arg Arg 35
40 45 Pro Arg Gly Cys Gly Gly Arg Phe Leu Asn Ala Lys 50 55 60
8660PRTArabidopsis thalianaG2344 conserved domain 86Glu Pro Val Phe
Val Asn Ala Lys Gln Tyr His Gly Ile Leu Arg Arg 1 5 10 15 Arg Gln
Ser Arg Ala Arg Leu Glu Ser Gln Asn Lys Val Ile Lys Ser 20 25 30
Arg Lys Pro Tyr Leu His Glu Ser Arg His Leu His Ala Ile Arg Arg 35
40 45 Pro Arg Gly Cys Gly Gly Arg Phe Leu Asn Ala Lys 50 55 60
8760PRTArabidopsis thalianaG931 conserved domain 87Glu Pro Val Phe
Val Asn Ala Lys Gln Phe His Ala Ile Met Arg Arg 1 5 10 15 Arg Gln
Gln Arg Ala Lys Leu Glu Ala Gln Asn Lys Leu Ile Lys Ala 20 25 30
Arg Lys Pro Tyr Leu His Glu Ser Arg His Val His Ala Leu Lys Arg 35
40 45 Pro Arg Gly Ser Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60
8860PRTGlycine maxG3920 conserved domain 88Glu Pro Val Tyr Val Asn
Ala Lys Gln Tyr His Gly Ile Leu Arg Arg 1 5 10 15 Arg Gln Ser Arg
Ala Lys Ala Glu Ile Glu Lys Lys Val Ile Lys Asn 20 25 30 Arg Lys
Pro Tyr Leu His Glu Ser Arg His Leu His Ala Met Arg Arg 35 40 45
Ala Arg Gly Asn Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60
8960PRTArabidopsis thalianaG928 conserved domain 89Asp Pro Val Phe
Val Asn Ala Lys Gln Tyr His Ala Ile Met Arg Arg 1 5 10 15 Arg Gln
Gln Arg Ala Lys Leu Glu Ala Gln Asn Lys Leu Ile Arg Ala 20 25 30
Arg Lys Pro Tyr Leu His Glu Ser Arg His Val His Ala Leu Lys Arg 35
40 45 Pro Arg Gly Ser Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60
9060PRTArabidopsis thalianaG1782 conserved domain 90Glu Pro Ile Phe
Val Asn Ala Lys Gln Tyr His Ala Ile Leu Arg Arg 1 5 10 15 Arg Lys
His Arg Ala Lys Leu Glu Ala Gln Asn Lys Leu Ile Lys Cys 20 25 30
Arg Lys Pro Tyr Leu His Glu Ser Arg His Leu His Ala Leu Lys Arg 35
40 45 Ala Arg Gly Ser Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60
9160PRTArabidopsis thalianaG1363 conserved domain 91Glu Pro Ile Phe
Val Asn Ala Lys Gln Tyr Gln Ala Ile Leu Arg Arg 1 5 10 15 Arg Glu
Arg Arg Ala Lys Leu Glu Ala Gln Asn Lys Leu Ile Lys Val 20 25 30
Arg Lys Pro Tyr Leu His Glu Ser Arg His Leu His Ala Leu Lys Arg 35
40 45 Val Arg Gly Ser Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60
9260PRTOryza sativaG3924 conserved domain 92Glu Pro Val Tyr Val Asn
Ala Lys Gln Tyr His Gly Ile Leu Arg Arg 1 5 10 15 Arg Gln Ser Arg
Ala Lys Ala Glu Leu Glu Lys Lys Val Val Lys Ser 20 25 30 Arg Lys
Pro Tyr Leu His Glu Ser Arg His Gln His Ala Met Arg Arg 35 40 45
Ala Arg Gly Thr Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60
9359PRTOryza sativaG3926 conserved domain 93Glu Pro Ile Phe Val Asn
Ala Lys Gln Tyr Asn Ala Ile Leu Arg Arg 1 5 10 15 Arg Gln Thr Arg
Ala Lys Leu Glu Ala Gln Asn Lys Ala Val Lys Gly 20 25 30 Arg Lys
Pro Tyr Leu His Glu Ser Arg His His His Ala Met Lys Arg 35 40 45
Ala Arg Gly Ser Gly Gly Arg Phe Leu Thr Lys 50 55 9460PRTOryza
sativaG3925 conserved domain 94Glu Pro Ile Tyr Val Asn Ala Lys Gln
Tyr His Ala Ile Leu Arg Arg 1 5 10 15 Arg Gln Leu Arg Ala Lys Leu
Glu Ala Glu Asn Lys Leu Val Lys Asn 20 25 30 Arg Lys Pro Tyr Leu
His Glu Ser Arg His Gln His Ala Met Lys Arg 35 40 45 Ala Arg Gly
Thr Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60 9560PRTZea maysG3921
conserved domain 95Glu Pro Ile Tyr Val Asn Ala Lys Gln Tyr His Ala
Ile Leu Arg Arg 1 5 10 15 Arg Gln Thr Arg Ala Lys Leu Glu Ala Gln
Asn Lys Met Val Lys Gly 20
25 30 Arg Lys Pro Tyr Leu His Glu Ser Arg His Arg His Ala Met Lys
Arg 35 40 45 Ala Arg Gly Ser Gly Gly Arg Phe Leu Asn Thr Lys 50 55
60 9660PRTZea maysG3922 conserved domain 96Glu Pro Ile Tyr Val Asn
Ala Lys Gln Tyr His Ala Ile Leu Arg Arg 1 5 10 15 Arg Gln Thr Arg
Ala Lys Leu Glu Ala Gln Asn Lys Met Val Lys Asn 20 25 30 Arg Lys
Pro Tyr Leu His Glu Ser Arg His Arg His Ala Met Lys Arg 35 40 45
Ala Arg Gly Ser Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60 9760PRTZea
maysG4264 conserved domain 97Glu Pro Ile Tyr Val Asn Ala Lys Gln
Tyr His Ala Ile Leu Arg Arg 1 5 10 15 Arg Gln Thr Arg Ala Lys Leu
Glu Ala Gln Asn Lys Met Val Lys Asn 20 25 30 Arg Lys Pro Tyr Leu
His Glu Ser Arg His Arg His Ala Met Lys Arg 35 40 45 Ala Arg Gly
Ser Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60 9858PRTArabidopsis
thalianaG2632 conserved domain 98Glu Pro Val Phe Val Asn Ala Lys
Gln Tyr Gln Ala Ile Leu Arg Arg 1 5 10 15 Arg Gln Ala Arg Ala Lys
Ala Glu Leu Glu Lys Lys Leu Ile Lys Ser 20 25 30 Arg Lys Pro Tyr
Leu His Glu Ser Arg His Gln His Ala Met Arg Arg 35 40 45 Pro Arg
Gly Thr Gly Gly Arg Phe Ala Lys 50 55 9958PRTArabidopsis
thalianaG1334 conserved domain 99Asp Gly Thr Ile Tyr Val Asn Ser
Lys Gln Tyr His Gly Ile Ile Arg 1 5 10 15 Arg Arg Gln Ser Arg Ala
Lys Ala Glu Lys Leu Ser Arg Cys Arg Lys 20 25 30 Pro Tyr Met His
His Ser Arg His Leu His Ala Met Arg Arg Pro Arg 35 40 45 Gly Ser
Gly Gly Arg Phe Leu Asn Thr Lys 50 55 10058PRTArabidopsis
thalianaG926 conserved domain 100Glu Pro Val Tyr Val Asn Ala Lys
Gln Tyr Glu Gly Ile Leu Arg Arg 1 5 10 15 Arg Lys Ala Arg Ala Lys
Ala Glu Leu Glu Arg Lys Val Ile Arg Asp 20 25 30 Arg Lys Pro Tyr
Leu His Glu Ser Arg His Lys His Ala Met Arg Arg 35 40 45 Ala Arg
Ala Ser Gly Gly Arg Phe Ala Lys 50 55 10164PRTArabidopsis
thalianaG927 conserved domain 101Ser Thr Ile Tyr Val Asn Ser Lys
Gln Tyr His Gly Ile Ile Arg Arg 1 5 10 15 Arg Gln Ser Arg Ala Lys
Ala Ala Ala Val Leu Asp Gln Lys Lys Leu 20 25 30 Ser Ser Arg Cys
Arg Lys Pro Tyr Met His His Ser Arg His Leu His 35 40 45 Ala Leu
Arg Arg Pro Arg Gly Ser Gly Gly Arg Phe Leu Asn Thr Lys 50 55 60
10266PRTZea maysG3911 conserved domain 102Leu Pro Leu Ala Arg Ile
Lys Lys Ile Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg Met Ile Ala
Ala Glu Ala Pro Val Val Phe Ala Arg Ala Cys Glu 20 25 30 Met Phe
Ile Leu Glu Leu Thr His Arg Gly Trp Ala His Ala Glu Glu 35 40 45
Asn Lys Arg Arg Thr Leu Gln Lys Ser Asp Ile Ala Ala Ala Ile Ala 50
55 60 Arg Thr 65 10366PRTOryza sativaG3546 conserved domain 103Leu
Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu Asp Val 1 5 10
15 Arg Met Ile Ala Ala Glu Ala Pro Val Val Phe Ala Arg Ala Cys Glu
20 25 30 Met Phe Ile Leu Glu Leu Thr His Arg Gly Trp Ala His Ala
Glu Glu 35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys Ser Asp Ile Ala
Ala Ala Ile Ala 50 55 60 Arg Thr 65 10466PRTZea maysG3909 conserved
domain 104Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu
Asp Val 1 5 10 15 Arg Met Ile Ala Ala Glu Ala Pro Val Val Phe Ser
Arg Ala Cys Glu 20 25 30 Met Phe Ile Leu Glu Leu Thr His Arg Gly
Trp Ala His Ala Glu Glu 35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys
Ser Asp Ile Ala Ala Ala Val Ala 50 55 60 Arg Thr 65 10566PRTZea
maysG3552 conserved domain 105Leu Pro Leu Ala Arg Ile Lys Lys Ile
Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala
Pro Val Val Phe Ala Lys Ala Cys Glu 20 25 30 Ile Phe Ile Leu Glu
Leu Thr Leu Arg Ser Trp Met His Thr Glu Glu 35 40 45 Asn Lys Arg
Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala Ile Thr 50 55 60 Arg
Thr 65 10666PRTArabidopsis thalianaG483 conserved domain 106Leu Pro
Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu Asp Val 1 5 10 15
Arg Met Ile Ser Ala Glu Ala Pro Val Ile Phe Ala Lys Ala Cys Glu 20
25 30 Met Phe Ile Leu Glu Leu Thr Leu Arg Ala Trp Ile His Thr Glu
Glu 35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala
Ala Ile Ser 50 55 60 Arg Thr 65 10766PRTGlycine maxG3547 conserved
domain 107Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu
Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala Pro Val Ile Phe Ala
Arg Ala Cys Glu 20 25 30 Met Phe Ile Leu Glu Leu Thr Leu Arg Ser
Trp Asn His Thr Glu Glu 35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys
Asn Asp Ile Ala Ala Ala Ile Thr 50 55 60 Arg Thr 65
10866PRTArabidopsis thalianaG714 conserved domain 108Leu Pro Leu
Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg
Met Ile Ser Ala Glu Ala Pro Val Val Phe Ala Arg Ala Cys Glu 20 25
30 Met Phe Ile Leu Glu Leu Thr Leu Arg Ser Trp Asn His Thr Glu Glu
35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala
Val Thr 50 55 60 Arg Thr 65 10966PRTOryza sativaG3542 conserved
domain 109Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu
Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala Pro Val Val Phe Ala
Lys Ala Cys Glu 20 25 30 Val Phe Ile Leu Glu Leu Thr Leu Arg Ser
Trp Met His Thr Glu Glu 35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys
Asn Asp Ile Ala Ala Ala Ile Thr 50 55 60 Arg Thr 65
11066PRTArabidopsis thalianaG489 conserved domain 110Leu Pro Leu
Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg
Met Ile Ser Ala Glu Ala Pro Val Val Phe Ala Arg Ala Cys Glu 20 25
30 Met Phe Ile Leu Glu Leu Thr Leu Arg Ser Trp Asn His Thr Glu Glu
35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala
Val Thr 50 55 60 Arg Thr 65 11166PRTOryza sativaG3544 conserved
domain 111Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu
Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala Pro Val Ile Phe Ala
Lys Ala Cys Glu 20 25 30 Ile Phe Ile Leu Glu Leu Thr Leu Arg Ser
Trp Met His Thr Glu Glu 35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys
Asn Asp Ile Ala Ala Ala Ile Thr 50 55 60 Arg Thr 65 11266PRTGlycine
maxG3550 conserved domain 112Leu Pro Leu Ala Arg Ile Lys Lys Ile
Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala
Pro Val Ile Phe Ala Lys Ala Cys Glu 20 25 30 Met Phe Ile Leu Glu
Leu Thr Leu Arg Ser Trp Ile His Thr Glu Glu 35 40 45 Asn Lys Arg
Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala Ile Ser 50 55 60 Arg
Asn 65 11366PRTGlycine maxG3548 conserved domain 113Leu Pro Leu Ala
Arg Ile Lys Lys Ile Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg Met
Ile Ser Ala Glu Ala Pro Val Ile Phe Ala Lys Ala Cys Glu 20 25 30
Met Phe Ile Leu Glu Leu Thr Leu Arg Ser Trp Ile His Thr Glu Glu 35
40 45 Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala Ile
Ser 50 55 60 Arg Asn 65 11466PRTArabidopsis thalianaG715 conserved
domain 114Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu
Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala Pro Ile Leu Phe Ala
Lys Ala Cys Glu 20 25 30 Leu Phe Ile Leu Glu Leu Thr Ile Arg Ser
Trp Leu His Ala Glu Glu 35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys
Asn Asp Ile Ala Ala Ala Ile Thr 50 55 60 Arg Thr 65 11566PRTGlycine
maxG3886 conserved domain 115Leu Pro Leu Ala Arg Ile Lys Lys Ile
Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala
Pro Ile Leu Phe Ala Lys Ala Cys Glu 20 25 30 Leu Phe Ile Leu Glu
Leu Thr Ile Arg Ser Trp Leu His Ala Glu Glu 35 40 45 Asn Lys Arg
Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala Ile Thr 50 55 60 Arg
Thr 65 11666PRTZea maysG3889 conserved domain 116Leu Pro Leu Ala
Arg Ile Lys Lys Ile Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg Met
Ile Ser Ala Glu Ala Pro Val Leu Phe Ala Lys Ala Cys Glu 20 25 30
Leu Phe Ile Leu Glu Leu Thr Ile Arg Ser Trp Leu His Ala Glu Glu 35
40 45 Asn Lys Arg Arg Thr Leu Gln Arg Asn Asp Val Ala Ala Ala Ile
Ala 50 55 60 Arg Thr 65 11766PRTArabidopsis thalianaG1646 conserved
domain 117Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu
Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala Pro Ile Leu Phe Ala
Lys Ala Cys Glu 20 25 30 Leu Phe Ile Leu Glu Leu Thr Ile Arg Ser
Trp Leu His Ala Glu Glu 35 40 45 Asn Lys Arg Arg Thr Leu Gln Lys
Asn Asp Ile Ala Ala Ala Ile Thr 50 55 60 Arg Thr 65 11866PRTOryza
sativaG3543 conserved domain 118Leu Pro Leu Ala Gly Ile Lys Lys Ile
Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala
Pro Val Leu Phe Ala Lys Ala Cys Glu 20 25 30 Leu Phe Ile Leu Glu
Leu Thr Ile Arg Ser Trp Leu His Ala Glu Glu 35 40 45 Asn Lys Arg
Arg Thr Leu Gln Arg Lys Asp Val Ala Ala Ala Ile Ala 50 55 60 Arg
Thr 65 11966PRTArabidopsis thalianaG1820 conserved domain 119Leu
Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Pro Asp Val 1 5 10
15 His Met Val Ser Ala Glu Ala Pro Ile Ile Phe Ala Lys Ala Cys Glu
20 25 30 Met Phe Ile Val Asp Leu Thr Met Arg Ser Trp Leu Lys Ala
Glu Glu 35 40 45 Asn Lys Arg His Thr Leu Gln Lys Ser Asp Ile Ser
Asn Ala Val Ala 50 55 60 Ser Ser 65 12066PRTArabidopsis
thalianaG1836 conserved domain 120Leu Pro Ile Thr Arg Ile Lys Lys
Ile Met Lys Tyr Asp Pro Asp Val 1 5 10 15 Thr Met Ile Ala Ser Glu
Ala Pro Ile Leu Leu Ser Lys Ala Cys Glu 20 25 30 Met Phe Ile Met
Asp Leu Thr Met Arg Ser Trp Leu His Ala Gln Glu 35 40 45 Ser Lys
Arg Val Thr Leu Gln Lys Ser Asn Val Asp Ala Ala Val Ala 50 55 60
Gln Thr 65 12165PRTArabidopsis thalianaG1819 conserved domain
121Phe Pro Leu Thr Arg Ile Lys Lys Ile Met Lys Ser Asn Pro Glu Val
1 5 10 15 Asn Met Val Thr Ala Glu Ala Pro Val Leu Ile Ser Lys Ala
Cys Glu 20 25 30 Met Leu Ile Leu Asp Leu Thr Met Arg Ser Trp Leu
His Thr Val Glu 35 40 45 Gly Gly Arg Gln Thr Leu Lys Arg Ser Asp
Thr Leu Thr Arg Ser Asp 50 55 60 Ile 65 12265PRTArabidopsis
thalianaG1818 conserved domain 122Pro Ile Ser Arg Ile Lys Arg Ile
Met Lys Phe Asp Pro Asp Val Ser 1 5 10 15 Met Ile Ala Ala Glu Ala
Pro Asn Leu Leu Ser Lys Ala Cys Glu Met 20 25 30 Phe Val Met Asp
Leu Thr Met Arg Ser Trp Leu His Ala Gln Glu Ser 35 40 45 Asn Arg
Leu Thr Ile Arg Lys Ser Asp Val Asp Ala Val Val Ser Gln 50 55 60
Thr 65 12366PRTArabidopsis thalianaG490 conserved domain 123Leu Pro
Leu Ser Arg Val Arg Lys Ile Leu Lys Ser Asp Pro Glu Val 1 5 10 15
Lys Lys Ile Ser Cys Asp Val Pro Ala Leu Phe Ser Lys Ala Cys Glu 20
25 30 Tyr Phe Ile Leu Glu Val Thr Leu Arg Ala Trp Met His Thr Gln
Ser 35 40 45 Cys Thr Arg Glu Thr Ile Arg Arg Cys Asp Ile Phe Gln
Ala Val Lys 50 55 60 Asn Ser 65 12464PRTArabidopsis thalianaG3074
conserved domain 124Pro Ala Ala Arg Ile Lys Lys Ile Met Gln Ala Asp
Glu Asp Val Gly 1 5 10 15 Lys Ile Ala Leu Ala Val Pro Val Leu Val
Ser Lys Ser Leu Glu Leu 20 25 30 Phe Leu Gln Asp Leu Cys Asp Arg
Thr Tyr Glu Ile Thr Leu Glu Arg 35 40 45 Gly Ala Lys Thr Val Ser
Ser Leu His Leu Lys His Cys Val Glu Arg 50 55 60
12564PRTArabidopsis thalianaG1249 conserved domain 125Pro Ile Gly
Arg Val Lys Lys Ile Met Lys Leu Asp Lys Asp Ile Asn 1 5 10 15 Lys
Ile Asn Ser Glu Ala Leu His Val Ile Thr Tyr Ser Thr Glu Leu 20 25
30 Phe Leu His Phe Leu Ala Glu Lys Ser Ala Val Val Thr Ala Glu Lys
35 40 45 Lys Arg Lys Thr Val Asn Leu Asp His Leu Arg Ile Ala Val
Lys Arg 50 55 60 12663PRTArabidopsis thalianaG3075 conserved domain
126Pro Met Asn Arg Ile Arg Arg Ile Met Arg Ser Asp Asn Ser Ala Pro
1 5 10 15 Gln Ile Met Gln Asp Ala Val Phe Leu Val Asn Lys Ala Thr
Glu Met 20 25 30 Phe Ile Glu Arg Phe Ser Glu Glu Ala Tyr Asp Ser
Ser Val Lys Asp 35 40 45 Lys Lys Lys Phe Ile His Tyr Lys His Leu
Ser Ser Val Val Ser 50 55 60 127953DNAArtificial sequencesynthetic
sequence 127ggagagacct ttaacaattt tctgagggta agatccagag attgattgaa
tcagcttact 60attttatata attcagtttg ttgttcctca gacttgtaac taggacagtc
ttctcatgaa 120tcatgacttc ttcagtacat gagctctctg ataacaatga
aagtcatgcg aagaaagaac 180gtccagattc ccaaacccga ccacaggttc
cttcaggacg aagttcggaa tctattgata 240caaactctgt ctactcagag
cccatggcac atggattata cccgtatcca gatccttact 300acagaagcgt
ctttgcacag
caagcgtatc ttccacatcc ctatcctggg gtccaattgc 360agttaatggg
aatgcagcag ccaggagttc cattgcaatg tgatgcagtc gaggaacctg
420tttttgttaa cgcaaagcaa taccatggta tactcaggcg caggcaatcc
cgggcaaaac 480ttgaggcacg aaatagagcc atcaaagcaa aaaagccata
catgcatgaa tctcggcatt 540tacatgcgat aagacggcca agaggatgtg
gtggccggtt tctcaatgcc aagaaggaaa 600atggagacca caaggaggag
gaggaggcaa cctctgatga gaacacttca gaagcaagtt 660ccagcctcag
gtccgagaaa ttagctatgg ctacttctgg tcctaatggt agatcttgag
720gaaggtttct gcacaaccac aagtttagtt tctattttgg gtggatgttc
tcagggcatc 780atcgtcttta gtgtttttgg atacgctgtg tacaggttat
ttgctagggt aaactttgtt 840ttagcgatta gaaataaaac taagcaaaga
aatgaaaagt gtgattggaa gtattgttgt 900accaaattga tattctttgc
caatgaactc atgttttgga aagtaaaaaa aaa 953128812DNAArtificial
sequencesynthetic sequence 128ttattctaag tagcttgact tgtttagttt
aaatatgagg ttaatgattt tgtggggatt 60tgatagttct ggttcttgag tttatttaaa
ataggtttac caggatcatg tactgactct 120gttctttgga acttttcaga
attctgcttc ggacattaag ctcatgagtc atgacttctt 180caatccatga
gctttctgat aacattggaa gtcatgagaa gcaagaacag agagattctc
240atttccaacc accaatccct tctgcaagaa attatgaatc aattgttaca
agtttagtct 300actcagaccc ggggactaca aattccatgg cacctggaca
atatccatat ccagatcctt 360actacagaag catatttgca ccgcctccac
aaccgtatac cggggtacat ctacagttga 420tgggagtgca gcaacaaggc
gttcctttac catctgatgc agtcgaggaa cctgtttttg 480ttaacgcaaa
gcaataccac ggtatactaa ggcgcagaca atcaagagca agacttgagt
540ctcagaataa agtcatcaag tcacgtaagc cgtatttgca tgaatctcgg
catttgcatg 600cgataagacg accaagagga tgtggcgggc ggtttctaaa
tgccaagaag gaggatgagc 660atcacgaaga cagtagtcat gaagaaaaat
ccaaccttag cgctggtaaa tccgccatgg 720ctgcttctag tggtacatct
tgagaaggtc ctacaagtag ctttgttgta ttttggctct 780gtttggtctc
agatcatcta tgtcttttag tg 8121291138DNAArtificial sequencesynthetic
sequence 129tgacagacac atgtatcatc aatcttctct gttgaagcag agagagagag
agctaattgt 60tgcctctgag tcacatggat aagaaagttt catttactag ctctgtggca
cattcaactc 120caccatacct tagtacttcc atctcatggg gacttccaac
caaatccaat ggtgtgactg 180aatcactgag tttgaaggtg gtagatgcaa
gaccagaacg tcttataaac acaaagaata 240tcagtttcca ggaccaggat
tcatcttcaa ctctgtcctc tgctcaatct tctaacgatg 300ttacaagtag
tggagatgat aacccctcaa gacaaatctc atttttagca cattcagatg
360tttgtaaagg atttgaagaa actcaaagga agcgatttgc aattaaatca
ggctcctcca 420cggcaggaat cgctgatatt cactcttctc cttccaaggc
taacttctca tttcactatg 480ccgatccaca ttttggtggt ttaatgcctg
cggcttacct accacaggca acaatatgga 540atccccaaat gactcgagtt
ccgctaccat tcgatctcat agagaatgag cctgtctttg 600tcaatgcaaa
gcaattccat gcaattatga ggaggaggca acagcgtgct aagctagagg
660cgcaaaacaa actaatcaaa gcccgtaagc cgtatcttca tgaatctcga
catgttcacg 720ctcttaaacg acctagagga tctggtggaa gattcctaaa
caccaaaaag cttcaagaat 780ctacagatcc aaaacaagac atgccaatcc
aacagcaaca cgcaacggga aacatgtcaa 840gatttgtgct ttatcagttg
cagaacagca atgactgtga ttgttcaacc acttctcgct 900ctgacatcac
atctgcttct gacagcgtta atctctttgg acactctgaa tttctgatat
960cagattgccc atctcagaca aacccaacaa tgtatgttca tggtcaatca
aatgacatgc 1020atggaggtag gaacacacac catttctctg tccatatctg
agccggtgga atctggtaat 1080gtgtacgttc ctacaaaaaa agggaagtca
tccttggctg ctacttcgct tattagct 1138130956DNAArtificial
sequencesynthetic sequence 130agttggtgct aagatgccag ggaaacctga
cactgatgat tggcgtgtag agcgtgggga 60gcagattcag tttcagtctt ccatttactc
tcatcatcag ccttggtggc gcggagtggg 120ggaaaatgcc tccaaatcat
cttcagatga tcagttaaat ggttcaatcg tgaatggtat 180cacgcggtct
gagaccaatg ataagtcagg cggaggtgtt gccaaagaat accaaaacat
240caaacatgcc atgttgtcaa ccccatttac catggagaaa catcttgctc
caaatcccca 300gatggaactt gttggtcatt cagttgtttt aacatctcct
tattcagatg cacagtatgg 360tcaaatcttg actacttacg ggcaacaagt
tatgataaat cctcagttgt atggaatgca 420tcatgctaga atgcctttgc
cacttgaaat ggaagaggag cctgtttatg tcaatgcgaa 480gcagtatcat
ggtattttga ggcgaagaca gtcacgtgct aaggctgaga ttgaaaagaa
540agtaatcaaa aacaggaagc catacctcca tgaatcccgt caccttcatg
caatgagaag 600ggcaagaggc aacggtggtc gctttctcaa cacaaagaag
cttgaaaata acaattctaa 660ttccacttca gacaaaggca acaatactcg
tgcaaacgcc tcaacaaact cgcctaacac 720tcaacttttg ttcaccaaca
atttgaatct aggctcatca aatgtttcac aagccacagt 780tcagcacatg
cacacagagc agagtttcac tataggttac cataatggaa atggtcttac
840agcactatac cgttcacaag caaatgggaa aaaggaggga aactgctttg
gtaaagagag 900ggaccctaat ggggatttca aataacactt ccctcagcca
tacagcaaga gttagg 9561311486DNAArtificial sequencesynthetic
sequence 131cccaggaaag gtaaaagaga cggagacgaa ccaaaacaag gaagaaagaa
gaagatctta 60catacgaaga tcactctctg attcactctg agagacaaac tggtttactt
tggttctgtt 120tgacaaaagg agacatgcaa aaataaatct ctatcccttg
tttttcttct tcgcttcatc 180gattactcaa agaggttgtt ggttgtgaga
ataattagct tgttaaggaa gacgttatga 240tgcatcagat gttgaataag
aaagattcag ctactcattc cactttgcca taccttaata 300ctagcatctc
ttggggagtg gttccaactg attccgttgc taatcgtcgc ggtcctgctg
360aatcactaag cttgaaggtt gattcaagac ctgggcatat acaaactaca
aagcaaatca 420gttttcagga ccaagattca tcttcaacac agtccactgg
tcaatcttat actgaagttg 480ctagtagtgg tgatgataat ccttccagac
aaatctcctt ttcggctaaa tcaggatctg 540aaataactca acggaagggg
tttgcaagta atcctaaaca aggctcgatg actggatttc 600cgaatattca
ctttgctcct gcacaggcta atttctcatt tcactatgct gatccacatt
660atggtggttt attagctgca acttacctac cacaggcacc aacatgcaat
cctcaaatgg 720tgagtatgat tcctggtcgt gttcctttac cagcagagct
cacagaaact gatccagtct 780ttgtcaatgc gaagcaatac cacgcaatta
tgaggaggag acagcaacgt gctaagcttg 840aggctcaaaa caaactaatc
agagcccgta agccctatct tcatgagtct cgacatgttc 900atgctcttaa
aaggccaaga ggatctggtg gaagattcct aaacaccaaa aaacttcttc
960aagaatccga acaggctgct gctagagaac aagaacagga caagttaggc
caacaggtaa 1020acagaaagac caacatgtct agattcgaag ctcatatgct
gcagaacaac aaagaccgca 1080gctcaaccac ttctggctca gacatcacct
ctgtttccga cggtgctgat atctttggac 1140acactgaatt ccagttttca
ggtttcccaa ctccgataaa ccgagccatg cttgttcatg 1200gtcagtctaa
tgacatgcat ggaggtggag acatgcacca tttctctgtc catatctgag
1260acagtggatc ttggtgctgt gttcatgttc ccaccaagaa ggggaagtca
tccttggcta 1320ctactagttc tttcgcttgt tgtaacttca gtgtttttat
ttcatattat gtctgtgtta 1380gacatcacaa gaacgaccaa gatcttcact
ttgaaacact ctattacctt ttcatcttct 1440gttaccatgg atctcttgtc
taaactagtg atatgattct tctgat 14861321007DNAArtificial
sequencesynthetic sequence 132gatttgtgac tggacttgtt ggtttggaca
tttagtttat tgaagtaaag atttgaagac 60aatgcaagtg tttcaaagga aagaagattc
atcttgggga aactcaatgc ctacaacaaa 120ttcaaatatt caaggatctg
aatctttcag cttgactaag gatatgataa tgtctacaac 180acaattaccc
gcgatgaaac attcgggttt gcagctgcaa aatcaagatt caacctcatc
240acaatctact gaagaagaat caggcggcgg tgaagttgca agctttggag
aatataagcg 300ttatggatgc agcattgtta ataacaatct ctcaggttac
atcgaaaact tgggaaagcc 360tattgaaaat tatactaagt caattactac
ctcgtcgatg gtgtctcaag actctgtgtt 420tcctgctcct acttctggtc
aaatatcttg gtctcttcaa tgtgctgaaa cgtcacattt 480caatggtttc
ttggctcctg aatatgcatc aacaccaacg gcgctgccac atttagagat
540gatgggtttg gtttcttcaa gagtgccatt gcctcatcac attcaagaga
atgaaccaat 600atttgtcaat gcgaaacagt atcatgcgat tctccgtcgc
aggaagcacc gtgctaaact 660cgaagctcag aacaaactca tcaaatgccg
taaaccgtac cttcatgagt ctcgccatct 720tcatgcttta aagagagcta
gaggctccgg tggacgtttc ctcaatacaa agaagcttca 780agaatcatca
aactcactgt gttcttctca aatggcaaat ggacaaaatt tctctatgag
840ccctcacggt ggtggaagcg gaatcgggtc tagttcgatc tcaccgagct
ccaattcaaa 900ctgtatcaac atgttccaaa acccgcagtt cagattctca
ggttatccgt caacacacca 960tgcctcagct ctcatgtcag ggacttgagg
cacatgagaa gaccttg 10071331671DNAArtificial sequencesynthetic
sequence 133atgcaagagt tccatagtag caaagattca ttgccttgtc ctgcaacttc
ttgggataac 60tctgtcttca ccaactcaaa tgtccaagga tcatcatcct tgaccgataa
caacacttta 120agcttgacaa tggagatgaa acaaactggt tttcaaatgc
agcactatga ttcctcctct 180actcaatcca ctggaggaga atcatatagt
gaagttgcta gcttaagtga acctactaat 240cgttatggcc acaacattgt
tgtcactcat ctctcaggtt acaaagaaaa cccggaaaat 300cctattggaa
gtcattcgat atcaaaggtg tctcaagatt cagtggttct tcctattgag
360gcggcttctt ggcctttaca cggcaatgta acgccacatt tcaatggttt
cttgtctttt 420ccttatgcat cacaacacac ggtgcagcat cctcaaatca
gagggttggt tccgtctaga 480atgcctttgc ctcacaacat tccagagaac
gaaccaattt tcgtcaatgc aaaacagtac 540caagccattc tccgccgcag
agagcgccgt gcaaagcttg aagctcagaa caagctcatc 600aaagtccgca
aaccatatct tcacgagtcg cggcacctcc atgcactaaa gagagttaga
660ggctctggtg gacgtttcct caacacaaag aagcatcaag aatcaaattc
ctcactatct 720cctccattct tgattccacc tcatgtcttc aagaactctc
caggaaagtt ccggcaaatg 780gacatttcaa ggggtggggt tgtgtctagt
gtctcgacaa catcttgctc ggacataacc 840gggaacaaca acgacatgtt
ccagcaaaac ccacaattca ggttctcagg ttatccatca 900aaccaccatg
tctcagtcct catggcggcc gctgccgctg cggcagcggc catggtgagc
960aagggcgagg agctgttcac cggggtggtg cccatcctgg tcgagctgga
cggcgacgta 1020aacggccaca agttcagcgt gtccggcgag ggcgagggcg
atgccaccta cggcaagctg 1080accctgaagt tcatctgcac caccggcaag
ctgcccgtgc cctggcccac cctcgtgacc 1140accttcggct acggcctgca
gtgcttcgcc cgctaccccg accacatgaa gcagcacgac 1200ttcttcaagt
ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac
1260gacggcaact acaagacccg cgccgaggtg aagttcgagg gcgacaccct
ggataaccga 1320atcgagctga agggcatcga cttcaaggag gacggcaaca
tcctggggca caagctggag 1380tacaactaca acagccacaa cgtctatatc
atggccgaca agcagaagaa cggcatcaag 1440gtgaacttca agatccgcca
caacatcgag gacggcagcg tgcagctcgc cgaccactac 1500cagcagaaca
cccccatcgg cgacggcccc gtggtgctgc ccgacaacca ctacctgagc
1560taccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt
cctgctggag 1620ttcgtgaccg ccgccgggat cactctcggc atggacgagc
tgtacaagta a 1671134824DNAArtificial sequencesynthetic sequence
134gcattccgag gtgagcgagc atggagtcga ggccgggggg aaccaacctc
gtggagccga 60gggggcaggg cgcgctgccg tccggcatac cgatccagca gccgtggtgg
acgacctccg 120ccggggtcgg ggcggtgtcg cccgccgtcg tggcgccggg
gagcggtgcg gggatcagcc 180tgtcgggcag ggatggcggc ggcgacgacg
cggcagagga gagcagcgat gactcacgaa 240gatcagggga gaccaaagat
ggaagcactg atcaagaaaa gcatcatgca acatcgcaga 300tgactgcttt
ggcatcagac tatttaacac cattttcaca gctggaacta aaccaaccaa
360ttgcttcggc agcataccag taccctgact cttactatat gggcatggtt
ggtccctatg 420gacctcaagc tatgtccgca cagactcatt tccagctacc
tggattaact cactctcgta 480tgccgttgcc tcttgaaata tctgaggagc
ctgtttatgt aaatgctaag caatatcatg 540gaattttaag acggaggcag
tcacgtgcga aggctgaact tgagaaaaaa gttgttaaat 600caagaaagcc
ctatcttcat gagtctcgtc atcaacatgc tatgcgaagg gcaagaggaa
660cgggtggacg cttcctgaac acaaagaaaa atgaagatgg tgctcccagt
gagaaagccg 720aaccaaacaa aggagagcag aactccgggt atcgccggat
ccctcctgac ttacagctcc 780tacagaagga aacatgaagt agcggctcga
aacctagaac agtg 8241351000DNAArtificial sequencesynthetic sequence
135gtggatcttg agtaatgcct tctaataatg ataatgctgt tgcaagaaat
ggagaatcat 60cctgtccaat gcatggccaa gaccaactat gattttcttg ccaggaataa
ctatccaatg 120aaacagttag ttcagaggaa ctctgatggt gactcgtcac
caacaaagtc tggggagtct 180caccaagaag catctgcagt aagtgacagc
agtctcaacg gacaacacac ctcaccacaa 240tcagtgtttg tcccctcaga
tattaacaac aatgatagtt gtggggagcg ggaccatggc 300actaagtcgg
tattgtcttt ggggaacaca gaagctgcct ttcctccttc aaagttcgat
360tacaaccagc cttttgcatg tgtttcttat ccatatggta ctgatccata
ttatggtgga 420gtattaacag gatacacttc acatgcattt gttcatcctc
aaattactgg tgctgcaaac 480tctaggatgc cattgcctgt tgatccttct
gtagaagagc ccatatttgt caatgcaaag 540caatacaatg cgatccttag
aagaaggcaa acgcgtgcaa aattggaggc ccaaaataag 600gcggtgaaag
gtcggaagcc ttacctccat gaatctcgac atcatcatgc tatgaagcga
660gcccgtggat caggtggtcg gttccttacc aaaaaggagc tgctggaaca
gcagcagcag 720cagcagcagc agaagccacc accggcatca gctcagtctc
caacaggtag agccagaacg 780agcggcggtg ccgttgtcct tggcaagaac
ctgtgcccag agaacagcac atcctgctcg 840ccatcgacac cgacaggctc
cgagatctcc agcatctcat ttgggggcgg catgctggct 900caccaagagc
acatcagctt cgcatccgct gatcgccacc ccacaatgaa ccagaaccac
960cgtgtccccg tcatgaggtg aaaacctcgg gatcgcggga
1000136756DNAArtificial sequencesynthetic sequence 136ggaggaggtt
tgccggagag gggacatgct ccctcctcat ctcacagaaa atggcacagt 60aatgattcag
tttggtcata aaatgcctga ctacgagtca tcagctaccc aatcaactag
120tggatctcct cgtgaagtgt ctggaatgag cgaaggaagc ctcaatgagc
agaatgatca 180atctggtaat cttgatggtt acacgaagag tgatgaaggt
aagatgatgt cagctttatc 240tctgggcaaa tcagaaactg tgtatgcaca
ttcggaacct gaccgtagcc aaccctttgg 300catatcatat ccatatgctg
attcgttcta tggtggtgct gtagcgactt atggcacaca 360tgctattatg
catccccaga ttgtgggcgt gatgtcatcc tcccgagtcc cgctaccaat
420agaaccagcc accgaagagc ctatttatgt aaatgcaaag caataccatg
cgattctccg 480aaggagacag ctccgtgcaa agttagaggc tgaaaacaag
ctggtgaaaa accgcaagcc 540gtacctccat gaatcccggc atcaacacgc
gatgaagaga gctcggggaa caggggggag 600attcctcaac acaaagcagc
agcctgaagc ttcagatggt ggcaccccaa ggctcgtctc 660tgcaaacggc
gttgtgttct caaagcacga gcacagcttg tcgtccagtg atctccatca
720tcgtcgtgcg aaagagggcg cttgagatcc tcgccg 7561371091DNAArtificial
sequencesynthetic sequence 137agatcatctg atttctcaga agcaaaatgt
tgtttggagc tcagtgacac catcttgtaa 60tgcctgtgat tttacgggaa atggaggatc
attctgtcca tcccatgtct aagtctaacc 120atggctcctt gtcaggaaat
ggttatgaga tgaaacattc aggccataaa gtttgcgata 180gggattcatc
atcggagtct gatcggtctc accaagaagc atcagcagca agtgaaagca
240gtccaaatga acacacatca actcaatcag acaatgatga agatcatggg
aaagataatc 300aggacacaat gaagccagta ttgtccttgg ggaaggaagg
ctctgccttt ttggccccaa 360aattacatta cagcccatct tttgcttgta
ttccttatac ttctgatgct tattatagtg 420cggttggggt cttgacagga
tatcctccac atgccattgt ccatccccag caaaatgata 480caacgaacac
tccgggtatg ttacctgtgg aacctgcaga agaaccaata tatgttaatg
540caaaacaata ccatgcaatc cttaggagga ggcaaacacg tgctaaattg
gaggcccaga 600acaagatggt gaaaaatcgg aagccatatc ttcatgagtc
ccgacatcgt catgccatga 660aacgggctcg tggatcagga ggacggttcc
tcaacacaaa gcagctccag gagcagaacc 720agcagtatca ggcatcgagt
ggttcattgt gctcaaagat cattgccaac agcataatct 780cccaaagtgg
ccccacctgc acgccctctt ctggcactgc aggtgcttca acagccggcc
840aggaccgcag ctgcttgccc tcagttggct tccgccccac gacaaacttc
agtgaccaag 900gtcgaggagg cttgaagctg gccgtgatcg gcatgcagca
gcgtgtttcc accataaggt 960gaagagaagt gggcacaaca ccattcccag
gcacactgcc tgtggcaact catccttggc 1020tcttggaact ttgaatatgc
aatcgacatg tagcttgagt tcctcagaat aaccaaaccg 1080tgaagaatat g
10911381149DNAArtificial sequencesynthetic sequence 138agatcgtttc
atcgtccaag cggaagaagc gtctccttca atcaccgtgg acacctccga 60gactgtctcc
gattgaatgg gaattgaaga catgcattca aaatctgaca gtggtgggaa
120caaggttgat tcagaggttc atggtacagt atcgtcgtcg ataaatagtt
taaacccttg 180gcatcgtgct gctgctgctt gcaatgcaaa ttctagtgtg
gaagctggag ataaatcttc 240taagtcaata gcattagcat tggaatcaaa
cggttccaaa tcaccatcca atagagataa 300tactgttaac aaggaatcac
aagtcacaac gtctccacaa tcagctggag attatagtga 360taaaaaccaa
gaatctctgc atcatggcat cacacaacct cctcctcacc ctcaacttgt
420tggccacaca gttgtaactt cccaatatag atatattgtt cttatcattt
cctttgggaa 480aattttagct atggttgctt acaatttagt tttttcccgg
ttatgaatca tgcagggatg 540ggcatcctca aatccatacc aggatccata
ttatgcagga gtgatgggag cctatggaca 600tcatcccctg gggtttgttc
catatggtgg gatgcctcat tcaagaatgc cactgccgcc 660tgagatggca
caagaaccag ttttcgtgaa tgctaaacag taccaggcga ttctgaggcg
720aaggcaggca cgcgccaagg cagagctaga gaagaagcta ataaaatcca
gaaagcctta 780tctacatgaa tctcggcatc aacatgctat gaggaggcca
aggggtactg gaggacggtt 840tgcaaagaaa accaacaccg aagcttcaaa
gcgtaaagct gaagaaaaga gcaatggtca 900tgttactcag tccccgtcat
catctaattc tgatcaaggt gaagcttgga atggtgacta 960tagaacacct
cagggagatg agatgcagag ctcagcttat aagagaaggg aagaaggaga
1020gtgttcaggg cagcaatgga acagcctttc ctcaaaccat ccttctcaag
ctcgtctagc 1080cattaaatga cctcacaagg cggcaattca ttcttggctt
tctctttgtt ggcttattcg 1140gtagcagcc 1149139873DNAArtificial
sequencesynthetic sequence 139ccattggact tttggaacat aagctatgca
aactgaggag cttttgtcgc caccacagac 60tccttggtgg aatgcttttg gatctcagcc
gttgactaca gagagccttt ccggcgaagc 120ttctgattca ttcaccggag
ttaaggcagt tactacggag gcagaacaag gtgtggtgga 180taaacaaact
tctacaactc tcttcacttt ctcacctggt ggtgaaaaga gttcaagaga
240tgtgccaaag cctcatgttg ctttcgcgat gcaatcagct tgcttcgagt
ttggatttgc 300tcagccaatg atgtacacaa agcatcctca tgttgaacaa
tactatggag ttgtttcagc 360atacggatct cagaggtctt cgggccgagt
aatgattcca ctgaagatgg agacagaaga 420agatggtacc atctatgtga
actcaaagca gtaccatgga attatcaggc gacgccagtc 480ccgagcaaag
gctgaaaaac tgagtagatg ccgtaagcca tatatgcatc actcacgcca
540tctccatgct atgcgccgtc ctagaggatc tggcgggcgt ttcttgaaca
ccaagacagc 600tgatgcggct aagcagtcta agccgagtaa ttctcagagt
tctgaagtct ttcatccgga 660aaatgagacc ataaactcat cgagggaagc
aaatgagtca aatctctcgg attctgcagt 720tacaagtatg gattactttc
taagttcgtc ggcttattct cctggtggca tggtcatgcc 780tatcaagtgg
aatgcagcag caatggatat tggctgctgc aaacttaata tatgatcagc
840agatagggga caagacatga ttggtcacca gtc 8731401394DNAArtificial
sequencesynthetic sequence 140tgtttgatct ttgtccagag aactccaaga
tagaccaaaa aggttcaacc tcaaaacaaa 60caacacaaaa acagccaaat agctagagac
caagatgaga tcaagcagcc aaaattctga 120aaactccaag acttgtctat
ctaacaacat caaagcaacc accaagaatg aagaagataa 180agatgaagag
gatgatgaag aaggcgaaga ggatgaagaa gagagatctg gagatcagag
240cccatctagc aatagctatg aggaagagag tgggagtcac caccatgatc
agaacaagaa 300gaatggagga tccgtgaggc cgtacaaccg ctcaaagact
ccgaggctgc gatggacgcc 360ggagctccat atttgctttc ttcaagctgt
ggagagattg ggtggcccag atagagcaac 420accgaagctt gttctccaat
tgatgaacgt caaggggcta agtattgccc atgttaagag 480tcatcttcag
atgtacagaa gcaagaagac cgatgagcct aatgaaggag atcaaggatt
540ttcgtttgaa cacggagctg gttacactta caaccttagc caacttccaa
tgctacaaag 600ttttgatcaa aggccttctt ctagtttagg atatggtggt
ggttcgtgga ctgaccacag 660acgacagatc
taccgtagcc cttggagagg attaacgaca cgagaaaata caagaacaag
720acaaacaatg tttagctcac agcctggtga gagatatcac ggagttagca
atagtattct 780taacgataag aacaaaacta tttcatttcg aatcaattct
catgaagggg ttcatgataa 840caatggagta gctggagctg ttccaagaat
tcatagaagt tttcttgaag gtatgaaaac 900gtttaacaaa tcatggggac
agagcctctc ttccaatctt aagtcctcca ccgcaacaat 960accacaagat
catattgcta caacgctaaa ttcttatcaa tgggagaatg ctggagtggc
1020agaaggatca gagaatgttt tgaagaggaa gaggttatta ttttctgatg
actgcaataa 1080gtcagaccaa gatttggatc taagcttgtc ccttaaggta
cctcggacac acgacaatct 1140tggagaatgc ttgttagaag atgaagtaaa
agaacatgat gatcatcaag atatcaagag 1200tttgtctctt tcgttatcat
cttcaggttc atcaaaactc gaccgaacca ttaggaaaga 1260agatcaaact
gatcacaaaa agagaaagat ttcggtcttg gcaagtcccc ttgatctcac
1320tctgtgaata tgtataacaa cttatatacg tatattctaa gtgagatctt
gtggtacttg 1380ttgatggaag acgg 13941411560DNAArtificial
sequencesynthetic sequence 141atgcaatcaa aaccgggaag agaaaacgaa
gaggaagtca ataatcacca tgctgttcag 60cagccgatga tgtatgcaga gccctggtgg
aaaaacaact cctttggtgt tgtacctcaa 120gcgagacctt ctggaattcc
atcaaattcc tcttctttgg attgccccaa tggttccgag 180tcaaacgatg
ttcattcagc atctgaagac ggtgcgttga atggtgaaaa cgatggcact
240tggaaggatt cacaagctgc aacttcctct cgttcagata atcacggaat
ggaaggaaat 300gacccagcgc tctctatccg taacatgcat gatcagccac
ttgtacaacc accagagctt 360gttggacact atatcgcttg tgtcccaaac
ccatatcagg atccatatta tgggggattg 420atgggagcat atggtcatca
gcaattgggt tttcgtccat atcttggaat gcctcgtgaa 480agaacagctc
tgccacttga catggcacaa gagcccgttt atgtgaatgc aaagcagtac
540gagggaattc taaggcgaag aaaagcacgt gccaaggcag agctagagag
gaaagtcatc 600cgggacagaa agccatatct tcacgagtca agacacaagc
atgcaatgag aagggcacga 660gcgagtggag gccggtttgc gaagaaaagt
gaggtagaag cgggagagga tgcaggaggg 720agagacagag aaaggggttc
agcaaccaac tcatcaggct ctgaacaagt tgagacagac 780tctaatgaga
ccctgaattc ttctggtgca ccagcggccg ctgccgctgc ggcagcggcc
840atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt
cgagctggac 900ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg
gcgagggcga tgccacctac 960ggcaagctga ccctgaagtt catctgcacc
accggcaagc tgcccgtgcc ctggcccacc 1020ctcgtgacca ccttcggcta
cggcctgcag tgcttcgccc gctaccccga ccacatgaag 1080cagcacgact
tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc
1140ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg
cgacaccctg 1200gataaccgaa tcgagctgaa gggcatcgac ttcaaggagg
acggcaacat cctggggcac 1260aagctggagt acaactacaa cagccacaac
gtctatatca tggccgacaa gcagaagaac 1320ggcatcaagg tgaacttcaa
gatccgccac aacatcgagg acggcagcgt gcagctcgcc 1380gaccactacc
agcagaacac ccccatcggc gacggccccg tggtgctgcc cgacaaccac
1440tacctgagct accagtccgc cctgagcaaa gaccccaacg agaagcgcga
tcacatggtc 1500ctgctggagt tcgtgaccgc cgccgggatc actctcggca
tggacgagct gtacaagtaa 15601421304DNAArtificial sequencesynthetic
sequence 142ggaatctgaa gctcttctct actctctact ctatcactcc atctgtgaac
atatctttct 60tattcttcta ggcactatct atttttcact ttttgtaatt ggaatttgga
gatggctatg 120caaactgtga gagaaggtct cttctctgct ccacagactt
cttggtggac tgcttttgga 180tctcagccgt tggctccgga gagtctcgcc
ggcgattctg actcattcgc cggagttaag 240gtcggatctg tcggagagac
aagacaacgt gtggataaac agagcaactc tgcaacgcac 300ttagctttct
cacttggtga tgtaaagagt ccaagacttg tgccaaagcc tcatggagct
360actttctcaa tgcaatcacc ttgcttggaa cttggatttt ctcagccacc
gatctataca 420aagtatccct atggagaaca acaatactat ggagttgttt
cagcctatgg atctcagagc 480agggtaatgc ttcctctaaa catggaaacg
gaagatagta ccatctatgt gaactcaaag 540caataccatg gaatcataag
gagacgccaa tcccgcgcaa aggctgctgc tgttcttgat 600cagaagaaat
tgagtagtag atgccgcaag ccatatatgc atcattcgcg ccatctccat
660gcattgcggc gtcctagagg atccggtggg agattcttga acactaaaag
tcagaacttg 720gaaaatagcg gaaccaatgc aaagaaaggt gatggaagta
tgcagattca gtctcagcct 780aagcctcagc aaagtaactc tcagaattct
gaagttgttc atccggaaaa cgggaccatg 840aacttatcga acggattaaa
tgtgtcggga tcagaagtta ctagcatgaa ctacttccta 900agttctcccg
ttcattctct tggtggcatg gtaatgccta gcaagtggat agcagcagca
960gcagcaatgg ataatggctg ctgcaatttc aaaacctgat cctttaccgt
ttcacagtca 1020aacggagaga gataaagaac tcttgccttg gtataaagga
ttttcctttt tgccaatccg 1080ctttggctgt gaacaggcaa atcatctttg
gctcattctc tattaaggta acttcgccgt 1140gaggtgaaaa aagctttgat
atatttatct tcagtgtaaa agtagttaaa actggtgaag 1200aacaatgatg
tgtttggtca ctaaacccac ttgttccaac tagtagtgtg tgttttaaga
1260aaactctgtt atctgatttt gtagctctct ctggctttgt gtgt
1304143627DNAArtificial sequencesynthetic sequence 143acaaccctag
ctgcccccga atccatggat cctaacaaat ccagcacccc gccgccgcct 60ccagtcatgg
gtgcccccgt tgcctaccct ccgcctgcgt accctcccgg tgtggccgcc
120ggcgccggcg cctacccgcc gcagctctac gcaccgccgg ctgctgccgc
ggcccagcag 180gcggcggccg cgcagcagca gcagctgcag atattctggg
cggagcagta ccgcgagatc 240gaggccacta ccgacttcaa gaatcacaac
ctcccgctcg cccgcatcaa gaagatcatg 300aaagccgacg aggacgtccg
catgatcgcc gccgaggctc ccgtggtgtt cgcccgggcc 360tgcgagatgt
tcatcctcga gctcacccat cgcggctggg cgcacgccga agagaacaag
420cgccgcacgc tccagaaatc cgacattgcc gctgccatcg cccgcaccga
ggtattcgac 480ttccttgtgg acatcgttcc gcgcgacgac ggtaaagacg
ctgatgcggc ggccgccgca 540gctgccgcgg ctgccgggat cccgcgcccc
gccgccggag taccagccac cgaccctctc 600gcctactact acgtgcctca gcagtaa
627144649DNAArtificial sequencesynthetic sequence 144gagaaaccct
agcaatggag cccaaatcca ccacccctcc tccgcctcct ccgccccccg 60tgctgggcgc
ccccgtccct tacccgccgg cgggagccta ccccccaccc gtcgggccct
120acgcccacgc gccgccgctc tacgccccgc ctccccccgc cgccgccgcc
gcctccgccg 180ccgccaccgc cgcctcgcag caggccgccg ccgcgcagct
ccagaacttc tgggcggagc 240agtaccgcga gatcgagcac accaccgact
tcaagaacca caacctcccc ctcgcccgca 300tcaagaagat catgaaggcc
gacgaggacg tccgcatgat cgccgccgag gcccccgtcg 360tgttcgccag
ggcgtgcgag atgttcatcc tcgagctcac ccaccgcggc tgggcgcacg
420ccgaggagaa caagcgccgc acgctccaga agtccgacat cgccgccgcc
atcgcccgca 480ccgaggtctt cgacttcctc gtcgacatcg tgccccgcga
cgaggccaag gacgccgagg 540ccgccgccgc cgttgccgcc gggatccccc
accccgccgc cggtttgccc gccaccgacc 600ccatggccta ctactatgtc
cagccgcagt aacattttcc taccgtata 649145638DNAArtificial
sequencesynthetic sequence 145tgaccgccgt aacaccctag gcaatggagc
ccaaatccac cacccctccc ccgccccccg 60tgatgggcgc gcccatcgcg tatcctcccc
cgcccggcgc cgcgtacccc gccgggccgt 120acgtgcacgc gccggcggcc
gcgctctacc ctcctcctcc cctgccgccg gcgcccccct 180cctcgcagca
gggcgccgcg gcggcgcacc agcagcagct attctgggcg gagcaatacc
240gcgagatcga ggccaccacc gacttcaaga accacaacct gccgctcgcc
cgcatcaaga 300agatcatgaa ggccgacgag gacgtgcgca tgatcgccgc
cgaggcgccc gtcgtcttct 360cccgcgcctg cgagatgttc atcctcgagc
tcacccaccg cggctgggca cacgccgagg 420agaacaagcg ccgcacgctg
cagaagtccg acatcgccgc cgccgtcgcg cgcaccgagg 480tcttcgactt
cctcgtcgac atcgtgccgc gggacgaggc caaggacgcc gactccgccg
540ccatgggagc agccgggatc ccgcaccccg ccgccggcct gcccgccgcc
gatcccatgg 600gctactacta cgtccagccg ccgcagtaac gaatttgc
638146778DNAArtificial sequencesynthetic sequence 146gagtggatat
ggaaccatcc cctcagccta tgggtgtcgc tgccggtggg tcacaagtgt 60atcctgcctc
tgcctatccg cctgcagcaa cagtagctcc tgcttctgtt gtatctgctg
120gtttacagtc agggcagcca ttcccagcca atcctggtca tatgagtgct
cagcaccaga 180ttgtctacca acaagctcaa caattccacc aacagctcca
gcagcaacaa caacagcagc 240ttcagcagtt ctgggttgaa cgcatgactg
aaattgaggc gacgactgat ttcaagaacc 300acaacttgcc acttgcgagg
ataaagaaga tcatgaaggc cgatgaagat gttcgcatga 360tctcagctga
agctcctgta gtctttgcaa aagcttgtga gatattcata ctggagctga
420cacttaggtc gtggatgcac actgaggaga acaagcgccg caccttgcaa
aagaatgaca 480ttgcagcagc gatcactagg actgacattt atgacttctt
ggtcgacatt gttcccaggg 540atgagatgaa ggaggacgga attgggcttc
ctagggctgg tctgccaccc atgggagccc 600cagctgatgc atatccatac
tactacatgc cacagcagca ggtgcctggt tctggaatgg 660tttatggtgc
ccagcaaggg cacccagtga cttatttgtg gcaggagcct cagcaacagc
720aggagcaagc tcctgaagag cagcaatctg catgaaagtg gctgagaata ttgctcag
778147553DNAArtificial sequencesynthetic sequence 147tgggctaacc
aaatgcaaga gatcgagcat accactgatt tcaagaacca cacccttccc 60ctagcccgca
tcaagaagat catgaaagct gatgaagatg tgaggatgat ctctgcggag
120gctcctgtga tttttgccaa ggcctgtgag atgttcattt tggagctcac
tctacgtgct 180tggatccaca ccgaggagaa caagaggagg accttgcaga
agaacgacat cgccgctgcc 240atttccagga ccgacgtgtt tgatttcctt
gtggacataa tcccgaggga cgagctgaaa 300gaagaaggtt taggcgtgac
caaagggacc ataccatcgg tggtgggttc cccgccatac 360tattacttgc
aacaacaggg gatgatgcaa cactggcccc aggagtaaca ccctgatgag
420tcttaaaact tttccccttt cgtttgtttg gttgtatcgt agtaaggtag
ctctgctctg 480ctgggaacca tttctattgt gttctgtaat gacatgttag
tatatcccca gtctatatct 540atggcaatgc agt 553148806DNAArtificial
sequencesynthetic sequence 148gggaagaatg gatcatcaag ggcatagcca
gaacccatct atgggggttg ttggtagtgg 60agctcaatta gcatatggtt ctaacccata
tcagccaggc caaataactg ggccaccggg 120gtctgttgtg acatcagttg
ggaccattca atccaccggt caacctgctg gagctcagct 180tggacagcat
caacttgctt atcagcatat tcatcagcaa caacagcacc agcttcagca
240acagctccaa caattttggt caagccagta ccaagaaatt gagaaggtta
ctgattttaa 300gaaccacagt cttcccctgg caaggatcaa gaagattatg
aaggctgacg aggatgttag 360gatgatatca gctgaagcac cagtcatttt
tgcaagggca tgtgaaatgt tcatattaga 420gttaaccctg cgctcttgga
atcacactga agagaacaaa aggcgaacac ttcagaaaaa 480tgatattgct
gctgctatca caaggactga catctttgat ttcttggttg acattgtgcc
540tcgtgaggac ttgaaagatg aagtgcttgc atcaatccca agaggaacaa
tgcctgttgc 600agggcctgct gatgcccttc catactgcta catgccgcct
cagcatccgt cccaagttgg 660agctgctggt gtcataatgg gtaagcctgt
gatggaccca aacatgtatg ctcagcagtc 720tcacccttac atggcaccac
aaatgtggcc acagccacca gaccaacgac agtcatctcc 780agaacattag
ctgatgtgtc gtggaa 806149925DNAArtificial sequencesynthetic sequence
149ccacgcgtcc gcgtcaatct ttgagtttgg tagagaaatg gatcaacaag
gacaatcatc 60agctatgaac tatggttcaa acccatatca aaccaacgcc atgaccacta
caccaaccgg 120ttcagaccat ccagcttacc atcagatcca ccagcaacaa
caacaacagc tcactcaaca 180gcttcaatct ttctgggaga ctcaattcaa
agagattgag aaaaccactg atttcaagaa 240ccatagcctt ccattggcaa
gaatcaagaa aatcatgaaa gctgatgaag atgtgcgtat 300gatctcggcc
gaggcgcctg ttgtgttcgc cagggcctgc gagatgttta ttctggagct
360tacgttaagg tcttggaacc atactgagga gaacaagaga aggacgttgc
agaagaatga 420tatcgcggct gcggtgacta gaactgatat ttttgatttt
cttgtggata ttgttcctcg 480ggaggatctt cgtgatgaag tcttgggtgg
tgttggtgct gaagctgcta cagctgcggg 540ttatccgtat ggatacttgc
ctcctggaac agctccaatt gggaacccgg gaatggttat 600gggtaacccg
ggcgcgtatc cgccgaaggc gtatatgggt cagccaatgt ggcaacaacc
660aggacctgag cagcaggatc ctgacaatta gcttggccta ataaactagc
cgtctaattc 720gaagctctcc ccggtggatc tactcaagaa gaagaatgtt
aatagaaaac tattgcgaca 780taaaaagttt ggtgtagtag aataatttct
gttttatgat ccatggattt atcaattgtt 840attcagtttg gtttatcttg
tcatcaaact gttttcggtc aatgtaacaa attcataaat 900tgagaattga
acttacaaaa ggcta 925150798DNAArtificial sequencesynthetic sequence
150agctgacatg gaaccatcct cacagcctca gcctgtgatg ggtgttgcca
ctggtgggtc 60acaagcatat cctcctcctg ctgctgcata tccacctcaa gccatggttc
ctggagctcc 120tgctgttgtt cctcctggct cacagccatc agcaccattc
cccactaatc cagctcaact 180cagtgctcag caccagctag tctaccaaca
agcccagcaa tttcatcagc agctgcagca 240acagcaacag cagcaactcc
gtgagttctg ggctaaccaa atggaagaga ttgagcaaac 300aaccgacttc
aagaaccaca gcttgccact cgcaaggata aagaagataa tgaaggctga
360tgaggatgtc cggatgatct cggcagaagc ccccgttgtc ttcgcaaagg
catgcgaggt 420attcatatta gagttaacat tgaggtcgtg gatgcacacg
gaggagaaca agcgccggac 480cttgcagaag aatgacattg cagctgccat
caccaggact gatatctatg acttcttggt 540ggacatagtt cccagggatg
aaatgaaaga agaagggctt gggcttccga gggttggcct 600accgcctaat
gtggggggcg cagcagacac atatccatat tactacgtgc cagcgcagca
660ggggcctgga tcaggaatga tgtacggtgg acagcaaggt cacccggtga
cgtatgtgtg 720gcagcagcct caagagcaac aggaagaggc ccctgaagag
cagcactctc tgccagaaag 780tagctaaaga tgatacag
7981511407DNAArtificial sequencesynthetic sequence 151atgaactatg
gcacaaaccc ataccaaacc aacccgatga gcaccactgc tgctactgta 60gcaggaggtg
cggcacaacc aggccagctg gcgttccacc agatccatca gcagcagcag
120cagcaacagc tggcacagca gcttcaagca ttttgggaga accaattcaa
agagattgag 180aagactaccg atttcaagaa ccacagcctt ccccttgcga
gaatcaagaa aatcatgaaa 240gcggatgaag atgtccgtat gatctcggct
gaggcgcctg tcgtgtttgc aagggcctgt 300gagatgttca tcctggagct
gacactcagg tcgtggaacc acactgagga gaataagagg 360cggacgttgc
agaagaacga tattgctgct gctgtgacta gaaccgatat ttttgatttc
420cttgtggata ttgttccccg ggaggatctc cgagatgaag tcttgggaag
tattccgagg 480ggcactgtcc cggaagctgc tgctgctggt tacccgtatg
gatacttgcc tgcaggaact 540gctccaatag gaaatccggg aatggttatg
ggtaatcccg gtggtgcgta tccacctaat 600ccttatatgg gtcaaccaat
gtggcaacaa caggcacctg accaacctga ccaggaaaat 660gcggccgctg
ccgctgcggc agcggccatg gtgagcaagg gcgaggagct gttcaccggg
720gtggtgccca tcctggtcga gctggacggc gacgtaaacg gccacaagtt
cagcgtgtcc 780ggcgagggcg agggcgatgc cacctacggc aagctgaccc
tgaagttcat ctgcaccacc 840ggcaagctgc ccgtgccctg gcccaccctc
gtgaccacct tcggctacgg cctgcagtgc 900ttcgcccgct accccgacca
catgaagcag cacgacttct tcaagtccgc catgcccgaa 960ggctacgtcc
aggagcgcac catcttcttc aaggacgacg gcaactacaa gacccgcgcc
1020gaggtgaagt tcgagggcga caccctggat aaccgaatcg agctgaaggg
catcgacttc 1080aaggaggacg gcaacatcct ggggcacaag ctggagtaca
actacaacag ccacaacgtc 1140tatatcatgg ccgacaagca gaagaacggc
atcaaggtga acttcaagat ccgccacaac 1200atcgaggacg gcagcgtgca
gctcgccgac cactaccagc agaacacccc catcggcgac 1260ggccccgtgg
tgctgcccga caaccactac ctgagctacc agtccgccct gagcaaagac
1320cccaacgaga agcgcgatca catggtcctg ctggagttcg tgaccgccgc
cgggatcact 1380ctcggcatgg acgagctgta caagtaa
1407152760DNAArtificial sequencesynthetic sequence 152agaggacatg
gagccatcat cacaacctca gccggcaatt ggtgttgttg ctggtggatc 60acaagtgtac
cctgcatacc ggcctgcagc aacagtgcct acagctcctg ctgtcattcc
120tgccggttca cagccagcac cgtcgttccc tgccaaccct gatcaactga
gtgctcagca 180ccagctcgtc tatcagcaag cccagcaatt tcaccagcag
cttcagcagc agcaacagcg 240tcaactccag cagttttggg ctgaacgtct
ggtcgatatt gaacaaacta ctgacttcaa 300gaaccacagc ttgccacttg
ctaggataaa gaagatcatg aaggcagatg aggacgttcg 360catgatctcc
gcagaggctc ctgtgatctt tgcgaaagca tgtgagatat tcatactgga
420gctgaccctg agatcatgga tgcacacgga ggagaacaag cgccgtacct
tgcagaagaa 480tgacatagca gctgccatca ccaggacgga tatgtacgat
ttcttggtag atatagttcc 540cagggatgac ttgaaggagg agggagttgg
gctccctagg gctggattgc cgcccttggg 600tgtccctgct gactcatatc
cgtatggcta ctatgtgcca cagcagcagg tcccaggtgc 660aggaatagcg
tatggtggtc agcaaggtca tccggggtat ctgtggcagg atcctcagga
720acagcaggaa gagcctcctg cagagcagca aagtgattaa
760153847DNAArtificial sequencesynthetic sequence 153agtaagtcat
catggataaa tcagagcaga ctcaacagca gcagcagcaa caacagcatg 60tgatgggagt
tgccgcaggg gctagccaaa tggcctattc ttctcactac ccgactgctt
120ccatggtggc ttctggcacg cccgctgtaa ctgctccttc cccaactcag
gctccagctg 180ccttctctag ttctgctcac cagcttgcat accagcaagc
acagcatttc caccaccaac 240agcagcaaca ccaacaacag cagcttcaaa
tgttctggtc aaaccaaatg caagaaattg 300agcaaacaat tgactttaaa
aaccatagcc ttcctcttgc tcggataaaa aagataatga 360aagctgatga
agatgtccgg atgatttcag cagaagctcc ggtcatattt gcaaaagctt
420gtgaaatgtt catattagag ttgacgttgc gatcttggat ccacacagaa
gagaacaaga 480ggagaactct acaaaagaat gatatagcag ctgctatttc
gagaaacgat gtttttgatt 540tcttggttga tattattcca agagatgagt
tgaaagagga aggacttgga ataaccaagg 600ctactattcc gttagtgggt
tctccagctg atatgccata ttactatgtc cctccacagc 660atcctgttgt
aggaccacct gggatgatca tgggcaagcc cattggcgct gagcaagcaa
720cactatattc tacacagcag cctcgacctc ctgtggcgtt catgccatgg
cctcatacac 780aacccctgca acagcagcca ccccaacatc aacaaacaga
ctcatgatga ctatgcaatt 840caattag 847154786DNAArtificial
sequencesynthetic sequence 154aacatcaggg gatgggcgtt gccacaggtg
ctagccaaat ggcctattct tctcactacc 60cgactgctcc catggtggct tctggcacgc
ctgctgtagc tgttccttcc ccaactcagg 120ctccagctgc cttctctagt
tctgctcacc agcttgcata ccagcaagca cagcatttcc 180accaccaaca
gcagcaacac caacaacagc agcttcaaat gttctggtca aaccaaatgc
240aagaaattga gcaaacaatt gactttaaaa accacagtct tcctcttgct
cggataaaaa 300agataatgaa agctgatgaa gatgtccgga tgatttctgc
agaagctcca gtcatatttg 360caaaagcatg tgaaatgttc atattagagt
tgacgttgag atcttggatc cacacagaag 420agaacaagag gagaactcta
caaaagaatg atatagcagc tgctatttcg agaaacgatg 480tttttgattt
cttggttgat attatcccaa gagatgagtt gaaagaggaa ggacttggaa
540taaccaaggc tactattcca ttggtgaatt ctccagctga tatgccatat
tactatgtcc 600ctccacagca tcctgttgta ggacctcctg ggatgatcat
gggcaagccc gttggtgctg 660agcaagcaac gctgtattct acacagcagc
ctcgacctcc catggcgttc atgccatggc 720cccatacaca accccagcaa
cagcagccac cccaacatca acaaacagac tcatgatgac 780catgca
786155748DNAArtificial sequencesynthetic sequence 155ccgaccaatg
gataccaaca accagcaacc acctccctcc gccgccggaa tccctcctcc 60accacctgga
accaccatct ccgccgcagg aggaggagct tcttaccacc accttctcca
120acaacaacaa caacagctcc aactattctg gacctaccaa cgccaagaga
tcgaacaagt 180taacgatttc aaaaaccatc agcttccact agctaggata
aaaaagatca tgaaagccga 240tgaagatgtt cgtatgatct ccgcagaagc
accgattctc ttcgcgaaag cttgtgagct 300tttcattctc gagctcacga
tcagatcttg gcttcacgct gaggagaata aacgtcgtac 360gcttcagaaa
aacgatatcg ctgctgcgat tactaggact gatatcttcg atttccttgt
420tgatattgtt cctagagatg agattaagga cgaagccgca gtcctcggtg
gtggaatggt 480ggtggctcct accgcgagcg gcgtgcctta ctattatccg
ccgatgggac aaccagctgg 540tcctggaggg
atgatgattg ggagaccagc tatggatccg aatggtgttt atgtccagcc
600tccgtctcag gcgtggcaga gtgtttggca gacttcgacg gggacgggag
atgatgtctc 660ttatggtagt ggtggaagtt ccggtcaagg gaatctcgac
ggccaaggtt aagctcagag 720tattccagat gatgcttgac ctgcttga
748156750DNAArtificial sequencesynthetic sequence 156attgggggaa
tggagaccaa caaccagcaa caacaacaac aaggagctca agcccaatcg 60ggaccctacc
ccgtcgccgg cgccggcggc agtgcaggtg caggtgcagg cgctcctccc
120cctttccagc accttctcca gcagcagcag cagcagctcc agatgttctg
gtcttaccag 180cgtcaagaaa tcgagcacgt gaacgacttt aagaatcacc
agctccctct tgcccgcatc 240aagaagatca tgaaggccga cgaggatgtc
cgcatgatct ccgccgaggc ccccatcctc 300ttcgccaagg cctgcgagct
cttcatcctc gagctcacca tccgctcctg gctccacgcc 360gaggagaaca
agcgccgcac cctccagaag aacgacatcg ccgccgccat cacccgcacc
420gacattttcg acttcctcgt tgatattgtc ccccgcgacg agatcaagga
cgacgctgct 480cttgtggggg ccaccgccag tggggtgcct tactactacc
cgcccattgg acagcctgcc 540gggatgatga ttggccgccc cgccgtcgat
cccgccaccg gggtttatgt ccagccgccc 600tcccaggcat ggcagtccgt
ctggcagtcc gctgccgagg acgcttccta tggcaccggc 660ggggccggtg
cccagcggag ccttgatggc cagagttgag tgacatcgat gccgatgatg
720gacagtcagg agttatgaag attctgaact 750157768DNAArtificial
sequencesynthetic sequence 157aatccatgga caaccagccg ctgccctact
ccacaggcca gccccctgcc cccggaggag 60ccccggtggc gggcatgcct ggcgcggccg
gcctcccacc cgtgccgcac caccacctgc 120tccagcagca gcaggcccag
ctgcaggcgt tctgggcgta ccagcgccag gaggcggagc 180gcgcgtccgc
gtcggacttc aagaaccacc agctgcctct ggcccggatc aagaagatca
240tgaaggccga cgaggacgtg cgcatgatct ccgccgaggc gcccgtgctg
ttcgccaagg 300cctgcgagct cttcatcctc gagctcacta tccgctcctg
gctccacgcc gaggagaaca 360agcgccgcac cctgcagcgc aacgacgtcg
ccgcggccat cgcgcgcacc gacgtcttcg 420atttcctcgt cgacatcgtg
ccccgcgagg aggccaagga ggagcccggc agcgccctcg 480gcttcgcggc
gcctgggacc ggcgtcgtcg gggctggcgc cccgggcggg gcgccagccg
540ccgggatgcc ctactactat ccgccgatgg ggcagccggc gccgatgatg
ccggcctggc 600atgttccggc ctgggacccg gcctggcagc aaggggcagc
ggatgtcgat cagagcggca 660gcttcagcga ggaaggacaa gggtttggag
caggccatgg cggcgccgct agcttccctc 720ctgcgcctcc gacctccgag
tgatcgatcg gcgcgtctct tggtcctg 768158800DNAArtificial
sequencesynthetic sequence 158gatcttttga tccaatcaca aggcaaagat
ccaatggaca ataacaacaa caacaacaac 60cagcaaccac caccaacctc cgtctatcca
cctggctccg ccgtcacaac cgtaatccct 120cctccaccat ctggatctgc
atcaatagtc accggaggag gagcgacata ccaccacctc 180ctccagcaac
aacagcaaca gcttcaaatg ttctggacat accagagaca agagatcgaa
240caggtaaacg atttcaaaaa ccatcagctc cctctagctc gtatcaaaaa
aatcatgaaa 300gctgatgaag atgtgcgtat gatctccgcc gaagcaccga
ttctcttcgc gaaagcttgt 360gagcttttca ttctcgaact tacgattaga
tcttggcttc acgctgaaga gaacaaacgt 420cgtacgcttc agaaaaacga
tatcgctgct gcgattacta gaaccgatat cttcgatttc 480cttgttgata
ttgttcctag ggaagagatc aaggaagagg aagatgcagc atcggctctt
540ggtggaggag gtatggttgc tcccgccgcg agcggtgttc cttattatta
tccaccgatg 600ggacaaccgg cggttcctgg agggatgatg attggaagac
cggcgatgga tcctagcggt 660gtttatgctc agcctccttc tcaggcatgg
caaagcgttt ggcagaattc agctggtggt 720ggtgatgatg tgtcttatgg
aagtggagga agtagcggcc atggtaatct cgatagccaa 780gggtaagtga
attctagtag 800159762DNAArtificial sequencesynthetic sequence
159ggtgacaatg gacaaccagc agctacccta cgccggtcag ccggcggccg
caggcgccgg 60agccccggtg ccgggcgtgc ctggcgcggg cgggccgccg gcggtgccgc
accaccacct 120gctccagcag cagcaggcgc agctgcaggc gttctgggcg
taccagcggc aggaggcgga 180gcgcgcgtcg gcgtcggact tcaagaacca
ccagctgccg ctggcgcgga tcaagaagat 240catgaaggcg gacgaggacg
tgcgcatgat ctcggcggag gcgcccgtgc tgttcgccaa 300ggcgtgcgag
ctcttcatcc tggagctcac catccgctcg tggctgcacg ccgaggagaa
360caagcgccgc accctgcagc gcaacgacgt cgccgccgcc atcgcgcgca
ccgacgtgtt 420cgacttcctc gtcgacatcg tgccgcggga ggaggccaag
gaggagcccg gcagcgcgct 480cgggttcgcg gcgggagggc ccgccggcgc
cgttggagcg gccggccccg ccgcggggct 540gccgtactac tacccgccga
tggggcagcc ggcgccgatg atgccggcgt ggcatgttcc 600ggcgtgggac
ccggcgtggc agcaaggagc agcgccggat gtggaccagg gcgccgccgg
660cagcttcagc gaggaagggc agcaaggttt tgcaggccat ggcggtgcgg
cagctagctt 720ccctcctgca cctccaagct ccgaatagtg atgatccata tg
762160734DNAArtificial sequencesynthetic sequence 160tctcacttcc
aacatccaaa tccctagaaa ttgtaaatgg ctgagaacaa caacaacaac 60ggcgacaaca
tgaacaacga caaccaccag caaccaccgt cgtactcgca gctgccgccg
120atggcatcat ccaaccctca gttacgtaat tactggattg agcagatgga
aaccgtctcg 180gatttcaaaa accgtcagct tccattggct cgaattaaga
agatcatgaa ggctgatcca 240gatgtgcaca tggtctccgc agaggctccg
atcatcttcg caaaggcttg cgaaatgttc 300atcgttgatc tcacgatgcg
gtcgtggctc aaagccgagg agaacaaacg ccacacgctt 360cagaaatcgg
atatctccaa cgcagtggct agctctttca cctacgattt ccttcttgat
420gttgtcccta aggacgagtc tatcgccacc gctgatcctg gctttgtggc
tatgccacat 480cctgacggtg gaggagtacc gcaatattat tatccaccgg
gagtggtgat gggaactcct 540atggttggta gtggaatgta cgcgccatcg
caggcgtggc cagcagcggc tggtgacggg 600gaggatgatg ctgaggataa
tggaggaaac ggcggcggaa attgaagtgt agatttaggg 660tttgtaaccg
cctatgtggg aaatttgaaa tttggtggtg tttattaggg ttcttcaatt
720cgtcggattt gctt 734161668DNAArtificial sequencesynthetic
sequence 161ataacaagcc tagaacacta gaaacttcaa aaaagaaaaa aatcttatgg
agaacaacaa 60cggcaacaac cagctgccac cgaaaggtaa cgagcaactg aagagtttct
ggtcaaaaga 120gatggaaggt aacttagatt tcaaaaatca cgaccttcct
ataactcgta tcaagaagat 180tatgaagtat gatccggatg tgactatgat
agctagtgag gctccaatcc tcctctcgaa 240agcatgtgag atgtttatca
tggatctcac gatgcgttcg tggctccatg ctcaggaaag 300caaacgagtc
acgctacaga aatctaatgt cgatgccgca gtggctcaaa ctgttatctt
360tgatttcttg cttgatgatg acattgaggt aaagagagag tctgttgccg
ccgctgctga 420tcctgtggcc atgccaccta ttgacgatgg agagctgcct
ccaggaatgg taattggaac 480tcctgtttgt tgtagtcttg gaatccacca
accacaacca caaatgcagg catggcctgg 540agcttggacc tcggtgtctg
gtgaggagga agaagcgcgt gggaaaaaag gaggtgacga 600cggaaactaa
taagtggaat acgttttagg gtattttcaa gggaatatgt agtaaatagt 660catggatc
668162800DNAArtificial sequencesynthetic sequence 162aatagttggg
ctgatttcgt agcccactta atcagccttt aaatatggaa accctagcct 60agaaagtgaa
caagaaaaac gtaaagatca aaatggaaga gaacaacggc aacaacaacc
120actacctgcc gcaaccatcg tcttcccaac tgccgccgcc accattgtat
tatcaatcaa 180tgccgttgcc gtcatattca ctgccgctgc cgtactcacc
gcagatgcgg aattattgga 240ttgcgcagat gggaaacgca actgatgtta
agcatcatgc gtttccacta accaggataa 300agaaaatcat gaagtccaac
ccggaagtga acatggtcac tgcagaggct ccggtcctta 360tatcgaaggc
ctgtgagatg ctcattcttg atctcacaat gcgatcgtgg cttcataccg
420tggagggcgg tcgccaaact ctcaagagat ccgatacgct cacgagatcc
gatatctccg 480ccgcaacgac tcgtagtttc aaatttacct tccttggcga
cgttgtccca agagaccctt 540ccgtcgttac cgatgatccc gtgctacatc
cggacggtga agtacttcct ccgggaacgg 600tgataggata tccggtgttt
gattgtaatg gtgtgtacgc gtcaccgcca cagatgcagg 660agtggccggc
ggtgcctggt gacggagagg aggcagctgg ggaaattgga ggaagcagcg
720gcggtaattg aaaagtgttg attgggtttt agggttgtaa tgcttttgtg
agaatttgta 780tctctatgga gtcatgtttg 800163736DNAArtificial
sequencesynthetic sequence 163taggttgaga ttcatatatg taaagagatc
acttcttaat cttatcctac catatcttat 60atacgcttaa ttttccttta tatatgcaaa
cctccacata aaaatatctc aaacccaaac 120acttcaaaca aaaaaaaaat
ggagaacaac aacaacaacc accaacagcc accgaaagat 180aacgagcaac
taaagagttt ctggtcaaag gggatggaag gtgacttgaa tgtcaagaat
240cacgagttcc ccatctctcg tatcaagagg ataatgaagt ttgatccgga
tgtgagtatg 300atcgctgctg aggctccaaa tctcttatct aaggcttgtg
aaatgtttgt catggacctc 360acgatgcgtt catggctcca tgctcaagag
agcaaccgac tcacgatacg gaaatctgat 420gttgatgccg tagtgtctca
aaccgtcatc tttgatttct tgcgtgatga tgtccctaag 480gacgagggag
agcccgttgt cgccgctgct gatcctgtgg acgatgttgc tgatcatgtg
540gctgtgccag atcttaacaa tgaagaactg ccgccgggaa cggtgatagg
aactccggtt 600tgttacggtt taggaataca cgcgccacac ccgcagatgc
ctggagcttg gaccgaggag 660gatgcgactg gggcaaatgg aggaaacggt
gggaattaat atttggattg ggttttgtaa 720ccgctgttgt gagaac
736164779DNAArtificial sequencesynthetic sequence 164acccttgttc
ttgccaaact ctacatcttc aaagattttt catcatctgt tgtgaaatat 60aacatgagga
ggccaaagtc atctcacgtc aggatggaac ctgttgcgcc tcgttcacat
120aacacgatgc caatgcttga tcaatttcga tctaatcatc ctgaaacaag
caagatcgag 180ggggtctctt cgttggacac agctctgaag gtgttttgga
ataatcaaag ggagcagcta 240ggaaactttg caggccaaac tcatttgccg
ctatctaggg tcagaaagat tttgaaatct 300gatcctgaag tcaagaagat
aagctgtgat gttcctgctt tgttttcgaa agcctgtgaa 360tacttcattc
tagaggtaac attacgagct tggatgcata ctcaatcatg cactcgtgag
420accatccggc gttgtgatat cttccaggcc gtaaagaact caggaactta
tgatttcctg 480attgatcgtg tcccttttgg accgcactgt gtcacccatc
agggtgtgca acctcctgct 540gaaatgattt tgccggatat gaatgttcca
atcgatatgg accagattga ggaggagaat 600atgatggaag agcgctctgt
cgggtttgac ctcaactgtg atctccagtg aacatgaagc 660tgctctggaa
gacaaaaact tgaagaagag aagaaatctg aagaggaatc acccaacaac
720tctatgttat gttcacctta taatagttta tcataaactc attcactaaa ctatgtgta
779165715DNAArtificial sequencesynthetic sequence 165cattgtgtgg
aaagatataa cgtgtttgat tttctgaggg aagttgtgag taaggtgcct 60gactatggcc
attcccaagg gcaaggacat ggtgatgtta ccatggatga tcgcagcatc
120tccaagagaa ggaagcccat cagcgatgaa gtgaatgaca gtgacgagga
atataagaaa 180agcaaaacgc aagagatagg gagtgctaag accagtggca
ggggtggtag aggaagaggg 240cgaggaagag gtcgtggtgg acgagctgca
aaagcagccg aaagagaggg tctcaaccgc 300gagatggaag tagaagccgc
caattctgga cagccaccac cagaagacaa tgtcaagatg 360catgcgtcag
agtcatcacc acaagaggat gagaagaaag gcatcgacgg cacagcagca
420tcgaacgaag acaccaagca acaccttcaa agtcccaaag aaggcattga
ctttgatctc 480aacgctgaat ccctcgacct aaacgagacc aaactggcac
cagccacagg cacaaccaca 540accacaactg cagcaacaga ctctgaggag
tattcgggct ggcctatgat ggacataagc 600aaaatggatc cagcacagct
tgctagtctg ggtaagagga tagacgagga tgaggaagat 660tatgacgaag
aaggctaagt cataacagca ctataggata tatagagtag cagcg
715166738DNAArtificial sequencesynthetic sequence 166tcgaccgttc
ttctcaatct caccaatcgg tttaagctga aaacccgaat tagcaaaatc 60ttcgttcggg
ctgttttggt taatccggtt tacatgtttt ctcattgctc attttcattt
120tcccgccgtg acagagcgcg taaatctcaa aaccctaaaa atgtcgaaca
tatacaattc 180attaccttaa tcagattttc tcaacagaat caaaatcaaa
atccatggag gaagaagaag 240gatcaatccg accagagttt ccaatcggaa
gagtaaagaa gataatgaaa ctggacaaag 300acatcaacaa aatcaactca
gaagctcttc acgtcatcac ttactccacc gaactcttcc 360tccacttcct
cgccgagaaa tctgctgttg ttacggcgga gaagaagcgt aagactgtta
420atctcgatca tttaagaatc gccgtgaaaa gacaccaacc tactagtgat
ttcctcttag 480actcgcttcc gttgccggct cagcctgtca aacataccaa
atcggtttcc gacaagaaga 540ttccggcgcc gccaattggg actcgtcgta
tcgatgattt cttcagtaaa gggaaagcaa 600agactgattc agcctaaagt
aaaatttctc attttgttca caattgcaaa ttttactctg 660ttctcaaatc
aaaatcttgt tttgctaaaa gtgtagtgag aatgtatgga tcatgaggaa
720cttttatagg aagcggcc 7381671461DNAArtificial sequencesynthetic
sequence 167ggcgaaaggg tgaaacatac tgtagttcta aaaaattaat ggtgtcgtca
aagaaaccca 60aggagaagaa ggcgaggagc gatgtcgtcg tcaataaagc gagtggtcgg
agtaaacgca 120gctccggttc cagaacgaag aagacgtcga acaaggttaa
cattgtgaag aagaagccgg 180agatttacga gatctcagaa tcatcgagca
gtgactctgt ggaagaagca ataagaggcg 240atgaggcgaa gaaaagtaac
ggcgtcgtga gcaagagggg taacggaaag agtgtaggaa 300ttccgacgaa
gacgagtaaa aatcgagaag aggacgatgg aggcgcggaa gatgctaaga
360tcaagtttcc gatgaatcgg attcggcgga tcatgagaag cgataattct
gctcctcaga 420ttatgcagga tgctgtattt cttgtcaaca aagccacggt
atagtactaa ttagacatat 480aaatttagct tgaggaactt aatttcacat
ggtttttgat gaatttggga gtatcaattg 540ctagtcagtg ttaattgggc
gttataatct cgcaaattgc tacagtaatg agattgtttc 600tgttaattaa
gccgggaatt aacgttattg cttcatctcc atgtgtgttt gatgaaaccg
660tagttactgt gcttgattgt cataggttta gcttttacat ccagaacttg
tagacaccta 720acacatgaga aagtccttat atatgattta ggttcatatt
ttcaaagctt aagtgatgag 780tgtttaattt tcctgtattg gaacttggca
ttgttttttc tttcttttga tttcatcttg 840tgttcatcga aattgatgtt
cattgcgttt acagtacatt aactggtttt tgtgattgaa 900ggagatgttc
attgagcggt tttctgaaga agcttatgat agttccgtca aggacaaaaa
960gaaattcatc cactacaaac acctctgtaa gctctaatct cgtccctatt
tgccaatatc 1020tggttacctc aatactgaat cccatgtcga aaatctcatg
ctgctgcacc aattgctgaa 1080cttaggattt ctggcttctg cacaagggcc
agtagttagt cttaggaaga gtttagataa 1140tgagttaatc cgttcgtatc
acttgcaaga tttgttgcac tcctaacata attgatgaca 1200ctgtcatttc
tactcgttgg cagcatccgt agtgagtaac gaccagagat acgagttcct
1260tgcaggtact taataacctc tgaagtattc gttttatgag ttccatgtgg
tttacgaaca 1320gcattttaat acctgtaata tcttgaatgc agatagtgtt
cccgagaaac ttaaagcaga 1380ggccgcgttg gaggaatggg aaagaggcat
gacagatgca ggctgaaata aatccggttg 1440gaatcgaact gaaccatttg g
1461168747DNAArtificial sequencesynthetic sequence 168gcggccgctg
ccgctgcggc agcggccatg gtgagcaagg gcgaggagct gttcaccggg 60gtggtgccca
tcctggtcga gctggacggc gacgtaaacg gccacaagtt cagcgtgtcc
120ggcgagggcg agggcgatgc cacctacggc aagctgaccc tgaagttcat
ctgcaccacc 180ggcaagctgc ccgtgccctg gcccaccctc gtgaccacct
tcggctacgg cctgcagtgc 240ttcgcccgct accccgacca catgaagcag
cacgacttct tcaagtccgc catgcccgaa 300ggctacgtcc aggagcgcac
catcttcttc aaggacgacg gcaactacaa gacccgcgcc 360gaggtgaagt
tcgagggcga caccctggtg aaccgcatcg agctgaaggg catcgacttc
420aaggaggacg gcaacatcct ggggcacaag ctggagtaca actacaacag
ccacaacgtc 480tatatcatgg ccgacaagca gaagaacggc atcaaggtga
acttcaagat ccgccacaac 540atcgaggacg gcagcgtgca gctcgccgac
cactaccagc agaacacccc catcggcgac 600ggccccgtgc tgctgcccga
caaccactac ctgagctacc agtccgccct gagcaaagac 660cccaacgaga
agcgcgatca catggtcctg ctggagttcg tgaccgccgc cgggatcact
720ctcggcatgg acgagctgta caagtaa 7471694856DNAArtificial
sequencesynthetic sequence 169aagcttnnnn ctgcagnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nntcggattc 60cattgcccag ctatctgtca ctttattgtg
aagatagtga aaaagaaggt ggctcctaca 120aatgccatca ttgcgataaa
ggaaaggcca tcgttgaaga tgcctctgcc gacagtggtc 180ccaaagatgg
acccccaccc acgaggagca tcgtggaaaa agaagacgtt ccaaccacgt
240cttcaaagca agtggattga tgtgatggtc cgattgagac ttttcaacaa
agggtaatat 300ccggaaacct cctcggattc cattgcccag ctatctgtca
ctttattgtg aagatagtgg 360aaaaggaagg tggctcctac aaatgccatc
attgcgataa aggaaaggcc atcgttgaag 420atgcctctgc cgacagtggt
cccaaagatg gacccccacc cacgaggagc atcgtggaaa 480aagaagacgt
tccaaccacg tcttcaaagc aagtggattg atgtgatatc tccactgacg
540taagggatga cgcacaatcc cactatcctt cgcaagaccc ttcctctata
taaggaagtt 600catttcattt ggagaggaca cgctgacaag ctgactctag
cagatctggt accgtcgaat 660cacaagtttg tacaaaaaag ctgaacgaga
aacgtaaaat gatataaata tcaatatatt 720aaattagatt ttgcataaaa
aacagactac ataatactgt aaaacacaac atatccagtc 780actatggcgg
ccgcattagg caccccaggc tttacacttt atgcttccgg ctcgtataat
840gtgtggattt tgagttagga tccgtcgaga ttttcaggag ctaaggaagc
taaaatggag 900aaaaaaatca ctggatatac caccgttgat atatcccaat
ggcatcgtaa agaacatttt 960gaggcatttc agtcagttgc tcaatgtacc
tataaccaga ccgttcagct ggatattacg 1020gcctttttaa agaccgtaaa
gaaaaataag cacaagtttt atccggcctt tattcacatt 1080cttgcccgcc
tgatgaatgc tcatccggaa ttccgtatgg caatgaaaga cggtgagctg
1140gtgatatggg atagtgttca cccttgttac accgttttcc atgagcaaac
tgaaacgttt 1200tcatcgctct ggagtgaata ccacgacgat ttccggcagt
ttctacacat atattcgcaa 1260gatgtggcgt gttacggtga aaacctggcc
tatttcccta aagggtttat tgagaatatg 1320tttttcgtct cagccaatcc
ctgggtgagt ttcaccagtt ttgatttaaa cgtggccaat 1380atggacaact
tcttcgcccc cgttttcacc atgggcaaat attatacgca aggcgacaag
1440gtgctgatgc cgctggcgat tcaggttcat catgccgtct gtgatggctt
ccatgtcggc 1500agaatgctta atgaattaca acagtactgc gatgagtggc
agggcggggc gtaaagatct 1560ggatccggct tactaaaagc cagataacag
tatgcgtatt tgcgcgctga tttttgcggt 1620ataagaatat atactgatat
gtatacccga agtatgtcaa aaagaggtat gctatgaagc 1680agcgtattac
agtgacagtt gacagcgaca gctatcagtt gctcaaggca tatatgatgt
1740caatatctcc ggtctggtaa gcacaaccat gcagaatgaa gcccgtcgtc
tgcgtgccga 1800acgctggaaa gcggaaaatc aggaagggat ggctgaggtc
gcccggttta ttgaaatgaa 1860cggctctttt gctgacgaga acaggggctg
gtgaaatgca gtttaaggtt tacacctata 1920aaagagagag ccgttatcgt
ctgtttgtgg atgtacagag tgatattatt gacacgcccg 1980ggcgacggat
ggtgatcccc ctggccagtg cacgtctgct gtcagataaa gtctcccgtg
2040aactttaccc ggtggtgcat atcggggatg aaagctggcg catgatgacc
accgatatgg 2100ccagtgtgcc ggtctccgtt atcggggaag aagtggctga
tctcagccac cgcgaaaatg 2160acatcaaaaa cgccattaac ctgatgttct
ggggaatata aatgtcaggc tcccttatac 2220acagccagtc tgcaggtcga
ccatagtgac tggatatgtt gtgttttaca gtattatgta 2280gtctgttttt
tatgcaaaat ctaatttaat atattgatat ttatatcatt ttacgtttct
2340cgttcagctt tcttgtacaa agtggtgatg gccgctctag acaggcctcg
taccggatcc 2400tctagctaga gctttcgttc gtatcatcgg tttcgacaac
gttcgtcaag ttcaatgcat 2460cagtttcatt gcgcacacac cagaatccta
ctgagtttga gtattatggc attgggaaaa 2520ctgtttttct tgtaccattt
gttgtgcttg taatttactg tgttttttat tcggttttcg 2580ctatcgaact
gtgaaatgga aatggatgga gaagagttaa tgaatgatat ggtccttttg
2640ttcattctca aattaatatt atttgttttt tctcttattt gttgtgtgtt
gaatttgaaa 2700ttataagaga tatgcaaaca ttttgttttg agtaaaaatg
tgtcaaatcg tggcctctaa 2760tgaccgaagt taatatgagg agtaaaacac
ttgtagttgt accattatgc ttattcacta 2820ggcaacaaat atattttcag
acctagaaaa gctgcaaatg ttactgaata caagtatgtc 2880ctcttgtgtt
ttagacattt atgaactttc ctttatgtaa ttttccagaa tccttgtcag
2940attctaatca ttgctttata attatagtta tactcatgga tttgtagttg
agtatgaaaa 3000tattttttaa tgcattttat gacttgccaa ttgattgaca
acatgcatca atcgacctgc 3060agccactcga agcggccggc cgccactcga
gatcatgagc ggagaattaa gggagtcacg 3120ttatgacccc
cgccgatgac gcgggacaag ccgttttacg tttggaactg acagaaccgc
3180aacgttgaag gagccactca gccgcgggtt tctggagttt aatgagctaa
gcacatacgt 3240cagaaaccat tattgcgcgt tcaaaagtcg cctaaggtca
ctatcagcta gcaaatattt 3300cttgtcaaaa atgctccact gacgttccat
aaattcccct cggtatccaa ttagagtctc 3360atattcactc tcaatccaaa
taatctgcac cggatctgga tcgtttcgca tgattgaaca 3420agatggattg
cacgcaggtt ctccggccgc ttgggtggag aggctattcg gctatgactg
3480ggcacaacag acaatcggct gctctgatgc cgccgtgttc cggctgtcag
cgcaggggcg 3540cccggttctt tttgtcaaga ccgacctgtc cggtgccctg
aatgaactgc aggacgaggc 3600agcgcggcta tcgtggctgg ccacgacggg
cgttccttgc gcagctgtgc tcgacgttgt 3660cactgaagcg ggaagggact
ggctgctatt gggcgaagtg ccggggcagg atctcctgtc 3720atctcacctt
gctcctgccg agaaagtatc catcatggct gatgcaatgc ggcggctgca
3780tacgcttgat ccggctacct gcccattcga ccaccaagcg aaacatcgca
tcgagcgagc 3840acgtactcgg atggaagccg gtcttgtcga tcaggatgat
ctggacgaag agcatcaggg 3900gctcgcgcca gccgaactgt tcgccaggct
caaggcgcgc atgcccgacg gcgaggatct 3960cgtcgtgacc catggcgatg
cctgcttgcc gaatatcatg gtggaaaatg gccgcttttc 4020tggattcatc
gactgtggcc ggctgggtgt ggcggaccgc tatcaggaca tagcgttggc
4080tacccgtgat attgctgaag agcttggcgg cgaatgggct gaccgcttcc
tcgtgcttta 4140cggtatcgcc gctcccgatt cgcagcgcat cgccttctat
cgccttcttg acgagttctt 4200ctgagcggga ctctggggtt cgaaatgacc
gaccaagcga cgcccaacct gccatcacga 4260gatttcgatt ccaccgccgc
cttctatgaa aggttgggct tcggaatcgt tttccgggac 4320gccggctgga
tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccacgggatc
4380tctgcggaac aggcggtcga aggtgccgat atcattacga cagcaacggc
cgacaagcac 4440aacgccacga tcctgagcga caatatgatc gggcccggcg
tccacatcaa cggcgtcggc 4500ggcgactgcc caggcaagac cgagatgcac
cgcgatatct tgctgcgttc ggatattttc 4560gtggagttcc cgccacagac
ccggatgatc cccgatcgtt caaacatttg gcaataaagt 4620ttcttaagat
tgaatcctgt tgccggtctt gcgatgatta tcatataatt tctgttgaat
4680tacgttaagc atgtaataat taacatgtaa tgcatgacgt tatttatgag
atgggttttt 4740atgattagag tcccgcaatt atacatttaa tacgcgatag
aaaacaaaat atagcgcgca 4800aactaggata aattatcgcg cgcggtgtca
tctatgttac tagatcgggc tcgaga 48561701394DNAArtificial
sequencesynthetic sequence 170aagcttnnnn ctgcagnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nntcggattc 60cattgcccag ctatctgtca ctttattgtg
aagatagtga aaaagaaggt ggctcctaca 120aatgccatca ttgcgataaa
ggaaaggcca tcgttgaaga tgcctctgcc gacagtggtc 180ccaaagatgg
acccccaccc acgaggagca tcgtggaaaa agaagacgtt ccaaccacgt
240cttcaaagca agtggattga tgtgatggtc cgattgagac ttttcaacaa
agggtaatat 300ccggaaacct cctcggattc cattgcccag ctatctgtca
ctttattgtg aagatagtgg 360aaaaggaagg tggctcctac aaatgccatc
attgcgataa aggaaaggcc atcgttgaag 420atgcctctgc cgacagtggt
cccaaagatg gacccccacc cacgaggagc atcgtggaaa 480aagaagacgt
tccaaccacg tcttcaaagc aagtggattg atgtgatatc tccactgacg
540taagggatga cgcacaatcc cactatcctt cgcaagaccc ttcctctata
taaggaagtt 600catttcattt ggagaggaca cgctgacaag ctgactctag
cagatctggt accgtcgacg 660gtgagctccg cggccgctct agacaggcct
cgtaccggat cctctagcta gagctttcgt 720tcgtatcatc ggtttcgaca
acgttcgtca agttcaatgc atcagtttca ttgcgcacac 780accagaatcc
tactgagttt gagtattatg gcattgggaa aactgttttt cttgtaccat
840ttgttgtgct tgtaatttac tgtgtttttt attcggtttt cgctatcgaa
ctgtgaaatg 900gaaatggatg gagaagagtt aatgaatgat atggtccttt
tgttcattct caaattaata 960ttatttgttt tttctcttat ttgttgtgtg
ttgaatttga aattataaga gatatgcaaa 1020cattttgttt tgagtaaaaa
tgtgtcaaat cgtggcctct aatgaccgaa gttaatatga 1080ggagtaaaac
acttgtagtt gtaccattat gcttattcac taggcaacaa atatattttc
1140agacctagaa aagctgcaaa tgttactgaa tacaagtatg tcctcttgtg
ttttagacat 1200ttatgaactt tcctttatgt aattttccag aatccttgtc
agattctaat cattgcttta 1260taattatagt tatactcatg gatttgtagt
tgagtatgaa aatatttttt aatgcatttt 1320atgacttgcc aattgattga
caacatgcat caatcgacct gcagccactc gaagcggccg 1380gccgccactc gaga
13941713158DNAArtificial sequencesynthetic sequence 171aagcttnnnn
ctgcagnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nntcggattc 60cattgcccag
ctatctgtca ctttattgtg aagatagtga aaaagaaggt ggctcctaca
120aatgccatca ttgcgataaa ggaaaggcca tcgttgaaga tgcctctgcc
gacagtggtc 180ccaaagatgg acccccaccc acgaggagca tcgtggaaaa
agaagacgtt ccaaccacgt 240cttcaaagca agtggattga tgtgatggtc
cgattgagac ttttcaacaa agggtaatat 300ccggaaacct cctcggattc
cattgcccag ctatctgtca ctttattgtg aagatagtgg 360aaaaggaagg
tggctcctac aaatgccatc attgcgataa aggaaaggcc atcgttgaag
420atgcctctgc cgacagtggt cccaaagatg gacccccacc cacgaggagc
atcgtggaaa 480aagaagacgt tccaaccacg tcttcaaagc aagtggattg
atgtgatatc tccactgacg 540taagggatga cgcacaatcc cactatcctt
cgcaagaccc ttcctctata taaggaagtt 600catttcattt ggagaggaca
cgctgacaag ctgactctag cagatctggt accgtcgacg 660gtgagctccg
cggccgctct agacaggcct cgtaccggat cctctagcta gagctttcgt
720tcgtatcatc ggtttcgaca acgttcgtca agttcaatgc atcagtttca
ttgcgcacac 780accagaatcc tactgagttt gagtattatg gcattgggaa
aactgttttt cttgtaccat 840ttgttgtgct tgtaatttac tgtgtttttt
attcggtttt cgctatcgaa ctgtgaaatg 900gaaatggatg gagaagagtt
aatgaatgat atggtccttt tgttcattct caaattaata 960ttatttgttt
tttctcttat ttgttgtgtg ttgaatttga aattataaga gatatgcaaa
1020cattttgttt tgagtaaaaa tgtgtcaaat cgtggcctct aatgaccgaa
gttaatatga 1080ggagtaaaac acttgtagtt gtaccattat gcttattcac
taggcaacaa atatattttc 1140agacctagaa aagctgcaaa tgttactgaa
tacaagtatg tcctcttgtg ttttagacat 1200ttatgaactt tcctttatgt
aattttccag aatccttgtc agattctaat cattgcttta 1260taattatagt
tatactcatg gatttgtagt tgagtatgaa aatatttttt aatgcatttt
1320atgacttgcc aattgattga caacatgcat caatcgacct gcagccactc
gaagcggccg 1380gccgccactc gagatcatga gcggagaatt aagggagtca
cgttatgacc cccgccgatg 1440acgcgggaca agccgtttta cgtttggaac
tgacagaacc gcaacgttga aggagccact 1500cagccgcggg tttctggagt
ttaatgagct aagcacatac gtcagaaacc attattgcgc 1560gttcaaaagt
cgcctaaggt cactatcagc tagcaaatat ttcttgtcaa aaatgctcca
1620ctgacgttcc ataaattccc ctcggtatcc aattagagtc tcatattcac
tctcaatcca 1680aataatctgc accggatctg gatcgtttcg catgattgaa
caagatggat tgcacgcagg 1740ttctccggcc gcttgggtgg agaggctatt
cggctatgac tgggcacaac agacaatcgg 1800ctgctctgat gccgccgtgt
tccggctgtc agcgcagggg cgcccggttc tttttgtcaa 1860gaccgacctg
tccggtgccc tgaatgaact gcaggacgag gcagcgcggc tatcgtggct
1920ggccacgacg ggcgttcctt gcgcagctgt gctcgacgtt gtcactgaag
cgggaaggga 1980ctggctgcta ttgggcgaag tgccggggca ggatctcctg
tcatctcacc ttgctcctgc 2040cgagaaagta tccatcatgg ctgatgcaat
gcggcggctg catacgcttg atccggctac 2100ctgcccattc gaccaccaag
cgaaacatcg catcgagcga gcacgtactc ggatggaagc 2160cggtcttgtc
gatcaggatg atctggacga agagcatcag gggctcgcgc cagccgaact
2220gttcgccagg ctcaaggcgc gcatgcccga cggcgaggat ctcgtcgtga
cccatggcga 2280tgcctgcttg ccgaatatca tggtggaaaa tggccgcttt
tctggattca tcgactgtgg 2340ccggctgggt gtggcggacc gctatcagga
catagcgttg gctacccgtg atattgctga 2400agagcttggc ggcgaatggg
ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga 2460ttcgcagcgc
atcgccttct atcgccttct tgacgagttc ttctgagcgg gactctgggg
2520ttcgaaatga ccgaccaagc gacgcccaac ctgccatcac gagatttcga
ttccaccgcc 2580gccttctatg aaaggttggg cttcggaatc gttttccggg
acgccggctg gatgatcctc 2640cagcgcgggg atctcatgct ggagttcttc
gcccacggga tctctgcgga acaggcggtc 2700gaaggtgccg atatcattac
gacagcaacg gccgacaagc acaacgccac gatcctgagc 2760gacaatatga
tcgggcccgg cgtccacatc aacggcgtcg gcggcgactg cccaggcaag
2820accgagatgc accgcgatat cttgctgcgt tcggatattt tcgtggagtt
cccgccacag 2880acccggatga tccccgatcg ttcaaacatt tggcaataaa
gtttcttaag attgaatcct 2940gttgccggtc ttgcgatgat tatcatataa
tttctgttga attacgttaa gcatgtaata 3000attaacatgt aatgcatgac
gttatttatg agatgggttt ttatgattag agtcccgcaa 3060ttatacattt
aatacgcgat agaaaacaaa atatagcgcg caaactagga taaattatcg
3120cgcgcggtgt catctatgtt actagatcgg gctcgaga
3158172230PRTLycopersicon esculentumG3553 polypeptide 172Met Asp
Gln His Gly Asn Gly Gln Pro Pro Val Ser Ala Gly Ala Ile 1 5 10 15
Gln Ser Pro Gln Ala Ala Gly Leu Ala Ala Ser Ser Ala Gln Met Ala 20
25 30 Gln His Gln Leu Ala Tyr Gln His Ile His Gln Gln Gln Gln Gln
Gln 35 40 45 Leu Gln Gln Gln Leu Gln Thr Phe Trp Ala Asn Gln Tyr
Gln Glu Ile 50 55 60 Glu His Val Thr Asp Phe Lys Asn His Ser Leu
Pro Leu Ala Arg Ile 65 70 75 80 Lys Lys Ile Met Lys Ala Asp Glu Asp
Val Arg Met Ile Ser Ala Glu 85 90 95 Ala Pro Val Val Phe Ala Arg
Ala Cys Glu Met Phe Ile Leu Glu Leu 100 105 110 Thr Leu Arg Ala Trp
Asn His Thr Glu Glu Asn Lys Arg Arg Thr Leu 115 120 125 Gln Lys Asn
Asp Ile Ala Ala Ala Ile Thr Arg Thr Asp Ile Phe Asp 130 135 140 Phe
Leu Val Asp Ile Val Pro Arg Glu Asp Leu Lys Asp Glu Val Leu 145 150
155 160 Ala Thr Ile Pro Arg Gly Thr Leu Pro Val Gly Gly Pro Thr Glu
Gly 165 170 175 Leu Pro Phe Tyr Tyr Gly Met Pro Pro Gln Ser Ala Gln
Pro Ile Gly 180 185 190 Ala Pro Gly Met Tyr Met Gly Lys Pro Val Asp
Gln Ala Leu Tyr Ala 195 200 205 Gln Gln Pro Arg Pro Tyr Met Ala Gln
Pro Ile Trp Pro Gln Gln Gln 210 215 220 Gln Pro Pro Ser Asp Ser 225
230 173271PRTLycopersicon esculentumG3554 polypeptide 173Met Asp
Gln His Gly Asn Gly Gln Pro Pro Gly Ile Gly Val Val Thr 1 5 10 15
Ser Ser Ala Pro Ile Tyr Gly Ala Pro Tyr Gln Ala Asn Gln Met Ala 20
25 30 Gly Pro Ser Pro Pro Ala Val Ser Ala Gly Ala Ile Gln Ser Pro
Gln 35 40 45 Ala Ala Gly Leu Ala Ala Ser Ser Ala Gln Met Ala Gln
His Gln Leu 50 55 60 Ala Tyr Gln His Ile His Gln Gln Gln Gln Gln
Gln Leu Gln Gln Gln 65 70 75 80 Leu Gln Thr Phe Trp Ala Asn Gln Tyr
Gln Glu Ile Glu His Val Thr 85 90 95 Asp Phe Lys Asn His Ser Leu
Pro Leu Ala Arg Ile Lys Lys Ile Met 100 105 110 Lys Ala Asp Glu Asp
Val Arg Met Ile Ser Ala Glu Ala Pro Val Val 115 120 125 Phe Ala Arg
Ala Cys Glu Met Phe Ile Leu Glu Leu Thr Leu Arg Ala 130 135 140 Trp
Asn His Thr Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp 145 150
155 160 Ile Ala Ala Ala Ile Thr Arg Thr Asp Ile Phe Asp Phe Leu Val
Asp 165 170 175 Ile Val Pro Arg Glu Asp Leu Lys Asp Glu Val Leu Ala
Thr Ile Pro 180 185 190 Arg Gly Thr Leu Pro Val Gly Gly Pro Thr Glu
Gly Leu Pro Phe Tyr 195 200 205 Tyr Gly Met Pro Pro Gln Ser Ala Gln
Pro Ile Gly Ala Pro Gly Met 210 215 220 Tyr Met Gly Lys Ala Cys Arg
Ser Ser Ser Val Cys Pro Ala Ala Pro 225 230 235 240 Pro Ile Tyr Gly
Thr Ala Asn Leu Ala Pro Ala Ala Ala Thr Thr Leu 245 250 255 Arg Phe
Leu Ser Ser Ser Lys Leu Arg Leu Gln Glu Ser Arg Ser 260 265 270
174258PRTLycopersicon esculentumG3894 polypeptide 174Met Asp Gln
His Gly Asn Gly Gln Pro Pro Gly Ile Gly Val Val Thr 1 5 10 15 Ser
Ser Ala Pro Ile Tyr Gly Ala Pro Tyr Gln Ala Asn Gln Met Ala 20 25
30 Gly Pro Ser Pro Pro Ala Val Ser Ala Gly Ala Ile Gln Ser Pro Gln
35 40 45 Ala Ala Gly Leu Ala Ala Ser Ser Ala Gln Met Ala Gln His
Gln Leu 50 55 60 Ala Tyr Gln His Ile His Gln Gln Gln Gln Gln Gln
Leu Gln Gln Gln 65 70 75 80 Leu Gln Thr Phe Trp Ala Asn Gln Tyr Gln
Glu Ile Glu His Val Thr 85 90 95 Asp Phe Lys Asn His Ser Leu Pro
Leu Ala Arg Ile Lys Lys Ile Met 100 105 110 Lys Ala Asp Glu Asp Val
Arg Met Ile Ser Ala Glu Ala Pro Val Val 115 120 125 Phe Ala Arg Ala
Cys Glu Met Phe Ile Leu Glu Leu Thr Leu Arg Ala 130 135 140 Trp Asn
His Thr Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp 145 150 155
160 Ile Ala Ala Ala Ile Thr Arg Thr Asp Ile Phe Asp Phe Leu Val Asp
165 170 175 Ile Val Pro Arg Glu Asp Leu Lys Asp Glu Val Leu Ala Thr
Ile Pro 180 185 190 Arg Gly Thr Leu Pro Val Gly Gly Pro Thr Glu Gly
Leu Pro Phe Tyr 195 200 205 Tyr Gly Met Pro Pro Gln Ser Ala Gln Pro
Ile Gly Ala Pro Gly Met 210 215 220 Tyr Met Gly Lys Pro Val Asp Gln
Ala Leu Tyr Ala Gln Gln Pro Arg 225 230 235 240 Pro Tyr Met Ala Gln
Pro Ile Trp Pro Gln Gln Gln Gln Pro Pro Ser 245 250 255 Asp Ser
175230PRTSolanum tuberosumG3892 polypeptide 175Met Asp His His Gly
Asn Gly Gln Pro Pro Val Ser Ala Gly Ala Ile 1 5 10 15 Gln Ser Pro
Gln Ala Ala Gly Leu Ser Ala Ser Ser Ala Gln Met Ala 20 25 30 Gln
His Gln Leu Ala Tyr Gln His Ile His Gln Gln Gln Gln Gln Gln 35 40
45 Leu Gln Gln Gln Leu Gln Thr Phe Trp Ala Asn Gln Tyr Gln Glu Ile
50 55 60 Glu His Val Thr Asp Phe Lys Asn His Ser Leu Pro Leu Ala
Arg Ile 65 70 75 80 Lys Lys Ile Met Lys Ala Asp Glu Asp Val Arg Met
Ile Ser Ala Glu 85 90 95 Ala Pro Val Val Phe Ala Arg Ala Cys Glu
Met Phe Ile Leu Glu Leu 100 105 110 Thr Leu Arg Ala Trp Asn His Thr
Glu Glu Asn Lys Arg Arg Thr Leu 115 120 125 Gln Lys Asn Asp Ile Ala
Ala Ala Ile Thr Arg Thr Asp Ile Phe Asp 130 135 140 Phe Leu Val Asp
Ile Val Pro Arg Glu Asp Leu Lys Asp Glu Val Leu 145 150 155 160 Ala
Thr Ile Pro Arg Gly Thr Leu Pro Val Gly Gly Pro Thr Glu Gly 165 170
175 Leu Pro Phe Tyr Tyr Gly Met Pro Pro Gln Ser Ala Gln Pro Ile Gly
180 185 190 Ala Pro Gly Met Tyr Met Gly Lys Pro Val Asp Gln Ala Leu
Tyr Ala 195 200 205 Gln Gln Pro Arg Pro Phe Met Ala Gln Pro Ile Trp
Pro Gln Gln Gln 210 215 220 Gln Pro Pro Ser Asp Ser 225 230
176228PRTSolanum tuberosumG3893 polypeptide 176Met Asp His His Gly
Asn Gly Gln Pro Pro Gly Ile Gly Val Val Thr 1 5 10 15 Ser Ser Ala
Pro Ile Tyr Gly Ala Pro Tyr Gln Ala Asn Gln Met Ala 20 25 30 Gly
Pro Pro Ala Val Ser Ala Gly Ala Ile Gln Ser Pro Gln Ala Ala 35 40
45 Gly Leu Ser Ala Ser Ser Ala Gln Met Ala Gln His Gln Leu Ala Tyr
50 55 60 Gln His Ile His Gln Gln Gln Gln Gln Gln Leu Gln Gln Gln
Leu Gln 65 70 75 80 Thr Phe Trp Ala Asn Gln Tyr Gln Glu Ile Glu His
Val Thr Asp Phe 85 90 95 Lys Asn His Ser Leu Pro Leu Ala Arg Ile
Lys Lys Ile Met Lys Ala 100 105 110 Asp Glu Asp Val Arg Met Ile Ser
Ala Glu Ala Pro Val Val Phe Ala 115 120 125 Arg Ala Cys Glu Met Phe
Ile Leu Glu Leu Thr Leu Arg Ala Trp Asn 130 135 140 His Thr Glu Glu
Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala 145 150 155 160 Ala
Ala Ile Thr Arg Thr Asp Ile Phe Asp Phe Leu Val Asp Ile Val 165 170
175 Pro Arg Glu Asp Leu Lys Asp Glu Val Leu Ala Thr Ile Pro Arg Gly
180 185 190 Thr Leu Pro Val Gly Gly Pro Thr Glu Gly Leu Pro Phe Tyr
Tyr Gly 195 200 205 Met Pro Pro Gln Ser Ala Gln Pro Ile Gly Ala Pro
Gly Met Tyr Met 210 215 220 Gly Lys Pro Val 225 177260PRTMedicago
truncatulaG3896 polypeptide 177Met Asp His Gln Gly His Asn Gln Asn
Pro Gln Met Gly Val Val Gly 1 5 10 15 Ser Gly Ser Gln Met Pro Tyr
Gly Ser Asn Pro Tyr Gln Ser Asn Gln 20 25 30 Met Thr Gly Ala Pro
Gly Ser Val Val Thr Ser Val Gly Gly Met Gln 35 40 45 Ser Thr Gly
Gln Pro Ala Gly Ala Gln Leu Gly Gln His Gln Leu Ala 50
55 60 Tyr Gln His Ile His Gln Gln Gln Gln Gln Gln Leu Gln Gln Gln
Leu 65 70 75 80 Gln Ser Phe Trp Ser Asn Gln Tyr Gln Glu Ile Glu Lys
Val Thr Asp 85 90 95 Phe Lys Asn His Ser Leu Pro Leu Ala Arg Ile
Lys Lys Ile Met Lys 100 105 110 Ala Asp Glu Asp Val Arg Met Ile Ser
Ala Glu Ala Pro Val Ile Phe 115 120 125 Ala Arg Ala Cys Glu Met Phe
Ile Leu Glu Leu Thr Leu Arg Ser Trp 130 135 140 Asn His Thr Glu Glu
Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile 145 150 155 160 Ala Ala
Ala Ile Thr Arg Thr Asp Ile Phe Asp Phe Leu Val Asp Ile 165 170 175
Val Pro Arg Glu Asp Leu Lys Asp Glu Val Leu Ala Ser Ile Pro Arg 180
185 190 Gly Thr Met Pro Val Ala Gly Pro Ala Asp Ala Leu Pro Tyr Cys
Tyr 195 200 205 Met Pro Pro Gln His Ala Ser Gln Val Gly Thr Ala Gly
Val Ile Met 210 215 220 Gly Lys Pro Val Met Asp Pro Asn Met Tyr Ala
Gln Gln Pro His Pro 225 230 235 240 Tyr Met Ala Pro Gln Met Trp Pro
Gln Pro Pro Glu Gln Arg Pro Pro 245 250 255 Ser Pro Asp His 260
178248PRTZea maysG3551 polypeptide 178Met Glu Pro Ser Pro Gln Pro
Met Gly Val Ala Ala Gly Gly Ser Gln 1 5 10 15 Val Tyr Pro Ala Ser
Ala Tyr Pro Pro Ala Ala Thr Val Ala Pro Ala 20 25 30 Ser Val Val
Ser Ala Gly Leu Gln Ser Gly Gln Pro Phe Pro Ala Asn 35 40 45 Pro
Gly His Met Ser Ala Gln His Gln Ile Val Tyr Gln Gln Ala Gln 50 55
60 Gln Phe His Gln Gln Leu Gln Gln Gln Gln Gln Gln Gln Leu Gln Gln
65 70 75 80 Phe Trp Val Glu Arg Met Thr Glu Ile Glu Ala Thr Thr Asp
Phe Lys 85 90 95 Asn His Asn Leu Pro Leu Ala Arg Ile Lys Lys Ile
Met Lys Ala Asp 100 105 110 Glu Asp Val Arg Met Ile Ser Ala Glu Ala
Pro Val Val Phe Ala Lys 115 120 125 Ala Cys Glu Ile Phe Ile Leu Glu
Leu Thr Leu Arg Ser Trp Met His 130 135 140 Thr Glu Val Asn Lys Arg
Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala 145 150 155 160 Ala Ile Thr
Arg Thr Asp Ile Tyr Asp Phe Leu Val Asp Ile Val Pro 165 170 175 Arg
Asp Glu Met Lys Glu Asp Gly Ile Gly Leu Pro Arg Ala Gly Leu 180 185
190 Pro Pro Met Gly Ala Pro Ala Asp Ala Tyr Pro Tyr Tyr Tyr Met Pro
195 200 205 Gln Gln Gln Val Pro Gly Ser Gly Met Val Tyr Gly Ala Gln
Gln Gly 210 215 220 His Pro Val Thr Tyr Leu Trp Gln Glu Pro Gln Gln
Gln Gln Glu Gln 225 230 235 240 Ala Pro Glu Glu Gln Gln Ser Ala 245
179249PRTDaucus carotaG3899 polypeptide 179Met Asp Glu Ser Glu Glu
Pro Gln Gln Gln Gln Glu Ala Val Ile Asp 1 5 10 15 Ser Ala Ser Gln
Met Thr Tyr Gly Val Pro His Tyr His Ala Val Gly 20 25 30 Leu Gly
Val Ala Thr Gly Thr Pro Val Val Pro Val Ser Ala Pro Thr 35 40 45
Gln His Pro Thr Gly Thr Thr Ser Gln Gln Gln Pro Glu Tyr Tyr Glu 50
55 60 Ala Gln His Val Tyr Gln Gln Gln Gln Leu Gln Leu Arg Thr Gln
Leu 65 70 75 80 Gln Ala Phe Trp Ala Asn Gln Ile Gln Glu Ile Gly Gln
Thr Pro Asp 85 90 95 Phe Lys Asn His Ser Leu Pro Leu Ala Arg Ile
Lys Lys Ile Met Lys 100 105 110 Ala Asp Glu Asp Val Arg Met Ile Ser
Ser Glu Ala Pro Val Ile Phe 115 120 125 Ala Lys Ala Cys Glu Met Phe
Ile Leu Glu Leu Thr Met Arg Ser Trp 130 135 140 Leu Leu Thr Glu Glu
Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile 145 150 155 160 Ala Ala
Ala Ile Ser Arg Thr Asp Ile Phe Asp Phe Leu Val Asp Ile 165 170 175
Ile Pro Arg Asp Glu Leu Lys Glu Glu Gly Leu Gly Ile Thr Lys Ala 180
185 190 Thr Ile Pro Leu Leu Gly Ser Pro Ala Asp Ser Ala Pro Tyr Tyr
Tyr 195 200 205 Val Pro Gln Gln His Ala Val Glu Gln Ala Gly Phe Tyr
Pro Asp Gln 210 215 220 Gln Ala His Pro Gln Leu Pro Tyr Met Ser Trp
Gln Gln Pro His Glu 225 230 235 240 His Lys Asp Gln Glu Glu Asn Gly
Asp 245 180229PRTDaucus carotaG3900 polypeptide 180Met Asp His Ser
Glu Glu Ser Gln Gln Gln Gln Glu Glu Val Ile Asp 1 5 10 15 Ile Ala
Tyr Gly Met Pro Gln Tyr His Ala Gly Pro Gly Val Ala Thr 20 25 30
Gly Thr Pro Val Val Pro Val Ser Ala Ala Thr Gln Ala Gln His Phe 35
40 45 Phe Gln Gln Lys Leu Gln Leu Gln Gln Gln Asp Gln Leu Gln Ala
Phe 50 55 60 Trp Ala Asn Gln Met Gln Glu Ile Glu Gln Thr Thr Asp
Phe Lys Asn 65 70 75 80 His Ser Leu Pro Leu Ala Arg Ile Lys Lys Ile
Met Lys Ala Asp Glu 85 90 95 Asp Val Arg Met Ile Ser Ser Glu Ala
Pro Val Val Phe Ala Lys Ala 100 105 110 Cys Glu Met Phe Ile Met Asp
Leu Thr Met Arg Ser Trp Ser His Thr 115 120 125 Glu Glu Asn Lys Arg
Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala 130 135 140 Val Ser Arg
Thr Asp Val Phe Asp Phe Leu Val Asp Ile Ile Pro Lys 145 150 155 160
Asp Glu Met Lys Glu Asp Thr Arg Ala Ser Ile Pro Leu Met Gly Gln 165
170 175 Pro Pro Ala Asp Ser Val Pro Tyr Tyr Tyr Val Pro Gln Gln His
Ala 180 185 190 Ala Gly Gln Ala Gly Phe Tyr Pro Asp Gln His Gln Gln
Gln Pro Leu 195 200 205 Pro Tyr Met Gln Trp Gln Gln Pro Gln Gln Asp
Gln Asn Gln Gln Gln 210 215 220 Gln Glu Asn Gly Asn 225
181184PRTGossypium arboreumG3907 polypeptide 181Met Asp Gln Arg Glu
Lys Thr Gln Gln Gln Gln Gln Gln Pro Val Met 1 5 10 15 Gly Val Val
Pro Gly Ala Gly Gln Met Gly Tyr Ser Thr Ala Tyr Gln 20 25 30 Thr
Ala Ser Met Val Ala Ser Gly Thr Thr Gly Val Ala Val Pro Ile 35 40
45 Gln Thr Gln Pro Ser Ala Thr Phe Ser Ser Ser Pro His Gln Leu Ala
50 55 60 Tyr Gln Gln Ala Gln His Phe His His Gln Gln Gln Gln Gln
Gln Gln 65 70 75 80 Gln Gln Leu Gln Met Phe Trp Ala Asn Gln Met His
Glu Ile Glu Gln 85 90 95 Thr Thr Asp Phe Lys Asn His Ser Leu Pro
Leu Ala Arg Ile Lys Lys 100 105 110 Ile Met Lys Ala Asp Glu Asp Val
Arg Met Ile Ser Ala Glu Ala Pro 115 120 125 Val Ile Phe Ala Lys Ala
Cys Glu Met Phe Val Leu Glu Leu Thr Leu 130 135 140 Arg Ser Trp Ile
His Thr Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys 145 150 155 160 Asn
Asp Ile Ala Ala Ala Ile Ser Arg Thr Asp Val Phe Asp Phe Leu 165 170
175 Val Asp Ile Ile Pro Gly Thr Glu 180 182270PRTGlycine maxG3549
polypeptide 182Met Asp Lys Ser Glu Gln Thr Gln Gln Gln Gln Gln Gln
Gln His Val 1 5 10 15 Met Gly Val Ala Ala Gly Ala Ser Gln Met Ala
Tyr Ser Ser His Tyr 20 25 30 Pro Thr Ala Ser Met Val Ala Ser Gly
Thr Pro Ala Val Thr Ala Pro 35 40 45 Ser Pro Thr Gln Ala Pro Ala
Ala Phe Ser Ser Ser Ala His Gln Leu 50 55 60 Ala Tyr Gln Gln Ala
Gln His Phe His His Gln Gln Gln Gln His Gln 65 70 75 80 Gln Gln Gln
Leu Gln Met Phe Trp Ser Asn Gln Met Gln Glu Ile Glu 85 90 95 Gln
Thr Ile Asp Phe Lys Asn His Ser Leu Pro Leu Ala Arg Ile Lys 100 105
110 Lys Ile Met Lys Ala Asp Glu Asp Val Arg Met Ile Ser Ala Glu Ala
115 120 125 Pro Val Ile Phe Ala Lys Ala Cys Glu Met Phe Ile Leu Glu
Leu Thr 130 135 140 Leu Arg Ser Trp Ile His Thr Glu Glu Asn Lys Arg
Arg Thr Leu Gln 145 150 155 160 Lys Asn Asp Ile Ala Ala Ala Ile Ser
Arg Asn Asp Val Phe Asp Phe 165 170 175 Leu Val Asp Ile Ile Pro Arg
Asp Glu Leu Lys Glu Glu Gly Leu Gly 180 185 190 Ile Thr Lys Ala Thr
Ile Pro Leu Val Asn Ser Pro Ala Asp Met Pro 195 200 205 Tyr Tyr Tyr
Val Pro Pro Gln His Pro Val Val Gly Pro Pro Gly Met 210 215 220 Ile
Met Gly Lys Pro Val Gly Ala Glu Gln Ala Thr Leu Tyr Ser Thr 225 230
235 240 Gln Gln Pro Arg Pro Pro Met Ala Phe Met Pro Trp Pro His Thr
Gln 245 250 255 Pro Gln Gln Gln Gln Pro Pro Gln His Gln Gln Thr Asp
Ser 260 265 270 183201PRTSorghum bicolorG3910 polypeptide 183Met
Glu Pro Lys Ser Thr Thr Pro Pro Pro Pro Pro Val Met Gly Ala 1 5 10
15 Pro Val Ala Tyr Pro Pro Pro Pro Gly Ala Ala Tyr Pro Ala Gly Pro
20 25 30 Tyr Ala His Ala Pro Ala Ala Ala Leu Tyr Pro Pro Pro Pro
Pro Pro 35 40 45 Pro Ala Pro Pro Thr Ser Gln Gln Gly Ala Ala Ala
Ala Gln Gln Leu 50 55 60 Gln Leu Phe Trp Ala Glu Gln Tyr Arg Glu
Ile Glu Ala Thr Thr Asp 65 70 75 80 Phe Lys Asn His Asn Leu Pro Leu
Ala Arg Ile Lys Lys Ile Met Lys 85 90 95 Ala Asp Glu Asp Val Arg
Met Ile Ala Ala Glu Ala Pro Val Val Phe 100 105 110 Ala Arg Ala Cys
Glu Met Phe Ile Leu Glu Leu Thr His Arg Gly Trp 115 120 125 Ala His
Ala Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys Ser Asp Ile 130 135 140
Ala Ala Ala Val Ala Arg Thr Glu Val Phe Asp Phe Leu Val Asp Ile 145
150 155 160 Val Pro Arg Asp Glu Ala Lys Glu Ala Asp Ser Ala Ala Ala
Met Gly 165 170 175 Pro Ala Gly Ile Pro His Pro Ala Ala Gly Leu Pro
Ala Thr Asp Pro 180 185 190 Met Gly Tyr Tyr Tyr Val Gln Pro Gln 195
200 184232PRTLycopersicon esculentumG3555 polypeptide 184Met Asp
Asn Asn Pro His Gln Ser Pro Thr Glu Ala Ala Ala Ala Ala 1 5 10 15
Ala Ala Ala Ala Ala Ala Ala Gln Ser Ala Thr Tyr Pro Ser Gln Thr 20
25 30 Pro Tyr His His Leu Leu Gln Gln Gln Gln Gln Gln Leu Gln Met
Phe 35 40 45 Trp Thr Tyr Gln Arg Gln Glu Ile Glu Gln Val Asn Asp
Phe Lys Asn 50 55 60 His Gln Leu Pro Leu Ala Arg Ile Lys Lys Ile
Met Lys Ala Asp Glu 65 70 75 80 Asp Val Arg Met Ile Ser Ala Glu Ala
Pro Val Leu Phe Ala Lys Ala 85 90 95 Cys Glu Leu Phe Ile Leu Glu
Leu Thr Ile Arg Ser Trp Leu His Ala 100 105 110 Glu Glu Asn Lys Arg
Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala 115 120 125 Ile Thr Arg
Thr Asp Ile Phe Asp Phe Leu Val Asp Ile Val Pro Arg 130 135 140 Asp
Glu Ile Lys Asp Glu Gly Val Gly Leu Gly Pro Gly Ile Val Gly 145 150
155 160 Ser Thr Ala Ser Gly Val Pro Tyr Tyr Tyr Pro Pro Met Gly Gln
Pro 165 170 175 Ala Pro Gly Gly Val Met Leu Gly Arg Pro Ala Val Pro
Gly Val Asp 180 185 190 Pro Ser Met Tyr Val His Pro Pro Pro Ser Gln
Ala Trp Gln Ser Val 195 200 205 Trp Gln Thr Gly Asp Asp Asn Ser Tyr
Ala Ser Gly Gly Ser Ser Gly 210 215 220 Gln Gly Asn Leu Asp Gly Gln
Ile 225 230 185232PRTSolanum tuberosumG3885 polypeptide 185Met Asp
Asn Asn Pro His Gln Ser Pro Thr Glu Ala Ala Ala Ala Ala 1 5 10 15
Ala Ala Ala Ala Ala Ala Ala Gln Ser Ala Thr Tyr Pro Pro Gln Thr 20
25 30 Pro Tyr His His Leu Leu Gln Gln Gln Gln Gln Gln Leu Gln Met
Phe 35 40 45 Trp Thr Tyr Gln Arg Gln Glu Ile Glu Gln Val Asn Asp
Phe Lys Asn 50 55 60 His Gln Leu Pro Leu Ala Arg Ile Lys Lys Ile
Met Lys Ala Asp Glu 65 70 75 80 Asp Val Arg Met Ile Ser Ala Glu Ala
Pro Val Leu Phe Ala Lys Ala 85 90 95 Cys Glu Leu Phe Ile Leu Glu
Leu Thr Ile Arg Ser Trp Leu His Ala 100 105 110 Glu Glu Asn Lys Arg
Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala 115 120 125 Ile Thr Arg
Thr Asp Ile Phe Asp Phe Leu Val Asp Ile Val Pro Arg 130 135 140 Asp
Glu Ile Lys Asp Glu Gly Val Val Leu Gly Pro Gly Ile Val Gly 145 150
155 160 Ser Thr Ala Ser Gly Val Pro Tyr Tyr Tyr Pro Pro Met Gly Gln
Pro 165 170 175 Ala Pro Gly Gly Val Met Leu Gly Arg Pro Ala Val Pro
Gly Val Asp 180 185 190 Pro Ser Met Tyr Val His Pro Pro Pro Ser Gln
Ala Trp Gln Ser Val 195 200 205 Trp Gln Thr Gly Asp Asp Asn Ser Tyr
Ala Ser Gly Gly Ser Ser Gly 210 215 220 Gln Gly Asn Leu Asp Gly Gln
Ile 225 230 186233PRTGossypium raimondiiG3883 polypeptide 186Met
Asp Ser Asn Gln Gln Thr Gln Ser Thr Pro Tyr Pro Pro Gln Pro 1 5 10
15 Pro Thr Ser Ala Ile Thr Pro Pro Ser Ser Ala Thr Ala Thr Ala Pro
20 25 30 Pro Phe His His Leu Leu Gln Gln Gln Gln Gln Gln Leu Gln
Met Phe 35 40 45 Trp Ser Tyr Gln Arg Gln Glu Ile Glu Gln Val Asn
Asp Phe Lys Asn 50 55 60 His Gln Leu Pro Leu Ala Arg Ile Lys Lys
Ile Met Lys Ala Asp Glu 65 70 75 80 Asp Val Arg Met Ile Ser Ala Glu
Ala Pro Ile Leu Phe Ala Lys Ala 85 90 95 Cys Glu Leu Phe Ile Leu
Glu Leu Thr Ile Arg Ser Trp Leu His Ala 100 105 110 Glu Glu Asn Lys
Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala 115 120 125 Ile Thr
Arg Thr Asp Ile Phe Asp Phe Leu Val Asp Ile Val Pro Arg 130 135 140
Asp Glu Ile Lys Asp Glu Thr Gly Leu Ala Pro Met Val Gly Ala Thr 145
150 155 160 Ala Ser Gly Val Pro Tyr Phe Tyr Pro Pro Met Gly Gln Pro
Ala Ala 165 170 175 Gly Gly Pro Gly Gly Met Met Ile Gly Arg Pro Ala
Val Asp Pro Thr 180 185 190 Gly Gly Ile Tyr Gly Gln Pro Pro Ser Gln
Ala Trp Gln Ser Val Trp 195
200 205 Gln Thr Ala Gly Thr Asp Asp Gly Ser Tyr Gly Ser Gly Val Thr
Gly 210 215 220 Gly Gln Gly Asn Leu Asp Gly Gln Gly 225 230
187227PRTNicotiana benthamianaG3884 polypeptide 187Met Glu Asn Asn
Gln Gln Ser Ala Ala Asn Ala Ala Ala Ala Ala Ala 1 5 10 15 Ala Ala
Ala Ala Tyr Pro Ala Gln Pro Pro Tyr His His Leu Leu Gln 20 25 30
Gln Gln Gln Gln Gln Leu Gln Met Phe Trp Thr Tyr Gln Arg Gln Glu 35
40 45 Ile Glu Gln Val Asn Asp Phe Lys Asn His Gln Leu Pro Leu Ala
Arg 50 55 60 Ile Lys Lys Ile Met Lys Ala Asp Glu Asp Val Arg Met
Ile Ser Ala 65 70 75 80 Glu Ala Pro Ile Leu Phe Ala Lys Ala Cys Glu
Leu Phe Ile Leu Glu 85 90 95 Leu Thr Ile Arg Ser Trp Leu His Ala
Glu Glu Asn Lys Arg Arg Thr 100 105 110 Leu Gln Lys Asn Asp Ile Ala
Ala Ala Ile Thr Arg Thr Asp Ile Phe 115 120 125 Asp Phe Leu Val Asp
Ile Val Pro Arg Asp Glu Ile Lys Glu Glu Gly 130 135 140 Gly Val Gly
Leu Gly Pro Ala Gly Ile Val Gly Ser Thr Ala Ser Gly 145 150 155 160
Val Pro Tyr Tyr Tyr Pro Pro Met Gly Gln Pro Ala Pro Pro Gly Val 165
170 175 Met Met Gly Arg Pro Ala Met Pro Gly Val Asp Pro Ser Met Tyr
Val 180 185 190 Gln Pro Pro Pro Ser Gln Ala Trp Gln Ser Val Trp Gln
Thr Ala Glu 195 200 205 Asp Asn Ser Tyr Ala Ser Gly Gly Ser Ser Gly
Gln Gly Asn Leu Asp 210 215 220 Gly Gln Ser 225
188214PRTPhyscomitrella patensG3867 polypeptide 188Met Ser His Pro
Gly Ala Val Met Pro Leu Gln Met His Tyr Pro Gln 1 5 10 15 Ala Gln
Gln Gln Met Met Pro Gln Leu Gly Asp Gln Gln Met Gln Pro 20 25 30
Gln Leu His Tyr Gln Gln Ile Gln Lys Gln Gln Leu Ser Gln Phe Trp 35
40 45 Gln Gln Gln Met Gln Glu Met Glu Gln Val Asn Asp Phe Lys Thr
His 50 55 60 Gln Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ser
Asp Glu Asp 65 70 75 80 Val Lys Met Ile Ala Ala Glu Ala Pro Val Leu
Phe Ser Lys Ala Cys 85 90 95 Glu Met Phe Ile Leu Glu Leu Thr Leu
Arg Ser Trp Ile His Thr Glu 100 105 110 Glu Asn Lys Arg Arg Thr Leu
Gln Arg Asn Asp Ile Ala Gly Ala Ile 115 120 125 Thr Arg Gly Asp Ile
Phe Asp Phe Leu Val Asp Ile Val Pro Arg Asp 130 135 140 Glu Leu Lys
Glu Glu Asp Leu Gly Val Pro Trp Thr Gly Val Pro Gly 145 150 155 160
Asp Gly Ser Val Pro Tyr Gly Gly Ile Phe Tyr Pro Pro Met Ala Gly 165
170 175 Gln Gln Met His His Ser Met Gly Ala Pro Glu Met Met Val Gly
Gln 180 185 190 Pro Pro Asn Pro Gln Met Met Tyr Gln Pro Pro Gln Thr
Ala Phe Val 195 200 205 Pro Glu Gln Gln Gln Gln 210 189249PRTOryza
sativaG3545 polypeptide 189Met Asp Pro His Ser His Lys Lys Ala His
Glu Gly Leu Ile Gly Asp 1 5 10 15 Asn Pro Asp Ala Tyr Ala Val Thr
Thr Tyr Gln Pro Val Leu Met Val 20 25 30 Glu Pro Ser Ala Ala Ala
Ala Phe Pro Pro Ala Pro Gln Val Ala Pro 35 40 45 Ala Tyr Pro Val
Asn Pro Met Gln Leu Pro Glu His Gln Gln His Ala 50 55 60 Ile Gln
Gln Val Gln Gln Leu Gln Gln Gln Gln Lys Glu Gln Leu Gln 65 70 75 80
Ala Phe Trp Ala Asp Gln Met Ala Glu Val Glu Gln Met Thr Glu Phe 85
90 95 Lys Leu Pro Asn Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys
Ala 100 105 110 Asp Glu Asp Val Lys Met Ile Ala Gly Glu Ala Pro Ala
Leu Phe Ala 115 120 125 Lys Ala Cys Glu Met Phe Ile Leu Asp Met Thr
Leu Arg Ser Trp Gln 130 135 140 His Thr Glu Glu Gly Arg Arg Arg Thr
Leu Gln Arg Ser Asp Val Glu 145 150 155 160 Ala Val Ile Lys Lys Thr
Asp Ile Phe Asp Phe Leu Val Asp Ile Ile 165 170 175 Thr Asp Asp Lys
Met Lys Asp Asp Gly Met Gly Ser Gln Ala Ala Ser 180 185 190 Met Val
Ser Pro Tyr Thr Ser Gly Gly Met Gly Phe Ser Phe Asp Leu 195 200 205
Tyr Pro Asn Gln His His Leu Ala Tyr Met Trp Pro Pro Gln Glu Gln 210
215 220 Gln Glu Gln Trp Pro Pro Gln Glu Gln Gln Glu Gln Lys Gln Lys
Gln 225 230 235 240 Asp Ser Asp Gly Gly Gly Gln Asp Glu 245
190275PRTArabidopsis thalianaG2637 polypeptide 190Met Glu Ser Glu
Lys Val Val Val Asp Glu Leu Pro Leu Ala Ile Val 1 5 10 15 Arg Arg
Val Val Lys Lys Lys Leu Ser Glu Cys Ser Pro Asp Tyr Asp 20 25 30
Val Ser Ile His Lys Glu Ala Leu Leu Ala Phe Ser Glu Ser Ala Arg 35
40 45 Ile Phe Ile His Tyr Leu Ser Ala Thr Ala Asn Asp Phe Cys Lys
Asp 50 55 60 Ala Arg Arg Gln Thr Met Lys Ala Asp Asp Val Phe Lys
Ala Leu Glu 65 70 75 80 Glu Met Asp Phe Ser Glu Phe Leu Glu Pro Leu
Lys Ser Ser Leu Glu 85 90 95 Asp Phe Lys Lys Lys Asn Ala Gly Lys
Lys Ala Gly Ala Ala Ala Ala 100 105 110 Ser Tyr Pro Ala Gly Gly Ala
Ala Leu Lys Ser Ser Ser Gly Thr Ala 115 120 125 Ser Lys Pro Lys Glu
Thr Lys Lys Arg Lys Gln Glu Glu Pro Ser Thr 130 135 140 Gln Lys Gly
Ala Arg Lys Ser Lys Ile Asp Glu Glu Thr Lys Arg Asn 145 150 155 160
Asp Glu Glu Thr Glu Asn Asp Asn Thr Glu Glu Glu Asn Gly Asn Asp 165
170 175 Glu Glu Asp Glu Asn Gly Asn Asp Glu Glu Asp Glu Asn Asp Asp
Glu 180 185 190 Asn Thr Glu Glu Asn Gly Asn Asp Glu Glu Asn Asp Asp
Glu Asn Thr 195 200 205 Glu Glu Asn Gly Asn Asp Glu Glu Asn Glu Lys
Glu Asp Glu Glu Asn 210 215 220 Ser Met Glu Glu Asn Gly Asn Glu Ser
Glu Glu Ser Gly Asn Glu Asp 225 230 235 240 His Ser Met Glu Glu Asn
Gly Ser Gly Val Gly Glu Asp Asn Glu Asn 245 250 255 Glu Asp Gly Ser
Val Ser Gly Ser Gly Glu Glu Val Glu Ser Asp Glu 260 265 270 Glu Asp
Glu 275 1913446DNAArtificial sequencesynthetic sequence
191cacggacctt ggatctgaag ttatgaacaa taacatattt ggcaaaacaa
agaaaaaaga 60aacaacaata ctaacatatt ttggtaaaag aacattgaga agtctcaaaa
attaacttct 120tcttattttg tttcctaata agaccgtttg cttcatttca
agttcttagg aaataatttc 180atgtaacgtg tatgtagata tgtttatgta
cagataaaga gagatctgaa aatgatatat 240agagcttttg tggtgataag
tgcaacaagc aggatatata tatcgaacgt ggtggttaga 300agatagcgtc
aaaatagatg ctagctgctg cgtatacatc atattcatat catatgtact
360tctcttttgt gatttctcat gtgattgaac atactacata aatcttgata
gatttataaa 420aatgcaacaa attgttgttt atataagaaa aataaaacac
tgatatgata tttcattagt 480tattatcaaa tttgcaatat aatgtttaac
atccaagatt tgttttacat aatcgttacg 540gttactaaag tttaatttat
gatgttttaa aacaaattga gactaaattt ctaaaagaaa 600catatacgta
catgtgtgta gctgcgtata tatatagaat ggtggggcta aaagctaatg
660atgtgtacat taattggaca tttgatgtgg ctggattgga cccaacttgc
tctttgatag 720agacctaact aagacaattt tgctcttcat tcatttctcc
cgtatacata attgaattaa 780ctgtacataa tgtttcacaa caagcgatct
agctatatat ttcaaaataa cagagactga 840tattttaatc tggtcttcta
agctctaacg tcaaattaaa aaaaaaatcc gatcttctaa 900ttaattagaa
gaaatcaatt atagaacctc tctctttaat ttcatttatt taaaactgct
960tggaaattta attattcact aaagactcac tattctcctt aatttatgat
aatttgtaga 1020tcatatgttc agtttttatt tatttgccat tcgaatgttg
agttttaatt aaaccaatat 1080gttaatattc gaattaaaaa aacttaccta
taattcactt atttaaaaac ataaaataat 1140aataattgca tcaccgtgat
acaaagcaac ctcacaagtc acaactctcg tgactacaaa 1200gatcactcat
taaacaaacc ttcctgcctt ctttttttct acttgggcac ctcgaccgat
1260cgaagactat tcttgggatc tgcttcaaaa acgactatat gttctaaatc
cacttcgtat 1320gatgacgaac atttggttta ctactgaaga tagagattac
gtccttctaa ttagaagtaa 1380ttaattattt tagtatttgg aagctaatgg
tggagatgta accgtatctt agtggatcga 1440gatattgtat ataaaatatg
tatgctacat cgaataataa actgaaagag agtaaaaagg 1500gatatttaat
gggaagaaaa gaagggtgga gatgtaacaa aggcgaagat aatggatatt
1560cttgggatgt tgtcttcaag gccacgagct tagattcttt tagttttgct
caatttgtta 1620agtttctact tttccttttg ttgcttacta cttttgctca
tgatctccat atacatatca 1680tacatatata tagtatacta tctttagact
gatttctcta tacactatct tttaacttat 1740gtatcgtttc aaaactcagg
acgtacatgt ttaaatttgg ttatataacc acgaccattt 1800caagtatata
tgtcatacca taccagattt aatataactt ctatgaagaa aatacataaa
1860gttggattaa aatgcaagtg acatcttttt agcataggtt catttggcat
agaagaaata 1920tataactaaa aatgaacttt aacttaaata gattttacta
tattacaatt ttttcttttt 1980acatggtcta atttattttt ctaaaattag
tataattgtt gttttgatga aacaataata 2040ccgtaagcaa tagttgctaa
aagatgtcca aatatttata aattacaaag taaatcaaat 2100aaggaagaag
acacgtggaa aacaccaaat aagagaagaa atggaaaaaa cagaaagaaa
2160ttttttaaca agaaaaatca attagtcctc aaacctgaga tatttaaagt
aatcaactaa 2220aacaggaaca cttgactaac aaagaaattt gaaacgtggt
ccaactttca cttaattata 2280ttgttttctc taaggcttat gcaatatatg
ccttaagcaa atgccgaatc tgtttttttt 2340ttttttgtta ttggatattg
actgaaaata aggggttttt tcacacttga agatctcaaa 2400agagaaaact
attacaacgg aaattcattg taaaagaagt gattaagcaa attgagcaaa
2460ggtttttatg tggtttattt cattatatga ttgacatcaa attgtatata
tatggttgtt 2520ttatttaaca atatatatgg atataacgta caaactaaat
atgtttgatt gacgaaaaaa 2580aatatatgta tgtttgatta acaacatagc
acatattcaa ctgatttttg tcctgatcat 2640ctacaactta ataagaacac
acaacattga acaaatcttt gacaaaatac tatttttggg 2700tttgaaattt
tgaatactta caattattct tctcgatctt cctctctttc cttaaatcct
2760gcgtacaaat ccgtcgacgc aatacattac acagttgtca attggttctc
agctctacca 2820aaaacatcta ttgccaaaag aaaggtctat ttgtacttca
ctgttacagc tgagaacatt 2880aaatataata agcaaatttg ataaaacaaa
gggttctcac cttattccaa aagaatagtg 2940taaaataggg taatagagaa
atgttaataa aaggaaatta aaaatagata ttttggttgg 3000ttcagatttt
gtttcgtaga tctacaggga aatctccgcc gtcaatgcaa agcgaaggtg
3060acacttgggg aaggaccagt ggtccgtaca atgttactta cccatttctc
ttcacgagac 3120gtcgataatc aaattgttta ttttcatatt tttaagtccg
cagttttatt aaaaaatcat 3180ggacccgaca ttagtacgag atataccaat
gagaagtcga cacgcaaatc ctaaagaaac 3240cactgtggtt tttgcaaaca
agagaaacca gctttagctt ttccctaaaa ccactcttac 3300ccaaatctct
ccataaataa agatcccgag actcaaacac aagtcttttt ataaaggaaa
3360gaaagaaaaa ctttcctaat tggttcatac caaagtctga gctcttcttt
atatctctct 3420tgtagtttct tattgggggt ctttgt 3446192800DNAArtificial
sequencesynthetic sequence 192aatagttggg ctgatttcgt agcccactta
atcagccttt aaatatggaa accctagcct 60agaaagtgaa caagaaaaac gtaaagatca
aaatggaaga gaacaacggc aacaacaacc 120actacctgcc gcaaccatcg
tcttcccaac tgccgccgcc accattgtat tatcaatcaa 180tgccgttgcc
gtcatattca ctgccgctgc cgtactcacc gcagatgcgg aattattgga
240ttgcgcagat gggaaacgca actgatgtta agcatcatgc gtttccacta
accaggataa 300agaaaatcat gaagtccaac ccggaagtga acatggtcac
tgcagaggct ccggtcctta 360tatcgaaggc ctgtgagatg ctcattcttg
atctcacaat gcgatcgtgg cttcataccg 420tggagggcgg tcgccaaact
ctcaagagat ccgatacgct cacgagatcc gatatctccg 480ccgcaacgac
tcgtagtttc aaatttacct tccttggcga cgttgtccca agagaccctt
540ccgtcgttac cgatgatccc gtgctacatc cggacggtga agtacttcct
ccgggaacgg 600tgataggata tccggtgttt gattgtaatg gtgtgtacgc
gtcaccgcca cagatgcagg 660agtggccggc ggtgcctggt gacggagagg
aggcagctgg ggaaattgga ggaagcagcg 720gcggtaattg aaaagtgttg
attgggtttt agggttgtaa tgcttttgtg agaatttgta 780tctctatgga
gtcatgtttg 8001931183DNAArtificial sequencesynthetic sequence
193gatatgacca aaatgattaa cttgcattac agttgggaag tatcaagtaa
acaacatttt 60gtttttgttt gatatcggga atctcaaaac caaagtccac actagttttt
ggactatata 120atgataaaag tcagatatct actaatacta gttgatcagt
atattcgaaa acatgacttt 180ccaaatgtaa gttatttact ttttttttgc
tattataatt aagatcaata aaaatgtcta 240agttttaaat ctttatcatt
atatccaaac aatcataatc ttattgttaa tctctcatca 300acacacagtt
tttaaaataa attaattacc ctttgcatga taccgaagag aaacgaattc
360gttcaaataa ttttataaca ggaaataaaa tagataaccg aaataaacga
tagaatgatt 420tcttagtact aactcttaac aacagtttta tttaaatgac
ttttgtaaaa aaaacaaagt 480taacttatac acgtacacgt gtcgaaaata
ttattgacaa tggatagcat gattcttatt 540agagtcatgt aaaagataaa
cacatgcaaa tatatatatg aataatatgt tgttaagata 600aactagacga
ttagaatata tagcacatct atagtttgta aaataactat ttctcaacta
660gacttaagtc ttcgaaatac ataaataaac aaaactataa aaattcagaa
aaaaacatga 720gagtacgtta gtaaaatgta tttttttggt aaaataatca
cttttcatca ggtcttttgt 780aaagcagttt tcatgttaga taaacgagat
tttaattttt tttaaaaaaa gaagtaaact 840aactatgttc ctatctacac
acctataatt ttgaacaatt acaaaacaac aatgaaatgc 900aaagaagacg
tagggcactg tcacactaca atacgattaa taaatgtatt ttggtcgaat
960taataacttt ccatacgata aagttgaatt aacatgtcaa acaaaagaga
tgagtggtcc 1020tatacatagt taggaattag gaacctctaa attaaatgag
tacaaccacc aactactcct 1080tccctctata atctatcgca ttcacaccac
ataacatata cgtacctact ctatataaca 1140ctcactcccc aaactctctt
catcatccat cactacacac atc 1183194812DNAArtificial sequencesynthetic
sequence 194ttattctaag tagcttgact tgtttagttt aaatatgagg ttaatgattt
tgtggggatt 60tgatagttct ggttcttgag tttatttaaa ataggtttac caggatcatg
tactgactct 120gttctttgga acttttcaga attctgcttc ggacattaag
ctcatgagtc atgacttctt 180caatccatga gctttctgat aacattggaa
gtcatgagaa gcaagaacag agagattctc 240atttccaacc accaatccct
tctgcaagaa attatgaatc aattgttaca agtttagtct 300actcagaccc
ggggactaca aattccatgg cacctggaca atatccatat ccagatcctt
360actacagaag catatttgca ccgcctccac aaccgtatac cggggtacat
ctacagttga 420tgggagtgca gcaacaaggc gttcctttac catctgatgc
agtcgaggaa cctgtttttg 480ttaacgcaaa gcaataccac ggtatactaa
ggcgcagaca atcaagagca agacttgagt 540ctcagaataa agtcatcaag
tcacgtaagc cgtatttgca tgaatctcgg catttgcatg 600cgataagacg
accaagagga tgtggcgggc ggtttctaaa tgccaagaag gaggatgagc
660atcacgaaga cagtagtcat gaagaaaaat ccaaccttag cgctggtaaa
tccgccatgg 720ctgcttctag tggtacatct tgagaaggtc ctacaagtag
ctttgttgta ttttggctct 780gtttggtctc agatcatcta tgtcttttag tg
8121951110DNAArtificial sequencesynthetic sequence 195aagctttgag
ctccgcggcc gcaagaccct tcctctatat aaggaagttc atttcatttg 60gagaggacac
gctcgagtat aagagctcat ttttacaaca attaccaaca acaacaaaca
120acaaacaaca ttacaattac atttacaatt accatggaag cgttaacggc
caggcaacaa 180gaggtgtttg atctcatccg tgatcacatc agccagacag
gtatgccgcc gacgcgtgcg 240gaaatcgcgc agcgtttggg gttccgttcc
ccaaacgcgg ctgaagaaca tctgaaggcg 300ctggcacgca aaggcgttat
tgaaattgtt tccggcgcat cacgcgggat tcgtctgttg 360caggaagagg
aagaagggtt gccgctggta ggtcgtgtgg ctgccggtga accacttctg
420gcgcaacagc atattgaagg tcattatcag gtcgatcctt ccttattcaa
gccgaatgct 480gatttcctgc tgcgcgtcag cgggatgtcg atgaaagata
tcggcattat ggatggtgac 540ttgctggcag tgcataaaac tcaggatgta
cgtaacggtc aggtcgttgt cgcacgtatt 600gatgacgaag ttaccgttaa
gcgcctgaaa aaacagggca ataaagtcga actgttgcca 660gaaaatagcg
agtttaaacc aattgtcgta gatcttcgtc agcagagctt caccattgaa
720gggctggcgg ttggggttat tcgcaacggc gactggctgg aattccccaa
ttttaatcaa 780agtgggaata ttgctgatag ctcattgtcc ttcactttca
ctaacagtag caacggtccg 840aacctcataa caactcaaac aaattctcaa
gcgctttcac aaccaattgc ctcctctaac 900gttcatgata acttcatgaa
taatgaaatc acggctagta aaattgatga tggtaataat 960tcaaaaccac
tgtcacctgg ttggacggac caaactgcgt ataacgcgtt tggaatcact
1020acagggatgt ttaataccac tacaatggat gatgtatata actatctatt
cgatgatgaa 1080gataccccac caaacccaaa aaaagagtag
1110196333DNAArtificial sequencesynthetic sequence 196acatatccat
atctaatctt acctcgactg ctgtatataa aaccagtggt tatatgtcca 60gtactgctgt
atataaaacc agtggttata tgtacagtac gtcgatcgat cgacgactgc
120tgtatataaa accagtggtt atatgtacag tactgctgta tataaaacca
gtggttatat 180gtacagtacg tcgaggggat gatcaagacc cttcctctat
ataaggaagt tcatttcatt 240tggagaggac acgctgacaa gctgactcta
gcagatctgg taccgtcgac ggtgagctcc 300gcggccgctc tagacaggcc
tcgtaccgga tcc 3331971098DNAGossypium raimondiiG3883 197aaataataat
aataaacaaa gccagcgccc attacaatgg ccgctgtacc ttatcccatc 60catatctgac
ctttaaaaat atccaccgcc gccaccacca ccacgatcac caccgccaca
120tccccatctc ccgccatttg ttcaccagcc
aatggacagt aaccagcaaa ctcaatccac 180cccataccca cctcagcctc
ccacatccgc cattacccct ccttcatccg ccacagcaac 240cgcgcctcct
ttccaccacc tccttcaaca acaacagcaa cagctccaaa tgttttggtc
300ataccaacgc caagaaatcg agcaagttaa cgattttaag aaccaccaac
tcccattagc 360tcgcattaag aagataatga aagccgacga agacgtccgt
atgatctccg ccgaggctcc 420cattctcttc gccaaagctt gtgagctttt
cattttggaa ctcactatcc gttcttggct 480tcacgccgag gaaaacaagc
gacggacact tcagaaaaac gacatcgctg cggctattac 540gaggaccgac
attttcgatt tcttggtaga tattgtgcct agggatgaga tcaaggatga
600aactggtttg gctccgatgg ttggggctac cgccagtggg gtaccttact
tttatccccc 660tatgggtcaa cctgctgctg gtggtcctgg tgggatgatg
attggccggc ctgccgtcga 720tcccaccgga ggtatttacg gtcagccacc
ttctcaggct tggcagagtg tttggcagac 780ggcgggaact gatgatggct
cgtatggcag tggagttacc ggtggtcaag ggaatcttga 840cggtcaaggc
taacctaaaa tcatgggtcc gatatcgtac agtggatggt gtggaaaacg
900cgtggaactc aggtgatcta ctggggaatt tatgcttttg tgcttattga
tttatgaatg 960cagttgtgtt ggtattgttt atgggaaaaa agaaaagcta
ccttgaattt gatgacactt 1020ctatagtaac ttgttaaaaa aacaaactct
tttaactcat ttttagtgca gctaaaacaa 1080tatcttgctg ccatgcca
1098198233PRTGossypium raimondiiG3883 polypeptide (domain in aa
coordinates 67-132) 198Met Asp Ser Asn Gln Gln Thr Gln Ser Thr Pro
Tyr Pro Pro Gln Pro 1 5 10 15 Pro Thr Ser Ala Ile Thr Pro Pro Ser
Ser Ala Thr Ala Thr Ala Pro 20 25 30 Pro Phe His His Leu Leu Gln
Gln Gln Gln Gln Gln Leu Gln Met Phe 35 40 45 Trp Ser Tyr Gln Arg
Gln Glu Ile Glu Gln Val Asn Asp Phe Lys Asn 50 55 60 His Gln Leu
Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala Asp Glu 65 70 75 80 Asp
Val Arg Met Ile Ser Ala Glu Ala Pro Ile Leu Phe Ala Lys Ala 85 90
95 Cys Glu Leu Phe Ile Leu Glu Leu Thr Ile Arg Ser Trp Leu His Ala
100 105 110 Glu Glu Asn Lys Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala
Ala Ala 115 120 125 Ile Thr Arg Thr Asp Ile Phe Asp Phe Leu Val Asp
Ile Val Pro Arg 130 135 140 Asp Glu Ile Lys Asp Glu Thr Gly Leu Ala
Pro Met Val Gly Ala Thr 145 150 155 160 Ala Ser Gly Val Pro Tyr Phe
Tyr Pro Pro Met Gly Gln Pro Ala Ala 165 170 175 Gly Gly Pro Gly Gly
Met Met Ile Gly Arg Pro Ala Val Asp Pro Thr 180 185 190 Gly Gly Ile
Tyr Gly Gln Pro Pro Ser Gln Ala Trp Gln Ser Val Trp 195 200 205 Gln
Thr Ala Gly Thr Asp Asp Gly Ser Tyr Gly Ser Gly Val Thr Gly 210 215
220 Gly Gln Gly Asn Leu Asp Gly Gln Gly 225 230 19966PRTGossypium
raimondiiG3883 conserved domain 199Leu Pro Leu Ala Arg Ile Lys Lys
Ile Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu
Ala Pro Ile Leu Phe Ala Lys Ala Cys Glu 20 25 30 Leu Phe Ile Leu
Glu Leu Thr Ile Arg Ser Trp Leu His Ala Glu Glu 35 40 45 Asn Lys
Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala Ile Thr 50 55 60
Arg Thr 65 200719DNAArtificial sequencesynthetic sequence
200gttcaccagc caatggacag taaccagcaa actcaatcca ccccataccc
acctcagcct 60cccacatccg ccattacccc tccttcatcc gccacagcaa cagcgcctcc
tttccaccac 120ctccttcaac aacaacagca acagctccaa atgttttggt
cataccaacg ccaagaaatc 180gagcaagtta acgattttaa gaaccaccaa
ctcccattag ctcgcattaa gaagataatg 240aaagccgacg aagacgtccg
tatgatctcc gccgaggctc ccattctctt cgccaaagct 300tgtgagcttt
tcattttgga actcactatc cgttcttggc ttcacgccga ggaaaacaag
360cgacggacac ttcagaaaaa cgacatcgct gcggctatta cgaggaccga
cattttcgat 420ttcttggtag atattgtgcc tagggatgag atcaaggatg
aaactgcttt ggctccgatg 480gtcggtgcta ccgccagtgg ggtaccttac
ttttatcccc ctatgggtca acctgctggt 540ggtggtcctg gtgggatgat
gattggccgg cctgccgtcg atcccaccgg agttatttac 600ggtcagccac
cttctcaggc ttggcagagt gtttggcaga cggcggggac tgatgatggc
660tcgtatggca gtggagttac cggtggtcaa gggaatcttg acggtcaagg ctaacctaa
7192011221DNALycopersicon esculentumG3894 201tatcaatcat ctacctcttt
gattgaaccc tagctctcac tctctctttc tctctctaaa 60aaaaaagcat aaaagtttca
atctttcagg taatttgatt ggagaggcgt atccagaatt 120tggagcttca
atcagtgggg agcctaagtt agctttggat tctccaaaag ggaatcttgc
180tcaaaatgga tcaacatgga aatggacagc ctccaggtat tggagtcgtt
actagctcag 240ctccaatata tggtgctcca taccaagcta accaaatggc
agggccctct cctcctgcag 300tttcagctgg tgcaattcaa tctcctcaag
cagctggtct tgctgcttcg tcagctcaga 360tggcgcaaca tcagctcgct
tatcagcaca ttcatcagca gcagcaacaa cagttgcagc 420aacaactcca
gactttctgg gcaaatcaat atcaagaaat cgagcatgtt actgatttca
480agaatcatag cttgccattg gcaaggatca agaaaatcat gaaagcggat
gaagatgtta 540ggatgatatc tgctgaagca ccagtcgtat ttgctcgtgc
ctgtgagatg ttcatacttg 600aattgacact gcgtgcatgg aaccacactg
aggagaacaa aaggaggacg ctgcagaaaa 660atgatatcgc tgcagccata
acaaggactg acatatttga tttcttagtt gacattgtcc 720caagagagga
cttgaaagat gaggtgcttg caacaattcc tagaggaacg cttcctgttg
780gaggcccaac tgagggtctg ccattctatt atggcatgcc accacaatct
gctcaaccga 840ttggagctcc agggatgtac atgggaaagc ctgtcgatca
agctctgtat gcccagcagc 900cccgcccata tatggcacag ccaatttggc
cccagcagca gcaaccaccc tcagattctt 960aagcagctca aagcttagat
tacaggaatc cagaagctag gagcagtgag tagtctgagt 1020agcgaagtac
tggagaacac atagcagtct gcatactttg aactttattt taatatatat
1080cgacatgaag cactagttat taaaagtcga gtgttccctt agtgtagcta
aaactttaca 1140ccatgagctt gatgttcttg gtgatgttta tgaacaaata
tgtttttaag tgtgtgattt 1200aaaagtaaaa aaaaaaaaaa a
1221202258PRTLycopersicon esculentumG3894 polypeptide (domain in aa
coordinates 103-168) 202Met Asp Gln His Gly Asn Gly Gln Pro Pro Gly
Ile Gly Val Val Thr 1 5 10 15 Ser Ser Ala Pro Ile Tyr Gly Ala Pro
Tyr Gln Ala Asn Gln Met Ala 20 25 30 Gly Pro Ser Pro Pro Ala Val
Ser Ala Gly Ala Ile Gln Ser Pro Gln 35 40 45 Ala Ala Gly Leu Ala
Ala Ser Ser Ala Gln Met Ala Gln His Gln Leu 50 55 60 Ala Tyr Gln
His Ile His Gln Gln Gln Gln Gln Gln Leu Gln Gln Gln 65 70 75 80 Leu
Gln Thr Phe Trp Ala Asn Gln Tyr Gln Glu Ile Glu His Val Thr 85 90
95 Asp Phe Lys Asn His Ser Leu Pro Leu Ala Arg Ile Lys Lys Ile Met
100 105 110 Lys Ala Asp Glu Asp Val Arg Met Ile Ser Ala Glu Ala Pro
Val Val 115 120 125 Phe Ala Arg Ala Cys Glu Met Phe Ile Leu Glu Leu
Thr Leu Arg Ala 130 135 140 Trp Asn His Thr Glu Glu Asn Lys Arg Arg
Thr Leu Gln Lys Asn Asp 145 150 155 160 Ile Ala Ala Ala Ile Thr Arg
Thr Asp Ile Phe Asp Phe Leu Val Asp 165 170 175 Ile Val Pro Arg Glu
Asp Leu Lys Asp Glu Val Leu Ala Thr Ile Pro 180 185 190 Arg Gly Thr
Leu Pro Val Gly Gly Pro Thr Glu Gly Leu Pro Phe Tyr 195 200 205 Tyr
Gly Met Pro Pro Gln Ser Ala Gln Pro Ile Gly Ala Pro Gly Met 210 215
220 Tyr Met Gly Lys Pro Val Asp Gln Ala Leu Tyr Ala Gln Gln Pro Arg
225 230 235 240 Pro Tyr Met Ala Gln Pro Ile Trp Pro Gln Gln Gln Gln
Pro Pro Ser 245 250 255 Asp Ser 20366PRTLycopersicon
esculentumG3894 conserved domain 203Leu Pro Leu Ala Arg Ile Lys Lys
Ile Met Lys Ala Asp Glu Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu
Ala Pro Val Val Phe Ala Arg Ala Cys Glu 20 25 30 Met Phe Ile Leu
Glu Leu Thr Leu Arg Ala Trp Asn His Thr Glu Glu 35 40 45 Asn Lys
Arg Arg Thr Leu Gln Lys Asn Asp Ile Ala Ala Ala Ile Thr 50 55 60
Arg Thr 65 204799DNAArtificial sequencesynthetic sequence
204gctcaaaatg gatcaacatg gaaatggaca gcctccaggt attggagtcg
ttactagctc 60agctccaata tatggtgctc cataccaagc taaccaaatg gcagggccct
ctcctcctgc 120agtttcagct ggtgcaattc aatctcctca agcagctggt
cttgctgctt cgtcagctca 180gatggcgcaa catcagctcg cttatcagca
cattcatcag cagcagcaac aacagttgca 240gcaacaactc cagactttct
gggcaaatca atatcaagaa atcgagcatg ttactgattt 300caagaatcat
agcttgccat tggcaaggat caagaaaatc atgaaagcgg atgaagatgt
360taggatgata tctgctgaag caccagtcgt atttgctcgt gcctgtgaga
tgttcatact 420tgaattgaca ctgcgtgcat ggaaccacac tgaggagaac
aaaaggagga cgctgcagaa 480aaatgatatc gctgcagcca taacaaggac
tgacatattt gatttcttag ttgacattgt 540cccaagagag gacttgaaag
atgaggtgct tgcaacaatt cctagaggaa cgcttcctgt 600tggaggccca
actgagggtc tgccattcta ttatggcatg ccaccacaat ctgctcaacc
660gattggagct ccagggatgt acatgggaaa gcctgtcgat caagctctgt
atgcccagca 720gccccgccca tatatggcac agccaatttg gccccagcag
cagcaaccac cctcagattc 780ttaagcagct caaagctta
7992051109DNAArtificial sequencesynthetic sequence 205ctacacgttt
tgaaaagtta acctgttggt taaatggtta gctatgactc tcgcaacaaa 60cccaaccctt
aagatgatga tggtttaaca tttgacaaca tagttaagac tgtgtctata
120taatagtcaa caaattcaga ttgtagtatt atggagtcaa catatttcga
gatcaaaaac 180attcaaaacg taaatctatc gacgtctcac atagttttgt
tatgaagctg atgaaaaaag 240ttggaagaca tagttttgca aacatcattt
gttgctaacg tataaacgtt ggtttgatta 300aatgtaatag gataaggata
tccgtttgtt catataattg agttaaatta tattttggtt 360attataatat
gttaagttga aaataaatag gtccaacaac cttgtttaaa tagatttttt
420aggagtgatt cccttttaat agtatagatt atactctctt cctaatcgac
cttccgtggg 480gtaaagtggt caattatatt ctttatggat gagcttgatt
gagaatgggt ttatgggtta 540tgacaagggc atgtacaaat gtcactgcct
cttgacatgc aaccgaacag ttggcgactc 600aagtcgcaga agatacaacg
gaccaaaccc tccgagtgtc gccgcgtctg ttatgtgtca 660cctttttgtc
tcctttcctt aaaaattggt aactcatttt tcaaaaaaag aagaggatag
720ttttggctgt atctcctaaa ctattcgatc acaacgccag atattttaat
actggatact 780agtgatgtaa tttgatttgt taattgtcaa aaagtagatt
ctcctatctc gtttttagtt 840caattattat atggttaaat gaatttaagt
cgattagaaa tgattagtta atcaaccaga 900gttgctctat aagtctatac
tgataacatg aaccattttc taaaaatgag atagatacat 960ttgaattttg
tcgtggtttg gagtatgcgg agatagtcgt acgcgcatga acatcatgag
1020acacttgctt cagctcacag agtgacgtgt aaagaccata gacccacgac
ttcatgcaaa 1080cccattccta cgtggcacaa accttcatg
11092061341DNAArtificial sequencesynthetic sequence 206atgacttctt
cagtacatga gctctctgat aacaatgaaa gtcatgcgaa gaaagaacgt 60ccagattccc
aaacccgacc acaggttcct tcaggacgaa gttcggaatc tattgataca
120aactctgtct actcagagcc catggcacat ggattatacc cgtatccaga
tccttactac 180agaagcgtct ttgcacagca agcgtatctt ccacatccct
atcctggggt ccaattgcag 240ttaatgggaa tgcagcagcc aggagttcca
ttgcaatgtg atgcagtcga ggaacctgtt 300tttgttaacg caaagcaata
ccatggtata ctcaggcgca ggcaatcccg ggcaaaactt 360gaggcacgaa
atagagccat caaagcaaaa aagccataca tgcatgaatc tcggcattta
420catgcgataa gacggccaag aggatgtggt ggccggtttc tcaatgccaa
gaaggaaaat 480ggagaccaca aggaggagga ggaggcaacc tctgatgaga
acacttcaga agcaagttcc 540agcctcaggt ccgagaaatt agctatggct
acttctggtc ctaatggtag atctgcggcc 600gctgccgctg cggcagcggc
catggtgagc aagggcgagg agctgttcac cggggtggtg 660cccatcctgg
tcgagctgga cggcgacgta aacggccaca agttcagcgt gtccggcgag
720ggcgagggcg atgccaccta cggcaagctg accctgaagt tcatctgcac
caccggcaag 780ctgcccgtgc cctggcccac cctcgtgacc accttcggct
acggcctgca gtgcttcgcc 840cgctaccccg accacatgaa gcagcacgac
ttcttcaagt ccgccatgcc cgaaggctac 900gtccaggagc gcaccatctt
cttcaaggac gacggcaact acaagacccg cgccgaggtg 960aagttcgagg
gcgacaccct ggataaccga atcgagctga agggcatcga cttcaaggag
1020gacggcaaca tcctggggca caagctggag tacaactaca acagccacaa
cgtctatatc 1080atggccgaca agcagaagaa cggcatcaag gtgaacttca
agatccgcca caacatcgag 1140gacggcagcg tgcagctcgc cgaccactac
cagcagaaca cccccatcgg cgacggcccc 1200gtggtgctgc ccgacaacca
ctacctgagc taccagtccg ccctgagcaa agaccccaac 1260gagaagcgcg
atcacatggt cctgctggag ttcgtgaccg ccgccgggat cactctcggc
1320atggacgagc tgtacaagta a 13412071386DNAArtificial
sequencesynthetic sequence 207ccaaaatcta gggttttctt ctcgcccaat
ttcacttttc ttctacgaaa ttctccattc 60ctgccggctg tcgggttttc tgaatcgatt
ctccttcacc aacttcttct ctggttctgt 120tcgattctga ttttttttca
aggtcaattt tttcttctct ttaaactctg caaaatcgtg 180atcgattaaa
ttcacctcag ggttttttga tttctgaaag aagttaatct tcttcgaagg
240cgattgcaaa agagtgctct gctgtgaatt tccactgaga tgcaatcaaa
accgggaaga 300gaaaacgaag aggaagtcaa taatcaccat gctgttcagc
agccgatgat gtatgcagag 360ccctggtgga aaaacaactc ctttggtgtt
gtacctcaag cgagaccttc tggaattcca 420tcaaattcct cttctttgga
ttgccccaat ggttccgagt caaacgatgt tcattcagca 480tctgaagacg
gtgcgttgaa tggtgaaaac gatggcactt ggaaggattc acaagctgca
540acttcctctc gttcagataa tcacggaatg gaaggaaatg acccagcgct
ctctatccgt 600aacatgcatg atcagccact tgtacaacca ccagagcttg
ttggacacta tatcgcttgt 660gtcccaaacc catatcagga tccatattat
gggggattga tgggagcata tggtcatcag 720caattgggtt ttcgtccata
tcttggaatg cctcgtgaaa gaacagctct gccacttgac 780atggcacaag
agcccgttta tgtgaatgca aagcagtacg agggaattct aaggcgaaga
840aaagcacgtg ccaaggcaga gctagagagg aaagtcatcc gggacagaaa
gccatatctt 900cacgagtcaa gacacaagca tgcaatgaga agggcacgag
cgagtggagg ccggtttgcg 960aagaaaagtg aggtagaagc gggagaggat
gcaggaggga gagacagaga aaggggttca 1020gcaaccaact catcaggctc
tgaacaagtt gagacagact ctaatgagac cctgaattct 1080tctggtgcac
cataataaaa aaagccaaag ctctgagagg agagagagac acacactttg
1140gctaatataa tccattgcct caaaccggca aatcattctt ggctttttcg
tttttgtgtt 1200tgctagttgt tcttgtcaga gtctcatatt gtgtgggttt
aacagttatg atgaatgtac 1260aaagagcgag ttatgttagg tgttagattt
tggagacaag agacaaagga atagcaagta 1320ggtcttgttt ttattctttg
accttttttt tctcttttgc aaaattgaaa aatacgtttg 1380cttaaa
13862084361DNAArtificial sequencesynthetic sequence 208agaatgtagc
aatacaaata tatgacggta ccgttatcca tcaccattat atgtatatat 60gtataatttg
ataaatattc actttgtgtt tcgtcgtttg cttaataaac agctcatttc
120catggtattg agtcttctat atgcgagaga atcagattcc cgctgggata
acaaaagaac 180aaggtactga aaaaaataga caaaactttt ttttaaatta
tataagctat aaaagaaaag 240agtatagaga gagattagcc ctactgttta
agagggagag agtagggtca ttagggcttt 300agagagagaa gacattcgga
ctgtccccac ttgcttttct gtagaataac attatttaaa 360tcttattttt
aattaaatat tacaactaaa agaagaaacc aacttttaaa ataaatgcag
420attatatgct ctgacttgga ctaaataaaa cttgcaagta acagtttcaa
gtccttttgt 480tttagaactt tttctttcgt agaagtgata aatgattgcc
ctagacctga tagattctct 540aaaattctac gtattacagc ataagttacc
tcctttattt gactattaga ccatccatat 600tggtgggctt ttagcaaatg
ttcttaacaa taattttata atttatttta atgttaagag 660gtttgataat
tttttttttt taagagtgta ttttgtttat taaaatgtgt tttgtttctt
720atataagaac caaatcttaa ctattttacc aattaaacat taaatttaaa
ttttaatatc 780tctaagaatt atattaagag ccaatataga tgcttttaaa
accattggtt gaataaataa 840atctaacctt cttaattatt tctgtgtgaa
tattttctaa attttcattt taatttagca 900caatataatc catgttctaa
aaagaacaat taacataata tttacaaacc taaaaagatt 960ataaaacaca
attttatttt ttacagctta taatgtttta aagttcaggt ttatttttta
1020aaagttcagg tttattacat taggtttgac ttgtaatcat catttatcac
aacgatcaaa 1080ctattattac aatcacaata gtagacaaaa tttaggatat
atatatatat atataattat 1140gtataaacta tgaacattta aagtgagatt
tttcaaaata atatataaat tcaaatagaa 1200atagactatt tggttcttaa
atgagagacc cccgaaaaaa tctttttttt tttctcatca 1260agctgtttac
atttttagat ataaaatcat attctttata gtttagaata tgaattaaat
1320agttttatat gttattaact tatcataaga tatgcgtgag gttggccaaa
aactcatcaa 1380ttaaccaaat aagaaaagta aaattgtatt ttgctttgct
aaaaatgtaa atatttcatt 1440gaaaaatgaa aaaggtttag gtaatacaat
taagtaaatc ctacaatttt ggttccatgg 1500caaaagaata aaattgtatt
gctttggtaa aagttgatcc aactaatata ttcagtagaa 1560actgcaaaac
tgaagaaata agtttgttta gtagaattgc tttcggttat gtaatgaata
1620tacatccaaa atggcttttt agtaatgatg tcttttcata ctctttccaa
tccctactac 1680tttcagatta tttgtcctac tattatagag atatacgttc
gttttcaata atatgaaaag 1740tgatatatat ttaaatagtg tgatatatat
ataagttttg caagtgcatc acttcccaaa 1800atcgcataaa tcattaatca
tattgtcgaa aacagtataa taacttctta aacgaaaacg 1860cagcgcaatt
aaaaataaca actagagata attgacaaaa cattgattaa tatttaccta
1920taagttaatt attgtattta aaatttattt aaagttcata aggaaaacat
atgcaaaaat 1980atttatatct aatattttgc tatgttatcc tttttttttt
ttacgttatc ctaattttgt 2040ttatcctaat ttgttgtggt taaaatctta
ttattgataa aaagagaact tttttttttg 2100tcatcataaa aaagagaact
tattacttcg attttaaaat tctatgagcg taggagacaa 2160agaaaaaaaa
aataaaaaaa aaaagaagag aaaaatcact tcttttcttc tttttagtcc
2220agatccaaca tattttggat aactaaatga agatttttta aaaaaatata
ttttagggta 2280tatataaatc ataatttgaa gcaaatgaaa taaaatccag
tttggtaata tataaatatg 2340atttgatggg ttccttgtaa tctctctcta
tctattagtt tctcagttat cttttctttg 2400ccagaaatgg cagtgaaggc
agtggctgag gagagagttt tttttcttct ttcatgggga 2460aagtaaaact
ttgccttgaa gatttctctc ttcaatattt ttctaagact tttgatttca
2520acgaatcact gtccttaacc taaaagcaag aaaaattagc tttatactgg
tctttacttt 2580tttttaacat atttattttt atatagttta cttataaaca
tagacatacg agtatgggaa 2640tatatagtat
atccaacttc taaataatat ttcgaatagt gataacaaaa ttagcaatac
2700atacggctag tgaaatgttg atcgaataaa cggcactgat gtaatgtact
tatcaatttt 2760gataatttta attgtattgt ttttcttttt ttcccacagt
attgaactag acaattaaat 2820ttaaagtaaa attatacatt tctttcgttg
tgtattaaag taacatgcat aatatcattt 2880tccttcgtac aatcctccaa
attgacaatt gatgaattac tttgtcaatc gtaaatgaat 2940ttttctcaag
tctgtatact attttcaggg ataaacaggt acaggtgtcc catgcttatt
3000ctcttgatag taacatgtgt cctatgttga gtcaattcta cgttcgaaga
agtgctaaca 3060attgttaata gcctcgtata ttattctaat taaaatgcct
cgatagattt ggttagtggt 3120ctgaatgtga ttggttattt tttcaagtgg
caagaggtct accatctaat attacaatca 3180atcgaccaaa aaggtcgaga
acatgataat ggtggcaaat acaaatggtt cattgttgtc 3240taatataaca
agccatcagt tgtcactttt taaaaacaat acagaataca agatactttt
3300tttttaaggt aaaatgtgtg tttaatattt tcgtttatat aacaaataaa
cagttacatg 3360ttttactcta tgattatatt tatgacattt ttcttcttct
taacaacatt tttttcccat 3420aagaacattt acaatagtat taaaactttg
attgcaatca aatgttagat cacttattat 3480aaaattacta agactgctat
cttttcctat tgacaaaagc gaatccaata tatgttactg 3540aaacaaatgc
gtaaattata ctatatggag atctatcggt taattattga gagaatctaa
3600gaaagttttt gagtacaaca gtcctaataa tatcttcaca taccatataa
tatacatata 3660tacatataca caaatgtact ttttaaacca acatcagcat
acgtatatcc catcaggaaa 3720cttagacttt tgggaattca tggtatgaaa
accaaaacca aatgacaaca ttcgatttga 3780tactcccgac ccatggtaaa
gaaataacaa attccaatat atctttcact ggactttccg 3840aggcacattc
cggttttctc catttcaaga aattgtcaaa aataaattga gatccggttt
3900attacctcaa aaaagaagaa gagaaattac aacattaatt tccgaaaagg
cataaatgag 3960aaatcatatt tcagcagaag aacacaaaag agttaagaac
ccacagatca cacaacctct 4020gtccatgtct gctttttaca cttttttaaa
ataagtttct cctaaaaagt tatttcctat 4080ttataataat ttccttagat
ttatcttcct ggtctctctt ctgctgcttc cctctccccc 4140ataactatca
ctatttagaa ttttcaatgt ggaaaaggaa gctgattgtt gaagcataaa
4200tcccgggaga ccacttttgc attttcaaat aattaaatta aaccatagat
acacacacac 4260agttacttac tcttttaggg tttcccaata aatttatagt
actttaatgt gtttcatgat 4320attgatgata aatgctagct gtatttacaa
tgggggctcc t 43612091243DNAZea maysG4259 209aaaaaaagaa gcttgccatt
tcgctcaggg ccctgcaacg cgcggcagcg cgccacgcgc 60cgagcttggc ttgggactgg
gccgcccggc cgcgaggaat aaactcactc ctgccttcat 120acgtatccaa
atagccgcgg cagtacgtgt atgtggttag ctatacgcga cctcagctcg
180ggcgcaagct acaacgccga ccaggcgaga agaagcatcg atagtgtgac
gagctaaccc 240accagcagca acgtaatcca aatccatgga caaccagccg
ctgccctact ccacaggcca 300gccccctgcc cccggaggag ccccggtggc
gggcatgcct ggcgcggccg gcctcccacc 360cgtgccgcac caccacctgc
tccagcagca gcaggcccag ctgcaggcgt tctgggcgta 420ccagcgccag
gaggcggagc gcgcgtccgc gtcggacttc aagaaccacc agctgcctct
480ggcccggatc aagaagatca tgaaggccga cgaggacgtg cgcatgatct
ccgccgaggc 540gcccgtgctg ttcgccaagg cctgcgagct cttcatcctc
gagctcacta tccgctcctg 600gctccacgcc gaggagaaca agcgccgcac
cctgcagcgc aacgacgtcg ccgcggccat 660cgcgcgcacc gacgtcttcg
atttcctcgt agacatcgtg ccccgcgagg aggccaagga 720ggagcccggc
agcgccctcg gcttcgcggc gcctgggacc ggcgtcgtcg gggctggcgc
780cccgggcggg gcgccagccg ccgggatgcc ctactactat ccgccgatgg
ggcagccggc 840gccgatgatg ccggcctggc atgttccggc ctgggacccg
gcctggcagc aaggggcagc 900ggatgtcgat cagagcggca gcttcagcga
ggaaggacaa gggtttggag caggccatgg 960cggcgccgct agcttccctc
ctgcgcctcc gacctccgag tgatcgatcg gcgcgtctct 1020tggtcctggc
ctcctggctt agctacatgt gcatgatgtc aatcgttcaa tgtgccatgc
1080tgtgtatatt ctacagcaaa cgtggtaatg gagctgctat gcatacagaa
cgaataaggc 1140gtgacgtgtg agaccgtaag agtacgtagt actaatatgt
agatgcacgt gacgtgccaa 1200ttaatcaaag attaacatgc agttaattaa
ttagatcctc cct 1243210245PRTZea maysG4259 polypeptide (domain in aa
coordinates 70-135) 210Met Asp Asn Gln Pro Leu Pro Tyr Ser Thr Gly
Gln Pro Pro Ala Pro 1 5 10 15 Gly Gly Ala Pro Val Ala Gly Met Pro
Gly Ala Ala Gly Leu Pro Pro 20 25 30 Val Pro His His His Leu Leu
Gln Gln Gln Gln Ala Gln Leu Gln Ala 35 40 45 Phe Trp Ala Tyr Gln
Arg Gln Glu Ala Glu Arg Ala Ser Ala Ser Asp 50 55 60 Phe Lys Asn
His Gln Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys 65 70 75 80 Ala
Asp Glu Asp Val Arg Met Ile Ser Ala Glu Ala Pro Val Leu Phe 85 90
95 Ala Lys Ala Cys Glu Leu Phe Ile Leu Glu Leu Thr Ile Arg Ser Trp
100 105 110 Leu His Ala Glu Glu Asn Lys Arg Arg Thr Leu Gln Arg Asn
Asp Val 115 120 125 Ala Ala Ala Ile Ala Arg Thr Asp Val Phe Asp Phe
Leu Val Asp Ile 130 135 140 Val Pro Arg Glu Glu Ala Lys Glu Glu Pro
Gly Ser Ala Leu Gly Phe 145 150 155 160 Ala Ala Pro Gly Thr Gly Val
Val Gly Ala Gly Ala Pro Gly Gly Ala 165 170 175 Pro Ala Ala Gly Met
Pro Tyr Tyr Tyr Pro Pro Met Gly Gln Pro Ala 180 185 190 Pro Met Met
Pro Ala Trp His Val Pro Ala Trp Asp Pro Ala Trp Gln 195 200 205 Gln
Gly Ala Ala Asp Val Asp Gln Ser Gly Ser Phe Ser Glu Glu Gly 210 215
220 Gln Gly Phe Gly Ala Gly His Gly Gly Ala Ala Ser Phe Pro Pro Ala
225 230 235 240 Pro Pro Thr Ser Glu 245 21166PRTZea maysG4259
conserved domain 211Leu Pro Leu Ala Arg Ile Lys Lys Ile Met Lys Ala
Asp Glu Asp Val 1 5 10 15 Arg Met Ile Ser Ala Glu Ala Pro Val Leu
Phe Ala Lys Ala Cys Glu 20 25 30 Leu Phe Ile Leu Glu Leu Thr Ile
Arg Ser Trp Leu His Ala Glu Glu 35 40 45 Asn Lys Arg Arg Thr Leu
Gln Arg Asn Asp Val Ala Ala Ala Ile Ala 50 55 60 Arg Thr 65
2121505DNAZea maysG4261 212gcgcgaggga gagacagagt gaggaaacga
gggaaggaga cgacgcgctc gcctattggc 60cgccggctcc gctccttcgc gcccagtgcg
acggccacgg cctgagcggc gctgccagca 120aggcggctag tatgagcagc
atggagtcgc ggccgggccg aacgaacctg gtggagccca 180tagggcacgg
cgccgcgctg ccgtccggcg gccaggcagt gcagccgtgg tggacgagct
240ccggggctgt gctcggtgca gtctcgccag ccgtcgtggc ggtggcgccc
gggagcggga 300cggggattag cctgtcgagc agcccggcag gtggtagtgg
tggtggcggc gcggctaaag 360gagccgcgag tgacgagagc agcgaggatt
cacggagatc tggggaacca aaagatggaa 420gcgctagtca agaaaagaac
catgccacat cgcagatacc cgctctggcg ccagagtatt 480tggcaccata
ctcgcagctg gaactgaacc aatcaattgc ttctgcagca tatcagtacc
540cagatcctta ctatgcaggc atggttgctc cctatggaag tcatgctgtg
gctcattttc 600agctacctgg actaactcaa tctcgaatgc cattacctct
tgaagtatcc gaggagcctg 660tttatgtaaa tgccaagcag taccatggta
tcttaagacg acggcagtcc cgtgctaagg 720ctgaacttga gaaaaaggtg
gtcaaagcca gaaagccata ccttcacgag tctcgtcatc 780agcacgcgat
gaggagggca agaggaaacg ggggacgctt cctgaacaca aagaaaagtg
840acagtggtgc tcccaatgga ggcgaaaacg ccgagcatct ccatgtccct
cccgacttac 900tacagctacg acagaacgag gcttgaagta gcggtatggc
tctggcatcc ttgaacagca 960gttcctgtcc acgggcgtag gcattcgaga
ccggattcat atagctctcc acagcatacg 1020cgcagccatc tctgcggtaa
cgcacgttct cctgaacgag ctttgtagcg agataggtat 1080gcaagtgcaa
tctgggcgca ggaatccatc atcaagtgcc caatgcccat ggggtaggta
1140cgctgtttca ggcaattcat tcttggcttt cacgttccac ccttgtgtaa
ctggtgtgtt 1200gtaaatgtgt ggaaaactaa gcttgtgctc tgtatcgggc
cgttcagcgg aactgcaaaa 1260cgcctgtata attaagatcg aactttggat
taactcggta atgctttgtc tggttttctt 1320ttagcttttc aactgtaaca
cggccacagc tgattcatgt gatgtgcttg ctaatattta 1380aataaacacc
ttgacccggc cgggcgcgga gtaattttat attttttata ttcgggagtc
1440cacacaatcg tgtaaggttc ctgcgaacca gttctgactt taattgaacc
gcccggattc 1500atatg 1505213264PRTZea maysG4261 polypeptide (domain
in aa coordinates 175-231) 213Met Ser Ser Met Glu Ser Arg Pro Gly
Arg Thr Asn Leu Val Glu Pro 1 5 10 15 Ile Gly His Gly Ala Ala Leu
Pro Ser Gly Gly Gln Ala Val Gln Pro 20 25 30 Trp Trp Thr Ser Ser
Gly Ala Val Leu Gly Ala Val Ser Pro Ala Val 35 40 45 Val Ala Val
Ala Pro Gly Ser Gly Thr Gly Ile Ser Leu Ser Ser Ser 50 55 60 Pro
Ala Gly Gly Ser Gly Gly Gly Gly Ala Ala Lys Gly Ala Ala Ser 65 70
75 80 Asp Glu Ser Ser Glu Asp Ser Arg Arg Ser Gly Glu Pro Lys Asp
Gly 85 90 95 Ser Ala Ser Gln Glu Lys Asn His Ala Thr Ser Gln Ile
Pro Ala Leu 100 105 110 Ala Pro Glu Tyr Leu Ala Pro Tyr Ser Gln Leu
Glu Leu Asn Gln Ser 115 120 125 Ile Ala Ser Ala Ala Tyr Gln Tyr Pro
Asp Pro Tyr Tyr Ala Gly Met 130 135 140 Val Ala Pro Tyr Gly Ser His
Ala Val Ala His Phe Gln Leu Pro Gly 145 150 155 160 Leu Thr Gln Ser
Arg Met Pro Leu Pro Leu Glu Val Ser Glu Glu Pro 165 170 175 Val Tyr
Val Asn Ala Lys Gln Tyr His Gly Ile Leu Arg Arg Arg Gln 180 185 190
Ser Arg Ala Lys Ala Glu Leu Glu Lys Lys Val Val Lys Ala Arg Lys 195
200 205 Pro Tyr Leu His Glu Ser Arg His Gln His Ala Met Arg Arg Ala
Arg 210 215 220 Gly Asn Gly Gly Arg Phe Leu Asn Thr Lys Lys Ser Asp
Ser Gly Ala 225 230 235 240 Pro Asn Gly Gly Glu Asn Ala Glu His Leu
His Val Pro Pro Asp Leu 245 250 255 Leu Gln Leu Arg Gln Asn Glu Ala
260 21457PRTZea maysG4261 conserved domain 214Glu Pro Val Tyr Val
Asn Ala Lys Gln Tyr His Gly Ile Leu Arg Arg 1 5 10 15 Arg Gln Ser
Arg Ala Lys Ala Glu Leu Glu Lys Lys Val Val Lys Ala 20 25 30 Arg
Lys Pro Tyr Leu His Glu Ser Arg His Gln His Ala Met Arg Arg 35 40
45 Ala Arg Gly Asn Gly Gly Arg Phe Leu 50 55 21516PRTArtificial
sequencederived from wheat, rye, and tomato 215Pro Lys Xaa Pro Ala
Gly Arg Xaa Lys Phe Xaa Glu Thr Arg His Pro 1 5 10 15
2165PRTarabidopsis thaliana 216Asp Ser Ala Trp Arg 1 5
uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.
While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.
All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.