U.S. patent application number 15/148399 was filed with the patent office on 2016-10-20 for insecticidal proteins. This patent application is currently assigned to Syngenta Participations AG. The applicant listed for this patent is Syngenta Participations AG. Invention is credited to Jeng Shong CHEN, Cheryl M. DEFONTES, Hope Hart.
Application Number | 20160304568 15/148399 |
Document ID | / |
Family ID | 50340330 |
Filed Date | 2016-10-20 |
United States Patent Application | 20160304568 |
Kind Code | A1 |
Hart; Hope ; et al. | October 20, 2016 |
Compositions and methods for controlling plant pests are disclosed. In particular, novel engineered hybrid insecticidal proteins (eHIPs) having toxicity to at least corn rootworm are provided. By fusing unique combinations of complete or partial variable regions and conserved blocks of at least two different Bacillus thuringiensis (Bt) Cry proteins or a modified Cry proteins an eHIP having activity against corn rootworm is designed. Nucleic acid molecules encoding the novel eHIPs are also provided. Methods of making the eHIPs and methods of using the eHIPs and nucleic acids encoding the eHIPs of the invention, for example in transgenic plants to confer protection from insect damage are also disclosed.
Inventors: | Hart; Hope; (Durham, NC) ; CHEN; Jeng Shong; (Durham, NC) ; DEFONTES; Cheryl M.; (Riehen, CH) | ||||||||||
Applicant: |
|
||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Assignee: | Syngenta Participations AG Basel CH |
||||||||||
Family ID: | 50340330 | ||||||||||
Appl. No.: | 15/148399 | ||||||||||
Filed: | May 6, 2016 |
Application Number | Filing Date | Patent Number | ||
---|---|---|---|---|
13623921 | Sep 21, 2012 | |||
15148399 | ||||
12529246 | Sep 25, 2009 | 8309516 | ||
PCT/US08/58182 | Mar 26, 2008 | |||
13623921 | ||||
60920493 | Mar 28, 2007 | |||
Current U.S. Class: | 1/1 |
Current CPC Class: | Y02A 40/162 20180101; C12N 15/8286 20130101; C07K 14/325 20130101; C07K 2319/55 20130101; A01N 63/10 20200101; Y02A 40/146 20180101 |
International Class: | C07K 14/325 20060101 C07K014/325; C12N 15/82 20060101 C12N015/82; A01N 63/02 20060101 A01N063/02 |
Sequence CWU 1
1
16012001DNAArtificial Sequence2OL-8a coding sequence 1atggctagca
tgactggtgg acagcaaatg ggtcgcggat ccactagtaa cggccgccag 60tgtgctggaa
ttcgccctta tgacggccga caacaacacc gaggcctgga cagcagcacc
120accaaggacg tgatccagaa gggcatcagc gtggtgggcg acctgctggg
cgtggtgggc 180ttccccttcg gcggcgccct ggtgagcttc tacaccaact
tcctgaacac catctggccc 240agcgaggacc cctggaaggc cttcatggag
caggtggagg ccctgatgga ccagaagatc 300gccgactacg ccaagaacaa
ggcactggcc gagctacagg gcctccagaa caacgtggag 360gactatgtga
gcgccctgag cagctggcag aagaaccccg ctgcaccgtt ccgcaacccc
420cacagccagg gccgcatccg cgagctgttc agccaggccg agagccactt
ccgcaacagc 480atgcccagct tcgccatcag cggctacgag gtgctgttcc
tgaccaccta cgcccaggcc 540gccaacaccc acctgttcct gctgaaggac
gcccaaatct acggagagga gtggggctac 600gagaaggagg acatcgccga
gttctacaag cgccagctga agctgaccca ggagtacacc 660gaccactgcg
tgaagtggta caacgtgggt ctagacaagc tccgcggcag cagctacgag
720agctgggtga acttcaaccg ctaccgccgc gagatgaccc tgaccgtgct
ggacctgatc 780gccctgttcc ccctgtacga cgtgcgcctg taccccaagg
aggtgaagac cgagctgacc 840cgcgacgtgc tgaccgaccc catcgtgggc
gtgaacaacc tgcgcggcta cggcaccacc 900ttcagcaaca tcgagaacta
catccgcaag ccccacctgt tcgactacct gcaccgcatc 960cagttccaca
cgcgtttcca gcccggctac tacggcaacg acagcttcaa ctactggagc
1020ggcaactacg tgagcacccg ccccagcatc ggcagcaacg acatcatcac
cagccccttc 1080tacggcaaca agagcagcga gcccgtgcag aaccttgagt
tcaacggcga gaaggtgtac 1140cgcgccgtgg ctaacaccaa cctggccgtg
tggccctctg cagtgtacag cggcgtgacc 1200aaggtggagt tcagccagta
caacgaccag accgacgagg ccagcaccca gacctacgac 1260agcaagcgca
acgtgggcgc cgtgagctgg gacagcatcg accagctgcc ccccgagacc
1320accgacgagc ccctggagaa gggctacagc caccagctga actacgtgat
gtgcttcctg 1380atgcagggca gccgcggcac catccccgtg ctgacctgga
cccacaagag cgtcgacttc 1440ttcaacatga tcgacagcaa gaagatcacc
cagctgcccc tgaccaagag caccaacctg 1500ggcagcggca ccagcgtggt
gaagggcccc ggcttcaccg gcggcgacat cctgcgccgc 1560accagccccg
gccagatcag caccctgcgc gtgaacatca ccgcccccct gagccagcgc
1620taccgcgtcc gcatccgcta cgccagcacc accaacctgc agttccacac
cagcatcgac 1680ggccgcccca tcaaccaggg caacttcagc gccaccatga
gcagcggcag caacctgcag 1740agcggcagct tccgcaccgt gggcttcacc
acccccttca acttcagcaa cggcagcagc 1800gtgttcaccc tgagcgccca
cgtgttcaac agcggcaacg aggtgtacat cgaccgcatc 1860gagttcgtgc
ccgccgaggt gaccttcgag gccgagtacg acctggagag ggctcagaag
1920gccgtgaacg agctgttcac cagcagcaac cagatcggcc tgaagaccga
cgtgaccgac 1980taccacatcg atcaggtgta g 20012668PRTArtificial
Sequence2OL-8a protein 2Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly
Arg Gly Ser Gly Ser 1 5 10 15 Thr Ser Asn Gly Arg Gln Cys Ala Gly
Ile Arg Pro Tyr Asp Gly Arg 20 25 30 Gln Gln His Arg Gly Leu Asp
Ser Ser Thr Thr Lys Asp Val Ile Gln 35 40 45 Lys Gly Ile Ser Val
Val Gly Asp Leu Leu Gly Val Val Gly Phe Pro 50 55 60 Phe Gly Gly
Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr Ile 65 70 75 80 Trp
Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val Glu Ala 85 90
95 Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala Leu Ala
100 105 110 Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val Ser
Ala Leu 115 120 125 Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg
Asn Pro His Ser 130 135 140 Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln
Ala Glu Ser His Phe Arg 145 150 155 160 Asn Ser Met Pro Ser Phe Ala
Ile Ser Gly Tyr Glu Val Leu Phe Leu 165 170 175 Thr Thr Tyr Ala Gln
Ala Ala Asn Thr His Leu Phe Leu Leu Lys Asp 180 185 190 Ala Gln Ile
Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile Ala 195 200 205 Glu
Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr Asp His 210 215
220 Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly Ser Ser
225 230 235 240 Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu
Met Thr Leu 245 250 255 Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu
Tyr Asp Val Arg Leu 260 265 270 Tyr Pro Lys Glu Val Lys Thr Glu Leu
Thr Arg Asp Val Leu Thr Asp 275 280 285 Pro Ile Val Gly Val Asn Asn
Leu Arg Gly Tyr Gly Thr Thr Phe Ser 290 295 300 Asn Ile Glu Asn Tyr
Ile Arg Lys Pro His Leu Phe Asp Tyr Leu His 305 310 315 320 Arg Ile
Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn Asp 325 330 335
Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro Ser Ile 340
345 350 Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn Lys Ser
Ser 355 360 365 Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys Val
Tyr Arg Ala 370 375 380 Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser
Ala Val Tyr Ser Gly 385 390 395 400 Val Thr Lys Val Glu Phe Ser Gln
Tyr Asn Asp Gln Thr Asp Glu Ala 405 410 415 Ser Thr Gln Thr Tyr Asp
Ser Lys Arg Asn Val Gly Ala Val Ser Trp 420 425 430 Asp Ser Ile Asp
Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu Glu 435 440 445 Lys Gly
Tyr Ser His Gln Leu Asn Tyr Val Met Cys Phe Leu Met Gln 450 455 460
Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr His Lys Ser Val 465
470 475 480 Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr Gln Leu
Pro Leu 485 490 495 Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val
Val Lys Gly Pro 500 505 510 Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg
Thr Ser Pro Gly Gln Ile 515 520 525 Ser Thr Leu Arg Val Asn Ile Thr
Ala Pro Leu Ser Gln Arg Tyr Arg 530 535 540 Val Arg Ile Arg Tyr Ala
Ser Thr Thr Asn Leu Gln Phe His Thr Ser 545 550 555 560 Ile Asp Gly
Arg Pro Ile Asn Gln Gly Asn Phe Ser Ala Thr Met Ser 565 570 575 Ser
Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr Val Gly Phe Thr 580 585
590 Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser Ala
595 600 605 His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile
Glu Phe 610 615 620 Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp
Leu Glu Arg Ala 625 630 635 640 Gln Lys Ala Val Asn Glu Leu Phe Thr
Ser Ser Asn Gln Ile Gly Leu 645 650 655 Lys Thr Asp Val Thr Asp Tyr
His Ile Asp Gln Val 660 665 31962DNAArtificial SequenceFR8a coding
sequence 3atgactagta acggccgcca gtgtgctggt attcgccctt atgacggccg
acaacaacac 60cgaggcctgg acagcagcac caccaaggac gtgatccaga agggcatcag
cgtggtgggc 120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc
tggtgagctt ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac
ccctggaagg ccttcatgga gcaggtggag 240gccctgatgg accagaagat
cgccgactac gccaagaaca aggcactggc cgagctacag 300ggcctccaga
acaacgtgga ggactatgtg agcgccctga gcagctggca gaagaacccc
360gctgcaccgt tccgcaaccc ccacagccag ggccgcatcc gcgagctgtt
cagccaggcc 420gagagccact tccgcaacag catgcccagc ttcgccatca
gcggctacga ggtgctgttc 480ctgaccacct acgcccaggc cgccaacacc
cacctgttcc tgctgaagga cgcccaaatc 540tacggagagg agtggggcta
cgagaaggag gacatcgccg agttctacaa gcgccagctg 600aagctgaccc
aggagtacac cgaccactgc gtgaagtggt acaacgtggg tctagacaag
660ctccgcggca gcagctacga gagctgggtg aacttcaacc gctaccgccg
cgagatgacc 720ctgaccgtgc tggacctgat cgccctgttc cccctgtacg
acgtgcgcct gtaccccaag 780gaggtgaaga ccgagctgac ccgcgacgtg
ctgaccgacc ccatcgtggg cgtgaacaac 840ctgcgcggct acggcaccac
cttcagcaac atcgagaact acatccgcaa gccccacctg 900ttcgactacc
tgcaccgcat ccagttccac acgcgtttcc agcccggcta ctacggcaac
960gacagcttca actactggag cggcaactac gtgagcaccc gccccagcat
cggcagcaac 1020gacatcatca ccagcccctt ctacggcaac aagagcagcg
agcccgtgca gaaccttgag 1080ttcaacggcg agaaggtgta ccgcgccgtg
gctaacacca acctggccgt gtggccctct 1140gcagtgtaca gcggcgtgac
caaggtggag ttcagccagt acaacgacca gaccgacgag 1200gccagcaccc
agacctacga cagcaagcgc aacgtgggcg ccgtgagctg ggacagcatc
1260gaccagctgc cccccgagac caccgacgag cccctggaga agggctacag
ccaccagctg 1320aactacgtga tgtgcttcct gatgcagggc agccgcggca
ccatccccgt gctgacctgg 1380acccacaaga gcgtcgactt cttcaacatg
atcgacagca agaagatcac ccagctgccc 1440ctgaccaaga gcaccaacct
gggcagcggc accagcgtgg tgaagggccc cggcttcacc 1500ggcggcgaca
tcctgcgccg caccagcccc ggccagatca gcaccctgcg cgtgaacatc
1560accgcccccc tgagccagcg ctaccgcgtc cgcatccgct acgccagcac
caccaacctg 1620cagttccaca ccagcatcga cggccgcccc atcaaccagg
gcaacttcag cgccaccatg 1680agcagcggca gcaacctgca gagcggcagc
ttccgcaccg tgggcttcac cacccccttc 1740aacttcagca acggcagcag
cgtgttcacc ctgagcgccc acgtgttcaa cagcggcaac 1800gaggtgtaca
tcgaccgcat cgagttcgtg cccgccgagg tgaccttcga ggccgagtac
1860gacctggaga gggctcagaa ggccgtgaac gagctgttca ccagcagcaa
ccagatcggc 1920ctgaagaccg acgtgaccga ctaccacatc gatcaggtgt ag
19624653PRTArtificial SequenceFR8a protein 4Met Thr Ser Asn Gly Arg
Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln Gln His
Arg Gly Leu Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln Lys
Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40 45
Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr 50
55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val
Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn
Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu
Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro Ala
Ala Pro Phe Arg Asn Pro His 115 120 125 Ser Gln Gly Arg Ile Arg Glu
Leu Phe Ser Gln Ala Glu Ser His Phe 130 135 140 Arg Asn Ser Met Pro
Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe 145 150 155 160 Leu Thr
Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys 165 170 175
Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile 180
185 190 Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr
Asp 195 200 205 His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu
Arg Gly Ser 210 215 220 Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr
Arg Arg Glu Met Thr 225 230 235 240 Leu Thr Val Leu Asp Leu Ile Ala
Leu Phe Pro Leu Tyr Asp Val Arg 245 250 255 Leu Tyr Pro Lys Glu Val
Lys Thr Glu Leu Thr Arg Asp Val Leu Thr 260 265 270 Asp Pro Ile Val
Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe 275 280 285 Ser Asn
Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu 290 295 300
His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn 305
310 315 320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg
Pro Ser 325 330 335 Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr
Gly Asn Lys Ser 340 345 350 Ser Glu Pro Val Gln Asn Leu Glu Phe Asn
Gly Glu Lys Val Tyr Arg 355 360 365 Ala Val Ala Asn Thr Asn Leu Ala
Val Trp Pro Ser Ala Val Tyr Ser 370 375 380 Gly Val Thr Lys Val Glu
Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu 385 390 395 400 Ala Ser Thr
Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser 405 410 415 Trp
Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu 420 425
430 Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys Phe Leu Met
435 440 445 Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr His
Lys Ser 450 455 460 Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile
Thr Gln Leu Pro 465 470 475 480 Leu Thr Lys Ser Thr Asn Leu Gly Ser
Gly Thr Ser Val Val Lys Gly 485 490 495 Pro Gly Phe Thr Gly Gly Asp
Ile Leu Arg Arg Thr Ser Pro Gly Gln 500 505 510 Ile Ser Thr Leu Arg
Val Asn Ile Thr Ala Pro Leu Ser Gln Arg Tyr 515 520 525 Arg Val Arg
Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln Phe His Thr 530 535 540 Ser
Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser Ala Thr Met 545 550
555 560 Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr Val Gly
Phe 565 570 575 Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe
Thr Leu Ser 580 585 590 Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr
Ile Asp Arg Ile Glu 595 600 605 Phe Val Pro Ala Glu Val Thr Phe Glu
Ala Glu Tyr Asp Leu Glu Arg 610 615 620 Ala Gln Lys Ala Val Asn Glu
Leu Phe Thr Ser Ser Asn Gln Ile Gly 625 630 635 640 Leu Lys Thr Asp
Val Thr Asp Tyr His Ile Asp Gln Val 645 650 51959DNAArtificial
SequenceFRCG coding sequence 5atgactagta acggccgcca gtgtgctggt
attcgccctt atgacggccg acaacaacac 60cgaggcctgg acagcagcac caccaaggac
gtgatccaga agggcatcag cgtggtgggc 120gacctgctgg gcgtggtggg
cttccccttc ggcggcgccc tggtgagctt ctacaccaac 180ttcctgaaca
ccatctggcc cagcgaggac ccctggaagg ccttcatgga gcaggtggag
240gccctgatgg accagaagat cgccgactac gccaagaaca aggcactggc
cgagctacag 300ggcctccaga acaacgtgga ggactatgtg agcgccctga
gcagctggca gaagaacccc 360gtctcgagcc gcaaccccca cagccagggc
cgcatccgcg agctgttcag ccaggccgag 420agccacttcc gcaacagcat
gcccagcttc gccatcagcg gctacgaggt gctgttcctg 480accacctacg
cccaggccgc caacacccac ctgttcctgc tgaaggacgc ccaaatctac
540ggagaggagt ggggctacga gaaggaggac atcgccgagt tctacaagcg
ccagctgaag 600ctgacccagg agtacaccga ccactgcgtg aagtggtaca
acgtgggtct agacaagctc 660cgcggcagca gctacgagag ctgggtgaac
ttcaaccgct accgccgcga gatgaccctg 720accgtgctgg acctgatcgc
cctgttcccc ctgtacgacg tgcgcctgta ccccaaggag 780gtgaagaccg
agctgacccg cgacgtgctg accgacccca tcgtgggcgt gaacaacctg
840cgcggctacg gcaccacctt cagcaacatc gagaactaca tccgcaagcc
ccacctgttc 900gactacctgc accgcatcca gttccacacg cgtttccagc
ccggctacta cggcaacgac 960agcttcaact actggagcgg caactacgtg
agcacccgcc ccagcatcgg cagcaacgac 1020atcatcacca gccccttcta
cggcaacaag agcagcgagc ccgtgcagaa ccttgagttc 1080aacggcgaga
aggtgtaccg cgccgtggct aacaccaacc tggccgtgtg gccctctgca
1140gtgtacagcg gcgtgaccaa ggtggagttc agccagtaca acgaccagac
cgacgaggcc 1200agcacccaga cctacgacag caagcgcaac gtgggcgccg
tgagctggga cagcatcgac 1260cagctgcccc ccgagaccac cgacgagccc
ctggagaagg gctacagcca ccagctgaac 1320tacgtgatgt gcttcctgat
gcagggcagc cgcggcacca tccccgtgct gacctggacc 1380cacaagagcg
tcgacttctt caacatgatc gacagcaaga agatcaccca gctgcccctg
1440accaagagca ccaacctggg cagcggcacc agcgtggtga agggccccgg
cttcaccggc 1500ggcgacatcc tgcgccgcac cagccccggc cagatcagca
ccctgcgcgt gaacatcacc 1560gcccccctga gccagcgcta ccgcgtccgc
atccgctacg ccagcaccac caacctgcag 1620ttccacacca gcatcgacgg
ccgccccatc aaccagggca acttcagcgc caccatgagc 1680agcggcagca
acctgcagag cggcagcttc cgcaccgtgg gcttcaccac ccccttcaac
1740ttcagcaacg gcagcagcgt gttcaccctg agcgcccacg tgttcaacag
cggcaacgag 1800gtgtacatcg accgcatcga gttcgtgccc gccgaggtga
ccttcgaggc cgagtacgac 1860ctggagaggg ctcagaaggc cgtgaacgag
ctgttcacca gcagcaacca gatcggcctg 1920aagaccgacg tgaccgacta
ccacatcgat caggtgtag 19596652PRTArtificial SequenceFRCG protein
6Met Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly 1
5 10 15 Arg Gln Gln His Arg Gly Leu Asp Ser Ser Thr Thr Lys Asp Val
Ile
20 25 30 Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val Val
Gly Phe 35 40 45 Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn
Phe Leu Asn Thr 50 55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala
Phe Met Glu Gln Val Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile Ala
Asp Tyr Ala Lys Asn Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly Leu
Gln Asn Asn Val Glu Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser Trp
Gln Lys Asn Pro Val Ser Ser Arg Asn Pro His Ser 115 120 125 Gln Gly
Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser His Phe Arg 130 135 140
Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe Leu 145
150 155 160 Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu
Lys Asp 165 170 175 Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys
Glu Asp Ile Ala 180 185 190 Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr
Gln Glu Tyr Thr Asp His 195 200 205 Cys Val Lys Trp Tyr Asn Val Gly
Leu Asp Lys Leu Arg Gly Ser Ser 210 215 220 Tyr Glu Ser Trp Val Asn
Phe Asn Arg Tyr Arg Arg Glu Met Thr Leu 225 230 235 240 Thr Val Leu
Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp Val Arg Leu 245 250 255 Tyr
Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val Leu Thr Asp 260 265
270 Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe Ser
275 280 285 Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr
Leu His 290 295 300 Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr
Tyr Gly Asn Asp 305 310 315 320 Ser Phe Asn Tyr Trp Ser Gly Asn Tyr
Val Ser Thr Arg Pro Ser Ile 325 330 335 Gly Ser Asn Asp Ile Ile Thr
Ser Pro Phe Tyr Gly Asn Lys Ser Ser 340 345 350 Glu Pro Val Gln Asn
Leu Glu Phe Asn Gly Glu Lys Val Tyr Arg Ala 355 360 365 Val Ala Asn
Thr Asn Leu Ala Val Trp Pro Ser Ala Val Tyr Ser Gly 370 375 380 Val
Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu Ala 385 390
395 400 Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser
Trp 405 410 415 Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu
Pro Leu Glu 420 425 430 Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met
Cys Phe Leu Met Gln 435 440 445 Gly Ser Arg Gly Thr Ile Pro Val Leu
Thr Trp Thr His Lys Ser Val 450 455 460 Asp Phe Phe Asn Met Ile Asp
Ser Lys Lys Ile Thr Gln Leu Pro Leu 465 470 475 480 Thr Lys Ser Thr
Asn Leu Gly Ser Gly Thr Ser Val Val Lys Gly Pro 485 490 495 Gly Phe
Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro Gly Gln Ile 500 505 510
Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser Gln Arg Tyr Arg 515
520 525 Val Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln Phe His Thr
Ser 530 535 540 Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser Ala
Thr Met Ser 545 550 555 560 Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe
Arg Thr Val Gly Phe Thr 565 570 575 Thr Pro Phe Asn Phe Ser Asn Gly
Ser Ser Val Phe Thr Leu Ser Ala 580 585 590 His Val Phe Asn Ser Gly
Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe 595 600 605 Val Pro Ala Glu
Val Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala 610 615 620 Gln Lys
Ala Val Asn Glu Leu Phe Thr Ser Ser Asn Gln Ile Gly Leu 625 630 635
640 Lys Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val 645 650
71962DNAArtificial SequenceFR8a-9F coding sequence 7atgactagta
acggccgcca gtgtgctggt attcgcccta tgacggccga caacaacacc 60gaggccctgg
acagcagcac caccaaggac gtgatccaga agggcatcag cgtggtgggc
120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc tggtgagctt
ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac ccctggaagg
ccttcatgga gcaggtggag 240gccctgatgg accagaagat cgccgactac
gccaagaaca aggcactggc cgagctacag 300ggcctccaga acaacgtgga
ggactatgtg agcgccctga gcagctggca gaagaacccc 360gctgcaccgt
tccgcaaccc ccacagccag ggccgcatcc gcgagctgtt cagccaggcc
420gagagccact tccgcaacag catgcccagc ttcgccatca gcggctacga
ggtgctgttc 480ctgaccacct acgcccaggc cgccaacacc cacctgttcc
tgctgaagga cgcccaaatc 540tacggagagg agtggggcta cgagaaggag
gacatcgccg agttctacaa gcgccagctg 600aagctgaccc aggagtacac
cgaccactgc gtgaagtggt acaacgtggg tctagacaag 660ctccgcggca
gcagctacga gagctgggtg aacttcaacc gctaccgccg cgagatgacc
720ctgaccgtgc tggacctgat cgccctgttc cccctgtacg acgtgcgcct
gtaccccaag 780gaggtgaaga ccgagctgac ccgcgacgtg ctgaccgacc
ccatcgtggg cgtgaacaac 840ctgcgcggct acggcaccac cttcagcaac
atcgagaact acatccgcaa gccccacctg 900ttcgactacc tgcaccgcat
ccagttccac acgcgtttcc agcccggcta ctacggcaac 960gacagcttca
actactggag cggcaactac gtgagcaccc gccccagcat cggcagcaac
1020gacatcatca ccagcccctt ctacggcaac aagagcagcg agcccgtgca
gaaccttgag 1080ttcaacggcg agaaggtgta ccgcgccgtg gctaacacca
acctggccgt gtggccctct 1140gcagtgtaca gcggcgtgac caaggtggag
ttcagccagt acaacgacca gaccgacgag 1200gccagcaccc agacctacga
cagcaagcgc aacgtgggcg ccgtgagctg ggacagcatc 1260gaccagctgc
cccccgagac caccgacgag cccctggaga agggctacag ccaccagctg
1320aactacgtga tgtgcttcct gatgcagggc agccgcggca ccatccccgt
gctgacctgg 1380acccacaaga gcgtcgactt cttcaacatg atcgacagca
agaagatcac ccagctgccc 1440ctgaccaaga gcaccaacct gggcagcggc
accagcgtgg tgaagggccc cggcttcacc 1500ggcggcgaca tcctgcgccg
caccagcccc ggccagatca gcaccctgcg cgtgaacatc 1560accgcccccc
tgagccagcg ctaccgcgtc cgcatccgct acgccagcac caccaacctg
1620cagttccaca ccagcatcga cggccgcccc atcaaccagg gcaacttcag
cgccaccatg 1680agcagcggca gcaacctgca gagcggcagc ttccgcaccg
tgggcttcac cacccccttc 1740aacttcagca acggcagcag cgtgttcacc
ctgagcgccc acgtgttcaa cagcggcaac 1800gaggtgtaca tcgaccgcat
cgagttcgtg cccgccgagg tgaccttcga ggccgagtac 1860gacctggaga
gggctcagaa ggccgtgaac gagctgttca ccagcagcaa ccagatcggc
1920ctgaagaccg acgtgaccga ctaccacatc gatcaggtgt ag
19628653PRTArtificial SequenceFR8a-9F protein 8Met Thr Ser Asn Gly
Arg Gln Cys Ala Gly Ile Arg Pro Met Thr Ala 1 5 10 15 Asp Asn Asn
Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln
Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40
45 Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr
50 55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln
Val Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys
Asn Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val
Glu Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro
Ala Ala Pro Phe Arg Asn Pro His 115 120 125 Ser Gln Gly Arg Ile Arg
Glu Leu Phe Ser Gln Ala Glu Ser His Phe 130 135 140 Arg Asn Ser Met
Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe 145 150 155 160 Leu
Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys 165 170
175 Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile
180 185 190 Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr
Thr Asp 195 200 205 His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys
Leu Arg Gly Ser 210 215 220 Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg
Tyr Arg Arg Glu Met Thr 225 230 235 240 Leu Thr Val Leu Asp Leu Ile
Ala Leu Phe Pro Leu Tyr Asp Val Arg 245 250 255 Leu Tyr Pro Lys Glu
Val Lys Thr Glu Leu Thr Arg Asp Val Leu Thr 260 265 270 Asp Pro Ile
Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe 275 280 285 Ser
Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu 290 295
300 His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn
305 310 315 320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr
Arg Pro Ser 325 330 335 Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe
Tyr Gly Asn Lys Ser 340 345 350 Ser Glu Pro Val Gln Asn Leu Glu Phe
Asn Gly Glu Lys Val Tyr Arg 355 360 365 Ala Val Ala Asn Thr Asn Leu
Ala Val Trp Pro Ser Ala Val Tyr Ser 370 375 380 Gly Val Thr Lys Val
Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu 385 390 395 400 Ala Ser
Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser 405 410 415
Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu 420
425 430 Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys Phe Leu
Met 435 440 445 Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr
His Lys Ser 450 455 460 Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys
Ile Thr Gln Leu Pro 465 470 475 480 Leu Thr Lys Ser Thr Asn Leu Gly
Ser Gly Thr Ser Val Val Lys Gly 485 490 495 Pro Gly Phe Thr Gly Gly
Asp Ile Leu Arg Arg Thr Ser Pro Gly Gln 500 505 510 Ile Ser Thr Leu
Arg Val Asn Ile Thr Ala Pro Leu Ser Gln Arg Tyr 515 520 525 Arg Val
Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln Phe His Thr 530 535 540
Ser Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser Ala Thr Met 545
550 555 560 Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr Val
Gly Phe 565 570 575 Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val
Phe Thr Leu Ser 580 585 590 Ala His Val Phe Asn Ser Gly Asn Glu Val
Tyr Ile Asp Arg Ile Glu 595 600 605 Phe Val Pro Ala Glu Val Thr Phe
Glu Ala Glu Tyr Asp Leu Glu Arg 610 615 620 Ala Gln Lys Ala Val Asn
Glu Leu Phe Thr Ser Ser Asn Gln Ile Gly 625 630 635 640 Leu Lys Thr
Asp Val Thr Asp Tyr His Ile Asp Gln Val 645 650 91959DNAArtificial
SequenceFR-9F-catg coding sequence 9atgactagta acggccgcca
gtgtgctggt attcgcccta tgacggccga caacaacacc 60gaggccctgg acagcagcac
caccaaggac gtgatccaga agggcatcag cgtggtgggc 120gacctgctgg
gcgtggtggg cttccccttc ggcggcgccc tggtgagctt ctacaccaac
180ttcctgaaca ccatctggcc cagcgaggac ccctggaagg ccttcatgga
gcaggtggag 240gccctgatgg accagaagat cgccgactac gccaagaaca
aggcactggc cgagctacag 300ggcctccaga acaacgtgga ggactatgtg
agcgccctga gcagctggca gaagaacccc 360gtctcgagcc gcaaccccca
cagccagggc cgcatccgcg agctgttcag ccaggccgag 420agccacttcc
gcaacagcat gcccagcttc gccatcagcg gctacgaggt gctgttcctg
480accacctacg cccaggccgc caacacccac ctgttcctgc tgaaggacgc
ccaaatctac 540ggagaggagt ggggctacga gaaggaggac atcgccgagt
tctacaagcg ccagctgaag 600ctgacccagg agtacaccga ccactgcgtg
aagtggtaca acgtgggtct agacaagctc 660cgcggcagca gctacgagag
ctgggtgaac ttcaaccgct accgccgcga gatgaccctg 720accgtgctgg
acctgatcgc cctgttcccc ctgtacgacg tgcgcctgta ccccaaggag
780gtgaagaccg agctgacccg cgacgtgctg accgacccca tcgtgggcgt
gaacaacctg 840cgcggctacg gcaccacctt cagcaacatc gagaactaca
tccgcaagcc ccacctgttc 900gactacctgc accgcatcca gttccacacg
cgtttccagc ccggctacta cggcaacgac 960agcttcaact actggagcgg
caactacgtg agcacccgcc ccagcatcgg cagcaacgac 1020atcatcacca
gccccttcta cggcaacaag agcagcgagc ccgtgcagaa ccttgagttc
1080aacggcgaga aggtgtaccg cgccgtggct aacaccaacc tggccgtgtg
gccctctgca 1140gtgtacagcg gcgtgaccaa ggtggagttc agccagtaca
acgaccagac cgacgaggcc 1200agcacccaga cctacgacag caagcgcaac
gtgggcgccg tgagctggga cagcatcgac 1260cagctgcccc ccgagaccac
cgacgagccc ctggagaagg gctacagcca ccagctgaac 1320tacgtgatgt
gcttcctgat gcagggcagc cgcggcacca tccccgtgct gacctggacc
1380cacaagagcg tcgacttctt caacatgatc gacagcaaga agatcaccca
gctgcccctg 1440accaagagca ccaacctggg cagcggcacc agcgtggtga
agggccccgg cttcaccggc 1500ggcgacatcc tgcgccgcac cagccccggc
cagatcagca ccctgcgcgt gaacatcacc 1560gcccccctga gccagcgcta
ccgcgtccgc atccgctacg ccagcaccac caacctgcag 1620ttccacacca
gcatcgacgg ccgccccatc aaccagggca acttcagcgc caccatgagc
1680agcggcagca acctgcagag cggcagcttc cgcaccgtgg gcttcaccac
ccccttcaac 1740ttcagcaacg gcagcagcgt gttcaccctg agcgcccacg
tgttcaacag cggcaacgag 1800gtgtacatcg accgcatcga gttcgtgccc
gccgaggtga ccttcgaggc cgagtacgac 1860ctggagaggg ctcagaaggc
cgtgaacgag ctgttcacca gcagcaacca gatcggcctg 1920aagaccgacg
tgaccgacta ccacatcgat caggtgtag 195910652PRTArtificial
SequenceFR-9F-catg protein 10Met Thr Ser Asn Gly Arg Gln Cys Ala
Gly Ile Arg Pro Met Thr Ala 1 5 10 15 Asp Asn Asn Thr Glu Ala Leu
Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser
Val Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly
Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile
Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70
75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala
Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr
Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro Val Ser Ser
Arg Asn Pro His Ser 115 120 125 Gln Gly Arg Ile Arg Glu Leu Phe Ser
Gln Ala Glu Ser His Phe Arg 130 135 140 Asn Ser Met Pro Ser Phe Ala
Ile Ser Gly Tyr Glu Val Leu Phe Leu 145 150 155 160 Thr Thr Tyr Ala
Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys Asp 165 170 175 Ala Gln
Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile Ala 180 185 190
Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr Asp His 195
200 205 Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly Ser
Ser 210 215 220 Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu
Met Thr Leu 225 230 235 240 Thr Val Leu Asp Leu Ile Ala Leu Phe Pro
Leu Tyr Asp Val Arg Leu 245 250 255 Tyr Pro Lys Glu Val Lys Thr Glu
Leu Thr Arg Asp Val Leu Thr Asp 260 265 270 Pro Ile Val Gly Val Asn
Asn Leu Arg Gly Tyr Gly Thr Thr Phe Ser 275 280 285 Asn Ile Glu Asn
Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu His 290 295 300 Arg Ile
Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn Asp 305 310 315
320 Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro Ser Ile
325 330 335 Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn Lys
Ser Ser 340 345 350 Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys
Val Tyr Arg Ala 355 360 365 Val Ala Asn Thr Asn Leu Ala Val Trp Pro
Ser Ala Val Tyr Ser Gly 370 375 380 Val Thr Lys Val Glu Phe Ser Gln
Tyr Asn Asp
Gln Thr Asp Glu Ala 385 390 395 400 Ser Thr Gln Thr Tyr Asp Ser Lys
Arg Asn Val Gly Ala Val Ser Trp 405 410 415 Asp Ser Ile Asp Gln Leu
Pro Pro Glu Thr Thr Asp Glu Pro Leu Glu 420 425 430 Lys Gly Tyr Ser
His Gln Leu Asn Tyr Val Met Cys Phe Leu Met Gln 435 440 445 Gly Ser
Arg Gly Thr Ile Pro Val Leu Thr Trp Thr His Lys Ser Val 450 455 460
Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr Gln Leu Pro Leu 465
470 475 480 Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val Lys
Gly Pro 485 490 495 Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser
Pro Gly Gln Ile 500 505 510 Ser Thr Leu Arg Val Asn Ile Thr Ala Pro
Leu Ser Gln Arg Tyr Arg 515 520 525 Val Arg Ile Arg Tyr Ala Ser Thr
Thr Asn Leu Gln Phe His Thr Ser 530 535 540 Ile Asp Gly Arg Pro Ile
Asn Gln Gly Asn Phe Ser Ala Thr Met Ser 545 550 555 560 Ser Gly Ser
Asn Leu Gln Ser Gly Ser Phe Arg Thr Val Gly Phe Thr 565 570 575 Thr
Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser Ala 580 585
590 His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe
595 600 605 Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp Leu Glu
Arg Ala 610 615 620 Gln Lys Ala Val Asn Glu Leu Phe Thr Ser Ser Asn
Gln Ile Gly Leu 625 630 635 640 Lys Thr Asp Val Thr Asp Tyr His Ile
Asp Gln Val 645 650 111926DNAArtificial SequenceFR8a-12AA coding
sequence 11atgtatgacg gccgacaaca acaccgaggc ctggacagca gcaccaccaa
ggacgtgatc 60cagaagggca tcagcgtggt gggcgacctg ctgggcgtgg tgggcttccc
cttcggcggc 120gccctggtga gcttctacac caacttcctg aacaccatct
ggcccagcga ggacccctgg 180aaggccttca tggagcaggt ggaggccctg
atggaccaga agatcgccga ctacgccaag 240aacaaggcac tggccgagct
acagggcctc cagaacaacg tggaggacta tgtgagcgcc 300ctgagcagct
ggcagaagaa ccccgctgca ccgttccgca acccccacag ccagggccgc
360atccgcgagc tgttcagcca ggccgagagc cacttccgca acagcatgcc
cagcttcgcc 420atcagcggct acgaggtgct gttcctgacc acctacgccc
aggccgccaa cacccacctg 480ttcctgctga aggacgccca aatctacgga
gaggagtggg gctacgagaa ggaggacatc 540gccgagttct acaagcgcca
gctgaagctg acccaggagt acaccgacca ctgcgtgaag 600tggtacaacg
tgggtctaga caagctccgc ggcagcagct acgagagctg ggtgaacttc
660aaccgctacc gccgcgagat gaccctgacc gtgctggacc tgatcgccct
gttccccctg 720tacgacgtgc gcctgtaccc caaggaggtg aagaccgagc
tgacccgcga cgtgctgacc 780gaccccatcg tgggcgtgaa caacctgcgc
ggctacggca ccaccttcag caacatcgag 840aactacatcc gcaagcccca
cctgttcgac tacctgcacc gcatccagtt ccacacgcgt 900ttccagcccg
gctactacgg caacgacagc ttcaactact ggagcggcaa ctacgtgagc
960acccgcccca gcatcggcag caacgacatc atcaccagcc ccttctacgg
caacaagagc 1020agcgagcccg tgcagaacct tgagttcaac ggcgagaagg
tgtaccgcgc cgtggctaac 1080accaacctgg ccgtgtggcc ctctgcagtg
tacagcggcg tgaccaaggt ggagttcagc 1140cagtacaacg accagaccga
cgaggccagc acccagacct acgacagcaa gcgcaacgtg 1200ggcgccgtga
gctgggacag catcgaccag ctgccccccg agaccaccga cgagcccctg
1260gagaagggct acagccacca gctgaactac gtgatgtgct tcctgatgca
gggcagccgc 1320ggcaccatcc ccgtgctgac ctggacccac aagagcgtcg
acttcttcaa catgatcgac 1380agcaagaaga tcacccagct gcccctgacc
aagagcacca acctgggcag cggcaccagc 1440gtggtgaagg gccccggctt
caccggcggc gacatcctgc gccgcaccag ccccggccag 1500atcagcaccc
tgcgcgtgaa catcaccgcc cccctgagcc agcgctaccg cgtccgcatc
1560cgctacgcca gcaccaccaa cctgcagttc cacaccagca tcgacggccg
ccccatcaac 1620cagggcaact tcagcgccac catgagcagc ggcagcaacc
tgcagagcgg cagcttccgc 1680accgtgggct tcaccacccc cttcaacttc
agcaacggca gcagcgtgtt caccctgagc 1740gcccacgtgt tcaacagcgg
caacgaggtg tacatcgacc gcatcgagtt cgtgcccgcc 1800gaggtgacct
tcgaggccga gtacgacctg gagagggctc agaaggccgt gaacgagctg
1860ttcaccagca gcaaccagat cggcctgaag accgacgtga ccgactacca
catcgatcag 1920gtgtag 192612641PRTArtificial SequenceFR8a-12AA
protein 12Met Tyr Asp Gly Arg Gln Gln His Arg Gly Leu Asp Ser Ser
Thr Thr 1 5 10 15 Lys Asp Val Ile Gln Lys Gly Ile Ser Val Val Gly
Asp Leu Leu Gly 20 25 30 Val Val Gly Phe Pro Phe Gly Gly Ala Leu
Val Ser Phe Tyr Thr Asn 35 40 45 Phe Leu Asn Thr Ile Trp Pro Ser
Glu Asp Pro Trp Lys Ala Phe Met 50 55 60 Glu Gln Val Glu Ala Leu
Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys 65 70 75 80 Asn Lys Ala Leu
Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp 85 90 95 Tyr Val
Ser Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe 100 105 110
Arg Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala 115
120 125 Glu Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly
Tyr 130 135 140 Glu Val Leu Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn
Thr His Leu 145 150 155 160 Phe Leu Leu Lys Asp Ala Gln Ile Tyr Gly
Glu Glu Trp Gly Tyr Glu 165 170 175 Lys Glu Asp Ile Ala Glu Phe Tyr
Lys Arg Gln Leu Lys Leu Thr Gln 180 185 190 Glu Tyr Thr Asp His Cys
Val Lys Trp Tyr Asn Val Gly Leu Asp Lys 195 200 205 Leu Arg Gly Ser
Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg 210 215 220 Arg Glu
Met Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu 225 230 235
240 Tyr Asp Val Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg
245 250 255 Asp Val Leu Thr Asp Pro Ile Val Gly Val Asn Asn Leu Arg
Gly Tyr 260 265 270 Gly Thr Thr Phe Ser Asn Ile Glu Asn Tyr Ile Arg
Lys Pro His Leu 275 280 285 Phe Asp Tyr Leu His Arg Ile Gln Phe His
Thr Arg Phe Gln Pro Gly 290 295 300 Tyr Tyr Gly Asn Asp Ser Phe Asn
Tyr Trp Ser Gly Asn Tyr Val Ser 305 310 315 320 Thr Arg Pro Ser Ile
Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr 325 330 335 Gly Asn Lys
Ser Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu 340 345 350 Lys
Val Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser 355 360
365 Ala Val Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp
370 375 380 Gln Thr Asp Glu Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg
Asn Val 385 390 395 400 Gly Ala Val Ser Trp Asp Ser Ile Asp Gln Leu
Pro Pro Glu Thr Thr 405 410 415 Asp Glu Pro Leu Glu Lys Gly Tyr Ser
His Gln Leu Asn Tyr Val Met 420 425 430 Cys Phe Leu Met Gln Gly Ser
Arg Gly Thr Ile Pro Val Leu Thr Trp 435 440 445 Thr His Lys Ser Val
Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile 450 455 460 Thr Gln Leu
Pro Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser 465 470 475 480
Val Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr 485
490 495 Ser Pro Gly Gln Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro
Leu 500 505 510 Ser Gln Arg Tyr Arg Val Arg Ile Arg Tyr Ala Ser Thr
Thr Asn Leu 515 520 525 Gln Phe His Thr Ser Ile Asp Gly Arg Pro Ile
Asn Gln Gly Asn Phe 530 535 540 Ser Ala Thr Met Ser Ser Gly Ser Asn
Leu Gln Ser Gly Ser Phe Arg 545 550 555 560 Thr Val Gly Phe Thr Thr
Pro Phe Asn Phe Ser Asn Gly Ser Ser Val 565 570 575 Phe Thr Leu Ser
Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile 580 585 590 Asp Arg
Ile Glu Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr 595 600 605
Asp Leu Glu Arg Ala Gln Lys Ala Val Asn Glu Leu Phe Thr Ser Ser 610
615 620 Asn Gln Ile Gly Leu Lys Thr Asp Val Thr Asp Tyr His Ile Asp
Gln 625 630 635 640 Val 131800DNAArtificial SequenceWR-9mut coding
sequence 13atgtatgacg gccgacaaca acaccgaggc ctggacagca gcaccaccaa
ggacgtgatc 60cagaagggca tcagcgtggt gggcgacctg ctgggcgtgg tgggcttccc
cttcggcggc 120gccctggtga gcttctacac caacttcctg aacaccatct
ggcccagcga ggacccctgg 180aaggccttca tggagcaggt ggaggccctg
atggaccaga agatcgccga ctacgccaag 240aacaaggcac tggccgagct
acagggcctc cagaacaacg tggaggacta tgtgagcgcc 300ctgagcagct
ggcagaagaa ccccgctgca ccgttccgca acccccacag ccagggccgc
360atccgcgagc tgttcagcca ggccgagagc cacttccgca acagcatgcc
cagcttcgcc 420atcagcggct acgaggtgct gttcctgacc acctacgccc
aggccgccaa cacccacctg 480ttcctgctga aggacgccca aatctacgga
gaggagtggg gctacgagaa ggaggacatc 540gccgagttct acaagcgcca
gctgaagctg acccaggagt acaccgacca ctgcgtgaag 600tggtacaacg
tgggtctaga caagctccgc ggcagcagct acgagagctg ggtgaacttc
660aaccgctacc gccgcgagat gaccctgacc gtgctggacc tgatcgccct
gttccccctg 720tacgacgtgc gcctgtaccc caaggaggtg aagaccgagc
tgacccgcga cgtgctgacc 780gaccccatcg tgggcgtgaa caacctgcgc
ggctacggca ccaccttcag caacatcgag 840aactacatcc gcaagcccca
cctgttcgac tacctgcacc gcatccagtt ccacacgcgt 900ttccagcccg
gctactacgg caacgacagc ttcaactact ggagcggcaa ctacgtgagc
960acccgcccca gcatcggcag caacgacatc atcaccagcc ccttctacgg
caacaagagc 1020agcgagcccg tgcagaacct tgagttcaac ggcgagaagg
tgtaccgcgc cgtggctaac 1080accaacctgg ccgtgtggcc ctctgcagtg
tacagcggcg tgaccaaggt ggagttcagc 1140cagtacaacg accagaccga
cgaggccagc acccagacct acgacagcaa gcgcaacgtg 1200ggcgccgtga
gctgggacag catcgaccag ctgccccccg agaccaccga cgagcccctg
1260gagaagggct acagccacca gctgaactac gtgatgtgct tcctgatgca
gggcagccgc 1320ggcaccatcc ccgtgctgac ctggacccac aagagcgtcg
acttcttcaa catgatcgac 1380agcaagaaga tcacccagct gcccctggtg
aaggcctaca agctccagag cggcgccagc 1440gtggtggcag gcccccgctt
caccggcggc gacatcatcc agtgcaccga gaacggcagc 1500gccgccacca
tctacgtgac ccccgacgtg agctacagcc agaagtaccg cgcccgcatc
1560cactacgcca gcaccagcca gatcaccttc accctgagcc tggacggggc
ccccttcaac 1620caatactact tcgacaagac catcaacaag ggcgacaccc
tgacctacaa cagcttcaac 1680ctggccagct tcagcacccc tttcgagctg
agcggcaaca acctccagat cggcgtgacc 1740ggcctgagcg ccggcgacaa
ggtgtacatc gacaagatcg agttcatccc cgtgaactag 180014599PRTArtificial
SequenceWR-9mut protein 14Met Tyr Asp Gly Arg Gln Gln His Arg Gly
Leu Asp Ser Ser Thr Thr 1 5 10 15 Lys Asp Val Ile Gln Lys Gly Ile
Ser Val Val Gly Asp Leu Leu Gly 20 25 30 Val Val Gly Phe Pro Phe
Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn 35 40 45 Phe Leu Asn Thr
Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met 50 55 60 Glu Gln
Val Glu Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys 65 70 75 80
Asn Lys Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp 85
90 95 Tyr Val Ser Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro
Phe 100 105 110 Arg Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu Phe
Ser Gln Ala 115 120 125 Glu Ser His Phe Arg Asn Ser Met Pro Ser Phe
Ala Ile Ser Gly Tyr 130 135 140 Glu Val Leu Phe Leu Thr Thr Tyr Ala
Gln Ala Ala Asn Thr His Leu 145 150 155 160 Phe Leu Leu Lys Asp Ala
Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu 165 170 175 Lys Glu Asp Ile
Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln 180 185 190 Glu Tyr
Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys 195 200 205
Leu Arg Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg 210
215 220 Arg Glu Met Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro
Leu 225 230 235 240 Tyr Asp Val Arg Leu Tyr Pro Lys Glu Val Lys Thr
Glu Leu Thr Arg 245 250 255 Asp Val Leu Thr Asp Pro Ile Val Gly Val
Asn Asn Leu Arg Gly Tyr 260 265 270 Gly Thr Thr Phe Ser Asn Ile Glu
Asn Tyr Ile Arg Lys Pro His Leu 275 280 285 Phe Asp Tyr Leu His Arg
Ile Gln Phe His Thr Arg Phe Gln Pro Gly 290 295 300 Tyr Tyr Gly Asn
Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser 305 310 315 320 Thr
Arg Pro Ser Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr 325 330
335 Gly Asn Lys Ser Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu
340 345 350 Lys Val Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp
Pro Ser 355 360 365 Ala Val Tyr Ser Gly Val Thr Lys Val Glu Phe Ser
Gln Tyr Asn Asp 370 375 380 Gln Thr Asp Glu Ala Ser Thr Gln Thr Tyr
Asp Ser Lys Arg Asn Val 385 390 395 400 Gly Ala Val Ser Trp Asp Ser
Ile Asp Gln Leu Pro Pro Glu Thr Thr 405 410 415 Asp Glu Pro Leu Glu
Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met 420 425 430 Cys Phe Leu
Met Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp 435 440 445 Thr
His Lys Ser Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile 450 455
460 Thr Gln Leu Pro Leu Val Lys Ala Tyr Lys Leu Gln Ser Gly Ala Ser
465 470 475 480 Val Val Ala Gly Pro Arg Phe Thr Gly Gly Asp Ile Ile
Gln Cys Thr 485 490 495 Glu Asn Gly Ser Ala Ala Thr Ile Tyr Val Thr
Pro Asp Val Ser Tyr 500 505 510 Ser Gln Lys Tyr Arg Ala Arg Ile His
Tyr Ala Ser Thr Ser Gln Ile 515 520 525 Thr Phe Thr Leu Ser Leu Asp
Gly Ala Pro Phe Asn Gln Tyr Tyr Phe 530 535 540 Asp Lys Thr Ile Asn
Lys Gly Asp Thr Leu Thr Tyr Asn Ser Phe Asn 545 550 555 560 Leu Ala
Ser Phe Ser Thr Pro Phe Glu Leu Ser Gly Asn Asn Leu Gln 565 570 575
Ile Gly Val Thr Gly Leu Ser Ala Gly Asp Lys Val Tyr Ile Asp Lys 580
585 590 Ile Glu Phe Ile Pro Val Asn 595 15 1848DNAArtificial
SequenceFRD3 coding sequence 15atgactagta acggccgcca gtgtgctggt
attcgccctt atgacggccg acaacaacac 60cgaggcctgg acagcagcac caccaaggac
gtgatccaga agggcatcag cgtggtgggc 120gacctgctgg gcgtggtggg
cttccccttc ggcggcgccc tggtgagctt ctacaccaac 180ttcctgaaca
ccatctggcc cagcgaggac ccctggaagg ccttcatgga gcaggtggag
240gccctgatgg accagaagat cgccgactac gccaagaaca aggcactggc
cgagctacag 300ggcctccaga acaacgtgga ggactatgtg agcgccctga
gcagctggca gaagaacccc 360gctgcaccgt tccgcaaccc ccacagccag
ggccgcatcc gcgagctgtt cagccaggcc 420gagagccact tccgcaacag
catgcccagc ttcgccatca gcggctacga ggtgctgttc 480ctgaccacct
acgcccaggc cgccaacacc cacctgttcc tgctgaagga cgcccaaatc
540tacggagagg agtggggcta cgagaaggag gacatcgccg agttctacaa
gcgccagctg 600aagctgaccc aggagtacac cgaccactgc gtgaagtggt
acaacgtggg tctagacaag 660ctccgcggca gcagctacga gagctgggtg
aacttcaacc gctaccgccg cgagatgacc 720ctgaccgtgc tggacctgat
cgccctgttc cccctgtacg acgtgcgcct gtaccccaag 780gaggtgaaga
ccgagctgac ccgcgacgtg ctgaccgacc ccatcgtggg cgtgaacaac
840ctgcgcggct acggcaccac cttcagcaac atcgagaact acatccgcaa
gccccacctg 900ttcgactacc tgcaccgcat ccagttccac acgcgtttcc
agcccggcta ctacggcaac 960gacagcttca actactggag cggcaactac
gtgagcaccc gccccagcat cggcagcaac 1020gacatcatca ccagcccctt
ctacggcaac aagagcagcg agcccgtgca gaaccttgag 1080ttcaacggcg
agaaggtgta ccgcgccgtg gctaacacca acctggccgt gtggccctct
1140gcagtgtaca gcggcgtgac caaggtggag ttcagccagt acaacgacca
gaccgacgag 1200gccagcaccc agacctacga
cagcaagcgc aacgtgggcg ccgtgagctg ggacagcatc 1260gaccagctgc
cccccgagac caccgacgag cccctggaga agggctacag ccaccagctg
1320aactacgtga tgtgcttcct gatgcagggc agccgcggca ccatccccgt
gctgacctgg 1380acccacaaga gcgtcgactt cttcaacatg atcgacagca
agaagatcac ccagctgccc 1440ctgaccaaga gcaccaacct gggcagcggc
accagcgtgg tgaagggccc cggcttcacc 1500ggcggcgaca tcctgcgccg
caccagcccc ggccagatca gcaccctgcg cgtgaacatc 1560accgcccccc
tgagccagcg ctaccgcgtc cgcatccgct acgccagcac caccaacctg
1620cagttccaca ccagcatcga cggccgcccc atcaaccagg gcaacttcag
cgccaccatg 1680agcagcggca gcaacctgca gagcggcagc ttccgcaccg
tgggcttcac cacccccttc 1740aacttcagca acggcagcag cgtgttcacc
ctgagcgccc acgtgttcaa cagcggcaac 1800gaggtgtaca tcgaccgcat
cgagttcgtg cccgccgagg tgacctag 184816615PRTArtificial SequenceFRD3
protein 16Met Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr
Asp Gly 1 5 10 15 Arg Gln Gln His Arg Gly Leu Asp Ser Ser Thr Thr
Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser Val Val Gly Asp Leu
Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly Gly Ala Leu Val Ser
Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile Trp Pro Ser Glu Asp
Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70 75 80 Ala Leu Met Asp
Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala Leu 85 90 95 Ala Glu
Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val Ser Ala 100 105 110
Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg Asn Pro His 115
120 125 Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser His
Phe 130 135 140 Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu
Val Leu Phe 145 150 155 160 Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr
His Leu Phe Leu Leu Lys 165 170 175 Asp Ala Gln Ile Tyr Gly Glu Glu
Trp Gly Tyr Glu Lys Glu Asp Ile 180 185 190 Ala Glu Phe Tyr Lys Arg
Gln Leu Lys Leu Thr Gln Glu Tyr Thr Asp 195 200 205 His Cys Val Lys
Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly Ser 210 215 220 Ser Tyr
Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu Met Thr 225 230 235
240 Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp Val Arg
245 250 255 Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val
Leu Thr 260 265 270 Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr
Gly Thr Thr Phe 275 280 285 Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro
His Leu Phe Asp Tyr Leu 290 295 300 His Arg Ile Gln Phe His Thr Arg
Phe Gln Pro Gly Tyr Tyr Gly Asn 305 310 315 320 Asp Ser Phe Asn Tyr
Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro Ser 325 330 335 Ile Gly Ser
Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn Lys Ser 340 345 350 Ser
Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys Val Tyr Arg 355 360
365 Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val Tyr Ser
370 375 380 Gly Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr
Asp Glu 385 390 395 400 Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn
Val Gly Ala Val Ser 405 410 415 Trp Asp Ser Ile Asp Gln Leu Pro Pro
Glu Thr Thr Asp Glu Pro Leu 420 425 430 Glu Lys Gly Tyr Ser His Gln
Leu Asn Tyr Val Met Cys Phe Leu Met 435 440 445 Gln Gly Ser Arg Gly
Thr Ile Pro Val Leu Thr Trp Thr His Lys Ser 450 455 460 Val Asp Phe
Phe Asn Met Ile Asp Ser Lys Lys Ile Thr Gln Leu Pro 465 470 475 480
Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val Lys Gly 485
490 495 Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro Gly
Gln 500 505 510 Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser
Gln Arg Tyr 515 520 525 Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr Asn
Leu Gln Phe His Thr 530 535 540 Ser Ile Asp Gly Arg Pro Ile Asn Gln
Gly Asn Phe Ser Ala Thr Met 545 550 555 560 Ser Ser Gly Ser Asn Leu
Gln Ser Gly Ser Phe Arg Thr Val Gly Phe 565 570 575 Thr Thr Pro Phe
Asn Phe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser 580 585 590 Ala His
Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu 595 600 605
Phe Val Pro Ala Glu Val Thr 610 615 171809DNAArtificial
SequenceFR-12-cg-dm3 coding sequence 17atgtatgacg gccgacaaca
acaccgaggc ctggacagca gcaccaccaa ggacgtgatc 60cagaagggca tcagcgtggt
gggcgacctg ctgggcgtgg tgggcttccc cttcggcggc 120gccctggtga
gcttctacac caacttcctg aacaccatct ggcccagcga ggacccctgg
180aaggccttca tggagcaggt ggaggccctg atggaccaga agatcgccga
ctacgccaag 240aacaaggcac tggccgagct acagggcctc cagaacaacg
tggaggacta tgtgagcgcc 300ctgagcagct ggcagaagaa ccccgtctcg
agccgcaacc cccacagcca gggccgcatc 360cgcgagctgt tcagccaggc
cgagagccac ttccgcaaca gcatgcccag cttcgccatc 420agcggctacg
aggtgctgtt cctgaccacc tacgcccagg ccgccaacac ccacctgttc
480ctgctgaagg acgcccaaat ctacggagag gagtggggct acgagaagga
ggacatcgcc 540gagttctaca agcgccagct gaagctgacc caggagtaca
ccgaccactg cgtgaagtgg 600tacaacgtgg gtctagacaa gctccgcggc
agcagctacg agagctgggt gaacttcaac 660cgctaccgcc gcgagatgac
cctgaccgtg ctggacctga tcgccctgtt ccccctgtac 720gacgtgcgcc
tgtaccccaa ggaggtgaag accgagctga cccgcgacgt gctgaccgac
780cccatcgtgg gcgtgaacaa cctgcgcggc tacggcacca ccttcagcaa
catcgagaac 840tacatccgca agccccacct gttcgactac ctgcaccgca
tccagttcca cacgcgtttc 900cagcccggct actacggcaa cgacagcttc
aactactgga gcggcaacta cgtgagcacc 960cgccccagca tcggcagcaa
cgacatcatc accagcccct tctacggcaa caagagcagc 1020gagcccgtgc
agaaccttga gttcaacggc gagaaggtgt accgcgccgt ggctaacacc
1080aacctggccg tgtggccctc tgcagtgtac agcggcgtga ccaaggtgga
gttcagccag 1140tacaacgacc agaccgacga ggccagcacc cagacctacg
acagcaagcg caacgtgggc 1200gccgtgagct gggacagcat cgaccagctg
ccccccgaga ccaccgacga gcccctggag 1260aagggctaca gccaccagct
gaactacgtg atgtgcttcc tgatgcaggg cagccgcggc 1320accatccccg
tgctgacctg gacccacaag agcgtcgact tcttcaacat gatcgacagc
1380aagaagatca cccagctgcc cctgaccaag agcaccaacc tgggcagcgg
caccagcgtg 1440gtgaagggcc ccggcttcac cggcggcgac atcctgcgcc
gcaccagccc cggccagatc 1500agcaccctgc gcgtgaacat caccgccccc
ctgagccagc gctaccgcgt ccgcatccgc 1560tacgccagca ccaccaacct
gcagttccac accagcatcg acggccgccc catcaaccag 1620ggcaacttca
gcgccaccat gagcagcggc agcaacctgc agagcggcag cttccgcacc
1680gtgggcttca ccaccccctt caacttcagc aacggcagca gcgtgttcac
cctgagcgcc 1740cacgtgttca acagcggcaa cgaggtgtac atcgaccgca
tcgagttcgt gcccgccgag 1800gtgacctag 180918602PRTArtificial
SequenceFR-12-cg-dm3 protein 18Met Tyr Asp Gly Arg Gln Gln His Arg
Gly Leu Asp Ser Ser Thr Thr 1 5 10 15 Lys Asp Val Ile Gln Lys Gly
Ile Ser Val Val Gly Asp Leu Leu Gly 20 25 30 Val Val Gly Phe Pro
Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn 35 40 45 Phe Leu Asn
Thr Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met 50 55 60 Glu
Gln Val Glu Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys 65 70
75 80 Asn Lys Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu
Asp 85 90 95 Tyr Val Ser Ala Leu Ser Ser Trp Gln Lys Asn Pro Val
Ser Ser Arg 100 105 110 Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu
Phe Ser Gln Ala Glu 115 120 125 Ser His Phe Arg Asn Ser Met Pro Ser
Phe Ala Ile Ser Gly Tyr Glu 130 135 140 Val Leu Phe Leu Thr Thr Tyr
Ala Gln Ala Ala Asn Thr His Leu Phe 145 150 155 160 Leu Leu Lys Asp
Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys 165 170 175 Glu Asp
Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu 180 185 190
Tyr Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu 195
200 205 Arg Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg
Arg 210 215 220 Glu Met Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe
Pro Leu Tyr 225 230 235 240 Asp Val Arg Leu Tyr Pro Lys Glu Val Lys
Thr Glu Leu Thr Arg Asp 245 250 255 Val Leu Thr Asp Pro Ile Val Gly
Val Asn Asn Leu Arg Gly Tyr Gly 260 265 270 Thr Thr Phe Ser Asn Ile
Glu Asn Tyr Ile Arg Lys Pro His Leu Phe 275 280 285 Asp Tyr Leu His
Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr 290 295 300 Tyr Gly
Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr 305 310 315
320 Arg Pro Ser Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly
325 330 335 Asn Lys Ser Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly
Glu Lys 340 345 350 Val Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val
Trp Pro Ser Ala 355 360 365 Val Tyr Ser Gly Val Thr Lys Val Glu Phe
Ser Gln Tyr Asn Asp Gln 370 375 380 Thr Asp Glu Ala Ser Thr Gln Thr
Tyr Asp Ser Lys Arg Asn Val Gly 385 390 395 400 Ala Val Ser Trp Asp
Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp 405 410 415 Glu Pro Leu
Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys 420 425 430 Phe
Leu Met Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr 435 440
445 His Lys Ser Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr
450 455 460 Gln Leu Pro Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr
Ser Val 465 470 475 480 Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Ile
Leu Arg Arg Thr Ser 485 490 495 Pro Gly Gln Ile Ser Thr Leu Arg Val
Asn Ile Thr Ala Pro Leu Ser 500 505 510 Gln Arg Tyr Arg Val Arg Ile
Arg Tyr Ala Ser Thr Thr Asn Leu Gln 515 520 525 Phe His Thr Ser Ile
Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser 530 535 540 Ala Thr Met
Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr 545 550 555 560
Val Gly Phe Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe 565
570 575 Thr Leu Ser Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile
Asp 580 585 590 Arg Ile Glu Phe Val Pro Ala Glu Val Thr 595 600 19
1941DNAArtificial Sequence9F-cg-del6 coding sequence 19atgtgtgctg
gtattcgccc tatgacggcc gacaacaaca ccgaggccct ggacagcagc 60accaccaagg
acgtgatcca gaagggcatc agcgtggtgg gcgacctgct gggcgtggtg
120ggcttcccct tcggcggcgc cctggtgagc ttctacacca acttcctgaa
caccatctgg 180cccagcgagg acccctggaa ggccttcatg gagcaggtgg
aggccctgat ggaccagaag 240atcgccgact acgccaagaa caaggcactg
gccgagctac agggcctcca gaacaacgtg 300gaggactatg tgagcgccct
gagcagctgg cagaagaacc ccgtctcgag ccgcaacccc 360cacagccagg
gccgcatccg cgagctgttc agccaggccg agagccactt ccgcaacagc
420atgcccagct tcgccatcag cggctacgag gtgctgttcc tgaccaccta
cgcccaggcc 480gccaacaccc acctgttcct gctgaaggac gcccaaatct
acggagagga gtggggctac 540gagaaggagg acatcgccga gttctacaag
cgccagctga agctgaccca ggagtacacc 600gaccactgcg tgaagtggta
caacgtgggt ctagacaagc tccgcggcag cagctacgag 660agctgggtga
acttcaaccg ctaccgccgc gagatgaccc tgaccgtgct ggacctgatc
720gccctgttcc ccctgtacga cgtgcgcctg taccccaagg aggtgaagac
cgagctgacc 780cgcgacgtgc tgaccgaccc catcgtgggc gtgaacaacc
tgcgcggcta cggcaccacc 840ttcagcaaca tcgagaacta catccgcaag
ccccacctgt tcgactacct gcaccgcatc 900cagttccaca cgcgtttcca
gcccggctac tacggcaacg acagcttcaa ctactggagc 960ggcaactacg
tgagcacccg ccccagcatc ggcagcaacg acatcatcac cagccccttc
1020tacggcaaca agagcagcga gcccgtgcag aaccttgagt tcaacggcga
gaaggtgtac 1080cgcgccgtgg ctaacaccaa cctggccgtg tggccctctg
cagtgtacag cggcgtgacc 1140aaggtggagt tcagccagta caacgaccag
accgacgagg ccagcaccca gacctacgac 1200agcaagcgca acgtgggcgc
cgtgagctgg gacagcatcg accagctgcc ccccgagacc 1260accgacgagc
ccctggagaa gggctacagc caccagctga actacgtgat gtgcttcctg
1320atgcagggca gccgcggcac catccccgtg ctgacctgga cccacaagag
cgtcgacttc 1380ttcaacatga tcgacagcaa gaagatcacc cagctgcccc
tgaccaagag caccaacctg 1440ggcagcggca ccagcgtggt gaagggcccc
ggcttcaccg gcggcgacat cctgcgccgc 1500accagccccg gccagatcag
caccctgcgc gtgaacatca ccgcccccct gagccagcgc 1560taccgcgtcc
gcatccgcta cgccagcacc accaacctgc agttccacac cagcatcgac
1620ggccgcccca tcaaccaggg caacttcagc gccaccatga gcagcggcag
caacctgcag 1680agcggcagct tccgcaccgt gggcttcacc acccccttca
acttcagcaa cggcagcagc 1740gtgttcaccc tgagcgccca cgtgttcaac
agcggcaacg aggtgtacat cgaccgcatc 1800gagttcgtgc ccgccgaggt
gaccttcgag gccgagtacg acctggagag ggctcagaag 1860gccgtgaacg
agctgttcac cagcagcaac cagatcggcc tgaagaccga cgtgaccgac
1920taccacatcg atcaggtgta g 194120646PRTArtificial
Sequence9F-cg-del6 protein 20Met Cys Ala Gly Ile Arg Pro Met Thr
Ala Asp Asn Asn Thr Glu Ala 1 5 10 15 Leu Asp Ser Ser Thr Thr Lys
Asp Val Ile Gln Lys Gly Ile Ser Val 20 25 30 Val Gly Asp Leu Leu
Gly Val Val Gly Phe Pro Phe Gly Gly Ala Leu 35 40 45 Val Ser Phe
Tyr Thr Asn Phe Leu Asn Thr Ile Trp Pro Ser Glu Asp 50 55 60 Pro
Trp Lys Ala Phe Met Glu Gln Val Glu Ala Leu Met Asp Gln Lys 65 70
75 80 Ile Ala Asp Tyr Ala Lys Asn Lys Ala Leu Ala Glu Leu Gln Gly
Leu 85 90 95 Gln Asn Asn Val Glu Asp Tyr Val Ser Ala Leu Ser Ser
Trp Gln Lys 100 105 110 Asn Pro Val Ser Ser Arg Asn Pro His Ser Gln
Gly Arg Ile Arg Glu 115 120 125 Leu Phe Ser Gln Ala Glu Ser His Phe
Arg Asn Ser Met Pro Ser Phe 130 135 140 Ala Ile Ser Gly Tyr Glu Val
Leu Phe Leu Thr Thr Tyr Ala Gln Ala 145 150 155 160 Ala Asn Thr His
Leu Phe Leu Leu Lys Asp Ala Gln Ile Tyr Gly Glu 165 170 175 Glu Trp
Gly Tyr Glu Lys Glu Asp Ile Ala Glu Phe Tyr Lys Arg Gln 180 185 190
Leu Lys Leu Thr Gln Glu Tyr Thr Asp His Cys Val Lys Trp Tyr Asn 195
200 205 Val Gly Leu Asp Lys Leu Arg Gly Ser Ser Tyr Glu Ser Trp Val
Asn 210 215 220 Phe Asn Arg Tyr Arg Arg Glu Met Thr Leu Thr Val Leu
Asp Leu Ile 225 230 235 240 Ala Leu Phe Pro Leu Tyr Asp Val Arg Leu
Tyr Pro Lys Glu Val Lys 245 250 255 Thr Glu Leu Thr Arg Asp Val Leu
Thr Asp Pro Ile Val Gly Val Asn 260 265 270 Asn Leu Arg Gly Tyr Gly
Thr Thr Phe Ser Asn Ile Glu Asn Tyr Ile 275 280 285 Arg Lys Pro His
Leu Phe Asp Tyr Leu His Arg Ile Gln Phe His Thr 290 295 300 Arg Phe
Gln Pro Gly Tyr Tyr Gly Asn Asp Ser Phe Asn Tyr Trp Ser 305 310 315
320 Gly Asn Tyr Val Ser Thr Arg Pro Ser Ile Gly Ser Asn Asp Ile Ile
325 330 335 Thr Ser Pro Phe Tyr Gly Asn Lys Ser Ser Glu Pro Val Gln
Asn Leu 340 345 350 Glu Phe Asn Gly Glu Lys Val Tyr Arg Ala Val Ala
Asn Thr Asn Leu 355 360 365 Ala Val Trp Pro Ser Ala
Val Tyr Ser Gly Val Thr Lys Val Glu Phe 370 375 380 Ser Gln Tyr Asn
Asp Gln Thr Asp Glu Ala Ser Thr Gln Thr Tyr Asp 385 390 395 400 Ser
Lys Arg Asn Val Gly Ala Val Ser Trp Asp Ser Ile Asp Gln Leu 405 410
415 Pro Pro Glu Thr Thr Asp Glu Pro Leu Glu Lys Gly Tyr Ser His Gln
420 425 430 Leu Asn Tyr Val Met Cys Phe Leu Met Gln Gly Ser Arg Gly
Thr Ile 435 440 445 Pro Val Leu Thr Trp Thr His Lys Ser Val Asp Phe
Phe Asn Met Ile 450 455 460 Asp Ser Lys Lys Ile Thr Gln Leu Pro Leu
Thr Lys Ser Thr Asn Leu 465 470 475 480 Gly Ser Gly Thr Ser Val Val
Lys Gly Pro Gly Phe Thr Gly Gly Asp 485 490 495 Ile Leu Arg Arg Thr
Ser Pro Gly Gln Ile Ser Thr Leu Arg Val Asn 500 505 510 Ile Thr Ala
Pro Leu Ser Gln Arg Tyr Arg Val Arg Ile Arg Tyr Ala 515 520 525 Ser
Thr Thr Asn Leu Gln Phe His Thr Ser Ile Asp Gly Arg Pro Ile 530 535
540 Asn Gln Gly Asn Phe Ser Ala Thr Met Ser Ser Gly Ser Asn Leu Gln
545 550 555 560 Ser Gly Ser Phe Arg Thr Val Gly Phe Thr Thr Pro Phe
Asn Phe Ser 565 570 575 Asn Gly Ser Ser Val Phe Thr Leu Ser Ala His
Val Phe Asn Ser Gly 580 585 590 Asn Glu Val Tyr Ile Asp Arg Ile Glu
Phe Val Pro Ala Glu Val Thr 595 600 605 Phe Glu Ala Glu Tyr Asp Leu
Glu Arg Ala Gln Lys Ala Val Asn Glu 610 615 620 Leu Phe Thr Ser Ser
Asn Gln Ile Gly Leu Lys Thr Asp Val Thr Asp 625 630 635 640 Tyr His
Ile Asp Gln Val 645 211845DNAArtificial SequenceFR-cg-dm3 coding
sequence 21atgactagta acggccgcca gtgtgctggt attcgccctt atgacggccg
acaacaacac 60cgaggcctgg acagcagcac caccaaggac gtgatccaga agggcatcag
cgtggtgggc 120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc
tggtgagctt ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac
ccctggaagg ccttcatgga gcaggtggag 240gccctgatgg accagaagat
cgccgactac gccaagaaca aggcactggc cgagctacag 300ggcctccaga
acaacgtgga ggactatgtg agcgccctga gcagctggca gaagaacccc
360gtctcgagcc gcaaccccca cagccagggc cgcatccgcg agctgttcag
ccaggccgag 420agccacttcc gcaacagcat gcccagcttc gccatcagcg
gctacgaggt gctgttcctg 480accacctacg cccaggccgc caacacccac
ctgttcctgc tgaaggacgc ccaaatctac 540ggagaggagt ggggctacga
gaaggaggac atcgccgagt tctacaagcg ccagctgaag 600ctgacccagg
agtacaccga ccactgcgtg aagtggtaca acgtgggtct agacaagctc
660cgcggcagca gctacgagag ctgggtgaac ttcaaccgct accgccgcga
gatgaccctg 720accgtgctgg acctgatcgc cctgttcccc ctgtacgacg
tgcgcctgta ccccaaggag 780gtgaagaccg agctgacccg cgacgtgctg
accgacccca tcgtgggcgt gaacaacctg 840cgcggctacg gcaccacctt
cagcaacatc gagaactaca tccgcaagcc ccacctgttc 900gactacctgc
accgcatcca gttccacacg cgtttccagc ccggctacta cggcaacgac
960agcttcaact actggagcgg caactacgtg agcacccgcc ccagcatcgg
cagcaacgac 1020atcatcacca gccccttcta cggcaacaag agcagcgagc
ccgtgcagaa ccttgagttc 1080aacggcgaga aggtgtaccg cgccgtggct
aacaccaacc tggccgtgtg gccctctgca 1140gtgtacagcg gcgtgaccaa
ggtggagttc agccagtaca acgaccagac cgacgaggcc 1200agcacccaga
cctacgacag caagcgcaac gtgggcgccg tgagctggga cagcatcgac
1260cagctgcccc ccgagaccac cgacgagccc ctggagaagg gctacagcca
ccagctgaac 1320tacgtgatgt gcttcctgat gcagggcagc cgcggcacca
tccccgtgct gacctggacc 1380cacaagagcg tcgacttctt caacatgatc
gacagcaaga agatcaccca gctgcccctg 1440accaagagca ccaacctggg
cagcggcacc agcgtggtga agggccccgg cttcaccggc 1500ggcgacatcc
tgcgccgcac cagccccggc cagatcagca ccctgcgcgt gaacatcacc
1560gcccccctga gccagcgcta ccgcgtccgc atccgctacg ccagcaccac
caacctgcag 1620ttccacacca gcatcgacgg ccgccccatc aaccagggca
acttcagcgc caccatgagc 1680agcggcagca acctgcagag cggcagcttc
cgcaccgtgg gcttcaccac ccccttcaac 1740ttcagcaacg gcagcagcgt
gttcaccctg agcgcccacg tgttcaacag cggcaacgag 1800gtgtacatcg
accgcatcga gttcgtgccc gccgaggtga cctag 184522614PRTArtificial
SequenceFR-cg-dm3 protein 22Met Thr Ser Asn Gly Arg Gln Cys Ala Gly
Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln Gln His Arg Gly Leu Asp
Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser Val
Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly Gly
Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile Trp
Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70 75 80
Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala Leu 85
90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val Ser
Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro Val Ser Ser Arg Asn
Pro His Ser 115 120 125 Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala
Glu Ser His Phe Arg 130 135 140 Asn Ser Met Pro Ser Phe Ala Ile Ser
Gly Tyr Glu Val Leu Phe Leu 145 150 155 160 Thr Thr Tyr Ala Gln Ala
Ala Asn Thr His Leu Phe Leu Leu Lys Asp 165 170 175 Ala Gln Ile Tyr
Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile Ala 180 185 190 Glu Phe
Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr Asp His 195 200 205
Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly Ser Ser 210
215 220 Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu Met Thr
Leu 225 230 235 240 Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr
Asp Val Arg Leu 245 250 255 Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr
Arg Asp Val Leu Thr Asp 260 265 270 Pro Ile Val Gly Val Asn Asn Leu
Arg Gly Tyr Gly Thr Thr Phe Ser 275 280 285 Asn Ile Glu Asn Tyr Ile
Arg Lys Pro His Leu Phe Asp Tyr Leu His 290 295 300 Arg Ile Gln Phe
His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn Asp 305 310 315 320 Ser
Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro Ser Ile 325 330
335 Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn Lys Ser Ser
340 345 350 Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys Val Tyr
Arg Ala 355 360 365 Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala
Val Tyr Ser Gly 370 375 380 Val Thr Lys Val Glu Phe Ser Gln Tyr Asn
Asp Gln Thr Asp Glu Ala 385 390 395 400 Ser Thr Gln Thr Tyr Asp Ser
Lys Arg Asn Val Gly Ala Val Ser Trp 405 410 415 Asp Ser Ile Asp Gln
Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu Glu 420 425 430 Lys Gly Tyr
Ser His Gln Leu Asn Tyr Val Met Cys Phe Leu Met Gln 435 440 445 Gly
Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr His Lys Ser Val 450 455
460 Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr Gln Leu Pro Leu
465 470 475 480 Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val
Lys Gly Pro 485 490 495 Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr
Ser Pro Gly Gln Ile 500 505 510 Ser Thr Leu Arg Val Asn Ile Thr Ala
Pro Leu Ser Gln Arg Tyr Arg 515 520 525 Val Arg Ile Arg Tyr Ala Ser
Thr Thr Asn Leu Gln Phe His Thr Ser 530 535 540 Ile Asp Gly Arg Pro
Ile Asn Gln Gly Asn Phe Ser Ala Thr Met Ser 545 550 555 560 Ser Gly
Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr Val Gly Phe Thr 565 570 575
Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser Ala 580
585 590 His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu
Phe 595 600 605 Val Pro Ala Glu Val Thr 610 231845DNAArtificial
Sequence9F-cg-dm3 coding sequence 23atgactagta acggccgcca
gtgtgctggt attcgcccta tgacggccga caacaacacc 60gaggccctgg acagcagcac
caccaaggac gtgatccaga agggcatcag cgtggtgggc 120gacctgctgg
gcgtggtggg cttccccttc ggcggcgccc tggtgagctt ctacaccaac
180ttcctgaaca ccatctggcc cagcgaggac ccctggaagg ccttcatgga
gcaggtggag 240gccctgatgg accagaagat cgccgactac gccaagaaca
aggcactggc cgagctacag 300ggcctccaga acaacgtgga ggactatgtg
agcgccctga gcagctggca gaagaacccc 360gtctcgagcc gcaaccccca
cagccagggc cgcatccgcg agctgttcag ccaggccgag 420agccacttcc
gcaacagcat gcccagcttc gccatcagcg gctacgaggt gctgttcctg
480accacctacg cccaggccgc caacacccac ctgttcctgc tgaaggacgc
ccaaatctac 540ggagaggagt ggggctacga gaaggaggac atcgccgagt
tctacaagcg ccagctgaag 600ctgacccagg agtacaccga ccactgcgtg
aagtggtaca acgtgggtct agacaagctc 660cgcggcagca gctacgagag
ctgggtgaac ttcaaccgct accgccgcga gatgaccctg 720accgtgctgg
acctgatcgc cctgttcccc ctgtacgacg tgcgcctgta ccccaaggag
780gtgaagaccg agctgacccg cgacgtgctg accgacccca tcgtgggcgt
gaacaacctg 840cgcggctacg gcaccacctt cagcaacatc gagaactaca
tccgcaagcc ccacctgttc 900gactacctgc accgcatcca gttccacacg
cgtttccagc ccggctacta cggcaacgac 960agcttcaact actggagcgg
caactacgtg agcacccgcc ccagcatcgg cagcaacgac 1020atcatcacca
gccccttcta cggcaacaag agcagcgagc ccgtgcagaa ccttgagttc
1080aacggcgaga aggtgtaccg cgccgtggct aacaccaacc tggccgtgtg
gccctctgca 1140gtgtacagcg gcgtgaccaa ggtggagttc agccagtaca
acgaccagac cgacgaggcc 1200agcacccaga cctacgacag caagcgcaac
gtgggcgccg tgagctggga cagcatcgac 1260cagctgcccc ccgagaccac
cgacgagccc ctggagaagg gctacagcca ccagctgaac 1320tacgtgatgt
gcttcctgat gcagggcagc cgcggcacca tccccgtgct gacctggacc
1380cacaagagcg tcgacttctt caacatgatc gacagcaaga agatcaccca
gctgcccctg 1440accaagagca ccaacctggg cagcggcacc agcgtggtga
agggccccgg cttcaccggc 1500ggcgacatcc tgcgccgcac cagccccggc
cagatcagca ccctgcgcgt gaacatcacc 1560gcccccctga gccagcgcta
ccgcgtccgc atccgctacg ccagcaccac caacctgcag 1620ttccacacca
gcatcgacgg ccgccccatc aaccagggca acttcagcgc caccatgagc
1680agcggcagca acctgcagag cggcagcttc cgcaccgtgg gcttcaccac
ccccttcaac 1740ttcagcaacg gcagcagcgt gttcaccctg agcgcccacg
tgttcaacag cggcaacgag 1800gtgtacatcg accgcatcga gttcgtgccc
gccgaggtga cctag 184524614PRTArtificial Sequence9F-cg-dm3 protein
24Met Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro Met Thr Ala 1
5 10 15 Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys Asp Val
Ile 20 25 30 Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val
Val Gly Phe 35 40 45 Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr
Asn Phe Leu Asn Thr 50 55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys
Ala Phe Met Glu Gln Val Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile
Ala Asp Tyr Ala Lys Asn Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly
Leu Gln Asn Asn Val Glu Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser
Trp Gln Lys Asn Pro Val Ser Ser Arg Asn Pro His Ser 115 120 125 Gln
Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser His Phe Arg 130 135
140 Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe Leu
145 150 155 160 Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu
Leu Lys Asp 165 170 175 Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu
Lys Glu Asp Ile Ala 180 185 190 Glu Phe Tyr Lys Arg Gln Leu Lys Leu
Thr Gln Glu Tyr Thr Asp His 195 200 205 Cys Val Lys Trp Tyr Asn Val
Gly Leu Asp Lys Leu Arg Gly Ser Ser 210 215 220 Tyr Glu Ser Trp Val
Asn Phe Asn Arg Tyr Arg Arg Glu Met Thr Leu 225 230 235 240 Thr Val
Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp Val Arg Leu 245 250 255
Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val Leu Thr Asp 260
265 270 Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe
Ser 275 280 285 Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp
Tyr Leu His 290 295 300 Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly
Tyr Tyr Gly Asn Asp 305 310 315 320 Ser Phe Asn Tyr Trp Ser Gly Asn
Tyr Val Ser Thr Arg Pro Ser Ile 325 330 335 Gly Ser Asn Asp Ile Ile
Thr Ser Pro Phe Tyr Gly Asn Lys Ser Ser 340 345 350 Glu Pro Val Gln
Asn Leu Glu Phe Asn Gly Glu Lys Val Tyr Arg Ala 355 360 365 Val Ala
Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val Tyr Ser Gly 370 375 380
Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu Ala 385
390 395 400 Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val
Ser Trp 405 410 415 Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp
Glu Pro Leu Glu 420 425 430 Lys Gly Tyr Ser His Gln Leu Asn Tyr Val
Met Cys Phe Leu Met Gln 435 440 445 Gly Ser Arg Gly Thr Ile Pro Val
Leu Thr Trp Thr His Lys Ser Val 450 455 460 Asp Phe Phe Asn Met Ile
Asp Ser Lys Lys Ile Thr Gln Leu Pro Leu 465 470 475 480 Thr Lys Ser
Thr Asn Leu Gly Ser Gly Thr Ser Val Val Lys Gly Pro 485 490 495 Gly
Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro Gly Gln Ile 500 505
510 Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser Gln Arg Tyr Arg
515 520 525 Val Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln Phe His
Thr Ser 530 535 540 Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser
Ala Thr Met Ser 545 550 555 560 Ser Gly Ser Asn Leu Gln Ser Gly Ser
Phe Arg Thr Val Gly Phe Thr 565 570 575 Thr Pro Phe Asn Phe Ser Asn
Gly Ser Ser Val Phe Thr Leu Ser Ala 580 585 590 His Val Phe Asn Ser
Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe 595 600 605 Val Pro Ala
Glu Val Thr 610 251863DNAArtificial SequenceB8a coding sequence
25atgacggccg acaacaacac cgaggccctg gacagcagca ccaccaagga cgtgatccag
60aagggcatca gcgtggtggg cgacctgctg ggcgtggtgg gcttcccctt cggcggcgcc
120ctggtgagct tctacaccaa cttcctgaac accatctggc ccagcgagga
cccctggaag 180gccttcatgg agcaggtgga ggccctgatg gaccagaaga
tcgccgacta cgccaagaac 240aaggcactgg ccgagctaca gggcctccag
aacaacgtgg aggactatgt gagcgccctg 300agcagctggc agaagaaccc
cgctgcaccg ttccgcaacc cccacagcca gggccgcatc 360cgcgagctgt
tcagccaggc cgagagccac ttccgcaaca gcatgcccag cttcgccatc
420agcggctacg aggtgctgtt cctgaccacc tacgcccagg ccgccaacac
ccacctgttc 480ctgctgaagg acgcccaaat ctacggagag gagtggggct
acgagaagga ggacatcgcc 540gagttctaca agcgccagct gaagctgacc
caggagtaca ccgaccactg cgtgaagtgg 600tacaacgtgg gtctagacaa
gctccgcggc agcagctacg agagctgggt gaacttcaac 660cgctaccgcc
gcgagatgac cctgaccgtg ctggacctga tcgccctgtt ccccctgtac
720gacgtgcgcc tgtaccccaa ggaggtgaag accgagctga cccgcgacgt
gctgaccgac 780cccatcgtgg gcgtgaacaa cctgcgcggc tacggcacca
ccttcagcaa catcgagaac 840tacatccgca agccccacct gttcgactac
ctgcaccgca tccagttcca cacgcgtttc 900cagcccggct actacggcaa
cgacagcttc aactactgga gcggcaacta cgtgagcacc 960cgccccagca
tcggcagcaa cgacatcatc accagcccct tctacggcaa caagagcagc
1020gagcccgtgc agaaccttga gttcaacggc gagaaggtgt accgcgccgt
ggctaacacc 1080aacctggccg tgtggccctc tgcagtgtac agcggcgtga
ccaaggtgga gttcagccag 1140tacaacgacc agaccgacga ggccagcacc
cagacctacg acagcaagcg caacgtgggc 1200gccgtgagct gggacagcat
cgaccagctg ccccccgaga ccaccgacga gcccctggag 1260aagggctaca
gccaccagct gaactacgtg atgtgcttcc
tgatgcaggg cagccgcggc 1320accatccccg tgctgacctg gacccacaag
agcgtcgact tcttcaacat gatcgacagc 1380aagaagatca cccagctgcc
cctggtgaag gccagcgagc tgccccaggg caccaccgtg 1440gttcgcggcc
ccggcttcac cggaggcgac atcctgcgac gcaccaacac cggcggcttc
1500ggccccatcc gcgtgaccgt gaacggcccc ctgacccagc gctaccgcat
cggcttccgc 1560tacgccagca ccgtggactt cgacttcttc gtgagccgcg
gcggcaccac cgtgaacaac 1620ttccgcttcc tgcgcaccat gaacagcggc
gacgagctga agtacggcaa cttcgtgcgc 1680cgcgccttca ccaccccctt
caccttcacc cagatccagg acatcatccg caccagcatc 1740cagggcctga
gcggcaacgg cgaggtgtac atcgacaaga tcgagatcat ccccgtgacc
1800gccaccttcg aggccgagta cgacctagag cgcgcccagg aggccgtgaa
cgccctgttc 1860tag 186326620PRTArtificial SequenceB8a protein 26Met
Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 1 5 10
15 Asp Val Ile Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val
20 25 30 Val Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr
Asn Phe 35 40 45 Leu Asn Thr Ile Trp Pro Ser Glu Asp Pro Trp Lys
Ala Phe Met Glu 50 55 60 Gln Val Glu Ala Leu Met Asp Gln Lys Ile
Ala Asp Tyr Ala Lys Asn 65 70 75 80 Lys Ala Leu Ala Glu Leu Gln Gly
Leu Gln Asn Asn Val Glu Asp Tyr 85 90 95 Val Ser Ala Leu Ser Ser
Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg 100 105 110 Asn Pro His Ser
Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu 115 120 125 Ser His
Phe Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu 130 135 140
Val Leu Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe 145
150 155 160 Leu Leu Lys Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr
Glu Lys 165 170 175 Glu Asp Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys
Leu Thr Gln Glu 180 185 190 Tyr Thr Asp His Cys Val Lys Trp Tyr Asn
Val Gly Leu Asp Lys Leu 195 200 205 Arg Gly Ser Ser Tyr Glu Ser Trp
Val Asn Phe Asn Arg Tyr Arg Arg 210 215 220 Glu Met Thr Leu Thr Val
Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr 225 230 235 240 Asp Val Arg
Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp 245 250 255 Val
Leu Thr Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly 260 265
270 Thr Thr Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe
275 280 285 Asp Tyr Leu His Arg Ile Gln Phe His Thr Arg Phe Gln Pro
Gly Tyr 290 295 300 Tyr Gly Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn
Tyr Val Ser Thr 305 310 315 320 Arg Pro Ser Ile Gly Ser Asn Asp Ile
Ile Thr Ser Pro Phe Tyr Gly 325 330 335 Asn Lys Ser Ser Glu Pro Val
Gln Asn Leu Glu Phe Asn Gly Glu Lys 340 345 350 Val Tyr Arg Ala Val
Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala 355 360 365 Val Tyr Ser
Gly Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln 370 375 380 Thr
Asp Glu Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly 385 390
395 400 Ala Val Ser Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr
Asp 405 410 415 Glu Pro Leu Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr
Val Met Cys 420 425 430 Phe Leu Met Gln Gly Ser Arg Gly Thr Ile Pro
Val Leu Thr Trp Thr 435 440 445 His Lys Ser Val Asp Phe Phe Asn Met
Ile Asp Ser Lys Lys Ile Thr 450 455 460 Gln Leu Pro Leu Val Lys Ala
Ser Glu Leu Pro Gln Gly Thr Thr Val 465 470 475 480 Val Arg Gly Pro
Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Asn 485 490 495 Thr Gly
Gly Phe Gly Pro Ile Arg Val Thr Val Asn Gly Pro Leu Thr 500 505 510
Gln Arg Tyr Arg Ile Gly Phe Arg Tyr Ala Ser Thr Val Asp Phe Asp 515
520 525 Phe Phe Val Ser Arg Gly Gly Thr Thr Val Asn Asn Phe Arg Phe
Leu 530 535 540 Arg Thr Met Asn Ser Gly Asp Glu Leu Lys Tyr Gly Asn
Phe Val Arg 545 550 555 560 Arg Ala Phe Thr Thr Pro Phe Thr Phe Thr
Gln Ile Gln Asp Ile Ile 565 570 575 Arg Thr Ser Ile Gln Gly Leu Ser
Gly Asn Gly Glu Val Tyr Ile Asp 580 585 590 Lys Ile Glu Ile Ile Pro
Val Thr Ala Thr Phe Glu Ala Glu Tyr Asp 595 600 605 Leu Glu Arg Ala
Gln Glu Ala Val Asn Ala Leu Phe 610 615 620 271902DNAArtificial
Sequence5*B8a coding sequence 27atgactagta acggccgcca gtgtgctggt
attcgccctt atgacggccg acaacaacac 60cgaggcctgg acagcagcac caccaaggac
gtgatccaga agggcatcag cgtggtgggc 120gacctgctgg gcgtggtggg
cttccccttc ggcggcgccc tggtgagctt ctacaccaac 180ttcctgaaca
ccatctggcc cagcgaggac ccctggaagg ccttcatgga gcaggtggag
240gccctgatgg accagaagat cgccgactac gccaagaaca aggcactggc
cgagctacag 300ggcctccaga acaacgtgga ggactatgtg agcgccctga
gcagctggca gaagaacccc 360gctgcaccgt tccgcaaccc ccacagccag
ggccgcatcc gcgagctgtt cagccaggcc 420gagagccact tccgcaacag
catgcccagc ttcgccatca gcggctacga ggtgctgttc 480ctgaccacct
acgcccaggc cgccaacacc cacctgttcc tgctgaagga cgcccaaatc
540tacggagagg agtggggcta cgagaaggag gacatcgccg agttctacaa
gcgccagctg 600aagctgaccc aggagtacac cgaccactgc gtgaagtggt
acaacgtggg tctagacaag 660ctccgcggca gcagctacga gagctgggtg
aacttcaacc gctaccgccg cgagatgacc 720ctgaccgtgc tggacctgat
cgccctgttc cccctgtacg acgtgcgcct gtaccccaag 780gaggtgaaga
ccgagctgac ccgcgacgtg ctgaccgacc ccatcgtggg cgtgaacaac
840ctgcgcggct acggcaccac cttcagcaac atcgagaact acatccgcaa
gccccacctg 900ttcgactacc tgcaccgcat ccagttccac acgcgtttcc
agcccggcta ctacggcaac 960gacagcttca actactggag cggcaactac
gtgagcaccc gccccagcat cggcagcaac 1020gacatcatca ccagcccctt
ctacggcaac aagagcagcg agcccgtgca gaaccttgag 1080ttcaacggcg
agaaggtgta ccgcgccgtg gctaacacca acctggccgt gtggccctct
1140gcagtgtaca gcggcgtgac caaggtggag ttcagccagt acaacgacca
gaccgacgag 1200gccagcaccc agacctacga cagcaagcgc aacgtgggcg
ccgtgagctg ggacagcatc 1260gaccagctgc cccccgagac caccgacgag
cccctggaga agggctacag ccaccagctg 1320aactacgtga tgtgcttcct
gatgcagggc agccgcggca ccatccccgt gctgacctgg 1380acccacaaga
gcgtcgactt cttcaacatg atcgacagca agaagatcac ccagctgccc
1440ctggtgaagg ccagcgagct gccccagggc accaccgtgg ttcgcggccc
cggcttcacc 1500ggaggcgaca tcctgcgacg caccaacacc ggcggcttcg
gccccatccg cgtgaccgtg 1560aacggccccc tgacccagcg ctaccgcatc
ggcttccgct acgccagcac cgtggacttc 1620gacttcttcg tgagccgcgg
cggcaccacc gtgaacaact tccgcttcct gcgcaccatg 1680aacagcggcg
acgagctgaa gtacggcaac ttcgtgcgcc gcgccttcac cacccccttc
1740accttcaccc agatccagga catcatccgc accagcatcc agggcctgag
cggcaacggc 1800gaggtgtaca tcgacaagat cgagatcatc cccgtgaccg
ccaccttcga ggccgagtac 1860gacctagagc gcgcccagga ggccgtgaac
gccctgttct ag 190228633PRTArtificial Sequence5*B8a Protein 28Met
Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly 1 5 10
15 Arg Gln Gln His Arg Gly Leu Asp Ser Ser Thr Thr Lys Asp Val Ile
20 25 30 Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val Val
Gly Phe 35 40 45 Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn
Phe Leu Asn Thr 50 55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala
Phe Met Glu Gln Val Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile Ala
Asp Tyr Ala Lys Asn Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly Leu
Gln Asn Asn Val Glu Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser Trp
Gln Lys Asn Pro Ala Ala Pro Phe Arg Asn Pro His 115 120 125 Ser Gln
Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser His Phe 130 135 140
Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe 145
150 155 160 Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu
Leu Lys 165 170 175 Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu
Lys Glu Asp Ile 180 185 190 Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu
Thr Gln Glu Tyr Thr Asp 195 200 205 His Cys Val Lys Trp Tyr Asn Val
Gly Leu Asp Lys Leu Arg Gly Ser 210 215 220 Ser Tyr Glu Ser Trp Val
Asn Phe Asn Arg Tyr Arg Arg Glu Met Thr 225 230 235 240 Leu Thr Val
Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp Val Arg 245 250 255 Leu
Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val Leu Thr 260 265
270 Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe
275 280 285 Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp
Tyr Leu 290 295 300 His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly
Tyr Tyr Gly Asn 305 310 315 320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn
Tyr Val Ser Thr Arg Pro Ser 325 330 335 Ile Gly Ser Asn Asp Ile Ile
Thr Ser Pro Phe Tyr Gly Asn Lys Ser 340 345 350 Ser Glu Pro Val Gln
Asn Leu Glu Phe Asn Gly Glu Lys Val Tyr Arg 355 360 365 Ala Val Ala
Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val Tyr Ser 370 375 380 Gly
Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu 385 390
395 400 Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val
Ser 405 410 415 Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp
Glu Pro Leu 420 425 430 Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val
Met Cys Phe Leu Met 435 440 445 Gln Gly Ser Arg Gly Thr Ile Pro Val
Leu Thr Trp Thr His Lys Ser 450 455 460 Val Asp Phe Phe Asn Met Ile
Asp Ser Lys Lys Ile Thr Gln Leu Pro 465 470 475 480 Leu Val Lys Ala
Ser Glu Leu Pro Gln Gly Thr Thr Val Val Arg Gly 485 490 495 Pro Gly
Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Asn Thr Gly Gly 500 505 510
Phe Gly Pro Ile Arg Val Thr Val Asn Gly Pro Leu Thr Gln Arg Tyr 515
520 525 Arg Ile Gly Phe Arg Tyr Ala Ser Thr Val Asp Phe Asp Phe Phe
Val 530 535 540 Ser Arg Gly Gly Thr Thr Val Asn Asn Phe Arg Phe Leu
Arg Thr Met 545 550 555 560 Asn Ser Gly Asp Glu Leu Lys Tyr Gly Asn
Phe Val Arg Arg Ala Phe 565 570 575 Thr Thr Pro Phe Thr Phe Thr Gln
Ile Gln Asp Ile Ile Arg Thr Ser 580 585 590 Ile Gln Gly Leu Ser Gly
Asn Gly Glu Val Tyr Ile Asp Lys Ile Glu 595 600 605 Ile Ile Pro Val
Thr Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg 610 615 620 Ala Gln
Glu Ala Val Asn Ala Leu Phe 625 630 291791DNAArtificial SequenceV3A
coding sequence 29atgacggccg acaacaacac cgaggccctg gacagcagca
ccaccaagga cgtgatccag 60aagggcatca gcgtggtggg cgacctgctg ggcgtggtgg
gcttcccctt cggcggcgcc 120ctggtgagct tctacaccaa cttcctgaac
accatctggc ccagcgagga cccctggaag 180gccttcatgg agcaggtgga
ggccctgatg gaccagaaga tcgccgacta cgccaagaac 240aaggcactgg
ccgagctaca gggcctccag aacaacgtgg aggactatgt gagcgccctg
300agcagctggc agaagaaccc cgctgcaccg ttccgcaacc cccacagcca
gggccgcatc 360cgcgagctgt tcagccaggc cgagagccac ttccgcaaca
gcatgcccag cttcgccatc 420agcggctacg aggtgctgtt cctgaccacc
tacgcccagg ccgccaacac ccacctgttc 480ctgctgaagg acgcccaaat
ctacggagag gagtggggct acgagaagga ggacatcgcc 540gagttctaca
agcgccagct gaagctgacc caggagtaca ccgaccactg cgtgaagtgg
600tacaacgtgg gtctagacaa gctccgcggc agcagctacg agagctgggt
gaacttcaac 660cgctaccgcc gcgagatgac cctgaccgtg ctggacatcg
tgagcctgtt ccccaactac 720gacagccgca cctaccccat ccgcaccgtg
agccagctga cccgcgagat ttacaccaac 780cccgtgctgg agaacttcga
cggcagcttc cgcggcagcg cccagggcat cgagggcagc 840atccgcagcc
cccacctgat ggacatcctg aacagcatca ccatctacac cgacgcccac
900cgcggcgagt actactggag cggccaccag atcatggcca gccccgtcgg
cttcagcggc 960cccgagttca ccttccccct gtacggcacc atgggcaacg
ctgcacctca gcagcgcatc 1020gtggcacagc tgggccaggg agtgtaccgc
accctgagca gcaccctgta ccgtcgacct 1080ttcaacatcg gcatcaacaa
ccagcagctg agcgtgctgg acggcaccga gttcgcctac 1140ggcaccagca
gcaacctgcc cagcgccgtg taccgcaaga gcggcaccgt ggacagcctg
1200gacgagatcc cccctcagaa caacaacgtg ccacctcgac agggcttcag
ccaccgtctg 1260agccacgtga gcatgttccg cagtggcttc agcaacagca
gcgtgagcat catccgtgca 1320cctatgttca gctggattca ccgcagtgcc
gagttcaaca acatcatccc cagcagccag 1380atcacccaga tccccctggt
gaaggcctac aagctccaga gcggcgccag cgtggtggca 1440ggcccccgct
tcaccggcgg cgacatcatc cagtgcaccg agaacggcag cgccgccacc
1500atctacgtga cccccgacgt gagctacagc cagaagtacc gcgcccgcat
ccactacgcc 1560agcaccagcc agatcacctt caccctgagc ctggacgggg
cccccttcaa ccaatactac 1620ttcgacaaga ccatcaacaa gggcgacacc
ctgacctaca acagcttcaa cctggccagc 1680ttcagcaccc ctttcgagct
gagcggcaac aacctccaga tcggcgtgac cggcctgagc 1740gccggcgaca
aggtgtacat cgacaagatc gagttcatcc ccgtgaacta g
179130596PRTArtificial SequenceV3A protein 30Met Thr Ala Asp Asn
Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 1 5 10 15 Asp Val Ile
Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val 20 25 30 Val
Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 35 40
45 Leu Asn Thr Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu
50 55 60 Gln Val Glu Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala
Lys Asn 65 70 75 80 Lys Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn
Val Glu Asp Tyr 85 90 95 Val Ser Ala Leu Ser Ser Trp Gln Lys Asn
Pro Ala Ala Pro Phe Arg 100 105 110 Asn Pro His Ser Gln Gly Arg Ile
Arg Glu Leu Phe Ser Gln Ala Glu 115 120 125 Ser His Phe Arg Asn Ser
Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu 130 135 140 Val Leu Phe Leu
Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe 145 150 155 160 Leu
Leu Lys Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys 165 170
175 Glu Asp Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu
180 185 190 Tyr Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp
Lys Leu 195 200 205 Arg Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn
Arg Tyr Arg Arg 210 215 220 Glu Met Thr Leu Thr Val Leu Asp Ile Val
Ser Leu Phe Pro Asn Tyr 225 230 235 240 Asp Ser Arg Thr Tyr Pro Ile
Arg Thr Val Ser Gln Leu Thr Arg Glu 245 250 255 Ile Tyr Thr Asn Pro
Val Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly 260 265 270 Ser Ala Gln
Gly Ile Glu Gly Ser Ile Arg Ser Pro His Leu Met Asp 275 280 285 Ile
Leu Asn Ser Ile Thr Ile Tyr Thr Asp Ala His Arg Gly Glu Tyr 290 295
300 Tyr Trp Ser Gly His Gln Ile Met Ala Ser Pro Val Gly Phe Ser Gly
305 310 315 320 Pro Glu Phe Thr Phe Pro Leu Tyr Gly Thr Met Gly Asn
Ala Ala Pro 325 330 335 Gln Gln Arg Ile Val Ala Gln Leu Gly Gln Gly
Val Tyr Arg Thr Leu 340 345 350 Ser Ser Thr Leu Tyr Arg Arg Pro Phe
Asn Ile Gly Ile Asn Asn Gln
355 360 365 Gln Leu Ser Val Leu Asp Gly Thr Glu Phe Ala Tyr Gly Thr
Ser Ser 370 375 380 Asn Leu Pro Ser Ala Val Tyr Arg Lys Ser Gly Thr
Val Asp Ser Leu 385 390 395 400 Asp Glu Ile Pro Pro Gln Asn Asn Asn
Val Pro Pro Arg Gln Gly Phe 405 410 415 Ser His Arg Leu Ser His Val
Ser Met Phe Arg Ser Gly Phe Ser Asn 420 425 430 Ser Ser Val Ser Ile
Ile Arg Ala Pro Met Phe Ser Trp Ile His Arg 435 440 445 Ser Ala Glu
Phe Asn Asn Ile Ile Pro Ser Ser Gln Ile Thr Gln Ile 450 455 460 Pro
Leu Val Lys Ala Tyr Lys Leu Gln Ser Gly Ala Ser Val Val Ala 465 470
475 480 Gly Pro Arg Phe Thr Gly Gly Asp Ile Ile Gln Cys Thr Glu Asn
Gly 485 490 495 Ser Ala Ala Thr Ile Tyr Val Thr Pro Asp Val Ser Tyr
Ser Gln Lys 500 505 510 Tyr Arg Ala Arg Ile His Tyr Ala Ser Thr Ser
Gln Ile Thr Phe Thr 515 520 525 Leu Ser Leu Asp Gly Ala Pro Phe Asn
Gln Tyr Tyr Phe Asp Lys Thr 530 535 540 Ile Asn Lys Gly Asp Thr Leu
Thr Tyr Asn Ser Phe Asn Leu Ala Ser 545 550 555 560 Phe Ser Thr Pro
Phe Glu Leu Ser Gly Asn Asn Leu Gln Ile Gly Val 565 570 575 Thr Gly
Leu Ser Ala Gly Asp Lys Val Tyr Ile Asp Lys Ile Glu Phe 580 585 590
Ile Pro Val Asn 595 31 1797DNAArtificial SequenceV4F coding
sequence 31atgacggccg acaacaacac cgaggccctg gacagcagca ccaccaagga
cgtgatccag 60aagggcatca gcgtggtggg cgacctgctg ggcgtggtgg gcttcccctt
cggcggcgcc 120ctggtgagct tctacaccaa cttcctgaac accatctggc
ccagcgagga cccctggaag 180gccttcatgg agcaggtgga ggccctgatg
gaccagaaga tcgccgacta cgccaagaac 240aaggcactgg ccgagctaca
gggcctccag aacaacgtgg aggactatgt gagcgccctg 300agcagctggc
agaagaaccc cgctgcaccg ttccgcaacc cccacagcca gggccgcatc
360cgcgagctgt tcagccaggc cgagagccac ttccgcaaca gcatgcccag
cttcgccatc 420agcggctacg aggtgctgtt cctgaccacc tacgcccagg
ccgccaacac ccacctgttc 480ctgctgaagg acgcccaaat ctacggagag
gagtggggct acgagaagga ggacatcgcc 540gagttctaca agcgccagct
gaagctgacc caggagtaca ccgaccactg cgtgaagtgg 600tacaacgtgg
gtctagacaa gctccgcggc agcagctacg agagctgggt gaacttcaac
660cgctaccgcc gcgagatgac cctgaccgtg ctggacctga tcgccctgtt
ccccctgtac 720gacgtgcgcc tgtaccccaa ggaggtgaag accgagctga
cccgcgacgt gctgaccgac 780cccatcgtgg gcgtgaacaa cctgcgcggc
tacggcacca ccttcagcaa catcgagaac 840tacatccgca agccccacct
gttcgactac ctgcaccgca tccagttcca cacgcgtttc 900cagcccggct
actacggcaa cgacagcttc aactactgga gcggcaacta cgtgagcacc
960cgccccagca tcggcagcaa cgacatcatc accagcccct tctacggcaa
caagagcagc 1020gagcccgtgc agaaccttga gttcaacggc gagaaggtgt
accgcgccgt ggctaacacc 1080aacctggccg tgtggccctc tgcagtgtac
agcggcgtga ccaaggtgga gttcagccag 1140tacaacgacc agaccgacga
ggccagcacc cagacctacg acagcaagcg caacgtgggc 1200gccgtgagct
gggacagcat cgaccagctg ccccccgaga ccaccgacga gcccctggag
1260aagggctaca gccaccagct gaactacgtg atgtgcttcc tgatgcaggg
cagccgcggc 1320accatccccg tgctgacctg gacccacaag agcgtcgact
tcttcaacat gatcgacagc 1380aagaagatca cccagctcgc cctgaccaag
agcaccaacc tgggcagcgg caccagcgtg 1440gtgaagggcc ccggcttcac
cggcggcgac atcctgcgcc gcaccagccc cggccagatc 1500agcaccctgc
gcgtgaacat caccgccccc ctgagccagc gctaccgcgt ccgcatccac
1560tacgccagca ccagccagat caccttcacc ctgagcctgg acggggcccc
cttcaaccaa 1620tactacttcg acaagaccat caacaagggc gacaccctga
cctacaacag cttcaacctg 1680gccagcttca gcaccccttt cgagctgagc
ggcaacaacc tccagatcgg cgtgaccggc 1740ctgagcgccg gcgacaaggt
gtacatcgac aagatcgagt tcatccccgt gaactag 179732598PRTArtificial
SequenceV4F protein 32Met Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp
Ser Ser Thr Thr Lys 1 5 10 15 Asp Val Ile Gln Lys Gly Ile Ser Val
Val Gly Asp Leu Leu Gly Val 20 25 30 Val Gly Phe Pro Phe Gly Gly
Ala Leu Val Ser Phe Tyr Thr Asn Phe 35 40 45 Leu Asn Thr Ile Trp
Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu 50 55 60 Gln Val Glu
Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn 65 70 75 80 Lys
Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr 85 90
95 Val Ser Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg
100 105 110 Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln
Ala Glu 115 120 125 Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Ile
Ser Gly Tyr Glu 130 135 140 Val Leu Phe Leu Thr Thr Tyr Ala Gln Ala
Ala Asn Thr His Leu Phe 145 150 155 160 Leu Leu Lys Asp Ala Gln Ile
Tyr Gly Glu Glu Trp Gly Tyr Glu Lys 165 170 175 Glu Asp Ile Ala Glu
Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu 180 185 190 Tyr Thr Asp
His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu 195 200 205 Arg
Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg 210 215
220 Glu Met Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr
225 230 235 240 Asp Val Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu
Thr Arg Asp 245 250 255 Val Leu Thr Asp Pro Ile Val Gly Val Asn Asn
Leu Arg Gly Tyr Gly 260 265 270 Thr Thr Phe Ser Asn Ile Glu Asn Tyr
Ile Arg Lys Pro His Leu Phe 275 280 285 Asp Tyr Leu His Arg Ile Gln
Phe His Thr Arg Phe Gln Pro Gly Tyr 290 295 300 Tyr Gly Asn Asp Ser
Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr 305 310 315 320 Arg Pro
Ser Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly 325 330 335
Asn Lys Ser Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys 340
345 350 Val Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser
Ala 355 360 365 Val Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gln Tyr
Asn Asp Gln 370 375 380 Thr Asp Glu Ala Ser Thr Gln Thr Tyr Asp Ser
Lys Arg Asn Val Gly 385 390 395 400 Ala Val Ser Trp Asp Ser Ile Asp
Gln Leu Pro Pro Glu Thr Thr Asp 405 410 415 Glu Pro Leu Glu Lys Gly
Tyr Ser His Gln Leu Asn Tyr Val Met Cys 420 425 430 Phe Leu Met Gln
Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr 435 440 445 His Lys
Ser Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr 450 455 460
Gln Leu Ala Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val 465
470 475 480 Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg
Thr Ser 485 490 495 Pro Gly Gln Ile Ser Thr Leu Arg Val Asn Ile Thr
Ala Pro Leu Ser 500 505 510 Gln Arg Tyr Arg Val Arg Ile His Tyr Ala
Ser Thr Ser Gln Ile Thr 515 520 525 Phe Thr Leu Ser Leu Asp Gly Ala
Pro Phe Asn Gln Tyr Tyr Phe Asp 530 535 540 Lys Thr Ile Asn Lys Gly
Asp Thr Leu Thr Tyr Asn Ser Phe Asn Leu 545 550 555 560 Ala Ser Phe
Ser Thr Pro Phe Glu Leu Ser Gly Asn Asn Leu Gln Ile 565 570 575 Gly
Val Thr Gly Leu Ser Ala Gly Asp Lys Val Tyr Ile Asp Lys Ile 580 585
590 Glu Phe Ile Pro Val Asn 595 33 1836DNAArtificial Sequence5*V4F
coding sequence 33atgactagta acggccgcca gtgtgctggt attcgccctt
atgacggccg acaacaacac 60cgaggcctgg acagcagcac caccaaggac gtgatccaga
agggcatcag cgtggtgggc 120gacctgctgg gcgtggtggg cttccccttc
ggcggcgccc tggtgagctt ctacaccaac 180ttcctgaaca ccatctggcc
cagcgaggac ccctggaagg ccttcatgga gcaggtggag 240gccctgatgg
accagaagat cgccgactac gccaagaaca aggcactggc cgagctacag
300ggcctccaga acaacgtgga ggactatgtg agcgccctga gcagctggca
gaagaacccc 360gctgcaccgt tccgcaaccc ccacagccag ggccgcatcc
gcgagctgtt cagccaggcc 420gagagccact tccgcaacag catgcccagc
ttcgccatca gcggctacga ggtgctgttc 480ctgaccacct acgcccaggc
cgccaacacc cacctgttcc tgctgaagga cgcccaaatc 540tacggagagg
agtggggcta cgagaaggag gacatcgccg agttctacaa gcgccagctg
600aagctgaccc aggagtacac cgaccactgc gtgaagtggt acaacgtggg
tctagacaag 660ctccgcggca gcagctacga gagctgggtg aacttcaacc
gctaccgccg cgagatgacc 720ctgaccgtgc tggacctgat cgccctgttc
cccctgtacg acgtgcgcct gtaccccaag 780gaggtgaaga ccgagctgac
ccgcgacgtg ctgaccgacc ccatcgtggg cgtgaacaac 840ctgcgcggct
acggcaccac cttcagcaac atcgagaact acatccgcaa gccccacctg
900ttcgactacc tgcaccgcat ccagttccac acgcgtttcc agcccggcta
ctacggcaac 960gacagcttca actactggag cggcaactac gtgagcaccc
gccccagcat cggcagcaac 1020gacatcatca ccagcccctt ctacggcaac
aagagcagcg agcccgtgca gaaccttgag 1080ttcaacggcg agaaggtgta
ccgcgccgtg gctaacacca acctggccgt gtggccctct 1140gcagtgtaca
gcggcgtgac caaggtggag ttcagccagt acaacgacca gaccgacgag
1200gccagcaccc agacctacga cagcaagcgc aacgtgggcg ccgtgagctg
ggacagcatc 1260gaccagctgc cccccgagac caccgacgag cccctggaga
agggctacag ccaccagctg 1320aactacgtga tgtgcttcct gatgcagggc
agccgcggca ccatccccgt gctgacctgg 1380acccacaaga gcgtcgactt
cttcaacatg atcgacagca agaagatcac ccagctcgcc 1440ctgaccaaga
gcaccaacct gggcagcggc accagcgtgg tgaagggccc cggcttcacc
1500ggcggcgaca tcctgcgccg caccagcccc ggccagatca gcaccctgcg
cgtgaacatc 1560accgcccccc tgagccagcg ctaccgcgtc cgcatccact
acgccagcac cagccagatc 1620accttcaccc tgagcctgga cggggccccc
ttcaaccaat actacttcga caagaccatc 1680aacaagggcg acaccctgac
ctacaacagc ttcaacctgg ccagcttcag cacccctttc 1740gagctgagcg
gcaacaacct ccagatcggc gtgaccggcc tgagcgccgg cgacaaggtg
1800tacatcgaca agatcgagtt catccccgtg aactag 183634611PRTArtificial
Sequence5*V4F Protein 34Met Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile
Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln Gln His Arg Gly Leu Asp Ser
Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser Val Val
Gly Asp Leu Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly Gly Ala
Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile Trp Pro
Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70 75 80 Ala
Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala Leu 85 90
95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val Ser Ala
100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg Asn
Pro His 115 120 125 Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala
Glu Ser His Phe 130 135 140 Arg Asn Ser Met Pro Ser Phe Ala Ile Ser
Gly Tyr Glu Val Leu Phe 145 150 155 160 Leu Thr Thr Tyr Ala Gln Ala
Ala Asn Thr His Leu Phe Leu Leu Lys 165 170 175 Asp Ala Gln Ile Tyr
Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile 180 185 190 Ala Glu Phe
Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr Asp 195 200 205 His
Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly Ser 210 215
220 Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu Met Thr
225 230 235 240 Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr
Asp Val Arg 245 250 255 Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr
Arg Asp Val Leu Thr 260 265 270 Asp Pro Ile Val Gly Val Asn Asn Leu
Arg Gly Tyr Gly Thr Thr Phe 275 280 285 Ser Asn Ile Glu Asn Tyr Ile
Arg Lys Pro His Leu Phe Asp Tyr Leu 290 295 300 His Arg Ile Gln Phe
His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn 305 310 315 320 Asp Ser
Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro Ser 325 330 335
Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn Lys Ser 340
345 350 Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys Val Tyr
Arg 355 360 365 Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala
Val Tyr Ser 370 375 380 Gly Val Thr Lys Val Glu Phe Ser Gln Tyr Asn
Asp Gln Thr Asp Glu 385 390 395 400 Ala Ser Thr Gln Thr Tyr Asp Ser
Lys Arg Asn Val Gly Ala Val Ser 405 410 415 Trp Asp Ser Ile Asp Gln
Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu 420 425 430 Glu Lys Gly Tyr
Ser His Gln Leu Asn Tyr Val Met Cys Phe Leu Met 435 440 445 Gln Gly
Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr His Lys Ser 450 455 460
Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr Gln Leu Ala 465
470 475 480 Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val
Lys Gly 485 490 495 Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr
Ser Pro Gly Gln 500 505 510 Ile Ser Thr Leu Arg Val Asn Ile Thr Ala
Pro Leu Ser Gln Arg Tyr 515 520 525 Arg Val Arg Ile His Tyr Ala Ser
Thr Ser Gln Ile Thr Phe Thr Leu 530 535 540 Ser Leu Asp Gly Ala Pro
Phe Asn Gln Tyr Tyr Phe Asp Lys Thr Ile 545 550 555 560 Asn Lys Gly
Asp Thr Leu Thr Tyr Asn Ser Phe Asn Leu Ala Ser Phe 565 570 575 Ser
Thr Pro Phe Glu Leu Ser Gly Asn Asn Leu Gln Ile Gly Val Thr 580 585
590 Gly Leu Ser Ala Gly Asp Lys Val Tyr Ile Asp Lys Ile Glu Phe Ile
595 600 605 Pro Val Asn 610 351901DNAArtificial Sequence2OL-7
coding sequence 35atgacggccg acaacaacac cgaggccctg gacagcagca
ccaccaagga cgtgatccag 60aagggcatca gcgtggtggg cgacctgctg ggcgtggtgg
gcttcccctt cggcggcgcc 120ctggtgagct tctacaccaa cttcctgaac
accatctggc ccagcgagga cccctggaag 180gccttcatgg agcaggtgga
ggccctgatg gaccagaaga tcgccgacta cgccaagaac 240aaggcactgg
ccgagctaca gggcctccag aacaacgtgg aggactatgt gagcgccctg
300agcagctggc agaagaaccc cgctgcaccg ttccgcaacc cccacagcca
gggccgcatc 360cgcgagctgt tcagccaggc cgagagccac ttccgcaaca
gcatgcccag cttcgccatc 420agcggctacg aggtgctgtt cctgaccacc
tacgtgcagg ccgccaacct gcacctgagc 480gtgctgcgcg acgtcagcgt
gttcggccag cgctggggct tcgacgccgc caccatcaac 540agccgctaca
acgacctgac ccgcctgatc ggcaactaca ccgaccacgc cgtgcgctgg
600tacaacaccg gcctggagcg cgtgtggggt cccgacagcc gcgactggat
caggtacaac 660cagttccgcc gcgagctgac cctgaccgtg ctggacatcg
tgagcctgtt ccccaactac 720gacagccgca cctaccccat ccgcaccgtg
agccagctga cccgcgagat ttacaccaac 780cccgtgctgg agaacttcga
cggcagcttc cgcggcagcg cccagggcat cgagggcagc 840atccgcagcc
cccacctgat ggacatcctg aacagcatca ccatctacac cgacgcccac
900cgcggcgagt actactggag cggccaccag atcatggcca gccccgtcgg
cttcagcggc 960cccgagttca ccttccccct gtacggcacc atgggcaacg
ctgcacctca gcagcgcatc 1020gtggcacagc tgggccaggg agtgtaccgc
accctgagca gcaccctgta ccgtcgacct 1080ttcaacatcg gcatcaacaa
ccagcagctg agcgtgctgg acggcaccga gttcgcctac 1140ggcaccagca
gcaacctgcc cagcgccgtg taccgcaaga gcggcaccgt ggacagcctg
1200gacgagatcc cccctcagaa caacaacgtg ccacctcgac agggcttcag
ccaccgtctg 1260agccacgtga gcatgttccg cagtggcttc agcaacagca
gcgtgagcat catccgtgca 1320cctatgttca gctggattca ccgcagtgcc
gagttcaaca acatcatccc cagcagccag 1380atcacccaga tccccctgac
caagagcacc aacctgggca gcggcaccag cgtggtgaag 1440ggccccggct
tcaccggcgg cgacatcctg cgccgcacca gccccggcca gatcagcacc
1500ctgcgcgtga acatcaccgc ccccctgagc cagcgctacc gcgtccgcat
ccgctacgcc 1560agcaccacca acctgcagtt ccacaccagc atcgacggcc
gccccatcaa ccagggcaac 1620ttcagcgcca ccatgagcag cggcagcaac
ctgcagagcg gcagcttccg caccgtgggc 1680ttcaccaccc ccttcaactt
cagcaacggc agcagcgtgt tcaccctgag cgcccacgtg 1740ttcaacagcg
gcaacgaggt gtacatcgac
cgcatcgagt tcgtgcccgc cgaggtgacc 1800ttcgaggccg agtacgacct
ggagagggct cagaaggccg tgaacgagct gttcaccagc 1860agcaaccaga
tcggcctgaa gaccgacgtg accgactacc a 190136633PRTArtificial
Sequence2OL-7 protein 36Met Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp
Ser Ser Thr Thr Lys 1 5 10 15 Asp Val Ile Gln Lys Gly Ile Ser Val
Val Gly Asp Leu Leu Gly Val 20 25 30 Val Gly Phe Pro Phe Gly Gly
Ala Leu Val Ser Phe Tyr Thr Asn Phe 35 40 45 Leu Asn Thr Ile Trp
Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu 50 55 60 Gln Val Glu
Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn 65 70 75 80 Lys
Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr 85 90
95 Val Ser Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg
100 105 110 Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln
Ala Glu 115 120 125 Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Ile
Ser Gly Tyr Glu 130 135 140 Val Leu Phe Leu Thr Thr Tyr Val Gln Ala
Ala Asn Leu His Leu Ser 145 150 155 160 Val Leu Arg Asp Val Ser Val
Phe Gly Gln Arg Trp Gly Phe Asp Ala 165 170 175 Ala Thr Ile Asn Ser
Arg Tyr Asn Asp Leu Thr Arg Leu Ile Gly Asn 180 185 190 Tyr Thr Asp
His Ala Val Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val 195 200 205 Trp
Gly Pro Asp Ser Arg Asp Trp Ile Arg Tyr Asn Gln Phe Arg Arg 210 215
220 Glu Leu Thr Leu Thr Val Leu Asp Ile Val Ser Leu Phe Pro Asn Tyr
225 230 235 240 Asp Ser Arg Thr Tyr Pro Ile Arg Thr Val Ser Gln Leu
Thr Arg Glu 245 250 255 Ile Tyr Thr Asn Pro Val Leu Glu Asn Phe Asp
Gly Ser Phe Arg Gly 260 265 270 Ser Ala Gln Gly Ile Glu Gly Ser Ile
Arg Ser Pro His Leu Met Asp 275 280 285 Ile Leu Asn Ser Ile Thr Ile
Tyr Thr Asp Ala His Arg Gly Glu Tyr 290 295 300 Tyr Trp Ser Gly His
Gln Ile Met Ala Ser Pro Val Gly Phe Ser Gly 305 310 315 320 Pro Glu
Phe Thr Phe Pro Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro 325 330 335
Gln Gln Arg Ile Val Ala Gln Leu Gly Gln Gly Val Tyr Arg Thr Leu 340
345 350 Ser Ser Thr Leu Tyr Arg Arg Pro Phe Asn Ile Gly Ile Asn Asn
Gln 355 360 365 Gln Leu Ser Val Leu Asp Gly Thr Glu Phe Ala Tyr Gly
Thr Ser Ser 370 375 380 Asn Leu Pro Ser Ala Val Tyr Arg Lys Ser Gly
Thr Val Asp Ser Leu 385 390 395 400 Asp Glu Ile Pro Pro Gln Asn Asn
Asn Val Pro Pro Arg Gln Gly Phe 405 410 415 Ser His Arg Leu Ser His
Val Ser Met Phe Arg Ser Gly Phe Ser Asn 420 425 430 Ser Ser Val Ser
Ile Ile Arg Ala Pro Met Phe Ser Trp Ile His Arg 435 440 445 Ser Ala
Glu Phe Asn Asn Ile Ile Pro Ser Ser Gln Ile Thr Gln Ile 450 455 460
Pro Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val Lys 465
470 475 480 Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser
Pro Gly 485 490 495 Gln Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro
Leu Ser Gln Arg 500 505 510 Tyr Arg Val Arg Ile Arg Tyr Ala Ser Thr
Thr Asn Leu Gln Phe His 515 520 525 Thr Ser Ile Asp Gly Arg Pro Ile
Asn Gln Gly Asn Phe Ser Ala Thr 530 535 540 Met Ser Ser Gly Ser Asn
Leu Gln Ser Gly Ser Phe Arg Thr Val Gly 545 550 555 560 Phe Thr Thr
Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe Thr Leu 565 570 575 Ser
Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile 580 585
590 Glu Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp Leu Glu
595 600 605 Arg Ala Gln Lys Ala Val Asn Glu Leu Phe Thr Ser Ser Asn
Gln Ile 610 615 620 Gly Leu Lys Thr Asp Val Thr Asp Tyr 625 630
371943DNAArtificial SequenceT7-2OL-7 coding sequence 37atggctagca
tgactggtgg acagcaaatg ggtcgcggat ccatgacggc cgacaacaac 60accgaggccc
tggacagcag caccaccaag gacgtgatcc agaagggcat cagcgtggtg
120ggcgacctgc tgggcgtggt gggcttcccc ttcggcggcg ccctggtgag
cttctacacc 180aacttcctga acaccatctg gcccagcgag gacccctgga
aggccttcat ggagcaggtg 240gaggccctga tggaccagaa gatcgccgac
tacgccaaga acaaggcact ggccgagcta 300cagggcctcc agaacaacgt
ggaggactat gtgagcgccc tgagcagctg gcagaagaac 360cccgctgcac
cgttccgcaa cccccacagc cagggccgca tccgcgagct gttcagccag
420gccgagagcc acttccgcaa cagcatgccc agcttcgcca tcagcggcta
cgaggtgctg 480ttcctgacca cctacgtgca ggccgccaac ctgcacctga
gcgtgctgcg cgacgtcagc 540gtgttcggcc agcgctgggg cttcgacgcc
gccaccatca acagccgcta caacgacctg 600acccgcctga tcggcaacta
caccgaccac gccgtgcgct ggtacaacac cggcctggag 660cgcgtgtggg
gtcccgacag ccgcgactgg atcaggtaca accagttccg ccgcgagctg
720accctgaccg tgctggacat cgtgagcctg ttccccaact acgacagccg
cacctacccc 780atccgcaccg tgagccagct gacccgcgag atttacacca
accccgtgct ggagaacttc 840gacggcagct tccgcggcag cgcccagggc
atcgagggca gcatccgcag cccccacctg 900atggacatcc tgaacagcat
caccatctac accgacgccc accgcggcga gtactactgg 960agcggccacc
agatcatggc cagccccgtc ggcttcagcg gccccgagtt caccttcccc
1020ctgtacggca ccatgggcaa cgctgcacct cagcagcgca tcgtggcaca
gctgggccag 1080ggagtgtacc gcaccctgag cagcaccctg taccgtcgac
ctttcaacat cggcatcaac 1140aaccagcagc tgagcgtgct ggacggcacc
gagttcgcct acggcaccag cagcaacctg 1200cccagcgccg tgtaccgcaa
gagcggcacc gtggacagcc tggacgagat cccccctcag 1260aacaacaacg
tgccacctcg acagggcttc agccaccgtc tgagccacgt gagcatgttc
1320cgcagtggct tcagcaacag cagcgtgagc atcatccgtg cacctatgtt
cagctggatt 1380caccgcagtg ccgagttcaa caacatcatc cccagcagcc
agatcaccca gatccccctg 1440accaagagca ccaacctggg cagcggcacc
agcgtggtga agggccccgg cttcaccggc 1500ggcgacatcc tgcgccgcac
cagccccggc cagatcagca ccctgcgcgt gaacatcacc 1560gcccccctga
gccagcgcta ccgcgtccgc atccgctacg ccagcaccac caacctgcag
1620ttccacacca gcatcgacgg ccgccccatc aaccagggca acttcagcgc
caccatgagc 1680agcggcagca acctgcagag cggcagcttc cgcaccgtgg
gcttcaccac ccccttcaac 1740ttcagcaacg gcagcagcgt gttcaccctg
agcgcccacg tgttcaacag cggcaacgag 1800gtgtacatcg accgcatcga
gttcgtgccc gccgaggtga ccttcgaggc cgagtacgac 1860ctggagaggg
ctcagaaggc cgtgaacgag ctgttcacca gcagcaacca gatcggcctg
1920aagaccgacg tgaccgacta cca 194338647PRTArtificial
SequenceT7-2OL-7 protein 38Met Ala Ser Met Thr Gly Gly Gln Gln Met
Gly Arg Gly Ser Met Thr 1 5 10 15 Ala Asp Asn Asn Thr Glu Ala Leu
Asp Ser Ser Thr Thr Lys Asp Val 20 25 30 Ile Gln Lys Gly Ile Ser
Val Val Gly Asp Leu Leu Gly Val Val Gly 35 40 45 Phe Pro Phe Gly
Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn 50 55 60 Thr Ile
Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val 65 70 75 80
Glu Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala 85
90 95 Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val
Ser 100 105 110 Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe
Arg Asn Pro 115 120 125 His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser
Gln Ala Glu Ser His 130 135 140 Phe Arg Asn Ser Met Pro Ser Phe Ala
Ile Ser Gly Tyr Glu Val Leu 145 150 155 160 Phe Leu Thr Thr Tyr Val
Gln Ala Ala Asn Leu His Leu Ser Val Leu 165 170 175 Arg Asp Val Ser
Val Phe Gly Gln Arg Trp Gly Phe Asp Ala Ala Thr 180 185 190 Ile Asn
Ser Arg Tyr Asn Asp Leu Thr Arg Leu Ile Gly Asn Tyr Thr 195 200 205
Asp His Ala Val Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly 210
215 220 Pro Asp Ser Arg Asp Trp Ile Arg Tyr Asn Gln Phe Arg Arg Glu
Leu 225 230 235 240 Thr Leu Thr Val Leu Asp Ile Val Ser Leu Phe Pro
Asn Tyr Asp Ser 245 250 255 Arg Thr Tyr Pro Ile Arg Thr Val Ser Gln
Leu Thr Arg Glu Ile Tyr 260 265 270 Thr Asn Pro Val Leu Glu Asn Phe
Asp Gly Ser Phe Arg Gly Ser Ala 275 280 285 Gln Gly Ile Glu Gly Ser
Ile Arg Ser Pro His Leu Met Asp Ile Leu 290 295 300 Asn Ser Ile Thr
Ile Tyr Thr Asp Ala His Arg Gly Glu Tyr Tyr Trp 305 310 315 320 Ser
Gly His Gln Ile Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu 325 330
335 Phe Thr Phe Pro Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gln Gln
340 345 350 Arg Ile Val Ala Gln Leu Gly Gln Gly Val Tyr Arg Thr Leu
Ser Ser 355 360 365 Thr Leu Tyr Arg Arg Pro Phe Asn Ile Gly Ile Asn
Asn Gln Gln Leu 370 375 380 Ser Val Leu Asp Gly Thr Glu Phe Ala Tyr
Gly Thr Ser Ser Asn Leu 385 390 395 400 Pro Ser Ala Val Tyr Arg Lys
Ser Gly Thr Val Asp Ser Leu Asp Glu 405 410 415 Ile Pro Pro Gln Asn
Asn Asn Val Pro Pro Arg Gln Gly Phe Ser His 420 425 430 Arg Leu Ser
His Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser 435 440 445 Val
Ser Ile Ile Arg Ala Pro Met Phe Ser Trp Ile His Arg Ser Ala 450 455
460 Glu Phe Asn Asn Ile Ile Pro Ser Ser Gln Ile Thr Gln Ile Pro Leu
465 470 475 480 Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val
Lys Gly Pro 485 490 495 Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr
Ser Pro Gly Gln Ile 500 505 510 Ser Thr Leu Arg Val Asn Ile Thr Ala
Pro Leu Ser Gln Arg Tyr Arg 515 520 525 Val Arg Ile Arg Tyr Ala Ser
Thr Thr Asn Leu Gln Phe His Thr Ser 530 535 540 Ile Asp Gly Arg Pro
Ile Asn Gln Gly Asn Phe Ser Ala Thr Met Ser 545 550 555 560 Ser Gly
Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr Val Gly Phe Thr 565 570 575
Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser Ala 580
585 590 His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu
Phe 595 600 605 Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp Leu
Glu Arg Ala 610 615 620 Gln Lys Ala Val Asn Glu Leu Phe Thr Ser Ser
Asn Gln Ile Gly Leu 625 630 635 640 Lys Thr Asp Val Thr Asp Tyr 645
391940DNAArtificial Sequence5*2OL-7 coding sequence 39atgactagta
acggccgcca gtgtgctggt attcgccctt atgacggccg acaacaacac 60cgaggcctgg
acagcagcac caccaaggac gtgatccaga agggcatcag cgtggtgggc
120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc tggtgagctt
ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac ccctggaagg
ccttcatgga gcaggtggag 240gccctgatgg accagaagat cgccgactac
gccaagaaca aggcactggc cgagctacag 300ggcctccaga acaacgtgga
ggactatgtg agcgccctga gcagctggca gaagaacccc 360gctgcaccgt
tccgcaaccc ccacagccag ggccgcatcc gcgagctgtt cagccaggcc
420gagagccact tccgcaacag catgcccagc ttcgccatca gcggctacga
ggtgctgttc 480ctgaccacct acgtgcaggc cgccaacctg cacctgagcg
tgctgcgcga cgtcagcgtg 540ttcggccagc gctggggctt cgacgccgcc
accatcaaca gccgctacaa cgacctgacc 600cgcctgatcg gcaactacac
cgaccacgcc gtgcgctggt acaacaccgg cctggagcgc 660gtgtggggtc
ccgacagccg cgactggatc aggtacaacc agttccgccg cgagctgacc
720ctgaccgtgc tggacatcgt gagcctgttc cccaactacg acagccgcac
ctaccccatc 780cgcaccgtga gccagctgac ccgcgagatt tacaccaacc
ccgtgctgga gaacttcgac 840ggcagcttcc gcggcagcgc ccagggcatc
gagggcagca tccgcagccc ccacctgatg 900gacatcctga acagcatcac
catctacacc gacgcccacc gcggcgagta ctactggagc 960ggccaccaga
tcatggccag ccccgtcggc ttcagcggcc ccgagttcac cttccccctg
1020tacggcacca tgggcaacgc tgcacctcag cagcgcatcg tggcacagct
gggccaggga 1080gtgtaccgca ccctgagcag caccctgtac cgtcgacctt
tcaacatcgg catcaacaac 1140cagcagctga gcgtgctgga cggcaccgag
ttcgcctacg gcaccagcag caacctgccc 1200agcgccgtgt accgcaagag
cggcaccgtg gacagcctgg acgagatccc ccctcagaac 1260aacaacgtgc
cacctcgaca gggcttcagc caccgtctga gccacgtgag catgttccgc
1320agtggcttca gcaacagcag cgtgagcatc atccgtgcac ctatgttcag
ctggattcac 1380cgcagtgccg agttcaacaa catcatcccc agcagccaga
tcacccagat ccccctgacc 1440aagagcacca acctgggcag cggcaccagc
gtggtgaagg gccccggctt caccggcggc 1500gacatcctgc gccgcaccag
ccccggccag atcagcaccc tgcgcgtgaa catcaccgcc 1560cccctgagcc
agcgctaccg cgtccgcatc cgctacgcca gcaccaccaa cctgcagttc
1620cacaccagca tcgacggccg ccccatcaac cagggcaact tcagcgccac
catgagcagc 1680ggcagcaacc tgcagagcgg cagcttccgc accgtgggct
tcaccacccc cttcaacttc 1740agcaacggca gcagcgtgtt caccctgagc
gcccacgtgt tcaacagcgg caacgaggtg 1800tacatcgacc gcatcgagtt
cgtgcccgcc gaggtgacct tcgaggccga gtacgacctg 1860gagagggctc
agaaggccgt gaacgagctg ttcaccagca gcaaccagat cggcctgaag
1920accgacgtga ccgactacca 194040646PRTArtificial Sequence5*2OL-7
protein 40Met Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr
Asp Gly 1 5 10 15 Arg Gln Gln His Arg Gly Leu Asp Ser Ser Thr Thr
Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser Val Val Gly Asp Leu
Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly Gly Ala Leu Val Ser
Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile Trp Pro Ser Glu Asp
Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70 75 80 Ala Leu Met Asp
Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala Leu 85 90 95 Ala Glu
Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val Ser Ala 100 105 110
Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg Asn Pro His 115
120 125 Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser His
Phe 130 135 140 Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu
Val Leu Phe 145 150 155 160 Leu Thr Thr Tyr Val Gln Ala Ala Asn Leu
His Leu Ser Val Leu Arg 165 170 175 Asp Val Ser Val Phe Gly Gln Arg
Trp Gly Phe Asp Ala Ala Thr Ile 180 185 190 Asn Ser Arg Tyr Asn Asp
Leu Thr Arg Leu Ile Gly Asn Tyr Thr Asp 195 200 205 His Ala Val Arg
Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro 210 215 220 Asp Ser
Arg Asp Trp Ile Arg Tyr Asn Gln Phe Arg Arg Glu Leu Thr 225 230 235
240 Leu Thr Val Leu Asp Ile Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg
245 250 255 Thr Tyr Pro Ile Arg Thr Val Ser Gln Leu Thr Arg Glu Ile
Tyr Thr 260 265 270 Asn Pro Val Leu Glu Asn Phe Asp Gly Ser Phe Arg
Gly Ser Ala Gln 275 280 285 Gly Ile Glu Gly Ser Ile Arg Ser Pro His
Leu Met Asp Ile Leu Asn 290 295 300 Ser Ile Thr Ile Tyr Thr Asp Ala
His Arg Gly Glu Tyr Tyr Trp Ser 305 310 315 320 Gly His Gln Ile Met
Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe 325 330 335 Thr Phe Pro
Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gln Gln Arg 340 345 350 Ile
Val Ala Gln Leu Gly Gln Gly Val Tyr Arg Thr Leu Ser Ser Thr 355
360 365 Leu Tyr Arg Arg Pro Phe Asn Ile Gly Ile Asn Asn Gln Gln Leu
Ser 370 375 380 Val Leu Asp Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser
Asn Leu Pro 385 390 395 400 Ser Ala Val Tyr Arg Lys Ser Gly Thr Val
Asp Ser Leu Asp Glu Ile 405 410 415 Pro Pro Gln Asn Asn Asn Val Pro
Pro Arg Gln Gly Phe Ser His Arg 420 425 430 Leu Ser His Val Ser Met
Phe Arg Ser Gly Phe Ser Asn Ser Ser Val 435 440 445 Ser Ile Ile Arg
Ala Pro Met Phe Ser Trp Ile His Arg Ser Ala Glu 450 455 460 Phe Asn
Asn Ile Ile Pro Ser Ser Gln Ile Thr Gln Ile Pro Leu Thr 465 470 475
480 Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val Lys Gly Pro Gly
485 490 495 Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro Gly Gln
Ile Ser 500 505 510 Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser Gln
Arg Tyr Arg Val 515 520 525 Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu
Gln Phe His Thr Ser Ile 530 535 540 Asp Gly Arg Pro Ile Asn Gln Gly
Asn Phe Ser Ala Thr Met Ser Ser 545 550 555 560 Gly Ser Asn Leu Gln
Ser Gly Ser Phe Arg Thr Val Gly Phe Thr Thr 565 570 575 Pro Phe Asn
Phe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser Ala His 580 585 590 Val
Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe Val 595 600
605 Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gln
610 615 620 Lys Ala Val Asn Glu Leu Phe Thr Ser Ser Asn Gln Ile Gly
Leu Lys 625 630 635 640 Thr Asp Val Thr Asp Tyr 645
411917DNAArtificial Sequence2OL-10 coding sequence 41atgacggccg
acaacaacac cgaggccctg gacagcagca ccaccaagga cgtgatccag 60aagggcatca
gcgtggtggg cgacctgctg ggcgtggtgg gcttcccctt cggcggcgcc
120ctggtgagct tctacaccaa cttcctgaac accatctggc ccagcgagga
cccctggaag 180gccttcatgg agcaggtgga ggccctgatg gaccagaaga
tcgccgacta cgccaagaac 240aaggcactgg ccgagctaca gggcctccag
aacaacgtgg aggactatgt gagcgccctg 300agcagctggc agaagaaccc
cgctgcaccg ttccgcaacc cccacagcca gggccgcatc 360cgcgagctgt
tcagccaggc cgagagccac ttccgcaaca gcatgcccag cttcgccatc
420agcggctacg aggtgctgtt cctgaccacc tacgcccagg ccgccaacac
ccacctgttc 480ctgctgaagg acgcccaaat ctacggagag gagtggggct
acgagaagga ggacatcgcc 540gagttctaca agcgccagct gaagctgacc
caggagtaca ccgaccactg cgtgaagtgg 600tacaacgtgg gtctagacaa
gctccgcggc agcagctacg agagctgggt gaacttcaac 660cgctaccgcc
gcgagatgac cctgaccgtg ctggacatcg tgagcctgtt ccccaactac
720gacagccgca cctaccccat ccgcaccgtg agccagctga cccgcgagat
ttacaccaac 780cccgtgctgg agaacttcga cggcagcttc cgcggcagcg
cccagggcat cgagggcagc 840atccgcagcc cccacctgat ggacatcctg
aacagcatca ccatctacac cgacgcccac 900cgcggcgagt actactggag
cggccaccag atcatggcca gccccgtcgg cttcagcggc 960cccgagttca
ccttccccct gtacggcacc atgggcaacg ctgcacctca gcagcgcatc
1020gtggcacagc tgggccaggg agtgtaccgc accctgagca gcaccctgta
ccgtcgacct 1080ttcaacatcg gcatcaacaa ccagcagctg agcgtgctgg
acggcaccga gttcgcctac 1140ggcaccagca gcaacctgcc cagcgccgtg
taccgcaaga gcggcaccgt ggacagcctg 1200gacgagatcc cccctcagaa
caacaacgtg ccacctcgac agggcttcag ccaccgtctg 1260agccacgtga
gcatgttccg cagtggcttc agcaacagca gcgtgagcat catccgtgca
1320cctatgttca gctggattca ccgcagtgcc gagttcaaca acatcatccc
cagcagccag 1380atcacccaga tccccctgac caagagcacc aacctgggca
gcggcaccag cgtggtgaag 1440ggccccggct tcaccggcgg cgacatcctg
cgccgcacca gccccggcca gatcagcacc 1500ctgcgcgtga acatcaccgc
ccccctgagc cagcgctacc gcgtccgcat ccgctacgcc 1560agcaccacca
acctgcagtt ccacaccagc atcgacggcc gccccatcaa ccagggcaac
1620ttcagcgcca ccatgagcag cggcagcaac ctgcagagcg gcagcttccg
caccgtgggc 1680ttcaccaccc ccttcaactt cagcaacggc agcagcgtgt
tcaccctgag cgcccacgtg 1740ttcaacagcg gcaacgaggt gtacatcgac
cgcatcgagt tcgtgcccgc cgaggtgacc 1800ttcgaggccg agtacgacct
ggagagggct cagaaggccg tgaacgagct gttcaccagc 1860agcaaccaga
tcggcctgaa gaccgacgtg accgactacc acatcgatca ggtgtag
191742638PRTArtificial Sequence2OL-10 protein 42Met Thr Ala Asp Asn
Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 1 5 10 15 Asp Val Ile
Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val 20 25 30 Val
Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 35 40
45 Leu Asn Thr Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu
50 55 60 Gln Val Glu Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala
Lys Asn 65 70 75 80 Lys Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn
Val Glu Asp Tyr 85 90 95 Val Ser Ala Leu Ser Ser Trp Gln Lys Asn
Pro Ala Ala Pro Phe Arg 100 105 110 Asn Pro His Ser Gln Gly Arg Ile
Arg Glu Leu Phe Ser Gln Ala Glu 115 120 125 Ser His Phe Arg Asn Ser
Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu 130 135 140 Val Leu Phe Leu
Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe 145 150 155 160 Leu
Leu Lys Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys 165 170
175 Glu Asp Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu
180 185 190 Tyr Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp
Lys Leu 195 200 205 Arg Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn
Arg Tyr Arg Arg 210 215 220 Glu Met Thr Leu Thr Val Leu Asp Ile Val
Ser Leu Phe Pro Asn Tyr 225 230 235 240 Asp Ser Arg Thr Tyr Pro Ile
Arg Thr Val Ser Gln Leu Thr Arg Glu 245 250 255 Ile Tyr Thr Asn Pro
Val Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly 260 265 270 Ser Ala Gln
Gly Ile Glu Gly Ser Ile Arg Ser Pro His Leu Met Asp 275 280 285 Ile
Leu Asn Ser Ile Thr Ile Tyr Thr Asp Ala His Arg Gly Glu Tyr 290 295
300 Tyr Trp Ser Gly His Gln Ile Met Ala Ser Pro Val Gly Phe Ser Gly
305 310 315 320 Pro Glu Phe Thr Phe Pro Leu Tyr Gly Thr Met Gly Asn
Ala Ala Pro 325 330 335 Gln Gln Arg Ile Val Ala Gln Leu Gly Gln Gly
Val Tyr Arg Thr Leu 340 345 350 Ser Ser Thr Leu Tyr Arg Arg Pro Phe
Asn Ile Gly Ile Asn Asn Gln 355 360 365 Gln Leu Ser Val Leu Asp Gly
Thr Glu Phe Ala Tyr Gly Thr Ser Ser 370 375 380 Asn Leu Pro Ser Ala
Val Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu 385 390 395 400 Asp Glu
Ile Pro Pro Gln Asn Asn Asn Val Pro Pro Arg Gln Gly Phe 405 410 415
Ser His Arg Leu Ser His Val Ser Met Phe Arg Ser Gly Phe Ser Asn 420
425 430 Ser Ser Val Ser Ile Ile Arg Ala Pro Met Phe Ser Trp Ile His
Arg 435 440 445 Ser Ala Glu Phe Asn Asn Ile Ile Pro Ser Ser Gln Ile
Thr Gln Ile 450 455 460 Pro Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly
Thr Ser Val Val Lys 465 470 475 480 Gly Pro Gly Phe Thr Gly Gly Asp
Ile Leu Arg Arg Thr Ser Pro Gly 485 490 495 Gln Ile Ser Thr Leu Arg
Val Asn Ile Thr Ala Pro Leu Ser Gln Arg 500 505 510 Tyr Arg Val Arg
Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln Phe His 515 520 525 Thr Ser
Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser Ala Thr 530 535 540
Met Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr Val Gly 545
550 555 560 Phe Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe
Thr Leu 565 570 575 Ser Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr
Ile Asp Arg Ile 580 585 590 Glu Phe Val Pro Ala Glu Val Thr Phe Glu
Ala Glu Tyr Asp Leu Glu 595 600 605 Arg Ala Gln Lys Ala Val Asn Glu
Leu Phe Thr Ser Ser Asn Gln Ile 610 615 620 Gly Leu Lys Thr Asp Val
Thr Asp Tyr His Ile Asp Gln Val 625 630 635 431956DNAArtificial
Sequence5*2OL-10 coding sequence 43atgactagta acggccgcca gtgtgctggt
attcgccctt atgacggccg acaacaacac 60cgaggcctgg acagcagcac caccaaggac
gtgatccaga agggcatcag cgtggtgggc 120gacctgctgg gcgtggtggg
cttccccttc ggcggcgccc tggtgagctt ctacaccaac 180ttcctgaaca
ccatctggcc cagcgaggac ccctggaagg ccttcatgga gcaggtggag
240gccctgatgg accagaagat cgccgactac gccaagaaca aggcactggc
cgagctacag 300ggcctccaga acaacgtgga ggactatgtg agcgccctga
gcagctggca gaagaacccc 360gctgcaccgt tccgcaaccc ccacagccag
ggccgcatcc gcgagctgtt cagccaggcc 420gagagccact tccgcaacag
catgcccagc ttcgccatca gcggctacga ggtgctgttc 480ctgaccacct
acgcccaggc cgccaacacc cacctgttcc tgctgaagga cgcccaaatc
540tacggagagg agtggggcta cgagaaggag gacatcgccg agttctacaa
gcgccagctg 600aagctgaccc aggagtacac cgaccactgc gtgaagtggt
acaacgtggg tctagacaag 660ctccgcggca gcagctacga gagctgggtg
aacttcaacc gctaccgccg cgagatgacc 720ctgaccgtgc tggacatcgt
gagcctgttc cccaactacg acagccgcac ctaccccatc 780cgcaccgtga
gccagctgac ccgcgagatt tacaccaacc ccgtgctgga gaacttcgac
840ggcagcttcc gcggcagcgc ccagggcatc gagggcagca tccgcagccc
ccacctgatg 900gacatcctga acagcatcac catctacacc gacgcccacc
gcggcgagta ctactggagc 960ggccaccaga tcatggccag ccccgtcggc
ttcagcggcc ccgagttcac cttccccctg 1020tacggcacca tgggcaacgc
tgcacctcag cagcgcatcg tggcacagct gggccaggga 1080gtgtaccgca
ccctgagcag caccctgtac cgtcgacctt tcaacatcgg catcaacaac
1140cagcagctga gcgtgctgga cggcaccgag ttcgcctacg gcaccagcag
caacctgccc 1200agcgccgtgt accgcaagag cggcaccgtg gacagcctgg
acgagatccc ccctcagaac 1260aacaacgtgc cacctcgaca gggcttcagc
caccgtctga gccacgtgag catgttccgc 1320agtggcttca gcaacagcag
cgtgagcatc atccgtgcac ctatgttcag ctggattcac 1380cgcagtgccg
agttcaacaa catcatcccc agcagccaga tcacccagat ccccctgacc
1440aagagcacca acctgggcag cggcaccagc gtggtgaagg gccccggctt
caccggcggc 1500gacatcctgc gccgcaccag ccccggccag atcagcaccc
tgcgcgtgaa catcaccgcc 1560cccctgagcc agcgctaccg cgtccgcatc
cgctacgcca gcaccaccaa cctgcagttc 1620cacaccagca tcgacggccg
ccccatcaac cagggcaact tcagcgccac catgagcagc 1680ggcagcaacc
tgcagagcgg cagcttccgc accgtgggct tcaccacccc cttcaacttc
1740agcaacggca gcagcgtgtt caccctgagc gcccacgtgt tcaacagcgg
caacgaggtg 1800tacatcgacc gcatcgagtt cgtgcccgcc gaggtgacct
tcgaggccga gtacgacctg 1860gagagggctc agaaggccgt gaacgagctg
ttcaccagca gcaaccagat cggcctgaag 1920accgacgtga ccgactacca
catcgatcag gtgtag 195644651PRTArtificial Sequnece5*2OL-10 protein
44Met Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly 1
5 10 15 Arg Gln Gln His Arg Gly Leu Asp Ser Ser Thr Thr Lys Asp Val
Ile 20 25 30 Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val
Val Gly Phe 35 40 45 Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr
Asn Phe Leu Asn Thr 50 55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys
Ala Phe Met Glu Gln Val Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile
Ala Asp Tyr Ala Lys Asn Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly
Leu Gln Asn Asn Val Glu Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser
Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg Asn Pro His 115 120 125 Ser
Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser His Phe 130 135
140 Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe
145 150 155 160 Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe
Leu Leu Lys 165 170 175 Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr
Glu Lys Glu Asp Ile 180 185 190 Ala Glu Phe Tyr Lys Arg Gln Leu Lys
Leu Thr Gln Glu Tyr Thr Asp 195 200 205 His Cys Val Lys Trp Tyr Asn
Val Gly Leu Asp Lys Leu Arg Gly Ser 210 215 220 Ser Tyr Glu Ser Trp
Val Asn Phe Asn Arg Tyr Arg Arg Glu Met Thr 225 230 235 240 Leu Thr
Val Leu Asp Ile Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg 245 250 255
Thr Tyr Pro Ile Arg Thr Val Ser Gln Leu Thr Arg Glu Ile Tyr Thr 260
265 270 Asn Pro Val Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala
Gln 275 280 285 Gly Ile Glu Gly Ser Ile Arg Ser Pro His Leu Met Asp
Ile Leu Asn 290 295 300 Ser Ile Thr Ile Tyr Thr Asp Ala His Arg Gly
Glu Tyr Tyr Trp Ser 305 310 315 320 Gly His Gln Ile Met Ala Ser Pro
Val Gly Phe Ser Gly Pro Glu Phe 325 330 335 Thr Phe Pro Leu Tyr Gly
Thr Met Gly Asn Ala Ala Pro Gln Gln Arg 340 345 350 Ile Val Ala Gln
Leu Gly Gln Gly Val Tyr Arg Thr Leu Ser Ser Thr 355 360 365 Leu Tyr
Arg Arg Pro Phe Asn Ile Gly Ile Asn Asn Gln Gln Leu Ser 370 375 380
Val Leu Asp Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro 385
390 395 400 Ser Ala Val Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp
Glu Ile 405 410 415 Pro Pro Gln Asn Asn Asn Val Pro Pro Arg Gln Gly
Phe Ser His Arg 420 425 430 Leu Ser His Val Ser Met Phe Arg Ser Gly
Phe Ser Asn Ser Ser Val 435 440 445 Ser Ile Ile Arg Ala Pro Met Phe
Ser Trp Ile His Arg Ser Ala Glu 450 455 460 Phe Asn Asn Ile Ile Pro
Ser Ser Gln Ile Thr Gln Ile Pro Leu Thr 465 470 475 480 Lys Ser Thr
Asn Leu Gly Ser Gly Thr Ser Val Val Lys Gly Pro Gly 485 490 495 Phe
Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro Gly Gln Ile Ser 500 505
510 Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser Gln Arg Tyr Arg Val
515 520 525 Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln Phe His Thr
Ser Ile 530 535 540 Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser Ala
Thr Met Ser Ser 545 550 555 560 Gly Ser Asn Leu Gln Ser Gly Ser Phe
Arg Thr Val Gly Phe Thr Thr 565 570 575 Pro Phe Asn Phe Ser Asn Gly
Ser Ser Val Phe Thr Leu Ser Ala His 580 585 590 Val Phe Asn Ser Gly
Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe Val 595 600 605 Pro Ala Glu
Val Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gln 610 615 620 Lys
Ala Val Asn Glu Leu Phe Thr Ser Ser Asn Gln Ile Gly Leu Lys 625 630
635 640 Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val 645 650
451821DNAArtificial Sequence2OL-12A coding sequence 45atggacaaca
accccaacat caacgagtgc atcccctaca actgcctgag caaccccgag 60gtggaggtgc
tgggcggcga gcgcatcgag accggctaca cccccatcga catcagcctg
120agcctgaccc agttcctgct gagcgagttc gtgcccggcg ccggcttcgt
gctgggcctg 180gtggacatca tctggggcat cttcggcccc agccagtggg
acgccttcct ggtgcagatc 240gagcagttga taaaccaacg catagaggaa
ttcgcccgca accaggccat cagccgcctg 300gagggcctga gcaacctgta
ccaaatctac gccgagagct tccgcgagtg ggaggccgac 360cccaccaacc
ccgccctgcg cgaggagatg cgcatccagt tcaacgacat gaacagcgcc
420ctgaccaccg ccatccccct gttcgccgtg cagaactacc aggtgcccct
gctgagcgtg 480tacgtgcagg ccgccaacct gcacctgagc gtgctgcgcg
acgtcagcgt gttcggccag 540cgctggggct tcgacgccgc caccatcaac
agccgctaca acgacctgac ccgcctgatc 600ggcaactaca ccgaccacgc
cgtgcgctgg tacaacaccg gcctggagcg
cgtgtggggt 660cccgacagcc gcgactggat caggtacaac cagttccgcc
gcgagctgac cctgaccgtg 720ctggacatcg tgagcctgtt ccccaactac
gacagccgca cctaccccat ccgcaccgtg 780agccagctga cccgcgagat
ttacaccaac cccgtgctgg agaacttcga cggcagcttc 840cgcggcagcg
cccagggcat cgagggcagc atccgcagcc cccacctgat ggacatcctg
900aacagcatca ccatctacac cgacgcccac cgcggcgagt actactggag
cggccaccag 960atcatggcca gccccgtcgg cttcagcggc cccgagttca
ccttccccct gtacggcacc 1020atgggcaacg ctgcacctca gcagcgcatc
gtggcacagc tgggccaggg agtgtaccgc 1080accctgagca gcaccctgta
ccgtcgacct ttcaacatcg gcatcaacaa ccagcagctg 1140agcgtgctgg
acggcaccga gttcgcctac ggcaccagca gcaacctgcc cagcgccgtg
1200taccgcaaga gcggcaccgt ggacagcctg gacgagatcc cccctcagaa
caacaacgtg 1260ccacctcgac agggcttcag ccaccgtctg agccacgtga
gcatgttccg cagtggcttc 1320agcaacagca gcgtgagcat catccgtgca
cctatgttca gctggattca ccgcagtgcc 1380gagttcaaca acatcatccc
cagcagccag atcacccaga tccccctggt gaaggcctac 1440aagctccaga
gcggcgccag cgtggtggca ggcccccgct tcaccggcgg cgacatcatc
1500cagtgcaccg agaacggcag cgccgccacc atctacgtga cccccgacgt
gagctacagc 1560cagaagtacc gcgcccgcat ccactacgcc agcaccagcc
agatcacctt caccctgagc 1620ctggacgggg cccccttcaa ccaatactac
ttcgacaaga ccatcaacaa gggcgacacc 1680ctgacctaca acagcttcaa
cctggccagc ttcagcaccc ctttcgagct gagcggcaac 1740aacctccaga
tcggcgtgac cggcctgagc gccggcgaca aggtgtacat cgacaagatc
1800gagttcatcc ccgtgaacta g 182146606PRTArtificial Sequence2OL-12A
protein 46Met Asp Asn Asn Pro Asn Ile Asn Glu Cys Ile Pro Tyr Asn
Cys Leu 1 5 10 15 Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg
Ile Glu Thr Gly 20 25 30 Tyr Thr Pro Ile Asp Ile Ser Leu Ser Leu
Thr Gln Phe Leu Leu Ser 35 40 45 Glu Phe Val Pro Gly Ala Gly Phe
Val Leu Gly Leu Val Asp Ile Ile 50 55 60 Trp Gly Ile Phe Gly Pro
Ser Gln Trp Asp Ala Phe Leu Val Gln Ile 65 70 75 80 Glu Gln Leu Ile
Asn Gln Arg Ile Glu Glu Phe Ala Arg Asn Gln Ala 85 90 95 Ile Ser
Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gln Ile Tyr Ala Glu 100 105 110
Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu 115
120 125 Glu Met Arg Ile Gln Phe Asn Asp Met Asn Ser Ala Leu Thr Thr
Ala 130 135 140 Ile Pro Leu Phe Ala Val Gln Asn Tyr Gln Val Pro Leu
Leu Ser Val 145 150 155 160 Tyr Val Gln Ala Ala Asn Leu His Leu Ser
Val Leu Arg Asp Val Ser 165 170 175 Val Phe Gly Gln Arg Trp Gly Phe
Asp Ala Ala Thr Ile Asn Ser Arg 180 185 190 Tyr Asn Asp Leu Thr Arg
Leu Ile Gly Asn Tyr Thr Asp His Ala Val 195 200 205 Arg Trp Tyr Asn
Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 210 215 220 Asp Trp
Ile Arg Tyr Asn Gln Phe Arg Arg Glu Leu Thr Leu Thr Val 225 230 235
240 Leu Asp Ile Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro
245 250 255 Ile Arg Thr Val Ser Gln Leu Thr Arg Glu Ile Tyr Thr Asn
Pro Val 260 265 270 Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala
Gln Gly Ile Glu 275 280 285 Gly Ser Ile Arg Ser Pro His Leu Met Asp
Ile Leu Asn Ser Ile Thr 290 295 300 Ile Tyr Thr Asp Ala His Arg Gly
Glu Tyr Tyr Trp Ser Gly His Gln 305 310 315 320 Ile Met Ala Ser Pro
Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 325 330 335 Leu Tyr Gly
Thr Met Gly Asn Ala Ala Pro Gln Gln Arg Ile Val Ala 340 345 350 Gln
Leu Gly Gln Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg 355 360
365 Arg Pro Phe Asn Ile Gly Ile Asn Asn Gln Gln Leu Ser Val Leu Asp
370 375 380 Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser
Ala Val 385 390 395 400 Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp
Glu Ile Pro Pro Gln 405 410 415 Asn Asn Asn Val Pro Pro Arg Gln Gly
Phe Ser His Arg Leu Ser His 420 425 430 Val Ser Met Phe Arg Ser Gly
Phe Ser Asn Ser Ser Val Ser Ile Ile 435 440 445 Arg Ala Pro Met Phe
Ser Trp Ile His Arg Ser Ala Glu Phe Asn Asn 450 455 460 Ile Ile Pro
Ser Ser Gln Ile Thr Gln Ile Pro Leu Val Lys Ala Tyr 465 470 475 480
Lys Leu Gln Ser Gly Ala Ser Val Val Ala Gly Pro Arg Phe Thr Gly 485
490 495 Gly Asp Ile Ile Gln Cys Thr Glu Asn Gly Ser Ala Ala Thr Ile
Tyr 500 505 510 Val Thr Pro Asp Val Ser Tyr Ser Gln Lys Tyr Arg Ala
Arg Ile His 515 520 525 Tyr Ala Ser Thr Ser Gln Ile Thr Phe Thr Leu
Ser Leu Asp Gly Ala 530 535 540 Pro Phe Asn Gln Tyr Tyr Phe Asp Lys
Thr Ile Asn Lys Gly Asp Thr 545 550 555 560 Leu Thr Tyr Asn Ser Phe
Asn Leu Ala Ser Phe Ser Thr Pro Phe Glu 565 570 575 Leu Ser Gly Asn
Asn Leu Gln Ile Gly Val Thr Gly Leu Ser Ala Gly 580 585 590 Asp Lys
Val Tyr Ile Asp Lys Ile Glu Phe Ile Pro Val Asn 595 600 605 47
1791DNAArtificial Sequence2OL-13 coding sequence 47atgacggccg
acaacaacac cgaggccctg gacagcagca ccaccaagga cgtgatccag 60aagggcatca
gcgtggtggg cgacctgctg ggcgtggtgg gcttcccctt cggcggcgcc
120ctggtgagct tctacaccaa cttcctgaac accatctggc ccagcgagga
cccctggaag 180gccttcatgg agcaggtgga ggccctgatg gaccagaaga
tcgccgacta cgccaagaac 240aaggcactgg ccgagctaca gggcctccag
aacaacgtgg aggactatgt gagcgccctg 300agcagctggc agaagaaccc
cgctgcaccg ttccgcaacc cccacagcca gggccgcatc 360cgcgagctgt
tcagccaggc cgagagccac ttccgcaaca gcatgcccag cttcgccatc
420agcggctacg aggtgctgtt cctgaccacc tacgcccagg ccgccaacac
ccacctgagc 480gtgctgcgcg acgtcagcgt gttcggccag cgctggggct
tcgacgccgc caccatcaac 540agccgctaca acgacctgac ccgcctgatc
ggcaactaca ccgaccacgc cgtgcgctgg 600tacaacaccg gcctggagcg
cgtgtggggt cccgacagcc gcgactggat caggtacaac 660cagttccgcc
gcgagctgac cctgaccgtg ctggacatcg tgagcctgtt ccccaactac
720gacagccgca cctaccccat ccgcaccgtg agccagctga cccgcgagat
ttacaccaac 780cccgtgctgg agaacttcga cggcagcttc cgcggcagcg
cccagggcat cgagggcagc 840atccgcagcc cccacctgat ggacatcctg
aacagcatca ccatctacac cgacgcccac 900cgcggcgagt actactggag
cggccaccag atcatggcca gccccgtcgg cttcagcggc 960cccgagttca
ccttccccct gtacggcacc atgggcaacg ctgcacctca gcagcgcatc
1020gtggcacagc tgggccaggg agtgtaccgc accctgagca gcaccctgta
ccgtcgacct 1080ttcaacatcg gcatcaacaa ccagcagctg agcgtgctgg
acggcaccga gttcgcctac 1140ggcaccagca gcaacctgcc cagcgccgtg
taccgcaaga gcggcaccgt ggacagcctg 1200gacgagatcc cccctcagaa
caacaacgtg ccacctcgac agggcttcag ccaccgtctg 1260agccacgtga
gcatgttccg cagtggcttc agcaacagca gcgtgagcat catccgtgca
1320cctatgttca gctggattca ccgcagtgcc gagttcaaca acatcatccc
cagcagccag 1380atcacccaga tccccctgac caagagcacc aacctgggca
gcggcaccag cgtggtgaag 1440ggccccggct tcaccggcgg cgacatcctg
cgccgcacca gccccggcca gatcagcacc 1500ctgcgcgtga acatcaccgc
ccccctgagc cagcgctacc gcgcccgcat ccactacgcc 1560agcaccagcc
agatcacctt caccctgagc ctggacgggg cccccttcaa ccaatactac
1620ttcgacaaga ccatcaacaa gggcgacacc ctgacctaca acagcttcaa
cctggccagc 1680ttcagcaccc ctttcgagct gagcggcaac aacctccaga
tcggcgtgac cggcctgagc 1740gccggcgaca aggtgtacat cgacaagatc
gagttcatcc ccgtgaacta g 179148596PRTArtificial Sequence2OL-13
protein 48Met Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr
Thr Lys 1 5 10 15 Asp Val Ile Gln Lys Gly Ile Ser Val Val Gly Asp
Leu Leu Gly Val 20 25 30 Val Gly Phe Pro Phe Gly Gly Ala Leu Val
Ser Phe Tyr Thr Asn Phe 35 40 45 Leu Asn Thr Ile Trp Pro Ser Glu
Asp Pro Trp Lys Ala Phe Met Glu 50 55 60 Gln Val Glu Ala Leu Met
Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn 65 70 75 80 Lys Ala Leu Ala
Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr 85 90 95 Val Ser
Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg 100 105 110
Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu 115
120 125 Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr
Glu 130 135 140 Val Leu Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr
His Leu Ser 145 150 155 160 Val Leu Arg Asp Val Ser Val Phe Gly Gln
Arg Trp Gly Phe Asp Ala 165 170 175 Ala Thr Ile Asn Ser Arg Tyr Asn
Asp Leu Thr Arg Leu Ile Gly Asn 180 185 190 Tyr Thr Asp His Ala Val
Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val 195 200 205 Trp Gly Pro Asp
Ser Arg Asp Trp Ile Arg Tyr Asn Gln Phe Arg Arg 210 215 220 Glu Leu
Thr Leu Thr Val Leu Asp Ile Val Ser Leu Phe Pro Asn Tyr 225 230 235
240 Asp Ser Arg Thr Tyr Pro Ile Arg Thr Val Ser Gln Leu Thr Arg Glu
245 250 255 Ile Tyr Thr Asn Pro Val Leu Glu Asn Phe Asp Gly Ser Phe
Arg Gly 260 265 270 Ser Ala Gln Gly Ile Glu Gly Ser Ile Arg Ser Pro
His Leu Met Asp 275 280 285 Ile Leu Asn Ser Ile Thr Ile Tyr Thr Asp
Ala His Arg Gly Glu Tyr 290 295 300 Tyr Trp Ser Gly His Gln Ile Met
Ala Ser Pro Val Gly Phe Ser Gly 305 310 315 320 Pro Glu Phe Thr Phe
Pro Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro 325 330 335 Gln Gln Arg
Ile Val Ala Gln Leu Gly Gln Gly Val Tyr Arg Thr Leu 340 345 350 Ser
Ser Thr Leu Tyr Arg Arg Pro Phe Asn Ile Gly Ile Asn Asn Gln 355 360
365 Gln Leu Ser Val Leu Asp Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser
370 375 380 Asn Leu Pro Ser Ala Val Tyr Arg Lys Ser Gly Thr Val Asp
Ser Leu 385 390 395 400 Asp Glu Ile Pro Pro Gln Asn Asn Asn Val Pro
Pro Arg Gln Gly Phe 405 410 415 Ser His Arg Leu Ser His Val Ser Met
Phe Arg Ser Gly Phe Ser Asn 420 425 430 Ser Ser Val Ser Ile Ile Arg
Ala Pro Met Phe Ser Trp Ile His Arg 435 440 445 Ser Ala Glu Phe Asn
Asn Ile Ile Pro Ser Ser Gln Ile Thr Gln Ile 450 455 460 Pro Leu Thr
Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val Lys 465 470 475 480
Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro Gly 485
490 495 Gln Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser Gln
Arg 500 505 510 Tyr Arg Ala Arg Ile His Tyr Ala Ser Thr Ser Gln Ile
Thr Phe Thr 515 520 525 Leu Ser Leu Asp Gly Ala Pro Phe Asn Gln Tyr
Tyr Phe Asp Lys Thr 530 535 540 Ile Asn Lys Gly Asp Thr Leu Thr Tyr
Asn Ser Phe Asn Leu Ala Ser 545 550 555 560 Phe Ser Thr Pro Phe Glu
Leu Ser Gly Asn Asn Leu Gln Ile Gly Val 565 570 575 Thr Gly Leu Ser
Ala Gly Asp Lys Val Tyr Ile Asp Lys Ile Glu Phe 580 585 590 Ile Pro
Val Asn 595 49 1923DNAArtificial SequenceV5&6 coding sequence
49atgacggccg acaacaacac cgaggccctg gacagcagca ccaccaagga cgtgatccag
60aagggcatca gcgtggtggg cgacctgctg ggcgtggtgg gcttcccctt cggcggcgcc
120ctggtgagct tctacaccaa cttcctgaac accatctggc ccagcgagga
cccctggaag 180gccttcatgg agcaggtgga ggccctgatg gaccagaaga
tcgccgacta cgccaagaac 240aaggcactgg ccgagctaca gggcctccag
aacaacgtgg aggactatgt gagcgccctg 300agcagctggc agaagaaccc
cgctgcaccg ttccgcaacc cccacagcca gggccgcatc 360cgcgagctgt
tcagccaggc cgagagccac ttccgcaaca gcatgcccag cttcgccatc
420agcggctacg aggtgctgtt cctgaccacc tacgcccagg ccgccaacac
ccacctgttc 480ctgctgaagg acgcccaaat ctacggagag gagtggggct
acgagaagga ggacatcgcc 540gagttctaca agcgccagct gaagctgacc
caggagtaca ccgaccactg cgtgaagtgg 600tacaacgtgg gtctagacaa
gctccgcggc agcagctacg agagctgggt gaacttcaac 660cgctaccgcc
gcgagatgac cctgaccgtg ctggacctga tcgccctgtt ccccctgtac
720gacgtgcgcc tgtaccccaa ggaggtgaag accgagctga cccgcgacgt
gctgaccgac 780cccatcgtgg gcgtgaacaa cctgcgcggc tacggcacca
ccttcagcaa catcgagaac 840tacatccgca agccccacct gttcgactac
ctgcaccgca tccagttcca cacgcgtttc 900cagcccggct actacggcaa
cgacagcttc aactactgga gcggcaacta cgtgagcacc 960cgccccagca
tcggcagcaa cgacatcatc accagcccct tctacggcaa caagagcagc
1020gagcccgtgc agaaccttga gttcaacggc gagaaggtgt accgcgccgt
ggctaacacc 1080aacctggccg tgtggccctc tgcagtgtac agcggcgtga
ccaaggtgga gttcagccag 1140tacaacgacc agaccgacga ggccagcacc
cagacctacg acagcaagcg caacgtgggc 1200gccgtgagct gggacagcat
cgaccagctg ccccccgaga ccaccgacga gcccctggag 1260aagggctaca
gccaccagct gaactacgtg atgtgcttcc tgatgcaggg cagccgcggc
1320accatccccg tgctgacctg gacccacaag agcgtcgact tcttcaacat
gatcgacagc 1380aagaagatca cccagctgcc cctggtgaag gcctacaagc
tccagagcgg cgccagcgtg 1440gtggcaggcc cccgcttcac cggcggcgac
atcatccagt gcaccgagaa cggcagcgcc 1500gccaccatct acgtgacccc
cgacgtgagc tacagccaga agtaccgcgc ccgcatccac 1560tacgccagca
ccaccaacct gcagttccac accagcatcg acggccgccc catcaaccag
1620ggcaacttca gcgccaccat gagcagcggc agcaacctgc agagcggcag
cttccgcacc 1680gtgggcttca ccaccccctt caacttcagc aacggcagca
gcgtgttcac cctgagcgcc 1740cacgtgttca acagcggcaa cgaggtgtac
atcgaccgca tcgagttcgt gcccgccgag 1800gtgaccttcg aggccgagta
cgacctggag agggctcaga aggccgtgaa cgagctgttc 1860accagcagca
accagatcgg cctgaagacc gacgtgaccg actaccacat cgatcaggtg 1920tag
192350640PRTArtificial SequenceV5&6 protein 50Met Thr Ala Asp
Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 1 5 10 15 Asp Val
Ile Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val 20 25 30
Val Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 35
40 45 Leu Asn Thr Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met
Glu 50 55 60 Gln Val Glu Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr
Ala Lys Asn 65 70 75 80 Lys Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn
Asn Val Glu Asp Tyr 85 90 95 Val Ser Ala Leu Ser Ser Trp Gln Lys
Asn Pro Ala Ala Pro Phe Arg 100 105 110 Asn Pro His Ser Gln Gly Arg
Ile Arg Glu Leu Phe Ser Gln Ala Glu 115 120 125 Ser His Phe Arg Asn
Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu 130 135 140 Val Leu Phe
Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe 145 150 155 160
Leu Leu Lys Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys 165
170 175 Glu Asp Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln
Glu 180 185 190 Tyr Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu
Asp Lys Leu 195 200 205 Arg Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe
Asn Arg Tyr Arg Arg 210 215 220 Glu Met Thr Leu Thr Val Leu Asp Leu
Ile Ala Leu Phe Pro Leu Tyr 225 230 235 240 Asp Val Arg Leu Tyr Pro
Lys Glu Val Lys Thr Glu Leu Thr Arg Asp 245 250 255 Val Leu Thr Asp
Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly 260 265 270 Thr Thr
Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe 275 280 285
Asp Tyr Leu His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr 290
295 300 Tyr Gly Asn Asp Ser
Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr 305 310 315 320 Arg Pro
Ser Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly 325 330 335
Asn Lys Ser Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys 340
345 350 Val Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser
Ala 355 360 365 Val Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gln Tyr
Asn Asp Gln 370 375 380 Thr Asp Glu Ala Ser Thr Gln Thr Tyr Asp Ser
Lys Arg Asn Val Gly 385 390 395 400 Ala Val Ser Trp Asp Ser Ile Asp
Gln Leu Pro Pro Glu Thr Thr Asp 405 410 415 Glu Pro Leu Glu Lys Gly
Tyr Ser His Gln Leu Asn Tyr Val Met Cys 420 425 430 Phe Leu Met Gln
Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr 435 440 445 His Lys
Ser Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr 450 455 460
Gln Leu Pro Leu Val Lys Ala Tyr Lys Leu Gln Ser Gly Ala Ser Val 465
470 475 480 Val Ala Gly Pro Arg Phe Thr Gly Gly Asp Ile Ile Gln Cys
Thr Glu 485 490 495 Asn Gly Ser Ala Ala Thr Ile Tyr Val Thr Pro Asp
Val Ser Tyr Ser 500 505 510 Gln Lys Tyr Arg Ala Arg Ile His Tyr Ala
Ser Thr Thr Asn Leu Gln 515 520 525 Phe His Thr Ser Ile Asp Gly Arg
Pro Ile Asn Gln Gly Asn Phe Ser 530 535 540 Ala Thr Met Ser Ser Gly
Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr 545 550 555 560 Val Gly Phe
Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe 565 570 575 Thr
Leu Ser Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp 580 585
590 Arg Ile Glu Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp
595 600 605 Leu Glu Arg Ala Gln Lys Ala Val Asn Glu Leu Phe Thr Ser
Ser Asn 610 615 620 Gln Ile Gly Leu Lys Thr Asp Val Thr Asp Tyr His
Ile Asp Gln Val 625 630 635 640 511962DNAArtificial
Sequence5*V5&6 coding sequence 51atgactagta acggccgcca
gtgtgctggt attcgccctt atgacggccg acaacaacac 60cgaggcctgg acagcagcac
caccaaggac gtgatccaga agggcatcag cgtggtgggc 120gacctgctgg
gcgtggtggg cttccccttc ggcggcgccc tggtgagctt ctacaccaac
180ttcctgaaca ccatctggcc cagcgaggac ccctggaagg ccttcatgga
gcaggtggag 240gccctgatgg accagaagat cgccgactac gccaagaaca
aggcactggc cgagctacag 300ggcctccaga acaacgtgga ggactatgtg
agcgccctga gcagctggca gaagaacccc 360gctgcaccgt tccgcaaccc
ccacagccag ggccgcatcc gcgagctgtt cagccaggcc 420gagagccact
tccgcaacag catgcccagc ttcgccatca gcggctacga ggtgctgttc
480ctgaccacct acgcccaggc cgccaacacc cacctgttcc tgctgaagga
cgcccaaatc 540tacggagagg agtggggcta cgagaaggag gacatcgccg
agttctacaa gcgccagctg 600aagctgaccc aggagtacac cgaccactgc
gtgaagtggt acaacgtggg tctagacaag 660ctccgcggca gcagctacga
gagctgggtg aacttcaacc gctaccgccg cgagatgacc 720ctgaccgtgc
tggacctgat cgccctgttc cccctgtacg acgtgcgcct gtaccccaag
780gaggtgaaga ccgagctgac ccgcgacgtg ctgaccgacc ccatcgtggg
cgtgaacaac 840ctgcgcggct acggcaccac cttcagcaac atcgagaact
acatccgcaa gccccacctg 900ttcgactacc tgcaccgcat ccagttccac
acgcgtttcc agcccggcta ctacggcaac 960gacagcttca actactggag
cggcaactac gtgagcaccc gccccagcat cggcagcaac 1020gacatcatca
ccagcccctt ctacggcaac aagagcagcg agcccgtgca gaaccttgag
1080ttcaacggcg agaaggtgta ccgcgccgtg gctaacacca acctggccgt
gtggccctct 1140gcagtgtaca gcggcgtgac caaggtggag ttcagccagt
acaacgacca gaccgacgag 1200gccagcaccc agacctacga cagcaagcgc
aacgtgggcg ccgtgagctg ggacagcatc 1260gaccagctgc cccccgagac
caccgacgag cccctggaga agggctacag ccaccagctg 1320aactacgtga
tgtgcttcct gatgcagggc agccgcggca ccatccccgt gctgacctgg
1380acccacaaga gcgtcgactt cttcaacatg atcgacagca agaagatcac
ccagctgccc 1440ctggtgaagg cctacaagct ccagagcggc gccagcgtgg
tggcaggccc ccgcttcacc 1500ggcggcgaca tcatccagtg caccgagaac
ggcagcgccg ccaccatcta cgtgaccccc 1560gacgtgagct acagccagaa
gtaccgcgcc cgcatccact acgccagcac caccaacctg 1620cagttccaca
ccagcatcga cggccgcccc atcaaccagg gcaacttcag cgccaccatg
1680agcagcggca gcaacctgca gagcggcagc ttccgcaccg tgggcttcac
cacccccttc 1740aacttcagca acggcagcag cgtgttcacc ctgagcgccc
acgtgttcaa cagcggcaac 1800gaggtgtaca tcgaccgcat cgagttcgtg
cccgccgagg tgaccttcga ggccgagtac 1860gacctggaga gggctcagaa
ggccgtgaac gagctgttca ccagcagcaa ccagatcggc 1920ctgaagaccg
acgtgaccga ctaccacatc gatcaggtgt ag 196252653PRTArtificial
Sequence5*V5&6 protein 52Met Thr Ser Asn Gly Arg Gln Cys Ala
Gly Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln Gln His Arg Gly Leu
Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser
Val Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly
Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile
Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70
75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala
Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr
Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro
Phe Arg Asn Pro His 115 120 125 Ser Gln Gly Arg Ile Arg Glu Leu Phe
Ser Gln Ala Glu Ser His Phe 130 135 140 Arg Asn Ser Met Pro Ser Phe
Ala Ile Ser Gly Tyr Glu Val Leu Phe 145 150 155 160 Leu Thr Thr Tyr
Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys 165 170 175 Asp Ala
Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile 180 185 190
Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr Asp 195
200 205 His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly
Ser 210 215 220 Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg
Glu Met Thr 225 230 235 240 Leu Thr Val Leu Asp Leu Ile Ala Leu Phe
Pro Leu Tyr Asp Val Arg 245 250 255 Leu Tyr Pro Lys Glu Val Lys Thr
Glu Leu Thr Arg Asp Val Leu Thr 260 265 270 Asp Pro Ile Val Gly Val
Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe 275 280 285 Ser Asn Ile Glu
Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu 290 295 300 His Arg
Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn 305 310 315
320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro Ser
325 330 335 Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn
Lys Ser 340 345 350 Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu
Lys Val Tyr Arg 355 360 365 Ala Val Ala Asn Thr Asn Leu Ala Val Trp
Pro Ser Ala Val Tyr Ser 370 375 380 Gly Val Thr Lys Val Glu Phe Ser
Gln Tyr Asn Asp Gln Thr Asp Glu 385 390 395 400 Ala Ser Thr Gln Thr
Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser 405 410 415 Trp Asp Ser
Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu 420 425 430 Glu
Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys Phe Leu Met 435 440
445 Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr His Lys Ser
450 455 460 Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr Gln
Leu Pro 465 470 475 480 Leu Val Lys Ala Tyr Lys Leu Gln Ser Gly Ala
Ser Val Val Ala Gly 485 490 495 Pro Arg Phe Thr Gly Gly Asp Ile Ile
Gln Cys Thr Glu Asn Gly Ser 500 505 510 Ala Ala Thr Ile Tyr Val Thr
Pro Asp Val Ser Tyr Ser Gln Lys Tyr 515 520 525 Arg Ala Arg Ile His
Tyr Ala Ser Thr Thr Asn Leu Gln Phe His Thr 530 535 540 Ser Ile Asp
Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser Ala Thr Met 545 550 555 560
Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr Val Gly Phe 565
570 575 Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe Thr Leu
Ser 580 585 590 Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp
Arg Ile Glu 595 600 605 Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu
Tyr Asp Leu Glu Arg 610 615 620 Ala Gln Lys Ala Val Asn Glu Leu Phe
Thr Ser Ser Asn Gln Ile Gly 625 630 635 640 Leu Lys Thr Asp Val Thr
Asp Tyr His Ile Asp Gln Val 645 650 531845DNAArtificial
Sequence88A-dm3 coding sequence 53atgactagta acggccgcca gtgtgctggt
attcgccctt atgacggccg acaacaacac 60cgaggcctgg acagcagcac caccaaggac
gtgatccaga agggcatcag cgtggtgggc 120gacctgctgg gcgtggtggg
cttccccttc ggcggcgccc tggtgagctt ctacaccaac 180ttcctgaaca
ccatctggcc cagcgaggac ccctggaagg ccttcatgga gcaggtggag
240gccctgatgg accagaagat cgccgactac gccaagaaca aggcactggc
cgagctacag 300ggcctccaga acaacgtgga ggactatgtg agcgccctga
gcagctggca gaagaacccc 360gctgcaccgt tccgcaaccc ccacagccag
ggccgcatcc gcgagctgtt cagccaggcc 420gagagccact tccgcaacag
catgcccagc ttcgccatca gcggctacga ggtgctgttc 480ctgaccacct
acgcccaggc cgccaacacc cacctgttcc tgctgaagga cgcccaaatc
540tacggagagg agtggggcta cgagaaggag gacatcgccg agttctacaa
gcgccagctg 600aagctgaccc aggagtacac cgaccactgc gtgaagtggt
acaacgtggg tctagacaag 660ctccgcggca gcagctacga gagctgggtg
aacttcaacc gctaccgccg cgagatgacc 720ctgaccgtgc tggacctgat
cgccctgttc cccctgtacg acgtgcgcct gtaccccaag 780gaggtgaaga
ccgagctgac ccgcgacgtg ctgaccgacc ccatcgtggg cgtgaacaac
840ctgcgcggct acggcaccac cttcagcaac atcgagaact acatccgcaa
gccccacctg 900ttcgactacc tgcaccgcat ccagttccac acgcgtttcc
agcccggcta ctacggcaac 960gacagcttca actactggag cggcaactac
gtgagcaccc gccccagcat cggcagcaac 1020gacatcatca ccagcccctt
ctacggcaac aagagcagcg agcccgtgca gaaccttgag 1080ttcaacggcg
agaaggtgta ccgcgccgtg gctaacacca acctggccgt gtggccctct
1140gcagtgtaca gcggcgtgac caaggtggag ttcagccagt acaacgacca
gaccgacgag 1200gccagcaccc agacctacga cagcaagcgc aacgtgggcg
ccgtgagctg ggacagcatc 1260gaccagctgc cccccgagac caccgacgag
cccctggaga agggctacag ccaccagctg 1320aactacgtga tgtgcttcct
gatgcagggc agccgcggca ccatccccgt gctgacctgg 1380acccacaaga
gcgtcgactt cttcaacatg atcgacagca agaagatcac ccagctgccc
1440ctggtaaagg gagacatgtt atatctaggg ggttccgtag tacagggtcc
tggatttaca 1500ggaggagata tattaaaaag aaccaatcct agcatattag
ggacctttgc ggttacagta 1560aatgggtcgt tatcacaaag atatcgtgta
agaattcgct atgcctctac aacagatttt 1620gaatttactc tataccttgg
cgacacaata gaaaaaaata gatttaacaa aactatggat 1680aatggggcat
ctttaacgta tgaaacattt aaattcgcaa gtttcattac tgatttccaa
1740ttcagagaaa cacaagataa aatactccta tccatgggtg attttagctc
cggtcaagaa 1800gtttatatag accgaatcga attcatccca gtagatgaga catag
184554614PRTArtificial Sequence88A-dm3 protein 54Met Thr Ser Asn
Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln
Gln His Arg Gly Leu Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25 30
Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val Val Gly Phe 35
40 45 Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn
Thr 50 55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu
Gln Val Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala
Lys Asn Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn
Val Glu Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn
Pro Ala Ala Pro Phe Arg Asn Pro His 115 120 125 Ser Gln Gly Arg Ile
Arg Glu Leu Phe Ser Gln Ala Glu Ser His Phe 130 135 140 Arg Asn Ser
Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe 145 150 155 160
Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys 165
170 175 Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp
Ile 180 185 190 Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu
Tyr Thr Asp 195 200 205 His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp
Lys Leu Arg Gly Ser 210 215 220 Ser Tyr Glu Ser Trp Val Asn Phe Asn
Arg Tyr Arg Arg Glu Met Thr 225 230 235 240 Leu Thr Val Leu Asp Leu
Ile Ala Leu Phe Pro Leu Tyr Asp Val Arg 245 250 255 Leu Tyr Pro Lys
Glu Val Lys Thr Glu Leu Thr Arg Asp Val Leu Thr 260 265 270 Asp Pro
Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe 275 280 285
Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu 290
295 300 His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly
Asn 305 310 315 320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser
Thr Arg Pro Ser 325 330 335 Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro
Phe Tyr Gly Asn Lys Ser 340 345 350 Ser Glu Pro Val Gln Asn Leu Glu
Phe Asn Gly Glu Lys Val Tyr Arg 355 360 365 Ala Val Ala Asn Thr Asn
Leu Ala Val Trp Pro Ser Ala Val Tyr Ser 370 375 380 Gly Val Thr Lys
Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu 385 390 395 400 Ala
Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser 405 410
415 Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu
420 425 430 Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys Phe
Leu Met 435 440 445 Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp
Thr His Lys Ser 450 455 460 Val Asp Phe Phe Asn Met Ile Asp Ser Lys
Lys Ile Thr Gln Leu Pro 465 470 475 480 Leu Val Lys Gly Asp Met Leu
Tyr Leu Gly Gly Ser Val Val Gln Gly 485 490 495 Pro Gly Phe Thr Gly
Gly Asp Ile Leu Lys Arg Thr Asn Pro Ser Ile 500 505 510 Leu Gly Thr
Phe Ala Val Thr Val Asn Gly Ser Leu Ser Gln Arg Tyr 515 520 525 Arg
Val Arg Ile Arg Tyr Ala Ser Thr Thr Asp Phe Glu Phe Thr Leu 530 535
540 Tyr Leu Gly Asp Thr Ile Glu Lys Asn Arg Phe Asn Lys Thr Met Asp
545 550 555 560 Asn Gly Ala Ser Leu Thr Tyr Glu Thr Phe Lys Phe Ala
Ser Phe Ile 565 570 575 Thr Asp Phe Gln Phe Arg Glu Thr Gln Asp Lys
Ile Leu Leu Ser Met 580 585 590 Gly Asp Phe Ser Ser Gly Gln Glu Val
Tyr Ile Asp Arg Ile Glu Phe 595 600 605 Ile Pro Val Asp Glu Thr 610
551986DNAArtificial SequenceFR(1Fa) coding sequence 55atgactagta
acggccgcca gtgtgctggt attcgccctt atgacggccg acaacaacac 60cgaggcctgg
acagcagcac caccaaggac gtgatccaga agggcatcag cgtggtgggc
120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc tggtgagctt
ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac ccctggaagg
ccttcatgga gcaggtggag 240gccctgatgg accagaagat cgccgactac
gccaagaaca aggcactggc cgagctacag 300ggcctccaga acaacgtgga
ggactatgtg agcgccctga gcagctggca gaagaacccc 360gctgcaccgt
tccgcaaccc ccacagccag ggccgcatcc gcgagctgtt cagccaggcc
420gagagccact tccgcaacag catgcccagc ttcgccatca gcggctacga
ggtgctgttc 480ctgaccacct acgcccaggc cgccaacacc cacctgttcc
tgctgaagga
cgcccaaatc 540tacggagagg agtggggcta cgagaaggag gacatcgccg
agttctacaa gcgccagctg 600aagctgaccc aggagtacac cgaccactgc
gtgaagtggt acaacgtggg tctagacaag 660ctccgcggca gcagctacga
gagctgggtg aacttcaacc gctaccgccg cgagatgacc 720ctgaccgtgc
tggacctgat cgccctgttc cccctgtacg acgtgcgcct gtaccccaag
780gaggtgaaga ccgagctgac ccgcgacgtg ctgaccgacc ccatcgtggg
cgtgaacaac 840ctgcgcggct acggcaccac cttcagcaac atcgagaact
acatccgcaa gccccacctg 900ttcgactacc tgcaccgcat ccagttccac
acgcgtttcc agcccggcta ctacggcaac 960gacagcttca actactggag
cggcaactac gtgagcaccc gccccagcat cggcagcaac 1020gacatcatca
ccagcccctt ctacggcaac aagagcagcg agcccgtgca gaaccttgag
1080ttcaacggcg agaaggtgta ccgcgccgtg gctaacacca acctggccgt
gtggccctct 1140gcagtgtaca gcggcgtgac caaggtggag ttcagccagt
acaacgacca gaccgacgag 1200gccagcaccc agacctacga cagcaagcgc
aacgtgggcg ccgtgagctg ggacagcatc 1260gaccagctgc cccccgagac
caccgacgag cccctggaga agggctacag ccaccagctg 1320aactacgtga
tgtgcttcct gatgcagggc agccgcggca ccatccccgt gctgacctgg
1380acccacaaga gcgtcgactt cttcaacatg atcgacagca agaagatcac
ccagctgccc 1440ctggtgaagg cccacaccct ccagtccggc accaccgtgg
tgcgcggccc gggcttcacc 1500ggcggcgaca tcctccgccg cacctccggc
ggcccgttcg cctacaccat cgtgaacatc 1560aacggccagc tcccgcagcg
ctaccgcgcc cgcatccgct acgcctccac caccaacctc 1620cgcatctacg
tgaccgtggc cggcgagcgc atcttcgccg gccagttcaa caagaccatg
1680gacaccggcg acccgctcac cttccagtcc ttctcctacg ccaccatcaa
caccgccttc 1740accttcccga tgtcccagtc ctccttcacc gtgggcgccg
acaccttctc ctccggcaac 1800gaggtgtaca tcgaccgctt cgagctgatc
ccggtgaccg ccaccttcga ggccgagtac 1860gacctggagc gcgcccagaa
ggccgtgaac gccctcttca cctccatcaa ccagatcggc 1920atcaagaccg
acgtgaccga ctaccacatc gaccaggtgt ccaacctcgt ggactgctta 1980agctag
198656661PRTArtificial SequenceFR(1Fa) protein 56Met Thr Ser Asn
Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln
Gln His Arg Gly Leu Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25 30
Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val Val Gly Phe 35
40 45 Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn
Thr 50 55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu
Gln Val Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala
Lys Asn Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn
Val Glu Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn
Pro Ala Ala Pro Phe Arg Asn Pro His 115 120 125 Ser Gln Gly Arg Ile
Arg Glu Leu Phe Ser Gln Ala Glu Ser His Phe 130 135 140 Arg Asn Ser
Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe 145 150 155 160
Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys 165
170 175 Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp
Ile 180 185 190 Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu
Tyr Thr Asp 195 200 205 His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp
Lys Leu Arg Gly Ser 210 215 220 Ser Tyr Glu Ser Trp Val Asn Phe Asn
Arg Tyr Arg Arg Glu Met Thr 225 230 235 240 Leu Thr Val Leu Asp Leu
Ile Ala Leu Phe Pro Leu Tyr Asp Val Arg 245 250 255 Leu Tyr Pro Lys
Glu Val Lys Thr Glu Leu Thr Arg Asp Val Leu Thr 260 265 270 Asp Pro
Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe 275 280 285
Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu 290
295 300 His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly
Asn 305 310 315 320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser
Thr Arg Pro Ser 325 330 335 Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro
Phe Tyr Gly Asn Lys Ser 340 345 350 Ser Glu Pro Val Gln Asn Leu Glu
Phe Asn Gly Glu Lys Val Tyr Arg 355 360 365 Ala Val Ala Asn Thr Asn
Leu Ala Val Trp Pro Ser Ala Val Tyr Ser 370 375 380 Gly Val Thr Lys
Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu 385 390 395 400 Ala
Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser 405 410
415 Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu
420 425 430 Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys Phe
Leu Met 435 440 445 Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp
Thr His Lys Ser 450 455 460 Val Asp Phe Phe Asn Met Ile Asp Ser Lys
Lys Ile Thr Gln Leu Pro 465 470 475 480 Leu Val Lys Ala His Thr Leu
Gln Ser Gly Thr Thr Val Val Arg Gly 485 490 495 Pro Gly Phe Thr Gly
Gly Asp Ile Leu Arg Arg Thr Ser Gly Gly Pro 500 505 510 Phe Ala Tyr
Thr Ile Val Asn Ile Asn Gly Gln Leu Pro Gln Arg Tyr 515 520 525 Arg
Ala Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu Arg Ile Tyr Val 530 535
540 Thr Val Ala Gly Glu Arg Ile Phe Ala Gly Gln Phe Asn Lys Thr Met
545 550 555 560 Asp Thr Gly Asp Pro Leu Thr Phe Gln Ser Phe Ser Tyr
Ala Thr Ile 565 570 575 Asn Thr Ala Phe Thr Phe Pro Met Ser Gln Ser
Ser Phe Thr Val Gly 580 585 590 Ala Asp Thr Phe Ser Ser Gly Asn Glu
Val Tyr Ile Asp Arg Phe Glu 595 600 605 Leu Ile Pro Val Thr Ala Thr
Phe Glu Ala Glu Tyr Asp Leu Glu Arg 610 615 620 Ala Gln Lys Ala Val
Asn Ala Leu Phe Thr Ser Ile Asn Gln Ile Gly 625 630 635 640 Ile Lys
Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val Ser Asn Leu 645 650 655
Val Asp Cys Leu Ser 660 571842DNAArtificial SequenceFR(1Ac) coding
sequence 57atgactagta acggccgcca gtgtgctggt attcgccctt atgacggccg
acaacaacac 60cgaggcctgg acagcagcac caccaaggac gtgatccaga agggcatcag
cgtggtgggc 120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc
tggtgagctt ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac
ccctggaagg ccttcatgga gcaggtggag 240gccctgatgg accagaagat
cgccgactac gccaagaaca aggcactggc cgagctacag 300ggcctccaga
acaacgtgga ggactatgtg agcgccctga gcagctggca gaagaacccc
360gctgcaccgt tccgcaaccc ccacagccag ggccgcatcc gcgagctgtt
cagccaggcc 420gagagccact tccgcaacag catgcccagc ttcgccatca
gcggctacga ggtgctgttc 480ctgaccacct acgcccaggc cgccaacacc
cacctgttcc tgctgaagga cgcccaaatc 540tacggagagg agtggggcta
cgagaaggag gacatcgccg agttctacaa gcgccagctg 600aagctgaccc
aggagtacac cgaccactgc gtgaagtggt acaacgtggg tctagacaag
660ctccgcggca gcagctacga gagctgggtg aacttcaacc gctaccgccg
cgagatgacc 720ctgaccgtgc tggacctgat cgccctgttc cccctgtacg
acgtgcgcct gtaccccaag 780gaggtgaaga ccgagctgac ccgcgacgtg
ctgaccgacc ccatcgtggg cgtgaacaac 840ctgcgcggct acggcaccac
cttcagcaac atcgagaact acatccgcaa gccccacctg 900ttcgactacc
tgcaccgcat ccagttccac acgcgtttcc agcccggcta ctacggcaac
960gacagcttca actactggag cggcaactac gtgagcaccc gccccagcat
cggcagcaac 1020gacatcatca ccagcccctt ctacggcaac aagagcagcg
agcccgtgca gaaccttgag 1080ttcaacggcg agaaggtgta ccgcgccgtg
gctaacacca acctggccgt gtggccctct 1140gcagtgtaca gcggcgtgac
caaggtggag ttcagccagt acaacgacca gaccgacgag 1200gccagcaccc
agacctacga cagcaagcgc aacgtgggcg ccgtgagctg ggacagcatc
1260gaccagctgc cccccgagac caccgacgag cccctggaga agggctacag
ccaccagctg 1320aactacgtga tgtgcttcct gatgcagggc agccgcggca
ccatccccgt gctgacctgg 1380acccacaaga gcgtcgactt cttcaacatg
atcgacagca agaagatcac ccagctgccc 1440ctggtgaagg gaaactttct
ttttaatggt tctgtaattt caggaccagg atttactggt 1500ggggacttag
ttagattaaa tagtagtgga aataacattc agaatagagg gtatattgaa
1560gttccaattc acttcccatc gacatctacc agatatcgag ttcgtgtacg
gtatgcttct 1620gtaaccccga ttcacctcaa cgttaattgg ggtaattcat
ccattttttc caatacagta 1680ccagctacag ctacgtcatt agataatcta
caatcaagtg attttggtta ttttgaaagt 1740gccaatgctt ttacatcttc
attaggtaat atagtaggtg ttagaaattt tagtgggact 1800gcaggagtga
taatagacag atttgaattt attccagttt ag 184258613PRTArtificial
SequenceFR(1Ac) protein 58Met Thr Ser Asn Gly Arg Gln Cys Ala Gly
Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln Gln His Arg Gly Leu Asp
Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser Val
Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly Gly
Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile Trp
Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70 75 80
Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala Leu 85
90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val Ser
Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg
Asn Pro His 115 120 125 Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln
Ala Glu Ser His Phe 130 135 140 Arg Asn Ser Met Pro Ser Phe Ala Ile
Ser Gly Tyr Glu Val Leu Phe 145 150 155 160 Leu Thr Thr Tyr Ala Gln
Ala Ala Asn Thr His Leu Phe Leu Leu Lys 165 170 175 Asp Ala Gln Ile
Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile 180 185 190 Ala Glu
Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr Asp 195 200 205
His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly Ser 210
215 220 Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu Met
Thr 225 230 235 240 Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu
Tyr Asp Val Arg 245 250 255 Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu
Thr Arg Asp Val Leu Thr 260 265 270 Asp Pro Ile Val Gly Val Asn Asn
Leu Arg Gly Tyr Gly Thr Thr Phe 275 280 285 Ser Asn Ile Glu Asn Tyr
Ile Arg Lys Pro His Leu Phe Asp Tyr Leu 290 295 300 His Arg Ile Gln
Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn 305 310 315 320 Asp
Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro Ser 325 330
335 Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn Lys Ser
340 345 350 Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys Val
Tyr Arg 355 360 365 Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser
Ala Val Tyr Ser 370 375 380 Gly Val Thr Lys Val Glu Phe Ser Gln Tyr
Asn Asp Gln Thr Asp Glu 385 390 395 400 Ala Ser Thr Gln Thr Tyr Asp
Ser Lys Arg Asn Val Gly Ala Val Ser 405 410 415 Trp Asp Ser Ile Asp
Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu 420 425 430 Glu Lys Gly
Tyr Ser His Gln Leu Asn Tyr Val Met Cys Phe Leu Met 435 440 445 Gln
Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr His Lys Ser 450 455
460 Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr Gln Leu Pro
465 470 475 480 Leu Val Lys Gly Asn Phe Leu Phe Asn Gly Ser Val Ile
Ser Gly Pro 485 490 495 Gly Phe Thr Gly Gly Asp Leu Val Arg Leu Asn
Ser Ser Gly Asn Asn 500 505 510 Ile Gln Asn Arg Gly Tyr Ile Glu Val
Pro Ile His Phe Pro Ser Thr 515 520 525 Ser Thr Arg Tyr Arg Val Arg
Val Arg Tyr Ala Ser Val Thr Pro Ile 530 535 540 His Leu Asn Val Asn
Trp Gly Asn Ser Ser Ile Phe Ser Asn Thr Val 545 550 555 560 Pro Ala
Thr Ala Thr Ser Leu Asp Asn Leu Gln Ser Ser Asp Phe Gly 565 570 575
Tyr Phe Glu Ser Ala Asn Ala Phe Thr Ser Ser Leu Gly Asn Ile Val 580
585 590 Gly Val Arg Asn Phe Ser Gly Thr Ala Gly Val Ile Ile Asp Arg
Phe 595 600 605 Glu Phe Ile Pro Val 610 592067DNAArtificial
SequenceFR(1Ia) coding sequence 59atgactagta acggccgcca gtgtgctggt
attcgccctt atgacggccg acaacaacac 60cgaggcctgg acagcagcac caccaaggac
gtgatccaga agggcatcag cgtggtgggc 120gacctgctgg gcgtggtggg
cttccccttc ggcggcgccc tggtgagctt ctacaccaac 180ttcctgaaca
ccatctggcc cagcgaggac ccctggaagg ccttcatgga gcaggtggag
240gccctgatgg accagaagat cgccgactac gccaagaaca aggcactggc
cgagctacag 300ggcctccaga acaacgtgga ggactatgtg agcgccctga
gcagctggca gaagaacccc 360gctgcaccgt tccgcaaccc ccacagccag
ggccgcatcc gcgagctgtt cagccaggcc 420gagagccact tccgcaacag
catgcccagc ttcgccatca gcggctacga ggtgctgttc 480ctgaccacct
acgcccaggc cgccaacacc cacctgttcc tgctgaagga cgcccaaatc
540tacggagagg agtggggcta cgagaaggag gacatcgccg agttctacaa
gcgccagctg 600aagctgaccc aggagtacac cgaccactgc gtgaagtggt
acaacgtggg tctagacaag 660ctccgcggca gcagctacga gagctgggtg
aacttcaacc gctaccgccg cgagatgacc 720ctgaccgtgc tggacctgat
cgccctgttc cccctgtacg acgtgcgcct gtaccccaag 780gaggtgaaga
ccgagctgac ccgcgacgtg ctgaccgacc ccatcgtggg cgtgaacaac
840ctgcgcggct acggcaccac cttcagcaac atcgagaact acatccgcaa
gccccacctg 900ttcgactacc tgcaccgcat ccagttccac acgcgtttcc
agcccggcta ctacggcaac 960gacagcttca actactggag cggcaactac
gtgagcaccc gccccagcat cggcagcaac 1020gacatcatca ccagcccctt
ctacggcaac aagagcagcg agcccgtgca gaaccttgag 1080ttcaacggcg
agaaggtgta ccgcgccgtg gctaacacca acctggccgt gtggccctct
1140gcagtgtaca gcggcgtgac caaggtggag ttcagccagt acaacgacca
gaccgacgag 1200gccagcaccc agacctacga cagcaagcgc aacgtgggcg
ccgtgagctg ggacagcatc 1260gaccagctgc cccccgagac caccgacgag
cccctggaga agggctacag ccaccagctg 1320aactacgtga tgtgcttcct
gatgcagggc agccgcggca ccatccccgt gctgacctgg 1380acccacaaga
gcgtcgactt cttcaacatg atcgacagca agaagatcac ccagctgccc
1440ctggtaaaag ctttcaatct gtcttcaggt gccgctgtag tgagaggacc
aggatttaca 1500ggtggggata tccttcgaag aacgaatact ggtacatttg
gggatatacg agtaaatatt 1560aatccaccat ttgcacaaag atatcgcgtg
aggattcgct atgcttctac cacagattta 1620caattccata cgtcaattaa
cggtaaagct attaatcaag gtaatttttc agcaactatg 1680aatagaggag
aggacttaga ctataaaacc tttagaactg taggctttac cactccattt
1740agctttttag atgtacaaag tacattcaca ataggtgctt ggaacttctc
ttcaggtaac 1800gaagtttata tagatagaat tgaatttgtt ccggtagaag
taacatatga ggcagaatat 1860gattttgaaa aagcgcaaga gaaggttact
gcactgttta catctacgaa tccaagagga 1920ttaaaaacag atgtaaagga
ttatcatatt gaccaggtat caaatttagt agagtctcta 1980tcagatgaat
tctatcttga tgaaaagaga gaattattcg agatagttaa atacgcgaag
2040caactccata ttgagcgtaa catgtag 206760688PRTArtificial
SequenceFR(1Ia) protein 60Met Thr Ser Asn Gly Arg Gln Cys Ala Gly
Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln Gln His Arg Gly Leu Asp
Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser Val
Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly Gly
Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile Trp
Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70 75 80
Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala Leu 85
90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val Ser
Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg
Asn Pro His 115 120 125 Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln
Ala Glu Ser His Phe 130 135 140 Arg Asn Ser Met Pro Ser Phe Ala Ile
Ser Gly Tyr Glu Val Leu Phe 145 150 155
160 Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys
165 170 175 Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu
Asp Ile 180 185 190 Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln
Glu Tyr Thr Asp 195 200 205 His Cys Val Lys Trp Tyr Asn Val Gly Leu
Asp Lys Leu Arg Gly Ser 210 215 220 Ser Tyr Glu Ser Trp Val Asn Phe
Asn Arg Tyr Arg Arg Glu Met Thr 225 230 235 240 Leu Thr Val Leu Asp
Leu Ile Ala Leu Phe Pro Leu Tyr Asp Val Arg 245 250 255 Leu Tyr Pro
Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val Leu Thr 260 265 270 Asp
Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe 275 280
285 Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu
290 295 300 His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr
Gly Asn 305 310 315 320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val
Ser Thr Arg Pro Ser 325 330 335 Ile Gly Ser Asn Asp Ile Ile Thr Ser
Pro Phe Tyr Gly Asn Lys Ser 340 345 350 Ser Glu Pro Val Gln Asn Leu
Glu Phe Asn Gly Glu Lys Val Tyr Arg 355 360 365 Ala Val Ala Asn Thr
Asn Leu Ala Val Trp Pro Ser Ala Val Tyr Ser 370 375 380 Gly Val Thr
Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu 385 390 395 400
Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser 405
410 415 Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro
Leu 420 425 430 Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys
Phe Leu Met 435 440 445 Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr
Trp Thr His Lys Ser 450 455 460 Val Asp Phe Phe Asn Met Ile Asp Ser
Lys Lys Ile Thr Gln Leu Pro 465 470 475 480 Leu Val Lys Ala Phe Asn
Leu Ser Ser Gly Ala Ala Val Val Arg Gly 485 490 495 Pro Gly Phe Thr
Gly Gly Asp Ile Leu Arg Arg Thr Asn Thr Gly Thr 500 505 510 Phe Gly
Asp Ile Arg Val Asn Ile Asn Pro Pro Phe Ala Gln Arg Tyr 515 520 525
Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr Asp Leu Gln Phe His Thr 530
535 540 Ser Ile Asn Gly Lys Ala Ile Asn Gln Gly Asn Phe Ser Ala Thr
Met 545 550 555 560 Asn Arg Gly Glu Asp Leu Asp Tyr Lys Thr Phe Arg
Thr Val Gly Phe 565 570 575 Thr Thr Pro Phe Ser Phe Leu Asp Val Gln
Ser Thr Phe Thr Ile Gly 580 585 590 Ala Trp Asn Phe Ser Ser Gly Asn
Glu Val Tyr Ile Asp Arg Ile Glu 595 600 605 Phe Val Pro Val Glu Val
Thr Tyr Glu Ala Glu Tyr Asp Phe Glu Lys 610 615 620 Ala Gln Glu Lys
Val Thr Ala Leu Phe Thr Ser Thr Asn Pro Arg Gly 625 630 635 640 Leu
Lys Thr Asp Val Lys Asp Tyr His Ile Asp Gln Val Ser Asn Leu 645 650
655 Val Glu Ser Leu Ser Asp Glu Phe Tyr Leu Asp Glu Lys Arg Glu Leu
660 665 670 Phe Glu Ile Val Lys Tyr Ala Lys Gln Leu His Ile Glu Arg
Asn Met 675 680 685 61 1962DNAArtificial SequenceDM23A coding
sequence 61atgactagta acggccgcca gtgtgctggt attcgccctt atgacggccg
acaacaacac 60cgaggcctgg acagcagcac caccaaggac gtgatccaga agggcatcag
cgtggtgggc 120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc
tggtgagctt ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac
ccctggaagg ccttcatgga gcaggtggag 240gccctgatgg accagaagat
cgccgactac gccaagaaca aggcactggc cgagctacag 300ggcctccaga
acaacgtgga ggactatgtg agcgccctga gcagctggca gaagaacccc
360gctgcaccgt tccgcaaccc ccacagccag ggccgcatcc gcgagctgtt
cagccaggcc 420gagagccact tccgcaacag catgcccagc ttcgccatca
gcggctacga ggtgctgttc 480ctgaccacct acgcccaggc cgccaacacc
cacctgttcc tgctgaagga cgcccaaatc 540tacggagagg agtggggcta
cgagaaggag gacatcgccg agttctacaa gcgccagctg 600aagctgaccc
aggagtacac cgaccactgc gtgaagtggt acaacgtggg tctagacaag
660ctccgcggca gcagctacga gagctgggtg aacttcaacc gctaccgccg
cgagatgacc 720ctgaccgtgc tggacctgat cgccctgttc cccctgtacg
acgtgcgcct gtaccccaag 780gaggtgaaga ccgagctgac ccgcgacgtg
ctgaccgacc ccatcgtggg cgtgaacaac 840ctgcgcggct acggcaccac
cttcagcaac atcgagaact acatccgcaa gccccacctg 900ttcgactacc
tgcaccgcat ccagttccac acgcgtttcc agcccggcta ctacggcaac
960gacagcttca actactggag cggcaactac gtgagcaccc gccccagcat
cggcagcaac 1020gacatcatca ccagcccctt ctacggcaac aagagcagcg
agcccgtgca gaaccttgag 1080ttcaacggcg agaaggtgta ccgcgccgtg
gctaacacca acctggccgt gtggccctct 1140gcagtgtaca gcggcgtgac
caaggtggag ttcagccagt acaacgacca gaccgacgag 1200gccagcaccc
agacctacga cagcaagcgc aacgtgggcg ccgtgagctg ggacagcatc
1260gaccagctgc cccccgagac caccgacgag cccctggaga agggctacag
ccaccagctg 1320aactacgtga tgtgcttcct gatgcagggc agccgcggca
ccatccccgt gctgacctgg 1380acccacaaga gcgccgagtt caacaacatc
atccccagca gccagatcac ccagatcccc 1440ctgaccaaga gcaccaacct
gggcagcggc accagcgtgg tgaagggccc cggcttcacc 1500ggcggcgaca
tcctgcgccg caccagcccc ggccagatca gcaccctgcg cgtgaacatc
1560accgcccccc tgagccagcg ctaccgcgtc cgcatccgct acgccagcac
caccaacctg 1620cagttccaca ccagcatcga cggccgcccc atcaaccagg
gcaacttcag cgccaccatg 1680agcagcggca gcaacctgca gagcggcagc
ttccgcaccg tgggcttcac cacccccttc 1740aacttcagca acggcagcag
cgtgttcacc ctgagcgccc acgtgttcaa cagcggcaac 1800gaggtgtaca
tcgaccgcat cgagttcgtg cccgccgagg tgaccttcga ggccgagtac
1860gacctggaga gggctcagaa ggccgtgaac gagctgttca ccagcagcaa
ccagatcggc 1920ctgaagaccg acgtgaccga ctaccacatc gatcaggtgt ag
196262653PRTArtificial SequenceDM23A protein 62Met Thr Ser Asn Gly
Arg Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln Gln
His Arg Gly Leu Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln
Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40
45 Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr
50 55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln
Val Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys
Asn Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val
Glu Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro
Ala Ala Pro Phe Arg Asn Pro His 115 120 125 Ser Gln Gly Arg Ile Arg
Glu Leu Phe Ser Gln Ala Glu Ser His Phe 130 135 140 Arg Asn Ser Met
Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe 145 150 155 160 Leu
Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys 165 170
175 Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile
180 185 190 Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr
Thr Asp 195 200 205 His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys
Leu Arg Gly Ser 210 215 220 Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg
Tyr Arg Arg Glu Met Thr 225 230 235 240 Leu Thr Val Leu Asp Leu Ile
Ala Leu Phe Pro Leu Tyr Asp Val Arg 245 250 255 Leu Tyr Pro Lys Glu
Val Lys Thr Glu Leu Thr Arg Asp Val Leu Thr 260 265 270 Asp Pro Ile
Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe 275 280 285 Ser
Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu 290 295
300 His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn
305 310 315 320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr
Arg Pro Ser 325 330 335 Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe
Tyr Gly Asn Lys Ser 340 345 350 Ser Glu Pro Val Gln Asn Leu Glu Phe
Asn Gly Glu Lys Val Tyr Arg 355 360 365 Ala Val Ala Asn Thr Asn Leu
Ala Val Trp Pro Ser Ala Val Tyr Ser 370 375 380 Gly Val Thr Lys Val
Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu 385 390 395 400 Ala Ser
Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser 405 410 415
Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu 420
425 430 Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys Phe Leu
Met 435 440 445 Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr
His Lys Ser 450 455 460 Ala Glu Phe Asn Asn Ile Ile Pro Ser Ser Gln
Ile Thr Gln Ile Pro 465 470 475 480 Leu Thr Lys Ser Thr Asn Leu Gly
Ser Gly Thr Ser Val Val Lys Gly 485 490 495 Pro Gly Phe Thr Gly Gly
Asp Ile Leu Arg Arg Thr Ser Pro Gly Gln 500 505 510 Ile Ser Thr Leu
Arg Val Asn Ile Thr Ala Pro Leu Ser Gln Arg Tyr 515 520 525 Arg Val
Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln Phe His Thr 530 535 540
Ser Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser Ala Thr Met 545
550 555 560 Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr Val
Gly Phe 565 570 575 Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val
Phe Thr Leu Ser 580 585 590 Ala His Val Phe Asn Ser Gly Asn Glu Val
Tyr Ile Asp Arg Ile Glu 595 600 605 Phe Val Pro Ala Glu Val Thr Phe
Glu Ala Glu Tyr Asp Leu Glu Arg 610 615 620 Ala Gln Lys Ala Val Asn
Glu Leu Phe Thr Ser Ser Asn Gln Ile Gly 625 630 635 640 Leu Lys Thr
Asp Val Thr Asp Tyr His Ile Asp Gln Val 645 650 631923DNAArtificial
Sequence8AF coding sequence 63atgacggccg acaacaacac cgaggccctg
gacagcagca ccaccaagga cgtgatccag 60aagggcatca gcgtggtggg cgacctgctg
ggcgtggtgg gcttcccctt cggcggcgcc 120ctggtgagct tctacaccaa
cttcctgaac accatctggc ccagcgagga cccctggaag 180gccttcatgg
agcaggtgga ggccctgatg gaccagaaga tcgccgacta cgccaagaac
240aaggcactgg ccgagctaca gggcctccag aacaacgtgg aggactatgt
gagcgccctg 300agcagctggc agaagaaccc cgctgcaccg ttccgcaacc
cccacagcca gggccgcatc 360cgcgagctgt tcagccaggc cgagagccac
ttccgcaaca gcatgcccag cttcgccatc 420agcggctacg aggtgctgtt
cctgaccacc tacgcccagg ccgccaacac ccacctgttc 480ctgctgaagg
acgcccaaat ctacggagag gagtggggct acgagaagga ggacatcgcc
540gagttctaca agcgccagct gaagctgacc caggagtaca ccgaccactg
cgtgaagtgg 600tacaacgtgg gtctagacaa gctccgcggc agcagctacg
agagctgggt gaacttcaac 660cgctaccgcc gcgagatgac cctgaccgtg
ctggacctga tcgccctgtt ccccctgtac 720gacgtgcgcc tgtaccccaa
ggaggtgaag accgagctga cccgcgacgt gctgaccgac 780cccatcgtgg
gcgtgaacaa cctgcgcggc tacggcacca ccttcagcaa catcgagaac
840tacatccgca agccccacct gttcgactac ctgcaccgca tccagttcca
cacgcgtttc 900cagcccggct actacggcaa cgacagcttc aactactgga
gcggcaacta cgtgagcacc 960cgccccagca tcggcagcaa cgacatcatc
accagcccct tctacggcaa caagagcagc 1020gagcccgtgc agaaccttga
gttcaacggc gagaaggtgt accgcgccgt ggctaacacc 1080aacctggccg
tgtggccctc tgcagtgtac agcggcgtga ccaaggtgga gttcagccag
1140tacaacgacc agaccgacga ggccagcacc cagacctacg acagcaagcg
caacgtgggc 1200gccgtgagct gggacagcat cgaccagctg ccccccgaga
ccaccgacga gcccctggag 1260aagggctaca gccaccagct gaactacgtg
atgtgcttcc tgatgcaggg cagccgcggc 1320accatccccg tgctgacctg
gacccacaag agcgtcgact tcttcaacat gatcgacagc 1380aagaagatca
cccagctgcc cctgaccaag agcaccaacc tgggcagcgg caccagcgtg
1440gtgaagggcc ccggcttcac cggcggcgac atcctgcgcc gcaccagccc
cggccagatc 1500agcaccctgc gcgtgaacat caccgccccc ctgagccagc
gctaccgcgt ccgcatccgc 1560tacgccagca ccaccaacct gcagttccac
accagcatcg acggccgccc catcaaccag 1620ggcaacttca gcgccaccat
gagcagcggc agcaacctgc agagcggcag cttccgcacc 1680gtgggcttca
ccaccccctt caacttcagc aacggcagca gcgtgttcac cctgagcgcc
1740cacgtgttca acagcggcaa cgaggtgtac atcgaccgca tcgagttcgt
gcccgccgag 1800gtgaccttcg aggccgagta cgacctggag agggctcaga
aggccgtgaa cgagctgttc 1860accagcagca accagatcgg cctgaagacc
gacgtgaccg actaccacat cgatcaggtg 1920tag 192364640PRTArtificial
Sequence8AF protein 64Met Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp
Ser Ser Thr Thr Lys 1 5 10 15 Asp Val Ile Gln Lys Gly Ile Ser Val
Val Gly Asp Leu Leu Gly Val 20 25 30 Val Gly Phe Pro Phe Gly Gly
Ala Leu Val Ser Phe Tyr Thr Asn Phe 35 40 45 Leu Asn Thr Ile Trp
Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu 50 55 60 Gln Val Glu
Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn 65 70 75 80 Lys
Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr 85 90
95 Val Ser Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg
100 105 110 Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln
Ala Glu 115 120 125 Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Ile
Ser Gly Tyr Glu 130 135 140 Val Leu Phe Leu Thr Thr Tyr Ala Gln Ala
Ala Asn Thr His Leu Phe 145 150 155 160 Leu Leu Lys Asp Ala Gln Ile
Tyr Gly Glu Glu Trp Gly Tyr Glu Lys 165 170 175 Glu Asp Ile Ala Glu
Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu 180 185 190 Tyr Thr Asp
His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu 195 200 205 Arg
Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg 210 215
220 Glu Met Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr
225 230 235 240 Asp Val Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu
Thr Arg Asp 245 250 255 Val Leu Thr Asp Pro Ile Val Gly Val Asn Asn
Leu Arg Gly Tyr Gly 260 265 270 Thr Thr Phe Ser Asn Ile Glu Asn Tyr
Ile Arg Lys Pro His Leu Phe 275 280 285 Asp Tyr Leu His Arg Ile Gln
Phe His Thr Arg Phe Gln Pro Gly Tyr 290 295 300 Tyr Gly Asn Asp Ser
Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr 305 310 315 320 Arg Pro
Ser Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly 325 330 335
Asn Lys Ser Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys 340
345 350 Val Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser
Ala 355 360 365 Val Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gln Tyr
Asn Asp Gln 370 375 380 Thr Asp Glu Ala Ser Thr Gln Thr Tyr Asp Ser
Lys Arg Asn Val Gly 385 390 395 400 Ala Val Ser Trp Asp Ser Ile Asp
Gln Leu Pro Pro Glu Thr Thr Asp 405 410 415 Glu Pro Leu Glu Lys Gly
Tyr Ser His Gln Leu Asn Tyr Val Met Cys 420 425 430 Phe Leu Met Gln
Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr 435 440 445 His Lys
Ser Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr 450 455 460
Gln Leu Pro Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val 465
470 475 480 Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg
Thr Ser 485
490 495 Pro Gly Gln Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu
Ser 500 505 510 Gln Arg Tyr Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr
Asn Leu Gln 515 520 525 Phe His Thr Ser Ile Asp Gly Arg Pro Ile Asn
Gln Gly Asn Phe Ser 530 535 540 Ala Thr Met Ser Ser Gly Ser Asn Leu
Gln Ser Gly Ser Phe Arg Thr 545 550 555 560 Val Gly Phe Thr Thr Pro
Phe Asn Phe Ser Asn Gly Ser Ser Val Phe 565 570 575 Thr Leu Ser Ala
His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp 580 585 590 Arg Ile
Glu Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp 595 600 605
Leu Glu Arg Ala Gln Lys Ala Val Asn Glu Leu Phe Thr Ser Ser Asn 610
615 620 Gln Ile Gly Leu Lys Thr Asp Val Thr Asp Tyr His Ile Asp Gln
Val 625 630 635 640 651836DNAArtificial Sequence5*cry3A055 coding
sequence 65atgactagta acggccgcca gtgtgctgga attcgccctt atgacggccg
acaacaacac 60cgaggcctgg acagcagcac caccaaggac gtgatccaga agggcatcag
cgtggtgggc 120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc
tggtgagctt ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac
ccctggaagg ccttcatgga gcaggtggag 240gccctgatgg accagaagat
cgccgactac gccaagaaca aggcactggc cgagctacag 300ggcctccaga
acaacgtgga ggactatgtg agcgccctga gcagctggca gaagaacccc
360gctgcaccgt tccgcaaccc ccacagccag ggccgcatcc gcgagctgtt
cagccaggcc 420gagagccact tccgcaacag catgcccagc ttcgccatca
gcggctacga ggtgctgttc 480ctgaccacct acgcccaggc cgccaacacc
cacctgttcc tgctgaagga cgcccaaatc 540tacggagagg agtggggcta
cgagaaggag gacatcgccg agttctacaa gcgccagctg 600aagctgaccc
aggagtacac cgaccactgc gtgaagtggt acaacgtggg tctagacaag
660ctccgcggca gcagctacga gagctgggtg aacttcaacc gctaccgccg
cgagatgacc 720ctgaccgtgc tggacctgat cgccctgttc cccctgtacg
acgtgcgcct gtaccccaag 780gaggtgaaga ccgagctgac ccgcgacgtg
ctgaccgacc ccatcgtggg cgtgaacaac 840ctgcgcggct acggcaccac
cttcagcaac atcgagaact acatccgcaa gccccacctg 900ttcgactacc
tgcaccgcat ccagttccac acgcgtttcc agcccggcta ctacggcaac
960gacagcttca actactggag cggcaactac gtgagcaccc gccccagcat
cggcagcaac 1020gacatcatca ccagcccctt ctacggcaac aagagcagcg
agcccgtgca gaaccttgag 1080ttcaacggcg agaaggtgta ccgcgccgtg
gctaacacca acctggccgt gtggccctct 1140gcagtgtaca gcggcgtgac
caaggtggag ttcagccagt acaacgacca gaccgacgag 1200gccagcaccc
agacctacga cagcaagcgc aacgtgggcg ccgtgagctg ggacagcatc
1260gaccagctgc cccccgagac caccgacgag cccctggaga agggctacag
ccaccagctg 1320aactacgtga tgtgcttcct gatgcagggc agccgcggca
ccatccccgt gctgacctgg 1380acccacaaga gcgtcgactt cttcaacatg
atcgacagca agaagatcac ccagctgccc 1440ctggtgaagg cctacaagct
ccagagcggc gccagcgtgg tggcaggccc ccgcttcacc 1500ggcggcgaca
tcatccagtg caccgagaac ggcagcgccg ccaccatcta cgtgaccccc
1560gacgtgagct acagccagaa gtaccgcgcc cgcatccact acgccagcac
cagccagatc 1620accttcaccc tgagcctgga cggggccccc ttcaaccaat
actacttcga caagaccatc 1680aacaagggcg acaccctgac ctacaacagc
ttcaacctgg ccagcttcag cacccctttc 1740gagctgagcg gcaacaacct
ccagatcggc gtgaccggcc tgagcgccgg cgacaaggtg 1800tacatcgaca
agatcgagtt catccccgtg aactag 183666611PRTArtificial
Sequence5*Cry3A055 protein 66Met Thr Ser Asn Gly Arg Gln Cys Ala
Gly Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln Gln His Arg Gly Leu
Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser
Val Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly
Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile
Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70
75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala
Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr
Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro
Phe Arg Asn Pro His 115 120 125 Ser Gln Gly Arg Ile Arg Glu Leu Phe
Ser Gln Ala Glu Ser His Phe 130 135 140 Arg Asn Ser Met Pro Ser Phe
Ala Ile Ser Gly Tyr Glu Val Leu Phe 145 150 155 160 Leu Thr Thr Tyr
Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys 165 170 175 Asp Ala
Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu Asp Ile 180 185 190
Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr Asp 195
200 205 His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly
Ser 210 215 220 Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg
Glu Met Thr 225 230 235 240 Leu Thr Val Leu Asp Leu Ile Ala Leu Phe
Pro Leu Tyr Asp Val Arg 245 250 255 Leu Tyr Pro Lys Glu Val Lys Thr
Glu Leu Thr Arg Asp Val Leu Thr 260 265 270 Asp Pro Ile Val Gly Val
Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe 275 280 285 Ser Asn Ile Glu
Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu 290 295 300 His Arg
Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr Gly Asn 305 310 315
320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro Ser
325 330 335 Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn
Lys Ser 340 345 350 Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu
Lys Val Tyr Arg 355 360 365 Ala Val Ala Asn Thr Asn Leu Ala Val Trp
Pro Ser Ala Val Tyr Ser 370 375 380 Gly Val Thr Lys Val Glu Phe Ser
Gln Tyr Asn Asp Gln Thr Asp Glu 385 390 395 400 Ala Ser Thr Gln Thr
Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser 405 410 415 Trp Asp Ser
Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro Leu 420 425 430 Glu
Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys Phe Leu Met 435 440
445 Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr His Lys Ser
450 455 460 Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr Gln
Leu Pro 465 470 475 480 Leu Val Lys Ala Tyr Lys Leu Gln Ser Gly Ala
Ser Val Val Ala Gly 485 490 495 Pro Arg Phe Thr Gly Gly Asp Ile Ile
Gln Cys Thr Glu Asn Gly Ser 500 505 510 Ala Ala Thr Ile Tyr Val Thr
Pro Asp Val Ser Tyr Ser Gln Lys Tyr 515 520 525 Arg Ala Arg Ile His
Tyr Ala Ser Thr Ser Gln Ile Thr Phe Thr Leu 530 535 540 Ser Leu Asp
Gly Ala Pro Phe Asn Gln Tyr Tyr Phe Asp Lys Thr Ile 545 550 555 560
Asn Lys Gly Asp Thr Leu Thr Tyr Asn Ser Phe Asn Leu Ala Ser Phe 565
570 575 Ser Thr Pro Phe Glu Leu Ser Gly Asn Asn Leu Gln Ile Gly Val
Thr 580 585 590 Gly Leu Ser Ala Gly Asp Lys Val Tyr Ile Asp Lys Ile
Glu Phe Ile 595 600 605 Pro Val Asn 610 671803DNAArtificial
Sequencemocry3A coding sequence 67atgacggccg acaacaacac cgaggccctg
gacagcagca ccaccaagga cgtgatccag 60aagggcatca gcgtggtggg cgacctgctg
ggcgtggtgg gcttcccctt cggcggcgcc 120ctggtgagct tctacaccaa
cttcctgaac accatctggc ccagcgagga cccctggaag 180gccttcatgg
agcaggtgga ggccctgatg gaccagaaga tcgccgacta cgccaagaac
240aaggcactgg ccgagctaca gggcctccag aacaacgtgg aggactatgt
gagcgccctg 300agcagctggc agaagaaccc cgtctcgagc cgcaaccccc
acagccaggg ccgcatccgc 360gagctgttca gccaggccga gagccacttc
cgcaacagca tgcccagctt cgccatcagc 420ggctacgagg tgctgttcct
gaccacctac gcccaggccg ccaacaccca cctgttcctg 480ctgaaggacg
cccaaatcta cggagaggag tggggctacg agaaggagga catcgccgag
540ttctacaagc gccagctgaa gctgacccag gagtacaccg accactgcgt
gaagtggtac 600aacgtgggtc tagacaagct ccgcggcagc agctacgaga
gctgggtgaa cttcaaccgc 660taccgccgcg agatgaccct gaccgtgctg
gacctgatcg ccctgttccc cctgtacgac 720gtgcgcctgt accccaagga
ggtgaagacc gagctgaccc gcgacgtgct gaccgacccc 780atcgtgggcg
tgaacaacct gcgcggctac ggcaccacct tcagcaacat cgagaactac
840atccgcaagc cccacctgtt cgactacctg caccgcatcc agttccacac
gcgtttccag 900cccggctact acggcaacga cagcttcaac tactggagcg
gcaactacgt gagcacccgc 960cccagcatcg gcagcaacga catcatcacc
agccccttct acggcaacaa gagcagcgag 1020cccgtgcaga accttgagtt
caacggcgag aaggtgtacc gcgccgtggc taacaccaac 1080ctggccgtgt
ggccctctgc agtgtacagc ggcgtgacca aggtggagtt cagccagtac
1140aacgaccaga ccgacgaggc cagcacccag acctacgaca gcaagcgcaa
cgtgggcgcc 1200gtgagctggg acagcatcga ccagctgccc cccgagacca
ccgacgagcc cctggagaag 1260ggctacagcc accagctgaa ctacgtgatg
tgcttcctga tgcagggcag ccgcggcacc 1320atccccgtgc tgacctggac
ccacaagagc gtcgacttct tcaacatgat cgacagcaag 1380aagatcaccc
agctgcccct ggtgaaggcc tacaagctcc agagcggcgc cagcgtggtg
1440gcaggccccc gcttcaccgg cggcgacatc atccagtgca ccgagaacgg
cagcgccgcc 1500accatctacg tgacccccga cgtgagctac agccagaagt
accgcgcccg catccactac 1560gccagcacca gccagatcac cttcaccctg
agcctggacg gggccccctt caaccaatac 1620tacttcgaca agaccatcaa
caagggcgac accctgacct acaacagctt caacctggcc 1680agcttcagca
cccctttcga gctgagcggc aacaacctcc agatcggcgt gaccggcctg
1740agcgccggcg acaaggtgta catcgacaag atcgagttca tccccgtgaa
ctagatctga 1800gct 180368597PRTBacillus thuringiensismoCry3A 68Met
Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 1 5 10
15 Asp Val Ile Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val
20 25 30 Val Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr
Asn Phe 35 40 45 Leu Asn Thr Ile Trp Pro Ser Glu Asp Pro Trp Lys
Ala Phe Met Glu 50 55 60 Gln Val Glu Ala Leu Met Asp Gln Lys Ile
Ala Asp Tyr Ala Lys Asn 65 70 75 80 Lys Ala Leu Ala Glu Leu Gln Gly
Leu Gln Asn Asn Val Glu Asp Tyr 85 90 95 Val Ser Ala Leu Ser Ser
Trp Gln Lys Asn Pro Val Ser Ser Arg Asn 100 105 110 Pro His Ser Gln
Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser 115 120 125 His Phe
Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val 130 135 140
Leu Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu 145
150 155 160 Leu Lys Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu
Lys Glu 165 170 175 Asp Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu
Thr Gln Glu Tyr 180 185 190 Thr Asp His Cys Val Lys Trp Tyr Asn Val
Gly Leu Asp Lys Leu Arg 195 200 205 Gly Ser Ser Tyr Glu Ser Trp Val
Asn Phe Asn Arg Tyr Arg Arg Glu 210 215 220 Met Thr Leu Thr Val Leu
Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp 225 230 235 240 Val Arg Leu
Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val 245 250 255 Leu
Thr Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr 260 265
270 Thr Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp
275 280 285 Tyr Leu His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly
Tyr Tyr 290 295 300 Gly Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr
Val Ser Thr Arg 305 310 315 320 Pro Ser Ile Gly Ser Asn Asp Ile Ile
Thr Ser Pro Phe Tyr Gly Asn 325 330 335 Lys Ser Ser Glu Pro Val Gln
Asn Leu Glu Phe Asn Gly Glu Lys Val 340 345 350 Tyr Arg Ala Val Ala
Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val 355 360 365 Tyr Ser Gly
Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr 370 375 380 Asp
Glu Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala 385 390
395 400 Val Ser Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp
Glu 405 410 415 Pro Leu Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val
Met Cys Phe 420 425 430 Leu Met Gln Gly Ser Arg Gly Thr Ile Pro Val
Leu Thr Trp Thr His 435 440 445 Lys Ser Val Asp Phe Phe Asn Met Ile
Asp Ser Lys Lys Ile Thr Gln 450 455 460 Leu Pro Leu Val Lys Ala Tyr
Lys Leu Gln Ser Gly Ala Ser Val Val 465 470 475 480 Ala Gly Pro Arg
Phe Thr Gly Gly Asp Ile Ile Gln Cys Thr Glu Asn 485 490 495 Gly Ser
Ala Ala Thr Ile Tyr Val Thr Pro Asp Val Ser Tyr Ser Gln 500 505 510
Lys Tyr Arg Ala Arg Ile His Tyr Ala Ser Thr Ser Gln Ile Thr Phe 515
520 525 Thr Leu Ser Leu Asp Gly Ala Pro Phe Asn Gln Tyr Tyr Phe Asp
Lys 530 535 540 Thr Ile Asn Lys Gly Asp Thr Leu Thr Tyr Asn Ser Phe
Asn Leu Ala 545 550 555 560 Ser Phe Ser Thr Pro Phe Glu Leu Ser Gly
Asn Asn Leu Gln Ile Gly 565 570 575 Val Thr Gly Leu Ser Ala Gly Asp
Lys Val Tyr Ile Asp Lys Ile Glu 580 585 590 Phe Ile Pro Val Asn 595
69 1807DNAArtificial Sequencecry3A055 coding sequence 69atgacggccg
acaacaacac cgaggccctg gacagcagca ccaccaagga cgtgatccag 60aagggcatca
gcgtggtggg cgacctgctg ggcgtggtgg gcttcccctt cggcggcgcc
120ctggtgagct tctacaccaa cttcctgaac accatctggc ccagcgagga
cccctggaag 180gccttcatgg agcaggtgga ggccctgatg gaccagaaga
tcgccgacta cgccaagaac 240aaggcactgg ccgagctaca gggcctccag
aacaacgtgg aggactatgt gagcgccctg 300agcagctggc agaagaaccc
cgctgcaccg ttccgcaacc cccacagcca gggccgcatc 360cgcgagctgt
tcagccaggc cgagagccac ttccgcaaca gcatgcccag cttcgccatc
420agcggctacg aggtgctgtt cctgaccacc tacgcccagg ccgccaacac
ccacctgttc 480ctgctgaagg acgcccaaat ctacggagag gagtggggct
acgagaagga ggacatcgcc 540gagttctaca agcgccagct gaagctgacc
caggagtaca ccgaccactg cgtgaagtgg 600tacaacgtgg gtctagacaa
gctccgcggc agcagctacg agagctgggt gaacttcaac 660cgctaccgcc
gcgagatgac cctgaccgtg ctggacctga tcgccctgtt ccccctgtac
720gacgtgcgcc tgtaccccaa ggaggtgaag accgagctga cccgcgacgt
gctgaccgac 780cccatcgtgg gcgtgaacaa cctgcgcggc tacggcacca
ccttcagcaa catcgagaac 840tacatccgca agccccacct gttcgactac
ctgcaccgca tccagttcca cacgcgtttc 900cagcccggct actacggcaa
cgacagcttc aactactgga gcggcaacta cgtgagcacc 960cgccccagca
tcggcagcaa cgacatcatc accagcccct tctacggcaa caagagcagc
1020gagcccgtgc agaaccttga gttcaacggc gagaaggtgt accgcgccgt
ggctaacacc 1080aacctggccg tgtggccctc tgcagtgtac agcggcgtga
ccaaggtgga gttcagccag 1140tacaacgacc agaccgacga ggccagcacc
cagacctacg acagcaagcg caacgtgggc 1200gccgtgagct gggacagcat
cgaccagctg ccccccgaga ccaccgacga gcccctggag 1260aagggctaca
gccaccagct gaactacgtg atgtgcttcc tgatgcaggg cagccgcggc
1320accatccccg tgctgacctg gacccacaag agcgtcgact tcttcaacat
gatcgacagc 1380aagaagatca cccagctgcc cctggtgaag gcctacaagc
tccagagcgg cgccagcgtg 1440gtggcaggcc cccgcttcac cggcggcgac
atcatccagt gcaccgagaa cggcagcgcc 1500gccaccatct acgtgacccc
cgacgtgagc tacagccaga agtaccgcgc ccgcatccac 1560tacgccagca
ccagccagat caccttcacc ctgagcctgg acggggcccc cttcaaccaa
1620tactacttcg acaagaccat caacaagggc gacaccctga cctacaacag
cttcaacctg 1680gccagcttca gcaccccttt cgagctgagc ggcaacaacc
tccagatcgg cgtgaccggc 1740ctgagcgccg gcgacaaggt gtacatcgac
aagatcgagt tcatccccgt gaactagatc 1800tgagctc 180770598PRTArtificial
SequenceCry3A055 protein 70Met Thr Ala Asp Asn Asn Thr Glu Ala Leu
Asp Ser Ser Thr Thr Lys 1 5 10 15 Asp Val Ile Gln Lys Gly Ile Ser
Val Val Gly Asp Leu Leu Gly Val 20 25 30 Val Gly Phe Pro Phe Gly
Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 35 40 45 Leu
Asn Thr Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu 50 55
60 Gln Val Glu Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn
65 70 75 80 Lys Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu
Asp Tyr 85 90 95 Val Ser Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala
Ala Pro Phe Arg 100 105 110 Asn Pro His Ser Gln Gly Arg Ile Arg Glu
Leu Phe Ser Gln Ala Glu 115 120 125 Ser His Phe Arg Asn Ser Met Pro
Ser Phe Ala Ile Ser Gly Tyr Glu 130 135 140 Val Leu Phe Leu Thr Thr
Tyr Ala Gln Ala Ala Asn Thr His Leu Phe 145 150 155 160 Leu Leu Lys
Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys 165 170 175 Glu
Asp Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu 180 185
190 Tyr Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu
195 200 205 Arg Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr
Arg Arg 210 215 220 Glu Met Thr Leu Thr Val Leu Asp Leu Ile Ala Leu
Phe Pro Leu Tyr 225 230 235 240 Asp Val Arg Leu Tyr Pro Lys Glu Val
Lys Thr Glu Leu Thr Arg Asp 245 250 255 Val Leu Thr Asp Pro Ile Val
Gly Val Asn Asn Leu Arg Gly Tyr Gly 260 265 270 Thr Thr Phe Ser Asn
Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe 275 280 285 Asp Tyr Leu
His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr 290 295 300 Tyr
Gly Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr 305 310
315 320 Arg Pro Ser Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr
Gly 325 330 335 Asn Lys Ser Ser Glu Pro Val Gln Asn Leu Glu Phe Asn
Gly Glu Lys 340 345 350 Val Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala
Val Trp Pro Ser Ala 355 360 365 Val Tyr Ser Gly Val Thr Lys Val Glu
Phe Ser Gln Tyr Asn Asp Gln 370 375 380 Thr Asp Glu Ala Ser Thr Gln
Thr Tyr Asp Ser Lys Arg Asn Val Gly 385 390 395 400 Ala Val Ser Trp
Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp 405 410 415 Glu Pro
Leu Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys 420 425 430
Phe Leu Met Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr 435
440 445 His Lys Ser Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile
Thr 450 455 460 Gln Leu Pro Leu Val Lys Ala Tyr Lys Leu Gln Ser Gly
Ala Ser Val 465 470 475 480 Val Ala Gly Pro Arg Phe Thr Gly Gly Asp
Ile Ile Gln Cys Thr Glu 485 490 495 Asn Gly Ser Ala Ala Thr Ile Tyr
Val Thr Pro Asp Val Ser Tyr Ser 500 505 510 Gln Lys Tyr Arg Ala Arg
Ile His Tyr Ala Ser Thr Ser Gln Ile Thr 515 520 525 Phe Thr Leu Ser
Leu Asp Gly Ala Pro Phe Asn Gln Tyr Tyr Phe Asp 530 535 540 Lys Thr
Ile Asn Lys Gly Asp Thr Leu Thr Tyr Asn Ser Phe Asn Leu 545 550 555
560 Ala Ser Phe Ser Thr Pro Phe Glu Leu Ser Gly Asn Asn Leu Gln Ile
565 570 575 Gly Val Thr Gly Leu Ser Ala Gly Asp Lys Val Tyr Ile Asp
Lys Ile 580 585 590 Glu Phe Ile Pro Val Asn 595 71
1947DNAArtificial Sequencemocry1Ab coding sequence 71atggacaaca
accccaacat caacgagtgc atcccctaca actgcctgag caaccccgag 60gtggaggtgc
tgggcggcga gcgcatcgag accggctaca cccccatcga catcagcctg
120agcctgaccc agttcctgct gagcgagttc gtgcccggcg ccggcttcgt
gctgggcctg 180gtggacatca tctggggcat cttcggcccc agccagtggg
acgccttcct ggtgcagatc 240gagcagttga taaaccaacg catagaggaa
ttcgcccgca accaggccat cagccgcctg 300gagggcctga gcaacctgta
ccaaatctac gccgagagct tccgcgagtg ggaggccgac 360cccaccaacc
ccgccctgcg cgaggagatg cgcatccagt tcaacgacat gaacagcgcc
420ctgaccaccg ccatccccct gttcgccgtg cagaactacc aggtgcccct
gctgagcgtg 480tacgtgcagg ccgccaacct gcacctgagc gtgctgcgcg
acgtcagcgt gttcggccag 540cgctggggct tcgacgccgc caccatcaac
agccgctaca acgacctgac ccgcctgatc 600ggcaactaca ccgaccacgc
cgtgcgctgg tacaacaccg gcctggagcg cgtgtggggt 660cccgacagcc
gcgactggat caggtacaac cagttccgcc gcgagctgac cctgaccgtg
720ctggacatcg tgagcctgtt ccccaactac gacagccgca cctaccccat
ccgcaccgtg 780agccagctga cccgcgagat ttacaccaac cccgtgctgg
agaacttcga cggcagcttc 840cgcggcagcg cccagggcat cgagggcagc
atccgcagcc cccacctgat ggacatcctg 900aacagcatca ccatctacac
cgacgcccac cgcggcgagt actactggag cggccaccag 960atcatggcca
gccccgtcgg cttcagcggc cccgagttca ccttccccct gtacggcacc
1020atgggcaacg ctgcacctca gcagcgcatc gtggcacagc tgggccaggg
agtgtaccgc 1080accctgagca gcaccctgta ccgtcgacct ttcaacatcg
gcatcaacaa ccagcagctg 1140agcgtgctgg acggcaccga gttcgcctac
ggcaccagca gcaacctgcc cagcgccgtg 1200taccgcaaga gcggcaccgt
ggacagcctg gacgagatcc cccctcagaa caacaacgtg 1260ccacctcgac
agggcttcag ccaccgtctg agccacgtga gcatgttccg cagtggcttc
1320agcaacagca gcgtgagcat catccgtgca cctatgttca gctggattca
ccgcagtgcc 1380gagttcaaca acatcatccc cagcagccag atcacccaga
tccccctgac caagagcacc 1440aacctgggca gcggcaccag cgtggtgaag
ggccccggct tcaccggcgg cgacatcctg 1500cgccgcacca gccccggcca
gatcagcacc ctgcgcgtga acatcaccgc ccccctgagc 1560cagcgctacc
gcgtccgcat ccgctacgcc agcaccacca acctgcagtt ccacaccagc
1620atcgacggcc gccccatcaa ccagggcaac ttcagcgcca ccatgagcag
cggcagcaac 1680ctgcagagcg gcagcttccg caccgtgggc ttcaccaccc
ccttcaactt cagcaacggc 1740agcagcgtgt tcaccctgag cgcccacgtg
ttcaacagcg gcaacgaggt gtacatcgac 1800cgcatcgagt tcgtgcccgc
cgaggtgacc ttcgaggccg agtacgacct ggagagggct 1860cagaaggccg
tgaacgagct gttcaccagc agcaaccaga tcggcctgaa gaccgacgtg
1920accgactacc acatcgatca ggtgtag 194772648PRTBacillus
thuringiensisCry1Ab protein 72Met Asp Asn Asn Pro Asn Ile Asn Glu
Cys Ile Pro Tyr Asn Cys Leu 1 5 10 15 Ser Asn Pro Glu Val Glu Val
Leu Gly Gly Glu Arg Ile Glu Thr Gly 20 25 30 Tyr Thr Pro Ile Asp
Ile Ser Leu Ser Leu Thr Gln Phe Leu Leu Ser 35 40 45 Glu Phe Val
Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp Ile Ile 50 55 60 Trp
Gly Ile Phe Gly Pro Ser Gln Trp Asp Ala Phe Leu Val Gln Ile 65 70
75 80 Glu Gln Leu Ile Asn Gln Arg Ile Glu Glu Phe Ala Arg Asn Gln
Ala 85 90 95 Ile Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gln Ile
Tyr Ala Glu 100 105 110 Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn
Pro Ala Leu Arg Glu 115 120 125 Glu Met Arg Ile Gln Phe Asn Asp Met
Asn Ser Ala Leu Thr Thr Ala 130 135 140 Ile Pro Leu Phe Ala Val Gln
Asn Tyr Gln Val Pro Leu Leu Ser Val 145 150 155 160 Tyr Val Gln Ala
Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser 165 170 175 Val Phe
Gly Gln Arg Trp Gly Phe Asp Ala Ala Thr Ile Asn Ser Arg 180 185 190
Tyr Asn Asp Leu Thr Arg Leu Ile Gly Asn Tyr Thr Asp His Ala Val 195
200 205 Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser
Arg 210 215 220 Asp Trp Ile Arg Tyr Asn Gln Phe Arg Arg Glu Leu Thr
Leu Thr Val 225 230 235 240 Leu Asp Ile Val Ser Leu Phe Pro Asn Tyr
Asp Ser Arg Thr Tyr Pro 245 250 255 Ile Arg Thr Val Ser Gln Leu Thr
Arg Glu Ile Tyr Thr Asn Pro Val 260 265 270 Leu Glu Asn Phe Asp Gly
Ser Phe Arg Gly Ser Ala Gln Gly Ile Glu 275 280 285 Gly Ser Ile Arg
Ser Pro His Leu Met Asp Ile Leu Asn Ser Ile Thr 290 295 300 Ile Tyr
Thr Asp Ala His Arg Gly Glu Tyr Tyr Trp Ser Gly His Gln 305 310 315
320 Ile Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro
325 330 335 Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gln Gln Arg Ile
Val Ala 340 345 350 Gln Leu Gly Gln Gly Val Tyr Arg Thr Leu Ser Ser
Thr Leu Tyr Arg 355 360 365 Arg Pro Phe Asn Ile Gly Ile Asn Asn Gln
Gln Leu Ser Val Leu Asp 370 375 380 Gly Thr Glu Phe Ala Tyr Gly Thr
Ser Ser Asn Leu Pro Ser Ala Val 385 390 395 400 Tyr Arg Lys Ser Gly
Thr Val Asp Ser Leu Asp Glu Ile Pro Pro Gln 405 410 415 Asn Asn Asn
Val Pro Pro Arg Gln Gly Phe Ser His Arg Leu Ser His 420 425 430 Val
Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser Ile Ile 435 440
445 Arg Ala Pro Met Phe Ser Trp Ile His Arg Ser Ala Glu Phe Asn Asn
450 455 460 Ile Ile Pro Ser Ser Gln Ile Thr Gln Ile Pro Leu Thr Lys
Ser Thr 465 470 475 480 Asn Leu Gly Ser Gly Thr Ser Val Val Lys Gly
Pro Gly Phe Thr Gly 485 490 495 Gly Asp Ile Leu Arg Arg Thr Ser Pro
Gly Gln Ile Ser Thr Leu Arg 500 505 510 Val Asn Ile Thr Ala Pro Leu
Ser Gln Arg Tyr Arg Val Arg Ile Arg 515 520 525 Tyr Ala Ser Thr Thr
Asn Leu Gln Phe His Thr Ser Ile Asp Gly Arg 530 535 540 Pro Ile Asn
Gln Gly Asn Phe Ser Ala Thr Met Ser Ser Gly Ser Asn 545 550 555 560
Leu Gln Ser Gly Ser Phe Arg Thr Val Gly Phe Thr Thr Pro Phe Asn 565
570 575 Phe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser Ala His Val Phe
Asn 580 585 590 Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe Val
Pro Ala Glu 595 600 605 Val Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg
Ala Gln Lys Ala Val 610 615 620 Asn Glu Leu Phe Thr Ser Ser Asn Gln
Ile Gly Leu Lys Thr Asp Val 625 630 635 640 Thr Asp Tyr His Ile Asp
Gln Val 645 731971DNAArtificial Sequencemocry1Ba coding sequence
73atgaccagca accgcaagaa cgagaacgag atcatcaacg ccgtgagcaa ccacagcgcc
60cagatggacc tgctgcccga cgcccgcatc gaggacagcc tgtgcatcgc cgagggcaac
120aacatcgacc ccttcgtgag cgctagcacc gtgcagaccg gtatcaacat
cgctggccgc 180atcctgggcg tgctgggcgt gcccttcgcc ggccagctgg
ctagcttcta cagcttcctg 240gtcggtgagc tgtggccacg cggccgcgac
cagtgggaaa tcttcctgga gcacgtggag 300cagctgatca accagcagat
caccgagaac gcccgcaaca ccgctcttgc ccgcctgcag 360ggtctgggcg
acagcttccg cgcctaccag cagagcctgg aggactggct ggagaaccgc
420gacgacgccc gcacccgcag cgtgctgtac acccagtaca tcgccctgga
gctggacttc 480ctgaacgcca tgcccctgtt cgccattcga aaccaggagg
tgcccctgct gatggtgtac 540gcccaggccg ccaacctgca cctgctgctg
ctgcgcgacg ccagcctgtt cggcagcgag 600ttcggcctga ccagccagga
gatccagcgg tactacgagc gccaggtgga gcgcacccgc 660gactacagcg
actactgcgt ggagtggtac aacaccggcc tgaacagctt aaggggcacc
720aacgccgcca gctgggtgcg ctacaaccag ttccgccgcg acctgaccct
gggcgtgctg 780gacctggtgg ccctgttccc cagctacgac acccgcacct
accccatcaa caccagcgcc 840cagctgaccc gcgaggtgta caccgacgcc
atcggcgcca ccggcgtgaa catggccagc 900atgaactggt acaacaacaa
cgcccccagc ttcagcgcca tcgaggccgc cgccatccgc 960agcccccacc
tgctggactt cctggagcag ctgaccatct tcagtgccag cagccgctgg
1020agcaacaccc gccacatgac ctactggcgc ggccacacca tccagtctag
acccatcggc 1080ggcggcctga acaccagcac ccacggcgcc accaacacca
gcatcaaccc cgtgaccctg 1140cgcttcgcct cccgagacgt ctaccgcacc
gagagctacg ccggcgtgct gctgtggggc 1200atctacctgg agcccatcca
tggcgtgccc accgtgcgct tcaacttcac caacccccag 1260aacatcagcg
accgcggcac cgccaactac agccagccct acgagagccc cgggttgcag
1320ctgaaggaca gcgagaccga gctgcccccc gagaccaccg agcgccccaa
ctacgagagc 1380tacagccacc gcctgagcca catcggcatc atcttgcaga
gccgcgtgaa cgtgcccgtg 1440tacagctgga cccaccgcag cgccgaccgc
accaacacca tcggccccaa ccgcatcacc 1500cagatcccca tggtgaaggc
cagcgagctg ccccagggca ccaccgtggt tcgcggcccc 1560ggcttcaccg
gaggcgacat cctgcgacgc accaacaccg gcggcttcgg ccccatccgc
1620gtgaccgtga acggccccct gacccagcgc taccgcatcg gcttccgcta
cgccagcacc 1680gtggacttcg acttcttcgt gagccgcggc ggcaccaccg
tgaacaactt ccgcttcctg 1740cgcaccatga acagcggcga cgagctgaag
tacggcaact tcgtgcgccg cgccttcacc 1800acccccttca ccttcaccca
gatccaggac atcatccgca ccagcatcca gggcctgagc 1860ggcaacggcg
aggtgtacat cgacaagatc gagatcatcc ccgtgaccgc caccttcgag
1920gccgagtacg acctagagcg cgcccaggag gccgtgaacg ccctgttcta g
197174656PRTBacillus thuringiensisCry1B protein 74Met Thr Ser Asn
Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Val Ser 1 5 10 15 Asn His
Ser Ala Gln Met Asp Leu Leu Pro Asp Ala Arg Ile Glu Asp 20 25 30
Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp Pro Phe Val Ser Ala 35
40 45 Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly Arg Ile Leu Gly
Val 50 55 60 Leu Gly Val Pro Phe Ala Gly Gln Leu Ala Ser Phe Tyr
Ser Phe Leu 65 70 75 80 Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Gln
Trp Glu Ile Phe Leu 85 90 95 Glu His Val Glu Gln Leu Ile Asn Gln
Gln Ile Thr Glu Asn Ala Arg 100 105 110 Asn Thr Ala Leu Ala Arg Leu
Gln Gly Leu Gly Asp Ser Phe Arg Ala 115 120 125 Tyr Gln Gln Ser Leu
Glu Asp Trp Leu Glu Asn Arg Asp Asp Ala Arg 130 135 140 Thr Arg Ser
Val Leu Tyr Thr Gln Tyr Ile Ala Leu Glu Leu Asp Phe 145 150 155 160
Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn Gln Glu Val Pro Leu 165
170 175 Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His Leu Leu Leu Leu
Arg 180 185 190 Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu Thr Ser
Gln Glu Ile 195 200 205 Gln Arg Tyr Tyr Glu Arg Gln Val Glu Arg Thr
Arg Asp Tyr Ser Asp 210 215 220 Tyr Cys Val Glu Trp Tyr Asn Thr Gly
Leu Asn Ser Leu Arg Gly Thr 225 230 235 240 Asn Ala Ala Ser Trp Val
Arg Tyr Asn Gln Phe Arg Arg Asp Leu Thr 245 250 255 Leu Gly Val Leu
Asp Leu Val Ala Leu Phe Pro Ser Tyr Asp Thr Arg 260 265 270 Thr Tyr
Pro Ile Asn Thr Ser Ala Gln Leu Thr Arg Glu Val Tyr Thr 275 280 285
Asp Ala Ile Gly Ala Thr Gly Val Asn Met Ala Ser Met Asn Trp Tyr 290
295 300 Asn Asn Asn Ala Pro Ser Phe Ser Ala Ile Glu Ala Ala Ala Ile
Arg 305 310 315 320 Ser Pro His Leu Leu Asp Phe Leu Glu Gln Leu Thr
Ile Phe Ser Ala 325 330 335 Ser Ser Arg Trp Ser Asn Thr Arg His Met
Thr Tyr Trp Arg Gly His 340 345 350 Thr Ile Gln Ser Arg Pro Ile Gly
Gly Gly Leu Asn Thr Ser Thr His 355 360 365 Gly Ala Thr Asn Thr Ser
Ile Asn Pro Val Thr Leu Arg Phe Ala Ser 370 375 380 Arg Asp Val Tyr
Arg Thr Glu Ser Tyr Ala Gly Val Leu Leu Trp Gly 385 390 395 400 Ile
Tyr Leu Glu Pro Ile His Gly Val Pro Thr Val Arg Phe Asn Phe 405 410
415 Thr Asn Pro Gln Asn Ile Ser Asp Arg Gly Thr Ala Asn Tyr Ser Gln
420 425 430 Pro Tyr Glu Ser Pro Gly Leu Gln Leu Lys Asp Ser Glu Thr
Glu Leu 435 440 445 Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
Tyr Ser His Arg 450 455 460 Leu Ser His Ile Gly Ile Ile Leu Gln Ser
Arg Val Asn Val Pro Val 465 470
475 480 Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn Thr Ile Gly
Pro 485 490 495 Asn Arg Ile Thr Gln Ile Pro Met Val Lys Ala Ser Glu
Leu Pro Gln 500 505 510 Gly Thr Thr Val Val Arg Gly Pro Gly Phe Thr
Gly Gly Asp Ile Leu 515 520 525 Arg Arg Thr Asn Thr Gly Gly Phe Gly
Pro Ile Arg Val Thr Val Asn 530 535 540 Gly Pro Leu Thr Gln Arg Tyr
Arg Ile Gly Phe Arg Tyr Ala Ser Thr 545 550 555 560 Val Asp Phe Asp
Phe Phe Val Ser Arg Gly Gly Thr Thr Val Asn Asn 565 570 575 Phe Arg
Phe Leu Arg Thr Met Asn Ser Gly Asp Glu Leu Lys Tyr Gly 580 585 590
Asn Phe Val Arg Arg Ala Phe Thr Thr Pro Phe Thr Phe Thr Gln Ile 595
600 605 Gln Asp Ile Ile Arg Thr Ser Ile Gln Gly Leu Ser Gly Asn Gly
Glu 610 615 620 Val Tyr Ile Asp Lys Ile Glu Ile Ile Pro Val Thr Ala
Thr Phe Glu 625 630 635 640 Ala Glu Tyr Asp Leu Glu Arg Ala Gln Glu
Ala Val Asn Ala Leu Phe 645 650 655 751950DNAArtificial
Sequencemocry1Fa coding sequence 75atggagaaca acatccagaa ccagtgcgtg
ccgtacaact gcctcaacaa cccggaggtg 60gagatcctca acgaggagcg ctccaccggc
cgcctcccgc tcgacatctc cctctccctc 120acccgcttcc tcctctccga
gttcgtgccg ggcgtgggcg tggccttcgg cctcttcgac 180ctcatctggg
gcttcatcac cccgtccgac tggtccctct tcctcctcca gatcgagcag
240ctcatcgagc agcgcatcga gaccctggag cgcaaccgcg ccatcaccac
cctccgcggc 300ctcgccgact cctacgaaat ctacatcgag gccctccgcg
agtgggaggc caacccgaac 360aacgcccagc tccgcgagga cgtgcgcatc
cgcttcgcca acaccgacga cgccctcatc 420accgccatca acaacttcac
cctcacctcc ttcgagatcc cgctcctctc cgtgtacgtg 480caggccgcca
acctccacct ctccctcctc cgcgacgccg tgtccttcgg ccagggctgg
540ggcctcgaca tcgccaccgt gaacaaccac tacaaccgcc tcatcaacct
catccaccgc 600tacaccaagc actgcctcga cacctacaac cagggcctgg
agaacctccg cggcaccaac 660acccgccagt gggcccgctt caaccagttc
cgccgcgacc tcaccctcac cgtgctcgac 720atcgtggccc tcttcccgaa
ctacgacgtg cgcacctacc cgatccagac ctcctcccag 780ctcacccgcg
aaatctacac ctcctccgtg atcgaggact ccccggtgtc cgccaacatc
840ccgaacggct tcaaccgcgc cgagttcggc gtgcgcccgc cgcacctcat
ggacttcatg 900aactccctct tcgtgaccgc cgagaccgtg cgctcccaga
ccgtgtgggg cggccacctc 960gtgtcctccc gcaacaccgc cggcaaccgc
atcaacttcc cgtcctacgg cgtgttcaac 1020ccgggcggcg ccatctggat
cgccgacgag gacccgcgcc cgttctaccg caccctctcc 1080gacccggtgt
tcgtgcgcgg cggcttcggc aacccgcact acgtgctcgg cctccgcggc
1140gtggccttcc agcagaccgg caccaaccac acccgcacct tccgcaactc
cggcaccatc 1200gactccctcg acgagatccc gccgcaggac aactccggcg
ccccgtggaa cgactactcc 1260cacgtgctca accacgtgac cttcgtgcgc
tggccgggcg agatatccgg ctccgactcc 1320tggcgtgcac cgatgttctc
ctggacccac cgctccgcca ccccgaccaa caccatcgac 1380ccggagcgca
tcacccagat cccgctcgtg aaggcccaca ccctccagtc cggcaccacc
1440gtggtgcgcg gcccgggctt caccggcggc gacatcctcc gccgcacctc
cggcggcccg 1500ttcgcctaca ccatcgtgaa catcaacggc cagctcccgc
agcgctaccg cgcccgcatc 1560cgctacgcct ccaccaccaa cctccgcatc
tacgtgaccg tggccggcga gcgcatcttc 1620gccggccagt tcaacaagac
catggacacc ggcgacccgc tcaccttcca gtccttctcc 1680tacgccacca
tcaacaccgc cttcaccttc ccgatgtccc agtcctcctt caccgtgggc
1740gccgacacct tctcctccgg caacgaggtg tacatcgacc gcttcgagct
gatcccggtg 1800accgccacct tcgaggccga gtacgacctg gagcgcgccc
agaaggccgt gaacgccctc 1860ttcacctcca tcaaccagat cggcatcaag
accgacgtga ccgactacca catcgaccag 1920gtgtccaacc tcgtggactg
cttaagctag 195076649PRTBacillus thguringiensisCry1F protein 76Met
Glu Asn Asn Ile Gln Asn Gln Cys Val Pro Tyr Asn Cys Leu Asn 1 5 10
15 Asn Pro Glu Val Glu Ile Leu Asn Glu Glu Arg Ser Thr Gly Arg Leu
20 25 30 Pro Leu Asp Ile Ser Leu Ser Leu Thr Arg Phe Leu Leu Ser
Glu Phe 35 40 45 Val Pro Gly Val Gly Val Ala Phe Gly Leu Phe Asp
Leu Ile Trp Gly 50 55 60 Phe Ile Thr Pro Ser Asp Trp Ser Leu Phe
Leu Leu Gln Ile Glu Gln 65 70 75 80 Leu Ile Glu Gln Arg Ile Glu Thr
Leu Glu Arg Asn Arg Ala Ile Thr 85 90 95 Thr Leu Arg Gly Leu Ala
Asp Ser Tyr Glu Ile Tyr Ile Glu Ala Leu 100 105 110 Arg Glu Trp Glu
Ala Asn Pro Asn Asn Ala Gln Leu Arg Glu Asp Val 115 120 125 Arg Ile
Arg Phe Ala Asn Thr Asp Asp Ala Leu Ile Thr Ala Ile Asn 130 135 140
Asn Phe Thr Leu Thr Ser Phe Glu Ile Pro Leu Leu Ser Val Tyr Val 145
150 155 160 Gln Ala Ala Asn Leu His Leu Ser Leu Leu Arg Asp Ala Val
Ser Phe 165 170 175 Gly Gln Gly Trp Gly Leu Asp Ile Ala Thr Val Asn
Asn His Tyr Asn 180 185 190 Arg Leu Ile Asn Leu Ile His Arg Tyr Thr
Lys His Cys Leu Asp Thr 195 200 205 Tyr Asn Gln Gly Leu Glu Asn Leu
Arg Gly Thr Asn Thr Arg Gln Trp 210 215 220 Ala Arg Phe Asn Gln Phe
Arg Arg Asp Leu Thr Leu Thr Val Leu Asp 225 230 235 240 Ile Val Ala
Leu Phe Pro Asn Tyr Asp Val Arg Thr Tyr Pro Ile Gln 245 250 255 Thr
Ser Ser Gln Leu Thr Arg Glu Ile Tyr Thr Ser Ser Val Ile Glu 260 265
270 Asp Ser Pro Val Ser Ala Asn Ile Pro Asn Gly Phe Asn Arg Ala Glu
275 280 285 Phe Gly Val Arg Pro Pro His Leu Met Asp Phe Met Asn Ser
Leu Phe 290 295 300 Val Thr Ala Glu Thr Val Arg Ser Gln Thr Val Trp
Gly Gly His Leu 305 310 315 320 Val Ser Ser Arg Asn Thr Ala Gly Asn
Arg Ile Asn Phe Pro Ser Tyr 325 330 335 Gly Val Phe Asn Pro Gly Gly
Ala Ile Trp Ile Ala Asp Glu Asp Pro 340 345 350 Arg Pro Phe Tyr Arg
Thr Leu Ser Asp Pro Val Phe Val Arg Gly Gly 355 360 365 Phe Gly Asn
Pro His Tyr Val Leu Gly Leu Arg Gly Val Ala Phe Gln 370 375 380 Gln
Thr Gly Thr Asn His Thr Arg Thr Phe Arg Asn Ser Gly Thr Ile 385 390
395 400 Asp Ser Leu Asp Glu Ile Pro Pro Gln Asp Asn Ser Gly Ala Pro
Trp 405 410 415 Asn Asp Tyr Ser His Val Leu Asn His Val Thr Phe Val
Arg Trp Pro 420 425 430 Gly Glu Ile Ser Gly Ser Asp Ser Trp Arg Ala
Pro Met Phe Ser Trp 435 440 445 Thr His Arg Ser Ala Thr Pro Thr Asn
Thr Ile Asp Pro Glu Arg Ile 450 455 460 Thr Gln Ile Pro Leu Val Lys
Ala His Thr Leu Gln Ser Gly Thr Thr 465 470 475 480 Val Val Arg Gly
Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr 485 490 495 Ser Gly
Gly Pro Phe Ala Tyr Thr Ile Val Asn Ile Asn Gly Gln Leu 500 505 510
Pro Gln Arg Tyr Arg Ala Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu 515
520 525 Arg Ile Tyr Val Thr Val Ala Gly Glu Arg Ile Phe Ala Gly Gln
Phe 530 535 540 Asn Lys Thr Met Asp Thr Gly Asp Pro Leu Thr Phe Gln
Ser Phe Ser 545 550 555 560 Tyr Ala Thr Ile Asn Thr Ala Phe Thr Phe
Pro Met Ser Gln Ser Ser 565 570 575 Phe Thr Val Gly Ala Asp Thr Phe
Ser Ser Gly Asn Glu Val Tyr Ile 580 585 590 Asp Arg Phe Glu Leu Ile
Pro Val Thr Ala Thr Phe Glu Ala Glu Tyr 595 600 605 Asp Leu Glu Arg
Ala Gln Lys Ala Val Asn Ala Leu Phe Thr Ser Ile 610 615 620 Asn Gln
Ile Gly Ile Lys Thr Asp Val Thr Asp Tyr His Ile Asp Gln 625 630 635
640 Val Ser Asn Leu Val Asp Cys Leu Ser 645 773469DNABacillus
thuringiensismisc_feature(1)..(3469)cry8Aa coding sequence
77atgagtccaa ataatcaaaa tgaatatgaa attatagatg cgacaccttc tacatctgta
60tccagtgatt ctaacagata cccttttgcg aatgagccaa cagatgcgtt acaaaatatg
120aattataaag attatctgaa aatgtctggg ggagagaatc ctgaattatt
tggaaatccg 180gagacgttta ttagttcatc cacgattcaa actggaattg
gcattgttgg tcgaatacta 240ggagctttag gggttccatt tgctagtcag
atagctagtt tctatagttt cattgttggt 300caattatggc cgtcaaagag
cgtagatata tggggagaaa ttatggaacg agtggaagaa 360ctcgttgatc
aaaaaataga aaaatatgta aaagataagg ctcttgctga attaaaaggg
420ctaggaaatg ctttggatgt atatcagcag tcacttgaag attggctgga
aaatcgcaat 480gatgcaagaa ctagaagtgt tgtttctaat caatttatag
ctttagatct taactttgtt 540agttcaattc catcttttgc agtatccgga
cacgaagtac tattattagc agtatatgca 600caggctgtga acctacattt
attgttatta agagatgctt ctatttttgg agaagagtgg 660ggatttacac
caggtgaaat ttctagattt tataatcgtc aagtgcaact taccgctgaa
720tattcagact attgtgtaaa gtggtataaa atcggcttag ataaattgaa
aggtaccact 780tctaaaagtt ggctgaatta tcatcagttc cgtagagaga
tgacattact ggtattagat 840ttggtggcgt tatttccaaa ctatgacaca
catatgtatc caatcgaaac aacagctcaa 900cttacacggg atgtgtatac
agatccgata gcatttaaca tagtgacaag tactggattc 960tgcaaccctt
ggtcaaccca cagtggtatt cttttttatg aagttgaaaa caacgtaatt
1020cgtccgccac acttgtttga tatactcagc tcagtagaaa ttaatacaag
tagagggggt 1080attacgttaa ataatgatgc atatataaac tactggtcag
gacataccct aaaatatcgt 1140agaacagctg attcgaccgt aacatacaca
gctaattacg gtcgaatcac ttcagaaaag 1200aattcatttg cacttgagga
tagggatatt tttgaaatta attcaactgt ggcaaaccta 1260gctaattact
accaaaaggc atatggtgtg ccgggatctt ggttccatat ggtaaaaagg
1320ggaacctcat caacaacagc gtatttatat tcaaaaacac atacagctct
ccaagggtgt 1380acacaggttt atgaatcaag tgatgaaata cctctagata
gaactgtacc ggtagctgaa 1440agctatagtc atagattatc tcatattacc
tcccattctt tctctaaaaa tgggagtgca 1500tactatggga gtttccctgt
atttgtttgg acacatacta gtgcggattt aaataataca 1560atatattcag
ataaaatcac tcaaattcca gcggtaaagg gagacatgtt atatctaggg
1620ggttccgtag tacagggtcc tggatttaca ggaggagata tattaaaaag
aaccaatcct 1680agcatattag ggacctttgc ggttacagta aatgggtcgt
tatcacaaag atatcgtgta 1740agaattcgct atgcctctac aacagatttt
gaatttactc tataccttgg cgacacaata 1800gaaaaaaata gatttaacaa
aactatggat aatggggcat ctttaacgta tgaaacattt 1860aaattcgcaa
gtttcattac tgatttccaa ttcagagaaa cacaagataa aatactccta
1920tccatgggtg attttagctc cggtcaagaa gtttatatag accgaatcga
attcatccca 1980gtagatgaga catatgaggc ggaacaagat ttagaagcgg
cgaagaaagc agtgaatgcc 2040ttgtttacga atacaaaaga tggcttacga
ccaggtgtaa cggattatga agtaaatcaa 2100gcggcaaact tagtggaatg
cctatcggat gatttatatc caaatgaaaa acgattgtta 2160tttgatgcgg
tgagagaggc aaaacgcctc agtggggcac gtaacttact acaagatcca
2220gatttccaag agataaacgg agaaaatgga tgggcggcaa gtacgggaat
tgagattgta 2280gaaggggatg ctgtatttaa aggacgttat ctacgcctac
caggtgcacg agaaattgat 2340acggaaacgt atccaacgta tctgtatcaa
aaagtagagg aaggtgtatt aaaaccatac 2400acaagatata gactgagagg
gtttgtggga agtagtcaag gattagaaat ttatacgata 2460cgtcaccaaa
cgaatcgaat tgtaaagaat gtaccagatg atttattgcc agatgtatct
2520cctgtaaact ctgatggcag tatcaatcga tgcagcgaac aaaagtatgt
gaatagccgt 2580ttagaaggag aaaaccgttc tggtgatgca catgagttct
cgctccctat cgatatagga 2640gagctggatt acaatgaaaa tgcaggaata
tgggttggat ttaagattac ggacccagag 2700ggatacgcaa cacttggaaa
tcttgaatta gtcgaagagg gacctttgtc aggagacgca 2760ttagagcgct
tgcaaagaga agaacaacag tggaagattc aaatgacaag aagacgtgaa
2820gagacagata gaagatacat ggcatcgaaa caagcggtag atcgtttata
tgccgattat 2880caggatcaac aactgaatcc tgatgtagag attacagatc
ttactgcggc tcaagatctg 2940atacagtcca ttccttacgt atataacgaa
atgttcccag aaataccagg gatgaactat 3000acgaagttta cagaattaac
agatcgactc caacaagcgt ggaatttgta tgatcagcga 3060aatgccatac
caaatggtga ttttcgaaat gggttaagta attggaatgc aacgcctggc
3120gtagaagtac aacaaatcaa tcatacatct gtccttgtga ttccaaactg
ggatgaacaa 3180gtttcacaac agtttacagt tcaaccgaat caaagatatg
tattacgagt tactgcaaga 3240aaagaagggg taggaaatgg atatgtaagt
attcgtgatg gtggaaatca atcagaaacg 3300cttactttta gtgcaagcga
ttatgataca aatggtgtgt ataatgacca aaccggctat 3360atcacaaaaa
cagtgacatt catcccgtat acagatcaaa tgtggattga aataagtgaa
3420acagaaggta cgttctatat agaaagtgta gaattgattg tagacgtag
3469781156PRTBacillus thuringiensisMISC_FEATURE(1)..(1156)Cry8Aa
protein 78Met Ser Pro Asn Asn Gln Asn Glu Tyr Glu Ile Ile Asp Ala
Thr Pro 1 5 10 15 Ser Thr Ser Val Ser Ser Asp Ser Asn Arg Tyr Pro
Phe Ala Asn Glu 20 25 30 Pro Thr Asp Ala Leu Gln Asn Met Asn Tyr
Lys Asp Tyr Leu Lys Met 35 40 45 Ser Gly Gly Glu Asn Pro Glu Leu
Phe Gly Asn Pro Glu Thr Phe Ile 50 55 60 Ser Ser Ser Thr Ile Gln
Thr Gly Ile Gly Ile Val Gly Arg Ile Leu 65 70 75 80 Gly Ala Leu Gly
Val Pro Phe Ala Ser Gln Ile Ala Ser Phe Tyr Ser 85 90 95 Phe Ile
Val Gly Gln Leu Trp Pro Ser Lys Ser Val Asp Ile Trp Gly 100 105 110
Glu Ile Met Glu Arg Val Glu Glu Leu Val Asp Gln Lys Ile Glu Lys 115
120 125 Tyr Val Lys Asp Lys Ala Leu Ala Glu Leu Lys Gly Leu Gly Asn
Ala 130 135 140 Leu Asp Val Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu
Asn Arg Asn 145 150 155 160 Asp Ala Arg Thr Arg Ser Val Val Ser Asn
Gln Phe Ile Ala Leu Asp 165 170 175 Leu Asn Phe Val Ser Ser Ile Pro
Ser Phe Ala Val Ser Gly His Glu 180 185 190 Val Leu Leu Leu Ala Val
Tyr Ala Gln Ala Val Asn Leu His Leu Leu 195 200 205 Leu Leu Arg Asp
Ala Ser Ile Phe Gly Glu Glu Trp Gly Phe Thr Pro 210 215 220 Gly Glu
Ile Ser Arg Phe Tyr Asn Arg Gln Val Gln Leu Thr Ala Glu 225 230 235
240 Tyr Ser Asp Tyr Cys Val Lys Trp Tyr Lys Ile Gly Leu Asp Lys Leu
245 250 255 Lys Gly Thr Thr Ser Lys Ser Trp Leu Asn Tyr His Gln Phe
Arg Arg 260 265 270 Glu Met Thr Leu Leu Val Leu Asp Leu Val Ala Leu
Phe Pro Asn Tyr 275 280 285 Asp Thr His Met Tyr Pro Ile Glu Thr Thr
Ala Gln Leu Thr Arg Asp 290 295 300 Val Tyr Thr Asp Pro Ile Ala Phe
Asn Ile Val Thr Ser Thr Gly Phe 305 310 315 320 Cys Asn Pro Trp Ser
Thr His Ser Gly Ile Leu Phe Tyr Glu Val Glu 325 330 335 Asn Asn Val
Ile Arg Pro Pro His Leu Phe Asp Ile Leu Ser Ser Val 340 345 350 Glu
Ile Asn Thr Ser Arg Gly Gly Ile Thr Leu Asn Asn Asp Ala Tyr 355 360
365 Ile Asn Tyr Trp Ser Gly His Thr Leu Lys Tyr Arg Arg Thr Ala Asp
370 375 380 Ser Thr Val Thr Tyr Thr Ala Asn Tyr Gly Arg Ile Thr Ser
Glu Lys 385 390 395 400 Asn Ser Phe Ala Leu Glu Asp Arg Asp Ile Phe
Glu Ile Asn Ser Thr 405 410 415 Val Ala Asn Leu Ala Asn Tyr Tyr Gln
Lys Ala Tyr Gly Val Pro Gly 420 425 430 Ser Trp Phe His Met Val Lys
Arg Gly Thr Ser Ser Thr Thr Ala Tyr 435 440 445 Leu Tyr Ser Lys Thr
His Thr Ala Leu Gln Gly Cys Thr Gln Val Tyr 450 455 460 Glu Ser Ser
Asp Glu Ile Pro Leu Asp Arg Thr Val Pro Val Ala Glu 465 470 475 480
Ser Tyr Ser His Arg Leu Ser His Ile Thr Ser His Ser Phe Ser Lys 485
490 495 Asn Gly Ser Ala Tyr Tyr Gly Ser Phe Pro Val Phe Val Trp Thr
His 500 505 510 Thr Ser Ala Asp Leu Asn Asn Thr Ile Tyr Ser Asp Lys
Ile Thr Gln 515 520 525 Ile Pro Ala Val Lys Gly Asp Met Leu Tyr Leu
Gly Gly Ser Val Val 530 535 540 Gln Gly Pro Gly Phe Thr Gly Gly Asp
Ile Leu Lys Arg Thr Asn Pro 545 550 555 560 Ser Ile Leu Gly Thr Phe
Ala Val Thr Val Asn Gly Ser Leu Ser Gln 565 570 575 Arg Tyr Arg Val
Arg Ile Arg Tyr Ala Ser Thr Thr Asp Phe Glu Phe 580 585 590 Thr Leu
Tyr Leu Gly Asp Thr Ile Glu Lys Asn Arg Phe
Asn Lys Thr 595 600 605 Met Asp Asn Gly Ala Ser Leu Thr Tyr Glu Thr
Phe Lys Phe Ala Ser 610 615 620 Phe Ile Thr Asp Phe Gln Phe Arg Glu
Thr Gln Asp Lys Ile Leu Leu 625 630 635 640 Ser Met Gly Asp Phe Ser
Ser Gly Gln Glu Val Tyr Ile Asp Arg Ile 645 650 655 Glu Phe Ile Pro
Val Asp Glu Thr Tyr Glu Ala Glu Gln Asp Leu Glu 660 665 670 Ala Ala
Lys Lys Ala Val Asn Ala Leu Phe Thr Asn Thr Lys Asp Gly 675 680 685
Leu Arg Pro Gly Val Thr Asp Tyr Glu Val Asn Gln Ala Ala Asn Leu 690
695 700 Val Glu Cys Leu Ser Asp Asp Leu Tyr Pro Asn Glu Lys Arg Leu
Leu 705 710 715 720 Phe Asp Ala Val Arg Glu Ala Lys Arg Leu Ser Gly
Ala Arg Asn Leu 725 730 735 Leu Gln Asp Pro Asp Phe Gln Glu Ile Asn
Gly Glu Asn Gly Trp Ala 740 745 750 Ala Ser Thr Gly Ile Glu Ile Val
Glu Gly Asp Ala Val Phe Lys Gly 755 760 765 Arg Tyr Leu Arg Leu Pro
Gly Ala Arg Glu Ile Asp Thr Glu Thr Tyr 770 775 780 Pro Thr Tyr Leu
Tyr Gln Lys Val Glu Glu Gly Val Leu Lys Pro Tyr 785 790 795 800 Thr
Arg Tyr Arg Leu Arg Gly Phe Val Gly Ser Ser Gln Gly Leu Glu 805 810
815 Ile Tyr Thr Ile Arg His Gln Thr Asn Arg Ile Val Lys Asn Val Pro
820 825 830 Asp Asp Leu Leu Pro Asp Val Ser Pro Val Asn Ser Asp Gly
Ser Ile 835 840 845 Asn Arg Cys Ser Glu Gln Lys Tyr Val Asn Ser Arg
Leu Glu Gly Glu 850 855 860 Asn Arg Ser Gly Asp Ala His Glu Phe Ser
Leu Pro Ile Asp Ile Gly 865 870 875 880 Glu Leu Asp Tyr Asn Glu Asn
Ala Gly Ile Trp Val Gly Phe Lys Ile 885 890 895 Thr Asp Pro Glu Gly
Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu 900 905 910 Glu Gly Pro
Leu Ser Gly Asp Ala Leu Glu Arg Leu Gln Arg Glu Glu 915 920 925 Gln
Gln Trp Lys Ile Gln Met Thr Arg Arg Arg Glu Glu Thr Asp Arg 930 935
940 Arg Tyr Met Ala Ser Lys Gln Ala Val Asp Arg Leu Tyr Ala Asp Tyr
945 950 955 960 Gln Asp Gln Gln Leu Asn Pro Asp Val Glu Ile Thr Asp
Leu Thr Ala 965 970 975 Ala Gln Asp Leu Ile Gln Ser Ile Pro Tyr Val
Tyr Asn Glu Met Phe 980 985 990 Pro Glu Ile Pro Gly Met Asn Tyr Thr
Lys Phe Thr Glu Leu Thr Asp 995 1000 1005 Arg Leu Gln Gln Ala Trp
Asn Leu Tyr Asp Gln Arg Asn Ala Ile 1010 1015 1020 Pro Asn Gly Asp
Phe Arg Asn Gly Leu Ser Asn Trp Asn Ala Thr 1025 1030 1035 Pro Gly
Val Glu Val Gln Gln Ile Asn His Thr Ser Val Leu Val 1040 1045 1050
Ile Pro Asn Trp Asp Glu Gln Val Ser Gln Gln Phe Thr Val Gln 1055
1060 1065 Pro Asn Gln Arg Tyr Val Leu Arg Val Thr Ala Arg Lys Glu
Gly 1070 1075 1080 Val Gly Asn Gly Tyr Val Ser Ile Arg Asp Gly Gly
Asn Gln Ser 1085 1090 1095 Glu Thr Leu Thr Phe Ser Ala Ser Asp Tyr
Asp Thr Asn Gly Val 1100 1105 1110 Tyr Asn Asp Gln Thr Gly Tyr Ile
Thr Lys Thr Val Thr Phe Ile 1115 1120 1125 Pro Tyr Thr Asp Gln Met
Trp Ile Glu Ile Ser Glu Thr Glu Gly 1130 1135 1140 Thr Phe Tyr Ile
Glu Ser Val Glu Leu Ile Val Asp Val 1145 1150 1155
793537DNABacillus thuringiensismisc_feature(1)..(3537)cry1Ac coding
sequence 79atggataaca atccgaacat caatgaatgc attccttata attgtttaag
taaccctgaa 60gtagaagtat taggtggaga aagaatagaa actggttaca ccccaatcga
tatttccttg 120tcgctaacgc aatttctttt gagtgaattt gttcccggtg
ctggatttgt gttaggacta 180gttgatataa tatggggaat ttttggtccc
tctcaatggg acgcatttct tgtacaaatt 240gaacagttaa ttaaccaaag
aatagaagaa ttcgctagga accaagccat ttctagatta 300gaaggactaa
gcaatcttta tcaaatttac gcagaatctt ttagagagtg ggaagcagat
360cctactaatc cagcattaag agaagagatg cgtattcaat tcaatgacat
gaacagtgcc 420cttacaaccg ctattcctct ttttgcagtt caaaattatc
aagttcctct tttatcagta 480tatgttcaag ctgcaaattt acatttatca
gttttgagag atgtttcagt gtttggacaa 540aggtggggat ttgatgccgc
gactatcaat agtcgttata atgatttaac taggcttatt 600ggcaactata
cagattatgc tgtacgctgg tacaatacgg gattagaacg tgtatgggga
660ccggattcta gagattgggt aaggtataat caatttagaa gagaattaac
actaactgta 720ttagatatcg ttgctctgtt cccgaattat gatagtagaa
gatatccaat tcgaacagtt 780tcccaattaa caagagaaat ttatacaaac
ccagtattag aaaattttga tggtagtttt 840cgaggctcgg ctcagggcat
agaaagaagt attaggagtc cacatttgat ggatatactt 900aacagtataa
ccatctatac ggatgctcat aggggttatt attattggtc agggcatcaa
960ataatggctt ctcctgtagg gttttcgggg ccagaattca cttttccgct
atatggaact 1020atgggaaatg cagctccaca acaacgtatt gttgctcaac
taggtcaggg cgtgtataga 1080acattatcgt ccactttata tagaagacct
tttaatatag ggataaataa tcaacaacta 1140tctgttcttg acgggacaga
atttgcttat ggaacctcct caaatttgcc atccgctgta 1200tacagaaaaa
gcggaacggt agattcgctg gatgaaatac cgccacagaa taacaacgtg
1260ccacctaggc aaggatttag tcatcgatta agccatgttt caatgtttcg
ttcaggcttt 1320agtaatagta gtgtaagtat aataagagct cctatgttct
cttggataca tcgtagtgct 1380gaatttaata atataattgc atcggatagt
attactcaaa tccctgcagt gaagggaaac 1440tttcttttta atggttctgt
aatttcagga ccaggattta ctggtgggga cttagttaga 1500ttaaatagta
gtggaaataa cattcagaat agagggtata ttgaagttcc aattcacttc
1560ccatcgacat ctaccagata tcgagttcgt gtacggtatg cttctgtaac
cccgattcac 1620ctcaacgtta attggggtaa ttcatccatt ttttccaata
cagtaccagc tacagctacg 1680tcattagata atctacaatc aagtgatttt
ggttattttg aaagtgccaa tgcttttaca 1740tcttcattag gtaatatagt
aggtgttaga aattttagtg ggactgcagg agtgataata 1800gacagatttg
aatttattcc agttactgca acactcgagg ctgaatataa tctggaaaga
1860gcgcagaagg cggtgaatgc gctgtttacg tctacaaacc aactagggct
aaaaacaaat 1920gtaacggatt atcatattga tcaagtgtcc aatttagtta
cgtatttatc ggatgaattt 1980tgtctggatg aaaagcgaga attgtccgag
aaagtcaaac atgcgaagcg actcagtgat 2040gaacgcaatt tactccaaga
ttcaaatttc aaagacatta ataggcaacc agaacgtggg 2100tggggcggaa
gtacagggat taccatccaa ggaggggatg acgtatttaa agaaaattac
2160gtcacactat caggtacctt tgatgagtgc tatccaacat atttgtatca
aaaaatcgat 2220gaatcaaaat taaaagcctt tacccgttat caattaagag
ggtatatcga agatagtcaa 2280gacttagaaa tctatttaat tcgctacaat
gcaaaacatg aaacagtaaa tgtgccaggt 2340acgggttcct tatggccgct
ttcagcccaa agtccaatcg gaaagtgtgg agagccgaat 2400cgatgcgcgc
cacaccttga atggaatcct gacttagatt gttcgtgtag ggatggagaa
2460aagtgtgccc atcattcgca tcatttctcc ttagacattg atgtaggatg
tacagactta 2520aatgaggacc taggtgtatg ggtgatcttt aagattaaga
cgcaagatgg gcacgcaaga 2580ctagggaatc tagagtttct cgaagagaaa
ccattagtag gagaagcgct agctcgtgtg 2640aaaagagcgg agaaaaaatg
gagagacaaa cgtgaaaaat tggaatggga aacaaatatc 2700gtttataaag
aggcaaaaga atctgtagat gctttatttg taaactctca atatgatcaa
2760ttacaagcgg atacgaatat tgccatgatt catgcggcag ataaacgtgt
tcatagcatt 2820cgagaagctt atctgcctga gctgtctgtg attccgggtg
tcaatgcggc tatttttgaa 2880gaattagaag ggcgtatttt cactgcattc
tccctatatg atgcgagaaa tgtcattaaa 2940aatggtgatt ttaataatgg
cttatcctgc tggaacgtga aagggcatgt agatgtagaa 3000gaacaaaaca
accaacgttc ggtccttgtt gttccggaat gggaagcaga agtgtcacaa
3060gaagttcgtg tctgtccggg tcgtggctat atccttcgtg tcacagcgta
caaggaggga 3120tatggagaag gttgcgtaac cattcatgag atcgagaaca
atacagacga actgaagttt 3180agcaactgcg tagaagagga aatctatcca
aataacacgg taacgtgtaa tgattatact 3240gtaaatcaag aagaatacgg
aggtgcgtac acttctcgta atcgaggata taacgaagct 3300ccttccgtac
cagctgatta tgcgtcagtc tatgaagaaa aatcgtatac agatggacga
3360agagagaatc cttgtgaatt taacagaggg tatagggatt acacgccact
accagttggt 3420tatgtgacaa aagaattaga atacttccca gaaaccgata
aggtatggat tgagattgga 3480gaaacggaag gaacatttat cgtggacagc
gtggaattac tccttatgga ggaatag 3537801178PRTBacillus
thuringiensisMISC_FEATURE(1)..(1178)Cry1Ac protein 80Met Asp Asn
Asn Pro Asn Ile Asn Glu Cys Ile Pro Tyr Asn Cys Leu 1 5 10 15 Ser
Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg Ile Glu Thr Gly 20 25
30 Tyr Thr Pro Ile Asp Ile Ser Leu Ser Leu Thr Gln Phe Leu Leu Ser
35 40 45 Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp
Ile Ile 50 55 60 Trp Gly Ile Phe Gly Pro Ser Gln Trp Asp Ala Phe
Leu Val Gln Ile 65 70 75 80 Glu Gln Leu Ile Asn Gln Arg Ile Glu Glu
Phe Ala Arg Asn Gln Ala 85 90 95 Ile Ser Arg Leu Glu Gly Leu Ser
Asn Leu Tyr Gln Ile Tyr Ala Glu 100 105 110 Ser Phe Arg Glu Trp Glu
Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu 115 120 125 Glu Met Arg Ile
Gln Phe Asn Asp Met Asn Ser Ala Leu Thr Thr Ala 130 135 140 Ile Pro
Leu Phe Ala Val Gln Asn Tyr Gln Val Pro Leu Leu Ser Val 145 150 155
160 Tyr Val Gln Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser
165 170 175 Val Phe Gly Gln Arg Trp Gly Phe Asp Ala Ala Thr Ile Asn
Ser Arg 180 185 190 Tyr Asn Asp Leu Thr Arg Leu Ile Gly Asn Tyr Thr
Asp Tyr Ala Val 195 200 205 Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val
Trp Gly Pro Asp Ser Arg 210 215 220 Asp Trp Val Arg Tyr Asn Gln Phe
Arg Arg Glu Leu Thr Leu Thr Val 225 230 235 240 Leu Asp Ile Val Ala
Leu Phe Pro Asn Tyr Asp Ser Arg Arg Tyr Pro 245 250 255 Ile Arg Thr
Val Ser Gln Leu Thr Arg Glu Ile Tyr Thr Asn Pro Val 260 265 270 Leu
Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gln Gly Ile Glu 275 280
285 Arg Ser Ile Arg Ser Pro His Leu Met Asp Ile Leu Asn Ser Ile Thr
290 295 300 Ile Tyr Thr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly
His Gln 305 310 315 320 Ile Met Ala Ser Pro Val Gly Phe Ser Gly Pro
Glu Phe Thr Phe Pro 325 330 335 Leu Tyr Gly Thr Met Gly Asn Ala Ala
Pro Gln Gln Arg Ile Val Ala 340 345 350 Gln Leu Gly Gln Gly Val Tyr
Arg Thr Leu Ser Ser Thr Leu Tyr Arg 355 360 365 Arg Pro Phe Asn Ile
Gly Ile Asn Asn Gln Gln Leu Ser Val Leu Asp 370 375 380 Gly Thr Glu
Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val 385 390 395 400
Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu Ile Pro Pro Gln 405
410 415 Asn Asn Asn Val Pro Pro Arg Gln Gly Phe Ser His Arg Leu Ser
His 420 425 430 Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val
Ser Ile Ile 435 440 445 Arg Ala Pro Met Phe Ser Trp Ile His Arg Ser
Ala Glu Phe Asn Asn 450 455 460 Ile Ile Ala Ser Asp Ser Ile Thr Gln
Ile Pro Ala Val Lys Gly Asn 465 470 475 480 Phe Leu Phe Asn Gly Ser
Val Ile Ser Gly Pro Gly Phe Thr Gly Gly 485 490 495 Asp Leu Val Arg
Leu Asn Ser Ser Gly Asn Asn Ile Gln Asn Arg Gly 500 505 510 Tyr Ile
Glu Val Pro Ile His Phe Pro Ser Thr Ser Thr Arg Tyr Arg 515 520 525
Val Arg Val Arg Tyr Ala Ser Val Thr Pro Ile His Leu Asn Val Asn 530
535 540 Trp Gly Asn Ser Ser Ile Phe Ser Asn Thr Val Pro Ala Thr Ala
Thr 545 550 555 560 Ser Leu Asp Asn Leu Gln Ser Ser Asp Phe Gly Tyr
Phe Glu Ser Ala 565 570 575 Asn Ala Phe Thr Ser Ser Leu Gly Asn Ile
Val Gly Val Arg Asn Phe 580 585 590 Ser Gly Thr Ala Gly Val Ile Ile
Asp Arg Phe Glu Phe Ile Pro Val 595 600 605 Thr Ala Thr Leu Glu Ala
Glu Tyr Asn Leu Glu Arg Ala Gln Lys Ala 610 615 620 Val Asn Ala Leu
Phe Thr Ser Thr Asn Gln Leu Gly Leu Lys Thr Asn 625 630 635 640 Val
Thr Asp Tyr His Ile Asp Gln Val Ser Asn Leu Val Thr Tyr Leu 645 650
655 Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val
660 665 670 Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln
Asp Ser 675 680 685 Asn Phe Lys Asp Ile Asn Arg Gln Pro Glu Arg Gly
Trp Gly Gly Ser 690 695 700 Thr Gly Ile Thr Ile Gln Gly Gly Asp Asp
Val Phe Lys Glu Asn Tyr 705 710 715 720 Val Thr Leu Ser Gly Thr Phe
Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr 725 730 735 Gln Lys Ile Asp Glu
Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gln Leu 740 745 750 Arg Gly Tyr
Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg 755 760 765 Tyr
Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu 770 775
780 Trp Pro Leu Ser Ala Gln Ser Pro Ile Gly Lys Cys Gly Glu Pro Asn
785 790 795 800 Arg Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu Asp
Cys Ser Cys 805 810 815 Arg Asp Gly Glu Lys Cys Ala His His Ser His
His Phe Ser Leu Asp 820 825 830 Ile Asp Val Gly Cys Thr Asp Leu Asn
Glu Asp Leu Gly Val Trp Val 835 840 845 Ile Phe Lys Ile Lys Thr Gln
Asp Gly His Ala Arg Leu Gly Asn Leu 850 855 860 Glu Phe Leu Glu Glu
Lys Pro Leu Val Gly Glu Ala Leu Ala Arg Val 865 870 875 880 Lys Arg
Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu Trp 885 890 895
Glu Thr Asn Ile Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu 900
905 910 Phe Val Asn Ser Gln Tyr Asp Gln Leu Gln Ala Asp Thr Asn Ile
Ala 915 920 925 Met Ile His Ala Ala Asp Lys Arg Val His Ser Ile Arg
Glu Ala Tyr 930 935 940 Leu Pro Glu Leu Ser Val Ile Pro Gly Val Asn
Ala Ala Ile Phe Glu 945 950 955 960 Glu Leu Glu Gly Arg Ile Phe Thr
Ala Phe Ser Leu Tyr Asp Ala Arg 965 970 975 Asn Val Ile Lys Asn Gly
Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn 980 985 990 Val Lys Gly His
Val Asp Val Glu Glu Gln Asn Asn Gln Arg Ser Val 995 1000 1005 Leu
Val Val Pro Glu Trp Glu Ala Glu Val Ser Gln Glu Val Arg 1010 1015
1020 Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg Val Thr Ala Tyr Lys
1025 1030 1035 Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu Ile
Glu Asn 1040 1045 1050 Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val
Glu Glu Glu Ile 1055 1060 1065 Tyr Pro Asn Asn Thr Val Thr Cys Asn
Asp Tyr Thr Val Asn Gln 1070 1075 1080 Glu Glu Tyr Gly Gly Ala Tyr
Thr Ser Arg Asn Arg Gly Tyr Asn 1085 1090 1095 Glu Ala Pro Ser Val
Pro Ala Asp Tyr Ala Ser Val Tyr Glu Glu 1100 1105 1110 Lys Ser Tyr
Thr Asp Gly Arg Arg Glu Asn Pro Cys Glu Phe Asn 1115 1120 1125 Arg
Gly Tyr Arg Asp Tyr Thr Pro Leu Pro Val Gly Tyr Val Thr 1130 1135
1140 Lys Glu Leu Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp Ile Glu
1145 1150 1155 Ile Gly Glu Thr Glu Gly Thr Phe Ile Val Asp Ser Val
Glu Leu 1160
1165 1170 Leu Leu Met Glu Glu 1175 812160DNABacillus
thuringiensismisc_feature(1)..(2160)cry1Ia coding sequence
81atgaaactaa agaatcaaga taagcatcaa agtttttcta gcaatgcgaa agtagataaa
60atctctacgg attcactaaa aaatgaaaca gatatagaat tacaaaacat taatcatgaa
120gattgtttga aaatgtctga gtatgaaaat gtagagccgt ttgttagtgc
atcaacaatt 180caaacaggta ttggtattgc gggtaaaata cttggtaccc
taggcgttcc ttttgcagga 240caagtagcta gtctttatag ttttatctta
ggtgagctat ggcctaaggg gaaaaatcaa 300tgggaaatct ttatggaaca
tgtagaagag attattaatc aaaaaatatc aacttatgca 360agaaataaag
cacttacaga cttgaaagga ttaggagatg ccttagctgt ctaccatgat
420tcgcttgaaa gttgggttgg aaatcgtaat aacacaaggg ctaggagtgt
tgtcaagagc 480caatatatcg cattagaatt gatgttcgtt cagaaactac
cttcttttgc agtgtctgga 540gaggaggtac cattattacc gatatatgcc
caagctgcaa atttacattt gttgctatta 600agagatgcat ctatttttgg
aaaagagtgg ggattatcat cttcagaaat ttcaacattt 660tataaccgtc
aagtcgaacg agcaggagat tattcctacc attgtgtgaa atggtatagc
720acaggtctaa ataacttgag gggtacaaat gccgaaagtt gggtacgata
taatcaattc 780cgtagagaca tgactttaat ggtactagat ttagtggcac
tatttccaag ctatgataca 840caaatgtatc caattaaaac tacagcccaa
cttacaagag aagtatatac agacgcaatt 900gggacagtac atccgcatcc
aagttttaca agtacgactt ggtataataa taatgcacct 960tcgttctctg
ccatagaggc tgctgttgtt cgaaacccgc atctactcga ttttctagaa
1020caagttacaa tttacagctt attaagtcga tggagtaaca ctcagtatat
gaatatgtgg 1080ggaggacata aactagaatt ccgaacaata ggaggaacgt
taaatatctc aacacaagga 1140tctactaata cttctattaa tcctgtaaca
ttaccgttca cttctcgaga cgtctatagg 1200actgaatcat tggcagggct
gaatctattt ttaactcaac ctgttaatgg agtacctagg 1260gttgattttc
attggaaatt cgtcacacat ccgatcgcat ctgataattt ctattatcca
1320gggtatgctg gaattgggac gcaattacag gattcagaaa atgaattacc
acctgaagca 1380acaggacagc caaattatga atcttatagt catagattat
ctcatatagg actcatttca 1440gcatcacatg tgaaagcatt ggtatattct
tggacgcatc gtagtgcaga tcgtacaaat 1500acaattgagc caaatagcat
tacacaaata ccattagtaa aagctttcaa tctgtcttca 1560ggtgccgctg
tagtgagagg accaggattt acaggtgggg atatccttcg aagaacgaat
1620actggtacat ttggggatat acgagtaaat attaatccac catttgcaca
aagatatcgc 1680gtgaggattc gctatgcttc taccacagat ttacaattcc
atacgtcaat taacggtaaa 1740gctattaatc aaggtaattt ttcagcaact
atgaatagag gagaggactt agactataaa 1800acctttagaa ctgtaggctt
taccactcca tttagctttt tagatgtaca aagtacattc 1860acaataggtg
cttggaactt ctcttcaggt aacgaagttt atatagatag aattgaattt
1920gttccggtag aagtaacata tgaggcagaa tatgattttg aaaaagcgca
agagaaggtt 1980actgcactgt ttacatctac gaatccaaga ggattaaaaa
cagatgtaaa ggattatcat 2040attgaccagg tatcaaattt agtagagtct
ctatcagatg aattctatct tgatgaaaag 2100agagaattat tcgagatagt
taaatacgcg aagcaactcc atattgagcg taacatgtag 216082719PRTBacillus
thuringiensisMISC_FEATURE(1)..(719)Cry1Ia protein 82Met Lys Leu Lys
Asn Gln Asp Lys His Gln Ser Phe Ser Ser Asn Ala 1 5 10 15 Lys Val
Asp Lys Ile Ser Thr Asp Ser Leu Lys Asn Glu Thr Asp Ile 20 25 30
Glu Leu Gln Asn Ile Asn His Glu Asp Cys Leu Lys Met Ser Glu Tyr 35
40 45 Glu Asn Val Glu Pro Phe Val Ser Ala Ser Thr Ile Gln Thr Gly
Ile 50 55 60 Gly Ile Ala Gly Lys Ile Leu Gly Thr Leu Gly Val Pro
Phe Ala Gly 65 70 75 80 Gln Val Ala Ser Leu Tyr Ser Phe Ile Leu Gly
Glu Leu Trp Pro Lys 85 90 95 Gly Lys Asn Gln Trp Glu Ile Phe Met
Glu His Val Glu Glu Ile Ile 100 105 110 Asn Gln Lys Ile Ser Thr Tyr
Ala Arg Asn Lys Ala Leu Thr Asp Leu 115 120 125 Lys Gly Leu Gly Asp
Ala Leu Ala Val Tyr His Asp Ser Leu Glu Ser 130 135 140 Trp Val Gly
Asn Arg Asn Asn Thr Arg Ala Arg Ser Val Val Lys Ser 145 150 155 160
Gln Tyr Ile Ala Leu Glu Leu Met Phe Val Gln Lys Leu Pro Ser Phe 165
170 175 Ala Val Ser Gly Glu Glu Val Pro Leu Leu Pro Ile Tyr Ala Gln
Ala 180 185 190 Ala Asn Leu His Leu Leu Leu Leu Arg Asp Ala Ser Ile
Phe Gly Lys 195 200 205 Glu Trp Gly Leu Ser Ser Ser Glu Ile Ser Thr
Phe Tyr Asn Arg Gln 210 215 220 Val Glu Arg Ala Gly Asp Tyr Ser Tyr
His Cys Val Lys Trp Tyr Ser 225 230 235 240 Thr Gly Leu Asn Asn Leu
Arg Gly Thr Asn Ala Glu Ser Trp Val Arg 245 250 255 Tyr Asn Gln Phe
Arg Arg Asp Met Thr Leu Met Val Leu Asp Leu Val 260 265 270 Ala Leu
Phe Pro Ser Tyr Asp Thr Gln Met Tyr Pro Ile Lys Thr Thr 275 280 285
Ala Gln Leu Thr Arg Glu Val Tyr Thr Asp Ala Ile Gly Thr Val His 290
295 300 Pro His Pro Ser Phe Thr Ser Thr Thr Trp Tyr Asn Asn Asn Ala
Pro 305 310 315 320 Ser Phe Ser Ala Ile Glu Ala Ala Val Val Arg Asn
Pro His Leu Leu 325 330 335 Asp Phe Leu Glu Gln Val Thr Ile Tyr Ser
Leu Leu Ser Arg Trp Ser 340 345 350 Asn Thr Gln Tyr Met Asn Met Trp
Gly Gly His Lys Leu Glu Phe Arg 355 360 365 Thr Ile Gly Gly Thr Leu
Asn Ile Ser Thr Gln Gly Ser Thr Asn Thr 370 375 380 Ser Ile Asn Pro
Val Thr Leu Pro Phe Thr Ser Arg Asp Val Tyr Arg 385 390 395 400 Thr
Glu Ser Leu Ala Gly Leu Asn Leu Phe Leu Thr Gln Pro Val Asn 405 410
415 Gly Val Pro Arg Val Asp Phe His Trp Lys Phe Val Thr His Pro Ile
420 425 430 Ala Ser Asp Asn Phe Tyr Tyr Pro Gly Tyr Ala Gly Ile Gly
Thr Gln 435 440 445 Leu Gln Asp Ser Glu Asn Glu Leu Pro Pro Glu Ala
Thr Gly Gln Pro 450 455 460 Asn Tyr Glu Ser Tyr Ser His Arg Leu Ser
His Ile Gly Leu Ile Ser 465 470 475 480 Ala Ser His Val Lys Ala Leu
Val Tyr Ser Trp Thr His Arg Ser Ala 485 490 495 Asp Arg Thr Asn Thr
Ile Glu Pro Asn Ser Ile Thr Gln Ile Pro Leu 500 505 510 Val Lys Ala
Phe Asn Leu Ser Ser Gly Ala Ala Val Val Arg Gly Pro 515 520 525 Gly
Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Asn Thr Gly Thr Phe 530 535
540 Gly Asp Ile Arg Val Asn Ile Asn Pro Pro Phe Ala Gln Arg Tyr Arg
545 550 555 560 Val Arg Ile Arg Tyr Ala Ser Thr Thr Asp Leu Gln Phe
His Thr Ser 565 570 575 Ile Asn Gly Lys Ala Ile Asn Gln Gly Asn Phe
Ser Ala Thr Met Asn 580 585 590 Arg Gly Glu Asp Leu Asp Tyr Lys Thr
Phe Arg Thr Val Gly Phe Thr 595 600 605 Thr Pro Phe Ser Phe Leu Asp
Val Gln Ser Thr Phe Thr Ile Gly Ala 610 615 620 Trp Asn Phe Ser Ser
Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe 625 630 635 640 Val Pro
Val Glu Val Thr Tyr Glu Ala Glu Tyr Asp Phe Glu Lys Ala 645 650 655
Gln Glu Lys Val Thr Ala Leu Phe Thr Ser Thr Asn Pro Arg Gly Leu 660
665 670 Lys Thr Asp Val Lys Asp Tyr His Ile Asp Gln Val Ser Asn Leu
Val 675 680 685 Glu Ser Leu Ser Asp Glu Phe Tyr Leu Asp Glu Lys Arg
Glu Leu Phe 690 695 700 Glu Ile Val Lys Tyr Ala Lys Gln Leu His Ile
Glu Arg Asn Met 705 710 715 8334DNAArtificial Sequence53A-1-bam
83ccggatccat gacggccgac aacaacaccg aggc 348420DNAArtificial
SequenceC3-3a-6 primer 84caggggcagc tgggtgatct 208520DNAArtificial
SequenceC3-1Ab-3 primer 85agatcaccca gatccccctg 208639DNAArtificial
Sequence1Ab-6-sac primer 86ccgagctcag ctcctacacc tgatcgatgt
ggtagtcgg 398756DNAArtificial Sequence8a-atg-delri primer
87ccggatccac catgactagt aacggccgcc agtgtgctgg tattcgccct tatgac
568820DNAArtificial SequenceC2-3A-4 primer 88gtccagcacg gtcagggtca
208920DNAArtificial Sequencereverse primer 89gcgtgcagtc aagtcagatc
209038DNAArtificial SequenceFR8a-OL-1 primer 90ggtgttgttg
tcggccgtca tagggcgaat accagcac 389139DNAArtificial
SequenceFR8a-OL-2 primer 91gccgacaaca acaccgaggc cctggacagc
agcaccacc 399227DNAArtificial SequenceC1-3a-2 primer 92caggtgggtg
ttggcggcct gggcgta 279321DNAArtificial Sequence5'FR8a primer
93ggatccacca tgactagtaa c 219436DNAArtificial Sequence5'fr8a-12aa
primer 94ccggatccac catgtatgac ggccgacaac aacacc
369520DNAArtificial SequenceC2-3A-3 primer 95tgaccctgac cgtgctggac
209627DNAArtificial Sequence3'1Ab-dm3 primer 96gagctcctag
gtcacctcgg cgggcac 279732DNAArtificial Sequence5'FR-del6 primer
97ggatccacca tgtgtgctgg tattcgccct at 329832DNAArtificial
Sequence5'1Ab-bam primer 98ccggatccat ggacaacaac cccaacatca ac
329920DNAArtificial SequenceC3-3a-7 primer 99gcttcaccgg cggcgacatc
2010020DNAArtificial SequenceC3-3a-8 primer 100gatgtcgccg
ccggtgaagc 2010123DNAArtificial SequenceC4-3a-9 primer
101ccgcatccac tacgccagca cca 2310223DNAArtificial SequenceC4-3a-10
primer 102tggtgctggc gtagtggatg cgg 2310344DNAArtificial
Sequence3a-12-sac primer 103ccgagctcag ctcagatcta gttcacgggg
atgaactcga tctt 4410427DNAArtificial Sequence3a-22 primer
104ggccttcacc aggggcagct gggtgat 2710531DNAArtificial Sequence1B-5
primer 105ccgccgcgac ctgaccctgg gcgtgctgga c 3110627DNAArtificial
Sequence1B-7 primer 106atcacccaga tccccatggt gaaggcc
2710726DNAArtificial Sequence1B-10 primer 107ccgagctcct agaacagggc
gttcac 2610820DNAArtificial SequenceC3-1Ab-2 primer 108cagggggatc
tgggtgatct 2010920DNAArtificial SequenceC3-3A-5 primer
109agatcaccca gctgcccctg 2011027DNAArtificial SequenceC1-1Ab-1
primer 110tacgtgcagg ccgccaacct gcacctg 2711142DNAArtificial
Sequence5'8Aa-dm3 primer 111agatcaccca gctgcccctg gtaaagggag
acatgttata tc 4211230DNAArtificial Sequence3'8Aa-dm3 primer
112gagctcctat gtctcatcta ctgggatgaa 3011333DNAArtificial
SequenceTant-OL-1 primer 113acccagctgc ccctggtgaa ggcccacacc ctc
3311433DNAArtificial SequenceTant-OL-2 primer 114gagggtgtgg
gccttcacca ggggcagctg ggt 3311529DNAArtificial SequenceTant-3'sac
primer 115gagctctagc ttaagcagtc cacgaggtt 2911637DNAArtificial
Sequence1Ac-OL-1 primer 116acccagctgc ccctggtgaa gggaaacttt cttttta
3711737DNAArtificial Sequence1Ac-OL-2 primer 117taaaaagaaa
gtttcccttc accaggggca gctgggt 3711830DNAArtificial Sequnce1Ac-3'sac
primer 118gagctcctat gttgcagtaa ctggaataaa 3011938DNAArtificial
Sequence1Ia-OL-1 primer 119acccagctgc ccctgagtaa aagctttcaa
tctgtctt 3812038DNAArtificial Sequence1Ia-OL-2 primer 120aagacagatt
gaaagctttt actcaggggc agctgggt 3812131DNAArtificial
Sequence1Ia-3'sac primer 121gagctcctac atgttacgct caatatggag t
3112233DNAArtificial SequenceFR-1Ab-1 primer 122tggacccaca
agagcgccga gttcaacaac atc 3312333DNAArtificial SequenceFR-1Ab-2
primer 123gatgttgttg aactcggcgc tcttgtgggt cca 3312440DNAArtificial
SequenceFR-1Ab-3 primer 124ccacaagagc gtcgacttca acacatcatc
cccagcagcc 4012541DNAArtificial SequenceFR-1Ab-4 primer
125ggctcgtggg gatgatgttg ttgaagtcga cgctcttgtg g
4112635PRTArtificial SequencePeptidyl fragment 1 126Met Ala Ser Met
Thr Gly Gly Gln Gln Met Gly Arg Gly Ser Thr Ser 1 5 10 15 Asn Gly
Arg Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly Arg Gln Gln 20 25 30
His Arg Gly 35 12722PRTArtificial SequencePeptidyl Fragment 2
127Met Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly
1 5 10 15 Arg Gln Gln His Arg Gly 20 12810PRTArtificial
SequencePeptidyl Fragment 3 128Met Tyr Asp Gly Arg Gln Gln His Arg
Gly 1 5 10 12913PRTArtificial SequencePeptidyl Fragment 4 129Met
Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro 1 5 10
1307PRTArtificial SequencePeptidyl fragment 5 130Met Cys Ala Gly
Ile Arg Pro 1 5 13155PRTArtificial SequencePeptidyl fragment 6
131Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser
1 5 10 15 Pro Asp Leu Gly Thr Leu Val Pro Arg Gly Ser Met Ala Asp
Ile Gly 20 25 30 Ser Thr Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile
Arg Pro Tyr Asp 35 40 45 Gly Arg Gln Gln His Arg Gly 50 55
13214PRTArtificial SequenceChemically synthesized peptidyl fragment
7 132Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly Arg Gly Ser 1 5 10
1339PRTArtificial SequencePeptidyl fragment 8 133Tyr Asp Gly Arg
Gln Gln His Arg Gly 1 5 13412PRTArtificial SequencePeptidyl
fragment 9 134Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro 1 5
10 135644PRTBacillus thuringiensisFull-length Cry3A protein 135Met
Asn Pro Asn Asn Arg Ser Glu His Asp Thr Ile Lys Thr Thr Glu 1 5 10
15 Asn Asn Glu Val Pro Thr Asn His Val Gln Tyr Pro Leu Ala Glu Thr
20 25 30 Pro Asn Pro Thr Leu Glu Asp Leu Asn Tyr Lys Glu Phe Leu
Arg Met 35 40 45 Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser
Thr Thr Lys Asp 50 55 60 Val Ile Gln Lys Gly Ile Ser Val Val Gly
Asp Leu Leu Gly Val Val 65 70 75 80 Gly Phe Pro Phe Gly Gly Ala Leu
Val Ser Phe Tyr Thr Asn Phe Leu 85 90 95 Asn Thr Ile Trp Pro Ser
Glu Asp Pro Trp Lys Ala Phe Met Glu Gln 100 105 110 Val Glu Ala Leu
Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys 115 120 125 Ala Leu
Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val 130 135 140
Ser Ala Leu Ser Ser Trp Gln Lys Asn Pro Val Ser Ser Arg Asn Pro 145
150 155 160 His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu
Ser His 165 170 175 Phe Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly
Tyr Glu Val Leu 180 185 190 Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn
Thr His Leu Phe Leu Leu 195 200 205 Lys Asp Ala Gln Ile Tyr Gly Glu
Glu Trp Gly Tyr Glu Lys Glu Asp 210 215 220 Ile Ala Glu Phe Tyr Lys
Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr 225 230 235 240 Asp His Cys
Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly 245 250 255 Ser
Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu Met 260 265
270 Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp Val
275 280 285 Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp
Val Leu 290 295 300 Thr Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly
Tyr Gly Thr Thr 305 310
315 320 Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp
Tyr 325 330 335 Leu His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly
Tyr Tyr Gly 340 345 350 Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr
Val Ser Thr Arg Pro 355 360 365 Ser Ile Gly Ser Asn Asp Ile Ile Thr
Ser Pro Phe Tyr Gly Asn Lys 370 375 380 Ser Ser Glu Pro Val Gln Asn
Leu Glu Phe Asn Gly Glu Lys Val Tyr 385 390 395 400 Arg Ala Val Ala
Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val Tyr 405 410 415 Ser Gly
Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp 420 425 430
Glu Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val 435
440 445 Ser Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu
Pro 450 455 460 Leu Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met
Cys Phe Leu 465 470 475 480 Met Gln Gly Ser Arg Gly Thr Ile Pro Val
Leu Thr Trp Thr His Lys 485 490 495 Ser Val Asp Phe Phe Asn Met Ile
Asp Ser Lys Lys Ile Thr Gln Leu 500 505 510 Pro Leu Val Lys Ala Tyr
Lys Leu Gln Ser Gly Ala Ser Val Val Ala 515 520 525 Gly Pro Arg Phe
Thr Gly Gly Asp Ile Ile Gln Cys Thr Glu Asn Gly 530 535 540 Ser Ala
Ala Thr Ile Tyr Val Thr Pro Asp Val Ser Tyr Ser Gln Lys 545 550 555
560 Tyr Arg Ala Arg Ile His Tyr Ala Ser Thr Ser Gln Ile Thr Phe Thr
565 570 575 Leu Ser Leu Asp Gly Ala Pro Phe Asn Gln Tyr Tyr Phe Asp
Lys Thr 580 585 590 Ile Asn Lys Gly Asp Thr Leu Thr Tyr Asn Ser Phe
Asn Leu Ala Ser 595 600 605 Phe Ser Thr Pro Phe Glu Leu Ser Gly Asn
Asn Leu Gln Ile Gly Val 610 615 620 Thr Gly Leu Ser Ala Gly Asp Lys
Val Tyr Ile Asp Lys Ile Glu Phe 625 630 635 640 Ile Pro Val Asn
13631DNAArtificial SequenceChemically synthesized CMS94 primer
136ggcgcgccac catggctagc atgactggtg g 3113720DNAArtificial
SequenceChemically synthesized CMS95 primer 137gcaggaacag
gtgggtgttg 2013820DNAArtificial sequenceChemically synthesized
CMS96 primer 138cctgaacacc atctggccca 2013939DNAArtificial
SequenceChemically synthesized CMS97 primer 139ctggctgctg
gggatgatgt tgttgaagtc gacgctctt 3914021DNAArtificial
SequenceChemically synthesized CMS98 primer 140gagctcttag
gtcacctcgg c 2114139DNAArtificial SequenceChemically synthesized
CMS99 primer 141aagagcgtcg acttcaacaa catcatcccc agcagccag
3914240DNAArtificial SequenceChemically synthesized CMS100 primer
142gaagtaccgc gcccgcatcc gctacgccag caccaccaac 4014340DNAArtificial
SequenceChemically synthesized CMS101 primer 143gttggtggtg
ctggcgtagc ggatgcgggc gcggtacttc 401441966DNAArtificial
SequenceT7-8AF coding sequence 144atggctagca tgactggtgg acagcaaatg
ggtcgcggat ccatgacggc cgacaacaac 60accgaggccc tggacagcag caccaccaag
gacgtgatcc agaagggcat cagcgtggtg 120ggcgacctgc tgggcgtggt
gggcttcccc ttcggcggcg ccctggtgag cttctacacc 180aacttcctga
acaccatctg gcccagcgag gacccctgga aggccttcat ggagcaggtg
240gaggccctga tggaccagaa gatcgccgac tacgccaaga acaaggcact
ggccgagcta 300cagggcctcc agaacaacgt ggaggactat gtgagcgccc
tgagcagctg gcagaagaac 360cccgctgcac cgttccgcaa cccccacagc
cagggccgca tccgcgagct gttcagccag 420gccgagagcc acttccgcaa
cagcatgccc agcttcgcca tcagcggcta cgaggtgctg 480ttcctgacca
cctacgccca ggccgccaac acccacctgt tcctgctgaa ggacgcccaa
540atctacggag aggagtgggg ctacgagaag gaggacatcg ccgagttcta
caagcgccag 600ctgaagctga cccaggagta caccgaccac tgcgtgaagt
ggtacaacgt gggtctagac 660aagctccgcg gcagcagcta cgagagctgg
gtgaacttca accgctaccg ccgcgagatg 720accctgaccg tgctggacct
gatcgccctg ttccccctgt acgacgtgcg cctgtacccc 780aaggaggtga
agaccgagct gacccgcgac gtgctgaccg accccatcgt gggcgtgaac
840aacctgcgcg gctacggcac caccttcagc aacatcgaga actacatccg
caagccccac 900ctgttcgact acctgcaccg catccagttc cacacgcgtt
tccagcccgg ctactacggc 960aacgacagct tcaactactg gagcggcaac
tacgtgagca cccgccccag catcggcagc 1020aacgacatca tcaccagccc
cttctacggc aacaagagca gcgagcccgt gcagaacctt 1080gagttcaacg
gcgagaaggt gtaccgcgcc gtggctaaca ccaacctggc cgtgtggccc
1140tctgcagtgt acagcggcgt gaccaaggtg gagttcagcc agtacaacga
ccagaccgac 1200gaggccagca cccagaccta cgacagcaag cgcaacgtgg
gcgccgtgag ctgggacagc 1260atcgaccagc tgccccccga gaccaccgac
gagcccctgg agaagggcta cagccaccag 1320ctgaactacg tgatgtgctt
cctgatgcag ggcagccgcg gcaccatccc cgtgctgacc 1380tggacccaca
agagcgtcga cttcttcaac atgatcgaca gcaagaagat cacccagctg
1440cccctgacca agagcaccaa cctgggcagc ggcaccagcg tggtgaaggg
ccccggcttc 1500accggcggcg acatcctgcg ccgcaccagc cccggccaga
tcagcaccct gcgcgtgaac 1560atcaccgccc ccctgagcca gcgctaccgc
gtccgcatcc gctacgccag caccaccaac 1620ctgcagttcc acaccagcat
cgacggccgc cccatcaacc agggcaactt cagcgccacc 1680atgagcagcg
gcagcaacct gcagagcggc agcttccgca ccgtgggctt caccaccccc
1740ttcaacttca gcaacggcag cagcgtgttc accctgagcg cccacgtgtt
caacagcggc 1800aacgaggtgt acatcgaccg catcgagttc gtgcccgccg
aggtgacctt cgaggccgag 1860tacgacctgg agagggctca gaaggccgtg
aacgagctgt tcaccagcag caaccagatc 1920ggcctgaaga ccgacgtgac
cgactaccac atcgatcagg tgtagg 1966145654PRTArtificial SequenceT7-8AF
protein 145Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly Arg Gly Ser
Met Thr 1 5 10 15 Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr
Thr Lys Asp Val 20 25 30 Ile Gln Lys Gly Ile Ser Val Val Gly Asp
Leu Leu Gly Val Val Gly 35 40 45 Phe Pro Phe Gly Gly Ala Leu Val
Ser Phe Tyr Thr Asn Phe Leu Asn 50 55 60 Thr Ile Trp Pro Ser Glu
Asp Pro Trp Lys Ala Phe Met Glu Gln Val 65 70 75 80 Glu Ala Leu Met
Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala 85 90 95 Leu Ala
Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr Val Ser 100 105 110
Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg Asn Pro 115
120 125 His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser
His 130 135 140 Phe Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr
Glu Val Leu 145 150 155 160 Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn
Thr His Leu Phe Leu Leu 165 170 175 Lys Asp Ala Gln Ile Tyr Gly Glu
Glu Trp Gly Tyr Glu Lys Glu Asp 180 185 190 Ile Ala Glu Phe Tyr Lys
Arg Gln Leu Lys Leu Thr Gln Glu Tyr Thr 195 200 205 Asp His Cys Val
Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly 210 215 220 Ser Ser
Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu Met 225 230 235
240 Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp Val
245 250 255 Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp
Val Leu 260 265 270 Thr Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly
Tyr Gly Thr Thr 275 280 285 Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys
Pro His Leu Phe Asp Tyr 290 295 300 Leu His Arg Ile Gln Phe His Thr
Arg Phe Gln Pro Gly Tyr Tyr Gly 305 310 315 320 Asn Asp Ser Phe Asn
Tyr Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro 325 330 335 Ser Ile Gly
Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn Lys 340 345 350 Ser
Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys Val Tyr 355 360
365 Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val Tyr
370 375 380 Ser Gly Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln
Thr Asp 385 390 395 400 Glu Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg
Asn Val Gly Ala Val 405 410 415 Ser Trp Asp Ser Ile Asp Gln Leu Pro
Pro Glu Thr Thr Asp Glu Pro 420 425 430 Leu Glu Lys Gly Tyr Ser His
Gln Leu Asn Tyr Val Met Cys Phe Leu 435 440 445 Met Gln Gly Ser Arg
Gly Thr Ile Pro Val Leu Thr Trp Thr His Lys 450 455 460 Ser Val Asp
Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr Gln Leu 465 470 475 480
Pro Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val Lys 485
490 495 Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro
Gly 500 505 510 Gln Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu
Ser Gln Arg 515 520 525 Tyr Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr
Asn Leu Gln Phe His 530 535 540 Thr Ser Ile Asp Gly Arg Pro Ile Asn
Gln Gly Asn Phe Ser Ala Thr 545 550 555 560 Met Ser Ser Gly Ser Asn
Leu Gln Ser Gly Ser Phe Arg Thr Val Gly 565 570 575 Phe Thr Thr Pro
Phe Asn Phe Ser Asn Gly Ser Ser Val Phe Thr Leu 580 585 590 Ser Ala
His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile 595 600 605
Glu Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp Leu Glu 610
615 620 Arg Ala Gln Lys Ala Val Asn Glu Leu Phe Thr Ser Ser Asn Gln
Ile 625 630 635 640 Gly Leu Lys Thr Asp Val Thr Asp Tyr His Ile Asp
Gln Val 645 650 1461920DNAArtificial Sequence-catG8AF coding
sequence 146atgacggccg acaacaacac cgaggccctg gacagcagca ccaccaagga
cgtgatccag 60aagggcatca gcgtggtggg cgacctgctg ggcgtggtgg gcttcccctt
cggcggcgcc 120ctggtgagct tctacaccaa cttcctgaac accatctggc
ccagcgagga cccctggaag 180gccttcatgg agcaggtgga ggccctgatg
gaccagaaga tcgccgacta cgccaagaac 240aaggcactgg ccgagctaca
gggcctccag aacaacgtgg aggactatgt gagcgccctg 300agcagctggc
agaagaaccc cgtctcgagc cgcaaccccc acagccaggg ccgcatccgc
360gagctgttca gccaggccga gagccacttc cgcaacagca tgcccagctt
cgccatcagc 420ggctacgagg tgctgttcct gaccacctac gcccaggccg
ccaacaccca cctgttcctg 480ctgaaggacg cccaaatcta cggagaggag
tggggctacg agaaggagga catcgccgag 540ttctacaagc gccagctgaa
gctgacccag gagtacaccg accactgcgt gaagtggtac 600aacgtgggtc
tagacaagct ccgcggcagc agctacgaga gctgggtgaa cttcaaccgc
660taccgccgcg agatgaccct gaccgtgctg gacctgatcg ccctgttccc
cctgtacgac 720gtgcgcctgt accccaagga ggtgaagacc gagctgaccc
gcgacgtgct gaccgacccc 780atcgtgggcg tgaacaacct gcgcggctac
ggcaccacct tcagcaacat cgagaactac 840atccgcaagc cccacctgtt
cgactacctg caccgcatcc agttccacac gcgtttccag 900cccggctact
acggcaacga cagcttcaac tactggagcg gcaactacgt gagcacccgc
960cccagcatcg gcagcaacga catcatcacc agccccttct acggcaacaa
gagcagcgag 1020cccgtgcaga accttgagtt caacggcgag aaggtgtacc
gcgccgtggc taacaccaac 1080ctggccgtgt ggccctctgc agtgtacagc
ggcgtgacca aggtggagtt cagccagtac 1140aacgaccaga ccgacgaggc
cagcacccag acctacgaca gcaagcgcaa cgtgggcgcc 1200gtgagctggg
acagcatcga ccagctgccc cccgagacca ccgacgagcc cctggagaag
1260ggctacagcc accagctgaa ctacgtgatg tgcttcctga tgcagggcag
ccgcggcacc 1320atccccgtgc tgacctggac ccacaagagc gtcgacttct
tcaacatgat cgacagcaag 1380aagatcaccc agctgcccct gaccaagagc
accaacctgg gcagcggcac cagcgtggtg 1440aagggccccg gcttcaccgg
cggcgacatc ctgcgccgca ccagccccgg ccagatcagc 1500accctgcgcg
tgaacatcac cgcccccctg agccagcgct accgcgtccg catccgctac
1560gccagcacca ccaacctgca gttccacacc agcatcgacg gccgccccat
caaccagggc 1620aacttcagcg ccaccatgag cagcggcagc aacctgcaga
gcggcagctt ccgcaccgtg 1680ggcttcacca cccccttcaa cttcagcaac
ggcagcagcg tgttcaccct gagcgcccac 1740gtgttcaaca gcggcaacga
ggtgtacatc gaccgcatcg agttcgtgcc cgccgaggtg 1800accttcgagg
ccgagtacga cctggagagg gctcagaagg ccgtgaacga gctgttcacc
1860agcagcaacc agatcggcct gaagaccgac gtgaccgact accacatcga
tcaggtgtag 1920147639PRTArtificial Sequence-catG8AF protein 147Met
Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 1 5 10
15 Asp Val Ile Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val
20 25 30 Val Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr
Asn Phe 35 40 45 Leu Asn Thr Ile Trp Pro Ser Glu Asp Pro Trp Lys
Ala Phe Met Glu 50 55 60 Gln Val Glu Ala Leu Met Asp Gln Lys Ile
Ala Asp Tyr Ala Lys Asn 65 70 75 80 Lys Ala Leu Ala Glu Leu Gln Gly
Leu Gln Asn Asn Val Glu Asp Tyr 85 90 95 Val Ser Ala Leu Ser Ser
Trp Gln Lys Asn Pro Val Ser Ser Arg Asn 100 105 110 Pro His Ser Gln
Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser 115 120 125 His Phe
Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val 130 135 140
Leu Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu 145
150 155 160 Leu Lys Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu
Lys Glu 165 170 175 Asp Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu
Thr Gln Glu Tyr 180 185 190 Thr Asp His Cys Val Lys Trp Tyr Asn Val
Gly Leu Asp Lys Leu Arg 195 200 205 Gly Ser Ser Tyr Glu Ser Trp Val
Asn Phe Asn Arg Tyr Arg Arg Glu 210 215 220 Met Thr Leu Thr Val Leu
Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp 225 230 235 240 Val Arg Leu
Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val 245 250 255 Leu
Thr Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr 260 265
270 Thr Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp
275 280 285 Tyr Leu His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly
Tyr Tyr 290 295 300 Gly Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr
Val Ser Thr Arg 305 310 315 320 Pro Ser Ile Gly Ser Asn Asp Ile Ile
Thr Ser Pro Phe Tyr Gly Asn 325 330 335 Lys Ser Ser Glu Pro Val Gln
Asn Leu Glu Phe Asn Gly Glu Lys Val 340 345 350 Tyr Arg Ala Val Ala
Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val 355 360 365 Tyr Ser Gly
Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr 370 375 380 Asp
Glu Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala 385 390
395 400 Val Ser Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp
Glu 405 410 415 Pro Leu Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val
Met Cys Phe 420 425 430 Leu Met Gln Gly Ser Arg Gly Thr Ile Pro Val
Leu Thr Trp Thr His 435 440 445 Lys Ser Val Asp Phe Phe Asn Met Ile
Asp Ser Lys Lys Ile Thr Gln 450 455 460 Leu Pro Leu Thr Lys Ser Thr
Asn Leu Gly Ser Gly Thr Ser Val Val 465 470 475 480 Lys Gly Pro Gly
Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro 485 490 495 Gly Gln
Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser Gln 500 505 510
Arg Tyr Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln Phe 515
520 525 His Thr Ser Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser
Ala 530 535 540 Thr Met Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe
Arg Thr Val 545 550
555 560 Gly Phe Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe
Thr 565 570 575 Leu Ser Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr
Ile Asp Arg 580 585 590 Ile Glu Phe Val Pro Ala Glu Val Thr Phe Glu
Ala Glu Tyr Asp Leu 595 600 605 Glu Arg Ala Gln Lys Ala Val Asn Glu
Leu Phe Thr Ser Ser Asn Gln 610 615 620 Ile Gly Leu Lys Thr Asp Val
Thr Asp Tyr His Ile Asp Gln Val 625 630 635 1481809DNAArtificial
Sequence8AFdm3 coding sequence 148atgacggccg acaacaacac cgaggccctg
gacagcagca ccaccaagga cgtgatccag 60aagggcatca gcgtggtggg cgacctgctg
ggcgtggtgg gcttcccctt cggcggcgcc 120ctggtgagct tctacaccaa
cttcctgaac accatctggc ccagcgagga cccctggaag 180gccttcatgg
agcaggtgga ggccctgatg gaccagaaga tcgccgacta cgccaagaac
240aaggcactgg ccgagctaca gggcctccag aacaacgtgg aggactatgt
gagcgccctg 300agcagctggc agaagaaccc cgctgcaccg ttccgcaacc
cccacagcca gggccgcatc 360cgcgagctgt tcagccaggc cgagagccac
ttccgcaaca gcatgcccag cttcgccatc 420agcggctacg aggtgctgtt
cctgaccacc tacgcccagg ccgccaacac ccacctgttc 480ctgctgaagg
acgcccaaat ctacggagag gagtggggct acgagaagga ggacatcgcc
540gagttctaca agcgccagct gaagctgacc caggagtaca ccgaccactg
cgtgaagtgg 600tacaacgtgg gtctagacaa gctccgcggc agcagctacg
agagctgggt gaacttcaac 660cgctaccgcc gcgagatgac cctgaccgtg
ctggacctga tcgccctgtt ccccctgtac 720gacgtgcgcc tgtaccccaa
ggaggtgaag accgagctga cccgcgacgt gctgaccgac 780cccatcgtgg
gcgtgaacaa cctgcgcggc tacggcacca ccttcagcaa catcgagaac
840tacatccgca agccccacct gttcgactac ctgcaccgca tccagttcca
cacgcgtttc 900cagcccggct actacggcaa cgacagcttc aactactgga
gcggcaacta cgtgagcacc 960cgccccagca tcggcagcaa cgacatcatc
accagcccct tctacggcaa caagagcagc 1020gagcccgtgc agaaccttga
gttcaacggc gagaaggtgt accgcgccgt ggctaacacc 1080aacctggccg
tgtggccctc tgcagtgtac agcggcgtga ccaaggtgga gttcagccag
1140tacaacgacc agaccgacga ggccagcacc cagacctacg acagcaagcg
caacgtgggc 1200gccgtgagct gggacagcat cgaccagctg ccccccgaga
ccaccgacga gcccctggag 1260aagggctaca gccaccagct gaactacgtg
atgtgcttcc tgatgcaggg cagccgcggc 1320accatccccg tgctgacctg
gacccacaag agcgtcgact tcaacaacat catccccagc 1380agccagatca
cccagatccc cctgaccaag agcaccaacc tgggcagcgg caccagcgtg
1440gtgaagggcc ccggcttcac cggcggcgac atcctgcgcc gcaccagccc
cggccagatc 1500agcaccctgc gcgtgaacat caccgccccc ctgagccagc
gctaccgcgt ccgcatccgc 1560tacgccagca ccaccaacct gcagttccac
accagcatcg acggccgccc catcaaccag 1620ggcaacttca gcgccaccat
gagcagcggc agcaacctgc agagcggcag cttccgcacc 1680gtgggcttca
ccaccccctt caacttcagc aacggcagca gcgtgttcac cctgagcgcc
1740cacgtgttca acagcggcaa cgaggtgtac atcgaccgca tcgagttcgt
gcccgccgag 1800gtgacctaa 1809149602PRTArtificial Sequence8AFdm3
protein 149Met Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr
Thr Lys 1 5 10 15 Asp Val Ile Gln Lys Gly Ile Ser Val Val Gly Asp
Leu Leu Gly Val 20 25 30 Val Gly Phe Pro Phe Gly Gly Ala Leu Val
Ser Phe Tyr Thr Asn Phe 35 40 45 Leu Asn Thr Ile Trp Pro Ser Glu
Asp Pro Trp Lys Ala Phe Met Glu 50 55 60 Gln Val Glu Ala Leu Met
Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn 65 70 75 80 Lys Ala Leu Ala
Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr 85 90 95 Val Ser
Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg 100 105 110
Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu 115
120 125 Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr
Glu 130 135 140 Val Leu Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr
His Leu Phe 145 150 155 160 Leu Leu Lys Asp Ala Gln Ile Tyr Gly Glu
Glu Trp Gly Tyr Glu Lys 165 170 175 Glu Asp Ile Ala Glu Phe Tyr Lys
Arg Gln Leu Lys Leu Thr Gln Glu 180 185 190 Tyr Thr Asp His Cys Val
Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu 195 200 205 Arg Gly Ser Ser
Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg 210 215 220 Glu Met
Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr 225 230 235
240 Asp Val Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp
245 250 255 Val Leu Thr Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly
Tyr Gly 260 265 270 Thr Thr Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys
Pro His Leu Phe 275 280 285 Asp Tyr Leu His Arg Ile Gln Phe His Thr
Arg Phe Gln Pro Gly Tyr 290 295 300 Tyr Gly Asn Asp Ser Phe Asn Tyr
Trp Ser Gly Asn Tyr Val Ser Thr 305 310 315 320 Arg Pro Ser Ile Gly
Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly 325 330 335 Asn Lys Ser
Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys 340 345 350 Val
Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala 355 360
365 Val Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln
370 375 380 Thr Asp Glu Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn
Val Gly 385 390 395 400 Ala Val Ser Trp Asp Ser Ile Asp Gln Leu Pro
Pro Glu Thr Thr Asp 405 410 415 Glu Pro Leu Glu Lys Gly Tyr Ser His
Gln Leu Asn Tyr Val Met Cys 420 425 430 Phe Leu Met Gln Gly Ser Arg
Gly Thr Ile Pro Val Leu Thr Trp Thr 435 440 445 His Lys Ser Val Asp
Phe Asn Asn Ile Ile Pro Ser Ser Gln Ile Thr 450 455 460 Gln Ile Pro
Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val 465 470 475 480
Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser 485
490 495 Pro Gly Gln Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu
Ser 500 505 510 Gln Arg Tyr Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr
Asn Leu Gln 515 520 525 Phe His Thr Ser Ile Asp Gly Arg Pro Ile Asn
Gln Gly Asn Phe Ser 530 535 540 Ala Thr Met Ser Ser Gly Ser Asn Leu
Gln Ser Gly Ser Phe Arg Thr 545 550 555 560 Val Gly Phe Thr Thr Pro
Phe Asn Phe Ser Asn Gly Ser Ser Val Phe 565 570 575 Thr Leu Ser Ala
His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp 580 585 590 Arg Ile
Glu Phe Val Pro Ala Glu Val Thr 595 600 1501809DNAArtificial
Sequence8AFlomgdm3 coding sequence 150atgacggccg acaacaacac
cgaggccctg gacagcagca ccaccaagga cgtgatccag 60aagggcatca gcgtggtggg
cgacctgctg ggcgtggtgg gcttcccctt cggcggcgcc 120ctggtgagct
tctacaccaa cttcctgaac accatctggc ccagcgagga cccctggaag
180gccttcatgg agcaggtgga ggccctgatg gaccagaaga tcgccgacta
cgccaagaac 240aaggcactgg ccgagctaca gggcctccag aacaacgtgg
aggactatgt gagcgccctg 300agcagctggc agaagaaccc cgctgcaccg
ttccgcaacc cccacagcca gggccgcatc 360cgcgagctgt tcagccaggc
cgagagccac ttccgcaaca gcatgcccag cttcgccatc 420agcggctacg
aggtgctgtt cctgaccacc tacgcccagg ccgccaacac ccacctgttc
480ctgctgaagg acgcccaaat ctacggagag gagtggggct acgagaagga
ggacatcgcc 540gagttctaca agcgccagct gaagctgacc caggagtaca
ccgaccactg cgtgaagtgg 600tacaacgtgg gtctagacaa gctccgcggc
agcagctacg agagctgggt gaacttcaac 660cgctaccgcc gcgagatgac
cctgaccgtg ctggacctga tcgccctgtt ccccctgtac 720gacgtgcgcc
tgtaccccaa ggaggtgaag accgagctga cccgcgacgt gctgaccgac
780cccatcgtgg gcgtgaacaa cctgcgcggc tacggcacca ccttcagcaa
catcgagaac 840tacatccgca agccccacct gttcgactac ctgcaccgca
tccagttcca cacgcgtttc 900cagcccggct actacggcaa cgacagcttc
aactactgga gcggcaacta cgtgagcacc 960cgccccagca tcggcagcaa
cgacatcatc accagcccct tctacggcaa caagagcagc 1020gagcccgtgc
agaaccttga gttcaacggc gagaaggtgt accgcgccgt ggctaacacc
1080aacctggccg tgtggccctc tgcagtgtac agcggcgtga ccaaggtgga
gttcagccag 1140tacaacgacc agaccgacga ggccagcacc cagacctacg
acagcaagcg caacgtgggc 1200gccgtgagct gggacagcat cgaccagctg
ccccccgaga ccaccgacga gcccctggag 1260aagggctaca gccaccagct
gaactacgtg atgtgcttcc tgatgcaggg cagccgcggc 1320accatccccg
tgctgacctg gacccacaag agcgtcgact tcttcaacat gatcgacagc
1380aagaagatca cccagctgcc cctggtgaag gcctacaagc tccagagcgg
cgccagcgtg 1440gtggcaggcc cccgcttcac cggcggcgac atcatccagt
gcaccgagaa cggcagcgcc 1500gccaccatct acgtgacccc cgacgtgagc
tacagccaga agtaccgcgc ccgcatccgc 1560tacgccagca ccaccaacct
gcagttccac accagcatcg acggccgccc catcaaccag 1620ggcaacttca
gcgccaccat gagcagcggc agcaacctgc agagcggcag cttccgcacc
1680gtgggcttca ccaccccctt caacttcagc aacggcagca gcgtgttcac
cctgagcgcc 1740cacgtgttca acagcggcaa cgaggtgtac atcgaccgca
tcgagttcgt gcccgccgag 1800gtgacctaa 1809151602PRTArtificial
Sequence8AFlongdm3 protein 151Met Thr Ala Asp Asn Asn Thr Glu Ala
Leu Asp Ser Ser Thr Thr Lys 1 5 10 15 Asp Val Ile Gln Lys Gly Ile
Ser Val Val Gly Asp Leu Leu Gly Val 20 25 30 Val Gly Phe Pro Phe
Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 35 40 45 Leu Asn Thr
Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu 50 55 60 Gln
Val Glu Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn 65 70
75 80 Lys Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp
Tyr 85 90 95 Val Ser Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala
Pro Phe Arg 100 105 110 Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu
Phe Ser Gln Ala Glu 115 120 125 Ser His Phe Arg Asn Ser Met Pro Ser
Phe Ala Ile Ser Gly Tyr Glu 130 135 140 Val Leu Phe Leu Thr Thr Tyr
Ala Gln Ala Ala Asn Thr His Leu Phe 145 150 155 160 Leu Leu Lys Asp
Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys 165 170 175 Glu Asp
Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln Glu 180 185 190
Tyr Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu 195
200 205 Arg Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg
Arg 210 215 220 Glu Met Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe
Pro Leu Tyr 225 230 235 240 Asp Val Arg Leu Tyr Pro Lys Glu Val Lys
Thr Glu Leu Thr Arg Asp 245 250 255 Val Leu Thr Asp Pro Ile Val Gly
Val Asn Asn Leu Arg Gly Tyr Gly 260 265 270 Thr Thr Phe Ser Asn Ile
Glu Asn Tyr Ile Arg Lys Pro His Leu Phe 275 280 285 Asp Tyr Leu His
Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr 290 295 300 Tyr Gly
Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser Thr 305 310 315
320 Arg Pro Ser Ile Gly Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly
325 330 335 Asn Lys Ser Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly
Glu Lys 340 345 350 Val Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val
Trp Pro Ser Ala 355 360 365 Val Tyr Ser Gly Val Thr Lys Val Glu Phe
Ser Gln Tyr Asn Asp Gln 370 375 380 Thr Asp Glu Ala Ser Thr Gln Thr
Tyr Asp Ser Lys Arg Asn Val Gly 385 390 395 400 Ala Val Ser Trp Asp
Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp 405 410 415 Glu Pro Leu
Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys 420 425 430 Phe
Leu Met Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr Trp Thr 435 440
445 His Lys Ser Val Asp Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr
450 455 460 Gln Leu Pro Leu Val Lys Ala Tyr Lys Leu Gln Ser Gly Ala
Ser Val 465 470 475 480 Val Ala Gly Pro Arg Phe Thr Gly Gly Asp Ile
Ile Gln Cys Thr Glu 485 490 495 Asn Gly Ser Ala Ala Thr Ile Tyr Val
Thr Pro Asp Val Ser Tyr Ser 500 505 510 Gln Lys Tyr Arg Ala Arg Ile
Arg Tyr Ala Ser Thr Thr Asn Leu Gln 515 520 525 Phe His Thr Ser Ile
Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser 530 535 540 Ala Thr Met
Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr 545 550 555 560
Val Gly Phe Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val Phe 565
570 575 Thr Leu Ser Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile
Asp 580 585 590 Arg Ile Glu Phe Val Pro Ala Glu Val Thr 595 600
1521848DNAArtificial Sequencecap8AFdm3 coding sequence
152atgactagta acggccgcca gtgtgctggt attcgccctt atgacggccg
acaacaacac 60cgaggcctgg acagcagcac caccaaggac gtgatccaga agggcatcag
cgtggtgggc 120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc
tggtgagctt ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac
ccctggaagg ccttcatgga gcaggtggag 240gccctgatgg accagaagat
cgccgactac gccaagaaca aggcactggc cgagctacag 300ggcctccaga
acaacgtgga ggactatgtg agcgccctga gcagctggca gaagaacccc
360gctgcaccgt tccgcaaccc ccacagccag ggccgcatcc gcgagctgtt
cagccaggcc 420gagagccact tccgcaacag catgcccagc ttcgccatca
gcggctacga ggtgctgttc 480ctgaccacct acgcccaggc cgccaacacc
cacctgttcc tgctgaagga cgcccaaatc 540tacggagagg agtggggcta
cgagaaggag gacatcgccg agttctacaa gcgccagctg 600aagctgaccc
aggagtacac cgaccactgc gtgaagtggt acaacgtggg tctagacaag
660ctccgcggca gcagctacga gagctgggtg aacttcaacc gctaccgccg
cgagatgacc 720ctgaccgtgc tggacctgat cgccctgttc cccctgtacg
acgtgcgcct gtaccccaag 780gaggtgaaga ccgagctgac ccgcgacgtg
ctgaccgacc ccatcgtggg cgtgaacaac 840ctgcgcggct acggcaccac
cttcagcaac atcgagaact acatccgcaa gccccacctg 900ttcgactacc
tgcaccgcat ccagttccac acgcgtttcc agcccggcta ctacggcaac
960gacagcttca actactggag cggcaactac gtgagcaccc gccccagcat
cggcagcaac 1020gacatcatca ccagcccctt ctacggcaac aagagcagcg
agcccgtgca gaaccttgag 1080ttcaacggcg agaaggtgta ccgcgccgtg
gctaacacca acctggccgt gtggccctct 1140gcagtgtaca gcggcgtgac
caaggtggag ttcagccagt acaacgacca gaccgacgag 1200gccagcaccc
agacctacga cagcaagcgc aacgtgggcg ccgtgagctg ggacagcatc
1260gaccagctgc cccccgagac caccgacgag cccctggaga agggctacag
ccaccagctg 1320aactacgtga tgtgcttcct gatgcagggc agccgcggca
ccatccccgt gctgacctgg 1380acccacaaga gcgtcgactt caacaacatc
atccccagca gccagatcac ccagatcccc 1440ctgaccaaga gcaccaacct
gggcagcggc accagcgtgg tgaagggccc cggcttcacc 1500ggcggcgaca
tcctgcgccg caccagcccc ggccagatca gcaccctgcg cgtgaacatc
1560accgcccccc tgagccagcg ctaccgcgtc cgcatccgct acgccagcac
caccaacctg 1620cagttccaca ccagcatcga cggccgcccc atcaaccagg
gcaacttcag cgccaccatg 1680agcagcggca gcaacctgca gagcggcagc
ttccgcaccg tgggcttcac cacccccttc 1740aacttcagca acggcagcag
cgtgttcacc ctgagcgccc acgtgttcaa cagcggcaac 1800gaggtgtaca
tcgaccgcat cgagttcgtg cccgccgagg tgacctag 1848153615PRTArtificial
Sequencecap8AFdm3 protein 153Met Thr Ser Asn Gly Arg Gln Cys Ala
Gly Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg Gln Gln His Arg Gly Leu
Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25 30 Gln Lys Gly Ile Ser
Val Val Gly Asp Leu Leu Gly Val Val Gly Phe 35 40 45 Pro Phe Gly
Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu Asn Thr 50 55 60 Ile
Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met Glu Gln Val Glu 65 70
75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn Lys Ala
Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr
Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro
Phe Arg Asn Pro His 115
120 125 Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser His
Phe 130 135 140 Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu
Val Leu Phe 145 150 155 160 Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr
His Leu Phe Leu Leu Lys 165 170 175 Asp Ala Gln Ile Tyr Gly Glu Glu
Trp Gly Tyr Glu Lys Glu Asp Ile 180 185 190 Ala Glu Phe Tyr Lys Arg
Gln Leu Lys Leu Thr Gln Glu Tyr Thr Asp 195 200 205 His Cys Val Lys
Trp Tyr Asn Val Gly Leu Asp Lys Leu Arg Gly Ser 210 215 220 Ser Tyr
Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu Met Thr 225 230 235
240 Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp Val Arg
245 250 255 Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val
Leu Thr 260 265 270 Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr
Gly Thr Thr Phe 275 280 285 Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro
His Leu Phe Asp Tyr Leu 290 295 300 His Arg Ile Gln Phe His Thr Arg
Phe Gln Pro Gly Tyr Tyr Gly Asn 305 310 315 320 Asp Ser Phe Asn Tyr
Trp Ser Gly Asn Tyr Val Ser Thr Arg Pro Ser 325 330 335 Ile Gly Ser
Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly Asn Lys Ser 340 345 350 Ser
Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys Val Tyr Arg 355 360
365 Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val Tyr Ser
370 375 380 Gly Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr
Asp Glu 385 390 395 400 Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn
Val Gly Ala Val Ser 405 410 415 Trp Asp Ser Ile Asp Gln Leu Pro Pro
Glu Thr Thr Asp Glu Pro Leu 420 425 430 Glu Lys Gly Tyr Ser His Gln
Leu Asn Tyr Val Met Cys Phe Leu Met 435 440 445 Gln Gly Ser Arg Gly
Thr Ile Pro Val Leu Thr Trp Thr His Lys Ser 450 455 460 Val Asp Phe
Asn Asn Ile Ile Pro Ser Ser Gln Ile Thr Gln Ile Pro 465 470 475 480
Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val Lys Gly 485
490 495 Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro Gly
Gln 500 505 510 Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser
Gln Arg Tyr 515 520 525 Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr Asn
Leu Gln Phe His Thr 530 535 540 Ser Ile Asp Gly Arg Pro Ile Asn Gln
Gly Asn Phe Ser Ala Thr Met 545 550 555 560 Ser Ser Gly Ser Asn Leu
Gln Ser Gly Ser Phe Arg Thr Val Gly Phe 565 570 575 Thr Thr Pro Phe
Asn Phe Ser Asn Gly Ser Ser Val Phe Thr Leu Ser 580 585 590 Ala His
Val Phe Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu 595 600 605
Phe Val Pro Ala Glu Val Thr 610 615 1541923DNAArtificial
Sequence8AFdm3 T coding sequence 154atgacggccg acaacaacac
cgaggccctg gacagcagca ccaccaagga cgtgatccag 60aagggcatca gcgtggtggg
cgacctgctg ggcgtggtgg gcttcccctt cggcggcgcc 120ctggtgagct
tctacaccaa cttcctgaac accatctggc ccagcgagga cccctggaag
180gccttcatgg agcaggtgga ggccctgatg gaccagaaga tcgccgacta
cgccaagaac 240aaggcactgg ccgagctaca gggcctccag aacaacgtgg
aggactatgt gagcgccctg 300agcagctggc agaagaaccc cgctgcaccg
ttccgcaacc cccacagcca gggccgcatc 360cgcgagctgt tcagccaggc
cgagagccac ttccgcaaca gcatgcccag cttcgccatc 420agcggctacg
aggtgctgtt cctgaccacc tacgcccagg ccgccaacac ccacctgttc
480ctgctgaagg acgcccaaat ctacggagag gagtggggct acgagaagga
ggacatcgcc 540gagttctaca agcgccagct gaagctgacc caggagtaca
ccgaccactg cgtgaagtgg 600tacaacgtgg gtctagacaa gctccgcggc
agcagctacg agagctgggt gaacttcaac 660cgctaccgcc gcgagatgac
cctgaccgtg ctggacctga tcgccctgtt ccccctgtac 720gacgtgcgcc
tgtaccccaa ggaggtgaag accgagctga cccgcgacgt gctgaccgac
780cccatcgtgg gcgtgaacaa cctgcgcggc tacggcacca ccttcagcaa
catcgagaac 840tacatccgca agccccacct gttcgactac ctgcaccgca
tccagttcca cacgcgtttc 900cagcccggct actacggcaa cgacagcttc
aactactgga gcggcaacta cgtgagcacc 960cgccccagca tcggcagcaa
cgacatcatc accagcccct tctacggcaa caagagcagc 1020gagcccgtgc
agaaccttga gttcaacggc gagaaggtgt accgcgccgt ggctaacacc
1080aacctggccg tgtggccctc tgcagtgtac agcggcgtga ccaaggtgga
gttcagccag 1140tacaacgacc agaccgacga ggccagcacc cagacctacg
acagcaagcg caacgtgggc 1200gccgtgagct gggacagcat cgaccagctg
ccccccgaga ccaccgacga gcccctggag 1260aagggctaca gccaccagct
gaactacgtg atgtgcttcc tgatgcaggg cagccgcggc 1320accatccccg
tgctgacctg gacccacaag agcgtcgact tcaacaacat catccccagc
1380agccagatca cccagatccc cctgaccaag agcaccaacc tgggcagcgg
caccagcgtg 1440gtgaagggcc ccggcttcac cggcggcgac atcctgcgcc
gcaccagccc cggccagatc 1500agcaccctgc gcgtgaacat caccgccccc
ctgagccagc gctaccgcgt ccgcatccgc 1560tacgccagca ccaccaacct
gcagttccac accagcatcg acggccgccc catcaaccag 1620ggcaacttca
gcgccaccat gagcagcggc agcaacctgc agagcggcag cttccgcacc
1680gtgggcttca ccaccccctt caacttcagc aacggcagca gcgtgttcac
cctgagcgcc 1740cacgtgttca acagcggcaa cgaggtgtac atcgaccgca
tcgagttcgt gcccgccgag 1800gtgaccttcg aggccgagta cgacctggag
agggctcaga aggccgtgaa cgagctgttc 1860accagcagca accagatcgg
cctgaagacc gacgtgaccg actaccacat cgatcaggtg 1920tag
1923155640PRTArtificial Sequence8AFdm3 T protein 155Met Thr Ala Asp
Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr Thr Lys 1 5 10 15 Asp Val
Ile Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val 20 25 30
Val Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe 35
40 45 Leu Asn Thr Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met
Glu 50 55 60 Gln Val Glu Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr
Ala Lys Asn 65 70 75 80 Lys Ala Leu Ala Glu Leu Gln Gly Leu Gln Asn
Asn Val Glu Asp Tyr 85 90 95 Val Ser Ala Leu Ser Ser Trp Gln Lys
Asn Pro Ala Ala Pro Phe Arg 100 105 110 Asn Pro His Ser Gln Gly Arg
Ile Arg Glu Leu Phe Ser Gln Ala Glu 115 120 125 Ser His Phe Arg Asn
Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu 130 135 140 Val Leu Phe
Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe 145 150 155 160
Leu Leu Lys Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys 165
170 175 Glu Asp Ile Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln
Glu 180 185 190 Tyr Thr Asp His Cys Val Lys Trp Tyr Asn Val Gly Leu
Asp Lys Leu 195 200 205 Arg Gly Ser Ser Tyr Glu Ser Trp Val Asn Phe
Asn Arg Tyr Arg Arg 210 215 220 Glu Met Thr Leu Thr Val Leu Asp Leu
Ile Ala Leu Phe Pro Leu Tyr 225 230 235 240 Asp Val Arg Leu Tyr Pro
Lys Glu Val Lys Thr Glu Leu Thr Arg Asp 245 250 255 Val Leu Thr Asp
Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly 260 265 270 Thr Thr
Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe 275 280 285
Asp Tyr Leu His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr 290
295 300 Tyr Gly Asn Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val Ser
Thr 305 310 315 320 Arg Pro Ser Ile Gly Ser Asn Asp Ile Ile Thr Ser
Pro Phe Tyr Gly 325 330 335 Asn Lys Ser Ser Glu Pro Val Gln Asn Leu
Glu Phe Asn Gly Glu Lys 340 345 350 Val Tyr Arg Ala Val Ala Asn Thr
Asn Leu Ala Val Trp Pro Ser Ala 355 360 365 Val Tyr Ser Gly Val Thr
Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln 370 375 380 Thr Asp Glu Ala
Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly 385 390 395 400 Ala
Val Ser Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp 405 410
415 Glu Pro Leu Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys
420 425 430 Phe Leu Met Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr
Trp Thr 435 440 445 His Lys Ser Val Asp Phe Asn Asn Ile Ile Pro Ser
Ser Gln Ile Thr 450 455 460 Gln Ile Pro Leu Thr Lys Ser Thr Asn Leu
Gly Ser Gly Thr Ser Val 465 470 475 480 Val Lys Gly Pro Gly Phe Thr
Gly Gly Asp Ile Leu Arg Arg Thr Ser 485 490 495 Pro Gly Gln Ile Ser
Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser 500 505 510 Gln Arg Tyr
Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln 515 520 525 Phe
His Thr Ser Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser 530 535
540 Ala Thr Met Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg Thr
545 550 555 560 Val Gly Phe Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser
Ser Val Phe 565 570 575 Thr Leu Ser Ala His Val Phe Asn Ser Gly Asn
Glu Val Tyr Ile Asp 580 585 590 Arg Ile Glu Phe Val Pro Ala Glu Val
Thr Phe Glu Ala Glu Tyr Asp 595 600 605 Leu Glu Arg Ala Gln Lys Ala
Val Asn Glu Leu Phe Thr Ser Ser Asn 610 615 620 Gln Ile Gly Leu Lys
Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val 625 630 635 640
1561923DNAArtificial Sequence8AFlongdm3T coding sequence
156atgacggccg acaacaacac cgaggccctg gacagcagca ccaccaagga
cgtgatccag 60aagggcatca gcgtggtggg cgacctgctg ggcgtggtgg gcttcccctt
cggcggcgcc 120ctggtgagct tctacaccaa cttcctgaac accatctggc
ccagcgagga cccctggaag 180gccttcatgg agcaggtgga ggccctgatg
gaccagaaga tcgccgacta cgccaagaac 240aaggcactgg ccgagctaca
gggcctccag aacaacgtgg aggactatgt gagcgccctg 300agcagctggc
agaagaaccc cgctgcaccg ttccgcaacc cccacagcca gggccgcatc
360cgcgagctgt tcagccaggc cgagagccac ttccgcaaca gcatgcccag
cttcgccatc 420agcggctacg aggtgctgtt cctgaccacc tacgcccagg
ccgccaacac ccacctgttc 480ctgctgaagg acgcccaaat ctacggagag
gagtggggct acgagaagga ggacatcgcc 540gagttctaca agcgccagct
gaagctgacc caggagtaca ccgaccactg cgtgaagtgg 600tacaacgtgg
gtctagacaa gctccgcggc agcagctacg agagctgggt gaacttcaac
660cgctaccgcc gcgagatgac cctgaccgtg ctggacctga tcgccctgtt
ccccctgtac 720gacgtgcgcc tgtaccccaa ggaggtgaag accgagctga
cccgcgacgt gctgaccgac 780cccatcgtgg gcgtgaacaa cctgcgcggc
tacggcacca ccttcagcaa catcgagaac 840tacatccgca agccccacct
gttcgactac ctgcaccgca tccagttcca cacgcgtttc 900cagcccggct
actacggcaa cgacagcttc aactactgga gcggcaacta cgtgagcacc
960cgccccagca tcggcagcaa cgacatcatc accagcccct tctacggcaa
caagagcagc 1020gagcccgtgc agaaccttga gttcaacggc gagaaggtgt
accgcgccgt ggctaacacc 1080aacctggccg tgtggccctc tgcagtgtac
agcggcgtga ccaaggtgga gttcagccag 1140tacaacgacc agaccgacga
ggccagcacc cagacctacg acagcaagcg caacgtgggc 1200gccgtgagct
gggacagcat cgaccagctg ccccccgaga ccaccgacga gcccctggag
1260aagggctaca gccaccagct gaactacgtg atgtgcttcc tgatgcaggg
cagccgcggc 1320accatccccg tgctgacctg gacccacaag agcgtcgact
tcttcaacat gatcgacagc 1380aagaagatca cccagctgcc cctggtgaag
gcctacaagc tccagagcgg cgccagcgtg 1440gtggcaggcc cccgcttcac
cggcggcgac atcatccagt gcaccgagaa cggcagcgcc 1500gccaccatct
acgtgacccc cgacgtgagc tacagccaga agtaccgcgc ccgcatccgc
1560tacgccagca ccaccaacct gcagttccac accagcatcg acggccgccc
catcaaccag 1620ggcaacttca gcgccaccat gagcagcggc agcaacctgc
agagcggcag cttccgcacc 1680gtgggcttca ccaccccctt caacttcagc
aacggcagca gcgtgttcac cctgagcgcc 1740cacgtgttca acagcggcaa
cgaggtgtac atcgaccgca tcgagttcgt gcccgccgag 1800gtgaccttcg
aggccgagta cgacctggag agggctcaga aggccgtgaa cgagctgttc
1860accagcagca accagatcgg cctgaagacc gacgtgaccg actaccacat
cgatcaggtg 1920tag 1923157640PRTArtificial Sequence8AFlongdm3T
protein 157Met Thr Ala Asp Asn Asn Thr Glu Ala Leu Asp Ser Ser Thr
Thr Lys 1 5 10 15 Asp Val Ile Gln Lys Gly Ile Ser Val Val Gly Asp
Leu Leu Gly Val 20 25 30 Val Gly Phe Pro Phe Gly Gly Ala Leu Val
Ser Phe Tyr Thr Asn Phe 35 40 45 Leu Asn Thr Ile Trp Pro Ser Glu
Asp Pro Trp Lys Ala Phe Met Glu 50 55 60 Gln Val Glu Ala Leu Met
Asp Gln Lys Ile Ala Asp Tyr Ala Lys Asn 65 70 75 80 Lys Ala Leu Ala
Glu Leu Gln Gly Leu Gln Asn Asn Val Glu Asp Tyr 85 90 95 Val Ser
Ala Leu Ser Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg 100 105 110
Asn Pro His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu 115
120 125 Ser His Phe Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr
Glu 130 135 140 Val Leu Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr
His Leu Phe 145 150 155 160 Leu Leu Lys Asp Ala Gln Ile Tyr Gly Glu
Glu Trp Gly Tyr Glu Lys 165 170 175 Glu Asp Ile Ala Glu Phe Tyr Lys
Arg Gln Leu Lys Leu Thr Gln Glu 180 185 190 Tyr Thr Asp His Cys Val
Lys Trp Tyr Asn Val Gly Leu Asp Lys Leu 195 200 205 Arg Gly Ser Ser
Tyr Glu Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg 210 215 220 Glu Met
Thr Leu Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr 225 230 235
240 Asp Val Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp
245 250 255 Val Leu Thr Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly
Tyr Gly 260 265 270 Thr Thr Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys
Pro His Leu Phe 275 280 285 Asp Tyr Leu His Arg Ile Gln Phe His Thr
Arg Phe Gln Pro Gly Tyr 290 295 300 Tyr Gly Asn Asp Ser Phe Asn Tyr
Trp Ser Gly Asn Tyr Val Ser Thr 305 310 315 320 Arg Pro Ser Ile Gly
Ser Asn Asp Ile Ile Thr Ser Pro Phe Tyr Gly 325 330 335 Asn Lys Ser
Ser Glu Pro Val Gln Asn Leu Glu Phe Asn Gly Glu Lys 340 345 350 Val
Tyr Arg Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala 355 360
365 Val Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln
370 375 380 Thr Asp Glu Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn
Val Gly 385 390 395 400 Ala Val Ser Trp Asp Ser Ile Asp Gln Leu Pro
Pro Glu Thr Thr Asp 405 410 415 Glu Pro Leu Glu Lys Gly Tyr Ser His
Gln Leu Asn Tyr Val Met Cys 420 425 430 Phe Leu Met Gln Gly Ser Arg
Gly Thr Ile Pro Val Leu Thr Trp Thr 435 440 445 His Lys Ser Val Asp
Phe Phe Asn Met Ile Asp Ser Lys Lys Ile Thr 450 455 460 Gln Leu Pro
Leu Val Lys Ala Tyr Lys Leu Gln Ser Gly Ala Ser Val 465 470 475 480
Val Ala Gly Pro Arg Phe Thr Gly Gly Asp Ile Ile Gln Cys Thr Glu 485
490 495 Asn Gly Ser Ala Ala Thr Ile Tyr Val Thr Pro Asp Val Ser Tyr
Ser 500 505 510 Gln Lys Tyr Arg Ala Arg Ile Arg Tyr Ala Ser Thr Thr
Asn Leu Gln 515 520 525 Phe His Thr Ser Ile Asp Gly Arg Pro Ile Asn
Gln Gly Asn
Phe Ser 530 535 540 Ala Thr Met Ser Ser Gly Ser Asn Leu Gln Ser Gly
Ser Phe Arg Thr 545 550 555 560 Val Gly Phe Thr Thr Pro Phe Asn Phe
Ser Asn Gly Ser Ser Val Phe 565 570 575 Thr Leu Ser Ala His Val Phe
Asn Ser Gly Asn Glu Val Tyr Ile Asp 580 585 590 Arg Ile Glu Phe Val
Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp 595 600 605 Leu Glu Arg
Ala Gln Lys Ala Val Asn Glu Leu Phe Thr Ser Ser Asn 610 615 620 Gln
Ile Gly Leu Lys Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val 625 630
635 640 1581962DNAArtificial Sequencecap8AFdm3T coding serquence
158atgactagta acggccgcca gtgtgctggt attcgccctt atgacggccg
acaacaacac 60cgaggcctgg acagcagcac caccaaggac gtgatccaga agggcatcag
cgtggtgggc 120gacctgctgg gcgtggtggg cttccccttc ggcggcgccc
tggtgagctt ctacaccaac 180ttcctgaaca ccatctggcc cagcgaggac
ccctggaagg ccttcatgga gcaggtggag 240gccctgatgg accagaagat
cgccgactac gccaagaaca aggcactggc cgagctacag 300ggcctccaga
acaacgtgga ggactatgtg agcgccctga gcagctggca gaagaacccc
360gctgcaccgt tccgcaaccc ccacagccag ggccgcatcc gcgagctgtt
cagccaggcc 420gagagccact tccgcaacag catgcccagc ttcgccatca
gcggctacga ggtgctgttc 480ctgaccacct acgcccaggc cgccaacacc
cacctgttcc tgctgaagga cgcccaaatc 540tacggagagg agtggggcta
cgagaaggag gacatcgccg agttctacaa gcgccagctg 600aagctgaccc
aggagtacac cgaccactgc gtgaagtggt acaacgtggg tctagacaag
660ctccgcggca gcagctacga gagctgggtg aacttcaacc gctaccgccg
cgagatgacc 720ctgaccgtgc tggacctgat cgccctgttc cccctgtacg
acgtgcgcct gtaccccaag 780gaggtgaaga ccgagctgac ccgcgacgtg
ctgaccgacc ccatcgtggg cgtgaacaac 840ctgcgcggct acggcaccac
cttcagcaac atcgagaact acatccgcaa gccccacctg 900ttcgactacc
tgcaccgcat ccagttccac acgcgtttcc agcccggcta ctacggcaac
960gacagcttca actactggag cggcaactac gtgagcaccc gccccagcat
cggcagcaac 1020gacatcatca ccagcccctt ctacggcaac aagagcagcg
agcccgtgca gaaccttgag 1080ttcaacggcg agaaggtgta ccgcgccgtg
gctaacacca acctggccgt gtggccctct 1140gcagtgtaca gcggcgtgac
caaggtggag ttcagccagt acaacgacca gaccgacgag 1200gccagcaccc
agacctacga cagcaagcgc aacgtgggcg ccgtgagctg ggacagcatc
1260gaccagctgc cccccgagac caccgacgag cccctggaga agggctacag
ccaccagctg 1320aactacgtga tgtgcttcct gatgcagggc agccgcggca
ccatccccgt gctgacctgg 1380acccacaaga gcgtcgactt caacaacatc
atccccagca gccagatcac ccagatcccc 1440ctgaccaaga gcaccaacct
gggcagcggc accagcgtgg tgaagggccc cggcttcacc 1500ggcggcgaca
tcctgcgccg caccagcccc ggccagatca gcaccctgcg cgtgaacatc
1560accgcccccc tgagccagcg ctaccgcgtc cgcatccgct acgccagcac
caccaacctg 1620cagttccaca ccagcatcga cggccgcccc atcaaccagg
gcaacttcag cgccaccatg 1680agcagcggca gcaacctgca gagcggcagc
ttccgcaccg tgggcttcac cacccccttc 1740aacttcagca acggcagcag
cgtgttcacc ctgagcgccc acgtgttcaa cagcggcaac 1800gaggtgtaca
tcgaccgcat cgagttcgtg cccgccgagg tgaccttcga ggccgagtac
1860gacctggaga gggctcagaa ggccgtgaac gagctgttca ccagcagcaa
ccagatcggc 1920ctgaagaccg acgtgaccga ctaccacatc gatcaggtgt ag
1962159653PRTArtificial Sequencecap8AFdm3T protein 159Met Thr Ser
Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr Asp Gly 1 5 10 15 Arg
Gln Gln His Arg Gly Leu Asp Ser Ser Thr Thr Lys Asp Val Ile 20 25
30 Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly Val Val Gly Phe
35 40 45 Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr Thr Asn Phe Leu
Asn Thr 50 55 60 Ile Trp Pro Ser Glu Asp Pro Trp Lys Ala Phe Met
Glu Gln Val Glu 65 70 75 80 Ala Leu Met Asp Gln Lys Ile Ala Asp Tyr
Ala Lys Asn Lys Ala Leu 85 90 95 Ala Glu Leu Gln Gly Leu Gln Asn
Asn Val Glu Asp Tyr Val Ser Ala 100 105 110 Leu Ser Ser Trp Gln Lys
Asn Pro Ala Ala Pro Phe Arg Asn Pro His 115 120 125 Ser Gln Gly Arg
Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser His Phe 130 135 140 Arg Asn
Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val Leu Phe 145 150 155
160 Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu Phe Leu Leu Lys
165 170 175 Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly Tyr Glu Lys Glu
Asp Ile 180 185 190 Ala Glu Phe Tyr Lys Arg Gln Leu Lys Leu Thr Gln
Glu Tyr Thr Asp 195 200 205 His Cys Val Lys Trp Tyr Asn Val Gly Leu
Asp Lys Leu Arg Gly Ser 210 215 220 Ser Tyr Glu Ser Trp Val Asn Phe
Asn Arg Tyr Arg Arg Glu Met Thr 225 230 235 240 Leu Thr Val Leu Asp
Leu Ile Ala Leu Phe Pro Leu Tyr Asp Val Arg 245 250 255 Leu Tyr Pro
Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val Leu Thr 260 265 270 Asp
Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr Thr Phe 275 280
285 Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His Leu Phe Asp Tyr Leu
290 295 300 His Arg Ile Gln Phe His Thr Arg Phe Gln Pro Gly Tyr Tyr
Gly Asn 305 310 315 320 Asp Ser Phe Asn Tyr Trp Ser Gly Asn Tyr Val
Ser Thr Arg Pro Ser 325 330 335 Ile Gly Ser Asn Asp Ile Ile Thr Ser
Pro Phe Tyr Gly Asn Lys Ser 340 345 350 Ser Glu Pro Val Gln Asn Leu
Glu Phe Asn Gly Glu Lys Val Tyr Arg 355 360 365 Ala Val Ala Asn Thr
Asn Leu Ala Val Trp Pro Ser Ala Val Tyr Ser 370 375 380 Gly Val Thr
Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr Asp Glu 385 390 395 400
Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly Ala Val Ser 405
410 415 Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr Thr Asp Glu Pro
Leu 420 425 430 Glu Lys Gly Tyr Ser His Gln Leu Asn Tyr Val Met Cys
Phe Leu Met 435 440 445 Gln Gly Ser Arg Gly Thr Ile Pro Val Leu Thr
Trp Thr His Lys Ser 450 455 460 Val Asp Phe Asn Asn Ile Ile Pro Ser
Ser Gln Ile Thr Gln Ile Pro 465 470 475 480 Leu Thr Lys Ser Thr Asn
Leu Gly Ser Gly Thr Ser Val Val Lys Gly 485 490 495 Pro Gly Phe Thr
Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro Gly Gln 500 505 510 Ile Ser
Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser Gln Arg Tyr 515 520 525
Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu Gln Phe His Thr 530
535 540 Ser Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe Ser Ala Thr
Met 545 550 555 560 Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg
Thr Val Gly Phe 565 570 575 Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser
Ser Val Phe Thr Leu Ser 580 585 590 Ala His Val Phe Asn Ser Gly Asn
Glu Val Tyr Ile Asp Arg Ile Glu 595 600 605 Phe Val Pro Ala Glu Val
Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg 610 615 620 Ala Gln Lys Ala
Val Asn Glu Leu Phe Thr Ser Ser Asn Gln Ile Gly 625 630 635 640 Leu
Lys Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val 645 650
160687PRTArtificial SequenceFR8a+34 protein 160Met Lys Glu Thr Ala
Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser 1 5 10 15 Pro Asp Leu
Gly Thr Leu Val Pro Arg Gly Ser Met Ala Asp Ile Gly 20 25 30 Ser
Thr Met Thr Ser Asn Gly Arg Gln Cys Ala Gly Ile Arg Pro Tyr 35 40
45 Asp Gly Arg Gln Gln His Arg Gly Leu Asp Ser Ser Thr Thr Lys Asp
50 55 60 Val Ile Gln Lys Gly Ile Ser Val Val Gly Asp Leu Leu Gly
Val Val 65 70 75 80 Gly Phe Pro Phe Gly Gly Ala Leu Val Ser Phe Tyr
Thr Asn Phe Leu 85 90 95 Asn Thr Ile Trp Pro Ser Glu Asp Pro Trp
Lys Ala Phe Met Glu Gln 100 105 110 Val Glu Ala Leu Met Asp Gln Lys
Ile Ala Asp Tyr Ala Lys Asn Lys 115 120 125 Ala Leu Ala Glu Leu Gln
Gly Leu Gln Asn Asn Val Glu Asp Tyr Val 130 135 140 Ser Ala Leu Ser
Ser Trp Gln Lys Asn Pro Ala Ala Pro Phe Arg Asn 145 150 155 160 Pro
His Ser Gln Gly Arg Ile Arg Glu Leu Phe Ser Gln Ala Glu Ser 165 170
175 His Phe Arg Asn Ser Met Pro Ser Phe Ala Ile Ser Gly Tyr Glu Val
180 185 190 Leu Phe Leu Thr Thr Tyr Ala Gln Ala Ala Asn Thr His Leu
Phe Leu 195 200 205 Leu Lys Asp Ala Gln Ile Tyr Gly Glu Glu Trp Gly
Tyr Glu Lys Glu 210 215 220 Asp Ile Ala Glu Phe Tyr Lys Arg Gln Leu
Lys Leu Thr Gln Glu Tyr 225 230 235 240 Thr Asp His Cys Val Lys Trp
Tyr Asn Val Gly Leu Asp Lys Leu Arg 245 250 255 Gly Ser Ser Tyr Glu
Ser Trp Val Asn Phe Asn Arg Tyr Arg Arg Glu 260 265 270 Met Thr Leu
Thr Val Leu Asp Leu Ile Ala Leu Phe Pro Leu Tyr Asp 275 280 285 Val
Arg Leu Tyr Pro Lys Glu Val Lys Thr Glu Leu Thr Arg Asp Val 290 295
300 Leu Thr Asp Pro Ile Val Gly Val Asn Asn Leu Arg Gly Tyr Gly Thr
305 310 315 320 Thr Phe Ser Asn Ile Glu Asn Tyr Ile Arg Lys Pro His
Leu Phe Asp 325 330 335 Tyr Leu His Arg Ile Gln Phe His Thr Arg Phe
Gln Pro Gly Tyr Tyr 340 345 350 Gly Asn Asp Ser Phe Asn Tyr Trp Ser
Gly Asn Tyr Val Ser Thr Arg 355 360 365 Pro Ser Ile Gly Ser Asn Asp
Ile Ile Thr Ser Pro Phe Tyr Gly Asn 370 375 380 Lys Ser Ser Glu Pro
Val Gln Asn Leu Glu Phe Asn Gly Glu Lys Val 385 390 395 400 Tyr Arg
Ala Val Ala Asn Thr Asn Leu Ala Val Trp Pro Ser Ala Val 405 410 415
Tyr Ser Gly Val Thr Lys Val Glu Phe Ser Gln Tyr Asn Asp Gln Thr 420
425 430 Asp Glu Ala Ser Thr Gln Thr Tyr Asp Ser Lys Arg Asn Val Gly
Ala 435 440 445 Val Ser Trp Asp Ser Ile Asp Gln Leu Pro Pro Glu Thr
Thr Asp Glu 450 455 460 Pro Leu Glu Lys Gly Tyr Ser His Gln Leu Asn
Tyr Val Met Cys Phe 465 470 475 480 Leu Met Gln Gly Ser Arg Gly Thr
Ile Pro Val Leu Thr Trp Thr His 485 490 495 Lys Ser Val Asp Phe Phe
Asn Met Ile Asp Ser Lys Lys Ile Thr Gln 500 505 510 Leu Pro Leu Thr
Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser Val Val 515 520 525 Lys Gly
Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Ser Pro 530 535 540
Gly Gln Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu Ser Gln 545
550 555 560 Arg Tyr Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu
Gln Phe 565 570 575 His Thr Ser Ile Asp Gly Arg Pro Ile Asn Gln Gly
Asn Phe Ser Ala 580 585 590 Thr Met Ser Ser Gly Ser Asn Leu Gln Ser
Gly Ser Phe Arg Thr Val 595 600 605 Gly Phe Thr Thr Pro Phe Asn Phe
Ser Asn Gly Ser Ser Val Phe Thr 610 615 620 Leu Ser Ala His Val Phe
Asn Ser Gly Asn Glu Val Tyr Ile Asp Arg 625 630 635 640 Ile Glu Phe
Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp Leu 645 650 655 Glu
Arg Ala Gln Lys Ala Val Asn Glu Leu Phe Thr Ser Ser Asn Gln 660 665
670 Ile Gly Leu Lys Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val 675
680 685
uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.
While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.
All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.