U.S. patent application number 12/244950 was filed with the patent office on 2009-05-07 for peroxisome biogenesis factor protein (pex) disruptions for altering polyunsaturated fatty acids and total lipid content in oleaginous eukaryotic organisms. This patent application is currently assigned to E. I. DU PONT DE NEMOURS AND COMPANY. Invention is credited to SEUNG-PYO HONG, PAMELA L. SHARPE, ZHIXIONG XUE, NARENDRA S. YADAV, QUINN QUN ZHU.
Application Number | 20090117253 12/244950 |
Document ID | / |
Family ID | 40084411 |
Filed Date | 2009-05-07 |
United States Patent Application | 20090117253 |
Kind Code | A1 |
HONG; SEUNG-PYO ; et al. | May 7, 2009 |
Methods of increasing the amount of polyunsaturated fatty acids (PUFAs) in the total lipid fraction and in the oil fraction of PUFA-producing, oleaginous eukaryotes, accomplished by modifying the activity of peroxisome biogenesis factor (Pex) proteins. Disruptions of a chromosomal Pex3 gene, Pex10p gene or Pex16p gene in a PUFA-producing, oleaginous eukaryotic strain resulted in an increased amount of PUFAs, as a percent of total fatty acids and as a percent of dry cell weight, in the total lipid fraction and in the oil fraction of the strain, as compared to the parental strain whose native Pex protein was not disrupted.
Inventors: | HONG; SEUNG-PYO; (HOCKESSIN, DE) ; SHARPE; PAMELA L.; (WILMINGTON, DE) ; XUE; ZHIXIONG; (CHADDS FORD, PA) ; YADAV; NARENDRA S.; (WILMINGTON, DE) ; ZHU; QUINN QUN; (WEST CHESTER, PA) |
Correspondence Address: |
E I DU PONT DE NEMOURS AND COMPANY;LEGAL PATENT RECORDS CENTER BARLEY MILL PLAZA 25/1122B, 4417 LANCASTER PIKE WILMINGTON DE 19805 US |
Assignee: | E. I. DU PONT DE NEMOURS AND
COMPANY WILMINGTON DE |
Family ID: | 40084411 |
Appl. No.: | 12/244950 |
Filed: | October 3, 2008 |
Application Number | Filing Date | Patent Number | ||
---|---|---|---|---|
60977174 | Oct 3, 2007 | |||
60977177 | Oct 3, 2007 | |||
Current U.S. Class: | 426/601 ; 435/254.2; 435/471 |
Current CPC Class: | C12P 7/6472 20130101; A61P 35/00 20180101; A61P 19/02 20180101; A61P 25/28 20180101; A61P 5/24 20180101; C12N 9/1029 20130101; A61P 9/00 20180101; C12P 7/6427 20130101; A61P 29/00 20180101; A61P 25/18 20180101; A61P 9/10 20180101; A61P 9/12 20180101; C12N 9/0083 20130101; A61P 19/10 20180101; A61P 25/24 20180101; C12N 15/815 20130101; A61P 3/10 20180101; A61P 17/02 20180101; A61P 3/06 20180101; A61P 1/16 20180101; A61P 25/00 20180101; A61P 1/00 20180101; A61P 3/00 20180101 |
Class at Publication: | 426/601 ; 435/471; 435/254.2 |
International Class: | A23K 1/16 20060101 A23K001/16; C12N 15/63 20060101 C12N015/63; C12N 1/19 20060101 C12N001/19 |
Sequence CWU 1
1
8611024PRTYarrowia lipolyticaMISC_FEATURE(1)..(1024)YlPex1p;
GenBank Accession No. CAG82178 1Met Thr Ser Lys Ser Asp Tyr Ser Gly
Lys Asp Lys Ile Glu Leu Asp1 5 10 15Pro Val Phe Ala Lys Ser Ile Asp
Leu Leu Pro Asn Thr Gln Val Val 20 25 30Ile Asp Ile Gln Leu Asn Pro
Lys Ile Ala His Thr Ile His Leu Glu 35 40 45Pro Val Thr Val Ala Asp
Trp Glu Ile Val Glu Leu His Ala Ala Tyr 50 55 60Leu Glu Ser Arg Met
Ile Asn Gln Val Arg Ala Val Ser Pro Asn Gln65 70 75 80Pro Val Thr
Val Tyr Pro Ser Ser Thr Thr Ser Ala Thr Leu Lys Val 85 90 95Ile Arg
Ile Glu Pro Asp Leu Gly Ala Ala Gly Phe Ala Lys Leu Ser 100 105
110Pro Asp Ser Glu Val Val Val Ala Pro Lys Gln Arg Lys Lys Glu Glu
115 120 125Lys Gln Val Lys Lys Arg Ser Gly Ser Ala Arg Ser Thr Gly
Ser Gln 130 135 140Lys Arg Lys Gly Gly Arg Gly Pro His Ala Leu Arg
Arg Ala Ile Ser145 150 155 160Glu Asp Phe Asp Gly His Leu Arg Leu
Glu Val Ser Leu Asp Val Ser 165 170 175Gln Leu Pro Pro Glu Phe His
Gln Leu Lys Asn Val Ser Ile Lys Val 180 185 190Ile Thr Pro Pro Asn
Leu Ala Ser Pro Gln Gln Ala Ala Ser Ile Ala 195 200 205Val Glu Glu
Lys Ser Glu Glu Ser Leu Ser Gln Asn Lys Pro Pro Ser 210 215 220Ser
Glu Pro Lys Val Glu Val Pro Pro Asp Ile Ile Asn Pro Ala Ser225 230
235 240Glu Ile Val Ala Thr Leu Val Asn Asp Thr Thr Ser Pro Thr Gly
His 245 250 255Ala Lys Leu Ser Tyr Ala Leu Ala Asp Ala Leu Gly Ile
Pro Ser Ser 260 265 270Val Gly His Val Ile Arg Phe Glu Ser Ala Ser
Lys Pro Leu Ser Gln 275 280 285Lys Pro Gly Ala Leu Val Ile His Arg
Phe Ile Thr Lys Thr Val Gly 290 295 300Ala Ala Glu Gln Lys Ser Leu
Arg Leu Lys Gly Glu Lys Asn Ala Asp305 310 315 320Asp Gly Val Ser
Ala Asp Asp Gln Phe Ser Leu Leu Glu Glu Leu Lys 325 330 335Lys Leu
Gln Met Leu Glu Gly Pro Ile Thr Asn Phe Gln Arg Leu Pro 340 345
350Pro Ile Pro Glu Leu Leu Pro Leu Gly Gly Val Ile Gly Leu Gln Asn
355 360 365Ser Glu Gly Trp Ile Gln Gly Gly Tyr Leu Gly Glu Glu Pro
Ile Pro 370 375 380Phe Val Ser Gly Ser Glu Ile Leu Arg Ser Glu Ser
Ser Leu Ser Pro385 390 395 400Ser Asn Ile Glu Ser Glu Asp Lys Arg
Val Val Gly Leu Asp Asn Met 405 410 415Leu Asn Lys Ile Asn Glu Val
Leu Ser Arg Asp Ser Ile Gly Cys Leu 420 425 430Val Tyr Gly Ser Arg
Gly Ser Gly Lys Ser Ala Val Leu Asn His Ile 435 440 445Lys Lys Glu
Cys Lys Val Ser His Thr His Thr Val Ser Ile Ala Cys 450 455 460Gly
Leu Ile Ala Gln Asp Arg Val Gln Ala Val Arg Glu Ile Leu Thr465 470
475 480Lys Ala Phe Leu Glu Ala Ser Trp Phe Ser Pro Ser Val Leu Phe
Leu 485 490 495Asp Asp Ile Asp Ala Leu Met Pro Ala Glu Val Glu His
Ala Asp Ser 500 505 510Ser Arg Thr Arg Gln Leu Thr Gln Leu Phe Leu
Glu Leu Ala Leu Pro 515 520 525Ile Met Lys Ser Arg His Val Ser Val
Val Ala Ser Ala Gln Ala Lys 530 535 540Glu Ser Leu His Met Asn Leu
Val Thr Gly His Val Phe Glu Glu Leu545 550 555 560Phe His Leu Lys
Ser Pro Asp Lys Glu Ala Arg Leu Ala Ile Leu Ser 565 570 575Glu Ala
Val Lys Leu Met Asp Gln Asn Val Ser Phe Ser Gln Asn Asp 580 585
590Val Leu Glu Ile Ala Ser Gln Val Asp Gly Tyr Leu Pro Gly Asp Leu
595 600 605Trp Thr Leu Ser Glu Arg Ala Gln His Glu Met Ala Leu Arg
Gln Ile 610 615 620Glu Ile Gly Leu Glu Asn Pro Ser Ile Gln Leu Ala
Asp Phe Met Lys625 630 635 640Ala Leu Glu Asp Phe Val Pro Ser Ser
Leu Arg Gly Val Lys Leu Gln 645 650 655Lys Ser Asn Val Lys Trp Asn
Asp Ile Gly Gly Leu Lys Glu Thr Lys 660 665 670Ala Val Leu Leu Glu
Thr Leu Glu Trp Pro Thr Lys Tyr Ala Pro Ile 675 680 685Phe Ala Ser
Cys Pro Leu Arg Leu Arg Ser Gly Leu Leu Leu Tyr Gly 690 695 700Tyr
Pro Gly Cys Gly Lys Thr Tyr Leu Ala Ser Ala Val Ala Ala Gln705 710
715 720Cys Gly Leu Asn Phe Ile Ser Ile Lys Gly Pro Glu Ile Leu Asn
Lys 725 730 735Tyr Ile Gly Ala Ser Glu Gln Ser Val Arg Glu Leu Phe
Glu Arg Ala 740 745 750Gln Ala Ala Lys Pro Cys Ile Leu Phe Phe Asp
Glu Phe Asp Ser Ile 755 760 765Ala Pro Lys Arg Gly His Asp Ser Thr
Gly Val Thr Asp Arg Val Val 770 775 780Asn Gln Met Leu Thr Gln Met
Asp Gly Ala Glu Gly Leu Asp Gly Val785 790 795 800Tyr Val Leu Ala
Ala Thr Ser Arg Pro Asp Leu Ile Asp Pro Ala Leu 805 810 815Leu Arg
Pro Gly Arg Leu Asp Lys Met Leu Ile Cys Asp Leu Pro Ser 820 825
830Tyr Glu Asp Arg Leu Asp Ile Leu Arg Ala Ile Val Asp Gly Lys Met
835 840 845His Leu Asp Gly Glu Val Glu Leu Glu Tyr Val Ala Ser Arg
Thr Asp 850 855 860Gly Phe Ser Gly Ala Asp Leu Gln Ala Val Met Phe
Asn Ala Tyr Leu865 870 875 880Glu Ala Ile His Glu Val Val Asp Val
Ala Asp Asp Thr Ala Ala Asp 885 890 895Thr Pro Ala Leu Glu Asp Lys
Arg Leu Glu Phe Phe Gln Thr Thr Leu 900 905 910Gly Asp Ala Lys Lys
Asp Pro Ala Ala Val Gln Asn Glu Val Met Asn 915 920 925Ala Arg Ala
Ala Val Ala Glu Lys Ala Arg Val Thr Ala Lys Leu Glu 930 935 940Ala
Leu Phe Lys Gly Met Ser Val Gly Val Asp Asn Asp Asp Asp Lys945 950
955 960Pro Arg Lys Lys Ala Val Val Val Ile Lys Pro Gln His Met Asn
Lys 965 970 975Ser Leu Asp Glu Thr Ser Pro Ser Ile Ser Lys Lys Glu
Leu Leu Lys 980 985 990Leu Lys Gly Ile Tyr Ser Gln Phe Val Ser Gly
Arg Ser Gly Asp Met 995 1000 1005Pro Pro Gly Thr Ala Ser Thr Asp
Val Gly Gly Arg Ala Thr Leu 1010 1015 1020Ala2381PRTYarrowia
lipolyticaMISC_FEATURE(1)..(381)YlPex2p; GenBank Accession No.
CAG77647 2Met Ser Ser Val Leu Arg Leu Phe Lys Ile Gly Ala Pro Val
Pro Asn1 5 10 15Val Arg Val His Gln Leu Asp Ala Ser Leu Leu Asp Ala
Glu Leu Val 20 25 30Asp Leu Leu Lys Asn Gln Leu Phe Lys Gly Phe Thr
Asn Phe His Pro 35 40 45Glu Phe Arg Asp Lys Tyr Glu Ser Glu Leu Val
Leu Ala Leu Lys Leu 50 55 60Ile Leu Phe Lys Leu Thr Val Trp Asp His
Ala Ile Thr Tyr Gly Gly65 70 75 80Lys Leu Gln Asn Leu Lys Phe Ile
Asp Ser Arg His Ser Ser Lys Leu 85 90 95Gln Ile Gln Pro Ser Val Ile
Gln Lys Leu Gly Tyr Gly Ile Leu Val 100 105 110Val Gly Gly Gly Tyr
Leu Trp Ser Lys Ile Glu Gly Tyr Leu Leu Ala 115 120 125Arg Ser Glu
Asp Asp Val Ala Thr Asp Gly Thr Ser Val Arg Gly Ala 130 135 140Ser
Ala Ala Arg Gly Ala Leu Lys Val Ala Asn Phe Ala Ser Leu Leu145 150
155 160Tyr Ser Ala Ala Thr Leu Gly Asn Phe Val Ala Phe Leu Tyr Thr
Gly 165 170 175Arg Tyr Ala Thr Val Ile Met Arg Leu Leu Arg Ile Arg
Leu Val Pro 180 185 190Ser Gln Arg Thr Ser Ser Arg Gln Val Ser Tyr
Glu Phe Gln Asn Arg 195 200 205Gln Leu Val Trp Asn Ala Phe Thr Glu
Phe Leu Ile Phe Ile Leu Pro 210 215 220Leu Leu Gln Leu Pro Lys Leu
Lys Arg Arg Ile Glu Arg Lys Leu Gln225 230 235 240Ser Leu Asn Val
Thr Arg Val Gly Asn Val Glu Glu Ala Ser Glu Gly 245 250 255Glu Leu
Ala His Leu Pro Gln Lys Thr Cys Ala Ile Cys Phe Arg Asp 260 265
270Glu Glu Glu Gln Glu Gly Gly Gly Gly Ala Ser His Tyr Ser Thr Asp
275 280 285Val Thr Asn Pro Tyr Gln Ala Asp Cys Gly His Val Tyr Cys
Tyr Val 290 295 300Cys Leu Val Thr Lys Leu Ala Gln Gly Asp Gly Asp
Gly Trp Asn Cys305 310 315 320Tyr Arg Cys Ala Lys Gln Val Gln Lys
Met Lys Pro Trp Val Asp Val 325 330 335Asp Glu Ala Ala Val Val Gly
Ala Ala Glu Met His Glu Lys Val Asp 340 345 350Val Ile Glu His Ala
Glu Asp Asn Glu Gln Glu Glu Glu Glu Phe Asp 355 360 365Asp Asp Asp
Glu Asp Ser Asn Phe Gln Leu Met Lys Asp 370 375 3803431PRTYarrowia
lipolyticaMISC_FEATURE(1)..(431)YlPex3p; GenBank Accession No.
CAG78565 3Met Asp Phe Phe Arg Arg His Gln Lys Lys Val Leu Ala Leu
Val Gly1 5 10 15Val Ala Leu Ser Ser Tyr Leu Phe Ile Asp Tyr Val Lys
Lys Lys Phe 20 25 30Phe Glu Ile Gln Gly Arg Leu Ser Ser Glu Arg Thr
Ala Lys Gln Asn 35 40 45Leu Arg Arg Arg Phe Glu Gln Asn Gln Gln Asp
Ala Asp Phe Thr Ile 50 55 60Met Ala Leu Leu Ser Ser Leu Thr Thr Pro
Val Met Glu Arg Tyr Pro65 70 75 80Val Asp Gln Ile Lys Ala Glu Leu
Gln Ser Lys Arg Arg Pro Thr Asp 85 90 95Arg Val Leu Ala Leu Glu Ser
Ser Thr Ser Ser Ser Ala Thr Ala Gln 100 105 110Thr Val Pro Thr Met
Thr Ser Gly Ala Thr Glu Glu Gly Glu Lys Ser 115 120 125Lys Thr Gln
Leu Trp Gln Asp Leu Lys Arg Thr Thr Ile Ser Arg Ala 130 135 140Phe
Ser Leu Val Tyr Ala Asp Ala Leu Leu Ile Phe Phe Thr Arg Leu145 150
155 160Gln Leu Asn Ile Leu Gly Arg Arg Asn Tyr Val Asn Ser Val Val
Ala 165 170 175Leu Ala Gln Gln Gly Arg Glu Gly Asn Ala Glu Gly Arg
Val Ala Pro 180 185 190Ser Phe Gly Asp Leu Ala Asp Met Gly Tyr Phe
Gly Asp Leu Ser Gly 195 200 205Ser Ser Ser Phe Gly Glu Thr Ile Val
Asp Pro Asp Leu Asp Glu Gln 210 215 220Tyr Leu Thr Phe Ser Trp Trp
Leu Leu Asn Glu Gly Trp Val Ser Leu225 230 235 240Ser Glu Arg Val
Glu Glu Ala Val Arg Arg Val Trp Asp Pro Val Ser 245 250 255Pro Lys
Ala Glu Leu Gly Phe Asp Glu Leu Ser Glu Leu Ile Gly Arg 260 265
270Thr Gln Met Leu Ile Asp Arg Pro Leu Asn Pro Ser Ser Pro Leu Asn
275 280 285Phe Leu Ser Gln Leu Leu Pro Pro Arg Glu Gln Glu Glu Tyr
Val Leu 290 295 300Ala Gln Asn Pro Ser Asp Thr Ala Ala Pro Ile Val
Gly Pro Thr Leu305 310 315 320Arg Arg Leu Leu Asp Glu Thr Ala Asp
Phe Ile Glu Ser Pro Asn Ala 325 330 335Ala Glu Val Ile Glu Arg Leu
Val His Ser Gly Leu Ser Val Phe Met 340 345 350Asp Lys Leu Ala Val
Thr Phe Gly Ala Thr Pro Ala Asp Ser Gly Ser 355 360 365Pro Tyr Pro
Val Val Leu Pro Thr Ala Lys Val Lys Leu Pro Ser Ile 370 375 380Leu
Ala Asn Met Ala Arg Gln Ala Gly Gly Met Ala Gln Gly Ser Pro385 390
395 400Gly Val Glu Asn Glu Tyr Ile Asp Val Met Asn Gln Val Gln Glu
Leu 405 410 415Thr Ser Phe Ser Ala Val Val Tyr Ser Ser Phe Asp Trp
Ala Leu 420 425 4304395PRTYarrowia
lipolyticaMISC_FEATURE(1)..(395)YlPex3Bp; GenBank Accession No.
CAG83356 4Met Leu Gln Ser Leu Asn Arg Asn Lys Lys Arg Leu Ala Val
Ser Thr1 5 10 15Gly Leu Ile Ala Val Ala Tyr Val Val Ile Ser Tyr Thr
Thr Lys Arg 20 25 30Leu Ile Glu Lys Gln Glu Gln Lys Leu Glu Glu Glu
Arg Ala Lys Glu 35 40 45Arg Leu Lys Gln Leu Phe Ala Gln Thr Gln Asn
Glu Ala Ala Phe His 50 55 60Thr Ala Ser Val Leu Pro Gln Leu Cys Glu
Gln Ile Met Glu Phe Val65 70 75 80Ala Val Glu Lys Ile Ala Glu Gln
Leu Gln Asn Met Arg Ala Glu Lys 85 90 95Arg Lys Lys Gln Asn Met Asp
Asp Asp Lys His Ser Val Leu Ser Leu 100 105 110Gly Thr Glu Thr Thr
Ala Ser Met Ala Asp Gly Gln Lys Met Ser Lys 115 120 125Ile Gln Leu
Trp Asp Glu Leu Lys Ile Glu Ser Leu Thr Arg Ile Val 130 135 140Thr
Leu Ile Tyr Cys Val Ser Leu Leu Asn Tyr Leu Ile Arg Leu Gln145 150
155 160Thr Asn Ile Val Gly Arg Lys Arg Tyr Gln Asn Glu Ala Gly Pro
Ala 165 170 175Gly Ala Thr Tyr Asp Met Ser Leu Glu Gln Cys Tyr Thr
Trp Leu Leu 180 185 190Thr Arg Gly Trp Lys Ser Val Val Asp Asn Val
Arg Arg Ser Val Gln 195 200 205Gln Val Phe Thr Gly Val Asn Pro Arg
Gln Asn Leu Ser Leu Asp Glu 210 215 220Phe Ala Thr Leu Leu Lys Arg
Val Gln Thr Leu Val Asn Ser Pro Pro225 230 235 240Tyr Ser Thr Thr
Pro Asn Thr Phe Leu Thr Ser Leu Leu Pro Pro Arg 245 250 255Glu Leu
Glu Gln Leu Arg Leu Glu Lys Glu Lys Gln Ser Leu Ser Pro 260 265
270Asn Tyr Thr Tyr Gly Ser Pro Leu Lys Asp Leu Val Phe Glu Ser Ala
275 280 285Gln His Ile Gln Ser Pro Gln Gly Met Ser Ser Phe Arg Ala
Ile Ile 290 295 300Asp Gln Ser Phe Lys Val Phe Leu Glu Lys Val Asn
Glu Ser Gln Tyr305 310 315 320Val Asn Pro Pro Ser Thr Gly Gly Lys
Arg Ile Ala Val Gly Ala Leu 325 330 335Gln Pro Pro Ile Ile Ser Gly
Gly Pro Lys Lys Val Lys Leu Ala Ser 340 345 350Leu Leu Ser Val Ala
Thr Arg Gln Ser Ser Val Ile Ser His Ala Gln 355 360 365Pro Asn Pro
Tyr Val Asp Ala Ile Asn Ser Val Ala Glu Tyr Asn Gly 370 375 380Leu
Cys Ala Val Ile Tyr Ser Ser Phe Glu Gln385 390 3955153PRTYarrowia
lipolyticaMISC_FEATURE(1)..(153)YlPex4p; GenBank Accession No.
CAG79130 5Met Ala Ser Gln Lys Arg Leu Ile Lys Glu Leu Ala Ala Tyr
Lys Lys1 5 10 15Asp Pro Asn Pro Cys Leu Ala Ser Leu Thr Ala Asp Gly
Asp Ser Leu 20 25 30Tyr Lys Trp Thr Ala Val Met Arg Gly Thr Glu Gly
Thr Ala Tyr Glu 35 40 45Asn Gly Leu Trp Gln Val Glu Ile Asn Ile Pro
Glu Asn Tyr Pro Leu 50 55 60Gln Pro Pro Thr Met Phe Phe Arg Thr Lys
Ile Cys His Pro Asn Ile65 70 75 80His Phe Glu Thr Gly Glu Val Cys
Ile Asp Val Leu Lys Thr Gln Trp 85 90 95Ser Pro Ala Trp Thr Ile Ser
Ser Ala Cys Thr Ala Val Ser Ala Met 100 105 110Leu Ser Leu Pro Glu
Pro Asp Ser Pro Leu Asn Ile Asp Ala Ala Asn 115 120 125Leu Val Arg
Cys Gly Asp Glu Ser Ala Met Glu Gly Leu Val Arg Tyr 130 135 140Tyr
Val Asn Lys Tyr Ala Ser Gly Asn145 1506598PRTYarrowia
lipolyticaMISC_FEATURE(1)..(598)YlPex5p; GenBank Accession No.
CAG78803 6Met Ser Phe Met Arg Gly Gly Ser Glu Cys Ser Thr Gly Arg
Asn Pro1 5 10
15Leu Ser Gln Phe Thr Lys His Thr Ala Glu Asp Arg Ser Leu Gln His
20 25 30Asp Arg Val Ala Gly Pro Ser Gly Gly Arg Val Gly Gly Met Arg
Ser 35 40 45Asn Thr Gly Glu Met Ser Gln Gln Asp Arg Glu Met Met Ala
Arg Phe 50 55 60Gly Ala Ala Gly Pro Glu Gln Ser Ser Phe Asn Tyr Glu
Gln Met Arg65 70 75 80His Glu Leu His Asn Met Gly Ala Gln Gly Gly
Gln Ile Pro Gln Val 85 90 95Pro Ser Gln Gln Gly Ala Ala Asn Gly Gly
Gln Trp Ala Arg Asp Phe 100 105 110Gly Gly Gln Gln Thr Ala Pro Gly
Ala Ala Pro Gln Asp Ala Lys Asn 115 120 125Trp Asn Ala Glu Phe Gln
Arg Gly Gly Ser Pro Ala Glu Ala Met Gln 130 135 140Gln Gln Gly Pro
Gly Pro Met Gln Gly Gly Met Gly Met Gly Gly Met145 150 155 160Pro
Met Tyr Gly Met Ala Arg Pro Met Tyr Ser Gly Met Ser Ala Asn 165 170
175Met Ala Pro Gln Phe Gln Pro Gln Gln Ala Asn Ala Arg Val Val Glu
180 185 190Leu Asp Glu Gln Asn Trp Glu Glu Gln Phe Lys Gln Met Asp
Ser Ala 195 200 205Val Gly Lys Gly Lys Glu Val Glu Glu Gln Thr Ala
Glu Thr Ala Thr 210 215 220Ala Thr Glu Thr Val Thr Glu Thr Glu Thr
Thr Thr Glu Asp Lys Pro225 230 235 240Met Asp Ile Lys Asn Met Asp
Phe Glu Asn Ile Trp Lys Asn Leu Gln 245 250 255Val Asn Val Leu Asp
Asn Met Asp Glu Trp Leu Glu Glu Thr Asn Ser 260 265 270Pro Ala Trp
Glu Arg Asp Phe His Glu Tyr Thr His Asn Arg Pro Glu 275 280 285Phe
Ala Asp Tyr Gln Phe Glu Glu Asn Asn Gln Phe Met Glu His Pro 290 295
300Asp Pro Phe Lys Ile Gly Val Glu Leu Met Glu Thr Gly Gly Arg
Leu305 310 315 320Ser Glu Ala Ala Leu Ala Phe Glu Ala Ala Val Gln
Lys Asn Thr Glu 325 330 335His Ala Glu Ala Trp Gly Arg Leu Gly Ala
Cys Gln Ala Gln Asn Glu 340 345 350Lys Glu Asp Pro Ala Ile Arg Ala
Leu Glu Arg Cys Ile Lys Leu Glu 355 360 365Pro Gly Asn Leu Ser Ala
Leu Met Asn Leu Ser Val Ser Tyr Thr Asn 370 375 380Glu Gly Tyr Glu
Asn Ala Ala Tyr Ala Thr Leu Glu Arg Trp Leu Ala385 390 395 400Thr
Lys Tyr Pro Glu Val Val Asp Gln Ala Arg Asn Gln Glu Pro Arg 405 410
415Leu Gly Asn Glu Asp Lys Phe Gln Leu His Ser Arg Val Thr Glu Leu
420 425 430Phe Ile Arg Ala Ala Gln Leu Ser Pro Asp Gly Ala Asn Ile
Asp Ala 435 440 445Asp Val Gln Val Gly Leu Gly Val Leu Phe Tyr Gly
Asn Glu Glu Tyr 450 455 460Asp Lys Ala Ile Asp Cys Phe Asn Ala Ala
Ile Ala Val Arg Pro Asp465 470 475 480Asp Ala Leu Leu Trp Asn Arg
Leu Gly Ala Thr Leu Ala Asn Ser His 485 490 495Arg Ser Glu Glu Ala
Ile Asp Ala Tyr Tyr Lys Ala Leu Glu Leu Arg 500 505 510Pro Ser Phe
Val Arg Ala Arg Tyr Asn Leu Gly Val Ser Cys Ile Asn 515 520 525Ile
Gly Cys Tyr Lys Glu Ala Ala Gln Tyr Leu Leu Gly Ala Leu Ser 530 535
540Met His Lys Val Glu Gly Val Gln Asp Asp Val Leu Ala Asn Gln
Ser545 550 555 560Thr Asn Leu Tyr Asp Thr Leu Lys Arg Val Phe Leu
Gly Met Asp Arg 565 570 575Arg Asp Leu Val Ala Lys Val Gly Asn Gly
Met Asp Val Asn Gln Phe 580 585 590Arg Asn Glu Phe Glu Phe
59571024PRTYarrowia lipolyticaMISC_FEATURE(1)..(1024)YlPex6p;
GenBank Accession No. CAG82306 7Met Pro Ser Ile Ser His Lys Pro Ile
Thr Ala Lys Leu Val Ala Ala1 5 10 15Pro Asp Ala Thr Lys Leu Glu Leu
Ser Ser Tyr Leu Tyr Gln Gln Leu 20 25 30Phe Ser Asp Lys Pro Ala Glu
Pro Tyr Val Ala Phe Glu Ala Pro Gly 35 40 45Ile Lys Trp Ala Leu Tyr
Pro Ala Ser Glu Asp Arg Ser Leu Pro Gln 50 55 60Tyr Thr Cys Lys Ala
Asp Ile Arg His Val Ala Gly Ser Leu Lys Lys65 70 75 80Phe Met Pro
Val Val Leu Lys Arg Val Asn Pro Val Thr Ile Glu His 85 90 95Ala Ile
Val Thr Val Pro Ala Ser Gln Tyr Glu Thr Leu Asn Thr Pro 100 105
110Glu Gln Val Leu Lys Ala Leu Glu Pro Gln Leu Asp Lys Asp Arg Pro
115 120 125Val Ile Arg Gln Gly Asp Val Leu Leu Asn Gly Cys Arg Val
Arg Leu 130 135 140Cys Glu Pro Val Asn Gln Gly Lys Val Val Lys Gly
Thr Thr Lys Leu145 150 155 160Thr Val Ala Lys Glu Gln Glu Thr Ile
Gln Pro Ala Asp Glu Ala Ala 165 170 175Asp Val Ala Phe Asp Ile Ala
Glu Phe Leu Asp Phe Asp Thr Ser Val 180 185 190Ala Lys Thr Arg Glu
Ser Thr Asn Leu Gln Val Ala Pro Leu Glu Gly 195 200 205Ala Ile Pro
Thr Pro Leu Ser Asp Arg Phe Asp Asp Cys Glu Ser Arg 210 215 220Gly
Phe Val Lys Ser Glu Thr Met Ser Lys Leu Gly Val Phe Ser Gly225 230
235 240Asp Ile Val Ser Ile Lys Thr Lys Asn Gly Ala Glu Arg Val Leu
Arg 245 250 255Leu Phe Ala Tyr Pro Glu Pro Asn Thr Val Lys Tyr Asp
Val Val Tyr 260 265 270Val Ser Pro Ile Leu Tyr His Asn Ile Gly Asp
Lys Glu Ile Glu Val 275 280 285Thr Pro Asn Gly Glu Thr His Lys Ser
Val Gly Glu Ala Leu Asp Ser 290 295 300Val Leu Glu Ala Ala Glu Glu
Val Lys Leu Ala Arg Val Leu Gly Pro305 310 315 320Thr Thr Thr Asp
Arg Thr Phe Gln Thr Ala Tyr His Ala Gly Leu Gln 325 330 335Ala Tyr
Phe Lys Pro Val Lys Arg Ala Val Arg Val Gly Asp Leu Ile 340 345
350Pro Ile Pro Phe Asp Ser Ile Leu Ala Arg Thr Ile Gly Glu Asp Pro
355 360 365Glu Met Ser His Ile Pro Leu Glu Ala Leu Ala Val Lys Pro
Asp Ser 370 375 380Val Ala Trp Phe Gln Val Thr Ser Leu Asn Gly Ser
Glu Asp Pro Ala385 390 395 400Ser Lys Gln Tyr Leu Val Asp Ser Ser
Gln Thr Lys Leu Ile Glu Gly 405 410 415Gly Thr Thr Ser Ser Ala Val
Ile Pro Thr Ser Val Pro Trp Arg Glu 420 425 430Tyr Leu Gly Leu Asp
Thr Leu Pro Lys Phe Gly Ser Glu Phe Ala Tyr 435 440 445Ala Asp Lys
Ile Arg Asn Leu Val Gln Ile Ser Thr Ser Ala Leu Ser 450 455 460His
Ala Lys Leu Asn Thr Ser Val Leu Leu His Ser Ala Lys Arg Gly465 470
475 480Val Gly Lys Ser Thr Val Leu Arg Ser Val Ala Ala Gln Cys Gly
Ile 485 490 495Ser Val Phe Glu Ile Ser Cys Phe Gly Leu Ile Gly Asp
Asn Glu Ala 500 505 510Gln Thr Leu Gly Thr Leu Arg Ala Lys Leu Asp
Arg Ala Tyr Gly Cys 515 520 525Ser Pro Cys Val Val Val Leu Gln His
Leu Glu Ser Ile Ala Lys Lys 530 535 540Ser Asp Gln Asp Gly Lys Asp
Glu Gly Ile Val Ser Lys Leu Val Asp545 550 555 560Val Leu Ala Asp
Tyr Ser Gly His Gly Val Leu Leu Ala Ala Thr Ser 565 570 575Asn Asp
Pro Asp Lys Ile Ser Glu Ala Ile Arg Ser Arg Phe Gln Phe 580 585
590Glu Ile Glu Ile Gly Val Pro Ser Glu Pro Gln Arg Arg Gln Ile Phe
595 600 605Ser His Leu Thr Lys Ser Gly Pro Gly Gly Asp Ser Ile Arg
Asn Ala 610 615 620Pro Ile Ser Leu Arg Ser Asp Val Ser Val Glu Asn
Leu Ala Leu Gln625 630 635 640Ser Ala Gly Leu Thr Pro Pro Asp Leu
Thr Ala Ile Val Gln Thr Thr 645 650 655Arg Leu Arg Ala Ile Asp Arg
Leu Asn Lys Leu Thr Lys Asp Ser Asp 660 665 670Thr Thr Leu Asp Asp
Leu Leu Thr Leu Ser His Gly Thr Leu Gln Leu 675 680 685Thr Pro Ser
Asp Phe Asp Asp Ala Ile Ala Asp Ala Arg Gln Lys Tyr 690 695 700Ser
Asp Ser Ile Gly Ala Pro Arg Ile Pro Asn Val Gly Trp Asp Asp705 710
715 720Val Gly Gly Met Glu Gly Val Lys Lys Asp Ile Leu Asp Thr Ile
Glu 725 730 735Thr Pro Leu Lys Tyr Pro His Trp Phe Ser Asp Gly Val
Lys Lys Arg 740 745 750Ser Gly Ile Leu Phe Tyr Gly Pro Pro Gly Thr
Gly Lys Thr Leu Leu 755 760 765Ala Lys Ala Ile Ala Thr Thr Phe Ser
Leu Asn Phe Phe Ser Val Lys 770 775 780Gly Pro Glu Leu Leu Asn Met
Tyr Ile Gly Glu Ser Glu Ala Asn Val785 790 795 800Arg Arg Val Phe
Gln Lys Ala Arg Asp Ala Lys Pro Cys Val Val Phe 805 810 815Phe Asp
Glu Leu Asp Ser Val Ala Pro Gln Arg Gly Asn Gln Gly Asp 820 825
830Ser Gly Gly Val Met Asp Arg Ile Val Ser Gln Leu Leu Ala Glu Leu
835 840 845Asp Gly Met Ser Thr Ala Gly Gly Glu Gly Val Phe Val Val
Gly Ala 850 855 860Thr Asn Arg Pro Asp Leu Leu Asp Glu Ala Leu Leu
Arg Pro Gly Arg865 870 875 880Phe Asp Lys Met Leu Tyr Leu Gly Ile
Ser Asp Thr His Glu Lys Gln 885 890 895Gln Thr Ile Met Glu Ala Leu
Thr Arg Lys Phe Arg Leu Ala Ala Asp 900 905 910Val Ser Leu Glu Ala
Ile Ser Lys Arg Cys Pro Phe Thr Phe Thr Gly 915 920 925Ala Asp Phe
Tyr Ala Leu Cys Ser Asp Ala Met Leu Asn Ala Met Thr 930 935 940Arg
Thr Ala Asn Glu Val Asp Ala Lys Ile Lys Leu Leu Asn Lys Asn945 950
955 960Arg Glu Glu Ala Gly Glu Glu Pro Val Ser Ile Arg Trp Trp Phe
Asp 965 970 975His Glu Ala Thr Lys Ser Asp Ile Glu Val Glu Val Ala
Gln Gln Asp 980 985 990Phe Glu Lys Ala Lys Asp Glu Leu Ser Pro Ser
Val Ser Ala Glu Glu 995 1000 1005Leu Gln His Tyr Leu Lys Leu Arg
Gln Gln Phe Glu Gly Gly Lys 1010 1015 1020Lys8356PRTYarrowia
lipolyticaMISC_FEATURE(1)..(356)YlPex7p; GenBank Accession No.
CAG78389 8Met Leu Gly Phe Lys Thr Gln Gly Phe Asn Gly Tyr Ala Ala
Asn Tyr1 5 10 15Ser Pro Phe Phe Asn Asp Lys Ile Ala Val Gly Thr Ala
Ala Asn Tyr 20 25 30Gly Leu Val Gly Asn Gly Lys Leu Phe Ile Leu Gly
Ile Ser Pro Glu 35 40 45Gly Arg Met Val Cys Glu Gly Gln Phe Asp Thr
Gln Asp Gly Ile Phe 50 55 60Asp Val Ala Trp Ser Glu Gln His Glu Asn
His Val Ala Thr Ala Cys65 70 75 80Gly Asp Gly Ser Val Lys Leu Phe
Asp Ile Lys Ala Gly Ala Phe Pro 85 90 95Leu Val Ser Phe Lys Glu His
Thr Arg Glu Val Phe Ser Val Asn Trp 100 105 110Asn Met Ala Asn Lys
Ala Leu Phe Cys Thr Ser Ser Trp Asp Ser Thr 115 120 125Ile Lys Ile
Trp Thr Pro Glu Arg Thr Asn Ser Ile Met Thr Leu Gly 130 135 140Gln
Pro Ala Pro Ala Gln Gly Thr Asn Ala Ser Ala His Ile Gly Arg145 150
155 160Gln Thr Ala Pro Asn Gln Ala Ala Ala Gln Glu Cys Ile Tyr Ser
Ala 165 170 175Lys Phe Ser Pro His Thr Asp Ser Ile Ile Ala Ser Ala
His Ser Thr 180 185 190Gly Met Val Lys Val Trp Asp Thr Arg Ala Pro
Gln Pro Leu Gln Gln 195 200 205Gln Phe Ser Thr Gln Gln Thr Glu Ser
Gly Gly Pro Pro Glu Val Leu 210 215 220Ser Leu Asp Trp Asn Lys Tyr
Arg Pro Thr Val Ile Ala Thr Gly Gly225 230 235 240Val Asp Arg Ser
Val Gln Val Tyr Asp Ile Arg Met Thr Gln Pro Ala 245 250 255Ala Asn
Gln Pro Val Gln Pro Leu Ser Leu Ile Leu Gly His Arg Leu 260 265
270Pro Val Arg Gly Val Ser Trp Ser Pro His His Ala Asp Leu Leu Leu
275 280 285Ser Cys Ser Tyr Asp Met Thr Ala Arg Val Trp Arg Asp Ala
Ser Thr 290 295 300Gly Gly Asn Tyr Leu Ala Arg Gln Arg Gly Gly Thr
Glu Val Lys Cys305 310 315 320Met Asp Arg His Thr Glu Phe Val Ile
Gly Gly Asp Trp Ser Leu Trp 325 330 335Gly Asp Pro Gly Trp Ile Thr
Thr Val Gly Trp Asp Gln Met Val Tyr 340 345 350Val Trp His Ala
3559671PRTYarrowia lipolyticaMISC_FEATURE(1)..(671)YlPex8p; GenBank
Accession No. CAG80447 9Met Asn Lys Tyr Leu Val Pro Pro Pro Gln Ala
Asn Arg Thr Val Thr1 5 10 15Asn Leu Asp Leu Leu Ile Asn Asn Leu Arg
Gly Ser Ser Thr Pro Gly 20 25 30Ala Ala Glu Val Asp Thr Arg Asp Ile
Leu Gln Arg Ile Val Phe Ile 35 40 45Leu Pro Thr Ile Lys Asn Pro Leu
Asn Leu Asp Leu Val Ile Lys Glu 50 55 60Ile Ile Asn Ser Pro Arg Leu
Leu Pro Pro Leu Ile Asp Leu His Asp65 70 75 80Tyr Gln Gln Leu Thr
Asp Ala Phe Arg Ala Thr Ile Lys Arg Lys Ala 85 90 95Leu Val Thr Asp
Pro Thr Ile Ser Phe Glu Ala Trp Leu Glu Thr Cys 100 105 110Phe Gln
Val Ile Thr Arg Phe Ala Gly Pro Gly Trp Lys Lys Leu Pro 115 120
125Leu Leu Ala Gly Leu Ile Leu Ala Asp Tyr Asp Ile Ser Ala Asp Gly
130 135 140Pro Thr Leu Glu Arg Lys Pro Gly Phe Pro Ser Lys Leu Lys
His Leu145 150 155 160Leu Lys Arg Glu Phe Val Thr Thr Phe Asp Gln
Cys Leu Ser Ile Asp 165 170 175Thr Arg Asn Arg Ser Asp Ala Thr Lys
Trp Val Pro Val Leu Ala Cys 180 185 190Ile Ser Ile Ala Gln Val Tyr
Ser Leu Leu Gly Asp Val Ala Ile Asn 195 200 205Tyr Arg Arg Phe Leu
Gln Val Gly Leu Asp Leu Ile Phe Ser Asn Tyr 210 215 220Gly Leu Glu
Met Gly Thr Ala Leu Ala Arg Leu His Ala Glu Ser Gly225 230 235
240Gly Asp Ala Thr Thr Ala Gly Gly Leu Ile Gly Lys Lys Leu Lys Glu
245 250 255Pro Val Val Ala Leu Leu Asn Thr Phe Ala His Ile Ala Ser
Ser Cys 260 265 270Ile Val His Val Asp Ile Asp Tyr Ile Asp Arg Ile
Gln Asn Lys Ile 275 280 285Ile Leu Val Cys Glu Asn Gln Ala Glu Thr
Trp Arg Ile Leu Thr Ile 290 295 300Glu Ser Pro Thr Val Met His His
Gln Glu Ser Val Gln Tyr Leu Lys305 310 315 320Trp Glu Leu Phe Thr
Leu Cys Ile Ile Met Gln Gly Ile Ala Asn Met 325 330 335Leu Leu Thr
Gln Lys Met Asn Gln Phe Met Tyr Leu Gln Leu Ala Tyr 340 345 350Lys
Gln Leu Gln Ala Leu His Ser Ile Tyr Phe Ile Val Asp Gln Met 355 360
365Gly Ser Gln Phe Ala Ala Tyr Asp Tyr Val Phe Phe Ser Ala Ile Asp
370 375 380Val Leu Leu Ser Glu Tyr Ala Pro Tyr Ile Lys Asn Arg Gly
Thr Ile385 390 395 400Pro Pro Asn Lys Glu Phe Val Ala Glu Arg Leu
Ala Ala Asn Leu Ala 405 410 415Gly Thr Ser Asn Val Gly Ser His Leu
Pro Ile Asp Arg Ser Arg Val 420 425 430Leu Phe Ala Leu Asn Tyr Tyr
Glu Gln Leu Val Thr Val Cys His Asp 435 440 445Ser Cys Val Glu Thr
Ile Ile Tyr Pro Met Ala Arg Ser Phe Leu Tyr 450 455 460Pro Thr Ser
Asp Ile Gln Gln Leu Lys Pro Leu Val Glu Ala Ala His465
470 475 480Ser Val Ile Leu Ala Gly Leu Ala Val Pro Thr Asn Ala Val
Val Asn 485 490 495Ala Lys Leu Ile Pro Glu Tyr Met Gly Gly Val Leu
Pro Leu Phe Pro 500 505 510Gly Val Phe Ser Trp Asn Gln Phe Val Leu
Ala Ile Gln Ser Ile Val 515 520 525Asn Thr Val Ser Pro Pro Ser Glu
Val Phe Lys Thr Asn Gln Lys Leu 530 535 540Phe Arg Leu Val Leu Asp
Ser Leu Met Lys Lys Cys Arg Asp Thr Pro545 550 555 560Val Gly Ile
Pro Val Pro His Ser Val Thr Val Ser Gln Glu Gln Glu 565 570 575Asp
Ile Pro Pro Thr Gln Arg Ala Val Val Met Leu Ala Leu Ile Asn 580 585
590Ser Leu Pro Tyr Val Asp Ile Arg Ser Phe Glu Leu Trp Leu Gln Glu
595 600 605Thr Trp Asn Met Ile Glu Ala Thr Pro Met Leu Ala Glu Asn
Ala Pro 610 615 620Asn Lys Glu Leu Ala His Ala Glu His Glu Phe Leu
Val Leu Glu Met625 630 635 640Trp Lys Met Ile Ser Gly Asn Ile Asp
Gln Arg Leu Asn Asp Val Ala 645 650 655Ile Arg Trp Trp Tyr Lys Lys
Asn Ala Arg Val His Gly Thr Leu 660 665 67010377PRTYarrowia
lipolyticaMISC_FEATURE(1)..(377)YlPex10p; GenBank Accession No.
CAG81606 10Met Trp Gly Ser Ser His Ala Phe Ala Gly Glu Ser Asp Leu
Thr Leu1 5 10 15Gln Leu His Thr Arg Ser Asn Met Ser Asp Asn Thr Thr
Ile Lys Lys 20 25 30Pro Ile Arg Pro Lys Pro Ile Arg Thr Glu Arg Leu
Pro Tyr Ala Gly 35 40 45Ala Ala Glu Ile Ile Arg Ala Asn Gln Lys Asp
His Tyr Phe Glu Ser 50 55 60Val Leu Glu Gln His Leu Val Thr Phe Leu
Gln Lys Trp Lys Gly Val65 70 75 80Arg Phe Ile His Gln Tyr Lys Glu
Glu Leu Glu Thr Ala Ser Lys Phe 85 90 95Ala Tyr Leu Gly Leu Cys Thr
Leu Val Gly Ser Lys Thr Leu Gly Glu 100 105 110Glu Tyr Thr Asn Leu
Met Tyr Thr Ile Arg Asp Arg Thr Ala Leu Pro 115 120 125Gly Val Val
Arg Arg Phe Gly Tyr Val Leu Ser Asn Thr Leu Phe Pro 130 135 140Tyr
Leu Phe Val Arg Tyr Met Gly Lys Leu Arg Ala Lys Leu Met Arg145 150
155 160Glu Tyr Pro His Leu Val Glu Tyr Asp Glu Asp Glu Pro Val Pro
Ser 165 170 175Pro Glu Thr Trp Lys Glu Arg Val Ile Lys Thr Phe Val
Asn Lys Phe 180 185 190Asp Lys Phe Thr Ala Leu Glu Gly Phe Thr Ala
Ile His Leu Ala Ile 195 200 205Phe Tyr Val Tyr Gly Ser Tyr Tyr Gln
Leu Ser Lys Arg Ile Trp Gly 210 215 220Met Arg Tyr Val Phe Gly His
Arg Leu Asp Lys Asn Glu Pro Arg Ile225 230 235 240Gly Tyr Glu Met
Leu Gly Leu Leu Ile Phe Ala Arg Phe Ala Thr Ser 245 250 255Phe Val
Gln Thr Gly Arg Glu Tyr Leu Gly Ala Leu Leu Glu Lys Ser 260 265
270Val Glu Lys Glu Ala Gly Glu Lys Glu Asp Glu Lys Glu Ala Val Val
275 280 285Pro Lys Lys Lys Ser Ser Ile Pro Phe Ile Glu Asp Thr Glu
Gly Glu 290 295 300Thr Glu Asp Lys Ile Asp Leu Glu Asp Pro Arg Gln
Leu Lys Phe Ile305 310 315 320Pro Glu Ala Ser Arg Ala Cys Thr Leu
Cys Leu Ser Tyr Ile Ser Ala 325 330 335Pro Ala Cys Thr Pro Cys Gly
His Phe Phe Cys Trp Asp Cys Ile Ser 340 345 350Glu Trp Val Arg Glu
Lys Pro Glu Cys Pro Leu Cys Arg Gln Gly Val 355 360 365Arg Glu Gln
Asn Leu Leu Pro Ile Arg 370 37511408PRTYarrowia
lipolyticaMISC_FEATURE(1)..(408)YlPex12p; GenBank Accession No.
CAG81532 11Met Asp Tyr Phe Ser Ser Leu Asn Ala Ser Gln Leu Asp Pro
Asp Val1 5 10 15Pro Thr Leu Phe Glu Leu Leu Ser Ala Lys Gln Leu Glu
Gly Leu Ile 20 25 30Ala Pro Ser Val Arg Tyr Ile Leu Ala Phe Tyr Ala
Gln Arg His Pro 35 40 45Arg Tyr Leu Leu Arg Ile Val Asn Arg Tyr Asp
Glu Leu Tyr Ala Leu 50 55 60Phe Met Gly Leu Val Glu Tyr Tyr Asn Leu
Lys Thr Trp Asn Ala Ser65 70 75 80Phe Thr Glu Lys Phe Tyr Gly Leu
Lys Arg Thr Gln Ile Leu Thr Asn 85 90 95Pro Ala Leu Arg Thr Arg Gln
Ala Val Pro Asp Leu Val Glu Ala Glu 100 105 110Lys Arg Leu Ser Lys
Lys Lys Ile Trp Gly Ser Leu Phe Phe Leu Ile 115 120 125Val Val Pro
Tyr Val Lys Glu Lys Leu Asp Ala Arg Tyr Glu Arg Leu 130 135 140Lys
Gly Arg Tyr Leu Ala Arg Asp Ile Asn Glu Glu Arg Ile Glu Ile145 150
155 160Lys Arg Thr Gly Thr Ala Gln Gln Ile Ala Val Phe Glu Phe Asp
Tyr 165 170 175Trp Leu Leu Lys Leu Tyr Pro Ile Val Thr Met Gly Cys
Thr Thr Ala 180 185 190Thr Leu Ala Phe His Met Leu Phe Leu Phe Ser
Val Thr Arg Ala Tyr 195 200 205Ser Ile Asp Asp Phe Leu Leu Asn Ile
Gln Phe Ser Arg Met Thr Arg 210 215 220Tyr Asp Tyr Gln Met Glu Thr
Gln Arg Asp Ser Arg Asn Ala Ala Asn225 230 235 240Val Ala His Thr
Met Lys Ser Ile Ser Glu Tyr Pro Val Ala Glu Arg 245 250 255Val Met
Leu Leu Leu Thr Thr Lys Ala Gly Ala Asn Ala Met Arg Ser 260 265
270Ala Ala Leu Ser Gly Leu Ser Tyr Val Leu Pro Thr Ser Ile Phe Ala
275 280 285Leu Lys Phe Leu Glu Trp Trp Tyr Ala Ser Asp Phe Ala Arg
Gln Leu 290 295 300Asn Gln Lys Arg Arg Gly Asp Leu Glu Asp Asn Leu
Pro Val Pro Asp305 310 315 320Lys Val Lys Gly Ala Asp Lys Leu Ala
Glu Ser Val Ala Lys Trp Lys 325 330 335Glu Asp Thr Ser Lys Cys Pro
Leu Cys Ser Lys Glu Leu Val Asn Pro 340 345 350Thr Val Ile Glu Ser
Gly Tyr Val Phe Cys Tyr Thr Cys Ile Tyr Arg 355 360 365His Leu Glu
Asp Gly Asp Glu Glu Thr Gly Gly Arg Cys Pro Val Thr 370 375 380Gly
Gln Lys Leu Leu Gly Cys Arg Trp Gln Asp Asp Val Trp Gln Val385 390
395 400Thr Gly Leu Arg Arg Leu Met Val 40512412PRTYarrowia
lipolyticaMISC_FEATURE(1)..(412)YlPex13p; GenBank Accession No.
CAG81789 12Met Ser Val Pro Arg Pro Lys Pro Trp Glu Gly Ala Ser Gly
Ser Ser1 5 10 15Ala Ala Thr Ala Thr Pro Ala Ala Thr Ala Thr Pro Ala
Ser Thr Asp 20 25 30Ala Val Ser Ser Ser Ala Gly Ser Ala Thr Gly Ala
Pro Glu Leu Pro 35 40 45Ser Arg Pro Ser Ala Met Gly Ser Thr Ser Asn
Ala Leu Ser Ser Pro 50 55 60Met Gly Ser Ser Met Asn Ser Gly Tyr Gly
Gly Met Asn Ser Gly Tyr65 70 75 80Gly Gly Met Gly Ser Ser Tyr Gly
Ser Gly Tyr Gly Ser Ser Tyr Gly 85 90 95Met Gly Ser Ser Tyr Gly Ser
Gly Tyr Gly Ser Gly Leu Gly Gly Tyr 100 105 110Gly Ser Tyr Gly Gly
Met Gly Gly Met Gly Gly Met Tyr Gly Ser Arg 115 120 125Tyr Gly Gly
Tyr Gly Ser Tyr Gly Gly Met Gly Gly Tyr Gly Gly Tyr 130 135 140Gly
Gly Met Gly Gly Gly Pro Met Gly Gln Asn Gly Leu Ala Gly Gly145 150
155 160Thr Gln Ala Thr Phe Gln Leu Ile Glu Ser Ile Val Gly Ala Val
Gly 165 170 175Gly Phe Ala Gln Met Leu Glu Ser Thr Tyr Met Ala Thr
Gln Ser Ser 180 185 190Phe Phe Ala Met Val Ser Val Ala Glu Gln Phe
Gly Asn Leu Lys Asn 195 200 205Thr Leu Gly Ser Leu Leu Gly Ile Tyr
Ala Ile Met Arg Trp Ala Arg 210 215 220Arg Leu Val Ala Lys Leu Ser
Gly Gln Pro Val Thr Gly Ala Asn Gly225 230 235 240Ile Thr Pro Ala
Gly Phe Ala Lys Phe Glu Ala Thr Gly Gly Ala Ala 245 250 255Gly Pro
Gly Arg Gly Pro Arg Pro Ser Tyr Lys Pro Leu Leu Phe Phe 260 265
270Leu Thr Ala Val Phe Gly Leu Pro Tyr Leu Leu Gly Arg Leu Ile Lys
275 280 285Ala Leu Ala Ala Lys Gln Glu Gly Met Tyr Asp Glu His Gly
Asn Leu 290 295 300Leu Pro Gly Ala Gln Met Gly Met Gly Gly Pro Gly
Met Glu Gly Gly305 310 315 320Ala Glu Ile Asp Pro Ser Lys Leu Glu
Phe Cys Arg Ala Asn Phe Asp 325 330 335Phe Val Pro Glu Asn Pro Gln
Leu Glu Leu Glu Leu Arg Lys Gly Asp 340 345 350Leu Val Ala Val Leu
Ala Lys Thr Asp Pro Met Gly Asn Pro Ser Gln 355 360 365Trp Trp Arg
Val Arg Thr Arg Asp Gly Arg Ser Gly Tyr Val Pro Ala 370 375 380Asn
Tyr Leu Glu Val Ile Pro Arg Pro Ala Val Glu Ala Pro Lys Lys385 390
395 400Val Glu Glu Ile Gly Ala Ser Ala Val Pro Val Asn 405
41013380PRTYarrowia lipolyticaMISC_FEATURE(1)..(380)YlPex14p;
GenBank Accession No. CAG79323 13Met Ile Pro Ser Cys Leu Ser Thr
Gln His Met Ala Pro Arg Glu Asp1 5 10 15Leu Val Gln Ser Ala Val Ala
Phe Leu Asn Asp Pro Gln Ala Ala Thr 20 25 30Ala Pro Leu Ala Lys Arg
Ile Glu Phe Leu Glu Ser Lys Asp Met Thr 35 40 45Pro Glu Glu Ile Glu
Glu Ala Leu Lys Arg Ala Gly Ser Gly Ser Ala 50 55 60Gln Ser His Pro
Gly Ser Val Val Ser His Gly Gly Ala Ala Pro Thr65 70 75 80Val Pro
Ala Ser Tyr Ala Phe Gln Ser Ala Pro Pro Leu Pro Glu Arg 85 90 95Asp
Trp Lys Asp Val Phe Ile Met Ala Thr Val Thr Val Gly Val Gly 100 105
110Phe Gly Leu Tyr Thr Val Ala Lys Arg Tyr Leu Met Pro Leu Ile Leu
115 120 125Pro Pro Thr Pro Pro Ser Leu Glu Ala Asp Lys Glu Ala Leu
Glu Ala 130 135 140Glu Phe Ala Arg Val Gln Gly Leu Leu Asp Gln Val
Gln Gln Asp Thr145 150 155 160Glu Glu Val Lys Asn Ser Gln Val Glu
Val Ala Lys Arg Val Thr Asp 165 170 175Ala Leu Lys Gly Val Glu Glu
Thr Ile Asp Gln Leu Lys Ser Gln Thr 180 185 190Lys Lys Arg Asp Asp
Glu Met Lys Leu Val Thr Ala Glu Val Glu Arg 195 200 205Ile Arg Asp
Arg Leu Pro Lys Asn Ile Asp Lys Leu Lys Asp Ser Gln 210 215 220Glu
Gln Gly Leu Ala Asp Ile Gln Ser Glu Leu Lys Ser Leu Lys Gln225 230
235 240Leu Leu Ser Thr Arg Thr Ala Ala Ser Ser Gly Pro Lys Leu Pro
Pro 245 250 255Ile Pro Pro Pro Ser Ser Tyr Leu Thr Arg Lys Ala Ser
Pro Ala Val 260 265 270Pro Ala Ala Ala Pro Ala Pro Val Thr Pro Gly
Ser Pro Val His Asn 275 280 285Val Ser Ser Ser Ser Thr Val Pro Ala
Asp Arg Asp Asp Phe Ile Pro 290 295 300Thr Pro Ala Gly Ala Val Pro
Met Ile Pro Gln Pro Ala Ser Met Ser305 310 315 320Ser Ser Ser Thr
Ser Thr Val Pro Asn Ser Ala Ile Ser Ser Ala Pro 325 330 335Ser Pro
Ile Gln Glu Pro Glu Pro Phe Val Pro Glu Pro Gly Asn Ser 340 345
350Ala Val Lys Lys Pro Ala Pro Lys Ala Ser Ile Pro Ala Trp Gln Leu
355 360 365Ala Ala Leu Glu Lys Glu Lys Glu Lys Glu Lys Glu 370 375
38014391PRTYarrowia lipolyticaMISC_FEATURE(1)..(391)YlPex16p;
GenBank Accession No. CAG79622 14Met Thr Asp Lys Leu Val Lys Val
Met Gln Lys Lys Lys Ser Ala Pro1 5 10 15Gln Thr Trp Leu Asp Ser Tyr
Asp Lys Phe Leu Val Arg Asn Ala Ala 20 25 30Ser Ile Gly Ser Ile Glu
Ser Thr Leu Arg Thr Val Ser Tyr Val Leu 35 40 45Pro Gly Arg Phe Asn
Asp Val Glu Ile Ala Thr Glu Thr Leu Tyr Ala 50 55 60Val Leu Asn Val
Leu Gly Leu Tyr His Asp Thr Ile Ile Ala Arg Ala65 70 75 80Val Ala
Ala Ser Pro Asn Ala Ala Ala Val Tyr Arg Pro Ser Pro His 85 90 95Asn
Arg Tyr Thr Asp Trp Phe Ile Lys Asn Arg Lys Gly Tyr Lys Tyr 100 105
110Ala Ser Arg Ala Val Thr Phe Val Lys Phe Gly Glu Leu Val Ala Glu
115 120 125Met Val Ala Lys Lys Asn Gly Gly Glu Met Ala Arg Trp Lys
Cys Ile 130 135 140Ile Gly Ile Glu Gly Ile Lys Ala Gly Leu Arg Ile
Tyr Met Leu Gly145 150 155 160Ser Thr Leu Tyr Gln Pro Leu Cys Thr
Thr Pro Tyr Pro Asp Arg Glu 165 170 175Val Thr Gly Glu Leu Leu Glu
Thr Ile Cys Arg Asp Glu Gly Glu Leu 180 185 190Asp Ile Glu Lys Gly
Leu Met Asp Pro Gln Trp Lys Met Pro Arg Thr 195 200 205Gly Arg Thr
Ile Pro Glu Ile Ala Pro Thr Asn Val Glu Gly Tyr Leu 210 215 220Leu
Thr Lys Val Leu Arg Ser Glu Asp Val Asp Arg Pro Tyr Asn Leu225 230
235 240Leu Ser Arg Leu Asp Asn Trp Gly Val Val Ala Glu Leu Leu Ser
Ile 245 250 255Leu Arg Pro Leu Ile Tyr Ala Cys Leu Leu Phe Arg Gln
His Val Asn 260 265 270Lys Thr Val Pro Ala Ser Thr Lys Ser Lys Phe
Pro Phe Leu Asn Ser 275 280 285Pro Trp Ala Pro Trp Ile Ile Gly Leu
Val Ile Glu Ala Leu Ser Arg 290 295 300Lys Met Met Gly Ser Trp Leu
Leu Arg Gln Arg Gln Ser Gly Lys Thr305 310 315 320Pro Thr Ala Leu
Asp Gln Met Glu Val Lys Gly Arg Thr Asn Leu Leu 325 330 335Gly Trp
Trp Leu Phe Arg Gly Glu Phe Tyr Gln Ala Tyr Thr Arg Pro 340 345
350Leu Leu Tyr Ser Ile Val Ala Arg Leu Glu Lys Ile Pro Gly Leu Gly
355 360 365Leu Phe Gly Ala Leu Ile Ser Asp Tyr Leu Tyr Leu Phe Asp
Arg Tyr 370 375 380Tyr Phe Thr Ala Ser Thr Leu385
39015225PRTYarrowia lipolyticaMISC_FEATURE(1)..(225)YlPex17p;
GenBank Accession No. CAG84025 15Met Ser Ala Phe Pro Glu Pro Ser
Ser Phe Glu Ile Glu Phe Ala Lys1 5 10 15Gln Met Asn Arg Pro Arg Thr
Val Gln Phe Lys Gln Leu Val Ala Val 20 25 30Leu Tyr Ile Phe Gly Gly
Thr Ser Ala Leu Ile Tyr Ile Ile Ser Lys 35 40 45Thr Ile Leu Asn Pro
Leu Phe Glu Glu Leu Thr Phe Ala Arg Ser Glu 50 55 60Tyr Ala Ile His
Ala Arg Arg Leu Met Glu Gln Leu Asn Ala Lys Leu65 70 75 80Ser Ser
Met Ala Ser Tyr Ile Pro Pro Val Arg Ala Leu Gln Gly Gln 85 90 95Arg
Phe Val Asp Ala Gln Thr Gln Thr Glu Asp Glu Glu Gly Glu Asp 100 105
110Ile Pro Asn Pro Ser Leu Gly Lys Ser Ser His Val Ser Phe Gly Glu
115 120 125Ser Pro Met Gln Leu Lys Leu Ala Glu Lys Glu Lys Gln Gln
Lys Leu 130 135 140Ile Asp Asp Ser Val Asp Asn Leu Glu Arg Leu Ala
Asp Ser Leu Lys145 150 155 160His Ala Gly Glu Val Ser Asp Leu Ser
Ala Leu Ser Gly Phe Lys Tyr 165 170 175Gln Val Glu Glu Leu Thr Asn
Tyr Ser Asp Gln Leu Ala Met Ser Gly 180 185 190Tyr Ser Met Met Lys
Ser Gly Leu Pro Gly His Glu Thr Ala Met Ser 195 200 205Glu Thr Lys
Lys Glu Ile Arg Ser Leu Lys Gly Ser Val Leu Ser Val 210 215
220Arg22516324PRTYarrowia
lipolyticaMISC_FEATURE(1)..(324)YlPex19p; GenBank Accession No.
AAK84827 16Met Ser His Glu Glu Asp Leu Asp Asp Leu Asp Asp Phe Leu
Asp Glu1 5 10 15Phe Asp Glu Gln Val Leu Ser Lys Pro Pro Gly Ala Gln
Lys Asp Ala 20 25 30Thr Pro Thr Thr Ser Thr Ala Pro Thr Thr Ala Glu
Ala Lys Pro Asp 35 40 45Ala Thr Lys Lys Ser Thr Glu Thr Ser Gly Thr
Asp Ser Lys Thr Glu 50 55 60Gly Ala Asp Thr Ala Asp Lys Asn Ala Ala
Thr Asp Ser Ala Glu Ala65 70 75 80Gly Ala Glu Lys Val Ser Leu Pro
Asn Leu Glu Asp Gln Leu Ala Gly 85 90 95Leu Lys Met Asp Asp Phe Leu
Lys Asp Ile Glu Ala Asp Pro Glu Ser 100 105 110Lys Ala Gln Phe Glu
Ser Leu Leu Lys Glu Ile Asn Asn Val Thr Ser 115 120 125Ala Thr Ala
Ser Glu Lys Ala Gln Gln Pro Lys Ser Phe Lys Glu Thr 130 135 140Ile
Ser Ala Thr Ala Asp Arg Leu Asn Gln Ser Asn Gln Glu Met Gly145 150
155 160Asp Met Pro Leu Gly Asp Asp Met Leu Ala Gly Leu Met Glu Gln
Leu 165 170 175Ser Gly Ala Gly Gly Phe Gly Glu Gly Gly Glu Gly Asp
Phe Gly Asp 180 185 190Met Leu Gly Gly Ile Met Arg Gln Leu Ala Ser
Lys Glu Val Leu Tyr 195 200 205Gln Pro Leu Lys Glu Met His Asp Asn
Tyr Pro Lys Trp Trp Asp Glu 210 215 220His Gly Ser Lys Val Thr Glu
Glu Lys Glu Arg Asp Arg Leu Lys Leu225 230 235 240Gln Gln Asp Ile
Val Gly Lys Ile Cys Ala Lys Phe Glu Asp Pro Ser 245 250 255Tyr Ser
Asp Asp Ser Glu Ala Asp Arg Ala Val Ile Thr Gln Leu Met 260 265
270Asp Glu Met Gln Glu Thr Gly Ala Pro Pro Asp Glu Ile Met Ser Asn
275 280 285Val Ala Asp Gly Ser Ile Pro Gly Gly Leu Asp Gly Leu Gly
Leu Gly 290 295 300Gly Leu Gly Gly Gly Lys Met Pro Glu Met Pro Glu
Asn Met Pro Glu305 310 315 320Cys Asn Gln Gln17417PRTYarrowia
lipolyticaMISC_FEATURE(1)..(417)YlPex20p; GenBank Accession No.
CAG79226 17Met Ala Ser Cys Gly Pro Ser Asn Ala Leu Gln Asn Leu Ser
Lys His1 5 10 15Ala Ser Ala Asp Arg Ser Leu Gln His Asp Arg Met Ala
Pro Gly Gly 20 25 30Ala Pro Gly Ala Gln Arg Gln Gln Phe Arg Ser Gln
Thr Gln Gly Gly 35 40 45Gln Leu Asn Asn Glu Phe Gln Gln Phe Ala Gln
Ala Gly Pro Ala His 50 55 60Asn Ser Phe Glu Gln Ser Gln Met Gly Pro
His Phe Gly Gln Gln His65 70 75 80Phe Gly Gln Pro His Gln Pro Gln
Met Gly Gln His Ala Pro Met Ala 85 90 95His Gly Gln Gln Ser Asp Trp
Ala Gln Ser Phe Ser Gln Leu Asn Leu 100 105 110Gly Pro Gln Thr Gly
Pro Gln His Thr Gln Gln Ser Asn Trp Gly Gln 115 120 125Asp Phe Met
Arg Gln Ser Pro Gln Ser His Gln Val Gln Pro Gln Met 130 135 140Ala
Asn Gly Val Met Gly Ser Met Ser Gly Met Ser Ser Phe Gly Pro145 150
155 160Met Tyr Ser Asn Ser Gln Leu Met Asn Ser Thr Tyr Gly Leu Gln
Thr 165 170 175Glu His Gln Gln Thr His Lys Thr Glu Thr Lys Ser Ser
Gln Asp Ala 180 185 190Ala Phe Glu Ala Ala Phe Gly Ala Val Glu Glu
Ser Ile Thr Lys Thr 195 200 205Ser Asp Lys Gly Lys Glu Val Glu Lys
Asp Pro Met Glu Gln Thr Tyr 210 215 220Arg Tyr Asp Gln Ala Asp Ala
Leu Asn Arg Gln Ala Glu His Ile Ser225 230 235 240Asp Asn Ile Ser
Arg Glu Glu Val Asp Ile Lys Thr Asp Glu Asn Gly 245 250 255Glu Phe
Ala Ser Ile Ala Arg Gln Ile Ala Ser Ser Leu Glu Glu Ala 260 265
270Asp Lys Ser Lys Phe Glu Lys Ser Thr Phe Met Asn Leu Met Arg Arg
275 280 285Ile Gly Asn His Glu Val Thr Leu Asp Gly Asp Lys Leu Val
Asn Lys 290 295 300Glu Gly Glu Asp Ile Arg Glu Glu Val Arg Asp Glu
Leu Leu Arg Glu305 310 315 320Gly Ala Ser Gln Glu Asn Gly Phe Gln
Ser Glu Ala Gln Gln Thr Ala 325 330 335Pro Leu Pro Val His His Glu
Ala Pro Pro Pro Glu Gln Ile His Pro 340 345 350His Thr Glu Thr Gly
Asp Lys Gln Leu Glu Asp Pro Met Val Tyr Ile 355 360 365Glu Gln Glu
Ala Ala Arg Arg Ala Ala Glu Ser Gly Arg Thr Val Glu 370 375 380Glu
Glu Lys Leu Asn Phe Tyr Ser Pro Phe Glu Tyr Ala Gln Lys Leu385 390
395 400Gly Pro Gln Gly Val Ala Lys Gln Ser Asn Trp Glu Glu Asp Tyr
Asp 405 410 415Phe18195PRTYarrowia
lipolyticaMISC_FEATURE(1)..(195)YlPex22p; GenBank Accession No.
CAG77876 18Val Pro Arg Cys Thr Ser His Pro Cys Asn Leu Thr Leu His
Leu Pro1 5 10 15Val Thr Thr Met Ala Pro Arg Lys Thr Arg Leu Pro Ala
Val Ile Gly 20 25 30Ala Ala Ala Ala Ala Ala Ala Val Ala Tyr Leu Val
Tyr Ser Phe Val 35 40 45Ala Lys Ser Asn Ser Asp Gln Asp Thr Phe Asp
Ser Ser Val Gln Ser 50 55 60Ser Ser Lys Ser Ser Thr Lys Ser Pro Lys
Ser Thr Ala Thr Asn Ser65 70 75 80Lys Ile Thr Val Val Val Ser Gln
Glu Leu Val Gln Ser Gln Leu Val 85 90 95Asp Phe Lys His Leu Met Ser
Val His Pro Asn Leu Val Val Ile Val 100 105 110Pro Pro Met Val Ala
Asn Lys Phe His Arg Ala Leu Lys Ser Ser Val 115 120 125Gly His Asp
His Gly Val Lys Val Ile Arg Cys Asp Thr Asp Val Gly 130 135 140Val
Ile His Val Ile Lys His Ile Arg Pro Asp Leu Ala Leu Ile Ala145 150
155 160Asp Gly Val Gly Asp Asn Ile Gln Gly Glu Ile Lys Arg Phe Val
Gly 165 170 175Ser Ser Glu Ala Leu Ser Gly Asp Val Asn Leu Ala Ala
Glu Arg Leu 180 185 190Thr Gly Leu 19519386PRTYarrowia
lipolyticaMISC_FEATURE(1)..(386)YlPex26p; GenBank Accession No.
NC_006072, antisense translation of nucleotides 117230-118387 19Met
Pro Pro Ala Met Pro Gln Met Thr Thr Ser Thr Leu Leu Thr Asp1 5 10
15Ser Val Thr Ser Ala Val Asn Gln Ala Ala Thr Pro Lys Val Asp Gln
20 25 30Met Tyr Gln Thr Phe Gly Glu Ser Ala Arg Glu Phe Val Asn Lys
Asn 35 40 45Phe Tyr Asn Ser Tyr Glu Leu Ile Arg Pro Phe Phe Asp Glu
Ile Thr 50 55 60Ala Lys Gly Ala Gln Gln Asn Gly Ser Thr Val Leu Asp
Ala Glu Asn65 70 75 80Pro His Asn Ile Pro Leu Ser Leu Trp Ile Lys
Val Trp Ser Leu Tyr 85 90 95Leu Ala Ile Leu Asp Ala Ser Cys Lys Gln
Ala Gly Glu Ala Leu Leu 100 105 110Asn Ser Thr Gly Asp Leu Ser Gly
Ser Asp Ser Gly Glu Trp Asn Gln 115 120 125Thr Arg Lys Leu Leu Ala
Arg Lys Leu Thr Ser Gly Ser Val Trp Asp 130 135 140Glu Leu Val Thr
Ala Ser Gly Gly Thr Gly Asn Ile His Pro Thr Ile145 150 155 160Leu
Ala Leu Leu Ala Ser Leu Ser Ile Arg His Asp Thr Asp Ala Lys 165 170
175Leu Met Ala Asp Asn Leu Glu Lys Phe Ile Val Thr Tyr Asn Asp Asn
180 185 190Gly Ser Asp Asp Val Lys Thr Lys Thr Ala Phe Tyr Lys Val
Leu Asp 195 200 205Leu Tyr Leu Leu Arg Val Leu Pro Asp Leu Gly Gln
Trp Asp Val Ala 210 215 220His Ser Phe Val Asn Asn Thr Asn Leu Phe
Ser His Glu Gln Lys Lys225 230 235 240Glu Met Thr His Lys Leu Asp
Gln Ser Gln Lys His Ala Glu Gln Glu 245 250 255His Lys Arg Leu Leu
Glu Glu Ala Gln Glu Lys Glu Lys Ser Asp Ala 260 265 270Lys Glu Lys
Glu Arg Glu Glu Arg Val Ser Arg Asp Thr Gln Ser Arg 275 280 285Glu
Ile Lys Ser Pro Ile Val Asp Ser Ser Thr Ser Ser Arg Asp Val 290 295
300Thr Arg Asp Thr Thr Arg Glu Leu Ser Lys Ser Ser Arg Gln Pro
Arg305 310 315 320Thr Leu Ser Gln Ile Ile Ser Thr Ser Leu Lys Ser
Gln Phe Asp Gly 325 330 335Asn Ala Ile Phe Arg Thr Leu Ala Leu Ile
Val Ile Val Ser Leu Ser 340 345 350Ala Ala Asn Pro Leu Ile Arg Lys
Arg Val Val Asp Thr Leu Lys Met 355 360 365Leu Trp Ile Lys Ile Leu
Gln Thr Leu Ser Met Gly Phe Lys Val Ser 370 375 380Tyr
Leu385203387DNAYarrowia lipolyticamisc_feature(1)..(3387)GenBank
Accession No. AB036770 20ggtaccatca agggtaaaat caaggctatc
atcaagggcc atatatcgca agtttggggg 60aagataatat gttcatagtg aatcgggttg
tggatttcct catctaacgg cattataact 120agtcctggag ggtctttttt
atggataacc tccatgtacg atgtatccaa gatctccacg 180tactgtgttc
tgtttcctaa gtaataccca acaacctctc caacaaacac ttgggaagat
240gcacttgtgc tgagatgtca agatgttaga gagtagagac agtagcaagc
gtaaaaggcg 300gccgaggcca ccgagagaac agcgtagcag ggcgcgtagt
caccacaggg gacgcagaac 360caaacaaatg acgaagaaga accacaagga
gacgttttca aaggcaatgc aaacgaagag 420ggcaatggaa ggattgagat
tagagaactg gagactggag tggcgttttc ccgatgaacg 480aacaaacacg
cgaagctatg tggaccaaca tacaacacgg actgaaccag gtttttttat
540gattttttta ctggaaatag gtacgtgcca agttggacca tgacactaaa
cgtgtttaat 600tagtaatatt cgtgtaagcg tacattcatt tcaaaggtta
ttctttcacg gcaaagttat 660aattaaatga atgtatatgc agaaaaaaaa
aaaaaaagta ctgtactgga tggagagaat 720attaataaat aattgttacc
caactacatc ttgtcgattg aaagagaccc ctaagacaga 780taggatatct
gcaacccgag gaatgaaccc cccagcaccg gcaccctttc tattaacaaa
840atgccaactg aaatttgaaa agttcaacta aacttatttg acccacaaaa
actcgtcaaa 900agtggcggcg aaagctggca aatgatgaca tccccttgga
accatgatat cctctcggaa 960tcttcgtccc catttgccac atctacttgc
aacgccacat ctgcttacta agcaacccaa 1020atctgcctcg gctcaaaatg
tggggaagtt cacatgcatt cgctggtgaa tctgatctga 1080cactacaact
acacaccagg tccaacatga gcgacaatac gacaatcaaa aagccgatcc
1140gacccaaacc gatccggacg gaacgcctgc cttacgctgg ggccgcagaa
atcatccgag 1200ccaaccagaa agaccactac tttgagtccg tgcttgaaca
gcatctcgtc acgtttctgc 1260agaaatggaa gggagtacga tttatccacc
agtacaagga ggagctggag acggcgtcca 1320agtttgcata tctcggtttg
tgtacgcttg tgggctccaa gactctcgga gaagagtaca 1380ccaatctcat
gtacactatc agagaccgaa cagctctacc gggggtggtg agacggtttg
1440gctacgtgct ttccaacact ctgtttccat acctgtttgt gcgctacatg
ggcaagttgc 1500gcgccaaact gatgcgcgag tatccccatc tggtggagta
cgacgaagat gagcctgtgc 1560ccagcccgga aacatggaag gagcgggtca
tcaagacgtt tgtgaacaag tttgacaagt 1620tcacggcgct ggaggggttt
accgcgatcc acttggcgat tttctacgtc tacggctcgt 1680actaccagct
cagtaagcgg atctggggca tgcgttatgt atttggacac cgactggaca
1740agaatgagcc tcgaatcggt tacgagatgc tcggtctgct gattttcgcc
cggtttgcca 1800cgtcatttgt gcagacggga agagagtacc tcggagcgct
gctggaaaag agcgtggaga 1860aagaggcagg ggagaaggaa gatgaaaagg
aagcggttgt gccgaaaaag aagtcgtcaa 1920ttccgttcat tgaggataca
gaaggggaga cggaagacaa gatcgatctg gaggaccctc 1980gacagctcaa
gttcattcct gaggcgtcca gagcgtgcac tctgtgtctg tcatacatta
2040gtgcgccggc atgtacgcca tgtggacact ttttctgttg ggactgtatt
tccgaatggg 2100tgagagagaa gcccgagtgt cccttgtgtc ggcagggtgt
gagagagcag aacttgttgc 2160ctatcagata atgacgaggt ctggatggaa
ggactagtca gcgagacaca gagcatcagg 2220gaccagacac gaccaattca
atcgacaaca ctgtgctgca tagcagtgca cagaggtcct 2280gggcatgaat
atattttagc attggagata tgagtggtag agcgtataca gtattaattg
2340tggaggtatc tcgtcgcatt gatagagcaa tacagttact gctgaaggga
atgataccga 2400gtatttcggc ccgattcagt tcttgatatc gtcattttgt
ctctattgtc tacttttcag 2460ataacctcaa caaatcttca acaaatctcc
cagtaaacag tcagagatca tatccgagat 2520catatcagat atgtcacgat
ccgagtacaa taatggatat taatctgctt gattttgaat 2580tctgttgcga
ttatgatttc tttgatttcg atatgaacac atacggcgac tcccagacct
2640ttagaagctc cagtttggat tcttagcaat ggttacactc aactatatcc
caagtaatac 2700ttggtaacaa tatgccaagt tagtcattca ttcgttatag
gagttagcaa gtgtttgtca 2760gctaaaaatg gttagtcggt cgattaccac
ttagatcttt tcagcgtgga acttgatggt 2820acgcttgaac cgacacttgg
agtagtcggg gctgttgatg acgtagatga cgtttcgctc 2880agggtgagga
gtgcaatagt agtactcctt ggggccgtct ctcagctcaa aggttccatc
2940ggcggcaatg tcaaagaccg agccctggag cttgtagccg tagtcgccgg
tccagaacaa 3000agcctgcagc tccagatagg cgatgggcat gtcgttaaca
gagaaggtgt tgccctcgcc 3060ctcggtgatg gtgatgggtt cgccgtcggt
ggaggcggtg atcaggtcat cttggtaggt 3120gacgggcaga gattcgaccg
attgggcgtc tgatctggta taggtcagct tgtacttgtc 3180tccgacagcc
gccagagcgg tggtagcgac ggtgatgagg gagatgagtt tcatattggc
3240ggcaagttta gcaaaagatg gcagtgggat tgagggacaa gagtgtttat
atagatatag 3300atacaacaca acgagtctga atgagacaac cgagacaacc
actcccgaag cctcactaat 3360agttactaac ggcatatttc aggtacc
3387211134DNAYarrowia lipolyticaCDS(1)..(1134)Pex10; GenBank
Accession No. AB036770, nucleotides 1038-2171 21atg tgg gga agt tca
cat gca ttc gct ggt gaa tct gat ctg aca cta 48Met Trp Gly Ser Ser
His Ala Phe Ala Gly Glu Ser Asp Leu Thr Leu1 5 10 15caa cta cac acc
agg tcc aac atg agc gac aat acg aca atc aaa aag 96Gln Leu His Thr
Arg Ser Asn Met Ser Asp Asn Thr Thr Ile Lys Lys 20 25 30ccg atc cga
ccc aaa ccg atc cgg acg gaa cgc ctg cct tac gct ggg 144Pro Ile Arg
Pro Lys Pro Ile Arg Thr Glu Arg Leu Pro Tyr Ala Gly 35 40 45gcc gca
gaa atc atc cga gcc aac cag aaa gac cac tac ttt gag tcc 192Ala Ala
Glu Ile Ile Arg Ala Asn Gln Lys Asp His Tyr Phe Glu Ser 50 55 60gtg
ctt gaa cag cat ctc gtc acg ttt ctg cag aaa tgg aag gga gta 240Val
Leu Glu Gln His Leu Val Thr Phe Leu Gln Lys Trp Lys Gly Val65 70 75
80cga ttt atc cac cag tac aag gag gag ctg gag acg gcg tcc aag ttt
288Arg Phe Ile His Gln Tyr Lys Glu Glu Leu Glu Thr Ala Ser Lys Phe
85 90 95gca tat ctc ggt ttg tgt acg ctt gtg ggc tcc aag act ctc gga
gaa 336Ala Tyr Leu Gly Leu Cys Thr Leu Val Gly Ser Lys Thr Leu Gly
Glu 100 105 110gag tac acc aat ctc atg tac act atc aga gac cga aca
gct cta ccg 384Glu Tyr Thr Asn Leu Met Tyr Thr Ile Arg Asp Arg Thr
Ala Leu Pro 115 120 125ggg gtg gtg aga cgg ttt ggc tac gtg ctt tcc
aac act ctg ttt cca 432Gly Val Val Arg Arg Phe Gly Tyr Val Leu Ser
Asn Thr Leu Phe Pro 130 135 140tac ctg ttt gtg cgc tac atg ggc aag
ttg cgc gcc aaa ctg atg cgc 480Tyr Leu Phe Val Arg Tyr Met Gly Lys
Leu Arg Ala Lys Leu Met Arg145 150 155 160gag tat ccc cat ctg gtg
gag tac gac gaa gat gag cct gtg ccc agc 528Glu Tyr Pro His Leu Val
Glu Tyr Asp Glu Asp Glu Pro Val Pro Ser 165 170 175ccg gaa aca tgg
aag gag cgg gtc atc aag acg ttt gtg aac aag ttt 576Pro Glu Thr Trp
Lys Glu Arg Val Ile Lys Thr Phe Val Asn Lys Phe 180 185 190gac aag
ttc acg gcg ctg gag ggg ttt acc gcg atc cac ttg gcg att 624Asp Lys
Phe Thr Ala Leu Glu Gly Phe Thr Ala Ile His Leu Ala Ile 195 200
205ttc tac gtc tac ggc tcg tac tac cag ctc agt aag cgg atc tgg ggc
672Phe Tyr Val Tyr Gly Ser Tyr Tyr Gln Leu Ser Lys Arg Ile Trp Gly
210 215 220atg cgt tat gta ttt gga cac cga ctg gac aag aat gag cct
cga atc 720Met Arg Tyr Val Phe Gly His Arg Leu Asp Lys Asn Glu Pro
Arg Ile225 230 235 240ggt tac gag atg ctc ggt ctg ctg att ttc gcc
cgg ttt gcc acg tca 768Gly Tyr Glu Met Leu Gly Leu Leu Ile Phe Ala
Arg Phe Ala Thr Ser 245 250 255ttt gtg cag acg gga aga gag tac ctc
gga gcg ctg ctg gaa aag agc 816Phe Val Gln Thr Gly Arg Glu Tyr Leu
Gly Ala Leu Leu Glu Lys Ser 260 265 270gtg gag aaa gag gca ggg gag
aag gaa gat gaa aag gaa gcg gtt gtg 864Val Glu Lys Glu Ala Gly Glu
Lys Glu Asp Glu Lys Glu Ala Val Val 275 280 285ccg aaa aag aag tcg
tca att ccg ttc att gag gat aca gaa ggg gag 912Pro Lys Lys Lys Ser
Ser Ile Pro Phe Ile Glu Asp Thr Glu Gly Glu 290 295 300acg gaa gac
aag atc gat ctg gag gac cct cga cag ctc aag ttc att 960Thr Glu Asp
Lys Ile
Asp Leu Glu Asp Pro Arg Gln Leu Lys Phe Ile305 310 315 320cct gag
gcg tcc aga gcg tgc act ctg tgt ctg tca tac att agt gcg 1008Pro Glu
Ala Ser Arg Ala Cys Thr Leu Cys Leu Ser Tyr Ile Ser Ala 325 330
335ccg gca tgt acg cca tgt gga cac ttt ttc tgt tgg gac tgt att tcc
1056Pro Ala Cys Thr Pro Cys Gly His Phe Phe Cys Trp Asp Cys Ile Ser
340 345 350gaa tgg gtg aga gag aag ccc gag tgt ccc ttg tgt cgg cag
ggt gtg 1104Glu Trp Val Arg Glu Lys Pro Glu Cys Pro Leu Cys Arg Gln
Gly Val 355 360 365aga gag cag aac ttg ttg cct atc aga taa 1134Arg
Glu Gln Asn Leu Leu Pro Ile Arg 370 37522377PRTYarrowia lipolytica
22Met Trp Gly Ser Ser His Ala Phe Ala Gly Glu Ser Asp Leu Thr Leu1
5 10 15Gln Leu His Thr Arg Ser Asn Met Ser Asp Asn Thr Thr Ile Lys
Lys 20 25 30Pro Ile Arg Pro Lys Pro Ile Arg Thr Glu Arg Leu Pro Tyr
Ala Gly 35 40 45Ala Ala Glu Ile Ile Arg Ala Asn Gln Lys Asp His Tyr
Phe Glu Ser 50 55 60Val Leu Glu Gln His Leu Val Thr Phe Leu Gln Lys
Trp Lys Gly Val65 70 75 80Arg Phe Ile His Gln Tyr Lys Glu Glu Leu
Glu Thr Ala Ser Lys Phe 85 90 95Ala Tyr Leu Gly Leu Cys Thr Leu Val
Gly Ser Lys Thr Leu Gly Glu 100 105 110Glu Tyr Thr Asn Leu Met Tyr
Thr Ile Arg Asp Arg Thr Ala Leu Pro 115 120 125Gly Val Val Arg Arg
Phe Gly Tyr Val Leu Ser Asn Thr Leu Phe Pro 130 135 140Tyr Leu Phe
Val Arg Tyr Met Gly Lys Leu Arg Ala Lys Leu Met Arg145 150 155
160Glu Tyr Pro His Leu Val Glu Tyr Asp Glu Asp Glu Pro Val Pro Ser
165 170 175Pro Glu Thr Trp Lys Glu Arg Val Ile Lys Thr Phe Val Asn
Lys Phe 180 185 190Asp Lys Phe Thr Ala Leu Glu Gly Phe Thr Ala Ile
His Leu Ala Ile 195 200 205Phe Tyr Val Tyr Gly Ser Tyr Tyr Gln Leu
Ser Lys Arg Ile Trp Gly 210 215 220Met Arg Tyr Val Phe Gly His Arg
Leu Asp Lys Asn Glu Pro Arg Ile225 230 235 240Gly Tyr Glu Met Leu
Gly Leu Leu Ile Phe Ala Arg Phe Ala Thr Ser 245 250 255Phe Val Gln
Thr Gly Arg Glu Tyr Leu Gly Ala Leu Leu Glu Lys Ser 260 265 270Val
Glu Lys Glu Ala Gly Glu Lys Glu Asp Glu Lys Glu Ala Val Val 275 280
285Pro Lys Lys Lys Ser Ser Ile Pro Phe Ile Glu Asp Thr Glu Gly Glu
290 295 300Thr Glu Asp Lys Ile Asp Leu Glu Asp Pro Arg Gln Leu Lys
Phe Ile305 310 315 320Pro Glu Ala Ser Arg Ala Cys Thr Leu Cys Leu
Ser Tyr Ile Ser Ala 325 330 335Pro Ala Cys Thr Pro Cys Gly His Phe
Phe Cys Trp Asp Cys Ile Ser 340 345 350Glu Trp Val Arg Glu Lys Pro
Glu Cys Pro Leu Cys Arg Gln Gly Val 355 360 365Arg Glu Gln Asn Leu
Leu Pro Ile Arg 370 375231065DNAYarrowia
lipolyticaCDS(1)..(1065)YlPEX10; GenBank Accession No. AJ012084,
which corresponds to nucleotides 1107-2171 of GenBank Accession No.
AB036770 23atg agc gac aat acg aca atc aaa aag ccg atc cga ccc aaa
ccg atc 48Met Ser Asp Asn Thr Thr Ile Lys Lys Pro Ile Arg Pro Lys
Pro Ile1 5 10 15cgg acg gaa cgc ctg cct tac gct ggg gcc gca gaa atc
atc cga gcc 96Arg Thr Glu Arg Leu Pro Tyr Ala Gly Ala Ala Glu Ile
Ile Arg Ala 20 25 30aac cag aaa gac cac tac ttt gag tcc gtg ctt gaa
cag cat ctc gtc 144Asn Gln Lys Asp His Tyr Phe Glu Ser Val Leu Glu
Gln His Leu Val 35 40 45acg ttt ctg cag aaa tgg aag gga gta cga ttt
atc cac cag tac aag 192Thr Phe Leu Gln Lys Trp Lys Gly Val Arg Phe
Ile His Gln Tyr Lys 50 55 60gag gag ctg gag acg gcg tcc aag ttt gca
tat ctc ggt ttg tgt acg 240Glu Glu Leu Glu Thr Ala Ser Lys Phe Ala
Tyr Leu Gly Leu Cys Thr65 70 75 80ctt gtg ggc tcc aag act ctc gga
gaa gag tac acc aat ctc atg tac 288Leu Val Gly Ser Lys Thr Leu Gly
Glu Glu Tyr Thr Asn Leu Met Tyr 85 90 95act atc aga gac cga aca gct
cta ccg ggg gtg gtg aga cgg ttt ggc 336Thr Ile Arg Asp Arg Thr Ala
Leu Pro Gly Val Val Arg Arg Phe Gly 100 105 110tac gtg ctt tcc aac
act ctg ttt cca tac ctg ttt gtg cgc tac atg 384Tyr Val Leu Ser Asn
Thr Leu Phe Pro Tyr Leu Phe Val Arg Tyr Met 115 120 125ggc aag ttg
cgc gcc aaa ctg atg cgc gag tat ccc cat ctg gtg gag 432Gly Lys Leu
Arg Ala Lys Leu Met Arg Glu Tyr Pro His Leu Val Glu 130 135 140tac
gac gaa gat gag cct gtg ccc agc ccg gaa aca tgg aag gag cgg 480Tyr
Asp Glu Asp Glu Pro Val Pro Ser Pro Glu Thr Trp Lys Glu Arg145 150
155 160gtc atc aag acg ttt gtg aac aag ttt gac aag ttc acg gcg ctg
gag 528Val Ile Lys Thr Phe Val Asn Lys Phe Asp Lys Phe Thr Ala Leu
Glu 165 170 175ggg ttt acc gcg atc cac ttg gcg att ttc tac gtc tac
ggc tcg tac 576Gly Phe Thr Ala Ile His Leu Ala Ile Phe Tyr Val Tyr
Gly Ser Tyr 180 185 190tac cag ctc agt aag cgg atc tgg ggc atg cgt
tat gta ttt gga cac 624Tyr Gln Leu Ser Lys Arg Ile Trp Gly Met Arg
Tyr Val Phe Gly His 195 200 205cga ctg gac aag aat gag cct cga atc
ggt tac gag atg ctc ggt ctg 672Arg Leu Asp Lys Asn Glu Pro Arg Ile
Gly Tyr Glu Met Leu Gly Leu 210 215 220ctg att ttc gcc cgg ttt gcc
acg tca ttt gtg cag acg gga aga gag 720Leu Ile Phe Ala Arg Phe Ala
Thr Ser Phe Val Gln Thr Gly Arg Glu225 230 235 240tac ctc gga gcg
ctg ctg gaa aag agc gtg gag aaa gag gca ggg gag 768Tyr Leu Gly Ala
Leu Leu Glu Lys Ser Val Glu Lys Glu Ala Gly Glu 245 250 255aag gaa
gat gaa aag gaa gcg gtt gtg ccg aaa aag aag tcg tca att 816Lys Glu
Asp Glu Lys Glu Ala Val Val Pro Lys Lys Lys Ser Ser Ile 260 265
270ccg ttc att gag gat aca gaa ggg gag acg gaa gac aag atc gat ctg
864Pro Phe Ile Glu Asp Thr Glu Gly Glu Thr Glu Asp Lys Ile Asp Leu
275 280 285gag gac cct cga cag ctc aag ttc att cct gag gcg tcc aga
gcg tgc 912Glu Asp Pro Arg Gln Leu Lys Phe Ile Pro Glu Ala Ser Arg
Ala Cys 290 295 300act ctg tgt ctg tca tac att agt gcg ccg gca tgt
acg cca tgt gga 960Thr Leu Cys Leu Ser Tyr Ile Ser Ala Pro Ala Cys
Thr Pro Cys Gly305 310 315 320cac ttt ttc tgt tgg gac tgt att tcc
gaa tgg gtg aga gag aag ccc 1008His Phe Phe Cys Trp Asp Cys Ile Ser
Glu Trp Val Arg Glu Lys Pro 325 330 335gag tgt ccc ttg tgt cgg cag
ggt gtg aga gag cag aac ttg ttg cct 1056Glu Cys Pro Leu Cys Arg Gln
Gly Val Arg Glu Gln Asn Leu Leu Pro 340 345 350atc aga taa 1065Ile
Arg24354PRTYarrowia lipolytica 24Met Ser Asp Asn Thr Thr Ile Lys
Lys Pro Ile Arg Pro Lys Pro Ile1 5 10 15Arg Thr Glu Arg Leu Pro Tyr
Ala Gly Ala Ala Glu Ile Ile Arg Ala 20 25 30Asn Gln Lys Asp His Tyr
Phe Glu Ser Val Leu Glu Gln His Leu Val 35 40 45Thr Phe Leu Gln Lys
Trp Lys Gly Val Arg Phe Ile His Gln Tyr Lys 50 55 60Glu Glu Leu Glu
Thr Ala Ser Lys Phe Ala Tyr Leu Gly Leu Cys Thr65 70 75 80Leu Val
Gly Ser Lys Thr Leu Gly Glu Glu Tyr Thr Asn Leu Met Tyr 85 90 95Thr
Ile Arg Asp Arg Thr Ala Leu Pro Gly Val Val Arg Arg Phe Gly 100 105
110Tyr Val Leu Ser Asn Thr Leu Phe Pro Tyr Leu Phe Val Arg Tyr Met
115 120 125Gly Lys Leu Arg Ala Lys Leu Met Arg Glu Tyr Pro His Leu
Val Glu 130 135 140Tyr Asp Glu Asp Glu Pro Val Pro Ser Pro Glu Thr
Trp Lys Glu Arg145 150 155 160Val Ile Lys Thr Phe Val Asn Lys Phe
Asp Lys Phe Thr Ala Leu Glu 165 170 175Gly Phe Thr Ala Ile His Leu
Ala Ile Phe Tyr Val Tyr Gly Ser Tyr 180 185 190Tyr Gln Leu Ser Lys
Arg Ile Trp Gly Met Arg Tyr Val Phe Gly His 195 200 205Arg Leu Asp
Lys Asn Glu Pro Arg Ile Gly Tyr Glu Met Leu Gly Leu 210 215 220Leu
Ile Phe Ala Arg Phe Ala Thr Ser Phe Val Gln Thr Gly Arg Glu225 230
235 240Tyr Leu Gly Ala Leu Leu Glu Lys Ser Val Glu Lys Glu Ala Gly
Glu 245 250 255Lys Glu Asp Glu Lys Glu Ala Val Val Pro Lys Lys Lys
Ser Ser Ile 260 265 270Pro Phe Ile Glu Asp Thr Glu Gly Glu Thr Glu
Asp Lys Ile Asp Leu 275 280 285Glu Asp Pro Arg Gln Leu Lys Phe Ile
Pro Glu Ala Ser Arg Ala Cys 290 295 300Thr Leu Cys Leu Ser Tyr Ile
Ser Ala Pro Ala Cys Thr Pro Cys Gly305 310 315 320His Phe Phe Cys
Trp Asp Cys Ile Ser Glu Trp Val Arg Glu Lys Pro 325 330 335Glu Cys
Pro Leu Cys Arg Gln Gly Val Arg Glu Gln Asn Leu Leu Pro 340 345
350Ile Arg 2538PRTYarrowia lipolyticamisc_feature(2)..(3)Xaa can be
any naturally occurring amino acid 25Cys Xaa Xaa Cys Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys1 5 10 15Xaa His Xaa Xaa Cys Xaa
Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 20 25 30Xaa Xaa Cys Xaa Xaa
Cys 3526345PRTYarrowia lipolytica 26Met Trp Gly Ser Ser His Ala Phe
Ala Gly Glu Ser Asp Leu Thr Leu1 5 10 15Gln Leu His Thr Arg Ser Asn
Met Ser Asp Asn Thr Thr Ile Lys Lys 20 25 30Pro Ile Arg Pro Lys Pro
Ile Arg Thr Glu Arg Leu Pro Tyr Ala Gly 35 40 45Ala Ala Glu Ile Ile
Arg Ala Asn Gln Lys Asp His Tyr Phe Glu Ser 50 55 60Val Leu Glu Gln
His Leu Val Thr Phe Leu Gln Lys Trp Lys Gly Val65 70 75 80Arg Phe
Ile His Gln Tyr Lys Glu Glu Leu Glu Thr Ala Ser Lys Phe 85 90 95Ala
Tyr Leu Gly Leu Cys Thr Leu Val Gly Ser Lys Thr Leu Gly Glu 100 105
110Glu Tyr Thr Asn Leu Met Tyr Thr Ile Arg Asp Arg Thr Ala Leu Pro
115 120 125Gly Val Val Arg Arg Phe Gly Tyr Val Leu Ser Asn Thr Leu
Phe Pro 130 135 140Tyr Leu Phe Val Arg Tyr Met Gly Lys Leu Arg Ala
Lys Leu Met Arg145 150 155 160Glu Tyr Pro His Leu Val Glu Tyr Asp
Glu Asp Glu Pro Val Pro Ser 165 170 175Pro Glu Thr Trp Lys Glu Arg
Val Ile Lys Thr Phe Val Asn Lys Phe 180 185 190Asp Lys Phe Thr Ala
Leu Glu Gly Phe Thr Ala Ile His Leu Ala Ile 195 200 205Phe Tyr Val
Tyr Gly Ser Tyr Tyr Gln Leu Ser Lys Arg Ile Trp Gly 210 215 220Met
Arg Tyr Val Phe Gly His Arg Leu Asp Lys Asn Glu Pro Arg Ile225 230
235 240Gly Tyr Glu Met Leu Gly Leu Leu Ile Phe Ala Arg Phe Ala Thr
Ser 245 250 255Phe Val Gln Thr Gly Arg Glu Tyr Leu Gly Ala Leu Leu
Glu Lys Ser 260 265 270Val Glu Lys Glu Ala Gly Glu Lys Glu Asp Glu
Lys Glu Ala Val Val 275 280 285Pro Lys Lys Lys Ser Ser Ile Pro Phe
Ile Glu Asp Thr Glu Gly Glu 290 295 300Thr Glu Asp Lys Ile Asp Leu
Glu Asp Pro Arg Gln Leu Lys Phe Ile305 310 315 320Pro Glu Ala Ser
Arg Ala Cys Thr Leu Cys Leu Ser Tyr Ile Ser Ala 325 330 335Pro Ala
Cys Thr Pro Cys Gly His Phe 340 345272987DNAYarrowia
lipolyticamisc_featuremutant acetohydroxyacid synthase (AHAS) with
W497L mutation 27ttccctagtc ccagtgtaca cccgccgata tcgcttaccc
tgcagccgga ttaaggttgg 60caatttttca cgtccttgtc tccgcaatta ctcaccgggt
ggtttataag attgcaagcg 120tcttgatttg tctctgtata ctaacatgca
atcgcgactc gcccgacggg ccactaacct 180ggccagaatc tccagatcca
agtattctct tggtctgcga tatgtttcca acacaaaagc 240ccctgctgcc
cagccggcaa ctgctgagtg agtattcctt gccataaacg acccagaacc
300actgtatagt gtttggaagc actagtcaga agaccagcga aaacaggtgg
aaaaaactga 360gacgaaaagc aacgaccaga aatgtaatgt gtggaaaagc
gacacacaca gagcagataa 420agaggtgaca aataacgaca aatgaaatat
cagtatcttc ccacaatcac tacctctcag 480ctgtctgaag gtgcggctga
tatatccatc ccacgtctaa cgtatggagt gtgatagaat 540atgacgacac
aagcatgaga actcgctctc tatccaacca ccgaaacact gtcactacag
600ccgttcttgt tgctccattc gcttttgtga ttccatgcct tctctggtga
ctgacaacat 660tccttccttt tctccagccc tgttgttatc tgctcatgac
ctacggccac tctctatcgc 720atactaacat agacgatccc agcccgctcc
ccacttccag ggcaccgttg gcaagcctcc 780tatcctcaag aaggctgagg
ctgccaacgc tgacatggac gagtccttca tcggaatgtc 840tggaggagag
atcttccacg agatgatgct gcgacacaac gtcgacactg tcttcggtta
900ccccggtgga gccattctcc ccgtctttga cgccattcac aactctgagt
acttcaactt 960tgtgctccct cgacacgagc agggtgccgg ccacatggcc
gagggctacg ctcgagcctc 1020tggtaagccc ggtgtcgttc tcgtcacctc
tggccccggt gccaccaacg tcatcacccc 1080catgcaggac gctctttccg
atggtacccc catggttgtc ttcaccggtc aggtcctgac 1140ctccgttatc
ggcactgacg ccttccagga ggccgatgtt gtcggcatct cccgatcttg
1200caccaagtgg aacgtcatgg tcaagaacgt tgctgagctc ccccgacgaa
tcaacgaggc 1260ctttgagatt gctacttccg gccgacccgg tcccgttctc
gtcgatctgc ccaaggatgt 1320tactgctgcc atcctgcgag agcccatccc
caccaagtcc accattccct cgcattctct 1380gaccaacctc acctctgccg
ccgccaccga gttccagaag caggctatcc agcgagccgc 1440caacctcatc
aaccagtcca agaagcccgt cctttacgtc ggacagggta tccttggctc
1500cgaggagggt cctaagctgc ttaaggagct ggctgagaag gccgagattc
ccgtcaccac 1560tactctgcag ggtcttggtg cctttgacga gcgagacccc
aagtctctgc acatgctcgg 1620tatgcacggt tccggctacg ccaacatggc
catgcagaac gctgactgta tcattgctct 1680cggcgcccga tttgatgacc
gagttaccgg ctccatcccc aagtttgccc ccgaggctcg 1740agccgctgcc
cttgagggtc gaggtggtat tgttcacttt gagatccagg ccaagaacat
1800caacaaggtt gttcaggcca ccgaagccgt tgagggagac gttaccgagt
ctgtccgaca 1860gctcatcccc ctcatcaaca aggtctctgc cgctgagcga
gctccctgga ctgagactat 1920ccagtcctgg aagcagcagt tccccttcct
cttcgaggct gaaggtgagg atggtgttat 1980caagccccag tccgtcattg
ctctgctctc tgacctgaca gagaacaaca aggacaagac 2040catcatcacc
accggtgttg gtcagcatca gatgtggact gcccagcatt tccgatggcg
2100acaccctcga accatgatca cttctggtgg tcttggaact atgggttacg
gcctgcccgc 2160cgctatcggc gccaaggttg cccgacctga ctgcgacgtc
attgacatcg atggtgacgc 2220ttctttcaac atgactctga ccgagctgtc
caccgccgtt cagttcaaca ttggcgtcaa 2280ggctattgtc ctcaacaacg
aggaacaggg tatggtcacc cagctgcagt ctctcttcta 2340cgagaaccga
tactgccaca ctcatcagaa gaaccccgac ttcatgaagc tggccgagtc
2400catgggcatg aagggtatcc gaatcactca cattgaccag ctggaggccg
gtctcaagga 2460gatgctcgca tacaagggcc ctgtgctcgt tgaggttgtt
gtcgacaaga agatccccgt 2520tcttcccatg gttcccgctg gtaaggcttt
gcatgagttc cttgtctacg acgctgacgc 2580cgaggctgct tctcgacccg
atcgactgaa gaatgccccc gcccctcacg tccaccagac 2640cacctttgag
aactaagtgg aaaggaacac aagcaatccg aaccaaaaat aattggggtc
2700ccgtgcccac agagtctagt gcagacctaa aatgaccaca gtaaattata
gctgttatta 2760aacatgagat tttgaccaac aagagcgtag gaatgttatt
agctactact tgtacataca 2820cagcatttgt tttaaataat gttgcctcca
ggggcagtga gatcaggacc cagatccgtg 2880gccagctctc tgacttcaga
ccgcttgtac ttaagcagct cgcaacactg ttgtcgagga 2940ttgaacttgc
catattcgat tttgtggtca tgaatccagc acacctc 29872813066DNAArtificial
SequencePlasmid pZP3-Pa777U 28tctcggtcta ttcttttgat ttataaggga
ttttgccgat ttcggcctat tggttaaaaa 60atgagctgat ttaacaaaaa tttaacgcga
attttaacaa aatattaacg cttacaattt 120cctgatgcgg tattttctcc
ttacgcatct gtgcggtatt tcacaccgca tcaggtggca 180cttttcgggg
aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata
240tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga
aaaaggaaga 300gtatgagtat tcaacatttc cgtgtcgccc ttattccctt
ttttgcggca ttttgccttc 360ctgtttttgc tcacccagaa acgctggtga
aagtaaaaga tgctgaagat cagttgggtg 420cacgagtggg ttacatcgaa
ctggatctca acagcggtaa gatccttgag agttttcgcc 480ccgaagaacg
ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat
540cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct
cagaatgact 600tggttgagta ctcaccagtc acagaaaagc atcttacgga
tggcatgaca gtaagagaat
660tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt
ctgacaacga 720tcggaggacc gaaggagcta accgcttttt tgcacaacat
gggggatcat gtaactcgcc 780ttgatcgttg ggaaccggag ctgaatgaag
ccataccaaa cgacgagcgt gacaccacga 840tgcctgtagc aatggcaaca
acgttgcgca aactattaac tggcgaacta cttactctag 900cttcccggca
acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc
960gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt
gagcgtgggt 1020ctcgcggtat cattgcagca ctggggccag atggtaagcc
ctcccgtatc gtagttatct 1080acacgacggg gagtcaggca actatggatg
aacgaaatag acagatcgct gagataggtg 1140cctcactgat taagcattgg
taactgtcag accaagttta ctcatatata ctttagattg 1200atttaaaact
tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca
1260tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc
gtagaaaaga 1320tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat
ctgctgcttg caaacaaaaa 1380aaccaccgct accagcggtg gtttgtttgc
cggatcaaga gctaccaact ctttttccga 1440aggtaactgg cttcagcaga
gcgcagatac caaatactgt tcttctagtg tagccgtagt 1500taggccacca
cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt
1560taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac
tcaagacgat 1620agttaccgga taaggcgcag cggtcgggct gaacgggggg
ttcgtgcaca cagcccagct 1680tggagcgaac gacctacacc gaactgagat
acctacagcg tgagctatga gaaagcgcca 1740cgcttcccga agggagaaag
gcggacaggt atccggtaag cggcagggtc ggaacaggag 1800agcgcacgag
ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc
1860gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg
agcctatgga 1920aaaacgccag caacgcggcc tttttacggt tcctggcctt
ttgctggcct tttgctcaca 1980tgttctttcc tgcgttatcc cctgattctg
tggataaccg tattaccgcc tttgagtgag 2040ctgataccgc tcgccgcagc
cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg 2100aagagcgccc
aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat taatgcagct
2160ggcgcgccac caatcacaat tctgaaaagc acatcttgat ctcctcattg
cggggagtcc 2220aacggtggtc ttattccccc gaatttcccg ctcaatctcg
ttccagaccg acccggacac 2280agtgcttaac gccgttccga aactctaccg
cagatatgct ccaacggact gggctgcata 2340gatgtgatcc tcggcttgga
gaaatggata aaagccggcc aaaaaaaaag cggaaaaaag 2400cggaaaaaaa
gagaaaaaaa atcgcaaaat ttgaaaaata gggggaaaag acgcaaaaac
2460gcaaggaggg gggagtatat gacactgata agcaagctca caacggttcc
tcttattttt 2520ttcctcatct tctgcctagg ttcccaaaat cccagatgct
tctctccagt gccaaaagta 2580agtaccccac aggttttcgg ccgaaaattc
cacgtgcagc aacgtcgtgt ggggtgttaa 2640aatgtggggg gggggaacca
ggacaagagg ctcttgtggg agccgaatga gagcacaaag 2700cgggcgggtg
tgataagggc atttttgccc attttccctt ctcctgtctc tccgacggtg
2760atggcgttgt gcgtcctcta tttcttttta tttctttttg ttttatttct
ctgactaccg 2820atttggtttg atttcctcaa ccccacacaa ataagctcgg
gccgaggaat atatatatac 2880acggacacag tcgccctgtg gacaacacgt
cactacctct acgatacaca ccgtacgttg 2940tgtggaagct tgtgagcgga
taacaatttc acacaggaaa cagctatgac catgattacg 3000ccaagctcga
aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggacaca
3060atatctggtc aaatttcagt ttcgttacat ttaaattcct tcacttcaag
ttcattcttc 3120atctgcttct gttttacttt gacaggcaaa tgaagacatg
gtacgacttg atggaggcca 3180agaacgccat ttcaccccga gacaccgaag
tgcctgaaat cctggctgcc cccattgata 3240acatcggaaa ctacggtatt
ccggaaagtg tatatagaac ctttccccag cttgtgtctg 3300tggatatgga
tggtgtaatc ccctttgagt actcgtcttg gcttctctcc gagcagtatg
3360aggctctcta atctagcgca tttaatatct caatgtattt atatatttat
cttctcatgc 3420ggccgcttag ttggctttgg tcttggcagc cttggcctcc
ttgagggtaa acatcttggc 3480atccttgtcg accacgccgt acttggcgta
cataagacca attcggatga aggtgggaat 3540gatgggagaa gccgactttc
gcaccagttc gggaaaggcc tgagcgaagg cagcagtggc 3600ctcgttgagc
ttgtagtgag gaatgatggg aaacagatgg tggatctgat gtgtaccaat
3660gttgtgggac aggttgtcga tgagggctcc gtagcttcgg tccacagagg
acaagttgcc 3720cttgacatag gtccactccg aatcggcgta ccagggagtt
tcctcgtcgt tgtgatggag 3780gaaggtagtg acaaccagca tggtggcgaa
tccaaagaga ggtgcgaagt aatacagagc 3840catggtcttg aggccgtaga
cgtaggtaag gtaggcgtac agaccagcaa aggccacgag 3900agagccgagg
gaaatgatga cggcagacat tcttcgcagg tagagaggct cccagggatt
3960gaagtggttg acctttcggg gaggaaatcc agcaacgagg taggcaaacc
aagccgaacc 4020aagggagatg accatgtgtc gggacagggg atgagagtcg
gcttctcgct gagggtagaa 4080gatctcatcc ttgtcgatgt tgccggtgtt
cttgtgatgg tgtcgatggc tgatcttcca 4140cgactcgtag ggagtcagaa
tgatggagtg aatgagtgtg ccaacagaga agttgagcag 4200gtgggatcgc
gagaaggcac catgtccaca gtcgtgaccg atggtaaaga atccccagaa
4260cacgataccc tggagcagaa tgtagccagt gcaaaggacg gcatcgagca
gtgcaaactc 4320ctgcacgata gcaagggctc gagcatagta cagtccgaga
gcaagggaac cggcaatgcc 4380cagagctcgc acggtatagt agagggacca
gggaacagag gcttcgaagc agtgggcagg 4440cagggatcgc ttgatctcgg
tgagagtagg gaactcgtag ggagcggcaa cggtagagga 4500agccatggtt
gtgaattagg gtggtgagaa tggttggttg tagggaagaa tcaaaggccg
4560gtctcgggat ccgtgggtat atatatatat atatatatat acgatccttc
gttacctccc 4620tgttctcaaa actgtggttt ttcgtttttc gttttttgct
ttttttgatt tttttagggc 4680caactaagct tccagatttc gctaatcacc
tttgtactaa ttacaagaaa ggaagaagct 4740gattagagtt gggcttttta
tgcaactgtg ctactcctta tctctgatat gaaagtgtag 4800acccaatcac
atcatgtcat ttagagttgg taatactggg aggatagata aggcacgaaa
4860acgagccata gcagacatgc tgggtgtagc caagcagaag aaagtagatg
ggagccaatt 4920gacgagcgag ggagctacgc caatccgaca tacgacacgc
tgagatcgtc ttggccgggg 4980ggtacctaca gatgtccaag ggtaagtgct
tgactgtaat tgtatgtctg aggacaaata 5040tgtagtcagc cgtataaagt
cataccaggc accagtgcca tcatcgaacc actaactctc 5100tatgatacat
gcctccggta ttattgtacc atgcgtcgct ttgttacata cgtatcttgc
5160ctttttctct cagaaactcc agactttggc tattggtcga gataagcccg
gaccatagtg 5220agtctttcac actctacatt tctcccttgc tccaactatc
gattgttgtc tactaactat 5280cgtacgataa cttcgtatag catacattat
acgaagttat cgcgtcgacg agtatctgtc 5340tgactcgtca ttgccgcctt
tggagtacga ctccaactat gagtgtgctt ggatcacttt 5400gacgatacat
tcttcgttgg aggctgtggg tctgacagct gcgttttcgg cgcggttggc
5460cgacaacaat atcagctgca acgtcattgc tggctttcat catgatcaca
tttttgtcgg 5520caaaggcgac gcccagagag ccattgacgt tctttctaat
ttggaccgat agccgtatag 5580tccagtctat ctataagttc aactaactcg
taactattac cataacatat acttcactgc 5640cccagataag gttccgataa
aaagttctgc agactaaatt tatttcagtc tcctcttcac 5700caccaaaatg
ccctcctacg aagctcgagc taacgtccac aagtccgcct ttgccgctcg
5760agtgctcaag ctcgtggcag ccaagaaaac caacctgtgt gcttctctgg
atgttaccac 5820caccaaggag ctcattgagc ttgccgataa ggtcggacct
tatgtgtgca tgatcaaaac 5880ccatatcgac atcattgacg acttcaccta
cgccggcact gtgctccccc tcaaggaact 5940tgctcttaag cacggtttct
tcctgttcga ggacagaaag ttcgcagata ttggcaacac 6000tgtcaagcac
cagtaccggt gtcaccgaat cgccgagtgg tccgatatca ccaacgccca
6060cggtgtaccc ggaaccggaa tcattgctgg cctgcgagct ggtgccgagg
aaactgtctc 6120tgaacagaag aaggaggacg tctctgacta cgagaactcc
cagtacaagg agttcctagt 6180cccctctccc aacgagaagc tggccagagg
tctgctcatg ctggccgagc tgtcttgcaa 6240gggctctctg gccactggcg
agtactccaa gcagaccatt gagcttgccc gatccgaccc 6300cgagtttgtg
gttggcttca ttgcccagaa ccgacctaag ggcgactctg aggactggct
6360tattctgacc cccggggtgg gtcttgacga caagggagac gctctcggac
agcagtaccg 6420aactgttgag gatgtcatgt ctaccggaac ggatatcata
attgtcggcc gaggtctgta 6480cggccagaac cgagatccta ttgaggaggc
caagcgatac cagaaggctg gctgggaggc 6540ttaccagaag attaactgtt
agaggttaga ctatggatat gtaatttaac tgtgtatata 6600gagagcgtgc
aagtatggag cgcttgttca gcttgtatga tggtcagacg acctgtctga
6660tcgagtatgt atgatactgc acaacctgtg tatccgcatg atctgtccaa
tggggcatgt 6720tgttgtgttt ctcgatacgg agatgctggg tacagtgcta
atacgttgaa ctacttatac 6780ttatatgagg ctcgaagaaa gctgacttgt
gtatgactta ttctcaacta catccccagt 6840cacaatacca ccactgcact
accactacac caaaaccatg atcaaaccac ccatggactt 6900cctggaggca
gaagaacttg ttatggaaaa gctcaagaga gagatcataa cttcgtatag
6960catacattat acgaagttat cctgcaggta aaggaattca tgctgttcat
cgtggttaat 7020gctgctgtgt gctgtgtgtg tgtgttgttt ggcgctcatt
gttgcgttat gcagcgtaca 7080ccacaatatt ggaagcttat tagcctttct
attttttcgt ttgcaaggct taacaacatt 7140gctgtggaga gggatgggga
tatggaggcc gctggaggga gtcggagagg cgttttggag 7200cggcttggcc
tggcgcccag ctcgcgaaac gcacctagga ccctttggca cgccgaaatg
7260tgccactttt cagtctagta acgccttacc tacgtcattc catgcgtgca
tgtttgcgcc 7320ttttttccct tgcccttgat cgccacacag tacagtgcac
tgtacagtgg aggttttggg 7380ggggtcttag atgggagcta aaagcggcct
agcggtacac tagtgggatt gtatggagtg 7440gcatggagcc taggtggagc
ctgacaggac gcacgaccgg ctagcccgtg acagacgatg 7500ggtggctcct
gttgtccacc gcgtacaaat gtttgggcca aagtcttgtc agccttgctt
7560gcgaacctaa ttcccaattt tgtcacttcg cacccccatt gatcgagccc
taacccctgc 7620ccatcaggca atccaattaa gctcgcattg tctgccttgt
ttagtttggc tcctgcccgt 7680ttcggcgtcc acttgcacaa acacaaacaa
gcattatata taaggctcgt ctctccctcc 7740caaccacact cacttttttg
cccgtcttcc cttgctaaca caaaagtcaa gaacacaaac 7800aaccacccca
acccccttac acacaagaca tatctacagc aatggccatg gcttcttcca
7860ctgttgctgc gccgtacgag ttcccgacgc tgacggagat caagcgctcg
ctgccagcgc 7920actgctttga ggcctcggtc ccgtggtcgc tctactacac
cgtgcgcgcg ctgggcatcg 7980ccggctcgct cgcgctcggc ctctactacg
cgcgcgcgct cgcgatcgtg caggagtttg 8040ccctgctgga tgcggtgctc
tgcacggggt acattctgct gcagggcatc gtattctggg 8100ggttcttcac
catcggccat gactgcggcc acggcgcgtt ctcgcgttcg cacctgctca
8160acttcagcgt cggcacgctc attcactcga tcatcctcac gccgtacgag
tcatggaaga 8220tctcgcaccg ccaccaccac aagaacacgg gcaacatcga
caaggacgag attttctacc 8280cgcagcgcga ggccgactcg cacccactgt
cccgacacat ggtgatctcg ctcggctcgg 8340cctggttcgc gtacctcgtt
gcgggcttcc ctcctcgcaa ggtgaaccac ttcaaccctt 8400gggaaccgtt
gtacctgcgc cgcatgtctg ccgtcatcat ctcactcggc tcgctcgtgg
8460cgttcgcggg cttgtatgcg tatctcacct acgtctatgg ccttaagacc
atggcgctgt 8520actacttcgc ccctctcttt gggttcgcca cgatgctcgt
ggtcactacc tttttgcacc 8580acaatgacga ggaaacgcca tggtacgccg
actcggagtg gacgtacgtc aagggcaacc 8640tctcgtccgt ggaccgctcg
tacggcgcgc tcatcgacaa cctgagccac aacatcggca 8700cgcaccagat
ccaccacctg tttccgatca tcccgcacta caagctgaac gaggcgacgg
8760cagcgttcgc gcaggcgttc ccggagctcg tgcgcaagag cgcgtcgccg
atcatcccga 8820cgttcatccg catcgggctc atgtacgcca agtacggcgt
cgtggacaag gacgccaaga 8880tgtttacgct caaggaggcc aaggccgcca
agaccaaggc caactaggcg gccgcattga 8940tgattggaaa cacacacatg
ggttatatct aggtgagagt tagttggaca gttatatatt 9000aaatcagcta
tgccaacggt aacttcattc atgtcaacga ggaaccagtg actgcaagta
9060atatagaatt tgaccacctt gccattctct tgcactcctt tactatatct
catttatttc 9120ttatatacaa atcacttctt cttcccagca tcgagctcgg
aaacctcatg agcaataaca 9180tcgtggatct cgtcaataga gggctttttg
gactccttgc tgttggccac cttgtccttg 9240ctgtttaaac agtgtacgca
gatctactat agaggaacat ttaaattgcc ccggagaaga 9300cggccaggcc
gcctagatga caaattcaac aactcacagc tgactttctg ccattgccac
9360tagggggggg cctttttata tggccaagcc aagctctcca cgtcggttgg
gctgcaccca 9420acaataaatg ggtagggttg caccaacaaa gggatgggat
ggggggtaga agatacgagg 9480ataacggggc tcaatggcac aaataagaac
gaatactgcc attaagactc gtgatccagc 9540gactgacacc attgcatcat
ctaagggcct caaaactacc tcggaactgc tgcgctgatc 9600tggacaccac
agaggttccg agcactttag gttgcaccaa atgtcccacc aggtgcaggc
9660agaaaacgct ggaacagcgt gtacagtttg tcttaacaaa aagtgagggc
gctgaggtcg 9720agcagggtgg tgtgacttgt tatagccttt agagctgcga
aagcgcgtat ggatttggct 9780catcaggcca gattgagggt ctgtggacac
atgtcatgtt agtgtacttc aatcgccccc 9840tggatatagc cccgacaata
ggccgtggcc tcattttttt gccttccgca catttccatt 9900gctcggtacc
cacaccttgc ttctcctgca cttgccaacc ttaatactgg tttacattga
9960ccaacatctt acaagcgggg ggcttgtcta gggtatatat aaacagtggc
tctcccaatc 10020ggttgccagt ctcttttttc ctttctttcc ccacagattc
gaaatctaaa ctacacatca 10080cagaattccg agccgtgagt atccacgaca
agatcagtgt cgagacgacg cgttttgtgt 10140aatgacacaa tccgaaagtc
gctagcaaca cacactctct acacaaacta acccagctct 10200ggtaccatgg
cttcttccac tgttgctgcg ccgtacgagt tcccgacgct gacggagatc
10260aagcgctcgc tgccagcgca ctgctttgag gcctcggtcc cgtggtcgct
ctactacacc 10320gtgcgcgcgc tgggcatcgc cggctcgctc gcgctcggcc
tctactacgc gcgcgcgctc 10380gcgatcgtgc aggagtttgc cctgctggat
gcggtgctct gcacggggta cattctgctg 10440cagggcatcg tattctgggg
gttcttcacc atcggccatg actgcggcca cggcgcgttc 10500tcgcgttcgc
acctgctcaa cttcagcgtc ggcacgctca ttcactcgat catcctcacg
10560ccgtacgagt catggaagat ctcgcaccgc caccaccaca agaacacggg
caacatcgac 10620aaggacgaga ttttctaccc gcagcgcgag gccgactcgc
acccactgtc ccgacacatg 10680gtgatctcgc tcggctcggc ctggttcgcg
tacctcgttg cgggcttccc tcctcgcaag 10740gtgaaccact tcaacccttg
ggaaccgttg tacctgcgcc gcatgtctgc cgtcatcatc 10800tcactcggct
cgctcgtggc gttcgcgggc ttgtatgcgt atctcaccta cgtctatggc
10860cttaagacca tggcgctgta ctacttcgcc cctctctttg ggttcgccac
gatgctcgtg 10920gtcactacct ttttgcacca caatgacgag gaaacgccat
ggtacgccga ctcggagtgg 10980acgtacgtca agggcaacct ctcgtccgtg
gaccgctcgt acggcgcgct catcgacaac 11040ctgagccaca acatcggcac
gcaccagatc caccacctgt ttccgatcat cccgcactac 11100aagctgaacg
aggcgacggc agcgttcgcg caggcgttcc cggagctcgt gcgcaagagc
11160gcgtcgccga tcatcccgac gttcatccgc atcgggctca tgtacgccaa
gtacggcgtc 11220gtggacaagg acgccaagat gtttacgctc aaggaggcca
aggccgccaa gaccaaggcc 11280aactaggcgg ccgcatggag cgtgtgttct
gagtcgatgt tttctatgga gttgtgagtg 11340ttagtagaca tgatgggttt
atatatgatg aatgaataga tgtgattttg atttgcacga 11400tggaattgag
aactttgtaa acgtacatgg gaatgtatga atgtgggggt tttgtgactg
11460gataactgac ggtcagtgga cgccgttgtt caaatatcca agagatgcga
gaaactttgg 11520gtcaagtgaa catgtcctct ctgttcaagt aaaccatcaa
ctatgggtag tatatttagt 11580aaggacaaga gttgagattc tttggagtcc
tagaaacgta ttttcgcgtt ccaagatcaa 11640attagtagag taatacgggc
acgggaatcc attcatagtc tcaatcctgc aggtgagtta 11700attaagatga
cgacatttgc gagctggacg aggaatagat ggagcgtgtg ttctgagtcg
11760atgttttcta tggagttgtg agtgttagta gacatgatgg gtttatatat
gatgaatgaa 11820tagatgtgat tttgatttgc acgatggaat tgagaacttt
gtaaacgtac atgggaatgt 11880atgaatgtgg gggttttgtg actggataac
tgacggtcag tggacgccgt tgttcaaata 11940tccaagagat gcgagaaact
ttgggtcaag tgaacatgtc ctctctgttc aagtaaacca 12000tcaactatgg
gtagtatatt tagtaaggac aagagttgag attctttgga gtcctagaaa
12060cgtattttcg cgttccaaga tcaaattagt agagtaatac gggcacggga
atccattcat 12120agtctcaatt ttcccatagg tgtgctacaa ggtgttgaga
tgtggtacag taccaccatg 12180attcgaggta aagagcccag aagtcattga
tgaggtcaag aaatacacag atctacagct 12240caatacaatg aatatcttct
ttcatattct tcaggtgaca ccaagggtgt ctattttccc 12300cagaaatgcg
tgaaaaggcg cgtgtgtagc gtggagtatg ggttcggttg gcgtatcctt
12360catatatcga cgaaatagta gggcaagaga tgacaaaaag tatctatatg
tagacagcgt 12420agaatatgga tttgattggt ataaattcat ttattgcgtg
tctcacaaat actctcgata 12480agttggggtt aaactggaga tggaacaatg
tcgatatctc gacgcatgcg acgtcgggcc 12540caattcgccc tatagtgagt
cgtattacaa ttcactggcc gtcgttttac aacgtcgtga 12600ctgggaaaac
cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag
12660ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc
gcagcctgaa 12720tggcgaatgg acgcgccctg tagcggcgca ttaagcgcgg
cgggtgtggt ggttacgcgc 12780agcgtgaccg ctacacttgc cagcgcccta
gcgcccgctc ctttcgcttt cttcccttcc 12840tttctcgcca cgttcgccgg
ctttccccgt caagctctaa atcgggggct ccctttaggg 12900ttccgattta
gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca
12960cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga
gtccacgttc 13020tttaatagtg gactcttgtt ccaaactgga acaacactca acccta
13066299570DNAArtificial SequencePlasmid pY117 29ggccgccacc
gcggcccgag attccggcct cttcggccgc caagcgaccc gggtggacgt 60ctagaggtac
ctagcaatta acagatagtt tgccggtgat aattctctta acctcccaca
120ctcctttgac ataacgattt atgtaacgaa actgaaattt gaccagatat
tgtgtccgcg 180gtggagctcc agcttttgtt ccctttagtg agggtttaaa
cgagcttggc gtaatcatgg 240tcatagctgt ttcctgtgtg aaattgttat
ccgctcacaa ttccacacaa cgtacgagcc 300ggaagcataa agtgtaaagc
ctggggtgcc taatgagtga gctaactcac attaattgcg 360ttgcgctcac
tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc
420ggccaacgcg cggggagagg cggtttgcgt attgggcgct cttccgcttc
ctcgctcact 480gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat
cagctcactc aaaggcggta 540atacggttat ccacagaatc aggggataac
gcaggaaaga acatgtgagc aaaaggccag 600caaaaggcca ggaaccgtaa
aaaggccgcg ttgctggcgt ttttccatag gctccgcccc 660cctgacgagc
atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta
720taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt
tccgaccctg 780ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa
gcgtggcgct ttctcatagc 840tcacgctgta ggtatctcag ttcggtgtag
gtcgttcgct ccaagctggg ctgtgtgcac 900gaaccccccg ttcagcccga
ccgctgcgcc ttatccggta actatcgtct tgagtccaac 960ccggtaagac
acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg
1020aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg
ctacactaga 1080aggacagtat ttggtatctg cgctctgctg aagccagtta
ccttcggaaa aagagttggt 1140agctcttgat ccggcaaaca aaccaccgct
ggtagcggtg gtttttttgt ttgcaagcag 1200cagattacgc gcagaaaaaa
aggatctcaa gaagatcctt tgatcttttc tacggggtct 1260gacgctcagt
ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg
1320atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta
aagtatatat 1380gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg
aggcacctat ctcagcgatc 1440tgtctatttc gttcatccat agttgcctga
ctccccgtcg tgtagataac tacgatacgg 1500gagggcttac catctggccc
cagtgctgca atgataccgc gagacccacg ctcaccggct 1560ccagatttat
cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca
1620actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt
aagtagttcg 1680ccagttaata gtttgcgcaa cgttgttgcc attgctacag
gcatcgtggt gtcacgctcg 1740tcgtttggta tggcttcatt cagctccggt
tcccaacgat caaggcgagt tacatgatcc 1800cccatgttgt gcaaaaaagc
ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag 1860ttggccgcag
tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg
1920ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt
ctgagaatag 1980tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac
gggataatac cgcgccacat 2040agcagaactt taaaagtgct catcattgga
aaacgttctt cggggcgaaa actctcaagg 2100atcttaccgc tgttgagatc
cagttcgatg taacccactc gtgcacccaa ctgatcttca 2160gcatctttta
ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca
2220aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct
ttttcaatat 2280tattgaagca tttatcaggg ttattgtctc atgagcggat
acatatttga atgtatttag 2340aaaaataaac aaataggggt tccgcgcaca
tttccccgaa aagtgccacc tgacgcgccc 2400tgtagcggcg cattaagcgc
ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt 2460gccagcgccc
tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc
2520ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt
tagtgcttta 2580cggcacctcg accccaaaaa
acttgattag ggtgatggtt cacgtagtgg gccatcgccc 2640tgatagacgg
tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg
2700ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt
ataagggatt 2760ttgccgattt cggcctattg gttaaaaaat gagctgattt
aacaaaaatt taacgcgaat 2820tttaacaaaa tattaacgct tacaatttcc
attcgccatt caggctgcgc aactgttggg 2880aagggcgatc ggtgcgggcc
tcttcgctat tacgccagct ggcgaaaggg ggatgtgctg 2940caaggcgatt
aagttgggta acgccagggt tttcccagtc acgacgttgt aaaacgacgg
3000ccagtgaatt gtaatacgac tcactatagg gcgaattggg taccgggccc
cccctcgagg 3060tcgatggtgt cgataagctt gatatcgaat tcatgtcaca
caaaccgatc ttcgcctcaa 3120ggaaacctaa ttctacatcc gagagactgc
cgagatccag tctacactga ttaattttcg 3180ggccaataat ttaaaaaaat
cgtgttatat aatattatat gtattatata tatacatcat 3240gatgatactg
acagtcatgt cccattgcta aatagacaga ctccatctgc cgcctccaac
3300tgatgttctc aatatttaag gggtcatctc gcattgttta ataataaaca
gactccatct 3360accgcctcca aatgatgttc tcaaaatata ttgtatgaac
ttatttttat tacttagtat 3420tattagacaa cttacttgct ttatgaaaaa
cacttcctat ttaggaaaca atttataatg 3480gcagttcgtt catttaacaa
tttatgtaga ataaatgtta taaatgcgta tgggaaatct 3540taaatatgga
tagcataaat gatatctgca ttgcctaatt cgaaatcaac agcaacgaaa
3600aaaatccctt gtacaacata aatagtcatc gagaaatatc aactatcaaa
gaacagctat 3660tcacacgtta ctattgagat tattattgga cgagaatcac
acactcaact gtctttctct 3720cttctagaaa tacaggtaca agtatgtact
attctcattg ttcatacttc tagtcatttc 3780atcccacata ttccttggat
ttctctccaa tgaatgacat tctatcttgc aaattcaaca 3840attataataa
gatataccaa agtagcggta tagtggcaat caaaaagctt ctctggtgtg
3900cttctcgtat ttatttttat tctaatgatc cattaaaggt atatatttat
ttcttgttat 3960ataatccttt tgtttattac atgggctgga tacataaagg
tattttgatt taattttttg 4020cttaaattca atcccccctc gttcagtgtc
aactgtaatg gtaggaaatt accatacttt 4080tgaagaagca aaaaaaatga
aagaaaaaaa aaatcgtatt tccaggttag acgttccgca 4140gaatctagaa
tgcggtatgc ggtacattgt tcttcgaacg taaaagttgc gctccctgag
4200atattgtaca tttttgcttt tacaagtaca agtacatcgt acaactatgt
actactgttg 4260atgcatccac aacagtttgt tttgtttttt tttgtttttt
ttttttctaa tgattcatta 4320ccgctatgta tacctacttg tacttgtagt
aagccgggtt attggcgttc aattaatcat 4380agacttatga atctgcacgg
tgtgcgctgc gagttacttt tagcttatgc atgctacttg 4440ggtgtaatat
tgggatctgt tcggaaatca acggatgctc aaccgatttc gacagtaatt
4500aattaattcc ctagtcccag tgtacacccg ccgatatcgc ttaccctgca
gccggattaa 4560ggttggcaat ttttcacgtc cttgtctccg caattactca
ccgggtggtt tataagattg 4620caagcgtctt gatttgtctc tgtatactaa
catgcaatcg cgactcgccc gacgggccac 4680taacctggcc agaatctcca
gatccaagta ttctcttggt ctgcgatatg tttccaacac 4740aaaagcccct
gctgcccagc cggcaactgc tgagtgagta ttccttgcca taaacgaccc
4800agaaccactg tatagtgttt ggaagcacta gtcagaagac cagcgaaaac
aggtggaaaa 4860aactgagacg aaaagcaacg accagaaatg taatgtgtgg
aaaagcgaca cacacagagc 4920agataaagag gtgacaaata acgacaaatg
aaatatcagt atcttcccac aatcactacc 4980tctcagctgt ctgaaggtgc
ggctgatata tccatcccac gtctaacgta tggagtgtga 5040tagaatatga
cgacacaagc atgagaactc gctctctatc caaccaccga aacactgtca
5100ctacagccgt tcttgttgct ccattcgctt ttgtgattcc atgccttctc
tggtgactga 5160caacattcct tccttttctc cagccctgtt gttatctgct
catgacctac ggccactctc 5220tatcgcatac taacatagac gatcccagcc
cgctccccac ttccagggca ccgttggcaa 5280gcctcctatc ctcaagaagg
ctgaggctgc caacgctgac atggacgagt ccttcatcgg 5340aatgtctgga
ggagagatct tccacgagat gatgctgcga cacaacgtcg acactgtctt
5400cggttacccc ggtggagcca ttctccccgt ctttgacgcc attcacaact
ctgagtactt 5460caactttgtg ctccctcgac acgagcaggg tgccggccac
atggccgagg gctacgctcg 5520agcctctggt aagcccggtg tcgttctcgt
cacctctggc cccggtgcca ccaacgtcat 5580cacccccatg caggacgctc
tttccgatgg tacccccatg gttgtcttca ccggtcaggt 5640cctgacctcc
gttatcggca ctgacgcctt ccaggaggcc gatgttgtcg gcatctcccg
5700atcttgcacc aagtggaacg tcatggtcaa gaacgttgct gagctccccc
gacgaatcaa 5760cgaggccttt gagattgcta cttccggccg acccggtccc
gttctcgtcg atctgcccaa 5820ggatgttact gctgccatcc tgcgagagcc
catccccacc aagtccacca ttccctcgca 5880ttctctgacc aacctcacct
ctgccgccgc caccgagttc cagaagcagg ctatccagcg 5940agccgccaac
ctcatcaacc agtccaagaa gcccgtcctt tacgtcggac agggtatcct
6000tggctccgag gagggtccta agctgcttaa ggagctggct gagaaggccg
agattcccgt 6060caccactact ctgcagggtc ttggtgcctt tgacgagcga
gaccccaagt ctctgcacat 6120gctcggtatg cacggttccg gctacgccaa
catggccatg cagaacgctg actgtatcat 6180tgctctcggc gcccgatttg
atgaccgagt taccggctcc atccccaagt ttgcccccga 6240ggctcgagcc
gctgcccttg agggtcgagg tggtattgtt cactttgaga tccaggccaa
6300gaacatcaac aaggttgttc aggccaccga agccgttgag ggagacgtta
ccgagtctgt 6360ccgacagctc atccccctca tcaacaaggt ctctgccgct
gagcgagctc cctggactga 6420gactatccag tcctggaagc agcagttccc
cttcctcttc gaggctgaag gtgaggatgg 6480tgttatcaag ccccagtccg
tcattgctct gctctctgac ctgacagaga acaacaagga 6540caagaccatc
atcaccaccg gtgttggtca gcatcagatg tggactgccc agcatttccg
6600atggcgacac cctcgaacca tgatcacttc tggtggtctt ggaactatgg
gttacggcct 6660gcccgccgct atcggcgcca aggttgcccg acctgactgc
gacgtcattg acatcgatgg 6720tgacgcttct ttcaacatga ctctgaccga
gctgtccacc gccgttcagt tcaacattgg 6780cgtcaaggct attgtcctca
acaacgagga acagggtatg gtcacccagc tgcagtctct 6840cttctacgag
aaccgatact gccacactca tcagaagaac cccgacttca tgaagctggc
6900cgagtccatg ggcatgaagg gtatccgaat cactcacatt gaccagctgg
aggccggtct 6960caaggagatg ctcgcataca agggccctgt gctcgttgag
gttgttgtcg acaagaagat 7020ccccgttctt cccatggttc ccgctggtaa
ggctttgcat gagttccttg tctacgacgc 7080tgacgccgag gctgcttctc
gacccgatcg actgaagaat gcccccgccc ctcacgtcca 7140ccagaccacc
tttgagaact aagtggaaag gaacacaagc aatccgaacc aaaaataatt
7200ggggtcccgt gcccacagag tctagtgcag acctaaaatg accacagtaa
attatagctg 7260ttattaaaca tgagattttg accaacaaga gcgtaggaat
gttattagct actacttgta 7320catacacagc atttgtttta aataatgttg
cctccagggg cagtgagatc aggacccaga 7380tccgtggcca gctctctgac
ttcagaccgc ttgtacttaa gcagctcgca acactgttgt 7440cgaggattga
acttgccata ttcgattttg tggtcatgaa tccagcacac ctcatttaaa
7500tgtagctaac ggtagcaggc gaactactgg tacatacctc ccccggaata
tgtacaggca 7560taatgcgtat ctgtgggaca tgtggtcgtt gcgccattat
gtaagcagcg tgtactcctc 7620tgactgtcca tatggtttgc tccatctcac
cctcatcgtt ttcattgttc acaggcggcc 7680acaaaaaaac tgtcttctct
ccttctctct tcgccttagt ctactcggac cagttttagt 7740ttagcttggc
gccactggat aaatgagacc tcaggccttg tgatgaggag gtcacttatg
7800aagcatgtta ggaggtgctt gtatggatag agaagcaccc aaaataataa
gaataataat 7860aaaacagggg gcgttgtcat ttcatatcgt gttttcacca
tcaatacacc tccaaacaat 7920gcccttcatg tggccagccc caatattgtc
ctgtagttca actctatgca gctcgtatct 7980tattgagcaa gtaaaactct
gtcagccgat attgcccgac ccgcgacaag ggtcaacaag 8040gtggtgtaag
gccttcgcag aagtcaaaac tgtgccaaac aaacatctag agtctctttg
8100gtgtttctcg catatatttw atcggctgtc ttacgtattt gcgcctcggt
accggactaa 8160tttcggatca tccccaatac gctttttctt cgcagctgtc
aacagtgtcc atgatctatc 8220cacctaaatg ggtcatatga ggcgtataat
ttcgtggtgc tgataataat tcccatatat 8280ttgacacaaa acttcccccc
ctagacatac atctcacaat ctcacttctt gtgcttctgt 8340cacacatctc
ctccagctga cttcaactca cacctctgcc ccagttggtc tacagcggta
8400taaggtttct ccgcatagag gtgcaccact cctcccgata cttgtttgtg
tgacttgtgg 8460gtcacgacat atatatctac acacattgcg ccaccctttg
gttcttccag cacaacaaaa 8520acacgacacg ctaaccatgg ccaatttact
gaccgtacac caaaatttgc ctgcattacc 8580ggtcgatgca acgagtgatg
aggttcgcaa gaacctgatg gacatgttca gggatcgcca 8640ggcgttttct
gagcatacct ggaaaatgct tctgtccgtt tgccggtcgt gggcggcatg
8700gtgcaagttg aataaccgga aatggtttcc cgcagaacct gaagatgttc
gcgattatct 8760tctatatctt caggcgcgcg gtctggcagt aaaaactatc
cagcaacatt tgggccagct 8820aaacatgctt catcgtcggt ccgggctgcc
acgaccaagt gacagcaatg ctgtttcact 8880ggttatgcgg cggatccgaa
aagaaaacgt tgatgccggt gaacgtgcaa aacaggctct 8940agcgttcgaa
cgcactgatt tcgaccaggt tcgttcactc atggaaaata gcgatcgctg
9000ccaggatata cgtaatctgg catttctggg gattgcttat aacaccctgt
tacgtatagc 9060cgaaattgcc aggatcaggg ttaaagatat ctcacgtact
gacggtggga gaatgttaat 9120ccatattggc agaacgaaaa cgctggttag
caccgcaggt gtagagaagg cacttagcct 9180gggggtaact aaactggtcg
agcgatggat ttccgtctct ggtgtagctg atgatccgaa 9240taactacctg
ttttgccggg tcagaaaaaa tggtgttgcc gcgccatctg ccaccagcca
9300gctatcaact cgcgccctgg aagggatttt tgaagcaact catcgattga
tttacggcgc 9360taaggatgac tctggtcaga gatacctggc ctggtctgga
cacagtgccc gtgtcggagc 9420cgcgcgagat atggcccgcg ctggagtttc
aataccggag atcatgcaag ctggtggctg 9480gaccaatgta aatattgtca
tgaactatat ccgtaacctg gatagtgaaa caggggcaat 9540ggtgcgcctg
ctggaagatg gcgattaagc 95703015743DNAArtificial SequencePlasmid
pZP2-2988 30ggccgcatgt acatacaaga ttatttatag aaatgaatcg cgatcgaaca
aagagtacga 60gtgtacgagt aggggatgat gataaaagtg gaagaagttc cgcatctttg
gatttatcaa 120cgtgtaggac gatacttcct gtaaaaatgc aatgtcttta
ccataggttc tgctgtagat 180gttattaact accattaaca tgtctacttg
tacagttgca gaccagttgg agtatagaat 240ggtacactta ccaaaaagtg
ttgatggttg taactacgat atataaaact gttgacggga 300tctgtatatt
cggtaagata tattttgtgg ggttttagtg gtgtttaaac agtgtacgca
360gtactataga ggaacaattg ccccggagaa gacggccagg ccgcctagat
gacaaattca 420acaactcaca gctgactttc tgccattgcc actagggggg
ggccttttta tatggccaag 480ccaagctctc cacgtcggtt gggctgcacc
caacaataaa tgggtagggt tgcaccaaca 540aagggatggg atggggggta
gaagatacga ggataacggg gctcaatggc acaaataaga 600acgaatactg
ccattaagac tcgtgatcca gcgactgaca ccattgcatc atctaagggc
660ctcaaaacta cctcggaact gctgcgctga tctggacacc acagaggttc
cgagcacttt 720aggttgcacc aaatgtccca ccaggtgcag gcagaaaacg
ctggaacagc gtgtacagtt 780tgtcttaaca aaaagtgagg gcgctgaggt
cgagcagggt ggtgtgactt gttatagcct 840ttagagctgc gaaagcgcgt
atggatttgg ctcatcaggc cagattgagg gtctgtggac 900acatgtcatg
ttagtgtact tcaatcgccc cctggatata gccccgacaa taggccgtgg
960cctcattttt ttgccttccg cacatttcca ttgctcggta cccacacctt
gcttctcctg 1020cacttgccaa ccttaatact ggtttacatt gaccaacatc
ttacaagcgg ggggcttgtc 1080tagggtatat ataaacagtg gctctcccaa
tcggttgcca gtctcttttt tcctttcttt 1140ccccacagat tcgaaatcta
aactacacat cacaccatgg aggtcgtgaa cgaaatcgtc 1200tccattggcc
aggaggttct tcccaaggtc gactatgctc agctctggtc tgatgcctcg
1260cactgcgagg tgctgtacct ctccatcgcc ttcgtcatcc tgaagttcac
ccttggtcct 1320ctcggaccca agggtcagtc tcgaatgaag tttgtgttca
ccaactacaa cctgctcatg 1380tccatctact cgctgggctc cttcctctct
atggcctacg ccatgtacac cattggtgtc 1440atgtccgaca actgcgagaa
ggctttcgac aacaatgtct tccgaatcac cactcagctg 1500ttctacctca
gcaagttcct cgagtacatt gactccttct atctgcccct catgggcaag
1560cctctgacct ggttgcagtt ctttcaccat ctcggagctc ctatggacat
gtggctgttc 1620tacaactacc gaaacgaagc cgtttggatc tttgtgctgc
tcaacggctt cattcactgg 1680atcatgtacg gctactattg gacccgactg
atcaagctca agttccctat gcccaagtcc 1740ctgattactt ctatgcagat
cattcagttc aacgttggct tctacatcgt ctggaagtac 1800cggaacattc
cctgctaccg acaagatgga atgagaatgt ttggctggtt tttcaactac
1860ttctacgttg gtactgtcct gtgtctgttc ctcaacttct acgtgcagac
ctacatcgtc 1920cgaaagcaca agggagccaa aaagattcag tgagcggccg
caagtgtgga tggggaagtg 1980agtgcccggt tctgtgtgca caattggcaa
tccaagatgg atggattcaa cacagggata 2040tagcgagcta cgtggtggtg
cgaggatata gcaacggata tttatgtttg acacttgaga 2100atgtacgata
caagcactgt ccaagtacaa tactaaacat actgtacata ctcatactcg
2160tacccgggca acggtttcac ttgagtgcag tggctagtgc tcttactcgt
acagtgtgca 2220atactgcgta tcatagtctt tgatgtatat cgtattcatt
catgttagtt gcgtacgggc 2280gtcgttgctt gtgtgatttt tgaggaccca
tccctttggt atataagtat actctggggt 2340taaggttgcc cgtgtagtct
aggttatagt tttcatgtga aataccgaga gccgagggag 2400aataaacggg
ggtatttgga cttgtttttt tcgcggaaaa gcgtcgaatc aaccctgcgg
2460gccttgcacc atgtccacga cgtgtttctc gccccaattc gccccttgca
cgtcaaaatt 2520aggcctccat ctagacccct ccataacatg tgactgtggg
gaaaagtata agggaaacca 2580tgcaaccata gacgacgtga aagacgggga
ggaaccaatg gaggccaaag aaatggggta 2640gcaacagtcc aggagacaga
caaggagaca aggagagggc gcccgaaaga tcggaaaaac 2700aaacatgtcc
aattggggca gtgacggaaa cgacacggac acttcagtac aatggaccga
2760ccatctccaa gccagggtta ttccggtatc accttggccg taacctcccg
ctggtacctg 2820atattgtaca cgttcacatt caatatactt tcagctacaa
taagagaggc tgtttgtcgg 2880gcatgtgtgt ccgtcgtatg gggtgatgtc
cgagggcgaa attcgctaca agcttaactc 2940tggcgcttgt ccagtatgaa
tagacaagtc aagaccagtg gtgccatgat tgacagggag 3000gtacaagact
tcgatactcg agcattactc ggacttgtgg cgattgaaca gacgggcgat
3060cgcttctccc ccgtattgcc ggcgcgccag ctgcattaat gaatcggcca
acgcgcgggg 3120agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc
tcactgactc gctgcgctcg 3180gtcgttcggc tgcggcgagc ggtatcagct
cactcaaagg cggtaatacg gttatccaca 3240gaatcagggg ataacgcagg
aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 3300cgtaaaaagg
ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac
3360aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag
ataccaggcg 3420tttccccctg gaagctccct cgtgcgctct cctgttccga
ccctgccgct taccggatac 3480ctgtccgcct ttctcccttc gggaagcgtg
gcgctttctc atagctcacg ctgtaggtat 3540ctcagttcgg tgtaggtcgt
tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 3600cccgaccgct
gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac
3660ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta
tgtaggcggt 3720gctacagagt tcttgaagtg gtggcctaac tacggctaca
ctagaagaac agtatttggt 3780atctgcgctc tgctgaagcc agttaccttc
ggaaaaagag ttggtagctc ttgatccggc 3840aaacaaacca ccgctggtag
cggtggtttt tttgtttgca agcagcagat tacgcgcaga 3900aaaaaaggat
ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac
3960gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt
cacctagatc 4020cttttaaatt aaaaatgaag ttttaaatca atctaaagta
tatatgagta aacttggtct 4080gacagttacc aatgcttaat cagtgaggca
cctatctcag cgatctgtct atttcgttca 4140tccatagttg cctgactccc
cgtcgtgtag ataactacga tacgggaggg cttaccatct 4200ggccccagtg
ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca
4260ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt
atccgcctcc 4320atccagtcta ttaattgttg ccgggaagct agagtaagta
gttcgccagt taatagtttg 4380cgcaacgttg ttgccattgc tacaggcatc
gtggtgtcac gctcgtcgtt tggtatggct 4440tcattcagct ccggttccca
acgatcaagg cgagttacat gatcccccat gttgtgcaaa 4500aaagcggtta
gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta
4560tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc
cgtaagatgc 4620ttttctgtga ctggtgagta ctcaaccaag tcattctgag
aatagtgtat gcggcgaccg 4680agttgctctt gcccggcgtc aatacgggat
aataccgcgc cacatagcag aactttaaaa 4740gtgctcatca ttggaaaacg
ttcttcgggg cgaaaactct caaggatctt accgctgttg 4800agatccagtt
cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc
4860accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa
gggaataagg 4920gcgacacgga aatgttgaat actcatactc ttcctttttc
aatattattg aagcatttat 4980cagggttatt gtctcatgag cggatacata
tttgaatgta tttagaaaaa taaacaaata 5040ggggttccgc gcacatttcc
ccgaaaagtg ccacctgatg cggtgtgaaa taccgcacag 5100atgcgtaagg
agaaaatacc gcatcaggaa attgtaagcg ttaatatttt gttaaaattc
5160gcgttaaatt tttgttaaat cagctcattt tttaaccaat aggccgaaat
cggcaaaatc 5220ccttataaat caaaagaata gaccgagata gggttgagtg
ttgttccagt ttggaacaag 5280agtccactat taaagaacgt ggactccaac
gtcaaagggc gaaaaaccgt ctatcagggc 5340gatggcccac tacgtgaacc
atcaccctaa tcaagttttt tggggtcgag gtgccgtaaa 5400gcactaaatc
ggaaccctaa agggagcccc cgatttagag cttgacgggg aaagccggcg
5460aacgtggcga gaaaggaagg gaagaaagcg aaaggagcgg gcgctagggc
gctggcaagt 5520gtagcggtca cgctgcgcgt aaccaccaca cccgccgcgc
ttaatgcgcc gctacagggc 5580gcgtccattc gccattcagg ctgcgcaact
gttgggaagg gcgatcggtg cgggcctctt 5640cgctattacg ccagctggcg
aaagggggat gtgctgcaag gcgattaagt tgggtaacgc 5700cagggttttc
ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa tacgactcac
5760tatagggcga attgggcccg acgtcgcatg cgctgatgac actttggtct
gaaagagatg 5820cattttgaat cccaaacttg cagtgcccaa gtgacataca
tctccgcgtt ttggaaaatg 5880ttcagaaaca gttgattgtg ttggaatggg
gaatggggaa tggaaaaatg actcaagtat 5940caattccaaa aacttctctg
gctggcagta cctactgtcc atactactgc attttctcca 6000gtcaggccac
tctatactcg acgacacagt agtaaaaccc agataatttc gacataaaca
6060agaaaacaga cccaataata tttatatata gtcagccgtt tgtccagttc
agactgtaat 6120agccgaaaaa aaatccaaag tttctattct aggaaaatat
attccaatat ttttaattct 6180taatctcatt tattttattc tagcgaaata
catttcagct acttgagaca tgtgataccc 6240acaaatcgga ttcggactcg
gttgttcaga agagcatatg gcattcgtgc tcgcttgttc 6300acgtattctt
cctgttccat ctcttggccg acaatcacac aaaaatgggg tttttttttt
6360aattctaatg attcattaca gcaaaattga gatatagcag accacgtatt
ccataatcac 6420caaggaagtt cttgggcgtc ttaattaact cacctgcagg
attgagacta tgaatggatt 6480cccgtgcccg tattactcta ctaatttgat
cttggaacgc gaaaatacgt ttctaggact 6540ccaaagaatc tcaactcttg
tccttactaa atatactacc catagttgat ggtttacttg 6600aacagagagg
acatgttcac ttgacccaaa gtttctcgca tctcttggat atttgaacaa
6660cggcgtccac tgaccgtcag ttatccagtc acaaaacccc cacattcata
cattcccatg 6720tacgtttaca aagttctcaa ttccatcgtg caaatcaaaa
tcacatctat tcattcatca 6780tatataaacc catcatgtct actaacactc
acaactccat agaaaacatc gactcagaac 6840acacgctcca tgcggccgct
tactgagcct tggcaccggg ctgcttctcg gccattcgag 6900cgaactggga
caggtatcgg agcaggatga cgagaccttc atggggcaga gggtttcggt
6960aggggaggtt gtgcttctgg cacagctgtt ccacctggta ggaaacggca
gtgaggttgt 7020gtcgaggcag ggtgggccag agatggtgct cgatctggta
gttcaggcct ccaaagaacc 7080agtcagtaat gatgcctcgt cgaatgttca
tggtctcatg gatctgaccc acagagaagc 7140catgtccgtc ccagacggaa
tcaccgatct tctccagagg gtagtggttc atgaagacca 7200cgatggcaat
tccgaagcca ccgacgagct cggaaacaaa gaacaccagc atcgaggtca
7260ggatggaggg cataaagaag aggtggaaca gggtcttgag agtccagtgc
agagcgagtc 7320caatggcctc tttcttgtac tgagatcggt agaactggtt
gtctcggtcc ttgagggatc 7380gaacggtcag cacagactgg aaacaccaga
tgaatcgcag gagaatacag atgaccagga 7440aatagtactg ttggaactga
atgagctttc gggagatggg agaagctcga gtgacatcgt 7500cctcggacca
ggcgagcaga ggcaggttat caatgtcggg atcgtgaccc tgaacgttgg
7560tagcagaatg atgggcgttg tgtctgtcct tccaccaggt cacggagaag
ccctggagtc 7620cgttgccaaa gaccagaccc aggacgttat tccagtttcg
gttcttgaag gtctggtggt 7680ggcagatgtc atgagacagc catcccattt
gctggtagtg cataccgagc acgagagcac 7740caatgaagta caggtggtac
tggaccagca tgaagaaggc aagcacgcca agacccaggg 7800tggtcaagat
cttgtacgag taccagaggg gagaggcgtc aaacatgcca gtggcgatca
7860gctcttctcg gagctttcgg aaatcctcct gagcttcgtt gacggcagcc
tggggaggca 7920gctcggaagc ctggttgatc ttgggcattc gcttgagctt
gtcgaaggct tcctgagagt 7980gcataaccat gaaggcgtca gtagcatctc
gtccctggta
gttctcaatg atttcagctc 8040caccagggtg gaagttcacc caagcggaga
cgtcgtacac ctttccgtcg atgacgaggg 8100gcagagcctg tcgagaagcc
ttcaccatgg ttgtgaatta gggtggtgag aatggttggt 8160tgtagggaag
aatcaaaggc cggtctcggg atccgtgggt atatatatat atatatatat
8220atacgatcct tcgttacctc cctgttctca aaactgtggt ttttcgtttt
tcgttttttg 8280ctttttttga tttttttagg gccaactaag cttccagatt
tcgctaatca cctttgtact 8340aattacaaga aaggaagaag ctgattagag
ttgggctttt tatgcaactg tgctactcct 8400tatctctgat atgaaagtgt
agacccaatc acatcatgtc atttagagtt ggtaatactg 8460ggaggataga
taaggcacga aaacgagcca tagcagacat gctgggtgta gccaagcaga
8520agaaagtaga tgggagccaa ttgacgagcg agggagctac gccaatccga
catacgacac 8580gctgagatcg tcttggccgg ggggtaccta cagatgtcca
agggtaagtg cttgactgta 8640attgtatgtc tgaggacaaa tatgtagtca
gccgtataaa gtcataccag gcaccagtgc 8700catcatcgaa ccactaactc
tctatgatac atgcctccgg tattattgta ccatgcgtcg 8760ctttgttaca
tacgtatctt gcctttttct ctcagaaact ccagactttg gctattggtc
8820gagataagcc cggaccatag tgagtctttc acactctaca tttctccctt
gctccaacta 8880tttaaattcc ttcacttcaa gttcattctt catctgcttc
tgttttactt tgacaggcaa 8940atgaagacat ggtacgactt gatggaggcc
aagaacgcca tttcaccccg agacaccgaa 9000gtgcctgaaa tcctggctgc
ccccattgat aacatcggaa actacggtat tccggaaagt 9060gtatatagaa
cctttcccca gcttgtgtct gtggatatgg atggtgtaat cccctttgag
9120tactcgtctt ggcttctctc cgagcagtat gaggctctct aatctagcgc
atttaatatc 9180tcaatgtatt tatatattta tcttctcatg cggccgctta
ctgagccttg gcaccgggct 9240gcttctcggc cattcgagcg aactgggaca
ggtatcggag caggatgacg agaccttcat 9300ggggcagagg gtttcggtag
gggaggttgt gcttctggca cagctgttcc acctggtagg 9360aaacggcagt
gaggttgtgt cgaggcaggg tgggccagag atggtgctcg atctggtagt
9420tcaggcctcc aaagaaccag tcagtaatga tgcctcgtcg aatgttcatg
gtctcatgga 9480tctgacccac agagaagcca tgtccgtccc agacggaatc
accgatcttc tccagagggt 9540agtggttcat gaagaccacg atggcaattc
cgaagccacc gacgagctcg gaaacaaaga 9600acaccagcat cgaggtcagg
atggagggca taaagaagag gtggaacagg gtcttgagag 9660tccagtgcag
agcgagtcca atggcctctt tcttgtactg agatcggtag aactggttgt
9720ctcggtcctt gagggatcga acggtcagca cagactggaa acaccagatg
aatcgcagga 9780gaatacagat gaccaggaaa tagtactgtt ggaactgaat
gagctttcgg gagatgggag 9840aagctcgagt gacatcgtcc tcggaccagg
cgagcagagg caggttatca atgtcgggat 9900cgtgaccctg aacgttggta
gcagaatgat gggcgttgtg tctgtccttc caccaggtca 9960cggagaagcc
ctggagtccg ttgccaaaga ccagacccag gacgttattc cagtttcggt
10020tcttgaaggt ctggtggtgg cagatgtcat gagacagcca tcccatttgc
tggtagtgca 10080taccgagcac gagagcacca atgaagtaca ggtggtactg
gaccagcatg aagaaggcaa 10140gcacgccaag acccagggtg gtcaagatct
tgtacgagta ccagagggga gaggcgtcaa 10200acatgccagt ggcgatcagc
tcttctcgga gctttcggaa atcctcctga gcttcgttga 10260cggcagcctg
gggaggcagc tcggaagcct ggttgatctt gggcattcgc ttgagcttgt
10320cgaaggcttc ctgagagtgc ataaccatga aggcgtcagt agcatctcgt
ccctggtagt 10380tctcaatgat ttcagctcca ccagggtgga agttcaccca
agcggagacg tcgtacacct 10440ttccgtcgat gacgaggggc agagcctgtc
gagaagcctt caccatgggc aggacctgtg 10500ttagtacatt gtcggggagt
catcaattgg ttcgacaggt tgtcgactgt tagtatgagc 10560tcaattgggc
tctggtgggt cgatgacact tgtcatctgt ttctgttggg tcatgtttcc
10620atcaccttct atggtactca caattcgtcc gattcgcccg aatccgttaa
taccgacttt 10680gatggccatg ttgatgtgtg tttaattcaa gaatgaatat
agagaagaga agaagaaaaa 10740agattcaatt gagccggcga tgcagaccct
tatataaatg ttgccttgga cagacggagc 10800aagcccgccc aaacctacgt
tcggtataat atgttaagct ttttaacaca aaggtttggc 10860ttggggtaac
ctgatgtggt gcaaaagacc gggcgttggc gagccattgc gcgggcgaat
10920ggggccgtga ctcgtctcaa attcgagggc gtgcctcaat tcgtgccccc
gtggcttttt 10980cccgccgttt ccgccccgtt tgcaccactg cagccgcttc
tttggttcgg acaccttgct 11040gcgagctagg tgccttgtgc tacttaaaaa
gtggcctccc aacaccaaca tgacatgagt 11100gcgtgggcca agacacgttg
gcggggtcgc agtcggctca atggcccgga aaaaacgctg 11160ctggagctgg
ttcggacgca gtccgccgcg gcgtatggat atccgcaagg ttccatagcg
11220ccattgccct ccgtcggcgt ctatcccgca acctctaaat agagcgggaa
tataacccaa 11280gcttcttttt tttcctttaa cacgcacacc cccaactatc
atgttgctgc tgctgtttga 11340ctctactctg tggaggggtg ctcccaccca
acccaaccta caggtggatc cggcgctgtg 11400attggctgat aagtctccta
tccggactaa ttctgaccaa tgggacatgc gcgcaggacc 11460caaatgccgc
aattacgtaa ccccaacgaa atgcctaccc ctctttggag cccagcggcc
11520ccaaatcccc ccaagcagcc cggttctacc ggcttccatc tccaagcaca
agcagcccgg 11580aattccttta cctgcaggat aacttcgtat aatgtatgct
atacgaagtt atgatctctc 11640tcttgagctt ttccataaca agttcttctg
cctccaggaa gtccatgggt ggtttgatca 11700tggttttggt gtagtggtag
tgcagtggtg gtattgtgac tggggatgta gttgagaata 11760agtcatacac
aagtcagctt tcttcgagcc tcatataagt ataagtagtt caacgtatta
11820gcactgtacc cagcatctcc gtatcgagaa acacaacaac atgccccatt
ggacagatca 11880tgcggataca caggttgtgc agtatcatac atactcgatc
agacaggtcg tctgaccatc 11940atacaagctg aacaagcgct ccatacttgc
acgctctcta tatacacagt taaattacat 12000atccatagtc taacctctaa
cagttaatct tctggtaagc ctcccagcca gccttctggt 12060atcgcttggc
ctcctcaata ggatctcggt tctggccgta cagacctcgg ccgacaatta
12120tgatatccgt tccggtagac atgacatcct caacagttcg gtactgctgt
ccgagagcgt 12180ctcccttgtc gtcaagaccc accccggggg tcagaataag
ccagtcctca gagtcgccct 12240taggtcggtt ctgggcaatg aagccaacca
caaactcggg gtcggatcgg gcaagctcaa 12300tggtctgctt ggagtactcg
ccagtggcca gagagccctt gcaagacagc tcggccagca 12360tgagcagacc
tctggccagc ttctcgttgg gagaggggac taggaactcc ttgtactggg
12420agttctcgta gtcagagacg tcctccttct tctgttcaga gacagtttcc
tcggcaccag 12480ctcgcaggcc agcaatgatt ccggttccgg gtacaccgtg
ggcgttggtg atatcggacc 12540actcggcgat tcggtgacac cggtactggt
gcttgacagt gttgccaata tctgcgaact 12600ttctgtcctc gaacaggaag
aaaccgtgct taagagcaag ttccttgagg gggagcacag 12660tgccggcgta
ggtgaagtcg tcaatgatgt cgatatgggt tttgatcatg cacacataag
12720gtccgacctt atcggcaagc tcaatgagct ccttggtggt ggtaacatcc
agagaagcac 12780acaggttggt tttcttggct gccacgagct tgagcactcg
agcggcaaag gcggacttgt 12840ggacgttagc tcgagcttcg taggagggca
ttttggtggt gaagaggaga ctgaaataaa 12900tttagtctgc agaacttttt
atcggaacct tatctggggc agtgaagtat atgttatggt 12960aatagttacg
agttagttga acttatagat agactggact atacggctat cggtccaaat
13020tagaaagaac gtcaatggct ctctgggcgt cgcctttgcc gacaaaaatg
tgatcatgat 13080gaaagccagc aatgacgttg cagctgatat tgttgtcggc
caaccgcgcc gaaaacgcag 13140ctgtcagacc cacagcctcc aacgaagaat
gtatcgtcaa agtgatccaa gcacactcat 13200agttggagtc gtactccaaa
ggcggcaatg acgagtcaga cagatactcg tcgacgcgat 13260aacttcgtat
aatgtatgct atacgaagtt atcgtacgat agttagtaga caacaatcga
13320taacgtctcg taccaaccac agattacgac ccattcgcag tcacagttca
ctagggtttg 13380ggttgcatcc gttgagagcg gtttgttttt aaccttctcc
atgtgctcac tcaggttttg 13440ggttcagatc aaatcaaggc gtgaaccact
ttgtttgagg acaaatgtga cacaaccaac 13500cagtgtcagg ggcaagtccg
tgacaaaggg gaagatacaa tgcaattact gacagttaca 13560gactgcctcg
atgccctaac cttgccccaa aataagacaa ctgtcctcgt ttaagcgcaa
13620ccctattcag cgtcacgtca taatagcgtt tggatagcac tagtctatga
ggagcgtttt 13680atgttgcggt gagggcgatt ggtgctcata tgggttcaat
tgaggtggcg gaacgagctt 13740agtcttcaat tgaggtgcga gcgacacaat
tgggtgtcac gtggcctaat tgacctcggg 13800tcgtggagtc cccagttata
cagcaaccac gaggtgcatg ggtaggagac gtcaccagac 13860aatagggttt
tttttggact ggagagggtt gggcaaaagc gctcaacggg ctgtttgggg
13920agctgtgggg gaggaattgg cgatatttgt gaggttaacg gctccgattt
gcgtgttttg 13980tcgctcctgc atctccccat acccatatct tccctcccca
cctctttcca cgataatttt 14040acggatcagc aataaggttc cttctcctag
tttccacgtc catatatatc tatgctgcgt 14100cgtccttttc gtgacatcac
caaaacacat acaacaatgg ctgttactga cgtccttaag 14160cgaaagtccg
gtgtcatcgt cggcgacgat gtccgagccg tgagtatcca cgacaagatc
14220agtgtcgaga cgacgcgttt tgtgtaatga cacaatccga aagtcgctag
caacacacac 14280tctctacaca aactaaccca gctctccatg gcctccacct
cggctctgcc caagcagaac 14340cctgccctcc gacgaaccgt cacttccacc
actgtgaccg actcggagtc tgctgccgtc 14400tctccctccg attctcccag
acactcggcc tcctctacat cgctgtcttc catgtccgag 14460gtggacattg
ccaagcccaa gtccgagtac ggtgtcatgc tggataccta cggcaaccag
14520ttcgaagttc ccgacttcac catcaaggac atctacaacg ctattcccaa
gcactgcttc 14580aagcgatctg ctctcaaggg atacggctac attcttcgag
acattgtcct cctgactacc 14640actttcagca tctggtacaa ctttgtgaca
cccgagtaca ttccctccac tcctgctcga 14700gccggtctgt gggctgtgta
caccgttctt cagggactct tcggtactgg actgtgggtc 14760attgcccacg
agtgtggaca tggtgctttc tccgattccc gaatcatcaa cgacattact
14820ggctgggtgc ttcactcttc cctgcttgtt ccctacttca gctggcaaat
ctcccaccgg 14880aagcatcaca aggccactgg aaacatggag cgagacatgg
tcttcgttcc tcgaacccga 14940gagcagcaag ctactcgact cggcaagatg
acccacgaac tcgcccatct taccgaggaa 15000actcctgctt tcaccctgct
catgcttgtg cttcagcaac tggtcggttg gcccaactat 15060ctcattacca
acgttactgg acacaactac catgagcggc agcgagaggg tcgaggcaag
15120ggaaagcaca acggtcttgg cggtggagtt aaccatttcg atccccgatc
tcctctgtac 15180gagaacagcg acgccaagct catcgtgctc tccgacattg
gcattggtct tatggccacc 15240gctctgtact ttctcgttca gaagttcgga
ttctacaaca tggccatctg gtacttcgtt 15300ccctacttgt gggttaacca
ctggctcgtc gccattacct ttctgcagca cacagatcct 15360actcttcccc
actacaccaa cgacgagtgg aactttgtgc gaggtgccgc tgcaaccatc
15420gaccgagaga tgggcttcat tggacgtcat ctgctccacg gcattatcga
gactcacgtc 15480ctgcatcact acgtctcttc cattcccttc tacaatgcgg
acgaagctac cgaggccatc 15540aaacctatca tgggcaagca ctatcgagct
gatgtccagg acggtcctcg aggattcatt 15600cgagccatgt accgatctgc
acgaatgtgc cagtgggttg aaccctccgc tggtgccgag 15660ggagctggca
agggtgtcct gttctttcga aaccgaaaca atgtgggcac tcctcccgct
15720gtcatcaagc ccgttgccta agc 15743316303DNAArtificial
SequencePlasmid pZKUE3S 31ggccgcaagt gtggatgggg aagtgagtgc
ccggttctgt gtgcacaatt ggcaatccaa 60gatggatgga ttcaacacag ggatatagcg
agctacgtgg tggtgcgagg atatagcaac 120ggatatttat gtttgacact
tgagaatgta cgatacaagc actgtccaag tacaatacta 180aacatactgt
acatactcat actcgtaccc gggcaacggt ttcacttgag tgcagtggct
240agtgctctta ctcgtacagt gtgcaatact gcgtatcata gtctttgatg
tatatcgtat 300tcattcatgt tagttgcgta cgaggaaact gtctctgaac
agaagaagga ggacgtctct 360gactacgaga actcccagta caaggagttc
ctagtcccct ctcccaacga gaagctggcc 420agaggtctgc tcatgctggc
cgagctgtct tgcaagggct ctctggccac tggcgagtac 480tccaagcaga
ccattgagct tgcccgatcc gaccccgagt ttgtggttgg cttcattgcc
540cagaaccgac ctaagggcga ctctgaggac tggcttattc tgacccccgg
ggtgggtctt 600gacgacaagg gagacgctct cggacagcag taccgaactg
ttgaggatgt catgtctacc 660ggaacggata tcataattgt cggccgaggt
ctgtacggcc agaaccgaga tcctattgag 720gaggccaagc gataccagaa
ggctggctgg gaggcttacc agaagattaa ctgttagagg 780ttagactatg
gatatgtaat ttaactgtgt atatagagag cgtgcaagta tggagcgctt
840gttcagcttg tatgatggtc agacgacctg tctgatcgag tatgtatgat
actgcacaac 900ctgtgtatcc gcatgatctg tccaatgggg catgttgttg
tgtttctcga tacggagatg 960ctgggtacag tgctaatacg ttgaactact
tatacttata tgaggctcga agaaagctga 1020cttgtgtatg acttaattaa
tcgagcttgg cgtaatcatg gtcatagctg tttcctgtgt 1080gaaattgtta
tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag
1140cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca
ctgcccgctt 1200tccagtcggg aaacctgtcg tgccagctgc attaatgaat
cggccaacgc gcggggagag 1260gcggtttgcg tattgggcgc tcttccgctt
cctcgctcac tgactcgctg cgctcggtcg 1320ttcggctgcg gcgagcggta
tcagctcact caaaggcggt aatacggtta tccacagaat 1380caggggataa
cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta
1440aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag
catcacaaaa 1500atcgacgctc aagtcagagg tggcgaaacc cgacaggact
ataaagatac caggcgtttc 1560cccctggaag ctccctcgtg cgctctcctg
ttccgaccct gccgcttacc ggatacctgt 1620ccgcctttct cccttcggga
agcgtggcgc tttctcatag ctcacgctgt aggtatctca 1680gttcggtgta
ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg
1740accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga
cacgacttat 1800cgccactggc agcagccact ggtaacagga ttagcagagc
gaggtatgta ggcggtgcta 1860cagagttctt gaagtggtgg cctaactacg
gctacactag aaggacagta tttggtatct 1920gcgctctgct gaagccagtt
accttcggaa aaagagttgg tagctcttga tccggcaaac 1980aaaccaccgc
tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa
2040aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag
tggaacgaaa 2100actcacgtta agggattttg gtcatgagat tatcaaaaag
gatcttcacc tagatccttt 2160taaattaaaa atgaagtttt aaatcaatct
aaagtatata tgagtaaact tggtctgaca 2220gttaccaatg cttaatcagt
gaggcaccta tctcagcgat ctgtctattt cgttcatcca 2280tagttgcctg
actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc
2340ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta
tcagcaataa 2400accagccagc cggaagggcc gagcgcagaa gtggtcctgc
aactttatcc gcctccatcc 2460agtctattaa ttgttgccgg gaagctagag
taagtagttc gccagttaat agtttgcgca 2520acgttgttgc cattgctaca
ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat 2580tcagctccgg
ttcccaacga tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag
2640cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca
gtgttatcac 2700tcatggttat ggcagcactg cataattctc ttactgtcat
gccatccgta agatgctttt 2760ctgtgactgg tgagtactca accaagtcat
tctgagaata gtgtatgcgg cgaccgagtt 2820gctcttgccc ggcgtcaata
cgggataata ccgcgccaca tagcagaact ttaaaagtgc 2880tcatcattgg
aaaacgttct tcggggcgaa aactctcaag gatcttaccg ctgttgagat
2940ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt
actttcacca 3000gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc
aaaaaaggga ataagggcga 3060cacggaaatg ttgaatactc atactcttcc
tttttcaata ttattgaagc atttatcagg 3120gttattgtct catgagcgga
tacatatttg aatgtattta gaaaaataaa caaatagggg 3180ttccgcgcac
atttccccga aaagtgccac ctgacgcgcc ctgtagcggc gcattaagcg
3240cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc
ctagcgcccg 3300ctcctttcgc tttcttccct tcctttctcg ccacgttcgc
cggctttccc cgtcaagctc 3360taaatcgggg gctcccttta gggttccgat
ttagtgcttt acggcacctc gaccccaaaa 3420aacttgatta gggtgatggt
tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 3480ctttgacgtt
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac
3540tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt
tcggcctatt 3600ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa
ttttaacaaa atattaacgc 3660ttacaatttc cattcgccat tcaggctgcg
caactgttgg gaagggcgat cggtgcgggc 3720ctcttcgcta ttacgccagc
tggcgaaagg gggatgtgct gcaaggcgat taagttgggt 3780aacgccaggg
ttttcccagt cacgacgttg taaaacgacg gccagtgaat tgtaatacga
3840ctcactatag ggcgaattgg gtaccgggcc ccccctcgag gtcgacgagt
atctgtctga 3900ctcgtcattg catgcctttg gagtacgact ccaactatga
gtgtgcttgg atcactttga 3960cgatacattc ttcgttggag gctgtgggtc
tgacagctgc gttttcggcg cggttggccg 4020acaacaatat cagctgcaac
gtcattgctg gctttcatca tgatcacatt tttgtcggca 4080aaggcgacgc
ccagagagcc attgacgttc tttctaattt ggaccgatag ccgtatagtc
4140cagtctatct ataagttcaa ctaactcgta actattacca taacatatac
ttcactgccc 4200cagataaggt tccgataaaa agttctgcag actaaattta
tttcagtctc ctcttcacca 4260ccaaaatgcc ctcctacgaa gctcgagtgc
tcaagctcgt ggcagccaag aaaaccaacc 4320tgtgtgcttc tctggatgtt
accaccacca aggagctcat tgagcttgcc gataaggtcg 4380gaccttatgt
gtgcatgatc aaaacccata tcgacatcat tgacgacttc acctacgccg
4440gcactgtgct ccccctcaag gaacttgctc ttaagcacgg tttcttcctg
ttcgaggaca 4500gaaagttcgc agatattggc aacactgtca agcaccagta
ccggtgtcac cgaatcgccg 4560agtggtccga tatcaccaac gcccacggtg
tttaaacccg gaaccggaat cgataagctt 4620gatatcgaat tcatgctgtt
catcgtggtt aatgctgctg tgtgctgtgt gtgtgtgttg 4680tttggcgctc
attgttgcgt tatgcagcgt acaccacaat attggaagct tattagcctt
4740tctatttttt cgtttgcaag gcttaacaac attgctgtgg agagggatgg
ggatatggag 4800gccgctggag ggagtcggag aggcgttttg gagcggcttg
gcctggcgcc cagctcgcga 4860aacgcaccta ggaccctttg gcacgccgaa
atgtgccact tttcagtcta gtaacgcctt 4920acctacgtca ttccatgcgt
gcatgtttgc gccttttttc ccttgccctt gatcgccaca 4980cagtacagtg
cactgtacag tggaggtttt gggggggtct tagatgggag ctaaaagcgg
5040cctagcggta cactagtggg attgtatgga gtggcatgga gcctaggtgg
agcctgacag 5100gacgcacgac cggctagccc gtgacagacg atgggtggct
cctgttgtcc accgcgtaca 5160aatgtttggg ccaaagtctt gtcagccttg
cttgcgaacc taattcccaa ttttgtcact 5220tcgcaccccc attgatcgag
ccctaacccc tgcccatcag gcaatccaat taagctcgca 5280ttgtctgcct
tgtttagttt ggctcctgcc cgtttcggcg tccacttgca caaacacaaa
5340caagcattat atataaggct cgtctctccc tcccaaccac actcactttt
ttgcccgtct 5400tcccttgcta acacaaaagt caagaacaca aacaaccacc
ccaaccccct tacacacaag 5460acatatctac accatggagt ctggacccat
gcctgctggc attcccttcc ctgagtacta 5520tgacttcttt atggactgga
agactcccct ggccatcgct gccacctaca ctgctgccgt 5580cggtctcttc
aaccccaagg ttggcaaggt ctcccgagtg gttgccaagt cggctaacgc
5640aaagcctgcc gagcgaaccc agtccggagc tgccatgact gccttcgtct
ttgtgcacaa 5700cctcattctg tgtgtctact ctggcatcac cttctactac
atgtttcctg ctatggtcaa 5760gaacttccga acccacacac tgcacgaagc
ctactgcgac acggatcagt ccctctggaa 5820caacgcactt ggctactggg
gttacctctt ctacctgtcc aagttctacg aggtcattga 5880caccatcatc
atcatcctga agggacgacg gtcctcgctg cttcagacct accaccatgc
5940tggagccatg attaccatgt ggtctggcat caactaccaa gccactccca
tttggatctt 6000tgtggtcttc aactccttca ttcacaccat catgtactgt
tactatgcct tcacctctat 6060cggattccat cctcctggca aaaagtacct
gacttcgatg cagattactc agtttctggt 6120cggtatcacc attgccgtgt
cctacctctt cgttcctggc tgcatccgaa cacccggtgc 6180tcagatggct
gtctggatca acgtcggcta cctgtttccc ttgacctatc tgttcgtgga
6240ctttgccaag cgaacctact ccaagcgatc tgccattgcc gctcagaaaa
aggctcagta 6300agc 63033221DNAArtificial SequencePrimer pZP-GW-5-1
32cgacaagatg gaatgagaat g 213322DNAArtificial SequencePrimer
pZP-GW-5-2 33ctggtttttc aactacttct ac 223421DNAArtificial
SequencePrimer pZP-GW-5-3 34gtactgtcct gtgtctgttc c
213522DNAArtificial SequencePrimer pZP-GW-5-4 35ctacatcgtc
cgaaagcaca ag 223624DNAArtificial SequencePrimer pZP-GW-3-1
36ctaccagatc gagcaccatc tctg 243721DNAArtificial SequencePrimer
pZP-GW-3-2 37ctaccaggtg gaacagctgt g 213822DNAArtificial
SequencePrimer pZP-GW-3-3 38tctgccccat gaaggtctcg tc
223922DNAArtificial SequencePrimer pZP-GW-3-4 39cctgtcccag
ttcgctcgaa tg 224044DNAArtificial SequenceGenome Walker adaptor-1
40gtaatacgac tatagggcac gcgtggtcga cggcccgggc tggt
44418DNAArtificial SequenceGenome Walker adaptor-2 41accagccc
84222DNAArtificial SequenceNested adaptor primer 42gtaatacgac
tcactatagg gc 224336DNAArtificial SequencePrimer Per10F1
43gatcaaccat ggggggaagt tcacatgcat tcgctg 364429DNAArtificial
SequencePrimer ZPGW-5-5 44gttatagttt tcatgtgaaa taccgagag
294537DNAArtificial SequencePrimer Per10R 45gatcaagcgg ccgccagacc
tcgtcattat ctgatag 37467222DNAArtificial SequencePlasmid
pFBAIn-MOD-1 46catggatcca ggcctgttaa cggccattac ggcctgcagg
atccgaaaaa acctcccaca 60cctccccctg aacctgaaac ataaaatgaa tgcaattgtt
gttgttaact tgtttattgc 120agcttataat ggttacaaat aaagcaatag
catcacaaat ttcacaaata aagcattttt 180ttcactgcat tctagttgtg
gtttgtccaa actcatcaat gtatcttatc atgtctgcgg 240ccgcaagtgt
ggatggggaa gtgagtgccc ggttctgtgt gcacaattgg caatccaaga
300tggatggatt caacacaggg atatagcgag ctacgtggtg gtgcgaggat
atagcaacgg 360atatttatgt ttgacacttg agaatgtacg atacaagcac
tgtccaagta caatactaaa 420catactgtac atactcatac tcgtacccgg
gcaacggttt cacttgagtg cagtggctag 480tgctcttact cgtacagtgt
gcaatactgc gtatcatagt ctttgatgta tatcgtattc 540attcatgtta
gttgcgtacg agccggaagc ataaagtgta aagcctgggg tgcctaatga
600gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc
gggaaacctg 660tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga
gaggcggttt gcgtattggg 720cgctcttccg cttcctcgct cactgactcg
ctgcgctcgg tcgttcggct gcggcgagcg 780gtatcagctc actcaaaggc
ggtaatacgg ttatccacag aatcagggga taacgcagga 840aagaacatgt
gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg
900gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg
ctcaagtcag 960aggtggcgaa acccgacagg actataaaga taccaggcgt
ttccccctgg aagctccctc 1020gtgcgctctc ctgttccgac cctgccgctt
accggatacc tgtccgcctt tctcccttcg 1080ggaagcgtgg cgctttctca
tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 1140cgctccaagc
tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc
1200ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact
ggcagcagcc 1260actggtaaca ggattagcag agcgaggtat gtaggcggtg
ctacagagtt cttgaagtgg 1320tggcctaact acggctacac tagaaggaca
gtatttggta tctgcgctct gctgaagcca 1380gttaccttcg gaaaaagagt
tggtagctct tgatccggca aacaaaccac cgctggtagc 1440ggtggttttt
ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat
1500cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg
ttaagggatt 1560ttggtcatga gattatcaaa aaggatcttc acctagatcc
ttttaaatta aaaatgaagt 1620tttaaatcaa tctaaagtat atatgagtaa
acttggtctg acagttacca atgcttaatc 1680agtgaggcac ctatctcagc
gatctgtcta tttcgttcat ccatagttgc ctgactcccc 1740gtcgtgtaga
taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata
1800ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc
agccggaagg 1860gccgagcgca gaagtggtcc tgcaacttta tccgcctcca
tccagtctat taattgttgc 1920cgggaagcta gagtaagtag ttcgccagtt
aatagtttgc gcaacgttgt tgccattgct 1980acaggcatcg tggtgtcacg
ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 2040cgatcaaggc
gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt
2100cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt
tatggcagca 2160ctgcataatt ctcttactgt catgccatcc gtaagatgct
tttctgtgac tggtgagtac 2220tcaaccaagt cattctgaga atagtgtatg
cggcgaccga gttgctcttg cccggcgtca 2280atacgggata ataccgcgcc
acatagcaga actttaaaag tgctcatcat tggaaaacgt 2340tcttcggggc
gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc
2400actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc
tgggtgagca 2460aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg
cgacacggaa atgttgaata 2520ctcatactct tcctttttca atattattga
agcatttatc agggttattg tctcatgagc 2580ggatacatat ttgaatgtat
ttagaaaaat aaacaaatag gggttccgcg cacatttccc 2640cgaaaagtgc
cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt
2700acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt
cgctttcttc 2760ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag
ctctaaatcg ggggctccct 2820ttagggttcc gatttagtgc tttacggcac
ctcgacccca aaaaacttga ttagggtgat 2880ggttcacgta gtgggccatc
gccctgatag acggtttttc gccctttgac gttggagtcc 2940acgttcttta
atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc
3000tattcttttg atttataagg gattttgccg atttcggcct attggttaaa
aaatgagctg 3060atttaacaaa aatttaacgc gaattttaac aaaatattaa
cgcttacaat ttccattcgc 3120cattcaggct gcgcaactgt tgggaagggc
gatcggtgcg ggcctcttcg ctattacgcc 3180agctggcgaa agggggatgt
gctgcaaggc gattaagttg ggtaacgcca gggttttccc 3240agtcacgacg
ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat
3300tgggtaccgg gccccccctc gaggtcgatg gtgtcgataa gcttgatatc
gaattcatgt 3360cacacaaacc gatcttcgcc tcaaggaaac ctaattctac
atccgagaga ctgccgagat 3420ccagtctaca ctgattaatt ttcgggccaa
taatttaaaa aaatcgtgtt atataatatt 3480atatgtatta tatatataca
tcatgatgat actgacagtc atgtcccatt gctaaataga 3540cagactccat
ctgccgcctc caactgatgt tctcaatatt taaggggtca tctcgcattg
3600tttaataata aacagactcc atctaccgcc tccaaatgat gttctcaaaa
tatattgtat 3660gaacttattt ttattactta gtattattag acaacttact
tgctttatga aaaacacttc 3720ctatttagga aacaatttat aatggcagtt
cgttcattta acaatttatg tagaataaat 3780gttataaatg cgtatgggaa
atcttaaata tggatagcat aaatgatatc tgcattgcct 3840aattcgaaat
caacagcaac gaaaaaaatc ccttgtacaa cataaatagt catcgagaaa
3900tatcaactat caaagaacag ctattcacac gttactattg agattattat
tggacgagaa 3960tcacacactc aactgtcttt ctctcttcta gaaatacagg
tacaagtatg tactattctc 4020attgttcata cttctagtca tttcatccca
catattcctt ggatttctct ccaatgaatg 4080acattctatc ttgcaaattc
aacaattata ataagatata ccaaagtagc ggtatagtgg 4140caatcaaaaa
gcttctctgg tgtgcttctc gtatttattt ttattctaat gatccattaa
4200aggtatatat ttatttcttg ttatataatc cttttgttta ttacatgggc
tggatacata 4260aaggtatttt gatttaattt tttgcttaaa ttcaatcccc
cctcgttcag tgtcaactgt 4320aatggtagga aattaccata cttttgaaga
agcaaaaaaa atgaaagaaa aaaaaaatcg 4380tatttccagg ttagacgttc
cgcagaatct agaatgcggt atgcggtaca ttgttcttcg 4440aacgtaaaag
ttgcgctccc tgagatattg tacatttttg cttttacaag tacaagtaca
4500tcgtacaact atgtactact gttgatgcat ccacaacagt ttgttttgtt
tttttttgtt 4560tttttttttt ctaatgattc attaccgcta tgtataccta
cttgtacttg tagtaagccg 4620ggttattggc gttcaattaa tcatagactt
atgaatctgc acggtgtgcg ctgcgagtta 4680cttttagctt atgcatgcta
cttgggtgta atattgggat ctgttcggaa atcaacggat 4740gctcaatcga
tttcgacagt aattaattaa gtcatacaca agtcagcttt cttcgagcct
4800catataagta taagtagttc aacgtattag cactgtaccc agcatctccg
tatcgagaaa 4860cacaacaaca tgccccattg gacagatcat gcggatacac
aggttgtgca gtatcataca 4920tactcgatca gacaggtcgt ctgaccatca
tacaagctga acaagcgctc catacttgca 4980cgctctctat atacacagtt
aaattacata tccatagtct aacctctaac agttaatctt 5040ctggtaagcc
tcccagccag ccttctggta tcgcttggcc tcctcaatag gatctcggtt
5100ctggccgtac agacctcggc cgacaattat gatatccgtt ccggtagaca
tgacatcctc 5160aacagttcgg tactgctgtc cgagagcgtc tcccttgtcg
tcaagaccca ccccgggggt 5220cagaataagc cagtcctcag agtcgccctt
aggtcggttc tgggcaatga agccaaccac 5280aaactcgggg tcggatcggg
caagctcaat ggtctgcttg gagtactcgc cagtggccag 5340agagcccttg
caagacagct cggccagcat gagcagacct ctggccagct tctcgttggg
5400agaggggact aggaactcct tgtactggga gttctcgtag tcagagacgt
cctccttctt 5460ctgttcagag acagtttcct cggcaccagc tcgcaggcca
gcaatgattc cggttccggg 5520tacaccgtgg gcgttggtga tatcggacca
ctcggcgatt cggtgacacc ggtactggtg 5580cttgacagtg ttgccaatat
ctgcgaactt tctgtcctcg aacaggaaga aaccgtgctt 5640aagagcaagt
tccttgaggg ggagcacagt gccggcgtag gtgaagtcgt caatgatgtc
5700gatatgggtt ttgatcatgc acacataagg tccgacctta tcggcaagct
caatgagctc 5760cttggtggtg gtaacatcca gagaagcaca caggttggtt
ttcttggctg ccacgagctt 5820gagcactcga gcggcaaagg cggacttgtg
gacgttagct cgagcttcgt aggagggcat 5880tttggtggtg aagaggagac
tgaaataaat ttagtctgca gaacttttta tcggaacctt 5940atctggggca
gtgaagtata tgttatggta atagttacga gttagttgaa cttatagata
6000gactggacta tacggctatc ggtccaaatt agaaagaacg tcaatggctc
tctgggcgtc 6060gcctttgccg acaaaaatgt gatcatgatg aaagccagca
atgacgttgc agctgatatt 6120gttgtcggcc aaccgcgccg aaaacgcagc
tgtcagaccc acagcctcca acgaagaatg 6180tatcgtcaaa gtgatccaag
cacactcata gttggagtcg tactccaaag gcggcaatga 6240cgagtcagac
agatactcgt cgaaaacagt gtacgcagat ctactataga ggaacattta
6300aattgccccg gagaagacgg ccaggccgcc tagatgacaa attcaacaac
tcacagctga 6360ctttctgcca ttgccactag gggggggcct ttttatatgg
ccaagccaag ctctccacgt 6420cggttgggct gcacccaaca ataaatgggt
agggttgcac caacaaaggg atgggatggg 6480gggtagaaga tacgaggata
acggggctca atggcacaaa taagaacgaa tactgccatt 6540aagactcgtg
atccagcgac tgacaccatt gcatcatcta agggcctcaa aactacctcg
6600gaactgctgc gctgatctgg acaccacaga ggttccgagc actttaggtt
gcaccaaatg 6660tcccaccagg tgcaggcaga aaacgctgga acagcgtgta
cagtttgtct taacaaaaag 6720tgagggcgct gaggtcgagc agggtggtgt
gacttgttat agcctttaga gctgcgaaag 6780cgcgtatgga tttggctcat
caggccagat tgagggtctg tggacacatg tcatgttagt 6840gtacttcaat
cgccccctgg atatagcccc gacaataggc cgtggcctca tttttttgcc
6900ttccgcacat ttccattgct cggtacccac accttgcttc tcctgcactt
gccaacctta 6960atactggttt acattgacca acatcttaca agcggggggc
ttgtctaggg tatatataaa 7020cagtggctct cccaatcggt tgccagtctc
ttttttcctt tctttcccca cagattcgaa 7080atctaaacta cacatcacag
aattccgagc cgtgagtatc cacgacaaga tcagtgtcga 7140gacgacgcgt
tttgtgtaat gacacaatcc gaaagtcgct agcaacacac actctctaca
7200caaactaacc cagctctggt ac 7222478133DNAArtificial
SequencePlasmid pFBAIN-Pex10 47ggccgcaagt gtggatgggg aagtgagtgc
ccggttctgt gtgcacaatt ggcaatccaa 60gatggatgga ttcaacacag ggatatagcg
agctacgtgg tggtgcgagg atatagcaac 120ggatatttat gtttgacact
tgagaatgta cgatacaagc actgtccaag tacaatacta 180aacatactgt
acatactcat actcgtaccc gggcaacggt ttcacttgag tgcagtggct
240agtgctctta ctcgtacagt gtgcaatact gcgtatcata gtctttgatg
tatatcgtat 300tcattcatgt tagttgcgta cgagccggaa gcataaagtg
taaagcctgg ggtgcctaat 360gagtgagcta actcacatta attgcgttgc
gctcactgcc cgctttccag tcgggaaacc 420tgtcgtgcca gctgcattaa
tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg 480ggcgctcttc
cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag
540cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg
gataacgcag 600gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa
ccgtaaaaag gccgcgttgc 660tggcgttttt ccataggctc cgcccccctg
acgagcatca caaaaatcga cgctcaagtc 720agaggtggcg aaacccgaca
ggactataaa gataccaggc gtttccccct ggaagctccc 780tcgtgcgctc
tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt
840cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg
gtgtaggtcg 900ttcgctccaa gctgggctgt gtgcacgaac cccccgttca
gcccgaccgc tgcgccttat 960ccggtaacta tcgtcttgag tccaacccgg
taagacacga cttatcgcca ctggcagcag 1020ccactggtaa caggattagc
agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 1080ggtggcctaa
ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc
1140cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc
accgctggta 1200gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag
aaaaaaagga tctcaagaag 1260atcctttgat cttttctacg gggtctgacg
ctcagtggaa cgaaaactca cgttaaggga 1320ttttggtcat gagattatca
aaaaggatct tcacctagat ccttttaaat taaaaatgaa 1380gttttaaatc
aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa
1440tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt
gcctgactcc 1500ccgtcgtgta gataactacg atacgggagg gcttaccatc
tggccccagt gctgcaatga 1560taccgcgaga cccacgctca ccggctccag
atttatcagc aataaaccag ccagccggaa 1620gggccgagcg cagaagtggt
cctgcaactt tatccgcctc catccagtct attaattgtt 1680gccgggaagc
tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg
1740ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc
tccggttccc 1800aacgatcaag gcgagttaca tgatccccca tgttgtgcaa
aaaagcggtt agctccttcg 1860gtcctccgat cgttgtcaga agtaagttgg
ccgcagtgtt atcactcatg gttatggcag 1920cactgcataa ttctcttact
gtcatgccat ccgtaagatg cttttctgtg actggtgagt 1980actcaaccaa
gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt
2040caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc
attggaaaac 2100gttcttcggg gcgaaaactc tcaaggatct taccgctgtt
gagatccagt tcgatgtaac 2160ccactcgtgc acccaactga tcttcagcat
cttttacttt caccagcgtt tctgggtgag 2220caaaaacagg aaggcaaaat
gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa 2280tactcatact
cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga
2340gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg
cgcacatttc 2400cccgaaaagt gccacctgac gcgccctgta gcggcgcatt
aagcgcggcg ggtgtggtgg 2460ttacgcgcag cgtgaccgct acacttgcca
gcgccctagc gcccgctcct ttcgctttct 2520tcccttcctt tctcgccacg
ttcgccggct ttccccgtca agctctaaat cgggggctcc 2580ctttagggtt
ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg
2640atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg
acgttggagt 2700ccacgttctt taatagtgga ctcttgttcc aaactggaac
aacactcaac cctatctcgg 2760tctattcttt tgatttataa gggattttgc
cgatttcggc ctattggtta aaaaatgagc 2820tgatttaaca aaaatttaac
gcgaatttta acaaaatatt aacgcttaca atttccattc 2880gccattcagg
ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt cgctattacg
2940ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc
cagggttttc 3000ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa
tacgactcac tatagggcga 3060attgggtacc gggccccccc tcgaggtcga
tggtgtcgat aagcttgata tcgaattcat 3120gtcacacaaa ccgatcttcg
cctcaaggaa acctaattct acatccgaga gactgccgag 3180atccagtcta
cactgattaa ttttcgggcc aataatttaa aaaaatcgtg ttatataata
3240ttatatgtat tatatatata catcatgatg atactgacag tcatgtccca
ttgctaaata 3300gacagactcc atctgccgcc tccaactgat gttctcaata
tttaaggggt catctcgcat 3360tgtttaataa taaacagact ccatctaccg
cctccaaatg atgttctcaa aatatattgt 3420atgaacttat ttttattact
tagtattatt agacaactta cttgctttat gaaaaacact 3480tcctatttag
gaaacaattt ataatggcag ttcgttcatt taacaattta tgtagaataa
3540atgttataaa tgcgtatggg aaatcttaaa tatggatagc ataaatgata
tctgcattgc 3600ctaattcgaa atcaacagca acgaaaaaaa tcccttgtac
aacataaata gtcatcgaga 3660aatatcaact atcaaagaac agctattcac
acgttactat tgagattatt attggacgag 3720aatcacacac tcaactgtct
ttctctcttc tagaaataca ggtacaagta tgtactattc 3780tcattgttca
tacttctagt catttcatcc cacatattcc ttggatttct ctccaatgaa
3840tgacattcta tcttgcaaat tcaacaatta taataagata taccaaagta
gcggtatagt 3900ggcaatcaaa aagcttctct ggtgtgcttc tcgtatttat
ttttattcta atgatccatt 3960aaaggtatat atttatttct tgttatataa
tccttttgtt tattacatgg gctggataca 4020taaaggtatt ttgatttaat
tttttgctta aattcaatcc cccctcgttc agtgtcaact 4080gtaatggtag
gaaattacca tacttttgaa gaagcaaaaa aaatgaaaga aaaaaaaaat
4140cgtatttcca ggttagacgt tccgcagaat ctagaatgcg gtatgcggta
cattgttctt 4200cgaacgtaaa agttgcgctc cctgagatat tgtacatttt
tgcttttaca agtacaagta 4260catcgtacaa ctatgtacta ctgttgatgc
atccacaaca gtttgttttg tttttttttg 4320tttttttttt ttctaatgat
tcattaccgc tatgtatacc tacttgtact tgtagtaagc 4380cgggttattg
gcgttcaatt aatcatagac ttatgaatct gcacggtgtg cgctgcgagt
4440tacttttagc ttatgcatgc tacttgggtg taatattggg atctgttcgg
aaatcaacgg 4500atgctcaatc gatttcgaca gtaattaatt aagtcataca
caagtcagct ttcttcgagc 4560ctcatataag tataagtagt tcaacgtatt
agcactgtac ccagcatctc cgtatcgaga 4620aacacaacaa catgccccat
tggacagatc atgcggatac acaggttgtg cagtatcata 4680catactcgat
cagacaggtc gtctgaccat catacaagct gaacaagcgc tccatacttg
4740cacgctctct atatacacag ttaaattaca tatccatagt ctaacctcta
acagttaatc 4800ttctggtaag cctcccagcc agccttctgg tatcgcttgg
cctcctcaat aggatctcgg 4860ttctggccgt acagacctcg gccgacaatt
atgatatccg ttccggtaga catgacatcc 4920tcaacagttc ggtactgctg
tccgagagcg tctcccttgt cgtcaagacc caccccgggg 4980gtcagaataa
gccagtcctc agagtcgccc ttaggtcggt tctgggcaat gaagccaacc
5040acaaactcgg ggtcggatcg ggcaagctca atggtctgct tggagtactc
gccagtggcc 5100agagagccct tgcaagacag ctcggccagc atgagcagac
ctctggccag cttctcgttg 5160ggagagggga ctaggaactc cttgtactgg
gagttctcgt agtcagagac gtcctccttc 5220ttctgttcag agacagtttc
ctcggcacca gctcgcaggc cagcaatgat tccggttccg 5280ggtacaccgt
gggcgttggt gatatcggac cactcggcga ttcggtgaca ccggtactgg
5340tgcttgacag tgttgccaat atctgcgaac tttctgtcct cgaacaggaa
gaaaccgtgc 5400ttaagagcaa gttccttgag ggggagcaca gtgccggcgt
aggtgaagtc gtcaatgatg 5460tcgatatggg ttttgatcat gcacacataa
ggtccgacct tatcggcaag ctcaatgagc 5520tccttggtgg tggtaacatc
cagagaagca cacaggttgg ttttcttggc tgccacgagc 5580ttgagcactc
gagcggcaaa ggcggacttg tggacgttag ctcgagcttc gtaggagggc
5640attttggtgg tgaagaggag actgaaataa atttagtctg cagaactttt
tatcggaacc 5700ttatctgggg cagtgaagta tatgttatgg taatagttac
gagttagttg aacttataga 5760tagactggac tatacggcta tcggtccaaa
ttagaaagaa cgtcaatggc tctctgggcg 5820tcgcctttgc cgacaaaaat
gtgatcatga tgaaagccag caatgacgtt gcagctgata 5880ttgttgtcgg
ccaaccgcgc cgaaaacgca gctgtcagac ccacagcctc caacgaagaa
5940tgtatcgtca aagtgatcca agcacactca tagttggagt cgtactccaa
aggcggcaat 6000gacgagtcag acagatactc gtcgaaaaca gtgtacgcag
atctactata gaggaacatt 6060taaattgccc cggagaagac ggccaggccg
cctagatgac aaattcaaca actcacagct 6120gactttctgc cattgccact
aggggggggc ctttttatat ggccaagcca agctctccac 6180gtcggttggg
ctgcacccaa caataaatgg gtagggttgc accaacaaag ggatgggatg
6240gggggtagaa gatacgagga taacggggct caatggcaca aataagaacg
aatactgcca 6300ttaagactcg tgatccagcg actgacacca ttgcatcatc
taagggcctc aaaactacct 6360cggaactgct gcgctgatct ggacaccaca
gaggttccga gcactttagg ttgcaccaaa 6420tgtcccacca ggtgcaggca
gaaaacgctg gaacagcgtg tacagtttgt cttaacaaaa 6480agtgagggcg
ctgaggtcga gcagggtggt gtgacttgtt atagccttta gagctgcgaa
6540agcgcgtatg gatttggctc atcaggccag attgagggtc tgtggacaca
tgtcatgtta 6600gtgtacttca atcgccccct ggatatagcc ccgacaatag
gccgtggcct catttttttg 6660ccttccgcac atttccattg ctcggtaccc
acaccttgct tctcctgcac ttgccaacct 6720taatactggt ttacattgac
caacatctta caagcggggg gcttgtctag ggtatatata 6780aacagtggct
ctcccaatcg gttgccagtc tcttttttcc tttctttccc cacagattcg
6840aaatctaaac tacacatcac agaattccga gccgtgagta tccacgacaa
gatcagtgtc 6900gagacgacgc gttttgtgta atgacacaat ccgaaagtcg
ctagcaacac acactctcta 6960cacaaactaa cccagctctg gtaccatggg
gggaagttca catgcattcg ctggtgaatc 7020tgatctgaca ctacaactac
acaccaggtc caacatgagc gacaatacga caatcaaaaa 7080gccgatccga
cccaaaccga tccggacgga acgcctgcct tacgctgggg ccgcagaaat
7140catccgagcc aaccagaaag accactactt tgagtccgtg cttgaacagc
atctcgtcac 7200gtttctgcag aaatggaagg gagtacgatt tatccaccag
tacaaggagg agctggagac 7260ggcgtccaag tttgcatatc tcggtttgtg
tacgcttgtg ggctccaaga ctctcggaga 7320agagtacacc aatctcatgt
acactatcag agaccgaaca gctctaccgg gggtggtgag 7380acggtttggc
tacgtgcttt ccaacactct gtttccatac ctgtttgtgc gctacatggg
7440caagttgcgc gccaaactga tgcgcgagta tccccatctg gtggagtacg
acgaagatga 7500gcctgtgccc agcccggaaa catggaagga gcgggtcatc
aagacgtttg tgaacaagtt 7560tgacaagttc acggcgctgg aggggtttac
cgcgatccac ttggcgattt tctacgtcta 7620cggctcgtac taccagctca
gtaagcggat ctggggcatg cgttatgtat ttggacaccg 7680actggacaag
aatgagcctc gaatcggtta cgagatgctc ggtctgctga ttttcgcccg
7740gtttgccacg tcatttgtgc agacgggaag agagtacctc ggagcgctgc
tggaaaagag 7800cgtggagaaa gaggcagggg agaaggaaga tgaaaaggaa
gcggttgtgc cgaaaaagaa 7860gtcgtcaatt ccgttcattg aggatacaga
aggggagacg gaagacaaga tcgatctgga 7920ggaccctcga cagctcaagt
tcattcctga ggcgtccaga gcgtgcactc tgtgtctgtc 7980atacattagt
gcgccggcat gtacgccatg tggacacttt ttctgttggg actgtatttc
8040cgaatgggtg agagagaagc ccgagtgtcc cttgtgtcgg cagggtgtga
gagagcagaa 8100cttgttgcct atcagataat gacgaggtct ggc
81334835DNAArtificial SequencePrimer PEX10-R-BsiWI 48gatcaacgta
cgcttcagca gtaactgtat tgctc 354935DNAArtificial SequencePrimer
PEX10-F1-SalI 49gatcaagtcg acattgtaac tagtcctgga gggtc
355036DNAArtificial SequencePrimer PEX10-F2-SalI 50gatcaagtcg
acgtcttagc gtcatgtatt ctcaag 36517277DNAArtificial SequencePlasmid
pEXP-MOD1 51catggatcca ggcctgttaa cggccattac ggcctgcagg atccgaaaaa
acctcccaca 60cctccccctg aacctgaaac ataaaatgaa tgcaattgtt gttgttaact
tgtttattgc 120agcttataat ggttacaaat aaagcaatag catcacaaat
ttcacaaata aagcattttt 180ttcactgcat tctagttgtg gtttgtccaa
actcatcaat gtatcttatc atgtctgcgg 240ccgcaagtgt ggatggggaa
gtgagtgccc ggttctgtgt gcacaattgg caatccaaga 300tggatggatt
caacacaggg atatagcgag ctacgtggtg gtgcgaggat atagcaacgg
360atatttatgt ttgacacttg agaatgtacg atacaagcac tgtccaagta
caatactaaa 420catactgtac atactcatac tcgtacccgg gcaacggttt
cacttgagtg cagtggctag 480tgctcttact cgtacagtgt gcaatactgc
gtatcatagt ctttgatgta tatcgtattc 540attcatgtta gttgcgtacg
agccggaagc ataaagtgta aagcctgggg tgcctaatga 600gtgagctaac
tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg
660tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt
gcgtattggg 720cgctcttccg cttcctcgct cactgactcg ctgcgctcgg
tcgttcggct gcggcgagcg 780gtatcagctc actcaaaggc ggtaatacgg
ttatccacag aatcagggga taacgcagga 840aagaacatgt gagcaaaagg
ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 900gcgtttttcc
ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag
960aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg
aagctccctc 1020gtgcgctctc ctgttccgac cctgccgctt accggatacc
tgtccgcctt tctcccttcg 1080ggaagcgtgg cgctttctca tagctcacgc
tgtaggtatc tcagttcggt gtaggtcgtt 1140cgctccaagc tgggctgtgt
gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 1200ggtaactatc
gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc
1260actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt
cttgaagtgg 1320tggcctaact acggctacac tagaaggaca gtatttggta
tctgcgctct gctgaagcca 1380gttaccttcg gaaaaagagt tggtagctct
tgatccggca aacaaaccac cgctggtagc 1440ggtggttttt ttgtttgcaa
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 1500cctttgatct
tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt
1560ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta
aaaatgaagt 1620tttaaatcaa tctaaagtat atatgagtaa acttggtctg
acagttacca atgcttaatc 1680agtgaggcac ctatctcagc gatctgtcta
tttcgttcat ccatagttgc ctgactcccc 1740gtcgtgtaga taactacgat
acgggagggc ttaccatctg gccccagtgc tgcaatgata 1800ccgcgagacc
cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg
1860gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat
taattgttgc 1920cgggaagcta gagtaagtag ttcgccagtt aatagtttgc
gcaacgttgt tgccattgct 1980acaggcatcg tggtgtcacg ctcgtcgttt
ggtatggctt cattcagctc cggttcccaa 2040cgatcaaggc gagttacatg
atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 2100cctccgatcg
ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca
2160ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac
tggtgagtac 2220tcaaccaagt cattctgaga atagtgtatg cggcgaccga
gttgctcttg cccggcgtca 2280atacgggata ataccgcgcc acatagcaga
actttaaaag tgctcatcat tggaaaacgt 2340tcttcggggc gaaaactctc
aaggatctta ccgctgttga gatccagttc gatgtaaccc 2400actcgtgcac
ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca
2460aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa
atgttgaata 2520ctcatactct tcctttttca atattattga agcatttatc
agggttattg tctcatgagc 2580ggatacatat ttgaatgtat ttagaaaaat
aaacaaatag gggttccgcg cacatttccc 2640cgaaaagtgc cacctgacgc
gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt 2700acgcgcagcg
tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc
2760ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg
ggggctccct 2820ttagggttcc gatttagtgc tttacggcac ctcgacccca
aaaaacttga ttagggtgat 2880ggttcacgta gtgggccatc gccctgatag
acggtttttc gccctttgac gttggagtcc 2940acgttcttta atagtggact
cttgttccaa actggaacaa cactcaaccc tatctcggtc 3000tattcttttg
atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg
3060atttaacaaa aatttaacgc gaattttaac aaaatattaa cgcttacaat
ttccattcgc 3120cattcaggct gcgcaactgt tgggaagggc gatcggtgcg
ggcctcttcg ctattacgcc 3180agctggcgaa agggggatgt gctgcaaggc
gattaagttg ggtaacgcca gggttttccc 3240agtcacgacg ttgtaaaacg
acggccagtg aattgtaata cgactcacta tagggcgaat 3300tgggtaccgg
gccccccctc gaggtcgatg gtgtcgataa gcttgatatc gaattcatgt
3360cacacaaacc gatcttcgcc tcaaggaaac ctaattctac atccgagaga
ctgccgagat 3420ccagtctaca ctgattaatt ttcgggccaa taatttaaaa
aaatcgtgtt atataatatt 3480atatgtatta tatatataca tcatgatgat
actgacagtc atgtcccatt gctaaataga 3540cagactccat ctgccgcctc
caactgatgt tctcaatatt taaggggtca tctcgcattg 3600tttaataata
aacagactcc atctaccgcc tccaaatgat gttctcaaaa tatattgtat
3660gaacttattt ttattactta gtattattag acaacttact tgctttatga
aaaacacttc 3720ctatttagga aacaatttat aatggcagtt cgttcattta
acaatttatg tagaataaat 3780gttataaatg cgtatgggaa atcttaaata
tggatagcat aaatgatatc tgcattgcct 3840aattcgaaat caacagcaac
gaaaaaaatc ccttgtacaa cataaatagt catcgagaaa 3900tatcaactat
caaagaacag ctattcacac gttactattg agattattat tggacgagaa
3960tcacacactc aactgtcttt ctctcttcta gaaatacagg tacaagtatg
tactattctc 4020attgttcata cttctagtca tttcatccca catattcctt
ggatttctct ccaatgaatg 4080acattctatc ttgcaaattc aacaattata
ataagatata ccaaagtagc ggtatagtgg 4140caatcaaaaa gcttctctgg
tgtgcttctc gtatttattt ttattctaat gatccattaa 4200aggtatatat
ttatttcttg ttatataatc cttttgttta ttacatgggc tggatacata
4260aaggtatttt gatttaattt tttgcttaaa ttcaatcccc cctcgttcag
tgtcaactgt 4320aatggtagga aattaccata cttttgaaga agcaaaaaaa
atgaaagaaa aaaaaaatcg 4380tatttccagg ttagacgttc cgcagaatct
agaatgcggt atgcggtaca ttgttcttcg 4440aacgtaaaag ttgcgctccc
tgagatattg tacatttttg cttttacaag tacaagtaca 4500tcgtacaact
atgtactact gttgatgcat ccacaacagt ttgttttgtt tttttttgtt
4560tttttttttt ctaatgattc attaccgcta tgtataccta cttgtacttg
tagtaagccg 4620ggttattggc gttcaattaa tcatagactt atgaatctgc
acggtgtgcg ctgcgagtta 4680cttttagctt atgcatgcta cttgggtgta
atattgggat ctgttcggaa atcaacggat 4740gctcaatcga tttcgacagt
aattaattaa gtcatacaca agtcagcttt cttcgagcct 4800catataagta
taagtagttc aacgtattag cactgtaccc agcatctccg tatcgagaaa
4860cacaacaaca tgccccattg gacagatcat gcggatacac aggttgtgca
gtatcataca 4920tactcgatca gacaggtcgt ctgaccatca tacaagctga
acaagcgctc catacttgca 4980cgctctctat atacacagtt aaattacata
tccatagtct aacctctaac agttaatctt 5040ctggtaagcc tcccagccag
ccttctggta tcgcttggcc tcctcaatag gatctcggtt 5100ctggccgtac
agacctcggc cgacaattat gatatccgtt ccggtagaca tgacatcctc
5160aacagttcgg tactgctgtc cgagagcgtc tcccttgtcg tcaagaccca
ccccgggggt 5220cagaataagc cagtcctcag agtcgccctt aggtcggttc
tgggcaatga agccaaccac 5280aaactcgggg tcggatcggg caagctcaat
ggtctgcttg gagtactcgc cagtggccag 5340agagcccttg caagacagct
cggccagcat gagcagacct ctggccagct tctcgttggg 5400agaggggact
aggaactcct tgtactggga gttctcgtag tcagagacgt cctccttctt
5460ctgttcagag acagtttcct cggcaccagc tcgcaggcca gcaatgattc
cggttccggg 5520tacaccgtgg gcgttggtga tatcggacca ctcggcgatt
cggtgacacc ggtactggtg 5580cttgacagtg ttgccaatat ctgcgaactt
tctgtcctcg aacaggaaga aaccgtgctt 5640aagagcaagt tccttgaggg
ggagcacagt gccggcgtag gtgaagtcgt caatgatgtc 5700gatatgggtt
ttgatcatgc acacataagg tccgacctta tcggcaagct caatgagctc
5760cttggtggtg gtaacatcca gagaagcaca caggttggtt ttcttggctg
ccacgagctt 5820gagcactcga gcggcaaagg cggacttgtg gacgttagct
cgagcttcgt aggagggcat 5880tttggtggtg aagaggagac tgaaataaat
ttagtctgca gaacttttta tcggaacctt 5940atctggggca gtgaagtata
tgttatggta atagttacga gttagttgaa cttatagata 6000gactggacta
tacggctatc ggtccaaatt agaaagaacg tcaatggctc tctgggcgtc
6060gcctttgccg acaaaaatgt gatcatgatg aaagccagca atgacgttgc
agctgatatt 6120gttgtcggcc aaccgcgccg aaaacgcagc tgtcagaccc
acagcctcca acgaagaatg 6180tatcgtcaaa gtgatccaag cacactcata
gttggagtcg tactccaaag gcggcaatga 6240cgagtcagac agatactcgt
cgaccgtacg gggagtttgg cgcccgtttt ttcgagcccc 6300acacgtttcg
gtgagtatga gcggcggcag attcgagcgt ttccggtttc cgcggctgga
6360cgagagccca tgatgggggc tcccaccacc agcaatcagg gccctgatta
cacacccacc 6420tgtaatgtca tgctgttcat cgatggttaa tgctgctgtg
tgctgtgtgt gtgtgttgtt 6480tggcgctcat tgttgcgtta tgcagcgtac
accacaatat tggaagctta ttagcctttc 6540tattttttcg tttgcaaggc
ttaacaacat tgctgtggag agggatgggg atatggaggc 6600cgctggaggg
agtcggagag gcgttttgga gcggcttggc ctggcgccca gctcgcgaaa
6660cgcacctagg accctttggc acgccgaaat gtgccacttt tcagtctagt
aacgccttac 6720ctacgtcatt ccatgcgtgc atgtttgcgc cttttttccc
ttgcccttga tcgccacaca 6780gtacagtgca ctgtacagtg gaggttttgg
gggggtctta gatgggagct aaaagcggcc 6840tagcggtaca ctagtgggat
tgtatggagt ggcatggagc ctaggtggag cctgacagga 6900cgcacgaccg
gctagcccgt gacagacgat gggtggctcc tgttgtccac cgcgtacaaa
6960tgtttgggcc aaagtcttgt cagccttgct tgcgaaccta attcccaatt
ttgtcacttc 7020gcacccccat tgatcgagcc ctaacccctg cccatcaggc
aatccaatta agctcgcatt 7080gtctgccttg tttagtttgg ctcctgcccg
tttcggcgtc cacttgcaca aacacaaaca 7140agcattatat ataaggctcg
tctctccctc ccaaccacac tcactttttt gcccgtcttc 7200ccttgctaac
acaaaagtca agaacacaaa caaccacccc aaccccctta cacacaagac
7260atatctacag caatggc 7277527559DNAArtificial SequencePlasmid
pPEX10-1 52gtacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag
ctaactcaca 60ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg
ccagctgcat 120taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta
ttgggcgctc ttccgcttcc 180tcgctcactg actcgctgcg ctcggtcgtt
cggctgcggc gagcggtatc agctcactca 240aaggcggtaa tacggttatc
cacagaatca ggggataacg caggaaagaa catgtgagca 300aaaggccagc
aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg
360ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg
gcgaaacccg 420acaggactat aaagatacca ggcgtttccc cctggaagct
ccctcgtgcg ctctcctgtt 480ccgaccctgc cgcttaccgg atacctgtcc
gcctttctcc cttcgggaag cgtggcgctt 540tctcatagct cacgctgtag
gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 600tgtgtgcacg
aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt
660gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg
taacaggatt 720agcagagcga ggtatgtagg cggtgctaca gagttcttga
agtggtggcc taactacggc 780tacactagaa ggacagtatt tggtatctgc
gctctgctga agccagttac cttcggaaaa 840agagttggta gctcttgatc
cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 900tgcaagcagc
agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct
960acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt
catgagatta 1020tcaaaaagga tcttcaccta gatcctttta aattaaaaat
gaagttttaa atcaatctaa 1080agtatatatg agtaaacttg gtctgacagt
taccaatgct taatcagtga ggcacctatc 1140tcagcgatct gtctatttcg
ttcatccata gttgcctgac tccccgtcgt gtagataact 1200acgatacggg
agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc
1260tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga
gcgcagaagt 1320ggtcctgcaa ctttatccgc ctccatccag tctattaatt
gttgccggga agctagagta 1380agtagttcgc cagttaatag tttgcgcaac
gttgttgcca ttgctacagg catcgtggtg 1440tcacgctcgt cgtttggtat
ggcttcattc agctccggtt cccaacgatc aaggcgagtt 1500acatgatccc
ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc
1560agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca
taattctctt 1620actgtcatgc catccgtaag atgcttttct gtgactggtg
agtactcaac caagtcattc 1680tgagaatagt gtatgcggcg accgagttgc
tcttgcccgg cgtcaatacg ggataatacc 1740gcgccacata gcagaacttt
aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa 1800ctctcaagga
tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac
1860tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac
aggaaggcaa 1920aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt
gaatactcat actcttcctt 1980tttcaatatt attgaagcat ttatcagggt
tattgtctca tgagcggata catatttgaa 2040tgtatttaga aaaataaaca
aataggggtt ccgcgcacat ttccccgaaa agtgccacct 2100gacgcgccct
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc
2160gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc
ctttctcgcc 2220acgttcgccg gctttccccg tcaagctcta aatcgggggc
tccctttagg gttccgattt 2280agtgctttac ggcacctcga ccccaaaaaa
cttgattagg gtgatggttc acgtagtggg 2340ccatcgccct gatagacggt
ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 2400ggactcttgt
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta
2460taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta
acaaaaattt 2520aacgcgaatt ttaacaaaat attaacgctt acaatttcca
ttcgccattc aggctgcgca 2580actgttggga agggcgatcg gtgcgggcct
cttcgctatt acgccagctg gcgaaagggg 2640gatgtgctgc aaggcgatta
agttgggtaa cgccagggtt ttcccagtca cgacgttgta 2700aaacgacggc
cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc
2760ccctcgaggt cgatggtgtc gataagcttg atatcgaatt catgtcacac
aaaccgatct 2820tcgcctcaag gaaacctaat tctacatccg agagactgcc
gagatccagt ctacactgat 2880taattttcgg gccaataatt taaaaaaatc
gtgttatata atattatatg tattatatat 2940atacatcatg atgatactga
cagtcatgtc ccattgctaa atagacagac tccatctgcc 3000gcctccaact
gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag
3060actccatcta ccgcctccaa atgatgttct caaaatatat tgtatgaact
tatttttatt 3120acttagtatt attagacaac ttacttgctt tatgaaaaac
acttcctatt taggaaacaa 3180tttataatgg cagttcgttc atttaacaat
ttatgtagaa taaatgttat aaatgcgtat 3240gggaaatctt aaatatggat
agcataaatg atatctgcat tgcctaattc gaaatcaaca 3300gcaacgaaaa
aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag
3360aacagctatt cacacgttac tattgagatt attattggac gagaatcaca
cactcaactg 3420tctttctctc ttctagaaat acaggtacaa gtatgtacta
ttctcattgt tcatacttct 3480agtcatttca tcccacatat tccttggatt
tctctccaat gaatgacatt ctatcttgca 3540aattcaacaa ttataataag
atataccaaa gtagcggtat agtggcaatc aaaaagcttc 3600tctggtgtgc
ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt
3660tcttgttata taatcctttt gtttattaca tgggctggat acataaaggt
attttgattt 3720aattttttgc ttaaattcaa tcccccctcg ttcagtgtca
actgtaatgg taggaaatta 3780ccatactttt gaagaagcaa aaaaaatgaa
agaaaaaaaa aatcgtattt ccaggttaga 3840cgttccgcag aatctagaat
gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg 3900ctccctgaga
tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta
3960ctactgttga tgcatccaca acagtttgtt ttgttttttt ttgttttttt
tttttctaat 4020gattcattac cgctatgtat acctacttgt acttgtagta
agccgggtta ttggcgttca 4080attaatcata gacttatgaa tctgcacggt
gtgcgctgcg agttactttt agcttatgca 4140tgctacttgg gtgtaatatt
gggatctgtt cggaaatcaa cggatgctca atcgatttcg 4200acagtaatta
attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt
4260agttcaacgt attagcactg tacccagcat ctccgtatcg agaaacacaa
caacatgccc 4320cattggacag atcatgcgga tacacaggtt gtgcagtatc
atacatactc gatcagacag 4380gtcgtctgac catcatacaa gctgaacaag
cgctccatac ttgcacgctc tctatataca 4440cagttaaatt acatatccat
agtctaacct ctaacagtta atcttctggt aagcctccca 4500gccagccttc
tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc
4560tcggccgaca attatgatat ccgttccggt agacatgaca tcctcaacag
ttcggtactg 4620ctgtccgaga gcgtctccct tgtcgtcaag acccaccccg
ggggtcagaa taagccagtc 4680ctcagagtcg cccttaggtc ggttctgggc
aatgaagcca accacaaact cggggtcgga 4740tcgggcaagc tcaatggtct
gcttggagta ctcgccagtg gccagagagc ccttgcaaga 4800cagctcggcc
agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa
4860ctccttgtac tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt
cagagacagt 4920ttcctcggca ccagctcgca ggccagcaat gattccggtt
ccgggtacac cgtgggcgtt 4980ggtgatatcg gaccactcgg cgattcggtg
acaccggtac tggtgcttga cagtgttgcc 5040aatatctgcg aactttctgt
cctcgaacag gaagaaaccg tgcttaagag caagttcctt 5100gagggggagc
acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat
5160catgcacaca taaggtccga ccttatcggc aagctcaatg agctccttgg
tggtggtaac 5220atccagagaa gcacacaggt tggttttctt ggctgccacg
agcttgagca ctcgagcggc 5280aaaggcggac ttgtggacgt tagctcgagc
ttcgtaggag ggcattttgg tggtgaagag 5340gagactgaaa taaatttagt
ctgcagaact ttttatcgga accttatctg gggcagtgaa 5400gtatatgtta
tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg
5460ctatcggtcc aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt
tgccgacaaa 5520aatgtgatca tgatgaaagc cagcaatgac gttgcagctg
atattgttgt cggccaaccg 5580cgccgaaaac gcagctgtca gacccacagc
ctccaacgaa gaatgtatcg tcaaagtgat 5640ccaagcacac tcatagttgg
agtcgtactc caaaggcggc aatgacgagt cagacagata 5700ctcgtcgaca
ttgtaactag tcctggaggg tcttttttat ggataacctc catgtacgat
5760gtatccaaga tctccacgta ctgtgttctg tttcctaagt aatacccaac
aacctctcca 5820acaaacactt gggaagatgc acttgtgctg agatgtcaag
atgttagtac tgtactggat 5880ggagagaata ttaataaata attgttaccc
aactacatct tgtcgattga aagagatacc 5940cctaagacag ataggatatc
tgcaacccga ggaatgaacc ccccagcacc ggcacccttt 6000ctattaacaa
aatgccaact gaaatttgaa aagttcaact aaacttattt gacccacaaa
6060aactcgtcaa aagtggcggc gaaagctggc aaatgatgac atccccttgg
aactatgata 6120tcccctcgga atcttcgtcc ccatttgcca catctacttg
caacgccacg tctgcttact 6180aagcaaccca aatctgcctc ggctcaaaat
gtggggaagt tcacatgcat tcgctggtga 6240atctgatctg acactacaac
tacacaccag gtccaacatg agcgacaata cgacaatcaa 6300aaagccgatc
cgacccaaac cgatccggac ggaacgcctg ccttacgctg gggccgcaga
6360aatcatccga
gccaaccaga aagaccacta ctttgagtcc gtgcttgaac agcatctcgt
6420cacgtttctg cagaaatgga agggagtacg atttatccac cagtacaagg
aggagctgga 6480gacggcgtcc aagtttgcat atctcggttt gtgtacgctt
gtgggctcca agactctcgg 6540agaagagtac accaatctca tgtacactat
cagagaccga acagctctac cgggggtggt 6600gagacggttt ggctacgtgc
tttccaacac tctgtttcca tacctgtttg tgcgctacat 6660gggcaagttg
cgcgccaaac tgatgcgcga gtatccccat ctggtggagt acgacgaaga
6720tgagcctgtg cccagcccgg aaacatggaa ggagcgggtc atcaagacgt
ttgtgaacaa 6780gtttgacaag ttcacggcgc tggaggggtt taccgcgatc
cacttggcga ttttctacgt 6840ctacggctcg tactaccagc tcagtaagcg
gatctggggc atgcgttatg tatttggaca 6900ccgactggac aagaatgagc
ctcgaatcgg ttacgagatg ctcggtctgc tgattttcgc 6960ccggtttgcc
acgtcatttg tgcagacggg aagagagtac ctcggagcgc tgctggaaaa
7020gagcgtggag aaagaggcag gggagaagga agatgaaaag gaagcggttg
tgccgaaaaa 7080gaagtcgtca attccgttca ttgaggatac agaaggggag
acggaagaca agatcgatct 7140ggaggaccct cgacagctca agttcattcc
tgaggcgtcc agagcgtgca ctctgtgtct 7200gtcatacatt agtgcgccgg
catgtacgcc atgtggacac tttttctgtt gggactgtat 7260ttccgaatgg
gtgagagaga agcccgagtg tcccttgtgt cggcagggtg tgagagagca
7320gaacttgttg cctatcagat aatgacgagg tctggatgga aggactagtc
agcgagacac 7380agagcatcag ggaccagaca cgaccaattc aatcgacaac
actgtgctgc atagcagtgc 7440acagaggtcc tgggcatgaa tatattttag
cattggagat atgagtggta gagcgtatac 7500agtattaatt gtggaggtat
ctcgtcgcat tgatagagca atacagttac tgctgaagc 7559538051DNAArtificial
SequencePlasmid pPEX10-2 53gtacgagccg gaagcataaa gtgtaaagcc
tggggtgcct aatgagtgag ctaactcaca 60ttaattgcgt tgcgctcact gcccgctttc
cagtcgggaa acctgtcgtg ccagctgcat 120taatgaatcg gccaacgcgc
ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc 180tcgctcactg
actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca
240aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa
catgtgagca 300aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt
tgctggcgtt tttccatagg 360ctccgccccc ctgacgagca tcacaaaaat
cgacgctcaa gtcagaggtg gcgaaacccg 420acaggactat aaagatacca
ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 480ccgaccctgc
cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt
540tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc
caagctgggc 600tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct
tatccggtaa ctatcgtctt 660gagtccaacc cggtaagaca cgacttatcg
ccactggcag cagccactgg taacaggatt 720agcagagcga ggtatgtagg
cggtgctaca gagttcttga agtggtggcc taactacggc 780tacactagaa
ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa
840agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg
tttttttgtt 900tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag
aagatccttt gatcttttct 960acggggtctg acgctcagtg gaacgaaaac
tcacgttaag ggattttggt catgagatta 1020tcaaaaagga tcttcaccta
gatcctttta aattaaaaat gaagttttaa atcaatctaa 1080agtatatatg
agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc
1140tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt
gtagataact 1200acgatacggg agggcttacc atctggcccc agtgctgcaa
tgataccgcg agacccacgc 1260tcaccggctc cagatttatc agcaataaac
cagccagccg gaagggccga gcgcagaagt 1320ggtcctgcaa ctttatccgc
ctccatccag tctattaatt gttgccggga agctagagta 1380agtagttcgc
cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg
1440tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc
aaggcgagtt 1500acatgatccc ccatgttgtg caaaaaagcg gttagctcct
tcggtcctcc gatcgttgtc 1560agaagtaagt tggccgcagt gttatcactc
atggttatgg cagcactgca taattctctt 1620actgtcatgc catccgtaag
atgcttttct gtgactggtg agtactcaac caagtcattc 1680tgagaatagt
gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc
1740gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc
ggggcgaaaa 1800ctctcaagga tcttaccgct gttgagatcc agttcgatgt
aacccactcg tgcacccaac 1860tgatcttcag catcttttac tttcaccagc
gtttctgggt gagcaaaaac aggaaggcaa 1920aatgccgcaa aaaagggaat
aagggcgaca cggaaatgtt gaatactcat actcttcctt 1980tttcaatatt
attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa
2040tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa
agtgccacct 2100gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg
tggttacgcg cagcgtgacc 2160gctacacttg ccagcgccct agcgcccgct
cctttcgctt tcttcccttc ctttctcgcc 2220acgttcgccg gctttccccg
tcaagctcta aatcgggggc tccctttagg gttccgattt 2280agtgctttac
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg
2340ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt
ctttaatagt 2400ggactcttgt tccaaactgg aacaacactc aaccctatct
cggtctattc ttttgattta 2460taagggattt tgccgatttc ggcctattgg
ttaaaaaatg agctgattta acaaaaattt 2520aacgcgaatt ttaacaaaat
attaacgctt acaatttcca ttcgccattc aggctgcgca 2580actgttggga
agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg
2640gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca
cgacgttgta 2700aaacgacggc cagtgaattg taatacgact cactataggg
cgaattgggt accgggcccc 2760ccctcgaggt cgatggtgtc gataagcttg
atatcgaatt catgtcacac aaaccgatct 2820tcgcctcaag gaaacctaat
tctacatccg agagactgcc gagatccagt ctacactgat 2880taattttcgg
gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat
2940atacatcatg atgatactga cagtcatgtc ccattgctaa atagacagac
tccatctgcc 3000gcctccaact gatgttctca atatttaagg ggtcatctcg
cattgtttaa taataaacag 3060actccatcta ccgcctccaa atgatgttct
caaaatatat tgtatgaact tatttttatt 3120acttagtatt attagacaac
ttacttgctt tatgaaaaac acttcctatt taggaaacaa 3180tttataatgg
cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat
3240gggaaatctt aaatatggat agcataaatg atatctgcat tgcctaattc
gaaatcaaca 3300gcaacgaaaa aaatcccttg tacaacataa atagtcatcg
agaaatatca actatcaaag 3360aacagctatt cacacgttac tattgagatt
attattggac gagaatcaca cactcaactg 3420tctttctctc ttctagaaat
acaggtacaa gtatgtacta ttctcattgt tcatacttct 3480agtcatttca
tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca
3540aattcaacaa ttataataag atataccaaa gtagcggtat agtggcaatc
aaaaagcttc 3600tctggtgtgc ttctcgtatt tatttttatt ctaatgatcc
attaaaggta tatatttatt 3660tcttgttata taatcctttt gtttattaca
tgggctggat acataaaggt attttgattt 3720aattttttgc ttaaattcaa
tcccccctcg ttcagtgtca actgtaatgg taggaaatta 3780ccatactttt
gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga
3840cgttccgcag aatctagaat gcggtatgcg gtacattgtt cttcgaacgt
aaaagttgcg 3900ctccctgaga tattgtacat ttttgctttt acaagtacaa
gtacatcgta caactatgta 3960ctactgttga tgcatccaca acagtttgtt
ttgttttttt ttgttttttt tttttctaat 4020gattcattac cgctatgtat
acctacttgt acttgtagta agccgggtta ttggcgttca 4080attaatcata
gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca
4140tgctacttgg gtgtaatatt gggatctgtt cggaaatcaa cggatgctca
atcgatttcg 4200acagtaatta attaagtcat acacaagtca gctttcttcg
agcctcatat aagtataagt 4260agttcaacgt attagcactg tacccagcat
ctccgtatcg agaaacacaa caacatgccc 4320cattggacag atcatgcgga
tacacaggtt gtgcagtatc atacatactc gatcagacag 4380gtcgtctgac
catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca
4440cagttaaatt acatatccat agtctaacct ctaacagtta atcttctggt
aagcctccca 4500gccagccttc tggtatcgct tggcctcctc aataggatct
cggttctggc cgtacagacc 4560tcggccgaca attatgatat ccgttccggt
agacatgaca tcctcaacag ttcggtactg 4620ctgtccgaga gcgtctccct
tgtcgtcaag acccaccccg ggggtcagaa taagccagtc 4680ctcagagtcg
cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga
4740tcgggcaagc tcaatggtct gcttggagta ctcgccagtg gccagagagc
ccttgcaaga 4800cagctcggcc agcatgagca gacctctggc cagcttctcg
ttgggagagg ggactaggaa 4860ctccttgtac tgggagttct cgtagtcaga
gacgtcctcc ttcttctgtt cagagacagt 4920ttcctcggca ccagctcgca
ggccagcaat gattccggtt ccgggtacac cgtgggcgtt 4980ggtgatatcg
gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc
5040aatatctgcg aactttctgt cctcgaacag gaagaaaccg tgcttaagag
caagttcctt 5100gagggggagc acagtgccgg cgtaggtgaa gtcgtcaatg
atgtcgatat gggttttgat 5160catgcacaca taaggtccga ccttatcggc
aagctcaatg agctccttgg tggtggtaac 5220atccagagaa gcacacaggt
tggttttctt ggctgccacg agcttgagca ctcgagcggc 5280aaaggcggac
ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag
5340gagactgaaa taaatttagt ctgcagaact ttttatcgga accttatctg
gggcagtgaa 5400gtatatgtta tggtaatagt tacgagttag ttgaacttat
agatagactg gactatacgg 5460ctatcggtcc aaattagaaa gaacgtcaat
ggctctctgg gcgtcgcctt tgccgacaaa 5520aatgtgatca tgatgaaagc
cagcaatgac gttgcagctg atattgttgt cggccaaccg 5580cgccgaaaac
gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat
5640ccaagcacac tcatagttgg agtcgtactc caaaggcggc aatgacgagt
cagacagata 5700ctcgtcgacg tcttagcgtc atgtattctc aagcttagtc
agagagaagg actatggagg 5760agaaggggag aattgagaag ggtatttgaa
gggactttga aggtcgcgtg gaagaggtac 5820ttgaagaggt atttgaaggt
cacgtggaag aggtatttga agatcacgtg gaagaagtac 5880ttgttttaca
gagaatatcg gggtgatttt gacagtggga ttgtctccca agtcctaatc
5940gtttgacatg ggagcagtga aaagtcgggc taaaaaaggg aatatcggaa
atcggaaaga 6000cggaaagaat tactggactc atgtttagta gatctgagca
cttcaaattt gaaaatatct 6060cttcaaacag cagatcggtt ggtcgtggag
gtaccatcaa gggtaaaatc aaggctatca 6120tcaagggcca tatatcgcaa
gtttggggga agataatatg ttcatagtga atcagggttg 6180tggatttcct
catctaacgg cattgtaact agtcctggag ggtctttttt atggataacc
6240tccatgtacg atgtatccaa gatctccacg tactgtgttc tgtttcctaa
gtaataccca 6300acaacctctc caacaaacac ttgggaagat gcacttgtgc
tgagatgtca agatgttagt 6360actgtactgg atggagagaa tattaataaa
taattgttac ccaactacat cttgtcgatt 6420gaaagagata cccctaagac
agataggata tctgcaaccc gaggaatgaa ccccccagca 6480ccggcaccct
ttctattaac aaaatgccaa ctgaaatttg aaaagttcaa ctaaacttat
6540ttgacccaca aaaactcgtc aaaagtggcg gcgaaagctg gcaaatgatg
acatcccctt 6600ggaactatga tatcccctcg gaatcttcgt ccccatttgc
cacatctact tgcaacgcca 6660cgtctgctta ctaagcaacc caaatctgcc
tcggctcaaa atgtggggaa gttcacatgc 6720attcgctggt gaatctgatc
tgacactaca actacacacc aggtccaaca tgagcgacaa 6780tacgacaatc
aaaaagccga tccgacccaa accgatccgg acggaacgcc tgccttacgc
6840tggggccgca gaaatcatcc gagccaacca gaaagaccac tactttgagt
ccgtgcttga 6900acagcatctc gtcacgtttc tgcagaaatg gaagggagta
cgatttatcc accagtacaa 6960ggaggagctg gagacggcgt ccaagtttgc
atatctcggt ttgtgtacgc ttgtgggctc 7020caagactctc ggagaagagt
acaccaatct catgtacact atcagagacc gaacagctct 7080accgggggtg
gtgagacggt ttggctacgt gctttccaac actctgtttc catacctgtt
7140tgtgcgctac atgggcaagt tgcgcgccaa actgatgcgc gagtatcccc
atctggtgga 7200gtacgacgaa gatgagcctg tgcccagccc ggaaacatgg
aaggagcggg tcatcaagac 7260gtttgtgaac aagtttgaca agttcacggc
gctggagggg tttaccgcga tccacttggc 7320gattttctac gtctacggct
cgtactacca gctcagtaag cggatctggg gcatgcgtta 7380tgtatttgga
caccgactgg acaagaatga gcctcgaatc ggttacgaga tgctcggtct
7440gctgattttc gcccggtttg ccacgtcatt tgtgcagacg ggaagagagt
acctcggagc 7500gctgctggaa aagagcgtgg agaaagaggc aggggagaag
gaagatgaaa aggaagcggt 7560tgtgccgaaa aagaagtcgt caattccgtt
cattgaggat acagaagggg agacggaaga 7620caagatcgat ctggaggacc
ctcgacagct caagttcatt cctgaggcgt ccagagcgtg 7680cactctgtgt
ctgtcataca ttagtgcgcc ggcatgtacg ccatgtggac actttttctg
7740ttgggactgt atttccgaat gggtgagaga gaagcccgag tgtcccttgt
gtcggcaggg 7800tgtgagagag cagaacttgt tgcctatcag ataatgacga
ggtctggatg gaaggactag 7860tcagcgagac acagagcatc agggaccaga
cacgaccaat tcaatcgaca acactgtgct 7920gcatagcagt gcacagaggt
cctgggcatg aatatatttt agcattggag atatgagtgg 7980tagagcgtat
acagtattaa ttgtggaggt atctcgtcgc attgatagag caatacagtt
8040actgctgaag c 80515415877DNAArtificial SequencePlasmid
pZKL1-2SP98C 54aaatgatgtc gacgcagtag gatgtcctgc acgggtcttt
ttgtggggtg tggagaaagg 60ggtgcttgga tcgatggaag ccggtagaac cgggctgctt
gtgcttggag atggaagccg 120gtagaaccgg gctgcttggg gggatttggg
gccgctgggc tccaaagagg ggtaggcatt 180tcgttggggt tacgtaattg
cggcatttgg gtcctgcgcg catgtcccat tggtcagaat 240tagtccggat
aggagactta tcagccaatc acagcgccgg atccacctgt aggttgggtt
300gggtgggagc acccctccac agagtagagt caaacagcag cagcaacatg
atagttgggg 360gtgtgcgtgt taaaggaaaa aaaagaagct tgggttatat
tcccgctcta tttagaggtt 420gcgggataga cgccgacgga gggcaatggc
gctatggaac cttgcggata tccatacgcc 480gcggcggact gcgtccgaac
cagctccagc agcgtttttt ccgggccatt gagccgactg 540cgaccccgcc
aacgtgtctt ggcccacgca ctcatgtcat gttggtgttg ggaggccact
600ttttaagtag cacaaggcac ctagctcgca gcaaggtgtc cgaaccaaag
aagcggctgc 660agtggtgcaa acggggcgga aacggcggga aaaagccacg
ggggcacgaa ttgaggcacg 720ccctcgaatt tgagacgagt cacggcccca
ttcgcccgcg caatggctcg ccaacgcccg 780gtcttttgca ccacatcagg
ttaccccaag ccaaaccttt gtgttaaaaa gcttaacata 840ttataccgaa
cgtaggtttg ggcgggcttg ctccgtctgt ccaaggcaac atttatataa
900gggtctgcat cgccggctca attgaatctt ttttcttctt ctcttctcta
tattcattct 960tgaattaaac acacatcaac catgggcgta ttcattaaac
aggagcagct tccggctctc 1020aagaagtaca agtactccgc cgaggatcac
tcgttcatct ccaacaacat tctgcgcccc 1080ttctggcgac agtttgtcaa
aatcttccct ctgtggatgg cccccaacat ggtgactctg 1140ctgggcttct
tctttgtcat tgtgaacttc atcaccatgc tcattgttga tcccacccac
1200gaccgcgagc ctcccagatg ggtctacctc acctacgctc tgggtctgtt
cctttaccag 1260acatttgatg cctgtgacgg atcccatgcc cgacgaactg
gccagagtgg accccttgga 1320gagctgtttg accactgtgt cgacgccatg
aatacctctc tgattctcac ggtggtggtg 1380tccaccaccc atatgggata
taacatgaag ctactgattg tgcagattgc cgctctcgga 1440aacttctacc
tgtcgacctg ggagacctac cataccggaa ctctgtacct ttctggcttc
1500tctggtcctg ttgaaggtat cttgattctg gtggctcttt tcgtcctcac
cttcttcact 1560ggtcccaacg tgtacgctct gaccgtctac gaggctcttc
ccgagtccat cacttcgctg 1620ctgcctgcca gcttcctgga cgtcaccatc
acccagatct acattggatt cggagtgctg 1680ggcatggtgt tcaacatcta
cggcgcctgc ggaaacgtga tcaagtacta caacaacaag 1740ggcaagagcg
ctctccccgc cattctcgga atcgccccct ttggcatctt ctacgtcggc
1800gtctttgcct gggcccatgt tgctcctctg cttctctcca agtacgccat
cgtctatctg 1860tttgccattg gggctgcctt tgccatgcaa gtcggccaga
tgattcttgc ccatctcgtg 1920cttgctccct ttccccactg gaacgtgctg
ctcttcttcc cctttgtggg actggcagtg 1980cactacattg cacccgtgtt
tggctgggac gccgatatcg tgtcggttaa cactctcttc 2040acctgttttg
gcgccaccct ctccatttac gccttctttg tgcttgagat catcgacgag
2100atcaccaact acctcgatat ctggtgtctg cgaatcaagt accctcagga
gaagaagacc 2160gaataagcgg ccgcatggag cgtgtgttct gagtcgatgt
tttctatgga gttgtgagtg 2220ttagtagaca tgatgggttt atatatgatg
aatgaataga tgtgattttg atttgcacga 2280tggaattgag aactttgtaa
acgtacatgg gaatgtatga atgtgggggt tttgtgactg 2340gataactgac
ggtcagtgga cgccgttgtt caaatatcca agagatgcga gaaactttgg
2400gtcaagtgaa catgtcctct ctgttcaagt aaaccatcaa ctatgggtag
tatatttagt 2460aaggacaaga gttgagattc tttggagtcc tagaaacgta
ttttcgcgtt ccaagatcaa 2520attagtagag taatacgggc acgggaatcc
attcatagtc tcaatcctgc aggtgagtta 2580attaatcgag cttggcgtaa
tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc 2640tcacaattcc
acacaacgta cgatagttag tagacaacaa tcagaacatc tccctcctta
2700tataatcaca caggccagaa cgcgctaaac taaagcgctt tggacactat
gttacattgg 2760cattgattga actgaaacca cagtctccct cgcctgaatc
gagcaatgga tgttgtcgga 2820agtcaacttc actagaagag cggttctatg
ccttgtcaag atcatatcat aaactcactc 2880tgtattaccc catctataga
acacttgtta tgaatgggcg gaaacattcc gctatatgca 2940cctttccaca
ctaatgcaaa gatgtgcatc ttcaacgggt agtaagactg gttccgactt
3000ccgttgcatg gagagcaatg acctcgataa tgcgaacatc ccccacatat
acactcttac 3060acaggccaat ataatctgtg catttactaa atatttaagt
ctatgcacct gcttgatgaa 3120aagcggcacg gatggtatca tctagtttcc
gccaatccaa gaaccaactg tgttggcagt 3180ggtgtagccc atggcacaca
gaccaaagat gaaaatacag acatcggcgg ttcgagccgt 3240ggtgcctcga
gcaacaccct tgtaatgcaa aagaggaggg taaatgtaca ccagaggcac
3300acatgcaaac gatccggtga gagcgacgaa ccgatcgaga tcgtcggcac
ctccccatgc 3360aacaaaggcg gtgacaaaca caaggaagaa ccggaaaatg
ttcttctgcc acttgatggt 3420agagttgtac ttgcctgatc gggtgaagag
accattctcg atgattcgga tggcgcgcca 3480gctgcattaa tgaatcggcc
aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc 3540cgcttcctcg
ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc
3600tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag
gaaagaacat 3660gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag
gccgcgttgc tggcgttttt 3720ccataggctc cgcccccctg acgagcatca
caaaaatcga cgctcaagtc agaggtggcg 3780aaacccgaca ggactataaa
gataccaggc gtttccccct ggaagctccc tcgtgcgctc 3840tcctgttccg
accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt
3900ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg
ttcgctccaa 3960gctgggctgt gtgcacgaac cccccgttca gcccgaccgc
tgcgccttat ccggtaacta 4020tcgtcttgag tccaacccgg taagacacga
cttatcgcca ctggcagcag ccactggtaa 4080caggattagc agagcgaggt
atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 4140ctacggctac
actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt
4200cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta
gcggtggttt 4260ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga
tctcaagaag atcctttgat 4320cttttctacg gggtctgacg ctcagtggaa
cgaaaactca cgttaaggga ttttggtcat 4380gagattatca aaaaggatct
tcacctagat ccttttaaat taaaaatgaa gttttaaatc 4440aatctaaagt
atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc
4500acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc
ccgtcgtgta 4560gataactacg atacgggagg gcttaccatc tggccccagt
gctgcaatga taccgcgaga 4620cccacgctca ccggctccag atttatcagc
aataaaccag ccagccggaa gggccgagcg 4680cagaagtggt cctgcaactt
tatccgcctc catccagtct attaattgtt gccgggaagc 4740tagagtaagt
agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat
4800cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc
aacgatcaag 4860gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt
agctccttcg gtcctccgat 4920cgttgtcaga agtaagttgg ccgcagtgtt
atcactcatg gttatggcag cactgcataa 4980ttctcttact gtcatgccat
ccgtaagatg cttttctgtg actggtgagt actcaaccaa 5040gtcattctga
gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga
5100taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac
gttcttcggg 5160gcgaaaactc tcaaggatct taccgctgtt gagatccagt
tcgatgtaac ccactcgtgc 5220acccaactga tcttcagcat cttttacttt
caccagcgtt tctgggtgag caaaaacagg 5280aaggcaaaat gccgcaaaaa
agggaataag ggcgacacgg aaatgttgaa tactcatact 5340cttccttttt
caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat
5400atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc
cccgaaaagt 5460gccacctgat gcggtgtgaa ataccgcaca gatgcgtaag
gagaaaatac cgcatcagga 5520aattgtaagc gttaatattt tgttaaaatt
cgcgttaaat ttttgttaaa tcagctcatt 5580ttttaaccaa taggccgaaa
tcggcaaaat cccttataaa tcaaaagaat agaccgagat 5640agggttgagt
gttgttccag tttggaacaa
gagtccacta ttaaagaacg tggactccaa 5700cgtcaaaggg cgaaaaaccg
tctatcaggg cgatggccca ctacgtgaac catcacccta 5760atcaagtttt
ttggggtcga ggtgccgtaa agcactaaat cggaacccta aagggagccc
5820ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag
ggaagaaagc 5880gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc
acgctgcgcg taaccaccac 5940acccgccgcg cttaatgcgc cgctacaggg
cgcgtccatt cgccattcag gctgcgcaac 6000tgttgggaag ggcgatcggt
gcgggcctct tcgctattac gccagctggc gaaaggggga 6060tgtgctgcaa
ggcgattaag ttgggtaacg ccagggtttt cccagtcacg acgttgtaaa
6120acgacggcca gtgaattgta atacgactca ctatagggcg aattgggccc
gacgtcgcat 6180gcttagaagt gaggattaca agaagcctct ggatatcaat
gatgaacgta ctcagcggct 6240ggtcaagcat ttcgaccgtc gaatcgacga
ggtgttcacc tttgacaagc gagggttccc 6300aattgatcac gttctcgagt
tgttcaaatc ttctctcaac atctctctgc atgaactatc 6360tctgttgacg
aacgtgtcac ccactgttcc tcgaacgccc ttctccgagt ttggtctgaa
6420catcttcgat ctcaaactga cccccgcagt gatcaatagt gccatgccac
tgccgatgcg 6480gtgcgaacat ccctggaggg attctcggag ctctacacaa
tgcagattct gtcgtcgagt 6540actctctacc ttgctcgaat gacttattgt
gctactactg cactcatgct tcgatcatgt 6600gccctactgc accccaaatt
tggtgatctg attgagacag agtaccctct tcagctgatt 6660cagaagatca
tcagcaacat gaatgatgtg gttgaccagg caggctgttg tagtcacgtc
6720cttcacttca agttcattct tcatctgctt ctgttttact ttgacaggca
aatgaagaca 6780tggtacgact tgatggaggc caagaacgcc atttcacccc
gagacaccga agtgcctgaa 6840atcctggctg cccccattga taacatcgga
aactacggta ttccggaaag tgtatataga 6900acctttcccc agcttgtgtc
tgtggatatg gatggtgtaa tccccttaat taactcacct 6960gcaggattga
gactatgaat ggattcccgt gcccgtatta ctctactaat ttgatcttgg
7020aacgcgaaaa tacgtttcta ggactccaaa gaatctcaac tcttgtcctt
actaaatata 7080ctacccatag ttgatggttt acttgaacag agaggacatg
ttcacttgac ccaaagtttc 7140tcgcatctct tggatatttg aacaacggcg
tccactgacc gtcagttatc cagtcacaaa 7200acccccacat tcatacattc
ccatgtacgt ttacaaagtt ctcaattcca tcgtgcaaat 7260caaaatcaca
tctattcatt catcatatat aaacccatca tgtctactaa cactcacaac
7320tccatagaaa acatcgactc agaacacacg ctccatgcgg ccgcttaggc
aacgggcttg 7380atgacagcgg gaggagtgcc cacattgttt cggtttcgaa
agaacaggac acccttgcca 7440gctccctcgg caccagcgga gggttcaacc
cactggcaca ttcgtgcaga tcggtacatg 7500gctcgaatga atcctcgagg
accgtcctgg acatcagctc gatagtgctt gcccatgata 7560ggtttgatgg
cctcggtagc ttcgtccgca ttgtagaagg gaatggaaga gacgtagtga
7620tgcaggacgt gagtctcgat aatgccgtgg agcagatgac gtccaatgaa
gcccatctct 7680cggtcgatgg ttgcagcggc acctcgcaca aagttccact
cgtcgttggt gtagtgggga 7740agagtaggat ctgtgtgctg cagaaaggta
atggcgacga gccagtggtt aacccacaag 7800tagggaacga agtaccagat
ggccatgttg tagaatccga acttctgaac gagaaagtac 7860agagcggtgg
ccataagacc aatgccaatg tcggagagca cgatgagctt ggcgtcgctg
7920ttctcgtaca gaggagatcg gggatcgaaa tggttaactc caccgccaag
accgttgtgc 7980tttcccttgc ctcgaccctc tcgctgccgc tcatggtagt
tgtgtccagt aacgttggta 8040atgagatagt tgggccaacc gaccagttgc
tgaagcacaa gcatgagcag ggtgaaagca 8100ggagtttcct cggtaagatg
ggcgagttcg tgggtcatct tgccgagtcg agtagcttgc 8160tgctctcggg
ttcgaggaac gaagaccatg tctcgctcca tgtttccagt ggccttgtga
8220tgcttccggt gggagatttg ccagctgaag tagggaacaa gcagggaaga
gtgaagcacc 8280cagccagtaa tgtcgttgat gattcgggaa tcggagaaag
caccatgtcc acactcgtgg 8340gcaatgaccc acagtccagt accgaagagt
ccctgaagaa cggtgtacac agcccacaga 8400ccggctcgag caggagtgga
gggaatgtac tcgggtgtca caaagttgta ccagatgctg 8460aaagtggtag
tcaggaggac aatgtctcga agaatgtagc cgtatccctt gagagcagat
8520cgcttgaagc agtgcttggg aatagcgttg tagatgtcct tgatggtgaa
gtcgggaact 8580tcgaactggt tgccgtaggt atccagcatg acaccgtact
cggacttggg cttggcaatg 8640tccacctcgg acatggaaga cagcgatgta
gaggaggccg agtgtctggg agaatcggag 8700ggagagacgg cagcagactc
cgagtcggtc acagtggtgg aagtgacggt tcgtcggagg 8760gcagggttct
gcttgggcag agccgaggtg gaggccatgg ccattgctgt agatatgtct
8820tgtgtgtaag ggggttgggg tggttgtttg tgttcttgac ttttgtgtta
gcaagggaag 8880acgggcaaaa aagtgagtgt ggttgggagg gagagacgag
ccttatatat aatgcttgtt 8940tgtgtttgtg caagtggacg ccgaaacggg
caggagccaa actaaacaag gcagacaatg 9000cgagcttaat tggattgcct
gatgggcagg ggttagggct cgatcaatgg gggtgcgaag 9060tgacaaaatt
gggaattagg ttcgcaagca aggctgacaa gactttggcc caaacatttg
9120tacgcggtgg acaacaggag ccacccatcg tctgtcacgg gctagccggt
cgtgcgtcct 9180gtcaggctcc acctaggctc catgccactc catacaatcc
cactagtgta ccgctaggcc 9240gcttttagct cccatctaag acccccccaa
aacctccact gtacagtgca ctgtactgtg 9300tggcgatcaa gggcaaggga
aaaaaggcgc aaacatgcac gcatggaatg acgtaggtaa 9360ggcgttacta
gactgaaaag tggcacattt cggcgtgcca aagggtccta ggtgcgtttc
9420gcgagctggg cgccaggcca agccgctcca aaacgcctct ccgactccct
ccagcggcct 9480ccatatcccc atccctctcc acagcaatgt tgttaagcct
tgcaaacgaa aaaatagaaa 9540ggctaataag cttccaatat tgtggtgtac
gctgcataac gcaacaatga gcgccaaaca 9600acacacacac acagcacaca
gcagcattaa ccacgatgaa cagcatgaat tcctttacct 9660gcaggataac
ttcgtataat gtatgctata cgaagttatg atctctctct tgagcttttc
9720cataacaagt tcttctgcct ccaggaagtc catgggtggt ttgatcatgg
ttttggtgta 9780gtggtagtgc agtggtggta ttgtgactgg ggatgtagtt
gagaataagt catacacaag 9840tcagctttct tcgagcctca tataagtata
agtagttcaa cgtattagca ctgtacccag 9900catctccgta tcgagaaaca
caacaacatg ccccattgga cagatcatgc ggatacacag 9960gttgtgcagt
atcatacata ctcgatcaga caggtcgtct gaccatcata caagctgaac
10020aagcgctcca tacttgcacg ctctctatat acacagttaa attacatatc
catagtctaa 10080cctctaacag ttaatcttct ggtaagcctc ccagccagcc
ttctggtatc gcttggcctc 10140ctcaatagga tctcggttct ggccgtacag
acctcggccg acaattatga tatccgttcc 10200ggtagacatg acatcctcaa
cagttcggta ctgctgtccg agagcgtctc ccttgtcgtc 10260aagacccacc
ccgggggtca gaataagcca gtcctcagag tcgcccttag gtcggttctg
10320ggcaatgaag ccaaccacaa actcggggtc ggatcgggca agctcaatgg
tctgcttgga 10380gtactcgcca gtggccagag agcccttgca agacagctcg
gccagcatga gcagacctct 10440ggccagcttc tcgttgggag aggggactag
gaactccttg tactgggagt tctcgtagtc 10500agagacgtcc tccttcttct
gttcagagac agtttcctcg gcaccagctc gcaggccagc 10560aatgattccg
gttccgggta caccgtgggc gttggtgata tcggaccact cggcgattcg
10620gtgacaccgg tactggtgct tgacagtgtt gccaatatct gcgaactttc
tgtcctcgaa 10680caggaagaaa ccgtgcttaa gagcaagttc cttgaggggg
agcacagtgc cggcgtaggt 10740gaagtcgtca atgatgtcga tatgggtttt
gatcatgcac acataaggtc cgaccttatc 10800ggcaagctca atgagctcct
tggtggtggt aacatccaga gaagcacaca ggttggtttt 10860cttggctgcc
acgagcttga gcactcgagc ggcaaaggcg gacttgtgga cgttagctcg
10920agcttcgtag gagggcattt tggtggtgaa gaggagactg aaataaattt
agtctgcaga 10980actttttatc ggaaccttat ctggggcagt gaagtatatg
ttatggtaat agttacgagt 11040tagttgaact tatagataga ctggactata
cggctatcgg tccaaattag aaagaacgtc 11100aatggctctc tgggcgtcgc
ctttgccgac aaaaatgtga tcatgatgaa agccagcaat 11160gacgttgcag
ctgatattgt tgtcggccaa ccgcgccgaa aacgcagctg tcagacccac
11220agcctccaac gaagaatgta tcgtcaaagt gatccaagca cactcatagt
tggagtcgta 11280ctccaaaggc ggcaatgacg agtcagacag atactcgtcg
acgcgataac ttcgtataat 11340gtatgctata cgaagttatc gtacgatagt
tagtagacaa caatcgatcg aggaagagga 11400caagcggctg cttcttaagt
ttgtgacatc agtatccaag gcaccattgc aaggattcaa 11460ggctttgaac
ccgtcatttg ccattcgtaa cgctggtaga caggttgatc ggttccctac
11520ggcctccacc tgtgtcaatc ttctcaagct gcctgactat caggacattg
atcaacttcg 11580gaagaaactt ttgtatgcca ttcgatcaca tgctggtttc
gatttgtctt agaggaacgc 11640atatacagta atcatagaga ataaacgata
ttcatttatt aaagtagata gttgaggtag 11700aagttgtaaa gagtgataaa
tagcggccgc tcactgaatc tttttggctc ccttgtgctt 11760tcggacgatg
taggtctgca cgtagaagtt gaggaacaga cacaggacag taccaacgta
11820gaagtagttg aaaaaccagc caaacattct cattccatct tgtcggtagc
agggaatgtt 11880ccggtacttc cagacgatgt agaagccaac gttgaactga
atgatctgca tagaagtaat 11940cagggacttg ggcataggga acttgagctt
gatcagtcgg gtccaatagt agccgtacat 12000gatccagtga atgaagccgt
tgagcagcac aaagatccaa acggcttcgt ttcggtagtt 12060gtagaacagc
cacatgtcca taggagctcc gagatggtga aagaactgca accaggtcag
12120aggcttgccc atgaggggca gatagaagga gtcaatgtac tcgaggaact
tgctgaggta 12180gaacagctga gtggtgattc ggaagacatt gttgtcgaaa
gccttctcgc agttgtcgga 12240catgacacca atggtgtaca tggcgtaggc
catagagagg aaggagccca gcgagtagat 12300ggacatgagc aggttgtagt
tggtgaacac aaacttcatt cgagactgac ccttgggtcc 12360gagaggacca
agggtgaact tcaggatgac gaaggcgatg gagaggtaca gcacctcgca
12420gtgcgaggca tcagaccaga gctgagcata gtcgaccttg ggaagaacct
cctggccaat 12480ggagacgatt tcgttcacga cctccatggt tgtgaattag
ggtggtgaga atggttggtt 12540gtagggaaga atcaaaggcc ggtctcggga
tccgtgggta tatatatata tatatatata 12600tacgatcctt cgttacctcc
ctgttctcaa aactgtggtt tttcgttttt cgttttttgc 12660tttttttgat
ttttttaggg ccaactaagc ttccagattt cgctaatcac ctttgtacta
12720attacaagaa aggaagaagc tgattagagt tgggcttttt atgcaactgt
gctactcctt 12780atctctgata tgaaagtgta gacccaatca catcatgtca
tttagagttg gtaatactgg 12840gaggatagat aaggcacgaa aacgagccat
agcagacatg ctgggtgtag ccaagcagaa 12900gaaagtagat gggagccaat
tgacgagcga gggagctacg ccaatccgac atacgacacg 12960ctgagatcgt
cttggccggg gggtacctac agatgtccaa gggtaagtgc ttgactgtaa
13020ttgtatgtct gaggacaaat atgtagtcag ccgtataaag tcataccagg
caccagtgcc 13080atcatcgaac cactaactct ctatgataca tgcctccggt
attattgtac catgcgtcgc 13140tttgttacat acgtatcttg cctttttctc
tcagaaactc cagactttgg ctattggtcg 13200agataagccc ggaccatagt
gagtctttca cactctgttt aaacaccact aaaaccccac 13260aaaatatatc
ttaccgaata tacagatcta ctatagagga acaattgccc cggagaagac
13320ggccaggccg cctagatgac aaattcaaca actcacagct gactttctgc
cattgccact 13380aggggggggc ctttttatat ggccaagcca agctctccac
gtcggttggg ctgcacccaa 13440caataaatgg gtagggttgc accaacaaag
ggatgggatg gggggtagaa gatacgagga 13500taacggggct caatggcaca
aataagaacg aatactgcca ttaagactcg tgatccagcg 13560actgacacca
ttgcatcatc taagggcctc aaaactacct cggaactgct gcgctgatct
13620ggacaccaca gaggttccga gcactttagg ttgcaccaaa tgtcccacca
ggtgcaggca 13680gaaaacgctg gaacagcgtg tacagtttgt cttaacaaaa
agtgagggcg ctgaggtcga 13740gcagggtggt gtgacttgtt atagccttta
gagctgcgaa agcgcgtatg gatttggctc 13800atcaggccag attgagggtc
tgtggacaca tgtcatgtta gtgtacttca atcgccccct 13860ggatatagcc
ccgacaatag gccgtggcct catttttttg ccttccgcac atttccattg
13920ctcggtaccc acaccttgct tctcctgcac ttgccaacct taatactggt
ttacattgac 13980caacatctta caagcggggg gcttgtctag ggtatatata
aacagtggct ctcccaatcg 14040gttgccagtc tcttttttcc tttctttccc
cacagattcg aaatctaaac tacacatcac 14100acaatgcctg ttactgacgt
ccttaagcga aagtccggtg tcatcgtcgg cgacgatgtc 14160cgagccgtga
gtatccacga caagatcagt gtcgagacga cgcgttttgt gtaatgacac
14220aatccgaaag tcgctagcaa cacacactct ctacacaaac taacccagct
ctccatggtg 14280aaggcttctc gacaggctct gcccctcgtc atcgacggaa
aggtgtacga cgtctccgct 14340tgggtgaact tccaccctgg tggagctgaa
atcattgaga actaccaggg acgagatgct 14400actgacgcct tcatggttat
gcactctcag gaagccttcg acaagctcaa gcgaatgccc 14460aagatcaacc
aggcttccga gctgcctccc caggctgccg tcaacgaagc tcaggaggat
14520ttccgaaagc tccgagaaga gctgatcgcc actggcatgt ttgacgcctc
tcccctctgg 14580tactcgtaca agatcttgac caccctgggt cttggcgtgc
ttgccttctt catgctggtc 14640cagtaccacc tgtacttcat tggtgctctc
gtgctcggta tgcactacca gcaaatggga 14700tggctgtctc atgacatctg
ccaccaccag accttcaaga accgaaactg gaataacgtc 14760ctgggtctgg
tctttggcaa cggactccag ggcttctccg tgacctggtg gaaggacaga
14820cacaacgccc atcattctgc taccaacgtt cagggtcacg atcccgacat
tgataacctg 14880cctctgctcg cctggtccga ggacgatgtc actcgagctt
ctcccatctc ccgaaagctc 14940attcagttcc aacagtacta tttcctggtc
atctgtattc tcctgcgatt catctggtgt 15000ttccagtctg tgctgaccgt
tcgatccctc aaggaccgag acaaccagtt ctaccgatct 15060cagtacaaga
aagaggccat tggactcgct ctgcactgga ctctcaagac cctgttccac
15120ctcttcttta tgccctccat cctgacctcg atgctggtgt tctttgtttc
cgagctcgtc 15180ggtggcttcg gaattgccat cgtggtcttc atgaaccact
accctctgga gaagatcggt 15240gattccgtct gggacggaca tggcttctct
gtgggtcaga tccatgagac catgaacatt 15300cgacgaggca tcattactga
ctggttcttt ggaggcctga actaccagat cgagcaccat 15360ctctggccca
ccctgcctcg acacaacctc actgccgttt cctaccaggt ggaacagctg
15420tgccagaagc acaacctccc ctaccgaaac cctctgcccc atgaaggtct
cgtcatcctg 15480ctccgatacc tgtcccagtt cgctcgaatg gccgagaagc
agcccggtgc caaggctcag 15540taagcggccg catgagaaga taaatatata
aatacattga gatattaaat gcgctagatt 15600agagagcctc atactgctcg
gagagaagcc aagacgagta ctcaaagggg attacaccat 15660ccatatccac
agacacaagc tggggaaagg ttctatatac actttccgga ataccgtagt
15720ttccgatgtt atcaatgggg gcagccagga tttcaggcac ttcggtgtct
cggggtgaaa 15780tggcgttctt ggcctccatc aagtcgtacc atgtcttcat
ttgcctgtca aagtaaaaca 15840gaagcagatg aagaatgaac ttgaagtgaa ggaattt
158775515812DNAArtificial SequencePlasmid pZKL2-5U89GC 55gtacgttatc
atttgaacag tgaaaggcta cagtaacaga agcagttgta aacttcattc 60cgttgattct
gtactacagt accccactac gccgcttccg ctgacactgt tcaacccaaa
120aactacatct gcgtgcgctg tgtaaggcta tcatcagata catactgtag
attctgtaga 180tgcgaacctg cttgtatcat atacatcccc ctccccctga
cctgcacaag caagcaatgt 240gacattgata ttgctgctta tctagtgccg
aggatgtgaa agccgagact caaacatttc 300ttttactctc ttgttcctga
ccagacctgg cggagattac gccagtatga ttcttgcagg 360tctgagacaa
gcctggaaca gccaacattt atttttcgaa gcgagaaaca tgccacaccc
420cggcacgttc agagatgcat atgatttgtt tttcgagtaa cagtaccccc
cccccccccc 480ccaatgaaac cagtattact cacaccatcc tcattcaaag
cgttacactg attacgcgcc 540catcaacgac agcatgaggg gactgctgat
ctgatctaat caaatgacta caaaaatcgc 600aataatgaag agcaaacgac
aaaaaagaaa caggttaacc aatcccgctt caatgtctca 660ccacaatcca
gcactgtttc tcattacctc ctccctctaa tttcagagtt gcatcagggt
720ccttgatggc gcgccagctg cattaatgaa tcggccaacg cgcggggaga
ggcggtttgc 780gtattgggcg ctcttccgct tcctcgctca ctgactcgct
gcgctcggtc gttcggctgc 840ggcgagcggt atcagctcac tcaaaggcgg
taatacggtt atccacagaa tcaggggata 900acgcaggaaa gaacatgtga
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 960cgttgctggc
gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct
1020caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt
ccccctggaa 1080gctccctcgt gcgctctcct gttccgaccc tgccgcttac
cggatacctg tccgcctttc 1140tcccttcggg aagcgtggcg ctttctcata
gctcacgctg taggtatctc agttcggtgt 1200aggtcgttcg ctccaagctg
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 1260ccttatccgg
taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg
1320cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct
acagagttct 1380tgaagtggtg gcctaactac ggctacacta gaagaacagt
atttggtatc tgcgctctgc 1440tgaagccagt taccttcgga aaaagagttg
gtagctcttg atccggcaaa caaaccaccg 1500ctggtagcgg tggttttttt
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 1560aagaagatcc
tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt
1620aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt
ttaaattaaa 1680aatgaagttt taaatcaatc taaagtatat atgagtaaac
ttggtctgac agttaccaat 1740gcttaatcag tgaggcacct atctcagcga
tctgtctatt tcgttcatcc atagttgcct 1800gactccccgt cgtgtagata
actacgatac gggagggctt accatctggc cccagtgctg 1860caatgatacc
gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag
1920ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc
cagtctatta 1980attgttgccg ggaagctaga gtaagtagtt cgccagttaa
tagtttgcgc aacgttgttg 2040ccattgctac aggcatcgtg gtgtcacgct
cgtcgtttgg tatggcttca ttcagctccg 2100gttcccaacg atcaaggcga
gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 2160ccttcggtcc
tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta
2220tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt
tctgtgactg 2280gtgagtactc aaccaagtca ttctgagaat agtgtatgcg
gcgaccgagt tgctcttgcc 2340cggcgtcaat acgggataat accgcgccac
atagcagaac tttaaaagtg ctcatcattg 2400gaaaacgttc ttcggggcga
aaactctcaa ggatcttacc gctgttgaga tccagttcga 2460tgtaacccac
tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg
2520ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg
acacggaaat 2580gttgaatact catactcttc ctttttcaat attattgaag
catttatcag ggttattgtc 2640tcatgagcgg atacatattt gaatgtattt
agaaaaataa acaaataggg gttccgcgca 2700catttccccg aaaagtgcca
cctgatgcgg tgtgaaatac cgcacagatg cgtaaggaga 2760aaataccgca
tcaggaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt
2820gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg caaaatccct
tataaatcaa 2880aagaatagac cgagataggg ttgagtgttg ttccagtttg
gaacaagagt ccactattaa 2940agaacgtgga ctccaacgtc aaagggcgaa
aaaccgtcta tcagggcgat ggcccactac 3000gtgaaccatc accctaatca
agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga 3060accctaaagg
gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa
3120aggaagggaa gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta
gcggtcacgc 3180tgcgcgtaac caccacaccc gccgcgctta atgcgccgct
acagggcgcg tccattcgcc 3240attcaggctg cgcaactgtt gggaagggcg
atcggtgcgg gcctcttcgc tattacgcca 3300gctggcgaaa gggggatgtg
ctgcaaggcg attaagttgg gtaacgccag ggttttccca 3360gtcacgacgt
tgtaaaacga cggccagtga attgtaatac gactcactat agggcgaatt
3420gggcccgacg tcgcatgctg gtttcgattt gtcttagagg aacgcatata
cagtaatcat 3480agagaataaa cgatattcat ttattaaagt agatagttga
ggtagaagtt gtaaagagtg 3540ataaatagct tagataccac agacaccctc
ggtgacgaag tactgcagat ggtttccaat 3600cacattgacc tgctggagca
gagtgttacc ggcagagcac tgtttattgc tctggccctg 3660gcacatgaca
acgttggaga gaggagggtg gatcaggggc cagtcaataa agacctcacc
3720agagcagtgc tggtaaccgt cccagaaggg cacttgaggg acgatatctc
ctcggtgggt 3780gattcggtag agctttcggt ctttggacac cttggagaca
tcggggttct cctggccaaa 3840gaagagttta tcgacccagt tagcaaagcc
agcgttaccg acaatgggct gaccaagagt 3900aacaacgagg ggatcgtggc
cgttaacctt gaggttgatt ccgaacagaa gggctgcagc 3960tcctccgaga
gagtgaccgg tgacagcaat ctggtagtcg ggatactgct caatcacaga
4020gtcgagcttg gggccgatct gattgtaggt gttgttgtag gactggatga
agccattgtg 4080gacaagacag tcatcacaag tagcagtaga agagatgtta
gcagcaagat caaagttaat 4140taactcacct gcaggattga gactatgaat
ggattcccgt gcccgtatta ctctactaat 4200ttgatcttgg aacgcgaaaa
tacgtttcta ggactccaaa gaatctcaac tcttgtcctt 4260actaaatata
ctacccatag ttgatggttt acttgaacag agaggacatg ttcacttgac
4320ccaaagtttc tcgcatctct tggatatttg aacaacggcg tccactgacc
gtcagttatc 4380cagtcacaaa acccccacat tcatacattc ccatgtacgt
ttacaaagtt ctcaattcca 4440tcgtgcaaat caaaatcaca tctattcatt
catcatatat aaacccatca tgtctactaa 4500cactcacaac tccatagaaa
acatcgactc agaacacacg ctccatgcgg ccgcttagga 4560atcctgagcg
tccttgacac agtgaaccac accgactttg tgcatgtact tgagggtgga
4620aatgatgttg cccacaatgg tagggtagaa gacgtaccga actccgtgtc
gttcgcaaca 4680ctctcggaca gcttgctgca cgaagggata gtgccaagac
gacattcgag gaaagaggtg 4740atgctcgatc tggaagttga gaccgccagt
aaagaacatg gcaatgggtc caccgtaggt 4800ggaagaggtc tccacctgag
ctctgtacca gtcgatctga tcggcttcaa cgtccttctc 4860ggagctcttg
accttgcagt tcttgtcggg gattcgctcc gagccatcga agttgtgaga
4920caagatgaaa aagaaggtga ggaaggcacc ggtagcagtg ggcaccagag
gaatggtgat 4980gagcagggag gttccagtga gataccaggg caagaaggcg
gttcgaaaga tgaagaaagc 5040tcgcataacg aatgcaaggg ttcggtaccg
tcgcagaaag ccgttctctc gcatggctgt 5100gacagactcg ggaatggtgt
cgttgtgctg cattcggaag atgtagagag ggttgtacac 5160cagcgaaacg
ccgtaggctc caagcacgag gtacatgtac caggcctgga atcggtgaaa
5220ccactttcga gcagtgttgg cagcagggta gttgtggaac acaaggaatg
gttctgcgga 5280ctcggcatcc aggtcgagac catgctgatt ggtgtaggtg
tgatgtcgca tgatgtgaga 5340ctgcagccag atccatctgg acgatccaat
gacgtcgatg ccgtaggcaa agagagcgtt 5400gacccagggc tttttgctga
tggcaccatg agaggcatcg tgctgaatgg acaggccgat 5460ctgcatgtgc
atgaatccag tcaagagacc ccacagcacc attccggtag tagcccagtg
5520ccactcgcaa aaggcggtga cagcaatgat gccaacggtt cgcagccaga
atccaggtgt 5580ggcataccag ttccgacctt tcatgacctc tcgcatagtt
cgcttgacgt cctgtgcaaa 5640gggagagtcg taggtgtaga caatgtcctt
ggaggttcgg tcgtgcttgc ctcgcacgaa 5700ctgttgaagc agcttcgagt
tctcgggctt gacgtaaggg tgcatggagt agaacagagg 5760agaagcatcg
gaggcaccag aagcgaggat caagtcgcct ccgggatgga ccttggcaag
5820accttccaga tcgtagagaa tgccgtcgat ggcaaccagg tcgggtcgct
cgagcagctg 5880ctcggtagta agggagagag ccatggttgt gaattagggt
ggtgagaatg gttggttgta 5940gggaagaatc aaaggccggt ctcgggatcc
gtgggtatat atatatatat atatatatac 6000gatccttcgt tacctccctg
ttctcaaaac tgtggttttt cgtttttcgt tttttgcttt 6060ttttgatttt
tttagggcca actaagcttc cagatttcgc taatcacctt tgtactaatt
6120acaagaaagg aagaagctga ttagagttgg gctttttatg caactgtgct
actccttatc 6180tctgatatga aagtgtagac ccaatcacat catgtcattt
agagttggta atactgggag 6240gatagataag gcacgaaaac gagccatagc
agacatgctg ggtgtagcca agcagaagaa 6300agtagatggg agccaattga
cgagcgaggg agctacgcca atccgacata cgacacgctg 6360agatcgtctt
ggccgggggg tacctacaga tgtccaaggg taagtgcttg actgtaattg
6420tatgtctgag gacaaatatg tagtcagccg tataaagtca taccaggcac
cagtgccatc 6480atcgaaccac taactctcta tgatacatgc ctccggtatt
attgtaccat gcgtcgcttt 6540gttacatacg tatcttgcct ttttctctca
gaaactccag aattctctct cttgagcttt 6600tccataacaa gttcttctgc
ctccaggaag tccatgggtg gtttgatcat ggttttggtg 6660tagtggtagt
gcagtggtgg tattgtgact ggggatgtag ttgagaataa gtcatacaca
6720agtcagcttt cttcgagcct catataagta taagtagttc aacgtattag
cactgtaccc 6780agcatctccg tatcgagaaa cacaacaaca tgccccattg
gacagatcat gcggatacac 6840aggttgtgca gtatcataca tactcgatca
gacaggtcgt ctgaccatca tacaagctga 6900acaagcgctc catacttgca
cgctctctat atacacagtt aaattacata tccatagtct 6960aacctctaac
agttaatctt ctggtaagcc tcccagccag ccttctggta tcgcttggcc
7020tcctcaatag gatctcggtt ctggccgtac agacctcggc cgacaattat
gatatccgtt 7080ccggtagaca tgacatcctc aacagttcgg tactgctgtc
cgagagcgtc tcccttgtcg 7140tcaagaccca ccccgggggt cagaataagc
cagtcctcag agtcgccctt aggtcggttc 7200tgggcaatga agccaaccac
aaactcgggg tcggatcggg caagctcaat ggtctgcttg 7260gagtactcgc
cagtggccag agagcccttg caagacagct cggccagcat gagcagacct
7320ctggccagct tctcgttggg agaggggact aggaactcct tgtactggga
gttctcgtag 7380tcagagacgt cctccttctt ctgttcagag acagtttcct
cggcaccagc tcgcaggcca 7440gcaatgattc cggttccggg tacaccgtgg
gcgttggtga tatcggacca ctcggcgatt 7500cggtgacacc ggtactggtg
cttgacagtg ttgccaatat ctgcgaactt tctgtcctcg 7560aacaggaaga
aaccgtgctt aagagcaagt tccttgaggg ggagcacagt gccggcgtag
7620gtgaagtcgt caatgatgtc gatatgggtt ttgatcatgc acacataagg
tccgacctta 7680tcggcaagct caatgagctc cttggtggtg gtaacatcca
gagaagcaca caggttggtt 7740ttcttggctg ccacgagctt gagcactcga
gcggcaaagg cggacttgtg gacgttagct 7800cgagcttcgt aggagggcat
tttggtggtg aagaggagac tgaaataaat ttagtctgca 7860gaacttttta
tcggaacctt atctggggca gtgaagtata tgttatggta atagttacga
7920gttagttgaa cttatagata gactggacta tacggctatc ggtccaaatt
agaaagaacg 7980tcaatggctc tctgggcgtc gcctttgccg acaaaaatgt
gatcatgatg aaagccagca 8040atgacgttgc agctgatatt gttgtcggcc
aaccgcgccg aaaacgcagc tgtcagaccc 8100acagcctcca acgaagaatg
tatcgtcaaa gtgatccaag cacactcata gttggagtcg 8160tactccaaag
gcggcaatga cgagtcagac agatactcgt cgaccttttc cttgggaacc
8220accaccgtca gcccttctga ctcacgtatt gtagccaccg acacaggcaa
cagtccgtgg 8280atagcagaat atgtcttgtc ggtccatttc tcaccaactt
taggcgtcaa gtgaatgttg 8340cagaagaagt atgtgccttc attgagaatc
ggtgttgctg atttcaataa agtcttgaga 8400tcagtttggc cagtcatgtt
gtggggggta attggattga gttatcgcct acagtctgta 8460caggtatact
cgctgcccac tttatacttt ttgattccgc tgcacttgaa gcaatgtcgt
8520ttaccaaaag tgagaatgct ccacagaaca caccccaggg tatggttgag
caaaaaataa 8580acactccgat acggggaatc gaaccccggt ctccacggtt
ctcaagaagt attcttgatg 8640agagcgtatc gatcgaggaa gaggacaagc
ggctgcttct taagtttgtg acatcagtat 8700ccaaggcacc attgcaagga
ttcaaggctt tgaacccgtc atttgccatt cgtaacgctg 8760gtagacaggt
tgatcggttc cctacggcct ccacctgtgt caatcttctc aagctgcctg
8820actatcagga cattgatcaa cttcggaaga aacttttgta tgccattcga
tcacatgctg 8880gtttcgattt gtcttagagg aacgcatata cagtaatcat
agagaataaa cgatattcat 8940ttattaaagt agatagttga ggtagaagtt
gtaaagagtg ataaatagcg gccgctcact 9000gaatcttttt ggctcccttg
tgctttcgga cgatgtaggt ctgcacgtag aagttgagga 9060acagacacag
gacagtacca acgtagaagt agttgaaaaa ccagccaaac attctcattc
9120catcttgtcg gtagcaggga atgttccggt acttccagac gatgtagaag
ccaacgttga 9180actgaatgat ctgcatagaa gtaatcaggg acttgggcat
agggaacttg agcttgatca 9240gtcgggtcca atagtagccg tacatgatcc
agtgaatgaa gccgttgagc agcacaaaga 9300tccaaacggc ttcgtttcgg
tagttgtaga acagccacat gtccatagga gctccgagat 9360ggtgaaagaa
ctgcaaccag gtcagaggct tgcccatgag gggcagatag aaggagtcaa
9420tgtactcgag gaacttgctg aggtagaaca gctgagtggt gattcggaag
acattgttgt 9480cgaaagcctt ctcgcagttg tcggacatga caccaatggt
gtacatggcg taggccatag 9540agaggaagga gcccagcgag tagatggaca
tgagcaggtt gtagttggtg aacacaaact 9600tcattcgaga ctgacccttg
ggtccgagag gaccaagggt gaacttcagg atgacgaagg 9660cgatggagag
gtacagcacc tcgcagtgcg aggcatcaga ccagagctga gcatagtcga
9720ccttgggaag aacctcctgg ccaatggaga cgatttcgtt cacgacctcc
atggttgatg 9780tgtgtttaat tcaagaatga atatagagaa gagaagaaga
aaaaagattc aattgagccg 9840gcgatgcaga cccttatata aatgttgcct
tggacagacg gagcaagccc gcccaaacct 9900acgttcggta taatatgtta
agctttttaa cacaaaggtt tggcttgggg taacctgatg 9960tggtgcaaaa
gaccgggcgt tggcgagcca ttgcgcgggc gaatggggcc gtgactcgtc
10020tcaaattcga gggcgtgcct caattcgtgc ccccgtggct ttttcccgcc
gtttccgccc 10080cgtttgcacc actgcagccg cttctttggt tcggacacct
tgctgcgagc taggtgcctt 10140gtgctactta aaaagtggcc tcccaacacc
aacatgacat gagtgcgtgg gccaagacac 10200gttggcgggg tcgcagtcgg
ctcaatggcc cggaaaaaac gctgctggag ctggttcgga 10260cgcagtccgc
cgcggcgtat ggatatccgc aaggttccat agcgccattg ccctccgtcg
10320gcgtctatcc cgcaacctct aaatagagcg ggaatataac ccaagcttct
tttttttcct 10380ttaacacgca cacccccaac tatcatgttg ctgctgctgt
ttgactctac tctgtggagg 10440ggtgctccca cccaacccaa cctacaggtg
gatccggcgc tgtgattggc tgataagtct 10500cctatccgga ctaattctga
ccaatgggac atgcgcgcag gacccaaatg ccgcaattac 10560gtaaccccaa
cgaaatgcct acccctcttt ggagcccagc ggccccaaat ccccccaagc
10620agcccggttc taccggcttc catctccaag cacaagcagc ccggttctac
cggcttccat 10680ctccaagcac ccctttctcc acaccccaca aaaagacccg
tgcaggacat cctactgcgt 10740gtttaaacac cactaaaacc ccacaaaata
tatcttaccg aatatacaga tctactatag 10800aggaacaatt gccccggaga
agacggccag gccgcctaga tgacaaattc aacaactcac 10860agctgacttt
ctgccattgc cactaggggg gggccttttt atatggccaa gccaagctct
10920ccacgtcggt tgggctgcac ccaacaataa atgggtaggg ttgcaccaac
aaagggatgg 10980gatggggggt agaagatacg aggataacgg ggctcaatgg
cacaaataag aacgaatact 11040gccattaaga ctcgtgatcc agcgactgac
accattgcat catctaaggg cctcaaaact 11100acctcggaac tgctgcgctg
atctggacac cacagaggtt ccgagcactt taggttgcac 11160caaatgtccc
accaggtgca ggcagaaaac gctggaacag cgtgtacagt ttgtcttaac
11220aaaaagtgag ggcgctgagg tcgagcaggg tggtgtgact tgttatagcc
tttagagctg 11280cgaaagcgcg tatggatttg gctcatcagg ccagattgag
ggtctgtgga cacatgtcat 11340gttagtgtac ttcaatcgcc ccctggatat
agccccgaca ataggccgtg gcctcatttt 11400tttgccttcc gcacatttcc
attgctcggt acccacacct tgcttctcct gcacttgcca 11460accttaatac
tggtttacat tgaccaacat cttacaagcg gggggcttgt ctagggtata
11520tataaacagt ggctctccca atcggttgcc agtctctttt ttcctttctt
tccccacaga 11580ttcgaaatct aaactacaca tcacacaatg cctgttactg
acgtccttaa gcgaaagtcc 11640ggtgtcatcg tcggcgacga tgtccgagcc
gtgagtatcc acgacaagat cagtgtcgag 11700acgacgcgtt ttgtgtaatg
acacaatccg aaagtcgcta gcaacacaca ctctctacac 11760aaactaaccc
agctctccat ggtgaaggct tctcgacagg ctctgcccct cgtcatcgac
11820ggaaaggtgt acgacgtctc cgcttgggtg aacttccacc ctggtggagc
tgaaatcatt 11880gagaactacc agggacgaga tgctactgac gccttcatgg
ttatgcactc tcaggaagcc 11940ttcgacaagc tcaagcgaat gcccaagatc
aaccaggctt ccgagctgcc tccccaggct 12000gccgtcaacg aagctcagga
ggatttccga aagctccgag aagagctgat cgccactggc 12060atgtttgacg
cctctcccct ctggtactcg tacaagatct tgaccaccct gggtcttggc
12120gtgcttgcct tcttcatgct ggtccagtac cacctgtact tcattggtgc
tctcgtgctc 12180ggtatgcact accagcaaat gggatggctg tctcatgaca
tctgccacca ccagaccttc 12240aagaaccgaa actggaataa cgtcctgggt
ctggtctttg gcaacggact ccagggcttc 12300tccgtgacct ggtggaagga
cagacacaac gcccatcatt ctgctaccaa cgttcagggt 12360cacgatcccg
acattgataa cctgcctctg ctcgcctggt ccgaggacga tgtcactcga
12420gcttctccca tctcccgaaa gctcattcag ttccaacagt actatttcct
ggtcatctgt 12480attctcctgc gattcatctg gtgtttccag tctgtgctga
ccgttcgatc cctcaaggac 12540cgagacaacc agttctaccg atctcagtac
aagaaagagg ccattggact cgctctgcac 12600tggactctca agaccctgtt
ccacctcttc tttatgccct ccatcctgac ctcgatgctg 12660gtgttctttg
tttccgagct cgtcggtggc ttcggaattg ccatcgtggt cttcatgaac
12720cactaccctc tggagaagat cggtgattcc gtctgggacg gacatggctt
ctctgtgggt 12780cagatccatg agaccatgaa cattcgacga ggcatcatta
ctgactggtt ctttggaggc 12840ctgaactacc agatcgagca ccatctctgg
cccaccctgc ctcgacacaa cctcactgcc 12900gtttcctacc aggtggaaca
gctgtgccag aagcacaacc tcccctaccg aaaccctctg 12960ccccatgaag
gtctcgtcat cctgctccga tacctgtccc agttcgctcg aatggccgag
13020aagcagcccg gtgccaaggc tcagtaagcg gccgcatgag aagataaata
tataaataca 13080ttgagatatt aaatgcgcta gattagagag cctcatactg
ctcggagaga agccaagacg 13140agtactcaaa ggggattaca ccatccatat
ccacagacac aagctgggga aaggttctat 13200atacactttc cggaataccg
tagtttccga tgttatcaat gggggcagcc aggatttcag 13260gcacttcggt
gtctcggggt gaaatggcgt tcttggcctc catcaagtcg taccatgtct
13320tcatttgcct gtcaaagtaa aacagaagca gatgaagaat gaacttgaag
tgaaggaatt 13380taaatagttg gagcaaggga gaaatgtaga gtgtgaaaga
ctcactatgg tccgggctta 13440tctcgaccaa tagccaaagt ctggagtttc
tgagagaaaa aggcaagata cgtatgtaac 13500aaagcgacgc atggtacaat
aataccggag gcatgtatca tagagagtta gtggttcgat 13560gatggcactg
gtgcctggta tgactttata cggctgacta catatttgtc ctcagacata
13620caattacagt caagcactta cccttggaca tctgtaggta ccccccggcc
aagacgatct 13680cagcgtgtcg tatgtcggat tggcgtagct ccctcgctcg
tcaattggct cccatctact 13740ttcttctgct tggctacacc cagcatgtct
gctatggctc gttttcgtgc cttatctatc 13800ctcccagtat taccaactct
aaatgacatg atgtgattgg gtctacactt tcatatcaga 13860gataaggagt
agcacagttg cataaaaagc ccaactctaa tcagcttctt cctttcttgt
13920aattagtaca aaggtgatta gcgaaatctg gaagcttagt tggccctaaa
aaaatcaaaa 13980aaagcaaaaa acgaaaaacg aaaaaccaca gttttgagaa
cagggaggta acgaaggatc 14040gtatatatat atatatatat atatacccac
ggatcccgag accggccttt gattcttccc 14100tacaaccaac cattctcacc
accctaattc acaaccatgg gcgtattcat taaacaggag 14160cagcttccgg
ctctcaagaa gtacaagtac tccgccgagg atcactcgtt catctccaac
14220aacattctgc gccccttctg gcgacagttt gtcaaaatct tccctctgtg
gatggccccc 14280aacatggtga ctctgctggg cttcttcttt gtcattgtga
acttcatcac catgctcatt 14340gttgatccca cccacgaccg cgagcctccc
agatgggtct acctcaccta cgctctgggt 14400ctgttccttt accagacatt
tgatgcctgt gacggatccc atgcccgacg aactggccag 14460agtggacccc
ttggagagct gtttgaccac tgtgtcgacg ccatgaatac ctctctgatt
14520ctcacggtgg tggtgtccac cacccatatg ggatataaca tgaagctact
gattgtgcag 14580attgccgctc tcggaaactt ctacctgtcg acctgggaga
cctaccatac cggaactctg 14640tacctttctg gcttctctgg tcctgttgaa
ggtatcttga ttctggtggc tcttttcgtc 14700ctcaccttct tcactggtcc
caacgtgtac gctctgaccg tctacgaggc tcttcccgag 14760tccatcactt
cgctgctgcc tgccagcttc ctggacgtca ccatcaccca gatctacatt
14820ggattcggag tgctgggcat ggtgttcaac atctacggcg cctgcggaaa
cgtgatcaag 14880tactacaaca acaagggcaa gagcgctctc cccgccattc
tcggaatcgc cccctttggc 14940atcttctacg tcggcgtctt tgcctgggcc
catgttgctc ctctgcttct ctccaagtac 15000gccatcgtct atctgtttgc
cattggggct gcctttgcca tgcaagtcgg ccagatgatt 15060cttgcccatc
tcgtgcttgc tccctttccc cactggaacg tgctgctctt cttccccttt
15120gtgggactgg cagtgcacta cattgcaccc gtgtttggct gggacgccga
tatcgtgtcg 15180gttaacactc tcttcacctg ttttggcgcc accctctcca
tttacgcctt ctttgtgctt 15240gagatcatcg acgagatcac caactacctc
gatatctggt gtctgcgaat caagtaccct 15300caggagaaga agaccgaata
agcggccgca tggagcgtgt gttctgagtc gatgttttct 15360atggagttgt
gagtgttagt agacatgatg ggtttatata tgatgaatga atagatgtga
15420ttttgatttg cacgatggaa ttgagaactt tgtaaacgta catgggaatg
tatgaatgtg 15480ggggttttgt gactggataa ctgacggtca gtggacgccg
ttgttcaaat atccaagaga 15540tgcgagaaac tttgggtcaa gtgaacatgt
cctctctgtt caagtaaacc atcaactatg 15600ggtagtatat ttagtaagga
caagagttga gattctttgg agtcctagaa acgtattttc 15660gcgttccaag
atcaaattag tagagtaata cgggcacggg aatccattca tagtctcaat
15720cctgcaggtg agttaattaa tcgagcttgg cgtaatcatg gtcatagctg
tttcctgtgt 15780gaaattgtta tccgctcaca attccacaca ac
15812567966DNAArtificial SequencePlasmid pYPS161 56aaatgtaacg
aaactgaaat ttgaccagat attgtgtccg cggtggagct ccagcttttg 60ttccctttag
tgagggttaa tttcgagctt ggcgtaatca tggtcatagc tgtttcctgt
120gtgaaattgt tatccgctca caagcttcca cacaacgtac gttctggttg
gctcggatga 180tttctgcggc cccagcgtaa ggcaggcgtt ccgtccggat
cggtttgggt cggatcggct 240ttttgattgt cgtattgtcg ctcatgttgg
acctggtgtg tagttgtagt gtcagatcag 300attcaccagc gaatgcatgt
gaacttcccc acattttgag ccgaggcaga tttgggttgc 360ttagtaagca
gacgtggcgt tgcaagtaga tgtggcaaat ggggacgaag attccgaggg
420gatatcatag ttccaagggg atgtcatcat ttgccagctt tcgccgccac
ttttgacgag 480tttttgtggg tcaaataagt ttagttgaac ttttcaaatt
tcagttggca ttttgttaat 540agaaagggtg ccggtgctgg ggggttcatt
cctcgggttg cagatatcct atctgtctta 600ggggtatctc tttcaatcga
caagatgtag ttgggtaaca attatttatt aatattctct 660ccatccagta
cagtactaac atcttgacat ctcagcacaa gtgcatcttc ccaagtgttt
720gttggagagg ttgttgggta ttacttagga aacagaacac agtacgtgga
gatcttggat 780acatcgtaca tggaggttat ccataaaaaa gaccctccag
gactagttac aatgccgtta 840gatgaggaaa tccacaaccc tgattcacta
tgaacatatt atcttccccc aaacttgcga 900tatatggccc ttgatgatag
ccttgatttt acccttgatg gtacctccac gaccaaccga 960tctgctgttt
gaagagatat tttcaaattt gaagtgctca gatctactaa acatgagtcc
1020agtaattctt tccgtctttc cgatttccga tattcccttt tttagcccga
cttttcactg 1080ctcccatgtc aaacgattag gacttgggag acaatcccac
tgtcaaaatc accccgatat 1140tctctgtaaa acaagtactt cttccacgtg
atcttcaaat acctcttcca cgtgaccttc 1200aaatacctct tcaagtacct
cttccacgcg accttcaaag tcccttcaaa tacccttctc 1260aattctcccc
ttctcctcca tagtccttct ctctgactaa gcttgagaat acatgacgct
1320aagacgaaaa cacactagag accctgagag cctgaacatg catccactct
gcagttgcgc 1380acgtgcctac agcaactatc gggtccagtg ctggatctga
cactgcgtct ccctatgaag 1440aaactgataa acagatctgc actcataaca
atgatctgag cgatgaaaac gtgacctcca 1500cagccacaag tcataatcgg
cgcgccagct gcattaatga atcggccaac gcgcggggag 1560aggcggtttg
cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt
1620cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt
tatccacaga 1680atcaggggat aacgcaggaa agaacatgtg agcaaaaggc
cagcaaaagg ccaggaaccg 1740taaaaaggcc gcgttgctgg cgtttttcca
taggctccgc ccccctgacg agcatcacaa 1800aaatcgacgc tcaagtcaga
ggtggcgaaa cccgacagga ctataaagat accaggcgtt 1860tccccctgga
agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct
1920gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct
gtaggtatct 1980cagttcggtg taggtcgttc gctccaagct gggctgtgtg
cacgaacccc ccgttcagcc 2040cgaccgctgc gccttatccg gtaactatcg
tcttgagtcc aacccggtaa gacacgactt 2100atcgccactg gcagcagcca
ctggtaacag gattagcaga gcgaggtatg taggcggtgc 2160tacagagttc
ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat
2220ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt
gatccggcaa 2280acaaaccacc gctggtagcg gtggtttttt tgtttgcaag
cagcagatta cgcgcagaaa 2340aaaaggatct caagaagatc ctttgatctt
ttctacgggg tctgacgctc agtggaacga 2400aaactcacgt taagggattt
tggtcatgag attatcaaaa aggatcttca cctagatcct 2460tttaaattaa
aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga
2520cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat
ttcgttcatc 2580catagttgcc tgactccccg tcgtgtagat aactacgata
cgggagggct taccatctgg 2640ccccagtgct gcaatgatac cgcgagaccc
acgctcaccg gctccagatt tatcagcaat 2700aaaccagcca gccggaaggg
ccgagcgcag aagtggtcct gcaactttat ccgcctccat 2760ccagtctatt
aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg
2820caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg
gtatggcttc 2880attcagctcc ggttcccaac gatcaaggcg agttacatga
tcccccatgt tgtgcaaaaa 2940agcggttagc tccttcggtc ctccgatcgt
tgtcagaagt aagttggccg cagtgttatc 3000actcatggtt atggcagcac
tgcataattc tcttactgtc atgccatccg taagatgctt 3060ttctgtgact
ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag
3120ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa
ctttaaaagt 3180gctcatcatt ggaaaacgtt cttcggggcg aaaactctca
aggatcttac cgctgttgag 3240atccagttcg atgtaaccca ctcgtgcacc
caactgatct tcagcatctt ttactttcac 3300cagcgtttct gggtgagcaa
aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 3360gacacggaaa
tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca
3420gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata
aacaaatagg 3480ggttccgcgc acatttcccc gaaaagtgcc acctgatgcg
gtgtgaaata ccgcacagat 3540gcgtaaggag aaaataccgc atcaggaaat
tgtaagcgtt aatattttgt taaaattcgc 3600gttaaatttt tgttaaatca
gctcattttt taaccaatag gccgaaatcg gcaaaatccc 3660ttataaatca
aaagaataga ccgagatagg gttgagtgtt gttccagttt ggaacaagag
3720tccactatta aagaacgtgg actccaacgt caaagggcga aaaaccgtct
atcagggcga 3780tggcccacta cgtgaaccat caccctaatc aagttttttg
gggtcgaggt gccgtaaagc 3840actaaatcgg aaccctaaag ggagcccccg
atttagagct tgacggggaa agccggcgaa 3900cgtggcgaga aaggaaggga
agaaagcgaa aggagcgggc
gctagggcgc tggcaagtgt 3960agcggtcacg ctgcgcgtaa ccaccacacc
cgccgcgctt aatgcgccgc tacagggcgc 4020gtccattcgc cattcaggct
gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg 4080ctattacgcc
agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca
4140gggttttccc agtcacgacg ttgtaaaacg acggccagtg aattgtaata
cgactcacta 4200tagggcgaat tgggcccgac gtcgcatgca actattagtg
aggcttcggg agtggttgtc 4260tcggttgtct cattcagact cgttgtgttg
tatctatatc tatataaaca ctcttgtccc 4320tcaatcccac tgccatcttt
tgctaaactt gccgccaata tgaaactcat ctccctcatc 4380accgtcgcta
ccaccgctct ggcggctgtc ggagacaagt acaagctgac ctataccaga
4440tcagacgccc aatcggtcga atctctgccc gtcacctacc aagatgacct
gatcaccgcc 4500tccaccgacg gcgaacccat caccatcacc gagggcgagg
gcaacacctt ctctgttaac 4560gacatgccca tcgcctatct ggagctgcag
gctttgttct ggaccggcga ctacggctac 4620aagctccagg gctcggtctt
tgacattgcc gccgatggaa cctttgagct gagagacggc 4680cccaaggagt
actactattg cactcctcac cctgagcgaa acgtcatcta cgtcatcaac
4740agccccgact actccaagtg tcggttcaag cgtaccatca agttccacgc
tgaaaagatc 4800taagtggtaa tcgaccgact aaccattttt agctgacaaa
cacttgctaa ctcctataac 4860gaatgaatga ctaacttggc atattgttac
caagtattac ttgggatata gttgagtgta 4920accattgcta agaatccaaa
ctggagcttc taaaggtctg ggagtcgccg tatgtgttca 4980tatcgaaatc
aaagaaatca taatcgcaac agaattcaaa atcaagcaga ttaatatcca
5040ttattgtact cggatcgtga catatctgat atgatctcgg atatgatctc
tgactgttta 5100ctgggagatt tgttgaagat ttgttgaggt tatctgaaaa
gtagacaata gagacaaaat 5160gacgatatca agaactgaat cgggccgaaa
tactcggtat cattcccttc agcagtaact 5220gtattgctct atcaatgcga
cgagatacct ccacaattaa tactgtatac gctctaccac 5280tcatatctcc
aatgctaaaa tatattcatg cccaggacct ctgtgcactg ctatgcagca
5340cagtgttgtc gattgaattg gtcgtgtctg gtccctgatg ctctgtgtct
cgctgactag 5400tccttccatc cagacctcgt cattatctga taggcaacaa
gttctgctct ctcacaccct 5460gccgacacaa gggacactcg ggcttctctc
tcacccattc ggaaatacag tccttaatta 5520agttgcgaca catgtcttga
tagtatcttg aattctctct cttgagcttt tccataacaa 5580gttcttctgc
ctccaggaag tccatgggtg gtttgatcat ggttttggtg tagtggtagt
5640gcagtggtgg tattgtgact ggggatgtag ttgagaataa gtcatacaca
agtcagcttt 5700cttcgagcct catataagta taagtagttc aacgtattag
cactgtaccc agcatctccg 5760tatcgagaaa cacaacaaca tgccccattg
gacagatcat gcggatacac aggttgtgca 5820gtatcataca tactcgatca
gacaggtcgt ctgaccatca tacaagctga acaagcgctc 5880catacttgca
cgctctctat atacacagtt aaattacata tccatagtct aacctctaac
5940agttaatctt ctggtaagcc tcccagccag ccttctggta tcgcttggcc
tcctcaatag 6000gatctcggtt ctggccgtac agacctcggc cgacaattat
gatatccgtt ccggtagaca 6060tgacatcctc aacagttcgg tactgctgtc
cgagagcgtc tcccttgtcg tcaagaccca 6120ccccgggggt cagaataagc
cagtcctcag agtcgccctt aggtcggttc tgggcaatga 6180agccaaccac
aaactcgggg tcggatcggg caagctcaat ggtctgcttg gagtactcgc
6240cagtggccag agagcccttg caagacagct cggccagcat gagcagacct
ctggccagct 6300tctcgttggg agaggggact aggaactcct tgtactggga
gttctcgtag tcagagacgt 6360cctccttctt ctgttcagag acagtttcct
cggcaccagc tcgcaggcca gcaatgattc 6420cggttccggg tacaccgtgg
gcgttggtga tatcggacca ctcggcgatt cggtgacacc 6480ggtactggtg
cttgacagtg ttgccaatat ctgcgaactt tctgtcctcg aacaggaaga
6540aaccgtgctt aagagcaagt tccttgaggg ggagcacagt gccggcgtag
gtgaagtcgt 6600caatgatgtc gatatgggtt ttgatcatgc acacataagg
tccgacctta tcggcaagct 6660caatgagctc cttggtggtg gtaacatcca
gagaagcaca caggttggtt ttcttggctg 6720ccacgagctt gagcactcga
gcggcaaagg cggacttgtg gacgttagct cgagcttcgt 6780aggagggcat
tttggtggtg aagaggagac tgaaataaat ttagtctgca gaacttttta
6840tcggaacctt atctggggca gtgaagtata tgttatggta atagttacga
gttagttgaa 6900cttatagata gactggacta tacggctatc ggtccaaatt
agaaagaacg tcaatggctc 6960tctgggcgtc gcctttgccg acaaaaatgt
gatcatgatg aaagccagca atgacgttgc 7020agctgatatt gttgtcggcc
aaccgcgccg aaaacgcagc tgtcagaccc acagcctcca 7080acgaagaatg
tatcgtcaaa gtgatccaag cacactcata gttggagtcg tactccaaag
7140gcggcaatga cgagtcagac agatactcgt cgaccttttc cttgggaacc
accaccgtca 7200gcccttctga ctcacgtatt gtagccaccg acacaggcaa
cagtccgtgg atagcagaat 7260atgtcttgtc ggtccatttc tcaccaactt
taggcgtcaa gtgaatgttg cagaagaagt 7320atgtgccttc attgagaatc
ggtgttgctg atttcaataa agtcttgaga tcagtttggc 7380cagtcatgtt
gtggggggta attggattga gttatcgcct acagtctgta caggtatact
7440cgctgcccac tttatacttt ttgattccgc tgcacttgaa gcaatgtcgt
ttaccaaaag 7500tgagaatgct ccacagaaca caccccaggg tatggttgag
caaaaaataa acactccgat 7560acggggaatc gaaccccggt ctccacggtt
ctcaagaagt attcttgatg agagcgtatc 7620gatgagccta aaatgaaccc
gagtatatct cataaaattc tcggtgagag gtctgtgact 7680gtcagtacaa
ggtgccttca ttatgccctc aaccttacca tacctcactg aatgtagtgt
7740acctctaaaa atgaaataca gtgccaaaag ccaaggcact gagctcgtct
aacggacttg 7800atatacaacc aattaaaaca aatgaaaaga aatacagttc
tttgtatcat ttgtaacaat 7860taccctgtac aaactaaggt attgaaatcc
cacaatattc ccaaagtcca cccctttcca 7920aattgtcatg cctacaactc
atataccaag cactaaccta ccgttt 79665720DNAArtificial SequencePrimer
Pex-10del1 3'.Forward 57ccaacatgag cgacaatacg 205820DNAArtificial
SequencePrimer Pex-10del2 5'.Reverse 58caagttctgc tctctcacac
20598673DNAArtificial SequencePlasmid pYRH13 59taagcgattg
atgattggaa acacacacat gggttatatc taggtgagag ttagttggac 60agttatatat
taaatcagct atgccaacgg taacttcatt catgtcaacg aggaaccagt
120gactgcaagt aatatagaat ttgaccacct tgccattctc ttgcactcct
ttactatatc 180tcatttattt cttatataca aatcacttct tcttcccagc
atcgagctcg gaaacctcat 240gagcaataac atcgtggatc tcgtcaatag
agggcttttt ggactccttg ctgttggcca 300ccttgtcctt gctgtctggc
tcattctgtt tcaacgcctt tcgcgccaga ccatcaacct 360tgttgagctc
tccgtcagca gcctcgacca gatcatcaaa accagaaccc ttggctcgag
420ttcgggcttc tcgaagcttg tctttagcct cttcataatc gcccttcttg
atagcaatca 480caccgactcc atatgtgcat agagcctggg cctcctcgac
ttccttggtc cgtcggacat 540cgggctcaag agaaggaatg gccttgagaa
cacgcttgta acatgactcg gatcgagcca 600gggcgttatt actgctcgtc
ttcattgtgt ccagaggaat ctcgccgcct gtgtcagctt 660tgatggtggt
gccctcgttc ttttcggcag tgtgaacaat cacctccagc tgttcagaca
720tgaggtagaa catggaggct aggttggctt gggctaacaa cagatctccc
actccacatc 780cggaagcaag catgatctga taagtgattt gcttctctct
gagagcaacg ttggcgaggg 840cgtcagagag gttgtgagtt gtgagcacat
cacgagcagc aataagctcg tctctgaagg 900gcatccaggc gtcgtaattg
ccggaagcac gcagcagacg agcatgagac gcacttttag 960tcagctgggt
catgaactcc cgctcgctct gtgtcggggg cgtgctggcg agtttcagca
1020gatctgtggc ctcggggcac cgtcgacaga cctcttcttg agccagcagg
atctgcagca 1080gtagcgctcg tgataccaca tcatttttct cggttccaga
aatgtgagcg agcttgagag 1140cgatccgcag acctctctgg atcacctggg
gccggacatc ctgggcgatt ttgttattct 1200ggaaggcgtc aacgtaggca
gcacaaatct ccatgtacac gtcgtgggca gcgtccgggt 1260agttgagcat
ctcgtagatc tctgccagtt tgagctggat gcctgtgtat tcgtccgaca
1320agggagacag gccttgggcc tcggcctcca taagtgcctc aatgtaatac
ttgacggcat 1380gcgacgtcgg gcccaattcg ccctatagtg agtcgtatta
caattcactg gccgtcgttt 1440tacaacgtcg tgactgggaa aaccctggcg
ttacccaact taatcgcctt gcagcacatc 1500cccctttcgc cagctggcgt
aatagcgaag aggcccgcac cgatcgccct tcccaacagt 1560tgcgcagcct
gaatggcgaa tggacgcgcc ctgtagcggc gcattaagcg cggcgggtgt
1620ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg
ctcctttcgc 1680tttcttccct tcctttctcg ccacgttcgc cggctttccc
cgtcaagctc taaatcgggg 1740gctcccttta gggttccgat ttagtgcttt
acggcacctc gaccccaaaa aacttgatta 1800gggtgatggt tcacgtagtg
ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 1860ggagtccacg
ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat
1920ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt
ggttaaaaaa 1980tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa
atattaacgc ttacaatttc 2040ctgatgcggt attttctcct tacgcatctg
tgcggtattt cacaccgcat caggtggcac 2100ttttcgggga aatgtgcgcg
gaacccctat ttgtttattt ttctaaatac attcaaatat 2160gtatccgctc
atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag
2220tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat
tttgccttcc 2280tgtttttgct cacccagaaa cgctggtgaa agtaaaagat
gctgaagatc agttgggtgc 2340acgagtgggt tacatcgaac tggatctcaa
cagcggtaag atccttgaga gttttcgccc 2400cgaagaacgt tttccaatga
tgagcacttt taaagttctg ctatgtggcg cggtattatc 2460ccgtattgac
gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt
2520ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag
taagagaatt 2580atgcagtgct gccataacca tgagtgataa cactgcggcc
aacttacttc tgacaacgat 2640cggaggaccg aaggagctaa ccgctttttt
gcacaacatg ggggatcatg taactcgcct 2700tgatcgttgg gaaccggagc
tgaatgaagc cataccaaac gacgagcgtg acaccacgat 2760gcctgtagca
atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc
2820ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac
cacttctgcg 2880ctcggccctt ccggctggct ggtttattgc tgataaatct
ggagccggtg agcgtgggtc 2940tcgcggtatc attgcagcac tggggccaga
tggtaagccc tcccgtatcg tagttatcta 3000cacgacgggg agtcaggcaa
ctatggatga acgaaataga cagatcgctg agataggtgc 3060ctcactgatt
aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga
3120tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg
ataatctcat 3180gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg
tcagaccccg tagaaaagat 3240caaaggatct tcttgagatc ctttttttct
gcgcgtaatc tgctgcttgc aaacaaaaaa 3300accaccgcta ccagcggtgg
tttgtttgcc ggatcaagag ctaccaactc tttttccgaa 3360ggtaactggc
ttcagcagag cgcagatacc aaatactgtt cttctagtgt agccgtagtt
3420aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc
taatcctgtt 3480accagtggct gctgccagtg gcgataagtc gtgtcttacc
gggttggact caagacgata 3540gttaccggat aaggcgcagc ggtcgggctg
aacggggggt tcgtgcacac agcccagctt 3600ggagcgaacg acctacaccg
aactgagata cctacagcgt gagctatgag aaagcgccac 3660gcttcccgaa
gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga
3720gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg
tcgggtttcg 3780ccacctctga cttgagcgtc gatttttgtg atgctcgtca
ggggggcgga gcctatggaa 3840aaacgccagc aacgcggcct ttttacggtt
cctggccttt tgctggcctt ttgctcacat 3900gttctttcct gcgttatccc
ctgattctgt ggataaccgt attaccgcct ttgagtgagc 3960tgataccgct
cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga
4020agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt
aatgcagctg 4080gcgcgccggt ttctgtctct cgtcgtgtca cagatggtgt
tgttgttgat gagttcctgg 4140ttgccctgtt tcgcacaagg tggtgcgtga
ggttgtgtgg agaggggctt gaaggagggg 4200ggtcgaggtg caggagcgtc
ccccgagggg ccctaggccg tcacatgacc ggcataatgg 4260tgtggagtcg
ggttttggtt ttcctggcgg gttccacact tgtcaagtct cgtttttcag
4320gctttttttc actcgctctt tttgcacttt ggcatctttt tacctttggt
gcttaccacc 4380tttgtatgca ggaaatctat tgggtttggt gtataggtga
aaaaaaaaaa gccaaaggtg 4440actgtttttt tccgactcgg tcatgttgca
ttttgtgcga tattataagt ggggaacgaa 4500tggaggcgag ctggtgtgat
acgggagctg ctgtttctca cgattctgcc cagccattta 4560tcacgcgcac
gctgacatct tgcacttagt catcaagagc tacagtacga cgagtacata
4620ctagagccaa ccactcctga agtgcttcca tgagttcagt tgagtgctga
accaactctc 4680gacactctcg acagcctgtg aaaaggaatg agtgtgtgga
aagggattca atactggaga 4740agagagggga gagatcgaga gggtgatgtt
acatccccaa gcgtcgtagt ctcgcgttga 4800tgactggaac ggactgttga
acgacgatca acatggtgtg caagctgatg gacagttggg 4860ccaatggttc
agaagcgtta gttgagcttc taacgaccta ctactcgcct gtcaagtgag
4920gtgtgtactt gttcatactc ctactcgtct cactggcgtc tagggttgtg
agcaccgtcg 4980cttatgaaag acgccgtcgc ctatgaaaga caccgtcgct
cattgaagac tagatccata 5040atataaacaa aagagtattt ctctgaatgg
cgacggattg gccagcccca tcgttacaca 5100atttgtccaa aaacaccatc
tctgccgtcc atcgatatct ttcgaaatca tccggaccag 5160acagtagagc
tttgagaacc ccgaaggagg aatactgcag tgaagtgttc tttgaaactc
5220tgactggagt atctccattt ctatatctcc attagtaatc actccaaaca
gatgtcttcc 5280agcttgagtc agccgagacc acggtcacgt atggtgattc
cttcaaacat ataactccat 5340tgacctaaca agacactggc agttgtaaat
acgtaaatac attcttgatg taagttttaa 5400tctgattgga gactcttctg
agtaacacac tctcttccaa gcagtcattt tggccttttt 5460ttcttccaaa
cccgtctcga ttactcatca ggttttatct gagaaccaaa acgtctcaat
5520cattgacata ttgtaccatc aactctgtaa aaacttgaca gatgtgctac
ttgtgtcatt 5580atgaatcgat tttccaaata tccattatca ttatcccatt
tcttccccga tatcacctcc 5640ccatctacca cctccattta ccaaccacca
tgctcagtaa tcagaaactc ctcttcacag 5700accacaattg ccaataattg
accaccaaaa gtcgtaccat gtgtttctcc ggtgaccagg 5760tctcgctttc
acccatttat tccctcaaaa acacccctac agtaatttca gcgcctttcc
5820atcaaactcc atacttgcaa caaaatcaca atggccccct gcctaaacta
cgcccgccca 5880taattgagta tatttgtatg acaatcccgc tcgaaatttg
gcccacttgt tccccgagct 5940ccaaatattc actattcacc ttcacctcgt
gcccaccctg gccccccaat gccccccgtg 6000ctcgtaacgt ctccctcccc
cacaccccac acacgtgaca taaagtgtaa agtgcgagta 6060cccgtacgtt
gtgtggaagc ttgtgagcgg ataacaattt cacacaggaa acagctatga
6120ccatgattac gccaagctcg aaattaaccc tcactaaagg gaacaaaagc
tggagctcca 6180ccgcggacac aatatctggt caaatttcag tttcgttaca
tttaaacggt aggttagtgc 6240ttggtatatg agttgtaggc atgacaattt
ggaaaggggt ggactttggg aatattgtgg 6300gatttcaata ccttagtttg
tacagggtaa ttgttacaaa tgatacaaag aactgtattt 6360cttttcattt
gttttaattg gttgtatatc aagtccgtta gacgagctca gtgccttggc
6420ttttggcact gtatttcatt tttagaggta cactacattc agtgaggtat
ggtaaggttg 6480agggcataat gaaggcacct tgtactgaca gtcacagacc
tctcaccgag aattttatga 6540gatatactcg ggttcatttt aggctcatcg
atacgctctc atcaagaata cttcttgaga 6600accgtggaga ccggggttcg
attccccgta tcggagtgtt tattttttgc tcaaccatac 6660cctggggtgt
gttctgtgga gcattctcac ttttggtaaa cgacattgct tcaagtgcag
6720cggaatcaaa aagtataaag tgggcagcga gtatacctgt acagactgta
ggcgataact 6780caatccaatt accccccaca acatgactgg ccaaactgat
ctcaagactt tattgaaatc 6840agcaacaccg attctcaatg aaggcacata
cttcttctgc aacattcact tgacgcctaa 6900agttggtgag aaatggaccg
acaagacata ttctgctatc cacggactgt tgcctgtgtc 6960ggtggctaca
atacgtgagt cagaagggct gacggtggtg gttcccaagg aaaaggtcga
7020cgagtatctg tctgactcgt cattgccgcc tttggagtac gactccaact
atgagtgtgc 7080ttggatcact ttgacgatac attcttcgtt ggaggctgtg
ggtctgacag ctgcgttttc 7140ggcgcggttg gccgacaaca atatcagctg
caacgtcatt gctggctttc atcatgatca 7200catttttgtc ggcaaaggcg
acgcccagag agccattgac gttctttcta atttggaccg 7260atagccgtat
agtccagtct atctataagt tcaactaact cgtaactatt accataacat
7320atacttcact gccccagata aggttccgat aaaaagttct gcagactaaa
tttatttcag 7380tctcctcttc accaccaaaa tgccctccta cgaagctcga
gctaacgtcc acaagtccgc 7440ctttgccgct cgagtgctca agctcgtggc
agccaagaaa accaacctgt gtgcttctct 7500ggatgttacc accaccaagg
agctcattga gcttgccgat aaggtcggac cttatgtgtg 7560catgatcaaa
acccatatcg acatcattga cgacttcacc tacgccggca ctgtgctccc
7620cctcaaggaa cttgctctta agcacggttt cttcctgttc gaggacagaa
agttcgcaga 7680tattggcaac actgtcaagc accagtaccg gtgtcaccga
atcgccgagt ggtccgatat 7740caccaacgcc cacggtgtac ccggaaccgg
aatcattgct ggcctgcgag ctggtgccga 7800ggaaactgtc tctgaacaga
agaaggagga cgtctctgac tacgagaact cccagtacaa 7860ggagttccta
gtcccctctc ccaacgagaa gctggccaga ggtctgctca tgctggccga
7920gctgtcttgc aagggctctc tggccactgg cgagtactcc aagcagacca
ttgagcttgc 7980ccgatccgac cccgagtttg tggttggctt cattgcccag
aaccgaccta agggcgactc 8040tgaggactgg cttattctga cccccggggt
gggtcttgac gacaagggag acgctctcgg 8100acagcagtac cgaactgttg
aggatgtcat gtctaccgga acggatatca taattgtcgg 8160ccgaggtctg
tacggccaga accgagatcc tattgaggag gccaagcgat accagaaggc
8220tggctgggag gcttaccaga agattaactg ttagaggtta gactatggat
atgtaattta 8280actgtgtata tagagagcgt gcaagtatgg agcgcttgtt
cagcttgtat gatggtcaga 8340cgacctgtct gatcgagtat gtatgatact
gcacaacctg tgtatccgca tgatctgtcc 8400aatggggcat gttgttgtgt
ttctcgatac ggagatgctg ggtacagtgc taatacgttg 8460aactacttat
acttatatga ggctcgaaga aagctgactt gtgtatgact tattctcaac
8520tacatcccca gtcacaatac caccactgca ctaccactac accaaaacca
tgatcaaacc 8580acccatggac ttcctggagg cagaagaact tgttatggaa
aagctcaaga gagagaattc 8640aagatactat caagacatgt gtcgcaactt aat
86736038DNAArtificial SequencePrimer PEX16Fii 60ccaaccagat
caccacccac tacaccttcc aggaaccc 386134DNAArtificial SequencePrimer
PEX16Rii 61ctggtagaac tcgcctcgga acaaccacca tccc
346234DNAArtificial SequencePrimer 3UTR-URA3 62gagagaattc
aagatactat caagacatgt gtcg 346333DNAArtificial SequencePrimer
Pex16-conf 63cacaccttca ccccggaagt cgccaccatt ctg
336420DNAArtificial SequenceReal time PCR primer ef-324F
64cgactgtgcc atcctcatca 206521DNAArtificial SequenceReal time PCR
primer ef-392R 65tgaccgtcct tggagatacc a 216618DNAArtificial
SequenceReal time PCR primer Pex16-741F 66gggagtggtg gccgagtt
186721DNAArtificial SequenceReal time PCR primer Pex16-802R
67ggaaaagcaa gcatgcgtag a 216821DNAArtificial SequenceNucleotide
portion of primer ef-345T 68tgctggtggt gttggtgagt t
216921DNAArtificial SequenceNucleotide portion of TaqMan probe
Pex16-760T 69ctgtccattc tgcgacccct c 21704313DNAArtificial
SequencePlasmid pZKUM 70taatcgagct tggcgtaatc atggtcatag ctgtttcctg
tgtgaaattg ttatccgctc 60acaattccac acaacatacg agccggaagc ataaagtgta
aagcctgggg tgcctaatga 120gtgagctaac tcacattaat tgcgttgcgc
tcactgcccg ctttccagtc gggaaacctg 180tcgtgccagc tgcattaatg
aatcggccaa cgcgcgggga gaggcggttt gcgtattggg 240cgctcttccg
cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg
300gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga
taacgcagga 360aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc
gtaaaaaggc cgcgttgctg 420gcgtttttcc ataggctccg cccccctgac
gagcatcaca aaaatcgacg ctcaagtcag 480aggtggcgaa acccgacagg
actataaaga taccaggcgt ttccccctgg aagctccctc 540gtgcgctctc
ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg
600ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt
gtaggtcgtt 660cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc
ccgaccgctg cgccttatcc 720ggtaactatc gtcttgagtc caacccggta
agacacgact tatcgccact ggcagcagcc 780actggtaaca ggattagcag
agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 840tggcctaact
acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca
900gttaccttcg gaaaaagagt
tggtagctct tgatccggca aacaaaccac cgctggtagc 960ggtggttttt
ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat
1020cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg
ttaagggatt 1080ttggtcatga gattatcaaa aaggatcttc acctagatcc
ttttaaatta aaaatgaagt 1140tttaaatcaa tctaaagtat atatgagtaa
acttggtctg acagttacca atgcttaatc 1200agtgaggcac ctatctcagc
gatctgtcta tttcgttcat ccatagttgc ctgactcccc 1260gtcgtgtaga
taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata
1320ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc
agccggaagg 1380gccgagcgca gaagtggtcc tgcaacttta tccgcctcca
tccagtctat taattgttgc 1440cgggaagcta gagtaagtag ttcgccagtt
aatagtttgc gcaacgttgt tgccattgct 1500acaggcatcg tggtgtcacg
ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 1560cgatcaaggc
gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt
1620cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt
tatggcagca 1680ctgcataatt ctcttactgt catgccatcc gtaagatgct
tttctgtgac tggtgagtac 1740tcaaccaagt cattctgaga atagtgtatg
cggcgaccga gttgctcttg cccggcgtca 1800atacgggata ataccgcgcc
acatagcaga actttaaaag tgctcatcat tggaaaacgt 1860tcttcggggc
gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc
1920actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc
tgggtgagca 1980aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg
cgacacggaa atgttgaata 2040ctcatactct tcctttttca atattattga
agcatttatc agggttattg tctcatgagc 2100ggatacatat ttgaatgtat
ttagaaaaat aaacaaatag gggttccgcg cacatttccc 2160cgaaaagtgc
cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt
2220acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt
cgctttcttc 2280ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag
ctctaaatcg ggggctccct 2340ttagggttcc gatttagtgc tttacggcac
ctcgacccca aaaaacttga ttagggtgat 2400ggttcacgta gtgggccatc
gccctgatag acggtttttc gccctttgac gttggagtcc 2460acgttcttta
atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc
2520tattcttttg atttataagg gattttgccg atttcggcct attggttaaa
aaatgagctg 2580atttaacaaa aatttaacgc gaattttaac aaaatattaa
cgcttacaat ttccattcgc 2640cattcaggct gcgcaactgt tgggaagggc
gatcggtgcg ggcctcttcg ctattacgcc 2700agctggcgaa agggggatgt
gctgcaaggc gattaagttg ggtaacgcca gggttttccc 2760agtcacgacg
ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat
2820tgggtaccgg gccccccctc gaggtcgacg agtatctgtc tgactcgtca
ttgccgcctt 2880tggagtacga ctccaactat gagtgtgctt ggatcacttt
gacgatacat tcttcgttgg 2940aggctgtggg tctgacagct gcgttttcgg
cgcggttggc cgacaacaat atcagctgca 3000acgtcattgc tggctttcat
catgatcaca tttttgtcgg caaaggcgac gcccagagag 3060ccattgacgt
tctttctaat ttggaccgat agccgtatag tccagtctat ctataagttc
3120aactaactcg taactattac cataacatat acttcactgc cccagataag
gttccgataa 3180aaagttctgc agactaaatt tatttcagtc tcctcttcac
caccaaaatg ccctcctacg 3240aagctcgagt gctcaagctc gtggcagcca
agaaaaccaa cctgtgtgct tctctggatg 3300ttaccaccac caaggagctc
attgagcttg ccgataaggt cggaccttat gtgtgcatga 3360tcaaaaccca
tatcgacatc attgacgact tcacctacgc cggcactgtg ctccccctca
3420aggaacttgc tcttaagcac ggtttcttcc tgttcgagga cagaaagttc
gcagatattg 3480gcaacactgt caagcaccag taccggtgtc accgaatcgc
cgagtggtcc gatatcacca 3540acgcccacgg tgtacccgga accggaatcg
attgctggcc tgcgagctgg tgcgtacgag 3600gaaactgtct ctgaacagaa
gaaggaggac gtctctgact acgagaactc ccagtacaag 3660gagttcctag
tcccctctcc caacgagaag ctggccagag gtctgctcat gctggccgag
3720ctgtcttgca agggctctct ggccactggc gagtactcca agcagaccat
tgagcttgcc 3780cgatccgacc ccgagtttgt ggttggcttc attgcccaga
accgacctaa gggcgactct 3840gaggactggc ttattctgac ccccggggtg
ggtcttgacg acaagggaga cgctctcgga 3900cagcagtacc gaactgttga
ggatgtcatg tctaccggaa cggatatcat aattgtcggc 3960cgaggtctgt
acggccagaa ccgagatcct attgaggagg ccaagcgata ccagaaggct
4020ggctgggagg cttaccagaa gattaactgt tagaggttag actatggata
tgtaatttaa 4080ctgtgtatat agagagcgtg caagtatgga gcgcttgttc
agcttgtatg atggtcagac 4140gacctgtctg atcgagtatg tatgatactg
cacaacctgt gtatccgcat gatctgtcca 4200atggggcatg ttgttgtgtt
tctcgatacg gagatgctgg gtacagtgct aatacgttga 4260actacttata
cttatatgag gctcgaagaa agctgacttg tgtatgactt aat
43137115966DNAArtificial SequencePlasmid pZKD2-5U89A2 71gtacgtttca
tgaaggcggg cagaaagtac tcgatggtgg agatgattgc tcggaggtac 60ttgttctgcg
gccagtatct ctcagcaatc aggtgatact cctggacgtc cagagggtag
120tatgtgtgcg tgggctccag atccaccgtc ttgtgcagag ttatggggaa
gtagcggcca 180aagagcttcc agatgaagaa gtttcttgaa ataggcgagt
atcgcttgac cactcctccg 240ttggacgggg agtcgtcttt aacagcgtac
actacatacg caatcacaaa tggccagagc 300agtggaattg cgcagcatag
catgaaaatt gtgaggaaag tgggaatgct gaaaatgtgc 360cagaccagag
agaaggtctc acatcggttg agtaatggtg tcgatagcgg ggcatatcgg
420attcccgcga ttttgggtgc cgtgtcgttt ttgtctcgcg acttgtagta
ttgtgagtcg 480atagtcatag cttttgtttt gtgtgacttg tctgttgcct
gttgttagaa gaaaaagtgg 540gagcttatca gtcacggtcc acgaacgatt
tcgtacttgt acgtaattgg tcgtgagaac 600tgttgcagag ccggtgcttt
tttttgtggc caagtcgaca ggtcgatttc ggcgctgtgc 660gaggttgctg
ggatgtgctg gtttggctgc caaatgtggg gaagatttca acctcggatt
720tgacgtgtgt agaggcgcgc cagctgcatt aatgaatcgg ccaacgcgcg
gggagaggcg 780gtttgcgtat tgggcgctct tccgcttcct cgctcactga
ctcgctgcgc tcggtcgttc 840ggctgcggcg agcggtatca gctcactcaa
aggcggtaat acggttatcc acagaatcag 900gggataacgc aggaaagaac
atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 960aggccgcgtt
gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc
1020gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag
gcgtttcccc 1080ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc
gcttaccgga tacctgtccg 1140cctttctccc ttcgggaagc gtggcgcttt
ctcatagctc acgctgtagg tatctcagtt 1200cggtgtaggt cgttcgctcc
aagctgggct gtgtgcacga accccccgtt cagcccgacc 1260gctgcgcctt
atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc
1320cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc
ggtgctacag 1380agttcttgaa gtggtggcct aactacggct acactagaag
aacagtattt ggtatctgcg 1440ctctgctgaa gccagttacc ttcggaaaaa
gagttggtag ctcttgatcc ggcaaacaaa 1500ccaccgctgg tagcggtggt
ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 1560gatctcaaga
agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact
1620cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag
atccttttaa 1680attaaaaatg aagttttaaa tcaatctaaa gtatatatga
gtaaacttgg tctgacagtt 1740accaatgctt aatcagtgag gcacctatct
cagcgatctg tctatttcgt tcatccatag 1800ttgcctgact ccccgtcgtg
tagataacta cgatacggga gggcttacca tctggcccca 1860gtgctgcaat
gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc
1920agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc
tccatccagt 1980ctattaattg ttgccgggaa gctagagtaa gtagttcgcc
agttaatagt ttgcgcaacg 2040ttgttgccat tgctacaggc atcgtggtgt
cacgctcgtc gtttggtatg gcttcattca 2100gctccggttc ccaacgatca
aggcgagtta catgatcccc catgttgtgc aaaaaagcgg 2160ttagctcctt
cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca
2220tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga
tgcttttctg 2280tgactggtga gtactcaacc aagtcattct gagaatagtg
tatgcggcga ccgagttgct 2340cttgcccggc gtcaatacgg gataataccg
cgccacatag cagaacttta aaagtgctca 2400tcattggaaa acgttcttcg
gggcgaaaac tctcaaggat cttaccgctg ttgagatcca 2460gttcgatgta
acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg
2520tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata
agggcgacac 2580ggaaatgttg aatactcata ctcttccttt ttcaatatta
ttgaagcatt tatcagggtt 2640attgtctcat gagcggatac atatttgaat
gtatttagaa aaataaacaa ataggggttc 2700cgcgcacatt tccccgaaaa
gtgccacctg atgcggtgtg aaataccgca cagatgcgta 2760aggagaaaat
accgcatcag gaaattgtaa gcgttaatat tttgttaaaa ttcgcgttaa
2820atttttgtta aatcagctca ttttttaacc aataggccga aatcggcaaa
atcccttata 2880aatcaaaaga atagaccgag atagggttga gtgttgttcc
agtttggaac aagagtccac 2940tattaaagaa cgtggactcc aacgtcaaag
ggcgaaaaac cgtctatcag ggcgatggcc 3000cactacgtga accatcaccc
taatcaagtt ttttggggtc gaggtgccgt aaagcactaa 3060atcggaaccc
taaagggagc ccccgattta gagcttgacg gggaaagccg gcgaacgtgg
3120cgagaaagga agggaagaaa gcgaaaggag cgggcgctag ggcgctggca
agtgtagcgg 3180tcacgctgcg cgtaaccacc acacccgccg cgcttaatgc
gccgctacag ggcgcgtcca 3240ttcgccattc aggctgcgca actgttggga
agggcgatcg gtgcgggcct cttcgctatt 3300acgccagctg gcgaaagggg
gatgtgctgc aaggcgatta agttgggtaa cgccagggtt 3360ttcccagtca
cgacgttgta aaacgacggc cagtgaattg taatacgact cactataggg
3420cgaattgggc ccgacgtcgc atgcatcaaa ggaagggtga atccaaggaa
gttcttgaca 3480aactgctgga atcggtacag cttggacgac ttgtcgttgc
taacctggtc atagaggtcg 3540ttctcaccaa aggccatgat gggaacaagg
gcgacatttc cgacctccat accaagtcga 3600acaaaaccct ttcgcttgag
tagcaccagg tccatgacac cgggtctggc cagaagactt 3660tcctgtgctc
caccaacgac aatgcagata gactggtttc gcttgaggag ggccttgcag
3720gacttcttgg agacagaagc gactcccaga ctcatgaggt actctctgta
gagaggcact 3780cggaagttgt tggtgagagt cataagagaa acagggatgc
ccggaaagag cttggaccat 3840ccagctccct cggtggcaat tccaccaaag
gctcccatgc cgataatgcc gtgggggtgg 3900tagccgaaga tgtattttct
gccagtgggc ttgagttttg tgggcgacag ctgtgggtcg 3960ttttcgccaa
tgatctggtt ggcgtaggag ttgagggacc cgttaagaag cgtggaatca
4020gatgcagtgg agccagcaga ggcggacgac aaaggtcgtc ggttagtggt
gccattgttg 4080ccgttgccgt taagttcgga gcccgaggcg tggccgttgg
agccagatga ttctccacgg 4140ctatatctgc tgtcgtggtt aattaactca
cctgcaggat tgagactatg aatggattcc 4200cgtgcccgta ttactctact
aatttgatct tggaacgcga aaatacgttt ctaggactcc 4260aaagaatctc
aactcttgtc cttactaaat atactaccca tagttgatgg tttacttgaa
4320cagagaggac atgttcactt gacccaaagt ttctcgcatc tcttggatat
ttgaacaacg 4380gcgtccactg accgtcagtt atccagtcac aaaaccccca
cattcataca ttcccatgta 4440cgtttacaaa gttctcaatt ccatcgtgca
aatcaaaatc acatctattc attcatcata 4500tataaaccca tcatgtctac
taacactcac aactccatag aaaacatcga ctcagaacac 4560acgctccatg
cggccgctta ggaatcctga gcgtccttga cacagtgaac cacaccgact
4620ttgtgcatgt acttgagggt ggaaatgatg ttgcccacaa tggtagggta
gaagacgtac 4680cgaactccgt gtcgttcgca acactctcgg acagcttgct
gcacgaaggg atagtgccaa 4740gacgacattc gaggaaagag gtgatgctcg
atctggaagt tgagaccgcc agtaaagaac 4800atggcaatgg gtccaccgta
ggtggaagag gtctccacct gagctctgta ccagtcgatc 4860tgatcggctt
caacgtcctt ctcggagctc ttgaccttgc agttcttgtc ggggattcgc
4920tccgagccat cgaagttgtg agacaagatg aaaaagaagg tgaggaaggc
accggtagca 4980gtgggcacca gaggaatggt gatgagcagg gaggttccag
tgagatacca gggcaagaag 5040gcggttcgaa agatgaagaa agctcgcata
acgaatgcaa gggttcggta ccgtcgcaga 5100aagccgttct ctcgcatggc
tgtgacagac tcgggaatgg tgtcgttgtg ctgcattcgg 5160aagatgtaga
gagggttgta caccagcgaa acgccgtagg ctccaagcac gaggtacatg
5220taccaggcct ggaatcggtg aaaccacttt cgagcagtgt tggcagcagg
gtagttgtgg 5280aacacaagga atggttctgc ggactcggca tccaggtcga
gaccatgctg attggtgtag 5340gtgtgatgtc gcatgatgtg agactgcagc
cagatccatc tggacgatcc aatgacgtcg 5400atgccgtagg caaagagagc
gttgacccag ggctttttgc tgatggcacc atgagaggca 5460tcgtgctgaa
tggacaggcc gatctgcatg tgcatgaatc cagtcaagag accccacagc
5520accattccgg tagtagccca gtgccactcg caaaaggcgg tgacagcaat
gatgccaacg 5580gttcgcagcc agaatccagg tgtggcatac cagttccgac
ctttcatgac ctctcgcata 5640gttcgcttga cgtcctgtgc aaagggagag
tcgtaggtgt agacaatgtc cttggaggtt 5700cggtcgtgct tgcctcgcac
gaactgttga agcagcttcg agttctcggg cttgacgtaa 5760gggtgcatgg
agtagaacag aggagaagca tcggaggcac cagaagcgag gatcaagtcg
5820cctccgggat ggaccttggc aagaccttcc agatcgtaga gaatgccgtc
gatggcaacc 5880aggtcgggtc gctcgagcag ctgctcggta gtaagggaga
gagccatggc cattgctgta 5940gatatgtctt gtgtgtaagg gggttggggt
ggttgtttgt gttcttgact tttgtgttag 6000caagggaaga cgggcaaaaa
agtgagtgtg gttgggaggg agagacgagc cttatatata 6060atgcttgttt
gtgtttgtgc aagtggacgc cgaaacgggc aggagccaaa ctaaacaagg
6120cagacaatgc gagcttaatt ggattgcctg atgggcaggg gttagggctc
gatcaatggg 6180ggtgcgaagt gacaaaattg ggaattaggt tcgcaagcaa
ggctgacaag actttggccc 6240aaacatttgt acgcggtgga caacaggagc
cacccatcgt ctgtcacggg ctagccggtc 6300gtgcgtcctg tcaggctcca
cctaggctcc atgccactcc atacaatccc actagtgtac 6360cgctaggccg
cttttagctc ccatctaaga cccccccaaa acctccactg tacagtgcac
6420tgtactgtgt ggcgatcaag ggcaagggaa aaaaggcgca aacatgcacg
catggaatga 6480cgtaggtaag gcgttactag actgaaaagt ggcacatttc
ggcgtgccaa agggtcctag 6540gtgcgtttcg cgagctgggc gccaggccaa
gccgctccaa aacgcctctc cgactccctc 6600cagcggcctc catatcccca
tccctctcca cagcaatgtt gttaagcctt gcaaacgaaa 6660aaatagaaag
gctaataagc ttccaatatt gtggtgtacg ctgcataacg caacaatgag
6720cgccaaacaa cacacacaca cagcacacag cagcattaac cacgatgaac
agcatgaatt 6780ctctctcttg agcttttcca taacaagttc ttctgcctcc
aggaagtcca tgggtggttt 6840gatcatggtt ttggtgtagt ggtagtgcag
tggtggtatt gtgactgggg atgtagttga 6900gaataagtca tacacaagtc
agctttcttc gagcctcata taagtataag tagttcaacg 6960tattagcact
gtacccagca tctccgtatc gagaaacaca acaacatgcc ccattggaca
7020gatcatgcgg atacacaggt tgtgcagtat catacatact cgatcagaca
ggtcgtctga 7080ccatcataca agctgaacaa gcgctccata cttgcacgct
ctctatatac acagttaaat 7140tacatatcca tagtctaacc tctaacagtt
aatcttctgg taagcctccc agccagcctt 7200ctggtatcgc ttggcctcct
caataggatc tcggttctgg ccgtacagac ctcggccgac 7260aattatgata
tccgttccgg tagacatgac atcctcaaca gttcggtact gctgtccgag
7320agcgtctccc ttgtcgtcaa gacccacccc gggggtcaga ataagccagt
cctcagagtc 7380gcccttaggt cggttctggg caatgaagcc aaccacaaac
tcggggtcgg atcgggcaag 7440ctcaatggtc tgcttggagt actcgccagt
ggccagagag cccttgcaag acagctcggc 7500cagcatgagc agacctctgg
ccagcttctc gttgggagag gggactagga actccttgta 7560ctgggagttc
tcgtagtcag agacgtcctc cttcttctgt tcagagacag tttcctcggc
7620accagctcgc aggccagcaa tgattccggt tccgggtaca ccgtgggcgt
tggtgatatc 7680ggaccactcg gcgattcggt gacaccggta ctggtgcttg
acagtgttgc caatatctgc 7740gaactttctg tcctcgaaca ggaagaaacc
gtgcttaaga gcaagttcct tgagggggag 7800cacagtgccg gcgtaggtga
agtcgtcaat gatgtcgata tgggttttga tcatgcacac 7860ataaggtccg
accttatcgg caagctcaat gagctccttg gtggtggtaa catccagaga
7920agcacacagg ttggttttct tggctgccac gagcttgagc actcgagcgg
caaaggcgga 7980cttgtggacg ttagctcgag cttcgtagga gggcattttg
gtggtgaaga ggagactgaa 8040ataaatttag tctgcagaac tttttatcgg
aaccttatct ggggcagtga agtatatgtt 8100atggtaatag ttacgagtta
gttgaactta tagatagact ggactatacg gctatcggtc 8160caaattagaa
agaacgtcaa tggctctctg ggcgtcgcct ttgccgacaa aaatgtgatc
8220atgatgaaag ccagcaatga cgttgcagct gatattgttg tcggccaacc
gcgccgaaaa 8280cgcagctgtc agacccacag cctccaacga agaatgtatc
gtcaaagtga tccaagcaca 8340ctcatagttg gagtcgtact ccaaaggcgg
caatgacgag tcagacagat actcgtcgac 8400cttttccttg ggaaccacca
ccgtcagccc ttctgactca cgtattgtag ccaccgacac 8460aggcaacagt
ccgtggatag cagaatatgt cttgtcggtc catttctcac caactttagg
8520cgtcaagtga atgttgcaga agaagtatgt gccttcattg agaatcggtg
ttgctgattt 8580caataaagtc ttgagatcag tttggccagt catgttgtgg
ggggtaattg gattgagtta 8640tcgcctacag tctgtacagg tatactcgct
gcccacttta tactttttga ttccgctgca 8700cttgaagcaa tgtcgtttac
caaaagtgag aatgctccac agaacacacc ccagggtatg 8760gttgagcaaa
aaataaacac tccgatacgg ggaatcgaac cccggtctcc acggttctca
8820agaagtattc ttgatgagag cgtatcgata gttggagcaa gggagaaatg
tagagtgtga 8880aagactcact atggtccggg cttatctcga ccaatagcca
aagtctggag tttctgagag 8940aaaaaggcaa gatacgtatg taacaaagcg
acgcatggta caataatacc ggaggcatgt 9000atcatagaga gttagtggtt
cgatgatggc actggtgcct ggtatgactt tatacggctg 9060actacatatt
tgtcctcaga catacaatta cagtcaagca cttacccttg gacatctgta
9120ggtacccccc ggccaagacg atctcagcgt gtcgtatgtc ggattggcgt
agctccctcg 9180ctcgtcaatt ggctcccatc tactttcttc tgcttggcta
cacccagcat gtctgctatg 9240gctcgttttc gtgccttatc tatcctccca
gtattaccaa ctctaaatga catgatgtga 9300ttgggtctac actttcatat
cagagataag gagtagcaca gttgcataaa aagcccaact 9360ctaatcagct
tcttcctttc ttgtaattag tacaaaggtg attagcgaaa tctggaagct
9420tagttggccc taaaaaaatc aaaaaaagca aaaaacgaaa aacgaaaaac
cacagttttg 9480agaacaggga ggtaacgaag gatcgtatat atatatatat
atatatatac ccacggatcc 9540cgagaccggc ctttgattct tccctacaac
caaccattct caccacccta attcacaacc 9600atggctgccg tcatcgaggt
ggccaacgag ttcgtcgcta tcactgccga gacccttccc 9660aaggtggact
atcagcgact ctggcgagac atctactcct gcgagctcct gtacttctcc
9720attgctttcg tcatcctcaa gtttaccctt ggcgagctct cggattctgg
caaaaagatt 9780ctgcgagtgc tgttcaagtg gtacaacctc ttcatgtccg
tcttttcgct ggtgtccttc 9840ctctgtatgg gttacgccat ctacaccgtt
ggactgtact ccaacgaatg cgacagagct 9900ttcgacaaca gcttgttccg
atttgccacc aaggtcttct actattccaa gtttctggag 9960tacatcgact
ctttctacct tcccctcatg gccaagcctc tgtcctttct gcagttcttt
10020catcacttgg gagctcctat ggacatgtgg ctcttcgtgc agtactctgg
cgaatccatt 10080tggatctttg tgttcctgaa cggattcatt cactttgtca
tgtacggcta ctattggaca 10140cggctgatga agttcaactt tcccatgccc
aagcagctca ttaccgcaat gcagatcacc 10200cagttcaacg ttggcttcta
cctcgtgtgg tggtacaagg acattccctg ttaccgaaag 10260gatcccatgc
gaatgctggc ctggatcttc aactactggt acgtcggtac cgttcttctg
10320ctcttcatca acttctttgt caagtcctac gtgtttccca agcctaagac
tgccgacaaa 10380aaggtccagt agcggccgca tgtacataca agattattta
tagaaatgaa tcgcgatcga 10440acaaagagta cgagtgtacg agtaggggat
gatgataaaa gtggaagaag ttccgcatct 10500ttggatttat caacgtgtag
gacgatactt cctgtaaaaa tgcaatgtct ttaccatagg 10560ttctgctgta
gatgttatta actaccatta acatgtctac ttgtacagtt gcagaccagt
10620tggagtatag aatggtacac ttaccaaaaa gtgttgatgg ttgtaactac
gatatataaa 10680actgttgacg ggatctgtat attcggtaag atatattttg
tggggtttta gtggtgttta 10740aacaccacta aaaccccaca aaatatatct
taccgaatat acagatctac tatagaggaa 10800caattgcccc ggagaagacg
gccaggccgc ctagatgaca aattcaacaa ctcacagctg 10860actttctgcc
attgccacta ggggggggcc tttttatatg gccaagccaa gctctccacg
10920tcggttgggc tgcacccaac aataaatggg tagggttgca ccaacaaagg
gatgggatgg 10980ggggtagaag atacgaggat aacggggctc aatggcacaa
ataagaacga atactgccat 11040taagactcgt gatccagcga ctgacaccat
tgcatcatct aagggcctca aaactacctc 11100ggaactgctg cgctgatctg
gacaccacag aggttccgag cactttaggt tgcaccaaat 11160gtcccaccag
gtgcaggcag aaaacgctgg aacagcgtgt acagtttgtc ttaacaaaaa
11220gtgagggcgc tgaggtcgag cagggtggtg tgacttgtta tagcctttag
agctgcgaaa 11280gcgcgtatgg atttggctca tcaggccaga ttgagggtct
gtggacacat gtcatgttag 11340tgtacttcaa tcgccccctg gatatagccc
cgacaatagg ccgtggcctc atttttttgc 11400cttccgcaca tttccattgc
tcggtaccca caccttgctt ctcctgcact tgccaacctt 11460aatactggtt
tacattgacc aacatcttac aagcgggggg cttgtctagg gtatatataa
11520acagtggctc tcccaatcgg ttgccagtct cttttttcct ttctttcccc
acagattcga 11580aatctaaact acacatcaca caatgcctgt
tactgacgtc cttaagcgaa agtccggtgt 11640catcgtcggc gacgatgtcc
gagccgtgag tatccacgac aagatcagtg tcgagacgac 11700gcgttttgtg
taatgacaca atccgaaagt cgctagcaac acacactctc tacacaaact
11760aacccagctc tccatggtga aggcttctcg acaggctctg cccctcgtca
tcgacggaaa 11820ggtgtacgac gtctccgctt gggtgaactt ccaccctggt
ggagctgaaa tcattgagaa 11880ctaccaggga cgagatgcta ctgacgcctt
catggttatg cactctcagg aagccttcga 11940caagctcaag cgaatgccca
agatcaacca ggcttccgag ctgcctcccc aggctgccgt 12000caacgaagct
caggaggatt tccgaaagct ccgagaagag ctgatcgcca ctggcatgtt
12060tgacgcctct cccctctggt actcgtacaa gatcttgacc accctgggtc
ttggcgtgct 12120tgccttcttc atgctggtcc agtaccacct gtacttcatt
ggtgctctcg tgctcggtat 12180gcactaccag caaatgggat ggctgtctca
tgacatctgc caccaccaga ccttcaagaa 12240ccgaaactgg aataacgtcc
tgggtctggt ctttggcaac ggactccagg gcttctccgt 12300gacctggtgg
aaggacagac acaacgccca tcattctgct accaacgttc agggtcacga
12360tcccgacatt gataacctgc ctctgctcgc ctggtccgag gacgatgtca
ctcgagcttc 12420tcccatctcc cgaaagctca ttcagttcca acagtactat
ttcctggtca tctgtattct 12480cctgcgattc atctggtgtt tccagtctgt
gctgaccgtt cgatccctca aggaccgaga 12540caaccagttc taccgatctc
agtacaagaa agaggccatt ggactcgctc tgcactggac 12600tctcaagacc
ctgttccacc tcttctttat gccctccatc ctgacctcga tgctggtgtt
12660ctttgtttcc gagctcgtcg gtggcttcgg aattgccatc gtggtcttca
tgaaccacta 12720ccctctggag aagatcggtg attccgtctg ggacggacat
ggcttctctg tgggtcagat 12780ccatgagacc atgaacattc gacgaggcat
cattactgac tggttctttg gaggcctgaa 12840ctaccagatc gagcaccatc
tctggcccac cctgcctcga cacaacctca ctgccgtttc 12900ctaccaggtg
gaacagctgt gccagaagca caacctcccc taccgaaacc ctctgcccca
12960tgaaggtctc gtcatcctgc tccgatacct gtcccagttc gctcgaatgg
ccgagaagca 13020gcccggtgcc aaggctcagt aagcggccgc atgagaagat
aaatatataa atacattgag 13080atattaaatg cgctagatta gagagcctca
tactgctcgg agagaagcca agacgagtac 13140tcaaagggga ttacaccatc
catatccaca gacacaagct ggggaaaggt tctatataca 13200ctttccggaa
taccgtagtt tccgatgtta tcaatggggg cagccaggat ttcaggcact
13260tcggtgtctc ggggtgaaat ggcgttcttg gcctccatca agtcgtacca
tgtcttcatt 13320tgcctgtcaa agtaaaacag aagcagatga agaatgaact
tgaagtgaag gaatttaaat 13380agttggagca agggagaaat gtagagtgtg
aaagactcac tatggtccgg gcttatctcg 13440accaatagcc aaagtctgga
gtttctgaga gaaaaaggca agatacgtat gtaacaaagc 13500gacgcatggt
acaataatac cggaggcatg tatcatagag agttagtggt tcgatgatgg
13560cactggtgcc tggtatgact ttatacggct gactacatat ttgtcctcag
acatacaatt 13620acagtcaagc acttaccctt ggacatctgt aggtaccccc
cggccaagac gatctcagcg 13680tgtcgtatgt cggattggcg tagctccctc
gctcgtcaat tggctcccat ctactttctt 13740ctgcttggct acacccagca
tgtctgctat ggctcgtttt cgtgccttat ctatcctccc 13800agtattacca
actctaaatg acatgatgtg attgggtcta cactttcata tcagagataa
13860ggagtagcac agttgcataa aaagcccaac tctaatcagc ttcttccttt
cttgtaatta 13920gtacaaaggt gattagcgaa atctggaagc ttagttggcc
ctaaaaaaat caaaaaaagc 13980aaaaaacgaa aaacgaaaaa ccacagtttt
gagaacaggg aggtaacgaa ggatcgtata 14040tatatatata tatatatata
cccacggatc ccgagaccgg cctttgattc ttccctacaa 14100ccaaccattc
tcaccaccct aattcacaac catggcctcc acctcggctc tgcccaagca
14160gaaccctgcc ctccgacgaa ccgtcacttc caccactgtg accgactcgg
agtctgctgc 14220cgtctctccc tccgattctc ccagacactc ggcctcctct
acatcgctgt cttccatgtc 14280cgaggtggac attgccaagc ccaagtccga
gtacggtgtc atgctggata cctacggcaa 14340ccagttcgaa gttcccgact
tcaccatcaa ggacatctac aacgctattc ccaagcactg 14400cttcaagcga
tctgctctca agggatacgg ctacattctt cgagacattg tcctcctgac
14460taccactttc agcatctggt acaactttgt gacacccgag tacattccct
ccactcctgc 14520tcgagccggt ctgtgggctg tgtacaccgt tcttcaggga
ctcttcggta ctggactgtg 14580ggtcattgcc cacgagtgtg gacatggtgc
tttctccgat tcccgaatca tcaacgacat 14640tactggctgg gtgcttcact
cttccctgct tgttccctac ttcagctggc aaatctccca 14700ccggaagcat
cacaaggcca ctggaaacat ggagcgagac atggtcttcg ttcctcgaac
14760ccgagagcag caagctactc gactcggcaa gatgacccac gaactcgccc
atcttaccga 14820ggaaactcct gctttcaccc tgctcatgct tgtgcttcag
caactggtcg gttggcccaa 14880ctatctcatt accaacgtta ctggacacaa
ctaccatgag cggcagcgag agggtcgagg 14940caagggaaag cacaacggtc
ttggcggtgg agttaaccat ttcgatcccc gatctcctct 15000gtacgagaac
agcgacgcca agctcatcgt gctctccgac attggcattg gtcttatggc
15060caccgctctg tactttctcg ttcagaagtt cggattctac aacatggcca
tctggtactt 15120cgttccctac ttgtgggtta accactggct cgtcgccatt
acctttctgc agcacacaga 15180tcctactctt ccccactaca ccaacgacga
gtggaacttt gtgcgaggtg ccgctgcaac 15240catcgaccga gagatgggct
tcattggacg tcatctgctc cacggcatta tcgagactca 15300cgtcctgcat
cactacgtct cttccattcc cttctacaat gcggacgaag ctaccgaggc
15360catcaaacct atcatgggca agcactatcg agctgatgtc caggacggtc
ctcgaggatt 15420cattcgagcc atgtaccgat ctgcacgaat gtgccagtgg
gttgaaccct ccgctggtgc 15480cgagggagct ggcaagggtg tcctgttctt
tcgaaaccga aacaatgtgg gcactcctcc 15540cgctgtcatc aagcccgttg
cctaagcggc cgctatttat cactctttac aacttctacc 15600tcaactatct
actttaataa atgaatatcg tttattctct atgattactg tatatgcgtt
15660cctctaagac aaatcgaaac cagcatgtga tcgaatggca tacaaaagtt
tcttccgaag 15720ttgatcaatg tcctgatagt caggcagctt gagaagattg
acacaggtgg aggccgtagg 15780gaaccgatca acctgtctac cagcgttacg
aatggcaaat gacgggttca aagccttgaa 15840tccttgcaat ggtgccttgg
atactgatgt cacaaactta agaagcagcc gcttgtcctc 15900ttcctcgatc
gatggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca 15960cacaac
15966722119DNAYarrowia lipolyticaCDS(291)..(1835)DGAT2 opening
reading frame, comprising 2 smaller internal opening reading frames
72aaacgcaccc actgctcgtc ctccttgctc ctcgaaaccg actcctctac acacgtcaaa
60tccgaggttg aaatcttccc cacatttggc agccaaacca gcacatccca gcaacctcgc
120acagcgccga aatcgacctg tcgacttggc cacaaaaaaa agcaccggct
ctgcaacagt 180tctcacgacc aattacgtac aagtacgaaa tcgttcgtgg
accgtgactg ataagctccc 240actttttctt ctaacaacag gcaacagaca
agtcacacaa aacaaaagct atg act 296 Met Thr 1atc gac tca caa tac tac
aag tcg cga gac aaa aac gac acg gca ccc 344Ile Asp Ser Gln Tyr Tyr
Lys Ser Arg Asp Lys Asn Asp Thr Ala Pro 5 10 15aaa atc gcg gga atc
cga tat gcc ccg cta tcg aca cca tta ctc aac 392Lys Ile Ala Gly Ile
Arg Tyr Ala Pro Leu Ser Thr Pro Leu Leu Asn 20 25 30cga tgt gag acc
ttc tct ctg gtc tgg cac att ttc agc att ccc act 440Arg Cys Glu Thr
Phe Ser Leu Val Trp His Ile Phe Ser Ile Pro Thr35 40 45 50ttc ctc
aca att ttc atg cta tgc tgc gca att cca ctg ctc tgg cca 488Phe Leu
Thr Ile Phe Met Leu Cys Cys Ala Ile Pro Leu Leu Trp Pro 55 60 65ttt
gtg att gcg tat gta gtg tac gct gtt aaa gac gac tcc ccg tcc 536Phe
Val Ile Ala Tyr Val Val Tyr Ala Val Lys Asp Asp Ser Pro Ser 70 75
80aac gga gga gtg gtc aag cga tac tcg cct att tca aga aac ttc ttc
584Asn Gly Gly Val Val Lys Arg Tyr Ser Pro Ile Ser Arg Asn Phe Phe
85 90 95atc tgg aag ctc ttt ggc cgc tac ttc ccc ata act ctg cac aag
acg 632Ile Trp Lys Leu Phe Gly Arg Tyr Phe Pro Ile Thr Leu His Lys
Thr 100 105 110gtg gat ctg gag ccc acg cac aca tac tac cct ctg gac
gtc cag gag 680Val Asp Leu Glu Pro Thr His Thr Tyr Tyr Pro Leu Asp
Val Gln Glu115 120 125 130tat cac ctg att gct gag aga tac tgg ccg
cag aac aag tac ctc cga 728Tyr His Leu Ile Ala Glu Arg Tyr Trp Pro
Gln Asn Lys Tyr Leu Arg 135 140 145gca atc atc tcc acc atc gag tac
ttt ctg ccc gcc ttc atg aaa cgg 776Ala Ile Ile Ser Thr Ile Glu Tyr
Phe Leu Pro Ala Phe Met Lys Arg 150 155 160tct ctt tct atc aac gag
cag gag cag cct gcc gag cga gat cct ctc 824Ser Leu Ser Ile Asn Glu
Gln Glu Gln Pro Ala Glu Arg Asp Pro Leu 165 170 175ctg tct ccc gtt
tct ccc agc tct ccg ggt tct caa cct gac aag tgg 872Leu Ser Pro Val
Ser Pro Ser Ser Pro Gly Ser Gln Pro Asp Lys Trp 180 185 190att aac
cac gac agc aga tat agc cgt gga gaa tca tct ggc tcc aac 920Ile Asn
His Asp Ser Arg Tyr Ser Arg Gly Glu Ser Ser Gly Ser Asn195 200 205
210ggc cac gcc tcg ggc tcc gaa ctt aac ggc aac ggc aac aat ggc acc
968Gly His Ala Ser Gly Ser Glu Leu Asn Gly Asn Gly Asn Asn Gly Thr
215 220 225act aac cga cga cct ttg tcg tcc gcc tct gct ggc tcc act
gca tct 1016Thr Asn Arg Arg Pro Leu Ser Ser Ala Ser Ala Gly Ser Thr
Ala Ser 230 235 240gat tcc acg ctt ctt aac ggg tcc ctc aac tcc tac
gcc aac cag atc 1064Asp Ser Thr Leu Leu Asn Gly Ser Leu Asn Ser Tyr
Ala Asn Gln Ile 245 250 255att ggc gaa aac gac cca cag ctg tcg ccc
aca aaa ctc aag ccc act 1112Ile Gly Glu Asn Asp Pro Gln Leu Ser Pro
Thr Lys Leu Lys Pro Thr 260 265 270ggc aga aaa tac atc ttc ggc tac
cac ccc cac ggc att atc ggc atg 1160Gly Arg Lys Tyr Ile Phe Gly Tyr
His Pro His Gly Ile Ile Gly Met275 280 285 290gga gcc ttt ggt gga
att gcc acc gag gga gct gga tgg tcc aag ctc 1208Gly Ala Phe Gly Gly
Ile Ala Thr Glu Gly Ala Gly Trp Ser Lys Leu 295 300 305ttt ccg ggc
atc cct gtt tct ctt atg act ctc acc aac aac ttc cga 1256Phe Pro Gly
Ile Pro Val Ser Leu Met Thr Leu Thr Asn Asn Phe Arg 310 315 320gtg
cct ctc tac aga gag tac ctc atg agt ctg gga gtc gct tct gtc 1304Val
Pro Leu Tyr Arg Glu Tyr Leu Met Ser Leu Gly Val Ala Ser Val 325 330
335tcc aag aag tcc tgc aag gcc ctc ctc aag cga aac cag tct atc tgc
1352Ser Lys Lys Ser Cys Lys Ala Leu Leu Lys Arg Asn Gln Ser Ile Cys
340 345 350att gtc gtt ggt gga gca cag gaa agt ctt ctg gcc aga ccc
ggt gtc 1400Ile Val Val Gly Gly Ala Gln Glu Ser Leu Leu Ala Arg Pro
Gly Val355 360 365 370atg gac ctg gtg cta ctc aag cga aag ggt ttt
gtt cga ctt ggt atg 1448Met Asp Leu Val Leu Leu Lys Arg Lys Gly Phe
Val Arg Leu Gly Met 375 380 385gag gtc gga aat gtc gcc ctt gtt ccc
atc atg gcc ttt ggt gag aac 1496Glu Val Gly Asn Val Ala Leu Val Pro
Ile Met Ala Phe Gly Glu Asn 390 395 400gac ctc tat gac cag gtt agc
aac gac aag tcg tcc aag ctg tac cga 1544Asp Leu Tyr Asp Gln Val Ser
Asn Asp Lys Ser Ser Lys Leu Tyr Arg 405 410 415ttc cag cag ttt gtc
aag aac ttc ctt gga ttc acc ctt cct ttg atg 1592Phe Gln Gln Phe Val
Lys Asn Phe Leu Gly Phe Thr Leu Pro Leu Met 420 425 430cat gcc cga
ggc gtc ttc aac tac gat gtc ggt ctt gtc ccc tac agg 1640His Ala Arg
Gly Val Phe Asn Tyr Asp Val Gly Leu Val Pro Tyr Arg435 440 445
450cga ccc gtc aac att gtg gtt ggt tcc ccc att gac ttg cct tat ctc
1688Arg Pro Val Asn Ile Val Val Gly Ser Pro Ile Asp Leu Pro Tyr Leu
455 460 465cca cac ccc acc gac gaa gaa gtg tcc gaa tac cac gac cga
tac atc 1736Pro His Pro Thr Asp Glu Glu Val Ser Glu Tyr His Asp Arg
Tyr Ile 470 475 480gcc gag ctg cag cga atc tac aac gag cac aag gat
gaa tat ttc atc 1784Ala Glu Leu Gln Arg Ile Tyr Asn Glu His Lys Asp
Glu Tyr Phe Ile 485 490 495gat tgg acc gag gag ggc aaa gga gcc cca
gag ttc cga atg att gag 1832Asp Trp Thr Glu Glu Gly Lys Gly Ala Pro
Glu Phe Arg Met Ile Glu 500 505 510taa ggaaaactgc ctgggttagg
caaatagcta atgagtattt ttttgatggc 1885aaccaaatgt agaaagaaaa
aaaaaaaaaa agaaaaaaaa aagagaatat tatatctatg 1945taattctatt
aaaagctctg ttgagtgagc ggaataaata ctgttgaaga ggggattgtg
2005tagagatctg tttactcaat ggcaaactca tctgggggag atccttccac
tgtgggaagc 2065tcctggatag cctttgcatc ggggttcaag aagaccattg
tgaacagccc ttga 211973514PRTYarrowia lipolytica 73Met Thr Ile Asp
Ser Gln Tyr Tyr Lys Ser Arg Asp Lys Asn Asp Thr1 5 10 15Ala Pro Lys
Ile Ala Gly Ile Arg Tyr Ala Pro Leu Ser Thr Pro Leu 20 25 30Leu Asn
Arg Cys Glu Thr Phe Ser Leu Val Trp His Ile Phe Ser Ile 35 40 45Pro
Thr Phe Leu Thr Ile Phe Met Leu Cys Cys Ala Ile Pro Leu Leu 50 55
60Trp Pro Phe Val Ile Ala Tyr Val Val Tyr Ala Val Lys Asp Asp Ser65
70 75 80Pro Ser Asn Gly Gly Val Val Lys Arg Tyr Ser Pro Ile Ser Arg
Asn 85 90 95Phe Phe Ile Trp Lys Leu Phe Gly Arg Tyr Phe Pro Ile Thr
Leu His 100 105 110Lys Thr Val Asp Leu Glu Pro Thr His Thr Tyr Tyr
Pro Leu Asp Val 115 120 125Gln Glu Tyr His Leu Ile Ala Glu Arg Tyr
Trp Pro Gln Asn Lys Tyr 130 135 140Leu Arg Ala Ile Ile Ser Thr Ile
Glu Tyr Phe Leu Pro Ala Phe Met145 150 155 160Lys Arg Ser Leu Ser
Ile Asn Glu Gln Glu Gln Pro Ala Glu Arg Asp 165 170 175Pro Leu Leu
Ser Pro Val Ser Pro Ser Ser Pro Gly Ser Gln Pro Asp 180 185 190Lys
Trp Ile Asn His Asp Ser Arg Tyr Ser Arg Gly Glu Ser Ser Gly 195 200
205Ser Asn Gly His Ala Ser Gly Ser Glu Leu Asn Gly Asn Gly Asn Asn
210 215 220Gly Thr Thr Asn Arg Arg Pro Leu Ser Ser Ala Ser Ala Gly
Ser Thr225 230 235 240Ala Ser Asp Ser Thr Leu Leu Asn Gly Ser Leu
Asn Ser Tyr Ala Asn 245 250 255Gln Ile Ile Gly Glu Asn Asp Pro Gln
Leu Ser Pro Thr Lys Leu Lys 260 265 270Pro Thr Gly Arg Lys Tyr Ile
Phe Gly Tyr His Pro His Gly Ile Ile 275 280 285Gly Met Gly Ala Phe
Gly Gly Ile Ala Thr Glu Gly Ala Gly Trp Ser 290 295 300Lys Leu Phe
Pro Gly Ile Pro Val Ser Leu Met Thr Leu Thr Asn Asn305 310 315
320Phe Arg Val Pro Leu Tyr Arg Glu Tyr Leu Met Ser Leu Gly Val Ala
325 330 335Ser Val Ser Lys Lys Ser Cys Lys Ala Leu Leu Lys Arg Asn
Gln Ser 340 345 350Ile Cys Ile Val Val Gly Gly Ala Gln Glu Ser Leu
Leu Ala Arg Pro 355 360 365Gly Val Met Asp Leu Val Leu Leu Lys Arg
Lys Gly Phe Val Arg Leu 370 375 380Gly Met Glu Val Gly Asn Val Ala
Leu Val Pro Ile Met Ala Phe Gly385 390 395 400Glu Asn Asp Leu Tyr
Asp Gln Val Ser Asn Asp Lys Ser Ser Lys Leu 405 410 415Tyr Arg Phe
Gln Gln Phe Val Lys Asn Phe Leu Gly Phe Thr Leu Pro 420 425 430Leu
Met His Ala Arg Gly Val Phe Asn Tyr Asp Val Gly Leu Val Pro 435 440
445Tyr Arg Arg Pro Val Asn Ile Val Val Gly Ser Pro Ile Asp Leu Pro
450 455 460Tyr Leu Pro His Pro Thr Asp Glu Glu Val Ser Glu Tyr His
Asp Arg465 470 475 480Tyr Ile Ala Glu Leu Gln Arg Ile Tyr Asn Glu
His Lys Asp Glu Tyr 485 490 495Phe Ile Asp Trp Thr Glu Glu Gly Lys
Gly Ala Pro Glu Phe Arg Met 500 505 510Ile Glu741434DNAFusarium
moniliformeCDS(1)..(1434)synthetic delta-12 desaturase
(codon-optimized for Yarrowia lipolytica) 74atg gcc tcc acc tcg gct
ctg ccc aag cag aac cct gcc ctc cga cga 48Met Ala Ser Thr Ser Ala
Leu Pro Lys Gln Asn Pro Ala Leu Arg Arg1 5 10 15acc gtc act tcc acc
act gtg acc gac tcg gag tct gct gcc gtc tct 96Thr Val Thr Ser Thr
Thr Val Thr Asp Ser Glu Ser Ala Ala Val Ser 20 25 30ccc tcc gat tct
ccc aga cac tcg gcc tcc tct aca tcg ctg tct tcc 144Pro Ser Asp Ser
Pro Arg His Ser Ala Ser Ser Thr Ser Leu Ser Ser 35 40 45atg tcc gag
gtg gac att gcc aag ccc aag tcc gag tac ggt gtc atg 192Met Ser Glu
Val Asp Ile Ala Lys Pro Lys Ser Glu Tyr Gly Val Met 50 55 60ctg gat
acc tac ggc aac cag ttc gaa gtt ccc gac ttc acc atc aag 240Leu Asp
Thr Tyr Gly Asn Gln Phe Glu Val Pro Asp Phe Thr Ile Lys65 70 75
80gac atc tac aac gct att ccc aag cac tgc ttc aag cga tct gct ctc
288Asp Ile Tyr Asn Ala Ile Pro Lys His Cys Phe Lys Arg Ser Ala Leu
85 90 95aag gga tac ggc tac att ctt cga gac att gtc ctc ctg act acc
act 336Lys Gly Tyr Gly Tyr Ile Leu Arg Asp Ile Val Leu Leu Thr Thr
Thr 100 105 110ttc agc atc tgg tac aac ttt gtg aca ccc gag tac att
ccc tcc act 384Phe Ser Ile Trp Tyr Asn Phe Val Thr Pro Glu Tyr Ile
Pro Ser Thr 115 120 125cct gct cga gcc ggt ctg tgg gct gtg tac acc
gtt ctt cag gga ctc 432Pro Ala Arg Ala Gly Leu Trp Ala Val Tyr Thr
Val Leu Gln Gly Leu 130 135 140ttc ggt act gga ctg tgg gtc att gcc
cac gag tgt gga cat ggt gct 480Phe Gly Thr Gly Leu Trp Val Ile Ala
His Glu Cys Gly His Gly Ala145 150 155 160ttc tcc gat tcc
cga atc atc aac gac att act ggc tgg gtg ctt cac 528Phe Ser Asp Ser
Arg Ile Ile Asn Asp Ile Thr Gly Trp Val Leu His 165 170 175tct tcc
ctg ctt gtt ccc tac ttc agc tgg caa atc tcc cac cgg aag 576Ser Ser
Leu Leu Val Pro Tyr Phe Ser Trp Gln Ile Ser His Arg Lys 180 185
190cat cac aag gcc act gga aac atg gag cga gac atg gtc ttc gtt cct
624His His Lys Ala Thr Gly Asn Met Glu Arg Asp Met Val Phe Val Pro
195 200 205cga acc cga gag cag caa gct act cga ctc ggc aag atg acc
cac gaa 672Arg Thr Arg Glu Gln Gln Ala Thr Arg Leu Gly Lys Met Thr
His Glu 210 215 220ctc gcc cat ctt acc gag gaa act cct gct ttc acc
ctg ctc atg ctt 720Leu Ala His Leu Thr Glu Glu Thr Pro Ala Phe Thr
Leu Leu Met Leu225 230 235 240gtg ctt cag caa ctg gtc ggt tgg ccc
aac tat ctc att acc aac gtt 768Val Leu Gln Gln Leu Val Gly Trp Pro
Asn Tyr Leu Ile Thr Asn Val 245 250 255act gga cac aac tac cat gag
cgg cag cga gag ggt cga ggc aag gga 816Thr Gly His Asn Tyr His Glu
Arg Gln Arg Glu Gly Arg Gly Lys Gly 260 265 270aag cac aac ggt ctt
ggc ggt gga gtt aac cat ttc gat ccc cga tct 864Lys His Asn Gly Leu
Gly Gly Gly Val Asn His Phe Asp Pro Arg Ser 275 280 285cct ctg tac
gag aac agc gac gcc aag ctc atc gtg ctc tcc gac att 912Pro Leu Tyr
Glu Asn Ser Asp Ala Lys Leu Ile Val Leu Ser Asp Ile 290 295 300ggc
att ggt ctt atg gcc acc gct ctg tac ttt ctc gtt cag aag ttc 960Gly
Ile Gly Leu Met Ala Thr Ala Leu Tyr Phe Leu Val Gln Lys Phe305 310
315 320gga ttc tac aac atg gcc atc tgg tac ttc gtt ccc tac ttg tgg
gtt 1008Gly Phe Tyr Asn Met Ala Ile Trp Tyr Phe Val Pro Tyr Leu Trp
Val 325 330 335aac cac tgg ctc gtc gcc att acc ttt ctg cag cac aca
gat cct act 1056Asn His Trp Leu Val Ala Ile Thr Phe Leu Gln His Thr
Asp Pro Thr 340 345 350ctt ccc cac tac acc aac gac gag tgg aac ttt
gtg cga ggt gcc gct 1104Leu Pro His Tyr Thr Asn Asp Glu Trp Asn Phe
Val Arg Gly Ala Ala 355 360 365gca acc atc gac cga gag atg ggc ttc
att gga cgt cat ctg ctc cac 1152Ala Thr Ile Asp Arg Glu Met Gly Phe
Ile Gly Arg His Leu Leu His 370 375 380ggc att atc gag act cac gtc
ctg cat cac tac gtc tct tcc att ccc 1200Gly Ile Ile Glu Thr His Val
Leu His His Tyr Val Ser Ser Ile Pro385 390 395 400ttc tac aat gcg
gac gaa gct acc gag gcc atc aaa cct atc atg ggc 1248Phe Tyr Asn Ala
Asp Glu Ala Thr Glu Ala Ile Lys Pro Ile Met Gly 405 410 415aag cac
tat cga gct gat gtc cag gac ggt cct cga gga ttc att cga 1296Lys His
Tyr Arg Ala Asp Val Gln Asp Gly Pro Arg Gly Phe Ile Arg 420 425
430gcc atg tac cga tct gca cga atg tgc cag tgg gtt gaa ccc tcc gct
1344Ala Met Tyr Arg Ser Ala Arg Met Cys Gln Trp Val Glu Pro Ser Ala
435 440 445ggt gcc gag gga gct ggc aag ggt gtc ctg ttc ttt cga aac
cga aac 1392Gly Ala Glu Gly Ala Gly Lys Gly Val Leu Phe Phe Arg Asn
Arg Asn 450 455 460aat gtg ggc act cct ccc gct gtc atc aag ccc gtt
gcc taa 1434Asn Val Gly Thr Pro Pro Ala Val Ile Lys Pro Val Ala465
470 47575477PRTFusarium moniliforme 75Met Ala Ser Thr Ser Ala Leu
Pro Lys Gln Asn Pro Ala Leu Arg Arg1 5 10 15Thr Val Thr Ser Thr Thr
Val Thr Asp Ser Glu Ser Ala Ala Val Ser 20 25 30Pro Ser Asp Ser Pro
Arg His Ser Ala Ser Ser Thr Ser Leu Ser Ser 35 40 45Met Ser Glu Val
Asp Ile Ala Lys Pro Lys Ser Glu Tyr Gly Val Met 50 55 60Leu Asp Thr
Tyr Gly Asn Gln Phe Glu Val Pro Asp Phe Thr Ile Lys65 70 75 80Asp
Ile Tyr Asn Ala Ile Pro Lys His Cys Phe Lys Arg Ser Ala Leu 85 90
95Lys Gly Tyr Gly Tyr Ile Leu Arg Asp Ile Val Leu Leu Thr Thr Thr
100 105 110Phe Ser Ile Trp Tyr Asn Phe Val Thr Pro Glu Tyr Ile Pro
Ser Thr 115 120 125Pro Ala Arg Ala Gly Leu Trp Ala Val Tyr Thr Val
Leu Gln Gly Leu 130 135 140Phe Gly Thr Gly Leu Trp Val Ile Ala His
Glu Cys Gly His Gly Ala145 150 155 160Phe Ser Asp Ser Arg Ile Ile
Asn Asp Ile Thr Gly Trp Val Leu His 165 170 175Ser Ser Leu Leu Val
Pro Tyr Phe Ser Trp Gln Ile Ser His Arg Lys 180 185 190His His Lys
Ala Thr Gly Asn Met Glu Arg Asp Met Val Phe Val Pro 195 200 205Arg
Thr Arg Glu Gln Gln Ala Thr Arg Leu Gly Lys Met Thr His Glu 210 215
220Leu Ala His Leu Thr Glu Glu Thr Pro Ala Phe Thr Leu Leu Met
Leu225 230 235 240Val Leu Gln Gln Leu Val Gly Trp Pro Asn Tyr Leu
Ile Thr Asn Val 245 250 255Thr Gly His Asn Tyr His Glu Arg Gln Arg
Glu Gly Arg Gly Lys Gly 260 265 270Lys His Asn Gly Leu Gly Gly Gly
Val Asn His Phe Asp Pro Arg Ser 275 280 285Pro Leu Tyr Glu Asn Ser
Asp Ala Lys Leu Ile Val Leu Ser Asp Ile 290 295 300Gly Ile Gly Leu
Met Ala Thr Ala Leu Tyr Phe Leu Val Gln Lys Phe305 310 315 320Gly
Phe Tyr Asn Met Ala Ile Trp Tyr Phe Val Pro Tyr Leu Trp Val 325 330
335Asn His Trp Leu Val Ala Ile Thr Phe Leu Gln His Thr Asp Pro Thr
340 345 350Leu Pro His Tyr Thr Asn Asp Glu Trp Asn Phe Val Arg Gly
Ala Ala 355 360 365Ala Thr Ile Asp Arg Glu Met Gly Phe Ile Gly Arg
His Leu Leu His 370 375 380Gly Ile Ile Glu Thr His Val Leu His His
Tyr Val Ser Ser Ile Pro385 390 395 400Phe Tyr Asn Ala Asp Glu Ala
Thr Glu Ala Ile Lys Pro Ile Met Gly 405 410 415Lys His Tyr Arg Ala
Asp Val Gln Asp Gly Pro Arg Gly Phe Ile Arg 420 425 430Ala Met Tyr
Arg Ser Ala Arg Met Cys Gln Trp Val Glu Pro Ser Ala 435 440 445Gly
Ala Glu Gly Ala Gly Lys Gly Val Leu Phe Phe Arg Asn Arg Asn 450 455
460Asn Val Gly Thr Pro Pro Ala Val Ile Lys Pro Val Ala465 470
475761272DNAArtificial Sequencemutant EgD8M delta-8 desaturase
(also designated as "EgD8S-23") 76c atg gtg aag gct tct cga cag gct
ctg ccc ctc gtc atc gac gga aag 49 Met Val Lys Ala Ser Arg Gln Ala
Leu Pro Leu Val Ile Asp Gly Lys 1 5 10 15gtg tac gac gtc tcc gct
tgg gtg aac ttc cac cct ggt gga gct gaa 97Val Tyr Asp Val Ser Ala
Trp Val Asn Phe His Pro Gly Gly Ala Glu 20 25 30atc att gag aac tac
cag gga cga gat gct act gac gcc ttc atg gtt 145Ile Ile Glu Asn Tyr
Gln Gly Arg Asp Ala Thr Asp Ala Phe Met Val 35 40 45atg cac tct cag
gaa gcc ttc gac aag ctc aag cga atg ccc aag atc 193Met His Ser Gln
Glu Ala Phe Asp Lys Leu Lys Arg Met Pro Lys Ile 50 55 60aac cag gct
tcc gag ctg cct ccc cag gct gcc gtc aac gaa gct cag 241Asn Gln Ala
Ser Glu Leu Pro Pro Gln Ala Ala Val Asn Glu Ala Gln65 70 75 80gag
gat ttc cga aag ctc cga gaa gag ctg atc gcc act ggc atg ttt 289Glu
Asp Phe Arg Lys Leu Arg Glu Glu Leu Ile Ala Thr Gly Met Phe 85 90
95gac gcc tct ccc ctc tgg tac tcg tac aag atc ttg acc acc ctg ggt
337Asp Ala Ser Pro Leu Trp Tyr Ser Tyr Lys Ile Leu Thr Thr Leu Gly
100 105 110ctt ggc gtg ctt gcc ttc ttc atg ctg gtc cag tac cac ctg
tac ttc 385Leu Gly Val Leu Ala Phe Phe Met Leu Val Gln Tyr His Leu
Tyr Phe 115 120 125att ggt gct ctc gtg ctc ggt atg cac tac cag caa
atg gga tgg ctg 433Ile Gly Ala Leu Val Leu Gly Met His Tyr Gln Gln
Met Gly Trp Leu 130 135 140tct cat gac atc tgc cac cac cag acc ttc
aag aac cga aac tgg aat 481Ser His Asp Ile Cys His His Gln Thr Phe
Lys Asn Arg Asn Trp Asn145 150 155 160aac gtc ctg ggt ctg gtc ttt
ggc aac gga ctc cag ggc ttc tcc gtg 529Asn Val Leu Gly Leu Val Phe
Gly Asn Gly Leu Gln Gly Phe Ser Val 165 170 175acc tgg tgg aag gac
aga cac aac gcc cat cat tct gct acc aac gtt 577Thr Trp Trp Lys Asp
Arg His Asn Ala His His Ser Ala Thr Asn Val 180 185 190cag ggt cac
gat ccc gac att gat aac ctg cct ctg ctc gcc tgg tcc 625Gln Gly His
Asp Pro Asp Ile Asp Asn Leu Pro Leu Leu Ala Trp Ser 195 200 205gag
gac gat gtc act cga gct tct ccc atc tcc cga aag ctc att cag 673Glu
Asp Asp Val Thr Arg Ala Ser Pro Ile Ser Arg Lys Leu Ile Gln 210 215
220ttc caa cag tac tat ttc ctg gtc atc tgt att ctc ctg cga ttc atc
721Phe Gln Gln Tyr Tyr Phe Leu Val Ile Cys Ile Leu Leu Arg Phe
Ile225 230 235 240tgg tgt ttc cag tct gtg ctg acc gtt cga tcc ctc
aag gac cga gac 769Trp Cys Phe Gln Ser Val Leu Thr Val Arg Ser Leu
Lys Asp Arg Asp 245 250 255aac cag ttc tac cga tct cag tac aag aaa
gag gcc att gga ctc gct 817Asn Gln Phe Tyr Arg Ser Gln Tyr Lys Lys
Glu Ala Ile Gly Leu Ala 260 265 270ctg cac tgg act ctc aag acc ctg
ttc cac ctc ttc ttt atg ccc tcc 865Leu His Trp Thr Leu Lys Thr Leu
Phe His Leu Phe Phe Met Pro Ser 275 280 285atc ctg acc tcg atg ctg
gtg ttc ttt gtt tcc gag ctc gtc ggt ggc 913Ile Leu Thr Ser Met Leu
Val Phe Phe Val Ser Glu Leu Val Gly Gly 290 295 300ttc gga att gcc
atc gtg gtc ttc atg aac cac tac cct ctg gag aag 961Phe Gly Ile Ala
Ile Val Val Phe Met Asn His Tyr Pro Leu Glu Lys305 310 315 320atc
ggt gat tcc gtc tgg gac gga cat ggc ttc tct gtg ggt cag atc 1009Ile
Gly Asp Ser Val Trp Asp Gly His Gly Phe Ser Val Gly Gln Ile 325 330
335cat gag acc atg aac att cga cga ggc atc att act gac tgg ttc ttt
1057His Glu Thr Met Asn Ile Arg Arg Gly Ile Ile Thr Asp Trp Phe Phe
340 345 350gga ggc ctg aac tac cag atc gag cac cat ctc tgg ccc acc
ctg cct 1105Gly Gly Leu Asn Tyr Gln Ile Glu His His Leu Trp Pro Thr
Leu Pro 355 360 365cga cac aac ctc act gcc gtt tcc tac cag gtg gaa
cag ctg tgc cag 1153Arg His Asn Leu Thr Ala Val Ser Tyr Gln Val Glu
Gln Leu Cys Gln 370 375 380aag cac aac ctc ccc tac cga aac cct ctg
ccc cat gaa ggt ctc gtc 1201Lys His Asn Leu Pro Tyr Arg Asn Pro Leu
Pro His Glu Gly Leu Val385 390 395 400atc ctg ctc cga tac ctg tcc
cag ttc gct cga atg gcc gag aag cag 1249Ile Leu Leu Arg Tyr Leu Ser
Gln Phe Ala Arg Met Ala Glu Lys Gln 405 410 415ccc ggt gcc aag gct
cag taa gc 1272Pro Gly Ala Lys Ala Gln 42077422PRTArtificial
SequenceSynthetic Construct 77Met Val Lys Ala Ser Arg Gln Ala Leu
Pro Leu Val Ile Asp Gly Lys1 5 10 15Val Tyr Asp Val Ser Ala Trp Val
Asn Phe His Pro Gly Gly Ala Glu 20 25 30Ile Ile Glu Asn Tyr Gln Gly
Arg Asp Ala Thr Asp Ala Phe Met Val 35 40 45Met His Ser Gln Glu Ala
Phe Asp Lys Leu Lys Arg Met Pro Lys Ile 50 55 60Asn Gln Ala Ser Glu
Leu Pro Pro Gln Ala Ala Val Asn Glu Ala Gln65 70 75 80Glu Asp Phe
Arg Lys Leu Arg Glu Glu Leu Ile Ala Thr Gly Met Phe 85 90 95Asp Ala
Ser Pro Leu Trp Tyr Ser Tyr Lys Ile Leu Thr Thr Leu Gly 100 105
110Leu Gly Val Leu Ala Phe Phe Met Leu Val Gln Tyr His Leu Tyr Phe
115 120 125Ile Gly Ala Leu Val Leu Gly Met His Tyr Gln Gln Met Gly
Trp Leu 130 135 140Ser His Asp Ile Cys His His Gln Thr Phe Lys Asn
Arg Asn Trp Asn145 150 155 160Asn Val Leu Gly Leu Val Phe Gly Asn
Gly Leu Gln Gly Phe Ser Val 165 170 175Thr Trp Trp Lys Asp Arg His
Asn Ala His His Ser Ala Thr Asn Val 180 185 190Gln Gly His Asp Pro
Asp Ile Asp Asn Leu Pro Leu Leu Ala Trp Ser 195 200 205Glu Asp Asp
Val Thr Arg Ala Ser Pro Ile Ser Arg Lys Leu Ile Gln 210 215 220Phe
Gln Gln Tyr Tyr Phe Leu Val Ile Cys Ile Leu Leu Arg Phe Ile225 230
235 240Trp Cys Phe Gln Ser Val Leu Thr Val Arg Ser Leu Lys Asp Arg
Asp 245 250 255Asn Gln Phe Tyr Arg Ser Gln Tyr Lys Lys Glu Ala Ile
Gly Leu Ala 260 265 270Leu His Trp Thr Leu Lys Thr Leu Phe His Leu
Phe Phe Met Pro Ser 275 280 285Ile Leu Thr Ser Met Leu Val Phe Phe
Val Ser Glu Leu Val Gly Gly 290 295 300Phe Gly Ile Ala Ile Val Val
Phe Met Asn His Tyr Pro Leu Glu Lys305 310 315 320Ile Gly Asp Ser
Val Trp Asp Gly His Gly Phe Ser Val Gly Gln Ile 325 330 335His Glu
Thr Met Asn Ile Arg Arg Gly Ile Ile Thr Asp Trp Phe Phe 340 345
350Gly Gly Leu Asn Tyr Gln Ile Glu His His Leu Trp Pro Thr Leu Pro
355 360 365Arg His Asn Leu Thr Ala Val Ser Tyr Gln Val Glu Gln Leu
Cys Gln 370 375 380Lys His Asn Leu Pro Tyr Arg Asn Pro Leu Pro His
Glu Gly Leu Val385 390 395 400Ile Leu Leu Arg Tyr Leu Ser Gln Phe
Ala Arg Met Ala Glu Lys Gln 405 410 415Pro Gly Ala Lys Ala Gln
42078792DNAEutreptiella sp. CCMP389CDS(1)..(792)synthetic delta-9
elongase (codon-optimized for Yarrowia lipolytica) 78atg gct gcc
gtc atc gag gtg gcc aac gag ttc gtc gct atc act gcc 48Met Ala Ala
Val Ile Glu Val Ala Asn Glu Phe Val Ala Ile Thr Ala1 5 10 15gag acc
ctt ccc aag gtg gac tat cag cga ctc tgg cga gac atc tac 96Glu Thr
Leu Pro Lys Val Asp Tyr Gln Arg Leu Trp Arg Asp Ile Tyr 20 25 30tcc
tgc gag ctc ctg tac ttc tcc att gct ttc gtc atc ctc aag ttt 144Ser
Cys Glu Leu Leu Tyr Phe Ser Ile Ala Phe Val Ile Leu Lys Phe 35 40
45acc ctt ggc gag ctc tcg gat tct ggc aaa aag att ctg cga gtg ctg
192Thr Leu Gly Glu Leu Ser Asp Ser Gly Lys Lys Ile Leu Arg Val Leu
50 55 60ttc aag tgg tac aac ctc ttc atg tcc gtc ttt tcg ctg gtg tcc
ttc 240Phe Lys Trp Tyr Asn Leu Phe Met Ser Val Phe Ser Leu Val Ser
Phe65 70 75 80ctc tgt atg ggt tac gcc atc tac acc gtt gga ctg tac
tcc aac gaa 288Leu Cys Met Gly Tyr Ala Ile Tyr Thr Val Gly Leu Tyr
Ser Asn Glu 85 90 95tgc gac aga gct ttc gac aac agc ttg ttc cga ttt
gcc acc aag gtc 336Cys Asp Arg Ala Phe Asp Asn Ser Leu Phe Arg Phe
Ala Thr Lys Val 100 105 110ttc tac tat tcc aag ttt ctg gag tac atc
gac tct ttc tac ctt ccc 384Phe Tyr Tyr Ser Lys Phe Leu Glu Tyr Ile
Asp Ser Phe Tyr Leu Pro 115 120 125ctc atg gcc aag cct ctg tcc ttt
ctg cag ttc ttt cat cac ttg gga 432Leu Met Ala Lys Pro Leu Ser Phe
Leu Gln Phe Phe His His Leu Gly 130 135 140gct cct atg gac atg tgg
ctc ttc gtg cag tac tct ggc gaa tcc att 480Ala Pro Met Asp Met Trp
Leu Phe Val Gln Tyr Ser Gly Glu Ser Ile145 150 155 160tgg atc ttt
gtg ttc ctg aac gga ttc att cac ttt gtc atg tac ggc 528Trp Ile Phe
Val Phe Leu Asn Gly Phe Ile His Phe Val Met Tyr Gly 165 170 175tac
tat tgg aca cgg ctg atg aag ttc aac ttt ccc atg ccc aag cag 576Tyr
Tyr Trp Thr Arg Leu Met Lys Phe Asn Phe Pro Met Pro Lys Gln 180 185
190ctc att acc gca atg cag atc acc cag ttc aac gtt ggc ttc tac ctc
624Leu Ile Thr Ala Met Gln Ile Thr Gln Phe Asn Val Gly Phe Tyr Leu
195 200 205gtg tgg tgg tac aag gac att ccc tgt tac cga aag gat ccc
atg cga 672Val Trp Trp Tyr Lys Asp Ile
Pro Cys Tyr Arg Lys Asp Pro Met Arg 210 215 220atg ctg gcc tgg atc
ttc aac tac tgg tac gtc ggt acc gtt ctt ctg 720Met Leu Ala Trp Ile
Phe Asn Tyr Trp Tyr Val Gly Thr Val Leu Leu225 230 235 240ctc ttc
atc aac ttc ttt gtc aag tcc tac gtg ttt ccc aag cct aag 768Leu Phe
Ile Asn Phe Phe Val Lys Ser Tyr Val Phe Pro Lys Pro Lys 245 250
255act gcc gac aaa aag gtc cag tag 792Thr Ala Asp Lys Lys Val Gln
26079263PRTEutreptiella sp. CCMP389 79Met Ala Ala Val Ile Glu Val
Ala Asn Glu Phe Val Ala Ile Thr Ala1 5 10 15Glu Thr Leu Pro Lys Val
Asp Tyr Gln Arg Leu Trp Arg Asp Ile Tyr 20 25 30Ser Cys Glu Leu Leu
Tyr Phe Ser Ile Ala Phe Val Ile Leu Lys Phe 35 40 45Thr Leu Gly Glu
Leu Ser Asp Ser Gly Lys Lys Ile Leu Arg Val Leu 50 55 60Phe Lys Trp
Tyr Asn Leu Phe Met Ser Val Phe Ser Leu Val Ser Phe65 70 75 80Leu
Cys Met Gly Tyr Ala Ile Tyr Thr Val Gly Leu Tyr Ser Asn Glu 85 90
95Cys Asp Arg Ala Phe Asp Asn Ser Leu Phe Arg Phe Ala Thr Lys Val
100 105 110Phe Tyr Tyr Ser Lys Phe Leu Glu Tyr Ile Asp Ser Phe Tyr
Leu Pro 115 120 125Leu Met Ala Lys Pro Leu Ser Phe Leu Gln Phe Phe
His His Leu Gly 130 135 140Ala Pro Met Asp Met Trp Leu Phe Val Gln
Tyr Ser Gly Glu Ser Ile145 150 155 160Trp Ile Phe Val Phe Leu Asn
Gly Phe Ile His Phe Val Met Tyr Gly 165 170 175Tyr Tyr Trp Thr Arg
Leu Met Lys Phe Asn Phe Pro Met Pro Lys Gln 180 185 190Leu Ile Thr
Ala Met Gln Ile Thr Gln Phe Asn Val Gly Phe Tyr Leu 195 200 205Val
Trp Trp Tyr Lys Asp Ile Pro Cys Tyr Arg Lys Asp Pro Met Arg 210 215
220Met Leu Ala Trp Ile Phe Asn Tyr Trp Tyr Val Gly Thr Val Leu
Leu225 230 235 240Leu Phe Ile Asn Phe Phe Val Lys Ser Tyr Val Phe
Pro Lys Pro Lys 245 250 255Thr Ala Asp Lys Lys Val Gln
260801350DNAEuglena gracilisCDS(1)..(1350)synthetic delta-5
desaturase (codon-optimized for Yarrowia lipolytica) 80atg gct ctc
tcc ctt act acc gag cag ctg ctc gag cga ccc gac ctg 48Met Ala Leu
Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu1 5 10 15gtt gcc
atc gac ggc att ctc tac gat ctg gaa ggt ctt gcc aag gtc 96Val Ala
Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 20 25 30cat
ccc gga ggc gac ttg atc ctc gct tct ggt gcc tcc gat gct tct 144His
Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 35 40
45cct ctg ttc tac tcc atg cac cct tac gtc aag ccc gag aac tcg aag
192Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys
50 55 60ctg ctt caa cag ttc gtg cga ggc aag cac gac cga acc tcc aag
gac 240Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys
Asp65 70 75 80att gtc tac acc tac gac tct ccc ttt gca cag gac gtc
aag cga act 288Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val
Lys Arg Thr 85 90 95atg cga gag gtc atg aaa ggt cgg aac tgg tat gcc
aca cct gga ttc 336Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala
Thr Pro Gly Phe 100 105 110tgg ctg cga acc gtt ggc atc att gct gtc
acc gcc ttt tgc gag tgg 384Trp Leu Arg Thr Val Gly Ile Ile Ala Val
Thr Ala Phe Cys Glu Trp 115 120 125cac tgg gct act acc gga atg gtg
ctg tgg ggt ctc ttg act gga ttc 432His Trp Ala Thr Thr Gly Met Val
Leu Trp Gly Leu Leu Thr Gly Phe 130 135 140atg cac atg cag atc ggc
ctg tcc att cag cac gat gcc tct cat ggt 480Met His Met Gln Ile Gly
Leu Ser Ile Gln His Asp Ala Ser His Gly145 150 155 160gcc atc agc
aaa aag ccc tgg gtc aac gct ctc ttt gcc tac ggc atc 528Ala Ile Ser
Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 165 170 175gac
gtc att gga tcg tcc aga tgg atc tgg ctg cag tct cac atc atg 576Asp
Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 180 185
190cga cat cac acc tac acc aat cag cat ggt ctc gac ctg gat gcc gag
624Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu
195 200 205tcc gca gaa cca ttc ctt gtg ttc cac aac tac cct gct gcc
aac act 672Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala
Asn Thr 210 215 220gct cga aag tgg ttt cac cga ttc cag gcc tgg tac
atg tac ctc gtg 720Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr
Met Tyr Leu Val225 230 235 240ctt gga gcc tac ggc gtt tcg ctg gtg
tac aac cct ctc tac atc ttc 768Leu Gly Ala Tyr Gly Val Ser Leu Val
Tyr Asn Pro Leu Tyr Ile Phe 245 250 255cga atg cag cac aac gac acc
att ccc gag tct gtc aca gcc atg cga 816Arg Met Gln His Asn Asp Thr
Ile Pro Glu Ser Val Thr Ala Met Arg 260 265 270gag aac ggc ttt ctg
cga cgg tac cga acc ctt gca ttc gtt atg cga 864Glu Asn Gly Phe Leu
Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 275 280 285gct ttc ttc
atc ttt cga acc gcc ttc ttg ccc tgg tat ctc act gga 912Ala Phe Phe
Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 290 295 300acc
tcc ctg ctc atc acc att cct ctg gtg ccc act gct acc ggt gcc 960Thr
Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala305 310
315 320ttc ctc acc ttc ttt ttc atc ttg tct cac aac ttc gat ggc tcg
gag 1008Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser
Glu 325 330 335cga atc ccc gac aag aac tgc aag gtc aag agc tcc gag
aag gac gtt 1056Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu
Lys Asp Val 340 345 350gaa gcc gat cag atc gac tgg tac aga gct cag
gtg gag acc tct tcc 1104Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln
Val Glu Thr Ser Ser 355 360 365acc tac ggt gga ccc att gcc atg ttc
ttt act ggc ggt ctc aac ttc 1152Thr Tyr Gly Gly Pro Ile Ala Met Phe
Phe Thr Gly Gly Leu Asn Phe 370 375 380cag atc gag cat cac ctc ttt
cct cga atg tcg tct tgg cac tat ccc 1200Gln Ile Glu His His Leu Phe
Pro Arg Met Ser Ser Trp His Tyr Pro385 390 395 400ttc gtg cag caa
gct gtc cga gag tgt tgc gaa cga cac gga gtt cgg 1248Phe Val Gln Gln
Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 405 410 415tac gtc
ttc tac cct acc att gtg ggc aac atc att tcc acc ctc aag 1296Tyr Val
Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 420 425
430tac atg cac aaa gtc ggt gtg gtt cac tgt gtc aag gac gct cag gat
1344Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp
435 440 445tcc taa 1350Ser 81449PRTEuglena gracilis 81Met Ala Leu
Ser Leu Thr Thr Glu Gln Leu Leu Glu Arg Pro Asp Leu1 5 10 15Val Ala
Ile Asp Gly Ile Leu Tyr Asp Leu Glu Gly Leu Ala Lys Val 20 25 30His
Pro Gly Gly Asp Leu Ile Leu Ala Ser Gly Ala Ser Asp Ala Ser 35 40
45Pro Leu Phe Tyr Ser Met His Pro Tyr Val Lys Pro Glu Asn Ser Lys
50 55 60Leu Leu Gln Gln Phe Val Arg Gly Lys His Asp Arg Thr Ser Lys
Asp65 70 75 80Ile Val Tyr Thr Tyr Asp Ser Pro Phe Ala Gln Asp Val
Lys Arg Thr 85 90 95Met Arg Glu Val Met Lys Gly Arg Asn Trp Tyr Ala
Thr Pro Gly Phe 100 105 110Trp Leu Arg Thr Val Gly Ile Ile Ala Val
Thr Ala Phe Cys Glu Trp 115 120 125His Trp Ala Thr Thr Gly Met Val
Leu Trp Gly Leu Leu Thr Gly Phe 130 135 140Met His Met Gln Ile Gly
Leu Ser Ile Gln His Asp Ala Ser His Gly145 150 155 160Ala Ile Ser
Lys Lys Pro Trp Val Asn Ala Leu Phe Ala Tyr Gly Ile 165 170 175Asp
Val Ile Gly Ser Ser Arg Trp Ile Trp Leu Gln Ser His Ile Met 180 185
190Arg His His Thr Tyr Thr Asn Gln His Gly Leu Asp Leu Asp Ala Glu
195 200 205Ser Ala Glu Pro Phe Leu Val Phe His Asn Tyr Pro Ala Ala
Asn Thr 210 215 220Ala Arg Lys Trp Phe His Arg Phe Gln Ala Trp Tyr
Met Tyr Leu Val225 230 235 240Leu Gly Ala Tyr Gly Val Ser Leu Val
Tyr Asn Pro Leu Tyr Ile Phe 245 250 255Arg Met Gln His Asn Asp Thr
Ile Pro Glu Ser Val Thr Ala Met Arg 260 265 270Glu Asn Gly Phe Leu
Arg Arg Tyr Arg Thr Leu Ala Phe Val Met Arg 275 280 285Ala Phe Phe
Ile Phe Arg Thr Ala Phe Leu Pro Trp Tyr Leu Thr Gly 290 295 300Thr
Ser Leu Leu Ile Thr Ile Pro Leu Val Pro Thr Ala Thr Gly Ala305 310
315 320Phe Leu Thr Phe Phe Phe Ile Leu Ser His Asn Phe Asp Gly Ser
Glu 325 330 335Arg Ile Pro Asp Lys Asn Cys Lys Val Lys Ser Ser Glu
Lys Asp Val 340 345 350Glu Ala Asp Gln Ile Asp Trp Tyr Arg Ala Gln
Val Glu Thr Ser Ser 355 360 365Thr Tyr Gly Gly Pro Ile Ala Met Phe
Phe Thr Gly Gly Leu Asn Phe 370 375 380Gln Ile Glu His His Leu Phe
Pro Arg Met Ser Ser Trp His Tyr Pro385 390 395 400Phe Val Gln Gln
Ala Val Arg Glu Cys Cys Glu Arg His Gly Val Arg 405 410 415Tyr Val
Phe Tyr Pro Thr Ile Val Gly Asn Ile Ile Ser Thr Leu Lys 420 425
430Tyr Met His Lys Val Gly Val Val His Cys Val Lys Asp Ala Gln Asp
435 440 445Ser 826356DNAArtificial SequencePlasmid pY157
82ttgagaagcc cattgtatat tattaggatc gtagcattat tgtggcaaaa aatattcaag
60tgctcatgtg aattgacacg atcacgtaaa tacctggtga aattgctagt attcgtgatg
120ttctaataca actctgttca atatttccgg cgctctcttg tatacaagag
cacaagacat 180gcaccccaca ttaaccgagg tcaagtgttt atgtatgaaa
agtgacataa atcgtccaaa 240aaaaagtagc acatagttgt atggctgtaa
gttatgtgat tgtcagttct tcggccttcc 300aactcctatg caccgtcttc
aatcatctac ccccgtgccc cacaccccgc actattagag 360tttatcacag
tcagctaaac tgcttgcaca tctacacctc tgactacacc accatggatt
420tcttcagacg gcaccagaaa aaggtgctgg cactggtagg tgtggcgctg
agttcctacc 480tgtttatcga ctatgtgaag aaaaagttct tcgagatcca
gggtcgtttg agctcggagc 540gaaccgctaa acagaatctc cggcgccgat
ttgaacagaa ccagcaggat gcagatttta 600caatcatggc tctgctatcc
agcttgacga caccggtaat ggagcgttac cccgtcgacc 660agatcaaggc
agagttacag agcaagagac gccccacaga ccgggttttg gctctcgaga
720gctccacctc gtcctcagct accgcacaaa ccgtgcccac catgacaagt
ggcgccacag 780aggagggcga gaagttaatt aactttggcc ggcctttacc
tgcaggataa cttcgtataa 840tgtatgctat acgaagttat gaattctctc
tcttgagctt ttccataaca agttcttctg 900cctccaggaa gtccatgggt
ggtttgatca tggttttggt gtagtggtag tgcagtggtg 960gtattgtgac
tggggatgta gttgagaata agtcatacac aagtcagctt tcttcgagcc
1020tcatataagt ataagtagtt caacgtatta gcactgtacc cagcatctcc
gtatcgagaa 1080acacaacaac atgccccatt ggacagatca tgcggataca
caggttgtgc agtatcatac 1140atactcgatc agacaggtcg tctgaccatc
atacaagctg aacaagcgct ccatacttgc 1200acgctctcta tatacacagt
taaattacat atccatagtc taacctctaa cagttaatct 1260tctggtaagc
ctcccagcca gccttctggt atcgcttggc ctcctcaata ggatctcggt
1320tctggccgta cagacctcgg ccgacaatta tgatatccgt tccggtagac
atgacatcct 1380caacagttcg gtactgctgt ccgagagcgt ctcccttgtc
gtcaagaccc accccggggg 1440tcagaataag ccagtcctca gagtcgccct
taggtcggtt ctgggcaatg aagccaacca 1500caaactcggg gtcggatcgg
gcaagctcaa tggtctgctt ggagtactcg ccagtggcca 1560gagagccctt
gcaagacagc tcggccagca tgagcagacc tctggccagc ttctcgttgg
1620gagaggggac taggaactcc ttgtactggg agttctcgta gtcagagacg
tcctccttct 1680tctgttcaga gacagtttcc tcggcaccag ctcgcaggcc
agcaatgatt ccggttccgg 1740gtacaccgtg ggcgttggtg atatcggacc
actcggcgat tcggtgacac cggtactggt 1800gcttgacagt gttgccaata
tctgcgaact ttctgtcctc gaacaggaag aaaccgtgct 1860taagagcaag
ttccttgagg gggagcacag tgccggcgta ggtgaagtcg tcaatgatgt
1920cgatatgggt tttgatcatg cacacataag gtccgacctt atcggcaagc
tcaatgagct 1980ccttggtggt ggtaacatcc agagaagcac acaggttggt
tttcttggct gccacgagct 2040tgagcactcg agcggcaaag gcggacttgt
ggacgttagc tcgagcttcg taggagggca 2100ttttggtggt gaagaggaga
ctgaaataaa tttagtctgc agaacttttt atcggaacct 2160tatctggggc
agtgaagtat atgttatggt aatagttacg agttagttga acttatagat
2220agactggact atacggctat cggtccaaat tagaaagaac gtcaatggct
ctctgggcgt 2280cgcctttgcc gacaaaaatg tgatcatgat gaaagccagc
aatgacgttg cagctgatat 2340tgttgtcggc caaccgcgcc gaaaacgcag
ctgtcagacc cacagcctcc aacgaagaat 2400gtatcgtcaa agtgatccaa
gcacactcat agttggagtc gtactccaaa ggcggcaatg 2460acgagtcaga
cagatactcg tcgactcatc gatataactt cgtataatgt atgctatacg
2520aagttatcct aggtatagat cttgcacttc ttattttctt cacgcgtttg
cagctcaaca 2580ttctaggacg acgaaactac gtcaacagtg ttgtcgctct
ggcgcagcag ggccgagagg 2640gtaatgccga gggtcgagtg gcgccctcgt
ttggtgatct tgcagatatg ggctatttcg 2700gcgacctttc aggctcgtcc
agcttcggag aaactattgt cgatcccgat ctggacgaac 2760agtaccttac
cttttcgtgg tggctgctga acgagggatg ggtgtcgctg agcgagcgag
2820tggaggaagc ggttcgtcga gtgtgggacc ccgtgtcacc caaggccgaa
cttggatttg 2880acgagttgtc ggaactcatt ggacgaacac agatgctcat
tgatcgacct ctcaatccct 2940cgtcgccact caactttctg agccagctgc
tgccaccacg ggagcaggag gagtacgtgc 3000ttgcccagaa ccccagcgat
actgctgccc ccattgtagg acctaccctc cgacggcttc 3060tggacgagac
tgccgacttc atcgagtccc ctaatgccgc agaggtgatt gagcgacttg
3120ttcactccgg tctctctgtg ttcatggaca agctggctgt cacgtttgga
gccacacctg 3180ctgattcggg ttcgccttat cctgtggtgc tgcctactgc
aaaggtcaag ctgccctcca 3240ttcttgccaa catggctcga caggctggag
gcatggccca gggatcgccg ggcgtggaaa 3300acgagtacat tgacgtgatg
aaccaagtgc aggagctgac ctcctttagt gctgtggtct 3360attcatcttt
tgattgggct ctctagaggc tcattcacga aagacacgaa gaacgaagat
3420ggggactgaa tacagcgctc tcatttgtac acaaatgatt tatgacagag
taacttgtac 3480atcatgtaga gcatacatac tgaaggtgtg atctcacggg
atatcttgaa gaccactcgt 3540agctggaggc ataggtagtg ctagtacgga
tacttgcacc gtatccaaca taagtagagg 3600agcctcctag tggctattgg
tacaccgata aagatacaca tacatggcgc gccagctgca 3660ttaatgaatc
ggccaacgcg cggggagagg cggtttgcgt attgggcgct cttccgcttc
3720ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat
cagctcactc 3780aaaggcggta atacggttat ccacagaatc aggggataac
gcaggaaaga acatgtgagc 3840aaaaggccag caaaaggcca ggaaccgtaa
aaaggccgcg ttgctggcgt ttttccatag 3900gctccgcccc cctgacgagc
atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 3960gacaggacta
taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt
4020tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa
gcgtggcgct 4080ttctcatagc tcacgctgta ggtatctcag ttcggtgtag
gtcgttcgct ccaagctggg 4140ctgtgtgcac gaaccccccg ttcagcccga
ccgctgcgcc ttatccggta actatcgtct 4200tgagtccaac ccggtaagac
acgacttatc gccactggca gcagccactg gtaacaggat 4260tagcagagcg
aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg
4320ctacactaga agaacagtat ttggtatctg cgctctgctg aagccagtta
ccttcggaaa 4380aagagttggt agctcttgat ccggcaaaca aaccaccgct
ggtagcggtg gtttttttgt 4440ttgcaagcag cagattacgc gcagaaaaaa
aggatctcaa gaagatcctt tgatcttttc 4500tacggggtct gacgctcagt
ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 4560atcaaaaagg
atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta
4620aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg
aggcacctat 4680ctcagcgatc tgtctatttc gttcatccat agttgcctga
ctccccgtcg tgtagataac 4740tacgatacgg gagggcttac catctggccc
cagtgctgca atgataccgc gagacccacg 4800ctcaccggct ccagatttat
cagcaataaa ccagccagcc ggaagggccg agcgcagaag 4860tggtcctgca
actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt
4920aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag
gcatcgtggt 4980gtcacgctcg tcgtttggta tggcttcatt cagctccggt
tcccaacgat caaggcgagt 5040tacatgatcc cccatgttgt gcaaaaaagc
ggttagctcc ttcggtcctc cgatcgttgt 5100cagaagtaag ttggccgcag
tgttatcact catggttatg gcagcactgc ataattctct 5160tactgtcatg
ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt
5220ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac
gggataatac 5280cgcgccacat agcagaactt taaaagtgct catcattgga
aaacgttctt cggggcgaaa 5340actctcaagg atcttaccgc tgttgagatc
cagttcgatg taacccactc gtgcacccaa 5400ctgatcttca gcatctttta
ctttcaccag cgtttctggg tgagcaaaaa caggaaggca 5460aaatgccgca
aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct
5520ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat
acatatttga 5580atgtatttag aaaaataaac aaataggggt tccgcgcaca
tttccccgaa aagtgccacc
5640tgatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc
aggaaattgt 5700aagcgttaat attttgttaa aattcgcgtt aaatttttgt
taaatcagct cattttttaa 5760ccaataggcc gaaatcggca aaatccctta
taaatcaaaa gaatagaccg agatagggtt 5820gagtgttgtt ccagtttgga
acaagagtcc actattaaag aacgtggact ccaacgtcaa 5880agggcgaaaa
accgtctatc agggcgatgg cccactacgt gaaccatcac cctaatcaag
5940ttttttgggg tcgaggtgcc gtaaagcact aaatcggaac cctaaaggga
gcccccgatt 6000tagagcttga cggggaaagc cggcgaacgt ggcgagaaag
gaagggaaga aagcgaaagg 6060agcgggcgct agggcgctgg caagtgtagc
ggtcacgctg cgcgtaacca ccacacccgc 6120cgcgcttaat gcgccgctac
agggcgcgtc cattcgccat tcaggctgcg caactgttgg 6180gaagggcgat
cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct
6240gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg
taaaacgacg 6300gccagtgaat tgtaatacga ctcactatag ggcgaattgg
gcccgacgtc gcatgc 6356835910DNAArtificial SequencePlasmid pY87
83catcaaagga agggtgaatc caaggaagtt cttgacaaac tgctggaatc ggtacagctt
60ggacgacttg tcgttgctaa cctggtcata gaggtcgttc tcaccaaagg ccatgatggg
120aacaagggcg acatttccga cctccatacc aagtcgaaca aaaccctttc
gcttgagtag 180caccaggtcc atgacaccgg gtctggccag aagactttcc
tgtgctccac caacgacaat 240gcagatagac tggtttcgct tgaggagggc
cttgcaggac ttcttggaga cagaagcgac 300tcccagactc atgaggtact
ctctgtagag aggcactcgg aagttgttgg tgagagtcat 360aagagaaaca
gggatgcccg gaaagagctt ggaccatcca gctccctcgg tggcaattcc
420accaaaggct cccatgccga taatgccgtg ggggtggtag ccgaagatgt
attttctgcc 480agtgggcttg agttttgtgg gcgacagctg tgggtcgttt
tcgccaatga tctggttggc 540gtaggagttg agggacccgt taagaagcgt
ggaatcagat gcagtggagc cagcagaggc 600ggacgacaaa ggtcgtcggt
tagtggtgcc attgttgccg ttgccgttaa gttcggagcc 660cgaggcgtgg
ccgttggagc cagatgattc tccacggcta tatctgctgt cgtggttaat
720taactttggc cggcctttac ctgcaggata acttcgtata atgtatgcta
tacgaagtta 780tgaattctct ctcttgagct tttccataac aagttcttct
gcctccagga agtccatggg 840tggtttgatc atggttttgg tgtagtggta
gtgcagtggt ggtattgtga ctggggatgt 900agttgagaat aagtcataca
caagtcagct ttcttcgagc ctcatataag tataagtagt 960tcaacgtatt
agcactgtac ccagcatctc cgtatcgaga aacacaacaa catgccccat
1020tggacagatc atgcggatac acaggttgtg cagtatcata catactcgat
cagacaggtc 1080gtctgaccat catacaagct gaacaagcgc tccatacttg
cacgctctct atatacacag 1140ttaaattaca tatccatagt ctaacctcta
acagttaatc ttctggtaag cctcccagcc 1200agccttctgg tatcgcttgg
cctcctcaat aggatctcgg ttctggccgt acagacctcg 1260gccgacaatt
atgatatccg ttccggtaga catgacatcc tcaacagttc ggtactgctg
1320tccgagagcg tctcccttgt cgtcaagacc caccccgggg gtcagaataa
gccagtcctc 1380agagtcgccc ttaggtcggt tctgggcaat gaagccaacc
acaaactcgg ggtcggatcg 1440ggcaagctca atggtctgct tggagtactc
gccagtggcc agagagccct tgcaagacag 1500ctcggccagc atgagcagac
ctctggccag cttctcgttg ggagagggga ctaggaactc 1560cttgtactgg
gagttctcgt agtcagagac gtcctccttc ttctgttcag agacagtttc
1620ctcggcacca gctcgcaggc cagcaatgat tccggttccg ggtacaccgt
gggcgttggt 1680gatatcggac cactcggcga ttcggtgaca ccggtactgg
tgcttgacag tgttgccaat 1740atctgcgaac tttctgtcct cgaacaggaa
gaaaccgtgc ttaagagcaa gttccttgag 1800ggggagcaca gtgccggcgt
aggtgaagtc gtcaatgatg tcgatatggg ttttgatcat 1860gcacacataa
ggtccgacct tatcggcaag ctcaatgagc tccttggtgg tggtaacatc
1920cagagaagca cacaggttgg ttttcttggc tgccacgagc ttgagcactc
gagcggcaaa 1980ggcggacttg tggacgttag ctcgagcttc gtaggagggc
attttggtgg tgaagaggag 2040actgaaataa atttagtctg cagaactttt
tatcggaacc ttatctgggg cagtgaagta 2100tatgttatgg taatagttac
gagttagttg aacttataga tagactggac tatacggcta 2160tcggtccaaa
ttagaaagaa cgtcaatggc tctctgggcg tcgcctttgc cgacaaaaat
2220gtgatcatga tgaaagccag caatgacgtt gcagctgata ttgttgtcgg
ccaaccgcgc 2280cgaaaacgca gctgtcagac ccacagcctc caacgaagaa
tgtatcgtca aagtgatcca 2340agcacactca tagttggagt cgtactccaa
aggcggcaat gacgagtcag acagatactc 2400gtcgactcat cgatataact
tcgtataatg tatgctatac gaagttatcc taggtataga 2460tctcaccgta
cgtttcatga aggcgggcag aaagtactcg atggtggaga tgattgctcg
2520gaggtacttg ttctgcggcc agtatctctc agcaatcagg tgatactcct
ggacgtccag 2580agggtagtat gtgtgcgtgg gctccagatc caccgtcttg
tgcagagtta tggggaagta 2640gcggccaaag agcttccaga tgaagaagtt
tcttgaaata ggcgagtatc gcttgaccac 2700tcctccgttg gacggggagt
cgtctttaac agcgtacact acatacgcaa tcacaaatgg 2760ccagagcagt
ggaattgcgc agcatagcat gaaaattgtg aggaaagtgg gaatgctgaa
2820aatgtgccag accagagaga aggtctcaca tcggttgagt aatggtgtcg
atagcggggc 2880atatcggatt cccgcgattt tgggtgccgt gtcgtttttg
tctcgcgact tgtagtattg 2940tgagtcgata gtcatagctt ttgttttgtg
tgacttgtct gttgcctgtt gttagaagaa 3000aaagtgggag cttatcagtc
acggtccacg aacgatttcg tacttgtacg taattggtcg 3060tgagaactgt
tgcagagccg gtgctttttt ttgtggccaa gtcgacaggt cgatttcggc
3120gctgtgcgag gttgctggga tgtgctggtt tggctgccaa atgtggggaa
gatttcaacc 3180tcggatttga cgtgtgtaga ggcgcgccag ctgcattaat
gaatcggcca acgcgcgggg 3240agaggcggtt tgcgtattgg gcgctcttcc
gcttcctcgc tcactgactc gctgcgctcg 3300gtcgttcggc tgcggcgagc
ggtatcagct cactcaaagg cggtaatacg gttatccaca 3360gaatcagggg
ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac
3420cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga
cgagcatcac 3480aaaaatcgac gctcaagtca gaggtggcga aacccgacag
gactataaag ataccaggcg 3540tttccccctg gaagctccct cgtgcgctct
cctgttccga ccctgccgct taccggatac 3600ctgtccgcct ttctcccttc
gggaagcgtg gcgctttctc atagctcacg ctgtaggtat 3660ctcagttcgg
tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag
3720cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt
aagacacgac 3780ttatcgccac tggcagcagc cactggtaac aggattagca
gagcgaggta tgtaggcggt 3840gctacagagt tcttgaagtg gtggcctaac
tacggctaca ctagaagaac agtatttggt 3900atctgcgctc tgctgaagcc
agttaccttc ggaaaaagag ttggtagctc ttgatccggc 3960aaacaaacca
ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga
4020aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc
tcagtggaac 4080gaaaactcac gttaagggat tttggtcatg agattatcaa
aaaggatctt cacctagatc 4140cttttaaatt aaaaatgaag ttttaaatca
atctaaagta tatatgagta aacttggtct 4200gacagttacc aatgcttaat
cagtgaggca cctatctcag cgatctgtct atttcgttca 4260tccatagttg
cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct
4320ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga
tttatcagca 4380ataaaccagc cagccggaag ggccgagcgc agaagtggtc
ctgcaacttt atccgcctcc 4440atccagtcta ttaattgttg ccgggaagct
agagtaagta gttcgccagt taatagtttg 4500cgcaacgttg ttgccattgc
tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct 4560tcattcagct
ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa
4620aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc
cgcagtgtta 4680tcactcatgg ttatggcagc actgcataat tctcttactg
tcatgccatc cgtaagatgc 4740ttttctgtga ctggtgagta ctcaaccaag
tcattctgag aatagtgtat gcggcgaccg 4800agttgctctt gcccggcgtc
aatacgggat aataccgcgc cacatagcag aactttaaaa 4860gtgctcatca
ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg
4920agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc
ttttactttc 4980accagcgttt ctgggtgagc aaaaacagga aggcaaaatg
ccgcaaaaaa gggaataagg 5040gcgacacgga aatgttgaat actcatactc
ttcctttttc aatattattg aagcatttat 5100cagggttatt gtctcatgag
cggatacata tttgaatgta tttagaaaaa taaacaaata 5160ggggttccgc
gcacatttcc ccgaaaagtg ccacctgatg cggtgtgaaa taccgcacag
5220atgcgtaagg agaaaatacc gcatcaggaa attgtaagcg ttaatatttt
gttaaaattc 5280gcgttaaatt tttgttaaat cagctcattt tttaaccaat
aggccgaaat cggcaaaatc 5340ccttataaat caaaagaata gaccgagata
gggttgagtg ttgttccagt ttggaacaag 5400agtccactat taaagaacgt
ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc 5460gatggcccac
tacgtgaacc atcaccctaa tcaagttttt tggggtcgag gtgccgtaaa
5520gcactaaatc ggaaccctaa agggagcccc cgatttagag cttgacgggg
aaagccggcg 5580aacgtggcga gaaaggaagg gaagaaagcg aaaggagcgg
gcgctagggc gctggcaagt 5640gtagcggtca cgctgcgcgt aaccaccaca
cccgccgcgc ttaatgcgcc gctacagggc 5700gcgtccattc gccattcagg
ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt 5760cgctattacg
ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc
5820cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa
tacgactcac 5880tatagggcga attgggcccg acgtcgcatg
59108434DNAEscherichia coli 84ataacttcgt ataatgtatg ctatacgaag ttat
348520DNAArtificial SequencePrimer UP 768 85acccgtgttt cgtctaaaag
208622DNAArtificial SequencePrimer LP 769 86ggtagataca agtggcaata
ac 22
uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.
While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.
All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.