Production Of Quinone Derived Compounds In Oleaginous Yeast And Fungi

Bailey; Richard B. ;   et al.

Patent Application Summary

U.S. patent application number 14/105136 was filed with the patent office on 2014-10-23 for production of quinone derived compounds in oleaginous yeast and fungi. This patent application is currently assigned to DSM IP ASSETS b.v.. The applicant listed for this patent is DSM IP ASSETS b.v.. Invention is credited to Richard B. Bailey, Kevin T. Madden, Joshua Trueheart.

Application Number20140315279 14/105136
Document ID /
Family ID38476163
Filed Date2014-10-23

United States Patent Application 20140315279
Kind Code A1
Bailey; Richard B. ;   et al. October 23, 2014

PRODUCTION OF QUINONE DERIVED COMPOUNDS IN OLEAGINOUS YEAST AND FUNGI

Abstract

The present invention provides systems for producing engineered oleaginous yeast or fungi that express quinone derived compounds.


Inventors: Bailey; Richard B.; (South Natick, MA) ; Madden; Kevin T.; (Arlington, MA) ; Trueheart; Joshua; (Concord, MA)
Applicant:
Name City State Country Type

DSM IP ASSETS b.v.

Heerlen

NL
Assignee: DSM IP ASSETS b.v.
Heerlen
NL

Family ID: 38476163
Appl. No.: 14/105136
Filed: December 12, 2013

Related U.S. Patent Documents

Application Number Filing Date Patent Number
12293237 Mar 9, 2009 8633009
PCT/US07/06834 Mar 20, 2007
14105136
60784499 Mar 20, 2006
60848064 Sep 28, 2006

Current U.S. Class: 435/254.21 ; 435/254.11; 435/254.2
Current CPC Class: C12P 7/6463 20130101; C12P 7/66 20130101; C12P 7/6481 20130101; C12N 15/81 20130101; C12N 15/815 20130101; C12N 15/52 20130101
Class at Publication: 435/254.21 ; 435/254.11; 435/254.2
International Class: C12N 15/81 20060101 C12N015/81

Claims



1. A recombinant fungus characterized by: a. the recombinant fungus is oleaginous in that it can accumulate lipid to at least about 20% of its dry cell weight; and b. the recombinant fungus produces at least one quinone derived compound, and can accumulate the produced quinone derived compound to at least about 1% of its dry cell weight; wherein the recombinant fungus comprises at least one modification as compared with a parental fungus, which parental fungus both is not oleaginous and does not accumulate the quinone derived compound to at least about 1% of its dry cell weight, the at least one modification being selected from the group consisting of quinonogenic modifications, oleaginic modifications, and combinations thereof, and wherein the at least one modification alters oleaginicity of the recombinant fungus, confers to the recombinant fungus oleaginy, confers to the recombinant fungus the ability to produce the at least one quinone derived compound to a level at least about 1% of its dry cell weight, or confers to the recombinant fungus the ability to produce at least one quinone derived compound which the parental fungus does not produce.

2. A recombinant fungus characterized by: a. the recombinant fungus is oleaginous in that it can accumulate lipid to at least about 20% of its dry cell weight; and b. the recombinant fungus produces at least one quinone derived compound selected from the group consisting of: a ubiquinone, a vitamin K compound, and a vitamin E compound, and combinations thereof, and can accumulate the produced quinone derived compound to at least about 1% of its dry cell weight; wherein the recombinant fungus comprises at least one modification as compared with a parental fungus, the at least one modification being selected from the group consisting of quinonogenic modifications, oleaginic modifications, and combinations thereof, and wherein the at least one modification alters oleaginicity of the recombinant fungus, confers to the recombinant fungus oleaginy, confers to the recombinant fungus the ability to produce the at least one quinone derived compound to a level at least about 1% of its dry cell weight, or confers to the recombinant fungus the ability to produce at least one quinone derived compound which the parental fungus does not naturally produce.

3-4. (canceled)

5. The recombinant fungus of claim 1 wherein the recombinant fungus is a member of a genus selected from the group consisting of: Saccharomyces, Xanthophyllomyces (Phaffia), and Yarrowia.

6-12. (canceled)

13. The recombinant fungus of claim 5 wherein the recombinant fungus is of the species Yarrowia lipolytica.

14-21. (canceled)

22. The recombinant fungus of claim 1 wherein the quinone derived compound is a ubiquinone.

23-25. (canceled)

26. The recombinant fungus of claim 1 wherein the quinone derived compound is a vitamin K compound.

27-28. (canceled)

29. The recombinant fungus of claim 1 wherein the quinone derived compound is a vitamin E compound.

30-53. (canceled)

54. The recombinant fungus of claim 1 wherein the fungus contains at least one quinonogenic modification.

55-59. (canceled)

60. The recombinant fungus of claim 54, wherein the at least one quinonogenic modification increases expression or activity of a quinonogenic polypeptide.

61-72. (canceled)

73. A recombinant fungus according to claim 1 wherein the fungus accumulates the produced at least one quinone derived compound to a level selected from the group consisting of: above about 1%, above about 2%, above about 3%, above about 5%, and above about 10% of the fungus' dry cell weight.

74-75. (canceled)

76. A strain of Yarrowia lipolytica comprising one or more modifications selected from the group consisting of an oleaginic modification, a quinonogenic modification, and combinations thereof, such that the strain accumulates from 1% to 15% of its dry cell weight as at least one quinone derived compound.

77-329. (canceled)
Description



CROSS REFERENCE TO RELATED APPLICATIONS

[0001] The present application is a divisional of U.S. application Ser. No. 12/293,237, filed Sep. 16, 2008, which is the National Stage under 35 U.S.C. .sctn.371 of International Application No. PCT/US2007/006834, filed Mar. 20, 2007, which claims the benefit under 35 U.S.C. .sctn.119(e) of U.S. provisional patent application Ser. No. 60/784,499, filed Mar. 20, 2006, and claims the benefit under 35U.S.C. .sctn.119(e) of U.S. provisional patent application Ser. No. 60/848,064, filed Sep. 28, 2006; the entire contents of these applications are incorporated herein by reference.

BACKGROUND OF THE INVENTION

[0002] Compounds containing or derived from quinone structures (i.e., "quinone derived compounds") play a variety of important roles in biological systems. Certain vitamins, for example, have quinone moieties (e.g., vitamin K) or are derived from such structures (e.g., vitamin E). Ubiquinones also represent an important class of quinone derived compounds. Ubiquinones play a critical function in the production of cellular ATP, serving as electron carriers in the respiratory chain. Furthermore, ubiquinones are lipophilic antioxidants that are capable of regenerating other antioxidants such as ascorbate and tocopherols.

[0003] Although many quinone derived compounds are naturally produced by the human body, they are often produced at levels lower than are required for optimal cell function. Accordingly, they must be obtained through the diet or by other means. There is a rapidly growing market for nutritional supplements and other products containing quinone derived compounds. Improved systems for enabling cost effective production, isolation, and/or formulation of such compounds are needed to effectively meet the growing demand.

SUMMARY OF THE INVENTION

[0004] The present invention provides improved systems for the biological production of quinone derived compounds. In particular, the present invention provides improved systems for the biological production of ubiquinone(s) (e.g., CoQ10 and/or C.sub.5-9 quinone compounds), vitamin K compounds, and vitamin E compounds.

[0005] In one aspect, the invention encompasses the discovery that it is desirable to produce such quinone derived compounds in oleaginous organisms. The present invention thus provides biological systems able to accumulate one or more quinone derived compounds (e.g., ubiquinones, vitamin K compounds, and/or vitamin E compounds) in lipid bodies. In some embodiments, the biological systems may produce higher levels of quinone derived compounds when the compound(s) is/are sequestered in lipid bodies. Regardless of whether absolute levels are higher; however, compounds that are accumulated within lipid bodies in oleaginous organisms are readily isolatable through isolation of the lipid bodies.

[0006] The present invention therefore provides oleaginous fungi (including, for example, yeast or other unicellular fungi) that produce one or more quinone derived compounds (e.g., ubiquinones, vitamin K compounds, and/or vitamin E compounds). The present invention also provides methods of constructing such yeast and fungi and methods of using such yeast and fungi to produce the quinone derived compounds. The present invention further provides methods of preparing quinone derived compounds, as well as compositions containing them, such as food or feed additives, nutritional supplements, or compositions for nutraceutical, pharmaceutical and/or cosmetic applications. In particular, the present invention provides systems and methods for generating yeast and fungi containing one or more oleaginic and/or quinonogenic modifications that increase the oleaginicity and/or alter their ability to produce one or more quinone derived compounds as compared with otherwise identical organisms that lack the modification(s). In some embodiments, the present invention provides a recombinant fungus. In certain embodiments, the recombinant fungus is oleaginous in that it can accumulate lipid to at least about 20% of its dry cell weight; and produces at least one quinone derived compound, and can accumulate the produced quinone derived compound to at least about 1% of its dry cell weight; wherein the recombinant fungus comprises at least one modification as compared with a parental fungus, which parental fungus both is not oleaginous and does not accumulate the quinone derived compound to at least about 1% of its dry cell weight, the at least one modification being selected from the group consisting of quinonogenic modifications, oleaginic modifications, and combinations thereof, and wherein the at least one modification alters oleaginicity of the recombinant fungus, confers to the recombinant fungus oleaginy, confers to the recombinant fungus the ability to produce the at least one quinone derived compound to a level at least about 1% of its dry cell weight, or confers to the recombinant fungus the ability to produce at least one quinone derived compound which the parental fungus does not produce.

[0007] In other embodiments, the recombinant fungus is oleaginous in that it can accumulate lipid to at least about 20% of its dry cell weight; and the recombinant fungus produces at least one quinone derived compound selected from the group consisting of: a ubiquinone (including, but not limited to, coenzyme Q10 and/or a C.sub.5-9 quinone compound), a vitamin K compound, a vitamin E compound, and combinations thereof, and can accumulate the produced quinone derived compound to at least about 1% of its dry cell weight; wherein the recombinant fungus comprises at least one modification as compared with a parental fungus, the at least one modification being selected from the group consisting of quinonogenic modifications, oleaginic modifications, and combinations thereof, and wherein the at least one modification alters oleaginicity of the recombinant fungus, confers to the recombinant fungus oleaginy, confers to the recombinant fungus the ability to produce the at least one quinone derived compound to a level at least about 1% of its dry cell weight, or confers to the recombinant fungus the ability to produce at least one quinone derived compound which the parental fungus does not naturally produce.

[0008] In some embodiments, the recombinant fungus is oleaginous in that it can accumulate lipid to at least about 20% of its dry cell weight; and the recombinant fungus produces at least one quinone derived compound, and can accumulate the produced quinone derived compound to at least about 1% of its dry cell weight; wherein the recombinant fungus is a member of a genus selected from the group consisting of: Aspergillus, Blakeslea, Botrytis, Candida, Cercospora, Cryptococcus, Cunninghamella, Fusarium (Gibberella), Kluyveromyces, Lipomyces, Mortierella, Mucor, Neurospora, Penicillium, Phycomyces, Pichia (Hansenula), Puccinia, Pythium, Rhodosporidium, Rhodotorula, Saccharomyces, Sclerotium, Trichoderma, Trichosporon, Xanthophyllomyces (Phaffia), and Yarrowia; or is a species selected from the group consisting of: Aspergillus terreus, Aspergillus nidulans, Aspergillus niger, Blakeslea trispora, Botrytis cinerea, Candida japonica, Candida pulcherrima, Candida revkaufi, Candida tropicalis, Candida utilis, Cercospora nicotianae, Cryptococcus curvatus, Cunninghamella echinulata, Cunninghamella elegans, Fusarium fujikuroi (Gibberella zeae), Kluyveromyces lactis, Lipomyces starkeyi, Lipomyces lipoferus, Mortierella alpina, Mortierella ramanniana, Mortierella isabellina, Mortierella vinacea, Mucor circinelloides, Neurospora crassa, Phycomyces blakesleanus, Pichia pastoris, Puccinia distincta, Pythium irregulare, Rhodosporidium toruloides, Rhodotorula glutinis, Rhodotorula graminis, Rhodotorula mucilaginosa, Rhodotorula pinicola, Rhodotorula gracilis, Saccharomyces cerevisiae, Sclerotium rolfsii, Trichoderma reesei, Trichosporon cutaneum, Trichosporon pullans, Xanthophyllomyces dendrorhous (Phaffia rhodozyma), and Yarrowia lipolytica; wherein the recombinant fungus comprises at least one modification as compared with a parental fungus, the at least one modification being selected from the group consisting of quinonogenic modifications, oleaginic modifications, and combinations thereof, and wherein the at least one modification alters oleaginicity of the recombinant fungus, confers to the recombinant fungus oleaginy, confers to the recombinant fungus the ability to produce the at least one quinone derived compound to a level at least about 1% of its dry cell weight, or confers to the recombinant fungus the ability to produce at least one quinone derived compound which the parental fungus does not naturally produce.

[0009] In some embodiments, the recombinant fungus is oleaginous in that it can accumulate lipid to at least about 20% of its dry cell weight; and the recombinant fungus produces at least one quinone derived compound selected from the group consisting of: a ubiquinone (including, but not limited to, coenzyme Q10 and/or a C.sub.5-9 quinone compound), a vitamin K compound, a vitamin E compound, and combinations thereof, and can accumulate the produced quinone derived compound to at least about 1% of its dry cell weight; wherein the recombinant fungus is a member of a genus selected from the group consisting of: Aspergillus, Blakeslea, Botrytis, Candida, Cercospora, Cryptococcus, Cunninghamella, Fusarium (Gibberella), Kluyveromyces, Lipomyces, Mortierella, Mucor, Neurospora, Penicillium, Phycomyces, Pichia (Hansenula), Puccinia, Pythium, Rhodosporidium, Rhodotorula, Saccharomyces, Sclerotium, Trichoderma, Trichosporon, Xanthophyllomyces (Phaffia), and Yarrowia; or is of a species selected from the group consisting of: Aspergillus terreus, Aspergillus nidulans, Aspergillus niger, Blakeslea trispora, Botrytis cinerea, Candida japonica, Candida pulcherrima, Candida revkaufi, Candida tropicalis, Candida utilis, Cercospora nicotianae, Cryptococcus curvatus, Cunninghamella echinulata, Cunninghamella elegans, Fusarium fujikuroi (Gibberella zeae), Kluyveromyces lactis, Lipomyces starkeyi, Lipomyces lipoferus, Mortierella alpina, Mortierella ramanniana, Mortierella isabellina, Mortierella vinacea, Mucor circinelloides, Neurospora crassa, Phycomyces blakesleanus, Pichia pastoris, Puccinia distincta, Pythium irregulare, Rhodosporidium toruloides, Rhodotorula glutinis, Rhodotorula graminis, Rhodotorula mucilaginosa, Rhodotorula pinicola, Rhodotorula gracilis, Saccharomyces cerevisiae, Sclerotium rolfsii, Trichoderma reesei, Trichosporon cutaneum, Trichosporon pullans, Xanthophyllomyces dendrorhous (Phaffia rhodozyma), and Yarrowia lipolytica; wherein the recombinant fungus comprises at least one modification as compared with a parental fungus, the at least one modification being selected from the group consisting of quinonogenic modifications, oleaginic modifications, and combinations thereof, and wherein the at least one modification alters oleaginicity of the recombinant fungus, confers to the recombinant fungus oleaginy, confers to the recombinant fungus the ability to produce the at least one quinone derived compound to a level at least about 1% of its dry cell weight, or confers to the recombinant fungus the ability to produce at least one quinone derived compound which the parental fungus does not naturally produce.

[0010] In some embodiments, the recombinant fungus accumulates the produced at least one quinone derived compound to a level selected from the group consisting of: above about 1%, above about 2%, above about 3%, above about 5%, and above about 10% of the fungus' dry cell weight.

[0011] In some embodiments, the present invention provides an engineered S. cerevisiae strain, comprising one or more quinonogenic modifications, wherein the one or more quinonogenic modifications are selected from the group consisting of: increased expression or activity of a decaprenyl diphosphate synthase polypeptide; increased expression or activity of an 4-hydroxybenzoate polyprenyl polypeptide; increased expression or activity of a GGPP synthase polypeptide; increased expression or activity of an FPP synthase polypeptide; increased expression or activity of an HMG CoA reductase polypeptide; decreased expression or activity of a squalene synthase polypeptide; decreased expression of activity of a prenyldiphosphate synthase polypeptide; increased expression or activity of a DAHP synthase polypeptide; increased expression or activity of a chorismate lyase polypeptide; increased expression or activity of a chorismate synthase polypeptide; increased expression or activity of an ATP-citrate lyase polypeptide; increased expression or activity of an AMP deaminase polypeptide; increased expression or activity of a cytosolic malic enzyme polypeptide; and combinations thereof.

[0012] In some embodiments, the present invention provides an engineered Y. lipolytica strain containing a truncated HMG CoA reductase polypeptide.

[0013] In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of a decaprenyl diphosphate synthase gene. In some embodiments, the present invention provides an engineered Y. lipolytica strain having decreased expression or activity of a squalene synthase polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having decreased expression or activity of a squalene synthase polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of a 4-hydroxybenzoate polyprenyl polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of a DAHP synthase polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of a chorismate lyase polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of an ATP-citrate lyase polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of a AMP deaminase polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of a cytosolic malic polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of a GGPP synthase polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of an FPP synthase polypeptide. In some embodiments, the present invention provides an engineered Y. lipolytica strain having increased expression or activity of a chorismate synthase polypeptide.

[0014] In some embodiments, the present invention provides a strain of Y. lipolytica comprising one or more modifications selected from the group consisting of an oleaginic modification, a quinonogenic modification, and combinations thereof, such that the strain accumulates from 1% to 15% of its dry cell weight as at least one quinone derived compound.

[0015] In certain embodiments, the present invention provides an engineered Y. lipolytica strain that produces a ubiquinone, the strain containing one or more quinonogenic modifications selected from the group consisting of: increased expression or activity of a Y. lipolytica GGPP synthase polypeptide; expression or activity of a truncated HMG CoA reductase polypeptide; increased expression or activity of a 4-hydroxybenzoate polyprenyl transferase polypeptide; increased expression or activity of a decaprenyl diphosphate synthase polypeptide; increased expression or activity of a DAHP synthase polypeptide; increased expression or activity of a chorismate lyase polypeptide; increased expression or activity of a shikimate pathway polypeptide; increased expression or activity of a chorismate mutase polypeptide; increased expression or activity of a transketolase polypeptide; increased expression or activity of an FPP synthase polypeptide; increased expression or activity of an IPP isomerase polypeptide; increased expression or activity of an HMG-CoA synthase polypeptide; increased expression or activity of a mevalonate kinase polypeptide; increased expression or activity of a phosphomevalonate kinase polypeptide; increased expression or activity of a mevalonate pyrophosphate decarboxylate polypeptide; increased expression or activity of a cytosolic malic enzyme polypeptide; increased expression or activity of a malate dehydrogenase polypeptide; increased expression or activity of an AMP deaminase polypeptide; increased expression or activity of a glucose 6 phosphate dehydrogenase polypeptide; increased expression or activity of a malate dehydrogenase homolog 2 polypeptide; increased expression or activity of a GND1-6-phosphogluconate dehydrogenase polypeptide; increased expression or activity of a isocitrate dehydrogenase polypeptide; increased expression or activity of a IDH2-isocitrate dehydrogenase polypeptide; increased expression or activity of a fructose 1,6 bisphosphatase polypeptide; increased expression or activity of a Erg10-acetoacetyl CoA thiolase polypeptide; increased expression or activity of a ATP citrate lyase subunit 2 polypeptide; increased expression or activity of a ATP citrate lyase subunit 1 polypeptide; decreased expression or activity of a squalene synthase polypeptide; decreased expression or activity of a prenyldiphosphate synthase polypeptide; or decreased expression or activity of a PHB polyprenyltransferase polypeptide; and combinations thereof.

[0016] In certain embodiments, the present invention provides a recombinant fungus characterized in that the fungus accumulates lipid in the form of cytoplasmic oil bodies.

[0017] In some embodiments, the present invention provides a composition comprising: lipid bodies; at least one quinone derived compound; and intact fungal cells.

[0018] In some embodiments, the present invention provides a method of producing a quinone derived compound, the method comprising steps of cultivating a fungus under conditions that allow production of the quinone derived compound; and isolating the produced quinone derived compound.

[0019] In some embodiments, the present invention provides a composition comprising: an oil suspension comprising: lipid bodies; at least one quinone derived compound; and intact fungal cells; and a binder or filler.

[0020] In some embodiments, the present invention provides a composition comprising: an oil suspension comprising: lipid bodies; at least one quinone derived compound; and intact fungal cells; and one or more other agents selected from the group consisting of chelating agents, pigments, salts, surfactants, moisturizers, viscosity modifiers, thickeners, emollients, fragrances, preservatives, and combinations thereof.

[0021] In some embodiments, the present invention provides an isolated quinone derived compound composition, prepared by a method comprising steps of cultivating a fungus under conditions that allow production of a quinone derived compound; and isolating the produced quinone derived compound.

[0022] In some embodiments, the present invention provides a quinone derived compound composition comprising a Y. lipolytica cell containing at least 1% quinone derived compounds by weight.

[0023] In some embodiments, the present invention provides a quinone derived compound comprising Y. lipolytica lipid bodies; and at least one quinone derived compound, wherein the at least one quinone derived compound is present at a level that is at least 1% by weight of the lipid bodies. In some embodiments, the present invention provides a composition comprising a quinone derived compound and one or more additional compounds (e.g., binders, fillers, chelating agents, pigments, salts, surfactants, moisturizers, viscosity modifiers, thickeners, emollients, fragrances, preservatives, etc. and combinations thereof).

[0024] In some embodiments, the present invention provides a feedstuff comprising a quinone derived compound in lipid bodies.

[0025] In some embodiments, the present invention provides methods comprising steps of cultivating the recombinant fungus under conditions that allow production of a quinone derived compound (e.g., ubiquinones, vitamin K compounds, and/or vitamin E compounds) and isolating the produced quinone derived compound (e.g., extracting with an organic or non-organic solvent). In certain embodiments, the recombinant fungus is cultured by growing under conditions of limiting nutrient (e.g., one or more of carbon, nitrogen, phosphate, magnesium, etc.) and/or by controlling at least one environmental parameter (e.g. one or more of nutrients, pH, temperature, pressure, oxygen concentration, timing of feeds, content of feeds, etc.) for at least a portion of the cultivation. In certain embodiments, the recombinant fungus is cultivated at a temperature of 20, 20.5, 21, 21.5, 22, 22.5, 23, 23.5, 24, 24.5, 25, 25.5, 26, 26.5, 27, 27.5, 28, 28.5, 29, 29.5, 30.degree. C. or above, and/or at one or more ranges within these temperatures. In certain embodiments, the recombinant fungus is cultivated at a pH of 4.0, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5 or above, and/or at one or more ranges within these pH's. In certain embodiments, the temperature, the pH, or both is varied during the culture period. In certain embodiments, the recombinant fungus is cultivated at an oxygen concentration within the range of about 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, or more, and/or at one or more ranges within these concentrations. In certain embodiments, the produced quinone derived compound is isolated by crystallization.

[0026] Additional aspects of the present invention will be apparent to those of ordinary skill in the art from the present description, including the appended claims. Various polypeptides are listed in Tables 1-101; as one of ordinary skill in the art will understand, the order in which these polypeptides are listed is not indicative of their importance to the present invention. Various patent and non-patent publications are referenced throughout the present application. Unless otherwise indicated, each of these references is hereby incorporated by reference in its entirety.

BRIEF DESCRIPTION OF THE DRAWING

[0027] FIGS. 1A-1C depict various quinone derived compounds including ubiquinone/Coenzyme Q10 in its various oxidated forms (Panel A); vitamin K (Panel B); and vitamin E (Panel C).

[0028] FIG. 2 depicts how sufficient levels of acetyl-CoA and NADPH may be accumulated in the cytosol of oleaginous organisms to allow for production of significant levels of cytosolic lipids. Enzymes: 1, pyruvate decarboxylase; 2, malate dehydrogenase; 3, malic enzyme; 4, pyruvate dehydrogenase; 5, citrate synthase; 6, ATP-citrate lyase; 7, citrate/malate translocase.

[0029] FIG. 3 illustrates biosynthetic pathways of aromatic amino acids, and the shikimate pathway for production of chorismate. A depiction of how these pathways feed into ubiquinone biosynthesis is depicted. Boxed numerical references are IUBMB Enzyme Nomenclature EC numbers for enzymes catalyzing the relevant reaction.

[0030] FIG. 4 depicts ubiquinone biosynthesis pathways. FIG. 4A shows the mevalonate isoprenoid biosynthesis pathway, which typically operates in eukaryotes, including fungi; as well as the mevalonate-independent isoprenoid biosynthesis pathway, also known as the DXP pathway, which typically operates in bacteria and in the plastids of plants and production of isoprenoid precursors; FIG. 4B depicts production of para-hydroxybenzoate (23) from precursor chorismate or tyrosine, and condensation with isoprene, resulting in formation of polyprenylhydroxybenzoate (24), followed by subsequent oxygenation, decarboxylation, and methylation reactions involved in production of quinine compound, polyprenyl-6-methoxy-1,4-benzoquinol (30). FIG. 4C depicts the final steps of methylation and oxygenation for generation of the ubiquinone CoQ10.

[0031] FIG. 5, illustrates how intermediates in the isoprenoid biosynthesis pathway can be processed into biomolecules, including ubiquinones, carotenoids, sterols, steroids, and vitamins, such as vitamin E or vitamin K.

[0032] FIG. 6 illustrates how intermediates in ubiquinone biosynthesis feed into the biosynthetic pathway, and how some intermediates can be processed into other molecules.

[0033] FIGS. 7A-7C show an alignment of certain representative fungal HMG-CoA reductase polypeptides. As can be seen, these polypeptides show very high identity across the catalytic region, and also have complex membrane spanning domains. In some embodiments of the invention, these membrane-spanning domains are disrupted or are removed, so that, for example, a hyperactive version of the polypeptide may be produced.

[0034] FIG. 8 depicts the steroid biosynthesis pathways, including the isoprenoid biosynthesis pathway, it indicates where certain quinone derived compounds (e.g., a ubiquinone, vitamin E and vitamin K) are produced via geranylgeranyl pyrophosphate.

[0035] FIG. 9 depicts ubiquinone and vitamin K biosynthesis pathways.

[0036] FIG. 10, Panels A-C, depict vitamin E biosynthesis pathways. Panel A shows synthesis of .gamma.- and .alpha.-tocopherols from isopentenyl pyrophosphate and p-hydroxyphenpyruvate through action of geranylgeranylpyrophosphate synthse, geranylgeranyl reductase, p-hydroxyphenpyruvate dioxygenase, prenyl transferase, methyltransferase I, cyclase, and .gamma.-methyltransferase enzyme activities. Panel B depicts the tocopherol biosynthetic pathway in plants. Dashed arrows represent multiple steps. Enzymes are indicated by circled numbers as follows: 1) HGA phytyltransferase; 2) p-hydroxyphenyl pyruvate dioxygenase; 3) HGA dioxygenase; 4) geranylgeranyl diphosphate reductase, 5) geranylgeranyl diphsophate synthase; 6) 1-deoxy-D-xylulose-5-phosphate sunthase; 7) 2-methyl-6-phytyl-1,4-benzoquinol methyltransferase; 8) tocopherol cyclase; and 9) .gamma.-tocopherol methyltransferase. Panel C depicts synthesis of .alpha., .beta., .gamma., and .delta. tocopherols.

[0037] FIGS. 11A-11C are a Table listing certain Y. lipolytica genes representing various polypeptides useful in engineering cells in accordance with the present invention.

[0038] FIG. 12, Panels A-M depict schematic representations of plasmids generated and described in detail in the exemplification

DEFINITIONS

[0039] Aromatic amino acid biosynthesis polypeptide: The term "aromatic amino acid biosynthesis polypeptide" refers to any polypeptide that is involved in the synthesis of aromatic amino acids in yeast and/or bacteria through chorismate and the shikimate pathway. For example, as discussed herein, anthranilate synthase, enzymes of the shikimate pathway, chorismate mutase, chorismate synthase, DAHP synthase, and transketolase are all aromatic amino acid biosynthesis polypeptides. Each of these polypeptides is also a ubiquinone biosynthesis polypeptide or a ubiquinone biosynthesis competitor for purposes of the present invention, as production of chorismate is a precursor in the synthesis of para-hydroxybenzoate for the biosynthesis of a ubiquinone. Representative examples of some of these enzymes are provided in Tables 32-37.

[0040] Aromatic amino acid pathway: The "aromatic amino acid pathway" is understood in the art to refer to a metabolic pathway that produces or utilizes shikimate pathway enzymes and chorismate in the production of phenylalanine, tryptophan or tyrosine. As discussed herein, two different pathways can produce the ubiquinoid precursor para-hydroxybenzoate--the first, the "shikimate pathway" is utilized in prokaryotes and induces conversion of chorismate to para-hydroxybenzoate through the action of chorismate pyruvate lyase; the second is utilized in mammalian systems and induces induction of para-hydroxybenzoate by derivation of tyrosine or phenylalanine. The term "aromatic amino acid pathway" encompasses both of these pathways. Lower eukaryotes such as yeast can utilize either method for production of para-hydroxybenzoate.

[0041] Biosynthesis polypeptide: The term "biosynthesis polypeptide" as used herein (typically in reference to a particular compound or class of compounds), refers to polypeptides involved in the production of the compound or class of compounds. In some embodiments of the invention, biosynthesis polypeptides are synthetic enzymes that catalyze particular steps in a synthesis pathway that ultimately produces a relevant compound. In some embodiments, the term "biosynthesis polypeptide" may also encompass polypeptides that do not themselves catalyze synthetic reactions, but that regulate expression and/or activity of other polypeptides that do so.

[0042] C.sub.5-9 quinone biosynthesis polypeptide: The term "C.sub.5-9 quinone biosynthesis polypeptide" refers to any polypeptide that is involved in the synthesis of a C.sub.5-9 quinone, for example a polyprenyldiphosphate synthase polypeptide. To mention but a few, these include, for example, pentaprenyl, hexaprenyl, heptaprenyl, octaprenyl, and/or solanesyl(nonaprenyl) diphosphate synthase polypeptides (i.e., polypeptides that perform the chemical reactions performed by the pentaprenyl, hexaprenyl, heptaprenyl, octaprenyl, and solanesyl(nonaprenyl) polypeptides, respectively, listed in Tables 61-65 (see also Okada et al., Biochim. Biophys. Acta 1302:217, 1996; Okada et al., J. Bacteriol. 179:5992, 1997). As will be appreciated by those of ordinary skill in the art, in some embodiments of the invention, C.sub.5-9 quinone biosynthesis polypeptides include polypeptides that affect the expression and/or activity of one or more other C.sub.5-9 quinone biosynthesis polypeptides.

[0043] C.sub.5-9 quinone compound: The term "C.sub.5-9 quinone compound", as used herein, refers to a family of ubiquinone compounds having 5-9 isoprenoid units in the side chain.

[0044] C.sub.5-9 quinone production modification: The term "C.sub.5-9 quinone production modification", as used herein, refers to a modification of a host organism that adjusts production of one or more C.sub.5-9 quinones, as described herein. For example, a C.sub.5-9 quinone production modification may increase the production level of one or more C.sub.5-9 quinones, and/or may alter relative production levels of different C.sub.5-9 quinones. In principle, an inventive C.sub.5-9 quinone production modification may be any chemical, physiological, genetic, or other modification that appropriately alters production of one or more C.sub.5-9 quinones in a host organism produced by that organism as compared with the level produced in an otherwise identical organism not subject to the same modification. In most embodiments, however, the C.sub.5-9 quinone production modification will comprise a genetic modification, typically resulting in increased production of one or more selected C.sub.5-9 quinones. In some embodiments, the C.sub.5-9 quinone production modification comprises at least one chemical, physiological, genetic, or other modification; in other embodiments, the C.sub.5-9 quinone production modification comprises more than one chemical, physiological, genetic, or other modification. In certain aspects where more than one modification is utilized, such modifications can comprise any combination of chemical, physiological, genetic, or other modification (e.g., one or more genetic, chemical and/or physiological modification(s)).

[0045] Carotenogenic modification: The term "carotenogenic modification", as used herein, refers to a modification of a host organism that adjusts production of one or more carotenoids, as described herein. For example, a carotenogenic modification may increase the production level of one or more carotenoids, and/or may alter relative production levels of different carotenoids. In principle, an inventive carotenogenic modification may be any chemical, physiological, genetic, or other modification that appropriately alters production of one or more carotenoids in a host organism produced by that organism as compared with the level produced in an otherwise identical organism not subject to the same modification. In most embodiments, however, the carotenogenic modification will comprise a genetic modification, typically resulting in increased production of one or more selected carotenoids. In some embodiments, the carotenogenic modification comprises at least one chemical, physiological, genetic, or other modification; in other embodiments, the carotenogenic modification comprises more than one chemical, physiological, genetic, or other modification. In certain aspects where more than one modification is utilized, such modifications can comprise any combination of chemical, physiological, genetic, or other modification (e.g., one or more genetic, chemical and/or physiological modification(s)). In some embodiments, the selected carotenoid is one or more of astaxanthin, .beta.-carotene, canthaxanthin, lutein, lycopene, phytoene, zeaxanthin, and/or modifications of zeaxanthin or astaxanthin (e.g., glucoside, or other ester of zeaxanthin or astaxanthin). In some embodiments, the selected carotenoid is astaxanthin. In some embodiments, the selected carotenoid is other than .beta.-carotene.

[0046] Carotenogenic polypeptide: The term "carotenogenic polypeptide", as used herein, refers to any polypeptide that is involved in the process of producing carotenoids in a cell, and may include polypeptides that are involved in processes other than carotenoid production but whose activities affect the extent or level of production of one or more carotenoids, for example by scavenging a substrate or reactant utilized by a carotenoid polypeptide that is directly involved in carotenoid production. Carotenogenic polypeptides include isoprenoid biosynthesis polypeptides, carotenoid biosynthesis polypeptides, and isoprenoid biosynthesis competitor polypeptides, as those terms are defined herein. The term also encompasses polypeptides that may affect the extent to which carotenoids are accumulated in lipid bodies.

[0047] Carotenoid: The term "carotenoid" is understood in the art to refer to a structurally diverse class of pigments derived from isoprenoid pathway intermediates. The commitment step in carotenoid biosynthesis is the formation of phytoene from geranylgeranyl pyrophosphate. Carotenoids can be acyclic or cyclic, and may or may not contain oxygen, so that the term carotenoids include both carotenes and xanthophylls. In general, carotenoids are hydrocarbon compounds having a conjugated polyene carbon skeleton formally derived from the five-carbon compound IPP, including triterpenes (C.sub.30 diapocarotenoids) and tetraterpenes (C.sub.40 carotenoids) as well as their oxygenated derivatives and other compounds that are, for example, C.sub.35, C.sub.50, C.sub.60, C.sub.70, C.sub.80 in length or other lengths. Many carotenoids have strong light absorbing properties and may range in length in excess of C.sub.200. C.sub.30 diapocarotenoids typically consist of six isoprenoid units joined in such a manner that the arrangement of isoprenoid units is reversed at the center of the molecule so that the two central methyl groups are in a 1,6-positional relationship and the remaining non-terminal methyl groups are in a 1,5-positional relationship. Such C.sub.30 carotenoids may be formally derived from the acyclic C.sub.30H.sub.42 structure, having a long central chain of conjugated double bonds, by: (i) hydrogenation (ii) dehydrogenation, (iii) cyclization, (iv) oxidation, (v) esterification/glycosylation, or any combination of these processes. C.sub.40 carotenoids typically consist of eight isoprenoid units joined in such a manner that the arrangement of isoprenoid units is reversed at the center of the molecule so that the two central methyl groups are in a 1,6-positional relationship and the remaining non-terminal methyl groups are in a 1,5-positional relationship. Such C.sub.40 carotenoids may be formally derived from the acyclic C.sub.40H.sub.56 structure, having a long central chain of conjugated double bonds, by (i) hydrogenation, (ii) dehydrogenation, (iii) cyclization, (iv) oxidation, (v) esterification/glycosylation, or any combination of these processes. The class of C.sub.40 carotenoids also includes certain compounds that arise from rearrangements of the carbon skeleton, or by the (formal) removal of part of this structure. More than 600 different carotenoids have been identified in nature; carotenoids include but are not limited to: antheraxanthin, adonirubin, adonixanthin, astaxanthin, canthaxanthin, capsorubrin, .beta.-cryptoxanthin, .alpha.-carotene, .beta.-carotene, .beta.,.psi.-carotene, .delta.-carotene, .epsilon.-carotene, echinenone, 3-hydroxyechinenone, 3'-hydroxyechinenone, .gamma.-carotene, .psi.-carotene, 4-keto-.gamma.-carotene, .zeta.-carotene, .alpha.-cryptoxanthin, deoxyflexixanthin, diatoxanthin, 7,8-didehydroastaxanthin, didehydrolycopene, fucoxanthin, fucoxanthinol, isorenieratene, .beta.-isorenieratene, lactucaxanthin, lutein, lycopene, myxobactone, neoxanthin, neurosporene, hydroxyneurosporene, peridinin, phytoene, rhodopin, rhodopin glucoside, 4-keto-rubixanthin, siphonaxanthin, spheroidene, spheroidenone, spirilloxanthin, torulene, 4-keto-torulene, 3-hydroxy-4-keto-torulene, uriolide, uriolide acetate, violaxanthin, zeaxanthin-.beta.-diglucoside, zeaxanthin, and C30 carotenoids. Additionally, carotenoid compounds include derivatives of these molecules, which may include hydroxy-, methoxy-, oxo-, epoxy-, carboxy-, or aldehydic functional groups. Further, included carotenoid compounds include ester (e.g., glycoside ester, fatty acid ester) and sulfate derivatives (e.g., esterified xanthophylls).

[0048] Carotenoid biosynthesis polypeptide: The term "carotenoid biosynthesis polypeptide" refers to any polypeptide that is involved in the synthesis of one or more carotenoids. To mention but a few, these carotenoid biosynthesis polypeptides include, for example, polypeptides of phytoene synthase, phytoene dehydrogenase (or desaturase), lycopene cyclase, carotenoid ketolase, carotenoid hydroxylase, astaxanthin synthase, carotenoid epsilon hydroxylase, lycopene cyclase (beta and epsilon subunits), carotenoid glucosyltransferase, and Acyl CoA:diacyglycerol acyltransferase. In some instances, a single gene may encode a protein with multiple carotenoid biosynthesis polypeptide activities. Representative examples of carotenoid biosynthesis polypeptide sequences are presented in Tables 17-21 and Tables 38-41. As will be appreciated by those of ordinary skill in the art, in some embodiments of the invention, carotenoid biosynthesis polypeptides include polypeptides that affect the expression and/or activity of one or more other carotenoid biosynthesis polypeptides.

[0049] Effective amount. The term "effective amount" is used herein to describe concentrations or amounts of compounds and/or compositions that, when administered to a subject, achieve a desired therapeutic or physiological effect.

[0050] FPP biosynthesis polypeptides: The term "FPP biosynthesis polypeptide" refers to any polypeptide that is involved in the synthesis of farnesyl pyrophosphate. As discussed herein, farnesyl pyrophosphate represents the branchpoint between the sterol biosynthesis pathway and the carotenoid and other biosynthesis pathways. One specific example of an FPP biosynthesis polypeptide is FPP synthase. Representative examples of FPP synthase polypeptide sequences are presented in Table 14. As will be appreciated by those of ordinary skill in the art, in some embodiments of the invention, FPP biosynthesis polypeptides include polypeptides that affect the expression and/or activity of one or more other FPP biosynthesis polypeptides.

[0051] Gene: The term "gene", as used herein, generally refers to a nucleic acid encoding a polypeptide, optionally including certain regulatory elements that may affect expression of one or more gene products (i.e., RNA or protein).

[0052] Heterologous: The term "heterologous", as used herein to refer to genes or polypeptides, refers to a gene or polypeptide that does not naturally occur in the organism in which it is being expressed. It will be understood that, in general, when a heterologous gene or polypeptide is selected for introduction into and/or expression by a host cell, the particular source organism from which the heterologous gene or polypeptide may be selected is not essential to the practice of the present invention. Relevant considerations may include, for example, how closely related the potential source and host organisms are in evolution, or how related the source organism is with other source organisms from which sequences of other relevant polypeptides have been selected. Where a plurality of different heterologous polypeptides are to be introduced into and/or expressed by a host cell, different polypeptides may be from different source organisms, or from the same source organism. To give but one example, in some cases, individual polypeptides may represent individual subunits of a complex protein activity and/or may be required to work in concert with other polypeptides in order to achieve the goals of the present invention. In some embodiments, it will often be desirable for such polypeptides to be from the same source organism, and/or to be sufficiently related to function appropriately when expressed together in a host cell. In some embodiments, such polypeptides may be from different, even unrelated source organisms. It will further be understood that, where a heterologous polypeptide is to be expressed in a host cell, it will often be desirable to utilize nucleic acid sequences encoding the polypeptide that have been adjusted to accommodate codon preferences of the host cell and/or to link the encoding sequences with regulatory elements active in the host cell.

[0053] Host cell: As used herein, the "host cell" is a fungal cell or yeast cell that is manipulated according to the present invention to accumulate lipid and/or to express one or more quinone derived compounds as described herein. A "modified host cell", as used herein, is any host cell which has been modified, engineered, or manipulated in accordance with the present invention as compared with a parental cell. In some embodiments, the modified host cell has at least one quinonogenic and/or at least one oleagenic modification. In some embodiments, the modified host cell containing at least one oleaginic modification and/or one quinonogenic modification further has at least one sterologenic modification and/or or at least one carotenogenic modification. In some embodiments, the parental cell is a naturally occurring parental cell.

[0054] Isolated: The term "isolated", as used herein, means that the isolated entity has been separated from at least one component with which it was previously associated. When most other components have been removed, the isolated entity is "purified" or "concentrated". Isolation and/or purification and/or concentration may be performed using any techniques known in the art including, for example, fractionation, extraction, precipitation, or other separation.

[0055] Isoprenoid biosynthesis polypeptide: The term "isoprenoid biosynthesis polypeptide" refers to any polypeptide that is involved in the synthesis of isoprenoids. For example, as discussed herein, acetoacetyl-CoA thiolase, HMG-CoA synthase, HMG-CoA reductase, mevalonate kinase, phosphomevalonate kinase, mevalonate pyrophosphate decarboxylase, IPP isomerase, FPP synthase, and GGPP synthase, are all involved in the mevalonate pathway for isoprenoid biosynthesis. Each of these proteins is also an isoprenoid biosynthesis polypeptide for purposes of the present invention, and sequences of representative examples of these enzymes are provided in Tables 7-15. As will be appreciated by those of ordinary skill in the art, in some embodiments of the invention, isoprenoid biosynthesis polypeptides include polypeptides that affect the expression and/or activity of one or more other isoprenoid biosynthesis polypeptides (e.g., of one or more enzymes that participate(s) in isoprenoid synthesis). Thus, for instance, transcription factors that regulate expression of isoprenoid biosynthesis enzymes can be isoprenoid biosynthesis polypeptides for purposes of the present invention. To give but a couple of examples, the S. cerevisiae Upc2 and YLR228c genes, and the Y. lipolytica YALI0B00660g gene encode transcription factors that are isoprenoid biosynthesis polypeptides according to certain embodiments of the present invention. For instance, the semidominant upc2-1 point mutation (G888D) exhibits increased sterol levels (Crowley et al., J. Bacteriol 180:4177-4183, 1998). Corresponding YLR228c mutants have been made and tested (Shianna et al., J Bacteriol 183:830, 2001); such mutants may be useful in accordance with the present invention, as may be YALI0B00660g derivatives with corresponding upc2-1 mutation(s).

[0056] Isoprenoid pathway: The "isoprenoid pathway" is understood in the art to refer to a metabolic pathway that either produces or utilizes the five-carbon metabolite isopentyl pyrophosphate (IPP). As discussed herein, two different pathways can produce the common isoprenoid precursor IPP--the "mevalonate pathway" and the "non-mevalonate pathway". The term "isoprenoid pathway" is sufficiently general to encompass both of these types of pathway. Biosynthesis of isoprenoids from IPP occurs by polymerization of several five-carbon isoprene subunits. Isoprenoid metabolites derived from IPP are of varying size and chemical structure, including both cyclic and acyclic molecules. Isoprenoid metabolites include, but are not limited to, monoterpenes, sesquiterpenes, diterpenes, sterols, and polyprenols such as carotenoids.

[0057] Oleaginic modification: The term "oleaginic modification", as used herein, refers to a modification of a host organism that adjusts the desirable oleaginy of that host organism, as described herein. In some cases, the host organism will already be oleaginous in that it will have the ability to accumulate lipid to at least about 20% of its dry cell weight. It may nonetheless be desirable to apply an oleaginic modification to such an organism, in accordance with the present invention, for example to increase (or, in some cases, possibly to decrease) its total lipid accumulation, or to adjust the types or amounts of one or more particular lipids it accumulates (e.g., to increase relative accumulation of triacylglycerol). In other cases, the host organism, may be non-oleaginous (though may contain some enzymatic and/or regulatory components used in other organisms to accumulate lipid), and may require oleaginic modification in order to become oleaginous in accordance with the present invention. The present invention also contemplates application of oleaginic modification to non-oleaginous host strains such that their oleaginicity is increased even though, even after being modified, they may not be oleaginous as defined herein. In principle, the oleaginic modification may be any chemical, physiological, genetic, or other modification that appropriately alters oleaginy of a host organism as compared with an otherwise identical organism not subjected to the oleaginic modification. In most embodiments, however, the oleaginic modification will comprise a genetic modification, typically resulting in increased production and/or activity of one or more oleaginic polypeptides. In some embodiments, the oleaginic modification comprises at least one chemical, physiological, genetic, or other modification; in other embodiments, the oleaginic modification comprises more than one chemical, physiological, genetic, or other modification. In certain aspects where more than one modification is utilized, such modifications can comprise any combination of chemical, physiological, genetic, or other modification (e.g., one or more genetic, chemical and/or physiological modification(s)).

[0058] Oleaginic polypeptide: The term "oleaginic polypeptide", as used herein, refers to any polypeptide that is involved in the process of lipid accumulation in a cell and may include polypeptides that are involved in processes other than lipid biosynthesis but whose activities affect the extent or level of accumulation of one or more lipids, for example by scavenging a substrate or reactant utilized by an oleaginic polypeptide that is directly involved in lipid accumulation. For example, as discussed herein, acetyl-CoA carboxylase, pyruvate decarboxylase, isocitrate dehydrogenase, ATP-citrate lyase, malic enzyme, malate dehydrogenase, and AMP deaminase, among other proteins, are all involved in lipid accumulation in cells. In general, reducing the activity of pyruvate decarboxylase or isocitrate dehydrogenase, and/or increasing the activity of acetyl CoA carboxylase, ATP-citrate lyase, malic enzyme, malate dehydrogenase, and/or AMP deaminase is expected to promote oleaginy. Each of these proteins is an oleaginic peptide for the purposes of the present invention, and sequences of representative examples of these enzymes are provided in Tables 1-6, 69. Other peptides that can be involved in regenerating NADPH may include, for example, 6-phosphogluconate dehydrogenase (gnd); Fructose 1,6 bisphosphatase (fbp); Glucose 6 phosphate dehydrogenase (g6pd); NADH kinase (EC 2.7.1.86); and/or transhydrogenase (EC 1.6.1.1 and 1.6.1.2). Alternative or additional strategies to promote oleaginy may include one or more of the following: (1) increased or heterologous expression of one or more of acyl-CoA:diacylglycerol acyltransferase (e.g., DGA1; YALI0E32769g); phospholipid: diacylglycerol acyltransferase (e.g., LRO1; YALI0E16797g); and acyl-CoA:cholesterol acyltransferase (e.g., ARE genes such as ARE1, ARE2, YALI0F06578g), which are involved in triglyceride synthesis (Kalscheuer et al. Appl Environ Microbiol p. 7119-7125, 2004; Oelkers et al. J Biol Chem 277:8877-8881, 2002; and Sorger et al. J Biol Chem 279:31190-31196, 2004), (2) decreased expression of triglyceride lipases (e.g., TGL3 and/or TGL4; YALI0D17534g and/or YALI0F10010g (Kurat et al. J Biol Chem 281:491-500, 2006); and (3) decreased expression of one or more acyl-coenzyme A oxidase activities, for example encoded by POX genes (e.g. POX1, POX2, POX3, POX4, POX5; YALI0C23859g , YALI0D24750g, YALI0E06567g, YALI0E27654g, YALI0E32835g, YALI0F10857g; see, for example, Mlickova et al. Appl Environ Microbiol 70: 3918-3924, 2004; Binns et al. J Cell Biol 173:719, 2006). Each of these proteins is an oleaginic peptide for the purposes of the present invention, and sequences of representative examples of these enzymes are provided in Tables 70-85 and Tables 100-101.

[0059] Oleaginous: The term "oleaginous", as used herein, refers to ability of an organism to accumulate lipid to at least about 20% of its dry cell weight. In certain embodiments of the invention, oleaginous yeast or fungi accumulate lipid to at least about 25% of their dry cell weight. In other embodiments, inventive oleaginous yeast or fungi accumulate lipid within the range of about 20-45% (e.g., about 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, or more) of their dry cell weight. In some embodiments, oleaginous organisms may accumulate lipid to as much as about 70% of their dry cell weight. In some embodiments of the invention, oleaginous organisms may accumulate a large fraction of total lipid accumulation in the form of triacylglycerol. In certain embodiments, the majority of the accumulated lipid is in the form of triacylglycerol. Alternatively or additionally, the lipid may accumulate in the form of intracellular lipid bodies, or oil bodies. In certain embodiments, the present invention utilizes yeast or fungi that are naturally oleaginous. In some aspects, naturally oleaginous organisms are manipulated (e.g., genetically, chemically, or otherwise) so as to further increase the level of accumulated lipid in the organism. For example, for the purposes of the present invention, Yarrowia lipolytica is a naturally oleaginous fungi. In other embodiments, yeast or fungi that are not naturally oleaginous are manipulated (e.g., genetically, chemically, or otherwise) to accumulate lipid as described herein. For example, for the purposes of the present invention, Saccharomyces cerevisiae, Xanthophyllomyces dendrorhous (Phaffia rhodozyma), and Candida utilis are not naturally oleaginous fungi.

[0060] PHB polypeptide or PHB biosynthesis polypeptide: The terms "PHB polypeptide" or "PHB biosynthesis polypeptide" as used herein refers to a polypeptide that is involved in the synthesis of para-hydroxybenzoate from chorismate. In prokaryotes and lower eukaryotes, synthesis of para-hydroxybenzoate occurs by the action of chorismate pyruvate lyase. Biosynthesis of para-hydroxybenzoate from tyrosine or phenylalanine occurs through a five-step process in mammalian cells. Lower eukaryotes such as yeast can utilize either method for production of para-hydroxybenzoate. For example, enzymes of the shikimate pathway, chorismate synthase, DAHP synthase, and transketolase are all PHB biosynthesis polypeptides. Each of these polypeptides is also a ubiquinone biosynthesis polypeptide or a ubiquinone biosynthesis competitor polypeptide for purposes of the present invention. Exemplary PHB polypeptides are provided in Tables 33 and 35-37.

[0061] Polypeptide: The term "polypeptide", as used herein, generally has its art-recognized meaning of a polymer of at least three amino acids. However, the term is also used to refer to specific functional classes of polypeptides, such as, for example, aromatic amino acid biosynthesis polypeptides, biosynthesis polypeptides, C.sub.5-9 quinone biosynthesis polypeptides, carotenogenic polypeptides, carotenoid biosynthesis polypeptides, isoprenoid biosynthesis polypeptides, oleaginic polypeptides, PHB polypeptides, PHB biosynthesis polypeptides, quinone biosynthesis polypeptides, quinonogenic polypeptides, ubiquinone biosynthesis polypeptides, ubiquinone biosynthesis competitor polypeptides, ubiquinogenic polypeptides, Vitamin E biosynthesis polypeptides, Vitamin K biosynthesis polypeptides, etc. For each such class, the present specification provides several examples of known sequences of such polypeptides. Those of ordinary skill in the art will appreciate, however, that the term "polypeptide" is intended to be sufficiently general as to encompass not only polypeptides having the complete sequence recited herein (or in a reference or database specifically mentioned herein), but also to encompass polypeptides that represent functional fragments (i.e., fragments retaining at least one activity) of such complete polypeptides. Moreover, those of ordinary skill in the art understand that protein sequences generally tolerate some substitution without destroying activity. Thus, any polypeptide that retains activity and shares at least about 30-40% overall sequence identity, often greater than about 50%, 60%, 70%, or 80%, and further usually including at least one region of much higher identity, often greater than 90% or even 95%, 96%, 97%, 98%, or 99% in one or more highly conserved regions (e.g., isocitrate dehydrogenase polypeptides often share a conserved AMP-binding motif; HMG-CoA reductase polypeptides typically include a highly conserved catalytic domain [see, for example, FIG. 7]; acetyl coA carboxylase typically has a carboxyl transferase domain; see, for example, Downing et al., Chem. Abs. 93:484, 1980; Gil et al., Cell 41:249, 1985; Jitrapakdee et al. Curr Protein Pept Sci. 4:217, 2003; U.S. Pat. No. 5,349,126, each of which is incorporated herein by reference in its entirety), usually encompassing at least 3-4 and often up to 20 or more amino acids, with another polypeptide of the same class, is encompassed within the relevant term "polypeptide" as used herein. Other regions of similarity and/or identity can be determined by those of ordinary skill in the art by analysis of the sequences of various polypeptides presented in the Tables herein.

[0062] Quinone biosynthesis polypeptide: A "quinone biosynthesis polypeptide", as that term is used herein, refers to any polypeptide involved in the synthesis of one or more quinone derived compound, as described herein. In particular, quinone biosynthesis polypeptides include ubiquinone biosynthesis polypeptides (e.g., biosynthesis polypeptides involved in production of coenzyme Q10 and/or a C.sub.5-9 quinone compound), vitamin K biosynthesis polypeptides, and vitamin E biosynthesis polypeptides.

[0063] Quinone derived compounds: The term "quinone derived compounds" is used herein to refer to certain compounds that either contain a quinone moiety or are derived from a precursor that contains a quinone moiety. In particular, quinone derived compounds according to the present invention are ubiquinones (e.g., coenzyme Q10, C.sub.5-9 quinone compounds, etc.), vitamin E compounds, and/or vitamin K compounds. Structures of representative quinone derived compounds are presented in FIG. 1.

[0064] Quinonogenic modification: The term "quinonogenic modification", as used herein, refers to refers to a modification of a host organism that adjusts production of one or more quinone derived compounds (e.g., ubiquinones, vitamin K compounds, vitamin E compounds, etc.), as described herein. For example, a quinonogenic modification may increase the production level of a particular quinone derived compound, or of a variety of different quinone derived compounds. In some embodiments of the invention, production of a particular quinone derived compound may be increased while production of other quinone derived compounds is decreased. In some embodiments of the invention, production of a plurality of different quinone derived compounds is increased. In principle, an inventive quinonogenic modification may be any chemical, physiological, genetic, or other modification that appropriately alters production of one or more quinone derived compounds in a host organism produced by that organism as compared with the level produced in an otherwise identical organism not subject to the same modification. In most embodiments, however, the quinonogenic modification will comprise a genetic modification, typically resulting in increased production of one or more quinone derived compounds (e.g., ubiquinones, vitamin K compounds, vitamin E compounds). In some embodiments, the quinonogenic modification comprises at least one chemical, physiological, genetic, or other modification; in other embodiments, the quinonogenic modification comprises more than one chemical, physiological, genetic, or other modification. In certain aspects where more than one modification is utilized, such modifications can comprise any combination of chemical, physiological, genetic, or other modification (e.g., one or more genetic, chemical and/or physiological modification(s)).

[0065] Quinonogenic polypeptide: The term "quinonogenic polypeptide", as used herein, refers to any polypeptide whose activity in a cell increases production of one or more quinone derived compounds (e.g., ubiquinones, vitamin K compounds, vitamin E compounds) in that cell. The term encompasses both polypeptides that are directly involved in quinone derived compound synthesis and those whose expression or activity affects the extent or level of production of one or more quinone derived compounds, for example by scavenging a substrate or reactant utilized by a quinone biosynthetic polypeptide that is directly involved in quinone derived compound production. Quinonogenic polypeptides include isoprenoid biosynthesis polypeptides, ubiquinone biosynthesis polypeptides (e.g., polypeptides involved in production of coenzyme Q10 and/or a C.sub.5-9 quinone compound), vitamin E biosynthesis polypeptides, and vitamin K biosynthesis polypeptides. Quinonogenic polypeptides may also include ubiquinogenic polypeptides, etc. The term also encompasses polypeptides that may affect the extent to which one or more quinone derived compounds is accumulated in lipid bodies.

[0066] Small Molecule: In general, a small molecule is understood in the art to be an organic molecule that is less than about 5 kilodaltons (Kd) in size. In some embodiments, the small molecule is less than about 3 Kd, 2 Kd, or 1 Kd. In some embodiments, the small molecule is less than about 800 daltons (D), 600 D, 500 D, 400 D, 300 D, 200 D, or 100 D. In some embodiments, small molecules are non-polymeric. In some embodiments, small molecules are not proteins, peptides, or amino acids. In some embodiments, small molecules are not nucleic acids or nucleotides. In some embodiments, small molecules are not saccharides or polysaccharides.

[0067] Source organism: The term "source organism", as used herein, refers to the organism in which a particular polypeptide sequence can be found in nature. Thus, for example, if one or more heterologous polypeptides is/are being expressed in a host organism, the organism in which the polypeptides are expressed in nature (and/or from which their genes were originally cloned) is referred to as the "source organism". Where multiple heterologous polypeptides are being expressed in a host organism, one or more source organism(s) may be utilized for independent selection of each of the heterologous polypeptide(s). It will be appreciated that any and all organisms that naturally contain relevant polypeptide sequences may be used as source organisms in accordance with the present invention. Representative source organisms include, for example, animal, mammalian, insect, plant, fungal, yeast, algal, bacterial, archaebacterial, cyanobacterial, and protozoal source organisms.

[0068] Sterol biosynthesis polypeptide: The term "sterol biosynthesis polypeptide", as used herein, refers to any polypeptide that is involved in the synthesis of one or more sterol compounds. Thus, sterol biosynthesis polypeptides can include isoprenoid biosynthesis polypeptides to the extent that they are involved in production of isopentyl pyrophosphate. Moreover, the term refers to any polypeptide that acts downstream of farnesyl pyrophosphate and in involved in the production of one or more sterol compounds. For example, sterol biosynthesis polypeptides include squalene synthase, which catalyses conversion of farnesyl pyrophosphate to presqualene pyrophosphate, and further catalyzes conversion of presqualene pyrophosphate to squalene (i.e., enzyme 2.5.1.21 in FIG. 8). In some embodiments of the invention, sterol biosynthesis polypeptides further include one or more polypeptides involved in metabolizing squalene into a vitamin D compound. Thus, sterol biosynthesis polypeptides can include one or more of the 1.14.99.7, 5.4.99.7, 5.4.99.8, 5.3.3.5, 1.14.21.6, 1.14.15.-, 1.14.13.13 enzyme polypeptides depicted in FIG. 8, as well as other enzyme polypeptides involved in the illustrated pathways. Furthermore, sterol biosynthesis polypeptides can include one or more enzyme polypeptides including, for example, C-14 demethylase (ERG9), squalene monooxygenase (ERG1), 2,3-oxidosqualene-lanosterol synthase (ERG7), C-1 demethylase (ERG11), C-14 reductase (ERG24), C-4 methyloxidase (ERG25), C-4 decarboxylase (ERG26), 3-ketoreductase (ERG27), C-24 methyltransferase (ERG6), .DELTA.8-7 isomerase (ERG2), C-5 desaturase (ERG3), C-22 desaturase (ERG5) and/or C-24 reductase (ERG4) polypeptides, and/or other polypeptides involved in producing one or more vitamin D compounds (e.g., vitamin D2, vitamin D3, or a precursor thereof). As will be appreciated by those of ordinary skill in the art, in some embodiments of the invention, sterol biosynthesis polypeptides include polypeptides that affect the expression and/or activity of one or more other sterol biosynthesis polypeptides. Thus, for instance, transcription factors that regulate expression of sterol biosynthesis enzymes can be sterol biosynthesis polypeptides for purposes of the present invention. To give but a couple of examples, the S. cerevisiae Upc2 and YLR228c genes, and the Y. lipolytica YALI0B00660g gene encode transcription factors that are sterol biosynthesis polypeptides according to certain embodiments of the present invention. For instance, the semidominant upc2-1 point mutation (G888D) exhibits increased sterol levels (Crowley et al., J. Bacteriol 180:4177-4183, 1998). Corresponding YLR228c mutants have been made and tested (Shianna et al., J Bacteriol 183:830, 2001); such mutants may be useful in accordance with the present invention, as may be YALI0B00660g derivatives with corresponding upc2-1 mutation(s). Representative examples of certain sterol biosynthesis polypeptide sequences are presented in Table 16 and 86-99.

[0069] Sterologenic modification: The term "sterologenic modification", as used herein, refers to a modification of a host organism that adjusts production of one or more sterol compounds (e.g., squalene, lanosterol, zymosterol, ergosterol, 7-dehydrocholesterol (provitamin D3), vitamin D compound(s), etc.), as described herein. For example, a sterologenic modification may increase the production level of a particular sterol compound, or of a variety of different sterol compounds. In some embodiments of the invention, production of a particular sterol compound may be increased while production of other sterol compounds is decreased. In some embodiments of the invention, production of a plurality of different sterol compounds is increased. In principle, an inventive sterologenic modification may be any chemical, physiological, genetic, or other modification that appropriately alters production of one or more sterol compounds in a host organism produced by that organism as compared with the level produced in an otherwise identical organism not subject to the same modification. In most embodiments, however, the sterologenic modification will comprise a genetic modification, typically resulting in increased production of one or more sterol compounds (e.g., squalene, lanosterol, zymosterol, ergosterol, 7-dehydrocholesterol (provitamin D3) or vitamin D compound(s)). In certain aspects where more than one modification is utilized, such modifications can comprise any combination of chemical, physiological, genetic, or other modification (e.g., one or more genetic modification and chemical or physiological modification).

[0070] Subject: The term "subject" is used throughout the present specification to describe an animal, in most instances a human, to whom inventive compositions are administered.

[0071] Ubiquinone: The term "ubiquinone" is understood in the art to refer to a structural class of quinone derivatives with or without isoprenoid side chains. Ubiquinones are described in the Merck Index, 11th Edition, Merck & Co., Inc. Rahway, N.Y., USA, Abstr. 9751 (1989), which is incorporated herein by reference. A dual nomenclature exists for these compounds and is based upon the length of the terpenoid side chain. Those which contain an isoprene side chain are also referred to by the term coenzymes Q. A benzoquinone of this family is therefore properly referred to as either "Coenzyme Qn," where n is an integer from one to twelve and designates the number of isoprenoid units in the side chain, or alternatively, "ubiquinone (x)" where x designates the total number of carbon atoms in the side chain and is a multiple of five. Typically, n is an integer ranging from 0 to 12, in particular from 1 to 12, and more particularly 6, 7, 8, 9, or 10. For example, the most common ubiquinone in animals has a ten isoprenoid side chain and is referred to as either Coenzyme Q10, ubiquinone, or ubidecarenone. In other organisms (e.g. fungi, bacteria), other ubiquinones, for example C.sub.5 (CoQ5), C.sub.6 (CoQ6), C.sub.7 (CoQ7), C.sub.8 (CoQ8), or C.sub.9 (CoQ9) (collectively C.sub.5-9) quinones are more prevalent than Coenzyme Q10 (CoQ10). As mentioned, ubiquinones may lack an isoprene side chain, and may be selected from alkylubiquinones in which the alkyl group may contain from 1 to 20 and preferably from 1 to 12 carbon atoms, such as, for example, decylubiquinones such as 6-decylubiquinone or 2,3-dimethoxy-5-decyl-1,4-ubiquinone, derivatives thereof, and mixtures thereof. Ubiquinones may exist in reduced (ubiquinol), oxidized or superoxidized states. For example, the oxidation states of the ubiquinone coenzyme Q10 are depicted in FIG. 1A.

[0072] Ubiquinone biosynthesis competitor: The term "ubiquinone biosynthesis competitor", as used herein, refers to an agent whose presence or activity in a cell reduces the level of farnesyl pyrophosphate (FPP), geranylgeranyl diphosphate (GGPP), chorismate, or any combination thereof that is available to enter the ubiquinogenic biosynthesis pathway. The term "ubiquinone biosynthesis competitor" encompasses both polypeptide (e.g. ubiquinone biosynthesis competitor polypeptides) and non-polypeptide (e.g., small molecule) inhibitor agents. Those of ordinary skill in the art will appreciate that certain competitor agents that do not act as inhibitors of ubiquinone biosynthesis generally can nonetheless act as inhibitors of biosynthesis of a particular ubiquinone compound (e.g. CoQ10 or a C.sub.5-9 quinone compound). Particular examples of ubiquinone biosynthesis competitor agents act on isoprenoid intermediates prior to FPP or GGPP, such that less FPP or GGPP is generated (see, for example, FIG. 5, FIG. 6). Squalene synthase (also called squalene synthetase) is a ubiquinone biosynthesis competitor according to the present invention; representative squalene synthase sequences are presented in Table 16. In another example, ubiquinone biosynthesis competitors include aromatic amino acid polypeptide enzymes that act on PHB precursors at or prior to chorismate, such that less chorismate is available for synthesis of para-hydroxybenzoate (see, for example FIG. 3). Anthranilate synthase is one ubiquinone biosynthesis competitor according to the present invention, and chorismate mutase another; representative anthranilate synthase and chorismate mutase polypeptides are presented in Table 32, 32B and Table 34, respectively. Those of ordinary skill in the art, considering the known metabolic pathways relating to ubiquinone production and/or metabolism (see, for example, FIG. 1 and other Figures and references herein) will readily appreciate a variety of other particular ubiquinone biosynthesis competitors, including ubiquinone biosynthesis polypeptides.

[0073] Ubiquinone biosynthesis polypeptide: The term "ubiquinone biosynthesis polypeptide" refers to any polypeptide that is involved in the synthesis of a ubiquinone (e.g., CoQ10 or a C.sub.5-9 quinone compound). To mention but a few, these ubiquinone biosynthesis polypeptides include, for example, polypeptides of polyprenyldiphosphate synthase (e.g. penta-, hexa-, hepta, octa-, nona-, deca-prenyldiphosphate synthase, PHB-polyprenyltransferase, and O-methyltransferase. Representative examples of ubiquinone biosynthesis polypeptide sequences are presented in Tables 23 (including 23b and 23c), 24-31, Table 33 as well as 35-37 (see also above PHB biosynthesis, C.sub.5-9 quinone biosynthesis polypeptides (e.g., Tables 61-65), and isoprenoid biosynthesis polypeptides which are ubiquinone biosynthesis polypeptides).

[0074] Ubiquinogenic modification: The term "ubiquinogenic modification", as used herein, refers to a modification of a host organism that adjusts production of a ubiquinone (e.g., CoQ10), as described herein. For example, a ubiquinogenic modification may increase the production level of a ubiquinone (e.g., CoQ 10 and/or a C.sub.5-9 quinone compound), and/or may alter relative levels of a ubiquinone and/or a ubiquinol. In principle, an inventive ubiquinogenic modification may be any chemical, physiological, genetic, or other modification that appropriately alters production of a ubiquinone (e.g., CoQ10 and/or a C.sub.5-9 quinone compound) in a host organism produced by that organism as compared with the level produced in an otherwise identical organism not subject to the same modification. In most embodiments, however, the ubiquinogenic modification will comprise a genetic modification, typically resulting in increased production of a ubiquinone (e.g., CoQ10 and/or a C.sub.5-9 quinone compound). In some embodiments, the ubiquinogenic modification comprises at least one chemical, physiological, genetic, or other modification; in other embodiments, the ubiquinogenic modification comprises more than one chemical, physiological, genetic, or other modification. In certain aspects where more than one modification is utilized, such modifications can comprise any combination of chemical, physiological, genetic, or other modification (e.g., one or more genetic, chemical and/or physiological modification(s)).

[0075] Ubiquinogenic polypeptide: The term "ubiquinogenic polypeptide," as used herein, refers to any polypeptide that is involved in the process of producing a ubiquinone (e.g., CoQ10 and/or a C.sub.5-9 quinone compound) in a cell, and may include polypeptides that are involved in processes other than ubiquinone production but whose expression or activity affects the extent or level of production of a ubiquinone and/or a ubiquinol, for example by scavenging a substrate or reactant utilized by a ubiquinone biosynthetic polypeptide that is directly involved in ubiquinone production. Ubiquinogenic polypeptides include isoprenoid biosynthesis polypeptides, ubiquinone biosynthesis polypeptides, and ubiquinone biosynthesis competitor polypeptides, as those terms are defined herein. The term also encompasses polypeptides that may affect the extent to which a given ubiquinone is accumulated in lipid bodies.

[0076] Vitamin D biosynthesis polypeptide: The term "vitamin D biosynthesis polypeptide" refers to any polypeptide that is involved in the synthesis of one or more vitamin D compounds. To mention but a few, these include, for example, the 1.14.99.7, 5.4.99.7, 5.4.99.8, 5.3.3.5, and/or 1.14.21.6, polypeptides depicted in FIG. 8. They further can include the hydroxylases that convert vitamin D.sub.3 to calcitriol (e.g., the 1.14.15.- and 1.14.13.13 polypeptides depicted in FIG. 8). As will be appreciated by those of ordinary skill in the art, in some embodiments of the invention, vitamin D biosynthesis polypeptides include polypeptides that affect the expression and/or activity of one or more other vitamin D biosynthesis polypeptides. Particular examples of certain vitamin D biosynthesis polypeptides are presented in Tables 86-99.

[0077] Vitamin E biosynthesis polypeptide: The term "vitamin E biosynthesis polypeptide" refers to any polypeptide that is involved in the synthesis of vitamin K. To mention but a few, these include, for example, tyrA, pds1(hppd), VTE1, HPT1(VTE2), VTE3, VTE4, and/or GGH polypeptides (i.e., polypeptides that perform the chemical reactions performed by tyrA, pds1(hppd), VTE1, HPT1(VTE2), VTE3, VTE4, and/or GGH, respectively). Particular examples of such vitamin E biosynthesis polypeptides are presented in Tables 54-60.

[0078] Vitamin E compound: The term "vitamin E compound", as used herein, refers to members of a family of structurally related compounds that have a 6-chromanol ring, an isoprenoid side chain, and the biologic activity of .alpha.-tocopherol. The term encompasses the eight known naturally occurring vitamin E compounds, the four tocopherols (.alpha., .beta., .gamma., .delta.) and four tocotrienols (.alpha., .beta., .gamma., .delta.), which all contain a hydrophilic chromanol ring and a hydrophobic side chain. The .alpha., .beta., .gamma., and .delta. forms differ from one another in the number of methyl groups on the chromanol ring. Several synthetic vitamin E compounds have also been prepared, and still others are possible (see, for example, Bramley et al., J. Sci Food Agric 80:913, 2000). .alpha.-tocopherol is a potent antioxidant, and is generally considered to be the most active vitamin E compound in humans.

[0079] Vitamin E production modification: The term "Vitamin E production modification", as used herein, refers to a modification of a host organism that adjusts production of Vitamin E. For example, a Vitamin E production modification may increase the production level of one or more vitamin E compounds, and/or may alter relative production levels of different vitamin E compounds. In principle, an inventive vitamin E production modification may be any chemical, physiological, genetic, or other modification that appropriately alters production of one or more vitamin E compounds in a host organism produced by that organism as compared with the level produced in an otherwise identical organism not subject to the same modification. In most embodiments, however, the vitamin E production modification will comprise a genetic modification, typically resulting in increased production of one or more selected vitamin E compounds. In some embodiments, the vitamin E production modification comprises at least one chemical, physiological, genetic, or other modification; in other embodiments, the vitamin E production modification comprises more than one chemical, physiological, genetic, or other modification. In certain aspects where more than one modification is utilized, such modifications can comprise any combination of chemical, physiological, genetic, or other modification (e.g., one or more genetic, chemical and/or physiological modification(s)).

[0080] Vitamin K biosynthesis polypeptide: The term "vitamin K biosynthesis polypeptide" refers to any polypeptide that is involved in the synthesis of vitamin K. To mention but a few, these include, for example, MenF, MenD, MenC, MenE, MenB, MenA, UbiE, and/or MenG polypeptides (i.e., polypeptides that perform the chemical reactions performed by MenF, MenD, MenC, MenE, MenB, MenA, UbiE, and/or MenG, respectively). Particular examples of such vitamin K biosynthesis polypeptides are presented in Tables 46-53.

[0081] Vitamin K compounds: The term "vitamin K compounds", as used herein, refers to members of a family of structurally related compounds that share a common biologic activity. In particular, vitamin K compounds are derivatives of 2-methyl-1,4-naphthoquinone that have coagulation activity. The two natural forms of vitamin K differ in the identity of their side chains at position 3. Vitamin K.sub.1, also known as phylloquinone (based on its presence in plants), has a phytyl side chain in position 3; vitamin K2, also known as menaquinone, has an isoprenyl side chain at position 3. Different forms of menaquinone, having side chains with different numbers of isoprene units (typically 4-13) are found in different types of cells.

[0082] Vitamin K production modification: The term "Vitamin K production modification", as used herein, refers to a modification of a host organism that adjusts production of Vitamin K. For example, a Vitamin K production modification may increase the production level of one or more vitamin K compounds, and/or may alter relative production levels of different vitamin K compounds. In principle, an inventive vitamin K production modification may be any chemical, physiological, genetic, or other modification that appropriately alters production of one or more vitamin K compounds in a host organism produced by that organism as compared with the level produced in an otherwise identical organism not subject to the same modification. In most embodiments, however, the vitamin K production modification will comprise a genetic modification, typically resulting in increased production of one or more selected vitamin K compounds. In some embodiments, the vitamin K production modification comprises at least one chemical, physiological, genetic, or other modification; in other embodiments, the vitamin K production modification comprises more than one chemical, physiological, genetic, or other modification. In certain aspects where more than one modification is utilized, such modifications can comprise any combination of chemical, physiological, genetic, or other modification (e.g., one or more genetic, chemical and/or physiological modification(s)).

Detailed Description of Certain Preferred Embodiments of the Invention

[0083] The present invention embraces the reasoning that quinone derived compound(s) (e.g., ubiquinones, vitamin K compounds, and vitamin E compounds) can effectively be produced in oleaginous yeast and fungi. According to the present invention, strains that both (i) accumulate lipid, often in the form of cytoplasmic oil bodies; and (ii) produce one or more quinone derived compound(s) at a level at least about 1%, of their dry cell weight, are generated through manipulation of host cells (i.e., strains, including, e.g., naturally-occurring strains and strains which have been previously modified). In certain embodiments, strains can accumulate lipid typically to at least about 20% of their dry cell weight. In some embodiments quinone derived compound(s) can be produced in the strains to at least about 2%, at least about 3%, at least about 4%, at least about 5%, at least about 6%, at least about 7%, at least about 8%, at least about 9%, at least about 10%, at least about 11%, at least about 12%, at least about 13%, at least about 14%, at least about 15%, at least about 16%, at least about 17%, at least about 18%, at least about 19%, or at least about 20% of fungus' dry cell weight. Thus, the provided manipulated strains can then be used to produce the quinone derived compound(s). In some embodiments, the compound(s) that partition(s) into the lipid bodies can readily be isolated. In some embodiments the quinone derived compound is Coenzyme Q10 (CoQ10; Ubiquinone 10; Ubiquinone 50). In some embodiments, the quinone derived compound is C.sub.5 (CoQ5), C.sub.6 (CoQ6), C.sub.7 (CoQ7), C.sub.8 (CoQ8), or C.sub.9 (CoQ9) quinone. In some embodiments, the quinone derived compound is a tocopherol or a tocotrienols. In some embodiments, the quinone derived compound is phylloquinone (vitamin K.sub.1), or menaquinone (vitamin K.sub.2).

[0084] In some embodiments, it will be desirable to balance oleaginy and production of quinone derived compound(s) in cells such that, as soon as a minimum desirable level of oleaginy is achieved, substantially all further carbon is diverted into a metabolic pathway that results in production of one or more quinone derived compounds. In some embodiments of the invention, this strategy involves engineering cells to be oleaginous; in other embodiments, it involves engineering cells to accumulate a higher level of lipid, particularly cytoplasmic lipid, than they would do in the absence of such engineering even though the engineered cells may not become "oleaginous" as defined herein. In other embodiments, the extent to which an oleaginous host cell accumulates lipid is actually reduced so that remaining carbon can be utilized in ubiquinone production.

[0085] To give but one example of adjustments that could be made to achieve a desired balance between oleaginy and production of quinone derived compound(s), we note that, while increasing acetyl CoA carboxylase expression (and/or activity) promotes oleaginy, decreasing its expression and/or activity may promote production of quinone derived compound(s). Those of ordinary skill in the art will appreciate that the expression and/or activity of acetyl CoA carboxylase, or of other polypeptides, may be adjusted up or down as desired according to the characteristics of a particular host cell of interest.

[0086] We note that engineered cells and processes of using them as described herein may provide one or more advantages as compared with unmodified cells. Such advantages may include, but are not limited to: increased yield (e.g., quinone derived compound expressed as either % dry cell weight (mg/mg) or parts per million), titer (g quinone derived compound/L), specific productivity (mg quinone derived compound g.sup.-1 biomass hour.sup.-1), and/or volumetric productivity (g quinone derived compound liter.sup.-1 hour.sup.-1)) of the desired quinone derived compound (and/or intermediates thereof), and/or decreased formation of undesirable side products (for example, undesirable intermediates).

[0087] Thus, for example, the specific productivity for one or more quinone derived compounds (e.g., ubiquinones, vitamin K compounds, and/or vitamin E compounds) or total quinone derived compounds may be at or about 0.1, at or about 0.11, at or about 0.12, at or about 0.13, at or about 0.14, at or about 0.15, at or about 0.16, at or about 0.17, at or about 0.18, at or about 0.19, at or about 0.2, at or about 0.21, at or about 0.22, at or about 0.23, at or about 0.24, at or about 0.25, at or about 0.26, at or about 0.27, at or about 0.28, at or about 0.29, at or about 0.3, at or about 0.31, at or about 0.32, at or about 0.33, at or about 0.34, at or about 0.35, at or about 0.36, at or about 0.37, at or about 0.38, at or about 0.39, at or about 0.4, at or about 0.41, at or about 0.42, at or about 0.43, at or about 0.44, at or about 0.45, at or about 0.46, at or about 0.47, at or about 0.48, at or about 0.49, at or about 0.5, at or about 0.51, at or about 0.52, at or about 0.53, at or about 0.54, at or about 0.55, at or about 0.56, at or about 0.57, at or about 0.58, at or about 0.59, at or about 0.6, at or about 0.61, at or about 0.62, at or about 0.63, at or about 0.64, at or about 0.65, at or about 0.66, at or about 0.67, at or about 0.68, at or about 0.69, at or about 0.7, at or about 0.71, at or about 0.72, at or about 0.73, at or about 0.74, at or about 0.75, at or about 0.76, at or about 0.77, at or about 0.78, at or about 0.79, at or about 0.8, at or about 0.81, at or about 0.82, at or about 0.83, at or about 0.84, at or about 0.85, at or about 0.86, at or about 0.87, at or about 0.88, at or about 0.89, at or about 0.9, at or about 0.91, at or about 0.92, at or about 0.93, at or about 0.94, at or about 0.95, at or about 0.96, at or about 0.97, at or about 0.98, at or about 0.99, at or about 1, 1.05, at or about 1.1, at or about 1.15, at or about 1.2, at or about 1.25, at or about 1.3, at or about 1.35, at or about 1.4, at or about 1.45, at or about 1.5, at or about 1.55, at or about 1.6, at or about 1.65, at or about 1.7, at or about 1.75, at or about 1.8, at or about 1.85, at or about 1.9, at or about 1.95, at or about 2 mg g.sup.-1 hour.sup.-1 or more.

[0088] Thus, for example, the volumetric productivity for one or more quinone derived compounds (e.g., ubiquinones, vitamin K compounds, and/or vitamin E compounds) or total quinone derived compounds may be at or about 0.01, at or about 0.011, at or about 0.012, at or about 0.013, at or about 0.014, at or about 0.015, at or about 0.016, at or about 0.017, at or about 0.018, at or about 0.019, at or about 0.02, at or about 0.021, at or about 0.022, at or about 0.023, at or about 0.024, at or about 0.025, at or about 0.026, at or about 0.027, at or about 0.028, at or about 0.029, at or about 0.03, at or about 0.031, at or about 0.032, at or about 0.033, at or about 0.034, at or about 0.035, at or about 0.036, at or about 0.037, at or about 0.038, at or about 0.039, at or about 0.04, at or about 0.041, at or about 0.042, at or about 0.043, at or about 0.044, at or about 0.045, at or about 0.046, at or about 0.047, at or about 0.048, at or about 0.049, at or about 0.05, at or about 0.051, at or about 0.052, at or about 0.053, at or about 0.054, at or about 0.055, at or about 0.056, at or about 0.057, at or about 0.058, at or about 0.059, at or about 0.06, at or about 0.061, at or about 0.062, at or about 0.063, at or about 0.064, at or about 0.065, at or about 0.066, at or about 0.067, at or about 0.068, at or about 0.069, at or about 0.07, at or about 0.071, at or about 0.072, at or about 0.073, at or about 0.074, at or about 0.075, at or about 0.076, at or about 0.077, at or about 0.078, at or about 0.079, at or about 0.08, at or about 0.081, at or about 0.082, at or about 0.083, at or about 0.084, at or about 0.085, at or about 0.086, at or about 0.087, at or about 0.088, at or about 0.089, at or about 0.09, at or about 0.091, at or about 0.092, at or about 0.093, at or about 0.094, at or about 0.095, at or about 0.096, at or about 0.097, at or about 0.098, at or about 0.099, at or about 0.1, 0.105, at or about 0.110, at or about 0.115, at or about 0.120, at or about 0.125, at or about 0.130, at or about 0.135, at or about 0.14, at or about 0.145, at or about 0.15, at or about 0.155, at or about 0.16, at or about 0.165, at or about 0.17, at or about 0.175, at or about 0.18, at or about 0.185, at or about 0.19, at or about 0.195, at or about 0.20 grams liter.sup.-1 hour.sup.-1 or more.

Host Cells

[0089] Those of ordinary skill in the art will readily appreciate that a variety of yeast and fungal strains exist that are naturally oleaginous or that naturally produce one or more quinone derived compounds (e.g., a ubiquinone, a vitamin K compound and/or a vitamin E compound). Any of such strains may be utilized as host strains according to the present invention, and may be engineered or otherwise manipulated to generate inventive oleaginous, quinone derived-compound-producing strains. Alternatively, strains that naturally are neither oleaginous nor quinone derived-compound-producing may be employed. Furthermore, even when a particular strain has a natural capacity for oleaginy or for production of one or more quinone derived compounds, its natural capabilities may be adjusted as described herein for optimal production of one or more particular desired compounds.

[0090] In certain embodiments, engineering or manipulation of a strain results in modification of a type of lipid and/or quinone derived compound produced. For example, a strain may be naturally oleaginous and/or may naturally produce one or more quinone derived compounds (e.g., a ubiquinone, including, e.g., CoQ5, CoQ6, CoQ8, CoQ9, CoQ10; vitamin K, including phylloquinone and/or one or more menaquinones; and/or vitamin E, including one or more tocopherols and/or one or more tocotrienols). However, engineering or modification of the strain may be employed so as to change the type or amount of lipid that is accumulated and/or to adjust production of one or more quinone derived compounds (including, for example, modulating relative amounts of particular quinone derived compounds). In some embodiments, production of CoQ10; phylloquinone; menaquinone; and/or .alpha.-tocopherol production will be optimized

[0091] When selecting a particular yeast or fungal strain for use in accordance with the present invention, it will generally be desirable to select one whose cultivation characteristics are amenable to commercial scale production. For example, it will generally (though not necessarily always) be desirable to avoid filamentous organisms, or organisms with particularly unusual or stringent requirements for growth conditions. In some embodiments of the invention, it will be desirable to utilize edible organisms as host cells, as they may optionally be formulated directly into pharmaceutical compositions, food or feed additives, or into nutritional supplements, as desired. Some embodiments of the invention utilize host cells that are genetically tractable, amenable to molecular genetics (e.g., can be efficiently transformed, especially with established or available vectors; optionally can incorporate and/or integrate multiple genes, for example sequentially; and/or have known genetic sequence; etc), devoid of complex growth requirements (e.g., a necessity for light), mesophilic (e.g., prefer growth temperatures within the range of about 20-32.degree. C.) (e.g., 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32.degree. C.), able to assimilate a variety of carbon and nitrogen sources and/or capable of growing to high cell density. Alternatively or additionally, various embodiments of the invention utilize host cells that grow as single cells rather than, for example, as mycelia.

[0092] In general, when it is desirable to utilize a naturally oleaginous organism in accordance with the present invention, any modifiable and cultivatable oleaginous organism may be employed. In certain embodiments of the invention, yeast or fungi of genera including, but not limited to, Blakeslea, Candida, Cryptococcus, Cunninghamella, Lipomyces, Mortierella, Mucor, Phycomyces, Pythium, Rhodosporidium, Rhodotorula, Thraustochytrium, Trichosporon, and Yarrowia are employed. In certain particular embodiments, organisms of species that include, but are not limited to, Blakeslea trispora, Candida pulcherrima, C. revkaufi, C. tropicalis, Cryptococcus curvatus, Cunninghamella echinulata, C. elegans, C. japonica, Lipomyces starkeyi, L. lipoferus, Mortierella alpina, M. isabellina, M. ramanniana, M. vinacea, Mucor circinelloides, Phycomyces blakesleanus, Pythium irregulare, Rhodosporidium toruloides, Rhodotorula glutinis, R. gracilis, R. graminis, R. mucilaginosa, R. pinicola, Thraustochytrium sp, Trichosporon pullans, T cutaneum, and Yarrowia lipolytica are used

[0093] Of these naturally oleaginous strains, some also naturally produce one or more quinone derived compounds and some do not. In most cases, only low levels (less than about 0.05% dry cell weight) of quinone derived compounds are produced by naturally-occurring oleaginous yeast or fungi.

[0094] In general, any organism that is naturally oleaginous but does not naturally produce one or more particular quinone derived compounds of interest (e.g., does not produce a ubiquinone such as CoQ10, vitamin K compounds, vitamin E compounds be utilized as a host cell in accordance with the present invention. In some embodiments, the organism is a yeast or fungus from a genus such as, but not limited to, Candida, Mortierella, and Yarrowia; in some embodiments, the organism is of a species including, but not limited to, Mortierella alpina and Yarrowia lipolytica. At least some fungal strains are known not to produce vitamin K.

[0095] Comparably, the present invention may utilize any naturally oleaginous organism that does naturally produce one or more quinone derived compounds as a host cell. In general, the present invention may be utilized to increase carbon flow into the isoprenoid pathway organisms that naturally produce a particular quinone derived compounds of interest and/or to shift production of one or more different quinone derived compounds. For example, the present invention may be utilized to shift production from one ubiquinone (e.g., CoQ5, CoQ6, CoQ8, CoQ9) to another (e.g., CoQ10). Introduction of one or more quinonogenic modifications (e.g., one or more ubiquinogenic modifications), such as but not limited to increased expression of one or more endogenous or heterologous quinonogenic polypeptides), in accordance with the present invention, can achieve these goals.

[0096] In certain embodiments of the invention, the utilized oleaginous, quinone derived compound-producing organism is a yeast or fungus, for example of a genus such as, but not limited to, Rhodotorula; in some embodiments, the organism is of a species such as Rhodotorula glutinis.

[0097] When it is desirable to utilize strains that are naturally non-oleaginous as host cells in accordance with the present invention, genera of non-oleaginous yeast or fungi include, but are not limited to, Aspergillus, Botrytis, Cercospora, Fusarium (Gibberella), Kluyveromyces, Neurospora, Penicillium, Pichia (Hansenula), Puccinia, Saccharomyces, Schizosaccharomyces, Sclerotium, Trichoderma, and Xanthophyllomyces (Phaffia); in some embodiments, the organism is of a species including, but not limited to, Candida utilis, Aspergillus nidulans, A. niger, A. terreus, Botrytis cinerea, Cercospora nicotianae, Fusarium fujikuroi (Gibberella zeae), Kluyveromyces lactis, K. lactis, Neurospora crassa, Pichia pastoris, Puccinia distincta, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Sclerotium rolfsii, Trichoderma reesei, and Xanthophyllomyces dendrorhous (Phaffia rhodozyma).

[0098] It will be appreciated that the term "non-oleaginous", as used herein, encompasses both strains that naturally have some ability to accumulate lipid, especially cytoplasmically, but do not do so to a level sufficient to qualify as "oleaginous" as defined herein, as well as strains that do not naturally have any ability to accumulate extra lipid, e.g., extra-membranous lipid. It will further be appreciated that, in some embodiments of the invention, it will be sufficient to increase the natural level of oleaginy of a particular host cell, even if the modified cell does not qualify as oleaginous as defined herein. In some embodiments, the cell will be modified to accumulate at least about 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, or 5% dry cell weight as lipid, so long as the accumulation level is more than that observed in the unmodified parental cell.

[0099] As with the naturally oleaginous organisms, some of the naturally non-oleaginous fungi naturally produce one or more quinone derived compounds, whereas others do not. Genera of naturally non-oleaginous fungi that do not naturally produce quinone derived compounds (such as CoQ10 or other ubiquinone, vitamin K compounds, vitamin E compounds) for instance, but that may desirably be used as host cells in accordance with the present invention include, but are not limited to, Aspergillus, Kluyveromyces, Penicillium, Saccharomyces, and Pichia; species include, but are not limited to, Candida utilis, Aspergillus niger, Xanthophyllomyces dendrorhous (Phaffia rhodozyma) and Saccharomyces cerevisiae. Saccharomyces cerevisiae, in particular, is known not to produce vitamin E or vitamin K compounds; other fungi likely also do not produce these compounds. Genera of naturally non-oleaginous fungi that do naturally produce one or more quinone derived compounds (e.g., CoQ10) and that may desirably be used as host cells in accordance with the present invention include, but are not limited to, yeasts of such genera as Schizosaccharomyces and Basidiomycetes; fungi of such genera as Tremella are capable of producing certain quinone derived compounds, and in particular are capable of producing CoQ10.

[0100] As discussed above, any of a variety of organisms may be employed as host cells in accordance with the present invention. In certain embodiments of the invention, host cells will be Y. lipolytica cells. Advantages of Y. lipolytica include, for example, tractable genetics and molecular biology, availability of genomic sequence (see, for example, Sherman et al. Nucleic Acids Res. 32(Database issue):D315-8, 2004), suitability to various cost-effective growth conditions, ability to grow to high cell density. In addition, Y. lipolytica is naturally oleaginous, such that fewer manipulations may be required to generate an oleaginous, ubiquinone-producing (e.g., CoQ10) Y. lipolytica strain than might be required for other organisms. Furthermore, there is already extensive commercial experience with Y. lipolytica.

[0101] Saccharomyces cerevisiae is also a useful host cell in accordance with the present invention, particularly due to its experimental tractability and the extensive experience that researchers have accumulated with the organism. Although cultivation of Saccharomyces under high carbon conditions may result in increased ethanol production, this can generally be managed by process and/or genetic alterations.

[0102] The edible fungus, Candida utilis is also a useful host cell in accordance with the present invention. Molecular biology tools and techniques are available in C. utilis (for example, see Iwakiri et al. (2006) Yeast 23:23-34, Iwakiri et al. (2005) Yeast 2005 22:1079-87, Iwakiri et al. (2005) Yeast 22:1049-60, Rodriquez et al. (1998) Yeast 14:1399-406, Rodriquez et al. (1998) FEMS Microbiol Lett. 165:335-40, and Kondo et al. (1995) J. Bacteriol. 177:7171-7).

[0103] Additional useful hosts include Xanthophyllomyces dendrorhous (Phaffia rhodozyma), which is experimentally tractable and naturally produces isoprenoid compounds.

[0104] Aspergillus niger and Mortierella alpina accumulate large amounts of citric acid and fatty acid, respectively; Mortierella alpina is also oleaginous.

[0105] Neurospora or Gibberella are also useful. They are not naturally oleaginous and may require extensive modification to be used in accordance with the present invention. Neurospora and Gibberella are considered relatively tractable from an experimental standpoint. Both are filamentous fungi, such that production at commercial scales can be a challenge necessary to overcome in utilization of such strains.

[0106] Mucor circinelloides is another available useful species. While its molecular genetics are generally less accessible than are those of some other organisms, it naturally produces isoprenoids and may require less modification than other species available.

[0107] Molecular genetics can be performed in Blakeslea, though significant effort may be required. Furthermore, cost-effective fermentation conditions can be challenging, as, for example, it may be required that the two mating types are mixed. Fungi of the genus Phycomyces are also possible sources which have the potential to pose fermentation process challenges, and these fungi may be less amenable to manipulate than several other potential host organisms.

[0108] Additional useful hosts include strains, such as Schizosaccharomyces pombe, Saitoella complicata, and Sporidiobolus ruineniae, which can produce certain quinone derived compounds including, for example, CoQ10.

[0109] Those of ordinary skill in the art will appreciate that the selection of a particular host cell for use in accordance with the present invention will also affect, for example, the selection of expression sequences utilized with any heterologous polypeptide to be introduced into the cell, and will also influence various aspects of culture conditions, etc. Much is known about the different gene regulatory requirements, protein targeting sequence requirements, and cultivation requirements, of different host cells to be utilized in accordance with the present invention (see, for example, with respect to Yarrowia, Barth et al. FEMS Microbiol Rev. 19:219, 1997; Madzak et al. J Biotechnol. 109:63, 2004; see, for example, with respect to Xanthophyllomyces, Verdoes et al. Appl Environ Microbiol 69: 3728-38, 2003; Visser et al. FEMS Yeast Res 4: 221-31, 2003; Martinez et al. Antonie Van Leeuwenhoek. 73(2):147-53, 1998; Kim et al. Appl Environ Microbiol. 64(5):1947-9, 1998; Wery et al. Gene. 184(1):89-97, 1997; see, for example, with respect to Saccharomyces, Guthrie and Fink Methods in Enzymology 194:1-933, 1991). In certain aspects, for example, targeting sequences of the host cell (or closely related analogs) may be useful to include for directing heterologous proteins to subcellular localization. Thus, such useful targeting sequences can be added to heterologous sequence for proper intracellular localization of activity. In other aspects (e.g., addition of mitochondrial targeting sequences), heterologous targeting sequences may be eliminated or altered in the selected heterologous sequence (e.g., alteration or removal of source organism plant chloroplast targeting sequences).

[0110] To give but a few specific examples, of promoters and/or regulatory sequences that may be employed in expression of polypeptides according to the present invention, useful promoters include, but are not limited to, the Leu2 promoter and variants thereof (see, for example, see U.S. Pat. No. 5,786,212); the EF1alpha protein and ribosomal protein S7 gene promoters (see, for example, PCT Application WO 97/44470); the Gpm (see US20050014270), Xpr2 (see U.S. Pat. No. 4,937,189), Tef1, Gpd1 (see, for example, US Application 2005-0014270A1), Cam1 (YALIOC24420g), YALI0D16467g, Tef4 (YALI0B12562g), Yef3 (YALI0E13277g), Pox2, Yat1 (see, for example US Application 2005-0130280; PCT Application WO 06/052754), Fba1 (see, for example WO05049805), and/or Gpat (see WO06031937) promoters; the sequences represented by SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, and SEQ ID NO: 12, subsequences thereof, and hybrid and tandem derivatives thereof (e.g., as disclosed in US Application 2004-0146975); the sequences represented by SEQ ID NO: 1, 2, or 3 including fragments (e.g. by 462-1016 and bp 917-1016 of SEQ ID NO: 1; bp 5-523 of SEQ ID NO:3) and complements thereof (e.g., as disclosed in US patent number http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1& Sect2=HITOFF&d=PALL&p=1&u=%2Fnetahtml%2FPTO%2Fsrchnum.htm&r=1&f=G&1=50&s1- =5952195.PN.&OS=PN/5952195&RS=PN/5952195-h0#h0http://patft.uspto.gov/netac- gi/nph-Parser?Sect1=PTO1& Sect2=HITOFF&d=PALL&p=1&u=%2Fnetahtml%2FPTO%2Fsrchnum.htm&r=1&f=G&1=50&s1- =5952195.PN.&OS=PN/5952195&RS=PN/5952195-h2#h25,952,195); CYP52A2A (see, for example, US Application 2002-0034788); promoter sequences from fungal (e.g., C. tropicalis) catalase, citrate synthase, 3-ketoacyl-CoA thiolase A, citrate synthase, O-acetylhornserine sulphydrylase, protease, carnitine O-acetyltransferase, hydratase-dehydrogenase, epimerase genes; promoter sequences from Pox4 genes (see, for example, US application 2004-0265980); and/or promoter sequences from Met2, Met3, Met6, Met25 and YALI0D12903g genes. Any such promoters can be used in conjunction with endogenous genes and/or heterologous genes for modification of expression patterns of endogenous polypeptides and/or heterologous polypeptides in accordance with the present invention.

[0111] Alternatively or additionally, regulatory sequences useful in accordance with the present invention may include one or more Xpr2 promoter fragments, for example as described in US patent http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&- p=1&u=%2Fnetahtml%2FPTO%2Fsrchnum.htm&r=1&f=G&1=50&s1=6083717.PN.&OS=PN/60- 83717&RS=PN/6083717-h0#h0http://patft.uspto.gov/netacgi/nph-Parser?Sect1=P- TO1& Sect2=HITOFF&d=PALL&p=1&u=%2Fnetahtml%2FPTO%2Fsrchnum.htm&r=1 &f=G&1=50&s1=6083717.PN.&OS=PN/6083717&RS=PN/6083717-h2#h26,083,717 (e.g. SEQ ID NOS: 1-4 also including sequences with 80% or more identity to these SEQ ID NOs) in one or more copies either in single or in tandem. Similarly, exemplary terminator sequences include, but are not limited to, Y. lipolytica Xpr2 (see U.S. Pat. No. 4,937,189) and Pox2 (YALI0F10857g) terminator sequences and those listed in example 11 herein.

Engineering Oleaginy

[0112] All living organisms synthesize lipids for use in their membranes and various other structures. However, most organisms do not accumulate in excess of about 10% of their dry cell weight as total lipid, and most of this lipid generally resides within cellular membranes.

[0113] Significant biochemical work has been done to define the metabolic enzymes necessary to confer oleaginy on microorganisms (primarily for the purpose of engineering single cell oils as commercial sources of arachidonic acid and docosahexaenoic acid; see for example Ratledge Biochimie 86:807, 2004, the entire contents of which are incorporated herein by reference). Although this biochemical work is compelling, there only have been a limited number of reports of de novo oleaginy being established through genetic engineering with the genes encoding the key metabolic enzymes. It should be noted that oleaginous organisms typically accumulate lipid only when grown under conditions of carbon excess and nitrogen limitation. The present invention further establishes that the limitation of other nutrients (e.g. phosphate or magnesium) can also induce lipid accumulation. The present invention establishes, for example, that limitation of nutrients such as phosphate and/or magnesium can induce lipid accumulation, much as is observed under conditions of nitrogen limitation. Under these conditions, the organism readily depletes the limiting nutrient but continues to assimilate the carbon source. The "excess" carbon is channeled into lipid biosynthesis so that lipids (usually triacylglycerols) accumulate in the cytosol, typically in the form of bodies.

[0114] In general, it is thought that, in order to be oleaginous, an organism must produce both acetyl-CoA and NADPH in the cytosol, which can then be utilized by the fatty acid synthase machinery to generate lipids. In at least some oleaginous organisms, acetyl-CoA is generated in the cytosol through the action of ATP-citrate lyase, which catalyzes the reaction:

citrate+CoA+ATP.fwdarw.acetyl-CoA+oxaloacetate+ADP+P.sub.i. (1)

[0115] Of course, in order for ATP-citrate lyase to generate appropriate levels of acetyl-CoA in the cytosol, it must first have an available pool of its substrate citric acid. Citric acid is generated in the mitochondria of all eukaryotic cells through the tricarboxylic acid (TCA) cycle, and can be moved into the cytosol (in exchange for malate) by citrate/malate translocase.

[0116] In most oleaginous organisms, and in some non-oleaginous organisms, the enzyme isocitrate dehydrogenase, which operates as part of the TCA cycle in the mitochondria, is strongly AMP-dependent. Thus, when AMP is depleted from the mitochondria, this enzyme is inactivated. When isocitrate dehydrogenase is inactive, isocitrate accumulates in the mitochondria. This accumulated isocitrate is then equilibrated with citric acid, presumably through the action of aconitase. Therefore, under conditions of low AMP, citrate accumulates in the mitochondria. As noted above, mitochondrial citrate is readily transported into the cytosol.

[0117] AMP depletion, which in oleaginous organisms is believed to initiate the cascade leading to accumulation of citrate (and therefore acetyl-CoA) in the cytoplasm, occurs as a result of the nutrient depletion mentioned above. When oleaginous cells are grown in the presence of excess carbon source but the absence of nitrogen or other nutrient (e.g., phosphate or magnesium), the activity of AMP deaminase, which catalyzes the reaction:

AMP.fwdarw.inosine 5'-monophosphate+NH.sub.3 (2)

is strongly induced. The increased activity of this enzyme depletes cellular AMP in both the cytosol and the mitochondria. Depletion of AMP from the mitochondria is thought to inactivate the AMP-dependent isocitrate dehydrogenase, resulting in accumulation of citrate in the mitochondria and, therefore, the cytosol. This series of events is depicted diagrammatically in FIG. 2.

[0118] As noted above, oleaginy requires both cytosolic acetyl-CoA and cytosolic NADPH. It is believed that, in many oleaginous organisms, appropriate levels of cytosolic NADPH are provided through the action of malic enzyme (Enzyme 3 in FIG. 2). Some oleaginous organisms (e.g., Lipomyces and some Candida) do not appear to have malic enzymes, however, so apparently other enzymes can provide comparable activity, although it is expected that a dedicated source of NADPH is probably required for fatty acid synthesis (see, for example, Wynn et al., Microbiol 145:1911, 1999; Ratledge Adv. Appi. Microbiol. 51:1, 2002, each of which is incorporated herein by reference in its entirety).

[0119] Other activities which can be involved in regenerating NADPH include, for example, 6-phosphogluconate dehydrogenase (gnd); Fructose 1,6 bisphosphatase (fbp); Glucose 6 phosphate dehydrogenase (g6pd); NADH kinase (EC 2.7.1.86); and/or transhydrogenase (EC 1.6.1.1 and 1.6.1.2).

[0120] Gnd is part of the pentose phosphate pathway and catalyses the reaction:

6-phospho-D-gluconate+NADP+.fwdarw.D-ribulose 5-phosphate+CO.sub.2+NADPH

[0121] Fbp is a hydrolase that catalyses the gluconeogenic reaction:

D-fructose 1,6-bisphosphate+H.sub.2O.fwdarw.D-fructose 6-phosphate+phosphate

Fbp redirects carbon flow from glycolysis towards the pentose phosphate pathway. The oxidative portion of the pentose phosphate pathway, which includes glucose 6 phosphate dehydrogenase and 6-phosphogluconate dehydrogenase, enables the regeneration of NADPH.

[0122] G6pd is part of the pentose phosphate pathway and catalyses the reaction:

D-glucose 6-phosphate+NADP.sup.+.fwdarw.D-glucono-1,5-lactone 6-phosphate+NADPH+H.sup.+

[0123] NADH kinase catalyzes the reaction:

ATP+NADH.fwdarw.ADP+NADPH

[0124] Transhydrogenase catalyzes the reaction:

NADPH+NAD.sup.+.revreaction.NADP.sup.++NADH

[0125] Thus, enhancing the expression and/or activity of any of these enzymes can increase NADPH levels and promote anabolic pathways requiring NADPH.

[0126] Alternative or additional strategies to promote oleaginy may include one or more of the following: (1) increased or heterologous expression of one or more of acyl-CoA:diacylglycerol acyltransferase (e.g., DGA1; YALI0E32769g); phospholipid:diacylglycerol acyltransferase (e.g., LRO1; YALI0E16797g); and acyl-CoA:cholesterol acyltransferase (e.g., ARE genes such as ARE1, ARE2, YALI0F06578g), which are involved in triglyceride synthesis (Kalscheuer et al. Appl Environ Microbiol p. 7119-7125, 2004; Oelkers et al. J Biol Chem 277:8877-8881, 2002; and Sorger et al. J Biol Chem 279:31190-31196, 2004), (2) decreased expression of triglyceride lipases (e.g., TGL3 and/or TGL4; YALI0D17534g and/or YALI0F10010g (Kurat et al. J Biol Chem 281:491-500, 2006); and (3) decreased expression of one or more acyl-coenzyme A oxidase activities, for example encoded by POX genes (e.g. POX1, POX2, POX3, POX4, POX5; YALI0C23859g, YALI0D24750g, YALI0E06567g, YALI0E27654g, YALI0E32835g, YALI0F10857g; see for example Mlickova et al. Appl Environ Microbiol 70: 3918-3924, 2004; Binns et al. J Cell Biol 173:719, 2006).

[0127] Thus, according to the present invention, the oleaginy of a host organism may be enhanced by modifying the expression or activity of one or more polypeptides involved in generating cytosolic acetyl-CoA and/or NADPH and/or altering lipid levels through other mechanisms. For example, modification of the expression or activity of one or more of acetyl-CoA carboxylase, pyruvate decarboxylase, isocitrate dehydrogenase, ATP-citrate lyase, malic enzyme, AMP-deaminase, glucose-6-phosphate dehydrogenase, 6-phosphogluconate dehydrogenase, fructose 1,6 bisphosphatase, NADH kinase, transhydrogenase, acyl-CoA: diacylglycerol acyltransferase, phospholipid: diacylglycerol acyltransferase, acyl-CoA:cholesterol acyltransferase, triglyceride lipase, acyl-coenzyme A oxidase can enhance oleaginy in accordance with the present invention. Exemplary polypeptides which can be utilized or derived so as to enhance oleaginy in accordance with the present invention include, but are not limited to those acetyl-CoA carboxylase, pyruvate decarboxylase, isocitrate dehydrogenase, ATP-citrate lyase, malic enzyme, AMP-deaminase, glucose-6-phosphate dehydrogenase, 6-phosphogluconate dehydrogenase, fructose 1,6 bisphosphatase, NADH kinase, transhydrogenase, acyl-CoA: diacylglycerol acyltransferase, phospholipid: diacylglycerol acyltransferase, acyl-CoA:cholesterol acyltransferase, triglyceride lipase, acyl-coenzyme A oxidase polypeptides provided in Tables 1-6 and Tables 69-85 respectively.

[0128] In some embodiments of the invention, where an oleaginous host cell is employed, enzymes and regulatory components relevant to oleaginy are already in place but could be modified, if desired, by for example altering expression or activity of one or more oleaginic polypeptides and/or by introducing one or more heterologous oleaginic polypeptides. In those embodiments of the invention where a non-oleaginous host cell is employed, it is generally expected that at least one or more heterologous oleaginic polypeptides will be introduced.

[0129] The present invention contemplates not only introduction of heterologous oleaginous polypeptides, but also adjustment of expression or activity levels of heterologous or endogenous oleaginic polypeptides, including, for example, alteration of constitutive or inducible expression patterns. In some embodiments of the invention, expression patterns are adjusted such that growth in nutrient-limiting conditions is not required to induce oleaginy. For example, genetic modifications comprising alteration and/or addition of regulatory sequences (e.g., promoter elements, terminator elements) and/or regulatory factors (e.g., polypeptides that modulate transcription, splicing, translation, modification, etc.) may be utilized to confer particular regulation of expression patterns. Such genetic modifications may be utilized in conjunction with endogenous genes (e.g., for regulation of endogenous oleagenic polypeptide(s)); alternatively, such genetic modifications may be included so as to confer regulation of expression of at least one heterologous polypeptide (e.g., oleagenic polypeptide(s)).

[0130] In some embodiments, at least one oleaginic polypeptide is introduced into a host cell. In some embodiments of the invention, a plurality (e.g., two or more) of different oleaginic polypeptides is introduced into the same host cell. In some embodiments, the plurality of oleaginic polypeptides contains polypeptides from the same source organism; in other embodiments, the plurality includes polypeptides independently selected from different source organisms.

[0131] Representative examples of a variety of oleaginic polypeptides that may be introduced into or modified within host cells according to the present invention, include, but are not limited to, those provided in Tables 1-6, and Tables 69-85. As noted above, it is expected that at least some of these polypeptides (e.g., malic enzyme and ATP-citrate lyase) should desirably act in concert, and possibly together with one or more components of fatty acid synthase, such that, in some embodiments of the invention, it will be desirable to utilize two or more oleaginic polypeptides from the same source organism.

[0132] In general, source organisms for oleaginic polypeptides to be used in accordance with the present invention include, but are not limited to, Blakeslea, Candida, Cryptococcus, Cunninghamella, Lipomyces, Mortierella, Mucor, Phycomyces, Pythium, Rhodosporidium, Rhodotorula, Trichosporon, Yarrowia, Aspergillus, Botrytis, Cercospora, Fusarium (Gibberella), Kluyveromyces, Neurospora, Penicillium, Pichia (Hansenula), Puccinia, Saccharomyces, Sclerotium, Trichoderma, and Xanthophyllomyces (Phaffia). In some embodiments, the source species for acetyl CoA carboxylase, ATP-citrate lyase, malic enzyme and/or AMP deaminase polypeptides include, but are not limited to, Aspergillus nidulans, Cryptococcus neoformans, Fusarium fujikuroi, Kluyveromyces lactis, Neurospora crassa, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Ustilago maydis, and Yarrowia lipolytica; in some embodiments, source species for pyruvate decarboxylase or isocitrate dehydrogenase polypeptides include, but are not limited to Neurospora crassa, Xanthophyllomyces dendrorhous (Phaffia rhodozyma), Aspergillus niger, Saccharomyces cerevisiae, Mucor circinelloides, Rhodotorula glutinis, Candida utilis, Mortierella alpina, and Yarrowia lipolytica.

[0133] Aspergillus niger accumulates large amounts of citric acid, whereas Mortierella alpina and Thraustochytrium sp. accumulate large amounts of fatty acid, respectively; Mortierella alpina and Thraustochytrium are also oleaginous. To give but one particular example of a host cell engineered to be oleaginous (or at least to accumulate increased levels of lipid) in accordance with the present invention, S. cerevisiae can be engineered to express one or more oleaginic polypeptides, e.g., from heterologous source organisms.

[0134] In some embodiments, a plurality of different oleaginic polypeptides are expressed, optionally from different source organisms. For instance, in some embodiments, S. cerevisiae cells are engineered to express (and/or to increase expression of) ATP-citrate lyase (e.g., from N. crassa), AMP deaminase (e.g., from S. cerevisiae), and/or malic enzyme (e.g., from M. circinelloides). In other embodiments, Candida utilis and Phaffia rhodozyma can be similarly modified. Modified S. cerevisiae, C. utilis, and P. rhodozyma strains can be further modified as described herein to increase production of one or more quinone derived compounds.

Engineering Production of Quinone Derived Compounds

[0135] The present invention encompasses the recognition that lipid-accumulating systems are useful for the production and/or isolation of certain quinone derived compounds, and particularly of a ubiquinone (e.g., CoQ10 and/or one or more C.sub.5-9 quinone compounds such as CoQ5, CoQ6, CoQ7, CoQ8, CoQ9), one or more vitamin K compounds (e.g., phylloquinone and/or one or more menaquinones), and/or one or more vitamin E compounds (e.g., one or more tocopherols and/or tocotrienols). Without wishing to be bound by theory, the present inventors propose that the higher intracellular membrane content may facilitate increased quinone derived compound production and/or accumulation. The present invention therefore encompasses the discovery that certain quinone derived compounds can desirably be produced in oleaginous yeast and fungi.

[0136] According to the present invention, strains that both (i) accumulate lipid, often in the form of cytoplasmic lipid bodies and typically to at least about 20% of their dry cell weight; and (ii) produce one or more quinone derived compounds at a level at least about 1%, and in some embodiments at least about 3-20%, of their dry cell weight, are generated through manipulation of generally available strains (e.g., naturally-occurring strains and strains which have been previously genetically modified, whether via recombinant DNA techniques or mutagenesis directed modification). These manipulated strains are then used to produce one or more quinone derived compounds, so that compound(s) that partitions into the lipid bodies can readily be isolated.

[0137] In certain embodiments of the invention, host cells are Yarrowia lipolytica cells. Advantages of Y. lipolytica include, for example, tractable genetics and molecular biology, availability of genomic sequence, suitability to various cost-effective growth conditions, and ability to grow to high cell density. In addition, Y. lipolytica is naturally oleaginous, such that fewer manipulations may be required to generate an oleaginous, quinone derived compound-producing Y. lipolytica strain than might be required for other organisms. Furthermore, there is already extensive commercial experience with Y. lipolytica. In certain embodiments, host cells are Saccharomyces cerevisiae cells. In other embodiments, host cells are Candida utilis cells.

[0138] As mentioned, quinone derived compounds are produced from the isoprenoid compound isopentyl pyrophosphate (IPP), via geranylgeranyl pyrophosphate (see, for example, FIG. 8. IPP can be generated through one of two different isoprenoid biosynthesis pathways. The most common isoprenoid biosynthesis pathway, sometimes referred to as the "mevalonate pathway", is generally depicted in FIG. 4A. As shown, acetyl-CoA is converted, via hydroxymethylglutaryl-CoA (HMG-CoA), into mevalonate. Mevalonate is then phosphorylated and converted into the five-carbon compound isopentenyl pyrophosphate (IPP).

[0139] An alternative isoprenoid biosynthesis pathway, that is utilized by some organisms (particularly bacteria) and is sometimes called the "mevalonate-independent pathway", is also depicted in FIG. 4A. This pathway is initiated by the synthesis of 1-deoxy-D-xyloglucose-5-phosphate (DOXP) from pyruvate and glyceraldehyde-3-phosphate. DOXP is then converted, via a series of reactions shown in FIG. 4B, into IPP.

[0140] Various proteins involved in isoprenoid biosynthesis have been identified and characterized in a number of organisms. Moreover, isoprenoids are synthesized in many, if not most, organisms. Thus, various aspects of the isoprenoid biosynthesis pathway are conserved throughout the fungal, bacterial, plant and animal kingdoms. For example, polypeptides corresponding to the acetoacetyl-CoA thiolase, HMG-CoA synthase, HMG-CoA reductase, mevalonate kinase, phosphomevalonate kinase, mevalonate pyrophosphate decarboxylase, IPP isomerase, FPP synthase, and GGPP synthase shown in FIGS. 4-6 and 8 have been identified in and isolated from a wide variety of organisms and cells; representative examples of a wide variety of such polypeptides are provided in Tables 7-15. One or more of the polypeptides selected from those provided in any one of Tables 7-15 may be utilized or derived for use in the methods and compositions in accordance with the present invention.

[0141] Alternatively or additionally, modified mevalonate kinase polypeptides that exhibit decreased feedback inhibition properties (e.g., to farnesyl pyrophosphate (FPP)) may be utilized in accordance with the present invention. Such modified mevalonate kinase polypeptides may be of eukaryotic or prokaryotic origin. For example, modified versions of mevalonate kinase polypeptides from animals (including humans), plants, algae, fungi (including yeast), and/or bacteria may be employed; for instance, modified versions of mevalonate kinase polypeptides disclosed in Table 10 herein may be utilized.

[0142] Particular examples of modified mevalonate kinase polypeptides include "feedback-resistant mevalonate kinases" disclosed in PCT Application WO 06/063,752. Thus, for example, a modified mevalonate kinase polypeptide may include one or more mutation(s) at one or more amino acid position(s) selected from the group consisting of amino acid positions corresponding to positions 17, 47, 93, 94, 132, 167, 169, 204, and 266 of the amino acid sequence of Paracoccus zeaxanthinifaciens mevalonate kinase as shown in SEQ ID NO:1 of PCT Application WO 04/111,214. For example, the modified mevalonate kinase polypeptide may contain one or more substitutions at positions corresponding to one or more of I 17T, G47D, K93E, V94I, R204H and C266S.

[0143] To give but a few specific examples, when a modified mevalonate kinase polypeptide comprises 2 amino acid changes as compared with a parent mevalonate kinase polypeptide, it may comprise changes at positions corresponding to the following positions 132/375,167/169, 17/47 and/or 17/93 of SEQ ID NO:1 of WO 04/111,214 (e.g. P132A/P375R, R167W/K169Q, 117T/G47D or I17T/K93E); when a modified mevalonate kinase polypeptide comprises 3 amino acid changes as compared with a parent mevalonate kinase, it may comprise changes at positions corresponding to the following positions 17/167/169, 17/132/375, 93/132/375, and/or 17/47/93 of SEQ ID NO: 1 of WO/2004/111214 (e.g., I17T/R167W/K169Q, I17T/P132A/P375R, K93E/P132A/P375R, I17T/R167W/K169H, I17T/R167T/K169M, I17T/R167T/K169Y, I17T/R167F/K169Q, I17T/R167I/K169N, I17T/R167H/K169Y, I17T/G47D/K93E or I17T/G47D/K93Q).

[0144] Thus, for example, a modified mevalonate kinase polypeptide may include one or more mutation(s) (particularly substitutions), as compared with a parent mevalonate kinase polypeptide, at one or more amino acid position(s) selected from the group consisting of amino acid positions corresponding to positions 55, 59, 66, 83, 106, 111, 117, 142, 152, 158, 218, 231, 249, 367 and 375 of the amino acid sequence of Saccharomyces cerevisiae mevalonate kinase as shown in SEQ ID NO:1 of PCT application WO 06/063,752. For example, such corresponding substitutions may comprise one or more of P55L, F59S, N66K, C117S, or I152M. A modified mevalonate kinase may comprise a substitution corresponding to F59S substitution. A modified mevalonate kinase polypeptide comprising 2 amino acid changes as compared with its parent mevalonate kinase polypeptide may, for example, comprise changes at positions corresponding to the following positions 55/117, 66/152, 83/249, 111/375 or 106/218 of to SEQ ID NO: 1 of WO 06/063,752 (e.g. P55L/C117S, N66K/I152M, K83E/S249P, H111N/K375N or L106P/S218P). A modified mevalonate kinase may comprise a substitution corresponding to N66K/I152M. A modified mevalonate kinase polypeptide comprising 4 amino acid changes as compared with its parent mevalonate kinase polypeptide may have changes at positions corresponding to one or more of the following positions 42/158/231/367 of SEQ ID NO:1 of WO 06/063,752 (e.g., 1142N/L158S/L2311/T367S).

[0145] According to the present invention, quinone derived compound production in a host organism may be adjusted by modifying the expression or activity of one or more proteins involved in isoprenoid biosynthesis. In some embodiments, such modification involves introduction of one or more heterologous isoprenoid biosynthesis polypeptides into the host cell; alternatively or additionally, modifications may be made to the expression or activity of one or more endogenous or heterologous isoprenoid biosynthesis polypeptides. Given the considerable conservation of components of the isoprenoid biosynthesis polypeptides, it is expected that heterologous isoprenoid biosynthesis polypeptides will often function well even in significantly divergent organisms. Furthermore, should it be desirable to introduce more than one heterologous isoprenoid biosynthesis polypeptide (e.g., more than one version of the same polypeptide and/or more than one different polypeptides), in many cases polypeptides from different source organisms may function well together. In some embodiments of the invention, a plurality of different heterologous isoprenoid biosynthesis polypeptides is introduced into the same host cell. In some embodiments, this plurality contains only polypeptides from the same source organism; in other embodiments the plurality includes polypeptides from different source organisms.

[0146] In certain embodiments of the present invention that utilize heterologous isoprenoid biosynthesis polypeptides, the source organisms include, but are not limited to, fungi of the genera Blakeslea, Candida, Cryptococcus, Cunninghamella, Lipomyces, Mortierella, Mucor, Phycomyces, Pythium, Rhodosporidium, Rhodotorula, Trichosporon, Yarrowia, Aspergillus, Botrytis, Cercospora, Fusarium (Gibberella), Kluyveromyces, Neurospora, Penicillium, Pichia (Hansenula), Puccinia, Saccharomyces, Schizosaccharomyces, Sclerotium, Trichoderms Ustilago, and Xanthophyllomyces (Phaffia). In certain embodiments, the source organisms are of a species including, but not limited to, Cryptococcus neoformans, Fusarium fujikuroi, Kluyverimyces lactis, Neurospora crassa, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Ustilago maydis, and Yarrowia lipolytica.

Ubiquinone-10/Coenzyme Q10

[0147] Ubiquinone-10 or coenzyme Q10 (CoQ10), is synthesized de novo in all mammalian tissues, and serves a critical function in mediating electron transport in the mitochondrial respiratory chain. The reduced form of CoQ10 is also thought to function as an antioxidant and to protect against lipid peroxidation as well as against other free radical-induced oxidative damage. Recent studies have confirmed the role of CoQ10 in certain disease states, and additional roles in certain disorders continue to be elucidated. For example, administration of cholesterol lowering drugs, including HMG-CoA reductase inhibitors (e.g., the statins) to patients can result in diminished CoQ10 levels since cholesterol biosynthesis and CoQ10 biosynthesis share many of the same biosynthetic steps. Additionally, several clinical studies have been conducted to examine whether supplemental CoQ10 administration provides protection against, for example, deterioration in cardiac function (e.g., congestive heart failure), slows deterioration in Parkinson's patients, or protects the heart from damage caused by certain chemotherapeutic drugs.

[0148] While cells are capable of endogenously producing CoQ10, only about half of the levels required for optimum human cell function are produced in the body; the other half is acquired through dietary sources. Also, cellular levels of CoQ10 decrease under stress, with increasing age, and after certain pharmacological treatments, further increasing the need for exogenous CoQ10 to maintain optimum cell function.

[0149] CoQ10 has been used commercially in dietary and nutritional supplements as well as in functional foods, cosmetics and in personal care items. A rapidly growing market exists for CoQ10. The 2005 market for CoQ10 as an ingredient is estimated to be approximate $500 million, which represented approximately 250 metric tonnes of CoQ10. Further market expansion for CoQ10 is expected. Improved systems for enabling cost effective production, isolation, and/or formulation of CoQ10 are needed to effectively meet the growing demand for CoQ10.

[0150] The commitment step in ubiquinone biosynthesis is the formation of para-hydroxybenzoate (PHB) from tyrosine or phenylalanine in mammals or chorismate in bacteria, followed by condensation of PHB and isoprene precursor, resulting in addition of the prenyl group (see FIG. 9). Lower eukaryotes, such as yeast, can synthesize PHB from either tyrosine or chorismate. The 3-decaprenyl-4-hydroxybenzoic acid resulting from the condensation reaction undergoes further modifications, which include hydroxylation, methylation and decarboxylation, in order to form ubiquinone. Ubiquinone biosynthetic enzymes and genes encoding these proteins have been characterized in several organisms. The most extensive analysis has been performed in bacterial systems such as Escherichia coli and Rhodobacter sphaeroides as well as the yeast, Saccharomyces cerevisiae. At least 8 enzymes are required for the synthesis of CoQ10 from PHB and the isoprene precursors.

[0151] Biomanufacturing processes for CoQ10 have been developed or attempted using several genera of bacteria and yeast. Bacterial hosts have included R. sphaeroides, Agrobacterium tumefaciens, Paracoccus denitrificans, Rhodopseudomonas spheroides, and recombinant strains of E. coli, which naturally produces CoQ8. Yeast hosts have included Saitoella complicata, Sporidiobolus ruineniae, Schizosaccharomyces pombe, and several species of Rhodotorula and Candida. In addition, a semi-synthetic process for producing CoQ10 has been developed using solanesol extracted from tobacco leaves as a starting material; this process has economic and environmental disadvantages.

[0152] Despite the early identification of several microorganisms as potential hosts for CoQ10 production, only limited progress has been made towards the development of a cost-effective CoQ10 process using fermentation. Without wishing to be bound by any particular theory, the present inventors note that the hydrophobic isoprenoid side chain on CoQ10 causes it to localize to cell membranes, and offer that enhanced CoQ 10 concentrations in the membranes might be detrimental to cell growth or viability. Efforts to engineer strains with increased production of CoQ10 have focused primarily in prokaryotes such as E. coli. While E. coli is a host with facile molecular genetic tools, this organism's fundamental physiology, may limit its ultimate utility as a CoQ10 production system.

[0153] The present invention encompasses the recognition that lipid-accumulating systems are useful for the production and/or isolation of ubiquinone(s). Indeed, several of the more productive natural CoQ10 strains (e.g., P. denitrificans, R. sphaeroides) display a relatively high intracellular membrane content. Without wishing to be bound by theory, the present inventors propose that the higher intracellular membrane content may facilitate increased CoQ10 production and/or accumulation. The present invention therefore encompasses the discovery that CoQ10 can desirably be produced in oleaginous yeast and fungi. According to the present invention, strains that both (i) accumulate lipid, often in the form of cytoplasmic lipid bodies and typically to at least about 20% of their dry cell weight; and (ii) produce CoQ10 at a level at least about 1%, and in some embodiments at least about 3-20%, of their dry cell weight, are generated through manipulation of generally available strains (e.g., naturally-occurring strains and strains which have been previously genetically modified, whether via recombinant DNA techniques or mutagenesis directed modification). These manipulated strains are then used to produce CoQ10, so that the CoQ10 that partitions into the lipid bodies can readily be isolated.

[0154] In certain embodiments of the invention, host cells are Yarrowia lipolytica cells. Advantages of Y. lipolytica include, for example, tractable genetics and molecular biology, availability of genomic sequence, suitability to various cost-effective growth conditions, and ability to grow to high cell density. In addition, Y. lipolytica is naturally oleaginous, such that fewer manipulations may be required to generate an oleaginous, CoQ10-producing Y. lipolytica strain than might be required for other organisms. Furthermore, there is already extensive commercial experience with Y. lipolytica.

[0155] As mentioned, ubiquinone is formed by the combination of para-hydroxybenzoate and isoprenoid chains produced, for example, via the isoprenoid biosynthesis pathways discussed above. Once IPP is formed according to that pathway, it isomerizes into dimethylallyl pyrophophate (DMAPP). Three sequential condensation reactions with additional molecules of IPP generate the ten-carbon molecule geranyl pyrophosphate (GPP), followed by the fifteen-carbon molecule farnesyl pyrophosphate (FPP), which can be used to form the twenty-carbon compound geranylgeranyl pyrophosphate (GGPP). In many instances, FPP appears to be the predominant substrate used by polyprenyldiphosphate synthases (e.g., Coq1 polypeptides) during ubiquinone biosynthesis. According to the present invention, ubiquinone production in a host organism may be adjusted by modifying the expression or activity of one or more proteins involved in isoprenoid biosynthesis as discussed above.

[0156] As discussed herein, two different pathways can produce the ubiquinoid precursor para-hydroxybenzoate--the first, the "shikimate pathway" is utilized in prokaryotes and yeast, involves synthesis of para-hydroxybenzoate (PHB) through chorismate. Biosynthesis of para-hydroxybenzoate from chorismate occurs by the action of chorismate pyruvate lyase. For example, as discussed herein, enzymes of the shikimate pathway, chorismate synthase, DAHP synthase, and transketolase are involved in this process. Representative examples of these enzymes are provided in Table 33 and Tables 35 through 37. In the second possible pathway, para-hydroxybenzoate is produced by derivation of tyrosine or phenylalanine. Biosynthesis of para-hydroxybenzate from tyrosine or phenylalanine occurs through a five step process in mammalian cells (see, for example FIG. 3).

[0157] Accordingly, ubiquinone (e.g., CoQ10 and/or C.sub.5-9 quinone compounds) production in a host organism may be adjusted by modifying the expression or activity of one or more proteins involved in PHB biosynthesis. In some embodiments, such modification involves introduction of one or more heterologous PHB polypeptides into the host cell; alternatively or additionally, modifications may be made to the expression or activity of one or more endogenous or heterologous additional PHB polypeptide, and/or isoprenoid biosynthesis polypeptides. Given the considerable conservation of components of the PHB biosynthesis polypeptides, it is expected that heterologous PHB biosynthesis polypeptides will often function well even in significantly divergent organisms. Further, given the conservation of the pathways among organism, it is anticipated that heterologous polypeptides throughout the ubiquinone biosynthetic pathway will function together effectively. Furthermore, should it be desirable to introduce more than one heterologous PHB polypeptide and/or isoprenoid biosynthesis polypeptide, in many cases polypeptides from different source organisms will function well together. In some embodiments of the invention, a plurality of different heterologous PHB and/or isoprenoid biosynthesis polypeptides is introduced into the same host cell. In some embodiments, this plurality contains only polypeptides from the same source organism; in other embodiments the plurality includes polypeptides from different source organisms. In still other embodiments, modification of endogenous PHB and/or isoprenoid biosynthesis polypeptides are also utilized, either alone or in combination with heterologous polypeptides as discussed herein.

[0158] As noted herein, the isoprenoid biosynthesis pathway is also involved in the production of non-ubiquinone compounds, such as carotenoids, sterols, steroids, and vitamins, such as vitamin K or vitamin E. Proteins that act on isoprenoid biosynthesis pathway intermediates, and divert them into biosynthesis of non-ubiquinone compounds are therefore indirect inhibitors of ubiquinone biosynthesis (see, for example, FIG. 5, which illustrates points at which isoprenoid intermediates are channeled into other biosynthesis pathways). Such proteins are therefore considered ubiquinone biosynthesis competitor polypeptides. Reductions of the level or activity of such ubiquinone biosynthesis competitor polypeptides are expected to increase ubiquinone (e.g., CoQ10 and/or C.sub.5-9 quinone compounds) production in host cells according to the present invention.

[0159] In some embodiments of the present invention, production or activity of endogenous ubiquinone biosynthesis competitor polypeptides may be reduced or eliminated in host cells. In some embodiments, this reduction or elimination of the activity of a ubiquinone biosynthesis competitor polypeptide can be achieved by treatment of the host organism with small molecule inhibitors of enzymes of the ergosterol biosynthetic pathway. Enzymes of the ergosterol biosynthetic pathway include, for example, squalene synthase (Erg9), squalene epoxidase (Erg1), 2,3-oxidosqualene-lanosterol cyclase (Erg7), cytochrome P450 lanosterol 14.alpha.-demethylase (Erg11), C-14 sterol reductase (Erg24), C-4 sterol methyl oxidase (Erg25), SAM:C-24 sterol methyltransferase (Erg6), C-8 sterol isomerase (Erg2), C-5 sterol desaturase (Erg3), C-22 sterol desaturase (Erg5), and C-24 sterol reductase (Erg4) polypeptides. Each of these enzymes is considered a ubiquinone biosynthesis competitor polypeptide. Regulators of these enzymes may also be considered ubiquinone biosynthesis competitor polypeptides (e.g., the yeast proteins Sut1 (Genbank Accession JC4374 GI:2133159) and Mot3 (Genbank Accession NP.sub.--013786 GI:6323715), which may or may not have homologs in other organisms.

[0160] In some embodiments of the invention, production or activity of endogenous ubiquinone biosynthesis competitor polypeptides may be reduced or eliminated in host cells. In some embodiments, this reduction or elimination of the activity of a ubiquinone biosynthesis competitor polypeptide can be achieved by treatment of the host organism with small molecule inhibitors of enzymes of the ergosterol biosynthetic pathway. Enzymes of the ergosterol biosynthetic pathway include, for example, squalene synthase, squalene epoxidase, 2,3-oxidosqualene-lanosterol cyclase, cytochrome P450 lanosterol 14.alpha.-demethylase, C-14 sterol reductase, C-4 sterol methyl oxidase, SAM:C-24 sterol methyltransferase, C-8 sterol isomerase, C-5 sterol desaturase, C-22 sterol desaturase, and C-24 sterol reductase. Each of these enzymes is considered a ubiquinone biosynthesis competitor polypeptide. Regulators of these enzymes may also be considered ubiquinone biosynthesis competitor polypeptides (e.g., the yeast proteins Sut1 (Genbank Accession JC4374 GI:2133159) and Mot3 (Genbank Accession NP.sub.--013786 GI:6323715), which may or may not have homologs in other organisms).

[0161] Known small molecule inhibitors of some ubiquinone biosynthesis competitor enzymes include, but are not limited to, zaragosic acid (including analogs thereof such as TAN1607A (Biochem Biophys Res Commun 1996 Feb. 15; 219(2):515-520)), RPR 107393 (3-hydroxy-3-[4-(quinolin-6-yl)phenyl]-1-azabicyclo[2-2-2]octane dihydrochloride; J Pharmacol Exp Ther. 1997 May; 281(2):746-52), ER-28448 (5-{N-[2-butenyl-3-(2-methoxyphenyl)]-N-methylamino}-1,1-penthylidenebis(- phosphonic acid) trisodium salt; Journal of Lipid Research, Vol. 41, 1136-1144, July 2000), BMS-188494 (The Journal of Clinical Pharmacology, 1998; 38:1116-1121), TAK-475 (1-[2-[(3R,5S)-1-(3-acetoxy-2,2-dimethylpropyl)-7-chloro-1,2,3,5-tetrahyd- ro-2-oxo-5-(2,3-dimethoxyphenyl)-4,1-benzoxazepine-3-yl]acetyl]piperidin-4- -acetic acid; Eur J. Pharmacol. 2003 Apr. 11; 466(1-2):155-61), YM-53601 ((E)-2-[2-fluoro-2-(quinuclidin-3-ylidene) ethoxy]-9H-carbazole monohydrochloride; Br J. Pharmacol. 2000 September; 131(1):63-70), or squalestatin I that inhibit squalene synthase; terbinafine (e.g., LAMISIL.RTM.), naftifine (NAFTIN.RTM.), S-allylcysteine, garlic, resveratrol, NB-598 (e.g., from Banyu Pharmaceutical Co), and/or green tea phenols that inhibit squalene epoxidase (see, for example, J. Biol Chem 265:18075, 1990; Biochem. Biophys. Res. Commun. 268:767, 2000); various azoles that inhibit cytochrome P450 lanosterol 14.alpha.-demethylase; and fenpropimorph that inhibits the C-14 sterol reductase and the C-8 sterol isomerase. In other embodiments, heterologous ubiquinone biosynthesis competitor polypeptides may be utilized (whether functional or non-functional; in some embodiments, dominant negative mutants are employed).

[0162] One ubiquinone biosynthesis competitor polypeptide useful according to the present invention is squalene synthase which has been identified and characterized from a variety of organisms; representative examples of squalene synthase polypeptide sequences are included in Table 16. In some embodiments of the invention that utilize squalene synthase (or modifications of squalene synthase) source organisms include, but are not limited to, Neurospora crassa, Xanthophyllomyces dendrorhous (Phaffia rhodozyma), Aspergillus niger, Saccharomyces cerevisiae, Mucor circinelloides, Rhotorula glutinis, Candida utilis, Mortierella alpina, and Yarrowia

[0163] Another ubiquinone biosynthesis competitor polypeptide useful according to the present invention is anthranilate synthase, which has also been identified in a variety of organisms; representative anthranilate synthase polypeptides are provided in Table 32 and 32B. In some embodiments of the invention, anthranilate synthase polypeptide, or modifications thereof are utilized and adapted from source organisms including, but not limited to: Kluyveromyces lactis, Candida glabrata, Saccharomyces cerevisiae, Yarrowia lipolytica, Debaryomyces hansenii, Candida albicans, Aspergillus fumigatus, Aspergillus oryzae, Aspergillus nidulans, Ustilago maydis, Neurospora crassa, Schizosaccharomyces pombe, Gibberella zeae, and Cryptococcus neoformans var.

[0164] Similarly, yet another ubiquinone biosynthesis competitor polypeptide useful according to the invention is chorismate mutase, which has also been identified in a variety of organisms; representative chorismate mutase polypeptides are provided in Table 34. In some embodiments of the invention, chorismate mutase polypeptide, or modifications thereof are utilized and adapted from source organisms including, but not limited to: Kluyveromyces lactis, Candida glabrata, Arabidopsis thaliana, Yarrowia lipolytica, Candida albicans, Aspergillus fumigatus, Aspergillus oryzae, Aspergillus nidulans, Ustilago maydis, Neurospora crassa, Schizosaccharomyces pombe, Gibberella zeae, and Pichia pastoris.

[0165] Particularly for embodiments of the present invention directed toward production of CoQ10, it will often be desirable to utilize one or more genes from a natural CoQ 10-producing organism. In general, where multiple heterologous polypeptides are to be expressed, it may be desirable to utilize the same source organism for all, or to utilize closely related source organisms.

[0166] Bacterial ubiquinogenic genes have already been demonstrated to be transferable to other organisms, and are therefore useful in accordance with the present invention (see, for example, Okada et al., FEMS Lett. 431:241-244 (1998)). In some embodiments of this invention, it may be desirable to fused sequences encoding specific targeting signals to bacterial ubiquinogenic genes. For example, in certain embodiments mitochondrial signal sequences are useful in conjunction with, e.g., bacterial ubiquinogenic polypeptides for effective targeting of polypeptides for proper functioning. Mitochondrial signal sequences are known in the art, and include, but are not limited to example, mitochondrial signal sequences provided in Table 22. In other embodiments, it may be desirable to utilize genes from other source organisms such as animals, plants, alga, or microalgae, fungi, yeast, insect, protozoa, and mammals.

[0167] The present invention contemplates not only introduction of heterologous ubiquinogenic polypeptides, but also adjustment of expression or activity levels of heterologous or endogenous ubiquinogenic polypeptides, including, for example, alteration of constitutive or inducible expression patterns so as to increase activity of ubiquinogenic polypeptides. For example, genetic modifications comprising alteration and/or addition of regulatory sequences (e.g., promoter elements, terminator elements) may be utilized to confer particular regulation of expression patterns. Such genetic modifications may be utilized in conjunction with endogenous genes (e.g., for regulation of endogenous ubiquinogenic polypeptide(s)); alternatively, such genetic modifications may be included so as to confer regulation of expression of heterologous polypeptides (e.g., ubiquinogenic polypeptide(s)).

[0168] To give but a few specific examples of strains engineered to produce CoQ10 (and optionally to be oleaginic) according to the present invention, in some embodiments, Yarrowia lipolytica cells are engineered to express a decaprenyl diphosphate synthase polypeptide(s) from a source organism. In some embodiments, the source organism is selected from the group consisting of Silibacter pomeroyi and Loktanella vestfoldensis. As is discussed herein, where the source organism is other than Y. lipolytica, nucleic acid sequences encoding the polypeptide(s) can be established with Y. lipolytica codon preferences and/or targeting sequences (e.g., mitochondrial targeting sequences). In some embodiments, one or more endogenous polyprenyl synthase genes (e.g., nonaprenyl diphosphate synthase) is/are disrupted or inactivated, and a decaprenyl diphosphate synthase gene from a different source organism is introduced. In some embodiments, the different source organism is selected from the group consisting of Silibacter pomeroyi and Loktanella vestfoldensis.

[0169] Alternatively or additionally, in some embodiments of the invention, host cells are engineered to produce CoQ10 by introducing or increasing expression or activity of one or more ubiquinone biosynthesis polypeptides, and in particular of one or more polypeptides involved in converting para-hydroxybenzoic acid to CoQ10 (e.g., via a ubiquinol pathway). To give but one specific example, Yarrowia lipolytica cells are engineered to express a 4-hydroxybenzoate polyprenyltransferase polypeptide from source organism, for example selected from the group consisting of Silibacter pomeroyi and Bos taurus. Sequences encoding the polypeptide(s) can optionally be established with Y. lipolytica codon preferences and/or targeting sequences (e.g., mitochondrial targeting sequences).

[0170] In some embodiments, CoQ10 production in cells, e.g., in Yarrowia lipolytica cells is enhanced by engineering the cells to increase production of para-hydroxybenzoic acid (PHB), for example by increasing expression or activity of one or more PHB polypeptides. In some embodiments, Y. lipolytica cells are engineered to express one or more of 3-deoxy-D-arabino-heptulosonate-7-phosphate (DAHP) and chorismate lyase polypeptides from a source organism. Where the source organism is other than Y. lipolytica, nucleic acid sequences encoding the polypeptide(s) can be established with Y. lipolytica codon preferences and/or targeting sequences (e.g., mitochondrial targeting sequences). In some embodiments, one or both of the DAHP and chorismate lyase polypeptides are from heterologous source organisms, which may be the same or different. In some embodiments, at least one source organism is Erwinia carotovora.

[0171] In some embodiments, host cells (e.g., Yarrowia lipolytica) engineered to increase production of para-hydroxybenzoic acid are further engineered to increase expression and/or activity of one or more ubiquinone biosynthesis polypeptides involved in converting para-hydroxybenzoic acid to a ubiquinone (e.g., via a ubiquinol pathway), and/or of one or more decaprenyl diphosphate synthase. To give but a couple of particular examples, such strains may be engineered to express a 4-hydroxybenzoate polyprenyltransferase polypeptide from source organism, for example selected from the group consisting of Silibacter pomeroyi and Bos taurus and/or to express a decaprenyl diphosphate synthase polypeptide(s) from a source organism, for example selected from the group consisting of Silibacter pomeroyi and Loktanella vestfoldensis.

[0172] Alternatively or additionally, in some specific embodiments, CoQ10 production is enhanced by engineering cells to reduce carbon flow into competing metabolic pathways such as, for example, the sterol pathway. To give but one specific example, in some embodiments of the invention, Yarrowia lipolytica cells are engineered to inactivate (e.g., partially) an endogenous gene (e.g., ERG9) encoding a squalene synthase polypeptide.

[0173] Yet further, in some particular embodiments, CoQ10 production is enhanced by engineering cells by expression of a truncated HMG CoA reductase polypeptide (e.g., from a source organism such as Y. lipolytica).

[0174] In some embodiments of the present invention, host cells are engineered to produce one or more ubiquinones other than CoQ10 in addition or as an alternative to CoQ10. Specifically, in some embodiments of the invention, host cells are engineered to produce one or more C.sub.5-9 quinones.

[0175] That is, the present invention provides engineered host cells, e.g., fungal cells, that produce C.sub.5-9 quinones as a result of the engineering. The present invention therefore provides engineered host cells containing a C.sub.5-9 quinones production modification. Such a modification may comprise, for instance, introduction or activation of one or more C.sub.5-9 quinones biosynthetic polypeptides within a host cell. Exemplary such polypeptides include, for example, pentaprenyl, hexaprenyl, heptaprenyl, octaprenyl, and/or solanesyl(nonaprenyl) biosynthesis polypeptides. Specific examples of each of these can be found, for example, in Tables 61-65.

[0176] To give but a few specific examples, in some embodiments of the present invention, Yarrowia lipolytica cells are engineered to express one or more polyprenyl diphosphate synthase genes (e.g., nonaprenyl diphosphate synthase, octaprenyl diphosphate synthase, heptaprenyl diphosphate synthase, hexaprenyl diphosphate synthase, pentaprenyl disphosphate synthase, etc.). In some embodiments, one or more endogenous polyprenyl synthase genes (e.g., nonaprenyl diphosphate synthase) is/are disrupted or inactivated, and a polyprenyl synthase gene from a different source organism is introduced. In some embodiments, the different source organism is selected from the group consisting of Silibacter pomeroyi and Loktanella vestfoldensis.

Vitamin K

[0177] Vitamin K is a generic term that refers to derivatives of 2-methyl-1,4-naphthoquinone that have coagulation activity. The two natural forms of vitamin K differ in the identity of their side chains at position 3. Vitamin K.sub.1, also known as phylloquinone (based on its presence in plants), has a phytyl side chain in position 3; vitamin K.sub.2, also known as menaquinone, has an isoprenyl side chain at position 3. Different forms of menaquinone, having side chains with different numbers of isoprene units (typically 4-13) are found in different types of cells.

[0178] Vitamin K has been found to be an important vitamin involved in the blood coagulation system, and is utilized as a hemostatic agent. Recently it has been suggested that vitamin K is involved in osteo-metabolism, and vitamin K is expected to be applied to the treatment of osteoporosis. Both phylloquinone and menaquinone have been approved as pharmaceuticals.

[0179] The daily requirement for vitamin K is about 1 ug/kg; an average diet typically contains about 75-150 ug/day. Vitamin K deficiency results in hypoprothrombinemia and defective coagulation. Primary vitamin K deficiency is uncommon in adults because of its general availability in the food chain, and potentially also because it can be produced by microbes inhabiting the human gastrointestinal tract. Vitamin K deficiency is observed in newborns, as breast milk contains very little and the newborn gut is sterile (i.e., not inhabited by vitamin K-producing organisms) for the first several days of life.

[0180] Vitamin K is considered an essential nutrient since mammals do not produce it and it is a dietary requirement. As depicted for example in FIG. 9, vitamin K is produced through precursor molecules chorismate and isoprenoids, similar to ubiquinone production, though biosynthesis proceeds through production of isochorismate, and prenyl addition does not occur until the final or next to last step in synthesis. Details of production of vitamin K, and the relevant enzymes involved are known in the art and can be adapted for use in combination with the provided methods and compositions. See, e.g., Meganathan, Vitamins and Hormones 61: 173-219, 2001.

[0181] The present invention provides engineered host cells, e.g., fungal cells, that produce vitamin K as a result of the engineering. That is, the present invention provides engineered host cells containing a Vitamin K production modification. Such a modification may comprise, for instance, introduction or activation of one or more Vitamin K biosynthetic polypeptides within a host cell. Exemplary such polypeptides include, for example, MenF, MenD, MenC, MenE, MenB, MenA, UbiE, and/or MenG polypeptides. Specific examples of each of these can be found, for example, in Tables 46-53.

Vitamin E

[0182] Vitamin E is a generic term for a family of structurally related compounds that have a 6-chromanol ring, an isoprenoid side chain, and the biologic activity of .alpha.-tocopherol. The term encompasses the eight known naturally occurring vitamin E compounds, the four tocopherols (.alpha., .beta., .gamma., .delta.) and four tocotrienols (.alpha., .beta., .gamma., .delta.), which all contain a hydrophilic chromanol ring and a hydrophobic side chain. The .alpha., .beta., .gamma., and .delta. forms differ from one another in the number of methyl groups on the chromanol ring. Several synthetic vitamin E compounds have also been prepared, and still others are possible (see, for example, Bramley et al., J. Sci Food Agric 80:913, 2000). .alpha.-tocopherol is a potent antioxidant, and is generally considered to be the most active vitamin E compound in humans.

[0183] Vitamin E is commonly included in nutritional supplements and also in skin creams and lotions (based on a reported role in wound healing). Vitamin E deficiency is manifested differently in different species and in different individual cases, but can include reproductive disorders; abnormalities of muscle, liver, bone marrow, and/or brain function; red blood cell hemolysis; defective embryogenesis; exudative diathesis, a disorder of capillary permeability; and/or skeletal muscular dystrophy (with or without cardiomyopathy). Vitamin E is used to treat vitamin E deficiency, and is also suspected to be useful in the treatment of cancer (thanks to its antioxidant properties), cataracts and macular degeneration (particularly age-related), heart disease, neurological disorders (e.g., Alzheimer's Disease, Parkinson's Disease, etc.), immune disorders, inflammatory diseases, among other things.

[0184] The U.S. Dietary Reference Intake (DRI) Recommended Daily Amount (RDA) for a 25-year old male for Vitamin E is 15 mg/day. This is approximately 15 IU/day. Specifically, The natural form of alpha-tocopherol: RRR-alpha-tocopherol maintain 1.5 IU/mg.

[0185] Vitamin E compounds are synthesized by higher plants and cyanobacteria by two pathways: the isoprenoid pathway and the homogentisic acid formation pathway. The overall synthesis is depicted in FIG. 10, Panels A-C. As can be seen, the first step is formation of the homogentisic head group (HGA), which is produced from p-hydroxyphenylpyruvic acid (HPP) by the enzyme p-hydroxyphenylpyruvic acid dioxygenase (HPPDase). This is a complex reaction involving the addition of two oxygen atoms as well as the decarboxylation and rearrangement of the HPP side chain.

[0186] In the next step, HGS is prenylated and decarboxylated to form 2-methyl-6-phytylplastoquinol. This step also represents the commitment step for production of tocopherols and/or tocotrienols, as 2-methyl-6-phytylplastoquinol represents the common intermediate in the synthesis of all tocopherols. The present invention provides host cells that have been engineered to accumulate 2-methyl-6-phytylplastoquinol. In some embodiments, the host cells are (or have been engineered to be) oleaginous or lipid-accumulating. In some embodiments, produced 2-methyl-6-phytylplastoquinol accumulates in lipid bodies within the engineered host cells.

[0187] In the final steps of tocopherol synthesis, methylation and ring cyclization reactions convert the 2-methyl-6-phytylplastoquinol into various tocopherols.

[0188] It is expected in accordance with the present invention that availability of tocopherol precursors and/or intermediates may well affect the rate and/or extent of tocopherol (or other vitamin E compound) production and/or accumulation by and/or within cells. The present invention therefore encompasses engineering host cells to adjust the rate or amount of one or more tocopherol precursors and/or intermediates.

[0189] The present invention provides engineered host cells, e.g., fungal cells, that produce vitamin E as a result of the engineering. The present invention specifically provides engineered host cells, e.g., fungal cells, that produce larger amounts of vitamin E that an otherwise identical non-engineered host cell. That is, the present invention provides engineered host cells containing a Vitamin E production modification. Such a modification may comprise, for instance, introduction or activation of one or more Vitamin E biosynthetic polypeptides within a host cell. Exemplary such polypeptides include, for example, tyrA, pds1(hppd), VTE1, HPT1(VTE2), VTE3, VTE4, and/or GGH polypeptides. Specific examples of each of these can be found, for example, in Tables 54-60.

Production and Isolation of Quinone Derived Compounds

[0190] Accumulation of lipid bodies in oleaginous organisms is generally induced by growing the relevant organism under conditions of carbon excess and nitrogen or other nutrient (e.g., phosphate and/or magnesium) limitation. Specific conditions for inducing such accumulation have previously been established for a number of different oleaginous organisms (see, for example, Wolf (ed.) Nonconventional yeasts in biotechnology Vol. 1, Springer-Verlag, Berlin, Germany, pp. 313-338; Lipids 18(9):623, 1983; Indian J. Exp. Biol. 35(3):313, 1997; J. Ind. Microbiol. Biotechnol. 30(1):75, 2003; Bioresour Technol. 95(3):287, 2004, each of which is incorporated herein by reference in its entirety).

[0191] In general, it will be desirable to cultivate inventive modified host cells under conditions that allow accumulation of at least about 20% of their dry cell weight as lipid. In other embodiments, the inventive modified host cells are grown under conditions that permit accumulation of at least about 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, or even 80% of their dry cell weight as lipid. In certain embodiments, the host cells utilized are cells which are naturally oleaginous and induced to produce lipid to the desired levels. In other embodiments, the host cells are cells which naturally produce lipid, but have been engineered to increase production of lipid such that desired levels of lipid production and accumulation are achieved.

[0192] In certain embodiments, the host cells of the invention are not naturally oleaginous, but have been engineered to produce lipid such that desired levels of lipid production are obtained. Those of ordinary skill in the art will appreciate that, in general, growth conditions that are effective for inducing lipid accumulation in a source organism may also be useful for inducing lipid accumulation in a host cell into which a source organism's oleaginic polypeptides have been introduced. Of course, modifications may be required in light of characteristics of the host cell, which modifications are within the skill of those of ordinary skill in the art.

[0193] It will also be appreciated by those of ordinary skill in the art that it will often be desirable to ensure that production of a desired quinone derived compound by an inventive modified host cell occurs at an appropriate time in relation to the induction of oleaginy such that the compound(s) accumulate(s) in the lipid bodies. In some embodiments, it will be desirable to induce production of a particular quinone derived compound (e.g., a ubiquinone, .alpha.-tocopherol, phylloquinone and/or menaquinone) in a host cell which does not naturally the particular compound, such that detectable levels of the compound are produced. In certain embodiments, host cells that do not naturally produce a particular quinone derived compound are capable of producing one or more other quinone derived compounds (e.g., cells that do not produce CoQ10 may produce one or more of CoQ5, CoQ6, CoQ7, CoQ8, CoQ9, etc). In additional embodiments, it will be desirable to increase production levels of a particular quinone derived compound in a host cell which does naturally produce low levels of that compound, such that increased detectable levels of the compound are produced. In certain aspects, the host cells which do naturally produce a particular compound (e.g., CoQ10) also produce additional quinone derived compound(s) (e.g., CoQ6, CoQ8); in other aspects, the cells which naturally produce the particular compound do not produce additional quinone derived compound(s).

[0194] In certain embodiments of the invention, it will be desirable to accumulate one or more quinone derived compounds (I.e., considering the total amount of all produced quinone derived compounds together or considering a particular quinone derived compound) to levels that are greater than at least about 1% of the dry weight of the cells. In some embodiments, the total quinone derived compound accumulation will be to a level at least about 1.5%, at least about 2%, at least about 2.5%, at least about 3%, at least about 3.5%, at least about 4%, at least about 4.5%, at least about 5%, at least about 5.5%, at least about 6%, at least about 6.5%, at least about 7%, at least about 7.5%, at least about 8%, at least about 8.5%, at least about 9%, at least about 9.5%, at least about 10%, at least about 10.5%, at least about 11%, at least about 11.5%, at least about 12%, at least about 12.5%, at least about 13%, at least about 13.5%, at least about 14%, at least about 14.5%, at least about 15%, at least about 15.5%, at least about 16%, at least about 16.5%, at least about 17%, at least about 17.5%, at least about 18%, at least about 18.5%, at least about 19%, at least about 19.5%, at least about 20% or more of the total dry weight of the cells.

[0195] In some embodiments of the invention, a particular quinone derived compound may not accumulate to a level as high as 1% of the total dry weight of the cells; appropriately engineered cells according to the present invention, and any lipid bodies and/or quinone derived compound(s) they produce, remain within the scope of the present invention. Thus, in some embodiments, the cells accumulate a given quinone derived compound to a level below about 1% of the dry weight of the cells. In some embodiments, the quinone derived compound accumulates to a level below about 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, 0.1%, 0.05%, or lower, of the dry cell weight of the cells.

[0196] In some embodiments of the invention, one or more quinone derived compound(s) accumulate both within lipid bodies and elsewhere in the cells. In some embodiments, quinone derived compound (s) accumulate primarily within lipid bodies. In some embodiments, quinone derived compound (s) accumulate substantially exclusively within lipid bodies. In some embodiments, at least about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or more of a desired produced quinone derived compound (s) accumulates in lipid bodies.

[0197] In some embodiments of the invention, modified host cells are engineered to produce one or more quinone derived compound(s) characterized by negligible solubility in water (whether hot or cold) and detectable solubility in one or more oils. In some embodiments, such compounds have a solubility in oil below about 0.2%. In some embodiments, such compounds have a solubility in oil within the range of about <0.001%-0.2%.

[0198] The present invention therefore provides engineered host cells (and methods of making and using them) that contain lipid bodies and that further contain one or more quinone derived compounds accumulated in the lipid bodies, where the compounds are characterized by a negligible solubility in water and a solubility in oil within the range of about <0.001%-0.2%; 0.004%-0.15%; 0.005-0.1%; or 0.005-0.5%. For example, in some embodiments, such compounds have a solubility in oil below about 0.15%, 0.14%, 0.13%, 0.12%, 0.11%, 0.10%. 0.09, 0.08%, 0.07%, 0.06%, 0.05%, 0.04%, 0.03%, 0.02%, 0.01%, 0.05%, or less. In some embodiments, the compounds show such solubility in an oil selected from the group consisting of sesame; soybean; apricot kernel; palm; peanut; safflower; coconut; olive; cocoa butter; palm kernel; shea butter; sunflower; almond; avocado; borage; carnauba; hazel nut; castor; cotton seed; evening primrose; orange roughy; rapeseed; rice bran; walnut; wheat germ; peach kernel; babassu; mango seed; black current seed; jojoba; macadamia nut; sea buckthorn; sasquana; tsubaki; mallow; meadowfoam seed; coffee; emu; mink; grape seed; thistle; tea tree; pumpkin seed; kukui nut; and mixtures thereof.

[0199] Also, it should be noted that, the absolute and/or relative amounts of quinone derived compounds produced in accordance with the present invention can sometimes be altered by adjustment of growth conditions, for example to modulate the isoprenoid biosynthesis pathway and/or one or more downstream pathways. For example, controlling the concentration of dissolved oxygen in a culture during cultivation may regulate relative production levels of CoQ10.

[0200] Inventive modified cells, that have been engineered to produce one or more quinone derived compounds and/or to accumulate lipid (including to be oleaginous), can be cultured under conditions that achieve ubiquinone (e.g. CoQ10 and/or C.sub.5-C.sub.9 quinone compounds) production and/or oleaginy.

[0201] In some embodiments, it will be desirable to control growth conditions in order to maximize production of a particular quinone derived compound or quinone derived compounds and/or to optimize accumulation of the particular quinone derived compound(s) in lipid bodies. In some embodiments it will be desirable to control growth conditions to adjust the relative amounts of different quinone derived compound products produced.

[0202] In some embodiments, it will be desirable to limit accumulation of a particular intermediate, for example ensuring that substantially all of a particular intermediate compound is converted so that accumulation is limited. For example, particularly in situations where a downstream enzyme may be less efficient than an upstream enzyme and it is desirable to limit accumulation of the product of the upstream enzyme (e.g., to avoid its being metabolized via a competitive pathway and/or converted into an undesirable product), it may be desirable to grow cells under conditions that control (e.g., slow) activity of the upstream enzyme so that the downstream enzyme can keep pace.

[0203] Those of ordinary skill in the art will appreciate that any of a variety of growth parameters, including for example amount of a particular nutrient, pH, temperature, pressure, oxygen concentration, timing of feeds, content of feeds, etc can be adjusted as is known in the art to control growth conditions as desired.

[0204] To give but a few examples, in some embodiments, growth and/or metabolism is/are limited by limiting the amount of biomass accumulation. For example, growth and/or metabolism can be limited by growing cells under conditions that are limiting for a selected nutrient. The selected limiting nutrient can then be added in a regulated fashion, as desired. In some embodiments, the limiting nutrient is carbon, nitrogen (e.g., via limiting ammonium or protein), phosphate, magnesium, or combinations thereof. In some embodiments, the limiting nutrient is carbon.

[0205] In some embodiments, use of a limiting nutrient can be utilized to control metabolism of a particular intermediate and/or to adjust relative production of particular quinone derived compounds. In some embodiments, this result can be achieved by controlling metabolism of a particular intermediate as discussed above; in some embodiments, it can be achieved, for example, by limiting progress through the quinone biosynthesis pathway so that a desired quinone derived compound product is not converted to a downstream compound. For example, nutrient limitation (e.g. phosphate limitation) may slow the overall rate of flux through the quinone derived compound biosynthesis pathway and may be utilized to change the ratio of one quinone derived compound versus another.

[0206] In some embodiments, cells are grown in the presence of excess carbon source and limiting nitrogen, phosphate, and/or magnesium to induce oleaginy. In some embodiments cells are grown in the presence of excess carbon source and limiting nitrogen. In some embodiments, the carbon:nitrogen ratio is within the range of about 200:1, 150:1, 125:1, 100:1, 95:1, 90:1, 85:1, 80:1, 75:1, 70:1, 65:1, 60:1, 55:1, 50:1, 45:1, 40:1, 35:1, 30:1, 25:1, 20:1, 15:1, 10:1, or less. Those of ordinary skill in the art are aware of a wide variety of carbon sources, including, for example, glycerol, glucose, galactose, dextrose, any of a variety of oils (e.g., olive, canola, corn, sunflower, soybean, cottonseed, rapeseed, etc., and combinations thereof) that may be utilized in accordance with the present invention. Combinations of such may also be utilized. For example, common carbon source compositions contain oil:glucose in a ratios within the range of about 5:95 to 50:50 (e.g. about 5:95, about 10:90, about 15:85, about 20:80, about 25:75, about 30:70, about 35:65, about 40:60, about 45:55, about 50:50).

[0207] Those of ordinary skill in the art are also aware of a variety of different nitrogen sources (e.g., ammonium sulfate, proline, sodium glutamate, soy acid hydrolysate, yeast extract-peptone, yeast nitrogen base, corn steep liquor, etc, and combinations thereof) that can be utilized in accordance with the present invention.

[0208] In some embodiments, cultures are grown at a selected oxygen concentration (e.g., within a selected range of oxygen concentrations). In some embodiments, oxygen concentration may be varied during culture. In some embodiments, oxygen concentration may be controlled during some periods of culture and not controlled, or controlled at a different point, during others. In some embodiments, oxygen concentration is not controlled. In some embodiments, cultures are grown at an oxygen concentration within the rage of about 5-30%, 5-20%, 10-25%, 10-30%, 15-25%, 15-30%, including at about 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, or more. In some embodiments, oxygen concentration is maintained above about 20%, at least for some period of the culture.

[0209] In some embodiments, cells are grown via fed-batch fermentation. In some embodiments, feed is continued until feed exhaustion and/or the feed is controlled to initiate or increase once a certain level of dissolved oxygen is detected in the culture medium (e.g., about 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, or more dissolved oxygen). The feed rate can be modulated to maintain the dissolved oxygen at a specific level (e.g., about 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, or more dissolved oxygen).

[0210] In some embodiments, inventive modified cells are grown in a two-phase feeding protocol in which the first phase is designed to maintain conditions of excess carbon and limiting oxygen, and the second phase results in conditions of excess oxygen and limiting carbon.

[0211] In some embodiments, inventive modified cells are cultivated at constant temperature (e.g., between about 20-40, or 20-30.degree. C., including for example at about 20, 20.5, 21, 21.5, 22, 22.5, 23, 23.5, 24, 24.5, 25, 25.5, 26, 26.5, 27, 27.5, 28, 28.5, 29, 29.5, 30.degree. C. or above) and/or pH (e.g., within a range of about 4-7.5, or 4-6.5, 3.5-7, 3.5-4, 4-4.5, 4.5-5, 5-5.5, 5.5-6, 6-6.5, 6.5-7, etc., including at about 4.0, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, or 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5 or above); in other embodiments, temperature and/or pH may be varied during the culture period, either gradually or in a stepwise fashion.

[0212] In some embodiments, the temperature at which inventive cells are cultivated is selected so that production of one or more particular carotenoid compound(s) is adjusted (e.g., so that production of one or more particular compound(s) is increased and/or production of one or more other compound(s) is decreased). In some embodiments, the temperature at which inventive cells are cultivated is selected so that the ratio of one quinone derived compound to another, is adjusted. To give but one example, in some embodiments, a temperature is selected to be sufficiently low that levels of one quinone derived compound are reduced and the level of at least one other quinone derived compound(s) is increased.

[0213] In some embodiments, cultures are grown at about pH 5.5 and or at a temperature between about 28-30.degree. C. In some embodiments, it may be desirable to grow inventive modified cells under low pH conditions, in order to minimize growth of other cells. In some embodiments, it will be desirable to grow inventive modified cells under relatively higher temperature conditions in order to slow growth rate and/or increase the ultimate dry cell weight output of quinone derived compounds (e.g. CoQ10, C.sub.5-C.sub.9 quinones, vitamin K compounds, vitamin E compounds).

[0214] One advantage provided by the present invention is that, in addition to allowing the production of high levels of one or more quinone derived compounds, certain embodiments of the present invention allow produced compounds to be readily isolated because they accumulate in the lipid bodies within oleaginous organisms. Methods and systems for isolating lipid bodies have been established for a wide variety of oleaginous organisms (see, for example, U.S. Pat. Nos. 5,164,308; 5,374,657; 5,422,247; 5,550,156; 5,583,019; 6,166,231; 6,541,049; 6,727,373; 6,750,048; and 6,812,001, each of which is incorporated herein by reference in its entirety). In brief, cells are typically recovered from culture, often by spray drying, filtering or centrifugation.

[0215] Of course, it is not essential that lipid bodies be specifically isolated in order to collect quinone derived compounds produced according to the present invention. Any of a variety of approaches can be utilized to isolate and/or purify quinone derived compounds. Many useful extraction and/or purification procedures for lipophilic agents generally, are known in the art (see, for example, EP670306, EP719866, U.S. Pat. No. 4,439,629, U.S. Pat. No. 4,680,314, U.S. Pat. No. 5,310,554, U.S. Pat. No. 5,328,845, U.S. Pat. No. 5,356,810, U.S. Pat. No. 5,422,247, U.S. Pat. No. 5,591,343, U.S. Pat. No. 6,166,231, U.S. Pat. No. 6,750,048, U.S. Pat. No. 6,812,001, U.S. Pat. No. 6,818,239, U.S. Pat. No. 7,015,014, US2003/0054070, US2005/0266132, each of which is incorporated herein by reference).

[0216] In many typical isolation procedures, cells are disrupted (e.g., mechanically, chemically [e.g., by exposure to a mild caustic agent such as a detergent or 0.1 N NaOH, for example at room temperature or at elevated temperature], etc.) to allow access of intracellular quinone derived compound(s) to an extraction solvent, and are then extracted one or more times. Cells may optionally be concentrated (e.g., to at least about 100 g/L or more, including to at least about 120 g/l, 150 g/l, 175 g/L, 200 g/L or more) and/or dried (e.g., with a spray dryer, Blaw Knox double drum dryer, single drum vacuum dryer, etc.), prior to exposure to extraction solvent (and/or prior to disruption or homogenization). Disruption can, of course, be performed prior to and/or during exposure to extraction solvent. After extraction, solvent is typically removed (e.g., by evaporation, for example by application of vacuum, change of temperature, etc.).

[0217] In some instances, cells are homogenized and then subjected to supercritical liquid extraction or solvent extraction. Typical liquids or solvents utilized in such extractions include, for example, organic or non-organic liquids or solvents. To give but a few specific examples, such liquids or solvents may include acetone, carbon dioxide (e.g., supercritical carbon dioxide, acetone, heptane, octane, ethanol), chloroform, ethanol, ethyl acetate, heptane, hexane, hexane:ethyl acetate, isopropanol, methanol, methylene chloride, isopropanol, ethyl acetateoctane, tetrahydrofuran (THF), different types of oils (e.g., soybeen, rapeseed, etc.), and combinations thereof. Particular solvents may be selected, for example, based on their ability to solubilize particular quinone derived compounds or sets of quinone derived compounds (e.g., all quinone derived compounds), and/or based on regulatory or other considerations (e.g., toxicity, cost, ease of handling, ease of removal, ease of disposal, etc.). For example, more polar quinone derived compounds may be extracted more efficiently into extraction solvents with increased polarity. In some embodiments, hexane is used as a solvent. In other embodiments, hexane:ethyl acetate is utilized.

[0218] In some embodiments, combinations of solvents may be utilized. In some embodiments, combinations of a relatively polar solvent (e.g., alcohols, acetone, chloroform, methylene chloride, ethyl acetate, etc.) and a relatively non-polar solvent (e.g., hexane, oils, etc.) are utilized for extraction. Those of ordinary skill in the art will readily appreciate that different ratios of polar to non-polar solvent may be employed as appropriate in a particular situation. Just to give a few examples, common ratios include 1:1, 2:1, 3:1, 3:2, 95:5, 90:10, 85:15, 80:20, 75:25, 70:30, 65:45, 60:40, 55:45, and 50:50. It will be appreciated that solvents or solvent mixtures of different polarities may be more effective at extracting particular quinone derived compounds (e.g., based on their polarities and/or as a function of other attributes of the host cell material from which they are being extracted). Those of ordinary skill in the art are well able to adjust the overall polarity of the extracting solvent, for instance by adjusting the relative amounts of polar and non-polar solvents in a solvent blend, in order to achieve more efficient extraction.

[0219] Extraction may be performed under any of a variety of environmental conditions, including any of a variety of temperatures. For example, extraction may be performed on ice, at room temperature, or at any of a variety of other temperatures. For example, a solvent may be maintained at a selected temperature (e.g., about 4, 25, 28, 30, 37, 68, 70, 75, 80, 85, 90, 95, or 100.degree. C.) in order to improve or adjust extraction of a particular desired quinone derived compound.

[0220] Extraction typically yields a crude oil suspension. In some embodiments, the crude oil suspension contains some intact host cells but is at least about 95% free of intact host cells. In some embodiments, the crude oil suspension is at least about 96%, 97%, 98%, or 99% or more free of intact host cells. In some embodiments, the suspension is substantially free of water-soluble cell components (e.g., nucleic acids, cell wall or storage carbohydrates, etc.). In some embodiments, the suspension contains less than about 5%, 4%, 3%, 2%, or 1% or less water-soluble cell components.

[0221] Extraction conditions that yield a crude oil suspension will enrich for lipophilic components that accumulate in the lipid bodies within oleaginous organisms. In general, the major components of the lipid bodies consist of triacylglycerols, ergosteryl esters, other steryl esters, free ergosterol, phospholipids, and some proteins, which often function in the synthesis or regulation of the levels of other lipid body components. C16 and C18 (e.g. C16:0, C16:1, C18:0, C18:1, and C18:2) are generally the major fatty acids present in lipid bodies, mainly as components of triacylglycerol and steryl esters.

[0222] In some embodiments of the invention, the crude oil suspension contains at least about 2.5% by weight quinone derived compound(s); in some embodiments, the crude oil suspension contains at least about 5% by weight quinone derived compound(s), at least about 10% by weight quinone derived compound(s), at least about 20% by weight quinone derived compound(s), at least about 30% by weight quinone derived compound(s), at least about 40% by weight quinone derived compound(s), or at least about 50% by weight quinone derived compound(s).

[0223] The crude oil suspension may optionally be refined as known in the art. Refined oils may be used directly as feed or food additives. Alternatively or additionally, quinone derived compounds (e.g. CoQ 10) can be isolated from the oil using conventional techniques.

[0224] Given the sensitivity of quinones generally to oxidation, many embodiments of the invention employ oxidative stabilizers (e.g., tocopherols, vitamin C; ethoxyquin; vitamin E, BHT, BHA, TBHQ, etc., or combinations thereof) during and/or after quinone derived compound isolation. When the desired product is the reduced ubiquinol form, then it may be advantageous to further add conventional reducing agents (e.g. sodium dithionite, ascorbic acid). Alternatively or additionally, microencapsulation, for example with proteins, may be employed to add a physical barrier to oxidation and/or to improve handling (see, for example, U.S. Patent Application 2004/0191365).

[0225] Extracted quinone derived compounds may be further isolated and/or purified, for example, by crystallization, washing, recrystallization, and/or other purification strategies. In some embodiments, carotenoid crystals are collected by filtration and/or centrifugation. Isolated or purified quinone derived compounds may be dried and/or formulated for storage, transport, sale, and/or ultimate use. To give but a few specific examples, quinone derived compounds may be prepared as a cold-water dispersible powder, as a suspension of crystals in oil (e.g., vegetable oil, e.g., about 5%-30% w/w), etc.

Uses

[0226] Quinone derived compounds (e.g., ubiquinones, vitamin K compounds, and/or vitamin E compounds) produced according to the present invention can be utilized in any of a variety of applications, for example exploiting biological or nutritional (e.g., metabolic, anti-oxidant, anti-proliferative, etc.) properties.

[0227] For example, according to the present invention, one or more quinone derived compounds may be used in pharmaceutical and/or nutraceutical applications for treatment and/or prevention of disorders such as cardiovascular disorders (including, congestive heart failure, myocardial infarction and cardiac surgery patients, angina, hypertension, etc), metabolic disorders, diabetes, pain, aging, neurodegenerative disorders (e.g., Parkinson's and Huntington's disorders), inflammatory disorders and cancers. Additionally, supplements of certain quinone derived compounds (including, e.g., CoQ10) have been proposed to improve performance (e.g., athletic performance). See, for example, Choi, et al., Appi. Microbiol., Biotechnol, 68: 9-15, 2005; Kawamukai, J. Biosci. Bioeng, 94:511-517, 2002; Ernster and Dallner, Biochim. Biophys. Acta., 1271: 195-204, 1995; U.S. Pat. No. 6,806,076; U.S. Pat. No. 6,867,024; U.S. Pat. No. 6,686,485; U.S. Pat. No. 6,417,233; U.S. Patent Publication No. 2006/0010519; U.S. Patent Publication No. 2004/0034107; U.S. Patent Publication No. 2003/0236239; U.S. Patent Publication No. 2003/0167556; U.S. Patent Publication No. 2002/0058712), nutritional supplement (see for example, U.S. Pat. No. 6,686,485; U.S. Pat. No. 6,080,788, U.S. Patent Publication No. 2004/0082536) food supplements (see, for example, U.S. Patent Publication Nos. 2005/011226 and 2004/0115309, cosmetics (as anti-oxidants and/or as cosmetics, including fragrances; see for example U.S. Patent Publication No. 2005/0084505; U.S. Patent Publication No. 2003/0167556), etc. The contents of each of the foregoing journal and patent publications are hereby incorporated by reference. Quinone derived compound(s) produced herein can also be co-administered with an HMG CoA reductase inhibitor (e.g. a statin such as atorvastatin, simvastatin, rosuvastatin, etc.)

[0228] It will be appreciated that, in some embodiments of the invention, one or more quinone derived compound(s) produced by manipulated host cells as described herein are incorporated into a final product (e.g., food or feed supplement, pharmaceutical, cosmetic, etc.) in the context of the host cell. For example, host cells may be lyophilized, freeze dried, frozen or otherwise inactivated, and then whole cells may be incorporated into or used as the final product. The host cells may also be processed prior to incorporation in the product to, e.g., increase bioavailability (e.g., via lysis). Alternatively or additionally, a final product may incorporate only a portion of a host cell (e.g., fractionated by size, solubility), separated from the whole. For example, in some embodiments of the invention, lipid bodies are isolated from host cells and are incorporated into or used as the final product.

[0229] For instance, inventive compound-containing lipid bodies (e.g., from engineered cells, and particularly from engineered fungal cells) may be substituted for the plant oil bodies described in U.S. Pat. No. 6,599,513 (the entire contents of which are hereby incorporated by reference) and incorporated into emulsions or emulsion formulations, as described thereon. In other embodiments, a ubiquinone itself is isolated and reformulated into a final product.

[0230] Preparations of one or more quinone derived compounds, including formulations, dosing, supplements (e.g., dietary supplements), have been described and are known in the art. For example, see citations relating to pharmaceutical preparations, nutritional supplement, and food additives described above. Additionally, see, for example, U.S. Pat. No. 6,906,106, U.S. Pat. No. 6,867,024; U.S. Pat. No. 6,740,338; U.S. Pat. No. 6,300,377; U.S. Patent Publication No. 2003/0105168, U.S. Patent Publication No. 2004/0014817; U.S. Patent Publication No. 2005/0019268; U.S. Patent Publication No. 2004/0152612; U.S. Patent Publication No. 2003/0167556; U.S. Patent Publication No. 2005/0153406; U.S. Patent Publication No. 2005/0181109; U.S. Patent Publication No. 2006/0010519; International Patent Publication No. WO05069916A2, and International Patent Publication No. WO05092123A1, each of which is incorporated herein by reference.

[0231] The amount of any particular quinone derived compound incorporated into a given product may vary dramatically depending on the product, and the particular compound(s) involved. Amounts may range, for example, from less than 0.01% by weight of the product, to more than 1%, 10%, 20%, 30% or more; in some cases the compound may comprise 100% of the product. Thus, the amount of quinone derived compound incorporated into a given product may be, for example, 0.01%, 0.02%, 0.03%, 0.04%, 0.05%, 0.06%, 0.07%, 0.08%, 0.09%, 0.1%, 0.2%, 0.3%, 0.4%, 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 100%.

[0232] In some embodiments of the invention, one or more produced quinone derived compound is incorporated into a component of food or feed (e.g., a food supplement, food additive). Types of food products into which quinone derived compound (s) can be incorporated according to the present invention are not particularly limited, and include beverages such as teas, juices, and liquors; confections such as jellies and biscuits; fat-containing foods and beverages such as dairy products; processed food products such as rice and soft rice (or porridge); infant formulas; or the like. In some embodiments of this aspect of the invention, it may be useful to incorporate the quinone derived compound within bodies of edible lipids as it may facilitate incorporation into certain fat-containing food products.

[0233] Examples of feedstuffs into which one or more quinone derived compounds produced in accordance with the present invention may be incorporated include, for instance, pet foods such as cat foods, dog foods and the like, feeds for aquarium fish or cultured fish, etc., feed for farm-raised animals (including livestock and further including fish raised in aquaculture). Food or feed material into which the quinone derived compound (s) produced in accordance with the present invention is incorporated is preferably palatable to the organism which is the intended recipient. This food or feed material may have any physical properties currently known for a food material (e.g., solid, liquid, soft).

[0234] In some embodiments of the invention, produced quinone derived compound is incorporated into a cosmetic product. Examples of such cosmetics include, for instance, skin cosmetics (e.g., lotions, emulsions, liquids, creams and the like), lipsticks, anti-sunburn cosmetics, makeup cosmetics, fragrances, products for daily use (e.g., toothpastes, mouthwashes, bad breath preventive agents, solid soaps, liquid soaps, shampoos, conditioners), etc.

[0235] In some embodiments, produced quinone derived compound is incorporated into a pharmaceutical. Examples of such pharmaceuticals include, for instance, various types of tablets, capsules, drinkable agents, troches, gargles, etc. In some embodiments, the pharmaceutical is suitable for topical application. Dosage forms are not particularly limited, and include capsulae, oils, granula, granula subtilae, pulveres, tabellae, pilulae, trochisci, or the like. Oils and oil-filled capsules may provide additional advantages both because of their lack of ingredient decomposition during manufacturing, and because inventive compound-containing lipid droplets may be readily incorporated into oil-based formulations.

[0236] Pharmaceuticals according to the present invention may be prepared according to techniques established in the art including, for example, the common procedure as described in the United States Pharmacopoeia, for example.

[0237] In still other embodiments, produced quinone derived compound is incorporated into a nutritional supplement or nutraceutical. Examples of such nutraceuticals, include, for instance, various types of tablets, capsules, drinkable agents, troches, gargles, etc. In some embodiments, the nutraceutical is suitable for topical application. Dosage forms are, as in pharmaceutical products, not particularly limited and include any of the same types of dosages as pharmaceuticals.

[0238] Quinone derived compound(s) produced according to the present invention (whether isolated or in the context of lipid droplets or of cells, e.g., fungal cells) may be incorporated into products as described herein by combination with any of a variety of agents. For instance, such quinone derived compound (s) may be combined with one or more binders or fillers. In some embodiments, inventive products will include one or more chelating agents, pigments, salts, surfactants, moisturizers, viscosity modifiers, thickeners, emollients, fragrances, preservatives, etc., and combinations thereof.

[0239] Useful surfactants include, for example, anionic surfactants such as branched and unbranched alkyl and acyl hydrocarbon compounds, sodium dodecyl sulfate (SDS); sodium lauryl sulfate (SLS); sodium lauryl ether sulfate (SLES); sarconisate; fatty alcohol sulfates, including sodium, potassium, ammonium or triethanolamine salts of C.sub.10 to C.sub.18 saturated or unsaturated forms thereof; ethoxylated fatty alcohol sulfates, including alkyl ether sulfates; alkyl glyceryl ether sulfonate, alpha sulpho fatty acids and esters; fatty acid esters of isethionic acid, including Igepon A; acyl (fatty) N-methyltaurides, including Igepon T; dialkylsulfo succinate esters, including C.sub.8, C.sub.10 and C.sub.12 forms thereof; Miranot BT also referred to as lauroamphocarboxyglycinate and sodium tridecath sulfate; N-acylated amino acids, such as sodium N-lauroyl sarconisate or gluconate; sodium coconut monoglyceride sulfonate; and fatty acid soaps, including sodium, potassium, DEA or TEA soaps.

[0240] Among the cationic surfactants that are useful are monoalkyl trimethyl quartenary salts; dialkyl dimethyl quartenary salts; ethoxylated or propoxylated alkyl quaternary ammonium salts, also referred to in the art as ethoquats and propoquats; cetyl benzylmethylalkyl ammonium chloride; quaternized imidazolines, which are generally prepared by reacting a fat or fatty acid with diethylenetriamine followed by quaternization, and non-fat derived cationic polymers such as the cellulosic polymer, Polymer JR (Union Carbide).

[0241] Further useful cationic surfactants include lauryl trimethyl ammonium chloride; cetyl pyridinium chloride; and alkyltrimethylammonium bromide. Cationic surfactants are particularly useful in the formulation of hair care products, such as shampoos, rinses and conditioners.

[0242] Useful nonionic surfactants include polyethoxylated compounds and polypropoxylated products. Examples of ethoxylated and propoxylated non-ionic surfactants include ethoxylated anhydrohexitol fatty esters, for example Tween 20; mono- and diethanolamides; Steareth-20, also known as Volpo20; polyethylene glycol fatty esters (PEGs), such as PEG-8-stearate, PEG-8 distearate; block co-polymers, which are essentially combinations of hydrophylic polyethoxy chains and lipophilic polypropoxy chains and generically known as Poloaxamers.

[0243] Still other useful non-ionic surfactants include fatty esters of polyglycols or polyhydric alcohols, such as mono and diglyceride esters; mono- and di-ethylene glycol esters; diethylene glycol esters; sorbitol esters also referred to as Spans; sucrose esters; glucose esters; sorbitan monooleate, also referred to as Span80; glyceryl monostearate; and sorbitan monolaurate, Span20 or Arlacel 20.

[0244] Yet other useful nonionic surfactants include polyethylene oxide condensates of alkyl phenols and polyhydroxy fatty acid amide surfactants which may be prepared as for example disclosed in U.S. Pat. No. 2,965,576.

[0245] Examples of amphoteric surfactants which can be used in accordance with the present invention include betaines, which can be prepared by reacting an alkyldimethyl tertiary amine, for example lauryl dimethylamine with chloroacetic acid. Betaines and betaine derivatives include higher alkyl betaine derivatives including coco dimethyl carboxymethyl betaine; sulfopropyl betaine; alkyl amido betaines; and cocoamido propyl betaine. Sulfosultaines which may be used include for example, cocoamidopropyl hydroxy sultaine. Still other amphoteric surfactants include imidazoline derivatives and include the products sold under the trade name "Miranol" described in U.S. Pat. No. 2,528,378 which is incorporated herein by reference in its entirety. Still other amphoterics include phosphates for example, cocamidopropyl PG-dimonium chloride phosphate and alkyldimethyl amine oxides.

[0246] Suitable moisturizers include, for example, polyhydroxy alcohols, including butylene glycol, hexylene glycol, propylene glycol, sorbitol and the like; lactic acid and lactate salts, such as sodium or ammonium salts; C.sub.3 and C.sub.6 diols and triols including hexylene glycol, 1,4 dihydroxyhexane, 1,2,6-hexane triol; aloe vera in any of its forms, for example aloe vera gel; sugars and starches; sugar and starch derivatives, for example alkoxylated glucose; hyaluronic acid; lactamide monoethanolamine; acetamide monoethanolamine; glycolic acid; alpha and beta hydroxy acids (e.g. lactic, glycolic salicylic acid); glycerine; pantheol; urea; vaseline; natural oils; oils and waxes (see: the emollients section herein) and mixtures thereof.)

[0247] Viscosity modifiers that may be used in accordance with the present invention include, for example, cetyl alcohol; glycerol, polyethylene glycol (PEG); PEG-stearate; and/or Keltrol.

[0248] Appropriate thickeners for use in inventive products include, for example, gelling agents such as cellulose and derivatives; Carbopol and derivatives; carob; carregeenans and derivatives; xanthane gum; sclerane gum; long chain alkanolamides; bentone and derivatives; Kaolin USP; Veegum Ultra; Green Clay; Bentonite NFBC; etc.

[0249] Suitable emollients include, for example, natural oils, esters, silicone oils, polyunsaturated fatty acids (PUFAs), lanoline and its derivatives and petrochemicals.

[0250] Natural oils which may be used in accordance with the present invention may be obtained from sesame; soybean; apricot kernel; palm; peanut; safflower; coconut; olive; cocoa butter; palm kernel; shea butter; sunflower; almond; avocado; borage; carnauba; hazel nut; castor; cotton seed; evening primrose; orange roughy; rapeseed; rice bran; walnut; wheat germ; peach kernel; babassu; mango seed; black current seed; jojoba; macadamia nut; sea buckthorn; sasquana; tsubaki; mallow; meadowfoam seed; coffee; emu; mink; grape seed; thistle; tea tree; pumpkin seed; kukui nut; and mixtures thereof.

[0251] Esters which may be used include, for example, C.sub.8-C.sub.30 alklyl esters of C.sub.8-C.sub.30 carboxylic acids; C.sub.1-C.sub.6 diol monoesters and diesters of C.sub.8-C.sub.30 carboxylic acids; C.sub.10-C.sub.20 alcohol monosorbitan esters, C.sub.10-C.sub.20 alcohol sorbitan di- and tri-esters; C.sub.10-C.sub.20 alcohol sucrose mono-, di-, and tri-esters and C.sub.10-C.sub.20 fatty alcohol esters of C.sub.2-C.sub.6 2-hydroxy acids and mixtures thereof. Examples of these materials include isopropyl palmitate; isopropyl myristate; isopropyl isononate; C.sub.12/C.sub.14 benzoate ester (also known as Finesolve); sorbitan palmitate, sorbitan oleate; sucrose palmitate; sucrose oleate; isostearyl lactate; sorbitan laurate; lauryl pyrrolidone carboxylic acid; panthenyl triacetate; and mixtures thereof.

[0252] Further useful emollients include silicone oils, including non-volatile and volatile silicones. Examples of silicone oils that may be used in the compositions of the present invention are dimethicone; cyclomethycone; dimethycone-copolyol; aminofunctional silicones; phenyl modified silicones; alkyl modified silicones; dimethyl and diethyl polysiloxane; mixed C.sub.1-C.sub.30 alkyl polysiloxane; and mixtures thereof. Additionally useful silicones are described in U.S. Pat. No. 5,011,681 to Ciotti et al., incorporated by reference herein.

[0253] A yet further useful group of emollients includes lanoline and lanoline derivatives, for example lanoline esters.

[0254] Petrochemicals which may be used as emollients in the compositions of the present invention include mineral oil; petrolatum; isohexdecane; permethyl 101; isododecanol; C.sub.11-C.sub.12 Isoparrafin, also known as Isopar H.

[0255] Among the waxes which may be included in inventive products are animal waxes such as beeswax; plant waxes such as carnauba wax, candelilla wax, ouricurry wax, Japan wax or waxes from cork fibres or sugar cane. Mineral waxes, for example paraffin wax, lignite wax, microcrystalline waxes or ozokerites and synthetic waxes may also be included.

[0256] Exemplary fragrances for use in inventive products include, for instance, linear and cyclic alkenes (i.e. terpenes); primary, secondary and tertiary alcohols; ethers; esters; ketones; nitrites; and saturated and unsaturated aldehydes; etc.

[0257] Examples of synthetic fragrances that may be used in accordance with the present invention include without limitation acetanisole; acetophenone; acetyl cedrene; methyl nonyl acetaldehyde; musk anbrette; heliotropin; citronellol; sandella; methoxycitranellal; hydroxycitranellal; phenyl ethyl acetate; phenylethylisobutarate; gamma methyl ionone; geraniol; anethole; benzaldehyde; benzyl acetate; benzyl salicate; linalool; cinnamic alcohol; phenyl acetaldehyde; amyl cinnamic aldehyde; caphore; p-tertiairy butyl cyclohexyl acetate; citral; cinnamyl acetate; citral diethyl acetal; coumarin; ethylene brasslate; eugenol; 1-menthol; vanillin; etc.

[0258] Examples of natural fragrances of use herein include without limitation lavandin; heliotropin; sandlewood oil; oak moss; pathouly; ambergris tincture; ambrette seed absolute; angelic root oil; bergamont oil; benzoin Siam resin; buchu leaf oil; cassia oil; cedarwood oil; cassia oil; castoreum; civet absolute; chamomile oil; geranium oil; lemon oil; lavender oil; Ylang Ylang oil; etc.

[0259] A list of generally used fragrance materials can be found in various reference sources, for example, "Perfume and Flavor Chemicals", Vols. I and II; Steffen Arctander Allured Pub. Co. (1994) and "Perfumes: Art, Science and Technology"; Muller, P. M. and Lamparsky, D., Blackie Academic and Professional (1994) both incorporated herein by reference.

[0260] Suitable preservatives include, among others, (e.g., sodium metabisulfite; Glydant Plus; Phenonip; methylparaben; Germall 115; Germaben II; phytic acid; sodium lauryl sulfate (SLS); sodium lauryl ether sulfate (SLES); Neolone; Kathon; Euxyl and combinations thereof), anti-oxidants (e.g., butylated hydroxytoluened (BHT); butylated hydroxyanisol (BHA); ascorbic acid (vitamin C); tocopherol; tocopherol acetate; phytic acid; citric acid; pro-vitamin A.

[0261] In some embodiments, inventive products will comprise an emulsion (e.g., containing inventive lipid bodies), and may include one or more emulsifying agents (e.g., Arlacel, such as Alacel 165; Glucamate; and combinations thereof) and/or emulsion stabilizing agents. In some embodiments, inventive products will include one or more biologically active agents other than the ubiquinone(s). To give but a few examples, inventive cosmetic or pharmaceutical products may include one or more biologically active agents such as, for example, sunscreen actives, anti-wrinkle actives, anti-aging actives, whitening actives, bleaching actives, sunless tanning actives, anti-microbial actives, anti-acne actives, anti-psoriasis actices, anti-eczema actives, antioxidants, anesthetics, vitamins, protein actives, etc.

Engineering Production of Multiple Isoprenoid Compounds

[0262] In certain embodiments of the invention, it may be desirable to generate engineered organisms that accumulate one or more other compounds in addition to the quinone derived compound described herein, and further to accumulate such other compound(s), optionally together with the quinone derived compound(s), in lipid bodies. For example, certain inventive engineered organisms may accumulate quinone derived compound together with at least one other compound derived from an isoprenoid precursor. In some embodiments, the other compound derived from an isoprenoid precursor will be one or more quinone derived compound discussed herein (e.g., a ubiquinone, vitamin K, vitamin E). Alternatively or additionally, in some embodiments the other compound derived from an isoprenoid precursor will be one or more carotenoids. Production of carotenoids in oleaginous organisms is described in U.S. Provisional Application No. 60/663,621, filed Mar. 18, 2005, and is also described in U.S. patent application Ser. No. 11/385,580, entitled Production of Carotenoids in Oleaginous Yeast and Fungi, filed Mar. 20, 2006.

[0263] In some embodiments of the invention, host cells are engineered to produce at least two compounds selected from the group consisting of a ubiquinone (e.g. CoQ10, a C.sub.5-9 quinone), vitamin K, vitamin E, and carotenoids. In some such embodiments, host cells are engineered to produce a least one compound selected from the group consisting of a ubiquinone (e.g. CoQ10, a C.sub.5-9 quinone), vitamin K, vitamin E, and at least one other compound. Such host cells are particularly useful for producing mixtures or combination products that contain a ubiquinone and/or vitamin K. Such host cells are also useful for producing combination products including one or more carotenoids.

[0264] In some embodiments of the invention, host cells are engineered to produce at least one quinone derived compound and at least one carotenoid. Carotenoids, which have an isoprene backbone consisting of 40 carbon atoms, have antioxidant effects as well as use in coloring agents. Carotenoids such as .beta.-carotene, astaxanthin, and cryptoxanthin are believed to possess cancer preventing and immunopotentiating activity.

[0265] The carotenoid biosynthesis pathway branches off from the isoprenoid biosynthesis pathway at the point where GGPP is formed. Up to and including formation of FPP, and potentially GGPP, is as described above for production of a ubiquinone. The commitment step in carotenoid biosynthesis is the formation of phytoene by the head-to-head condensation of two molecules of GGPP, catalyzed by phytoene synthase (often called crtB). A series of dehydrogenation reactions, each of which increases the number of conjugated double bonds by two, converts phytoene into lycopene via neurosporene. The pathway branches at various points, both before and after lycopene production, so that a wide range of carotenoids can be generated. For example, action of a cyclase enzyme on lycopene generates .gamma.-carotene; action of a desaturase instead produces 3,4-didehydrolycopene. .gamma.-carotene is converted to .beta.-carotene through the action of a cyclase. .beta.-carotene can be processed into any of a number of products, including astaxanthin (via echinone, hydroxyechinone, and phoenicoxanthin).

[0266] Carotenoid production in a host organism may be adjusted by modifying the expression or activity of one or more proteins involved in carotenoid biosynthesis. In some embodiments of the invention, it will be desirable to utilize as host cells organisms that naturally produce one or more carotenoids. In some such cases, the focus will be on increasing production of a naturally-produced carotenoid, for example by increasing the level and/or activity of one or more proteins involved in the synthesis of that carotenoid and/or by decreasing the level or activity of one or more proteins involved in a competing biosynthetic pathway. Alternatively or additionally, in some embodiments it will be desirable to generate production of one or more carotenoids not naturally produced by the host cell.

[0267] In some embodiments of the invention, it will be desirable to introduce one or more carotenogenic modifications into a host cell. In certain embodiments the carotenogenic modification may confer expression of one or more heterologous carotenogenic polypeptides into a host cell. As will be apparent to those of ordinary skill in the art, any of a variety of heterologous polypeptides may be employed; selection will consider, for instance, the particular carotenoid whose production is to be enhanced. Still further, selection will consider the complementation and/or ability of the selected polypeptide to function in conjunction with additional oleaginic and/or quinonogenic modifications of a cell such that each of oleaginy, ubiquinone biosynthesis and carotenoid biosynthesis are effectuated to the desired extent.

[0268] Proteins involved in carotenoid biosynthesis include, but are not limited to, phytoene synthase, phytoene dehydrogenase, lycopene cyclase, carotenoid ketolase, carotenoid hydroxylase, astaxanthin synthase (a single multifunctional enzyme found in some source organisms that typically has both ketolase and hydroxylase activities), carotenoid epsilon hydroxylase, lycopene cyclase (beta and epsilon subunits), carotenoid glucosyltransferase, and acyl CoA:diacyglycerol acyltransferase. Representative example sequences for carotenoid biosynthesis polypeptides are provided in Tables 17-21 and Tables 38-41.

[0269] Alternatively or additionally, modified carotenoid ketolase polypeptides that exhibit improved carotenoid production activity may be utilized in accordance with the present invention. For example, carotenoid ketolase polypeptides comprising one more mutations to corresponding to those identified Sphingomonassp. DC18 which exhibited improved astaxanthin production (Tao et al 2006 Metab Eng. 2006 Jun 27) and Paracoccus sp. strain N81106 which exhibited altered carotenoid production (Ye et al. Appl Environ Microbiol 72:5829, 2006).

[0270] In some embodiments of the invention, potential source organisms for carotenoid biosynthesis polypeptides include, but are not limited to, genera of naturally oleaginous or non-oleaginous fungi that naturally produce carotenoids. These include, but are not limited to, Botrytis, Cercospora, Fusarium (Gibberella), Mucor, Neurospora, Phycomyces, Puccina, Rhodotorula, Sclerotium, Trichoderma, and Xanthophyllomyces. Exemplary species include, but are not limited to, Neurospora crassa, Xanthophyllomyces dendrorhous (Phaffia rhodozyma), Mucor circinelloides, and Rhodotorula glutinis. Of course, carotenoids are produced by a wide range of diverse organisms such as plants, algae, yeast, fungi, bacteria, cyanobacteria, etc. Any such organisms may be source organisms for carotenoid biosynthesis polypeptides according to the present invention.

[0271] It will be appreciated that the particular carotenogenic modification to be applied to a host cell in accordance with the present invention will be influenced by which carotenoid(s) is desired to be produced. For example, isoprenoid biosynthesis polypeptides are relevant to the production of most carotenoids. Carotenoid biosynthesis polypeptides are also broadly relevant. Carotenoid ketolase activity is particularly relevant for production of canthaxanthin, as carotenoid hydroxylase activity is for production of lutein and zeaxanthin, among others. Both carotenoid hydroxylase and ketolase activities (and/or astaxanthin synthase) are particularly useful for production of astaxanthin.

[0272] Bacterial carotenogenic genes have already been demonstrated to be transferable to other organisms, and are therefore particularly useful in accordance with the present invention (see, for example, Miura et al., Appi. Environ. Microbiol. 64:1226, 1998). In other embodiments, it may be desirable to utilize genes from other source organisms such as plant, alga, or microalgae; these organisms provide a variety of potential sources for ketolase and hydroxylase polypeptides. Still additional useful source organisms include fungal, yeast, insect, protozoal, and mammalian sources of polypeptides.

[0273] In certain embodiments of the present invention, cells are engineered to produce at least one sterol compound together with the at least one quinone derived compound. The commitment step in sterol biosynthesis is the conversion of farnesyl pyrophosphate into presqualene pyrophosphate. Farnesyl pyrophosphate (FPP) is produced from isopentenyl pyrophosphate (IPP), for example in a process that involves isomerization of IPP into dimethylallyl pyrophosphate (DMAPP), followed by three sequential condensation reactions with additional molecules of IPP generate the ten-carbon molecule geranyl pyrophosphate (GPP), followed by the fifteen-carbon molecule farnesyl pyrophosphate (FPP). FPP can either enter the sterol biosynthesis pathway by conversion into presqualene puyrophosphate, or alternatively can be diverted toward biosynthesis of carotenoids and other compounds (e.g., a ubiquinone, vitamin E, vitamin K, etc.) by conversion into the twenty-carbon compound geranylgeranyl pyrophosphate (GGPP). In many instances, FPP appears to be the predominant substrate used by polyprenyldiphosphate synthases (Coq1 polypeptides) during biosynthesis of ubiquinones.

[0274] Once the sterol biosynthesis pathway has been entered, presqualene pyrophosphate is then converted to squalene by the same enzyme that performed the farnesyl pyrophosphate.fwdarw.presqualene pyrophosphate conversion. Squalene is then converted into a variety of different sterol compounds, including, but not limited to, lanosterol, zymosterol, ergosterol, 7-dehydrocholesterol (provitamin D3), and vitamin D compounds. The vitamin D.sub.2 biosynthetic pathway and the vitamin D.sub.3 biosynthetic pathway share some common reactions, and there can be multiple points at which a vitamin D.sub.2 intermediate can be converted into a vitamin D.sub.3 intermediate (see, for example, FIG. 1).

[0275] In some embodiments, the sterol compound whose production is engineered is selected from the group consisting of squalene, lanosterol, zymosterol, ergosterol, and vitamin D compounds (e.g., 7-dehydrocholesterol (provitamin D3)).

[0276] In some embodiments of the present invention, host cells are engineered to produce squalene in addition to one or more quinone derived compounds. In some embodiments, squalene production is enhanced in a cell by introduction of one or more sterologenic modifications that increases levels of IPP, FPP and/or squalene itself. In some embodiments, squalene production is enhanced in a cell by increasing the level and/or activity of one or more squalene biosynthesis polypeptides (e.g., one or more isoprenoid biosynthesis polypeptides, an FPP synthase polypeptide, and/or a squalene synthase polypeptide). Alternatively or additionally, in some embodiments, squalene production is enhanced in a cell by decreasing the level and/or activity of one or more sterol biosynthesis competitor polypeptides that diverts one or more intermediates away from squalene production and/or that metabolizes squalene itself.

[0277] For example, in some embodiments of the present invention, squalene production in a host cell is increased by introducing or increasing expression and/or activity of one or more squalene synthase polypeptides in the cell. Representative examples of squalene synthase polypeptide sequences are included in Table 16. In some embodiments of the invention that utilize squalene synthase (or modifications of squalene synthase) source organisms include, but are not limited to, Neurospora crassa, Aspergillus niger, Saccharomyces cerevisiae, Mucor circinelloides, Candida utilis, Mortierella alpina, Phaffia rhodozyma, and Yarrowia lipolytica.

[0278] In some embodiments of the invention, squalene production in a host cell is increased by reducing the level or activity of one or more squalene biosynthesis competitor polypeptides, including for example one or more vitamin D biosynthesis polypeptides (e.g., which act to metabolize squalene). For instance, in some embodiments, the level or activity of one or more polypeptides active in the ergosterol biosynthetic pathway (see below) is reduced or eliminated.

[0279] In some embodiments, the sterol compound produced in addition to one or more quinone derived compounds according to the present invention is lanosterol. In some embodiments, lanosterol production is enhanced in a cell by introduction of one or more sterologenic modifications that increases levels of IPP, FPP and/or lanosterol itself. In some embodiments, lanosterol production is enhanced in a cell by increasing the level and/or activity of one or more lanosterol biosynthesis polypeptides (e.g., one or more isoprenoid biosynthesis polypeptides, an FPP synthase polypeptide, squalene synthase polypeptide, squalene epoxidase polypeptide, or 2,3-oxidosqualene-lanosterol cyclase polypeptide). Alternatively or additionally, in some embodiments, lanosterol production is enhanced in a cell by decreasing the level and/or activity of one or more sterol biosynthesis competitor polypeptides that diverts one or more intermediates away from lanosterol production and/or that metabolizes lanosterol itself.

[0280] For example, in some embodiments of the present invention, lanosterol production in a host cell is increased by introducing or increasing expression and/or activity of one or more squalene synthase polypeptide, squalene epoxidase polypeptide, or 2,3-oxidosqualene-lanosterol cyclase polypeptide in the cell. Representative examples of squalene synthase polypeptide, squalene epoxidase polypeptide, and 2,3-oxidosqualene-lanosterol cyclase polypeptide sequences are included in Table 16, 83, and 85. In some embodiments of the invention that utilize squalene synthase polypeptide, squalene epoxidase polypeptide, or 2,3-oxidosqualene-lanosterol cyclase polypeptide (or modifications of these polypeptides) source organisms include, but are not limited to, Neurospora crassa, Aspergillus niger, Saccharomyces cerevisiae, Phaffia rhodozyma, Mucor circinelloides, Candida utilis, Mortierella alpina, and Yarrowia lipolytica.

[0281] In some embodiments of the invention, lanosterol production in a host cell is increased by reducing the level or activity of one or more vitamin D biosynthesis polypeptides (e.g., which act to metabolize squalene). For instance, in some embodiments, the level or activity of one or more polypeptides active in the ergosterol biosynthetic pathway (see below).

[0282] In some embodiments of the present invention, host cells are engineered to produce zymosterol in addition to one or more quinone derived compounds. In some embodiments, zymosterol production is enhanced in a cell by introduction of one or more sterologenic modifications that increases levels of IPP, FPP and/or zymosterol itself. In some embodiments, zymosterol production is enhanced in a cell by increasing the level and/or activity of one or more zymosterol biosynthesis polypeptides (e.g., one or more isoprenoid biosynthesis polypeptides, an FPP synthase polypeptide, squalene synthase polypeptide, squalene epoxidase polypeptide, 2,3-oxidosqualene-lanosterol cyclase polypeptide, cytochrome P450 lanosterol 14.alpha.-demethylase polypeptide, C-14 sterol reductase polypeptide, or C-4 sterol methyl oxidase polypeptide). Alternatively or additionally, in some embodiments, zymosterol production is enhanced in a cell by decreasing the level and/or activity of one or more sterol biosynthesis competitor polypeptides that diverts one or more intermediates away from lanosterol production and/or that metabolizes lanosterol itself.

[0283] For example, in some embodiments of the present invention, zymosterol production in a host cell is increased by introducing or increasing expression and/or activity of one or more squalene synthase polypeptide, squalene epoxidase polypeptide, 2,3-oxidosqualene-lanosterol cyclase polypeptide, cytochrome P450 lanosterol 14.alpha.-demethylase polypeptide, C-14 sterol reductase polypeptide, or C-4 sterol methyl oxidase polypeptide in the cell. Representative examples of squalene synthase polypeptide, squalene epoxidase polypeptide, 2,3-oxidosqualene-lanosterol cyclase polypeptide, cytochrome P450 lanosterol 14.alpha.-demethylase polypeptide, C-14 sterol reductase polypeptide, or C-4 sterol methyl oxidase polypeptide sequences are included in Tables 16, and tables 86-99. In some embodiments of the invention that utilize squalene synthase polypeptide, squalene epoxidase polypeptide, 2,3-oxidosqualene-lanosterol cyclase polypeptide, cytochrome P450 lanosterol 14.alpha.-demethylase polypeptide, C-14 sterol reductase polypeptide, or C-4 sterol methyl oxidase polypeptide (or modifications of these polypeptides) source organisms include, but are not limited to, Neurospora crassa, Aspergillus niger, Saccharomyces cerevisiae, Phaffia rhodozyma, Mucor circinelloides, Candida utilis, Mortierella alpina, and Yarrowia

[0284] In some embodiments of the invention, zymosterol production in a host cell is increased by reducing the level or activity of one or more vitamin D biosynthesis polypeptides (e.g., which act to metabolize squalene). For instance, in some embodiments, the level or activity of one or more polypeptides active in the ergosterol biosynthetic pathway (see below).

[0285] In some embodiments of the present invention, host cells are engineered to produce ergosterol in addition to one or more quinone derived compounds. In some embodiments, ergosterol production is enhanced in a cell by introduction of one or more sterologenic modifications that increases levels of IPP, FPP and/or ergosterol itself. In some embodiments, ergosterol production is enhanced in a cell by increasing the level and/or activity of one or more ergosterol biosynthesis polypeptides (e.g., one or more isoprenoid biosynthesis polypeptides, an FPP synthase polypeptide, squalene synthase polypeptide, squalene epoxidase polypeptide, 2,3-oxidosqualene-lanosterol cyclase polypeptide, cytochrome P450 lanosterol 14.alpha.-demethylase polypeptide, C-14 sterol reductase polypeptide, C-4 sterol methyl oxidase polypeptide, SAM:C-24 sterol methyltransferase polypeptide, C-8 sterol isomerase polypeptide, C-5 sterol desaturase polypeptide, C-22 sterol desaturase polypeptide, or C-24 sterol reductase polypeptide. Alternatively or additionally, in some embodiments, ergosterol production is enhanced in a cell by decreasing the level and/or activity of one or more sterol biosynthesis competitor polypeptides that diverts one or more intermediates away from ergosterol production and/or that metabolizes ergosterol itself.

[0286] For example, in some embodiments of the present invention, ergosterol production in a host cell is increased by introducing or increasing expression and/or activity of one or more squalene synthase polypeptide, squalene epoxidase polypeptide, 2,3-oxidosqualene-lanosterol cyclase polypeptide, cytochrome P450 lanosterol 14.alpha.-demethylase polypeptide, C-14 sterol reductase polypeptide, C-4 sterol methyl oxidase polypeptide, SAM:C-24 sterol methyltransferase polypeptide, C-8 sterol isomerase polypeptide, C-5 sterol desaturase polypeptide, C-22 sterol desaturase polypeptide, or C-24 sterol reductase polypeptide in the cell. Representative examples of squalene synthase polypeptide, squalene epoxidase polypeptide, 2,3-oxidosqualene-lanosterol cyclase polypeptide, cytochrome P450 lanosterol 14.alpha.-demethylase polypeptide, C-14 sterol reductase polypeptide, C-4 sterol methyl oxidase polypeptide, SAM:C-24 sterol methyltransferase polypeptide, C-8 sterol isomerase polypeptide, C-5 sterol desaturase polypeptide, C-22 sterol desaturase polypeptide, or C-24 sterol reductase polypeptide sequences are included in Table 16, and tables 86-99. In some embodiments of the invention that utilize squalene synthase polypeptide, squalene epoxidase polypeptide, 2,3-oxidosqualene-lanosterol cyclase polypeptide, cytochrome P450 lanosterol 14.alpha.-demethylase polypeptide, C-14 sterol reductase polypeptide, C-4 sterol methyl oxidase polypeptide, SAM:C-24 sterol methyltransferase polypeptide, C-8 sterol isomerase polypeptide, C-5 sterol desaturase polypeptide, C-22 sterol desaturase polypeptide, or C-24 sterol reductase polypeptide (or modifications of these polypeptides) source organisms include, but are not limited to, Neurospora crassa, Aspergillus niger, Saccharomyces cerevisiae, Mucor circinelloides, Candida utilis, Mortierella alpina, and Yarrowia lipolytica.

[0287] In some embodiments of the invention, ergosterol production in a host cell is increased by reducing the level or activity of one or more vitamin D biosynthesis polypeptides (e.g., which act to metabolize squalene). For instance, in some embodiments, the level or activity of one or more polypeptides (see below).

[0288] In some embodiments, the sterol compound produced in addition to one or more quinone derived compounds in accordance with the present invention is a vitamin D compound. Vitamin D compounds are a group of steroid compounds including vitamin D.sub.3 (cholecalciferol), vitamin D.sub.2 (ergocalciferol), their provitamins, and certain metabolites (see, for example, FIG. 10A-B). Vitamins D.sub.3 and D.sub.2 can be produced from their respective provitamins (e.g., 7-dehydrocholesterol and ergosterol) by ultraviolet irradiation (e.g., by the action of sunlight). The most biologically active form of vitamin D is 1,25-dihydroxy vitamin D.sub.3, which is also known as calcitriol. Calcitriol is produced by hydroxylation of vitamin D.sub.3 at the 25 position, followed by hydroxylation to generate calcitriol.

[0289] In some embodiments, vitamin D production is enhanced in a cell by introduction of one or more sterologenic modifications that increases levels of IPP, FPP and/or squalene. In some embodiments, vitamin D production is enhanced in a cell by increasing the level and/or activity of one or more vitamin D biosynthesis polypeptides (e.g., one or more isoprenoid biosynthesis polypeptides, an FPP synthase polypeptide, a squalene synthase polypeptide, and/or one or more polypeptides involved in converting squalene into a particular vitamin D compound of interest [e.g., 7-dehydrocholesterol and/or calcitriol]). Alternatively or additionally, in some embodiments, vitamin D production is enhanced in a cell by decreasing the level and/or activity of one or more sterol biosynthesis competitor polypeptides that diverts one or more intermediates away from vitamin D production. In some embodiments of the invention, production of a particular vitamin D compound (e.g., 7-dehydrocholesterol and/or calcitriol) is enhanced by decreasing the level and/or activity of one or more polypeptides that diverts a relevant intermediate toward an alternative vitamin D compound (e.g., ergosterol, vitamin D.sub.2).

[0290] To give but one particular example of a sterologenic modification that can be employed in accordance with the present invention to increase production of one or more vitamin D.sub.3 compounds (e.g., 7-dehydrocholesterol and/or calcitriol), in accordance with some embodiments of the present invention, the level and/or activity of one or more polypeptides that diverts a relevant intermediate toward a vitamin D.sub.2 compound (e.g., ergosterol, vitamin D.sub.2), and away from vitamin D.sub.3 compounds, can be inhibited or destroyed. In some embodiments of the invention, where multiple heterologous polypeptides are to be expressed (e.g., because one or more activities of interest require two or more polypeptide chains and/or because multiple activities of interest are being engineered), it may be desirable to utilize the same source organism for all, or to utilize closely related source organisms; in other embodiments, heterologous polypeptides may be from different source organisms. In some embodiments, two or more versions of a particular heterologous polypeptide, optionally from different source organisms, may be introduced into and/or engineered within, a single host cell.

[0291] Having now generally described the invention, the same will be more readily understood through reference to the following exemplification which is provided by way of illustration, and is not intended to be limiting of the present invention.

EXEMPLIFICATION

[0292] Table 42 describes the Yarrowia lipolytica strains used in the following exemplification:

TABLE-US-00001 TABLE 42 Yarrowia lipolytica strains. NRRL Y-1095 Wild type diploid Derived from strain ATCC76861 MATB ura2-21 lyc1-5 LYS1-5B ATCC76982 MATB ade1 leu2-35 lyc1-5 xpr2 ATCC201249 MATA ura3-302 leu2-270 lys8-11 PEX17-HA MF346 MATA ura2-21 ATCC76861 .times. ATCC201249 MF350 MATB ura2-21 leu2-35 ade1 ATCC76982 .times. MF346 MF358 MATB adel ATCC76982 .times. MF346 MF454 MATB ura3-302 ade1 leu2-270 ATCC201249 .times. MF358

[0293] (The genotypes at LYC1, LYS1, XPR2, and PEX17 were not determined in crosses nor verified for ATCC strains.)

[0294] All basic molecular biology and DNA manipulation procedures described herein are generally performed according to Sambrook et al. or Ausubel et al. (Sambrook J, Fritsch E F, Maniatis T (eds). 1989. Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press: New York; Ausubel F M, Brent R, Kingston R E, Moore D D, Seidman J G, Smith J A, Struhl K (eds). 1998. Current Protocols in Molecular Biology. Wiley: New York). The GPD1 and TEF1 promoters are from Y. lipolytica as is the XPR2 terminator.

[0295] Plasmids were generated for construction of CoQ10 producing strains. The following subparts describe production of plasmids encoding ubiquinogenic polypeptides. General plasmid construction for expression vectors are described in Table 43, and further constructions in the text. All PCR amplifications used NRRL Y-1095 genomic DNA as template unless otherwise specified. The URA5 gene described below is allelic with the ura2-21 auxotrophy above in Table 42.

TABLE-US-00002 TABLE 43 Plasmids Plasmid Backbone Insert Oligos or source pMB4529 pCR2.1 3.4 kb ADE1 PCR product MO4475 & MO4476 pMB4534 pCR2.1 2.1 kb LEU2 PCR product MO4477 & MO4478 pMB4535 pCR2.1 1.2 kb URA5 PCR product MO4471 & MO4472 pMB4589 pMB4535 (KpnI + SpeI) 1.2 kb GPD1 promoter (KpnI + MO4568 & NotI); 0.14 kb XPR2 MO4591; MO4566 & terminator (NotI + SpeI) MO4593 pMB4590 pMB4535 (KpnI + SpeI) 0.4 kb TEF1 promoter (KpnI + MO4571 & NotI); 0.14 kb XPR2 MO4592; MO4566 terminator (NotI + SpeI) & MO4593 pMB4597 pMB4534 (Acc65I + SpeI) GPD1 promoter & XPR2 From pMB4589 terminator (Acc65I + SpeI) pMB4603 pMB4597 (RsrII + MluI) Residual backbone From pMB4590 & TEF1 promoter (RsrII + MluI) pMB4616 pMB4529 (RsrII + SpeI) Residual backbone From pMB4589 & GPD1 promoter & XPR2 terminator (RsrII + SpeI) pMB4629 pMB4616 (RsrII + MluI) Residual backbone From pMB4590 & TEF1 promoter (RsrII + MluI) pMB4631 pMB4603 (KpnI + NheI) 1.2 kb GPD1 promoter MO4568 & MO4659 (KpnI + NheI); pMB4637 pMB4629 (NheI + MluI) 1.5 kb hmg1.sup.trunc ORF (XbaI + MO4657 & MO4658 MluI) pMB4662 pMB4631 (SpeI + XhoI) 1.8 kb URA3 fragment MO4684 & MO4685 (SpeI + BsaI pMB4691 pMB4662 (Acc65I + MluI) 0.4 kb TEF1 promoter From pMB4629 (Acc65I + MluI)

TABLE-US-00003 TABLE 44 Oligonucleotides referenced in Table 43 MO4471 5'CTGGGTGACCTGGAAGCCTT MO4472 5'AAGATCAATCCGTAGAAGTTCAG MO4475 5'AAGCGATTACAATCTTCCTTTGG MO4476 5'CCAGTCCATCAACTCAGTCTCA MO4477 5'GCATTGCTTATTACGAAGACTAC MO4478 5'CCACTGTCCTCCACTACAAACAC MO4566 5'CACAAACTAGTTTGCCACCTACAAGCCAGAT MO4568 5'CACAAGGTACCAATGTGAAAGTGCGCGTGAT MO4571 5'CACAAGGTACCAGAGACCGGGTTGGCGG MO4591 5'CACAAGCGGCCGCGCTAGCATGGGGATCGATCTCTTATAT MO4592 5'CACAAGCGGCCGCGCTAGCGAATGATTCTTATACTCAGAAG MO4593 5'CACAAGCGGCCGCACGCGTGCAATTAACAGATAGTTTGCC MO4659 5'CACAAGCTAGCTGGGGATGCGATCTCTTATATC MO4684 5'CATTCACTAGTGGTGTGTTCTGTGGAGCATTC MO4685 5'CACACGGTCTCATCGAGGTGTAGTGGTAGTGCAGTG MO4657 5'CACACTCTAGACACAAAAATGACCCAGTCTGTGAAGGTGG MO4658 5'CACACACGCGTACACCTATGACCGTATGCAAAT

Example 1

Biosynthesis of CoQ10 in Yarrowia lipolytica

[0296] Decaprenyl diphosphate synthases ("dpdpS") based on the amino acid sequence of those found in Silicibacter pomeroyi and in Loktanella vestfoldensis are expressed in Yarrowia lipolytica, using nucleotide sequences with the appropriate Yarrowia codon bias, and with Yarrowia mitochondrial targeting sequences. Insertion of the expression cassette results in the disruption of the native COQ1 gene, encoding nonaprenyl diphosphate synthase.

[0297] To construct the mitochondrial leader sequence encoding DNA of NADH:ubiquinone oxidoreductases (complex I) ["NUAM"], four oligos were annealed and treated sequentially with Klenow fragment and ligase:

TABLE-US-00004 MO4859: 5'-TCTAGACACAAAAATGCTCTCGAGAAACCTCAGCAAGTTTG MO4860: 5'-GTGGTTGCTGGCCGGATGAGACCGGCTCGAGCAAACTTGCT MO4861: 5'-CAGCAACCACATCCACACACACCCGACTATTCTCTGTCTCC MO4862: 5'-GGATCCCGTCTCGGACAGACGTCGGGCGGAGACAGAG.

[0298] The resulting fragment was subsequently amplified with MO4859 and MO4862 using Pfu polymerase and the product was phosphorylated and ligated to pBluescriptSKII.sup.- cut with EcoRV to create pMB4776. The mitochondrial leader sequence of NUAM is thereby encoded within the resultant XbaI-BsmBI (underline) fragment:

TABLE-US-00005 (NUAMLdr): tctagacacaaaaatgctctcgagaaacctcagcaagtttgctcgag ccggtctcatccggccagcaaccacatccacacacacccgactattc tctgtctccgcccgacgtctgtccgagacgggatcc

[0299] To construct the mitochondrial leader sequence encoding DNA of the native Y. lipolytica Coq1 protein, Y. lipolytica genomic DNA was amplified with MO4857 (5'-CGGATCCCGTCTCGGACATTTCTTGCGACACAATG) and MO4858 (5'-CTCTAGACACAAAAATGCTGAGAGTCGGACGAAT) and the 0.25 kb product was phosphorylated and ligated to pBluescriptSKII.sup.- cut with EcoRV to create pMB4750. The mitochondrial leader sequence of Coq1 is thereby encoded within the resultant XbaI-BsmBI (underline) fragment:

TABLE-US-00006 (Coq1ldr): ctctagacacaaaaatgctgagagtcggacgaattggcaccaagacc ctagccagcagcagcctgcgtttcgtggcaggtgctcggcccaaatc cacgctcaccgaggccgtgctggagaccacagggctgctgaaaacca cgccccaaaaccccgagtggtctggagccgtcaagcaggcatctcgt ctggtggagaccgacactccgatccgagacccgttttccattgtgtc gcaagaaatgtccgagacgggatccg.

[0300] To construct a cassette encoding the decaprenyl diphosphate synthase (dpdpS) from Silicibacter pomeroyi, the following sequence was synthesized de novo:

TABLE-US-00007 (Sp-dpdpS): GGATCCCGTCTCTTGTCTGATGCCAAAGTCTCTACAAAGCCTCACGAGATGCTCGCTGCCACACT CTCTCAGGAAATGGCCGCTGTCAACGCCCTTATTCGAACCCGTATGGCTTCTGAACACGCCCCTA GAATCCCCGAAGTTACAGCCCACCTCGTCGAAGCCGGCGGAAAACGATTGCGACCTATGCTGAC ACTTGCTGCTGCCCGACTGTGTGGATACCAGGGCGAAGATCATGTCAAACTGGCTGCCACTGTT GAATTTATTCACACTGCTACACTTTTGCACGATGATGTCGTGGACGAGTCTGGACAAAGACGTG GACGACCTACTGCTAATCTTCTTTGGGATAACAAGTCCTCTGTTCTTGTGGGCGACTATCTTTTTG CACGATCGTTCCAGCTGATGGTCGAAACCGGTTCCCTTCGAGTGCTTGACATCCTCGCTAACGCT GCCGCGACAATTGCCGAAGGTGAAGTGCTCCAGATGACCGCTGCTTCCGATCTTAGAACTGATG AATCCGTTTACCTTCAGGTCGTTCGAGGTAAAACTGCCGCTCTTTTTTCTGCTGCTACTGAAGTTG GTGGTGTTATTGCCGGAGTCCCCGAAGCCCAAGTTCGAGCACTTTTTGAATACGGTGATGCGCTG GGAATTGCATTTCAGATTGCTGATGACCTCCTCGACTACCAAGGTGATGCTAAGGCCACTGGAA AGAATGTTGGAGATGACTTCAGAGAAAGAAAACTCACATTGCCTGTTATTAAAGCCGTCGCTCA AGCTACTGATGAGGAACGAGCGTTTTGGGTTAGAACTATTGAAAAGGGAAAGCAAGCTGAAGG AGATCTTGAACAGGCTCTCGCTCTCATGGAAAAATACGGAACACTTGCCGCAACCAGAGCCGAT GCGCATGCTTGGGCCGAAAAAGCACGAACCGCCCTCGAACTGTTGCCGAATCACGAAATCAGAA CAATGCTCTCCGACCTCGCCGATTATGTCGTTGCTAGATTGTCTTAAACGCGT.

[0301] To construct a cassette encoding the decaprenyl diphosphate synthase (dpdpS) from Loktanella vestfoldensis, the following sequence was synthesized de novo:

TABLE-US-00008 (Lv-dpdpS): GGATCCCGTCTCTTGTCTCTTGATCACGCTGCCACCAAACCTCATGAACAACTTGCCGCCGCCCT TGCCGACGACCTTACCGCAGTTAACGCTATGATTAGAGACCGAATGGCCAGCGAACATGCCCCC AGAATTCCACAAGTTACCGCACACCTCGTTGAAGCCGGAGGTAAAAGACTTCGTCCAATGCTGA CCCTTGCCGCCGCCCGTATGTGTGGCTACGACGGACCTTACCACATCCACCTTGCAGCCACCGTT GAGTTCATTCACACCGCCACCTTGCTCCATGACGATGTTGTTGATGAGTCCTCCCAACGTCGTGG TCGACCTACCGCTAATCTTCTGTGGGATAATACCTCTTCTGTTTTGGTTGGAGACTACCTCTTTGC ACGATCTTTTCAGTTGATGGTTGAGACCGGATCGCTTCGAGTTCTTGACATCCTGGCCAATGCTT CTGCTACCATCGCCGAAGGTGAGGTTCTGCAACTTACCGCCGCCGCTGACCTTGCAACTACCGA GGACATTTACATCAAAGTTGTTCGAGGAAAGACTGCTGCACTCTTTTCTGCTGCTATGGAAGTTG GTGGAGAAATCGCCGGTCAGGACCCGGCTATTAAACAGGCTCTCTTTGACTACGGCGACGCTCT CGGAATTTCTTTTCAGATTGTTGATGACCTGTTGGATTACGGTGGAACAAAAGCTACAGGAAAG AACGTTGGTGATGACTTCCGAGAAAGAAAGCTCACCCTCCCTGTTATTCGTGCCGTTGCTGCTGC TGATGCTGATGAAAGAGCTTTTTGGGAACGAACAATTCAAAAAGGAAGACAACAAGATGGAGA CCTGGACCATGCAATTGCCCTTCTTCATAGACACGGAACCCTTGAATCCACACGACAAGACGCA ATCCTTTGGGCCGCTAAAGCTAAAGAAGCACTCGGAGTTATCCCGGACCATGCTCTTAAAACCA TGCTCGTTGACCTTGCTGACTACGTTGTTGCTCGACTCACCTAAACGCGT.

[0302] A NUAMldr-Sp-dpdpS fusion capable of being expressed in Y. lipolytica was constructed by ligating the XbaI-BsmBI NUAMldr fragment together with the BsmBI-MluI Sp-dpdpS fragment into pMB4691 cut with NheI and MluI, to create pMB4799.

[0303] A NUAMldr-Lv-dpdpS fusion capable of being expressed in Y. lipolytica was constructed by ligating the XbaI-BsmBI NUAMldr fragment together with the BsmBI-MluI Lv-dpdpS fragment into pMB4691 cut with NheI and MluI, to create pMB4797.

[0304] A Coq1ldr-Sp-dpdpS fusion capable of being expressed in Y. lipolytica was constructed by ligating the XbaI-BsmBI Coq1ldr fragment together with the BsmBI-MluI Sp-dpdpS fragment into pMB4691 cut with NheI and MluI, to create pMB4798.

[0305] A Coq1ldr-Lv-dpdpS fusion capable of being expressed in Y. lipolytica was constructed by ligating the XbaI-BsmBI Coq1ldr fragment together with the BsmBI-MluI Lv-dpdpS fragment into pMB4691 cut with NheI and MluI, to create pMB4796.

[0306] To effect the disruption of the native COQ1 gene and its replacement by the fusions described above, Y. lipolytica genomic DNA was amplified with two pairs of primers:

TABLE-US-00009 5'- CACACGGTACCGGTATAGGCACAAGTGC [MO4848] with 5'- CACTCGGATCCGTACGTCTGTGGTGCTGTGGT [MO4856], and 5'- CACACGGATCCGCTAGCTCTGAGAAACCTCACCATG [MO4850] with 5'- CACACTCTAGATTTTCTTGGCCATGAACGGT [MO4851],

yielding a fragment of approximately 0.7 kb and 0.85 kb in length, respectively. Fragment MO4848/4856 was cleaved with Acc65I and BamHI, fragment MO4850/4851 was cleaved with BamHI and XbaI, and pBluescriptSKII.sup.- was cleaved with XbaI and Acc65I. These three segments were ligated together to yield plasmid pMB4747.

[0307] The 3.6 (or 3.7) kb Acc65I-XbaI fragments from pMB4796, pMB4797, pMB4798, and pMB4799 containing various tef-dpdpS constructs and the URA3 marker were each ligated to pMB4747 cleaved with BsiWI and NheI. The resulting plasmids, pMB4800 (Coq1ldr-Lv-dpdpS), pMB4801 (NUAMldr-Lv-dpdpS), pMB4802 (Coq1ldr-Sp-dpdpS), pMB4803 (NUAMldr-Sp-dpdpS), are cleaved with Acc65I and XbaI and the 5.1 (or 5.3) kb fragment containing various coq1-tef1p-dpdpS-URA3-coq1 sequences is used to transform Y. lipolytica strain MF454 (MATB ura3 leu2 ade1) to uracil prototrophy. Transformants were checked by PCR for confirmation of homologous integration based gene replacement which confirmed the presence of the heterologous dpdpS and the absence of the wild type Y. lypolytica COQ1 allele. The strains were designated MF975 (MATB ura3 leu2 ade1 coq1.DELTA.NUAMldr-Lv-dpdpS-URA3), MFL976 (MATB ura3 leu2 ade1 coq1.DELTA.NUAMldr-Lv-dpdpS-URA3) as duplicate and MF977 (MATB ura3 leu2 ade1 coq1.DELTA.NUAMldr-Sp-dpdpS-URA3). Strains were analyzed for ubiquinone by TLC and HPLC analysis as described in subsequent examples.

Example 2

Increased CoQ10 Production Resulting from Enhanced PHB-Polyprenyltransferase Activity

[0308] 4-hydroxybenzoate polyprenyl transferases ("phbPPt") based on the amino acid sequence of those found in Silicibacter pomeroyi and in Bos taurus are expressed in Yarrowia lipolytica, using nucleotide sequences with the appropriate Yarrowia codon bias, and in the case of the S. pomeroyi sequence, with Yarrowia mitochondrial targeting sequences.

[0309] The following sequences are synthesized de novo:

TABLE-US-00010 G: Yarrowia lipolytica Coq1 signal sequence. + Silicibacter pomeroyi phbPPt 5'- TTCTAGACACAAAAATGCTGCGAGTGGGCCGAATCGGCACCAAGACCCTGGCCTCTTCTTC TCTGCGATTCGTGGCCGGCGCCCGACCCAAGTCTACCCTGACCGAGGCCGTGCTGGAGAC CACCGGCCTGCTGAAGACCACCCCCCAGAACCCCGAGTGGTCTGGCGCCGTGAAGCAGGC CTCTCGACTGGTGGAGACCGACACCCCCATCCGAGACCCCTTCTCTATCGTGTCTCAGGAG ATGCAGGGCCAGGCCCCCACCCCCGACGGCCAGGTGGCCGACGCCGTGACCGGCAACTGGGTG GACATCCACGCCCCCGCCTGGTCTCGACCCTACCTGCGACTGTCTCGAGCCGACCGACCCATCGG CACCTGGCTGCTGCTGATCCCCTGTTGGTGGGGCCTGGCCCTGGCCATGCTGGACGGCCAGGAC GCCCGATGGGGCGACCTGTGGATCGCCCTGGGCTGTGCCATCGGCGCCTTCCTGATGCGAGGCG CCGGCTGTACCTGGAACGACATCACCGACCGAGAGTTCGACGGCCGAGTGGAGCGAACCCGATC TCGACCCATCCCCTCTGGCCAGGTGTCTGTGCGAATGGCCGTGGTGTGGATGATCGCCCAGGCCC TGCTGGCCCTGATGATCCTGCTGACCTTCAACCGAATGGCCATCGCCATGGGCGTGCTGTCTCTG CTGCCCGTGGCCGTGTACCCCTTCGCCAAGCGATTCACCTGGTGGCCCCAGGTGTTCCTGGGCCT GGCCTTCAACTGGGGCGCCCTGCTGGCCTGGACCGCCCACTCTGGCTCTCTGGGCTGGGGCGCCC TGTTCCTGTACCTGGCCGGCATCGCCTGGACCCTGTTCTACGACACCATCTACGCCCACCAGGAC ACCGAGGACGACGCCCTGATCGGCGTGAAGTCTACCGCCCGACTGTTCGGCGCCCAGACCCCCC GATGGATGTCTTACTTCCTGGTGGCCACCGTGTCTCTGATGGGCATCGCCGTGTTCGAGGCCGCC CTGCCCGACGCCTCTATCCTGGCCCTGGTGCTGGCCCTGGCCGGCCCCTGGGCCATGGGCTGGCA CATGGCCTGGCAGCTGCGAGGCCTGGACCTGGACGACAACGGCAAGCTGCTGCAGCTGTTCCGA GTGAACCGAGACACCGGCCTGATCCCCCTGATCTTCTTCGTGATCGCCCTGTTCGCCTAACGCGT H: Bos taurus phpPPt 5'- TTCTAGACACAAAAATGCTGGGCTCTTGTGGCGCCGGCCTGGTGCGAGGCCTGCGAGCCGAGAC CCAGGCCTGGCTGTGGGGCACCCGAGGCCGATCTCTGGCCCTGGTGCACGCCGCCCGAGGCCTG CACGCCGCCAACTGGCAGCCCTCTCCCGGCCAGGGCCCCCGAGGCCGACCCCTGTCTCTGTCTGC CGCCGCCGTGGTGAACTCTGCCCCCCGACCCCTGCAGCCCTACCTGCGACTGATGCGACTGGAC AAGCCCATCGGCACCTGGCTGCTGTACCTGCCCTGTACCTGGTCTATCGGCCTGGCCGCCGACCC CGGCTGTCTGCCCGACTGGTACATGCTGTCTCTGTTCGGCACCGGCGCCGTGCTGATGCGAGGCG CCGGCTGTACCATCAACGACATGTGGGACCGAGACTACGACAAGAAGGTGACCCGAACCGCCTC TCGACCCATCGCCGCCGGCGACATCTCTACCTTCCGATCTTTCGTGTTCCTGGGCGGCCAGCTGA CCCTGGCCCTGGGCGTGCTGCTGTGTCTGAACTACTACTCTATCGCCCTGGGCGCCGCCTCTCTG CTGCTGGTGACCACCTACCCCCTGATGAAGCGAATCACCTACTGGCCCCAGCTGGCCCTGGGCCT GACCTTCAACTGGGGCGCCCTGCTGGGCTGGTCTGCCGTGAAGGGCTCTTGTGACCCCTCTGTGT GTCTGCCCCTGTACTTCTCTGGCATCATGTGGACCCTGATCTACGACACCATCTACGCCCACCAG GACAAGAAGGACGACGCCCTGATCGGCCTGAAGTCTACCGCCCTGCTGTTCCGAGAGGACACCA AGAAGTGGCTGTCTGGCTTCTCTGTGGCCATGCTGGGCGCCCTGTCTCTGGTGGGCGTGAACTCT GGCCAGACCATGCCCTACTACACCGCCCTGGCCGCCGTGGGCGCCCACCTGGCCCACCAGATCT ACACCCTGGACATCAACCGACCCGAGGACTGTTGGGAGAAGTTCACCTCTAACCGAACCATCGG CCTGATCATCTTCCTGGGCATCGTGCTGGGCAACCTGTGTAAGGCCAAGGAGACCGACAAGACC CGAAAGAACATCGAGAACCGAATGGAGAACTAACGCGT

[0310] The above sequences are cleaved with MluI and XbaI, and inserted into NheI- and MluI-cleaved pMB4603, placing the genes under the control of the Y. lipolytica TEF1 promoter, and adjacent to a functional Y. lipolytica LEU2 gene. These plasmids are designated tef-transS (from sequence G) and tef-transB (from sequence H).

[0311] These plasmids can be used to transform Y. lipolytica strain MF454, ML975, MF976, or MF977 to leucine prototrophy by cleaving with enzymes that lie within plasmid backbone sequences to favor nonhomologous integration events. Transformants can be checked by Southern analysis for confirmation of ectopic integration. Production of CoQ10 can be assessed using HPLC or TLC analysis.

Example 3

Extraction of Ubiquinone from Yarrowia Lipolytica Cells

[0312] Shake-flask testing was conducted using 20 ml cultures containing YPD medium (1% yeast extract, 2% peptone, 2% glucose) in 125 ml flasks grown at 30.degree. C. Y. lipolytica cells were harvested from 24 hour cultures, and extractions are performed to determine ubiquinone form and quantity. Entire culture was pelleted and washed with 50 ml H.sub.2O. Pelleted cells may be frozen at -80.degree. C. and stored. 0.5 ml of cubic zirconium beads was added to cell pellets, along with 1 ml ice-cold extraction solvent (a 50/50 v/v mix of hexane and ethyl acetate containing 0.01% butylhydroxytoluene (BHT)). The mixture was then agitated (Mini-BeadBeater-8, BioSpec Products, Inc.) at maximum speed for 5 minutes at 4.degree. C. The mixture was then spun at maximum speed for 2 minutes, and the supernatant collected and deposited in a cold 16 ml glass vial. The remaining cell debris was re-extracted at least three times, without the addition of zirconium beads; all supernatants were pooled in the 16 ml glass vial. A Speed Vac was used to concentrate the supernatant (room temperature in dark), and the samples were stored at -20.degree. C. or -80.degree. C. until immediately before TLC and HPLC analysis. Prior to TLC analysis, the samples were resuspended in 50 .mu.l ice-cold solvent and then transferred to a cold amber vial. In an alternative similar protocol, Y. lipolytica cells can be harvested from 72-96 hour 20 ml cultures grown in 125 ml flasks grown at 30.degree. C. 1.8 ml of culture is placed into a pre-weighed tube with a hole poked in the top. Cells are pelleted and washed with 1 ml H.sub.2O. The washing procedure is repeated and then cells are lyophilized overnight. After drying to completion, the tube is weighed in order to calculate dry cell weight. 1 ml from the same shake flask culture is placed into a screw-cap tube for ubiquinone extraction. Cells are pelleted and washed once with 1 ml H.sub.2O and once with 1 M potassium phosphate solution, pH 7.1. Pelleted cells may be frozen at -80.degree. C. and stored. Cubic zirconium beads addition and extraction is performed as describe above. Following extraction, the glass vial is spun for 5 minutes at 2000 rpm at 4.degree. C. in a Sorvall tabletop centrifuge, and the supernatant is transferred to a new cold 16 ml glass vial. A Speed Vac is used to concentrate the supernatant (room temperature in dark), and the samples are stored at -20.degree. C. or -80.degree. C. until immediately before HPLC analysis. Prior to HPLC analysis, the samples are resuspended in 1 ml ice-cold solvent and then transferred to a cold amber vial. Throughout the extraction protocols, care is taken to avoid contact with oxygen, light, heat, and acids.

Example 4a

Purification of Ubiquinones from Crude Extract by Thin Layer Chromatography (TLC)

[0313] Ubiquinone containing samples from the primary extraction in example 3 were spotted to immobilized silica gel plates next to the primary standards (Coenzyme Q9 (Sigma 275970) and Coenzyme Q10 (Sigma 27595)) and resolved in a 80:40:1 hexanes:diethyl ether:acetic acid solvent mixture. Ubiquinone containing bands were identified by UV shadowing and migration with the purified ubiquinone standard and excised from the TLC plate. Samples were extracted from the silica gel with 50 .mu.l acetone followed by mixing, centrifugation, decanting and drying before HPLC analysis.

Example 4b

Quantification of Ubiquinone Production by HPLC

[0314] Ubiquinone-containing samples are resuspended in ice-cold extraction solvent (a 50/50 v/v mix of hexane and ethyl acetate containing 0.01% butylhydroxytoluene (BHT)). An Alliance 2795 HPLC (Waters) equipped with a Waters XBridge C18 column (3.5 .mu.m, 2.1.times.50 mm) and Thermo Basic 8 guard column (2.1.times.10 mm) is used to resolve ubiquinone at 25.degree. C.; authentic ubiquinone samples are used as standards. Ubiquinone molecules were detected at 275 nm (alternatively, they can also be detected at 405 nm), and retention times include CoQ10 (4.272 min) and CoQ9 (4.164 min) The mobile phases and flow rates are shown in Table 45 (Solvent A=Ethyl Acetate; Solvent B=Water; Solvent C=Methanol; Solvent D=Acetonitrile). The injection volume was 10 .mu.L. The detector was a Waters 996 photodiode array detector. A peak corresponding to Coenzyme Q10 was observed in strains MF975, MF976, and MF977.

[0315] The retention times for additional lipophilic molecules include astaxanthin (1.159 min), zeaxanthin (1.335), .beta.-apo-8'-carotenal (2.86 min), ergosterol (3.11 min), lycopene (3.69 min), 13-Carotene (4.02 min), and phytoene (4.13 min) Astaxanthin, zeaxanthin, .beta.-apo-8'-carotenal, lycopene and .beta.-Carotene are detected at 475 nm, whereas ergosterol and phytoene are detected at 286 nm.

TABLE-US-00011 TABLE 45 Mobile Phases and Flow Rates for Ubiquinone Resolution Time (min) Flow (mL/min) % A % B % C % D Curve 0.50 0.0 20.0 0.0 80.0 3.00 1.00 20.0 0.0 0.0 80.0 6 4.50 1.00 80.0 0.0 20.0 0.0 6 5.50 1.00 0.0 0.0 60.0 40.0 6 6.50 1.00 0.0 0.0 80.0 20.0 6 7.50 1.00 0.0 0.0 100.0 0.0 6 8.50 1.00 0.0 0.0 100.0 0.0 6 9.50 1.00 0.0 20.0 0.0 80.0 6 10.50 0.50 0.0 20.0 0.0 80.0 6

Example 5

Engineering of PHB Biosynthetic Pathway Genes Results in Increased Para-Hydroxybenzoic Acid Production

[0316] Production of para-hydroxybenzoic acid (PHB) is increased by overexpression of a heterologous 3-deoxy-D-arabino-heptulosonate-7-phosphate (DAHP) synthase and a heterologous chorismate lyase (UbiC). Both genes are codon-optimized variants of the Erwinia carotovora coding sequences. The DAHP synthase is based on the amino acid sequence of that found in Erwinia is expressed in Yarrowia lipolytica, using nucleotide sequences with the appropriate Yarrowia codon bias, and containing a P148L substitution. The P 148L substitution can be employed to alleviate tyrosine-mediated allosteric regulation of the Erwinia DAHP synthase. The following sequences are synthesized:

TABLE-US-00012 J: DAHP E. carotovora (optimized) (AroF with P148L substitution) 5'- TTCTAGACACAAAAATGCAGAAGGACTCTCTGAACAACATCAACATCTCTGCCGAGCAGGTGCT GATCACCCCCGACGAGCTGAAGGCCAAGTTCCCCCTGAACGACGTGGAGCAGCGAGACATCGCC CAGGCCCGAGCCACCATCGCCGACATCATCCACGGCCGAGACGACCGACTGCTGATCGTGTGTG GCCCCTGTTCTATCCACGACACCGACGCCGCCCTGGAGTACGCCCGACGACTGCAGCTGCTGGC CGCCGAGCTGAACGACCGACTGTACATCGTGATGCGAGTGTACTTCGAGAAGCCCCGAACCACC GTGGGCTGGAAGGGCCTGATCAACGACCCCTTCATGGACGGCTCTTTCGACGTGGAGTCTGGCC TGCACATCGCCCGAGGCCTGCTGCTGCAGCTGGTGAACATGGGCCTGCCCCTGGCCACCGAGGC CCTGGACCTGAACTCTCCCCAGTACCTGGGCGACCTGTTCTCTTGGTCTGCCATCGGCGCCCGAA CCACCGAGTCTCAGACCCACCGAGAGATGGCCTCTGGCCTGTCTATGCCCGTGGGCTTCAAGAA CGGCACCGACGGCTCTCTGGGCACCGCCATCAACGCCATGCGAGCCGCCGCCATGCCCCACCGA TTCGTGGGCATCAACCAGACCGGCCAGGTGTGTCTGCTGCAGACCCAGGGCAACGGCGACGGCC ACGTGATCCTGCGAGGCGGCAAGACCCCCAACTACTCTGCCCAGGACGTGGCCGAGTGTGAGAA GCAGATGCAGGAGGCCGGCCTGCGACCCGCCCTGATGATCGACTGTTCTCACGGCAACTCTAAC AAGGACTACCGACGACAGCCCCTGGTGGTGGAGTCTGCCATCGAGCAGATCAAGGCCGGCAAC CGATCTATCATCGGCCTGATGCTGGAGTCTCACCTGAACGAGGGCTCTCAGTCTTCTGAGCAGCC CCGATCTGACATGCGATACGGCGTGTCTGTGACCGACGCCTGTATCTCTTGGGAGTCTACCGAGA CCCTGCTGCGATCTGTGCACCAGGACCTGTCTGCCGCCCGAGTGAAGCACTCTGGCGAGTAACG CGT K: Chorismate lyase E. carotovora (optimized) (UbiC) 5'- TTCTAGACACAAAAATGTCTGACGACGCCTCTACCCTGCTGCGAACCATCTCTTGGTTCACCGAG CCCCCCTCTGTGCTGCCCGAGCACATCGGCGACTGGCTGATGGAGACCTCTTCTATGACCCAGCG ACTGGAGAAGTACTGTGCCCAGCTGCGAGTGACCCTGTGTCGAGAGGGCTTCATCACCCCCCAG ATGCTGGGCGAGGAGCGAGACCAGCTGCCCGCCGACGAGCGATACTGGCTGCGAGAGGTGGTG CTGTACGGCGACGACCGACCCTGGCTGTTCGGCCGAACCATCGTGCCCCAGCAGACCCTGGAGG GCTCTGGCGCCGCCCTGACCAAGATCGGCAACCAGCCCCTGGGCCGATACCTGTTCGAGCAGAA GTCTCTGACCCGAGACTACATCCACACCGGCTGTTGTGAGGGCCTGTGGGCCCGACGATCTCGA CTGTGTCTGTCTGGCCACCCCCTGCTGCTGACCGAGCTGTTCCTGCCCGAGTCTCCCGTGTACTA CACCCCCGGCGACGAGGGCTGGCAGGTGATCTAACGCGT

[0317] The above sequences are cleaved with MluI and XbaI. Sequence J is inserted into NheI- and MluI-cleaved pMB4603, and sequence K is inserted into NheI- and MluI-cleaved pMB4691, placing the genes under the control of the Y. lipolytica TEF1 promoter, and adjacent to a functional Y. lipolytica LEU2 (J) or URA3 (K) gene. These plasmids are designated tef-aroF P148L (from sequence J) and tef-ubiC (from sequence K). They can be integrated sequentially, by cleavage of tef-aroF P148L with SspI and of tef-ubiC with SalI, into the genome of Y. lipolytica strain ATCC201249 (MATA ura3 leu2 lys8), and screened for confirmation of homologous integration as well as for increased PHB and ubiquinone production.

Example 6

Production of CoQ10 in Yarrowia lipolytica Strains Harboring Multiple Exogenous Ubiquinogenic pathway genes

[0318] The PHB overproducing strain produced in Example 5 is mated with CoQ10-producing strains described in Example 1 or Example 2, and diploids are selected on minimal medium. Upon meiosis and sporulation, haploid recombinants may be selected that harbor all four heterologous genes: AroF P148L, ubiC, dpdpS, and phbPPt. Genotypes of selected segregants can be confirmed by Southern blotting and polymerase chain reaction (PCR) analysis. Furthermore, increased production of CoQ10 in the selected recombinants is assessed as described in prior examples. In addition, segregants can be chosen that also harbor an ade1 auxotrophy to facilitate further manipulation.

Example 7

Decreasing Carbon Flow to the Competing Sterol Pathway Results in Enhanced Ubiquinone Production

[0319] In order to partially inactivate the ERG9 gene encoding squalene synthase, the neighboring FOL3 gene is disrupted, resulting in a folinic acid requirement. This strain is then transformed with a mutagenized fragment of DNA partially spanning the two genes, and Fol.sup.+ transformants are screened for decreased squalene synthase activity.

[0320] The following oligonucleotides are synthesized:

TABLE-US-00013 5'-CCTTCTAGTCGTACGTAGTCAGC [L]; 5'-CCACTGATCTAGAATCTCTTTCTGG [M].

[0321] They are used to amplify a 2.3 kb fragment from Y. lipolytica genomic DNA spanning most of the FOL3 gene, using Pfu polymerase. The fragment is cleaved with XbaI and phosphorylated, and ligated into pBluescriptSK.sup.- (GenBank accession number X52330) that has been cleaved with KpnI, treated with T4 DNA polymerase (T4pol) in the presence of dNTPs, and subsequently cleaved with XbaI. The resultant plasmid, designated BS-fol3, is then cleaved with Acc65I and EcoRI, treated with T4pol as above, and ligated to the 3.4 kb EcoRV-SpeI ADE1 fragment (treated with T4pol) from pMB4529.

[0322] The resulting plasmid, pBSfol3.DELTA.ade, can be cleaved with BsiWI and XbaI to liberate a 5.5 kb fragment that is used to transform the ade1 CoQ10-overproducing strains described above to adenine prototrophy. Ade.sup.+ transformants are screened for a folinic acid requirement, and for homologous integration by PCR analysis.

[0323] Strains that harbor the resultant fol3.DELTA.ADE1 allele can be transformed with a 3.5 kb DNA fragment generated by mutagenic PCR amplification using the primers:

TABLE-US-00014 5'-GGCTCATTGCGCATGCTAACATCG [N]; 5'-CGACGATGCTATGAGCTTCTAGACG [P],

and Y. lipolytica genomic DNA as template. This fragment contains the N-terminal three-quarters of the FOL3 ORF and the C-terminal nine-tenths of the ERG9 ORF. Fol.sup.+ Ade.sup.- transformants are screened for decreased squalene synthase activity by sensitivity to agents such as zaragozic acid, itraconazole, or fluconazole, or by resistance to reactive oxygen species-generating agents such as antimycin.

[0324] In addition, the above fragment could be cloned, and altered in such a way as to remove the 3'-untranslated region of ERG9. Replacement of the fol3.DELTA.ADE1 disruption by this fragment results in decreased expression of squalene synthase [Schuldiner et al. (2005), Cell 123:507-519][Muhlrad and Parker (1999), RNA 5:1299-1307], which can be confirmed as described above. This approach may also be used in a Fol.sup.+ Ade.sup.- strain, using the ADE1 marker to disrupt the ERG9 3'-UTR.

[0325] Partially defective ERG9 alleles can also be identified in S. cerevisiae using plasmid shuffling techniques [Boeke et al. (1987), Methods Enzymol. 154:164-175], and using drug sensitivities as a phenotype. Defective genes can be transferred to Y. lipolytica using standard molecular genetic techniques.

Example 8

Expression of a Truncated Form of HMG-CoA Reductase to Elevate CoQ10 Production

[0326] In order to further increase CoQ10 production, carbon flow through the isoprenoid pathway is enhanced by introducing a truncated variant of the Y. lipolytica HMG-CoA reductase gene, which also encodes a 77 amino acid leader sequence derived from S. cerevisiae Hmg1.

[0327] The following oligonucleotides are synthesized:

TABLE-US-00015 5'-TTCTAGACACAAAAATGGCTGCAGACCAATTGGTGA [Q]; 5'-CATTAATTCTTCTAAAGGACGTATTTTCTTATC [R]; 5'-GTTCTCTGGACGACCTAGAGG [S]; 5'-CACACACGCGTACACCTATGACCGTATGCAAAT [T].

Q and R are used to amplify a 0.23 kb fragment encoding Met-Ala followed by residues 530 to 604 of the Hmg1 protein of S. cerevisiae, using genomic DNA as template. S and T are used to amplify a 1.4 kb fragment encoding the C-terminal 448 residues of the Hmg1 protein of Y. lipolytica, using genomic DNA as template. These fragments are ligated to the appropriate cloning vector, and the resultant plasmids, designated pQR and pST, are verified by sequencing. The QR fragment is liberated with XbaI and AseI, and the ST fragment is liberated with MaeI and MluI. These fragments are then ligated to the ADE1 tef1p expression vector pMB4629 cut with NheI and MluI.

[0328] The resulting plasmid, pTefHMG, can be cleaved with SnaBI, BbvCI, or Bsu36I to direct integration at the ade1 locus, or with BamHI to direct integration at the HMG1 locus, of the CoQ10-producing strains described above, restoring them to adenine prototrophy. Ade transformants are screened for increased CoQ10 production.

[0329] Alternatively, the native HMG1 gene from Y. lipolytica may be modified through truncation, and without S. cerevisiae sequences, by amplifying Y. lipolytica genomic DNA with primer T above and

TABLE-US-00016 T2 5'-CACACTCTAGACACAAAAATGACCCAGTCTGTGAAGGTGG

[0330] yielding a 1.5 kb fragment that is cleaved with XbaI and MluI and ligated to pMB4629 cut with NheI and MluI to create pMB4637. pMB4637 may be cut with SnaBI, Bsu36I, or BbvCI to direct integration at the ade1 locus, or with BamHI to direct integration at the HMG1 locus, of the CoQ10-producing strains described above, restoring them to adenine prototrophy. Ade transformants are screened for increased CoQ10 production.

Example 9

Constructing an Oleaginous Strain of Saccharomyces cerevisiae

[0331] Genes encoding the two subunits of ATP-citrate lyase from N. crassa, the AMP deaminase from Saccharomyces cerevisiae, and the cytosolic malic enzyme from M. circinelloides are overexpressed in S. cereviseae strains in order to increase the total lipid content. Similar approaches to enhance lipid production could be employed in other host organisms such as Xanthophyllomyces dendrorhous (Phaffia rhodozyma), using the same, homologous, or functionally similar oleaginic polypeptides.

[0332] Qiagen RNAEasy kits (Qiagen, Valencia, Calif.) are used to prepare messenger RNA from lyophilized biomass prepared from cultures of N. crassa. Subsequently, RT-PCR is performed in two reactions containing the mRNA template and either of the following primer pairs.

TABLE-US-00017 acl1: 1fwd: 5' CACACGGATCCTATAatgccttccgcaacgaccg 1rev: 5' CACACACTAGttaaatttggacctcaacacgaccc acl2: 2fwd: 5' CACACGGATCCAATATAAatgtctgcgaagagcatcctcg 2rev: 5' CACACGCATGCttaagcttggaactccaccgcac

[0333] The resulting fragment from the acl1 reaction is cleaved with SpeI and BamHI, and that from the acl2 reaction is cleaved with BamHI and SphI, and both are ligated together into YEp24 that has been digested with NheI and SphI, creating the plasmid "p12". The bi-directional GAL1-10 promoter is amplified from S. cerevisiae genomic DNA using the primers.

TABLE-US-00018 gal10: 5' CACACGGATCCaattttcaaaaattcttactttttttttggatggac gal1: 5' CACACGGATCCttttttctccttgacgttaaagtatagagg,

and the resulting 0.67 kb fragment is cleaved with BamHI and ligated in either orientation to BamHI-digested "p12" to create "p1gal2" and "p2gal1", containing GALL-acl1/GAL10-acl2 and GAL10-acl1/GAL1-acl2, respectively (Genbank accession: acl1: CAB91740.2; acl2: CAB91741.2).

[0334] In order to amplify the S. cereviseae gene encoding AMP deaminase and a promoter suitable for expressing this gene, S. cerevisiae genomic DNA is amplified using two primer pairs in separate reactions:

TABLE-US-00019 AMD1 ORF: AMD1FWD: 5' CACACGAGCTCAAAAatggacaatcaggctacacagag AMD1rev: 5' CACACCCTAGGtcacttttcttcaatggttctcttgaaattg GAL7p: gal7prox: 5' CACACGAGCTCggaatattcaactgtttttttttatcatgttgatg gal7dist: 5' CACACGGAtccttcttgaaaatatgcactctatatcttttag,

The resulting fragment from the AMD1 reaction (2.4 kb) is cleaved with Sad and AvrII, and that from the GAL7 reaction (0.7 kb) is cleaved with BamHI and SphI, and both are ligated together into YEp13 that has been digested with NheI and BamHI, creating the plasmid "pAMPD". This plasmid carries the S. cerevisiae gene, AMD1, encoding AMP deaminase, under the control of the galactose-inducible GAL7 promoter.

[0335] Messenger RNA is prepared from lyophilized biomass of M. circinelloides, as described above, and the mRNA template is used in a RT-PCR reaction with two primers:

TABLE-US-00020 MAEfwd: 5' CACACGCTAGCTACAAAatgttgtcactcaaacgcatagcaac MAErev: 5' CACACGTCGACttaatgatctcggtatacgagaggaac,

and the resulting fragment is cleaved with NheI and SalI, and ligated to XbaI- and XhoI-digested pRS413TEF (Mumberg, D. et al. (1995) Gene, 156:119-122), creating the plasmid "pTEFMAE", which contains sequences encoding the cytosolic NADP.sup.+-dependant malic enzyme from M. circinelloides (E.C. 1.1.1.40; mce gene; Genbank accession: AY209191) under the control of the constitutive TEF1 promoter.

[0336] The plasmids "p1gal2", "pAMPD", and "pTEFMAE" are sequentially transformed into a strain of S. cereviseae to restore prototrophy for uracil ("p1gal2"), leucine ("pAMPD"), and histidine ("pTEFMAE") (Guthrie and Fink Methods in Enzymology 194:1-933, 1991). The resulting transformants are tested for total lipid content following shake flask testing (e.g., 20 ml cultures in 125 ml flasks grown at 30.degree. C. for 72-96 hour cultures) in either synthetic complete (SC) medium lacking uracil, leucine and histidine or in a 2-step fermentation process. In the 2-step process, 1.5 ml of cells from an overnight 2 ml roll tube culture containing SC medium lacking uracil, leucine and histidine are centrifuged, washed in distilled water, and resuspended in 20 ml of a nitrogen-limiting medium suitable for lipid accumulation (30 g/L glucose, 1.5 g/L yeast extract, 0.5 g/L NH.sub.4C1, 7 g/L KH.sub.2PO.sub.4, 5 g/L Na.sub.2HPO.sub.4-12H.sub.2O, 1.5 g/L MgSO.sub.4.7H.sub.2O, 0.08 g/L FeCl.sub.3-6H.sub.2O, 0.01 g/L ZnSO.sub.4.7H.sub.2O, 0.1 g/L CaCl.sub.2-2H.sub.2O, 0.1 mg/L MnSO.sub.4.5H.sub.2O, 0.1 mg/L CuSO.sub.4.5H.sub.2O, 0.1 mg/L Co(NO.sub.3).sub.2-6H.sub.2O; pH 5.5 (J Am Oil Chem Soc 70:891-894 (1993)).

[0337] Intracellular lipid content of the modified and control S. cerevisiae strains is analyzed using the fluorescent probe, Nile Red (J Microbiol Meth (2004) 56:331-338). In brief, cells diluted in buffer are stained with Nile Red, excited at 488 nm, and the fluorescent emission spectra in the wavelength region of 400-700 nm are acquired and compared to the corresponding spectra from cells not stained with Nile Red. To confirm results from the rapid estimation method, the total lipid content is determined by gas chromatographic analysis of the total fatty acids directly transmethylesterified from dried cells, as described (Appl Microbiol Biotechnol. 2002 November; 60(3):275-80). Yeast strains expressing multiple oleaginic polypeptides produce elevated total lipid (for example, in the range of 17% and 25% dry cell weight basis) following growth in YPD and lipid accumulation medium when compared to non-transformed S. cerevisiae strains which may, for example, produce in the range of 6% and 10% total lipid after growth in YPD and lipid accumulation medium.

Example 10

Y. lipolytica Oleaginic and Isoprenoid Biosynthesis Genes

[0338] FIG. 11 is a list of Y. lipolytica genes representing various polypeptides (e.g. oleaginic and isoprenoid biosynthesis peptides) useful in the fungal strains and methods described herein. The Genbank accession number and GI number is given for each polypeptide in addition to oligo pairs which can be used to amplify the coding region for each gene from Y. lipolytica genomic DNA or cDNA. Resulting PCR fragments can be cleaved with restriction enzyme pairs (e.g. depending on what site is present within the oligo sequence, XbaI/MluI or NheI/MluI or XbaI/AscI or NheI/AscI) and inserted into expression vectors (e.g. fungal expression vectors including Y. lipolytica expression vectors such as MB4629 and MB4691 described herein).

[0339] The DNA and proteins they encode of the Y. lipolytica genes represented in FIG. 11 are as follows (intron sequence is underlined):

TABLE-US-00021 YALI0F30481g DNA: atgtcgcaaccccagaacgttggaatcaaagccctcgagatctacgtgccttctcgaattgtcaaccaggctga- gc tcgagaagcacgacggtgtcgctgctggcaagtacaccattggtcttggtcagaccaacatggcctttgtcgac- ga cagagaggacatctattcctttgccctgaccgccgtctctcgactgctcaagaacaacaacatcgaccctgcat- ct attggtcgaatcgaggttggtactgaaacccttctggacaagtccaagtccgtcaagtctgtgctcatgcagct- ct ttggcgagaacagcaacattgagggtgtggacaacgtcaacgcctgctacggaggaaccaacgccctgttcaac- gc tatcaactgggttgagggtcgatcttgggacggccgaaacgccatcgtcgttgccggtgacattgccctctacg- ca aagggcgctgcccgacccaccggaggtgccggctgtgttgccatgctcattggccccgacgctcccctggttct- tg acaacgtccacggatcttacttcgagcatgcctacgatttctacaagcctgatctgacctccgagtacccctat- gt tgatggccactactccctgacctgttacacaaaggccctcgacaaggcctacgctgcctacaacgcccgagccg- ag aaggtcggtctgttcaaggactccgacaagaagggtgctgaccgatttgactactctgccttccacgtgcccac- ct gcaagcttgtcaccaagtcttacgctcgacttctctacaacgactacctcaacgacaagagcctgtacgagggc- ca ggtccccgaggaggttgctgccgtctcctacgatgcctctctcaccgacaagaccgtcgagaagaccttccttg- gt attgccaaggctcagtccgccgagcgaatggctccttctctccagggacccaccaacaccggtaacatgtacac- cg cctctgtgtacgcttctctcatctctctgctgacttttgtccccgctgagcagctgcagggcaagcgaatctct- ct cttctcttacggatctggtcttgcttccactcttttctctctgaccgtcaagggagacatttctcccatcgtca- ag gcctgcgacttcaaggctaagctcgatgaccgatccaccgagactcccgtcgactacgaggctgccaccgatct- cc gagagaaggcccacctcaagaagaactttgagccccagggagacatcaagcacatcaagtctggcgtctactac- ct caccaacatcgatgacatgttccgacgaaagtacgagatcaagcagtag Protein: Msqpqnvgikaleiyvpsrivnqaelekhdgvaagkytiglgqtnmafvddrediysfaltavsrllknnnidp- as igrievgtetlldksksyksylmqlfgensniegvdnvnacyggtnalfnainwvegrswdgrnaivvagdial- ya kgaarptggagcvamligpdaplvldnvhgsyfehaydfykpdltseypyvdghysltcytkaldkayaaynar- ae kvglfkdsdkkgadrfdysafhvptcklvtksyarllyndylndkslyegqvpeevaavsydasltdktvektf- lg iakaqsaermapslqgptntgnmytasvyaslislltfvpaeqlqgkrislfsygsglastlfsltvkgdispi- vk acdfkaklddrstetpvdyeaatdlrekahlkknfepqgdikhiksgvyyltniddmfrrkyeikq YALI0B16038g DNA: atggactacatcatttcggcgccaggcaaagtgattctatttggtgaacatgccgctgtgtttggtaagcctgc- ga ttgcagcagccatcgacttgcgaacatacctgcttgtcgaaaccacaacatccgacaccccgacagtcacgttg- ga gtttccagacatccacttgaacttcaaggtccaggtggacaagctggcatctctcacagcccagaccaaggccg- ac catctcaattggtcgactcccaaaactctggataagcacattttcgacagcttgtctagcttggcgcttctgga- ag aacctgggctcactaaggtccagcaggccgctgttgtgtcgttcttgtacctctacatccacctatgtccccct- tc tgtgtgcgaagattcatcaaactgggtagttcgatcaacgctgcctatcggcgcgggcctgggctcttccgcat- cc atttgtgtctgtttggctgcaggtcttctggttctcaacggccagctgagcattgaccaggcaagagatttcaa- gt ccctgaccgagaagcagctgtctctggtggacgactggtccttcgtcggtgaaatgtgcattcacggcaacccg- tc gggcatcgacaatgctgtggctactcagggaggtgctctgttgttccagcgacctaacaaccgagtccctcttg- tt gacattcccgagatgaagctgctgcttaccaatacgaagcatcctcgatctaccgcagacctggttggtggagt- cg gagttctcactaaagagtttggctccatcatggatcccatcatgacttcagtaggcgagatttccaaccaggcc- at ggagatcatttctagaggcaagaagatggtggaccagtctaaccttgagattgagcagggtatcttgcctcaac- cc acctctgaggatgcctgcaacgtgatggaagatggagctactcttcaaaagttgagagatatcggttcggaaat- gc agcatctagtgagaatcaatcacggcctgcttatcgctatgggtgtttcccacccgaagctcgaaatcattcga- ac tgcctccattgtccacaacctgggtgagaccaagctcactggtgctggaggaggaggttgcgccatcactctag- tc acttctaaagacaagactgcgacccagctggaggaaaatgtcattgctttcacagaggagatggctacccatgg- ct tcgaggtgcacgagactactattggtgccagaggagttggtatgtgcattgaccatccctctctcaagactgtt- ga agccttcaagaaggtggagcgggcggatctcaaaaacatcggtccctggacccattag Protein: mdyiisapgkvilfgehaavfgkpaiaaaidlrtyllvetttsdtptvtlefpdihlnfkvqvdklasltaqtk- ad hlnwstpktldkhifdslsslalleepgltkvqqaavvsflylyihlcppsvcedssnwvvrstlpigaglgss- as icvclaagllvlngqlsidqardfksltekqlslvddwsfygemcihgnpsgidnavatqggallfqrpnnrvp- lv dipemkllltntkhprstadlvggvgvltkefgsimdpimtsvgeisnqameiisrgkkmvdqsnleieqgilp- qp tsedacnvmedgatlqklrdigsemqhlvrinhglliamgvshpkleiirtasivhnlgetkltgaggggcait- lv tskdktatqleenviafteemathgfevhettigargvgmcidhpslktveafkkveradlknigpwth YALI0E06193g DNA: atgaccacctattcggctccgggaaaggccctcctttgcggcggttatttggttattgatccggcgtattcagc- at acgtcgtgggcctctcggcgcgtatttacgcgacagtttcggcttccgaggcctccaccacctctgtccatgtc- gt ctctccgcagtttgacaagggtgaatggacctacaactacacgaacggccagctgacggccatcggacacaacc- ca tttgctcacgcggccgtcaacaccgttctgcattacgttcctcctcgaaacctccacatcaacatcagcatcaa- aa gtgacaacgcgtaccactcgcaaattgacagcacgcagagaggccagtttgcataccacaaaaaggcgatccac- ga ggtgcctaaaacgggcctcggtagctccgctgctcttaccaccgttcttgtggcagctttgctcaagtcatacg- gc attgatcccttgcataacacccacctcgttcacaacctgtcccaggttgcacactgctcggcacagaagaagat- tg ggtctggatttgacgtggcttcggccgtttgtggctctctagtctatagacgtttcccggcggagtccgtgaac- at ggtcattgcagctgaagggacctccgaatacggggctctgttgagaactaccgttaatcaaaagtggaaggtga- ct ctggaaccatccttcttgccgccgggaatcagcctgcttatgggagacgtccagggaggatctgagactccagg- ta tggtggccaaggtgatggcatggcgaaaagcaaagccccgagaagccgagatggtgtggagagatctcaacgct- gc caacatgctcatggtcaagttgttcaacgacctgcgcaagctctctctcactaacaacgaggcctacgaacaac- tt ttggccgaggctgctcctctcaacgctctaaagatgataatgttgcagaaccctctcggagaactagcacgatg- ca ttatcactattcgaaagcatctcaagaagatgacacgggagactggtgctgctattgagccggatgagcagtct- gc attgctcaacaagtgcaacacttatagtggagtcattggaggtgttgtgcctggagcaggaggctacgatgcta- tt tctcttctggtgatcagctctacggtgaacaatgtcaagcgagagagccagggagtccaatggatggagctcaa- gg aggagaacgagggtctgcggctcgagaaggggttcaagtag Protein: mttysapgkallcggylvidpaysayvvglsariyatvsaeseasttsvhvvspqfdkgewtynytngqltaig- hn pfahaavntvlhyvpprnlhinisiksdnayhsqidstqrgqfayhkkaihcvpktglgssaalttvlvaallk- sy gidplhnthlvhnlsqvahcsaqkkigsgfdvasavcgslvyrrfpaesvnmviaaegtseygallrttvnqkw- kv tlepsflppgisllmgdvqggsetpgmvakvmawrkakpreaemvwrdlnaanmlmvklfndlrklsltnneay- eq llaeaaplnalkmimlqnplgelarciitirkhlkkmtretgaaiepdeqsallnkcntysgviggvvpgaggy- da isllvisstvnnvkresqgvqwmelkeeneglrlekgfk YALI0F05632g DNA: atgatccaccaggcctccaccaccgctccggtgaacattgcgacactcaagtactggggcaagcgagaccctgc- tc tcaatctgcccactaacaactccatctcctgtgactttgtcgcaggatgatctgcggaccctcaccacagcctc- gt gttcccctgatttcacccaggacgagctgtggctcaatggcaagcaggaggacgtgagcggcaaacgtctggtt- gc gtgtttccgagagctgcgggctctgcgacacaaaatggaggactccgactcttctctgcctaagctggccgatc- ag aagctcaagatcgtgtccgagaacaacttccccaccgccgctggtctcgcctcatcggctgctggctttgccgc- cc tgatccgagccgttgcaaatctctacgagctccaggagacccccgagcagctgtccattgtggctcgacagggc- tc tggatccgcctgtcgatctctctacggaggctacgtggcatgggaaatgggcaccgagtctgacggaagcgact- cg cgagcggtccagatcgccaccgccgaccactggcccgagatgcgagccgccatcctcgttgtctctgccgacaa- ga aggacacgtcgtccactaccggtatgcaggtgactgtgcacacttctcccctcttcaaggagcgagtcaccact- gt ggttcccgagcggtttgcccagatgaagaagtcgattctggaccgagacttccccacctttgccgagctcacca- tg cgagactcaaaccagttccacgccacctgtctggactcgtatcctcccattttctacctcaacgacgtgtcgcg- ag

cctccattcgggtagttgaggccatcaacaaggctgccggagccaccattgccgcctacacctttgatgctgga- cc caactgtgtcatctactacgaggacaagaacgaggagctggttctgggtgctctcaaggccattctgggccgtg- tg gagggatgggagaagcaccagtctgtggacgccaagaagattgatgttgacgagcggtgggagtccgagctggc- ca acggaattcagcgggtgatccttaccaaggttggaggagatcccgtgaagaccgctgagtcgcttatcaacgag- ga tggttctctgaagaacagcaagtag Protein: mihqasttapvniatlkywgkrdpalnlptnnsisvtlsqddlrtlttascspdftqdelwlngkqedvsgkrl- va cfrelralrhkmedsdsslpkladqklkivsennfptaaglassaagfaaliravanlyelqetpeqlsivarq- gs gsacrslyggyvawemgtesdgsdsravqiatadhwpemraailvvsadkkdtssttgmqvtvhtsplfkervt- tv vperfaqmkksildrdfptfaeltmrdsnqfhatcldsyppifylndvsrasirvveainkaagatiaaytfda- gp ncviyyedkneelvlgalkailgrvegwekhqsvdakkidvderweselangiqrviltkvggdpvktaeslin- ed gslknsk YALI0F04015 DNA: Atgacgacgtcttacagcgacaaaatcaagagtatcagcgtgagctctgtggctcagcagtttcctgaggtggc- gc cgattgcggacgtgtccaaggctagccggcccagcacggagtcgtcggactcgtcggccaagctatttgatggc- ca cgacgaggagcagatcaagctgatggacgagatctgtgtggtgctggactgggacgacaagccgattggcggcg- cg tccaaaaagtgtctgtcatctgatggacaacatcaacgacggactggtgcatcgggccttttccgtgttcatgt- tc aacgaccgcggtgagctgcttctgcagcagcgggcggcggaaaaaatcacctttgccaacatgtggaccaacac- gt gctgctcgcatcctctggcggtgcccagcgagatgggcgggctggatctggagtcccggatccagggcgccaaa- aa cgccgcggtccggaagcttgagcacgagctgggaatcgaccccaaggccgttccggcagacaagttccatttcc- tc acccggatccactacgccgcgccctcctcgggcccctggggcgagcacgagattgactacattctgtttgtccg- gg gcgaccccgagctcaaggtggtggccaacgaggtccgcgataccgtgtgggtgtcgcagcagggactcaaggac- at gatggccgatcccaagctggttttcaccccttggttccggctcatttgtgagcaggcgctgtttccctggtggg- ac cagttggacaatctgcccgcgggcgatgacgagattcggcggtggatcaagtag Protein: mttsysdkiksisvssvaqqfpevapiadvskasrpstessdssaklfdghdeeqiklmdeicvvldwddkpig- ga skkcchlmdnindglvhrafsvfmfndrgelllqqraaekitfanmwtntccshplavpsemggldlesriqga- kn aavrklehelgidpkavpadkfhfltrihyaapssgpwgeheidyilfvrgdpelkvvanevrdtvwvsqqglk- dm madpklvftpwfrliceqalfpwwdqldnlpagddeirrwik YALI0E05753 DNA: atgtccaaggcgaaattcgaaagcgtgttcccccgaatctccgaggagctggtgcagctgctgcgagacgaggg- tc tgccccaggatgccgtgcagtggttttccgactcacttcagtacaactgtgtgggtggaaagctcaaccgaggc- ct gtctgtggtcgacacctaccagctactgaccggcaagaaggagctcgatgacgaggagtactaccgactcgcgc- tg ctcggctggctgattgagctgctgcaggcgtttttcctcgtgtcggacgacattatggatgagtccaagacccg- ac gaggccagccctgctggtacctcaagcccaaggtcggcatgattgccatcaacgatgctttcatgctagagagt- gg catctacattctgcttaagaagcatttccgacaggagaagtactacattgaccttgtcgagctgttccacgaca- tt tcgttcaagaccgagctgggccagctggtggatcttctgactgcccccgaggatgaggttgatctcaaccggtt- ct ctctggacaagcactcctttattgtgcgatacaagactgcttactactccttctacctgcccgttgttctagcc- at gtacgtggccggcattaccaaccccaaggacctgcagcaggccatggatgtgctgatccctctcggagagtact- tc caggtccaggacgactaccttgacaactttggagaccccgagttcattggtaagatcggcaccgacatccagga- ca acaagtgctcctggctcgttaacaaagcccttcagaaggccacccccgagcagcgacagatcctcgaggacaac- ta cggcgtcaaggacaagtccaaggagctcgtcatcaagaaactgtatgatgacatgaagattgagcaggactacc- tt gactacgaggaggaggttgttggcgacatcaagaagaagatcgagcaggttgacgagagccgaggcttcaagaa- gg aggtgctcaacgctttcctcgccaagatttacaagcgacagaagtag Protein: mskakfesvfpriseelvqllrdeglpqdavqwfsdslqyncvggklnrglsvvdtyqlltgkkelddeeyyrl- al lgwliellqafflvsddimdesktrrgqpcwylkpkvgmiaindafmlesgiyillkkhfrqekyyidlvelfh- di sfktelgqlvdlltapedevdlnrfsldkhsfivryktayysfylpvvlamyvagitnpkdlqqamdvliplge- yf qvqddyldnfgdpefigkigtdiqdnkcswlvnkalqkatpeqrqilednygvkdkskelvikklyddmkieqd- yl dyeeevvgdikkkieqvdesrgfkkevlnaflakiykrqk YALI0E18634g DNA: atgttacgactacgaaccatgcgacccacacagaccagcgtcagggcggcgcttgggcccaccgccgcggcccg- aa acatgtcctcctccagcccctccagcttcgaatactcgtcctacgtcaagggcacgcgggaaatcggccaccga- aa ggcgcccacaacccgtctgtcggttgagggccccatctacgtgggcttcgacggcattcgtcttctcaacctgc- cg catctcaacaagggctcgggattccccctcaacgagcgacgggaattcagactcagtggtcttctgccctctgc- cg aagccaccctggaggaacaggtcgaccgagcataccaacaattcaaaaagtgtggcactcccttagccaaaaac- gg gttctgcacctcgctcaagttccaaaacgaggtgctctactacgccctgctgctcaagcacgttaaggaggtct- tc cccatcatctatacaccgactcagggagaagccattgaacagtactcgcggctgttccggcggcccgaaggctg- ct tcctcgacatcaccagtccctacgacgtggaggagcgtctgggagcgtttggagaccatgacgacattgactac- at tgtcgtgactgactccgagggtattctcggaattggagaccaaggagtgggcggtattggtatttccatcgcca- ag ctggctctcatgactctatgtgctggagtcaacccctcacgagtcattcctgtggttctggatacgggaaccaa- ca accaggagctgctgcacgaccccctgtatctcggccgacgaatgccccgagtgcgaggaaagcagtacgacgac- tt catcgacaactttgtgcagtctgcccgaaggctgtatcccaaggcggtgatccatttcgaggactttgggctcg- ct aacgcacacaagatcctcgacaagtatcgaccggagatcccctgcttcaacgacgacatccagggcactggagc- cg tcactttggcctccatcacggccgctctcaaggtgctgggcaaaaatatcacagatactcgaattctcgtgtac- gg agctggttcggccggcatgggtattgctgaacaggtctatgataacctggttgcccagggtctcgacgacaaga- ct gcgcgacaaaacatctttctcatggaccgaccgggtctactgaccaccgcacttaccgacgagcagatgagcga- cg tgcagaagccgtttgccaaggacaaggccaattacgagggagtggacaccaagactctggagcacgtggttgct- gc cgtcaagccccatattctcattggatgttccactcagcccggcgcctttaacgagaaggtcgtcaaggagatgc- tc aaacacacccctcgacccatcattctccctctttccaaccccacacgtcttcatgaggctgtccctgcagatct- gt acaagtggaccgacggcaaggctctggttgccaccggctcgccctttgacccagtcaacggcaaggagacgtct- ga gaacaataactgctttgttttccccggaatcgggctgggagccattctgtctcgatcaaagctcatcaccaaca- cc atgattgctgctgccatcgagtgcctcgccgaacaggcccccattctcaagaaccacgacgagggagtacttcc- cg acgtagctctcatccagatcatttcggcccgggtggccactgccgtggttcttcaggccaaggctgagggccta- gc cactgtcgaggaagagctcaagcccggcaccaaggaacatgtgcagattcccgacaactttgacgagtgtctcg- cc tgggtcgagactcagatgtggcggcccgtctaccggcctctcatccatgtgcgggattacgactag Protein: mlrlrtmrptqtsvraalgptaaarnmsssspssfeyssyvkgtreighrkapttrlsvegpiyvgfdgirlln- lp hlnkgsgfplnerrefrlsgllpsaeatleeqvdrayqqfkkcgtplakngfctslkfqnevlyyalllkhvke- vf piiytptqgeaieqysrlfrrpegcflditspydveerlgafgdhddidyivvtdsegilgigdqgvggigisi- ak lalmtlcagvnpsrvipvvldtgtnnqellhdplylgrrmprvrgkqyddfidnfvqsarrlypkavihfedfg- la nahkildkyrpeipcfnddiqgtgavtlasitaalkvlgknitdtrilvygagsagmgiaeqvydnlvaqgldd- kt arqniflmdrpgllttaltdeqmsdvqkpfakdkanyegvdtktlehvvaavkphiligcstqpgafnekvvke- ml khtprpiilplsnptrlheavpadlykwtdgkalvatgspfdpvngketsennncfvfpgiglgailsrsklit- nt miaaaieclaeqapilknhdegvlpdvaliqiisarvatavvlqakaeglatveeelkpgtkehvqipdnfdec- la wvetqmwrpvyrplihvrdyd YALI0E11495g DNA: atgccgcagcaagcaatggatatcaagggcaaggccaagtctgtgcccatgcccgaagaagacgacctggactc- gc attttgtgggtcccatctctccccgacctcacggagcagacgagattgctggctacgtgggctgcgaagacgac- ga agacgagcttgaagaactgggaatgctgggccgatctgcgtccacccacttctcttacgcggaagaacgccacc- tc

atcgaggttgatgccaagtacagagctcttcatggccatctgcctcatcagcactctcagagtcccgtgtccag- at cttcgtcatttgtgcgggccgaaatgaaccacccccctcccccaccctccagccacacccaccaacagccagag- ga cgatgacgcatcttccactcgatctcgatcgtcgtctcgagcttctggacgcaagttcaacagaaacagaacca- ag tctggatcttcgctgagcaagggtctccagcagctcaacatgaccggatcgctcgaagaagagccctacgagag- cg atgacgatgcccgactatctgcggaagacgacattgtctatgatgctacccagaaagacacctgcaagcccata- tc tcctactctcaaacgcacccgcaccaaggacgacatgaagaacatgtccatcaacgacgtcaaaatcaccacca- cc acagaagatcctcttgtggcccaggagctgtccatgatgttcgaaaaggtgcagtactgccgagacctccgaga- ca agtaccaaaccgtgtcgctacagaaggacggagacaaccccaaggatgacaagacacactggaaaatttacccc- ga gcctccaccaccctcctggcacgagaccgaaaagcgattccgaggctcgtccaaaaaggagcaccaaaagaaag- ac ccgacaatggatgaattcaaattcgaggactgcgaaatccccggacccaacgacatggtcttcaagcgagatcc- ta cctgtgtctatcaggtctatgaggatgaaagctctctcaacgaaaataagccgtttgttgccatcccctcaatc- cg agattactacatggatctggaggatctcattgtggcttcgtctgacggacctgccaagtcttttgctttccgac- ga ctgcaatatctagaagccaagtggaacctctactacctgctcaacgagtacacggagacaaccgagtccaagac- ca acccccatcgagacttttacaacgtacgaaaggtcgacacccacgttcaccactctgcctgcatgaaccagaag- ca tctgctgcgattcatcaaatacaagatgaagaactgccctgatgaagttgtcatccaccgagacggtcgggagc- tg acactctcccaggtgtttgagtcacttaacttgactgcctacgacctgtctatcgatacccttgatatgcatgc- tc acaaggactcgttccatcgatttgacaagttcaacctcaagtacaaccctgtcggtgagtctcgactgcgagaa- at cttcctaaagaccgacaactacatccagggtcgatacctagctgagatcacaaaggaggtgttccaggatctcg- ag aactcgaagtaccagatggcggagtaccgtatttccatctacggtcggtccaaggacgagtgggacaagctggc- tg cctgggtgctggacaacaaactgttttcgcccaatgttcggtggttgatccaggtgcctcgactgtacgacatt- ta caagaaggctggtctggttaacacctttgccgacattgtgcagaacgtctttgagcctcttttcgaggtcacca- ag gatcccagtacccatcccaagctgcacgtgttcctgcagcgagttgtgggctttgactctgtcgatgacgagtc- ga agctggaccgacgtttccaccgaaagttcccaactgcagcatactgggacagcgcacagaaccctccctactcg- ta ctggcagtactatctatacgccaacatggcctccatcaacacctggagacagcgtttgggctataatacttttg- ag ttgcgaccccatgctggagaggctggtgacccagagcatcttctgtgcacttatctggttgctcagggtatcaa- cc acggtattctgttgcgaaaggtgcccttcattcagtacctttactacctggaccagatccccattgccatgtct- cc tgtgtccaacaatgcgctgttcctcacgttcgacaagaaccccttctactcatacttcaagcggggtctcaacg- tg tccttgtcatcggatgatcctctgcagtttgcttacactaaggaggctctgattgaggagtactctgtggctgc- gc tcatttacaagctttccaacgtggatatgtgtgagcttgctcgaaactcggtactgcaatctggctttgagcga- at catcaaggagcattggatcggcgaaaactacgagatccatggccccgagggcaacaccatccagaagacaaacg- tg cccaatgtgcgtctggccttccgagacgagactttgacccacgagcttgctctggtggacaagtacaccaatct- tg aggagtttgagcggctgcatggttaa Protein: mpqqamdikgkaksvpmpeeddldshfvgpisprphgadeiagyvgceddedeleelgmlgrsasthfsyaeer- hl ievdakyralhghlphqhsqspvsrsssfvraemnhpppppsshthqqpedddasstrsrsssrasgrkfnrnr- tk sgsslskglqqlnmtgsleeepyesdddarlsaeddivydatqkdtckpisptlkrtrtkddmknmsindvkit- tt tedplvaqelsmmfekvqycrdlrdkyqtvslqkdgdnpkddkthwkiypeppppswhetekrfrgsskkehqk- kd ptmdefkfedceipgpndmvfkrdptcvyqvyedesslnenkpfvaipsirdyymdledlivassdgpaksfaf- rr lqyleakwnlyyllneytettesktnphrdfynvrkvdthvhhsacmnqkhllrfikykmkncpdevvihrdgr- el tlsqvfeslnitaydlsidtldmhahkdsfhrfdkfnlkynpvgesrlreiflktdnyiqgrylaeitkevfqd- le nskyqmaeyrisiygrskdewdklaawvldnklfspnvrwliqvprlydiykkaglvntfadivqnvfeplfev- tk dpsthpklhvflqrvvgfdsvddeskldrrfhrkfptaaywdsaqnppysywqyylyanmasintwrqrlgynt- fe lrphageagdpehllctylvaqginhgillrkvpfiqylyyldqipiamspvsnnalfltfdknpfysyflagl- nv slssddplqfaytkealieeysvaaliyklsnvdmcelarnsvlqsgferiikehwigenyeihgpegntiqkt- nv pnvrlafrdetlthelalvdkytnleeferlhg YALI0D16753g DNA: atgttccgaacccgagttaccggctccaccctgcgatccttctccacctccgctgcccgacagcacaaggttgt- cg tccttggcgccaacggaggcattggccagcccctgtctctgctgctcaagctcaacaagaacgtgaccgacctc- gg tctgtacgatctgcgaggcgcccccggcgttgctgccgatgtctcccacatccccaccaactccaccgtggccg- gc tactctcccgacaacaacggcattgccgaggccctcaagggcgccaagctggtgctgatccccgccggtgtccc- cc gaaagcccggcatgacccgagacgatctgttcaacaccaacgcctccattgtgcgagacctggccaaggccgtc- gg tgagcacgcccccgacgcctttgtcggagtcattgctaaccccgtcaactccaccgtccccattgtcgccgagg- tg ctcaagtccaagggcaagtacgaccccaagaagctcttcggtgtcaccaccctcgacgtcatccgagccgagcg- at tcgtctcccagctcgagcacaccaaccccaccaaggagtacttccccgttgttggcggccactccggtgtcacc- at tgtccccctcgtgtcccagtccgaccaccccgacattgccggtgaggctcgagacaagcttgtccaccgaatcc- ag tttggcggtgacgaggttgtcaaggccaaggacggtgccggatccgccaccctttccatggcccaggctgccgc- cc gattcgccgactctctcctccgaggtgtcaacggcgagaaggacgttgttgagcccactttcgtcgactctcct- ct gttcaagggtgagggcatcgacttcttctccaccaaggtcactcttggccctaacggtgttgaggagatccacc- cc atcggaaaggtcaacgagtacgaggagaagctcatcgaggctgccaaggccgatctcaagaagaacattgagaa- gg gtgtcaactttgtcaagcagaacccttaa Protein: mfrtrvtgstlrsfstsaarqhkvvvlganggigqplslllklnknvtdlglydlrgapgvaadvshiptnstv- ag yspdnngiaealkgaklvlipagvprkpgmtrddlfntnasivrdlakavgehapdafvgvianpvnstvpiva- ev lkskgkydpkklfgvttldviraerfvsqlehtnptkeyfpvvgghsgvtivplvsqsdhpdiageardklvhr- iq fggdevvkakdgagsatlsmaqaaarfadsllrgvngekdvveptfvdsplfkgegidffstkvtlgpngveei- hp igkvneyeeklieaakadlkkniekgvnfvkqnp YALI0D16247g DNA: atgacacaaacgcacaatctgttttcgccaatcaaagtgggctcttcggagctccagaaccggatcgttctcgc- ac ccttgactcgaaccagagctctgcccggaaacgtgccctcggatcttgccacagagtactacgcacaaagagca- gc atctccaggcactctcctcatcaccgaggccacatacatctcccccggatctgctggagtgcccattccaggag- ac ggaatcgttccgggcatctggagtgacgagcagctcgaagcatggaaaaaggtgttcaaggccgtgcacgaccg- ag gatccaaaatctacgtccagctgtgggacattggacgtgtcgcatggtaccacaagctgcaggaactgggcaac- ta cttccctacaggcccctcagctatccccatgaagggagaggagagcgagcatctcaaggctctgactcactggg- ag atcaagggcaaggtggccctctacgtcaacgctgccaagaacgccattgccgcaggcgctgatggcgtcgagat- cc actcggccaacggctaccttcccgacacatttctgagaagcgcctccaaccaacgaacagacgaatatggagga- ag catcgagaaccgggcccgattctcgctggagattgtcgacgctatcaccgaggccattggagcagacaaaaccg- cc atccgtctgtctccctggtccactttccaggacattgaggtgaatgacaccgagacccccgcacagttcacata- cc tgtttgagcagctgcagaagcgagccgacgagggaaagcagctggcctacgtgcatgtagttgagccccgactg- tt tggtccccccgagccctgggccaccaatgagcctttcagaaaaatttggaagggtaacttcattagagcaggtg- ga tacgatagagagactgctcttgaggatgcagacaagtcagacaacaccctgattgcctttggtcgagacttcat- tg ccaatcctgatctcgtccaacgcctcaagaataacgagcctttggccaagtacgacagaacaaccttctacgtt- cc aggtgccaagggctacactgattaccctgcgtacaagatgtaa Protein: mtqthnlfspikvgsselqnrivlapltrtralpgnvpsdlateyyaqraaspgtlliteatyispgsagvpip- gd givpgiwsdeqleawkkvfkavhdrgskiyvqlwdigrvawyhklqelgnyfptgpsaipmkgeesehlkalth- we ikgkvalyvnaaknaiaagadgveihsangylpdtflrsasnqrtdeyggsienrarfsleivdaiteaigadk- ta irlspwstfqdievndtetpaqftylfeqlqkradegkqlayvhvveprlfgppepwatnepfrkiwkgnfira- gg ydretaledadksdntliafgrdfianpdlvqrlknneplakydrttfyvpgakgytdypaykm YALI0A15972g

DNA: atggaagccaaccccgaagtccagaccgatatcatcacgctgacccggttcattctgcaggaacagaacaaggt- gg gcgcgtcgtccgcaatccccaccggagacttcactctgctgctcaactcgctgcagtttgccttcaagttcatt- gc ccacaacatccgacgatcgaccctggtcaacctgattggcctgtcgggaaccgccaactccaccggcgacgacc- ag aagaagctggacgtgatcggagacgagatcttcatcaacgccatgaaggcctccggtaaggtcaagctggtggt- gt ccgaggagcaggaggacctcattgtgtttgagggcgacggccgatacgccgtggtctgcgaccccatcgacgga- tc ctccaacctcgacgccggcgtctccgtcggcaccattttcggcgtctacaagctccccgagggctcctccggat- cc atcaaggacgtgctccgacccggaaaggagatggttgccgccggctacaccatgtacggtgcctccgccaacct- gg tgctgtccaccggaaacggctgcaacggcttcactctcgatgaccctctgggagagttcatcctgacccacccc- ga tctcaagctccccgatctgcgatccatctactccgtcaacgagggtaactcctccctgtggtccgacaacgtca- ag gactacttcaaggccctcaagttccccgaggacggctccaagccctactcggcccgatacattggctccatggt- cg ccgacgtgcaccgaaccattctctacggaggtatgtttgcctaccccgccgactccaagtccaagaagggcaag- ct ccgacttttgtacgagggtttccccatggcctacatcattgagcaggccggcggtcttgccatcaacgacaacg- gc gagcgaatcctcgatctggtccccaccgagatccacgagcgatccggcgtctggctgggctccaagggcgagat- tg agaaggccaagaagtaccttctgaaatga Protein: meanpevqtdiitltrfilqeqnkvgassaiptgdftlllnslqfafkfiahnirrstlvnliglsgtanstgd- dq kkldvigdeifinamkasgkvklvvseeqedlivfegdgryavvcdpidgssnldagvsvgtifgvyklpegss- gs ikdvlrpgkemvaagytmygasanlvlstgngcngftlddplgefilthpdlklpdlrsiysvnegnsslwsdn- vk dyfkalkfpedgskpysaryigsmvadvhrtilyggmfaypadskskkgklrllyegfpmayiieqagglaind- ng erildlvpteihersgvwlgskgeiekakkyllk YALI0E11099g DNA: atgcgactcactctgccccgacttaacgccgcctacattgtaggagccgcccgaactcctgtcggcaagttcaa- cg gagccctcaagtccgtgtctgccattgacctcggtatcaccgctgccaaggccgctgtccagcgatccaaggtc- cc cgccgaccagattgacgagtttctgtttggccaggtgctgaccgccaactccggccaggcccccgcccgacagg- tg gttatcaagggtggtttccccgagtccgtcgaggccaccaccatcaacaaggtgtgctcttccggcctcaagac- cg tggctctggctgcccaggccatcaaggccggcgaccgaaacgttatcgtggccggtggaatggagtccatgtcc- aa caccccctactactccggtcgaggtcttgttttcggcaaccagaagctcgaggactccatcgtcaaggacggtc- tc tgggacccctacaacaacatccacatgggcaactgctgcgagaacaccaacaagcgagacggcatcacccgaga- gc agcaggacgagtacgccatcgagtcctaccgacgggccaacgagtccatcaagaacggcgccttcaaggatgag- at tgtccccgttgagatcaagacccgaaagggcaccgtgactgtctccgaggacgaggagcccaagggagccaacg- cc gagaagctcaagggcctcaagcctgtctttgacaagcagggctccgtcactgccggtaacgcctcccccatcaa- cg atggtgcttctgccgttgtcgttgcctctggcaccaaggccaaggagctcggtacccccgtgctcgccaagatt- gt ctcttacgcagacgccgccaccgcccccattgactttaccattgctccctactggccattcccgccgccctcaa- ga aggctggccttaccaaggacgacattgccctctgggagatcaacgaggccttctccggtgtcgctctcgccaac- ct catgcgactcggaattgacaagtccaaggtcaacgtcaagggtggagctgttgctctcggccaccccattggtg- cc tccggtaaccgaatctttgtgactttggtcaacgccctcaaggagggcgagtacggagttgccgccatctgcaa- cg gtggaggagcttccaccgccatcgtcatcaagaaggtctcttctgtcgagtag Protein: mrltlprlnaayivgaartpvgkfngalksvsaidlgitaakaavqrskvpadqideflfgqvltansgqapar- qv vikggfpesveattinkvcssglktvalaaqaikagdrnvivaggmesmsntpyysgrglvfgnqkledsivkd- gl wdpynnihmgnccentnkrdgitreqqdeyaiesyrranesikngafkdeivpveiktrkgtvtvsedeepkga- na eklkglkpvfdkqgsvtagnaspindgasavvvasgtkakelgtpvlakivsyadaatapidftiapslaipaa- lk kagltkddialweineafsgvalanlmrlgidkskvnvkggavalghpigasgnrifvtlvnalkegeygvaai- cn gggastaivikkvssve YALI0E34793g DNA: atgtctgccaacgagaacatctcccgattcgacgcccctgtgggcaaggagcaccccgcctacgagctcttcca- ta accacacacgatctttcgtctatggtctccagcctcgagcctgccagggtatgctggacttcgacttcatctgt- aa gcgagagaacccctccgtggccggtgtcatctatcccttcggcggccagttcgtcaccaagatgtactggggca- cc aaggagactcttctccctgtctaccagcaggtcgagaaggccgctgccaagcaccccgaggtcgatgtcgtggt- ca actttgcctcctctcgatccgtctactcctctaccatggagctgctcgagtacccccagttccgaaccatcgcc- at tattgccgagggtgtccccgagcgacgagcccgagagatcctccacaaggcccagaagaagggtgtgaccatca- tt ggtcccgctaccgtcggaggtatcaagcccggttgcttcaaggttggaaacaccggaggtatgatggacaacat- tg tcgcctccaagctctaccgacccggctccgttgcctacgtctccaagtccggaggaatgtccaacgagctgaac- aa cattatctctcacaccaccgacggtgtctacgagggtattgctattggtggtgaccgataccctggtactacct- tc attgaccatatcctgcgatacgaggccgaccccaagtgtaagatcatcgtcctccttggtgaggttggtggtgt- tg aggagtaccgagtcatcgaggctgttaagaacggccagatcaagaagcccatcgtcgcttgggccattggtact- tg tgcctccatgttcaagactgaggttcagttcggccacgccggctccatggccaactccgacctggagactgcca- ag gctaagaacgccgccatgaagtctgctggcttctacgtccccgataccttcgaggacatgcccgaggtccttgc- cg agctctacgagaagatggtcgccaagggcgagctgtctcgaatctctgagcctgaggtccccaagatccccatt- ga ctactcttgggcccaggagcttggtcttatccgaaagcccgctgctttcatctccactatttccgatgaccgag- gc caggagcttctgtacgctggcatgcccatttccgaggttttcaaggaggacattggtatcggcggtgtcatgtc- tc tgctgtggttccgacgacgactccccgactacgcctccaagtttcttgagatggttctcatgcttactgctgac- ca cggtcccgccgtatccggtgccatgaacaccattatcaccacccgagctggtaaggatctcatttcttccctgg- tt gctggtctcctgaccattggtacccgattcggaggtgctcttgacggtgctgccaccgagttcaccactgccta- cg acaagggtctgtccccccgacagttcgttgataccatgcgaaagcagaacaagctgattcctggtattggccat- cg agtcaagtctcgaaacaaccccgatttccgagtcgagcttgtcaaggactttgttaagaagaacttcccctcca- cc cagctgctcgactacgcccttgctgtcgaggaggtcaccacctccaagaaggacaacctgattctgaacgttga- cg gtgctattgctgtttcttttgtcgatctcatgcgatcttgcggtgcctttactgtggaggagactgaggactac- ct caagaacggtgttctcaacggtctgttcgttctcggtcgatccattggtctcattgcccaccatctcgatcaga- ag cgactcaagaccggtctgtaccgacatccttgggacgatatcacctacctggttggccaggaggctatccagaa- ga agcgagtcgagatcagcgccggcgacgtttccaaggccaagactcgatcatag Protein: msanenisrfdapvgkehpayelfhnhtrsfvyglqpracqgmldfdfickrenpsvagviypfggqfvtkmyw- gt ketllpvyqqvekaaakhpevdvvvnfassrsvysstmelleypqfrtiaiiaegvperrareilhkaqkkgvt- ii gpatvggikpgcfkvgntggmmdnivasklyrpgsvayvsksggmsnelnniishttdgvyegiaiggdrypgt- tf idhilryeadpkckiivllgevggveeyrvieavkngqikkpivawaigtcasmfktevqfghagsmansdlet- ak aknaamksagfyvpdtfedmpevlaelyekmvakgelsrisepevpkipidyswaqelglirkpaafistisdd- rg qellyagmpisevfkedigiggvmsllwfrrrlpdyaskflemvlmltadhgpaysgamntiittragkdliss- lv aglltigtrfggaldgaatefttaydkglsprqfvdtmrkqnklipgighrvksrnnpdfrvelvkdfvkknfp- st qlldyalaveevttskkdnlilnvdgaiaysfvdlmrscgaftveetedylkngvlnglfvlgrsigliahhld- qk rlktglyrhpwdditylvgqeaiqkkrveisagdvskaktrs YALI0D24431g DNA: atgtcagcgaaatccattcacgaggccgacggcaaggccctgctcgcacactttctgtccaaggcgcccgtgtg- gg ccgagcagcagcccatcaacacgtttgaaatgggcacacccaagctggcgtctctgacgttcgaggacggcgtg- gc ccccgagcagatcttcgccgccgctgaaaagacctacccctggctgctggagtccggcgccaagtttgtggcca- ag cccgaccagctcatcaagcgacgaggcaaggccggcctgctggtactcaacaagtcgtgggaggagtgcaagcc- ct ggatcgccgagcgggccgccaagcccatcaacgtggagggcattgacggagtgctgcgaacgttcctggtcgag- cc ctttgtgccccacgaccagaagcacgagtactacatcaacatccactccgtgcgagagggcgactggatcctct- tc taccacgagggaggagtcgacgtcggcgacgtggacgccaaggccgccaagatcctcatccccgttgacattga-

ga acgagtacccctccaacgccacgctcaccaaggagctgctggcacacgtgcccgaggaccagcaccagaccctg- ct cgacttcatcaaccggctctacgccgtctacgtcgatctgcagtttacgtatctggagatcaaccccctggtcg- tg atccccaccgcccagggcgtcgaggtccactacctggatcttgccggcaagctcgaccagaccgcagagtttga- gt gcggccccaagtgggctgctgcgcggtcccccgccgctctgggccaggtcgtcaccattgacgccggctccacc- aa ggtgtccatcgacgccggccccgccatggtcttccccgctcctttcggtcgagagctgtccaaggaggaggcgt- ac attgcggagctcgattccaagaccggagcttctctgaagctgactgttctcaatgccaagggccgaatctggac- cc ttgtggctggtggaggagcctccgtcgtctacgccgacgccattgcgtctgccggctttgctgacgagctcgcc- aa ctacggcgagtactctggcgctcccaacgagacccagacctacgagtacgccaaaaccgtactggatctcatga- cc cggggcgacgctcaccccgagggcaaggtactgttcattggcggaggaatcgccaacttcacccaggttggatc- ca ccttcaagggcatcatccgggccttccgggactaccagtcttctctgcacaaccacaaggtgaagatttacgtg- cg acgaggcggtcccaactggcaggagggtctgcggttgatcaagtcggctggcgacgagctgaatctgcccatgg- ag atttacggccccgacatgcacgtgtcgggtattgttcctttggctctgcttggaaagcggcccaagaatgtcaa- gc cttttggcaccggaccttctactgaggcttccactcctctcggagtttaa Protein: Msaksiheadgkallahflskapvwaeqqpintfemgtpklasltfedgvapeqifaaaektypwllesgakfv- ak pdqlikrrgkagllvlnksweeckpwiaeraakpinvegidgvlrtflvepfvphdqkheyyinihsvregdwi- lf yheggvdvgdvdakaakilipvdieneypsnatltkellahvpedqhqtlldfinrlyavyvdlqftyleinpl- vv iptaqgvevhyldlagkldqtaefecgpkwaaarspaalgqvvtidagstkvsidagpamvfpapfgrelskee- ay iaeldsktgaslkltvlnakgriwtlvagggasvvyadaiasagfadelanygeysgapnetqtyeyaktvldl- mt rgdahpegkvlfigggianftqvgstfkgiirafrdyqsslhnhkvkiyvrrggpnwqeglrliksagdelnlp- me iygpdmhvsgivplallgkrpknvkpfgtgpsteastplgv YALI0E14190g DNA: atggttattatgtgtgtgggacctcagcacacgcatcatcccaacacagggtgcagtatatatagacagacgtg- tt ccttcgcaccgttcttcacatatcaaaacactaacaaattcaaaagtgagtatcatggtgggagtcaattgatt- gc tcggggagttgaacaggcaacaatggcatgcacagggccagtgaaggcagactgcagtcgctgcacatggatcg- tg gttctgaggcgttgctatcaaaagggtcaattacctcacgaaacacagctggatgttgtgcaatcgtcaattga- aa aacccgacacaatgcaagatctctttgcgcgcattgccatcgctgttgccatcgctgtcgccatcgccaatgcc- gc tgcggattattatccctaccttgttccccgcttccgcacaaccggcgatgtctttgtatcatgaactctcgaaa- ct aactcagtggttaaagctgtcgttgccggagccgctggtggtattggccagcccctttctcttctcctcaaact- ct ctccttacgtgaccgagcttgctctctctacgatgtcgtcaactcccccggtgttgccgctgacctctcccaca- tc tccaccaaggctaaggtcactggctacctccccaaggatgacggtctcaagaacgctctgaccggcgccaacat- tg tcgttatccccgccggtatcccccgaaagcccggtatgacccgagacgatctgttcaagatcaacgctggtatc- gt ccgagatctcgtcaccggtgtcgcccagtacgcccctgacgcctttgtgctcatcatctccaaccccgtcaact- ct accgtccctattgctgccgaggtcctcaagaagcacaacgtcttcaaccctaagaagctcttcggtgtcaccac- cc ttgacgttgtccgagcccagaccttcaccgccgctgttgttggcgagtctgaccccaccaagctcaacatcccc- gt cgttggtggccactccggagacaccattgtccctctcctgtctctgaccaagcctaaggtcgagatccccgccg- ac aagctcgacgacctcgtcaagcgaatccagtttggtggtgacgaggttgtccaggctaaggacggtcttggatc- cg ctaccctctccatggcccaggctggtttccgatttgccgaggctgtcctcaagggtgccgctggtgagaagggc- at catcgagcccgcctacatctaccttgacggtattgatggcacctccgacatcaagcgagaggtcggtgtcgcct- tc ttctctgtccctgtcgagttcggccctgagggtgccgctaaggcttacaacatccttcccgaggccaacgacta- cg agaagaagcttctcaaggtctccatcgacggtctttacggcaacattgccaagggcgaggagttcattgttaac- cc tcctcctgccaagtaa Protein: vvkavvagaaggigqplslllklspyvtelalydvvnspgvaadlshistkakvtgylpkddglknaltganiv- vi pagiprkpgmtrddlfkinagivrdlvtgvaqyapdafvliisnpvnstvpiaaevlkkhnvfnpkklfgvttl- dv vraqtftaavvgesdptklnipvvgghsgdtivpllsltkpkveipadklddlvkriqfggdevvqakdglgsa- tl smaqagfrfaeavlkgaagekgiiepayiyldgidgtsdikrevgvaffsvpvefgpegaakaynilpeandye- kk llkvsidglygniakgeefivnpppak YALI0E22649g DNA: atgactggcaccttacccaagttcggcgacggaaccaccattgtggttcttggagcctccggcgacctcgctaa- ga agaagaccgtgagtattgaaccagactgaggtcaattgaagagtaggagagtctgagaacattcgacggacctg- at tgtgctctggaccactcaattgactcgttgagagccccaatgggtcttggctagccgagtcgttgacttgttga- ct tgttgagcccagaacccccaacttttgccaccatacaccgccatcaccatgacacccagatgtgcgtgcgtatg- tg agagtcaattgttccgtggcaaggcacagcttattccaccgtgttccttgcacaggtggtctttacgctctccc- ac tctatccgagcaataaaagcggaaaaacagcagcaagtcccaacagacttctgctccgaataaggcgtctagca- ag tgtgcccaaaactcaattcaaaaatgtcagaaacctgatatcaacccgtcttcaaaagctaaccccagttcccc- gc cctcttcggcctttaccgaaacggcctgctgcccaaaaatgttgaaatcatcggctacgcacggtcgaaaatga- ct caggaggagtaccacgagcgaatcagccactacttcaagacccccgacgaccagtccaaggagcaggccaagaa- gt tccttgagaacacctgctacgtccagggcccttacgacggtgccgagggctaccagcgactgaatgaaaagatt- ga ggagtttgagaagaagaagcccgagccccactaccgtcttttctacctggctctgccccccagcgtcttccttg- ag gctgccaacggtctgaagaagtatgtctaccccggcgagggcaaggcccgaatcatcatcgagaagccctttgg- cc acgacctggcctcgtcacgagagctccaggacggccttgctcctctctggaaggagtctgagatcttccgaatc- ga ccactacctcggaaaggagatggtcaagaacctcaacattctgcgatttggcaaccagttcctgtccgccgtgg- gt gacaagaacaccatttccaacgtccagatctccttcaaggagccctttggcactgagggccgaggtggatactt- ca acgacattggaatcatccgagacgttattcagaaccatctgttgcaggttctgtccattctagccatggagcga- cc cgtcactttcggcgccgaggacattcgagatgagaaggtcaaggtgctccgatgtgtcgacattctcaacattg- ac gacgtcattctcggccagtacggcccctctgaagacggaaagaagcccggatacaccgatgacgatggcgttcc- cg atgactcccgagctgtgacctttgctgctctccatctccagatccacaacgacagatgggagggtgttcctttc- at cctccgagccggtaaggctctggacgagggcaaggtcgagatccgagtgcagttccgagacgtgaccaagggcg- tt gtggaccatctgcctcgaaatgagctcgtcatccgaatccagccctccgagtccatctacatgaagatgaactc- ca agctgcctggccttactgccaagaacattgtcaccgacctggatctgacctacaaccgacgatactcggacgtg- cg aatccctgaggcttacgagtctctcattctggactgcctcaagggtgaccacaccaactttgtgcgaaacgacg- ag ctggacatttcctggaagattttcaccgatctgctgcacaagattgacgaggacaagagcattgtgcccgagaa- gt acgcctacggctctcgtggccccgagcgactcaagcagtggctccgagaccgaggctacgtgcgaaacggcacc- ga gctgtaccaatggcctgtcaccaagggctcctcgtga Protein: mtgtlpkfgdgttivvlgasgdlakkktfpalfglyrngllpknveiigyarskmtqeeyherishyfktpddq- sk eqakkflentcyvqgpydgaegyqrlnekieefekkkpephyrlfylalppsvfleaanglkkyvypgegkari- ii ekpfghdlassrelqdglaplwkeseifridhylgkemyknlnilrfgnqflsavwdkntisnvqisfkepfgt- eg rggyfndigiirdviqnhllqvlsilamerpvtfgaedirdekvkvlrcvdilniddvilgqygpsedgkkpgy- td ddgvpddsravtfaalhlqihndrwegvpfilragkaldegkveirvqfrdvtkgvvdhlprnelviriqpses- iy mkmnsklpgltaknivtdldltynrrysdvripeayeslildclkgdhtnfvrndeldiswkiftdllhkided- ks ivpekyaygsrgperlkqwlrdrgyvrngtelyqwpvtkgss YALI0B15598g DNA: atgactgacacttcaaacatcaagtgagtattgccgcacacaattgcaatcaccgccgggctctacctcctcag- ct ctcgacgtcaatgggccagcagccgccatttgaccccaattacactggttgtgtaaaaccctcaaccacaatcg- ct tatgctcaccacagactacgacttaaccaagtcatgtcacaggtcaaagtaaagtcagcgcaacaccccctcaa- tc tcaacacacttttgctaactcaggcctgtcgctgacattgccctcatcggtctcgccgtcatgggccagaacct- ga

tcctcaacatggccgaccacggtaagtatcaattgactcaagacgcaccagcaagatacagagcatacccagca- at cgctcctctgataatcgccattgtaacactacgttggttagattgatctaaggtcgttgctggttccatgcact- tc cacttgctcatatgaagggagtcaaactctattttgatagtgtcctctcccatccccgaaatgtcgcattgttg- ct aacaataggctacgaggttgttgcctacaaccgaaccacctccaaggtcgaccacttcctcgagaacgaggcca- ag ggtgagtatccgtccagctatgctgtttacagccattgaccccaccttcccccacaattgctacgtcaccatta- aa aaacaaaattaccggtatcggcaagctagactttcatgcaacctacgcagggtaacaagttgagtttcagccgt- gc accttacaggaaaaccagtcatacgccgaggcagtgtgaaagcgaaagcacacagcctacggtgattgattgca- tt tttttgacataggagggaaacacgtgacatggcaagtgcccaacacgaatactaacaaacaggaaagtccatta- tt ggtgctcactctatcaaggagctgtgtgctctgctgaagcgaccccgacgaatcattctgctcgttaaggccgg- tg ctgctgtcgattctttcatcgaacagctcctgccctatctcgataagggtgatatcatcattgacggtggtaac- tc ccacttccccgactccaaccgacgatacgaggagcttaacgagaagggaatcctctttgttggttccggtgttt- cc ggcggtgaggagggtgcccgatacggtccctccatcatgcccggtggaaacaaggaggcctggccccacattaa- ga agattttccaggacatctctgctaaggctgatggtgagccctgctgtgactgggtcggtgacgctggtgccggc- ac ctttgtcaagatggttcacaacggtattgagtatggtgacatgcagcttatctgcgaggcttacgacctcatga- ag cgaggtgctggtttcaccaatgaggagattggagacgttttcgccaagtggaacaacggtatcctcgactcctt- cc tcattgagatcacccgagacatcttcaagtacgacgacggctctggaactcctctcgttgagaagatctccgac- ac tgctggccagaagggtactggaaagtggaccgctatcaacgctcttgaccttggtatgcccgtcaccctgatcg- gt gaggccgtcttcgctcgatgcctttctgccctcaagcaggagcgtgtccgagcttccaaggttcttgatggccc- cg agcccgtcaagttcactggtgacaagaaggagtttgtcgaccagctcgagcaggccctttacgcctccaagatc- at ctcttacgcccagggtttcatgcttatccgagaggccgccaagacctacggctgggagctcaacaacgccggta- tt gccctcatgtggcgaggtggttgcatcatccgatccgtcttccttgctgacatcaccaaggcttaccgacagga- cc ccaacctcgagaacctgctgttcaacgacttcttcaagaacgccatctccaaggccaacccctcttggcgagct- ac cgtggccaaggctgtcacctggggtgttcccactcccgcctttgcctcggctctggctttctacgacggttacc- ga tctgccaagctccccgctaacctgctccaggcccagcgagactacttcggcgcccacacctaccagctcctcga- tg gtgatggaaagtggatccacaccaactggaccggccgaggtggtgaggtttcttcttccacttacgatgataa Protein: mtdtsnikpvadialiglavmgqnlilnmadhgyevvaynrttskvdhfleneakgksiigahsikelcallkr- pr riillvkagaavdsfieqllpyldkgdiiidggnshfpdsnrryeelnekgilfvgsgvsggeegarygpsimp- gg nkeawphikkifqdisakadgepccdwygdagaghfvkmvhngieygdmqliceaydlmkrgagftneeigdyf- ak wnngildsflieitrdifkyddgsgtplvekisdtagqkgtgkwtainaldlgmpvtligeavfarclsalkqe- rv raskvldgpepvkftgdkkefvdqleqalyaskiisyaqgfmlireaaktygwelnnagialmwrggciirsvf- la ditkayrqdpnlenllfndffknaiskanpswratvakavtwgvptpafasalafydgyrsaklpanllqaqrd- yf gahtyqlldgdgkwihtnwtgrggevssstyda YALI0D06303g DNA: atgctcaaccttagaaccgcccttcgagctgtgcgacccgtcactctggtgagtatctcggagcccgggacggc- ta ccaacacacaagcaagatgcaacagaaaccggactttttaaatgcggattgcggaaaatttgcatggcggcaac- ga ctcggagaaggagcgggacaattgcaatggcaggatgccattgacgaactgagggtgatgagagaccgggcctc- cg atgacgtggtggtgacgacagcccggctggtgttgccgggactgtctctgaaaagcaatttctctatctccggt- ct caacagactccccttctctagctcaattggcattgtcttcagaaggtgtcttagtggtatccccattgttatct- tc ttttccccaatgtcaatgtcaatgtcaatggctccgacctctttcacattaacacggcgcaaacacagatacca- cg gaaccgactcaaacaaatccaaagagacgcagcggaataattggcatcaacgaacgatttgggatactctggcg- ag aatgccgaaatatttcgcttgtcttgttgtttctcttgagtgagttgtttgtgaagtcgtttggaagaaggttc- cc aatgtcacaaaccataccaactcgttacagccagcttgtaatcccccacctcttcaatacatactaacgcagac- cc gatcctacgccacttccgtggcctctttcaccggccagaagaactccaacggcaagtacactgtgtctctgatt- ga gggagacggtatcggaaccgagatctccaaggctgtcaaggacatctaccatgccgccaaggtccccatcgact- gg gaggttgtcgacgtcacccccactctggtcaacggcaagaccaccatccccgacagcgccattgagtccatcaa- cc gaaacaaggttgccctcaagggtcccctcgccacccccatcggtaagggccacgtttccatgaacctgactctg- cg acgaaccttcaacctgttcgccaacgtccgaccttgcaagtccgtcgtgggctacaagaccccttacgagaacg- tc gacaccctgctcatccgagagaacactgagggtgagtactccggtatcgagcacaccgtcgtccccggtgtcgt- tc agtccatcaagctgatcacccgagaggcttccgagcgagtcatccggtacgcttacgagtacgccctgtcccga- gg catgaagaaggtccttgttgtccacaaggcctctattatgaaggtctccgatggtcttttccttgaggttgctc- ga gagctcgccaaggagtacccctccattgacctttccgtcgagctgatcgacaacacctgtctgcgaatggtcca- gg accccgctctctaccgagatgtcgtcatggtcatgcccaacctttacggtgacattctgtccgatcttgcctcc- gg tcttatcggtggtcttggtctgaccccctccggtaacatgggtgacgaggtctccatcttcgaggccgtccacg- ga tccgctcccgacattgctggcaagggtcttgctaaccccactgctctgctgctctcctccgtgatgatgctgcg- ac acatgggtctcaacgacaacgccaccaacatcgagcaggccgtctttggcaccattgcttccggccccgagaac- cg aaccaaggatcttaagggtaccgccaccacttctcactttgctgagcagattatcaagcgactcaagtag Protein: mlnlrtalravrpvtltrsyatsvasftgqknsngkytvsliegdgigteiskavkdiyhaakvpidwevvdvt- pt lvngkttipdsaiesinrnkvalkgplatpigkghvsmnltlrrtfnlfanvrpcksvvgyktpyenvdtllir- en tegeysgiehtvvpgvvqsiklitreaserviryayeyalsrgmkkvlvvhkasimkvsdglflevarelakey- ps idlsvelidntclrmvqdpalyrdvvmvmpnlygdilsdlasgligglgltpsgnmgdevsifeavhgsapdia- gk glanptalllssvmmlrhmglndnatnieqavfgtiasgpenrtkdlkgtattshfaeqiikrlk

Example 11

Regulatory Sequences

[0340] Sequences which consist of, consist essentially of, or comprise the following regulatory sequences (e.g. promoters and terminator sequences, including functional fragments thereof) may be useful to control expression of endogenous and heterologous genes in recombinant fungi described herein.

TABLE-US-00022 Met2 promoter 5'cctctcactttgtgaatcgtgaaacatgaatcttcaagccaagaatgttaggcaggggaagctttctttcag- ac tttttggaattggtcctcttttggacattattgacgatattattattttttccccgtccaatgttgacccttgt- aa gccattccggttctggagcgcatctcgtctgaaggagtcttcgtgtggctataactacaagcgttgtatggtgg- at cctatgaccgtctatatagggcaacttttgctcttgttcttccccctccttgagggacgtatggcaatggctat- ga caactatcgtagtgagcctctataacccattgaagtacaagtcctccaccttgctgccaaactcgcgagaaaaa- aa gtccaccaactccgccgggaaatactggagaacacctctaagacgtgggcttctgcacctgtgtggcttgggtc- tg ggttttgcgagctctgagccacaacctaaggacggtgtgattgggagataagtagtcgttggttttctaatcgc- ac gtgatatgcaagccacacttataacacaatgaagacaggccgatgaactgcatgtcattgtacaggtgcggaga- gc aagaaactctggggcggaggtgaaagatgagacaaaaagcctcaggtgcaaggtagggagttgatcaacgtcaa- ac acaaataatctaggttgttaggcagctaaacatgtatataactgggctgccaccgagtgttacttgtcattaac- gt cgcattttcgcctacacaaaatttgggttactcgccactacactgctcaaatctttcagctgtgcaacaagatt- ca ggtcacacatagactcgcataaggacccgggtcatctgttattctccactggtaaaccaatagtcctagctgat- tt gggtacagaagctcactttcacatcttttcatcttatctacaaccatc Met3 promoter 5'atctgtgaggagcccctggcgtcactgtcgactgtgccggcatttctgatggtatttccagccccgcagttc- tc gagacccccgaacaaatgtgccacacccttgccaaaatgacgaatacacggcgtcgcggccgggaatcgaactc- tt ggcaccgccacaggagtgaaatttgaaatttgaaatttgaaaaataattcacattttgagtttcaataatatat- cg atgaccctcccaaaagacccaagtcgagacgcaaaaaaacacccagacgacatggatgcggtcacgtgaccgca- aa aaccgccccggaaatccgtttgtgacgtgttcaattccatctctatgtttttctgcggtttctacgatgccgca- at ggtggccaatgtgcgtttcactgccgtagtggctggaacaagccacagggggtcgtcgggccaatcagacggtc- cc tgacatggttctgcgccctaacccgggaactctaacccccgtggtggcgcaatcgctgtcttcatgtgctttat- ct cacgtgacggctggaatctggcagaagacggagtatgtacattttgtcgttggtcacgttatccctaaaacgtg- gt gtttaaactggtcgaatgcttggcccagaacacaagaagaaaaaaacgagacaacttgatcagtttcaacgcca- ca gcaagcttgtcttcactgtggttggtcttctccacgccacaagcaacacgtacatgtcaattacgtcagggtct- tt taagttctgtggcttttgaaccagttataaagaaccaaccacccttttttcaaagctaatcaagacggggaaat- tt tttttttgatatttttcgaca Met6 promoter 5'gatactgcagacggtgcattacttacccgtgtcgactgagagtctacttggtacttggccctgtggctaagc- ag tatttgagcaacaatgcaatgcagttgctgactcggttccagatccccttgccccgatgtgtggaagcgttgtt- tt tggggcaagggcatgtgggggctgcatcatactgtggctggggccgttggaagagccgtcggcagcgagcctga- gt cgcttctcggggccttattccccccgcctctaggtcagcggcggccgaagtgtcgtactcagctcgcctgtaca- gt atgacgtgaccgaatagcctctggaaggttggagaagtacagtgcaaaaaaaagttgcaaaatttcattttagc- gt tcgatccgacgtggcagttggacaatgaatcgatggagacatgatcatgggcagaaatagaaggtaccatgttc- aa tggcagtaccaattgagcaacagacgggtcgacaggcggcgggcacaccatccgccaccacatggcgcaatcgt- ca gtgcagcgattcgtactcggattgcatcatgttgcaccgaaagttggggcccgcacgttggagaggcgaggagc- ca gggttagctttggtggggtcctttgttgtcacgtggcatcagcgaatggcgtcctccaatcagggccgtcagcg- aa gtcggcgtgtgatagtgcgtggggagcgaatagagtttctgggggggggcggcccaaaacgtgaaatccgagta- cg catgtagagtgtaaattgggtgtatagtgacattgtttgactctgaccctgagagtaatatataatgtgtacgt- gt ccccctccgttggtcttctttttttacctttctcctaaccaacacccaaactaatcaatc Met25 promoter 5'aagtcgtattaacataactttccttacatttttttaaagcacgtcactatccacgtgacctagccacgcgat- ac caagtattcatccataatgacacactcatgacgtccggaggacgtcatcatcgtccagtcacgtgccaaggcac- at gactaatcataacaccttatgactagcttctgaatcgctacacagttccaattcgcaaataaactcgaaatgac- ga aatgccataataaaaatgacgaaactcgagattgagagcagcacatgcactgaagtggtggacaaccagcgtat- cc ggagacacgacggatccagcaccatggaagctggccgaaaaagagatccccagcacattgagcaaccaagtcag- ct caattgagtaacatcacacactcagatcgagtctgatggtggtcccatttgttccttcacttgaaaaataattg- aa aataacaataacaataaaaataaaaacaaaataaaaataaaaataaaaataaaaataaaaaaataaaaaaacct- tg ccgcatttagcgtcagccaccccccgcattgacctgagtacgttggattgaccccgatcctgcacgtcgagcgt- gg tcggccaaaaagcgcccgtggctggtgagtcagaaatagcagggttgcaagagagagctgcgcaacgagcaata- aa cggtgtttttttcgcttctgtgctgcttagagtggagagccgaccctcgccatgctcacgtgaccattcacgtg- gt tgcaaactccaccttagtatagccgtgtccctacgctacccattatcgcatcgtactccagccacatttttttg- tt ccccgctaaatccggaaccttatctgggtcacgtgaaattgcaatacgacaggaggttatacttatagagtgag- ac actccacgcaaggtgttgcaagtcaattgacaccacctcacctcagactaacatccaca Pox2 promoter 5'gaatctgcccccacattttataccgcttttgactgtttttctcccccctttcacactctgcttttggctaca- ta aaccccgcaccgtttggaactctgttggtccggggaagccgccgttaggtgtgtcagatggagagcgccagacg- ag cagaaccgagggacagcggatcgggggagggctgtcacgtgacgaagggcactgttgacgtggtgaatgtcgcc- cg ttctcacgtgacccgtctcctctatatgtgtatccgcctctttgtttggttttttttctgcttccccccccccc- cc cccaccccaatcacatgctcagaaagtagacatctgcatcgtcctgcatgccatcccacaagacgaacaagtga- ta ggccgagagccgaggacgaggtggagtgcacaaggggtaggcgaatggtacgattccgccaagtgagactggcg- at cgggagaagggttggtggtcatgggggatagaatttgtacaagtggaaaaaccactacgagtagcggatttgat- ac cacaagtagcagagatatacagcaatggtgggagtgcaagtatcggaatgtactgtacctcctgtactcgtact- cg tacggcactcgtagaaacggggcaatacgggggagaagcgatcgcccgtctgttcaatcgccacaagtccgagt- aa tgctcgagtatcgaagtcttgtacctccctgtcaatcatggcaccactggtcttgacttgtctattcatactgg- ac aagcgccagagttaagcttgtagcgaatttcgccctcggacatcaccccatacgacggacacacatgcccgaca- aa cagcctctcttattgtagctgaaagtatattgaatgtgaacgtgtacaatatcaggtaccagcgggaggttacg- gc caaggtgataccggaataaccctggcttggagatggtcggtccattgtactgaagtgtccgtgtcgtttccgtc- ac tgccccaattggacatgtttgtttttccgatctttcgggcgccctctccttgtctccttgtctgtctcctggac- tg ttgctaccccatttctttggcctccattggttcctccccgtctttcacgtcgtctatggttgcatggtttccct- ta tacttttccccacagtcacatgttatggaggggtctagatggaggcctaattttgacgtgcaaggggcgaattg- gg gcgagaaacacgtcgtggacatggtgcaaggcccgcagggttgattcgacgatttccgcgaaaaaaacaagtcc- aa atacccccgtttattctccctcggctctcggtatttcacatgaaaactataacctagactacacgggcaacctt- aa ccccagagtatacttatataccaaagggatgggtcctcaaaaatcacacaagcaacg Yef3 (YALI0E13277g) promoter 5'cgccattcggttccttccagaccattccagatcaatccacctcttcttatctcaggtgggtgtgctgacatc- ag accccgtagcccttctcccagtggcgaacagcaggcataaaacagggccattgagcagagcaaacaaggtcggt- ga aatcgtcgaaaaagtcggaaaacggttgcaagaaattggagcgtcacctgccaccctccaggctctatataaag- ca ttgccccaattgctaacgcttcatatttacacctttggcaccccagtccatccctccaataaaatgtactacat- gg gacacaacaagagaggatgcgcgcccaaaccctaacctagcacatgcacgatgattctattgtctgtgaaaaaa- tt tttccaccaaaatttccccattgggatgaaaccctaaccgcaaccaaaagtttttaactatcatcttgtacgtc- ac ggtttccgattcttctcttctctttcatcatcatcacttgtgacc Cam1 (YALI0C24420g) promoter 5'aactaccataaagtaccgagaaatataggcaattgtacaaattgtccacctccttcacttacattaccgaac- ca tggccatatcaccaaaataccccgagtgctaaaacacctccctccaaatgttctcttaccttccaccgaaaacc- ga tcttattatcccaacgcttgttgtggcttgacgcgccgcacccgctgggcttgccatttcgataccaatccaag- ag gaaaagctcatgagaaacaatcggaatatcacgagaacggcctggcgaaccaacaggatatttttgaatataat- ta cccctcgaatctagtcatatctatgtctactgtagacttgggcggcatcatgatgtacattattttagcgtctg- ga accctaaagttcacgtacaatcatgtgacaaacgaggctaaaaaatgtcaatttcgtatattagtgttattacg- tg gctcacatttccgaatcatctaccaccccccacctaaaaa YALI0D16467g promoter

5'tttttttaattttcatatttattttcatatttattttcatatttattttcatttatttattcatgtatttat- tt attactttttaagtattttaaactcctcactaaaccgtcgattgcacaatattaaccttcattacacctgcagc- gt ggtttttgtggtcgttagccgaagtcttccaacgtgggtataagtaggaacaattgggccgattttttgagccg- tc taaatctctcgactcaattgatctgctgtcgaaaatccggctctctagctccttttccccgtccgctggagctc- ct cttcattgtgccgtttttccaacatttaactttgccacccaccaccacccccactaccatcacccactcgatct- ct gttcgtgtcaccacgactttgtcttctcacacatactctgtttgtgcaccacacattgcgaa Tef4 (YALI0B12562g) promoter 5'gctacaatagattattggccctattgagcacgctacaattcggtccagtatgtacaacgtctatgcgcacta- ac ggccatacagtgagttacagcacacccaaaagtaaccctgcctgacctgtctgcctgagacaggaagattaact- ct tgtagtgaccgagctcgataagactcaagccacacaatttttttatagccttgcttcaagagtcgccaaaatga- ca ttacacaactccacggaccgtcggttccatgtccacacccttggatgggtaagcgctccacgcacgtaccacgt- gc attgagtttaaccacaaacataggtctgtgtcccagagttaccctgctgcatcagccaagtcttgaaagcaaaa- tt tcttgcacaatttttcctcttcttttcttcactgatcgcagtccaaacacaaaca YALI0D12903g promoter 5'gcgctctgatccacttgtatggctccaagttcagtgtaccaagtagttggtgatgcagggagggatgtctct- at ccaccaataatgaactcatgggcgaaattgtttctgttaaacactccaactgtcgttttaaatctcattctctt- tg catttggactccattcgcttccgttgggccaatataatccatcgtaacgtactttagatggaaatttagttacc- tg ctacttgtctcaacaccccaacaggggctgttcgacagaggtaatagagcgtcaatgggttaataaaaacacac- tg tcgattttcactcattgtctttatgatattacctgttttccgctgttatcaatgccgagcatcgtgttatatct- tc caccccaactacttgcatttacttaactattacctcaactatttacaccccgaattgttacctcccaataagta- ac tttatttcaaccaatgggacgagagcatactgagaacatcgatctatctagtcaatattgcccagaatcgttcg- aa aaaaaacaccaaaaggtttacagcgccattataaatataaattcgttgtcaattcccccgcaatgtctgttgaa- at ctcattttgagaccttccaacattaccactctcccgtctggtcacatgacgtgactgcttcttcccaaaacgaa- ca ctcccaactcttcccccccgtcagtgaaaagtatacatccgacctccaaatcttttcttcactcaac Tef1 (YALI0C09141g) promoter 5'agagacgggttggcggcgtatttgtgtcccaaaaaacagccccaattgccccaattgaccccaaattgaccc- ag tagcgggcccaaccccggcgagagcccccttcaccccacatatcaaacctcccccggttcccacacttgccgtt- aa gggcgtagggtactgcagtaggaatctacgcttgttcagactttgtactagtttctttgtctggccatccgggt- aa cccatgccggacgcaaaatagactactgaaaatttttttgctttgtggttgggactttagccaagggtataaaa- ga ccaccgtccccgaattacctttcctcttcttttctctctctccttgtcaactcacacccgaaatcgttaagcat- tt ccttctgagtataagaatcattc Fba1 (YALI0E26004g) promoter 5'gctgcgctgatctggacaccacagaggttccgagcactttaggttgcaccaaatgtcccaccaggtgcaggc- ag aaaacgctggaacagcgtgtacagtttgtcttagcaaaaagtgaaggcgctgaggtcgagcagggtggtgtgac- tt gttatagcctttagagagcgaaagcgcgtatggatttggctcatcaggccagattgagggtctgtggacacatg- tc atgttagtgtacttcaatcgccccctggatatagccccgacaataggccgtggcctcatttttttgccttccgc- ac atttccattgctcggtacccacaccttgatctcctgcacttgccaaccttaatactggtttacattgaccaaca- tc ttacaagcggggggcttgtctagggtatatataaacagtggctctcccaatcggttgccagtctcttttttcat- tc tttccccacagattcgaaatctaaactacacatc Pox2 terminator: 5'gatgaggaatagacaagcgggtatttattgtatgaataaagattatgtattgattgcaaaaaagtgcatttg- ta gatgtggtttattgtagagagtacggtatgtactgtacgaacattaggagctacttctacaagtagattttctt- aa caagggtgaaatttactaggaagtacatgcatatttcgttagtagaatcacaaaagaaatgtacaagcacgtac- ta cttgtactccacaatgtggagtgggagcaaaaaaattggacgacaccggaatcgaaccggggacctcgcgcatg- ct aagcgcatgtgataaccaactacaccagacgcccaagaactttcttggtgattatggaatacgtggtctgctat- at ctcaattttgctgtaatgaatcattagaattaaaaaaaaaaccccatttttgtgtgattgtcggccaagagatg- ga acaggaagaatacgtgaacaagcgagcacgaatgccatatgctcttctgaacaaccgagtccgaatccgatttg- tg ggtatcacatgtacaagtagagaaatgtatttcgctagaataaaataaatgagattaagaattaaaaatattgg- aa tatattttcctagaatagaaactttggattttttttcggctattacagtagaactggacaaacggctgactata- ta taaatattattgggtctgttttcttgtttatgtcgaaattatctgggttttactactgtgtcgtcgagtataga- gt ggcctgactggagaaaatgcagtagtatggacagtaggtactgccagccagagaagtttttggaattgatactt- ga gtcatttttccattccccattccccattccaacacaatcaactgtttctgaacattttccaaaacgcggagatg- ta tgtcacttggcactgcaagtctcgattcaaaatgcatctctttcagaccaaagtgtcatcagctttgtttggcc- cc aaattaccgcaaatacttgtcgaaattgaagtgcaatacggcctcgtctgccatgaaacctgcctattctcttc- aa attggcgtcaggtttcacgtccagcattcctcgcccagacagagttgctatggttgaatcgtgtactgttaata- ta tgtatgtattatactcgtactacgatatactgttcaatagagtctcttataatcgtacgacgattctgggca

Example 12

Cultures Conditions, Such as Limitation for Nitrogen, Magnesium or Phosphate, can Promote Lipid Accumulation in Y. lipolytica

[0341] 12a. Strains Used to Analyze Lipid Accumulation During Growth Under Various Conditions.

[0342] Strains MF760, MF858, and MF921 were grown under an array of culture conditions, and then harvested cells were extracted and analyzed for total lipid content and levels of specific lipophilic metabolites. FIG. 12 depicts schematic representations of certain plasmids generated and described in this example. Strain MF760 has genotype MATB ura2::URA2/tef-GGS1 ChrA-1635618::tef-carB ura3-302 ade1::?ADE1/tef-HMG1trunc leu2::?LEU2/tef-carRP (questions marks denote presumed loci of chromosomal integration). Strain MF760 harbors four insertion plasmids, pMB4637, pMB4591, pMB4705, and pMB4660, which encode native or heterologous genes required for synthesis of either isoprenoid metabolites in general, or carotenoid metabolites specifically. In all insertion plasmids, except pMB4789, described in this example, the Y. lipolytica TEF 1 promoter and XPR2 terminator were the regulatory sequences used to control expression of genes of interest. Also, in some instances multiple URA3-containing plasmids can be utilized in the same strain, since 5-fluoroorotic acid can be used to select for Ura.sup.- segregants following transformation with a URA3 plasmid. pMB4637 is an ADE1 plasmid that encodes a truncated variant of the Y. lipolytica HMG-CoA reductase. pMB4591 is a URA5 plasmid that encodes the Y. lipolytica geranylgeranylpyrophosphate synthase. pMB4705 is a LEU2 plasmid that encodes the phytoene synthase/lycopene cyclase (CarRP) from Mucor circinelloides. pMB4660 is a URA3 plasmid that encodes a phytoene dehydrogenase from M. circinelloides.

[0343] Strain MF858 has genotype MATB ura2::URA2/tef-GGS1 ChrA-1635618::tef-carB ura3-302::?URA3/tef-plasmid ade1::?ADE1/tef-HMG1trunc leu2::?LEU2/tef-carRP. Strain MF858 harbors the same four plasmids as MF760, and an addition control plasmid (pMB4691), which is a URA3 plasmid that contains regulatory sequences but no gene of interest.

[0344] Strain MF921 has genotype MATB erg9-3'UTR::URA3 ura2::URA2/tef-GGS1 ChrA-1635618::tef-carB ura3-302 ade1::?ADE1/tef-HMG1trunc leu2::?LEU2/tef-carRP. Strain MF921 harbors the same four plasmids as MF760, and an addition URA3 plasmid, pMB4789, which contains sequences for insertion into the 3' UTR of the native ERG9. Insertion into 3' UTR of ERG9 is presumed to result in a hypomorphic mutation to decrease squalene synthase activity.

12b. Lipid Accumulation in Media Containing Various Carbon: Nitrogen Ratios

[0345] Shake flask testing was conducted using carbon to nitrogen (C/N) ratios of 160, 80, 60, 40, 30, 20, and 10 with yeast nitrogen base being the base medium providing vitamins, trace elements and salts. Ammonium sulfate (which contains 21% nitrogen) was used as the nitrogen source and glucose (which contains 40% carbon) was used as the carbon source at a concentration of 30 g/L. The concentrations of ammonium sulfate corresponding to these ratios are: 0.36, 0.71, 0.95, 1.43, 1.91, 2.86, and 4.6 g/L, respectively. Uracil was supplemented at 0.2 mM. As controls, strains were also grown in yeast extract-peptone with 50 g/L of glucose (media in which lipids do not accumulate at high levels) and yeast extract-peptone with 5% olive oil (v/v) (media in which lipids accumulate at high levels). Strain MF760 (10-14 ml of culture) was harvested after 4 days of growth at 30.degree. C., during which time the cultures were shaking at 250 rpm. Following harvesting, cells were washed three times with water, with the exception of the oil-grown cells which were washed three times in 0.5% BSA and one time with water before lipid extractions. Lipids were extracted as described in Folch J, Lees, M, and Stanley, G. H. S. J. Biol. Chem. 226: 497-509, 1957. In brief, cell pellets were resuspended in 6 ml of water. A 1 ml aliquot was transferred to a pre-weighed tube with a hole on the lid, spun down and the cell pellet lyophilized overnight to determine the dry cell weight. The remaining 5 ml were placed in a 15 ml Falcon tube and spun down. Cell pellets were frozen at -20.degree. C. until extractions were performed. Two to three volumes of a Zymolyase solution (2 mg/ml Zymolyase 100T in 1M Sorbitol, 50 mM EDTA and 0.01% .beta.-mercaptoethanol) was added to each cell pellet and placed at 37.degree. C. with constant agitation for 1 hr. Two volumes of cubic zirconia beads were added to each tube and vortexed for 15-20 min. Samples were viewed under a microscope to ensure cell breakage before continuing with extractions. After cell breakage was complete, 6 ml of extraction solvent was added (a 2:1 mix of chloroform and methanol) and mixed. The mixture was spun down for 5 min at 3000 rpm and the organic layer was transferred to a clean tube. NaCl was added to the remaining aqueous layer to make it a 0.29% NaCl solution. 6 ml of extraction solvent was added and mixed, and the mixture was spun down for 5 min. The organic layers were pooled and filtered through a 0.2 .mu.m an filter to get rid of any cell debris. The extract was washed with 0.2 volumes of 0.29% NaCl solution and another 6 ml of extraction solvent added and mixed. Mixtures were spun and the organic layer was placed in a pre-weighed glass vial, the solvent was evaporated under a flow on nitrogen and the vial was weighed again to determine the weight of the lipid extracted. The dry cell weight is used to determine the percentage of lipid per dry cell weight. The lipid accumulation results are in the Table 66 below:

TABLE-US-00023 TABLE 66 Lipid accumulation under various carbon:nitrogen ratio growth conditions C/N Ratio % lipid YNB 160 61 3% Glucose 80 49 60 34 40 17 30 16 20 14 10 15 YEP 5% Glucose 22 5% olive oil 38

Other nitrogen sources tested were proline (12% nitrogen), sodium glutamate (7% nitrogen), soy acid hydrolysate (12% nitrogen), and yeast extract-peptone (26.8% nitrogen). All nitrogen sources tested at C/N ratios of 80 (with glucose as a carbon source), had significantly larger lipid bodies than at C/N ratios of 10 (also with glucose as a carbon source).

[0346] Strains MF858 and MF921 were harvested after 4 days of growth at 30.degree. C. (3% glucose was used as the carbon source). Cells were washed three times with water and lipids extracted as described above. Lipid accumulation data for soy hydrolysate, yeast extract-peptone and yeast nitrogen base, used as a control, are listed in Table 67 below.

TABLE-US-00024 TABLE 67 Lipid accumulation under different carbon and nitrogen conditions with various nitrogen sources % lipid C/N Ratio MF858 MF921 Soy hydrolysate 80 36 36 60 36 35 10 14 15 Yeast Extract- 80 37 37 Peptone 10 15 14 Yeast Nitrogen 80 37 38 Base 10 13 11

12c. Determination of Lipid Levels Under High Carbon and Phosphate or Magnesium Limiting Conditions.

[0347] To test whether other nutrient limitations, under high carbon conditions, will allow for higher lipid accumulation, phosphate or magnesium limiting conditions were tested. For phosphate limiting conditions, yeast nitrogen base medium without phosphate was prepared. Shake flask testing was performed using carbon to phosphate ratios ranging from 5376 down to 42. This range corresponds to 7.8 mg/L up to 1 g/L, respectively, and the latter concentration corresponds to that are commonly used in yeast nitrogen base medium. Glucose, at 30 g/L, was used at the carbon source. Potassium phosphate monobasic (containing 28.7% phosphate) was used as the phosphate source.

[0348] For magnesium limiting conditions, yeast nitrogen base medium without magnesium was prepared. Shake flask testing was conducted using carbon to magnesium ratios ranging from 31360 down to 245. This range corresponds to 0.375 mg/L up to 0.5 g/L, and the latter magnesium concentration corresponds to that commonly used in yeast nitrogen base. Glucose, at 30 g/L, was used as the carbon source. Magnesium sulfate (containing 9.8% magnesium) was used as the magnesium source.

[0349] Strains MF858 and MF921 were harvested after 4 days of growth at 30.degree. C., during which time the cultures were shaking at 250 rpm. Cells were washed three times with water before lipid extraction. Lipids were extracted as described above. Lipid accumulation data is listed in the Table 68 below:

TABLE-US-00025 TABLE 68 Lipid accumulation in phosphate or magnesium limiting growth conditions % Lipid g/L MF858 MF921 phosphate 1 14 14 0.0625 18 20 0.0313 34 41 0.0156 62 63 0.0078 83 76 magnesium 0.5 12 11 0.0313 NA 16 0.0156 NA 25 0.0078 NA 42 0.0039 48 48

[0350] The following tables are referenced throughout the description. Each reference and information designated by each of the Genbank Accession and GI numbers are hereby incorporated by reference in their entirety.

TABLE-US-00026 Lengthy table referenced here US20140315279A1-20141023-T00001 Please refer to the end of the specification for access instructions.

TABLE-US-00027 Lengthy table referenced here US20140315279A1-20141023-T00002 Please refer to the end of the specification for access instructions.

TABLE-US-00028 Lengthy table referenced here US20140315279A1-20141023-T00003 Please refer to the end of the specification for access instructions.

TABLE-US-00029 Lengthy table referenced here US20140315279A1-20141023-T00004 Please refer to the end of the specification for access instructions.

TABLE-US-00030 Lengthy table referenced here US20140315279A1-20141023-T00005 Please refer to the end of the specification for access instructions.

TABLE-US-00031 Lengthy table referenced here US20140315279A1-20141023-T00006 Please refer to the end of the specification for access instructions.

TABLE-US-00032 Lengthy table referenced here US20140315279A1-20141023-T00007 Please refer to the end of the specification for access instructions.

TABLE-US-00033 Lengthy table referenced here US20140315279A1-20141023-T00008 Please refer to the end of the specification for access instructions.

TABLE-US-00034 Lengthy table referenced here US20140315279A1-20141023-T00009 Please refer to the end of the specification for access instructions.

TABLE-US-00035 Lengthy table referenced here US20140315279A1-20141023-T00010 Please refer to the end of the specification for access instructions.

TABLE-US-00036 Lengthy table referenced here US20140315279A1-20141023-T00011 Please refer to the end of the specification for access instructions.

TABLE-US-00037 Lengthy table referenced here US20140315279A1-20141023-T00012 Please refer to the end of the specification for access instructions.

TABLE-US-00038 Lengthy table referenced here US20140315279A1-20141023-T00013 Please refer to the end of the specification for access instructions.

TABLE-US-00039 Lengthy table referenced here US20140315279A1-20141023-T00014 Please refer to the end of the specification for access instructions.

TABLE-US-00040 Lengthy table referenced here US20140315279A1-20141023-T00015 Please refer to the end of the specification for access instructions.

TABLE-US-00041 Lengthy table referenced here US20140315279A1-20141023-T00016 Please refer to the end of the specification for access instructions.

TABLE-US-00042 Lengthy table referenced here US20140315279A1-20141023-T00017 Please refer to the end of the specification for access instructions.

TABLE-US-00043 Lengthy table referenced here US20140315279A1-20141023-T00018 Please refer to the end of the specification for access instructions.

TABLE-US-00044 Lengthy table referenced here US20140315279A1-20141023-T00019 Please refer to the end of the specification for access instructions.

TABLE-US-00045 Lengthy table referenced here US20140315279A1-20141023-T00020 Please refer to the end of the specification for access instructions.

TABLE-US-00046 Lengthy table referenced here US20140315279A1-20141023-T00021 Please refer to the end of the specification for access instructions.

TABLE-US-00047 Lengthy table referenced here US20140315279A1-20141023-T00022 Please refer to the end of the specification for access instructions.

TABLE-US-00048 Lengthy table referenced here US20140315279A1-20141023-T00023 Please refer to the end of the specification for access instructions.

TABLE-US-00049 Lengthy table referenced here US20140315279A1-20141023-T00024 Please refer to the end of the specification for access instructions.

TABLE-US-00050 Lengthy table referenced here US20140315279A1-20141023-T00025 Please refer to the end of the specification for access instructions.

TABLE-US-00051 Lengthy table referenced here US20140315279A1-20141023-T00026 Please refer to the end of the specification for access instructions.

TABLE-US-00052 Lengthy table referenced here US20140315279A1-20141023-T00027 Please refer to the end of the specification for access instructions.

TABLE-US-00053 Lengthy table referenced here US20140315279A1-20141023-T00028 Please refer to the end of the specification for access instructions.

TABLE-US-00054 Lengthy table referenced here US20140315279A1-20141023-T00029 Please refer to the end of the specification for access instructions.

TABLE-US-00055 Lengthy table referenced here US20140315279A1-20141023-T00030 Please refer to the end of the specification for access instructions.

TABLE-US-00056 Lengthy table referenced here US20140315279A1-20141023-T00031 Please refer to the end of the specification for access instructions.

TABLE-US-00057 Lengthy table referenced here US20140315279A1-20141023-T00032 Please refer to the end of the specification for access instructions.

TABLE-US-00058 Lengthy table referenced here US20140315279A1-20141023-T00033 Please refer to the end of the specification for access instructions.

TABLE-US-00059 Lengthy table referenced here US20140315279A1-20141023-T00034 Please refer to the end of the specification for access instructions.

TABLE-US-00060 Lengthy table referenced here US20140315279A1-20141023-T00035 Please refer to the end of the specification for access instructions.

TABLE-US-00061 Lengthy table referenced here US20140315279A1-20141023-T00036 Please refer to the end of the specification for access instructions.

TABLE-US-00062 Lengthy table referenced here US20140315279A1-20141023-T00037 Please refer to the end of the specification for access instructions.

TABLE-US-00063 Lengthy table referenced here US20140315279A1-20141023-T00038 Please refer to the end of the specification for access instructions.

TABLE-US-00064 Lengthy table referenced here US20140315279A1-20141023-T00039 Please refer to the end of the specification for access instructions.

TABLE-US-00065 Lengthy table referenced here US20140315279A1-20141023-T00040 Please refer to the end of the specification for access instructions.

TABLE-US-00066 Lengthy table referenced here US20140315279A1-20141023-T00041 Please refer to the end of the specification for access instructions.

TABLE-US-00067 Lengthy table referenced here US20140315279A1-20141023-T00042 Please refer to the end of the specification for access instructions.

TABLE-US-00068 Lengthy table referenced here US20140315279A1-20141023-T00043 Please refer to the end of the specification for access instructions.

TABLE-US-00069 Lengthy table referenced here US20140315279A1-20141023-T00044 Please refer to the end of the specification for access instructions.

TABLE-US-00070 Lengthy table referenced here US20140315279A1-20141023-T00045 Please refer to the end of the specification for access instructions.

TABLE-US-00071 Lengthy table referenced here US20140315279A1-20141023-T00046 Please refer to the end of the specification for access instructions.

TABLE-US-00072 Lengthy table referenced here US20140315279A1-20141023-T00047 Please refer to the end of the specification for access instructions.

TABLE-US-00073 Lengthy table referenced here US20140315279A1-20141023-T00048 Please refer to the end of the specification for access instructions.

TABLE-US-00074 Lengthy table referenced here US20140315279A1-20141023-T00049 Please refer to the end of the specification for access instructions.

TABLE-US-00075 Lengthy table referenced here US20140315279A1-20141023-T00050 Please refer to the end of the specification for access instructions.

TABLE-US-00076 Lengthy table referenced here US20140315279A1-20141023-T00051 Please refer to the end of the specification for access instructions.

TABLE-US-00077 Lengthy table referenced here US20140315279A1-20141023-T00052 Please refer to the end of the specification for access instructions.

TABLE-US-00078 Lengthy table referenced here US20140315279A1-20141023-T00053 Please refer to the end of the specification for access instructions.

TABLE-US-00079 Lengthy table referenced here US20140315279A1-20141023-T00054 Please refer to the end of the specification for access instructions.

TABLE-US-00080 Lengthy table referenced here US20140315279A1-20141023-T00055 Please refer to the end of the specification for access instructions.

TABLE-US-00081 Lengthy table referenced here US20140315279A1-20141023-T00056 Please refer to the end of the specification for access instructions.

TABLE-US-00082 Lengthy table referenced here US20140315279A1-20141023-T00057 Please refer to the end of the specification for access instructions.

TABLE-US-00083 Lengthy table referenced here US20140315279A1-20141023-T00058 Please refer to the end of the specification for access instructions.

TABLE-US-00084 Lengthy table referenced here US20140315279A1-20141023-T00059 Please refer to the end of the specification for access instructions.

TABLE-US-00085 Lengthy table referenced here US20140315279A1-20141023-T00060 Please refer to the end of the specification for access instructions.

TABLE-US-00086 Lengthy table referenced here US20140315279A1-20141023-T00061 Please refer to the end of the specification for access instructions.

TABLE-US-00087 Lengthy table referenced here US20140315279A1-20141023-T00062 Please refer to the end of the specification for access instructions.

TABLE-US-00088 Lengthy table referenced here US20140315279A1-20141023-T00063 Please refer to the end of the specification for access instructions.

TABLE-US-00089 Lengthy table referenced here US20140315279A1-20141023-T00064 Please refer to the end of the specification for access instructions.

TABLE-US-00090 Lengthy table referenced here US20140315279A1-20141023-T00065 Please refer to the end of the specification for access instructions.

TABLE-US-00091 Lengthy table referenced here US20140315279A1-20141023-T00066 Please refer to the end of the specification for access instructions.

TABLE-US-00092 Lengthy table referenced here US20140315279A1-20141023-T00067 Please refer to the end of the specification for access instructions.

TABLE-US-00093 Lengthy table referenced here US20140315279A1-20141023-T00068 Please refer to the end of the specification for access instructions.

TABLE-US-00094 Lengthy table referenced here US20140315279A1-20141023-T00069 Please refer to the end of the specification for access instructions.

TABLE-US-00095 Lengthy table referenced here US20140315279A1-20141023-T00070 Please refer to the end of the specification for access instructions.

TABLE-US-00096 Lengthy table referenced here US20140315279A1-20141023-T00071 Please refer to the end of the specification for access instructions.

TABLE-US-00097 Lengthy table referenced here US20140315279A1-20141023-T00072 Please refer to the end of the specification for access instructions.

TABLE-US-00098 Lengthy table referenced here US20140315279A1-20141023-T00073 Please refer to the end of the specification for access instructions.

TABLE-US-00099 Lengthy table referenced here US20140315279A1-20141023-T00074 Please refer to the end of the specification for access instructions.

TABLE-US-00100 Lengthy table referenced here US20140315279A1-20141023-T00075 Please refer to the end of the specification for access instructions.

TABLE-US-00101 Lengthy table referenced here US20140315279A1-20141023-T00076 Please refer to the end of the specification for access instructions.

TABLE-US-00102 Lengthy table referenced here US20140315279A1-20141023-T00077 Please refer to the end of the specification for access instructions.

TABLE-US-00103 Lengthy table referenced here US20140315279A1-20141023-T00078 Please refer to the end of the specification for access instructions.

TABLE-US-00104 Lengthy table referenced here US20140315279A1-20141023-T00079 Please refer to the end of the specification for access instructions.

TABLE-US-00105 Lengthy table referenced here US20140315279A1-20141023-T00080 Please refer to the end of the specification for access instructions.

TABLE-US-00106 Lengthy table referenced here US20140315279A1-20141023-T00081 Please refer to the end of the specification for access instructions.

TABLE-US-00107 Lengthy table referenced here US20140315279A1-20141023-T00082 Please refer to the end of the specification for access instructions.

TABLE-US-00108 Lengthy table referenced here US20140315279A1-20141023-T00083 Please refer to the end of the specification for access instructions.

TABLE-US-00109 Lengthy table referenced here US20140315279A1-20141023-T00084 Please refer to the end of the specification for access instructions.

TABLE-US-00110 Lengthy table referenced here US20140315279A1-20141023-T00085 Please refer to the end of the specification for access instructions.

TABLE-US-00111 Lengthy table referenced here US20140315279A1-20141023-T00086 Please refer to the end of the specification for access instructions.

TABLE-US-00112 Lengthy table referenced here US20140315279A1-20141023-T00087 Please refer to the end of the specification for access instructions.

TABLE-US-00113 Lengthy table referenced here US20140315279A1-20141023-T00088 Please refer to the end of the specification for access instructions.

TABLE-US-00114 Lengthy table referenced here US20140315279A1-20141023-T00089 Please refer to the end of the specification for access instructions.

TABLE-US-00115 Lengthy table referenced here US20140315279A1-20141023-T00090 Please refer to the end of the specification for access instructions.

TABLE-US-00116 Lengthy table referenced here US20140315279A1-20141023-T00091 Please refer to the end of the specification for access instructions.

TABLE-US-00117 Lengthy table referenced here US20140315279A1-20141023-T00092 Please refer to the end of the specification for access instructions.

TABLE-US-00118 Lengthy table referenced here US20140315279A1-20141023-T00093 Please refer to the end of the specification for access instructions.

TABLE-US-00119 Lengthy table referenced here US20140315279A1-20141023-T00094 Please refer to the end of the specification for access instructions.

EQUIVALENTS

[0351] Those skilled in the art will recognize, or be able to understand that the foregoing description and examples are illustrative of practicing the provided invention. Those skilled in the art will be able to ascertain using no more than routine experimentation, many variations of the detail presented herein may be made to the specific embodiments of the invention described herein without departing from the spirit and scope of the present invention.

TABLE-US-LTS-00001 LENGTHY TABLES The patent application contains a lengthy table section. A copy of the table is available in electronic form from the USPTO web site (http://seqdata.uspto.gov/?pageRequest=docDetail&DocID=US20140315279A1). An electronic copy of the table will also be available from the USPTO upon request and payment of the fee set forth in 37 CFR 1.19(b)(3).

Sequence CWU 1

1

152120DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 1ctgggtgacc tggaagcctt 20223DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 2aagatcaatc cgtagaagtt cag 23323DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 3aagcgattac aatcttcctt tgg 23422DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 4ccagtccatc aactcagtct ca 22523DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 5gcattgctta ttacgaagac tac 23623DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 6ccactgtcct ccactacaaa cac 23731DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 7cacaaactag tttgccacct acaagccaga t 31831DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 8cacaaggtac caatgtgaaa gtgcgcgtga t 31928DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 9cacaaggtac cagagaccgg gttggcgg 281040DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 10cacaagcggc cgcgctagca tggggatcga tctcttatat 401141DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 11cacaagcggc cgcgctagcg aatgattctt atactcagaa g 411240DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 12cacaagcggc cgcacgcgtg caattaacag atagtttgcc 401333DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 13cacaagctag ctggggatgc gatctcttat atc 331432DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 14cattcactag tggtgtgttc tgtggagcat tc 321536DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 15cacacggtct catcgaggtg tagtggtagt gcagtg 361640DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 16cacactctag acacaaaaat gacccagtct gtgaaggtgg 401733DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 17cacacacgcg tacacctatg accgtatgca aat 331841DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 18tctagacaca aaaatgctct cgagaaacct cagcaagttt g 411941DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 19gtggttgctg gccggatgag accggctcga gcaaacttgc t 412041DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 20cagcaaccac atccacacac acccgactat tctctgtctc c 412137DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 21ggatcccgtc tcggacagac gtcgggcgga gacagag 3722130DNAArtificial SequenceDescription of Artificial Sequence Synthetic polynucleotide 22tctagacaca aaaatgctct cgagaaacct cagcaagttt gctcgagccg gtctcatccg 60gccagcaacc acatccacac acacccgact attctctgtc tccgcccgac gtctgtccga 120gacgggatcc 1302335DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 23cggatcccgt ctcggacatt tcttgcgaca caatg 352434DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 24ctctagacac aaaaatgctg agagtcggac gaat 3425261DNAArtificial SequenceDescription of Artificial Sequence Synthetic polynucleotide 25ctctagacac aaaaatgctg agagtcggac gaattggcac caagacccta gccagcagca 60gcctgcgttt cgtggcaggt gctcggccca aatccacgct caccgaggcc gtgctggaga 120ccacagggct gctgaaaacc acgccccaaa accccgagtg gtctggagcc gtcaagcagg 180catctcgtct ggtggagacc gacactccga tccgagaccc gttttccatt gtgtcgcaag 240aaatgtccga gacgggatcc g 261261020DNAArtificial SequenceDescription of Artificial Sequence Synthetic polynucleotide 26ggatcccgtc tcttgtctga tgccaaagtc tctacaaagc ctcacgagat gctcgctgcc 60acactctctc aggaaatggc cgctgtcaac gcccttattc gaacccgtat ggcttctgaa 120cacgccccta gaatccccga agttacagcc cacctcgtcg aagccggcgg aaaacgattg 180cgacctatgc tgacacttgc tgctgcccga ctgtgtggat accagggcga agatcatgtc 240aaactggctg ccactgttga atttattcac actgctacac ttttgcacga tgatgtcgtg 300gacgagtctg gacaaagacg tggacgacct actgctaatc ttctttggga taacaagtcc 360tctgttcttg tgggcgacta tctttttgca cgatcgttcc agctgatggt cgaaaccggt 420tcccttcgag tgcttgacat cctcgctaac gctgccgcga caattgccga aggtgaagtg 480ctccagatga ccgctgcttc cgatcttaga actgatgaat ccgtttacct tcaggtcgtt 540cgaggtaaaa ctgccgctct tttttctgct gctactgaag ttggtggtgt tattgccgga 600gtccccgaag cccaagttcg agcacttttt gaatacggtg atgcgctggg aattgcattt 660cagattgctg atgacctcct cgactaccaa ggtgatgcta aggccactgg aaagaatgtt 720ggagatgact tcagagaaag aaaactcaca ttgcctgtta ttaaagccgt cgctcaagct 780actgatgagg aacgagcgtt ttgggttaga actattgaaa agggaaagca agctgaagga 840gatcttgaac aggctctcgc tctcatggaa aaatacggaa cacttgccgc aaccagagcc 900gatgcgcatg cttgggccga aaaagcacga accgccctcg aactgttgcc gaatcacgaa 960atcagaacaa tgctctccga cctcgccgat tatgtcgttg ctagattgtc ttaaacgcgt 1020271017DNAArtificial SequenceDescription of Artificial Sequence Synthetic polynucleotide 27ggatcccgtc tcttgtctct tgatcacgct gccaccaaac ctcatgaaca acttgccgcc 60gcccttgccg acgaccttac cgcagttaac gctatgatta gagaccgaat ggccagcgaa 120catgccccca gaattccaca agttaccgca cacctcgttg aagccggagg taaaagactt 180cgtccaatgc tgacccttgc cgccgcccgt atgtgtggct acgacggacc ttaccacatc 240caccttgcag ccaccgttga gttcattcac accgccacct tgctccatga cgatgttgtt 300gatgagtcct cccaacgtcg tggtcgacct accgctaatc ttctgtggga taatacctct 360tctgttttgg ttggagacta cctctttgca cgatcttttc agttgatggt tgagaccgga 420tcgcttcgag ttcttgacat cctggccaat gcttctgcta ccatcgccga aggtgaggtt 480ctgcaactta ccgccgccgc tgaccttgca actaccgagg acatttacat caaagttgtt 540cgaggaaaga ctgctgcact cttttctgct gctatggaag ttggtggaga aatcgccggt 600caggacccgg ctattaaaca ggctctcttt gactacggcg acgctctcgg aatttctttt 660cagattgttg atgacctgtt ggattacggt ggaacaaaag ctacaggaaa gaacgttggt 720gatgacttcc gagaaagaaa gctcaccctc cctgttattc gtgccgttgc tgctgctgat 780gctgatgaaa gagctttttg ggaacgaaca attcaaaaag gaagacaaca agatggagac 840ctggaccatg caattgccct tcttcataga cacggaaccc ttgaatccac acgacaagac 900gcaatccttt gggccgctaa agctaaagaa gcactcggag ttatcccgga ccatgctctt 960aaaaccatgc tcgttgacct tgctgactac gttgttgctc gactcaccta aacgcgt 10172828DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 28cacacggtac cggtataggc acaagtgc 282932DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 29cactcggatc cgtacgtctg tggtgctgtg gt 323036DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 30cacacggatc cgctagctct gagaaacctc accatg 363131DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 31cacactctag attttcttgg ccatgaacgg t 31321210DNAArtificial SequenceDescription of Artificial Sequence Synthetic polynucleotide 32ttctagacac aaaaatgctg cgagtgggcc gaatcggcac caagaccctg gcctcttctt 60ctctgcgatt cgtggccggc gcccgaccca agtctaccct gaccgaggcc gtgctggaga 120ccaccggcct gctgaagacc accccccaga accccgagtg gtctggcgcc gtgaagcagg 180cctctcgact ggtggagacc gacaccccca tccgagaccc cttctctatc gtgtctcagg 240agatgcaggg ccaggccccc acccccgacg gccaggtggc cgacgccgtg accggcaact 300gggtggacat ccacgccccc gcctggtctc gaccctacct gcgactgtct cgagccgacc 360gacccatcgg cacctggctg ctgctgatcc cctgttggtg gggcctggcc ctggccatgc 420tggacggcca ggacgcccga tggggcgacc tgtggatcgc cctgggctgt gccatcggcg 480ccttcctgat gcgaggcgcc ggctgtacct ggaacgacat caccgaccga gagttcgacg 540gccgagtgga gcgaacccga tctcgaccca tcccctctgg ccaggtgtct gtgcgaatgg 600ccgtggtgtg gatgatcgcc caggccctgc tggccctgat gatcctgctg accttcaacc 660gaatggccat cgccatgggc gtgctgtctc tgctgcccgt ggccgtgtac cccttcgcca 720agcgattcac ctggtggccc caggtgttcc tgggcctggc cttcaactgg ggcgccctgc 780tggcctggac cgcccactct ggctctctgg gctggggcgc cctgttcctg tacctggccg 840gcatcgcctg gaccctgttc tacgacacca tctacgccca ccaggacacc gaggacgacg 900ccctgatcgg cgtgaagtct accgcccgac tgttcggcgc ccagaccccc cgatggatgt 960cttacttcct ggtggccacc gtgtctctga tgggcatcgc cgtgttcgag gccgccctgc 1020ccgacgcctc tatcctggcc ctggtgctgg ccctggccgg cccctgggcc atgggctggc 1080acatggcctg gcagctgcga ggcctggacc tggacgacaa cggcaagctg ctgcagctgt 1140tccgagtgaa ccgagacacc ggcctgatcc ccctgatctt cttcgtgatc gccctgttcg 1200cctaacgcgt 1210331135DNAArtificial SequenceDescription of Artificial Sequence Synthetic polynucleotide 33ttctagacac aaaaatgctg ggctcttgtg gcgccggcct ggtgcgaggc ctgcgagccg 60agacccaggc ctggctgtgg ggcacccgag gccgatctct ggccctggtg cacgccgccc 120gaggcctgca cgccgccaac tggcagccct ctcccggcca gggcccccga ggccgacccc 180tgtctctgtc tgccgccgcc gtggtgaact ctgccccccg acccctgcag ccctacctgc 240gactgatgcg actggacaag cccatcggca cctggctgct gtacctgccc tgtacctggt 300ctatcggcct ggccgccgac cccggctgtc tgcccgactg gtacatgctg tctctgttcg 360gcaccggcgc cgtgctgatg cgaggcgccg gctgtaccat caacgacatg tgggaccgag 420actacgacaa gaaggtgacc cgaaccgcct ctcgacccat cgccgccggc gacatctcta 480ccttccgatc tttcgtgttc ctgggcggcc agctgaccct ggccctgggc gtgctgctgt 540gtctgaacta ctactctatc gccctgggcg ccgcctctct gctgctggtg accacctacc 600ccctgatgaa gcgaatcacc tactggcccc agctggccct gggcctgacc ttcaactggg 660gcgccctgct gggctggtct gccgtgaagg gctcttgtga cccctctgtg tgtctgcccc 720tgtacttctc tggcatcatg tggaccctga tctacgacac catctacgcc caccaggaca 780agaaggacga cgccctgatc ggcctgaagt ctaccgccct gctgttccga gaggacacca 840agaagtggct gtctggcttc tctgtggcca tgctgggcgc cctgtctctg gtgggcgtga 900actctggcca gaccatgccc tactacaccg ccctggccgc cgtgggcgcc cacctggccc 960accagatcta caccctggac atcaaccgac ccgaggactg ttgggagaag ttcacctcta 1020accgaaccat cggcctgatc atcttcctgg gcatcgtgct gggcaacctg tgtaaggcca 1080aggagaccga caagacccga aagaacatcg agaaccgaat ggagaactaa cgcgt 1135341093DNAArtificial SequenceDescription of Artificial Sequence Synthetic polynucleotide 34ttctagacac aaaaatgcag aaggactctc tgaacaacat caacatctct gccgagcagg 60tgctgatcac ccccgacgag ctgaaggcca agttccccct gaacgacgtg gagcagcgag 120acatcgccca ggcccgagcc accatcgccg acatcatcca cggccgagac gaccgactgc 180tgatcgtgtg tggcccctgt tctatccacg acaccgacgc cgccctggag tacgcccgac 240gactgcagct gctggccgcc gagctgaacg accgactgta catcgtgatg cgagtgtact 300tcgagaagcc ccgaaccacc gtgggctgga agggcctgat caacgacccc ttcatggacg 360gctctttcga cgtggagtct ggcctgcaca tcgcccgagg cctgctgctg cagctggtga 420acatgggcct gcccctggcc accgaggccc tggacctgaa ctctccccag tacctgggcg 480acctgttctc ttggtctgcc atcggcgccc gaaccaccga gtctcagacc caccgagaga 540tggcctctgg cctgtctatg cccgtgggct tcaagaacgg caccgacggc tctctgggca 600ccgccatcaa cgccatgcga gccgccgcca tgccccaccg attcgtgggc atcaaccaga 660ccggccaggt gtgtctgctg cagacccagg gcaacggcga cggccacgtg atcctgcgag 720gcggcaagac ccccaactac tctgcccagg acgtggccga gtgtgagaag cagatgcagg 780aggccggcct gcgacccgcc ctgatgatcg actgttctca cggcaactct aacaaggact 840accgacgaca gcccctggtg gtggagtctg ccatcgagca gatcaaggcc ggcaaccgat 900ctatcatcgg cctgatgctg gagtctcacc tgaacgaggg ctctcagtct tctgagcagc 960cccgatctga catgcgatac ggcgtgtctg tgaccgacgc ctgtatctct tgggagtcta 1020ccgagaccct gctgcgatct gtgcaccagg acctgtctgc cgcccgagtg aagcactctg 1080gcgagtaacg cgt 109335553DNAArtificial SequenceDescription of Artificial Sequence Synthetic polynucleotide 35ttctagacac aaaaatgtct gacgacgcct ctaccctgct gcgaaccatc tcttggttca 60ccgagccccc ctctgtgctg cccgagcaca tcggcgactg gctgatggag acctcttcta 120tgacccagcg actggagaag tactgtgccc agctgcgagt gaccctgtgt cgagagggct 180tcatcacccc ccagatgctg ggcgaggagc gagaccagct gcccgccgac gagcgatact 240ggctgcgaga ggtggtgctg tacggcgacg accgaccctg gctgttcggc cgaaccatcg 300tgccccagca gaccctggag ggctctggcg ccgccctgac caagatcggc aaccagcccc 360tgggccgata cctgttcgag cagaagtctc tgacccgaga ctacatccac accggctgtt 420gtgagggcct gtgggcccga cgatctcgac tgtgtctgtc tggccacccc ctgctgctga 480ccgagctgtt cctgcccgag tctcccgtgt actacacccc cggcgacgag ggctggcagg 540tgatctaacg cgt 5533623DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 36ccttctagtc gtacgtagtc agc 233725DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 37ccactgatct agaatctctt tctgg 253824DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 38ggctcattgc gcatgctaac atcg 243925DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 39cgacgatgct atgagcttct agacg 254036DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 40ttctagacac aaaaatggct gcagaccaat tggtga 364133DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 41cattaattct tctaaaggac gtattttctt atc 334221DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 42gttctctgga cgacctagag g 214333DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 43cacacacgcg tacacctatg accgtatgca aat 334440DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 44cacactctag acacaaaaat gacccagtct gtgaaggtgg 404534DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 45cacacggatc ctataatgcc ttccgcaacg accg 344635DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 46cacacactag ttaaatttgg acctcaacac gaccc 354740DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 47cacacggatc caatataaat gtctgcgaag agcatcctcg 404834DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 48cacacgcatg cttaagcttg gaactccacc gcac 344947DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 49cacacggatc caattttcaa aaattcttac tttttttttg gatggac 475041DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 50cacacggatc cttttttctc cttgacgtta aagtatagag g 415138DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 51cacacgagct caaaaatgga caatcaggct acacagag 385242DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 52cacaccctag gtcacttttc ttcaatggtt ctcttgaaat tg 425346DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 53cacacgagct cggaatattc aactgttttt ttttatcatg ttgatg 465442DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 54cacacggatc cttcttgaaa atatgcactc tatatctttt ag 425543DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 55cacacgctag ctacaaaatg ttgtcactca aacgcatagc aac 435638DNAArtificial SequenceDescription of Artificial Sequence Synthetic primer 56cacacgtcga cttaatgatc tcggtatacg agaggaac 38571341DNAYarrowia lipolytica 57atgtcgcaac cccagaacgt tggaatcaaa gccctcgaga tctacgtgcc ttctcgaatt 60gtcaaccagg ctgagctcga gaagcacgac ggtgtcgctg ctggcaagta caccattggt 120cttggtcaga ccaacatggc ctttgtcgac gacagagagg acatctattc ctttgccctg 180accgccgtct ctcgactgct caagaacaac aacatcgacc ctgcatctat tggtcgaatc 240gaggttggta ctgaaaccct tctggacaag tccaagtccg tcaagtctgt gctcatgcag 300ctctttggcg agaacagcaa cattgagggt gtggacaacg tcaacgcctg ctacggagga 360accaacgccc tgttcaacgc tatcaactgg gttgagggtc gatcttggga cggccgaaac 420gccatcgtcg ttgccggtga cattgccctc tacgcaaagg gcgctgcccg acccaccgga 480ggtgccggct gtgttgccat gctcattggc cccgacgctc ccctggttct tgacaacgtc 540cacggatctt acttcgagca tgcctacgat ttctacaagc ctgatctgac ctccgagtac 600ccctatgttg atggccacta ctccctgacc tgttacacaa aggccctcga caaggcctac 660gctgcctaca acgcccgagc cgagaaggtc ggtctgttca aggactccga caagaagggt 720gctgaccgat ttgactactc tgccttccac gtgcccacct gcaagcttgt caccaagtct 780tacgctcgac ttctctacaa cgactacctc aacgacaaga gcctgtacga gggccaggtc 840cccgaggagg ttgctgccgt ctcctacgat gcctctctca ccgacaagac cgtcgagaag 900accttccttg gtattgccaa ggctcagtcc gccgagcgaa tggctccttc tctccaggga 960cccaccaaca ccggtaacat gtacaccgcc tctgtgtacg cttctctcat ctctctgctg 1020acttttgtcc ccgctgagca gctgcagggc

aagcgaatct ctctcttctc ttacggatct 1080ggtcttgctt ccactctttt ctctctgacc gtcaagggag acatttctcc catcgtcaag 1140gcctgcgact tcaaggctaa gctcgatgac cgatccaccg agactcccgt cgactacgag 1200gctgccaccg atctccgaga gaaggcccac ctcaagaaga actttgagcc ccagggagac 1260atcaagcaca tcaagtctgg cgtctactac ctcaccaaca tcgatgacat gttccgacga 1320aagtacgaga tcaagcagta g 134158446PRTYarrowia lipolytica 58Met Ser Gln Pro Gln Asn Val Gly Ile Lys Ala Leu Glu Ile Tyr Val 1 5 10 15 Pro Ser Arg Ile Val Asn Gln Ala Glu Leu Glu Lys His Asp Gly Val 20 25 30 Ala Ala Gly Lys Tyr Thr Ile Gly Leu Gly Gln Thr Asn Met Ala Phe 35 40 45 Val Asp Asp Arg Glu Asp Ile Tyr Ser Phe Ala Leu Thr Ala Val Ser 50 55 60 Arg Leu Leu Lys Asn Asn Asn Ile Asp Pro Ala Ser Ile Gly Arg Ile 65 70 75 80 Glu Val Gly Thr Glu Thr Leu Leu Asp Lys Ser Lys Ser Val Lys Ser 85 90 95 Val Leu Met Gln Leu Phe Gly Glu Asn Ser Asn Ile Glu Gly Val Asp 100 105 110 Asn Val Asn Ala Cys Tyr Gly Gly Thr Asn Ala Leu Phe Asn Ala Ile 115 120 125 Asn Trp Val Glu Gly Arg Ser Trp Asp Gly Arg Asn Ala Ile Val Val 130 135 140 Ala Gly Asp Ile Ala Leu Tyr Ala Lys Gly Ala Ala Arg Pro Thr Gly 145 150 155 160 Gly Ala Gly Cys Val Ala Met Leu Ile Gly Pro Asp Ala Pro Leu Val 165 170 175 Leu Asp Asn Val His Gly Ser Tyr Phe Glu His Ala Tyr Asp Phe Tyr 180 185 190 Lys Pro Asp Leu Thr Ser Glu Tyr Pro Tyr Val Asp Gly His Tyr Ser 195 200 205 Leu Thr Cys Tyr Thr Lys Ala Leu Asp Lys Ala Tyr Ala Ala Tyr Asn 210 215 220 Ala Arg Ala Glu Lys Val Gly Leu Phe Lys Asp Ser Asp Lys Lys Gly 225 230 235 240 Ala Asp Arg Phe Asp Tyr Ser Ala Phe His Val Pro Thr Cys Lys Leu 245 250 255 Val Thr Lys Ser Tyr Ala Arg Leu Leu Tyr Asn Asp Tyr Leu Asn Asp 260 265 270 Lys Ser Leu Tyr Glu Gly Gln Val Pro Glu Glu Val Ala Ala Val Ser 275 280 285 Tyr Asp Ala Ser Leu Thr Asp Lys Thr Val Glu Lys Thr Phe Leu Gly 290 295 300 Ile Ala Lys Ala Gln Ser Ala Glu Arg Met Ala Pro Ser Leu Gln Gly 305 310 315 320 Pro Thr Asn Thr Gly Asn Met Tyr Thr Ala Ser Val Tyr Ala Ser Leu 325 330 335 Ile Ser Leu Leu Thr Phe Val Pro Ala Glu Gln Leu Gln Gly Lys Arg 340 345 350 Ile Ser Leu Phe Ser Tyr Gly Ser Gly Leu Ala Ser Thr Leu Phe Ser 355 360 365 Leu Thr Val Lys Gly Asp Ile Ser Pro Ile Val Lys Ala Cys Asp Phe 370 375 380 Lys Ala Lys Leu Asp Asp Arg Ser Thr Glu Thr Pro Val Asp Tyr Glu 385 390 395 400 Ala Ala Thr Asp Leu Arg Glu Lys Ala His Leu Lys Lys Asn Phe Glu 405 410 415 Pro Gln Gly Asp Ile Lys His Ile Lys Ser Gly Val Tyr Tyr Leu Thr 420 425 430 Asn Ile Asp Asp Met Phe Arg Arg Lys Tyr Glu Ile Lys Gln 435 440 445 591350DNAYarrowia lipolytica 59atggactaca tcatttcggc gccaggcaaa gtgattctat ttggtgaaca tgccgctgtg 60tttggtaagc ctgcgattgc agcagccatc gacttgcgaa catacctgct tgtcgaaacc 120acaacatccg acaccccgac agtcacgttg gagtttccag acatccactt gaacttcaag 180gtccaggtgg acaagctggc atctctcaca gcccagacca aggccgacca tctcaattgg 240tcgactccca aaactctgga taagcacatt ttcgacagct tgtctagctt ggcgcttctg 300gaagaacctg ggctcactaa ggtccagcag gccgctgttg tgtcgttctt gtacctctac 360atccacctat gtcccccttc tgtgtgcgaa gattcatcaa actgggtagt tcgatcaacg 420ctgcctatcg gcgcgggcct gggctcttcc gcatccattt gtgtctgttt ggctgcaggt 480cttctggttc tcaacggcca gctgagcatt gaccaggcaa gagatttcaa gtccctgacc 540gagaagcagc tgtctctggt ggacgactgg tccttcgtcg gtgaaatgtg cattcacggc 600aacccgtcgg gcatcgacaa tgctgtggct actcagggag gtgctctgtt gttccagcga 660cctaacaacc gagtccctct tgttgacatt cccgagatga agctgctgct taccaatacg 720aagcatcctc gatctaccgc agacctggtt ggtggagtcg gagttctcac taaagagttt 780ggctccatca tggatcccat catgacttca gtaggcgaga tttccaacca ggccatggag 840atcatttcta gaggcaagaa gatggtggac cagtctaacc ttgagattga gcagggtatc 900ttgcctcaac ccacctctga ggatgcctgc aacgtgatgg aagatggagc tactcttcaa 960aagttgagag atatcggttc ggaaatgcag catctagtga gaatcaatca cggcctgctt 1020atcgctatgg gtgtttccca cccgaagctc gaaatcattc gaactgcctc cattgtccac 1080aacctgggtg agaccaagct cactggtgct ggaggaggag gttgcgccat cactctagtc 1140acttctaaag acaagactgc gacccagctg gaggaaaatg tcattgcttt cacagaggag 1200atggctaccc atggcttcga ggtgcacgag actactattg gtgccagagg agttggtatg 1260tgcattgacc atccctctct caagactgtt gaagccttca agaaggtgga gcgggcggat 1320ctcaaaaaca tcggtccctg gacccattag 135060449PRTYarrowia lipolytica 60Met Asp Tyr Ile Ile Ser Ala Pro Gly Lys Val Ile Leu Phe Gly Glu 1 5 10 15 His Ala Ala Val Phe Gly Lys Pro Ala Ile Ala Ala Ala Ile Asp Leu 20 25 30 Arg Thr Tyr Leu Leu Val Glu Thr Thr Thr Ser Asp Thr Pro Thr Val 35 40 45 Thr Leu Glu Phe Pro Asp Ile His Leu Asn Phe Lys Val Gln Val Asp 50 55 60 Lys Leu Ala Ser Leu Thr Ala Gln Thr Lys Ala Asp His Leu Asn Trp 65 70 75 80 Ser Thr Pro Lys Thr Leu Asp Lys His Ile Phe Asp Ser Leu Ser Ser 85 90 95 Leu Ala Leu Leu Glu Glu Pro Gly Leu Thr Lys Val Gln Gln Ala Ala 100 105 110 Val Val Ser Phe Leu Tyr Leu Tyr Ile His Leu Cys Pro Pro Ser Val 115 120 125 Cys Glu Asp Ser Ser Asn Trp Val Val Arg Ser Thr Leu Pro Ile Gly 130 135 140 Ala Gly Leu Gly Ser Ser Ala Ser Ile Cys Val Cys Leu Ala Ala Gly 145 150 155 160 Leu Leu Val Leu Asn Gly Gln Leu Ser Ile Asp Gln Ala Arg Asp Phe 165 170 175 Lys Ser Leu Thr Glu Lys Gln Leu Ser Leu Val Asp Asp Trp Ser Phe 180 185 190 Val Gly Glu Met Cys Ile His Gly Asn Pro Ser Gly Ile Asp Asn Ala 195 200 205 Val Ala Thr Gln Gly Gly Ala Leu Leu Phe Gln Arg Pro Asn Asn Arg 210 215 220 Val Pro Leu Val Asp Ile Pro Glu Met Lys Leu Leu Leu Thr Asn Thr 225 230 235 240 Lys His Pro Arg Ser Thr Ala Asp Leu Val Gly Gly Val Gly Val Leu 245 250 255 Thr Lys Glu Phe Gly Ser Ile Met Asp Pro Ile Met Thr Ser Val Gly 260 265 270 Glu Ile Ser Asn Gln Ala Met Glu Ile Ile Ser Arg Gly Lys Lys Met 275 280 285 Val Asp Gln Ser Asn Leu Glu Ile Glu Gln Gly Ile Leu Pro Gln Pro 290 295 300 Thr Ser Glu Asp Ala Cys Asn Val Met Glu Asp Gly Ala Thr Leu Gln 305 310 315 320 Lys Leu Arg Asp Ile Gly Ser Glu Met Gln His Leu Val Arg Ile Asn 325 330 335 His Gly Leu Leu Ile Ala Met Gly Val Ser His Pro Lys Leu Glu Ile 340 345 350 Ile Arg Thr Ala Ser Ile Val His Asn Leu Gly Glu Thr Lys Leu Thr 355 360 365 Gly Ala Gly Gly Gly Gly Cys Ala Ile Thr Leu Val Thr Ser Lys Asp 370 375 380 Lys Thr Ala Thr Gln Leu Glu Glu Asn Val Ile Ala Phe Thr Glu Glu 385 390 395 400 Met Ala Thr His Gly Phe Glu Val His Glu Thr Thr Ile Gly Ala Arg 405 410 415 Gly Val Gly Met Cys Ile Asp His Pro Ser Leu Lys Thr Val Glu Ala 420 425 430 Phe Lys Lys Val Glu Arg Ala Asp Leu Lys Asn Ile Gly Pro Trp Thr 435 440 445 His 611257DNAYarrowia lipolytica 61atgaccacct attcggctcc gggaaaggcc ctcctttgcg gcggttattt ggttattgat 60ccggcgtatt cagcatacgt cgtgggcctc tcggcgcgta tttacgcgac agtttcggct 120tccgaggcct ccaccacctc tgtccatgtc gtctctccgc agtttgacaa gggtgaatgg 180acctacaact acacgaacgg ccagctgacg gccatcggac acaacccatt tgctcacgcg 240gccgtcaaca ccgttctgca ttacgttcct cctcgaaacc tccacatcaa catcagcatc 300aaaagtgaca acgcgtacca ctcgcaaatt gacagcacgc agagaggcca gtttgcatac 360cacaaaaagg cgatccacga ggtgcctaaa acgggcctcg gtagctccgc tgctcttacc 420accgttcttg tggcagcttt gctcaagtca tacggcattg atcccttgca taacacccac 480ctcgttcaca acctgtccca ggttgcacac tgctcggcac agaagaagat tgggtctgga 540tttgacgtgg cttcggccgt ttgtggctct ctagtctata gacgtttccc ggcggagtcc 600gtgaacatgg tcattgcagc tgaagggacc tccgaatacg gggctctgtt gagaactacc 660gttaatcaaa agtggaaggt gactctggaa ccatccttct tgccgccggg aatcagcctg 720cttatgggag acgtccaggg aggatctgag actccaggta tggtggccaa ggtgatggca 780tggcgaaaag caaagccccg agaagccgag atggtgtgga gagatctcaa cgctgccaac 840atgctcatgg tcaagttgtt caacgacctg cgcaagctct ctctcactaa caacgaggcc 900tacgaacaac ttttggccga ggctgctcct ctcaacgctc taaagatgat aatgttgcag 960aaccctctcg gagaactagc acgatgcatt atcactattc gaaagcatct caagaagatg 1020acacgggaga ctggtgctgc tattgagccg gatgagcagt ctgcattgct caacaagtgc 1080aacacttata gtggagtcat tggaggtgtt gtgcctggag caggaggcta cgatgctatt 1140tctcttctgg tgatcagctc tacggtgaac aatgtcaagc gagagagcca gggagtccaa 1200tggatggagc tcaaggagga gaacgagggt ctgcggctcg agaaggggtt caagtag 125762418PRTYarrowia lipolytica 62Met Thr Thr Tyr Ser Ala Pro Gly Lys Ala Leu Leu Cys Gly Gly Tyr 1 5 10 15 Leu Val Ile Asp Pro Ala Tyr Ser Ala Tyr Val Val Gly Leu Ser Ala 20 25 30 Arg Ile Tyr Ala Thr Val Ser Ala Ser Glu Ala Ser Thr Thr Ser Val 35 40 45 His Val Val Ser Pro Gln Phe Asp Lys Gly Glu Trp Thr Tyr Asn Tyr 50 55 60 Thr Asn Gly Gln Leu Thr Ala Ile Gly His Asn Pro Phe Ala His Ala 65 70 75 80 Ala Val Asn Thr Val Leu His Tyr Val Pro Pro Arg Asn Leu His Ile 85 90 95 Asn Ile Ser Ile Lys Ser Asp Asn Ala Tyr His Ser Gln Ile Asp Ser 100 105 110 Thr Gln Arg Gly Gln Phe Ala Tyr His Lys Lys Ala Ile His Glu Val 115 120 125 Pro Lys Thr Gly Leu Gly Ser Ser Ala Ala Leu Thr Thr Val Leu Val 130 135 140 Ala Ala Leu Leu Lys Ser Tyr Gly Ile Asp Pro Leu His Asn Thr His 145 150 155 160 Leu Val His Asn Leu Ser Gln Val Ala His Cys Ser Ala Gln Lys Lys 165 170 175 Ile Gly Ser Gly Phe Asp Val Ala Ser Ala Val Cys Gly Ser Leu Val 180 185 190 Tyr Arg Arg Phe Pro Ala Glu Ser Val Asn Met Val Ile Ala Ala Glu 195 200 205 Gly Thr Ser Glu Tyr Gly Ala Leu Leu Arg Thr Thr Val Asn Gln Lys 210 215 220 Trp Lys Val Thr Leu Glu Pro Ser Phe Leu Pro Pro Gly Ile Ser Leu 225 230 235 240 Leu Met Gly Asp Val Gln Gly Gly Ser Glu Thr Pro Gly Met Val Ala 245 250 255 Lys Val Met Ala Trp Arg Lys Ala Lys Pro Arg Glu Ala Glu Met Val 260 265 270 Trp Arg Asp Leu Asn Ala Ala Asn Met Leu Met Val Lys Leu Phe Asn 275 280 285 Asp Leu Arg Lys Leu Ser Leu Thr Asn Asn Glu Ala Tyr Glu Gln Leu 290 295 300 Leu Ala Glu Ala Ala Pro Leu Asn Ala Leu Lys Met Ile Met Leu Gln 305 310 315 320 Asn Pro Leu Gly Glu Leu Ala Arg Cys Ile Ile Thr Ile Arg Lys His 325 330 335 Leu Lys Lys Met Thr Arg Glu Thr Gly Ala Ala Ile Glu Pro Asp Glu 340 345 350 Gln Ser Ala Leu Leu Asn Lys Cys Asn Thr Tyr Ser Gly Val Ile Gly 355 360 365 Gly Val Val Pro Gly Ala Gly Gly Tyr Asp Ala Ile Ser Leu Leu Val 370 375 380 Ile Ser Ser Thr Val Asn Asn Val Lys Arg Glu Ser Gln Gly Val Gln 385 390 395 400 Trp Met Glu Leu Lys Glu Glu Asn Glu Gly Leu Arg Leu Glu Lys Gly 405 410 415 Phe Lys 631164DNAYarrowia lipolytica 63atgatccacc aggcctccac caccgctccg gtgaacattg cgacactcaa gtactggggc 60aagcgagacc ctgctctcaa tctgcccact aacaactcca tctccgtgac tttgtcgcag 120gatgatctgc ggaccctcac cacagcctcg tgttcccctg atttcaccca ggacgagctg 180tggctcaatg gcaagcagga ggacgtgagc ggcaaacgtc tggttgcgtg tttccgagag 240ctgcgggctc tgcgacacaa aatggaggac tccgactctt ctctgcctaa gctggccgat 300cagaagctca agatcgtgtc cgagaacaac ttccccaccg ccgctggtct cgcctcatcg 360gctgctggct ttgccgccct gatccgagcc gttgcaaatc tctacgagct ccaggagacc 420cccgagcagc tgtccattgt ggctcgacag ggctctggat ccgcctgtcg atctctctac 480ggaggctacg tggcatggga aatgggcacc gagtctgacg gaagcgactc gcgagcggtc 540cagatcgcca ccgccgacca ctggcccgag atgcgagccg ccatcctcgt tgtctctgcc 600gacaagaagg acacgtcgtc cactaccggt atgcaggtga ctgtgcacac ttctcccctc 660ttcaaggagc gagtcaccac tgtggttccc gagcggtttg cccagatgaa gaagtcgatt 720ctggaccgag acttccccac ctttgccgag ctcaccatgc gagactcaaa ccagttccac 780gccacctgtc tggactcgta tcctcccatt ttctacctca acgacgtgtc gcgagcctcc 840attcgggtag ttgaggccat caacaaggct gccggagcca ccattgccgc ctacaccttt 900gatgctggac ccaactgtgt catctactac gaggacaaga acgaggagct ggttctgggt 960gctctcaagg ccattctggg ccgtgtggag ggatgggaga agcaccagtc tgtggacgcc 1020aagaagattg atgttgacga gcggtgggag tccgagctgg ccaacggaat tcagcgggtg 1080atccttacca aggttggagg agatcccgtg aagaccgctg agtcgcttat caacgaggat 1140ggttctctga agaacagcaa gtag 116464387PRTYarrowia lipolytica 64Met Ile His Gln Ala Ser Thr Thr Ala Pro Val Asn Ile Ala Thr Leu 1 5 10 15 Lys Tyr Trp Gly Lys Arg Asp Pro Ala Leu Asn Leu Pro Thr Asn Asn 20 25 30 Ser Ile Ser Val Thr Leu Ser Gln Asp Asp Leu Arg Thr Leu Thr Thr 35 40 45 Ala Ser Cys Ser Pro Asp Phe Thr Gln Asp Glu Leu Trp Leu Asn Gly 50 55 60 Lys Gln Glu Asp Val Ser Gly Lys Arg Leu Val Ala Cys Phe Arg Glu 65 70 75 80 Leu Arg Ala Leu Arg His Lys Met Glu Asp Ser Asp Ser Ser Leu Pro 85 90 95 Lys Leu Ala Asp Gln Lys Leu Lys Ile Val Ser Glu Asn Asn Phe Pro 100 105 110 Thr Ala Ala Gly Leu Ala Ser Ser Ala Ala Gly Phe Ala Ala Leu Ile 115 120 125 Arg Ala Val Ala Asn Leu Tyr Glu Leu Gln Glu Thr Pro Glu Gln Leu 130 135 140 Ser Ile Val Ala Arg Gln Gly Ser Gly Ser Ala Cys Arg Ser Leu Tyr 145 150 155 160 Gly Gly Tyr Val Ala Trp Glu Met Gly Thr Glu Ser Asp Gly Ser Asp 165 170 175 Ser Arg Ala Val Gln Ile Ala Thr Ala Asp His Trp Pro Glu Met Arg 180 185 190 Ala Ala Ile Leu Val Val Ser Ala Asp Lys Lys Asp Thr Ser Ser Thr 195 200 205 Thr Gly Met Gln Val Thr Val His Thr Ser Pro Leu Phe Lys Glu Arg 210 215 220 Val Thr Thr Val Val Pro Glu Arg Phe Ala Gln Met Lys Lys Ser Ile 225 230 235 240 Leu Asp Arg Asp Phe Pro Thr Phe Ala Glu Leu Thr Met Arg Asp Ser 245 250 255 Asn Gln Phe His Ala Thr Cys Leu Asp Ser Tyr Pro Pro Ile Phe Tyr 260 265 270 Leu Asn Asp Val Ser Arg Ala Ser Ile Arg Val Val Glu Ala Ile Asn 275 280 285 Lys Ala Ala Gly Ala Thr Ile Ala Ala Tyr Thr Phe Asp Ala Gly Pro 290 295 300 Asn Cys Val Ile Tyr Tyr Glu Asp Lys Asn Glu Glu Leu Val Leu Gly 305 310 315 320 Ala Leu Lys Ala

Ile Leu Gly Arg Val Glu Gly Trp Glu Lys His Gln 325 330 335 Ser Val Asp Ala Lys Lys Ile Asp Val Asp Glu Arg Trp Glu Ser Glu 340 345 350 Leu Ala Asn Gly Ile Gln Arg Val Ile Leu Thr Lys Val Gly Gly Asp 355 360 365 Pro Val Lys Thr Ala Glu Ser Leu Ile Asn Glu Asp Gly Ser Leu Lys 370 375 380 Asn Ser Lys 385 65813DNAYarrowia lipolytica 65atgacgacgt cttacagcga caaaatcaag agtatcagcg tgagctctgt ggctcagcag 60tttcctgagg tggcgccgat tgcggacgtg tccaaggcta gccggcccag cacggagtcg 120tcggactcgt cggccaagct atttgatggc cacgacgagg agcagatcaa gctgatggac 180gagatctgtg tggtgctgga ctgggacgac aagccgattg gcggcgcgtc caaaaagtgc 240tgtcatctga tggacaacat caacgacgga ctggtgcatc gggccttttc cgtgttcatg 300ttcaacgacc gcggtgagct gcttctgcag cagcgggcgg cggaaaaaat cacctttgcc 360aacatgtgga ccaacacgtg ctgctcgcat cctctggcgg tgcccagcga gatgggcggg 420ctggatctgg agtcccggat ccagggcgcc aaaaacgccg cggtccggaa gcttgagcac 480gagctgggaa tcgaccccaa ggccgttccg gcagacaagt tccatttcct cacccggatc 540cactacgccg cgccctcctc gggcccctgg ggcgagcacg agattgacta cattctgttt 600gtccggggcg accccgagct caaggtggtg gccaacgagg tccgcgatac cgtgtgggtg 660tcgcagcagg gactcaagga catgatggcc gatcccaagc tggttttcac cccttggttc 720cggctcattt gtgagcaggc gctgtttccc tggtgggacc agttggacaa tctgcccgcg 780ggcgatgacg agattcggcg gtggatcaag tag 81366270PRTYarrowia lipolytica 66Met Thr Thr Ser Tyr Ser Asp Lys Ile Lys Ser Ile Ser Val Ser Ser 1 5 10 15 Val Ala Gln Gln Phe Pro Glu Val Ala Pro Ile Ala Asp Val Ser Lys 20 25 30 Ala Ser Arg Pro Ser Thr Glu Ser Ser Asp Ser Ser Ala Lys Leu Phe 35 40 45 Asp Gly His Asp Glu Glu Gln Ile Lys Leu Met Asp Glu Ile Cys Val 50 55 60 Val Leu Asp Trp Asp Asp Lys Pro Ile Gly Gly Ala Ser Lys Lys Cys 65 70 75 80 Cys His Leu Met Asp Asn Ile Asn Asp Gly Leu Val His Arg Ala Phe 85 90 95 Ser Val Phe Met Phe Asn Asp Arg Gly Glu Leu Leu Leu Gln Gln Arg 100 105 110 Ala Ala Glu Lys Ile Thr Phe Ala Asn Met Trp Thr Asn Thr Cys Cys 115 120 125 Ser His Pro Leu Ala Val Pro Ser Glu Met Gly Gly Leu Asp Leu Glu 130 135 140 Ser Arg Ile Gln Gly Ala Lys Asn Ala Ala Val Arg Lys Leu Glu His 145 150 155 160 Glu Leu Gly Ile Asp Pro Lys Ala Val Pro Ala Asp Lys Phe His Phe 165 170 175 Leu Thr Arg Ile His Tyr Ala Ala Pro Ser Ser Gly Pro Trp Gly Glu 180 185 190 His Glu Ile Asp Tyr Ile Leu Phe Val Arg Gly Asp Pro Glu Leu Lys 195 200 205 Val Val Ala Asn Glu Val Arg Asp Thr Val Trp Val Ser Gln Gln Gly 210 215 220 Leu Lys Asp Met Met Ala Asp Pro Lys Leu Val Phe Thr Pro Trp Phe 225 230 235 240 Arg Leu Ile Cys Glu Gln Ala Leu Phe Pro Trp Trp Asp Gln Leu Asp 245 250 255 Asn Leu Pro Ala Gly Asp Asp Glu Ile Arg Arg Trp Ile Lys 260 265 270 671035DNAYarrowia lipolytica 67atgtccaagg cgaaattcga aagcgtgttc ccccgaatct ccgaggagct ggtgcagctg 60ctgcgagacg agggtctgcc ccaggatgcc gtgcagtggt tttccgactc acttcagtac 120aactgtgtgg gtggaaagct caaccgaggc ctgtctgtgg tcgacaccta ccagctactg 180accggcaaga aggagctcga tgacgaggag tactaccgac tcgcgctgct cggctggctg 240attgagctgc tgcaggcgtt tttcctcgtg tcggacgaca ttatggatga gtccaagacc 300cgacgaggcc agccctgctg gtacctcaag cccaaggtcg gcatgattgc catcaacgat 360gctttcatgc tagagagtgg catctacatt ctgcttaaga agcatttccg acaggagaag 420tactacattg accttgtcga gctgttccac gacatttcgt tcaagaccga gctgggccag 480ctggtggatc ttctgactgc ccccgaggat gaggttgatc tcaaccggtt ctctctggac 540aagcactcct ttattgtgcg atacaagact gcttactact ccttctacct gcccgttgtt 600ctagccatgt acgtggccgg cattaccaac cccaaggacc tgcagcaggc catggatgtg 660ctgatccctc tcggagagta cttccaggtc caggacgact accttgacaa ctttggagac 720cccgagttca ttggtaagat cggcaccgac atccaggaca acaagtgctc ctggctcgtt 780aacaaagccc ttcagaaggc cacccccgag cagcgacaga tcctcgagga caactacggc 840gtcaaggaca agtccaagga gctcgtcatc aagaaactgt atgatgacat gaagattgag 900caggactacc ttgactacga ggaggaggtt gttggcgaca tcaagaagaa gatcgagcag 960gttgacgaga gccgaggctt caagaaggag gtgctcaacg ctttcctcgc caagatttac 1020aagcgacaga agtag 103568344PRTYarrowia lipolytica 68Met Ser Lys Ala Lys Phe Glu Ser Val Phe Pro Arg Ile Ser Glu Glu 1 5 10 15 Leu Val Gln Leu Leu Arg Asp Glu Gly Leu Pro Gln Asp Ala Val Gln 20 25 30 Trp Phe Ser Asp Ser Leu Gln Tyr Asn Cys Val Gly Gly Lys Leu Asn 35 40 45 Arg Gly Leu Ser Val Val Asp Thr Tyr Gln Leu Leu Thr Gly Lys Lys 50 55 60 Glu Leu Asp Asp Glu Glu Tyr Tyr Arg Leu Ala Leu Leu Gly Trp Leu 65 70 75 80 Ile Glu Leu Leu Gln Ala Phe Phe Leu Val Ser Asp Asp Ile Met Asp 85 90 95 Glu Ser Lys Thr Arg Arg Gly Gln Pro Cys Trp Tyr Leu Lys Pro Lys 100 105 110 Val Gly Met Ile Ala Ile Asn Asp Ala Phe Met Leu Glu Ser Gly Ile 115 120 125 Tyr Ile Leu Leu Lys Lys His Phe Arg Gln Glu Lys Tyr Tyr Ile Asp 130 135 140 Leu Val Glu Leu Phe His Asp Ile Ser Phe Lys Thr Glu Leu Gly Gln 145 150 155 160 Leu Val Asp Leu Leu Thr Ala Pro Glu Asp Glu Val Asp Leu Asn Arg 165 170 175 Phe Ser Leu Asp Lys His Ser Phe Ile Val Arg Tyr Lys Thr Ala Tyr 180 185 190 Tyr Ser Phe Tyr Leu Pro Val Val Leu Ala Met Tyr Val Ala Gly Ile 195 200 205 Thr Asn Pro Lys Asp Leu Gln Gln Ala Met Asp Val Leu Ile Pro Leu 210 215 220 Gly Glu Tyr Phe Gln Val Gln Asp Asp Tyr Leu Asp Asn Phe Gly Asp 225 230 235 240 Pro Glu Phe Ile Gly Lys Ile Gly Thr Asp Ile Gln Asp Asn Lys Cys 245 250 255 Ser Trp Leu Val Asn Lys Ala Leu Gln Lys Ala Thr Pro Glu Gln Arg 260 265 270 Gln Ile Leu Glu Asp Asn Tyr Gly Val Lys Asp Lys Ser Lys Glu Leu 275 280 285 Val Ile Lys Lys Leu Tyr Asp Asp Met Lys Ile Glu Gln Asp Tyr Leu 290 295 300 Asp Tyr Glu Glu Glu Val Val Gly Asp Ile Lys Lys Lys Ile Glu Gln 305 310 315 320 Val Asp Glu Ser Arg Gly Phe Lys Lys Glu Val Leu Asn Ala Phe Leu 325 330 335 Ala Lys Ile Tyr Lys Arg Gln Lys 340 691890DNAYarrowia lipolytica 69atgttacgac tacgaaccat gcgacccaca cagaccagcg tcagggcggc gcttgggccc 60accgccgcgg cccgaaacat gtcctcctcc agcccctcca gcttcgaata ctcgtcctac 120gtcaagggca cgcgggaaat cggccaccga aaggcgccca caacccgtct gtcggttgag 180ggccccatct acgtgggctt cgacggcatt cgtcttctca acctgccgca tctcaacaag 240ggctcgggat tccccctcaa cgagcgacgg gaattcagac tcagtggtct tctgccctct 300gccgaagcca ccctggagga acaggtcgac cgagcatacc aacaattcaa aaagtgtggc 360actcccttag ccaaaaacgg gttctgcacc tcgctcaagt tccaaaacga ggtgctctac 420tacgccctgc tgctcaagca cgttaaggag gtcttcccca tcatctatac accgactcag 480ggagaagcca ttgaacagta ctcgcggctg ttccggcggc ccgaaggctg cttcctcgac 540atcaccagtc cctacgacgt ggaggagcgt ctgggagcgt ttggagacca tgacgacatt 600gactacattg tcgtgactga ctccgagggt attctcggaa ttggagacca aggagtgggc 660ggtattggta tttccatcgc caagctggct ctcatgactc tatgtgctgg agtcaacccc 720tcacgagtca ttcctgtggt tctggatacg ggaaccaaca accaggagct gctgcacgac 780cccctgtatc tcggccgacg aatgccccga gtgcgaggaa agcagtacga cgacttcatc 840gacaactttg tgcagtctgc ccgaaggctg tatcccaagg cggtgatcca tttcgaggac 900tttgggctcg ctaacgcaca caagatcctc gacaagtatc gaccggagat cccctgcttc 960aacgacgaca tccagggcac tggagccgtc actttggcct ccatcacggc cgctctcaag 1020gtgctgggca aaaatatcac agatactcga attctcgtgt acggagctgg ttcggccggc 1080atgggtattg ctgaacaggt ctatgataac ctggttgccc agggtctcga cgacaagact 1140gcgcgacaaa acatctttct catggaccga ccgggtctac tgaccaccgc acttaccgac 1200gagcagatga gcgacgtgca gaagccgttt gccaaggaca aggccaatta cgagggagtg 1260gacaccaaga ctctggagca cgtggttgct gccgtcaagc cccatattct cattggatgt 1320tccactcagc ccggcgcctt taacgagaag gtcgtcaagg agatgctcaa acacacccct 1380cgacccatca ttctccctct ttccaacccc acacgtcttc atgaggctgt ccctgcagat 1440ctgtacaagt ggaccgacgg caaggctctg gttgccaccg gctcgccctt tgacccagtc 1500aacggcaagg agacgtctga gaacaataac tgctttgttt tccccggaat cgggctggga 1560gccattctgt ctcgatcaaa gctcatcacc aacaccatga ttgctgctgc catcgagtgc 1620ctcgccgaac aggcccccat tctcaagaac cacgacgagg gagtacttcc cgacgtagct 1680ctcatccaga tcatttcggc ccgggtggcc actgccgtgg ttcttcaggc caaggctgag 1740ggcctagcca ctgtcgagga agagctcaag cccggcacca aggaacatgt gcagattccc 1800gacaactttg acgagtgtct cgcctgggtc gagactcaga tgtggcggcc cgtctaccgg 1860cctctcatcc atgtgcggga ttacgactag 189070629PRTYarrowia lipolytica 70Met Leu Arg Leu Arg Thr Met Arg Pro Thr Gln Thr Ser Val Arg Ala 1 5 10 15 Ala Leu Gly Pro Thr Ala Ala Ala Arg Asn Met Ser Ser Ser Ser Pro 20 25 30 Ser Ser Phe Glu Tyr Ser Ser Tyr Val Lys Gly Thr Arg Glu Ile Gly 35 40 45 His Arg Lys Ala Pro Thr Thr Arg Leu Ser Val Glu Gly Pro Ile Tyr 50 55 60 Val Gly Phe Asp Gly Ile Arg Leu Leu Asn Leu Pro His Leu Asn Lys 65 70 75 80 Gly Ser Gly Phe Pro Leu Asn Glu Arg Arg Glu Phe Arg Leu Ser Gly 85 90 95 Leu Leu Pro Ser Ala Glu Ala Thr Leu Glu Glu Gln Val Asp Arg Ala 100 105 110 Tyr Gln Gln Phe Lys Lys Cys Gly Thr Pro Leu Ala Lys Asn Gly Phe 115 120 125 Cys Thr Ser Leu Lys Phe Gln Asn Glu Val Leu Tyr Tyr Ala Leu Leu 130 135 140 Leu Lys His Val Lys Glu Val Phe Pro Ile Ile Tyr Thr Pro Thr Gln 145 150 155 160 Gly Glu Ala Ile Glu Gln Tyr Ser Arg Leu Phe Arg Arg Pro Glu Gly 165 170 175 Cys Phe Leu Asp Ile Thr Ser Pro Tyr Asp Val Glu Glu Arg Leu Gly 180 185 190 Ala Phe Gly Asp His Asp Asp Ile Asp Tyr Ile Val Val Thr Asp Ser 195 200 205 Glu Gly Ile Leu Gly Ile Gly Asp Gln Gly Val Gly Gly Ile Gly Ile 210 215 220 Ser Ile Ala Lys Leu Ala Leu Met Thr Leu Cys Ala Gly Val Asn Pro 225 230 235 240 Ser Arg Val Ile Pro Val Val Leu Asp Thr Gly Thr Asn Asn Gln Glu 245 250 255 Leu Leu His Asp Pro Leu Tyr Leu Gly Arg Arg Met Pro Arg Val Arg 260 265 270 Gly Lys Gln Tyr Asp Asp Phe Ile Asp Asn Phe Val Gln Ser Ala Arg 275 280 285 Arg Leu Tyr Pro Lys Ala Val Ile His Phe Glu Asp Phe Gly Leu Ala 290 295 300 Asn Ala His Lys Ile Leu Asp Lys Tyr Arg Pro Glu Ile Pro Cys Phe 305 310 315 320 Asn Asp Asp Ile Gln Gly Thr Gly Ala Val Thr Leu Ala Ser Ile Thr 325 330 335 Ala Ala Leu Lys Val Leu Gly Lys Asn Ile Thr Asp Thr Arg Ile Leu 340 345 350 Val Tyr Gly Ala Gly Ser Ala Gly Met Gly Ile Ala Glu Gln Val Tyr 355 360 365 Asp Asn Leu Val Ala Gln Gly Leu Asp Asp Lys Thr Ala Arg Gln Asn 370 375 380 Ile Phe Leu Met Asp Arg Pro Gly Leu Leu Thr Thr Ala Leu Thr Asp 385 390 395 400 Glu Gln Met Ser Asp Val Gln Lys Pro Phe Ala Lys Asp Lys Ala Asn 405 410 415 Tyr Glu Gly Val Asp Thr Lys Thr Leu Glu His Val Val Ala Ala Val 420 425 430 Lys Pro His Ile Leu Ile Gly Cys Ser Thr Gln Pro Gly Ala Phe Asn 435 440 445 Glu Lys Val Val Lys Glu Met Leu Lys His Thr Pro Arg Pro Ile Ile 450 455 460 Leu Pro Leu Ser Asn Pro Thr Arg Leu His Glu Ala Val Pro Ala Asp 465 470 475 480 Leu Tyr Lys Trp Thr Asp Gly Lys Ala Leu Val Ala Thr Gly Ser Pro 485 490 495 Phe Asp Pro Val Asn Gly Lys Glu Thr Ser Glu Asn Asn Asn Cys Phe 500 505 510 Val Phe Pro Gly Ile Gly Leu Gly Ala Ile Leu Ser Arg Ser Lys Leu 515 520 525 Ile Thr Asn Thr Met Ile Ala Ala Ala Ile Glu Cys Leu Ala Glu Gln 530 535 540 Ala Pro Ile Leu Lys Asn His Asp Glu Gly Val Leu Pro Asp Val Ala 545 550 555 560 Leu Ile Gln Ile Ile Ser Ala Arg Val Ala Thr Ala Val Val Leu Gln 565 570 575 Ala Lys Ala Glu Gly Leu Ala Thr Val Glu Glu Glu Leu Lys Pro Gly 580 585 590 Thr Lys Glu His Val Gln Ile Pro Asp Asn Phe Asp Glu Cys Leu Ala 595 600 605 Trp Val Glu Thr Gln Met Trp Arg Pro Val Tyr Arg Pro Leu Ile His 610 615 620 Val Arg Asp Tyr Asp 625 712610DNAYarrowia lipolytica 71atgccgcagc aagcaatgga tatcaagggc aaggccaagt ctgtgcccat gcccgaagaa 60gacgacctgg actcgcattt tgtgggtccc atctctcccc gacctcacgg agcagacgag 120attgctggct acgtgggctg cgaagacgac gaagacgagc ttgaagaact gggaatgctg 180ggccgatctg cgtccaccca cttctcttac gcggaagaac gccacctcat cgaggttgat 240gccaagtaca gagctcttca tggccatctg cctcatcagc actctcagag tcccgtgtcc 300agatcttcgt catttgtgcg ggccgaaatg aaccaccccc ctcccccacc ctccagccac 360acccaccaac agccagagga cgatgacgca tcttccactc gatctcgatc gtcgtctcga 420gcttctggac gcaagttcaa cagaaacaga accaagtctg gatcttcgct gagcaagggt 480ctccagcagc tcaacatgac cggatcgctc gaagaagagc cctacgagag cgatgacgat 540gcccgactat ctgcggaaga cgacattgtc tatgatgcta cccagaaaga cacctgcaag 600cccatatctc ctactctcaa acgcacccgc accaaggacg acatgaagaa catgtccatc 660aacgacgtca aaatcaccac caccacagaa gatcctcttg tggcccagga gctgtccatg 720atgttcgaaa aggtgcagta ctgccgagac ctccgagaca agtaccaaac cgtgtcgcta 780cagaaggacg gagacaaccc caaggatgac aagacacact ggaaaattta ccccgagcct 840ccaccaccct cctggcacga gaccgaaaag cgattccgag gctcgtccaa aaaggagcac 900caaaagaaag acccgacaat ggatgaattc aaattcgagg actgcgaaat ccccggaccc 960aacgacatgg tcttcaagcg agatcctacc tgtgtctatc aggtctatga ggatgaaagc 1020tctctcaacg aaaataagcc gtttgttgcc atcccctcaa tccgagatta ctacatggat 1080ctggaggatc tcattgtggc ttcgtctgac ggacctgcca agtcttttgc tttccgacga 1140ctgcaatatc tagaagccaa gtggaacctc tactacctgc tcaacgagta cacggagaca 1200accgagtcca agaccaaccc ccatcgagac ttttacaacg tacgaaaggt cgacacccac 1260gttcaccact ctgcctgcat gaaccagaag catctgctgc gattcatcaa atacaagatg 1320aagaactgcc ctgatgaagt tgtcatccac cgagacggtc gggagctgac actctcccag 1380gtgtttgagt cacttaactt gactgcctac gacctgtcta tcgataccct tgatatgcat 1440gctcacaagg actcgttcca tcgatttgac aagttcaacc tcaagtacaa ccctgtcggt 1500gagtctcgac tgcgagaaat cttcctaaag accgacaact acatccaggg tcgataccta 1560gctgagatca caaaggaggt gttccaggat ctcgagaact cgaagtacca gatggcggag 1620taccgtattt ccatctacgg tcggtccaag gacgagtggg acaagctggc tgcctgggtg 1680ctggacaaca aactgttttc gcccaatgtt cggtggttga tccaggtgcc tcgactgtac 1740gacatttaca agaaggctgg tctggttaac acctttgccg acattgtgca gaacgtcttt 1800gagcctcttt tcgaggtcac caaggatccc agtacccatc ccaagctgca cgtgttcctg 1860cagcgagttg tgggctttga ctctgtcgat gacgagtcga agctggaccg acgtttccac 1920cgaaagttcc caactgcagc atactgggac agcgcacaga accctcccta ctcgtactgg 1980cagtactatc tatacgccaa catggcctcc atcaacacct ggagacagcg tttgggctat 2040aatacttttg agttgcgacc ccatgctgga gaggctggtg acccagagca tcttctgtgc 2100acttatctgg ttgctcaggg tatcaaccac ggtattctgt tgcgaaaggt gcccttcatt 2160cagtaccttt actacctgga ccagatcccc attgccatgt ctcctgtgtc caacaatgcg 2220ctgttcctca cgttcgacaa gaaccccttc tactcatact tcaagcgggg tctcaacgtg 2280tccttgtcat cggatgatcc tctgcagttt gcttacacta aggaggctct gattgaggag 2340tactctgtgg ctgcgctcat ttacaagctt tccaacgtgg atatgtgtga gcttgctcga 2400aactcggtac tgcaatctgg ctttgagcga atcatcaagg agcattggat cggcgaaaac 2460tacgagatcc

atggccccga gggcaacacc atccagaaga caaacgtgcc caatgtgcgt 2520ctggccttcc gagacgagac tttgacccac gagcttgctc tggtggacaa gtacaccaat 2580cttgaggagt ttgagcggct gcatggttaa 261072869PRTYarrowia lipolytica 72Met Pro Gln Gln Ala Met Asp Ile Lys Gly Lys Ala Lys Ser Val Pro 1 5 10 15 Met Pro Glu Glu Asp Asp Leu Asp Ser His Phe Val Gly Pro Ile Ser 20 25 30 Pro Arg Pro His Gly Ala Asp Glu Ile Ala Gly Tyr Val Gly Cys Glu 35 40 45 Asp Asp Glu Asp Glu Leu Glu Glu Leu Gly Met Leu Gly Arg Ser Ala 50 55 60 Ser Thr His Phe Ser Tyr Ala Glu Glu Arg His Leu Ile Glu Val Asp 65 70 75 80 Ala Lys Tyr Arg Ala Leu His Gly His Leu Pro His Gln His Ser Gln 85 90 95 Ser Pro Val Ser Arg Ser Ser Ser Phe Val Arg Ala Glu Met Asn His 100 105 110 Pro Pro Pro Pro Pro Ser Ser His Thr His Gln Gln Pro Glu Asp Asp 115 120 125 Asp Ala Ser Ser Thr Arg Ser Arg Ser Ser Ser Arg Ala Ser Gly Arg 130 135 140 Lys Phe Asn Arg Asn Arg Thr Lys Ser Gly Ser Ser Leu Ser Lys Gly 145 150 155 160 Leu Gln Gln Leu Asn Met Thr Gly Ser Leu Glu Glu Glu Pro Tyr Glu 165 170 175 Ser Asp Asp Asp Ala Arg Leu Ser Ala Glu Asp Asp Ile Val Tyr Asp 180 185 190 Ala Thr Gln Lys Asp Thr Cys Lys Pro Ile Ser Pro Thr Leu Lys Arg 195 200 205 Thr Arg Thr Lys Asp Asp Met Lys Asn Met Ser Ile Asn Asp Val Lys 210 215 220 Ile Thr Thr Thr Thr Glu Asp Pro Leu Val Ala Gln Glu Leu Ser Met 225 230 235 240 Met Phe Glu Lys Val Gln Tyr Cys Arg Asp Leu Arg Asp Lys Tyr Gln 245 250 255 Thr Val Ser Leu Gln Lys Asp Gly Asp Asn Pro Lys Asp Asp Lys Thr 260 265 270 His Trp Lys Ile Tyr Pro Glu Pro Pro Pro Pro Ser Trp His Glu Thr 275 280 285 Glu Lys Arg Phe Arg Gly Ser Ser Lys Lys Glu His Gln Lys Lys Asp 290 295 300 Pro Thr Met Asp Glu Phe Lys Phe Glu Asp Cys Glu Ile Pro Gly Pro 305 310 315 320 Asn Asp Met Val Phe Lys Arg Asp Pro Thr Cys Val Tyr Gln Val Tyr 325 330 335 Glu Asp Glu Ser Ser Leu Asn Glu Asn Lys Pro Phe Val Ala Ile Pro 340 345 350 Ser Ile Arg Asp Tyr Tyr Met Asp Leu Glu Asp Leu Ile Val Ala Ser 355 360 365 Ser Asp Gly Pro Ala Lys Ser Phe Ala Phe Arg Arg Leu Gln Tyr Leu 370 375 380 Glu Ala Lys Trp Asn Leu Tyr Tyr Leu Leu Asn Glu Tyr Thr Glu Thr 385 390 395 400 Thr Glu Ser Lys Thr Asn Pro His Arg Asp Phe Tyr Asn Val Arg Lys 405 410 415 Val Asp Thr His Val His His Ser Ala Cys Met Asn Gln Lys His Leu 420 425 430 Leu Arg Phe Ile Lys Tyr Lys Met Lys Asn Cys Pro Asp Glu Val Val 435 440 445 Ile His Arg Asp Gly Arg Glu Leu Thr Leu Ser Gln Val Phe Glu Ser 450 455 460 Leu Asn Leu Thr Ala Tyr Asp Leu Ser Ile Asp Thr Leu Asp Met His 465 470 475 480 Ala His Lys Asp Ser Phe His Arg Phe Asp Lys Phe Asn Leu Lys Tyr 485 490 495 Asn Pro Val Gly Glu Ser Arg Leu Arg Glu Ile Phe Leu Lys Thr Asp 500 505 510 Asn Tyr Ile Gln Gly Arg Tyr Leu Ala Glu Ile Thr Lys Glu Val Phe 515 520 525 Gln Asp Leu Glu Asn Ser Lys Tyr Gln Met Ala Glu Tyr Arg Ile Ser 530 535 540 Ile Tyr Gly Arg Ser Lys Asp Glu Trp Asp Lys Leu Ala Ala Trp Val 545 550 555 560 Leu Asp Asn Lys Leu Phe Ser Pro Asn Val Arg Trp Leu Ile Gln Val 565 570 575 Pro Arg Leu Tyr Asp Ile Tyr Lys Lys Ala Gly Leu Val Asn Thr Phe 580 585 590 Ala Asp Ile Val Gln Asn Val Phe Glu Pro Leu Phe Glu Val Thr Lys 595 600 605 Asp Pro Ser Thr His Pro Lys Leu His Val Phe Leu Gln Arg Val Val 610 615 620 Gly Phe Asp Ser Val Asp Asp Glu Ser Lys Leu Asp Arg Arg Phe His 625 630 635 640 Arg Lys Phe Pro Thr Ala Ala Tyr Trp Asp Ser Ala Gln Asn Pro Pro 645 650 655 Tyr Ser Tyr Trp Gln Tyr Tyr Leu Tyr Ala Asn Met Ala Ser Ile Asn 660 665 670 Thr Trp Arg Gln Arg Leu Gly Tyr Asn Thr Phe Glu Leu Arg Pro His 675 680 685 Ala Gly Glu Ala Gly Asp Pro Glu His Leu Leu Cys Thr Tyr Leu Val 690 695 700 Ala Gln Gly Ile Asn His Gly Ile Leu Leu Arg Lys Val Pro Phe Ile 705 710 715 720 Gln Tyr Leu Tyr Tyr Leu Asp Gln Ile Pro Ile Ala Met Ser Pro Val 725 730 735 Ser Asn Asn Ala Leu Phe Leu Thr Phe Asp Lys Asn Pro Phe Tyr Ser 740 745 750 Tyr Phe Lys Arg Gly Leu Asn Val Ser Leu Ser Ser Asp Asp Pro Leu 755 760 765 Gln Phe Ala Tyr Thr Lys Glu Ala Leu Ile Glu Glu Tyr Ser Val Ala 770 775 780 Ala Leu Ile Tyr Lys Leu Ser Asn Val Asp Met Cys Glu Leu Ala Arg 785 790 795 800 Asn Ser Val Leu Gln Ser Gly Phe Glu Arg Ile Ile Lys Glu His Trp 805 810 815 Ile Gly Glu Asn Tyr Glu Ile His Gly Pro Glu Gly Asn Thr Ile Gln 820 825 830 Lys Thr Asn Val Pro Asn Val Arg Leu Ala Phe Arg Asp Glu Thr Leu 835 840 845 Thr His Glu Leu Ala Leu Val Asp Lys Tyr Thr Asn Leu Glu Glu Phe 850 855 860 Glu Arg Leu His Gly 865 731017DNAYarrowia lipolytica 73atgttccgaa cccgagttac cggctccacc ctgcgatcct tctccacctc cgctgcccga 60cagcacaagg ttgtcgtcct tggcgccaac ggaggcattg gccagcccct gtctctgctg 120ctcaagctca acaagaacgt gaccgacctc ggtctgtacg atctgcgagg cgcccccggc 180gttgctgccg atgtctccca catccccacc aactccaccg tggccggcta ctctcccgac 240aacaacggca ttgccgaggc cctcaagggc gccaagctgg tgctgatccc cgccggtgtc 300ccccgaaagc ccggcatgac ccgagacgat ctgttcaaca ccaacgcctc cattgtgcga 360gacctggcca aggccgtcgg tgagcacgcc cccgacgcct ttgtcggagt cattgctaac 420cccgtcaact ccaccgtccc cattgtcgcc gaggtgctca agtccaaggg caagtacgac 480cccaagaagc tcttcggtgt caccaccctc gacgtcatcc gagccgagcg attcgtctcc 540cagctcgagc acaccaaccc caccaaggag tacttccccg ttgttggcgg ccactccggt 600gtcaccattg tccccctcgt gtcccagtcc gaccaccccg acattgccgg tgaggctcga 660gacaagcttg tccaccgaat ccagtttggc ggtgacgagg ttgtcaaggc caaggacggt 720gccggatccg ccaccctttc catggcccag gctgccgccc gattcgccga ctctctcctc 780cgaggtgtca acggcgagaa ggacgttgtt gagcccactt tcgtcgactc tcctctgttc 840aagggtgagg gcatcgactt cttctccacc aaggtcactc ttggccctaa cggtgttgag 900gagatccacc ccatcggaaa ggtcaacgag tacgaggaga agctcatcga ggctgccaag 960gccgatctca agaagaacat tgagaagggt gtcaactttg tcaagcagaa cccttaa 101774338PRTYarrowia lipolytica 74Met Phe Arg Thr Arg Val Thr Gly Ser Thr Leu Arg Ser Phe Ser Thr 1 5 10 15 Ser Ala Ala Arg Gln His Lys Val Val Val Leu Gly Ala Asn Gly Gly 20 25 30 Ile Gly Gln Pro Leu Ser Leu Leu Leu Lys Leu Asn Lys Asn Val Thr 35 40 45 Asp Leu Gly Leu Tyr Asp Leu Arg Gly Ala Pro Gly Val Ala Ala Asp 50 55 60 Val Ser His Ile Pro Thr Asn Ser Thr Val Ala Gly Tyr Ser Pro Asp 65 70 75 80 Asn Asn Gly Ile Ala Glu Ala Leu Lys Gly Ala Lys Leu Val Leu Ile 85 90 95 Pro Ala Gly Val Pro Arg Lys Pro Gly Met Thr Arg Asp Asp Leu Phe 100 105 110 Asn Thr Asn Ala Ser Ile Val Arg Asp Leu Ala Lys Ala Val Gly Glu 115 120 125 His Ala Pro Asp Ala Phe Val Gly Val Ile Ala Asn Pro Val Asn Ser 130 135 140 Thr Val Pro Ile Val Ala Glu Val Leu Lys Ser Lys Gly Lys Tyr Asp 145 150 155 160 Pro Lys Lys Leu Phe Gly Val Thr Thr Leu Asp Val Ile Arg Ala Glu 165 170 175 Arg Phe Val Ser Gln Leu Glu His Thr Asn Pro Thr Lys Glu Tyr Phe 180 185 190 Pro Val Val Gly Gly His Ser Gly Val Thr Ile Val Pro Leu Val Ser 195 200 205 Gln Ser Asp His Pro Asp Ile Ala Gly Glu Ala Arg Asp Lys Leu Val 210 215 220 His Arg Ile Gln Phe Gly Gly Asp Glu Val Val Lys Ala Lys Asp Gly 225 230 235 240 Ala Gly Ser Ala Thr Leu Ser Met Ala Gln Ala Ala Ala Arg Phe Ala 245 250 255 Asp Ser Leu Leu Arg Gly Val Asn Gly Glu Lys Asp Val Val Glu Pro 260 265 270 Thr Phe Val Asp Ser Pro Leu Phe Lys Gly Glu Gly Ile Asp Phe Phe 275 280 285 Ser Thr Lys Val Thr Leu Gly Pro Asn Gly Val Glu Glu Ile His Pro 290 295 300 Ile Gly Lys Val Asn Glu Tyr Glu Glu Lys Leu Ile Glu Ala Ala Lys 305 310 315 320 Ala Asp Leu Lys Lys Asn Ile Glu Lys Gly Val Asn Phe Val Lys Gln 325 330 335 Asn Pro 751107DNAYarrowia lipolytica 75atgacacaaa cgcacaatct gttttcgcca atcaaagtgg gctcttcgga gctccagaac 60cggatcgttc tcgcaccctt gactcgaacc agagctctgc ccggaaacgt gccctcggat 120cttgccacag agtactacgc acaaagagca gcatctccag gcactctcct catcaccgag 180gccacataca tctcccccgg atctgctgga gtgcccattc caggagacgg aatcgttccg 240ggcatctgga gtgacgagca gctcgaagca tggaaaaagg tgttcaaggc cgtgcacgac 300cgaggatcca aaatctacgt ccagctgtgg gacattggac gtgtcgcatg gtaccacaag 360ctgcaggaac tgggcaacta cttccctaca ggcccctcag ctatccccat gaagggagag 420gagagcgagc atctcaaggc tctgactcac tgggagatca agggcaaggt ggccctctac 480gtcaacgctg ccaagaacgc cattgccgca ggcgctgatg gcgtcgagat ccactcggcc 540aacggctacc ttcccgacac atttctgaga agcgcctcca accaacgaac agacgaatat 600ggaggaagca tcgagaaccg ggcccgattc tcgctggaga ttgtcgacgc tatcaccgag 660gccattggag cagacaaaac cgccatccgt ctgtctccct ggtccacttt ccaggacatt 720gaggtgaatg acaccgagac ccccgcacag ttcacatacc tgtttgagca gctgcagaag 780cgagccgacg agggaaagca gctggcctac gtgcatgtag ttgagccccg actgtttggt 840ccccccgagc cctgggccac caatgagcct ttcagaaaaa tttggaaggg taacttcatt 900agagcaggtg gatacgatag agagactgct cttgaggatg cagacaagtc agacaacacc 960ctgattgcct ttggtcgaga cttcattgcc aatcctgatc tcgtccaacg cctcaagaat 1020aacgagcctt tggccaagta cgacagaaca accttctacg ttccaggtgc caagggctac 1080actgattacc ctgcgtacaa gatgtaa 110776368PRTYarrowia lipolytica 76Met Thr Gln Thr His Asn Leu Phe Ser Pro Ile Lys Val Gly Ser Ser 1 5 10 15 Glu Leu Gln Asn Arg Ile Val Leu Ala Pro Leu Thr Arg Thr Arg Ala 20 25 30 Leu Pro Gly Asn Val Pro Ser Asp Leu Ala Thr Glu Tyr Tyr Ala Gln 35 40 45 Arg Ala Ala Ser Pro Gly Thr Leu Leu Ile Thr Glu Ala Thr Tyr Ile 50 55 60 Ser Pro Gly Ser Ala Gly Val Pro Ile Pro Gly Asp Gly Ile Val Pro 65 70 75 80 Gly Ile Trp Ser Asp Glu Gln Leu Glu Ala Trp Lys Lys Val Phe Lys 85 90 95 Ala Val His Asp Arg Gly Ser Lys Ile Tyr Val Gln Leu Trp Asp Ile 100 105 110 Gly Arg Val Ala Trp Tyr His Lys Leu Gln Glu Leu Gly Asn Tyr Phe 115 120 125 Pro Thr Gly Pro Ser Ala Ile Pro Met Lys Gly Glu Glu Ser Glu His 130 135 140 Leu Lys Ala Leu Thr His Trp Glu Ile Lys Gly Lys Val Ala Leu Tyr 145 150 155 160 Val Asn Ala Ala Lys Asn Ala Ile Ala Ala Gly Ala Asp Gly Val Glu 165 170 175 Ile His Ser Ala Asn Gly Tyr Leu Pro Asp Thr Phe Leu Arg Ser Ala 180 185 190 Ser Asn Gln Arg Thr Asp Glu Tyr Gly Gly Ser Ile Glu Asn Arg Ala 195 200 205 Arg Phe Ser Leu Glu Ile Val Asp Ala Ile Thr Glu Ala Ile Gly Ala 210 215 220 Asp Lys Thr Ala Ile Arg Leu Ser Pro Trp Ser Thr Phe Gln Asp Ile 225 230 235 240 Glu Val Asn Asp Thr Glu Thr Pro Ala Gln Phe Thr Tyr Leu Phe Glu 245 250 255 Gln Leu Gln Lys Arg Ala Asp Glu Gly Lys Gln Leu Ala Tyr Val His 260 265 270 Val Val Glu Pro Arg Leu Phe Gly Pro Pro Glu Pro Trp Ala Thr Asn 275 280 285 Glu Pro Phe Arg Lys Ile Trp Lys Gly Asn Phe Ile Arg Ala Gly Gly 290 295 300 Tyr Asp Arg Glu Thr Ala Leu Glu Asp Ala Asp Lys Ser Asp Asn Thr 305 310 315 320 Leu Ile Ala Phe Gly Arg Asp Phe Ile Ala Asn Pro Asp Leu Val Gln 325 330 335 Arg Leu Lys Asn Asn Glu Pro Leu Ala Lys Tyr Asp Arg Thr Thr Phe 340 345 350 Tyr Val Pro Gly Ala Lys Gly Tyr Thr Asp Tyr Pro Ala Tyr Lys Met 355 360 365 771017DNAYarrowia lipolytica 77atggaagcca accccgaagt ccagaccgat atcatcacgc tgacccggtt cattctgcag 60gaacagaaca aggtgggcgc gtcgtccgca atccccaccg gagacttcac tctgctgctc 120aactcgctgc agtttgcctt caagttcatt gcccacaaca tccgacgatc gaccctggtc 180aacctgattg gcctgtcggg aaccgccaac tccaccggcg acgaccagaa gaagctggac 240gtgatcggag acgagatctt catcaacgcc atgaaggcct ccggtaaggt caagctggtg 300gtgtccgagg agcaggagga cctcattgtg tttgagggcg acggccgata cgccgtggtc 360tgcgacccca tcgacggatc ctccaacctc gacgccggcg tctccgtcgg caccattttc 420ggcgtctaca agctccccga gggctcctcc ggatccatca aggacgtgct ccgacccgga 480aaggagatgg ttgccgccgg ctacaccatg tacggtgcct ccgccaacct ggtgctgtcc 540accggaaacg gctgcaacgg cttcactctc gatgaccctc tgggagagtt catcctgacc 600caccccgatc tcaagctccc cgatctgcga tccatctact ccgtcaacga gggtaactcc 660tccctgtggt ccgacaacgt caaggactac ttcaaggccc tcaagttccc cgaggacggc 720tccaagccct actcggcccg atacattggc tccatggtcg ccgacgtgca ccgaaccatt 780ctctacggag gtatgtttgc ctaccccgcc gactccaagt ccaagaaggg caagctccga 840cttttgtacg agggtttccc catggcctac atcattgagc aggccggcgg tcttgccatc 900aacgacaacg gcgagcgaat cctcgatctg gtccccaccg agatccacga gcgatccggc 960gtctggctgg gctccaaggg cgagattgag aaggccaaga agtaccttct gaaatga 101778338PRTYarrowia lipolytica 78Met Glu Ala Asn Pro Glu Val Gln Thr Asp Ile Ile Thr Leu Thr Arg 1 5 10 15 Phe Ile Leu Gln Glu Gln Asn Lys Val Gly Ala Ser Ser Ala Ile Pro 20 25 30 Thr Gly Asp Phe Thr Leu Leu Leu Asn Ser Leu Gln Phe Ala Phe Lys 35 40 45 Phe Ile Ala His Asn Ile Arg Arg Ser Thr Leu Val Asn Leu Ile Gly 50 55 60 Leu Ser Gly Thr Ala Asn Ser Thr Gly Asp Asp Gln Lys Lys Leu Asp 65 70 75 80 Val Ile Gly Asp Glu Ile Phe Ile Asn Ala Met Lys Ala Ser Gly Lys 85 90 95 Val Lys Leu Val Val Ser Glu Glu Gln Glu Asp Leu Ile Val Phe Glu 100 105 110 Gly Asp Gly Arg Tyr Ala Val Val Cys Asp Pro Ile Asp Gly Ser Ser 115 120 125 Asn Leu Asp Ala Gly Val Ser Val Gly Thr Ile Phe Gly Val Tyr Lys 130 135 140 Leu Pro Glu Gly Ser Ser Gly Ser Ile Lys Asp Val Leu Arg Pro Gly 145 150 155 160 Lys Glu Met Val Ala Ala Gly Tyr Thr Met Tyr Gly Ala Ser Ala Asn 165 170 175 Leu Val Leu Ser Thr Gly Asn Gly Cys Asn Gly Phe Thr Leu Asp Asp 180

185 190 Pro Leu Gly Glu Phe Ile Leu Thr His Pro Asp Leu Lys Leu Pro Asp 195 200 205 Leu Arg Ser Ile Tyr Ser Val Asn Glu Gly Asn Ser Ser Leu Trp Ser 210 215 220 Asp Asn Val Lys Asp Tyr Phe Lys Ala Leu Lys Phe Pro Glu Asp Gly 225 230 235 240 Ser Lys Pro Tyr Ser Ala Arg Tyr Ile Gly Ser Met Val Ala Asp Val 245 250 255 His Arg Thr Ile Leu Tyr Gly Gly Met Phe Ala Tyr Pro Ala Asp Ser 260 265 270 Lys Ser Lys Lys Gly Lys Leu Arg Leu Leu Tyr Glu Gly Phe Pro Met 275 280 285 Ala Tyr Ile Ile Glu Gln Ala Gly Gly Leu Ala Ile Asn Asp Asn Gly 290 295 300 Glu Arg Ile Leu Asp Leu Val Pro Thr Glu Ile His Glu Arg Ser Gly 305 310 315 320 Val Trp Leu Gly Ser Lys Gly Glu Ile Glu Lys Ala Lys Lys Tyr Leu 325 330 335 Leu Lys 791194DNAYarrowia lipolytica 79atgcgactca ctctgccccg acttaacgcc gcctacattg taggagccgc ccgaactcct 60gtcggcaagt tcaacggagc cctcaagtcc gtgtctgcca ttgacctcgg tatcaccgct 120gccaaggccg ctgtccagcg atccaaggtc cccgccgacc agattgacga gtttctgttt 180ggccaggtgc tgaccgccaa ctccggccag gcccccgccc gacaggtggt tatcaagggt 240ggtttccccg agtccgtcga ggccaccacc atcaacaagg tgtgctcttc cggcctcaag 300accgtggctc tggctgccca ggccatcaag gccggcgacc gaaacgttat cgtggccggt 360ggaatggagt ccatgtccaa caccccctac tactccggtc gaggtcttgt tttcggcaac 420cagaagctcg aggactccat cgtcaaggac ggtctctggg acccctacaa caacatccac 480atgggcaact gctgcgagaa caccaacaag cgagacggca tcacccgaga gcagcaggac 540gagtacgcca tcgagtccta ccgacgggcc aacgagtcca tcaagaacgg cgccttcaag 600gatgagattg tccccgttga gatcaagacc cgaaagggca ccgtgactgt ctccgaggac 660gaggagccca agggagccaa cgccgagaag ctcaagggcc tcaagcctgt ctttgacaag 720cagggctccg tcactgccgg taacgcctcc cccatcaacg atggtgcttc tgccgttgtc 780gttgcctctg gcaccaaggc caaggagctc ggtacccccg tgctcgccaa gattgtctct 840tacgcagacg ccgccaccgc ccccattgac tttaccattg ctccctctct ggccattccc 900gccgccctca agaaggctgg ccttaccaag gacgacattg ccctctggga gatcaacgag 960gccttctccg gtgtcgctct cgccaacctc atgcgactcg gaattgacaa gtccaaggtc 1020aacgtcaagg gtggagctgt tgctctcggc caccccattg gtgcctccgg taaccgaatc 1080tttgtgactt tggtcaacgc cctcaaggag ggcgagtacg gagttgccgc catctgcaac 1140ggtggaggag cttccaccgc catcgtcatc aagaaggtct cttctgtcga gtag 119480397PRTYarrowia lipolytica 80Met Arg Leu Thr Leu Pro Arg Leu Asn Ala Ala Tyr Ile Val Gly Ala 1 5 10 15 Ala Arg Thr Pro Val Gly Lys Phe Asn Gly Ala Leu Lys Ser Val Ser 20 25 30 Ala Ile Asp Leu Gly Ile Thr Ala Ala Lys Ala Ala Val Gln Arg Ser 35 40 45 Lys Val Pro Ala Asp Gln Ile Asp Glu Phe Leu Phe Gly Gln Val Leu 50 55 60 Thr Ala Asn Ser Gly Gln Ala Pro Ala Arg Gln Val Val Ile Lys Gly 65 70 75 80 Gly Phe Pro Glu Ser Val Glu Ala Thr Thr Ile Asn Lys Val Cys Ser 85 90 95 Ser Gly Leu Lys Thr Val Ala Leu Ala Ala Gln Ala Ile Lys Ala Gly 100 105 110 Asp Arg Asn Val Ile Val Ala Gly Gly Met Glu Ser Met Ser Asn Thr 115 120 125 Pro Tyr Tyr Ser Gly Arg Gly Leu Val Phe Gly Asn Gln Lys Leu Glu 130 135 140 Asp Ser Ile Val Lys Asp Gly Leu Trp Asp Pro Tyr Asn Asn Ile His 145 150 155 160 Met Gly Asn Cys Cys Glu Asn Thr Asn Lys Arg Asp Gly Ile Thr Arg 165 170 175 Glu Gln Gln Asp Glu Tyr Ala Ile Glu Ser Tyr Arg Arg Ala Asn Glu 180 185 190 Ser Ile Lys Asn Gly Ala Phe Lys Asp Glu Ile Val Pro Val Glu Ile 195 200 205 Lys Thr Arg Lys Gly Thr Val Thr Val Ser Glu Asp Glu Glu Pro Lys 210 215 220 Gly Ala Asn Ala Glu Lys Leu Lys Gly Leu Lys Pro Val Phe Asp Lys 225 230 235 240 Gln Gly Ser Val Thr Ala Gly Asn Ala Ser Pro Ile Asn Asp Gly Ala 245 250 255 Ser Ala Val Val Val Ala Ser Gly Thr Lys Ala Lys Glu Leu Gly Thr 260 265 270 Pro Val Leu Ala Lys Ile Val Ser Tyr Ala Asp Ala Ala Thr Ala Pro 275 280 285 Ile Asp Phe Thr Ile Ala Pro Ser Leu Ala Ile Pro Ala Ala Leu Lys 290 295 300 Lys Ala Gly Leu Thr Lys Asp Asp Ile Ala Leu Trp Glu Ile Asn Glu 305 310 315 320 Ala Phe Ser Gly Val Ala Leu Ala Asn Leu Met Arg Leu Gly Ile Asp 325 330 335 Lys Ser Lys Val Asn Val Lys Gly Gly Ala Val Ala Leu Gly His Pro 340 345 350 Ile Gly Ala Ser Gly Asn Arg Ile Phe Val Thr Leu Val Asn Ala Leu 355 360 365 Lys Glu Gly Glu Tyr Gly Val Ala Ala Ile Cys Asn Gly Gly Gly Ala 370 375 380 Ser Thr Ala Ile Val Ile Lys Lys Val Ser Ser Val Glu 385 390 395 811953DNAYarrowia lipolytica 81atgtctgcca acgagaacat ctcccgattc gacgcccctg tgggcaagga gcaccccgcc 60tacgagctct tccataacca cacacgatct ttcgtctatg gtctccagcc tcgagcctgc 120cagggtatgc tggacttcga cttcatctgt aagcgagaga acccctccgt ggccggtgtc 180atctatccct tcggcggcca gttcgtcacc aagatgtact ggggcaccaa ggagactctt 240ctccctgtct accagcaggt cgagaaggcc gctgccaagc accccgaggt cgatgtcgtg 300gtcaactttg cctcctctcg atccgtctac tcctctacca tggagctgct cgagtacccc 360cagttccgaa ccatcgccat tattgccgag ggtgtccccg agcgacgagc ccgagagatc 420ctccacaagg cccagaagaa gggtgtgacc atcattggtc ccgctaccgt cggaggtatc 480aagcccggtt gcttcaaggt tggaaacacc ggaggtatga tggacaacat tgtcgcctcc 540aagctctacc gacccggctc cgttgcctac gtctccaagt ccggaggaat gtccaacgag 600ctgaacaaca ttatctctca caccaccgac ggtgtctacg agggtattgc tattggtggt 660gaccgatacc ctggtactac cttcattgac catatcctgc gatacgaggc cgaccccaag 720tgtaagatca tcgtcctcct tggtgaggtt ggtggtgttg aggagtaccg agtcatcgag 780gctgttaaga acggccagat caagaagccc atcgtcgctt gggccattgg tacttgtgcc 840tccatgttca agactgaggt tcagttcggc cacgccggct ccatggccaa ctccgacctg 900gagactgcca aggctaagaa cgccgccatg aagtctgctg gcttctacgt ccccgatacc 960ttcgaggaca tgcccgaggt ccttgccgag ctctacgaga agatggtcgc caagggcgag 1020ctgtctcgaa tctctgagcc tgaggtcccc aagatcccca ttgactactc ttgggcccag 1080gagcttggtc ttatccgaaa gcccgctgct ttcatctcca ctatttccga tgaccgaggc 1140caggagcttc tgtacgctgg catgcccatt tccgaggttt tcaaggagga cattggtatc 1200ggcggtgtca tgtctctgct gtggttccga cgacgactcc ccgactacgc ctccaagttt 1260cttgagatgg ttctcatgct tactgctgac cacggtcccg ccgtatccgg tgccatgaac 1320accattatca ccacccgagc tggtaaggat ctcatttctt ccctggttgc tggtctcctg 1380accattggta cccgattcgg aggtgctctt gacggtgctg ccaccgagtt caccactgcc 1440tacgacaagg gtctgtcccc ccgacagttc gttgatacca tgcgaaagca gaacaagctg 1500attcctggta ttggccatcg agtcaagtct cgaaacaacc ccgatttccg agtcgagctt 1560gtcaaggact ttgttaagaa gaacttcccc tccacccagc tgctcgacta cgcccttgct 1620gtcgaggagg tcaccacctc caagaaggac aacctgattc tgaacgttga cggtgctatt 1680gctgtttctt ttgtcgatct catgcgatct tgcggtgcct ttactgtgga ggagactgag 1740gactacctca agaacggtgt tctcaacggt ctgttcgttc tcggtcgatc cattggtctc 1800attgcccacc atctcgatca gaagcgactc aagaccggtc tgtaccgaca tccttgggac 1860gatatcacct acctggttgg ccaggaggct atccagaaga agcgagtcga gatcagcgcc 1920ggcgacgttt ccaaggccaa gactcgatca tag 195382650PRTYarrowia lipolytica 82Met Ser Ala Asn Glu Asn Ile Ser Arg Phe Asp Ala Pro Val Gly Lys 1 5 10 15 Glu His Pro Ala Tyr Glu Leu Phe His Asn His Thr Arg Ser Phe Val 20 25 30 Tyr Gly Leu Gln Pro Arg Ala Cys Gln Gly Met Leu Asp Phe Asp Phe 35 40 45 Ile Cys Lys Arg Glu Asn Pro Ser Val Ala Gly Val Ile Tyr Pro Phe 50 55 60 Gly Gly Gln Phe Val Thr Lys Met Tyr Trp Gly Thr Lys Glu Thr Leu 65 70 75 80 Leu Pro Val Tyr Gln Gln Val Glu Lys Ala Ala Ala Lys His Pro Glu 85 90 95 Val Asp Val Val Val Asn Phe Ala Ser Ser Arg Ser Val Tyr Ser Ser 100 105 110 Thr Met Glu Leu Leu Glu Tyr Pro Gln Phe Arg Thr Ile Ala Ile Ile 115 120 125 Ala Glu Gly Val Pro Glu Arg Arg Ala Arg Glu Ile Leu His Lys Ala 130 135 140 Gln Lys Lys Gly Val Thr Ile Ile Gly Pro Ala Thr Val Gly Gly Ile 145 150 155 160 Lys Pro Gly Cys Phe Lys Val Gly Asn Thr Gly Gly Met Met Asp Asn 165 170 175 Ile Val Ala Ser Lys Leu Tyr Arg Pro Gly Ser Val Ala Tyr Val Ser 180 185 190 Lys Ser Gly Gly Met Ser Asn Glu Leu Asn Asn Ile Ile Ser His Thr 195 200 205 Thr Asp Gly Val Tyr Glu Gly Ile Ala Ile Gly Gly Asp Arg Tyr Pro 210 215 220 Gly Thr Thr Phe Ile Asp His Ile Leu Arg Tyr Glu Ala Asp Pro Lys 225 230 235 240 Cys Lys Ile Ile Val Leu Leu Gly Glu Val Gly Gly Val Glu Glu Tyr 245 250 255 Arg Val Ile Glu Ala Val Lys Asn Gly Gln Ile Lys Lys Pro Ile Val 260 265 270 Ala Trp Ala Ile Gly Thr Cys Ala Ser Met Phe Lys Thr Glu Val Gln 275 280 285 Phe Gly His Ala Gly Ser Met Ala Asn Ser Asp Leu Glu Thr Ala Lys 290 295 300 Ala Lys Asn Ala Ala Met Lys Ser Ala Gly Phe Tyr Val Pro Asp Thr 305 310 315 320 Phe Glu Asp Met Pro Glu Val Leu Ala Glu Leu Tyr Glu Lys Met Val 325 330 335 Ala Lys Gly Glu Leu Ser Arg Ile Ser Glu Pro Glu Val Pro Lys Ile 340 345 350 Pro Ile Asp Tyr Ser Trp Ala Gln Glu Leu Gly Leu Ile Arg Lys Pro 355 360 365 Ala Ala Phe Ile Ser Thr Ile Ser Asp Asp Arg Gly Gln Glu Leu Leu 370 375 380 Tyr Ala Gly Met Pro Ile Ser Glu Val Phe Lys Glu Asp Ile Gly Ile 385 390 395 400 Gly Gly Val Met Ser Leu Leu Trp Phe Arg Arg Arg Leu Pro Asp Tyr 405 410 415 Ala Ser Lys Phe Leu Glu Met Val Leu Met Leu Thr Ala Asp His Gly 420 425 430 Pro Ala Val Ser Gly Ala Met Asn Thr Ile Ile Thr Thr Arg Ala Gly 435 440 445 Lys Asp Leu Ile Ser Ser Leu Val Ala Gly Leu Leu Thr Ile Gly Thr 450 455 460 Arg Phe Gly Gly Ala Leu Asp Gly Ala Ala Thr Glu Phe Thr Thr Ala 465 470 475 480 Tyr Asp Lys Gly Leu Ser Pro Arg Gln Phe Val Asp Thr Met Arg Lys 485 490 495 Gln Asn Lys Leu Ile Pro Gly Ile Gly His Arg Val Lys Ser Arg Asn 500 505 510 Asn Pro Asp Phe Arg Val Glu Leu Val Lys Asp Phe Val Lys Lys Asn 515 520 525 Phe Pro Ser Thr Gln Leu Leu Asp Tyr Ala Leu Ala Val Glu Glu Val 530 535 540 Thr Thr Ser Lys Lys Asp Asn Leu Ile Leu Asn Val Asp Gly Ala Ile 545 550 555 560 Ala Val Ser Phe Val Asp Leu Met Arg Ser Cys Gly Ala Phe Thr Val 565 570 575 Glu Glu Thr Glu Asp Tyr Leu Lys Asn Gly Val Leu Asn Gly Leu Phe 580 585 590 Val Leu Gly Arg Ser Ile Gly Leu Ile Ala His His Leu Asp Gln Lys 595 600 605 Arg Leu Lys Thr Gly Leu Tyr Arg His Pro Trp Asp Asp Ile Thr Tyr 610 615 620 Leu Val Gly Gln Glu Ala Ile Gln Lys Lys Arg Val Glu Ile Ser Ala 625 630 635 640 Gly Asp Val Ser Lys Ala Lys Thr Arg Ser 645 650 831494DNAYarrowia lipolytica 83atgtcagcga aatccattca cgaggccgac ggcaaggccc tgctcgcaca ctttctgtcc 60aaggcgcccg tgtgggccga gcagcagccc atcaacacgt ttgaaatggg cacacccaag 120ctggcgtctc tgacgttcga ggacggcgtg gcccccgagc agatcttcgc cgccgctgaa 180aagacctacc cctggctgct ggagtccggc gccaagtttg tggccaagcc cgaccagctc 240atcaagcgac gaggcaaggc cggcctgctg gtactcaaca agtcgtggga ggagtgcaag 300ccctggatcg ccgagcgggc cgccaagccc atcaacgtgg agggcattga cggagtgctg 360cgaacgttcc tggtcgagcc ctttgtgccc cacgaccaga agcacgagta ctacatcaac 420atccactccg tgcgagaggg cgactggatc ctcttctacc acgagggagg agtcgacgtc 480ggcgacgtgg acgccaaggc cgccaagatc ctcatccccg ttgacattga gaacgagtac 540ccctccaacg ccacgctcac caaggagctg ctggcacacg tgcccgagga ccagcaccag 600accctgctcg acttcatcaa ccggctctac gccgtctacg tcgatctgca gtttacgtat 660ctggagatca accccctggt cgtgatcccc accgcccagg gcgtcgaggt ccactacctg 720gatcttgccg gcaagctcga ccagaccgca gagtttgagt gcggccccaa gtgggctgct 780gcgcggtccc ccgccgctct gggccaggtc gtcaccattg acgccggctc caccaaggtg 840tccatcgacg ccggccccgc catggtcttc cccgctcctt tcggtcgaga gctgtccaag 900gaggaggcgt acattgcgga gctcgattcc aagaccggag cttctctgaa gctgactgtt 960ctcaatgcca agggccgaat ctggaccctt gtggctggtg gaggagcctc cgtcgtctac 1020gccgacgcca ttgcgtctgc cggctttgct gacgagctcg ccaactacgg cgagtactct 1080ggcgctccca acgagaccca gacctacgag tacgccaaaa ccgtactgga tctcatgacc 1140cggggcgacg ctcaccccga gggcaaggta ctgttcattg gcggaggaat cgccaacttc 1200acccaggttg gatccacctt caagggcatc atccgggcct tccgggacta ccagtcttct 1260ctgcacaacc acaaggtgaa gatttacgtg cgacgaggcg gtcccaactg gcaggagggt 1320ctgcggttga tcaagtcggc tggcgacgag ctgaatctgc ccatggagat ttacggcccc 1380gacatgcacg tgtcgggtat tgttcctttg gctctgcttg gaaagcggcc caagaatgtc 1440aagccttttg gcaccggacc ttctactgag gcttccactc ctctcggagt ttaa 149484497PRTYarrowia lipolytica 84Met Ser Ala Lys Ser Ile His Glu Ala Asp Gly Lys Ala Leu Leu Ala 1 5 10 15 His Phe Leu Ser Lys Ala Pro Val Trp Ala Glu Gln Gln Pro Ile Asn 20 25 30 Thr Phe Glu Met Gly Thr Pro Lys Leu Ala Ser Leu Thr Phe Glu Asp 35 40 45 Gly Val Ala Pro Glu Gln Ile Phe Ala Ala Ala Glu Lys Thr Tyr Pro 50 55 60 Trp Leu Leu Glu Ser Gly Ala Lys Phe Val Ala Lys Pro Asp Gln Leu 65 70 75 80 Ile Lys Arg Arg Gly Lys Ala Gly Leu Leu Val Leu Asn Lys Ser Trp 85 90 95 Glu Glu Cys Lys Pro Trp Ile Ala Glu Arg Ala Ala Lys Pro Ile Asn 100 105 110 Val Glu Gly Ile Asp Gly Val Leu Arg Thr Phe Leu Val Glu Pro Phe 115 120 125 Val Pro His Asp Gln Lys His Glu Tyr Tyr Ile Asn Ile His Ser Val 130 135 140 Arg Glu Gly Asp Trp Ile Leu Phe Tyr His Glu Gly Gly Val Asp Val 145 150 155 160 Gly Asp Val Asp Ala Lys Ala Ala Lys Ile Leu Ile Pro Val Asp Ile 165 170 175 Glu Asn Glu Tyr Pro Ser Asn Ala Thr Leu Thr Lys Glu Leu Leu Ala 180 185 190 His Val Pro Glu Asp Gln His Gln Thr Leu Leu Asp Phe Ile Asn Arg 195 200 205 Leu Tyr Ala Val Tyr Val Asp Leu Gln Phe Thr Tyr Leu Glu Ile Asn 210 215 220 Pro Leu Val Val Ile Pro Thr Ala Gln Gly Val Glu Val His Tyr Leu 225 230 235 240 Asp Leu Ala Gly Lys Leu Asp Gln Thr Ala Glu Phe Glu Cys Gly Pro 245 250 255 Lys Trp Ala Ala Ala Arg Ser Pro Ala Ala Leu Gly Gln Val Val Thr 260 265 270 Ile Asp Ala Gly Ser Thr Lys Val Ser Ile Asp Ala Gly Pro Ala Met 275 280 285 Val Phe Pro Ala Pro Phe Gly Arg Glu Leu Ser Lys Glu Glu Ala Tyr 290 295 300 Ile Ala Glu Leu Asp Ser Lys Thr Gly Ala Ser Leu Lys Leu Thr Val 305 310 315 320 Leu Asn Ala Lys Gly Arg Ile Trp Thr Leu Val Ala Gly Gly Gly Ala 325 330 335 Ser Val Val Tyr Ala Asp Ala Ile Ala Ser Ala Gly Phe Ala Asp Glu 340 345 350 Leu Ala Asn Tyr Gly Glu Tyr Ser Gly Ala Pro Asn Glu Thr Gln Thr 355

360 365 Tyr Glu Tyr Ala Lys Thr Val Leu Asp Leu Met Thr Arg Gly Asp Ala 370 375 380 His Pro Glu Gly Lys Val Leu Phe Ile Gly Gly Gly Ile Ala Asn Phe 385 390 395 400 Thr Gln Val Gly Ser Thr Phe Lys Gly Ile Ile Arg Ala Phe Arg Asp 405 410 415 Tyr Gln Ser Ser Leu His Asn His Lys Val Lys Ile Tyr Val Arg Arg 420 425 430 Gly Gly Pro Asn Trp Gln Glu Gly Leu Arg Leu Ile Lys Ser Ala Gly 435 440 445 Asp Glu Leu Asn Leu Pro Met Glu Ile Tyr Gly Pro Asp Met His Val 450 455 460 Ser Gly Ile Val Pro Leu Ala Leu Leu Gly Lys Arg Pro Lys Asn Val 465 470 475 480 Lys Pro Phe Gly Thr Gly Pro Ser Thr Glu Ala Ser Thr Pro Leu Gly 485 490 495 Val 851458DNAYarrowia lipolytica 85atggttatta tgtgtgtggg acctcagcac acgcatcatc ccaacacagg gtgcagtata 60tatagacaga cgtgttcctt cgcaccgttc ttcacatatc aaaacactaa caaattcaaa 120agtgagtatc atggtgggag tcaattgatt gctcggggag ttgaacaggc aacaatggca 180tgcacagggc cagtgaaggc agactgcagt cgctgcacat ggatcgtggt tctgaggcgt 240tgctatcaaa agggtcaatt acctcacgaa acacagctgg atgttgtgca atcgtcaatt 300gaaaaacccg acacaatgca agatctcttt gcgcgcattg ccatcgctgt tgccatcgct 360gtcgccatcg ccaatgccgc tgcggattat tatccctacc ttgttccccg cttccgcaca 420accggcgatg tctttgtatc atgaactctc gaaactaact cagtggttaa agctgtcgtt 480gccggagccg ctggtggtat tggccagccc ctttctcttc tcctcaaact ctctccttac 540gtgaccgagc ttgctctcta cgatgtcgtc aactcccccg gtgttgccgc tgacctctcc 600cacatctcca ccaaggctaa ggtcactggc tacctcccca aggatgacgg tctcaagaac 660gctctgaccg gcgccaacat tgtcgttatc cccgccggta tcccccgaaa gcccggtatg 720acccgagacg atctgttcaa gatcaacgct ggtatcgtcc gagatctcgt caccggtgtc 780gcccagtacg cccctgacgc ctttgtgctc atcatctcca accccgtcaa ctctaccgtc 840cctattgctg ccgaggtcct caagaagcac aacgtcttca accctaagaa gctcttcggt 900gtcaccaccc ttgacgttgt ccgagcccag accttcaccg ccgctgttgt tggcgagtct 960gaccccacca agctcaacat ccccgtcgtt ggtggccact ccggagacac cattgtccct 1020ctcctgtctc tgaccaagcc taaggtcgag atccccgccg acaagctcga cgacctcgtc 1080aagcgaatcc agtttggtgg tgacgaggtt gtccaggcta aggacggtct tggatccgct 1140accctctcca tggcccaggc tggtttccga tttgccgagg ctgtcctcaa gggtgccgct 1200ggtgagaagg gcatcatcga gcccgcctac atctaccttg acggtattga tggcacctcc 1260gacatcaagc gagaggtcgg tgtcgccttc ttctctgtcc ctgtcgagtt cggccctgag 1320ggtgccgcta aggcttacaa catccttccc gaggccaacg actacgagaa gaagcttctc 1380aaggtctcca tcgacggtct ttacggcaac attgccaagg gcgaggagtt cattgttaac 1440cctcctcctg ccaagtaa 145886331PRTYarrowia lipolytica 86Val Val Lys Ala Val Val Ala Gly Ala Ala Gly Gly Ile Gly Gln Pro 1 5 10 15 Leu Ser Leu Leu Leu Lys Leu Ser Pro Tyr Val Thr Glu Leu Ala Leu 20 25 30 Tyr Asp Val Val Asn Ser Pro Gly Val Ala Ala Asp Leu Ser His Ile 35 40 45 Ser Thr Lys Ala Lys Val Thr Gly Tyr Leu Pro Lys Asp Asp Gly Leu 50 55 60 Lys Asn Ala Leu Thr Gly Ala Asn Ile Val Val Ile Pro Ala Gly Ile 65 70 75 80 Pro Arg Lys Pro Gly Met Thr Arg Asp Asp Leu Phe Lys Ile Asn Ala 85 90 95 Gly Ile Val Arg Asp Leu Val Thr Gly Val Ala Gln Tyr Ala Pro Asp 100 105 110 Ala Phe Val Leu Ile Ile Ser Asn Pro Val Asn Ser Thr Val Pro Ile 115 120 125 Ala Ala Glu Val Leu Lys Lys His Asn Val Phe Asn Pro Lys Lys Leu 130 135 140 Phe Gly Val Thr Thr Leu Asp Val Val Arg Ala Gln Thr Phe Thr Ala 145 150 155 160 Ala Val Val Gly Glu Ser Asp Pro Thr Lys Leu Asn Ile Pro Val Val 165 170 175 Gly Gly His Ser Gly Asp Thr Ile Val Pro Leu Leu Ser Leu Thr Lys 180 185 190 Pro Lys Val Glu Ile Pro Ala Asp Lys Leu Asp Asp Leu Val Lys Arg 195 200 205 Ile Gln Phe Gly Gly Asp Glu Val Val Gln Ala Lys Asp Gly Leu Gly 210 215 220 Ser Ala Thr Leu Ser Met Ala Gln Ala Gly Phe Arg Phe Ala Glu Ala 225 230 235 240 Val Leu Lys Gly Ala Ala Gly Glu Lys Gly Ile Ile Glu Pro Ala Tyr 245 250 255 Ile Tyr Leu Asp Gly Ile Asp Gly Thr Ser Asp Ile Lys Arg Glu Val 260 265 270 Gly Val Ala Phe Phe Ser Val Pro Val Glu Phe Gly Pro Glu Gly Ala 275 280 285 Ala Lys Ala Tyr Asn Ile Leu Pro Glu Ala Asn Asp Tyr Glu Lys Lys 290 295 300 Leu Leu Lys Val Ser Ile Asp Gly Leu Tyr Gly Asn Ile Ala Lys Gly 305 310 315 320 Glu Glu Phe Ile Val Asn Pro Pro Pro Ala Lys 325 330 871937DNAYarrowia lipolytica 87atgactggca ccttacccaa gttcggcgac ggaaccacca ttgtggttct tggagcctcc 60ggcgacctcg ctaagaagaa gaccgtgagt attgaaccag actgaggtca attgaagagt 120aggagagtct gagaacattc gacggacctg attgtgctct ggaccactca attgactcgt 180tgagagcccc aatgggtctt ggctagccga gtcgttgact tgttgacttg ttgagcccag 240aacccccaac ttttgccacc atacaccgcc atcaccatga cacccagatg tgcgtgcgta 300tgtgagagtc aattgttccg tggcaaggca cagcttattc caccgtgttc cttgcacagg 360tggtctttac gctctcccac tctatccgag caataaaagc ggaaaaacag cagcaagtcc 420caacagactt ctgctccgaa taaggcgtct agcaagtgtg cccaaaactc aattcaaaaa 480tgtcagaaac ctgatatcaa cccgtcttca aaagctaacc ccagttcccc gccctcttcg 540gcctttaccg aaacggcctg ctgcccaaaa atgttgaaat catcggctac gcacggtcga 600aaatgactca ggaggagtac cacgagcgaa tcagccacta cttcaagacc cccgacgacc 660agtccaagga gcaggccaag aagttccttg agaacacctg ctacgtccag ggcccttacg 720acggtgccga gggctaccag cgactgaatg aaaagattga ggagtttgag aagaagaagc 780ccgagcccca ctaccgtctt ttctacctgg ctctgccccc cagcgtcttc cttgaggctg 840ccaacggtct gaagaagtat gtctaccccg gcgagggcaa ggcccgaatc atcatcgaga 900agccctttgg ccacgacctg gcctcgtcac gagagctcca ggacggcctt gctcctctct 960ggaaggagtc tgagatcttc cgaatcgacc actacctcgg aaaggagatg gtcaagaacc 1020tcaacattct gcgatttggc aaccagttcc tgtccgccgt gtgggacaag aacaccattt 1080ccaacgtcca gatctccttc aaggagccct ttggcactga gggccgaggt ggatacttca 1140acgacattgg aatcatccga gacgttattc agaaccatct gttgcaggtt ctgtccattc 1200tagccatgga gcgacccgtc actttcggcg ccgaggacat tcgagatgag aaggtcaagg 1260tgctccgatg tgtcgacatt ctcaacattg acgacgtcat tctcggccag tacggcccct 1320ctgaagacgg aaagaagccc ggatacaccg atgacgatgg cgttcccgat gactcccgag 1380ctgtgacctt tgctgctctc catctccaga tccacaacga cagatgggag ggtgttcctt 1440tcatcctccg agccggtaag gctctggacg agggcaaggt cgagatccga gtgcagttcc 1500gagacgtgac caagggcgtt gtggaccatc tgcctcgaaa tgagctcgtc atccgaatcc 1560agccctccga gtccatctac atgaagatga actccaagct gcctggcctt actgccaaga 1620acattgtcac cgacctggat ctgacctaca accgacgata ctcggacgtg cgaatccctg 1680aggcttacga gtctctcatt ctggactgcc tcaagggtga ccacaccaac tttgtgcgaa 1740acgacgagct ggacatttcc tggaagattt tcaccgatct gctgcacaag attgacgagg 1800acaagagcat tgtgcccgag aagtacgcct acggctctcg tggccccgag cgactcaagc 1860agtggctccg agaccgaggc tacgtgcgaa acggcaccga gctgtaccaa tggcctgtca 1920ccaagggctc ctcgtga 193788498PRTYarrowia lipolytica 88Met Thr Gly Thr Leu Pro Lys Phe Gly Asp Gly Thr Thr Ile Val Val 1 5 10 15 Leu Gly Ala Ser Gly Asp Leu Ala Lys Lys Lys Thr Phe Pro Ala Leu 20 25 30 Phe Gly Leu Tyr Arg Asn Gly Leu Leu Pro Lys Asn Val Glu Ile Ile 35 40 45 Gly Tyr Ala Arg Ser Lys Met Thr Gln Glu Glu Tyr His Glu Arg Ile 50 55 60 Ser His Tyr Phe Lys Thr Pro Asp Asp Gln Ser Lys Glu Gln Ala Lys 65 70 75 80 Lys Phe Leu Glu Asn Thr Cys Tyr Val Gln Gly Pro Tyr Asp Gly Ala 85 90 95 Glu Gly Tyr Gln Arg Leu Asn Glu Lys Ile Glu Glu Phe Glu Lys Lys 100 105 110 Lys Pro Glu Pro His Tyr Arg Leu Phe Tyr Leu Ala Leu Pro Pro Ser 115 120 125 Val Phe Leu Glu Ala Ala Asn Gly Leu Lys Lys Tyr Val Tyr Pro Gly 130 135 140 Glu Gly Lys Ala Arg Ile Ile Ile Glu Lys Pro Phe Gly His Asp Leu 145 150 155 160 Ala Ser Ser Arg Glu Leu Gln Asp Gly Leu Ala Pro Leu Trp Lys Glu 165 170 175 Ser Glu Ile Phe Arg Ile Asp His Tyr Leu Gly Lys Glu Met Val Lys 180 185 190 Asn Leu Asn Ile Leu Arg Phe Gly Asn Gln Phe Leu Ser Ala Val Trp 195 200 205 Asp Lys Asn Thr Ile Ser Asn Val Gln Ile Ser Phe Lys Glu Pro Phe 210 215 220 Gly Thr Glu Gly Arg Gly Gly Tyr Phe Asn Asp Ile Gly Ile Ile Arg 225 230 235 240 Asp Val Ile Gln Asn His Leu Leu Gln Val Leu Ser Ile Leu Ala Met 245 250 255 Glu Arg Pro Val Thr Phe Gly Ala Glu Asp Ile Arg Asp Glu Lys Val 260 265 270 Lys Val Leu Arg Cys Val Asp Ile Leu Asn Ile Asp Asp Val Ile Leu 275 280 285 Gly Gln Tyr Gly Pro Ser Glu Asp Gly Lys Lys Pro Gly Tyr Thr Asp 290 295 300 Asp Asp Gly Val Pro Asp Asp Ser Arg Ala Val Thr Phe Ala Ala Leu 305 310 315 320 His Leu Gln Ile His Asn Asp Arg Trp Glu Gly Val Pro Phe Ile Leu 325 330 335 Arg Ala Gly Lys Ala Leu Asp Glu Gly Lys Val Glu Ile Arg Val Gln 340 345 350 Phe Arg Asp Val Thr Lys Gly Val Val Asp His Leu Pro Arg Asn Glu 355 360 365 Leu Val Ile Arg Ile Gln Pro Ser Glu Ser Ile Tyr Met Lys Met Asn 370 375 380 Ser Lys Leu Pro Gly Leu Thr Ala Lys Asn Ile Val Thr Asp Leu Asp 385 390 395 400 Leu Thr Tyr Asn Arg Arg Tyr Ser Asp Val Arg Ile Pro Glu Ala Tyr 405 410 415 Glu Ser Leu Ile Leu Asp Cys Leu Lys Gly Asp His Thr Asn Phe Val 420 425 430 Arg Asn Asp Glu Leu Asp Ile Ser Trp Lys Ile Phe Thr Asp Leu Leu 435 440 445 His Lys Ile Asp Glu Asp Lys Ser Ile Val Pro Glu Lys Tyr Ala Tyr 450 455 460 Gly Ser Arg Gly Pro Glu Arg Leu Lys Gln Trp Leu Arg Asp Arg Gly 465 470 475 480 Tyr Val Arg Asn Gly Thr Glu Leu Tyr Gln Trp Pro Val Thr Lys Gly 485 490 495 Ser Ser 892202DNAYarrowia lipolytica 89atgactgaca cttcaaacat caagtgagta ttgccgcaca caattgcaat caccgccggg 60ctctacctcc tcagctctcg acgtcaatgg gccagcagcc gccatttgac cccaattaca 120ctggttgtgt aaaaccctca accacaatcg cttatgctca ccacagacta cgacttaacc 180aagtcatgtc acaggtcaaa gtaaagtcag cgcaacaccc cctcaatctc aacacacttt 240tgctaactca ggcctgtcgc tgacattgcc ctcatcggtc tcgccgtcat gggccagaac 300ctgatcctca acatggccga ccacggtaag tatcaattga ctcaagacgc accagcaaga 360tacagagcat acccagcaat cgctcctctg ataatcgcca ttgtaacact acgttggtta 420gattgatcta aggtcgttgc tggttccatg cacttccact tgctcatatg aagggagtca 480aactctattt tgatagtgtc ctctcccatc cccgaaatgt cgcattgttg ctaacaatag 540gctacgaggt tgttgcctac aaccgaacca cctccaaggt cgaccacttc ctcgagaacg 600aggccaaggg tgagtatccg tccagctatg ctgtttacag ccattgaccc caccttcccc 660cacaattgct acgtcaccat taaaaaacaa aattaccggt atcggcaagc tagactttca 720tgcaacctac gcagggtaac aagttgagtt tcagccgtgc accttacagg aaaaccagtc 780atacgccgag gcagtgtgaa agcgaaagca cacagcctac ggtgattgat tgcatttttt 840tgacatagga gggaaacacg tgacatggca agtgcccaac acgaatacta acaaacagga 900aagtccatta ttggtgctca ctctatcaag gagctgtgtg ctctgctgaa gcgaccccga 960cgaatcattc tgctcgttaa ggccggtgct gctgtcgatt ctttcatcga acagctcctg 1020ccctatctcg ataagggtga tatcatcatt gacggtggta actcccactt ccccgactcc 1080aaccgacgat acgaggagct taacgagaag ggaatcctct ttgttggttc cggtgtttcc 1140ggcggtgagg agggtgcccg atacggtccc tccatcatgc ccggtggaaa caaggaggcc 1200tggccccaca ttaagaagat tttccaggac atctctgcta aggctgatgg tgagccctgc 1260tgtgactggg tcggtgacgc tggtgccggc cactttgtca agatggttca caacggtatt 1320gagtatggtg acatgcagct tatctgcgag gcttacgacc tcatgaagcg aggtgctggt 1380ttcaccaatg aggagattgg agacgttttc gccaagtgga acaacggtat cctcgactcc 1440ttcctcattg agatcacccg agacatcttc aagtacgacg acggctctgg aactcctctc 1500gttgagaaga tctccgacac tgctggccag aagggtactg gaaagtggac cgctatcaac 1560gctcttgacc ttggtatgcc cgtcaccctg atcggtgagg ccgtcttcgc tcgatgcctt 1620tctgccctca agcaggagcg tgtccgagct tccaaggttc ttgatggccc cgagcccgtc 1680aagttcactg gtgacaagaa ggagtttgtc gaccagctcg agcaggccct ttacgcctcc 1740aagatcatct cttacgccca gggtttcatg cttatccgag aggccgccaa gacctacggc 1800tgggagctca acaacgccgg tattgccctc atgtggcgag gtggttgcat catccgatcc 1860gtcttccttg ctgacatcac caaggcttac cgacaggacc ccaacctcga gaacctgctg 1920ttcaacgact tcttcaagaa cgccatctcc aaggccaacc cctcttggcg agctaccgtg 1980gccaaggctg tcacctgggg tgttcccact cccgcctttg cctcggctct ggctttctac 2040gacggttacc gatctgccaa gctccccgct aacctgctcc aggcccagcg agactacttc 2100ggcgcccaca cctaccagct cctcgatggt gatggaaagt ggatccacac caactggacc 2160ggccgaggtg gtgaggtttc ttcttccact tacgatgctt aa 220290489PRTYarrowia lipolytica 90Met Thr Asp Thr Ser Asn Ile Lys Pro Val Ala Asp Ile Ala Leu Ile 1 5 10 15 Gly Leu Ala Val Met Gly Gln Asn Leu Ile Leu Asn Met Ala Asp His 20 25 30 Gly Tyr Glu Val Val Ala Tyr Asn Arg Thr Thr Ser Lys Val Asp His 35 40 45 Phe Leu Glu Asn Glu Ala Lys Gly Lys Ser Ile Ile Gly Ala His Ser 50 55 60 Ile Lys Glu Leu Cys Ala Leu Leu Lys Arg Pro Arg Arg Ile Ile Leu 65 70 75 80 Leu Val Lys Ala Gly Ala Ala Val Asp Ser Phe Ile Glu Gln Leu Leu 85 90 95 Pro Tyr Leu Asp Lys Gly Asp Ile Ile Ile Asp Gly Gly Asn Ser His 100 105 110 Phe Pro Asp Ser Asn Arg Arg Tyr Glu Glu Leu Asn Glu Lys Gly Ile 115 120 125 Leu Phe Val Gly Ser Gly Val Ser Gly Gly Glu Glu Gly Ala Arg Tyr 130 135 140 Gly Pro Ser Ile Met Pro Gly Gly Asn Lys Glu Ala Trp Pro His Ile 145 150 155 160 Lys Lys Ile Phe Gln Asp Ile Ser Ala Lys Ala Asp Gly Glu Pro Cys 165 170 175 Cys Asp Trp Val Gly Asp Ala Gly Ala Gly His Phe Val Lys Met Val 180 185 190 His Asn Gly Ile Glu Tyr Gly Asp Met Gln Leu Ile Cys Glu Ala Tyr 195 200 205 Asp Leu Met Lys Arg Gly Ala Gly Phe Thr Asn Glu Glu Ile Gly Asp 210 215 220 Val Phe Ala Lys Trp Asn Asn Gly Ile Leu Asp Ser Phe Leu Ile Glu 225 230 235 240 Ile Thr Arg Asp Ile Phe Lys Tyr Asp Asp Gly Ser Gly Thr Pro Leu 245 250 255 Val Glu Lys Ile Ser Asp Thr Ala Gly Gln Lys Gly Thr Gly Lys Trp 260 265 270 Thr Ala Ile Asn Ala Leu Asp Leu Gly Met Pro Val Thr Leu Ile Gly 275 280 285 Glu Ala Val Phe Ala Arg Cys Leu Ser Ala Leu Lys Gln Glu Arg Val 290 295 300 Arg Ala Ser Lys Val Leu Asp Gly Pro Glu Pro Val Lys Phe Thr Gly 305 310 315 320 Asp Lys Lys Glu Phe Val Asp Gln Leu Glu Gln Ala Leu Tyr Ala Ser 325 330 335 Lys Ile Ile Ser Tyr Ala Gln Gly Phe Met Leu Ile Arg Glu Ala Ala 340 345 350 Lys Thr Tyr Gly Trp Glu Leu Asn Asn Ala Gly Ile Ala Leu Met Trp 355 360 365 Arg Gly Gly Cys Ile Ile Arg Ser Val Phe Leu Ala Asp Ile Thr Lys 370 375 380 Ala Tyr Arg Gln Asp Pro Asn Leu Glu Asn Leu Leu Phe Asn Asp Phe 385 390 395 400 Phe Lys Asn Ala Ile Ser Lys Ala Asn Pro Ser Trp Arg Ala Thr Val 405 410 415 Ala Lys Ala Val Thr Trp Gly Val Pro Thr Pro Ala Phe Ala Ser Ala 420 425 430 Leu Ala Phe Tyr Asp Gly Tyr Arg Ser Ala Lys Leu Pro Ala Asn Leu 435 440

445 Leu Gln Ala Gln Arg Asp Tyr Phe Gly Ala His Thr Tyr Gln Leu Leu 450 455 460 Asp Gly Asp Gly Lys Trp Ile His Thr Asn Trp Thr Gly Arg Gly Gly 465 470 475 480 Glu Val Ser Ser Ser Thr Tyr Asp Ala 485 911742DNAYarrowia lipolytica 91atgctcaacc ttagaaccgc ccttcgagct gtgcgacccg tcactctggt gagtatctcg 60gagcccggga cggctaccaa cacacaagca agatgcaaca gaaaccggac tttttaaatg 120cggattgcgg aaaatttgca tggcggcaac gactcggaga aggagcggga caattgcaat 180ggcaggatgc cattgacgaa ctgagggtga tgagagaccg ggcctccgat gacgtggtgg 240tgacgacagc ccggctggtg ttgccgggac tgtctctgaa aagcaatttc tctatctccg 300gtctcaacag actccccttc tctagctcaa ttggcattgt cttcagaagg tgtcttagtg 360gtatccccat tgttatcttc ttttccccaa tgtcaatgtc aatgtcaatg gctccgacct 420ctttcacatt aacacggcgc aaacacagat accacggaac cgactcaaac aaatccaaag 480agacgcagcg gaataattgg catcaacgaa cgatttggga tactctggcg agaatgccga 540aatatttcgc ttgtcttgtt gtttctcttg agtgagttgt ttgtgaagtc gtttggaaga 600aggttcccaa tgtcacaaac cataccaact cgttacagcc agcttgtaat cccccacctc 660ttcaatacat actaacgcag acccgatcct acgccacttc cgtggcctct ttcaccggcc 720agaagaactc caacggcaag tacactgtgt ctctgattga gggagacggt atcggaaccg 780agatctccaa ggctgtcaag gacatctacc atgccgccaa ggtccccatc gactgggagg 840ttgtcgacgt cacccccact ctggtcaacg gcaagaccac catccccgac agcgccattg 900agtccatcaa ccgaaacaag gttgccctca agggtcccct cgccaccccc atcggtaagg 960gccacgtttc catgaacctg actctgcgac gaaccttcaa cctgttcgcc aacgtccgac 1020cttgcaagtc cgtcgtgggc tacaagaccc cttacgagaa cgtcgacacc ctgctcatcc 1080gagagaacac tgagggtgag tactccggta tcgagcacac cgtcgtcccc ggtgtcgttc 1140agtccatcaa gctgatcacc cgagaggctt ccgagcgagt catccggtac gcttacgagt 1200acgccctgtc ccgaggcatg aagaaggtcc ttgttgtcca caaggcctct attatgaagg 1260tctccgatgg tcttttcctt gaggttgctc gagagctcgc caaggagtac ccctccattg 1320acctttccgt cgagctgatc gacaacacct gtctgcgaat ggtccaggac cccgctctct 1380accgagatgt cgtcatggtc atgcccaacc tttacggtga cattctgtcc gatcttgcct 1440ccggtcttat cggtggtctt ggtctgaccc cctccggtaa catgggtgac gaggtctcca 1500tcttcgaggc cgtccacgga tccgctcccg acattgctgg caagggtctt gctaacccca 1560ctgctctgct gctctcctcc gtgatgatgc tgcgacacat gggtctcaac gacaacgcca 1620ccaacatcga gcaggccgtc tttggcacca ttgcttccgg ccccgagaac cgaaccaagg 1680atcttaaggg taccgccacc acttctcact ttgctgagca gattatcaag cgactcaagt 1740ag 174292369PRTYarrowia lipolytica 92Met Leu Asn Leu Arg Thr Ala Leu Arg Ala Val Arg Pro Val Thr Leu 1 5 10 15 Thr Arg Ser Tyr Ala Thr Ser Val Ala Ser Phe Thr Gly Gln Lys Asn 20 25 30 Ser Asn Gly Lys Tyr Thr Val Ser Leu Ile Glu Gly Asp Gly Ile Gly 35 40 45 Thr Glu Ile Ser Lys Ala Val Lys Asp Ile Tyr His Ala Ala Lys Val 50 55 60 Pro Ile Asp Trp Glu Val Val Asp Val Thr Pro Thr Leu Val Asn Gly 65 70 75 80 Lys Thr Thr Ile Pro Asp Ser Ala Ile Glu Ser Ile Asn Arg Asn Lys 85 90 95 Val Ala Leu Lys Gly Pro Leu Ala Thr Pro Ile Gly Lys Gly His Val 100 105 110 Ser Met Asn Leu Thr Leu Arg Arg Thr Phe Asn Leu Phe Ala Asn Val 115 120 125 Arg Pro Cys Lys Ser Val Val Gly Tyr Lys Thr Pro Tyr Glu Asn Val 130 135 140 Asp Thr Leu Leu Ile Arg Glu Asn Thr Glu Gly Glu Tyr Ser Gly Ile 145 150 155 160 Glu His Thr Val Val Pro Gly Val Val Gln Ser Ile Lys Leu Ile Thr 165 170 175 Arg Glu Ala Ser Glu Arg Val Ile Arg Tyr Ala Tyr Glu Tyr Ala Leu 180 185 190 Ser Arg Gly Met Lys Lys Val Leu Val Val His Lys Ala Ser Ile Met 195 200 205 Lys Val Ser Asp Gly Leu Phe Leu Glu Val Ala Arg Glu Leu Ala Lys 210 215 220 Glu Tyr Pro Ser Ile Asp Leu Ser Val Glu Leu Ile Asp Asn Thr Cys 225 230 235 240 Leu Arg Met Val Gln Asp Pro Ala Leu Tyr Arg Asp Val Val Met Val 245 250 255 Met Pro Asn Leu Tyr Gly Asp Ile Leu Ser Asp Leu Ala Ser Gly Leu 260 265 270 Ile Gly Gly Leu Gly Leu Thr Pro Ser Gly Asn Met Gly Asp Glu Val 275 280 285 Ser Ile Phe Glu Ala Val His Gly Ser Ala Pro Asp Ile Ala Gly Lys 290 295 300 Gly Leu Ala Asn Pro Thr Ala Leu Leu Leu Ser Ser Val Met Met Leu 305 310 315 320 Arg His Met Gly Leu Asn Asp Asn Ala Thr Asn Ile Glu Gln Ala Val 325 330 335 Phe Gly Thr Ile Ala Ser Gly Pro Glu Asn Arg Thr Lys Asp Leu Lys 340 345 350 Gly Thr Ala Thr Thr Ser His Phe Ala Glu Gln Ile Ile Lys Arg Leu 355 360 365 Lys 93960DNAYarrowia lipolytica 93cctctcactt tgtgaatcgt gaaacatgaa tcttcaagcc aagaatgtta ggcaggggaa 60gctttctttc agactttttg gaattggtcc tcttttggac attattgacg atattattat 120tttttccccg tccaatgttg acccttgtaa gccattccgg ttctggagcg catctcgtct 180gaaggagtct tcgtgtggct ataactacaa gcgttgtatg gtggatccta tgaccgtcta 240tatagggcaa cttttgctct tgttcttccc cctccttgag ggacgtatgg caatggctat 300gacaactatc gtagtgagcc tctataaccc attgaagtac aagtcctcca ccttgctgcc 360aaactcgcga gaaaaaaagt ccaccaactc cgccgggaaa tactggagaa cacctctaag 420acgtgggctt ctgcacctgt gtggcttggg tctgggtttt gcgagctctg agccacaacc 480taaggacggt gtgattggga gataagtagt cgttggtttt ctaatcgcac gtgatatgca 540agccacactt ataacacaat gaagacaggc cgatgaactg catgtcattg tacaggtgcg 600gagagcaaga aactctgggg cggaggtgaa agatgagaca aaaagcctca ggtgcaaggt 660agggagttga tcaacgtcaa acacaaataa tctaggttgt taggcagcta aacatgtata 720taactgggct gccaccgagt gttacttgtc attaacgtcg cattttcgcc tacacaaaat 780ttgggttact cgccactaca ctgctcaaat ctttcagctg tgcaacaagc tttcaggtca 840cacatagact cgcataagga cccgggtcat ctgttattct ccactggtaa accaatagtc 900ctagctgatt tgggtacaga agctcacttt cacatctttt catcttcttc tacaaccatc 96094855DNAYarrowia lipolytica 94atctgtgagg agcccctggc gtcactgtcg actgtgccgg catttctgat ggtatttcca 60gccccgcagt tctcgagacc cccgaacaaa tgtgccacac ccttgccaaa atgacgaata 120cacggcgtcg cggccgggaa tcgaactctt ggcaccgcca caggagtgaa atttgaaatt 180tgaaatttga aaaataattc acattttgag tttcaataat atatcgatga ccctcccaaa 240agacccaagt cgagacgcaa aaaaacaccc agacgacatg gatgcggtca cgtgaccgca 300aaaaccgccc cggaaatccg tttgtgacgt gttcaattcc atctctatgt ttttctgcgg 360tttctacgat gccgcaatgg tggccaatgt gcgtttcact gccgtagtgg ctggaacaag 420ccacaggggg tcgtcgggcc aatcagacgg tccctgacat ggttctgcgc cctaacccgg 480gaactctaac ccccgtggtg gcgcaatcgc tgtcttcatg tgctttatct cacgtgacgg 540ctggaatctg gcagaagacg gagtatgtac attttgtcgt tggtcacgtt atccctaaaa 600cgtggtgttt aaactggtcg aatgcttggc ccagaacaca agaagaaaaa aacgagacaa 660cttgatcagt ttcaacgcca cagcaagctt gtcttcactg tggttggtct tctccacgcc 720acaagcaaca cgtacatgtc aattacgtca gggtctttta agttctgtgg cttttgaacc 780agttataaag aaccaaccac ccttttttca aagctaatca agacggggaa attttttttt 840tgatattttt cgaca 85595897DNAYarrowia lipolytica 95gatactgcag acggtgcatt acttacccgt gtcgactgag agtctacttg gtacttggcc 60ctgtggctaa gcagtatttg agcaacaatg caatgcagtt gctgactcgg ttccagatcc 120ccttgccccg atgtgtggaa gcgttgtttt tggggcaagg gcatgtgggg gctgcatcat 180actgtggctg gggccgttgg aagagccgtc ggcagcgagc ctgagtcgct tctcggggcc 240ttattccccc cgcctctagg tcagcggcgg ccgaagtgtc gtactcagct cgcctgtaca 300gtatgacgtg accgaatagc ctctggaagg ttggagaagt acagtgcaaa aaaaagttgc 360aaaatttcat tttagcgttc gatccgacgt ggcagttgga caatgaatcg atggagacat 420gatcatgggc agaaatagaa ggtctccatg ttcaatggca gtaccaattg agcaacagac 480gggtcgacag gcggcgggca caccatccgc cctccacatg gcgcaatcgt cagtgcagcg 540attcgtactc ggattgcatc atgttgcacc gaaagttggg gcccgcacgt tggagaggcg 600aggagccagg gttagctttg gtggggtcct ttgttgtcac gtggcatcag cgaatggcgt 660cctccaatca gggccgtcag cgaagtcggc gtgtgatagt gcgtggggag cgaatagagt 720ttctgggggg gggcggccca aaacgtgaaa tccgagtacg catgtagagt gtaaattggg 780tgtatagtga cattgtttga ctctgaccct gagagtaata tataatgtgt acgtgtcccc 840ctccgttggt cttctttttt tctcctttct cctaaccaac acccaaacta atcaatc 89796972DNAYarrowia lipolytica 96aagtcgtatt aacataactt tccttacatt tttttaaagc acgtcactat ccacgtgacc 60tagccacgcg ataccaagta ttcatccata atgacacact catgacgtcc ggaggacgtc 120atcatcgtcc agtcacgtgc caaggcacat gactaatcat aacaccttat gactagcttc 180tgaatcgcta cacagttcca attcgcaaat aaactcgaaa tgacgaaatg ccataataaa 240aatgacgaaa ctcgagattg agagcagcac atgcactgaa gtggtggaca accagcgtat 300ccggagacac gacggatcca gcaccatgga agctggccga aaaagagatc cccagcacat 360tgagcaacca agtcagctca attgagtaac atcacacact cagatcgagt ctgatggtgg 420tccccttttg ttccttcact tgaaaaataa ttgaaaataa caataacaat aaaaataaaa 480acaaaataaa aataaaaata aaaataaaaa taaaaaaata aaaaaacctt gccgcattta 540gcgtcagcca ccccccgcat tgacctgagt acgttggatt gaccccgatc ctgcacgtcg 600agcgtggtcg gccaaaaagc gcccgtggct ggtgagtcag aaatagcagg gttgcaagag 660agagctgcgc aacgagcaat aaacggtgtt tttttcgctt ctgtgctgct tagagtggag 720agccgaccct cgccatgctc acgtgaccat tcacgtggtt gcaaactcca ccttagtata 780gccgtgtccc tctcgctacc cattatcgca tcgtactcca gccacatttt tttgttcccc 840gctaaatccg gaaccttatc tgggtcacgt gaaattgcaa tctcgacagg aggttatact 900tatagagtga gacactccac gcaaggtgtt gcaagtcaat tgacaccacc tcacctcaga 960ctaacatcca ca 972971425DNAYarrowia lipolytica 97gaatctgccc ccacatttta tctccgcttt tgactgtttt tctcccccct ttcacactct 60gcttttggct acataaaccc cgcaccgttt ggaactctgt tggtccgggg aagccgccgt 120taggtgtgtc agatggagag cgccagacga gcagaaccga gggacagcgg atcgggggag 180ggctgtcacg tgacgaaggg cactgttgac gtggtgaatg tcgcccgttc tcacgtgacc 240cgtctcctct atatgtgtat ccgcctcttt gtttggtttt ttttctgctt cccccccccc 300ccccccaccc caatcacatg ctcagaaagt agacatctgc atcgtcctgc atgccatccc 360acaagacgaa caagtgatag gccgagagcc gaggacgagg tggagtgcac aaggggtagg 420cgaatggtac gattccgcca agtgagactg gcgatcggga gaagggttgg tggtcatggg 480ggatagaatt tgtacaagtg gaaaaaccac tacgagtagc ggatttgata ccacaagtag 540cagagatata cagcaatggt gggagtgcaa gtatcggaat gtactgtacc tcctgtactc 600gtactcgtac ggcactcgta gaaacggggc aatacggggg agaagcgatc gcccgtctgt 660tcaatcgcca caagtccgag taatgctcga gtatcgaagt cttgtacctc cctgtcaatc 720atggcaccac tggtcttgac ttgtctattc atactggaca agcgccagag ttaagcttgt 780agcgaatttc gccctcggac atcaccccat acgacggaca cacatgcccg acaaacagcc 840tctcttattg tagctgaaag tatattgaat gtgaacgtgt acaatatcag gtaccagcgg 900gaggttacgg ccaaggtgat accggaataa ccctggcttg gagatggtcg gtccattgta 960ctgaagtgtc cgtgtcgttt ccgtcactgc cccaattgga catgtttgtt tttccgatct 1020ttcgggcgcc ctctccttgt ctccttgtct gtctcctgga ctgttgctac cccatttctt 1080tggcctccat tggttcctcc ccgtctttca cgtcgtctat ggttgcatgg tttcccttat 1140acttttcccc acagtcacat gttatggagg ggtctagatg gaggcctaat tttgacgtgc 1200aaggggcgaa ttggggcgag aaacacgtcg tggacatggt gcaaggcccg cagggttgat 1260tcgacgcttt tccgcgaaaa aaacaagtcc aaataccccc gtttattctc cctcggctct 1320cggtatttca catgaaaact ataacctaga ctacacgggc aaccttaacc ccagagtata 1380cttatatacc aaagggatgg gtcctcaaaa atcacacaag caacg 142598500DNAYarrowia lipolytica 98cgccattcgg ttccttccag accattccag atcaatccac ctcttcttat ctcaggtggg 60tgtgctgaca tcagaccccg tagcccttct cccagtggcg aacagcaggc ataaaacagg 120gccattgagc agagcaaaca aggtcggtga aatcgtcgaa aaagtcggaa aacggttgca 180agaaattgga gcgtcacctg ccaccctcca ggctctatat aaagcattgc cccaattgct 240aacgcttcat atttacacct ttggcacccc agtccatccc tccaataaaa tgtactacat 300gggacacaac aagagaggat gcgcgcccaa accctaacct agcacatgca cgatgattct 360ctttgtctgt gaaaaaattt ttccaccaaa atttccccat tgggatgaaa ccctaaccgc 420aaccaaaagt ttttaactat catcttgtac gtcacggttt ccgattcttc tcttctcttt 480catcatcatc acttgtgacc 50099494DNAYarrowia lipolytica 99aactaccata aagtaccgag aaatataggc aattgtacaa attgtccacc tccttcactt 60acattaccga accatggcca tatcaccaaa ataccccgag tgctaaaaca cctccctcca 120aatgttctct taccttccac cgaaaaccga tcttattatc ccaacgcttg ttgtggcttg 180acgcgccgca cccgctgggc ttgccatttc gataccaatc caagaggaaa agctcatgag 240aaacaatcgg aatatcacga gaacggcctg gcgaaccaac aggatatttt tgaatataat 300tacccctcga atctagtcat atctatgtct actgtagact tgggcggcat catgatgtac 360attattttag cgtctggaac cctaaagttc acgtacaatc atgtgacaaa cgaggctaaa 420aaatgtcaat ttcgtatatt agtgttatta cgtggctcac atttccgaat catctaccac 480cccccaccta aaaa 494100440DNAYarrowia lipolytica 100tttttttaat tttcatattt attttcatat ttattttcat atttattttc atttatttat 60tcatgtattt atttattact ttttaagtat tttaaactcc tcactaaacc gtcgattgca 120caatattaac cttcattaca cctgcagcgt ggtttttgtg gtcgttagcc gaagtcttcc 180aacgtgggta taagtaggaa caattgggcc gattttttga gccgtctaaa tctctcgact 240caattgatct gctgtcgaaa atccggctct ctagctcctt ttccccgtcc gctggagctc 300ctcttcattg tgccgttttt ccaacattta actttgccac ccaccaccac ccccactacc 360atcacccact cgatctctgt tcgtgtcacc acgactttgt cttctcacac atactctgtt 420tgtgcaccac acattgcgaa 440101434DNAYarrowia lipolytica 101gctacaatag ctttattggc cctattgagc acgctacaat tcggtccagt atgtacaacg 60tctatgcgca ctaacggcca tacagtgagt tacagcacac ccaaaagtaa ccctgcctga 120cctgtctgcc tgagacagga agattaactc ttgtagtgac cgagctcgat aagactcaag 180ccacacaatt tttttatagc cttgcttcaa gagtcgccaa aatgacatta cacaactcca 240cggaccgtcg gttccatgtc cacacccttg gatgggtaag cgctccacgc acgtaccacg 300tgcattgagt ttaaccacaa acataggtct gtgtcccaga gttaccctgc tgcatcagcc 360aagtcttgaa agcaaaattt cttgcacaat ttttcctctt cttttcttca ctgatcgcag 420tccaaacaca aaca 434102752DNAYarrowia lipolytica 102gcgctctgat ccacttgtat ggctccaagt tcagtgtacc aagtagttgg tgatgcaggg 60agggatgtct ctatccacca ataatgaact catgggcgaa attgtttctg ttaaacactc 120caactgtcgt tttaaatctc attctctttg catttggact ccattcgctt ccgttgggcc 180aatataatcc atcgtaacgt actttagatg gaaatttagt tacctgctac ttgtctcaac 240accccaacag gggctgttcg acagaggtaa tagagcgtca atgggttaat aaaaacacac 300tgtcgatttt cactcattgt ctttatgata ttacctgttt tccgctgtta tcaatgccga 360gcatcgtgtt atatcttcca ccccaactac ttgcatttac ttaactatta cctcaactat 420ttacaccccg aattgttacc tcccaataag taactttatt tcaaccaatg ggacgagagc 480atctctgaga acatcgatct atctctgtca atattgccca gaatcgttcg aaaaaaaaca 540ccaaaaggtt tacagcgcca ttataaatat aaattcgttg tcaattcccc cgcaatgtct 600gttgaaatct cattttgaga ccttccaaca ttaccctctc tcccgtctgg tcacatgacg 660tgactgcttc ttcccaaaac gaacactccc aactcttccc ccccgtcagt gaaaagtata 720catccgacct ccaaatcttt tcttcactca ac 752103402DNAYarrowia lipolytica 103agagacgggt tggcggcgta tttgtgtccc aaaaaacagc cccaattgcc ccaattgacc 60ccaaattgac ccagtagcgg gcccaacccc ggcgagagcc cccttcaccc cacatatcaa 120acctcccccg gttcccacac ttgccgttaa gggcgtaggg tactgcagtc tggaatctac 180gcttgttcag actttgtact agtttctttg tctggccatc cgggtaaccc atgccggacg 240caaaatagac tactgaaaat ttttttgctt tgtggttggg actttagcca agggtataaa 300agaccaccgt ccccgaatta cctttcctct tcttttctct ctctccttgt caactcacac 360ccgaaatcgt taagcatttc cttctgagta taagaatcat tc 402104491DNAYarrowia lipolytica 104gctgcgctga tctggacacc acagaggttc cgagcacttt aggttgcacc aaatgtccca 60ccaggtgcag gcagaaaacg ctggaacagc gtgtacagtt tgtcttagca aaaagtgaag 120gcgctgaggt cgagcagggt ggtgtgactt gttatagcct ttagagctgc gaaagcgcgt 180atggatttgg ctcatcaggc cagattgagg gtctgtggac acatgtcatg ttagtgtact 240tcaatcgccc cctggatata gccccgacaa taggccgtgg cctcattttt ttgccttccg 300cacatttcca ttgctcggta cccacacctt gcttctcctg cacttgccaa ccttaatact 360ggtttacatt gaccaacatc ttacaagcgg ggggcttgtc tagggtatat ataaacagtg 420gctctcccaa tcggttgcca gtctcttttt tcctttcttt ccccacagat tcgaaatcta 480aactacacat c 4911051213DNAYarrowia lipolytica 105gatgaggaat agacaagcgg gtatttattg tatgaataaa gattatgtat tgattgcaaa 60aaagtgcatt tgtagatgtg gtttattgta gagagtacgg tatgtactgt acgaacatta 120ggagctactt ctacaagtag attttcttaa caagggtgaa atttactagg aagtacatgc 180atatttcgtt agtagaatca caaaagaaat gtacaagcac gtactacttg tactccacaa 240tgtggagtgg gagcaaaaaa attggacgac accggaatcg aaccggggac ctcgcgcatg 300ctaagcgcat gtgataacca actacaccag acgcccaaga actttcttgg tgattatgga 360atacgtggtc tgctatatct caattttgct gtaatgaatc attagaatta aaaaaaaaac 420cccatttttg tgtgattgtc ggccaagaga tggaacagga agaatacgtg aacaagcgag 480cacgaatgcc atatgctctt ctgaacaacc gagtccgaat ccgatttgtg ggtatcacat 540gtctcaagta gctgaaatgt atttcgctag aataaaataa atgagattaa gaattaaaaa 600tattggaata tattttccta gaatagaaac tttggatttt ttttcggcta ttacagtctg 660aactggacaa acggctgact atatataaat attattgggt ctgttttctt gtttatgtcg 720aaattatctg ggttttacta ctgtgtcgtc gagtatagag tggcctgact ggagaaaatg 780cagtagtatg gacagtaggt actgccagcc agagaagttt ttggaattga tacttgagtc 840atttttccat tccccattcc ccattccaac acaatcaact gtttctgaac attttccaaa 900acgcggagat gtatgtcact tggcactgca agtctcgatt caaaatgcat ctctttcaga 960ccaaagtgtc atcagctttg tttggcccca aattaccgca aatacttgtc gaaattgaag 1020tgcaatacgg cctcgtctgc catgaaacct gcctattctc ttcaaattgg cgtcaggttt 1080cacgtccagc attcctcgcc cagacagagt tgctatggtt gaatcgtgta

ctgttaatat 1140atgtatgtat tatactcgta ctacgatata ctgttcaata gagtctctta taatcgtacg 1200acgattctgg gca 121310634PRTYarrowia lipolytica 106Met Leu Ser Arg Asn Leu Ser Lys Phe Ala Arg Ala Gly Leu Ile Arg 1 5 10 15 Pro Ala Thr Thr Ser Thr His Thr Arg Leu Phe Ser Val Ser Ala Arg 20 25 30 Arg Leu 10732PRTYarrowia lipolytica 107Met Leu Arg Leu Ile Arg Pro Arg Leu Ala Ala Leu Ala Arg Pro Thr 1 5 10 15 Thr Arg Ala Pro Gln Ala Leu Asn Ala Arg Thr His Ile Val Ser Val 20 25 30 10853PRTSaccharomyces cerevisiae 108Met Phe Gln Arg Ser Gly Ala Ala His His Ile Lys Leu Ile Ser Ser 1 5 10 15 Arg Arg Cys Arg Phe Lys Ser Ser Phe Ala Val Ala Leu Asn Ala Ala 20 25 30 Ser Lys Leu Val Thr Pro Lys Ile Leu Trp Asn Asn Pro Ile Ser Leu 35 40 45 Val Ser Lys Glu Met 50 10977PRTYarrowia lipolytica 109Met Leu Arg Val Gly Arg Ile Gly Thr Lys Thr Leu Ala Ser Ser Ser 1 5 10 15 Leu Arg Phe Val Ala Gly Ala Arg Pro Lys Ser Thr Leu Thr Glu Ala 20 25 30 Val Leu Glu Thr Thr Gly Leu Leu Lys Thr Thr Pro Gln Asn Pro Glu 35 40 45 Trp Ser Gly Ala Val Lys Gln Ala Ser Arg Leu Val Glu Thr Asp Thr 50 55 60 Pro Ile Arg Asp Pro Phe Ser Ile Val Ser Gln Glu Met 65 70 75 1101125PRTAspergillus nidulans 110Met Ala Ser Val Leu Ile Arg Arg Lys Phe Gly Thr Glu Gly Gly Ser 1 5 10 15 Asp Ala Glu Pro Ser Trp Leu Lys Arg Gln Val Thr Gly Cys Leu Gln 20 25 30 Ser Ile Ser Arg Arg Ala Cys Ile His Pro Ile His Thr Ile Val Val 35 40 45 Ile Ala Leu Leu Ala Ser Thr Thr Tyr Val Gly Leu Leu Glu Gly Ser 50 55 60 Leu Phe Asp Ser Phe Arg Asn Ser Asn Asn Val Ala Gly His Val Asp 65 70 75 80 Val Asp Ser Leu Leu Leu Gly Asn Arg Ser Leu Arg Leu Gly Glu Gly 85 90 95 Thr Ser Trp Lys Trp Gln Val Glu Asp Ser Leu Asn Gln Asp Asp Gln 100 105 110 Lys Val Gly Asn Pro Glu Leu Lys Arg Glu Val Asp Gln His Leu Ala 115 120 125 Leu Thr Thr Leu Ile Phe Pro Asp Ser Ile Ser Lys Ser Ala Ser Thr 130 135 140 Ala Pro Ala Ala Asp Ala Leu Pro Val Pro Ala Asn Ala Ser Ala Gln 145 150 155 160 Leu Leu Pro His Thr Pro Asn Leu Phe Ser Pro Phe Ser His Asp Ser 165 170 175 Ser Leu Val Phe Thr Leu Pro Phe Asp Gln Val Pro Gln Phe Leu Arg 180 185 190 Ala Val Gln Glu Leu Pro Asp Pro Thr Leu Glu Asp Asp Glu Gly Glu 195 200 205 Gln Lys Arg Trp Ile Met Arg Ala Thr Arg Gly Pro Val Ser Gly Pro 210 215 220 Asn Gly Thr Ile Ser Ser Trp Leu Ser Asp Ala Trp Ser Ser Phe Val 225 230 235 240 Asp Leu Ile Lys His Ala Glu Thr Ile Asp Ile Ile Ile Met Thr Leu 245 250 255 Gly Tyr Leu Ala Met Tyr Leu Ser Phe Ala Ser Leu Glu Phe Ser Met 260 265 270 Lys Gln Leu Gly Ser Lys Phe Trp Leu Ala Thr Thr Val Leu Phe Ser 275 280 285 Gly Met Phe Ala Phe Leu Phe Gly Leu Leu Val Thr Thr Lys Phe Gly 290 295 300 Val Pro Leu Asn Leu Leu Leu Leu Ser Glu Gly Leu Pro Phe Leu Val 305 310 315 320 Thr Thr Ile Gly Phe Glu Lys Pro Ile Ile Leu Thr Arg Ala Val Leu 325 330 335 Ser Ala Ser Ile Asp Lys Lys Arg Gln Gly Ser Ala Thr Ser Thr Pro 340 345 350 Ser Ser Ile Gln Asp Ser Ile Gln Thr Ala Ile Arg Glu Gln Gly Phe 355 360 365 Glu Ile Ile Arg Asp Tyr Cys Ile Glu Ile Ser Ile Leu Ile Ala Gly 370 375 380 Ala Ala Ser Gly Val Gln Gly Gly Leu Gln Gln Phe Cys Phe Leu Ala 385 390 395 400 Ala Trp Ile Leu Phe Phe Asp Cys Ile Leu Leu Phe Thr Phe Tyr Thr 405 410 415 Thr Ile Leu Cys Ile Lys Leu Glu Ile Thr Arg Ile Arg Arg His Val 420 425 430 Thr Leu Arg Lys Ala Leu Glu Glu Asp Gly Thr Thr Gln Ser Val Ala 435 440 445 Glu Lys Val Ala Ser Ser Asn Asp Trp Phe Gly Ala Gly Ser Asp Asn 450 455 460 Ser Asp Ala Asp Asp Ala Ser Val Phe Gly Arg Lys Ile Lys Ser Asn 465 470 475 480 Asn Val Arg Arg Phe Lys Phe Leu Met Val Gly Gly Phe Val Leu Val 485 490 495 Asn Val Val Asn Met Thr Ala Ile Pro Phe Arg Asn Ser Ser Leu Ser 500 505 510 Pro Leu Cys Asn Val Phe Ser Pro Thr Pro Ile Asp Pro Phe Lys Val 515 520 525 Ala Glu Asn Gly Leu Asp Ala Thr Tyr Val Ser Ala Lys Ser Gln Lys 530 535 540 Leu Glu Thr Leu Val Thr Val Val Pro Pro Ile Lys Val Lys Leu Glu 545 550 555 560 Tyr Pro Ser Val His Tyr Ala Lys Leu Gly Glu Ser Gln Ser Ile Glu 565 570 575 Ile Glu Tyr Thr Asp Gln Leu Leu Asp Ala Val Gly Gly His Val Leu 580 585 590 Asn Gly Val Leu Lys Ser Ile Glu Asp Pro Val Ile Ser Lys Trp Ile 595 600 605 Thr Ala Val Leu Thr Ile Ser Ile Val Leu Asn Gly Tyr Leu Phe Asn 610 615 620 Ala Ala Arg Trp Ser Ile Lys Glu Pro Gln Ala Ala Pro Ala Pro Lys 625 630 635 640 Glu Pro Ala Lys Pro Lys Val Tyr Pro Lys Thr Asp Leu Asn Ala Gly 645 650 655 Pro Lys Arg Ser Met Glu Glu Cys Glu Ala Met Leu Lys Ala Lys Lys 660 665 670 Ala Ala Tyr Leu Ser Asp Glu Leu Leu Ile Glu Leu Ser Leu Ser Gly 675 680 685 Lys Leu Pro Gly Tyr Ala Leu Leu Lys Ser Leu Glu Asn Glu Glu Leu 690 695 700 Met Ser Arg Val Asp Ala Phe Leu Arg Ala Val Lys Leu Arg Arg Ala 705 710 715 720 Val Val Ser Arg Thr Pro Ala Thr Ser Ala Val Thr Ser Ser Leu Glu 725 730 735 Thr Ser Lys Leu Pro Tyr Lys Asp Tyr Asn Tyr Ala Leu Val His Gly 740 745 750 Ala Cys Cys Glu Asn Val Ile Gly Thr Leu Pro Leu Pro Leu Gly Val 755 760 765 Ala Gly Pro Leu Val Thr Asp Gly Gln Ser Tyr Phe Ile Pro Met Ala 770 775 780 Thr Ile Glu Gly Val Leu Val Ala Ser Ala Ser Arg Gly Ala Lys Ala 785 790 795 800 Ile Asn Ala Gly Gly Gly Ala Val Ile Val Leu Thr Gly Asp Gly Met 805 810 815 Thr Arg Gly Pro Cys Val Gly Phe Pro Thr Leu Ala Arg Ala Ala Ala 820 825 830 Ala Lys Val Trp Leu Asp Ser Glu Glu Gly Lys Ser Val Met Thr Ala 835 840 845 Ala Phe Asn Ser Thr Ser Arg Phe Ala Arg Leu Gln His Leu Lys Thr 850 855 860 Ala Leu Ala Gly Thr Tyr Leu Tyr Ile Arg Phe Lys Thr Thr Thr Gly 865 870 875 880 Asp Ala Met Gly Met Asn Met Ile Ser Lys Gly Val Glu Lys Ala Leu 885 890 895 His Val Met Ala Thr Glu Cys Gly Phe Asp Asp Met Ala Thr Ile Ser 900 905 910 Val Ser Gly Asn Phe Cys Thr Asp Lys Lys Ala Ala Ala Leu Asn Trp 915 920 925 Ile Asp Gly Arg Gly Lys Ser Val Val Ala Glu Ala Ile Ile Pro Gly 930 935 940 Asp Val Val Arg Asn Val Leu Lys Ser Asp Val Asp Ala Leu Val Glu 945 950 955 960 Leu Asn Thr Ser Lys Asn Leu Ile Gly Ser Ala Met Ala Gly Ser Leu 965 970 975 Gly Gly Phe Asn Ala His Ala Ser Asn Ile Val Thr Ala Ile Phe Leu 980 985 990 Ala Thr Gly Gln Asp Pro Ala Gln Asn Val Glu Ser Ser Ser Cys Ile 995 1000 1005 Thr Thr Met Lys Asn Thr Asn Gly Asn Leu Gln Thr Ala Val Ser 1010 1015 1020 Met Pro Ser Ile Glu Val Gly Thr Ile Gly Gly Gly Thr Ile Leu 1025 1030 1035 Glu Ala Gln Gly Ala Met Leu Asp Leu Leu Gly Val Arg Gly Ser 1040 1045 1050 His Pro Thr Asn Pro Gly Asp Asn Ala Arg Gln Leu Ala Arg Ile 1055 1060 1065 Val Ala Ala Ala Val Leu Ala Gly Phe Leu Ser Leu Cys Ser Ala 1070 1075 1080 Leu Ala Ala Gly His Leu Val Arg Ala His Met Ala His Asn Arg 1085 1090 1095 Ser Ala Ala Pro Thr Arg Ser Ala Thr Pro Val Ser Ala Ala Val 1100 1105 1110 Gly Ala Thr Arg Gly Leu Ser Met Thr Ser Ser Arg 1115 1120 1125 1111210PRTGibberella zeae 111Met Ala Ser Ile Leu Leu Pro Lys Lys Phe Arg Gly Glu Thr Ala Pro 1 5 10 15 Ala Glu Lys Thr Thr Pro Ser Trp Ala Ser Lys Arg Leu Thr Pro Ile 20 25 30 Ala Gln Phe Ile Ser Arg Leu Ala Cys Ser His Pro Ile His Thr Val 35 40 45 Val Leu Val Ala Val Leu Ala Ser Thr Ser Tyr Val Gly Leu Leu Gln 50 55 60 Glu Ser Phe Phe Ser Thr Asp Leu Pro Thr Val Gly Lys Ala Asp Trp 65 70 75 80 Ser Ser Leu Val Glu Gly Ser Arg Val Leu Arg Ala Gly Pro Glu Thr 85 90 95 Ala Trp Asn Trp Lys Ala Ile Glu Gln Asp Ser Ile Gln His Ala Gly 100 105 110 Ala Asp Ala Asp His Leu Ala Leu Leu Thr Leu Val Phe Pro Asp Thr 115 120 125 His Ser Ala Glu Ser Ser Ser Thr Ala Pro Arg Ser Ser His Val Pro 130 135 140 Val Pro Gln Asn Leu Ser Ile Thr Pro Leu Pro Ser Thr Lys Asn Ser 145 150 155 160 Phe Thr Ala Tyr Ser Gln Asp Ser Ile Leu Ala Tyr Ser Leu Pro Tyr 165 170 175 Ala Glu Gly Pro Asp Val Val Gln Trp Ala Asn Asn Ala Trp Thr Glu 180 185 190 Phe Leu Asp Leu Leu Lys Asn Ala Glu Thr Leu Asp Ile Val Ile Met 195 200 205 Phe Leu Gly Tyr Thr Ala Met His Leu Thr Phe Val Ser Leu Phe Leu 210 215 220 Ser Met Arg Lys Ile Gly Ser Lys Phe Trp Leu Gly Ile Cys Thr Leu 225 230 235 240 Phe Ser Ser Val Phe Ala Phe Leu Phe Gly Leu Ile Val Thr Thr Lys 245 250 255 Leu Gly Val Pro Ile Ser Val Ile Leu Leu Ser Glu Gly Leu Pro Phe 260 265 270 Leu Val Val Thr Ile Gly Phe Glu Lys Asn Ile Val Leu Thr Arg Ala 275 280 285 Val Met Ser His Ala Ile Glu His Arg Arg Gln Ile Gln Asn Ser Lys 290 295 300 Ser Gly Lys Gly Ser Pro Glu Arg Ser Met Gln Asn Val Ile Gln Tyr 305 310 315 320 Ala Val Gln Ser Ala Ile Lys Glu Lys Gly Phe Glu Ile Met Arg Asp 325 330 335 Tyr Ala Ile Glu Ile Val Ile Leu Ala Leu Gly Ala Ala Ser Gly Val 340 345 350 Gln Gly Gly Leu Gln His Phe Cys Phe Leu Ala Ala Trp Thr Leu Phe 355 360 365 Phe Asp Phe Ile Leu Leu Phe Thr Phe Tyr Thr Ala Ile Leu Ser Ile 370 375 380 Lys Leu Glu Ile Asn Arg Ile Lys Arg His Val Asp Met Arg Met Ala 385 390 395 400 Leu Glu Asp Asp Gly Val Ser Arg Arg Val Ala Glu Asn Val Ala Lys 405 410 415 Ser Asp Gly Asp Trp Thr Arg Val Lys Gly Asp Ser Ser Leu Phe Gly 420 425 430 Arg Lys Ser Ser Ser Val Pro Thr Phe Lys Val Leu Met Ile Leu Gly 435 440 445 Phe Ile Phe Val Asn Ile Val Asn Ile Cys Ser Ile Pro Phe Arg Asn 450 455 460 Pro Arg Ser Leu Ser Thr Ile Arg Thr Trp Ala Ser Ser Leu Gly Gly 465 470 475 480 Val Val Ala Pro Leu Ser Val Asp Pro Phe Lys Val Ala Ser Asn Gly 485 490 495 Leu Asp Ala Ile Leu Ala Ala Ala Lys Ser Asn Asn Arg Pro Thr Leu 500 505 510 Val Thr Val Leu Thr Pro Ile Lys Tyr Glu Leu Glu Tyr Pro Ser Ile 515 520 525 His Tyr Ala Leu Gly Ser Ala Ile Asn Gly Asn Asn Ala Glu Tyr Thr 530 535 540 Asp Ala Phe His His His Phe Gln Gly Tyr Gly Val Gly Gly Arg Met 545 550 555 560 Val Gly Gly Ile Leu Lys Ser Leu Glu Asp Pro Val Leu Ser Lys Trp 565 570 575 Ile Val Ile Ala Leu Ala Leu Ser Val Ala Leu Asn Gly Tyr Leu Phe 580 585 590 Asn Val Ala Arg Trp Gly Ile Lys Asp Pro Asn Val Pro Glu His Asn 595 600 605 Ile Asp Arg Asn Glu Leu Ala Arg Ala Gln Gln Phe Asn Asp Thr Gly 610 615 620 Ser Ala Thr Leu Pro Leu Gly Glu Tyr Val Pro Pro Thr Pro Met Arg 625 630 635 640 Thr Glu Pro Ser Thr Pro Ala Ile Thr Asp Asp Glu Ala Glu Gly Leu 645 650 655 Gln Met Thr Lys Ala Arg Ser Asp Lys Leu Pro Asn Arg Pro Asn Glu 660 665 670 Glu Leu Glu Lys Leu Leu Ala Glu Lys Arg Val Lys Glu Met Ser Asp 675 680 685 Glu Glu Leu Val Ser Leu Ser Met Arg Cys Lys Ile Pro Gly Tyr Ala 690 695 700 Leu Leu Lys Thr Leu Gly Asp Phe Thr Arg Ala Val Lys Ile Arg Arg 705 710 715 720 Ser Ile Ile Ala Arg Asn Arg Ala Thr Ser Asp Leu Thr His Ser Leu 725 730 735 Glu Arg Ser Lys Leu Pro Phe Glu Lys Tyr Asn Trp Glu Arg Val Phe 740 745 750 Cys Ala Cys Cys Glu Asn Val Ile Gly Tyr Met Pro Leu Pro Val Gly 755 760 765 Val Ala Gly Arg Leu Val Thr Asp Gly Gln Ser Tyr Phe Ile Pro Met 770 775 780 Ala Thr Thr Glu Gly Val Leu Val Ala Ser Ala Ser Arg Gly Cys Lys 785 790 795 800 Ala Ile Asn Ala Gly Gly Gly Ala Val Thr Val Leu Thr Ala Asp Gly 805 810 815 Met Thr Arg Gly Pro Cys Val Ala Phe Glu Thr Leu Glu Arg Ala Gly 820 825 830 Ala Ala Lys Leu Trp Ile Asp Ser Glu Ala Gly Ser Asp Ile Met Lys 835 840 845 Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala Arg Leu Gln Ser Met Lys 850 855 860 Thr Ala Leu Ala Gly Thr Asn Leu Tyr Ile Arg Phe Lys Thr Thr Thr 865 870 875 880 Gly Asp Ala Met Gly Met Asn Ile Ile Ser Lys Gly Val Glu His Ala 885 890 895 Leu Ser Val Met Ser Asn Glu Ala Gly Phe Asp Asp Met Gln Ile Val 900 905 910 Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys Lys Ala Ala Ala Leu Asn 915 920 925 Trp Ile Asp Gly Arg Gly Lys Gly

Val Val Ala Glu Ala Ile Ile Pro 930 935 940 Gly Asp Val Val Arg Ser Val Leu Lys Ser Asp Val Asp Ala Leu Val 945 950 955 960 Glu Leu Asn Ile Ser Lys Asn Ile Ile Gly Ser Ala Met Ala Gly Ser 965 970 975 Val Gly Gly Phe Asn Ala His Ala Ala Asn Ile Val Ala Ala Ile Phe 980 985 990 Leu Ala Thr Gly Gln Asp Pro Ala Gln Val Val Glu Ser Ala Asn Cys 995 1000 1005 Ile Thr Leu Met Lys Asn Leu Arg Gly Ala Leu Gln Thr Ser Val 1010 1015 1020 Ser Met Pro Ser Leu Glu Val Gly Thr Leu Gly Gly Gly Thr Ile 1025 1030 1035 Leu Glu Pro Gln Ser Ala Met Leu Asp Leu Leu Gly Val Arg Gly 1040 1045 1050 Ser His Pro Thr Asn Pro Gly Asp Asn Ser Arg Arg Leu Ala Arg 1055 1060 1065 Ile Ile Gly Ala Ser Val Leu Ala Gly Glu Leu Ser Leu Cys Ser 1070 1075 1080 Ala Leu Ala Ala Gly His Leu Val Arg Ala His Met Gln His Asn 1085 1090 1095 Arg Ser Ala Ala Pro Ser Arg Ser Thr Thr Pro Ala Pro Met Thr 1100 1105 1110 Pro Val Arg Ser Phe Asp Thr Lys Val Arg Cys Gln Pro Asn Asn 1115 1120 1125 Lys Asp Ile Arg Asn Ile Leu Leu Thr Gln His Pro Ser Lys Pro 1130 1135 1140 Thr Ile Thr Tyr Ser Lys Arg Val Ile Lys Ser Thr Ile His Leu 1145 1150 1155 Asn Pro Leu Ile Leu Ala Leu Phe Asp Asn Ser Val Gln Thr Arg 1160 1165 1170 Asp Val Gln Leu Gly Asp Gln Val Ser Thr Arg Gly Thr Leu Asp 1175 1180 1185 Ala Val Gly Gly Pro Gln Gly Gly Gly Val Ala Ala Gly Gly Val 1190 1195 1200 Ala Arg Arg Val Val Gly Ser 1205 1210 1121173PRTNeurospora crassa 112Met Ile Ala Ser Ser Leu Leu Pro Ser Lys Phe Arg Gly Glu Gln Pro 1 5 10 15 Ala Thr Gln Ala Ala Thr Pro Ser Trp Ile Asn Lys Lys Val Thr Pro 20 25 30 Pro Leu Gln Lys Leu Ser Lys Ile Thr Ser Ser Asn Pro Ile His Thr 35 40 45 Ile Val Ile Val Ala Leu Leu Ala Ser Ser Ser Tyr Ile Gly Leu Leu 50 55 60 Gln Asn Ser Leu Phe Asn Val Thr Arg Ser Val Arg Lys Ala Glu Trp 65 70 75 80 Glu Ser Leu Gln Ala Gly Ser Arg Met Leu Arg Ala Gly Ala Asn Thr 85 90 95 Glu Trp Asn Trp Gln Asn His Asp Pro Glu Ala Pro Val Pro Ala Asn 100 105 110 Ala Asn His Leu Ala Leu Leu Thr Leu Val Phe Pro Asp Thr Ala Glu 115 120 125 Ser Gly Pro Val Val Ala Gln Thr Asn Thr Val Pro Leu Pro Ser Asn 130 135 140 Leu Ser Thr Thr Pro Leu Pro Ser Thr Ala Ile Ser Phe Thr Tyr Ser 145 150 155 160 Gln Asp Ser Ala Leu Ala Phe Ser Leu Pro Tyr Ser Gln Ala Pro Glu 165 170 175 Phe Leu Ala Asn Ala Gln Glu Ile Pro Asn Ala Val Ser Ser Gln Glu 180 185 190 Thr Ile Glu Thr Glu Arg Gly His Glu Lys Lys Met Trp Ile Met Lys 195 200 205 Ala Ala Arg Val Gln Thr Arg Ser Ser Thr Val Lys Trp Val Gln Asn 210 215 220 Ala Trp Val Glu Phe Thr Asp Leu Leu Arg Asn Ala Glu Thr Leu Asp 225 230 235 240 Ile Ile Ile Met Ala Leu Gly Tyr Ile Ser Met His Leu Thr Phe Val 245 250 255 Ser Leu Phe Leu Ser Met Arg Arg Met Gly Ser Asn Phe Trp Leu Ala 260 265 270 Thr Ser Val Ile Phe Ser Ser Ile Phe Ala Phe Leu Phe Gly Leu Leu 275 280 285 Val Thr Thr Lys Leu Gly Val Pro Met Asn Met Val Leu Leu Ser Glu 290 295 300 Gly Leu Pro Phe Leu Val Val Thr Ile Gly Phe Glu Lys Asn Ile Val 305 310 315 320 Leu Thr Arg Ala Val Leu Ser His Ala Ile Asp His Arg Arg Pro Thr 325 330 335 Glu Lys Ser Gly Lys Pro Ser Lys Gln Ala Asp Ser Ala His Ser Ile 340 345 350 Gln Ser Ala Ile Gln Leu Ala Ile Lys Glu Lys Gly Phe Asp Ile Val 355 360 365 Lys Asp Tyr Ala Ile Glu Ala Gly Ile Leu Val Leu Gly Ala Ala Ser 370 375 380 Gly Val Gln Gly Gly Leu Gln Gln Phe Cys Phe Leu Ala Ala Trp Ile 385 390 395 400 Leu Phe Phe Asp Cys Ile Leu Leu Phe Ser Phe Tyr Thr Ala Ile Leu 405 410 415 Cys Ile Lys Leu Phe Ile Asn Arg Ile Lys Arg His Val Gln Met Arg 420 425 430 Lys Ala Leu Glu Glu Asp Gly Val Ser Arg Arg Val Ala Glu Lys Val 435 440 445 Ala Gln Ser Asn Asp Trp Pro Arg Ala Asp Gly Lys Asp Gln Pro Gly 450 455 460 Thr Thr Ile Phe Gly Arg Gln Leu Lys Ser Thr His Ile Pro Lys Phe 465 470 475 480 Lys Val Met Met Val Thr Gly Phe Val Leu Ile Asn Val Leu Asn Leu 485 490 495 Cys Thr Ile Pro Phe Arg Ser Ala Asn Ser Ile Ser Ser Ile Ser Ser 500 505 510 Trp Ala Arg Gly Leu Gly Gly Val Val Thr Pro Pro Pro Val Asp Pro 515 520 525 Phe Lys Val Ala Ser Asn Gly Leu Asp Ile Ile Leu Glu Ala Ala Arg 530 535 540 Ala Asp Gly Arg Glu Thr Thr Val Thr Val Leu Thr Pro Ile Arg Tyr 545 550 555 560 Glu Leu Glu Tyr Pro Ser Thr His Tyr Asp Leu Pro Gln Lys Ser Ala 565 570 575 Glu Val Glu Gly Gly Asp Tyr Ala Asn Leu Gly Gly Tyr Gly Gly Arg 580 585 590 Met Val Gly Ser Ile Leu Lys Ser Leu Glu Asp Pro Thr Leu Ser Lys 595 600 605 Trp Ile Val Val Ala Leu Ala Leu Ser Val Ala Leu Asn Gly Tyr Leu 610 615 620 Phe Asn Ala Ala Arg Trp Gly Ile Lys Asp Pro Asn Val Pro Asp His 625 630 635 640 Pro Ile Asn Pro Lys Glu Leu Asp Glu Ala Gln Lys Phe Asn Asp Thr 645 650 655 Ala Ser Ala Thr Leu Pro Leu Gly Glu Tyr Met Lys Pro Thr Ala Pro 660 665 670 Ser Ser Pro Val Ala Pro Leu Thr Pro Ser Ser Thr Asp Asp Glu Asn 675 680 685 Asp Ala Gln Ala Lys Glu Asn Arg Ala Val Thr Leu Ala Ala Gln Arg 690 695 700 Ala Thr Thr Ile Arg Ser Gln Gly Glu Leu Asp Lys Met Thr Ala Glu 705 710 715 720 Lys Arg Thr His Glu Leu Asn Asp Glu Glu Thr Val His Leu Ser Leu 725 730 735 Lys Gly Lys Ile Pro Gly Tyr Ala Leu Glu Lys Thr Leu Lys Asp Phe 740 745 750 Thr Arg Ala Val Lys Val Arg Arg Ser Ile Ile Ser Arg Thr Lys Ala 755 760 765 Thr Thr Glu Leu Thr Asn Ile Leu Asp Arg Ser Lys Leu Pro Tyr Gln 770 775 780 Asn Val Asn Trp Ala Gln Val His Gly Ala Cys Cys Glu Asn Val Ile 785 790 795 800 Gly Tyr Met Pro Leu Pro Val Gly Val Ala Gly Pro Leu Val Thr Asp 805 810 815 Gly Gln Ser Phe Phe Val Pro Met Ala Thr Thr Glu Gly Val Leu Val 820 825 830 Ala Ser Thr Ser Arg Gly Cys Lys Ala Ile Asn Ser Gly Gly Gly Ala 835 840 845 Val Thr Val Leu Thr Ala Asp Gly Met Thr Arg Gly Pro Cys Val Gln 850 855 860 Phe Glu Thr Leu Glu Arg Ala Gly Ala Ala Lys Leu Trp Leu Asp Ser 865 870 875 880 Glu Lys Gly Gln Ser Ile Met Lys Lys Ala Phe Asn Ser Thr Ser Arg 885 890 895 Phe Ala Arg Leu Glu Thr Met Lys Thr Ala Met Ala Gly Thr Asn Leu 900 905 910 Tyr Ile Arg Phe Lys Ile Thr Thr Gly Asp Ala Met Gly Met Asn Met 915 920 925 Ile Ser Lys Gly Val Glu His Ala Leu Ser Val Met Tyr Asn Glu Gly 930 935 940 Phe Glu Asp Met Asn Ile Val Ser Leu Ser Gly Asn Tyr Cys Thr Asp 945 950 955 960 Lys Lys Ala Ala Ala Ile Asn Val Ile Asp Gly Arg Gly Lys Ser Val 965 970 975 Val Ala Glu Ala Ile Ile Pro Ala Asp Val Val Lys Asn Val Leu Lys 980 985 990 Thr Asp Val Asp Thr Leu Val Glu Leu Asn Val Asn Lys Asn Thr Ile 995 1000 1005 Gly Ser Ala Met Ala Gly Ser Met Gly Gly Phe Asn Ala His Ala 1010 1015 1020 Ala Asn Ile Val Ala Ala Ile Phe Leu Ala Thr Gly Gln Asp Pro 1025 1030 1035 Ala Gln Val Val Glu Ser Ala Asn Cys Ile Thr Leu Met Arg Asn 1040 1045 1050 Leu Arg Gly Asn Leu Gln Ile Ser Val Ser Met Pro Ser Ile Glu 1055 1060 1065 Val Gly Thr Leu Gly Gly Gly Thr Ile Leu Glu Pro Gln Ser Ala 1070 1075 1080 Met Leu Asp Met Leu Gly Val Arg Gly Pro His Pro Thr Asn Pro 1085 1090 1095 Gly Glu Asn Ala Arg Arg Leu Ala Arg Ile Val Ala Ala Ala Val 1100 1105 1110 Leu Ala Gly Glu Leu Ser Leu Cys Ser Ala Leu Ala Ala Gly His 1115 1120 1125 Leu Val Lys Ala His Met Ala His Asn Arg Ser Ala Pro Pro Thr 1130 1135 1140 Arg Thr Ser Thr Pro Ala Pro Ala Ala Ala Ala Gly Leu Thr Met 1145 1150 1155 Thr Ser Ser Asn Pro Asn Ala Ala Ala Val Glu Arg Ser Arg Arg 1160 1165 1170 1131045PRTSaccharomyces cerevisiae 113Met Ser Leu Pro Leu Lys Thr Ile Val His Leu Val Lys Pro Phe Ala 1 5 10 15 Cys Thr Ala Arg Phe Ser Ala Arg Tyr Pro Ile His Val Ile Val Val 20 25 30 Ala Val Leu Leu Ser Ala Ala Ala Tyr Leu Ser Val Thr Gln Ser Tyr 35 40 45 Leu Asn Glu Trp Lys Leu Asp Ser Asn Gln Tyr Ser Thr Tyr Leu Ser 50 55 60 Ile Lys Pro Asp Glu Leu Phe Glu Lys Cys Thr His Tyr Tyr Arg Ser 65 70 75 80 Pro Val Ser Asp Thr Trp Lys Leu Leu Ser Ser Lys Glu Ala Ala Asp 85 90 95 Ile Tyr Thr Pro Phe His Tyr Tyr Leu Ser Thr Ile Ser Phe Gln Ser 100 105 110 Lys Asp Asn Ser Thr Thr Leu Pro Ser Leu Asp Asp Val Ile Tyr Ser 115 120 125 Val Asp His Thr Arg Tyr Leu Leu Ser Glu Glu Pro Lys Ile Pro Thr 130 135 140 Glu Leu Val Ser Glu Asn Gly Thr Lys Trp Arg Leu Arg Asn Asn Ser 145 150 155 160 Asn Phe Ile Leu Asp Leu His Asn Ile Tyr Arg Asn Met Val Lys Gln 165 170 175 Phe Ser Asn Lys Thr Ser Glu Phe Asp Gln Phe Asp Leu Phe Ile Ile 180 185 190 Leu Ala Ala Tyr Leu Thr Leu Phe Tyr Thr Leu Cys Cys Leu Phe Asn 195 200 205 Asp Met Arg Lys Ile Gly Ser Lys Phe Trp Leu Ser Phe Ser Ala Leu 210 215 220 Ser Asn Ser Ala Cys Ala Leu Tyr Leu Ser Leu Tyr Thr Thr His Ser 225 230 235 240 Leu Leu Lys Lys Pro Ala Ser Leu Leu Ser Leu Val Ile Gly Leu Pro 245 250 255 Phe Thr Val Val Ile Ile Gly Glu Lys His Lys Val Arg Leu Ala Ala 260 265 270 Phe Ser Leu Gln Lys Phe His Arg Ile Ser Ile Asp Lys Lys Ile Thr 275 280 285 Val Ser Asn Ile Ile Tyr Glu Ala Met Phe Gln Glu Gly Ala Tyr Leu 290 295 300 Ile Arg Asp Tyr Leu Phe Tyr Ile Ser Ser Phe Ile Gly Cys Ala Ile 305 310 315 320 Tyr Ala Arg His Leu Pro Gly Leu Val Asn Phe Cys Ile Leu Ser Thr 325 330 335 Phe Met Leu Val Phe Asp Leu Leu Leu Ser Ala Thr Phe Tyr Ser Ala 340 345 350 Ile Leu Ser Met Lys Leu Phe Ile Asn Ile Ile His Arg Ser Thr Val 355 360 365 Ile Arg Gln Thr Leu Glu Glu Asp Gly Val Val Pro Thr Thr Ala Asp 370 375 380 Ile Ile Tyr Lys Asp Glu Thr Ala Ser Glu Pro His Phe Leu Arg Ser 385 390 395 400 Asn Val Ala Ile Ile Leu Gly Lys Ala Ser Val Ile Gly Leu Leu Leu 405 410 415 Leu Ile Asn Leu Tyr Val Phe Thr Asp Lys Leu Asn Ala Thr Ile Leu 420 425 430 Asn Thr Val Tyr Phe Asp Ser Thr Ile Tyr Ser Leu Pro Asn Phe Ile 435 440 445 Asn Tyr Lys Asp Ile Gly Asn Leu Ser Asn Gln Val Ile Ile Ser Val 450 455 460 Leu Pro Lys Gln Tyr Tyr Thr Pro Leu Lys Lys Tyr His Gln Ile Glu 465 470 475 480 Asp Ser Val Leu Leu Ile Ile Asp Ser Val Ser Asn Ala Ile Arg Asp 485 490 495 Gln Phe Ile Ser Lys Leu Leu Phe Phe Ala Phe Ala Val Ser Ile Ser 500 505 510 Ile Asn Val Tyr Leu Leu Asn Ala Ala Lys Ile His Thr Gly Tyr Met 515 520 525 Asn Phe Gln Pro Gln Ser Asn Lys Ile Asp Asp Leu Val Val Gln Gln 530 535 540 Lys Ser Ala Thr Ile Glu Phe Ser Glu Thr Arg Ser Met Pro Leu Ala 545 550 555 560 Ser Gly Leu Glu Thr Pro Val Thr Ala Lys Asp Ile Ile Ile Ser Glu 565 570 575 Glu Ile Gln Asn Asn Glu Cys Val Tyr Ala Leu Ser Ser Gln Asp Glu 580 585 590 Pro Ile Arg Pro Leu Ser Asn Leu Val Glu Leu Met Glu Lys Glu Gln 595 600 605 Leu Lys Asn Met Asn Asn Thr Glu Val Ser Asn Leu Val Val Asn Gly 610 615 620 Lys Leu Pro Leu Tyr Ser Leu Glu Lys Lys Leu Glu Asp Thr Leu Arg 625 630 635 640 Ala Val Leu Val Arg Arg Lys Ala Leu Ser Thr Leu Ala Glu Ser Pro 645 650 655 Ile Leu Val Ser Glu Lys Leu Pro Phe Arg Asn Tyr Asp Tyr Asp Arg 660 665 670 Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr Met Pro Ile Pro 675 680 685 Val Gly Val Ile Gly Pro Leu Ile Thr Asp Gly Thr Ser Tyr His Ile 690 695 700 Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser Ala Met Pro Gly 705 710 715 720 Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr Val Leu Thr Lys 725 730 735 Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro Thr Leu Ile Arg 740 745 750 Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu Gly Gln Asn Ser 755 760 765 Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala Arg Leu Gln His 770 775 780 Thr Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met Arg Phe Arg Thr 785 790 795 800 Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser Lys Gly Val Glu 805 810 815 Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp Glu Asp Met Glu

820 825 830 Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys Lys Pro Ala Ala 835 840 845 Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val Ala Glu Ala Thr 850 855 860 Ile Pro Gly Asp Val Val Lys Ser Val Leu Lys Ser Asp Val Ser Ala 865 870 875 880 Leu Val Glu Leu Asn Ile Ser Lys Asn Ile Val Gly Ser Ala Met Ala 885 890 895 Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn Leu Val Thr Ala 900 905 910 Leu Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn Val Glu Ser Ser 915 920 925 Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp Leu Arg Ile Ser 930 935 940 Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly Gly Gly Thr Val 945 950 955 960 Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly Val Arg Gly Pro 965 970 975 His Pro Thr Glu Pro Gly Ala Asn Ala Arg Gln Leu Ala Arg Ile Ile 980 985 990 Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys Ser Ala Leu Ala 995 1000 1005 Ala Gly His Leu Val Gln Ser His Met Thr His Asn Arg Lys Thr 1010 1015 1020 Asn Lys Ala Asn Glu Leu Pro Gln Pro Ser Asn Lys Gly Pro Pro 1025 1030 1035 Cys Lys Thr Ser Ala Leu Leu 1040 1045 1141053PRTSaccharomyces cerevisiae 114Met Pro Pro Leu Phe Lys Gly Leu Lys Gln Met Ala Lys Pro Leu Ala 1 5 10 15 Tyr Val Ser Arg Phe Ser Ala Lys Pro Pro Ile His Ile Ile Leu Phe 20 25 30 Ser Leu Ile Ile Ser Ala Phe Ala Tyr Leu Ser Val Ile Gln Tyr Tyr 35 40 45 Phe Asn Gly Trp Gln Leu Asp Ser Asn Ser Val Phe Glu Thr Ala Pro 50 55 60 Asn Lys Asp Ser Asn Thr Leu Phe Gln Glu Cys Ser His Tyr Tyr Arg 65 70 75 80 Asp Ser Ser Leu Asp Gly Trp Val Ser Ile Thr Ala His Glu Ala Ser 85 90 95 Glu Leu Pro Ala Pro His His Tyr Tyr Leu Leu Asn Leu Asn Phe Ser 100 105 110 Pro Asn Glu Thr Asp Ser Ile Pro Glu Leu Ala Asn Thr Val Phe Glu 115 120 125 Lys Asp Asn Thr Lys Tyr Leu Leu Gln Glu Asp Leu Ser Val Ser Lys 130 135 140 Glu Ile Ser Ser Thr Asp Gly Thr Lys Trp Arg Leu Arg Ser Asp Arg 145 150 155 160 Lys Ser Leu Phe Asp Val Lys Thr Leu Ala Tyr Ser Leu Tyr Asp Val 165 170 175 Phe Ser Glu Asn Val Thr Gln Ala Asp Pro Phe Asp Val Leu Ile Met 180 185 190 Val Thr Ala Tyr Leu Met Met Phe Tyr Thr Ile Phe Gly Leu Phe Asn 195 200 205 Asp Met Arg Lys Thr Gly Ser Asn Phe Trp Leu Ser Ala Ser Thr Val 210 215 220 Val Asn Ser Ala Ser Ser Leu Phe Leu Ala Leu Tyr Val Thr Gln Cys 225 230 235 240 Ile Leu Gly Lys Glu Val Ser Ala Leu Thr Leu Phe Glu Gly Leu Pro 245 250 255 Phe Leu Val Val Val Val Gly Phe Lys His Lys Ile Lys Ile Ala Gln 260 265 270 Tyr Ala Leu Glu Lys Phe Glu Arg Val Gly Leu Ser Lys Arg Ile Thr 275 280 285 Thr Asp Glu Ile Val Phe Glu Ser Val Ser Glu Glu Gly Gly Arg Leu 290 295 300 Ile Gln Asp His Leu Leu Cys Ile Phe Ala Phe Ile Gly Cys Ser Met 305 310 315 320 Tyr Ala His Gln Leu Lys Thr Leu Thr Asn Phe Gly Ile Leu Ser Ala 325 330 335 Phe Ile Leu Ile Phe Glu Leu Ile Leu Thr Pro Thr Phe Tyr Ser Ala 340 345 350 Ile Leu Ala Leu Arg Leu Phe Met Asn Val Ile His Arg Ser Thr Ile 355 360 365 Ile Lys Gln Thr Leu Glu Glu Asp Gly Val Val Pro Ser Thr Ala Arg 370 375 380 Ile Ile Ser Lys Ala Glu Lys Lys Ser Val Ser Ser Phe Leu Asn Leu 385 390 395 400 Ser Val Val Val Ile Thr Met Lys Leu Ser Val Ile Leu Leu Phe Val 405 410 415 Phe Ile Asn Phe Tyr Asn Phe Gly Ala Asn Trp Val Asn Asp Ala Phe 420 425 430 Asn Ser Leu Tyr Phe Asp Lys Glu Arg Val Ser Leu Pro Asp Phe Ile 435 440 445 Thr Ser Asn Ala Ser Glu Asn Phe Lys Glu Gln Ala Ile Val Ser Val 450 455 460 Thr Pro Leu Leu Tyr Tyr Lys Pro Ile Lys Ser Tyr Gln Arg Ile Glu 465 470 475 480 Asp Met Val Leu Leu Leu Leu Arg Asn Val Ser Val Ala Ile Arg Asp 485 490 495 Arg Phe Val Ser Lys Leu Val Leu Ser Ala Leu Val Cys Ser Ala Val 500 505 510 Ile Asn Val Tyr Leu Leu Asn Ala Ala Arg Ile His Thr Ser Tyr Thr 515 520 525 Ala Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr Ala 530 535 540 Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val Ile 545 550 555 560 Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser Ser 565 570 575 Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu Ser 580 585 590 Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Leu Leu Ser 595 600 605 Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu Val 610 615 620 Thr His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly Asp 625 630 635 640 Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu Ala 645 650 655 Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr Asp 660 665 670 Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr Met 675 680 685 Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Thr Asp Gly Thr Ser 690 695 700 Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser Ala 705 710 715 720 Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr Val 725 730 735 Leu Thr Lys Asp Gly Met Ile Arg Gly Pro Val Val Arg Phe Pro Thr 740 745 750 Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu Gly 755 760 765 Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala Arg 770 775 780 Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met Arg 785 790 795 800 Phe Arg Thr Thr Thr Ser Asp Ala Met Gly Met Asn Met Ile Ser Lys 805 810 815 Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp Glu 820 825 830 Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys Lys 835 840 845 Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val Ala 850 855 860 Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser Asp 865 870 875 880 Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly Ser 885 890 895 Ala Met Ala Gly Ser Val Gly Gly Glu Asn Ala His Ala Ala Asn Leu 900 905 910 Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn Val 915 920 925 Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp Leu 930 935 940 Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly Gly 945 950 955 960 Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Ser Val 965 970 975 Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu Ala 980 985 990 Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys Ala 995 1000 1005 Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn 1010 1015 1020 Arg Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr 1025 1030 1035 Asp Ile Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser 1040 1045 1050 115999PRTYarrowia lipolytica 115Met Leu Gln Ala Ala Ile Gly Lys Ile Val Gly Phe Ala Val Asn Arg 1 5 10 15 Pro Ile His Thr Val Val Leu Thr Ser Ile Val Ala Ser Thr Ala Tyr 20 25 30 Leu Ala Leu Leu Asp Ile Ala Ile Pro Gly Glu Glu Gly Thr Gln Pro 35 40 45 Ile Ser Tyr Tyr His Pro Ala Ala Lys Ser Tyr Asp Asn Pro Ala Asp 50 55 60 Trp Thr His Ile Ala Glu Ala Asp Ile Pro Ser Asp Ala Tyr Arg Leu 65 70 75 80 Ala Phe Ala Gln Ile Arg Val Ser Asp Val Gln Gly Gly Glu Ala Pro 85 90 95 Thr Ile Pro Gly Ala Val Ala Val Ser Asp Leu Asp His Arg Ile Val 100 105 110 Met Asp Tyr Lys Gln Trp Ala Pro Trp Thr Ala Ser Asn Glu Gln Ile 115 120 125 Ala Ser Glu Asn His Ile Trp Lys His Ser Phe Lys Asp His Val Ala 130 135 140 Phe Ser Trp Ile Lys Trp Phe Arg Trp Ala Tyr Leu Arg Leu Ser Thr 145 150 155 160 Leu Ile Gln Gly Ala Asp Asn Phe Asp Ile Ala Val Val Ala Leu Gly 165 170 175 Tyr Leu Ala Met His Tyr Thr Phe Phe Ser Leu Phe Arg Ser Lys Arg 180 185 190 Lys Val Gly Ser His Phe Trp Leu Ala Ser Met Ala Leu Val Ser Ser 195 200 205 Phe Phe Ala Phe Leu Leu Ala Val Val Ala Ser Ser Ser Leu Gly Tyr 210 215 220 Arg Pro Ser Met Ile Thr Met Ser Glu Gly Leu Pro Phe Leu Val Val 225 230 235 240 Ala Ile Gly Phe Asp Arg Lys Val Asn Leu Ala Ser Glu Val Leu Thr 245 250 255 Ser Lys Ser Ser Gln Leu Ala Pro Met Val Gln Val Ile Thr Lys Ile 260 265 270 Ala Ser Lys Ala Leu Phe Glu Tyr Ser Leu Glu Val Ala Ala Leu Phe 275 280 285 Ala Gly Ala Tyr Thr Gly Val Pro Arg Leu Ser Gln Phe Cys Phe Leu 290 295 300 Ser Ala Trp Ile Leu Ile Phe Asp Tyr Met Phe Leu Leu Thr Phe Tyr 305 310 315 320 Ser Ala Val Ile Ala Ile Lys Phe Leu Ile Asn His Ile Lys Phe Asn 325 330 335 Arg Met Ile Gln Asp Ala Leu Lys Glu Asp Gly Val Ser Ala Ala Val 340 345 350 Ala Glu Lys Val Ala Asp Ser Ser Pro Asp Ala Lys Leu Asp Arg Lys 355 360 365 Ser Asp Val Ser Leu Phe Gly Ala Ser Gly Ala Ile Ala Val Phe Lys 370 375 380 Ile Phe Met Val Leu Gly Phe Leu Gly Leu Asn Leu Ile Asn Leu Thr 385 390 395 400 Ala Ile Pro His Leu Gly Lys Ala Ala Ala Ala Ala Gln Ser Val Thr 405 410 415 Pro Ile Thr Leu Ser Pro Glu Leu Leu His Ala Ile Pro Ala Ser Val 420 425 430 Pro Val Val Val Thr Phe Val Pro Ser Val Val Tyr Glu His Ser Gln 435 440 445 Leu Ile Leu Gln Leu Glu Asp Ala Leu Thr Phe Phe Leu Ala Ala Cys 450 455 460 Ser Lys Thr Ile Gly Asp Pro Val Ile Ser Lys Tyr Ile Phe Leu Cys 465 470 475 480 Leu Met Val Ser Thr Ala Leu Asn Val Tyr Leu Phe Gly Ala Thr Arg 485 490 495 Glu Val Val Arg Thr Gln Ser Val Lys Val Val Glu Lys His Val Pro 500 505 510 Ile Val Ile Glu Lys Pro Ser Glu Lys Glu Glu Asp Thr Ser Ser Glu 515 520 525 Asp Ser Ile Glu Leu Thr Val Gly Lys Gln Pro Lys Pro Val Thr Glu 530 535 540 Thr Arg Ser Leu Asp Asp Leu Glu Ala Thr Met Lys Ala Gly Lys Thr 545 550 555 560 Lys Leu Leu Glu Asp His Glu Val Val Lys Leu Ser Leu Glu Gly Lys 565 570 575 Leu Pro Leu Tyr Ala Leu Phe Lys Gln Leu Gly Asp Asn Thr Arg Ala 580 585 590 Val Gly Ile Arg Arg Ser Ile Ile Ser Gln Gln Ser Asn Thr Lys Thr 595 600 605 Leu Glu Thr Ser Lys Leu Pro Tyr Leu His Tyr Asp Tyr Asp Arg Val 610 615 620 Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr Met Pro Leu Pro Val 625 630 635 640 Gly Val Ala Gly Pro Met Asn Thr Asp Gly Lys Asn Tyr His Ile Pro 645 650 655 Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser Thr Met Arg Gly Cys 660 665 670 Lys Ala Ile Asn Ala Gly Gly Gly Val Thr Thr Val Leu Thr Gln Asp 675 680 685 Gly Met Thr Arg Gly Pro Cys Val Ser Phe Pro Ser Leu Lys Arg Ala 690 695 700 Gly Ala Ala Lys Ile Trp Leu Asp Glu Ser Glu Gly Leu Lys Ser Met 705 710 715 720 Arg Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala Arg Leu Gln Ser Leu 725 730 735 His Ser Thr Leu Ala Gly Asn Leu Leu Phe Ile Arg Phe Arg Thr Thr 740 745 750 Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser Lys Gly Val Glu His 755 760 765 Ser Leu Ala Val Met Val Lys Glu Tyr Gly Phe Pro Leu Met Asp Ile 770 775 780 Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys Lys Pro Ala Ala Ile 785 790 795 800 Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val Ala Glu Ala Thr Ile 805 810 815 Pro Ala His Ile Val Lys Ser Val Leu Lys Ser Glu Val Asp Ala Leu 820 825 830 Val Glu Leu Asn Ile Ser Lys Asn Leu Ile Gly Ser Ala Met Ala Gly 835 840 845 Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn Leu Val Thr Ala Ile 850 855 860 Tyr Leu Ala Thr Gly Gln Asp Pro Ala Gln Asn Val Glu Ser Ser Asn 865 870 875 880 Cys Ile Thr Leu Met Ser Asn Val Asp Gly Asn Leu Leu Ile Ser Val 885 890 895 Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly Gly Gly Thr Ile Leu 900 905 910 Glu Pro Gln Gly Ala Met Leu Glu Met Leu Gly Val Arg Gly Pro His 915 920 925 Ile Glu Thr Pro Gly Ala Asn Ala Gln Gln Leu Ala Arg Ile Ile Ala 930 935 940 Ser Gly Val Leu Ala Ala Glu Leu Ser Leu Cys Ser Ala Leu Ala Ala 945 950 955 960 Gly His Leu Val Gln Ser His Met Thr His Asn Arg Ser Gln Ala Pro 965 970 975 Thr Pro Ala Lys Gln Ser Gln Ala Asp Leu Gln Arg Leu Gln Asn Gly 980 985 990 Ser Asn Ile Cys Ile Arg Ser 995 1161289PRTArtificial SequenceDescription of Artificial Sequence Synthetic consensus sequence 116Xaa Met Ala Ser Xaa Leu Leu Xaa Xaa Arg Phe Xaa Xaa Glu Xaa

Xaa 1 5 10 15 Xaa Xaa Xaa Xaa Ala Xaa Pro Ser Trp Xaa Xaa Lys Xaa Leu Thr Xaa 20 25 30 Pro Ile Gln Xaa Ile Ser Arg Phe Ala Ala Xaa His Pro Ile His Thr 35 40 45 Ile Val Leu Val Ala Leu Leu Ala Ser Thr Ala Tyr Leu Gly Leu Leu 50 55 60 Gln Xaa Ser Leu Phe Xaa Trp Xaa Leu Xaa Ser Asn Xaa Xaa Xaa Xaa 65 70 75 80 Xaa Xaa Asp Xaa Thr Ser Leu Xaa Xaa Gly Ser Arg Xaa Leu Arg Xaa 85 90 95 Gly Xaa Xaa Thr Xaa Trp Arg Trp Xaa Xaa Ile Asp Xaa Xaa Xaa Ile 100 105 110 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ala Ala Asp Ala Xaa 115 120 125 His Leu Ala Leu Xaa Thr Leu Val Phe Pro Asp Thr Gln Ser Xaa Glu 130 135 140 Xaa Ala Ser Thr Ile Pro Xaa Ala Xaa Xaa Val Pro Val Pro Xaa Asn 145 150 155 160 Xaa Ser Ile Xaa Leu Leu Pro Xaa Thr Xaa Xaa Ile Phe Thr Xaa Tyr 165 170 175 Ser Gln Asp Ser Ser Leu Xaa Phe Ser Leu Pro Tyr Ser Xaa Xaa Xaa 180 185 190 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 195 200 205 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 210 215 220 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Ile Val Xaa Trp Xaa Xaa 225 230 235 240 Asn Ala Trp Xaa Xaa Phe Ser Asp Leu Ile Lys Asn Ala Asp Thr Phe 245 250 255 Asp Ile Ile Ile Met Xaa Leu Gly Tyr Leu Ala Met His Tyr Thr Phe 260 265 270 Xaa Ser Leu Phe Xaa Ser Met Arg Lys Leu Gly Ser Lys Phe Trp Leu 275 280 285 Ala Thr Ser Xaa Leu Phe Ser Ser Ile Phe Ala Phe Leu Leu Gly Leu 290 295 300 Leu Val Thr Thr Lys Leu Gly Xaa Val Pro Ile Ser Met Leu Leu Leu 305 310 315 320 Ser Glu Gly Leu Pro Phe Leu Val Val Thr Ile Gly Phe Glu Lys Lys 325 330 335 Ile Val Leu Thr Arg Ala Val Leu Ser Xaa Ala Ile Asp Xaa Arg Arg 340 345 350 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa 355 360 365 Xaa Xaa Ser Ile Gln Xaa Ala Ile Gln Xaa Ala Ile Lys Glu Xaa Gly 370 375 380 Phe Glu Ile Ile Arg Asp Tyr Ala Ile Glu Ile Ser Ile Leu Ile Ala 385 390 395 400 Gly Ala Ala Ser Gly Val Gln Gly Gly Xaa Leu Xaa Gln Phe Cys Phe 405 410 415 Leu Ala Ala Trp Ile Leu Phe Phe Asp Xaa Ile Leu Leu Phe Thr Phe 420 425 430 Tyr Ser Ala Ile Leu Ala Ile Lys Leu Glu Ile Asn Arg Ile Lys Arg 435 440 445 His Val Ile Ile Arg Xaa Ala Leu Glu Glu Asp Gly Val Ser Xaa Ser 450 455 460 Val Ala Glu Lys Val Ala Lys Ser Glu Xaa Asp Trp Xaa Xaa Xaa Lys 465 470 475 480 Gly Ser Asp Ser Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 485 490 495 Lys Ser Xaa Ala Ile Xaa Ile Phe Lys Val Leu Met Ile Leu Gly Phe 500 505 510 Val Leu Ile Asn Leu Val Asn Leu Thr Ala Ile Pro Phe Arg Xaa Ala 515 520 525 Xaa Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Gly Val 530 535 540 Xaa Ser Pro Xaa Xaa Val Asp Pro Phe Lys Val Ala Xaa Asn Leu Leu 545 550 555 560 Asp Ala Ile Xaa Ala Ala Ala Lys Ser Asn Xaa Arg Glu Thr Leu Val 565 570 575 Thr Val Val Thr Pro Ile Lys Tyr Glu Leu Glu Tyr Pro Ser Ile His 580 585 590 Tyr Xaa Glu Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 595 600 605 Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Gly Gly Xaa Met Leu 610 615 620 Gly Ser Val Ser Lys Ser Ile Glu Asp Pro Val Ile Ser Lys Trp Ile 625 630 635 640 Val Ile Ala Leu Ala Leu Ser Ile Ala Leu Asn Val Tyr Leu Phe Asn 645 650 655 Ala Ala Arg Trp Xaa Ile Lys Asp Pro Asn Val Xaa Xaa Xaa Xaa Xaa 660 665 670 Glu Val Xaa Glu Leu Xaa Xaa Xaa Gln Xaa Xaa Asn Xaa Xaa Xaa Ser 675 680 685 Ala Xaa Leu Xaa Xaa Xaa Xaa Xaa Ile Xaa Xaa Thr Xaa Xaa Xaa Xaa 690 695 700 Xaa Xaa Xaa Xaa Xaa Thr Pro Ala Xaa Thr Asp Asp Glu Xaa Xaa Ser 705 710 715 720 Xaa Xaa Ser Xaa Xaa Xaa Xaa Val Xaa Lys Ile Xaa Xaa Xaa Xaa Xaa 725 730 735 Xaa Ile Arg Ser Leu Glu Glu Leu Glu Ala Leu Leu Ala Ala Lys Lys 740 745 750 Thr Lys Xaa Leu Xaa Asp Glu Glu Val Val Xaa Leu Ser Leu Xaa Gly 755 760 765 Lys Leu Pro Leu Tyr Ala Leu Glu Lys Thr Leu Gly Xaa Xaa Xaa Xaa 770 775 780 Xaa Xaa Xaa Xaa Xaa Asp Phe Thr Arg Ala Val Lys Ile Arg Arg Ser 785 790 795 800 Ile Ile Ser Arg Xaa Xaa Ala Thr Ser Ala Leu Thr Xaa Ser Leu Glu 805 810 815 Ser Ser Lys Leu Pro Tyr Lys Asn Tyr Asn Tyr Asp Arg Val Phe Gly 820 825 830 Ala Cys Cys Glu Asn Val Ile Gly Tyr Met Pro Leu Pro Val Gly Val 835 840 845 Ala Gly Pro Leu Val Ile Asp Gly Gln Ser Tyr His Ile Pro Met Ala 850 855 860 Thr Thr Glu Gly Val Leu Val Ala Ser Ala Ser Arg Gly Cys Lys Ala 865 870 875 880 Ile Asn Ala Gly Gly Gly Ala Val Thr Val Leu Thr Ala Asp Gly Met 885 890 895 Thr Arg Gly Pro Cys Val Xaa Phe Pro Thr Leu Xaa Arg Ala Gly Ala 900 905 910 Ala Lys Ile Trp Leu Asp Ser Glu Glu Gly Gln Xaa Ser Met Lys Lys 915 920 925 Ala Phe Asn Ser Thr Ser Arg Phe Ala Arg Leu Gln His Ile Lys Thr 930 935 940 Ala Leu Ala Gly Thr Leu Leu Phe Ile Arg Phe Lys Thr Thr Thr Gly 945 950 955 960 Asp Ala Met Gly Met Asn Met Ile Ser Lys Gly Val Glu His Ala Leu 965 970 975 Ser Val Met Val Xaa Glu Tyr Gly Phe Glu Asp Met Glu Ile Val Ser 980 985 990 Val Ser Gly Asn Tyr Cys Thr Asp Lys Lys Pro Ala Ala Ile Asn Trp 995 1000 1005 Ile Asp Gly Arg Gly Lys Ser Val Val Ala Glu Ala Thr Ile Pro 1010 1015 1020 Gly Asp Val Val Lys Ser Val Leu Lys Ser Asp Val Asp Ala Leu 1025 1030 1035 Val Glu Leu Asn Ile Ser Lys Asn Leu Ile Gly Ser Ala Met Ala 1040 1045 1050 Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn Ile Val Thr 1055 1060 1065 Ala Ile Phe Leu Ala Thr Gly Gln Asp Pro Ala Gln Asn Val Glu 1070 1075 1080 Ser Ser Asn Cys Ile Thr Leu Met Lys Asn Val Asp Gly Asn Leu 1085 1090 1095 Gln Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly 1100 1105 1110 Gly Gly Thr Ile Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu 1115 1120 1125 Gly Val Arg Gly Pro His Pro Thr Asn Pro Gly Asp Asn Ala Arg 1130 1135 1140 Gln Leu Ala Arg Ile Ile Ala Ala Ala Val Leu Ala Gly Glu Leu 1145 1150 1155 Ser Leu Cys Ser Ala Leu Ala Ala Gly His Leu Val Gln Ala His 1160 1165 1170 Met Thr His Asn Arg Ser Ala Ala Pro Thr Arg Ser Xaa Thr Pro 1175 1180 1185 Xaa Xaa Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Xaa Ile Xaa Ser 1190 1195 1200 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1205 1210 1215 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1220 1225 1230 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1235 1240 1245 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1250 1255 1260 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1265 1270 1275 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1280 1285 11734DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 117ctctagacac aaaaatgtcg caaccccaga acgt 3411827DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 118cacgcgtcta ctgcttgatc tcgtact 2711936DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 119cgctagccac aaaaatggac tacatcattt cggcgc 3612027DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 120cacgcgtcta atgggtccag ggaccga 2712135DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 121ctctagacac aaaaatgacc acctattcgg ctccg 3512229DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 122cggcgcgccc tacttgaacc ccttctcga 2912336DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 123ctctagacac aaaaatgatc caccaggcct ccacca 3612427DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 124cacgcgtcta cttgctgttc ttcagag 2712534DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 125ctctagacac aaaaatgacg acgtcttaca gcga 3412627DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 126cacgcgtcta cttgatccac cgccgaa 2712735DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 127ctctagacac aaaaatgtcc aaggcgaaat tcgaa 3512827DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 128cacgcgtcta cttctgtcgc ttgtaaa 2712934DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 129ctctagacac aaaaatgtta cgactacgaa ccat 3413026DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 130cacgcgtcta gtcgtaaccc gcacat 2613134DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 131cgctagccac aaaaatgccg cagcaagcaa tgga 3413227DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 132cacgcgttta accatgcagc cgctcaa 2713334DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 133ctctagacac aaaaatgttc cgaacccgag ttac 3413427DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 134cacgcgttta agggttctgc ttgacaa 2713534DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 135ctctagacac aaaaatgaca caaacgcaca atct 3413627DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 136cacgcgttta catcttgtac gcagggt 2713734DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 137ctctagacac aaaaatggaa gccaaccccg aagt 3413827DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 138cacgcgttca tttcagaagg tacttct 2713932DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 139ctctagacac aaaaatgcga ctcactctgc cc 3214029DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 140cacgcgtcta ctcgacagaa gagaccttc 2914137DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 141ttctagaccc aaaaatgtct gccaacgaga acatctc 3714229DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 142aacgcgtcta tgatcgagtc ttggccttg 2914337DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 143ttctagacac aaaaatgtca gcgaaatcca ttcacga 3714426DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 144cacgcgttaa actccgagag gagtgg 2614534DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 145ctctagacac aaaagtggtt aaagctgtcg ttgc 3414627DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 146cacgcgttta cttggcagga ggagggt 2714734DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 147ctctagacac aaaaatgact ggcaccttac ccaa 3414827DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 148cacgcgttca cgaggagccc ttggtga 2714934DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 149ctctagacac aaaaatgact gacacttcaa acat 3415027DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 150cacgcgttta agcatcgtaa gtggaag 2715134DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 151ctctagacac aaaaatgctc aaccttagaa ccgc 3415227DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 152cacgcgtcta cttgagtcgc ttgataa 27

* * * * *

References


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed