Yeast Production Culture For The Production Of Butanol

NAGARAJAN; VASANTHA ;   et al.

Patent Application Summary

U.S. patent application number 13/162165 was filed with the patent office on 2012-06-14 for yeast production culture for the production of butanol. This patent application is currently assigned to Butamax(TM) Advanced Biofuels LLC. Invention is credited to Michael G. Bramucci, VASANTHA NAGARAJAN.

Application Number20120149080 13/162165
Document ID /
Family ID44627329
Filed Date2012-06-14

United States Patent Application 20120149080
Kind Code A1
NAGARAJAN; VASANTHA ;   et al. June 14, 2012

YEAST PRODUCTION CULTURE FOR THE PRODUCTION OF BUTANOL

Abstract

High cell density cultures of yeast were found to have higher tolerance for butanol in the medium. The high cell density yeast cultures had greater survival and higher glucose utilization than cultures with low cell densities. Production of butanol using yeast in high cell density cultures is thus beneficial for improving butanol production.


Inventors: NAGARAJAN; VASANTHA; (Wilmington, DE) ; Bramucci; Michael G.; (Boothwyn, PA)
Assignee: Butamax(TM) Advanced Biofuels LLC
Wilmington
DE

Family ID: 44627329
Appl. No.: 13/162165
Filed: June 16, 2011

Related U.S. Patent Documents

Application Number Filing Date Patent Number
61355733 Jun 17, 2010

Current U.S. Class: 435/160 ; 435/254.2; 435/255.21; 435/255.4; 435/255.5; 435/255.7
Current CPC Class: C12N 15/01 20130101; C12P 7/16 20130101; C12N 1/16 20130101; C12R 1/645 20130101; C12R 1/865 20130101; Y02E 50/10 20130101
Class at Publication: 435/160 ; 435/255.7; 435/254.2; 435/255.21; 435/255.4; 435/255.5
International Class: C12P 7/16 20060101 C12P007/16; C12N 1/19 20060101 C12N001/19; C12N 1/16 20060101 C12N001/16

Claims



1. A production culture for the fermentative production of butanol comprising: (i) a medium comprising a fermentable carbon substrate; (ii) a culture of yeast cells having a glucose utilization rate of at least about 0.5 gram per gram of dry cell weight per hour; and (iii) butanol at a concentration of at least about 2% (w/v) in the medium.

2. The production culture of claim 1, wherein the cell density is at least about 2.4 gram dry cell weight per liter.

3. The production culture of claim 2, wherein the cell density is at least about 7 gram dry cell weight per liter.

4. The production culture of claim 1, wherein the butanol producing yeast cells have a glucose utilization rate of at least about 1 gram per gram of dry cell weight per hour.

5. The production culture of claim 4, wherein the butanol producing yeast cells are produced by a method comprising: (i) mutagenesis; (ii) exposure to 3% isobutanol; and (iii) repeated freeze-thaw cycles.

6. The production culture of claim 1, wherein the yeast is crabtree positive.

7. The production culture of claim 1, wherein the yeast is a member of a genus selected from the group consisting of Saccharomyces, Schizosaccharomyces, Hansenula, Candida, Kluyveromyces, Yarrowia, Issatchenkia and Pichia.

8. The production culture of claim 1, wherein the yeast comprises an isobutanol pathway or a 1-butanol pathway.

9. The production culture of claim 1, wherein the culture is maintained at the following conditions for about 1 hour to about 200 hours: (i) temperature that is from about 20.degree. C. to about 45.degree. C.; (ii) dissolved oxygen that is maintained at microaerobic conditions to above 3%; (iii) carbon substrates in excess provided by liquefied biomass; (iv) pH that is from about 3 to about 7.5; and (v) butanol removal selected from vacuum application and liquid-liquid extraction.

10. A method for the production of butanol, comprising: (i) preparing the production culture of claim 1, wherein the yeast comprises an isobutanol pathway or a 1-butanol pathway; and (ii) fermenting the yeast under conditions wherein butanol is produced.

11. A production culture for the fermentative production of butanol comprising: (i) a medium comprising a fermentable carbon substrate; (ii) a culture of yeast cells having a cell density of at least about 2.4 gram dry cell weight per liter; and (iii) butanol at a concentration of at least about 2% (w/v) in the medium.

12. The production culture of claim 11, wherein the culture has a glucose utilization rate of at least about 0.5 gram per gram of dry cell weight per hour.

13. The production culture of claim 11, wherein the culture has a glucose utilization rate of at least about 1 gram per gram of dry cell weight per hour

14. The production culture of claim 11, wherein the cell density is at least about 7 grams dry cell weight per liter.

15. The production culture of claim 11, wherein the butanol concentration is at least about 2.5% and the culture has a glucose utilization rate of at least about 0.4 gram per gram of dry cell weight per hour.

16. The production culture of claim 11, wherein the yeast is a member of a genus selected from the group consisting of Saccharomyces, Schizosaccharomyces, Hansenula, Candida, Kluyveromyces, Yarrowia, Issatchenkia and Pichia.

17. The production culture of claim 11, wherein the yeast comprises an isobutanol pathway or 1-butanol pathway.

18. The production culture of claim 11, wherein the culture is maintained at the following conditions for about 1 hour to about 200 hours: (i) temperature that is from about 20.degree. C. to about 37.degree. C.; (ii) dissolved oxygen that is maintained at microaerobic conditions to above 3%; (iii) carbon substrates in excess provided by liquefied biomass; (iv) pH that is from about 3 to about 7.5; and (v) butanol removal selected from vacuum application and liquid-liquid extraction.

19. A method for the production of butanol, comprising: a. preparing the production culture of claim 11, wherein the yeast comprises a butanol biosynthetic pathway selected from the group consisting of an isobutanol pathway, a 1-butanol pathway and a 2-butanol pathway; and b. fermenting the yeast under conditions wherein butanol is produced.

20. A method for increasing the tolerance of a production culture for the fermentative production of butanol, comprising: (i) providing a medium comprising a fermentable carbon substrate; (ii) providing a culture of butanol producing yeast cells having a glucose utilization rate of at least about 0.5 gram per gram of dry cell weight per hour; and (iii) contacting the yeast culture with the fermentable carbon substrate whereby the glucose utilization rate is maintained over a suitable period of time and whereby butanol is produced.

21. A production culture for the fermentative production of butanol comprising: (i) a medium comprising a fermentable carbon substrate; (ii) a culture of yeast cells having a glucose utilization rate of at least about 2.4 grams per gram of dry cell weight per hour; and (iii) butanol at a concentration of at least about 1% (w/v) in the medium.

22. The production culture of claim 21, wherein the cell density is at least about 2.7 grams dry cell weight per liter.
Description



CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application claims the benefit of priority of U.S. Provisional Application No. 61/355,733, filed Jun. 17, 2010, which is incorporated herein by reference in its entirety.

REFERENCE TO SEQUENCE LISTING SUBMITTED ELECTRONICALLY

[0002] The content of the electronically submitted sequence listing Name: 20110616_CL4590USNA_SEQLIST.txt; Size: 244,365 bytes; and Date of Creation: Jun. 16, 2011 is incorporated herein by reference in its entirety.

FIELD OF THE INVENTION

[0003] The invention relates to the field of industrial microbiology and the production of butanol. Specifically, production cultures of yeast organisms tolerant to high concentrations of butanol have been developed.

BACKGROUND OF THE INVENTION

[0004] Butanol is an important industrial chemical, useful as a fuel additive, as a feedstock chemical in the plastics industry, and as a foodgrade extractant in the food and flavor industry. Each year 10 to 12 billion pounds of butanol are produced by petrochemical means and the need for this commodity chemical will likely increase.

[0005] Butanol may be made through chemical synthesis or by fermentation. Isobutanol is produced biologically as a by-product of yeast fermentation. It is a component of "fusel oil" that forms as a result of incomplete metabolism of amino acids by this group of fungi. Isobutanol is specifically produced from catabolism of L-valine and the yield is typically very low. Additionally, recombinant microbial production hosts, expressing a 1-butanol biosynthetic pathway (Donaldson et al., U.S. Patent Application Publication No. US20080182308A1), a 2-butanol biosynthetic pathway (Donaldson et al., U.S. Patent Publication Nos. US 20070259410A1 and US 20070292927), and an isobutanol biosynthetic pathway (Maggio-Hall et al., U.S. Patent Publication No. US 20070092957) have been described.

[0006] Biological production of butanols is generally limited by butanol toxicity to the host microorganism used in fermentation for butanol production. Yeasts are typically sensitive to butanol in the medium. Genetic engineering approaches have been used to alter gene expression to increase yeast cell tolerance to butanol. Another approach to improving production is to improve the fermentation process. U.S. Pat. No. 4,765,992 discloses addition of microorganism cell walls before or during fermentation to adsorb substances toxic to yeast which cause cessation of fermentation during alcoholic fermentation. U.S. Pat. No. 4,414,329 discloses using high mineral salts media with at least about 60 to 160 grams per liter of cells in a method to produce single cell protein. U.S. Pat. No. 4,284,724 discloses achieving a 6% to about 20% (dry cell weight) density fermentation by removing fermentation broth, filtering, and recycling the yeast cells to the fermentor.

[0007] There remains a need to develop butanol production cultures of yeast in which effects due to butanol sensitivity are reduced, thereby allowing increased butanol yield.

SUMMARY OF THE INVENTION

[0008] The invention provides cultures for production of butanol that have increased tolerance to butanol due to the presence of a high density of yeast cells.

[0009] Provided herein is a production culture for the fermentative production of butanol comprising: a) a medium comprising a suitable carbon substrate for the metabolism of yeast; b) a culture of butanol producing yeast cells having a glucose utilization rate of at least about 0.5 gram per gram of dry cell weight per hour (g/gdcw/h); and c) butanol at a concentration of at least about 2% (weight/volume) the medium.

[0010] In another embodiment the invention provides a production culture for the fermentative production of butanol comprising: a) a medium comprising a suitable carbon substrate for the metabolism of yeast; b) a culture of butanol producing yeast cells having a cell density of at least about 2.4 gram dry cell weight per liter (gdcw/L); and c) butanol at a concentration of at least about 2% (weight/volume) in the medium.

[0011] In yet another embodiment the invention provides a method for the production of butanol comprising preparing a production culture of the invention wherein the yeast comprises a butanol biosynthetic pathway selected from the group consisting of isobutanol pathway and 1-butanol pathway, and fermenting the yeast under conditions wherein butanol is produced.

[0012] Provided herein is a production culture for the fermentative production of butanol comprising: a) a medium comprising a suitable carbon substrate for the metabolism of yeast; b) a culture of butanol producing yeast cells having a glucose utilization rate of at least about 0.5 gram per gram of dry cell weight per hour; and c) butanol at a concentration of at least about 2% (w/v) in the medium.

[0013] In embodiments, the cell density is at least about 2.4 gram dry cell weight per liter. In embodiments, the cell density is at least about 7 gram dry cell weight per liter. In embodiments, the butanol producing yeast cells have a glucose utilization rate of at least about 1 gram per gram of dry cell weight per hour.

[0014] In embodiments, the butanol producing yeast cells were produced by a method comprising: a) mutagenesis; and b) exposure to 3% isobutanol; and c) repeated freeze-thaw cycles. In embodiments, the freeze-thaw cycle is repeated more than twice. In embodiments, the cells are produced by a method comprising at least 5 freeze-thaw cycles.

[0015] In embodiments, the butanol producing yeast cells were produced by a method comprising: a) growth of the butanol producing yeast cells in a medium containing ethanol; b) concentrating the cells to a density in the range of 30 gdcw/L; c) exposure to 3% isobutanol; and d) repeated freeze-thaw cycles. In embodiments, the freeze-thaw cycle is repeated more than twice. In embodiments, the cells are produced by a method comprising at least 5 freeze-thaw cycles.

[0016] In embodiments, the butanol producing yeast cells were produced by a method comprising: a) growth of the butanol producing yeast cells in a medium containing ethanol; b) serially transferred to a medium containing 0.1% to 2.0% butanol for growth for a minimum of 10 h and maximum of 72 h; c) gradually increasing the butanol concentration in the medium in subsequent passages for increased growth rates of butanol producing cells; and d) pooling the cells with fast growth rate e) exposure to 3% isobutanol; and/or repeated freeze-thaw cycles.

[0017] In embodiments, the yeast is crabtree positive and, in embodiments, the yeast is crabtree negative. In embodiments, the yeast is a member of a genus selected from the group consisting of Saccharomyces, Schizosaccharomyces, Hansenula, Candida, Kluyveromyces, Yarrowia, Issatchenkia and Pichia. In embodiments, the yeast comprises a butanol biosynthetic pathway selected from the group consisting of isobutanol pathway and 1-butanol pathway.

[0018] In embodiments, the culture is maintained at the following conditions for between 1 hour to about 200 hours: a) temperature that is between about 20.degree. C. and 37.degree. C.; b) dissolved oxygen that is maintained between microaerobic conditions to above 3%; c) carbon substrates in excess provided by liquefied biomass; d) pH that is between about 3 and 7.5; and d) butanol removal selected from vacuum application and liquid-liquid extraction.

[0019] In embodiments, the culture has a glucose utilization rate of at least about 0.5 gram per gram of dry cell weight per hour. In embodiments, the cell density is at least about 7 grams dry cell weight per liter. In embodiments, the butanol concentration is at least about 2.5% and the culture has a glucose utilization rate of at least about 0.4 gram per gram of dry cell weight per hour. In embodiments, the yeast comprises a butanol biosynthetic pathway selected from the group consisting of isobutanol pathway, 1-butanol pathway, and 2-butanol pathway.

[0020] In embodiments, the culture comprises PNY0602 or PNY0614. In embodiments, the culture comprises PNY0602 or PNY0614 further comprising a butanol biosynthetic pathway. In embodiments, the butanol biosynthetic pathway is an isobutanol biosynthetic pathway.

BRIEF DESCRIPTION OF THE FIGURES AND SEQUENCE DESCRIPTIONS

[0021] The various embodiments of the invention can be more fully understood from the following detailed description, the figures, and the accompanying sequence descriptions, which form a part of this application.

[0022] FIG. 1 shows four different 2-butanol biosynthetic pathways.

[0023] FIG. 2 shows three different isobutanol biosynthetic pathways.

[0024] FIG. 3 a pathway for 1-butanol biosynthesis.

[0025] FIG. 4 shows a graph of glucose utilization in high cell density yeast cultures in the presence of isobutanol.

[0026] The following biological deposits have been made under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure:

TABLE-US-00001 International Depositor Identification Depository Reference Designation Date of Deposit Saccharomyces cerevisiae PNY0602 Saccharomyces cerevisiae PNY0614

[0027] The following sequences and the sequence listing provided herewith and incorporated by reference herein conform with 37 C.F.R. 1.821-1.825 ("Requirements for Patent Applications Containing Nucleotide Sequences and/or Amino Acid Sequence Disclosures--the Sequence Rules") and are consistent with World Intellectual Property Organization (WIPO) Standard ST.25 (2009) and the sequence listing requirements of the EPO and PCT (Rules 5.2 and 49.5(a-bis), and Section 208 and Annex C of the Administrative Instructions). The symbols and format used for nucleotide and amino acid sequence data comply with the rules set forth in 37 C.F.R. .sctn.1.822.

TABLE-US-00002 TABLE 1 SEQ ID Numbers of Expression Coding Regions and Proteins SEQ SEQ ID NO: ID NO: Nucleic Amino Description acid Acid Klebsiella pneumoniae budB (acetolactate 1 2 synthase) Bacillus subtilis alsS 3 4 (acetolactate synthase) Lactococcus lactis als 5 6 (acetolactate synthase) Als Staphylococcus aureus 7 8 Als Listeria monocytogenes 9 10 Als Streptococcus mutans 11 12 Als Streptococcus thermophilus 13 14 Als Vibrio angustum 15 16 Als Bacillus cereus 17 18 budA, acetolactate decarboxylase from Klebsiella 19 20 pneumoniae ATCC 25955 alsD, acetolactate decarboxylase from Bacillus 21 22 subtilis budA, acetolactate decarboxylase from Klebsiella 23 24 terrigena budC, butanediol dehydrogenase from Klebsiella 25 26 pneumoniae IAM1063 butanediol dehydrogenase from Bacillus cereus 27 28 butB, butanediol dehydrogenase from Lactococcus 29 30 lactis BDH1 butanediol dehydrogenase from 54 55 Saccharomyces cerevisiae RdhtA, B12-indep diol dehydratase from Roseburia 31 32 inulinivorans RdhtB, B12-indep diol dehydratase reactivase from 33 34 Roseburia inulinivorans sadB, butanol dehydrogenase from Achromobacter 35 36 xylosoxidans S. cerevisiae ILV5 37 38 (acetohydroxy acid reductoisomerase) Vibrio cholerae ketol-acid reductoisomerase 39 40 (KARI) Pseudomonas aeruginosa ketol-acid 41 42 reductoisomerase Pseudomonas fluorescens ketol-acid 43 44 reductoisomerase Pf5.IlvC-Z4B8 mutant Pseudomonas fluorescens 45 46 acetohydroxy acid reductoisomerase (codon optimized for S. cerevisiae expression) Lactococcus lactis ilvC 58 59 S. cerevisiae ILV3 47 48 (Dihydroxyacid dehydratase; DHAD) Streptococcus mutans ilvD (DHAD) 49 50 Lactococcus lactis kivD (branched-chain .alpha.-keto 51 52 acid decarboxylase) L. lactis kivD codon optimized for S. cerevisiae 53 52* expression Equus caballus alcohol dehydrogenase codon 56 57 optimized for S. cerevisiae expression Saccharomyces cerevisiae ALD6 -- 60 Saccharomyces cerevisiae YMR226C -- 63 Beijerinkia indica alcohol dehydrogenase -- 74 Anaerostipes caccae KARI variant K9D3 -- 61 Anaerostipes caccae KARI variant K9G9 -- 62 Saccharomyces cerevisiae AFT1 64 65 Saccharomyces cerevisiae AFT2 66 67 Saccharomyces cerevisiae FRA2 68 69 Saccharomyces cerevisiae GRX3 70 71 Saccharomyces cerevisiae CCC1 72 73 *The same amino acid sequence is encoded by SEQ ID NOs: 51 and 53

SEQ ID NOs: 55-59 are hybrid promoter sequences.

[0028] The invention can be more fully understood from the following detailed description which forms a part of this application.

DETAILED DESCRIPTION OF THE INVENTION

[0029] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. In case of conflict, the present application including the definitions will control. Unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. All publications, patents and other references mentioned herein are incorporated by reference in their entireties for all purposes as if each individual publication or patent application were specifically and individually indicated to be incorporated by reference, unless only specific sections of patents or patent publications are indicated to be incorporated by reference.

[0030] The present invention relates to yeast production cultures that have improved fermentation to produce butanol due to reduced sensitivity to butanol that is present in the culture medium. Butanol includes isobutanol and 1-butanol. In addition, the invention relates to methods of producing butanol using the present cultures. Butanol is useful for replacing fossil fuels, in addition to applications as solvents and/or extractants

[0031] The following definitions and abbreviations are to be use for the interpretation of the claims and the specification.

[0032] As used herein, the terms "comprises," "comprising," "includes," "including," "has," "having," "contains" or "containing," or any other variation thereof, are intended to cover a non-exclusive inclusion. For example, a composition, a mixture, process, method, article, or apparatus that comprises a list of elements is not necessarily limited to only those elements but may include other elements not expressly listed or inherent to such composition, mixture, process, method, article, or apparatus. Further, unless expressly stated to the contrary, "or" refers to an inclusive or and not to an exclusive or. For example, a condition A or B is satisfied by any one of the following: A is true (or present) and B is false (or not present), A is false (or not present) and B is true (or present), and both A and B are true (or present).

[0033] Also, the indefinite articles "a" and "an" preceding an element or component of the invention are intended to be nonrestrictive regarding the number of instances (i.e. occurrences) of the element or component. Therefore "a" or "an" should be read to include one or at least one, and the singular word form of the element or component also includes the plural unless the number is obviously meant to be singular.

[0034] The term "invention" or "present invention" as used herein is a non-limiting term and is not intended to refer to any single embodiment of the particular invention but encompasses all possible embodiments as described in the specification and the claims.

[0035] As used herein, the term "about" modifying the quantity of an ingredient or reactant of the invention employed refers to variation in the numerical quantity that can occur, for example, through typical measuring and liquid handling procedures used for making concentrates or use solutions in the real world; through inadvertent error in these procedures; through differences in the manufacture, source, or purity of the ingredients employed to make the compositions or carry out the methods; and the like. The term "about" also encompasses amounts that differ due to different equilibrium conditions for a composition resulting from a particular initial mixture. Whether or not modified by the term "about", the claims include equivalents to the quantities. In one embodiment, the term "about" means within 10% of the reported numerical value, preferably within 5% of the reported numerical value.

[0036] The term "butanol" as used herein, refers to 1-butanol, isobutanol, or mixtures thereof.

[0037] The term "isobutanol biosynthetic pathway" or "isobutanol pathway" refers to an enzyme pathway to produce isobutanol from pyruvate.

[0038] The term "1-butanol biosynthetic pathway" or "1-butanol pathway" refers to an enzyme pathway to produce 1-butanol from pyruvate.

[0039] The term "low cell density" refers to a cell concentration of less than about 6.times.10.sup.5 cells/ml. For example, a culture with an OD.sub.600 of 0.05 is a low cell density culture, based on the relationship that 1.0 OD.sub.600 corresponds to 10.sup.7 cell/ml.

[0040] The term "high cell density" refers to a cell concentration of greater than about 5.times.10.sup.7 cells/ml. For example, a culture with an OD.sub.600 of 5.0 and about 2.4 grams dry cell weight per liter (gdcw/L) is a high cell density culture, based on the relationship that 1.0 OD.sub.600 corresponds to 10.sup.7 cell/ml and to 0.4 gdcw/L.

[0041] The terms "glucose utilization" and "glucose consumption" refer to the amount of glucose that a cell culture metabolizes under conditions of excess glucose. Glucose utilization rate is measured in a culture of defined cell density in a defined concentration of butanol in culture conditions as described in Example 3 herein with glucose as the carbon substrate. When alternative carbon substrates are present in media used for production, the glucose utilization rate cannot be measured in that culture, but must be measured in a separate culture with glucose as the carbon substrate.

[0042] The term "carbon substrate" or "fermentable carbon substrate" or "suitable carbon substrate" refers to a carbon source capable of being metabolized by cultures of the present invention and particularly include carbon sources selected from the group consisting of monosaccharides, oligosaccharides, and polysaccharides.

[0043] The term "fermentative production" refers to the conversion of a carbon source to a product by the metabolic activity of a microorganism, such as in this case by cultures of the present invention.

[0044] The term "viable" refers to a culture of cells (e.g., yeast cells) capable of multiplying or being cultured under the growth conditions provided herein or in a growth medium containing butanol at a concentration of at least about 2% (w/v). In some embodiments, a viable culture is capable of multiplying or being cultured under such conditions for 24 hours.

[0045] The term "codon-optimized" as it refers to genes or coding regions of nucleic acid molecules for transformation of various hosts, refers to the alteration of codons in the gene or coding regions of the nucleic acid molecules to reflect the typical codon usage of the host organism without altering the polypeptide encoded by the DNA. Such optimization includes replacing at least one, or more than one, or a significant number, of codons with one or more codons that are more frequently used in the genes of that organism.

Improved Butanol Tolerance in High Cell Density Culture

[0046] Disclosed herein is the discovery that when yeast cells are in a high cell density culture, the cells show increased tolerance to butanol as compared to yeast cells in a low cell density culture. High cell density culture for the present purposes refers to a culture with a cell density of at least about 2.4 gram dry cell weight per liter (gdcw/L). Any culture with greater cell density than about 2.4 gdcw/L, such as one with at least about 3.8 gdcw/L or higher, including 24 gdcw/L or higher, is a high cell density culture. High cell density may be greater than about 3 gdcw/L, greater than about 5 gdcw/L, greater than about 7 gdcw/L, greater than about 10 gdcw/L, greater than about 20 gdcw/L, or greater than about 30 gdcw/L. It is envisioned that cell densities as high as about 35-40 gdcw/L may be useful. For yeast, high cell density may also be characterized as a cell concentration of greater than about 5.times.10.sup.7 cells/ml or a measured OD.sub.600 of at least about 5. In some embodiments, the cell density can be any range of cell densities described herein, for example, from about 2.4 gdcw/L to about 40 gdcw/L, from about 2.4 gdcw/L to about 35 gdcw/L, from about 2.4 gdcw/L to about 30 gdcw/L, from about 2.4 gdcw/L to about 20 gdcw/L, from about 2.4 gdcw/L to about 10 gdcw/L, from about 2.4 gcdw/L to about 7 gdcw/L, from about 2.4 gdcw/L to about 5 gdcw/L, from about 3 gdcw/L to about 40 gdcw/L, from about 3 gdcw/L to about 35 gdcw/L, from about 3 gdcw/L to about 30 gdcw/L, from about 3 gdcw/L to about 20 gdcw/L, from about 3 gdcw/L to about 10 gdcw/L, from about 3 gcdw/L to about 7 gdcw/L, from about 3 gdcw/L to about 5 gdcw/L, from about 7 gdcw/L to about 40 gdcw/L, from about 7 gdcw/L to about 35 gdcw/L, from about 7 gdcw/L to about 30 gdcw/L, from about 7 gdcw/L to about 20 gdcw/L, or from about 7 gdcw/L to about 10 gdcw/L. For comparison, low cell density cultures have less than about 6.times.10.sup.5 cells/ml.

[0047] Increased tolerance of yeast to butanol was assessed herein by survival and/or utilization of glucose, which was found to be improved in high cell density cultures with respect to that in cultures of yeast cells at low cell density. Survival was measured by assaying the number of colony forming units (CFU), which is a measure of live cells. Yeast cultures with high cell densities in the presence of 1% butanol were found to have survival rates of at least about 40%, 50%, 60%, 70%, 75%, 80%, and up to 100% as compared to survival rate of a control culture without butanol under conditions assayed herein. Survival rate varies and depends on multiple factors including concentration of butanol, specific butanol isomer, and strain of yeast. In contrast, survival of low cell density cultures, having 100-fold fewer cells, was found in 1% butanol to be at rates up to 13% as compared to survival rate of a control culture without butanol.

[0048] The effect of yeast cell density on tolerance to alcohols was found not to be one response generalized to the presence of alcohols. The effect of cell density on butanol tolerance was different from the effect of cell density in the presence of the most widely produced alcohol, ethanol. In fact, as shown herein (see Example 1), the opposite effect was observed in ethanol, with low cell density cultures showing higher survival than high cell density cultures.

[0049] High cell density cultures having 3.8 gdcw/L and 24 gdcw/L were found herein to utilize glucose in 1.5% isobutanol at a rate of at least about 1 gram per gdcw per hour. In 1% isobutanol the calculated rate was at least about 1.4 gram per gdcw per hour. In 2% isobutanol the rate was at least about 0.5 gram per gdcw per hour, in 2.5% isobutanol the rate was at least about 0.4 gram per gdcw per hour and in 3% isobutanol the rate was at least about 0.2 gram per gdcw per hour.

[0050] Thus, provided herein are butanol production cultures having a high cell density of at least about 2.4 gdcw/L so that higher tolerance to butanol is achieved. High cell density cultures may be at least about 2.4 gdcw/L, 3.8 gdcw/L, or 24 gdcw/L, or higher. Butanol concentration in the production culture is at least about 1%, and may be at least about 1.5%, 2.0%, 2.5%, or 3.0% (w/v). In some embodiments, the butanol concentration is at least about 2.0%. In some embodiments, the butanol concentration is any range of the butanol concentrations disclosed herein, for example, from about 1% to about 3%, from about 1.5% to about 3%, from about 2% to about 3%, from about 1.5% to about 2.5%, or from about 2% to about 3% (w/v). Butanol can be isobutanol or 1-butanol. In the present production culture the glucose utilization rate is at least about 0.2 grams per gram of dry cell weight per hour (g/gdcw/h). The glucose utilization rate may be higher, such as at least about 0.3, 0.4, 0.5, 0.6, 1, 1.5, 2.4, or 3 gram per gdcw per hour. In some embodiments, the butanol concentration is at least about 2% w/v and the glucose utilization rate is at least about 0.5, 0.6, 1, 1.5, 2.4, or 3 gram per gdcw per hour. In some embodiments, the glucose utilization rate is at least about 0.5 gram per gdcw per hour. In some embodiments, the glucose utilization rate can be any range of the glucose utilization rates described herein, for example, from about 0.3 g/gdcw/h to about 3 g/gdcw/h, from about 0.3 g/gdcw/h to about 2.4 g/gdcw/h, from about 0.3 g/gdcw/h to about 1.5 g/gdcw/h, from about 0.3 g/gdcw/h to about 1 g/gdcw/h, from about 0.3 g/gdcw/h to about 0.6 g/gdcw/h, from about 0.3 g/gdcw/h to about 0.5 g/gdcw/h, from about 0.3 g/gdcw/h to about 0.4 g/gdcw/h, from about 0.5 g/gdcw/h to about 3 g/gdcw/h, from about 0.5 g/gdcw/h to about 2.4 g/gdcw/h, from about 0.5 g/gdcw/h to about 1.5 g/gdcw/h, from about 0.5 g/gdcw/h to about 1 g/gdcw/h, from about 0.5 g/gdcw/h to about 0.6 g/gdcw/h, from about 1 g/gdcw/h to about 3 g/gdcw/h, from about 1 g/gdcw/h to about 2.4 g/gdcw/h, or from about 1 g/gdcw/h to about 1.5 g/gdcw/h. In some embodiments, the glucose concentration is at least about 2.0% w/v and the glucose utilization rate is a range from about 0.5 g/gdcw/h to about 3 g/gdcw/h, from about 0.5 g/gdcw/h to about 2.4 g/gdcw/h, from about 0.5 g/gdcw/h to about 1.5 g/gdcw/h, from about 0.5 g/gdcw/h to about 1 g/gdcw/h, from about 0.5 g/gdcw/h to about 0.6 g/gdcw/h, from about 0.5 g/gdcw/h to about 3 g/gdcw/h, from about 0.5 g/gdcw/h to about 2.4 g/gdcw/h, from about 0.5 g/gdcw/h to about 1.5 g/gdcw/h, from about 0.5 g/gdcw/h to about 1 g/gdcw/h, from about 0.5 g/gdcw/h to about 0.6 g/gdcw/h, from about 1 g/gdcw/h to about 3 g/gdcw/h, from about 1 g/gdcw/h to about 2.4 g/gdcw/h, or from about 1 g/gdcw/h to about 1.5 g/gdcw/h. The glucose utilization rate achieved will depend on the concentration of butanol and the specific type of butanol in the culture medium. In general the rate will decrease with increasing butanol concentration. In general, yeast cells have similar response to isobutanol and 1-butanol, with less sensitivity to 2-butanol.

[0051] The glucose utilization rate is typically determined at a temperature of about 30.degree. C. to about 45.degree. C. The glucose utilization rate can also be determined at a temperature of about 30.degree. C. to about 37.degree. C. In some embodiments, cultures provided herein have a glucose utilization rate of greater than about 0.5 gram per gram of dry cell weight per hour at a temperature between about 30.degree. C. to about 45.degree. C. In some embodiments, cultures provided herein have a glucose utilization rate of greater than about 0.5 gram per gram of dry cell weight per hour at a temperature between about 30.degree. C. to about 37.degree. C. In some embodiments, cultures provided herein have a glucose utilization rate of greater than about 0.5 gram per gram of dry cell weight per hour at a temperature between about 30.degree. C. to about 32.degree. C. In some embodiments, the glucose utilization rate is determined for a culture that has been in contact with a medium comprising butanol at a concentration of at least about 2% (w/v) for at least about 6 hours. In some embodiments, cultures provided herein have a glucose utilization rate of greater than about 0.5 gram per gram of dry cell weight per hour for a culture that has been in contact with a medium comprising butanol at a concentration of at least about 2% (w/v) for at least about 6 hours.

[0052] In some embodiments, the cultures described herein are viable cultures.

Preparation of High Cell Density Production Cultures

[0053] The present high cell density butanol production cultures may be prepared by any method that provides a cell density of at least about 2.4 gdcw/L. Cultures with cell densities of, for example, 2.4 gdcw/L, 2.7 gdcw/L, 2.8 gdcw/L, 3.8 gdcw/L or 24 gdcw/L or higher may be prepared as high cell density cultures. In some embodiments, the cell density can be any range of cell densities described herein, for example, from about 2.4 gdcw/L to about 40 gdcw/L, from about 2.4 gdcw/L to about 35 gdcw/L, from about 2.4 gdcw/L to about 30 gdcw/L, from about 2.4 gdcw/L to about 20 gdcw/L, from about 2.4 gdcw/L to about 10 gdcw/L, from about 2.4 gcdw/L to about 7 gdcw/L, from about 2.4 gdcw/L to about 5 gdcw/L, from about 3 gdcw/L to about 40 gdcw/L, from about 3 gdcw/L to about 35 gdcw/L, from about 3 gdcw/L to about 30 gdcw/L, from about 3 gdcw/L to about 20 gdcw/L, from about 3 gdcw/L to about 10 gdcw/L, from about 3 gcdw/L to about 7 gdcw/L, from about 3 gdcw/L to about 5 gdcw/L, from about 7 gdcw/L to about 40 gdcw/L, from about 7 gdcw/L to about 35 gdcw/L, from about 7 gdcw/L to about 30 gdcw/L, from about 7 gdcw/L to about 20 gdcw/L, or from about 7 gdcw/L to about 10 gdcw/L. For example, in one method yeast cells that are capable of producing butanol are grown in an aerated culture which minimizes butanol production. For example, crabtree-positive yeast cells may be grown with high aeration and in low glucose concentration to maximize respiration and cell mass production, as known in the art, rather than butanol production. Typically the glucose concentration is kept to less than about 0.2 g/L. The aerated culture can grow to a high cell density and then be used as the present production culture. Alternatively, yeast cells that are capable of producing butanol may be grown and concentrated to produce a high cell density culture.

[0054] In addition, expression of the butanol biosynthetic pathway may be regulated such that it is minimally active during growth of yeast cells to high cell density. One or more genes of the pathway may be expressed from a promoter that may be controlled by growth conditions or media components to regulate their expression. In this culture cells may grow to high cell density to be used as a production culture or to be used as a seed culture for starting a high cell density production culture.

[0055] Cultures for production of high cell density cultures are grown in media using conditions as described below in sections on Fermentation media and Culture conditions, except that glucose can be limited to about 2 g/L and aerobic conditions can be as described above for maximizing respiratory growth.

Butanol Producing Yeast Cells

[0056] The present high cell density production cultures may be cultures of any yeast that produces butanol. The yeast may be crabtree positive or crabtree negative. Crabtree-positive yeast cells demonstrate the crabtree effect, which is a phenomenon whereby cellular respiration is inhibited when a high concentration of glucose is added to aerobic culture medium. Suitable yeasts include, but are not limited to, Saccharomyces, Schizosaccharomyces, Hansenula, Candida, Kluyveromyces, Yarrowia, Issatchenkia and Pichia. Suitable strains include, but are not limited to, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces lactis, Kluyveromyces thermotolerans, Candida glabrata, Candida albicans, Pichia stipitis, Issatchenkia orientalis and Yarrowia lipolytica.

[0057] Any of these yeasts that are engineered or otherwise able to produce butanol may be used in the present cultures. A biosynthetic pathway for production of isobutanol, 1-butanol, or 2-butanol is constructed in the yeast cell so that it produces butanol. The pathway genes may include endogenous genes and/or heterologous genes.

[0058] Standard recombinant DNA and molecular cloning techniques for recombinant host cells are well known in the art and are described by Sambrook, J., Fritsch, E. F. and Maniatis, T., Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989) (hereinafter "Maniatis"); and by Silhavy, T. J., Bennan, M. L. and Enquist, L. W., Experiments with Gene Fusions, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1984); and by Ausubel, F. M. et al., Current Protocols in Molecular Biology, published by Greene Publishing Assoc. and Wiley-Interscience (1987). Other molecular tools and techniques are known in the art and include splicing by overlapping extension polymerase chain reaction (PCR) (Yu, et al. (2004) Fungal Genet. Biol. 41:973-981), positive selection for mutations at the URA3 locus of Saccharomyces cerevisiae (Boeke, J. D. et al. (1984) Mol. Gen. Genet. 197, 345-346; M A Romanos, et al. Nucleic Acids Res. 1991 January 11; 19(1): 187), the cre-lox site-specific recombination system as well as mutant lox sites and FLP substrate mutations (Sauer, B. (1987) Mol Cell Biol 7: 2087-2096; Senecoff, et al. (1988) Journal of Molecular Biology, Volume 201, Issue 2, Pages 405-421; Albert, et al. (1995) The Plant Journal. Volume 7, Issue 4, pages 649-659), "seamless" gene deletion (Akada, et al. (2006) Yeast; 23(5):399-405), and gap repair methodology (Ma et al., Genetics 58:201-216; 1981).

[0059] Methods for gene expression in yeasts are known in the art as described, for example, in Methods in Enzymology, Volume 194, Guide to Yeast Genetics and Molecular and Cell Biology (Part A, 2004, Christine Guthrie and Gerald R. Fink (Eds.), Elsevier Academic Press, San Diego, Calif.). For example, a chimeric gene for expression may be constructed by operably linking a promoter and terminator to a coding region. Promoters that may used include, for example, constitutive promoters FBA1, TDH3, ADH1, and GPM1, and the inducible promoters GAL1, GAL10, and CUP1. Other yeast promoters include hybrid promoters UAS(PGK1)-FBA1p (SEQ ID NO: 55), UAS(PGK1)-ENO2p (SEQ ID NO: 56), UAS(FBA1)-PDC1p (SEQ ID NO: 57), UAS(PGK1)-PDC1p (SEQ ID NO: 58), and UAS(PGK)-OLE1p (SEQ ID NO: 59). Suitable transcriptional terminators that may be used in a chimeric gene construct for expression include, but are not limited to FBA1t, TDH3t, GPM1t, ERG10t, GAL1t, CYC1t, and ADH1t.

[0060] Suitable promoters, transcriptional terminators, and coding regions may be cloned into E. coli-yeast shuttle vectors, and transformed into yeast cells. These vectors allow for propagation in both E. coli and yeast strains. Typically the vector contains a selectable marker and sequences allowing autonomous replication or chromosomal integration in the desired host. Typically used plasmids in yeast are shuttle vectors pRS423, pRS424, pRS425, and pRS426 (American Type Culture Collection, Rockville, Md.), which contain an E. coli replication origin (e.g., pMB1), a yeast 2.mu. origin of replication, and a marker for nutritional selection. The selection markers for these four vectors are HIS3 (vector pRS423), TRP1 (vector pRS424), LEU2 (vector pRS425) and URA3 (vector pRS426). Construction of expression vectors with a chimeric gene for expression may be performed by either standard molecular cloning techniques in E. coli or by the gap repair recombination method in yeast.

[0061] The gap repair cloning approach takes advantage of the highly efficient homologous recombination in yeast. Typically, a yeast vector DNA is digested (e.g., in its multiple cloning site) to create a "gap" in its sequence. A number of insert DNAs of interest are generated that contain a .gtoreq.21 bp sequence at both the 5' and the 3' ends that sequentially overlap with each other, and with the 5' and 3' terminus of the vector DNA. For example, to construct a yeast expression vector for "Gene X', a yeast promoter and a yeast terminator are selected for the expression cassette. The promoter and terminator are amplified from the yeast genomic DNA, and Gene X is either PCR amplified from its source organism or obtained from a cloning vector comprising Gene X sequence. There is at least a 21 bp overlapping sequence between the 5' end of the linearized vector and the promoter sequence, between the promoter and Gene X, between Gene X and the terminator sequence, and between the terminator and the 3' end of the linearized vector. The "gapped" vector and the insert DNAs are then co-transformed into a yeast strain and plated on the medium containing the appropriate compound mixtures that allow complementation of the nutritional selection markers on the plasmids. The presence of correct insert combinations can be confirmed by PCR mapping using plasmid DNA prepared from the selected cells. The plasmid DNA isolated from yeast (usually low in concentration) can then be transformed into an E. coli strain, e.g. TOP10, followed by mini preps and restriction mapping to further verify the plasmid construct. Finally the construct can be verified by DNA sequence analysis.

[0062] Like the gap repair technique, integration into the yeast genome also takes advantage of the homologous recombination system in yeast. Typically, a cassette containing a coding region plus control elements (promoter and terminator) and auxotrophic marker is PCR-amplified with a high-fidelity DNA polymerase using primers that hybridize to the cassette and contain 40-70 base pairs of sequence homology to the regions 5' and 3' of the genomic area where insertion is desired. The PCR product is then transformed into yeast and plated on medium containing the appropriate compound mixtures that allow selection for the integrated auxotrophic marker. For example, to integrate "Gene X" into chromosomal location "Y", the promoter-coding regionX-terminator construct is PCR amplified from a plasmid DNA construct and joined to an autotrophic marker (such as URA3) by either SOE PCR (Horton et al. (1989) Gene 77:61-68) or by common restriction digests and cloning. The full cassette, containing the promoter-coding regionX-terminator-URA3 region, is PCR amplified with primer sequences that contain 40-70 bp of homology to the regions 5' and 3' of location "Y" on the yeast chromosome. The PCR product is transformed into yeast and selected on growth media lacking uracil. Transformants can be verified either by colony PCR or by direct sequencing of chromosomal DNA.

Butanol Production Pathways

[0063] Suitable pathways for production of butanol are known in the art, and certain suitable pathways are described herein. In some embodiments, the butanol production pathway comprises at least one gene that is heterologous to the host cell. In some embodiments, the butanol biosynthetic pathway comprises more than one gene that is heterologous to the host cell. In some embodiments, the butanol biosynthetic pathway comprises heterologous genes encoding polypeptides corresponding to every step of a biosynthetic pathway. As is known in the art, sequences may be codon optimized for expression in a host cell.

[0064] Genes and polypeptides that can be used for substrate to product conversions described herein as well as methods of identifying such genes and polypeptides, are described herein and/or in the art, for example, for isobutanol, in U.S. Pat. No. 7,851,188.

[0065] Biosynthetic pathways for production of 2-butanol that may be engineered in the present cells are disclosed in the art, for example, in US Patent Application Publications US20070292927A1 and US20070259410A1, which are herein incorporated by reference. A diagram of the disclosed 2-butanol biosynthetic pathways is provided in FIG. 1. For example, the pathway in US20070292927A1 includes the following conversion steps: [0066] pyruvate to acetolactate (FIG. 1 step a) as catalyzed for example by acetolactate synthase; [0067] acetolactate to acetoin (FIG. 1 step b) as catalyzed for example by acetolactate decarboxylase; [0068] acetoin to 2,3-butanediol (FIG. 1 step i) as catalyzed for example by butanediol dehydrogenase; [0069] 2,3-butanediol to 2-butanone (FIG. 1 step j) as catalyzed for example by diol dehydratase or glycerol dehydratase; and [0070] 2-butanone to 2-butanol (FIG. 1 step f) as catalyzed for example by butanol dehydrogenase.

[0071] As disclosed in US Patent Application Publication No. US 2009-0305363, for production of the 2,3-butanediol intermediate in yeast pdc- host cells, acetolactate synthase may be expressed in the cytosol. Acetolactate synthase enzymes, which also may be called acetohydroxy acid synthase, belong to EC 2.2.1.6 (switched from 4.1.3.18 in 2002), are well-known, and they participate in the biosynthetic pathway for the proteinogenic amino acids leucine and valine, as well as in the pathway for fermentative production of 2,3-butanediol from acetoin in a number of organisms. The skilled person will appreciate that polypeptides having acetolactate synthase activity isolated from a variety of sources may be used in the present cells. Acetolactate synthase (Als) enzyme activities that have substrate preference for pyruvate over ketobutyrate are of particular utility, such as those encoded by alsS of Bacillus and budB of Klebsiella (Gollop et al., J. Bacteriol. 172(6):3444-3449 (1990); Holtzclaw et al., J. Bacteriol. 121(3):917-922 (1975)). Als from Bacillus subtilis (DNA: SEQ ID NO:3; protein: SEQ ID NO:4), from Klebsiella pneumoniae (DNA: SEQ ID NO:1; protein: SEQ ID NO:2), and from Lactococcus lactis (DNA: SEQ ID NO:5; protein: SEQ ID NO:6) are provided herein.

[0072] Additional Als coding regions and encoded proteins that may be used include those from Staphylococcus aureus (DNA: SEQ ID NO:7; protein: SEQ ID NO:8), Listeria monocytogenes (DNA: SEQ ID NO:9; protein: SEQ ID NO:10), Streptococcus mutans (DNA: SEQ ID NO:11; protein: SEQ ID NO:12), Streptococcus thermophilus (DNA: SEQ ID NO:13; protein: SEQ ID NO:14), Vibrio angustum (DNA: SEQ ID NO:15; protein: SEQ ID NO:16), and Bacillus cereus (DNA: SEQ ID NO:17; protein: SEQ ID NO:18). Any Als gene that encodes an acetolactate synthase having at least about 80-85%, 85%-90%, 90%-95%, or at least about 96%, 97%, or 98% sequence identity to any of those with SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, or 18 that converts pyruvate to acetolactate may be used. Identities are based on the Clustal W method of alignment using the default parameters of GAP PENALTY=10, GAP LENGTH PENALTY=0.1, and Gonnet 250 series of protein weight matrix.

[0073] Additionally, US Patent Application Publication No. 20090305363 provides a phylogenetic tree depicting acetolactate synthases that are the 100 closest neighbors of the B. subtilis AlsS sequence, any of which may be used. Additional Als sequences that may be used in the present strains may be identified in the literature and in bioinformatics databases as is well known to the skilled person. Identification of coding and/or protein sequences using bioinformatics is typically through BLAST (described above) searching of publicly available databases with known Als encoding sequences or encoded amino acid sequences, such as those provided herein. Identities are based on the Clustal W method of alignment as specified above. Additionally, the sequences described herein or those recited in the art may be used to identify other homologs in nature as described above.

[0074] Cytosolic expression of acetolactate synthase is achieved by transforming with a gene comprising a sequence encoding an acetolactate synthase protein, with no mitochondrial targeting signal sequence. Methods for gene expression in yeasts are known in the art (see for example Methods in Enzymology, Volume 194, Guide to Yeast Genetics and Molecular and Cell Biology (Part A, 2004, Christine Guthrie and Gerald R. Fink (Eds.), Elsevier Academic Press, San Diego, Calif.). Expression using chimeric genes (including coding regions with operably linked promoters and terminators), vectors, cloning methods, and integration methods are as described above.

[0075] Conversion of acetolactate to acetoin is by an acetolactate decarboxylase enzyme, known as EC 4.1.1.5 which is available, for example, from Bacillus subtilis (DNA: SEQ ID NO:21; Protein: SEQ ID NO:22), Klebsiella terrigena (DNA: SEQ ID NO:23, Protein: SEQ ID NO:24) and Klebsiella pneumoniae (DNA: SEQ ID NO:19, protein: SEQ ID NO:20). Any gene that encodes an acetolactate decarboxylase having at least about 80-85%, 85%-90%, 90%-95%, or at least about 96%, 97%, or 98% sequence identity to any of those with SEQ ID NOs:20, 22, or 24 that converts acetolactate to acetoin may be used.

[0076] Conversion of acetoin to 2,3-butanediol is by a butanediol dehydrogenase enzyme, also known as acetoin reductase. Butanediol dehydrogenase enzymes may have specificity for production of (R)- or (S)-stereochemistry in the alcohol product. (S)-specific butanediol dehydrogenases are known as EC 1.1.1.76 and are available, for example, from Klebsiella pneumoniae (DNA: SEQ ID NO:25; protein: SEQ ID NO:26). (R)-specific butanediol dehydrogenases are known as EC 1.1.1.4 and are available, for example, from Bacillus cereus (DNA: SEQ ID NO:27, protein: SEQ ID NO:28), Lactococcus lactis (DNA: SEQ ID NO:29, protein: SEQ ID NO:30), and Saccharomyces cerevisiae (BDH1; DNA: SEQ ID NO:54, protein: SEQ ID NO:55). Any gene that encodes a butanediol dehydrogenase having at least about 80-85%, 85%-90%, 90%-95%, or at least about 98% sequence identity to any of those with SEQ ID NOs:26, 28, 30 or 55 that converts acetoin to 2,3-butanediol may be used.

[0077] Diol dehydratases, also known as butanediol dehydratases, which utilize the cofactor adenosyl cobalamin (vitamin B12) are known as EC 4.2.1.28. Glycerol dehydratases that also utilize the cofactor adenosyl cobalamin are known as EC 4.2.1.30. Diol and glycerol dehydratases have three subunits that are required for activity. Provided in US 20070292927A1 are examples of sequences of the three subunits of many diol and glycerol dehydratases that may be used in a 2-butanone or 2-butanol pathway in the present cells, as well as the preparation and use of a Hidden Markov Model to identify additional diol and dehydratase enzymes that may be used.

[0078] Butanol dehydrogenases are a subset of a broad family of alcohol dehydrogenases and may be NAD.sup.+- or NADP.sup.+-dependent. The NAD-dependent enzymes are known as EC 1.1.1.1 and the NADP-dependent enzymes are known as EC 1.1.1.2. Provided in US 20070292927A1 are examples of sequences of butanol dehydrogenases that may be used in the disclosed 2-butanol biosynthetic pathway in the present cells.

[0079] Described in US Patent Application Publication US 20090155870 A1, which is herein incorporated by reference, are construction of chimeric genes and genetic engineering of yeast for 2-butanol production using the US 20070292927A1 disclosed biosynthetic pathway. Further description for gene construction and expression is above and in the Examples herein.

[0080] The use in this pathway in yeast of the butanediol dehydratase from Roseburia inulinivorans, RdhtA, (protein SEQ ID NO:32, coding region SEQ ID NO:31) is disclosed in commonly owned and co-pending US Patent Application Publication US 20090155870 A1. This enzyme is used in conjunction with the butanediol dehydratase reactivase from Roseburia inulinivorans, RdhtB, (protein SEQ ID NO:34, coding region SEQ ID NO: 33). This butanediol dehydratase is desired in many hosts because it does not require coenzyme B.sub.12. Another B.sub.12-dependent diol dehydratase that may be used is one from Klebsiella pneumoniae, having three subunits: pduC, pduD, and pduE, that is disclosed in WO2009046370.

[0081] Useful for the last step of converting 2-butanone to 2-butanol in all pathways of FIG. 2 is a butanol dehydrogenase isolated from an environmental isolate of a bacterium identified as Achromobacter xylosoxidans that is disclosed in US Patent Application Publication No. 20090269823 (DNA: SEQ ID NO:35, protein SEQ ID NO:36), which is herein incorporated by reference.

[0082] Genes and their expression for other pathways of FIG. 2 are disclosed in US20070259410A1. Additional sequences that may be used to express the disclosed enzyme activities in the present strains may be identified in the literature and in bioinformatics databases as is well known to the skilled person. Identification of coding and/or protein sequences using bioinformatics is typically done using sequence analysis software such as BLAST (BLASTP, BLASTN, BLASTX (Altschul et al., J. Mol. Biol., 215:403-410 (1990)) and searching of publicly available databases with known encoding sequences or encoded amino acid sequences, such as those provided herein. Identities are based on the Clustal W method of alignment (described by Higgins and Sharp, CABIOS. 5:151-153 (1989); Higgins, D. G. et al., Comput. Appl. Biosci. 8:189-191 (1992)) and found in the MegAlign.TM. v6.1 program of the LASERGENE bioinformatics computing suite (DNASTAR Inc.). Default parameters for multiple alignment (GAP PENALTY=10, GAP LENGTH PENALTY=0.2, Delay Divergen Seqs(%)=30, DNA Transition Weight=0.5, Protein Weight Matrix=Gonnet Series, DNA Weight Matrix=IUB).

[0083] Additionally, the sequences described herein or those recited in the art may be used to identify other homologs in nature. For example each of the encoding nucleic acid fragments described herein may be used to isolate genes encoding homologous proteins. Isolation of homologous genes using sequence-dependent protocols is well known in the art. Examples of sequence-dependent protocols include, but are not limited to: 1) methods of nucleic acid hybridization; 2) methods of DNA and RNA amplification, as exemplified by various uses of nucleic acid amplification technologies [e.g., polymerase chain reaction (PCR), Mullis et al., U.S. Pat. No. 4,683,202; ligase chain reaction (LCR), Tabor, S. et al., Proc. Acad. Sci. USA 82:1074 (1985); or strand displacement amplification (SDA), Walker, et al., Proc. Natl. Acad. Sci. U.S.A., 89:392 (1992)]; and 3) methods of library construction and screening by complementation.

[0084] Biosynthetic pathways for production of isobutanol that may be engineered in the present cells are disclosed in the art, for example, in US Patent Application Publication US 20070092957 A1, which is herein incorporated by reference. A diagram of the disclosed isobutanol biosynthetic pathways is provided in FIG. 2. As described in US 20070092957 A1, steps in an example isobutanol biosynthetic pathway include conversion of: [0085] pyruvate to acetolactate (FIG. 2 pathway step a) as catalyzed for example by acetolactate synthase [0086] acetolactate to 2,3-dihydroxyisovalerate (FIG. 2 pathway step b) as catalyzed for example by acetohydroxy acid isomeroreductase, also called ketol-acid reductoisomerase; [0087] 2,3-dihydroxyisovalerate to .alpha.-ketoisovalerate (FIG. 2 pathway step c) as catalyzed for example by acetohydroxy acid dehydratase, also called dihydroxy-acid dehydratase; [0088] .alpha.-ketoisovalerate to isobutyraldehyde (FIG. 2 pathway step d) as catalyzed for example by branched-chain .alpha.-keto acid decarboxylase; and [0089] isobutyraldehyde to isobutanol (FIG. 2 pathway step e) as catalyzed for example by branched-chain alcohol dehydrogenase.

[0090] Acetolactate synthase was described above for the 2,3-butanediol pathway.

[0091] Acetohydroxy acid isomeroreductase, also called ketol-acid reductoisomerase (KARI) may naturally use NADPH (reduced nicotinamide adenine dinucleotide phosphate) as an electron donor. KARIs include those known by the EC number 1.1.1.86. Examples of sequences of KARI enzymes and their coding regions are provided in US20070092957 A1, including ILV5 from Saccharomyces cerevisiae (DNA: SEQ ID NO:37; protein SEQ ID NO:38). Ketol-acid reductoisomerase (KARI) enzymes are described in U.S. Patent Appl. Pub. Nos. 20080261230 A1, 20090163376 A1, 20100197519 A1, and PCT Appl. Pub. No. WO/2011/041415, all herein incorporated by reference. Examples of KARIs disclosed therein are those from Vibrio cholerae (DNA: SEQ ID NO:39; protein SEQ ID NO:40), Pseudomonas aeruginosa PAO1, (DNA: SEQ ID NO:41; protein SEQ ID NO:42), and Pseudomonas fluorescens PF5 (DNA: SEQ ID NO:43; protein SEQ ID NO:44) and Pf5.IlvC-Z4B8 mutant Pseudomonas fluorescens acetohydroxy acid reductoisomerase (DNA: SEQ ID NO:45; protein SEQ ID NO:46). Another KARI is encoded by the ilvC gene of Lactococcus lactis (DNA: SEQ ID NO:58; protein SEQ ID NO:59). KARIs also include Anaerostipes caccae KARI variants "K9G9" and "K9D3" (SEQ ID NOs: 62 and 61, respectively).

[0092] Acetohydroxy acid dehydratases, also called dihydroxy acid dehydratases (DHAD), are known by the EC number 4.2.1.9. Examples of sequences of DHAD enzymes and their coding regions are provided in U.S. Pat. No. 7,851,188 dihydroxyacid dehydratases (DHADs), including ILV3 of Saccharomyces cerevisiae (DNA: SEQ ID NO:47; protein SEQ ID NO:48). Additional [2Fe-2S].sup.2+ DHAD sequences such as the Streptococcus mutans DHAD (DNA: SEQ ID NO:49; protein SEQ ID NO:50) and a method for identifying [2Fe-2S].sup.2+ DHAD enzymes that may be used to obtain additional DHAD sequences that may be used are disclosed in co-pending US Patent Application Publication No. 20100081154, which is herein incorporated by reference.

[0093] Branched-chain .alpha.-keto acid decarboxylases (KivD) are known by the EC number 4.1.1.72. Examples of sequences of branched-chain .alpha.-keto acid decarboxylase enzymes and their coding regions are provided in US20070092957 A1, including Lactococcus lactis KivD (DNA: SEQ ID NO:51; codon optimized for expression in S. cerevisiae SEQ ID NO:53; protein SEQ ID NO:52). Additional branched-chain .alpha.-keto acid decarboxylases include one from Bacillus subtilis with coding sequence optimized for expression in S. cerevisiae (DNA: SEQ ID NO:54; protein SEQ ID NO:55), and others readily identified by one skilled in the art using bioinformatics as described above.

[0094] Branched-chain alcohol dehydrogenases are known by the EC number 1.1.1.265, but may also be classified under other alcohol dehydrogenases (specifically, EC 1.1.1.1 or 1.1.1.2). These enzymes utilize NADH (reduced nicotinamide adenine dinucleotide) and/or NADPH as electron donor and examples of sequences of branched-chain alcohol dehydrogenase enzymes and their coding regions are provided in US20070092957 A1.

[0095] U.S. Patent Appl. Publ. No. 20090269823 A1 describes SadB (DNA: SEQ ID NO:35, protein SEQ ID NO:36), an alcohol dehydrogenase (ADH) from Achromobacter xylosoxidans. Alcohol dehydrogenases also include horse liver ADH (HADH; codon optimized for expression in S. cerevisiae; DNA: SEQ ID NO:56; protein SEQ ID NO:57) and Beijerinkia indica ADH (protein SEQ ID NO: 74) as well as others readily identified by one skilled in the art using bioinformatics as described above.

[0096] Genes that may be used for expression of enzymes for two additional isobutanol pathways are described in US 20070092957 A1. Additional genes that may be used in all three pathways can be identified by one skilled in the art as described above.

[0097] Additionally described in US 20070092957 A1 are construction of chimeric genes and genetic engineering of yeast, exemplified by Saccharomyces cerevisiae, for isobutanol production using the disclosed biosynthetic pathways. Further description for gene construction and expression is above and in the Examples herein.

[0098] A biosynthetic pathway for production of 1-butanol that may be engineered in the present cells is disclosed in co-pending US Patent Application Publication US 20080182308A1, which is herein incorporated by reference. A diagram of the disclosed 1-butanol biosynthetic pathway is provided in FIG. 3. As described in US 20080182308A1, steps in the disclosed 1-butanol biosynthetic pathway include conversion of: [0099] acetyl-CoA to acetoacetyl-CoA (FIG. 3 pathway step a), as catalyzed for example by acetyl-CoA acetyltransferase; [0100] acetoacetyl-CoA to 3-hydroxybutyryl-CoA (FIG. 3 pathway step b), as catalyzed for example by 3-hydroxybutyryl-CoA dehydrogenase; [0101] 3-hydroxybutyryl-CoA to crotonyl-CoA (FIG. 3 pathway step c), as catalyzed for example by crotonase; [0102] crotonyl-CoA to butyryl-CoA (FIG. 3 pathway step d), as catalyzed for example by butyryl-CoA dehydrogenase; [0103] butyryl-CoA to butyraldehyde (FIG. 3 pathway step e), as catalyzed for example by butyraldehyde dehydrogenase; and [0104] butyraldehyde to 1-butanol (FIG. 3 pathway step f), as catalyzed for example by butanol dehydrogenase.

[0105] Genes that may be used for expression of these enzymes are described in US 20080182308A1, and additional genes that may be used can be identified by one skilled in the art as described above. Methods for expression of these genes in yeast are described in US 20080182308A1 as well as herein above.

[0106] In some embodiments, the butanol biosynthetic pathway comprises at least one gene, at least two genes, at least three genes, at least four genes, or at least 5 genes that is/are heterologous to the yeast cell. In embodiments, each substrate to product conversion of a butanol biosynthetic pathway in a recombinant host cell is catalyzed by a heterologous polypeptide. In embodiments, the butanol biosynthetic pathway is an isobutanol biosynthetic pathway and the polypeptide catalyzing the substrate to product conversions of acetolactate to 2,3-dihydroxyisovalerate and/or the polypeptide catalyzing the substrate to product conversion of isobutyraldehyde to isobutanol are capable of utilizing NADH as a cofactor.

[0107] It will be appreciated that host cells comprising a butanol biosynthetic pathway such as an isobutanol biosynthetic pathway as provided herein may further comprise one or more additional modifications. U.S. Appl. Pub. No. 20090305363 (incorporated by reference) discloses increased conversion of pyruvate to acetolactate by engineering yeast for expression of a cytosol-localized acetolactate synthase and substantial elimination of pyruvate decarboxylase activity. Modifications to reduce glycerol-3-phosphate dehydrogenase activity and/or disruption in at least one gene encoding a polypeptide having pyruvate decarboxylase activity or a disruption in at least one gene encoding a regulatory element controlling pyruvate decarboxylase gene expression as described in U.S. Patent Appl. Pub. No. 20090305363 (incorporated herein by reference), modifications to a host cell that provide for increased carbon flux through an Entner-Doudoroff Pathway or reducing equivalents balance as described in U.S. Patent Appl. Pub. No. 20100120105 (incorporated herein by reference). Other modifications include integration of at least one polynucleotide encoding a polypeptide that catalyzes a step in a pyruvate-utilizing biosynthetic pathway. Other modifications include at least one deletion, mutation, and/or substitution in an endogenous polynucleotide encoding a polypeptide having acetolactate reductase activity. In embodiments, the polypeptide having acetolactate reductase activity is YMR226c (SEQ ID NO: 63) of Saccharomyces cerevisae or a homolog thereof. Additional modifications include a deletion, mutation, and/or substitution in an endogenous polynucleotide encoding a polypeptide having aldehyde dehydrogenase and/or aldehyde oxidase activity. In embodiments, the polypeptide having aldehyde dehydrogenase activity is ALD6 (SEQ ID NO: 60) from Saccharomyces cerevisiae or a homolog thereof. A genetic modification which has the effect of reducing glucose repression wherein the yeast production host cell is pdc- is described in U.S. Appl. Publication No. 20110124060, incorporated herein by reference.

[0108] Recombinant host cells may further comprise (a) at least one heterologous polynucleotide encoding a polypeptide having dihydroxy-acid dehydratase activity; and (b)(i) at least one deletion, mutation, and/or substitution in an endogenous gene encoding a polypeptide affecting Fe--S cluster biosynthesis; and/or (ii) at least one heterologous polynucleotide encoding a polypeptide affecting Fe--S cluster biosynthesis. In embodiments, the polypeptide affecting Fe--S cluster biosynthesis is encoded by AFT1 (nucleic acid SEQ ID NO: 64, amino acid SEQ ID NO: 65), AFT2 (SEQ ID NOs: 66 and 67), FRA2 (SEQ ID NOs: 68 and 69), GRX3 (SEQ ID NOs: 70 and 71), or CCC1 (SEQ ID NOs: 72 and 73). In embodiments, the polypeptide affecting Fe--S cluster biosynthesis is constitutive mutant AFT1 L99A, AFT1 L102A, AFT1 C291F, or AFT1 C293F.

Fermentation Media

[0109] High cell density production cultures disclosed herein are maintained in culture medium that supports metabolism for production of butanol. The culture medium can also provide the culture viability for production of butanol. Typically media used for the present invention may contain at least about 2 g/L glucose or an equivalent amount of carbon substrates. Carbon substrates may include but are not limited to monosaccharides such as fructose, oligosaccharides such as lactose maltose, galactose, or sucrose, polysaccharides such as starch or cellulose or mixtures thereof and unpurified mixtures from renewable feedstocks such as cheese whey permeate, cornsteep liquor, sugar beet molasses, and barley malt. Other carbon substrates may include ethanol, lactate, succinate, or glycerol. Hence it is contemplated that the source of carbon in the media may encompass a wide variety of carbon containing substrates. Carbon substrates may also be provided by corn mash, cane juice, molasses, wheat mash, or other forms of biomass that have been liquefied, or treated and saccharified, to release carbon sources therein. Carbon substrates are typically maintained in excess to allow for maximal metabolism.

[0110] In addition to an appropriate carbon source, fermentation media must contain suitable minerals, salts, cofactors, buffers and other components, known to those skilled in the art, suitable for the metabolism of the cultures and promotion of the enzymatic pathway necessary for production of butanol.

[0111] Suitable media include common commercially prepared media such as broth that includes yeast nitrogen base, ammonium sulfate, and dextrose as the carbon/energy source or YPD Medium, a blend of peptone, yeast extract, and dextrose in optimal proportions for growing most Saccharomyces cerevisiae strains. Other defined or synthetic growth media may also be used and are known by one skilled in the art of microbiology or fermentation science.

Culture Conditions

[0112] Typically cultures are maintained under conditions to support a viable butanol producing yeast cell, including a temperature in the range of about 20.degree. C. to about 37.degree. C. in an appropriate medium. Suitable pH ranges for the fermentation are typically between pH 3.0 and pH 7.5, where pH 4.5 to pH 6.5 is in some embodiments the initial condition.

[0113] Fermentations may be performed under aerobic or anaerobic conditions. In some embodiments, dissolved oxygen is maintained between microaerobic conditions to above 3%.

[0114] The amount of butanol in the fermentation medium is typically determined by high performance liquid chromatography (HPLC). However, other art-known methods can be used.

[0115] Cultures may be fermented in batch, fed-batch, or continuous systems. A Fed-Batch system is similar to a typical batch system with the exception that the carbon source substrate is added in increments as the fermentation progresses. Batch and Fed-Batch fermentations are common and well known in the art and examples may be found in Thomas D. Brock in Biotechnology: A Textbook of Industrial Microbiology, Second Edition (1989) Sinauer Associates, Inc., Sunderland, Mass., or Deshpande, Mukund V., Appl. Biochem. Biotechnol., 36:227, (1992), herein incorporated by reference.

[0116] Typical production conditions may include a fed-batch process for a period of time with a switch to a batch mode once all of the carbon source is added. Under commercial conditions, the liquefied mash or feedstock is fed over a period of time and a saccharification enzyme is also added to the fermentor which releases glucose from starch over time. This slow release of glucose over time from starch is controlled by the amount of saccharification enzyme that is added to the fermentor. In the case of cane juice fermentation, the substrate is slowly added over time until all substrates are added after which the fermentation proceeds under batch mode. The fermentation may be run for a period of time that is between about one hour and 200 hours.

Product Isolation from Fermentation Medium

[0117] During production, butanol product may be removed from the fermentation media by processes known in the art including vacuum application and liquid-liquid extraction.

[0118] Products can be isolated from the fermentation medium by methods known to one skilled in the art. For example, bioproduced isobutanol may be isolated from the fermentation medium using methods known in the art for ABE fermentations (see, e.g., Durre, Appl. Microbiol. Biotechnol. 49:639-648 (1998), Groot et al., Process. Biochem. 27:61-75 (1992), and references therein). For example, solids may be removed from the fermentation medium by centrifugation, filtration, decantation, or the like. Then, the isobutanol may be isolated from the fermentation medium using methods such as distillation, azeotropic distillation, liquid-liquid extraction, adsorption, gas stripping, membrane evaporation, pervaporation or vacuum flash fermentation (see e.g., U.S. Pub. No. 20090171129 A1, and International Pub. No. WO2010/151832 A1, both incorporated herein by reference in their entirety). A vacuum may be applied to a portion or the whole of the fermentation broth to remove butanol from the aqueous phase.

[0119] Because butanol forms a low boiling point, azeotropic mixture with water, distillation can be used to separate the mixture up to its azeotropic composition. Distillation may be used in combination with another separation method to obtain separation around the azeotrope. Methods that may be used in combination with distillation to isolate and purify butanol include, but are not limited to, decantation, liquid-liquid extraction, adsorption, and membrane-based techniques. Additionally, butanol may be isolated using azeotropic distillation using an entrainer (see, e.g., Doherty and Malone, Conceptual Design of Distillation Systems, McGraw Hill, New York, 2001).

[0120] The butanol-water mixture forms a heterogeneous azeotrope so that distillation may be used in combination with decantation to isolate and purify the isobutanol. In this method, the isobutanol containing fermentation broth is distilled to near the azeotropic composition. Then, the azeotropic mixture is condensed, and the isobutanol is separated from the fermentation medium by decantation. The decanted aqueous phase may be returned to the first distillation column as reflux. The isobutanol-rich decanted organic phase may be further purified by distillation in a second distillation column.

[0121] The butanol can also be isolated from the fermentation medium using liquid-liquid extraction in combination with distillation. In this method, the isobutanol is extracted from the fermentation broth using liquid-liquid extraction with a suitable solvent. The isobutanol-containing organic phase is then distilled to separate the butanol from the solvent. The amount of an extractant added may be from 5% to 50% of the fermentor volume for use in liquid-liquid extraction (LLE) to remove butanol from the aqueous medium during fermentation.

[0122] Distillation in combination with adsorption can also be used to isolate isobutanol from the fermentation medium. In this method, the fermentation broth containing the isobutanol is distilled to near the azeotropic composition and then the remaining water is removed by use of an adsorbent, such as molecular sieves (Aden et al., Lignocellulosic Biomass to Ethanol Process Design and Economics Utilizing Co-Current Dilute Acid Prehydrolysis and Enzymatic Hydrolysis for Corn Stover, Report NREL/TP-510-32438, National Renewable Energy Laboratory, June 2002).

[0123] Additionally, distillation in combination with pervaporation may be used to isolate and purify the isobutanol from the fermentation medium. In this method, the fermentation broth containing the isobutanol is distilled to near the azeotropic composition, and then the remaining water is removed by pervaporation through a hydrophilic membrane (Guo et al., J. Membr. Sci. 245, 199-210 (2004)).

[0124] In situ product removal (ISPR) (also referred to as extractive fermentation) can be used to remove butanol (or other fermentative alcohol) from the fermentation vessel as it is produced, thereby allowing the microorganism to produce butanol at high yields. One method for ISPR for removing fermentative alcohol that has been described in the art is liquid-liquid extraction. In general, with regard to butanol fermentation, for example, the fermentation medium, which includes the microorganism, is contacted with an organic extractant at a time before the butanol concentration reaches a toxic level. The organic extractant and the fermentation medium form a biphasic mixture. The butanol partitions into the organic extractant phase, decreasing the concentration in the aqueous phase containing the microorganism, thereby limiting the exposure of the microorganism to the inhibitory butanol.

[0125] Liquid-liquid extraction can be performed, for example, according to the processes described in U.S. Patent Appl. Pub. No. 20090305370, the disclosure of which is hereby incorporated in its entirety. U.S. Patent Appl. Pub. No. 20090305370 describes methods for producing and recovering butanol from a fermentation broth using liquid-liquid extraction, the methods comprising the step of contacting the fermentation broth with a water immiscible extractant to form a two-phase mixture comprising an aqueous phase and an organic phase. Typically, the extractant can be an organic extractant selected from the group consisting of saturated, mono-unsaturated, poly-unsaturated (and mixtures thereof) C.sub.12 to C.sub.22 fatty alcohols, C.sub.12 to C.sub.22 fatty acids, esters of C.sub.12 to C.sub.22 fatty acids, C.sub.12 to C.sub.22 fatty aldehydes, and mixtures thereof. The extractant(s) for ISPR can be non-alcohol extractants. The ISPR extractant can be an exogenous organic extractant such as oleyl alcohol, behenyl alcohol, cetyl alcohol, lauryl alcohol, myristyl alcohol, stearyl alcohol, 1-undecanol, oleic acid, lauric acid, myristic acid, stearic acid, methyl myristate, methyl oleate, undecanal, lauric aldehyde, 20-methylundecanal, and mixtures thereof.

[0126] In some embodiments, the alcohol can be esterfied by contacting the alcohol in a fermentation medium with an organic acid (e.g., fatty acids) and a catalyst (e.g. enzyme such as a lipase) capable of esterfiying the alcohol with the organic acid. In such embodiments, the organic acid can serve as an ISPR extractant into which the alcohol esters partition. The organic acid can be supplied to the fermentation vessel and/or derived from the biomass supplying fermentable carbon fed to the fermentation vessel. Lipids present in the feedstock can be catalytically hydrolyzed to organic acid, and the same catalyst (e.g., enzymes) can esterify the organic acid with the alcohol. The catalyst can be supplied to the feedstock prior to fermentation, or can be supplied to the fermentation vessel before or contemporaneously with the supplying of the feedstock. When the catalyst is supplied to the fermentation vessel, alcohol esters can be obtained by hydrolysis of the lipids into organic acid and substantially simultaneous esterification of the organic acid with butanol present in the fermentation vessel. Organic acid and/or native oil not derived from the feedstock can also be fed to the fermentation vessel, with the native oil being hydrolyzed into organic acid. Any organic acid not esterified with the alcohol can serve as part of the ISPR extractant. The extractant containing alcohol esters can be separated from the fermentation medium, and the alcohol can be recovered from the extractant. The extractant can be recycled to the fermentation vessel. Thus, in the case of butanol production, for example, the conversion of the butanol to an ester reduces the free butanol concentration in the fermentation medium, shielding the microorganism from the toxic effect of increasing butanol concentration. In addition, unfractionated grain can be used as feedstock without separation of lipids therein, since the lipids can be catalytically hydrolyzed to organic acid, thereby decreasing the rate of build-up of lipids in the ISPR extractant.

[0127] In situ product removal can be carried out in a batch mode or a continuous mode. In a continuous mode of in situ product removal, product is continually removed from the reactor. In a batchwise mode of in situ product removal, a volume of organic extractant is added to the fermentation vessel and the extractant is not removed during the process. For in situ product removal, the organic extractant can contact the fermentation medium at the start of the fermentation forming a biphasic fermentation medium. Alternatively, the organic extractant can contact the fermentation medium after the microorganism has achieved a desired amount of growth, which can be determined by measuring the optical density of the culture. Further, the organic extractant can contact the fermentation medium at a time at which the product alcohol level in the fermentation medium reaches a preselected level. In the case of butanol production according to some embodiments of the present invention, the organic acid extractant can contact the fermentation medium at a time before the butanol concentration reaches a toxic level, so as to esterify the butanol with the organic acid to produce butanol esters and consequently reduce the concentration of butanol in the fermentation vessel. The ester-containing organic phase can then be removed from the fermentation vessel (and separated from the fermentation broth which constitutes the aqueous phase) after a desired effective titer of the butanol esters is achieved. In some embodiments, the ester-containing organic phase is separated from the aqueous phase after fermentation of the available fermentable sugar in the fermentation vessel is substantially complete.

EXAMPLES

[0128] The present invention is further defined in the following Examples. It should be understood that these Examples, while indicating embodiments of the invention, are given by way of illustration only. From the above discussion and these Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various uses and conditions.

[0129] The meaning of abbreviations used is as follows: "min" means minute(s), "h" means hour(s), ".mu.L" means microliter(s), "mL" means milliliter(s), "L" means liter(s), "nm" means nanometer(s), "mm" means millimeter(s), "cm" means centimeter(s), ".mu.m" means micrometer(s), "mM" means millimolar, "M" means molar, "mmol" means millimole(s), ".mu.mole" means micromole(s), "g" means gram(s), ".mu.g" means microgram(s), "mg" means milligram(s), "OD.sub.600" means the optical density measured at a wavelength of 600 nm, "CFU" means colony forming unit, "HPLC" means high performance liquid chromatography.

GENERAL METHODS

[0130] Media and growth of yeast are described in Amberg, Burke and Strathern, 2005, Methods in Yeast Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. Methods are also described in Yeast Protocol (Editor W. Xiao, Humana Press, Totowa, N.J. The optical density at 600 nm (OD.sub.600) was measured using an Ultraspec 3100 spectrophotometer (GE Healthcare Life Sciences, Piscataway, N.J.).

Example 1

24 Hour Survival at Low Cell Density and High Cell Density for Saccharomyces cerevisiae BY4743

[0131] Saccharomyces cerevisiae BY4743 (ATCC 201390) was plated from a freezer vial onto YPD agar (Teknova, Hollister, Calif.; Cat. #Y1000) and incubated overnight at 30.degree. C. Cells from the YPD plate were inoculated into a 25 ml pre-culture of YPD broth (Teknova; Cat. #Y5000) with an initial OD.sub.600 of 0.06 to 0.08. The pre-culture was grown for 7 hours in an aerobic shaker at 30.degree. C. An aliquot of the pre-culture was inoculated into 200 ml of YPD broth, with an initial OD.sub.600 of 0.06 to 0.08, and this culture was incubated with aeration at 30.degree. C. for approximately 17 hours. The optical density of the culture was measured, the culture was centrifuged and the cells were resuspended to yield an OD.sub.600 of 5 in fresh YPD broth. This is referred to as a high cell density (HCD) culture. This culture was diluted 100 fold into fresh YPD broth to give a culture with an OD.sub.600 of 0.05. This is referred to as a low the cell density (LCD) culture. These cultures were incubated at 30.degree. C. with shaking for 30 minutes for an acclimation period. At the end of the 30 minute acclimation period, the OD.sub.600 was measured. 5 ml of low or high cell density culture was dispensed to 15 ml conical tubes which contained different concentrations of the alcohol to be tested. A control tube included no alcohol. The tubes were placed in a rotary drum at 30.degree. C. and incubated for 24 hours. The time of addition of cells was referred to as "time 0". The concentrations tested were: 1-butanol at 1.0% (w/v), 1.5% and 2%; isobutanol at 1.0%, 1.5% and 2.5%; 2-butanol at 2.0%, 3.0% and 3.5%; and ethanol at 2.0%, 4.0%, and 5%. The alcohol concentration of each culture was measured after 24 hours by HPLC which showed that <0.01% of the alcohol was lost during the incubation period.

[0132] Colony forming units (CFUs) were determined on YPD plates at time 0 for the control and at 24 hours for each of the cultures, including the control culture, using standard microbiological methods. Cultures were serially diluted in a microtiter plate (20 .mu.l of the culture and 180 .mu.l of YPD broth) and 10 .mu.l of the various dilutions were spotted onto triplicate agar plates and incubated at 30.degree. C. for 24 and 48 hours. Colonies were counted after 48 hours, and CFUs/ml were calculated. At time 0, the low cell density cultures had about 4.8.times.10.sup.5 CFUs/ml, and the high cell density cultures had about 5.4.times.10.sup.7 CFUs/ml. The percent survival was calculated based on the CFUs/ml of the 24 hour control (0 butanol) HCD or LCD culture, and the results are given in Table 2.

TABLE-US-00003 TABLE 2 Survival of S. cerevisiae after 24 h exposure to variousalcohols Concentration Percent Survival Alcohol (w/v) HCD (%) LCD (%) 1-Butanol 1.0% 72.2 1.3 1.5% 57.3 0.4 2.0% 3.6 0.1 Isobutanol 1% 77.8 0.6 1.5% 45.1 0.1 2.5% <0.001 <0.001 2-Butanol 2% 27.6 0.3 3% 18.9 0.1 3.5% <0.001 <0.001 Ethanol 2% 62.5 100.0 4% 50.2 100.0 5% 38.9 25.3

[0133] The control high cell density culture without alcohol increased from 5.4.times.10.sup.7 cells/ml to 6.5.times.10.sup.7 cells/ml in 24 hours. The control low cell density culture without alcohol increased from 4.8.times.10.sup.5 cells/ml to 2.8.times.10.sup.7 cells/ml in 24 hours. In cultures exposed to 1-butanol, isobutanol, or 2-butanol, the percent survival was significantly higher for the high cell density cultures than for the low cell density cultures (Table 2). In contrast, high cell density cultures were more sensitive to exposure to 2.0% or 4.0% ethanol than the low cell density cultures.

Example 2

24 Hour Survival at Low Cell Density and High Cell Density for Non-Saccharomyces Yeasts

[0134] Non-Saccharomyces yeast strains were tested to determine whether isobutanol tolerance is affected by cell density for these yeasts. Kluyveromyces marxianus PNY0577 (American Type Culture Collection ("ATCC"), Manassas, Va.; ATCC #8554), Kluyveromyces marxians PNY0578 (ATCC #16045), Pichia membranifaciens PNY0572, Pichia anomala PNY0573, Pichia sp. PNY0574, Issatchenkia orientalis PNY0575 and Issatchenkia orientalis PNY0576 were tested. The Pichia and Issatchenkia strains used are wild-type representatives of the designated genera. Other representative strains are commercially available, for example, from ATCC.

[0135] The non-Saccharomyces yeast strains were plated from freezer vials onto YPD agar (Teknova, Hollister, Calif.) and incubated overnight at 30.degree. C. Cells from each strain from the YPD plates were inoculated into a 25 ml pre-culture of YPD broth) (Teknova, Hollister, Calif.) with an initial OD.sub.600 of 0.06 to 0.08 and grown for 7 hours in an aerobic shaker at 30.degree. C. An aliquot of each pre-culture was inoculated into 200 ml YPD broth, to provide an initial OD.sub.600 of 0.06 to 0.08 and the resulting culture was incubated with aeration at 30.degree. C. for approximately 17 hours. The optical density of the culture was measured, the culture was centrifuged and the cells were resuspended to yield an OD.sub.600 of 5 in fresh YPD broth. This is referred to as a high cell density (HCD) culture. This culture was diluted 100 fold into fresh YPD broth to give a culture with an OD.sub.600 of 0.05. This is referred to as a low cell density (LCD) culture. These cultures were incubated at 30.degree. C. with shaking for 30 minutes for an acclimation period. At the end of the 30 minute acclimation period, the OD.sub.600 was measured.

[0136] 5 ml of low or high cell density culture was dispensed to 15 ml conical tubes which contained the concentrations of isobutanol to be tested. The tubes were placed in a rotary drum at 30.degree. C. and incubated for 24 hours. The isobutanol concentration was 1.0% or 2.0%. A control culture for each strain contained no isobutanol. The time of addition of cells was "time 0". The isobutanol concentration of each sample was measured after 24 hours by HPLC which showed that <0.01% of the alcohol was lost during the incubation period. All samples were filtered with Acrodisc CR PTFE 0.2 .mu.m filters (Pall Life Sciences, Port Washington, N.Y.) and analyzed using a Shodex SH1011 column (8 mm ID.times.300 mm length; Showa Denko America, Inc., New York, N.Y.) and a Shodex SH-G as a guard column. The injection volume was 10 .mu.L. The mobile phase was 0.01N sulfuric acid. The column temperature was 50.degree. C. with a mobile phase flow rate of 0.5 mL/min. For detection, a photometric detector at 210 nm and a refractive index detector were used.

[0137] Colony forming units (CFUs) were determined on YPD plates at time 0 for the controls and at 24 hours for all samples, including the control cultures, using standard microbiological methods. Essentially, cells were serially diluted in a microtiter plate (20 .mu.l of the culture and 180 .mu.l of YPD broth) and 10 .mu.l of the various dilutions were spotted onto triplicate agar plates and incubated at 30.degree. C. for 24 and 48 hours. Colonies were counted after 48 hours and CFUs/ml were calculated. The percent survival was calculated based on the CFUs/ml of the 24 hour HCD or LCD control culture for each strain and results are given in Table 3.

[0138] The control high cell density cultures without isobutanol increased from about 1.0.times.10.sup.8 cells/ml to about 1.5.times.10.sup.8 cells/ml in 24 hours. The control low cell density cultures without isobutanol increased from about 1.1.times.10.sup.6 cells/ml to about 1.0.times.10.sup.8 cells/ml in 24 hours. All of the non-Saccharomyces yeast strains had significantly higher levels of survival in HCD cultures than in LCD cultures exposed to 1.0% isobutanol (Table 3). Only one strain (PNY0574) survived exposure to 2.0% isobutanol. This strain also had a significantly higher level of survival in the HCD culture than in the LCD culture exposed to 2.0% isobutanol.

TABLE-US-00004 TABLE 3 Survival of non-Saccharomyces yeasts after 24 h exposure to isobutanol Percent Survival 1% isobutanol 2% isobutanol Strain ID Name HCD LCD HCD LCD PNY0572 Pichia membranifaciens 80.6 2.7 <0.001 <0.001 PNY0573 Pichia anomala 75.6 0.1 <0.001 <0.001 PNY0574 Pichia sp. 100 10 100 0.5 PNY0575 Issatchenkia orientalis 61.3 3.1 <0.001 <0.001 PNY0576 Issatchenkia orientalis 50 12.8 <0.001 <0.001 PNY0577 Kluyveromyces marxianus 39.5 0.1 <0.001 <0.001 PNY0578 Kluyveromyces marxianus 51 <0.001 <0.001 <0.001

Example 3

[0139] Glucose Utilization Rates for Saccharomyces cerevisiae Strains in the Presence of Isobutanol in High Cell Density Culture

[0140] Saccharomyces cerevisiae BY4743 was plated from a freezer vial onto YPD agar (Teknova, Hollister, Calif.; Cat. #Y1000) and incubated overnight at 30.degree. C. Cells from the YPD plate were inoculated into a 25 ml pre-culture of YPD broth (Teknova, Hollister, Calif.; Cat. #Y5000) with an initial OD.sub.600 of 0.06 to 0.08, and this culture was grown for 7 hours in an air shaker at 30.degree. C. An aliquot of the pre-culture was inoculated into 200 ml of YPD broth with an initial OD.sub.600 of 0.06 to 0.08, and this culture was incubated with aeration at 30.degree. C. for approximately 17 hours. The optical density of the culture was determined, the culture was centrifuged and the cells were resuspended in fresh YPD broth to yield an OD.sub.600 of 8. At this point, an aliquot of 10 ml of the culture was used for a dry cell weight determination, which gave a result of 3.89 gdcw/L.

[0141] 20 ml cultures were transferred to flasks that contained 0.0%, 0.5%, 1.0%, 1.25%, 1.5%, 1.75% or 2.0% isobutanol. Time of addition of cells was referred to as 0 hours. At various times, samples (1.5 ml) were withdrawn and centrifuged at 10,000 rpm for 2 minutes in a microfuge (Sorvall Biofuge pico). The supernatant was filtered through a 0.2 .mu.m filter (Pall Life Sciences, Port Washington, N.Y.) Pall GHP Acrodisc 13 mm Syringe Filter with 0.2 .mu.m GHP Membrane), the filtrate was diluted ten-fold and the glucose concentration was determined using a YSI Glucose Analyzer (YSI 2700 Select; YSI, Inc., Yellow Springs, Ohio).

[0142] Glucose consumption rates in the presence of isobutanol were also determined when the cell concentration was 18 gdcw/L or 24 gdcw/L. Cells from the YPD plate were inoculated into a 25 ml pre-culture of YPD broth (Teknova, Hollister, Calif. Cat. #Y5000) with an initial OD.sub.600 of 0.06 to 0.08, and this culture was grown for 7 hours in an air shaker at 30.degree. C. An aliquot of the pre-culture was inoculated into 8 flasks with 300 ml of YPD broth with an initial OD.sub.600 of 0.1 and this culture was incubated with aeration at 30.degree. C. for approximately 17 hours. The optical density of the culture was determined, the culture was centrifuged and the cells were resuspended in fresh YPD broth to yield an OD.sub.600 of 38 (18 gdcw/L). Samples (15 ml) were transferred to flasks containing 2%, 3%, or 4% isobutanol and glucose consumption was followed for 150 min.

[0143] The experiment was repeated except that the starting OD.sub.600 was 49.1 (24 gdcw/L). Samples (15 ml) were transferred to flasks containing 1.5%, 3%, or 4% isobutanol and glucose consumption was followed for 150 min. The results for BY4743 are graphed in FIG. 4.

[0144] In the control culture (without isobutanol) the glucose consumption rate was around 2.17 g/gdcw/h under the experimental conditions tested. At 1.5% isobutanol the glucose consumption rate for both the 3.8 and the 24 gdcw/L cultures was approximately 50% of the control rate (about 1.1 g/gdcw/h). Glucose consumption was observed even in the presence of 2.0% and 2.5% isobutanol, and it ranged from 20 to 25% of the control rate. The actual glucose consumption rate in the control may vary 5% from one experiment to another. However, the relative percentage of isobutanol inhibition were reproducible.

Example 4

Process to Generate High Cell Density Culture

[0145] Saccharomyces cerevisiae BY4743 was plated from a freezer vial onto YPD agar (Teknova, Hollister, Calif.; Cat. #Y1000) and incubated overnight at 30.degree. C. Cells from the YPD plate were inoculated into a 25 ml pre-culture of YPD broth (Teknova, Hollister, Calif.; Cat. #Y5000) with an initial OD.sub.600 of 0.06 to 0.08. The pre-culture was grown for 7 hours in an aerobic shaker at 30.degree. C. An aliquot of the pre-culture was inoculated into 125 ml of YPD broth, with an initial OD.sub.600 of 0.06 to 0.08, and this culture was incubated with aeration at 30.degree. C. for approximately 17 hours.

[0146] A starter culture of 150 ml of YPD was inoculated with 5 ml of the overnight culture with an initial OD.sub.600 of 0.15. This culture was incubated at 30.degree. C. with aeration for two hours, reaching OD.sub.600 of 0.38. At this time (time=0) the cultures were challenged with varying concentrations of isobutanol as given in Table 4 and the OD.sub.600 was assayed every hour for 5 h and at 24 h. Results are given in Table 4.

TABLE-US-00005 TABLE 4 Growth (OD 600) of yeast in varying isobutanol concentrations BY4743 Time Isobutanol Concentration (Hours) 0% 0.5% 0.75% 1% 1.5% 2% 2.5% 0 0.38 0.38 0.38 0.38 0.38 0.38 0.38 1 0.56 0.55 0.54 0.51 0.46 0.43 0.42 2 0.80 0.74 0.68 0.62 0.50 0.46 0.42 3 1.42 1.12 1.00 0.82 0.53 0.47 0.42 4 2.32 1.69 1.36 0.96 0.61 0.49 0.45 5 3.02 2.22 1.80 1.16 0.65 0.50 0.45 24 6.44 5.19 4.97 4.41 1.41 0.90 0.52

Since for this strain, one OD.sub.600 is equivalent to 10.sup.7 cells/ml, S. cerevisiae grew to about 4.times.10.sup.7 cells/ml in the presence of 1% isobutanol after overnight growth.

[0147] The cultures grown in 1% isobutanol, or less (concentration of isobutanol <1%), to OD.sub.600 of 4.41, or greater, are high cell density cultures. These cultures can be used for isobutanol production.

Example 5

Process to Generate Strains with High Glucose Consumption

[0148] S. cerevisiae strains used were PNY0569 CEN.PK122 (MATa MAL2-8c SUC2/MATalpha MAL2-8c SUC2) and PNY0571 (CEN.PK 113-7D MATa MAL2-8c SUC2), obtained from Centraalbureau voor Schimmelcultures, Fungal and Yeast Collection (Netherlands) as well as PNY0602 and PNY0614. PNY0602 and PNY0614 were isolated from a mutagenized culture of PNY0571. PNY0571 (CEN.PK 113-7D) was subjected to 10 rounds of chemical mutagenesis with N-methyl-N'-nitro-N-nitrosoguanidine (NTG) and ethylmethane sulfonate (EMS) using standard methods (Barbour, L., M. Hanna and W. Xiao. 2006. Mutagenesis, p. 121-127. In W. Xiao (ed.), Yeast Protocols, Second Edition. Humana Press, NJ). Cells were grown overnight in 10 ml of YPD at 30.degree. C. with shaking. The overnight culture was centrifuged, and the pellet was resuspended in 50 mM potassium phosphate buffer, pH 7.0. The cells were again centrifuged and resuspended in 10 ml of the same buffer. A portion of the resuspended cells (2.5 ml) was transferred to a plastic 15 ml screw cap centrifuge tube, and the cells were treated with NTG (10 .mu.g/ml final concentration) or EMS (3% w/v) for 40 minutes at 30.degree. C. without shaking. The mutagen was inactivated by addition of an equal volume of filter sterilized 10% (w/v) sodium thiosulfate. The treated cells were centrifuged and resuspended in water two times. The treatment protocol involved repeated cycles of treating yeast cells with one of the mutagens (e.g., NTG), allowing the surviving cells to grow out overnight in YPD with 1% isobutanol, and then treating the overnight culture with the other mutagen (e.g., EMS). After the fifth cycle (i.e., a total of ten treatments with mutagen), cells were screened for isobutanol tolerance. PNY0602 was isolated after prolonged (24 h) exposure to 3.0% isobutanol. PNY0614 was isolated after 5 cycles of repeated freezing and thawing of the mutagenized culture by resuspending mutagenized cells in distilled water and transferring the cells to dry-ice ethanol bath and a 37.degree. C. water bath for 20 minutes each.

[0149] The isolated microorganism associated with ATCC Accession No. ______ is also known herein as PNY0602. The isolated microorganism associated with ATCC Accession No. ______ was deposited under the Budapest Treaty on ______ at the American Type Culture Collection, Patent Depository 10801 University Boulevard, Manassas, Va. 20110-2209. The isolated microorganism associated with ATCC Accession No. ______ is also known herein as PNY0614. The isolated microorganism associated with ATCC Accession No. ______ was deposited under the Budapest Treaty on ______ at the American Type Culture Collection, Patent Depository 10801 University Boulevard, Manassas, Va. 20110-2209.

[0150] Glucose consumption rates in the presence of isobutanol in YPD were also determined when the cell concentration was about 8 O.D. (about 3.9 gdcw/L). Saccharomyces cerevisiae strains PNY0571, PNY0602 and PNY0614 were plated from a freezer vial onto YPD agar (Teknova, Hollister, Calif.; Cat. #Y1000) and incubated overnight at 30.degree. C. Cells from the YPD plate were inoculated into a 25 ml pre-culture of YPD broth (Teknova, Hollister, Calif.; Cat. #Y5000) with an initial OD.sub.600 of 0.06 to 0.08, and this culture was grown for 7 hours in an air shaker at 30.degree. C. An aliquot of the pre-culture was inoculated into 200 ml of YPD broth with an initial OD.sub.600 of 0.06 to 0.08, and this culture was incubated with aeration at 30.degree. C. for approximately 17 hours. The optical density of the culture was determined, the culture was centrifuged and the cells were resuspended in fresh YPD broth to yield an OD.sub.600 of 8. At this point, an aliquot of 10 ml of the culture was used for a dry cell weight determination, which gave dry cell weight in the range of 3.89 gdcw/L.

[0151] 20 ml cultures were transferred to flasks that contained 0.0%, 0.5%, 0.75%, 1.0%, 1.25%, 1.5%, 1.75%, 2.0% and 2.5% isobutanol. Time of addition of cells was referred to as 0 hours. At various times samples (500 to 700 ul) were transferred to a tube containing an equal volume of 10% TCA. Samples were centrifuged and glucose determined in the YSI glucose analyzer (YSI 2700 Select).

[0152] Results are shown in Table 5. These results show that the glucose consumption rates can be improved compared with the parent by mutagenesis and selection.

TABLE-US-00006 TABLE 5 Glucose consumed per gdcw per hour in high cell density cultures with differing amounts of isobutanol. Isobutanol Concentration (%) Strain 0 0.5 0.75 1 1.5 1.75 2 2.5 PNY0571 2.4 2.1 2.0 1.9 1.0 0.8 0.6 0.3 PNY0602 3.1 3.0 2.9 2.7 1.5 1.1 0.9 0.5 PNY0614 3.0 2.9 2.8 2.6 1.4 1.0 0.8 0.4

[0153] Isobutanol production may or may not be coupled to growth. Therefore we measured the glucose consumption rates under non-growing conditions using three different pH buffers.

[0154] Glucose consumption rates were determined in phosphate buffer pH 6.5 as outlined by Diderich, J. A., et al., Microbiology, 1999, 145 p. 3447-54. We also determined the glucose consumption rates at pH 5.25 and pH 4.0 using MES buffer.

[0155] Cells were grown overnight at 30.degree. C. in YPD (125 ml in a 500 ml flask). Cells were centrifuged and resuspended in buffer (0.1M phosphate buffer pH 6.0 or 0.1M MES buffer pH 5.25 or 0.1M MES buffer pH 4.0), washed one time, and then resuspended such that each cell suspension had approximately 16 OD or 160 million cell/ml. 10 ml of these stock cells were transferred to flasks containing 10 ml of the corresponding buffer containing 40 g/L glucose and varying concentrations of isobutanol. At various times samples (500 to 700 ul) were transferred to a tube containing an equal volume of 10% TCA. Samples were centrifuged and glucose determined in the YSI glucose analyzer. Glucose consumption rates were determined by plotting the amount of glucose consumed over time. The results are given in Table 6. Higher glucose consumption rates were observed in the presence of 20 g/L isobutanol at lower pHs than at pH 6.5.

TABLE-US-00007 TABLE 6 Glucose utilization rates (g/gdcw/h) in the presence of 20 g/L isobutanol, at three different pH values. 20 g/L Glucose Utilization (g/gdcw/h) Isobutanol pH 4 pH 5.25 pH 6.5 PNY0569 0.7 0.8 0.6 PNY0602 0.9 1.1 0.7 PNY0614 0.9 1.1 0.7

Example 6

Glucose Utilization Rates for Issatchenkia orientalis Strains in the Presence of Isobutanol in High Cell Density Culture

[0156] Strain Issatchenkia orientalis PNY0660, was derived from ATCC 20381 strain that was previously referred to as Candida acidothermophilium. Strain PNY0660 was plated from a freezer vial onto YPD agar (Teknova, Hollister, Calif.; Cat. #Y1000) and incubated overnight at 30.degree. C. Cells from the YPD plate were inoculated into a 50 ml pre-culture of YPD broth (Teknova, Hollister, Calif.; Cat. #Y5000) with an initial OD.sub.600 of 0.06 to 0.08, and this culture was grown for 5 hours in an air shaker at 30.degree. C. An aliquot of the pre-culture was inoculated into 200 ml of YPD broth with an initial OD.sub.600 of 0.6, and this culture was incubated with aeration at 30.degree. C. for approximately 17 hours. The optical density of the culture was determined, the culture was centrifuged, and the sugar consumption rate was determined in corn test medium. Corn test medium contained 0.2% casamino acids and 2% glucose and 100 mM MES buffer pH 5.25 and per liter contained (i) salts: ammonium sulfate 5.0 g, potassium phosphate monobasic 2.8 g, and magnesium sulfate heptahydrate 0.5 g, (ii) vitamins: biotin (D-) 0.40 mg, Ca D(+) panthotenate 8.00 mg, myo-inositol 200.00 mg, pyridoxol hydrochloride 8.00 mg, p-aminobenzoic acid 1.60 mg, riboflavin 1.60 mg, folic acid 0.02 mg, niacin 30.0 mg, and thiamine 30 mg; and (iii) trace elements: EDTA (Titriplex 1117) 99.38 mg, zinc sulphate heptahydrate 29.81 mg, manganese chloride dehydrate 5.57 mg, cobalt(II)chloride hexahydrate 1.99 mg, copper(II)sulphate pentahydrate 1.99 mg, Di-sodium molybdenum dehydrate 2.65 mg, calcium chloride dehydrate 29.81 mg, iron sulphate heptahydrate 19.88 mg, and boric acid.

[0157] 20 ml cultures (2.7 gdcw/L) in corn test medium were transferred to flasks that contained 0.0%, 0.5%, 1.0%, 1.5%, 1.75% or 2.0% isobutanol. Time of addition of cells was referred to as 0 hours. At various times, samples (1.5 ml) were withdrawn and centrifuged at 10,000 rpm for 2 minutes in a microfuge (Sorvall Biofuge pico). The supernatant was filtered through a 0.2 .mu.m filter (Pall Life Sciences, Port Washington, N.Y.) Pall GHP Acrodisc 13 mm Syringe Filter with 0.2 .mu.m GHP Membrane), the filtrate was diluted ten-fold, and the glucose concentration was determined using a YSI Glucose Analyzer (YSI 2700 Select; YSI, Inc., Yellow Springs, Ohio). The sugar consumption rate that was measured from 0 h to 6 h is shown in Table 7.

TABLE-US-00008 TABLE 7 Glucose utilization rate g/gdcw/h Isobutanol (g/L) 0.0 5.0 10.0 15.0 17.5 20.0 30 C. 2.4 2.4 2.4 1.4 0.9 0.7

[0158] I. orientalis strains have a sugar consumption rate of 0.7 g/gdw/h in the presence of 20 g/l of isobutanol.

[0159] The foregoing description of the invention has been presented for purposes of illustration and description. Furthermore, the description is not intended to limit the invention to the form disclosed herein.

[0160] All of the various aspects, embodiments and options described herein can be combined in any and all variations.

[0161] All publications, patents and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference.

Sequence CWU 1

1

7911680DNAK. pneumoniae 1atggacaaac agtatccggt acgccagtgg gcgcacggcg ccgatctcgt cgtcagtcag 60ctggaagctc agggagtacg ccaggtgttc ggcatccccg gcgccaaaat cgacaaggtc 120tttgattcac tgctggattc ctccattcgc attattccgg tacgccacga agccaacgcc 180gcatttatgg ccgccgccgt cggacgcatt accggcaaag cgggcgtggc gctggtcacc 240tccggtccgg gctgttccaa cctgatcacc ggcatggcca ccgcgaacag cgaaggcgac 300ccggtggtgg ccctgggcgg cgcggtaaaa cgcgccgata aagcgaagca ggtccaccag 360agtatggata cggtggcgat gttcagcccg gtcaccaaat acgccatcga ggtgacggcg 420ccggatgcgc tggcggaagt ggtctccaac gccttccgcg ccgccgagca gggccggccg 480ggcagcgcgt tcgttagcct gccgcaggat gtggtcgatg gcccggtcag cggcaaagtg 540ctgccggcca gcggggcccc gcagatgggc gccgcgccgg atgatgccat cgaccaggtg 600gcgaagctta tcgcccaggc gaagaacccg atcttcctgc tcggcctgat ggccagccag 660ccggaaaaca gcaaggcgct gcgccgtttg ctggagacca gccatattcc agtcaccagc 720acctatcagg ccgccggagc ggtgaatcag gataacttct ctcgcttcgc cggccgggtt 780gggctgttta acaaccaggc cggggaccgt ctgctgcagc tcgccgacct ggtgatctgc 840atcggctaca gcccggtgga atacgaaccg gcgatgtgga acagcggcaa cgcgacgctg 900gtgcacatcg acgtgctgcc cgcctatgaa gagcgcaact acaccccgga tgtcgagctg 960gtgggcgata tcgccggcac tctcaacaag ctggcgcaaa atatcgatca tcggctggtg 1020ctctccccgc aggcggcgga gatcctccgc gaccgccagc accagcgcga gctgctggac 1080cgccgcggcg cgcagctcaa ccagtttgcc ctgcatcccc tgcgcatcgt tcgcgccatg 1140caggatatcg tcaacagcga cgtcacgttg accgtggaca tgggcagctt ccatatctgg 1200attgcccgct acctgtacac gttccgcgcc cgtcaggtga tgatctccaa cggccagcag 1260accatgggcg tcgccctgcc ctgggctatc ggcgcctggc tggtcaatcc tgagcgcaaa 1320gtggtctccg tctccggcga cggcggcttc ctgcagtcga gcatggagct ggagaccgcc 1380gtccgcctga aagccaacgt gctgcatctt atctgggtcg ataacggcta caacatggtc 1440gctatccagg aagagaaaaa atatcagcgc ctgtccggcg tcgagtttgg gccgatggat 1500tttaaagcct atgccgaatc cttcggcgcg aaagggtttg ccgtggaaag cgccgaggcg 1560ctggagccga ccctgcgcgc ggcgatggac gtcgacggcc cggcggtagt ggccatcccg 1620gtggattatc gcgataaccc gctgctgatg ggccagctgc atctgagtca gattctgtaa 16802559PRTK. pneumoniae 2Met Asp Lys Gln Tyr Pro Val Arg Gln Trp Ala His Gly Ala Asp Leu1 5 10 15Val Val Ser Gln Leu Glu Ala Gln Gly Val Arg Gln Val Phe Gly Ile 20 25 30Pro Gly Ala Lys Ile Asp Lys Val Phe Asp Ser Leu Leu Asp Ser Ser 35 40 45Ile Arg Ile Ile Pro Val Arg His Glu Ala Asn Ala Ala Phe Met Ala 50 55 60Ala Ala Val Gly Arg Ile Thr Gly Lys Ala Gly Val Ala Leu Val Thr65 70 75 80Ser Gly Pro Gly Cys Ser Asn Leu Ile Thr Gly Met Ala Thr Ala Asn 85 90 95Ser Glu Gly Asp Pro Val Val Ala Leu Gly Gly Ala Val Lys Arg Ala 100 105 110Asp Lys Ala Lys Gln Val His Gln Ser Met Asp Thr Val Ala Met Phe 115 120 125Ser Pro Val Thr Lys Tyr Ala Ile Glu Val Thr Ala Pro Asp Ala Leu 130 135 140Ala Glu Val Val Ser Asn Ala Phe Arg Ala Ala Glu Gln Gly Arg Pro145 150 155 160Gly Ser Ala Phe Val Ser Leu Pro Gln Asp Val Val Asp Gly Pro Val 165 170 175Ser Gly Lys Val Leu Pro Ala Ser Gly Ala Pro Gln Met Gly Ala Ala 180 185 190Pro Asp Asp Ala Ile Asp Gln Val Ala Lys Leu Ile Ala Gln Ala Lys 195 200 205Asn Pro Ile Phe Leu Leu Gly Leu Met Ala Ser Gln Pro Glu Asn Ser 210 215 220Lys Ala Leu Arg Arg Leu Leu Glu Thr Ser His Ile Pro Val Thr Ser225 230 235 240Thr Tyr Gln Ala Ala Gly Ala Val Asn Gln Asp Asn Phe Ser Arg Phe 245 250 255Ala Gly Arg Val Gly Leu Phe Asn Asn Gln Ala Gly Asp Arg Leu Leu 260 265 270Gln Leu Ala Asp Leu Val Ile Cys Ile Gly Tyr Ser Pro Val Glu Tyr 275 280 285Glu Pro Ala Met Trp Asn Ser Gly Asn Ala Thr Leu Val His Ile Asp 290 295 300Val Leu Pro Ala Tyr Glu Glu Arg Asn Tyr Thr Pro Asp Val Glu Leu305 310 315 320Val Gly Asp Ile Ala Gly Thr Leu Asn Lys Leu Ala Gln Asn Ile Asp 325 330 335His Arg Leu Val Leu Ser Pro Gln Ala Ala Glu Ile Leu Arg Asp Arg 340 345 350Gln His Gln Arg Glu Leu Leu Asp Arg Arg Gly Ala Gln Leu Asn Gln 355 360 365Phe Ala Leu His Pro Leu Arg Ile Val Arg Ala Met Gln Asp Ile Val 370 375 380Asn Ser Asp Val Thr Leu Thr Val Asp Met Gly Ser Phe His Ile Trp385 390 395 400Ile Ala Arg Tyr Leu Tyr Thr Phe Arg Ala Arg Gln Val Met Ile Ser 405 410 415Asn Gly Gln Gln Thr Met Gly Val Ala Leu Pro Trp Ala Ile Gly Ala 420 425 430Trp Leu Val Asn Pro Glu Arg Lys Val Val Ser Val Ser Gly Asp Gly 435 440 445Gly Phe Leu Gln Ser Ser Met Glu Leu Glu Thr Ala Val Arg Leu Lys 450 455 460Ala Asn Val Leu His Leu Ile Trp Val Asp Asn Gly Tyr Asn Met Val465 470 475 480Ala Ile Gln Glu Glu Lys Lys Tyr Gln Arg Leu Ser Gly Val Glu Phe 485 490 495Gly Pro Met Asp Phe Lys Ala Tyr Ala Glu Ser Phe Gly Ala Lys Gly 500 505 510Phe Ala Val Glu Ser Ala Glu Ala Leu Glu Pro Thr Leu Arg Ala Ala 515 520 525Met Asp Val Asp Gly Pro Ala Val Val Ala Ile Pro Val Asp Tyr Arg 530 535 540Asp Asn Pro Leu Leu Met Gly Gln Leu His Leu Ser Gln Ile Leu545 550 55531716DNABacillus subtilis 3atgttgacaa aagcaacaaa agaacaaaaa tcccttgtga aaaacagagg ggcggagctt 60gttgttgatt gcttagtgga gcaaggtgtc acacatgtat ttggcattcc aggtgcaaaa 120attgatgcgg tatttgacgc tttacaagat aaaggacctg aaattatcgt tgcccggcac 180gaacaaaacg cagcattcat ggcccaagca gtcggccgtt taactggaaa accgggagtc 240gtgttagtca catcaggacc gggtgcctct aacttggcaa caggcctgct gacagcgaac 300actgaaggag accctgtcgt tgcgcttgct ggaaacgtga tccgtgcaga tcgtttaaaa 360cggacacatc aatctttgga taatgcggcg ctattccagc cgattacaaa atacagtgta 420gaagttcaag atgtaaaaaa tataccggaa gctgttacaa atgcatttag gatagcgtca 480gcagggcagg ctggggccgc ttttgtgagc tttccgcaag atgttgtgaa tgaagtcaca 540aatacgaaaa acgtgcgtgc tgttgcagcg ccaaaactcg gtcctgcagc agatgatgca 600atcagtgcgg ccatagcaaa aatccaaaca gcaaaacttc ctgtcgtttt ggtcggcatg 660aaaggcggaa gaccggaagc aattaaagcg gttcgcaagc ttttgaaaaa ggttcagctt 720ccatttgttg aaacatatca agctgccggt accctttcta gagatttaga ggatcaatat 780tttggccgta tcggtttgtt ccgcaaccag cctggcgatt tactgctaga gcaggcagat 840gttgttctga cgatcggcta tgacccgatt gaatatgatc cgaaattctg gaatatcaat 900ggagaccgga caattatcca tttagacgag attatcgctg acattgatca tgcttaccag 960cctgatcttg aattgatcgg tgacattccg tccacgatca atcatatcga acacgatgct 1020gtgaaagtgg aatttgcaga gcgtgagcag aaaatccttt ctgatttaaa acaatatatg 1080catgaaggtg agcaggtgcc tgcagattgg aaatcagaca gagcgcaccc tcttgaaatc 1140gttaaagagt tgcgtaatgc agtcgatgat catgttacag taacttgcga tatcggttcg 1200cacgccattt ggatgtcacg ttatttccgc agctacgagc cgttaacatt aatgatcagt 1260aacggtatgc aaacactcgg cgttgcgctt ccttgggcaa tcggcgcttc attggtgaaa 1320ccgggagaaa aagtggtttc tgtctctggt gacggcggtt tcttattctc agcaatggaa 1380ttagagacag cagttcgact aaaagcacca attgtacaca ttgtatggaa cgacagcaca 1440tatgacatgg ttgcattcca gcaattgaaa aaatataacc gtacatctgc ggtcgatttc 1500ggaaatatcg atatcgtgaa atatgcggaa agcttcggag caactggctt gcgcgtagaa 1560tcaccagacc agctggcaga tgttctgcgt caaggcatga acgctgaagg tcctgtcatc 1620atcgatgtcc cggttgacta cagtgataac attaatttag caagtgacaa gcttccgaaa 1680gaattcgggg aactcatgaa aacgaaagct ctctag 17164571PRTBacillus subtilis 4Met Leu Thr Lys Ala Thr Lys Glu Gln Lys Ser Leu Val Lys Asn Arg1 5 10 15Gly Ala Glu Leu Val Val Asp Cys Leu Val Glu Gln Gly Val Thr His 20 25 30Val Phe Gly Ile Pro Gly Ala Lys Ile Asp Ala Val Phe Asp Ala Leu 35 40 45Gln Asp Lys Gly Pro Glu Ile Ile Val Ala Arg His Glu Gln Asn Ala 50 55 60Ala Phe Met Ala Gln Ala Val Gly Arg Leu Thr Gly Lys Pro Gly Val65 70 75 80Val Leu Val Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr Gly Leu 85 90 95Leu Thr Ala Asn Thr Glu Gly Asp Pro Val Val Ala Leu Ala Gly Asn 100 105 110Val Ile Arg Ala Asp Arg Leu Lys Arg Thr His Gln Ser Leu Asp Asn 115 120 125Ala Ala Leu Phe Gln Pro Ile Thr Lys Tyr Ser Val Glu Val Gln Asp 130 135 140Val Lys Asn Ile Pro Glu Ala Val Thr Asn Ala Phe Arg Ile Ala Ser145 150 155 160Ala Gly Gln Ala Gly Ala Ala Phe Val Ser Phe Pro Gln Asp Val Val 165 170 175Asn Glu Val Thr Asn Thr Lys Asn Val Arg Ala Val Ala Ala Pro Lys 180 185 190Leu Gly Pro Ala Ala Asp Asp Ala Ile Ser Ala Ala Ile Ala Lys Ile 195 200 205Gln Thr Ala Lys Leu Pro Val Val Leu Val Gly Met Lys Gly Gly Arg 210 215 220Pro Glu Ala Ile Lys Ala Val Arg Lys Leu Leu Lys Lys Val Gln Leu225 230 235 240Pro Phe Val Glu Thr Tyr Gln Ala Ala Gly Thr Leu Ser Arg Asp Leu 245 250 255Glu Asp Gln Tyr Phe Gly Arg Ile Gly Leu Phe Arg Asn Gln Pro Gly 260 265 270Asp Leu Leu Leu Glu Gln Ala Asp Val Val Leu Thr Ile Gly Tyr Asp 275 280 285Pro Ile Glu Tyr Asp Pro Lys Phe Trp Asn Ile Asn Gly Asp Arg Thr 290 295 300Ile Ile His Leu Asp Glu Ile Ile Ala Asp Ile Asp His Ala Tyr Gln305 310 315 320Pro Asp Leu Glu Leu Ile Gly Asp Ile Pro Ser Thr Ile Asn His Ile 325 330 335Glu His Asp Ala Val Lys Val Glu Phe Ala Glu Arg Glu Gln Lys Ile 340 345 350Leu Ser Asp Leu Lys Gln Tyr Met His Glu Gly Glu Gln Val Pro Ala 355 360 365Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu Ile Val Lys Glu Leu 370 375 380Arg Asn Ala Val Asp Asp His Val Thr Val Thr Cys Asp Ile Gly Ser385 390 395 400His Ala Ile Trp Met Ser Arg Tyr Phe Arg Ser Tyr Glu Pro Leu Thr 405 410 415Leu Met Ile Ser Asn Gly Met Gln Thr Leu Gly Val Ala Leu Pro Trp 420 425 430Ala Ile Gly Ala Ser Leu Val Lys Pro Gly Glu Lys Val Val Ser Val 435 440 445Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu Glu Thr Ala 450 455 460Val Arg Leu Lys Ala Pro Ile Val His Ile Val Trp Asn Asp Ser Thr465 470 475 480Tyr Asp Met Val Ala Phe Gln Gln Leu Lys Lys Tyr Asn Arg Thr Ser 485 490 495Ala Val Asp Phe Gly Asn Ile Asp Ile Val Lys Tyr Ala Glu Ser Phe 500 505 510Gly Ala Thr Gly Leu Arg Val Glu Ser Pro Asp Gln Leu Ala Asp Val 515 520 525Leu Arg Gln Gly Met Asn Ala Glu Gly Pro Val Ile Ile Asp Val Pro 530 535 540Val Asp Tyr Ser Asp Asn Ile Asn Leu Ala Ser Asp Lys Leu Pro Lys545 550 555 560Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 565 57051665DNALactococcus lactis 5atgtctgaga aacaatttgg ggcgaacttg gttgtcgata gtttgattaa ccataaagtg 60aagtatgtat ttgggattcc aggagcaaaa attgaccggg tttttgattt attagaaaat 120gaagaaggcc ctcaaatggt cgtgactcgt catgagcaag gagctgcttt catggctcaa 180gctgtcggtc gtttaactgg cgaacctggt gtagtagttg ttacgagtgg gcctggtgta 240tcaaaccttg cgactccgct tttgaccgcg acatcagaag gtgatgctat tttggctatc 300ggtggacaag ttaaacgaag tgaccgtctt aaacgtgcgc accaatcaat ggataatgct 360ggaatgatgc aatcagcaac aaaatattca gcagaagttc ttgaccctaa tacactttct 420gaatcaattg ccaacgctta tcgtattgca aaatcaggac atccaggtgc aactttctta 480tcaatccccc aagatgtaac ggatgccgaa gtatcaatca aagccattca accactttca 540gaccctaaaa tggggaatgc ctctattgat gacattaatt atttagcaca agcaattaaa 600aatgctgtat tgccagtaat tttggttgga gctggtgctt cagatgctaa agtcgcttca 660tccttgcgta atctattgac tcatgttaat attcctgtcg ttgaaacatt ccaaggtgca 720ggggttattt cacatgattt agaacatact ttttatggac gtatcggtct tttccgcaat 780caaccaggcg atatgcttct gaaacgttct gaccttgtta ttgctgttgg ttatgaccca 840attgaatatg aagctcgtaa ctggaatgca gaaattgata gtcgaattat cgttattgat 900aatgccattg ctgaaattga tacttactac caaccagagc gtgaattaat tggtgatatc 960gcagcaacat tggataatct tttaccagct gttcgtggct acaaaattcc aaaaggaaca 1020aaagattatc tcgatggcct tcatgaagtt gctgagcaac acgaatttga tactgaaaat 1080actgaagaag gtagaatgca ccctcttgat ttggtcagca ctttccaaga aatcgtcaag 1140gatgatgaaa cagtaaccgt tgacgtaggt tcactctaca tttggatggc acgtcatttc 1200aaatcatacg aaccacgtca tctcctcttc tcaaacggaa tgcaaacact cggagttgca 1260cttccttggg caattacagc cgcattgttg cgcccaggta aaaaagttta ttcacactct 1320ggtgatggag gcttcctttt cacagggcaa gaattggaaa cagctgtacg tttgaatctt 1380ccaatcgttc aaattatctg gaatgacggc cattatgata tggttaaatt ccaagaagaa 1440atgaaatatg gtcgttcagc agccgttgat tttggctatg ttgattacgt aaaatatgct 1500gaagcaatga gagcaaaagg ttaccgtgca cacagcaaag aagaacttgc tgaaattctc 1560aaatcaatcc cagatactac tggaccggtg gtaattgacg ttcctttgga ctattctgat 1620aacattaaat tagcagaaaa attattgcct gaagagtttt attga 16656554PRTLactococcus lactis 6Met Ser Glu Lys Gln Phe Gly Ala Asn Leu Val Val Asp Ser Leu Ile1 5 10 15Asn His Lys Val Lys Tyr Val Phe Gly Ile Pro Gly Ala Lys Ile Asp 20 25 30Arg Val Phe Asp Leu Leu Glu Asn Glu Glu Gly Pro Gln Met Val Val 35 40 45Thr Arg His Glu Gln Gly Ala Ala Phe Met Ala Gln Ala Val Gly Arg 50 55 60Leu Thr Gly Glu Pro Gly Val Val Val Val Thr Ser Gly Pro Gly Val65 70 75 80Ser Asn Leu Ala Thr Pro Leu Leu Thr Ala Thr Ser Glu Gly Asp Ala 85 90 95Ile Leu Ala Ile Gly Gly Gln Val Lys Arg Ser Asp Arg Leu Lys Arg 100 105 110Ala His Gln Ser Met Asp Asn Ala Gly Met Met Gln Ser Ala Thr Lys 115 120 125Tyr Ser Ala Glu Val Leu Asp Pro Asn Thr Leu Ser Glu Ser Ile Ala 130 135 140Asn Ala Tyr Arg Ile Ala Lys Ser Gly His Pro Gly Ala Thr Phe Leu145 150 155 160Ser Ile Pro Gln Asp Val Thr Asp Ala Glu Val Ser Ile Lys Ala Ile 165 170 175Gln Pro Leu Ser Asp Pro Lys Met Gly Asn Ala Ser Ile Asp Asp Ile 180 185 190Asn Tyr Leu Ala Gln Ala Ile Lys Asn Ala Val Leu Pro Val Ile Leu 195 200 205Val Gly Ala Gly Ala Ser Asp Ala Lys Val Ala Ser Ser Leu Arg Asn 210 215 220Leu Leu Thr His Val Asn Ile Pro Val Val Glu Thr Phe Gln Gly Ala225 230 235 240Gly Val Ile Ser His Asp Leu Glu His Thr Phe Tyr Gly Arg Ile Gly 245 250 255Leu Phe Arg Asn Gln Pro Gly Asp Met Leu Leu Lys Arg Ser Asp Leu 260 265 270Val Ile Ala Val Gly Tyr Asp Pro Ile Glu Tyr Glu Ala Arg Asn Trp 275 280 285Asn Ala Glu Ile Asp Ser Arg Ile Ile Val Ile Asp Asn Ala Ile Ala 290 295 300Glu Ile Asp Thr Tyr Tyr Gln Pro Glu Arg Glu Leu Ile Gly Asp Ile305 310 315 320Ala Ala Thr Leu Asp Asn Leu Leu Pro Ala Val Arg Gly Tyr Lys Ile 325 330 335Pro Lys Gly Thr Lys Asp Tyr Leu Asp Gly Leu His Glu Val Ala Glu 340 345 350Gln His Glu Phe Asp Thr Glu Asn Thr Glu Glu Gly Arg Met His Pro 355 360 365Leu Asp Leu Val Ser Thr Phe Gln Glu Ile Val Lys Asp Asp Glu Thr 370 375 380Val Thr Val Asp Val Gly Ser Leu Tyr Ile Trp Met Ala Arg His Phe385 390 395 400Lys Ser Tyr Glu Pro Arg His Leu Leu Phe Ser Asn Gly Met Gln Thr 405 410 415Leu Gly Val Ala Leu Pro Trp Ala Ile Thr Ala Ala Leu Leu Arg Pro 420 425 430Gly Lys Lys Val Tyr Ser His Ser Gly Asp Gly Gly Phe Leu Phe Thr 435 440 445Gly Gln Glu Leu Glu Thr Ala Val Arg Leu Asn Leu Pro Ile Val Gln 450 455 460Ile Ile Trp Asn Asp Gly His Tyr Asp Met Val Lys Phe Gln Glu Glu465 470 475 480Met Lys Tyr Gly Arg Ser Ala Ala Val Asp Phe Gly Tyr Val Asp Tyr

485 490 495Val Lys Tyr Ala Glu Ala Met Arg Ala Lys Gly Tyr Arg Ala His Ser 500 505 510Lys Glu Glu Leu Ala Glu Ile Leu Lys Ser Ile Pro Asp Thr Thr Gly 515 520 525Pro Val Val Ile Asp Val Pro Leu Asp Tyr Ser Asp Asn Ile Lys Leu 530 535 540Ala Glu Lys Leu Leu Pro Glu Glu Phe Tyr545 55071665DNAStaphylococcus aureusCDS(1)..(1665) 7atg act gat aaa aag tac act gca gcc gat atg gtt att gat act ttg 48Met Thr Asp Lys Lys Tyr Thr Ala Ala Asp Met Val Ile Asp Thr Leu1 5 10 15aaa aat aat ggg gta gaa tat gtt ttt ggt att ccg ggt gca aag ata 96Lys Asn Asn Gly Val Glu Tyr Val Phe Gly Ile Pro Gly Ala Lys Ile 20 25 30gac tat cta ttt aat gct tta att gat gat ggt cct gaa ctt att gtc 144Asp Tyr Leu Phe Asn Ala Leu Ile Asp Asp Gly Pro Glu Leu Ile Val 35 40 45act cgt cat gaa caa aat gct gca atg atg gca caa ggt att gga aga 192Thr Arg His Glu Gln Asn Ala Ala Met Met Ala Gln Gly Ile Gly Arg 50 55 60tta aca ggt aaa ccg ggt gta gta ctt gtt aca agt ggc cct ggt gta 240Leu Thr Gly Lys Pro Gly Val Val Leu Val Thr Ser Gly Pro Gly Val65 70 75 80agt aat tta acg act gga cta tta aca gct aca tct gaa ggg gat cct 288Ser Asn Leu Thr Thr Gly Leu Leu Thr Ala Thr Ser Glu Gly Asp Pro 85 90 95gta tta gcg tta ggt ggc caa gtg aaa cgt aat gat tta tta cga tta 336Val Leu Ala Leu Gly Gly Gln Val Lys Arg Asn Asp Leu Leu Arg Leu 100 105 110acg cat caa agt att gat aat gct gcg cta tta aaa tat tca tca aaa 384Thr His Gln Ser Ile Asp Asn Ala Ala Leu Leu Lys Tyr Ser Ser Lys 115 120 125tac agt gaa gaa gta caa gat cct gaa tca tta tca gaa gtt atg aca 432Tyr Ser Glu Glu Val Gln Asp Pro Glu Ser Leu Ser Glu Val Met Thr 130 135 140aat gca att cga att gct act tca gga aaa aat ggc gca agt ttt att 480Asn Ala Ile Arg Ile Ala Thr Ser Gly Lys Asn Gly Ala Ser Phe Ile145 150 155 160agt att ccg caa gac gtt att tct tca cca gtt gaa tct aaa gct ata 528Ser Ile Pro Gln Asp Val Ile Ser Ser Pro Val Glu Ser Lys Ala Ile 165 170 175tca ctt tgc caa aaa cca aat tta gga gta ccg agt gaa caa gat att 576Ser Leu Cys Gln Lys Pro Asn Leu Gly Val Pro Ser Glu Gln Asp Ile 180 185 190aat gat gtc att gaa gcg att aaa aat gca tca ttt cct gtt tta tta 624Asn Asp Val Ile Glu Ala Ile Lys Asn Ala Ser Phe Pro Val Leu Leu 195 200 205gct ggt atg aga agt tca agt gca gaa gaa aca aat gcc att cgc aaa 672Ala Gly Met Arg Ser Ser Ser Ala Glu Glu Thr Asn Ala Ile Arg Lys 210 215 220tta gtt gag cgc acg aat tta cca gtt gta gaa aca ttc caa ggt gca 720Leu Val Glu Arg Thr Asn Leu Pro Val Val Glu Thr Phe Gln Gly Ala225 230 235 240ggt gta att agt cgt gaa tta gaa aat cat ttc ttc ggt cgt gtg ggc 768Gly Val Ile Ser Arg Glu Leu Glu Asn His Phe Phe Gly Arg Val Gly 245 250 255tta ttc cgc aat caa gtt ggt gat gaa tta tta cgt aaa agt gat tta 816Leu Phe Arg Asn Gln Val Gly Asp Glu Leu Leu Arg Lys Ser Asp Leu 260 265 270gtt gtt aca atc ggt tat gat cca att gaa tac gaa gct agt aac tgg 864Val Val Thr Ile Gly Tyr Asp Pro Ile Glu Tyr Glu Ala Ser Asn Trp 275 280 285aat aaa gaa tta gaa aca caa att atc aat att gac gaa gtt caa gct 912Asn Lys Glu Leu Glu Thr Gln Ile Ile Asn Ile Asp Glu Val Gln Ala 290 295 300gaa att act aat tat atg caa ccg aaa aaa gag ttg att ggt aat att 960Glu Ile Thr Asn Tyr Met Gln Pro Lys Lys Glu Leu Ile Gly Asn Ile305 310 315 320gct aaa acg att gaa atg att tct gaa aaa gtg gat gag cca ttt ata 1008Ala Lys Thr Ile Glu Met Ile Ser Glu Lys Val Asp Glu Pro Phe Ile 325 330 335aat caa caa cat tta gac gaa tta gaa caa tta aga aca cat att gat 1056Asn Gln Gln His Leu Asp Glu Leu Glu Gln Leu Arg Thr His Ile Asp 340 345 350gaa gaa act ggt att aaa gcg acg cat gaa gaa gga att cta cat cca 1104Glu Glu Thr Gly Ile Lys Ala Thr His Glu Glu Gly Ile Leu His Pro 355 360 365gtg gaa att att gaa tct atg caa aag gta tta act gat gat act act 1152Val Glu Ile Ile Glu Ser Met Gln Lys Val Leu Thr Asp Asp Thr Thr 370 375 380gta aca gtt gat gtt gga agt cac tat att tgg atg gca cgt aat ttc 1200Val Thr Val Asp Val Gly Ser His Tyr Ile Trp Met Ala Arg Asn Phe385 390 395 400aga agt tac aat cca aga cat tta tta ttt agc aat ggt atg caa acg 1248Arg Ser Tyr Asn Pro Arg His Leu Leu Phe Ser Asn Gly Met Gln Thr 405 410 415ctt ggt gta gca tta ccg tgg gca att tca gct gca ctt gtg cgc cct 1296Leu Gly Val Ala Leu Pro Trp Ala Ile Ser Ala Ala Leu Val Arg Pro 420 425 430aat acg caa gtt gtg tcc gtt gct ggc gat ggt ggc ttt tta ttt tca 1344Asn Thr Gln Val Val Ser Val Ala Gly Asp Gly Gly Phe Leu Phe Ser 435 440 445tca caa gat tta gaa acg gcc gta cgt aaa aat tta aat atc atc cag 1392Ser Gln Asp Leu Glu Thr Ala Val Arg Lys Asn Leu Asn Ile Ile Gln 450 455 460ctt att tgg aat gat gga aaa tat aac atg gtt gaa ttc caa gaa gaa 1440Leu Ile Trp Asn Asp Gly Lys Tyr Asn Met Val Glu Phe Gln Glu Glu465 470 475 480atg aaa tat aaa cgt tcg tca ggt gta gac ttc ggt cct gta gat ttt 1488Met Lys Tyr Lys Arg Ser Ser Gly Val Asp Phe Gly Pro Val Asp Phe 485 490 495gta aaa tat gca gaa tca ttt ggc gcg aaa ggt tta cga gtt act aat 1536Val Lys Tyr Ala Glu Ser Phe Gly Ala Lys Gly Leu Arg Val Thr Asn 500 505 510caa gaa gaa tta gaa gcg gca att aaa gag ggc tat gaa aca gat ggt 1584Gln Glu Glu Leu Glu Ala Ala Ile Lys Glu Gly Tyr Glu Thr Asp Gly 515 520 525cca gta tta att gat ata cct gta aat tac aaa gat aat atc aaa ctt 1632Pro Val Leu Ile Asp Ile Pro Val Asn Tyr Lys Asp Asn Ile Lys Leu 530 535 540tca aca aat atg tta cct gac gta ttt aac taa 1665Ser Thr Asn Met Leu Pro Asp Val Phe Asn545 5508554PRTStaphylococcus aureus 8Met Thr Asp Lys Lys Tyr Thr Ala Ala Asp Met Val Ile Asp Thr Leu1 5 10 15Lys Asn Asn Gly Val Glu Tyr Val Phe Gly Ile Pro Gly Ala Lys Ile 20 25 30Asp Tyr Leu Phe Asn Ala Leu Ile Asp Asp Gly Pro Glu Leu Ile Val 35 40 45Thr Arg His Glu Gln Asn Ala Ala Met Met Ala Gln Gly Ile Gly Arg 50 55 60Leu Thr Gly Lys Pro Gly Val Val Leu Val Thr Ser Gly Pro Gly Val65 70 75 80Ser Asn Leu Thr Thr Gly Leu Leu Thr Ala Thr Ser Glu Gly Asp Pro 85 90 95Val Leu Ala Leu Gly Gly Gln Val Lys Arg Asn Asp Leu Leu Arg Leu 100 105 110Thr His Gln Ser Ile Asp Asn Ala Ala Leu Leu Lys Tyr Ser Ser Lys 115 120 125Tyr Ser Glu Glu Val Gln Asp Pro Glu Ser Leu Ser Glu Val Met Thr 130 135 140Asn Ala Ile Arg Ile Ala Thr Ser Gly Lys Asn Gly Ala Ser Phe Ile145 150 155 160Ser Ile Pro Gln Asp Val Ile Ser Ser Pro Val Glu Ser Lys Ala Ile 165 170 175Ser Leu Cys Gln Lys Pro Asn Leu Gly Val Pro Ser Glu Gln Asp Ile 180 185 190Asn Asp Val Ile Glu Ala Ile Lys Asn Ala Ser Phe Pro Val Leu Leu 195 200 205Ala Gly Met Arg Ser Ser Ser Ala Glu Glu Thr Asn Ala Ile Arg Lys 210 215 220Leu Val Glu Arg Thr Asn Leu Pro Val Val Glu Thr Phe Gln Gly Ala225 230 235 240Gly Val Ile Ser Arg Glu Leu Glu Asn His Phe Phe Gly Arg Val Gly 245 250 255Leu Phe Arg Asn Gln Val Gly Asp Glu Leu Leu Arg Lys Ser Asp Leu 260 265 270Val Val Thr Ile Gly Tyr Asp Pro Ile Glu Tyr Glu Ala Ser Asn Trp 275 280 285Asn Lys Glu Leu Glu Thr Gln Ile Ile Asn Ile Asp Glu Val Gln Ala 290 295 300Glu Ile Thr Asn Tyr Met Gln Pro Lys Lys Glu Leu Ile Gly Asn Ile305 310 315 320Ala Lys Thr Ile Glu Met Ile Ser Glu Lys Val Asp Glu Pro Phe Ile 325 330 335Asn Gln Gln His Leu Asp Glu Leu Glu Gln Leu Arg Thr His Ile Asp 340 345 350Glu Glu Thr Gly Ile Lys Ala Thr His Glu Glu Gly Ile Leu His Pro 355 360 365Val Glu Ile Ile Glu Ser Met Gln Lys Val Leu Thr Asp Asp Thr Thr 370 375 380Val Thr Val Asp Val Gly Ser His Tyr Ile Trp Met Ala Arg Asn Phe385 390 395 400Arg Ser Tyr Asn Pro Arg His Leu Leu Phe Ser Asn Gly Met Gln Thr 405 410 415Leu Gly Val Ala Leu Pro Trp Ala Ile Ser Ala Ala Leu Val Arg Pro 420 425 430Asn Thr Gln Val Val Ser Val Ala Gly Asp Gly Gly Phe Leu Phe Ser 435 440 445Ser Gln Asp Leu Glu Thr Ala Val Arg Lys Asn Leu Asn Ile Ile Gln 450 455 460Leu Ile Trp Asn Asp Gly Lys Tyr Asn Met Val Glu Phe Gln Glu Glu465 470 475 480Met Lys Tyr Lys Arg Ser Ser Gly Val Asp Phe Gly Pro Val Asp Phe 485 490 495Val Lys Tyr Ala Glu Ser Phe Gly Ala Lys Gly Leu Arg Val Thr Asn 500 505 510Gln Glu Glu Leu Glu Ala Ala Ile Lys Glu Gly Tyr Glu Thr Asp Gly 515 520 525Pro Val Leu Ile Asp Ile Pro Val Asn Tyr Lys Asp Asn Ile Lys Leu 530 535 540Ser Thr Asn Met Leu Pro Asp Val Phe Asn545 55091698DNAListeria monocytogenesCDS(1)..(1698) 9atg gcg aaa cta gaa aaa gac caa gaa aaa gta ata aca caa ggg aaa 48Met Ala Lys Leu Glu Lys Asp Gln Glu Lys Val Ile Thr Gln Gly Lys1 5 10 15tca gga gcg gat tta gtt gta gac agc tta att aat caa ggt gtt acg 96Ser Gly Ala Asp Leu Val Val Asp Ser Leu Ile Asn Gln Gly Val Thr 20 25 30cat gta ttc ggg att ccg gga gcg aaa att gat aaa gtt ttt gat gtg 144His Val Phe Gly Ile Pro Gly Ala Lys Ile Asp Lys Val Phe Asp Val 35 40 45atg gaa gaa cgt gga cca gaa tta att gtc agt cgt cat gaa caa aat 192Met Glu Glu Arg Gly Pro Glu Leu Ile Val Ser Arg His Glu Gln Asn 50 55 60gcg gcg ttt atg gct gct gct atc ggt cgt cta acc ggg aaa cct ggt 240Ala Ala Phe Met Ala Ala Ala Ile Gly Arg Leu Thr Gly Lys Pro Gly65 70 75 80gtt gta ctt gta act agt gga cct ggc gca tcg aat ctt gca aca ggg 288Val Val Leu Val Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr Gly 85 90 95ctt gta acc gca act gca gaa gga gat cca gtc gtt gcg att gct ggt 336Leu Val Thr Ala Thr Ala Glu Gly Asp Pro Val Val Ala Ile Ala Gly 100 105 110aac gta aca agg caa gac cgc tta aaa aga acc cac caa tca atg gat 384Asn Val Thr Arg Gln Asp Arg Leu Lys Arg Thr His Gln Ser Met Asp 115 120 125aat gca gca ctt ttc cgt ccg att aca aaa tac agc gaa gaa gta gtt 432Asn Ala Ala Leu Phe Arg Pro Ile Thr Lys Tyr Ser Glu Glu Val Val 130 135 140cac gcc gaa agt att cca gaa gca atc act aac gct ttt cgc tcg gca 480His Ala Glu Ser Ile Pro Glu Ala Ile Thr Asn Ala Phe Arg Ser Ala145 150 155 160aca gaa cca aac caa ggc gct gct ttt gtc agt ttg cca caa gat atc 528Thr Glu Pro Asn Gln Gly Ala Ala Phe Val Ser Leu Pro Gln Asp Ile 165 170 175gtg aac gaa cca aac gta cca gta aaa gcg att cgc cca ctt gct aaa 576Val Asn Glu Pro Asn Val Pro Val Lys Ala Ile Arg Pro Leu Ala Lys 180 185 190cca gaa aat ggt cct gct tcc aaa gaa caa gtt gca aaa ctt gtt aca 624Pro Glu Asn Gly Pro Ala Ser Lys Glu Gln Val Ala Lys Leu Val Thr 195 200 205cgt ttg aaa aaa gcg aaa tta ccg gta ttg cta ttg ggt atg cga gca 672Arg Leu Lys Lys Ala Lys Leu Pro Val Leu Leu Leu Gly Met Arg Ala 210 215 220tct agt cca gaa gta act ggt gca att cgt cgc tta ctc caa aaa aca 720Ser Ser Pro Glu Val Thr Gly Ala Ile Arg Arg Leu Leu Gln Lys Thr225 230 235 240agt atc cca gta gta gaa act ttc caa gca gct ggc gtc att tca cgc 768Ser Ile Pro Val Val Glu Thr Phe Gln Ala Ala Gly Val Ile Ser Arg 245 250 255gac tta gaa gat aac ttc ttt gga cgt gtt ggt ctg ttc cgc aac caa 816Asp Leu Glu Asp Asn Phe Phe Gly Arg Val Gly Leu Phe Arg Asn Gln 260 265 270cca ggg gat att ttg tta aat aaa gct gat tta gtt att aca gtg ggt 864Pro Gly Asp Ile Leu Leu Asn Lys Ala Asp Leu Val Ile Thr Val Gly 275 280 285tat gat cca att gaa tac gat cca aaa gct tgg aat gcc tct ggt gat 912Tyr Asp Pro Ile Glu Tyr Asp Pro Lys Ala Trp Asn Ala Ser Gly Asp 290 295 300aga acg att gtc cat tta gac gac att cgc gct gat att gat cat tat 960Arg Thr Ile Val His Leu Asp Asp Ile Arg Ala Asp Ile Asp His Tyr305 310 315 320tac caa cca gtg aca gag cta gtc gga aac atc gcg ctt act tta gac 1008Tyr Gln Pro Val Thr Glu Leu Val Gly Asn Ile Ala Leu Thr Leu Asp 325 330 335cga gtg aat gcg aaa ttc agc ggt tta gaa tta gcg gaa aaa gaa ctt 1056Arg Val Asn Ala Lys Phe Ser Gly Leu Glu Leu Ala Glu Lys Glu Leu 340 345 350gaa aca tta aaa gaa ctt cat gct caa tta gaa gag cga gat gtt ccg 1104Glu Thr Leu Lys Glu Leu His Ala Gln Leu Glu Glu Arg Asp Val Pro 355 360 365cca gaa agt gat gaa act aac cga gta cat cca ttg tcg gtc att caa 1152Pro Glu Ser Asp Glu Thr Asn Arg Val His Pro Leu Ser Val Ile Gln 370 375 380aca cta cgt tcg gca att gat gac aac gta act gtg aca gtc gac gtt 1200Thr Leu Arg Ser Ala Ile Asp Asp Asn Val Thr Val Thr Val Asp Val385 390 395 400ggt tca cat tat att tgg atg gca cgt cat ttc cgc tcc tat gaa cca 1248Gly Ser His Tyr Ile Trp Met Ala Arg His Phe Arg Ser Tyr Glu Pro 405 410 415cgc cgt ctg ctt ttc agt aac ggt atg caa acg ctt ggt gtt gcg ctt 1296Arg Arg Leu Leu Phe Ser Asn Gly Met Gln Thr Leu Gly Val Ala Leu 420 425 430cct tgg gga att gct gca aca ctt gta cat ccg ggt gaa aaa gtg gtt 1344Pro Trp Gly Ile Ala Ala Thr Leu Val His Pro Gly Glu Lys Val Val 435 440 445tcg att tct ggt gac ggt ggt ttc tta ttt tcc gcg atg gaa tta gaa 1392Ser Ile Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu Glu 450 455 460aca gct gtc cgc ttg cgt gcg cca ctt gta cac cta gta tgg aat gac 1440Thr Ala Val Arg Leu Arg Ala Pro Leu Val His Leu Val Trp Asn Asp465 470 475 480gga agc tat gac atg gtt gct ttc caa caa aaa atg aaa tac ggc aaa 1488Gly Ser Tyr Asp Met Val Ala Phe Gln Gln Lys Met Lys Tyr Gly Lys 485 490 495gaa gca gct gtt cgt ttt ggc gat gtt gat atc gta aaa ttt gca gaa 1536Glu Ala Ala Val Arg Phe Gly Asp Val Asp Ile Val Lys Phe Ala Glu 500 505 510agt ttc gga gca aaa ggt ctt cgc gta aca aat cca gca gaa ctt tct 1584Ser Phe Gly Ala Lys Gly Leu Arg Val Thr Asn Pro Ala Glu Leu Ser 515 520 525gat gtg tta aaa gaa gcg ctt gaa aca gaa gga ccc gtc gtt gta gat 1632Asp Val Leu Lys Glu Ala Leu Glu Thr Glu Gly Pro Val Val Val Asp 530 535 540att cca att gat tac cgt gat aac atc aaa ctt ggc gaa act tta cta 1680Ile Pro Ile Asp Tyr Arg Asp Asn Ile Lys Leu Gly Glu Thr Leu Leu545 550 555 560cct gac caa ttt tat taa 1698Pro Asp Gln Phe Tyr 56510565PRTListeria monocytogenes 10Met Ala Lys Leu Glu Lys Asp Gln Glu Lys Val Ile Thr Gln Gly Lys1 5 10 15Ser Gly Ala Asp Leu Val Val Asp Ser Leu Ile Asn Gln Gly Val Thr 20 25 30His Val Phe Gly

Ile Pro Gly Ala Lys Ile Asp Lys Val Phe Asp Val 35 40 45Met Glu Glu Arg Gly Pro Glu Leu Ile Val Ser Arg His Glu Gln Asn 50 55 60Ala Ala Phe Met Ala Ala Ala Ile Gly Arg Leu Thr Gly Lys Pro Gly65 70 75 80Val Val Leu Val Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr Gly 85 90 95Leu Val Thr Ala Thr Ala Glu Gly Asp Pro Val Val Ala Ile Ala Gly 100 105 110Asn Val Thr Arg Gln Asp Arg Leu Lys Arg Thr His Gln Ser Met Asp 115 120 125Asn Ala Ala Leu Phe Arg Pro Ile Thr Lys Tyr Ser Glu Glu Val Val 130 135 140His Ala Glu Ser Ile Pro Glu Ala Ile Thr Asn Ala Phe Arg Ser Ala145 150 155 160Thr Glu Pro Asn Gln Gly Ala Ala Phe Val Ser Leu Pro Gln Asp Ile 165 170 175Val Asn Glu Pro Asn Val Pro Val Lys Ala Ile Arg Pro Leu Ala Lys 180 185 190Pro Glu Asn Gly Pro Ala Ser Lys Glu Gln Val Ala Lys Leu Val Thr 195 200 205Arg Leu Lys Lys Ala Lys Leu Pro Val Leu Leu Leu Gly Met Arg Ala 210 215 220Ser Ser Pro Glu Val Thr Gly Ala Ile Arg Arg Leu Leu Gln Lys Thr225 230 235 240Ser Ile Pro Val Val Glu Thr Phe Gln Ala Ala Gly Val Ile Ser Arg 245 250 255Asp Leu Glu Asp Asn Phe Phe Gly Arg Val Gly Leu Phe Arg Asn Gln 260 265 270Pro Gly Asp Ile Leu Leu Asn Lys Ala Asp Leu Val Ile Thr Val Gly 275 280 285Tyr Asp Pro Ile Glu Tyr Asp Pro Lys Ala Trp Asn Ala Ser Gly Asp 290 295 300Arg Thr Ile Val His Leu Asp Asp Ile Arg Ala Asp Ile Asp His Tyr305 310 315 320Tyr Gln Pro Val Thr Glu Leu Val Gly Asn Ile Ala Leu Thr Leu Asp 325 330 335Arg Val Asn Ala Lys Phe Ser Gly Leu Glu Leu Ala Glu Lys Glu Leu 340 345 350Glu Thr Leu Lys Glu Leu His Ala Gln Leu Glu Glu Arg Asp Val Pro 355 360 365Pro Glu Ser Asp Glu Thr Asn Arg Val His Pro Leu Ser Val Ile Gln 370 375 380Thr Leu Arg Ser Ala Ile Asp Asp Asn Val Thr Val Thr Val Asp Val385 390 395 400Gly Ser His Tyr Ile Trp Met Ala Arg His Phe Arg Ser Tyr Glu Pro 405 410 415Arg Arg Leu Leu Phe Ser Asn Gly Met Gln Thr Leu Gly Val Ala Leu 420 425 430Pro Trp Gly Ile Ala Ala Thr Leu Val His Pro Gly Glu Lys Val Val 435 440 445Ser Ile Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu Glu 450 455 460Thr Ala Val Arg Leu Arg Ala Pro Leu Val His Leu Val Trp Asn Asp465 470 475 480Gly Ser Tyr Asp Met Val Ala Phe Gln Gln Lys Met Lys Tyr Gly Lys 485 490 495Glu Ala Ala Val Arg Phe Gly Asp Val Asp Ile Val Lys Phe Ala Glu 500 505 510Ser Phe Gly Ala Lys Gly Leu Arg Val Thr Asn Pro Ala Glu Leu Ser 515 520 525Asp Val Leu Lys Glu Ala Leu Glu Thr Glu Gly Pro Val Val Val Asp 530 535 540Ile Pro Ile Asp Tyr Arg Asp Asn Ile Lys Leu Gly Glu Thr Leu Leu545 550 555 560Pro Asp Gln Phe Tyr 565111680DNAStreptococcus mutansCDS(1)..(1680) 11atg acc gaa ata aat aag gaa ggc tat ggg gct gac ctg att gta gac 48Met Thr Glu Ile Asn Lys Glu Gly Tyr Gly Ala Asp Leu Ile Val Asp1 5 10 15agc ctc att aat cat gat gtc aac tat gtt ttt gga atc cct ggt gca 96Ser Leu Ile Asn His Asp Val Asn Tyr Val Phe Gly Ile Pro Gly Ala 20 25 30aaa att gat cgt gtc ttt gat acc tta gaa gat aag ggg cca gaa ctt 144Lys Ile Asp Arg Val Phe Asp Thr Leu Glu Asp Lys Gly Pro Glu Leu 35 40 45att gta gca cgc cat gag caa aat gct gct ttt atg gct caa gga att 192Ile Val Ala Arg His Glu Gln Asn Ala Ala Phe Met Ala Gln Gly Ile 50 55 60ggc cgt att act ggt gag cct ggt gtt gtg att aca acc agc ggt ccc 240Gly Arg Ile Thr Gly Glu Pro Gly Val Val Ile Thr Thr Ser Gly Pro65 70 75 80ggt gtt tcc aat ctg gtg act ggt ctt gtt act gcg aca gct gag gga 288Gly Val Ser Asn Leu Val Thr Gly Leu Val Thr Ala Thr Ala Glu Gly 85 90 95gat cct gtc ctt gct att ggt ggt cag gtt aaa cgt gct gat ttg ctc 336Asp Pro Val Leu Ala Ile Gly Gly Gln Val Lys Arg Ala Asp Leu Leu 100 105 110aaa cgg gct cac cag tca atg aat aat gtt gct atg ctc gat ccc att 384Lys Arg Ala His Gln Ser Met Asn Asn Val Ala Met Leu Asp Pro Ile 115 120 125acc aaa tat tca gca gaa att cag gat ccc gca aca ctt tca gaa aat 432Thr Lys Tyr Ser Ala Glu Ile Gln Asp Pro Ala Thr Leu Ser Glu Asn 130 135 140att gct aat gcc tat cgt ttg gct aaa gca gga aag ccg gga gct agt 480Ile Ala Asn Ala Tyr Arg Leu Ala Lys Ala Gly Lys Pro Gly Ala Ser145 150 155 160ttc tta tct att cct caa gat ata act gat agt cct gtt act gtc aag 528Phe Leu Ser Ile Pro Gln Asp Ile Thr Asp Ser Pro Val Thr Val Lys 165 170 175gcg att aag ccc ttg aca gat cct aaa cta ggt tca gcg tca gtt gct 576Ala Ile Lys Pro Leu Thr Asp Pro Lys Leu Gly Ser Ala Ser Val Ala 180 185 190gat att aat tat ttg gca cag gcc ata aaa aat gcg gtc ctt cct gtc 624Asp Ile Asn Tyr Leu Ala Gln Ala Ile Lys Asn Ala Val Leu Pro Val 195 200 205tta ctt tta gga aat ggt gcg tca acg gct gca gtt aca gct tct att 672Leu Leu Leu Gly Asn Gly Ala Ser Thr Ala Ala Val Thr Ala Ser Ile 210 215 220cgc cgt ttg tta gga gct gtc aag ctg cca gtc gtt gaa act ttc caa 720Arg Arg Leu Leu Gly Ala Val Lys Leu Pro Val Val Glu Thr Phe Gln225 230 235 240gga gct ggt att gtt tca aga gat tta gaa gag gac act ttt ttt ggt 768Gly Ala Gly Ile Val Ser Arg Asp Leu Glu Glu Asp Thr Phe Phe Gly 245 250 255cgt gtg ggg ctt ttt cgt aat cag ccc gga gat atg ttg ctg aag cgt 816Arg Val Gly Leu Phe Arg Asn Gln Pro Gly Asp Met Leu Leu Lys Arg 260 265 270tct gac tta gtt atc gct att ggc tat gat cct att gaa tat gaa gcg 864Ser Asp Leu Val Ile Ala Ile Gly Tyr Asp Pro Ile Glu Tyr Glu Ala 275 280 285cgc aat tgg aat gct gaa att tcg gct cgc att atc gtt att gat gtt 912Arg Asn Trp Asn Ala Glu Ile Ser Ala Arg Ile Ile Val Ile Asp Val 290 295 300gct cca gct gaa att gat act tat ttc caa cct gaa cgt gaa tta att 960Ala Pro Ala Glu Ile Asp Thr Tyr Phe Gln Pro Glu Arg Glu Leu Ile305 310 315 320ggt gat ata gct gaa aca ctt gat tta ctc cta cct gct att agt ggc 1008Gly Asp Ile Ala Glu Thr Leu Asp Leu Leu Leu Pro Ala Ile Ser Gly 325 330 335tac tca ctt cca aaa ggt tct ctt gac tat ctc aaa ggc ctt cgt gat 1056Tyr Ser Leu Pro Lys Gly Ser Leu Asp Tyr Leu Lys Gly Leu Arg Asp 340 345 350aat gta gta gaa gat gtc aaa ttt gat aag aca gtc aaa tcc ggt ctg 1104Asn Val Val Glu Asp Val Lys Phe Asp Lys Thr Val Lys Ser Gly Leu 355 360 365gtt cat ccg ctt gat gtg att gat gtc ctt caa aag caa acg act gat 1152Val His Pro Leu Asp Val Ile Asp Val Leu Gln Lys Gln Thr Thr Asp 370 375 380gat atg aca gta acg gtt gat gtt ggc agc cat tat att tgg atg gct 1200Asp Met Thr Val Thr Val Asp Val Gly Ser His Tyr Ile Trp Met Ala385 390 395 400cgt tat ttt aaa agc tat gaa gca cgg cac tta ctt ttc tca aat ggt 1248Arg Tyr Phe Lys Ser Tyr Glu Ala Arg His Leu Leu Phe Ser Asn Gly 405 410 415atg caa acc tta ggt gtt gct ttg cct tgg gca att tcg gca gct ctt 1296Met Gln Thr Leu Gly Val Ala Leu Pro Trp Ala Ile Ser Ala Ala Leu 420 425 430gta cgg cca aat gag aag att att tct att tca ggt gat ggt ggt ttc 1344Val Arg Pro Asn Glu Lys Ile Ile Ser Ile Ser Gly Asp Gly Gly Phe 435 440 445ctc ttt tct ggc caa gaa ttg gaa aca gct gtt cgt tta cat tta cca 1392Leu Phe Ser Gly Gln Glu Leu Glu Thr Ala Val Arg Leu His Leu Pro 450 455 460att gtt cat atc att tgg aat gat ggt aaa tat aat atg gtt gaa ttc 1440Ile Val His Ile Ile Trp Asn Asp Gly Lys Tyr Asn Met Val Glu Phe465 470 475 480caa gaa gaa atg aaa tac ggc cgt tca gca ggt gtt gat ttt ggt cct 1488Gln Glu Glu Met Lys Tyr Gly Arg Ser Ala Gly Val Asp Phe Gly Pro 485 490 495gtt gat ttt gtc aag tat gct gat agt ttc ggt gct aaa ggt tac cgt 1536Val Asp Phe Val Lys Tyr Ala Asp Ser Phe Gly Ala Lys Gly Tyr Arg 500 505 510gct gat agt aaa gaa aag ttt gat caa gtt ctt caa aca gca ctc aag 1584Ala Asp Ser Lys Glu Lys Phe Asp Gln Val Leu Gln Thr Ala Leu Lys 515 520 525gaa gct gca aat ggc cca gtt ctc att gat gtt cca atg gac tat aaa 1632Glu Ala Ala Asn Gly Pro Val Leu Ile Asp Val Pro Met Asp Tyr Lys 530 535 540gat aat gta aaa ttg ggt gaa act att ttg cct gat gaa ttc tac taa 1680Asp Asn Val Lys Leu Gly Glu Thr Ile Leu Pro Asp Glu Phe Tyr545 550 55512559PRTStreptococcus mutans 12Met Thr Glu Ile Asn Lys Glu Gly Tyr Gly Ala Asp Leu Ile Val Asp1 5 10 15Ser Leu Ile Asn His Asp Val Asn Tyr Val Phe Gly Ile Pro Gly Ala 20 25 30Lys Ile Asp Arg Val Phe Asp Thr Leu Glu Asp Lys Gly Pro Glu Leu 35 40 45Ile Val Ala Arg His Glu Gln Asn Ala Ala Phe Met Ala Gln Gly Ile 50 55 60Gly Arg Ile Thr Gly Glu Pro Gly Val Val Ile Thr Thr Ser Gly Pro65 70 75 80Gly Val Ser Asn Leu Val Thr Gly Leu Val Thr Ala Thr Ala Glu Gly 85 90 95Asp Pro Val Leu Ala Ile Gly Gly Gln Val Lys Arg Ala Asp Leu Leu 100 105 110Lys Arg Ala His Gln Ser Met Asn Asn Val Ala Met Leu Asp Pro Ile 115 120 125Thr Lys Tyr Ser Ala Glu Ile Gln Asp Pro Ala Thr Leu Ser Glu Asn 130 135 140Ile Ala Asn Ala Tyr Arg Leu Ala Lys Ala Gly Lys Pro Gly Ala Ser145 150 155 160Phe Leu Ser Ile Pro Gln Asp Ile Thr Asp Ser Pro Val Thr Val Lys 165 170 175Ala Ile Lys Pro Leu Thr Asp Pro Lys Leu Gly Ser Ala Ser Val Ala 180 185 190Asp Ile Asn Tyr Leu Ala Gln Ala Ile Lys Asn Ala Val Leu Pro Val 195 200 205Leu Leu Leu Gly Asn Gly Ala Ser Thr Ala Ala Val Thr Ala Ser Ile 210 215 220Arg Arg Leu Leu Gly Ala Val Lys Leu Pro Val Val Glu Thr Phe Gln225 230 235 240Gly Ala Gly Ile Val Ser Arg Asp Leu Glu Glu Asp Thr Phe Phe Gly 245 250 255Arg Val Gly Leu Phe Arg Asn Gln Pro Gly Asp Met Leu Leu Lys Arg 260 265 270Ser Asp Leu Val Ile Ala Ile Gly Tyr Asp Pro Ile Glu Tyr Glu Ala 275 280 285Arg Asn Trp Asn Ala Glu Ile Ser Ala Arg Ile Ile Val Ile Asp Val 290 295 300Ala Pro Ala Glu Ile Asp Thr Tyr Phe Gln Pro Glu Arg Glu Leu Ile305 310 315 320Gly Asp Ile Ala Glu Thr Leu Asp Leu Leu Leu Pro Ala Ile Ser Gly 325 330 335Tyr Ser Leu Pro Lys Gly Ser Leu Asp Tyr Leu Lys Gly Leu Arg Asp 340 345 350Asn Val Val Glu Asp Val Lys Phe Asp Lys Thr Val Lys Ser Gly Leu 355 360 365Val His Pro Leu Asp Val Ile Asp Val Leu Gln Lys Gln Thr Thr Asp 370 375 380Asp Met Thr Val Thr Val Asp Val Gly Ser His Tyr Ile Trp Met Ala385 390 395 400Arg Tyr Phe Lys Ser Tyr Glu Ala Arg His Leu Leu Phe Ser Asn Gly 405 410 415Met Gln Thr Leu Gly Val Ala Leu Pro Trp Ala Ile Ser Ala Ala Leu 420 425 430Val Arg Pro Asn Glu Lys Ile Ile Ser Ile Ser Gly Asp Gly Gly Phe 435 440 445Leu Phe Ser Gly Gln Glu Leu Glu Thr Ala Val Arg Leu His Leu Pro 450 455 460Ile Val His Ile Ile Trp Asn Asp Gly Lys Tyr Asn Met Val Glu Phe465 470 475 480Gln Glu Glu Met Lys Tyr Gly Arg Ser Ala Gly Val Asp Phe Gly Pro 485 490 495Val Asp Phe Val Lys Tyr Ala Asp Ser Phe Gly Ala Lys Gly Tyr Arg 500 505 510Ala Asp Ser Lys Glu Lys Phe Asp Gln Val Leu Gln Thr Ala Leu Lys 515 520 525Glu Ala Ala Asn Gly Pro Val Leu Ile Asp Val Pro Met Asp Tyr Lys 530 535 540Asp Asn Val Lys Leu Gly Glu Thr Ile Leu Pro Asp Glu Phe Tyr545 550 555131683DNAStreptococcus thermophilusCDS(1)..(1683) 13gtg ttc atg tca gaa gaa aag caa ttg tat ggt gca gat tta gtg gtt 48Val Phe Met Ser Glu Glu Lys Gln Leu Tyr Gly Ala Asp Leu Val Val1 5 10 15gat agt ttg atc aac cat gat gtt gag tat gtc ttt ggg att cca ggc 96Asp Ser Leu Ile Asn His Asp Val Glu Tyr Val Phe Gly Ile Pro Gly 20 25 30gca aaa atc gat agg gtt ttt gat acc ttg gaa gat aag gga cct gaa 144Ala Lys Ile Asp Arg Val Phe Asp Thr Leu Glu Asp Lys Gly Pro Glu 35 40 45ttg att gtt gcc cgt cat gag caa aat gct gct ttt atg gct caa ggt 192Leu Ile Val Ala Arg His Glu Gln Asn Ala Ala Phe Met Ala Gln Gly 50 55 60gtt gga cgt att act ggg aaa cca ggt gta gta ttg gta aca tct ggt 240Val Gly Arg Ile Thr Gly Lys Pro Gly Val Val Leu Val Thr Ser Gly65 70 75 80cca ggt gtc tcc aat ttg gct act ggt ttg gta aca gcg acg gat gaa 288Pro Gly Val Ser Asn Leu Ala Thr Gly Leu Val Thr Ala Thr Asp Glu 85 90 95gga gac cct gtt ctt gct att ggt ggt cag gtt aag cgt gca gat ctc 336Gly Asp Pro Val Leu Ala Ile Gly Gly Gln Val Lys Arg Ala Asp Leu 100 105 110ttg aaa cgt gcc cac caa tca atg aat aac gtt gct atg ctt gag cca 384Leu Lys Arg Ala His Gln Ser Met Asn Asn Val Ala Met Leu Glu Pro 115 120 125att acc aaa tat gct gct gaa gta cat gat gct aac acc ctt tct gaa 432Ile Thr Lys Tyr Ala Ala Glu Val His Asp Ala Asn Thr Leu Ser Glu 130 135 140acg gtt gct aat gcc tat cgt cac gct aag tca ggg aaa cca ggt gca 480Thr Val Ala Asn Ala Tyr Arg His Ala Lys Ser Gly Lys Pro Gly Ala145 150 155 160agc ttc att tca att cct caa gac gtg acg gat gct ccg gtc agt gtt 528Ser Phe Ile Ser Ile Pro Gln Asp Val Thr Asp Ala Pro Val Ser Val 165 170 175aag gct att aag cct atg aca gat cca aaa ctt ggt tca gca tct gtt 576Lys Ala Ile Lys Pro Met Thr Asp Pro Lys Leu Gly Ser Ala Ser Val 180 185 190tct gat att aac tat cta gca caa gcc att aaa aat gca gtg ttg cca 624Ser Asp Ile Asn Tyr Leu Ala Gln Ala Ile Lys Asn Ala Val Leu Pro 195 200 205gtc ttt ctt ttg ggg aat ggt gcc tca tca gaa gcc gta act tac tct 672Val Phe Leu Leu Gly Asn Gly Ala Ser Ser Glu Ala Val Thr Tyr Ser 210 215 220att cgc caa att ttg aag cat gtt aaa ttg cca gtt gtt gaa act ttc 720Ile Arg Gln Ile Leu Lys His Val Lys Leu Pro Val Val Glu Thr Phe225 230 235 240caa ggt gcc ggt atc gtg tca cgt gac ctt gaa gaa gat act ttc ttt 768Gln Gly Ala Gly Ile Val Ser Arg Asp Leu Glu Glu Asp Thr Phe Phe 245 250 255ggt cgt gta ggt ctt ttc cgt aac caa ccc gga gac atg ttg ctt aaa 816Gly Arg Val Gly Leu Phe Arg Asn Gln Pro Gly Asp Met Leu Leu Lys 260 265 270aaa tcc gac tta gtt att gcc att ggt tat gat cca atc gaa tat gaa 864Lys Ser Asp Leu Val Ile Ala Ile Gly Tyr Asp Pro Ile Glu Tyr Glu 275 280 285gca cgt aac tgg aat gct gaa att tca gca cgt atc atc gtt att gat 912Ala Arg Asn Trp Asn Ala Glu

Ile Ser Ala Arg Ile Ile Val Ile Asp 290 295 300gtc gag ccg gcc gag gtg gac act tac ttc caa ccg gaa cgt gaa ttg 960Val Glu Pro Ala Glu Val Asp Thr Tyr Phe Gln Pro Glu Arg Glu Leu305 310 315 320att ggt aat gta gaa gcg agc tta gac ttg ctt ttg ccc gct att caa 1008Ile Gly Asn Val Glu Ala Ser Leu Asp Leu Leu Leu Pro Ala Ile Gln 325 330 335ggt tat aaa ttg cct gaa ggt gcg gtt gaa tat ctt aaa ggt ttg aaa 1056Gly Tyr Lys Leu Pro Glu Gly Ala Val Glu Tyr Leu Lys Gly Leu Lys 340 345 350aac aat gtt gtt gag gat gtt aag ttt gac cgt cag cct gat gaa ggt 1104Asn Asn Val Val Glu Asp Val Lys Phe Asp Arg Gln Pro Asp Glu Gly 355 360 365acg gtg cat ccg cta gat ttc atc gaa aat ttg caa gaa cac aca gat 1152Thr Val His Pro Leu Asp Phe Ile Glu Asn Leu Gln Glu His Thr Asp 370 375 380gat gat atg act gtt acg ttt gat gtt ggt agt cac tat att tgg atg 1200Asp Asp Met Thr Val Thr Phe Asp Val Gly Ser His Tyr Ile Trp Met385 390 395 400gca cgt tat ctc aaa tcg tat gaa cca cgt cat ttg ctt ttc tca aat 1248Ala Arg Tyr Leu Lys Ser Tyr Glu Pro Arg His Leu Leu Phe Ser Asn 405 410 415ggg atg caa acg ata ggt att gct att aca tgg gct atc tct gca gca 1296Gly Met Gln Thr Ile Gly Ile Ala Ile Thr Trp Ala Ile Ser Ala Ala 420 425 430ttg gtt cgt cct aag aca aaa gtg att tct gta tct ggt gat ggt ggt 1344Leu Val Arg Pro Lys Thr Lys Val Ile Ser Val Ser Gly Asp Gly Gly 435 440 445ttc ctc ttc tca gca caa gaa ttg gaa aca gca gtt cgt ttg aaa ttg 1392Phe Leu Phe Ser Ala Gln Glu Leu Glu Thr Ala Val Arg Leu Lys Leu 450 455 460cca att gtc cat att atc tgg aac gat ggt cat tac aat atg gtg gaa 1440Pro Ile Val His Ile Ile Trp Asn Asp Gly His Tyr Asn Met Val Glu465 470 475 480ttc cag gaa gaa atg aag tac ggt cgt tca tct ggg gtt gac ttt ggt 1488Phe Gln Glu Glu Met Lys Tyr Gly Arg Ser Ser Gly Val Asp Phe Gly 485 490 495cct gta gat ttt gta aaa tat gct gag agc ttt gga gcc aaa ggt tat 1536Pro Val Asp Phe Val Lys Tyr Ala Glu Ser Phe Gly Ala Lys Gly Tyr 500 505 510cgt gca aca agt aaa gca gcg ttt gct agc ttg ctt caa gag gct ttg 1584Arg Ala Thr Ser Lys Ala Ala Phe Ala Ser Leu Leu Gln Glu Ala Leu 515 520 525act cag gct gta gat gga cca gtc ctt att gat gtt cca att gac tat 1632Thr Gln Ala Val Asp Gly Pro Val Leu Ile Asp Val Pro Ile Asp Tyr 530 535 540aaa gat aac att aaa ctc ggc gaa act att ttg cca gat gaa ttt tac 1680Lys Asp Asn Ile Lys Leu Gly Glu Thr Ile Leu Pro Asp Glu Phe Tyr545 550 555 560taa 168314560PRTStreptococcus thermophilus 14Val Phe Met Ser Glu Glu Lys Gln Leu Tyr Gly Ala Asp Leu Val Val1 5 10 15Asp Ser Leu Ile Asn His Asp Val Glu Tyr Val Phe Gly Ile Pro Gly 20 25 30Ala Lys Ile Asp Arg Val Phe Asp Thr Leu Glu Asp Lys Gly Pro Glu 35 40 45Leu Ile Val Ala Arg His Glu Gln Asn Ala Ala Phe Met Ala Gln Gly 50 55 60Val Gly Arg Ile Thr Gly Lys Pro Gly Val Val Leu Val Thr Ser Gly65 70 75 80Pro Gly Val Ser Asn Leu Ala Thr Gly Leu Val Thr Ala Thr Asp Glu 85 90 95Gly Asp Pro Val Leu Ala Ile Gly Gly Gln Val Lys Arg Ala Asp Leu 100 105 110Leu Lys Arg Ala His Gln Ser Met Asn Asn Val Ala Met Leu Glu Pro 115 120 125Ile Thr Lys Tyr Ala Ala Glu Val His Asp Ala Asn Thr Leu Ser Glu 130 135 140Thr Val Ala Asn Ala Tyr Arg His Ala Lys Ser Gly Lys Pro Gly Ala145 150 155 160Ser Phe Ile Ser Ile Pro Gln Asp Val Thr Asp Ala Pro Val Ser Val 165 170 175Lys Ala Ile Lys Pro Met Thr Asp Pro Lys Leu Gly Ser Ala Ser Val 180 185 190Ser Asp Ile Asn Tyr Leu Ala Gln Ala Ile Lys Asn Ala Val Leu Pro 195 200 205Val Phe Leu Leu Gly Asn Gly Ala Ser Ser Glu Ala Val Thr Tyr Ser 210 215 220Ile Arg Gln Ile Leu Lys His Val Lys Leu Pro Val Val Glu Thr Phe225 230 235 240Gln Gly Ala Gly Ile Val Ser Arg Asp Leu Glu Glu Asp Thr Phe Phe 245 250 255Gly Arg Val Gly Leu Phe Arg Asn Gln Pro Gly Asp Met Leu Leu Lys 260 265 270Lys Ser Asp Leu Val Ile Ala Ile Gly Tyr Asp Pro Ile Glu Tyr Glu 275 280 285Ala Arg Asn Trp Asn Ala Glu Ile Ser Ala Arg Ile Ile Val Ile Asp 290 295 300Val Glu Pro Ala Glu Val Asp Thr Tyr Phe Gln Pro Glu Arg Glu Leu305 310 315 320Ile Gly Asn Val Glu Ala Ser Leu Asp Leu Leu Leu Pro Ala Ile Gln 325 330 335Gly Tyr Lys Leu Pro Glu Gly Ala Val Glu Tyr Leu Lys Gly Leu Lys 340 345 350Asn Asn Val Val Glu Asp Val Lys Phe Asp Arg Gln Pro Asp Glu Gly 355 360 365Thr Val His Pro Leu Asp Phe Ile Glu Asn Leu Gln Glu His Thr Asp 370 375 380Asp Asp Met Thr Val Thr Phe Asp Val Gly Ser His Tyr Ile Trp Met385 390 395 400Ala Arg Tyr Leu Lys Ser Tyr Glu Pro Arg His Leu Leu Phe Ser Asn 405 410 415Gly Met Gln Thr Ile Gly Ile Ala Ile Thr Trp Ala Ile Ser Ala Ala 420 425 430Leu Val Arg Pro Lys Thr Lys Val Ile Ser Val Ser Gly Asp Gly Gly 435 440 445Phe Leu Phe Ser Ala Gln Glu Leu Glu Thr Ala Val Arg Leu Lys Leu 450 455 460Pro Ile Val His Ile Ile Trp Asn Asp Gly His Tyr Asn Met Val Glu465 470 475 480Phe Gln Glu Glu Met Lys Tyr Gly Arg Ser Ser Gly Val Asp Phe Gly 485 490 495Pro Val Asp Phe Val Lys Tyr Ala Glu Ser Phe Gly Ala Lys Gly Tyr 500 505 510Arg Ala Thr Ser Lys Ala Ala Phe Ala Ser Leu Leu Gln Glu Ala Leu 515 520 525Thr Gln Ala Val Asp Gly Pro Val Leu Ile Asp Val Pro Ile Asp Tyr 530 535 540Lys Asp Asn Ile Lys Leu Gly Glu Thr Ile Leu Pro Asp Glu Phe Tyr545 550 555 560151665DNAVibrio angustumCDS(1)..(1665) 15atg tcg gat aaa acc gtc tct ggt gct gaa ctg gtt gtt gaa act tta 48Met Ser Asp Lys Thr Val Ser Gly Ala Glu Leu Val Val Glu Thr Leu1 5 10 15aat gca cat aac gtt cca cac att ttt ggt att cct gga gca aag gtg 96Asn Ala His Asn Val Pro His Ile Phe Gly Ile Pro Gly Ala Lys Val 20 25 30gat gct gtt ttc gat gct gtt tgt gat aac gga cca gaa atc att att 144Asp Ala Val Phe Asp Ala Val Cys Asp Asn Gly Pro Glu Ile Ile Ile 35 40 45tgt cat cat gaa caa aat gca gcg ttt atg gca gca gca act ggg cgt 192Cys His His Glu Gln Asn Ala Ala Phe Met Ala Ala Ala Thr Gly Arg 50 55 60tta acg ggt aaa gca ggc att tgt tta gca acc tct gga cca ggc gca 240Leu Thr Gly Lys Ala Gly Ile Cys Leu Ala Thr Ser Gly Pro Gly Ala65 70 75 80tca aac ctt gtc aca ggc gtt gca aca gcg aat agt gaa ggt gat cct 288Ser Asn Leu Val Thr Gly Val Ala Thr Ala Asn Ser Glu Gly Asp Pro 85 90 95gtg gtt gca ctt gca ggt gct gta cct ctt tct atg tat tct cac aat 336Val Val Ala Leu Ala Gly Ala Val Pro Leu Ser Met Tyr Ser His Asn 100 105 110act cat caa tcc atg gat acc cgt tca ctg ttt act cct atc acc aag 384Thr His Gln Ser Met Asp Thr Arg Ser Leu Phe Thr Pro Ile Thr Lys 115 120 125ttt tca gca gaa gtg atg gat agc agc tcg gta tct gat gtt gta cat 432Phe Ser Ala Glu Val Met Asp Ser Ser Ser Val Ser Asp Val Val His 130 135 140aaa gct ttt cgt att gca gag caa cct acc caa ggt gct agc ttt gtt 480Lys Ala Phe Arg Ile Ala Glu Gln Pro Thr Gln Gly Ala Ser Phe Val145 150 155 160agt cta ccg caa gat att cta act aac cgt att cct tac cag cca gta 528Ser Leu Pro Gln Asp Ile Leu Thr Asn Arg Ile Pro Tyr Gln Pro Val 165 170 175caa cag cct aat cca att ttg ttc ggt ggt gca cac cca caa gct att 576Gln Gln Pro Asn Pro Ile Leu Phe Gly Gly Ala His Pro Gln Ala Ile 180 185 190cgt cag gct gct gat cgc att aat gct gca aaa aat ccg gtg tta tta 624Arg Gln Ala Ala Asp Arg Ile Asn Ala Ala Lys Asn Pro Val Leu Leu 195 200 205ctg ggc atg gat gca agc cag cct ttt gtt gct gat gct att cgc caa 672Leu Gly Met Asp Ala Ser Gln Pro Phe Val Ala Asp Ala Ile Arg Gln 210 215 220cta ctc aaa caa aca cca att gcc gtt gtg aat acg ttt gcc gca gct 720Leu Leu Lys Gln Thr Pro Ile Ala Val Val Asn Thr Phe Ala Ala Ala225 230 235 240ggg gtt att tct cat gat tta tac aac tgc ttt tta ggt cgt gtt ggc 768Gly Val Ile Ser His Asp Leu Tyr Asn Cys Phe Leu Gly Arg Val Gly 245 250 255tta ttt aaa aat caa ccc ggt gat att gca tta aac agt gca gat tta 816Leu Phe Lys Asn Gln Pro Gly Asp Ile Ala Leu Asn Ser Ala Asp Leu 260 265 270atc att acc att ggc tac agc cca att gaa tac gat ccg att ctt tgg 864Ile Ile Thr Ile Gly Tyr Ser Pro Ile Glu Tyr Asp Pro Ile Leu Trp 275 280 285aat aaa gat gca aac aca cca att att cat att ggt tat caa caa gca 912Asn Lys Asp Ala Asn Thr Pro Ile Ile His Ile Gly Tyr Gln Gln Ala 290 295 300gat tta gaa att agc tat aac cct gtt tgt gaa gtt gtg ggt gac tta 960Asp Leu Glu Ile Ser Tyr Asn Pro Val Cys Glu Val Val Gly Asp Leu305 310 315 320gcg gtg tct gtc acg tct att gct tct gaa tta gat aag cga gaa tca 1008Ala Val Ser Val Thr Ser Ile Ala Ser Glu Leu Asp Lys Arg Glu Ser 325 330 335tta gaa aat aac caa caa atc caa tta tta cgc cac gat tta caa cat 1056Leu Glu Asn Asn Gln Gln Ile Gln Leu Leu Arg His Asp Leu Gln His 340 345 350att atg cag atg ggg gta aat aaa acc tca aca aac ggc gtt cac ccg 1104Ile Met Gln Met Gly Val Asn Lys Thr Ser Thr Asn Gly Val His Pro 355 360 365ctt cgt ttt gtt cat gag tta cgt cgc ttt gtt agt gac gac acc act 1152Leu Arg Phe Val His Glu Leu Arg Arg Phe Val Ser Asp Asp Thr Thr 370 375 380gta tgt tgt gat gta ggc tct att tat att tgg atg gca cgt tac ttc 1200Val Cys Cys Asp Val Gly Ser Ile Tyr Ile Trp Met Ala Arg Tyr Phe385 390 395 400cac agc ttt gaa cct cgt cgt tta ttg ttc agc aat ggc caa caa aca 1248His Ser Phe Glu Pro Arg Arg Leu Leu Phe Ser Asn Gly Gln Gln Thr 405 410 415ttg ggc gta gct tta cct tgg gca att gca gct tcc ctt ctt cac cct 1296Leu Gly Val Ala Leu Pro Trp Ala Ile Ala Ala Ser Leu Leu His Pro 420 425 430aat gaa aaa gta att tcc atg tct ggt gat ggt ggc ttc cta ttc tca 1344Asn Glu Lys Val Ile Ser Met Ser Gly Asp Gly Gly Phe Leu Phe Ser 435 440 445tca atg gaa tta gcc acg gcc gtt cgc cat aaa tgt aat atc gtt cac 1392Ser Met Glu Leu Ala Thr Ala Val Arg His Lys Cys Asn Ile Val His 450 455 460ttt gtt tgg aca gat cac agt tat gac atg gtt aag atc caa cag ctt 1440Phe Val Trp Thr Asp His Ser Tyr Asp Met Val Lys Ile Gln Gln Leu465 470 475 480aaa aag tat ggt cga gag agt gcc gtc agc ttt ata ggt cct gat att 1488Lys Lys Tyr Gly Arg Glu Ser Ala Val Ser Phe Ile Gly Pro Asp Ile 485 490 495gtt aag tac gca gaa agc ttc ggc gca cat ggt tta gcg atc aat act 1536Val Lys Tyr Ala Glu Ser Phe Gly Ala His Gly Leu Ala Ile Asn Thr 500 505 510gcc gat gat att gag cct gtt atg cga aaa gct atg agc tta agt ggc 1584Ala Asp Asp Ile Glu Pro Val Met Arg Lys Ala Met Ser Leu Ser Gly 515 520 525cca gta ttg gtc aac gtc aat gtt gat tat agc gat aac agt cgc cta 1632Pro Val Leu Val Asn Val Asn Val Asp Tyr Ser Asp Asn Ser Arg Leu 530 535 540ctt gat caa ctt cat cca tgc caa caa gat taa 1665Leu Asp Gln Leu His Pro Cys Gln Gln Asp545 55016554PRTVibrio angustum 16Met Ser Asp Lys Thr Val Ser Gly Ala Glu Leu Val Val Glu Thr Leu1 5 10 15Asn Ala His Asn Val Pro His Ile Phe Gly Ile Pro Gly Ala Lys Val 20 25 30Asp Ala Val Phe Asp Ala Val Cys Asp Asn Gly Pro Glu Ile Ile Ile 35 40 45Cys His His Glu Gln Asn Ala Ala Phe Met Ala Ala Ala Thr Gly Arg 50 55 60Leu Thr Gly Lys Ala Gly Ile Cys Leu Ala Thr Ser Gly Pro Gly Ala65 70 75 80Ser Asn Leu Val Thr Gly Val Ala Thr Ala Asn Ser Glu Gly Asp Pro 85 90 95Val Val Ala Leu Ala Gly Ala Val Pro Leu Ser Met Tyr Ser His Asn 100 105 110Thr His Gln Ser Met Asp Thr Arg Ser Leu Phe Thr Pro Ile Thr Lys 115 120 125Phe Ser Ala Glu Val Met Asp Ser Ser Ser Val Ser Asp Val Val His 130 135 140Lys Ala Phe Arg Ile Ala Glu Gln Pro Thr Gln Gly Ala Ser Phe Val145 150 155 160Ser Leu Pro Gln Asp Ile Leu Thr Asn Arg Ile Pro Tyr Gln Pro Val 165 170 175Gln Gln Pro Asn Pro Ile Leu Phe Gly Gly Ala His Pro Gln Ala Ile 180 185 190Arg Gln Ala Ala Asp Arg Ile Asn Ala Ala Lys Asn Pro Val Leu Leu 195 200 205Leu Gly Met Asp Ala Ser Gln Pro Phe Val Ala Asp Ala Ile Arg Gln 210 215 220Leu Leu Lys Gln Thr Pro Ile Ala Val Val Asn Thr Phe Ala Ala Ala225 230 235 240Gly Val Ile Ser His Asp Leu Tyr Asn Cys Phe Leu Gly Arg Val Gly 245 250 255Leu Phe Lys Asn Gln Pro Gly Asp Ile Ala Leu Asn Ser Ala Asp Leu 260 265 270Ile Ile Thr Ile Gly Tyr Ser Pro Ile Glu Tyr Asp Pro Ile Leu Trp 275 280 285Asn Lys Asp Ala Asn Thr Pro Ile Ile His Ile Gly Tyr Gln Gln Ala 290 295 300Asp Leu Glu Ile Ser Tyr Asn Pro Val Cys Glu Val Val Gly Asp Leu305 310 315 320Ala Val Ser Val Thr Ser Ile Ala Ser Glu Leu Asp Lys Arg Glu Ser 325 330 335Leu Glu Asn Asn Gln Gln Ile Gln Leu Leu Arg His Asp Leu Gln His 340 345 350Ile Met Gln Met Gly Val Asn Lys Thr Ser Thr Asn Gly Val His Pro 355 360 365Leu Arg Phe Val His Glu Leu Arg Arg Phe Val Ser Asp Asp Thr Thr 370 375 380Val Cys Cys Asp Val Gly Ser Ile Tyr Ile Trp Met Ala Arg Tyr Phe385 390 395 400His Ser Phe Glu Pro Arg Arg Leu Leu Phe Ser Asn Gly Gln Gln Thr 405 410 415Leu Gly Val Ala Leu Pro Trp Ala Ile Ala Ala Ser Leu Leu His Pro 420 425 430Asn Glu Lys Val Ile Ser Met Ser Gly Asp Gly Gly Phe Leu Phe Ser 435 440 445Ser Met Glu Leu Ala Thr Ala Val Arg His Lys Cys Asn Ile Val His 450 455 460Phe Val Trp Thr Asp His Ser Tyr Asp Met Val Lys Ile Gln Gln Leu465 470 475 480Lys Lys Tyr Gly Arg Glu Ser Ala Val Ser Phe Ile Gly Pro Asp Ile 485 490 495Val Lys Tyr Ala Glu Ser Phe Gly Ala His Gly Leu Ala Ile Asn Thr 500 505 510Ala Asp Asp Ile Glu Pro Val Met Arg Lys Ala Met Ser Leu Ser Gly 515 520 525Pro Val Leu Val Asn Val Asn Val Asp Tyr Ser Asp Asn Ser Arg Leu 530 535 540Leu Asp Gln Leu His Pro Cys Gln Gln Asp545 550171689DNABacillus cereus 17ttgagtacag gtgtaaaagc aaacgacgtg aagacaaaaa caaaaggagc agatcttgtt 60gttgattgtt taattaaaca aggtgttaca catgttttcg gtattccagg agcaaagatt 120gactctgtat ttgatgtact

gcaagaaaga ggaccagagt taattgtttg tcgtcatgaa 180caaaatgcag catttatggc agctgctatt ggtagattaa caggaaaacc gggcgtatgt 240cttgtaactt caggaccagg gacatcaaat ttagcgacag gtcttgttac tgcgaatgcg 300gagagtgatc ccgttgttgc tttagctggt gcagttccgc gtacggatcg attaaaacgt 360acacatcaat ctatggataa tgctgcacta ttcgaaccaa tcacaaaata tagcgtagaa 420gtagagcatc ctgataatgt gccagaagca ttatcaaatg cattccgaag tgcgacttct 480acaaatccag gagcaacttt agtaagtttg ccgcaagacg ttatgactgc ggaaacgact 540gtagagtcta tcggtgcgct ttctaagcca cagcttggaa tcgctcccac acatgatatt 600acatatgtag tagataaaat aaaagcagcg aaattaccag ttattttact cggtatgaga 660gcgagcacaa atgaagtgac gaaagccgtt cgtaaattaa ttgcggatac agaacttcct 720gtcgttgaaa catatcaagc ggctggtgcc atttcacgtg agttagaaga tcatttcttc 780ggccgtgttg gactattccg taaccaacca ggtgatattt tactagaaga ggcagatctt 840gttatttcta tcggttatga tccaattgag tatgatccaa agttctggaa taaacttgga 900gacagaacga ttattcatct tgatgaccat caagcagata tagatcatga ttaccaacca 960gagcgtgaat taattggtga tattgcctta acagtaaata gcatcgcaga aaagttaccg 1020aaacttgtgt taagtacgaa atcagaagca gtgttagaac gattacgcgc gaaattatca 1080gaacaagcag aagttccaaa tcgtccttca gaaggtgtta cacatccgct tcaagtgatt 1140cgtacacttc gttctttaat tagtgacgac acaaccgtta catgtgacat cggttcccat 1200tctatttgga tggcgagatg tttccgttct tatgaaccac gtagattatt atttagtaac 1260ggtatgcaga cgttaggtgt tgcacttcct tgggcaattg ctgctacttt agtagaacca 1320ggtaaaaaag tagtttccgt atcaggtgac ggtggtttct tattctcagc gatggagtta 1380gaaacggcgg tacgtttaaa ttctccaatc gtccatcttg tttggagaga cggcacatat 1440gatatggttg cattccaaca aatgatgaaa tacggcagaa catcagctac agagtttggt 1500gatgttgatc ttgttaaata tgcggaaagt ttcggggcgt taggtcttcg tgttaacacg 1560cctgatgaat tagaaggggt attgaaagaa gcactagcag cagacggccc tgtcattatt 1620gatattccaa ttgactatcg tgacaacatt aaattaagcg aaaaattatt accaaaccaa 1680ttaaactaa 168918562PRTBacillus cereus 18Met Ser Thr Gly Val Lys Ala Asn Asp Val Lys Thr Lys Thr Lys Gly1 5 10 15Ala Asp Leu Val Val Asp Cys Leu Ile Lys Gln Gly Val Thr His Val 20 25 30Phe Gly Ile Pro Gly Ala Lys Ile Asp Ser Val Phe Asp Val Leu Gln 35 40 45Glu Arg Gly Pro Glu Leu Ile Val Cys Arg His Glu Gln Asn Ala Ala 50 55 60Phe Met Ala Ala Ala Ile Gly Arg Leu Thr Gly Lys Pro Gly Val Cys65 70 75 80Leu Val Thr Ser Gly Pro Gly Thr Ser Asn Leu Ala Thr Gly Leu Val 85 90 95Thr Ala Asn Ala Glu Ser Asp Pro Val Val Ala Leu Ala Gly Ala Val 100 105 110Pro Arg Thr Asp Arg Leu Lys Arg Thr His Gln Ser Met Asp Asn Ala 115 120 125Ala Leu Phe Glu Pro Ile Thr Lys Tyr Ser Val Glu Val Glu His Pro 130 135 140Asp Asn Val Pro Glu Ala Leu Ser Asn Ala Phe Arg Ser Ala Thr Ser145 150 155 160Thr Asn Pro Gly Ala Thr Leu Val Ser Leu Pro Gln Asp Val Met Thr 165 170 175Ala Glu Thr Thr Val Glu Ser Ile Gly Ala Leu Ser Lys Pro Gln Leu 180 185 190Gly Ile Ala Pro Thr His Asp Ile Thr Tyr Val Val Asp Lys Ile Lys 195 200 205Ala Ala Lys Leu Pro Val Ile Leu Leu Gly Met Arg Ala Ser Thr Asn 210 215 220Glu Val Thr Lys Ala Val Arg Lys Leu Ile Ala Asp Thr Glu Leu Pro225 230 235 240Val Val Glu Thr Tyr Gln Ala Ala Gly Ala Ile Ser Arg Glu Leu Glu 245 250 255Asp His Phe Phe Gly Arg Val Gly Leu Phe Arg Asn Gln Pro Gly Asp 260 265 270Ile Leu Leu Glu Glu Ala Asp Leu Val Ile Ser Ile Gly Tyr Asp Pro 275 280 285Ile Glu Tyr Asp Pro Lys Phe Trp Asn Lys Leu Gly Asp Arg Thr Ile 290 295 300Ile His Leu Asp Asp His Gln Ala Asp Ile Asp His Asp Tyr Gln Pro305 310 315 320Glu Arg Glu Leu Ile Gly Asp Ile Ala Leu Thr Val Asn Ser Ile Ala 325 330 335Glu Lys Leu Pro Lys Leu Val Leu Ser Thr Lys Ser Glu Ala Val Leu 340 345 350Glu Arg Leu Arg Ala Lys Leu Ser Glu Gln Ala Glu Val Pro Asn Arg 355 360 365Pro Ser Glu Gly Val Thr His Pro Leu Gln Val Ile Arg Thr Leu Arg 370 375 380Ser Leu Ile Ser Asp Asp Thr Thr Val Thr Cys Asp Ile Gly Ser His385 390 395 400Ser Ile Trp Met Ala Arg Cys Phe Arg Ser Tyr Glu Pro Arg Arg Leu 405 410 415Leu Phe Ser Asn Gly Met Gln Thr Leu Gly Val Ala Leu Pro Trp Ala 420 425 430Ile Ala Ala Thr Leu Val Glu Pro Gly Lys Lys Val Val Ser Val Ser 435 440 445Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu Glu Thr Ala Val 450 455 460Arg Leu Asn Ser Pro Ile Val His Leu Val Trp Arg Asp Gly Thr Tyr465 470 475 480Asp Met Val Ala Phe Gln Gln Met Met Lys Tyr Gly Arg Thr Ser Ala 485 490 495Thr Glu Phe Gly Asp Val Asp Leu Val Lys Tyr Ala Glu Ser Phe Gly 500 505 510Ala Leu Gly Leu Arg Val Asn Thr Pro Asp Glu Leu Glu Gly Val Leu 515 520 525Lys Glu Ala Leu Ala Ala Asp Gly Pro Val Ile Ile Asp Ile Pro Ile 530 535 540Asp Tyr Arg Asp Asn Ile Lys Leu Ser Glu Lys Leu Leu Pro Asn Gln545 550 555 560Leu Asn19780DNAKlebsiella pneumoniae 19atgaatcatt ctgctgaatg cacctgcgaa gagagtctat gcgaaaccct gcgggcgttt 60tccgcgcagc atcccgagag cgtgctctat cagacatcgc tcatgagcgc cctgctgagc 120ggggtttacg aaggcagcac caccatcgcg gacctgctga aacacggcga tttcggcctc 180ggcaccttta atgagctgga cggggagctg atcgccttca gcagtcaggt ctatcagctg 240cgcgccgacg gcagcgcgcg caaagcccag ccggagcaga aaacgccgtt cgcggtgatg 300acctggttcc agccgcagta ccggaaaacc tttgaccatc cggtgagccg ccagcagctg 360cacgaggtga tcgaccagca aatcccctct gacaacctgt tctgcgccct gcgcatcgac 420ggccatttcc gccatgccca tacccgcacc gtgccgcgcc agacgccgcc gtaccgggcg 480atgaccgacg tcctcgacga tcagccggtg ttccgcttta accagcgcga aggggtgctg 540gtcggcttcc ggaccccgca gcatatgcag gggatcaacg tcgccgggta tcacgagcac 600tttattaccg atgaccgcaa aggcggcggt cacctgctgg attaccagct cgaccatggg 660gtgctgacct tcggcgaaat tcacaagctg atgatcgacc tgcccgccga cagcgcgttc 720ctgcaggcta atctgcatcc cgataatctc gatgccgcca tccgttccgt agaaagttaa 78020259PRTKlebsiella pneumoniae 20Met Asn His Ser Ala Glu Cys Thr Cys Glu Glu Ser Leu Cys Glu Thr1 5 10 15Leu Arg Ala Phe Ser Ala Gln His Pro Glu Ser Val Leu Tyr Gln Thr 20 25 30Ser Leu Met Ser Ala Leu Leu Ser Gly Val Tyr Glu Gly Ser Thr Thr 35 40 45Ile Ala Asp Leu Leu Lys His Gly Asp Phe Gly Leu Gly Thr Phe Asn 50 55 60Glu Leu Asp Gly Glu Leu Ile Ala Phe Ser Ser Gln Val Tyr Gln Leu65 70 75 80Arg Ala Asp Gly Ser Ala Arg Lys Ala Gln Pro Glu Gln Lys Thr Pro 85 90 95Phe Ala Val Met Thr Trp Phe Gln Pro Gln Tyr Arg Lys Thr Phe Asp 100 105 110His Pro Val Ser Arg Gln Gln Leu His Glu Val Ile Asp Gln Gln Ile 115 120 125Pro Ser Asp Asn Leu Phe Cys Ala Leu Arg Ile Asp Gly His Phe Arg 130 135 140His Ala His Thr Arg Thr Val Pro Arg Gln Thr Pro Pro Tyr Arg Ala145 150 155 160Met Thr Asp Val Leu Asp Asp Gln Pro Val Phe Arg Phe Asn Gln Arg 165 170 175Glu Gly Val Leu Val Gly Phe Arg Thr Pro Gln His Met Gln Gly Ile 180 185 190Asn Val Ala Gly Tyr His Glu His Phe Ile Thr Asp Asp Arg Lys Gly 195 200 205Gly Gly His Leu Leu Asp Tyr Gln Leu Asp His Gly Val Leu Thr Phe 210 215 220Gly Glu Ile His Lys Leu Met Ile Asp Leu Pro Ala Asp Ser Ala Phe225 230 235 240Leu Gln Ala Asn Leu His Pro Asp Asn Leu Asp Ala Ala Ile Arg Ser 245 250 255Val Glu Ser21768DNABacillus subtilis 21atgaaacgag aaagcaacat tcaagtgctc agccgtggtc aaaaagatca gcctgtgagc 60cagatttatc aagtatcaac aatgacttct ctattagacg gagtatatga cggagatttt 120gaactgtcag agattccgaa atatggagac ttcggtatcg gaacctttaa caagcttgac 180ggagagctga ttgggtttga cggcgaattt taccgtcttc gctcagacgg aaccgcgaca 240ccggtccaaa atggagaccg ttcaccgttc tgttcattta cgttctttac accggacatg 300acgcacaaaa ttgatgcgaa aatgacacgc gaagactttg aaaaagagat caacagcatg 360ctgccaagca gaaacttatt ttatgcaatt cgcattgacg gattgtttaa aaaggtgcag 420acaagaacag tagaacttca agaaaaacct tacgtgccaa tggttgaagc ggtcaaaaca 480cagccgattt tcaacttcga caacgtgaga ggaacgattg taggtttctt gacaccagct 540tatgcaaacg gaatcgccgt ttctggctat cacctgcact tcattgacga aggacgcaat 600tcaggcggac acgtttttga ctatgtgctt gaggattgca cggttacgat ttctcaaaaa 660atgaacatga atctcagact tccgaacaca gcggatttct ttaatgcgaa tctggataac 720cctgattttg cgaaagatat cgaaacaact gaaggaagcc ctgaataa 76822255PRTBacillus subtilis 22Met Lys Arg Glu Ser Asn Ile Gln Val Leu Ser Arg Gly Gln Lys Asp1 5 10 15Gln Pro Val Ser Gln Ile Tyr Gln Val Ser Thr Met Thr Ser Leu Leu 20 25 30Asp Gly Val Tyr Asp Gly Asp Phe Glu Leu Ser Glu Ile Pro Lys Tyr 35 40 45Gly Asp Phe Gly Ile Gly Thr Phe Asn Lys Leu Asp Gly Glu Leu Ile 50 55 60Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr65 70 75 80Pro Val Gln Asn Gly Asp Arg Ser Pro Phe Cys Ser Phe Thr Phe Phe 85 90 95Thr Pro Asp Met Thr His Lys Ile Asp Ala Lys Met Thr Arg Glu Asp 100 105 110Phe Glu Lys Glu Ile Asn Ser Met Leu Pro Ser Arg Asn Leu Phe Tyr 115 120 125Ala Ile Arg Ile Asp Gly Leu Phe Lys Lys Val Gln Thr Arg Thr Val 130 135 140Glu Leu Gln Glu Lys Pro Tyr Val Pro Met Val Glu Ala Val Lys Thr145 150 155 160Gln Pro Ile Phe Asn Phe Asp Asn Val Arg Gly Thr Ile Val Gly Phe 165 170 175Leu Thr Pro Ala Tyr Ala Asn Gly Ile Ala Val Ser Gly Tyr His Leu 180 185 190His Phe Ile Asp Glu Gly Arg Asn Ser Gly Gly His Val Phe Asp Tyr 195 200 205Val Leu Glu Asp Cys Thr Val Thr Ile Ser Gln Lys Met Asn Met Asn 210 215 220Leu Arg Leu Pro Asn Thr Ala Asp Phe Phe Asn Ala Asn Leu Asp Asn225 230 235 240Pro Asp Phe Ala Lys Asp Ile Glu Thr Thr Glu Gly Ser Pro Glu 245 250 25523780DNAKlebsiella terrigena 23gtgaatcatt atcctgaatg cacctgccag gagagcctgt gcgaaaccgt acgcggcttc 60tccgcccacc accctgatag cgttatctat cagacctctc tgatgagcgc gctgctgagc 120ggggtctatg agggtagcac caccatcgcc gacctgctga cccacggcga cttcggtctc 180ggcaccttta acgaactcga tggcgaactg attgccttta gcagcgaggt ctaccagctg 240cgcgctgacg gcagcgcgcg taaagcccgg gcggatcaaa aaacgccctt cgcggtgatg 300acctggttca gaccgcagta ccgtaaaacc tttgaccacc cggtcagccg ccagcagctg 360cacgacgtta tcgaccagca aatcccctcc gataacctgt tctgcgccct gcatattgat 420ggtcactttc gccacgccca cacccgcacc gtgccgcggc agacgccgcc ctatcgggcg 480atgaccgacg tgctcgatga ccagccggtt ttccgcttca accagcgcaa ggggacgctg 540gtcggctttc gcaccccgca gcatatgcag ggccttaacg ttgccggcta ccacgagcac 600tttattaccg acgatcgcca gggcggcggc catctgctgg actaccagct cgatagcggc 660gtgctgacct tcggcgagat ccacaagctg atgattgacc tcccggccga cagcgctttc 720ctgcaggccg acctgcatcc tgacaatctc gatgccgcta ttcgtgcggt agaaaactaa 78024259PRTKlebsiella terrigena 24Met Asn His Tyr Pro Glu Cys Thr Cys Gln Glu Ser Leu Cys Glu Thr1 5 10 15Val Arg Gly Phe Ser Ala His His Pro Asp Ser Val Ile Tyr Gln Thr 20 25 30Ser Leu Met Ser Ala Leu Leu Ser Gly Val Tyr Glu Gly Ser Thr Thr 35 40 45Ile Ala Asp Leu Leu Thr His Gly Asp Phe Gly Leu Gly Thr Phe Asn 50 55 60Glu Leu Asp Gly Glu Leu Ile Ala Phe Ser Ser Glu Val Tyr Gln Leu65 70 75 80Arg Ala Asp Gly Ser Ala Arg Lys Ala Arg Ala Asp Gln Lys Thr Pro 85 90 95Phe Ala Val Met Thr Trp Phe Arg Pro Gln Tyr Arg Lys Thr Phe Asp 100 105 110His Pro Val Ser Arg Gln Gln Leu His Asp Val Ile Asp Gln Gln Ile 115 120 125Pro Ser Asp Asn Leu Phe Cys Ala Leu His Ile Asp Gly His Phe Arg 130 135 140His Ala His Thr Arg Thr Val Pro Arg Gln Thr Pro Pro Tyr Arg Ala145 150 155 160Met Thr Asp Val Leu Asp Asp Gln Pro Val Phe Arg Phe Asn Gln Arg 165 170 175Lys Gly Thr Leu Val Gly Phe Arg Thr Pro Gln His Met Gln Gly Leu 180 185 190Asn Val Ala Gly Tyr His Glu His Phe Ile Thr Asp Asp Arg Gln Gly 195 200 205Gly Gly His Leu Leu Asp Tyr Gln Leu Asp Ser Gly Val Leu Thr Phe 210 215 220Gly Glu Ile His Lys Leu Met Ile Asp Leu Pro Ala Asp Ser Ala Phe225 230 235 240Leu Gln Ala Asp Leu His Pro Asp Asn Leu Asp Ala Ala Ile Arg Ala 245 250 255Val Glu Asn25771DNAKlebsiella pneumoniae 25atgaaaaaag tcgcacttgt taccggcgcc ggccagggga ttggtaaagc tatcgccctt 60cgtctggtga aggatggatt tgccgtggcc attgccgatt ataacgacgc caccgccaaa 120gcggtcgcct cggaaatcaa ccaggccggc ggacacgccg tggcggtgaa agtggatgtc 180tccgaccgcg atcaggtatt tgccgccgtt gaacaggcgc gcaaaacgct gggcggcttc 240gacgtcatcg tcaataacgc cggtgtggca ccgtctacgc cgatcgagtc cattaccccg 300gagattgtcg acaaagtcta caacatcaac gtcaaagggg tgatctgggg tattcaggcg 360gcggtcgagg cctttaagaa agaggggcac ggcgggaaaa tcatcaacgc ctgttcccag 420gccggccacg tcggcaaccc ggagctggcg gtgtatagct ccagtaaatt cgcggtacgc 480ggcttaaccc agaccgccgc tcgcgacctc gcgccgctgg gcatcacggt caacggctac 540tgcccgggga ttgtcaaaac gccaatgtgg gccgaaattg accgccaggt gtccgaagcc 600gccggtaaac cgctgggcta cggtaccgcc gagttcgcca aacgcatcac tctcggtcgt 660ctgtccgagc cggaagatgt cgccgcctgc gtctcctatc ttgccagccc ggattctgat 720tacatgaccg gtcagtcgtt gctgatcgac ggcgggatgg tatttaacta a 77126256PRTKlebsiella pneumoniae 26Met Lys Lys Val Ala Leu Val Thr Gly Ala Gly Gln Gly Ile Gly Lys1 5 10 15Ala Ile Ala Leu Arg Leu Val Lys Asp Gly Phe Ala Val Ala Ile Ala 20 25 30Asp Tyr Asn Asp Ala Thr Ala Lys Ala Val Ala Ser Glu Ile Asn Gln 35 40 45Ala Gly Gly His Ala Val Ala Val Lys Val Asp Val Ser Asp Arg Asp 50 55 60Gln Val Phe Ala Ala Val Glu Gln Ala Arg Lys Thr Leu Gly Gly Phe65 70 75 80Asp Val Ile Val Asn Asn Ala Gly Val Ala Pro Ser Thr Pro Ile Glu 85 90 95Ser Ile Thr Pro Glu Ile Val Asp Lys Val Tyr Asn Ile Asn Val Lys 100 105 110Gly Val Ile Trp Gly Ile Gln Ala Ala Val Glu Ala Phe Lys Lys Glu 115 120 125Gly His Gly Gly Lys Ile Ile Asn Ala Cys Ser Gln Ala Gly His Val 130 135 140Gly Asn Pro Glu Leu Ala Val Tyr Ser Ser Ser Lys Phe Ala Val Arg145 150 155 160Gly Leu Thr Gln Thr Ala Ala Arg Asp Leu Ala Pro Leu Gly Ile Thr 165 170 175Val Asn Gly Tyr Cys Pro Gly Ile Val Lys Thr Pro Met Trp Ala Glu 180 185 190Ile Asp Arg Gln Val Ser Glu Ala Ala Gly Lys Pro Leu Gly Tyr Gly 195 200 205Thr Ala Glu Phe Ala Lys Arg Ile Thr Leu Gly Arg Leu Ser Glu Pro 210 215 220Glu Asp Val Ala Ala Cys Val Ser Tyr Leu Ala Ser Pro Asp Ser Asp225 230 235 240Tyr Met Thr Gly Gln Ser Leu Leu Ile Asp Gly Gly Met Val Phe Asn 245 250 255271053DNABacillus cereus 27atgaaagcac tactttggca taatcaacgt gatgtacgag tagaagaagt accagaacca 60acagtaaaac caggaacagt gaaaatcaaa gttaaatggt gtggtatttg tgggacagac 120ttgcatgaat atttagcagg gcctattttt attccaacag aagaacatcc attaacacat 180gtgaaagcac ctgttatttt aggtcatgag tttagtggtg aggtaataga gattggtgaa 240ggagttacat ctcataaagt gggagaccgc gttgttgtag agccaattta ttcttgtggt 300aaatgtgaag cttgtaaaca tggacattac aatgtttgtg aacaacttgt tttccacggt 360cttggcggag aaggcggcgg tttctctgaa tatacagtag

taccagaaga tatggttcat 420cacattccag atgaaatgac gtatgaacaa ggtgcgcttg tagaaccagc agcagtagca 480gttcatgcag tacgtcaaag taaattaaaa gaaggggaag ctgtagcggt atttggttgc 540ggtccaattg gacttcttgt tatccaagca gctaaagcag caggagcaac tcctgttatt 600gcagttgaac tttctaaaga acgtcaagag ttagcgaaat tagcaggtgc ggattatgta 660ttaaatccag caactcaaga tgtgttagct gaaattcgta acttaacaaa tggtttaggt 720gtaaatgtta gctttgaagt aacaggtgtt gaagttgtac tacgccaagc gattgaaagt 780acaagcttcg aaggacaaac tgtaattgtt agtgtatggg aaaaagacgc aacaattact 840ccaaataact tagtattaaa agaaaaagaa gttattggta ttttaggata ccgtcacatc 900ttcccagctg ttattaaatt gattagctcc ggtcaaattc aagcagagaa attaattacg 960aaaaaaatta cagtggatca agttgttgaa gaaggatttg aagcacttgt aaaagataaa 1020acacaagtga aaattcttgt ttcacctaaa taa 105328350PRTBacillus cereus 28Met Lys Ala Leu Leu Trp His Asn Gln Arg Asp Val Arg Val Glu Glu1 5 10 15Val Pro Glu Pro Thr Val Lys Pro Gly Thr Val Lys Ile Lys Val Lys 20 25 30Trp Cys Gly Ile Cys Gly Thr Asp Leu His Glu Tyr Leu Ala Gly Pro 35 40 45Ile Phe Ile Pro Thr Glu Glu His Pro Leu Thr His Val Lys Ala Pro 50 55 60Val Ile Leu Gly His Glu Phe Ser Gly Glu Val Ile Glu Ile Gly Glu65 70 75 80Gly Val Thr Ser His Lys Val Gly Asp Arg Val Val Val Glu Pro Ile 85 90 95Tyr Ser Cys Gly Lys Cys Glu Ala Cys Lys His Gly His Tyr Asn Val 100 105 110Cys Glu Gln Leu Val Phe His Gly Leu Gly Gly Glu Gly Gly Gly Phe 115 120 125Ser Glu Tyr Thr Val Val Pro Glu Asp Met Val His His Ile Pro Asp 130 135 140Glu Met Thr Tyr Glu Gln Gly Ala Leu Val Glu Pro Ala Ala Val Ala145 150 155 160Val His Ala Val Arg Gln Ser Lys Leu Lys Glu Gly Glu Ala Val Ala 165 170 175Val Phe Gly Cys Gly Pro Ile Gly Leu Leu Val Ile Gln Ala Ala Lys 180 185 190Ala Ala Gly Ala Thr Pro Val Ile Ala Val Glu Leu Ser Lys Glu Arg 195 200 205Gln Glu Leu Ala Lys Leu Ala Gly Ala Asp Tyr Val Leu Asn Pro Ala 210 215 220Thr Gln Asp Val Leu Ala Glu Ile Arg Asn Leu Thr Asn Gly Leu Gly225 230 235 240Val Asn Val Ser Phe Glu Val Thr Gly Val Glu Val Val Leu Arg Gln 245 250 255Ala Ile Glu Ser Thr Ser Phe Glu Gly Gln Thr Val Ile Val Ser Val 260 265 270Trp Glu Lys Asp Ala Thr Ile Thr Pro Asn Asn Leu Val Leu Lys Glu 275 280 285Lys Glu Val Ile Gly Ile Leu Gly Tyr Arg His Ile Phe Pro Ala Val 290 295 300Ile Lys Leu Ile Ser Ser Gly Gln Ile Gln Ala Glu Lys Leu Ile Thr305 310 315 320Lys Lys Ile Thr Val Asp Gln Val Val Glu Glu Gly Phe Glu Ala Leu 325 330 335Val Lys Asp Lys Thr Gln Val Lys Ile Leu Val Ser Pro Lys 340 345 350291113DNALactococcus lactis 29ttgcctgaaa cgacaaccat cctatataga ggaggcgttt ttatgcgcgc agcacgtttt 60tacgaccgcg gggatatccg cattgatgaa attaatgaac caatagtaaa agctggccaa 120gttggcattg atgtggcttg gtgtggaatt tgtggaacag atctccatga atttttagat 180ggcccaattt tttgtccgtc agcagaacat cctaatccaa ttactggaga agtaccacca 240gtcactcttg gacatgaaat gtctggggtt gtaaatttta taggtgaagg agtaagcgga 300cttaaagtag gtgaccatgt cgttgtcgaa ccttatatcg ttcccgaagg gactgataca 360agtgaaactg gacattataa cctctcagaa ggctcaaact ttattggttt gggcggaaat 420ggtggaggtt tggctgaaaa aatttctgtt gatgaacgtt gggttcacaa aattcctgat 480aacttaccat tggatgaagc tgctctaatt gagccactat cagtcggcta tcacgctgtt 540gaacgagcaa atttaagtga aaagagtacg gtattagttg ttggtgctgg accaattgga 600ctattaactg ctgccgttgc aaaagcgcaa ggacatactg ttatcatcag tgaacctagt 660ggacttcgtc gtaaaaaagc acaagaagca caagttgctg attatttctt caatccaatt 720gaagatgaca ttcaagctaa agttcatgaa attaatgaaa aaggagtgga cgcagccttt 780gaatgtacct ctgtccaacc gggatttgac gcttgtctag atgcgattcg tatgggtgga 840acagttgtca ttgtcgcaat ttggggcaag cctgctagtg ttgatatggc aaaattagta 900atcaaagaag ctaacctttt aggaacgatt gcttataata acactcatcc aaaaacaatt 960gatttagtat caacaggtaa aataaaattg gaccaattca tcacagctaa aatcggtttg 1020gatgatttga ttgataaagg attcgatacg ctgattcatc ataatgaaac agctgttaaa 1080attttagttt caccaactgg taaaggtcta taa 111330370PRTLactococcus lactis 30Met Pro Glu Thr Thr Thr Ile Leu Tyr Arg Gly Gly Val Phe Met Arg1 5 10 15Ala Ala Arg Phe Tyr Asp Arg Gly Asp Ile Arg Ile Asp Glu Ile Asn 20 25 30Glu Pro Ile Val Lys Ala Gly Gln Val Gly Ile Asp Val Ala Trp Cys 35 40 45Gly Ile Cys Gly Thr Asp Leu His Glu Phe Leu Asp Gly Pro Ile Phe 50 55 60Cys Pro Ser Ala Glu His Pro Asn Pro Ile Thr Gly Glu Val Pro Pro65 70 75 80Val Thr Leu Gly His Glu Met Ser Gly Val Val Asn Phe Ile Gly Glu 85 90 95Gly Val Ser Gly Leu Lys Val Gly Asp His Val Val Val Glu Pro Tyr 100 105 110Ile Val Pro Glu Gly Thr Asp Thr Ser Glu Thr Gly His Tyr Asn Leu 115 120 125Ser Glu Gly Ser Asn Phe Ile Gly Leu Gly Gly Asn Gly Gly Gly Leu 130 135 140Ala Glu Lys Ile Ser Val Asp Glu Arg Trp Val His Lys Ile Pro Asp145 150 155 160Asn Leu Pro Leu Asp Glu Ala Ala Leu Ile Glu Pro Leu Ser Val Gly 165 170 175Tyr His Ala Val Glu Arg Ala Asn Leu Ser Glu Lys Ser Thr Val Leu 180 185 190Val Val Gly Ala Gly Pro Ile Gly Leu Leu Thr Ala Ala Val Ala Lys 195 200 205Ala Gln Gly His Thr Val Ile Ile Ser Glu Pro Ser Gly Leu Arg Arg 210 215 220Lys Lys Ala Gln Glu Ala Gln Val Ala Asp Tyr Phe Phe Asn Pro Ile225 230 235 240Glu Asp Asp Ile Gln Ala Lys Val His Glu Ile Asn Glu Lys Gly Val 245 250 255Asp Ala Ala Phe Glu Cys Thr Ser Val Gln Pro Gly Phe Asp Ala Cys 260 265 270Leu Asp Ala Ile Arg Met Gly Gly Thr Val Val Ile Val Ala Ile Trp 275 280 285Gly Lys Pro Ala Ser Val Asp Met Ala Lys Leu Val Ile Lys Glu Ala 290 295 300Asn Leu Leu Gly Thr Ile Ala Tyr Asn Asn Thr His Pro Lys Thr Ile305 310 315 320Asp Leu Val Ser Thr Gly Lys Ile Lys Leu Asp Gln Phe Ile Thr Ala 325 330 335Lys Ile Gly Leu Asp Asp Leu Ile Asp Lys Gly Phe Asp Thr Leu Ile 340 345 350His His Asn Glu Thr Ala Val Lys Ile Leu Val Ser Pro Thr Gly Lys 355 360 365Gly Leu 370312532DNARoseburia inulinivorans 31atgggcaatt acgattcaac accgatagct aaaagtgata ggattaaaag attggttgat 60catttgtatg ctaaaatgcc tgaaattgag gccgctagag cagagctaat tactgaatcc 120tttaaggcca ccgaaggtca acctgttgtt atgagaaagg ctagagcttt tgaacatata 180ctaaagaatt tgccaattat cataagacca gaagaactga ttgttggctc aactacaatt 240gcccctagag gttgccaaac gtatccagaa ttctcatacg agtggttaga ggctgaattt 300gaaactgtcg aaacgcgttc agctgaccca ttttatattt cagaagaaac gaagaaacgt 360ttgctggctg ccgatgctta ttggaaaggt aaaacaacct cagagttggc aacttcatat 420atggccccag aaactctaag agccatgaag cataacttct tcacccctgg aaactacttc 480tacaatggtg tcggtcatgt cacagttcaa tatgaaacag tattagcaat cggcttgaat 540ggagtaaaag agaaggttag gaaagagatg gagaattgtc attttggtga tgccgattat 600agtacaaaga tgtgtttctt ggagagcatt ttaatatcgt gtgatgccgt aatcacttat 660gctaatagat atgccaagat ggccgaggaa atggctgaaa aagaaacaga tgctgcaagg 720aggcaagaac tattaacaat cgccagggtt tgcaaaaacg ttcctgaatt cccagccgaa 780agcttccagg aggcctgcca atccttttgg ttcatacaac aagtgcttca aattgaatcc 840agtggtcatt caatttcccc aggtagattt gatcaatata tgtatcctta ttacgaaaag 900gatttaaagg aaggtagctt aactagggaa tatgctcagg aactgatcga ttgtatctgg 960gttaagttaa atgatctgaa taagtgcagg gatgctgcct ctgctgaggg ctttgcagga 1020tattccttat ttcaaaactt aatcgttggg ggccaaacgg ttcaaggaag ggacgccacc 1080aatgatttga gttttatgtg tatcacggca tctgaacacg tctttttacc gatgccgtcg 1140ttgtctataa gagtttggca tggtagttcc aaagcactgc ttatgagagc agctgaattg 1200actagaaccg gtataggctt acctgcttat tacaatgatg aagtcatcat accagctttg 1260gtgcataggg gtgctactat ggatgaagca agaaattaca acataatagg atgtgtcgaa 1320ccgcaggttc ctggtaaaac tgatggctgg cacgatgcag cattctttaa catgtgcaga 1380cctttggaaa tggtgtttag taatggttat gataacggtg aaattgcatc tatacaaact 1440ggtaacgtag aatcttttca gagttttgat gagtttatgg aagcttacag aaaacaaatg 1500ctatataaca tagaacttat ggtaaatgcc gacaacgcga tagattatgc ccacgcaaag 1560ttggccccat tgccatttga gtcatgtttg gttgatgact gtataaagag aggaatgtcc 1620gctcaggaag gcggcgcaat ctataatttc actggtccac agggctttgg tattgcaaac 1680gttgctgata gcttgtatac gattaagaaa ttggtgttcg aggagaagag aattacgatg 1740ggtgaattaa agaaagcgtt ggaaatgaat tatggtaagg gtttggatgc cacaaccgct 1800ggtgacatcg caatgcaggt cgcgaaggga ctaaaagatg ccggacagga agtgggtccc 1860gacgtgatcg ctaatacaat ccgtcaagtt cttgaaatgg aattaccaga agatgtaaga 1920aagagatatg aagagatcca tgaaatgata cttgagttac caaagtatgg taatgatata 1980gatgaagttg atgaattagc tagagaagca gcttactttt acacaagacc attagaaact 2040tttaagaatc caaggggtgg catgtatcaa gccggccttt atcccgtgtc cgctaatgtg 2100ccactaggcg ctcaaacggg ggccacaccc gatggacgtt tggcgcatac acccgtggcg 2160gatggcgttg gtccgacatc aggcttcgat atatccggac caacagcttc ttgcaattct 2220gtcgccaagt tggatcatgc tatagcctct aatggtacct tatttaatat gaagatgcac 2280ccaaccgcaa tggcaggtga aaagggctta gaatccttca tatcgttgat ccgtggttat 2340ttcgatcaac aaggtatgca catgcaattt aacgtagtag acagggctac actgcttgat 2400gcgcaggccc accctgaaaa gtattcaggc ttaattgtca gagtggcagg ttattctgcc 2460ctttttacca cattgtccaa gtcattacaa gatgatataa tcaaacgtac cgaacaagca 2520gacaatagat ag 253232843PRTRoseburia inulinivorans 32Met Gly Asn Tyr Asp Ser Thr Pro Ile Ala Lys Ser Asp Arg Ile Lys1 5 10 15Arg Leu Val Asp His Leu Tyr Ala Lys Met Pro Glu Ile Glu Ala Ala 20 25 30Arg Ala Glu Leu Ile Thr Glu Ser Phe Lys Ala Thr Glu Gly Gln Pro 35 40 45Val Val Met Arg Lys Ala Arg Ala Phe Glu His Ile Leu Lys Asn Leu 50 55 60Pro Ile Ile Ile Arg Pro Glu Glu Leu Ile Val Gly Ser Thr Thr Ile65 70 75 80Ala Pro Arg Gly Cys Gln Thr Tyr Pro Glu Phe Ser Tyr Glu Trp Leu 85 90 95Glu Ala Glu Phe Glu Thr Val Glu Thr Arg Ser Ala Asp Pro Phe Tyr 100 105 110Ile Ser Glu Glu Thr Lys Lys Arg Leu Leu Ala Ala Asp Ala Tyr Trp 115 120 125Lys Gly Lys Thr Thr Ser Glu Leu Ala Thr Ser Tyr Met Ala Pro Glu 130 135 140Thr Leu Arg Ala Met Lys His Asn Phe Phe Thr Pro Gly Asn Tyr Phe145 150 155 160Tyr Asn Gly Val Gly His Val Thr Val Gln Tyr Glu Thr Val Leu Ala 165 170 175Ile Gly Leu Asn Gly Val Lys Glu Lys Val Arg Lys Glu Met Glu Asn 180 185 190Cys His Phe Gly Asp Ala Asp Tyr Ser Thr Lys Met Cys Phe Leu Glu 195 200 205Ser Ile Leu Ile Ser Cys Asp Ala Val Ile Thr Tyr Ala Asn Arg Tyr 210 215 220Ala Lys Met Ala Glu Glu Met Ala Glu Lys Glu Thr Asp Ala Ala Arg225 230 235 240Arg Gln Glu Leu Leu Thr Ile Ala Arg Val Cys Lys Asn Val Pro Glu 245 250 255Phe Pro Ala Glu Ser Phe Gln Glu Ala Cys Gln Ser Phe Trp Phe Ile 260 265 270Gln Gln Val Leu Gln Ile Glu Ser Ser Gly His Ser Ile Ser Pro Gly 275 280 285Arg Phe Asp Gln Tyr Met Tyr Pro Tyr Tyr Glu Lys Asp Leu Lys Glu 290 295 300Gly Ser Leu Thr Arg Glu Tyr Ala Gln Glu Leu Ile Asp Cys Ile Trp305 310 315 320Val Lys Leu Asn Asp Leu Asn Lys Cys Arg Asp Ala Ala Ser Ala Glu 325 330 335Gly Phe Ala Gly Tyr Ser Leu Phe Gln Asn Leu Ile Val Gly Gly Gln 340 345 350Thr Val Gln Gly Arg Asp Ala Thr Asn Asp Leu Ser Phe Met Cys Ile 355 360 365Thr Ala Ser Glu His Val Phe Leu Pro Met Pro Ser Leu Ser Ile Arg 370 375 380Val Trp His Gly Ser Ser Lys Ala Leu Leu Met Arg Ala Ala Glu Leu385 390 395 400Thr Arg Thr Gly Ile Gly Leu Pro Ala Tyr Tyr Asn Asp Glu Val Ile 405 410 415Ile Pro Ala Leu Val His Arg Gly Ala Thr Met Asp Glu Ala Arg Asn 420 425 430Tyr Asn Ile Ile Gly Cys Val Glu Pro Gln Val Pro Gly Lys Thr Asp 435 440 445Gly Trp His Asp Ala Ala Phe Phe Asn Met Cys Arg Pro Leu Glu Met 450 455 460Val Phe Ser Asn Gly Tyr Asp Asn Gly Glu Ile Ala Ser Ile Gln Thr465 470 475 480Gly Asn Val Glu Ser Phe Gln Ser Phe Asp Glu Phe Met Glu Ala Tyr 485 490 495Arg Lys Gln Met Leu Tyr Asn Ile Glu Leu Met Val Asn Ala Asp Asn 500 505 510Ala Ile Asp Tyr Ala His Ala Lys Leu Ala Pro Leu Pro Phe Glu Ser 515 520 525Cys Leu Val Asp Asp Cys Ile Lys Arg Gly Met Ser Ala Gln Glu Gly 530 535 540Gly Ala Ile Tyr Asn Phe Thr Gly Pro Gln Gly Phe Gly Ile Ala Asn545 550 555 560Val Ala Asp Ser Leu Tyr Thr Ile Lys Lys Leu Val Phe Glu Glu Lys 565 570 575Arg Ile Thr Met Gly Glu Leu Lys Lys Ala Leu Glu Met Asn Tyr Gly 580 585 590Lys Gly Leu Asp Ala Thr Thr Ala Gly Asp Ile Ala Met Gln Val Ala 595 600 605Lys Gly Leu Lys Asp Ala Gly Gln Glu Val Gly Pro Asp Val Ile Ala 610 615 620Asn Thr Ile Arg Gln Val Leu Glu Met Glu Leu Pro Glu Asp Val Arg625 630 635 640Lys Arg Tyr Glu Glu Ile His Glu Met Ile Leu Glu Leu Pro Lys Tyr 645 650 655Gly Asn Asp Ile Asp Glu Val Asp Glu Leu Ala Arg Glu Ala Ala Tyr 660 665 670Phe Tyr Thr Arg Pro Leu Glu Thr Phe Lys Asn Pro Arg Gly Gly Met 675 680 685Tyr Gln Ala Gly Leu Tyr Pro Val Ser Ala Asn Val Pro Leu Gly Ala 690 695 700Gln Thr Gly Ala Thr Pro Asp Gly Arg Leu Ala His Thr Pro Val Ala705 710 715 720Asp Gly Val Gly Pro Thr Ser Gly Phe Asp Ile Ser Gly Pro Thr Ala 725 730 735Ser Cys Asn Ser Val Ala Lys Leu Asp His Ala Ile Ala Ser Asn Gly 740 745 750Thr Leu Phe Asn Met Lys Met His Pro Thr Ala Met Ala Gly Glu Lys 755 760 765Gly Leu Glu Ser Phe Ile Ser Leu Ile Arg Gly Tyr Phe Asp Gln Gln 770 775 780Gly Met His Met Gln Phe Asn Val Val Asp Arg Ala Thr Leu Leu Asp785 790 795 800Ala Gln Ala His Pro Glu Lys Tyr Ser Gly Leu Ile Val Arg Val Ala 805 810 815Gly Tyr Ser Ala Leu Phe Thr Thr Leu Ser Lys Ser Leu Gln Asp Asp 820 825 830Ile Ile Lys Arg Thr Glu Gln Ala Asp Asn Arg 835 84033794DNARoseburia inulinivorans 33atgaaagaat atcttaatac ttcaggtaga atatttgata tccagaggta ttctattcac 60gatggccctg gtgtgcgtac aattgtgttt ctaaaaggtt gtgcccttag atgcagatgg 120tgctgtaatc ctgaaagcca aagcttcgaa gttgaaacaa tgacgattaa tggaaaacct 180aaagtcatgg gtaaagatgt tacagtcgcc gaggttatga agacggtaga aagagacatg 240ccttattacc ttcaatcagg tggtggtatc accttatcgg gtggcgaatg tactttgcaa 300ccagaatttt cccttggcct attgagagct gcaaaggatt tgggcatatc cacggcaata 360gagagcatgg cgtacgcaaa gtacgaagta atagaaactc ttcttccgta tttggatacg 420tatttaatgg acatcaaaca tatgaatcct gagaaacata aagaatacac tggtcatgat 480aacttgagga tgttagaaaa cgccttaaga gtcgcgcatt ctggtcagac cgaactgatc 540atcagagtac ctgtcatccc aggattcaac gcaactgagc aggaactact agatattgca 600aaattcgcag atacactgcc tggagttaga caaatacaca tcttgccata tcataatttt 660ggtcagggta aatacgaagg attgaacagg gactatccga tgggggacac tgagaaaccc 720tctaatgaac agatgaaagc ttttcaagaa atgattcaaa agaacacttc cctacattgc 780caaatcggtg gtta 79434264PRTroseburia inulinivorans 34Met Lys Glu Tyr Leu Asn Thr Ser Gly Arg Ile Phe Asp Ile Gln Arg1 5 10

15Tyr Ser Ile His Asp Gly Pro Gly Val Arg Thr Ile Val Phe Leu Lys 20 25 30Gly Cys Ala Leu Arg Cys Arg Trp Cys Cys Asn Pro Glu Ser Gln Ser 35 40 45Phe Glu Val Glu Thr Met Thr Ile Asn Gly Lys Pro Lys Val Met Gly 50 55 60Lys Asp Val Thr Val Ala Glu Val Met Lys Thr Val Glu Arg Asp Met65 70 75 80Pro Tyr Tyr Leu Gln Ser Gly Gly Gly Ile Thr Leu Ser Gly Gly Glu 85 90 95Cys Thr Leu Gln Pro Glu Phe Ser Leu Gly Leu Leu Arg Ala Ala Lys 100 105 110Asp Leu Gly Ile Ser Thr Ala Ile Glu Ser Met Ala Tyr Ala Lys Tyr 115 120 125Glu Val Ile Glu Thr Leu Leu Pro Tyr Leu Asp Thr Tyr Leu Met Asp 130 135 140Ile Lys His Met Asn Pro Glu Lys His Lys Glu Tyr Thr Gly His Asp145 150 155 160Asn Leu Arg Met Leu Glu Asn Ala Leu Arg Val Ala His Ser Gly Gln 165 170 175Thr Glu Leu Ile Ile Arg Val Pro Val Ile Pro Gly Phe Asn Ala Thr 180 185 190Glu Gln Glu Leu Leu Asp Ile Ala Lys Phe Ala Asp Thr Leu Pro Gly 195 200 205Val Arg Gln Ile His Ile Leu Pro Tyr His Asn Phe Gly Gln Gly Lys 210 215 220Tyr Glu Gly Leu Asn Arg Asp Tyr Pro Met Gly Asp Thr Glu Lys Pro225 230 235 240Ser Asn Glu Gln Met Lys Ala Phe Gln Glu Met Ile Gln Lys Asn Thr 245 250 255Ser Leu His Cys Gln Ile Gly Gly 260351047DNAAchromobacter xylosoxidans 35atgaaagctc tggtttatca cggtgaccac aagatctcgc ttgaagacaa gcccaagccc 60acccttcaaa agcccacgga tgtagtagta cgggttttga agaccacgat ctgcggcacg 120gatctcggca tctacaaagg caagaatcca gaggtcgccg acgggcgcat cctgggccat 180gaaggggtag gcgtcatcga ggaagtgggc gagagtgtca cgcagttcaa gaaaggcgac 240aaggtcctga tttcctgcgt cacttcttgc ggctcgtgcg actactgcaa gaagcagctt 300tactcccatt gccgcgacgg cgggtggatc ctgggttaca tgatcgatgg cgtgcaggcc 360gaatacgtcc gcatcccgca tgccgacaac agcctctaca agatccccca gacaattgac 420gacgaaatcg ccgtcctgct gagcgacatc ctgcccaccg gccacgaaat cggcgtccag 480tatgggaatg tccagccggg cgatgcggtg gctattgtcg gcgcgggccc cgtcggcatg 540tccgtactgt tgaccgccca gttctactcc ccctcgacca tcatcgtgat cgacatggac 600gagaatcgcc tccagctcgc caaggagctc ggggcaacgc acaccatcaa ctccggcacg 660gagaacgttg tcgaagccgt gcataggatt gcggcagagg gagtcgatgt tgcgatcgag 720gcggtgggca taccggcgac ttgggacatc tgccaggaga tcgtcaagcc cggcgcgcac 780atcgccaacg tcggcgtgca tggcgtcaag gttgacttcg agattcagaa gctctggatc 840aagaacctga cgatcaccac gggactggtg aacacgaaca cgacgcccat gctgatgaag 900gtcgcctcga ccgacaagct tccgttgaag aagatgatta cccatcgctt cgagctggcc 960gagatcgagc acgcctatca ggtattcctc aatggcgcca aggagaaggc gatgaagatc 1020atcctctcga acgcaggcgc tgcctga 104736348PRTAchromobacter xylosoxidans 36Met Lys Ala Leu Val Tyr His Gly Asp His Lys Ile Ser Leu Glu Asp1 5 10 15Lys Pro Lys Pro Thr Leu Gln Lys Pro Thr Asp Val Val Val Arg Val 20 25 30Leu Lys Thr Thr Ile Cys Gly Thr Asp Leu Gly Ile Tyr Lys Gly Lys 35 40 45Asn Pro Glu Val Ala Asp Gly Arg Ile Leu Gly His Glu Gly Val Gly 50 55 60Val Ile Glu Glu Val Gly Glu Ser Val Thr Gln Phe Lys Lys Gly Asp65 70 75 80Lys Val Leu Ile Ser Cys Val Thr Ser Cys Gly Ser Cys Asp Tyr Cys 85 90 95Lys Lys Gln Leu Tyr Ser His Cys Arg Asp Gly Gly Trp Ile Leu Gly 100 105 110Tyr Met Ile Asp Gly Val Gln Ala Glu Tyr Val Arg Ile Pro His Ala 115 120 125Asp Asn Ser Leu Tyr Lys Ile Pro Gln Thr Ile Asp Asp Glu Ile Ala 130 135 140Val Leu Leu Ser Asp Ile Leu Pro Thr Gly His Glu Ile Gly Val Gln145 150 155 160Tyr Gly Asn Val Gln Pro Gly Asp Ala Val Ala Ile Val Gly Ala Gly 165 170 175Pro Val Gly Met Ser Val Leu Leu Thr Ala Gln Phe Tyr Ser Pro Ser 180 185 190Thr Ile Ile Val Ile Asp Met Asp Glu Asn Arg Leu Gln Leu Ala Lys 195 200 205Glu Leu Gly Ala Thr His Thr Ile Asn Ser Gly Thr Glu Asn Val Val 210 215 220Glu Ala Val His Arg Ile Ala Ala Glu Gly Val Asp Val Ala Ile Glu225 230 235 240Ala Val Gly Ile Pro Ala Thr Trp Asp Ile Cys Gln Glu Ile Val Lys 245 250 255Pro Gly Ala His Ile Ala Asn Val Gly Val His Gly Val Lys Val Asp 260 265 270Phe Glu Ile Gln Lys Leu Trp Ile Lys Asn Leu Thr Ile Thr Thr Gly 275 280 285Leu Val Asn Thr Asn Thr Thr Pro Met Leu Met Lys Val Ala Ser Thr 290 295 300Asp Lys Leu Pro Leu Lys Lys Met Ile Thr His Arg Phe Glu Leu Ala305 310 315 320Glu Ile Glu His Ala Tyr Gln Val Phe Leu Asn Gly Ala Lys Glu Lys 325 330 335Ala Met Lys Ile Ile Leu Ser Asn Ala Gly Ala Ala 340 345371188DNASaccharomyces cerevisiae 37atgttgagaa ctcaagccgc cagattgatc tgcaactccc gtgtcatcac tgctaagaga 60acctttgctt tggccacccg tgctgctgct tacagcagac cagctgcccg tttcgttaag 120ccaatgatca ctacccgtgg tttgaagcaa atcaacttcg gtggtactgt tgaaaccgtc 180tacgaaagag ctgactggcc aagagaaaag ttgttggact acttcaagaa cgacactttt 240gctttgatcg gttacggttc ccaaggttac ggtcaaggtt tgaacttgag agacaacggt 300ttgaacgtta tcattggtgt ccgtaaagat ggtgcttctt ggaaggctgc catcgaagac 360ggttgggttc caggcaagaa cttgttcact gttgaagatg ctatcaagag aggtagttac 420gttatgaact tgttgtccga tgccgctcaa tcagaaacct ggcctgctat caagccattg 480ttgaccaagg gtaagacttt gtacttctcc cacggtttct ccccagtctt caaggacttg 540actcacgttg aaccaccaaa ggacttagat gttatcttgg ttgctccaaa gggttccggt 600agaactgtca gatctttgtt caaggaaggt cgtggtatta actcttctta cgccgtctgg 660aacgatgtca ccggtaaggc tcacgaaaag gcccaagctt tggccgttgc cattggttcc 720ggttacgttt accaaaccac tttcgaaaga gaagtcaact ctgacttgta cggtgaaaga 780ggttgtttaa tgggtggtat ccacggtatg ttcttggctc aatacgacgt cttgagagaa 840aacggtcact ccccatctga agctttcaac gaaaccgtcg aagaagctac ccaatctcta 900tacccattga tcggtaagta cggtatggat tacatgtacg atgcttgttc caccaccgcc 960agaagaggtg ctttggactg gtacccaatc ttcaagaatg ctttgaagcc tgttttccaa 1020gacttgtacg aatctaccaa gaacggtacc gaaaccaaga gatctttgga attcaactct 1080caacctgact acagagaaaa gctagaaaag gaattagaca ccatcagaaa catggaaatc 1140tggaaggttg gtaaggaagt cagaaagttg agaccagaaa accaataa 118838395PRTSaccharomyces cerevisiae 38Met Leu Arg Thr Gln Ala Ala Arg Leu Ile Cys Asn Ser Arg Val Ile1 5 10 15Thr Ala Lys Arg Thr Phe Ala Leu Ala Thr Arg Ala Ala Ala Tyr Ser 20 25 30Arg Pro Ala Ala Arg Phe Val Lys Pro Met Ile Thr Thr Arg Gly Leu 35 40 45Lys Gln Ile Asn Phe Gly Gly Thr Val Glu Thr Val Tyr Glu Arg Ala 50 55 60Asp Trp Pro Arg Glu Lys Leu Leu Asp Tyr Phe Lys Asn Asp Thr Phe65 70 75 80Ala Leu Ile Gly Tyr Gly Ser Gln Gly Tyr Gly Gln Gly Leu Asn Leu 85 90 95Arg Asp Asn Gly Leu Asn Val Ile Ile Gly Val Arg Lys Asp Gly Ala 100 105 110Ser Trp Lys Ala Ala Ile Glu Asp Gly Trp Val Pro Gly Lys Asn Leu 115 120 125Phe Thr Val Glu Asp Ala Ile Lys Arg Gly Ser Tyr Val Met Asn Leu 130 135 140Leu Ser Asp Ala Ala Gln Ser Glu Thr Trp Pro Ala Ile Lys Pro Leu145 150 155 160Leu Thr Lys Gly Lys Thr Leu Tyr Phe Ser His Gly Phe Ser Pro Val 165 170 175Phe Lys Asp Leu Thr His Val Glu Pro Pro Lys Asp Leu Asp Val Ile 180 185 190Leu Val Ala Pro Lys Gly Ser Gly Arg Thr Val Arg Ser Leu Phe Lys 195 200 205Glu Gly Arg Gly Ile Asn Ser Ser Tyr Ala Val Trp Asn Asp Val Thr 210 215 220Gly Lys Ala His Glu Lys Ala Gln Ala Leu Ala Val Ala Ile Gly Ser225 230 235 240Gly Tyr Val Tyr Gln Thr Thr Phe Glu Arg Glu Val Asn Ser Asp Leu 245 250 255Tyr Gly Glu Arg Gly Cys Leu Met Gly Gly Ile His Gly Met Phe Leu 260 265 270Ala Gln Tyr Asp Val Leu Arg Glu Asn Gly His Ser Pro Ser Glu Ala 275 280 285Phe Asn Glu Thr Val Glu Glu Ala Thr Gln Ser Leu Tyr Pro Leu Ile 290 295 300Gly Lys Tyr Gly Met Asp Tyr Met Tyr Asp Ala Cys Ser Thr Thr Ala305 310 315 320Arg Arg Gly Ala Leu Asp Trp Tyr Pro Ile Phe Lys Asn Ala Leu Lys 325 330 335Pro Val Phe Gln Asp Leu Tyr Glu Ser Thr Lys Asn Gly Thr Glu Thr 340 345 350Lys Arg Ser Leu Glu Phe Asn Ser Gln Pro Asp Tyr Arg Glu Lys Leu 355 360 365Glu Lys Glu Leu Asp Thr Ile Arg Asn Met Glu Ile Trp Lys Val Gly 370 375 380Lys Glu Val Arg Lys Leu Arg Pro Glu Asn Gln385 390 395391485DNAVibrio cholerae 39atggcgaatt atttcaatac gctgaatctg cgtgaacagt tggatcaact tggtcgttgc 60cgttttatgg cgcgagaaga gtttgcaacc gaagctgatt acctaaaagg taagaaagtg 120gtgatcgtag gttgtggggc tcaaggccta aaccaaggcc tcaatatgcg tgattcaggt 180ttggatgttt cttacgctct gcgtcaggct gcgattgatg aacagcgtca gtcatttaag 240aatgccaaga ataatggctt caacgtgggt agttatgaac aactcatccc aaccgcagat 300ttggtgatta acttgacgcc agacaagcag cacaccagtg tggtcaatgc ggtgatgcct 360ctgatgaagc aaggtgctgc cttgggttac tcacacggtt ttaatatcgt tgaagagggc 420atgcagatcc gtaaagacat cacggttgtg atggtggcac caaaatgtcc gggtacggaa 480gttcgtgaag agtataagcg cggtttcggc gttcctactc ttatcgcggt acaccctgaa 540aacgatccac aaggtgaagg ttgggaaatt gctaaagcgt gggctgcggc aacgggtggc 600catcgtgcgg gctgtttagc ttcttctttt gtggcggaag tgaaatccga tttgatgggt 660gagcaaacca ttctctgcgg tatgctgcaa gcgggctcta tcgtttgtta cgagaaaatg 720gttgctgatg gcatcgaccc tggttatgcg ggcaagcttt tgcaatttgg ttgggaaacc 780attaccgaag cactcaagtt tggcggtatt actcatatga tggatcgcct gtctaaccct 840gcaaaaatca aagcgtttga gctgtctgaa gagttgaaag atctgatgcg cccactgtac 900aacaagcata tggatgacat catttctggc cacttctcta gcaccatgat ggcggattgg 960gcgaatgatg ataaagactt attcggctgg cgtgcagaaa ccgctgagac gacctttgaa 1020aactatccaa caaccgacgt aaaaattgct gagcaagaat actttgataa cggtattttg 1080atgattgcca tggtgcgtgc tggggttgag ttggcgtttg aagcgatgac ggcttcaggc 1140atcatcgatg agtcggctta ctatgaatca ctgcacgaac tcccactgat tgccaatacg 1200gtagcgcgta agcgtctgta tgaaatgaac gtggtaatct ctgacactgc tgagtacggt 1260aactatctgt ttgccaatgt ggcggtacca ctattgcgtg aaaagtttat gccgaaagtg 1320ggcactgatg tgattggtaa aggattaggc gtggtctcta atcaagttga taacgcaacg 1380cttatcgaag taaacagcat catccgtaac catccggttg agtatatcgg tgaagagcta 1440cgcggttaca tgaaagacat gaagcgcatc gccgtgggtg attaa 148540494PRTVibrio cholerae 40Met Ala Asn Tyr Phe Asn Thr Leu Asn Leu Arg Glu Gln Leu Asp Gln1 5 10 15Leu Gly Arg Cys Arg Phe Met Ala Arg Glu Glu Phe Ala Thr Glu Ala 20 25 30Asp Tyr Leu Lys Gly Lys Lys Val Val Ile Val Gly Cys Gly Ala Gln 35 40 45Gly Leu Asn Gln Gly Leu Asn Met Arg Asp Ser Gly Leu Asp Val Ser 50 55 60Tyr Ala Leu Arg Gln Ala Ala Ile Asp Glu Gln Arg Gln Ser Phe Lys65 70 75 80Asn Ala Lys Asn Asn Gly Phe Asn Val Gly Ser Tyr Glu Gln Leu Ile 85 90 95Pro Thr Ala Asp Leu Val Ile Asn Leu Thr Pro Asp Lys Gln His Thr 100 105 110Ser Val Val Asn Ala Val Met Pro Leu Met Lys Gln Gly Ala Ala Leu 115 120 125Gly Tyr Ser His Gly Phe Asn Ile Val Glu Glu Gly Met Gln Ile Arg 130 135 140Lys Asp Ile Thr Val Val Met Val Ala Pro Lys Cys Pro Gly Thr Glu145 150 155 160Val Arg Glu Glu Tyr Lys Arg Gly Phe Gly Val Pro Thr Leu Ile Ala 165 170 175Val His Pro Glu Asn Asp Pro Gln Gly Glu Gly Trp Glu Ile Ala Lys 180 185 190Ala Trp Ala Ala Ala Thr Gly Gly His Arg Ala Gly Cys Leu Ala Ser 195 200 205Ser Phe Val Ala Glu Val Lys Ser Asp Leu Met Gly Glu Gln Thr Ile 210 215 220Leu Cys Gly Met Leu Gln Ala Gly Ser Ile Val Cys Tyr Glu Lys Met225 230 235 240Val Ala Asp Gly Ile Asp Pro Gly Tyr Ala Gly Lys Leu Leu Gln Phe 245 250 255Gly Trp Glu Thr Ile Thr Glu Ala Leu Lys Phe Gly Gly Ile Thr His 260 265 270Met Met Asp Arg Leu Ser Asn Pro Ala Lys Ile Lys Ala Phe Glu Leu 275 280 285Ser Glu Glu Leu Lys Asp Leu Met Arg Pro Leu Tyr Asn Lys His Met 290 295 300Asp Asp Ile Ile Ser Gly His Phe Ser Ser Thr Met Met Ala Asp Trp305 310 315 320Ala Asn Asp Asp Lys Asp Leu Phe Gly Trp Arg Ala Glu Thr Ala Glu 325 330 335Thr Thr Phe Glu Asn Tyr Pro Thr Thr Asp Val Lys Ile Ala Glu Gln 340 345 350Glu Tyr Phe Asp Asn Gly Ile Leu Met Ile Ala Met Val Arg Ala Gly 355 360 365Val Glu Leu Ala Phe Glu Ala Met Thr Ala Ser Gly Ile Ile Asp Glu 370 375 380Ser Ala Tyr Tyr Glu Ser Leu His Glu Leu Pro Leu Ile Ala Asn Thr385 390 395 400Val Ala Arg Lys Arg Leu Tyr Glu Met Asn Val Val Ile Ser Asp Thr 405 410 415Ala Glu Tyr Gly Asn Tyr Leu Phe Ala Asn Val Ala Val Pro Leu Leu 420 425 430Arg Glu Lys Phe Met Pro Lys Val Gly Thr Asp Val Ile Gly Lys Gly 435 440 445Leu Gly Val Val Ser Asn Gln Val Asp Asn Ala Thr Leu Ile Glu Val 450 455 460Asn Ser Ile Ile Arg Asn His Pro Val Glu Tyr Ile Gly Glu Glu Leu465 470 475 480Arg Gly Tyr Met Lys Asp Met Lys Arg Ile Ala Val Gly Asp 485 490411014DNAPseudomonas aeruginosa 41atgcgcgttt tctacgataa agactgtgac ctctcgatca tccagggcaa gaaagttgcc 60atcatcggct acggctccca gggccacgcc catgcctgca acctgaagga ctccggcgtc 120gacgtcaccg tgggcctgcg tagcggctcc gccaccgtgg ccaaggccga agcgcacggt 180ctgaaggttg ccgacgtgaa gaccgccgtc gccgcagccg acgtggtcat gatcctcacc 240ccggacgagt tccagggccg cctgtacaag gaagagatcg agccgaacct gaagaagggc 300gccaccctgg ccttcgctca cggcttctcc atccactaca accaggtcgt cccgcgcgcc 360gacctcgacg tgatcatgat cgcgccgaag gcaccgggtc acaccgtgcg ttccgagttc 420gtcaagggcg gtggcatccc tgacctgatc gccatctacc aggacgcttc cggcaacgcc 480aagaacgtcg ccctgtccta cgcctgcggc gtcggcggcg gtcgtaccgg tatcatcgaa 540accaccttca aggacgagac cgaaaccgac ctgttcggtg agcaggccgt tctctgcggt 600ggttgcgtcg agctggtcaa ggccggtttc gaaaccctgg tcgaagccgg ttacgcgccg 660gaaatggcct acttcgagtg cctgcacgag ctgaagctga tcgtcgacct gatgtacgaa 720ggcggcatcg ccaacatgaa ctactccatc tccaacaatg ccgaatacgg tgagtacgta 780accggtccgg aggtgatcaa cgccgagtcc cgtgctgcca tgcgcaacgc cctgaagcgc 840atccaggacg gcgagtacgc gaaaatgttc attaccgaag gtgcggccaa ctacccgtcg 900atgactgcct accgccgcaa caacgccgct cacccgatcg agcagatcgg cgagaagctg 960cgcgcgatga tgccgtggat cgcagccaac aagatcgtcg acaagagcaa gaac 101442338PRTPseudomonas aeruginosa 42Met Arg Val Phe Tyr Asp Lys Asp Cys Asp Leu Ser Ile Ile Gln Gly1 5 10 15Lys Lys Val Ala Ile Ile Gly Tyr Gly Ser Gln Gly His Ala His Ala 20 25 30Cys Asn Leu Lys Asp Ser Gly Val Asp Val Thr Val Gly Leu Arg Ser 35 40 45Gly Ser Ala Thr Val Ala Lys Ala Glu Ala His Gly Leu Lys Val Ala 50 55 60Asp Val Lys Thr Ala Val Ala Ala Ala Asp Val Val Met Ile Leu Thr65 70 75 80Pro Asp Glu Phe Gln Gly Arg Leu Tyr Lys Glu Glu Ile Glu Pro Asn 85 90 95Leu Lys Lys Gly Ala Thr Leu Ala Phe Ala His Gly Phe Ser Ile His 100 105 110Tyr Asn Gln Val Val Pro Arg Ala Asp Leu Asp Val Ile Met Ile Ala 115 120 125Pro Lys Ala Pro Gly His Thr Val Arg Ser Glu Phe Val Lys Gly Gly 130 135 140Gly Ile Pro Asp Leu Ile Ala Ile Tyr Gln Asp Ala Ser Gly Asn Ala145 150 155 160Lys Asn Val Ala Leu Ser Tyr

Ala Cys Gly Val Gly Gly Gly Arg Thr 165 170 175Gly Ile Ile Glu Thr Thr Phe Lys Asp Glu Thr Glu Thr Asp Leu Phe 180 185 190Gly Glu Gln Ala Val Leu Cys Gly Gly Cys Val Glu Leu Val Lys Ala 195 200 205Gly Phe Glu Thr Leu Val Glu Ala Gly Tyr Ala Pro Glu Met Ala Tyr 210 215 220Phe Glu Cys Leu His Glu Leu Lys Leu Ile Val Asp Leu Met Tyr Glu225 230 235 240Gly Gly Ile Ala Asn Met Asn Tyr Ser Ile Ser Asn Asn Ala Glu Tyr 245 250 255Gly Glu Tyr Val Thr Gly Pro Glu Val Ile Asn Ala Glu Ser Arg Ala 260 265 270Ala Met Arg Asn Ala Leu Lys Arg Ile Gln Asp Gly Glu Tyr Ala Lys 275 280 285Met Phe Ile Thr Glu Gly Ala Ala Asn Tyr Pro Ser Met Thr Ala Tyr 290 295 300Arg Arg Asn Asn Ala Ala His Pro Ile Glu Gln Ile Gly Glu Lys Leu305 310 315 320Arg Ala Met Met Pro Trp Ile Ala Ala Asn Lys Ile Val Asp Lys Ser 325 330 335Lys Asn431014DNAPseudomonas fluorescens 43atgaaagttt tctacgataa agactgcgac ctgtcgatca tccaaggtaa gaaagttgcc 60atcatcggct acggttccca gggccacgct caagcatgca acctgaagga ttccggcgta 120gacgtgactg ttggcctgcg taaaggctcg gctaccgttg ccaaggctga agcccacggc 180ttgaaagtga ccgacgttgc tgcagccgtt gccggtgccg acttggtcat gatcctgacc 240ccggacgagt tccagtccca gctgtacaag aacgaaatcg agccgaacat caagaagggc 300gccactctgg ccttctccca cggcttcgcg atccactaca accaggttgt gcctcgtgcc 360gacctcgacg tgatcatgat cgcgccgaag gctccaggcc acaccgtacg ttccgagttc 420gtcaagggcg gtggtattcc tgacctgatc gcgatctacc aggacgcttc cggcaacgcc 480aagaacgttg ccctgtccta cgccgcaggc gtgggcggcg gccgtaccgg catcatcgaa 540accaccttca aggacgagac tgaaaccgac ctgttcggtg agcaggctgt tctgtgtggc 600ggtaccgtcg agctggtcaa agccggtttc gaaaccctgg ttgaagctgg ctacgctcca 660gaaatggcct acttcgagtg cctgcacgaa ctgaagctga tcgttgacct catgtacgaa 720ggcggtatcg ccaacatgaa ctactcgatc tccaacaacg ctgaatacgg cgagtacgtg 780actggtccag aagtcatcaa cgccgaatcc cgtcaggcca tgcgcaatgc tctgaagcgc 840atccaggacg gcgaatacgc gaagatgttc atcagcgaag gcgctaccgg ctacccatcg 900atgaccgcca agcgtcgtaa caacgctgct cacggtatcg aaatcatcgg cgagcaactg 960cgctcgatga tgccttggat cggtgccaac aaaatcgtcg acaaagccaa gaac 101444338PRTPseudomonas fluorescens 44Met Lys Val Phe Tyr Asp Lys Asp Cys Asp Leu Ser Ile Ile Gln Gly1 5 10 15Lys Lys Val Ala Ile Ile Gly Tyr Gly Ser Gln Gly His Ala Gln Ala 20 25 30Cys Asn Leu Lys Asp Ser Gly Val Asp Val Thr Val Gly Leu Arg Lys 35 40 45Gly Ser Ala Thr Val Ala Lys Ala Glu Ala His Gly Leu Lys Val Thr 50 55 60Asp Val Ala Ala Ala Val Ala Gly Ala Asp Leu Val Met Ile Leu Thr65 70 75 80Pro Asp Glu Phe Gln Ser Gln Leu Tyr Lys Asn Glu Ile Glu Pro Asn 85 90 95Ile Lys Lys Gly Ala Thr Leu Ala Phe Ser His Gly Phe Ala Ile His 100 105 110Tyr Asn Gln Val Val Pro Arg Ala Asp Leu Asp Val Ile Met Ile Ala 115 120 125Pro Lys Ala Pro Gly His Thr Val Arg Ser Glu Phe Val Lys Gly Gly 130 135 140Gly Ile Pro Asp Leu Ile Ala Ile Tyr Gln Asp Ala Ser Gly Asn Ala145 150 155 160Lys Asn Val Ala Leu Ser Tyr Ala Ala Gly Val Gly Gly Gly Arg Thr 165 170 175Gly Ile Ile Glu Thr Thr Phe Lys Asp Glu Thr Glu Thr Asp Leu Phe 180 185 190Gly Glu Gln Ala Val Leu Cys Gly Gly Thr Val Glu Leu Val Lys Ala 195 200 205Gly Phe Glu Thr Leu Val Glu Ala Gly Tyr Ala Pro Glu Met Ala Tyr 210 215 220Phe Glu Cys Leu His Glu Leu Lys Leu Ile Val Asp Leu Met Tyr Glu225 230 235 240Gly Gly Ile Ala Asn Met Asn Tyr Ser Ile Ser Asn Asn Ala Glu Tyr 245 250 255Gly Glu Tyr Val Thr Gly Pro Glu Val Ile Asn Ala Glu Ser Arg Gln 260 265 270Ala Met Arg Asn Ala Leu Lys Arg Ile Gln Asp Gly Glu Tyr Ala Lys 275 280 285Met Phe Ile Ser Glu Gly Ala Thr Gly Tyr Pro Ser Met Thr Ala Lys 290 295 300Arg Arg Asn Asn Ala Ala His Gly Ile Glu Ile Ile Gly Glu Gln Leu305 310 315 320Arg Ser Met Met Pro Trp Ile Gly Ala Asn Lys Ile Val Asp Lys Ala 325 330 335Lys Asn451014DNAartificial sequenceMutant of Pseudomonas fluorescens ilvC coding region 45atgaaggtgt tttacgataa agactgcgat ctgagcatca tccagggaaa gaaggttgct 60attataggat atggttccca aggacacgca caagccttga acttgaaaga ttctggggtc 120gacgtgacag taggtctgta taaaggtgct gctgatgcag caaaggctga agcacatggc 180tttaaagtca cagatgttgc agcggctgtt gctggcgctg atttagtcat gattttaatt 240ccagatgaat ttcaatcgca attgtacaaa aatgaaatag aaccaaacat taagaagggc 300gctaccttgg ccttcagtca tggatttgcc attcattaca atcaagtagt ccccagggca 360gatttggacg ttattatgat tgcacctaag gctccggggc atactgttag gagcgaattt 420gttaagggtg gtggtattcc agatttgatc gctatatacc aagacgttag cggaaacgct 480aagaatgtag ctttaagcta cgcagcagga gttggtggcg ggagaacggg tataatagaa 540accactttta aagacgagac tgagacagat ttatttggag aacaagcggt tctgtgcgga 600ggaactgttg aattggttaa agcaggcttt gagacgcttg tcgaagcagg gtacgctccc 660gaaatggcat acttcgaatg tctacatgaa ttgaagttga tagtagactt aatgtatgaa 720ggtggtatag ctaatatgaa ctattccatt tcaaataatg cagaatatgg tgagtatgtc 780accggacctg aagtcattaa cgcagaatca agacaagcca tgagaaatgc cttgaaacgt 840atccaggacg gtgaatacgc taagatgttc ataagtgaag gcgctacggg ttacccgagt 900atgactgcta aaagaagaaa caatgcagca catggtatcg aaattattgg tgaacagtta 960aggtctatga tgccctggat cggtgctaat aagatcgtag acaaggcgaa aaat 101446338PRTartificial sequencemutant of Pseudomonas fluorescens protein 46Met Lys Val Phe Tyr Asp Lys Asp Cys Asp Leu Ser Ile Ile Gln Gly1 5 10 15Lys Lys Val Ala Ile Ile Gly Tyr Gly Ser Gln Gly His Ala Gln Ala 20 25 30Leu Asn Leu Lys Asp Ser Gly Val Asp Val Thr Val Gly Leu Tyr Lys 35 40 45Gly Ala Ala Asp Ala Ala Lys Ala Glu Ala His Gly Phe Lys Val Thr 50 55 60Asp Val Ala Ala Ala Val Ala Gly Ala Asp Leu Val Met Ile Leu Ile65 70 75 80Pro Asp Glu Phe Gln Ser Gln Leu Tyr Lys Asn Glu Ile Glu Pro Asn 85 90 95Ile Lys Lys Gly Ala Thr Leu Ala Phe Ser His Gly Phe Ala Ile His 100 105 110Tyr Asn Gln Val Val Pro Arg Ala Asp Leu Asp Val Ile Met Ile Ala 115 120 125Pro Lys Ala Pro Gly His Thr Val Arg Ser Glu Phe Val Lys Gly Gly 130 135 140Gly Ile Pro Asp Leu Ile Ala Ile Tyr Gln Asp Val Ser Gly Asn Ala145 150 155 160Lys Asn Val Ala Leu Ser Tyr Ala Ala Ala Val Gly Gly Gly Arg Thr 165 170 175Gly Ile Ile Glu Thr Thr Phe Lys Asp Glu Thr Glu Thr Asp Leu Phe 180 185 190Gly Glu Gln Ala Val Leu Cys Gly Gly Thr Val Glu Leu Val Lys Ala 195 200 205Gly Phe Glu Thr Leu Val Glu Ala Gly Tyr Ala Pro Glu Met Ala Tyr 210 215 220Phe Glu Cys Leu His Glu Leu Lys Leu Ile Val Asp Leu Met Tyr Glu225 230 235 240Gly Gly Ile Ala Asn Met Asn Tyr Ser Ile Ser Asn Asn Ala Glu Tyr 245 250 255Gly Glu Tyr Val Thr Gly Pro Glu Val Ile Asn Ala Glu Ser Arg Gln 260 265 270Ala Met Arg Asn Ala Leu Lys Arg Ile Gln Asp Gly Glu Tyr Ala Lys 275 280 285Met Phe Ile Ser Glu Gly Ala Thr Gly Tyr Pro Ser Met Thr Ala Lys 290 295 300Arg Arg Asn Asn Ala Ala His Gly Ile Glu Ile Ile Gly Glu Gln Leu305 310 315 320Arg Ser Met Met Pro Trp Ile Gly Ala Asn Lys Ile Val Asp Lys Ala 325 330 335Lys Asn471758DNASaccharomyces cerevisiae 47atgggcttgt taacgaaagt tgctacatct agacaattct ctacaacgag atgcgttgca 60aagaagctca acaagtactc gtatatcatc actgaaccta agggccaagg tgcgtcccag 120gccatgcttt atgccaccgg tttcaagaag gaagatttca agaagcctca agtcggggtt 180ggttcctgtt ggtggtccgg taacccatgt aacatgcatc tattggactt gaataacaga 240tgttctcaat ccattgaaaa agcgggtttg aaagctatgc agttcaacac catcggtgtt 300tcagacggta tctctatggg tactaaaggt atgagatact cgttacaaag tagagaaatc 360attgcagact cctttgaaac catcatgatg gcacaacact acgatgctaa catcgccatc 420ccatcatgtg acaaaaacat gcccggtgtc atgatggcca tgggtagaca taacagacct 480tccatcatgg tatatggtgg tactatcttg cccggtcatc caacatgtgg ttcttcgaag 540atctctaaaa acatcgatat cgtctctgcg ttccaatcct acggtgaata tatttccaag 600caattcactg aagaagaaag agaagatgtt gtggaacatg catgcccagg tcctggttct 660tgtggtggta tgtatactgc caacacaatg gcttctgccg ctgaagtgct aggtttgacc 720attccaaact cctcttcctt cccagccgtt tccaaggaga agttagctga gtgtgacaac 780attggtgaat acatcaagaa gacaatggaa ttgggtattt tacctcgtga tatcctcaca 840aaagaggctt ttgaaaacgc cattacttat gtcgttgcaa ccggtgggtc cactaatgct 900gttttgcatt tggtggctgt tgctcactct gcgggtgtca agttgtcacc agatgatttc 960caaagaatca gtgatactac accattgatc ggtgacttca aaccttctgg taaatacgtc 1020atggccgatt tgattaacgt tggtggtacc caatctgtga ttaagtatct atatgaaaac 1080aacatgttgc acggtaacac aatgactgtt accggtgaca ctttggcaga acgtgcaaag 1140aaagcaccaa gcctacctga aggacaagag attattaagc cactctccca cccaatcaag 1200gccaacggtc acttgcaaat tctgtacggt tcattggcac caggtggagc tgtgggtaaa 1260attaccggta aggaaggtac ttacttcaag ggtagagcac gtgtgttcga agaggaaggt 1320gcctttattg aagccttgga aagaggtgaa atcaagaagg gtgaaaaaac cgttgttgtt 1380atcagatatg aaggtccaag aggtgcacca ggtatgcctg aaatgctaaa gccttcctct 1440gctctgatgg gttacggttt gggtaaagat gttgcattgt tgactgatgg tagattctct 1500ggtggttctc acgggttctt aatcggccac attgttcccg aagccgctga aggtggtcct 1560atcgggttgg tcagagacgg cgatgagatt atcattgatg ctgataataa caagattgac 1620ctattagtct ctgataagga aatggctcaa cgtaaacaaa gttgggttgc acctccacct 1680cgttacacaa gaggtactct atccaagtat gctaagttgg tttccaacgc ttccaacggt 1740tgtgttttag atgcttga 175848585PRTSaccharomyces cerevisiae 48Met Gly Leu Leu Thr Lys Val Ala Thr Ser Arg Gln Phe Ser Thr Thr1 5 10 15Arg Cys Val Ala Lys Lys Leu Asn Lys Tyr Ser Tyr Ile Ile Thr Glu 20 25 30Pro Lys Gly Gln Gly Ala Ser Gln Ala Met Leu Tyr Ala Thr Gly Phe 35 40 45Lys Lys Glu Asp Phe Lys Lys Pro Gln Val Gly Val Gly Ser Cys Trp 50 55 60Trp Ser Gly Asn Pro Cys Asn Met His Leu Leu Asp Leu Asn Asn Arg65 70 75 80Cys Ser Gln Ser Ile Glu Lys Ala Gly Leu Lys Ala Met Gln Phe Asn 85 90 95Thr Ile Gly Val Ser Asp Gly Ile Ser Met Gly Thr Lys Gly Met Arg 100 105 110Tyr Ser Leu Gln Ser Arg Glu Ile Ile Ala Asp Ser Phe Glu Thr Ile 115 120 125Met Met Ala Gln His Tyr Asp Ala Asn Ile Ala Ile Pro Ser Cys Asp 130 135 140Lys Asn Met Pro Gly Val Met Met Ala Met Gly Arg His Asn Arg Pro145 150 155 160Ser Ile Met Val Tyr Gly Gly Thr Ile Leu Pro Gly His Pro Thr Cys 165 170 175Gly Ser Ser Lys Ile Ser Lys Asn Ile Asp Ile Val Ser Ala Phe Gln 180 185 190Ser Tyr Gly Glu Tyr Ile Ser Lys Gln Phe Thr Glu Glu Glu Arg Glu 195 200 205Asp Val Val Glu His Ala Cys Pro Gly Pro Gly Ser Cys Gly Gly Met 210 215 220Tyr Thr Ala Asn Thr Met Ala Ser Ala Ala Glu Val Leu Gly Leu Thr225 230 235 240Ile Pro Asn Ser Ser Ser Phe Pro Ala Val Ser Lys Glu Lys Leu Ala 245 250 255Glu Cys Asp Asn Ile Gly Glu Tyr Ile Lys Lys Thr Met Glu Leu Gly 260 265 270Ile Leu Pro Arg Asp Ile Leu Thr Lys Glu Ala Phe Glu Asn Ala Ile 275 280 285Thr Tyr Val Val Ala Thr Gly Gly Ser Thr Asn Ala Val Leu His Leu 290 295 300Val Ala Val Ala His Ser Ala Gly Val Lys Leu Ser Pro Asp Asp Phe305 310 315 320Gln Arg Ile Ser Asp Thr Thr Pro Leu Ile Gly Asp Phe Lys Pro Ser 325 330 335Gly Lys Tyr Val Met Ala Asp Leu Ile Asn Val Gly Gly Thr Gln Ser 340 345 350Val Ile Lys Tyr Leu Tyr Glu Asn Asn Met Leu His Gly Asn Thr Met 355 360 365Thr Val Thr Gly Asp Thr Leu Ala Glu Arg Ala Lys Lys Ala Pro Ser 370 375 380Leu Pro Glu Gly Gln Glu Ile Ile Lys Pro Leu Ser His Pro Ile Lys385 390 395 400Ala Asn Gly His Leu Gln Ile Leu Tyr Gly Ser Leu Ala Pro Gly Gly 405 410 415Ala Val Gly Lys Ile Thr Gly Lys Glu Gly Thr Tyr Phe Lys Gly Arg 420 425 430Ala Arg Val Phe Glu Glu Glu Gly Ala Phe Ile Glu Ala Leu Glu Arg 435 440 445Gly Glu Ile Lys Lys Gly Glu Lys Thr Val Val Val Ile Arg Tyr Glu 450 455 460Gly Pro Arg Gly Ala Pro Gly Met Pro Glu Met Leu Lys Pro Ser Ser465 470 475 480Ala Leu Met Gly Tyr Gly Leu Gly Lys Asp Val Ala Leu Leu Thr Asp 485 490 495Gly Arg Phe Ser Gly Gly Ser His Gly Phe Leu Ile Gly His Ile Val 500 505 510Pro Glu Ala Ala Glu Gly Gly Pro Ile Gly Leu Val Arg Asp Gly Asp 515 520 525Glu Ile Ile Ile Asp Ala Asp Asn Asn Lys Ile Asp Leu Leu Val Ser 530 535 540Asp Lys Glu Met Ala Gln Arg Lys Gln Ser Trp Val Ala Pro Pro Pro545 550 555 560Arg Tyr Thr Arg Gly Thr Leu Ser Lys Tyr Ala Lys Leu Val Ser Asn 565 570 575Ala Ser Asn Gly Cys Val Leu Asp Ala 580 585491713DNAStreptococcus mutans 49atgactgaca aaaaaactct taaagactta agaaatcgta gttctgttta cgattcaatg 60gttaaatcac ctaatcgtgc tatgttgcgt gcaactggta tgcaagatga agactttgaa 120aaacctatcg tcggtgtcat ttcaacttgg gctgaaaaca caccttgtaa tatccactta 180catgactttg gtaaactagc caaagtcggt gttaaggaag ctggtgcttg gccagttcag 240ttcggaacaa tcacggtttc tgatggaatc gccatgggaa cccaaggaat gcgtttctcc 300ttgacatctc gtgatattat tgcagattct attgaagcag ccatgggagg tcataatgcg 360gatgcttttg tagccattgg cggttgtgat aaaaacatgc ccggttctgt tatcgctatg 420gctaacatgg atatcccagc catttttgct tacggcggaa caattgcacc tggtaattta 480gacggcaaag atatcgattt agtctctgtc tttgaaggtg tcggccattg gaaccacggc 540gatatgacca aagaagaagt taaagctttg gaatgtaatg cttgtcccgg tcctggaggc 600tgcggtggta tgtatactgc taacacaatg gcgacagcta ttgaagtttt gggacttagc 660cttccgggtt catcttctca cccggctgaa tccgcagaaa agaaagcaga tattgaagaa 720gctggtcgcg ctgttgtcaa aatgctcgaa atgggcttaa aaccttctga cattttaacg 780cgtgaagctt ttgaagatgc tattactgta actatggctc tgggaggttc aaccaactca 840acccttcacc tcttagctat tgcccatgct gctaatgtgg aattgacact tgatgatttc 900aatactttcc aagaaaaagt tcctcatttg gctgatttga aaccttctgg tcaatatgta 960ttccaagacc tttacaaggt cggaggggta ccagcagtta tgaaatatct ccttaaaaat 1020ggcttccttc atggtgaccg tatcacttgt actggcaaaa cagtcgctga aaatttgaag 1080gcttttgatg atttaacacc tggtcaaaag gttattatgc cgcttgaaaa tcctaaacgt 1140gaagatggtc cgctcattat tctccatggt aacttggctc cagacggtgc cgttgccaaa 1200gtttctggtg taaaagtgcg tcgtcatgtc ggtcctgcta aggtctttaa ttctgaagaa 1260gaagccattg aagctgtctt gaatgatgat attgttgatg gtgatgttgt tgtcgtacgt 1320tttgtaggac caaagggcgg tcctggtatg cctgaaatgc tttccctttc atcaatgatt 1380gttggtaaag ggcaaggtga aaaagttgcc cttctgacag atggccgctt ctcaggtggt 1440acttatggtc ttgtcgtggg tcatatcgct cctgaagcac aagatggcgg tccaatcgcc 1500tacctgcaaa caggagacat agtcactatt gaccaagaca ctaaggaatt acactttgat 1560atctccgatg aagagttaaa acatcgtcaa gagaccattg aattgccacc gctctattca 1620cgcggtatcc ttggtaaata tgctcacatc gtttcgtctg cttctagggg agccgtaaca 1680gacttttgga agcctgaaga aactggcaaa aaa 171350571PRTStreptococcus mutans 50Met Thr Asp Lys Lys Thr Leu Lys Asp Leu Arg Asn Arg Ser Ser Val1 5 10 15Tyr Asp Ser Met Val Lys Ser Pro Asn Arg Ala Met Leu Arg Ala Thr 20 25 30Gly Met Gln Asp Glu Asp Phe Glu Lys Pro Ile Val Gly Val Ile Ser 35 40 45Thr Trp Ala Glu Asn Thr Pro Cys Asn Ile His Leu His Asp Phe Gly 50 55 60Lys Leu Ala Lys Val Gly Val Lys Glu Ala Gly Ala Trp

Pro Val Gln65 70 75 80Phe Gly Thr Ile Thr Val Ser Asp Gly Ile Ala Met Gly Thr Gln Gly 85 90 95Met Arg Phe Ser Leu Thr Ser Arg Asp Ile Ile Ala Asp Ser Ile Glu 100 105 110Ala Ala Met Gly Gly His Asn Ala Asp Ala Phe Val Ala Ile Gly Gly 115 120 125Cys Asp Lys Asn Met Pro Gly Ser Val Ile Ala Met Ala Asn Met Asp 130 135 140Ile Pro Ala Ile Phe Ala Tyr Gly Gly Thr Ile Ala Pro Gly Asn Leu145 150 155 160Asp Gly Lys Asp Ile Asp Leu Val Ser Val Phe Glu Gly Val Gly His 165 170 175Trp Asn His Gly Asp Met Thr Lys Glu Glu Val Lys Ala Leu Glu Cys 180 185 190Asn Ala Cys Pro Gly Pro Gly Gly Cys Gly Gly Met Tyr Thr Ala Asn 195 200 205Thr Met Ala Thr Ala Ile Glu Val Leu Gly Leu Ser Leu Pro Gly Ser 210 215 220Ser Ser His Pro Ala Glu Ser Ala Glu Lys Lys Ala Asp Ile Glu Glu225 230 235 240Ala Gly Arg Ala Val Val Lys Met Leu Glu Met Gly Leu Lys Pro Ser 245 250 255Asp Ile Leu Thr Arg Glu Ala Phe Glu Asp Ala Ile Thr Val Thr Met 260 265 270Ala Leu Gly Gly Ser Thr Asn Ser Thr Leu His Leu Leu Ala Ile Ala 275 280 285His Ala Ala Asn Val Glu Leu Thr Leu Asp Asp Phe Asn Thr Phe Gln 290 295 300Glu Lys Val Pro His Leu Ala Asp Leu Lys Pro Ser Gly Gln Tyr Val305 310 315 320Phe Gln Asp Leu Tyr Lys Val Gly Gly Val Pro Ala Val Met Lys Tyr 325 330 335Leu Leu Lys Asn Gly Phe Leu His Gly Asp Arg Ile Thr Cys Thr Gly 340 345 350Lys Thr Val Ala Glu Asn Leu Lys Ala Phe Asp Asp Leu Thr Pro Gly 355 360 365Gln Lys Val Ile Met Pro Leu Glu Asn Pro Lys Arg Glu Asp Gly Pro 370 375 380Leu Ile Ile Leu His Gly Asn Leu Ala Pro Asp Gly Ala Val Ala Lys385 390 395 400Val Ser Gly Val Lys Val Arg Arg His Val Gly Pro Ala Lys Val Phe 405 410 415Asn Ser Glu Glu Glu Ala Ile Glu Ala Val Leu Asn Asp Asp Ile Val 420 425 430Asp Gly Asp Val Val Val Val Arg Phe Val Gly Pro Lys Gly Gly Pro 435 440 445Gly Met Pro Glu Met Leu Ser Leu Ser Ser Met Ile Val Gly Lys Gly 450 455 460Gln Gly Glu Lys Val Ala Leu Leu Thr Asp Gly Arg Phe Ser Gly Gly465 470 475 480Thr Tyr Gly Leu Val Val Gly His Ile Ala Pro Glu Ala Gln Asp Gly 485 490 495Gly Pro Ile Ala Tyr Leu Gln Thr Gly Asp Ile Val Thr Ile Asp Gln 500 505 510Asp Thr Lys Glu Leu His Phe Asp Ile Ser Asp Glu Glu Leu Lys His 515 520 525Arg Gln Glu Thr Ile Glu Leu Pro Pro Leu Tyr Ser Arg Gly Ile Leu 530 535 540Gly Lys Tyr Ala His Ile Val Ser Ser Ala Ser Arg Gly Ala Val Thr545 550 555 560Asp Phe Trp Lys Pro Glu Glu Thr Gly Lys Lys 565 570511647DNALactococcus lactis 51atgtatacag taggagatta cctattagac cgattacacg agttaggaat tgaagaaatt 60tttggagtcc ctggagacta taacttacaa tttttagatc aaattatttc ccacaaggat 120atgaaatggg tcggaaatgc taatgaatta aatgcttcat atatggctga tggctatgct 180cgtactaaaa aagctgccgc atttcttaca acctttggag taggtgaatt gagtgcagtt 240aatggattag caggaagtta cgccgaaaat ttaccagtag tagaaatagt gggatcacct 300acatcaaaag ttcaaaatga aggaaaattt gttcatcata cgctggctga cggtgatttt 360aaacacttta tgaaaatgca cgaacctgtt acagcagctc gaactttact gacagcagaa 420aatgcaaccg ttgaaattga ccgagtactt tctgcactat taaaagaaag aaaacctgtc 480tatatcaact taccagttga tgttgctgct gcaaaagcag agaaaccctc actccctttg 540aaaaaggaaa actcaacttc aaatacaagt gaccaagaaa ttttgaacaa aattcaagaa 600agcttgaaaa atgccaaaaa accaatcgtg attacaggac atgaaataat tagttttggc 660ttagaaaaaa cagtcactca atttatttca aagacaaaac tacctattac gacattaaac 720tttggtaaaa gttcagttga tgaagccctc ccttcatttt taggaatcta taatggtaca 780ctctcagagc ctaatcttaa agaattcgtg gaatcagccg acttcatctt gatgcttgga 840gttaaactca cagactcttc aacaggagcc ttcactcatc atttaaatga aaataaaatg 900atttcactga atatagatga aggaaaaata tttaacgaaa gaatccaaaa ttttgatttt 960gaatccctca tctcctctct cttagaccta agcgaaatag aatacaaagg aaaatatatc 1020gataaaaagc aagaagactt tgttccatca aatgcgcttt tatcacaaga ccgcctatgg 1080caagcagttg aaaacctaac tcaaagcaat gaaacaatcg ttgctgaaca agggacatca 1140ttctttggcg cttcatcaat tttcttaaaa tcaaagagtc attttattgg tcaaccctta 1200tggggatcaa ttggatatac attcccagca gcattaggaa gccaaattgc agataaagaa 1260agcagacacc ttttatttat tggtgatggt tcacttcaac ttacagtgca agaattagga 1320ttagcaatca gagaaaaaat taatccaatt tgctttatta tcaataatga tggttataca 1380gtcgaaagag aaattcatgg accaaatcaa agctacaatg atattccaat gtggaattac 1440tcaaaattac cagaatcgtt tggagcaaca gaagatcgag tagtctcaaa aatcgttaga 1500actgaaaatg aatttgtgtc tgtcatgaaa gaagctcaag cagatccaaa tagaatgtac 1560tggattgagt taattttggc aaaagaaggt gcaccaaaag tactgaaaaa aatgggcaaa 1620ctatttgctg aacaaaataa atcataa 164752548PRTLactococcus lactis 52Met Tyr Thr Val Gly Asp Tyr Leu Leu Asp Arg Leu His Glu Leu Gly1 5 10 15Ile Glu Glu Ile Phe Gly Val Pro Gly Asp Tyr Asn Leu Gln Phe Leu 20 25 30Asp Gln Ile Ile Ser His Lys Asp Met Lys Trp Val Gly Asn Ala Asn 35 40 45Glu Leu Asn Ala Ser Tyr Met Ala Asp Gly Tyr Ala Arg Thr Lys Lys 50 55 60Ala Ala Ala Phe Leu Thr Thr Phe Gly Val Gly Glu Leu Ser Ala Val65 70 75 80Asn Gly Leu Ala Gly Ser Tyr Ala Glu Asn Leu Pro Val Val Glu Ile 85 90 95Val Gly Ser Pro Thr Ser Lys Val Gln Asn Glu Gly Lys Phe Val His 100 105 110His Thr Leu Ala Asp Gly Asp Phe Lys His Phe Met Lys Met His Glu 115 120 125Pro Val Thr Ala Ala Arg Thr Leu Leu Thr Ala Glu Asn Ala Thr Val 130 135 140Glu Ile Asp Arg Val Leu Ser Ala Leu Leu Lys Glu Arg Lys Pro Val145 150 155 160Tyr Ile Asn Leu Pro Val Asp Val Ala Ala Ala Lys Ala Glu Lys Pro 165 170 175Ser Leu Pro Leu Lys Lys Glu Asn Ser Thr Ser Asn Thr Ser Asp Gln 180 185 190Glu Ile Leu Asn Lys Ile Gln Glu Ser Leu Lys Asn Ala Lys Lys Pro 195 200 205Ile Val Ile Thr Gly His Glu Ile Ile Ser Phe Gly Leu Glu Lys Thr 210 215 220Val Thr Gln Phe Ile Ser Lys Thr Lys Leu Pro Ile Thr Thr Leu Asn225 230 235 240Phe Gly Lys Ser Ser Val Asp Glu Ala Leu Pro Ser Phe Leu Gly Ile 245 250 255Tyr Asn Gly Thr Leu Ser Glu Pro Asn Leu Lys Glu Phe Val Glu Ser 260 265 270Ala Asp Phe Ile Leu Met Leu Gly Val Lys Leu Thr Asp Ser Ser Thr 275 280 285Gly Ala Phe Thr His His Leu Asn Glu Asn Lys Met Ile Ser Leu Asn 290 295 300Ile Asp Glu Gly Lys Ile Phe Asn Glu Arg Ile Gln Asn Phe Asp Phe305 310 315 320Glu Ser Leu Ile Ser Ser Leu Leu Asp Leu Ser Glu Ile Glu Tyr Lys 325 330 335Gly Lys Tyr Ile Asp Lys Lys Gln Glu Asp Phe Val Pro Ser Asn Ala 340 345 350Leu Leu Ser Gln Asp Arg Leu Trp Gln Ala Val Glu Asn Leu Thr Gln 355 360 365Ser Asn Glu Thr Ile Val Ala Glu Gln Gly Thr Ser Phe Phe Gly Ala 370 375 380Ser Ser Ile Phe Leu Lys Ser Lys Ser His Phe Ile Gly Gln Pro Leu385 390 395 400Trp Gly Ser Ile Gly Tyr Thr Phe Pro Ala Ala Leu Gly Ser Gln Ile 405 410 415Ala Asp Lys Glu Ser Arg His Leu Leu Phe Ile Gly Asp Gly Ser Leu 420 425 430Gln Leu Thr Val Gln Glu Leu Gly Leu Ala Ile Arg Glu Lys Ile Asn 435 440 445Pro Ile Cys Phe Ile Ile Asn Asn Asp Gly Tyr Thr Val Glu Arg Glu 450 455 460Ile His Gly Pro Asn Gln Ser Tyr Asn Asp Ile Pro Met Trp Asn Tyr465 470 475 480Ser Lys Leu Pro Glu Ser Phe Gly Ala Thr Glu Asp Arg Val Val Ser 485 490 495Lys Ile Val Arg Thr Glu Asn Glu Phe Val Ser Val Met Lys Glu Ala 500 505 510Gln Ala Asp Pro Asn Arg Met Tyr Trp Ile Glu Leu Ile Leu Ala Lys 515 520 525Glu Gly Ala Pro Lys Val Leu Lys Lys Met Gly Lys Leu Phe Ala Glu 530 535 540Gln Asn Lys Ser545531644DNAartificial sequencecodon optimized L. lactis kivD coding region for S. cerevisiae expression 53atgtatacag taggtgacta tctgttggac agattacacg aattaggtat agaagaaata 60ttcggagtac caggtgacta caatttgcaa tttctagatc aaattatttc acacaaagat 120atgaaatggg tgggaaatgc taatgagtta aatgcctcct atatggccga cgggtacgca 180agaacgaaaa aggctgcggc attcttgact acatttggtg ttggcgaatt atccgcagtt 240aatggcttag cgggctccta tgctgagaac ctgcctgttg ttgagatcgt gggatctcct 300acctcgaaag tgcagaacga aggtaagttt gttcaccata cgttggctga tggtgatttc 360aagcacttta tgaagatgca cgaaccggtt actgctgcca ggactttatt gacagccgag 420aatgcaactg ttgaaattga tagagtgttg tctgccttac taaaggaaag aaagccggtt 480tacatcaatt tacctgtaga tgtagctgcc gctaaggctg aaaaaccatc cttgcctctt 540aagaaggaaa attccacgtc gaatacatct gatcaagaga ttctgaacaa aatacaggaa 600agtctgaaga atgccaagaa accaattgta atcacaggcc atgaaattat atcgttcggc 660ctagagaaga ctgttactca gtttatttca aagactaagt tacctattac tactttgaac 720tttggtaaat catctgttga tgaagcattg ccctcatttt tggggattta caacggtact 780ctgtcagagc caaacttgaa ggaatttgtg gaatctgctg attttattct tatgttgggt 840gtaaagctta ccgattctag tacgggtgca tttactcacc atcttaatga aaataaaatg 900atttccttga atatcgatga aggtaaaatt ttcaacgaaa gaatccaaaa tttcgacttc 960gaatccctga tatcatctct tcttgacttg tccgaaattg aatataaagg caagtacata 1020gataaaaagc aagaagattt tgtaccttct aacgcgctgt tgtcacaaga tagactgtgg 1080caagctgtcg aaaatttgac ccaaagtaat gagacgatcg tggctgaaca aggcacttct 1140ttcttcggtg cctcatctat atttctgaaa tcgaaatcac attttattgg tcaacccttg 1200tggggatcta taggatacac tttccccgca gctctaggca gccaaattgc agataaagaa 1260tctagacatt tattgtttat cggagatgga tcattgcaac tgactgtcca agaattagga 1320ctagccatta gagagaagat aaacccaatc tgctttatca ttaataacga tggttacacg 1380gttgagaggg aaattcatgg tccgaaccag agttataatg acattcctat gtggaattac 1440tcaaaactgc cagaaagttt cggggcaacg gaagacagag ttgtgtccaa aattgtgaga 1500acagaaaatg aattcgtatc cgtgatgaaa gaagctcaag cagatccaaa taggatgtat 1560tggatagaac ttattctagc aaaggagggt gcacctaaag ttttgaaaaa gatgggtaag 1620ttatttgcag aacaaaacaa gagc 1644541146DNASaccharomyces cerevisiae 54atgagagctt tggcatattt caagaagggt gatattcact tcactaatga tatccctagg 60ccagaaatcc aaaccgacga tgaggttatt atcgacgtct cttggtgtgg gatttgtggc 120tcggatcttc acgagtactt ggatggtcca atcttcatgc ctaaagatgg agagtgccat 180aaattatcca acgctgcttt acctctggca atgggccatg agatgtcagg aattgtttcc 240aaggttggtc ctaaagtgac aaaggtgaag gttggcgacc acgtggtcgt tgatgctgcc 300agcagttgtg cggacctgca ttgctggcca cactccaaat tttacaattc caaaccatgt 360gatgcttgtc agaggggcag tgaaaatcta tgtacccacg ccggttttgt aggactaggt 420gtgatcagtg gtggctttgc tgaacaagtc gtagtctctc aacatcacat tatcccggtt 480ccaaaggaaa ttcctctaga tgtggctgct ttagttgagc ctctttctgt cacctggcat 540gctgttaaga tttctggttt caaaaaaggc agttcagcct tggttcttgg tgcaggtccc 600attgggttgt gtaccatttt ggtacttaag ggaatggggg ctagtaaaat tgtagtgtct 660gaaattgcag agagaagaat agaaatggcc aagaaactgg gcgttgaggt gttcaatccc 720tccaagcacg gtcataaatc tatagagata ctacgtggtt tgaccaagag ccatgatggg 780tttgattaca gttatgattg ttctggtatt caagttactt tcgaaacctc tttgaaggca 840ttaacattca aggggacagc caccaacatt gcagtttggg gtccaaaacc tgtcccattc 900caaccaatgg atgtgactct ccaagagaaa gttatgactg gttcgatcgg ctatgttgtc 960gaagccttcg aagaagttgt tcgtgccatc cacaacggag acatcgccat ggaagattgt 1020aagcaactaa tcactggtaa gcaaaggatt gaggacggtt gggaaaaggg attccaagag 1080ttgatggatc acaaggaatc caacgttaag attctattga cgcctaacaa tcacggtgaa 1140atgaag 114655382PRTSaccharomyces cerevisiae 55Met Arg Ala Leu Ala Tyr Phe Lys Lys Gly Asp Ile His Phe Thr Asn1 5 10 15Asp Ile Pro Arg Pro Glu Ile Gln Thr Asp Asp Glu Val Ile Ile Asp 20 25 30Val Ser Trp Cys Gly Ile Cys Gly Ser Asp Leu His Glu Tyr Leu Asp 35 40 45Gly Pro Ile Phe Met Pro Lys Asp Gly Glu Cys His Lys Leu Ser Asn 50 55 60Ala Ala Leu Pro Leu Ala Met Gly His Glu Met Ser Gly Ile Val Ser65 70 75 80Lys Val Gly Pro Lys Val Thr Lys Val Lys Val Gly Asp His Val Val 85 90 95Val Asp Ala Ala Ser Ser Cys Ala Asp Leu His Cys Trp Pro His Ser 100 105 110Lys Phe Tyr Asn Ser Lys Pro Cys Asp Ala Cys Gln Arg Gly Ser Glu 115 120 125Asn Leu Cys Thr His Ala Gly Phe Val Gly Leu Gly Val Ile Ser Gly 130 135 140Gly Phe Ala Glu Gln Val Val Val Ser Gln His His Ile Ile Pro Val145 150 155 160Pro Lys Glu Ile Pro Leu Asp Val Ala Ala Leu Val Glu Pro Leu Ser 165 170 175Val Thr Trp His Ala Val Lys Ile Ser Gly Phe Lys Lys Gly Ser Ser 180 185 190Ala Leu Val Leu Gly Ala Gly Pro Ile Gly Leu Cys Thr Ile Leu Val 195 200 205Leu Lys Gly Met Gly Ala Ser Lys Ile Val Val Ser Glu Ile Ala Glu 210 215 220Arg Arg Ile Glu Met Ala Lys Lys Leu Gly Val Glu Val Phe Asn Pro225 230 235 240Ser Lys His Gly His Lys Ser Ile Glu Ile Leu Arg Gly Leu Thr Lys 245 250 255Ser His Asp Gly Phe Asp Tyr Ser Tyr Asp Cys Ser Gly Ile Gln Val 260 265 270Thr Phe Glu Thr Ser Leu Lys Ala Leu Thr Phe Lys Gly Thr Ala Thr 275 280 285Asn Ile Ala Val Trp Gly Pro Lys Pro Val Pro Phe Gln Pro Met Asp 290 295 300Val Thr Leu Gln Glu Lys Val Met Thr Gly Ser Ile Gly Tyr Val Val305 310 315 320Glu Ala Phe Glu Glu Val Val Arg Ala Ile His Asn Gly Asp Ile Ala 325 330 335Met Glu Asp Cys Lys Gln Leu Ile Thr Gly Lys Gln Arg Ile Glu Asp 340 345 350Gly Trp Glu Lys Gly Phe Gln Glu Leu Met Asp His Lys Glu Ser Asn 355 360 365Val Lys Ile Leu Leu Thr Pro Asn Asn His Gly Glu Met Lys 370 375 380561125DNAartificial sequencehorse ADH coding region codon optimized for S. cerevisiae expression 56atgtcaacag ccggtaaagt tattaagtgt aaagcggcag ttttgtggga agagaaaaag 60ccgtttagca tagaagaagt agaagtagcg ccaccaaaag cacacgaggt tagaatcaag 120atggttgcca ccggaatctg tagatccgac gaccatgtgg tgagtggcac tctagttact 180cctttgccag taatcgcggg acacgaggct gccggaatcg ttgaatccat aggtgaaggt 240gttaccactg ttcgtcctgg tgataaagtg atcccactgt tcactcctca atgtggtaag 300tgtagagtct gcaaacatcc tgagggtaat ttctgcctta aaaatgattt gtctatgcct 360agaggtacta tgcaggatgg tacaagcaga tttacatgca gagggaaacc tatacaccat 420ttccttggta cttctacatt ttcccaatac acagtggtgg acgagatatc tgtcgctaaa 480atcgatgcag cttcaccact ggaaaaagtt tgcttgatag ggtgcggatt ttccaccggt 540tacggttccg cagttaaagt tgcaaaggtt acacagggtt cgacttgtgc agtattcggt 600ttaggaggag taggactaag cgttattatg gggtgtaaag ctgcaggcgc agcgaggatt 660ataggtgtag acatcaataa ggacaaattt gcaaaagcta aggaggtcgg ggctactgaa 720tgtgttaacc ctcaagatta taagaaacca atacaagaag tccttactga aatgtcaaac 780ggtggagttg atttctcttt tgaagttata ggccgtcttg atactatggt aactgcgttg 840tcctgctgtc aagaggcata tggagtcagt gtgatcgtag gtgttcctcc tgattcacaa 900aatttgtcga tgaatcctat gctgttgcta agcggtcgta catggaaggg agctatattt 960ggcggtttta agagcaagga tagtgttcca aaacttgttg ccgactttat ggcgaagaag 1020tttgctcttg atcctttaat tacacatgta ttgccattcg agaaaatcaa tgaagggttt 1080gatttgttaa gaagtggtga atctattcgt acaattttaa ctttt 112557375PRTEquus caballus 57Met Ser Thr Ala Gly Lys Val Ile Lys Cys Lys Ala Ala Val Leu Trp1 5 10 15Glu Glu Lys Lys Pro Phe Ser Ile Glu Glu Val Glu Val Ala Pro Pro 20 25 30Lys Ala His Glu Val Arg Ile Lys Met Val Ala Thr Gly Ile Cys Arg 35 40 45Ser Asp Asp His Val Val Ser Gly Thr Leu Val Thr Pro Leu Pro

Val 50 55 60Ile Ala Gly His Glu Ala Ala Gly Ile Val Glu Ser Ile Gly Glu Gly65 70 75 80Val Thr Thr Val Arg Pro Gly Asp Lys Val Ile Pro Leu Phe Thr Pro 85 90 95Gln Cys Gly Lys Cys Arg Val Cys Lys His Pro Glu Gly Asn Phe Cys 100 105 110Leu Lys Asn Asp Leu Ser Met Pro Arg Gly Thr Met Gln Asp Gly Thr 115 120 125Ser Arg Phe Thr Cys Arg Gly Lys Pro Ile His His Phe Leu Gly Thr 130 135 140Ser Thr Phe Ser Gln Tyr Thr Val Val Asp Glu Ile Ser Val Ala Lys145 150 155 160Ile Asp Ala Ala Ser Pro Leu Glu Lys Val Cys Leu Ile Gly Cys Gly 165 170 175Phe Ser Thr Gly Tyr Gly Ser Ala Val Lys Val Ala Lys Val Thr Gln 180 185 190Gly Ser Thr Cys Ala Val Phe Gly Leu Gly Gly Val Gly Leu Ser Val 195 200 205Ile Met Gly Cys Lys Ala Ala Gly Ala Ala Arg Ile Ile Gly Val Asp 210 215 220Ile Asn Lys Asp Lys Phe Ala Lys Ala Lys Glu Val Gly Ala Thr Glu225 230 235 240Cys Val Asn Pro Gln Asp Tyr Lys Lys Pro Ile Gln Glu Val Leu Thr 245 250 255Glu Met Ser Asn Gly Gly Val Asp Phe Ser Phe Glu Val Ile Gly Arg 260 265 270Leu Asp Thr Met Val Thr Ala Leu Ser Cys Cys Gln Glu Ala Tyr Gly 275 280 285Val Ser Val Ile Val Gly Val Pro Pro Asp Ser Gln Asn Leu Ser Met 290 295 300Asn Pro Met Leu Leu Leu Ser Gly Arg Thr Trp Lys Gly Ala Ile Phe305 310 315 320Gly Gly Phe Lys Ser Lys Asp Ser Val Pro Lys Leu Val Ala Asp Phe 325 330 335Met Ala Lys Lys Phe Ala Leu Asp Pro Leu Ile Thr His Val Leu Pro 340 345 350Phe Glu Lys Ile Asn Glu Gly Phe Asp Leu Leu Arg Ser Gly Glu Ser 355 360 365Ile Arg Thr Ile Leu Thr Phe 370 375581020DNALactococcus lactis 58atggcagtta caatgtatta tgaagatgat gtagaagtat cagcacttgc tggaaagcaa 60attgcagtaa tcggttatgg ttcacaagga catgctcacg cacagaattt gcgtgattct 120ggtcacaacg ttatcattgg tgtgcgccac ggaaaatctt ttgataaagc aaaagaagat 180ggctttgaaa catttgaagt aggagaagca gtagctaaag ctgatgttat tatggttttg 240gcaccagatg aacttcaaca atccatttat gaagaggaca tcaaaccaaa cttgaaagca 300ggttcagcac ttggttttgc tcacggattt aatatccatt ttggctatat taaagtacca 360gaagacgttg acgtctttat ggttgcgcct aaggctccag gtcaccttgt ccgtcggact 420tatactgaag gttttggtac accagctttg tttgtttcac accaaaatgc aagtggtcat 480gcgcgtgaaa tcgcaatgga ttgggccaaa ggaattggtt gtgctcgagt gggaattatt 540gaaacaactt ttaaagaaga aacagaagaa gatttgtttg gagaacaagc tgttctatgt 600ggaggtttga cagcacttgt tgaagccggt tttgaaacac tgacagaagc tggatacgct 660ggcgaattgg cttactttga agttttgcac gaaatgaaat tgattgttga cctcatgtat 720gaaggtggtt ttactaaaat gcgtcaatcc atctcaaata ctgctgagtt tggcgattat 780gtgactggtc cacggattat tactgacgaa gttaaaaaga atatgaagct tgttttggct 840gatattcaat ctggaaaatt tgctcaagat ttcgttgatg acttcaaagc ggggcgtcca 900aaattaatag cctatcgcga agctgcaaaa aatcttgaaa ttgaaaaaat tggggcagag 960ctacgtcaag caatgccatt cacacaatct ggtgatgacg atgcctttaa aatctatcag 102059340PRTLactococcus lactis 59Met Ala Val Thr Met Tyr Tyr Glu Asp Asp Val Glu Val Ser Ala Leu1 5 10 15Ala Gly Lys Gln Ile Ala Val Ile Gly Tyr Gly Ser Gln Gly His Ala 20 25 30His Ala Gln Asn Leu Arg Asp Ser Gly His Asn Val Ile Ile Gly Val 35 40 45Arg His Gly Lys Ser Phe Asp Lys Ala Lys Glu Asp Gly Phe Glu Thr 50 55 60Phe Glu Val Gly Glu Ala Val Ala Lys Ala Asp Val Ile Met Val Leu65 70 75 80Ala Pro Asp Glu Leu Gln Gln Ser Ile Tyr Glu Glu Asp Ile Lys Pro 85 90 95Asn Leu Lys Ala Gly Ser Ala Leu Gly Phe Ala His Gly Phe Asn Ile 100 105 110His Phe Gly Tyr Ile Lys Val Pro Glu Asp Val Asp Val Phe Met Val 115 120 125Ala Pro Lys Ala Pro Gly His Leu Val Arg Arg Thr Tyr Thr Glu Gly 130 135 140Phe Gly Thr Pro Ala Leu Phe Val Ser His Gln Asn Ala Ser Gly His145 150 155 160Ala Arg Glu Ile Ala Met Asp Trp Ala Lys Gly Ile Gly Cys Ala Arg 165 170 175Val Gly Ile Ile Glu Thr Thr Phe Lys Glu Glu Thr Glu Glu Asp Leu 180 185 190Phe Gly Glu Gln Ala Val Leu Cys Gly Gly Leu Thr Ala Leu Val Glu 195 200 205Ala Gly Phe Glu Thr Leu Thr Glu Ala Gly Tyr Ala Gly Glu Leu Ala 210 215 220Tyr Phe Glu Val Leu His Glu Met Lys Leu Ile Val Asp Leu Met Tyr225 230 235 240Glu Gly Gly Phe Thr Lys Met Arg Gln Ser Ile Ser Asn Thr Ala Glu 245 250 255Phe Gly Asp Tyr Val Thr Gly Pro Arg Ile Ile Thr Asp Glu Val Lys 260 265 270Lys Asn Met Lys Leu Val Leu Ala Asp Ile Gln Ser Gly Lys Phe Ala 275 280 285Gln Asp Phe Val Asp Asp Phe Lys Ala Gly Arg Pro Lys Leu Ile Ala 290 295 300Tyr Arg Glu Ala Ala Lys Asn Leu Glu Ile Glu Lys Ile Gly Ala Glu305 310 315 320Leu Arg Gln Ala Met Pro Phe Thr Gln Ser Gly Asp Asp Asp Ala Phe 325 330 335Lys Ile Tyr Gln 34060719DNAArtificial sequenceUAS(PGK1)-FBA1 60aattaccgtc gctcgtgatt tgtttgcaaa aagaacaaaa ctgaaaaaac ccagacacgc 60tcgacttcct gtcttcctat tgattgcagc ttccaatttc gtcacacaac aaggtcctgc 120ctacttggct tcacatacgt tgcatacgtc gatatagata ataatgataa tgacagcagg 180attatcgtaa tacgtaatag ttgaaaatct caaaaatgtg tgggtcatta cgtaaataat 240gataggaatg ggattcttct atttttcctt tttccattct agcagccgtc gggaaaacgt 300ggcatcctct ctttcgggct caattggagt cacgctgccg tgagcatcct ctctttccat 360atctaacaac tgagcacgta accaatggaa aagcatgagc ttagcgttgc tccaaaaaag 420tattggatgg ttaataccat ttgtctgttc tcttctgact ttgactcctc aaaaaaaaaa 480aatctacaat caacagatcg cttcaattac gccctcacaa aaactttttt ccttcttctt 540cgcccacgtt aaattttatc cctcatgttg tctaacggat ttctgcactt gatttattat 600aaaaagacaa agacataata cttctctatc aatttcagtt attgttcttc cttgcgttat 660tcttctgttc ttctttttct tttgtcatat ataaccataa ccaagtaata catattcaa 71961636DNAArtificial sequencePromoter UAS(PGK1)-ENO2 61aattaccgtc gctcgtgatt tgtttgcaaa aagaacaaaa ctgaaaaaac ccagacacgc 60tcgacttcct gtcttcctat tgattgcagc ttccaatttc gtcacacaac aaggtcctgc 120ggcgttatgt cactaacgac gtgcaccaac ttgcggaaag tggaatcccg ttccaaaact 180ggcatccact aattgataca tctacacacc gcacgccttt tttctgaagc ccactttcgt 240ggactttgcc atatgcaaaa ttcatgaagt gtgataccaa gtcagcatac acctcactag 300ggtagtttct ttggttgtat tgatcatttg gttcatcgtg gttcattaat tttttttctc 360cattgctttc tggctttgat cttactatca tttggatttt tgtcgaaggt tgtagaattg 420tatgtgacaa gtggcaccaa gcatatataa aaaaaaaaag cattatcttc ctaccagagt 480tgattgttaa aaacgtattt atagcaaacg caattgtaat taattcttat tttgtatctt 540ttcttccctt gtctcaatct tttattttta ttttattttt cttttcttag tttctttcat 600aacaccaagc aactaatact ataacataca ataata 63662979DNAArtificial sequencePromoter UAS(FBA1)-PDC1 62tgacagcagg attatcgtaa tacgtaatag ttgaaaatct caaaaatgtg tgggtcatta 60cgtaaataat gataggaatg ggattcttct atttttcctt tttccattct cgcacgccga 120aatgcatgca agtaacctat tcaaagtaat atctcataca tgtttcatga gggtaacaac 180atgcgactgg gtgagcatat gttccgctga tgtgatgtgc aagataaaca agcaaggcag 240aaactaactt cttcttcatg taataaacac accccgcgtt tatttaccta tctctaaact 300tcaacacctt atatcataac taatatttct tgagataagc acactgcacc cataccttcc 360ttaaaaacgt agcttccagt ttttggtggt tccggcttcc ttcccgattc cgcccgctaa 420acgcatattt ttgttgcctg gtggcatttg caaaatgcat aacctatgca tttaaaagat 480tatgtatgct cttctgactt ttcgtgtgat gaggctcgtg gaaaaaatga ataatttatg 540aatttgagaa caattttgtg ttgttacggt attttactat ggaataatca atcaattgag 600gattttatgc aaatatcgtt tgaatatttt tccgaccctt tgagtacttt tcttcataat 660tgcataatat tgtccgctgc ccctttttct gttagacggt gtcttgatct acttgctatc 720gttcaacacc accttatttt ctaactattt tttttttagc tcatttgaat cagcttatgg 780tgatggcaca tttttgcata aacctagctg tcctcgttga acataggaaa aaaaaatata 840taaacaaggc tctttcactc tccttgcaat cagatttggg tttgttccct ttattttcat 900atttcttgtc atattccttt ctcaattatt attttctact cataacctca cgcaaaataa 960cacagtcaaa tcaatcaaa 97963994DNAArtificial sequenceUAS(PGK1)-PDC1 63aattaccgtc gctcgtgatt tgtttgcaaa aagaacaaaa ctgaaaaaac ccagacacgc 60tcgacttcct gtcttcctat tgattgcagc ttccaatttc gtcacacaac aaggtcctgt 120cgacccgcac gccgaaatgc atgcaagtaa cctattcaaa gtaatatctc atacatgttt 180catgagggta acaacatgcg actgggtgag catatgttcc gctgatgtga tgtgcaagat 240aaacaagcaa ggcagaaact aacttcttct tcatgtaata aacacacccc gcgtttattt 300acctatctct aaacttcaac accttatatc ataactaata tttcttgaga taagcacact 360gcacccatac cttccttaaa aacgtagctt ccagtttttg gtggttccgg cttccttccc 420gattccgccc gctaaacgca tatttttgtt gcctggtggc atttgcaaaa tgcataacct 480atgcatttaa aagattatgt atgctcttct gacttttcgt gtgatgaggc tcgtggaaaa 540aatgaataat ttatgaattt gagaacaatt ttgtgttgtt acggtatttt actatggaat 600aatcaatcaa ttgaggattt tatgcaaata tcgtttgaat atttttccga ccctttgagt 660acttttcttc ataattgcat aatattgtcc gctgcccctt tttctgttag acggtgtctt 720gatctacttg ctatcgttca acaccacctt attttctaac tatttttttt ttagctcatt 780tgaatcagct tatggtgatg gcacattttt gcataaacct agctgtcctc gttgaacata 840ggaaaaaaaa atatataaac aaggctcttt cactctcctt gcaatcagat ttgggtttgt 900tccctttatt ttcatatttc ttgtcatatt cctttctcaa ttattatttt ctactcataa 960cctcacgcaa aataacacag tcaaatcaat caaa 99464731DNAArtificial sequenceUAS(PGK1)-OLE1 64aattaccgtc gctcgtgatt tgtttgcaaa aagaacaaaa ctgaaaaaac ccagacacgc 60tcgacttcct gtcttcctat tgattgcagc ttccaatttc gtcacacaac aaggtcctgt 120cgaccggggc gcccgtgcaa aaactaactc cgagcccggg catgtcccgg gttagcgggc 180ccaacaaagg cgcttatctg gtgggcttcc gtagaagaaa aaaagctgtt gagcgagcta 240tttcgggtat cccagccttc tctgcagacc gccccagttg gcttggctct ggtgctgttc 300gttagcatca catcgcctgt gacaggcaga ggtaataacg gcttaaggtt ctcttcgcat 360agtcggcagc tttctttcgg acgttgaaca ctcaacaaac cttatctagt gcccaaccag 420gtgtgcttct acgagtcttg ctcactcaga cacacctatc cctattgtta cggctatggg 480gatggcacac aaaggtggaa ataatagtag ttaacaatat atgcagcaaa tcatcggctc 540ctggctcatc gagtcttgca aatcagcata tacatatata tatgggggca gatcttgatt 600catttattgt tctatttcca tctttcctac ttctgtttcc gtttatattt tgtattacgt 660agaatagaac atcatagtaa tagatagttg tggtgatcat attataaaca gcactaaaac 720attacaacaa a 73165500PRTSaccharomyces cerevisiae 65Met Thr Lys Leu His Phe Asp Thr Ala Glu Pro Val Lys Ile Thr Leu1 5 10 15Pro Asn Gly Leu Thr Tyr Glu Gln Pro Thr Gly Leu Phe Ile Asn Asn 20 25 30Lys Phe Met Lys Ala Gln Asp Gly Lys Thr Tyr Pro Val Glu Asp Pro 35 40 45Ser Thr Glu Asn Thr Val Cys Glu Val Ser Ser Ala Thr Thr Glu Asp 50 55 60Val Glu Tyr Ala Ile Glu Cys Ala Asp Arg Ala Phe His Asp Thr Glu65 70 75 80Trp Ala Thr Gln Asp Pro Arg Glu Arg Gly Arg Leu Leu Ser Lys Leu 85 90 95Ala Asp Glu Leu Glu Ser Gln Ile Asp Leu Val Ser Ser Ile Glu Ala 100 105 110Leu Asp Asn Gly Lys Thr Leu Ala Leu Ala Arg Gly Asp Val Thr Ile 115 120 125Ala Ile Asn Cys Leu Arg Asp Ala Ala Ala Tyr Ala Asp Lys Val Asn 130 135 140Gly Arg Thr Ile Asn Thr Gly Asp Gly Tyr Met Asn Phe Thr Thr Leu145 150 155 160Glu Pro Ile Gly Val Cys Gly Gln Ile Ile Pro Trp Asn Phe Pro Ile 165 170 175Met Met Leu Ala Trp Lys Ile Ala Pro Ala Leu Ala Met Gly Asn Val 180 185 190Cys Ile Leu Lys Pro Ala Ala Val Thr Pro Leu Asn Ala Leu Tyr Phe 195 200 205Ala Ser Leu Cys Lys Lys Val Gly Ile Pro Ala Gly Val Val Asn Ile 210 215 220Val Pro Gly Pro Gly Arg Thr Val Gly Ala Ala Leu Thr Asn Asp Pro225 230 235 240Arg Ile Arg Lys Leu Ala Phe Thr Gly Ser Thr Glu Val Gly Lys Ser 245 250 255Val Ala Val Asp Ser Ser Glu Ser Asn Leu Lys Lys Ile Thr Leu Glu 260 265 270Leu Gly Gly Lys Ser Ala His Leu Val Phe Asp Asp Ala Asn Ile Lys 275 280 285Lys Thr Leu Pro Asn Leu Val Asn Gly Ile Phe Lys Asn Ala Gly Gln 290 295 300Ile Cys Ser Ser Gly Ser Arg Ile Tyr Val Gln Glu Gly Ile Tyr Asp305 310 315 320Glu Leu Leu Ala Ala Phe Lys Ala Tyr Leu Glu Thr Glu Ile Lys Val 325 330 335Gly Asn Pro Phe Asp Lys Ala Asn Phe Gln Gly Ala Ile Thr Asn Arg 340 345 350Gln Gln Phe Asp Thr Ile Met Asn Tyr Ile Asp Ile Gly Lys Lys Glu 355 360 365Gly Ala Lys Ile Leu Thr Gly Gly Glu Lys Val Gly Asp Lys Gly Tyr 370 375 380Phe Ile Arg Pro Thr Val Phe Tyr Asp Val Asn Glu Asp Met Arg Ile385 390 395 400Val Lys Glu Glu Ile Phe Gly Pro Val Val Thr Val Ala Lys Phe Lys 405 410 415Thr Leu Glu Glu Gly Val Glu Met Ala Asn Ser Ser Glu Phe Gly Leu 420 425 430Gly Ser Gly Ile Glu Thr Glu Ser Leu Ser Thr Gly Leu Lys Val Ala 435 440 445Lys Met Leu Lys Ala Gly Thr Val Trp Ile Asn Thr Tyr Asn Asp Phe 450 455 460Asp Ser Arg Val Pro Phe Gly Gly Val Lys Gln Ser Gly Tyr Gly Arg465 470 475 480Glu Met Gly Glu Glu Val Tyr His Ala Tyr Thr Glu Val Lys Ala Val 485 490 495Arg Ile Lys Leu 50066343PRTArtificial sequenceAnaerostipes caccae KARI variant K9D3 66Met Glu Glu Cys Lys Met Ala Lys Ile Tyr Tyr Gln Glu Asp Cys Asn1 5 10 15Leu Ser Leu Leu Asp Gly Lys Thr Ile Ala Val Ile Gly Tyr Gly Ser 20 25 30Gln Gly His Ala His Ala Leu Asn Ala Lys Glu Ser Gly Cys Asn Val 35 40 45Ile Ile Gly Leu Tyr Glu Gly Ala Lys Asp Trp Lys Arg Ala Glu Glu 50 55 60Gln Gly Phe Glu Val Tyr Thr Ala Ala Glu Ala Ala Lys Lys Ala Asp65 70 75 80Ile Ile Met Ile Leu Ile Asn Asp Glu Lys Gln Ala Thr Met Tyr Lys 85 90 95Asn Asp Ile Glu Pro Asn Leu Glu Ala Gly Asn Met Leu Met Phe Ala 100 105 110His Gly Phe Asn Ile His Phe Gly Cys Ile Val Pro Pro Lys Asp Val 115 120 125Asp Val Thr Met Ile Ala Pro Lys Gly Pro Gly His Thr Val Arg Ser 130 135 140Glu Tyr Glu Glu Gly Lys Gly Val Pro Cys Leu Val Ala Val Glu Gln145 150 155 160Asp Ala Thr Gly Lys Ala Leu Asp Met Ala Leu Ala Tyr Ala Leu Ala 165 170 175Ile Gly Gly Ala Arg Ala Gly Val Leu Glu Thr Thr Phe Arg Thr Glu 180 185 190Thr Glu Thr Asp Leu Phe Gly Glu Gln Ala Val Leu Cys Gly Gly Val 195 200 205Cys Ala Leu Met Gln Ala Gly Phe Glu Thr Leu Val Glu Ala Gly Tyr 210 215 220Asp Pro Arg Asn Ala Tyr Phe Glu Cys Ile His Glu Met Lys Leu Ile225 230 235 240Val Asp Leu Ile Tyr Gln Ser Gly Phe Ser Gly Met Arg Tyr Ser Ile 245 250 255Ser Asn Thr Ala Glu Tyr Gly Asp Tyr Ile Thr Gly Pro Lys Ile Ile 260 265 270Thr Glu Asp Thr Lys Lys Ala Met Lys Lys Ile Leu Ser Asp Ile Gln 275 280 285Asp Gly Thr Phe Ala Lys Asp Phe Leu Val Asp Met Ser Asp Ala Gly 290 295 300Ser Gln Val His Phe Lys Ala Met Arg Lys Leu Ala Ser Glu His Pro305 310 315 320Ala Glu Val Val Gly Glu Glu Ile Arg Ser Leu Tyr Ser Trp Ser Asp 325 330 335Glu Asp Lys Leu Ile Asn Asn 34067343PRTArtificial sequenceAnaerostipes caccae KARI variant K9G9 67Met Glu Glu Cys Lys Met Ala Lys Ile Tyr Tyr Gln Glu Asp Cys Asn1 5 10 15Leu Ser Leu Leu Asp Gly Lys Thr Ile Ala Val Ile Gly Tyr Gly Ser 20 25 30Gln Gly His Ala His Ala Leu Asn Ala Lys Glu Ser Gly Cys Asn Val 35

40 45Ile Ile Gly Leu Tyr Glu Gly Ala Lys Glu Trp Lys Arg Ala Glu Glu 50 55 60Gln Gly Phe Glu Val Tyr Thr Ala Ala Glu Ala Ala Lys Lys Ala Asp65 70 75 80Ile Ile Met Ile Leu Ile Asn Asp Glu Lys Gln Ala Thr Met Tyr Lys 85 90 95Asn Asp Ile Glu Pro Asn Leu Glu Ala Gly Asn Met Leu Met Phe Ala 100 105 110His Gly Phe Asn Ile His Phe Gly Cys Ile Val Pro Pro Lys Asp Val 115 120 125Asp Val Thr Met Ile Ala Pro Lys Gly Pro Gly His Thr Val Arg Ser 130 135 140Glu Tyr Glu Glu Gly Lys Gly Val Pro Cys Leu Val Ala Val Glu Gln145 150 155 160Asp Ala Thr Gly Lys Ala Leu Asp Met Ala Leu Ala Tyr Ala Leu Ala 165 170 175Ile Gly Gly Ala Arg Ala Gly Val Leu Glu Thr Thr Phe Arg Thr Glu 180 185 190Thr Glu Thr Asp Leu Phe Gly Glu Gln Ala Val Leu Cys Gly Gly Val 195 200 205Cys Ala Leu Met Gln Ala Gly Phe Glu Thr Leu Val Glu Ala Gly Tyr 210 215 220Asp Pro Arg Asn Ala Tyr Phe Glu Cys Ile His Glu Met Lys Leu Ile225 230 235 240Val Asp Leu Ile Tyr Gln Ser Gly Phe Ser Gly Met Arg Tyr Ser Ile 245 250 255Ser Asn Thr Ala Glu Tyr Gly Asp Tyr Ile Thr Gly Pro Lys Ile Ile 260 265 270Thr Glu Asp Thr Lys Lys Ala Met Lys Lys Ile Leu Ser Asp Ile Gln 275 280 285Asp Gly Thr Phe Ala Lys Asp Phe Leu Val Asp Met Ser Asp Ala Gly 290 295 300Ser Gln Val His Phe Lys Ala Met Arg Lys Leu Ala Ser Glu His Pro305 310 315 320Ala Glu Val Val Gly Glu Glu Ile Arg Ser Leu Tyr Ser Trp Ser Asp 325 330 335Glu Asp Lys Leu Ile Asn Asn 34068267PRTSaccharomyces cerevisiae 68Met Ser Gln Gly Arg Lys Ala Ala Glu Arg Leu Ala Lys Lys Thr Val1 5 10 15Leu Ile Thr Gly Ala Ser Ala Gly Ile Gly Lys Ala Thr Ala Leu Glu 20 25 30Tyr Leu Glu Ala Ser Asn Gly Asp Met Lys Leu Ile Leu Ala Ala Arg 35 40 45Arg Leu Glu Lys Leu Glu Glu Leu Lys Lys Thr Ile Asp Gln Glu Phe 50 55 60Pro Asn Ala Lys Val His Val Ala Gln Leu Asp Ile Thr Gln Ala Glu65 70 75 80Lys Ile Lys Pro Phe Ile Glu Asn Leu Pro Gln Glu Phe Lys Asp Ile 85 90 95Asp Ile Leu Val Asn Asn Ala Gly Lys Ala Leu Gly Ser Asp Arg Val 100 105 110Gly Gln Ile Ala Thr Glu Asp Ile Gln Asp Val Phe Asp Thr Asn Val 115 120 125Thr Ala Leu Ile Asn Ile Thr Gln Ala Val Leu Pro Ile Phe Gln Ala 130 135 140Lys Asn Ser Gly Asp Ile Val Asn Leu Gly Ser Ile Ala Gly Arg Asp145 150 155 160Ala Tyr Pro Thr Gly Ser Ile Tyr Cys Ala Ser Lys Phe Ala Val Gly 165 170 175Ala Phe Thr Asp Ser Leu Arg Lys Glu Leu Ile Asn Thr Lys Ile Arg 180 185 190Val Ile Leu Ile Ala Pro Gly Leu Val Glu Thr Glu Phe Ser Leu Val 195 200 205Arg Tyr Arg Gly Asn Glu Glu Gln Ala Lys Asn Val Tyr Lys Asp Thr 210 215 220Thr Pro Leu Met Ala Asp Asp Val Ala Asp Leu Ile Val Tyr Ala Thr225 230 235 240Ser Arg Lys Gln Asn Thr Val Ile Ala Asp Thr Leu Ile Phe Pro Thr 245 250 255Asn Gln Ala Ser Pro His His Ile Phe Arg Gly 260 265692073DNASaccharomyces cerevisiae 69atggaaggct tcaatccggc tgacatagaa catgcgtcac cgattaattc atctgacagc 60cattcatcct cctttgtata tgctctaccc aaaagtgcta gtgaatatgt agtcaaccat 120aatgagggtc gtgcaagtgc aagtggaaat ccagccgcag tgccgtctcc cataatgaca 180ctgaatctca aaagcacaca ttccctcaat attgatcagc atgttcatac ctcaacatcg 240ccgacggaaa ctattgggca tattcatcat gtggaaaagc tgaatcaaaa caatttgatt 300catctggatc cagtacccaa ctttgaagat aagtccgata ttaagccttg gttgcaaaag 360attttttatc ctcaaggaat agaacttgtg atagaaaggt cggacgcatt taaagttgtc 420ttcaagtgta aagctgctaa aaggggaagg aacgcgagaa ggaaaagaaa agataagccc 480aaaggacagg accacgaaga cgagaaatcc aagatcaatg atgacgaatt agaatatgcg 540agtccttcta atgccacagt aaccaatggg cctcaaacat cgcccgatca aacatcctcc 600ataaagccaa agaaaaaaag atgtgtatcg aggtttaata actgtccgtt tagagtacga 660gctacttatt cgttaaagag gaaaagatgg agcattgttg taatggacaa taaccattca 720catcagctaa agtttaaccc tgattccgaa gagtacaaaa aattcaaaga aaaattaaga 780aaggataatg acgtagatgc aatcaagaaa ttcgacgaat tggaatacag aactttggcc 840aatttgccca ttccaacagc tacaatcccc tgtgattgtg gtttaacaaa tgaaatacaa 900agtttcaatg tcgtattgcc cactaacagt aatgttactt catcagcatc ctcttcaact 960gtatcgtcca tatcccttga ttcatcgaat gcatctaaaa ggccatgctt accctctgta 1020aataacaccg gtagtatcaa taccaataac gtaaggaaac cgaaaagcca gtgtaagaat 1080aaagacacac tcttaaaaag aaccaccatg cagaactttc tcacaactaa atcaaggctg 1140cgtaagaccg gtacgccaac atcttcgcaa cactcatcta cagcattttc aggatatatt 1200gatgatcctt tcaatttgaa tgaaatcttg ccactgccgg catccgattt caagctaaac 1260actgtaacaa atttgaacga aattgacttt acgaacattt ttaccaaatc gccgcatcca 1320catagcgggt ctacccatcc aagacaagtc ttcgaccaat tggacgattg ttcctctata 1380ctcttctctc cattaactac aaacacgaat aatgaatttg aaggagagtc agatgatttt 1440gttcattctc catatttgaa ctcagaggca gatttcagcc aaattcttag tagtgctccc 1500ccagtccatc atgacccaaa tgaaacacat caggaaaacc aggatattat tgatagattt 1560gctaatagtt cccaagaaca taatgagtat attctacaat atttgacgca ctccgatgct 1620gctaaccaca ataacatcgg cgttccaaac aacaattcac attcgctaaa tactcagcat 1680aacgtttctg atctgggcaa ctcactttta agacaagaag ctttagttgg cagctcttca 1740acaaaaatct tcgacgaatt gaaatttgta caaaatggcc cacacggttc tcaacatcct 1800atagattttc aacatgttga ccatcgtcat ctcagctcta atgaacctca agtacgatca 1860catcaatatg gtccgcaaca gcagccaccg cagcaattgc aatatcacca aaatcagccc 1920cacgacggcc ataaccacga acagcaccaa acagtacaaa aggatatgca aacgcatgaa 1980tcgctagaaa taatgggaaa cacattattg gaagagttca aagacattaa aatggtgaac 2040ggcgagttga agtatgtgaa gccagaagat tag 207370690PRTSaccharomyces cerevisiae 70Met Glu Gly Phe Asn Pro Ala Asp Ile Glu His Ala Ser Pro Ile Asn1 5 10 15Ser Ser Asp Ser His Ser Ser Ser Phe Val Tyr Ala Leu Pro Lys Ser 20 25 30Ala Ser Glu Tyr Val Val Asn His Asn Glu Gly Arg Ala Ser Ala Ser 35 40 45Gly Asn Pro Ala Ala Val Pro Ser Pro Ile Met Thr Leu Asn Leu Lys 50 55 60Ser Thr His Ser Leu Asn Ile Asp Gln His Val His Thr Ser Thr Ser65 70 75 80Pro Thr Glu Thr Ile Gly His Ile His His Val Glu Lys Leu Asn Gln 85 90 95Asn Asn Leu Ile His Leu Asp Pro Val Pro Asn Phe Glu Asp Lys Ser 100 105 110Asp Ile Lys Pro Trp Leu Gln Lys Ile Phe Tyr Pro Gln Gly Ile Glu 115 120 125Leu Val Ile Glu Arg Ser Asp Ala Phe Lys Val Val Phe Lys Cys Lys 130 135 140Ala Ala Lys Arg Gly Arg Asn Ala Arg Arg Lys Arg Lys Asp Lys Pro145 150 155 160Lys Gly Gln Asp His Glu Asp Glu Lys Ser Lys Ile Asn Asp Asp Glu 165 170 175Leu Glu Tyr Ala Ser Pro Ser Asn Ala Thr Val Thr Asn Gly Pro Gln 180 185 190Thr Ser Pro Asp Gln Thr Ser Ser Ile Lys Pro Lys Lys Lys Arg Cys 195 200 205Val Ser Arg Phe Asn Asn Cys Pro Phe Arg Val Arg Ala Thr Tyr Ser 210 215 220Leu Lys Arg Lys Arg Trp Ser Ile Val Val Met Asp Asn Asn His Ser225 230 235 240His Gln Leu Lys Phe Asn Pro Asp Ser Glu Glu Tyr Lys Lys Phe Lys 245 250 255Glu Lys Leu Arg Lys Asp Asn Asp Val Asp Ala Ile Lys Lys Phe Asp 260 265 270Glu Leu Glu Tyr Arg Thr Leu Ala Asn Leu Pro Ile Pro Thr Ala Thr 275 280 285Ile Pro Cys Asp Cys Gly Leu Thr Asn Glu Ile Gln Ser Phe Asn Val 290 295 300Val Leu Pro Thr Asn Ser Asn Val Thr Ser Ser Ala Ser Ser Ser Thr305 310 315 320Val Ser Ser Ile Ser Leu Asp Ser Ser Asn Ala Ser Lys Arg Pro Cys 325 330 335Leu Pro Ser Val Asn Asn Thr Gly Ser Ile Asn Thr Asn Asn Val Arg 340 345 350Lys Pro Lys Ser Gln Cys Lys Asn Lys Asp Thr Leu Leu Lys Arg Thr 355 360 365Thr Met Gln Asn Phe Leu Thr Thr Lys Ser Arg Leu Arg Lys Thr Gly 370 375 380Thr Pro Thr Ser Ser Gln His Ser Ser Thr Ala Phe Ser Gly Tyr Ile385 390 395 400Asp Asp Pro Phe Asn Leu Asn Glu Ile Leu Pro Leu Pro Ala Ser Asp 405 410 415Phe Lys Leu Asn Thr Val Thr Asn Leu Asn Glu Ile Asp Phe Thr Asn 420 425 430Ile Phe Thr Lys Ser Pro His Pro His Ser Gly Ser Thr His Pro Arg 435 440 445Gln Val Phe Asp Gln Leu Asp Asp Cys Ser Ser Ile Leu Phe Ser Pro 450 455 460Leu Thr Thr Asn Thr Asn Asn Glu Phe Glu Gly Glu Ser Asp Asp Phe465 470 475 480Val His Ser Pro Tyr Leu Asn Ser Glu Ala Asp Phe Ser Gln Ile Leu 485 490 495Ser Ser Ala Pro Pro Val His His Asp Pro Asn Glu Thr His Gln Glu 500 505 510Asn Gln Asp Ile Ile Asp Arg Phe Ala Asn Ser Ser Gln Glu His Asn 515 520 525Glu Tyr Ile Leu Gln Tyr Leu Thr His Ser Asp Ala Ala Asn His Asn 530 535 540Asn Ile Gly Val Pro Asn Asn Asn Ser His Ser Leu Asn Thr Gln His545 550 555 560Asn Val Ser Asp Leu Gly Asn Ser Leu Leu Arg Gln Glu Ala Leu Val 565 570 575Gly Ser Ser Ser Thr Lys Ile Phe Asp Glu Leu Lys Phe Val Gln Asn 580 585 590Gly Pro His Gly Ser Gln His Pro Ile Asp Phe Gln His Val Asp His 595 600 605Arg His Leu Ser Ser Asn Glu Pro Gln Val Arg Ser His Gln Tyr Gly 610 615 620Pro Gln Gln Gln Pro Pro Gln Gln Leu Gln Tyr His Gln Asn Gln Pro625 630 635 640His Asp Gly His Asn His Glu Gln His Gln Thr Val Gln Lys Asp Met 645 650 655Gln Thr His Glu Ser Leu Glu Ile Met Gly Asn Thr Leu Leu Glu Glu 660 665 670Phe Lys Asp Ile Lys Met Val Asn Gly Glu Leu Lys Tyr Val Lys Pro 675 680 685Glu Asp 690711251DNASaccharomyces cerevisiae 71atgaaagcaa agtcgatgaa gagcataata tctgtgccaa tatcagtgtc taaaacaggg 60aaaatgaagt taacagcatc tcccgataat ctagcatcta tgatgtctaa agaccaaaat 120aaacttattc atttagaccc agttccgtct tttgaggaca ggcacgagat taaaccgtgg 180ctacagaaaa tattctatcc tcaaggaata gatatagtca tcgaaagatc agatagtagc 240aaagttactt ttaagtgtag gtcagtacga tcgaaagtag ggttgaaccc taaatccaaa 300ggttcctctt caagatccca tgcatgtcca tttagaatta gagcagccta ctctgtgcgg 360ttacagaagt ggaacgtggt tgttatgaat aacatacatt cacatgagct gagatttgat 420ttgattacta aaacggatga ctataaaaag tttaaggaaa atcttcgtca gaagaacgac 480gaaaaagcca tcaaaacatt tgatgaatta gaatataaag caagtctaaa cttaccactc 540gtcacaccaa ttatttcttg tgactgtggg ttaaccaagg aaattgaagc tttcaataac 600atttttctcc cactatcaaa cccccctttg acgtctaaaa aaaatttgct caaaacaaat 660aaaaactccg tttctaaaat aaaatccaga cagatggata attcgaaacc taggccaagg 720ttgaaaacga agttggatgc ggatctacat gataccggat ttttagacaa tttcaaaacg 780cggaattcct gtgtaaagat agagaaagaa gattctttaa cgaacttaaa cgaaattgat 840tttacgaaca tgttttgcaa cgacaatttc atccaaaatt acaaccaagg cctgatggaa 900cttttgacag aacctacacc aggtccttcg tcctcttctt gtatacttcc ttcaacacca 960acgaggccct tgtcacaaag caaaatggat atagccctgt cagaatcaac aacatcttca 1020ccaaatttta tggaaacaga cgctccttat ggagatgaga tcattaaagt ttccaaagat 1080accaaaagca acgcaccaac tgctgataca gatattgcta cgaacttagg caaagagaga 1140aacgaaaatt ttggcatgtt aaactacaac tacgaagcgt tgcttcattt caacgatgag 1200cattttaatg aattaaattc tattgaccca gccttaatat caaaatatta a 125172416PRTSaccharomyces cerevisiae 72Met Lys Ala Lys Ser Met Lys Ser Ile Ile Ser Val Pro Ile Ser Val1 5 10 15Ser Lys Thr Gly Lys Met Lys Leu Thr Ala Ser Pro Asp Asn Leu Ala 20 25 30Ser Met Met Ser Lys Asp Gln Asn Lys Leu Ile His Leu Asp Pro Val 35 40 45Pro Ser Phe Glu Asp Arg His Glu Ile Lys Pro Trp Leu Gln Lys Ile 50 55 60Phe Tyr Pro Gln Gly Ile Asp Ile Val Ile Glu Arg Ser Asp Ser Ser65 70 75 80Lys Val Thr Phe Lys Cys Arg Ser Val Arg Ser Lys Val Gly Leu Asn 85 90 95Pro Lys Ser Lys Gly Ser Ser Ser Arg Ser His Ala Cys Pro Phe Arg 100 105 110Ile Arg Ala Ala Tyr Ser Val Arg Leu Gln Lys Trp Asn Val Val Val 115 120 125Met Asn Asn Ile His Ser His Glu Leu Arg Phe Asp Leu Ile Thr Lys 130 135 140Thr Asp Asp Tyr Lys Lys Phe Lys Glu Asn Leu Arg Gln Lys Asn Asp145 150 155 160Glu Lys Ala Ile Lys Thr Phe Asp Glu Leu Glu Tyr Lys Ala Ser Leu 165 170 175Asn Leu Pro Leu Val Thr Pro Ile Ile Ser Cys Asp Cys Gly Leu Thr 180 185 190Lys Glu Ile Glu Ala Phe Asn Asn Ile Phe Leu Pro Leu Ser Asn Pro 195 200 205Pro Leu Thr Ser Lys Lys Asn Leu Leu Lys Thr Asn Lys Asn Ser Val 210 215 220Ser Lys Ile Lys Ser Arg Gln Met Asp Asn Ser Lys Pro Arg Pro Arg225 230 235 240Leu Lys Thr Lys Leu Asp Ala Asp Leu His Asp Thr Gly Phe Leu Asp 245 250 255Asn Phe Lys Thr Arg Asn Ser Cys Val Lys Ile Glu Lys Glu Asp Ser 260 265 270Leu Thr Asn Leu Asn Glu Ile Asp Phe Thr Asn Met Phe Cys Asn Asp 275 280 285Asn Phe Ile Gln Asn Tyr Asn Gln Gly Leu Met Glu Leu Leu Thr Glu 290 295 300Pro Thr Pro Gly Pro Ser Ser Ser Ser Cys Ile Leu Pro Ser Thr Pro305 310 315 320Thr Arg Pro Leu Ser Gln Ser Lys Met Asp Ile Ala Leu Ser Glu Ser 325 330 335Thr Thr Ser Ser Pro Asn Phe Met Glu Thr Asp Ala Pro Tyr Gly Asp 340 345 350Glu Ile Ile Lys Val Ser Lys Asp Thr Lys Ser Asn Ala Pro Thr Ala 355 360 365Asp Thr Asp Ile Ala Thr Asn Leu Gly Lys Glu Arg Asn Glu Asn Phe 370 375 380Gly Met Leu Asn Tyr Asn Tyr Glu Ala Leu Leu His Phe Asn Asp Glu385 390 395 400His Phe Asn Glu Leu Asn Ser Ile Asp Pro Ala Leu Ile Ser Lys Tyr 405 410 41573363DNASaccharomyces cerevisiae 73atgacaggtg aaagaattga aaaggtgaaa ataaatgacg aatttgcaaa atcacatttc 60ctaacaactc agtggagaga aacaaagcgc caacgacact ataagatgcc cgttactgaa 120cagggactcc gcgagaggat agaatcagcc ataccgcaag tttaccatat cattgtgacg 180gatctttcgt acggttgtgg tcagtcgttt gatattgtgg tggtcagcga cttctttcaa 240ggtaaaagca aactaatgag gagtcgtgca gtgaacaagg ctgtgaaaga agagctgcag 300gagattcatg cctttagctg caagtgctac actgaggagg aatggtctaa gattgtggta 360tga 36374120PRTSaccharomyces cerevisiae 74Met Thr Gly Glu Arg Ile Glu Lys Val Lys Ile Asn Asp Glu Phe Ala1 5 10 15Lys Ser His Phe Leu Thr Thr Gln Trp Arg Glu Thr Lys Arg Gln Arg 20 25 30His Tyr Lys Met Pro Val Thr Glu Gln Gly Leu Arg Glu Arg Ile Glu 35 40 45Ser Ala Ile Pro Gln Val Tyr His Ile Ile Val Thr Asp Leu Ser Tyr 50 55 60Gly Cys Gly Gln Ser Phe Asp Ile Val Val Val Ser Asp Phe Phe Gln65 70 75 80Gly Lys Ser Lys Leu Met Arg Ser Arg Ala Val Asn Lys Ala Val Lys 85 90 95Glu Glu Leu Gln Glu Ile His Ala Phe Ser Cys Lys Cys Tyr Thr Glu 100 105 110Glu Glu Trp Ser Lys Ile Val Val 115 12075858DNASaccharomyces cerevisiae 75atgtgttctt ttcaggttcc atctgcattt tcttttaact acacctcgta ctgttataaa 60cgccaccaag caagatatta cacagcagca aaactttttc aggaaatgcc tgttattgaa 120attaacgatc

aagagcaatt tacttaccta actaccactg cggccggcga caagttaatc 180gtgctttatt tccataccag ttgggcagaa ccatgcaaag cattaaagca ggtttttgag 240gccattagta atgagccttc caattccaac gtctctttct tatccattga tgcggacgaa 300aactcggaaa tttcagaact ttttgaaatc tcagctgttc catattttat cataattcac 360aaagggacaa tcttaaaaga attatccggc gcggatccaa aggagtatgt gtctttatta 420gaagactgca agaactcagt caattccgga tcatcacaaa ctcatactat ggaaaatgca 480aacgtaaatg aggggagtca taatgatgaa gacgatgacg acgaagaaga ggaagaagaa 540actgaggagc aaataaacgc tagattgact aaattggtca atgccgcgcc ggtaatgtta 600tttatgaagg ggagcccctc tgaacctaaa tgcgggtttt cgagacaact tgtgggtatc 660ttgagagaac atcaagtaag atttggcttc tttgatatat taagagacga atctgttaga 720caaaacttga aaaagttttc tgaatggcca actttccctc aactttatat aaatggggag 780tttcaaggcg gtttagacat tatcaaggaa tccttggagg aagaccctga ttttttgcag 840catgctctcc aatcttaa 85876285PRTSaccharomyces cerevisiae 76Met Cys Ser Phe Gln Val Pro Ser Ala Phe Ser Phe Asn Tyr Thr Ser1 5 10 15Tyr Cys Tyr Lys Arg His Gln Ala Arg Tyr Tyr Thr Ala Ala Lys Leu 20 25 30Phe Gln Glu Met Pro Val Ile Glu Ile Asn Asp Gln Glu Gln Phe Thr 35 40 45Tyr Leu Thr Thr Thr Ala Ala Gly Asp Lys Leu Ile Val Leu Tyr Phe 50 55 60His Thr Ser Trp Ala Glu Pro Cys Lys Ala Leu Lys Gln Val Phe Glu65 70 75 80Ala Ile Ser Asn Glu Pro Ser Asn Ser Asn Val Ser Phe Leu Ser Ile 85 90 95Asp Ala Asp Glu Asn Ser Glu Ile Ser Glu Leu Phe Glu Ile Ser Ala 100 105 110Val Pro Tyr Phe Ile Ile Ile His Lys Gly Thr Ile Leu Lys Glu Leu 115 120 125Ser Gly Ala Asp Pro Lys Glu Tyr Val Ser Leu Leu Glu Asp Cys Lys 130 135 140Asn Ser Val Asn Ser Gly Ser Ser Gln Thr His Thr Met Glu Asn Ala145 150 155 160Asn Val Asn Glu Gly Ser His Asn Asp Glu Asp Asp Asp Asp Glu Glu 165 170 175Glu Glu Glu Glu Thr Glu Glu Gln Ile Asn Ala Arg Leu Thr Lys Leu 180 185 190Val Asn Ala Ala Pro Val Met Leu Phe Met Lys Gly Ser Pro Ser Glu 195 200 205Pro Lys Cys Gly Phe Ser Arg Gln Leu Val Gly Ile Leu Arg Glu His 210 215 220Gln Val Arg Phe Gly Phe Phe Asp Ile Leu Arg Asp Glu Ser Val Arg225 230 235 240Gln Asn Leu Lys Lys Phe Ser Glu Trp Pro Thr Phe Pro Gln Leu Tyr 245 250 255Ile Asn Gly Glu Phe Gln Gly Gly Leu Asp Ile Ile Lys Glu Ser Leu 260 265 270Glu Glu Asp Pro Asp Phe Leu Gln His Ala Leu Gln Ser 275 280 28577969DNASaccharomyces cerevisiae 77atgtccattg tagcactaaa gaacgcagtg gtgaccctta tacagaaagc gaaaggtagt 60ggtggaacct cagagttggg ggggtctgaa tcaactccct tgttgagggg tagtaatagc 120aatagttcaa ggcatgataa cttatcctca tctagctcgg atattatcta tggtagaaat 180tcagcgcagg atctagaaaa ctcaccgatg tcagtaggga aagataatag gaatggcgat 240aacggttcgg ataacgaaaa ggcgaaccta gggttcttcc aatcagtaga tcctcgcgta 300attagcgatt tgattatcgg gctaagcgac ggtttaaccg tcccctttgc tctaacagct 360ggtctatctt cacttggtga tgcgaaattg gtcattaccg gtggctttgc tgagttgatc 420tcaggtgcga tttctatggg ccttggtggc tatttgggag cgaagagtga atctgattat 480tatcatgctg aagtaaagaa ggagaaaagg aaattttacg ataactccaa cctaattaac 540agagaaattg aagacattct gttagagatt aaccctaatt tctcggacga aacaatcgtt 600tcgttcatca aagatttaca aagaacacct gagctaatgg ttgacttcat tatcaggtac 660ggtaggggtc tagacgaacc tgccgagaat agagaactaa tcagtgcagt cactatcggt 720ggcggttatc tacttggtgg gctagtacca ctagtgccat atttttttgt ctcagatgtg 780ggcacaggtc tcatttattc tataatagtc atggttgtaa cattattctg gtttggttac 840gtaaagacaa aactatccat gggtagtggc agttcgactt caaagaaagt tacggaaggt 900gttgaaatgg ttgttgtggg tggtgtagca gcaggtgcgg cttggttctt tgttaagtta 960ctgggttaa 96978322PRTSaccharomyces cerevisiae 78Met Ser Ile Val Ala Leu Lys Asn Ala Val Val Thr Leu Ile Gln Lys1 5 10 15Ala Lys Gly Ser Gly Gly Thr Ser Glu Leu Gly Gly Ser Glu Ser Thr 20 25 30Pro Leu Leu Arg Gly Ser Asn Ser Asn Ser Ser Arg His Asp Asn Leu 35 40 45Ser Ser Ser Ser Ser Asp Ile Ile Tyr Gly Arg Asn Ser Ala Gln Asp 50 55 60Leu Glu Asn Ser Pro Met Ser Val Gly Lys Asp Asn Arg Asn Gly Asp65 70 75 80Asn Gly Ser Asp Asn Glu Lys Ala Asn Leu Gly Phe Phe Gln Ser Val 85 90 95Asp Pro Arg Val Ile Ser Asp Leu Ile Ile Gly Leu Ser Asp Gly Leu 100 105 110Thr Val Pro Phe Ala Leu Thr Ala Gly Leu Ser Ser Leu Gly Asp Ala 115 120 125Lys Leu Val Ile Thr Gly Gly Phe Ala Glu Leu Ile Ser Gly Ala Ile 130 135 140Ser Met Gly Leu Gly Gly Tyr Leu Gly Ala Lys Ser Glu Ser Asp Tyr145 150 155 160Tyr His Ala Glu Val Lys Lys Glu Lys Arg Lys Phe Tyr Asp Asn Ser 165 170 175Asn Leu Ile Asn Arg Glu Ile Glu Asp Ile Leu Leu Glu Ile Asn Pro 180 185 190Asn Phe Ser Asp Glu Thr Ile Val Ser Phe Ile Lys Asp Leu Gln Arg 195 200 205Thr Pro Glu Leu Met Val Asp Phe Ile Ile Arg Tyr Gly Arg Gly Leu 210 215 220Asp Glu Pro Ala Glu Asn Arg Glu Leu Ile Ser Ala Val Thr Ile Gly225 230 235 240Gly Gly Tyr Leu Leu Gly Gly Leu Val Pro Leu Val Pro Tyr Phe Phe 245 250 255Val Ser Asp Val Gly Thr Gly Leu Ile Tyr Ser Ile Ile Val Met Val 260 265 270Val Thr Leu Phe Trp Phe Gly Tyr Val Lys Thr Lys Leu Ser Met Gly 275 280 285Ser Gly Ser Ser Thr Ser Lys Lys Val Thr Glu Gly Val Glu Met Val 290 295 300Val Val Gly Gly Val Ala Ala Gly Ala Ala Trp Phe Phe Val Lys Leu305 310 315 320Leu Gly79347PRTBeijerinkia indica 79Met Lys Ala Leu Val Tyr Arg Gly Pro Gly Gln Lys Leu Val Glu Glu1 5 10 15Arg Gln Lys Pro Glu Leu Lys Glu Pro Gly Asp Ala Ile Val Lys Val 20 25 30Thr Lys Thr Thr Ile Cys Gly Thr Asp Leu His Ile Leu Lys Gly Asp 35 40 45Val Ala Thr Cys Lys Pro Gly Arg Val Leu Gly His Glu Gly Val Gly 50 55 60Val Ile Glu Ser Val Gly Ser Gly Val Thr Ala Phe Gln Pro Gly Asp65 70 75 80Arg Val Leu Ile Ser Cys Ile Ser Ser Cys Gly Lys Cys Ser Phe Cys 85 90 95Arg Arg Gly Met Phe Ser His Cys Thr Thr Gly Gly Trp Ile Leu Gly 100 105 110Asn Glu Ile Asp Gly Thr Gln Ala Glu Tyr Val Arg Val Pro His Ala 115 120 125Asp Thr Ser Leu Tyr Arg Ile Pro Ala Gly Ala Asp Glu Glu Ala Leu 130 135 140Val Met Leu Ser Asp Ile Leu Pro Thr Gly Phe Glu Cys Gly Val Leu145 150 155 160Asn Gly Lys Val Ala Pro Gly Ser Ser Val Ala Ile Val Gly Ala Gly 165 170 175Pro Val Gly Leu Ala Ala Leu Leu Thr Ala Gln Phe Tyr Ser Pro Ala 180 185 190Glu Ile Ile Met Ile Asp Leu Asp Asp Asn Arg Leu Gly Leu Ala Lys 195 200 205Gln Phe Gly Ala Thr Arg Thr Val Asn Ser Thr Gly Gly Asn Ala Ala 210 215 220Ala Glu Val Lys Ala Leu Thr Glu Gly Leu Gly Val Asp Thr Ala Ile225 230 235 240Glu Ala Val Gly Ile Pro Ala Thr Phe Glu Leu Cys Gln Asn Ile Val 245 250 255Ala Pro Gly Gly Thr Ile Ala Asn Val Gly Val His Gly Ser Lys Val 260 265 270Asp Leu His Leu Glu Ser Leu Trp Ser His Asn Val Thr Ile Thr Thr 275 280 285Arg Leu Val Asp Thr Ala Thr Thr Pro Met Leu Leu Lys Thr Val Gln 290 295 300Ser His Lys Leu Asp Pro Ser Arg Leu Ile Thr His Arg Phe Ser Leu305 310 315 320Asp Gln Ile Leu Asp Ala Tyr Glu Thr Phe Gly Gln Ala Ala Ser Thr 325 330 335Gln Ala Leu Lys Val Ile Ile Ser Met Glu Ala 340 345

* * * * *


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed