U.S. patent application number 11/315801 was filed with the patent office on 2006-07-06 for hybrid enzymes.
This patent application is currently assigned to Novozymes A/S. Invention is credited to Carsten Andersen, Anders Vikso Nielsen, Henrik Oestdal, Tina Spendler, Allan Svendsen.
Application Number | 20060147581 11/315801 |
Document ID | / |
Family ID | 36640727 |
Filed Date | 2006-07-06 |
United States Patent
Application |
20060147581 |
Kind Code |
A1 |
Svendsen; Allan ; et
al. |
July 6, 2006 |
Hybrid enzymes
Abstract
The present invention relates to a polypeptide and the use
thereof.
Inventors: |
Svendsen; Allan; (Hoersholm,
DK) ; Andersen; Carsten; (Vaerlose, DK) ;
Spendler; Tina; (Maaloev, DK) ; Nielsen; Anders
Vikso; (Slangerup, DK) ; Oestdal; Henrik;
(Virum, DK) |
Correspondence
Address: |
NOVOZYMES NORTH AMERICA, INC.
500 FIFTH AVENUE
SUITE 1600
NEW YORK
NY
10110
US
|
Assignee: |
Novozymes A/S
Bagsvaerd
DK
|
Family ID: |
36640727 |
Appl. No.: |
11/315801 |
Filed: |
December 22, 2005 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
60717274 |
Sep 14, 2005 |
|
|
|
60639181 |
Dec 22, 2004 |
|
|
|
Current U.S.
Class: |
426/20 ; 435/204;
435/252.3; 435/471; 435/69.1; 536/23.2 |
Current CPC
Class: |
Y02E 50/17 20130101;
A21D 8/042 20130101; Y02E 50/10 20130101 |
Class at
Publication: |
426/020 ;
435/069.1; 435/204; 435/252.3; 435/471; 536/023.2 |
International
Class: |
A21D 8/02 20060101
A21D008/02; C07H 21/04 20060101 C07H021/04; C12P 21/06 20060101
C12P021/06; C12N 9/32 20060101 C12N009/32; C12N 15/74 20060101
C12N015/74 |
Foreign Application Data
Date |
Code |
Application Number |
Sep 9, 2005 |
DK |
PA 2005 01261 |
Dec 22, 2004 |
DK |
PA 2004 01976 |
Claims
1. polypeptide which polypeptide is a hybrid comprising; a) a first
amino acid sequence having endo-amylase activity and b) a second
amino acid sequence comprising a carbohydrate-binding module.
2. The polypeptide of claim 1, wherein said first amino acid
sequence and/or said second amino is derived from a bacterium
3. The polypeptide according to claim 1, wherein said second amino
acid sequence has at least 60% identity to the amino acid sequence
shown as amino acid residues 485 to 586 in SEQ ID NO:2.
4. The polypeptide according to claim 1, wherein said first amino
acid sequence has at least 60% identity to any amino acid sequence
selected from the group consisting of SEQ ID NO:35, SEQ ID NO:36,
SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID
NO:41 and SEQ ID NO:42.
5. The polypeptide according to claim 1, having at least 60%
identity to any amino acid sequence selected from the group
consisting of SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10,
SEQ ID NO:12, SEQ ID NO:14.
6. The polypeptide according to claim 1, comprising a) the
catalytic domain shown in SEQ ID NO:40 or a homologous catalytic
domain, wherein one or more, or preferably all, of the following
substitutions have been introduced: R118K, D183*, G184*, N195F,
R320K, R458K, N33S, D36N, K37L, E391I, Q394R, K395D, T452Y and
N484P, using the numbering of SEQ ID NO: 40 and b) the CBM shown as
residue 485 to 585 of SEQ ID NO:2.
7. The polypeptide according to claim 1 comprising a) the catalytic
domain shown in SEQ.ID: 37 or a homologous catalytic domain and
comprising one or more, e.g. such as all of the following
alterations: S31A, D32N, 133L, E178*, G179*, N190F, K3891, K392R,
E393D, V508A and b) the CBM having the amino acid sequence shown as
amino acid residues 485 to 586 in SEQ ID NO:2.
8. The polypeptide according to claim 1 comprising a) the catalytic
domain shown in SEQ ID NO:40 or a homologous catalytic domain,
wherein one or more, or preferably all, of the following
substitutions have been introduced: R118K, D183*, G184*, N195F,
R320K, R458K and N484P, using the numbering of SEQ ID NO: and b)
the CBM shown as residue 485 to 585 of SEQ ID NO:2.
9. The polypeptide according to claim 1, wherein said polypeptide
has; a) an EIF1 larger than 1.0 at the test conditions given in the
specification, or b) an EIF2 larger than 1.0 at the test conditions
given in the specification.
10. The polypeptide according to claim 1, wherein said polypeptide
has at least 25% residual activity at 70.degree. C. at the test
conditions given in the specification.
11. The polypeptide according to claim 1, wherein the addition of 2
times the effective dosage of said polypeptide to a dough results
in an ELR of less than 15%.
12. The polypeptide according to claim 1, wherein the addition of 2
times the effective dosage of said polypeptide to a dough results
in an ELR.sub.N of less than 15%.
13. A process for preparing a dough or an edible product made from
a dough, which process comprises adding the polypeptide according
to claim 1 to the dough.
14. The process of claim 13 wherein the edible product is a baked
product.
15. The process of claim 13 wherein the addition of 2 times the
effective dosage of said polypeptide results in an ELR of less than
15%.
16. The process according to claim 13 wherein the addition of 2
times the effective dosage of said polypeptide results in an
ELR.sub.N of less than 15%.
17. The process according to claim 13 wherein the polypeptide gives
a cohesiveness reduction of less than 5% when dosed to give a
dHardness of at least 85 units at the test conditions given in the
specification.
18. The process according to claim 13 wherein the polypeptide when
added together with 300 MANU Novamyl/kg flour gives a cohesiveness
reduction of less than 5% when dosed to give a dHardness of at
least 15 units at the test conditions given in the
specification
19. The process according to claim 13 wherein the polypeptide gives
a cohesiveness reduction of less than 5% when dosed to give a
dMobility of at least 400 units at the test conditions given in the
specification
20. The process according to claim 13 wherein the polypeptide when
added together with 300 MANU Novamyl/kg flour gives a cohesiveness
reduction of less than 5% when dosed to give a dMobility of at
least 1100 units at the test conditions given in the
specification
21-52. (canceled)
Description
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims, under 35 U.S.C. 119, priority of
Danish application nos. PA 2004 01976, filed Dec. 22, 2004, PA 2005
01261, filed Sep. 9, 2005, and the benefit of U.S. provisional
application Nos. 60/639,181, filed Dec. 22, 2004 and 60/717,274,
filed Sep. 14, 2005, the contents of which are fully incorporated
herein by reference.
SEQUENCE LISTING
[0002] This application contains a sequence listing. A computer
readable form containing the sequence listing accompanies this
application, and the computer readable form of the sequence listing
is hereby incorporated by reference.
FIELD OF THE INVENTION
[0003] The present invention relates, inter alia, to hybrid enzymes
comprising a carbohydrate binding module and having endo-amylase
activity. The enzymes may be applied in processes comprising starch
modification and/or degradation, or in dough making processes.
BACKGROUND OF THE INVENTION
[0004] Bacterial endo-amylases are used in a large number of
processes, e.g., for liquefaction of starch in processes where
starch is modified, and/or degraded to smaller polymers or monomers
of glucose. The degradation products may used in the industry,
e.g., as maltose and/or fructose syrups or further processed in a
fermentation step to a fermentation product, e.g., ethanol. The
bacterial endo-amylases are used in baking to give additional
softness and a better moistness of the bread crumb. However, the
endo-amylases are easy to overdose which may results in gumminess
and an undesirable loss in elasticity in the baked product. There
is a need for endo-amylases with improved properties for use in
various processes, e.g., within starch processing and baking.
SUMMARY OF THE INVENTION
[0005] The present inventors have now surprisingly discovered that
by addition of a carbohydrate binding module (CBM) to an
endo-amylase the catalytic activity of the endo-amylase can be
modified thereby resulting in an increased baking performance
compared to the wild type enzyme. There is no significant change in
the taste or smell of the baked product. Without being bound by
theory it is suggested that the effect is due to an increased
activity towards raw starch in the dough conferred by the CBM,
and/or a reduced activity towards the heated starch in the baking
bread conferred by the CBM. The endo-amylase with a CBM can be used
as a baking enzyme with less risk of overdosing compared to the
enzyme without a CBM. Such hybrids consisting of a polypeptide
having endo-amylase activity and a carbohydrate binding module,
primarily having affinity for starch like e.g. the CBM20, have the
advantage over existing endo-amylases that by selecting a catalytic
domain with desire properties e.g. the pH profile, the temperature
profile, the oxidation resistance, the calcium stability, the
substrate affinity or the product profile can be combined with a
carbohydrate binding module with stronger or weaker binding
affinities, e.g., specific affinities for amylose, specific
affinities for amylopectin or affinities for specific structure in
the carbohydrate. The hybrid may be used as a baking additive,
e.g., as an anti-staling enzyme.
[0006] The present inventors have further surprisingly discovered
that by adding a carbohydrate-binding module (CBM) to an
endo-amylase the activity and specificity can be altered thereby
increasing the efficacy of various starch degrading processes,
e.g., comprising degradation of raw, e.g., ungelatinized starch as
well as gelatinized starch. Due to the superior hydrolysis activity
of these endo-amylases having a CBM the overall starch conversion
process can be performed without having to gelatinize the starch,
i.e. the endo-amylases having a CBM hydrolyses granular starch in a
raw starch process as well as fully or partially gelatinized starch
in a traditional starch process.
[0007] Accordingly the invention provides in a first aspect a
polypeptide which polypeptide is a hybrid comprising; a first amino
acid sequence having endo-amylase activity and a second amino acid
sequence comprising a carbohydrate-binding module. Preferably said
first amino acid sequence and/or said amino acid second sequence is
derived from a bacterium. The second amino acid sequence has
preferably at least 60% identity to the amino acid sequence shown
as amino acid residues 485 to 586 in SEQ ID NO:2 and/or the first
amino acid sequence has at least 60% identity to the amino acid
sequence shown in SEQ ID NO:35.
[0008] In a second aspect the invention provides a process for
preparing a dough or an edible product made from a dough, which
process comprises adding the polypeptide of the first aspect to a
dough.
[0009] In a third and a fourth aspect the invention provides a
composition comprising the polypeptide of the first aspect, and a
dough- or bread-improving additive in the form of a granulate or
agglomerated powder comprising the polypeptide of the first
aspect.
[0010] In a fifth aspect the invention provides a process for
designing a polypeptide suitable for baking, said process
comprising; providing a first amino acid sequence having
endo-amylase activity, and a second amino acid sequence comprising
a carbohydrate-binding module; wherein said first amino acid
sequence is derived from a bacterium; providing a second amino acid
sequence comprising a carbohydrate-binding module; and constructing
a polypeptide comprising said first amino acid sequence with said
second amino acid sequence.
[0011] In a sixth aspect the invention provides a process for
preparing composition, e.g., a bread improving additive, is
produced in a process comprising the steps of; a) providing a first
amino acid sequence having endo-amylase activity; b) providing a
second amino acid sequence comprising a carbohydrate-binding
module; c) and constructing a polypeptide comprising said first
amino acid sequence and second amino acid sequence; d) providing a
DNA sequence encoding said polypeptide; e) expressing said DNA
sequence in a suitable host cell and recovering said polypeptide;
f) adding said polypeptide to flour or to a granulate or
agglomerated powder.
[0012] In a seventh aspect the invention provides a process for
preparing a dough or an edible product made from a dough, which
process comprises; providing a first amino acid sequence having
endo-amylase activity; providing a second amino acid sequence
comprising a carbohydrate-binding module; and constructing a
polypeptide comprising said first amino acid sequence and second
amino acid sequence; providing a DNA sequence encoding said
polypeptide; expressing said DNA sequence in a suitable host cell
and recovering said polypeptide; and adding said polypeptide to a
dough.
[0013] In a eigth aspect the invention provides a process for
saccharifying starch, wherein a starch is treated with the
polypeptide according to the first aspect.
[0014] In a ninth aspect the invention provides a process
comprising; contacting a starch with a polypeptide comprising a
first amino acid sequence having endo-amylase activity, and a
second amino acid sequence comprising a carbohydrate-binding
module; wherein said first amino acid sequence and/or said second
amino acid sequence is derived from a bacterium; incubating said
starch with said polypeptide for a time and at a temperature
sufficient to achieve conversion of at least 90% w/w of said starch
substrate into fermentable sugars; fermenting to produce a
fermentation product, and optionally recovering the fermentation
product, wherein said polypeptide may be a polypeptide according to
the first aspect
[0015] In an tenth aspect the invention provides a process
comprising; a) contacting a starch substrate with a yeast cell
transformed to express a polypeptide comprising a first amino acid
sequence having endo-amylase activity, and a second amino acid
sequence comprising a carbohydrate-binding module; b) holding said
starch substrate with said yeast for a time and at a temperature
sufficient to achieve conversion of at least 90% w/w of said starch
substrate into fermentable sugars; c) fermenting to produce
ethanol; optionally recovering ethanol; wherein steps a, b, and c
are performed separately or simultaneously and wherein said
polypeptide may be a polypeptide according to the first aspect
[0016] In an eleventh aspect the invention provides a process of
producing ethanol from starch-containing material by fermentation,
said process comprises: a) liquefying said starch-containing
material with a polypeptide comprising a first amino acid sequence
having endo-amylase activity, and a second amino acid sequence
comprising a carbohydrate-binding module; wherein said first amino
acid sequence and/or second amino acid sequence is derived from a
bacterium; b) saccharifying the liquefied mash obtained; c)
fermenting the material obtained in step (b) in the presence of a
fermenting organism.
[0017] In still further aspects the invention provides a DNA
sequence encoding a polypeptide according to the first aspect, a
DNA construct comprising said DNA sequence, a recombinant
expression vector which carries said DNA construct, a host cell
which is transformed with said DNA construct or said vector, said
host cell being a bacterium or a fungal cell, a plant cell, or a
yeast cell.
DETAILED DESCRIPTION OF THE INVENTION
Hybrid Enzymes
[0018] The polypeptide of the invention may be a hybrid enzyme
comprises a first amino acid sequence having endo-amylase activity,
and a second amino acid sequence comprising a carbohydrate-binding
module (CBM). The hybrid may be produced by fusion of a first DNA
sequences encoding a first amino acid sequences and a second DNA
sequences encoding a second amino acid sequences, or the hybrid may
be produced as a completely synthetic gene based on knowledge of
the amino acid sequences of suitable CBMs, linkers and catalytic
domains.
[0019] The terms "hybrid enzyme" (also referred to as "fusion
protein", "hybrid", hybrid polypeptide" or "hybrid protein) is used
herein to characterize the polypeptides of the invention comprising
a first amino acid sequence comprising at least one catalytic
module having endo-amylase activity and a second amino acid
sequence comprising at least one carbohydrate-binding module
wherein the first and the second are derived from different
sources. The term "source" being understood as e.g., but not
limited, to a parent enzyme, or a variant thereof, e.g., an amylase
or glucoamylase, or other catalytic activity comprising a suitable
catalytic module and/or a suitable CBM and/or a suitable linker.
However the CBM may also be derived from a polypeptide having no
catalytic activity. The first and the second amino acid sequence
may be derived from the same bacterial strain, from strains within
the same species, from closely related species or less related
organisms. Preferably the first and the second amino acid sequence
of the hybrids derived from different sources, e.g., from different
enzymes from the same strain and/or species, or e.g., from strains
within different species.
[0020] Enzyme classification numbers (EC numbers) referred to in
the present specification are in accordance with the
Recommendations of the Nomenclature Committee of the International
Union of Biochemistry and Molecular Biology
(http://www.chem.qmw.ac.uk/iubmb/enzyme/).
[0021] Hybrid enzymes as referred to herein include species
comprising an amino acid sequence of an endo-amylase, i.e. an
alpha-amylase (EC 3.2.1.1) which is linked (i.e. covalently bound)
to an amino acid sequence comprising a carbohydrate-binding module
(CBM). The hybrid enzyme is thus an enzyme capable of catalyzing
hydrolysis of starch in an endo-fashion.
[0022] CBM-containing hybrid enzymes, as well as detailed
descriptions of the preparation and purification thereof, are known
in the art [see, e.g., WO 90/00609, WO 94/24158 and WO 95/16782, as
well as Greenwood et al. Biotechnology and Bioengineering 44 (1994)
pp. 1295-1305]. They may, e.g., be prepared by transforming into a
host cell a DNA construct comprising at least a fragment of DNA
encoding the carbohydrate-binding module ligated, with or without a
linker, to a DNA sequence encoding the enzyme of interest, and
growing the transformed host cell to express the fused gene. The
linker may be a bond (i.e. comprising 0 residues), or a short
linking group comprising from about 2 to about 100 carbon atoms, in
particular of from 2 to 40 carbon atoms. However, the linker is
preferably a sequence of 0 amino acid residues (e.g., just a bond)
or it is from about 2 to about 100 amino acid residues, more
preferably of from 2 to 40 amino acid residues, such as from 2 to
15 amino acid residues. Preferably the linker is not sensitive to
or at least has low sensitivity towards hydrolysis by a protease,
which e.g., may be present during production of the hybrid and/or
during the industrial application of the hybrid. The CBM in a
hybrid enzyme of the type in question may be positioned
C-terminally, N-terminally or internally in the hybrid enzyme. In
an embodiment a polypeptide may comprise more than one CBM, e.g.,
two CBMs; one positioned C-terminally, the other N-terminally or
the two CBMs in tandem positioned C-terminally, N-terminally or
internally. However, polypeptides with more than two CBMs are
equally contemplated.
Polypeptide Identity
[0023] The term polypeptide "identity" is understood as the degree
of identity between two sequences indicating a derivation of the
first sequence from the second. The identity may suitably be
determined by means of computer programs known in the art such as
GAP provided in the GCG program package (Program Manual for the
Wisconsin Package, Version 8, August 1994, Genetics Computer Group,
575 Science Drive, Madison, Wis., USA 53711) (Needleman, S. B. and
Wunsch, C. D., (1970), Journal of Molecular Biology, 48, 443-453.
The following settings for amino acid sequence comparison are used:
GAP creation penalty of 3.0 and GAP extension penalty of 0.1. The
relevant part of the amino acid sequence for the identity
determination is the mature polypeptide, i.e. without the signal
peptide.
Carbohydrate-Binding Modules
[0024] A carbohydrate-binding module (CBM), or as often referred
to, a carbohydrate-binding domain (CBD), is a polypeptide amino
acid sequence which binds preferentially to a poly- or
oligosaccharide (carbohydrate), frequently--but not necessarily
exclusively--to a water-insoluble (including crystalline) form
thereof.
[0025] CBMs derived from starch degrading enzymes are often
referred to as starch-binding modules or SBMs (CBMs which may occur
in certain amylolytic enzymes, such as certain glucoamylases, or in
enzymes such as cyclodextrin glucanotransferases, or in
endo-amylases). SBMs are often referred to as SBDs (Starch Binding
Domains). Prefered for the invention are CBMs which are Starch
Binding Modules.
[0026] CBMs are found as integral parts of large polypeptides or
proteins consisting of two or more polypeptide amino acid sequence
regions, especially in hydrolytic enzymes (hydrolases) which
typically comprise a catalytic module containing the active site
for substrate hydrolysis and a carbohydrate-binding module (CBM)
for binding to the carbohydrate substrate in question. Such enzymes
can comprise more than one catalytic module and one, two or three
CBMs, and optionally further comprise one or more polypeptide amino
acid sequence regions linking the CBM(s) with the catalytic
module(s), a region of the latter type usually being denoted a
"linker". CBMs have also been found in algae, e.g., in the red alga
Porphyra purpurea in the form of a non-hydrolytic
polysaccharide-binding protein.
[0027] In proteins/polypeptides in which CBMs occur (e.g., enzymes,
typically hydrolytic enzymes), a CBM may be located at the N or C
terminus or at an internal position.
[0028] That part of a polypeptide or protein (e.g., hydrolytic
enzyme) which constitutes a CBM per se typically consists of more
than about 30 and less than about 250 amino acid residues. The
"Carbohydrate-Binding Module of Family 20" or a CBM-20 module is in
the context of this invention defined as a sequence of
approximately 100 amino acids having at least 45% identity to the
Carbohydrate-Binding Module (CBM) of the polypeptide disclosed in
FIG. 1 by Joergensen et al (1997) in Biotechnol. Lett.
19:1027-1031. The CBM comprises the last 102 amino acids of the
polypeptide, i.e. the subsequence from amino acid 582 to amino acid
683. The numbering of Glycoside Hydrolase Families applied in this
disclosure follows the concept of Coutinho, P. M. & Henrissat,
B. (1999) CAZy--Carbohydrate-Active Enzymes server at URL:
http://afmb.cnrs-mrs.fr/.about.cazy/CAZY/index.html or
alternatively Coutinho, P. M. & Henrissat, B. 1999; The modular
structure of cellulases and other carbohydrate-active enzymes: an
integrated database approach. In "Genetics, Biochemistry and
Ecology of Cellulose Degradation", K. Ohmiya, K. Hayashi, K. Sakka,
Y. Kobayashi, S. Karita and T. Kimura eds., Uni Publishers Co.,
Tokyo, pp. 15-23, and Bourne, Y.& Henrissat, B. 2001; Glycoside
hydrolases and glycosyltransferases: families and functional
modules, Current Opinion in Structural Biology 11:593-600.
[0029] Examples of enzymes which comprise a CBM suitable for use in
the context of the invention are endo-amylases (i.e. alpha-amylases
in EC 3.2.1.1), maltogenic alpha-amylases (EC 3.2.1.133),
glucoamylases (EC 3.2.1.3) or from CGTases (EC 2.4.1.19).
[0030] Preferred for the invention is CBMs of Carbohydrate-Binding
Module Family 20. CBMs of Carbohydrate-Binding Module Family 20
suitable for the invention may be derived from beta-amylases of
Bacillus cereus (SWISSPROT P36924), or from CGTases of Bacillus
circulans (SWISSPROT P43379). Also preferred for the invention is
any CBM having at least 60%, at least 70%, at least 80% or even at
least 90% identity to any of the afore mentioned CBM amino acid
sequences. Further suitable CBMs of Carbohydrate-Binding Module
Family 20 may be found at URL:
http://afmb.cnrs-mrs.fr/.about.cazy/CAZY/index.html).
[0031] Once a nucleotide sequence encoding the substrate-binding
(carbohydrate-binding) region has been identified, either as cDNA
or chromosomal DNA, it may then be manipulated in a variety of ways
to fuse it to a DNA sequence encoding the enzyme of interest. The
DNA fragment encoding the carbohydrate-binding amino acid sequence
and the DNA encoding the enzyme of interest are then ligated with
or without a linker. The resulting ligated DNA may then be
manipulated in a variety of ways to achieve expression.
[0032] CBMs deriving from bacteria will generally be suitable for
use in the context of the invention, however, preferred are CBMs of
bacillus origin, such as a CBM20 from Bacillus flavothermus (Syn.
Anoxybacillus contaminans), preferably from amylase AMY1048 (SEQ ID
NO:2 herein), AMY1039, or AMY1079 (disclosed as respectively SEQ ID
NO1, 2 and 3 in PCT/US2004/023031 [NZ10474]), the Bacillus amylases
disclosed in WO 2002068589 from Diversa, Bacillus sp. TS23 (Korea)
(Lin, L.-L.; Submitted (1-Mar.-1995) to the EMBL/GenBank/DDBJ
databases. Long-Liu Lin, Food Industry Research Institute, Culture
Collection and Research Center, 331 Food Road, Hsinchu, Taiwan 300,
Republic of China).
[0033] In a particular embodiment the CBM sequence has the amino
acid sequence shown as amino acid residues 485 to 586 in SEQ ID
NO:2 or the CBM sequence has an amino acid sequence having at least
60%, at least 70%, at least 80% or even at least 90% identity to
the afore mentioned amino acid sequence.
[0034] In another preferred embodiment the CBM sequence has an
amino acid sequence which differs from the amino acid sequence
shown as amino acid residues 485 to 586 in SEQ ID NO:2 in no more
than 10 positions, no more than 9 positions, no more than 8
positions, no more than 7 positions, no more than 6 positions, no
more than 5 positions, no more than 4 positions, no more than 3
positions, no more than 2 positions, or even no more than 1
position.
Endo-Amylase Sequence
[0035] Endo-amylases which are appropriate as the basis for
CBM/amylase hybrids of the types employed in the context of the
present invention include those of bacterial origin and having
endo-amylase activity. The endo-activity of the amylase may be
determined according to the assay in the "Materials and methods"
section of the present application. Preferred are endo-amylase
derived from Bacillus sp., particularly from B. licheniformis, B.
amyloliquefaciens, B. stearothermophilus or B. flavothermus. The
endo-amylase is preferably an endo-amylase having at least 60%, at
least 70%, at least 80% or even at least 90% identity to the
amylase from Bacillus licheniformis (BLA, SEQ ID NO:8 in
WO2002/010355) shown in SEQ ID NO:35 herein. This includes but are
not limited to the the amylase from B. licheniformis variant LE429
(WO2002/010355) shown in SEQ ID NO:41 herein, the amylase from B.
stearothermophilus (BSG, SEQ ID NO:6 in WO2002/010355) shown in SEQ
ID NO:36 herein, the amylase from B. amyloliquefacience (BAN, SEQ
ID NO:10 in WO2002/010355) shown in SEQ ID NO:37 herein, the
amylase from B. halodurance SP722 (SEQ ID NO:4 in WO2002/010355)
shown in SEQ ID NO:38 herein, SP690 (WO9526397) shown in SEQ ID
NO:39 herein, the amylase from M560 (SEQ ID NO:12 in WO2002/010355)
shown in SEQ ID:40 herein, the amylase from alkaline Bacillus
strains like e.g., SP707 (Tsukamoto et al., Biochemical and
Biophysical Research Communications, 151 (1988), pp. 25-31.), the
amylase KSM-AP1378 (WO9700324/KAO), the amylases KSM-K36 and
KSM-K38 (EP 1,022,334-A/KAO), the amylase SP7-7 (WO0210356/Henkel),
and the amylase AAI-6 (WO0060058), AMRK385
(PCT/DK01/00133)--fragments, variants or truncated forms of above.
The endo-amylase sequence may also be derived from Pseudomonas
saccharophilia, such as from the amylase disclosed as SEQ ID NO:1
in WO 2004111217. Preferably endo-amylase sequence comprises the
amino acid residues 1 to 417 shown in SEQ ID NO:42 herein.
[0036] Preferably the endo-amylase is a wild type enzyme or the
endo-amylase is a variant endo-amylases comprising amino acid
modifications leading to increased activity and/or increased
protein stability at low pH, and/or at high pH, increased stability
towards calcium depletion, and/or increased stability at elevated
temperature. Chemically or genetically modified mutants of such
endo-amylases are included in this connection.
[0037] The B. licheniformis endo-amylase BLA shown in SEQ ID NO:35
is a wild type amylase made up of a catalytic fragment of 483 amino
acid. The catalytic domain can be divided into the central
core-domain harboring the catalytic center and a C domain
c-terminal to the catalytic domain. In Seq. ID 8/NN10062 the
catalytic core domain consist of the first 396 amino acids and the
C domain is defined as the amino acids from 397 to 483
[0038] The variant of the B. licheniformis endo-amylase, LE429
shown in SEQ ID NO:41 consist of a catalytic fragment of 481 amino
acid. The catalytic domain can be divided into the central
core-domain harboring the catalytic center and a C domain
c-terminal to the catalytic domain. In SEQ ID NO:41 the catalytic
core domain consist of the first 394 amino acids and the C domain
is defined as the amino acids from 395 to 481.
[0039] The B. amyloliquefacience endo-amylase, BAN shown in SEQ ID
NO:37 is a wild type amylase made up of a catalytic fragment of 483
amino acid. The catalytic domain can be divided into the central
core-domain harboring the catalytic center and a C domain
c-terminal to the catalytic domain. In SEQ ID NO:37 the catalytic
core domain consist of the first 396 amino acids and the C domain
is defined as the amino acids from 397 to 483.
[0040] The B. stearothermophilus endo-amylase, BSG shown in SEQ ID
NO:36 is a wild type amylase made up of a catalytic fragment of 483
amino acid and in addition a c-terminal extension. The catalytic
domain can further be divided into the central core-domain
harboring the catalytic center and a C domain c-terminal to the
catalytic domain. In SEQ ID NO:36 the catalytic core domain consist
of the first 396 aa, the C domain is defined as the amino acids
from 397 to 483 and the c-terminal extension is defines as amino
acids 484 to 515.
[0041] The B. halodurance endo-amylase SP722 shown in SEQ ID NO:38
is a wild type amylase made up of a catalytic fragment of 485 amino
acid. The core domain can further be divided into the central
AB-domain harboring the catalytic center and a C domain c-terminal
to the catalytic domain. In SEQ ID NO:38 the catalytic core domain
consist of the first 398 amino acids and the C domain is defined as
the amino acids from 399 to 485.
[0042] The alkaline Bacillus endo-amylase, AA560 shown in SEQ ID:40
herein is a wild type amylase made up of a catalytic fragment of
485 amino acid. The core domain can further be divided into the
central AB-domain harboring the catalytic center and a C domain
c-terminal to the catalytic domain. The catalytic core domain
consist of the first 398 amino acids and the C domain is defined as
the amino acids from 399 to 485. The catalytic core domain is
encoded by nucleotide 1-1194 and the C domain is encoded by the
nucleotides 1189-1455.
[0043] In a particular embodiment of the first aspect the
endo-amylase sequence has the amino acid sequence shown in SEQ ID
NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ
ID NO:40, SEQ ID NO:41, SEQ ID NO:42 or the endo-amylase sequence
has an amino acid sequence having at least 60%, 65%, 70%, 75%, 80%,
85%, 90%, 95%, 97% or even at least 99% identity to any of the
afore mentioned amino acid sequences.
[0044] In yet another preferred embodiment of the first aspect the
endo-amylase sequence has an amino acid sequence which differs from
any of the amino acid sequence amino acid sequences shown in SEQ ID
NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ
ID NO:40, SEQ ID NO:41, SEQ ID NO:42 in no more than 10 positions,
no more than 9 positions, no more than 8 positions, no more than 7
positions, no more than 6 positions, no more than 5 positions, no
more than 4 positions, no more than 3 positions, no more than 2
positions, or even no more than 1 position.
[0045] In a preferred embodiment of the first aspect the
endo-amylase sequence has an amino acid sequence as shown in SEQ
ID:40 (M560), and comprising one or more of the following
alterations R118K, D183*, G184*, N195F, R320K and R458K.
[0046] In another particularly preferred embodiment of the first
aspect the endo-amylase sequence has an amino acid sequence as
shown in SEQ ID:40, and comprising one or more, e.g., such as all,
of the following alterations R118K, D183*, G184*, N195F, R320K,
R458K, N33S, D36N, K37L, E391I, Q394R, K395D, T452Y and N484P.
[0047] In another particularly preferred embodiment of the first
aspect the endo-amylase sequence has an amino acid sequence as
shown in SEQ ID:40, and comprising one or more, e.g., such as all,
of the following alterations R118K, D183*, G184*, N195F, R320K,
R458K and N484P.
[0048] In yet another highly preferred embodiment of the first
aspect the endo-amylase sequence has an amino acid sequence as
shown in SEQ ID NO:37 and comprise one or more, e.g such as all of
the following alterations: S31A, D32N, 133L, E178*, G179*, N190F,
K3891, K392R, E393D, V508A
Preferred Hybrids
[0049] In a particular embodiment the hybrid of the invention has
amino acid sequence shown in SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8,
SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14 or the hybrid of the
invention has an amino acid sequence having at least 60%, at least
70%, at least 80% or even at least 90% identity to any of the afore
mentioned amino acid sequences.
[0050] In yet another preferred embodiment the hybrid of the
invention has an amino acid sequence which differs from the amino
acid sequence amino acid sequence shown in SEQ ID NO:4, SEQ ID
NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14 in no
more than 10 positions, no more than 9 positions, no more than 8
positions, no more than 7 positions, no more than 6 positions, no
more than 5 positions, no more than 4 positions, no more than 3
positions, no more than 2 positions, or even no more than 1
position.
[0051] In a preferred embodiment the polypeptide of the invention
comprises a) the catalytic domain shown in SEQ ID NO:40 or a
homologous catalytic domain, and b) the CBM shown as residue 485 to
585 of SEQ ID NO:2, wherein one or more, or preferably all, of the
following substitutions have been introduced: R118K, D183*, G184*,
N195F, R320K, R458K, N33S, D36N, K37L, E391I, Q394R, K395D, T452Y
and N484P, using the numbering of SEQ ID NO: 40.
[0052] In another preferred embodiment the polypeptide of the
invention comprises the catalytic domain shown in SEQ ID NO:40 or a
homologous catalytic domain, and b) the CBM shown as residue 485 to
585 of SEQ ID NO:2, wherein one or more, or preferably all, of the
following substitutions have been introduced: R118K, D183*, G184*,
N195F, R320K, R458K and N484P, using the numbering of SEQ ID NO:
40.
[0053] In yet another preferred embodiment the polypeptide of the
invention comprises the catalytic domain shown in SEQ ID: 37 and
comprise one or more, e.g. such as all of the following
alterations: S31A, D32N, 133L, E178*, G179*, N190F, K3891, K392R,
E393D, V508A and a CBM having the amino acid sequence shown as
amino acid residues 485 to 586 in SEQ ID NO:2.
Stabilization of Hybrids
[0054] A hybrid of the invention may be volatile to proteolytic
attack if the CBM and catalytic domain proteins do not form
sufficiently tight protein-protein interactions. However, the
stability of the hybrid can be improved by introducing
substitutions on the surface of either of the proteins to create a
stable hybrid.
[0055] The present inventors have identified the following amino
acid residues on the surface of bacterial endo-amylases, e.g., such
polypeptides having at least 60% identity to the amylase from
Bacillus licheniformis (SEQ ID NO:8), to be in close contact with
the CBM when comprised in the hybrid of the invention, i.e. within
less than 5.0 .ANG. distance. These residues are suitable targets
for mutations in order to make a stable hybrid: 12, 29, 30, 32, 33,
34, 35, 36, 37, 38, 368, 371, 372, 381, 383, 384, 386, 387, 388,
389, 390, 391, 392, 394, 395, 396, 422, 423, 448, 449, 450, 451,
452, 453, 454, 455, 456, 458, 459, 460, 461, 483, 484, 485 using
the numbering of SEQ ID NO: 40. Preferably the catalytic domain of
the hybrid of the invention comprises one or more substitutions in
positions corresponding to these residues.
[0056] In a preferred embodiment the hybrid of the invention
comprises a) the catalytic domain shown in SE ID NO:40 or a
homologous catalytic domain, and b) the CBM shown as residue 485 to
585 of SEQ ID NO:2, wherein one or more, or preferably all, of the
following substitutions have been introduced: N33S, K35S/A,
D36A/N/S, K37L, E391I, Q394R, K395D, N484A/P using the numbering of
SEQ ID NO: 40.
[0057] On the surface of the CBM protruding towards the catalytic
domain of the hybrid the following residues are found in close
contact with the catalytic domain, i.e. within 5.0 .ANG. distance,
and these residues are suitable targets for mutations in order to
make a stable hybrid: 485, 486, 487, 488, 507, 512, 513, 514, 515,
516, 517, 518, 519, 520, 521, 522, 523, 524, 526, 538, 539, 540,
541, 553, 554, 555, 556, 557, 558, 559 using the numbering of SEQ
ID NO: 2.
Expression Vectors
[0058] The present invention also relates to recombinant expression
vectors which may comprise a DNA sequence encoding the hybrid
enzyme, a promoter, a signal peptide sequence, and transcriptional
and translational stop signals. The various DNA and control
sequences described above may be joined together to produce a
recombinant expression vector which may include one or more
convenient restriction sites to allow for insertion or substitution
of the DNA sequence encoding the polypeptide at such sites.
Alternatively, the DNA sequence of the present invention may be
expressed by inserting the DNA sequence or a DNA construct
comprising the sequence into an appropriate vector for expression.
In creating the expression vector, the coding sequence is located
in the vector so that the coding sequence is operably linked with
the appropriate control sequences for expression, and possibly
secretion.
[0059] The recombinant expression vector may be any vector (e.g., a
plasmid or virus), which can be conveniently subjected to
recombinant DNA procedures and can bring about the expression of
the DNA sequence. The choice of the vector will typically depend on
the compatibility of the vector with the host cell into which the
vector is to be introduced. The vectors may be linear or closed
circular plasmids. The vector may be an autonomously replicating
vector, i.e. a vector which exists as an extrachromosomal entity,
the replication of which is independent of chromosomal replication,
e.g., a plasmid, an extrachromosomal element, a minichromosome, a
cosmid or an artificial chromosome. The vector may contain any
means for assuring self-replication. Alternatively, the vector may
be one which, when introduced into the host cell, is integrated
into the genome and replicated together with the chromosome(s) into
which it has been integrated. The vector system may be a single
vector or plasmid or two or more vectors or plasmids which together
contain the total DNA to be introduced into the genome of the host
cell, or a transposon.
Host Cells
[0060] The host cell of the invention, either comprising a DNA
construct or an expression vector comprising the DNA sequence
encoding the polypeptide of the first aspect, e.g., a hybrid
enzyme, is advantageously used as a host cell in the recombinant
production of the hybrid enzyme, wild type enzyme or a genetically
modified wild type enzyme. The cell may be transformed with an
expression vector. Alternatively, the cell may be transformed with
the DNA construct of the invention encoding the hybrid enzyme or a
genetically modified wild type enzyme, conveniently by integrating
the DNA construct (in one or more copies) in the host chromosome.
Integration of the DNA construct into the host chromosome may be
performed according to conventional methods, e.g., by homologous or
heterologous recombination.
[0061] The host cell may be any appropriate prokaryotic or
eukaryotic cell, e.g., a bacterial cell, a filamentous fungus cell,
a yeast cell, a plant cell or a mammalian cell.
Isolating and Cloning a DNA Sequence Encoding a Parent
Endo-Amylase
[0062] The techniques used to isolate or clone a DNA sequence
encoding the polypeptide of the first aspect, e.g., a hybrid
enzyme, are known in the art and include isolation from genomic
DNA, preparation from cDNA, or a combination thereof. The cloning
of the DNA sequences of the present invention from such genomic DNA
can be effected, e.g., by using the well known polymerase chain
reaction (PCR) or antibody screening of expression libraries to
detect cloned DNA fragments with shared structural features. See,
e.g., Innis et al., 1990, PCR: A Guide to Methods and Application,
Academic Press, New York. Other DNA amplification procedures such
as ligase chain reaction (LCR), ligated activated transcription
(LAT) and DNA sequence-based amplification (NASBA) may be used.
[0063] The DNA sequence encoding a parent endo-amylase may be
isolated from any cell or microorganism producing the endo-amylase
in question, using various methods well known in the art. First, a
genomic DNA and/or cDNA library should be constructed using
chromosomal DNA or messenger RNA from the organism that produces
the endo-amylase to be studied. Then, if the amino acid sequence of
the endo-amylase is known, labeled oligonucleotide probes may be
synthesized and used to identify endo-amylase-encoding clones from
a genomic library prepared from the organism in question.
Alternatively, a labelled oligonucleotide probe containing
sequences homologous to another known endo-amylase gene could be
used as a probe to identify endo-amylase-encoding clones, using
hybridization and washing conditions of very low to very high
stringency.
[0064] Yet another method for identifying endo-amylase-encoding
clones would involve inserting fragments of genomic DNA into an
expression vector, such as a plasmid, transforming
endo-amylase-negative bacteria with the resulting genomic DNA
library, and then plating the transformed bacteria onto agar
containing a substrate for endo-amylase (i.e. maltose), thereby
allowing clones expressing the endo-amylase to be identified.
[0065] Alternatively, the DNA sequence encoding the enzyme may be
prepared synthetically by established standard methods, e.g., the
phosphoroamidite method described S. L. Beaucage and M. H.
Caruthers, (1981), Tetrahedron Letters 22, p. 1859-1869, or the
method described by Matthes et al. (1984), EMBO J. 3, p. 801-805.
In the phosphoroamidite method, oligonucleotides are synthesized,
e.g., in an automatic DNA synthesizer, purified, annealed, ligated
and cloned in appropriate vectors.
[0066] Finally, the DNA sequence may be of mixed genomic and
synthetic origin, mixed synthetic and cDNA origin or mixed genomic
and cDNA origin, prepared by ligating fragments of synthetic,
genomic or cDNA origin (as appropriate, the fragments corresponding
to various parts of the entire DNA sequence), in accordance with
standard techniques. The DNA sequence may also be prepared by
polymerase chain reaction (PCR) using specific primers, for
instance as described in U.S. Pat. No. 4,683,202 or R. K. Saiki et
al. (1988), Science 239, 1988, pp. 487-491.
Isolated DNA Sequence
[0067] The present invention relates, inter alia, to an isolated
DNA sequence comprising a DNA sequence encoding a polypeptide of
the first aspect, e.g., a hybrid enzyme.
[0068] The term "isolated DNA sequence" as used herein refers to a
DNA sequence, which is essentially free of other DNA sequences,
e.g., at least about 20% pure, preferably at least about 40% pure,
more preferably at least about 60% pure, even more preferably at
least about 80% pure, and most preferably at least about 90% pure
as determined by agarose electrophoresis.
[0069] For example, an isolated DNA sequence can be obtained by
standard cloning procedures used in genetic engineering to relocate
the DNA sequence from its natural location to a different site
where it will be reproduced. The cloning procedures may involve
excision and isolation of a desired DNA fragment comprising the DNA
sequence encoding the polypeptide of interest, insertion of the
fragment into a vector molecule, and incorporation of the
recombinant vector into a host cell where multiple copies or clones
of the DNA sequence will be replicated. An isolated DNA sequence
may be manipulated in a variety of ways to provide for expression
of the polypeptide of interest. Manipulation of the DNA sequence
prior to its insertion into a vector may be desirable or necessary
depending on the expression vector. The techniques for modifying
DNA sequences utilizing recombinant DNA methods are well known in
the art.
DNA Construct
[0070] The present invention relates, inter alia, to a DNA
construct comprising a DNA sequence encoding a polypeptide of the
first aspect. "DNA construct" is defined herein as a DNA molecule,
either single- or double-stranded, which is isolated from a
naturally occurring gene or which has been modified to contain
segments of DNA, which are combined and juxtaposed in a manner,
which would not otherwise exist in nature. The term DNA construct
is synonymous with the term expression cassette when the DNA
construct contains all the control sequences required for
expression of a coding sequence of the present invention.
Site-Directed Mutagenesis
[0071] Once a parent endo-amylase-encoding DNA sequence suitable
for use in a polypeptide of the first aspect has been isolated, and
desirable sites for mutation identified, mutations may be
introduced using synthetic oligonucleotides. These oligonucleotides
contain nucleotide sequences flanking the desired mutation sites.
In a specific method, a single-stranded gap of DNA, the
endo-amylase-encoding sequence, is created in a vector carrying the
endo-amylase gene. Then the synthetic nucleotide, bearing the
desired mutation, is annealed to a homologous portion of the
single-stranded DNA. The remaining gap is then filled in with DNA
polymerase I (Klenow fragment) and the construct is ligated using
T4 ligase. A specific example of this method is described in
Morinaga et al. (1984), Biotechnology 2, p. 646-639. U.S. Pat. No.
4,760,025 disclose the introduction of oligonucleotides encoding
multiple mutations by performing minor alterations of the cassette.
However, an even greater variety of mutations can be introduced at
any one time by the Morinaga method, because a multitude of
oligonucleotides, of various lengths, can be introduced.
[0072] Another method for introducing mutations into
endo-amylase-encoding DNA sequences is described in Nelson and
Long, (1989), Analytical Biochemistry 180, p. 147-151. It involves
the 3-step generation of a PCR fragment containing the desired
mutation introduced by using a chemically synthesized DNA strand as
one of the primers in the PCR reactions. From the PCR-generated
fragment, a DNA fragment carrying the mutation may be isolated by
cleavage with restriction endonucleases and reinserted into an
expression plasmid.
Localized Random Mutagenesis
[0073] The random mutagenesis may be advantageously localized to a
part of the parent endo-amylase in question. This may, e.g., be
advantageous when certain regions of the enzyme have been
identified to be of particular importance for a given property of
the enzyme, and when modified are expected to result in a variant
having improved properties. Such regions may normally be identified
when the tertiary structure of the parent enzyme has been
elucidated and related to the function of the enzyme.
[0074] The localized or region-specific, random mutagenesis is
conveniently performed by use of PCR generated mutagenesis
techniques as described above or any other suitable technique known
in the art. Alternatively, the DNA sequence encoding the part of
the DNA sequence to be modified may be isolated, e.g., by insertion
into a suitable vector, and said part may be subsequently subjected
to mutagenesis by use of any of the mutagenesis methods discussed
above.
Expression of the Enzymes in Plants
[0075] A DNA sequence encoding an enzyme of interest, such as a
hybrid enzyme of the present invention, may be transformed and
expressed in transgenic plants as described below.
[0076] The transgenic plant can be dicotyledonous or
monocotyledonous, for short a dicot or a monocot. Examples of
monocot plants are grasses, such as meadow grass (blue grass, Poa),
forage grass such as Festuca, Lolium, temperate grass, such as
Agrostis, and cereals, e.g., wheat, oats, rye, barley, rice,
sorghum and maize (corn).
[0077] Examples of dicot plants are tobacco, legumes, such as
lupins, potato, sugar beet, pea, bean and soybean, and cruciferous
plants (family Brassicaceae), such as cauliflower, oil seed rape
and the closely related model organism Arabidopsis thaliana.
[0078] Examples of plant parts are stem, callus, leaves, root,
fruits, seeds, and tubers as well as the individual tissues
comprising these parts, e.g., epidermis, mesophyll, parenchyme,
vascular tissues, meristems. In the present context, also specific
plant cell compartments, such as chloroplast, apoplast,
mitochondria, vacuole, peroxisomes and cytoplasm are considered to
be a plant part. Furthermore, any plant cell, whatever the tissue
origin, is considered to be a plant part. Likewise, plant parts
such as specific tissues and cells isolated to facilitate the
utilisation of the invention are also considered plant parts e.g.,
embryos, endosperms, aleurone and seeds coats.
[0079] Also included within the scope of the invention are the
progeny of such plants, plant parts and plant cells.
[0080] The transgenic plant or plant cell expressing the enzyme of
interest may be constructed in accordance with methods known in the
art. In short the plant or plant cell is constructed by
incorporating one or more expression constructs encoding the enzyme
of interest into the plant host genome and propagating the
resulting modified plant or plant cell into a transgenic plant or
plant cell.
[0081] Conveniently, the expression construct is a DNA construct
which comprises a gene encoding the enzyme of interest in operable
association with appropriate regulatory sequences required for
expression of the gene in the plant or plant part of choice.
Furthermore, the expression construct may comprise a selectable
marker useful for identifying host cells into which the expression
construct has been integrated and DNA sequences necessary for
introduction of the construct into the plant in question (the
latter depends on the DNA introduction method to be used).
[0082] The choice of regulatory sequences, such as promoter and
terminator sequences and optionally signal or transit sequences is
determined, e.g., on the basis of when, where and how the enzyme is
desired to be expressed. For instance, the expression of the gene
encoding the enzyme of the invention may be constitutive or
inducible, or may be developmental, stage or tissue specific, and
the gene product may be targeted to a specific cell compartment,
tissue or plant part such as seeds or leaves. Regulatory sequences
are, e.g., described by Tague et al, Plant, Phys., 86, 506,
1988.
[0083] For constitutive expression the 35S-CaMV, the maize
ubiquitin 1 and the rice actin 1 promoter may be used (Franck et
al. 1980. Cell 21: 285-294, Christensen AH, Sharrock R A and Quail
1992. Maize polyubiquitin genes: structure, thermal perturbation of
expression and transcript splicing, and promoter activity following
transfer to protoplasts by electroporation. Plant Mo. Biol. 18,
675-689.; Zhang W, McElroy D. and Wu R 1991, Analysis of rice Act1
5' region activity in transgenic rice plants. Plant Cell 3,
1155-1165). Organ-specific promoters may, e.g., be a promoter from
storage sink tissues such as seeds, potato tubers, and fruits
(Edwards & Coruzzi, 1990. Annu. Rev. Genet. 24: 275-303), or
from metabolic sink tissues such as meristems (Ito et al., 1994.
Plant Mol. Biol. 24: 863-878), a seed specific promoter such as the
glutelin, prolamin, globulin or albumin promoter from rice (Wu et
al., Plant and Cell Physiology Vol. 39, No. 8 pp. 885-889 (1998)),
a Vicia faba promoter from the legumin B4 and the unknown seed
protein gene from Vicia faba described by Conrad U. et al, Journal
of Plant Physiology Vol. 152, No. 6 pp. 708-711 (1998), a promoter
from a seed oil body protein (Chen et al., Plant and cell
physiology vol. 39, No. 9 pp. 935-941 (1998), the storage protein
napA promoter from Brassica napus, or any other seed specific
promoter known in the art, e.g., as described in WO 91/14772.
Furthermore, the promoter may be a leaf specific promoter such as
the rbcs promoter from rice or tomato (Kyozuka et al., Plant
Physiology Vol. 102, No. 3 pp. 991-1000 (1993), the chlorella virus
adenine methyltransferase gene promoter (Mitra, A. and Higgins, D
W, Plant Molecular Biology Vol. 26, No. 1 pp. 85-93 (1994), or the
aldP gene promoter from rice (Kagaya et al., Molecular and General
Genetics Vol. 248, No. 6 pp. 668-674 (1995), or a wound inducible
promoter such as the potato pin2 promoter (Xu et al, Plant
Molecular Biology Vol. 22, No. 4 pp. 573-588 (1993). Likewise, the
promoter may inducible by abiotic treatments such as temperature,
drought or alterations in salinity or induced by exogenously
applied substances that activate the promoter e.g., ethanol,
oestrogens, plant hormones like ethylene, abscisic acid and
gibberellic acid and heavy metals.
[0084] A promoter enhancer element may be used to achieve higher
expression of the enzyme in the plant. For instance, the promoter
enhancer element may be an intron which is placed between the
promoter and the nucleotide sequence encoding the enzyme. For
instance, Xu et al. op cit disclose the use of the first intron of
the rice actin 1 gene to enhance expression.
[0085] The selectable marker gene and any other parts of the
expression construct may be chosen from those available in the
art.
[0086] The DNA construct is incorporated into the plant genome
according to conventional techniques known in the art, including
Agrobacterium-mediated transformation, virus-mediated
transformation, micro injection, particle bombardment, biolistic
transformation, and electroporation (Gasser et al, Science, 244,
1293; Potrykus, Bio/Techn. 8, 535, 1990; Shimamoto et al, Nature,
338, 274, 1989).
[0087] Presently, Agrobacterium tumefaciens mediated gene transfer
is the method of choice for generating transgenic dicots (for
review Hooykas & Schilperoort, 1992. Plant Mol. Biol. 19:
15-38), and can also be used for transforming monocots, although
other transformation methods often are used for these plants.
Presently, the method of choice for generating transgenic monocots
supplementing the Agrobacterium approach is particle bombardment
(microscopic gold or tungsten particles coated with the
transforming DNA) of embryonic calli or developing embryos
(Christou, 1992. Plant J. 2: 275-281; Shimamoto, 1994. Curr. Opin.
Biotechnol. 5: 158-162; Vasil et al., 1992. Bio/Technology 10:
667-674). An alternative method for transformation of monocots is
based on protoplast transformation as described by Omirulleh S, et
al., Plant Molecular biology Vol. 21, No. 3 pp. 415-428 (1993).
[0088] Following transformation, the transformants having
incorporated the expression construct are selected and regenerated
into whole plants according to methods well-known in the art. Often
the transformation procedure is designed for the selective
elimination of selection genes either during regeneration or in the
following generations by using e.g., co-transformation with two
separate T-DNA constructs or site specific excision of the
selection gene by a specific recombinase.
Dough-Based Products
[0089] The hybrid enzyme of the present invention may be used for
the preparation of a dough-based edible product such as, bread,
tortillas, cakes, pancakes, biscuits, cookies, pie crusts, more
preferably baked products, such as, bread products.
[0090] The dough used to prepare the dough based product generally
comprises flour, e.g., from grains, such as, wheat flour, corn
flour, rye flour, oat flour, or sorghum flour. The dough is
generally leavened by the addition of a suitable yeast culture,
such as a culture of Saccharomyces cerevisiae (baker's yeast) or a
chemical leavening agent.
[0091] The edible dough based product may preferably be any kind of
baked product prepared from dough, either of a soft or a crisp
character, either of a white, light or dark type. Preferred edible
dough based products include bread (in particular white, wheat,
whole-meal, low-carb, brown, multi-grain, dark and rye bread),
typically in the form of loaves, buns or rolls, and more
preferably, pan bread, hamburger buns, French baguette-type bread,
pita bread, tortillas, cakes, pancakes, biscuits, cookies, pie
crusts, crisp bread, steamed bread, pizza crust and the like.
[0092] The edible dough-based product is made by heating the dough,
e.g., by baking or steaming. Examples are steamed or baked bread
(in particular white, whole-meal or rye bread), typically in the
form of loaves or rolls. The edible dough based product may also be
prepared by frying (e.g., deep frying in hot fat or oil). An
example of such an edible product is a doughnut.
[0093] The hybrid enzyme of the first aspect of the invention
preferably have a high tolerance towards overdosing. The addition
of the polypeptide of the invention, e.g., the polypeptide of the
first aspect, in 2 times, 3 times, preferably 4 times, more
preferably 5 times, most preferably 6 times the effective dosage of
said polypeptide to a dough results in an ELR and/or an ELR.sub.N
of less than 15%, less than 10%, less than 7%, less than 6%, less
than 5%, less than 4% or even less than 3%.
[0094] In a further aspect the polypeptide of the invention has a
residual activity of at least 20%, such as at least 25% or 30%,
preferably at least 35%, more preferably at least 40% and most
preferably at least 50%, at the test conditions given in the
specification.
[0095] The polypeptide of the present invention may further have an
improved exo-to-endo ratio de-fined as IEF1 or IEF2 in the
specification. The IEF1 or IEF2 of the polypeptide may be larger
than 1, such as 1.1 or 1.5, preferably 2 or 2.5 or 3, more
preferably 3.5 or 4, most preferably 5 or 7 or 10.
[0096] In further embodiments the invention provides polypeptides
with characteristics that are of particular interest for baking
purposes, namely a residual activity of at least 25% at 70.degree.
C. at the test conditions given in the specification, an increased
exo-to-endo ratio (IEF), where IEF is larger than 1, and finally a
reduced cohesiveness of less than 5% (at the test conditions given
in the specification) while change in hardness is at least 85 units
(at the test conditions given in the specification) and/or change
mobility of free water is at least 1100 units (at the test
conditions given in the specification).
[0097] For baking purpose the polypeptide of the invention may give
a cohesiveness reduction, when measured at the test conditions
given in the specification, of at least 5%, while dHard-ness, when
measured at the test conditions given in the specification, is at
least 85 units, such as 90 units or 100 units, preferably 150 units
or 200 units, more preferably 250 units or 300 units, most
preferably 400 units or 600 units. In a further embodiment the
polypeptide of the invention may give a cohesiveness reduction,
when measured at the test conditions given in the specification, of
at least 4%, while dHardness, when measured at the test condi-tions
given in the specification, is at least 85 units, such as 90 units
or 100 units, preferably 150 units or 200 units, more preferably
250 units or 300 units, most preferably 400 units or 600 units. In
a still further embodiment the polypeptide of the invention may
give a cohesive-ness reduction, when measured at the test
conditions given in the specification, of at least 2%, while
dHardness, when measured at the test conditions given in the
specification, is at least 85 units, such as 90 units or 100 units,
preferably 150 units or 200 units, more prefera-bly 250 units or
300 units, most preferably 400 units or 600 units. In yet another
embodiment the polypeptide of the invention may give a cohesiveness
reduction, when measured at the test conditions given in the
specification, of at least 1%, while dHardness, when measured at
the test conditions given in the specification, is at least 85
units, such as 90 units or 100 units, preferably 150 units or 200
units, more preferably 250 units or 300 units, most prefera-bly 400
units or 600 units.
[0098] When the polypeptide of the invention is added together with
300 MANU Novamyl.RTM./kg flour it may give a cohesiveness
reduction, when measured at the test conditions given in the
specification, of at least 5%, while dHardness, when measured at
the test conditions given in the specification, is at least 15
units, such as 20 units or 30 units, preferably 40 units or 50
units, more preferably 60 units or 70 units, most preferably 85
units or 100 units. In a further embodiment the polypeptide of the
invention may give a cohesiveness reduction, when measured at the
test conditions given in the specification, of at least 4%, while
dHardness, when measured at the test conditions given in the
specification, is at least 15 units, such as 20 units or 30 units,
preferably 40 units or 50 units, more preferably 60 units or 70
units, most preferably 85 units or 100 units. In a still further
embodiment the polypeptide of the invention may give a cohesiveness
reduction, when measured at the test conditions given in the
specification, of at least 2%, while dHardness, when measured at
the test conditions given in the specification, is at least 15
units, such as 20 units or 30 units, preferably 40 units or 50
units, more preferably 60 units or 70 units, most preferably 85
units or 100 units. In yet another embodiment the polypeptide of
the invention may give a cohesiveness reduction, when measured at
the test conditions given in the specification, of at least 1%,
while dHardness, when measured at the test conditions given in the
specification, is at least 15 units, such as 20 units or 30 units,
preferably 40 units or 50 units, more preferably 60 units or 70
units, most preferably 85 units or 100 units.
[0099] For baking purpose the polypeptide of the invention may give
a cohesiveness reduction, when measured at the test conditions
given in the specification, of at least 5%, while dMobil-ity, when
measured at the test conditions given in the specification, is at
least 300 units, such as 400 units or 500 units, preferably 600
units or 700 units, more preferably 800 units or 900 units, most
preferably 1000 units or 1200 units. In a further embodiment the
polypeptide of the invention may give a cohesiveness reduction,
when measured at the test conditions given in the specification, of
at least 4%, while dMobility, when measured at the test condi-tions
given in the specification, is at least 300 units, such as 400
units or 500 units, prefera-bly 600 units or 700 units, more
preferably 800 units or 900 units, most preferably 1000 units or
1200 units. In a still further embodiment the polypeptide of the
invention may give a cohe-siveness reduction, when measured at the
test conditions given in the specification, of at least 2%, while
dMobility, when measured at the test conditions given in the
specification, is at least 300 units, such as 400 units or 500
units, preferably 600 units or 700 units, more preferably 800 units
or 900 units, most preferably 1000 units or 1200 units. In yet
another embodiment the polypeptide of the invention may give a
cohesiveness reduction, when measured at the test conditions given
in the specification, of at least 1%, while dMobility, when
measured at the test conditions given in the specification, is at
least 300 units, such as 400 units or 500 units, preferably 600
units or 700 units, more preferably 800 units or 900 units, most
preferably 1000 units or 1200 units.
[0100] When the polypeptide of the invention is added together with
300 MANU Novamyl.RTM./kg flour it may give a cohesiveness
reduction, when measured at the test conditions given in the
specification, of at least 5%, while dMobility, when measured at
the test conditions given in the specification, is at least 1000
units, such as 1100 units or 1200 units, preferably 1400 units or
1500 units, more preferably 1800 units or 2000 units, most
preferably 2200 units or 2500 units. In a further embodiment the
polypeptide of the invention may give a cohesive-ness reduction,
when measured at the test conditions given in the specification, of
at least 4%, while dMobility, when measured at the test conditions
given in the specification, is at least 1000 units, such as 1100
units or 1200 units, preferably 1400 units or 1500 units, more
preferably 1800 units or 2000 units, most preferably 2200 units or
2500 units. In a still further embodiment the polypeptide of the
invention may give a cohesiveness reduction, when measured at the
test conditions given in the specification, of at least 2%, while
dMobility, when measured at the test conditions given in the
specification, is at least 1000 units, such as 1100 units or 1200
units, preferably 1400 units or 1500 units, more preferably 1800
units or 2000 units, most preferably 2200 units or 2500 units. In
yet another embodiment the poly-peptide of the invention may give a
cohesiveness reduction, when measured at the test con-ditions given
in the specification, of at least 1%, while dMobility, when
measured at the test conditions given in the specification, is at
least 1000 units, such as 1100 units or 1200 units, preferably 1400
units or 1500 units, more preferably 1800 units or 2000 units, most
preferably 2200 units or 2500 units.
[0101] The above values for cohesiveness reduction, dHardness and
dMobility are particularly rele-vant for bread, in particular for
bread prepared by the sponge and dough method. Similar correlation
between cohesiveness reduction and dHardness and dMobility is
disclosed in Example 7.
[0102] The hybrid enzyme of the present invention may optionally be
used together with one or more additional enzymes and/or
anti-staling agents.
[0103] Anti-staling agents include but are not limited to
emulsifiers, hydrocolloids and enzymatic anti-staling agents. As
used herein, an anti-staling agent refers to a chemical, biological
or enzymatic agent which can retard staling of the dough-based
products, that is, which can reduce the rate deterioration of the
softness of the dough based product during storage. The softness of
dough based products (and the anti-staling effect of the
anti-staling agent) can be evaluated empirically by the skilled
test baker or measured using a texture analyzer (e.g., TAXT2), as
is known in the art.
[0104] Examples of chemical anti-staling agents include polar
lipids, e.g., fatty acids and their monoglyceride esters, such as,
described in U.S. Pat. No. 4,160,848.
[0105] In a preferred embodiment, the anti-staling agent is an
anti-staling enzyme, which is preferably added to the dough prior
to cooking (e.g., baking). Examples of anti-staling enzymes
include, without limitation, endo-amylases, such as the hybrids of
the invention, exo-endo-amylases, such as, e.g., the exo-amylase
described in U.S. Pat. No. 6,667,065 and US 2004/0043109,
pullulanases, glycosyltransferases, amyloglycosidases, branching
enzymes (1,4-alpha-glyucan branching enzyme),
4-alpha-glucanotransferases (dextrin transferase), beta-amylases,
maltogenic alpha-amylases, lipases, phospholipases, galactolipases,
acyltransferases, pectate lyases, xylanases, xyloglucan
endotransglycosylases, proteases, e.g., as described in WO
2003/084331, peptidases and combinations thereof.
[0106] The amylase may be from a fungus, bacterium or plant. It may
be an endo-amylase, e.g., from Bacillus, particularly B.
licheniformis or B. amyloliquefaciens, a beta-amylase, e.g., from
plant (e.g., soy bean) or from microbial sources (e.g., Bacillus),
such as the non-maltogenic Bacillus clausii alpha-amylase disclosed
in WO9950399A2, the Pseudomonas saccharophilia amylase in SEQ ID
NO:1 of WO 2004111217, or a glucoamylase, or a fungal endo-amylase,
e.g., from A. niger or A. oryzae.
[0107] More preferably, the additional enzyme is an anti-staling
enzyme and preferably the anti-staling enzyme is a maltogenic
amylase (EC 3.2.1.133). The maltogenic amylases is added into the
dough in an amount effective to retard the staling of the product,
such as, at least 500 MANU/kg flour, more preferably in an amount
of at least 500 to 1500 MANU/kg flour. A maltogenic amylase may be
obtained from any suitable source, such as derived from a bacteria,
such as Bacillus, preferably B. stearothermophilus, e.g., from
strain NCIB 11837 or a variant thereof made by amino acid
modification (EP 494233 B1, U.S. Pat. No. 6,162,628). The
maltogenic amylase may preferably be added at a dosage of 20 to
2000 MANU/kg flour, preferably 500 to 1000 MANU/kg flour, more
preferably, at least 750 MANU/kg flour, at least 1000 MANU/kg
flour. A preferred maltogenic amylase is Novamyl.RTM. (available
form Novozymes A/S).
[0108] In another preferred embodiment, the anti-staling enzyme is
a xylanase. The xylanase may be obtained from any suitable source,
e.g., from Bacillus, e.g., Bacillus subtilis, as described in WO
2003/010923, WO 2001/066711 or WO 2000/039289, and Aspergillus, in
particular of A. aculeatus, A. niger, A. awamori, or A. tubigensis
or Trichoderma and Thermomyces as described in WO 96/32472, e.g., T
reesei, or from a strain of Humicola, e.g., H. insolens.
Optionally, an additional enzyme may be used together with the
above anti-staling enzymes, such as, a lipolytic enzyme,
particularly phospholipase, galactoilipase and/or triacyl glycerol
lipase activity, e.g., as described in WO 9953769, WO 0032758, WO
0200852 or WO 2002066622. or e.g., a transglutaminase, a cellulytic
enzyme, e.g., a cellulase, an acyltransferase, a protein disulfide
isomerase, a pectinase, a pectate lyase, an oxidoreductase. The
enzyme may be of any origin, including mammalian, plant, and
preferably microbial (bacterial, yeast or fungal) origin and may be
obtained by techniques conventionally used in the art.
[0109] The additional enzyme may also be a lipolytic enzyme,
particularly phospholipase, galactoilipase and/or triacyl glycerol
lipase activity, e.g., as described in WO 9953769, WO 0032758, WO
0200852 or WO 2002066622.
[0110] Further, the additional enzyme may be a second amylase, a
cyclodextrin glucanotransferase, a protease or peptidase, in
particular an exopeptidase, a trans-glutaminase, a lipase, a
phospholipase, a cellulase, a hemicellulase, a glycosyltransferase,
a branching enzyme (1,4-alpha-glucan branching enzyme) or an
oxidoreductase. The additional enzyme may be of mammalian, plant or
microbial (bacterial, yeast or fungal) origin.
[0111] The second amylase may be from a fungus, bacterium or plant.
It may be a maltogenic amylase (EC 3.2.1.133), e.g., from B.
stearothermophilus, an endo-amylase, e.g., from Bacillus,
particularly B. licheniformis or B. amyloliquefaciens, a
beta-amylase, e.g., from plant (e.g., soy bean) or from microbial
sources (e.g., Bacillus), a glucoamylase, e.g., from A. niger, or a
fungal endo-amylase, e.g., from A. oryzae or from Pseudomonas
saccharophilia such as the non-maltogenic alpha-amylase disclosed
in WO9950399A2.
[0112] The hemicellulase may be a pentosanase, e.g., a xylanase
which may be of microbial origin, e.g., derived from a bacterium or
fungus, such as a strain of Aspergillus, in particular of A.
aculeatus, A. niger, A. awamori, or A. tubigensis, from a strain of
Trichoderma, e.g., T. reesei, or from a strain of Humicola, e.g.,
H. insolens.
[0113] The protease may be from Bacillus, e.g., B.
amyloliquefaciens.
[0114] The oxidoreductase may be a glucose oxidase, a carbohydrate
oxidase, a hexose oxidase, a lipoxidase, a peroxidase, or a
laccase.
Dough and/or Bread-Improving Additive
[0115] The hybrid enzyme of the present invention may be provided
as a dough and/or bread improving additive in the form of a
granulate or agglomerated powder. The dough and/or bread improving
additive may preferably have a narrow particle size distribution
with more than 95% (by weight) of the particles in the range from
25 to 500 .mu.m.
[0116] In a preferred embodiment a composition, e.g., a bread
improving additive, is produced in a process comprising the steps
of; a) providing a first amino acid sequence having endo-amylase
activity; b) providing a second amino acid sequence comprising a
carbohydrate-binding module; c) and constructing a polypeptide
comprising said first amino acid sequence and second amino acid
sequence; d) providing a DNA sequence encoding said polypeptide; e)
expressing said DNA sequence in a suitable host cell and recovering
said polypeptide; f) adding said polypeptide to flour or to a
granulate or agglomerated powder.
[0117] Granulates and agglomerated powders may be prepared by
conventional methods, e.g., by spraying the amylase, i.e. the
hybrid enzyme, onto a carrier in a fluid-bed granulator. The
carrier may consist of particulate cores having a suitable particle
size. The carrier may be soluble or insoluble, e.g., a salt (such
as NaCl or sodium sulfate), a sugar (such as sucrose or lactose), a
sugar alcohol (such as sorbitol), starch, rice, corn grits, or
soy.
Starch Processing
[0118] The polypeptide of this invention, i.e. an endo-amylase
having a CBM, possesses valuable properties allowing for a variety
of industrial applications. In particular, enzymes of the first
aspect are applicable as a component in washing, dishwashing and
hard-surface cleaning detergent compositions. Numerous variants are
particularly useful in the production of sweeteners and ethanol
from starch, and/or for textile desizing. One example of producing
ethanol, wherein an endo-amylase of the invention may be used is
disclosed in U.S. Pat. No. 5,231,017 which is hereby incorporated
by reference.
[0119] Further, a process wherein an endo-amylase of the invention
may be used is disclosed in DK patent application PA 2003 01568
(hereby incorporated by reference). Said process comprises
hydrolysing starch into a soluble starch hydrolysate at a
temperature below the initial gelatinization temperature of said
granular starch. Another suitable process is disclosed in
WO2004081193 (hereby incorporated by reference).
[0120] Conditions for conventional starch-conversion processes,
including starch liquefaction and/or saccharification processes are
described in, e.g., U.S. Pat. No. 3,912,590 and in EP patent
publications Nos. 252,730 and 63,909.
[0121] A preferred use is in a fermentation process wherein a
starch substrate is liquefied and/or saccharified in the presence
of the endo-amylase having a CBM to produce glucose and/or maltose,
e.g., for use as sweeteners or suitable for conversion into a
fermentation product by a fermenting organism, preferably a yeast.
Such fermentation processes include a process for producing ethanol
for fuel or drinking ethanol (portable alcohol), a process for
producing a beverage, a process for producing organic compounds,
such as citric acid, itaconic acid, lactic acid, gluconic acid;
ketones; amino acids, such as glutamic acid (sodium
monoglutaminate), but also more complex compounds such as
antibiotics, such as penicillin, tetracyclin; enzymes; vitamins,
such as riboflavin, B12, beta-carotene; hormones, which are
difficult to produce synthetically.
Production of Sweeteners from Starch:
[0122] A "traditional" process for conversion of starch to fructose
syrups normally consists of three consecutive enzymatic processes,
viz. a liquefaction process followed by a saccharification process
and an isomerization process. During the liquefaction process,
starch is degraded to dextrins by an endo-amylase, preferably by an
endo-amylase having a CBM, such as the polypeptide of the invention
at pH values between 5.5 and 6.2 and at temperatures of
95-160.degree. C. for a period of approx. 2 hours. In order to
ensure an optimal enzyme stability under these conditions, 1 mM of
calcium is added (40 ppm free calcium ions).
[0123] After the liquefaction process the dextrins are converted
into dextrose by addition of a g-lucoamylase (e.g., AMG.TM.) and a
debranching enzyme, such as an isoamylase or a pullulanase (e.g.,
Promozyme.TM.). Before this step the pH is reduced to a value below
4.5, maintaining the high temperature (above 95.degree. C.), and
the liquefying endo-amylase activity is denatured. The temperature
is lowered to 60.degree. C., and glucoamylase and debranching
enzyme are added. The sac-charification process proceeds for 24-72
hours.
[0124] After the saccharification process the pH is increased to a
value in the range of 6-8, preferably pH 7.5, and the calcium is
removed by ion exchange. The dextrose syrup is then converted into
high fructose syrup using, e.g., an immmobilized glucoseisomerase
(such as Sweetzyme.TM.).
[0125] In an embodiment of a starch process of the invention,
milled gelatinized whole grain raw material is broken down
(hydrolyzed) into maltodextrins (dextrins) mostly of a DE higher
than 4 using the polypeptide of the first aspect. The raw material
is in one embodiment of the invention milled (whole) grain.
[0126] In an embodiment of the invention, enzymatic liquefaction is
carried out as a three-step hot slurry process. The slurry is
heated to between 60-95.degree. C., preferably 80-85.degree. C.,
and the enzyme(s) is(are) added to initiate liquefaction
(thinning), at least a polypeptide of the first aspect is added.
Then the slurry is jet-cooked at a temperature between
95-140.degree. C., preferably 105-125.degree. C. to complete
gelanitization of the slurry. Then the slurry is cooled to
60-95.degree. C. and more enzyme(s), preferably comprising the
polypeptide of the first aspect, is (are), added to finalize
hydrolysis (secondary liquefaction). The liquefaction process is
carried out at pH 4.5-6.5, in particular at a pH between 5 and 6.
Milled and liquefied whole grains are known as mash. The
polypeptide of the first aspect may be added in effective amounts
well known to the person skilled in the art.
[0127] In an aspect the process may comprise; a) contacting a
starch substrate with a endo-amylase having a CBM, e.g., the
polypeptide of the first aspect; b) incubating said starch
substrate with said polypeptide and a fungal alpha-amylase and/or
or a glucoamylase for a time and at a temperature sufficient to
achieve liquefaction and saccharification of at least 90%, or at
least 92%, at least 94%, at least 95%, at least 96%, at least 97%,
at least 98%, at least 99%, at least 99.5% w/w of said starch
substrate into fermentable sugars; c) fermenting to produce a
fermentation product, d) optionally recovering the fermentation
product.
[0128] In yet another aspect the process comprising liquefaction
and/or hydrolysis of a slurry of gelatinized or granular starch, in
particular liquefaction and/or hydrolysis of granular starch into a
soluble starch hydrolysate at a temperature below the initial
gelatinization temperature of said granular starch. In addition to
being contacted with a polypeptide of the invention, e.g, the
polypeptide of the first aspect, the starch may be contacted with
an enzyme selected from the group consisting of; a fungal
alpha-amylase (EC 3.2.1.1), a beta-amylase (E.C. 3.2.1.2), and a
glucoamylase (E.C.3.2.1.3). In an embodiment further a debranching
enzyme, such as an isoamylase (E.C. 3.2.1.68) or a pullulanases
(E.C. 3.2.1.41) may be added.
[0129] In an embodiment the process is conducted at a temperature
below the initial gelatinization temperature. Preferably the
temperature at which the processes are conducted is at least
30.degree. C., at least 31.degree. C., at least 32.degree. C., at
least 33.degree. C., at least 34.degree. C., at least 35.degree.
C., at least 36.degree. C., at least 37.degree. C., at least
38.degree. C., at least 39.degree. C., at least 40.degree. C., at
least 41.degree. C., at least 42.degree. C., at least 43.degree.
C., at least 44.degree. C., at least 45.degree. C., at least
46.degree. C., at least 47.degree. C., at least 48.degree. C., at
least 49.degree. C., at least 50.degree. C., at least 51.degree.
C., at least 52.degree. C., at least 53.degree. C., at least
54.degree. C., at least 55.degree. C., at least 56.degree. C., at
least 57.degree. C., at least 58.degree. C., at least 59.degree.
C., or preferably at least 60.degree. C. The pH at which the
process is conducted may in be in the range of 3.0 to 7.0,
preferably from 3.5 to 6.0, or more preferably from 4.0-5.0. In a
preferred embodiment the process comprises fermentation, e.g with a
yeast to produce ethanol, e.g., at a temperature around 32.degree.
C., such as from 30 to 35.degree. C. During the fermentation the
ethanol content reaches at least 7%, at least 8%, at least 9%, at
least 10% such as at least 11%, at least 12%, at least 13%, at
least 14%, at least 15% such as at least 16% ethanol (w/w).
[0130] The starch slurry to be used in any of the above aspects may
have 20-55% dry solids granular starch, preferably 25-40% dry
solids granular starch, more preferably 30-35% dry solids granular
starch. After being contacted with the endo-amylase having a CBM,
e.g, the polypeptide of the first aspect at least 85%, at least
86%, at least 87%, at least 88%, at least 89%, at least 90%, at
least 91%, at least 92%, at least 93%, at least 94%, at least 95%,
at least 96%, at least 97%, at least 98%, or preferably at least
99% of the dry solids of the granular starch is converted into a
soluble starch hydrolysate.
[0131] In another preferred embodiment the endo-amylase having a
CBM, e.g, the polypeptide of the first aspect, is used in a process
for liquefaction, saccharification of a gelatinized starch, e.g.,
but not limited to gelatinization by jet cooking. The process may
comprise fermentation to produce a fermentation product, e.g.,
ethanol. Such a process for producing ethanol from
starch-containing material by fermentation comprises: (i)
liquefying said starch-containing material with a endo-amylase
having a CBM, e.g, the polypeptide of the first aspect; (ii)
saccharifying the liquefied mash obtained; (iii) fermenting the
material obtained in step (ii) in the presence of a fermenting
organism. Optionally the process further comprises recovery of the
ethanol. The saccharification and fermentation may be carried out
as a simultaneous saccharification and fermentation process (SSF
process). During the fermentation the ethanol content reaches at
least 7%, at least 8%, at least 9%, at least 10% such as at least
11%, at least 12%, at least 13%, at least 14%, at least 15% such as
at least 16% ethanol.
[0132] The starch to be processed in the processes of the above
aspects may in particular be obtained from tubers, roots, stems,
legumes, cereals or whole grain. More specifically the granular
starch may be obtained from corns, cobs, wheat, barley, rye, milo,
sago, cassaya, tapioca, sorghum, rice, peas, bean, banana or
potatoes. Specially contemplated are both waxy and non-waxy types
of corn and barley.
Compositions of the Invention
[0133] The invention also relates to a composition comprising the
polypeptide of the first aspect. The composition may further
comprise an enzyme selected from the group comprising of; a fungal
alpha-amylase (EC 3.2.1.1), a beta-amylase (E.C. 3.2.1.2), a
glucoamylase (E.C.3.2.1.3) and a pullulanases (E.C. 3.2.1.41). The
glucoamylase may preferably be derived from a strain of Aspergillus
sp., such as Aspergillus niger, or from a strain of Talaromyces sp.
and in particular derived from Talaromyces leycettanus such as the
glucoamylase disclosed in U.S. Pat. No. Re. 32,153, Talaromyces
duponti and/or Talaromyces thermopiles such as the glucoamylases
disclosed in U.S. Pat. No. 4,587,215 and more preferably derived
from Talaromyces emersonii. Most preferably the glucoamylase is
derived from Talaromyces emersonii strain CBS 793.97 and/or having
the sequence disclosed as SEQ ID NO: 7 in WO 99/28448. Further
preferred is a glucoamylase which has an amino acid sequence having
at least 50%, at least 60%, at least 70%, at least 80%, at least
90% or even at least 95% homology to the aforementioned amino acid
sequence. A commercial Talaromyces glucoamylase preparation is
supplied by Novozymes A/S as Spirizyme Fuel.
[0134] Also preferred for a composition comprising the polypeptide
of the first aspect and a glucoamylase are polypeptides having
glucoamylase activity which are derived from a strain of the genus
Trametes, preferably Trametes cingulata. Further preferred is
polypeptides having glucoamylase activity and havering at least
50%, at least 60%, at least 70%, at least 80%, at least 90% or even
at least 95% homology with amino acids for mature polypeptide amino
acids 1 to 575 of SEQ ID NO: 5 in U.S. Patent application
60/650,612.
[0135] Also preferred for a composition comprising the polypeptide
of the first aspect and a glucoamylase are polypeptides having
glucoamylase activity which are derived from a strain of the genus
Pachykytospora, preferably Pachykytospora papyracea. Further
preferred is polypeptides having glucoamylase activity and havering
at least 50%, at least 60%, at least 70%, at least 80%, at least
90% or even at least 95% homology with amino acids for mature
polypeptide amino acids 1 to 556 of SEQ ID NO: 2 in U.S. Patent
application 60/650,612.
[0136] The composition described above may be used for liquefying
and/or saccharifying a gelatinized or a granular starch, as well as
a partly gelatinized starch, e.g. in a production of sweetener, or
a fermentation process, such as for ethanol. A partly gelatinized
starch is a starch which to some extent is gelatinized, i.e.
wherein part of the starch has irreversibly swelled and
gelatininized and part of the starch is still present in a granular
state.
[0137] The composition described above may also comprise an acid
fungal alpha-amylase present in an amount of 0.01 to 10 AFAU/g DS,
preferably 0.1 to 5 AFAU/g DS, more preferably 0.5 to 3 AFAU/AGU,
and most preferably 0.3 to 2 AFAU/g DS. The composition may be
applied in any of the starch processes described above.
Production of Fermentation Products
[0138] From gelatinized starch: In this aspect the present
invention relates to a process for producing a fermentation
product, especially ethanol, from starch-containing material, which
process includes a liquefaction step and separately or
simultaneously performed saccharification and fermentation step(s).
The fermentation product, such as especially ethanol, may
optionally be recovered after fermentation, e.g., by distillation.
Suitable starch-containing starting materials are listed in the
section "Starch-containing materials"-section below. Contemplated
enzymes are listed in the "Enzymes"-section below. The fermentation
is preferably carried out in the presence of yeast, preferably a
strain of Saccharomyces. Suitable fermenting organisms are listed
in the "Fermenting Organisms"-section below.
[0139] A preferred process comprises a) contacting an aqueous
starch slurry with a polypeptide comprising a first amino acid
sequence having alpha-amylase activity and a second amino acid
sequence comprising a carbohydrate-binding module, b) incubating
said starch slurry with said polypeptide, c) fermenting to produce
a fermentation product, and d) optionally recovering the
fermentation product. Preferably the step b) is performed for a
time and at a temperature sufficient to achieve conversion of at
least 90% w/w of said starch substrate into fermentable sugars.
Preferably the first amino acid sequence and/or second amino acid
sequence of said polypeptide is derived from a bacterium. Said
polypeptide may preferably be the hybrid of the first aspect.
[0140] The aqueous slurry may contain from 10-40 wt-%, preferably
25-35 wt-% starch-containing material. The slurry is heated to
above the gelatinization temperature and bacterial and/or acid
fungal alpha-amylase may be added to initiate liquefaction
(thinning). The slurry may in an embodiment be jet-cooked to
further gelatinize the slurry before being subjected to an
alpha-amylase in step (a) of the invention.
[0141] More specifically liquefaction may be carried out as a
three-step hot slurry process. The slurry is heated to between
60-95.degree. C., preferably 80-85.degree. C., and alpha-amylase is
added to initiate liquefaction (thinning). Then the slurry may be
jet-cooked at a temperature between 95-140.degree. C., preferably
105-125.degree. C., for 1-15 minutes, preferably for 3-10 minute,
especially around 5 minutes. The slurry is cooled to 60-95.degree.
C. and more alpha-amylase is added to finalize hydrolysis
(secondary liquefaction). The liquefaction process is usually
carried out at pH 4.5-6.5, in particular at a pH between 5 and 6.
Milled and liquefied whole grains are known as mash.
[0142] The saccharification in step may be carried out using
conditions well know in the art. For instance, a full
saccharification process may lasts up to from about 24 to about 72
hours, however, it is common only to do a pre-saccharification of
typically 40-90 minutes at a temperature between 30-65.degree. C.,
typically about 60.degree. C., followed by complete
saccharification during fermentation in a simultaneous
saccharification and fermentation process (SSF). Saccharification
is typically carried out at temperatures from 30-65.degree. C.,
typically around 60.degree. C., and at a pH between 4 and 5,
normally at about pH 4.5.
[0143] The most widely used process in ethanol production is the
simultaneous saccharification and fermentation (SSF) process, in
which there is no holding stage for the saccharification, meaning
that fermenting organism, such as yeast, and enzyme(s) may be added
together. When doing SSF it is common to introduce a
pre-saccharification step at a temperature above 50.degree. C.,
just prior to the fermentation.
[0144] In accordance with the present invention the fermentation
step (c) includes, without limitation, fermentation processes used
to produce alcohols (e.g., ethanol, methanol, butanol); organic
acids (e.g., citric acid, acetic acid, itaconic acid, lactic acid,
gluconic acid); ketones (e.g., acetone); amino acids (e.g.,
glutamic acid); gases (e.g., H.sub.2 and CO.sub.2); antibiotics
(e.g., penicillin and tetracycline); enzymes; vitamins (e.g.,
riboflavin, B12, beta-carotene); and hormones. Preferred
fermentation processes include alcohol fermentation processes, as
are well known in the art. Preferred fermentation processes are
anaerobic fermentation processes, as are well known in the art.
[0145] From un-gelatinized starch: In this embodiment the invention
relates to processes for producing a fermentation product from
starch-containing material without gelatinization of the
starch-containing material. In one embodiment a polypeptide of the
invention, e.g. the hybrid enzyme of the first aspect, and
optionally a glucoamylase is used during saccharification and
fermentation. According to the invention the desired fermentation
product, such as ethanol, can be produced without liquefying the
aqueous slurry containing the starch-containing material. In one
embodiment a process of the invention includes saccharifying milled
starch-containing material below the initial gelatinization
temperature in the presence of the hybrid enzyme of the first
aspect and a glucoamylase to produce sugars that can be fermented
into the desired fermentation product by a suitable fermenting
organism.
[0146] A preferred process comprises a) contacting an aqueous
granular starch slurry with a polypeptide comprising a first amino
acid sequence having alpha-amylase activity and a second amino acid
sequence comprising a carbohydrate-binding module, b) incubating
said starch slurry with said polypeptide, c) fermenting to produce
a fermentation product, and d) optionally recovering the
fermentation product. Preferably the step b) is performed for a
time and at a temperature sufficient to achieve conversion of at
least 90% w/w of said starch substrate into fermentable sugars.
Preferably the first amino acid sequence and/or second amino acid
sequence of said polypeptide is derived from a bacterium. Said
polypeptide may preferably be the hybrid of the first aspect.
[0147] The term "initial gelatinization temperature" means the
lowest temperature at which gelatinization of the starch commences.
Starch heated in water begins to gelatinize between 50.degree. C.
and 75.degree. C.; the exact temperature of gelatinization depends
on the specific starch, and can readily be determined by the
skilled artisan. Thus, the initial gelatinization temperature may
vary according to the plant species, to the particular variety of
the plant species as well as with the growth conditions. In the
context of this invention the initial gelatinization temperature of
a given starch-containing material is the temperature at which
birefringence is lost in 5% of the starch granules using the method
described by Gorinstein. S. and Lii. C., Starch/Starke, Vol. 44
(12) pp. 461-466 (1992).
[0148] Before step (a) a slurry of starch-containing material, such
as granular starch, having 20-55 wt.-% dry solids, preferably 2540
wt.-% dry solids, more preferably 30-35% dry solids of
starch-containing material may be prepared. The slurry may include
water and/or process waters, such as stillage (backset), scrubber
water, evaporator condensate or distillate, side stripper water
from distillation, or other fermentation product plant process
water. Because the process of the invention is carried out below
the gelatinization temperature and thus no significant viscosity
increase takes place, high levels of stillage may be used if
desired. In an embodiment the aqueous slurry contains from about 1
to about 70 vol.-% stillage, preferably 15-60% vol.-% stillage,
especially from about 30 to 50 vol.-% stillage.
[0149] The milled starch-containing material may be prepared by
milling starch-containing material to a particle size of 0.05 to
3.0 mm, preferably 0.1-0.5 mm. After being subjected to a process
of the invention at least 85%, at least 86%, at least 87%, at least
88%, at least 89%, at least 90%, at least 91%, at least 92%, at
least 93%, at least 94%, at least 95%, at least 96%, at least 97%,
at least 98%, or preferably at least 99% of the dry solids of the
starch-containing material is converted into a soluble starch
hydrolysate.
[0150] The process of the invention is conducted at a temperature
below the initial gelatinization temperature. Preferably the
temperature at which step (a) is carried out is between
30-75.degree. C., preferably between 45-60.degree. C.
[0151] In a preferred embodiment step (a) and step (b) are carried
out as a simultaneous saccharification and fermentation process. In
such preferred embodiment the process is typically carried at a
temperature between 28.degree. C. and 36.degree. C., such as
between 29.degree. C. and 35.degree. C., such as between 30.degree.
C. and 34.degree. C., such as around 32.degree. C. According to the
invention the temperature may be adjusted up or down during
fermentation.
[0152] In an embodiment simultaneous saccharification and
fermentation is carried out so that the sugar level, such as
glucose level, is kept at a low level such as below about 3 wt.-%,
preferably below about 2 wt.-%, more preferred below about 1
wt.-%., even more preferred below about 0.5%, or even more
preferred below about 0.1 wt. %. Such low levels of sugar can be
accomplished by simply employing adjusted quantities of enzyme and
fermenting organism. A skilled person in the art can easily
determine which quantities of enzyme and fermenting organism to
use. The employed quantities of enzyme and fermenting organism may
also be selected to maintain low concentrations of maltose in the
fermentation broth. For instance, the maltose level may be kept
below about 0.5 wt.-% or below about 0.2 wt.-%.
[0153] The process of the invention may be carried out at a pH in
the range between 3 and 7, preferably from 3.5 to 6, or more
preferably from 4 to 5.
Starch-Containing Materials
[0154] Any suitable starch-containing starting material, including
granular starch, may be used according to the present invention.
The starting material is generally selected based on the desired
fermentation product. Examples of starch-containing starting
materials, suitable for use in a process of present invention,
include tubers, roots, stems, whole grains, corns, cobs, wheat,
barley, rye, milo, sago, cassaya, tapioca, sorghum, rice peas,
beans, or cereals, sugar-containing raw materials, such as
molasses, fruit materials, sugar, cane or sugar beet, potatoes, and
cellulose-containing materials, such as wood or plant residues.
Contemplated are both waxy and non-waxy types of corn and
barley.
[0155] The term "granular starch" means raw uncooked starch, i.e.,
starch in its natural form found in cereal, tubers or grains.
Starch is formed within plant cells as tiny granules insoluble in
water. When put in cold water, the starch granules may absorb a
small amount of the liquid and swell. At temperatures up to
50.degree. C. to 75.degree. C. the swelling may be reversible.
However, with higher temperatures an irreversible swelling called
"gelatinization" begins. Granular starch to be processed may be a
highly refined starch quality, preferably at least 90%, at least
95%, at least 97% or at least 99.5% pure or it may be a more crude
starch containing material comprising milled whole grain including
non-starch fractions such as germ residues and fibers. The raw
material, such as whole grain, is milled in order to open up the
structure and allowing for further processing. Two milling
processes are preferred according to the invention: wet and dry
milling. In dry milling whole kernels are milled and used. Wet
milling gives a good separation of germ and meal (starch granules
and protein) and is often applied at locations where the starch
hydrolysate is used in production of syrups. Both dry and wet
milling is well known in the art of starch processing and is
equally contemplated for the process of the invention.
[0156] The starch-containing material is milled in order to expose
more surface area. In an embodiment the particle size is between
0.05 to 3.0 mm, preferably 0.1-0.5 mm, or so that at least 30%,
preferably at least 50%, more preferably at least 70%, even more
preferably at least 90% of the milled starch-containing material
fit through a sieve with a 0.05 to 3.0 mm screen, preferably
0.1-0.5 mm screen.
Fermentation Product
[0157] The term "fermentation product" means a product produced by
a process including a fermentation step using a fermenting
organism. Fermentation products contemplated according to the
invention include alcohols (e.g., ethanol, methanol, butanol);
organic acids (e.g., citric acid, acetic acid, itaconic acid,
lactic acid, gluconic acid); ketones (e.g., acetone); amino acids
(e.g., glutamic acid); gases (e.g., H.sub.2 and CO.sub.2);
antibiotics (e.g., penicillin and tetracycline); enzymes; vitamins
(e.g., riboflavin, B.sub.12, beta-carotene); and hormones. In a
preferred embodiment the fermentation product is ethanol, e.g.,
fuel ethanol; drinking ethanol, i.e., potable neutral spirits; or
industrial ethanol or products used in the consumable alcohol
industry (e.g., beer and wine), dairy industry (e.g., fermented
dairy products), leather industry and tobacco industry. Preferred
beer types comprise ales, stouts, porters, lagers, bitters, malt
liquors, happoushu, high-alcohol beer, low-alcohol beer,
low-calorie beer or light beer. Preferred fermentation processes
used include alcohol fermentation processes, as are well known in
the art. Preferred fermentation processes are anaerobic
fermentation processes, as are well known in the art.
Fermenting Organisms
[0158] "Fermenting organism" refers to any organism, including
bacterial and fungal organisms, suitable for use in a fermentation
process and capable of producing desired a fermentation product.
Especially suitable fermenting organisms are able to ferment, i.e.,
convert, sugars, such as glucose or maltose, directly or indirectly
into the desired fermentation product. Examples of fermenting
organisms include fungal organisms, such as yeast. Preferred yeast
includes strains of Saccharomyces spp., in particular,
Saccharomyces cerevisiae.
[0159] In a preferred embodiment the fermenting organism, e.g. the
yeast, may be transformed with the polypeptide of the first aspect
and applied in a process comprising; a) contacting a starch
substrate with a fermenting organism cell transformed to express a
polypeptide comprising a first amino acid sequence having
alpha-amylase activity and a second amino acid sequence comprising
a carbohydrate-binding module; b) holding said starch substrate
with said yeast for a time and at a temperature sufficient to
achieve conversion of at least 90% w/w of said starch substrate
into fermentable sugars; c) fermenting to produce a fermentation
product, e.g., ethanol, d) optionally recovering the fermentation
product, e.g., ethanol. The steps a, b, and c are performed
separately or simultaneously. In a preferred embodiment the first
amino acid sequence and/or second amino acid sequence of said
polypeptide is derived from a bacterium.
Materials and Methods
[0160] KNU amylolytic activity: The amylolytic activity may be
determined using potato starch as substrate. This method is based
on the break-down of modified potato starch by the enzyme, and the
reaction is followed by mixing samples of the starch/enzyme
solution with an iodine solution. Initially, a blackish-blue colour
is formed, but during the break-down of the starch the blue colour
gets weaker and gradually turns into a reddish-brown, which is
compared to a coloured glass standard.
[0161] One Kilo Novo alfa Amylase Unit (KNU) is defined as the
amount of enzyme which, under standard conditions (i.e. at
37.degree. C.+/-0.05; 0.0003 M Ca.sup.2+; and pH 5.6) dextrinizes
5.26 g starch dry substance Merck Amylum solubile. A folder AF 9/6
describing this analytical method in more detail is available upon
request to Novozymes A/S, Denmark, which folder is hereby included
by reference.
[0162] Endo activity assay: Endo endo-amylase activity may be
determined using the Endo activity assay. 1 mL resuspended Phadebas
substrate (0.25 tablets/mL 50 mM sodium acetate, 1 mM CaCl.sub.2,
adjusted to pH 5.7) is incubated with 25 microL enzyme for 15 min
at 40.degree. C. with agitation. The reaction is stopped by
addition of 0.5 mL 1 M NaOH and the mixture is centrifuged in a
table centrifuge at 14,000 RPM. The absorbance of the supernatant
at 620 nm is measured. The activity is determined by comparing to a
standard with declared activity (BAN 480 L, 480 KNU/g).
[0163] Maltogenic amylase activity: One MANU (Maltogenic Amylase
Novo Unit) may be defined as the amount of enzyme required to
release one micromol of maltose per minute at a concentration of 10
mg of maltotriose (Sigma M 8378) substrate per ml of 0.1 M citrate
buffer, pH 5.0 at 37.degree. C. for 30 minutes (MANU unit further
defined in U.S. Pat. No. 6,162,628, which is hereby incorporated by
reference).
DNA Manipulations
[0164] Unless otherwise stated, DNA manipulations and
transformations were performed using standard methods of molecular
biology as described in Sambrook et al. (1989) Molecular cloning: A
laboratory manual, Cold Spring Harbor lab. Cold Spring Harbor,
N.Y.; Ausubel, F. M. et al. (eds.) "Current protocols in Molecular
Biology", John Wiley and Sons, 1995; Harwood, C. R. and Cutting, S.
M. (eds.).
EXAMPLE 1
Construction of Hybrids Between an Endo-Amylase and the CBM from
AMY1048
[0165] The amylase AMY1048 is a wild type Bacillus amylase made up
of a catalytic fragment of 484 amino acid and in addition a CBM20
fragment of 101 aa. The DNA sequence coding the AMY1048 is included
as SEQ ID NO:1 and the mature AMY1048 sequence is included as SEQ
ID NO:2. In SEQ ID NO:1 the CBM is defined as amino acid residues
485 to 586 which correspond to nucleotides 1540-1845 in SEQ ID
NO:2. The amylase including the CBM can be expressed from a
construction similar to what have been described for other amylases
i.e. e.g., inserted into a vector under the control of a
constitutive active promoter and flanked by the signal sequence
(SEQ ID NO: 15) and the terminator sequence of B. licheniformis
endo-amylase.
[0166] Replacing the catalytic fragment of the AMY1048 endo-amylase
with a catalytic domain of another endo-amylase, thus creating a
hybrid of the CBM from AMY1048 and a new endo-amylase, is made by
amplifying the DNA fragment coding the catalytic domain of the new
amylase by PCR using two oligonucleotides. The sense
oligonucleotide is in it's 5'end identical to the last 20
nucleotide of the DNA sequence coding for the signal sequence prior
the AMY1048 mature sequence and further in it's 3'end is identical
to the first 20 nucleotides of DNA sequence coding the mature part
of the desire amylase DNA. The antisense oligonucleotides are in
it's 5'end identical to the antisense DNA of the first 20
nucleotide of the DNA sequence coding the CBM from AMY1048 and
further in it's 3'end is identical to the antisense of the last 20
nucleotides of the DNA sequence coding the mature part of the
desire amylase DNA.
[0167] Both the amplified amylase DNA and the vector hosting the
AMY1048 amylase, is digested with Sac II and Sca I and the vector
and PCR fragments ligated prior to transferring into Bacillus
subtilis strain SHA273. In the primer sequences below the
recognition sites of the restriction enzymes are indicated by
underscore.
[0168] To construct a hybrid of the B. licheniformis endo-amylase
(SEQ ID NO:35) and the CBM20 from B. flavothermus amylase the
following oligonucleotides were used by the present inventors:
TABLE-US-00001 Sense:
5'-ctcattctgcagccgcggcagcaaatcttaatgggacgct-3'. (P1s SEQ ID NO:19)
Antisence: 5'-atttgggaagtagtacttattctttgaacataaattgaaa-3'. (P1as
SEQ ID NO:20)
[0169] The resulting DNA sequence coding the mature polypeptide and
the amino acid sequence of the mature polypeptide are included as
SEQ ID NO:3 and SEQ ID NO:4 respectively
[0170] To construct a hybrid of the LE429 variant of B.
licheniformis endo-amylase (SEQ ID NO:41) and the CBM20 from B.
flavothermus amylase the following oligonucleotides were used:
TABLE-US-00002 Sense:
5'-ctcattctgcagccgcggcagtaaatggcacgctgatgca-3'. (P2s SEQ ID NO:21)
Antisence: 5'-atttgggaagtagtacttatttttggaacataaattgaaa-3'. (P2as
SEQ ID NO:22)
[0171] The resulting DNA sequence coding the mature polypeptide and
the amino acid sequence of the mature polypeptide are included as
SEQ ID NO:5 and SEQ ID NO:6 respectively To construct a hybrid of
the B. Stearothermophilus endo-amylase (SEQ ID NO:36) and the CBM20
from B. flavothermus amylase the following oligonucleotides were
used: TABLE-US-00003 Sense:
5'-ctcattctgcagccgcggcagcaccgtttaacggctttaa-3'. (P3s SEQ ID NO:23)
Antisence: 5'-atttgggaagtagtacttattttaggaacccaaaccgaaa-3'. (P3as
SEQ ID NO:24)
[0172] The result DNA sequence coding the mature polypeptide and
the amino acid sequence of the mature polypeptide are included as
SEQ ID NO:7 and SEQ ID NO:8 respectively To construct a hybrid of a
variant of the alkaline Bacillus sp. SP722 endo-amylase (SEQ ID
NO:38) and the CBM20 from B. flavothermus amylase the following
oligonucleotides were used: TABLE-US-00004 Sense:
5'ctcattctgcagccgcggcacatcataatgggacaaatgg-3'. (P4s SEQ ID NO:25)
Antisence: 5'-atttgggaagtagtacttatccatttgtcccattatgatg-3'. (P4as
SEQ ID NO:26)
[0173] The resulting DNA sequence coding the mature polypeptide and
the amino acid sequence of the mature polypeptide are included as
SEQ ID NO:9 and SEQ ID NO:10 respectively.
[0174] To construct a hybrid of a variant of the alkaline Bacillus
species AA560 endo-amylase (SEQ ID NO:40) and the CBM20 from B.
flavothermus amylase the following oligonucleotides were used:
TABLE-US-00005 Sense:
5'-ctcattctgcagccgcggcacaccataatggtacgaacgg-3' (P5s SEQ ID NO:27)
Antisence: 5'-atttgggaagtagtacttattttgtttacccaaatagaaa-3' (P5as SEQ
ID NO:28)
The resulting DNA sequence coding the mature polypeptide and the
amino acid sequence of the mature polypeptide are included as SEQ
ID NO:11 and SEQ ID NO:12 respectively.
[0175] To construct a hybrid of a variant of the Bacillus
amyloliquefacience endo-amylase (SEQ ID NO:37) and the CBM20 from
B. flavothermus amylase the following oligonucleotides were used:
TABLE-US-00006 Sense:
5'-ctcattctgcagccgcggcagtaaatggcacgctgatgca-3' (P6s SEQ ID NO:29)
Antisence: 5'-atttgggaagtagtacttatttttggaacataaatggaga-3' (P6as SEQ
ID NO:30)
[0176] The resulting DNA sequence coding the mature polypeptide and
the amino acid sequence of the mature polypeptide are included as
SEQ ID NO:13 and SEQ ID NO:14 respectively.
[0177] The above described hybrid enzymes was expressed by B.
subtilis growing in shake flasks for 72 hours at and secreted into
the supernatant. The presence of hybrid enzyme in the supernatant
was demonstrated by SDS-PAGE.
EXAMPLE 2
Construction of a Hybrid Amylase with Carbohydrate Binding
Domain
[0178] The catalytic fragment of the B. flavothermus endo-amylase,
AMY1048 can further be divided into the central AB-domain harboring
the catalytic center and a C domain c-terminal to the catalytic
domain but prior to the CBM. In SEQ ID NO:2 the catalytic core
domain consist of the first 397 amino acid residues, the C domain
is defined as the amino acid residues from 398 to 484 and the CBM
is defined as amino acid residues 485 to 586. In SEQ ID NO:1 the
signal sequence is encoded by nucleotide 1 to 87, the catalytic
core domain is encoded by nucleotide 88-1278, the C domain is
encoded by the nucleotides 1279-1539, and the CBM is encoded by
nucleotide 1540-1845.
[0179] The amylase including the CBM can be expressed from a vector
construction similar to what have been described in WO0060060A2 in
example 4--i.e. the amylase gene is inserted into a vector under
the control of a amylase promoter and flanked by the signal
sequence and the terminator sequence of B. licheniformis
endo-amylase.
[0180] As an alternative to harboring the gene on a plasmid, the
cassette including the DNA sequence coding for the antibiotic
marker, promoter, signal sequence, the mature protein and the
terminator can be integrated into the genome of the B. subtilis by
homologous in-vivo crossover, by flanked upstream and downstream
genomic DNA with high similarity to a non-essential part of the B.
subtilis DNA. Useful DNA regions could be the pectate lyase or the
endo-amylase loci. In this example the AMY1048 and the hybrid is
inserted into the amylase loci in opposite direction relative to
the original B. subtilis amylase.
[0181] The catalytic core domain of the AMY1048 endo-amylase was
replaced with a catalytic core domain of the Bacillus
starothermophilus (BSG) endo-amylase, thus creating a hybrid of the
C-domain and the CBM from AMY1048 and the catalytic core domain
from the new endo-amylase.
[0182] The DNA fragment coding the catalytic core of the B.
stearothermophilus amylase (SEQ ID NO:36) was amplified by PCR
using two oligonucleotides. The sense oligonucleotides were in it's
5'end identical to the last 20 nucleotide of the DNA sequence (SEQ
ID NO:15) coding for the signal sequence prior the AMY1048 mature
sequence (SEQ ID NO:1) and further in it's 3'end identical to the
first 20 nucleotides of DNA sequence coding the mature part of the
desire amylase DNA. The antisense oligonucleotides were in its
5'end identical to the antisense DNA of the first 20 nucleotide of
the DNA sequence coding the C-domain from AMY1048 and further in
its 3'end was identical to the antisense of the last 20 nucleotides
of the DNA sequence coding the catalytic core of the BSG amylase
DNA.
[0183] To construct a hybrid of the B. stearothermophilus
endo-amylase core domain and C-domain and the CBM20 from B.
flavothermus amylase the following oligonucleotides were used by
the present inventors: TABLE-US-00007 Sense:
5'-ctcattctgcagccgcggcagcaccgtttaacggctttaa-3'. (P7s SEQ ID NO:31)
Antisence: 5'-atatagtcgtgctgtgttccgtaagcataatccctgcgcg-3'. (P7as
SEQ ID NO:32)
[0184] To facilitate genome integration, a 5 kB fragment upstream
from of the signal sequence and into the amylase genome sequence is
made by PCR using the AMY1048 genomic construction as template, and
the inverse primer of the antisense primer and the genome specific
primer: 5'-ctgcatcagggctgcggcatcc-3; (P8 SEQ ID NO:33).
[0185] Another fragment from the termination of the gene and
upstream of the genomic B. subtilis amylase is made by PCR using
the AMY1048 genomic construction as template, and the inverse
primer of the sense primer and the genome specific primer:
5'-ctgcatcagggctgcggcatcc-3'; (P9 SEQ ID NO:34).
[0186] Taking advantages of the 40 bp overlap, the three PCR
fragments were assembled by PCR and the resulting product amplified
in another PCR using the genome specific primers, prior to
transferring into Bacillus subtilis strain SHA273 (described in
WO92/11357 and WO95/10603).
[0187] The resulting DNA sequence coding the mature polypeptide and
the mature polypeptide are included as SEQ ID NO:17 and SEQ ID
NO:18 respectively.
[0188] The hybrid enzyme was expressed by B. subtilis growing in
PS1 media in shake flasks for 72 hours at 37.degree. C. and
secreted into the supernatant. The presence of hybrid enzyme in the
supernatant was demonstrated by SDS-PAGE.
EXAMPLE 3
Determination of Exo-Endo Improvement Factor (EIF)
[0189] EIF is the measure of an increment of the exo/endo ratio
relative to a parent enzyme i.e. EIF=(exo/endo of
variant)/(exo/endo of parent enzyme). An enzyme has an increase in
exo/endo ratio compared to its parent enzyme if EIF>1. EIF may
be based on one of the following methods.
[0190] EIF1 Endo activity assay: The Phadebas Amylase Test
(Pharmacia Diagnostics) is run according to the suppliers
recommenda-tions and the endo units calculated from the supplied
formula where the natural logarithm to the activ-ity equals N,
where N=A+square root [B+C*ln(Abs)]. Abs is the absorbance at 620
nm, A=-13.3235, B=243.3293, and C=26.73797
[0191] Exo activity assay: 50 microL of 50 mM sodium citrate, 5 mM
CaCl.sub.2, pH 6.5 is mixed with 25 microL of enzyme in the same
buffer and 25 microL Betamyl substrate (Betamyl Method, Megazyme)
dissolved according to suppliers recommendations. The assay mix is
incubated for 30 min. at 40.degree. C. and the reaction stopped by
adding 150 microL 4% (w/w) Trizma base
(Tris(hydroxymethyl)-aminomethane). The activity is expressed
directly as the absorbance at 420 nm measured using a microtiter
plate reader.
[0192] EIF2 Endo activity assay: 1 mL resuspended Phadebas
substrate (Pharmacia Diagnostics) (0.25 tablets/mL 50 mM so-dium
acetate, 1 mM CaCl.sub.2, adjusted to pH 5.7) is incubated with 25
microL enzyme for 15 min at 40.degree. C. with agitation. The
reaction is stopped by addition of 0.25 mL 1 M NaOH and the
mix-ture is centrifuged in a table centrifuge at 14,000 RPM. The
absorbance of the supernatant at 620 nm is measured. The activity
is determined by comparing to a standard with declared activity
(BAN 480 L, 480 KNU/g)
[0193] Exo activity assay: 900 microL 3.3% solubilized waxy maize
starch (3.3% starch is boiled in 50 mM sodium acetate, 1 mM
CaCl.sub.2, pH 5.7 for 5 min and cooled to 40.degree. C.) is
incubated with 100 microL enzyme at 40.degree. C. with stirring.
After appropriate reaction time the remaining starch is
precipitated by addition of 450 microL 4.degree. C. 96% ethanol.
The precipitate is immediately removed by centrifugation at 3000 G
for 20 min. The total carbohydrate in the supernatant is determined
by mixing 200 microL supernatant with 50 microL 2% tryptophan and
900 microL 64% sulphuric acid. The mixture is heated for 15 min at
95.degree. C. and the absorbance at 630 nm is measured after
cooling to room tem-perature. The activity is determined by
comparing with the absorbance of glucose standards in the same
assay. One unit is defined as the amount of enzyme that at initial
rates liberates 1 mg oligomeric products (products that are not
precipitated by ethanol) per min.
EXAMPLE 4
Liquefaction and Saccharification with an Endo-Amylase with a
CBM
[0194] This example illustrates the conversion of granular wheat
starch into glucose using a bacterial endo-amylase with a CBM (SEQ
ID NO:4) or the same bacterial endo-amylase without CBM (SEQ ID
NO:35) together with a glucoamylase and an acid fungal amylase. A
slurry with 33% dry solids (DS) granular starch was prepared by
adding 247.5 g of wheat starch under stirring to 502.5 ml of water.
The pH was adjusted with HCl to 4.5. The granular starch slurry was
distributed to 100 ml Erlenmeyer flasks with 75 g in each flask.
The flasks were incubated with magnetic stirring in a 60.degree. C.
water bath. At zero hours the enzyme activities given in table 1
were dosed to the flasks. Samples were withdrawn after 24, 48 and
73 and 94 hours. The enzyme levels used were endo-amylase +/-CBM
100 KNU/kg DS, glucoamylase 200 AGU/kg DS, acid fungal
alpha-amylase 50 AFAU/g DS
[0195] Total dry solids starch was determined using the following
method. The starch was completely hydrolyzed by adding an excess
amount of endo-amylase (300 KNU/Kg dry solids) and placing the
sample in an oil bath at 95.degree. C. for 45 minutes. Subsequently
the samples were cooled to 60.degree. C. and an excess amount of
glucoamylase (600 AGU/kg DS) was added followed by incubation for 2
hours at 60.degree. C.
[0196] Soluble dry solids in the starch hydrolysate were determined
by refractive index measurement on samples after filtering through
a 0.22 microM filter. The sugar profile was determined by HPLC. The
amount of glucose was calculated as DX. The results are shown in
table 2 and 3. TABLE-US-00008 TABLE 2 Soluble dry solids as
percentage of total dry substance at 100 KNU/kg DS endo-amylase
dosage. Enzyme 24 hours 48 hours 73 hours 94 hours Endo-amylase
83.7 87 89.7 90.3 Endo-amylase + CBM 87.2 89.7 91.5 92.3
[0197] TABLE-US-00009 TABLE 3 The DX of the soluble hydrolysate at
100 KNU/kg DS endo-amylase dosage. Enzyme 24 hours 48 hours 73
hours 94 hours Endo-amylase 72.0 82.0 83.8 83.8 Endo-amylase + CBM
76.7 87.0 87.5 87.5
EXAMPLE 5
Effective Dosage
[0198] The "effective dosage" of the amylase in question is defined
as the dosage resulting in a reduction in firmness of more than
10%, e.g., of between 10 and 20%, compared to the firmness of a
bread without enzymes (the control). The reduction in firmness is
measured after storage for 14 days in inert atmosphere at room
temperature.
[0199] Tolerance towards overdosing is measured by using the
Elasticity Loss Ratio=ELR. ELR is measured day 1 after baking or
later, such as day 5, day 10 or as in the example below after 14
days storage and is defined then as follows: ELR
%=(Elasticity.sub.control day 14-Elasticity.sub.amylase day
14.times.100)/Elasticity.sub.control day 14
[0200] In combination with 450 MANU/kg flour Novamyl.RTM. the
tolerance towards overdosing is measured: ELR.sub.N
%=(Elasticity.sub.Novamyl day 14-Elasticity.sub.Novamyl day
14.times.100)/Elasticity.sub.Novamyl day 14
[0201] If the amylase is overdosed the ELR and/or ELR.sub.N will be
>5%.
Baking Process
[0202] Bread are baked according to the sponge & dough
method.
[0203] Sponge, Ingredients as % on Flour Basis TABLE-US-00010 Soya
oil 2.5 SSL 0.38 Yeast 5 Wheat flour 60 Water 62
[0204] Dough, Ingredients as % on flour basis TABLE-US-00011
Ascorbic acid to be optimized for each flour ADA 20 ppm Salt 2
Sirup 7 (dry substance) Water to be optimized for each flour Wheat
flour 40 Calcium propionate + enzymes 0.25
[0205] The sponge ingredients yeast, water, flour, SSL and oil are
mixed at 90 rpm for 1 minutes, 150 rpm for 4 minutes. The sponge is
set for fermentation for 3 hours at 27.degree. C. and 86% RH. The
sponge is added the dough ingredients and mixed to a dough at 90
rpm for 1 minute and at 150 rpm for 14 minutes. The dough is scaled
into pieces of 340 g each and rested for 10 minutes.
[0206] The dough portions are sheeted and molded followed by
fermentation at 55 minutes at 42.degree. C. and 86% RH. The doughs
are baked at 225.degree. C. for 15 minutes. The baked bread are
cooled and stored until analysis.
[0207] Bread is baked with the CBM-hybrid enzyme and with the
corresponding enzyme without a CBM. The effective dose is
determined with and without addition of Novamyl.RTM. at 450 MANU/kg
flour. Firmness and elasticity of a bread are measured by the
TA.XT2 texture analyzer according to AACC method 74-09.
[0208] The effective dosage of the CBM-hybrid enzyme is determined
and a new set of bread is baked with 3 and 5 times the effective
dosage with and without addition of Novamyl.RTM. at 450 MANU/kg
flour.
[0209] The ELR is measured after 14 days of storage, and it is
found that the ELR as well as the ELR.sub.N is less than 5% for the
amylase with CBM dosed 5 times the effective dosage whereas it is
more than 5% for the corresponding enzymes without addition of the
CBM dosed 3 times the effective dose.
EXAMPLE 6
Determination of ELR for Selected Variants
[0210] Example 6 was performed as described in Example 5 except
that a dosage of 500 MANU/kg flour was used.
[0211] Two variants of a hybrid comprising the alkaline Bacillus
species M560 endo-amylase (SEQ ID NO:40) and the CBM20 from the B.
flavothermus amylase (residues 485 to 586 in SEQ ID NO:2) were
used: The variant BE1 comprising the following alterations in the
amylase sequence: R118K, D183*, G184*, N195F, R320K, R458K, N33S,
D36N, K37L, E391I, Q394R, K395D, T452Y and N484P, and the variant
BE2 comprising of the following alterations in the amylase
sequence: R118K, D183*, G184*, N195F, R320K, R458K and N484P.
TABLE-US-00012 TABLE 1 Application of hybrid-amylase (1 mg/kg
flour) without Novamyl Firmness Firmness on reduction in %
Elasticity Treatment day 15 (g) of control day 15 g/g ELR % Control
794 39.9 BE1 382 51 47.0 -17.0 BE2 313 61 46.6 -16.8
[0212] TABLE-US-00013 TABLE 2 Application of hybrid-amylase in
combination with Novamyl Firmness Firmness on reduction in % of
Elasticity ELR Treatment Day 15 (g) control day 15 g/g % Control
706 40.8 BE1 0.5 mg/kg flour 316 55 46.9 -4.5 BE1 1 mg/kg flour 239
66 47.0 -4.9 BE2 0.5 mg/kg flour 315 55 47.0 -4.9 BE2 1 mg/kg flour
225 68 47.5 -6.0 Only Novamyl .RTM. 452 44.8 500 MANU/kg flour
EXAMPLE 7
Batter Cake
[0213] Batter cake dough was prepared with hybrids BE1, BE2, the
Bacillus amylase shown in SEQ ID NO:40 (CD donor homologue) and the
Bacillus amylases SEQ ID NO:2 (CBM donor).
[0214] The dough was made from a commercial batter cake mix "Tegral
Allegro" from Puratos consisting of wheat flour, sugar, baking
powder, emulsifier (mono- and diglycerides of fatty acids). The
cake mix, enzyme (4 mg/kg flour) and water was place in a bowl and
beat with a spatula, Bear AR 5 A-Vari-mixer, at third speed until a
smooth homogeneous mixture was obtained (approximately 2 minutes).
Molds were filled with 300 g dough and baked at 180 C for 45
minutes. The baked cakes were cooled at room temperature for 30
minutes and packed in nitrogen before storage at room temperature
until analysis.
[0215] Mobility of free water was determined using low field NMR as
described by P. L. Chen, Z. Long, R. Ruan and T. P. Labuza, Nuclear
Magnetic Resonance Studies of water Mobility in bread during
Storage. Lebensmittel Wissenschaft und Technologie 30, 178-183
(1997).
[0216] Hardness and cohesiveness was measured according to the
method described in Food Texture and viscosity, 2.sup.nd edition,
Malcolm Bourne, Food Science and Technology, International Series,
Academic Press, page 182-186.
[0217] All data were measured after 14 days. The following results
were obtained: TABLE-US-00014 Treatment Hardness units Cohesiveness
units Mobility units Reference 1485 34 4148 BE1 9.5 KNU/kg 1482 35
4655 flour Amyl1 1702 35 4811 9.5 KNU/kg flour BE3 9.5 KNU/kg 1217
34 4797 flour BAN (SEQ ID 1456 32 4423 NO: 37) 9.5 KNU/kg flour
Based on the above data the following parameters (I)-(III) were
calculated: Cohesiveness reduction
%=(Cohesiveness.sub.Reference-Cohesiveness.sub.amyase).times.100%/COhesiv-
eness.sub.Reference (I)
dHardness=Hardness.sub.Reference-Hardness.sub.Amylase (II)
dMobility=Mobility.sub.Amylase-Mobility.sub.Reference (III)
TABLE-US-00015 Treatment Cohesiveness Reference Reduction %
dHardness units dMobility units BE1 9.5 KNU/kg -3 3 507 flour Amyl1
-3 -217 663 9.5 KNU/kg flour BE3 0 268 649 9.5 KNU/kg flour BAN
(SEQ ID 5.8 20 275 NO: 37) 9.5 KNU/kg flour
Amyl1 is identical to the amylase of SEQ ID NO: 40 with the
following substitutions: R118K, D183*, G184*, N195F, R320K, R458K,
N33S, D36N, K37L, E391I, Q394R, K395D, T452Y and N484P, using the
numbering of SEQ ID NO: 40.
EXAMPLE 8
Sponge and Dough
[0218] Bread were baked according to the sponge & dough method.
Bread were stored at room temperature for 14 days until analysis.
Hardness and cohesiveness was measured according to the method
described in Food Texture and viscosity, 2 edition, Malcolm Bourne,
Food Science and Technology, International Series, Academic Press,
page 182-186, and mobility of free water was determined using low
field NMR as described by P. L. Chen, Z. Long, R. Ruan and T. P.
Labuza, Nuclear Magnetic Resonance Studies of water Mobility in
bread during Storage. Lebensmittel Wissenschaft und Technologie 30,
178-183 (1997). Three amylases were used; the variants BE1 and BE3
and the Bacillus amylase SEQ ID NO:2 (CBM donor). The variant BE3
has a the catalytic domain having the amino acid sequence as shown
in SEQ.ID: 37 and comprise one or more, e.g. such as all of the
following alterations: S31A, D32N, 133L, E178*, G179*, N190F,
K3891, K392R, E393D, V508A and a CBM having the amino acid sequence
shown as amino acid residues 485 to 586 in SEQ ID NO:2.
[0219] All data were measured after 14 days. The following results
were obtained: TABLE-US-00016 Hardness Cohesiveness Mobility
Treatment units units units Reference 400 38 6435 Novamyl 272 48
6234 300 MANU/kg flour BE3 0.05 mg/kg flour + 256 48 7365 Novamyl
300 MANU/kg flour BAN (SEQ ID 207 45 7354 NO: 37) 0.05 mg/kg flour
+ Novamyl 300 MANU/kg flour BE3 0.15 mg/kg flour 223 48 6886 BE1
0.5 mg/kg flour 311 41 7152
Based on the above data the following parameters (I)-(VI) were
calculated: For treatments without Novamyl.RTM. Cohesiveness
reduction
%=(Cohesiveness.sub.Novamyl-Cohesiveness.sub.amyase+Novamyl).times.100%/C-
Ohesiveness.sub.Novamyl (IV)
dHardness=Hardness.sub.Novamyl-Hardness.sub.Amylase+Novamyl (V)
dMobility=Mobility.sub.Amylase+Novamyl-Mobility.sub.Novamyl (VI)
For treatments with Novamyl.RTM. Cohesiveness reduction
%=(Cohesiveness.sub.Reference-Cohesiveness.sub.amyase).times.100%/COhesiv-
eness.sub.Reference (I)
dHardness=Hardness.sub.Reference-Hardness.sub.Amylase (II)
dMobility=Mobility.sub.Amylase-Mobility.sub.Reference (III)
TABLE-US-00017 Cohesiveness dHardness dMobility Treatment reduction
% units units Reference Novamyl 300 MANU/kg flour BE3 0.05 mg/kg
flour + 0 16 1131 Novamyl 300 MANU/kg flour BAN (SEQ ID 6.3 65 1120
NO: 37) 0.05 mg/kg flour + Novamyl 300 MANU/kg flour BE3 0.15 mg/kg
flour -26 177 451 BE1 0.5 mg/kg flour -7.9 89 717
EXAMPLE 9
Determination of Thermostablity
[0220] The thermostability was determined at 60, 65 or 70.degree.
C. for 30 minutes in a 50 mM NaOAc, 1 mM CaCl.sub.2 buffer at pH
5.7. The samples was cooled down and the residual activity was
measured using the Phadebas method as describe in section Materials
and Methods except that the determination took place at 50.degree.
C. The residual activity (R.A.) can be calculated according to the
following equation: R.A. (%)=[Abs (heat treated)-Abs (blank)]/[Abs
(heat treated at 60 C)-Abs (blank)]*100%.
[0221] The following results were obtained:
[0222] Residual activity for Fungamyl, a well-known fungal baking
amylase from A. oryzae, and to hybrid enzymes of the invention.
TABLE-US-00018 Enzyme 60.degree. C. 65.degree. C. 70.degree. C.
Fungamyl 100 4 2 BE1 100 78 67 BE3 100 80 27
[0223]
Sequence CWU 1
1
42 1 1758 DNA Bacillus flavothermus CDS (1)..(1758) 1 gga agt gtg
ccg gta aat ggc aca atg atg caa tat ttc gaa tgg tac 48 Gly Ser Val
Pro Val Asn Gly Thr Met Met Gln Tyr Phe Glu Trp Tyr 1 5 10 15 ctt
cca gac gat gga aca cta tgg acg aaa gta gca aat aac gct caa 96 Leu
Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Asn Ala Gln 20 25
30 tct tta gcg aat ctt ggc att act gcc ctt tgg ctt ccc cct gcc tat
144 Ser Leu Ala Asn Leu Gly Ile Thr Ala Leu Trp Leu Pro Pro Ala Tyr
35 40 45 aaa gga aca agc agc agt gac gtt gga tat ggc gtt tat gat
tta tat 192 Lys Gly Thr Ser Ser Ser Asp Val Gly Tyr Gly Val Tyr Asp
Leu Tyr 50 55 60 gac ctt gga gag ttt aat caa aaa gga act gtc cga
aca aaa tac ggg 240 Asp Leu Gly Glu Phe Asn Gln Lys Gly Thr Val Arg
Thr Lys Tyr Gly 65 70 75 80 aca aaa aca caa tat atc caa gca atc caa
gcg gcg cat aca gca ggg 288 Thr Lys Thr Gln Tyr Ile Gln Ala Ile Gln
Ala Ala His Thr Ala Gly 85 90 95 atg caa gta tat gca gat gtc gtc
ttt aac cat aaa gcc ggt gca gat 336 Met Gln Val Tyr Ala Asp Val Val
Phe Asn His Lys Ala Gly Ala Asp 100 105 110 gga aca gaa cta gtc gat
gca gta gaa gta aat cct tct gac cgc aat 384 Gly Thr Glu Leu Val Asp
Ala Val Glu Val Asn Pro Ser Asp Arg Asn 115 120 125 caa gaa ata tca
gga aca tat caa atc caa gcg tgg aca aaa ttt gat 432 Gln Glu Ile Ser
Gly Thr Tyr Gln Ile Gln Ala Trp Thr Lys Phe Asp 130 135 140 ttt cct
ggt cgt gga aac acc tat tct agt ttt aaa tgg cgt tgg tat 480 Phe Pro
Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr 145 150 155
160 cat ttc gat gga acg gac tgg gat gag agt aga aaa cta aat cgt att
528 His Phe Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg Ile
165 170 175 tac aag ttc cgc ggc acg gga aaa gca tgg gat tgg gaa gta
gat aca 576 Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val
Asp Thr 180 185 190 gaa aac ggg aat tat gac tat ctc atg tat gca gat
tta gat atg gat 624 Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp
Leu Asp Met Asp 195 200 205 cat cca gag gtt gta tcc gaa cta aaa aat
tgg gga aag tgg tat gta 672 His Pro Glu Val Val Ser Glu Leu Lys Asn
Trp Gly Lys Trp Tyr Val 210 215 220 acc aca acc aat atc gac gga ttc
cgt ctg gat gca gtg aag cat att 720 Thr Thr Thr Asn Ile Asp Gly Phe
Arg Leu Asp Ala Val Lys His Ile 225 230 235 240 aaa tat agc ttt ttc
ccg gac tgg cta tcg tac gta cga acc caa aca 768 Lys Tyr Ser Phe Phe
Pro Asp Trp Leu Ser Tyr Val Arg Thr Gln Thr 245 250 255 caa aag cct
ctt ttt gcc gtt ggg gaa ttt tgg agc tat gac att agc 816 Gln Lys Pro
Leu Phe Ala Val Gly Glu Phe Trp Ser Tyr Asp Ile Ser 260 265 270 aag
ttg cac aac tat att aca aag acg aac ggc tct atg tcc cta ttc 864 Lys
Leu His Asn Tyr Ile Thr Lys Thr Asn Gly Ser Met Ser Leu Phe 275 280
285 gat gcc ccg ctg cat aac aat ttt tat ata gca tcg aaa tca ggc ggt
912 Asp Ala Pro Leu His Asn Asn Phe Tyr Ile Ala Ser Lys Ser Gly Gly
290 295 300 tat ttt gat atg cgc aca tta ctc aac aac aca ttg atg aaa
gat cag 960 Tyr Phe Asp Met Arg Thr Leu Leu Asn Asn Thr Leu Met Lys
Asp Gln 305 310 315 320 cct aca tta gca gtc aca tta gtg gat aat cac
gat act gag cca ggg 1008 Pro Thr Leu Ala Val Thr Leu Val Asp Asn
His Asp Thr Glu Pro Gly 325 330 335 caa tct ctg cag tca tgg gtc gag
cca tgg ttt aaa ccg tta gct tac 1056 Gln Ser Leu Gln Ser Trp Val
Glu Pro Trp Phe Lys Pro Leu Ala Tyr 340 345 350 gca ttt atc ttg acc
cgc caa gaa ggt tat cct tgc gtc ttt tat gga 1104 Ala Phe Ile Leu
Thr Arg Gln Glu Gly Tyr Pro Cys Val Phe Tyr Gly 355 360 365 gat tac
tat ggt att cca aaa tac aac att cct gcg ctg aaa agc aaa 1152 Asp
Tyr Tyr Gly Ile Pro Lys Tyr Asn Ile Pro Ala Leu Lys Ser Lys 370 375
380 ctt gat ccg ctg tta att gcc aga aga gat tat gcc tat gga aca cag
1200 Leu Asp Pro Leu Leu Ile Ala Arg Arg Asp Tyr Ala Tyr Gly Thr
Gln 385 390 395 400 cac gac tat att gac agt gcg gat att atc ggt tgg
acg cgg gaa gga 1248 His Asp Tyr Ile Asp Ser Ala Asp Ile Ile Gly
Trp Thr Arg Glu Gly 405 410 415 gtg gct gaa aaa gca aat tca gga ctg
gct gca ctc att acc gac ggg 1296 Val Ala Glu Lys Ala Asn Ser Gly
Leu Ala Ala Leu Ile Thr Asp Gly 420 425 430 cct ggc gga agc aaa tgg
atg tat gtt gga aaa caa cac gct ggc aaa 1344 Pro Gly Gly Ser Lys
Trp Met Tyr Val Gly Lys Gln His Ala Gly Lys 435 440 445 acg ttt tat
gat tta acc ggc aat cga agt gat aca gtg aca atc aat 1392 Thr Phe
Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr Ile Asn 450 455 460
gct gat gga tgg gga gaa ttt aaa gtc aat gga ggg tct gta tcc ata
1440 Ala Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser
Ile 465 470 475 480 tgg gtt cca aaa ata agt act act tcc caa ata aca
ttt act gta aat 1488 Trp Val Pro Lys Ile Ser Thr Thr Ser Gln Ile
Thr Phe Thr Val Asn 485 490 495 aac gcc aca acc gtt tgg gga caa aat
gta tac gtt gtc ggg aat att 1536 Asn Ala Thr Thr Val Trp Gly Gln
Asn Val Tyr Val Val Gly Asn Ile 500 505 510 tcg cag ctg ggg aac tgg
gat cca gtc cac gca gtt caa atg acg ccg 1584 Ser Gln Leu Gly Asn
Trp Asp Pro Val His Ala Val Gln Met Thr Pro 515 520 525 tct tct tat
cca aca tgg act gta aca atc cct ctt ctt caa ggg caa 1632 Ser Ser
Tyr Pro Thr Trp Thr Val Thr Ile Pro Leu Leu Gln Gly Gln 530 535 540
aac ata caa ttt aaa ttt atc aaa aaa gat tca gct gga aat gtc att
1680 Asn Ile Gln Phe Lys Phe Ile Lys Lys Asp Ser Ala Gly Asn Val
Ile 545 550 555 560 tgg gaa gat ata tcg aat cga aca tac acc gtc cca
act gct gca tcc 1728 Trp Glu Asp Ile Ser Asn Arg Thr Tyr Thr Val
Pro Thr Ala Ala Ser 565 570 575 gga gca tat aca gcc agc tgg aac gtg
ccc 1758 Gly Ala Tyr Thr Ala Ser Trp Asn Val Pro 580 585 2 586 PRT
Bacillus flavothermus 2 Gly Ser Val Pro Val Asn Gly Thr Met Met Gln
Tyr Phe Glu Trp Tyr 1 5 10 15 Leu Pro Asp Asp Gly Thr Leu Trp Thr
Lys Val Ala Asn Asn Ala Gln 20 25 30 Ser Leu Ala Asn Leu Gly Ile
Thr Ala Leu Trp Leu Pro Pro Ala Tyr 35 40 45 Lys Gly Thr Ser Ser
Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr 50 55 60 Asp Leu Gly
Glu Phe Asn Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly 65 70 75 80 Thr
Lys Thr Gln Tyr Ile Gln Ala Ile Gln Ala Ala His Thr Ala Gly 85 90
95 Met Gln Val Tyr Ala Asp Val Val Phe Asn His Lys Ala Gly Ala Asp
100 105 110 Gly Thr Glu Leu Val Asp Ala Val Glu Val Asn Pro Ser Asp
Arg Asn 115 120 125 Gln Glu Ile Ser Gly Thr Tyr Gln Ile Gln Ala Trp
Thr Lys Phe Asp 130 135 140 Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser
Phe Lys Trp Arg Trp Tyr 145 150 155 160 His Phe Asp Gly Thr Asp Trp
Asp Glu Ser Arg Lys Leu Asn Arg Ile 165 170 175 Tyr Lys Phe Arg Gly
Thr Gly Lys Ala Trp Asp Trp Glu Val Asp Thr 180 185 190 Glu Asn Gly
Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp 195 200 205 His
Pro Glu Val Val Ser Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val 210 215
220 Thr Thr Thr Asn Ile Asp Gly Phe Arg Leu Asp Ala Val Lys His Ile
225 230 235 240 Lys Tyr Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg
Thr Gln Thr 245 250 255 Gln Lys Pro Leu Phe Ala Val Gly Glu Phe Trp
Ser Tyr Asp Ile Ser 260 265 270 Lys Leu His Asn Tyr Ile Thr Lys Thr
Asn Gly Ser Met Ser Leu Phe 275 280 285 Asp Ala Pro Leu His Asn Asn
Phe Tyr Ile Ala Ser Lys Ser Gly Gly 290 295 300 Tyr Phe Asp Met Arg
Thr Leu Leu Asn Asn Thr Leu Met Lys Asp Gln 305 310 315 320 Pro Thr
Leu Ala Val Thr Leu Val Asp Asn His Asp Thr Glu Pro Gly 325 330 335
Gln Ser Leu Gln Ser Trp Val Glu Pro Trp Phe Lys Pro Leu Ala Tyr 340
345 350 Ala Phe Ile Leu Thr Arg Gln Glu Gly Tyr Pro Cys Val Phe Tyr
Gly 355 360 365 Asp Tyr Tyr Gly Ile Pro Lys Tyr Asn Ile Pro Ala Leu
Lys Ser Lys 370 375 380 Leu Asp Pro Leu Leu Ile Ala Arg Arg Asp Tyr
Ala Tyr Gly Thr Gln 385 390 395 400 His Asp Tyr Ile Asp Ser Ala Asp
Ile Ile Gly Trp Thr Arg Glu Gly 405 410 415 Val Ala Glu Lys Ala Asn
Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly 420 425 430 Pro Gly Gly Ser
Lys Trp Met Tyr Val Gly Lys Gln His Ala Gly Lys 435 440 445 Thr Phe
Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr Ile Asn 450 455 460
Ala Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Ile 465
470 475 480 Trp Val Pro Lys Ile Ser Thr Thr Ser Gln Ile Thr Phe Thr
Val Asn 485 490 495 Asn Ala Thr Thr Val Trp Gly Gln Asn Val Tyr Val
Val Gly Asn Ile 500 505 510 Ser Gln Leu Gly Asn Trp Asp Pro Val His
Ala Val Gln Met Thr Pro 515 520 525 Ser Ser Tyr Pro Thr Trp Thr Val
Thr Ile Pro Leu Leu Gln Gly Gln 530 535 540 Asn Ile Gln Phe Lys Phe
Ile Lys Lys Asp Ser Ala Gly Asn Val Ile 545 550 555 560 Trp Glu Asp
Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser 565 570 575 Gly
Ala Tyr Thr Ala Ser Trp Asn Val Pro 580 585 3 1755 DNA BLA-CBM
hybrid CDS (1)..(1755) 3 gca aat ctt aat ggg acg ctg atg cag tat
ttt gaa tgg tac atg ccc 48 Ala Asn Leu Asn Gly Thr Leu Met Gln Tyr
Phe Glu Trp Tyr Met Pro 1 5 10 15 aat gac ggc caa cat tgg agg cgt
ttg caa aac gac tcg gca tat ttg 96 Asn Asp Gly Gln His Trp Arg Arg
Leu Gln Asn Asp Ser Ala Tyr Leu 20 25 30 gct gaa cac ggt att act
gcc gtc tgg att ccc ccg gca tat aag gga 144 Ala Glu His Gly Ile Thr
Ala Val Trp Ile Pro Pro Ala Tyr Lys Gly 35 40 45 acg agc caa gcg
gat gtg ggc tac ggt gct tac gac ctt tat gat tta 192 Thr Ser Gln Ala
Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 50 55 60 ggg gag
ttt cat caa aaa ggg acg gtt cgg aca aag tac ggc aca aaa 240 Gly Glu
Phe His Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 65 70 75 80
gga gag ctg caa tct gcg atc aaa agt ctt cat tcc cgc gac att aac 288
Gly Glu Leu Gln Ser Ala Ile Lys Ser Leu His Ser Arg Asp Ile Asn 85
90 95 gtt tac ggg gat gtg gtc atc aac cac aaa ggc ggc gct gat gcg
acc 336 Val Tyr Gly Asp Val Val Ile Asn His Lys Gly Gly Ala Asp Ala
Thr 100 105 110 gaa gat gta acc gcg gtt gaa gtc gat ccc gct gac cgc
aac cgc gta 384 Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg
Asn Arg Val 115 120 125 att tca gga gaa cac cta att aaa gcc tgg aca
cat ttt cat ttt ccg 432 Ile Ser Gly Glu His Leu Ile Lys Ala Trp Thr
His Phe His Phe Pro 130 135 140 ggg cgc ggc agc aca tac agc gat ttt
aaa tgg cat tgg tac cat ttt 480 Gly Arg Gly Ser Thr Tyr Ser Asp Phe
Lys Trp His Trp Tyr His Phe 145 150 155 160 gac gga acc gat tgg gac
gag tcc cga aag ctg aac cgc atc tat aag 528 Asp Gly Thr Asp Trp Asp
Glu Ser Arg Lys Leu Asn Arg Ile Tyr Lys 165 170 175 ttt caa gga aag
gct tgg gat tgg gaa gtt tcc aat gaa aac ggc aac 576 Phe Gln Gly Lys
Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn 180 185 190 tat gat
tat ttg atg tat gcc gac atc gat tat gac cat cct gat gtc 624 Tyr Asp
Tyr Leu Met Tyr Ala Asp Ile Asp Tyr Asp His Pro Asp Val 195 200 205
gca gca gaa att aag aga tgg ggc act tgg tat gcc aat gaa ctg caa 672
Ala Ala Glu Ile Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gln 210
215 220 ttg gac ggt ttc cgt ctt gat gct gtc aaa cac att aaa ttt tct
ttt 720 Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His Ile Lys Phe Ser
Phe 225 230 235 240 ttg cgg gat tgg gtt aat cat gtc agg gaa aaa acg
ggg aag gaa atg 768 Leu Arg Asp Trp Val Asn His Val Arg Glu Lys Thr
Gly Lys Glu Met 245 250 255 ttt acg gta gct gaa tat tgg cag aat gac
ttg ggc gcg ctg gaa aac 816 Phe Thr Val Ala Glu Tyr Trp Gln Asn Asp
Leu Gly Ala Leu Glu Asn 260 265 270 tat ttg aac aaa aca aat ttt aat
cat tca gtg ttt gac gtg ccg ctt 864 Tyr Leu Asn Lys Thr Asn Phe Asn
His Ser Val Phe Asp Val Pro Leu 275 280 285 cat tat cag ttc cat gct
gca tcg aca cag gga ggc ggc tat gat atg 912 His Tyr Gln Phe His Ala
Ala Ser Thr Gln Gly Gly Gly Tyr Asp Met 290 295 300 agg aaa ttg ctg
aac ggt acg gtc gtt tcc aag cat ccg ttg aaa tcg 960 Arg Lys Leu Leu
Asn Gly Thr Val Val Ser Lys His Pro Leu Lys Ser 305 310 315 320 gtt
aca ttt gtc gat aac cat gat aca cag ccg ggg caa tcg ctt gag 1008
Val Thr Phe Val Asp Asn His Asp Thr Gln Pro Gly Gln Ser Leu Glu 325
330 335 tcg act gtc caa aca tgg ttt aag ccg ctt gct tac gct ttt att
ctc 1056 Ser Thr Val Gln Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe
Ile Leu 340 345 350 aca agg gaa tct gga tac cct cag gtt ttc tac ggg
gat atg tac ggg 1104 Thr Arg Glu Ser Gly Tyr Pro Gln Val Phe Tyr
Gly Asp Met Tyr Gly 355 360 365 acg aaa gga gac tcc cag cgc gaa att
cct gcc ttg aaa cac aaa att 1152 Thr Lys Gly Asp Ser Gln Arg Glu
Ile Pro Ala Leu Lys His Lys Ile 370 375 380 gaa ccg atc tta aaa gcg
aga aaa cag tat gcg tac gga gca cag cat 1200 Glu Pro Ile Leu Lys
Ala Arg Lys Gln Tyr Ala Tyr Gly Ala Gln His 385 390 395 400 gat tat
ttc gac cac cat gac att gtc ggc tgg aca agg gaa ggc gac 1248 Asp
Tyr Phe Asp His His Asp Ile Val Gly Trp Thr Arg Glu Gly Asp 405 410
415 agc tcg gtt gca aat tca ggt ttg gcg gca tta ata aca gac gga ccc
1296 Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly
Pro 420 425 430 ggt ggg gca aag cga atg tat gtc ggc cgg caa aac gcc
ggt gag aca 1344 Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gln Asn
Ala Gly Glu Thr 435 440 445 tgg cat gac att acc gga aac cgt tcg gag
ccg gtt gtc atc aat tcg 1392 Trp His Asp Ile Thr Gly Asn Arg Ser
Glu Pro Val Val Ile Asn Ser 450 455 460 gaa ggc tgg gga gag ttt cac
gta aac ggc ggg tcg gtt tca att tat 1440 Glu Gly Trp Gly Glu Phe
His Val Asn Gly Gly Ser Val Ser Ile Tyr 465 470 475 480 gtt caa aga
ata agt act act tcc caa ata aca ttt act gta aat aac 1488 Val Gln
Arg Ile Ser Thr Thr Ser Gln Ile Thr Phe Thr Val Asn Asn 485 490 495
gcc aca acc gtt tgg gga caa aat gta tac gtt gtc ggg aat att tcg
1536 Ala Thr Thr Val Trp Gly Gln Asn Val Tyr Val Val Gly Asn Ile
Ser 500 505 510 cag ctg ggg aac tgg gat cca gtc cac gca gtt caa atg
acg ccg tct 1584 Gln Leu Gly Asn Trp Asp Pro Val His Ala Val Gln
Met Thr Pro Ser 515 520 525 tct tat cca aca tgg act gta aca atc cct
ctt ctt caa ggg caa aac 1632 Ser Tyr Pro Thr Trp Thr Val Thr Ile
Pro Leu Leu Gln Gly Gln Asn 530 535 540 ata caa ttt aaa ttt atc aaa
aaa gat tca gct gga aat gtc att tgg 1680 Ile Gln Phe Lys Phe Ile
Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545 550 555
560 gaa gat ata tcg aat cga aca tac acc gtc cca act gct gca tcc gga
1728 Glu Asp Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser
Gly 565 570 575 gca tat aca gcc agc tgg aac gtg ccc 1755 Ala Tyr
Thr Ala Ser Trp Asn Val Pro 580 585 4 585 PRT BLA-CBM hybrid 4 Ala
Asn Leu Asn Gly Thr Leu Met Gln Tyr Phe Glu Trp Tyr Met Pro 1 5 10
15 Asn Asp Gly Gln His Trp Arg Arg Leu Gln Asn Asp Ser Ala Tyr Leu
20 25 30 Ala Glu His Gly Ile Thr Ala Val Trp Ile Pro Pro Ala Tyr
Lys Gly 35 40 45 Thr Ser Gln Ala Asp Val Gly Tyr Gly Ala Tyr Asp
Leu Tyr Asp Leu 50 55 60 Gly Glu Phe His Gln Lys Gly Thr Val Arg
Thr Lys Tyr Gly Thr Lys 65 70 75 80 Gly Glu Leu Gln Ser Ala Ile Lys
Ser Leu His Ser Arg Asp Ile Asn 85 90 95 Val Tyr Gly Asp Val Val
Ile Asn His Lys Gly Gly Ala Asp Ala Thr 100 105 110 Glu Asp Val Thr
Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val 115 120 125 Ile Ser
Gly Glu His Leu Ile Lys Ala Trp Thr His Phe His Phe Pro 130 135 140
Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 145
150 155 160 Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg Ile
Tyr Lys 165 170 175 Phe Gln Gly Lys Ala Trp Asp Trp Glu Val Ser Asn
Glu Asn Gly Asn 180 185 190 Tyr Asp Tyr Leu Met Tyr Ala Asp Ile Asp
Tyr Asp His Pro Asp Val 195 200 205 Ala Ala Glu Ile Lys Arg Trp Gly
Thr Trp Tyr Ala Asn Glu Leu Gln 210 215 220 Leu Asp Gly Phe Arg Leu
Asp Ala Val Lys His Ile Lys Phe Ser Phe 225 230 235 240 Leu Arg Asp
Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met 245 250 255 Phe
Thr Val Ala Glu Tyr Trp Gln Asn Asp Leu Gly Ala Leu Glu Asn 260 265
270 Tyr Leu Asn Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu
275 280 285 His Tyr Gln Phe His Ala Ala Ser Thr Gln Gly Gly Gly Tyr
Asp Met 290 295 300 Arg Lys Leu Leu Asn Gly Thr Val Val Ser Lys His
Pro Leu Lys Ser 305 310 315 320 Val Thr Phe Val Asp Asn His Asp Thr
Gln Pro Gly Gln Ser Leu Glu 325 330 335 Ser Thr Val Gln Thr Trp Phe
Lys Pro Leu Ala Tyr Ala Phe Ile Leu 340 345 350 Thr Arg Glu Ser Gly
Tyr Pro Gln Val Phe Tyr Gly Asp Met Tyr Gly 355 360 365 Thr Lys Gly
Asp Ser Gln Arg Glu Ile Pro Ala Leu Lys His Lys Ile 370 375 380 Glu
Pro Ile Leu Lys Ala Arg Lys Gln Tyr Ala Tyr Gly Ala Gln His 385 390
395 400 Asp Tyr Phe Asp His His Asp Ile Val Gly Trp Thr Arg Glu Gly
Asp 405 410 415 Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu Ile Thr
Asp Gly Pro 420 425 430 Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gln
Asn Ala Gly Glu Thr 435 440 445 Trp His Asp Ile Thr Gly Asn Arg Ser
Glu Pro Val Val Ile Asn Ser 450 455 460 Glu Gly Trp Gly Glu Phe His
Val Asn Gly Gly Ser Val Ser Ile Tyr 465 470 475 480 Val Gln Arg Ile
Ser Thr Thr Ser Gln Ile Thr Phe Thr Val Asn Asn 485 490 495 Ala Thr
Thr Val Trp Gly Gln Asn Val Tyr Val Val Gly Asn Ile Ser 500 505 510
Gln Leu Gly Asn Trp Asp Pro Val His Ala Val Gln Met Thr Pro Ser 515
520 525 Ser Tyr Pro Thr Trp Thr Val Thr Ile Pro Leu Leu Gln Gly Gln
Asn 530 535 540 Ile Gln Phe Lys Phe Ile Lys Lys Asp Ser Ala Gly Asn
Val Ile Trp 545 550 555 560 Glu Asp Ile Ser Asn Arg Thr Tyr Thr Val
Pro Thr Ala Ala Ser Gly 565 570 575 Ala Tyr Thr Ala Ser Trp Asn Val
Pro 580 585 5 1749 DNA Bacillus licheniformis-CBM CDS (1)..(1749) 5
gta aat ggc acg ctg atg cag tat ttt gaa tgg tat acg ccg aac gac 48
Val Asn Gly Thr Leu Met Gln Tyr Phe Glu Trp Tyr Thr Pro Asn Asp 1 5
10 15 ggc cag cat tgg aaa cga ttg cag aat gat gcg gaa cat tta tcg
gat 96 Gly Gln His Trp Lys Arg Leu Gln Asn Asp Ala Glu His Leu Ser
Asp 20 25 30 atc ggt att act gcc gtc tgg att ccc ccg gca tat aag
gga acg agc 144 Ile Gly Ile Thr Ala Val Trp Ile Pro Pro Ala Tyr Lys
Gly Thr Ser 35 40 45 caa gcg gat gtg ggc tac ggt gct tac gac ctt
tat gat tta ggg gag 192 Gln Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu
Tyr Asp Leu Gly Glu 50 55 60 ttt cat caa aaa ggg acg gtt cgg aca
aag tac ggc aca aaa gga gag 240 Phe His Gln Lys Gly Thr Val Arg Thr
Lys Tyr Gly Thr Lys Gly Glu 65 70 75 80 ctg caa tct gcg atc aaa agt
ctt cat tcc cgc gac att aac gtt tac 288 Leu Gln Ser Ala Ile Lys Ser
Leu His Ser Arg Asp Ile Asn Val Tyr 85 90 95 ggg gat gtg gtc atc
aac cac aaa ggc ggc gct gat gcg acc gaa gat 336 Gly Asp Val Val Ile
Asn His Lys Gly Gly Ala Asp Ala Thr Glu Asp 100 105 110 gta acc gcg
gtt gaa gtc gat ccc gct gac cgc aac cgc gta att tca 384 Val Thr Ala
Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val Ile Ser 115 120 125 gga
gaa cac cta att aaa gcc tgg aca cat ttt cat ttt ccg ggg cgc 432 Gly
Glu His Leu Ile Lys Ala Trp Thr His Phe His Phe Pro Gly Arg 130 135
140 ggc agc aca tac agc gat ttt aag tgg tat tgg tac cat ttt gac gga
480 Gly Ser Thr Tyr Ser Asp Phe Lys Trp Tyr Trp Tyr His Phe Asp Gly
145 150 155 160 acc gat tgg gac gag tcc cga aag ctg aac cgc atc tat
aag ttt caa 528 Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg Ile Tyr
Lys Phe Gln 165 170 175 ggg aag act tgg gat tgg gaa gtt tcc aat gaa
ttc ggc aac tat gat 576 Gly Lys Thr Trp Asp Trp Glu Val Ser Asn Glu
Phe Gly Asn Tyr Asp 180 185 190 tat ttg atg tat gcc gac ttt gat tat
gac cat cct gat gtc gta gca 624 Tyr Leu Met Tyr Ala Asp Phe Asp Tyr
Asp His Pro Asp Val Val Ala 195 200 205 gag att aag aga tgg ggc act
tgg tat gcc aat gaa ctg caa ttg gac 672 Glu Ile Lys Arg Trp Gly Thr
Trp Tyr Ala Asn Glu Leu Gln Leu Asp 210 215 220 ggt ttc cgt ctt gat
gct gtc aaa cac att aaa ttt tct ttt ttg cgg 720 Gly Phe Arg Leu Asp
Ala Val Lys His Ile Lys Phe Ser Phe Leu Arg 225 230 235 240 gat tgg
gtt aat cat gtc agg gaa aaa acg ggg aag gaa atg ttt acg 768 Asp Trp
Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met Phe Thr 245 250 255
gta gct gag tac tgg tcg aat gac ttg ggc gcg ctg gaa aac tat ttg 816
Val Ala Glu Tyr Trp Ser Asn Asp Leu Gly Ala Leu Glu Asn Tyr Leu 260
265 270 aac aaa aca aat ttt aat cat tca gtg ttt gac gtg ccg ctt cat
tat 864 Asn Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu His
Tyr 275 280 285 cag ttc cat gct gca tcg aca cag gga ggc ggc tat gat
atg agg aaa 912 Gln Phe His Ala Ala Ser Thr Gln Gly Gly Gly Tyr Asp
Met Arg Lys 290 295 300 ttg ctg aac ggt acg gtc gtt tcc aag cat ccg
ttg aaa tcg gtt aca 960 Leu Leu Asn Gly Thr Val Val Ser Lys His Pro
Leu Lys Ser Val Thr 305 310 315 320 ttt gtc gat aac cat gat aca cag
ccg ggg caa tcg ctt gag tcg act 1008 Phe Val Asp Asn His Asp Thr
Gln Pro Gly Gln Ser Leu Glu Ser Thr 325 330 335 gtc caa aca tgg ttt
aag ccg ctt gct tac gct ttt att ctc aca agg 1056 Val Gln Thr Trp
Phe Lys Pro Leu Ala Tyr Ala Phe Ile Leu Thr Arg 340 345 350 gaa tct
gga tac cct cag gtt ttc tac ggg gat atg tac ggg acg aaa 1104 Glu
Ser Gly Tyr Pro Gln Val Phe Tyr Gly Asp Met Tyr Gly Thr Lys 355 360
365 gga gac tcc cag cgc gaa att cct gcc ttg aaa cac aaa att gaa ccg
1152 Gly Asp Ser Gln Arg Glu Ile Pro Ala Leu Lys His Lys Ile Glu
Pro 370 375 380 atc tta aaa gcg aga aaa cag tat gcg tac gga gca cag
cat gat tat 1200 Ile Leu Lys Ala Arg Lys Gln Tyr Ala Tyr Gly Ala
Gln His Asp Tyr 385 390 395 400 ttc gac cac cat gac att gtc ggc tgg
aca agg gaa ggc gac agc tcg 1248 Phe Asp His His Asp Ile Val Gly
Trp Thr Arg Glu Gly Asp Ser Ser 405 410 415 gtt gca aat tca ggt ttg
gcg gca tta ata aca gac gga ccc ggt ggg 1296 Val Ala Asn Ser Gly
Leu Ala Ala Leu Ile Thr Asp Gly Pro Gly Gly 420 425 430 gca aag cga
atg tat gtc ggc cgg caa aac gcc ggt gag aca tgg cat 1344 Ala Lys
Arg Met Tyr Val Gly Arg Gln Asn Ala Gly Glu Thr Trp His 435 440 445
gac att acc gga aac cgt tcg gag ccg gtt gtc atc aat tcg gaa ggc
1392 Asp Ile Thr Gly Asn Arg Ser Glu Pro Val Val Ile Asn Ser Glu
Gly 450 455 460 tgg gga gag ttt cac gta aac ggc ggg tcg gtt tca att
tat gtt cca 1440 Trp Gly Glu Phe His Val Asn Gly Gly Ser Val Ser
Ile Tyr Val Pro 465 470 475 480 aaa ata agt act act tcc caa ata aca
ttt act gta aat aac gcc aca 1488 Lys Ile Ser Thr Thr Ser Gln Ile
Thr Phe Thr Val Asn Asn Ala Thr 485 490 495 acc gtt tgg gga caa aat
gta tac gtt gtc ggg aat att tcg cag ctg 1536 Thr Val Trp Gly Gln
Asn Val Tyr Val Val Gly Asn Ile Ser Gln Leu 500 505 510 ggg aac tgg
gat cca gtc cac gca gtt caa atg acg ccg tct tct tat 1584 Gly Asn
Trp Asp Pro Val His Ala Val Gln Met Thr Pro Ser Ser Tyr 515 520 525
cca aca tgg act gta aca atc cct ctt ctt caa ggg caa aac ata caa
1632 Pro Thr Trp Thr Val Thr Ile Pro Leu Leu Gln Gly Gln Asn Ile
Gln 530 535 540 ttt aaa ttt atc aaa aaa gat tca gct gga aat gtc att
tgg gaa gat 1680 Phe Lys Phe Ile Lys Lys Asp Ser Ala Gly Asn Val
Ile Trp Glu Asp 545 550 555 560 ata tcg aat cga aca tac acc gtc cca
act gct gca tcc gga gca tat 1728 Ile Ser Asn Arg Thr Tyr Thr Val
Pro Thr Ala Ala Ser Gly Ala Tyr 565 570 575 aca gcc agc tgg aac gtg
ccc 1749 Thr Ala Ser Trp Asn Val Pro 580 6 583 PRT Bacillus
licheniformis-CBM 6 Val Asn Gly Thr Leu Met Gln Tyr Phe Glu Trp Tyr
Thr Pro Asn Asp 1 5 10 15 Gly Gln His Trp Lys Arg Leu Gln Asn Asp
Ala Glu His Leu Ser Asp 20 25 30 Ile Gly Ile Thr Ala Val Trp Ile
Pro Pro Ala Tyr Lys Gly Thr Ser 35 40 45 Gln Ala Asp Val Gly Tyr
Gly Ala Tyr Asp Leu Tyr Asp Leu Gly Glu 50 55 60 Phe His Gln Lys
Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Gly Glu 65 70 75 80 Leu Gln
Ser Ala Ile Lys Ser Leu His Ser Arg Asp Ile Asn Val Tyr 85 90 95
Gly Asp Val Val Ile Asn His Lys Gly Gly Ala Asp Ala Thr Glu Asp 100
105 110 Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val Ile
Ser 115 120 125 Gly Glu His Leu Ile Lys Ala Trp Thr His Phe His Phe
Pro Gly Arg 130 135 140 Gly Ser Thr Tyr Ser Asp Phe Lys Trp Tyr Trp
Tyr His Phe Asp Gly 145 150 155 160 Thr Asp Trp Asp Glu Ser Arg Lys
Leu Asn Arg Ile Tyr Lys Phe Gln 165 170 175 Gly Lys Thr Trp Asp Trp
Glu Val Ser Asn Glu Phe Gly Asn Tyr Asp 180 185 190 Tyr Leu Met Tyr
Ala Asp Phe Asp Tyr Asp His Pro Asp Val Val Ala 195 200 205 Glu Ile
Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gln Leu Asp 210 215 220
Gly Phe Arg Leu Asp Ala Val Lys His Ile Lys Phe Ser Phe Leu Arg 225
230 235 240 Asp Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met
Phe Thr 245 250 255 Val Ala Glu Tyr Trp Ser Asn Asp Leu Gly Ala Leu
Glu Asn Tyr Leu 260 265 270 Asn Lys Thr Asn Phe Asn His Ser Val Phe
Asp Val Pro Leu His Tyr 275 280 285 Gln Phe His Ala Ala Ser Thr Gln
Gly Gly Gly Tyr Asp Met Arg Lys 290 295 300 Leu Leu Asn Gly Thr Val
Val Ser Lys His Pro Leu Lys Ser Val Thr 305 310 315 320 Phe Val Asp
Asn His Asp Thr Gln Pro Gly Gln Ser Leu Glu Ser Thr 325 330 335 Val
Gln Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe Ile Leu Thr Arg 340 345
350 Glu Ser Gly Tyr Pro Gln Val Phe Tyr Gly Asp Met Tyr Gly Thr Lys
355 360 365 Gly Asp Ser Gln Arg Glu Ile Pro Ala Leu Lys His Lys Ile
Glu Pro 370 375 380 Ile Leu Lys Ala Arg Lys Gln Tyr Ala Tyr Gly Ala
Gln His Asp Tyr 385 390 395 400 Phe Asp His His Asp Ile Val Gly Trp
Thr Arg Glu Gly Asp Ser Ser 405 410 415 Val Ala Asn Ser Gly Leu Ala
Ala Leu Ile Thr Asp Gly Pro Gly Gly 420 425 430 Ala Lys Arg Met Tyr
Val Gly Arg Gln Asn Ala Gly Glu Thr Trp His 435 440 445 Asp Ile Thr
Gly Asn Arg Ser Glu Pro Val Val Ile Asn Ser Glu Gly 450 455 460 Trp
Gly Glu Phe His Val Asn Gly Gly Ser Val Ser Ile Tyr Val Pro 465 470
475 480 Lys Ile Ser Thr Thr Ser Gln Ile Thr Phe Thr Val Asn Asn Ala
Thr 485 490 495 Thr Val Trp Gly Gln Asn Val Tyr Val Val Gly Asn Ile
Ser Gln Leu 500 505 510 Gly Asn Trp Asp Pro Val His Ala Val Gln Met
Thr Pro Ser Ser Tyr 515 520 525 Pro Thr Trp Thr Val Thr Ile Pro Leu
Leu Gln Gly Gln Asn Ile Gln 530 535 540 Phe Lys Phe Ile Lys Lys Asp
Ser Ala Gly Asn Val Ile Trp Glu Asp 545 550 555 560 Ile Ser Asn Arg
Thr Tyr Thr Val Pro Thr Ala Ala Ser Gly Ala Tyr 565 570 575 Thr Ala
Ser Trp Asn Val Pro 580 7 1755 DNA Bacillus stearothermophilus-CBM
CDS (1)..(1755) 7 gca ccg ttt aac ggc ttt aac ggc acc atg atg cag
tat ttt gaa tgg 48 Ala Pro Phe Asn Gly Phe Asn Gly Thr Met Met Gln
Tyr Phe Glu Trp 1 5 10 15 tac ttg ccg gat gat ggc acg tta tgg acc
aaa gtg gcc aat gaa gcc 96 Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr
Lys Val Ala Asn Glu Ala 20 25 30 aac aac tta tcc agc ctt ggc atc
acc gct ctt tgg ctg ccg ccc gct 144 Asn Asn Leu Ser Ser Leu Gly Ile
Thr Ala Leu Trp Leu Pro Pro Ala 35 40 45 tac aaa gga aca agc cgc
agc gac gta ggg tac gga gta tac gac ttg 192 Tyr Lys Gly Thr Ser Arg
Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu 50 55 60 tat gac ctc ggc
gaa ttc aat caa aaa ggg acc gtc cgc aca aaa tac 240 Tyr Asp Leu Gly
Glu Phe Asn Gln Lys Gly Thr Val Arg Thr Lys Tyr 65 70 75 80 gga aca
aaa gct caa tat ctt caa gcc att caa gcc gcc cac gcc gct 288 Gly Thr
Lys Ala Gln Tyr Leu Gln Ala Ile Gln Ala Ala His Ala Ala 85 90 95
gga atg caa gtg tac gcc gat gtc gtg ttc gac cat aaa ggc ggc gct 336
Gly Met Gln Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala 100
105 110 gac ggc acg gaa tgg gtg gac gcc gtc gaa gtc aat ccg tcc gac
cgc 384 Asp Gly Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp
Arg 115 120 125 aac caa gaa atc tcg ggc acc tat caa atc caa gca tgg
acg aaa ttt 432 Asn Gln Glu Ile Ser Gly Thr Tyr Gln Ile Gln Ala Trp
Thr Lys Phe 130 135 140 gat ttt ccc ggg cgg ggc aac acc tac tcc agc
ttt aag tgg cgc tgg 480 Asp Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser
Phe Lys Trp Arg Trp 145 150 155 160 tac cat ttt gac ggc gtt gat tgg
gac gaa agc cga
aaa ttg agc cgc 528 Tyr His Phe Asp Gly Val Asp Trp Asp Glu Ser Arg
Lys Leu Ser Arg 165 170 175 att tac aaa ttc cgt ggc aag gct tgg gat
tgg gaa gta gac acg gaa 576 Ile Tyr Lys Phe Arg Gly Lys Ala Trp Asp
Trp Glu Val Asp Thr Glu 180 185 190 ttc gga aac tat gac tac tta atg
tat gcc gac ctt gat atg gat cat 624 Phe Gly Asn Tyr Asp Tyr Leu Met
Tyr Ala Asp Leu Asp Met Asp His 195 200 205 ccc gaa gtc gtg acc gag
ctg aaa aac tgg ggg aaa tgg tat gtc aac 672 Pro Glu Val Val Thr Glu
Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn 210 215 220 aca acg aac att
gat ggg ttc cgg ctt gat gcc gtc aag cat att aag 720 Thr Thr Asn Ile
Asp Gly Phe Arg Leu Asp Ala Val Lys His Ile Lys 225 230 235 240 ttc
agt ttt ttt cct gat tgg ttg tcg tat gtg cgt tct cag act ggc 768 Phe
Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gln Thr Gly 245 250
255 aag ccg cta ttt acc gtc ggg gaa tat tgg agc tat gac atc aac aag
816 Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp Ile Asn Lys
260 265 270 ttg cac aat tac att acg aaa aca gac gga acg atg tct ttg
ttt gat 864 Leu His Asn Tyr Ile Thr Lys Thr Asp Gly Thr Met Ser Leu
Phe Asp 275 280 285 gcc ccg tta cac aac aaa ttt tat acc gct tcc aaa
tca ggg ggc gca 912 Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys
Ser Gly Gly Ala 290 295 300 ttt gat atg cgc acg tta atg acc aat act
ctc atg aaa gat caa ccg 960 Phe Asp Met Arg Thr Leu Met Thr Asn Thr
Leu Met Lys Asp Gln Pro 305 310 315 320 aca ttg gcc gtc acc ttc gtt
gat aat cat gac acc gaa ccc ggc caa 1008 Thr Leu Ala Val Thr Phe
Val Asp Asn His Asp Thr Glu Pro Gly Gln 325 330 335 gcg ctg caa tca
tgg gtc gac cca tgg ttc aaa ccg ttg gct tac gcc 1056 Ala Leu Gln
Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 340 345 350 ttt
att cta act cgg cag gaa gga tac ccg tgc gtc ttt tat ggt gac 1104
Phe Ile Leu Thr Arg Gln Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp 355
360 365 tat tat ggc att cca caa tat aac att cct tcg ctg aaa agc aaa
atc 1152 Tyr Tyr Gly Ile Pro Gln Tyr Asn Ile Pro Ser Leu Lys Ser
Lys Ile 370 375 380 gat ccg ctc ctc atc gcg cgc agg gat tat gct tac
gga acg caa cat 1200 Asp Pro Leu Leu Ile Ala Arg Arg Asp Tyr Ala
Tyr Gly Thr Gln His 385 390 395 400 gat tat ctt gat cac tcc gac atc
atc ggg tgg aca agg gaa ggg ggc 1248 Asp Tyr Leu Asp His Ser Asp
Ile Ile Gly Trp Thr Arg Glu Gly Gly 405 410 415 act gaa aaa cca gga
tcc gga ctg gcc gca ctg atc acc gat ggg ccg 1296 Thr Glu Lys Pro
Gly Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly Pro 420 425 430 gga gga
agc aaa tgg atg tac gtt ggc aaa caa cac gct gga aaa gtg 1344 Gly
Gly Ser Lys Trp Met Tyr Val Gly Lys Gln His Ala Gly Lys Val 435 440
445 ttc tat gac ctt acc ggc aac cgg agt gac acc gtc acc atc aac agt
1392 Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr Ile Asn
Ser 450 455 460 gat gga tgg ggg gaa ttc aaa gtc aat ggc ggt tcg gtt
tcg gtt tgg 1440 Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser
Val Ser Val Trp 465 470 475 480 gtt cct aaa ata agt act act tcc caa
ata aca ttt act gta aat aac 1488 Val Pro Lys Ile Ser Thr Thr Ser
Gln Ile Thr Phe Thr Val Asn Asn 485 490 495 gcc aca acc gtt tgg gga
caa aat gta tac gtt gtc ggg aat att tcg 1536 Ala Thr Thr Val Trp
Gly Gln Asn Val Tyr Val Val Gly Asn Ile Ser 500 505 510 cag ctg ggg
aac tgg gat cca gtc cac gca gtt caa atg acg ccg tct 1584 Gln Leu
Gly Asn Trp Asp Pro Val His Ala Val Gln Met Thr Pro Ser 515 520 525
tct tat cca aca tgg act gta aca atc cct ctt ctt caa ggg caa aac
1632 Ser Tyr Pro Thr Trp Thr Val Thr Ile Pro Leu Leu Gln Gly Gln
Asn 530 535 540 ata caa ttt aaa ttt atc aaa aaa gat tca gct gga aat
gtc att tgg 1680 Ile Gln Phe Lys Phe Ile Lys Lys Asp Ser Ala Gly
Asn Val Ile Trp 545 550 555 560 gaa gat ata tcg aat cga aca tac acc
gtc cca act gct gca tcc gga 1728 Glu Asp Ile Ser Asn Arg Thr Tyr
Thr Val Pro Thr Ala Ala Ser Gly 565 570 575 gca tat aca gcc agc tgg
aac gtg ccc 1755 Ala Tyr Thr Ala Ser Trp Asn Val Pro 580 585 8 585
PRT Bacillus stearothermophilus-CBM 8 Ala Pro Phe Asn Gly Phe Asn
Gly Thr Met Met Gln Tyr Phe Glu Trp 1 5 10 15 Tyr Leu Pro Asp Asp
Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala 20 25 30 Asn Asn Leu
Ser Ser Leu Gly Ile Thr Ala Leu Trp Leu Pro Pro Ala 35 40 45 Tyr
Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu 50 55
60 Tyr Asp Leu Gly Glu Phe Asn Gln Lys Gly Thr Val Arg Thr Lys Tyr
65 70 75 80 Gly Thr Lys Ala Gln Tyr Leu Gln Ala Ile Gln Ala Ala His
Ala Ala 85 90 95 Gly Met Gln Val Tyr Ala Asp Val Val Phe Asp His
Lys Gly Gly Ala 100 105 110 Asp Gly Thr Glu Trp Val Asp Ala Val Glu
Val Asn Pro Ser Asp Arg 115 120 125 Asn Gln Glu Ile Ser Gly Thr Tyr
Gln Ile Gln Ala Trp Thr Lys Phe 130 135 140 Asp Phe Pro Gly Arg Gly
Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp 145 150 155 160 Tyr His Phe
Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg 165 170 175 Ile
Tyr Lys Phe Arg Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu 180 185
190 Phe Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His
195 200 205 Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr
Val Asn 210 215 220 Thr Thr Asn Ile Asp Gly Phe Arg Leu Asp Ala Val
Lys His Ile Lys 225 230 235 240 Phe Ser Phe Phe Pro Asp Trp Leu Ser
Tyr Val Arg Ser Gln Thr Gly 245 250 255 Lys Pro Leu Phe Thr Val Gly
Glu Tyr Trp Ser Tyr Asp Ile Asn Lys 260 265 270 Leu His Asn Tyr Ile
Thr Lys Thr Asp Gly Thr Met Ser Leu Phe Asp 275 280 285 Ala Pro Leu
His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala 290 295 300 Phe
Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gln Pro 305 310
315 320 Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly
Gln 325 330 335 Ala Leu Gln Ser Trp Val Asp Pro Trp Phe Lys Pro Leu
Ala Tyr Ala 340 345 350 Phe Ile Leu Thr Arg Gln Glu Gly Tyr Pro Cys
Val Phe Tyr Gly Asp 355 360 365 Tyr Tyr Gly Ile Pro Gln Tyr Asn Ile
Pro Ser Leu Lys Ser Lys Ile 370 375 380 Asp Pro Leu Leu Ile Ala Arg
Arg Asp Tyr Ala Tyr Gly Thr Gln His 385 390 395 400 Asp Tyr Leu Asp
His Ser Asp Ile Ile Gly Trp Thr Arg Glu Gly Gly 405 410 415 Thr Glu
Lys Pro Gly Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly Pro 420 425 430
Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gln His Ala Gly Lys Val 435
440 445 Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr Ile Asn
Ser 450 455 460 Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val
Ser Val Trp 465 470 475 480 Val Pro Lys Ile Ser Thr Thr Ser Gln Ile
Thr Phe Thr Val Asn Asn 485 490 495 Ala Thr Thr Val Trp Gly Gln Asn
Val Tyr Val Val Gly Asn Ile Ser 500 505 510 Gln Leu Gly Asn Trp Asp
Pro Val His Ala Val Gln Met Thr Pro Ser 515 520 525 Ser Tyr Pro Thr
Trp Thr Val Thr Ile Pro Leu Leu Gln Gly Gln Asn 530 535 540 Ile Gln
Phe Lys Phe Ile Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545 550 555
560 Glu Asp Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser Gly
565 570 575 Ala Tyr Thr Ala Ser Trp Asn Val Pro 580 585 9 1755 DNA
JE1-CBM CDS (1)..(1755) 9 cat cat aat ggg aca aat ggg acg atg atg
caa tac ttt gaa tgg cac 48 His His Asn Gly Thr Asn Gly Thr Met Met
Gln Tyr Phe Glu Trp His 1 5 10 15 ttg cct aat gat ggg aat cac tgg
aat aga tta aga gat gat gct agt 96 Leu Pro Asn Asp Gly Asn His Trp
Asn Arg Leu Arg Asp Asp Ala Ser 20 25 30 aat cta aga aat aga ggt
ata acc gct att tgg att ccg ccg gcc tgg 144 Asn Leu Arg Asn Arg Gly
Ile Thr Ala Ile Trp Ile Pro Pro Ala Trp 35 40 45 aaa ggg act tcg
caa aat gat gtg ggg tat gga gcc tat gat ctt tat 192 Lys Gly Thr Ser
Gln Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 50 55 60 gat tta
ggg gaa ttt aat caa aag ggg acg gtt cgt act aag tat ggg 240 Asp Leu
Gly Glu Phe Asn Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly 65 70 75 80
aca cgt agt caa ttg gag tct gcc atc cat gct tta aag aat aat ggc 288
Thr Arg Ser Gln Leu Glu Ser Ala Ile His Ala Leu Lys Asn Asn Gly 85
90 95 gtt caa gtt tat ggg gat gta gtg atg aac cat aaa gga gga gct
gat 336 Val Gln Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala
Asp 100 105 110 gct aca gaa aac gtt ctt gct gtc gag gtg aat cca aat
aac cgg aat 384 Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn
Asn Arg Asn 115 120 125 caa gaa ata tct ggg gac tac aca att gag gct
tgg act aag ttt gat 432 Gln Glu Ile Ser Gly Asp Tyr Thr Ile Glu Ala
Trp Thr Lys Phe Asp 130 135 140 ttt cca ggg agg ggt aat aca tac tca
gac ttt aaa tgg cgt tgg tat 480 Phe Pro Gly Arg Gly Asn Thr Tyr Ser
Asp Phe Lys Trp Arg Trp Tyr 145 150 155 160 cat ttc gat ggt gta gat
tgg gat caa tca cga caa ttc caa aat cgt 528 His Phe Asp Gly Val Asp
Trp Asp Gln Ser Arg Gln Phe Gln Asn Arg 165 170 175 atc tac aaa ttc
cga ggt aaa gct tgg gat tgg gaa gta gat tcg gaa 576 Ile Tyr Lys Phe
Arg Gly Lys Ala Trp Asp Trp Glu Val Asp Ser Glu 180 185 190 aat gga
aat tat gat tat tta atg tat gca gat gta gat atg gat cat 624 Asn Gly
Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met Asp His 195 200 205
ccg gag gta gta aat gag ctt aga aga tgg gga gaa tgg tat aca aat 672
Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr Thr Asn 210
215 220 aca tta aat ctt gat gga ttt agg atc gat gcg gtg aag cat att
aaa 720 Thr Leu Asn Leu Asp Gly Phe Arg Ile Asp Ala Val Lys His Ile
Lys 225 230 235 240 tat agc ttt aca cgt gat tgg ttg acc cat gta aga
aac gca acg gga 768 Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg
Asn Ala Thr Gly 245 250 255 aaa gaa atg ttt gct gtt gct gaa ttt tgg
aaa aat gat tta ggt gcc 816 Lys Glu Met Phe Ala Val Ala Glu Phe Trp
Lys Asn Asp Leu Gly Ala 260 265 270 ttg gag aac tat tta aat aaa aca
aac tgg aat cat tct gtc ttt gat 864 Leu Glu Asn Tyr Leu Asn Lys Thr
Asn Trp Asn His Ser Val Phe Asp 275 280 285 gtc ccc ctt cat tat aat
ctt tat aac gcg tca aat agt gga ggc aac 912 Val Pro Leu His Tyr Asn
Leu Tyr Asn Ala Ser Asn Ser Gly Gly Asn 290 295 300 tat gac atg gca
aaa ctt ctt aat gga acg gtt gtt caa aag cat cca 960 Tyr Asp Met Ala
Lys Leu Leu Asn Gly Thr Val Val Gln Lys His Pro 305 310 315 320 atg
cat gcc gta act ttt gtg gat aat cac gat tct caa cct ggg gaa 1008
Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gln Pro Gly Glu 325
330 335 tca tta gaa tca ttt gta caa gaa tgg ttt aag cca ctt gct tat
gcg 1056 Ser Leu Glu Ser Phe Val Gln Glu Trp Phe Lys Pro Leu Ala
Tyr Ala 340 345 350 ctt att tta aca aga gaa caa ggc tat ccc tct gtc
ttc tat ggt gac 1104 Leu Ile Leu Thr Arg Glu Gln Gly Tyr Pro Ser
Val Phe Tyr Gly Asp 355 360 365 tac tat gga att cca aca cat agt gtc
cca gca atg aaa gcc aag att 1152 Tyr Tyr Gly Ile Pro Thr His Ser
Val Pro Ala Met Lys Ala Lys Ile 370 375 380 gat cca atc tta gag gcg
cgt caa aat ttt gca tat gga aca caa cat 1200 Asp Pro Ile Leu Glu
Ala Arg Gln Asn Phe Ala Tyr Gly Thr Gln His 385 390 395 400 gat tat
ttt gac cat cat aat ata atc gga tgg aca cgt gaa gga aat 1248 Asp
Tyr Phe Asp His His Asn Ile Ile Gly Trp Thr Arg Glu Gly Asn 405 410
415 acc acg cat ccc aat tca gga ctt gcg act atc atg tcg gat ggg cca
1296 Thr Thr His Pro Asn Ser Gly Leu Ala Thr Ile Met Ser Asp Gly
Pro 420 425 430 ggg gga gag aaa tgg atg tac gta ggg caa gat aaa gca
ggt caa gtt 1344 Gly Gly Glu Lys Trp Met Tyr Val Gly Gln Asp Lys
Ala Gly Gln Val 435 440 445 tgg cat gac ata act gga aat aaa cca ggc
aca gtt acg atc aat gca 1392 Trp His Asp Ile Thr Gly Asn Lys Pro
Gly Thr Val Thr Ile Asn Ala 450 455 460 gat gga tgg gcc aat ttt tca
gta aat gga gga tct gtt tcc att tgg 1440 Asp Gly Trp Ala Asn Phe
Ser Val Asn Gly Gly Ser Val Ser Ile Trp 465 470 475 480 gtg cca aaa
ata agt act act tcc caa ata aca ttt act gta aat aac 1488 Val Pro
Lys Ile Ser Thr Thr Ser Gln Ile Thr Phe Thr Val Asn Asn 485 490 495
gcc aca acc gtt tgg gga caa aat gta tac gtt gtc ggg aat att tcg
1536 Ala Thr Thr Val Trp Gly Gln Asn Val Tyr Val Val Gly Asn Ile
Ser 500 505 510 cag ctg ggg aac tgg gat cca gtc cac gca gtt caa atg
acg ccg tct 1584 Gln Leu Gly Asn Trp Asp Pro Val His Ala Val Gln
Met Thr Pro Ser 515 520 525 tct tat cca aca tgg act gta aca atc cct
ctt ctt caa ggg caa aac 1632 Ser Tyr Pro Thr Trp Thr Val Thr Ile
Pro Leu Leu Gln Gly Gln Asn 530 535 540 ata caa ttt aaa ttt atc aaa
aaa gat tca gct gga aat gtc att tgg 1680 Ile Gln Phe Lys Phe Ile
Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545 550 555 560 gaa gat ata
tcg aat cga aca tac acc gtc cca act gct gca tcc gga 1728 Glu Asp
Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser Gly 565 570 575
gca tat aca gcc agc tgg aac gtg ccc 1755 Ala Tyr Thr Ala Ser Trp
Asn Val Pro 580 585 10 585 PRT JE1-CBM 10 His His Asn Gly Thr Asn
Gly Thr Met Met Gln Tyr Phe Glu Trp His 1 5 10 15 Leu Pro Asn Asp
Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 20 25 30 Asn Leu
Arg Asn Arg Gly Ile Thr Ala Ile Trp Ile Pro Pro Ala Trp 35 40 45
Lys Gly Thr Ser Gln Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 50
55 60 Asp Leu Gly Glu Phe Asn Gln Lys Gly Thr Val Arg Thr Lys Tyr
Gly 65 70 75 80 Thr Arg Ser Gln Leu Glu Ser Ala Ile His Ala Leu Lys
Asn Asn Gly 85 90 95 Val Gln Val Tyr Gly Asp Val Val Met Asn His
Lys Gly Gly Ala Asp 100 105 110 Ala Thr Glu Asn Val Leu Ala Val Glu
Val Asn Pro Asn Asn Arg Asn 115 120 125 Gln Glu Ile Ser Gly Asp Tyr
Thr Ile Glu Ala Trp Thr Lys Phe Asp 130 135 140 Phe Pro Gly Arg Gly
Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 145 150 155 160 His Phe
Asp Gly Val Asp Trp Asp Gln Ser Arg Gln Phe Gln Asn Arg 165 170 175
Ile Tyr Lys Phe Arg Gly Lys Ala Trp Asp Trp Glu Val Asp Ser Glu 180
185 190 Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met Asp
His 195 200 205 Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp
Tyr Thr Asn 210
215 220 Thr Leu Asn Leu Asp Gly Phe Arg Ile Asp Ala Val Lys His Ile
Lys 225 230 235 240 Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg
Asn Ala Thr Gly 245 250 255 Lys Glu Met Phe Ala Val Ala Glu Phe Trp
Lys Asn Asp Leu Gly Ala 260 265 270 Leu Glu Asn Tyr Leu Asn Lys Thr
Asn Trp Asn His Ser Val Phe Asp 275 280 285 Val Pro Leu His Tyr Asn
Leu Tyr Asn Ala Ser Asn Ser Gly Gly Asn 290 295 300 Tyr Asp Met Ala
Lys Leu Leu Asn Gly Thr Val Val Gln Lys His Pro 305 310 315 320 Met
His Ala Val Thr Phe Val Asp Asn His Asp Ser Gln Pro Gly Glu 325 330
335 Ser Leu Glu Ser Phe Val Gln Glu Trp Phe Lys Pro Leu Ala Tyr Ala
340 345 350 Leu Ile Leu Thr Arg Glu Gln Gly Tyr Pro Ser Val Phe Tyr
Gly Asp 355 360 365 Tyr Tyr Gly Ile Pro Thr His Ser Val Pro Ala Met
Lys Ala Lys Ile 370 375 380 Asp Pro Ile Leu Glu Ala Arg Gln Asn Phe
Ala Tyr Gly Thr Gln His 385 390 395 400 Asp Tyr Phe Asp His His Asn
Ile Ile Gly Trp Thr Arg Glu Gly Asn 405 410 415 Thr Thr His Pro Asn
Ser Gly Leu Ala Thr Ile Met Ser Asp Gly Pro 420 425 430 Gly Gly Glu
Lys Trp Met Tyr Val Gly Gln Asp Lys Ala Gly Gln Val 435 440 445 Trp
His Asp Ile Thr Gly Asn Lys Pro Gly Thr Val Thr Ile Asn Ala 450 455
460 Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser Ile Trp
465 470 475 480 Val Pro Lys Ile Ser Thr Thr Ser Gln Ile Thr Phe Thr
Val Asn Asn 485 490 495 Ala Thr Thr Val Trp Gly Gln Asn Val Tyr Val
Val Gly Asn Ile Ser 500 505 510 Gln Leu Gly Asn Trp Asp Pro Val His
Ala Val Gln Met Thr Pro Ser 515 520 525 Ser Tyr Pro Thr Trp Thr Val
Thr Ile Pro Leu Leu Gln Gly Gln Asn 530 535 540 Ile Gln Phe Lys Phe
Ile Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545 550 555 560 Glu Asp
Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser Gly 565 570 575
Ala Tyr Thr Ala Ser Trp Asn Val Pro 580 585 11 1755 DNA AX379-CBM
CDS (1)..(1755) 11 cac cat aat ggt acg aac ggc aca atg atg cag tac
ttt gaa tgg tat 48 His His Asn Gly Thr Asn Gly Thr Met Met Gln Tyr
Phe Glu Trp Tyr 1 5 10 15 cta cca aat gac gga aac cat tgg aat aga
tta agg tct gat gca agt 96 Leu Pro Asn Asp Gly Asn His Trp Asn Arg
Leu Arg Ser Asp Ala Ser 20 25 30 aac cta aaa gat aaa ggg atc tca
gcg gtt tgg att cct cct gca tgg 144 Asn Leu Lys Asp Lys Gly Ile Ser
Ala Val Trp Ile Pro Pro Ala Trp 35 40 45 aag ggt gcc tct caa aat
gat gtg ggg tat ggt gct tat gat ctg tat 192 Lys Gly Ala Ser Gln Asn
Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 50 55 60 gat tta gga gaa
ttc aat caa aaa gga acc att cgt aca aaa tat gga 240 Asp Leu Gly Glu
Phe Asn Gln Lys Gly Thr Ile Arg Thr Lys Tyr Gly 65 70 75 80 acg cgc
aat cag tta caa gct gcg gtt aac gcc ttg aaa agt aat gga 288 Thr Arg
Asn Gln Leu Gln Ala Ala Val Asn Ala Leu Lys Ser Asn Gly 85 90 95
att caa gtg tat ggc gat gtt gta atg aat cat aaa ggg gga gca gac 336
Ile Gln Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 100
105 110 gct acc gaa atg gtt aaa gca gtc gaa gta aac ccg aat aat aga
aat 384 Ala Thr Glu Met Val Lys Ala Val Glu Val Asn Pro Asn Asn Arg
Asn 115 120 125 caa gaa gtg tcc ggt gaa tat aca att gag gct tgg aca
aag ttt gac 432 Gln Glu Val Ser Gly Glu Tyr Thr Ile Glu Ala Trp Thr
Lys Phe Asp 130 135 140 ttt cca gga cga ggt aat act cat tca aac ttc
aaa tgg aga tgg tat 480 Phe Pro Gly Arg Gly Asn Thr His Ser Asn Phe
Lys Trp Arg Trp Tyr 145 150 155 160 cac ttt gat gga gta gat tgg gat
cag tca cgt aag ctg aac aat cga 528 His Phe Asp Gly Val Asp Trp Asp
Gln Ser Arg Lys Leu Asn Asn Arg 165 170 175 att tat aaa ttc cgc ggt
aaa ggg tgg gat tgg gaa gtc gat aca gaa 576 Ile Tyr Lys Phe Arg Gly
Lys Gly Trp Asp Trp Glu Val Asp Thr Glu 180 185 190 ttc ggt aac tat
gat tac cta atg tat gca gat att gac atg gat cac 624 Phe Gly Asn Tyr
Asp Tyr Leu Met Tyr Ala Asp Ile Asp Met Asp His 195 200 205 cca gag
gta gtg aat gag cta aga aat tgg ggt gtt tgg tat acg aat 672 Pro Glu
Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr Thr Asn 210 215 220
aca tta ggc ctt gat ggt ttt aga ata gat gca gta aaa cat ata aaa 720
Thr Leu Gly Leu Asp Gly Phe Arg Ile Asp Ala Val Lys His Ile Lys 225
230 235 240 tac agc ttt act cgt gat tgg att aat cat gtt aga agt gca
act ggc 768 Tyr Ser Phe Thr Arg Asp Trp Ile Asn His Val Arg Ser Ala
Thr Gly 245 250 255 aaa aat atg ttt gcg gtt gcg gaa ttt tgg aaa aat
gat tta ggt gct 816 Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn
Asp Leu Gly Ala 260 265 270 att gaa aac tat tta aac aaa aca aac tgg
aac cat tca gtc ttt gat 864 Ile Glu Asn Tyr Leu Asn Lys Thr Asn Trp
Asn His Ser Val Phe Asp 275 280 285 gtt ccg ctg cac tat aac ctc tat
aat gct tca aaa agc gga ggg aat 912 Val Pro Leu His Tyr Asn Leu Tyr
Asn Ala Ser Lys Ser Gly Gly Asn 290 295 300 tat gat atg agg caa ata
ttt aat ggt aca gtc gtg caa aag cat cca 960 Tyr Asp Met Arg Gln Ile
Phe Asn Gly Thr Val Val Gln Lys His Pro 305 310 315 320 atg cat gct
gtt aca ttt gtt gat aat cat gat tcg caa cct gaa gaa 1008 Met His
Ala Val Thr Phe Val Asp Asn His Asp Ser Gln Pro Glu Glu 325 330 335
gct tta gag tct ttt gtt gaa gaa tgg ttc aaa cca tta gcg tat gct
1056 Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro Leu Ala Tyr
Ala 340 345 350 ttg aca tta aca cgt gaa caa ggc tac cct tct gta ttt
tat gga gat 1104 Leu Thr Leu Thr Arg Glu Gln Gly Tyr Pro Ser Val
Phe Tyr Gly Asp 355 360 365 tat tat ggc att cca acg cat ggt gta cca
gcg atg aaa tcg aaa att 1152 Tyr Tyr Gly Ile Pro Thr His Gly Val
Pro Ala Met Lys Ser Lys Ile 370 375 380 gac ccg att cta gaa gcg cgt
caa aag tat gca tat gga aga caa aat 1200 Asp Pro Ile Leu Glu Ala
Arg Gln Lys Tyr Ala Tyr Gly Arg Gln Asn 385 390 395 400 gac tac tta
gac cat cat aat atc atc ggt tgg aca cgt gaa ggg aat 1248 Asp Tyr
Leu Asp His His Asn Ile Ile Gly Trp Thr Arg Glu Gly Asn 405 410 415
aca gca cac ccc aac tcc ggt tta gct act atc atg tcc gat ggg gca
1296 Thr Ala His Pro Asn Ser Gly Leu Ala Thr Ile Met Ser Asp Gly
Ala 420 425 430 gga gga aat aag tgg atg ttt gtt ggg cgt aat aaa gct
ggt caa gtt 1344 Gly Gly Asn Lys Trp Met Phe Val Gly Arg Asn Lys
Ala Gly Gln Val 435 440 445 tgg acc gat atc act gga aat aaa gcc ggt
act gtt acg att aat gct 1392 Trp Thr Asp Ile Thr Gly Asn Lys Ala
Gly Thr Val Thr Ile Asn Ala 450 455 460 gat gga tgg ggt aat ttt tct
gta aat gga gga tca gtt tct att tgg 1440 Asp Gly Trp Gly Asn Phe
Ser Val Asn Gly Gly Ser Val Ser Ile Trp 465 470 475 480 gta aac aaa
ata agt act act tcc caa ata aca ttt act gta aat aac 1488 Val Asn
Lys Ile Ser Thr Thr Ser Gln Ile Thr Phe Thr Val Asn Asn 485 490 495
gcc aca acc gtt tgg gga caa aat gta tac gtt gtc ggg aat att tcg
1536 Ala Thr Thr Val Trp Gly Gln Asn Val Tyr Val Val Gly Asn Ile
Ser 500 505 510 cag ctg ggg aac tgg gat cca gtc cac gca gtt caa atg
acg ccg tct 1584 Gln Leu Gly Asn Trp Asp Pro Val His Ala Val Gln
Met Thr Pro Ser 515 520 525 tct tat cca aca tgg act gta aca atc cct
ctt ctt caa ggg caa aac 1632 Ser Tyr Pro Thr Trp Thr Val Thr Ile
Pro Leu Leu Gln Gly Gln Asn 530 535 540 ata caa ttt aaa ttt atc aaa
aaa gat tca gct gga aat gtc att tgg 1680 Ile Gln Phe Lys Phe Ile
Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545 550 555 560 gaa gat ata
tcg aat cga aca tac acc gtc cca act gct gca tcc gga 1728 Glu Asp
Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser Gly 565 570 575
gca tat aca gcc agc tgg aac gtg ccc 1755 Ala Tyr Thr Ala Ser Trp
Asn Val Pro 580 585 12 585 PRT AX379-CBM 12 His His Asn Gly Thr Asn
Gly Thr Met Met Gln Tyr Phe Glu Trp Tyr 1 5 10 15 Leu Pro Asn Asp
Gly Asn His Trp Asn Arg Leu Arg Ser Asp Ala Ser 20 25 30 Asn Leu
Lys Asp Lys Gly Ile Ser Ala Val Trp Ile Pro Pro Ala Trp 35 40 45
Lys Gly Ala Ser Gln Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 50
55 60 Asp Leu Gly Glu Phe Asn Gln Lys Gly Thr Ile Arg Thr Lys Tyr
Gly 65 70 75 80 Thr Arg Asn Gln Leu Gln Ala Ala Val Asn Ala Leu Lys
Ser Asn Gly 85 90 95 Ile Gln Val Tyr Gly Asp Val Val Met Asn His
Lys Gly Gly Ala Asp 100 105 110 Ala Thr Glu Met Val Lys Ala Val Glu
Val Asn Pro Asn Asn Arg Asn 115 120 125 Gln Glu Val Ser Gly Glu Tyr
Thr Ile Glu Ala Trp Thr Lys Phe Asp 130 135 140 Phe Pro Gly Arg Gly
Asn Thr His Ser Asn Phe Lys Trp Arg Trp Tyr 145 150 155 160 His Phe
Asp Gly Val Asp Trp Asp Gln Ser Arg Lys Leu Asn Asn Arg 165 170 175
Ile Tyr Lys Phe Arg Gly Lys Gly Trp Asp Trp Glu Val Asp Thr Glu 180
185 190 Phe Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Ile Asp Met Asp
His 195 200 205 Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp
Tyr Thr Asn 210 215 220 Thr Leu Gly Leu Asp Gly Phe Arg Ile Asp Ala
Val Lys His Ile Lys 225 230 235 240 Tyr Ser Phe Thr Arg Asp Trp Ile
Asn His Val Arg Ser Ala Thr Gly 245 250 255 Lys Asn Met Phe Ala Val
Ala Glu Phe Trp Lys Asn Asp Leu Gly Ala 260 265 270 Ile Glu Asn Tyr
Leu Asn Lys Thr Asn Trp Asn His Ser Val Phe Asp 275 280 285 Val Pro
Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly Gly Asn 290 295 300
Tyr Asp Met Arg Gln Ile Phe Asn Gly Thr Val Val Gln Lys His Pro 305
310 315 320 Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gln Pro
Glu Glu 325 330 335 Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro
Leu Ala Tyr Ala 340 345 350 Leu Thr Leu Thr Arg Glu Gln Gly Tyr Pro
Ser Val Phe Tyr Gly Asp 355 360 365 Tyr Tyr Gly Ile Pro Thr His Gly
Val Pro Ala Met Lys Ser Lys Ile 370 375 380 Asp Pro Ile Leu Glu Ala
Arg Gln Lys Tyr Ala Tyr Gly Arg Gln Asn 385 390 395 400 Asp Tyr Leu
Asp His His Asn Ile Ile Gly Trp Thr Arg Glu Gly Asn 405 410 415 Thr
Ala His Pro Asn Ser Gly Leu Ala Thr Ile Met Ser Asp Gly Ala 420 425
430 Gly Gly Asn Lys Trp Met Phe Val Gly Arg Asn Lys Ala Gly Gln Val
435 440 445 Trp Thr Asp Ile Thr Gly Asn Lys Ala Gly Thr Val Thr Ile
Asn Ala 450 455 460 Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser
Val Ser Ile Trp 465 470 475 480 Val Asn Lys Ile Ser Thr Thr Ser Gln
Ile Thr Phe Thr Val Asn Asn 485 490 495 Ala Thr Thr Val Trp Gly Gln
Asn Val Tyr Val Val Gly Asn Ile Ser 500 505 510 Gln Leu Gly Asn Trp
Asp Pro Val His Ala Val Gln Met Thr Pro Ser 515 520 525 Ser Tyr Pro
Thr Trp Thr Val Thr Ile Pro Leu Leu Gln Gly Gln Asn 530 535 540 Ile
Gln Phe Lys Phe Ile Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545 550
555 560 Glu Asp Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser
Gly 565 570 575 Ala Tyr Thr Ala Ser Trp Asn Val Pro 580 585 13 1755
DNA BAN-CBM CDS (1)..(1755) 13 gta aat ggc acg ctg atg cag tat ttt
gaa tgg tat acg ccg aac gac 48 Val Asn Gly Thr Leu Met Gln Tyr Phe
Glu Trp Tyr Thr Pro Asn Asp 1 5 10 15 ggc cag cat tgg aaa cga ttg
cag aat gat gcg gaa cat tta tcg gat 96 Gly Gln His Trp Lys Arg Leu
Gln Asn Asp Ala Glu His Leu Ser Asp 20 25 30 atc gga atc act gcc
gtc tgg att cct ccc gca tac aaa gga ttg agc 144 Ile Gly Ile Thr Ala
Val Trp Ile Pro Pro Ala Tyr Lys Gly Leu Ser 35 40 45 caa tcc gat
aac gga tac gga cct tat gat ttg tat gat tta gga gaa 192 Gln Ser Asp
Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu 50 55 60 ttc
cag caa aaa ggg acg gtc aga acg aaa tac ggc aca aaa tca gag 240 Phe
Gln Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu 65 70
75 80 ctt caa gat gcg atc ggc tca ctg cat tcc cgg aac gtc caa gta
tac 288 Leu Gln Asp Ala Ile Gly Ser Leu His Ser Arg Asn Val Gln Val
Tyr 85 90 95 gga gat gtg gtt ttg aat cat aag gct ggt gct gat gca
aca gaa gat 336 Gly Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala
Thr Glu Asp 100 105 110 gta act gcc gtc gaa gtc aat ccg gcc aat aga
aat cag gaa act tcg 384 Val Thr Ala Val Glu Val Asn Pro Ala Asn Arg
Asn Gln Glu Thr Ser 115 120 125 gag gaa tat caa atc aaa gcg tgg acg
gat ttt cgt ttt ccg ggc cgt 432 Glu Glu Tyr Gln Ile Lys Ala Trp Thr
Asp Phe Arg Phe Pro Gly Arg 130 135 140 gga aac acg tac agt gat ttt
aaa tgg cat tgg tat cat ttc gac gga 480 Gly Asn Thr Tyr Ser Asp Phe
Lys Trp His Trp Tyr His Phe Asp Gly 145 150 155 160 gcg gac tgg gat
gaa tcc cgg aag atc agc cgc atc ttt aag ttt cgt 528 Ala Asp Trp Asp
Glu Ser Arg Lys Ile Ser Arg Ile Phe Lys Phe Arg 165 170 175 ggg gaa
gga aaa gcg tgg gat tgg gaa gta tca agt gaa aac ggc aac 576 Gly Glu
Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn 180 185 190
tat gac tat tta atg tat gct gat gtt gac tac gac cac cct gat gtc 624
Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp Val 195
200 205 gtg gca gag aca aaa aaa tgg ggt atc tgg tat gcg aat gaa ctg
tca 672 Val Ala Glu Thr Lys Lys Trp Gly Ile Trp Tyr Ala Asn Glu Leu
Ser 210 215 220 tta gac ggc ttc cgt att gat gcc gcc aaa cat att aaa
ttt tca ttt 720 Leu Asp Gly Phe Arg Ile Asp Ala Ala Lys His Ile Lys
Phe Ser Phe 225 230 235 240 ctg cgt gat tgg gtt cag gcg gtc aga cag
gcg acg gga aaa gaa atg 768 Leu Arg Asp Trp Val Gln Ala Val Arg Gln
Ala Thr Gly Lys Glu Met 245 250 255 ttt acg gtt gcg gag tat tgg cag
aat aat gcc ggg aaa ctc gaa aac 816 Phe Thr Val Ala Glu Tyr Trp Gln
Asn Asn Ala Gly Lys Leu Glu Asn 260 265 270 tac ttg aat aaa aca agc
ttt aat caa tcc gtg ttt gat gtt ccg ctt 864 Tyr Leu Asn Lys Thr Ser
Phe Asn Gln Ser Val Phe Asp Val Pro Leu 275 280 285 cat ttc aat tta
cag gcg gct tcc tca caa gga ggc gga tat gat atg 912 His Phe Asn Leu
Gln Ala Ala Ser Ser Gln Gly Gly Gly Tyr Asp Met 290 295 300 agg cgt
ttg ctg gac ggt acc gtt gtg tcc agg cat ccg gaa aag gcg 960 Arg Arg
Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala 305 310 315
320 gtt aca ttt gtt gaa aat cat gac aca cag ccg gga cag tca ttg gaa
1008 Val Thr Phe Val Glu Asn His Asp Thr Gln Pro Gly Gln Ser Leu
Glu
325 330 335 tcg aca gtc caa act tgg ttt aaa ccg ctt gca tac gcc ttt
att ttg 1056 Ser Thr Val Gln Thr Trp Phe Lys Pro Leu Ala Tyr Ala
Phe Ile Leu 340 345 350 aca aga gaa tcc ggt tat cct cag gtg ttc tat
ggg gat atg tac ggg 1104 Thr Arg Glu Ser Gly Tyr Pro Gln Val Phe
Tyr Gly Asp Met Tyr Gly 355 360 365 aca aaa ggg aca tcg cca aag gaa
att ccc tca ctg aaa gat aat ata 1152 Thr Lys Gly Thr Ser Pro Lys
Glu Ile Pro Ser Leu Lys Asp Asn Ile 370 375 380 gag ccg att tta aaa
gcg cgt aag gag tac gca tac ggg ccc cag cac 1200 Glu Pro Ile Leu
Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro Gln His 385 390 395 400 gat
tat att gac cac ccg gat gtg atc gga tgg acg agg gaa ggt gac 1248
Asp Tyr Ile Asp His Pro Asp Val Ile Gly Trp Thr Arg Glu Gly Asp 405
410 415 agc tcc gcc gcc aaa tca ggt ttg gcc gct tta atc acg gac gga
ccc 1296 Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu Ile Thr Asp
Gly Pro 420 425 430 ggc gga tca aag cgg atg tat gcc ggc ctg aaa aat
gcc ggc gag aca 1344 Gly Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys
Asn Ala Gly Glu Thr 435 440 445 tgg tat gac ata acg ggc aac cgt tca
gat act gta aaa atc gga tct 1392 Trp Tyr Asp Ile Thr Gly Asn Arg
Ser Asp Thr Val Lys Ile Gly Ser 450 455 460 gac ggc tgg gga gag ttt
cat gta aac gat ggg tcc gtc tcc att tat 1440 Asp Gly Trp Gly Glu
Phe His Val Asn Asp Gly Ser Val Ser Ile Tyr 465 470 475 480 gtt cca
aaa ata agt act act tcc caa ata aca ttt act gta aat aac 1488 Val
Pro Lys Ile Ser Thr Thr Ser Gln Ile Thr Phe Thr Val Asn Asn 485 490
495 gcc aca acc gtt tgg gga caa aat gta tac gtt gtc ggg aat att tcg
1536 Ala Thr Thr Val Trp Gly Gln Asn Val Tyr Val Val Gly Asn Ile
Ser 500 505 510 cag ctg ggg aac tgg gat cca gtc cac gca gtt caa atg
acg ccg tct 1584 Gln Leu Gly Asn Trp Asp Pro Val His Ala Val Gln
Met Thr Pro Ser 515 520 525 tct tat cca aca tgg act gta aca atc cct
ctt ctt caa ggg caa aac 1632 Ser Tyr Pro Thr Trp Thr Val Thr Ile
Pro Leu Leu Gln Gly Gln Asn 530 535 540 ata caa ttt aaa ttt atc aaa
aaa gat tca gct gga aat gtc att tgg 1680 Ile Gln Phe Lys Phe Ile
Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545 550 555 560 gaa gat ata
tcg aat cga aca tac acc gtc cca act gct gca tcc gga 1728 Glu Asp
Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser Gly 565 570 575
gca tat aca gcc agc tgg aac gtg ccc 1755 Ala Tyr Thr Ala Ser Trp
Asn Val Pro 580 585 14 585 PRT BAN-CBM 14 Val Asn Gly Thr Leu Met
Gln Tyr Phe Glu Trp Tyr Thr Pro Asn Asp 1 5 10 15 Gly Gln His Trp
Lys Arg Leu Gln Asn Asp Ala Glu His Leu Ser Asp 20 25 30 Ile Gly
Ile Thr Ala Val Trp Ile Pro Pro Ala Tyr Lys Gly Leu Ser 35 40 45
Gln Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu 50
55 60 Phe Gln Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser
Glu 65 70 75 80 Leu Gln Asp Ala Ile Gly Ser Leu His Ser Arg Asn Val
Gln Val Tyr 85 90 95 Gly Asp Val Val Leu Asn His Lys Ala Gly Ala
Asp Ala Thr Glu Asp 100 105 110 Val Thr Ala Val Glu Val Asn Pro Ala
Asn Arg Asn Gln Glu Thr Ser 115 120 125 Glu Glu Tyr Gln Ile Lys Ala
Trp Thr Asp Phe Arg Phe Pro Gly Arg 130 135 140 Gly Asn Thr Tyr Ser
Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly 145 150 155 160 Ala Asp
Trp Asp Glu Ser Arg Lys Ile Ser Arg Ile Phe Lys Phe Arg 165 170 175
Gly Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn 180
185 190 Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp
Val 195 200 205 Val Ala Glu Thr Lys Lys Trp Gly Ile Trp Tyr Ala Asn
Glu Leu Ser 210 215 220 Leu Asp Gly Phe Arg Ile Asp Ala Ala Lys His
Ile Lys Phe Ser Phe 225 230 235 240 Leu Arg Asp Trp Val Gln Ala Val
Arg Gln Ala Thr Gly Lys Glu Met 245 250 255 Phe Thr Val Ala Glu Tyr
Trp Gln Asn Asn Ala Gly Lys Leu Glu Asn 260 265 270 Tyr Leu Asn Lys
Thr Ser Phe Asn Gln Ser Val Phe Asp Val Pro Leu 275 280 285 His Phe
Asn Leu Gln Ala Ala Ser Ser Gln Gly Gly Gly Tyr Asp Met 290 295 300
Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala 305
310 315 320 Val Thr Phe Val Glu Asn His Asp Thr Gln Pro Gly Gln Ser
Leu Glu 325 330 335 Ser Thr Val Gln Thr Trp Phe Lys Pro Leu Ala Tyr
Ala Phe Ile Leu 340 345 350 Thr Arg Glu Ser Gly Tyr Pro Gln Val Phe
Tyr Gly Asp Met Tyr Gly 355 360 365 Thr Lys Gly Thr Ser Pro Lys Glu
Ile Pro Ser Leu Lys Asp Asn Ile 370 375 380 Glu Pro Ile Leu Lys Ala
Arg Lys Glu Tyr Ala Tyr Gly Pro Gln His 385 390 395 400 Asp Tyr Ile
Asp His Pro Asp Val Ile Gly Trp Thr Arg Glu Gly Asp 405 410 415 Ser
Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly Pro 420 425
430 Gly Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly Glu Thr
435 440 445 Trp Tyr Asp Ile Thr Gly Asn Arg Ser Asp Thr Val Lys Ile
Gly Ser 450 455 460 Asp Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser
Val Ser Ile Tyr 465 470 475 480 Val Pro Lys Ile Ser Thr Thr Ser Gln
Ile Thr Phe Thr Val Asn Asn 485 490 495 Ala Thr Thr Val Trp Gly Gln
Asn Val Tyr Val Val Gly Asn Ile Ser 500 505 510 Gln Leu Gly Asn Trp
Asp Pro Val His Ala Val Gln Met Thr Pro Ser 515 520 525 Ser Tyr Pro
Thr Trp Thr Val Thr Ile Pro Leu Leu Gln Gly Gln Asn 530 535 540 Ile
Gln Phe Lys Phe Ile Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545 550
555 560 Glu Asp Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser
Gly 565 570 575 Ala Tyr Thr Ala Ser Trp Asn Val Pro 580 585 15 87
DNA Bacillus licheniformis CDS (1)..(87) 15 atg aaa caa caa aaa cgg
ctt tac gcc cga ttg ctg acg ctg tta ttt 48 Met Lys Gln Gln Lys Arg
Leu Tyr Ala Arg Leu Leu Thr Leu Leu Phe 1 5 10 15 gcg ctc atc ttc
ttg ctg cct cat tct gca gcc gcg gca 87 Ala Leu Ile Phe Leu Leu Pro
His Ser Ala Ala Ala Ala 20 25 16 29 PRT Bacillus licheniformis 16
Met Lys Gln Gln Lys Arg Leu Tyr Ala Arg Leu Leu Thr Leu Leu Phe 1 5
10 15 Ala Leu Ile Phe Leu Leu Pro His Ser Ala Ala Ala Ala 20 25 17
1758 DNA BSG(AB)-CBM CDS (1)..(1758) 17 gca ccg ttt aac ggc ttt aac
ggc acc atg atg cag tat ttt gaa tgg 48 Ala Pro Phe Asn Gly Phe Asn
Gly Thr Met Met Gln Tyr Phe Glu Trp 1 5 10 15 tac ttg ccg gat gat
ggc acg tta tgg acc aaa gtg gcc aat gaa gcc 96 Tyr Leu Pro Asp Asp
Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala 20 25 30 aac aac tta
tcc agc ctt ggc atc acc gct ctt tgg ctg ccg ccc gct 144 Asn Asn Leu
Ser Ser Leu Gly Ile Thr Ala Leu Trp Leu Pro Pro Ala 35 40 45 tac
aaa gga aca agc cgc agc gac gta ggg tac gga gta tac gac ttg 192 Tyr
Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu 50 55
60 tat gac ctc ggc gaa ttc aat caa aaa ggg acc gtc cgc aca aaa tac
240 Tyr Asp Leu Gly Glu Phe Asn Gln Lys Gly Thr Val Arg Thr Lys Tyr
65 70 75 80 gga aca aaa gct caa tat ctt caa gcc att caa gcc gcc cac
gcc gct 288 Gly Thr Lys Ala Gln Tyr Leu Gln Ala Ile Gln Ala Ala His
Ala Ala 85 90 95 gga atg caa gtg tac gcc gat gtc gtg ttc gac cat
aaa ggc ggc gct 336 Gly Met Gln Val Tyr Ala Asp Val Val Phe Asp His
Lys Gly Gly Ala 100 105 110 gac ggc acg gaa tgg gtg gac gcc gtc gaa
gtc aat ccg tcc gac cgc 384 Asp Gly Thr Glu Trp Val Asp Ala Val Glu
Val Asn Pro Ser Asp Arg 115 120 125 aac caa gaa atc tcg ggc acc tat
caa atc caa gca tgg acg aaa ttt 432 Asn Gln Glu Ile Ser Gly Thr Tyr
Gln Ile Gln Ala Trp Thr Lys Phe 130 135 140 gat ttt ccc ggg cgg ggc
aac acc tac tcc agc ttt aag tgg cgc tgg 480 Asp Phe Pro Gly Arg Gly
Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp 145 150 155 160 tac cat ttt
gac ggc gtt gat tgg gac gaa agc cga aaa ttg agc cgc 528 Tyr His Phe
Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg 165 170 175 att
tac aaa ttc cgt ggc aag gct tgg gat tgg gaa gta gac acg gaa 576 Ile
Tyr Lys Phe Arg Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu 180 185
190 ttc gga aac tat gac tac tta atg tat gcc gac ctt gat atg gat cat
624 Phe Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His
195 200 205 ccc gaa gtc gtg acc gag ctg aaa aac tgg ggg aaa tgg tat
gtc aac 672 Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr
Val Asn 210 215 220 aca acg aac att gat ggg ttc cgg ctt gat gcc gtc
aag cat att aag 720 Thr Thr Asn Ile Asp Gly Phe Arg Leu Asp Ala Val
Lys His Ile Lys 225 230 235 240 ttc agt ttt ttt cct gat tgg ttg tcg
tat gtg cgt tct cag act ggc 768 Phe Ser Phe Phe Pro Asp Trp Leu Ser
Tyr Val Arg Ser Gln Thr Gly 245 250 255 aag ccg cta ttt acc gtc ggg
gaa tat tgg agc tat gac atc aac aag 816 Lys Pro Leu Phe Thr Val Gly
Glu Tyr Trp Ser Tyr Asp Ile Asn Lys 260 265 270 ttg cac aat tac att
acg aaa aca gac gga acg atg tct ttg ttt gat 864 Leu His Asn Tyr Ile
Thr Lys Thr Asp Gly Thr Met Ser Leu Phe Asp 275 280 285 gcc ccg tta
cac aac aaa ttt tat acc gct tcc aaa tca ggg ggc gca 912 Ala Pro Leu
His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala 290 295 300 ttt
gat atg cgc acg tta atg acc aat act ctc atg aaa gat caa ccg 960 Phe
Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gln Pro 305 310
315 320 aca ttg gcc gtc acc ttc gtt gat aat cat gac acc gaa ccc ggc
caa 1008 Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro
Gly Gln 325 330 335 gcg ctg caa tca tgg gtc gac cca tgg ttc aaa ccg
ttg gct tac gcc 1056 Ala Leu Gln Ser Trp Val Asp Pro Trp Phe Lys
Pro Leu Ala Tyr Ala 340 345 350 ttt att cta act cgg cag gaa gga tac
ccg tgc gtc ttt tat ggt gac 1104 Phe Ile Leu Thr Arg Gln Glu Gly
Tyr Pro Cys Val Phe Tyr Gly Asp 355 360 365 tat tat ggc att cca caa
tat aac att cct tcg ctg aaa agc aaa atc 1152 Tyr Tyr Gly Ile Pro
Gln Tyr Asn Ile Pro Ser Leu Lys Ser Lys Ile 370 375 380 gat ccg ctc
ctc atc gcg cgc agg gat tat gct tac gga aca cag cac 1200 Asp Pro
Leu Leu Ile Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gln His 385 390 395
400 gac tat att gac agt gcg gat att atc ggt tgg acg cgg gaa gga gtg
1248 Asp Tyr Ile Asp Ser Ala Asp Ile Ile Gly Trp Thr Arg Glu Gly
Val 405 410 415 gct gaa aaa gca aat tca gga ctg gct gca ctc att acc
gac ggg cct 1296 Ala Glu Lys Ala Asn Ser Gly Leu Ala Ala Leu Ile
Thr Asp Gly Pro 420 425 430 ggc gga agc aaa tgg atg tat gtt gga aaa
caa cac gct ggc aaa acg 1344 Gly Gly Ser Lys Trp Met Tyr Val Gly
Lys Gln His Ala Gly Lys Thr 435 440 445 ttt tat gat tta acc ggc aat
cga agt gat aca gtg aca atc aat gct 1392 Phe Tyr Asp Leu Thr Gly
Asn Arg Ser Asp Thr Val Thr Ile Asn Ala 450 455 460 gat gga tgg gga
gaa ttt aaa gtc aat gga ggg tct gta tcc ata tgg 1440 Asp Gly Trp
Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Ile Trp 465 470 475 480
gtt cca aaa ata agt act act tcc caa ata aca ttt act gta aat aac
1488 Val Pro Lys Ile Ser Thr Thr Ser Gln Ile Thr Phe Thr Val Asn
Asn 485 490 495 gcc aca acc gtt tgg gga caa aat gta tac gtt gtc ggg
aat att tcg 1536 Ala Thr Thr Val Trp Gly Gln Asn Val Tyr Val Val
Gly Asn Ile Ser 500 505 510 cag ctg ggg aac tgg gat cca gtc cac gca
gtt caa atg acg ccg tct 1584 Gln Leu Gly Asn Trp Asp Pro Val His
Ala Val Gln Met Thr Pro Ser 515 520 525 tct tat cca aca tgg act gta
aca atc cct ctt ctt caa ggg caa aac 1632 Ser Tyr Pro Thr Trp Thr
Val Thr Ile Pro Leu Leu Gln Gly Gln Asn 530 535 540 ata caa ttt aaa
ttt atc aaa aaa gat tca gct gga aat gtc att tgg 1680 Ile Gln Phe
Lys Phe Ile Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545 550 555 560
gaa gat ata tcg aat cga aca tac acc gtc cca act gct gca tcc gga
1728 Glu Asp Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala Ser
Gly 565 570 575 gca tat aca gcc agc tgg aac gtg ccc tag 1758 Ala
Tyr Thr Ala Ser Trp Asn Val Pro 580 585 18 585 PRT BSG(AB)-CBM 18
Ala Pro Phe Asn Gly Phe Asn Gly Thr Met Met Gln Tyr Phe Glu Trp 1 5
10 15 Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu
Ala 20 25 30 Asn Asn Leu Ser Ser Leu Gly Ile Thr Ala Leu Trp Leu
Pro Pro Ala 35 40 45 Tyr Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr
Gly Val Tyr Asp Leu 50 55 60 Tyr Asp Leu Gly Glu Phe Asn Gln Lys
Gly Thr Val Arg Thr Lys Tyr 65 70 75 80 Gly Thr Lys Ala Gln Tyr Leu
Gln Ala Ile Gln Ala Ala His Ala Ala 85 90 95 Gly Met Gln Val Tyr
Ala Asp Val Val Phe Asp His Lys Gly Gly Ala 100 105 110 Asp Gly Thr
Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg 115 120 125 Asn
Gln Glu Ile Ser Gly Thr Tyr Gln Ile Gln Ala Trp Thr Lys Phe 130 135
140 Asp Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp
145 150 155 160 Tyr His Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys
Leu Ser Arg 165 170 175 Ile Tyr Lys Phe Arg Gly Lys Ala Trp Asp Trp
Glu Val Asp Thr Glu 180 185 190 Phe Gly Asn Tyr Asp Tyr Leu Met Tyr
Ala Asp Leu Asp Met Asp His 195 200 205 Pro Glu Val Val Thr Glu Leu
Lys Asn Trp Gly Lys Trp Tyr Val Asn 210 215 220 Thr Thr Asn Ile Asp
Gly Phe Arg Leu Asp Ala Val Lys His Ile Lys 225 230 235 240 Phe Ser
Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gln Thr Gly 245 250 255
Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp Ile Asn Lys 260
265 270 Leu His Asn Tyr Ile Thr Lys Thr Asp Gly Thr Met Ser Leu Phe
Asp 275 280 285 Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser
Gly Gly Ala 290 295 300 Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu
Met Lys Asp Gln Pro 305 310 315 320 Thr Leu Ala Val Thr Phe Val Asp
Asn His Asp Thr Glu Pro Gly Gln 325 330 335 Ala Leu Gln Ser Trp Val
Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 340 345 350 Phe Ile Leu Thr
Arg Gln Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp 355 360 365 Tyr Tyr
Gly Ile Pro Gln Tyr Asn Ile Pro Ser Leu Lys Ser Lys Ile 370 375 380
Asp Pro Leu Leu Ile Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gln His 385
390 395 400 Asp Tyr
Ile Asp Ser Ala Asp Ile Ile Gly Trp Thr Arg Glu Gly Val 405 410 415
Ala Glu Lys Ala Asn Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly Pro 420
425 430 Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gln His Ala Gly Lys
Thr 435 440 445 Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr
Ile Asn Ala 450 455 460 Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly
Ser Val Ser Ile Trp 465 470 475 480 Val Pro Lys Ile Ser Thr Thr Ser
Gln Ile Thr Phe Thr Val Asn Asn 485 490 495 Ala Thr Thr Val Trp Gly
Gln Asn Val Tyr Val Val Gly Asn Ile Ser 500 505 510 Gln Leu Gly Asn
Trp Asp Pro Val His Ala Val Gln Met Thr Pro Ser 515 520 525 Ser Tyr
Pro Thr Trp Thr Val Thr Ile Pro Leu Leu Gln Gly Gln Asn 530 535 540
Ile Gln Phe Lys Phe Ile Lys Lys Asp Ser Ala Gly Asn Val Ile Trp 545
550 555 560 Glu Asp Ile Ser Asn Arg Thr Tyr Thr Val Pro Thr Ala Ala
Ser Gly 565 570 575 Ala Tyr Thr Ala Ser Trp Asn Val Pro 580 585 19
40 DNA Artificial Primer 19 ctcattctgc agccgcggca gcaaatctta
atgggacgct 40 20 40 DNA Artificial Primer 20 atttgggaag tagtacttat
tctttgaaca taaattgaaa 40 21 40 DNA Artificial Primer 21 ctcattctgc
agccgcggca gtaaatggca cgctgatgca 40 22 40 DNA Artificial Primer 22
atttgggaag tagtacttat ttttggaaca taaattgaaa 40 23 40 DNA Artificial
Primer 23 ctcattctgc agccgcggca gcaccgttta acggctttaa 40 24 40 DNA
Artificial Primer 24 atttgggaag tagtacttat tttaggaacc caaaccgaaa 40
25 40 DNA Artificial Primer 25 ctcattctgc agccgcggca catcataatg
ggacaaatgg 40 26 40 DNA Artificial Primer 26 atttgggaag tagtacttat
ccatttgtcc cattatgatg 40 27 40 DNA Artificial Primer 27 ctcattctgc
agccgcggca caccataatg gtacgaacgg 40 28 40 DNA Artificial Primer 28
atttgggaag tagtacttat tttgtttacc caaatagaaa 40 29 40 DNA Artificial
Primer 29 ctcattctgc agccgcggca gtaaatggca cgctgatgca 40 30 40 DNA
Artificial Primer 30 atttgggaag tagtacttat ttttggaaca taaatggaga 40
31 40 DNA Artificial Primer 31 ctcattctgc agccgcggca gcaccgttta
acggctttaa 40 32 40 DNA Artificial Primer 32 atatagtcgt gctgtgttcc
gtaagcataa tccctgcgcg 40 33 22 DNA Artificial Primer 33 ctgcatcagg
gctgcggcat cc 22 34 22 DNA Artificial Primer 34 ctgcatcagg
gctgcggcat cc 22 35 483 PRT BLA B. licheniformis mat_peptide
(1)..(483) 35 Ala Asn Leu Asn Gly Thr Leu Met Gln Tyr Phe Glu Trp
Tyr Met Pro 1 5 10 15 Asn Asp Gly Gln His Trp Arg Arg Leu Gln Asn
Asp Ser Ala Tyr Leu 20 25 30 Ala Glu His Gly Ile Thr Ala Val Trp
Ile Pro Pro Ala Tyr Lys Gly 35 40 45 Thr Ser Gln Ala Asp Val Gly
Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 50 55 60 Gly Glu Phe His Gln
Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 65 70 75 80 Gly Glu Leu
Gln Ser Ala Ile Lys Ser Leu His Ser Arg Asp Ile Asn 85 90 95 Val
Tyr Gly Asp Val Val Ile Asn His Lys Gly Gly Ala Asp Ala Thr 100 105
110 Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val
115 120 125 Ile Ser Gly Glu His Leu Ile Lys Ala Trp Thr His Phe His
Phe Pro 130 135 140 Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His
Trp Tyr His Phe 145 150 155 160 Asp Gly Thr Asp Trp Asp Glu Ser Arg
Lys Leu Asn Arg Ile Tyr Lys 165 170 175 Phe Gln Gly Lys Ala Trp Asp
Trp Glu Val Ser Asn Glu Asn Gly Asn 180 185 190 Tyr Asp Tyr Leu Met
Tyr Ala Asp Ile Asp Tyr Asp His Pro Asp Val 195 200 205 Ala Ala Glu
Ile Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gln 210 215 220 Leu
Asp Gly Phe Arg Leu Asp Ala Val Lys His Ile Lys Phe Ser Phe 225 230
235 240 Leu Arg Asp Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu
Met 245 250 255 Phe Thr Val Ala Glu Tyr Trp Gln Asn Asp Leu Gly Ala
Leu Glu Asn 260 265 270 Tyr Leu Asn Lys Thr Asn Phe Asn His Ser Val
Phe Asp Val Pro Leu 275 280 285 His Tyr Gln Phe His Ala Ala Ser Thr
Gln Gly Gly Gly Tyr Asp Met 290 295 300 Arg Lys Leu Leu Asn Gly Thr
Val Val Ser Lys His Pro Leu Lys Ser 305 310 315 320 Val Thr Phe Val
Asp Asn His Asp Thr Gln Pro Gly Gln Ser Leu Glu 325 330 335 Ser Thr
Val Gln Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe Ile Leu 340 345 350
Thr Arg Glu Ser Gly Tyr Pro Gln Val Phe Tyr Gly Asp Met Tyr Gly 355
360 365 Thr Lys Gly Asp Ser Gln Arg Glu Ile Pro Ala Leu Lys His Lys
Ile 370 375 380 Glu Pro Ile Leu Lys Ala Arg Lys Gln Tyr Ala Tyr Gly
Ala Gln His 385 390 395 400 Asp Tyr Phe Asp His His Asp Ile Val Gly
Trp Thr Arg Glu Gly Asp 405 410 415 Ser Ser Val Ala Asn Ser Gly Leu
Ala Ala Leu Ile Thr Asp Gly Pro 420 425 430 Gly Gly Ala Lys Arg Met
Tyr Val Gly Arg Gln Asn Ala Gly Glu Thr 435 440 445 Trp His Asp Ile
Thr Gly Asn Arg Ser Glu Pro Val Val Ile Asn Ser 450 455 460 Glu Gly
Trp Gly Glu Phe His Val Asn Gly Gly Ser Val Ser Ile Tyr 465 470 475
480 Val Gln Arg 36 514 PRT BSG B. stearothermophilus mat_peptide
(1)..(514) 36 Ala Ala Pro Phe Asn Gly Thr Met Met Gln Tyr Phe Glu
Trp Tyr Leu 1 5 10 15 Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala
Asn Glu Ala Asn Asn 20 25 30 Leu Ser Ser Leu Gly Ile Thr Ala Leu
Trp Leu Pro Pro Ala Tyr Lys 35 40 45 Gly Thr Ser Arg Ser Asp Val
Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 50 55 60 Leu Gly Glu Phe Asn
Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr 65 70 75 80 Lys Ala Gln
Tyr Leu Gln Ala Ile Gln Ala Ala His Ala Ala Gly Met 85 90 95 Gln
Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly 100 105
110 Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gln
115 120 125 Glu Ile Ser Gly Thr Tyr Gln Ile Gln Ala Trp Thr Lys Phe
Asp Phe 130 135 140 Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp
Arg Trp Tyr His 145 150 155 160 Phe Asp Gly Val Asp Trp Asp Glu Ser
Arg Lys Leu Ser Arg Ile Tyr 165 170 175 Lys Phe Arg Gly Ile Gly Lys
Ala Trp Asp Trp Glu Val Asp Thr Glu 180 185 190 Asn Gly Asn Tyr Asp
Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His 195 200 205 Pro Glu Val
Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn 210 215 220 Thr
Thr Asn Ile Asp Gly Phe Arg Leu Asp Ala Val Lys His Ile Lys 225 230
235 240 Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gln Thr
Gly 245 250 255 Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp
Ile Asn Lys 260 265 270 Leu His Asn Tyr Ile Thr Lys Thr Asp Gly Thr
Met Ser Leu Phe Asp 275 280 285 Ala Pro Leu His Asn Lys Phe Tyr Thr
Ala Ser Lys Ser Gly Gly Ala 290 295 300 Phe Asp Met Arg Thr Leu Met
Thr Asn Thr Leu Met Lys Asp Gln Pro 305 310 315 320 Thr Leu Ala Val
Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gln 325 330 335 Ala Leu
Gln Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 340 345 350
Phe Ile Leu Thr Arg Gln Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp 355
360 365 Tyr Tyr Gly Ile Pro Gln Tyr Asn Ile Pro Ser Leu Lys Ser Lys
Ile 370 375 380 Asp Pro Leu Leu Ile Ala Arg Arg Asp Tyr Ala Tyr Gly
Thr Gln His 385 390 395 400 Asp Tyr Leu Asp His Ser Asp Ile Ile Gly
Trp Thr Arg Glu Gly Gly 405 410 415 Thr Glu Lys Pro Gly Ser Gly Leu
Ala Ala Leu Ile Thr Asp Gly Pro 420 425 430 Gly Gly Ser Lys Trp Met
Tyr Val Gly Lys Gln His Ala Gly Lys Val 435 440 445 Phe Tyr Asp Leu
Thr Gly Asn Arg Ser Asp Thr Val Thr Ile Asn Ser 450 455 460 Asp Gly
Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp 465 470 475
480 Val Pro Arg Lys Thr Thr Val Ser Thr Ile Ala Arg Pro Ile Thr Thr
485 490 495 Arg Pro Trp Thr Gly Glu Phe Val Arg Trp Thr Glu Pro Arg
Leu Val 500 505 510 Ala Trp 37 482 PRT BAN B. amyloliquefacience
mat_peptide (1)..(482) 37 Val Asn Gly Thr Leu Met Gln Tyr Phe Glu
Trp Tyr Thr Pro Asn Asp 1 5 10 15 Gly Gln His Trp Lys Arg Leu Gln
Asn Asp Ala Glu His Leu Ser Asp 20 25 30 Ile Gly Ile Thr Ala Val
Trp Ile Pro Pro Ala Tyr Lys Gly Leu Ser 35 40 45 Gln Ser Asp Asn
Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu 50 55 60 Phe Gln
Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu 65 70 75 80
Leu Gln Asp Ala Ile Gly Ser Leu His Ser Arg Asn Val Gln Val Tyr 85
90 95 Gly Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr Glu
Asp 100 105 110 Val Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gln
Glu Thr Ser 115 120 125 Glu Glu Tyr Gln Ile Lys Ala Trp Thr Asp Phe
Arg Phe Pro Gly Arg 130 135 140 Gly Asn Thr Tyr Ser Asp Phe Lys Trp
His Trp Tyr His Phe Asp Gly 145 150 155 160 Ala Asp Trp Asp Glu Ser
Arg Lys Ile Ser Arg Ile Phe Lys Phe Arg 165 170 175 Gly Glu Gly Lys
Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn 180 185 190 Tyr Asp
Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp Val 195 200 205
Val Ala Glu Thr Lys Lys Trp Gly Ile Trp Tyr Ala Asn Glu Leu Ser 210
215 220 Leu Asp Gly Phe Arg Ile Asp Ala Ala Lys His Ile Lys Phe Ser
Phe 225 230 235 240 Leu Arg Asp Trp Val Gln Ala Val Arg Gln Ala Thr
Gly Lys Glu Met 245 250 255 Phe Thr Val Ala Glu Tyr Trp Gln Asn Asn
Ala Gly Lys Leu Glu Asn 260 265 270 Tyr Leu Asn Lys Thr Ser Phe Asn
Gln Ser Val Phe Asp Val Pro Leu 275 280 285 His Phe Asn Leu Gln Ala
Ala Ser Ser Gln Gly Gly Gly Tyr Asp Met 290 295 300 Arg Arg Leu Leu
Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala 305 310 315 320 Val
Thr Phe Val Glu Asn His Asp Thr Gln Pro Gly Gln Ser Leu Glu 325 330
335 Ser Thr Val Gln Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe Ile Leu
340 345 350 Thr Arg Glu Ser Gly Tyr Pro Gln Val Phe Tyr Gly Asp Met
Tyr Gly 355 360 365 Thr Lys Gly Thr Ser Pro Lys Glu Ile Pro Ser Leu
Lys Asp Asn Ile 370 375 380 Glu Pro Ile Leu Lys Ala Arg Lys Glu Tyr
Ala Tyr Gly Pro Gln His 385 390 395 400 Asp Tyr Ile Asp His Pro Asp
Val Ile Gly Trp Thr Arg Glu Gly Asp 405 410 415 Ser Ser Ala Ala Lys
Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly Pro 420 425 430 Gly Gly Ser
Lys Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly Glu Thr 435 440 445 Trp
Tyr Asp Ile Thr Gly Asn Arg Ser Asp Thr Val Lys Ile Gly Ser 450 455
460 Asp Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser Val Ser Ile Tyr
465 470 475 480 Val Gln 38 485 PRT SP722 Bacillus sp. mat_peptide
(1)..(485) 38 His His Asn Gly Thr Asn Gly Thr Met Met Gln Tyr Phe
Glu Trp His 1 5 10 15 Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu
Arg Asp Asp Ala Ser 20 25 30 Asn Leu Arg Asn Arg Gly Ile Thr Ala
Ile Trp Ile Pro Pro Ala Trp 35 40 45 Lys Gly Thr Ser Gln Asn Asp
Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 50 55 60 Asp Leu Gly Glu Phe
Asn Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly 65 70 75 80 Thr Arg Ser
Gln Leu Glu Ser Ala Ile His Ala Leu Lys Asn Asn Gly 85 90 95 Val
Gln Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 100 105
110 Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn
115 120 125 Gln Glu Ile Ser Gly Asp Tyr Thr Ile Glu Ala Trp Thr Lys
Phe Asp 130 135 140 Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys
Trp Arg Trp Tyr 145 150 155 160 His Phe Asp Gly Val Asp Trp Asp Gln
Ser Arg Gln Phe Gln Asn Arg 165 170 175 Ile Tyr Lys Phe Arg Gly Asp
Gly Lys Ala Trp Asp Trp Glu Val Asp 180 185 190 Ser Glu Asn Gly Asn
Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 195 200 205 Asp His Pro
Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 210 215 220 Thr
Asn Thr Leu Asn Leu Asp Gly Phe Arg Ile Asp Ala Val Lys His 225 230
235 240 Ile Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn
Ala 245 250 255 Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys
Asn Asp Leu 260 265 270 Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn
Trp Asn His Ser Val 275 280 285 Phe Asp Val Pro Leu His Tyr Asn Leu
Tyr Asn Ala Ser Asn Ser Gly 290 295 300 Gly Asn Tyr Asp Met Ala Lys
Leu Leu Asn Gly Thr Val Val Gln Lys 305 310 315 320 His Pro Met His
Ala Val Thr Phe Val Asp Asn His Asp Ser Gln Pro 325 330 335 Gly Glu
Ser Leu Glu Ser Phe Val Gln Glu Trp Phe Lys Pro Leu Ala 340 345 350
Tyr Ala Leu Ile Leu Thr Arg Glu Gln Gly Tyr Pro Ser Val Phe Tyr 355
360 365 Gly Asp Tyr Tyr Gly Ile Pro Thr His Ser Val Pro Ala Met Lys
Ala 370 375 380 Lys Ile Asp Pro Ile Leu Glu Ala Arg Gln Asn Phe Ala
Tyr Gly Thr 385 390 395 400 Gln His Asp Tyr Phe Asp His His Asn Ile
Ile Gly Trp Thr Arg Glu 405 410 415 Gly Asn Thr Thr His Pro Asn Ser
Gly Leu Ala Thr Ile Met Ser Asp 420 425 430 Gly Pro Gly Gly Glu Lys
Trp Met Tyr Val Gly Gln Asn Lys Ala Gly 435 440 445 Gln Val Trp His
Asp Ile Thr Gly Asn Lys Pro Gly Thr Val Thr Ile 450 455 460 Asn Ala
Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 465 470 475
480 Ile Trp Val Lys Arg 485 39 485 PRT SP690 Bacillus sp.
mat_peptide (1)..(485) 39 His His Asn Gly Thr Asn Gly Thr Met Met
Gln Tyr Phe Glu Trp Tyr 1 5 10 15 Leu Pro Asn Asp Gly Asn His Trp
Asn Arg Leu Arg Asp Asp Ala Ala 20 25
30 Asn Leu Lys Ser Lys Gly Ile Thr Ala Val Trp Ile Pro Pro Ala Trp
35 40 45 Lys Gly Thr Ser Gln Asn Asp Val Gly Tyr Gly Ala Tyr Asp
Leu Tyr 50 55 60 Asp Leu Gly Glu Phe Asn Gln Lys Gly Thr Val Arg
Thr Lys Tyr Gly 65 70 75 80 Thr Arg Asn Gln Leu Gln Ala Ala Val Thr
Ser Leu Lys Asn Asn Gly 85 90 95 Ile Gln Val Tyr Gly Asp Val Val
Met Asn His Lys Gly Gly Ala Asp 100 105 110 Gly Thr Glu Ile Val Asn
Ala Val Glu Val Asn Arg Ser Asn Arg Asn 115 120 125 Gln Glu Thr Ser
Gly Glu Tyr Ala Ile Glu Ala Trp Thr Lys Phe Asp 130 135 140 Phe Pro
Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 145 150 155
160 His Phe Asp Gly Thr Asp Trp Asp Gln Ser Arg Gln Leu Gln Asn Lys
165 170 175 Ile Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu
Val Asp 180 185 190 Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala
Asp Val Asp Met 195 200 205 Asp His Pro Glu Val Ile His Glu Leu Arg
Asn Trp Gly Val Trp Tyr 210 215 220 Thr Asn Thr Leu Asn Leu Asp Gly
Phe Arg Ile Asp Ala Val Lys His 225 230 235 240 Ile Lys Tyr Ser Phe
Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 245 250 255 Thr Gly Lys
Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 260 265 270 Gly
Ala Ile Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Ala 275 280
285 Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly
290 295 300 Gly Tyr Tyr Asp Met Arg Asn Ile Leu Asn Gly Ser Val Val
Gln Lys 305 310 315 320 His Pro Thr His Ala Val Thr Phe Val Asp Asn
His Asp Ser Gln Pro 325 330 335 Gly Glu Ala Leu Glu Ser Phe Val Gln
Gln Trp Phe Lys Pro Leu Ala 340 345 350 Tyr Ala Leu Val Leu Thr Arg
Glu Gln Gly Tyr Pro Ser Val Phe Tyr 355 360 365 Gly Asp Tyr Tyr Gly
Ile Pro Thr His Gly Val Pro Ala Met Lys Ser 370 375 380 Lys Ile Asp
Pro Leu Leu Gln Ala Arg Gln Thr Phe Ala Tyr Gly Thr 385 390 395 400
Gln His Asp Tyr Phe Asp His His Asp Ile Ile Gly Trp Thr Arg Glu 405
410 415 Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr Ile Met Ser
Asp 420 425 430 Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn
Lys Ala Gly 435 440 445 Gln Val Trp Arg Asp Ile Thr Gly Asn Arg Thr
Gly Thr Val Thr Ile 450 455 460 Asn Ala Asp Gly Trp Gly Asn Phe Ser
Val Asn Gly Gly Ser Val Ser 465 470 475 480 Val Trp Val Lys Gln 485
40 485 PRT AA560 Bacillus sp. mat_peptide (1)..(485) 40 His His Asn
Gly Thr Asn Gly Thr Met Met Gln Tyr Phe Glu Trp Tyr 1 5 10 15 Leu
Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Ser Asp Ala Ser 20 25
30 Asn Leu Lys Asp Lys Gly Ile Ser Ala Val Trp Ile Pro Pro Ala Trp
35 40 45 Lys Gly Ala Ser Gln Asn Asp Val Gly Tyr Gly Ala Tyr Asp
Leu Tyr 50 55 60 Asp Leu Gly Glu Phe Asn Gln Lys Gly Thr Ile Arg
Thr Lys Tyr Gly 65 70 75 80 Thr Arg Asn Gln Leu Gln Ala Ala Val Asn
Ala Leu Lys Ser Asn Gly 85 90 95 Ile Gln Val Tyr Gly Asp Val Val
Met Asn His Lys Gly Gly Ala Asp 100 105 110 Ala Thr Glu Met Val Arg
Ala Val Glu Val Asn Pro Asn Asn Arg Asn 115 120 125 Gln Glu Val Ser
Gly Glu Tyr Thr Ile Glu Ala Trp Thr Lys Phe Asp 130 135 140 Phe Pro
Gly Arg Gly Asn Thr His Ser Asn Phe Lys Trp Arg Trp Tyr 145 150 155
160 His Phe Asp Gly Val Asp Trp Asp Gln Ser Arg Lys Leu Asn Asn Arg
165 170 175 Ile Tyr Lys Phe Arg Gly Asp Gly Lys Gly Trp Asp Trp Glu
Val Asp 180 185 190 Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala
Asp Ile Asp Met 195 200 205 Asp His Pro Glu Val Val Asn Glu Leu Arg
Asn Trp Gly Val Trp Tyr 210 215 220 Thr Asn Thr Leu Gly Leu Asp Gly
Phe Arg Ile Asp Ala Val Lys His 225 230 235 240 Ile Lys Tyr Ser Phe
Thr Arg Asp Trp Ile Asn His Val Arg Ser Ala 245 250 255 Thr Gly Lys
Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 260 265 270 Gly
Ala Ile Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 275 280
285 Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly
290 295 300 Gly Asn Tyr Asp Met Arg Gln Ile Phe Asn Gly Thr Val Val
Gln Arg 305 310 315 320 His Pro Met His Ala Val Thr Phe Val Asp Asn
His Asp Ser Gln Pro 325 330 335 Glu Glu Ala Leu Glu Ser Phe Val Glu
Glu Trp Phe Lys Pro Leu Ala 340 345 350 Tyr Ala Leu Thr Leu Thr Arg
Glu Gln Gly Tyr Pro Ser Val Phe Tyr 355 360 365 Gly Asp Tyr Tyr Gly
Ile Pro Thr His Gly Val Pro Ala Met Lys Ser 370 375 380 Lys Ile Asp
Pro Ile Leu Glu Ala Arg Gln Lys Tyr Ala Tyr Gly Arg 385 390 395 400
Gln Asn Asp Tyr Leu Asp His His Asn Ile Ile Gly Trp Thr Arg Glu 405
410 415 Gly Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr Ile Met Ser
Asp 420 425 430 Gly Ala Gly Gly Asn Lys Trp Met Phe Val Gly Arg Asn
Lys Ala Gly 435 440 445 Gln Val Trp Thr Asp Ile Thr Gly Asn Arg Ala
Gly Thr Val Thr Ile 450 455 460 Asn Ala Asp Gly Trp Gly Asn Phe Ser
Val Asn Gly Gly Ser Val Ser 465 470 475 480 Ile Trp Val Asn Lys 485
41 481 PRT LE429 B. lich. variant mat_peptide (1)..(481) 41 Val Asn
Gly Thr Leu Met Gln Tyr Phe Glu Trp Tyr Thr Pro Asn Asp 1 5 10 15
Gly Gln His Trp Lys Arg Leu Gln Asn Asp Ala Glu His Leu Ser Asp 20
25 30 Ile Gly Ile Thr Ala Val Trp Ile Pro Pro Ala Tyr Lys Gly Thr
Ser 35 40 45 Gln Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp
Leu Gly Glu 50 55 60 Phe His Gln Lys Gly Thr Val Arg Thr Lys Tyr
Gly Thr Lys Gly Glu 65 70 75 80 Leu Gln Ser Ala Ile Lys Ser Leu His
Ser Arg Asp Ile Asn Val Tyr 85 90 95 Gly Asp Val Val Ile Asn His
Lys Gly Gly Ala Asp Ala Thr Glu Asp 100 105 110 Val Thr Ala Val Glu
Val Asp Pro Ala Asp Arg Asn Arg Val Ile Ser 115 120 125 Gly Glu His
Leu Ile Lys Ala Trp Thr His Phe His Phe Pro Gly Arg 130 135 140 Gly
Ser Thr Tyr Ser Asp Phe Lys Trp Tyr Trp Tyr His Phe Asp Gly 145 150
155 160 Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg Ile Tyr Lys Phe
Gln 165 170 175 Gly Lys Thr Trp Asp Trp Glu Val Ser Asn Glu Phe Gly
Asn Tyr Asp 180 185 190 Tyr Leu Met Tyr Ala Asp Phe Asp Tyr Asp His
Pro Asp Val Val Ala 195 200 205 Glu Ile Lys Arg Trp Gly Thr Trp Tyr
Ala Asn Glu Leu Gln Leu Asp 210 215 220 Gly Phe Arg Leu Asp Ala Val
Lys His Ile Lys Phe Ser Phe Leu Arg 225 230 235 240 Asp Trp Val Asn
His Val Arg Glu Lys Thr Gly Lys Glu Met Phe Thr 245 250 255 Val Ala
Glu Tyr Trp Ser Asn Asp Leu Gly Ala Leu Glu Asn Tyr Leu 260 265 270
Asn Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu His Tyr 275
280 285 Gln Phe His Ala Ala Ser Thr Gln Gly Gly Gly Tyr Asp Met Arg
Lys 290 295 300 Leu Leu Asn Gly Thr Val Val Ser Lys His Pro Leu Lys
Ser Val Thr 305 310 315 320 Phe Val Asp Asn His Asp Thr Gln Pro Gly
Gln Ser Leu Glu Ser Thr 325 330 335 Val Gln Thr Trp Phe Lys Pro Leu
Ala Tyr Ala Phe Ile Leu Thr Arg 340 345 350 Glu Ser Gly Tyr Pro Gln
Val Phe Tyr Gly Asp Met Tyr Gly Thr Lys 355 360 365 Gly Asp Ser Gln
Arg Glu Ile Pro Ala Leu Lys His Lys Ile Glu Pro 370 375 380 Ile Leu
Lys Ala Arg Lys Gln Tyr Ala Tyr Gly Ala Gln His Asp Tyr 385 390 395
400 Phe Asp His His Asp Ile Val Gly Trp Thr Arg Glu Gly Asp Ser Ser
405 410 415 Val Ala Asn Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly Pro
Gly Gly 420 425 430 Ala Lys Arg Met Tyr Val Gly Arg Gln Asn Ala Gly
Glu Thr Trp His 435 440 445 Asp Ile Thr Gly Asn Arg Ser Glu Pro Val
Val Ile Asn Ser Glu Gly 450 455 460 Trp Gly Glu Phe His Val Asn Gly
Gly Ser Val Ser Ile Tyr Val Gln 465 470 475 480 Arg 42 417 PRT
Pseudomonas saccharophila mat_peptide (1)..(417) 42 Asp Gln Ala Gly
Lys Ser Pro Ala Gly Val Arg Tyr His Gly Gly Asp 1 5 10 15 Glu Ile
Ile Leu Gln Gly Phe His Trp Asn Val Val Arg Glu Ala Pro 20 25 30
Asn Asp Trp Tyr Asn Ile Leu Arg Gln Gln Ala Ser Thr Ile Ala Ala 35
40 45 Asp Gly Phe Ser Ala Ile Trp Met Pro Val Pro Trp Arg Asp Phe
Ser 50 55 60 Ser Trp Thr Asp Gly Gly Lys Ser Gly Gly Gly Glu Gly
Tyr Phe Trp 65 70 75 80 His Asp Phe Asn Lys Asn Gly Arg Tyr Gly Ser
Asp Ala Gln Leu Arg 85 90 95 Gln Ala Ala Gly Ala Leu Gly Gly Ala
Gly Val Lys Val Leu Tyr Asp 100 105 110 Val Val Pro Asn His Met Asn
Arg Gly Tyr Pro Asp Lys Glu Ile Asn 115 120 125 Leu Pro Ala Gly Gln
Gly Phe Trp Arg Asn Asp Cys Ala Asp Pro Gly 130 135 140 Asn Tyr Pro
Asn Asp Cys Asp Asp Gly Asp Arg Phe Ile Gly Gly Glu 145 150 155 160
Ser Asp Leu Asn Thr Gly His Pro Gln Ile Tyr Gly Met Phe Arg Asp 165
170 175 Glu Leu Ala Asn Leu Arg Ser Gly Tyr Gly Ala Gly Gly Phe Arg
Phe 180 185 190 Asp Phe Val Arg Gly Tyr Ala Pro Glu Arg Val Asp Ser
Trp Met Ser 195 200 205 Asp Ser Ala Asp Ser Ser Phe Cys Val Gly Glu
Leu Trp Lys Gly Pro 210 215 220 Ser Glu Tyr Pro Ser Trp Asp Trp Arg
Asn Thr Ala Ser Trp Gln Gln 225 230 235 240 Ile Ile Lys Asp Trp Ser
Asp Arg Ala Lys Cys Pro Val Phe Asp Phe 245 250 255 Ala Leu Lys Glu
Arg Met Gln Asn Gly Ser Val Ala Asp Trp Lys His 260 265 270 Gly Leu
Asn Gly Asn Pro Asp Pro Arg Trp Arg Glu Val Ala Val Thr 275 280 285
Phe Val Asp Asn His Asp Thr Gly Tyr Ser Pro Gly Gln Asn Gly Gly 290
295 300 Gln His His Trp Ala Leu Gln Asp Gly Leu Ile Arg Gln Ala Tyr
Ala 305 310 315 320 Tyr Ile Leu Thr Ser Pro Gly Thr Pro Val Val Tyr
Trp Ser His Met 325 330 335 Tyr Asp Trp Gly Tyr Gly Asp Phe Ile Arg
Gln Leu Ile Gln Val Arg 340 345 350 Arg Thr Ala Gly Val Arg Ala Asp
Ser Ala Ile Ser Phe His Ser Gly 355 360 365 Tyr Ser Gly Leu Val Ala
Thr Val Ser Gly Ser Gln Gln Thr Leu Val 370 375 380 Val Ala Leu Asn
Ser Asp Leu Ala Asn Pro Gly Gln Val Ala Ser Gly 385 390 395 400 Ser
Phe Ser Glu Ala Val Asn Ala Ser Asn Gly Gln Val Arg Val Trp 405 410
415 Arg
* * * * *
References