U.S. patent application number 13/080073 was filed with the patent office on 2012-02-02 for novel fungal enzymes.
This patent application is currently assigned to DYADIC INTERNATIONAL, INC.. Invention is credited to Mark Gosink, Alexander Vasilievich Gusakov, Sandra Wilhelmina Agnes Hinz, Zhijie Jiang, Peter J. Punt, Arkady Panteleimonovich Sinitsyn, Jacobs Van der Meij, Jan Cornelis Verdoes, Elena Vlasenko.
Application Number | 20120030838 13/080073 |
Document ID | / |
Family ID | 40305293 |
Filed Date | 2012-02-02 |
United States Patent
Application |
20120030838 |
Kind Code |
A1 |
Gusakov; Alexander Vasilievich ;
et al. |
February 2, 2012 |
NOVEL FUNGAL ENZYMES
Abstract
This invention relates to novel enzymes and novel methods for
producing the same. More specifically this invention relates to a
variety of fungal enzymes. Nucleic acid molecules encoding such
enzymes, compositions, recombinant and genetically modified host
cells, and methods of use are described. The invention also relates
to a method to convert lignocellulosic biomass to fermentable
sugars with enzymes that degrade the lignocellulosic material and
novel combinations of enzymes, including those that provide a
synergistic release of sugars from plant biomass. The invention
also relates to methods to use the novel enzymes and compositions
of such enzymes in a variety of other processes, including washing
of clothing, detergent processes, deinking and biobleaching of
paper and pulp, and treatment of waste streams.
Inventors: |
Gusakov; Alexander Vasilievich;
(Moscow, RU) ; Punt; Peter J.; (Houten, NL)
; Verdoes; Jan Cornelis; (Bennekom, NL) ;
Sinitsyn; Arkady Panteleimonovich; (Moscow, RU) ;
Vlasenko; Elena; (Davis, CA) ; Hinz; Sandra
Wilhelmina Agnes; (Wageningen, NL) ; Gosink;
Mark; (Wellington, FL) ; Jiang; Zhijie; (West
Palm Beach, FL) ; Van der Meij; Jacobs; (Bennekom,
NL) |
Assignee: |
DYADIC INTERNATIONAL, INC.
Jupiter
FL
|
Family ID: |
40305293 |
Appl. No.: |
13/080073 |
Filed: |
April 5, 2011 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
11833133 |
Aug 2, 2007 |
7923236 |
|
|
13080073 |
|
|
|
|
Current U.S.
Class: |
800/295 ;
424/94.1; 424/94.6; 424/94.61; 426/2; 426/442; 426/624; 435/100;
435/105; 435/106; 435/136; 435/148; 435/155; 435/162; 435/197;
435/200; 435/205; 435/209; 435/232; 435/252.3; 435/254.11;
435/254.3; 435/254.5; 435/254.6; 435/254.7; 435/254.8; 435/263;
435/267; 435/320.1; 435/41; 435/72; 510/392; 530/387.9; 536/23.2;
536/24.3; 8/401 |
Current CPC
Class: |
C12Y 302/01003 20130101;
C12N 9/2442 20130101; C11D 3/38645 20130101; C12Y 302/01032
20130101; C12P 7/10 20130101; C12Y 302/01014 20130101; C12N 9/2477
20130101; C12Y 302/01004 20130101; A61P 11/06 20180101; A61P 29/00
20180101; C12N 9/248 20130101; C12N 9/2445 20130101; A61P 37/08
20180101; C12N 9/2437 20130101; Y02E 50/17 20130101; C12P 19/14
20130101; C12Y 302/01021 20130101; Y02E 50/16 20130101; C12Y
302/01091 20130101; Y02E 50/10 20130101 |
Class at
Publication: |
800/295 ;
536/23.2; 435/320.1; 435/254.11; 435/254.6; 435/254.3; 435/254.7;
435/254.8; 435/254.5; 435/252.3; 536/24.3; 435/209; 435/200;
435/205; 435/232; 435/197; 530/387.9; 435/72; 435/41; 435/106;
435/148; 435/136; 435/155; 435/162; 435/105; 435/100; 435/263;
424/94.61; 424/94.6; 424/94.1; 435/267; 510/392; 8/401; 426/624;
426/2; 426/442 |
International
Class: |
A01H 5/00 20060101
A01H005/00; C12N 15/63 20060101 C12N015/63; C12N 1/15 20060101
C12N001/15; C12N 1/21 20060101 C12N001/21; C12N 15/56 20060101
C12N015/56; C12N 15/60 20060101 C12N015/60; C12N 15/55 20060101
C12N015/55; C12N 9/42 20060101 C12N009/42; C12N 9/24 20060101
C12N009/24; C12N 9/34 20060101 C12N009/34; C12N 9/88 20060101
C12N009/88; C12N 9/18 20060101 C12N009/18; C07K 16/40 20060101
C07K016/40; C12P 19/00 20060101 C12P019/00; C12P 1/00 20060101
C12P001/00; C12P 13/04 20060101 C12P013/04; C12P 7/26 20060101
C12P007/26; C12P 7/40 20060101 C12P007/40; C12P 7/02 20060101
C12P007/02; C12P 7/14 20060101 C12P007/14; C12P 19/02 20060101
C12P019/02; C12P 19/12 20060101 C12P019/12; C12S 11/00 20060101
C12S011/00; A61K 38/47 20060101 A61K038/47; A61K 38/46 20060101
A61K038/46; A61K 38/51 20060101 A61K038/51; A01N 63/04 20060101
A01N063/04; A61P 29/00 20060101 A61P029/00; A61P 11/06 20060101
A61P011/06; A01P 3/00 20060101 A01P003/00; A01P 7/04 20060101
A01P007/04; C12S 3/00 20060101 C12S003/00; C11D 3/386 20060101
C11D003/386; C09B 67/42 20060101 C09B067/42; A23K 1/06 20060101
A23K001/06; A23K 1/18 20060101 A23K001/18; C07H 21/04 20060101
C07H021/04 |
Claims
1. An isolated nucleic acid molecule comprising a nucleic acid
sequence selected from the group consisting of: a) a nucleic acid
sequence encoding a protein comprising an amino acid sequence
selected from the group consisting of: SEQ ID NO:11, SEQ ID NO:23,
SEQ ID NO:29, SEQ ID NO:32, SEQ ID NO:35, SEQ ID NO:38, SEQ ID
NO:41, SEQ ID NO:44, SEQ ID NO:50, SEQ ID NO:53, SEQ ID NO:56, SEQ
ID NO:59, SEQ ID NO:62, SEQ ID NO:65, SEQ ID NO:68, SEQ ID NO:71,
SEQ ID NO:74, SEQ ID NO:77, SEQ ID NO:80, SEQ ID NO:83, SEQ ID
NO:86, SEQ ID NO:89, SEQ ID NO:92, and SEQ ID NO:94. b) a nucleic
acid sequence encoding a fragment of the protein of (a), wherein
the fragment has a biological activity of the protein of (a); and
c) a nucleic acid sequence encoding an amino acid sequence that is
at least 70% identical to an amino acid sequence of (a) and has a
biological activity of the protein comprising the amino acid
sequence.
2. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence encodes an amino acid sequence that is at
least 90% identical to the amino acid sequence of (a) and has a
biological activity of the protein comprising the amino acid
sequence.
3. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence encodes an amino acid sequence that is at
least 95% identical to the amino acid sequence of (a) and has a
biological activity of the protein comprising the amino acid
sequence.
4. The isolated nucleic acid molecule of claim 0, wherein said
nucleic acid sequence encodes an amino acid sequence that is at
least 97% identical to the amino acid sequence of (a) and has a
biological activity of the protein comprising the amino acid
sequence.
5. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence encodes an amino acid sequence that is at
least 99% identical to the amino acid sequence of (a) and has a
biological activity of the protein comprising the amino acid
sequence.
6. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence encodes a protein comprising an amino acid
sequence selected from the group consisting of: SEQ ID NO:11, SEQ
ID NO:23, SEQ ID NO:29, SEQ ID NO:32, SEQ ID NO:35, SEQ ID NO:38,
SEQ ID NO:41, SEQ ID NO:44, SEQ ID NO:50, SEQ ID NO:53, SEQ ID
NO:56, SEQ ID NO:59, SEQ ID NO:62, SEQ ID NO:65, SEQ ID NO:68, SEQ
ID NO:71, SEQ ID NO:74, SEQ ID NO:77, SEQ ID NO:80, SEQ ID NO:83,
SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:92, and SEQ ID NO:94.
7. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence consists of a nucleic acid sequence selected
from the group consisting of: SEQ ID NO:1, SEQ ID NO:3, SEQ ID
NO:4, SEQ ID NO:6, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:22, SEQ ID
NO:24, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ
ID NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40,
SEQ ID NO:41. SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:45, SEQ ID
NO:46, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:51, SEQ ID NO:52, SEQ
ID NO:54, SEQ ID NO:55, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:60,
SEQ ID NO:61, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:66, SEQ ID
NO:67, SEQ ID NO:69, SEQ ID NO:70. SEQ ID NO:72, SEQ ID NO:73, SEQ
ID NO:75, SEQ ID NO:76, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:90,
SEQ ID NO:91 and SEQ ID NO:93.
8. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising the amino
acid sequence of SEQ ID NO:11, wherein the protein has
cellobiohydrolase activity.
9. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising an amino
acid sequence selected from the group consisting of: SEQ ID NO:94
and SEQ ID NO:23, wherein the protein has endoglucanase
activity.
10. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising an amino
acid sequence selected from the group consisting of: SEQ ID NO 29,
SEQ ID NO:32, SEQ ID NO:35, SEQ ID NO:38 and SEQ ID NO:41, wherein
the protein has xylanase activity.
11. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising an amino
acid sequence of SEQ ID NO:44, wherein the protein has
.beta.-glucosidase activity.
12. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising an amino
acid sequence selected from the group consisting of: SEQ ID NO:50,
SEQ ID NO:53, SEQ ID NO:62, SEQ ID NO:65, SEQ ID NO:68, SEQ ID
NO:71, SEQ ID NO:74, SEQ ID NO:77, SEQ ID NO:80, SEQ ID NO:83, SEQ
ID NO:86, SEQ ID NO:89, and SEQ ID NO:92, wherein the protein has
hemicellulase activity.
13. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising the amino
acid sequence of SEQ ID NO:50, wherein the protein has glucoamylase
activity.
14. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising the amino
acid sequence of SEQ ID NO:53, wherein the protein has pectate
lyase activity.
15. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising an amino
acid sequence selected from the group consisting of: SEQ ID NO:56
and SEQ ID NO:59, wherein the protein has chitinase activity.
16. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising an amino
acid sequence selected from the group consisting of: SEQ ID NO:62,
SEQ ID NO:80, and SEQ ID NO:83, wherein the protein has acetylxylan
esterase activity.
17. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising an amino
acid sequence selected from the group consisting of: SEQ ID NO:65,
SEQ ID NO:68, and SEQ ID NO:71, wherein the protein has ferulic
acid esterase activity.
18. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising an amino
acid sequence selected from the group consisting of: SEQ ID NO:74
and SEQ ID NO:77, wherein the protein has arabinofuranosidase
activity.
19. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising the amino
acid sequence of SEQ ID NO:86, wherein the protein has pectin
methyl esterase activity.
20. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising the amino
acid sequence of SEQ ID NO:89, wherein the protein has
endo-arabinase activity.
21. The isolated nucleic acid molecule of claim 1, wherein said
nucleic acid sequence of (a) encodes a protein comprising the amino
acid sequence of SEQ ID NO:92, wherein the protein has
.beta.-xylosidase or .beta.-glucosidase activity.
22. An isolated nucleic acid molecule comprising a nucleic acid
sequence encoding a protein comprising an amino acid sequence
selected from the group consisting of: SEQ ID NO:2 and SEQ ID
NO:5.
23. An isolated nucleic acid molecule comprising a nucleic acid
sequence that is fully complementary to the nucleic acid sequence
of the nucleic acid molecule of any one of claims 1 to 22.
24. An isolated protein comprising an amino acid sequence encoded
by the nucleic acid molecule of any one of claims 1 to 22.
25. An isolated protein comprising an amino acid sequence encoded
by the nucleic acid molecule of claim 15.
26. An isolated fusion protein comprising the isolated protein of
claim 24 fused to a protein comprising an amino acid sequence that
is heterologous to the isolated protein of claim 24.
27. An isolated antibody or antigen binding fragment thereof that
selectively binds to the protein of claim 24.
28. A kit for degrading a lignocellulosic material to fermentable
sugars comprising at least one isolated protein of claim 24.
29. A detergent comprising at least one isolated protein of claim
24.
30. A composition for the degradation of a lignocellulosic material
comprising at least one isolated protein of claim 24.
31. A recombinant nucleic acid molecule comprising the isolated
nucleic acid molecule of any one of claims 1 to 22, operatively
linked to at least one expression control sequence.
32. The recombinant nucleic acid molecule of claim 31, wherein the
recombinant nucleic acid molecule comprises an expression
vector.
33. The recombinant nucleic acid molecule of claim 31, wherein the
recombinant nucleic acid molecule comprises a targeting vector.
34. An isolated host cell transfected with the nucleic acid
molecule of any one of claims 1 to 22.
35. The isolated host cell of claim 34, wherein the host cell is a
fungus.
36. The isolated host cell of claim 34, wherein the host cell is a
filamentous fungus.
37. The isolated host cell of claim 36, wherein the filamentous
fungus is from a genus selected from the group consisting of:
Chrysosporium, Thielavia, Neurospora, Aureobasidium, Filibasidium,
Piromyces, Corynascus, Crvplococcus, Acremonium, Tolypocladium,
Scytalidium, Schizophyllum, Sporotrichum, Penicillium, Gibberella,
Myceliophthora, Mucor, Aspergillus, Fusarium, Humicola, and
Trichoderma, and anamorphs and teleomorphs thereof.
38. The isolated host cell of claim 34, wherein the host cell is a
bacterium.
39. An oligonucleotide consisting essentially of at least 12
consecutive nucleotides of a nucleic acid sequence selected from
the group consisting of: SEQ ID NOM, SEQ ID NO:3, SEQ ID NO:4, SEQ
ID NO:6, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:24,
SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID
NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ
ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:46,
SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:51, SEQ ID NO:52, SEQ ID
NO:54, SEQ ID NO:55. SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:60, SEQ
ID NO:61, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:66, SEQ ID NO:67,
SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:72. SEQ ID NO:73, SEQ ID
NO:75, SEQ ID NO:76, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:90, SEQ
ID NO:91, and SEQ ID NO:93, or the complement thereof.
40. A kit comprising at least one oligonucleotide of claim 39.
41. A method for producing the protein of claim 24, comprising
culturing a cell that has been transfected with a nucleic acid
molecule comprising a nucleic acid sequence encoding the protein,
and expressing the protein with the transfected cell.
42. The method of claim 41, further comprising recovering the
protein from the cell or from a culture comprising the cell.
43. A genetically modified organism comprising components suitable
for degrading a lignocellulosic material to fermentable sugars,
wherein the organism has been genetically modified to express at
least one protein of claim 24.
44. The genetically modified organism of claim 43, wherein the
genetically modified organism is a microorganism.
45. The genetically modified organism of claim 44, wherein the
microorganism is a filamentous fungus.
46. The genetically modified organism of claim 45, wherein the
filamentous fungus is from a genus selected from the group
consisting of: Chrysosporium, Thielavia, Neurospora, Aureobasidium,
Filibasidium, Piromyces, Corynascus, Cryplococcus, Acremonium,
Tolypocladium, Scytalidium, Schizophyllum, Sporotrichum,
Penicillium, Talaromyces, Gibberella, Myceliophthora, Mucor,
Aspergillus, Fusarium, Humicola, and Trichoderma.
47. The genetically modified organism of claim 45, wherein the
filamentous fungus is selected from the group consisting of:
Trichoderma reesei, Chrysosporium lucknowense, Aspergillus
japonicus, Penicillium canescens, Penicillium solitum, Penicillium
funiculosum, and Talaromyces flavus.
48. The genetically modified organism of claim 43, wherein the
organism has been genetically modified to express at least one
additional enzyme.
49. The genetically modified organism of claim 48, wherein the
additional enzyme is an accessory enzyme selected from the group
consisting of: cellulase, xylanase, ligninase, glucuronidase,
arabinofuranosidase, arabinogalactanase, ferulic acid esterase,
lipase, pectinase, glucomannase, amylase, laminarinase,
xyloglucanase, galactanase, glucoamylase, pectate lyase, chitinase,
exo-.cndot.-D-glucosaminidase, cellobiose dehydrogenase, and
acetylxylan esterase.
50. The genetically modified organism of claim 43, wherein the
genetically modified organism is a plant.
51. A recombinant enzyme isolated from the genetically modified
microorganism of any one of claims 43 to 49.
52. The recombinant enzyme of claim 51, wherein the enzyme has been
subjected to a purification step.
53. A crude fermentation product produced by culturing the
comprising the genetically modified microorganism of any one of
claims 43 to 49, wherein the crude fermentation product contains
the at least one protein of claim 24.
54. A multi-enzyme composition comprising enzymes produced by the
genetically modified organism of any one of claims 43 to 50, and
recovered therefrom.
55. A multi-enzyme composition comprising at least one protein of
claim 24, and at least one additional protein for degrading a
lignocellulosic material or a fragment thereof that has biological
activity.
56. The multi-enzyme composition of claim 55, wherein the at least
one additional protein for degrading a lignocellulosic material to
fermentable sugars is selected from the group consisting of: SEQ ID
NO:8, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:26 and
SEQ ID NO:47, or a fragment thereof that has biological
activity.
57. The multi-enzyme composition of claim 55, wherein the
composition comprises at least one cellobiohydrolase, at least one
xylanase, at least one endoglucanase, at least one
.cndot.-glucosidase, at least one .cndot.-xylosidase, and at least
one accessory enzyme.
58. The multi-enzyme composition of claim 55, wherein between about
50% and about 70% of the enzymes in the composition are
cellobiohydrolases.
59. The multi-enzyme composition of claim 55, wherein between about
10% and about 30% of the enzymes in the composition are
xylanases.
60. The multi-enzyme composition of claim 55, wherein between about
5% and about 15% of the enzymes in the composition are
endoglucanases.
61. The multi-enzyme composition of claim 55, wherein between about
1% and about 5% of the enzymes in the composition are
.cndot.-glucosidases.
62. The multi-enzyme composition of claim 55, wherein between about
1% and about 3% of the enzymes in the composition are
.cndot.-xylosidases.
63. The multi-enzyme composition of claim 55, wherein the
composition comprises about 60% cellobiohydrolases, about 20%
xylanases, about 10% endoglucanases, about 3% .cndot.-glucosidases,
about 2% .cndot.-xylosidases, and about 5% accessory enzymes.
64. The multi-enzyme composition of claim 57 or claim 63, wherein
the xylanases are selected from the group consisting of:
endoxylanases, exoxylanases, and .cndot.-xylosidases.
65. The multi-enzyme composition of claim 57 or claim 63, wherein
the accessory enzymes include an enzyme selected from the group
consisting of: ligninase, glucuronidase, arabinofuranosidase,
arabinogalactanase, furilic acid esterase, lipase, pectinase,
glucomannase, amylase, laminarinase, xyloglucanase, galactanase,
glucoamylase, pectate lyase, chitinase,
exo-.cndot.-D-glucosaminidase, cellobiose dehydrogenase, and
acetylxylan esterase.
66. The multi-enzyme composition of claim 55, wherein the
composition comprises at least one protein encoded by the nucleic
acid molecule of claim 8, at least one protein encoded by the
nucleic acid molecule of claim 9, at least one protein encoded by
the nucleic acid molecule of claim 10 at least one protein encoded
by the nucleic acid molecule of claim 11, and at least one protein
encoded by the nucleic acid molecule of claim 12.
67. The multi-enzyme composition of claim 55, wherein the
composition comprises: a) at least one first protein comprising an
amino acid sequence that is at least 95% identical to an amino acid
sequence selected from the group consisting of: SEQ ID NO:2, SEQ ID
NO:5, SEQ ID NO:8 and SEQ ID NO:11 and has cellobiohydrolase
activity, or a fragment thereof that has cellobiohydrolase
activity; b) at least one second protein comprising an amino acid
sequence that is at least 95% identical to an amino acid sequence
selected from the group consisting of: SEQ ID NO:14, SEQ ID NO:94,
SEQ ID NO:17, SEQ ID NO:20 and SEQ ID NO:23 and has endoglucanase
activity, or a fragment thereof that has endoglucanase activity;
and c) at least one third protein comprising an amino acid sequence
that is at least 95% identical to an amino acid sequence selected
from the group consisting of: SEQ ID NO:26, SEQ ID NO:29, SEQ ID
NO:32, SEQ ID NO:35, SEQ ID NO:38 and SEQ ID NO:41 and has xylanase
activity, or a fragment thereof that has xylanase activity.
68. The multi-enzyme composition of claim 67, wherein the
composition further comprises a fourth protein comprising an amino
acid sequence that is at least 95% identical to SEQ ID NO:44 and
has .beta.-glucosidase activity, or a fragment thereof that has
.beta.-glucosidase activity.
69. The multi-enzyme composition of claim 68, wherein the
composition further comprises at least one fifth protein comprising
an amino acid sequence that is at least 95% identical to an amino
acid sequence selected from the group consisting of: SEQ ID NO:47,
SEQ ID NO:50, SEQ ID NO:53, SEQ ID NO:56, SEQ ID NO:59, SEQ ID
NO:62, SEQ ID NO:65, SEQ ID NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ
ID NO:77, SEQ ID NO:80, SEQ ID NO:83, SEQ ID NO:86, SEQ ID NO:89,
and SEQ ID NO:92, and has hemicellulase or chitinase activity, or a
fragment thereof that has hemicellulase or chitinase activity
70. A multi-enzyme composition comprising: a) a first protein
comprising an amino acid sequence that is at least 95% identical to
SEQ ID NO:2 and has cellobiohydrolase activity, or a fragment
thereof that has cellobiohydrolase activity; b) a second protein
comprising an amino acid sequence that is at least 95% identical to
SEQ ID NO:8 and has cellobiohydrolase activity, or a fragment
thereof that has cellobiohydrolase activity; c) a third protein
comprising an amino acid sequence that is at least 95% identical to
SEQ ID NO:94 and has endoglucanase activity, or a fragment thereof
that has endoglucanase activity; d) a fourth protein comprising an
amino acid sequence that is at least 95% identical to SEQ ID NO:29
and has xylanase activity, or a fragment thereof that has xylanase
activity; and e) a fifth protein comprising an amino acid sequence
that is at least 95% identical to SEQ ID NO:44 and has
.beta.-glucosidase activity, or a fragment thereof that has
.beta.-glucosidase activity.
71. The multi-enzyme composition of claim 70, further comprising a
sixth protein that has .beta.-xylosidase activity, or a fragment
thereof that has .beta.-xylosidase activity.
72. The multi-enzyme composition of claim 70, further comprising
one or more proteins selected from the group consisting of: d) a
protein comprising an amino acid sequence that is at least 95%
identical to SEQ ID NO:5 and has cellobiohydrolase activity, or a
fragment thereof that has cellobiohydrolase activity; e) a protein
comprising an amino acid sequence that is at least 95% identical to
SEQ ID NO:20 and has endoglucanase activity, or a fragment thereof
that has endoglucanase activity; f) a protein comprising an amino
acid sequence that is at least 95% identical to SEQ ID NO:26 and
has xylanase activity, or a fragment thereof that has xylanase
activity; and d) a protein comprising an amino acid sequence that
is at least 95% identical to SEQ ID NO:47 and has
arabinogalactanase activity, or a fragment thereof that has
arabinogalactanase activity.
73. The multi-enzyme composition of any one of claims 54 to 72,
wherein the multi-enzyme composition comprises at least one
hemicellulase.
74. The multi-enzyme composition of claim 73, wherein the
hemicellulase is selected from the group consisting of a xylanase,
an arabinofuranosidase, an acetyl xylan esterase, a glucuronidase,
and endo-galactanase, a mannanase, an endo arabinase, an exo
arabinase, an exo-galactanase, a ferulic acid esterase, a
galactomannanase, a xylogluconase, and mixtures thereof.
75. The multi-enzyme composition of claim 74, wherein the xylanase
is selected from the group consisting of endoxylanases,
exoxylanase, and .beta.-xylosidase.
76. The multi-enzyme composition of any one of claims 54 to 72,
wherein the multi-enzyme composition comprises at least one
cellulase.
77. The multi-enzyme composition of any one of claims 54 to 72,
wherein the composition is a crude fermentation product.
78. The multi-enzyme composition of any one of claims 54 to 72,
wherein the composition is a crude fermentation product that has
been subjected to a purification step.
79. The multi-enzyme composition of any one of claims 54 to 72,
further comprising one or more accessory enzymes.
80. The multi-enzyme composition of claim 79, wherein the accessory
enzymes includes at least one enzyme selected from the group
consisting of: cellulase, xylanase, ligninase, glucuronidase,
arabinofuranosidase, arabinogalactanase, furilic acid esterase,
lipase, pectinase, glucomannase, amylase, laminarinase,
xyloglucanase, galactanase, glucoamylase, pectate lyase, chitinase,
exo-.cndot.-D-glucosaminidase, cellobiose dehydrogenase, and
acetylxylan esterase.
81. The multi-enzyme composition of claim 79, wherein the accessory
enzyme is selected from the group consisting of a glucoamylase, a
pectinase, and a ligninase.
82. The multi-enzyme composition of claim 79, wherein the accessory
enzyme is a glucoamylase.
83. The multi-enzyme composition of claim 79, wherein the accessory
enzyme is added as a crude or a semi-purified enzyme mixture.
84. The multi-enzyme composition of claim 79, wherein the accessory
enzyme is produced by culturing at least one organism on a
substrate to produce the enzyme.
85. A method for degrading a lignocellulosic material to
fermentable sugars, comprising contacting the lignocellulosic
material with at least one isolated protein of claim 24.
86. The method of claim 85, wherein the isolated protein is part of
a multi-enzyme composition.
87. A method for degrading a lignocellulosic material to
fermentable sugars, comprising contacting the lignocellulosic
material with at least one multi-enzyme composition of any one of
claims 54 to 84.
88. A method for producing an organic substance, comprising: a)
saccharifying a lignocellulosic material with a multi-enzyme
composition of any one of claims 54 to 84; b) fermenting the
saccharified lignocellulosic material obtained with one or more
fermentating microoganisms; and c) recovering the organic substance
from the fermentation.
89. The method of claim 88, wherein the steps of saccharifying and
fermenting are performed simultaneously.
90. The method of claim 88, wherein the organic substance is an
alcohol, organic acid, ketone, amino acid, or gas.
91. The method of claim 88, wherein the organic substance is an
alcohol.
92. The method of claim 91, wherein the alcohol is ethanol.
93. The method of any one of claims 85 to 92, wherein the
lignocellulosic material is selected from the group consisting of
consisting of herbaceous material, agricultural residue, forestry
residue, municipal solid waste, waste paper, and pulp and paper
mill residue.
94. The method of any one of claims 85 to 92, wherein the
lignocellulosic material is distiller's dried grains or distiller's
dried grains with solubles.
95. The method of any one of claims 85 to 92, wherein the
distiller's dried grains or distiller's dried grains with solubles
is derived from corn.
96. A method for degrading a lignocellulosic material consisting of
distiller's dried grains or distiller's dried grains with solubles
to sugars, the method comprising contacting the distiller's dried
grains or distiller's dried grains with solubles with a
multi-enzyme composition, whereby at least 10% of the fermentable
sugars are liberated, wherein the multi-enzyme composition is the
multi-enzyme composition of any one of claims 54 to 84.
97. The method of claim 96, whereby at least 15% of the sugars are
liberated.
98. The method of claim 96, whereby at least 20% of the sugars are
liberated.
99. The method of claim 96, whereby at least 23% of the sugars are
liberated.
100. The method of claim 96, wherein the distiller's dried grains
or distiller's dried grains with solubles is derived from corn.
101. The method of any one of claims 85 to 100, further comprising
a pretreatment process for pretreating the lignocellulosic
material.
102. The method of claim 101, wherein the pretreatment process is
selected from the group consisting of physical treatment, metal
ion, ultraviolet light, ozone, organosolv treatment, steam
explosion treatment, lime impregnation with steam explosion
treatment, hydrogen peroxide treatment, hydrogen peroxide/ozone
(peroxone) treatment, acid treatment, dilute acid treatment, and
base treatment.
103. The method of claim 101, wherein the pretreatment process is
selected from the group consisting of organosolv, steam explosion,
heat treatment and AFEX.
104. The method of claim 103, wherein the heat treatment comprises
heating the lignocellulosic material to 121.degree. C. for 15
minutes.
105. The method of any one of claims 85 to 104, further comprising
detoxifying the lignocellulosic material.
106. The method of any one of claims 85 to 105, further comprising
recovering the fermentable sugar.
107. The method of claim 106, wherein the sugar is selected from
the group consisting of glucose, xylose, arabinose, galactose,
mannose, rhamnose, sucrose and fructose.
108. The method of any one of claims 85 to 107, further comprising
recovering the contacted lignocellulosic material after the
fermentable sugars are degraded.
109. A feed additive comprising the recovered lignocellulosic
material of claim 108.
110. The feed additive of claim 109, wherein the protein content of
the recovered lignocellulosic material is higher than that of the
starting lignocellulosic material.
111. A method of improving the performance of an animal which
comprises administering to the animal the feed additive of claim
109.
112. A method for improving the nutritional quality of an animal
feed comprising adding the feed additive of claim 109 to an animal
feed.
113. A method for stonewashing a fabric, comprising contacting the
fabric with at least one isolated protein of claim 24.
114. A method for stonewashing a fabric, comprising contacting the
fabric with at least one multi-enzyme composition of any one of
claims 54 to 84.
115. The method of claim 113 or claim 114, wherein the fabric is
denim.
116. A method for enhancing the softness or feel of a fabric or
depilling a fabric, comprising contacting the fabric with at least
one isolated protein of claim 24 or a fragment of at least one
isolated protein of claim 24 comprising a cellulose binding domain
(CBD) of the protein.
117. A method for enhancing the softness or feel of a fabric or
depilling a fabric, comprising contacting the fabric with at least
one multi-enzyme composition of any one of claims 54 to 84.
118. A method for restoring color to or brightening a fabric,
comprising contacting the fabric with at least one isolated protein
of claim 24.
119. A method for restoring color to or brightening a fabric,
comprising contacting the fabric with at least one multi-enzyme
composition of any one of claims 54 to 84.
120. A method of biopolishing, defibrillating, bleaching, dyeing or
desizing a fabric, comprising contacting the fabric with at least
one isolated protein of claim 24.
121. A method of biopolishing, defibrillating, bleaching, dyeing or
desizing a fabric, comprising contacting the fabric with at least
one multi-enzyme composition of any one of claims 54 to 84.
122. A method of deinking or biobleaching paper or pulp, comprising
contacting the paper or pulp with at least one isolated protein of
claim 24.
123. A method of deinking or biobleaching paper or pulp, comprising
contacting the paper or pulp with at least one multi-enzyme
composition of any one of claims 54 to 84.
124. A method for enhancing the cleaning ability of a detergent
composition, comprising adding at least one isolated protein of
claim 24 to the detergent composition.
125. A method for enhancing the cleaning ability of a detergent
composition, comprising adding at least one multi-enzyme
composition of any one of claims 54 to 84 to the detergent
composition.
126. A detergent composition, comprising at least one isolated
protein of claim 24 and at least one surfactant.
127. A detergent composition, comprising at least one multi-enzyme
composition of any one of claims 54 to 84 and at least one
surfactant.
128. A method for inhibiting or reducing fungal growth, comprising
contacting a fungus or an area susceptible to fungal growth with at
least one isolated protein of claim 25.
129. A method for reducing or preventing insect infestation on a
plant, comprising contacting the plant with at least one isolated
protein of claim 25.
130. A method for reducing or preventing allergic inflammation or
asthma, comprising administering at least one isolated protein of
claim 25 to a subject.
131. A lysing enzyme for the generation of protoplasts from fungi,
comprising at least one isolated protein of claim 25.
132. An insecticide composition, comprising at least one isolated
protein of claim 25.
Description
REFERENCE TO SEQUENCE LISTING
[0001] This application contains a Sequence Listing submitted as an
electronic text file named "5671-5_ST25.txt", having a size in
bytes of 366 kb, and created on 2 Aug. 2007 the information
contained in this electronic file is hereby incorporated by
reference in its entirety pursuant to 37 CFR .sctn.1.52(e)(5).
FIELD OF THE INVENTION
[0002] This invention relates to novel enzymes and novel methods
for producing the same. More specifically this invention relates to
enzymes produced by fungi. The invention also relates to a method
to convert lignocellulosic biomass to fermentable sugars with
enzymes that degrade the lignocellulosic material and novel
combinations of enzymes, including those that provide a synergistic
release of sugars from plant biomass. The invention also relates to
methods to use the novel enzymes and compositions of such enzymes
in a variety of other processes, including washing of clothing,
detergent processes, deinking and biobleaching of paper and pulp,
and treatment of waste streams.
BACKGROUND OF THE INVENTION
[0003] Large amounts of carbohydrates in plant biomass provide a
plentiful source of potential energy in the form of sugars (both
five carbon and six carbon sugars) that can be utilized for
numerous industrial and agricultural processes. However, the
enormous energy potential of these carbohydrates is currently
under-utilized because the sugars are locked in complex polymers,
and hence are not readily accessible for fermentation. These
complex polymers are often referred to collectively as
lignocellulose. Sugars generated from, degradation of plant biomass
potentially represent plentiful, economically competitive
feedstocks for fermentation into chemicals, plastics, and fuels,
including ethanol as a substitute for petroleum.
[0004] For example, distillers' dried grains (DDG) are
lignocellulosic byproducts of the corn dry milling process. Milled
whole corn kernels are treated with amylases to liquefy the starch
within the kernels and hydrolyze it to glucose. The glucose so
produced is then fermented in a second step to ethanol. The
residual solids after the ethanol fermentation and distillation are
centrifuged and dried, and the resulting product is DDG, which is
used as an animal feed stock. Although DDG composition can vary, a
typical composition for DDG is: 32% hemicellulose. 22% cellulose,
30% protein, 10% lipids, 4% residual starch, and 4% inorganics. In
theory, the cellulose and hemicellulose fractions, comprising about
54% of the weight of the DDG, can be efficiently hydrolyzed to
fermentable sugars by enzymes; however, it has been found that the
carbohydrates comprising lignocellulosic materials in DDG are more
difficult to digest. To date, the efficiency of hydrolysis of these
(hemi) cellulosic polymers by enzymes is much lower than the
hydrolytic efficiency of starch, due to the more complex and
recalcitrant nature of these substrates. Accordingly, the cost of
producing the requisite enzymes is higher than the cost of
producing amylases for starch hydrolysis.
[0005] Major polysaccharides comprising lignocellulosic materials
include cellulose and hemicelluloses. The enzymatic hydrolysis of
these polysaccharides to soluble sugars (and finally to monomers
such as glucose, xylose and other hexoses and pentoses) is
catalyzed by several enzymes acting in concert. For example,
endo-1,4-.beta.-glucanases (EGs) and exo-cellobiohydrolases (CBHs)
catalyze the hydrolysis of insoluble cellulose to
cellooligosachharides (with cellobiose the main product), while
.beta.-glucosidaes (BGLs) convert the oligosaccharides to glucose.
Similarly, xylanases, together with other enzymes such as
.alpha.-L-arabinofuranosidases, feruloyl and acetylxylan esterases
and .beta.-xylosidases, catalyze the hydrolysis of
hemicelluloses.
[0006] Regardless of the type of cellulosic feedstock, the cost and
hydrolytic efficiency of enzymes are major factors that restrict
the widespread use of biomass bioconversion processes. The
hydrolytic efficiency of a multi-enzyme complex in the process of
lignocellulosic saccharification depends both on properties of the
individual enzymes and the ratio of each enzyme within the
complex.
[0007] Enzymes useful for the hydrolysis of complex polysaccharides
are also highly useful in a variety of industrial textile
applications, as well as industrial paper and pulp applications,
and in the treatment of waste streams. For example, as an
alternative to the use of pumice in the stone washing process,
methods for treating cellulose-containing fabrics for clothing with
hydrolytic enzymes, such as cellulases, are known to improve the
softness or feel of such fabrics. Cellulases are also used in
detergent compositions, either for the purpose of enhancing the
cleaning ability of the composition or as a softening agent.
Cellulases are also used in combination with polymeric agents in
processes for providing a localized variation in the color density
of fibers. Such enzymes can also be used for the saccharification
of lignocellulosic biomass in waste streams, such as municipal
solid waste, for biobleaching of wood pulp, and for deinking of
recycled print paper. As with the hydrolysis of these
polysaccharides in lignocellulosic materials for use as feedstocks
described above, the cost and hydrolytic efficiency of the enzymes
are major factors that control the use of enzymes in these
processes.
[0008] Filamentous fungi are a source of cellulases and
hemicellulases, as well as other enzymes useful in the enzymatic
hydrolysis of major polysaccharides. In particular, strains of
Trichoderma sp., such as T. viride, T. reesei and T.
longibrachiatum, and Penicillium sp., and enzymes derived from
these strains, have previously been used to hydrolyze crystalline
cellulose. However, the costs associated with producing enzymes
from these fungi, as well as the presence of additional,
undesirable enzymes, remains a drawback. It is therefore desirable
to produce inexpensive enzymes and enzyme mixtures that efficiently
degrade cellulose and hemicellulose for use in a variety of
agricultural and industrial applications.
SUMMARY OF THE INVENTION
[0009] The present invention provides isolated nucleic acid
molecules comprising a nucleic acid sequence selected from the
group consisting of: [0010] a) a nucleic acid sequence encoding a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:11, SEQ ID NO:23, SEQ ID NO:29, SEQ ID
NO:32, SEQ ID NO:35, SEQ ID NO:38, SEQ ID NO:41, SEQ ID NO:44, SEQ
ID NO:50, SEQ ID NO:53, SEQ ID NO:56, SEQ ID NO:59, SEQ ID NO:62,
SEQ ID NO:65, SEQ ID NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ ID
NO:77, SEQ ID NO:80, SEQ ID NO:83, SEQ ID NO:86, SEQ ID NO:89, SEQ
ID NO:92, and SEQ ID NO:94. [0011] b) a nucleic acid sequence
encoding a fragment of the protein of (a), wherein the fragment has
a biological activity of the protein of (a); and [0012] c) a
nucleic acid sequence encoding an amino acid sequence that is at
least 70% identical to an amino acid sequence of (a) and has a
biological activity of the protein comprising the amino acid
sequence.
[0013] In some embodiments, the nucleic acid sequence encodes an
amino acid sequence that is at least 90%, at least 95%, at least
97%, or at least 99% identical to the amino acid sequence of (a)
and has a biological activity of the protein comprising the amino
acid sequence.
[0014] In some embodiments, the nucleic acid sequence encodes a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:11, SEQ ID NO:23, SEQ ID NO:29, SEQ ID
NO:32, SEQ ID NO:35, SEQ ID NO:38, SEQ ID NO:41, SEQ ID NO:44, SEQ
ID NO:50, SEQ ID NO:53, SEQ ID NO:56, SEQ ID NO:59, SEQ ID NO:62,
SEQ ID NO:65, SEQ ID NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ ID
NO:77, SEQ ID NO:80, SEQ ID NO:83, SEQ ID NO:86, SEQ ID NO:89, SEQ
ID NO:92, and SEQ ID NO:94.
[0015] In some embodiments, the nucleic acid sequence consists of a
nucleic acid sequence selected from the group consisting of: SEQ ID
NO: 1, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:10, SEQ ID
NO:12, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NO:30, SEQ
ID NO:31, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:37,
SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID
NO:43, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:49, SEQ
ID NO:51, SEQ ID NO:52, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:57,
SEQ ID NO:58, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:63, SEQ ID
NO:64, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:69, SEQ ID NO:70, SEQ
ID NO:72, SEQ ID NO:73, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:78.
SEQ ID NO:79, SEQ ID NO:90, SEQ ID NO:91 and SEQ ID NO:93.
[0016] In some embodiments, the nucleic acid sequence encodes a
protein comprising the amino acid sequence of SEQ ID NO:11, wherein
the protein has cellobiohydrolase activity.
[0017] In some embodiments, the nucleic acid sequence encodes a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:94 and SEQ ID NO:23, wherein the protein
has endoglucanase activity.
[0018] In some embodiments, the nucleic acid sequence encodes a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:29, SEQ ID NO:32, SEQ ID NO:35, SEQ ID
NO:38 and SEQ ID NO:41, wherein the protein has xylanase
activity.
[0019] In some embodiments, the nucleic acid sequence encodes a
protein comprising an amino acid sequence of SEQ ID NO:44, wherein
the protein has .beta.-glucosidase activity.
[0020] In some embodiments, the nucleic acid sequence encodes a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:50, SEQ ID NO:53, SEQ ID NO:62, SEQ ID
NO:65, SEQ ID NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ ID NO:77, SEQ
ID NO:80, SEQ ID NO:83, SEQ ID NO:86, SEQ ID NO:89, and SEQ ID
NO:92, wherein the protein has hemicellulase activity.
[0021] In some embodiments, the nucleic acid sequence encodes a
protein comprising the amino acid sequence of SEQ ID NO:50, wherein
the protein has glucoamylase activity.
[0022] In some embodiments, the nucleic acid sequence encodes a
protein comprising the amino acid sequence of SEQ ID NO:53, wherein
the protein has pectate lyase activity.
[0023] In some embodiments, the nucleic acid sequence encodes a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:56 and SEQ ID NO:59, wherein the protein
has chitinase activity.
[0024] in some embodiments, the nucleic acid sequence encodes a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:62, SEQ ID NO:80, and SEQ ID NO:83,
wherein the protein has acetylxylan esterase activity.
[0025] In some embodiments, the nucleic acid sequence encodes a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:65, SEQ ID NO:68, and SEQ ID NO:71,
wherein the protein has ferulic acid esterase activity.
[0026] In some embodiments, the nucleic acid sequence encodes a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:74 and SEQ ID NO:77, wherein the protein
has arabinofuranosidase activity.
[0027] In some embodiments, the nucleic acid sequence encodes a
protein comprising the amino acid sequence of SEQ ID NO:86, wherein
the protein has pectin methyl esterase activity.
[0028] In some embodiments, the nucleic acid sequence of (a)
encodes a protein comprising the amino acid sequence of SEQ ID
NO:89, wherein the protein has endo-arabinase activity.
[0029] In some embodiments, the nucleic acid sequence encodes a
protein comprising the amino acid sequence of SEQ ID NO:92, wherein
the protein has .beta.-xylosidase or .beta.-glucosidase
activity.
[0030] In some embodiments, the nucleic acid sequence encoding a
protein comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:2 and SEQ ID NO:5.
[0031] In some embodiments, the nucleic acid sequence is fully
complementary to any of the nucleic acid sequences described
above.
[0032] The invention also provides a protein comprising an amino
acid sequence encoded by the nucleic acid molecules of the present
invention.
[0033] In some embodiments, the protein comprises an amino acid
sequence selected from the group consisting of: SEQ ID NO:11, SEQ
ID NO:23, SEQ ID NO:29, SEQ ID NO:32. SEQ ID NO:35, SEQ ID NO:38,
SEQ ID NO:41, SEQ ID NO:44, SEQ ID NO:50, SEQ ID NO:53, SEQ ID
NO:56, SEQ ID NO:59, SEQ ID NO:62, SEQ ID NO:65, SEQ ID NO:68, SEQ
ID NO:71, SEQ ID NO:74, SEQ ID NO:77, SEQ ID NO:80, SEQ ID NO:83,
SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:92, and SEQ ID NO:94.
[0034] In some embodiments, the protein comprises an amino acid
sequence selected from SEQ ID NO:56 and SEQ ID NO:59, wherein the
protein has chitinase activity.
[0035] The invention also provides an isolated fusion protein
comprising an isolated protein of the present invention fused to a
protein comprising an amino acid sequence that is heterologous to
the isolated protein of the present invention.
[0036] The invention also provides an isolated antibody or antigen
binding fragment thereof that selectively binds to a protein of the
present invention.
[0037] The invention also provides a kit for degrading a
lignocellulosic material to fermentable sugars comprising at least
one isolated protein of the present invention.
[0038] The invention also provides a detergent comprising at least
one isolated protein of the present invention
[0039] The invention also provides a composition for the
degradation of a lignocellulosic material comprising at least one
isolated protein of the present invention.
[0040] The invention also provides a recombinant nucleic acid
molecule comprising an isolated nucleic acid molecule of the
present invention, operatively linked to at least one expression
control sequence.
[0041] In some embodiments, the recombinant nucleic acid molecule
comprises an expression vector. In some embodiments, the
recombinant nucleic acid molecule comprises a targeting vector.
[0042] The invention also provides an isolated host cell
transfected with a nucleic acid molecule of the present
invention.
[0043] In some embodiments, the host cell is a fungus. In some
embodiments, the host cell is a filamentous fungus. In some
embodiments, the filamentous fungus is from a genus selected from
the group consisting of: Chrysosporium, Thielavia, Neurospora,
Aureobasidium, Filibasidium, Piromyces, Corynascus, Crvplococcus,
Acremonium, Tolypocladium, Scytalidium, Schizophyllum,
Sporotrichum, Penicillium, Gibberella. Myceliophthora, Mucor,
Aspergillus, Fusarium, Humicola, and Trichoderma, and anamorphs and
teleomorphs thereof. In some embodiments, the host cell is a
bacterium.
[0044] The invention also provides an oligonucleotide consisting
essentially of at least 12 consecutive nucleotides of a nucleic
acid sequence selected from the group consisting of: SEQ ID NO:1,
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:10, SEQ ID NO:12,
SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NO:30. SEQ ID
NO:31, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ
ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43,
SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:49, SEQ ID
NO:51, SEQ ID NO:52, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:57. SEQ
ID NO:58, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:63, SEQ ID NO:64,
SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:69, SEQ ID NO:70, SEQ ID
NO:72, SEQ ID NO:73, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:78, SEQ
ID NO:79, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:93, or the
complement thereof.
[0045] The invention also provides a kit comprising at least one
oligonucleotide of the present invention.
[0046] The invention also provides a method for producing a protein
of the present invention, comprising culturing a cell that has been
transfected with a nucleic acid molecule comprising a nucleic acid
sequence encoding the protein, and expressing the protein with the
transfected cell.
[0047] In some embodiments, the method further comprises recovering
the protein from the cell or from a culture comprising the
cell.
[0048] The invention also provides a genetically modified organism
comprising components suitable for degrading a lignocellulosic
material to fermentable sugars, wherein the organism has been
genetically modified to express at least one protein of the present
invention.
[0049] In some embodiments, the genetically modified organism is a
microorganism. In some embodiments, the microorganism is a
filamentous fungus.
[0050] In some embodiments, the filamentous fungus is from a genus
selected from the group consisting of: Chrysosporium, Thielavia,
Neurospora, Aureobasidium, Filibasidium, Piromyces, Corynascus,
Cryplococcus, Acremonium, Tolypocladium, Scytalidium,
Schizophyllum, Sporotrichum, Penicillium, Talaromyces, Gibberella,
Myceliophthora, Mucor, Aspergillus, Fusarium, Humicola, and
Trichoderma.
[0051] In some embodiments, the filamentous fungus is selected from
the group consisting of: Trichoderma reesei, Chrysosporium
lucknowense, Aspergillus japonicus, Penicillium canescens,
Penicillium solitum, Penicillium funiculosum, and Talaromyces
flavus.
[0052] In some embodiments, the organism has been genetically
modified to express at least one additional enzyme.
[0053] In some embodiments, the additional enzyme is an accessory
enzyme selected from the group consisting of: cellulase, xylanase,
ligninase, glucuronidase, arabinofuranosidase, arabinogalactanase,
ferulic acid esterase, lipase, pectinase, glucomannase, amylase,
laminarinase, xyloglucanase, galactanase, glucoamylase, pectate
lyase, chitinase, exo-.cndot.-D-glucosaminidase, cellobiose
dehydrogenase, and acetylxylan esterase.
[0054] In some embodiment's, the genetically modified organism is a
plant.
[0055] The invention also provides a recombinant enzyme isolated
from a genetically modified microorganism of the present
invention.
[0056] In some embodiments, the enzyme has been subjected to a
purification step.
[0057] The invention also provides a crude fermentation product
produced by culturing the comprising a genetically modified
microorganism of the present invention, wherein the crude
fermentation product contains at least one protein of the present
invention.
[0058] The invention also provides a multi-enzyme composition
comprising enzymes produced by a genetically modified organism of
the present invention and recovered therefrom.
[0059] The invention also provides a multi-enzyme composition
comprising at least one protein of the present invention and at
least one additional protein for degrading a lignocellulosic
material or a fragment thereof that has biological activity.
[0060] In some embodiments, the at least one additional protein for
degrading a lignocellulosic material to fermentable sugars is
selected from the group consisting of: SEQ ID NO:8, SEQ ID NO:14.
SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:26 and SEQ ID NO:47, or a
fragment thereof that has biological activity.
[0061] In some embodiments, the multi-enzyme composition comprises
at least one cellobiohydrolase, at least one xylanase, at least one
endoglucanase, at least one .cndot.-glucosidase, at least one
.cndot.-xylosidase, and at least one accessory enzyme.
[0062] In some embodiments, between about 50% and about 70% of the
enzymes in the composition are cellobiohydrolases.
[0063] In some embodiments, between about 10% and about 30% of the
enzymes in the composition are xylanases.
[0064] In some embodiments, between about 5% and about 15% of the
enzymes in the composition are endoglucanases.
[0065] In some embodiments, between about 1% and about 5% of the
enzymes in the composition are .cndot.-glucosidases.
[0066] In some embodiments, between about 1% and about 3% of the
enzymes in the composition are .cndot.-xylosidases.
[0067] In some embodiments, the composition comprises about 60%
cellobiohydrolases, about 20% xylanases, about 10% endoglucanases,
about 3% .cndot.-glucosidases, about 2% .cndot.-xylosidases, and
about 5% accessory enzymes.
[0068] In some embodiments, the xylanases are selected from the
group consisting of: endoxylanases, exoxylanases, and
.cndot.-xylosidases.
[0069] In some embodiments, the accessory enzymes include an enzyme
selected from the group consisting of: ligninase, glucuronidase,
arabinofuranosidase, arabinogalactanase, furilic acid esterase,
lipase, pectinase, glucomannase, amylase, laminarinase,
xyloglucanase, galactanase, glucoamylase, pectate lyase, chitinase,
exo-.cndot.-D-glucosaminidase, cellobiose dehydrogenase, and
acetylxylan esterase.
[0070] In some embodiments, the composition comprises at least one
protein comprising the amino acid sequence of SEQ ID NO:11, wherein
the protein has cellobiohydrolase activity, at least one protein
comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:94 and SEQ ID NO:23, wherein the protein
has endoglucanase activity, at least one protein comprising an
amino acid sequence selected from the group consisting of: SEQ ID
NO:29, SEQ ID NO:32, SEQ ID NO:35, SEQ ID NO:38 and SEQ ID NO:41,
wherein the protein has xylanase activity, at least one protein
comprising an amino acid sequence of SEQ ID NO:44, wherein the
protein has .cndot.-glucosidase activity, and at least one protein
comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:50, SEQ ID NO:53, SEQ ID NO:62, SEQ ID
NO:65, SEQ ID NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ ID NO:77, SEQ
ID NO:80, SEQ ID NO:83, SEQ ID NO:86, SEQ ID NO:89, and SEQ ID
NO:92, wherein the protein has hemicellulase activity.
[0071] In some embodiments, the composition comprises at least one
first protein comprising an amino acid sequence that is at least
95% identical to an amino acid sequence selected from the group
consisting of: SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8 and SEQ ID
NO:11 and has cellobiohydrolase activity, or a fragment thereof
that has cellobiohydrolase activity; at least one second protein
comprising an amino acid sequence that is at least 95% identical to
an amino acid sequence selected from the group consisting of: SEQ
ID NO:14, SEQ ID NO:94, SEQ ID NO:17, SEQ ID NO:20 and SEQ ID NO:23
and has endoglucanase activity, or a fragment thereof that has
endoglucanase activity; and at least one third protein comprising
an amino acid sequence that is at least 95% identical to an amino
acid sequence selected from the group consisting of: SEQ ID NO:26.
SEQ ID NO:29, SEQ ID NO:32, SEQ ID NO:35, SEQ ID NO:38 and SEQ ID
NO:41 and has xylanase activity, or a fragment thereof that has
xylanase activity.
[0072] In some embodiments, the composition further comprises a
fourth protein comprising an amino acid sequence that is at least
95% identical to SEQ ID NO:44 and has .beta.-glucosidase activity,
or a fragment thereof that has .beta.-glucosidase activity.
[0073] In some embodiments, the composition further comprises at
least one fifth protein comprising an amino acid sequence that is
at least 95% identical to an amino acid sequence selected from the
group consisting of: SEQ ID NO:47, SEQ ID NO:50, SEQ ID NO:53, SEQ
ID NO:56, SEQ ID NO:59, SEQ ID NO:62, SEQ ID NO:65, SEQ ID NO:68,
SEQ ID NO:71, SEQ ID NO:74, SEQ ID NO:77, SEQ ID NO:80, SEQ ID
NO:83, SEQ ID NO:86, SEQ ID NO:89, and SEQ ID NO:92, and has
hemicellulase or chitinase activity, or a fragment thereof that has
hemicellulase or chitinase activity
[0074] The invention also provides a multi-enzyme composition
comprising a first protein comprising an amino acid sequence that
is at least 95% identical to SEQ ID NO:2 and has cellobiohydrolase
activity, or a fragment thereof that has cellobiohydrolase
activity; a second protein comprising an amino acid sequence that
is at least 95% identical to SEQ ID NO:8 and has cellobiohydrolase
activity, or a fragment thereof that has cellobiohydrolase
activity; a third protein comprising an amino acid sequence that is
at least 95% identical to SEQ ID NO:94 and has endoglucanase
activity, or a fragment thereof that has endoglucanase activity; a
fourth protein comprising an amino acid sequence that is at least
95% identical to SEQ ID NO:29 and has xylanase activity, or a
fragment thereof that has xylanase activity; and a fifth protein
comprising an amino acid sequence that is at least 95% identical to
SEQ ID NO:44 and has .beta.-glucosidase activity, or a fragment
thereof that has .beta.-glucosidase activity.
[0075] In some embodiments, the multi-enzyme composition further
comprises a sixth protein that has .beta.-xylosidase activity, or a
fragment thereof that has .beta.-xylosidase activity.
[0076] In some embodiments, the multi-enzyme composition further
comprises one or more proteins selected from the group consisting
of: [0077] a) a protein comprising an amino acid sequence that is
at least 95% identical to SEQ ID NO:5 and has cellobiohydrolase
activity, or a fragment thereof that has cellobiohydrolase
activity; [0078] b) a protein comprising an amino acid sequence
that is at least 95% identical to SEQ ID NO:20 and has
endoglucanase activity, or a fragment thereof that has
endoglucanase activity; [0079] c) a protein comprising an amino
acid sequence that is at least 95% identical to SEQ ID NO:26 and
has xylanase activity, or a fragment (hereof that has xylanase
activity; and [0080] d) a protein comprising an amino acid sequence
that is at least 95% identical to SEQ ID NO:47 and has
arabinogalactanase activity, or a fragment thereof that has
arabinogalactanase activity.
[0081] In some embodiments, the multi-enzyme composition comprises
at least one hemicellulase.
[0082] In some embodiments, the hemicellulase is selected from the
group consisting of a xylanase, an arabinofuranosidase, an acetyl
xylan esterase, a glucuronidase, and endo-galactanase, a mannanase,
an endo arabinase, an exo arabinase, an exo-galactanase, a ferulic
acid esterase, a galactomannanase, a xylogluconase, and mixtures
thereof.
[0083] In some embodiments, the xylanase is selected from the group
consisting of endoxylanases, exoxylanase, and
.beta.-xylosidase.
[0084] In some embodiments, the multi-enzyme composition comprises
at least one cellulase.
[0085] In some embodiments, the composition is a crude fermentation
product.
[0086] In some embodiments, the composition is a crude fermentation
product that has been subjected to a purification step.
[0087] In some embodiments, the multi-enzyme composition further
comprises one or more accessory enzymes.
[0088] In some embodiments, the accessory enzyme includes at least
one enzyme selected from the group consisting of: cellulase,
xylanase, ligninase, glucuronidase, arabinofuranosidase,
arabinogalactanase, furilic acid esterase, lipase, pectinase,
glucomannase, amylase, laminarinase, xyloglucanase, galactanase,
glucoamylase, pectate lyase, chitinase,
exo-.cndot.-D-glucosaminidase, cellobiose dehydrogenase, and
acetylxylan esterase.
[0089] In some embodiments, the accessory enzyme is selected from
the group consisting of a glucoamylase, a pectinase, and a
ligninase.
[0090] In some embodiments, the accessory enzyme is a
glucoamylase.
[0091] In some embodiments, the accessory enzyme is added as a
crude or a semi-purified enzyme mixture.
[0092] In some embodiments, the accessory enzyme is produced by
culturing at least one organism on a substrate to produce the
enzyme.
[0093] The invention also provides a method for degrading a
lignocellulosic material to fermentable sugars, comprising,
contacting the lignocellulosic material with at least one isolated
protein of the present invention.
[0094] In some embodiments, the isolated protein is part of a
multi-enzyme composition.
[0095] The invention also provides a method for degrading a
lignocellulosic material to fermentable sugars, comprising
contacting the lignocellulosic material with at least one
multi-enzyme composition of the present invention.
[0096] The invention also provides a method for producing an
organic substance, comprising: [0097] a) saccharifying a
lignocellulosic material with a multi-enzyme composition of the
present invention; [0098] b) fermenting the saccharified
lignocellulosic material obtained with one or more fermentating
microoganisms; and [0099] c) recovering the organic substance from
the fermentation.
[0100] In some embodiments, the steps of saccharifying and
fermenting are performed simultaneously.
[0101] In some embodiments, the organic substance is an alcohol,
organic acid, ketone, amino acid, or gas.
[0102] In some embodiments, the organic substance is an alcohol. In
some embodiments, the alcohol is ethanol.
[0103] In some embodiments, the lignocellulosic material is
selected from the group consisting of consisting of herbaceous
material, agricultural residue, forestry residue, municipal solid
waste, waste paper, and pulp and paper mill residue.
[0104] In some embodiments, the lignocellulosic material is
distiller's dried grains or distiller's dried grains with
solubles.
[0105] In some embodiments, the distiller's dried grains or
distiller's dried grains with solubles is derived from corn.
[0106] The invention also provides a method for degrading a
lignocellulosic material consisting of distiller's dried grains or
distiller's dried grains with solubles to sugars, the method
comprising contacting the distiller's dried grains or distiller's
dried grains with solubles with a multi-enzyme composition, whereby
at least 10% of the fermentable sugars are liberated, wherein the
multi-enzyme composition is the multi-enzyme composition of the
present invention.
[0107] In some embodiments, at least 15% of the sugars are
liberated. In some embodiments, at least 20% of the sugars are
liberated. In some embodiments, at least 23% of the sugars are
liberated.
[0108] In some embodiments, the distiller's dried grains or
distiller's dried grains with solubles is derived from corn.
[0109] In some embodiments, the method further comprises a
pretreatment process for pretreating the lignocellulosic
material.
[0110] In some embodiments, the pretreatment process is selected
from the group consisting of physical treatment, metal ion,
ultraviolet light, ozone, organosolv treatment, steam explosion
treatment, lime impregnation with steam explosion treatment,
hydrogen peroxide treatment, hydrogen peroxide/ozone (peroxone)
treatment, acid treatment, dilute acid treatment, and base
treatment.
[0111] In some embodiments, the pretreatment process is selected
from the group consisting of organosolv, steam explosion, heat
treatment and AFEX.
[0112] In some embodiments, the heat treatment comprises heating
the lignocellulosic material to 121.degree. C. for 15 minutes.
[0113] In some embodiments, the method further comprises
detoxifying the lignocellulosic material.
[0114] In some embodiments, the method further comprises recovering
the fermentable sugar.
[0115] In some embodiments, the sugar is selected from the group
consisting of glucose, xylose, arabinose, galactose, mannose,
rhamnose, sucrose and fructose.
[0116] In some embodiments, the method further comprises recovering
the contacted lignocellulosic material after the fermentable sugars
are degraded.
[0117] The invention also provides a feed additive comprising a
recovered lignocellulosic material.
[0118] In some embodiments, the protein content of the recovered
lignocellulosic material is higher than that of the starting
lignocellulosic material.
[0119] The invention also provides a method of improving the
performance of an animal which comprises administering to the
animal a feed additive of the present invention.
[0120] The invention also provides a method for improving the
nutritional quality of an animal feed comprising adding a feed
additive of the present invention to an animal feed.
[0121] The invention also provides a method for stonewashing a
fabric, comprising contacting the fabric with at least one isolated
protein of the present invention.
[0122] The invention also provides a method for stonewashing a
fabric, comprising contacting the fabric with at least one
multi-enzyme composition of the present invention.
[0123] In some embodiments, the fabric is denim.
[0124] The invention also provides a method for enhancing the
softness or feel of a fabric or depilling a fabric, comprising
contacting the fabric with at least one isolated protein of the
present invention or a fragment thereof comprising a cellulose
binding domain (CBD) of the protein.
[0125] The invention also provides a method for enhancing the
softness or feel of a fabric or depilling a fabric, comprising
contacting the fabric with at least one multi-enzyme composition of
the present invention.
[0126] The invention also provides a method for restoring color to
or brightening a fabric, comprising contacting the fabric with at
least one isolated protein of the present invention.
[0127] The invention also provides a method for restoring color to
or brightening a fabric, comprising contacting the fabric with at
least one multi-enzyme composition of the present invention.
[0128] The invention also provides a method of biopolishing,
defibrillating, bleaching, dyeing or desizing a fabric, comprising
contacting the fabric with at least one isolated protein of the
present invention.
[0129] The invention also provides a method of biopolishing,
defibrillating, bleaching, dyeing or desizing a fabric, comprising
contacting the fabric with at least one mull-enzyme composition of
the present invention.
[0130] The invention also provides a method of deinking or
biobleaching paper or pulp, comprising contacting the paper or pulp
with at least one isolated protein of the present invention.
[0131] The invention also provides a method of deinking or
biobleaching paper or pulp, comprising contacting the paper or pulp
with at least one multi-enzyme composition of the present
invention.
[0132] The invention also provides a method for enhancing the
cleaning ability of a detergent composition, comprising adding at
least one isolated protein of the present invention to the
detergent composition.
[0133] The invention also provides a method for enhancing the
cleaning ability of a detergent composition, comprising adding at
least one multi-enzyme composition of the present invention to the
detergent composition.
[0134] The invention also provides a detergent composition,
comprising at least one isolated protein of the present invention
and at least one surfactant.
[0135] The invention also provides a detergent composition,
comprising at least one multi-enzyme composition of the present
invention and at least one surfactant.
[0136] The invention also provides a method for inhibiting or
reducing fungal growth, comprising contacting a fungus or an area
susceptible to fungal growth with at least one isolated protein
comprising an amino acid sequence selected from the group
consisting of: SEQ ID NO:56 and SEQ ID NO:59, wherein the protein
has chitinase activity.
[0137] The invention also provides a method for reducing or
preventing insect infestation on a plant, comprising contacting the
plant with at least one isolated protein comprising an amino acid
sequence selected from the group consisting of SEQ ID NO:56 and SEQ
ID NO:59, wherein the protein has chitinase activity.
[0138] The invention also provides a method for reducing or
preventing allergic inflammation or asthma, comprising
administering at least one isolated protein comprising an amino
acid sequence selected from the group consisting of: SEQ ID NO:56
and SEQ ID NO:59, wherein the protein has chitinase activity.
[0139] The invention also provides a lysing enzyme for the
generation of protoplasts from fungi, comprising at least one
isolated protein comprising an amino acid sequence selected from
the group consisting of: SEQ ID NO:56 and SEQ ID NO:59, wherein the
protein has chitinase activity.
[0140] The invention also provides an insecticide composition,
comprising at least one isolated protein comprising an amino acid
sequence selected from the group consisting of: SEQ ID NO:56 and
SEQ ID NO:59, wherein the protein has chitinase activity.
BRIEF DESCRIPTION OF THE FIGURES
[0141] FIG. 1 shows the specific activity of cellobiohydrolases and
endoglucanases of the present invention toward the indicated
substrates. Specific activities (Units/mg) were determined at pH
5.0 and 50.degree. C.
[0142] FIG. 2 illustrates the enzymatic activity (percentage of
maximum) of cellobiohydrolases of the present invention at varying
pH levels. The activities were determined using an Avicel substrate
at 40.degree. C.
[0143] FIG. 3 illustrates the enzymatic activity (percentage of
maximum) of cellobiohydrolases of the present invention at varying
temperature levels. The activities were determined using an Avicel
substrate at pH 5.0.
[0144] FIG. 4 illustrates the enzymatic activity (percentage of
maximum) of endoglucanases of the present invention at varying pH
levels. The activities were determined using an CMC substrate at
50.degree. C.
[0145] FIG. 5 illustrates the enzymatic activity (percentage of
maximum) of endoglucanases of the present invention at varying
temperature levels. The activities were determined using a CMC
substrate at pH 5.0.
[0146] FIG. 6 shows the specific activity of xylanases of the
present invention toward the indicated substrates. Specific
activites (Units/mg) were determined at pH 5.0 and 50.degree.
C.
[0147] FIG. 7 illustrates the enzymatic activity (percentage of
maximum) of xylanases of the present invention at varying pH
levels. The activities were determined using a soluble xylan
substrate at 40.degree. C.
[0148] FIG. 8 illustrates the enzymatic activity (percentage of
maximum) of xylanases of the present invention at varying
temperature levels. The activities were determined using a soluble
xylan substrate at pH 5.0.
[0149] FIG. 9 illustrates the enzymatic activity (percentage of
maximum) of enzymes of the present invention at varying pH
levels.
[0150] FIG. 10 illustrates the enzymatic activity (percentage of
maximum) of enzymes of the present invention at varying temperature
levels.
[0151] FIG. 11 shows a standard calibration plot for the
Somogyi-Nelson assay used for the measurement of reducing sugars in
enzymatic hydrolyzates.
[0152] FIG. 12 shows a standard calibration plot with glucose
standards from 0.025 g/l to 0.5 g/l for a glucosooxidase-peroxidase
assay used to determine glucose levels.
[0153] FIG. 13 shows the relative activity over time of xylanase at
20-, 10- and 5-fold dilutions in a thermoshock assay.
[0154] FIG. 14 shows the pelletizing stability of xylanases of the
present invention. The percentage of activity after 20 seconds at
80.degree. C. relative to an unheated sample is shown.
[0155] FIG. 15 shows the relative viscosity of water/rye mixes
treated with the indicated enzyme(s). To compare the efficiency of
the preparations, the xylanase activities of the samples (units per
flask) are also indicated.
[0156] FIG. 16 shows a plot of the residual viscosity (%) of
water/rye mixes) after enzymatic treatment.
[0157] FIG. 17 shows a graph of the residual viscosity (%) of rye
meal water extract after a 1-hour incubation with 1 unit of the
indicated xylanase (in vitro feed test).
[0158] FIG. 18 shows the biobleaching ability (A.sub.235) of
xylanases of the present invention on soft wood pulp at pH 7.5 and
50.degree. C. for 2 hours. For each xylanase, the assay was
conducted using 0.01, 0.003, 0.001, 0.0003 mg/ml of enzyme in the
reaction mixture, as indicated.
[0159] FIG. 19 shows the biobleaching ability (A.sub.235) of
xylanases of the present invention on hard wood pulp at pH 7.5 and
50.degree. C. for 2 hours. For each xylanase, the assay was
conducted using 0.01, 0.003, 0.001, 0.0003 mg/ml of enzyme in the
reaction mixture, as indicated.
[0160] FIG. 20 illustrates the bioscouring ability of
cellobiohydrolases and endoglucanases of the present invention. The
reciprocal of the time (l/time; sec.sup.-1) of water drop
adsorption on a textile after treatment at pH 5.0 with 0.6 mg/l of
enzyme followed by heating is shown. The water drop adsorption time
for the untreated control sample is 2400 sec (l/time=0.0004).
[0161] FIG. 21 illustrates a comparison of the pectin content in an
unscoured sample, an alkaline scoured sample, and bioscoured
samples.
[0162] FIG. 22 shows a diagram of cells for the enzymatic and
mechanical stress treatment of fabric. In FIG. 22A, the following
numbers represent the indicated structure: 1: glass vessel, 2:
plastic tube, 3: denim swatch, 4: rubber ring, 5: buckshot balls,
and 6: teflon disk. In FIG. 22B, the following numbers represent
the indicated structure: 1: lid with a thread inside, 2: rubber
ring, 3: cell body (stainless steel), 4: cylindrical cartridge, 5:
stainless steel cylinders, 6: circular denim swatch.
[0163] FIG. 23 illustrates the biopolishing ability of
cellobiohydrolases and endoglucanases of the present invention. The
dye release (D 400 nm) after treatment with 0.2 g/l enzyme at pH
5.0, 50.degree. C. is shown.
[0164] FIG. 24 shows an example histogram generated from the color
intensity analysis of a denim swatch.
[0165] FIG. 25 illustrates the denim-washing (abrasion) ability of
cellobiohydrolases and endoglucanases of the present invention. The
relative units per mg of enzyme determined at 50.degree. C., pH 5.0
is shown.
[0166] FIG. 26 shows a graph of the saccharification ability of
cellobiohydrolases and endoglucanases of the present invention. The
amount of glucose (g/l) produced after the hydrolysis of 5 g/l
Avicel at pH 5.0, 40.degree. C. for 120 hours in the presence of
0.1 mg/ml enzyme and 0.2 U/ml cellobiase is shown.
DETAILED DESCRIPTION OF THE INVENTION
[0167] The present invention relates generally to proteins that
play a role in the degradation of cellulose and hemicellulose and
nucleic acids encoding the same. In particular, the present
invention relates to enzymes isolated from a filamentous fungal
strain denoted herein as C1 (Accession No. VKM F-3500-D), nucleic
acids encoding the enzymes, and methods of producing and using the
enzymes. The invention also provides compositions that include at
least one of the enzymes described herein for uses including, but
not limited to, the hydrolysis of lignocellulose. The invention
stems, in part, from the discovery of a variety of novel cellulases
and hemicellulases produced by the C1 fungus that exhibit high
activity toward cellulose and other components of biomass.
[0168] The present invention also provides methods and compositions
for the conversion of plant biomass to fermentable sugars that can
be converted to useful products. The methods include methods for
degrading lignocellulosic material using enzyme mixtures to
liberate sugars. The compositions of the invention include enzyme
combinations that break down lignocellulose. As used herein the
terms "biomass" or "lignocellulosic material" includes materials
containing cellulose and/or hemicellulose. Generally, these
materials also contain xylan, lignin, protein, and carbohydrates,
such as starch and sugar. Lignocellulose is generally found, for
example, in the stems, leaves, hulls, husks, and cobs of plants or
leaves, branches, and wood of trees. The process of converting a
complex carbohydrate (such as starch or cellulose) into fermentable
sugars is also referred to herein as "saccharification."
Fermentable sugars, as used herein, refers to simple sugars, such
as glucose, xylose, arabinose, galactose, mannose, rhamnose,
sucrose and fructose.
[0169] Biomass can include virgin biomass and/or non-virgin biomass
such as agricultural biomass, commercial organics, construction and
demolition debris, municipal solid waste, waste paper and yard
waste. Common forms of biomass include trees, shrubs and grasses,
wheat, wheat straw, sugar cane bagasse, corn, corn husks, corn
kernel including fiber from kernels, products and by-products from
milling of grains such as corn, wheat and barley (including wet
milling and dry milling) as well as municipal solid waste, waste
paper and yard waste. The biomass can also be, but is not limited
to, herbaceous material, agricultural residues, forestry residues,
municipal solid wastes, waste paper, and pulp and paper mill
residues. "Agricultural biomass" includes branches, bushes, canes,
corn and corn husks, energy crops, forests, fruits, flowers,
grains, grasses, herbaceous crops, leaves, bark, needles, logs,
roots, saplings, short rotation woody crops, shrubs, switch
grasses, trees, vegetables, fruit peels, vines, sugar beet pulp,
wheat midlings, oat hulls, and hard and soft woods (not including
woods with deleterious materials). In addition, agricultural
biomass includes organic waste materials generated from
agricultural processes including farming and forestry activities,
specifically including forestry wood waste. Agricultural biomass
may be any of the aforestated singularly or in any combination or
mixture thereof.
[0170] Biomass high in starch, sugar, or protein such as corn,
grains, fruits and vegetables are usually consumed as food.
Conversely, biomass high in cellulose, hemicellulose and lignin are
not readily digestible and are primarily utilized for wood and
paper products, animal feed, fuel, or are typically disposed.
Generally, the substrate is of high lignocellulose content,
including distillers' dried grains corn stover, corn cobs, rice
straw, hay, sugarcane bagasse, and other agricultural biomass,
switchgrass, forestry wastes, poplar wood chips, pine wood chips,
sawdust, yard waste, and the like, including any combination of
substrate.
[0171] In one embodiment, the lignocellulosic material is
distillers' dried grains (DDG). DDG (also known as dried
distiller's grain, or distiller's spent grain) is spent, dried
grains recovered after alcohol fermentation. The lignocellulosic
material can also be distiller's dried grain with soluble material
recycled back (DDGS). While reference will be made herein to DDG
for convenience and simplicity, it should be understood that both
DDG and DDGS are contemplated as desired lignocellulosic materials.
These are largely considered to be waste products and can be
obtained after the fermentation of the starch derived from any of a
number of grains, including corn, wheat, barley, oats, rice and
rye. In one embodiment the DDG is derived from corn.
[0172] It should be noted that the distiller's grains do not
necessarily have to be dried. Although the grains normally,
currently dried, water and enzymes are added to the DDG substrate
in the present invention. If the saccharification were done on
site, the drying step could be eliminated and enzymes could be
added to the distiller's grains without drying.
[0173] Due in part to the many components that comprise biomass and
lignocellulosic materials, enzymes or a mixture of enzymes capable
of degrading xylan, lignin, protein, and carbohydrates are needed
to achieve saccharification. The present invention includes enzymes
or compositions thereof with, for example, cellobiohydrolase,
endoglucanase, xylanase, .beta.-glucosidase, hemicellulase and
chitinase activities.
[0174] The enzymes of the present invention may also be used for
stone washing cellulosic fabrics such as cotton denim), linen,
hemp, ramie, cupro, lyocell, newcell, rayon and the like. See, for
example, U.S. Pat. No. 6,015,707. The enzymes and compositions of
the present invention are suitable for industrial textile
applications in addition to the stone washing process. For example,
cellulases are used in detergent compositions, either for the
purpose of enhancing the cleaning ability of the composition or as
a softening agent. When so used, the cellulase will degrade a
portion of the cellulosic material, e.g., cotton fabric, in the
wash, which facilitates the cleaning and/or softening of the
fabric. The endoglucanase components of fungal cellulases have also
been used for the purposes of enhancing the cleaning ability of
detergent compositions, for use as a softening agent, and for use
in improving the feel of cotton fabrics, and the like. Enzymes and
compositions of the present invention may also be used in the
treatment of paper pulp (e.g., for improving the drainage or for
de-inking of recycled paper) or for the treatment of wastewater
streams (e.g., to hydrolyze waste material containing cellulose,
hemicellulose and pectins to soluble lower molecular weight
polymers).
[0175] In one aspect, the present invention includes proteins
isolated from, or derived from the knowledge of enzymes from, a
fungus such as C. lucknowense or a mutant or other derivative
thereof, and more particularly, from the fungal strain denoted
herein as C1 (Accession No. VKM F-3500-D). Preferably, the proteins
of the invention possess enzymatic activity. As described in U.S.
Pat. No. 6,015,707 or U.S. Pat. No. 6,573,086, each of which is
incorporated herein by reference for all purposes, a strain called
C1 (Accession No. VKM F-3500-D), was isolated from samples of
forest alkaline soil from Sola Lake, Far East of the Russian
Federation. This strain was deposited at the All-Russian Collection
of Microorganisms of Russian Academy of Sciences (VKM), Bakhurhina
St. 8, Moscow, Russia, 113184, under the terms of the Budapest
Treaty on the International Regulation of the Deposit of
Microorganisms for the Purposes of Patent Procedure on Aug. 29,
1996, as Chrysosporium lucknowense Garg 27K, VKM-F 3500 D. Various
mutant strains of C. lucknowense C1 have been produced and these
strains have also been deposited at the All-Russian Collection of
Microorganisms of Russian Academy of Sciences (VKM), Bakhurhina St.
8, Moscow, Russia, 113184, under the terms of the Budapest Treaty
on the International Regulation of the Deposit of Microorganisms
for the Purposes of Patent Procedure on Sep. 2, 1998. For example,
Strain C1 was mutagenised by subjecting it to ultraviolet light to
generate strain UV13-6 (Accession No. VKM F-3632 D). This strain
was subsequently further mutated with
N-methyl-N'-nitro-N-nitrosoguanidine to generate strain NG7C-19
(Accession No. VKM F-3633 D). This latter strain in turn was
subjected to mutation by ultraviolet light, resulting in strain
UV18-25 (VKM F-3631 D). Strain C1 was classified as a Chrysosporium
lucknowense based on morphological and growth characteristics of
the microorganism, as discussed in detail in U.S. Pat. No.
6,015,707 and U.S. Pat. No. 6,573,086.
[0176] In certain embodiments of the present invention, a protein
of the invention comprises, consists essentially of, or consists of
an amino acid sequence selected from: SEQ ID NO:2, SEQ ID NO:5. SEQ
ID NO:8, SEQ ID NO:11, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20,
SEQ ID NO:23, SEQ ID NO:26, SEQ ID NO:29, SEQ ID NO:32, SEQ ID
NO:35, SEQ ID NO:38, SEQ ID NO:41, SEQ ID NO:44, SEQ ID NO:47. SEQ
ID NO:50, SEQ ID NO:53, SEQ ID NO:56, SEQ ID NO:59, SEQ ID NO:62,
SEQ ID NO:65, SEQ ID NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ ID
NO:77, SEQ ID NO:80. SEQ ID NO:83, SEQ ID NO:86, SEQ ID NO:89, SEQ
ID NO:92, or SEQ ID NO:94. The present invention also includes
homologues of any of the above sequences, including fragments and
sequences having a given identity to any of the above sequences,
wherein the homologue or fragment has at least one biological
activity of the wild-type protein, as described herein.
[0177] In general, the proteins disclosed herein possess
carbohydrate enzymatic activity, or the ability to degrade
carbohydrate-containing materials. More specifically, the proteins
may possess cellulase activity such as endoglucanase activity
(e.g., 1,4-.beta.-D-glucan-4-glucanohydrolases), exoglucanase
activity (e.g., 1,4-.beta.-D-glucan cellobiohydrolases), and
.beta.-glucosidase activity. The proteins may possess hemicellulase
activity such as endoxylanase activity, exoxylanase activity, or
.beta.-xylosidase activity. The proteins may possess laminarinase,
xyloglucanase, galactanase, glucoamylase, pectate lyase, chitinase,
exo-.beta.-D-glucosaminidase, cellobiose dehydrogenase, acetylxylan
esterase, ligninase, amylase, glucuronidase, ferulic acid esterase,
arabinofuranosidase, pectin methyl esterase, arabinase, lipase,
glucosidase or glucomannanase activities. Physical properties,
biochemical characteristics and substrate specificities of proteins
of the present invention are illustrated below.
[0178] As used herein, "carbohydrase" refers to any protein that
catalyzes the hydrolysis of carbohydrates, "Glycoside hydrolase" or
"glycosidase" refers to a protein that catalyzes the hydrolysis of
the glycosidic bonds between carbohydrates or between a
carbohydrate and a non-carbohydrate residue. Endoglucanases,
cellebiohydrolases, .beta.-glucosidases, .alpha.-glucosidases,
xylanases, .beta.-xylosidases, galactanases,
.alpha.-galactosidases, .beta.-galactosidases, .alpha.-amylases,
glucoamylases, endo-arabinases, arabinofuranosidases, mannanases,
.beta.-mannosidases, pectinases, acetyl xylan esterases, acetyl
mannan esterases, ferulic acid esterases, coumaric acid esterases,
pectin methyl esterases, and chitinases are examples of
glycosidases.
[0179] "Cellulase" refers to a protein that catalyzes the
hydrolysis of 1,4-.beta.-D-glycosidic linkages in cellulose (such
as bacterial cellulose, cotton, filter paper, phosphoric acid
swollen cellulose, Avicel); cellulose derivatives (such as
carboxymethylcellulose and hydroxyethylcellulose); plant
lignocellulosic materials, beta-D-glucans or xyloglucans. Cellulose
is a linear beta-(1-4) glucan consisting of anhydrocellobiose
units. Endoglucanases, cellobiohydrolases, and .beta.-glucosidases
are examples of cellulases. "Endoglucanase" refers to a protein
that catalyzes the hydrolysis of cellulose to oligosaccharide
chains at random locations by means of an endoglucanase activity.
"Cellobiohydrolase" refers to a protein that catalyzes the
hydrolysis of cellulose to cellobiose via an exoglucanase activity,
sequentially releasing molecules of cellobiose from the reducing or
non-reducing ends of cellulose or cellooligosaccharides.
".beta.-glucosidase" refers to an enzyme that catalyzes the
conversion of cellobiose and oligosaccharides to glucose.
[0180] "Hemicellulase" refers to a protein that catalyzes the
hydrolysis of hemicellulose, such as that found, in lignocellulosic
materials. Hemicellulose is a complex polymer, and its composition
often varies widely from organism to organism, and from one tissue
type to another. Hemicelluloses include a variety of compounds,
such as xylans, arabinoxylans, xyloglucans, mannans, glucomannans,
and galactomannans. Hemicellulose can also contain glucan, which is
a general term for beta-linked glucose residues. In general, a main
component of hemicellulose is beta-1,4-linked xylose, a five carbon
sugar. However, this xylose is often branched as beta-1,3 linkages
or beta-1,2 linkages, and can be substituted with linkages to
arabinose, galactose, mannose, glucuronic acid, or by
esterification to acetic acid. The composition, nature of
substitution, and degree of branching of hemicellulose is very
different in dicotyledonous plants (dicots, i.e., plant whose seeds
have two cotyledons or seed leaves such as lima beans, peanuts,
almonds, peas, kidney beans) as compared to monocotyledonous plants
(monocots: i.e., plants having a single cotyledon or seed leaf such
as corn, wheat, rice, grasses, barley). In dicots, hemicellulose is
comprised mainly of xyloglucans that are 1,4-beta-linked glucose
chains with 1,6-alpha-linked xylosyl side chains. In monocots,
including most grain crops, the principal components of
hemicellulose are heteroxylans. These are primarily comprised of
1,4-beta-linked xylose backbone polymers with 1,2- or 1,3-beta
linkages to arabinose, galactose and mannose as well as xylose
modified by ester-linked acetic acids. Also present are branched
beta glucans comprised of 1,3- and 1,4-beta-linked glucosyl chains.
In monocots, cellulose, heteroxylans and beta glucans are present
in roughly equal amounts, each comprising about 15-25% of the dry
matter of cell walls. Hemicellulolytic enzymes, i.e.
hemicellulases, include both endo-acting and exo-acting enzymes,
such as xylanases, .beta.-xylosidases, galactanases,
.alpha.-galactosidases, .beta.-galactosidases, endo-arabinases,
arabinofuranosidases, mannanases. .beta.-mannosidases.
Hemicellulases also include the accessory enzymes, such as
acetylesterases, ferulic acid esterases, and coumaric acid
esterases. Among these, xylanases and acetyl xylan esterases cleave
the xylan and acetyl side chains of xylan and the remaining
xylo-oligomers are unsubstituted and can thus be hydrolysed with
.beta.-xylosidase only. In addition, several less known side
activities have been found in enzyme preparations which hydrolyze
hemicellulose. Accordingly, xylanases, acetylesterases and
.beta.-xylosidases are examples of hemicellulases.
[0181] "Xylanase" specifically refers to an enzyme that hydrolyzes
the .beta.-1,4 bond in the xylan backbone, producing short
xylooligosaccharides.
[0182] .beta.-Mannanases hydrolyze mannan-based hemicelluloses
(mannan, glucomannan, galactomannan) and produce short
.beta.-1,4-mannooligosaccharides.
[0183] "Galactanase" or "arabinogalactan
endo-1,4-.beta.-galactosidase" refers to a protein that catalyzes
the hydrolysis of endo-1,4-.beta.-D-galactosidic linkages in
arabinogalactans.
[0184] "Glucoamylase" refers to a protein that catalyzes the
hydrolysis of terminal 1,4-linked .alpha.-D-glucose residues
successively from non-reducing ends of the chains with the release
of .beta.-D-glucose.
[0185] "Chitinase" or "exo-.beta.-D-chitinase" refers to a protein
that catalyzes the hydrolysis of N-acetyl-.beta.-D-glucosaminide
1,4-O-linkages in chitin and chitodextrins.
[0186] ".alpha.-L-arabinofuranosidase" or "arabinofuranosidase"
refers to a protein that hydrolyzes arabinofuranosyl-containing
hemicelluloses. Some of these enzymes remove arabinofuranoside
residues from O-2 and/or O-3 single substituted xylose residues, as
well as from O-2 and O-3 doubly substituted xylose residues
[0187] "Endo-arabinase" refers to a protein that catalyzes the
hydrolysis of 1,5-.alpha.-arabinofuranosidic linkages in
1,5-arabinans.
[0188] ".beta.-xylosidase" refers to a protein that hydrolyzes
short 1,4-.beta.-D-xylooligomers into xylose.
[0189] "Acetyl xylan esterase" refers to a protein that catalyzes
the removal of the acetyl groups from xylose residues. "Acetyl
mannan esterase" refers to a protein that catalyzes the removal of
the acetyl groups from mannose residues. "Feruloyl esterase" or
"ferulic acid esterase" refers to a protein that hydrolyzes the
ester bond between the arabinose substituent group and ferulic
acid. "Coumaric acid esterase" refers to a protein that hydrolyzes
the ester bond between the arabinose substituent group and coumaric
acid. Acetyl xylan esterases, ferulic acid esterases and pectin
methyl esterases are examples of carbohydrate esterases.
[0190] "Pectate lyase" refers to a protein that catalyzes the
cleavage of 1,4-.alpha.-D-galacturonan in oligosaccharide
substrates.
[0191] Glycosidases (glycoside hydrolases; GH), a large family of
enzymes that includes cellulases and hemicellulases, catalyze the
hydrolysis of glycosidic linkages, predominantly in carbohydrates.
Glycosidases such as the proteins of the present invention may be
assigned to families on the basis of sequence similarities, and
there are now over 100 different such families defined (see the
CAZy (Carbohydrate Active EnZymes database) website, maintained by
the Architecture of Fonction de Macromolecules Biologiques of the
Centre National de la Recherche Scientifique, which describes the
families of structurally-related catalytic and carbohydrate-binding
modules (or functional domains) of enzymes that degrade, modify, or
create glycosidic bonds; Coutinho, P. M. & Henrissat, B. (1999)
Carbohydrate-active enzymes: an integrated database approach. In
"Recent Advances in Carbohydrate Bioengineering", H. J. Gilbert. G.
Davies, B. Henrissat and B. Svensson eds., The Royal Society of
Chemistry, Cambridge, pp. 3-12). Because there is a direct
relationship between the amino acid sequence of a protein and its
folding similarities, such a classification reflects the structural
features of these enzymes and their substrate specificity. Such a
classification system can help to reveal the evolutionary
relationships between these enzymes and provide a convenient tool
to determine information such as a enzyme's activity and function.
Thus, enzymes assigned to a particular family based on sequence
homology with other members of the family are expected to have
similar enzymatic activities and related substrate specifities.
CAZy family classifications also exist for glycosyltransferases
(GT), polysaccharide lyases (PL), and carbohydrate esterases (CE).
Likewise, sequence homology may be used to identify particular
domains within proteins, such as cellulose binding modules (CBMs;
also known as cellulose binding domains (CBDs)). The CAZy
homologies of proteins of the present invention are disclosed
below.
[0192] Proteins of the present invention may also include
homologues and fragments of the proteins disclosed herein. The
protein fragments include, but are not limited to, fragments
comprising a catalytic domain (CD) and/or a cellulose-binding
domain (also known as a cellulose binding module (CBM); both are
referred to herein as CBD). The identity and location of domains
within proteins of the present invention are disclosed in detail
below. The present invention encompasses all combinations of the
disclosed domains. For example, a protein fragment may comprise a
CD of a protein but not a CBD of the protein or a CBD of a protein
but not a CD. Similarly, domains from different proteins may be
combined. Protein fragments comprising a CD, CBD or combinations
thereof for each protein disclosed herein can be readily produced
using standard techniques known in the art. In some embodiments, a
protein fragment comprises a domain of a protein that has at least
one biological activity of the full-length protein. Homologues of
proteins of the invention that have at least one biological
activity of the full-length protein are described in detail below.
As used herein, the phrase "biological activity" of a protein
refers to any function(s) exhibited or performed by the protein
that is ascribed to the naturally occurring form of the protein as
measured or observed in vitro or in vivo. In certain embodiments, a
protein fragment comprises a domain of a protein that has the
catalytic activity of the full-length enzyme. Specific biological
activities of the proteins of the invention, and structures within
the proteins that are responsible for the activities, are described
below.
Cellobiohydrolases
[0193] Certain proteins of the present invention possess
cellobiohydrolase enzymatic activity. For example, the polypeptides
denoted herein as CBH Ia (SEQ ID NO:2), CBH Ib (SEQ ID NO:5), CBH
IIa (SEQ ID NO:8), and CBH IIb (SEQ ID NO:11) possess
cellobiohydrolase activity. These enzymes participate in the
hydrolytic conversion of insoluble cellulose to
cellooligosaccharides, with cellobiose being a primary product of
the enzymatic catalysis.
[0194] The enzyme denoted CBH Ia is encoded by the genomic nucleic
acid sequence represented herein as SEQ ID NO:1 and the cDNA
sequence represented herein as SEQ ID NO:3. The CBH Ia nucleic acid
sequence encodes a 526 amino acid sequence, represented herein as
SEQ ID NO:2. The signal peptide for CBH Ia is located from
positions 1 to about position 17 of SEQ ID NO:2, with the mature
protein spanning from about position 18 to position 526 of SEQ ID
NO:2. Within CBH Ia are two domains: a catalytic domain (CD) and a
cellulose-binding domain (CBD). The amino acid sequence containing
the CD of CBH Ia spans from a starting point of about position 19
of SEQ ID NO:2 to an ending point of about position 453 of SEQ ID
NO:2. The amino acid sequence containing the CBD of CBH Ia spans
from a starting point of about position 490 of SEQ ID NO:2 to an
ending point of about position 519 of SEQ ID NO:2. Based on
homology, CBH Ia can be assigned to CAZy Families GH 7 and CBM 1.
As evidenced below. CBH Ia exhibits cellobiohydrolase activity.
[0195] The enzyme denoted CBH Ib is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:4 and the cDNA sequence
represented herein as SEQ ID NO:6. The CBH Ib nucleic acid sequence
encodes a 450 amino acid sequence, represented herein as SEQ ID
NO:5. The signal peptide for CBH Ia is located from positions 1 to
about position 20 of SEQ ID NO:5, with the mature protein spanning
from about position 21 to position 450 of SEQ ID NO:5. Within CBH
Ia is a catalytic domain (CD). The amino acid sequence containing
the CD of CBH Ib spans from a starting point of about position 22
of SEQ ID NO:5 to an ending point of about position 450 of SEQ ID
NO:5. Based on homology, CBH Ib can be assigned to CAZy Family GH
7. As evidenced below, CBH Ib exhibits cellobiohydrolase
activity.
[0196] The enzyme denoted CBH IIa is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:7 and the cDNA sequence
represented herein as SEQ ID NO:9. The CBH IIa nucleic acid
sequences encodes a 395 amino acid sequence, represented herein as
SEQ ID NO:8. The signal peptide for CBH IIa is located from
positions 1 to about position 17 of SEQ ID NO:8, with the mature
protein spanning from about position 18 to position 395 of SEQ ID
NO:8. Within CBH IIa is a catalytic domain (CD). The amino acid
sequence containing the CD of CBH IIa spans from a starting point
of about position 40 of SEQ ID NO:8 to an ending point of about
position 395 of SEQ ID NO:8. Based on homology, CBH IIa can be
assigned to CAZy Family GH 6. As evidenced below, CBH IIa exhibits
cellobiohydrolase activity.
[0197] The enzyme denoted CBH IIb is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:10 and the cDNA sequence
represented herein as SEQ ID NO:12. The CBH IIb nucleic acid
sequence encodes a 482 amino acid sequence, represented herein as
SEQ ID NO:11. The signal peptide for CBH IIb is located from
positions 1 to about position 17 of SEQ ID NO:11, with the mature
protein spanning from about position 18 to position 482 of SEQ ID
NO:11. Within CBH IIb are two domains: a catalytic domain (CD) and
a cellulose-binding domain (CBD). The amino acid sequence
containing the CBD of CBH lib spans from a starting point of about
position 27 of SEQ ID NO: 11 to an ending point of about position
62 of SEQ ID NO:11. The amino acid sequence containing the CD of
CBH IIb spans from a starting point of about position 125 of SEQ ID
NO:11 to an ending point of about position 482 of SEQ ID NO:11.
Based on homology, CBH IIb can be assigned to CAZy Families GH 6
and CBM 1. As evidenced below, CBH IIb exhibits cellobiohydrolase
activity.
[0198] Physical properties, biochemical characteristics and
substrate specificities of cellobiohydrolases of the present
invention are illustrated in Tables 1 and 2 below. Physical and
biochemical characteristics include molecular weight, isoelectric
point, pH and temperature optima of enzymatic activity, and the
stability of the enzymes after heating to 50.degree. C. for five
hours at pH 5.0 or 7.0. Substrate specificities demonstrating
enzyme activity for the indicated substrates were determined using
standard assays known in the art, such as those disclosed in the
Examples below. FIG. 1 illustrates the specific activity of
cellobiohydrolases of the present invention toward substrates
selected from Table 2.
TABLE-US-00001 TABLE 1 Physical and Biochemical Properties of C1
Cellobiohydrolases SEQ ID MW T .degree. C. Stability Stability
Enzyme NO (kDa) pI pH (50%) (50%) pH 5.0 pH 7.0 CBH Ia 2 65 4.5 5.0
60 99% 96% (3.5-6.5) (45->80) CBH Ia 2 52 4.5 5.0 60 97% 95%
(CD) (CD (3.5-6.5) (45->80) Only) CBH Ib 5 60 3.8 4.7 65 80% 80%
(3.3-6.3) (50->80) CBH 8 43 4.2 5.0 60 98% 70% IIa (3.5-6.5)
(50-72) CBH 11 70 5.6 5.0 60 100% 80% IIb (<3.0-6.5) (50-75) MW
= Molecular Weight, kiloDaltons (kDa) pI = isoelectric point pH
(50%) = pH optimum of enzyme activity (range of at least 50%
activity) T .degree. C. (50%) = Temperature optimum of enzyme
activity (range of at least 50% activity) Stability pH 5.0 =
residual enzyme activity after 5 hour incubation at 50.degree. C.,
pH 5.0 Stability pH 7.0 = residual enzyme activity after 5 hour
incubation at 50.degree. C., pH 7.0
TABLE-US-00002 TABLE 2 Substrate Specificities of C1
Cellobiohydrolases (Activity Towards Specific Substrates
(units/mg)) CBH Ia Substrate CBH Ia (CD) CBH Ib CBH IIa CBH IIb CMC
0.2 0.2 0.3 1.1 0.1 (S--N Assay) CMC 0.01 0.02 0.02 0.6 0.01
(Viscometric) .beta.-glucan <0.1 <0.1 <0.1 2.0 0.2 Avicel
0.21 0.10 0.12 0.08 0.22 Birchwood Xylan 0 0 0 1.4 0.03 Xyloglucan
0 0 0 0 0 Laminarin 0 0 0 0 0 Arabinan 0 0 0 0.1 0 Galactan 0 0 0 0
0 Gaiactomannan 0 0 0 0 0 p-NPh-.beta.-D- 0.021 0.025 0.02 0 0
cellobioside p-NPh-p-D- 0.12 0.11 0.09 0 0 lactoside
[0199] pH and temperature dependencies of cellobiohydrolases of the
present invention are illustrated in Tables 3 and 4 below. pH and
temperature dependencies were determined using the Avicel cellulase
assay described in Example 5 below. pH dependencies were assayed by
varying the pH at 40.degree. C. for one hour, while temperature
dependencies were assayed by varying the temperature at pH 5.0 for
one hour. The temperature/pH that exhibited the highest enzyme
activity was set at 100% in the tables below, with the remaining
values expressed a percentage of this standard. The pH and
temperature dependencies of the cellobiohydrolases are also
illustrated graphically in FIGS. 2 and 3, respectively.
TABLE-US-00003 TABLE 3 pH Dependency of C1 Cellobiohydrolases (%
Activity) CBH Ia pH CBH Ia (CD) CBH Ib CBH IIa CBH IIb 3 23 35 22
21 72 3.55 50 50 48 50.0 87 3.9 58 61 85 68 89 4.3 75 80 95 85 91
4.7 95 95 100 99 95 5.08 100 100 92 100 100 5.6 98 98 75 98 98 6 77
78 58 78 94 6.6 48 49 46 51.6 50 7.1 17.4 20 20 42 30 7.5 2 8 23
10
TABLE-US-00004 TABLE 4 Temperature Dependency of C1
Cellobiohydrolases (% Activity) CBH Ia Temp .degree. C. CBH Ia (CD)
CBH Ib CBH IIa CBH IIb 30 11.0 10 35 20 20 20.0 18 40 39 32 28 30
45 50 48 43 41 50 62 55 25 55 55.6 55 82 67 54 78 73 60 95 82 92
100 89 65 100 100 100 90 100 70 85 92 29 64 76 75 40 75 6 40 36.5
80 15 20 2 20 23
Endoglucanases
[0200] Some proteins of the present invention possess
endo-1,4-.beta.-glucanase enzymatic activity. For example, the
polypeptides EG I (SEQ ID NO:14), EG II (SEQ ID NO:94), EG III (SEQ
ID NO:17), EG V (SEQ ID NO:20), and EG VI (SEQ ID NO:23) possess
endo-1,4-.beta.-glucanase activity. As with the cellobiohydrolases
described above, these endoglucanases also participate in the
hydrolytic conversion of insoluble cellulose to
cellooligosaccharides, with cellobiose being a primary product of
the enzymatic catalysis.
[0201] The enzyme denoted EG I is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:13 and the cDNA sequence
represented herein as SEQ ID NO:15. The EG I nucleic acid sequence
encodes a 456 amino acid sequence, represented herein as SEQ ID
NO:14. The signal peptide for EG I is located from positions 1 to
about position 20 of SEQ ID NO:14, with the mature protein spanning
from about position 21 to position 456 of SEQ ID NO:14. Within EG I
is a catalytic domain (CD). The amino acid sequence containing the
CD of EG I spans from a starting point of about position 22 of SEQ
ID NO:14 to an ending point of about position 420 of SEQ ID NO:14.
Based on homology, EG I can be assigned to CAZy Family GH 7. As
evidenced below, EG I exhibits endoglucanase activity.
[0202] The enzyme denoted EG II is a 389 amino acid sequence,
represented herein as SEQ ID NO:94. The signal peptide for EG II is
located from positions 1 to about position 16 of SEQ ID NO:94, with
the mature protein spanning from about position 17 to position 389
of SEQ ID NO:94. Within EG II are two domains: a catalytic domain
(CD) and a cellulose-binding domain (CBD). The amino acid sequence
containing the CBD of EG II spans from a starting point of about
position 17 of SEQ ID NO:94 to an ending point of about position 52
of SEQ ID NO:94. The amino acid sequence containing the CD of EG II
spans from a starting point of about position 84 of SEQ ID NO:94 to
an ending point of about position 388 of SEQ ID NO:94. As evidenced
below, EG II exhibits endoglucanase activity.
[0203] The enzyme denoted EG III is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:16 and the cDNA sequence
represented herein as SEQ ID NO:18. The EG III nucleic acid
sequence encodes a 247 amino acid sequence, represented herein as
SEQ ID NO:17. The signal peptide for EG III is located, from
positions 1 to about position 15 of SEQ ID NO:17, with the mature
protein spanning from about position 16 to position 247 of SEQ ID
NO:17. Within EG III is a catalytic domain (CD). The amino acid
sequence containing the CD of EG III spans from a starting point of
about position 91 of SEQ ID NO:17 to an ending point of about
position 247 of SEQ ID NO:17. Based on homology, EG III can be
assigned to CAZy Family GH 12. As evidenced below, EG III exhibits
endoglucanase activity.
[0204] The enzyme denoted EG V is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:119 and the cDNA sequence
represented herein as SEQ ID NO:21. The EG V nucleic acid sequence
encodes a 225 amino acid sequence, represented herein as SEQ ID
NO:20. The signal peptide for EG V is located from positions 1 to
about position 18 of SEQ ID NO:20, with the mature protein spanning
from about position 19 to position 225 of SEQ ID NO:20. Within EG V
is a catalytic domain (CD). The amino acid sequence containing the
CD of EG V spans from a starting point of about position 23 of SEQ
ID NO:20 to an ending point of about position 224 of SEQ ID NO:20.
Based on homology, EG V can be assigned to CAZy Family GM 45. As
evidenced below, EG V exhibits endoglucanase activity.
[0205] The enzyme denoted EG VI is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:22 and the cDNA sequence
represented herein as SEQ ID NO:24. The EG VI nucleic acid sequence
encodes a 381 amino acid sequence, represented herein as SEQ ID
NO:23. The signal peptide for EG VI is located from positions 1 to
about position 18 of SEQ ID NO:23, with the mature protein spanning
from about position 19 to position 381 of SEQ ID NO:23. Within EG
VI is a catalytic domain (CD). The amino acid sequence containing
the CD of EG VI spans from a starting point of about position 33 of
SEQ ID NO:23 to an ending point of about position 378 of SEQ ID
NO:23. Based on homology, EG VI can be assigned to CAZy Family GH
6. As evidenced below, EG VI exhibits endoglucanase activity.
[0206] Physical properties, biochemical characteristics and
substrate specificities of endoglucanases of the present invention
are illustrated in Tables 5 and 6 below. Physical and biochemical
characteristics include molecular weight, isoelectric point, pH and
temperature optima of enzymatic activity, and the stability of the
enzymes after heating to 50.degree. C. for five hours at pH 5.0 or
7.0. Substrate specificities demonstrating enzyme activity for the
indicated substrates were determined using standard assays known in
the art, such as those disclosed in the Examples below. FIG. 1
illustrates the specific activity of endoglucanases of the present
invention toward substrates selected from Table 6.
TABLE-US-00005 TABLE 5 Physical and Biochemical Properties of C1
Endoglucanases SEQ MW T .degree. C. Stability Stability Enzyme ID
NO (kDa) pI pH (50%) (50%) pH 5.0 pH 7.0 EG I 14 60 3.7 4.8 60 80%
80% (3.5-7.5) (43-76) EG II 97 51 4.8 4.7 70 93% 93% (3.5-7.5)
(53-77) EG II 97 (CD 44 6.0 5.2 70 90% 90% (CD) Only) (3.5-7.5)
(53-77) EG III 17 28 5.7 5.3 60 92% 74% (3.8-7.0) (30-75) EG V 20
25 4.0 5.5 65 95% 92% (3.5-8.5) (50-75) EG VI 23 47 5.7 6.0 70 60%
75% (4.5->8.5) (40-82) MW = Molecular Weight, kiloDaltons (kDa)
pI = isoelectric point pH (50%) = pH optimum of enzyme activity
(range of at least 50% activity) T .degree. C. (50%) = Temperature
optimum of enzyme activity (range of at least 50% activity)
Stability pH 5.0 = residual enzyme activity after 5 hour incubation
at 50.degree. C., pH 5.0 Stability pH 7.0 = residual enzyme
activity after 5 hour incubation at 50.degree. C., pH 7.0
TABLE-US-00006 TABLE 6 Substrate Specificities of C1 Endoglucanases
(Activity Towards Specific Substrates (units/mg)) Substrate EG I EG
II EG II (CD) EG III EG V EG VI CMC 12 52 59 11 17 14 (S--N Assay)
CMC 5 46 39 19 20 6 (Viscometric) .beta.-glucan 18 75 74 125 5.3 18
Avicel 0.08 0.19 0.06 0.02 0.06 0.07 Birchwood Xylan 0 0.18 0.07
0.2 0 0.08 Xyloglucan 0 0 0 15.6 1.9 0 Laminarin 0 0.1 0.1 0 0 0
Arabinan 0 0 0.3 0 0 0 Galactan 0 0 0.6 0 0 0.1 Galactomannan 0 1.1
1 0 0.1 0 p-NPh-.beta.-D- 0.12 0 0 0 0 0 cellobioside
p-NPh-.beta.-D- 0.27 0 0 0 0 0 lactoside
[0207] pH and temperature dependencies of endoglucanases of the
present invention are illustrated in Tables 7 and 8 below. pH and
temperature dependencies were determined using the CMC cellulase
assay described in Example 3 below, pH dependencies were assayed by
varying the pH at 50.degree. C. for five minutes, while temperature
dependencies were assayed by varying the temperature at pH 5.0 for
five minutes. The temperature/pH that exhibited the highest enzyme
activity was set at 100% in the tables below, with the remaining
values expressed a percentage of this standard. The pH and
temperature dependencies of the endoglucanases are also illustrated
graphically in FIGS. 4 and 5, respectively.
TABLE-US-00007 TABLE 7 pH Dependency of C1 Endoglucanases (%
Activity) pH EG I EG II EG II (CD) EG III EG V EG VI 2.7 9 23 16 0
16.5 8 3.34 30 39 29 10 48 17 4.04 88 87 77 60 81 37 4.777 100 100
95 98 94.5 75 5.36 98 96 100 100 100 97 6.06 87 86 96 89 98 100
6.73 74 75 81 70 88 96 7.38 55 58 64 38 76.5 91 8 33 43 38 15 67 75
8.44 23 32 25 8 52 68
TABLE-US-00008 TABLE 8 Temperature Dependency of C1 Endoglucanases
(% Activity) Temp .degree. C. EG I EG II EG II (CD) EG III EG V EG
VI 30 22.5 19 10.5 55.2 22 24.7 35 30.5 20 12 69.1 26 37.0 40 42.3
24 20.7 80.6 37 52 45 61.1 27 25 86.3 45 68.5 50 83.3 35.4 32 93.2
51 79.8 55 95.5 55 56 97.5 70 89.0 60 100 72 71 100 83.3 96.1 65
93.5 90 92 95.9 100 99 70 77.6 100 100 87.9 88 100 75 57.3 62 70
57.5 60 82.2 80 32.4 31 35 13.4 33 57.5
Xylanases
[0208] Certain proteins of the present invention possess xylanase
enzymatic activity. For example, the polypeptides Xyl 1 (SEQ ID
NO:26), Xyl 2 (SEQ ID NO:29), Xyl 3 (SEQ ID NO:32), Xyl 4 (SEQ ID
NO:35), Xyl 5 (SEQ ID NO:38), and Xyl 6 (SEQ ID NO:41) possess
xylanase activity. These xylanases participate in the hydrolytic
conversion of hemicellulose into xylose.
[0209] The enzyme denoted Xyl 1 is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:25 and the cDNA sequence
represented herein as SEQ ID NO:27. The Xyl 1 nucleic acid sequence
encodes a 384 amino acid sequence, represented herein as SEQ ID
NO:26. The signal peptide for Xyl 1 is located from positions 1 to
about position 17 of SEQ ID NO:26, with the mature protein spanning
from about position 18 to position 384 of SEQ ID NO:26. Within Xyl
1 are two domains: a catalytic domain (CD) and a cellulose-binding
domain (CBD). The amino acid sequence containing the CBD of Xyl 1
spans from a starting point of about position 18 of SEQ ID NO:26 to
an ending point of about position 53 of SEQ ID NO:26. The amino
acid sequence containing the CD of Xyl 1 spans from a starting
point of about position 83 of SEQ ID NO:26 to an ending point of
about position 384 of SEQ ID NO:26. Based on homology, Xyl 1 can be
assigned to CAZy Families GH 10 and CBM 1. As evidenced below, Xyl
1 exhibits xylanase activity.
[0210] The enzyme denoted Xyl 2 is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:28 and the cDNA sequence
represented herein as SEQ ID NO:30. The Xyl 2 nucleic acid sequence
encodes a 221 amino acid sequence, represented herein as SEQ ID
NO:29. The signal peptide for Xyl 2 is located from positions 1 to
about position 18 of SEQ ID NO:29, with the mature protein spanning
from about position 19 to position 221 of SEQ ID NO:29. Within Xyl
2 is a catalytic domain (CD). The amino acid sequence containing
the CD of Xyl 2 spans from a starting point of about position 25 of
SEQ ID NO:29 to an ending point of about position 219 of SEQ ID
NO:29. Based on homology. Xyl 2 can be assigned to CAZy Family GH
11. As evidenced below, Xyl 2 exhibits xylanase activity.
[0211] The enzyme denoted Xyl 3 is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:31 and the cDNA sequence
represented herein as SEQ ID NO:33. The Xyl 3 nucleic acid sequence
encodes a 413 amino acid sequence, represented herein as SEQ ID
NO:32. The signal peptide for Xyl 3 is located from positions 1 to
about position 17 of SEQ ID NO:32, with the mature protein spanning
from about position 18 to position 413 of SEQ ID NO:32. Within Xyl
3 are two domains: a catalytic domain (CD) and a cellulose-binding
domain (CBD). The amino acid sequence containing the CD of Xyl 3
spans from a starting point of about position 18 of SEQ ID NO:32 to
an ending point of about position 336 of SEQ ID NO:32. The amino
acid sequence containing the CBD of Xyl 3 spans from a starting
point of about position 377 of SEQ ID NO:32 to an ending point of
about position 413 of SEQ ID NO:32. Based on homology, Xyl 3 can be
assigned to CAZy Family GH 10. As evidenced below, Xyl 3 exhibits
xylanase activity.
[0212] The enzyme denoted Xyl 4 is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:34 and the cDNA sequence
represented herein as SEQ ID NO:36. The Xyl 4 nucleic acid sequence
encodes a 375 amino acid sequence, represented herein as SEQ ID
NO:35. The signal peptide for Xyl 4 is located from positions 1 to
about position 19 of SEQ ID NO:35, with the mature protein spanning
from about position 20 to position 375 of SEQ ID NO:35. Within Xyl
4 is a catalytic domain (CD). The amino acid sequence containing
the CD of Xyl 4 spans from a starting point of about position 31 of
SEQ ID NO:35 to an ending point of about position 354 of SEQ ID
NO:35. Based on homology, Xyl 4 can be assigned to CAZy Family GH
10. As evidenced below, Xyl 4 exhibits xylanase activity.
[0213] The enzyme denoted Xyl 5 is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:37 and the cDNA sequence
represented herein as SEQ ID NO:39. The Xyl 5 nucleic acid sequence
encodes a 226 amino acid sequence, represented herein as SEQ ID
NO:38. The signal peptide for Xyl 5 is located from positions 1 to
about position 21 of SEQ ID NO:38, with the mature protein spanning
from about position 22 to position 226 of SEQ ID NO:38. Within Xyl
5 is a catalytic domain (CD). The amino acid sequence containing
the CD of Xyl 5 spans from a starting point of about position 42 of
SEQ ID NO:38 to an ending point of about position 223 of SEQ ID
NO:38. Based on homology, Xyl 5 can be assigned to CAZy Families GH
11 and CE 4. As evidenced below, Xyl 5 exhibits xylanase
activity.
[0214] The enzyme denoted Xyl 6 is encoded by the nucleic acid
sequence represented herein as SEQ ID NO:40 and the cDNA sequence
represented herein as SEQ ID NO:42. The Xyl 6 nucleic acid sequence
encodes a 228 amino acid sequence, represented herein as SEQ ID
NO:41. The signal peptide for Xyl 6 is located from positions 1 to
about position 20 of SEQ ID NO:41, with the mature protein spanning
from about position 21 to position 228 of SEQ ID NO:41. Within Xyl
6 is a catalytic domain (CD). The amino acid sequence containing
the CD of Xyl 6 spans from a starting point of about position 37 of
SEQ ID NO:41 to an ending point of about position 226 of SEQ ID
NO:41. Based on homology, Xyl 6 can be assigned to CAZy Family GH
11. As evidenced below, Xyl 6 exhibits xylanase activity.
[0215] Physical properties, biochemical characteristics and
substrate specificities of xylanases of the present invention are
illustrated in Tables 9, 10, 11 and 12 below. Physical and
biochemical characteristics include molecular weight, isoelectric
point, pH and temperature optima of enzymatic activity, and the
stability of the enzymes after heating to 50.degree. C. for five
hours at pH 5.0 or 7.0, or after heating to 80.degree. C. for
twenty seconds. Substrate specificities demonstrating enzyme
activity for the indicated substrates (specific and non-specific)
were determined using standard assays known in the art, such as
those disclosed in the Examples below. The degree of xylan
exhaustive hydrolysis of specific substrates is also indicated.
FIG. 6 illustrates the specific activity of xylanases of the
present invention toward substrates selected from Table 10.
TABLE-US-00009 TABLE 9 Physical and Biochemical Properties of C1
Xylanases SEQ MW Stability Stability Stability Enzyme ID NO (kDa)
pI pH T .degree. C. pH 5.0 pH 7.0 80.degree. C. Xyl 1 26 42 7.8
5.5-7.0 65-70 100% 100% 12% (81%) (85%) Xyl 1 26 (CD Only) 31 8.9
5.5-7.0 65-70 100% 100% 19% (CD) (76%) (88%) Xyl 2 29 24 7.9
5.7-7.5 70 84% 71% 23% (35%) (25%) Xyl 3 32 57 4.4 5.5-6.5 80-85
100% 100% 75% (90%) (92%) Xyl 3 32 (CD Only) 46 4.3 5.5-6.5 80-85
100% 100% 98% (CD) (95%) (97%) Xyl 4 35 38 4.8 5.0 80 100% 100% 82%
(82%) (70%) Xyl 5 38 22 6.7 4.5 65 45% Unstable 68% (35%) at pH
.gtoreq. 7.0 Xyl 6 41 23 8.4 6.0 65-70 90% 95% 60% (12%) (20%) MW =
Molecular Weight, kiloDaltons (kDa) pI = isoelectric point pH = pH
optimum of enzyme activity T .degree. C. = Temperature optimum of
enzyme activity Stability pH 5.0 = residual enzyme activity after 1
hour incubation at 50.degree. C. (60.degree. C.), pH 5.0 Stability
pH 7.0 = residual enzyme activity after 1 hour incubation at
50.degree. C. (60.degree. C.), pH 7.0 Stability 80.degree. C. =
residual enzyme activity after 20 seconds at 80.degree. C.
TABLE-US-00010 TABLE 10 Substrate Specificities of C1 Xylanases
(Activity Towards Specific Substrates (units/mg)) Xyl 1 Xyl 3
Substrate Xyl 1 (CD) Xyl 2 Xyl 3 (CD) Xyl 4 Xyl 5 Xyl 6 Birchwood
65 83 395 39 85 32 300 169 Xylan Beechwood 66 71 329 47 88 36 300
188 Xylan Wheat 83 102 494 52 98 40 143 268 Arabinoxylan Wheat 83
82 198 52 78 24 57 107 Arabinoxylan (Insoluble) Arabinoxylan 96 100
485 63 107 42 320 329 from Oat Spelts Viscometric 1311 1382 2158
807 1210 234 2700 2452 Activity, Wheat Arabinoxylan
TABLE-US-00011 TABLE 11 Degree of Xylan Exhaustive Hydrolysis of
Specific Substrates by C1 Xylanases (%) Xyl 1 Xyl 3 Substrate Xyl 1
(CD) Xyl 2 Xyl 3 (CD) Xyl 4 Xyl 5 Xyl 6 Birchwood 26 24 19 24 23 25
22 21 Xylan Beechwood 24 22 15 22 20 23 18 17 Xylan Wheat 26 28 15
30 29 24 18 17 Arabinoxylan Wheat 12 13 7 14 11 12 9 8 Arabinoxylan
(Insoluble) Arabinoxylan 28 27 19 29 25 26 20 19 from Oat
Spelts
TABLE-US-00012 TABLE 12 Substrate Specificities of C1 Xylanases
(Activity Towards Non-Specific Substrates (units/mg)) Xyl 1 Xyl 3
Substrate Xyl 1 (CD) Xyl 2 Xyl 3 (CD) Xyl 4 Xyl 5 Xyl 6 CMC 0.65 1
0 0.12 0.2 0 0 0 Avicel 0 0 0 0 0 0 0 0 p-NPh-.beta.-D- 0 0 0 0.3
0.6 0.3 0 0 xylopyranoside p-NPh-.beta.-D- 0.36 0.42 0 0.84 1.6 4.0
0 0 cellobioside
[0216] pH and temperature dependencies of xylanases of the present
invention are illustrated in Tables 13 and 14 below. pH and
temperature dependencies were determined using the xylanase assay
(on soluble xylan) described in Example 7 below. pH dependencies
were assayed by varying the pH at 40.degree. C. for 10 minutes,
while temperature dependencies were assayed by varying the
temperature at pH 5.0 for 10 minutes. The temperature/pH that
exhibited the highest enzyme activity was set at 100% in the table
below, with the remaining values expressed a percentage of this
standard. The pH and temperature dependencies of the xylanases are
also illustrated graphically in FIGS. 7 and 8, respectively.
TABLE-US-00013 TABLE 13 pH Dependency of C1 Xylanases (% Activity)
Xyl 1 Xyl 3 pH Xyl 1 (CD) Xyl 3 (CD) pH Xyl 4 pH Xyl 2 pH Xyl 6 pH
Xyl 5 2.5 0.00 0.46 1.83 0.42 3.0 0.37 3.1 0 2.35 0.00 2.44 0.00
3.53 22.00 25.48 39.99 25.32 3.5 36.33 3.8 5.89 3.53 14.25 3 13.65
4.67 70.23 73.56 74.68 71.26 4.0 78.51 4.2 26.66 4.42 45.85 3.5
36.97 4.92 85.63 83.69 84.23 86.62 4.5 94.89 4.8 70.06 4.92 75.55 4
80.39 5.57 100 97.00 97.22 95.00 5.0 100 5.3 90.00 5.57 94.76 4.5
100 6.03 99.59 100 98.33 97.53 5.5 89.22 5.8 97.26 6.03 100 5.1
70.32 6.64 97.33 95.34 100 100 6.0 73.65 6.4 100 6.64 96.92 5.5
45.14 7.1 95.46 90.08 90.43 91.99 6.5 52.16 6.9 99.05 7.1 78.61
5.81 29.11 7.61 90.76 85.16 79.49 79.90 7.0 40.83 7.9 89.62 7.61
60.77 6.75 0.00 8.71 72.12 62.86 61.10 63.44 8.0 19.64 9.5 4.40
8.71 21.99 7.2 0.00
TABLE-US-00014 TABLE 14 Temperature Dependency of C1 Xylanases (%
Activity) Xyl 1 Xyl 3 Temp .degree. C. Xyl 1 (CD) Xyl 2 Xyl 3 (CD)
Xyl 4 Xyl 5 Xyl 6 30 37.5 37.5 8.3 27.3 26.5 18.6 44.8 35.4 40 46.8
46.8 22.7 34.1 31.7 27.4 56.3 57.0 45 63.1 63.1 40.0 30.0 36.4 62.6
50 73.1 73.1 47.2 46.4 36.5 40.3 75.9 76.1 55 82.9 82.9 59.5 63.8
50.5 85.0 60 88.1 88.1 79.0 63.3 67.5 70.3 98.3 87.5 65 98.0 98.0
92.5 76.1 80.4 81.8 100.0 97.2 70 100.0 100.0 100.0 81.5 87.5 86.4
92.2 97.3 75 95.0 95.0 95.2 92.0 88.8 92.6 70.1 71.6 80 63.7 63.7
56.4 100.0 98.3 100.0 30.2 46.2 85 41.0 41.0 100.0 100.0 85.3 31.1
90 21.1 21.1 71.9 69.8 65.6 20.8
Additional Glucosidases
[0217] Certain proteins of the present invention possess
glycosidase enzymatic activity, including .beta.-glucosidase,
galactanase, glucoamylase, endo-arabinase, and .beta.-xylosidase
activities. The polypeptides denoted herein as Bgl3A (SEQ ID
NO:44), Gal53A (SEQ ID NO:47), Gla15A (SEQ ID NO:50), Abn1 (SEQ ID
NO:89) and bxl2 (SEQ ID NO:92) possess glycosidase activity, as
indicated below.
[0218] The enzyme Bgl3A is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:43 and the cDNA sequence
represented herein as SEQ ID NO:45. The Bgl3A nucleic acid sequence
encodes a 871 amino acid sequence, represented herein as SEQ ID
NO:44. The signal peptide for Bgl3A is located from positions 1 to
about position 19 of SEQ ID NO:44, with the mature protein spanning
from about position 20 to position 871 of SEQ ID NO:111. Within
Bgl3A is a catalytic domain (CD). The amino acid sequence
containing the CD of Bgl3A spans from a starting point of about
position 87 of SEQ ID NO:44 to an ending point of about position
643 of SEQ ID NO:44. Based on homology, Bgl3A can be assigned to
CAZy Family GH 3.
[0219] The enzyme Gal53A is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:46 and the cDNA sequence
represented herein as SEQ ID NO:48. The Gal53A nucleic acid
sequence encodes a 350 amino acid sequence, represented herein as
SEQ ID NO:47. The signal peptide for Gal53A is located from
positions 1 to about position 18 of SEQ ID NO:47, with the mature
protein spanning from about position 19 to position 350 of SEQ ID
NO:47. Within Gal53A is a catalytic domain (CD). The amino acid
sequence containing the CD of Gal53A spans from a starting point of
about position 19 of SEQ ID NO:47 to an ending point of about
position 334 of SEQ ID NO:47. Based on homology, Gal53A can be
assigned to CAZy Family GH 53. As evidenced below, Gal53A exhibits
galactanase (arabinogalactan endo-1,4-.beta.-galactosidase)
activity.
[0220] The enzyme Gla15A is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:49 and the cDNA sequence
represented herein as SEQ ID NO:51. The Gla15A nucleic acid
sequence encodes a 628 amino acid sequence, represented herein as
SEQ ID NO:50. The signal peptide for Gla15A is located from
positions 1 to about position 20 of SEQ ID NO:50, with the mature
protein spanning from about position 21 to position 628 of SEQ ID
NO:50. Within Gla15A are two domains: a catalytic domain (CD) and a
starch-binding domain (CBM 20). The amino acid sequence containing
the CD of Gla15A spans from a starting point of about position 28
of SEQ ID NO:50 to an ending point of about position 461 SEQ ID
NO:50. The amino acid sequence containing the starch-binding domain
of Gla15A spans from a starting point of about position 518 of SEQ
ID NO:50 to an ending point of about position 628 of SEQ ID NO:50.
Based on homology, Gla15A can be assigned to CAZy Families GH 15
and CBM 20. As evidenced below, Gla15A exhibits glucoamylase
activity.
[0221] The enzyme Abn1 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:88 and the cDNA sequence
represented herein as SEQ ID NO:90. The Abn1 nucleic acid sequence
encodes a 321 amino acid sequence, represented herein as SEQ ID
NO:89. The signal peptide for Abn1 is located from positions 1 to
about position 20 of SEQ ID NO:89, with the mature protein spanning
from about position 21 to position 321 of SEQ ID NO:89. Within Abn1
is a catalytic domain (CD). The amino acid sequence containing the
CD of Abn1 spans from a starting point of about position 27 of SEQ
ID NO:89 to an ending point of about position 321 of SEQ ID NO:89.
Based on homology, Abn1 can be assigned to CAZy Family GH 43. As
illustrated below in Example 23, Abn1 exhibits endo-arabinase
activity as measured by the ability to hydrolyze AZCL-arabinan
(debranched). Abn1 also possesses significant homology (about 61%
from amino acids 13 to 321 of Abn1) with a endo-arabinase from
Aspergillus niger (Genbank Accession. No. AAA32682: see also
Flipphi et al., Appl. Microbial. Biotechnol. 40:3.18 (1993)). Based
on this degree of homology, Abn1 is expected to exhibit similar
endo-arabinase enzymatic activity.
[0222] The enzyme Bxl2 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:91 and the cDNA sequence
represented herein as SEQ ID NO:93. The Bxl2 nucleic acid sequence
encodes a 733 amino acid sequence, represented herein as SEQ ID
NO:92. The signal peptide for Bxl2 is located from positions 1 to
about position 17 of SEQ ID NO:92, with the mature protein spanning
from about position 1.8 to position 733 of SEQ ID NO:92. Within
Bxl2 is a catalytic domain (CD). The amino acid sequence containing
the CD of Bxl2 spans from a starting point of about position 26 of
SEQ ID NO:92 to an ending point of about position 602 of SEQ ID
NO:92. Based on homology, Bxl2 can be assigned to CAZy Family GH 3.
As illustrated below in Example 24, Bxl2 exhibits .beta.-xylosidase
activity. Bxl2 also possesses significant homology (about 70% from
amino acids 28 to 733 of Bxl2) with a .beta.-glucosidase from
Hypocrea jecorina (Genbank Accession No. 1713235A; see also Barnett
et al., Biotechnol. 9:562 (1991)). Based on this degree of
homology, Bxl2 is also expected to exhibit similar
.beta.-glucosidase enzymatic activity.
Pectate Lyases
[0223] Certain proteins of the present invention possess pectate
lyase enzymatic activity. The polypeptide denoted herein as Pel1
(SEQ ID NO:53) possesses pectate lyase activity.
[0224] The enzyme Pel1 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:52 and the cDNA sequence
represented herein as SEQ ID NO:54. The Pel1 nucleic acid sequence
encodes a 375 amino acid sequence, represented herein as SEQ ID
NO:53. The signal peptide for Pel1 is located from positions 1 to
about position 20 of SEQ ID NO:53, with the mature protein spanning
from about position 21 to position 375 of SEQ ID NO:53. Within Pel1
is a catalytic domain (CD). The amino acid sequence containing the
CD of Pel1 spans from a starting point of about position 21 of SEQ
ID NO:53 to an ending point of about position 375 of SEQ ID NO:53.
As evidenced below, Pel1 exhibits pectate lyase activity.
Chitinases
[0225] Certain proteins of the present invention possess chitinase
enzymatic activity. The polypeptides denoted herein as Chi18A (SEQ
ID NO:56) and Gls2A (SEQ ID NO:59) possess chitinase activity.
[0226] The enzyme Chi 18A is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:55 and the cDNA sequence
represented herein as SEQ ID NO:57. The Chi18A nucleic acid
sequence encodes a 426 amino acid sequence, represented herein as
SEQ ID NO:56. The signal peptide for Chi18A is located from
positions 1 to about position 23 of SEQ ID NO:56, with the mature
protein spanning from about position 24 to position 426 of SEQ ID
NO:56. Within Chi18A is a catalytic domain (CD). The amino acid
sequence containing the CD of Chi 18A spans from a starting point
of about position 42 of SEQ ID NO:56 to an ending point of about
position 386 of SEQ ID NO:56. Based on homology, Chi18A can be
assigned to CAZy Family GH 18. As evidenced below, Chi18A exhibits
chitinase activity.
[0227] The enzyme Gls2A is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:58 and the cDNA sequence
represented herein as SEQ ID NO:60. The Gls2A nucleic acid sequence
encodes a 907 amino acid sequence, represented herein as SEQ ID
NO:59. The signal peptide for Gls2A is located from positions 1 to
about position 20 of SEQ ID NO:59, with the mature protein spanning
from about position 21 to position 907 of SEQ ID NO:59. Within
Gls2A is a catalytic domain (CD). The amino acid sequence
containing the CD of Gls2A spans from a starting point of about
position 210 of SEQ ID NO:59 to an ending point of about position
764 of SEQ ID NO:59. Based on homology, Gls2A can be assigned to
CAZy Family GH 2. As evidenced below, Gls2A exhibits chitinase
(exo-.beta.-D-chitinase) activity.
Carbohydrate Esterases
[0228] Certain proteins of the present invention possess
carbohydrate esterase enzymatic activity, such as acetyl xylan
esterase, ferulic acid esterase and pectin methyl esterase activity
The polypeptides denoted herein as Axe3 (SEQ ID NO:62), FaeA1 (SEQ
ID NO:65), FaeA2 (SEQ ID NO:68), FaeB2 (SEQ ID NO:71), Axe1 (SEQ ID
NO:80), Axe2 (SEQ ID NO:83) and Pme1 (SEQ ID NO:86) possess
carbohydrate esterase activity.
[0229] The enzyme Axe3 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:6.1 and the cDNA sequence
represented herein as SEQ ID NO:63. The Axe3 nucleic acid sequence
encodes a 313 amino acid sequence, represented herein as SEQ ID
NO:62. The signal peptide for Axe3 is located from positions 1 to
about position 21 of SEQ ID NO:62, with the mature protein spanning
from about position 22 to position 313 of SEQ ID NO:62. Within Axe3
is a catalytic domain (CD). The amino acid sequence containing the
CD of Axe3 spans from a starting point of about position 22 of SEQ
ID NO:62 to an ending point of about position 255 of SEQ ID NO:62.
Based on homology, Axe3 can be assigned to CAZy Family CE 1. As
illustrated below in Example 19, Axe3 exhibits acetyl esterase
enzymatic activity. Axe3 also possesses significant homology (about
69% from amino acids 41 to 312 of Axe3) with an acetyl xylan
esterase from Penicillium purpurogenum (Genbank Accession No.
AAM93261.1; see also Gordillo et al., Mycol. Res. 110:1129 (2006)).
Based on this degree of homology. Axe3 is expected to exhibit
similar acetyl xylan esterase enzymatic activity.
[0230] The enzyme FaeA1 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:64 and the cDNA sequence
represented herein as SEQ ID NO:66. The FaeA1 nucleic acid sequence
encodes a 279 amino acid sequence, represented herein as SEQ ID
NO:65. The signal peptide for FaeA1 is located from positions 1 to
about position 20 of SEQ ID NO:65, with the mature protein spanning
from about position 21 to position 279 of SEQ ID NO:65. Within
FaeA1 is a catalytic domain (CD). The amino acid sequence
containing the CD of FaeA1 spans from a starting point of about
position 33 of SEQ ID NO:65 to an ending point of about position
216 of SEQ ID NO:65. Based on homology, FaeA1 can be assigned to
CAZy Family CE 1. As illustrated below in Example 20, FaeA1
exhibits ferulic acid esterase enzymatic activity. FaeA1 also
possesses significant homology (about 51% from amino acids 22 to
264 of FaeA1) with a ferulic acid esterase from Aspergillus
clavatus (Genbank Accession No. CAC85738). Based on this degree of
homology, FaeA1 is expected to exhibit similar ferulic acid
esterase enzymatic activity.
[0231] The enzyme FaeA2 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:67 and the cDNA sequence
represented herein as SEQ ID NO:69. The FaeA2 nucleic acid sequence
encodes a 302 amino acid sequence, represented herein as SEQ ID
NO:68. The signal peptide for FaeA2 is located from positions 1 to
about position 26 of SEQ ID NO:68, with the mature protein spanning
from about position 27 to position 302 of SEQ ID NO:68. As
illustrated below in Example 20, FaeA2 exhibits ferulic acid
esterase enzymatic activity. FaeA2 also possesses significant
homology (about 45% from amino acids 32 to 301 of FaeA2) with a
ferulic acid esterase from Aspergillus clavatus (Genbank Accession
No. CAC85738). Based on this degree of homology. FaeA2 is expected
to exhibit similar ferulic acid esterase enzymatic activity.
[0232] The enzyme FaeB2 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:70 and the cDNA sequence
represented herein as SEQ ID NO:72. The FaeB2 nucleic acid sequence
encodes a 319 amino acid sequence, represented herein as SEQ ID
NO:71. The signal peptide for FaeB2 is located from positions 1 to
about position 22 of SEQ ID NO:71, with the mature protein spanning
from about position 23 to position 319 of SEQ ID NO:71. Within
FaeB2 is a catalytic domain (CD). The amino acid sequence
containing the CD of FaeB2 spans from a starting point of about
position 63 of SEQ ID NO:71, to an ending point of about position
278 of SEQ ID NO:71. As illustrated below in Example 20, FaeB2
exhibits ferulic acid esterase enzymatic activity. FaeB2 also
possesses significant homology (about 68% from amino acids 46 to
319 of FaeB2) with a type B ferulic acid esterase from Neurospora
crassa (Genbank Accession No. CAC05587). Based on this degree of
homology, FaeB2 is expected to exhibit similar ferulic acid
esterase enzymatic activity.
[0233] The enzyme Axe1 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:79 and the cDNA sequence
represented herein as SEQ ID NO:81. The Axe1 nucleic acid sequence
encodes a 303 amino acid sequence, represented herein as SEQ ID
NO:80. The signal peptide for Axe1 is located from positions 1 to
about position 21 of SEQ ID NO:80, with the mature protein spanning
from about position 22 to position 303 of SEQ ID NO:80. Within Axe1
are two domains: a catalytic domain (CD) and a cellulose-binding
domain (CBD). The amino acid sequence containing the CD of Axe1
spans from a starting point of about position 37 of SEQ ID NO:80 to
an ending point of about position 246 of SEQ ID NO:80. The amino
acid sequence containing the CBD of Axe1 spans from a starting
point of about position 271 of SEQ ID NO:80 to an ending point of
about position 303 of SEQ ID NO:80. Based on homology, Axe1 can be
assigned to CAZy Families CE 5 and CBM 1. As illustrated below in
Example 19, Axe1 exhibits acetyl esterase enzymatic activity. Axe1
also possesses significant homology (about 57% from amino acids 3
to 303 of Axe1) with a acetyl xylan esterase from Hypocrea jecorina
(Genbank Accession No. CAA93247; see also Margolles-Clark et al.,
Eur. J. Biochem. 237:553 (1996)). Based on this degree of homology,
Axe1 is expected to exhibit similar acetyl xylan esterase enzymatic
activity.
[0234] The enzyme Axe2 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:82 and the cDNA sequence
represented herein as SEQ ID NO:84. The Axe2 nucleic acid sequence
encodes a 228 amino acid sequence, represented herein as SEQ ID
NO:83. The signal peptide for Axe2 is located from positions 1 to
about position 17 of SEQ ID NO:83, with the mature protein spanning
from about position 18 to position 228 of SEQ ID NO:83. Within Axe2
is a catalytic domain (CD). The amino acid sequence containing the
CD of Axe2 spans from a starting point of about position 26 of SEQ
ID NO:83 to an ending point of about position 242 of SEQ ID NO:83.
Based on homology, Axe2 can be assigned to CAZy Family CE 5. As
illustrated below in Example 19, Axe2 exhibits acetyl esterase
enzymatic activity. Axe2 also possesses significant homology (about
58% from amino acids 7 to 225 of Axe2) with a acetyl xylan esterase
from Penicillium purpurogenum (Genbank. Accession No. 059893; see
also Egana et al., Biotechnol. Appl. Biochem. 24(1):33 (1996)).
Based on this degree of homology, Axe2 is expected to exhibit
similar acetyl xylan esterase enzymatic activity.
[0235] The enzyme Pme1 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:85 and the cDNA sequence
represented herein as SEQ ID NO:87. The Pme1 nucleic acid sequence
encodes a 331 amino acid sequence, represented herein as SEQ ID
NO:86. The signal peptide for Pme1 is located from positions 1 to
about position 27 of SEQ ID NO:86, with the mature protein spanning
from about position 28 to position 331 of SEQ ID NO:86. Within Pme1
is a catalytic domain (CD). The amino acid sequence containing the
CD of Pme1 spans from a starting point of about position 43 of SEQ
ID NO:86 to an ending point of about position 327 of SEQ ID NO:86.
Based on homology, Pme1 can be assigned to CAZy Family CE 8. As
illustrated below in Example 22, Pme1 exhibits pectin methyl
esterase enzymatic activity on substrates such as methylated pectin
(e.g., the ability to hydrolyze citrus pectin with a degree of
methylation of 65 leading to the formation of unsubstituted
galacturonic acid). Pectin methyl esterases catalyze the hydrolysis
of methylester groups of pectins such as those found in the cell
walls of plants. Pme1 also possesses significant homology (about
55% from amino acids 7 to 331 of Pme1) with a pectin methyl
esterase from Emericella nidulans (Genbank Accession No.
ABF50865.1; see also Bauer et al. Proc. Natl. Acad. Sci. U.S.A.
103:11417 (2006)). Based on this degree of homology, Pme1 is
expected to exhibit similar pectin methyl esterase enzymatic
activity.
Arabinofuranosidases
[0236] Certain proteins of the present invention possess
arabinofuranosidase enzymatic activity. The polypeptides denoted
herein as Abf1 (SEQ ID NO:74) and Abf2 (SEQ ID NO:77) possess
arabinofuranosidase activity.
[0237] The enzyme Abf1 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:73 and the cDNA sequence
represented herein as SEQ ID NO:75. The Abf1 nucleic acid sequence
encodes a 370 amino acid sequence, represented herein as SEQ ID
NO:74. The signal peptide for Abf1 is located from positions 1 to
about position 22 of SEQ ID NO:74, with the mature protein spanning
from about position 23 to position 370 of SEQ ID NO:74. Within Abf1
are two domains: a catalytic domain (CD) and a cellulose-binding
domain (CBD). The amino acid sequence containing the CD of Abf1
spans from a starting point of about position 24 of SEQ ID NO:74 to
an ending point of about position 294 of SEQ ID NO:74. The amino
acid sequence containing the CBD of Abf1 spans from a starting
point of about position 339 of SEQ ID NO:74 to an ending point of
about position 370 of SEQ ID NO:74. Based on homology, Abf1 can be
assigned to CAZy Families CBM 1, CBM 13 and OH 62. As illustrated
below in Example 21, Abf1 exhibits .alpha.-arabinofuranosidase
enzymatic activity. Abf1 also possesses significant homology (about
70% from amino acids 19 to 322 of Abf1) with a arabinofuranosidase
from Streptomyces thermoviolaceus (Genbank Accession No. JC7820;
see also Tsujibo et al., Biosci. Biotechnol. Biochem. 66:434
(2002)). Based on this degree of homology, Abf1 is expected to
exhibit similar arabinofuranosidase enzymatic activity.
[0238] The enzyme Abf2 is encoded by the nucleic acid sequence
represented herein as SEQ ID NO:76 and the cDNA sequence
represented herein as SEQ ID NO:78. The Abf2 nucleic acid sequence
encodes a 321 amino acid sequence, represented herein as SEQ ID
NO:77. The signal peptide for Abf2 is located from positions 1 to
about position 19 of SEQ ID NO:77, with the mature protein spanning
from about position 20 to position 321 of SEQ ID NO:77. Within Abf2
is a catalytic domain (CD). The amino acid sequence containing the
CD of Abf2 spans from a starting point of about position 22 of SEQ
ID NO:77 to an ending point of about position 291 of SEQ ID NO:77.
Based on homology, Abf2 can be assigned to CAZy Family GH 62. As
illustrated below in Example 21, Abf2 exhibits
.alpha.-arabinofuranosidase enzymatic activity. Abf2 also possesses
significant homology (about 75% from amino acids 21 to 315 of Abf2)
with a arabinofuranosidase from Streptomyces thermoviolaceus
(Genbank Accession No. JC7820; see also Tsujibo et al., Biosci.
Biotechnol. Biochem. 66:434 (2002)). Based on this degree of
homology, Abf2 is expected to exhibit similar arabinofuranosidase
enzymatic activity.
[0239] Physical properties, biochemical characteristics and
substrate specificities of the glycosidases, pectate lyases,
chitinases, carbohydrate esterases, and arabinofuranosidases of the
present invention are illustrated in Tables 15, 16, 17 and 18
below. Physical and biochemical characteristics include molecular
weight, isoelectric point, pH and temperature optima of enzymatic
activity, and the stability of the enzymes after heating to
50.degree. C. for five hours at pH 5.0. Substrate specificities
demonstrating enzyme activity for the indicated substrates
(specific and non-specific) were determined using standard assays
known in the art, such as those disclosed in the Examples
below.
TABLE-US-00015 TABLE 15 Physical and Biochemical Properties of C1
Enzymes SEQ MW T .degree. C. Stability Enzyme ID NO (kDa) pI pH
(50%) (50%) pH 5.0 Bgl3A 44 106 4.8 4.0 60 64% (2.5-7.0) (45-70)
Gal53A 47 30 8.2 4.5 55-60 100% (2.7-6.0) (30-70) Gla15A 50 68 4.3
5.5 70 50% (3.5-7.0) Pel1 53 33 5.3 8.5 60-65 100%** (7.8-8.8)
(30-70) Chi18A 56 43 4.0 7.5 70 76% (5.7-8.7 (60-75) Gls2A 59 80
6.6 5.9 70 100% (5.0-7.0) (55-75) Axe3 62 35 6.5-6.8 FaeA1 65 25-40
3.5-4.0 FaeA2 68 29.1* 4.87* FaeB2 71 29.3* 4.28* Abf1 74 35 7.3
and 8.5-9.0 Abf2 77 33 4.5-5.0 Axe1 80 28* 7.2* Axe2 83 27 6 Pme1
86 32.6* 8.79* Abn1 89 32.8* 5.57* bxl2 95 75.9* 4.8* MW =
Molecular Weight, kiloDaltons (kDa) (*= Predicted) pI = isoelectric
point (*= Predicted) pH (50%) = pH optimum of enzyme activity
(range of at least 50% activity) T .degree. C. (50%) = Temperature
optimum of enzyme activity (range of at least 50% activity)
Stability pH 5.0 = residual enzyme activity after 3 hour incubation
at 50.degree. C., pH 5.0 **pH 8.5, 1 mM CaCl.sub.2
TABLE-US-00016 TABLE 16 Substrate Specificities of C1 Enzymes
(Activity Towards Substrates (units/mg)) Substrate Bgl3A Gal53A
Pel1 CMC (S-N Assay) 1.5 0 0 .beta.-glucan from Barley 18 0 0
Birchwood 0 0 0 GlucuronoXylan Wheat ArabinoXylan 0 0 0 Xyloglucan
from 0 0 0 Tamarind Laminarin (.beta.-1,3; .beta.- 109 0 0 1,6)
Polygalacturonic acid 0 0 70 (RS) Citrus Pectin 0 0 525*
(Viscosimetry, A235) Citrus Pectin 0 0 82* (Etherefication 89%,
A235) Citrus Pectin 0 0 126* (Etherefication 65%, A235) Citrus
Pectin 0 0 133* (Etherefication 26%, A235) Galactan 0 620 0
Arabinan 0 97 0 Debranched Arabinan Branched 0 90 0 Arabinogalactan
0 0 0 Galactomannan 0 0 0 p-NPh-.beta.-D- 0.03 0 0 cellobioside
p-NPh-.beta.-D-lactoside 0 0 0 p-NPh-.beta.-L- 0 0 0
arabinofuranoside p-NPh-.beta.-L- 0 0 0 arabinopyranoside
o-NPh-.beta.-D- 0 0 0 xylopyranoside p-NPh-.beta.-D- 39 0 0
glucopyranoside p-NPh-.beta.-L- 0 0 0 galactopyranoside Cellobiose
(.beta.-1,4) 52 0 0 Laminaribiose (.beta.-1,3) 86 0 0
.alpha.-Sophorose (.beta.-1,2) 50 0 0 Gentibiose (.beta.-1,6) 107 0
0 Maltose 0 0 0 *C1 pectate lyase is a Ca.sup.+2-dependent enzyme,
so activities were measured in 1 mM of CaCl.sub.2.
TABLE-US-00017 TABLE 17 Substrate Specificities of C1 Gla15A
(Activity Towards Substrates (units/mg)) Substrate Glucoamylase
Activity Starch soluble (potato) NS* 150 Amylose soluble (potato),
NS 87 Amylopectin soluble (potato), NS 104 Pullulan (Aureobasidium
pullulans) 11 Starch insoluble (potato) NS 35 Amylose insoluble
(potato), NS 59 Amylopectin insoluble (potato), NS 43 Maltose** 18
Isomaltose** 1.2 Maltotriose** 48 Maltohexaose** 61
pNph-.alpha.-D-glucopiranoside 0
TABLE-US-00018 TABLE 18 Substrate Specificities of C1 Enzymes
(Activity Towards Substrates (units/mg)) Substrate Chi18A Gls2A
Chitosan soluble, 140 kDa 35 28 Chitin Colloid 8 0 CMC 0 0
.beta.-glucan 0 0 Birchwood GlucuronoXylan 0 0
p-NPh-.beta.-D-N-acetylglucosamine 0.1 0 Chitosan, 140 kDa
viscometry 210 0
[0240] pH and temperature dependencies of certain enzymes of the
present invention are illustrated in Tables 19 and 20 below. pH and
temperature dependencies were determined using the standard assays
with the substrate and conditions listed below:
TABLE-US-00019 Enzyme pH Temperature Bgl3A
p-NPh-.beta.-D-glucopyranoside Cellobiose Gla15A Potato Starch,
50.degree. C., 10 minutes Potato Starch, pH 5.0, 10 minutes Chi18A
Colloidal chitin, 50.degree. C., 10 minutes Chitosan (140 kD), pH
5.0, 10 minutes Gls2A Chitosan (140 kD), 50.degree. C., 10 minutes
Chitosan (140 kD), pH 5.0, 10 minutes Gal53A.sup.1 Arabinan
branched, 50.degree. C., 10 minutes Arabinan branched, pH 5.0, 10
minutes Gal53A.sup.2 Galactan, 50.degree. C. 10 minutes Not
Applicable Pel1 A235, PGA + 1 mM CaCl.sub.2 A235, PGA + 1 mM
CaCl.sub.2
[0241] The temperature/pH that exhibited the highest enzyme
activity was set at 100% in the tables below, with the remaining
values expressed a percentage of this standard. The pH and
temperature dependencies of certain enzymes are also illustrated
graphically in FIGS. 9 and 10, respectively.
TABLE-US-00020 TABLE 19 pH Dependency of C1 Enzymes (% Activity) pH
Bgl3A Gla15A Chi18A Gls2a pH Gal 53A.sup.1 pH Gal 53A.sup.2 pH Pel1
2.0 2 2.6 50 37 2.45 39.5 2.9 0.3 3.0 70 43 2 0 2.73 55.6 3.51 26.5
3.53 90 51 2 3 3.5 70.4 4.22 93.9 7.25 2.8 4.0 100 78 2 10.3 4.05
89.3 4.62 97.4 7.75 41.9 4.57 98 90 8 40 4.5 100.0 4.90 97.7 8.05
70.5 5.0 93 97 20 50 4.95 75.6 5.45 100 8.18 88.8 5.5 83 100 30 64
5.5 70.6 6.02 95.8 8.42 100 6.0 72 96 57 100 6.02 48.2 6.51 91.6
8.88 49.6 6.5 58 83 62 65 6.47 34.5 7.03 71.9 8.99 33 7.0 47 50 83
37.8 7.03 19.8 7.50 51.7 9.5 5 7.5 30 25 100 7.5 17.8 8.00 42.2 8
17 1 98 9.04 17.1 9.11 1.7 9 25
TABLE-US-00021 TABLE 20 Temperature Dependency of C1 Enzymes (%
Activity) Temp Gal Temp .degree. C. Bgl3A Gla15A Chi18A Gls2a
53A.sup.1 .degree. C. Pel1 25 10 21 6.9 30 10 20 15 46.6 30 15.2 35
18 21 18 40 36 20 25 22 80.3 39 29.6 45 48 19 36 32 47 51.5 50 65
20 40 43 92.1 55 95 44 42 53 56 82.5 60 100 62 47 60 100 61 100 65
80 88 60 80 97.1 67 94.8 70 48 100 100 100 59.8 72 33.9 75 15 5 53
53 80 12 15 85 90
[0242] As used herein, reference to an isolated protein or
polypeptide in the present invention, including any of the enzymes
disclosed herein, includes full-length proteins, fusion proteins,
or any fragment or homologue of such a protein. More specifically,
an isolated protein, such as an enzyme according to the present
invention, is a protein (including a polypeptide or peptide) that
has been removed from its natural milieu (i.e., that has been
subject to human manipulation) and can include purified proteins,
partially purified proteins, recombinantly produced proteins,
synthetically produced proteins, proteins complexed with lipids,
soluble proteins, and isolated proteins associated with other
proteins, for example. As such, "isolated" does not reflect the
extent to which the protein has been purified. Preferably, an
isolated protein of the present invention is produced
recombinantly. In addition, and by way of example, a "C. lucknowens
protein" or "C. lucknowens enzyme" refers to a protein (generally
including a homologue of a naturally occurring protein) from
Chrysosporium lucknowense or to a protein that has been otherwise
produced from the knowledge of the structure (e.g., sequence) and
perhaps the function of a naturally occurring protein from
Chrysosporium lucknowense. In other words, a C. lucknowens protein
includes any protein that has substantially similar structure and
function of a naturally occurring C. lucknowens protein or that is
a biologically active (i.e., has biological activity) homologue of
a naturally occurring protein from C. lucknowens as described in
detail herein. As such, a C. lucknowens protein can include
purified, partially purified, recombinant, mutated/modified and
synthetic proteins. According to the present invention, the terms
"modification" and "mutation" can be used interchangeably,
particularly with regard to the modifications/mutations to the
amino acid sequence of a C. lucknowens protein (or nucleic acid
sequences) described herein. An isolated protein according to the
present invention can be isolated from its natural source, produced
recombinantly or produced synthetically.
[0243] According to the present invention, the terms "modification"
and "mutation" can be used interchangeably, particularly with
regard to the modifications/mutations to the primary amino acid
sequences of a protein or peptide (or nucleic acid sequences)
described herein. The term "modification" can also be used to
describe, post-translational Modifications to a protein or peptide
including, but not limited to, methylation, farnesylation,
carboxymethylation, geranyl geranylation, glycosylation,
phosphorylation, acetylation, myristoylation, prenylation,
palmitation, and/or amidation. Modifications can also include, for
example, complexing a protein or peptide with another compound.
Such modifications can be considered to be mutations, for example,
if the modification is different than the post-translational
modification that occurs in the natural, wild-type protein or
peptide.
[0244] As used herein, the term "homologue" is used to refer to a
protein or peptide which differs from a naturally occurring protein
or peptide (i.e., the "prototype" or "wild-type" protein) by minor
modifications to the naturally occurring protein or peptide, but
which maintains the basic protein and side chain structure of the
naturally occurring form. Such changes include, but are not limited
to: changes in one or a few amino acid side chains; changes one or
a few amino acids, including deletions (e.g., a truncated version
of the protein or peptide), insertions and/or substitutions;
changes in stereochemistry of one or a few atoms; and/or minor
derivatizations, including but not limited to: methylation,
glycosylation, phosphorylation, acetylation, myristoylation,
prenylation, palmitation, amidation and/or addition of
glycosylphosphatidyl inositol. A homologue can have either
enhanced, decreased, or substantially similar properties as
compared to the naturally occurring protein or peptide. A homologue
can include an agonist of a protein or an antagonist of a
protein.
[0245] Homologues can be the result of natural allelic variation or
natural mutation. A naturally occurring allelic variant of a
nucleic acid encoding a protein is a gene that occurs at
essentially the same locus (or loci) in the genome as the gene
which encodes such protein, but which, due to natural variations
caused by, for example, mutation or recombination, has a similar
but not identical sequence. Allelic variants typically encode
proteins having similar activity to that of the protein encoded by
the gene to which they are being compared. One class of allelic
variants can encode the same protein but have different nucleic
acid sequences due to the degeneracy of the genetic code. Allelic
variants can also comprise alterations in the 5' or 3' untranslated
regions of the gene (e.g., in regulatory control regions). Allelic
variants are well known to those skilled in the art.
[0246] Homologues can be produced using techniques known in the art
for the production of proteins including, but not limited to,
direct modifications to the isolated, naturally occurring protein,
direct protein synthesis, or modifications to the nucleic acid
sequence encoding the protein using, for example, classic or
recombinant DNA techniques to effect random or targeted
mutagenesis.
[0247] Modifications in protein homologues, as compared to the
wild-type protein, either agonize, antagonize, or do not
substantially change, the basic biological activity of the
homologue as compared to the naturally occurring protein.
Modifications of a protein, such as in a homologue, may result in
proteins having the same biological activity as the naturally
occurring protein, or in proteins having decreased or increased
biological activity as compared to the naturally occurring protein.
Modifications which result in a decrease in protein expression or a
decrease in the activity of the protein, can be referred to as
inactivation (complete or partial), down-regulation, or decreased
action of a protein. Similarly, modifications which result in an
increase in protein expression or an increase in the activity of
the protein, can be referred to as amplification, overproduction,
activation, enhancement, up-regulation or increased action of a
protein.
[0248] According to the present invention, an isolated protein,
including a biologically active homologue or fragment thereof, has
at least one characteristic of biological activity of a wild-type,
or naturally occurring, protein. As discussed above, in general,
the biological activity or biological action of a protein refers to
any function(s) exhibited or performed by the protein that is
ascribed to the naturally occurring form of the protein as measured
or observed in vivo (i.e., in the natural physiological environment
of the protein) or in vitro (i.e., under laboratory conditions).
The biological activity of a protein of the present invention can
include an enzyme activity (catalytic activity and/or substrate
binding activity), such as cellulase activity, hemicellulase
activity, .beta.-glucanase activity, .beta.-glucosidase activity,
.alpha.-galactosidase activity, .beta.-galactosidase activity,
xylanase activity or any other activity disclosed herein. Specific
biological activities of the proteins disclosed herein are
described in detail above and in the Examples. Methods of detecting
and measuring the biological activity of a protein of the invention
include, but are not limited to, the assays described in the
Examples section below. Such assays include, but are not limited
to, measurement of enzyme activity (e.g., catalytic activity),
measurement of substrate binding, and the like. It is noted that an
isolated protein of the present invention (including homologues) is
not required to have a biological activity such as catalytic
activity. A protein can be a truncated, mutated or inactive
protein, or lack at least one activity of the wild-type enzyme, for
example. Inactive proteins may be useful in some screening assays,
for example, or for other purposes such as antibody production.
[0249] Methods to measure protein expression levels of a protein
according to the invention include, but are not limited to: western
blotting; immunocytochemistry, flow cytometry or other
immunologic-based assays; assays based on a property of the protein
including but not limited to, ligand binding or interaction with
other protein partners. Binding assays are also well known in the
art. For example, a BIAcore machine can be used to determine the
binding constant of a complex between two proteins. The
dissociation constant for the complex can be determined by
monitoring changes in the refractive index with respect to time as
buffer is passed over the chip (O'Shannessy et al. Anal. Biochem.
212:457-468 (1993); Schuster et al., Nature 365:343-347 (1993)).
Other suitable assays for measuring the binding of one protein to
another include, for example, immunoassays such as enzyme linked
immunoabsorbent assays (ELISA) and radioimmunoassays (RIA), or
determination of binding by monitoring the change in the
spectroscopic or optical properties of the proteins through
fluorescence, UV absorption, circular dichrosim, or nuclear
magnetic resonance (NMR).
[0250] Many of the enzymes and proteins of the present invention
may be desirable targets for modification and use in the processes
described herein. These proteins have been described in terms of
function and amino acid sequence (and nucleic acid sequence
encoding the same) of representative wild-type proteins. In one
embodiment of the invention, homologues of a given protein (which
can include related proteins from other organisms or modified forms
of the given protein) are encompassed for use in the invention.
Homologues of a protein encompassed by the present invention can
comprise, consist essentially of, or consist of, in one embodiment,
an amino acid sequence that is at least about 35% identical, and
more preferably at least about 40% identical, and more preferably
at least about 45% identical, and more preferably at least about
50% identical, and more preferably at least about 55% identical,
and more preferably at least about 60% identical, and more
preferably at least about 65% identical, and more preferably at
least about 70% identical, and more preferably at least about 75%
identical, and more preferably at least about 80% identical, and
more preferably at least about 85% identical, and more preferably
at least about 90% identical, and more preferably at least about
95% identical, and more preferably at least about 96% identical,
and more preferably at least about 97% identical, and more
preferably at least about 98% identical, and more preferably at
least about 99% identical, or any percent identity between 35% and
99%, in whole integers (i.e., 36%, 37%, etc.), to an amino acid
sequence disclosed herein that represents the amino acid sequence
of an enzyme or protein according to the invention (including a
biologically active domain of a full-length protein). Preferably,
the amino acid sequence of the homologue has a biological activity
of the wild-type or reference protein or of a biologically active
domain thereof (e.g., a catalytic domain).
[0251] In one embodiment, a protein of the present invention
comprises, consists essentially of, or consists of an amino acid
sequence that is less than 100% identical to an amino acid sequence
selected from: SEQ ID NO:2. SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:11.
SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20. SEQ ID NO:23, SEQ ID
NO:26. SEQ ID NO:29, SEQ ID NO:32, SEQ ID NO:35, SEQ ID NO:38, SEQ
ID NO:41, SEQ ID NO:44, SEQ ID NO:47, SEQ ID NO:50, SEQ ID NO:53,
SEQ ID NO:56, SEQ ID NO:59, SEQ ID NO:62, SEQ ID NO:65, SEQ ID
NO:68, SEQ ID NO:71. SEQ ID NO:74. SEQ ID NO:77. SEQ ID NO:80, SEQ
ID NO:83, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:92, or SEQ ID NO:94
(i.e., a homologue). In another aspect of the invention, a
homologue according to the present invention has an amino acid
sequence that is less than about 99% identical to any of such amino
acid sequences, and in another embodiment, is less than about 98%
identical to any of such amino acid sequences, and in another
embodiment, is less than about 97% identical to any of such amino
acid sequences, and in another embodiment, is less than about 96%
identical to any of such amino acid sequences, and in another
embodiment, is less than about 95% identical to any of such amino
acid sequences, and in another embodiment, is less than about 94%
identical to any of such amino acid sequences, and in another
embodiment, is less than about 93% identical to any of such amino
acid sequences, and in another embodiment, is less than about 92%
identical to any of such amino acid sequences, and in another
embodiment, is less than about 91% identical to any of such amino
acid sequences, and in another embodiment, is less than about 90%
identical to any of such amino acid sequences, and so on, in
increments of whole integers.
[0252] As used herein, unless otherwise specified, reference to a
percent (%) identity refers to an evaluation of homology which is
performed using: (1) a BLAST 2.0 Basic BLAST homology search using
blastp for amino acid searches and blastn for nucleic acid searches
with standard default parameters, wherein the query sequence is
filtered for low complexity regions by default (described in
Altschul, S. F., Madden, T. L., Schaaffer, A. A., Zhang, J., Zhang,
Z., Miller, W. & Lipman, D. J. (1997) "Gapped BLAST and
PSI-BLAST: a new generation of protein database search programs."
Nucleic Acids Res. 25:3389-3402, incorporated herein by reference
in it's entirety); (2) a BLAST 2 alignment (using the parameters
described below); (3) PSI-BLAST with the standard default
parameters (Position, Specific Iterated BLAST; and/or (4)
CAZy-homology determined using standard default parameters from the
Carbohydrate Active EnZymes database (Coutinho, P. M. &
Henrissat, B. (1999) Carbohydrate-active enzymes: an integrated
database approach. In "Recent Advances in Carbohydrate
Bioengineering", H. J. Gilbert, G. Davies, B. Henrissat and B.
Svensson eds., The Royal Society of Chemistry, Cambridge, pp.
3-12).
[0253] It is noted that due to some differences in the standard
parameters between BLAST 2.0 Basic BLAST and BLAST 2, two specific
sequences might be recognized as having significant homology using
the BLAST 2 program, whereas a search performed in BLAST 2.0 Basic
BLAST using one of the sequences as the query sequence may not
identify the second sequence in the top matches. In addition,
PSI-BLAST provides an automated, easy-to-use version of a "profile"
search, which is a sensitive way to look for sequence homologues.
The program first performs a gapped BLAST database search. The
PSI-BLAST program uses the information from any significant
alignments returned to construct a position-specific score matrix,
which replaces the query sequence for the next round of database
searching. Therefore, it is to be understood that percent identity
can be determined by using any one of these programs.
[0254] Two specific sequences can be aligned to one another using
BLAST 2 sequence as described in Tatusova and Madden, (1999),
"Blast 2 sequences--a new tool for comparing protein and nucleotide
sequences", FEMS Microbiol Lett. 174:247-250, incorporated herein
by reference in its entirety. BLAST 2 sequence alignment is
performed in blastp or blastn using the BLAST 2.0 algorithm to
perform a Gapped BLAST search (BLAST 2.0) between the two sequences
allowing for the introduction of gaps (deletions and insertions) in
the resulting alignment. For purposes of clarity herein, a BLAST 2
sequence alignment is performed using the standard default
parameters as follows.
For blastn, using 0 BLOSUM62 matrix: [0255] Reward for match=1
[0256] Penalty for mismatch=-2 [0257] Open gap (5) and extension
gap (2) penalties [0258] gap x_dropoff (50) expect (10) word size
(11) filter (on) For blastp, using 0 BLOSUM62 matrix: [0259] Open
gap (11) and extension gap (1) penalties [0260] gap x_dropoff (50)
expect (10) word size (3) filter (on).
[0261] A protein of the present invention can also include proteins
having an amino acid sequence comprising at least 10 contiguous
amino acid residues of any of the sequences described herein (i.e.,
10 contiguous amino acid residues having 100% identity with 10
contiguous amino acids of SEQ ID NO:2). In other embodiments, a
homologue of a protein amino acid sequence includes amino acid
sequences comprising at least 20, or at least 30, or at least 40,
or at least 50, or at least 75, or at least 100, or at least 125,
or at least 150, or at least 175, or at least 150, or at least 200,
or at least 250, or at least 300, or at least 350 contiguous amino
acid residues of any of the amino acid sequence represented
disclosed herein. Even small fragments of proteins without
biological activity are useful in the present invention, for
example, in the preparation of antibodies against the full-length
protein or in a screening assay (e.g., a binding assay). Fragments
can also be used to construct fusion proteins, for example, where
the fusion protein comprises functional domains from two or more
different proteins (e.g., a CBD from one protein linked to a CD
from another protein). In one embodiment, a homologue has a
measurable or detectable biological activity associated with the
wild-type protein (e.g., enzymatic activity).
[0262] According to the present invention, the term "contiguous" or
"consecutive", with regard to nucleic acid or amino acid sequences
described herein, means to be connected in an unbroken sequence.
For example, for a first sequence to comprise 30 contiguous (or
consecutive) amino acids of a second sequence, means that the first
sequence includes an unbroken sequence of 30 amino acid residues
that is 100% identical to an unbroken sequence of 30 amino acid
residues in the second sequence. Similarly, for a first sequence to
have "1.00% identity" with a second sequence means that the first
sequence exactly matches the second sequence with no gaps between
nucleotides or amino acids.
[0263] In another embodiment, a protein of the present invention,
including a homologue, includes a protein having an amino acid
sequence that is sufficiently similar to a natural amino acid
sequence that a nucleic acid sequence encoding the homologue is
capable of hybridizing under moderate, high or very high stringency
conditions (described below) to (i.e., with) a nucleic acid
molecule encoding the natural protein (i.e., to the complement of
the nucleic acid strand encoding the natural amino acid sequence).
Preferably, a homologue of a protein of the present invention is
encoded by a nucleic acid molecule comprising a nucleic acid
sequence that hybridizes under low, moderate, or high stringency
conditions to the complement of a nucleic acid sequence that
encodes a protein comprising, consisting essentially of, or
consisting of, an amino acid sequence represented by any of: SEQ ID
NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:11, SEQ ID NO:14, SEQ ID
NO:17, SEQ ID NO:20, SEQ ID NO:23, SEQ ID NO:26, SEQ ID NO:29, SEQ
ID NO:32, SEQ ID NO:35, SEQ ID NO:38, SEQ ID NO:41, SEQ ID NO:44,
SEQ ID NO:47, SEQ ID NO:50, SEQ ID NO:53, SEQ ID NO:56, SEQ ID
NO:59, SEQ ID NO:62, SEQ ID NO:65, SEQ ID NO:68, SEQ ID NO:71, SEQ
ID NO:74, SEQ ID NO:77, SEQ ID NO:80, SEQ ID NO:83, SEQ ID NO:86,
SEQ ID NO:89, SEQ ID NO:92, or SEQ ID NO:94. Such hybridization
conditions are described in detail below.
[0264] A nucleic acid sequence complement of nucleic acid sequence
encoding a protein of the present invention refers to the nucleic
acid sequence of the nucleic acid strand that is complementary to
the strand which encodes the protein. It will be appreciated that a
double stranded DNA which encodes a given amino acid sequence
comprises a single strand DNA and its complementary strand having a
sequence that is a complement to the single strand DNA. As such,
nucleic acid molecules of the present invention can be either
double-stranded or single-stranded, and include those nucleic acid
molecules that form stable hybrids under stringent hybridization
conditions with a nucleic acid sequence that encodes an amino acid
sequence such as SEQ ID NO:2, and/or with the complement of the
nucleic acid sequence that encodes an amino acid sequence such as
SEQ ID NO:2. Methods to deduce a complementary sequence are known
to those skilled in the art. It should be noted that since nucleic
acid sequencing technologies are not entirely error-free, the
sequences presented herein, at best, represent apparent sequences
of the proteins of the present invention.
[0265] As used herein, reference to hybridization conditions refers
to standard hybridization conditions under which nucleic acid
molecules are used to identify similar nucleic acid molecules. Such
standard conditions are disclosed, for example, in Sambrook et al.,
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Labs
Press, 1989. Sambrook et al., ibid., is incorporated by reference
herein in its entirety (see specifically, pages 9.31-9.62). In
addition, formulae to calculate the appropriate hybridization and
wash conditions to achieve hybridization permitting varying degrees
of mismatch of nucleotides are disclosed, for example, in Meinkoth
et al., 1984, Anal. Biochem. 138, 267-284; Meinkoth et al., ibid.,
is incorporated by reference herein in its entirety.
[0266] More particularly, moderate stringency hybridization and
washing conditions, as referred to herein, refer to conditions
which permit isolation of nucleic acid molecules having at least
about 70% nucleic acid sequence identity with the nucleic acid
molecule being used to probe in the hybridization reaction (i.e.,
conditions permitting about 30% or less mismatch of nucleotides).
High stringency hybridization and washing conditions, as referred
to herein, refer to conditions which permit isolation of nucleic
acid molecules having at least about 80% nucleic acid sequence
identity with the nucleic acid molecule being used to probe in the
hybridization reaction (i.e., conditions permitting about 20% or
less mismatch of nucleotides). Very high stringency hybridization
and washing conditions, as referred to herein, refer to conditions
which permit isolation of nucleic acid molecules having at least
about 90% nucleic acid sequence identity with the nucleic acid
molecule being used to probe in the hybridization reaction (i.e.,
conditions permitting about 10% or less mismatch of nucleotides).
As discussed above, one of skill in the art can use the formulae in
Meinkoth et al., ibid. to calculate the appropriate hybridization
and wash conditions to achieve these particular levels of
nucleotide mismatch. Such conditions will vary, depending on
whether DNA:RNA or DNA:DNA hybrids are being formed. Calculated
melting temperatures for DNA:DNA hybrids are 10.degree. C. less
than for DNA:RNA hybrids. In particular embodiments, stringent
hybridization conditions for DNA:DNA hybrids include hybridization
at an ionic strength of 6.times.SSC (0.9 M Na.sup.+) at a
temperature of between about 20.degree. C. and about 35.degree. C.
(lower stringency), more preferably, between about 28.degree. C.
and about 40.degree. C. (more stringent), and even more preferably,
between about 35.degree. C. and about 45.degree. C. (even more
stringent), with appropriate wash conditions. In particular
embodiments, stringent hybridization conditions for DNA:RNA hybrids
include hybridization at an ionic strength of 6.times.SSC (0.9 M
Na.sup.+) at a temperature of between about 30.degree. C. and about
45.degree. C., more preferably, between about 38.degree. C. and
about 50.degree. C., and even more preferably, between about
45.degree. C. and about 55.degree. C., with similarly stringent
wash conditions. These values are based on calculations of a
melting temperature for molecules larger than about 100
nucleotides, 0% formamide and a G+C content of about 40%.
Alternatively, T.sub.m can be calculated empirically as set forth
in Sambrook et al., supra, pages 9.31 to 9.62. In general, the wash
conditions should be as stringent as possible, and should be
appropriate for the chosen hybridization conditions. For example,
hybridization conditions can include a combination of salt and
temperature conditions that are approximately 20-25.degree. C.
below the calculated T.sub.m of a particular hybrid, and wash
conditions typically include a combination of salt and temperature
conditions that are approximately 12-20.degree. C. below the
calculated T.sub.m of the particular hybrid. One example of
hybridization conditions suitable for use with DNA:DNA hybrids
includes a 2-24 hour hybridization in 6.times.SSC (50% formamide)
at about 42.degree. C., followed by washing steps that include one
or more washes at room temperature in about 2.times.SSC, followed
by additional washes at higher temperatures and lower ionic
strength (e.g., at least one wash as about 37.degree. C. in about
0.1.times.-0.5.times.SSC, followed by at least one wash at about
68.degree. C. in about 0.1.times.-0.5.times.SSC).
[0267] The minimum size of a protein and/or homologue of the
present invention is a size sufficient to have biological activity
or, when the protein is not required to have such activity,
sufficient to be useful for another purpose associated with a
protein of the present invention, such as for the production of
antibodies that bind to a naturally occurring protein. In one
embodiment, the protein of the present invention is at least 20
amino acids in length, or at least about 25 amino acids in length,
or at least about 30 amino acids in length, or at least about 40
amino acids in length, or at least about 50 amino acids in length,
or at least about 60 amino acids in length, or at least about 70
amino acids in length, or at least about 80 amino acids in length;
or at least about 90 amino acids in length, or at least about 100
amino acids in length, or at least about 125 amino acids in length,
or at least about 150 amino acids in length, or at least about 175
amino acids in length, or at least about 200 amino acids in length,
or at least about 250 amino acids in length, and so on up to a full
length of each protein, and including any size in between in
increments of one whole integer (one amino acid). There is no
limit, other than a practical limit, on the maximum size of such a
protein in that the protein can include a portion of a protein or a
full-length protein, plus additional sequence (e.g., a fusion
protein sequence), if desired.
[0268] The present invention also includes a fusion protein that
includes a domain of a protein of the present invention (including
a homologue) attached to one or more fusion segments, which are
typically heterologous in sequence to the protein sequence (i.e.,
different than protein sequence). Suitable fusion segments for use
with the present invention include, but are not limited to,
segments that can: enhance a protein's stability; provide other
desirable biological activity; and/or assist with the purification
of the protein (e.g.; by affinity chromatography). A suitable
fusion segment can be a domain of any size that has the desired
function (e.g., imparts increased stability, solubility, action or
biological activity; and/or simplifies purification of a protein).
Fusion segments can be joined to amino and/or carboxyl termini of
the domain of a protein of the present invention and can be
susceptible to cleavage in order to enable straight-forward
recovery of the protein. Fusion proteins are preferably produced by
culturing a recombinant cell transfected with a fusion nucleic acid
molecule that encodes a protein including the fusion segment
attached to either the carboxyl and/or amino terminal end of a
domain of a protein of the present invention. Accordingly, proteins
of the present invention also include expression products of gene
fusions (for example, used to overexpress soluble, active forms of
the recombinant protein), of mutagenized genes (such as genes
having codon modifications to enhance gene transcription and
translation), and of truncated genes (such as genes having membrane
binding domains removed to generate soluble forms of a membrane
protein, or genes having signal sequences removed which are poorly
tolerated in a particular recombinant host).
[0269] In one embodiment of the present invention, any of the amino
acid sequences described herein can be produced with from at least
one, and up to about 20, additional heterologous amino acids
flanking each of the C- and/or N-terminal ends of the specified
amino acid sequence. The resulting protein or polypeptide can be
referred to as "consisting essentially of" the specified amino acid
sequence. According to the present invention, the heterologous
amino acids are a sequence of amino acids that are not naturally
found (i.e., not found in nature, in vivo) flanking the specified
amino acid sequence, or that are not related to the function of the
specified amino acid sequence, or that would not be encoded by the
nucleotides that flank the naturally occurring nucleic acid
sequence encoding the specified amino acid sequence as it occurs in
the gene, if such nucleotides in the naturally occurring sequence
were translated using standard codon usage for the organism from
which the given amino acid sequence is derived.
[0270] The present invention also provides enzyme combinations that
break down lignocellulose material. Such enzyme combinations or
mixtures can include a multi-enzyme composition that contains at
least one protein of the present invention in combination with one
or more additional proteins of the present invention or one or more
enzymes or other proteins from other microorganisms, plants, or
similar organisms. Synergistic enzyme combinations and related
methods are contemplated. The invention includes methods to
identify the optimum ratios and compositions of enzymes with which
to degrade each lignocellulosic material. These methods entail
tests to identify the optimum enzyme composition and ratios for
efficient conversion of any lignocellulosic substrate to its
constituent sugars. The Examples below include assays that may be
used to identify optimum ratios and compositions of enzymes with
which to degrade lignocellulosic materials.
[0271] Any combination of the proteins disclosed herein is suitable
for use in the multi-enzyme compositions of the present invention.
Due to the complex nature of most biomass sources, which can
contain xylan, lignin, protein, and carbohydrates, among other
components, preferred enzyme combinations may contain enzymes with
a range of substrate specificities that work together to degrade
biomass into fermentable sugars in the most efficient manner. One
example of a multi-enzyme complex for lignocellulose
saccharification is a mixture of cellobiohydrolase(s), xylanase(s),
endoglucanase(s), .beta.-glucosidase(s), .beta.-xylosidase(s), and
accessory enzymes. However, it is to be understood that any of the
enzymes described specifically herein can be combined with any one
or more of the enzymes described herein or with any other available
and suitable enzymes, to produce a multi-enzyme composition. The
invention is not restricted or limited to the specific exemplary
combinations listed below.
[0272] In one embodiment, the cellobiohydrolase(s) comprise between
about 30% and about 90% or between about 50% and about 70% of the
enzymes in the composition, and more preferably, between about 55%
and 65%, and more preferably, about 60% of the enzymes in the
composition (including any percentage between 50% and 70% in 0.5%
increments (e.g., 50%, 50.5%, 51%, etc.).
[0273] In one embodiment, the xylanase(s) comprise between about
10% and about 30% of the enzymes in the composition, and more
preferably, between about 15% and about 25%, and more preferably,
about 20% of the enzymes in the composition (including any
percentage between 10% and 30% in 0.5% increments).
[0274] In one embodiment, the endoglucanase(s) comprise between
about 5% and about 15% of the enzymes in the composition, and more
preferably, between about 7% and about 13%, and more preferably,
about 10% of the enzymes in the composition (including any
percentage between 5% and 15% in 0.5% increments).
[0275] In one embodiment, the .beta.-glucosidase(s) comprise
between about 1% and about 5% of the enzymes in the composition,
and preferably between about 2% and 4%, and more preferably, about
3% of the enzymes in the composition (including any percentage
between 1% and 5% in 0.5% increments).
[0276] In one embodiment, the .beta.-xylosidase(s) comprise between
about 1% and about 3% of the enzymes in the composition, and
preferably, between about 1.5% and about 2.5%, and more preferably,
about 2% of the enzymes, in the composition (including any
percentage between 1% and 3% in 0.5% increments.
[0277] In one embodiment, the accessory enzymes comprise between
about 2% and about 8% of the enzymes in the composition, and
preferably, between about 3% and about 7%, and more preferably,
about 5% of the enzymes in the composition (including any
percentage between 2% and 8% in 0.5% increments.
[0278] One particularly preferred example of a multi-enzyme complex
for lignocellulose saccharification is a mixture of about 60%
cellobiohydrolase(s), about 20% xylanase(s), about 10%
endoglucanase(s), about 3% .beta.-glucosidase(s), about 2%
.beta.-xylosidase(s) and about 5% accessory enzyme(s).
[0279] The multi-enzyme composition may comprise at least one
cellobiohydrolase. In some embodiments, the cellobiohydrolase may
have an amino acid sequence selected from SEQ ID NO:2, SEQ ID NO:5,
SEQ ID NO:8 SEQ ID NO:11 or a homologue, fusion protein, or
fragment thereof that has cellobiohydrolase activity. Compositions
comprising at least 2, 3, 4, 5, 6, 7, 8, 9 or 10 or more
cellobiohydrolases are contemplated by the invention.
[0280] The multi-enzyme composition may comprise at least one
endoglucanase. In certain embodiments, the endoglucanase may have
an amino acid sequence selected from SEQ ID NO:14, SEQ ID NO:94,
SEQ ID NO:17, SEQ ID NO:20 or SEQ ID NO:23 or a homologue, fusion
protein, or fragment thereof that has endoglucanase activity.
Compositions comprising at least 2, 3, 4, 5, 6, 7, 8, 9 or 10 or
more endoglucanases are contemplated by the invention.
[0281] The multi-enzyme composition may comprise at least one
xylanase. In some embodiments, the xylanase may have an amino acid
sequence selected from SEQ ID NO:26, SEQ ID NO:29. SEQ ID NO:32,
SEQ ID NO:35, SEQ ID NO:38 or SEQ ID NO:41 or a homologue, fusion
protein, or fragment thereof that has xylanase activity.
Compositions comprising at least 2, 3, 4, 5, 6, 7, 8, 9 or 10 or
more xylanases are contemplated by the invention.
[0282] The multi-enzyme composition may further comprise a
.beta.-glucosidase. In one embodiment, the .beta.-glucosidase may
have an amino acid sequence of SEQ ID NO:44 or a homologue, fusion
protein, or fragment thereof that has .beta.-glucosidase activity.
Compositions comprising at least 2, 3, 4, 5, 6, 7, 8, 9 or 10 or
more .beta.-glucosidases are contemplated by the invention.
[0283] The multi-enzyme composition of may further comprise a
hemicellulase. In certain embodiments, the hemicellulase may have
an amino acid sequence selected from SEQ ID NO:47, SEQ ID NO:50,
SEQ ID NO:53, SEQ ID NO:56, SEQ ID NO:59, SEQ ID NO:62, SEQ ID
NO:65, SEQ ID NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ ID NO:77, SEQ
ID NO:80, SEQ ID NO:83, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:92,
or a homologue, fusion protein, or fragment thereof that has a
hemicellulase activity. Compositions comprising at least 2, 3, 4,
5, 6, 7, 8, 9 or 10 or more hemicellulases are contemplated by the
invention.
[0284] One or more components of a multi-enzyme composition (other
than proteins of the present invention) can be obtained from or
derived from a microbial, plant, or other source or combination
thereof, and will contain enzymes capable of degrading
lignocellulosic material. Examples of enzymes included in the
multi-enzyme compositions of the invention include cellulases,
hemicellulases (such as xylanases, including endoxylanases,
exoxylanase, and .beta.-xylosidase), ligninases, amylases,
glucuronidases, proteases, esterases (including ferulic acid
esterase), lipases, glucosidases (such as .beta.-glucosidase),
glucomannanases, and xylogluconases.
[0285] While the multi-enzyme composition may contain many types of
enzymes, mixtures comprising enzymes that increase or enhance sugar
release from biomass are preferred, including hemicellulases. In
one embodiment, the hemicellulase is selected from a xylanase, an
arabinofuranosidase, an acetyl xylan esterase, a glucuronidase, an
endo-galactanase, a mannanase, an endo-arabinase, an exo-arabinase,
an exo-galactanase, a ferulic acid esterase, a galactomannanase, a
xylogluconase, or mixtures of any of these. In particular, the
enzymes can include glucoamylase, .beta.-xylosidase and/or
.beta.-glucosidase. The enzymes of the multi-enzyme composition can
be provided by a variety of sources. In one embodiment, the enzymes
can be produced by a growing microorganisms or plants which produce
the enzymes naturally or by virtue of being genetically modified to
express the enzyme or enzymes. In another embodiment, at least one
enzyme of the multi-enzyme composition is a commercially available
enzyme.
[0286] In some embodiments, the multi-enzyme compositions comprise
an accessory enzyme. An accessory enzyme is any additional enzyme
capable of hydrolyzing lignocellulose or enhancing or promoting the
hydrolysis of lignocellulose, wherein the accessory enzyme is
typically provided in addition to a core enzyme or core set of
enzymes. An accessory enzyme can have the same or similar function
or a different function as an enzyme or enzymes in the core set of
enzymes. These enzymes have been described elsewhere herein, and
can generally include cellulose's, xylanases, ligninases, amylases,
lipidases, or glucuronidases, for example. Accessory enzymes can
include enzymes that when contacted with biomass in a reaction,
allow for an increase in the activity of enzymes (e.g.,
hemicellulases) in the multi-enzyme composition. An accessory
enzyme or enzyme mix may be composed of enzymes from (1) commercial
suppliers; (2) cloned genes expressing enzymes; (3) complex broth
(such as that resulting from growth of a microbial strain in media,
wherein the strains secrete proteins and enzymes into the media);
(4) cell lysates of strains grown as in (3); and, (5) plant
material expressing enzymes capable of degrading lignocellulose. In
some embodiments, the accessory enzyme is a glucoamylase, a
pectinase, or a ligninase.
[0287] As used herein, a ligninase is an enzyme that can hydrolyze
or break down the structure of lignin polymers, including lignin
peroxidases, manganese peroxidases, laccases, and other enzymes
described in the art known to depolymerize or otherwise break
lignin polymers. Also included are enzymes capable of hydrolyzing
bonds formed between hemicellulosic sugars (notably arabinose) and
lignin.
[0288] In one embodiment, the multi-enzyme composition comprises
the enzyme CBH Ia (SEQ ID NO:2), a biologically active (e.g., has
enzyme activity) homologue thereof, or a fragment comprising,
consisting essentially of, or consisting of the CD thereof. In
another embodiment, the multi-enzyme composition comprises the
enzyme CBH lib (SEQ ID NO:11), a biologically active homologue
thereof, or a fragment comprising, consisting essentially of, or
consisting of the CD thereof. In another embodiment, the
multi-enzyme composition comprises the enzyme EG II (SEQ ID NO:94);
a biologically active homologue thereof, or a fragment comprising,
consisting essentially of, or consisting of the CD thereof. In a
further embodiment, the multi-enzyme composition comprises the
enzyme Xyl 2 (SEQ ID NO:29), a biologically active homologue
thereof, or a fragment comprising, consisting essentially of, or
consisting of the CD thereof. In yet another embodiment, the
multi-enzyme composition comprises the enzyme .beta.-glucosidase
(SEQ ID NO:44), a biologically active homologue thereof, or a
fragment comprising, consisting essentially of, or consisting of
the CD thereof. In certain embodiments, the multi-enzyme
composition comprises any combination of the enzymes listed above.
In other embodiments, the multi-enzyme composition comprises at
least one of the enzymes listed above in combination with a
.beta.-xylosidase enzyme.
[0289] In one embodiment, the multi-enzyme composition comprises
CBH Ia (SEQ ID NO:2), CBH IIb (SEQ ID NO: 11), EG II (SEQ ID
NO:94), the CD of Xyl 2 (SEQ ID NO:29) and the CD of
.beta.-glucosidase (SEQ ID NO:44). This preferred enzyme
combination may further comprise a .beta.-xylosidase enzyme. One of
skill in the art will appreciate, however, that any enzyme in the
combinations described above may be replaced with an enzyme
exhibiting a similar substrate specificity. Further, the optimal
enzyme combination for a particular use (e.g., for degradation of a
biomass derived from a specific source) can be determined by one of
skill in the art using routine experimentation and assays known in
the art, such as those disclosed herein.
[0290] In one embodiment, the multi-enzyme composition comprises
the enzyme CBH Ib (SEQ ID NO:5), a biologically active homologue
thereof, or a fragment comprising, consisting essentially of, or
consisting of the CD thereof. In an additional embodiment, the
multi-enzyme composition comprises the enzyme EG V (SEQ ID NO:20),
a biologically active homologue thereof, or a fragment comprising,
consisting essentially of, or consisting of the CD thereof. In
still another embodiment, the multi-enzyme composition comprises
the enzyme Xyl 1 (SEQ ID NO:26), a biologically active homologue
thereof, or a fragment comprising, consisting essentially of, or
consisting of the CD thereof. In further embodiments, the
multi-enzyme composition comprises the enzyme arabinogalactanase
(SEQ ID NO:47), a biologically active homologue thereof, or a
fragment comprising, consisting essentially of, or consisting of
the CD thereof. In certain embodiments, the multi-enzyme
composition comprises any combination of the enzymes listed above.
In other embodiments, the multi-enzyme composition comprises at
least one of the enzymes listed above in combination with at least
one accessory enzyme such as, for example,
.alpha.-L-arabinofuranosidase.
[0291] In a preferred embodiment, the multi-enzyme composition
comprises the CD of CBH Ib (SEQ ID NO:5), the CD of EG V (SEQ ID
NO:20), the CD of Xyl 1 (SEQ ID NO:26) and the CD of
arabinogalactanase (SEQ ID NO:47). This preferred enzyme
combination may further comprise at least one accessory enzyme. One
of skill in the art will appreciate, however, that any enzyme in
the combinations described above may be replaced with an enzyme
exhibiting a similar substrate specificity. Further, the optimal
enzyme combination for a particular use (e.g., for degradation of a
biomass derived from a specific source) can be determined by one of
skill in the art using routine experimentation and assays known in
the art, such as those disclosed herein.
[0292] The multi-enzyme compositions, in some embodiments, comprise
a biomass comprising microorganisms or a crude fermentation product
of microorganisms. A crude fermentation product refers to the
fermentation broth which has been separated from the microorganism
biomass (by filtration, for example). In general, the
microorganisms are grown in fermentors, optionally centrifuged or
filtered to remove biomass, and optionally concentrated,
formulated, and dried to produce an enzyme(s) or a multi-enzyme
composition that is a crude fermentation product. In other
embodiments, enzyme(s) or multi-enzyme compositions produced by the
microorganism (including a genetically modified microorganism as
described below) are subjected to one or more purification steps,
such as ammonium sulfate precipitation, chromatography and/or
ultrafiltration, which result in a partially purified or purified
enzyme(s). If the microorganism has been genetically modified to
express the enzyme(s), the enzyme(s) will include recombinant
enzymes. If the genetically modified microorganism also naturally
expresses the enzyme(s) or other enzymes useful for lignocellulosic
saccharification, the enzyme(s) may include both naturally
occurring and recombinant enzymes.
[0293] Another embodiment of the present invention relates to a
composition comprising at least about 500 ng, and preferably at
least about 1 .mu.g, and more preferably at least about 5 .mu.g,
and more preferably at least about 10 .mu.g, and more preferably at
least about 25 .mu.g, and more preferably at least about 50 .mu.g,
and more preferably at least about 75 and more preferably at least
about 100 .mu.g, and more preferably at least about 250 .mu.g, and
more preferably at least about 500 .mu.g, and more preferably at
least about 750 .mu.g, and more preferably at least about 1 mg, and
more preferably at least about 5 mg, of an isolated protein
comprising any of the proteins or homologues or fragments thereof
discussed herein. Such a composition of the present invention may
include any carrier with which the protein is associated by virtue
of the protein preparation method, a protein purification method,
or a preparation of the protein for use in any method according to
the present invention. For example, such a carrier can include any
suitable buffer, extract, or medium that is suitable for combining
with the protein of the present invention so that the protein can
be used in any method described herein according to the present
invention.
[0294] In one embodiment of the invention, one or more enzymes of
the invention is bound to a solid support, i.e., an immobilized
enzyme. As used herein, an immobilized enzyme includes immobilized
isolated enzymes, immobilized microbial cells which contain one or
more enzymes of the invention, other stabilized intact cells that
produce one or more enzymes of the invention, and stabilized
cell/membrane homogenates. Stabilized intact cells and stabilized
cell/membrane homogenates include cells and homogenates from
naturally occurring microorganisms expressing the enzymes of the
invention and preferably, from genetically modified microorganisms
as disclosed elsewhere herein. Thus, although methods for
immobilizing enzymes are discussed below, it will be appreciated
that such methods are equally applicable to immobilizing microbial
cells and in such an embodiment, the cells can be lysed, if
desired.
[0295] A variety of methods for immobilizing an enzyme are
disclosed in Industrial Enzymology 2nd Ed., Godfrey, T. and West,
S. Eds., Stockton Press, New York, N.Y., 1996, pp. 267-272;
Immobilized Enzymes, Chibata, I. Ed., Halsted Press, New York,
N.Y., 1978; Enzymes and Immobilized Cells in Biotechnology, Laskin,
A. Ed., Benjamin/Cummings Publishing Co., Inc., Menlo Park, Calif.,
1985; and Applied Biochemistry and Bioengineering, Vol. 4, Chibata,
I. and Winward, Jr., L. Eds, Academic Press, New York, N.Y., 1983,
which are incorporated herein in their entirety.
[0296] Briefly, a solid support refers to any solid organic,
biopolymer or inorganic supports that can form a bond with an
enzyme without significantly effecting the activity of the enzyme.
Exemplary organic solid supports include polymers such as
polystyrene, nylon, phenol-formaldehyde resins, acrylic copolymers
(e.g., polyacrylamide), stabilized intact whole cells, and
stabilized crude whole cell/membrane homogenates. Exemplary
biopolymer supports include cellulose, polydextrans (e.g.,
Sephadex.RTM.), agarose, collagen and chitin. Exemplary inorganic
supports include glass beads (porous and nonporous), stainless
steel, metal oxides (e.g., porous ceramics such as ZrO.sub.2,
TiO.sub.2, Al.sub.2O.sub.3, and NiO) and sand. In one embodiment,
the solid support is selected from the group consisting of
stabilized intact cells and/or crude cell homogenates (e.g.,
produced from the microbial host cells expressing recombinant
enzymes, alone or in combination with natural enzymes). Preparation
of such supports requires a minimum of handling and cost.
Additionally, such supports provide excellent stability of the
enzyme.
[0297] Stabilized intact cells and/or cell/membrane homogenates can
be produced, for example, by using bifunctional crosslinkers (e.g.,
glutaraldehyde) to stabilize cells and cell homogenates. In both
the intact cells and the cell membranes, the cell wall and
membranes act as immobilizing supports. In such a system, integral
membrane proteins are in the "best" lipid membrane environment.
Whether starting with intact cells or homogenates, in this system
the cells are either no longer "alive" or "metabolizing", or
alternatively, are "resting" (i.e., the cells maintain metabolic
potential and active enzyme, but under the culture conditions are
not growing); in either case, the immobilized cells or membranes
serve as biocatalysts.
[0298] An enzyme of the invention can be bound to a solid support
by a variety of methods including adsorption, cross-linking
(including covalent bonding), and entrapment. Adsorption can be
through van del Waal's forces, hydrogen bonding, ionic bonding, or
hydrophobic binding. Exemplary solid supports for adsorption
immobilization include polymeric adsorbents and ion-exchange
resins. Solid supports in a bead form are particularly well-suited.
The particle size of an adsorption solid support can be selected
such that the immobilized enzyme is retained in the reactor by a
mesh filter while the substrate is allowed to flow through the
reactor at a desired rate. With porous particulate supports it is
possible to control the adsorption process to allow enzymes or
cells to be embedded within the cavity of the particle, thus
providing protection without an unacceptable loss of activity.
[0299] Cross-linking of an enzyme to a solid support involves
forming a chemical bond between a solid support and the enzyme. It
will be appreciated that although cross-linking generally involves
linking the enzyme to a solid support using an intermediary
compound, it is also possible to achieve a covalent bonding between
the enzyme and the solid support directly without the use of an
intermediary compound. Cross-linking commonly uses a bifunctional
or multifunctional reagent to activate and attach a carboxyl group,
amino group, sulfur group, hydroxy group or other functional group
of the enzyme to the solid support. The term "activate." refers to
a chemical transformation of a functional group which allows a
formation of a bond at the functional group. Exemplary amino group
activating reagents include water-soluble carbodiimides,
glutaraldehyde, cyanogen bromide, N-hydroxysuccinimide esters,
triazines, cyanuric chloride, and carbonyl diimidazole. Exemplary
carboxyl group activating reagents include water-soluble
carbodiimides, and N-ethyl-5-phenylisoxazolium-3-sulfonate.
Exemplary tyrosyl group activating reagents include diazonium
compounds. And exemplary sulfhydryl group activating reagents
include dithiobis-5,5'-(2-nitrobenzoic acid), and
glutathione-2-pyridyl disulfide. Systems for covalently linking an
enzyme directly to a solid support include Eupergit.RTM., a
polymethacrylate bead support available from Rohm Pharma
(Darmstadt, Germany), kieselguhl (Macrosorbs), available from
Sterling Organics, kaolinite available from English China Clay as
"Biofix" supports, silica gels which can be activated by
silanization, available from W.R. Grace, and high-density alumina,
available from UOP (Des Plains, Ill.).
[0300] Entrapment can also be used to immobilize an enzyme.
Entrapment of an enzyme involves formation of, inter alia, gels
(using organic or biological polymers), vesicles (including
microencapsulation), semipermeable membranes or other matrices.
Exemplary materials used for entrapment of an enzyme include
collagen, gelatin, agar, cellulose triacetate, alginate,
polyacrylamide, polystyrene, polyurethane, epoxy resins,
carrageenan, and egg albumin. Some of the polymers, in particular
cellulose triacetate, can be used to entrap the enzyme as they are
spun into a fiber. Other materials such as polyacrylamide gels can
be polymerized in solution to entrap the enzyme. Still other
materials such as polyglycol oligomers that are functionalized with
polymerizable vinyl end groups can entrap enzymes by forming a
cross-linked polymer with UV light illumination in the presence of
a photosensitizer.
[0301] Further embodiments of the present invention include nucleic
acid molecules that encode a protein of the present invention, as
well as homologues or fragments of such nucleic acid molecules. A
nucleic acid molecule of the present invention includes a nucleic
acid molecule comprising, consisting essentially of, or consisting
of, a nucleic acid sequence encoding any of the isolated proteins
disclosed herein, including a fragment or a homologue of such
proteins, described above. Nucleic acid molecules can include a
nucleic acid sequence that encodes a fragment of a protein that
does not have biological activity, and can also include portions of
a gene or polynucleotide encoding the protein that are not part of
the coding region for the protein (e.g., introns or regulatory
regions of a gene encoding the protein). Nucleic acid molecules can
include a nucleic acid sequence that is useful as a probe or primer
(oligonucleotide sequences).
[0302] In one embodiment, a nucleic molecule of the present
invention includes a nucleic acid molecule comprising, consisting
essentially of, or consisting of, a nucleic acid sequence
represented by SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6,
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13,
SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID
NO:21, SEQ ID NO:22. SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, SEQ
ID NO:28, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34,
SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID
NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:46, SEQ
ID NO:48, SEQ ID NO:49, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:54,
SEQ ID NO:55, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:60, SEQ ID
NO:61, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:66, SEQ ID NO:67, SEQ
ID NO:69, SEQ ID NO:70, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:75,
SEQ ID NO:76, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:90, SEQ ID
NO:91, SEQ ID NO:93, or fragments or homologues thereof.
Preferably, the nucleic acid sequence encodes a protein (including
fragments and homologues thereof) useful in the invention, or
encompasses useful oligonucleotides or complementary nucleic acid
sequences.
[0303] In one embodiment, a nucleic molecule of the present
invention includes a nucleic acid molecule comprising, consisting
essentially of, or consisting of, a nucleic acid sequence encoding
an amino acid sequence represented by SEQ ID NO:2, SEQ ID NO:5, SEQ
ID NO:8, SEQ ID NO: 11, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20,
SEQ ID NO:23, SEQ ID NO:26, SEQ ID NO:29, SEQ ID NO:32, SEQ ID
NO:35, SEQ ID NO:38, SEQ ID NO:41, SEQ ID NO:44. SEQ ID NO:47, SEQ
ID NO:50, SEQ ID NO:53, SEQ ID NO:56, SEQ ID NO:59, SEQ ID NO:62,
SEQ ID NO:65, SEQ ID NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ ID
NO:77, SEQ ID NO:80, SEQ ID NO:83, SEQ ID NO:86, SEQ ID NO:89, SEQ
ID NO:92, SEQ ID NO:94 or fragments or homologues thereof.
Preferably, the nucleic acid sequence encodes a protein (including
fragments and homologues thereof) useful in the invention, or
encompasses useful oligonucleotides or complementary nucleic acid
sequences.
[0304] In one embodiment, such nucleic acid molecules include
isolated nucleic acid molecules that hybridize under moderate
stringency conditions, and more preferably under high stringency
conditions, and even more preferably under very high stringency
conditions, as described above, with the complement of a nucleic
acid sequence encoding a protein of the present invention (i.e.,
including naturally occurring allelic variants encoding a protein
of the present invention). Preferably, an isolated nucleic acid
molecule encoding a protein of the present invention comprises a
nucleic acid sequence that hybridizes under moderate, high, or very
high stringency conditions to the complement of a nucleic acid
sequence that encodes a protein comprising an amino acid sequence
represented by SEQ NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:11,
SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, SEQ ID
NO:26, SEQ ID NO:29, SEQ ID NO:32, SEQ ID NO:35. SEQ ID NO:38, SEQ
ID NO:41, SEQ ID NO:44, SEQ ID NO:47, SEQ ID NO:50, SEQ ID NO:53,
SEQ ID NO:56, SEQ ID NO:59, SEQ ID NO:62, SEQ ID NO:65, SEQ ID
NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ ID NO:77, SEQ ID NO:80, SEQ
ID NO:83, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:92, or SEQ ID
NO:94.
[0305] In accordance with the present invention, an isolated
nucleic acid molecule is a nucleic acid molecule (polynucleotide)
that has been removed from its natural milieu (i.e. that has been
subject to human manipulation) and can include DNA, RNA, or
derivatives of either DNA or RNA, including cDNA. As such,
"isolated" does not reflect the extent to which the nucleic acid
molecule has been purified. Although the phrase "nucleic acid
molecule" primarily refers to the physical nucleic acid molecule,
and the phrase "nucleic acid sequence" primarily refers to the
sequence of nucleotides on the nucleic acid molecule, the two
phrases can be used interchangeably, especially with respect to a
nucleic acid molecule, or a nucleic acid sequence, being capable of
encoding a protein. An isolated nucleic acid molecule of the
present invention can be isolated from its natural source or
produced using recombinant DNA technology (e.g., polymerase chain
reaction (PCR) amplification, cloning) or chemical synthesis.
Isolated nucleic acid molecules can include, for example, genes,
natural allelic variants of genes, coding regions or portions
thereof, and coding and/or regulatory regions modified by
nucleotide insertions, deletions, substitutions, and/or inversions
in a manner such that the modifications do not substantially
interfere with the nucleic acid molecule's ability to encode a
protein of the present invention or to form stable hybrids under
stringent conditions with natural gene isolates. An isolated
nucleic acid molecule can include degeneracies. As used herein,
nucleotide degeneracy refers to the phenomenon that one amino acid
can be encoded by different nucleotide codons. Thus, the nucleic
acid sequence of a nucleic acid molecule that encodes a protein of
the present invention can vary due to degeneracies. It is noted
that a nucleic acid molecule of the present invention is not
required to encode a protein having protein activity. A nucleic
acid molecule can encode a truncated, mutated or inactive protein,
for example. In addition, nucleic acid molecules of the invention
are useful as probes and primers for the identification, isolation
and/or purification of other nucleic acid molecules. If the nucleic
acid molecule is an oligonucleotide, such as a probe or primer, the
oligonucleotide preferably ranges from about 5 to about 50 or about
500 nucleotides, more preferably from about 10 to about 40
nucleotides, and most preferably from about 15 to about 40
nucleotides in length.
[0306] According to the present invention, reference to a gene
includes all nucleic acid sequences related to a natural (i.e.
wild-type) gene, such as regulatory regions that control production
of the protein encoded by that gene (such as, but not limited to,
transcription, translation or post-translation control regions) as
well as the coding region itself. In another embodiment, a gene can
be a naturally occurring allelic variant that includes a similar
but not identical sequence to the nucleic acid sequence encoding a
given protein. Allelic variants have been previously described
above. The phrases "nucleic acid molecule" and "gene" can be used
interchangeably when the nucleic acid molecule comprises a gene as
described above.
[0307] Preferably, an isolated nucleic acid molecule of the present
invention is produced using recombinant DNA technology (e.g.,
polymerase chain reaction (PCR) amplification, cloning, etc.) or
chemical synthesis. Isolated nucleic acid molecules include any
nucleic acid molecules and homologues thereof that are part of a
gene described herein and/or that encode a protein described
herein, including, but not limited to, natural allelic variants and
modified nucleic acid molecules (homologues) in which nucleotides
have been inserted, deleted, substituted, and/or inverted in such a
manner that such modifications provide the desired effect on
protein biological activity or on the activity of the nucleic acid
molecule. Allelic variants and protein homologues (e.g., proteins
encoded by nucleic acid homologues) have been discussed in detail
above.
[0308] A nucleic acid molecule homologue (i.e., encoding a
homologue of a protein of the present invention) can be produced
using a number of methods known to those skilled in the art (see,
for example, Sambrook et al.). For example, nucleic acid molecules
can be modified using a variety of techniques including, but not
limited to, by classic mutagenesis and recombinant DNA techniques
(e.g., site-directed mutagenesis, chemical treatment, restriction
enzyme cleavage, ligation of nucleic acid fragments and/or PCR
amplification), or synthesis of oligonucleotide mixtures and
ligation of mixture groups to "build" a mixture of nucleic acid
molecules and combinations thereof. Another method for modifying a
recombinant nucleic acid molecule encoding a protein is gene
shuffling (i.e., molecular breeding) (See, for example, U.S. Pat.
No. 5,605,793 to Stemmer; Minshull and Stemmer; 1999, Curr. Opin.
Chem. Biol. 3:284-290; Stemmer, 1994, P.N.A.S. USA 91:10747-10751,
all of which are incorporated herein by reference in their
entirety). This technique can be used to efficiently introduce
multiple simultaneous changes in the protein. Nucleic acid molecule
homologues can be selected by hybridization with a gene or
polynucleotide, or by screening for the function of a protein
encoded by a nucleic acid molecule (i.e., biological activity).
[0309] The minimum size of a nucleic acid molecule of the present
invention is a size sufficient to encode a protein (including a
fragment or homologue of a full-length protein) having biological
activity, sufficient to encode a protein comprising at least one
epitope which binds to an antibody, or sufficient to form a probe
or oligonucleotide primer that is capable of forming a stable
hybrid with the complementary sequence of a nucleic acid molecule
encoding a natural protein (e.g., under moderate, high, or high
stringency conditions). As such, the size of the nucleic acid
molecule encoding such a protein can be dependent on nucleic acid
composition and percent homology or identity between the nucleic
acid molecule and complementary sequence as well as upon
hybridization conditions per se (e.g., temperature, salt
concentration, and formamide concentration). The minimal, size of a
nucleic acid molecule that is used as an oligonucleotide primer or
as a probe is typically at least about 12 to about 15 nucleotides
in length if the nucleic acid molecules are GC-rich and at least
about IS to about 18 bases in length if they are AT-rich. There is
no limit, other than a practical limit, on the maximal size of a
nucleic acid molecule of the present invention, in that the nucleic
acid molecule can include a portion of a protein encoding sequence,
a nucleic acid sequence encoding a full-length protein (including a
gene), including any length fragment between about 20 nucleotides
and the number of nucleotides that make up the full length cDNA
encoding a protein, in whole integers (e.g., 20, 21, 22, 23, 24, 25
. . . nucleotides), or multiple genes, or portions thereof.
[0310] The phrase "consisting essentially of", when used with
reference to a nucleic acid sequence herein, refers to a nucleic
acid sequence encoding a specified amino acid sequence that can be
flanked by from at least one, and up to as many as about 60,
additional heterologous nucleotides at each of the 5' and/or the 3'
end of the nucleic acid sequence encoding the specified amino acid
sequence. The heterologous nucleotides are not naturally found
(i.e., not found in nature, in vivo) flanking the nucleic acid
sequence encoding the specified amino acid sequence as it occurs in
the natural gene or do not encode a protein that imparts any
additional function to the protein or changes the function of the
protein having the specified amino acid sequence.
[0311] In one embodiment, the polynucleotide probes or primers of
the invention are conjugated to detectable markers. Detectable
labels suitable for use in the present invention include any
composition detectable by spectroscopic, photochemical,
biochemical, immunochemical, electrical, optical or chemical means.
Useful labels in the present invention include biotin for staining
with labeled streptavidin conjugate, magnetic beads (e.g.,
Dynabeads.TM.), fluorescent dyes (e.g., fluorescein, texas red,
rhodamine, green fluorescent protein, and the like), radiolabels
(e.g., .sup.3H, .sup.125I, .sup.35S. .sup.14C, or .sup.32P),
enzymes (e.g., horse radish peroxidase, alkaline phosphatase and
others commonly used in an ELISA), and colorimetric labels such as
colloidal gold or colored glass or plastic (e.g., polystyrene,
polypropylene, latex, etc.) beads. Preferably, the polynucleotide
probes are immobilized on a substrate such as: artificial
membranes, organic supports, biopolymer supports and inorganic
supports.
[0312] One embodiment of the present invention relates to a
recombinant nucleic acid molecule which comprises the isolated
nucleic acid molecule described above which is operatively linked
to at least one expression control sequence. More particularly,
according to the present invention, a recombinant nucleic acid
molecule typically comprises a recombinant vector and any one or
more of the isolated nucleic acid molecules as described herein.
According to the present invention, a recombinant vector is an
engineered (i.e., artificially produced) nucleic acid molecule that
is used as a tool for manipulating a nucleic acid sequence of
choice and/or for introducing such a nucleic acid sequence into a
host cell. The recombinant vector is therefore suitable for use in
cloning, sequencing, and/or otherwise manipulating the nucleic acid
sequence of choice, such as by expressing and/or delivering the
nucleic acid sequence of choice into a host cell to form a
recombinant cell. Such a vector typically contains heterologous
nucleic acid sequences, that is, nucleic acid sequences that are
not naturally found adjacent to nucleic acid sequence to be cloned
or delivered, although the vector can also contain regulatory
nucleic acid sequences (e.g., promoters, untranslated regions)
which are naturally found adjacent to nucleic acid sequences of the
present invention or which are useful for expression of the nucleic
acid molecules of the present invention (discussed in detail
below). The vector can be either RNA or DNA, either prokaryotic or
eukaryotic, and typically is a plasmid. The vector can be
maintained as an extrachromosomal element (e.g., a plasmid) or it
can be integrated into the chromosome of a recombinant host cell,
although it is preferred if the vector remain separate from the
genome for most applications of the invention. The entire vector
can remain in place within a host cell, or under certain
conditions, the plasmid DNA can be deleted, leaving behind the
nucleic acid molecule of the present invention. An integrated
nucleic acid molecule can be under chromosomal promoter control,
under native or plasmid promoter control, or under a combination of
several promoter controls. Single or multiple copies of the nucleic
acid molecule can be integrated into the chromosome. A recombinant
vector of the present invention can contain at least one selectable
marker.
[0313] In one embodiment, a recombinant vector used in a
recombinant nucleic acid molecule of the present invention is an
expression vector. As used herein, the phrase "expression vector"
is used to refer to a vector that is suitable for production of an
encoded product (e.g., a protein of interest, such as an enzyme of
the present invention). In this embodiment, a nucleic acid sequence
encoding the product to be produced (e.g., the protein or homologue
thereof) is inserted into the recombinant vector to produce a
recombinant nucleic acid molecule. The nucleic acid sequence
encoding the protein to be produced is inserted into the vector in
a manner that operatively links the nucleic acid sequence to
regulatory sequences in the vector which enable the transcription
and translation of the nucleic acid sequence within the recombinant
host cell.
[0314] Typically, a recombinant nucleic acid molecule includes at
least one nucleic acid molecule of the present invention
operatively linked to one or more expression control sequences
(e.g., transcription control sequences or translation control
sequences). As used herein, the phrase "recombinant molecule" or
"recombinant nucleic acid molecule" primarily refers to a nucleic
acid molecule or nucleic acid sequence operatively linked to a
transcription control sequence, but can be used interchangeably
with the phrase "nucleic acid molecule", when such nucleic acid
molecule is a recombinant molecule as discussed herein. According
to the present invention, the phrase "operatively linked" refers to
linking a nucleic acid molecule to an expression control sequence
in a manner such that the molecule is able to be expressed when
transfected (i.e., transformed, transduced, transfected, conjugated
or conduced) into a host cell. Transcription control sequences are
sequences which control the initiation, elongation, or termination
of transcription. Particularly important transcription control
sequences are those which control transcription initiation, such as
promoter, enhancer, operator and repressor sequences. Suitable
transcription control sequences include any transcription control
sequence that can function in a host cell or organism into which
the recombinant nucleic acid molecule is to be introduced.
[0315] Recombinant nucleic acid molecules of the present invention
can also contain additional regulatory sequences, such as
translation regulatory sequences, origins of replication, and other
regulatory sequences that are compatible with the recombinant cell.
In one embodiment, a recombinant molecule of the present invention,
including those which are integrated into the host cell chromosome,
also contains secretory signals (i.e., signal segment nucleic acid
sequences) to enable an expressed protein to be secreted from the
cell that produces the protein. Suitable signal segments include a
signal segment that is naturally associated with the protein to be
expressed or any heterologous signal segment capable of directing
the secretion of the protein according to the present invention. In
another embodiment, a recombinant molecule of the present invention
comprises a leader sequence to enable an expressed protein to be
delivered to and inserted into the membrane of a host cell.
Suitable leader sequences include a leader sequence that is
naturally associated with the protein, or any heterologous leader
sequence capable of directing the delivery and insertion of the
protein to the membrane of a cell.
[0316] According to the present invention, the term "transfection"
is generally used to refer to any method by which an exogenous
nucleic acid molecule (i.e., a recombinant nucleic acid molecule)
can be inserted into a cell. The term "transformation" can be used
interchangeably with the term "transfection" when such term is used
to refer to the introduction of nucleic acid molecules into
microbial cells or plants and describes an inherited change due to
the acquisition of exogenous nucleic acids by the microorganism
that is essentially synonymous with the term "transfection."
Transfection techniques include, but are not limited to,
transformation, particle bombardment, electroporation,
microinjection, lipofection, adsorption, infection and protoplast
fusion.
[0317] One or more recombinant molecules of the present invention
can be used to produce an encoded product (e.g., a protein) of the
present invention. In one embodiment, an encoded product is
produced by expressing a nucleic acid molecule as described herein
under conditions effective to produce the protein. A preferred
method to produce an encoded protein is by transfecting a host cell
with one or more recombinant molecules to form a recombinant cell.
Suitable host cells to transfect include, but are not limited to,
any bacterial, fungal (e.g., filamentous fungi or yeast), plant,
insect, or animal cell that can be transfected. Host cells can be
either untransfected cells or cells that are already transfected
with at least one other recombinant nucleic acid molecule.
[0318] Suitable cells (e.g., a host cell or production organism)
include any microorganism (e.g., a bacterium, a protist, an alga, a
fungus, or other microbe), and is preferably a bacterium, a yeast
or a filamentous fungus. Suitable bacterial genera include, but are
not limited to, Escherichia, Bacillus, Lactobacillus, Pseudomonas
and Streptomyces. Suitable bacterial species include, but are not
limited to, Escherichia coli, Bacillus subtilis, Bacillus
licheniformis, Lactobacillus brevis, Pseudomonas aeruginosa and
Streptomyces lividans. Suitable genera of yeast include, but are
not limited to. Saccharomyces, Schizosacclzaromyces, Candida,
Hansenula, Pichia, Kluyveromyces, and Phaffia. Suitable yeast
species include, but are not limited to, Saccharomyces cerevisiae,
Schizosaccharomyces pombe, Candida albicans, Hansenula polymorpha,
Pichia pastoris, P. canadensis, Kluyveromyces marxianus and Phaffia
rhodozyma.
[0319] Suitable fungal genera include, but are not limited to,
Chrysosporium, Thielavia, Neurospora, Aureobasidium, Filibasidium,
Piromyces, Corynascus, Cryplococcus, Acremonium, Tolypocladium,
Scytalidium, Schizophyllum, Sporotrichum, Penicillium, Gibberella,
Myceliophthora, Mucor, Aspergillus, Fusarium, Humicola, and
Trichoderma, and anamorphs and teleomorphs thereof. Suitable fungal
species include, but are not limited to, Aspergillus niger,
Aspergillus nidulans, Aspergillus japonicus, Absidia coerulea,
Rhizopus oryzae, Chrysosporium lucknowense, Neurospora crassa,
Neurospora intermedia, Trichoderma reesei, Penicillium canescens,
Penicillium solitum, Penicillium funiculosum, and Talaromyces
flavus. In one embodiment, the host cell is a fungal cell of the
species Chrysosporium lucknowense. In one embodiment, the host cell
is a fungal cell of Strain C1 (VKM F-3500-D) or a mutant strain
derived therefrom (e.g., UV13-6 (Accession No. VKM F-3632 D);
NG7C-19 (Accession No. VKM F-3633 D); or UV18-25 (VKM F-3631D)).
Host cells can be either untransfected cells or cells that are
already transfected with at least one other recombinant nucleic
acid molecule. Additional embodiments of the present invention
include any of the genetically modified cells described herein.
[0320] In one embodiment, one or more protein(s) expressed by an
isolated nucleic acid molecule of the present invention are
produced by culturing a cell that expresses the protein (i.e., a
recombinant cell or recombinant host cell) under conditions
effective to produce the protein. In some instances, the protein
may be recovered, and in others, the cell may be harvested in
whole, either of which can be used in a composition.
[0321] Microorganisms used in the present invention (including
recombinant host cells or genetically modified microorganisms) are
cultured in an appropriate fermentation medium. An appropriate, or
effective, fermentation medium refers to any medium in which a cell
of the present invention, including a genetically modified
microorganism (described below), when cultured, is capable of
expressing enzymes useful in the present invention and/or of
catalyzing the production of sugars from lignocellulosic biomass.
Such a medium is typically an aqueous medium comprising assimilable
carbon, nitrogen and phosphate sources. Such a medium can also
include appropriate salts, minerals, metals and other nutrients.
Microorganisms and other cells of the present invention can, be
cultured in conventional fermentation bioreactors. The
microorganisms can be cultured by any fermentation process which
includes, but is not limited to, batch, fed-batch, cell recycle,
and continuous fermentation. The fermentation of microorganisms
such as fungi may be carried out in any appropriate reactor, using
methods known to those skilled in the art. For example, the
fermentation may be carried out for a period of 1 to 14 days, or
more preferably between about 3 and 10 days. The temperature of the
medium is typically maintained between about 25 and 50.degree. C.,
and more preferably between 28 and 40.degree. C. The pH of the
fermentation medium is regulated to a pH suitable for growth and
protein production of the particular organism. The fermentor can be
aerated in order to supply the oxygen necessary for fermentation
and to avoid the excessive accumulation of carbon dioxide produced
by fermentation. In addition, the aeration helps to control the
temperature and the moisture of the culture medium. In general the
fungal strains are grown in fermentors, optionally centrifuged or
filtered to remove biomass, and optionally concentrated,
formulated, and dried to produce an enzyme(s) or a multi-enzyme
composition that is a crude fermentation product. Particularly
suitable conditions for culturing filamentous fungi are described,
for example, in U.S. Pat. No. 6,015,707 and U.S. Pat. No.
6,573,086, supra.
[0322] Depending on the vector and host system used for production,
resultant proteins of the present invention may either remain
within the recombinant cell; be secreted into the culture medium;
be secreted into a space between two cellular membranes; or be
retained on the outer surface of a cell membrane. The phrase
"recovering the protein" refers to collecting the whole culture
medium containing the protein and need not imply additional steps
of separation or purification. Proteins produced according to the
present invention can be purified using a variety of standard
protein purification techniques, such as, but not limited to,
affinity chromatography, ion exchange chromatography, filtration,
electrophoresis, hydrophobic interaction chromatography, gel
filtration chromatography, reverse phase chromatography,
concanavalin A chromatography, chromatofocusing and differential
solubilization.
[0323] Proteins of the present invention are preferably retrieved,
obtained, and/or used in "substantially pure" form. As used herein,
"substantially pure" refers to a purity that allows for the
effective use of the protein in any method according to the present
invention. For a protein to be useful in any of the methods
described herein or in any method utilizing enzymes of the types
described herein according to the present invention, it is
substantially free of contaminants, other proteins and/or chemicals
that might interfere or that would interfere with its use in a
method disclosed by the present invention (e.g., that might
interfere with enzyme activity), or that at least would be
undesirable for inclusion with a protein of the present invention
(including homologues) when it is used in a method disclosed by the
present invention (described in detail below). Preferably, a
"substantially pure" protein, as referenced herein, is a protein
that can be produced by any method (i.e., by direct purification
from a natural source, recombinantly, or synthetically), and that
has been purified from other protein components such that the
protein comprises at least about 80% weight/weight of the total
protein in a given composition (e.g., the protein of interest is
about 80% of the protein in a solution/composition/buffer), and
more preferably, at least about 85%, and more preferably at least
about 90%, and more preferably at least about 91%, and more
preferably at least about 92%, and more preferably at least about
93%, and more preferably at least about 94%, and more preferably at
least about 95%, and more preferably at least about 96%, and more
preferably at least about 97%, and more preferably at least about
98%, and more preferably at least about 99%, weight/weight of the
total protein in a given composition.
[0324] It will be appreciated by one skilled in the art that use of
recombinant DNA technologies can improve control of expression of
transfected nucleic acid molecules by manipulating, for example,
the number of copies of the nucleic acid molecules within the host
cell, the efficiency with which those nucleic acid molecules are
transcribed, the efficiency with which the resultant transcripts
are translated, and the efficiency of post-translational
modifications. Additionally, the promoter sequence might be
genetically engineered to improve the level of expression as
compared to the native promoter. Recombinant techniques useful for
controlling the expression of nucleic acid molecules include, but
are not limited to, integration of the nucleic acid molecules into
one or more host cell chromosomes, addition of vector stability
sequences to plasmids, substitutions or modifications of
transcription control signals promoters, operators, enhancers),
substitutions or modifications of translational control signals
(e.g., ribosome binding sites), modification of nucleic acid
molecules to correspond to the codon usage of the host cell, and
deletion of sequences that destabilize transcripts.
[0325] Another aspect of the present invention relates to a
genetically modified microorganism that has been transfected with
one or more nucleic acid molecules of the present invention. As
used herein, a genetically modified microorganism can include a
genetically modified bacterium, yeast, filamentous fungus, or other
microbe. Such a genetically modified microorganism has a genome
which is modified (i.e., mutated or changed) from its normal (i.e.,
wild-type or naturally occurring) form such that the desired result
is achieved (i.e., increased or modified activity and/or production
of at least one an enzyme or a multi-enzyme composition for the
conversion of lignocellulosic material to fermentable sugars).
Genetic modification of a microorganism can be accomplished using
classical strain development and/or molecular genetic techniques.
Such techniques known in the art and are generally disclosed for
microorganisms, for example, in Sambrook et al., 1989, Molecular
Cloning: A Laboratory Manual, Cold Spring Harbor Labs Press or
Molecular Cloning: A Laboratory Manual, third edition (Sambrook and
Russel, 2001), (jointly referred to herein as "Sambrook"). The
references of Sambrook, ibid., are incorporated by reference herein
in its entirety. A genetically modified microorganism can include a
microorganism in which nucleic acid molecules have been inserted,
deleted or modified (i.e., mutated; e.g., by insertion, deletion,
substitution, and/or inversion of nucleotides), in such a manner
that such modifications provide the desired effect within the
microorganism.
[0326] In one embodiment, a genetically modified microorganism can
endogenously contain and express an enzyme or a multi-enzyme
composition for the conversion of lignocellulosic material to
fermentable sugars, and the genetic modification can be a genetic
modification of one or more of such endogenous enzymes, whereby the
modification has some effect on the ability of the microorganism to
convert lignocellulosic material to fermentable sugars (e.g.,
increased expression of the protein by introduction of promoters or
other expression control sequences, or modification of the coding
region by homologous recombination to increase the activity of the
encoded protein).
[0327] In another embodiment, a genetically modified microorganism
can endogenously contain and express an enzyme for the conversion
of lignocellulosic material to fermentable sugars, and the genetic
modification can be an introduction of at least one exogenous
nucleic acid sequence (e.g., a recombinant nucleic acid molecule),
wherein the exogenous nucleic acid sequence encodes at least one
additional enzyme useful for the conversion of lignocellulosic
material to fermentable sugars and/or a protein that improves the
efficiency of the enzyme for the conversion of lignocellulosic
material to fermentable sugars. In this aspect of the invention,
the microorganism can also have at least one modification to a gene
or genes comprising its endogenous enzyme(s) for the conversion of
lignocellulosic material to fermentable sugars.
[0328] In yet another embodiment, the genetically modified
microorganism does not necessarily endogenously (naturally) contain
an enzyme for the conversion of lignocellulosic material to
fermentable sugars, but is genetically modified to introduce at
least one recombinant nucleic acid molecule encoding at least one
enzyme or a multiplicity of enzymes for the conversion of
lignocellulosic material to fermentable sugars. Such a
microorganism can be used in a method of the invention, or as a
production microorganism for crude fermentation products, partially
purified recombinant enzymes, and/or purified recombinant enzymes,
any of which can then be used in a method of the present
invention.
[0329] Once the proteins (enzymes) are expressed in a host cell, a
cell extract that contains the activity to test can be generated.
For example, a lysate from the host cell is produced, and the
supernatant containing the activity is harvested and/or the
activity can be isolated from the lysate. In the case of cells that
secrete enzymes into the culture medium, the culture medium
containing them can be harvested, and/or the activity can be
purified from the culture medium. The extracts/activities prepared
in this way can be tested using assays known in the art.
Accordingly, methods to identify multi-enzyme compositions capable
of degrading lignocellulosic biomass are provided.
[0330] Artificial substrates, or complex mixtures of polymeric
carbohydrates and lignin, or actual lignocellulose can be used in
such tests. One assay that may be used to measure the release of
sugars and oligosaccharides from these complex substrates is the
dinitrosalicylic acid assay (DNS). In this assay, the
lignocellulosic material such as DDG is incubated with enzymes(s)
for various times and reducing sugars are measured.
[0331] The invention also contemplates genetically modified plants
transformed with one or more nucleic acid molecules of the
invention. The plants may be used for production of the enzymes,
and/or as the lignocellulosic material used as a substrate in the
methods of the invention. Methods to generate recombinant plants
are known in the art. For instance, numerous methods for plant
transformation have been developed, including biological and
physical transformation protocols. See, for example, Miki et al.,
"Procedures for Introducing Foreign DNA into Plants" in Methods in
Plant Molecular Biology and Biotechnology, Glick, B. R. and
Thompson, J. E. Eds. (CRC Press, Inc., Boca Raton, 1993) pp. 67-88.
In addition, vectors and in vitro culture methods for plant cell or
tissue transformation and regeneration of plants are available.
See, for example, Gruber et al., "Vectors for Plant Transformation"
in Methods in Plant Molecular Biology and Biotechnology, Glick, B.
R. and Thompson, J. E. Eds. (CRC Press, Inc., Boca Raton, 1993) pp.
89-119.
[0332] The most widely utilized method for introducing an
expression vector into plants is based on the natural
transformation system of Agrobacterium. See, for example, Horsch et
al., Science 227:1229 (1985). A. tumefaciens and A. rhizogenes are
plant pathogenic soil bacteria which genetically transform plant
cells. The Ti and Ri plasmids of A. tumefaciens and A. rhizogenes,
respectively, early genes responsible for genetic transformation of
the plant. See, for example, Kado, C. I., Crit. Rev. Plant. Sci.
10:1 (1991). Descriptions of Agrobacterium vector systems and
methods for Agrobacterium-mediated gene transfer are provided by
numerous references, including Gruber et al., supra, Miki et al.,
supra, Moloney et al., Plant Cell Reports 8:238 (1989), and U.S.
Pat. Nos. 4,940,838 and 5,464.763.
[0333] Another generally applicable method of plant transformation
is microprojectile-mediated transformation wherein DNA is carried
on the surface of microprojectiles. The expression vector is
introduced into plant tissues with a biolistic device that
accelerates the microprojectiles to speeds sufficient to penetrate
plant cell walls and membranes. Sanford et al., Part. Sci. Technol.
5:27 (1987), Sanford, J. C., Trends Biotech. 6:299 (1988), Sanford,
J. C., Physiol. Plant 79:206 (1990), Klein et al., Biotechnology
10:268 (1992).
[0334] Another method for physical delivery of DNA to plants is
sonication of target cells. Zhang et al., Bio/Technology 9:996
(1991). Alternatively, liposome or spheroplast fusion have been
used to introduce expression vectors into plants. Deshayes et al.
EMBO J., 4:2731 (1985). Christou et al., Proc Natl. Acad. Sci. USA
84:3962 (1987). Direct uptake of DNA into protoplasts using
CaCl.sub.2 precipitation, polyvinyl alcohol or poly-L-ornithine
have also been reported. Hain et al., Mol. Gen. Genet. 199:161
(1985) and Draper et al., Plant Cell Physiol. 23:451 (1.982).
Electroporation of protoplasts and whole cells and tissues have
also been described. Donn et al., In Abstracts of VIIth
International Congress on Plant Cell and Tissue Culture IAPTC,
A2-38, p. 53 (1990); D'Halluin et al., Plant Cell 4:1495-1505
(1992) and Spencer et al. Plant Mol. Biol. 24:51-61 (1994).
[0335] Another embodiment of the present invention relates to an
isolated binding agent capable of selectively binding to a protein
of the present invention. Suitable binding agents may be selected
from an antibody, an antigen binding fragment, or a binding
partner. The binding agent selectively binds to an amino acid
sequence selected from SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ
ID NO:11, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23,
SEQ ID NO:26, SEQ ID NO:29. SEQ ID NO:32, SEQ ID NO:35, SEQ ID
NO:38, SEQ ID NO:41, SEQ ID NO:44, SEQ ID NO:47, SEQ ID NO:50, SEQ
ID NO:53, SEQ ID NO:56, SEQ ID NO:59, SEQ ID NO:62, SEQ ID NO:65,
SEQ ID NO:68, SEQ ID NO:71, SEQ ID NO:74, SEQ ID NO:77, SEQ ID
NO:80, SEQ ID NO:83. SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:92 or
SEQ ID NO:94, including to any fragment of any of the above
sequences comprising at least one antibody binding epitope.
[0336] According to the present invention, the phrase "selectively
binds to" refers to the ability of an antibody, antigen binding
fragment or binding partner of the present invention to
preferentially bind to specified proteins. More specifically, the
phrase "selectively binds" refers to the specific binding of one
protein to another (e.g., an antibody, fragment thereof, or binding
partner to an antigen), wherein the level of binding, as measured
by any standard assay (e.g., an immunoassay), is statistically
significantly higher than the background control for the assay. For
example, when performing an immunoassay, controls typically include
a reaction well/tube that contain antibody or antigen binding
fragment alone (i.e., in the absence of antigen), wherein an amount
of reactivity (e.g., non-specific binding to the well) by the
antibody or antigen binding fragment thereof in the absence of the
antigen is considered to be background. Binding can be measured
using a variety of methods standard in the art including enzyme
immunoassays (e.g., ELISA), immunoblot assays, etc.).
[0337] Antibodies are characterized in that they comprise
immunoglobulin domains and as such, they are members of the
immunoglobulin superfamily of proteins. An antibody of the
invention includes polyclonal and monoclonal antibodies, divalent
and monovalent antibodies, bi- or multi-specific antibodies, serum
containing such antibodies, antibodies that have been purified to
varying degrees, and any functional equivalents of whole
antibodies. Isolated antibodies of the present invention can
include serum containing such antibodies, or antibodies that have
been purified to varying degrees. Whole antibodies of the present
invention can be polyclonal or monoclonal. Alternatively,
functional equivalents of whole antibodies, such as antigen binding
fragments in which one or more antibody domains are truncated or
absent (e.g., Fv, Fab, Fab', or F(ab).sub.2 fragments), as well as
genetically-engineered antibodies or antigen binding fragments
thereof, including single chain antibodies or antibodies that can
bind to more than one epitope (e.g., bi-specific antibodies), or
antibodies that can bind to one or more different antigens (e.g.,
bi- or multi-specific antibodies), may also be employed in the
invention. Methods for the generation and production of antibodies
are well known in the art.
[0338] Monoclonal antibodies may be produced according to the
methodology of Kohler and Milstein (Nature 256:495-497, 1975).
Non-antibody polypeptides, sometimes referred to as binding
partners, are designed to bind specifically to a protein of the
invention. Examples of the design of such polypeptides, which
possess a prescribed ligand specificity are given in Beste et al.
(Proc. Natl. Acad. Sci. 96:1898-1903, 1999), incorporated herein by
reference in its entirety. In one embodiment, a binding agent of
the invention is immobilized on a substrate such as: artificial
membranes, organic supports, biopolymer supports and inorganic
supports such as for use in a screening assay.
[0339] Proteins of the present invention, at least one protein of
the present invention, compositions comprising such protein(s) of
the present invention, and multi-enzyme, compositions (examples of
which are described above) may be used in any method where it is
desirable to hydrolyze glycosidic linkages in lignocellulosic
material, or any other method wherein enzymes of the same or
similar function are useful.
[0340] In one embodiment, the present invention includes the use of
at least one protein of the present invention, compositions
comprising at least one protein of the present invention, or
multi-enzyme compositions in methods for hydrolyzing lignocellulose
and the generation of fermentable sugars therefrom. In one
embodiment, the method comprises contacting the lignocellulosic
material with an effective amount of one or more proteins of the
present invention, composition comprising at least one protein of
the present invention, or a multi-enzyme composition, whereby at
least one fermentable sugar is produced (liberated). The
lignocellulosic material may be partially or completely degraded to
fermentable sugars. Economical levels of degradation at
commercially viable costs are contemplated.
[0341] Typically, the amount of enzyme or enzyme composition
contacted with the lignocellulose will depend upon the amount of
glucan present in the lignocellulose. In some embodiments, the
amount of enzyme or enzyme composition contacted with the
lignocellulose may be from about 0.1 to about 200 mg enzyme or
enzyme composition per gram of glucan; in other embodiments, from
about 3 to about 20 mg enzyme or enzyme composition per gram of
glucan. The invention encompasses the use of any suitable or
sufficient amount of enzyme or enzyme composition between about 0.1
mg and about 200 mg enzyme per gram glucan, in increments of 0.05
mg (i.e., 0.1 mg, 0.15 mg, 0.2 mg . . . 199.9 mg, 199.95 mg, 200
mg).
[0342] In a further embodiment, the invention provides a method for
degrading DDG, preferably, but not limited to, DDG derived from
corn, to sugars. The method comprises contacting the DDG with a
protein of the present invention, a composition comprising at least
one protein of the present invention, or a multi-enzyme
composition. In certain embodiments, at least 10% of fermentable
sugars are liberated. In other embodiment, the at least 15% of the
sugars are liberated, or at least 20% of the sugars are liberated,
or at least 23% of the sugars are liberated, or at least 24% of the
sugars are liberated, or at least 25% of the sugars are liberated,
or at least 26% of the sugars are liberated, or at least 27% of the
sugars are liberated, or at least 28% of the sugars are
liberated.
[0343] In another embodiment, the invention provides a method for
producing fermentable sugars comprising cultivating a genetically
modified microorganism of the present invention in a nutrient
medium comprising a lignocellulosic material, whereby fermentable
sugars are produced.
[0344] Also provided are methods that comprise further contacting
the lignocellulosic material with at least one accessory enzyme.
Accessory enzymes have been described elsewhere herein. The
accessory enzyme or enzymes may be added at the same time, prior
to, or following the addition of a protein of the present
invention, a composition comprising at least one protein of the
present invention, or a multi-enzyme composition, or can be
expressed (endogenously or overexpressed) in a genetically modified
microorganism used in a method of the invention. When added
simultaneously, the protein of the present invention, a composition
comprising at least one protein of the present invention, or a
multi-enzyme composition will be compatible with the accessory
enzymes selected. When the enzymes are added following the
treatment with the protein of the present invention, a composition
comprising at least one protein of the present invention, or a
multi-enzyme composition, the conditions (such as temperature and
pH) may be altered to those optimal for the accessory enzyme
before, during, or after addition of the accessory enzyme. Multiple
rounds enzyme addition are also encompassed. The accessory enzyme
may also be present in the lignocellulosic material itself as a
result of genetically modifying the plant. The nutrient medium used
in a fermentation can also comprise one or more accessory
enzymes.
[0345] In some embodiments, the method comprises a pretreatment
process. In general, a pretreatment process will result in
components of the lignocellulose being more accessible for
downstream applications or so that it is more digestible by enzymes
following treatment in the absence of hydrolysis. The pretreatment
can be a chemical, physical or biological pretreatment. The
lignocellulose may have been previously treated to release some or
all of the sugars, as in the case of DDG. Physical treatments, such
as grinding, boiling, freezing, milling, vacuum infiltration, and
the like may also be used with the methods of the invention. In one
embodiment, the heat treatment comprises heating the
lignocellulosic material to 121.degree. C. for 15 minutes. A
physical treatment such as milling can allow a higher concentration
of lignocellulose to be used in the methods of the invention. A
higher concentration refers to about 20%, up to about 25%, up to
about 30%, up to about 35%, up to about 40%, up to about 45%, or up
to about 50% lignocellulose. The lignocellulose may also be
contacted with a metal ion, ultraviolet light, ozone, and the like.
Additional pretreatment processes are known to those skilled in the
art, and can include, for example, organosolv treatment, steam
explosion treatment, lime impregnation with steam explosion
treatment, hydrogen peroxide treatment, hydrogen peroxide/ozone
(peroxone) treatment, acid treatment, dilute acid treatment, and
base treatment, including ammonia fiber explosion (AFEX)
technology. Details on pretreatment technologies and processes can
be found in Wyman et al., Bioresource Tech. 96:1959 (2005); Wyman
et al., Bioresource Tech. 96:2026 (2005); Hsu, "Pretreatment of
biomass" In Handbook on Bioethanol: Production and Utilization,
Wyman, Taylor and Francis Eds., p. 179-212 (1996); and Mosier et
al., Bioresource Tech. 96:673 (2005).
[0346] In an additional embodiment, the method comprises
detoxifying the lignocellulosic material. Detoxification may be
desirable in the event that inhibitors are present in the
lignocellulosic material. Such inhibitors can be generated by a
pretreatment process, deriving from sugar degradation or are direct
released from the lignocellulose polymer. Detoxifying can include
the reduction of their formation by adjusting sugar extraction
conditions; the use of inhibitor-tolerant or inhibitor-degrading
strains of microorganisms. Detoxifying can also be accomplished by
the addition of ion exchange resins, active charcoal, enzymatic
detoxification using, e.g., laccase, and the like. In some
embodiments, the proteins, compositions or products of the present
invention further comprises detoxifying agents.
[0347] In some embodiments, the methods may be performed one or
more times in whole or in part. That is, one may perform one or
more pretreatments, followed by one or more reactions with a
protein of the present invention, composition or product of the
present invention and/or accessory enzyme. The enzymes may be added
in a single dose, or may be added in a series of small doses.
Further, the entire process may be repeated one or more times as
necessary. Therefore, one or more additional treatments with heat
and enzymes are contemplated.
[0348] The methods described above result in the production of
fermentable sugars. During, or subsequent to the methods described,
the fermentable sugars may be recovered. In the case of a
cultivation of microorganisms, the sugars can be recovered through
a continuous, batch or fed-batch method. The sugars recovered can
be concentrated or purified. Recovery may occur by any method known
in the art, including, but not limited to, washing, gravity flow,
pressure, chromatography, extraction, crystallization (e.g.,
evaporative crystallization), membrane separation, reverse osmosis,
distillation, and filtration. The sugars can be subjected further
processing; e.g., they can also be sterilized, for example, by
filtration.
[0349] In a related embodiment, the invention provides means for
improving quality of lignocellulosic material, including DDG for
animal nutrition. In one embodiment, the treated lignocellulosic
material (e.g., a lignocellulosic material which has been
saccharified) is recovered (e.g., has the soluble sugars removed).
The recovered material can be used as an animal feed additive. It
is believed that the recovered material will have beneficial
properties for animal nutrition, possibly due to a higher protein
content. In some embodiments, the amount of enzyme or enzyme
composition contacted with the lignocellulosic material may be from
about 0.0001% to about 1.0% of the weight of the lignocellulosic
material; in other embodiments, from about 0.005% to about 0.1% of
the weight of the lignocellulosic material. The invention includes
the use of any amount of enzyme or enzyme composition between about
0.0001% and about 1.0%, in increments of 0.0001 (i.e. 0.0001,
0.0002, 0.0003 . . . etc.).
[0350] In an additional embodiment, the invention provides a method
for producing an organic substance, comprising saccharifying a
lignocellulosic material with an effective amount of a protein of
the present invention or a composition comprising at least one
protein of the present invention, fermenting the saccharified
lignocellulosic material obtained with one or more fermentating
microorganisms, and recovering the organic substance from the
fermentation. Sugars released from biomass can be converted to
useful fermentation products including but not limited to amino
acids, vitamins, pharmaceuticals, animal feed supplements,
specialty chemicals, chemical feedstocks, plastics, solvents,
fuels, or other organic polymers, lactic acid, and ethanol,
including fuel ethanol. Specific products that may be produced by
the methods of the invention include, but not limited, to, biofuels
(including ethanol); lactic acid; plastics; specialty chemicals;
organic acids, including citric acid, succinic acid and maleic
acid; solvents; animal feed supplements; pharmaceuticals; vitamins;
amino acids, such as lysine, methionine, tryptophan, threonine, and
aspartic acid; industrial enzymes, such as proteases, cellulases,
amylases, glucanases, lactases, lipases, lyases, oxidoreductases,
and transferases; and chemical feedstocks. The methods of the
invention are also useful to generate feedstocks for fermentation
by fermenting microorganisms. In one embodiment, the method further
comprises the addition of at least one fermenting organism. As used
herein, "fermenting organism" refers to an organism capable of
fermentation, such as bacteria and fungi, including yeast. Such
feedstocks have additional nutritive value above the nutritive
value provided by the liberated sugars.
[0351] Proteins of the present invention and compositions
comprising at least one protein of the present invention are also
useful in a variety of other applications involving the hydrolysis
of glycosidic linkages in lignocellulosic material, such as stone
washing, color brightening, depilling and fabric softening, as well
as other applications well known in the art. Proteins of the
present invention and compositions comprising at least one protein
of the present invention are also readily amenable to use as
additives in detergent and other media used for such applications.
These and other methods of use will readily suggest themselves to
those of skill in the art based on the invention described
herein.
[0352] In one embodiment of this invention, proteins and
compositions of the present invention can be used in stone washing
procedures for fabrics or other textiles. In some embodiments, the
proteins and compositions can be used in stone washing procedures
for denim jeans. By way of example, the method for stone washing
the fabric comprises contacting the fabric with a protein or
composition of the present invention. In an additional embodiment,
the protein or composition of the present invention is included in
a detergent composition, as described below. A preferred pH range
of stone wash applications is between about 5.5 to 7.5, most
preferably at about pH 6 to about 7. One of skill in the art will
know how to regulate the amount or concentration of the protein or
composition produced by this invention based on such factors as the
activity of the enzyme and the wash conditions, including but not
limited to temperature and pH. Examples of these uses can be found
in U.S. Patent Application Publication No. 2003/0157595, the
contents of which are hereby incorporated by reference.
[0353] In yet another embodiment of this invention, the cellulase
compositions of this invention can be used to reduce or eliminate
the harshness associated with a fabric or textile by contacting the
fabric or textile with a protein or composition of the present
invention. In some embodiments, the fabric or textile may be made
from cellulose or cotton. By way of example, a preferred range for
reducing or eliminating the harshness associated with a fabric or
textile is between about pH 8 to about 12, or between about pH 10
to about 11.
[0354] The proteins or compositions of the subject invention can be
used in detergent compositions. In one embodiment, the detergent
composition may comprise at least one protein or composition of the
present invention and one or more surfactants. The detergent
compositions may also include any additional detergent ingredient
known in the art. Detergent ingredients contemplated for use with
the detergent compositions of the subject invention include, but
are not limited to, detergents, buffers, surfactants, bleaching
agents, softeners, solvents, solid forming agents, abrasives,
alkalis, inorganic electrolytes, cellulase activators,
antioxidants, builders, silicates, preservatives, and stabilizers.
The detergent compositions of this invention preferably employ a
surface active agent, i.e., surfactant, including anionic,
non-ionic, and ampholytic surfactants well known for their use in
detergent compositions. In addition to the at least one protein or
composition of the present invention and the surface active agent,
the detergent compositions of this invention can additionally
contain one or more of the following components: the enzymes
amylases, cellulases, proteinase, lipases, oxido-reductases,
peroxidases and other enzymes; cationic surfactants and long-chain
fatty acids; builders; antiredeposition agents; bleaching agents;
bluing agents and fluorescent dyes; caking inhibitors; masking
agents for factors inhibiting the cellulase activity; cellulase
activators; antioxidants; and solubilizers. In addition, perfumes,
preservatives, dyes, and the like can be used, if desired, with the
detergent compositions of this invention. Examples of detergent
compositions employing cellulases are exemplified in U.S. Pat. Nos.
4,435,307; 4,443,355; 4,661,289; 4,479,881; 5,120,463, each of
which is incorporated herein by reference in its entirety for all
purposes.
[0355] When a detergent base used in the present invention is in
the form of a powder, it may be one which is prepared by any known
preparation method including a spray-drying method and/or a
granulation method. The granulation method are the most preferred
because of the non-dusting nature of granules compared to spray dry
products. The detergent base obtained by the spray-drying method is
hollow granules which are obtained by spraying an aqueous slurry of
heat-resistant ingredients, such as surface active agents and
builders, into a hot space. The granules have a size of from about
50 to about 2000 micrometers. After the spray-drying, perfumes,
enzymes, bleaching agents, and/or inorganic alkaline builders may
be added. With a highly dense, granular detergent base obtained by
such as the spray-drying-granulation method, various ingredients
may also be added after the preparation of the base. When the
detergent base is a liquid, it may be either a homogenous solution
or an inhomogeneous solution.
[0356] Other textile applications in which proteins and
compositions of the present invention may be used include, but are
not limited to, garment dyeing applications such as enzymatic
mercerizing of viscose, bio-polishing applications, enzymatic
surface polishing; biowash (washing or washing down treatment of
textile materials), enzymatic microfibrillation, enzymatic
"cottonization" of linen, ramie and hemp; and treatment of Lyocel
or Newcell (i.e., "TENCEL" from Courtauld's), Cupro and other
cellulosic fibers or garments, dye removal from dyed cellulosic
substrates such as dyed cotton (Leisola & Linko--(1976)
Analytical Biochemistry, v. 70, p. 592. Determination Of The
Solubilizing Activity Of A Cellulase Complex With Dyed Substrates;
Blum & Stahl--Enzymic Degradation Of Cellulose Fibers; Reports
of the Shizuoka Prefectural Hamamatsu Textile Industrial Research
institute No. 24 (1985)), as a bleaching agent to make new indigo
dyed denim look old (Fujikawa--Japanese Patent Application Kokai
No. 50-132269), to enhance the bleaching action of bleaching agents
(Suzuki--Great Britain Patent No. 2 094 826), and in a process for
compositions for enzymatic desizing and bleaching of textiles
(Windbichtler et al., U.S. Pat. No. 2,974,001. Another example of
enzymatic desizing using cellulases is provided in Bhatawadekar
(May 1983) Journal of the Textile Association, pages 83-86.
[0357] The amount of enzyme or enzyme composition contacted with a
textile may vary with the particular application. Typically, for
biofinishing and denim washing applications, from about 0.02 wt. %
to about 5 wt. % of an enzyme or enzyme composition may be
contacted with the textile. In some embodiments, from about 0.5 wt.
% to about 2 wt. % of an enzyme or enzyme composition may be
contacted with the textile. For bioscouring, from about 0.1 to
about 10, or from about 0.1 to about 1.0 grams of an enzyme or
enzyme composition per kilogram of textile may be used, including
any amount between about 0.1 grams and about 10 grams, in
increments of 0.1 grams.
[0358] In other embodiments, the proteins or compositions of the
present invention can be used in the saccharification of
lignocellulose biomass from agriculture, forest products, municipal
solid waste, and other sources, for biobleaching of wood pulp, and
for de-inking of recycled print paper all by methods known to one
skilled in the art.
[0359] The amount of enzyme or enzyme composition used for pulp and
paper modification (e.g., biobleaching of wood pulp, de-inking of
paper, or biorefining of pulp for paper making) typically varies
depending upon the stock that is used, the pH and temperature of
the system, and the retention time. In certain embodiments, the
amount of enzyme or enzyme composition contacted with the paper or
pulp may be from about 0.01 to about 50 U; from about 0.1 to about
15 U; or from about 0.1 to about 5 U of enzyme or enzyme
composition per dry gram of fiber, including any amount between
about 0.0.1 and about 50 U, in 0.01 U increments. In other
embodiments, the amount of enzyme or enzyme composition contacted
with the paper or pulp may be from about 1 to about 2000 grams or
from about 100 to about 500 grams enzyme or enzyme composition per
thy ton of pulp, including any amount between about 1 and about
2000 grams, in 1 gram increments.
[0360] Proteins or compositions of the present invention can added
to wastewater to reduce the amount of solids such as sludge or to
increase total biochemical oxygen demand (BOD) and chemical oxygen
demand (COD) removal. For example, proteins or compositions of the
present invention can be used to transform particulate COD to
soluble COD in wastewater produced from grain/fruit/cellulose
industrial processes or to increase the BOD/COD ratio by increasing
waste biodegradability (soluble lower molecular weight polymers in
cellulosic/hemicellulosic wastes are typically more readily
biodegradable than non-soluble material). In biological wastewater
treatment systems, proteins or compositions of the present
invention can also be used to increase waste digestion by aerobic
and/or anaerobic bacteria.
[0361] Chitinases of the present invention (e.g., SEQ ID NOS: 56
and 59) can hydrolyze the .beta.-1,4-glycosidic linkage present in
chitin and thus may be used to degrade chitin-containing materials.
Examples of chitin-containing materials include fungal cell walls,
insect exoskeletons, the eggs of parasitic worms, and crustacean
shells.
[0362] Chitinases may be used to inhibit or reduce fungal growth,
including the treatment of fungal infections such as those caused
by nail fungi. For example, chitinases of the present invention may
be applied to any fungus or area susceptible to fungal growth.
Chitinases may also be used to coat or treat seeds and flower bulbs
to prevent the growth of fungi. Further, chitinases may be added to
fungal cultures to lower culture viscosity by increasing cell wall
degradation. Chitinases may also by used as lysing enzymes for the
generation of protoplasts from fungi (see, e.g., Yano et al.,
Biosci Biotechnol Biochem. 70:1754 (2006).
[0363] Chitinases or compositions containing chitinases may be used
as a biological control agent such as an insecticide (see, e.g.,
Kramer et al., Insect Biochem Mol Biol. 27:887 (1997). Chitinases
of the present invention have been shown to be effective for
controlling white-fly larvae in laboratory tests. Thus, chitinases
may be applied to crops, plants and the like to control insect
infestations.
[0364] Chitin has also been suggested to play a role in inducing
allergic inflammation and asthma (see Reese et al., Nature 447:92
(2007)). Accordingly, chitinases of the present invention may be
administered to a subject to reduce allergic inflammatory responses
induced by chitin or to reduce the symptoms of asthma.
[0365] Exemplary methods according to the invention are presented
below. Examples of the methods described above may also be found in
the following references, all of which are incorporated herein in
their entireties: Trichoderma & Gliocladium. Volume 2. Enzymes,
biological control and commercial applications. Editors: Gary E.
Harman, Christian P. Kubicek. Taylor & Francis Ltd 1998, 393
(in particular, chapters 14, 15 and 16) Helmut Uhlig. Industrial
enzymes and their applications. Translated and updated by Elfriede
M. Linsmaier-Bednar. John Wiley & Sons, Inc 1998, p. 454 (in
particular, chapters 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.9, 5.10,
5.11, and 5.13) For saccharification applications: Hahn-Hagerdal,
B., Galbe, M., Gorwa-Grauslund, M. F. Liden, Zacchi, G.
Bio-ethanol--the fuel of tomorrow from the residues of today.
Trends in Biotechnology. 2006, 24 (12), 549-556; Mielenz, J. R.
Ethanol production from biomass: technology and commercialization
status. Current Opinion in Microbiology. 2001, 4, 324-329; Himmel.
M. E., Ruth, M. F., Wyman, C. E. Cellulase for commodity products
from cellulosic biomass. Current Opinion in Biotechnology. 1999,
10, 358-364; Sheehan, J., Himmel, M. Enzymes, energy, and the
environment: a strategic perspective on the U.S. Department of
Energy's Research and Development Activities for Bioethanol.
Biotechnology Progress. 1999, 15, 817-827. For textile processing
applications: Galante, Y. M. Formantici, C. Enzyme applications in
detergency and in manufacturing industries. Current Organic
Chemistry. 2003, 7, 1399-1422. For pulp and paper applications:
Bajpai, P., Bajpai, P. K Deinking with enzymes: a review. TAPPI
Journal. 1998, 81(12), 111-117; Viikari, L., Pere, J., Suurnakki,
A., Oksanen, T., Buchert, J. Use of cellulases in pulp and paper
applications. In: "Carbohydrates from Trichoderma reesei and other
microorganisms. Structure, Biochemistry, Genetics and
Applications." Editors: Mark Claessens, Wim Nerinckx, and Kathleen
Piens. The Royal Society of Chemistry 1998, 245-254. For food and
beverage applications: Roller, S., Dea, I. C. M. Biotechnology in
the production and modification of biopolymers for foods. Critical
Reviews in Biotechnology. 1992, 12(3), 261-277.
[0366] Each publication or reference cited herein is incorporated
herein by reference in its entirety for all purposes.
[0367] The following examples are provided for the purpose of
illustration and are not intended to limit the scope of the present
invention.
EXAMPLES
Example 1
[0368] The following example illustrates the Somogyi-Nelson assay
used to measure beta-glucanase enzymatic activity.
[0369] This assay measures the release of reducing sugars (as
glucose equivalents) by the action of .beta.-glucanase on a soluble
.beta.-glucan. One .beta.-glucanase unit of activity is the amount
of enzyme that liberates 1 micromole of reducing sugars, expressed
as glucose equivalents, in one minute at 50.degree. C. and pH
5.0.
Reagents
[0370] Acetate buffer (0.05 M, pH 5.0) is prepared as follows. 4.1
g of anhydrous sodium acetate or 6.8 g of sodium acetate *3H.sub.2O
is dissolved in distilled water so that the final volume of the
solution is 1000 mL (Solution A). In a separate flask, 3.0 g (2.86
mL) of glacial acetic acid is mixed with distilled water to make a
total volume of 1000 mL (Solution B). The final 0.05 M acetate
buffer, pH. 5.0, is prepared by mixing Solution A with Solution B
until the pH of the resulting solution is equal to 5.0.
[0371] .beta.-glucan from barley (medium viscosity, Megazyme,
P-BGBM) is used as the assay substrate. The substrate is milled in
a mortar before dissolving. 1% w/v stock solution is prepared as
follows. 500 mg of .beta.-glucan is mixed with 50 mL of distilled
water and agitated on a magnetic stirrer for 1 hour. After that,
the suspension is placed into a Bandelin SONOREX TK52 ultrasonic
bath for 10 minutes to destroy undissolved particles and then into
a boiling water bath until the solution becomes clear. Finally, the
solution is stirred on a magnetic stirrer for 30 minutes. The
solution is typically stable for 2 days.
[0372] The Somogyi reagent is prepared as follows. 24 g of
anhydrous sodium carbonate and 1.2 g of sodium-potassium tartrate
tetrahydrate are dissolved in 250 mL of distilled water. A solution
of copper sulfate pentahydrate (4 g in 40 mL of distilled water) is
added on stirring, and then 16 g of sodium bicarbonate is dissolved
to obtain Solution A. In a separate flask, 180 g of sodium sulfate
is dissolved in 500 mL of hot (.about.80.degree. C.) distilled
water and boiled for 5 minutes (Solution B). Solution A is mixed
with Solution B and the final volume is adjusted to 1.000 mL. The
reagent is stable for 2-3 months on storing in a dark glass
vessel.
[0373] The Nelson reagent is prepared as follows. 50 g of ammonium
molybdate is dissolved in 900 mL of hot (.about.60.degree. C.)
distilled water. The solution is cooled to 5-10.degree. C. and 42 g
of concentrated sulfuric acid containing 6 g of sodium arsenate are
carefully added on stirring. The volume of the mixture is adjusted
to 1000 mL. The flask is incubated at 40.degree. C. for 48 hours
and then the solution is filtered if necessary. The reagent is
stable for 2-3 months.
[0374] Using the above reagents, the assay is performed as detailed
below.
Reagent Blank
[0375] 0.5 mL of 0.05 M sodium acetate buffer is mixed in a
graduated glass test tube (1.5 cm.times.1.5 cm) with 0.5 mL of
Somogyi reagent and incubated in a boiling water bath for 40
minutes. Then, the test tube is cooled in an ice or cold water bath
and 0.5 mL of Nelson reagent is added and mixed by shaking
manually. After 10 minutes incubation at room temperature, the
solution volume is adjusted to 5 mL with distilled water.
Enzyme Sample
[0376] 0.25 mL of 1% w/v .beta.-glucan stock solution is mixed with
0.15 mL of 0.05 M acetate buffer, pH 5.0, in graduated glass test
tubes (1.5 cm.times.15 cm) and preheated at 50.degree. C. for 5
minutes. Enzyme sample is suitably diluted by the 0.05 M sodium
acetate buffer (the dilution of the enzyme sample is chosen so that
A.sub.610 should be between 0.05 and 0.4 units of optical density).
0.1 mL of suitably diluted enzyme sample is added to 0.4 mL of the
preheated substrate solution, mixed and incubated at 50.degree. C.
for 10 minutes. After exactly 10 minutes of incubation, 0.5 mL of
Somogyi reagent is added, and test tubes are placed into a boiling
water bath for 40 minutes. Then, the test tubes are cooled in an
ice or cold water bath and 0.5 mL of Nelson reagent is added and
mixed by shaking manually. After 10 minutes incubation at room
temperature, the solution volume is adjusted to 5 mL with distilled
water. Then, the absorbance at 610 nm (A.sub.610) is measured
against the reagent blank as A.sub.S (enzyme sample).
Enzyme Blank
[0377] 0.4 mL of 0.05 M sodium acetate buffer is mixed in a
graduated glass test tube (1.5 cm.times.15 cm) with 0.1 mL of
enzyme sample having the same dilution as above. 0.5 mL of Somogyi
reagent is added and the test tube is incubated in a boiling water
bath for 40 minutes. Then, the test tube is cooled in an ice or
cold water bath and 0.5 mL of Nelson reagent is added and mixed by
shaking manually. After 10 minutes incubation at room temperature,
the solution volume is adjusted to 5 mL with distilled water. Then,
the absorbance at 610 nm (A.sub.610) is measured against the
reagent blank as A.sub.EB (enzyme blank).
Substrate Blank
[0378] 0.25 mL of 1% w/v .beta.-glucan stock solution is mixed with
0.25 mL of 0.05 M acetate buffer, pH 5.0, in a graduated glass test
tube (1.5 cm.times.15 cm), 0.5 mL of Somogyi reagent is added, and
the test tube is placed into a boiling water bath for 40 minutes.
Then, the test tube is cooled in an ice or cold water bath and 0.5
mL of Nelson reagent is added and mixed by shaking manually. After
10 minutes incubation at room temperature, the solution volume is
adjusted to 5 mL with distilled water. Then, the absorbance at 610
nm (A.sub.610) is measured against the reagent blank as A.sub.SB
(substrate blank). The value of A.sub.SB should not exceed 0.05
units of optical density.
[0379] All enzyme samples and enzyme and substrate blanks should be
boiled together.
Standard Calibration Plot
[0380] Stock glucose solution (1 mg/mL) is prepared by dissolving
100 mg of D-glucose in 100 mL of 0.05 M acetate buffer, pH 5.0.
Then, different dilutions of stock glucose solution are prepared as
follows:
TABLE-US-00022 0.05M acetate Stock solution (mL) buffer (mL)
Glucose concentration (mg/mL) 0.05 4.95 0.01 0.10 4.90 0.02 0.15
4.85 0.03 0.20 4.80 0.04 0.25 4.75 0.05 0.30 4.70 0.06 0.35 4.65
0.07 0.40 4.60 0.08 0.45 4.55 0.09 0.50 4.50 0.10
[0381] Samples (0.5 mL) of glucose solution with different
dilutions are placed to graduated glass test tubes (1.5 cm.times.15
cm). 0.5 mL of Somogyi reagent is added and test tubes are placed
into a boiling water bath for 40 minutes. Then, the test tubes are
cooled in an ice or cold water bath and 0.5 mL of Nelson reagent is
added and mixed by shaking manually. After 10 minutes incubation at
room temperature, the solution volume is adjusted to 5 mL with
distilled water. Then, the absorbance at 610 nm (A.sub.610) is
measured against the reagent blank. The following typical A.sub.610
data obtained with different glucose dilutions are given below:
TABLE-US-00023 Glucose, mg/mL 0.01 0.02 0.03 0.04 0.05 0.06 0.07
0.08 0.09 0.10 A.sub.610 0.11 0.20 0.28 0.41 0.48 0.60 0.72 0.79
0.93 1.06
Absorbance (A.sub.610) on the y-axis versus glucose concentration
(G, mg/mL) on the x-axis is plotted to generate a standard
calibration plot such as that shown in the FIG. 11. From FIG. 11,
the inverse of the slop (1/a=0.098) is calculated.
Calculation of Activity
[0382] Activity is calculated as follows: activity
(IU/mL)=1/a*.DELTA.A.sub.610*DF*2.78, where 1/a-inverse of the
slope calculated from the standard calibration plot,
.DELTA.A.sub.610=A.sub.S (enzyme sample)-A.sub.EB (enzyme
blank)-A.sub.SB (substrate blank), DF is the enzyme dilution
factor, and 2.78 is the coefficient that accounts for additional
5-fold enzyme dilution in the reaction mixture (0.1 mL enzyme+0.4
mL substrate), reaction time (10 minutes) and molecular weight of
glucose (0.18016 mg/micromole), i.e. 5/(10*0.18016)=2.78
[0383] An example calculation is supplied below.
TABLE-US-00024 B4 UF concentrate DF .DELTA.A.sub.610 20,000 0.545
40,000 0.316 80,000 0.154 .beta.-glucanase activity (IU/mL) = 0.098
* 0.316 * 40000 * 2.78 = 3444
Example 2
[0384] The following example illustrates the assays used to measure
beta-glucosidase, .alpha.-galactosidase and .beta.-galactosidase
enzymatic activities.
[0385] This assay measures the release of p-nitrophenol by the
action of .beta.-glucosidase on p-nitrophenyl
.beta.-D-glucopyranoside (PNPG). Modifications to the assay for
measuring .alpha.-galactosidase and .beta.-galactosidase enzymatic
activities are discussed below. One .beta.-glucosidase unit of
activity is the amount of enzyme that liberates 1 micromole of
p-nitrophenol in one minute at 40.degree. C. and pH 5.0.
Reagents
[0386] Acetate buffer (0.1 M, pH 5.0) is prepared as follows. 8.2 g
of anhydrous sodium acetate or 13.6 g of sodium acetate *3H.sub.2O
is dissolved in distilled water so that the final volume of the
solution to be 1000 mL (Solution A). In a separate flask, 6.0 g
(5.72 mL) of glacial acetic acid is mixed with distilled water to
make the total volume of 1000 mL (Solution B). The final 0.1 M
acetate buffer, pH 5.0, is prepared by mixing Solution A with
Solution B until the pH of the resulting solution is equal to
5.0.
[0387] PNPG from Sigma (USA) is used as the assay substrate. 6 mg
of PNPG is dissolved in 2 mL of distilled water using magnetic
stirrer to obtain 10 mM stock solution. The solution is stable for
2 days on storage at 4.degree. C.
[0388] The stop reagent (1 M sodium carbonate solution) is prepared
as follows. 10.6 g of anhydrous sodium carbonate is dissolved in 80
mL of distilled water, and the solution volume is adjusted to 100
mL. This reagent is used to terminate the enzymatic reaction.
[0389] Using the above reagents, the assay is performed as detailed
below.
Enzyme Sample
[0390] 0.09 mL of 10 mM PNPG stock solution is mixed with 0.81 mL
of 0.1 M acetate buffer, pH 5.0, and preheated at 40.degree. C. for
5 minutes. The enzyme sample is suitably diluted with the 0.1 M
sodium acetate buffer (the dilution of the enzyme sample is chosen
so that A.sub.400 should be between 0.05 and 0.35 units of optical
density). 0.1 mL of suitably diluted enzyme sample is added to 0.9
mL of the preheated substrate solution, mixed and incubated at
40.degree. C. for 10 minutes. After exactly 10 minutes of
incubation, 0.5 mL of 1 M sodium carbonate solution is added and
then the absorbance at 400 nm (A.sub.400) is measured as A.sub.S
(enzyme sample).
Substrate Blank
[0391] 0.09 mL of 10 mM PNPG stock solution is mixed with 0.91 mL
of 0.1 M acetate buffer, pH 5.0. Then, 0.5 mL of 1 M sodium
carbonate solution is added and the absorbance at 400 nm
(A.sub.400) is measured as A.sub.SB (substrate blank).
Calculation of Activity
[0392] Activity is calculated as follows; activity (1
U/mL)=.DELTA.A.sub.400*DF*0.082, where .DELTA.A.sub.400==A.sub.S
(enzyme sample)-A.sub.SB (substrate blank), DF is the enzyme
dilution factor, and 0.082 is the coefficient obtained as
follows:
0.082 = 10 6 18300 * 10 * ( 0.1 / 0.0015 ) ##EQU00001##
where 10.sup.6 is factor used to convert moles/L in the extinction
coefficient to micromoles/L, 18300 M.sup.-1 cm.sup.-1 is the
extinction coefficient of p-nitrophenol released, 10 minutes is the
reaction time, and 0.1 mL is the volume of enzyme solution added to
0.0015 L of total assay volume.
[0393] An example calculation is supplied below.
TABLE-US-00025 B4 UF concentrate DF .DELTA.A.sub.400 5,000 0.495
10,000 0.268 20,000 0.144 For DF = 10,000: .beta.-glucosidase
activity (IU/mL) = 0.268 * 10000 * 0.082 = 220 For DF = 20,000:
.beta.-glucosidase activity (IU/mL) = 0.144 * 20000 * 0.082 =
236
Finally, .beta.-glucosidase activity=228 IU/mL, as a mean value
between two measurements.
.alpha.- and .beta.-Galactosidase Assays
[0394] The assay procedures for .alpha.- and .beta.-galactosidases
are the same as the .beta.-glucosidase assay described above,
except that p-nitrophenyl .alpha.-D-galactopyranoside and
p-nitrophenyl .beta.-D-galactopyranoside are used as substrates,
respectively, instead of p-nitrophenyl
.beta.-D-glucopyranoside.
Example 3
[0395] The following example illustrates the Somogyi-Nelson assay
used to measure CMCase enzymatic activity.
[0396] This assay measures the release of reducing sugars (as
glucose equivalents) by the action of cellulase on a soluble
cellulosic substrate (CIVIC). One CMCase unit of activity is the
amount of enzyme that liberates 1 micromole of reducing sugars,
expressed as glucose equivalents, in one minute at 50.degree. C.,
pH 5.0, 0.05 M sodium acetate buffer.
Reagents
[0397] Acetate buffer (0.05 M, pH 5.0) is prepared as follows. 4.1
g of anhydrous sodium acetate or 6.8 g of sodium acetate *3H.sub.2O
is dissolved in distilled water so that the final volume of the
solution to be 1000 mL (Solution A). In a separate flask, 3.0 g
(2.86 mL) of glacial acetic acid is mixed with distilled water to
make the total volume of 1000 mL (Solution B). The final 0.05 M
acetate buffer, pH 5.0, is prepared by mixing Solution A with
Solution B until the pH of the resulting solution is equal to
5.0.
[0398] The sodium salt of carboxymethyl cellulose (CMC, medium
viscosity, Sigma, C 4888) is used as the substrate. A 1% w/v stock
solution is prepared in 0.05 M acetate buffer, pH 5.0, by adding
gradually 1.0 g of CMC to 99 mL of the buffer agitated on a
magnetic stirrer. The mixture is stirred until the CMC is
completely dissolved. The solution is stable for 2 days at
4.degree. C.
[0399] The Somogyi reagent is prepared as follows. 24 g of
anhydrous sodium carbonate and 12 g of sodium-potassium tartrate
tetrahydrate are dissolved in 250 mL of distilled water. The
solution of copper sulfate pentahydrate (4 g in 40 mL of distilled
water) is added on stirring, and then 16 g of sodium bicarbonate is
dissolved to obtain Solution A. In a separate flask, 18.0 g of
sodium sulfate is dissolved in 500 mL of hot (.about.80.degree. C.)
distilled water and boiled for 5 min (Solution B). Solution A is
mixed with Solution B and the final volume is adjusted to 1000 mL.
The reagent is stable for 2-3 months on storing in a dark glass
vessel at room temperature.
[0400] The Nelson reagent is prepared as follows. 50 g of ammonium
molybdate is dissolved in 900 mL of hot (.about.60.degree. C.)
distilled water. The solution is cooled to 5-10.degree. C. and 42 g
of concentrated sulfuric acid containing 6 g of sodium arsenate are
carefully added on stirring. The volume of the mixture is adjusted
to 1000 mL. The flask is incubated at 40.degree. C. for 48 hours
and then the solution is filtered if necessary. The reagent is
stable for 2-3 months at room temperature.
[0401] Using the above reagents, the assay is performed as detailed
below.
Reagent Blank
[0402] 0.5 mL of 0.05 M sodium acetate buffer is mixed in a
graduated glass test tube (1.5 cm.times.15 cm) with 0.5 mL of
Somogyi reagent and incubated in a boiling water bath for 40
minutes. Then, the test tube is cooled in an ice or cold water bath
and 0.5 mL of Nelson reagent is added and mixed by shaking
manually. After 10 minutes incubation at room temperature, the
solution volume is adjusted to 51 mL with distilled water.
Enzyme Sample
[0403] 0.25 mL of 1% w/v CMC stock solution is preheated in
graduated glass test tubes (1.5 cm.times.15 cm) at 50.degree. C.
for 5 minutes. Enzyme sample is suitably diluted by the 0.05 M
sodium acetate buffer (the dilution of the enzyme sample is chosen
so that A.sub.610 should be between 0.05 and 0.35 units of optical
density) and preheated at 50.degree. C. for 5 minutes. 0.25 mL of
suitably diluted enzyme sample is added to 0.25 mL of preheated.
CMC stock solution, mixed and incubated at 50.degree. C. for 5
minutes. After exactly 5 minutes of incubation, 0.5 mL of Somogyi
reagent is added, and test tubes are placed into a boiling water
bath for 40 minutes. Then, the test tubes are cooled in an ice or
cold water bath and 0.5 mL of Nelson reagent is added and mixed by
shaking manually. After 10 minutes incubation at room temperature,
1 mL of acetone is added to dissolve CMC precipitated during the
boiling procedure and the solution volume is adjusted to 5 mL with
distilled water. Then, the absorbance at 610 nm (A.sub.610) is
measured against the reagent blank as A.sub.S (enzyme sample).
Enzyme Blank
[0404] 0.25 mL of 0.05 M sodium acetate buffer is mixed in a
graduated glass test tube (1.5 cm.times.1.5 cm) with 0.25 mL of
enzyme sample having the same dilution as above. 0.5 mL of Somogyi
reagent is added and the test tube is incubated in a boiling water
bath for 40 minutes. Then, the test tube is cooled in an ice or
cold water bath and 0.5 mL of Nelson reagent is added and mixed by
shaking manually. After 10 minutes incubation at room temperature,
the solution volume is adjusted to 5 mL with distilled water. Then,
the absorbance at 610 nm (A.sub.610) is measured against the
reagent blank as A.sub.B (enzyme blank).
[0405] All enzyme samples and enzyme blanks should be boiled
together.
Standard Calibration Plot
[0406] A stock glucose solution (1 mg/mL) is prepared by dissolving
100 mg of D-glucose in 100 mL of 0.05 M acetate buffer, pH 5.0.
Then, different dilutions of stock glucose solution are prepared as
follows:
TABLE-US-00026 0.05M acetate Stock solution (mL) buffer (mL)
Glucose concentration (mg/mL) 0.05 4.95 0.01 0.10 4.90 0.02 0.15
4.85 0.03 0.20 4.80 0.04 0.25 4.75 0.05 0.30 4.70 0.06 0.35 4.65
0.07 0.40 4.60 0.08 0.45 4.55 0.09 0.50 4.50 0.10
[0407] Samples (0.5 mL) of glucose solution with different
dilutions are placed to graduated glass test tubes (1.5 cm.times.15
cm). 0.5 mL of Somogyi reagent is added and test tubes are placed
into a boiling water bath for 40 minutes. Then, the test tubes are
cooled in an ice or cold water bath and 0.5 mL of Nelson reagent is
added and mixed by shaking manually. After 10 minutes incubation at
room temperature, the solution volume is adjusted to 5 mL with
distilled water. Then, the absorbance at 610 nm (A.sub.610) is
measured against the reagent blank. The following typical A.sub.610
data obtained with different glucose dilutions are given below:
TABLE-US-00027 Glucose, mg/mL 0.01 0.02 0.03 0.04 0.05 0.06 0.07
0.08 0.09 0.10 A.sub.610 0.11 0.20 0.28 0.41 0.48 0.60 0.72 0.79
0.93 1.06
Absorbance (A.sub.610) on the y-axis versus glucose concentration
(G, mg/mL) on the x-axis is plotted to generate a standard
calibration plot such as that shown in the FIG. 11. From FIG. 11,
the inverse of the slop (1/a=0.098) is calculated.
Calculation of Activity
[0408] Activity is calculated as follows: CMCase
(IU/mL)=1/a*.DELTA.A.sub.610*DF*2.22, where 1/a-inverse of the
slope calculated from the standard calibration plot,
.DELTA.A.sub.610=A.sub.S (enzyme sample)-A.sub.B (enzyme blank).
DF--enzyme dilution factor, and 2.22 is the coefficient that
accounts for additional 2-fold enzyme dilution in the reaction
mixture (0.25 mL enzyme 0.25 mL substrate), reaction time (5
minutes) and molecular weight of glucose (0.1801.6 mg/micromole),
i.e. 2/(5*0.18016)=2.22.
[0409] An example calculation is supplied below.
TABLE-US-00028 ACE sample DF .DELTA.A.sub.610 40,000 0.41 80,000
0.25 CMCase activity (IU/mL) = 0.098*0.25*80000*2.22 = 4351
Example 4
[0410] The following example illustrates the Lowry protein
assay.
[0411] The Lowry procedure is one of the most known and widely used
protein assays, being first described in 1951 (Lowry O H,
Rosebrough N J, Farr A L, Randall R J (1951) J. Biol. Chem. 193:
265-275). Under alkaline conditions, copper complexes with protein.
When Folin's phenol reagent (phospho-molybdic-phosphotungstic
reagent) is added, it binds to the protein. Bound reagent is slowly
reduced and changes color from yellow to blue.
Stock Solutions and Reagents
[0412] Lowry A stock solution is 2% Na.sub.2CO.sub.3 in 0.1 M NaOH.
Lowry B stock solution is 1% CuSO.sub.4.5H.sub.2O in distilled
water. Lowry C stock solution is 2% sodium potassium tartrate
(NaKC.sub.4H.sub.4O.sub.6.4H.sub.2O) in distilled water. Lowry
stock reagent is prepared fresh daily by mixing 49 ml. Lowry A, 0.5
ml Lowry B and 0.5 ml Lowry C. Commercial. Folin's Reagent: Phenol
reagent--2N (Folin--Ciocalteau reagent) is diluted 1:1 in distilled
water before use and stored in a refrigerator.
[0413] Bovine serum albumin (BSA) is used as the assay standard.
When BSA is weighed out before use, the protein preparation used to
prepare the standard should be salt-free to avoid inaccurate
results. BSA is dissolved at a concentration of 1 mg/ml in a buffer
similar to the solution for analysis. A series of standard
dilutions is prepared where the protein (BSA) concentration is
varied in the range of 0.05-0.5 mg/ml.
Assay Procedure
[0414] Using the above reagents, the assay is performed as follows.
100 .mu.l of sample is added to each tube followed by 1.0 ml of
Lowry stock reagent and the mixture is incubate 10 minutes at room
temperature. 100 .mu.l of Folin's reagent is added to each tube and
the tubes are further incubated for 40 minutes at room temperature.
A absorbance at 750 nm is read on a spectrophotometer against a
blank (control) prepared in the same way as the sample except the
buffer is added to the tube instead of the analyzed protein.
Protein concentration is the determined by reading the absorbance
at 750 nm for each sample and plotting the result on a BSA
calibration graph prepared from the standard dilutions discussed
above.
[0415] The Lowry procedure should be carried out with caution since
it is subjected to interference by a wide variety of chemicals.
Among the chemicals reported to interfere with the Lowry procedure
are barbital, CAPS, cesium chloride, citrate, cysteine,
diethanolamine, dithiothreitol, EDTA, EGTA, HEPES, mercaptoethanol,
Nonidet P-40, phenol, polyvinyl pyrrolidone, sodium deoxycholate,
sodium salicylate, thimerosol, Tricine, TRIS and Triton X-100.
Example 5
[0416] The following example illustrates the Somogyi-Nelson assay
used to measure avicelase activity.
[0417] The assay measures the velocity of release of reducing
sugars by the action of cellulases on insoluble Avicel cellulose
(micro crystalline cellulose) when reducing sugars are assayed by
Somogyi-Nelson method as glucose equivalents. One unit of avicelase
activity is equal to the amount of enzyme that liberates 1
micromole of reducing sugars, expressed as glucose equivalents
(when reducing sugars assayed by Somogyi-Nelson method) in one
minute at 40.degree. C., pH 5.0 (0.05 M sodium acetate buffer).
Reagents
[0418] Acetate buffer (0.05 M, pH 5.0) is prepared as follows. 4.1
g of anhydrous sodium acetate or 6.8 g of sodium acetate *3H.sub.2O
is dissolved in distilled water so that the final volume of the
solution to be 1000 mL (Solution A). In a separate flask, 3.0 g
(2.86 mL) of glacial acetic acid is mixed with distilled water to
make the total volume of 1000 mL (Solution B). The final 0.05 M
acetate buffer, pH 5.0, is prepared by mixing Solution A with
Solution B until the pH of the resulting solution is equal to
5.0.
[0419] Avicel PH 105 (Serva, FRG) (or any other available source of
Avicel) is used as the assay substrate. 1% w/v of stock suspension
of Avicel is prepared in 0.05 M acetate buffer, pH 5.0, by adding
50 mg of thy Avicel gradually to a vessel with 5 mL of the buffer
agitated on a magnetic stirrer. After that the obtained Avicel
suspension is mixed on a magnetic stirrer during 1 hour at room
temperature (for swelling of Avicel). 1% w/v stock suspension of
Avicel can be stored in covered vessel for 5-7 days at 4.degree.
C.
[0420] The Somogyi reagent is prepared as follows. 24 g of
anhydrous sodium carbonate and 12 g of sodium-potassium tartrate
*4H.sub.2O are dissolved in 250 mL of distilled water. The solution
of copper sulfate *5H.sub.2O (4 g in 40 mL of distilled water) is
added on stirring, and then 16 g of sodium bicarbonate is dissolved
(Solution A). In a separate flask, 18.0 g of sodium sulfate is
dissolved in 500 mL of hot (-80.degree. C.) distilled water and
boiled for 5 minutes (Solution B). Solution A is mixed with
Solution B and the final volume is adjusted to 1000 mL by distill
water. The reagent is stable for 2-3 months on storing in a dark
glass vessel at room temperature.
[0421] The Nelson reagent is prepared as follows. 50 g of ammonium
molybdate is dissolved in 900 mL of hot (.about.60.degree. C.)
distilled water. The solution is cooled to 5-10.degree. C. and 42
mL of concentrated sulfuric acid containing 6 g of sodium arsenate
are carefully added with stirring. The volume of the mixture is
adjusted to 1000 mL with distilled water. The flask with this
solution is incubated at 40.degree. C. for 48 hours and then the
solution is filtered if necessary. The reagent is stable for 2-3
months at room temperature.
[0422] Using the above reagents, the assay is performed as detailed
below.
Buffer Blank
[0423] 0.2 mL of 0.05 M sodium acetate buffer (pH 5.0) is mixed in
a graduated glass test tube with 0.2 mL of Somogyi reagent,
covered, and incubated in a boiling water bath for 40 minutes.
Then, the test tube is cooled in a cold-water bath and 0.2 mL of
Nelson reagent is added and mixed by shaking manually. After 10
minutes incubation at room temperature, the solution volume is
adjusted to 2 mL with distilled water. The obtained solution is
used as a buffer blank.
Determination of Avicelase Activity
[0424] Avicelase activity is determined by carrying out of the
reaction of Avicel hydrolysis at 40.degree. C. and pH 5.0 for 60
minutes while mixing. A 1% w/v stock suspension of Avicel is mixed
on magnetic stirrer for 10 minutes, then 0.25 mL of this 1% w/v
Avicel stock solution is placed in a graduated glass test tube.
0.15 mL of 0.05 M acetate buffer (pH 5.0) is added to the tube and
the mixture is heated to 40.degree. C. for 10 minutes with
stirring. 0.1 mL of enzyme sample, suitably diluted and preheated
to 40.degree. C. for 10 minutes, is placed into the tube. The
reaction mixture is then incubated for 1 hour at 40.degree. C. with
stirring. After 1 hour, the reaction mixture is centrifuged 2 min
at 12000 rpm, 0.2 mL of supernatant is withdrawn and placed in a
graduated glass test tube. 0.2 mL of Somogyi reagent is added and
the mixture incubated, covered, in a boiling water bath for 40
minutes. Then, the tube is cooled in a cold water bath and 0.2 mL
of Nelson reagent is added and mixed by shaking manually. After 1.0
minutes incubation at room temperature, the solution volume is
adjusted to 2 mL with distilled water. Then, the absorbance at 6.10
nm (A.sub.610) is measured against the buffer blank as A.sub.ES
(enzyme sample). The enzyme sample must be diluted with the 0.05 M
sodium acetate buffer so that A.sub.610 is between 0.5 and 1.5
units of optical density at 610 nm.
Enzyme Blank
[0425] 0.16 mL of 0.05 M sodium acetate buffer (pH 5.0) is mixed in
a graduated glass test tube with 0.04 mL of enzyme sample having
the same dilution rate as above. 0.2 mL of Somogyi reagent is added
to the test tube, and the tube is incubated, covered, in a boiling
water bath for 40 minutes. Then, the test tube is cooled in a cold
water bath, 0.2 mL of Nelson reagent is added and the tube is mixed
by shaking manually. After 10 minutes incubation at room
temperature, the solution volume is adjusted to 2 mL with distilled
water. Then, the absorbance at 610 nm (A.sub.610) is measured
against the buffer blank as A.sub.E (enzyme blank).
Substrate Blank
[0426] A 1% w/v stock suspension of Avicel is mixed using a
magnetic stirrer for 10 minutes, then 0.25 mL of the 1% w/v Avicel
stock solution is placed into a graduated glass test tube. 0.25 mL
of 0.05 M acetate buffer (pH 5.0) is added to the tube and the
mixture is heated to 40.degree. C. for 10 minutes with stirring.
0.1 mL of enzyme sample, suitably diluted and preheated to
40.degree. C. for 10 minutes, is placed into the tube. The reaction
mixture is then incubated for 1 hour at 40.degree. C. with
stirring. After 1 hour, the reaction mixture is centrifuged 2 min
at 1.2000 rpm, 0.2 mL of supernatant is withdrawn and placed in a
graduated glass test tube. 0.2 mL of Somogyi reagent is added and
the mixture incubated, covered, in a boiling water bath for 40
minutes. Then, the tube is cooled in a cold water bath and 0.2 mL
of Nelson reagent is added and mixed by shaking manually. After 10
minutes incubation at room temperature, the solution volume is
adjusted to 2 mL with distilled water. Then, the absorbance at 610
nm (A.sub.610) is measured against the buffer blank as A.sub.S
(substrate blank).
Standard Calibration Plot
[0427] Stock glucose solution (0.5 mg/mL) is prepared by dissolving
10 mg of D-glucose in 20 mL of 0.05 M acetate buffer, pH 5.0. Then,
different dilutions of stock glucose solution are prepared as
follows:
TABLE-US-00029 0.05M acetate Stock solution (mL) buffer (mL)
Glucose concentration (mg/mL) 0.020 0.180 0.02 0.040 0.160 0.04
0.060 0.140 0.06 0.080 0.120 0.08 0.100 0.100 0.10 0.120 0.080 0.12
0.140 0.060 0.14 0.160 0.040 0.16 0.180 0.020 0.18 0.200 0 0.20
Samples (0.2 mL) of glucose solution with different dilutions are
placed to graduated glass test tubes. 0.2 mL of Somogyi reagent is
added and the test tubes are placed into a boiling water bath for
40 minutes. Then, the test tubes are cooled in a cold water bath
and 0.2 mL of Nelson reagent is added to each tube, and the tubes
mixed by shaking manually. After 10 minutes incubation at room
temperature, the solution volume in each tube is adjusted to 2 mL
with distilled water. Then, the absorbance at 610 nm (A.sub.610) is
measured against the buffer blank. The following typical A.sub.610
data obtained with different glucose dilutions are given below:
TABLE-US-00030 Glucose, mg/mL 0.02 0.04 0.06 0.08 0.10 0.12 0.14
0.16 0.18 0.20 A.sub.610 0.02 0.12 0.38 0.50 0.70 0.92 1.10 1.38
1.54 1.67
Absorbance (A.sub.610) on the y-axis versus glucose concentration
(G, mg/mL) on the x-axis is plotted to generate a standard
calibration plot such as that shown in the FIG. 11.
[0428] The dependence of A.sub.610 from G is described by the
equation y=ax+b. The concentration of reducing sugars (C) in an
assayed sample can be calculated from a calibration plot using the
equation C=(.DELTA.A.sub.610-b)/a, where b=-0.2239 and 1/a=0.204.
In other words, C=(.DELTA.A.sub.610+0.223.9)*0.104 (in mg/mL).
Calculation of Activity
[0429] Activity is calculated as follows: Avicelase
(U/mL)=(.DELTA.A.sub.610-b)*(1/a)*0.463*DR, where
.DELTA.A.sub.610=A.sub.ES (enzyme sample)-A.sub.E (enzyme
blank)-A.sub.S (substrate blank), 1/a is 0.204, b is -0.2239, the
coefficients from the glucose calibration plot (see above), DR is
the enzyme dilution rate (dilution rate of enzyme sample before
addition into the reaction mixture), and 0.463 is the coefficient
obtained as 5/(0.180*60), where 5 is the additional dilution of
enzyme sample in the reaction mixture (0.1 mL of enzyme+0.4 mL of
reaction mixture). 0.180 mg/mkmol is the transition coefficient for
glucose from mg to mkmol (molecular weight of glucose is 180) and
60 is the reaction time (60 minutes). The enzyme sample must be
diluted so that .DELTA.A.sub.610 should be between 0.5 and 1.5
units of optical density.
[0430] An example calculation is supplied below. The sample the
dilution rate of 500, A.sub.ES is 1.436, A.sub.E is 0.102, A.sub.S
is 0.070, so .DELTA.A.sub.610=1.436-0.102-0.070=1.264. The
Avicelase activity is: (1.264+0.2239)*0.104*0.463*500=35.8
U/mL.
Example 6
[0431] The following example illustrates glucose determination by a
glucosooxidase-peroxidase assay.
[0432] As a result of .beta.-D-Glucose oxidation by glucose oxidase
(GOD), hydrogen peroxide H.sub.2O.sub.2 and glucono-lactone are
formed. Then, H.sub.2O.sub.2 is used by peroxidase (POD) as an
oxidizing agent for 4-aminoantipyrine, oxidizing in the presence of
phenolic substances. As a result, dyed products accumulate in the
reaction mixture and can be detected at 490 nm
spectrophotometrically. The amount of dyed products corresponds to
glucose concentration as shown in the reaction scheme below:
##STR00001##
[0433] The assay is carried out using a Fotoglucose kit (Impact
Ltd., Russia) with the general conditions: 40.degree. C., pH 7.0,
duration--15 min, A.sub.490, light path 1 cm.
Reagents
[0434] Reagent 1 (R1) is prepared as follows. Two
"Buffer-Substrate" tablets (0.1 mmol/L potassium or ammonium salts
of phosphoric acid, 50 mmol/l 4-aminoantipyrine, 0.75 mmom/L
8-oxyquinolyn) are dissolved in 200 mL of distilled water, filtered
through a cellulose filter, and the pH adjusted to 7.0 if
necessary. The solution can be stored in the dark at 4.degree. C.
and should be made fresh weekly.
[0435] Reagent 2 (R2) is prepared as follows. One "Enzyme" tablet
(2500 units of glucose oxidase and 500 units of peroxidase) is
dissolved in 5 mL of distilled water. The solution can be stored in
the dark at 4.degree. C. and should be made fresh weekly.
[0436] A glucose solution of 10 mmol/L glucose in 0.15% benzoic
acid is used as the calibration standard. The solution is diluted
with distilled water to obtain glucose solutions of 0 to 5 g/L.
[0437] Using the above reagents, the assay is performed as detailed
below.
[0438] R1 and R2 are mixed at a ratio of 40:1 (this solution may be
stored in the dark at 4.degree. C. and should be made fresh weekly)
and 1 mL of the mixture is preheated to 40.degree. C. for 5-10
seconds. 0.1 mL of the sample (or glucose standard) is then added
and the mixture is incubated at 40.degree. C. for 15 minutes. The
mixtures are then placed at room temperature and the A.sub.490
determined on spectrophotometer. Prolongation of the incubation
time to 20 minutes has little or no effect on the results, while
decreasing it to less than 15 minutes can result in a lower optical
density. Thus, 15-20 min of incubation at 40.degree. C. is
preferred for glucose determination.
Calculation of Glucose Concentration
[0439] As shown in Table 21, calibration with standards containing
varying concentrations of glucose reveals an optical density
A.sub.490 in the range from 0.06 to 1.1.5 corresponds to glucose
concentrations from 0.025 g/L to 0.5 g/L. These data are plotted in
FIG. 12, which demonstrates a linear proportion and high
reproducibility at these conditions.
[0440] Using the calibration results, the following formula is
obtained for glucose determination in samples from an A.sub.490
reading:
GLUCOSE, g/L=(A.sub.490sample-A.sub.490blank)-0.0058)/2.3029
TABLE-US-00031 TABLE 21 Calibration of the reagents using standard
glucose solutions (two experiments) Glucose, g/L A.sub.490
A.sub.490 0 0.05 0.05 0.0025 0.05 0.05 0.005 0.06 0.06 0.01 0.07
0.07 0.025 0.11 0.11 0.05 0.17 0.17 0.1 0.29 0.29 0.25 0.65 0.64
0.5 1.20 1.20 1 1.71 1.72 2.5 1.93 1.94 5 2.06 2.06
Example 7
[0441] The following example illustrates the Somogyi-Nelson assay
used to measure xylanase activity.
[0442] This assay measures the release of reducing sugars (as
glucose equivalents) by the action of xylanase on a soluble xylan.
One xylanase unit of activity is the amount of enzyme that
liberates 1 micromole of reducing sugars, expressed as glucose
equivalents, in one minute at 50.degree. C. and pH 5.0.
Reagents
[0443] Acetate buffer (0.05 M, pH 5.0) is prepared as follows. 4.1
g of anhydrous sodium acetate or 6.8 g of sodium acetate *3H.sub.2O
is dissolved in distilled water so that the final volume of the
solution is 1000 mL (Solution A). In a separate flask, 3.0 g (2.86
mL) of glacial acetic acid is mixed with distilled water to make a
total volume of 1000 mL (Solution B). The final 0.05 M acetate
buffer, pH 5.0, is prepared by mixing Solution A with Solution B
until the pH of the resulting solution is equal to 5.0.
[0444] Xylan from birchwood (Sigma, X 0502) is used as the
substrate. A 1% w/v stock solution is prepared by dissolving 500 mg
of xylan in 50 mL of distilled water with agitation on a magnetic
stirrer for 1 hour. The solution is stable for 2 days at 4.degree.
C.
[0445] The Somogyi reagent is prepared as follows. 24 g of
anhydrous sodium carbonate and 12 g of sodium-potassium tartrate
tetrahydrate are dissolved in 250 mL of distilled water. A solution
of copper sulfate pentahydrate (4 g in 40 mL of distilled water) is
added with stirring, and then 16 g of sodium bicarbonate is
dissolved to obtain Solution A. In a separate flask, 180 g of
sodium sulfate is dissolved in 500 mL of hot (.about.80.degree. C.)
distilled water and boiled for 5 min (Solution B). Solution A is
mixed with Solution B and the final volume is adjusted to 1000 mL.
The reagent is stable for 2-3 months on storing in a dark glass
vessel.
[0446] The Nelson reagent is prepared as follows. 50 g of ammonium
molybdate is dissolved in 900 mL of hot (.about.60.degree. C.)
distilled water. The solution is cooled to 5-1.0.degree. C. and 42
g of concentrated sulfuric acid containing 6 g of sodium arsenate
are carefully added with stirring. The volume of the mixture is
adjusted to 1000 mL. The flask is incubated at 40.degree. C. for 48
hours and then the solution is filtered if necessary. The reagent
is stable for 2-3 months.
[0447] Using the above reagents, the assay is performed as detailed
below.
Reagent Blank
[0448] 0.5 mL of 0.05 M sodium acetate buffer is mixed in a
graduated glass test tube (1.5 cm.times.15 cm) with 0.5 mL of
Somogyi reagent and incubated in a boiling water bath for 40
minutes. Then, the test tube is cooled in an ice or cold water bath
and 0.5 mL of Nelson reagent is added and mixed by shaking
manually. After 10 minutes incubation at room temperature, the
solution volume is adjusted to 5 mL with distilled water.
Enzyme Sample
[0449] 0.25 mL of 1% w/v xylan stock solution is mixed with 0.15 mL
of 0.05 M acetate buffer, pH. 5.0, in graduated glass test tubes
(1.5 cm.times.15 cm) and preheated at 50.degree. C. for 5 minutes.
Enzyme sample is suitably diluted by the 0.05 M sodium acetate
buffer (the dilution of the enzyme sample is chosen so that
A.sub.610 should be between 0.95 and 1.05 units of optical
density). 0.1 mL of suitably diluted enzyme sample is added to 0.4
mL of the preheated substrate solution, mixed and incubated at
50.degree. C. for 10 minutes. After exactly 10 minutes of
incubation, 0.5 mL of Somogyi reagent is added, and test tubes are
placed into a boiling water bath for 40 minutes. Then, the test
tubes are cooled in an ice or cold water bath and 0.5 mL of Nelson
reagent is added and mixed by shaking manually. After 10 minutes
incubation at room temperature, the solution volume is adjusted to
5 mL with distilled water. Then, the absorbance at 610 nm
(A.sub.610) is measured against the reagent blank as A.sub.S
(enzyme sample).
Enzyme Blank
[0450] 0.4 mL of 0.05 M sodium acetate buffer is mixed in a
graduated glass test tube (1.5 cm.times.15 cm) with 0.1 mL of
enzyme sample having the same dilution as above. 0.5 mL of Somogyi
reagent is added and the test tube is incubated in a boiling water
bath for 40 minutes. Then, the test tube is cooled in an ice or
cold water bath and 0.5 mL of Nelson reagent is added and mixed by
shaking manually. After 10 minutes incubation at room temperature,
the solution volume is adjusted to 5 mL with distilled water. Then,
the absorbance at 61.0 nm (A.sub.610) is measured against the
reagent blank as A.sub.EB (enzyme blank).
Substrate Blank
[0451] 0.25 mL of 1% w/v xylan stock solution is mixed with 0.25 mL
of 0.05 M acetate buffer, pH 5.0, in a graduated glass test tube
(1.5 cm.times.15 cm), 0.5 mL of Somogyi reagent is added, and the
test tube is placed into a boiling water bath for 40 minutes. Then,
the test tube is cooled in an ice or cold water bath and 0.5 mL of
Nelson reagent is added and mixed by shaking manually. After 10
minutes incubation at room temperature, the solution volume is
adjusted to 5 mL with distilled water. Then, the absorbance at 610
nm (A.sub.610) is measured against the reagent blank as A.sub.SB
(substrate blank). The value of A.sub.SB should not exceed 0.05
units of optical density.
[0452] All enzyme samples and enzyme and substrate blanks should be
boiled together.
Standard Calibration Plot
[0453] Stock glucose solution (1 mg/mL) is prepared by dissolving
100 mg of D-glucose in 100 mL of 0.05 M acetate buffer, pH 5.0.
Then, different dilutions of stock glucose solution are prepared as
follows:
TABLE-US-00032 0.05M acetate Stock solution (mL) buffer (mL)
Glucose concentration (mg/mL) 0.05 4.95 0.01 0.10 4.90 0.02 0.15
4.85 0.03 0.20 4.80 0.04 0.25 4.75 0.05 0.30 4.70 0.06 0.35 4.65
0.07 0.40 4.60 0.08 0.45 4.55 0.09 0.50 4.50 0.10
Samples (0.5 mL) of glucose solution with different dilutions are
placed in graduated glass test tubes (1.5 cm.times.15 cm). 0.5 mL
of Somogyi reagent is added and test tubes are placed into a
boiling water bath for 40 minutes. Then, the test tubes are cooled
in an ice or cold water bath and 0.5 mL of Nelson reagent is added
and mixed by shaking manually. After 10 minutes incubation at room
temperature, the solution volume is adjusted to 5 mL with distilled
water. Then, the absorbance at 610 nm (A.sub.610) is measured
against the reagent blank. The following typical A.sub.610 data
obtained with different glucose dilutions are given below:
TABLE-US-00033 Glucose, mg/mL 0.01 0.02 0.03 0.04 0.05 0.06 0.07
0.08 0.09 0.10 A.sub.610 0.11 0.20 0.28 0.41 0.48 0.60 0.72 0.79
0.93 1.06
Absorbance (A.sub.610) on the y-axis versus glucose concentration
(G, mg/mL) on the x-axis is plotted to demonstrate a standard
calibration plot such as that shown in FIG. 11. From FIG. 11, the
inverse of the slop (1/a=0.098) is calculated.
Calculation of Activity
[0454] Activity is calculated as follows: activity
(IU/mL)=1/a*.DELTA.A.sub.610*DF*2.78, where 1/a is the inverse of
the slope calculated from the standard calibration plot,
.DELTA.A.sub.610 is A.sub.S (enzyme sample)-A.sub.EB (enzyme
blank)-A.sub.SB (substrate blank), DF is the enzyme dilution
factor, and 2.78 is the coefficient that accounts for additional
5-fold enzyme dilution in the reaction mixture (0.1 mL enzyme+0.4
mL substrate), reaction time (10 minutes) and molecular weight of
glucose (0.18016 mg/micromole), i.e. 5/(10*0.18016)=2.78
[0455] An example calculation is supplied below.
TABLE-US-00034 B4 UF concentrate DF .DELTA.A.sub.610 5,000 1.61
10,000 1.04 20,000 0.64 Xylanase activity (IU/mL) = 0.098 * 1.04 *
10000 * 2.78 = 2833
Example 8
[0456] The following example illustrates the assay used to measure
the stability of enzymes of the present invention to
thermoshock.
[0457] The following assay determines the stability of enzymes
after short-term treatment at elevated temperature (e.g.
temperature of pelletization of animal feed). A solution of enzyme
is incubated for approximately two minutes in a glass tube in a
thermostat at 80.degree. C. The gradual drop of enzyme activity is
recorded over time and the resultant curve is used to determine how
long the enzyme is stable (results usually range from 10-60
seconds).
[0458] The use of glass tubes that fit tightly in a regular
thermostat or eppendorf-type tubes help ensure proper thermal
conditions and fast heating of the tube content. The enzyme
dilution used is selected so that after additional dilution in
reaction mixture the optical density is as high as possible
(depends on particular method used for activity determination). For
example, when determining xylanase activity by the Somogyi-Nelson
assay disclosed in Example 7, the enzyme selected should give an
optical density A.sub.640 1.0-1.1 after being diluted 5 times in
reaction mixture.
[0459] For faster equilibration at 80.degree. C., 0.4-0.5 ml water
may be placed in the well of thermostat, prior to fitting in a
glass tube. The temperature is set at 80.3-80.4.degree. C. to
compensate for cooling. 360 .cndot.l of desired buffer is placed in
the tube and heated for 2-3 minutes until it reaches 80.degree. C.
It is advisable to check the temperature in the tube (using a
thermometer) to be sure the temperature has been reached. 40
.cndot.l of enzyme solution is then added and the tube is mixed
rapidly. After 20, 40, 60, 90 and 120 seconds, aliquots are removed
from the tube and immediately transferred in the reaction mixture
for determination of activity (assay components are prepared
beforehand). To determine zero-point activity (without heating),
the same solution (360 .cndot.l buffer+40 .cndot.l enzyme) is
prepared in the separate tube at room temperature. Determined
activities at every time point are then plotted as relative
activity (in %) vs. time, with zero time point (without heating)
representing 100% activity. FIG. 13 demonstrates these data for
stability of xylanase at 20-, 10- and 5-fold dilutions.
[0460] In this protocol, a 10-fold dilution of enzyme is used (9:1
preheated buffer solution to cold enzyme solution). In principle
one can use higher dilution rates (e.g., 1:20), but typically not
lower dilution factors (e.g., 1:5) because mixing of larger volumes
of cold enzyme solution can produce inaccurate results. The plot
below illustrates this principle. The stability of xylanase was
tested using 20-, 10- and 5-fold dilutions, and the results shown
in FIG. 13 demonstrate that dilutions of 10- and 20-fold (and
likely higher) gave consistent results. In contrast, use of a
5-fold dilution generates overestimated values.
[0461] The percentages of enzymatic activity of xylanases of the
present invention after 20 seconds of incubation at 80.degree. C.
(also known as the pelletizing stabilities) are shown below in
Table 22 and in the graph depicted in FIG. 14.
TABLE-US-00035 TABLE 22 Pelletizing Stability of C1 Xylanses Enzyme
% Activity Xyl 1 19 Xyl 1 (CD) 12 Xyl 2 98 Xyl 3 75 Xyl 3 (CD) 82
Xyl 4 23 Xyl 5 60 Xyl 6 68
Example 9
[0462] The following example illustrates the in vitro,
semi-quantitative assay used to measure the feed activity of
enzymes of the present invention.
[0463] The assay was developed to simulate the enzymatic treatment
of feed. The assay detects the dynamics of liquid and sludge
fraction formation from a uniform slurry of milled rye bran by the
action of the enzyme samples in narrow test tubes.
[0464] A mixture of enzyme solution and uniform rye bran suspension
in acetate buffer (pH 5.0) is placed into a long glass tube (e.g.,
2 ml calibrated glass pipettes) and stored at room temperature (or
at 40.degree. C.). The action of enzymes on non-starch rye bran
generates polysaccharides (e.g. arabinoxyan, .beta.-glucan and
others) causing the original uniform rye bran slurry to separate
into a liquid phase (containing the polysaccharides) on the top and
an insoluble sludge on the bottom of. The volume of liquid phase
formed on the top of tubes over the reaction is measured every
10-15 min during 1.5-3 hours (the total assay time depends on
enzymatic activity). The data obtained are plotted as "liquid
formed" (ml) versus time (minutes) and the slope of the line
calculated. The higher the slope of the line on the obtained graph,
the higher the in vitro feed activity of the tested enzyme
sample.
Substrate
[0465] Rye bran is used as a substrate in this example. However,
wheat and barley bran, rye, wheat, barley, and soy meal may also be
used as assay substrates. Partially de-starched rye bran samples
are carefully milled and passed through the fine sieve. The fine
fraction is taken as the substrate. Substrate is accurately weighed
and placed into a dry 20-ml screw cap vial. The sample of substrate
is suspended in acetate buffer to get 30% w/v suspension. The
substrate suspension is mixed vigorously with a magnetic stirrer
for 30-45 minutes at room temperature. The resulting suspension
obtained can be stored at 4.degree. C. in a sealed vial for about 2
days.
Assay Procedure
[0466] The reaction mixture contains the following components:
[0467] 1500 .mu.l of substrate suspension (300 mg/ml) [0468] 750
.mu.l of enzyme solution
[0469] The blank solution contains the same amount of substrate,
but the enzyme solution is substituted with the same volume of
acetate buffer.
[0470] The reaction mixture is subjected to vigorous agitation and
placed in appropriate calibrated glass tubes (e.g., 2 ml glass
pipettes). Glass tubes or pipettes should be filled with the
reaction mixture immediately and placed vertically. The top end of
the tube is left open while the bottom end is sealed with a piece
of Parafilm or by dipping in melted paraffin.
[0471] The concentration of enzyme samples can be measured either
in mg/ml of protein or in xylanase units/ml in the reaction mixture
(or any other enzyme activities, e.g., .beta.-glucanase,
.alpha.-galactosidase, protease, etc.). Protein content or xylanase
(or other activities) of the enzyme samples are measured separately
prior to the experiments. In the experiments, enzyme concentration
was equalized to 1 mg/ml of protein in the reaction mixture, or in
xylanase activity (2 to 3 U/ml).
Materials and Instruments
[0472] Thermostat for 2 ml Eppendorf type test tubes, set at
40.degree. C. [0473] Set of calibrated glass tubes (e.g., 2-ml
glass pipettes) [0474] UV-Vis spectrophotometer [0475] Reagents to
perform Nelson-Somogyi assay [0476] Analytical electronic balances
[0477] Adjustable volume automated pipettes 40-200 .mu.l and
200-1000 .mu.l [0478] Rye bran [0479] 0.05 M acetate buffer, pH
5.0
Data Analysis
[0480] Data obtained is presented on a graph as liquid formed (ml)
versus time (minutes). The higher the slope of the line on the
obtained graph, the higher the in vitro feed activity of the tested
enzyme sample.
Example 10
[0481] The following example illustrates the assay used to measure
the ability of enzymes to reduce the viscosity of feed
materials.
[0482] Enzymatic treatment of barley, rye or wheat can increase the
nutritive value by hydrolysis of viscous polysaccharides
(beta-glucan, arabino xylan) within cereals. Table 23 below
illustrates typical viscous polysaccharide profiles of sample
feedstuffs compared with the viscosity of water/meal mixtures.
TABLE-US-00036 TABLE 23 Viscous Polysachharide Profiles of
Feedstuffs Beta-Glucans, Relative Viscosity Arabino Xylans g/kg of
g/kg of dry of water/meal dry matter matter mixes, U RYE 14.4 7.6
9.71 BARLEY 3.3 24.3 4.21 WHEAT 3.2 5.6 2.25
[0483] As shown in the table, the high viscosity of a water/rye
meal mixture is predominantly due to the presence of arabinoxylans.
Similarly, the viscosity of a water/barley mixture is predominantly
due to the presence of beta-glucans, while the viscosity of
water/wheat mixture close to water. However, cereals such as
barley, rye and wheat often contain specific inhibitors of the
hydrolytic enzymes. Thus, preferred enzymatic preparations that
effectively decrease viscosity should be resistant to cereal
inhibitors and be enzymatically active at low dosage.
Materials and Instruments
[0484] 100 ml glass flasks with caps [0485] Ostwald capillary
viscometer with thermostat at 40.degree. C. [0486] Shaker for meal
incubation at 40.degree. C. [0487] Whole rye, barley, wheat grains
or hulls or their mixtures [0488] Rotary mill and 0.5 mm sieve to
prepare whole meal [0489] Adjustable volume automated pipettes
40-200 and 200-1000 ul [0490] Distilled water or 0.1M sodium
acetate buffer, pH 5.0
Sample Preparation
[0491] To prepare the crude barley, wheat or rye meals, whole
grains are milled on a rotary mill and passed through a 0.5 mm
sieve. 6 g of crude barley or rye meal or 12 g of wheat meal is
added to a 100 ml glass flask. 30 ml of distilled water or buffer
(blank flask) or 30 ml of the enzyme water solution (experimental
flasks) is then added and the flasks are covered. The concentration
of 6 g of meal per 30 ml of water is 200 g per liter, while 12 g
per 30 ml is 400 g/l.
The flasks are shaken well by hand then placed on a shaker at
40.degree. C., 250 rpm.
[0492] After a 1-hour incubation, the mixtures are centrifuged for
5 min at 5000 rpm to separate the insoluble precipitate of the
crude meal. The supernatant is then filtered through a cotton pad.
5 ml of the filtered supernatant is placed into an Ostwald
viscometer at 40.degree. C. and incubated for at least 3 minutes
before the viscosity of the sample is determined. The measurement
of the efflux time of the samples from the viscometer bulb should
be done later than 80 minutes after starting the incubation.
[0493] A dosage of 0.15 kg/t of the meal approximates the dosage
used in real practice. For liquid preparations, a dosage of 0.15
kg/t equals 90 .mu.l of the preparation in a 100-fold dilution per
flask (30 ml of water/meal mix). For dry preparations, a dosage of
0.15 kg/t equals 90 .mu.l of the 10 g/l solution per flask (30 ml
of water/meal mix).
[0494] FIG. 15 shows the relative viscosity of water/rye mixes
treated with the indicated enzyme(s). To compare the efficiency of
the preparations, the xylanase activities of the samples (units per
flask) are also indicated.
[0495] The efficiency of the preparations may also be compared by
plotting the residual viscosity (%) of the water/meal mix after
enzymatic treatment, as shown in FIG. 16.
[0496] Residual viscosity is calculated as follows:
ResV=(Rvi-Rvwater)/(Rvnative-Rvwater).times.100%
The final formula is:
ResV=(Rvi-1)/(Rvnative-1).times.100%
Where ResV is residual viscosity, Rvi is the relative viscosity of
the sample with enzyme, Rvnative is the relative viscosity of the
native water/meal mix without enzyme, and Rvwater is 1.
[0497] Table 24 shows of the residual viscosity (%) of rye meal
water extract after a 1-hour incubation with 1 unit of the
xylanases of the present invention. These data are presented
graphically in FIG. 17.
TABLE-US-00037 TABLE 24 Residual Feed Viscosities of C1 Xylanses
Enzyme % Residual Viscosity Xyl 1 59 Xyl 1 (CD) 59 Xyl 2 27 Xyl 3
25 Xyl 3 (CD) 76 Xyl 4 77 Xyl 5 94 Xyl 6 91
[0498] Table 25 shows of the residual viscosity (%) of barley meal
after a 1-hour incubation with 1 unit of the endoglucanases of the
present invention.
TABLE-US-00038 TABLE 25 Residual Feed Viscosities of C1
Endoglucanases Enzyme % Residual Viscosity EG I 18 EG II 13 EG II
(CD) 14 EG III 10 EG V 18 EG VI 20
Example 11
[0499] The following example illustrates an assay used to measure
the biobleaching ability of enzymes of the present invention.
[0500] Ability of different crude enzyme preparations or purified
enzymes to (bio)bleach cellulosic pulp may be evaluated by the
assay below. The assay monitors the release of chromogenic phenolic
substances from raw cellulosic pulp by the action of enzymes. The
phenolic substances are detected by the absorbance at 237 nm
(A.sub.237) or 235 nm (A.sub.235).
Mini Biobleaching Assay
[0501] Raw (wet) eucalyptus cellulosic pulp with natural pH 7.7
(National Paper Fabric, Uruguay) is placed into 5 ml reaction
tubes, and the tubes are placed into a water bath shaker,
(50.degree. C., 200 rpm). The reaction volume consists 2 ml of
enzyme sample appropriately diluted in appropriate buffer solution
and 2 ml of cellulosic pulp suspension (0.04 g of cellulosic pulp
by dry weight). The concentration of cellulosic pulp is 10 mg/ml.
The biobleaching process is started by the addition of 2 ml of
enzyme solution to the pulp and the assay is conducted for 2 hours.
After that reaction, the mixture is centrifuged and A.sub.237 of
the supernatant is monitored in a spectrophotometer. Since some
phenolic compounds are washed out from the raw cellulosic pulp even
in the absence of enzymes, the blank experiments with buffer
solution only (with no enzymes) are carried out. Some input to
A.sub.237 value could be made by chromogenic substances, existing
in the raw enzyme sample, so another blank experiments were carried
out to estimate this input. All results take into account both of
those blanks as .DELTA.A.sub.237. The higher the .DELTA.A.sub.237
value, the better was biobleaching performance.
[0502] The assay may also be performed using soft or hard wood pulp
at pH 7.5 and 50.degree. C. for 2 hours, with the absorbance
determined at 235 nm. Table 26 and FIG. 18 show the biobleaching
ability (A.sub.235) of xylanases of the present invention on soft
wood pulp under these conditions, while Table 27 and FIG. 19 show
the same biobleaching ability on hard wood pulp. For each xylanase,
the assay was conducted using 0.01, 0.003, 0.001, 0.0003 mg/ml of
enzyme in the reaction mixture, as indicated. A positive control
xylanase preparation (Luminase, Verenium Corp., San Diego, Calif.
USA) was also used for comparison.
TABLE-US-00039 TABLE 26 Biobleaching Ability of C1 Xylanases on
Soft Wood Pulp (A.sub.235) Enzyme Xyl 1 Xyl 3 (mg/ml) (CD) Xyl 1
Xyl 3 (CD) Xyl 4 Xyl 2 Xyl 6 Luminase 0.01 3 1.37 2.28 2.2 1.73
3.66 2.62 3 0.003 2.65 1.015 2.7 1.43 1.1 2.8 2.03 2.445 0.001
2.385 1.07 0.85 1.05 0.455 2.22 1.7 2.14 0.0003 1.45 0.615 0.575
0.31 0.2 1.615 1.4 1
TABLE-US-00040 TABLE 27 Biobleaching Ability of C1 Xylanases on
Hard Wood Pulp (A.sub.235) Enzyme Xyl 1 Xyl 3 (mg/ml) (CD) Xyl 1
Xyl 3 (CD) Xyl 4 Xyl 2 Xyl 6 Luminase 0.01 8.5 2.355 4.2 3.715 1.9
5 6.5 7.585 0.003 6.25 1.82 2.24 2.27 0.82 4.45 4.5 5.6 0.001 5.4
1.37 1.05 1.5 0.5 2.355 3 4.8 0.0003 3.5 0.4 0.61 0.675 0.3 1.5
1.26 2.785
Example 12
[0503] The following example illustrates the Saccharification of
pretreated lignocellulosic feedstock using crude enzyme
preparations.
[0504] The assay involves a protocol for enzymatic saccharification
of pretreated lignocellulosic biomass with a known moisture content
and composition in order to determine the extent of cellulose and
hemicellulose digestibility under concrete conditions of
saccharification of the feedstock with crude cellulase and
hemicellulase enzyme preparations.
Equipment
[0505] INNOVA 40 Thermo Shaker (New Jersey, USA), where agitation
of a slurry in plastic vessels is carried out using 250 rpm
vibrations (each vessel supplied by one 0.5 cm metal ball to
improve mixing of slurry) [0506] Micro centrifuge [0507] pH meter
[0508] Vortex mixer [0509] Analytical balance, sensitive to 0.1 mg
[0510] UV/Vis spectrophotometer [0511] HPLC with refractive index
detector and Silica column with bonded amino phase (4.times.250 mm)
and guard column [0512] Pipettes with variable sample volume
(5-5000 .mu.L)
Reagents and Materials
[0512] [0513] Lignocellulosic feedstock with known moisture content
and composition (cellulose, hemicellulose, lignin, ash,
extractives) [0514] Sodium acetate buffer (0.1 M, pH 5.0)
containing 1 mM of sodium azide to prevent microflora growth [0515]
Enzyme cellulase and hemicellulose preparations from different
fungal strains, either in soluble or in dry form [0516] Reagents
for protein determination (either Lowry or Pierce BCA kit) [0517]
Water, distilled and/or deionized [0518] Ampiox, sodium-salt.
[0519] "Fotoglucose" kit (reagents for glucose determination using
glucose oxidase-peroxidase assay) from impact Ltd. Russia [0520]
Somogyi and Nelson reagents for reducing sugars' assay [0521]
Acetonitrile, HPLC grade [0522] NaOH, acetic acid for pH
adjustments
Assay Procedure
[0523] The procedure described below assumes that the hydrolysis is
carried out at 5% solids. If the process should be carried out at
different substrate concentration, then recalculations should be
done for the initial biomass sample weight (on a thy weight basis)
using the known moisture content in the sample.
[0524] A biomass sample containing 1 g of solids (on a thy weight
basis) is weighed and placed into a 50 ml plastic vessel equipped
with a hermetic lid and one metal ball (0.5 cm in diameter) to
improve mixing (the moisture content is determined in preliminary
experiments by drying the biomass samples at 105.degree. C. until a
constant weight is reached). The weight (volume) of water in the
sample is noted as W.sub.s.
[0525] The volume(s) of enzyme(s) necessary to add to the reaction
mixture is calculated taking into account the protein concentration
(enzyme activity) in a stock solution and the final protein
concentration (enzyme activity) in the reaction mixture. Using
these data, the volume of buffer (W.sub.a) that is necessary for
adjusting the total volume of the reaction system to 20 ml is
calculated, taking into account water contained in the biomass
sample (W.sub.s).
[0526] The calculated amount (W.sub.a) of 0.1 M, pH 5.0 Na-acetate
buffer containing 200 mg/l (4 mM) of sodium azide is added to the
biomass slurry, and the slurry is thoroughly mix in the 50 ml
vessel using a Vortex. The pH is measure and adjusted pH to 5.0
using concentrated acetic acid or NaOH, if necessary. The lid is
closed and the vessels incubated in an INNOVA 40 Thermo Shaker for
15 minutes to preheat the sample to 50.degree. C.
[0527] The calculated amount(s) of enzyme stock solution(s) is
added to the reaction mixture to start the enzymatic reaction. 20
.mu.l of Ampiox solution (100 g/l) is then added to the final
reaction mixture, which is allowed to proceed for 72 hours.
(50.degree. C. and 250 rpm vibrations).
[0528] Aliquots of the reaction slurry (50-100 .mu.L) are typically
taken at 6, 24, 48, 72 or 96 hour time points, using a pipette with
a cut tip. The aliquots are centrifuged at 15,000 rpm in a micro
centrifuge for 3 minutes, and the supernatants are analyzed for
glucose by the glucose oxidase-peroxidase assay (see Example 6) and
for reducing sugars by the Nelson-Somogyi assay (see, for example,
Example 1). Samples (supernatants) are typically analyzed
immediately. In some cases, they are subjected to boiling for 5
minutes, then frozen and analyzed later. The final time point (72
hours) is also analyzed for sugars by HPLC.
[0529] A substrate blank (control) is prepared in the same way as
described above, except that distilled water is added to the vessel
instead of the enzyme(s) stock solution(s). The supernatant
resulting from the substrate blank is analyzed for glucose by the
glucose oxidase-peroxidase assay, reducing sugars by the
Nelson-Somogyi assay and also for sugars by HPLC.
[0530] Reactions are typically carried out in duplicate. The
results of the analysis for sugars are determined as average values
together with standard deviations. Typically, the protein content
in the reaction system (mg protein/g solids) is used as a measure
of enzyme loading. In the experiments with crude enzyme
preparations, the enzyme loading is 5, 2.5 and 1 mg protein/g
solids. In enzymatic reactions may be carried out in the presence
of .beta.-glucosidase preparations (40 Units of
p-NPh-.beta.-glucosidase activity per 1 g solids) in order to
convert all soluble oligosaccharides to glucose.
Calculations
[0531] To calculate the percent digestibility of cellulose
(glucan), the glucose concentration in the final (72 or 96 hour)
supernatant of the reaction mixture is determined (data from the
glucose oxidase-peroxidase assay and/or HPLC assay), subtract the
glucose concentration from the substrate blank (control). The
initial cellulose (glucan) content in the reaction system is also
calculated. For example, if the cellulose content in the pretreated
feedstock is 35% of the total solids (dry matter) and the
hydrolysis is carried out at 100 g solids/L, then the initial
cellulose concentration would be 35 mg/mL.
[0532] The percent digestion of cellulose is calculated as
follows:
Digestion ( % ) = Glucose concentration ( mg / mL ) .times. 100 %
Initial cellulose concentration ( mg / mL ) .times. 1.11
##EQU00002##
The factor 1.11 is a result of hydration of anhydroglucose residues
in cellulose (162 Da per one residue) that yields a glucose
molecule (180 Da).
[0533] To calculate the percent digestibility of hemicellulose
(xylan), the total concentration of pentoses (xylose+arabinose) in
the final (72 or 96 hour) supernatant of the reaction mixture is
determined (data from HPLC assay), subtracting the pentose
(xylose+arabinose) concentration from the substrate blank
(control). The initial hemicellulose (xylan) content in the
reaction system is also calculated. For example, if hemicellulose
content in the pretreated feedstock is 30% of the total solids (dry
matter) and the hydrolysis is carried out at 100 g solids/1, then
the initial hemicellulose concentration would be 30 mg/ml.
[0534] The percent digestion of hemicellulose is calculated as
follows:
Digestion ( % ) = Total pentose concentration ( mg / mL ) .times.
100 % Initial hemicellulose concentration ( mg / mL ) .times. 1.136
##EQU00003##
The factor 1.136 is a result of hydration of anhydropentose (xylose
or arabinose) residue in hemicellulose (132 Da per one residue)
that yields a pentose molecule (150 Da).
REFERENCES
[0535] 1. Berezin I V, Rabinovich M L, Sinitsyn A P (1977) Study of
applicability of quantitative kinetic spectrophotometric method for
glucose determination. Biokhimiya (Moscow) 42:1631-1636. [0536] 2.
Ghose T K (1987) Measurement of cellulase activities. Pure Appl.
Chem. 59:257-268. [0537] 3. Gusakov A V, Salanovich T N, Antonov A
I, Ustinov B. B. Okunev O N, Burlingame R, Emalfarb M, Baez M,
Sinitsyn A P (2007) Design of highly efficient cellulase mixtures
for enzymatic hydrolysis of cellulose. Biotechnol. Bioeng. On-line:
http://www3.interscience.wiley.com/cgi-bin/abstract/114064699/,
DOI: 10.1002/bit.21329 [0538] 4. Lowry O H, Rosebrough N J, Farr A
L, Randall R J (1951) Protein measurement with the Folin phenol
reagent. J. Biol. Chem. 193:265-275. [0539] 5. Sinitsyn A P,
Chemoglazov V M, Gusakov A V (1990) Methods of investigation and
properties of cellulolytic enzymes (in Russian), Biotechnology
Series, v. 25. Moscow: VINITI Press. 220 p. [0540] 6. Somogyi M
(1952) Notes on sugar determination. J. Biol. Chem. 195:19-23.
Example 13
[0541] The following example illustrates the saccharification of
pretreated lignocellulosic feedstock with individual purified
enzymes and enzyme mixtures.
[0542] The assay involves a protocol for the enzymatic
saccharification of pretreated lignocellulosic biomass with known
moisture content and composition in order to determine the extent
of cellulose and hemicellulose digestibility under concrete
conditions of saccharification of the feedstock with individual
purified enzymes (monocomponents) and their compositions (mixes of
monocomponents).
Equipment
[0543] TS-100 Thermo Shaker (Riga, Latvia), where agitation of a
slurry in plastic tubes is carried out using 1200 rpm vibrations
[0544] Micro centrifuge [0545] pH meter [0546] Vortex mixer [0547]
Analytical balance, sensitive to 0.1 mg [0548] UV/Vis
spectrophotometer [0549] HPLC with refractive index detector and
Silica column with bonded amino phase (4.times.250 mm) and guard
column [0550] Pipettes with variable sample volume (5-1000
.mu.L)
Reagents and Materials
[0550] [0551] Lignocellulosic feedstock with known moisture content
and composition (cellulose, hemicellulose, lignin, ash,
extractives) [0552] Sodium acetate buffer (0.1 M, pH 5.0)
containing 1 mM of sodium azide to prevent microflora growth [0553]
Enzyme monocomponents (cellulases, hemicellulases, glycosidases)
isolated from different fungal strains, either in solution or
freeze-dried [0554] Reagents for protein determination (either
Lowry or Pierce BCA kit) [0555] Water, distilled and/or deionized
[0556] "Fotoglucose" kit (reagents for glucose determination using
glucose oxidase-peroxidase assay) from Impact Ltd., Russia [0557]
Somogyi and Nelson reagents for reducing sugars' assay [0558]
Acetonitrile, HPLC grade [0559] NaOH, acetic acid for pH
adjustments.
Assay Procedure
[0560] The procedure described below assumes that the hydrolysis is
carried out at 10% solids. If the process should be carried out at
different substrate concentration, then recalculations should be
done for the initial biomass sample weight (on a dry weight basis)
using the known moisture content in the sample.
[0561] A biomass sample containing 200 mg of solids (on a dry
weight basis) is weighed and placed into a 2 ml plastic tube
equipped with a hermetic lid (the moisture content is determined in
preliminary experiments by drying the biomass samples at
105.degree. C. until a constant weight is reached). The weight
(volume) of water in the sample is noted as W.
[0562] 1 ml of Na-acetate buffer (0.1 M, pH 5.0) containing 1 mM of
sodium azide is added, and the slurry is thoroughly mixed using a
vortex. The pH is measured and adjusted to 5.0 using concentrated
acetic acid or NaOH, if necessary.
[0563] The volume(s) of enzyme(s) that is necessary to add to the 2
ml tube is calculated, taking into account the protein
concentration (enzyme activity) in a stock solution and the final
protein concentration (enzyme activity) in the reaction mixture.
Using these data, the volume of water (buffer), that is necessary
for adjusting the total volume of the reaction system to 2 mL
(W.sub.a) is calculated, taking into account water contained in the
biomass sample (W.sub.s) and the previously added 1 ml of
buffer.
[0564] The calculated amount of the acetate buffer (W.sub.a) is
added to the biomass slurry, the lid closed and the tube incubated
in a TS-100 Thermo Shaker for 5 minutes to preheat the sample to
50.degree. C. The calculated amount(s) of enzyme stock solution(s)
is added to the tube to start the enzymatic reaction. The reaction
is allowed to proceed at 50.degree. C. and 1200 rpm vibrations in a
TS-100 Thermo Shaker for 96 hours.
[0565] Aliquots of the reaction slurry (50-100 .mu.L) are typically
taken at 3 or 6, 24, 48, 72, and 96 hours, using a pipette with a
cut tip. The aliquots are centrifuged at 15,000 rpm in a micro
centrifuge for 3 minutes, and the supernatants are analyzed for
glucose by the glucose oxidase peroxidase assay (see Example 6) and
for reducing sugars by the Nelson-Somogyi assay (see, for example,
Example 1). Samples (supernatants) are typically analyzed
immediately. In some cases, they are boiled for 5 minutes, frozen
and analyzed later. The final time point (96 hours) is also
analyzed for sugars by HPLC.
[0566] A substrate blank (control) is prepared in the same way as
above, except that distilled water is added to the tube instead of
the enzyme(s) stock solution(s). The supernatant resulting from the
substrate blank is analyzed for glucose by the glucose
oxidase-peroxidase assay, reducing sugars by the Nelson-Somogyi
assay and also for sugars by HPLC. Typically, there is no need to
prepare enzyme blanks since purified enzymes (monocomponents)
usually don't contain sugars.
[0567] All reactions are carried out in duplicate, and the results
of the analysis for sugars are given as average values together
with standard deviations. Typically, protein content in the
reaction system (mg protein/g solids) is used as a measure of
enzyme loading. In the case of purified enzymes where the amino
acid sequences are known, the protein concentration is calculated
from the UV absorption at 280 nm using enzyme extinction
coefficients predicted by the ProtParam tool
(http://www.expasy.ch/tools/protparam.html). Otherwise, the protein
concentration is determined using the Lowry (Pierce BCA) assay.
[0568] In the experiments with individual purified enzymes
(monocomponents), the enzyme loading is 2 mg protein/g solids. In
the case of monocomponent cellulases (endoglucanases and
cellobiohydrolases), the enzymatic reactions are carried out in the
presence of purified .beta.-glucosidase (0.5 U/ml cellobiase
activity) in order to convert all soluble oligosaccharides to
glucose.
[0569] In the experiments with mixes of monocomponents, the enzyme
loading is typically 2 mg of total protein per 1 g of solids, when
preliminary screening of mixes is carried out. With selected mixes
of monocomponents, the enzyme (protein) loading may be varied (up
to 5-10 mg protein/g solids).
Calculations
[0570] To calculate the percent digestibility of cellulose
(glucan), the glucose concentration in the final (96 hour)
supernatant of the reaction mixture is calculated (data from the
glucose oxidase-peroxidase assay and/or HPLC assay), subtract the
glucose concentration from the substrate blank (control). The
initial cellulose (glucan) content in the reaction system is also
calculated. For example, if the cellulose content in the pretreated
feedstock was 35% of the total solids (thy matter) and the
hydrolysis was carried out at 100 g solids/1, then the initial
cellulose concentration would be 35 mg/ml.
[0571] The percent digestion of cellulose is calculated as
follows:
Digestion ( % ) = Glucose concentration ( mg / mL ) .times. 100 %
Initial cellulose concentration ( mg / mL ) .times. 1.11
##EQU00004##
The factor 1.11 is a result of hydration of anhydroglucose residue
in cellulose (162 Da per one residue) that yields a glucose
molecule (180 Da).
[0572] To calculate the percent digestibility of hemicellulose
(xylan), the total concentration of pentoses (xylose+arabinose) in
the final (96 hour) supernatant of the reaction mixture is
determined (data from HPLC assay), subtract the pentose
(xylose+arabinose) concentration from the substrate blank
(control). The initial hemicellulose (xylan) content in the
reaction system is also calculated. For example, if the
hemicellulose content in the pretreated feedstock is 30% of the
total solids (dry matter) and the hydrolysis is carried out at 100
g solids/l, then the initial hemicellulose concentration would be
30 mg/ml.
[0573] The percent digestion of hemicellulose is calculated as
follows:
Digestion ( % ) = Total pentose concentration ( mg / mL ) .times.
100 % Initial hemicellulose concentration ( mg / mL ) .times. 1.136
##EQU00005##
The factor 1.136 is a result of hydration of anhydropentose (xylose
or arabinose) residue in hemicellulose (132 Da per one residue)
that yields in pentose molecule (150 Da).
REFERENCES
[0574] 1. Berezin I V, Rabinovich M L, Sinitsyn A P (1977) Study of
applicability of quantitative kinetic spectrophotometric method for
glucose determination. Biokhimiya (Moscow) 42:1631-1636. [0575] 2.
Ghose T K (1987) Measurement of cellulase activities. Pure Appl.
Chem. 59:257-268. [0576] 3. Gusakov A V, Salanovich T N, Antonov A
I, Ustinov B B, Okunev O N, Burlingame R, Emalfarb M. Baez M,
Sinitsyn A P (2007) Design of highly efficient cellulase mixtures
for enzymatic hydrolysis of cellulose. Biotechnol. Bioeng. On-line:
http://www3.interscience.wiley.com/cgi-bin/abstract/114064699/,
DOI: 10.1002/bit.21329 [0577] 4. Lowry O H, Rosebrough N J, Farr A
L. Randall R J (1951) Protein measurement with the Folin phenol
reagent. J. Biol. Chem. 193:265-275. [0578] 5. Sinitsyn A P,
Chemoglazov V M, Gusakov A V (1990) Methods of investigation and
properties of cellulolytic enzymes (in Russian), Biotechnology
Series, v. 25. Moscow: VINITI Press. 220 p. [0579] 6. Somogyi M
(1.952) Notes on sugar determination. J. Biol. Chem. 195:19-23.
Example 14
[0580] The following example illustrates an assay used to measure
the thermal stability of enzymes of the present invention.
[0581] This assay allows the determination of the long-term thermal
stability of an enzyme at 40, 50 and 70.degree. C. and at different
pH values (the preferred values may vary with the area of use of
the enzyme and its pH-profile, which is typically from pH 5.0 or
7.0). A solution of enzyme is incubated from 3-24 hours in a
thermostat. The gradual drop of enzyme activity is recorded. The
resulting data are used to determine enzyme half-life or enzyme
activity after 3 hours of incubation under specified conditions
(temperature and pH).
[0582] An enzyme dilution that gives a high enough optical density
to be determined when the enzyme is diluted (e.g., in the range
0.8-1.2 using the Somogyi-Nelson assay described above) should be
selected. 500-800 .mu.l of this dilution (prepared using preheated
buffer and 10- to 20-fold dilution factors (e.g., 450 .mu.l
preheated buffer+50 .mu.l of enzyme) 0 is placed into a 1.5-2.0 ml
eppendorf tube (capped) in a thermostat set at 40, 50 or 60.degree.
C. Aliquots are removed at 15, 30, 45, 60, 90, 120, 180 minutes and
activity determined simultaneously using any of the enzyme activity
assays disclosed in this application or known in the art. A
zero-point activity is determined by preparing the same enzyme
dilution at room temperature. Relative activity (in %) is plotted
versus time, with the activity of the enzyme at the zero-time point
(without heating) set to 100%.
Example 15
[0583] The following example illustrates methods of bioscouring
textiles using enzymes of the present invention.
[0584] The methods below are primarily intended for testing small
amount of enzymes (mini-testing in the range of 0.5-5 mg of enzyme
sample).
Bioscouring Assays
[0585] Scouring removes hydrophobic waxes and pectic substance from
the surface of raw cotton fabric. The following methods, which
allow the evaluation of the scouring efficiency of pure enzymes and
crude enzyme preparations, are listed below: [0586] 1) Scouring and
evaluating procedure for strips of 2.5.times.16 cm textile strips,
[0587] 2) Micromethod of scouring and evaluating procedure for
circular pieces of textile, diameter 1.4 cm, [0588] 3) Method for
determination (staining) of remaining pectic substances in scoured
textile (qualitative), [0589] 4) Method for determination of
remaining waxes in scoured textile.
Method A. Scouring and Evaluation Procedure for the Strips of
2.5.times.16 cm Textile Strips
[0590] Sample preparation: For all testing, raw weave 100% cotton
fabric is used (density of 142 g/m.sup.2). Cotton fabric is desized
by .alpha.-amylase overnight with surfactant added at room
temperature, then rinsed and dried at room temperature. For 1-2
m.sup.2 fabric, about 5 liters of solution containing 5 g/l textile
.alpha.-amylase preparation and 1 g/l nonionic surfactant for
scouring is needed. The fabric is rinsed thoroughly with tap water
and dried at room temperature overnight. The fabric should not be
ironed. The fabric is then cut into strips (2.5.times.16 cm).
[0591] Scouring procedure: The strips are treated in 100 mL plastic
vessels on an Elpan water bath shaker at 50.degree. C. at 250 rpm
for one hour. Two 16.times.2.5 cm strips of fabric (1.1-1.2 g
total) are loaded in one vessel. The liquor ratio is 20:1, using
0.1 M acetate buffer, pH 5.0.
[0592] In detail, the strips, usually 20 strips (in 10 vessels),
are marked with a textile marker, then dipped into the water to
completely wet each strip. An enzyme solution (20 ml) is poured in
the vessels. Then, two wet strips are rapidly placed into each
vessel using tweezers. The vessels are capped and placed into a
preheated water bath shaker (set at 50.degree. C., 250 rpm,
amplitude 4) for one hour.
[0593] After the treatment, the strips are rinsed with cold tap
water for 2-4 minutes, usually in one bath, and then dried at room
temperature overnight. More intense rinses have been tested (hot
water, sequestrates e.g. EDTA, surfactants) but are not recommended
since they provide less reproducible results.
[0594] Heating: Dry strips are heated for 1 hour at 105.degree. C.,
allowed to remain overnight at room temperature, and then assayed
for wicking height.
[0595] Analysis of wettability: Wettability (wicking height) is
measured according to the Russian textile standard. One end of the
vertical strip of thy fabric is immersed in a water bath. Wicking
height (cm) is measured after 30 minutes. The higher the value of
wicking height--the better the scouring result. A value of 8 cm or
higher (for heated samples) indicates good scouring. All
experiments are conducted in duplicate or triplicate.
[0596] Enzyme dosages for commercial preparations are usually
within the range 5-60 g preparation per kg textile. Preferably, the
preparations should be normalized by protein. The dose of 0.6 mg
protein per 1 ml solution (12 g protein per kg textile) proves to
be suitable when comparing T. reesei preparations and pure enzymes
using this method and micromethod B below.
[0597] When comparing preparations in one dose, 4 repetitions for
each preparation (two vessels with two strips in each) are carried
out. It is often more convenient to run a series of 10-16 vessels,
with 1 blank (buffer treatment without enzyme) and 1-2 vessels with
some standard preparation in each series
[0598] Preferably, an alkaline scoured strip should also be added
in the heating and evaluation procedure. For alkaline scouring, the
strips are treated with a solution of 40 g/l NaOH in a boiling
water bath for 2 hours at a liquor ratio of 5-20. Adding surfactant
is typically not needed for these samples, but low doses of 1 g/l
of EM-3 surfactant or similar products may be beneficial.
[0599] The strips can be analyzed without heating after drying
overnight. However, heating of the strips provides more
reproducible result and leads to a greater difference between well
and poorly scoured strips. Drying and conditioning of the strips
overnight or longer after heating is an important step and should
not be shortened. Results may also be improved by using a special
conditioning chamber with constant humidity, rather than leaving
the samples at room conditions.
Method B. Micromethod of Scouring and Evaluation Procedure for the
Circular Pieces of Textile, Diameter 1.4 cm
[0600] Treatment procedure: In this method, circular pieces
(diameter 1.4 cm), of the same desized fabric from method A are
used. The fabric should be completely wetted in distilled water
before the analysis, with excess water removed using filter
paper.
[0601] A circular piece of textile is placed in a small cylindrical
vial ("penicillin vial", 2.2 cm diameter and 5 cm height)
containing 2.0 ml of enzyme solution in 0.1 M Na-acetate buffer (pH
5.0). The vials are placed into a preheated water bath (50.degree.
C.) on a shaker (250 rpm) and incubated for 1 hour. The fabric
swatch is removed with tweezers, washed 2-3 minutes with cold tap
water, and then dried at room temperature conditions overnight. For
each enzyme sample and control sample (treatment with no enzyme in
buffer), the assay procedure is typically carried out in 2
repetitions (2 pieces).
[0602] Heating: Overnight-dried textile swatches are heated for 1
hour at 105.degree. C., then room conditioned overnight prior to
being assayed for wettability. Heating generally provides better
results, but may be omitted.
[0603] Analysis of wettability: Wettability is measured using a
drop test. A drop of distilled water, 5 ul, is placed on the
surface of fabric and time it takes the drop to adsorb into the
material is measured. For each circular piece, 5 measurements are
taken, placing each new drop in a dry area within the fabric
swatch. The average value for each sample is then calculated.
[0604] Enzyme preparations and pure enzymes are tested at 0.6 mg
protein per ml solution, but the assays are typically reproducible
at doses between 0.6-2.4 mg protein per ml solution This method
usually gives greater deviations than method A, but 10-fold less
enzyme is needed. Thus, method B is usually recommended where
enzyme amounts are limiting.
[0605] Testing at one dose with heat treatment reveals preparations
of 8-10 cm (e.g. classical ACE, BioACE, IndiAGE 44L, Ecostone L
350, Celloviridin) provide comparative results.
[0606] Table 28 and FIG. 20 show the bioscouring ability of
cellobiohydrolases and endoglucanases of the present invention. The
reciprocal of the time (1/time; sec.sup.-1) of water drop
adsorption on a textile after treatment at pH 5.0 with 0.6 mg/l of
enzyme followed by heating is shown. The water drop adsorption time
for the untreated control sample is 2400 sec (1/time=0.0004).
TABLE-US-00041 TABLE 28 Bioscouring Abilities of C1
Cellobiohydrolases and Endoglucanases Enzyme 1/Time (sec.sup.-1)
Blank 0.0004 EG I 0.0006 EG II 0.0006 EG II (CD) 0.0007 EG III
0.0008 EG V 0.0007 EG VI 0.0007 CBH Ia 0.0006 CBH Ia (CD) 0.0006
CBH Ib 0.0006 CBH IIa 0.0006 CBH IIb 0.0006
Qualitative Method for Determination (Staining) of Remaining Pectic
Substances in a Scoured Fabric
[0607] Ruthenium Red Dye (Sigma, R-2751) is used for specifically
staining pectic substances in fabric, by forming a complex with
charged carboxyl groups of pectins.
[0608] Small swatches (2.5.times.6 or 2.5.times.2.5 cm) of scoured
fabric are marked using a textile marker. The scoured samples
remaining after the wettability analyses described above can be
used.
[0609] The following controls are included: [0610] 1) desized
unscoured fabric, [0611] 2) control treated with buffer (without an
enzyme). [0612] 3) alkaline scoured fabric.
[0613] All the samples are dyed in one bath and weighed together.
The volume of the solution is calculated using the following
formula:
V(ml)=m samples (g).times.30.
[0614] The desired volume of 1 g/l solution of the dye is prepared.
The dye solution is stirred and warmed in a boiling water bath for
5 minutes until the dye dissolves completely. The dye solution is
warmed in a glass vessel on a water bath shaker set at 50.degree.
C., 200 rpm. The samples are then placed in the solution and some
surfactant is added if the samples are not wetted. The samples are
dyed for 30 minutes at 50.degree. C. with shaking at about 200 rpm
for even dying.
[0615] Then, the samples are rinsed under tap water followed 3-4
rinses (about 5 minutes each) with hot water (about 70.degree. C.
in a glass vessel). After this, the samples are dried overnight at
room temperature. The samples should not be iron dried as it may
destroy the dye. The samples are generally compared visually,
although quantitative colorimetric evaluation can be used.
[0616] Example of stained strips are presented in FIG. 21. FIG. 21
illustrates a comparison of the pectin content in an unscoured
sample, an alkaline scoured sample, a sample bioscoured by BioPrep
(Novo Nordisk, Denmark) at 20 g preparation per 1 kg fabric, and
samples bioscoured by ACE at 40 g/kg for 1 or 3 hours.
[0617] The unscoured sample and control (treated without enzyme)
are stained at high and equal intensity--that means no pectins are
removed during buffer treatment. Alkaline scoured sample is very
faint stained--100% pectin removed (by definition). All three
enzyme scoured samples are also faint, but little bit more
intensive than alkaline scoured. This means that cellulase scouring
and scouring by BioPrep removes the majority of surface (stainable)
pectins from the primary wall, similar to alkaline scouring.
[0618] Other positively charged dyes can be used for staining
pectins due to the negatively charged polymer in cotton. We have
tested methylene blue, but it resulted in less distinction between
scoured and unscoured fabric.
Method for Determination of Remaining Waxes in Scoured Textiles
[0619] The presence of remaining waxes in scoured textiles is
determined using an extraction procedure. 3-12 g of scoured fabric
is weighed and then extracted with boiling benzene in a Soxlet
apparatus for 4 hours. The extract is evaporated in a rotary
evaporator. Then, a flask containing the evaporated, extracted
waxes is brought to a constant weight at 105.degree. C. (0.751 hour
of incubation time is usually necessary). The weight of extracted
waxes is then determined and the wax content is calculated.
[0620] We compared bioscouring (ACE preparation) and alkaline
scouring. Desized cotton strips were treated, dried at room
conditions overnight and divided into two parts: one part was
evaluated for wicking height, another part were heated to
105.degree. C. for 1 hour, equilibrated at room conditions
overnight and then evaluated for wicking height.
[0621] Wax content was determined by the method described above,
and the results are presented in Table 29 below.
[0622] Bioscouring by ACE and by the alkaline procedure gave
comparable results based on wicking height changes, but bioscouring
removed less wax. Alkaline scouring, however, did not remove all of
the wax.
TABLE-US-00042 TABLE 29 Bioscouring by ACE versus alkaline
scouring. Wicking height, cm Wicking height, cm Type of treatment
Room drying heating 105.degree. C., 1 h Waxes, % Untreated cotton
6.0 0 0.62 .+-. 0.03 Alkaline scouring 13.0 8.4 0.37 .+-. 0.03
Bioscouring (ACE) 12.4 8.6 0.57 .+-. 0.03
Example 16
[0623] The following example illustrates a method of biopolishing
(biofinishing) textiles using enzymes of the present invention.
[0624] The method allows for testing small amount of enzymes
(mini-testing in the range of 0.5-5 mg of enzyme sample). The
method for testing of pure enzymes and crude enzyme preparations is
carried out in specially designed cells (essentially a
micro-washing machine) also used for the mini-denim washing assay
(see FIG. 22B).
[0625] Sample preparation: Circular pieces of textile (diameter of
28 mm) are excised from TIC-460 cotton interlock dyed Red 80
fabric. The samples are marked with a textile marker on the side
that will be treated with an enzyme and mechanical stress. Each
sample is then weighed to an accuracy 1 mg. One piece generally
weighs 160 to 180 mg.
[0626] Treatment procedure: The sample is fixed in the cell and
three metal cylinders are placed into it. When all cells are
assembled, 3 ml of enzyme solution are added to each cell. The
enzyme solutions contain 0.1M Na-acetate buffer, pH 5.0, to control
the pH. The cells are placed into a preheated Elpan water bath
shaker (50.degree. C., 300 rpm, amplitude 4) and the samples are
treated for 1 hour.
[0627] The efficiency of biopolishing is assayed by measuring the
optical density of the reaction solution. Immediately after the
treatment, the solution should be thoroughly mixed, using a
pipette, to resuspend any possible sediment of removed fuzz. A
1-1.5 ml aliquot of reaction solution is then taken from each cell.
The optical density of the solution is then measured on a
spectrophotometer at 400 nm in a 1 ml cuvette (1 cm optical path).
The solution is resuspended and dosed into the cuvette and the
optical density determined rapidly (within 10 seconds), as the
suspension sediments gradually and the optical density changes due
to sedimentation. If the value of A.sub.400 exceeds 1 unit, the
sample is diluted 2-4 times carefully and the optical density
determined once again. If a sample was diluted, the resultant value
should be multiplied by the dilution factor.
[0628] Rinse and drying: The circular pieces of textile should be
rinsed immediately after taking the aliquots of reaction solution,
since the enzymatic reaction will proceed otherwise. The sample is
removed from the cell, and washed briefly in distilled water. All
the samples are then rinsed together thoroughly in distilled water
for 3-5 min, stirring gently and changing water periodically. After
rinsing, the samples are blotted using filter paper and dried at
room temperature for at least 24 hours.
[0629] Measurement of the efficiency of biopolishing by
determination of the weight loss: Each sample is weighed to a 1 mg
accuracy. The resulting values are compared with that of the same
sample before the treatment and weight loss is calculated as
below:
Weight loss (mg)=m before treatment (mg)-m after treatment (mg) %
Weight loss=Weight loss/m before treatment (mg).times.1.00%
[0630] Qualitative evaluation of the surface appearance: The image
of the edge of the folded fabric is taken using suitable device at
a magnification about .times.60. Images of the samples treated by
different enzymes and blank runs are compared visually.
[0631] In detail, the circular piece is folded (for all samples in
a same manner--e.g., so that the surface threads are perpendicular
to the edge line). All samples should be folded so that the treated
surface is up. The samples can be stapled to a piece of paper for
more convenient operation. Images are then taken using a QX3 Intel
Play microscope at .times.60 magnification and recorded in JPEG
format.
[0632] The samples should be run in series up to 8 samples in each.
In each series, there should be a control treatment (without an
enzyme) and a treatment with a standard preparation. For each
preparation under investigation, at least 2 repetitions should be
made.
[0633] Weight determination is influenced by the drying conditions.
Drying of the samples before the treatment and after in
reproducible conditions is recommended (e.g., heating in an oven
and drying in a dessicator). Drying at room conditions can result
in slight deviations between series run on different days.
[0634] The effect of enzymatic treatment along with mechanical
stress in our test is similar to the widely used Launderometer and
other laboratory testing machines, but it has different geometry.
Due to this geometry, the system is amenable to further
miniaturization and require less than 1 mg quantities of an
enzyme.
[0635] Evaluation of the biopolished fabric can be problematic. The
most important (and direct) parameters are pilling resistance and
fuzziness of a treated fabric. Weight loss, color clarification,
and improvement of softness are less direct but valuable
parameters. Greater pilling resistance correlates with greater
weight losses and gives less fuzz, clarification of color and
softer feel. Fuzziness is evaluated qualitatively in our method on
3 levels: good, moderate and poor. Optical density should correlate
directly both to weight loss (weight loss is due mainly to the
removed fuzz making turbid suspension in a solution) and fuzziness
of fabric.
[0636] Table 30 and FIG. 23 illustrate the biopolishing ability of
cellobiohydrolases and endoglucanases of the present invention. The
dye release (D 400 nm) after treatment with 0.2 g/l enzyme at pH
5.0 and 50.degree. C. is shown.
TABLE-US-00043 TABLE 30 Biopolishing Abilities of C1
Cellobiohydrolases and Endoglucanases Enzyme % Dye Release EG I
0.17 EG II 0.2 EG II (CD) 0.1 EG III 0.16 EG V 0.2 EG VI 0.1 CBH Ia
0.1 CBH Ia (CD) 0.2 CBH Ib 0.3 CBH IIa 0.2 CBH IIb 0.1
Example 17
[0637] The following example illustrates methods of biostoning
(abrasion) textiles using enzymes of the present invention.
[0638] The method allow for testing small amount of enzymes
(mini-testing in the range of 0.5-5 mg of enzyme sample). The
method is available in two scales, requiring approximately 10 and 1
mg of protein, respectively (Method A and Method B). Both scales
utilize the same principle and gives comparable results. Method B
requires specially designed cells but proved easier when assaying
pure enzymes.
Method A
[0639] 250 ml vessels (internal diameter 56 mm, height 110 mm) with
Teflon disks at the bottom (thickness 4 mm) are used for the
treatment of denim fabric with cellulase (see FIG. 22A). A swatch
of desized denim fabric (10.times.10 cm) is pulled over the butt of
a plastic tube (diameter 50 mm, height 100 mm) and fixed with a
rubber ring. The tube is inserted into the 250 ml vessel so that
the swatch attaches tightly and uniformly to the Teflon disk at the
bottom of the glass. Ten metal buck-shot balls (diameter 7 mm) are
placed inside the tube to provide mechanical stress to the surface
of fabric, and 30 ml of enzyme solution in the appropriate buffer
is added. The vessel is placed onto a ELPAN water bath shaker type
357 (Poland) at 50.degree. C. The enzymatic treatment of the denim
is carried out for 1 hour at the shaker speed of 300 rpm and
amplitude of vibrations adjusted to 4 units. The swatch is then
removed and washed for 1 minute with running water. An excess of
water on the surface of the swatch is removed by squeezing it
between two sheets of blotting paper, and then the swatch is dried
overnight in the open air at room temperature. In a separate
experiment, a control assay is carried out as described above, but
in the absence of enzyme. For the control, 30 ml of the same buffer
used for enzyme dilution is added to the vessel.
Method B
[0640] This assay procedure uses specially designed cylindrical
cells (external diameter 35 mm, internal diameter 30 mm, height 48
mm) made of stainless steel and equipped with tightened lids (see
FIG. 22B). This method also uses 10-fold less enzyme than Method A.
A circular swatch of denim fabric (diameter of 30 mm) is placed at
the bottom of a cell, and a stainless steel cylindrical cartridge
is inserted to clamp the swatch. Three metal cylinders (7.times.7
mm), made of stainless steel, are placed into the cell to provide
mechanical stress, and 3 ml of enzyme solution in the appropriate
buffer is added. A lid, equipped with a rubber ring inside, is
quickly tightened to seal the cell hermetically, and the cell is
placed in a water bath on a shaker. Then the procedure is carried
under the same conditions as described above for Method A. In the
control assay, afacric swatch is treated under the same conditions
but in the absence of enzyme. To this end, 3 ml of the same buffer
used for enzyme dilution is added to the cell.
[0641] Both assays (Method A or Method B) are carried out at least
in two repetitions for each particular enzyme (usually three
repetitions). Typically, when different cellulase preparations are
compared, the conditions in the reaction system are equalized by
the CMCase activity (1-3 U/ml). Acid cellulase samples are assayed
at pH 5.0 using 0.1 M acetate buffer. Neutral cellulases are
assayed at pH 6.0 using 0.1 M phosphate buffer.
[0642] Analysis of color intensity on denim swatches is carried out
using a Mustek MFS-12000SP (USA) scanner. A circular area on the
swatch, subjected to the combined enzyme and mechanical action as
above, is scanned at 300 dpi resolution. This area is usually more
light than the denim surface near the swatch edges. Each swatch is
scanned twice: first, when the threads on the surface of the fabric
are parallel to the movement of the light source on scanning, and
second, when the threads are directed perpendicular to the movement
of the light source on scanning.
[0643] The JPG images obtained as a result of scanning are analyzed
using Adobe Photoshop software (versions 3.0 or later can be used).
For each particular scan, a histogram is opened. The blue channel
is selected and the percentile at level 32 (P32) is read. This
parameter shows the percentage of image pixels having the intensity
of color higher than at the level 32 of histogram (a total of 256
shades (levels) of blue color as the abscissa axis). The higher is
the P32, the darker is the denim swatch. On the contrary, the
parameter (100-P32, %) shows the percentage of pixels having the
intensity of color lower than at the level 32 of the histogram.
FIG. 24 shows an example histogram generated from the color
intensity analysis of a denim swatch.
[0644] For each particular enzyme tested, an average value of the
P32 and standard deviation are calculated using the data of all
scans for this enzyme. For example, if the assay is performed in
two repetitions, four P32 values are taken into account, since for
each repetition two scans are carried out.
[0645] The difference between the average value of P32 obtained in
the control (without cellulase) and the average value of P32 for
the enzyme is taken as a criterion of the abrasive activity of the
enzyme.
[0646] Table 31 and FIG. 25 illustrate the denim-washing (abrasion)
ability of cellobiohydrolases and endoglucanases of the present
invention using Assay B above. The relative units per mg of enzyme
determined at 50.degree. C., pH 5.0 is shown.
TABLE-US-00044 TABLE 31 Abrasion Abilities of C1 Cellobiohydrolases
and Endoglucanases Enzyme Relative Units per mg EG I 14 EG II 62 EG
II (CD) 54 EG III 25 EG V 65 EG VI 12 CBH Ia 3 CBH Ia (CD) 11 CBH
Ib 10 CBH IIa 22 CBH IIb 15
[0647] Additional examples and discussion concerning methods A and
B are presented in A. Gusakov, A. Sinitsyn, S. Grishutin, D.
Tikhomirov, D. Shook, D. Sheer, M. Emalfarb Microassays to control
the results of cellulase treatment of denim fabrics; Textile
Chemist and Colorist and American Dyestuff Reporter, (2000), V. 32,
N. 5, P. 42, the contents of which are hereby incorporated by
reference in their entirety.
Example 18
[0648] The following example illustrates methods of backstaining
textiles using enzymes of the present invention.
[0649] The methods allow for testing small amount of enzymes
(mini-testing in the range of 0.5-5 mg of enzyme sample).
[0650] Standard Indigo backstaining Method A requires larger enzyme
quantities (typically 2-5 mg of protein for one assay performed
with 3.5.times.3.5 cm swatches in 2-3 repetitions). When
purification of enzyme components is performed on a laboratory
scale and it is necessary to assay the backstaining index (BSI) for
the purified enzyme, much lower quantities of an enzyme are
typically available. In such case, a modification of this method
(Method B) should be used, where smaller swatches of standard
garment (1.4.times.1.4 cm) are used, and the total volume of Indigo
suspension is 2 ml. This minimized microassay procedure requires
ten times lower amounts of protein (0.2-0.5 mg).
Method A
[0651] Indigo reagent preparation: 250 mg of indigo is added to 50
ml of distilled water in a flask and stirred vigorously for 15-20
minutes on a magnetic stirrer. Before taking aliquots for staining
experiments, the suspension is placed into a Bandelin SONOREX TK52
ultrasonic bath for 5 minutes to disrupt large indigo aggregates
and to make the suspension more uniform.
2 ml aliquots are used for staining experiments (see below) under
magnetic stirring in order to provide a uniform indigo suspension
in the reaction mixture.
[0652] Assay procedure: A piece of white fabric (3.5.times.3.5 cm)
is added to 1.8 ml of 0.1 M Na-acetate buffer (pH 5.0) in a 250 ml
glass vessel and incubated at room temperature for 10 minutes
(control). A solution of enzyme preparation in the same amount of
the buffer is used in the enzyme assay procedure. 2 ml of indigo
suspension (5 mg/ml) is then added, and the vessel is placed into a
water bath (50.degree. C.) on a shaker (300 rpm) and agitated for
30 min. The piece of fabric is removed with tweezers and washed
twice for 5 minutes with 50 ml of distilled water (on a shaker at
300 rpm) and then dried at room temperature overnight. For each
enzyme sample and control, the assay procedure is carried out at
least in 3 repetitions.
[0653] Samples of stained fabric are scanned at 300 dpi resolution
on a "Paragon" ("Murtek", USA) scanner. Two images of the front
side and two images of the back side (one direct and one rotated
90.degree.) are obtained for each sample. Images are processed
using an Adobe Photoshop software. A histogram of colour
intensities is calculated for each sample using a blue channel, and
a percentile at level. 120 is used as a criterion of staining. An
average value and standard deviation are calculated for each enzyme
preparation and control.
[0654] T. reesei (ACE) is usually used as a control (internal
standard) for analyzing backstaining properties of a particular
enzyme sample, since ACE is characterized by a high backstaining
index. If possible, a IndiAge Super GX sample should be used as a
second internal standard since it has a low backstaining index.
Method B
[0655] The Indigo stock suspension (5 g/l) is prepared and the
image processing is carried out as described above.
[0656] Assay procedure: A piece of white fabric (1.4.times.1.4 cm)
is placed in a small cylindrical vial ("penicillin vial", 2.2 cm
diameter and 5 cm height) containing 1.8 ml of enzyme solution in
0.1 M Na-acetate buffer (pH 5.0) and incubated at room temperature
for 10 min. 0.2 ml of Indigo suspension is then added, and the vial
is placed into a water bath (50.degree. C.) on a shaker (300 rpm)
and agitated for 30 minutes. The fabric swatch is removed with
tweezers and washed twice for 5 minutes with 5 ml of distilled
water at 50.degree. C. (on a shaker at 300 rpm), then dried at room
temperature overnight. For each enzyme sample and control (internal
standard), the assay procedure is typically carried out in 3
repetitions. In most experiments, 1.5 CMCase units per ml of
reaction system are used. In cases where specific CMCase activity
is very low (or absent), 0.05-0.1 mg of protein per ml of reaction
system is used.
[0657] It is convenient to place three vials (3 repetitions) into a
250 ml vessel containing 15-20 ml of water at the bottom and then
to place the vessel into a water bath on a shaker.
[0658] Additional examples and discussion concerning methods A and
B are presented in A. Gusakov, A. Sinitsyn, S. Grishutin. D.
Tikhomirov, D. Shook, D. Sheer, M. Emalfarb Microassays to control
the results of cellulase treatment of denim fabrics; Textile
Chemist and Colorist and American Dyestuff Reporter, (2000), V. 32,
N. 5, P. 42, the contents of which are hereby incorporated by
reference in their entirety.
Example 19
[0659] The following example illustrates assays used to measure
acetyl esterase enzymatic activity.
Acitivity Towards p-Nitrophenyl Acetate Substrate
[0660] This assay measures the release of p-nitrophenol by the
action of an acetyl esterase on p-nitrophenyl acetate (PNPAc). One
acetyl esterase unit of activity is the amount of enzyme that
liberates 1 micromole of p-nitrophenol in one minute at 37.degree.
C. and pH 7.2.
[0661] Phosphate buffer (0.01 M, pH 7.2) is prepared as follows:
0.124 g of NaH.sub.2PO.sub.4*H.sub.2O and 0.178 g Na.sub.2HPO.sub.4
are dissolved in distilled water so that the final volume of the
solution 500 ml and the pH of the resulting solution is 7.2.
[0662] PNPAc (Fluka Chemie, Switzerland, cat. #46021) is used as
the assay substrate. 3.6 mg of PNPAc is dissolved in 10 ml of 0.01
M phosphate buffer using a magnetic stirrer to obtain a 2 mM stock
solution. The solution is stable for 2 days with storage at
4.degree. C.
[0663] The stop reagent (0.25 M Tris-HCl, pH 8.5) is prepared as
follows: 30.29 g of Tris is dissolved in 900 ml of distilled water
(Solution A). The final 0.25 M Tris-HCl pH 8.5 is prepared by
mixing solution A with 37% HCl until the pH of the resulting
solution is equal to 8.5. The solution volume is then adjusted to
1000 ml. This reagent is used to terminate the enzymatic reaction.
Using the above reagents, the assay is performed as detailed
below.
[0664] For the enzyme sample, 0.10 ml of 2 mM PNPAc stock solution
is mixed with 0.01 ml of an enzyme sample and incubated at
37.degree. C. for 5 minutes (Axe2 and Axe3) or 10 minutes (Axe1):
After exactly 5 or 1.0 minutes of incubation, 0.1 ml of 0.25 M
Tris-HCl solution is added and then the absorbance at 405 nm
(A.sub.405) is measured in microtiter plates as A.sub.S.
[0665] For the substrate blank, 0.10 mL of 2 mM PNPAc stock
solution is mixed with 0.01 ml of 0.0.1 M phosphate buffer, pH 7.2.
0.1 mL of 0.25 M Tris-HCl solution is then added and the absorbance
at 405 nm (A.sub.405) is measured in microtiter plates as
A.sub.SB.
[0666] Activity is calculated as follows:
Activity ( I U / ml ) = .DELTA. A 405 * D F * 21 * 1.33 13.700 * 5
##EQU00006##
where .DELTA.A.sub.405=A.sub.S-A.sub.SB. DF is the enzyme dilution
factor, 21 is the dilution of 10 .mu.l enzyme solution in 210 .mu.l
reaction volume, 1.33 is the conversion factor of microtiter plates
to cuvettes, 13.700 is the extinction coefficient (13700 M.sup.-1
cm.sup.-1 of p-nitrophenol released corrected for mol/L to
.mu.mol/mL), and 5 minutes is the reaction time.
[0667] Using the assay above, acetyl esterase activity of Axe1 was
found to be 0.23 IU/mL (.DELTA.A.sub.405=1.11, DF=1), Axe2 was
found to be 2.80 U/ml ((.DELTA.A.sub.405=0.1714, DF=40), and Axe3
was found to be 39.97 U/ml (.DELTA.A.sub.405=0.4901, DF=200).
Acitivity Towards Arabinoxylan Oligosaccharide Substrates
[0668] This assay measures the release of acetate by the action of
a xylan acetyl esterase on the arabinoxylan oligosaccharides from
Eucalyptus wood.
[0669] 0.01 M sodium hydroxide is prepared as follows: 0.4 g of
sodium hydroxide is dissolved in distilled water so that the final
volume of the solution to be 1000 ml
[0670] Arabinoxylan oligosaccharides from Eucalyptus wood were
prepared by dissolving 5 mg of arabinoxylan oligosaccharides in 1
ml distilled water using a magnetic stirrer as described in Kabel
et al. Carbohdr. Polym. 50:191 (2002).
[0671] For the enzyme sample, 1.0 mL of arabinoxylan
oligosaccharides stock solution is mixed with 0.005 mg of the
enzyme sample and incubated at 35.degree. C. The pH was maintained
at 5.0 by the addition of 0.01 M NaOH. The added volume of 0.01 M
NaOH is proportional to the breakdown of ester linkage and release
of acetate.
[0672] For the substrate blank, 1.0 ml of arabinoxylan
oligosaccharides stock solution is mixed with 0.005 mg of distilled
water and incubated at 35.degree. C. The pH was maintained at 5.0
by addition of 0.01 M NaOH. The added volume of 0.01 M NaOH is
proportional to the breakdown of ester linkage and release of
acetate.
[0673] Using the assay above, both Axe2 and Axe3 were found to
exhibit xylan acetyl esterase activity, as measured by the release
of acetate from eucalyptus wood arabinoxylan oligosaccharides. For
Axe2, 0.7 ml NaOH was used after 200 minutes of incubation. For
Axe3, 0.35 ml was used after 200 minutes.
Example 20
[0674] The following example illustrates assays used to measure
ferulic acid esterase enzymatic activity.
Acitivity Towards P-Nitrophenyl Butyrate Substrate
[0675] This assay measures the release of p-nitrophenol by the
action of a ferulic acid esterase on p-nitrophenyl butyrate
(PNPBu). One acetyl esterase unit of activity is the amount of
enzyme that liberates 1 micromole of p-nitrophenol in one minute at
37.degree. C. and pH 7.2. Phosphate buffer (0.01 M, pH 7.2) is
prepared as follows: 0.124 g of NaH.sub.2PO.sub.4*H.sub.2O and
0.178 g Na.sub.2HPO.sub.4 are dissolved in distilled water so that
the final volume of the solution is 500 ml and the pH of the
resulting solution is equal to 7.2.
[0676] PNPBu (Sigma, USA, cat. #N9876-5G) is used as the assay
substrate. 10 .mu.l of PNPBu is mixed with 25 ml of 0.0.1 M
phosphate buffer using a magnetic stirrer to obtain a 2 mM stock
solution. The solution is stable for 2 days with storage at
4.degree. C.
[0677] The stop reagent (0.25 M Tris-HCl, pH 8.5) is prepared as
follows: 30.29 g of Tris is dissolved in 900 ml of distilled water
(Solution A). The final 0.25 M Tris-HCl pH 8.5 is prepared by
mixing solution A with 37% HCl until the pH of the resulting
solution is equal to 8.5. The solution volume is adjusted to 1000
ml. This reagent is used to terminate the enzymatic reaction. Using
the above reagents, the assay is performed as detailed below.
[0678] For the enzyme sample, 0.10 ml of 2 mM PNPBu stock solution
is mixed with 0.01 ml of the enzyme sample and incubated at
37.degree. C. for 10 minutes. After exactly 10 minutes of
incubation, 0.1 ml of 0.25 M Tris-HCl solution is added and then
the absorbance at 405 nm (A.sub.405) is measured in microtiter
plates as A.sub.S.
[0679] For the substrate blank, 0.10 ml of 2 mM. PNPBu stock
solution is mixed with 0.0.1 ml of 0.01 M phosphate buffer, pH 7.2.
0.1 ml of 0.25 M Tris-HCl solution is then added and the absorbance
at 405 nm (A.sub.405) is measured microtiter plates as
A.sub.SB.
[0680] Activity is calculated as follows:
Activity ( I U / ml ) = .DELTA. A 405 * D F * 21 * 1.33 13.700 * 10
##EQU00007##
where .DELTA.A.sub.405=A.sub.S-A.sub.SB, DF is the enzyme dilution
factor, 21 is the dilution of 10 .mu.l enzyme solution in 210 .mu.l
reaction volume, 1.33 is the conversion factor of microtiter plates
to cuvettes, 13.700 is the extinction coefficient (13700 M.sup.-1
cm.sup.-1 of p-nitrophenol released corrected for mol/l to
.mu.mol/ml), and 10 minutes is the reaction time.
[0681] Using the assay above, the ferulic acid esterase activity of
FaeA1 was found to be 18.5 IU/ml (.DELTA.A.sub.405=0.906, DF=10),
FaeA2 was found to be 0.40 U/ml (.DELTA.A.sub.405=1.976, DF=1) and
FaeB2 was found to be 0.02 U/ml (.DELTA.A.sub.405=0.111, DF=1).
Acitivity Towards p-Nitrophenyl Butyrate Substrate
[0682] The following assay is used to measure the enzymatic
activity of a ferulic acid esterase towards wheat bran (WB)
oligosaccharides by measuring the release of ferulic acid. Wheat
bran oligosaccharides are prepared by degradation of wheat bran
(obtained from Nedalco, The Netherlands) by endo-xylanase III from
A. niger (enzyme collection Laboratory of Food Chemistry,
Wageningen University, The Netherlands). 50 mg of WB is dissolved
in 10 ml of 0.05 M acetate buffer pH 5.0 using a magnetic stirrer.
1.0 ml of WB stock solution is mixed with 0.0075 mg of the enzyme
and incubated at 35.degree. C. for 24 hours. The reaction is
stopped by heating the samples for 10 minutes at 100.degree. C. The
residual material is removed by centrifugation (15 minutes at
11.4000 rpm), and the supernatant is used as the substrate in the
assay detailed below.
[0683] For the enzyme sample, 1.0 ml of wheat bran oligosaccharides
stock solution is mixed with 0.005 mg of the enzyme sample and
incubated at 35.degree. C. for 24 hours. The reaction is stopped by
heating the samples for 10 minutes at 100.degree. C. The release of
ferulic acid is analyzed by measuring the absorbance at 335 nm.
[0684] For the substrate blank, 1.0 ml of wheat bran
oligosaccharides stock solution is mixed with 0.005 mg of 0.05 M
acetate buffer, pH 5.0, and incubated at 35.degree. C. for 24
hours. The reaction is stopped by heating the samples for 10
minutes at 100.degree. C. The release of ferulic acid is analyzed
by measuring the absorbance at 335 nm.
[0685] Using the assay above, FaeA1 was found to release ferulic
acid from wheat bran arabinoxylan oligosaccharides, releasing 75%
of the ferulic acid present in wheat bran. FaeA1 also exhibited
activity towards sugar beet pulp oligosaccharides (20% hydrolysis).
FaeB2 was found to release 80% of the ferulic acid present in wheat
bran. FaeB2 also exhibited activity towards sugar beet pulp
oligosaccharides (16% hydrolysis).
Example 21
[0686] The following example illustrates an assay used to measure
.alpha.-arabinofuranosidase enzymatic activity.
[0687] This assay measures the release of arabinose by the action
of the .alpha.-arabinofuranosidase on wheat arabinoxylan
oligosaccharides (WAX).
[0688] Acetate buffer (0.05 M, pH 5.0) is prepared as follows: 4.1
g of anhydrous sodium acetate or 6.8 g of sodium acetate *3H.sub.2O
is dissolved in distilled water so that the final volume of the
solution to be 1000 ml (Solution A). In a separate flask, 3.0 g
(2.86 mL) of glacial acetic acid is mixed with distilled water to
make the total volume 1000 ml (Solution B). The final 0.05 M
acetate buffer, pH 5.0, is prepared by mixing Solution A with
Solution B until the pH of the resulting solution is equal to
5.0.
[0689] Wheat arabinoxylan oligosaccharides are prepared by
degradation of Wheat arabinoxylan (Megazyme, Bray Ireland, Cat.
#P-WAXYI) by endo-xylanase 1 from A. niger (enzyme collection
Laboratory of Food Chemistry, Wageningen University, The
Netherlands). 50 mg of WAX is dissolved in 10 ml 0.05 M acetate
buffer, pH 5.0, using a magnetic stirrer. 1.0 ml of WAX stock
solution is mixed with 0.0075 mg of the enzyme and incubated at
35.degree. C. for 24 hours. The reaction is stopped by heating the
samples for 10 minutes at 100.degree. C. The residual material is
removed by centrifugation (15 minutes at 14000 rpm), and the
supernatant is used as the substrate in the assay detailed
below.
[0690] For the enzyme sample, 1.0 ml of wheat arabinoxylan
oligosaccharides stock solution is mixed with 0.0075 mg of the
enzyme sample and incubated at 35.degree. C. for 24 hours. The
reaction is stopped by heating the samples for 10 minutes at
100.degree. C. The release of arabinose and formation of new
arabinoxylan oligosaccharides were analyzed by High Performance
Anion Exchange Chromatography.
[0691] For the substrate blank, 1.0 ml of wheat arabinoxylan
oligosaccharides stock solution is mixed with 0.0075 mg of the 0.05
M acetate buffer, pH 5.0, and incubated at 35.degree. C. for 24
hours. The reaction is stopped by heating the samples for 10
minutes at 100.degree. C. The release of arabinose and formation of
new arabinoxylan oligosaccharides were analyzed by High Performance
Anion Exchange Chromatography.
[0692] High Performance Anion Exchange Chromatography analysis is
performed using a Dionex HPLC system equipped with a Dionex
CarboPac PA-1 (2 mm ID.times.250 mm) column in combination with a
CarboPac PA guard column (1 mm ID.times.25 mm) and a Dionex EDet1
PAD-detector (Dionex Co., Sunnyvale, Calif., USA). A flow rate of
0.3 ml/min was used with the following gradient of sodium acetate
in 0.1 M NaOH: 0-5 min, 0-100 mM; 5-45 min. 100-400 mM. Each
elution was followed by a washing step of 5 minutes 1,000 mM sodium
acetate in 0.1 M NaOH and an equilibration step of 15 minutes 0.1 M
NaOH.
[0693] Using the above assay, both Abf1 and Abf2 were found to
release arabinose from wheat arabinoxylan oligosaccharides. Both
enzymes removed arabinose residues, which were monosubstituted to
the xylose backbone. Arabinose residues linked to O2 and O3
position of the xylose residue were released by both Abf1 and
Abf2.
Example 22
[0694] The following example illustrates an assay used to measure
pectin methyl esterase enzymatic activity.
[0695] The following assay is used to measure the enzymatic
activity of a pectin methyl esterase towards highly methylated
citrus pectin (DM 65) in combination with methyl red. The assay
measures the release of methanol by the action of pectin methyl
esterases on methylated citrus pectin. The release of methanol will
acidify a solution (non-esterified galacturonic acid), which makes
the methyl red indicator change color at a pH of less than 4.4.
[0696] Pectin-methyl red solution is prepared as follows: 0.5 g of
methylated citrus pectin (DM 65, Sigma, USA, Cat. #P9436-5G), 2 mg
methyl red (Sigma, USA) and 0.435 g NaCl is dissolved in distilled
water so that the final volume of the solution is 50 ml. The final
pH is adjusted to 5.0 by adding 0.1 M NaOH.
[0697] For the enzyme sample, 0.1 ml of pectin-methyl red stock
solution is mixed with 0.01 ml of the enzyme sample and incubated
at 37.degree. C. overnight. The absorbance at 520 nm (A.sub.520) is
measured as A.sub.S.
[0698] For the substrate blank, 0.1 ml of pectin/methyl red stock
solution is mixed with 0.0.1 ml of distilled water and incubated at
37.degree. C. overnight. The absorbance at 520 nm (A.sub.520) is
measured as A.sub.SB.
[0699] Using the assay above, the .DELTA.A.sub.520 of Pme1 was
found to be 0.401, demonstrating that Pme1 is active towards highly
methylated pectin.
Example 23
[0700] The following example illustrates an assay used to measure
endo-arabinase enzymatic activity.
[0701] The assay is used to measure the enzymatic activity of an
endo-arabinase towards AZCL-arabinan (debranched). This substrate
is insoluble in buffered solutions, but rapidly hydrates to form
gel particles that are hydrolysed by specific endo-hydrolases,
releasing soluble dye-labeled fragments.
[0702] Acetate buffer (0.05 M, pH 5.0) is prepared as follows: 4.1
g of anhydrous sodium acetate or 6.8 g of sodium acetate *3H.sub.2O
is dissolved in distilled water so that the final volume of the
solution is 1000 ml (Solution A). In a separate flask, 3.0 g (2.86
ml) of glacial acetic acid is mixed with distilled water to yield a
total volume of 1000 ml (Solution B). The final 0.05 M acetate
buffer, pH 5.0, is prepared by mixing Solution A with Solution B
until the pH of the resulting solution is equal to 5.0.
[0703] AZCL-arabinan (debranched) from Megazyme (Bray, Ireland,
Cat. #I-AZDAR) is used as the assay substrate. 50 mg of
AZCL-arabinan is dissolved in 10 ml of 0.05 M acetate buffer, pH
5.0, using a magnetic stirrer. 96% Ethanol is used to terminate the
enzymatic reaction.
[0704] For the enzyme sample, 0.2 ml of 5 mg/ml AZCL-arabinan stock
solution is preheated to 40.degree. C. for 1.0 minutes. This
preheated stock solution is mixed with 200 .mu.l of the enzyme
sample (preheated at 40.degree. C. for 10 minutes) and incubated at
40.degree. C. for 10 minutes. After exactly 10 minutes of
incubation, 1.0 ml of 96% ethanol is added and the absorbance at
590 nm (A.sub.590) is then measured as A.sub.S.
[0705] For the substrate blank, 1.2 ml of 5 mg/ml AZCL-arabinan
stock solution is preheated at 40.degree. C. for 10 minutes. This
preheated stock solution is mixed with 200 .mu.l of 0.05 M acetate
buffer pH 5.0 (preheated at 40.degree. C. for 10 minutes) and
incubated at 40.degree. C. for 10 minutes. After exactly 10 minutes
of incubation, 1.0 ml of 96% ethanol is added and the absorbance at
590 nm (A.sub.590) is then measured as A.sub.SB.
[0706] Endo-arabinase activity is determined using a standard curve
produced from an endo-arabinase with known activity towards
AZCL-arabinan. Activity (IU/ml)=.DELTA.A.sub.595*DF, where
.DELTA.A.sub.595=A.sub.S-A.sub.SB and DF is the enzyme dilution
factor.
[0707] Using the above assay, the .DELTA.A.sub.595 was found to be
1.53 with a DF of 1, indicating that Abn1 exhibits endo-arabinase
activity towards AZCL-arabinan.
Example 24
[0708] The following example illustrates an assay used to measure
.beta.-xylosidase enzymatic activity.
[0709] This assay measures the release of p-nitrophenol by the
action of a .beta.-xylosidase on p-nitrophenyl
.beta.-D-xylopyranoside (PNPX). One .beta.-xylosidase unit of
activity is the amount of enzyme that liberates 1 micromole of
p-nitrophenol in one minute at 37.degree. C. and pH 5.0.
[0710] Acetate buffer (0.1 M, pH 5.0) is prepared as follows: 8.2 g
of anhydrous sodium acetate or 13.6 g of sodium acetate *3H.sub.2O
is dissolved in distilled water so that the final volume of the
solution is 1000 ml (Solution A). In a separate flask. 6.0 g (5.72
ml) of glacial acetic acid is mixed with distilled water to make
the total volume of 1000 ml (Solution B). The final 0.1 M acetate
buffer, pH 5.0, is prepared by mixing Solution. A with Solution B
until the pH of the resulting solution is equal to 5.0.
[0711] PNPX (Extrasynthese, France, Cat. #4244) is used as the
assay substrate. 16.5 mg of PNPX is dissolved in 5 ml of distilled
water and 5 ml 0.1 M acetate buffer using a magnetic stirrer to
obtain 2 mM stock solution. The solution is stable for 2 days with
storage at 4.degree. C.
[0712] The stop reagent (0.25 M sodium carbonate solution) is
prepared as follows: 26.5 g of anhydrous sodium carbonate is
dissolved in 800 ml of distilled water, and the solution volume is
adjusted to 1000 ml. This reagent is used to terminate the
enzymatic reaction.
[0713] For the enzyme sample, 0.10 ml of 2 mM PNPX stock solution
is mixed with 0.01 ml of the enzyme sample and incubated at
37.degree. C. for 30 minutes. After exactly 30 minutes of
incubation, 0.1 ml of 0.25 M sodium carbonate solution is added and
the absorbance at 405 nm (A.sub.405) is then measured in microtiter
plates as A.sub.S.
[0714] For the substrate blank, 0.10 ml of 2 mM PNPX stock solution
is mixed with 0.01 ml of 0.05 M acetate buffer, pH 5.0. 0.1 mL of
0.25 M sodium carbonate solution is then added and the absorbance
at 405 nm (A.sub.405) is measured microtiter plates as
A.sub.SB.
[0715] Activity is calculated as follows:
Activity ( I U / ml ) = .DELTA. A 405 * D F * 21 * 1.33 13.700 * 30
##EQU00008##
where .DELTA.A.sub.405=A.sub.S-A.sub.SB, DF is the enzyme dilution
factor, 21 is the dilution of 10 .mu.l enzyme solution in 210 .mu.l
reaction volume, 1.33 is the conversion factor of microtiter plates
to cuvettes, 13.700 is the extinction coefficient 13700 M.sup.-1
cm.sup.-1 of p-nitrophenol released corrected for mol/l to
.mu.mol/ml, and 30 minutes is the reaction time.
[0716] Using the above assay. Bxl2 was found to exhibit a
.beta.-xylosidase activity of 0.09 IU/ml (.DELTA.A.sub.405=1.36,
DF=1).
Example 25
[0717] The following example illustrates an assay used to measure
the saccharification ability of enzymes of the present
invention.
[0718] The hydrolysis of 5 g/l Avicel catalyzed by purified
cellulases was performed for 120 hours at 40.degree. C. and pH 5.0.
Avicel substrate (10 mg) was mixed with 1.0 ml of 0.1 M Na-acetate
buffer, pH 5.0, containing 1 mM of sodium azide to prevent
microbial growth, in a 2 ml vial. The vial was incubated in a
thermostatic water bath equipped with a magnetic stirrer. The
enzyme solution (1 ml), diluted as needed to reach a particular
concentration, was added and the mixture was stirred. Equal protein
concentrations (0.1 mg/ml) were used for all enzyme preparations.
To convert cellooligosaccharides, which are soluble products of
cellulose hydrolysis catalyzed by endoglucanases and
cellobiohydrolases, into a final hydrolysis product (glucose) and
thus simplify the analysis of hydrolyzate, highly purified
beta-glucosidase from A. japonicus (0.2 U/ml; see below) was added
to the reaction mixture. Aliquots of the suspension (0.1 ml) were
collected from the reaction mixture and centrifuged after fixed
time intervals. Glucose concentration in the supernatant was
determined by the glucose oxidase-peroxidase method (see Example
6).
[0719] Beta-glucosidase was purified from A. japonicus culture
filtrates using anion-exchange chromatography on SOURCE Q media (pH
5.5, gradient of NaCl; GE Healthcare, USA) followed by hydrophobic
chromatography on Phenyl SUPEROSE media (GE Healthcare, USA) in a
gradient of ammonium sulfate.
[0720] Table 32 and FIG. 26 show a graph of the saccharification
ability of cellobiohydrolases and endoglucanases of the present
invention. The amount of glucose (g/l) produced after the
hydrolysis of 5 g/l Avicel at pH 5.0, 40.degree. C. for 120 hours
in the presence of 0.1 mg/ml enzyme and 0.2 U/ml beta-glucosidase
is shown.
TABLE-US-00045 TABLE 32 Saccharification Abilities of C1
Cellobiohydrolases and Endoglucanases Enzyme Glucose Yield (g/l) EG
I 1.2 EG II 0.97 EG II (CD) 0.4 EG III 0.42 EG V 0.51 EG VI 0.56
CBH Ia 2.54 CBH Ia (CD) 1.31 CBH Ib 1.25 CBH IIa 1.16 CBH IIb
3.2
[0721] While various embodiments of the present invention have been
described in detail, it is apparent that modifications and
adaptations of those embodiments will occur to those skilled in the
art. It is to be expressly understood, however, that such
modifications and adaptations are within the scope of the present
invention, as set forth in the following claims.
Sequence CWU 1
1
9411836DNAChrysosporium lucknowenseCDS(98)..(506)CDS(574)..(1742)
1atataaattg ggtgattccc agctcttgat gggcgtgtct tctgcctggc agccctcgtc
60ttcagatcaa gcaactgtgt gctgatcctc ttccgcc atg tac gcc aag ttc gcg
115 Met Tyr Ala Lys Phe Ala 1 5acc ctc gcc gcc ctt gtg gct ggc gcc
gct gct cag aac gcc tgc act 163Thr Leu Ala Ala Leu Val Ala Gly Ala
Ala Ala Gln Asn Ala Cys Thr 10 15 20ctg acc gct gag aac cac ccc tcg
ctg acg tgg tcc aag tgc acg tct 211Leu Thr Ala Glu Asn His Pro Ser
Leu Thr Trp Ser Lys Cys Thr Ser 25 30 35ggc ggc agc tgc acc agc gtc
cag ggt tcc atc acc atc gac gcc aac 259Gly Gly Ser Cys Thr Ser Val
Gln Gly Ser Ile Thr Ile Asp Ala Asn 40 45 50tgg cgg tgg act cac cgg
acc gat agc gcc acc aac tgc tac gag ggc 307Trp Arg Trp Thr His Arg
Thr Asp Ser Ala Thr Asn Cys Tyr Glu Gly55 60 65 70aac aag tgg gat
act tcg tac tgc agc gat ggt cct tct tgc gcc tcc 355Asn Lys Trp Asp
Thr Ser Tyr Cys Ser Asp Gly Pro Ser Cys Ala Ser 75 80 85aag tgc tgc
atc gac ggc gct gac tac tcg agc acc tat ggc atc acc 403Lys Cys Cys
Ile Asp Gly Ala Asp Tyr Ser Ser Thr Tyr Gly Ile Thr 90 95 100acg
agc ggt aac tcc ctg aac ctc aag ttc gtc acc aag ggc cag tac 451Thr
Ser Gly Asn Ser Leu Asn Leu Lys Phe Val Thr Lys Gly Gln Tyr 105 110
115tcg acc aac atc ggc tcg cgt acc tac ctg atg gag agc gac acc aag
499Ser Thr Asn Ile Gly Ser Arg Thr Tyr Leu Met Glu Ser Asp Thr Lys
120 125 130tac cag a gtaagttcct ctcgcacccg gccgccggga gatgatggcg
cccagcccgc 556Tyr Gln135tgacgcgaat gacacag tg ttc cag ctc ctc ggc
aac gag ttc acc ttc 605 Met Phe Gln Leu Leu Gly Asn Glu Phe Thr Phe
140 145gat gtc gac gtc tcc aac ctc ggc tgc ggc ctc aat ggc gcc ctc
tac 653Asp Val Asp Val Ser Asn Leu Gly Cys Gly Leu Asn Gly Ala Leu
Tyr 150 155 160ttc gtg tcc atg gat gcc gat ggt ggc atg tcc aag tac
tcg ggc aac 701Phe Val Ser Met Asp Ala Asp Gly Gly Met Ser Lys Tyr
Ser Gly Asn 165 170 175aag gca ggt gcc aag tac ggt acc ggc tac tgt
gat tct cag tgc ccc 749Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys
Asp Ser Gln Cys Pro180 185 190 195cgc gac ctc aag ttc atc aac ggc
gag gcc aac gta gag aac tgg cag 797Arg Asp Leu Lys Phe Ile Asn Gly
Glu Ala Asn Val Glu Asn Trp Gln 200 205 210agc tcg acc aac gat gcc
aac gcc ggc acg ggc aag tac ggc agc tgc 845Ser Ser Thr Asn Asp Ala
Asn Ala Gly Thr Gly Lys Tyr Gly Ser Cys 215 220 225tgc tcc gag atg
gac gtc tgg gag gcc aac aac atg gcc gcc gcc ttc 893Cys Ser Glu Met
Asp Val Trp Glu Ala Asn Asn Met Ala Ala Ala Phe 230 235 240act ccc
cac cct tgc acc gtg atc ggc cag tcg cgc tgc gag ggc gac 941Thr Pro
His Pro Cys Thr Val Ile Gly Gln Ser Arg Cys Glu Gly Asp 245 250
255tcg tgc ggc ggt acc tac agc acc gac cgc tat gcc ggc atc tgc gac
989Ser Cys Gly Gly Thr Tyr Ser Thr Asp Arg Tyr Ala Gly Ile Cys
Asp260 265 270 275ccc gac gga tgc gac ttc aac tcg tac cgc cag ggc
aac aag acc ttc 1037Pro Asp Gly Cys Asp Phe Asn Ser Tyr Arg Gln Gly
Asn Lys Thr Phe 280 285 290tac ggc aag ggc atg acg gtc gac acg acc
aag aag atc acg gtc gtc 1085Tyr Gly Lys Gly Met Thr Val Asp Thr Thr
Lys Lys Ile Thr Val Val 295 300 305acc cag ttc ctc aag aac tcg gcc
ggc gag ctc tcc gag atc aag cgg 1133Thr Gln Phe Leu Lys Asn Ser Ala
Gly Glu Leu Ser Glu Ile Lys Arg 310 315 320ttc tac gtc cag aac ggc
aag gtc atc ccc aac tcc gag tcc acc atc 1181Phe Tyr Val Gln Asn Gly
Lys Val Ile Pro Asn Ser Glu Ser Thr Ile 325 330 335ccg ggc gtc gag
ggc aac tcc atc acc cag gac tgg tgc gac cgc cag 1229Pro Gly Val Glu
Gly Asn Ser Ile Thr Gln Asp Trp Cys Asp Arg Gln340 345 350 355aag
gcc gcc ttc ggc gac gtg acc gac ttc cag gac aag ggc ggc atg 1277Lys
Ala Ala Phe Gly Asp Val Thr Asp Phe Gln Asp Lys Gly Gly Met 360 365
370gtc cag atg ggc aag gcc ctc gcg ggg ccc atg gtc ctc gtc atg tcc
1325Val Gln Met Gly Lys Ala Leu Ala Gly Pro Met Val Leu Val Met Ser
375 380 385atc tgg gac gac cac gcc gtc aac atg ctc tgg ctc gac tcc
acc tgg 1373Ile Trp Asp Asp His Ala Val Asn Met Leu Trp Leu Asp Ser
Thr Trp 390 395 400ccc atc gac ggc gcc ggc aag ccg ggc gcc gag cgc
ggt gcc tgc ccc 1421Pro Ile Asp Gly Ala Gly Lys Pro Gly Ala Glu Arg
Gly Ala Cys Pro 405 410 415acc acc tcg ggc gtc ccc gct gag gtc gag
gcc gag gcc ccc aac tcc 1469Thr Thr Ser Gly Val Pro Ala Glu Val Glu
Ala Glu Ala Pro Asn Ser420 425 430 435aac gtc atc ttc tcc aac atc
cgc ttc ggc ccc atc ggc tcc acc gtc 1517Asn Val Ile Phe Ser Asn Ile
Arg Phe Gly Pro Ile Gly Ser Thr Val 440 445 450tcc ggc ctg ccc gac
ggc ggc agc ggc aac ccc aac ccg ccc gtc agc 1565Ser Gly Leu Pro Asp
Gly Gly Ser Gly Asn Pro Asn Pro Pro Val Ser 455 460 465tcg tcc acc
ccg gtc ccc tcc tcg tcc acc aca tcc tcc ggt tcc tcc 1613Ser Ser Thr
Pro Val Pro Ser Ser Ser Thr Thr Ser Ser Gly Ser Ser 470 475 480ggc
ccg act ggc ggc acg ggt gtc gct aag cac tat gag caa tgc gga 1661Gly
Pro Thr Gly Gly Thr Gly Val Ala Lys His Tyr Glu Gln Cys Gly 485 490
495gga atc ggg ttc act ggc cct acc cag tgc gag agc ccc tac act tgc
1709Gly Ile Gly Phe Thr Gly Pro Thr Gln Cys Glu Ser Pro Tyr Thr
Cys500 505 510 515acc aaa ctg aat gac tgg tac tcg cag tgc ctg
taaacgaacc tctctgaagg 1762Thr Lys Leu Asn Asp Trp Tyr Ser Gln Cys
Leu 520 525aggttctgag acacgcgcga ttcttctgta tatagtttta tttttcactc
tggagtgctt 1822cgctccacca gtac 18362526PRTChrysosporium lucknowense
2Met Tyr Ala Lys Phe Ala Thr Leu Ala Ala Leu Val Ala Gly Ala Ala1 5
10 15Ala Gln Asn Ala Cys Thr Leu Thr Ala Glu Asn His Pro Ser Leu
Thr 20 25 30Trp Ser Lys Cys Thr Ser Gly Gly Ser Cys Thr Ser Val Gln
Gly Ser 35 40 45Ile Thr Ile Asp Ala Asn Trp Arg Trp Thr His Arg Thr
Asp Ser Ala 50 55 60Thr Asn Cys Tyr Glu Gly Asn Lys Trp Asp Thr Ser
Tyr Cys Ser Asp65 70 75 80Gly Pro Ser Cys Ala Ser Lys Cys Cys Ile
Asp Gly Ala Asp Tyr Ser 85 90 95Ser Thr Tyr Gly Ile Thr Thr Ser Gly
Asn Ser Leu Asn Leu Lys Phe 100 105 110Val Thr Lys Gly Gln Tyr Ser
Thr Asn Ile Gly Ser Arg Thr Tyr Leu 115 120 125Met Glu Ser Asp Thr
Lys Tyr Gln Met Phe Gln Leu Leu Gly Asn Glu 130 135 140Phe Thr Phe
Asp Val Asp Val Ser Asn Leu Gly Cys Gly Leu Asn Gly145 150 155
160Ala Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Met Ser Lys Tyr
165 170 175Ser Gly Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys
Asp Ser 180 185 190Gln Cys Pro Arg Asp Leu Lys Phe Ile Asn Gly Glu
Ala Asn Val Glu 195 200 205Asn Trp Gln Ser Ser Thr Asn Asp Ala Asn
Ala Gly Thr Gly Lys Tyr 210 215 220Gly Ser Cys Cys Ser Glu Met Asp
Val Trp Glu Ala Asn Asn Met Ala225 230 235 240Ala Ala Phe Thr Pro
His Pro Cys Thr Val Ile Gly Gln Ser Arg Cys 245 250 255Glu Gly Asp
Ser Cys Gly Gly Thr Tyr Ser Thr Asp Arg Tyr Ala Gly 260 265 270Ile
Cys Asp Pro Asp Gly Cys Asp Phe Asn Ser Tyr Arg Gln Gly Asn 275 280
285Lys Thr Phe Tyr Gly Lys Gly Met Thr Val Asp Thr Thr Lys Lys Ile
290 295 300Thr Val Val Thr Gln Phe Leu Lys Asn Ser Ala Gly Glu Leu
Ser Glu305 310 315 320Ile Lys Arg Phe Tyr Val Gln Asn Gly Lys Val
Ile Pro Asn Ser Glu 325 330 335Ser Thr Ile Pro Gly Val Glu Gly Asn
Ser Ile Thr Gln Asp Trp Cys 340 345 350Asp Arg Gln Lys Ala Ala Phe
Gly Asp Val Thr Asp Phe Gln Asp Lys 355 360 365Gly Gly Met Val Gln
Met Gly Lys Ala Leu Ala Gly Pro Met Val Leu 370 375 380Val Met Ser
Ile Trp Asp Asp His Ala Val Asn Met Leu Trp Leu Asp385 390 395
400Ser Thr Trp Pro Ile Asp Gly Ala Gly Lys Pro Gly Ala Glu Arg Gly
405 410 415Ala Cys Pro Thr Thr Ser Gly Val Pro Ala Glu Val Glu Ala
Glu Ala 420 425 430Pro Asn Ser Asn Val Ile Phe Ser Asn Ile Arg Phe
Gly Pro Ile Gly 435 440 445Ser Thr Val Ser Gly Leu Pro Asp Gly Gly
Ser Gly Asn Pro Asn Pro 450 455 460Pro Val Ser Ser Ser Thr Pro Val
Pro Ser Ser Ser Thr Thr Ser Ser465 470 475 480Gly Ser Ser Gly Pro
Thr Gly Gly Thr Gly Val Ala Lys His Tyr Glu 485 490 495Gln Cys Gly
Gly Ile Gly Phe Thr Gly Pro Thr Gln Cys Glu Ser Pro 500 505 510Tyr
Thr Cys Thr Lys Leu Asn Asp Trp Tyr Ser Gln Cys Leu 515 520
52531578DNAChrysosporium lucknowense 3atgtacgcca agttcgcgac
cctcgccgcc cttgtggctg gcgccgctgc tcagaacgcc 60tgcactctga ccgctgagaa
ccacccctcg ctgacgtggt ccaagtgcac gtctggcggc 120agctgcacca
gcgtccaggg ttccatcacc atcgacgcca actggcggtg gactcaccgg
180accgatagcg ccaccaactg ctacgagggc aacaagtggg atacttcgta
ctgcagcgat 240ggtccttctt gcgcctccaa gtgctgcatc gacggcgctg
actactcgag cacctatggc 300atcaccacga gcggtaactc cctgaacctc
aagttcgtca ccaagggcca gtactcgacc 360aacatcggct cgcgtaccta
cctgatggag agcgacacca agtaccagat gttccagctc 420ctcggcaacg
agttcacctt cgatgtcgac gtctccaacc tcggctgcgg cctcaatggc
480gccctctact tcgtgtccat ggatgccgat ggtggcatgt ccaagtactc
gggcaacaag 540gcaggtgcca agtacggtac cggctactgt gattctcagt
gcccccgcga cctcaagttc 600atcaacggcg aggccaacgt agagaactgg
cagagctcga ccaacgatgc caacgccggc 660acgggcaagt acggcagctg
ctgctccgag atggacgtct gggaggccaa caacatggcc 720gccgccttca
ctccccaccc ttgcaccgtg atcggccagt cgcgctgcga gggcgactcg
780tgcggcggta cctacagcac cgaccgctat gccggcatct gcgaccccga
cggatgcgac 840ttcaactcgt accgccaggg caacaagacc ttctacggca
agggcatgac ggtcgacacg 900accaagaaga tcacggtcgt cacccagttc
ctcaagaact cggccggcga gctctccgag 960atcaagcggt tctacgtcca
gaacggcaag gtcatcccca actccgagtc caccatcccg 1020ggcgtcgagg
gcaactccat cacccaggac tggtgcgacc gccagaaggc cgccttcggc
1080gacgtgaccg acttccagga caagggcggc atggtccaga tgggcaaggc
cctcgcgggg 1140cccatggtcc tcgtcatgtc catctgggac gaccacgccg
tcaacatgct ctggctcgac 1200tccacctggc ccatcgacgg cgccggcaag
ccgggcgccg agcgcggtgc ctgccccacc 1260acctcgggcg tccccgctga
ggtcgaggcc gaggccccca actccaacgt catcttctcc 1320aacatccgct
tcggccccat cggctccacc gtctccggcc tgcccgacgg cggcagcggc
1380aaccccaacc cgcccgtcag ctcgtccacc ccggtcccct cctcgtccac
cacatcctcc 1440ggttcctccg gcccgactgg cggcacgggt gtcgctaagc
actatgagca atgcggagga 1500atcgggttca ctggccctac ccagtgcgag
agcccctaca cttgcaccaa actgaatgac 1560tggtactcgc agtgcctg
157842029DNAChrysosporium
lucknowenseCDS(108)..(674)CDS(762)..(898)CDS(974)..(1186)CDS(1284)..(1530-
)CDS(1638)..(1768)CDS(1836)..(1890) 4atataaaaag gcaaatgagg
cggcgccttg gacaggtcca ttctcccacc gctcaaccag 60cctccaattc ctcagaagtc
tgttgctctc tcgcagtcgc agtcaag atg aag cag 116 Met Lys Gln 1tac ctc
cag tac ctc gcg gcg acc ctg ccc ctg gtg ggc ctg gcc acg 164Tyr Leu
Gln Tyr Leu Ala Ala Thr Leu Pro Leu Val Gly Leu Ala Thr 5 10 15gcc
cag cag gcg ggt aac ctg cag acc gag act cac ccc aag ctc act 212Ala
Gln Gln Ala Gly Asn Leu Gln Thr Glu Thr His Pro Lys Leu Thr20 25 30
35tgg tcg aag tgc acg gcc ccg gga tcc tgc caa cag gtc aac ggc gag
260Trp Ser Lys Cys Thr Ala Pro Gly Ser Cys Gln Gln Val Asn Gly Glu
40 45 50gtc gtc atc gac tcc aac tgg cgc tgg gtg cac gac gag aac gcg
cag 308Val Val Ile Asp Ser Asn Trp Arg Trp Val His Asp Glu Asn Ala
Gln 55 60 65aac tgc tac gac ggc aac cag tgg acc aac gct tgc agc tct
gcc acc 356Asn Cys Tyr Asp Gly Asn Gln Trp Thr Asn Ala Cys Ser Ser
Ala Thr 70 75 80gac tgc gcc gag aat tgc gcg ctc gag ggt gcc gac tac
cag ggc acc 404Asp Cys Ala Glu Asn Cys Ala Leu Glu Gly Ala Asp Tyr
Gln Gly Thr 85 90 95tat ggc gcc tcg acc agc ggc aat gcc ctg acg ctc
acc ttc gtc act 452Tyr Gly Ala Ser Thr Ser Gly Asn Ala Leu Thr Leu
Thr Phe Val Thr100 105 110 115aag cac gag tac ggc acc aac att ggt
tcg cgc ctc tac ctc atg aac 500Lys His Glu Tyr Gly Thr Asn Ile Gly
Ser Arg Leu Tyr Leu Met Asn 120 125 130ggc gcg aac aag tac cag atg
ttc acc ctc aag ggc aac gag ctg gcc 548Gly Ala Asn Lys Tyr Gln Met
Phe Thr Leu Lys Gly Asn Glu Leu Ala 135 140 145ttc gac gtc gac ctc
tcg gcc gtc gag tgc ggc ctc aac agc gcc ctc 596Phe Asp Val Asp Leu
Ser Ala Val Glu Cys Gly Leu Asn Ser Ala Leu 150 155 160tac ttc gtg
gcc atg gag gag gat ggc ggt gtg tcg agc tac ccg acc 644Tyr Phe Val
Ala Met Glu Glu Asp Gly Gly Val Ser Ser Tyr Pro Thr 165 170 175aac
acg gcc ggt gct aag ttc ggc act ggg gtaagttcaa cgacccgaga 694Asn
Thr Ala Gly Ala Lys Phe Gly Thr Gly180 185cgggtgccct tattatctgc
tgcgaaaacg gacggtcccc ttttgctaac taccctcctc 754caaacag tac tgc gac
gcc caa tgc gca cgc gac ctc aag ttc gtc ggc 803 Tyr Cys Asp Ala Gln
Cys Ala Arg Asp Leu Lys Phe Val Gly 190 195 200ggc aag ggc aac atc
gag ggc tgg aag ccg tcc acc aac gat gcc aat 851Gly Lys Gly Asn Ile
Glu Gly Trp Lys Pro Ser Thr Asn Asp Ala Asn 205 210 215gcc ggt gtc
ggt cct tat ggc ggg tgc tgc gct gag atc gac gtc tg 898Ala Gly Val
Gly Pro Tyr Gly Gly Cys Cys Ala Glu Ile Asp Val Trp220 225 230
235gtaagttttg ttgcctgggc agcaatggta tattagctcg agtggttccc
gtcgttgctg 958accctctctt accag g gag tcg aac aag tat gct ttc gct
ttc acc ccg 1007 Glu Ser Asn Lys Tyr Ala Phe Ala Phe Thr Pro 240
245cac ggt tgc gag aac cct aaa tac cac gtc tgc gag acc acc aac tgc
1055His Gly Cys Glu Asn Pro Lys Tyr His Val Cys Glu Thr Thr Asn Cys
250 255 260ggt ggc acc tac tcc gag gac cgc ttc gct ggt gac tgc gat
gcc aac 1103Gly Gly Thr Tyr Ser Glu Asp Arg Phe Ala Gly Asp Cys Asp
Ala Asn 265 270 275ggc tgc gac tac aac ccc tac cgc atg ggc aac cag
gac ttc tac ggt 1151Gly Cys Asp Tyr Asn Pro Tyr Arg Met Gly Asn Gln
Asp Phe Tyr Gly 280 285 290ccc ggc ttg acg gtc gat acc agc aag aag
ttc ac gtgagtacac 1196Pro Gly Leu Thr Val Asp Thr Ser Lys Lys Phe
Thr295 300 305cgtgcttgaa gccccctccc cccccccccc caaaaaaaaa
aagaaaaaag aagtcaaatg 1256attgatgcta accaaatcaa ataacag c gtc gtc
agc cag ttc gag gag aac 1308 Val Val Ser Gln Phe Glu Glu Asn 310aag
ctc acc cag ttc ttc gtc cag gac ggc aag aag att gag atc ccc 1356Lys
Leu Thr Gln Phe Phe Val Gln Asp Gly Lys Lys Ile Glu Ile Pro315 320
325 330ggc ccc aag gtc gag ggc atc gat gcg gac agc gcc gct atc acc
cct 1404Gly Pro Lys Val Glu Gly Ile Asp Ala Asp Ser Ala Ala Ile Thr
Pro 335 340 345gag ctg tgc agt gcc ctg ttc aag gcc ttc gat gac cgt
gac cgc ttc 1452Glu Leu Cys Ser Ala Leu Phe Lys Ala Phe Asp Asp Arg
Asp Arg Phe 350 355 360tcg gag gtt ggc ggc ttc gat gcc atc aac acg
gcc ctc agc act ccc 1500Ser Glu Val Gly Gly Phe Asp Ala Ile Asn Thr
Ala Leu Ser Thr Pro 365 370 375atg gtc ctc gtc atg tcc atc tgg gat
gat gtacgttacc taaccccccc 1550Met Val Leu Val Met Ser Ile Trp Asp
Asp 380 385cccctttttt tttcccgctt ctctccccga aactgccact acttatatac
gtcccgcgtc 1610catgatgctt accttttctc cttccag cac tac gcc aat atg
ctc
tgg ctc gac 1664 His Tyr Ala Asn Met Leu Trp Leu Asp 390 395tcg agc
tac ccc cct gag aag gct ggc cag cct ggc ggt gac cgt ggc 1712Ser Ser
Tyr Pro Pro Glu Lys Ala Gly Gln Pro Gly Gly Asp Arg Gly 400 405
410ccg tgt cct cag gac tct ggc gtc ccg gcc gac gtt gag gct cag tac
1760Pro Cys Pro Gln Asp Ser Gly Val Pro Ala Asp Val Glu Ala Gln Tyr
415 420 425cct aat gc gtgagtcgaa accgtaaaat gtcgggcaaa aaaaagatcg
1808Pro Asn Ala430ctcaagctaa cgaaataata tgattag c aag gtc atc tgg
tcc aac atc cgc 1860 Lys Val Ile Trp Ser Asn Ile Arg 435 440ttc ggc
ccc atc ggc tcg act gtc aac gtc taaactgcaa cctgaccggg 1910Phe Gly
Pro Ile Gly Ser Thr Val Asn Val 445 450ccctttctct ccacccccac
ccctctcaag ttctctctgg tggagccctc gtgtccttct 1970tttcctaggt
tcgcgaacct ttgagcttgt gtatcgtagg gtcattgtgt acatacaca
20295450PRTChrysosporium lucknowense 5Met Lys Gln Tyr Leu Gln Tyr
Leu Ala Ala Thr Leu Pro Leu Val Gly1 5 10 15Leu Ala Thr Ala Gln Gln
Ala Gly Asn Leu Gln Thr Glu Thr His Pro 20 25 30Lys Leu Thr Trp Ser
Lys Cys Thr Ala Pro Gly Ser Cys Gln Gln Val 35 40 45Asn Gly Glu Val
Val Ile Asp Ser Asn Trp Arg Trp Val His Asp Glu 50 55 60Asn Ala Gln
Asn Cys Tyr Asp Gly Asn Gln Trp Thr Asn Ala Cys Ser65 70 75 80Ser
Ala Thr Asp Cys Ala Glu Asn Cys Ala Leu Glu Gly Ala Asp Tyr 85 90
95Gln Gly Thr Tyr Gly Ala Ser Thr Ser Gly Asn Ala Leu Thr Leu Thr
100 105 110Phe Val Thr Lys His Glu Tyr Gly Thr Asn Ile Gly Ser Arg
Leu Tyr 115 120 125Leu Met Asn Gly Ala Asn Lys Tyr Gln Met Phe Thr
Leu Lys Gly Asn 130 135 140Glu Leu Ala Phe Asp Val Asp Leu Ser Ala
Val Glu Cys Gly Leu Asn145 150 155 160Ser Ala Leu Tyr Phe Val Ala
Met Glu Glu Asp Gly Gly Val Ser Ser 165 170 175Tyr Pro Thr Asn Thr
Ala Gly Ala Lys Phe Gly Thr Gly Tyr Cys Asp 180 185 190Ala Gln Cys
Ala Arg Asp Leu Lys Phe Val Gly Gly Lys Gly Asn Ile 195 200 205Glu
Gly Trp Lys Pro Ser Thr Asn Asp Ala Asn Ala Gly Val Gly Pro 210 215
220Tyr Gly Gly Cys Cys Ala Glu Ile Asp Val Trp Glu Ser Asn Lys
Tyr225 230 235 240Ala Phe Ala Phe Thr Pro His Gly Cys Glu Asn Pro
Lys Tyr His Val 245 250 255Cys Glu Thr Thr Asn Cys Gly Gly Thr Tyr
Ser Glu Asp Arg Phe Ala 260 265 270Gly Asp Cys Asp Ala Asn Gly Cys
Asp Tyr Asn Pro Tyr Arg Met Gly 275 280 285Asn Gln Asp Phe Tyr Gly
Pro Gly Leu Thr Val Asp Thr Ser Lys Lys 290 295 300Phe Thr Val Val
Ser Gln Phe Glu Glu Asn Lys Leu Thr Gln Phe Phe305 310 315 320Val
Gln Asp Gly Lys Lys Ile Glu Ile Pro Gly Pro Lys Val Glu Gly 325 330
335Ile Asp Ala Asp Ser Ala Ala Ile Thr Pro Glu Leu Cys Ser Ala Leu
340 345 350Phe Lys Ala Phe Asp Asp Arg Asp Arg Phe Ser Glu Val Gly
Gly Phe 355 360 365Asp Ala Ile Asn Thr Ala Leu Ser Thr Pro Met Val
Leu Val Met Ser 370 375 380Ile Trp Asp Asp His Tyr Ala Asn Met Leu
Trp Leu Asp Ser Ser Tyr385 390 395 400Pro Pro Glu Lys Ala Gly Gln
Pro Gly Gly Asp Arg Gly Pro Cys Pro 405 410 415Gln Asp Ser Gly Val
Pro Ala Asp Val Glu Ala Gln Tyr Pro Asn Ala 420 425 430Lys Val Ile
Trp Ser Asn Ile Arg Phe Gly Pro Ile Gly Ser Thr Val 435 440 445Asn
Val 45061350DNAChrysosporium lucknowense 6atgaagcagt acctccagta
cctcgcggcg accctgcccc tggtgggcct ggccacggcc 60cagcaggcgg gtaacctgca
gaccgagact caccccaagc tcacttggtc gaagtgcacg 120gccccgggat
cctgccaaca ggtcaacggc gaggtcgtca tcgactccaa ctggcgctgg
180gtgcacgacg agaacgcgca gaactgctac gacggcaacc agtggaccaa
cgcttgcagc 240tctgccaccg actgcgccga gaattgcgcg ctcgagggtg
ccgactacca gggcacctat 300ggcgcctcga ccagcggcaa tgccctgacg
ctcaccttcg tcactaagca cgagtacggc 360accaacattg gttcgcgcct
ctacctcatg aacggcgcga acaagtacca gatgttcacc 420ctcaagggca
acgagctggc cttcgacgtc gacctctcgg ccgtcgagtg cggcctcaac
480agcgccctct acttcgtggc catggaggag gatggcggtg tgtcgagcta
cccgaccaac 540acggccggtg ctaagttcgg cactgggtac tgcgacgccc
aatgcgcacg cgacctcaag 600ttcgtcggcg gcaagggcaa catcgagggc
tggaagccgt ccaccaacga tgccaatgcc 660ggtgtcggtc cttatggcgg
gtgctgcgct gagatcgacg tctgggagtc gaacaagtat 720gctttcgctt
tcaccccgca cggttgcgag aaccctaaat accacgtctg cgagaccacc
780aactgcggtg gcacctactc cgaggaccgc ttcgctggtg actgcgatgc
caacggctgc 840gactacaacc cctaccgcat gggcaaccag gacttctacg
gtcccggctt gacggtcgat 900accagcaaga agttcaccgt cgtcagccag
ttcgaggaga acaagctcac ccagttcttc 960gtccaggacg gcaagaagat
tgagatcccc ggccccaagg tcgagggcat cgatgcggac 1020agcgccgcta
tcacccctga gctgtgcagt gccctgttca aggccttcga tgaccgtgac
1080cgcttctcgg aggttggcgg cttcgatgcc atcaacacgg ccctcagcac
tcccatggtc 1140ctcgtcatgt ccatctggga tgatcactac gccaatatgc
tctggctcga ctcgagctac 1200ccccctgaga aggctggcca gcctggcggt
gaccgtggcc cgtgtcctca ggactctggc 1260gtcccggccg acgttgaggc
tcagtaccct aatgccaagg tcatctggtc caacatccgc 1320ttcggcccca
tcggctcgac tgtcaacgtc 135071592DNAChrysosporium
lucknowenseCDS(78)..(519)CDS(630)..(954)CDS(1048)..(1465)
7tgcttgggaa gaaaggatct ctcgaccatg caccacagcc tagctctaac ccagcttgtc
60gtgtgttgtt gcccagc atg aag ttc gtg cag tcc gcc acc ctg gcg ttc
110 Met Lys Phe Val Gln Ser Ala Thr Leu Ala Phe 1 5 10gcc gcc acg
gcc ctc gct gcg ccc tcg cgc acg act ccc cag aag ccc 158Ala Ala Thr
Ala Leu Ala Ala Pro Ser Arg Thr Thr Pro Gln Lys Pro 15 20 25cgc cag
gcc tcg gcg ggc tgc gcg tcg gcc gtg acg ctc gat gcc agc 206Arg Gln
Ala Ser Ala Gly Cys Ala Ser Ala Val Thr Leu Asp Ala Ser 30 35 40acc
aac gtg ttc cag cag tac acg ctg cac ccc aac aac ttc tac cgt 254Thr
Asn Val Phe Gln Gln Tyr Thr Leu His Pro Asn Asn Phe Tyr Arg 45 50
55gcc gag gtc gag gct gcc gcc gag gcc atc tcc gac tcg gcg ctg gcc
302Ala Glu Val Glu Ala Ala Ala Glu Ala Ile Ser Asp Ser Ala Leu
Ala60 65 70 75gag aag gcc cgc aag gtc gcc gac gtc ggt acc ttc ctg
tgg ctc gac 350Glu Lys Ala Arg Lys Val Ala Asp Val Gly Thr Phe Leu
Trp Leu Asp 80 85 90acc atc gag aac att ggc cgg ctg gag ccc gcg ctc
gag gac gtg ccc 398Thr Ile Glu Asn Ile Gly Arg Leu Glu Pro Ala Leu
Glu Asp Val Pro 95 100 105tgc gag aac atc gtg ggt ctc gtc atc tac
gac ctc ccg ggc cgt gac 446Cys Glu Asn Ile Val Gly Leu Val Ile Tyr
Asp Leu Pro Gly Arg Asp 110 115 120tgc gcg gcc aag gcc tcc aac ggc
gag ctc aag gtc ggc gag ctc gac 494Cys Ala Ala Lys Ala Ser Asn Gly
Glu Leu Lys Val Gly Glu Leu Asp 125 130 135agg tac aag acc gag tac
atc gac a gtgagttaac cctttgtggc 539Arg Tyr Lys Thr Glu Tyr Ile
Asp140 145cccttctttt cccccgagag agcgtctggt tgagtggggt tgtgagagag
aaaatggggc 599gagcttaaag actgacgtgt tggctcgcag ag atc gcc gag atc
ctc aag gcc 652 Lys Ile Ala Glu Ile Leu Lys Ala 150 155cac tcc aac
acg gcc ttc gcc ctc gtc atc gag ccc gac tcg ctc ccc 700His Ser Asn
Thr Ala Phe Ala Leu Val Ile Glu Pro Asp Ser Leu Pro 160 165 170aac
ctg gtc acc aat agc gac ctg cag acg tgc cag cag agc gct tcc 748Asn
Leu Val Thr Asn Ser Asp Leu Gln Thr Cys Gln Gln Ser Ala Ser 175 180
185ggc tac cgc gag ggt gtc gcc tat gcc ctc aag cag ctc aac ctc ccc
796Gly Tyr Arg Glu Gly Val Ala Tyr Ala Leu Lys Gln Leu Asn Leu Pro
190 195 200aac gtg gtc atg tac atc gat gcc ggc cac ggt ggc tgg ctc
ggc tgg 844Asn Val Val Met Tyr Ile Asp Ala Gly His Gly Gly Trp Leu
Gly Trp 205 210 215gac gcc aac ctc aag ccc ggc gcc cag gag ctc gcc
agc gtc tac aag 892Asp Ala Asn Leu Lys Pro Gly Ala Gln Glu Leu Ala
Ser Val Tyr Lys220 225 230 235tct gct ggt tcg ccc tcg caa gtc cgc
ggt atc tcc acc aac gtg gct 940Ser Ala Gly Ser Pro Ser Gln Val Arg
Gly Ile Ser Thr Asn Val Ala 240 245 250ggt tgg aac gcc tg
gtaagacact ctatgtcccc ctcgtcggtc aatggcgagc 994Gly Trp Asn Ala Trp
255ggaatggcgt gaaatgcatg gtgctgacct ttgatctttt ccccctccta tag g gac
1051 Aspcag gag ccc ggt gag ttc tcg gac gcc tcg gat gcc cag tac aac
aag 1099Gln Glu Pro Gly Glu Phe Ser Asp Ala Ser Asp Ala Gln Tyr Asn
Lys 260 265 270tgc cag aac gag aag atc tac atc aac acc ttt ggc gct
gag ctc aag 1147Cys Gln Asn Glu Lys Ile Tyr Ile Asn Thr Phe Gly Ala
Glu Leu Lys 275 280 285tct gcc ggc atg ccc aac cac gcc atc atc gac
act ggc cgc aac ggt 1195Ser Ala Gly Met Pro Asn His Ala Ile Ile Asp
Thr Gly Arg Asn Gly290 295 300 305gtc acc ggt ctc cgc gac gag tgg
ggt gac tgg tgc aac gtc aac ggc 1243Val Thr Gly Leu Arg Asp Glu Trp
Gly Asp Trp Cys Asn Val Asn Gly 310 315 320gcc ggc ttc ggt gtg cgc
ccg act gcc aac act ggc gac gag ctc gcc 1291Ala Gly Phe Gly Val Arg
Pro Thr Ala Asn Thr Gly Asp Glu Leu Ala 325 330 335gac gcc ttc gtg
tgg gtc aag ccc ggt ggc gag tcc gac ggc acc agc 1339Asp Ala Phe Val
Trp Val Lys Pro Gly Gly Glu Ser Asp Gly Thr Ser 340 345 350gac tcg
tcg gcg gcg cgc tac gac agc ttc tgc ggc aag ccc gac gcc 1387Asp Ser
Ser Ala Ala Arg Tyr Asp Ser Phe Cys Gly Lys Pro Asp Ala 355 360
365ttc aag ccc agc ccc gag gcc ggt acc tgg aac cag gcc tac ttc gag
1435Phe Lys Pro Ser Pro Glu Ala Gly Thr Trp Asn Gln Ala Tyr Phe
Glu370 375 380 385atg ctc ctc aag aac gcc aac ccg tcc ttc
taagctcctc gacggcttct 1485Met Leu Leu Lys Asn Ala Asn Pro Ser Phe
390 395tgctgtcagt cgctctgacg gtggtgtgct ggtggtgccc ctgctcctgc
tgctgctgct 1545ccgcggggag gggaggcaac gaaaatgaag tcctgcttca aaacaaa
15928395PRTChrysosporium lucknowense 8Met Lys Phe Val Gln Ser Ala
Thr Leu Ala Phe Ala Ala Thr Ala Leu1 5 10 15Ala Ala Pro Ser Arg Thr
Thr Pro Gln Lys Pro Arg Gln Ala Ser Ala 20 25 30Gly Cys Ala Ser Ala
Val Thr Leu Asp Ala Ser Thr Asn Val Phe Gln 35 40 45Gln Tyr Thr Leu
His Pro Asn Asn Phe Tyr Arg Ala Glu Val Glu Ala 50 55 60Ala Ala Glu
Ala Ile Ser Asp Ser Ala Leu Ala Glu Lys Ala Arg Lys65 70 75 80Val
Ala Asp Val Gly Thr Phe Leu Trp Leu Asp Thr Ile Glu Asn Ile 85 90
95Gly Arg Leu Glu Pro Ala Leu Glu Asp Val Pro Cys Glu Asn Ile Val
100 105 110Gly Leu Val Ile Tyr Asp Leu Pro Gly Arg Asp Cys Ala Ala
Lys Ala 115 120 125Ser Asn Gly Glu Leu Lys Val Gly Glu Leu Asp Arg
Tyr Lys Thr Glu 130 135 140Tyr Ile Asp Lys Ile Ala Glu Ile Leu Lys
Ala His Ser Asn Thr Ala145 150 155 160Phe Ala Leu Val Ile Glu Pro
Asp Ser Leu Pro Asn Leu Val Thr Asn 165 170 175Ser Asp Leu Gln Thr
Cys Gln Gln Ser Ala Ser Gly Tyr Arg Glu Gly 180 185 190Val Ala Tyr
Ala Leu Lys Gln Leu Asn Leu Pro Asn Val Val Met Tyr 195 200 205Ile
Asp Ala Gly His Gly Gly Trp Leu Gly Trp Asp Ala Asn Leu Lys 210 215
220Pro Gly Ala Gln Glu Leu Ala Ser Val Tyr Lys Ser Ala Gly Ser
Pro225 230 235 240Ser Gln Val Arg Gly Ile Ser Thr Asn Val Ala Gly
Trp Asn Ala Trp 245 250 255Asp Gln Glu Pro Gly Glu Phe Ser Asp Ala
Ser Asp Ala Gln Tyr Asn 260 265 270Lys Cys Gln Asn Glu Lys Ile Tyr
Ile Asn Thr Phe Gly Ala Glu Leu 275 280 285Lys Ser Ala Gly Met Pro
Asn His Ala Ile Ile Asp Thr Gly Arg Asn 290 295 300Gly Val Thr Gly
Leu Arg Asp Glu Trp Gly Asp Trp Cys Asn Val Asn305 310 315 320Gly
Ala Gly Phe Gly Val Arg Pro Thr Ala Asn Thr Gly Asp Glu Leu 325 330
335Ala Asp Ala Phe Val Trp Val Lys Pro Gly Gly Glu Ser Asp Gly Thr
340 345 350Ser Asp Ser Ser Ala Ala Arg Tyr Asp Ser Phe Cys Gly Lys
Pro Asp 355 360 365Ala Phe Lys Pro Ser Pro Glu Ala Gly Thr Trp Asn
Gln Ala Tyr Phe 370 375 380Glu Met Leu Leu Lys Asn Ala Asn Pro Ser
Phe385 390 39591185DNAChrysosporium lucknowense 9atgaagttcg
tgcagtccgc caccctggcg ttcgccgcca cggccctcgc tgcgccctcg 60cgcacgactc
cccagaagcc ccgccaggcc tcggcgggct gcgcgtcggc cgtgacgctc
120gatgccagca ccaacgtgtt ccagcagtac acgctgcacc ccaacaactt
ctaccgtgcc 180gaggtcgagg ctgccgccga ggccatctcc gactcggcgc
tggccgagaa ggcccgcaag 240gtcgccgacg tcggtacctt cctgtggctc
gacaccatcg agaacattgg ccggctggag 300cccgcgctcg aggacgtgcc
ctgcgagaac atcgtgggtc tcgtcatcta cgacctcccg 360ggccgtgact
gcgcggccaa ggcctccaac ggcgagctca aggtcggcga gctcgacagg
420tacaagaccg agtacatcga caagatcgcc gagatcctca aggcccactc
caacacggcc 480ttcgccctcg tcatcgagcc cgactcgctc cccaacctgg
tcaccaatag cgacctgcag 540acgtgccagc agagcgcttc cggctaccgc
gagggtgtcg cctatgccct caagcagctc 600aacctcccca acgtggtcat
gtacatcgat gccggccacg gtggctggct cggctgggac 660gccaacctca
agcccggcgc ccaggagctc gccagcgtct acaagtctgc tggttcgccc
720tcgcaagtcc gcggtatctc caccaacgtg gctggttgga acgcctggga
ccaggagccc 780ggtgagttct cggacgcctc ggatgcccag tacaacaagt
gccagaacga gaagatctac 840atcaacacct ttggcgctga gctcaagtct
gccggcatgc ccaaccacgc catcatcgac 900actggccgca acggtgtcac
cggtctccgc gacgagtggg gtgactggtg caacgtcaac 960ggcgccggct
tcggtgtgcg cccgactgcc aacactggcg acgagctcgc cgacgccttc
1020gtgtgggtca agcccggtgg cgagtccgac ggcaccagcg actcgtcggc
ggcgcgctac 1080gacagcttct gcggcaagcc cgacgccttc aagcccagcc
ccgaggccgg tacctggaac 1140caggcctact tcgagatgct cctcaagaac
gccaacccgt ccttc 1185102115DNAChrysosporium
lucknowenseCDS(90)..(181)CDS(278)..(786)CDS(871)..(1455)CDS(1629)..(1888)
10tgtgtggttt ctctccctcg tgccaaacca ctcccacctc ccgccccgag atagttgctt
60gtttcgctcc gtgagaggga cacacacca atg gcc aag aag ctt ttc atc acc
113 Met Ala Lys Lys Leu Phe Ile Thr 1 5gcc gcg ctt gcg gct gcc gtg
ttg gcg gcc ccc gtc att gag gag cgc 161Ala Ala Leu Ala Ala Ala Val
Leu Ala Ala Pro Val Ile Glu Glu Arg 10 15 20cag aac tgc ggc gct gtg
tg gtaagaaagc ccggtccgag tctcccatga 211Gln Asn Cys Gly Ala Val
Trp25 30ttttctcgtc gagtaatggc ataagggcca ccccttcgac tgaccgtgag
aatcgatcaa 271atccag g act caa tgc ggc ggt aac ggg tgg caa ggt ccc
aca tgc tgc 320 Thr Gln Cys Gly Gly Asn Gly Trp Gln Gly Pro Thr Cys
Cys 35 40 45gcc tcg ggc tcg acc tgc gtt gcg cag aac gag tgg tac tct
cag tgc 368Ala Ser Gly Ser Thr Cys Val Ala Gln Asn Glu Trp Tyr Ser
Gln Cys 50 55 60ctg ccc aac agc cag gtg acg agt tcc acc act ccg tcg
tcg act tcc 416Leu Pro Asn Ser Gln Val Thr Ser Ser Thr Thr Pro Ser
Ser Thr Ser 65 70 75acc tcg cag cgc agc acc agc acc tcc agc agc acc
acc agg agc ggc 464Thr Ser Gln Arg Ser Thr Ser Thr Ser Ser Ser Thr
Thr Arg Ser Gly 80 85 90agc tcc tcc tcc tcc tcc acc acg ccc ccg ccc
gtc tcc agc ccc gtg 512Ser Ser Ser Ser Ser Ser Thr Thr Pro Pro Pro
Val Ser Ser Pro Val 95 100 105acc agc att ccc ggc ggt gcg acc tcc
acg gcg agc tac tct ggc aac 560Thr Ser Ile Pro Gly Gly Ala Thr Ser
Thr Ala Ser Tyr Ser Gly Asn110 115 120 125ccc ttc tcg ggc gtc cgg
ctc ttc gcc aac gac tac tac agg tcc gag 608Pro Phe Ser Gly Val Arg
Leu Phe Ala Asn Asp Tyr Tyr Arg Ser Glu 130 135 140gtc cac aat ctc
gcc att cct agc atg act ggt act ctg gcg gcc aag
656Val His Asn Leu Ala Ile Pro Ser Met Thr Gly Thr Leu Ala Ala Lys
145 150 155gct tcc gcc gtc gcc gaa gtc cct agc ttc cag tgg ctc gac
cgg aac 704Ala Ser Ala Val Ala Glu Val Pro Ser Phe Gln Trp Leu Asp
Arg Asn 160 165 170gtc acc atc gac acc ctg atg gtc cag act ctg tcc
cag gtc cgg gct 752Val Thr Ile Asp Thr Leu Met Val Gln Thr Leu Ser
Gln Val Arg Ala 175 180 185ctc aat aag gcc ggt gcc aat cct ccc tat
gct g gtgagttaca 796Leu Asn Lys Ala Gly Ala Asn Pro Pro Tyr Ala190
195 200tggcgacttg ccttctcgtc ccctaccttt cttgacggga tcggttacct
gacctggagg 856caaaacaaca acag cc caa ctc gtc gtc tac gac ctc ccc
gac cgt gac 905 Ala Gln Leu Val Val Tyr Asp Leu Pro Asp Arg Asp 205
210tgt gcc gcc gct gcg tcc aac ggc gag ttt tcg att gca aac ggc ggc
953Cys Ala Ala Ala Ala Ser Asn Gly Glu Phe Ser Ile Ala Asn Gly Gly
215 220 225gcc gcc aac tac agg agc tac atc gac gct atc cgc aag cac
atc att 1001Ala Ala Asn Tyr Arg Ser Tyr Ile Asp Ala Ile Arg Lys His
Ile Ile 230 235 240gag tac tcg gac atc cgg atc atc ctg gtt atc gag
ccc gac tcg atg 1049Glu Tyr Ser Asp Ile Arg Ile Ile Leu Val Ile Glu
Pro Asp Ser Met245 250 255 260gcc aac atg gtg acc aac atg aac gtg
gcc aag tgc agc aac gcc gcg 1097Ala Asn Met Val Thr Asn Met Asn Val
Ala Lys Cys Ser Asn Ala Ala 265 270 275tcg acg tac cac gag ttg acc
gtg tac gcg ctc aag cag ctg aac ctg 1145Ser Thr Tyr His Glu Leu Thr
Val Tyr Ala Leu Lys Gln Leu Asn Leu 280 285 290ccc aac gtc gcc atg
tat ctc gac gcc ggc cac gcc ggc tgg ctc ggc 1193Pro Asn Val Ala Met
Tyr Leu Asp Ala Gly His Ala Gly Trp Leu Gly 295 300 305tgg ccc gcc
aac atc cag ccc gcc gcc gag ctg ttt gcc ggc atc tac 1241Trp Pro Ala
Asn Ile Gln Pro Ala Ala Glu Leu Phe Ala Gly Ile Tyr 310 315 320aat
gat gcc ggc aag ccg gct gcc gtc cgc ggc ctg gcc act aac gtc 1289Asn
Asp Ala Gly Lys Pro Ala Ala Val Arg Gly Leu Ala Thr Asn Val325 330
335 340gcc aac tac aac gcc tgg agc atc gct tcg gcc ccg tcg tac acg
tcg 1337Ala Asn Tyr Asn Ala Trp Ser Ile Ala Ser Ala Pro Ser Tyr Thr
Ser 345 350 355cct aac cct aac tac gac gag aag cac tac atc gag gcc
ttc agc ccg 1385Pro Asn Pro Asn Tyr Asp Glu Lys His Tyr Ile Glu Ala
Phe Ser Pro 360 365 370ctc ttg aac tcg gcc ggc ttc ccc gca cgc ttc
att gtc gac act ggc 1433Leu Leu Asn Ser Ala Gly Phe Pro Ala Arg Phe
Ile Val Asp Thr Gly 375 380 385cgc aac ggc aaa caa cct acc g
gtatgttttt ttttcttttg tctctgtccc 1485Arg Asn Gly Lys Gln Pro Thr
390 395ccccttttct cccccttcag ttggcgtcca caaggtctct tagtcctgct
tcatctgtga 1545ccaacctccc cccccccggc accgcccaca accgtttgac
tctatactct tgggaatggg 1605cgccgaaact gaccgttcca cag gc caa caa cag
tgg ggt gac tgg tgc aat 1657 Gly Gln Gln Gln Trp Gly Asp Trp Cys
Asn 400 405gtc aag ggc acc ggc ttt ggc gtg cgc ccg acg gcc aac acg
ggc cac 1705Val Lys Gly Thr Gly Phe Gly Val Arg Pro Thr Ala Asn Thr
Gly His 410 415 420gag ctg gtc gat gcc ttt gtc tgg gtc aag ccc ggc
ggc gag tcc gac 1753Glu Leu Val Asp Ala Phe Val Trp Val Lys Pro Gly
Gly Glu Ser Asp 425 430 435ggc aca agc gac acc agc gcc gcc cgc tac
gac tac cac tgc ggc ctg 1801Gly Thr Ser Asp Thr Ser Ala Ala Arg Tyr
Asp Tyr His Cys Gly Leu 440 445 450tcc gat gcc ctg cag cct gcc ccc
gag gct gga cag tgg ttc cag gcc 1849Ser Asp Ala Leu Gln Pro Ala Pro
Glu Ala Gly Gln Trp Phe Gln Ala 455 460 465tac ttc gag cag ctg ctc
acc aac gcc aac ccg ccc ttc taaacctcgt 1898Tyr Phe Glu Gln Leu Leu
Thr Asn Ala Asn Pro Pro Phe470 475 480cataaagaga gagagatggc
gggcatgggc ctgattgggt tcattgacca tgcggctctt 1958ctgggggtac
atattttacc tacctaccta taaataaggc ggcctatcgg gctctcgctt
2018cgtttattag gtacttgttc ttgtacatac tttgtttata catacagcag
ttagcatcca 2078ctattcgttt cgacaaagcg gaactttcca gaaaaaa
211511482PRTChrysosporium lucknowense 11Met Ala Lys Lys Leu Phe Ile
Thr Ala Ala Leu Ala Ala Ala Val Leu1 5 10 15Ala Ala Pro Val Ile Glu
Glu Arg Gln Asn Cys Gly Ala Val Trp Thr 20 25 30Gln Cys Gly Gly Asn
Gly Trp Gln Gly Pro Thr Cys Cys Ala Ser Gly 35 40 45Ser Thr Cys Val
Ala Gln Asn Glu Trp Tyr Ser Gln Cys Leu Pro Asn 50 55 60Ser Gln Val
Thr Ser Ser Thr Thr Pro Ser Ser Thr Ser Thr Ser Gln65 70 75 80Arg
Ser Thr Ser Thr Ser Ser Ser Thr Thr Arg Ser Gly Ser Ser Ser 85 90
95Ser Ser Ser Thr Thr Pro Pro Pro Val Ser Ser Pro Val Thr Ser Ile
100 105 110Pro Gly Gly Ala Thr Ser Thr Ala Ser Tyr Ser Gly Asn Pro
Phe Ser 115 120 125Gly Val Arg Leu Phe Ala Asn Asp Tyr Tyr Arg Ser
Glu Val His Asn 130 135 140Leu Ala Ile Pro Ser Met Thr Gly Thr Leu
Ala Ala Lys Ala Ser Ala145 150 155 160Val Ala Glu Val Pro Ser Phe
Gln Trp Leu Asp Arg Asn Val Thr Ile 165 170 175Asp Thr Leu Met Val
Gln Thr Leu Ser Gln Val Arg Ala Leu Asn Lys 180 185 190Ala Gly Ala
Asn Pro Pro Tyr Ala Ala Gln Leu Val Val Tyr Asp Leu 195 200 205Pro
Asp Arg Asp Cys Ala Ala Ala Ala Ser Asn Gly Glu Phe Ser Ile 210 215
220Ala Asn Gly Gly Ala Ala Asn Tyr Arg Ser Tyr Ile Asp Ala Ile
Arg225 230 235 240Lys His Ile Ile Glu Tyr Ser Asp Ile Arg Ile Ile
Leu Val Ile Glu 245 250 255Pro Asp Ser Met Ala Asn Met Val Thr Asn
Met Asn Val Ala Lys Cys 260 265 270Ser Asn Ala Ala Ser Thr Tyr His
Glu Leu Thr Val Tyr Ala Leu Lys 275 280 285Gln Leu Asn Leu Pro Asn
Val Ala Met Tyr Leu Asp Ala Gly His Ala 290 295 300Gly Trp Leu Gly
Trp Pro Ala Asn Ile Gln Pro Ala Ala Glu Leu Phe305 310 315 320Ala
Gly Ile Tyr Asn Asp Ala Gly Lys Pro Ala Ala Val Arg Gly Leu 325 330
335Ala Thr Asn Val Ala Asn Tyr Asn Ala Trp Ser Ile Ala Ser Ala Pro
340 345 350Ser Tyr Thr Ser Pro Asn Pro Asn Tyr Asp Glu Lys His Tyr
Ile Glu 355 360 365Ala Phe Ser Pro Leu Leu Asn Ser Ala Gly Phe Pro
Ala Arg Phe Ile 370 375 380Val Asp Thr Gly Arg Asn Gly Lys Gln Pro
Thr Gly Gln Gln Gln Trp385 390 395 400Gly Asp Trp Cys Asn Val Lys
Gly Thr Gly Phe Gly Val Arg Pro Thr 405 410 415Ala Asn Thr Gly His
Glu Leu Val Asp Ala Phe Val Trp Val Lys Pro 420 425 430Gly Gly Glu
Ser Asp Gly Thr Ser Asp Thr Ser Ala Ala Arg Tyr Asp 435 440 445Tyr
His Cys Gly Leu Ser Asp Ala Leu Gln Pro Ala Pro Glu Ala Gly 450 455
460Gln Trp Phe Gln Ala Tyr Phe Glu Gln Leu Leu Thr Asn Ala Asn
Pro465 470 475 480Pro Phe121446DNAChrysosporium lucknowense
12atggccaaga agcttttcat caccgccgcg cttgcggctg ccgtgttggc ggcccccgtc
60attgaggagc gccagaactg cggcgctgtg tggactcaat gcggcggtaa cgggtggcaa
120ggtcccacat gctgcgcctc gggctcgacc tgcgttgcgc agaacgagtg
gtactctcag 180tgcctgccca acagccaggt gacgagttcc accactccgt
cgtcgacttc cacctcgcag 240cgcagcacca gcacctccag cagcaccacc
aggagcggca gctcctcctc ctcctccacc 300acgcccccgc ccgtctccag
ccccgtgacc agcattcccg gcggtgcgac ctccacggcg 360agctactctg
gcaacccctt ctcgggcgtc cggctcttcg ccaacgacta ctacaggtcc
420gaggtccaca atctcgccat tcctagcatg actggtactc tggcggccaa
ggcttccgcc 480gtcgccgaag tccctagctt ccagtggctc gaccggaacg
tcaccatcga caccctgatg 540gtccagactc tgtcccaggt ccgggctctc
aataaggccg gtgccaatcc tccctatgct 600gcccaactcg tcgtctacga
cctccccgac cgtgactgtg ccgccgctgc gtccaacggc 660gagttttcga
ttgcaaacgg cggcgccgcc aactacagga gctacatcga cgctatccgc
720aagcacatca ttgagtactc ggacatccgg atcatcctgg ttatcgagcc
cgactcgatg 780gccaacatgg tgaccaacat gaacgtggcc aagtgcagca
acgccgcgtc gacgtaccac 840gagttgaccg tgtacgcgct caagcagctg
aacctgccca acgtcgccat gtatctcgac 900gccggccacg ccggctggct
cggctggccc gccaacatcc agcccgccgc cgagctgttt 960gccggcatct
acaatgatgc cggcaagccg gctgccgtcc gcggcctggc cactaacgtc
1020gccaactaca acgcctggag catcgcttcg gccccgtcgt acacgtcgcc
taaccctaac 1080tacgacgaga agcactacat cgaggccttc agcccgctct
tgaactcggc cggcttcccc 1140gcacgcttca ttgtcgacac tggccgcaac
ggcaaacaac ctaccggcca acaacagtgg 1200ggtgactggt gcaatgtcaa
gggcaccggc tttggcgtgc gcccgacggc caacacgggc 1260cacgagctgg
tcgatgcctt tgtctgggtc aagcccggcg gcgagtccga cggcacaagc
1320gacaccagcg ccgcccgcta cgactaccac tgcggcctgt ccgatgccct
gcagcctgcc 1380cccgaggctg gacagtggtt ccaggcctac ttcgagcagc
tgctcaccaa cgccaacccg 1440cccttc 1446132507DNAChrysosporium
lucknowenseCDS(967)..(1572)CDS(1666)..(2427) 13ctatataaag
gtacttacag ataaaatccc caggttgaca gccctttcga gcataagaat 60ccaggtccta
tagtggccgg ctcgaagccc agatttgagc cctattgaga ataactatat
120ggcactggat gaaggattgg cttgaacatc gctacgatgt ttaatcgctg
agtcttacag 180ggcttcgact ggctgttctg gaggcgtggg aggcagtccc
ccccgagttt cttcagtgcc 240tggcacactc aatgcctcga cggctagcaa
aaaccagggg ggagagactg ggtattgata 300cttctgtatt atagaaattt
gtattgggtt ggttgtacct agacttcttg gctacttttc 360tgatttgcgt
tagttgtggg gcatgtatcg cactaggatt aataatgctg gaagaggtgt
420agagatgttg cgtcgcatgg ccaccgagat ccggcctggc gtcgtgcctt
gcagctcgca 480cgatgcgcgc gacgatgcgg gacacccctc ccccctcccc
cctcccccct acattaggag 540gggatccggg gccgctgccc gttttccgtc
ttggtattcc cggctgacac aacggttctc 600cagagtgata accgaggcac
accagtagca gtccacgatg gcacggcagg ggatacattc 660cgcccggaac
aacccaagat ctgggaatcc cagtgacgac cccggggtcc tccggggtcc
720cccgatttcg tctgaaccga cgaactggaa ggaggcgagc cttggaacga
tgggggatat 780gatatatatt aagatgcaac attctctccc tccctcctct
ctctctccct ccccccctct 840ctcctcctct tcccctcctc cccgccgttc
tctctccatg agcctcaatt cttgctccga 900gcgcggtgta tttcccccac
gaggaattga caaaagaaaa agaaaaagac aagactctcg 960agaacg atg ggt cgc
ggc gct gct ttc cta ggc ctc gcc tcg ctc ctc 1008 Met Gly Arg Gly
Ala Ala Phe Leu Gly Leu Ala Ser Leu Leu 1 5 10gtg ggc gcg gcc aag
gcc cag acg ccc ggc gag ggc gag gag gtg cac 1056Val Gly Ala Ala Lys
Ala Gln Thr Pro Gly Glu Gly Glu Glu Val His15 20 25 30ccg cag atc
acg acg tac cgc tgc acc aag gcg gac ggg tgc gag gag 1104Pro Gln Ile
Thr Thr Tyr Arg Cys Thr Lys Ala Asp Gly Cys Glu Glu 35 40 45aag acc
aac tac atc gtg ctg gac gcc cta tcg cac ccg gtc cac cag 1152Lys Thr
Asn Tyr Ile Val Leu Asp Ala Leu Ser His Pro Val His Gln 50 55 60gtc
gac aac ccg tac aac tgc ggc gac tgg ggc cag aag ccc aac gag 1200Val
Asp Asn Pro Tyr Asn Cys Gly Asp Trp Gly Gln Lys Pro Asn Glu 65 70
75acg gcc tgc ccg gac ctc gag tcg tgc gcc agg aac tgc atc atg gac
1248Thr Ala Cys Pro Asp Leu Glu Ser Cys Ala Arg Asn Cys Ile Met Asp
80 85 90ccg gtc tcg gac tac ggc cgg cac ggg gtc tcg acc gac ggc acc
tcg 1296Pro Val Ser Asp Tyr Gly Arg His Gly Val Ser Thr Asp Gly Thr
Ser95 100 105 110ctg cgc ctc aag cag cta gtc ggc ggc aac gtc gtc
agc ccg cgc gtc 1344Leu Arg Leu Lys Gln Leu Val Gly Gly Asn Val Val
Ser Pro Arg Val 115 120 125tac ctg ctc gac gag acc aag gag cgc tac
gag atg ctc aag ctg acc 1392Tyr Leu Leu Asp Glu Thr Lys Glu Arg Tyr
Glu Met Leu Lys Leu Thr 130 135 140ggc aac gag ttc acc ttt gac gtc
gac gcc acc aag ctg ccc tgc ggc 1440Gly Asn Glu Phe Thr Phe Asp Val
Asp Ala Thr Lys Leu Pro Cys Gly 145 150 155atg aac agc gcc ctc tac
ctc tcc gag atg gac gcc acc ggc gcc cgg 1488Met Asn Ser Ala Leu Tyr
Leu Ser Glu Met Asp Ala Thr Gly Ala Arg 160 165 170agc gag ctc aac
ccg ggc ggc gcc acc ttt ggc acc ggc tac tgc gac 1536Ser Glu Leu Asn
Pro Gly Gly Ala Thr Phe Gly Thr Gly Tyr Cys Asp175 180 185 190gcc
cag tgc tac gtc acc ccc ttc atc aac ggc ctc gtacgtattc 1582Ala Gln
Cys Tyr Val Thr Pro Phe Ile Asn Gly Leu 195 200tcctattaat
cttgtttctt ttttcctttt ctttttttga tataatagaa ttaagaagaa
1642ctcggtgggc tgacatgaca cag ggc aac atc gag ggc aag ggc gcg tgc
tgc 1695 Gly Asn Ile Glu Gly Lys Gly Ala Cys Cys 205 210aac gag atg
gat atc tgg gag gcc aac gcg cgg gcg cag cac atc gcg 1743Asn Glu Met
Asp Ile Trp Glu Ala Asn Ala Arg Ala Gln His Ile Ala 215 220 225ccg
cac ccg tgc agc aag gcg ggg ccg tac ctg tgc gag ggc gcc gag 1791Pro
His Pro Cys Ser Lys Ala Gly Pro Tyr Leu Cys Glu Gly Ala Glu 230 235
240tgc gag ttc gac ggc gtg tgc gac aag aac ggc tgc gcc tgg aac ccg
1839Cys Glu Phe Asp Gly Val Cys Asp Lys Asn Gly Cys Ala Trp Asn
Pro245 250 255 260tac cgg gtc aac gtg acg gac tac tac ggc gag ggc
gcc gag ttc agg 1887Tyr Arg Val Asn Val Thr Asp Tyr Tyr Gly Glu Gly
Ala Glu Phe Arg 265 270 275gtg gac acg acc cgg ccc ttc tcg gtc gtc
acg cag ttc cgc gcc ggc 1935Val Asp Thr Thr Arg Pro Phe Ser Val Val
Thr Gln Phe Arg Ala Gly 280 285 290ggc gac gcg ggg ggc ggc aag ctc
gag agc atc tac cgg ctc ttc gtc 1983Gly Asp Ala Gly Gly Gly Lys Leu
Glu Ser Ile Tyr Arg Leu Phe Val 295 300 305cag gac ggc agg gtg atc
gag tcg tac gtc gtc gac aag ccc ggc ctg 2031Gln Asp Gly Arg Val Ile
Glu Ser Tyr Val Val Asp Lys Pro Gly Leu 310 315 320ccc ccg acg gac
cgc atg acg gac gag ttc tgc gcc gcc acc ggc gcc 2079Pro Pro Thr Asp
Arg Met Thr Asp Glu Phe Cys Ala Ala Thr Gly Ala325 330 335 340gcc
cgc ttc acg gag ctc ggc gcc atg gag gcc atg ggc gac gcc ctg 2127Ala
Arg Phe Thr Glu Leu Gly Ala Met Glu Ala Met Gly Asp Ala Leu 345 350
355acg cgc ggc atg gtc ctc gcc ctc agc atc tgg tgg agc gag ggc gac
2175Thr Arg Gly Met Val Leu Ala Leu Ser Ile Trp Trp Ser Glu Gly Asp
360 365 370aac atg aac tgg ctc gac tcg ggc gag gcc ggc ccc tgc gac
ccg gac 2223Asn Met Asn Trp Leu Asp Ser Gly Glu Ala Gly Pro Cys Asp
Pro Asp 375 380 385gag ggc aac ccg tcc aac atc atc cgc gtc cag ccc
gac ccg gag gtc 2271Glu Gly Asn Pro Ser Asn Ile Ile Arg Val Gln Pro
Asp Pro Glu Val 390 395 400gtc ttc agc aac ctg cgc tgg ggc gag atc
ggc tca acc tac gag tcc 2319Val Phe Ser Asn Leu Arg Trp Gly Glu Ile
Gly Ser Thr Tyr Glu Ser405 410 415 420gcc gtc gac ggg ccc gtc ggc
aag ggc aag ggc aag ggc aag ggc aag 2367Ala Val Asp Gly Pro Val Gly
Lys Gly Lys Gly Lys Gly Lys Gly Lys 425 430 435gct ccc gcc ggc gac
ggc aac ggg aag gag aag agc aat ggc aag cgc 2415Ala Pro Ala Gly Asp
Gly Asn Gly Lys Glu Lys Ser Asn Gly Lys Arg 440 445 450ttc agg agg
ttc tgagcaacct tgatattatt tttttctttc tttccttcac 2467Phe Arg Arg Phe
455cagttaatta gttgcctttg attagaaaga gagagagaaa 2507
14456PRTChrysosporium lucknowense 14Met Gly Arg Gly Ala Ala Phe Leu
Gly Leu Ala Ser Leu Leu Val Gly1 5 10 15Ala Ala Lys Ala Gln Thr Pro
Gly Glu Gly Glu Glu Val His Pro Gln 20 25 30Ile Thr Thr Tyr Arg Cys
Thr Lys Ala Asp Gly Cys Glu Glu Lys Thr 35 40 45Asn Tyr Ile Val Leu
Asp Ala Leu Ser His Pro Val His Gln Val Asp 50 55 60Asn Pro Tyr Asn
Cys Gly Asp Trp Gly Gln Lys Pro Asn Glu Thr Ala65 70 75 80Cys Pro
Asp Leu Glu Ser Cys Ala Arg Asn Cys Ile Met Asp Pro Val 85 90 95Ser
Asp Tyr Gly Arg His Gly Val Ser Thr Asp Gly Thr Ser Leu Arg 100 105
110Leu Lys Gln Leu Val Gly Gly Asn Val Val Ser Pro Arg Val Tyr Leu
115 120 125Leu Asp Glu Thr Lys Glu Arg Tyr Glu Met Leu Lys Leu Thr
Gly Asn 130 135 140Glu Phe Thr Phe Asp Val Asp Ala Thr Lys Leu Pro
Cys Gly Met Asn145 150
155 160Ser Ala Leu Tyr Leu Ser Glu Met Asp Ala Thr Gly Ala Arg Ser
Glu 165 170 175Leu Asn Pro Gly Gly Ala Thr Phe Gly Thr Gly Tyr Cys
Asp Ala Gln 180 185 190Cys Tyr Val Thr Pro Phe Ile Asn Gly Leu Gly
Asn Ile Glu Gly Lys 195 200 205Gly Ala Cys Cys Asn Glu Met Asp Ile
Trp Glu Ala Asn Ala Arg Ala 210 215 220Gln His Ile Ala Pro His Pro
Cys Ser Lys Ala Gly Pro Tyr Leu Cys225 230 235 240Glu Gly Ala Glu
Cys Glu Phe Asp Gly Val Cys Asp Lys Asn Gly Cys 245 250 255Ala Trp
Asn Pro Tyr Arg Val Asn Val Thr Asp Tyr Tyr Gly Glu Gly 260 265
270Ala Glu Phe Arg Val Asp Thr Thr Arg Pro Phe Ser Val Val Thr Gln
275 280 285Phe Arg Ala Gly Gly Asp Ala Gly Gly Gly Lys Leu Glu Ser
Ile Tyr 290 295 300Arg Leu Phe Val Gln Asp Gly Arg Val Ile Glu Ser
Tyr Val Val Asp305 310 315 320Lys Pro Gly Leu Pro Pro Thr Asp Arg
Met Thr Asp Glu Phe Cys Ala 325 330 335Ala Thr Gly Ala Ala Arg Phe
Thr Glu Leu Gly Ala Met Glu Ala Met 340 345 350Gly Asp Ala Leu Thr
Arg Gly Met Val Leu Ala Leu Ser Ile Trp Trp 355 360 365Ser Glu Gly
Asp Asn Met Asn Trp Leu Asp Ser Gly Glu Ala Gly Pro 370 375 380Cys
Asp Pro Asp Glu Gly Asn Pro Ser Asn Ile Ile Arg Val Gln Pro385 390
395 400Asp Pro Glu Val Val Phe Ser Asn Leu Arg Trp Gly Glu Ile Gly
Ser 405 410 415Thr Tyr Glu Ser Ala Val Asp Gly Pro Val Gly Lys Gly
Lys Gly Lys 420 425 430Gly Lys Gly Lys Ala Pro Ala Gly Asp Gly Asn
Gly Lys Glu Lys Ser 435 440 445Asn Gly Lys Arg Phe Arg Arg Phe 450
455151368DNAChrysosporium lucknowense 15atgggtcgcg gcgctgcttt
cctaggcctc gcctcgctcc tcgtgggcgc ggccaaggcc 60cagacgcccg gcgagggcga
ggaggtgcac ccgcagatca cgacgtaccg ctgcaccaag 120gcggacgggt
gcgaggagaa gaccaactac atcgtgctgg acgccctatc gcacccggtc
180caccaggtcg acaacccgta caactgcggc gactggggcc agaagcccaa
cgagacggcc 240tgcccggacc tcgagtcgtg cgccaggaac tgcatcatgg
acccggtctc ggactacggc 300cggcacgggg tctcgaccga cggcacctcg
ctgcgcctca agcagctagt cggcggcaac 360gtcgtcagcc cgcgcgtcta
cctgctcgac gagaccaagg agcgctacga gatgctcaag 420ctgaccggca
acgagttcac ctttgacgtc gacgccacca agctgccctg cggcatgaac
480agcgccctct acctctccga gatggacgcc accggcgccc ggagcgagct
caacccgggc 540ggcgccacct ttggcaccgg ctactgcgac gcccagtgct
acgtcacccc cttcatcaac 600ggcctcggca acatcgaggg caagggcgcg
tgctgcaacg agatggatat ctgggaggcc 660aacgcgcggg cgcagcacat
cgcgccgcac ccgtgcagca aggcggggcc gtacctgtgc 720gagggcgccg
agtgcgagtt cgacggcgtg tgcgacaaga acggctgcgc ctggaacccg
780taccgggtca acgtgacgga ctactacggc gagggcgccg agttcagggt
ggacacgacc 840cggcccttct cggtcgtcac gcagttccgc gccggcggcg
acgcgggggg cggcaagctc 900gagagcatct accggctctt cgtccaggac
ggcagggtga tcgagtcgta cgtcgtcgac 960aagcccggcc tgcccccgac
ggaccgcatg acggacgagt tctgcgccgc caccggcgcc 1020gcccgcttca
cggagctcgg cgccatggag gccatgggcg acgccctgac gcgcggcatg
1080gtcctcgccc tcagcatctg gtggagcgag ggcgacaaca tgaactggct
cgactcgggc 1140gaggccggcc cctgcgaccc ggacgagggc aacccgtcca
acatcatccg cgtccagccc 1200gacccggagg tcgtcttcag caacctgcgc
tggggcgaga tcggctcaac ctacgagtcc 1260gccgtcgacg ggcccgtcgg
caagggcaag ggcaagggca agggcaaggc tcccgccggc 1320gacggcaacg
ggaaggagaa gagcaatggc aagcgcttca ggaggttc
1368161305DNAChrysosporium
lucknowenseCDS(103)..(545)CDS(683)..(903)CDS(986)..(1062)
16cgacaaagac ccgtcagtga ttaataataa ttagtagcag tttctttctt tcaagactca
60agaatactcc tttccgccat cgtggcagcg tttagattca tc atg cag ccg ttt
114 Met Gln Pro Phe 1ctg ctc ttg ttc ctc tcg tcg gtc acg gcg gcg
agc ccc ctg acg gcg 162Leu Leu Leu Phe Leu Ser Ser Val Thr Ala Ala
Ser Pro Leu Thr Ala5 10 15 20ctc gac aag cgg cag cag gcg acg ttg
tgc gag cag tac ggc tac tgg 210Leu Asp Lys Arg Gln Gln Ala Thr Leu
Cys Glu Gln Tyr Gly Tyr Trp 25 30 35tcg ggc aac ggt tac gag gtc aac
aac aac aac tgg ggc aag gat tcg 258Ser Gly Asn Gly Tyr Glu Val Asn
Asn Asn Asn Trp Gly Lys Asp Ser 40 45 50gcc tcg ggc ggc cat cag tgc
acc tac gtc gac agc agc agc tcc agc 306Ala Ser Gly Gly His Gln Cys
Thr Tyr Val Asp Ser Ser Ser Ser Ser 55 60 65ggc gtc gcc tgg cac acg
acc tgg cag tgg gaa gga ggc cag aac cag 354Gly Val Ala Trp His Thr
Thr Trp Gln Trp Glu Gly Gly Gln Asn Gln 70 75 80gtc aag agc ttc gcc
aac tgc ggc ctg cag gtg ccc aag ggc agg acc 402Val Lys Ser Phe Ala
Asn Cys Gly Leu Gln Val Pro Lys Gly Arg Thr85 90 95 100atc tcg tcc
atc agc aac ctg cag acc tcc atc tcg tgg tcc tac agc 450Ile Ser Ser
Ile Ser Asn Leu Gln Thr Ser Ile Ser Trp Ser Tyr Ser 105 110 115aac
acc aac atc cgc gcc aac gtg gcc tac gac ctc ttc acc gcg gca 498Asn
Thr Asn Ile Arg Ala Asn Val Ala Tyr Asp Leu Phe Thr Ala Ala 120 125
130gac ccg aac cac gcg acc agc agc ggc gac tac gag ctc atg atc tg
545Asp Pro Asn His Ala Thr Ser Ser Gly Asp Tyr Glu Leu Met Ile Trp
135 140 145gtcagttttt tttttctttt ttcttttctt ctcttttctt ttcttttcct
ttctcctgtt 605ttattttctt atccattgct tcgccctctt tccttaaccc
tgctgactct ctcttcttgt 665caatgatact gtaatag g ctg gcg aga ttc ggc
gac gtc tac ccc atc ggc 716 Leu Ala Arg Phe Gly Asp Val Tyr Pro Ile
Gly 150 155tcg tcc cag ggc cac gtc aac gtg gcc ggc cag gac tgg gag
ctg tgg 764Ser Ser Gln Gly His Val Asn Val Ala Gly Gln Asp Trp Glu
Leu Trp160 165 170 175acg ggc ttc aac ggc aac atg cgg gtc tac agc
ttc gta gcg ccc agc 812Thr Gly Phe Asn Gly Asn Met Arg Val Tyr Ser
Phe Val Ala Pro Ser 180 185 190ccc cgc aac agc ttc agc gcc aac gtc
aag gac ttc ttc aac tat ctc 860Pro Arg Asn Ser Phe Ser Ala Asn Val
Lys Asp Phe Phe Asn Tyr Leu 195 200 205cag tcc aac cag ggc ttc ccg
gcc agc agc caa tac ctt ctc a 903Gln Ser Asn Gln Gly Phe Pro Ala
Ser Ser Gln Tyr Leu Leu 210 215 220gtaaggagac gagatctcga acagcatacc
atatatgcgt gcggtacaag tgcactaacc 963cccttttttt tcccgttcgc ag tc ttc
cag gcg ggc acc gag ccc ttc acc 1014 Ile Phe Gln Ala Gly Thr Glu
Pro Phe Thr 225 230ggc ggc gag acc acc ctt acc gtc aac aac tac tct
gca agg gtt gct 1062Gly Gly Glu Thr Thr Leu Thr Val Asn Asn Tyr Ser
Ala Arg Val Ala 235 240 245taaacaggaa ggccgaggat ggcccccaag
gccgttgcgg gttcacgagc tctcttcttt 1122tcaagtgctg tacatacata
attagcgtac caagtcatag ctgtttgtca gcttcaaact 1182aagtgctcgc
ccacaaaaga ggggggaggg gaaaataaca aattgccgaa cgcagtgata
1242agcttctggg agcgttgaaa gcagtctaca gtaggtggct gtacgaagga
aaagagtgcc 1302tta 130517247PRTChrysosporium lucknowense 17Met Gln
Pro Phe Leu Leu Leu Phe Leu Ser Ser Val Thr Ala Ala Ser1 5 10 15Pro
Leu Thr Ala Leu Asp Lys Arg Gln Gln Ala Thr Leu Cys Glu Gln 20 25
30Tyr Gly Tyr Trp Ser Gly Asn Gly Tyr Glu Val Asn Asn Asn Asn Trp
35 40 45Gly Lys Asp Ser Ala Ser Gly Gly His Gln Cys Thr Tyr Val Asp
Ser 50 55 60Ser Ser Ser Ser Gly Val Ala Trp His Thr Thr Trp Gln Trp
Glu Gly65 70 75 80Gly Gln Asn Gln Val Lys Ser Phe Ala Asn Cys Gly
Leu Gln Val Pro 85 90 95Lys Gly Arg Thr Ile Ser Ser Ile Ser Asn Leu
Gln Thr Ser Ile Ser 100 105 110Trp Ser Tyr Ser Asn Thr Asn Ile Arg
Ala Asn Val Ala Tyr Asp Leu 115 120 125Phe Thr Ala Ala Asp Pro Asn
His Ala Thr Ser Ser Gly Asp Tyr Glu 130 135 140Leu Met Ile Trp Leu
Ala Arg Phe Gly Asp Val Tyr Pro Ile Gly Ser145 150 155 160Ser Gln
Gly His Val Asn Val Ala Gly Gln Asp Trp Glu Leu Trp Thr 165 170
175Gly Phe Asn Gly Asn Met Arg Val Tyr Ser Phe Val Ala Pro Ser Pro
180 185 190Arg Asn Ser Phe Ser Ala Asn Val Lys Asp Phe Phe Asn Tyr
Leu Gln 195 200 205Ser Asn Gln Gly Phe Pro Ala Ser Ser Gln Tyr Leu
Leu Ile Phe Gln 210 215 220Ala Gly Thr Glu Pro Phe Thr Gly Gly Glu
Thr Thr Leu Thr Val Asn225 230 235 240Asn Tyr Ser Ala Arg Val Ala
24518741DNAChrysosporium lucknowense 18atgcagccgt ttctgctctt
gttcctctcg tcggtcacgg cggcgagccc cctgacggcg 60ctcgacaagc ggcagcaggc
gacgttgtgc gagcagtacg gctactggtc gggcaacggt 120tacgaggtca
acaacaacaa ctggggcaag gattcggcct cgggcggcca tcagtgcacc
180tacgtcgaca gcagcagctc cagcggcgtc gcctggcaca cgacctggca
gtgggaagga 240ggccagaacc aggtcaagag cttcgccaac tgcggcctgc
aggtgcccaa gggcaggacc 300atctcgtcca tcagcaacct gcagacctcc
atctcgtggt cctacagcaa caccaacatc 360cgcgccaacg tggcctacga
cctcttcacc gcggcagacc cgaaccacgc gaccagcagc 420ggcgactacg
agctcatgat ctggctggcg agattcggcg acgtctaccc catcggctcg
480tcccagggcc acgtcaacgt ggccggccag gactgggagc tgtggacggg
cttcaacggc 540aacatgcggg tctacagctt cgtagcgccc agcccccgca
acagcttcag cgccaacgtc 600aaggacttct tcaactatct ccagtccaac
cagggcttcc cggccagcag ccaatacctt 660ctcatcttcc aggcgggcac
cgagcccttc accggcggcg agaccaccct taccgtcaac 720aactactctg
caagggttgc t 741191448DNAChrysosporium
lucknowenseCDS(166)..(597)CDS(708)..(738)CDS(846)..(972)CDS(1074)..(1158)
19atatataagt gagagtgttt tttgactgcc ccgggttctg gtagttgaag ggaagttcga
60tgctctctgc tgtcgtcgct ctcgtcgctc tcgtcggcat cctccatccg tccgcctttg
120ataacccgct ccccgactca gtcaagacga cgcatacttg gcacc atg cat ctc
tcc 177 Met His Leu Ser 1gcc acc acc ggg ttc ctc gcc ctc ccg gcc
ctg gcc ctg gcc cag ctc 225Ala Thr Thr Gly Phe Leu Ala Leu Pro Ala
Leu Ala Leu Ala Gln Leu5 10 15 20tcg ggc agc ggc cag acg acc cgg
tac tgg gac tgc tgc aag ccg agc 273Ser Gly Ser Gly Gln Thr Thr Arg
Tyr Trp Asp Cys Cys Lys Pro Ser 25 30 35tgc gcc tgg ccc ggc aag ggc
ccc tcg tct ccg gtg cag gcc tgc gac 321Cys Ala Trp Pro Gly Lys Gly
Pro Ser Ser Pro Val Gln Ala Cys Asp 40 45 50aag aac gac aac ccg ctc
aac gac ggc ggc tcc acc cgg tcc ggc tgc 369Lys Asn Asp Asn Pro Leu
Asn Asp Gly Gly Ser Thr Arg Ser Gly Cys 55 60 65gac gcg ggc ggc agc
gcc tac atg tgc tcc tcc cag agc ccc tgg gcc 417Asp Ala Gly Gly Ser
Ala Tyr Met Cys Ser Ser Gln Ser Pro Trp Ala 70 75 80gtc agc gac gag
ctg tcg tac ggc tgg gcg gcc gtc aag ctc gcc ggc 465Val Ser Asp Glu
Leu Ser Tyr Gly Trp Ala Ala Val Lys Leu Ala Gly85 90 95 100agc tcc
gag tcg cag tgg tgc tgc gcc tgc tac gag ctg acc ttc acc 513Ser Ser
Glu Ser Gln Trp Cys Cys Ala Cys Tyr Glu Leu Thr Phe Thr 105 110
115agc ggg ccg gtc gcg ggc aag aag atg att gtg cag gcg acc aac acc
561Ser Gly Pro Val Ala Gly Lys Lys Met Ile Val Gln Ala Thr Asn Thr
120 125 130ggt ggc gac ctg ggc gac aac cac ttt gac ctg gcc
gtgagttgcc 607Gly Gly Asp Leu Gly Asp Asn His Phe Asp Leu Ala 135
140tccccttctc cccggaccgc tcagattaga tgagattaga ctttgctcgt
aaatcggtcc 667aagattcccg ttgactgacc aacaaacatc atacgggcag atc ccc
ggt ggc ggt 722 Ile Pro Gly Gly Gly 145gtc ggt att ttc aac g
gtaagctggt gcccccggac ccctccccgg acccctcccc 778Val Gly Ile Phe
Asn150cttttcctcc agcgagccga gttgggatcg ccgagatcga gaactcacac
aacttctctc 838tcgacag cc tgc acc gac cag tac ggc gct ccc ccg aac
ggc tgg ggc 886 Ala Cys Thr Asp Gln Tyr Gly Ala Pro Pro Asn Gly Trp
Gly 155 160 165gac cgc tac ggc ggc atc cat tcc aag gaa gag tgc gaa
tcc ttc ccg 934Asp Arg Tyr Gly Gly Ile His Ser Lys Glu Glu Cys Glu
Ser Phe Pro 170 175 180gag gcc ctc aag ccc ggc tgc aac tgg cgc ttc
gac tg gtacgttgct 982Glu Ala Leu Lys Pro Gly Cys Asn Trp Arg Phe
Asp Trp185 190 195ttgacatacc ggaacccaat tcctccaacc cccccccttt
tctcccccaa ctccgggggt 1042agtcggaatg tcgcgactga ccctatttca g g ttc
caa aac gcc gac aac ccg 1095 Phe Gln Asn Ala Asp Asn Pro 200tcg gtc
acc ttc cag gag gtg gcc tgc ccg tcg gag ctc acg tcc aag 1143Ser Val
Thr Phe Gln Glu Val Ala Cys Pro Ser Glu Leu Thr Ser Lys205 210 215
220agc ggc tgc tcc cgt taagagggaa gagagggggc tggaaggacc gaaagattca
1198Ser Gly Cys Ser Arg 225acctctgctc ctgctgggga agctcgggcg
cgagtgtgaa actggtgtaa atattgtggc 1258acacacaagc tactacagtc
cgtctcgccg tccggctaac tagccttgct gcggatctgt 1318ccatcttcgg
tccgaactgt ccgttgctgt tttggctcgg tgcctcatct tctcccaacc
1378tagtcaagaa tgaatcgtga gagaggctga gagagataag atcgacttca
gaaatccagg 1438gttgaaagca 144820225PRTChrysosporium lucknowense
20Met His Leu Ser Ala Thr Thr Gly Phe Leu Ala Leu Pro Ala Leu Ala1
5 10 15Leu Ala Gln Leu Ser Gly Ser Gly Gln Thr Thr Arg Tyr Trp Asp
Cys 20 25 30Cys Lys Pro Ser Cys Ala Trp Pro Gly Lys Gly Pro Ser Ser
Pro Val 35 40 45Gln Ala Cys Asp Lys Asn Asp Asn Pro Leu Asn Asp Gly
Gly Ser Thr 50 55 60Arg Ser Gly Cys Asp Ala Gly Gly Ser Ala Tyr Met
Cys Ser Ser Gln65 70 75 80Ser Pro Trp Ala Val Ser Asp Glu Leu Ser
Tyr Gly Trp Ala Ala Val 85 90 95Lys Leu Ala Gly Ser Ser Glu Ser Gln
Trp Cys Cys Ala Cys Tyr Glu 100 105 110Leu Thr Phe Thr Ser Gly Pro
Val Ala Gly Lys Lys Met Ile Val Gln 115 120 125Ala Thr Asn Thr Gly
Gly Asp Leu Gly Asp Asn His Phe Asp Leu Ala 130 135 140Ile Pro Gly
Gly Gly Val Gly Ile Phe Asn Ala Cys Thr Asp Gln Tyr145 150 155
160Gly Ala Pro Pro Asn Gly Trp Gly Asp Arg Tyr Gly Gly Ile His Ser
165 170 175Lys Glu Glu Cys Glu Ser Phe Pro Glu Ala Leu Lys Pro Gly
Cys Asn 180 185 190Trp Arg Phe Asp Trp Phe Gln Asn Ala Asp Asn Pro
Ser Val Thr Phe 195 200 205Gln Glu Val Ala Cys Pro Ser Glu Leu Thr
Ser Lys Ser Gly Cys Ser 210 215 220Arg22521675DNAChrysosporium
lucknowense 21atgcatctct ccgccaccac cgggttcctc gccctcccgg
ccctggccct ggcccagctc 60tcgggcagcg gccagacgac ccggtactgg gactgctgca
agccgagctg cgcctggccc 120ggcaagggcc cctcgtctcc ggtgcaggcc
tgcgacaaga acgacaaccc gctcaacgac 180ggcggctcca cccggtccgg
ctgcgacgcg ggcggcagcg cctacatgtg ctcctcccag 240agcccctggg
ccgtcagcga cgagctgtcg tacggctggg cggccgtcaa gctcgccggc
300agctccgagt cgcagtggtg ctgcgcctgc tacgagctga ccttcaccag
cgggccggtc 360gcgggcaaga agatgattgt gcaggcgacc aacaccggtg
gcgacctggg cgacaaccac 420tttgacctgg ccatccccgg tggcggtgtc
ggtattttca acgcctgcac cgaccagtac 480ggcgctcccc cgaacggctg
gggcgaccgc tacggcggca tccattccaa ggaagagtgc 540gaatccttcc
cggaggccct caagcccggc tgcaactggc gcttcgactg gttccaaaac
600gccgacaacc cgtcggtcac cttccaggag gtggcctgcc cgtcggagct
cacgtccaag 660agcggctgct cccgt 675221658DNAChrysosporium
lucknowenseCDS(211)..(1139)CDS(1217)..(1430) 22aggaacgggg
aggaggaggg cttggttagg gtcgcgttcg tttggagatt gctgagctct 60gagccttcgg
tccttggatc cctgcggtcc ccggtctcct ctctctctct ctctctctct
120ctctctctct cttcttccca cgctcgttcg acagacgcct ccccttcttc
gctctccttt 180ccctcgcacg tagcacacta atagtgcacc atg cgc gtc tct agt
ttg gtc gcg 234 Met Arg Val Ser Ser Leu Val Ala 1 5gcc ctt gct acc
ggt ggt ctt gtc gcc gcc acg cct aag ccc aag ggg 282Ala Leu Ala Thr
Gly Gly Leu Val Ala Ala Thr Pro Lys Pro Lys Gly 10 15 20tcg tcg ccc
cct ggg gcc gtg gac gcg aac cct ttc aag ggc aag acg 330Ser Ser Pro
Pro Gly Ala Val Asp Ala Asn Pro Phe Lys Gly Lys Thr25 30
35 40cag ttc gtc aac ccg gca tgg gcg gcc aag ctg gaa cag acc aaa
aag 378Gln Phe Val Asn Pro Ala Trp Ala Ala Lys Leu Glu Gln Thr Lys
Lys 45 50 55gcg ttc ctg gcc agg aac gac acc gtc aat gcc gcc aag acg
gag aag 426Ala Phe Leu Ala Arg Asn Asp Thr Val Asn Ala Ala Lys Thr
Glu Lys 60 65 70gtc cag cag acc agc tcg ttc gtc tgg gtc tcg agg atc
gcc gag ctc 474Val Gln Gln Thr Ser Ser Phe Val Trp Val Ser Arg Ile
Ala Glu Leu 75 80 85tcc aac atc gac gac gcc atc gcg gct gcc cgc aag
gcg cag aag aag 522Ser Asn Ile Asp Asp Ala Ile Ala Ala Ala Arg Lys
Ala Gln Lys Lys 90 95 100acg ggc agg agg cag atc gtc ggc ctg gtg
ctc tac aac ctt ccg gac 570Thr Gly Arg Arg Gln Ile Val Gly Leu Val
Leu Tyr Asn Leu Pro Asp105 110 115 120cgc gac tgc agc gcg ggc gag
agc gcg ggc gag ctc agc agc gac aag 618Arg Asp Cys Ser Ala Gly Glu
Ser Ala Gly Glu Leu Ser Ser Asp Lys 125 130 135aac ggg ctc gag atc
tac aag act gag ttc gtc aag ccc ttc gcc gac 666Asn Gly Leu Glu Ile
Tyr Lys Thr Glu Phe Val Lys Pro Phe Ala Asp 140 145 150aag gtg gcg
gcc gca aag gac ctc gac ttc gcc atc gtc ctg gag ccc 714Lys Val Ala
Ala Ala Lys Asp Leu Asp Phe Ala Ile Val Leu Glu Pro 155 160 165gac
tcg ctg gcc aac ctg gtc acc aac ctg ggc atc gag ttc tgc gcc 762Asp
Ser Leu Ala Asn Leu Val Thr Asn Leu Gly Ile Glu Phe Cys Ala 170 175
180aac gcc gcc ccc gtc tac cgc gag ggc atc gcc tat gcc atc tcc agc
810Asn Ala Ala Pro Val Tyr Arg Glu Gly Ile Ala Tyr Ala Ile Ser
Ser185 190 195 200ctt cag cag cca aac gtg cac ttg tac atc gat gct
gcc cac ggc ggc 858Leu Gln Gln Pro Asn Val His Leu Tyr Ile Asp Ala
Ala His Gly Gly 205 210 215tgg ctc ggc tgg gac gac aac ctg ccg ctg
gcc gcc aag gag ttt gcc 906Trp Leu Gly Trp Asp Asp Asn Leu Pro Leu
Ala Ala Lys Glu Phe Ala 220 225 230gag gtg gtc aag ctt gcc ggc gag
ggc aag aag atc cgc ggc ttc gtc 954Glu Val Val Lys Leu Ala Gly Glu
Gly Lys Lys Ile Arg Gly Phe Val 235 240 245acc aac gtg tcc aac tac
aac ccc ttc cac gcc gtc gtg cgc gag aac 1002Thr Asn Val Ser Asn Tyr
Asn Pro Phe His Ala Val Val Arg Glu Asn 250 255 260ttt acc gag tgg
agc aac tcg tgg gac gag tct cac tac gcc tcc tcg 1050Phe Thr Glu Trp
Ser Asn Ser Trp Asp Glu Ser His Tyr Ala Ser Ser265 270 275 280ctc
aca ccg ttc ctc gag aaa gag ggg ctg ccg gca cgc ttc atc gtc 1098Leu
Thr Pro Phe Leu Glu Lys Glu Gly Leu Pro Ala Arg Phe Ile Val 285 290
295gac cag ggt cgc gtt gcc ctc ccg gga gcc cgc aag gag tg 1139Asp
Gln Gly Arg Val Ala Leu Pro Gly Ala Arg Lys Glu Trp 300 305
310gtgagtttcg accagattga ccctcgaccc atgcgaccga gattgctgac
gattgaattg 1199cgtgtcccgt cccccag g ggt gaa tgg tgc aac gtg gca ccc
gcc gga ttt 1250 Gly Glu Trp Cys Asn Val Ala Pro Ala Gly Phe 315
320ggc ccc gcg ccc acg acc agg gtc aac aac acc gtc gtc gat gct ctc
1298Gly Pro Ala Pro Thr Thr Arg Val Asn Asn Thr Val Val Asp Ala Leu
325 330 335gtc tgg gtc aag cct ggc ggc gag agc gac ggc gag tgt ggc
ttg gct 1346Val Trp Val Lys Pro Gly Gly Glu Ser Asp Gly Glu Cys Gly
Leu Ala 340 345 350ggc gcc ccc aag gcc ggc cag tgg ttc gac gag tac
gcc cag atg ctg 1394Gly Ala Pro Lys Ala Gly Gln Trp Phe Asp Glu Tyr
Ala Gln Met Leu 355 360 365gtc gag aat gcc cac ccg tct gtc gtc cac
aag tgg tagataaatt 1440Val Glu Asn Ala His Pro Ser Val Val His Lys
Trp370 375 380ttggagtccg agaagggtcc cagatagact tttgttttaa
aacaaaatgc aaggtgtcga 1500cagatactgg cttaacatta accaagcacc
atgaacatga cttgtcaaca tattgataca 1560ttccgctgct ttcccatacg
tgctctcagg tctcagggat caaatggata ggtcggtaat 1620gcaaaacgat
ccattggata tccagaagag agaaaaaa 165823381PRTChrysosporium
lucknowense 23Met Arg Val Ser Ser Leu Val Ala Ala Leu Ala Thr Gly
Gly Leu Val1 5 10 15Ala Ala Thr Pro Lys Pro Lys Gly Ser Ser Pro Pro
Gly Ala Val Asp 20 25 30Ala Asn Pro Phe Lys Gly Lys Thr Gln Phe Val
Asn Pro Ala Trp Ala 35 40 45Ala Lys Leu Glu Gln Thr Lys Lys Ala Phe
Leu Ala Arg Asn Asp Thr 50 55 60Val Asn Ala Ala Lys Thr Glu Lys Val
Gln Gln Thr Ser Ser Phe Val65 70 75 80Trp Val Ser Arg Ile Ala Glu
Leu Ser Asn Ile Asp Asp Ala Ile Ala 85 90 95Ala Ala Arg Lys Ala Gln
Lys Lys Thr Gly Arg Arg Gln Ile Val Gly 100 105 110Leu Val Leu Tyr
Asn Leu Pro Asp Arg Asp Cys Ser Ala Gly Glu Ser 115 120 125Ala Gly
Glu Leu Ser Ser Asp Lys Asn Gly Leu Glu Ile Tyr Lys Thr 130 135
140Glu Phe Val Lys Pro Phe Ala Asp Lys Val Ala Ala Ala Lys Asp
Leu145 150 155 160Asp Phe Ala Ile Val Leu Glu Pro Asp Ser Leu Ala
Asn Leu Val Thr 165 170 175Asn Leu Gly Ile Glu Phe Cys Ala Asn Ala
Ala Pro Val Tyr Arg Glu 180 185 190Gly Ile Ala Tyr Ala Ile Ser Ser
Leu Gln Gln Pro Asn Val His Leu 195 200 205Tyr Ile Asp Ala Ala His
Gly Gly Trp Leu Gly Trp Asp Asp Asn Leu 210 215 220Pro Leu Ala Ala
Lys Glu Phe Ala Glu Val Val Lys Leu Ala Gly Glu225 230 235 240Gly
Lys Lys Ile Arg Gly Phe Val Thr Asn Val Ser Asn Tyr Asn Pro 245 250
255Phe His Ala Val Val Arg Glu Asn Phe Thr Glu Trp Ser Asn Ser Trp
260 265 270Asp Glu Ser His Tyr Ala Ser Ser Leu Thr Pro Phe Leu Glu
Lys Glu 275 280 285Gly Leu Pro Ala Arg Phe Ile Val Asp Gln Gly Arg
Val Ala Leu Pro 290 295 300Gly Ala Arg Lys Glu Trp Gly Glu Trp Cys
Asn Val Ala Pro Ala Gly305 310 315 320Phe Gly Pro Ala Pro Thr Thr
Arg Val Asn Asn Thr Val Val Asp Ala 325 330 335Leu Val Trp Val Lys
Pro Gly Gly Glu Ser Asp Gly Glu Cys Gly Leu 340 345 350Ala Gly Ala
Pro Lys Ala Gly Gln Trp Phe Asp Glu Tyr Ala Gln Met 355 360 365Leu
Val Glu Asn Ala His Pro Ser Val Val His Lys Trp 370 375
380241143DNAChrysosporium lucknowense 24atgcgcgtct ctagtttggt
cgcggccctt gctaccggtg gtcttgtcgc cgccacgcct 60aagcccaagg ggtcgtcgcc
ccctggggcc gtggacgcga accctttcaa gggcaagacg 120cagttcgtca
acccggcatg ggcggccaag ctggaacaga ccaaaaaggc gttcctggcc
180aggaacgaca ccgtcaatgc cgccaagacg gagaaggtcc agcagaccag
ctcgttcgtc 240tgggtctcga ggatcgccga gctctccaac atcgacgacg
ccatcgcggc tgcccgcaag 300gcgcagaaga agacgggcag gaggcagatc
gtcggcctgg tgctctacaa ccttccggac 360cgcgactgca gcgcgggcga
gagcgcgggc gagctcagca gcgacaagaa cgggctcgag 420atctacaaga
ctgagttcgt caagcccttc gccgacaagg tggcggccgc aaaggacctc
480gacttcgcca tcgtcctgga gcccgactcg ctggccaacc tggtcaccaa
cctgggcatc 540gagttctgcg ccaacgccgc ccccgtctac cgcgagggca
tcgcctatgc catctccagc 600cttcagcagc caaacgtgca cttgtacatc
gatgctgccc acggcggctg gctcggctgg 660gacgacaacc tgccgctggc
cgccaaggag tttgccgagg tggtcaagct tgccggcgag 720ggcaagaaga
tccgcggctt cgtcaccaac gtgtccaact acaacccctt ccacgccgtc
780gtgcgcgaga actttaccga gtggagcaac tcgtgggacg agtctcacta
cgcctcctcg 840ctcacaccgt tcctcgagaa agaggggctg ccggcacgct
tcatcgtcga ccagggtcgc 900gttgccctcc cgggagcccg caaggagtgg
ggtgaatggt gcaacgtggc acccgccgga 960tttggccccg cgcccacgac
cagggtcaac aacaccgtcg tcgatgctct cgtctgggtc 1020aagcctggcg
gcgagagcga cggcgagtgt ggcttggctg gcgcccccaa ggccggccag
1080tggttcgacg agtacgccca gatgctggtc gagaatgccc acccgtctgt
cgtccacaag 1140tgg 1143251683DNAChrysosporium
lucknowenseCDS(76)..(148)CDS(223)..(286)CDS(438)..(700)CDS(780)..(1531)
25atctgccggt tgcccacgtc gccgtcgact gcttgtccgc ttcctacctg cagcctcttt
60cagagaccat caaac atg cgt act ctt acg ttc gtg ctg gca gcc gcc ccg
111 Met Arg Thr Leu Thr Phe Val Leu Ala Ala Ala Pro 1 5 10gtg gct
gtg ctt gcc caa tct cct ctg tgg ggc cag t gtatgtaatt 158Val Ala Val
Leu Ala Gln Ser Pro Leu Trp Gly Gln 15 20gccttactcg gaaaatagtc
accactagag ggacttaagc tcactacttc ctgtttcaca 218atag gc ggc ggt caa
ggc tgg aca ggt ccc acg acc tgc gtt tct ggc 266 Cys Gly Gly Gln Gly
Trp Thr Gly Pro Thr Thr Cys Val Ser Gly 25 30 35gca gta tgc caa ttc
gtc aa gtcagtaact gcttttattt cttttctctc 316Ala Val Cys Gln Phe Val
Asn40 45tgggattacg atttcgtttt gcacttagct tggttctgca tttcattgtt
gtattgttct 376ctttttgtgt gtgagaggtt ttattaccac ctaaaggcca
tttgctaaca aatctcccca 436g t gac tgg tac tcc caa tgc gtg ccc gga
tcg agc aac cct cct acg 483 Asp Trp Tyr Ser Gln Cys Val Pro Gly Ser
Ser Asn Pro Pro Thr 50 55 60ggc acc acc agc agc acc act gga agc acc
ccg gct cct act ggc ggc 531Gly Thr Thr Ser Ser Thr Thr Gly Ser Thr
Pro Ala Pro Thr Gly Gly 65 70 75ggc ggc agc gga acc ggc ctc cac gac
aaa ttc aag gcc aag ggc aag 579Gly Gly Ser Gly Thr Gly Leu His Asp
Lys Phe Lys Ala Lys Gly Lys 80 85 90ctc tac ttc gga acc gag atc gat
cac tac cat ctc aac aac aat gcc 627Leu Tyr Phe Gly Thr Glu Ile Asp
His Tyr His Leu Asn Asn Asn Ala 95 100 105ttg acc aac att gtc aag
aaa gac ttt ggt caa gtc act cac gag aac 675Leu Thr Asn Ile Val Lys
Lys Asp Phe Gly Gln Val Thr His Glu Asn110 115 120 125agc ttg aag
tgg gat gct act gag c gtgagtgacc tctcctcctt 720Ser Leu Lys Trp Asp
Ala Thr Glu 130ctcccgaaca ataatagata attacgagcc ggttcgaggc
tgacattgcg cgattctag 779cg agc cgc aat caa ttc aac ttt gcc aac gcc
gac gcg gtt gtc aac 826Pro Ser Arg Asn Gln Phe Asn Phe Ala Asn Ala
Asp Ala Val Val Asn 135 140 145ttt gcc cag gcc aac ggc aag ctc atc
cgc ggc cac acc ctc ctc tgg 874Phe Ala Gln Ala Asn Gly Lys Leu Ile
Arg Gly His Thr Leu Leu Trp150 155 160 165cac tct cag ctg ccg cag
tgg gtg cag aac atc aac gac cgc aac acc 922His Ser Gln Leu Pro Gln
Trp Val Gln Asn Ile Asn Asp Arg Asn Thr 170 175 180ttg acc cag gtc
atc gag aac cac gtc acc acc ctt gtc act cgc tac 970Leu Thr Gln Val
Ile Glu Asn His Val Thr Thr Leu Val Thr Arg Tyr 185 190 195aag ggc
aag atc ctc cac tgg gac gtc gtt aac gag atc ttt gcc gag 1018Lys Gly
Lys Ile Leu His Trp Asp Val Val Asn Glu Ile Phe Ala Glu 200 205
210gac ggc tcg ctc cgc gac agc gtc ttc agc cgc gtc ctc ggc gag gac
1066Asp Gly Ser Leu Arg Asp Ser Val Phe Ser Arg Val Leu Gly Glu Asp
215 220 225ttt gtc ggc atc gcc ttc cgc gcc gcc cgc gcc gcc gat ccc
aac gcc 1114Phe Val Gly Ile Ala Phe Arg Ala Ala Arg Ala Ala Asp Pro
Asn Ala230 235 240 245aag ctc tac atc aac gac tac aac ctc gac att
gcc aac tac gcc aag 1162Lys Leu Tyr Ile Asn Asp Tyr Asn Leu Asp Ile
Ala Asn Tyr Ala Lys 250 255 260gtg acc cgg ggc atg gtc gag aag gtc
aac aag tgg atc gcc cag ggc 1210Val Thr Arg Gly Met Val Glu Lys Val
Asn Lys Trp Ile Ala Gln Gly 265 270 275atc ccg atc gac ggc atc ggc
acc cag tgc cac ctg gcc ggg ccc ggc 1258Ile Pro Ile Asp Gly Ile Gly
Thr Gln Cys His Leu Ala Gly Pro Gly 280 285 290ggg tgg aac acg gcc
gcc ggc gtc ccc gac gcc ctc aag gcc ctc gcc 1306Gly Trp Asn Thr Ala
Ala Gly Val Pro Asp Ala Leu Lys Ala Leu Ala 295 300 305gcg gcc aac
gtc aag gag atc gcc atc acc gag ctc gac atc gcc ggc 1354Ala Ala Asn
Val Lys Glu Ile Ala Ile Thr Glu Leu Asp Ile Ala Gly310 315 320
325gcc tcc gcc aac gac tac ctc acc gtc atg aac gcc tgc ctc cag gtc
1402Ala Ser Ala Asn Asp Tyr Leu Thr Val Met Asn Ala Cys Leu Gln Val
330 335 340tcc aag tgc gtc ggc atc acc gtc tgg ggc gtc tct gac aag
gac agc 1450Ser Lys Cys Val Gly Ile Thr Val Trp Gly Val Ser Asp Lys
Asp Ser 345 350 355tgg agg tcg agc agc aac ccg ctc ctc ttc gac agc
aac tac cag cca 1498Trp Arg Ser Ser Ser Asn Pro Leu Leu Phe Asp Ser
Asn Tyr Gln Pro 360 365 370aag gcg gca tac aat gct ctg att aat gcc
ttg taagaggagg tatattattt 1551Lys Ala Ala Tyr Asn Ala Leu Ile Asn
Ala Leu 375 380ttagaggcaa tgaagctagg aggaaagagg ggaagtgagg
taattagcta ggacaggcaa 1611atctagcagc aattataagt caacactata
taaaatattc ctataatggc ttgtgcttcg 1671gtgtgcaaaa aa
168326384PRTChrysosporium lucknowense 26Met Arg Thr Leu Thr Phe Val
Leu Ala Ala Ala Pro Val Ala Val Leu1 5 10 15Ala Gln Ser Pro Leu Trp
Gly Gln Cys Gly Gly Gln Gly Trp Thr Gly 20 25 30Pro Thr Thr Cys Val
Ser Gly Ala Val Cys Gln Phe Val Asn Asp Trp 35 40 45Tyr Ser Gln Cys
Val Pro Gly Ser Ser Asn Pro Pro Thr Gly Thr Thr 50 55 60Ser Ser Thr
Thr Gly Ser Thr Pro Ala Pro Thr Gly Gly Gly Gly Ser65 70 75 80Gly
Thr Gly Leu His Asp Lys Phe Lys Ala Lys Gly Lys Leu Tyr Phe 85 90
95Gly Thr Glu Ile Asp His Tyr His Leu Asn Asn Asn Ala Leu Thr Asn
100 105 110Ile Val Lys Lys Asp Phe Gly Gln Val Thr His Glu Asn Ser
Leu Lys 115 120 125Trp Asp Ala Thr Glu Pro Ser Arg Asn Gln Phe Asn
Phe Ala Asn Ala 130 135 140Asp Ala Val Val Asn Phe Ala Gln Ala Asn
Gly Lys Leu Ile Arg Gly145 150 155 160His Thr Leu Leu Trp His Ser
Gln Leu Pro Gln Trp Val Gln Asn Ile 165 170 175Asn Asp Arg Asn Thr
Leu Thr Gln Val Ile Glu Asn His Val Thr Thr 180 185 190Leu Val Thr
Arg Tyr Lys Gly Lys Ile Leu His Trp Asp Val Val Asn 195 200 205Glu
Ile Phe Ala Glu Asp Gly Ser Leu Arg Asp Ser Val Phe Ser Arg 210 215
220Val Leu Gly Glu Asp Phe Val Gly Ile Ala Phe Arg Ala Ala Arg
Ala225 230 235 240Ala Asp Pro Asn Ala Lys Leu Tyr Ile Asn Asp Tyr
Asn Leu Asp Ile 245 250 255Ala Asn Tyr Ala Lys Val Thr Arg Gly Met
Val Glu Lys Val Asn Lys 260 265 270Trp Ile Ala Gln Gly Ile Pro Ile
Asp Gly Ile Gly Thr Gln Cys His 275 280 285Leu Ala Gly Pro Gly Gly
Trp Asn Thr Ala Ala Gly Val Pro Asp Ala 290 295 300Leu Lys Ala Leu
Ala Ala Ala Asn Val Lys Glu Ile Ala Ile Thr Glu305 310 315 320Leu
Asp Ile Ala Gly Ala Ser Ala Asn Asp Tyr Leu Thr Val Met Asn 325 330
335Ala Cys Leu Gln Val Ser Lys Cys Val Gly Ile Thr Val Trp Gly Val
340 345 350Ser Asp Lys Asp Ser Trp Arg Ser Ser Ser Asn Pro Leu Leu
Phe Asp 355 360 365Ser Asn Tyr Gln Pro Lys Ala Ala Tyr Asn Ala Leu
Ile Asn Ala Leu 370 375 380271152DNAChrysosporium lucknowense
27atgcgtactc ttacgttcgt gctggcagcc gccccggtgg ctgtgcttgc ccaatctcct
60ctgtggggcc agtgcggcgg tcaaggctgg acaggtccca cgacctgcgt ttctggcgca
120gtatgccaat tcgtcaatga ctggtactcc caatgcgtgc ccggatcgag
caaccctcct 180acgggcacca ccagcagcac cactggaagc accccggctc
ctactggcgg cggcggcagc 240ggaaccggcc tccacgacaa attcaaggcc
aagggcaagc tctacttcgg aaccgagatc 300gatcactacc atctcaacaa
caatgccttg accaacattg tcaagaaaga ctttggtcaa 360gtcactcacg
agaacagctt gaagtgggat gctactgagc cgagccgcaa tcaattcaac
420tttgccaacg ccgacgcggt tgtcaacttt gcccaggcca acggcaagct
catccgcggc 480cacaccctcc tctggcactc tcagctgccg cagtgggtgc
agaacatcaa cgaccgcaac 540accttgaccc aggtcatcga gaaccacgtc
accacccttg tcactcgcta caagggcaag 600atcctccact gggacgtcgt
taacgagatc tttgccgagg acggctcgct ccgcgacagc 660gtcttcagcc
gcgtcctcgg cgaggacttt gtcggcatcg ccttccgcgc cgcccgcgcc
720gccgatccca acgccaagct ctacatcaac gactacaacc tcgacattgc
caactacgcc 780aaggtgaccc ggggcatggt cgagaaggtc aacaagtgga
tcgcccaggg catcccgatc 840gacggcatcg
gcacccagtg ccacctggcc gggcccggcg ggtggaacac ggccgccggc
900gtccccgacg ccctcaaggc cctcgccgcg gccaacgtca aggagatcgc
catcaccgag 960ctcgacatcg ccggcgcctc cgccaacgac tacctcaccg
tcatgaacgc ctgcctccag 1020gtctccaagt gcgtcggcat caccgtctgg
ggcgtctctg acaaggacag ctggaggtcg 1080agcagcaacc cgctcctctt
cgacagcaac taccagccaa aggcggcata caatgctctg 1140attaatgcct tg
1152281194DNAChrysosporium lucknowenseCDS(76)..(367)CDS(480)..(850)
28atgaatcaga agacgacact gggcaactaa acagcttgca gcagagtttt gtgccttgca
60taggccctcg atatc atg gtc tcg ttc act ctc ctc ctc acg gtc atc gcc
111 Met Val Ser Phe Thr Leu Leu Leu Thr Val Ile Ala 1 5 10gct gcg
gtg acg acg gcc agc cct ctc gag gtg gtc aag cgc ggc atc 159Ala Ala
Val Thr Thr Ala Ser Pro Leu Glu Val Val Lys Arg Gly Ile 15 20 25cag
ccg ggc acg ggc acc cac gag ggg tac ttc tac tcg ttc tgg acc 207Gln
Pro Gly Thr Gly Thr His Glu Gly Tyr Phe Tyr Ser Phe Trp Thr 30 35
40gac ggc cgt ggc tcg gtc gac ttc aac ccc ggg ccc cgc ggc tcg tac
255Asp Gly Arg Gly Ser Val Asp Phe Asn Pro Gly Pro Arg Gly Ser
Tyr45 50 55 60agc gtc acc tgg aac aac gtc aac aac tgg gtt ggc ggc
aag ggc tgg 303Ser Val Thr Trp Asn Asn Val Asn Asn Trp Val Gly Gly
Lys Gly Trp 65 70 75aac ccg ggc ccg ccg cgc aag att gcg tac aac ggc
acc tgg aac aac 351Asn Pro Gly Pro Pro Arg Lys Ile Ala Tyr Asn Gly
Thr Trp Asn Asn 80 85 90tac aac gtg aac agc t gtgcgttgtc ctcctctttc
tccctttcgc ttgttttcct 407Tyr Asn Val Asn Ser 95tgatgattgg
gatccatttt aaaagagaag gaaaaaaaaa acaaaggaaa atagaagata
467actaacgcca ag ct ctg gca gac ctc gcc ctg tac ggc tgg act cgc aac
517 Ser Leu Ala Asp Leu Ala Leu Tyr Gly Trp Thr Arg Asn 100 105
110ccg ctg gtc gag tat tac atc gtg gag gca tac ggc acg tac aac ccc
565Pro Leu Val Glu Tyr Tyr Ile Val Glu Ala Tyr Gly Thr Tyr Asn Pro
115 120 125tcg tcg ggc acg gcg cgg ctg ggc acc atc gag gac gac ggc
ggc gtg 613Ser Ser Gly Thr Ala Arg Leu Gly Thr Ile Glu Asp Asp Gly
Gly Val 130 135 140tac gac atc tac aag acg acg cgg tac aac cag ccg
tcc atc gag ggg 661Tyr Asp Ile Tyr Lys Thr Thr Arg Tyr Asn Gln Pro
Ser Ile Glu Gly 145 150 155acc tcc acc ttc gac cag tac tgg tcc gtc
cgc cgc cag aag cgc gtc 709Thr Ser Thr Phe Asp Gln Tyr Trp Ser Val
Arg Arg Gln Lys Arg Val 160 165 170ggc ggc act atc gac acg ggc aag
cac ttt gac gag tgg aag cgc cag 757Gly Gly Thr Ile Asp Thr Gly Lys
His Phe Asp Glu Trp Lys Arg Gln175 180 185 190ggc aac ctc cag ctc
ggc acc tgg aac tac atg atc atg gcc acc gag 805Gly Asn Leu Gln Leu
Gly Thr Trp Asn Tyr Met Ile Met Ala Thr Glu 195 200 205ggc tac cag
agc tct ggt tcg gcc act atc gag gtc cgg gag gcc 850Gly Tyr Gln Ser
Ser Gly Ser Ala Thr Ile Glu Val Arg Glu Ala 210 215 220taaagaagcc
aggcgccttt cttttgtttt gcaggagggg gtagaggggg ggggggaggg
910aaaacgaaaa gtagcagggt ggttttatgc cggcagccgt gggccattcg
agtgcaacct 970gtatctctct ctctcccaag tctccgggct ccttctcaga
gaacttcaat atgtctgggg 1030acaaaccacc ttgtgaaata caacggtaat
tatctaagtt tgagtgccct atcgtatgct 1090tctgaaaatt tcctgctcct
tgatacaagt cggtttgagc cgagccaatg agactgtgtc 1150gattgataga
ggccctgaag gatcaagcgc gatgcaacaa ttaa 119429221PRTChrysosporium
lucknowense 29Met Val Ser Phe Thr Leu Leu Leu Thr Val Ile Ala Ala
Ala Val Thr1 5 10 15Thr Ala Ser Pro Leu Glu Val Val Lys Arg Gly Ile
Gln Pro Gly Thr 20 25 30Gly Thr His Glu Gly Tyr Phe Tyr Ser Phe Trp
Thr Asp Gly Arg Gly 35 40 45Ser Val Asp Phe Asn Pro Gly Pro Arg Gly
Ser Tyr Ser Val Thr Trp 50 55 60Asn Asn Val Asn Asn Trp Val Gly Gly
Lys Gly Trp Asn Pro Gly Pro65 70 75 80Pro Arg Lys Ile Ala Tyr Asn
Gly Thr Trp Asn Asn Tyr Asn Val Asn 85 90 95Ser Ser Leu Ala Asp Leu
Ala Leu Tyr Gly Trp Thr Arg Asn Pro Leu 100 105 110Val Glu Tyr Tyr
Ile Val Glu Ala Tyr Gly Thr Tyr Asn Pro Ser Ser 115 120 125Gly Thr
Ala Arg Leu Gly Thr Ile Glu Asp Asp Gly Gly Val Tyr Asp 130 135
140Ile Tyr Lys Thr Thr Arg Tyr Asn Gln Pro Ser Ile Glu Gly Thr
Ser145 150 155 160Thr Phe Asp Gln Tyr Trp Ser Val Arg Arg Gln Lys
Arg Val Gly Gly 165 170 175Thr Ile Asp Thr Gly Lys His Phe Asp Glu
Trp Lys Arg Gln Gly Asn 180 185 190Leu Gln Leu Gly Thr Trp Asn Tyr
Met Ile Met Ala Thr Glu Gly Tyr 195 200 205Gln Ser Ser Gly Ser Ala
Thr Ile Glu Val Arg Glu Ala 210 215 22030663DNAChrysosporium
lucknowense 30atggtctcgt tcactctcct cctcacggtc atcgccgctg
cggtgacgac ggccagccct 60ctcgaggtgg tcaagcgcgg catccagccg ggcacgggca
cccacgaggg gtacttctac 120tcgttctgga ccgacggccg tggctcggtc
gacttcaacc ccgggccccg cggctcgtac 180agcgtcacct ggaacaacgt
caacaactgg gttggcggca agggctggaa cccgggcccg 240ccgcgcaaga
ttgcgtacaa cggcacctgg aacaactaca acgtgaacag ctctctggca
300gacctcgccc tgtacggctg gactcgcaac ccgctggtcg agtattacat
cgtggaggca 360tacggcacgt acaacccctc gtcgggcacg gcgcggctgg
gcaccatcga ggacgacggc 420ggcgtgtacg acatctacaa gacgacgcgg
tacaaccagc cgtccatcga ggggacctcc 480accttcgacc agtactggtc
cgtccgccgc cagaagcgcg tcggcggcac tatcgacacg 540ggcaagcact
ttgacgagtg gaagcgccag ggcaacctcc agctcggcac ctggaactac
600atgatcatgg ccaccgaggg ctaccagagc tctggttcgg ccactatcga
ggtccgggag 660gcc 663311598DNAChrysosporium
lucknowenseCDS(69)..(398)CDS(467)..(1375) 31tggctctcca acacccctgc
ccctcagcct cccgctcgtc agtgttgtat cctcttctgg 60ccggtatc atg cac tcc
aaa gct ttc ttg gca gcg ctt ctt gcg cct gcc 110 Met His Ser Lys Ala
Phe Leu Ala Ala Leu Leu Ala Pro Ala 1 5 10gtc tca ggg caa ctg aac
gac ctc gcc gtc agg gct gga ctc aag tac 158Val Ser Gly Gln Leu Asn
Asp Leu Ala Val Arg Ala Gly Leu Lys Tyr15 20 25 30ttt ggt act gct
ctt agc gag agc gtc atc aac agt gat act cgg tat 206Phe Gly Thr Ala
Leu Ser Glu Ser Val Ile Asn Ser Asp Thr Arg Tyr 35 40 45gct gcc atc
ctc agc gac aag agc atg ttc ggc cag ctc gtc ccc gag 254Ala Ala Ile
Leu Ser Asp Lys Ser Met Phe Gly Gln Leu Val Pro Glu 50 55 60aat ggc
atg aag tgg gat gct act gag ccg tcc cgt ggc cag ttc aac 302Asn Gly
Met Lys Trp Asp Ala Thr Glu Pro Ser Arg Gly Gln Phe Asn 65 70 75tac
gcc tcg ggc gac atc acg gcc aac acg gcc aag aag aat ggc cag 350Tyr
Ala Ser Gly Asp Ile Thr Ala Asn Thr Ala Lys Lys Asn Gly Gln 80 85
90ggc atg cgt tgc cac acc atg gtc tgg tac agc cag ctc ccg agc tgg
398Gly Met Arg Cys His Thr Met Val Trp Tyr Ser Gln Leu Pro Ser
Trp95 100 105 110gtgagtgcag cccccctcat cacatccacc agtctgcagt
catactcacg agtataataa 458caccgcag gtc tcc tcg ggc tcg tgg acc agg
gac tcg ctc acc tcg gtc 508 Val Ser Ser Gly Ser Trp Thr Arg Asp Ser
Leu Thr Ser Val 115 120atc gag acg cac atg aac aac gtc atg ggc cac
tac aag ggc caa tgc 556Ile Glu Thr His Met Asn Asn Val Met Gly His
Tyr Lys Gly Gln Cys125 130 135 140tac gcc tgg gat gtc atc aac gag
gcc atc aat gac gac ggc aac tcc 604Tyr Ala Trp Asp Val Ile Asn Glu
Ala Ile Asn Asp Asp Gly Asn Ser 145 150 155tgg cgc gac aac gtc ttt
ctc cgg acc ttt ggg acc gac tac ttc gcc 652Trp Arg Asp Asn Val Phe
Leu Arg Thr Phe Gly Thr Asp Tyr Phe Ala 160 165 170ctg tcc ttc aac
cta gcc aag aag gcc gat ccc gat acc aag ctg tac 700Leu Ser Phe Asn
Leu Ala Lys Lys Ala Asp Pro Asp Thr Lys Leu Tyr 175 180 185tac aac
gac tac aac ctc gag tac aac cag gcc aag acg gac cgc gct 748Tyr Asn
Asp Tyr Asn Leu Glu Tyr Asn Gln Ala Lys Thr Asp Arg Ala 190 195
200gtt gag ctc gtc aag atg gtc cag gcc gcc ggc gcg ccc atc gac ggt
796Val Glu Leu Val Lys Met Val Gln Ala Ala Gly Ala Pro Ile Asp
Gly205 210 215 220gtc ggc ttc cag ggc cac ctc att gtc ggc tcg acc
ccg acg cgc tcg 844Val Gly Phe Gln Gly His Leu Ile Val Gly Ser Thr
Pro Thr Arg Ser 225 230 235cag ctg gcc acc gcc ctc cag cgc ttc acc
gcg ctc ggc ctc gag gtc 892Gln Leu Ala Thr Ala Leu Gln Arg Phe Thr
Ala Leu Gly Leu Glu Val 240 245 250gcc tac acc gag ctc gac atc cgc
cac tcg agc ctg ccg gcc tct tcg 940Ala Tyr Thr Glu Leu Asp Ile Arg
His Ser Ser Leu Pro Ala Ser Ser 255 260 265tcg gcg ctc gcg acc cag
ggc aac gac ttc gcc aac gtg gtc ggc tct 988Ser Ala Leu Ala Thr Gln
Gly Asn Asp Phe Ala Asn Val Val Gly Ser 270 275 280tgc ctc gac acc
gcc ggc tgc gtc ggc gtc acc gtc tgg ggc ttc acc 1036Cys Leu Asp Thr
Ala Gly Cys Val Gly Val Thr Val Trp Gly Phe Thr285 290 295 300gat
gcg cac tcg tgg atc ccg aac acg ttc ccc ggc cag ggc gac gcc 1084Asp
Ala His Ser Trp Ile Pro Asn Thr Phe Pro Gly Gln Gly Asp Ala 305 310
315ctg atc tac gac agc aac tac aac aag aag ccc gcg tgg acc tcg atc
1132Leu Ile Tyr Asp Ser Asn Tyr Asn Lys Lys Pro Ala Trp Thr Ser Ile
320 325 330tcg tcc gtc ctg gcc gcc aag gcc acc ggc gcc ccg ccc gcc
tcg tcc 1180Ser Ser Val Leu Ala Ala Lys Ala Thr Gly Ala Pro Pro Ala
Ser Ser 335 340 345tcc acc acc ctc gtc acc atc acc acc cct ccg ccg
gca tcc acc acc 1228Ser Thr Thr Leu Val Thr Ile Thr Thr Pro Pro Pro
Ala Ser Thr Thr 350 355 360gcc tcc tcc tcc tcc agt gcc acg ccc acg
agc gtc ccg acg cag acg 1276Ala Ser Ser Ser Ser Ser Ala Thr Pro Thr
Ser Val Pro Thr Gln Thr365 370 375 380agg tgg gga cag tgc ggc ggc
atc gga tgg acg ggg ccg acc cag tgc 1324Arg Trp Gly Gln Cys Gly Gly
Ile Gly Trp Thr Gly Pro Thr Gln Cys 385 390 395gag agc cca tgg acc
tgc cag aag ctg aac gac tgg tac tgg cag tgc 1372Glu Ser Pro Trp Thr
Cys Gln Lys Leu Asn Asp Trp Tyr Trp Gln Cys 400 405 410ctg
taagggacga ggtcccgctg agggcgagtc ttctggattg ttgaagttgg
1425Leucagcataatt tggtgaggtt cctagacaga gaccatcggg ggccgatcca
ggaaacactg 1485gctgcatccg accgggattt tctgacttct tctgcatgta
ttgaaggttg tgttgagaac 1545caactgtaaa taagggacca gcagatcaaa
gctggaggtt caagcagaat aaa 159832413PRTChrysosporium lucknowense
32Met His Ser Lys Ala Phe Leu Ala Ala Leu Leu Ala Pro Ala Val Ser1
5 10 15Gly Gln Leu Asn Asp Leu Ala Val Arg Ala Gly Leu Lys Tyr Phe
Gly 20 25 30Thr Ala Leu Ser Glu Ser Val Ile Asn Ser Asp Thr Arg Tyr
Ala Ala 35 40 45Ile Leu Ser Asp Lys Ser Met Phe Gly Gln Leu Val Pro
Glu Asn Gly 50 55 60Met Lys Trp Asp Ala Thr Glu Pro Ser Arg Gly Gln
Phe Asn Tyr Ala65 70 75 80Ser Gly Asp Ile Thr Ala Asn Thr Ala Lys
Lys Asn Gly Gln Gly Met 85 90 95Arg Cys His Thr Met Val Trp Tyr Ser
Gln Leu Pro Ser Trp Val Ser 100 105 110Ser Gly Ser Trp Thr Arg Asp
Ser Leu Thr Ser Val Ile Glu Thr His 115 120 125Met Asn Asn Val Met
Gly His Tyr Lys Gly Gln Cys Tyr Ala Trp Asp 130 135 140Val Ile Asn
Glu Ala Ile Asn Asp Asp Gly Asn Ser Trp Arg Asp Asn145 150 155
160Val Phe Leu Arg Thr Phe Gly Thr Asp Tyr Phe Ala Leu Ser Phe Asn
165 170 175Leu Ala Lys Lys Ala Asp Pro Asp Thr Lys Leu Tyr Tyr Asn
Asp Tyr 180 185 190Asn Leu Glu Tyr Asn Gln Ala Lys Thr Asp Arg Ala
Val Glu Leu Val 195 200 205Lys Met Val Gln Ala Ala Gly Ala Pro Ile
Asp Gly Val Gly Phe Gln 210 215 220Gly His Leu Ile Val Gly Ser Thr
Pro Thr Arg Ser Gln Leu Ala Thr225 230 235 240Ala Leu Gln Arg Phe
Thr Ala Leu Gly Leu Glu Val Ala Tyr Thr Glu 245 250 255Leu Asp Ile
Arg His Ser Ser Leu Pro Ala Ser Ser Ser Ala Leu Ala 260 265 270Thr
Gln Gly Asn Asp Phe Ala Asn Val Val Gly Ser Cys Leu Asp Thr 275 280
285Ala Gly Cys Val Gly Val Thr Val Trp Gly Phe Thr Asp Ala His Ser
290 295 300Trp Ile Pro Asn Thr Phe Pro Gly Gln Gly Asp Ala Leu Ile
Tyr Asp305 310 315 320Ser Asn Tyr Asn Lys Lys Pro Ala Trp Thr Ser
Ile Ser Ser Val Leu 325 330 335Ala Ala Lys Ala Thr Gly Ala Pro Pro
Ala Ser Ser Ser Thr Thr Leu 340 345 350Val Thr Ile Thr Thr Pro Pro
Pro Ala Ser Thr Thr Ala Ser Ser Ser 355 360 365Ser Ser Ala Thr Pro
Thr Ser Val Pro Thr Gln Thr Arg Trp Gly Gln 370 375 380Cys Gly Gly
Ile Gly Trp Thr Gly Pro Thr Gln Cys Glu Ser Pro Trp385 390 395
400Thr Cys Gln Lys Leu Asn Asp Trp Tyr Trp Gln Cys Leu 405
410331239DNAChrysosporium lucknowense 33atgcactcca aagctttctt
ggcagcgctt cttgcgcctg ccgtctcagg gcaactgaac 60gacctcgccg tcagggctgg
actcaagtac tttggtactg ctcttagcga gagcgtcatc 120aacagtgata
ctcggtatgc tgccatcctc agcgacaaga gcatgttcgg ccagctcgtc
180cccgagaatg gcatgaagtg ggatgctact gagccgtccc gtggccagtt
caactacgcc 240tcgggcgaca tcacggccaa cacggccaag aagaatggcc
agggcatgcg ttgccacacc 300atggtctggt acagccagct cccgagctgg
gtctcctcgg gctcgtggac cagggactcg 360ctcacctcgg tcatcgagac
gcacatgaac aacgtcatgg gccactacaa gggccaatgc 420tacgcctggg
atgtcatcaa cgaggccatc aatgacgacg gcaactcctg gcgcgacaac
480gtctttctcc ggacctttgg gaccgactac ttcgccctgt ccttcaacct
agccaagaag 540gccgatcccg ataccaagct gtactacaac gactacaacc
tcgagtacaa ccaggccaag 600acggaccgcg ctgttgagct cgtcaagatg
gtccaggccg ccggcgcgcc catcgacggt 660gtcggcttcc agggccacct
cattgtcggc tcgaccccga cgcgctcgca gctggccacc 720gccctccagc
gcttcaccgc gctcggcctc gaggtcgcct acaccgagct cgacatccgc
780cactcgagcc tgccggcctc ttcgtcggcg ctcgcgaccc agggcaacga
cttcgccaac 840gtggtcggct cttgcctcga caccgccggc tgcgtcggcg
tcaccgtctg gggcttcacc 900gatgcgcact cgtggatccc gaacacgttc
cccggccagg gcgacgccct gatctacgac 960agcaactaca acaagaagcc
cgcgtggacc tcgatctcgt ccgtcctggc cgccaaggcc 1020accggcgccc
cgcccgcctc gtcctccacc accctcgtca ccatcaccac ccctccgccg
1080gcatccacca ccgcctcctc ctcctccagt gccacgccca cgagcgtccc
gacgcagacg 1140aggtggggac agtgcggcgg catcggatgg acggggccga
cccagtgcga gagcccatgg 1200acctgccaga agctgaacga ctggtactgg
cagtgcctg 1239341664DNAChrysosporium
lucknowenseCDS(136)..(393)CDS(496)..(1131)CDS(1256)..(1486)
34gattaaatag ggtcgctctg ctcgccgctc tcttgggttt gctctcgtca agcatcctcc
60cccacctcgc gccgcaagta cggtacaggc ctgcaaccga ggatcgagcc cgtcaaagaa
120cgagtcaaaa tcaag atg cat ctc tcc tcg tct ctc ctc ctc ctc gcc gcc
171 Met His Leu Ser Ser Ser Leu Leu Leu Leu Ala Ala 1 5 10ttg ccc
ctg ggc atc gcc ggc aag ggc aag ggc cac ggc cac ggc ccc 219Leu Pro
Leu Gly Ile Ala Gly Lys Gly Lys Gly His Gly His Gly Pro 15 20 25cat
acc ggg ctc cac acc ctc gcc aag cag gcc ggc ctc aag tac ttc 267His
Thr Gly Leu His Thr Leu Ala Lys Gln Ala Gly Leu Lys Tyr Phe 30 35
40ggc tct gcc acc gac tct ccc ggc cag cgt gag cgc gcc ggc tac gag
315Gly Ser Ala Thr Asp Ser Pro Gly Gln Arg Glu Arg Ala Gly Tyr
Glu45 50 55 60gac aag tac gcc cag tac gac cag atc atg tgg aag tcg
ggc gag ttc 363Asp Lys Tyr Ala Gln Tyr Asp Gln Ile Met Trp Lys Ser
Gly Glu Phe 65 70 75ggc ctg acg acc ccg acc aac ggc caa aag
gttcgacatt agacctcctg 413Gly Leu Thr Thr Pro Thr Asn Gly Gln Lys 80
85cctgcgctcc cttcttaacc ctagacctgc tttcgccccg ggaaggccgc ctcgttgcta
473accagttctc ggctcattct ag tgg ctg ttt act gag ccc gag cgt ggc gtg
525 Trp Leu Phe Thr Glu Pro Glu Arg Gly Val 90 95ttc aac ttc acc
gag
ggt gac atc gtg acg aac ctg gcc cgg aag cac 573Phe Asn Phe Thr Glu
Gly Asp Ile Val Thr Asn Leu Ala Arg Lys His 100 105 110ggt ttc atg
cag cgg tgc cac gcg ctc gtc tgg cac agc cag ctc gcc 621Gly Phe Met
Gln Arg Cys His Ala Leu Val Trp His Ser Gln Leu Ala 115 120 125cct
tgg gtc gag tcg acc gag tgg acg ccc gag gag ctg cgc cag gtc 669Pro
Trp Val Glu Ser Thr Glu Trp Thr Pro Glu Glu Leu Arg Gln Val 130 135
140att gtc aac cac atc acc cac gtg gcc ggc tac tac aag ggc aag tgc
717Ile Val Asn His Ile Thr His Val Ala Gly Tyr Tyr Lys Gly Lys
Cys145 150 155 160tat gcc tgg gac gtc gtc aac gag gcc ctg aac gag
gac ggc acc tac 765Tyr Ala Trp Asp Val Val Asn Glu Ala Leu Asn Glu
Asp Gly Thr Tyr 165 170 175cgc gag tcc gtc ttc tac aag gtg ctc ggc
gag gac tac atc aag ctg 813Arg Glu Ser Val Phe Tyr Lys Val Leu Gly
Glu Asp Tyr Ile Lys Leu 180 185 190gcc ttc gag acg gcc gcc aag gtc
gac ccc cac gcc aag ctc tac tac 861Ala Phe Glu Thr Ala Ala Lys Val
Asp Pro His Ala Lys Leu Tyr Tyr 195 200 205aac gac tac aac ctc gag
tcc ccc agc gcc aag acc gag ggc gcc aag 909Asn Asp Tyr Asn Leu Glu
Ser Pro Ser Ala Lys Thr Glu Gly Ala Lys 210 215 220cgc atc gtc aag
atg ctc aag gac gcc ggc atc cgc atc gac ggc gtc 957Arg Ile Val Lys
Met Leu Lys Asp Ala Gly Ile Arg Ile Asp Gly Val225 230 235 240ggc
ctg cag gcc cac ctc gtc gcc gag agc cac ccg acc ctc gac gag 1005Gly
Leu Gln Ala His Leu Val Ala Glu Ser His Pro Thr Leu Asp Glu 245 250
255cac atc gat gcc atc aag ggc ttc acc gag ctc ggc gtc gag gtc gcc
1053His Ile Asp Ala Ile Lys Gly Phe Thr Glu Leu Gly Val Glu Val Ala
260 265 270ctg acc gag ctc gac atc cgc ctc tcc atc ccg gcc aac gcc
acc aac 1101Leu Thr Glu Leu Asp Ile Arg Leu Ser Ile Pro Ala Asn Ala
Thr Asn 275 280 285ctc gcc cag cag agg gag gcg tac aag aac
gtgagttttt tttattttat 1151Leu Ala Gln Gln Arg Glu Ala Tyr Lys Asn
290 295ttttatcttc cctcttgttc ttccatccaa cccccccccc ttttgacggt
gcccagtcag 1211attgctcttg atcgatgcta acctttgttc cttctctttg atag gtc
gtc ggc gct 1267 Val Val Gly Ala 300tgc gtc cag gtt cgc ggc tgc att
ggc gtg gag atc tgg gac ttc tat 1315Cys Val Gln Val Arg Gly Cys Ile
Gly Val Glu Ile Trp Asp Phe Tyr 305 310 315gac ccc ttc agc tgg gtc
cct gcc acc ttc ccc ggc cag ggc gcc ccc 1363Asp Pro Phe Ser Trp Val
Pro Ala Thr Phe Pro Gly Gln Gly Ala Pro 320 325 330ctg ctc tgg ttc
gag gac ttt tcc aag cac ccc gcc tac gac ggc gtc 1411Leu Leu Trp Phe
Glu Asp Phe Ser Lys His Pro Ala Tyr Asp Gly Val335 340 345 350gtc
gag gcc ctg acc aac agg acc acg ggc ggg tgc aag ggc aag ggc 1459Val
Glu Ala Leu Thr Asn Arg Thr Thr Gly Gly Cys Lys Gly Lys Gly 355 360
365aag ggc aag ggc aag gtt tgg aag gcc taataagtgg tgttgtgcgg
1506Lys Gly Lys Gly Lys Val Trp Lys Ala 370 375gtcgtgtaga
gatttaggga gaggattggg tccttcggtt cttgtatcta attattattc
1566cgtactatta attgcatttc gatctaaagt tagtgttgag cgggtcgccg
cagtcatgcg 1626acatgcatgg cacgctaagc tcggccgagc tctcggga
166435375PRTChrysosporium lucknowense 35Met His Leu Ser Ser Ser Leu
Leu Leu Leu Ala Ala Leu Pro Leu Gly1 5 10 15Ile Ala Gly Lys Gly Lys
Gly His Gly His Gly Pro His Thr Gly Leu 20 25 30His Thr Leu Ala Lys
Gln Ala Gly Leu Lys Tyr Phe Gly Ser Ala Thr 35 40 45Asp Ser Pro Gly
Gln Arg Glu Arg Ala Gly Tyr Glu Asp Lys Tyr Ala 50 55 60Gln Tyr Asp
Gln Ile Met Trp Lys Ser Gly Glu Phe Gly Leu Thr Thr65 70 75 80Pro
Thr Asn Gly Gln Lys Trp Leu Phe Thr Glu Pro Glu Arg Gly Val 85 90
95Phe Asn Phe Thr Glu Gly Asp Ile Val Thr Asn Leu Ala Arg Lys His
100 105 110Gly Phe Met Gln Arg Cys His Ala Leu Val Trp His Ser Gln
Leu Ala 115 120 125Pro Trp Val Glu Ser Thr Glu Trp Thr Pro Glu Glu
Leu Arg Gln Val 130 135 140Ile Val Asn His Ile Thr His Val Ala Gly
Tyr Tyr Lys Gly Lys Cys145 150 155 160Tyr Ala Trp Asp Val Val Asn
Glu Ala Leu Asn Glu Asp Gly Thr Tyr 165 170 175Arg Glu Ser Val Phe
Tyr Lys Val Leu Gly Glu Asp Tyr Ile Lys Leu 180 185 190Ala Phe Glu
Thr Ala Ala Lys Val Asp Pro His Ala Lys Leu Tyr Tyr 195 200 205Asn
Asp Tyr Asn Leu Glu Ser Pro Ser Ala Lys Thr Glu Gly Ala Lys 210 215
220Arg Ile Val Lys Met Leu Lys Asp Ala Gly Ile Arg Ile Asp Gly
Val225 230 235 240Gly Leu Gln Ala His Leu Val Ala Glu Ser His Pro
Thr Leu Asp Glu 245 250 255His Ile Asp Ala Ile Lys Gly Phe Thr Glu
Leu Gly Val Glu Val Ala 260 265 270Leu Thr Glu Leu Asp Ile Arg Leu
Ser Ile Pro Ala Asn Ala Thr Asn 275 280 285Leu Ala Gln Gln Arg Glu
Ala Tyr Lys Asn Val Val Gly Ala Cys Val 290 295 300Gln Val Arg Gly
Cys Ile Gly Val Glu Ile Trp Asp Phe Tyr Asp Pro305 310 315 320Phe
Ser Trp Val Pro Ala Thr Phe Pro Gly Gln Gly Ala Pro Leu Leu 325 330
335Trp Phe Glu Asp Phe Ser Lys His Pro Ala Tyr Asp Gly Val Val Glu
340 345 350Ala Leu Thr Asn Arg Thr Thr Gly Gly Cys Lys Gly Lys Gly
Lys Gly 355 360 365Lys Gly Lys Val Trp Lys Ala 370
375361125DNAChrysosporium lucknowense 36atgcatctct cctcgtctct
cctcctcctc gccgccttgc ccctgggcat cgccggcaag 60ggcaagggcc acggccacgg
cccccatacc gggctccaca ccctcgccaa gcaggccggc 120ctcaagtact
tcggctctgc caccgactct cccggccagc gtgagcgcgc cggctacgag
180gacaagtacg cccagtacga ccagatcatg tggaagtcgg gcgagttcgg
cctgacgacc 240ccgaccaacg gccaaaagtg gctgtttact gagcccgagc
gtggcgtgtt caacttcacc 300gagggtgaca tcgtgacgaa cctggcccgg
aagcacggtt tcatgcagcg gtgccacgcg 360ctcgtctggc acagccagct
cgccccttgg gtcgagtcga ccgagtggac gcccgaggag 420ctgcgccagg
tcattgtcaa ccacatcacc cacgtggccg gctactacaa gggcaagtgc
480tatgcctggg acgtcgtcaa cgaggccctg aacgaggacg gcacctaccg
cgagtccgtc 540ttctacaagg tgctcggcga ggactacatc aagctggcct
tcgagacggc cgccaaggtc 600gacccccacg ccaagctcta ctacaacgac
tacaacctcg agtcccccag cgccaagacc 660gagggcgcca agcgcatcgt
caagatgctc aaggacgccg gcatccgcat cgacggcgtc 720ggcctgcagg
cccacctcgt cgccgagagc cacccgaccc tcgacgagca catcgatgcc
780atcaagggct tcaccgagct cggcgtcgag gtcgccctga ccgagctcga
catccgcctc 840tccatcccgg ccaacgccac caacctcgcc cagcagaggg
aggcgtacaa gaacgtcgtc 900ggcgcttgcg tccaggttcg cggctgcatt
ggcgtggaga tctgggactt ctatgacccc 960ttcagctggg tccctgccac
cttccccggc cagggcgccc ccctgctctg gttcgaggac 1020ttttccaagc
accccgccta cgacggcgtc gtcgaggccc tgaccaacag gaccacgggc
1080gggtgcaagg gcaagggcaa gggcaagggc aaggtttgga aggcc
1125371192DNAChrysosporium
lucknowenseCDS(121)..(386)CDS(531)..(942) 37acataaatag agccactggc
tcccctcaag ggattggcgt tcgaagcatc gtcagaaccg 60caatatacat attagtgtaa
tatcttgcac ctcattagat ccacaatcat catcagcatc 120atg gtt acc ctc act
cgc ctg gcg gtc gcc gcg gcg gcc atg atc tcc 168Met Val Thr Leu Thr
Arg Leu Ala Val Ala Ala Ala Ala Met Ile Ser1 5 10 15agc act ggc ctg
gct gcc ccg acg ccc gaa gct ggc ccc gac ctt ccc 216Ser Thr Gly Leu
Ala Ala Pro Thr Pro Glu Ala Gly Pro Asp Leu Pro 20 25 30gac ttt gag
ctc ggg gtc aac aac ctc gcc cgc cgc gcg ctg gac tac 264Asp Phe Glu
Leu Gly Val Asn Asn Leu Ala Arg Arg Ala Leu Asp Tyr 35 40 45aac cag
aac tac agg acc agc ggc aac gtc aac tac tcg ccc acc gac 312Asn Gln
Asn Tyr Arg Thr Ser Gly Asn Val Asn Tyr Ser Pro Thr Asp 50 55 60aac
ggc tac tcg gtc agc ttc tcc aac gcg gga gat ttt gtc gtc ggg 360Asn
Gly Tyr Ser Val Ser Phe Ser Asn Ala Gly Asp Phe Val Val Gly65 70 75
80aag ggc tgg agg acg gga gcc acc ag gtaagacagc agcgtacaat 406Lys
Gly Trp Arg Thr Gly Ala Thr Arg 85cgtcagggtt attggttata tggttatcca
gacttgagcc gtggatttct tttatttcat 466ttttcctttc ctccactctt
aaacacgctt cgggcagcgt tgacaaacct tctaatatct 526tcag a aac atc acc
ttc tcg gga tcg aca cag cat acc tcg ggc acc 573 Asn Ile Thr Phe Ser
Gly Ser Thr Gln His Thr Ser Gly Thr 90 95 100gtg ctc gtc tcc gtc
tac ggc tgg acc cgg aac ccg ctg atc gag tac 621Val Leu Val Ser Val
Tyr Gly Trp Thr Arg Asn Pro Leu Ile Glu Tyr 105 110 115tac gtg cag
gag tac acg tcc aac ggg gcc ggc tcc gct cag ggc gag 669Tyr Val Gln
Glu Tyr Thr Ser Asn Gly Ala Gly Ser Ala Gln Gly Glu120 125 130
135aag ctg ggc acg gtc gag agc gac ggg ggc acg tac gag atc tgg cgg
717Lys Leu Gly Thr Val Glu Ser Asp Gly Gly Thr Tyr Glu Ile Trp Arg
140 145 150cac cag cag gtc aac cag ccg tcg atc gag ggc acc tcg acc
ttc tgg 765His Gln Gln Val Asn Gln Pro Ser Ile Glu Gly Thr Ser Thr
Phe Trp 155 160 165cag tac atc tcg aac cgc gtg tcc ggc cag cgg ccc
aac ggc ggc acc 813Gln Tyr Ile Ser Asn Arg Val Ser Gly Gln Arg Pro
Asn Gly Gly Thr 170 175 180gtc acc ctc gcc aac cac ttc gcc gcc tgg
cag aag ctc ggc ctg aac 861Val Thr Leu Ala Asn His Phe Ala Ala Trp
Gln Lys Leu Gly Leu Asn 185 190 195ctg ggc cag cac gac tac cag gtc
ctg gcc acc gag ggc tgg ggc aac 909Leu Gly Gln His Asp Tyr Gln Val
Leu Ala Thr Glu Gly Trp Gly Asn200 205 210 215gcc ggc ggc agc tcc
cag tac acc gtc agc ggc tgaccgttgg gtggtggggg 962Ala Gly Gly Ser
Ser Gln Tyr Thr Val Ser Gly 220 225gcaaagagct ggtcttagaa ccatccaacg
atccttacca tgaaagagct tgtgacttag 1022tcgctgttca tgaaaacatg
tctgttttac acagacaaga tttaccaatt gcaatgaagc 1082atacgtcaac
tcgaacgtgt ttgtgtttct cgtgcctctc gtatgaatat gctatgctga
1142tggagtatca ctcttgatgt caaacatgct atattaacga cataactata
119238226PRTChrysosporium lucknowense 38Met Val Thr Leu Thr Arg Leu
Ala Val Ala Ala Ala Ala Met Ile Ser1 5 10 15Ser Thr Gly Leu Ala Ala
Pro Thr Pro Glu Ala Gly Pro Asp Leu Pro 20 25 30Asp Phe Glu Leu Gly
Val Asn Asn Leu Ala Arg Arg Ala Leu Asp Tyr 35 40 45Asn Gln Asn Tyr
Arg Thr Ser Gly Asn Val Asn Tyr Ser Pro Thr Asp 50 55 60Asn Gly Tyr
Ser Val Ser Phe Ser Asn Ala Gly Asp Phe Val Val Gly65 70 75 80Lys
Gly Trp Arg Thr Gly Ala Thr Arg Asn Ile Thr Phe Ser Gly Ser 85 90
95Thr Gln His Thr Ser Gly Thr Val Leu Val Ser Val Tyr Gly Trp Thr
100 105 110Arg Asn Pro Leu Ile Glu Tyr Tyr Val Gln Glu Tyr Thr Ser
Asn Gly 115 120 125Ala Gly Ser Ala Gln Gly Glu Lys Leu Gly Thr Val
Glu Ser Asp Gly 130 135 140Gly Thr Tyr Glu Ile Trp Arg His Gln Gln
Val Asn Gln Pro Ser Ile145 150 155 160Glu Gly Thr Ser Thr Phe Trp
Gln Tyr Ile Ser Asn Arg Val Ser Gly 165 170 175Gln Arg Pro Asn Gly
Gly Thr Val Thr Leu Ala Asn His Phe Ala Ala 180 185 190Trp Gln Lys
Leu Gly Leu Asn Leu Gly Gln His Asp Tyr Gln Val Leu 195 200 205Ala
Thr Glu Gly Trp Gly Asn Ala Gly Gly Ser Ser Gln Tyr Thr Val 210 215
220Ser Gly22539678DNAChrysosporium lucknowense 39atggttaccc
tcactcgcct ggcggtcgcc gcggcggcca tgatctccag cactggcctg 60gctgccccga
cgcccgaagc tggccccgac cttcccgact ttgagctcgg ggtcaacaac
120ctcgcccgcc gcgcgctgga ctacaaccag aactacagga ccagcggcaa
cgtcaactac 180tcgcccaccg acaacggcta ctcggtcagc ttctccaacg
cgggagattt tgtcgtcggg 240aagggctgga ggacgggagc caccagaaac
atcaccttct cgggatcgac acagcatacc 300tcgggcaccg tgctcgtctc
cgtctacggc tggacccgga acccgctgat cgagtactac 360gtgcaggagt
acacgtccaa cggggccggc tccgctcagg gcgagaagct gggcacggtc
420gagagcgacg ggggcacgta cgagatctgg cggcaccagc aggtcaacca
gccgtcgatc 480gagggcacct cgaccttctg gcagtacatc tcgaaccgcg
tgtccggcca gcggcccaac 540ggcggcaccg tcaccctcgc caaccacttc
gccgcctggc agaagctcgg cctgaacctg 600ggccagcacg actaccaggt
cctggccacc gagggctggg gcaacgccgg cggcagctcc 660cagtacaccg tcagcggc
678401194DNAChrysosporium lucknowenseCDS(205)..(485)CDS(584)..(986)
40gtataaatgc atcaaggagc gactgccccc ccatcaataa ccacccggtt gcttgagtct
60ctcgcactcg cggccccttc ttctctgctt cgcacgcatc tcgctgtctc gctgtctcgc
120tgtctcactg tctcgctgtc tcactgtctc gctgtctcac tgtctcgctg
tctcactgtc 180tcactcgtcc atcagagcaa aacc atg gtc tcg ctc aag tcc
ctc ctc ctc 231 Met Val Ser Leu Lys Ser Leu Leu Leu 1 5gcc gcg gcg
gcg acg ttg acg gcg gtg acg gcg cgc ccg ttc gac ttt 279Ala Ala Ala
Ala Thr Leu Thr Ala Val Thr Ala Arg Pro Phe Asp Phe10 15 20 25gac
gac ggc aac tcg acc gag gcg ctg gcc aag cgc cag gtc acg ccc 327Asp
Asp Gly Asn Ser Thr Glu Ala Leu Ala Lys Arg Gln Val Thr Pro 30 35
40aac gcg cag ggc tac cac tcg ggc tac ttc tac tcg tgg tgg tcc gac
375Asn Ala Gln Gly Tyr His Ser Gly Tyr Phe Tyr Ser Trp Trp Ser Asp
45 50 55ggc ggc ggc cag gcc acc ttc acc ctg ctc gag ggc agc cac tac
cag 423Gly Gly Gly Gln Ala Thr Phe Thr Leu Leu Glu Gly Ser His Tyr
Gln 60 65 70gtc aac tgg agg aac acg ggc aac ttt gtc ggt ggc aag ggc
tgg aac 471Val Asn Trp Arg Asn Thr Gly Asn Phe Val Gly Gly Lys Gly
Trp Asn 75 80 85ccg ggt acc ggc cg gtaagttgtg tttcttttct tgaactattg
actactaatt 525Pro Gly Thr Gly Arg90atcgtatctt tccagtcaac ggggttgaaa
ggttggttgg ctgactgccg gatcgcag g 584acc atc aac tac ggc ggc tcg ttc
aac ccg agc ggc aac ggc tac ctg 632Thr Ile Asn Tyr Gly Gly Ser Phe
Asn Pro Ser Gly Asn Gly Tyr Leu95 100 105 110gcc gtc tac ggc tgg
acg cac aac ccg ctg atc gag tac tac gtg gtc 680Ala Val Tyr Gly Trp
Thr His Asn Pro Leu Ile Glu Tyr Tyr Val Val 115 120 125gag tcg tac
ggg acc tac aac ccg ggc agc cag gcc cag tac aag ggc 728Glu Ser Tyr
Gly Thr Tyr Asn Pro Gly Ser Gln Ala Gln Tyr Lys Gly 130 135 140agc
ttc cag agc gac ggc ggc acc tac aac atc tac gtc tcg acc cgc 776Ser
Phe Gln Ser Asp Gly Gly Thr Tyr Asn Ile Tyr Val Ser Thr Arg 145 150
155tac aac gcg ccc tcg atc gag ggc acc cgc acc ttc cag cag tac tgg
824Tyr Asn Ala Pro Ser Ile Glu Gly Thr Arg Thr Phe Gln Gln Tyr Trp
160 165 170tcc atc cgc acc tcc aag cgc gtc ggc ggc tcc gtc acc atg
cag aac 872Ser Ile Arg Thr Ser Lys Arg Val Gly Gly Ser Val Thr Met
Gln Asn175 180 185 190cac ttc aac gcc tgg gcc cag cac ggc atg ccc
ctc ggc tcc cac gac 920His Phe Asn Ala Trp Ala Gln His Gly Met Pro
Leu Gly Ser His Asp 195 200 205tac cag atc gtc gcc acc gag ggc tac
cag agc agc ggc tcc tcc gac 968Tyr Gln Ile Val Ala Thr Glu Gly Tyr
Gln Ser Ser Gly Ser Ser Asp 210 215 220atc tac gtc cag act cac
taggtacctt aggtattcaa gcgatccagt 1016Ile Tyr Val Gln Thr His
225cggtcagaca gactcttcca ggcacgccgc tgtccatcag tgggtagtat
tatcccatga 1076tgggcgccgg cggtgacggg aaaaggggcc attcgtctgc
agtaccttat tttactatta 1136atactgccta tctgccagca taacaatagg
tagcacgcgt gcacttacca ggaattca 119441228PRTChrysosporium
lucknowense 41Met Val Ser Leu Lys Ser Leu Leu Leu Ala Ala Ala Ala
Thr Leu Thr1 5 10 15Ala Val Thr Ala Arg Pro Phe Asp Phe Asp Asp Gly
Asn Ser Thr Glu 20 25 30Ala Leu Ala Lys Arg Gln Val Thr Pro Asn Ala
Gln Gly Tyr His Ser 35 40 45Gly Tyr Phe Tyr Ser Trp Trp Ser Asp Gly
Gly Gly Gln Ala Thr Phe 50 55 60Thr Leu Leu Glu Gly Ser His Tyr Gln
Val Asn Trp Arg Asn Thr Gly65 70 75 80Asn Phe Val Gly Gly Lys Gly
Trp Asn Pro Gly Thr Gly Arg Thr Ile 85 90 95Asn Tyr Gly Gly Ser Phe
Asn Pro Ser Gly Asn Gly Tyr
Leu Ala Val 100 105 110Tyr Gly Trp Thr His Asn Pro Leu Ile Glu Tyr
Tyr Val Val Glu Ser 115 120 125Tyr Gly Thr Tyr Asn Pro Gly Ser Gln
Ala Gln Tyr Lys Gly Ser Phe 130 135 140Gln Ser Asp Gly Gly Thr Tyr
Asn Ile Tyr Val Ser Thr Arg Tyr Asn145 150 155 160Ala Pro Ser Ile
Glu Gly Thr Arg Thr Phe Gln Gln Tyr Trp Ser Ile 165 170 175Arg Thr
Ser Lys Arg Val Gly Gly Ser Val Thr Met Gln Asn His Phe 180 185
190Asn Ala Trp Ala Gln His Gly Met Pro Leu Gly Ser His Asp Tyr Gln
195 200 205Ile Val Ala Thr Glu Gly Tyr Gln Ser Ser Gly Ser Ser Asp
Ile Tyr 210 215 220Val Gln Thr His22542684DNAChrysosporium
lucknowense 42atggtctcgc tcaagtccct cctcctcgcc gcggcggcga
cgttgacggc ggtgacggcg 60cgcccgttcg actttgacga cggcaactcg accgaggcgc
tggccaagcg ccaggtcacg 120cccaacgcgc agggctacca ctcgggctac
ttctactcgt ggtggtccga cggcggcggc 180caggccacct tcaccctgct
cgagggcagc cactaccagg tcaactggag gaacacgggc 240aactttgtcg
gtggcaaggg ctggaacccg ggtaccggcc ggaccatcaa ctacggcggc
300tcgttcaacc cgagcggcaa cggctacctg gccgtctacg gctggacgca
caacccgctg 360atcgagtact acgtggtcga gtcgtacggg acctacaacc
cgggcagcca ggcccagtac 420aagggcagct tccagagcga cggcggcacc
tacaacatct acgtctcgac ccgctacaac 480gcgccctcga tcgagggcac
ccgcaccttc cagcagtact ggtccatccg cacctccaag 540cgcgtcggcg
gctccgtcac catgcagaac cacttcaacg cctgggccca gcacggcatg
600cccctcggct cccacgacta ccagatcgtc gccaccgagg gctaccagag
cagcggctcc 660tccgacatct acgtccagac tcac 684432740DNAChrysosporium
lucknowenseCDS(1)..(230)CDS(294)..(682)CDS(744)..(2737) 43atg caa
ctt cca gcc gca gcc caa tgg ctg ctc acg ctt ccc gcg aaa 48Met Gln
Leu Pro Ala Ala Ala Gln Trp Leu Leu Thr Leu Pro Ala Lys1 5 10 15gcc
tca ctt gct gac aat cat cgt cag gtt cac cag aag ccc ctc gcg 96Ala
Ser Leu Ala Asp Asn His Arg Gln Val His Gln Lys Pro Leu Ala 20 25
30aga tct gaa cct ttt tac ccg tcg cca tgg atg aat ccc aac gcc gac
144Arg Ser Glu Pro Phe Tyr Pro Ser Pro Trp Met Asn Pro Asn Ala Asp
35 40 45ggc tgg gcg gag gcc tat gcc cag gcc aag tcc ttt gtc tcc caa
atg 192Gly Trp Ala Glu Ala Tyr Ala Gln Ala Lys Ser Phe Val Ser Gln
Met 50 55 60act ctg cta gag aag gtc aac ttg acc acg gga gtc gg
gtaagttttg 240Thr Leu Leu Glu Lys Val Asn Leu Thr Thr Gly Val Gly65
70 75tcattttgtc caggtaacat gcaaatggtt ctgctaacaa taacttaccg tag c
tgg 297 Trpggg gct gag cag tgc gtc ggc caa gtg ggc gcg atc cct cgc
ctt gga 345Gly Ala Glu Gln Cys Val Gly Gln Val Gly Ala Ile Pro Arg
Leu Gly 80 85 90ctt cgc agt ctg tgc atg cat gac tcc cct ctc ggc atc
cga gga gcc 393Leu Arg Ser Leu Cys Met His Asp Ser Pro Leu Gly Ile
Arg Gly Ala95 100 105 110gac tac aac tca gcg ttc ccc tct ggc cag
acc gtt gct gct acc tgg 441Asp Tyr Asn Ser Ala Phe Pro Ser Gly Gln
Thr Val Ala Ala Thr Trp 115 120 125gat cgc ggt ctg atg tac cgt cgc
ggc tac gca atg ggc cag gag gcc 489Asp Arg Gly Leu Met Tyr Arg Arg
Gly Tyr Ala Met Gly Gln Glu Ala 130 135 140aaa ggc aag ggc atc aat
gtc ctt ctc gga cca gtc gcc ggc ccc ctt 537Lys Gly Lys Gly Ile Asn
Val Leu Leu Gly Pro Val Ala Gly Pro Leu 145 150 155ggc cgc atg ccc
gag ggc ggt cgt aac tgg gaa ggc ttc gct ccg gat 585Gly Arg Met Pro
Glu Gly Gly Arg Asn Trp Glu Gly Phe Ala Pro Asp 160 165 170ccc gtc
ctt acc ggc atc ggc atg tcc gag acg atc aag ggc att cag 633Pro Val
Leu Thr Gly Ile Gly Met Ser Glu Thr Ile Lys Gly Ile Gln175 180 185
190gat gct ggc gtc atc gct tgt gcg aag cac ttt att gga aac gag cag
g 682Asp Ala Gly Val Ile Ala Cys Ala Lys His Phe Ile Gly Asn Glu
Gln 195 200 205gtgagtagtc aaagacgggc cgtctcggac ccgcggcttc
aagctgctga ctctgctgca 742g ag cac ttc aga cag gtg cca gaa gcc cag
gga tac ggt tac aac atc 790 Glu His Phe Arg Gln Val Pro Glu Ala Gln
Gly Tyr Gly Tyr Asn Ile 210 215 220agc gaa acc ctc tcc tcc aac att
gac gac aag acc atg cac gag ctc 838Ser Glu Thr Leu Ser Ser Asn Ile
Asp Asp Lys Thr Met His Glu Leu 225 230 235tac ctt tgg ccg ttt gcc
gat gcc gtc cgg gcc ggc gtc ggc tct gtc 886Tyr Leu Trp Pro Phe Ala
Asp Ala Val Arg Ala Gly Val Gly Ser Val 240 245 250atg tgc tcg tac
cag cag gtc aac aac tcg tac gcc tgc cag aac tcg 934Met Cys Ser Tyr
Gln Gln Val Asn Asn Ser Tyr Ala Cys Gln Asn Ser255 260 265 270aag
ctg ctg aac gac ctc ctc aag aac gag ctt ggg ttt cag ggc ttc 982Lys
Leu Leu Asn Asp Leu Leu Lys Asn Glu Leu Gly Phe Gln Gly Phe 275 280
285gtc atg agc gac tgg cag gca cag cac act ggc gca gca agc gcc gtg
1030Val Met Ser Asp Trp Gln Ala Gln His Thr Gly Ala Ala Ser Ala Val
290 295 300gct ggt ctc gat atg tcc atg ccg ggc gac acc cag ttc aac
act ggc 1078Ala Gly Leu Asp Met Ser Met Pro Gly Asp Thr Gln Phe Asn
Thr Gly 305 310 315gtc agt ttc tgg ggc gcc aat ctc acc ctc gcc gtc
ctc aac ggc aca 1126Val Ser Phe Trp Gly Ala Asn Leu Thr Leu Ala Val
Leu Asn Gly Thr 320 325 330gtc cct gcc tac cgt ctc gac gac atg gcc
atg cgc atc atg gcc gcc 1174Val Pro Ala Tyr Arg Leu Asp Asp Met Ala
Met Arg Ile Met Ala Ala335 340 345 350ctc ttc aag gtc acc aag acc
acc cac ctg gaa ccc atc aac ttc tcc 1222Leu Phe Lys Val Thr Lys Thr
Thr His Leu Glu Pro Ile Asn Phe Ser 355 360 365ttc tgg acc gac gac
act tat ggc ccg atc cac tgg gcc gcc aag cat 1270Phe Trp Thr Asp Asp
Thr Tyr Gly Pro Ile His Trp Ala Ala Lys His 370 375 380ggc tac cag
aag att aat tcc cac gtt gac gtc cgc gcc gac cac ggc 1318Gly Tyr Gln
Lys Ile Asn Ser His Val Asp Val Arg Ala Asp His Gly 385 390 395aac
ctc atc cgg gag att gcc gcc aag ggt acg gtg ctg ctg aag aat 1366Asn
Leu Ile Arg Glu Ile Ala Ala Lys Gly Thr Val Leu Leu Lys Asn 400 405
410acc ggc tct cta ccc ctg aac aag cca aag ttc gtg gcc gtc atc ggc
1414Thr Gly Ser Leu Pro Leu Asn Lys Pro Lys Phe Val Ala Val Ile
Gly415 420 425 430gag gat gct ggg tcg agc ccc aac ggg ccc aac ggc
tgc agc gac cgc 1462Glu Asp Ala Gly Ser Ser Pro Asn Gly Pro Asn Gly
Cys Ser Asp Arg 435 440 445ggc tgt aac gaa ggc acg ctc gcc atg ggc
tgg gga tcc ggc aca gcc 1510Gly Cys Asn Glu Gly Thr Leu Ala Met Gly
Trp Gly Ser Gly Thr Ala 450 455 460aac tat ccg tac ctc gtt tcc ccc
gac gcc gcg ctc cag gcc cgg gcc 1558Asn Tyr Pro Tyr Leu Val Ser Pro
Asp Ala Ala Leu Gln Ala Arg Ala 465 470 475atc cag gac ggc acg agg
tac gag agc gtc ctg tcc aac tac gcc gag 1606Ile Gln Asp Gly Thr Arg
Tyr Glu Ser Val Leu Ser Asn Tyr Ala Glu 480 485 490gaa aag aca aag
gct ctg gtc tcg cag gcc aat gca acc gcc atc gtc 1654Glu Lys Thr Lys
Ala Leu Val Ser Gln Ala Asn Ala Thr Ala Ile Val495 500 505 510ttc
gtc aat gcc gac tca ggc gag ggc tac atc aac gtg gac ggt aac 1702Phe
Val Asn Ala Asp Ser Gly Glu Gly Tyr Ile Asn Val Asp Gly Asn 515 520
525gag ggc gac cgt aag aac ctg act ctc tgg aac aac ggt gat act ctg
1750Glu Gly Asp Arg Lys Asn Leu Thr Leu Trp Asn Asn Gly Asp Thr Leu
530 535 540gtc aag aac gtc tcg agc tgg tgc agc aac acc atc gtc gtc
atc cac 1798Val Lys Asn Val Ser Ser Trp Cys Ser Asn Thr Ile Val Val
Ile His 545 550 555tcg gtc ggc ccg gtc ctc ctg acc gat tgg tac gac
aac ccc aac atc 1846Ser Val Gly Pro Val Leu Leu Thr Asp Trp Tyr Asp
Asn Pro Asn Ile 560 565 570acg gcc att ctc tgg gct ggt ctt ccg ggc
cag gag tcg ggc aac tcc 1894Thr Ala Ile Leu Trp Ala Gly Leu Pro Gly
Gln Glu Ser Gly Asn Ser575 580 585 590atc acc gac gtg ctt tac ggc
aag gtc aac ccc gcc gcc cgc tcg ccc 1942Ile Thr Asp Val Leu Tyr Gly
Lys Val Asn Pro Ala Ala Arg Ser Pro 595 600 605ttc act tgg ggc aag
acc cgc gaa agc tat ggc gcg gac gtc ctg tac 1990Phe Thr Trp Gly Lys
Thr Arg Glu Ser Tyr Gly Ala Asp Val Leu Tyr 610 615 620aag ccg aat
aat ggc aat ggt gcg ccc caa cag gac ttc acc gag ggc 2038Lys Pro Asn
Asn Gly Asn Gly Ala Pro Gln Gln Asp Phe Thr Glu Gly 625 630 635gtc
ttc atc gac tac cgc tac ttc gac aag gtt gac gat gac tcg gtc 2086Val
Phe Ile Asp Tyr Arg Tyr Phe Asp Lys Val Asp Asp Asp Ser Val 640 645
650atc tac gag ttc ggc cac ggc ctg agc tac acc acc ttc gag tac agc
2134Ile Tyr Glu Phe Gly His Gly Leu Ser Tyr Thr Thr Phe Glu Tyr
Ser655 660 665 670aac atc cgc gtc gtc aag tcc aac gtc agc gag tac
cgg ccc acg acg 2182Asn Ile Arg Val Val Lys Ser Asn Val Ser Glu Tyr
Arg Pro Thr Thr 675 680 685ggc acc acg gcc cag gcc ccg acg ttt ggc
aac ttc tcc acc gac ctc 2230Gly Thr Thr Ala Gln Ala Pro Thr Phe Gly
Asn Phe Ser Thr Asp Leu 690 695 700gag gac tat ctc ttc ccc aag gac
gag ttc ccc tac atc tac cag tac 2278Glu Asp Tyr Leu Phe Pro Lys Asp
Glu Phe Pro Tyr Ile Tyr Gln Tyr 705 710 715atc tac ccg tac ctc aac
acg acc gac ccc cgg agg gcc tcg gcc gat 2326Ile Tyr Pro Tyr Leu Asn
Thr Thr Asp Pro Arg Arg Ala Ser Ala Asp 720 725 730ccc cac tac ggc
cag acc gcc gag gag ttc ctc ccg ccc cac gcc acc 2374Pro His Tyr Gly
Gln Thr Ala Glu Glu Phe Leu Pro Pro His Ala Thr735 740 745 750gat
gac gac ccc cag ccg ctc ctc cgg tcc tcg ggc gga aac tcc ccc 2422Asp
Asp Asp Pro Gln Pro Leu Leu Arg Ser Ser Gly Gly Asn Ser Pro 755 760
765ggc ggc aac cgc cag ctg tac gac att gtc tac aca atc acg gcc gac
2470Gly Gly Asn Arg Gln Leu Tyr Asp Ile Val Tyr Thr Ile Thr Ala Asp
770 775 780atc acg aat acg ggc tcc gtt gta ggc gag gag gta ccg cag
ctc tac 2518Ile Thr Asn Thr Gly Ser Val Val Gly Glu Glu Val Pro Gln
Leu Tyr 785 790 795gtc tcg ctg ggc ggt ccc gag gat ccc aag gtg cag
ctg cgc gac ttt 2566Val Ser Leu Gly Gly Pro Glu Asp Pro Lys Val Gln
Leu Arg Asp Phe 800 805 810gac agg atg cgg atc gaa ccc ggc gag acg
agg cag ttc acc ggc cgc 2614Asp Arg Met Arg Ile Glu Pro Gly Glu Thr
Arg Gln Phe Thr Gly Arg815 820 825 830ctg acg cgc aga gat ctg agc
aac tgg gac gtc acg gtg cag gac tgg 2662Leu Thr Arg Arg Asp Leu Ser
Asn Trp Asp Val Thr Val Gln Asp Trp 835 840 845gtc atc agc agg tat
ccc aag acg gca tat gtt ggg agg agc agc cgg 2710Val Ile Ser Arg Tyr
Pro Lys Thr Ala Tyr Val Gly Arg Ser Ser Arg 850 855 860aag ttg gat
ctc aag att gag ctt cct tga 2740Lys Leu Asp Leu Lys Ile Glu Leu Pro
865 87044871PRTChrysosporium lucknowense 44Met Gln Leu Pro Ala Ala
Ala Gln Trp Leu Leu Thr Leu Pro Ala Lys1 5 10 15Ala Ser Leu Ala Asp
Asn His Arg Gln Val His Gln Lys Pro Leu Ala 20 25 30Arg Ser Glu Pro
Phe Tyr Pro Ser Pro Trp Met Asn Pro Asn Ala Asp 35 40 45Gly Trp Ala
Glu Ala Tyr Ala Gln Ala Lys Ser Phe Val Ser Gln Met 50 55 60Thr Leu
Leu Glu Lys Val Asn Leu Thr Thr Gly Val Gly Trp Gly Ala65 70 75
80Glu Gln Cys Val Gly Gln Val Gly Ala Ile Pro Arg Leu Gly Leu Arg
85 90 95Ser Leu Cys Met His Asp Ser Pro Leu Gly Ile Arg Gly Ala Asp
Tyr 100 105 110Asn Ser Ala Phe Pro Ser Gly Gln Thr Val Ala Ala Thr
Trp Asp Arg 115 120 125Gly Leu Met Tyr Arg Arg Gly Tyr Ala Met Gly
Gln Glu Ala Lys Gly 130 135 140Lys Gly Ile Asn Val Leu Leu Gly Pro
Val Ala Gly Pro Leu Gly Arg145 150 155 160Met Pro Glu Gly Gly Arg
Asn Trp Glu Gly Phe Ala Pro Asp Pro Val 165 170 175Leu Thr Gly Ile
Gly Met Ser Glu Thr Ile Lys Gly Ile Gln Asp Ala 180 185 190Gly Val
Ile Ala Cys Ala Lys His Phe Ile Gly Asn Glu Gln Glu His 195 200
205Phe Arg Gln Val Pro Glu Ala Gln Gly Tyr Gly Tyr Asn Ile Ser Glu
210 215 220Thr Leu Ser Ser Asn Ile Asp Asp Lys Thr Met His Glu Leu
Tyr Leu225 230 235 240Trp Pro Phe Ala Asp Ala Val Arg Ala Gly Val
Gly Ser Val Met Cys 245 250 255Ser Tyr Gln Gln Val Asn Asn Ser Tyr
Ala Cys Gln Asn Ser Lys Leu 260 265 270Leu Asn Asp Leu Leu Lys Asn
Glu Leu Gly Phe Gln Gly Phe Val Met 275 280 285Ser Asp Trp Gln Ala
Gln His Thr Gly Ala Ala Ser Ala Val Ala Gly 290 295 300Leu Asp Met
Ser Met Pro Gly Asp Thr Gln Phe Asn Thr Gly Val Ser305 310 315
320Phe Trp Gly Ala Asn Leu Thr Leu Ala Val Leu Asn Gly Thr Val Pro
325 330 335Ala Tyr Arg Leu Asp Asp Met Ala Met Arg Ile Met Ala Ala
Leu Phe 340 345 350Lys Val Thr Lys Thr Thr His Leu Glu Pro Ile Asn
Phe Ser Phe Trp 355 360 365Thr Asp Asp Thr Tyr Gly Pro Ile His Trp
Ala Ala Lys His Gly Tyr 370 375 380Gln Lys Ile Asn Ser His Val Asp
Val Arg Ala Asp His Gly Asn Leu385 390 395 400Ile Arg Glu Ile Ala
Ala Lys Gly Thr Val Leu Leu Lys Asn Thr Gly 405 410 415Ser Leu Pro
Leu Asn Lys Pro Lys Phe Val Ala Val Ile Gly Glu Asp 420 425 430Ala
Gly Ser Ser Pro Asn Gly Pro Asn Gly Cys Ser Asp Arg Gly Cys 435 440
445Asn Glu Gly Thr Leu Ala Met Gly Trp Gly Ser Gly Thr Ala Asn Tyr
450 455 460Pro Tyr Leu Val Ser Pro Asp Ala Ala Leu Gln Ala Arg Ala
Ile Gln465 470 475 480Asp Gly Thr Arg Tyr Glu Ser Val Leu Ser Asn
Tyr Ala Glu Glu Lys 485 490 495Thr Lys Ala Leu Val Ser Gln Ala Asn
Ala Thr Ala Ile Val Phe Val 500 505 510Asn Ala Asp Ser Gly Glu Gly
Tyr Ile Asn Val Asp Gly Asn Glu Gly 515 520 525Asp Arg Lys Asn Leu
Thr Leu Trp Asn Asn Gly Asp Thr Leu Val Lys 530 535 540Asn Val Ser
Ser Trp Cys Ser Asn Thr Ile Val Val Ile His Ser Val545 550 555
560Gly Pro Val Leu Leu Thr Asp Trp Tyr Asp Asn Pro Asn Ile Thr Ala
565 570 575Ile Leu Trp Ala Gly Leu Pro Gly Gln Glu Ser Gly Asn Ser
Ile Thr 580 585 590Asp Val Leu Tyr Gly Lys Val Asn Pro Ala Ala Arg
Ser Pro Phe Thr 595 600 605Trp Gly Lys Thr Arg Glu Ser Tyr Gly Ala
Asp Val Leu Tyr Lys Pro 610 615 620Asn Asn Gly Asn Gly Ala Pro Gln
Gln Asp Phe Thr Glu Gly Val Phe625 630 635 640Ile Asp Tyr Arg Tyr
Phe Asp Lys Val Asp Asp Asp Ser Val Ile Tyr 645 650 655Glu Phe Gly
His Gly Leu Ser Tyr Thr Thr Phe Glu Tyr Ser Asn Ile 660 665 670Arg
Val Val Lys Ser Asn Val Ser Glu Tyr Arg Pro Thr Thr Gly Thr 675 680
685Thr Ala Gln Ala Pro Thr Phe Gly Asn Phe Ser Thr Asp Leu Glu Asp
690 695 700Tyr Leu Phe Pro Lys Asp Glu Phe Pro Tyr Ile Tyr Gln Tyr
Ile Tyr705 710 715 720Pro Tyr Leu Asn Thr Thr Asp Pro Arg Arg Ala
Ser Ala Asp Pro His 725 730 735Tyr Gly Gln Thr Ala Glu Glu Phe Leu
Pro Pro His Ala Thr Asp Asp 740 745 750Asp Pro Gln Pro Leu Leu Arg
Ser Ser Gly Gly Asn Ser Pro Gly Gly 755 760 765Asn Arg Gln Leu Tyr
Asp Ile Val Tyr Thr Ile Thr
Ala Asp Ile Thr 770 775 780Asn Thr Gly Ser Val Val Gly Glu Glu Val
Pro Gln Leu Tyr Val Ser785 790 795 800Leu Gly Gly Pro Glu Asp Pro
Lys Val Gln Leu Arg Asp Phe Asp Arg 805 810 815Met Arg Ile Glu Pro
Gly Glu Thr Arg Gln Phe Thr Gly Arg Leu Thr 820 825 830Arg Arg Asp
Leu Ser Asn Trp Asp Val Thr Val Gln Asp Trp Val Ile 835 840 845Ser
Arg Tyr Pro Lys Thr Ala Tyr Val Gly Arg Ser Ser Arg Lys Leu 850 855
860Asp Leu Lys Ile Glu Leu Pro865 870452613DNAChrysosporium
lucknowense 45atgcaacttc cagccgcagc ccaatggctg ctcacgcttc
ccgcgaaagc ctcacttgct 60gacaatcatc gtcaggttca ccagaagccc ctcgcgagat
ctgaaccttt ttacccgtcg 120ccatggatga atcccaacgc cgacggctgg
gcggaggcct atgcccaggc caagtccttt 180gtctcccaaa tgactctgct
agagaaggtc aacttgacca cgggagtcgg ctggggggct 240gagcagtgcg
tcggccaagt gggcgcgatc cctcgccttg gacttcgcag tctgtgcatg
300catgactccc ctctcggcat ccgaggagcc gactacaact cagcgttccc
ctctggccag 360accgttgctg ctacctggga tcgcggtctg atgtaccgtc
gcggctacgc aatgggccag 420gaggccaaag gcaagggcat caatgtcctt
ctcggaccag tcgccggccc ccttggccgc 480atgcccgagg gcggtcgtaa
ctgggaaggc ttcgctccgg atcccgtcct taccggcatc 540ggcatgtccg
agacgatcaa gggcattcag gatgctggcg tcatcgcttg tgcgaagcac
600tttattggaa acgagcagga gcacttcaga caggtgccag aagcccaggg
atacggttac 660aacatcagcg aaaccctctc ctccaacatt gacgacaaga
ccatgcacga gctctacctt 720tggccgtttg ccgatgccgt ccgggccggc
gtcggctctg tcatgtgctc gtaccagcag 780gtcaacaact cgtacgcctg
ccagaactcg aagctgctga acgacctcct caagaacgag 840cttgggtttc
agggcttcgt catgagcgac tggcaggcac agcacactgg cgcagcaagc
900gccgtggctg gtctcgatat gtccatgccg ggcgacaccc agttcaacac
tggcgtcagt 960ttctggggcg ccaatctcac cctcgccgtc ctcaacggca
cagtccctgc ctaccgtctc 1020gacgacatgg ccatgcgcat catggccgcc
ctcttcaagg tcaccaagac cacccacctg 1080gaacccatca acttctcctt
ctggaccgac gacacttatg gcccgatcca ctgggccgcc 1140aagcatggct
accagaagat taattcccac gttgacgtcc gcgccgacca cggcaacctc
1200atccgggaga ttgccgccaa gggtacggtg ctgctgaaga ataccggctc
tctacccctg 1260aacaagccaa agttcgtggc cgtcatcggc gaggatgctg
ggtcgagccc caacgggccc 1320aacggctgca gcgaccgcgg ctgtaacgaa
ggcacgctcg ccatgggctg gggatccggc 1380acagccaact atccgtacct
cgtttccccc gacgccgcgc tccaggcccg ggccatccag 1440gacggcacga
ggtacgagag cgtcctgtcc aactacgccg aggaaaagac aaaggctctg
1500gtctcgcagg ccaatgcaac cgccatcgtc ttcgtcaatg ccgactcagg
cgagggctac 1560atcaacgtgg acggtaacga gggcgaccgt aagaacctga
ctctctggaa caacggtgat 1620actctggtca agaacgtctc gagctggtgc
agcaacacca tcgtcgtcat ccactcggtc 1680ggcccggtcc tcctgaccga
ttggtacgac aaccccaaca tcacggccat tctctgggct 1740ggtcttccgg
gccaggagtc gggcaactcc atcaccgacg tgctttacgg caaggtcaac
1800cccgccgccc gctcgccctt cacttggggc aagacccgcg aaagctatgg
cgcggacgtc 1860ctgtacaagc cgaataatgg caatggtgcg ccccaacagg
acttcaccga gggcgtcttc 1920atcgactacc gctacttcga caaggttgac
gatgactcgg tcatctacga gttcggccac 1980ggcctgagct acaccacctt
cgagtacagc aacatccgcg tcgtcaagtc caacgtcagc 2040gagtaccggc
ccacgacggg caccacggcc caggccccga cgtttggcaa cttctccacc
2100gacctcgagg actatctctt ccccaaggac gagttcccct acatctacca
gtacatctac 2160ccgtacctca acacgaccga cccccggagg gcctcggccg
atccccacta cggccagacc 2220gccgaggagt tcctcccgcc ccacgccacc
gatgacgacc cccagccgct cctccggtcc 2280tcgggcggaa actcccccgg
cggcaaccgc cagctgtacg acattgtcta cacaatcacg 2340gccgacatca
cgaatacggg ctccgttgta ggcgaggagg taccgcagct ctacgtctcg
2400ctgggcggtc ccgaggatcc caaggtgcag ctgcgcgact ttgacaggat
gcggatcgaa 2460cccggcgaga cgaggcagtt caccggccgc ctgacgcgca
gagatctgag caactgggac 2520gtcacggtgc aggactgggt catcagcagg
tatcccaaga cggcatatgt tgggaggagc 2580agccggaagt tggatctcaa
gattgagctt cct 2613461226DNAChrysosporium
lucknowenseCDS(89)..(690)CDS(776)..(1223) 46agacagcttg agagacctcc
taatcctaga tctcgaggct caacctgagg agagacggtc 60tgtaattacc cgctgcacga
caccaaag atg atg ctc aca cgc ttc gtg gct 112 Met Met Leu Thr Arg
Phe Val Ala 1 5ggc ctg ctc ggc atc tcc gcc gcg gat gcc gcc ctc acc
tac aga ggc 160Gly Leu Leu Gly Ile Ser Ala Ala Asp Ala Ala Leu Thr
Tyr Arg Gly 10 15 20gtg gat tgg tcc tca gtg gtg gtt gag gaa cgg gcc
ggc gtc tcg tac 208Val Asp Trp Ser Ser Val Val Val Glu Glu Arg Ala
Gly Val Ser Tyr25 30 35 40aag aac acc aac ggg aat gcc caa ccg ctt
gag aac atc ctg gct gcc 256Lys Asn Thr Asn Gly Asn Ala Gln Pro Leu
Glu Asn Ile Leu Ala Ala 45 50 55aat ggc gtc aac acg gtg cgg cag cga
gtc tgg gtt aac ccc gcg gac 304Asn Gly Val Asn Thr Val Arg Gln Arg
Val Trp Val Asn Pro Ala Asp 60 65 70ggc aac tac aac ctc gac tac aac
atc gcg atc gcg aag agg gcg aag 352Gly Asn Tyr Asn Leu Asp Tyr Asn
Ile Ala Ile Ala Lys Arg Ala Lys 75 80 85gct gcc ggg ctt ggc gtg tac
atc gac ttc cac tac agc gac acc tgg 400Ala Ala Gly Leu Gly Val Tyr
Ile Asp Phe His Tyr Ser Asp Thr Trp 90 95 100gcc gat cct gct cat
cag acc atg ccc gct ggg tgg ccg agc gac att 448Ala Asp Pro Ala His
Gln Thr Met Pro Ala Gly Trp Pro Ser Asp Ile105 110 115 120gac aac
ctc tcc tgg aag ctc tac aac tac act ctg gac gcc gcc aac 496Asp Asn
Leu Ser Trp Lys Leu Tyr Asn Tyr Thr Leu Asp Ala Ala Asn 125 130
135aag ctc cag aac gcg ggt atc cag ccc acc atc gtg tcc atc ggt aac
544Lys Leu Gln Asn Ala Gly Ile Gln Pro Thr Ile Val Ser Ile Gly Asn
140 145 150gag atc cgg gcc ggt ctg cta tgg ccc aca ggg aga acc gag
aac tgg 592Glu Ile Arg Ala Gly Leu Leu Trp Pro Thr Gly Arg Thr Glu
Asn Trp 155 160 165gcc aac att gcc cgg ttg ttg cac tcc gct gct tgg
ggt atc aag gac 640Ala Asn Ile Ala Arg Leu Leu His Ser Ala Ala Trp
Gly Ile Lys Asp 170 175 180tcg tcg ctc agc ccg aag cca aag atc atg
atc cac ctc gac aac gga 688Ser Ser Leu Ser Pro Lys Pro Lys Ile Met
Ile His Leu Asp Asn Gly185 190 195 200tg gtgagttata tcatcctttc
cttgtccgat aacctgttcc tcagcctggt 740Trpcgatgtggag agctgatggc
gggggatcca aacag g gac tgg ggt acc cag aat 794 Asp Trp Gly Thr Gln
Asn 205tgg tgg tac acg aat gtc ttg aag cag ggt aca ctt gag ctg tcc
gac 842Trp Trp Tyr Thr Asn Val Leu Lys Gln Gly Thr Leu Glu Leu Ser
Asp 210 215 220ttt gac atg atg ggc gtc tcg ttc tac ccc ttt tac tcg
tcg tcg gca 890Phe Asp Met Met Gly Val Ser Phe Tyr Pro Phe Tyr Ser
Ser Ser Ala 225 230 235acc ttg agc gcc ctg aaa tcg agc ttg gac aac
atg gcc aaa acc tgg 938Thr Leu Ser Ala Leu Lys Ser Ser Leu Asp Asn
Met Ala Lys Thr Trp240 245 250 255aac aag gag att gcc gtg gtc gag
acc aat tgg cca atc tct tgt ccc 986Asn Lys Glu Ile Ala Val Val Glu
Thr Asn Trp Pro Ile Ser Cys Pro 260 265 270aac cca agg tac agt ttc
ccc tcg gac gtc aag aac atc ccc ttc tcg 1034Asn Pro Arg Tyr Ser Phe
Pro Ser Asp Val Lys Asn Ile Pro Phe Ser 275 280 285ccg gaa gga cag
acg acc ttc atc acc aac gtg gcc aac atc gtg tcc 1082Pro Glu Gly Gln
Thr Thr Phe Ile Thr Asn Val Ala Asn Ile Val Ser 290 295 300tcg gta
agc cgc ggc gta ggc ctg ttt tat tgg gaa ccc gct tgg att 1130Ser Val
Ser Arg Gly Val Gly Leu Phe Tyr Trp Glu Pro Ala Trp Ile 305 310
315cac aat gca aac ctg ggc tcg tcg tgc gcc gac aac acc atg ttt tcg
1178His Asn Ala Asn Leu Gly Ser Ser Cys Ala Asp Asn Thr Met Phe
Ser320 325 330 335caa tcc ggg cag gct ttg tcc agc ttg tcc gtt ttc
cag aga atc tga 1226Gln Ser Gly Gln Ala Leu Ser Ser Leu Ser Val Phe
Gln Arg Ile 340 345 35047350PRTChrysosporium lucknowense 47Met Met
Leu Thr Arg Phe Val Ala Gly Leu Leu Gly Ile Ser Ala Ala1 5 10 15Asp
Ala Ala Leu Thr Tyr Arg Gly Val Asp Trp Ser Ser Val Val Val 20 25
30Glu Glu Arg Ala Gly Val Ser Tyr Lys Asn Thr Asn Gly Asn Ala Gln
35 40 45Pro Leu Glu Asn Ile Leu Ala Ala Asn Gly Val Asn Thr Val Arg
Gln 50 55 60Arg Val Trp Val Asn Pro Ala Asp Gly Asn Tyr Asn Leu Asp
Tyr Asn65 70 75 80Ile Ala Ile Ala Lys Arg Ala Lys Ala Ala Gly Leu
Gly Val Tyr Ile 85 90 95Asp Phe His Tyr Ser Asp Thr Trp Ala Asp Pro
Ala His Gln Thr Met 100 105 110Pro Ala Gly Trp Pro Ser Asp Ile Asp
Asn Leu Ser Trp Lys Leu Tyr 115 120 125Asn Tyr Thr Leu Asp Ala Ala
Asn Lys Leu Gln Asn Ala Gly Ile Gln 130 135 140Pro Thr Ile Val Ser
Ile Gly Asn Glu Ile Arg Ala Gly Leu Leu Trp145 150 155 160Pro Thr
Gly Arg Thr Glu Asn Trp Ala Asn Ile Ala Arg Leu Leu His 165 170
175Ser Ala Ala Trp Gly Ile Lys Asp Ser Ser Leu Ser Pro Lys Pro Lys
180 185 190Ile Met Ile His Leu Asp Asn Gly Trp Asp Trp Gly Thr Gln
Asn Trp 195 200 205Trp Tyr Thr Asn Val Leu Lys Gln Gly Thr Leu Glu
Leu Ser Asp Phe 210 215 220Asp Met Met Gly Val Ser Phe Tyr Pro Phe
Tyr Ser Ser Ser Ala Thr225 230 235 240Leu Ser Ala Leu Lys Ser Ser
Leu Asp Asn Met Ala Lys Thr Trp Asn 245 250 255Lys Glu Ile Ala Val
Val Glu Thr Asn Trp Pro Ile Ser Cys Pro Asn 260 265 270Pro Arg Tyr
Ser Phe Pro Ser Asp Val Lys Asn Ile Pro Phe Ser Pro 275 280 285Glu
Gly Gln Thr Thr Phe Ile Thr Asn Val Ala Asn Ile Val Ser Ser 290 295
300Val Ser Arg Gly Val Gly Leu Phe Tyr Trp Glu Pro Ala Trp Ile
His305 310 315 320Asn Ala Asn Leu Gly Ser Ser Cys Ala Asp Asn Thr
Met Phe Ser Gln 325 330 335Ser Gly Gln Ala Leu Ser Ser Leu Ser Val
Phe Gln Arg Ile 340 345 350481050DNAChrysosporium lucknowense
48atgatgctca cacgcttcgt ggctggcctg ctcggcatct ccgccgcgga tgccgccctc
60acctacagag gcgtggattg gtcctcagtg gtggttgagg aacgggccgg cgtctcgtac
120aagaacacca acgggaatgc ccaaccgctt gagaacatcc tggctgccaa
tggcgtcaac 180acggtgcggc agcgagtctg ggttaacccc gcggacggca
actacaacct cgactacaac 240atcgcgatcg cgaagagggc gaaggctgcc
gggcttggcg tgtacatcga cttccactac 300agcgacacct gggccgatcc
tgctcatcag accatgcccg ctgggtggcc gagcgacatt 360gacaacctct
cctggaagct ctacaactac actctggacg ccgccaacaa gctccagaac
420gcgggtatcc agcccaccat cgtgtccatc ggtaacgaga tccgggccgg
tctgctatgg 480cccacaggga gaaccgagaa ctgggccaac attgcccggt
tgttgcactc cgctgcttgg 540ggtatcaagg actcgtcgct cagcccgaag
ccaaagatca tgatccacct cgacaacgga 600tgggactggg gtacccagaa
ttggtggtac acgaatgtct tgaagcaggg tacacttgag 660ctgtccgact
ttgacatgat gggcgtctcg ttctacccct tttactcgtc gtcggcaacc
720ttgagcgccc tgaaatcgag cttggacaac atggccaaaa cctggaacaa
ggagattgcc 780gtggtcgaga ccaattggcc aatctcttgt cccaacccaa
ggtacagttt cccctcggac 840gtcaagaaca tccccttctc gccggaagga
cagacgacct tcatcaccaa cgtggccaac 900atcgtgtcct cggtaagccg
cggcgtaggc ctgttttatt gggaacccgc ttggattcac 960aatgcaaacc
tgggctcgtc gtgcgccgac aacaccatgt tttcgcaatc cgggcaggct
1020ttgtccagct tgtccgtttt ccagagaatc 1050492344DNAChrysosporium
lucknowenseCDS(54)..(276)CDS(348)..(717)CDS(814)..(887)CDS(947)..(1078)CD-
S(1144)..(2228) 49agctccagga ccacctccca tccgtaaacg gatctggcct
cgtcacgccc gcc atg 56 Met 1cac gct ctc tcg tcg ctc gct gtc ctc ggc
gcc tgg gcc gtc cag acg 104His Ala Leu Ser Ser Leu Ala Val Leu Gly
Ala Trp Ala Val Gln Thr 5 10 15gtc ttg ggc cgt ccg gcc acc ctc tca
aag cgg gcc acc gac tcc ttc 152Val Leu Gly Arg Pro Ala Thr Leu Ser
Lys Arg Ala Thr Asp Ser Phe 20 25 30atc gag acc gag acg cct atc gca
tgg gaa aag ctg cgg tgc aac atc 200Ile Glu Thr Glu Thr Pro Ile Ala
Trp Glu Lys Leu Arg Cys Asn Ile 35 40 45ggc gct aac ggc tgt gcg gct
tcc gga gcc gct gcc ggt gtg gtc att 248Gly Ala Asn Gly Cys Ala Ala
Ser Gly Ala Ala Ala Gly Val Val Ile50 55 60 65gcc agc ccg tcc aag
tcg gat cca gac t gtgcgttgac agcggcccct 296Ala Ser Pro Ser Lys Ser
Asp Pro Asp 70accccttatt atcgcaactt cggtcggtgt tgagatgctg
agacgcgaca g ac ttc 352 Tyr Phe 75tac acc tgg act cga gat gcc ggc
ctg gtc ctg acg ggt atc gtg gac 400Tyr Thr Trp Thr Arg Asp Ala Gly
Leu Val Leu Thr Gly Ile Val Asp 80 85 90gcc ctg tcc caa aac tac tcg
gcg gcc ctg cag acc aac att cag gac 448Ala Leu Ser Gln Asn Tyr Ser
Ala Ala Leu Gln Thr Asn Ile Gln Asp 95 100 105tac atc atc gcc cag
gcc aag ctc cag ggt gtt tcg aac ccc tcc ggt 496Tyr Ile Ile Ala Gln
Ala Lys Leu Gln Gly Val Ser Asn Pro Ser Gly 110 115 120agc ctc tcg
gac ggc acc ggt ctt ggc gag ccc aag ttc aat gtc gat 544Ser Leu Ser
Asp Gly Thr Gly Leu Gly Glu Pro Lys Phe Asn Val Asp125 130 135
140ctc acc cag ttc acg ggc gac tgg ggc cgg ccg cag cgc gac ggt ccg
592Leu Thr Gln Phe Thr Gly Asp Trp Gly Arg Pro Gln Arg Asp Gly Pro
145 150 155cct ctc cgg gcc atc gcc ctc atc cgc tac gcc aag tgg ctg
gct tcc 640Pro Leu Arg Ala Ile Ala Leu Ile Arg Tyr Ala Lys Trp Leu
Ala Ser 160 165 170aac ggt tac aag gac acg gcc aac agc gtc gtc tgg
ccc gtc atc aag 688Asn Gly Tyr Lys Asp Thr Ala Asn Ser Val Val Trp
Pro Val Ile Lys 175 180 185aac gac ctg gcc tat gcc gct cag tat tg
gtgagtgccc cttctacttc 737Asn Asp Leu Ala Tyr Ala Ala Gln Tyr Trp
190 195tcgcccttgg gggagtttct tcccgtcctg acgccggagg cacgctaaca
cacgccacac 797acacacgcac atacag g aac gag act ggt ttc gac ctg tgg
gag gag gtt 847 Asn Glu Thr Gly Phe Asp Leu Trp Glu Glu Val 200
205ccc ggc agc tcg ttc ttc acc att gcc agc acg cac cga g gtgggtggtc
897Pro Gly Ser Ser Phe Phe Thr Ile Ala Ser Thr His Arg210 215
220cccgttgtac atatatgggt tgatcaacga attactgacg cagcgacag cc ttg gta
954 Ala Leu Val 225gag gga gct gcc ctc gct gcc cag ctc ggc acc gaa
tgc agc gcc tgc 1002Glu Gly Ala Ala Leu Ala Ala Gln Leu Gly Thr Glu
Cys Ser Ala Cys 230 235 240atc acc gtc gcg ccc caa gtc ctc tgc ttc
cag cag agc ttc tgg aac 1050Ile Thr Val Ala Pro Gln Val Leu Cys Phe
Gln Gln Ser Phe Trp Asn 245 250 255ccg tcg ggc ggt tac gtt gtc tcg
aac a gtaagtctct tctcgtggtg 1098Pro Ser Gly Gly Tyr Val Val Ser Asn
260 265gtgccgctcc cagcctcaat ttccacacaa ctaaacaacg gcaag tc aac ggc
ggc 1154Ile Asn Gly Gly 270aac aac cgg tcc ggc aag gat ctc aac tcg
gtc ctg gcc tcc atc cac 1202Asn Asn Arg Ser Gly Lys Asp Leu Asn Ser
Val Leu Ala Ser Ile His 275 280 285acc ttt gac ccg gcg gtc ggc tgc
gac tcg gtc acc ttc cag ccc tgc 1250Thr Phe Asp Pro Ala Val Gly Cys
Asp Ser Val Thr Phe Gln Pro Cys 290 295 300agc gac aag gcg ctc tcc
aac cac aag gcc tac gtc gac tcc ttc cgc 1298Ser Asp Lys Ala Leu Ser
Asn His Lys Ala Tyr Val Asp Ser Phe Arg 305 310 315agc gtc tac gcc
atc aac tcg ggc att gcc cag ggc aag gcc gtc gcc 1346Ser Val Tyr Ala
Ile Asn Ser Gly Ile Ala Gln Gly Lys Ala Val Ala 320 325 330gtg ggc
cgc tac tcg gag gac gtc tac tac aac ggc aac ccg tgg tac 1394Val Gly
Arg Tyr Ser Glu Asp Val Tyr Tyr Asn Gly Asn Pro Trp Tyr335 340 345
350ctg gcc aac ttc gcg gcc gcc gag cag ctc tac gac gcc gtc ttc gtc
1442Leu Ala Asn Phe Ala Ala Ala Glu Gln Leu Tyr Asp Ala Val Phe Val
355 360 365tgg aag aag cag cag tcc atc gag gtc acc cag ctg tcc ctc
ccc ttc 1490Trp Lys Lys Gln Gln Ser Ile Glu Val Thr Gln Leu Ser Leu
Pro Phe 370 375 380ttc aag gac ctg ctc ccc ggc atc tcc acc ggc acc
tac acc ccg tcg 1538Phe Lys Asp Leu Leu Pro Gly Ile Ser Thr Gly Thr
Tyr Thr Pro Ser 385 390 395tcg tcg acg tac cag cag atc ctc gac gcc
gtc tcg gcc tac gcc gac 1586Ser Ser
Thr Tyr Gln Gln Ile Leu Asp Ala Val Ser Ala Tyr Ala Asp 400 405
410ggc ttc atc gac gtc gcg gcc aag tac acc ccc tcg gac ggc tcc ctg
1634Gly Phe Ile Asp Val Ala Ala Lys Tyr Thr Pro Ser Asp Gly Ser
Leu415 420 425 430gcc gag cag tac acg cgc gac tcg ggc cag ccg atc
tcg gcc aag gac 1682Ala Glu Gln Tyr Thr Arg Asp Ser Gly Gln Pro Ile
Ser Ala Lys Asp 435 440 445ctg acc tgg tcc tac gcc gcc ttc ctc tcg
gcc gcc gac cgc cgc gcg 1730Leu Thr Trp Ser Tyr Ala Ala Phe Leu Ser
Ala Ala Asp Arg Arg Ala 450 455 460ggc atc gtc ccg gcc ggc tgg tcc
gcc gag cac ggc aag acg ctg ccc 1778Gly Ile Val Pro Ala Gly Trp Ser
Ala Glu His Gly Lys Thr Leu Pro 465 470 475ggc tcg tgc tcg gcc gtc
cag gtc gcc ggc acc tac acc cag gcc acc 1826Gly Ser Cys Ser Ala Val
Gln Val Ala Gly Thr Tyr Thr Gln Ala Thr 480 485 490gcc acc tcc ttc
ccg ccc ggc cag acg ccc aac ccg acc agc gac acc 1874Ala Thr Ser Phe
Pro Pro Gly Gln Thr Pro Asn Pro Thr Ser Asp Thr495 500 505 510ccg
gcc ccg ttc ccc acg gcc tgc gcc gac tcc acc cag gtc ttc gtc 1922Pro
Ala Pro Phe Pro Thr Ala Cys Ala Asp Ser Thr Gln Val Phe Val 515 520
525acc ttc cgc gcc gag gtg acc acc cag tgg ggc cag tcg gtc aag gtc
1970Thr Phe Arg Ala Glu Val Thr Thr Gln Trp Gly Gln Ser Val Lys Val
530 535 540gtc ggc agc tcg tcc gag ctc ggc aac tgg gac gtc tcc aag
gcc ccg 2018Val Gly Ser Ser Ser Glu Leu Gly Asn Trp Asp Val Ser Lys
Ala Pro 545 550 555cgc ctg tcc gcg tcg gcc tac acg gcg tcg gac ccg
ctc tgg gcc atc 2066Arg Leu Ser Ala Ser Ala Tyr Thr Ala Ser Asp Pro
Leu Trp Ala Ile 560 565 570acg gtg ccc atg aag gcc ggc cag tcg gtg
cag tac aag ttc gtc aag 2114Thr Val Pro Met Lys Ala Gly Gln Ser Val
Gln Tyr Lys Phe Val Lys575 580 585 590gtc aac ggg gac ggg tcg atc
cag tgg gag tcg gac ccg aac cgc cag 2162Val Asn Gly Asp Gly Ser Ile
Gln Trp Glu Ser Asp Pro Asn Arg Gln 595 600 605ttc acg gtc agc tct
tcc tcc acg gcc agc ggc tgt gcc tgg cag acc 2210Phe Thr Val Ser Ser
Ser Ser Thr Ala Ser Gly Cys Ala Trp Gln Thr 610 615 620atc gag gcg
acc tgg cgg tagatatggg agtactggag caaaagcctc 2258Ile Glu Ala Thr
Trp Arg 625attcggagcc ttcggtggct cttgggtggg ggggggggtg gaaaagaggg
gaggtaatag 2318gatgatcatg agcgtttggg aaaaaa
234450628PRTChrysosporium lucknowense 50Met His Ala Leu Ser Ser Leu
Ala Val Leu Gly Ala Trp Ala Val Gln1 5 10 15Thr Val Leu Gly Arg Pro
Ala Thr Leu Ser Lys Arg Ala Thr Asp Ser 20 25 30Phe Ile Glu Thr Glu
Thr Pro Ile Ala Trp Glu Lys Leu Arg Cys Asn 35 40 45Ile Gly Ala Asn
Gly Cys Ala Ala Ser Gly Ala Ala Ala Gly Val Val 50 55 60Ile Ala Ser
Pro Ser Lys Ser Asp Pro Asp Tyr Phe Tyr Thr Trp Thr65 70 75 80Arg
Asp Ala Gly Leu Val Leu Thr Gly Ile Val Asp Ala Leu Ser Gln 85 90
95Asn Tyr Ser Ala Ala Leu Gln Thr Asn Ile Gln Asp Tyr Ile Ile Ala
100 105 110Gln Ala Lys Leu Gln Gly Val Ser Asn Pro Ser Gly Ser Leu
Ser Asp 115 120 125Gly Thr Gly Leu Gly Glu Pro Lys Phe Asn Val Asp
Leu Thr Gln Phe 130 135 140Thr Gly Asp Trp Gly Arg Pro Gln Arg Asp
Gly Pro Pro Leu Arg Ala145 150 155 160Ile Ala Leu Ile Arg Tyr Ala
Lys Trp Leu Ala Ser Asn Gly Tyr Lys 165 170 175Asp Thr Ala Asn Ser
Val Val Trp Pro Val Ile Lys Asn Asp Leu Ala 180 185 190Tyr Ala Ala
Gln Tyr Trp Asn Glu Thr Gly Phe Asp Leu Trp Glu Glu 195 200 205Val
Pro Gly Ser Ser Phe Phe Thr Ile Ala Ser Thr His Arg Ala Leu 210 215
220Val Glu Gly Ala Ala Leu Ala Ala Gln Leu Gly Thr Glu Cys Ser
Ala225 230 235 240Cys Ile Thr Val Ala Pro Gln Val Leu Cys Phe Gln
Gln Ser Phe Trp 245 250 255Asn Pro Ser Gly Gly Tyr Val Val Ser Asn
Ile Asn Gly Gly Asn Asn 260 265 270Arg Ser Gly Lys Asp Leu Asn Ser
Val Leu Ala Ser Ile His Thr Phe 275 280 285Asp Pro Ala Val Gly Cys
Asp Ser Val Thr Phe Gln Pro Cys Ser Asp 290 295 300Lys Ala Leu Ser
Asn His Lys Ala Tyr Val Asp Ser Phe Arg Ser Val305 310 315 320Tyr
Ala Ile Asn Ser Gly Ile Ala Gln Gly Lys Ala Val Ala Val Gly 325 330
335Arg Tyr Ser Glu Asp Val Tyr Tyr Asn Gly Asn Pro Trp Tyr Leu Ala
340 345 350Asn Phe Ala Ala Ala Glu Gln Leu Tyr Asp Ala Val Phe Val
Trp Lys 355 360 365Lys Gln Gln Ser Ile Glu Val Thr Gln Leu Ser Leu
Pro Phe Phe Lys 370 375 380Asp Leu Leu Pro Gly Ile Ser Thr Gly Thr
Tyr Thr Pro Ser Ser Ser385 390 395 400Thr Tyr Gln Gln Ile Leu Asp
Ala Val Ser Ala Tyr Ala Asp Gly Phe 405 410 415Ile Asp Val Ala Ala
Lys Tyr Thr Pro Ser Asp Gly Ser Leu Ala Glu 420 425 430Gln Tyr Thr
Arg Asp Ser Gly Gln Pro Ile Ser Ala Lys Asp Leu Thr 435 440 445Trp
Ser Tyr Ala Ala Phe Leu Ser Ala Ala Asp Arg Arg Ala Gly Ile 450 455
460Val Pro Ala Gly Trp Ser Ala Glu His Gly Lys Thr Leu Pro Gly
Ser465 470 475 480Cys Ser Ala Val Gln Val Ala Gly Thr Tyr Thr Gln
Ala Thr Ala Thr 485 490 495Ser Phe Pro Pro Gly Gln Thr Pro Asn Pro
Thr Ser Asp Thr Pro Ala 500 505 510Pro Phe Pro Thr Ala Cys Ala Asp
Ser Thr Gln Val Phe Val Thr Phe 515 520 525Arg Ala Glu Val Thr Thr
Gln Trp Gly Gln Ser Val Lys Val Val Gly 530 535 540Ser Ser Ser Glu
Leu Gly Asn Trp Asp Val Ser Lys Ala Pro Arg Leu545 550 555 560Ser
Ala Ser Ala Tyr Thr Ala Ser Asp Pro Leu Trp Ala Ile Thr Val 565 570
575Pro Met Lys Ala Gly Gln Ser Val Gln Tyr Lys Phe Val Lys Val Asn
580 585 590Gly Asp Gly Ser Ile Gln Trp Glu Ser Asp Pro Asn Arg Gln
Phe Thr 595 600 605Val Ser Ser Ser Ser Thr Ala Ser Gly Cys Ala Trp
Gln Thr Ile Glu 610 615 620Ala Thr Trp Arg625511884DNAChrysosporium
lucknowense 51atgcacgctc tctcgtcgct cgctgtcctc ggcgcctggg
ccgtccagac ggtcttgggc 60cgtccggcca ccctctcaaa gcgggccacc gactccttca
tcgagaccga gacgcctatc 120gcatgggaaa agctgcggtg caacatcggc
gctaacggct gtgcggcttc cggagccgct 180gccggtgtgg tcattgccag
cccgtccaag tcggatccag actacttcta cacctggact 240cgagatgccg
gcctggtcct gacgggtatc gtggacgccc tgtcccaaaa ctactcggcg
300gccctgcaga ccaacattca ggactacatc atcgcccagg ccaagctcca
gggtgtttcg 360aacccctccg gtagcctctc ggacggcacc ggtcttggcg
agcccaagtt caatgtcgat 420ctcacccagt tcacgggcga ctggggccgg
ccgcagcgcg acggtccgcc tctccgggcc 480atcgccctca tccgctacgc
caagtggctg gcttccaacg gttacaagga cacggccaac 540agcgtcgtct
ggcccgtcat caagaacgac ctggcctatg ccgctcagta ttggaacgag
600actggtttcg acctgtggga ggaggttccc ggcagctcgt tcttcaccat
tgccagcacg 660caccgagcct tggtagaggg agctgccctc gctgcccagc
tcggcaccga atgcagcgcc 720tgcatcaccg tcgcgcccca agtcctctgc
ttccagcaga gcttctggaa cccgtcgggc 780ggttacgttg tctcgaacat
caacggcggc aacaaccggt ccggcaagga tctcaactcg 840gtcctggcct
ccatccacac ctttgacccg gcggtcggct gcgactcggt caccttccag
900ccctgcagcg acaaggcgct ctccaaccac aaggcctacg tcgactcctt
ccgcagcgtc 960tacgccatca actcgggcat tgcccagggc aaggccgtcg
ccgtgggccg ctactcggag 1020gacgtctact acaacggcaa cccgtggtac
ctggccaact tcgcggccgc cgagcagctc 1080tacgacgccg tcttcgtctg
gaagaagcag cagtccatcg aggtcaccca gctgtccctc 1140cccttcttca
aggacctgct ccccggcatc tccaccggca cctacacccc gtcgtcgtcg
1200acgtaccagc agatcctcga cgccgtctcg gcctacgccg acggcttcat
cgacgtcgcg 1260gccaagtaca ccccctcgga cggctccctg gccgagcagt
acacgcgcga ctcgggccag 1320ccgatctcgg ccaaggacct gacctggtcc
tacgccgcct tcctctcggc cgccgaccgc 1380cgcgcgggca tcgtcccggc
cggctggtcc gccgagcacg gcaagacgct gcccggctcg 1440tgctcggccg
tccaggtcgc cggcacctac acccaggcca ccgccacctc cttcccgccc
1500ggccagacgc ccaacccgac cagcgacacc ccggccccgt tccccacggc
ctgcgccgac 1560tccacccagg tcttcgtcac cttccgcgcc gaggtgacca
cccagtgggg ccagtcggtc 1620aaggtcgtcg gcagctcgtc cgagctcggc
aactgggacg tctccaaggc cccgcgcctg 1680tccgcgtcgg cctacacggc
gtcggacccg ctctgggcca tcacggtgcc catgaaggcc 1740ggccagtcgg
tgcagtacaa gttcgtcaag gtcaacgggg acgggtcgat ccagtgggag
1800tcggacccga accgccagtt cacggtcagc tcttcctcca cggccagcgg
ctgtgcctgg 1860cagaccatcg aggcgacctg gcgg
1884521637DNAChrysosporium
lucknowenseCDS(148)..(864)CDS(1194)..(1601) 52gtataagatt gacgacgatc
cgggtagaca aaccgacttg cagatctttg caacccatat 60ttgtactagc ctccaggttc
atcctgccca ggtctcctgg cctagagttg tatacatttg 120ctttgttaac
ggagcctttt cctcgtc atg aga tcc tcc gcc atc tgg tcc acc 174 Met Arg
Ser Ser Ala Ile Trp Ser Thr 1 5gcc ttc gcg gcg ttg gca cac act gcc
gcc gcc gtc act ggt gct gcc 222Ala Phe Ala Ala Leu Ala His Thr Ala
Ala Ala Val Thr Gly Ala Ala10 15 20 25gag ggc ttc gcc aaa ggg gtc
acg gga ggt ggc aat gcg act ccc gtc 270Glu Gly Phe Ala Lys Gly Val
Thr Gly Gly Gly Asn Ala Thr Pro Val 30 35 40tat ccc aaa acc acg gcc
gag ctg gtc tcg tac ctg ggt gac tcg gag 318Tyr Pro Lys Thr Thr Ala
Glu Leu Val Ser Tyr Leu Gly Asp Ser Glu 45 50 55cct cgt gtc atc atg
ctg gac cgg aca ttc gac ttc acc gga acc gag 366Pro Arg Val Ile Met
Leu Asp Arg Thr Phe Asp Phe Thr Gly Thr Glu 60 65 70ggc acc gcg acc
gag acg ggc tgc gcc ccc tgg ggc acc ggc tcg ggc 414Gly Thr Ala Thr
Glu Thr Gly Cys Ala Pro Trp Gly Thr Gly Ser Gly 75 80 85tgc cag ctg
gcc ctg aac aag aac gac tgg tgc acc cgg gag cag ccg 462Cys Gln Leu
Ala Leu Asn Lys Asn Asp Trp Cys Thr Arg Glu Gln Pro90 95 100 105aac
gcg ccc aaa gtg agc atc acg tac gac aag gcc ggc att ctc ggc 510Asn
Ala Pro Lys Val Ser Ile Thr Tyr Asp Lys Ala Gly Ile Leu Gly 110 115
120atc acc gtc aag tcc aac aag tcc ctc atc ggc gtc ggc aac aag ggc
558Ile Thr Val Lys Ser Asn Lys Ser Leu Ile Gly Val Gly Asn Lys Gly
125 130 135gtc atc aag ggc aaa ggc atc cgc atg gtc agc ggc acc agc
aac gtc 606Val Ile Lys Gly Lys Gly Ile Arg Met Val Ser Gly Thr Ser
Asn Val 140 145 150atc atc cag aac atc cac atc acc aac ctc aac ccg
cag tac gtc tgg 654Ile Ile Gln Asn Ile His Ile Thr Asn Leu Asn Pro
Gln Tyr Val Trp 155 160 165ggc ggc gac gcc atc acg ctc gac aat acc
gag atg atc tgg atc gac 702Gly Gly Asp Ala Ile Thr Leu Asp Asn Thr
Glu Met Ile Trp Ile Asp170 175 180 185cac gtc acc acc tcc ctc atc
ggc cgc cag cac gtc gtc ctg ggt aac 750His Val Thr Thr Ser Leu Ile
Gly Arg Gln His Val Val Leu Gly Asn 190 195 200aac ccg agc ggc cgc
gtc acc atc tcc aac agc aag ttc gac ggc cag 798Asn Pro Ser Gly Arg
Val Thr Ile Ser Asn Ser Lys Phe Asp Gly Gln 205 210 215acc tcc tgg
tcc gcc acc tgc aac acc tac cac tac tgg ggc ctg tac 846Thr Ser Trp
Ser Ala Thr Cys Asn Thr Tyr His Tyr Trp Gly Leu Tyr 220 225 230ttt
aca ggg tcc aac gac gtgagttttc accctccttt tcttcttccc 894Phe Thr Gly
Ser Asn Asp 235agcataagtc ataaatcaaa aaaaagaaaa aaaagaaaaa
agaaaaaaag caaaaagaaa 954aaaaagaaaa aaagaaaaaa aagaaaaaaa
gaaaaaaaag aaaaaaagaa aaaaagataa 1014aaaagaagaa agaaaaaaag
aaaaaaagaa aaaaaagaaa aaaaaaaaag aaaaaaagaa 1074aaaaaaaaga
aaaaaaagaa gaacagaaca aaagaaacga agaaaaagga aaataaaaaa
1134gaatagaata atgcctaaat aggtacagat gctaacggaa ttgcttccac
ccaatccag 1193atg atc acc ttc aag aac aac ctc atc acc cac atg tcg
ggc cgc agc 1241Met Ile Thr Phe Lys Asn Asn Leu Ile Thr His Met Ser
Gly Arg Ser240 245 250 255ccc aag gtg gcg ggc aac acg ctg ctg cac
gcg gtc aac aac tac tgg 1289Pro Lys Val Ala Gly Asn Thr Leu Leu His
Ala Val Asn Asn Tyr Trp 260 265 270tac gac agc aag ggc cac aac ttc
gag ctg gac gcg ggg gcc atg gtg 1337Tyr Asp Ser Lys Gly His Asn Phe
Glu Leu Asp Ala Gly Ala Met Val 275 280 285ctc gtc gag ggc agc gtg
ttc cag aac gtc gag gcg ccg ctc cag tcc 1385Leu Val Glu Gly Ser Val
Phe Gln Asn Val Glu Ala Pro Leu Gln Ser 290 295 300ggc ggc gcc ggc
cgg ctc ttc gcc agc tcc ggc acc gcc tcg gcc tgc 1433Gly Gly Ala Gly
Arg Leu Phe Ala Ser Ser Gly Thr Ala Ser Ala Cys 305 310 315gcc ccg
tac ctc ggc cgc gtg tgc cag gcc aac ggc ttc ggc agc tcc 1481Ala Pro
Tyr Leu Gly Arg Val Cys Gln Ala Asn Gly Phe Gly Ser Ser320 325 330
335ggc tcc ctc tcc ggc tcc gac acc tcc ttc ctg cag tac ttc cag aac
1529Gly Ser Leu Ser Gly Ser Asp Thr Ser Phe Leu Gln Tyr Phe Gln Asn
340 345 350aag aac atc gcc ggc gcg aca gac tac aac acg gcc aag aac
gtc ctc 1577Lys Asn Ile Ala Gly Ala Thr Asp Tyr Asn Thr Ala Lys Asn
Val Leu 355 360 365gac acc gcc gga ttt ggc aca atc tgatcgatcg
tgttgcgttt ccttggaaaa 1631Asp Thr Ala Gly Phe Gly Thr Ile 370
375ggtgga 1637 53375PRTChrysosporium lucknowense 53Met Arg Ser Ser
Ala Ile Trp Ser Thr Ala Phe Ala Ala Leu Ala His1 5 10 15Thr Ala Ala
Ala Val Thr Gly Ala Ala Glu Gly Phe Ala Lys Gly Val 20 25 30Thr Gly
Gly Gly Asn Ala Thr Pro Val Tyr Pro Lys Thr Thr Ala Glu 35 40 45Leu
Val Ser Tyr Leu Gly Asp Ser Glu Pro Arg Val Ile Met Leu Asp 50 55
60Arg Thr Phe Asp Phe Thr Gly Thr Glu Gly Thr Ala Thr Glu Thr Gly65
70 75 80Cys Ala Pro Trp Gly Thr Gly Ser Gly Cys Gln Leu Ala Leu Asn
Lys 85 90 95Asn Asp Trp Cys Thr Arg Glu Gln Pro Asn Ala Pro Lys Val
Ser Ile 100 105 110Thr Tyr Asp Lys Ala Gly Ile Leu Gly Ile Thr Val
Lys Ser Asn Lys 115 120 125Ser Leu Ile Gly Val Gly Asn Lys Gly Val
Ile Lys Gly Lys Gly Ile 130 135 140Arg Met Val Ser Gly Thr Ser Asn
Val Ile Ile Gln Asn Ile His Ile145 150 155 160Thr Asn Leu Asn Pro
Gln Tyr Val Trp Gly Gly Asp Ala Ile Thr Leu 165 170 175Asp Asn Thr
Glu Met Ile Trp Ile Asp His Val Thr Thr Ser Leu Ile 180 185 190Gly
Arg Gln His Val Val Leu Gly Asn Asn Pro Ser Gly Arg Val Thr 195 200
205Ile Ser Asn Ser Lys Phe Asp Gly Gln Thr Ser Trp Ser Ala Thr Cys
210 215 220Asn Thr Tyr His Tyr Trp Gly Leu Tyr Phe Thr Gly Ser Asn
Asp Met225 230 235 240Ile Thr Phe Lys Asn Asn Leu Ile Thr His Met
Ser Gly Arg Ser Pro 245 250 255Lys Val Ala Gly Asn Thr Leu Leu His
Ala Val Asn Asn Tyr Trp Tyr 260 265 270Asp Ser Lys Gly His Asn Phe
Glu Leu Asp Ala Gly Ala Met Val Leu 275 280 285Val Glu Gly Ser Val
Phe Gln Asn Val Glu Ala Pro Leu Gln Ser Gly 290 295 300Gly Ala Gly
Arg Leu Phe Ala Ser Ser Gly Thr Ala Ser Ala Cys Ala305 310 315
320Pro Tyr Leu Gly Arg Val Cys Gln Ala Asn Gly Phe Gly Ser Ser Gly
325 330 335Ser Leu Ser Gly Ser Asp Thr Ser Phe Leu Gln Tyr Phe Gln
Asn Lys 340 345 350Asn Ile Ala Gly Ala Thr Asp Tyr Asn Thr Ala Lys
Asn Val Leu Asp 355 360 365Thr Ala Gly Phe Gly Thr Ile 370
375541125DNAChrysosporium lucknowense 54atgagatcct ccgccatctg
gtccaccgcc ttcgcggcgt tggcacacac tgccgccgcc 60gtcactggtg ctgccgaggg
cttcgccaaa ggggtcacgg gaggtggcaa tgcgactccc 120gtctatccca
aaaccacggc cgagctggtc tcgtacctgg gtgactcgga gcctcgtgtc
180atcatgctgg accggacatt cgacttcacc ggaaccgagg gcaccgcgac
cgagacgggc 240tgcgccccct ggggcaccgg ctcgggctgc cagctggccc
tgaacaagaa cgactggtgc 300acccgggagc agccgaacgc gcccaaagtg
agcatcacgt acgacaaggc cggcattctc 360ggcatcaccg tcaagtccaa
caagtccctc atcggcgtcg gcaacaaggg cgtcatcaag 420ggcaaaggca
tccgcatggt cagcggcacc agcaacgtca tcatccagaa catccacatc
480accaacctca acccgcagta cgtctggggc ggcgacgcca tcacgctcga
caataccgag 540atgatctgga tcgaccacgt caccacctcc ctcatcggcc
gccagcacgt cgtcctgggt 600aacaacccga gcggccgcgt caccatctcc
aacagcaagt tcgacggcca gacctcctgg 660tccgccacct gcaacaccta
ccactactgg ggcctgtact ttacagggtc caacgacatg 720atcaccttca
agaacaacct catcacccac atgtcgggcc gcagccccaa ggtggcgggc
780aacacgctgc tgcacgcggt caacaactac tggtacgaca gcaagggcca
caacttcgag 840ctggacgcgg gggccatggt gctcgtcgag ggcagcgtgt
tccagaacgt cgaggcgccg 900ctccagtccg gcggcgccgg ccggctcttc
gccagctccg gcaccgcctc ggcctgcgcc 960ccgtacctcg gccgcgtgtg
ccaggccaac ggcttcggca gctccggctc cctctccggc 1020tccgacacct
ccttcctgca gtacttccag aacaagaaca tcgccggcgc gacagactac
1080aacacggcca agaacgtcct cgacaccgcc ggatttggca caatc
1125551904DNAChrysosporium
lucknowenseCDS(179)..(330)CDS(443)..(591)CDS(732)..(1708)
55gtataagagc actgtgatcg ttcacaaagc cggcgtactc ctctagagca tctatcatca
60acatcaccag aaaggtcaag accaggtggt tgccatatcc agtcgcaaaa gagccaaaga
120gcgaaggagc acgaaagcac agcccaatca ttccctgctt tgctacttct tctccacc
178atg gtg act ccc tct ctg aag aag gca gct ctg gct gcc ctg tcc ctc
226Met Val Thr Pro Ser Leu Lys Lys Ala Ala Leu Ala Ala Leu Ser Leu1
5 10 15ttc ccg ctc ctc tcc cta gct tcg ccg gtg cca gca aca gcc gaa
gct 274Phe Pro Leu Leu Ser Leu Ala Ser Pro Val Pro Ala Thr Ala Glu
Ala 20 25 30agc gta cag acc cgg cag tca tcg gga tac aag aac atc gtg
tac ttt 322Ser Val Gln Thr Arg Gln Ser Ser Gly Tyr Lys Asn Ile Val
Tyr Phe 35 40 45acc aac tg gtatgtacgc ccggccgtca acccctccgc
cctccgctcc 370Thr Asn Trp 50ctctctccac cgtttccggg ccagacgaac
cactcaagac tttacttaat tactcactgg 430ccaatcgaac ag g ggt att tac ggc
cgc aac tac cag cca gac cag ctg 479 Gly Ile Tyr Gly Arg Asn Tyr Gln
Pro Asp Gln Leu 55 60ccc gcg tct cag ctc acc cac gtg ctc tat tcc
ttt gcc aac atc agg 527Pro Ala Ser Gln Leu Thr His Val Leu Tyr Ser
Phe Ala Asn Ile Arg 65 70 75tcc aac ggc gag gta ttc ctc tcc gac acg
tat gcg gac cta gag aag 575Ser Asn Gly Glu Val Phe Leu Ser Asp Thr
Tyr Ala Asp Leu Glu Lys80 85 90 95cac tat ccc aac gac t gtaagattaa
ttatatccct cttccatcca cccatgtact 631His Tyr Pro Asn Asp
100cccaacccaa caaccagtca tccattttat ccaccccctt tctttgtaat
cttctctctt 691ccccccttct gttccactga cagcacggct cccccggcag ca tgg
aat gac gtc 745 Ser Trp Asn Asp Val 105gga aac aac gtg tac ggc tgc
gtc aag cag ctc ttc ctg ctc aag aag 793Gly Asn Asn Val Tyr Gly Cys
Val Lys Gln Leu Phe Leu Leu Lys Lys 110 115 120gcg aac cgc cag ctc
aag acg ctg ctc tcc atc ggc ggc tgg acc tac 841Ala Asn Arg Gln Leu
Lys Thr Leu Leu Ser Ile Gly Gly Trp Thr Tyr 125 130 135tcg gcc acc
ttc ccc gcg gcg gcc agc aca gcc gag agc cga gcc ctc 889Ser Ala Thr
Phe Pro Ala Ala Ala Ser Thr Ala Glu Ser Arg Ala Leu 140 145 150ttc
gcg tcg tcg gcg gtc cgg ctc ctg gcc gac ctg ggc ttc gac ggc 937Phe
Ala Ser Ser Ala Val Arg Leu Leu Ala Asp Leu Gly Phe Asp Gly 155 160
165ctc gac atc gac tgg gag tac ccg gcc aac gag cag gag gcg gcc aac
985Leu Asp Ile Asp Trp Glu Tyr Pro Ala Asn Glu Gln Glu Ala Ala
Asn170 175 180 185ttc gtc ctc ctg ctc aag gcg gtc cgc tcg gcc ctg
gac gac tac gcg 1033Phe Val Leu Leu Leu Lys Ala Val Arg Ser Ala Leu
Asp Asp Tyr Ala 190 195 200gcc cag cac gcg ccc ggc tac cac ttc ctc
ctc acc atc gcc tcg ccg 1081Ala Gln His Ala Pro Gly Tyr His Phe Leu
Leu Thr Ile Ala Ser Pro 205 210 215gcc ggc ccg agc aac tac ggg cac
ctg ccg ctg cgc gac atc gcc ggc 1129Ala Gly Pro Ser Asn Tyr Gly His
Leu Pro Leu Arg Asp Ile Ala Gly 220 225 230gtg atc gac ttc ttc aac
ttc atg ggc tac gac tac gcc ggc tcg tgg 1177Val Ile Asp Phe Phe Asn
Phe Met Gly Tyr Asp Tyr Ala Gly Ser Trp 235 240 245agc acc gcg gcg
gga cac cag gcc aac ctg tac ccg acc gcc gac gcg 1225Ser Thr Ala Ala
Gly His Gln Ala Asn Leu Tyr Pro Thr Ala Asp Ala250 255 260 265ggc
agg acg ccc ttc tcg acc gac aag gcc ctg tcc gac tac gtc gcc 1273Gly
Arg Thr Pro Phe Ser Thr Asp Lys Ala Leu Ser Asp Tyr Val Ala 270 275
280gcc ggc gtc gac ccg gcc aag atc gtg ctc ggc atg ccc atc tac ggc
1321Ala Gly Val Asp Pro Ala Lys Ile Val Leu Gly Met Pro Ile Tyr Gly
285 290 295cgc tcc ttc gag gcc acc gac ggc ctg ggc aag ccc ttc acc
ggc gtc 1369Arg Ser Phe Glu Ala Thr Asp Gly Leu Gly Lys Pro Phe Thr
Gly Val 300 305 310ggc cag ggc agc tgg gag agc ggc gtg tgg gac tac
aag gtg ctg ccc 1417Gly Gln Gly Ser Trp Glu Ser Gly Val Trp Asp Tyr
Lys Val Leu Pro 315 320 325cgg gcc ggc gcc acc gtc cag tac gac gag
gag gcc ggc gcg acc tac 1465Arg Ala Gly Ala Thr Val Gln Tyr Asp Glu
Glu Ala Gly Ala Thr Tyr330 335 340 345agc tac gac ccg gcc acg cgc
gag ctc atc agc ttc gac acg gtc gac 1513Ser Tyr Asp Pro Ala Thr Arg
Glu Leu Ile Ser Phe Asp Thr Val Asp 350 355 360atg gtc aag aag aag
gtc gac tac gtc aag cag aag ggc ttc gcc ggc 1561Met Val Lys Lys Lys
Val Asp Tyr Val Lys Gln Lys Gly Phe Ala Gly 365 370 375agc atg ttc
tgg gag gcc tcc gcc gac cgc acc ggc gac cag agc ctc 1609Ser Met Phe
Trp Glu Ala Ser Ala Asp Arg Thr Gly Asp Gln Ser Leu 380 385 390atc
ggc gcc agc ttc ggc gcc ctc ggc ggc atc gac cag tcc cag aac 1657Ile
Gly Ala Ser Phe Gly Ala Leu Gly Gly Ile Asp Gln Ser Gln Asn 395 400
405cag ctg tcg tac ccg gac agc aag tat gat aac ctc agg gcc ggc ttc
1705Gln Leu Ser Tyr Pro Asp Ser Lys Tyr Asp Asn Leu Arg Ala Gly
Phe410 415 420 425ccg tgaggcccta tgacctcccg agtcctggtc gagagagggg
agctggagag 1758Proagagagagag agagagagag agagggagcc tgaagccgta
ctgattcggc aaaccatttt 1818gaagcctgga catgtttacc ttcgttttct
cttcttggat ggtttgcatt attattagcc 1878gcttttcttt tttctctcca ttcgtt
190456426PRTChrysosporium lucknowense 56Met Val Thr Pro Ser Leu Lys
Lys Ala Ala Leu Ala Ala Leu Ser Leu1 5 10 15Phe Pro Leu Leu Ser Leu
Ala Ser Pro Val Pro Ala Thr Ala Glu Ala 20 25 30Ser Val Gln Thr Arg
Gln Ser Ser Gly Tyr Lys Asn Ile Val Tyr Phe 35 40 45Thr Asn Trp Gly
Ile Tyr Gly Arg Asn Tyr Gln Pro Asp Gln Leu Pro 50 55 60Ala Ser Gln
Leu Thr His Val Leu Tyr Ser Phe Ala Asn Ile Arg Ser65 70 75 80Asn
Gly Glu Val Phe Leu Ser Asp Thr Tyr Ala Asp Leu Glu Lys His 85 90
95Tyr Pro Asn Asp Ser Trp Asn Asp Val Gly Asn Asn Val Tyr Gly Cys
100 105 110Val Lys Gln Leu Phe Leu Leu Lys Lys Ala Asn Arg Gln Leu
Lys Thr 115 120 125Leu Leu Ser Ile Gly Gly Trp Thr Tyr Ser Ala Thr
Phe Pro Ala Ala 130 135 140Ala Ser Thr Ala Glu Ser Arg Ala Leu Phe
Ala Ser Ser Ala Val Arg145 150 155 160Leu Leu Ala Asp Leu Gly Phe
Asp Gly Leu Asp Ile Asp Trp Glu Tyr 165 170 175Pro Ala Asn Glu Gln
Glu Ala Ala Asn Phe Val Leu Leu Leu Lys Ala 180 185 190Val Arg Ser
Ala Leu Asp Asp Tyr Ala Ala Gln His Ala Pro Gly Tyr 195 200 205His
Phe Leu Leu Thr Ile Ala Ser Pro Ala Gly Pro Ser Asn Tyr Gly 210 215
220His Leu Pro Leu Arg Asp Ile Ala Gly Val Ile Asp Phe Phe Asn
Phe225 230 235 240Met Gly Tyr Asp Tyr Ala Gly Ser Trp Ser Thr Ala
Ala Gly His Gln 245 250 255Ala Asn Leu Tyr Pro Thr Ala Asp Ala Gly
Arg Thr Pro Phe Ser Thr 260 265 270Asp Lys Ala Leu Ser Asp Tyr Val
Ala Ala Gly Val Asp Pro Ala Lys 275 280 285Ile Val Leu Gly Met Pro
Ile Tyr Gly Arg Ser Phe Glu Ala Thr Asp 290 295 300Gly Leu Gly Lys
Pro Phe Thr Gly Val Gly Gln Gly Ser Trp Glu Ser305 310 315 320Gly
Val Trp Asp Tyr Lys Val Leu Pro Arg Ala Gly Ala Thr Val Gln 325 330
335Tyr Asp Glu Glu Ala Gly Ala Thr Tyr Ser Tyr Asp Pro Ala Thr Arg
340 345 350Glu Leu Ile Ser Phe Asp Thr Val Asp Met Val Lys Lys Lys
Val Asp 355 360 365Tyr Val Lys Gln Lys Gly Phe Ala Gly Ser Met Phe
Trp Glu Ala Ser 370 375 380Ala Asp Arg Thr Gly Asp Gln Ser Leu Ile
Gly Ala Ser Phe Gly Ala385 390 395 400Leu Gly Gly Ile Asp Gln Ser
Gln Asn Gln Leu Ser Tyr Pro Asp Ser 405 410 415Lys Tyr Asp Asn Leu
Arg Ala Gly Phe Pro 420 425571278DNAChrysosporium lucknowense
57atggtgactc cctctctgaa gaaggcagct ctggctgccc tgtccctctt cccgctcctc
60tccctagctt cgccggtgcc agcaacagcc gaagctagcg tacagacccg gcagtcatcg
120ggatacaaga acatcgtgta ctttaccaac tggggtattt acggccgcaa
ctaccagcca 180gaccagctgc ccgcgtctca gctcacccac gtgctctatt
cctttgccaa catcaggtcc 240aacggcgagg tattcctctc cgacacgtat
gcggacctag agaagcacta tcccaacgac 300tcatggaatg acgtcggaaa
caacgtgtac ggctgcgtca agcagctctt cctgctcaag 360aaggcgaacc
gccagctcaa gacgctgctc tccatcggcg gctggaccta ctcggccacc
420ttccccgcgg cggccagcac agccgagagc cgagccctct tcgcgtcgtc
ggcggtccgg 480ctcctggccg acctgggctt cgacggcctc gacatcgact
gggagtaccc ggccaacgag 540caggaggcgg ccaacttcgt cctcctgctc
aaggcggtcc gctcggccct ggacgactac 600gcggcccagc acgcgcccgg
ctaccacttc ctcctcacca tcgcctcgcc ggccggcccg 660agcaactacg
ggcacctgcc gctgcgcgac atcgccggcg tgatcgactt cttcaacttc
720atgggctacg actacgccgg ctcgtggagc accgcggcgg gacaccaggc
caacctgtac 780ccgaccgccg acgcgggcag gacgcccttc tcgaccgaca
aggccctgtc cgactacgtc 840gccgccggcg tcgacccggc caagatcgtg
ctcggcatgc ccatctacgg ccgctccttc 900gaggccaccg acggcctggg
caagcccttc accggcgtcg gccagggcag ctgggagagc 960ggcgtgtggg
actacaaggt gctgccccgg gccggcgcca ccgtccagta cgacgaggag
1020gccggcgcga cctacagcta cgacccggcc acgcgcgagc tcatcagctt
cgacacggtc 1080gacatggtca agaagaaggt cgactacgtc aagcagaagg
gcttcgccgg cagcatgttc 1140tgggaggcct ccgccgaccg caccggcgac
cagagcctca tcggcgccag cttcggcgcc 1200ctcggcggca tcgaccagtc
ccagaaccag ctgtcgtacc cggacagcaa gtatgataac 1260ctcagggccg gcttcccg
1278583200DNAChrysosporium lucknowenseCDS(118)..(2838) 58aggcccttcc
cttgtgccta ttaccttacc cgaattgtta attatgtgtc taccaggcac 60ccaggagggc
attgtactca ctcactctcc gagagaacat cgaatccgga aaacaga 117atg cgg gcg
aaa agg gcg gcc att gcc ctt ttc cta gct tgg ccg gcc 165Met Arg Ala
Lys Arg Ala Ala Ile Ala Leu Phe Leu Ala Trp Pro Ala1 5 10 15ata aca
cgg gcc gcg tcc ctc gtc tct ttt ccg gga aac acg gcc gcg 213Ile Thr
Arg Ala Ala Ser Leu Val Ser Phe Pro Gly Asn Thr Ala Ala 20 25 30att
cca tcg tgg gaa ctg cag tct tct gtg gag gcc ggc act gac ctc 261Ile
Pro Ser Trp Glu Leu Gln Ser Ser Val Glu Ala Gly Thr Asp Leu 35 40
45gaa ggt ctt tca caa gtt ggg gtc gat aca aag tca tgg cac cac atc
309Glu Gly Leu Ser Gln Val Gly Val Asp Thr Lys Ser Trp His His Ile
50 55 60aaa aca tgg aga tgt acg ctc atg ggc tgc ctg atc gag gcc ggc
gtc 357Lys Thr Trp Arg Cys Thr Leu Met Gly Cys Leu Ile Glu Ala Gly
Val65 70 75 80tac aat gaa gat caa ctc ttc tac tcg gag aac ctg cgc
aaa gtc aac 405Tyr Asn Glu Asp Gln Leu Phe Tyr Ser Glu Asn Leu Arg
Lys Val Asn 85 90 95gag acg caa ttc ctg gtg cca tgg att tat cgc aac
gag ttc tcc ctg 453Glu Thr Gln Phe Leu Val Pro Trp Ile Tyr Arg Asn
Glu Phe Ser Leu 100 105 110cat ccc ggc ccg ggc agg cac ttc ttc ctg
cag aca cac ggc atc tct 501His Pro Gly Pro Gly Arg His Phe Phe Leu
Gln Thr His Gly Ile Ser 115 120 125tcg cga gcc gac atc ttt ctg aac
ggc aaa cga gtg gcc aat tct tcg 549Ser Arg Ala Asp Ile Phe Leu Asn
Gly Lys Arg Val Ala Asn Ser Ser 130 135 140gag cag gcg ggt tcg tac
gtt ggc cgg att tac gac atc acc aaa cag 597Glu Gln Ala Gly Ser Tyr
Val Gly Arg Ile Tyr Asp Ile Thr Lys Gln145 150 155 160gtt cac aga
gaa aat gcg ctg gca att cag gtg tac ccg acc aac tac 645Val His Arg
Glu Asn Ala Leu Ala Ile Gln Val Tyr Pro Thr Asn Tyr 165 170 175tac
tac gac ctg gcg ctc gga tgg gtg gat tgg aac cca tgg ccg gcc 693Tyr
Tyr Asp Leu Ala Leu Gly Trp Val Asp Trp Asn Pro Trp Pro Ala 180 185
190gac aac ggc acc ggc gtc tgg cgg gac gtc gag atc aag cag acc ggg
741Asp Asn Gly Thr Gly Val Trp Arg Asp Val Glu Ile Lys Gln Thr Gly
195 200 205gcg gtc atg ctg gag ccg ttg cgg gtt gtg acg cac ttg ggg
ccg acg 789Ala Val Met Leu Glu Pro Leu Arg Val Val Thr His Leu Gly
Pro Thr 210 215 220ctc gga gat gag cca gca aac gtg acc ctc aag gcc
caa gca cat aac 837Leu Gly Asp Glu Pro Ala Asn Val Thr Leu Lys Ala
Gln Ala His Asn225 230 235 240ctt gaa aac tac acc gtc acc atc acg
gcc acc ggc cac atc gcc ctg 885Leu Glu Asn Tyr Thr Val Thr Ile Thr
Ala Thr Gly His Ile Ala Leu 245 250 255gac ccc cac tct gat aac caa
caa cac cat cac cac cac cac cct att 933Asp Pro His Ser Asp Asn Gln
Gln His His His His His His Pro Ile 260 265 270acc tgg cac cga acg
tac acg ctc cca ccc ctt tcc acc acc gag ctc 981Thr Trp His Arg Thr
Tyr Thr Leu Pro Pro Leu Ser Thr Thr Glu Leu 275 280 285gcc ctc tcc
gcg acc att gcc gac ccc cag atc tgg tgg ccc cgc caa 1029Ala Leu Ser
Ala Thr Ile Ala Asp Pro Gln Ile Trp Trp Pro Arg Gln 290 295 300tgg
ggt ggt caa ccg ctc tac acc gcc gcc ttg acc gtc acc acg cgc 1077Trp
Gly Gly Gln Pro Leu Tyr Thr Ala Ala Leu Thr Val Thr Thr Arg305 310
315 320ggc acc acc acg ccg tcc gac cgg tcc acg gcc acc ttc ggc ctg
cgc 1125Gly Thr Thr Thr Pro Ser Asp Arg Ser Thr Ala Thr Phe Gly Leu
Arg 325 330 335acc gtc gag gcg agg ctg aac ccg acc tac aac gac acc
acc ttc cac 1173Thr Val Glu Ala Arg Leu Asn Pro Thr Tyr Asn Asp Thr
Thr Phe His 340 345 350gtc aac tcg cgc ccc ttc cag gtc ctg ggc gcc
ggc tac tcc ccc aac 1221Val Asn Ser Arg Pro Phe Gln Val Leu Gly Ala
Gly Tyr Ser Pro Asn 355 360 365gtc ttc ctc cgc gcg ccg cca cgg acc
gcc gcg gcc gac tac acg gcc 1269Val Phe Leu Arg Ala Pro Pro Arg Thr
Ala Ala Ala Asp Tyr Thr Ala 370 375 380acg ctc cag tac gta ctc gac
ctg ggc ctg aac acg atc cgg ctc gag 1317Thr Leu Gln Tyr Val Leu Asp
Leu Gly Leu Asn Thr Ile Arg Leu Glu385 390 395 400ggc aag gac gag
cac tcg gaa ctg tac gcg gcg gcg gac cgg ctg ggc 1365Gly Lys Asp Glu
His Ser Glu Leu Tyr Ala Ala Ala Asp Arg Leu Gly 405 410 415gtc atg
atc ctt gcc ggc tgg gag tgc tgc gac aag tgg gag gcg tgg 1413Val Met
Ile Leu Ala Gly Trp Glu Cys Cys Asp Lys Trp Glu Ala Trp 420 425
430agc tac aac ggc gac ctg gcc atc gac ccg gtg ccg gtc tgg gac gac
1461Ser Tyr Asn Gly Asp Leu Ala Ile Asp Pro Val Pro Val Trp Asp Asp
435 440 445gcc gac tac gcc gtc gcg cgt gcc gcc atg gcc cac gag gcc
cgc atg 1509Ala Asp Tyr Ala Val Ala Arg Ala Ala Met Ala His Glu Ala
Arg Met 450 455 460ctc cag ccc cat ccc agc ctg ctg ggc ttc ctc atc
ggc agc gac ttc 1557Leu Gln Pro His Pro Ser Leu Leu Gly Phe Leu Ile
Gly Ser Asp Phe465 470 475 480tgg ccc gac gac cgc gcc acc gcc gcc
tac ctc gat gcc ctg gaa gcc 1605Trp Pro Asp Asp Arg Ala Thr Ala Ala
Tyr Leu Asp Ala Leu Glu Ala 485 490 495gcc gac tgg gcc gcc ccc gtg
ctg gcc tcg gcc tcg aag cgc ggg tac 1653Ala Asp Trp Ala Ala Pro Val
Leu Ala Ser Ala Ser Lys Arg Gly Tyr
500 505 510ccc gag cgg acc ggt ccg gga ggg ctc aag atg gac ggg ccg
tac gac 1701Pro Glu Arg Thr Gly Pro Gly Gly Leu Lys Met Asp Gly Pro
Tyr Asp 515 520 525tgg gtg cca ccg ggg tac tgg tgg gac acg gag ccg
acc ggg gag agg 1749Trp Val Pro Pro Gly Tyr Trp Trp Asp Thr Glu Pro
Thr Gly Glu Arg 530 535 540ctc gga gcc gcg ttc ggt ttt ggg tcc gag
ctg ggc gcg ggc gtg ggc 1797Leu Gly Ala Ala Phe Gly Phe Gly Ser Glu
Leu Gly Ala Gly Val Gly545 550 555 560acg ccg gag ctg ggg tcg ctg
agg cgg ttc ctg agc gag ggg gac ctg 1845Thr Pro Glu Leu Gly Ser Leu
Arg Arg Phe Leu Ser Glu Gly Asp Leu 565 570 575gag gcc ctg tgg aag
cgg ccg aac gtg agc cta ttc cac atg tcg agg 1893Glu Ala Leu Trp Lys
Arg Pro Asn Val Ser Leu Phe His Met Ser Arg 580 585 590gag acc tcc
caa ttt acg acg agg gtg atc tac aac gcc ggg ctc tgg 1941Glu Thr Ser
Gln Phe Thr Thr Arg Val Ile Tyr Asn Ala Gly Leu Trp 595 600 605aac
cgg tgg ggc gcc ccc gcg agc ctg gaa gac tac ctg atg aag gcg 1989Asn
Arg Trp Gly Ala Pro Ala Ser Leu Glu Asp Tyr Leu Met Lys Ala 610 615
620cag ctg atg gac tac gag gcc acg cgg gct cag ttc gac gcc tac acg
2037Gln Leu Met Asp Tyr Glu Ala Thr Arg Ala Gln Phe Asp Ala Tyr
Thr625 630 635 640gcc atg tgg aac gcc gag agg ccg gcc acg ggc ctg
ata tac tgg atg 2085Ala Met Trp Asn Ala Glu Arg Pro Ala Thr Gly Leu
Ile Tyr Trp Met 645 650 655ctc aac aac gcc tgg ccg agc ctg cac tgg
aac ctg tgg gac tac tac 2133Leu Asn Asn Ala Trp Pro Ser Leu His Trp
Asn Leu Trp Asp Tyr Tyr 660 665 670atg cgt cct gcc ggt agt tac ttt
ggg gca aag gcc ggc agc agg gtg 2181Met Arg Pro Ala Gly Ser Tyr Phe
Gly Ala Lys Ala Gly Ser Arg Val 675 680 685gaa aat gtc gtg ttt gac
tac gtc agg agg gcg gtc tgg ctg gtc aac 2229Glu Asn Val Val Phe Asp
Tyr Val Arg Arg Ala Val Trp Leu Val Asn 690 695 700cgg tcc ctg gac
aag agc ggg acg agg aga gtg cag gtg gac gtc atg 2277Arg Ser Leu Asp
Lys Ser Gly Thr Arg Arg Val Gln Val Asp Val Met705 710 715 720gat
aag cat ggt aaa gtc cta tac aag gac acg acc gtc acc acg act 2325Asp
Lys His Gly Lys Val Leu Tyr Lys Asp Thr Thr Val Thr Thr Thr 725 730
735gtt cca aac acg agc aag gac att ctt tca cta gcg gga cct ctg ggc
2373Val Pro Asn Thr Ser Lys Asp Ile Leu Ser Leu Ala Gly Pro Leu Gly
740 745 750aac atc acc gac gtg gtg ttt ttg cgg ctg gtt ctc tac gat
gca caa 2421Asn Ile Thr Asp Val Val Phe Leu Arg Leu Val Leu Tyr Asp
Ala Gln 755 760 765cgg caa ggg cat gcg ctg agt cga aac gtg tac tgg
gtt gcg aag gag 2469Arg Gln Gly His Ala Leu Ser Arg Asn Val Tyr Trp
Val Ala Lys Glu 770 775 780cca gat gtg ctg gac tgg gac gag tcg gat
tgg tac ttc acg ccg gtg 2517Pro Asp Val Leu Asp Trp Asp Glu Ser Asp
Trp Tyr Phe Thr Pro Val785 790 795 800acc agc tac tcg gac tac act
gca cta aac aac atg gct cag gca gac 2565Thr Ser Tyr Ser Asp Tyr Thr
Ala Leu Asn Asn Met Ala Gln Ala Asp 805 810 815gtc acg gtt cac aat
gtc tgg cac ggc tca ggg gtt gac atg acg ctg 2613Val Thr Val His Asn
Val Trp His Gly Ser Gly Val Asp Met Thr Leu 820 825 830gag aac aaa
tct aag gtg cca gcg ttc ttt gtc agc ttg acg ctg gtg 2661Glu Asn Lys
Ser Lys Val Pro Ala Phe Phe Val Ser Leu Thr Leu Val 835 840 845gat
gta cac ggg aac gag gtc ctg ccg gtg acg tgg gag gac aac tat 2709Asp
Val His Gly Asn Glu Val Leu Pro Val Thr Trp Glu Asp Asn Tyr 850 855
860gtc acc cta tgg ccg ggt gag aaa ttg aca ctt cgg gga agg tca atg
2757Val Thr Leu Trp Pro Gly Glu Lys Leu Thr Leu Arg Gly Arg Ser
Met865 870 875 880ggg gta gat gca aag ccg tgg gag gtc att gtg aga
gga aag aac gtt 2805Gly Val Asp Ala Lys Pro Trp Glu Val Ile Val Arg
Gly Lys Asn Val 885 890 895gaa gca aag aga gtc cgt cac cgg tgg cat
gta tgagagaaga ccaacattac 2858Glu Ala Lys Arg Val Arg His Arg Trp
His Val 900 905atggatagcc ttaatggtat agatcctacc gaatgaaatc
actcattagc gaggaagtac 2918tttgtgagtt gggtgagcat attccctaaa
ttcattctac tgggggttgc attagaggca 2978cctcacgtgg aacaactgtt
aggattcaat ttcgcagcaa gaaccgtagt cccactcgtt 3038ttcaaatatc
catgtgagac atgtgatctg tctgtataac ggggcggcta cttgcccact
3098tcacccaaca cggaagacct aggtttggtg ggccaccata ttgcctttgc
atgactattt 3158cacaccccaa cgtgaaacga tgctctaaaa ccaacgaaaa aa
320059907PRTChrysosporium lucknowense 59Met Arg Ala Lys Arg Ala Ala
Ile Ala Leu Phe Leu Ala Trp Pro Ala1 5 10 15Ile Thr Arg Ala Ala Ser
Leu Val Ser Phe Pro Gly Asn Thr Ala Ala 20 25 30Ile Pro Ser Trp Glu
Leu Gln Ser Ser Val Glu Ala Gly Thr Asp Leu 35 40 45Glu Gly Leu Ser
Gln Val Gly Val Asp Thr Lys Ser Trp His His Ile 50 55 60Lys Thr Trp
Arg Cys Thr Leu Met Gly Cys Leu Ile Glu Ala Gly Val65 70 75 80Tyr
Asn Glu Asp Gln Leu Phe Tyr Ser Glu Asn Leu Arg Lys Val Asn 85 90
95Glu Thr Gln Phe Leu Val Pro Trp Ile Tyr Arg Asn Glu Phe Ser Leu
100 105 110His Pro Gly Pro Gly Arg His Phe Phe Leu Gln Thr His Gly
Ile Ser 115 120 125Ser Arg Ala Asp Ile Phe Leu Asn Gly Lys Arg Val
Ala Asn Ser Ser 130 135 140Glu Gln Ala Gly Ser Tyr Val Gly Arg Ile
Tyr Asp Ile Thr Lys Gln145 150 155 160Val His Arg Glu Asn Ala Leu
Ala Ile Gln Val Tyr Pro Thr Asn Tyr 165 170 175Tyr Tyr Asp Leu Ala
Leu Gly Trp Val Asp Trp Asn Pro Trp Pro Ala 180 185 190Asp Asn Gly
Thr Gly Val Trp Arg Asp Val Glu Ile Lys Gln Thr Gly 195 200 205Ala
Val Met Leu Glu Pro Leu Arg Val Val Thr His Leu Gly Pro Thr 210 215
220Leu Gly Asp Glu Pro Ala Asn Val Thr Leu Lys Ala Gln Ala His
Asn225 230 235 240Leu Glu Asn Tyr Thr Val Thr Ile Thr Ala Thr Gly
His Ile Ala Leu 245 250 255Asp Pro His Ser Asp Asn Gln Gln His His
His His His His Pro Ile 260 265 270Thr Trp His Arg Thr Tyr Thr Leu
Pro Pro Leu Ser Thr Thr Glu Leu 275 280 285Ala Leu Ser Ala Thr Ile
Ala Asp Pro Gln Ile Trp Trp Pro Arg Gln 290 295 300Trp Gly Gly Gln
Pro Leu Tyr Thr Ala Ala Leu Thr Val Thr Thr Arg305 310 315 320Gly
Thr Thr Thr Pro Ser Asp Arg Ser Thr Ala Thr Phe Gly Leu Arg 325 330
335Thr Val Glu Ala Arg Leu Asn Pro Thr Tyr Asn Asp Thr Thr Phe His
340 345 350Val Asn Ser Arg Pro Phe Gln Val Leu Gly Ala Gly Tyr Ser
Pro Asn 355 360 365Val Phe Leu Arg Ala Pro Pro Arg Thr Ala Ala Ala
Asp Tyr Thr Ala 370 375 380Thr Leu Gln Tyr Val Leu Asp Leu Gly Leu
Asn Thr Ile Arg Leu Glu385 390 395 400Gly Lys Asp Glu His Ser Glu
Leu Tyr Ala Ala Ala Asp Arg Leu Gly 405 410 415Val Met Ile Leu Ala
Gly Trp Glu Cys Cys Asp Lys Trp Glu Ala Trp 420 425 430Ser Tyr Asn
Gly Asp Leu Ala Ile Asp Pro Val Pro Val Trp Asp Asp 435 440 445Ala
Asp Tyr Ala Val Ala Arg Ala Ala Met Ala His Glu Ala Arg Met 450 455
460Leu Gln Pro His Pro Ser Leu Leu Gly Phe Leu Ile Gly Ser Asp
Phe465 470 475 480Trp Pro Asp Asp Arg Ala Thr Ala Ala Tyr Leu Asp
Ala Leu Glu Ala 485 490 495Ala Asp Trp Ala Ala Pro Val Leu Ala Ser
Ala Ser Lys Arg Gly Tyr 500 505 510Pro Glu Arg Thr Gly Pro Gly Gly
Leu Lys Met Asp Gly Pro Tyr Asp 515 520 525Trp Val Pro Pro Gly Tyr
Trp Trp Asp Thr Glu Pro Thr Gly Glu Arg 530 535 540Leu Gly Ala Ala
Phe Gly Phe Gly Ser Glu Leu Gly Ala Gly Val Gly545 550 555 560Thr
Pro Glu Leu Gly Ser Leu Arg Arg Phe Leu Ser Glu Gly Asp Leu 565 570
575Glu Ala Leu Trp Lys Arg Pro Asn Val Ser Leu Phe His Met Ser Arg
580 585 590Glu Thr Ser Gln Phe Thr Thr Arg Val Ile Tyr Asn Ala Gly
Leu Trp 595 600 605Asn Arg Trp Gly Ala Pro Ala Ser Leu Glu Asp Tyr
Leu Met Lys Ala 610 615 620Gln Leu Met Asp Tyr Glu Ala Thr Arg Ala
Gln Phe Asp Ala Tyr Thr625 630 635 640Ala Met Trp Asn Ala Glu Arg
Pro Ala Thr Gly Leu Ile Tyr Trp Met 645 650 655Leu Asn Asn Ala Trp
Pro Ser Leu His Trp Asn Leu Trp Asp Tyr Tyr 660 665 670Met Arg Pro
Ala Gly Ser Tyr Phe Gly Ala Lys Ala Gly Ser Arg Val 675 680 685Glu
Asn Val Val Phe Asp Tyr Val Arg Arg Ala Val Trp Leu Val Asn 690 695
700Arg Ser Leu Asp Lys Ser Gly Thr Arg Arg Val Gln Val Asp Val
Met705 710 715 720Asp Lys His Gly Lys Val Leu Tyr Lys Asp Thr Thr
Val Thr Thr Thr 725 730 735Val Pro Asn Thr Ser Lys Asp Ile Leu Ser
Leu Ala Gly Pro Leu Gly 740 745 750Asn Ile Thr Asp Val Val Phe Leu
Arg Leu Val Leu Tyr Asp Ala Gln 755 760 765Arg Gln Gly His Ala Leu
Ser Arg Asn Val Tyr Trp Val Ala Lys Glu 770 775 780Pro Asp Val Leu
Asp Trp Asp Glu Ser Asp Trp Tyr Phe Thr Pro Val785 790 795 800Thr
Ser Tyr Ser Asp Tyr Thr Ala Leu Asn Asn Met Ala Gln Ala Asp 805 810
815Val Thr Val His Asn Val Trp His Gly Ser Gly Val Asp Met Thr Leu
820 825 830Glu Asn Lys Ser Lys Val Pro Ala Phe Phe Val Ser Leu Thr
Leu Val 835 840 845Asp Val His Gly Asn Glu Val Leu Pro Val Thr Trp
Glu Asp Asn Tyr 850 855 860Val Thr Leu Trp Pro Gly Glu Lys Leu Thr
Leu Arg Gly Arg Ser Met865 870 875 880Gly Val Asp Ala Lys Pro Trp
Glu Val Ile Val Arg Gly Lys Asn Val 885 890 895Glu Ala Lys Arg Val
Arg His Arg Trp His Val 900 905602721DNAChrysosporium lucknowense
60atgcgggcga aaagggcggc cattgccctt ttcctagctt ggccggccat aacacgggcc
60gcgtccctcg tctcttttcc gggaaacacg gccgcgattc catcgtggga actgcagtct
120tctgtggagg ccggcactga cctcgaaggt ctttcacaag ttggggtcga
tacaaagtca 180tggcaccaca tcaaaacatg gagatgtacg ctcatgggct
gcctgatcga ggccggcgtc 240tacaatgaag atcaactctt ctactcggag
aacctgcgca aagtcaacga gacgcaattc 300ctggtgccat ggatttatcg
caacgagttc tccctgcatc ccggcccggg caggcacttc 360ttcctgcaga
cacacggcat ctcttcgcga gccgacatct ttctgaacgg caaacgagtg
420gccaattctt cggagcaggc gggttcgtac gttggccgga tttacgacat
caccaaacag 480gttcacagag aaaatgcgct ggcaattcag gtgtacccga
ccaactacta ctacgacctg 540gcgctcggat gggtggattg gaacccatgg
ccggccgaca acggcaccgg cgtctggcgg 600gacgtcgaga tcaagcagac
cggggcggtc atgctggagc cgttgcgggt tgtgacgcac 660ttggggccga
cgctcggaga tgagccagca aacgtgaccc tcaaggccca agcacataac
720cttgaaaact acaccgtcac catcacggcc accggccaca tcgccctgga
cccccactct 780gataaccaac aacaccatca ccaccaccac cctattacct
ggcaccgaac gtacacgctc 840ccaccccttt ccaccaccga gctcgccctc
tccgcgacca ttgccgaccc ccagatctgg 900tggccccgcc aatggggtgg
tcaaccgctc tacaccgccg ccttgaccgt caccacgcgc 960ggcaccacca
cgccgtccga ccggtccacg gccaccttcg gcctgcgcac cgtcgaggcg
1020aggctgaacc cgacctacaa cgacaccacc ttccacgtca actcgcgccc
cttccaggtc 1080ctgggcgccg gctactcccc caacgtcttc ctccgcgcgc
cgccacggac cgccgcggcc 1140gactacacgg ccacgctcca gtacgtactc
gacctgggcc tgaacacgat ccggctcgag 1200ggcaaggacg agcactcgga
actgtacgcg gcggcggacc ggctgggcgt catgatcctt 1260gccggctggg
agtgctgcga caagtgggag gcgtggagct acaacggcga cctggccatc
1320gacccggtgc cggtctggga cgacgccgac tacgccgtcg cgcgtgccgc
catggcccac 1380gaggcccgca tgctccagcc ccatcccagc ctgctgggct
tcctcatcgg cagcgacttc 1440tggcccgacg accgcgccac cgccgcctac
ctcgatgccc tggaagccgc cgactgggcc 1500gcccccgtgc tggcctcggc
ctcgaagcgc gggtaccccg agcggaccgg tccgggaggg 1560ctcaagatgg
acgggccgta cgactgggtg ccaccggggt actggtggga cacggagccg
1620accggggaga ggctcggagc cgcgttcggt tttgggtccg agctgggcgc
gggcgtgggc 1680acgccggagc tggggtcgct gaggcggttc ctgagcgagg
gggacctgga ggccctgtgg 1740aagcggccga acgtgagcct attccacatg
tcgagggaga cctcccaatt tacgacgagg 1800gtgatctaca acgccgggct
ctggaaccgg tggggcgccc ccgcgagcct ggaagactac 1860ctgatgaagg
cgcagctgat ggactacgag gccacgcggg ctcagttcga cgcctacacg
1920gccatgtgga acgccgagag gccggccacg ggcctgatat actggatgct
caacaacgcc 1980tggccgagcc tgcactggaa cctgtgggac tactacatgc
gtcctgccgg tagttacttt 2040ggggcaaagg ccggcagcag ggtggaaaat
gtcgtgtttg actacgtcag gagggcggtc 2100tggctggtca accggtccct
ggacaagagc gggacgagga gagtgcaggt ggacgtcatg 2160gataagcatg
gtaaagtcct atacaaggac acgaccgtca ccacgactgt tccaaacacg
2220agcaaggaca ttctttcact agcgggacct ctgggcaaca tcaccgacgt
ggtgtttttg 2280cggctggttc tctacgatgc acaacggcaa gggcatgcgc
tgagtcgaaa cgtgtactgg 2340gttgcgaagg agccagatgt gctggactgg
gacgagtcgg attggtactt cacgccggtg 2400accagctact cggactacac
tgcactaaac aacatggctc aggcagacgt cacggttcac 2460aatgtctggc
acggctcagg ggttgacatg acgctggaga acaaatctaa ggtgccagcg
2520ttctttgtca gcttgacgct ggtggatgta cacgggaacg aggtcctgcc
ggtgacgtgg 2580gaggacaact atgtcaccct atggccgggt gagaaattga
cacttcgggg aaggtcaatg 2640ggggtagatg caaagccgtg ggaggtcatt
gtgagaggaa agaacgttga agcaaagaga 2700gtccgtcacc ggtggcatgt a
2721611275DNAChrysosporium
lucknowenseCDS(180)..(668)CDS(776)..(1225) 61gtataattct acgtcatccc
ctccgttccc ttcccagctc ccctctccta ctgctgagtg 60atgttaataa tagtacgagt
cgtggaattt gtggatactg atccaggata aagtctcttc 120cattcggccc
gacctcctaa aactcccgac aaagctaaac cgccccgtgt gacgacaag 179atg aag
ctc ctg ggc aaa ctc tcg gcg gca ctc gcc ctc gcg ggc agc 227Met Lys
Leu Leu Gly Lys Leu Ser Ala Ala Leu Ala Leu Ala Gly Ser1 5 10 15agg
ctg gct gcc gcg cac ccg gtc ttc gac gag ctg atg cgg ccg acg 275Arg
Leu Ala Ala Ala His Pro Val Phe Asp Glu Leu Met Arg Pro Thr 20 25
30gcg ccg ctg gtg cgc ccg cgg gcg gcc ctg cag cag gtg acc aac ttt
323Ala Pro Leu Val Arg Pro Arg Ala Ala Leu Gln Gln Val Thr Asn Phe
35 40 45ggc agc aac ccg tcc aac acg aag atg ttc atc tac gtg ccc gac
aag 371Gly Ser Asn Pro Ser Asn Thr Lys Met Phe Ile Tyr Val Pro Asp
Lys 50 55 60ctg gcc ccc aac ccg ccc atc ata gtg gcc atc cac tac tgc
acc ggc 419Leu Ala Pro Asn Pro Pro Ile Ile Val Ala Ile His Tyr Cys
Thr Gly65 70 75 80acc gcc cag gcc tac tac tcg ggc tcc cct tac gcc
cgc ctc gcc gac 467Thr Ala Gln Ala Tyr Tyr Ser Gly Ser Pro Tyr Ala
Arg Leu Ala Asp 85 90 95cag aag ggc ttc atc gtc atc tac ccg gag tcc
ccc tac agc ggc acc 515Gln Lys Gly Phe Ile Val Ile Tyr Pro Glu Ser
Pro Tyr Ser Gly Thr 100 105 110tgt tgg gac gtc tcg tcg cgc gcc gcc
ctg acc cac aac ggc ggc ggc 563Cys Trp Asp Val Ser Ser Arg Ala Ala
Leu Thr His Asn Gly Gly Gly 115 120 125gac agc aac tcg atc gcc aac
atg gtc acc tac acc ctc gaa aag tac 611Asp Ser Asn Ser Ile Ala Asn
Met Val Thr Tyr Thr Leu Glu Lys Tyr 130 135 140aat ggc gac gcc agc
aag gtc ttt gtc acc ggc tcc tcg tcc ggc gcc 659Asn Gly Asp Ala Ser
Lys Val Phe Val Thr Gly Ser Ser Ser Gly Ala145 150 155 160atg atg
acg gtgcgttcta attacccctc ccactccttt tttttttttt 708Met Met
Thrtttgttttcc ttccttttct ttttcccagg agcgtcttgg ttgattgact
gacttgttca 768aatccag aac gtg atg gcc gcc gcg tac ccg gaa ctg ttc
gcg gca gga 817 Asn Val Met Ala Ala Ala Tyr Pro Glu Leu Phe Ala Ala
Gly 165 170 175atc gcc tac tcg ggc gtg ccc gcc ggc tgc ttc tac agc
cag tcc gga 865Ile Ala Tyr Ser Gly Val Pro Ala Gly Cys Phe Tyr Ser
Gln Ser Gly 180 185 190ggc acc aac gcg agg aac agc tcg tgc gcc aac
ggg cag atc aac tcg 913Gly Thr Asn Ala Arg Asn Ser Ser Cys Ala Asn
Gly Gln Ile Asn Ser 195 200 205acg ccc cag gtg tgg
gcc aag atg gtc ttc gac atg tac ccg gaa tac 961Thr Pro Gln Val Trp
Ala Lys Met Val Phe Asp Met Tyr Pro Glu Tyr210 215 220 225gac ggc
ccg cgc ccc aag atg cag atc tac cac ggc tcg gcc gac ggc 1009Asp Gly
Pro Arg Pro Lys Met Gln Ile Tyr His Gly Ser Ala Asp Gly 230 235
240acg ctc aga ccc agc aac tac aac gag acc atc aag cag tgg tgc ggc
1057Thr Leu Arg Pro Ser Asn Tyr Asn Glu Thr Ile Lys Gln Trp Cys Gly
245 250 255gtc ttc ggc ttc gac tac acc cgc ccc gac acc acc cag gcc
aac tcc 1105Val Phe Gly Phe Asp Tyr Thr Arg Pro Asp Thr Thr Gln Ala
Asn Ser 260 265 270ccg cag gcc ggc tac acc acc tac acc tgg ggc gag
cag cag ctc gtc 1153Pro Gln Ala Gly Tyr Thr Thr Tyr Thr Trp Gly Glu
Gln Gln Leu Val 275 280 285ggc atc tac gcc cag ggc gtc gga cac acg
gtc ccc atc cgc ggc agc 1201Gly Ile Tyr Ala Gln Gly Val Gly His Thr
Val Pro Ile Arg Gly Ser290 295 300 305gac gac atg gcc ttc ttt ggc
ctg tgatgatgag aggttcggtg ggctgggaac 1255Asp Asp Met Ala Phe Phe
Gly Leu 310cagagcggaa gtacggttta 127562313PRTChrysosporium
lucknowense 62Met Lys Leu Leu Gly Lys Leu Ser Ala Ala Leu Ala Leu
Ala Gly Ser1 5 10 15Arg Leu Ala Ala Ala His Pro Val Phe Asp Glu Leu
Met Arg Pro Thr 20 25 30Ala Pro Leu Val Arg Pro Arg Ala Ala Leu Gln
Gln Val Thr Asn Phe 35 40 45Gly Ser Asn Pro Ser Asn Thr Lys Met Phe
Ile Tyr Val Pro Asp Lys 50 55 60Leu Ala Pro Asn Pro Pro Ile Ile Val
Ala Ile His Tyr Cys Thr Gly65 70 75 80Thr Ala Gln Ala Tyr Tyr Ser
Gly Ser Pro Tyr Ala Arg Leu Ala Asp 85 90 95Gln Lys Gly Phe Ile Val
Ile Tyr Pro Glu Ser Pro Tyr Ser Gly Thr 100 105 110Cys Trp Asp Val
Ser Ser Arg Ala Ala Leu Thr His Asn Gly Gly Gly 115 120 125Asp Ser
Asn Ser Ile Ala Asn Met Val Thr Tyr Thr Leu Glu Lys Tyr 130 135
140Asn Gly Asp Ala Ser Lys Val Phe Val Thr Gly Ser Ser Ser Gly
Ala145 150 155 160Met Met Thr Asn Val Met Ala Ala Ala Tyr Pro Glu
Leu Phe Ala Ala 165 170 175Gly Ile Ala Tyr Ser Gly Val Pro Ala Gly
Cys Phe Tyr Ser Gln Ser 180 185 190Gly Gly Thr Asn Ala Arg Asn Ser
Ser Cys Ala Asn Gly Gln Ile Asn 195 200 205Ser Thr Pro Gln Val Trp
Ala Lys Met Val Phe Asp Met Tyr Pro Glu 210 215 220Tyr Asp Gly Pro
Arg Pro Lys Met Gln Ile Tyr His Gly Ser Ala Asp225 230 235 240Gly
Thr Leu Arg Pro Ser Asn Tyr Asn Glu Thr Ile Lys Gln Trp Cys 245 250
255Gly Val Phe Gly Phe Asp Tyr Thr Arg Pro Asp Thr Thr Gln Ala Asn
260 265 270Ser Pro Gln Ala Gly Tyr Thr Thr Tyr Thr Trp Gly Glu Gln
Gln Leu 275 280 285Val Gly Ile Tyr Ala Gln Gly Val Gly His Thr Val
Pro Ile Arg Gly 290 295 300Ser Asp Asp Met Ala Phe Phe Gly Leu305
31063939DNAChrysosporium lucknowense 63atgaagctcc tgggcaaact
ctcggcggca ctcgccctcg cgggcagcag gctggctgcc 60gcgcacccgg tcttcgacga
gctgatgcgg ccgacggcgc cgctggtgcg cccgcgggcg 120gccctgcagc
aggtgaccaa ctttggcagc aacccgtcca acacgaagat gttcatctac
180gtgcccgaca agctggcccc caacccgccc atcatagtgg ccatccacta
ctgcaccggc 240accgcccagg cctactactc gggctcccct tacgcccgcc
tcgccgacca gaagggcttc 300atcgtcatct acccggagtc cccctacagc
ggcacctgtt gggacgtctc gtcgcgcgcc 360gccctgaccc acaacggcgg
cggcgacagc aactcgatcg ccaacatggt cacctacacc 420ctcgaaaagt
acaatggcga cgccagcaag gtctttgtca ccggctcctc gtccggcgcc
480atgatgacga acgtgatggc cgccgcgtac ccggaactgt tcgcggcagg
aatcgcctac 540tcgggcgtgc ccgccggctg cttctacagc cagtccggag
gcaccaacgc gaggaacagc 600tcgtgcgcca acgggcagat caactcgacg
ccccaggtgt gggccaagat ggtcttcgac 660atgtacccgg aatacgacgg
cccgcgcccc aagatgcaga tctaccacgg ctcggccgac 720ggcacgctca
gacccagcaa ctacaacgag accatcaagc agtggtgcgg cgtcttcggc
780ttcgactaca cccgccccga caccacccag gccaactccc cgcaggccgg
ctacaccacc 840tacacctggg gcgagcagca gctcgtcggc atctacgccc
agggcgtcgg acacacggtc 900cccatccgcg gcagcgacga catggccttc tttggcctg
939641182DNAChrysosporium
lucknowenseCDS(162)..(621)CDS(793)..(1169) 64atataaataa gccgggaggg
catcaggaca tgacctctac ctctctgtaa tcagacgctg 60gtggcttcct cccacatatc
ggggttaata tttattagct cgtcattgtc tctgcgcgcc 120ttgccgcaaa
acatcacccg tatattgcgt atcagcaaac a atg atc tcg gtt cct 176 Met Ile
Ser Val Pro 1 5gct ctc gct ctg gcc ctt ctg gcc gcc gtc cag gtc gtc
gag tct gcc 224Ala Leu Ala Leu Ala Leu Leu Ala Ala Val Gln Val Val
Glu Ser Ala 10 15 20tcg gct ggc tgt ggc aag gcg ccc cct tcc tcg ggc
acc aag tcg atg 272Ser Ala Gly Cys Gly Lys Ala Pro Pro Ser Ser Gly
Thr Lys Ser Met 25 30 35acg gtc aac ggc aag cag cgc cag tac att ctc
cag ctg ccc aac aac 320Thr Val Asn Gly Lys Gln Arg Gln Tyr Ile Leu
Gln Leu Pro Asn Asn 40 45 50tac gac gcc aac aag gcc cac agg gtg gtg
atc ggg tac cac tgg cgc 368Tyr Asp Ala Asn Lys Ala His Arg Val Val
Ile Gly Tyr His Trp Arg 55 60 65gac gga tcc atg aac gac gtg gcc aac
ggc ggc ttc tac gat ctg cgg 416Asp Gly Ser Met Asn Asp Val Ala Asn
Gly Gly Phe Tyr Asp Leu Arg70 75 80 85tcc cgg gcg ggc gac agc acc
atc ttc gtt gcc ccc aac ggc ctc aat 464Ser Arg Ala Gly Asp Ser Thr
Ile Phe Val Ala Pro Asn Gly Leu Asn 90 95 100gcc gga tgg gcc aac
gtg ggc ggc gag gac atc acc ttt acg gac cag 512Ala Gly Trp Ala Asn
Val Gly Gly Glu Asp Ile Thr Phe Thr Asp Gln 105 110 115atc gta gac
atg ctc aag aac gac ctc tgc gtg gac gag acc cag ttc 560Ile Val Asp
Met Leu Lys Asn Asp Leu Cys Val Asp Glu Thr Gln Phe 120 125 130ttt
gct acg ggc tgg agc tat ggc ggt gcc atg agc cat agc gtg gct 608Phe
Ala Thr Gly Trp Ser Tyr Gly Gly Ala Met Ser His Ser Val Ala 135 140
145tgt tct cgg cca g gtaatattga ttttgtcatc ctgtcttttt tcttcttctt
661Cys Ser Arg Pro150cttctctttc ttccctctgg cattcttctc ttctcttctc
ttctcttctc ctcttttatt 721tccccgtcat ttccccccac aaagaaacac
ctggacaagc caatgacatc cattaatata 781tataaatgca g ac gtc ttc aag gcc
gtc gcg gtc atc gcc ggg gcc cag 830 Asp Val Phe Lys Ala Val Ala Val
Ile Ala Gly Ala Gln 155 160 165ctg tcc ggc tgc gcc ggc ggc acg acg
ccc gtg gcg tac cta ggc atc 878Leu Ser Gly Cys Ala Gly Gly Thr Thr
Pro Val Ala Tyr Leu Gly Ile 170 175 180cac gga gcc gcc gac aac gtc
ctg ccc atc gac ctc ggc cgc cag ctg 926His Gly Ala Ala Asp Asn Val
Leu Pro Ile Asp Leu Gly Arg Gln Leu 185 190 195cgc gac aag tgg ctg
cag acc aac ggc tgc aac tac cag ggc gcc cag 974Arg Asp Lys Trp Leu
Gln Thr Asn Gly Cys Asn Tyr Gln Gly Ala Gln 200 205 210gac ccc gcg
ccg ggc cag cag gcc cac atc aag acc acc tac agc tgc 1022Asp Pro Ala
Pro Gly Gln Gln Ala His Ile Lys Thr Thr Tyr Ser Cys215 220 225
230tcc cgc gcg ccc gtc acc tgg atc ggc cac ggg ggc ggc cac gtc ccc
1070Ser Arg Ala Pro Val Thr Trp Ile Gly His Gly Gly Gly His Val Pro
235 240 245gac ccc acg ggc aac aac ggc gtc aag ttt gcg ccc cag gag
acc tgg 1118Asp Pro Thr Gly Asn Asn Gly Val Lys Phe Ala Pro Gln Glu
Thr Trp 250 255 260gac ttc ttt gat gcc gcc gtc gga gcg gcc ggc gcg
cag agc ccg atg 1166Asp Phe Phe Asp Ala Ala Val Gly Ala Ala Gly Ala
Gln Ser Pro Met 265 270 275aca taagcccgaa gga 1182Thr
65279PRTChrysosporium lucknowense 65Met Ile Ser Val Pro Ala Leu Ala
Leu Ala Leu Leu Ala Ala Val Gln1 5 10 15Val Val Glu Ser Ala Ser Ala
Gly Cys Gly Lys Ala Pro Pro Ser Ser 20 25 30Gly Thr Lys Ser Met Thr
Val Asn Gly Lys Gln Arg Gln Tyr Ile Leu 35 40 45Gln Leu Pro Asn Asn
Tyr Asp Ala Asn Lys Ala His Arg Val Val Ile 50 55 60Gly Tyr His Trp
Arg Asp Gly Ser Met Asn Asp Val Ala Asn Gly Gly65 70 75 80Phe Tyr
Asp Leu Arg Ser Arg Ala Gly Asp Ser Thr Ile Phe Val Ala 85 90 95Pro
Asn Gly Leu Asn Ala Gly Trp Ala Asn Val Gly Gly Glu Asp Ile 100 105
110Thr Phe Thr Asp Gln Ile Val Asp Met Leu Lys Asn Asp Leu Cys Val
115 120 125Asp Glu Thr Gln Phe Phe Ala Thr Gly Trp Ser Tyr Gly Gly
Ala Met 130 135 140Ser His Ser Val Ala Cys Ser Arg Pro Asp Val Phe
Lys Ala Val Ala145 150 155 160Val Ile Ala Gly Ala Gln Leu Ser Gly
Cys Ala Gly Gly Thr Thr Pro 165 170 175Val Ala Tyr Leu Gly Ile His
Gly Ala Ala Asp Asn Val Leu Pro Ile 180 185 190Asp Leu Gly Arg Gln
Leu Arg Asp Lys Trp Leu Gln Thr Asn Gly Cys 195 200 205Asn Tyr Gln
Gly Ala Gln Asp Pro Ala Pro Gly Gln Gln Ala His Ile 210 215 220Lys
Thr Thr Tyr Ser Cys Ser Arg Ala Pro Val Thr Trp Ile Gly His225 230
235 240Gly Gly Gly His Val Pro Asp Pro Thr Gly Asn Asn Gly Val Lys
Phe 245 250 255Ala Pro Gln Glu Thr Trp Asp Phe Phe Asp Ala Ala Val
Gly Ala Ala 260 265 270Gly Ala Gln Ser Pro Met Thr
27566837DNAChrysosporium lucknowense 66atgatctcgg ttcctgctct
cgctctggcc cttctggccg ccgtccaggt cgtcgagtct 60gcctcggctg gctgtggcaa
ggcgccccct tcctcgggca ccaagtcgat gacggtcaac 120ggcaagcagc
gccagtacat tctccagctg cccaacaact acgacgccaa caaggcccac
180agggtggtga tcgggtacca ctggcgcgac ggatccatga acgacgtggc
caacggcggc 240ttctacgatc tgcggtcccg ggcgggcgac agcaccatct
tcgttgcccc caacggcctc 300aatgccggat gggccaacgt gggcggcgag
gacatcacct ttacggacca gatcgtagac 360atgctcaaga acgacctctg
cgtggacgag acccagttct ttgctacggg ctggagctat 420ggcggtgcca
tgagccatag cgtggcttgt tctcggccag acgtcttcaa ggccgtcgcg
480gtcatcgccg gggcccagct gtccggctgc gccggcggca cgacgcccgt
ggcgtaccta 540ggcatccacg gagccgccga caacgtcctg cccatcgacc
tcggccgcca gctgcgcgac 600aagtggctgc agaccaacgg ctgcaactac
cagggcgccc aggaccccgc gccgggccag 660caggcccaca tcaagaccac
ctacagctgc tcccgcgcgc ccgtcacctg gatcggccac 720gggggcggcc
acgtccccga ccccacgggc aacaacggcg tcaagtttgc gccccaggag
780acctgggact tctttgatgc cgccgtcgga gcggccggcg cgcagagccc gatgaca
837671537DNAChrysosporium lucknowenseCDS(394)..(1299) 67ccttccgagt
cccaatgccg agtaattaac ccaaacaaca accccagtca ctttagccca 60ggttttgcgg
cattcgttag agtaccgtta cagccaagcg ttgaaatatg ctagagaaac
120cggaggatgt tccccaggtt ggttctcagc ggagagagca attgcaaccc
tggaagaaaa 180cgtccgtggc acttcggcaa ctattcttat catatagcac
cgtcaatcct tcaccagcat 240gccaaaggac tgtataatat ctttgacata
aacttgggcc accattcttg gaacacgagg 300atgggctgaa ccctttgttt
ccctcttgcc caccatttaa ttacgttcta caaccaatct 360ttggcaacat
agcaaggacc aagagaagca aag atg tcc cgg tac ctt ctc ggt 414 Met Ser
Arg Tyr Leu Leu Gly 1 5cga cta ata gtg gcc aca gtg gcc atc acg gtg
gcc tct ctc cgc ggc 462Arg Leu Ile Val Ala Thr Val Ala Ile Thr Val
Ala Ser Leu Arg Gly 10 15 20gtg acg gcc gcc acg ccg tcg ccg gga tgc
ggg aag acg ccg acg ctg 510Val Thr Ala Ala Thr Pro Ser Pro Gly Cys
Gly Lys Thr Pro Thr Leu 25 30 35atc acg gac ggg tcc gcc acg act ccc
ctg acg ctg acc tcc aac ggc 558Ile Thr Asp Gly Ser Ala Thr Thr Pro
Leu Thr Leu Thr Ser Asn Gly40 45 50 55aag acg cgg cgg ttc tac gtc
aag ctt ccc gac gac tac gac aac agc 606Lys Thr Arg Arg Phe Tyr Val
Lys Leu Pro Asp Asp Tyr Asp Asn Ser 60 65 70cac ccg tac cgg ctc atc
ttc gcg ctg cac gcc ctc ggc ggg acc gcc 654His Pro Tyr Arg Leu Ile
Phe Ala Leu His Ala Leu Gly Gly Thr Ala 75 80 85cag cag gtc acc acc
ggg acg ggc ggc tac ctg ccg tgg tac ggg atc 702Gln Gln Val Thr Thr
Gly Thr Gly Gly Tyr Leu Pro Trp Tyr Gly Ile 90 95 100ccc gac ctc
gcg gcc aac gac acg gtg ggc gcc gtc tac gtg gcg ccc 750Pro Asp Leu
Ala Ala Asn Asp Thr Val Gly Ala Val Tyr Val Ala Pro 105 110 115gac
ggg ctc aac aac ggc tgg gcc aac cag ggg ggc gag gac gtg gcc 798Asp
Gly Leu Asn Asn Gly Trp Ala Asn Gln Gly Gly Glu Asp Val Ala120 125
130 135ttc ctg gag gcc gtg atg gag acg gtg gag cag gac gtg tgc gtc
gac 846Phe Leu Glu Ala Val Met Glu Thr Val Glu Gln Asp Val Cys Val
Asp 140 145 150cgg gac ctg cgc ttc tcg acc ggc ttc agc tac ggc gcc
gcc atg tcg 894Arg Asp Leu Arg Phe Ser Thr Gly Phe Ser Tyr Gly Ala
Ala Met Ser 155 160 165tac acg ctc gcg tgc gcg ctc ggc cgc cgc atc
cgg gcc gtc gcc gtg 942Tyr Thr Leu Ala Cys Ala Leu Gly Arg Arg Ile
Arg Ala Val Ala Val 170 175 180ctc tcg ggc agc ccc gtc atc agc ggc
ggc tgc gcc ggc gcc ggc tcc 990Leu Ser Gly Ser Pro Val Ile Ser Gly
Gly Cys Ala Gly Ala Gly Ser 185 190 195ggc gcc tcc gag ccc gtc gcc
tac tac ggc cag cac ggc atg tcc gac 1038Gly Ala Ser Glu Pro Val Ala
Tyr Tyr Gly Gln His Gly Met Ser Asp200 205 210 215ccg gtc ctc ccc
gtc gcc ggc ggc cgc gag atg cgc gac cac ttc gtc 1086Pro Val Leu Pro
Val Ala Gly Gly Arg Glu Met Arg Asp His Phe Val 220 225 230cgc acc
aac ggc tgc gac gcc ggc cgg ggc ccc ccc cgg gag ccc gcc 1134Arg Thr
Asn Gly Cys Asp Ala Gly Arg Gly Pro Pro Arg Glu Pro Ala 235 240
245cgc ggc agc ggc acc cac gtc aag acg gtc tac gac ggc tgc gac ccg
1182Arg Gly Ser Gly Thr His Val Lys Thr Val Tyr Asp Gly Cys Asp Pro
250 255 260gac tac ccc gtc gtg tgg aac gcg ttc gac ggc gac cac acc
ccg cag 1230Asp Tyr Pro Val Val Trp Asn Ala Phe Asp Gly Asp His Thr
Pro Gln 265 270 275ccc gtc gac cgg ggc gcc acc acc act ttc tcg gcc
gtc gag acc tgg 1278Pro Val Asp Arg Gly Ala Thr Thr Thr Phe Ser Ala
Val Glu Thr Trp280 285 290 295gag ttc ttc tcg cag ttc aag
taattaagcc cgagaggaat cccctgagca 1329Glu Phe Phe Ser Gln Phe Lys
300ttcccttttg gcacaatatg tgaacccgtc tcgcaacttc ggttcggttt
ccgatgatgt 1389taggtatttt acgttccgtt catgaccgct gccctgatat
tcttgatcgt gtttcaatct 1449ggacaagtaa agaacattcg acctttgtgc
ctgactgact tcggggccga tgtgccgcgc 1509ctgatcacca ggttgtaagt aattaatt
153768302PRTChrysosporium lucknowense 68Met Ser Arg Tyr Leu Leu Gly
Arg Leu Ile Val Ala Thr Val Ala Ile1 5 10 15Thr Val Ala Ser Leu Arg
Gly Val Thr Ala Ala Thr Pro Ser Pro Gly 20 25 30Cys Gly Lys Thr Pro
Thr Leu Ile Thr Asp Gly Ser Ala Thr Thr Pro 35 40 45Leu Thr Leu Thr
Ser Asn Gly Lys Thr Arg Arg Phe Tyr Val Lys Leu 50 55 60Pro Asp Asp
Tyr Asp Asn Ser His Pro Tyr Arg Leu Ile Phe Ala Leu65 70 75 80His
Ala Leu Gly Gly Thr Ala Gln Gln Val Thr Thr Gly Thr Gly Gly 85 90
95Tyr Leu Pro Trp Tyr Gly Ile Pro Asp Leu Ala Ala Asn Asp Thr Val
100 105 110Gly Ala Val Tyr Val Ala Pro Asp Gly Leu Asn Asn Gly Trp
Ala Asn 115 120 125Gln Gly Gly Glu Asp Val Ala Phe Leu Glu Ala Val
Met Glu Thr Val 130 135 140Glu Gln Asp Val Cys Val Asp Arg Asp Leu
Arg Phe Ser Thr Gly Phe145 150 155 160Ser Tyr Gly Ala Ala Met Ser
Tyr Thr Leu Ala Cys Ala Leu Gly Arg 165 170 175Arg Ile Arg Ala Val
Ala Val Leu Ser Gly Ser Pro Val Ile Ser Gly 180 185 190Gly Cys Ala
Gly Ala Gly Ser Gly Ala Ser Glu Pro Val Ala Tyr Tyr 195 200 205Gly
Gln His Gly Met Ser Asp Pro Val Leu Pro Val Ala Gly Gly Arg 210 215
220Glu Met Arg Asp His Phe Val Arg Thr Asn Gly Cys Asp Ala Gly
Arg225 230 235 240Gly Pro Pro Arg
Glu Pro Ala Arg Gly Ser Gly Thr His Val Lys Thr 245 250 255Val Tyr
Asp Gly Cys Asp Pro Asp Tyr Pro Val Val Trp Asn Ala Phe 260 265
270Asp Gly Asp His Thr Pro Gln Pro Val Asp Arg Gly Ala Thr Thr Thr
275 280 285Phe Ser Ala Val Glu Thr Trp Glu Phe Phe Ser Gln Phe Lys
290 295 30069906DNAChrysosporium lucknowense 69atgtcccggt
accttctcgg tcgactaata gtggccacag tggccatcac ggtggcctct 60ctccgcggcg
tgacggccgc cacgccgtcg ccgggatgcg ggaagacgcc gacgctgatc
120acggacgggt ccgccacgac tcccctgacg ctgacctcca acggcaagac
gcggcggttc 180tacgtcaagc ttcccgacga ctacgacaac agccacccgt
accggctcat cttcgcgctg 240cacgccctcg gcgggaccgc ccagcaggtc
accaccggga cgggcggcta cctgccgtgg 300tacgggatcc ccgacctcgc
ggccaacgac acggtgggcg ccgtctacgt ggcgcccgac 360gggctcaaca
acggctgggc caaccagggg ggcgaggacg tggccttcct ggaggccgtg
420atggagacgg tggagcagga cgtgtgcgtc gaccgggacc tgcgcttctc
gaccggcttc 480agctacggcg ccgccatgtc gtacacgctc gcgtgcgcgc
tcggccgccg catccgggcc 540gtcgccgtgc tctcgggcag ccccgtcatc
agcggcggct gcgccggcgc cggctccggc 600gcctccgagc ccgtcgccta
ctacggccag cacggcatgt ccgacccggt cctccccgtc 660gccggcggcc
gcgagatgcg cgaccacttc gtccgcacca acggctgcga cgccggccgg
720ggcccccccc gggagcccgc ccgcggcagc ggcacccacg tcaagacggt
ctacgacggc 780tgcgacccgg actaccccgt cgtgtggaac gcgttcgacg
gcgaccacac cccgcagccc 840gtcgaccggg gcgccaccac cactttctcg
gccgtcgaga cctgggagtt cttctcgcag 900ttcaag
906701919DNAChrysosporium
lucknowenseCDS(279)..(340)CDS(541)..(718)CDS(802)..(915)CDS(1028)..(1630)
70agcactgttg aagggctgtt tggaagtttc ttattgtgag agggggctcg agtcacggca
60accaagttcg gcgcctggtc atctctggag ctgaccacgc ctttggccgt cgtcgattcc
120gcgtctcaga gccgagttgg cgggccccgt gcaggttaaa ttgtctgcag
cgcccggcta 180cctggaactc cgagcgatta ccagtttcgg ctcggagcac
gaccttcgcc aaccccagat 240cggccacttc aagggaccga aaccgttgcc gtggcccg
atg ccg tgc aag tcg agc 296 Met Pro Cys Lys Ser Ser 1 5ctc ctc ctc
ctc ctc ctc tcc tcc ttc ttc tcc tcc gag gcc aa 340Leu Leu Leu Leu
Leu Leu Ser Ser Phe Phe Ser Ser Glu Ala Asn 10 15 20gtccgtgaat
acaccaacga cgccgttacg ttgctgatac ttaagtaagg aagggcgcgg
400cgctcgctgc tgctgatgtg aagacgagga gcagacttga acattggtaa
agcagagcac 460tgtttgattg cgctcgtgca ctcgccacca tcatccaatt
aacataccta gcccataatt 520atcggacact ggcgttcgag c acc gcc cac cgc
cca cca tgc gtc tct gcg 571 Thr Ala His Arg Pro Pro Cys Val Ser Ala
25 30atc gct cac caa gat ggc cgc cat ggc cgg cct gcg cgg ggc gcg
tca 619Ile Ala His Gln Asp Gly Arg His Gly Arg Pro Ala Arg Gly Ala
Ser 35 40 45ctg cag cta gtg acc aac ttt ggc gac aat ccg acc ggt ctg
cag aag 667Leu Gln Leu Val Thr Asn Phe Gly Asp Asn Pro Thr Gly Leu
Gln Lys 50 55 60tat gtg tac gtt ccg gac aaa gtt gcg gtc tcg ccc gcc
atc atc gtt 715Tyr Val Tyr Val Pro Asp Lys Val Ala Val Ser Pro Ala
Ile Ile Val 65 70 75gct gtaagtcaac agagcttgat taacattatt atgtgtttga
gttggattga 768Ala80gtggtgatga ccgctaactt tgttgtcgat cag ctg cat ccc
tgt gga gga tcg 822 Leu His Pro Cys Gly Gly Ser 85gct cag gga tgg
tac tcg cag aca cgg ctc ccg tcg tac gcg gac cag 870Ala Gln Gly Trp
Tyr Ser Gln Thr Arg Leu Pro Ser Tyr Ala Asp Gln 90 95 100ctt ggg
ttc atc ctc atc tac gcc ggc acg acc aag atg agc aac 915Leu Gly Phe
Ile Leu Ile Tyr Ala Gly Thr Thr Lys Met Ser Asn 105 110
115gtacgcatag tttttttttt tttttgcttt tttttgcttg ctgtattaca
tagagagtta 975tacccttact aaccataatt attttcttcc tcgatatttt
ctttgccaac ag tgc tgg 1033Cys Trp 120gac gtc cag aac ccg gcc agc
ctg aca cac ggc ggg ggc gac gcc ggg 1081Asp Val Gln Asn Pro Ala Ser
Leu Thr His Gly Gly Gly Asp Ala Gly 125 130 135ggc atc gtc agc atg
gtc aag tac gcg ctg aag cag tac aac ggc gac 1129Gly Ile Val Ser Met
Val Lys Tyr Ala Leu Lys Gln Tyr Asn Gly Asp 140 145 150gcg tcg agg
gtg tac gtc atg ggg ggc tcg tcg ggg gcc atg atg acc 1177Ala Ser Arg
Val Tyr Val Met Gly Gly Ser Ser Gly Ala Met Met Thr 155 160 165aac
gtg ctg gcg ggc tcg tac ccg gac gtg ttc gag gcc ggc gcg gcc 1225Asn
Val Leu Ala Gly Ser Tyr Pro Asp Val Phe Glu Ala Gly Ala Ala 170 175
180ttc tcc ggg gtg gcg cac gcg tgc ttc ctc ggg gcc gac tcg gcg acg
1273Phe Ser Gly Val Ala His Ala Cys Phe Leu Gly Ala Asp Ser Ala
Thr185 190 195 200ccc ttc tcg cct aac cag acg tgc gcg cag ggc cgg
atc cag cgg tcg 1321Pro Phe Ser Pro Asn Gln Thr Cys Ala Gln Gly Arg
Ile Gln Arg Ser 205 210 215gcg cgc gag tgg ggg gac ctg gtg cgc aac
tcg ttc ccg gcc tac gac 1369Ala Arg Glu Trp Gly Asp Leu Val Arg Asn
Ser Phe Pro Ala Tyr Asp 220 225 230ggc cgc cgc ccg cgc atg cag atc
ttc cac ggc aac gcc gac ttc ctc 1417Gly Arg Arg Pro Arg Met Gln Ile
Phe His Gly Asn Ala Asp Phe Leu 235 240 245gtc cat ccc gag tgc gcc
cac cag gcc ctc gcc cag tgg gcc gac gtc 1465Val His Pro Glu Cys Ala
His Gln Ala Leu Ala Gln Trp Ala Asp Val 250 255 260ctc ggc ctc cag
ctg acc cag acc aac aag ggc gtg ccc tcg gcc gag 1513Leu Gly Leu Gln
Leu Thr Gln Thr Asn Lys Gly Val Pro Ser Ala Glu265 270 275 280tac
acc cag gag gtc tac ggc gac ggc acc cag ctg cag ggc ttc ttt 1561Tyr
Thr Gln Glu Val Tyr Gly Asp Gly Thr Gln Leu Gln Gly Phe Phe 285 290
295ggc gac ggc gtc ggc cac atc gcc ccg gtc aac gag ccc gtc atg ctc
1609Gly Asp Gly Val Gly His Ile Ala Pro Val Asn Glu Pro Val Met Leu
300 305 310cgc ttc ttt ggc ctg atg aac tagctcgtac cggatgtact
gtaccgtaaa 1660Arg Phe Phe Gly Leu Met Asn 315tgatccatca tttaatagcg
aatagtacat agttacatac cagtatgtaa ctgacatgca 1720tgtcaagatc
ccgacattgt gctccgagtc aaccttggat gggaggtgat gacagcctgg
1780aggacgaaag ttgatttgac cgtagcatgc tgtctgtgct cttttgaggt
ctgaacgcct 1840cattattacc gcgaacggtg cattgcatta tggggggtcg
ataaacttcc aaggggtata 1900gttatacgcg cgtagaaat
191971319PRTChrysosporium lucknowense 71Met Pro Cys Lys Ser Ser Leu
Leu Leu Leu Leu Leu Ser Ser Phe Phe1 5 10 15Ser Ser Glu Ala Asn Thr
Ala His Arg Pro Pro Cys Val Ser Ala Ile 20 25 30Ala His Gln Asp Gly
Arg His Gly Arg Pro Ala Arg Gly Ala Ser Leu 35 40 45Gln Leu Val Thr
Asn Phe Gly Asp Asn Pro Thr Gly Leu Gln Lys Tyr 50 55 60Val Tyr Val
Pro Asp Lys Val Ala Val Ser Pro Ala Ile Ile Val Ala65 70 75 80Leu
His Pro Cys Gly Gly Ser Ala Gln Gly Trp Tyr Ser Gln Thr Arg 85 90
95Leu Pro Ser Tyr Ala Asp Gln Leu Gly Phe Ile Leu Ile Tyr Ala Gly
100 105 110Thr Thr Lys Met Ser Asn Cys Trp Asp Val Gln Asn Pro Ala
Ser Leu 115 120 125Thr His Gly Gly Gly Asp Ala Gly Gly Ile Val Ser
Met Val Lys Tyr 130 135 140Ala Leu Lys Gln Tyr Asn Gly Asp Ala Ser
Arg Val Tyr Val Met Gly145 150 155 160Gly Ser Ser Gly Ala Met Met
Thr Asn Val Leu Ala Gly Ser Tyr Pro 165 170 175Asp Val Phe Glu Ala
Gly Ala Ala Phe Ser Gly Val Ala His Ala Cys 180 185 190Phe Leu Gly
Ala Asp Ser Ala Thr Pro Phe Ser Pro Asn Gln Thr Cys 195 200 205Ala
Gln Gly Arg Ile Gln Arg Ser Ala Arg Glu Trp Gly Asp Leu Val 210 215
220Arg Asn Ser Phe Pro Ala Tyr Asp Gly Arg Arg Pro Arg Met Gln
Ile225 230 235 240Phe His Gly Asn Ala Asp Phe Leu Val His Pro Glu
Cys Ala His Gln 245 250 255Ala Leu Ala Gln Trp Ala Asp Val Leu Gly
Leu Gln Leu Thr Gln Thr 260 265 270Asn Lys Gly Val Pro Ser Ala Glu
Tyr Thr Gln Glu Val Tyr Gly Asp 275 280 285Gly Thr Gln Leu Gln Gly
Phe Phe Gly Asp Gly Val Gly His Ile Ala 290 295 300Pro Val Asn Glu
Pro Val Met Leu Arg Phe Phe Gly Leu Met Asn305 310
31572957DNAChrysosporium lucknowense 72atgccgtgca agtcgagcct
cctcctcctc ctcctctcct ccttcttctc ctccgaggcc 60aacaccgccc accgcccacc
atgcgtctct gcgatcgctc accaagatgg ccgccatggc 120cggcctgcgc
ggggcgcgtc actgcagcta gtgaccaact ttggcgacaa tccgaccggt
180ctgcagaagt atgtgtacgt tccggacaaa gttgcggtct cgcccgccat
catcgttgct 240ctgcatccct gtggaggatc ggctcaggga tggtactcgc
agacacggct cccgtcgtac 300gcggaccagc ttgggttcat cctcatctac
gccggcacga ccaagatgag caactgctgg 360gacgtccaga acccggccag
cctgacacac ggcgggggcg acgccggggg catcgtcagc 420atggtcaagt
acgcgctgaa gcagtacaac ggcgacgcgt cgagggtgta cgtcatgggg
480ggctcgtcgg gggccatgat gaccaacgtg ctggcgggct cgtacccgga
cgtgttcgag 540gccggcgcgg ccttctccgg ggtggcgcac gcgtgcttcc
tcggggccga ctcggcgacg 600cccttctcgc ctaaccagac gtgcgcgcag
ggccggatcc agcggtcggc gcgcgagtgg 660ggggacctgg tgcgcaactc
gttcccggcc tacgacggcc gccgcccgcg catgcagatc 720ttccacggca
acgccgactt cctcgtccat cccgagtgcg cccaccaggc cctcgcccag
780tgggccgacg tcctcggcct ccagctgacc cagaccaaca agggcgtgcc
ctcggccgag 840tacacccagg aggtctacgg cgacggcacc cagctgcagg
gcttctttgg cgacggcgtc 900ggccacatcg ccccggtcaa cgagcccgtc
atgctccgct tctttggcct gatgaac 957731560DNAChrysosporium
lucknowenseCDS(221)..(1330) 73gtgagttaca catagttata taaaggataa
ccccatcggg ggaaccagtc ctcgctgcat 60atctgcttcc ggcacggatc ttcagttatt
tcgcctcgcg cttggccggc tgctccccct 120gttcccaccg ccgaagtacg
aagtaattaa tagtcattgc gcggacccaa ctgattgtgc 180ctgctccgtg
tgaagcagag aaaaggagga caccgccatc atg cgt ctc acc aag 235 Met Arg
Leu Thr Lys 1 5acg gac ctc agc gtc gcg ctg ctc gcg tcg gcc cta ccc
gca acc cag 283Thr Asp Leu Ser Val Ala Leu Leu Ala Ser Ala Leu Pro
Ala Thr Gln 10 15 20gcg gcg tgc agc ctc ccc tcg tcc tac cgg tgg acg
tcg acg ggc gcg 331Ala Ala Cys Ser Leu Pro Ser Ser Tyr Arg Trp Thr
Ser Thr Gly Ala 25 30 35ttg gct cag ccc aaa tcg ggg tgg gct aac ctc
aag gac ttc acc cac 379Leu Ala Gln Pro Lys Ser Gly Trp Ala Asn Leu
Lys Asp Phe Thr His 40 45 50gtg cct tac aac ggc aag cac ctt gtg tac
ggg agc tac tac tct aac 427Val Pro Tyr Asn Gly Lys His Leu Val Tyr
Gly Ser Tyr Tyr Ser Asn 55 60 65gcc tac ggc tca atg aac ttc ggc ctc
ttc tcg gac tgg agc gag atg 475Ala Tyr Gly Ser Met Asn Phe Gly Leu
Phe Ser Asp Trp Ser Glu Met70 75 80 85ggc tcg gcc agc cag aac cag
atg agc cag gcc gcg gtc gcg ccc acg 523Gly Ser Ala Ser Gln Asn Gln
Met Ser Gln Ala Ala Val Ala Pro Thr 90 95 100ctc ttc tac ttc tcc
cca aag aac atc tgg gtc ctg gcc tac cag tgg 571Leu Phe Tyr Phe Ser
Pro Lys Asn Ile Trp Val Leu Ala Tyr Gln Trp 105 110 115ggt gcc acc
cct ttc tcc tac cgc acc tcg acc gat ccg acc aac ccc 619Gly Ala Thr
Pro Phe Ser Tyr Arg Thr Ser Thr Asp Pro Thr Asn Pro 120 125 130aac
ggc tgg tcc gct ccc cag ccg ctc ttc tcg ggc tcc atc tcc gac 667Asn
Gly Trp Ser Ala Pro Gln Pro Leu Phe Ser Gly Ser Ile Ser Asp 135 140
145tcg gac acg ggc ccc atc gac cag acc ctg atc ggc gac gac aag aac
715Ser Asp Thr Gly Pro Ile Asp Gln Thr Leu Ile Gly Asp Asp Lys
Asn150 155 160 165atg tac ctg ttc ttc gcc ggc gac aac ggc aag atc
tac cgc gcc agc 763Met Tyr Leu Phe Phe Ala Gly Asp Asn Gly Lys Ile
Tyr Arg Ala Ser 170 175 180atg ccg ctc ggc aac ttc ccg ggc aac ttt
ggc tcg gcc gcc acc gtc 811Met Pro Leu Gly Asn Phe Pro Gly Asn Phe
Gly Ser Ala Ala Thr Val 185 190 195gtc ctg agc ggc gcc cgc tac gac
ctg ttc gag gcc gtg cag gta tac 859Val Leu Ser Gly Ala Arg Tyr Asp
Leu Phe Glu Ala Val Gln Val Tyr 200 205 210acg gtc gtc ggc cag agc
tcg ccg acc tat ctg atg atc gtc gag tcc 907Thr Val Val Gly Gln Ser
Ser Pro Thr Tyr Leu Met Ile Val Glu Ser 215 220 225atc ggc gcg cgc
ggg cgc tac ttc cgc tcc tac acg gcc agc cgg ctc 955Ile Gly Ala Arg
Gly Arg Tyr Phe Arg Ser Tyr Thr Ala Ser Arg Leu230 235 240 245gac
ggc gcc tgg acg gcc cag gcg gcc agc gag agc cag ccg ttc gcc 1003Asp
Gly Ala Trp Thr Ala Gln Ala Ala Ser Glu Ser Gln Pro Phe Ala 250 255
260ggc gcg gcc aac agc ggc gcc tcc tgg acc agc gac atc agc cac ggc
1051Gly Ala Ala Asn Ser Gly Ala Ser Trp Thr Ser Asp Ile Ser His Gly
265 270 275gac ctc atc cgc atc gac gcc aac cac acc atg ccc gtc gac
ccc tgc 1099Asp Leu Ile Arg Ile Asp Ala Asn His Thr Met Pro Val Asp
Pro Cys 280 285 290cgc ctc cag ctg ctc tac cag ggc cgc tcc ggc gac
agc gcc gac tac 1147Arg Leu Gln Leu Leu Tyr Gln Gly Arg Ser Gly Asp
Ser Ala Asp Tyr 295 300 305aac tcc ctc ccc tac cgc ccg ggc ctg ctc
acc ctg cag ggc gct tct 1195Asn Ser Leu Pro Tyr Arg Pro Gly Leu Leu
Thr Leu Gln Gly Ala Ser310 315 320 325tcc ggc ggc ggc gac aac ggc
ggc ggc ggc ggc ggc gta acc gtc ccc 1243Ser Gly Gly Gly Asp Asn Gly
Gly Gly Gly Gly Gly Val Thr Val Pro 330 335 340cgg tac ggc cag tgc
ggt ggc cag ggc tac acg ggg ccg aca acg tgc 1291Arg Tyr Gly Gln Cys
Gly Gly Gln Gly Tyr Thr Gly Pro Thr Thr Cys 345 350 355gag agc cca
tac acg tgc aag tac cag aat gat tgt aag tgatgtctta 1340Glu Ser Pro
Tyr Thr Cys Lys Tyr Gln Asn Asp Cys Lys 360 365 370tttaactctg
gttttttctt ttcttccctc tcttgtgaat cctgtgggta gtagccagtc
1400tgaatcaatt tactaattaa caattcttga aagggtactc ccaatgcctc
tgacaagcga 1460cggaataatt aataagtaag taaaagtggg tgagagcggc
cttgaagctg cgggttccct 1520tttttactgc tgagttcaga tgtatgcata
cgtaggtata 156074370PRTChrysosporium lucknowense 74Met Arg Leu Thr
Lys Thr Asp Leu Ser Val Ala Leu Leu Ala Ser Ala1 5 10 15Leu Pro Ala
Thr Gln Ala Ala Cys Ser Leu Pro Ser Ser Tyr Arg Trp 20 25 30Thr Ser
Thr Gly Ala Leu Ala Gln Pro Lys Ser Gly Trp Ala Asn Leu 35 40 45Lys
Asp Phe Thr His Val Pro Tyr Asn Gly Lys His Leu Val Tyr Gly 50 55
60Ser Tyr Tyr Ser Asn Ala Tyr Gly Ser Met Asn Phe Gly Leu Phe Ser65
70 75 80Asp Trp Ser Glu Met Gly Ser Ala Ser Gln Asn Gln Met Ser Gln
Ala 85 90 95Ala Val Ala Pro Thr Leu Phe Tyr Phe Ser Pro Lys Asn Ile
Trp Val 100 105 110Leu Ala Tyr Gln Trp Gly Ala Thr Pro Phe Ser Tyr
Arg Thr Ser Thr 115 120 125Asp Pro Thr Asn Pro Asn Gly Trp Ser Ala
Pro Gln Pro Leu Phe Ser 130 135 140Gly Ser Ile Ser Asp Ser Asp Thr
Gly Pro Ile Asp Gln Thr Leu Ile145 150 155 160Gly Asp Asp Lys Asn
Met Tyr Leu Phe Phe Ala Gly Asp Asn Gly Lys 165 170 175Ile Tyr Arg
Ala Ser Met Pro Leu Gly Asn Phe Pro Gly Asn Phe Gly 180 185 190Ser
Ala Ala Thr Val Val Leu Ser Gly Ala Arg Tyr Asp Leu Phe Glu 195 200
205Ala Val Gln Val Tyr Thr Val Val Gly Gln Ser Ser Pro Thr Tyr Leu
210 215 220Met Ile Val Glu Ser Ile Gly Ala Arg Gly Arg Tyr Phe Arg
Ser Tyr225 230 235 240Thr Ala Ser Arg Leu Asp Gly Ala Trp Thr Ala
Gln Ala Ala Ser Glu 245 250 255Ser Gln Pro Phe Ala Gly Ala Ala Asn
Ser Gly Ala Ser Trp Thr Ser 260 265 270Asp Ile Ser His Gly Asp Leu
Ile Arg Ile Asp Ala Asn His Thr Met 275 280 285Pro Val Asp Pro Cys
Arg Leu Gln Leu Leu Tyr Gln Gly Arg Ser Gly 290 295 300Asp Ser Ala
Asp Tyr Asn Ser Leu Pro Tyr Arg Pro Gly Leu Leu Thr305 310 315
320Leu Gln Gly Ala Ser Ser Gly Gly Gly Asp Asn Gly Gly Gly Gly Gly
325 330 335Gly Val Thr Val Pro Arg Tyr Gly Gln Cys Gly Gly Gln Gly
Tyr Thr 340 345
350Gly Pro Thr Thr Cys Glu Ser Pro Tyr Thr Cys Lys Tyr Gln Asn Asp
355 360 365Cys Lys 370751110DNAChrysosporium lucknowense
75atgcgtctca ccaagacgga cctcagcgtc gcgctgctcg cgtcggccct acccgcaacc
60caggcggcgt gcagcctccc ctcgtcctac cggtggacgt cgacgggcgc gttggctcag
120cccaaatcgg ggtgggctaa cctcaaggac ttcacccacg tgccttacaa
cggcaagcac 180cttgtgtacg ggagctacta ctctaacgcc tacggctcaa
tgaacttcgg cctcttctcg 240gactggagcg agatgggctc ggccagccag
aaccagatga gccaggccgc ggtcgcgccc 300acgctcttct acttctcccc
aaagaacatc tgggtcctgg cctaccagtg gggtgccacc 360cctttctcct
accgcacctc gaccgatccg accaacccca acggctggtc cgctccccag
420ccgctcttct cgggctccat ctccgactcg gacacgggcc ccatcgacca
gaccctgatc 480ggcgacgaca agaacatgta cctgttcttc gccggcgaca
acggcaagat ctaccgcgcc 540agcatgccgc tcggcaactt cccgggcaac
tttggctcgg ccgccaccgt cgtcctgagc 600ggcgcccgct acgacctgtt
cgaggccgtg caggtataca cggtcgtcgg ccagagctcg 660ccgacctatc
tgatgatcgt cgagtccatc ggcgcgcgcg ggcgctactt ccgctcctac
720acggccagcc ggctcgacgg cgcctggacg gcccaggcgg ccagcgagag
ccagccgttc 780gccggcgcgg ccaacagcgg cgcctcctgg accagcgaca
tcagccacgg cgacctcatc 840cgcatcgacg ccaaccacac catgcccgtc
gacccctgcc gcctccagct gctctaccag 900ggccgctccg gcgacagcgc
cgactacaac tccctcccct accgcccggg cctgctcacc 960ctgcagggcg
cttcttccgg cggcggcgac aacggcggcg gcggcggcgg cgtaaccgtc
1020ccccggtacg gccagtgcgg tggccagggc tacacggggc cgacaacgtg
cgagagccca 1080tacacgtgca agtaccagaa tgattgtaag
1110761310DNAChrysosporium lucknowenseCDS(114)..(1076) 76gtatatatca
gcgcgtagac agcccgcctc tctcccgggc gttcatgtac tcgtcgacaa 60cgcctctctg
atctctctat atcgccacgt ggaacaaaca cccccatccc acc atg 116 Met 1agg
cca aca cgc ccc ggt atc gtg ctc ctc gcc acc gcg acc tcg gtt 164Arg
Pro Thr Arg Pro Gly Ile Val Leu Leu Ala Thr Ala Thr Ser Val 5 10
15gcc gga cag tca tgc gag ctc ccc tcc tcg tac cgc tgg aca tca acc
212Ala Gly Gln Ser Cys Glu Leu Pro Ser Ser Tyr Arg Trp Thr Ser Thr
20 25 30ggc gcg ctg gca cag ccg aaa agc ccg tgg gtg tcg ctc aag gac
ttc 260Gly Ala Leu Ala Gln Pro Lys Ser Pro Trp Val Ser Leu Lys Asp
Phe 35 40 45acc gtc gtc ccg tac gac ggc cag cac ctg gtc tac gca acg
aca aac 308Thr Val Val Pro Tyr Asp Gly Gln His Leu Val Tyr Ala Thr
Thr Asn50 55 60 65gac ggg acc aac tgg ggc tcc atg ggc ttc agc ctc
ttc agc gac tgg 356Asp Gly Thr Asn Trp Gly Ser Met Gly Phe Ser Leu
Phe Ser Asp Trp 70 75 80gat gcc atg ggc tcg gcg acg cag acg ggc atg
agc tcc agc gcg gtg 404Asp Ala Met Gly Ser Ala Thr Gln Thr Gly Met
Ser Ser Ser Ala Val 85 90 95gcg ccc acc ctg ttc tac ttc gag ccc aag
aac gtt tgg atc ctc gcc 452Ala Pro Thr Leu Phe Tyr Phe Glu Pro Lys
Asn Val Trp Ile Leu Ala 100 105 110cac cag tgg ggc ccg acg gcc ttc
tcg tac cgg acg tcc aac gat tcc 500His Gln Trp Gly Pro Thr Ala Phe
Ser Tyr Arg Thr Ser Asn Asp Ser 115 120 125acc aac ccc aac agc tgg
tcc cag ccc cag ccc ctc ttc acc ggc acc 548Thr Asn Pro Asn Ser Trp
Ser Gln Pro Gln Pro Leu Phe Thr Gly Thr130 135 140 145atc tcg ggc
tcg agc acc ggc ccc atc gac cag acc ctc atc ggc gac 596Ile Ser Gly
Ser Ser Thr Gly Pro Ile Asp Gln Thr Leu Ile Gly Asp 150 155 160gac
cag aat atg tac ctc ttc ttc tgc ggc gac aac ggc aag atc tac 644Asp
Gln Asn Met Tyr Leu Phe Phe Cys Gly Asp Asn Gly Lys Ile Tyr 165 170
175cgc gcc agc atg ccc atc ggc aac ttc ccc gga aac ttt ggc agc gag
692Arg Ala Ser Met Pro Ile Gly Asn Phe Pro Gly Asn Phe Gly Ser Glu
180 185 190tcg acc gtc gtc atg agc gac agc acc aac aac ctc ttc gag
gcc gtc 740Ser Thr Val Val Met Ser Asp Ser Thr Asn Asn Leu Phe Glu
Ala Val 195 200 205cag gtg tac aag gtg aac ggg agg cag cag tac ctc
atg ctc gtc gag 788Gln Val Tyr Lys Val Asn Gly Arg Gln Gln Tyr Leu
Met Leu Val Glu210 215 220 225gcc atc ggc gcc cag ggc cgc tac ttc
cgc tcc ttc acg gcc acg agg 836Ala Ile Gly Ala Gln Gly Arg Tyr Phe
Arg Ser Phe Thr Ala Thr Arg 230 235 240ctc gac ggc gag tgg acg ccc
aac gcg gtc agc gag gct agt ccc ttc 884Leu Asp Gly Glu Trp Thr Pro
Asn Ala Val Ser Glu Ala Ser Pro Phe 245 250 255gcc ggc aag gcc aac
tcg ggc gcc agc tgg acc aac gac atc agc cac 932Ala Gly Lys Ala Asn
Ser Gly Ala Ser Trp Thr Asn Asp Ile Ser His 260 265 270ggc gag ctc
gtc cgc ctc agc gcc gac cag acc ttc ccc atc gac ccc 980Gly Glu Leu
Val Arg Leu Ser Ala Asp Gln Thr Phe Pro Ile Asp Pro 275 280 285tgc
aac ctg cag ctg ctc tac cag ggc cgc gac ccc agc tcg gga ggc 1028Cys
Asn Leu Gln Leu Leu Tyr Gln Gly Arg Asp Pro Ser Ser Gly Gly290 295
300 305gat tac aac cgc ctg ccc tac cgc ccc ggc gtc ctt acc ctg cag
cgc 1076Asp Tyr Asn Arg Leu Pro Tyr Arg Pro Gly Val Leu Thr Leu Gln
Arg 310 315 320tgatctgata ggctctcttt tcgtactttc atccccaggt
gcgtcaaatg gggtagctgg 1136tcgatggggc gggtcagaga aaccctgggc
acgcggggaa gttcaaatcg agatgctttg 1196ttctatacct cacttttttg
cggttagcag ccctagtctg tcgagggatg tcagagcaat 1256ttctctaaca
atttcgactc actaagggta caacgttcat ctgcaatcgc tgca
131077321PRTChrysosporium lucknowense 77Met Arg Pro Thr Arg Pro Gly
Ile Val Leu Leu Ala Thr Ala Thr Ser1 5 10 15Val Ala Gly Gln Ser Cys
Glu Leu Pro Ser Ser Tyr Arg Trp Thr Ser 20 25 30Thr Gly Ala Leu Ala
Gln Pro Lys Ser Pro Trp Val Ser Leu Lys Asp 35 40 45Phe Thr Val Val
Pro Tyr Asp Gly Gln His Leu Val Tyr Ala Thr Thr 50 55 60Asn Asp Gly
Thr Asn Trp Gly Ser Met Gly Phe Ser Leu Phe Ser Asp65 70 75 80Trp
Asp Ala Met Gly Ser Ala Thr Gln Thr Gly Met Ser Ser Ser Ala 85 90
95Val Ala Pro Thr Leu Phe Tyr Phe Glu Pro Lys Asn Val Trp Ile Leu
100 105 110Ala His Gln Trp Gly Pro Thr Ala Phe Ser Tyr Arg Thr Ser
Asn Asp 115 120 125Ser Thr Asn Pro Asn Ser Trp Ser Gln Pro Gln Pro
Leu Phe Thr Gly 130 135 140Thr Ile Ser Gly Ser Ser Thr Gly Pro Ile
Asp Gln Thr Leu Ile Gly145 150 155 160Asp Asp Gln Asn Met Tyr Leu
Phe Phe Cys Gly Asp Asn Gly Lys Ile 165 170 175Tyr Arg Ala Ser Met
Pro Ile Gly Asn Phe Pro Gly Asn Phe Gly Ser 180 185 190Glu Ser Thr
Val Val Met Ser Asp Ser Thr Asn Asn Leu Phe Glu Ala 195 200 205Val
Gln Val Tyr Lys Val Asn Gly Arg Gln Gln Tyr Leu Met Leu Val 210 215
220Glu Ala Ile Gly Ala Gln Gly Arg Tyr Phe Arg Ser Phe Thr Ala
Thr225 230 235 240Arg Leu Asp Gly Glu Trp Thr Pro Asn Ala Val Ser
Glu Ala Ser Pro 245 250 255Phe Ala Gly Lys Ala Asn Ser Gly Ala Ser
Trp Thr Asn Asp Ile Ser 260 265 270His Gly Glu Leu Val Arg Leu Ser
Ala Asp Gln Thr Phe Pro Ile Asp 275 280 285Pro Cys Asn Leu Gln Leu
Leu Tyr Gln Gly Arg Asp Pro Ser Ser Gly 290 295 300Gly Asp Tyr Asn
Arg Leu Pro Tyr Arg Pro Gly Val Leu Thr Leu Gln305 310 315
320Arg78963DNAChrysosporium lucknowense 78atgaggccaa cacgccccgg
tatcgtgctc ctcgccaccg cgacctcggt tgccggacag 60tcatgcgagc tcccctcctc
gtaccgctgg acatcaaccg gcgcgctggc acagccgaaa 120agcccgtggg
tgtcgctcaa ggacttcacc gtcgtcccgt acgacggcca gcacctggtc
180tacgcaacga caaacgacgg gaccaactgg ggctccatgg gcttcagcct
cttcagcgac 240tgggatgcca tgggctcggc gacgcagacg ggcatgagct
ccagcgcggt ggcgcccacc 300ctgttctact tcgagcccaa gaacgtttgg
atcctcgccc accagtgggg cccgacggcc 360ttctcgtacc ggacgtccaa
cgattccacc aaccccaaca gctggtccca gccccagccc 420ctcttcaccg
gcaccatctc gggctcgagc accggcccca tcgaccagac cctcatcggc
480gacgaccaga atatgtacct cttcttctgc ggcgacaacg gcaagatcta
ccgcgccagc 540atgcccatcg gcaacttccc cggaaacttt ggcagcgagt
cgaccgtcgt catgagcgac 600agcaccaaca acctcttcga ggccgtccag
gtgtacaagg tgaacgggag gcagcagtac 660ctcatgctcg tcgaggccat
cggcgcccag ggccgctact tccgctcctt cacggccacg 720aggctcgacg
gcgagtggac gcccaacgcg gtcagcgagg ctagtccctt cgccggcaag
780gccaactcgg gcgccagctg gaccaacgac atcagccacg gcgagctcgt
ccgcctcagc 840gccgaccaga ccttccccat cgacccctgc aacctgcagc
tgctctacca gggccgcgac 900cccagctcgg gaggcgatta caaccgcctg
ccctaccgcc ccggcgtcct taccctgcag 960cgc 963791422DNAChrysosporium
lucknowenseCDS(68)..(221)CDS(288)..(411)CDS(465)..(570)CDS(648)..(827)CDS-
(890)..(1234) 79aaaccttcta tcaccacctc acttcaactt cactctgccg
tcgttttaaa ccggcaactc 60gcccaga atg tgg cct act ccc aag gtt gct tac
ctc ttg ctg gca gct 109 Met Trp Pro Thr Pro Lys Val Ala Tyr Leu Leu
Leu Ala Ala 1 5 10cac tcg ctg ctc gtc agc ggg act ccc gtc gac gcc
gag gcc gtg gac 157His Ser Leu Leu Val Ser Gly Thr Pro Val Asp Ala
Glu Ala Val Asp15 20 25 30acg gcc atc ctc gcc aag cgc caa tgc ccg
cag atc cat atc ttc gga 205Thr Ala Ile Leu Ala Lys Arg Gln Cys Pro
Gln Ile His Ile Phe Gly 35 40 45gct cga gaa acg aca g gtttgtcgta
ttcttgccag gttactaatg tgcatctgag 261Ala Arg Glu Thr Thr
50gtcgtctagc tgacaatacg ggacag ca cca ccc gga tat gga act gcc ggt
313 Ala Pro Pro Gly Tyr Gly Thr Ala Gly 55 60acc gtc gtc aac ctg
att ctc aat gcg ttc cct ggg gcc acg gcc gaa 361Thr Val Val Asn Leu
Ile Leu Asn Ala Phe Pro Gly Ala Thr Ala Glu 65 70 75gca atc aac tat
ccc gcc tgc ggt gga caa gca tct tgt gga gga gtc 409Ala Ile Asn Tyr
Pro Ala Cys Gly Gly Gln Ala Ser Cys Gly Gly Val 80 85 90ca
gaacgtgatc ccttctattt gcgccatgat atcgtcgact aaccatccta cag g 465
Glntac ggt gac tcg gca agg cag ggc acc gac gcg gtg gct tcg gcg gtg
513Tyr Gly Asp Ser Ala Arg Gln Gly Thr Asp Ala Val Ala Ser Ala Val
95 100 105aac agc ttc aac cag cgg tgt ccc aac act cag atc gtg ctt
gtt ggc 561Asn Ser Phe Asn Gln Arg Cys Pro Asn Thr Gln Ile Val Leu
Val Gly110 115 120 125tac tcc cag gtgcgtgtcg atgagaatga caagacacat
ctgggaaagg 610Tyr Ser Glnacaaagatga gaaggaacgg ggttctaaca aacgcag
ggc ggt caa atc atc gac 665 Gly Gly Gln Ile Ile Asp 130aat gct gtt
tgt ggt ggc cct gac acg ggc tct gga atc acg acc acg 713Asn Ala Val
Cys Gly Gly Pro Asp Thr Gly Ser Gly Ile Thr Thr Thr135 140 145
150act cct ccc atc tcg gct gcg gcc ctg aac cag atc aag gcc gtc atc
761Thr Pro Pro Ile Ser Ala Ala Ala Leu Asn Gln Ile Lys Ala Val Ile
155 160 165gaa atg ggc tct cca cgg ttc gtc gcc ggc ttg tcc tac gac
gtc ggt 809Glu Met Gly Ser Pro Arg Phe Val Ala Gly Leu Ser Tyr Asp
Val Gly 170 175 180acc tgc acg gca caa ggt gtaagtggct tcgtcctctc
tcgaatgagg 857Thr Cys Thr Ala Gln Gly 185tattctctag tttctaacac
acgcattcct ag ttt gct gcg cgc ccg cgt ggc 910 Phe Ala Ala Arg Pro
Arg Gly 190 195tac gtc tgc ggc tcc aac tcg gcg tcc aag atc cag agc
tac tgc gat 958Tyr Val Cys Gly Ser Asn Ser Ala Ser Lys Ile Gln Ser
Tyr Cys Asp 200 205 210tcg acc gac ccc tac tgc tgc acc ggc aac gac
gcc aac agc cac cag 1006Ser Thr Asp Pro Tyr Cys Cys Thr Gly Asn Asp
Ala Asn Ser His Gln 215 220 225cag tat ggc aac aag tac gga cag cag
gcc ctg gct ttt gtc aag gcc 1054Gln Tyr Gly Asn Lys Tyr Gly Gln Gln
Ala Leu Ala Phe Val Lys Ala 230 235 240agg ctc agc ggc agc ggc gga
act ccg acg tcg tcg gcc ggc ggc tcc 1102Arg Leu Ser Gly Ser Gly Gly
Thr Pro Thr Ser Ser Ala Gly Gly Ser 245 250 255gtt ccg acc ggc ggc
aac ggc ggc act tgc agc ccg ctg tac gga cag 1150Val Pro Thr Gly Gly
Asn Gly Gly Thr Cys Ser Pro Leu Tyr Gly Gln260 265 270 275tgc ggt
ggc cag gga tgg acg ggt ccc acg tgc tgc tct cag gga acc 1198Cys Gly
Gly Gln Gly Trp Thr Gly Pro Thr Cys Cys Ser Gln Gly Thr 280 285
290tgc agg gca tcc aac cag tgg tac tcc cag tgc ttg taaatccagg
1244Cys Arg Ala Ser Asn Gln Trp Tyr Ser Gln Cys Leu 295
300gcgtgactcc gtgaggaaca tgggttccaa ggactcgacc tcttcttgta
aatatcttga 1304ttgcggcagt gtgaatacag cccggcacat agcggaattc
ctttctgccc cttctgacta 1364acaccgagag cagcataagt agatatatgc
ttaggaacag accgagccag ataaaaaa 142280303PRTChrysosporium
lucknowense 80Met Trp Pro Thr Pro Lys Val Ala Tyr Leu Leu Leu Ala
Ala His Ser1 5 10 15Leu Leu Val Ser Gly Thr Pro Val Asp Ala Glu Ala
Val Asp Thr Ala 20 25 30Ile Leu Ala Lys Arg Gln Cys Pro Gln Ile His
Ile Phe Gly Ala Arg 35 40 45Glu Thr Thr Ala Pro Pro Gly Tyr Gly Thr
Ala Gly Thr Val Val Asn 50 55 60Leu Ile Leu Asn Ala Phe Pro Gly Ala
Thr Ala Glu Ala Ile Asn Tyr65 70 75 80Pro Ala Cys Gly Gly Gln Ala
Ser Cys Gly Gly Val Gln Tyr Gly Asp 85 90 95Ser Ala Arg Gln Gly Thr
Asp Ala Val Ala Ser Ala Val Asn Ser Phe 100 105 110Asn Gln Arg Cys
Pro Asn Thr Gln Ile Val Leu Val Gly Tyr Ser Gln 115 120 125Gly Gly
Gln Ile Ile Asp Asn Ala Val Cys Gly Gly Pro Asp Thr Gly 130 135
140Ser Gly Ile Thr Thr Thr Thr Pro Pro Ile Ser Ala Ala Ala Leu
Asn145 150 155 160Gln Ile Lys Ala Val Ile Glu Met Gly Ser Pro Arg
Phe Val Ala Gly 165 170 175Leu Ser Tyr Asp Val Gly Thr Cys Thr Ala
Gln Gly Phe Ala Ala Arg 180 185 190Pro Arg Gly Tyr Val Cys Gly Ser
Asn Ser Ala Ser Lys Ile Gln Ser 195 200 205Tyr Cys Asp Ser Thr Asp
Pro Tyr Cys Cys Thr Gly Asn Asp Ala Asn 210 215 220Ser His Gln Gln
Tyr Gly Asn Lys Tyr Gly Gln Gln Ala Leu Ala Phe225 230 235 240Val
Lys Ala Arg Leu Ser Gly Ser Gly Gly Thr Pro Thr Ser Ser Ala 245 250
255Gly Gly Ser Val Pro Thr Gly Gly Asn Gly Gly Thr Cys Ser Pro Leu
260 265 270Tyr Gly Gln Cys Gly Gly Gln Gly Trp Thr Gly Pro Thr Cys
Cys Ser 275 280 285Gln Gly Thr Cys Arg Ala Ser Asn Gln Trp Tyr Ser
Gln Cys Leu 290 295 30081909DNAChrysosporium lucknowense
81atgtggccta ctcccaaggt tgcttacctc ttgctggcag ctcactcgct gctcgtcagc
60gggactcccg tcgacgccga ggccgtggac acggccatcc tcgccaagcg ccaatgcccg
120cagatccata tcttcggagc tcgagaaacg acagcaccac ccggatatgg
aactgccggt 180accgtcgtca acctgattct caatgcgttc cctggggcca
cggccgaagc aatcaactat 240cccgcctgcg gtggacaagc atcttgtgga
ggagtccagt acggtgactc ggcaaggcag 300ggcaccgacg cggtggcttc
ggcggtgaac agcttcaacc agcggtgtcc caacactcag 360atcgtgcttg
ttggctactc ccagggcggt caaatcatcg acaatgctgt ttgtggtggc
420cctgacacgg gctctggaat cacgaccacg actcctccca tctcggctgc
ggccctgaac 480cagatcaagg ccgtcatcga aatgggctct ccacggttcg
tcgccggctt gtcctacgac 540gtcggtacct gcacggcaca aggttttgct
gcgcgcccgc gtggctacgt ctgcggctcc 600aactcggcgt ccaagatcca
gagctactgc gattcgaccg acccctactg ctgcaccggc 660aacgacgcca
acagccacca gcagtatggc aacaagtacg gacagcaggc cctggctttt
720gtcaaggcca ggctcagcgg cagcggcgga actccgacgt cgtcggccgg
cggctccgtt 780ccgaccggcg gcaacggcgg cacttgcagc ccgctgtacg
gacagtgcgg tggccaggga 840tggacgggtc ccacgtgctg ctctcaggga
acctgcaggg catccaacca gtggtactcc 900cagtgcttg
909821440DNAChrysosporium
lucknowenseCDS(282)..(402)CDS(470)..(594)CDS(672)..(929)CDS(987)..(1166)
82cccaaagaga ttaaagcgac cttccggcgt gccattaggc agccttccgc cgaccctttc
60tccgtcttcc ctatttcgaa cacccgtccg tccgggctcc ggggtgtaac atagggtctc
120ggacgagggt catcagacgg tgtcgagtga tgctctggtt tgtaatgtag
ttatataata 180taagaatcca cccgactctt cggtgtactt gtagccgatg
tttgatcagg gcacatctca 240agtccatcta tactcttcaa tcagaataac
atatacacgc c atg aag gtc acc gcc
296 Met Lys Val Thr Ala 1 5gtt gcc gtt ccc gtc ctg gct ggc att gcc
agc gcc aac ccc gtc gac 344Val Ala Val Pro Val Leu Ala Gly Ile Ala
Ser Ala Asn Pro Val Asp 10 15 20gtc gag gcc cgc cag tcg tgc ccc gag
gtc cac gta ttc ggc gcc cgg 392Val Glu Ala Arg Gln Ser Cys Pro Glu
Val His Val Phe Gly Ala Arg 25 30 35gaa act aca g gtaattaagc
ttagggttgg ggttgtctca ggggtaatta 442Glu Thr Thr 40cggctaccta
actaacgtca acatcag ca ccc cct ggc tat ggc aca tcc caa 495 Ala Pro
Pro Gly Tyr Gly Thr Ser Gln 45ggc ctc gtc aac atg gtc gtg cag gcg
tac ccc gga gca aca agc gag 543Gly Leu Val Asn Met Val Val Gln Ala
Tyr Pro Gly Ala Thr Ser Glu50 55 60 65gcc ata aac tac ccc gca tgc
ggc ggt cag gcg tct tgc ggt gga atc 591Ala Ile Asn Tyr Pro Ala Cys
Gly Gly Gln Ala Ser Cys Gly Gly Ile 70 75 80gat gcaagtgacc
ccgaaaatgc cgctgactgt aaatcaggtg catccgttgt 644Asptcctaacgag
taactcgggg cttttag tac aat acc tct gcg aac cag ggc act 698 Tyr Asn
Thr Ser Ala Asn Gln Gly Thr 85 90cag gcg gtt gtc tcc gct gtg acg
agc ttt aac cag cgc tgc cct gac 746Gln Ala Val Val Ser Ala Val Thr
Ser Phe Asn Gln Arg Cys Pro Asp 95 100 105aca aag atc gtc ttg atc
ggc tat tcg cag ggt ggc cag atc atg gac 794Thr Lys Ile Val Leu Ile
Gly Tyr Ser Gln Gly Gly Gln Ile Met Asp 110 115 120aac gcc tac tgc
ggc ggc gcc ggt gcc acc ctc agc ggc agc gcc ctc 842Asn Ala Tyr Cys
Gly Gly Ala Gly Ala Thr Leu Ser Gly Ser Ala Leu 125 130 135aac gcc
gtt aaa gcc acg gtc tgg ttc gga aat ccg cat tat ctg agc 890Asn Ala
Val Lys Ala Thr Val Trp Phe Gly Asn Pro His Tyr Leu Ser140 145 150
155caa tta agt tat cga gtt ggg aca tgc cag gcc gga ggg gtaggttcct
939Gln Leu Ser Tyr Arg Val Gly Thr Cys Gln Ala Gly Gly 160
165ctattgtcga tgtattttcg tgagacacga catgctgacc gagatag ttc gcg gcc
995 Phe Ala Ala 170cgc cct ccc gga ttc caa tgt tcg cct ggt aac cca
gat aac atc aag 1043Arg Pro Pro Gly Phe Gln Cys Ser Pro Gly Asn Pro
Asp Asn Ile Lys 175 180 185tcg tac tgc gat gcc gag gat ccg tac tgc
tgc aat gga aac gat gca 1091Ser Tyr Cys Asp Ala Glu Asp Pro Tyr Cys
Cys Asn Gly Asn Asp Ala 190 195 200aac cac cac caa cag tac gtc acc
atc tac ggc cag cag gcg ctc gcc 1139Asn His His Gln Gln Tyr Val Thr
Ile Tyr Gly Gln Gln Ala Leu Ala 205 210 215ttc atc aag tcg aag ctc
gac gct gct taagcgtagg gagatacgct 1186Phe Ile Lys Ser Lys Leu Asp
Ala Ala220 225gcgttggaaa gggggcgatg gagcacatta ccaagtaaat
acagtgccac gtccgtttgt 1246atgtacacag gtagccaaaa aaacagccat
gctcggatcc atatgtttgc ccctctctct 1306tggagtgacc cgactctgtt
gaggttgttg gtatcgcggt acgagaaagt ggacgaccaa 1366ggcaccaatg
agctgctgct ctttgcaatg gacatttctg ggcgatattt gccgtgaact
1426atcacactcg ccat 144083228PRTChrysosporium lucknowense 83Met Lys
Val Thr Ala Val Ala Val Pro Val Leu Ala Gly Ile Ala Ser1 5 10 15Ala
Asn Pro Val Asp Val Glu Ala Arg Gln Ser Cys Pro Glu Val His 20 25
30Val Phe Gly Ala Arg Glu Thr Thr Ala Pro Pro Gly Tyr Gly Thr Ser
35 40 45Gln Gly Leu Val Asn Met Val Val Gln Ala Tyr Pro Gly Ala Thr
Ser 50 55 60Glu Ala Ile Asn Tyr Pro Ala Cys Gly Gly Gln Ala Ser Cys
Gly Gly65 70 75 80Ile Asp Tyr Asn Thr Ser Ala Asn Gln Gly Thr Gln
Ala Val Val Ser 85 90 95Ala Val Thr Ser Phe Asn Gln Arg Cys Pro Asp
Thr Lys Ile Val Leu 100 105 110Ile Gly Tyr Ser Gln Gly Gly Gln Ile
Met Asp Asn Ala Tyr Cys Gly 115 120 125Gly Ala Gly Ala Thr Leu Ser
Gly Ser Ala Leu Asn Ala Val Lys Ala 130 135 140Thr Val Trp Phe Gly
Asn Pro His Tyr Leu Ser Gln Leu Ser Tyr Arg145 150 155 160Val Gly
Thr Cys Gln Ala Gly Gly Phe Ala Ala Arg Pro Pro Gly Phe 165 170
175Gln Cys Ser Pro Gly Asn Pro Asp Asn Ile Lys Ser Tyr Cys Asp Ala
180 185 190Glu Asp Pro Tyr Cys Cys Asn Gly Asn Asp Ala Asn His His
Gln Gln 195 200 205Tyr Val Thr Ile Tyr Gly Gln Gln Ala Leu Ala Phe
Ile Lys Ser Lys 210 215 220Leu Asp Ala Ala22584684DNAChrysosporium
lucknowense 84atgaaggtca ccgccgttgc cgttcccgtc ctggctggca
ttgccagcgc caaccccgtc 60gacgtcgagg cccgccagtc gtgccccgag gtccacgtat
tcggcgcccg ggaaactaca 120gcaccccctg gctatggcac atcccaaggc
ctcgtcaaca tggtcgtgca ggcgtacccc 180ggagcaacaa gcgaggccat
aaactacccc gcatgcggcg gtcaggcgtc ttgcggtgga 240atcgattaca
atacctctgc gaaccagggc actcaggcgg ttgtctccgc tgtgacgagc
300tttaaccagc gctgccctga cacaaagatc gtcttgatcg gctattcgca
gggtggccag 360atcatggaca acgcctactg cggcggcgcc ggtgccaccc
tcagcggcag cgccctcaac 420gccgttaaag ccacggtctg gttcggaaat
ccgcattatc tgagccaatt aagttatcga 480gttgggacat gccaggccgg
agggttcgcg gcccgccctc ccggattcca atgttcgcct 540ggtaacccag
ataacatcaa gtcgtactgc gatgccgagg atccgtactg ctgcaatgga
600aacgatgcaa accaccacca acagtacgtc accatctacg gccagcaggc
gctcgccttc 660atcaagtcga agctcgacgc tgct 684851401DNAChrysosporium
lucknowenseCDS(85)..(371)CDS(432)..(849)CDS(927)..(1075)CDS(1143)..(1198)-
CDS(1263)..(1345) 85agcttctcag catggacatg ttgaatgcct agcagacgga
aatcccatca gtaatatact 60tagtcttctg gaaacttgct cgtc atg agg cta tcc
ctt tgc agg ttg gca 111 Met Arg Leu Ser Leu Cys Arg Leu Ala 1 5tcg
gcg gtg cta tcc ctg acc ttt gtg ggc gat gct ttg cct ggt gct 159Ser
Ala Val Leu Ser Leu Thr Phe Val Gly Asp Ala Leu Pro Gly Ala10 15 20
25tct gcg cag gcc gcg agc cgg acc act ccg ccg agc gga tgt ctg tca
207Ser Ala Gln Ala Ala Ser Arg Thr Thr Pro Pro Ser Gly Cys Leu Ser
30 35 40gta gga agc ggc agg cag tac agc acc atc acg gaa gcg ata act
gcc 255Val Gly Ser Gly Arg Gln Tyr Ser Thr Ile Thr Glu Ala Ile Thr
Ala 45 50 55ttg ggt tct ggc act tca gct gcc tgc att ttc atc tac ccg
gga acc 303Leu Gly Ser Gly Thr Ser Ala Ala Cys Ile Phe Ile Tyr Pro
Gly Thr 60 65 70tac aac gtt gct gat ggt gtt tcg atc aag tac aag ggc
ccg ctg aca 351Tyr Asn Val Ala Asp Gly Val Ser Ile Lys Tyr Lys Gly
Pro Leu Thr 75 80 85ctg tat ggg tca acc tcg ga gtgagtactt
acataataac tgccaaatgc 401Leu Tyr Gly Ser Thr Ser Asp90 95cgtattcacc
actattaatc accatggcag t acc agc aag cag agc gcc aac 453 Thr Ser Lys
Gln Ser Ala Asn 100cag gtg acc ttt acc agg aac aag ggt tcg gcc gat
gct ggg agc ctg 501Gln Val Thr Phe Thr Arg Asn Lys Gly Ser Ala Asp
Ala Gly Ser Leu 105 110 115gac gcg tct gcc acg ttc aac atc gtg agc
agc aac ttc cgg gcg tac 549Asp Ala Ser Ala Thr Phe Asn Ile Val Ser
Ser Asn Phe Arg Ala Tyr120 125 130 135aac atc aac ttt cgt aac aca
tac ggc acg cag ggc cag gcc gtg gcc 597Asn Ile Asn Phe Arg Asn Thr
Tyr Gly Thr Gln Gly Gln Ala Val Ala 140 145 150gtg gcc gcc aac ggc
gac aag cag gcc tac tat gcg tgc ggt ttc tac 645Val Ala Ala Asn Gly
Asp Lys Gln Ala Tyr Tyr Ala Cys Gly Phe Tyr 155 160 165ggc tac cag
gac acg ctg tac gcc aag agc ggt cgc cag tac tac tcc 693Gly Tyr Gln
Asp Thr Leu Tyr Ala Lys Ser Gly Arg Gln Tyr Tyr Ser 170 175 180aac
tgc tac atc gag ggc gcc gtc gac ttc atc ttt ggc gcc gcg gct 741Asn
Cys Tyr Ile Glu Gly Ala Val Asp Phe Ile Phe Gly Ala Ala Ala 185 190
195gcc tgg ttt ggc gag tgt acc gtt gcc tcc aac ggc ggc ggg tac atc
789Ala Trp Phe Gly Glu Cys Thr Val Ala Ser Asn Gly Gly Gly Tyr
Ile200 205 210 215acc gct aac tct cgc tcc act acg gct gat act acc
tgg tat gct ttt 837Thr Ala Asn Ser Arg Ser Thr Thr Ala Asp Thr Thr
Trp Tyr Ala Phe 220 225 230gat cac agt act gtaagttcac cgtccaaagc
aagtccaacg ctccgtgccc 889Asp His Ser Thr 235acaactatta acgattgtcc
cgcttcgcta ttatcag atc cgg gcg gct gcc gga 944 Ile Arg Ala Ala Ala
Gly 240att agc ttg gcc ggc aaa gtg ttc ctc ggc cga ccg tgg cgc gta
ctt 992Ile Ser Leu Ala Gly Lys Val Phe Leu Gly Arg Pro Trp Arg Val
Leu 245 250 255gcc cgc gtt atc ttc caa aat tcc gag ttg acc gac gtc
gtg cac cct 1040Ala Arg Val Ile Phe Gln Asn Ser Glu Leu Thr Asp Val
Val His Pro 260 265 270gag ggc tgg acc aga atg gct gca gga gct act
cc gtaagtgatt 1085Glu Gly Trp Thr Arg Met Ala Ala Gly Ala Thr Pro
275 280 285ctctatccgc gacttctggt gacccagatg ctgacaaata tggcgaaact
cttgcag c 1143gag ttc cgg gag ttc caa aac act ggt gca gga tct aac
aca tcc cag 1191Glu Phe Arg Glu Phe Gln Asn Thr Gly Ala Gly Ser Asn
Thr Ser Gln 290 295 300aga aag t gtaagtgaat gtatcttcta ctctagaggg
cactgaattg ctaacgctct 1248Arg Lystggtgtgtat gcag gg ctc act ttc cca
acg act gct ggt gtc act aag 1297 Trp Leu Thr Phe Pro Thr Thr Ala
Gly Val Thr Lys 305 310 315acc cag ctc tgg ggc agt gac tgg aag act
tgg ata gat act gca tgg 1345Thr Gln Leu Trp Gly Ser Asp Trp Lys Thr
Trp Ile Asp Thr Ala Trp 320 325 330tgagaggacg tgtccaaggt tggcaatcag
cacctcaaat caacggacat aattaa 140186331PRTChrysosporium lucknowense
86Met Arg Leu Ser Leu Cys Arg Leu Ala Ser Ala Val Leu Ser Leu Thr1
5 10 15Phe Val Gly Asp Ala Leu Pro Gly Ala Ser Ala Gln Ala Ala Ser
Arg 20 25 30Thr Thr Pro Pro Ser Gly Cys Leu Ser Val Gly Ser Gly Arg
Gln Tyr 35 40 45Ser Thr Ile Thr Glu Ala Ile Thr Ala Leu Gly Ser Gly
Thr Ser Ala 50 55 60Ala Cys Ile Phe Ile Tyr Pro Gly Thr Tyr Asn Val
Ala Asp Gly Val65 70 75 80Ser Ile Lys Tyr Lys Gly Pro Leu Thr Leu
Tyr Gly Ser Thr Ser Asp 85 90 95Thr Ser Lys Gln Ser Ala Asn Gln Val
Thr Phe Thr Arg Asn Lys Gly 100 105 110Ser Ala Asp Ala Gly Ser Leu
Asp Ala Ser Ala Thr Phe Asn Ile Val 115 120 125Ser Ser Asn Phe Arg
Ala Tyr Asn Ile Asn Phe Arg Asn Thr Tyr Gly 130 135 140Thr Gln Gly
Gln Ala Val Ala Val Ala Ala Asn Gly Asp Lys Gln Ala145 150 155
160Tyr Tyr Ala Cys Gly Phe Tyr Gly Tyr Gln Asp Thr Leu Tyr Ala Lys
165 170 175Ser Gly Arg Gln Tyr Tyr Ser Asn Cys Tyr Ile Glu Gly Ala
Val Asp 180 185 190Phe Ile Phe Gly Ala Ala Ala Ala Trp Phe Gly Glu
Cys Thr Val Ala 195 200 205Ser Asn Gly Gly Gly Tyr Ile Thr Ala Asn
Ser Arg Ser Thr Thr Ala 210 215 220Asp Thr Thr Trp Tyr Ala Phe Asp
His Ser Thr Ile Arg Ala Ala Ala225 230 235 240Gly Ile Ser Leu Ala
Gly Lys Val Phe Leu Gly Arg Pro Trp Arg Val 245 250 255Leu Ala Arg
Val Ile Phe Gln Asn Ser Glu Leu Thr Asp Val Val His 260 265 270Pro
Glu Gly Trp Thr Arg Met Ala Ala Gly Ala Thr Pro Glu Phe Arg 275 280
285Glu Phe Gln Asn Thr Gly Ala Gly Ser Asn Thr Ser Gln Arg Lys Trp
290 295 300Leu Thr Phe Pro Thr Thr Ala Gly Val Thr Lys Thr Gln Leu
Trp Gly305 310 315 320Ser Asp Trp Lys Thr Trp Ile Asp Thr Ala Trp
325 33087993DNAChrysosporium lucknowense 87atgaggctat ccctttgcag
gttggcatcg gcggtgctat ccctgacctt tgtgggcgat 60gctttgcctg gtgcttctgc
gcaggccgcg agccggacca ctccgccgag cggatgtctg 120tcagtaggaa
gcggcaggca gtacagcacc atcacggaag cgataactgc cttgggttct
180ggcacttcag ctgcctgcat tttcatctac ccgggaacct acaacgttgc
tgatggtgtt 240tcgatcaagt acaagggccc gctgacactg tatgggtcaa
cctcggatac cagcaagcag 300agcgccaacc aggtgacctt taccaggaac
aagggttcgg ccgatgctgg gagcctggac 360gcgtctgcca cgttcaacat
cgtgagcagc aacttccggg cgtacaacat caactttcgt 420aacacatacg
gcacgcaggg ccaggccgtg gccgtggccg ccaacggcga caagcaggcc
480tactatgcgt gcggtttcta cggctaccag gacacgctgt acgccaagag
cggtcgccag 540tactactcca actgctacat cgagggcgcc gtcgacttca
tctttggcgc cgcggctgcc 600tggtttggcg agtgtaccgt tgcctccaac
ggcggcgggt acatcaccgc taactctcgc 660tccactacgg ctgatactac
ctggtatgct tttgatcaca gtactatccg ggcggctgcc 720ggaattagct
tggccggcaa agtgttcctc ggccgaccgt ggcgcgtact tgcccgcgtt
780atcttccaaa attccgagtt gaccgacgtc gtgcaccctg agggctggac
cagaatggct 840gcaggagcta ctcccgagtt ccgggagttc caaaacactg
gtgcaggatc taacacatcc 900cagagaaagt ggctcacttt cccaacgact
gctggtgtca ctaagaccca gctctggggc 960agtgactgga agacttggat
agatactgca tgg 993881504DNAChrysosporium
lucknowenseCDS(296)..(565)CDS(650)..(1240)CDS(1351)..(1452)
88acaccctcac gacaagtcgc aacctcgatt tcccgtgtgc ccggagcttg gcaacgcgtc
60acgacccata gttagtagta tcgtcaaggg ccgtctcgct aaaagtagcg tggtttcaga
120aaccgtttat ttagcgacct taaataatta ttattgatta tctcccgaca
acctccgggc 180ctcgtcggcg ggtttctgtc taactgctgc tgcaaacttc
aggcggctgt ccgtcgtgtt 240cggtcattct caccttgcca gctttggcta
attaccttac cttgctctac caacg atg 298 Met 1aag aga tcc ggc ctt ctg
ggg acg gtg gct ctt ctg ttc ccg gcc ctc 346Lys Arg Ser Gly Leu Leu
Gly Thr Val Ala Leu Leu Phe Pro Ala Leu 5 10 15gtt tct gcc tat gcg
ctc ccg ggc gaa tgt act gga gtc tgc aac aac 394Val Ser Ala Tyr Ala
Leu Pro Gly Glu Cys Thr Gly Val Cys Asn Asn 20 25 30tcc cac gac ccg
tca atc atc cgc cgc gca gat ggg acg tac ttc cgc 442Ser His Asp Pro
Ser Ile Ile Arg Arg Ala Asp Gly Thr Tyr Phe Arg 35 40 45ttc tcg act
ggt ggc aag atc gcc att cac acg gcg ccg agc ctg acc 490Phe Ser Thr
Gly Gly Lys Ile Ala Ile His Thr Ala Pro Ser Leu Thr50 55 60 65ggc
ccg tgg acc tat agg ggc gcc gcc ttg ccg gcc ggg tcc tcc atc 538Gly
Pro Trp Thr Tyr Arg Gly Ala Ala Leu Pro Ala Gly Ser Ser Ile 70 75
80aac ttg gac ggg aac cag gac ctc tgg gtaagaagaa caaggcgttc 585Asn
Leu Asp Gly Asn Gln Asp Leu Trp 85 90acggcccgca gcctggatgt
aatttattta tgggtgatga tgatgttgac gacgagctcg 645ccag gcc ccc gat gtt
gca cag gtc gga gac cag tac tac ctg tac tac 694 Ala Pro Asp Val Ala
Gln Val Gly Asp Gln Tyr Tyr Leu Tyr Tyr 95 100 105tcc gtc agc acc
ttc ggc gtc cag aac tcg gcc atc ggc ata gcc cgc 742Ser Val Ser Thr
Phe Gly Val Gln Asn Ser Ala Ile Gly Ile Ala Arg 110 115 120tca tcg
agc ctg gac gcc ggc acc tgg acc gac ctc ggg tct acc ggt 790Ser Ser
Ser Leu Asp Ala Gly Thr Trp Thr Asp Leu Gly Ser Thr Gly 125 130
135gtc acc tcc agc acg ggc tcg gcg tac aac gcc atc gac ggc aac ctg
838Val Thr Ser Ser Thr Gly Ser Ala Tyr Asn Ala Ile Asp Gly Asn Leu
140 145 150atc aac gag ccg ggg acc agc aac tac ttc ctc acg ttt ggc
agc ttc 886Ile Asn Glu Pro Gly Thr Ser Asn Tyr Phe Leu Thr Phe Gly
Ser Phe 155 160 165tgg aac ggc atc cat cgc gtc cgc atg acg ccc acc
aag acg aac ggc 934Trp Asn Gly Ile His Arg Val Arg Met Thr Pro Thr
Lys Thr Asn Gly170 175 180 185aac gtg tac cag gtc gcc ttt gat ccc
aac gac ccg gcc atg gaa ggc 982Asn Val Tyr Gln Val Ala Phe Asp Pro
Asn Asp Pro Ala Met Glu Gly 190 195 200ccg acc gtc ttc aag tac ggc
gac tac tac tac ctt ttc ttc tcc aag 1030Pro Thr Val Phe Lys Tyr Gly
Asp Tyr Tyr Tyr Leu Phe Phe Ser Lys 205 210 215ggc acg tgc tgc ggc
tac gac cag aac agg ccg ccc gcc gga cag gag 1078Gly Thr Cys Cys Gly
Tyr
Asp Gln Asn Arg Pro Pro Ala Gly Gln Glu 220 225 230tac agg atc atg
gtg tgc agg tcg acc tcg ccc acc ggc ccc ttc gag 1126Tyr Arg Ile Met
Val Cys Arg Ser Thr Ser Pro Thr Gly Pro Phe Glu 235 240 245gac agg
gac ggc aag tcg tgc cgc tcg ggc ggt gga act ctg gtg ctc 1174Asp Arg
Asp Gly Lys Ser Cys Arg Ser Gly Gly Gly Thr Leu Val Leu250 255 260
265ccg tct cac gac tgg gtg tac ggc ccc ggc ggc caa ggt gtc tac cag
1222Pro Ser His Asp Trp Val Tyr Gly Pro Gly Gly Gln Gly Val Tyr Gln
270 275 280gac ccc gaa cac gga ccg gttagttccc cgtcctttct cccgtacggg
1270Asp Pro Glu His Gly Pro 285cctagcattc agcatcgggg atcgagcatc
aagcatcacc ctgcttgaac ccatctcttg 1330ctaacgctct tccttcccag gtc ttg
tac tac cat tat gtt gat act cgc att 1383 Val Leu Tyr Tyr His Tyr
Val Asp Thr Arg Ile 290 295ggc tat gcg gac ggc cag aag aag ttt ggt
tgg aat cgc atc gac ttt 1431Gly Tyr Ala Asp Gly Gln Lys Lys Phe Gly
Trp Asn Arg Ile Asp Phe 300 305 310tca agc ggt tgg ccg gtt gtc
tgagaaaccc gagacgcttg acttgtttat 1482Ser Ser Gly Trp Pro Val Val315
320ttacaggtat taccggtatt aa 150489321PRTChrysosporium lucknowense
89Met Lys Arg Ser Gly Leu Leu Gly Thr Val Ala Leu Leu Phe Pro Ala1
5 10 15Leu Val Ser Ala Tyr Ala Leu Pro Gly Glu Cys Thr Gly Val Cys
Asn 20 25 30Asn Ser His Asp Pro Ser Ile Ile Arg Arg Ala Asp Gly Thr
Tyr Phe 35 40 45Arg Phe Ser Thr Gly Gly Lys Ile Ala Ile His Thr Ala
Pro Ser Leu 50 55 60Thr Gly Pro Trp Thr Tyr Arg Gly Ala Ala Leu Pro
Ala Gly Ser Ser65 70 75 80Ile Asn Leu Asp Gly Asn Gln Asp Leu Trp
Ala Pro Asp Val Ala Gln 85 90 95Val Gly Asp Gln Tyr Tyr Leu Tyr Tyr
Ser Val Ser Thr Phe Gly Val 100 105 110Gln Asn Ser Ala Ile Gly Ile
Ala Arg Ser Ser Ser Leu Asp Ala Gly 115 120 125Thr Trp Thr Asp Leu
Gly Ser Thr Gly Val Thr Ser Ser Thr Gly Ser 130 135 140Ala Tyr Asn
Ala Ile Asp Gly Asn Leu Ile Asn Glu Pro Gly Thr Ser145 150 155
160Asn Tyr Phe Leu Thr Phe Gly Ser Phe Trp Asn Gly Ile His Arg Val
165 170 175Arg Met Thr Pro Thr Lys Thr Asn Gly Asn Val Tyr Gln Val
Ala Phe 180 185 190Asp Pro Asn Asp Pro Ala Met Glu Gly Pro Thr Val
Phe Lys Tyr Gly 195 200 205Asp Tyr Tyr Tyr Leu Phe Phe Ser Lys Gly
Thr Cys Cys Gly Tyr Asp 210 215 220Gln Asn Arg Pro Pro Ala Gly Gln
Glu Tyr Arg Ile Met Val Cys Arg225 230 235 240Ser Thr Ser Pro Thr
Gly Pro Phe Glu Asp Arg Asp Gly Lys Ser Cys 245 250 255Arg Ser Gly
Gly Gly Thr Leu Val Leu Pro Ser His Asp Trp Val Tyr 260 265 270Gly
Pro Gly Gly Gln Gly Val Tyr Gln Asp Pro Glu His Gly Pro Val 275 280
285Leu Tyr Tyr His Tyr Val Asp Thr Arg Ile Gly Tyr Ala Asp Gly Gln
290 295 300Lys Lys Phe Gly Trp Asn Arg Ile Asp Phe Ser Ser Gly Trp
Pro Val305 310 315 320Val90963DNAChrysosporium lucknowense
90atgaagagat ccggccttct ggggacggtg gctcttctgt tcccggccct cgtttctgcc
60tatgcgctcc cgggcgaatg tactggagtc tgcaacaact cccacgaccc gtcaatcatc
120cgccgcgcag atgggacgta cttccgcttc tcgactggtg gcaagatcgc
cattcacacg 180gcgccgagcc tgaccggccc gtggacctat aggggcgccg
ccttgccggc cgggtcctcc 240atcaacttgg acgggaacca ggacctctgg
gcccccgatg ttgcacaggt cggagaccag 300tactacctgt actactccgt
cagcaccttc ggcgtccaga actcggccat cggcatagcc 360cgctcatcga
gcctggacgc cggcacctgg accgacctcg ggtctaccgg tgtcacctcc
420agcacgggct cggcgtacaa cgccatcgac ggcaacctga tcaacgagcc
ggggaccagc 480aactacttcc tcacgtttgg cagcttctgg aacggcatcc
atcgcgtccg catgacgccc 540accaagacga acggcaacgt gtaccaggtc
gcctttgatc ccaacgaccc ggccatggaa 600ggcccgaccg tcttcaagta
cggcgactac tactaccttt tcttctccaa gggcacgtgc 660tgcggctacg
accagaacag gccgcccgcc ggacaggagt acaggatcat ggtgtgcagg
720tcgacctcgc ccaccggccc cttcgaggac agggacggca agtcgtgccg
ctcgggcggt 780ggaactctgg tgctcccgtc tcacgactgg gtgtacggcc
ccggcggcca aggtgtctac 840caggaccccg aacacggacc ggtcttgtac
taccattatg ttgatactcg cattggctat 900gcggacggcc agaagaagtt
tggttggaat cgcatcgact tttcaagcgg ttggccggtt 960gtc
963912451DNAChrysosporium
lucknowenseCDS(57)..(1851)CDS(1954)..(2357) 91aggaccgtca aacgtaaggg
cagctctagc tcaccagata cgagacgtcg cccatc atg 59 Met 1acc ctt caa gcc
ttt gcg ctg ctg gcg gcg gcg gcc ctt gta cgg ggc 107Thr Leu Gln Ala
Phe Ala Leu Leu Ala Ala Ala Ala Leu Val Arg Gly 5 10 15gag acc ccg
acc aag gtc cct cgc gac gct ccg aga gga gct gct gct 155Glu Thr Pro
Thr Lys Val Pro Arg Asp Ala Pro Arg Gly Ala Ala Ala 20 25 30tgg gaa
gct gcc cac tcc tca gca gcc gct gcc ttg gga aag ctg tcc 203Trp Glu
Ala Ala His Ser Ser Ala Ala Ala Ala Leu Gly Lys Leu Ser 35 40 45cag
cag gac aag atc aac atc gtg acg ggc gtc ggc tgg aac aag ggg 251Gln
Gln Asp Lys Ile Asn Ile Val Thr Gly Val Gly Trp Asn Lys Gly50 55 60
65ccc tgc gtg ggc aat act ccc gct atc agc tcc atc aac tac ccg cag
299Pro Cys Val Gly Asn Thr Pro Ala Ile Ser Ser Ile Asn Tyr Pro Gln
70 75 80ctc tgc ctc caa gac gga ccc ctg ggc gtc cgc ttc ggt tcc agc
atc 347Leu Cys Leu Gln Asp Gly Pro Leu Gly Val Arg Phe Gly Ser Ser
Ile 85 90 95acc gct ttc act cca ggc att cag gcc gcg tcg acg tgg gat
gtg gac 395Thr Ala Phe Thr Pro Gly Ile Gln Ala Ala Ser Thr Trp Asp
Val Asp 100 105 110ctg atc cga caa cga ggc gag tac atg ggc gcc gag
ttc aag ggg tgt 443Leu Ile Arg Gln Arg Gly Glu Tyr Met Gly Ala Glu
Phe Lys Gly Cys 115 120 125ggc atc cac gtc cag cta ggc ccc gtg gcc
ggg ccc ttg ggt aag gtt 491Gly Ile His Val Gln Leu Gly Pro Val Ala
Gly Pro Leu Gly Lys Val130 135 140 145ccg caa ggc ggc cgc aac tgg
gag ggc ttt ggc gtg gat ccc tac ctc 539Pro Gln Gly Gly Arg Asn Trp
Glu Gly Phe Gly Val Asp Pro Tyr Leu 150 155 160acc ggc att gcc atg
gcc gag acc atc gag ggg ata cag tca gcg ggc 587Thr Gly Ile Ala Met
Ala Glu Thr Ile Glu Gly Ile Gln Ser Ala Gly 165 170 175gtg caa gcc
acc gcc aag cat tac atc ctc aac gag cag gag ctc aac 635Val Gln Ala
Thr Ala Lys His Tyr Ile Leu Asn Glu Gln Glu Leu Asn 180 185 190cgc
gag acc atg agc agc aat gtc gac gac cgc acc ttg cac gag ctg 683Arg
Glu Thr Met Ser Ser Asn Val Asp Asp Arg Thr Leu His Glu Leu 195 200
205tac ctc tgg cct ttt gca gac gct gtt cac tct aac gtg gcc agc gtc
731Tyr Leu Trp Pro Phe Ala Asp Ala Val His Ser Asn Val Ala Ser
Val210 215 220 225atg tgc agt tac aac aag atc aac ggc acg tgg gcg
tgc gag aac gac 779Met Cys Ser Tyr Asn Lys Ile Asn Gly Thr Trp Ala
Cys Glu Asn Asp 230 235 240cgg gtg ctg aac gtc att ctg aag cag gag
ctt ggg ttt ccg ggc tat 827Arg Val Leu Asn Val Ile Leu Lys Gln Glu
Leu Gly Phe Pro Gly Tyr 245 250 255gtg atg agc gac tgg aac gcg cag
cac tcg acc gac gac gcg gcc aac 875Val Met Ser Asp Trp Asn Ala Gln
His Ser Thr Asp Asp Ala Ala Asn 260 265 270cac ggc atg gac atg acg
atg ccc ggc agc gac ttc aac gga ggg acg 923His Gly Met Asp Met Thr
Met Pro Gly Ser Asp Phe Asn Gly Gly Thr 275 280 285atc ctc tgg ggc
ccg cag ctc gac agc gcc gtc aac agc ggc cgg gtg 971Ile Leu Trp Gly
Pro Gln Leu Asp Ser Ala Val Asn Ser Gly Arg Val290 295 300 305ccc
aag tcc cgc ctc gac gac atg gtc gag cgc atc ctc gcg gcg tgg 1019Pro
Lys Ser Arg Leu Asp Asp Met Val Glu Arg Ile Leu Ala Ala Trp 310 315
320tac ctt ctc ggt cag gac agc aat tac ccg gct atc aac atc ggc gcc
1067Tyr Leu Leu Gly Gln Asp Ser Asn Tyr Pro Ala Ile Asn Ile Gly Ala
325 330 335aac gta cag ggc aac cac aag gag aac gtg cgg gcg gtc gcg
cgc gat 1115Asn Val Gln Gly Asn His Lys Glu Asn Val Arg Ala Val Ala
Arg Asp 340 345 350ggc atc gtg ctc ctc aag aac gac gac ggc atc ctc
ccg ctc aag aag 1163Gly Ile Val Leu Leu Lys Asn Asp Asp Gly Ile Leu
Pro Leu Lys Lys 355 360 365ccc gcc aag ctc gcc ctc atc ggt tcg gcc
gcc gtc gtc aac ccg cag 1211Pro Ala Lys Leu Ala Leu Ile Gly Ser Ala
Ala Val Val Asn Pro Gln370 375 380 385gga ctg aac tcc tgc cag gat
cag ggc tgc aac aaa ggc gcg ctg ggc 1259Gly Leu Asn Ser Cys Gln Asp
Gln Gly Cys Asn Lys Gly Ala Leu Gly 390 395 400atg gga tgg ggg tcc
ggg gcg gtc aac tac ccg tac ttt gtc gcg ccg 1307Met Gly Trp Gly Ser
Gly Ala Val Asn Tyr Pro Tyr Phe Val Ala Pro 405 410 415tac gac gcg
ctc aag gcc cgc gcc caa gag gac ggg acg acg gtc agc 1355Tyr Asp Ala
Leu Lys Ala Arg Ala Gln Glu Asp Gly Thr Thr Val Ser 420 425 430ctg
cac aac tcg gac agc acg agc ggg gtg gcg aac gtg gcc tcg gac 1403Leu
His Asn Ser Asp Ser Thr Ser Gly Val Ala Asn Val Ala Ser Asp 435 440
445gcg gac gcg gcc atc gtg gtc atc acg gcc gac tcg ggc gag ggt tac
1451Ala Asp Ala Ala Ile Val Val Ile Thr Ala Asp Ser Gly Glu Gly
Tyr450 455 460 465atc aca gtc gaa ggc gcc gcc ggg gac cgg ctg aac
ctc gac ccg tgg 1499Ile Thr Val Glu Gly Ala Ala Gly Asp Arg Leu Asn
Leu Asp Pro Trp 470 475 480cac aac ggc aac gag ctg gta aag gcg gtg
gcg gcg gcc aac aag aac 1547His Asn Gly Asn Glu Leu Val Lys Ala Val
Ala Ala Ala Asn Lys Asn 485 490 495acc atc gtc gtc gtg cac agc gtc
ggg ccc atc atc ctc gag acc atc 1595Thr Ile Val Val Val His Ser Val
Gly Pro Ile Ile Leu Glu Thr Ile 500 505 510ctg gca acc gaa ggg gtc
aag gcg att gtg tgg gcc ggc ctg cca agc 1643Leu Ala Thr Glu Gly Val
Lys Ala Ile Val Trp Ala Gly Leu Pro Ser 515 520 525cag gag aat ggt
aac gcg ctg gtg gac atc ctg tac ggt ctg gcc tcg 1691Gln Glu Asn Gly
Asn Ala Leu Val Asp Ile Leu Tyr Gly Leu Ala Ser530 535 540 545ccg
tcg ggc aag ctg gtc tac acc atc gcc aag cga gaa cag gac tac 1739Pro
Ser Gly Lys Leu Val Tyr Thr Ile Ala Lys Arg Glu Gln Asp Tyr 550 555
560ggc act gcg gtc gtt cgc gga gac gac aca ttt ccg gag ggc ctg ttt
1787Gly Thr Ala Val Val Arg Gly Asp Asp Thr Phe Pro Glu Gly Leu Phe
565 570 575gtc gac tac cgg cac ttt gac aag gag aac atc gag ccg cgg
tac gag 1835Val Asp Tyr Arg His Phe Asp Lys Glu Asn Ile Glu Pro Arg
Tyr Glu 580 585 590ttt ggg ttt ggg ctg t gtaagttgtt ctctgtcata
aagagtgtcc ccgtccactc 1891Phe Gly Phe Gly Leu 595cgccatcttt
tttttttttt cagttttacc tcgtcctaac agcaagtgaa acaaactggc 1951ag ca
tac acc aac ttc acg tac gcc gac ctc gaa ctc acc tcg acc 1997 Ser
Tyr Thr Asn Phe Thr Tyr Ala Asp Leu Glu Leu Thr Ser Thr 600 605
610gcc acg gcc ggc cca gcg acg ggc gag acc atc ccc ggc ggc gcg gcc
2045Ala Thr Ala Gly Pro Ala Thr Gly Glu Thr Ile Pro Gly Gly Ala Ala
615 620 625gac ctc tgg gag gag gtg gcc acg gtc acg gcg acc atc acc
aac agc 2093Asp Leu Trp Glu Glu Val Ala Thr Val Thr Ala Thr Ile Thr
Asn Ser630 635 640 645ggc ggc gtg gac ggc gcc gag gtg gcc cag ctg
tac ctg acg ttg ccg 2141Gly Gly Val Asp Gly Ala Glu Val Ala Gln Leu
Tyr Leu Thr Leu Pro 650 655 660tcg tcg gcg ccg gcg acc ccg ccc aag
cag ctc cgc ggc ttc gcc aag 2189Ser Ser Ala Pro Ala Thr Pro Pro Lys
Gln Leu Arg Gly Phe Ala Lys 665 670 675ctc aag ctg gcg gcc ggc gcc
agc ggg acc gcg acg ttt agt ctg cgc 2237Leu Lys Leu Ala Ala Gly Ala
Ser Gly Thr Ala Thr Phe Ser Leu Arg 680 685 690agg cgc gac ctc agc
tac tgg gac acc ggc cgc ggg cag tgg gtg gtg 2285Arg Arg Asp Leu Ser
Tyr Trp Asp Thr Gly Arg Gly Gln Trp Val Val 695 700 705ccg gag ggc
gag ttt ggc gtc tcg gtc ggg gcg agc tcg cgc gat atc 2333Pro Glu Gly
Glu Phe Gly Val Ser Val Gly Ala Ser Ser Arg Asp Ile710 715 720
725cgg ctg acg ggg agc ttc cga gta tgaaaggaaa tatcctgggg cggatagttt
2387Arg Leu Thr Gly Ser Phe Arg Val 730ctgcttcttc gatggagtga
actatatata ggtgcgcccg gacgccagag atatggacaa 2447taac
245192733PRTChrysosporium lucknowense 92Met Thr Leu Gln Ala Phe Ala
Leu Leu Ala Ala Ala Ala Leu Val Arg1 5 10 15Gly Glu Thr Pro Thr Lys
Val Pro Arg Asp Ala Pro Arg Gly Ala Ala 20 25 30Ala Trp Glu Ala Ala
His Ser Ser Ala Ala Ala Ala Leu Gly Lys Leu 35 40 45Ser Gln Gln Asp
Lys Ile Asn Ile Val Thr Gly Val Gly Trp Asn Lys 50 55 60Gly Pro Cys
Val Gly Asn Thr Pro Ala Ile Ser Ser Ile Asn Tyr Pro65 70 75 80Gln
Leu Cys Leu Gln Asp Gly Pro Leu Gly Val Arg Phe Gly Ser Ser 85 90
95Ile Thr Ala Phe Thr Pro Gly Ile Gln Ala Ala Ser Thr Trp Asp Val
100 105 110Asp Leu Ile Arg Gln Arg Gly Glu Tyr Met Gly Ala Glu Phe
Lys Gly 115 120 125Cys Gly Ile His Val Gln Leu Gly Pro Val Ala Gly
Pro Leu Gly Lys 130 135 140Val Pro Gln Gly Gly Arg Asn Trp Glu Gly
Phe Gly Val Asp Pro Tyr145 150 155 160Leu Thr Gly Ile Ala Met Ala
Glu Thr Ile Glu Gly Ile Gln Ser Ala 165 170 175Gly Val Gln Ala Thr
Ala Lys His Tyr Ile Leu Asn Glu Gln Glu Leu 180 185 190Asn Arg Glu
Thr Met Ser Ser Asn Val Asp Asp Arg Thr Leu His Glu 195 200 205Leu
Tyr Leu Trp Pro Phe Ala Asp Ala Val His Ser Asn Val Ala Ser 210 215
220Val Met Cys Ser Tyr Asn Lys Ile Asn Gly Thr Trp Ala Cys Glu
Asn225 230 235 240Asp Arg Val Leu Asn Val Ile Leu Lys Gln Glu Leu
Gly Phe Pro Gly 245 250 255Tyr Val Met Ser Asp Trp Asn Ala Gln His
Ser Thr Asp Asp Ala Ala 260 265 270Asn His Gly Met Asp Met Thr Met
Pro Gly Ser Asp Phe Asn Gly Gly 275 280 285Thr Ile Leu Trp Gly Pro
Gln Leu Asp Ser Ala Val Asn Ser Gly Arg 290 295 300Val Pro Lys Ser
Arg Leu Asp Asp Met Val Glu Arg Ile Leu Ala Ala305 310 315 320Trp
Tyr Leu Leu Gly Gln Asp Ser Asn Tyr Pro Ala Ile Asn Ile Gly 325 330
335Ala Asn Val Gln Gly Asn His Lys Glu Asn Val Arg Ala Val Ala Arg
340 345 350Asp Gly Ile Val Leu Leu Lys Asn Asp Asp Gly Ile Leu Pro
Leu Lys 355 360 365Lys Pro Ala Lys Leu Ala Leu Ile Gly Ser Ala Ala
Val Val Asn Pro 370 375 380Gln Gly Leu Asn Ser Cys Gln Asp Gln Gly
Cys Asn Lys Gly Ala Leu385 390 395 400Gly Met Gly Trp Gly Ser Gly
Ala Val Asn Tyr Pro Tyr Phe Val Ala 405 410 415Pro Tyr Asp Ala Leu
Lys Ala Arg Ala Gln Glu Asp Gly Thr Thr Val 420 425 430Ser Leu His
Asn Ser Asp Ser Thr Ser Gly Val Ala Asn Val Ala Ser 435 440 445Asp
Ala Asp Ala Ala Ile Val Val Ile Thr Ala Asp Ser Gly Glu Gly 450 455
460Tyr Ile Thr Val Glu Gly Ala Ala Gly Asp Arg Leu Asn Leu Asp
Pro465 470 475 480Trp His Asn Gly Asn Glu Leu Val Lys Ala Val Ala
Ala Ala Asn Lys 485 490 495Asn Thr Ile Val Val Val His Ser Val Gly
Pro Ile Ile Leu Glu Thr 500 505 510Ile Leu Ala Thr Glu Gly Val Lys
Ala Ile Val Trp Ala Gly Leu Pro 515 520
525Ser Gln Glu Asn Gly Asn Ala Leu Val Asp Ile Leu Tyr Gly Leu Ala
530 535 540Ser Pro Ser Gly Lys Leu Val Tyr Thr Ile Ala Lys Arg Glu
Gln Asp545 550 555 560Tyr Gly Thr Ala Val Val Arg Gly Asp Asp Thr
Phe Pro Glu Gly Leu 565 570 575Phe Val Asp Tyr Arg His Phe Asp Lys
Glu Asn Ile Glu Pro Arg Tyr 580 585 590Glu Phe Gly Phe Gly Leu Ser
Tyr Thr Asn Phe Thr Tyr Ala Asp Leu 595 600 605Glu Leu Thr Ser Thr
Ala Thr Ala Gly Pro Ala Thr Gly Glu Thr Ile 610 615 620Pro Gly Gly
Ala Ala Asp Leu Trp Glu Glu Val Ala Thr Val Thr Ala625 630 635
640Thr Ile Thr Asn Ser Gly Gly Val Asp Gly Ala Glu Val Ala Gln Leu
645 650 655Tyr Leu Thr Leu Pro Ser Ser Ala Pro Ala Thr Pro Pro Lys
Gln Leu 660 665 670Arg Gly Phe Ala Lys Leu Lys Leu Ala Ala Gly Ala
Ser Gly Thr Ala 675 680 685Thr Phe Ser Leu Arg Arg Arg Asp Leu Ser
Tyr Trp Asp Thr Gly Arg 690 695 700Gly Gln Trp Val Val Pro Glu Gly
Glu Phe Gly Val Ser Val Gly Ala705 710 715 720Ser Ser Arg Asp Ile
Arg Leu Thr Gly Ser Phe Arg Val 725 730932199DNAChrysosporium
lucknowense 93atgacccttc aagcctttgc gctgctggcg gcggcggccc
ttgtacgggg cgagaccccg 60accaaggtcc ctcgcgacgc tccgagagga gctgctgctt
gggaagctgc ccactcctca 120gcagccgctg ccttgggaaa gctgtcccag
caggacaaga tcaacatcgt gacgggcgtc 180ggctggaaca aggggccctg
cgtgggcaat actcccgcta tcagctccat caactacccg 240cagctctgcc
tccaagacgg acccctgggc gtccgcttcg gttccagcat caccgctttc
300actccaggca ttcaggccgc gtcgacgtgg gatgtggacc tgatccgaca
acgaggcgag 360tacatgggcg ccgagttcaa ggggtgtggc atccacgtcc
agctaggccc cgtggccggg 420cccttgggta aggttccgca aggcggccgc
aactgggagg gctttggcgt ggatccctac 480ctcaccggca ttgccatggc
cgagaccatc gaggggatac agtcagcggg cgtgcaagcc 540accgccaagc
attacatcct caacgagcag gagctcaacc gcgagaccat gagcagcaat
600gtcgacgacc gcaccttgca cgagctgtac ctctggcctt ttgcagacgc
tgttcactct 660aacgtggcca gcgtcatgtg cagttacaac aagatcaacg
gcacgtgggc gtgcgagaac 720gaccgggtgc tgaacgtcat tctgaagcag
gagcttgggt ttccgggcta tgtgatgagc 780gactggaacg cgcagcactc
gaccgacgac gcggccaacc acggcatgga catgacgatg 840cccggcagcg
acttcaacgg agggacgatc ctctggggcc cgcagctcga cagcgccgtc
900aacagcggcc gggtgcccaa gtcccgcctc gacgacatgg tcgagcgcat
cctcgcggcg 960tggtaccttc tcggtcagga cagcaattac ccggctatca
acatcggcgc caacgtacag 1020ggcaaccaca aggagaacgt gcgggcggtc
gcgcgcgatg gcatcgtgct cctcaagaac 1080gacgacggca tcctcccgct
caagaagccc gccaagctcg ccctcatcgg ttcggccgcc 1140gtcgtcaacc
cgcagggact gaactcctgc caggatcagg gctgcaacaa aggcgcgctg
1200ggcatgggat gggggtccgg ggcggtcaac tacccgtact ttgtcgcgcc
gtacgacgcg 1260ctcaaggccc gcgcccaaga ggacgggacg acggtcagcc
tgcacaactc ggacagcacg 1320agcggggtgg cgaacgtggc ctcggacgcg
gacgcggcca tcgtggtcat cacggccgac 1380tcgggcgagg gttacatcac
agtcgaaggc gccgccgggg accggctgaa cctcgacccg 1440tggcacaacg
gcaacgagct ggtaaaggcg gtggcggcgg ccaacaagaa caccatcgtc
1500gtcgtgcaca gcgtcgggcc catcatcctc gagaccatcc tggcaaccga
aggggtcaag 1560gcgattgtgt gggccggcct gccaagccag gagaatggta
acgcgctggt ggacatcctg 1620tacggtctgg cctcgccgtc gggcaagctg
gtctacacca tcgccaagcg agaacaggac 1680tacggcactg cggtcgttcg
cggagacgac acatttccgg agggcctgtt tgtcgactac 1740cggcactttg
acaaggagaa catcgagccg cggtacgagt ttgggtttgg gctgtcatac
1800accaacttca cgtacgccga cctcgaactc acctcgaccg ccacggccgg
cccagcgacg 1860ggcgagacca tccccggcgg cgcggccgac ctctgggagg
aggtggccac ggtcacggcg 1920accatcacca acagcggcgg cgtggacggc
gccgaggtgg cccagctgta cctgacgttg 1980ccgtcgtcgg cgccggcgac
cccgcccaag cagctccgcg gcttcgccaa gctcaagctg 2040gcggccggcg
ccagcgggac cgcgacgttt agtctgcgca ggcgcgacct cagctactgg
2100gacaccggcc gcgggcagtg ggtggtgccg gagggcgagt ttggcgtctc
ggtcggggcg 2160agctcgcgcg atatccggct gacggggagc ttccgagta
219994389PRTChrysosporium lucknowense 94Met Lys Ser Ser Ile Leu Ala
Ser Val Phe Ala Thr Gly Ala Val Ala1 5 10 15Gln Ser Gly Pro Trp Gln
Gln Cys Gly Gly Ile Gly Trp Gln Gly Ser 20 25 30Thr Asp Cys Val Ser
Gly Tyr His Cys Val Tyr Gln Asn Asp Trp Tyr 35 40 45Ser Gln Cys Val
Pro Gly Ala Ala Ser Thr Thr Leu Gln Thr Ser Thr 50 55 60Thr Ser Arg
Pro Thr Ala Thr Ser Thr Ala Pro Pro Ser Ser Thr Thr65 70 75 80Ser
Pro Ser Lys Gly Lys Leu Lys Trp Leu Gly Ser Asn Glu Ser Gly 85 90
95Ala Glu Phe Gly Glu Gly Asn Tyr Pro Gly Leu Trp Gly Lys His Phe
100 105 110Ile Phe Pro Ser Thr Ser Ala Ile Gln Thr Leu Ile Asn Asp
Gly Tyr 115 120 125Asn Ile Phe Arg Ile Asp Phe Ser Met Glu Arg Leu
Val Pro Asn Gln 130 135 140Leu Thr Ser Ser Phe Asp Glu Gly Tyr Leu
Arg Asn Leu Thr Glu Val145 150 155 160Val Asn Phe Val Thr Asn Ala
Gly Lys Tyr Ala Val Leu Asp Pro His 165 170 175Asn Tyr Gly Arg Tyr
Tyr Gly Asn Val Ile Thr Asp Thr Asn Ala Phe 180 185 190Arg Thr Phe
Trp Thr Asn Leu Ala Lys Gln Phe Ala Ser Asn Ser Leu 195 200 205Val
Ile Phe Asp Thr Asn Asn Glu Tyr Asn Thr Met Asp Gln Thr Leu 210 215
220Val Leu Asn Leu Asn Gln Ala Ala Ile Asp Gly Ile Arg Ala Ala
Gly225 230 235 240Ala Thr Ser Gln Tyr Ile Phe Val Glu Gly Asn Ala
Trp Ser Gly Ala 245 250 255Trp Ser Trp Asn Thr Thr Asn Thr Asn Met
Ala Ala Leu Thr Asp Pro 260 265 270Gln Asn Lys Ile Val Tyr Glu Met
His Gln Tyr Leu Asp Ser Asp Ser 275 280 285Ser Gly Thr His Ala Glu
Cys Val Ser Ser Asn Ile Gly Ala Gln Arg 290 295 300Val Val Gly Ala
Thr Gln Trp Leu Arg Ala Asn Gly Lys Leu Gly Val305 310 315 320Leu
Gly Glu Phe Ala Gly Gly Ala Asn Ala Val Cys Gln Gln Ala Val 325 330
335Thr Gly Leu Leu Asp His Leu Gln Asp Asn Ser Glu Val Trp Leu Gly
340 345 350Ala Leu Trp Trp Ala Ala Gly Pro Trp Trp Gly Asp Tyr Met
Tyr Ser 355 360 365Phe Glu Pro Pro Ser Gly Thr Gly Tyr Val Asn Tyr
Asn Ser Ile Leu 370 375 380Lys Lys Tyr Leu Pro385
* * * * *
References