U.S. patent application number 15/007622 was filed with the patent office on 2017-04-20 for surface display of whole antibodies in eukaryotes.
This patent application is currently assigned to Merck Sharp & Dohme Corp.. The applicant listed for this patent is Merck Sharp & Dohme Corp.. Invention is credited to Piotr Bobrowicz, Bianka Prinz, Natarajan Sethuraman, Stefan Wildt, Dongxing Zha.
Application Number | 20170107509 15/007622 |
Document ID | / |
Family ID | 41202611 |
Filed Date | 2017-04-20 |
United States Patent
Application |
20170107509 |
Kind Code |
A1 |
Prinz; Bianka ; et
al. |
April 20, 2017 |
SURFACE DISPLAY OF WHOLE ANTIBODIES IN EUKARYOTES
Abstract
Methods for display of recombinant whole immunoglobulins or
immunoglobulin libraries on the surface of eukaryote host cells,
including yeast and filamentous fungi, are described. The methods
are useful for screening libraries of recombinant immunoglobulins
in eukaryote host cells to identify immunoglobulins that are
specific for an antigen of interest.
Inventors: |
Prinz; Bianka; (Lebanon,
NH) ; Sethuraman; Natarajan; (Hanover, NH) ;
Zha; Dongxing; (Houston, TX) ; Wildt; Stefan;
(Somerville, MA) ; Bobrowicz; Piotr; (Hanover,
NH) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
Merck Sharp & Dohme Corp. |
Rahway |
NJ |
US |
|
|
Assignee: |
Merck Sharp & Dohme
Corp.
Rahway
NJ
|
Family ID: |
41202611 |
Appl. No.: |
15/007622 |
Filed: |
January 27, 2016 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
13251410 |
Oct 3, 2011 |
9260712 |
|
|
15007622 |
|
|
|
|
12489900 |
Jun 23, 2009 |
8067339 |
|
|
13251410 |
|
|
|
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
C07K 16/00 20130101;
G01N 33/6854 20130101; C12N 15/1034 20130101; C07K 16/2887
20130101; C07K 16/32 20130101; C12N 15/1037 20130101; C07K 2319/30
20130101 |
International
Class: |
C12N 15/10 20060101
C12N015/10; C07K 16/32 20060101 C07K016/32; C07K 16/28 20060101
C07K016/28; C07K 16/00 20060101 C07K016/00 |
Claims
1. A method for producing eukaryotic host cells that express an
immunoglobulin of interest, comprising: (a) providing host cells
that include a first nucleic acid molecule encoding a capture
moiety comprising a cell surface anchoring protein fused to a
binding moiety that is capable of specifically binding an
immunoglobulin operably linked to a first regulatable promoter; (b)
transfecting the host cells with a plurality of nucleic acid
molecules encoding a genetically diverse population of heavy and
light chains of an immunoglobulin wherein at least one of the heavy
or light chain encoding nucleic acid molecules is operably linked
to a second regulatable promoter to produce a plurality of
genetically diverse host cells capable of displaying an
immunoglobulin on the surface thereof; (c) inducing expression of
the nucleic acid molecule encoding the capture moiety for a time
sufficient to produce the capture moiety on the surface of the host
cell; and (d) inhibiting expression of the nucleic acid molecule
encoding the capture moiety and inducing expression of the nucleic
acid molecules encoding the immunoglobulins in the host cells to
produce the host cells that express the immunoglobulin of interest
displayed.
2. The method of claim 1 wherein, the method further includes (e)
contacting the host cells with a detection means that specifically
binds to the immunoglobulin of interest displayed on the cell
surface of the host cells; and (f) isolating host cells in which
the detection means is bound to produce the host cells that express
the immunoglobulin of interest.
3. The method of claim 1, wherein the binding moiety binds the Fc
region of the immunoglobulin.
4. The method of claim 1, wherein the binding moiety is selected
from the group consisting of protein A, protein A ZZ domain,
protein G, and protein L.
5. The method of claim 1, wherein the cell surface anchoring
protein is a GPI protein.
6. A method for producing eukaryotic host cells that express an
immunoglobulin of interest, comprising: (a) providing a host cell
that includes a first nucleic acid molecule encoding a capture
moiety comprising a cell surface anchoring protein fused to a
binding moiety that is capable of specifically binding an
immunoglobulin operably linked to a first regulatable promoter; (b)
transfecting the host cells with a one or more nucleic acid
molecules encoding the heavy and light chains of an immunoglobulin
wherein at least one of the heavy or light chain encoding nucleic
acid molecules is operably linked to a second regulatable promoter
to generate a plurality of host cells encoding a variegated
population of mutants of the immunoglobulins; (c) inducing
expression of the capture moiety for a time sufficient to produce
the capture moiety on the surface of the host cells; and (d)
inhibiting expression of the capture moiety and inducing expression
of the variegated population of mutants of the immunoglobulin in
the host cells to produce the host cells that express the
immunoglobulin of interest.
7. The method of claim 6 wherein the method further includes (e)
contacting the host cells with a detection means that binds to the
immunoglobulin of interest to identify host cells that display the
immunoglobulin of interest on the surface thereof; and (f)
isolating the host cells that display the immunoglobulin of
interest on the surface of thereof to produce the host cells that
express the immunoglobulin of interest.
8. The method of claim 6, wherein the binding moiety binds the Fc
region of the immunoglobulin.
9. The method of claim 6, wherein the binding moiety is selected
from the group consisting of protein A, protein A ZZ domain,
protein G, and protein L.
10. The method of claim 1, wherein the cell surface anchoring
protein is a GPI protein.
11. A method of producing eukaryote host cells that produce an
immunoglobulin having a VH domain and a VL domain and having an
antigen binding site with binding specificity for an antigen of
interest, the method comprising: (a) providing a library of
eukaryote host cells displaying on their surface an immunoglobulin
comprising a VH domain and a VL domain, wherein the library is
created by: (i) providing eukaryote host cells that express a
capture moiety comprising a cell surface anchoring protein fused to
a moiety capable of binding to an immunoglobulin wherein expression
of the capture moiety is effected by a first regulatable promoter;
and (ii) transfecting the host cells with a library of nucleic acid
molecules encoding a genetically diverse population of
immunoglobulins, wherein the VH domains of the genetically diverse
population of immunoglobulins are biased for one or more VH gene
families and wherein expression of at least one of the heavy or
light chains of the immunoglobulins is effected by a second
regulatable promoter to produce a plurality of host cells, each
expressing an immunoglobulin; (b) inducing expression of the
capture moiety in the host cells for a time sufficient to produce
the capture moiety on the surface of the host cells; and (c)
inhibiting expression of the capture moiety and inducing expression
of the library of nucleic acid molecule sequences in the host
cells, whereby each host cell displays an immunoglobulin at the
surface thereof to produce the host cells that produce the
immunoglobulin having a VH domain and a VL domain and having the
antigen binding site with binding specificity for the antigen of
interest.
12. The method of claim 11, wherein the immunoglobulin comprises a
synthetic human immunoglobulin VH domain and a synthetic human
immunoglobulin VL domain and wherein the synthetic human
immunoglobulin VH domain and the synthetic human immunoglobulin VL
domain comprise framework regions and hypervariable loops, wherein
the framework regions and first two hypervariable loops of both the
VH domain and VL domain are essentially human germ line, and
wherein the VH domain and VL domain have altered CDR3 loops.
13. The method of claim 12, wherein in addition to having altered
CDR3 loops the human synthetic immunoglobulin VH and VL domains
contain mutations in other CDR loops.
14. The method of claim 12, wherein each human synthetic
immunoglobulin VH domain CDR loop is of random sequence.
15. The method of claim 12, wherein human synthetic immunoglobulin
VH domain CDR loops are of known canonical structures and
incorporate random sequence elements.
16. The method of claim 11 wherein the method further includes (d)
identifying host cells in the plurality of host cells that display
immunoglobulins thereon that has a binding specificity for the
antigen of interest by contacting the plurality of host cells with
the antigen of interest and detecting the host cells that have the
antigen of interest bound to the immunoglobulin displayed thereon
to produce the host cells that produce the immunoglobulin having a
VH domain and a VL domain and having the antigen binding site with
binding specificity for the antigen of interest.
17. The method of claim 11, wherein the antibody is selected from
the group consisting of IgG, IgA, IgM, and IgE.
18. The method of claim 11, wherein the binding moiety binds to the
Fc region of the immunoglobulin.
19. The method of claim 11, wherein the binding moiety is selected
from the group consisting of protein A, protein A ZZ domain,
protein G, and protein L.
20. The method of claim 11, wherein the cell surface anchoring
protein is a GPI protein.
21. An immunoglobulin produced comprising the method of claim 1, 6,
or 11 and derivatives thereof.
22. A eukaryote host cell that expresses the immunoglobulin of
claim 21.
23. A host cell that expresses an immunoglobulin that had been
identified using the method of claim 1, 6, or 11.
24. A eukaryote host cell comprising a nucleic acid molecule
encoding a capture moiety comprising a cell surface anchoring
protein fused to a binding moiety capable of binding an
immunoglobulin operably linked to a regulatable promoter and one or
more nucleic acid molecules encoding the heavy and light chains of
immunoglobulins, wherein at least one of the nucleic acid molecules
encoding the heavy or light chains is operably linked to a second
regulatable promoter.
Description
[0001] This is a divisional of U.S. patent application Ser. No.
13/251,410, filed Oct. 3, 5 2011; which is a divisional of U.S.
patent application Ser. No. 12/489,900, filed Jun. 23, 2009, now
U.S. Pat. No. 8,067,339; which claims the benefit of U.S.
provisional patent application No. 61/208,583, filed Feb. 25, 2009
and U.S. provisional patent application No. 61/134,331, filed Jul.
9, 2008; each of which is herein incorporated by referenced in its
entirety.
BACKGROUND OF THE INVENTION
[0002] (1) Field of the Invention
[0003] The present invention relates to methods for display of
whole immunoglobulins or libraries of immunoglobulins on the
surface of eukaryote host cells, including mammalian, plant, yeast,
and filamentous fungal cells. The methods are useful for screening
libraries of eukaryotic host cells that produce recombinant
immunoglobulins to identify particular immunoglobulins with desired
properties. The methods are particularly useful for screening
immunoglobulin libraries in eukaryote host cells to identify host
cells that express an immunoglobulin of interest at high levels, as
well as host cells that express immunoglobulins that have high
affinity for specific antigens.
[0004] (2) Description of Related Art
[0005] The discovery of monoclonal antibodies has evolved from
hybridoma technology for producing the antibodies to direct
selection of antibodies from human cDNA or synthetic DNA libraries.
This has been driven in part by the desire to engineer improvements
in binding affinity and specificity of the antibodies to improve
efficacy of the antibodies. Thus, combinatorial library screening
and selection methods have become a common tool for altering the
recognition properties of proteins (Ellman et al., Proc. Natl.
Acad. Sci. USA 94: 2779-2782 (1997): Phizicky & Fields,
Microbiol. Rev. 59: 94-123 (1995)). The ability to construct and
screen antibody libraries in vitro promises improved control over
the strength and specificity of antibody-antigen interactions.
[0006] The most widespread technique for constructing and screening
antibody libraries is phage display, whereby the protein of
interest is expressed as a polypeptide fusion to a bacteriophage
coat protein and subsequently screened by binding to immobilized or
soluble biotinylated ligand. Fusions are made most commonly to a
minor coat protein, called the gene III protein (pIII), which is
present in three to five copies at the tip of the phage. A phage
constructed in this way can be considered a compact genetic "unit",
possessing both the phenotype (binding activity of the displayed
antibody) and genotype (the gene coding for that antibody) in one
package. Phage display has been successfully applied to antibodies,
DNA binding proteins, protease inhibitors, short peptides, and
enzymes (Choo & Klug, Curr. Opin. Biotechnol. 6: 431-436
(1995); Hoogenboom, Trends Biotechnol. 15: 62-70 (1997); Ladner,
Trends Biotechnol. 13: 426-430 (1995); Lowman et al., Biochemistry
30: 10832-10838 (1991); Markland et al., Methods Enzymol. 267:
28-51 (1996); Matthews & Wells, Science 260: 1113-1117 (1993);
Wang et al., Methods Enzymol. 267: 52-68 (1996)).
[0007] Antibodies possessing desirable binding properties are
selected by binding to immobilized antigen in a process called
"panning". Phage bearing nonspecific antibodies are removed by
washing, and then the bound phage are eluted and amplified by
infection of E. coli. This approach has been applied to generate
antibodies against many antigens.
[0008] Nevertheless, phage display possesses several shortcomings.
Although panning of antibody phage display libraries is a powerful
technology, it possesses several intrinsic difficulties that limit
its wide-spread successful application. For example, some
eukaryotic secreted proteins and cell surface proteins require
post-translational modifications such as glycosylation or extensive
disulfide isomerization, which are unavailable in bacterial cells.
Furthermore, the nature of phage display precludes quantitative and
direct discrimination of ligand binding parameters. For example,
very high affinity antibodies (Kd.ltoreq.1 nM) are difficult to
isolate by panning, since the elution conditions required to break
a very strong antibody-antigen interaction are generally harsh
enough (e.g., low pH, high salt) to denature the phage particle
sufficiently to render it non-infective.
[0009] Additionally, the requirement for physical immobilization of
an antigen to a solid surface produces many artifactual
difficulties. For example, high antigen surface density introduces
avidity effects which mask true affinity. Also, physical tethering
reduces the translational and rotational entropy of the antigen,
resulting in a smaller DS upon antibody binding and a resultant
overestimate of binding affinity relative to that for soluble
antigen and large effects from variability in mixing and washing
procedures lead to difficulties with reproducibility. Furthermore,
the presence of only one to a few antibodies per phage particle
introduces substantial stochastic variation, and discrimination
between antibodies of similar affinity becomes impossible. For
example, affinity differences of six-fold or greater are often
required for efficient discrimination (Riechmann & Weill,
Biochem. 32: 8848-55 (1993)). Finally, populations can be overtaken
by more rapidly growing wild-type phage. In particular, since pIII
is involved directly in the phage life cycle, the presence of some
antibodies or bound antigens will prevent or retard amplification
of the associated phage.
[0010] Additional bacterial cell surface display methods have been
developed (Francisco, et al., Proc. Natl. Acad. Sci. USA 90:
10444-10448 (1993); Georgiou et al., Nat. Biotechnol. 15: 29-34
(1997)). However, use of a prokaryotic expression system
occasionally introduces unpredictable expression biases (Knappik
& Pluckthun, Prot. Eng. 8: 81-89 (1995); Ulrich et al., Proc.
Natl. Acad. Sci. USA 92: 11907-11911 (1995); Walker & Gilbert,
J. Biol. Chem 269: 28487-28493 (1994)) and bacterial capsular
polysaccharide layers present a diffusion barrier that restricts
such systems to small molecule ligands (Roberts, Annu. Rev.
Microbiol. 50: 285-315 (1996)). E. coli possesses a
lipopolysaccharide layer or capsule that may interfere sterically
with macromolecular binding reactions. In fact, a presumed
physiological function of the bacterial capsule is restriction of
macromolecular diffusion to the cell membrane, in order to shield
the cell from the immune system (DiRienzo et al., Ann. Rev.
Biochem. 47: 481-532, (1978)). Since the periplasm of E. coli has
not evolved as a compartment for the folding and assembly of
antibody fragments, expression of antibodies in E. coli has
typically been very clone dependent, with some clones expressing
well and others not at all. Such variability introduces concerns
about equivalent representation of all possible sequences in an
antibody library expressed on the surface of E. coli. Moreover,
phage display does not allow some important posttranslational
modifications such as glycosylation that can affect specificity or
affinity of the antibody. About a third of circulating monoclonal
antibodies contain one or more N-linked glycans in the variable
regions. In some cases it is believed that these N-glycans in the
variable region may play a significant role in antibody
function.
[0011] The efficient production of monoclonal antibody therapeutics
would be facilitated by the development of alternative test systems
that utilize lower eukaryotic cells, such as yeast cells. The
structural similarities between B-cells displaying antibodies and
yeast cells displaying antibodies provide a closer analogy to in
vivo affinity maturation than is available with filamentous phage.
In particular, because lower eukaryotic cells are able to produce
glycosylated proteins, whereas filamentous phage cannot, monoclonal
antibodies produced in lower eukaryotic host cells are more likely
to exhibit similar activity in humans and other mammals as they do
in test systems which utilize lower eukaryotic host cells.
[0012] Moreover, the ease of growth culture and facility of genetic
manipulation available with yeast will enable large populations to
be mutagenized and screened rapidly. By contrast with conditions in
the mammalian body, the physicochemical conditions of binding and
selection can be altered for a yeast culture within a broad range
of pH, temperature, and ionic strength to provide additional
degrees of freedom in antibody engineering experiments. The
development of yeast surface display system for screening
combinatorial protein libraries has been described.
[0013] U.S. Pat. Nos. 6,300,065 and 6,699,658 describe the
development of a yeast surface display system for screening
combinatorial antibody libraries and a screen based on
antibody-antigen dissociation kinetics. The system relies on
transfecting yeast with vectors that express an antibody or
antibody fragment fused to a yeast cell wall protein, using
mutagenesis to produce a variegated population of mutants of the
antibody or antibody fragment and then screening and selecting
those cells that produce the antibody or antibody fragment with the
desired enhanced phenotypic properties. U.S. Pat. No. 7,132,273
discloses various yeast cell wall anchor proteins and a surface
expression system that uses them to immobilize foreign enzymes or
polypeptides on the cell wall.
[0014] Of interest are Tanino et al, Biotechnol. Prog. 22: 989-993
(2006), which discloses construction of a Pichia pastoris cell
surface display system using Flo1p anchor system; Ren et al.,
Molec. Biotechnol. 35:103-108 (2007), which discloses the display
of adenoregulin in a Pichia pastoris cell surface display system
using the Flo1p anchor system; Mergler et al., Appl. Microbiol.
Biotechnol. 63:418-421 (2004), which discloses display of K. lactis
yellow enzyme fused to the C-terminus half of S. cerevisiae
.alpha.-agglutinin; Jacobs et al., Abstract T23, Pichia Protein
expression Conference, San Diego, Calif. (Oct. 8-11, 2006), which
discloses display of proteins on the surface of Pichia pastoris
using .alpha.-agglutinin; Ryckaert et al., Abstracts BVBMB Meeting,
Vrije Universiteit Brussel, Belgium (Dec. 2, 2005), which discloses
using a yeast display system to identify proteins that bind
particular lectins; U.S. Pat. No. 7,166,423, which discloses a
method for identifying cells based on the product secreted by the
cells by coupling to the cell surface a capture moiety that binds
the secreted product, which can then be identified using a
detection means; U.S. Published Application No. 2004/0219611, which
discloses a biotin-avidin system for attaching protein A or G to
the surface of a cell for identifying cells that express particular
antibodies; U.S. Pat. No. 6,919,183, which discloses a method for
identifying cells that express a particular protein by expressing
in the cell a surface capture moiety and the protein wherein the
capture moiety and the protein form a complex which is displayed on
the surface of the cell; U.S. Pat. No. 6,114,147, which discloses a
method for immobilizing proteins on the surface of a yeast or
fungal using a fusion protein consisting of a binding protein fused
to a cell wall protein which is expressed in the cell.
[0015] The potential applications of engineering antibodies for the
diagnosis and treatment of human disease such as cancer therapy,
tumor imaging, sepsis are far-reaching. For these applications,
antibodies with high affinity (i.e., Kd.ltoreq.10 nM) and high
specificity are highly desirable. Anecdotal evidence, as well as
the a priori considerations discussed previously, suggests that
phage display or bacterial display systems are unlikely to
consistently produce antibodies of sub-nanomolar affinity. Also,
antibodies identified using phage display or bacterial display
systems may not be susceptible to commercial scale production in
eukaryotic cells. To date, no system has been developed which can
accomplish such purpose, and be used.
[0016] Therefore, development of further protein expression systems
based on improved vectors and host cell lines in which effective
protein display facilitates development of genetically enhanced
cells for recombinant production of immunoglobulins is a desirable
objective.
BRIEF SUMMARY OF THE INVENTION
[0017] One of the most powerful applications of the display system
herein is its use in the arena of immunoglobulin engineering. It
has been shown that scFv antigen-binding units can be expressed on
the surface of lower eukaryote host cells with no apparent loss of
binding specificity and affinity (See for example, U.S. Pat. No.
6,300,065). It has also been shown that full-length antibodies can
be captured and bound to the surface of hybridomas and CHO cells,
for example (See U.S. Pat. Nos. 6,919,183 and 7,166,423). While
antibodies and fragments thereof to many diverse antigens have been
successfully isolated using phage display technology, there is
still a need for a robust display system for producing
immunoglobulins in eukaryotic host cells and in particular, lower
eukaryote host cells. It is particularly desirable to have a robust
display system for producing immunoglobulins that have human-like
glycosylation patterns. Genetically engineered eukaryote cells that
produce glycoproteins that have various human-like glycosylation
patterns have been described in U.S. Pat. No. 7,029,872 and for
example in Choi et al., Hamilton, et al., Science 313; 1441 1443
(2006); Wildt and Gerngross, Nature Rev. 3: 119-128 (2005);
Bobrowicz et al., GlycoBiol. 757-766 (2004); Li et al., Nature
Biotechnol. 24: 210-215 (2006); Chiba et al., J. Biol. Chem. 273:
26298-26304 (1998); and, Mara et al., Glycoconjugate J. 16: 99-107
(1999).
[0018] The methods disclosed herein are particularly suited for
this application because it allows presentation of a vast diverse
repertoire of full-sized immunoglobulins having particular
glycosylation patterns on the surface of the cell when the host
cells have been genetically engineered to have altered or modified
glycosylation pathways. In many respects the subject display system
mimics the natural immune system. Antigen-driven stimulation can be
achieved by selecting for high-affinity binders from a display
library of cloned antibody H and L chains. The large number of
chain permutations that occur during recombination of H and L chain
genes in developing B cells can be mimicked by shuffling the cloned
H and L chains as DNA, and protein and through the use of
site-specific recombination (Geoffory et al. Gene 151: 109-113
(1994)). The somatic mutation can also be matched by the
introduction of mutations in the CDR regions of the H and L
chains.
[0019] Immunoglobulins with desired binding specificity or affinity
can be identified using a form of affinity selection known as
"panning" (Parmley & Smith, Gene 73:305-318 (1988)). The
library of immunoglobulins is first incubated with an antigen of
interest followed by the capture of the antigen with the bound
immunoglobulins. The immunoglobulins recovered in this manner can
then be amplified and again gain selected for binding to the
antigen, thus enriching for those immunoglobulins that bind the
antigen of interest. One or more rounds of selection will enable
isolation of antibodies or fragments thereof with the desired
specificity or avidity. Thus, rare host cells expressing a desired
antibody or fragment thereof can easily be selected from greater
than 10.sup.4 different individuals in one experiment. The primary
structure of the binding immunoglobulins is then deduced by
nucleotide sequence of the individual host cell clone. When human
VH and VL regions are employed in the displayed immunoglobulins,
the subject display systems allow selection of human
immunoglobulins without further manipulation of a non-human
immunoglobulins.
[0020] Therefore, in one embodiment, provided is a method for
producing eukaryotic host cells that express an immunoglobulin of
interest, comprising providing host cells that include a first
nucleic acid molecule encoding a capture moiety comprising a cell
surface anchoring protein fused to a binding moiety that is capable
of specifically binding an immunoglobulin operably linked to a
first regulatable promoter; transfecting the host cells with a
plurality of nucleic acid molecules encoding a genetically diverse
population of heavy and light chains of an immunoglobulin wherein
at least one of the heavy or light chain encoding nucleic acid
molecules is operably linked to a second regulatable promoter to
produce a plurality of genetically diverse host cells capable of
displaying an immunoglobulin on the surface thereof inducing
expression of the first nucleic acid molecule encoding the capture
moiety for a time sufficient to produce the capture moiety on the
surface of the host cells; and inhibiting expression of the first
nucleic acid molecule encoding the capture moiety and inducing
expression of the nucleic acid molecules encoding the
immunoglobulins in the host cells to produce the host cells, which
display the immunoglobulin of interest on the surface of the cells.
In further aspects, the method further includes contacting the host
cells with a detection means that specifically binds to the
immunoglobulin of interest displayed on the surface thereof and
isolating host cells in which the detection means is bound to
select the host cells that express the immunoglobulin of
interest.
[0021] In another embodiment, provided is a method for producing
eukaryotic host cells that express an immunoglobulin of interest
comprising providing a host cell that includes a first nucleic acid
molecule encoding a capture moiety comprising a cell surface
anchoring protein fused to a binding moiety that is capable of
specifically binding an immunoglobulin operably linked to a first
regulatable promoter; transfecting the host cell with one or more
second nucleic acid molecules encoding an immunoglobulin wherein
either the molecules encoding the light chain or the heavy chain
are operably linked to a second regulatable promoter, wherein
mutagenesis is used to generate a plurality of host cells encoding
a variegated population of mutants of the immunoglobulin; inducing
expression of the capture moiety for a time sufficient to produce
the capture moiety on the surface of the host cells; inhibiting
expression of the capture moiety and inducing expression of the
variegated population of mutants of the immunoglobulin in the host
cells; contacting the plurality of host cells with a detection
means that binds to the immunoglobulin of interest to identify host
cells in the plurality of host cells that display the
immunoglobulin of interest on the surface thereof. In further
embodiments, the method further includes isolating the host cells
that display the immunoglobulin of interest on the surface of
thereof to produce the host cells expressing the immunoglobulin of
interest.
[0022] In a further embodiment, provided is a method for producing
eukaryotic host cells that express an immunoglobulin of interest,
comprising: providing a host cell that includes a first nucleic
acid molecule encoding a capture moiety comprising a cell surface
anchoring protein fused to a binding moiety that is capable of
specifically binding an immunoglobulin operably linked to a first
regulatable promoter; transfecting the host cells with a one or
more nucleic acid molecules encoding the heavy and light chains of
an immunoglobulin wherein at least one of the heavy or light chain
encoding nucleic acid molecules is operably linked to a second
regulatable promoter to generate a plurality of host cells encoding
a variegated population of mutants of the immunoglobulins; inducing
expression of the capture moiety for a time sufficient to produce
the capture moiety on the surface of the host cells; and inhibiting
expression of the capture moiety and inducing expression of the
variegated population of mutants of the immunoglobulin in the host
cells to produce the host cells. In further embodiments, the method
further includes contacting the host cells with a detection means
that binds to the immunoglobulin of interest to identify host cells
that display the immunoglobulin of interest on the surface thereof;
and isolating the host cells that display the immunoglobulin of
interest on the surface of thereof to produce the host cells that
express the immunoglobulin of interest.
[0023] In a further embodiment, provided is a method for producing
eukaryotic host cells that express an immunoglobulin of interest,
comprising providing host cells that include a first nucleic acid
molecule encoding a capture moiety comprising a cell surface
anchoring protein fused to a binding moiety that is capable of
specifically binding an immunoglobulin operably linked to a first
regulatable promoter; transfecting the host cells with a plurality
of nucleic acid molecules comprising open reading frames (ORFs)
encoding a genetically diverse population of heavy and light chains
of an immunoglobulin wherein at least the ORFs encoding the heavy
chain are operably linked to a second regulatable promoter when the
capture moiety binds the heavy chain or at least the ORFs encoding
the light chain are operably linked to a second regulatable
promoter when the capture moiety binds the light chain to produce a
plurality of genetically diverse host cells capable of displaying
an immunoglobulin on the surface thereof; inducing expression of
the nucleic acid molecule encoding the capture moiety for a time
sufficient to produce the capture moiety on the surface of the host
cell; and inhibiting expression of the nucleic acid molecule
encoding the capture moiety and inducing expression of the nucleic
acid molecules encoding the immunoglobulins in the host cells to
produce the host cells. In further embodiments, the method further
includes contacting the host cells with a detection means that
specifically binds to the immunoglobulin of interest displayed on
the cell surface of the host cells; and isolating host cells in
which the detection means is bound to produce the host cells that
express the immunoglobulin of interest.
[0024] In a further embodiment, provided is a method of producing
eukaryote host cells that produce an immunoglobulin having a VH
domain and a VL domain and having an antigen binding site with
binding specificity for an antigen of interest, the method
comprising (a) providing a library of eukaryote host cells
displaying on their surface an immunoglobulin comprising a VH
domain and a VL domain, wherein the library is created by (i)
providing eukaryote host cells that express a capture moiety
comprising a cell surface anchoring protein fused to a moiety
capable of binding to an immunoglobulin wherein expression of the
capture moiety is effected by a first regulatable promoter; and
(ii) transfecting the host cells with a library of nucleic acid
molecules encoding a genetically diverse population of
immunoglobulins, wherein the VH domains of the genetically diverse
population of immunoglobulins are biased for one or more VH gene
families and wherein expression of at least one of the heavy or
light chains of the immunoglobulins is effected by a second
regulatable promoter to produce a plurality of host cells, each
expressing an immunoglobulin; (b) inducing expression of the
capture moiety in the host cells for a time sufficient to produce
the capture moiety on the surface of the host cells; (c) inhibiting
expression of the capture moiety and inducing expression of the
library of nucleic acid sequences in the host cells, whereby each
host cell displays an immunoglobulin at the surface thereof to
produce the host cells. In further embodiments, the method further
includes (d) identifying host cells in the plurality of host cells
that display immunoglobulins thereon that has a binding specificity
for the antigen of interest by contacting the plurality of host
cells with the antigen of interest and detecting the host cells
that have the antigen of interest bound to the immunoglobulin
displayed thereon to produce the host cells that produce the
immunoglobulin having a VH domain and a VL domain and having the
antigen binding site with binding specificity for the antigen of
interest.
[0025] In a further aspect of the above embodiment, the
immunoglobulin comprises a synthetic human immunoglobulin VH domain
and a synthetic human immunoglobulin VL domain and wherein the
synthetic human immunoglobulin VH domain and the synthetic human
immunoglobulin VL domain comprise framework regions and
hypervariable loops, wherein the framework regions and first two
hypervariable loops of both the VH domain and VL domain are
essentially human germ line, and wherein the VH domain and VL
domain have altered CDR3 loops. In a further aspect of the above
embodiment, in addition to having altered CDR3 loops the human
synthetic immunoglobulin VH and VL domains contain mutations in
other CDR loops. In a further still aspect of the above embodiment,
each human synthetic immunoglobulin VH domain CDR loop is of random
sequence, and in a further still aspect of the above embodiment,
the human synthetic immunoglobulin VH domain CDR loops are of known
canonical structures and incorporate random sequence elements.
[0026] In a further embodiment, provided is a eukaryote host cell
comprising a nucleic acid molecule encoding a capture moiety
comprising a cell surface anchoring protein fused to a binding
moiety capable of binding an immunoglobulin operably linked to a
regulatable promoter and one or more nucleic acid molecules
encoding the heavy and light chains of immunoglobulins, wherein at
least one of the nucleic acid molecules encoding the heavy or light
chains is operably linked to a second regulatable promoter. In
particular embodiments, the nucleic acid molecules encoding both
the heavy and light chains are operably linked to a second
regulatable promoter. In other embodiments, the nucleic acid
molecules encoding the heavy chains are operably linked to a second
regulatable promoter and the nucleic acid molecules encoding the
light chain are operably linked to a third regulatable promoter or
to a constitutive promoter. In other embodiments, the nucleic acid
molecules encoding the light chains are operably linked to a second
regulatable promoter and the nucleic acid molecules encoding the
heavy chain are operably linked to a third regulatable promoter or
to a constitutive promoter. In particular aspects, the heavy and
light chains are encoded by separate open reading frames (ORFs)
wherein each ORF is operably linked to a promoter. In other
aspects, the heavy and light chains are encoded by a single ORF,
which produces a single fusion polypeptide comprising the heavy and
light chains in a tandem orientation, and the ORF is operably
linked to a regulatable promoter. The single polypeptide is
cleavable between the heavy and light chains to produce separate
heavy and light chain proteins, which can then associate to form a
functional antibody molecule.
[0027] In various aspects of any one of the above embodiments or
aspects, the binding moiety that binds the immunoglobulin binds the
Fc region of the immunoglobulin. Examples of such binding moieties
include, but are not limited to those selected from the group
consisting of protein A, protein A ZZ domain, protein G, and
protein L and fragments thereof that retain the ability to bind to
the immunoglobulin. Examples of other binding moieties, include but
are not limited to, Fc receptor (FcR) proteins and
immunoglobulin-binding fragments thereof. The FCR proteins include
members of the Fc gamma receptor (Fc.gamma.R) family, which bind
gamma immunoglobulin (IgG), Fc epsilon receptor (Fc.epsilon.R)
family, which bind epsilon immunoglobulin (IgE), and Fc alpha
receptor (Fc.alpha.R) family, which bind alpha immunoglobulin
(IgA). Particular FcR proteins that bind IgG that can comprise the
binding moiety herein include at least the IgG binding region of
Fc.gamma.RI, Fc.gamma.RIIA, Fc.gamma.RIIB1, Fc.gamma.RIIB2,
Fc.gamma.RIIIA, Fc.gamma.RIIIB or Fc.gamma.Rn (neonatal).
[0028] In further aspects of any one of the above embodiments or
aspects, detection means is an antigen that is capable of being
bound by the immunoglobulin of interest. In particular aspects, the
antigen is conjugated to or labeled with a fluorescent moiety. In
other aspects, the detection means further includes a detection
immunoglobulin that is specific for the immunoglobulin-antigen
complex or is specific for another epitope on the antigen and it is
this detection immunoglobulin that is conjugated to or labeled with
a detection moiety such as a fluorescent moiety.
[0029] In further aspects of any one of the above embodiments or
aspects, the cell surface anchoring protein is a
Glycosylphosphatidylinositol-anchored (GPI) protein. In particular
aspects, the cell surface anchoring protein is selected from the
group consisting of .alpha.-agglutinin, Cwp1p, Cwp2p, Gas1p, Yap3p,
Flo1p, Crh2p, Pir1p, Pir4p, Sed1p, Tip1p, Wpip, Hpwp1p, Als3p, and
Rbt5p. In further aspects, the cell surface anchoring protein is
Sed1p.
[0030] The host cell that can be used includes both lower and high
eukaryote cells. Higher eukaryote cells include mammalian, insect,
and plant cells. In further aspects of any one of the above
embodiments or aspects, the eukaryote is a lower eukaryote. In
further aspects, the host cell is a yeast or filamentous fungi
cell, which in particular aspects is selected from the group
consisting of Pichia pastoris, Pichia finlandica, Pichia
trehalophila, Pichia koclamae, Pichia membranaefaciens, Pichia
minuta (Ogataea minuta, Pichia lindneri), Pichia opuntiae, Pichia
thermotolerans, Pichia salictaria, Pichia guercuum, Pichia pijperi,
Pichia stiptis, Pichia methanolica, Pichia sp., Saccharomyces
cerevisiae, Saccharomyces sp., Hansenula polymorpha, Kluyveromyces
sp., Kluyveromyces lactis, Candida albicans, Aspergillus nidulans,
Aspergillus niger, Aspergillus oryzae, Trichoderma reesei,
Chrysosporium lucknowense, Fusarium sp., Fusarium gramineum,
Fusarium venenatum and Neurospora crassa. In particular aspects,
the eukaryote is a yeast and in further aspects, the yeast is
Pichia pastoris. While the methods herein have been exemplified
using Pichia pastoris as the host cell, the methods herein can be
used in other lower eukaryote or higher eukaryote cells for the
same purposes disclosed herein.
[0031] In further aspects of any one of the aforementioned methods,
O-glycosylation of glycoproteins in the host cell is controlled.
That is, O-glycan occupancy and mannose chain length are reduced.
In lower eukaryote host cells such as yeast, O-glycosylation can be
controlled by deleting the genes encoding one or more protein
O-mannosyltransferases (Dol-P-Man:Protein (Ser/Thr) Mannosyl
Transferase genes) (PMTs) or by growing the host in a medium
containing one or more Pmtp inhibitors. In further aspects, the
host cell includes a deletion of one or more of the genes encoding
PMTs and the host cell is cultivated in a medium that includes one
or more Pmtp inhibitors. Pmtp inhibitors include but are not
limited to a benzylidene thiazolidinedione. Examples of benzylidene
thiazolidinediones that can be used are 5-[[3,4-bis(phenylmethoxy)
phenyl]methylene]-4-oxo-2-thioxo-3-thiazolidineacetic Acid;
5-[[3-(1-Phenylethoxy)-4-(2-phenylethoxy)]phenyl]methylene]-4-oxo-2-thiox-
o-3-thiazolidineacetic Acid; and
5-[[3-(1-Phenyl-2-hydroxy)ethoxy)-4-(2-phenylethoxy)]phenyl]methylene]-4--
oxo-2-thioxo-3-thiazolidineacetic Acid. In further still aspects,
the host cell further includes a nucleic acid that encodes an
alpha-1,2-mannosidase that has a signal peptide that directs it for
secretion.
[0032] In further aspects of any one of the aforementioned methods,
host cells further include lower eukaryote cells (e.g., yeast such
as Pichia pastoris) that are genetically engineered to eliminate
glycoproteins having .alpha.-mannosidase-resistant N-glycans by
deleting or disrupting one or more of the
.beta.-mannosyltransferase genes (e.g., BMT1, BMT2, BMT3, and BMT4)
(See, U.S. Published Patent Application No. 2006/0211085) or
abrogating translation of RNAs encoding one or more of the
.beta.-mannosyltransferases using interfering RNA, antisense RNA,
or the like.
[0033] In further aspects of any one of the methods herein, the
host cells can further include lower eukaryote cells (e.g., yeast
such as Pichia pastoris) that are genetically engineered to
eliminate glycoproteins having phosphomannose residues by deleting
or disrupting one or both of the phosphomannosyl transferase genes
PNO1 and MNN4B (See for example, U.S. Pat. Nos. 7,198,921 and
7,259,007), which in further aspects can also include deleting or
disrupting the MNN4A gene or abrogating translation of RNAs
encoding one or more of the phosphomannosyltransferases using
interfering RNA, antisense RNA, or the like.
[0034] In further still aspects, the host cell has been genetically
modified to produce glycoproteins that have predominantly an
N-glycan selected from the group consisting of complex N-glycans,
hybrid N-glycans, and high mannose N-glycans wherein complex
N-glycans are selected from the group consisting of
Man.sub.3GlcNAc.sub.2, GlcNAC.sub.(1-4)Man.sub.3GlcNAc.sub.2,
Gal.sub.(1-4)GlcNAc.sub.(1-4)Man.sub.3GlcNAc.sub.2, and
NANA.sub.(1-4)Gal.sub.(1-4)Man.sub.3GlcNAc.sub.2; hybrid N-glycans
are selected from the group consisting of Man.sub.5GlcNAc.sub.2,
GlcNAcMan.sub.5GlcNAc.sub.2, GalGlcNAcMan.sub.5GlcNAc.sub.2, and
NANAGalGlcNAcMan.sub.5GlcNAc.sub.2; and high Mannose N-glycans are
selected from the group consisting of Man.sub.6GlcNAc.sub.2,
Man.sub.7GlcNAc.sub.2, Man.sub.8GlcNAc.sub.2, and
Man.sub.9GlcNAc.sub.2.
[0035] In any one of the above embodiments or aspects, the first
regulatable promoter is a promoter that is inducible without
inducing expression of the second regulatable promoter. The second
regulatable promoter is a promoter that is inducible without
inducing the expression of the first regulatable promoter. In
further aspects, the inducer of the second regulatable promoter
inhibits transcription from the first regulatable promoter. In
particular aspects in which the host cells are yeast, the first
regulatable promoter is the GUT1 promoter and the second
regulatable promoter is the GADPH promoter. In other aspects, the
first regulatable promoter is the PCK1 promoter and the second
regulatable promoter is the GADPH promoter.
[0036] In general, in the above embodiments or aspects, the
immunoglobulin will be an IgG molecule and can include IgG1, IgG2,
IgG3, and IgG4 immunoglobulins and subspecies thereof. However, in
particular aspects of the above, the immunoglobulin is selected
from the group consisting of IgA, IgM, IgE, camel heavy chain, and
llama heavy chain.
[0037] The information derived from the host cells and methods
herein can be used to produce affinity matured immunoglobulins,
derivatives of the antibodies, and modified immunoglobulins or the
nucleic acid encoding the desired immunoglobulin can be subcloned
into another host cell for production or affinity maturation of the
immunoglobulin. Therefore, further provided is a host cell that
expresses an immunoglobulin that had been identified using any one
of the aforementioned methods but does not necessarily have to be
the host cell that was used to identify the immunoglobulin. The
host cell can be a prokaryote or eukaryote host cell.
[0038] Further provided is an immunoglobulin produced by any one of
the above embodiments or aspects.
[0039] The following terms, unless otherwise indicated, shall be
understood to have the following meanings:
[0040] As used herein, the terms "N-glycan" and "glycoform" are
used interchangeably and refer to an N-linked oligosaccharide,
e.g., one that is attached by an asparagine-N-acetylglucosamine
linkage to an asparagine residue of a polypeptide. N-linked
glycoproteins contain an N-acetylglucosamine residue linked to the
amide nitrogen of an asparagine residue in the protein. The
predominant sugars found on glycoproteins are glucose, galactose,
mannose, fucose, N-acetylgalactosamine (GalNAc),
N-acetylglucosamine (GlcNAc) and sialic acid (e.g.,
N-acetyl-neuraminic acid (NANA)). The processing of the sugar
groups occurs co-translationally in the lumen of the ER and
continues in the Golgi apparatus for N-linked glycoproteins.
[0041] N-glycans have a common pentasaccharide core of Man3GlcNAc2
("Man" refers to mannose; "Glc" refers to glucose; and "NAc" refers
to N-acetyl; GlcNAc refers to N-acetylglucosamine). N-glycans
differ with respect to the number of branches (antennae) comprising
peripheral sugars (e.g., GlcNAc, galactose, fucose and sialic acid)
that are added to the Man3GlcNAc2 ("Man3") core structure which is
also referred to as the "trimannose core", the "pentasaccharide
core" or the "paucimannose core". N-glycans are classified
according to their branched constituents (e.g., high mannose,
complex or hybrid). A "high mannose" type N-glycan has five or more
mannose residues. A "complex" type N-glycan typically has at least
one GlcNAc attached to the 1,3 mannose arm and at least one GlcNAc
attached to the 1,6 mannose arm of a "trimannose" core. Complex
N-glycans may also have galactose ("Gal") or N-acetylgalactosamine
("GalNAc") residues that are optionally modified with sialic acid
or derivatives (e.g., "NANA" or "NeuAc", where "Neu" refers to
neuraminic acid and "Ac" refers to acetyl). Complex N-glycans may
also have intrachain substitutions comprising "bisecting" GlcNAc
and core fucose ("Fuc"). Complex N-glycans may also have multiple
antennae on the "trimannose core," often referred to as "multiple
antennary glycans." A "hybrid" N-glycan has at least one GlcNAc on
the terminal of the 1,3 mannose arm of the trimannose core and zero
or more mannoses on the 1,6 mannose arm of the trimannose core. The
various N-glycans are also referred to as "glycoforms."
[0042] Abbreviations used herein are of common usage in the art,
see, e.g., abbreviations of sugars, above. Other common
abbreviations include "PNGase", or "glycanase" or "glucosidase"
which all refer to peptide N-glycosidase F (EC 3.2.2.18).
[0043] The term "operably linked" expression control sequences
refers to a linkage in which the expression control sequence is
contiguous with the gene of interest to control the gene of
interest, as well as expression control sequences that act in trans
or at a distance to control the gene of interest.
[0044] The term "expression control sequence" or "regulatory
sequences" are used interchangeably and as used herein refer to
polynucleotide sequences which are necessary to affect the
expression of coding sequences to which they are operably linked.
Expression control sequences are sequences which control the
transcription, post-transcriptional events and translation of
nucleic acid sequences. Expression control sequences include
appropriate transcription initiation, termination, promoter and
enhancer sequences; efficient RNA processing signals such as
splicing and polyadenylation signals; sequences that stabilize
cytoplasmic mRNA; sequences that enhance translation efficiency
(e.g., ribosome binding sites); sequences that enhance protein
stability; and when desired, sequences that enhance protein
secretion. The nature of such control sequences differs depending
upon the host organism; in prokaryotes, such control sequences
generally include promoter, ribosomal binding site, and
transcription termination sequence. The term "control sequences" is
intended to include, at a minimum, all components whose presence is
essential for expression, and can also include additional
components whose presence is advantageous, for example, leader
sequences and fusion partner sequences.
[0045] The term "recombinant host cell" ("expression host cell",
"expression host system", "expression system" or simply "host
cell"), as used herein, is intended to refer to a cell into which a
recombinant vector has been introduced. It should be understood
that such terms are intended to refer not only to the particular
subject cell but to the progeny of such a cell. Because certain
modifications may occur in succeeding generations due to either
mutation or environmental influences, such progeny may not, in
fact, be identical to the parent cell, but are still included
within the scope of the term "host cell" as used herein. A
recombinant host cell may be an isolated cell or cell line grown in
culture or may be a cell which resides in a living tissue or
organism.
[0046] The term "transfect", transfection", "transfecting" and the
like refer to the introduction of a heterologous nucleic acid into
eukaryote cells, both higher and lower eukaryote cells.
Historically, the term "transformation" has been used to describe
the introduction of a nucleic acid into a yeast or fungal cell;
however, herein the term "transfection" is used to refer to the
introduction of a nucleic acid into any eukaryote cell, including
yeast and fungal cells.
[0047] The term "eukaryotic" refers to a nucleated cell or
organism, and includes insect cells, plant cells, mammalian cells,
animal cells and lower eukaryotic cells.
[0048] The term "lower eukaryotic cells" includes yeast and
filamentous fungi. Yeast and filamentous fungi include, but are not
limited to Pichia pastoris, Pichia finlandica, Pichia trehalophila,
Pichia koclamae, Pichia membranaefaciens, Pichia minuta (Ogataea
minuta, Pichia lindneri), Pichia opuntiae, Pichia thermotolerans,
Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis,
Pichia methanolica, Pichia sp., Saccharomyces cerevisiae,
Saccharomyces sp., Hansenula polymorpha, Kluyveromyces sp.,
Kluyveromyces lactis, Candida albicans, Aspergillus nidulans,
Aspergillus niger, Aspergillus oryzae, Trichoderma reesei,
Chrysosporium lucknowense, Fusarium sp., Fusarium gramineum,
Fusarium venenatum, Physcomitrella patens and Neurospora crassa.
Pichia sp., any Saccharomyces sp., Hansenula polymorpha, any
Kluyveromyces sp., Candida albicans, any Aspergillus sp.,
Trichoderma reesei, Chrysosporium lucknowense, any Fusarium sp. and
Neurospora crassa.
[0049] As used herein, the terms "antibody," "immunoglobulin,"
"immunoglobulins" and "immunoglobulin molecule" are used
interchangeably. Each immunoglobulin molecule has a unique
structure that allows it to bind its specific antigen, but all
immunoglobulins have the same overall structure as described
herein. The basic immunoglobulin structural unit is known to
comprise a tetramer of subunits. Each tetramer has two identical
pairs of polypeptide chains, each pair having one "light" chain
(about 25 kDa) and one "heavy" chain (about 50-70 kDa). The
amino-terminal portion of each chain includes a variable region of
about 100 to 110 or more amino acids primarily responsible for
antigen recognition. The carboxy-terminal portion of each chain
defines a constant region primarily responsible for effector
function. Light chains are classified as either kappa or lambda.
Heavy chains are classified as gamma, mu, alpha, delta, or epsilon,
and define the antibody's isotype as IgG, IgM, IgA, IgD, and IgE,
respectively.
[0050] The light and heavy chains are subdivided into variable
regions and constant regions (See generally, Fundamental Immunology
(Paul, W., ed., 2nd ed. Raven Press, N.Y., 1989), Ch. 7. The
variable regions of each light/heavy chain pair form the antibody
binding site. Thus, an intact antibody has two binding sites.
Except in bifunctional or bispecific antibodies, the two binding
sites are the same. The chains all exhibit the same general
structure of relatively conserved framework regions (FR) joined by
three hypervariable regions, also called complementarity
determining regions or CDRs. The CDRs from the two chains of each
pair are aligned by the framework regions, enabling binding to a
specific epitope. The terms include naturally occurring forms, as
well as fragments and derivatives. Included within the scope of the
term are classes of immunoglobulins (Igs), namely, IgG, IgA, IgE,
IgM, and IgD. Also included within the scope of the terms are the
subtypes of IgGs, namely, IgG1, IgG2, IgG3, and IgG4. The term is
used in the broadest sense and includes single monoclonal
antibodies (including agonist and antagonist antibodies) as well as
antibody compositions which will bind to multiple epitopes or
antigens. The terms specifically cover monoclonal antibodies
(including full length monoclonal antibodies), polyclonal
antibodies, multispecific antibodies (for example, bispecific
antibodies), and antibody fragments so long as they contain or are
modified to contain at least the portion of the CH2 domain of the
heavy chain immunoglobulin constant region which comprises an
N-linked glycosylation site of the CH2 domain, or a variant
thereof. Included within the terms are molecules comprising only
the Fc region, such as immunoadhesins (U.S. Published Patent
Application No. 20040136986), Fc fusions, and antibody-like
molecules.
[0051] The term "Fc" fragment refers to the `fragment crystallized`
C-terminal region of the antibody containing the CH2 and CH3
domains. The term "Fab" fragment refers to the `fragment antigen
binding` region of the antibody containing the VH, CH1, VL and CL
domains.
[0052] The term "monoclonal antibody" (mAb) as used herein refers
to an antibody obtained from a population of substantially
homogeneous antibodies, i.e., the individual antibodies comprising
the population are identical except for possible naturally
occurring mutations that may be present in minor amounts.
Monoclonal antibodies are highly specific, being directed against a
single antigenic site. Furthermore, in contrast to conventional
(polyclonal) antibody preparations which typically include
different antibodies directed against different determinants
(epitopes), each mAb is directed against a single determinant on
the antigen. In addition to their specificity, monoclonal
antibodies are advantageous in that they can be synthesized by
hybridoma culture, uncontaminated by other immunoglobulins. The
term "monoclonal" indicates the character of the antibody as being
obtained from a substantially homogeneous population of antibodies,
and is not to be construed as requiring production of the antibody
by any particular method. For example, the monoclonal antibodies to
be used in accordance with the present invention may be made by the
hybridoma method first described by Kohler et al., (1975) Nature,
256:495, or may be made by recombinant DNA methods (See, for
example, U.S. Pat. No. 4,816,567 to Cabilly et al.).
[0053] The term "fragments" within the scope of the terms
"antibody" or "immunoglobulin" include those produced by digestion
with various proteases, those produced by chemical cleavage and/or
chemical dissociation and those produced recombinantly, so long as
the fragment remains capable of specific binding to a target
molecule. Among such fragments are Fc, Fab, Fab', Fv, F(ab')2, and
single chain Fv (scFv) fragments. Hereinafter, the term
"immunoglobulin" also includes the term "fragments" as well.
[0054] Immunoglobulins further include immunoglobulins or fragments
that have been modified in sequence but remain capable of specific
binding to a target molecule, including: interspecies chimeric and
humanized antibodies; antibody fusions; heteromeric antibody
complexes and antibody fusions, such as diabodies (bispecific
antibodies), single-chain diabodies, and intrabodies (See, for
example, Intracellular Antibodies: Research and Disease
Applications, (Marasco, ed., Springer-Verlag New York, Inc.,
1998).
[0055] The term "catalytic antibody" refers to immunoglobulin
molecules that are capable of catalyzing a biochemical reaction.
Catalytic antibodies are well known in the art and have been
described in U.S. Pat. Nos. 7,205,136; 4,888,281; 5,037,750 to
Schochetman et al., U.S. Pat. Nos. 5,733,757; 5,985,626; and U.S.
Pat. No. 6,368,839 to Barbas, III et al.
[0056] As used herein, the term "consisting essentially of" will be
understood to imply the inclusion of a stated integer or group of
integers; while excluding modifications or other integers which
would materially affect or alter the stated integer. With respect
to species of N-glycans, the term "consisting essentially of" a
stated N-glycan will be understood to include the N-glycan whether
or not that N-glycan is fucosylated at the N-acetylglucosamine
(GlcNAc) which is directly linked to the asparagine residue of the
glycoprotein.
[0057] As used herein, the term "predominantly" or variations such
as "the predominant" or "which is predominant" will be understood
to mean the glycan species that has the highest mole percent (%) of
total neutral N-glycans after the glycoprotein has been treated
with PNGase and released glycans analyzed by mass spectroscopy, for
example, MALDI-TOF MS or HPLC. In other words, the phrase
"predominantly" is defined as an individual entity, such as a
specific glycoform, is present in greater mole percent than any
other individual entity. For example, if a composition consists of
species A in 40 mole percent, species B in 35 mole percent and
species C in 25 mole percent, the composition comprises
predominantly species A, and species B would be the next most
predominant species. Some host cells may produce compositions
comprising neutral N-glycans and charged N-glycans such as
mannosylphosphate. Therefore, a composition of glycoproteins can
include a plurality of charged and uncharged or neutral N-glycans.
In the present invention, it is within the context of the total
plurality of neutral N-glycans in the composition in which the
predominant N-glycan determined. Thus, as used herein, "predominant
N-glycan" means that of the total plurality of neutral N-glycans in
the composition, the predominant N-glycan is of a particular
structure.
[0058] As used herein, the term "essentially free of" a particular
sugar residue, such as fucose, or galactose and the like, is used
to indicate that the glycoprotein composition is substantially
devoid of N-glycans which contain such residues. Expressed in terms
of purity, essentially free means that the amount of N-glycan
structures containing such sugar residues does not exceed 10%, and
preferably is below 5%, more preferably below 1%, most preferably
below 0.5%, wherein the percentages are by weight or by mole
percent. Thus, substantially all of the N-glycan structures in a
glycoprotein composition according to the present invention are
free of fucose, or galactose, or both.
[0059] As used herein, a glycoprotein composition "lacks" or "is
lacking" a particular sugar residue, such as fucose or galactose,
when no detectable amount of such sugar residue is present on the
N-glycan structures at any time. For example, in preferred
embodiments of the present invention, the glycoprotein compositions
are produced by lower eukaryotic organisms, as defined above,
including yeast (for example, Pichia sp.; Saccharomyces sp.;
Kluyveromyces sp.; Aspergillus sp.), and will "lack fucose,"
because the cells of these organisms do not have the enzymes needed
to produce fucosylated N-glycan structures. Thus, the term
"essentially free of fucose" encompasses the term "lacking fucose."
However, a composition may be "essentially free of fucose" even if
the composition at one time contained fucosylated N-glycan
structures or contains limited, but detectable amounts of
fucosylated N-glycan structures as described above.
[0060] The interaction of antibodies and antibody-antigen complexes
with cells of the immune system and the variety of responses,
including antibody-dependent cell-mediated cytotoxicity (ADCC) and
complement-dependent cytotoxicity (CDC), clearance of
immunocomplexes (phagocytosis), antibody production by B cells and
IgG serum half-life are defined respectively in the following:
Daeron et al., 1997, Annu. Rev. Immunol. 15: 203-234; Ward and
Ghetie, 1995, Therapeutic Immunol. 2:77-94; Cox and Greenberg,
2001, Semin. Immunol. 13: 339-345; Heyman, 2003, Immunol. Lett.
88:157-161; and Ravetch, 1997, Curr. Opin. Immunol. 9: 121-125.
BRIEF DESCRIPTION OF THE DRAWINGS
[0061] FIG. 1 illustrates the general operation of the method using
an embodiment wherein the immunoglobulin (Ig) light and heavy
chains are separately expressed and detection of cells that express
the immunoglobulin of interest is via a labeled antigen.
[0062] FIG. 2 (A-B) illustrates the construction of plasmid vector
pGLY642 (construction steps in A continued onto B).
[0063] FIG. 3 illustrates the construction of plasmid vector
pGLY2233
[0064] FIG. 4 (A-B) illustrates the construction of plasmid vector
pGFI207t (construction steps in A continued onto B).
[0065] FIG. 5 illustrates the construction of plasmid vector
pGLY1162.
[0066] FIG. 6 illustrates the genealogy of some of the yeast
strains used to demonstrate operation of the present invention.
[0067] FIG. 7 shows a map of plasmid vector pGLY2988.
[0068] FIG. 8 shows a map of plasmid vector pGLY3200.
[0069] FIG. 9 shows maps of plasmid vectors pGLY4136 and
pGLY4124.
[0070] FIG. 10 shows maps of plasmid vectors pGLY4116 and
pGLY4137.
[0071] FIG. 11 shows fluorescence microscopy results of strain
yGLY4134 (expresses anti-Her2 antibody), strain yGLY2696 (empty
strain) transfected with pGLY4136 encoding Protein A/SED1 fusion
protein, and strain yGLY4134 (expresses anti-Her2 antibody)
transfected with pGLY4136 encoding Protein A/SED1 fusion protein
incubated with goat anti-human IgG (H+L)-Alexa 488.
[0072] FIG. 12 shows fluorescence microscopy results of strain
yGLY2696 (empty strain) transfected with pGLY4136 encoding the
Protein A/SED1 fusion protein incubated with anti-Her2 antibody.
Goat anti-human IgG (H+L)-Alexa 488 was used for detection of
anti-antibody bound to the Protein A/SED1 fusion protein anchored
to the cell surface.
[0073] FIG. 13 shows fluorescence microscopy results of strain
yGLY2696 (empty strain) transfected with pGLY4136 encoding Protein
A/SED1 fusion protein, strain yGLY3920 (expresses anti-CD20
antibody) transfected with pGLY4136 encoding Protein A/SED1 fusion
protein, and strain yGLY4134 (expresses anti-Her2 antibody)
transfected with pGLY4136 encoding Protein A/SED1 fusion protein
incubated with anti-Her2 antibody. Goat anti-human IgG (H+L)-Alexa
488 was used for detection of anti-antibody bound to the Protein
A/SED1 fusion protein anchored to the cell surface.
[0074] FIG. 14 shows fluorescence microscopy results of strain
yGLY2696 (empty strain) transfected with pGLY4116 encoding the
FcRIII/SED1 fusion protein incubated with anti-Her2 antibody. Goat
anti-human IgG (H+L)-Alexa 488 was used for detection of
anti-antibody bound to the Protein A/SED1 fusion protein anchored
to the cell surface.
[0075] FIG. 15 shows maps of plasmid vectors pGLY439 and
pGLY4144.
[0076] FIG. 16 shows fluorescence microscopy results of strain
yGLY4134 (AOX promoter-anti-Her2 antibody) transfected with
pGLY4136 (AOX promoter-Protein A/SED1 fusion protein), strain
yGLY4134 (AOX promoter-anti-Her2 antibody) transfected with
pGLY4139 (GAPDH promoter-Protein A/SED1 fusion protein), and strain
yGLY5434(GAPDH promoter-anti-Her2 antibody) transfected with
pGLY4139 (GUT1 promoter-Protein A/SED1 fusion protein). Goat
anti-human IgG (H+L)-Alexa 488 was used for detection of
anti-antibody bound to the Protein A/SED1 fusion protein anchored
to the cell surface.
[0077] FIG. 17 illustrates the hypothetical expression of Protein
A/SED1 fusion protein and antibody under the control of different
combinations of promoters.
[0078] FIG. 18 shows fluorescence microscopy results of strains
yGLY5757 (expresses anti-CD20 antibody under control of the GAPDH
promoter) and yGLY5434 (expresses anti-Her2 antibody under control
of the GAPDH promoter), each transfected with pGLY4144 encoding
Protein A/SED1 fusion protein under the control of the GUT1
promoter. Protein A/SED1 fusion protein expression (GUT1 promoter)
was induced first under glycerol conditions; then antibody
expression from the GAPDH promoter was induced under dextrose
conditions, which also inhibits expression of the Protein A/SED1
fusion protein. Goat anti-human IgG (H+L)-Alexa 488 was used for
detection of anti-antibody bound to the Protein A/SED1 fusion
protein anchored to the cell surface.
[0079] FIG. 19 (A-B) shows the results of FACS sorting of the cells
shown in FIG. 18 (A: YGLY5757/pGLY4144; b: YGLY5434/pGLY4144). The
red line represents the negative control without co-expression of
antibody. The blue line represents colonies of anti-Her2 or
anti-CD20 expressing strains.
[0080] FIG. 20 shows a map of plasmid vector pGLY3033.
DETAILED DESCRIPTION OF THE INVENTION
[0081] The present invention provides a protein display system that
is capable of displaying diverse libraries of immunoglobulins on
the surface of a eukaryote host cell. The compositions and methods
are particularly useful for the display of collections of
immunoglobulins in the context of discovery (that is, screening) or
molecular evolution protocols. A salient feature of the method is
that it provides a display system in which a whole, intact
immunoglobulin molecule of interest can be displayed on the surface
of a host cell without having to express the immunoglobulin
molecule of interest either as fusion protein in which it is fused
to a surface anchor protein or other moiety that enables capture of
the immunoglobulin by a capture moiety bound to the cell surface.
Another feature of the method is that it enables screening diverse
libraries of immunoglobulins in host cells for a host cell in the
library that produces an immunoglobulin of interest and then
enables the host cell to be separated from the other host cells in
the library that do not express the immunoglobulin of interest.
Importantly, the isolated host cell can then be used for production
of the immunoglobulin of interest for use in therapeutic or
diagnostic applications. This is an improvement over phage and
yeast display methods wherein a diverse library of scFV or Fab
fragments are screened for a host cell that expresses an scFV or
Fab of interest, which is then used in a series of steps to
construct a mammalian host cell that expresses a whole
immunoglobulin with the characteristics of the scFV or Fab of
interest. These subsequent steps present the risk that the desired
affinity or specificity of an scFV or Fab that has been identified
during the maturation process of converting the scFV or Fab into a
whole immunoglobulin could be abrogated or diminished.
[0082] While current phage-based methods provide substantial
library diversity and have greatly improved the processes for
developing immunoglobulins, a disadvantage is that the prokaryotic
host cells used to construct the libraries do not produce N-linked
glycosylated glycoproteins. Posttranslational modifications such as
glycosylation can affect specificity or affinity of the
immunoglobulin. It is estimated that about 15-20% of circulating
monoclonal antibodies derived entirely in mammalian cells contain
one or more N-linked glycans in the variable regions. (Jefferis,
Biotechnol Progress 21: 11-16 (2005)) In some cases it is believed
that these N-glycans in the variable region may play a significant
role in immunoglobulin function. For example, both positive and
negative influences on antigen binding have been seen in antibody
molecules with variable region N-glycosylation. N-glycosylation
consensus sites added within the CDR2 region of an anti-dextran
antibody were filled with carbohydrates of varying structure and
showed changes in affinity, half-life and tissue targeting in a
site dependent manner (Coloma et al., The Journal of Immunology
162: 2162-2170 (1999)). Therefore, libraries produced and screened
in prokaryotic host cells will tend to be biased against
immunoglobulin species that might have glycosylation in the
variable region. Thus, immunoglobulins that might have particularly
desirable specificity or affinity due in whole or in part to
glycosylation of one or more sites in the variable regions will not
be identified. Conversely, antibodies identified through
prokaryotic screening methods may, when expressed in a eukaryotic
host, have glycosylation structures that unfavorably impact folding
or affinity. The methods and systems herein for the first time
enable libraries of immunoglobulins to be screened wherein the
libraries include populations of immunoglobulins that are
glycosylated in the variable region. This has the potential effect
of increasing the diversity of the library over what would be
expected if the diversity of the library was based solely on
sequence. This improvement is expected to increase the ability to
develop immunoglobulins that have greater specificity or affinity
than current methods permit.
[0083] The methods and systems herein also provide another
advantage over current methods in that eukaryote host cells that
have been genetically engineered to produce glycoproteins that have
predominantly particular N-glycan structures can be used. The
N-glycan structures include any of the N-glycan structures
currently found on human immunoglobulins or N-glycan structures
that lack features not found in glycoproteins from higher
eukaryotes. For example, in the case of yeast, the host cells can
be genetically engineered to produce immunoglobulins wherein the
N-glycans are not hypermannosylated. The host cells can be
genetically engineered to limit the amount of O-glycosylation or to
modify O-glycosylation to resemble O-glycosylation in mammalian
cells.
[0084] A significant advantage of the methods and systems is that
the host cell identified in the library to produce a desired
immunoglobulin can be used without further development or
manipulation of the host cell or the nucleic acid molecule encoding
the immunoglobulin for production of the immunoglobulin. That is,
cultivating the host cells identified herein as expressing the
desired immunoglobulin under conditions that induce expression of
the desired immunoglobulin without inducing expression of the
capture moiety either before, after, or at the same time: the cells
secrete the desired immunoglobulin, which can then be recovered
from the culture medium using methods well known in the art. An
important element is that the immunoglobulin that is produced is a
whole, intact immunoglobulin molecule. This ability to use library
cells to produce whole, intact immunoglobulins is not possible with
the current phage-based or yeast-based systems. In those systems,
the nucleic acid molecules encoding the desired Fab or scFV has to
be further manipulated to construct a nucleic acid molecule that
encodes a whole, intact immunoglobulin, which is then transfected
into a mammalian cell for production of the whole, intact
immunoglobulin. Thus, the methods and systems herein provide
significant improvements to the development and production of
immunoglobulins for therapeutic or diagnostic purposes.
[0085] What is provided then is a method for constructing and
isolating a eukaryotic host cell expressing an immunoglobulin of
interest from a library of host cells expressing a plurality of
immunoglobulins. The method enables the construction and selection
of immunoglobulins with desirable specificity and/or affinity
properties. In general, the method comprises providing a host cell
that comprises a first nucleic acid molecule encoding a capture
moiety comprising a cell surface anchoring protein fused to a
binding moiety that is capable of specifically binding an
immunoglobulin operably linked to a first regulatable promoter. The
host cell can be further genetically engineered to produce
immunoglobulins having particular predominant N-glycan
structures.
[0086] In one aspect, the host cell is propagated in a culture to
provide a multiplicity of host cells, which are then transfected
with a plurality of second nucleic acid molecules, each nucleic
acid molecule encoding the heavy and/or light chains of an
immunoglobulin wherein at least the nucleic acid encoding a heavy
chain is operably linked to a second regulatable promoter when the
capture moiety binds the heavy chain or at least the nucleic acid
encoding a light chain is operably linked to a second regulatable
promoter when the capture moiety binds the light chain. This
produces a plurality of host cells wherein each host cell in the
plurality of host cells capable of displaying an immunoglobulin on
the surface thereof and each host cell in the plurality of host
cells is capable of displaying a particular distinct immunoglobulin
species. In general, the diversity of the host cell population in
the plurality of host cells will depend on the diversity of the
library of nucleic acid molecules that was transfected into the
host cells.
[0087] In another aspect, the host cell is propagated in a culture
to provide a multiplicity of host cells, which are then transfected
with one or more nucleic acid second molecules encoding the heavy
and/or light chains of an immunoglobulin wherein at least the
nucleic acid encoding a heavy chain is operably linked to a second
regulatable promoter when the capture moiety binds the heavy chain
or at least the nucleic acid encoding a light chain is operably
linked to a second regulatable promoter when the capture moiety
binds the light chain to provide a multiplicity of host cells that
are capable of displaying the encoded immunoglobulin on the surface
thereof. Mutagenesis of the multiplicity of host cells is used to
generate a plurality of host cells that encode a variegated
population of mutants of the immunoglobulin. The diversity is
dependent on the mutagenesis method used. Suitable methods for
mutagenesis include but are not limited to cassette mutagenesis,
error-prone PCR, chemical mutagenesis, or shuffling to generate a
refined repertoire of altered sequences that resemble the parent
nucleic acid molecule.
[0088] In further aspects, the host cell is propagated in a culture
to provide a multiplicity of host cells, which are then transfected
with a plurality of second nucleic acid molecules, each nucleic
acid molecule encoding the heavy and/or light chains of an
immunoglobulin wherein at least the nucleic acid encoding a heavy
chain is operably linked to a second regulatable promoter when the
capture moiety binds the heavy chain or at least the nucleic acid
encoding a light chain is operably linked to a second regulatable
promoter when the capture moiety binds the light chain to produce a
plurality of host cells that are capable of displaying an
immunoglobulin on the surface thereof. Mutagenesis is then used to
generate further increase the diversity of the plurality of host
cells that are capable of displaying an immunoglobulin on the
surface thereof.
[0089] In particular embodiments, the nucleic acid molecules
encoding both the heavy and light chains are operably linked to a
second regulatable promoter. In other embodiments, the nucleic acid
molecules encoding at least one of the heavy chains are operably
linked to a second regulatable promoter and the nucleic acid
molecules encoding the light chain are operably linked to a third
regulatable promoter or to a constitutive promoter. In particular
aspects, a plurality of nucleic acids encoding sub-populations of
heavy chains are provided wherein expression of each sub-population
is effected by a second, third, or more regulatable promoter such
that different sub-populations can be expressed at a particular
time while other sub-populations are not expressed at that
time.
[0090] In general, the heavy and light chains are encoded by
separate open reading frames (ORFs) wherein each ORF is operably
linked to a promoter. However, in other aspects, the heavy and
light chains are encoded by a single ORF, which produces a single
fusion polypeptide comprising the heavy and light chains in a
tandem orientation, and the ORF is operably linked to a regulatable
promoter. The single polypeptide is cleavable between the heavy and
light chains to produce separate heavy and light chain proteins,
which can then associate to form a functional antibody molecule.
(See for example, U.S. Published Application No. 2006/0252096).
[0091] In any one of the above aspects, the expression of the first
nucleic acid molecule encoding the capture moiety is induced for a
time sufficient to produce the capture moiety and allow it to be
transported to and then bound to the surface of the host cell such
that the capture moiety is capable of binding immunoglobulin
molecules as they are secreted from the host cell. Expression of
the capture moiety is then reduced or inhibited and expression of
the nucleic acid molecules encoding the heavy and/or light chains
of the immunoglobulins operably linked to the second regulatable
promoter is induced. While expression of both the heavy and light
chains can be induced, in particular aspects, the expression of the
heavy chain is induced and expression of the light chain is
constitutive. In other aspects, when the capture moiety binds the
light chain, expression of the light chain can be regulated and
expression of the heavy chain can be constitutive. Thus, whether it
is the heavy chain or the light chain that is captured determines
whether it is the light chain or the heavy chain whose expression
is regulated.
[0092] Inhibition of expression of the capture moiety can be
effected by no longer providing the inducer than induces expression
of the capture moiety, or by providing an inhibitor of the first
regulatable promoter that inhibits expression of the capture
moiety, or by using an inducer of expression of the immunoglobulins
heavy and/or light chains operably linked to a second or more
inducible promoter that also inhibits expression of the capture
moiety. Inhibition can be complete repression of expression or a
reduction in expression to an amount wherein expression of the
capture moiety is such that it does not interfere with the
processing and transport of the heavy and light chains through the
secretory pathway. The expressed immunoglobulin heavy and/or light
chains are processed and transported to the cell surface via the
host cell secretory pathway where they are captured by the capture
moiety bound to the host cell surface for display. The plurality of
host cells with the expressed immunoglobulins displayed thereon are
then screened using a detection means that will bind to the
immunoglobulin of interest but not to other immunoglobulins to
identify the host cells that display the immunoglobulin of interest
on the surface thereof from those host cells that do not display
the immunoglobulin of interest. Host cells that express and display
the immunoglobulin of interest are separated from the host cells
that do not express and display the immunoglobulin of interest to
produce a population of host cells comprising exclusively or
enriched for the host cells displaying the immunoglobulin of
interest. These separated host cells can be propagated and used to
produce the immunoglobulin of interest in the quantities needed for
the use intended. The nucleic acid encoding the immunoglobulin can
be determined and an expression vector encoding the heavy and light
chains of the immunoglobulin can be constructed and used to
transfect another host cell, which can be a prokaryotic or
eukaryotic host cell.
[0093] Detection and analysis of host cells that express the
immunoglobulin of interest can be achieved by labeling the host
cells with an antigen that is specifically recognized by the
immunoglobulin of interest. In particular aspects, the antigen is
labeled with a detection moiety. In other aspects the antigen is
unlabeled and detection is achieved by using a detection
immunoglobulin that is labeled with a detection moiety and binds an
epitope of the antigen that is not bound by the immunoglobulin of
interest. This enables selection of host cells that produce
immunoglobulins that bind the antigen at an epitope other than the
epitope bound by the detection immunoglobulin. In another aspect,
the detection immunoglobulin is specific for the
immunoglobulin-antigen complex. Regardless of the detection means,
a high occurrence of the label indicates the immunoglobulin of
interest has desirable binding properties and a low occurrence of
the label indicates the immunoglobulin of interest does not have
desirable binding properties.
[0094] Detection moieties that are suitable for labeling are well
known in the art. Examples of detection moieties, include but are
not limited to, fluorescein (FITC), Alexa Fluors such as Alexa Fuor
488 (Invitrogen), green fluorescence protein (GFP),
Carboxyfluorescein succinimidyl ester (CFSE), DyLight Fluors
(Thermo Fisher Scientific), HyLite Fluors (AnaSpec), and
phycoerythrin. Other detection moieties include but are not limited
to, magnetic beads which are coated with the antigen of interest or
immunoglobulins that are specific for the immunoglobulin of
interest or immunoglobulin-antigen complex. In particular aspects,
the magnetic beads are coated with anti-fluorochrome
immunoglobulins specific for the fluorescent label on the labeled
antigen or immunoglobulin. Thus, the host cells are incubated with
the labeled-antigen or immunoglobulin and then incubated with the
magnetic beads specific for the fluorescent label.
[0095] Analysis of the cell population and cell sorting of those
host cells that display the immunoglobulin of interest based upon
the presence of the detection moiety can be accomplished by a
number of techniques known in the art. Cells that display the
immunoglobulin of interest can be analyzed or sorted by, for
example, flow cytometry, magnetic beads, or fluorescence-activated
cell sorting (FACS). These techniques allow the analysis and
sorting according to one or more parameters of the cells. Usually
one or multiple secretion parameters can be analyzed simultaneously
in combination with other measurable parameters of the cell,
including, but not limited to, cell type, cell surface antigens,
DNA content, etc. The data can be analyzed and cells that display
the immunoglobulin of interest can be sorted using any formula or
combination of the measured parameters. Cell sorting and cell
analysis methods are known in the art and are described in, for
example, The Handbook of Experimental Immunology, Volumes 1 to 4,
(D. N. Weir, editor) and Flow Cytometry and Cell Sorting (A.
Radbruch, editor, Springer Verlag, 1992). Cells can also be
analyzed using microscopy techniques including, for example, laser
scanning microscopy, fluorescence microscopy; techniques such as
these may also be used in combination with image analysis systems.
Other methods for cell sorting include, for example, panning and
separation using affinity techniques, including those techniques
using solid supports such as plates, beads, and columns.
[0096] In further aspects, provided is a library method for
identifying and selecting cells that produce an immunoglobulin
having a desired specificity and/or affinity for a particular
antigen. The method comprises providing a library of eukaryote host
cells displaying on their surface an immunoglobulin comprising a VH
domain and a VL domain, wherein the library is created by (i)
providing eukaryote host cells that express a capture moiety
comprising a cell surface anchoring protein fused to a moiety
capable of binding an immunoglobulin wherein expression of the
capture moiety is effected by a first regulatable promoter; and
(ii) transfecting the host cells with a library of nucleic acid
sequences encoding a genetically diverse population of
immunoglobulins, wherein the VH domains of the genetically diverse
population of immunoglobulins are biased for one or more VH gene
families and wherein expression of at least one or more heavy or
light chains is effected by a second regulatable promoter to
produce a plurality of host cells, each host cell in the plurality
of host cells expresses an immunoglobulin species. Expression of
the capture moiety is induced in the plurality of host cells for a
time sufficient to produce the capture moiety on the surface of the
host cells. Then expression of the of the capture moiety while
expression of the library of nucleic acid sequences is induced in
the plurality of host cells to produce a plurality of host cells
wherein each host cell displays an immunoglobulin species at the
surface thereof. Host cells in the plurality of host cells that
display immunoglobulins thereon that has a binding specificity for
the antigen of interest are identified by contacting the plurality
of host cells with the antigen of interest and detecting the host
cells that have the antigen of interest bound to the immunoglobulin
displayed thereon to produce the host cells that produce the
immunoglobulin having a VH domain and a VL domain and having the
antigen binding site with binding specificity for the antigen of
interest. In particular aspects, the immunoglobulin comprises a
synthetic human immunoglobulin VH domain and a synthetic human
immunoglobulin VL domain and further, the synthetic human
immunoglobulin VH domain and the synthetic human immunoglobulin VL
domain comprise framework regions and hypervariable loops, wherein
the framework regions and first two hypervariable loops of both the
VH domain and VL domain are essentially human germ line, and
wherein the VH domain and VL domain have altered CDR3 loops.
[0097] This provides a library of host cells that are capable of
expressing a plurality of immunoglobulin molecules, which can be
captured and displayed on the cell surface for detection by a
detection means that can bind an immunoglobulin specific for a
particular antigen and thereby enable the host cell expressing the
immunoglobulin to be identified from the plurality of host cells in
the library. In general, the detection means will usually use the
antigen that has been labeled with a detection moiety. These host
cells can be isolated from the plurality of host cells by any means
currently used for selection of particular cells in a population of
cells, e.g., FACS sorting.
[0098] Thus, the method comprises at least two components. The
first component is a helper vector that contains an expression
cassette comprising the first nucleic acid molecule that encodes
and expresses a capture moiety that in particular embodiments
comprises a cell surface anchoring protein or cell wall binding
protein that is capable of binding or integrating to the surface of
the host cell fused at its N- or C-terminus to a binding moiety
capable of binding an immunoglobulin. The binding moiety is located
at the end of the cell surface anchoring protein that is exposed to
the extracellular environment such that the binding moiety is
capable of interacting with an immunoglobulin. The immunoglobulin
binding moiety includes the immunoglobulin binding domains from
such molecules as protein A, protein G, protein L, or the like or
an Fc receptor.
[0099] The second component is one or more vectors that contain
expression cassettes that encode and express the heavy and light
chains of an immunoglobulin of interest or libraries of which the
immunoglobulin of interest is to be selected (for example, a
library of vectors expressing immunoglobulins). In particular
aspects, the nucleic acid molecule encoding the immunoglobulin may
include the nucleotide sequences encoding both the heavy and the
light chains of the immunoglobulins, e.g., an immunoglobulin having
a VH domain and a VL domain and having an antigen binding site with
binding specificity for an antigen of interest. In other aspects,
the heavy and light chains are encoded on separate nucleic acid
molecules. In either case, these nucleic acid molecules may further
include when desirable codon optimizations to enhance translation
of the mRNA encoding the immunoglobulins in the host cell chosen.
The nucleic acid molecule may further include when desirable
replacement of endogenous signal peptides with signal peptides that
are appropriate for the host cell chosen.
[0100] In one aspect, the above nucleic acid molecule can comprise
a single expression cassette operably linked to a second
regulatable promoter wherein the open reading frames (ORFs) for the
light and heavy chains are in frame and separated by a nucleic acid
molecule encoding in frame a protease cleavage site that upon
expression produces a fusion protein that is processed
post-translationally with a protease specific for the protease
cleavage site to produce the light and heavy chains of the
immunoglobulin. Examples of these expression cassettes can be found
in for example, U.S. Publication No. 20060252096. In another
aspect, the heavy and light immunoglobulin chains are expressed
from separate expression cassettes wherein the ORF encoding each of
the light and heavy chains is operably linked to a second
regulatable promoter. Examples of these expression cassettes can be
found in for example, U.S. Pat. Nos. 4,816,567 and 4,816,397. In a
further aspect, the heavy and light immunoglobulin chains are
expressed from separate expression cassettes wherein the ORF
encoding the heavy chain is operably linked to a second regulatable
promoter and the ORF encoding the light chain is operably linked to
a constitutive promoter.
[0101] In particular aspects, the encoded immunoglobulin comprises
a synthetic human immunoglobulin VH domain and a synthetic human
immunoglobulin VL domain and wherein the synthetic human
immunoglobulin VH domain and the synthetic human immunoglobulin VL
domain comprise framework regions and hypervariable loops, wherein
the framework regions and first two hypervariable loops of both the
VH domain and VL domain are essentially human germ line, and
wherein the VH domain and VL domain have altered CDR3 loops. In
further still aspects, in addition to having altered CDR3 loops,
the human synthetic immunoglobulin VH and VL domains contain
mutations in other CDR loops. In further aspects, each human
synthetic immunoglobulin VH domain CDR loop is of random sequence.
In further still aspects, the human synthetic immunoglobulin VH
domain CDR loops are of known canonical structures and incorporate
random sequence elements.
[0102] Both of the components can be provided in vectors which
integrate the nucleic acid molecules into the genome of the host
cell by homologous recombination. Homologous recombination can be
double crossover or single crossover homologous recombination.
Roll-in single crossover homologous recombination has been
described in Nett et al., Yeast 22: 295-304 (2005). Each component
can be integrated in the same locus in the genome or in separate
loci in the genome. Alternatively, one or both components can be
transiently expressed in the host cell.
[0103] FIG. 1 illustrates the general operation of the method using
an embodiment wherein the immunoglobulin light and heavy chains are
separately expressed and detection is via a labeled antigen. FIG. 1
shows an expression cassette encoding the capture moiety fusion
protein operably linked to promoter A and expression cassettes
encoding the immunoglobulin (Ig) light and heavy chains, each
operably linked to promoter B. As shown, the host cell is
transfected with the expression cassettes and the transformed cells
grown under conditions that induce expression of the capture moiety
fusion protein via promoter A. The capture moiety fusion protein is
anchored to the cell surface. Then the cells are grown under
conditions that inhibit or reduce expression of the capture moiety
fusion protein but induce expression of the immunoglobulin light
and heavy chains via promoter B. The immunoglobulins are secreted
from the cells and captured by the capture moiety fusion protein
anchored to the cell surface. The cells with the captured
immunoglobulins are then screened for the Ig of interest using a
antigen labeled with a detection moiety. As shown, not all cells
will produce the immunoglobulin of interest. Cells that bind the
labeled antigen are selected and separated from cells that do not
produce the immunoglobulin of interest. This produces cells that
express the immunoglobulin of interest. These cells can be used for
producing the immunoglobulin for use in therapeutic or diagnostic
applications. Alternatively, the cells can undergo mutagenesis that
introduces mutations into the expression cassettes encoding the
immunoglobulins and the cells screened for cells that produce
immunoglobulins with properties that have been modified or altered
from those properties in the immunoglobulin prior to mutagenesis
and which are desired. Cells that express immunoglobulins having
modified or altered but desired properties can be separated from
the other cells and used for producing the immunoglobulin for
therapeutic or diagnostic applications.
[0104] Glycosylphosphatidylinositol-anchored (GPI) proteins provide
a suitable means for tethering the capture moiety to the surface of
the host cell. GPI proteins have been identified and characterized
in a wide range of species from humans to yeast and fungi. Thus, in
particular aspects of the methods disclosed herein, the cell
surface anchoring protein is a GPI protein or fragment thereof that
can anchor to the cell surface. Lower eukaryotic cells have systems
of GPI proteins that are involved in anchoring or tethering
expressed proteins to the cell wall so that they are effectively
displayed on the cell wall of the cell from which they were
expressed. For example, 66 putative GPI proteins have been
identified in Saccharomyces cerevisiae (See, de Groot et al., Yeast
20: 781-796 (2003)). GPI proteins which may be used in the methods
herein include, but are not limited to, Saccharomyces cerevisiae
CWP1, CWP2, SED1, and GAS1; Pichia pastoris SP1 and GAS1; and H.
polymorpha TIP1. Additional GPI proteins may also be useful.
Additional suitable GPI proteins can be identified using the
methods and materials of the invention described and exemplified
herein.
[0105] The selection of the appropriate GPI protein will depend on
the particular recombinant protein to be produced in the host cell
and the particular post-translation modifications to be performed
on the recombinant protein. For example, production of
immunoglobulins with particular glycosylation patterns will entail
the use of recombinant host cells that produce glycoproteins having
particular glycosylation patterns. The GPI protein most suitable in
a system for producing antibodies or fragments thereof that have
predominantly Man.sub.5GlcNAc.sub.2 N-glycosylation many not
necessarily be the GPI protein most suitable in a system for
producing antibodies or thereof having predominantly
Gal.sub.2GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 N-glycosylation. In
addition, the GPI most suitable in a system for producing
immunoglobulins specific for one epitope or antigen may not
necessarily be the most suitable GPI protein in a system for
producing immunoglobulins specific for another epitope or
antigen.
[0106] Therefore, further provided is a library method for
constructing the host cell that is to be used for producing a
particular immunoglobulin. In general, the host cell that is
desired to produce the particular immunoglobulin is selected based
on the desired characteristics that will be imparted to the
particular immunoglobulin produced by the host cell. For example, a
host cell that produces glycoproteins having predominantly
Man.sub.5GlcNAc.sub.2 or Gal.sub.2GlcNAc.sub.2Man.sub.3GlcNAc.sub.2
N-glycosylation is selected and a library of vectors encoding GPI
proteins fused to one or more immunoglobulin capture moieties is
then provided (GPI-IgG capture moiety). A library of host cells is
then constructed wherein each host cell to make up the library is
transfected with one of the vectors in the library of vectors
encoding GPI-IgG capture moiety fusion proteins such that each host
cell species in the library will express one particular GPI-IgG
capture moiety fusion protein. Each host cell species of the
library is then transfected with a vector encoding the desired
particular immunoglobulin. The host cell that results in the best
presentation of the particular immunoglobulin on the surface of the
host cell is selected as the host cell for producing the particular
immunoglobulin.
[0107] In general, the GPI protein used in the methods disclosed
herein is a chimeric protein or fusion protein comprising the GPI
protein fused at its N-terminus to the C-terminus of an
immunoglobulin capture moiety. The N-terminus of the capture moiety
is fused to the C-terminus of a signal sequence or peptide that
enables the GPI-IgG capture moiety fusion protein to be transported
through the secretory pathway to the cell surface where the GPI-IgG
capture moiety fusion protein is secreted and then bound to the
cell surface. In some aspects, the GPI-IgG capture moiety fusion
protein comprises the entire GPI protein and in other aspects, the
GPI-IgG capture moiety fusion protein comprises the portion of the
GPI protein that is capable of binding to the cell surface.
[0108] The immunoglobulin capture moiety can comprise any molecule
that can bind to an immunoglobulins. A multitude of Gram-positive
bacteria species have been isolated that express surface proteins
with affinities for mammalian immunoglobulins through interaction
with their heavy chains. The best known of these immunoglobulin
binding proteins are type 1 Staphylococcus Protein A and type 2
Streptococcus Protein G which have been shown to interact
principally through the C2-C3 interface on the Fc region of human
immunoglobulins. In addition, both have also been shown to interact
weakly to the Fab region, but again through the immunoglobulin
heavy chain.
[0109] Recently, a novel protein from Peptococcusmagnums, Protein
L, has been reported that was found to bind to human, rabbit,
porcine, mouse, and rat immunoglobulins uniquely through
interaction with their light chains. In humans this interaction has
been shown to occur exclusively to the kappa chains. Since both
kappa and lambda light chains are shared between different classes,
Protein L binds strongly to all human classes, in particular to the
multi-subunit IgM, and similarly is expected to bind to all classes
in species that show Protein L light chain binding.
[0110] Examples of other binding moieties, include but are not
limited to, Fc receptor (FcR) proteins and immunoglobulin-binding
fragments thereof. The FCR proteins include members of the Fc gamma
receptor (Fc.gamma.R) family, which bind gamma immunoglobulin
(IgG), Fc epsilon receptor (Fc.epsilon.R) family, which bind
epsilon immunoglobulin (IgE), and Fc alpha receptor (Fc.alpha.R)
family, which bind alpha immunoglobulin (IgA). Particular FcR
proteins that bind IgG and can be used to comprise the capture
moiety disclosed herein include at least the immunoglobulin binding
portion of any one of Fc.gamma.RI, Fc.gamma.RIIA, Fc.gamma.RIIB1,
Fc.gamma.RIIB2, Fc.gamma.RIIIA, Fc.gamma.RIIIB or Fc.gamma.Rn
(neonatal).
[0111] Regulatory sequences which may be used in the practice of
the methods disclosed herein include signal sequences, promoters,
and transcription terminator sequences. It is generally preferred
that the regulatory sequences used be from a species or genus that
is the same as or closely related to that of the host cell or is
operational in the host cell type chosen. Examples of signal
sequences include those of Saccharomyces cerevisiae invertase; the
Aspergillus niger amylase and glucoamylase; human serum albumin;
Kluyveromyces maxianus inulinase; and Pichia pastoris mating factor
and Kar2. Signal sequences shown herein to be useful in yeast and
filamentous fungi include, but are not limited to, the alpha mating
factor presequence and preprosequence from Saccharomyces
cerevisiae; and signal sequences from numerous other species.
[0112] Examples of promoters include promoters from numerous
species, including but not limited to alcohol-regulated promoter,
tetracycline-regulated promoters, steroid-regulated promoters
(e.g., glucocorticoid, estrogen, ecdysone, retinoid, thyroid),
metal-regulated promoters, pathogen-regulated promoters,
temperature-regulated promoters, and light-regulated promoters.
Specific examples of regulatable promoter systems well known in the
art include but are not limited to metal-inducible promoter systems
(e.g., the yeast copper-metallothionein promoter), plant herbicide
safner-activated promoter systems, plant heat-inducible promoter
systems, plant and mammalian steroid-inducible promoter systems,
Cym repressor-promoter system (Krackeler Scientific, Inc. Albany,
N.Y.), RheoSwitch System (New England Biolabs, Beverly Mass.),
benzoate-inducible promoter systems (See WO2004/043885), and
retroviral-inducible promoter systems. Other specific regulatable
promoter systems well-known in the art include the
tetracycline-regulatable systems (See for example, Berens &
Hillen, Eur J Biochem 270: 3109-3121 (2003)), RU 486-inducible
systems, ecdysone-inducible systems, and kanamycin-regulatable
system. Lower eukaryote-specific promoters include but are not
limited to the Saccharomyces cerevisiae TEF-1 promoter, Pichia
pastoris GAPDH promoter, Pichia pastoris GUT1 promoter, PMA-1
promoter, Pichia pastoris PCK-1 promoter, and Pichia pastoris AOX-1
and AOX-2 promoters. For temporal expression of the GPI-IgG capture
moiety and the immunoglobulins, the Pichia pastoris GUT1 promoter
operably linked to the nucleic acid molecule encoding the GPI-IgG
capture moiety and the Pichia pastoris GAPDH promoter operably
linked to the nucleic acid molecule encoding the immunoglobulin are
shown in the examples herein to be useful.
[0113] Examples of transcription terminator sequences include
transcription terminators from numerous species and proteins,
including but not limited to the Saccharomyces cerevisiae
cytochrome C terminator; and Pichia pastoris ALG3 and PMA1
terminators.
[0114] Nucleic acid molecules encoding immunoglobulins can be
obtained from any suitable source including spleen and liver cells
and antigen-stimulated antibody producing cells, obtained from
either in vivo or in vitro sources. Regardless of source, the
cellular VH and VL mRNAs are reverse transcribed into VH and VL
cDNA sequences. Reverse transcription may be performed in a single
step or in an optional combined reverse transcription/PCR procedure
to produce cDNA libraries containing a plurality of
immunoglobulin-encoding DNA molecules. (See, for example, Marks et
al., J. Mol. Biol. 222: 581-596 (1991)). Nucleic acid molecules can
also be synthesized de novo based on sequences in the scientific
literature. Nucleic acid molecules can also be synthesized by
extension of overlapping oligonucleotides spanning a desired
sequence (See, e.g., Caldas et al., Protein Engineering, 13:
353-360 (2000)). Humanized immunoglobulin-encoding cDNA libraries
can be constructed by PCR amplifying the complementary-determining
regions (CDR) from the cDNAs in one or more libraries from any
source and integrating the PCR amplified CDR-encoding nucleic acid
molecules into nucleic acid molecules encoding a human
immunoglobulin framework to produce a cDNA library encoding a
plurality of humanized immunoglobulins (See, for example, U.S. Pat.
Nos. 6,180,370; 6,632,927; and, 6,872,392). Chimeric
immunoglobulin-encoding cDNA libraries can be constructed by PCR
amplifying the variable regions from the cDNAs in the cDNA library
from one species and integrating the nucleic acid molecules
encoding the PCR-amplified variable regions onto nucleic acid
molecules encoding immunoglobulin constant regions from another
species to produce a cDNA library encoding a plurality of chimeric
immunoglobulins (See, for example, U.S. Pat. No. 5,843,708).
Various methods that have been developed for the creation of
diversity within protein libraries, including random mutagenesis
(Daugherty et al., Proc. Natl Acad. Sci. USA, 97: 2029-2034 (2000);
Boder et al., Proc. Natl Acad. Sci. USA, 97: 10701-10705 (2000);
Holler et al., Proc. Natl Acad. Sci. USA, 97: 5387-5392 (2000)), in
vitro DNA shuffling (Stemmer, Nature, 370: 389-391 (1994); Stemmer,
Proc. Natl Acad. Sci. USA, 91: 10747-10751 (1994)), in vivo DNA
shuffling (Swers et al., Nucl. Acid Res. 32: e36 (2004)), and
site-specific recombination (Rehberg et al., J. Biol. Chem., 257:
11497-11502 (1982); Streuli et al., Proc. Natl Acad. Sci. USA, 78:
2848-2852 (1981); Waterhouse et al., (1993) Nucl. Acids Res., 21:
2265-2266 (1993); Sblattero & Bradbury, Nat. Biotechnol., 18:
75-80 (2000)) can be used or adapted to produce the plurality of
host cells disclosed herein that express immunoglobulins and the
capture moiety comprising a cell surface anchoring protein fused to
a binding moiety that is capable of specifically binding an
immunoglobulin.
[0115] Production of active immunoglobulins requires proper folding
of the protein when it is produced and secreted by the cells. In E.
coli, the complexity and large size of an antibody presents an
obstacle to proper folding and assembly of the expressed light and
heavy chain polypeptides, resulting in poor yield of intact
antibody. The presence of effective molecular chaperone proteins
may be required, or may enhance the ability of the cell to produce
and secrete properly folded proteins. The use of molecular
chaperone proteins to improve production of immunoglobulins in
yeast has been disclosed in U.S. Pat. No. 5,772,245; U.S. Pat. Nos.
5,700,678 and 5,874,247; U.S. Application Publication No.
2002/0068325; Toman et al., J. Biol. Chem. 275: 23303-23309 (2000);
Keizer-Gunnink et al., Martix Biol. 19: 29-36 (2000); Vad et al.,
J. Biotechnol. 116: 251-260 (2005); Inana et al., Biotechnol.
Bioengineer. 93: 771-778 (2005); Zhang et al., Biotechnol. Prog.
22: 1090-1095 (2006); Damasceno et al., Appl. Microbiol.
Biotechnol. 74: 381-389 (2006); Huo et al., Protein Express. Purif
54: 234-239 (2007); and copending application Ser. No. 61/066,409,
filed 20 Feb. 2008.
[0116] As used herein, the methods can use host cells from any kind
of cellular system which can be modified to express a capture
moiety comprising a cell surface anchoring protein fused to a
binding moiety capable of binding an immunoglobulin and whole,
intact immunoglobulins. Within the scope of the invention, the term
"cells" means the cultivation of individual cells, tissues, organs,
insect cells, avian cells, reptilian cells, mammalian cells,
hybridoma cells, primary cells, continuous cell lines, stem cells,
plant cells, yeast cells, filamentous fungal cells, and/or
genetically engineered cells, such as recombinant cells expressing
and displaying a glycosylated immunoglobulin.
[0117] In a further embodiment, lower eukaryotes such as yeast or
filamentous fungi are used for expression and display of the
immunoglobulins because they can be economically cultured, give
high yields, and when appropriately modified are capable of
suitable glycosylation. Yeast particularly offers established
genetics allowing for rapid transfections, tested protein
localization strategies and facile gene knock-out techniques.
Suitable vectors have expression control sequences, such as
promoters, including 3-phosphoglycerate kinase or other glycolytic
enzymes, and an origin of replication, termination sequences and
the like as desired.
[0118] Host cells useful in the present invention include Pichia
pastoris, Pichia finlandica, Pichia trehalophila, Pichia koclamae,
Pichia membranaefaciens, Pichia minuta (Ogataea minuta, Pichia
lindneri), Pichia opuntiae, Pichia thermotolerans, Pichia
salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis, Pichia
methanolica, Pichia sp., Saccharomyces cerevisiae, Saccharomyces
sp., Hansenula polymorpha, Kluyveromyces sp., Kluyveromyces lactis,
Candida albicans, Aspergillus nidulans, Aspergillus niger,
Aspergillus oryzae, Trichoderma reesei, Chrysosporium lucknowense,
Fusarium sp., Fusarium gramineum, Fusarium venenatum and Neurospora
crassa. Various yeasts, such as K. lactis, Pichia pastoris, Pichia
methanolica, and Hansenula polymorpha are particularly suitable for
cell culture because they are able to grow to high cell densities
and secrete large quantities of recombinant protein. Likewise,
filamentous fungi, such as Aspergillus niger, Fusarium sp,
Neurospora crassa and others can be used to produce glycoproteins
of the invention at an industrial scale. In the case of lower
eukaryotes, cells are routinely grown from between about 1.5 to 3
days under conditions that induce expression of the capture moiety.
The induction of immunoglobulin expression while inhibiting
expression of the capture moiety is for about 1 to 2 days.
Afterwards, the cells are analyzed for those cells that display the
immunoglobulin of interest.
[0119] Lower eukaryotes, particularly yeast and filamentous fungi,
can be genetically modified so that they express glycoproteins in
which the glycosylation pattern is human-like or humanized. In this
manner, glycoprotein compositions can be produced in which a
specific desired glycoform is predominant in the composition. Such
can be achieved by eliminating selected endogenous glycosylation
enzymes and/or genetically engineering the host cells and/or
supplying exogenous enzymes to mimic all or part of the mammalian
glycosylation pathway as described in US 2004/0018590. If desired,
additional genetic engineering of the glycosylation can be
performed, such that the glycoprotein can be produced with or
without core fucosylation. Use of lower eukaryotic host cells is
further advantageous in that these cells are able to produce highly
homogenous compositions of glycoprotein, such that the predominant
glycoform of the glycoprotein may be present as greater than thirty
mole percent of the glycoprotein in the composition. In particular
aspects, the predominant glycoform may be present in greater than
forty mole percent, fifty mole percent, sixty mole percent, seventy
mole percent and, most preferably, greater than eighty mole percent
of the glycoprotein present in the composition.
[0120] Lower eukaryotes, particularly yeast, can be genetically
modified so that they express glycoproteins in which the
glycosylation pattern is human-like or humanized. Such can be
achieved by eliminating selected endogenous glycosylation enzymes
and/or supplying exogenous enzymes as described by Gerngross et
al., US20040018590. For example, a host cell can be selected or
engineered to be depleted in 1,6-mannosyl transferase activities,
which would otherwise add mannose residues onto the N-glycan on a
glycoprotein.
[0121] In one embodiment, the host cell further includes an
.alpha.1,2-mannosidase catalytic domain fused to a cellular
targeting signal peptide not normally associated with the catalytic
domain and selected to target the .alpha.1,2-mannosidase activity
to the ER or Golgi apparatus of the host cell. Passage of a
recombinant glycoprotein through the ER or Golgi apparatus of the
host cell produces a recombinant glycoprotein comprising a
Man.sub.5GlcNAc.sub.2 glycoform, for example, a recombinant
glycoprotein composition comprising predominantly a
Man.sub.5GlcNAc.sub.2 glycoform. For example, U.S. Pat. No.
7,029,872 and U.S. Published Patent Application Nos. 2004/0018590
and 2005/0170452 disclose lower eukaryote host cells capable of
producing a glycoprotein comprising a Man.sub.5GlcNAc.sub.2
glycoform.
[0122] In a further embodiment, the immediately preceding host cell
further includes a GlcNAc transferase I (GnT I) catalytic domain
fused to a cellular targeting signal peptide not normally
associated with the catalytic domain and selected to target GlcNAc
transferase I activity to the ER or Golgi apparatus of the host
cell. Passage of the recombinant glycoprotein through the ER or
Golgi apparatus of the host cell produces a recombinant
glycoprotein comprising a GlcNAcMan.sub.5GlcNAc.sub.2 glycoform,
for example a recombinant glycoprotein composition comprising
predominantly a GlcNAcMan.sub.5GlcNAc.sub.2 glycoform. U.S. Pat.
No. 7,029,872 and U.S. Published Patent Application Nos.
2004/0018590 and 2005/0170452 disclose lower eukaryote host cells
capable of producing a glycoprotein comprising a
GlcNAcMan.sub.5GlcNAc.sub.2 glycoform. The glycoprotein produced in
the above cells can be treated in vitro with a hexaminidase to
produce a recombinant glycoprotein comprising a
Man.sub.5GlcNAc.sub.2 glycoform.
[0123] In a further embodiment, the immediately preceding host cell
further includes a mannosidase II catalytic domain fused to a
cellular targeting signal peptide not normally associated with the
catalytic domain and selected to target mannosidase II activity to
the ER or Golgi apparatus of the host cell. Passage of the
recombinant glycoprotein through the ER or Golgi apparatus of the
host cell produces a recombinant glycoprotein comprising a
GlcNAcMan.sub.3GlcNAc.sub.2 glycoform, for example a recombinant
glycoprotein composition comprising predominantly a
GlcNAcMan.sub.3GlcNAc.sub.2 glycoform. U.S. Pat. No. 7,029,872 and
U.S. Published Patent Application No. 2004/0230042 discloses lower
eukaryote host cells that express mannosidase II enzymes and are
capable of producing glycoproteins having predominantly a
GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform. The glycoprotein
produced in the above cells can be treated in vitro with a
hexaminidase to produce a recombinant glycoprotein comprising a
Man.sub.3GlcNAc.sub.2 glycoform.
[0124] In a further embodiment, the immediately preceding host cell
further includes GlcNAc transferase II (GnT II) catalytic domain
fused to a cellular targeting signal peptide not normally
associated with the catalytic domain and selected to target GlcNAc
transferase II activity to the ER or Golgi apparatus of the host
cell. Passage of the recombinant glycoprotein through the ER or
Golgi apparatus of the host cell produces a recombinant
glycoprotein comprising a GlcNAc.sub.2Man.sub.3GlcNAc.sub.2
glycoform, for example a recombinant glycoprotein composition
comprising predominantly a GlcNAc.sub.2Man.sub.3GlcNAc.sub.2
glycoform. U.S. Pat. No. 7,029,872 and U.S. Published Patent
Application Nos. 2004/0018590 and 2005/0170452 disclose lower
eukaryote host cells capable of producing a glycoprotein comprising
a GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform. The glycoprotein
produced in the above cells can be treated in vitro with a
hexaminidase to produce a recombinant glycoprotein comprising a
Man.sub.3GlcNAc.sub.2 glycoform.
[0125] In a further embodiment, the immediately preceding host cell
further includes a galactosyltransferase catalytic domain fused to
a cellular targeting signal peptide not normally associated with
the catalytic domain and selected to target galactosyltransferase
activity to the ER or Golgi apparatus of the host cell. Passage of
the recombinant glycoprotein through the ER or Golgi apparatus of
the host cell produces a recombinant glycoprotein comprising a
GalGlcNAc.sub.2Man.sub.3GlcNAc.sub.2 or
Gal.sub.2GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform, or mixture
thereof for example a recombinant glycoprotein composition
comprising predominantly a GalGlcNAc.sub.2Man.sub.3GlcNAc.sub.2
glycoform or Gal.sub.2GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform
or mixture thereof. U.S. Pat. No. 7,029,872 and U.S. Published
Patent Application No. 2006/0040353 discloses lower eukaryote host
cells capable of producing a glycoprotein comprising a
Gal.sub.2GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform. The
glycoprotein produced in the above cells can be treated in vitro
with a galactosidase to produce a recombinant glycoprotein
comprising a GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform, for
example a recombinant glycoprotein composition comprising
predominantly a GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform.
[0126] In a further embodiment, the immediately preceding host cell
further includes a sialyltransferase catalytic domain fused to a
cellular targeting signal peptide not normally associated with the
catalytic domain and selected to target sialytransferase activity
to the ER or Golgi apparatus of the host cell. Passage of the
recombinant glycoprotein through the ER or Golgi apparatus of the
host cell produces a recombinant glycoprotein comprising
predominantly a
NANA.sub.2Gal.sub.2GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform or
NANAGal.sub.2GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform or mixture
thereof. For lower eukaryote host cells such as yeast and
filamentous fungi, it is useful that the host cell further include
a means for providing CMP-sialic acid for transfer to the N-glycan.
U.S. Published Patent Application No. 2005/0260729 discloses a
method for genetically engineering lower eukaryotes to have a
CMP-sialic acid synthesis pathway and U.S. Published Patent
Application No. 2006/0286637 discloses a method for genetically
engineering lower eukaryotes to produce sialylated glycoproteins.
The glycoprotein produced in the above cells can be treated in
vitro with a neuraminidase to produce a recombinant glycoprotein
comprising predominantly a
Gal.sub.2GlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform or
GalGlcNAc.sub.2Man.sub.3GlcNAc.sub.2 glycoform or mixture
thereof.
[0127] Any one of the preceding host cells can further include one
or more GlcNAc transferase selected from the group consisting of
GnT III, GnT IV, GnT V, GnT VI, and GnT IX to produce glycoproteins
having bisected (GnT III) and/or multiantennary (GnT IV, V, VI, and
IX) N-glycan structures such as disclosed in U.S. Published Patent
Application Nos. 2004/074458 and 2007/0037248.
[0128] In further embodiments, the host cell that produces
glycoproteins that have predominantly GlcNAcMan.sub.5GlcNAc.sub.2
N-glycans further includes a galactosyltransferase catalytic domain
fused to a cellular targeting signal peptide not normally
associated with the catalytic domain and selected to target
Galactosyltransferase activity to the ER or Golgi apparatus of the
host cell. Passage of the recombinant glycoprotein through the ER
or Golgi apparatus of the host cell produces a recombinant
glycoprotein comprising predominantly the
GalGlcNAcMan.sub.5GlcNAc.sub.2 glycoform.
[0129] In a further embodiment, the immediately preceding host cell
that produced glycoproteins that have predominantly the
GalGlcNAcMan.sub.5GlcNAc.sub.2 N-glycans further includes a
sialyltransferase catalytic domain fused to a cellular targeting
signal peptide not normally associated with the catalytic domain
and selected to target sialytransferase activity to the ER or Golgi
apparatus of the host cell. Passage of the recombinant glycoprotein
through the ER or Golgi apparatus of the host cell produces a
recombinant glycoprotein comprising a
NANAGalGlcNAcMan.sub.5GlcNAc.sub.2 glycoform.
[0130] Various of the preceding host cells further include one or
more sugar transporters such as UDP-GlcNAc transporters (for
example, Kluyveromyces lactis and Mus musculus UDP-GlcNAc
transporters), UDP-galactose transporters (for example, Drosophila
melanogaster UDP-galactose transporter), and CMP-sialic acid
transporter (for example, human sialic acid transporter). Because
lower eukaryote host cells such as yeast and filamentous fungi lack
the above transporters, it is preferable that lower eukaryote host
cells such as yeast and filamentous fungi be genetically engineered
to include the above transporters.
[0131] Host cells further include lower eukaryote cells (e.g.,
yeast such as Pichia pastoris) that are genetically engineered to
eliminate glycoproteins having .alpha.-mannosidase-resistant
N-glycans by deleting or disrupting one or more of the
.beta.-mannosyltransferase genes (e.g., BMT1, BMT2, BMT3, and
BMT4)(See, U.S. Published Patent Application No. 2006/0211085) and
glycoproteins having phosphomannose residues by deleting or
disrupting one or both of the phosphomannosyl transferase genes
PNO1 and MNN4B (See for example, U.S. Pat. Nos. 7,198,921 and
7,259,007), which in further aspects can also include deleting or
disrupting the MNN4A gene. Disruption includes disrupting the open
reading frame encoding the particular enzymes or disrupting
expression of the open reading frame or abrogating translation of
RNAs encoding one or more of the .beta.-mannosyltransferases and/or
phosphomannosyltransferases using interfering RNA, antisense RNA,
or the like. The host cells can further include any one of the
aforementioned host cells modified to produce particular N-glycan
structures.
[0132] Host cells further include lower eukaryote cells (e.g.,
yeast such as Pichia pastoris) that are genetically modified to
control O-glycosylation of the glycoprotein by deleting or
disrupting one or more of the protein O-mannosyltransferase
(Dol-P-Man:Protein (Ser/Thr) Mannosyl Transferase genes) (PMTs)
(See U.S. Pat. No. 5,714,377) or grown in the presence of Pmtp
inhibitors and/or an alpha-mannosidase as disclosed in Published
International Application No. WO 2007061631, or both. Disruption
includes disrupting the open reading frame encoding the Pmtp or
disrupting expression of the open reading frame or abrogating
translation of RNAs encoding one or more of the Pmtps using
interfering RNA, antisense RNA, or the like. The host cells can
further include any one of the aforementioned host cells modified
to produce particular N-glycan structures.
[0133] Pmtp inhibitors include but are not limited to a benzylidene
thiazolidinediones. Examples of benzylidene thiazolidinediones that
can be used are 5-[[3,4-bis(phenylmethoxy)
phenyl]methylene]-4-oxo-2-thioxo-3-thiazolidineacetic Acid;
5-[[3-(1-Phenylethoxy)-4-(2-phenylethoxy)]phenyl]methylene]-4-oxo-2-thiox-
o-3-thiazolidineacetic Acid; and
5-[[3-(1-Phenyl-2-hydroxy)ethoxy)-4-(2-phenylethoxy)]phenyl]methylene]-4--
oxo-2-thioxo-3-thiazolidineacetic Acid.
[0134] In particular embodiments, the function or expression of at
least one endogenous PMT gene is reduced, disrupted, or deleted.
For example, in particular embodiments the function or expression
of at least one endogenous PMT gene selected from the group
consisting of the PMT1, PMT2, PMT3, and PMT4 genes is reduced,
disrupted, or deleted; or the host cells are cultivated in the
presence of one or more PMT inhibitors. In further embodiments, the
host cells include one or more PMT gene deletions or disruptions
and the host cells are cultivated in the presence of one or more
Pmtp inhibitors. In particular aspects of these embodiments, the
host cells also express a secreted alpha-1,2-mannosidase.
[0135] PMT deletions or disruptions and/or Pmtp inhibitors control
O-glycosylation by reducing O-glycosylation occupancy, that is by
reducing the total number of O-glycosylation sites on the
glycoprotein that are glycosylated. The further addition of an
alpha-1,2-mannsodase that is secreted by the cell controls
O-glycosylation by reducing the mannose chain length of the
O-glycans that are on the glycoprotein. Thus, combining PMT
deletions or disruptions and/or Pmtp inhibitors with expression of
a secreted alpha-1,2-mannosidase controls O-glycosylation by
reducing occupancy and chain length. In particular circumstances,
the particular combination of PMT deletions or disruptions, Pmtp
inhibitors, and alpha-1,2-mannosidase is determined empirically as
particular heterologous glycoproteins (antibodies, for example) may
be expressed and transported through the Golgi apparatus with
different degrees of efficiency and thus may require a particular
combination of PMT deletions or disruptions, Pmtp inhibitors, and
alpha-1,2-mannosidase. In another aspect, genes encoding one or
more endogenous mannosyltransferase enzymes are deleted. This
deletion(s) can be in combination with providing the secreted
alpha-1,2-mannosidase and/or PMT inhibitors or can be in lieu of
providing the secreted alpha-1,2-mannosidase and/or PMT
inhibitors.
[0136] Thus, the control of O-glycosylation can be useful for
producing particular glycoproteins in the host cells disclosed
herein in better total yield or in yield of properly assembled
glycoprotein. The reduction or elimination of O-glycosylation
appears to have a beneficial effect on the assembly and transport
of whole antibodies as they traverse the secretory pathway and are
transported to the cell surface. Thus, in cells in which
O-glycosylation is controlled, the yield of properly assembled
antibodies fragments is increased over the yield obtained in host
cells in which O-glycosylation is not controlled.
[0137] In addition, O-glycosylation may have an effect on an
antibody's affinity and/or avidity for an antigen. This can be
particularly significant when the ultimate host cell for production
of the antibody is not the same as the host cell that was used for
selecting the antibody. For example, O-glycosylation might
interfere with an antibody's affinity for an antigen, thus an
antibody that might otherwise have high affinity for an antigen
might not be identified because O-glycosylation may interfere with
the ability of the antibody to bind the antigen. In other cases, an
antibody that has high avidity for an antigen might not be
identified because O-glycosylation interferes with the antibody's
avidity for the antigen. In the preceding two cases, an antibody
that might be particularly effective when produced in a mammalian
cell line might not be identified because the host cells for
identifying and selecting the antibody was of another cell type,
for example, a yeast or fungal cell (e.g., a Pichia pastoris host
cell). It is well known that O-glycosylation in yeast can be
significantly different from O-glycosylation in mammalian cells.
This is particularly relevant when comparing wild type yeast
o-glycosylation with mucin-type or dystroglycan type
O-glycosylation in mammals. In particular cases, O-glycosylation
might enhance the antibody's affinity or avidity for an antigen
instead of interfere. This effect is undesirable when the
production host cell is to be different from the host cell used to
identify and select the antibody (for example, identification and
selection is done in yeast and the production host is a mammalian
cell) because in the production host the O-glycosylation will no
longer be of the type that caused the enhanced affinity or avidity
for the antigen. Therefore, controlling O-glycosylation can enable
use of the materials and methods herein to identify and select
antibodies with specificity for a particular antigen based upon
affinity or avidity of the antibody for the antigen without
identification and selection of the antibody being influenced by
the O-glycosylation system of the host cell. Thus, controlling
O-glycosylation further enhances the usefulness of yeast or fungal
host cells to identify and select antibodies that will ultimately
be produced in a mammalian cell line.
[0138] Yield of antibodies can in some situations be improved by
overexpressing nucleic acid molecules encoding mammalian or human
chaperone proteins or replacing the genes encoding one or more
endogenous chaperone proteins with nucleic acid molecules encoding
one or more mammalian or human chaperone proteins. In addition, the
expression of mammalian or human chaperone proteins in the host
cell also appears to control O-glycosylation in the cell. Thus,
further included are the host cells herein wherein the function of
at least one endogenous gene encoding a chaperone protein has been
reduced or eliminated, and a vector encoding at least one mammalian
or human homolog of the chaperone protein is expressed in the host
cell. Also included are host cells in which the endogenous host
cell chaperones and the mammalian or human chaperone proteins are
expressed. In further aspects, the lower eukaryotic host cell is a
yeast or filamentous fungi host cell. Examples of the use of
chaperones of host cells in which human chaperone proteins are
introduced to improve the yield and reduce or control
O-glycosylation of recombinant proteins has been disclosed in U.S.
Provisional Application Nos. 61/066,409 filed Feb. 20, 2008 and
61/188,723 filed Aug. 12, 2008. Like above, further included are
lower eukaryotic host cells wherein, in addition to replacing the
genes encoding one or more of the endogenous chaperone proteins
with nucleic acid molecules encoding one or more mammalian or human
chaperone proteins or overexpressing one or more mammalian or human
chaperone proteins as described above, the function or expression
of at least one endogenous gene encoding a protein
O-mannosyltransferase (PMT) protein is reduced, disrupted, or
deleted. In particular embodiments, the function of at least one
endogenous PMT gene selected from the group consisting of the PMT1,
PMT2, PMT3, and PMT4 genes is reduced, disrupted, or deleted.
[0139] Therefore, the methods disclose herein can use any host cell
that has been genetically modified to produce glycoproteins that
have no N-glycans compositions wherein the predominant N-glycan is
selected from the group consisting of complex N-glycans, hybrid
N-glycans, and high mannose N-glycans wherein complex N-glycans are
selected from the group consisting of Man.sub.3GlcNAc.sub.2,
GlcNAC.sub.(1-4)Man.sub.3GlcNAc.sub.2,
Gal.sub.(1-4)GlcNAc.sub.(1-4)Man.sub.3GlcNAc.sub.2, and
NANA.sub.(1-4)Gal.sub.(1-4)Man.sub.3GlcNAc.sub.2; hybrid N-glycans
are selected from the group consisting of Man.sub.5GlcNAc.sub.2,
GlcNAcMan.sub.5GlcNAc.sub.2, GalGlcNAcMan.sub.5GlcNAc.sub.2, and
NANAGalGlcNAcMan.sub.5GlcNAc.sub.2; and high Mannose N-glycans are
selected from the group consisting of Man.sub.6GlcNAc.sub.2,
Man.sub.7GlcNAc.sub.2, Man.sub.8GlcNAc.sub.2, and
Man.sub.9GlcNAc.sub.2. In particular aspects, the composition of
N-glycans comprises about 39% GlcNAC.sub.2Man.sub.3GlcNAc.sub.2;
40% Gal.sub.1GlcNAC.sub.2Man.sub.3GlcNAc.sub.2; and 6%
Gal.sub.2GlcNAC.sub.2Man.sub.3GlcNAc.sub.2 or about 60%
GlcNAC.sub.2Man.sub.3GlcNAc.sub.2; 17%
Gal.sub.1GlcNAC.sub.2Man.sub.3GlcNAc.sub.2; and 5%
Gal.sub.2GlcNAC.sub.2Man.sub.3GlcNAc.sub.2, or mixtures in
between.
[0140] In the above embodiments in which the yeast cell does not
display 1,6-mannosyl transferase activity (that is, the OCH1 gene
encoding och1p has been disrupted or deleted), the host cell is not
capable of mating. Thus, depending on the efficiency of
transformation, the potential library diversity of light chains and
heavy chains appears to be limited to a heavy chain library of
between about 10.sup.3 to 10.sup.6 diversity and a light chain
library of about 10.sup.3 to 10.sup.6 diversity. However, in a
yeast host cell that is capable of mating, the diversity can be
increased to about 10.sup.6 to 10.sup.12 because the host cells
expressing the heavy chain library can be mated to host cells
expressing the light chain library to produce host cells that
express heavy chain/light chain library. Therefore, in particular
embodiments, the host cell is a yeast cell such as Pichia pastoris
that displays 1,6-mannosyl transferase activities (that is, has an
OCH1 gene encoding a function och1p) but which is modified as
described herein to display antibodies or fragments thereof on the
cell surface. In these embodiments, the host cell can be a host
cell with its native glycosylation pathway.
[0141] Yeast selectable markers that can be used in the present
invention include drug resistance markers and genetic functions
which allow the yeast host cell to synthesize essential cellular
nutrients, e.g. amino acids. Drug resistance markers which are
commonly used in yeast include chloramphenicol, kanamycin,
methotrexate, G418 (geneticin), Zeocin, and the like. Genetic
functions which allow the yeast host cell to synthesize essential
cellular nutrients are used with available yeast strains having
auxotrophic mutations in the corresponding genomic function. Common
yeast selectable markers provide genetic functions for synthesizing
leucine (LEU2), tryptophan (TRP1 and TRP2), proline (PRO1), uracil
(URA3, URA5, URA6), histidine (HIS3), lysine (LYS2), adenine (ADE1
or ADE2), and the like. Other yeast selectable markers include the
ARR3 gene from S. cerevisiae, which confers arsenite resistance to
yeast cells that are grown in the presence of arsenite (Bobrowicz
et al., Yeast, 13:819-828 (1997); Wysocki et al., J. Biol. Chem.
272:30061-30066 (1997)). A number of suitable integration sites
include those enumerated in U.S. Published application No.
2007/0072262 and include homologs to loci known for Saccharomyces
cerevisiae and other yeast or fungi. Methods for integrating
vectors into yeast are well known, for example, see U.S. Pat. No.
7,479,389, WO2007136865, and PCT/US2008/13719. Examples of
insertion sites include, but are not limited to, Pichia ADE genes;
Pichia TRP (including TRP1 through TRP2) genes; Pichia MCA genes;
Pichia CYM genes; Pichia PEP genes; Pichia PRB genes; and Pichia
LEU genes. The Pichia ADE1 and ARG4 genes have been described in
Lin Cereghino et al., Gene 263:159-169 (2001) and U.S. Pat. No.
4,818,700, the HIS3 and TRP1 genes have been described in Comm et
al., Yeast 14:861-867 (1998), HIS4 has been described in GenBank
Accession No. X56180.
[0142] In embodiments that express whole antibodies, the nucleic
acid molecule encoding the antibody or heavy chain fragment thereof
is modified to replace the codon encoding an asparagine residue at
position 297 of the molecule (the glycosylation site) with a codon
encoding any other amino acid residue. Thus, the antibody that is
produced in the host cell is not glycosylated. In this embodiment,
the host cell displaying the heavy chain library is mated to the
host cell displaying the light chain library and the resulting
combinatorial library is screened as taught herein. Because the
antibodies lack N-glycosylation, the non-human yeast N-glycans of
the host cell which might interfere with antibody affinity for a
desired antigen are not present on the recombinant antibodies.
Cells producing antibodies that have desired affinity for an
antigen of interest are selected. The nucleic acid molecules
encoding the heavy and light chains of the antibody thereof are
removed from the cells and the nucleic acid molecule encoding the
heavy chain is modified to reintroduce an asparagine residue at
position 297. This enables appropriate human-like glycosylation at
position 297 of the antibody or fragment thereof when the nucleic
acid molecule encoding the antibody thereof is introduced into a
host cell that has been engineered to make glycoproteins that have
hybrid or complex N-glycans as discussed previously.
[0143] The cell systems used for recombinant expression and display
of the immunoglobulin can also be any higher eukaryote cell,
tissue, organism from the animal kingdom, for example transgenic
goats, transgenic rabbits, CHO cells, insect cells, and human cell
lines. Examples of animal cells include, but are not limited to,
SC-I cells, LLC-MK cells, CV-I cells, CHO cells, COS cells, murine
cells, human cells, HeLa cells, 293 cells, VERO cells, MDBK cells,
MDCK cells, MDOK cells, CRFK cells, RAF cells, TCMK cells, LLC-PK
cells, PK15 cells, WI-38 cells, MRC-5 cells, T-FLY cells, BHK
cells, SP2/0, NSO cells, and derivatives thereof. Insect cells
include cells of Drosophila melanogaster origin. These cells can be
genetically engineered to render the cells capable of making
immunoglobulins that have particular or predominantly particular
N-glycans. For example, U.S. Pat. No. 6,949,372 discloses methods
for making glycoproteins in insect cells that are sialylated.
Yamane-Ohnuki et al. Biotechnol. Bioeng. 87: 614-622 (2004), Kanda
et al., Biotechnol. Bioeng. 94: 680-688 (2006), Kanda et al.,
Glycobiol. 17: 104-118 (2006), and U.S. Pub. Application Nos.
2005/0216958 and 2007/0020260 disclose mammalian cells that are
capable of producing immunoglobulins in which the N-glycans thereon
lack fucose or have reduced fucose.
[0144] In particular embodiments, the higher eukaryote cell,
tissue, organism can also be from the plant kingdom, for example,
wheat, rice, corn, tobacco, and the like. Alternatively, bryophyte
cells can be selected, for example from species of the genera
Physcomitrella, Funaria, Sphagnum, Ceratodon, Marchantia, and
Sphaerocarpos. Exemplary of plant cells is the bryophyte cell of
Physcomitrella patens, which has been disclosed in WO 2004/057002
and WO2008/006554. Expression systems using plant cells can further
manipulated to have altered glycosylation pathways to enable the
cells to produce immunoglobulins that have predominantly particular
N-glycans. For example, the cells can be genetically engineered to
have a dysfunctional or no core fucosyltransferase and/or a
dysfunctional or no xylosyltransferase, and/or a dysfunctional or
no .beta.1,4-galactosyltransferase. Alternatively, the galactose,
fucose and/or xylose can be removed from the immunoglobulin by
treatment with enzymes removing the residues. Any enzyme resulting
in the release of galactose, fucose and/or xylose residues from
N-glycans which are known in the art can be used, for example
.alpha.-galactosidase, .beta.-xylosidase, and .alpha.-fucosidase.
Alternatively an expression system can be used which synthesizes
modified N-glycans which can not be used as substrates by
1,3-fucosyltransferase and/or 1,2-xylosyltransferase, and/or
1,4-galactosyltransferase. Methods for modifying glycosylation
pathways in plant cells has been disclosed in U.S. Published
Application No. 2004/0018590.
[0145] The methods disclosed herein can be adapted for use in
mammalian, insect, and plant cells. The regulatable promoters
selected for regulating expression of the expression cassettes in
mammalian, insect, or plant cells should be selected for
functionality in the cell-type chosen. Examples of suitable
regulatable promoters include but are not limited to the
tetracycline-regulatable promoters (See for example, Berens &
Hillen, Eur. J. Biochem. 270: 3109-3121 (2003)), RU 486-inducible
promoters, ecdysone-inducible promoters, and kanamycin-regulatable
systems. These promoters can replace the promoters exemplified in
the expression cassettes described in the examples. The capture
moiety can be fused to a cell surface anchoring protein suitable
for use in the cell-type chosen. Cell surface anchoring proteins
including GPI proteins are well known for mammalian, insect, and
plant cells. GPI-anchored fusion proteins has been described by
Kennard et al., Methods Biotechnol. Vo. 8: Animal Cell
Biotechnology (Ed. Jenkins. Human Press, Inc., Totowa, N.J.) pp.
187-200 (1999). The genome targeting sequences for integrating the
expression cassettes into the host cell genome for making stable
recombinants can replace the genome targeting and integration
sequences exemplified in the examples. Transfection methods for
making stable and transiently transfected mammalian, insect, plant
host cells are well known in the art. Once the transfected host
cells have been constructed as disclosed herein, the cells can be
screened for expression of the immunoglobulin of interest and
selected as disclosed herein.
[0146] The present invention also encompasses kits containing the
expression and helper vectors of this invention in suitable
packaging. Each kit necessarily comprises the reagents which render
the delivery of vectors into a host cell possible. The selection of
reagents that facilitate delivery of the vectors may vary depending
on the particular transfection or infection method used. The kits
may also contain reagents useful for generating labeled
polynucleotide probes or proteinaceous probes for detection of
exogenous sequences and the protein product. Each reagent can be
supplied in a solid form or dissolved/suspended in a liquid buffer
suitable for inventory storage, and later for exchange or addition
into the reaction medium when the experiment is performed. Suitable
packaging is provided. The kit can optionally provide additional
components that are useful in the procedure. These optional
components include, but are not limited to, buffers, capture
reagents, developing reagents, labels, reacting surfaces, means for
detection, control samples, instructions, and interpretive
information.
[0147] All publications, patents, and other references mentioned
herein are hereby incorporated by reference in their
entireties.
[0148] The following examples are intended to promote a further
understanding of the present invention.
Example 1
[0149] Utility of the invention was demonstrated using Pichia
pastoris as a model. The glycoengineered Pichia pastoris strain
yGLY2696 was the background strain used. In strain yGLY2696, the
gene encoding the endogenous PDI replaced with a nucleic acid
molecule encoding the human PDI and a nucleic acid molecule
encoding the human GRP94 protein inserted into the PEP4 locus. The
strain was further engineered to alter the endogenous glycosylation
pathway to produce glycoproteins that have predominantly
Man.sub.5GlcNAc.sub.2 N-glycans. Strain YGLY2696 has been disclosed
in co-pending Application Ser. No. 61/066,409, filed 20 Feb. 2008.
This strain was shown to be useful for producing immunoglobulins
and for producing immunoglobulins that have reduced
O-glycosylation. Construction of strain yGLY2696 involved the
following steps.
[0150] Construction of expression/integration plasmid vector
pGLY642 comprising an expression cassette encoding the human PDI
protein and nucleic acid molecules to target the plasmid vector to
the Pichia pastoris PDI1 locus for replacement of the gene encoding
the Pichia pastoris PDI1 with a nucleic acid molecule encoding the
human PDI was as follows and is shown in FIG. 8. cDNA encoding the
human PDI1 was amplified by PCR using the primers hPDI/UP1: 5'
AGCGC TGACG CCCCC GAGGA GGAGG ACCAC 3' (SEQ ID NO: 1) and
hPDI/LP-PacI: 5' CCTTA ATTAA TTACA GTTCA TCATG CACAG CTTTC TGATC AT
3' (SEQ ID NO: 2), Pfu turbo DNA polymerase (Stratagene, La Jolla,
Calif.), and a human liver cDNA (BD Bioscience, San Jose, Calif.).
The PCR conditions were 1 cycle of 95.degree. C. for two minutes,
25 cycles of 95.degree. C. for 20 seconds, 58.degree. C. for 30
seconds, and 72.degree. C. for 1.5 minutes, and followed by one
cycle of 72.degree. C. for 10 minutes. The resulting PCR product
was cloned into plasmid vector pCR2.1 to make plasmid vector
pGLY618. The nucleotide and amino acid sequences of the human PDI1
(SEQ ID NOs:39 and 40, respectively) are shown in Table 1.
[0151] The nucleotide and amino acid sequences of the Pichia
pastoris PDI1 (SEQ ID NOs:41 and 42, respectively) are shown in
Table 1. Isolation of nucleic acid molecules comprising the Pichia
pastoris PDI1 5' and 3' regions was performed by PCR amplification
of the regions from Pichia pastoris genomic DNA. The 5' region was
amplified using primers PB248: 5' ATGAA TTCAG GCCAT ATCGG CCATT
GTTTA CTGTG CGCCC ACAGT AG 3' (SEQ ID NO: 3); PB249: 5' ATGTT TAAAC
GTGAG GATTA CTGGT GATGA AAGAC 3' (SEQ ID NO: 4). The 3' region was
amplified using primers PB250: 5' AGACT AGTCT ATTTG GAGAC ATTGA
CGGAT CCAC 3' (SEQ ID NO: 5); PB251: 5' ATCTC GAGAG GCCAT GCAGG
CCAAC CACAA GATGA ATCAA ATTTT G-3' (SEQ ID NO: 6). Pichia pastoris
strain NRRL-11430 genomic DNA was used for PCR amplification. The
PCR conditions were one cycle of 95.degree. C. for two minutes, 25
cycles of 95.degree. C. for 30 seconds, 55.degree. C. for 30
seconds, and 72.degree. C. for 2.5 minutes, and followed by one
cycle of 72.degree. C. for 10 minutes. The resulting PCR fragments,
PpPDI1 (5') and PpPDI1 (3'), were separately cloned into plasmid
vector pCR2.1 to make plasmid vectors pGLY620 and pGLY617,
respectively. To construct pGLY678, DNA fragments PpARG3-5' and
PpARG-3' of integration plasmid vector pGLY24, which targets the
plasmid vector to Pichia pastoris ARG3 locus, were replaced with
DNA fragments PpPDI (5') and PpPDI (3'), respectively, which
targets the plasmid vector pGLY678 to the PDI1 locus and disrupts
expression of the PDI1 locus.
[0152] The nucleic acid molecule encoding the human PDI was then
cloned into plasmid vector pGLY678 to produce plasmid vector
pGLY642 in which the nucleic acid molecule encoding the human PDI
was placed under the control of the Pichia pastoris GAPDH promoter
(PpGAPDH). Expression/integration plasmid vector pGLY642 was
constructed by ligating a nucleic acid molecule encoding the
Saccharomyces cerevisiae alpha mating factor (MF) presequence
signal peptide (Sc.alpha.MFpre-signal peptide) having a NotI
restriction enzyme site at the 5' end and a blunt 3' end and the
expression cassette comprising the nucleic acid molecule encoding
the human PDI released from plasmid vector pGLY618 with AfeI and
PacI to produce a nucleic acid molecule having a blunt 5' end and a
PacI site at the 3' end into plasmid vector pGLY678 digested with
NotI and PacI. The resulting integration/expression plasmid vector
pGLY642 comprises an expression cassette encoding a human
PDI1/Sc.alpha.MFpre-signal peptide fusion protein operably linked
to the Pichia pastoris promoter and nucleic acid molecule sequences
to target the plasmid vector to the Pichia pastoris PDI1 locus for
disruption of the PDI1 locus and integration of the expression
cassette into the PDI1 locus. FIG. 2 illustrates the construction
of plasmid vector pGLY642. The nucleotide and amino acid sequences
of the Sc.alpha.MFpre-signal peptide are shown in SEQ ID NOs:27 and
28, respectively.
[0153] Construction of expression/integration vector pGLY2233
encoding the human GRP94 protein was as follows and is shown in
FIG. 3. The human GRP94 was PCR amplified from human liver cDNA (BD
Bioscience) with the primers hGRP94/UP1: 5'-AGCGC TGACG ATGAA GTTGA
TGTGG ATGGT ACAGT AG-3; (SEQ ID NO: 15); and hGRP94/LP1: 5'-GGCCG
GCCTT ACAAT TCATC ATGTT CAGCT GTAGA TTC 3; (SEQ ID NO: 16). The PCR
conditions were one cycle of 95.degree. C. for two minutes, 25
cycles of 95.degree. C. for 20 seconds, 55.degree. C. for 20
seconds, and 72.degree. C. for 2.5 minutes, and followed by one
cycle of 72.degree. C. for 10 minutes. The PCR product was cloned
into plasmid vector pCR2.1 to make plasmid vector pGLY2216. The
nucleotide and amino acid sequences of the human GRP94 (SEQ ID
NOs:43 and 44, respectively) are shown in Table 1.
[0154] The nucleic acid molecule encoding the human GRP94 was
released from plasmid vector pGLY2216 with AfeI and FseI. The
nucleic acid molecule was then ligated to a nucleic acid molecule
encoding the Sc.alpha.MPpre-signal peptide having NotI and blunt
ends as above and plasmid vector pGLY2231 digested with NotI and
FseI carrying nucleic acid molecules comprising the Pichia pastoris
PEP4 5' and 3' regions (PpPEP4-5' and PpPEP4-3' regions,
respectively) to make plasmid vector pGLY2229. Plasmid vector
pGLY2229 was digested with BglII and NotI and a DNA fragment
containing the PpPDI1 promoter was removed from plasmid vector
pGLY2187 with BglII and NotI and the DNA fragment ligated into
pGLY2229 to make plasmid vector pGLY2233. Plasmid vector pGLY2233
encodes the human GRP94 fusion protein under control of the Pichia
pastoris PDI promoter and includes the 5' and 3' regions of the
Pichia pastoris PEP4 gene to target the plasmid vector to the PEP4
locus of genome for disruption of the PEP4 locus and integration of
the expression cassette into the PEP4 locus. FIG. 3 illustrates the
construction of plasmid vector pGLY2233.
[0155] Construction of plasmid vectors pGLY1162, pGLY1896, and
pGFI207t was as follows. All Trichoderma reesei
.alpha.-1,2-mannosidase expression plasmid vectors were derived
from pGFI165, which encodes the T. reesei .alpha.-1,2-mannosidase
catalytic domain (See published International Application No.
WO2007061631) fused to S. cerevisiae .alpha.MATpre signal peptide
(Sc.alpha.MPpre-signal peptide) herein expression is under the
control of the Pichia pastoris GAP promoter and wherein integration
of the plasmid vectors is targeted to the Pichia pastoris PRO1
locus and selection is using the Pichia pastoris URA5 gene. A map
of plasmid vector pGFI165 is shown in FIG. 4.
[0156] Plasmid vector pGLY1162 was made by replacing the GAP
promoter in pGFI165 with the Pichia pastoris AOX1 (PpAOX1)
promoter. This was accomplished by isolating the PpAOX1 promoter as
an EcoRI (made blunt)-BglII fragment from pGLY2028, and inserting
into pGFI165 that was digested with NotI (made blunt) and BglII.
Integration of the plasmid vector is to the Pichia pastoris PRO1
locus and selection is using the Pichia pastoris URA5 gene. A map
of plasmid vector pGLY1162 is shown in FIG. 5.
[0157] Plasmid vector pGLY1896 contains an expression cassette
encoding the mouse .alpha.-1,2-mannosidase catalytic domain fused
to the S. cerevisiae MNN2 membrane insertion leader peptide fusion
protein (See Choi et al., Proc. Natl. Acad. Sci. USA 100: 5022
(2003)) inserted into plasmid vector pGFI165 (FIG. 5). This was
accomplished by isolating the GAPp-ScMNN2-mouse MNSI expression
cassette from pGLY1433 digested with XhoI (and the ends made blunt)
and PmeI, and inserting the fragment into pGFI165 that digested
with PmeI. Integration of the plasmid vector is to the Pichia
pastoris PRO1 locus and selection is using the Pichia pastoris URA5
gene. A map of plasmid vector pGLY1896 is shown in FIG. 4.
[0158] Plasmid vector pGFI207t is similar to pGLY1896 except that
the URA5 selection marker was replaced with the S. cerevisiae ARR3
(ScARR3) gene, which confers resistance to arsenite. This was
accomplished by isolating the ScARR3 gene from pGFI166 digested
with AscI and the AscI ends made blunt) and BglII, and inserting
the fragment into pGLY1896 that digested with SpeI and the SpeI
ends made blunt and BglII. Integration of the plasmid vector is to
the Pichia pastoris PRO1 locus and selection is using the
Saccharomyces cerevisiae ARR3 gene. A map of plasmid vector
pGFI2007t is shown in FIG. 4. The ARR3 gene from S. cerevisiae
confers arsenite resistance to cells that are grown in the presence
of arsenite (Bobrowicz et al., Yeast, 13:819-828 (1997); Wysocki et
al., J. Biol. Chem. 272:30061-066 (1997)).
[0159] Yeast transfections with the above expression/integration
vectors were as follows. Pichia pastoris strains were grown in 50
mL YPD media (yeast extract (1%), peptone (2%), dextrose (2%))
overnight to an OD of between about 0.2 to 6. After incubation on
ice for 30 minutes, cells were pelleted by centrifugation at
2500-3000 rpm for 5 minutes. Media was removed and the cells washed
three times with ice cold sterile 1 M sorbitol before resuspending
in 0.5 ml ice cold sterile 1 M sorbitol. Ten .mu.L linearized DNA
(5-20 .mu.g) and 100 .mu.L cell suspension was combined in an
electroporation cuvette and incubated for 5 minutes on ice.
Electroporation was in a Bio-Rad GenePulser Xcell following the
preset Pichia pastoris protocol (2 kV, 25 .mu.F, 200.OMEGA.),
immediately followed by the addition of 1 mL YPDS recovery media
(YPD media plus 1 M sorbitol). The transfected cells were allowed
to recover for four hours to overnight at room temperature
(26.degree. C.) before plating the cells on selective media.
[0160] Generation of Cell Lines was as follows and is shown in FIG.
6. The strain yGLY24-1 (ura5.DELTA.::MET1 och1.DELTA.::lacZ
bmt2.DELTA.::lacZ/KlMNN2-2/mnn4L1.DELTA.::lacZ/MmSLC35A3
pno1.DELTA.mnn4 .DELTA.::lacZ met16.DELTA.::lacZ), was constructed
using methods described earlier (See for example, Nett and
Gerngross, Yeast 20:1279 (2003); Choi et al., Proc. Natl. Acad.
Sci. USA 100:5022 (2003); Hamilton et al., Science 301:1244
(2003)). The BMT2 gene has been disclosed in Mille et al., J. Biol.
Chem. 283: 9724-9736 (2008) and U.S. Published Application No.
20060211085. The PNO1 gene has been disclosed in U.S. Pat. No.
7,198,921 and the mnn4L1 gene (also referred to as mnn4b) has been
disclosed in U.S. Pat. No. 7,259,007. The mnn4 refers to mnn4L2 or
mnn4a. In the genotype, KlMNN2-2 is the Kluveromyces lactis GlcNAc
transporter and MmSLC35A3 is the Mus musculus GlcNAc transporter.
The URA5 deletion renders the yGLY24-1 strain auxotrophic for
uracil (See U.S. Published application No. 2004/0229306) and was
used to construct the humanized chaperone strains that follow.
While the various expression cassettes were integrated into
particular loci of the Pichia pastoris genome in the examples
herein, it is understood that the operation of the invention is
independent of the loci used for integration. Loci other than those
disclosed herein can be used for integration of the expression
cassettes. Suitable integration sites include those enumerated in
U.S. Published application No. 20070072262 and include homologs to
loci known for Saccharomyces cerevisiae and other yeast or
fungi.
[0161] Strains yGLY702 and yGLY704 were generated in order to test
the effectiveness of the human PDI1 expressed in Pichia pastoris
cells in the absence of the endogenous Pichia pastoris PDI gene.
Strains yGLY702 and yGLY704 (huPDI) were constructed as follows.
Strain yGLY702 was generated by transfecting yGLY24-1 with plasmid
vector pGLY642 containing the expression cassette encoding the
human PDI under control of the constitutive PpGAPDH promoter.
Plasmid vector pGLY642 also contained an expression cassette
encoding the Pichia pastoris URA5, which rendered strain yGLY702
prototrophic for uracil. The URA5 expression cassette was removed
by counterselecting yGLY702 on 5-FOA plates to produce strain
yGLY704 in which, so that the Pichia pastoris PDI1 gene has been
stably replaced by the human PDI gene and the strain is auxotrophic
for uracil.
[0162] Strain yGLY733 was generated by transfecting with plasmid
vector pGLY1162, which comprises an expression cassette that
encodes the Trichoderma Reesei mannosidase (TrMNS1) operably linked
to the Pichia pastoris AOX1 promoter (PpAOX1-TrMNS1), into the PRO1
locus of yGLY704. This strain has the gene encoding the Pichia
pastoris PD1 replaced with the expression cassette encoding the
human PDI1, has the PpAOX1-TrMNS1 expression cassette integrated
into the PRO1 locus, and is a URA5 auxotroph. The PpAOX1 promoter
allows overexpression when the cells are grown in the presence of
methanol.
[0163] Strain yGLY762 was constructed by integrating expression
cassettes encoding TrMNS1 and mouse mannosidase IA (MuMNS1A), each
operably linked to the Pichia pastoris GAPDH promoter in plasmid
vector pGFI207t into control strain yGLY733 at the 5' PRO1 locus
UTR in Pichia pastoris genome. This strain has the gene encoding
the Pichia pastoris PD1 replaced with the expression cassette
encoding the human PDI1, has the PpGAPDH-TrMNS1 and PpGAPDH-MuMNS1A
expression cassettes integrated into the PRO1 locus, and is a URA5
auxotroph.
[0164] Strain yGLY2677 was generated by counterselecting yGLY762 on
5-FOA plates. This strain has the gene encoding the Pichia pastoris
PD1 replaced with the expression cassette encoding the human PDI1,
has the PpAOX1-TrMNS1 expression cassette integrated into the PRO1
locus, has the PpGAPH-TrMNS1 and PpGAPDH-MuMNS1A expression
cassettes integrated into the PRO1 locus, and is a URA5
prototroph.
[0165] Strains yGLY2696 was generated by integrating plasmid vector
pGLY2233, which encodes the human GRP94 protein, into the PEP4
locus. This strain has the gene encoding the Pichia pastoris PD1
replaced with the expression cassette encoding the human PDI1, has
the PpAOX1-TrMNS1 expression cassette integrated into the PRO1
locus, has the PpGAPDH-TrMNS1 and PpGAPDH-MuMNS1A expression
cassettes integrated into the PRO1 locus, has the human GRP64
integrated into the PEP4 locus, and is a URA5 prototroph. The
genealogy of this chaperone-humanized strain is shown in FIG.
6.
Example 2
[0166] Expression vectors encoding an anti-Her2 antibody and an
anti-CD20 antibody were constructed as follows.
[0167] Expression/integration plasmid vector pGLY2988 contains
expression cassettes encoding the heavy and light chains of an
anti-Her2 antibody. Anti-Her2 heavy (HC) and light (LC) chains
fused at the N-terminus to .alpha.-MAT pre signal peptide were
synthesized by GeneArt AG. The nucleotide and amino acid sequences
for the .alpha.-amylase signal peptide are shown in SEQ ID NOs:27
and 28. Each was synthesized with unique 5' EcoR1 and 3' Fse1
sites. The nucleotide and amino acid sequences of the anti-Her2 HC
are shown in SEQ ID Nos:29 and 30, respectively. The nucleotide and
amino acid sequences of the anti-Her2 LC are shown in SEQ ID Nos:31
and 32, respectively. Both nucleic acid molecule fragments encoding
the HC and LC fusion proteins were separately subcloned using 5'
EcoR1 and 3' Fse1 unique sites into an expression plasmid vector
pGLY2198 (contains the Pichia pastoris TRP2 targeting nucleic acid
molecule and the Zeocin-resistance marker) to form plasmid vector
pGLY2987 and pGLY2338, respectively. The LC expression cassette
encoding the LC fusion protein under the control of the Pichia
pastoris AOX1 promoter and Saccharomyces cerevisiae CYC terminator
was removed from plasmid vector pGLY2338 by digesting with BamHI
and NotI and then cloning the DNA fragment into plasmid vector
pGLY2987 digested with BamH1 and Not1, thus generating the final
expression plasmid vector pGLY2988 (FIG. 7).
[0168] Expression/integration plasmid vector pGLY3200 (map is
identical to pGLY2988 except LC and HC are anti-CD20 with
.alpha.-amylase signal sequences). Anti-CD20 sequences were from
GenMab sequence 2C6 except Light chain (LC) framework sequences
matched those from VKappa 3 germline. Heavy (HC) and LC variable
sequences fused at the N-terminus to the .alpha.-amylase (from
Aspergillus niger) signal peptide were synthesized by GeneArt AG.
The nucleotide and amino acid sequences for the .alpha.-amylase
signal peptide are shown in SEQ ID NOs:33 and 34. Each was
synthesized with unique 5' EcoR1 and 3' KpnI sites which allowed
for the direct cloning of variable regions into expression vectors
containing the IgG1 and V kappa constant regions. The nucleotide
and amino acid sequences of the anti-CD20 HC are shown in SEQ ID
Nos:37 and 38, respectively. The nucleotide and amino acid
sequences of the anti-CD20 LC are shown in SEQ ID Nos:35 and 36,
respectively. Both HC and LC fusion proteins were subcloned into
IgG1 plasmid vector pGLY3184 and VKappa plasmid vector pGLY2600,
respectively, (each plasmid vector contains the Pichia pastoris
TRP2 targeting nucleic acid molecule and Zeocin-resistance marker)
to form plasmid vectors pGLY3192 and pGLY3196, respectively. The LC
expression cassette encoding the LC fusion protein under the
control of the Pichia pastoris AOX1 promoter and Saccharomyces
cerevisiae CYC terminator was removed from plasmid vector pGLY3196
by digesting with BamHI and NotI and then cloning the DNA fragment
into plasmid vector pGLY3192 digested with BamH1 and Not1, thus
generating the final expression plasmid vector pGLY3200 (FIG.
8).
[0169] Transfection of strain yGLY2696 with the above anti-Her2 or
anti-CD20 antibody expression/integration plasmid vectors was
performed essentially as follows. Appropriate Pichia pastoris
strains were grown in 50 mL YPD media (yeast extract (1%), peptone
(2%), dextrose (2%)) overnight to an OD of between about 0.2 to 6.
After incubation on ice for 30 minutes, cells were pelleted by
centrifugation at 2500-3000 rpm for 5 minutes. Media were removed
and the cells washed three times with ice cold sterile 1 M sorbitol
before resuspending in 0.5 mL ice cold sterile 1 M sorbitol. Ten
.mu.L linearized DNA (5-20 .mu.g) and 100 .mu.L cell suspension was
combined in an electroporation cuvette and incubated for 5 minutes
on ice. Electroporation was in a Bio-Rad GenePulser Xcell following
the preset Pichia pastoris protocol (2 kV, 25 .mu.F, 200.OMEGA.),
immediately followed by the addition of 1 mL YPDS recovery media
(YPD media plus 1 M sorbitol). The transfected cells were allowed
to recover for four hours to overnight at room temperature
(26.degree. C.) before plating the cells on selective media. Strain
yGLY2696 transfected with pGLY2988 encoding the anti-HER2 antibody
was designated yGLY4134. Strain yGLY2696 transfected with pGLY3200
encoding the anti-CD20 antibody was designated yGLY3920.
Example 3
[0170] This example describes the construction of plasmids
comprising expression cassettes encoding cell surface anchoring
proteins fused to binding moieties capable of binding an
immunoglobulin, which are suitable for use in Pichia pastoris. The
plasmids comprise a nucleic acid molecule encoding sed1p, a cell
surface anchoring protein that inherently contains an attached
glycophosphotidylinositol (GPI) post-translational modification
that anchors the protein in the cell wall. The nucleic acid
molecule encoding the sed1p was linked in frame to a nucleic acid
molecule encoding an antibody-binding moiety that is capable of
binding whole, intact antibodies.
[0171] Four plasmids were constructed containing antibody binding
moiety/cell surface anchor fusion protein expression cassettes.
Plasmid pGLY4136 encodes the five Fc binding domains of Protein A
fused to the Saccharomyces cerevisiae SED1 (ScSED1) gene followed
by the CYC terminator, all under the control of the AOX promoter
(FIG. 9). Plasmid pGLY4116 encodes the Fc receptor III (FcRIII
(LF)) fused to the ScSED1 gene (FIG. 10). Plasmid pGLY4137 encodes
Fc receptor I (FcRI) fused to the ScSED1 gene (FIG. 10) and plasmid
pGLY4124 (FIG. 9) encodes the ZZ-domain from Protein A fused to the
ScSED1 gene. The ZZ-domain consists of two of the five Fc binding
domains. All four plasmids contain a pUC19 E. coli origin and an
arsenite resistance marker and are integrated into the Pichia
pastoris genome at the URA6 locus.
[0172] Plasmid pGLY3033 comprising an expression cassette encoding
a fusion protein comprising the Saccharomyces cerevisiae SED1 GPI
anchoring protein without its endogenous signal peptide (SED1
fragment) has been described in copending Application Ser. No.
61/067,965 filed Mar. 3, 2008. The SED1 amino acid sequence without
its endogenous signal peptide is shown in SEQ ID NO:60. A nucleic
acid molecule encoding the SED1 fragment was synthesized by GeneArt
AG. The codons encoding the fragment had been optimized for
expression in Pichia pastoris. The nucleotide sequence encoding the
SED1 fragment is shown in SEQ ID NO:61). The Pichia pastoris URA6
locus was chosen as an integrating site for the GPI anchoring
protein expression cassette. The URA6 gene was PCR amplified from
Pichia pastoris genomic DNA and cloned into pCR2.1 TOPO
(Invitrogen, La Jolla, Calif.) to produce plasmid pGLY1849. The
BglII and EcoRI sites within the gene were mutated by silent
mutation for cloning purposes. The TRP2 targeting nucleic acid
molecule of plasmid pGLY2184 was replaced with the Pichia pastoris
URA6 gene from pGLY1849. In addition, the Pichia pastoris ARG1
selection marker was replaced with the Arsenite marker cassette
from plasmid pGFI8. The final plasmid was named pGFI30t and was
used to make plasmid pGLY3033 (FIG. 20), containing an expression
cassette comprising a nucleic acid molecule encoding the SED1
fragment protein fused at its amino terminus to a GR2 coiled-coil
peptide and Aspergillus niger alpha-amylase signal peptide operably
linked to the PpAOX1 promoter. The GR2 coiled coil and signal
peptide encoding fragment can be removed by EcoRI and SalI
digestion and replaced with an antibody capture moiety to make a
fusion protein in which the capture moiety is fused to a cell
surface anchoring protein.
[0173] Plasmid pGLY4136 comprising an expression cassette encoding
the five Fc binding domains of protein A fused to the SED1 fragment
under the control of the AOX1 promoter was constructed as follows.
A nucleic acid molecule fragment encoding the five Fc binding
domains from protein A was synthesized by GeneArt to encode the
five Fc binding domains fused to the Saccharomyces cerevisiae
.alpha.-Mating Factor pre signal sequence at the N-terminus and an
HA and 9.times.HIS Tag sequence at the C-terminus and to have an
EcoRI 5' end and a SalI 3' end. The fragment apre-5.times.BD-Htag
has the nucleotide sequence shown in SEQ ID NO:45. The
apre-5.times.BD-Htag fusion protein has the amino acid sequence
shown in SEQ ID NO:46. The nucleic acid molecule encoding the
apre-5.times.BD-Htag fusion protein was digested with EcoRI and
SalI and the fragment cloned into pGLY3033, which had been digested
with EcoRI and SalI to remove the GR2 coiled coil encoding
fragment. This produced plasmid pGLY4136, which contains operably
linked to the PpAOX1 promoter, the nucleic acid molecule encoding
the apre-5.times.BD-Htag fusion protein linked in-frame to the
nucleic acid molecule encoding the SED1 fragment. The plasmid is an
integration/expression vector that targets the plasmid to the URA6
locus. The fusion protein expressed by this integration/insertion
plasmid is referred to herein as the Protein A/SED1 fusion
protein.
[0174] To put the Protein A/SED1 fusion protein under the control
of the GAPDH promoter, plasmid pGLY4136 was digested with BglII and
EcoRI to release the AOX1 promoter and to insert the Pichia
pastoris GAPDH promoter from pGLY880. This produced plasmid
pGLY4139.
[0175] Plasmid pGLY4124 comprising an expression cassette encoding
the Protein A ZZ domain fused to the SED1 fragment under the
control of the AOX1 promoter was constructed as follows. The
ZZ-domain from GeneArt plasmid 0706208 ZZHAtag was PCR amplified
using the following primers: primer alpha-amy-ProtAZZ/up:
CGGAATTCacgATGGTCGCTTGGTGGTCTTTGTTTCTGTACGGTCTTCAGGTCGCTGCA
CCTGCTTTGGCTTCTGGTGGTGTTACTCCAGCTGCTAACGCTGCTCAACACG (SEQ ID NO:47)
and HA-ProtAZZ-Xho1ZZ/1p:
GCCTCGAGAGCGTAGTCTGGAACATCGTATGGGTAACCACCACCAGCATC (SEQ ID NO:48).
The alpha-amy-ProtAZZ/up primer includes in-frame the coding
sequence for the first 20 amino acids of the Aspergillus niger
.alpha.-amylase signal peptide (underlined). The primers introduce
an EcoRI site at the 5' end of the coding region and a XhoI site at
the 3' end. The nucleic acid sequence of the ZZ-domain as an
EcoRI/XhoI fragment is shown in SEQ ID NO:49. The amino acid
sequence of the ZZ-domain is shown in SEQ ID NO:50. The PCR
conditions were one cycle of 95.degree. C. for 2 minutes, 20 cycles
of 98.degree. C. for 10 seconds, 65.degree. C. for 10 seconds, and
72.degree. C. for 1 minute, and followed by one cycle of 72.degree.
C. for 10 minutes.
[0176] The PCR fragment was cloned into plasmid pCR2.1 TOPO and the
cloned fragment sequenced to confirm the sequence encoded the
Protein A ZZ domain. The ZZ-domain fragment was extracted from the
pCR2.1 TOPO vector by EcoRI and XhoI digest and the EcoRI/XhoI
fragment was cloned into plasmid pGLY3033, which had been digested
with EcoRI and SalI to remove the GR2 coiled coil encoding
fragment. This produced plasmid pGLY4124, which contains operably
linked to the PpAOX1 promoter, the nucleic acid molecule encoding
the Protein A ZZ domain-alpha amylase signal peptide fusion protein
linked in-frame to the nucleic acid molecule encoding the SED1
fragment. The plasmid is an integration/expression vector that
targets the plasmid to the URA6 locus. The fusion protein expressed
by this integration/insertion plasmid is referred to herein as the
ZZ/SED1 fusion protein.
[0177] Plasmid pGLY4116 comprising an expression cassette encoding
the FcRIIIa LF receptor fused to the SED1 fragment under the
control of the AOX1 promoter was constructed as follows. A nucleic
acid molecule encoding the FcRIIIa LF receptor was PCR amplified
from plasmid pGLY3247 (FcRIIIa LF) as an EcoRI/SalI fragment. In
plasmid pGLY3247, the FcRIIIa LF receptor is a fusion protein in
which the endogenous signal peptide had been replaced with the
.alpha.-MFpre-pro. The 5' primer anneals to the sequence encoding
the signal peptide and the 3' primer anneals to the His-tag at the
end of the receptor and omits the stop codon for the receptor. The
5' primer was SEcoapp: AACGGAATTCATGAGATTTCCTTCAATTTTTAC (SEQ ID
NO:51) and the 3' primer was 3HtagSal
CGATGTCGACGTGATGGTGATGGTGGTGATGATGATGACCACC (SEQ ID NO:52). The PCR
conditions were one cycle of 95.degree. C. for 2 minutes, 25 cycles
of 95.degree. C. for 30 seconds, 58.degree. C. for 30 seconds, and
72.degree. C. for 70 seconds, and followed by one cycle of
72.degree. C. for 10 minutes.
[0178] The PCR fragment encoding the receptor fusion protein was
cloned into plasmid pCR2.1 TOPO and the cloned fragment sequenced
to confirm the sequence encoded the receptor. The nucleotide
sequence of the FcRIII(LF) as an EcoRI/SalI fragment is shown in
SEQ ID NO:53. The amino acid sequence of the FcRIII(LF) with
.alpha. MF pre-signal sequence is shown in SEQ ID NO:54.
[0179] Plasmid pCR2.1 TOPO was digested with EcoRI and SalI and the
EcoRI/SalI fragment encoding the receptor was cloned into pGLY3033,
which had been digested with EcoRI and SalI to remove the GR21
coiled coil encoding fragment. This produced plasmid pGLY4116,
which contains operably linked to the PpAOX1 promoter, the nucleic
acid molecule encoding the FcRIIIa LF/.alpha.-MF pre-pro signal
peptide fusion protein linked in-frame to the nucleic acid molecule
encoding the SED1 fragment. The plasmid is an
integration/expression vector that targets the plasmid to the URA6
locus. The fusion protein expressed by this integration/insertion
plasmid is referred to herein as the FcRIIIa fusion protein.
[0180] Plasmid pGLY4137 encoding the FcRI receptor fused to the
SED1 fragment was constructed as follows. A nucleic acid molecule
encoding the FcRI receptor was PCR amplified from plasmid pGLY3248
as an EcoRI/SalI fragment. In plasmid pGLY3248, the FcRI receptor
is a fusion protein in which the endogenous signal peptide had been
replaced with the .alpha.-MFpre-pro. The 5' primer anneals to the
sequence encoding the signal peptide and the 3' primer anneals to
the His-tag at the end of the receptor and omits the stop codon for
the receptor. The 5' primer was SEcoapp:
AACGGAATTCATGAGATTTCCTTCAATTTTTAC (SEQ ID NO:51) and the 3' primer
was 3HtagSal CGATGTCGACGTGATGGTGATGGTGGTGATGATGATGACCACC (SEQ ID
NO:52). The PCR conditions were one cycle of 95.degree. C. for 2
minutes, 25 cycles of 95.degree. C. for 30 seconds, 58.degree. C.
for 30 seconds, and 72.degree. C. for 70 seconds, and followed by
one cycle of 72.degree. C. for 10 minutes.
[0181] The PCR fragment encoding the receptor fusion protein was
cloned into plasmid pCR2.1 TOPO and the cloned fragment sequenced
to confirm the sequence encoded the receptor. The nucleic acid
sequence of the FcRI as an EcoRI/SalI fragment is shown in SEQ ID
NO:55. The amino acid sequence of the FcRI with .alpha. MF
pre-signal sequence is shown in SEQ ID NO:56.
[0182] Plasmid pCR2.1 TOPO was digested with EcoRI and SalI and the
EcoRI/SalI fragment encoding the receptor was cloned into pGLY3033,
which had been digested with EcoRI and SalI to remove the GR21
coiled coil encoding fragment. This produced plasmid pGLY4116,
which contains operably linked to the PpAOX1 promoter, the nucleic
acid molecule encoding the FcRI/.alpha.-MF pre-pro signal peptide
fusion protein linked in-frame to the nucleic acid molecule
encoding the SED1 fragment. The plasmid is an
integration/expression vector that targets the plasmid to the URA6
locus. The fusion protein expressed by this integration/insertion
plasmid is referred to herein as the FcRI fusion protein.
Example 4
[0183] Co-Expression of antibody and antibody binding moiety/cell
surface anchor fusion protein in Pichia pastoris was as
follows.
[0184] Pichia pastoris strains yGLY4134 (expresses anti-HER2
antibody) and yGLY3920 (expresses anti-CD20 antibody) were each
transfected with pGLY4116 (expresses FcRIII receptor/SED fusion
protein), pGLY4136 (expresses Protein A/SED fusion protein),
pGLY4124 (expresses Protein A ZZ domain/SED fusion protein), or
pGLY4137 (expresses FcRI receptor/SED fusion protein). YGLY2696 was
also transfected with each of the above four expression/integration
vectors. For transfection, the strains are grown in 50 mL BMGY
media until the culture reached a density of about OD600=2.0. The
cells are washed three times with 1 M sorbitol and resuspended in 1
mL 1 M sorbitol. About 1 to 2 .mu.g of linearized plasmid are mixed
with the cells. Transfection is performed with a BioRad
electroporation apparatus using the manufacturer's program specific
for electroporation of nucleic acid molecules into Pichia pastoris.
One mL of recovery media is added to the cells, which are then
plated out on YPG (yeast extract:peptone:glycerol medium) with 50
.mu.g/mL arsenite.
[0185] Cell surface labeling was as follows. Strain yGLY4134
(expresses anti-Her2 antibody), strain yGLY4134 transfected with
pGLY4136 (expresses anti-Her2 antibody and Protein A/SED1 fusion
protein, and strain YGLY2696 transfected with pGLY4136 (expresses
Protein A/SED1 fusion protein) were grown in 600 .mu.L BMGY
(buffered minimal glycerol medium-yeast extract, Invitrogen) in a
96 deep well plate or 50 mL BMGY in a 250 mL shake flask for two
days. The cells were collected by centrifugation and the
supernatant was discarded. The cells were induced by incubation in
300 .mu.L or 25 mL BMMY with Pmti-3 inhibitor overnight following
the methods taught in WO2007/061631. Pmti-3 is
3-hydroxy-4-(2-phenylethoxy)benzaldehyde;
3-(1-phenylethoxy)-4-(2-phenylethoxy)-benzaldehyde, which as been
described in U.S. Pat. No. 7,105,554 and Published International
Application No. WO 2007061631. The Pmti-3 inhibitor reduces the
O-glycosylation occupancy, that is the number of total O-glycans on
the antibody molecule. The cell further express a T. reesei
alpha-1,2-mannsodase catalytic domain linked to the Saccharomyces
cerevisiea .alpha.MAT pre signal peptide to control the chain
length of those O-glycans that are on the antibody molecule.
[0186] Induced cells were labeled with goat anti-human heavy and
light chain (H+L) Alexa 488 (Invitrogen, Carlsbad, Calif.)
conjugated antibody and viewed using fluorescence microscopy as
follows. After induction, cells at density of about 0.5-1.0 OD600
were collected by centrifugation in a 1.5-mL tube. The cells were
rinsed twice with 1 mL PBS and 0.5 mL goat anti-human IgG
(H+L)-Alexa 488 (1:500 in 1% BSA in PBS) was added. The tubes were
rotated for one hour at 37.degree. C., centrifuged, and rinsed
3.times. with 1 mL PBS to remove the detection antibody. The cells
were resuspended in about 50-100 .mu.L of PBS and a 10 .mu.L
aliquot viewed with a fluorescence microscope and photographed
(FIG. 2). As expected, both the anti-Her2 antibody expressing
strain yGLY4134 without pGLY4136 encoding the protein A/SED1 fusion
protein and yGLY2696 with pGLY4136 encoding the Protein A/SED1
fusion protein but no anti-Her2 antibody showed no surface
labeling. The weak labeling that was visible on the cells of
yGLY2696 transfected with pGLY4136 might be due to cross reaction
of the goat anti human heavy and light chain (H+L) Alexa 488
conjugated antibody to the expressed Protein A. However, as can
also be seen in FIG. 11, co-expression of the Protein A/SED1 fusion
protein and the anti-Her2 antibody (strain yGLY4134 transfected
with pGLY4136) did not result in displayed antibody on the cell
surface and showed only background labeling. This result suggested
that simultaneously expressing the antibody and Protein A/SED1
protein interfered with display of the antibody on the cell surface
or the Protein A/SED1 protein was not properly anchored to the cell
surface.
Example 5
[0187] This example demonstrates that the Protein A/SED1 fusion
protein is properly anchored to the cell surface and that
co-expressing the anti-Her2 antibody and Protein A/SED1 fusion
protein at the same time interfere with capture and display of the
antibody on the cell surface.
[0188] To test whether the Protein A/SED1 fusion protein itself is
displayed on the cell surface, strain yGLY2696 transfected pGLY4136
encoding the Protein A/SED1 fusion protein was grown and induced as
described in the previous example. At a cell density of about
0.5-1.0 OD600, cells were collected by centrifugation in a 1.5-mL
tube and rinsed twice with 1 mL PBS. Either 10 or 50 ng of
anti-Her2 antibody was added externally to the cells and the cells
incubated for one hour. Afterwards, the cells were washed 3.times.
in 1 ml PBS and labeled with goat anti human H+L as described in
the previous example. The results showed that the anti-Her2
antibody was captured and displayed on the surface of the cells.
This can be seen in FIG. 12, which shows strong cell surface
staining. The results confirm that the Protein A/SED1 fusion
protein is expressed, the expressed fusion protein is properly
inserted into the cell surface, and the fusion protein is able to
capture and display antibodies on the cell surface.
[0189] To determine whether co-expression interfered with display
of the antibody on the cell surface, strain yGLY2696 transfected
with pGLY4136 (empty strain that expresses Protein A/SED1 fusion
protein), strain yGLY4134 transfected with pGLY4136 (strain
expresses anti-Her2 antibody and Protein A/SED1 fusion protein),
and strain yGLY3920 transfected with pGLY4136 (strain expresses
anti-CD20 antibody and Protein A/SED1 fusion protein) were grown
and induced as in the previous example. Cells were incubated with
10 ng externally added anti-Her2 antibody, labeled, and detected as
in the previous example. FIG. 13 illustrates strong cell surface
labeling of the empty strain expressing only the Protein A/SED1
fusion protein (yGLY2696 transfected with pGLY4136), but only weak
staining in the strains when the Protein A/SED1 fusion protein and
the antibody were co-expressed (yGLY4134 transfected with pGLY4136
and yGLY3920 transfected with pGLY4136). Cells expressing the
Protein A/SED1 fusion protein were able to capture externally added
antibody and display it while cells co-expressing antibody and
Protein A/SED1 fusion protein were unable to capture externally
added antibody nor display their own secreted antibody.
[0190] These results suggested that the Protein A/SED1 fusion
protein is not displayed well on the cell surface in an antibody
co-expressing strain. This may be because co-expression of the
Protein A/SED1 fusion protein and the antibody from the strong AOX
promoter under methanol induction may lead to aggregation of the
antibody-Protein A/SED1 fusion protein complex in the ER and
degradation. Alternatively, the antibody-Protein A/SED1 fusion
protein complex produced in the ER may not secrete well because of
its molecular weight or steric hindrance.
Example 6
[0191] Other antibody binding moieties were tested for their
ability to display antibody on the cell surface of P. pastoris.
These include the Fc receptor I (FcRI), the Fc receptor III
(FcRIII) and the Protein A ZZ-domain. Strains yGLY2696 (empty),
yGLY4134 (expresses anti-Her2 antibody) and yGLY3920 (expresses
anti-CD20 antibody) were separately transfected with each of
plasmids pGLY4116 (encodes FcRIII/SED1 fusion protein), pGLY4124
(encodes Protein A ZZ domain/SED1 fusion protein), and pGLY4136
(encodes Protein A/SED1 fusion protein), were grown, induced and
labeled as in Example 4.
[0192] The results for the ZZ-domain were similar to those for
Protein A albeit the staining was somewhat weaker. This suggests
that two Fc binding domains have a lower affinity for the antibody
compared to the intact Protein A, which has five Fc binding
domains.
[0193] Co-expression of the FcRIII/SED1 fusion protein and antibody
resulted in a lack of cell surface staining. Strain yGLY2696
transfected with pGLY4116 (encodes FcRIII/SED1 fusion protein) was
grown and induced as described in Example 4 and the cells were
incubated with 10 or 50 ng externally added anti-Her2 antibody.
Contrary to the results from strains that expressed the Protein
A/SED1 fusion protein, cell surface staining was absent while some
intracellular staining is observed (FIG. 14). The results suggest
that while the FcRIII/SED1 fusion protein may be expressed in the
cell, it did not appear to be secreted.
Example 7
[0194] This example demonstrates that temporal expression of the
Protein A/SED1 fusion protein and the antibody enables proper
expression and capture of the secreted antibody on the cell
surface.
[0195] The above experiments suggested that co-expression of the
antibody binding moiety/cell surface anchor fusion protein and the
antibody together does not allow the anchor to be displayed at the
cell surface. In the above experiments, both the antibody binding
moiety/cell surface anchor fusion protein and antibody were
expressed from nucleic acid molecules operably linked to the strong
AOX inducible promoter. It was hypothesized that inducing
expression of the antibody binding moiety/cell surface anchor
fusion protein first, then after sufficient antibody binding
moiety/cell surface anchor fusion protein had been made and
anchored to the cell surface, inhibiting expression of the antibody
binding moiety/cell surface anchor fusion protein and inducing
expression of the antibody, would enable the antibody that is made
to be captured at the cell surface by the antibody binding
moiety/cell surface anchor fusion protein. Therefore, different
promoters that would allow temporal expression of the nucleic acid
molecules encoding the antibody binding moiety/cell surface anchor
fusion protein and antibody were tested.
[0196] The GUT1 promoter is a promoter that is induced in cells
grown in the presence of glycerol and repressed when the cells are
switched to a medium that lacks glycerol but contains dextrose. PCR
was used to amplify the GUT1 promoter from genomic DNA of Pichia
pastoris as BglII/EcoR1 fragment using primer 5gutBglII ATTGAGATCT
ACCCAATTTA GCAGCCTGCA TTCTC (SEQ ID NO:57) and primer 3gutEcoRI
GTCAGAATTC ATCTGTGGTA TAGTGTGAAA AAGTAG (SEQ ID NO:58). The PCR
fragment was then cloned into the pCR2.1 TOPO vector, and then
sequenced to confirm the sequence. The GUT1 promoter fragment was
extracted from the pCR2.1 TOPO vector by BglII/EcoRI digest and
cloned into pGLY4136 digested with BglII/EcoRI to exchange the AOX1
promoter by the GUT1 promoter. The nucleotide sequence of the GUT1
promoter including the BglI and EcoRI ends is shown in SEQ ID
NO:59.
[0197] The AOX promoter from the Protein A/SED1 fusion protein
plasmid pGLY4136 was replaced either by the PpGAPDH promoter
resulting in plasmid pGLY4139 or the GUT1 promoter producing the
plasmid pGLY4144 (FIG. 15). The PpGAPDH promoter is induced in
dextrose and at about 80% of that level in glycerol, while the GUT1
promoter is induced in glycerol and repressed in dextrose. pGLY4139
was transfected into yGLY4134, expressing anti-Her2 antibody under
control of the AOX promoter. Additionally, pGLY4144 has been
transfected into strain yGLY5434 (yGLY2696 transfected with
pGLY4142), in which anti-Her2 expression is regulated by the GAPDH
promoter.
[0198] Strain yGLY4134 transfected with pGLY4136, in which
expression of the Protein A/SED1 fusion protein and the anti-Her2
antibody are both regulated by the AOX promoter, was grown in 600
.mu.L BMGY (glycerol as carbon source) in a 96 deep well plate or
50 mL BMGY in a 250 mL shake flask for two days. The cells were
collected by centrifugation and the supernatant was discarded. The
cells were induced by incubation overnight in 300 .mu.L or 25 mL
BMMY (methanol as carbon source) with PMTi inhibitor.
[0199] Strain yGLY4134 transfected with pGLY4139, in which
expression of the Protein A/SED1 fusion protein is regulated by the
PpGAPDH promoter and expression of the anti-Her2 antibody regulated
by the AOX promoter, was grown in BMGY (glycerol as carbon source)
and induced in BMMY with PMTi inhibitor (methanol as carbon
source).
[0200] Strain yGLY5434 transfected with pGLY4144, in which
expression of the Protein A/SED1 fusion protein is regulated by the
GUT1 promoter and expression of the anti-Her2 antibody is regulated
by the GAPDH promoter, was grown in BMGY (glycerol as carbon
source) and induced in BMDY with PMTi inhibitor (dextrose as carbon
source). Dextrose inhibits transcription from the GUT1 promoter.
After induction, all three strains were labeled with goat anti
human IgG (H+L)-Alexa 488 as described in Example 1. In general,
growth can be between 1.5 days to 3 days and induction between 1 to
2 days. Strains are usually grown for 2 days and then induced for
another 2 days: afterwards the analysis is done.
[0201] FIG. 16 illustrates the results of cell surface staining of
the above strains. As was shown in Example 5, co-expression of the
Protein A/SED1 fusion protein and anti-Her2 antibody, both under
the strong AOX promoter (yGLY4134 transfected with pGLY4136) does
not show any cell surface labeling. Expression of the Protein
A/SED1 fusion protein under the GAPDH promoter during growth in
glycerol and the expression of anti-Her2 antibody regulated by the
AOX promoter during induction with methanol (yGLY4134 transfected
with pGLY4139) shows some weak but visible cell surface labeling.
In this case the Protein A/SED1 fusion protein is still expressed
at some level during induction of the antibody because the GAPDH
promoter is not completely repressed under methanol induction
conditions. However, expression of the Protein A/SED1 fusion
protein under the GUT1 promoter during growth in glycerol followed
by induction of the anti-Her2 antibody regulated by the GAPDH
promoter during induction in dextrose (YGLY5434 transfected with
pGLY4144) showed strong cell surface labeling. In this case, the
Protein A/SED1 fusion protein was not expressed under antibody
induction conditions because the GUT1 promoter is completely
repressed in dextrose.
[0202] FIG. 17 is a chart that illustrates the expected expression
patterns of Protein A/SED1 fusion protein and antibody under the
control of different combinations of promoters. Expression of the
Protein A/SED1 fusion protein and the antibody under the strong AOX
promoter, which is repressed in the glycerol growth phase and
induced in the methanol induction phase, led to no detectable cell
surface display. Likely, co-expression leads to a Protein A/SED1
fusion protein-antibody complex in the ER, which does not secrete
to the cell surface or is degraded.
[0203] Expression of the Protein A/SED1 fusion protein under the
GAPDH promoter during growth in glycerol and expression of the
antibody under the AOX promoter during induction in methanol
resulted in weak cell surface display. In this case, the Protein
A/SED1 fusion protein is still expressed at some level during
induction of the antibody because the GAPDH promoter is not
repressed completely under methanol induction conditions. This
means that under induction conditions, there might be complex
formation between the Protein A/SED1 fusion protein and the
antibody in the ER, which then clogs the secretory pathway leading
to only a small amount of Protein A/SED1 fusion protein at the cell
surface.
[0204] Expression of the Protein A/SED1 fusion protein under the
GUT1 promoter during growth in glycerol followed by expression of
the antibody under the GAPDH promoter while simultaneously
repressing expression of the Protein A/SED1 fusion protein during
induction of antibody expression with dextrose led to strong cell
surface display. Thus, when the Protein A/SED1 fusion protein is
expressed first and then completely repressed during antibody
induction, the Protein A/SED1 fusion protein is secreted to the
cell wall where it can capture the antibody when it is secreted.
Although the antibody is expressed at some level during Protein
A/SED1 fusion protein growth because the GAPDH promoter is not
repressed under glycerol, the level of expression of the antibody
appears to be low enough to not interfere with the Protein A/SED1
fusion protein secretion.
[0205] To demonstrate that the cell surface display of whole
antibody by Protein A/SED1 fusion protein regulated under the GUT1
promoter is functional for different antibodies, the anti-CD20
antibody expressing strain yGLY5757 was also transfected with
plasmid pGLY4144, which encodes Protein A/SED1 fusion protein whose
expression is regulated the GUT1 promoter. Strain yGLY5757 is
strain yGLY2696 transfected with the plasmid pGLY4078. Plasmid
pGLY4078 encodes the heavy and light chain of the anti-CD20
antibody under the regulation of the GAPDH promoter.
[0206] Strain yGLY5757 expressing the anti-CD20 antibody operably
linked to the GAPDH promoter and transfected with pGLY4144 (encodes
Protein A/SED1 fusion protein under control of the GUT1 promoter)
and strain yGLY5434 expressing the anti-Her2 antibody operably
linked to the GAPDH promoter transfected with pGLY4144 were grown
in glycerol for Protein A/SED1 fusion protein expression followed
by induction in dextrose for antibody expression and secretion as
described for FIG. 6. Strong cell surface staining was observed for
both antibodies (FIG. 18). This demonstrates that temporal
regulation enables different antibodies and not just the anti-Her2
antibodies to be displayed on the yeast surface by an anchored
antibody binding moiety.
[0207] FIG. 19 shows the results of FACS sorting of the samples
from FIG. 8. The anti-Her2 expressing strain yGLY5757 transfected
with pGLY4144, the anti-CD20 expressing strain yGLY5434 transfected
with pGLY4144 and the empty strain yGLY2696 transfected with
pGLY4144 were grown in glycerol and then induced in dextrose. Cells
were labeled with goat anti human IgG (H+L)-Alexa 488 and analyzed
by FACS sorting. As shown in FIG. 9, the empty strain without
antibody expression displayed background fluorescent staining while
for three clones of the anti-CD20 expressing strain, the
fluorescence was shifted to the right showing cell surface
labeling. The same was also seen for the anti-Her2 expressing
strain. One clone of this strain showed no cell surface labeling,
which could be a false positive from a transfection that does not
express the antibody or the anchor. These results demonstrate that
the cells displaying whole antibodies can be sorted using FACS
sorting.
TABLE-US-00001 TABLE 1 BRIEF DESCRIPTION OF THE SEQUENCES SEQ ID
NO: Description Sequence 1 PCR primer
AGCGCTGACGCCCCCGAGGAGGAGGACCAC hPDI/UP1 2 PCR primer
CCTTAATTAATTACAGTTCATCATGCACAGCTTTCTGATCAT hPDI/LP-PacI 3 PCR
primer ATGAATTCAGGC CATATCGGCCATTGTTTACTGTGCGCCCACAGTAG PB248 4 PCR
primer ATGTTTA AACGTGAGGATTACTGGTGATGAAAGAC PB249 5 PCR primer
AGACTAGTCTATTTGGAG ACATTGACGGATCCAC PB250 6 PCR primer
ATCTCGAGAGGCCATGCAGGCCAACCACAAGATGAATCAAATTTTG PB251 7 PCR primer
GGTGAGGTTGAGGTCCCAAGTGACTATCAAGGTC PpPDI/UPi-1 8 PCR primer
GACCTTGATAGTCACTTGGGACCTCAACCTCACC PpPDI/LPi-1 9 PCR primer
CGCCAATGATGAGGATGCCTCTTCAAAGGTTGTG PpPDI/UPi-2 10 PCR primer
CACAACCTTTGAAGAGGCATCCTCATCATTGGCG PpPDI/LPi-2 11 PCR primer
GGCGATTGCATTCGCGAC TGTATC PpPDI-5'/UP 12 PCR primer CCTAGAGAGCGGTGG
CCAAGATG hPDI-3'/LP 13 PCR primer GTGGCCACACCAGGGGGC ATGGAAC
hPDI/UP 14 PCR primer CCTAGAGAGCGGTGG CCAAGATG hPDI-3'/LP 15 PCR
primer AGCGCTGACGATGAAGTTGATGTGGATGGTACA GTAG hGRP94/UP1 16 PCR
primer GGCCGGCCTTACAATTCATCATG TTCAGCTGTAGATTC hGRP94/LP1 17 PCR
primer TGAACCCATCTGTAAATAGAATGC PMT1-KO1 18 PCR primer
GTGTCACCTAAATCGTATGTGCCCATTTACTGGAAGCTGCTAACC PMT1-KO2 19 PCR
primer CTCCCTATAGTGAGTCGTATTCATCATTGTACTTTGGTATATTGG PMT1-KO3 20
PCR primer TATTTGTACCTGCGTCCTGTTTGC PMT1-KO4 21 PCR primer
CACATACGATTTAGGTGACAC PR29 22 PCR primer AATACGACTCACTATAGGGAG PR32
23 PCR primer TGCTCTCCGCGTGCAATAGAAACT PMT4-KO1 24 PCR primer
CTCCCTATAGTGAGTCGTATTCACAGTGTACCATCTTTCATCTCC PMT4-KO2 25 PCR
primer GTGTCACCTAAATCGTATGTGAACCTAACTCTAATTCTTCAAAGC PMT4-KO3 26
PCR primer ACTAGGGTATATAATTCCCAAGGT PMT4-KO4 27 Pre-pro
.alpha.-mating ATG AGA TTC CCA TCC ATC TTC ACT GCT GTT TTG factor
signal TTC GCT GCT TCT TCT GCT TTG GCT peptide (Sc.alpha.MTprepro)
(DNA) 28 Pre-pro .alpha.-mating MRFPSIFTAVLFAASSALA factor signal
peptide (protein) 29 Anti-Her2
GAGGTTCAGTTGGTTGAATCTGGAGGAGGATTGGTTCAACCTGGTGGTTCTTTG Heavy chain
AGATTGTCCTGTGCTGCTTCCGGTTTCAACATCAAGGACACTTACATCCACTGG (VH + IgG1
GTTAGACAAGCTCCAGGAAAGGGATTGGAGTGGGTTGCTAGAATCTACCCAACT constant
AACGGTTACACAAGATACGCTGACTCCGTTAAGGGAAGATTCACTATCTCTGCT region)
(DNA) GACACTTCCAAGAACACTGCTTACTTGCAGATGAACTCCTTGAGAGCTGAGGAT
ACTGCTGTTTACTACTGTTCCAGATGGGGTGGTGATGGTTTCTACGCTATGGAC
TACTGGGGTCAAGGAACTTTGGTTACTGTTTCCTCCGCTTCTACTAAGGGACCA
TCTGTTTTCCCATTGGCTCCATCTTCTAAGTCTACTTCCGGTGGTACTGCTGCT
TTGGGATGTTTGGTTAAAGACTACTTCCCAGAGCCAGTTACTGTTTCTTGGAAC
TCCGGTGCTTTGACTTCTGGTGTTCACACTTTCCCAGCTGTTTTGCAATCTTCC
GGTTTGTACTCTTTGTCCTCCGTTGTTACTGTTCCATCCTCTTCCTTGGGTACT
CAGACTTACATCTGTAACGTTAACCACAAGCCATCCAACACTAAGGTTGACAAG
AAGGTTGAGCCAAAGTCCTGTGACAAGACTCATACTTGTCCACCATGTCCAGCT
CCAGAATTGTTGGGTGGTCCTTCCGTTTTTTTGTTCCCACCAAAGCCAAAGGAC
ACTTTGATGATCTCCAGAACTCCAGAGGTTACATGTGTTGTTGTTGACGTTTCT
CACGAGGACCCAGAGGTTAAGTTCAACTGGTACGTTGACGGTGTTGAAGTTCAC
AACGCTAAGACTAAGCCAAGAGAGGAGCAGTACAACTCCACTTACAGAGTTGTT
TCCGTTTTGACTGTTTTGCACCAGGATTGGTTGAACGGAAAGGAGTACAAGTGT
AAGGTTTCCAACAAGGCTTTGCCAGCTCCAATCGAAAAGACTATCTCCAAGGCT
AAGGGTCAACCAAGAGAGCCACAGGTTTACACTTTGCCACCATCCAGAGATGAG
TTGACTAAGAACCAGGTTTCCTTGACTTGTTTGGTTAAGGGATTCTACCCATCC
GACATTGCTGTTGAATGGGAGTCTAACGGTCAACCAGAGAACAACTACAAGACT
ACTCCACCTGTTTTGGACTCTGACGGTTCCTTTTTCTTGTACTCCAAGTTGACT
GTTGACAAGTCCAGATGGCAACAGGGTAACGTTTTCTCCTGTTCCGTTATGCAT
GAGGCTTTGCACAACCACTACACTCAAAAGTCCTTGTCTTTGTCCCCTGGTAAG TAA 30
Anti-Her2 EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARIYPT
Heavy chain NGYTRYADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAMD
(VH + IgG1 YWGQGTLVTVSSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWN
constant region)
SGALTSGVHTFPAVLQSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDK (protein)
KVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVS
HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC
KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPS
DIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMH
EALHNHYTQKSLSLSPGK 31 Anti-Her2 light
GACATCCAAATGACTCAATCCCCATCTTCTTTGTCTGCTTCCGTTGGTGACAGA chain (VL +
GTTACTATCACTTGTAGAGCTTCCCAGGACGTTAATACTGCTGTTGCTTGGTAT Kappa
constant CAACAGAAGCCAGGAAAGGCTCCAAAGTTGTTGATCTACTCCGCTTCCTTCTTG
region) (DNA)
TACTCTGGTGTTCCATCCAGATTCTCTGGTTCCAGATCCGGTACTGACTTCACT
TTGACTATCTCCTCCTTGCAACCAGAAGATTTCGCTACTTACTACTGTCAGCAG
CACTACACTACTCCACCAACTTTCGGACAGGGTACTAAGGTTGAGATCAAGAGA
ACTGTTGCTGCTCCATCCGTTTTCATTTTCCCACCATCCGACGAACAGTTGAAG
TCTGGTACAGCTTCCGTTGTTTGTTTGTTGAACAACTTCTACCCAAGAGAGGCT
AAGGTTCAGTGGAAGGTTGACAACGCTTTGCAATCCGGTAACTCCCAAGAATCC
GTTACTGAGCAAGACTCTAAGGACTCCACTTACTCCTTGTCCTCCACTTTGACT
TTGTCCAAGGCTGATTACGAGAAGCACAAGGTTTACGCTTGTGAGGTTACACAT
CAGGGTTTGTCCTCCCCAGTTACTAAGTCCTTCAACAGAGGAGAGTGTTAA 32 Anti-Her2
light DIQMTQSPSSLSASVGDRVTITCRASQDVNTAVAWYQQKPGKAPKLLIYSASFL chain
(VL + Kappa YSGVPSRFSGSRSGTDFTLTISSLQPEDFATYYCQQHYTTPP constant
region) TFGQGTKVEIKRTVA APSVFIFPPSDEQLKSGTASVVCLNNFYPREAKVQWKV
DNALQSGNSQESVTEQDSKDSTYSLSSTLTLSKADYEKHKVYACEVTHQGLSSP VTKSFNRGEC
33 Alpha amylase ATGGTTGCTT GGTGGTCCTT GTTCTTGTAC GGATTGCAAG
TTGCTGCTCC signal peptide AGCTTTGGCT (from Aspergillus niger
.alpha.-amylase) (DNA) 34 Alpha amylase MVAWWSLFLY GLQVAAPALA
signal peptide (from Aspergillus niger .alpha.-amylase) 35
Anti-CD20 GAGATCGTTT TGACACAGTC CCCAGCTACT TTGTCTTTGT CCCCAGGTGA
Light chain AAGAGCTACA TTGTCCTGTA GAGCTTCCCA ATCTGTTTCC TCCTACTTGG
Variable CTTGGTATCA ACAAAAGCCA GGACAGGCTC CAAGATTGTT GATCTACGAC
Region (DNA) GCTTCCAATA GAGCTACTGG TATCCCAGCT AGATTCTCTG GTTCTGGTTC
CGGTACTGAC TTCACTTTGA CTATCTCTTC CTTGGAACCA GAGGACTTCG CTGTTTACTA
CTGTCAGCAG AGATCCAATT GGCCATTGAC TTTCGGTGGT GGTACTAAGG TTGAGATCAA
GCGTACGGTT GCTGCTCCTT CCGTTTTCAT TTTCCCACCA TCCGACGAAC AATTGAAGTC
TGGTACCCAA TTCGCCC 36 Anti-CD20 EIVLTQSPAT LSLSPGERAT LSCRASQSVS
SYLAWYQQKP GQAPRLLIYD Light chain ASNRATGIPA RFSGSGSGTD FTLTISSLEP
EDFAVYYCQQ RSNWPLTFGG Variable Region GTKVEIKRTV
AAPSVFIFPPSDEQLKSGTQFA 37 Anti-CD20 GCTGTTCAGC TGGTTGAATC
TGGTGGTGGA TTGGTTCAAC CTGGTAGATC Heavy chain CTTGAGATTG TCCTGTGCTG
CTTCCGGTTT TACTTTCGGT GACTACACTA Variable Region TGCACTGGGT
TAGACAAGCT CCAGGAAAGG GATTGGAATG GGTTTCCGGT (DNA) ATTTCTTGGA
ACTCCGGTTC CATTGGTTAC GCTGATTCCG TTAAGGGAAG ATTCACTATC TCCAGAGACA
ACGCTAAGAA CTCCTTGTAC TTGCAGATGA ACTCCTTGAG AGCTGAGGAT ACTGCTTTGT
ACTACTGTAC TAAGGACAAC CAATACGGTT CTGGTTCCAC TTACGGATTG GGAGTTTGGG
GACAGGGAAC TTTGGTTACT GTCTCGAGTG CTTCTACTAA GGGACCATCC GTTTTTCCAT
TGGCTCCATC CTCTAAGTCT ACTTCCGGTG GTACCCAATT CGCCC 38 Anti-CD20
AVQLVESGGG LVQPGRSLRL SCAASGFTFG DYTMHWVRQA PGKGLEWVSG Heavy chain
ISWNSGSIGY ADSVKGRFTI SRDNAKNSLY LQMNSLRAED TALYYCTKDN Variable
Region QYGSGSTYGL GVWGQGTLVT VSSASTKGPS VFPLAPSSKS TSGGTQFA 39
human PDI GACGCCCCCGAGGAGGAGGACCACGTCTTGGTGCTGCGGAAAAGCAACTTCGCG
Gene (DNA) GAGGCGCTGGCGGCCCACAAGTACCCGCCGGTGGAGTTCCATGCCCCCTGGTGT
GGCCACTGCAAGGCTCTGGCCCCTGAGTATGCCAAAGCCGCTGGGAAGCTGAAG
GCAGAAGGTTCCGAGATCAGGTTGGCCAAGGTGGACGCCACGGAGGAGTCTGAC
CTAGCCCAGCAGTACGGCGTGCGCGGCTATCCCACCATCAAGTTCTTCAGGAAT
GGAGACACGGCTTCCCCCAAGGAATATACAGCTGGCAGAGAGGCTGATGACATC
GTGAACTGGCTGAAGAAGCGCACGGGCCCGGCTGCCACCACCCTGCCTGACGGC
GCAGCTGCAGAGTCCTTGGTGGAGTCCAGCGAGGTGGCCGTCATCGGCTTCTTC
AAGGACGTGGAGTCGGACTCTGCCAAGCAGTTTTTGCAGGCAGCAGAGGCCATC
GATGACATACCATTTGGGATCACTTCCAACAGTGACGTGTTCTCCAAATACCAG
CTCGACAAAGATGGGGTTGTCCTCTTTAAGAAGTTTGATGAAGGCCGGAACAAC
TTTGAAGGGGAGGTCACCAAGGAGAACCTGCTGGACTTTATCAAACACAACCAG
CTGCCCCTTGTCATCGAGTTCACCGAGCAGACAGCCCCGAAGATTTTTGGAGGT
GAAATCAAGACTCACATCCTGCTGTTCTTGCCCAAGAGTGTGTCTGACTATGAC
GGCAAACTGAGCAACTTCAAAACAGCAGCCGAGAGCTTCAAGGGCAAGATCCTG
TTCATCTTCATCGACAGCGACCACACCGACAACCAGCGCATCCTCGAGTTCTTT
GGCCTGAAGAAGGAAGAGTGCCCGGCCGTGCGCCTCATCACCTTGGAGGAGGAG
ATGACCAAGTACAAGCCCGAATCGGAGGAGCTGACGGCAGAGAGGATCACAGAG
TTCTGCCACCGCTTCCTGGAGGGCAAAATCAAGCCCCACCTGATGAGCCAGGAG
CTGCCGGAGGACTGGGACAAGCAGCCTGTCAAGGTGCTTGTTGGGAAGAACTTT
GAAGACGTGGCTTTTGATGAGAAAAAAAACGTCTTTGTGGAGTTCTATGCCCCA
TGGTGTGGTCACTGCAAACAGTTGGCTCCCATTTGGGATAAACTGGGAGAGACG
TACAAGGACCATGAGAACATCGTCATCGCCAAGATGGACTCGACTGCCAACGAG
GTGGAGGCCGTCAAAGTGCACGGCTTCCCCACACTCGGGTTCTTTCCTGCCAGT
GCCGACAGGACGGTCATTGATTACAACGGGGAACGCACGCTGGATGGTTTTAAG
AAATTCCTAGAGAGCGGTGGCCAAGATGGGGCAGGGGATGTTGACGACCTCGAG
GACCTCGAAGAAGCAGAGGAGCCAGACATGGAGGAAGACGATGACCAGAAAGCT
GTGAAAGATGAACTGTAA 40 human PDI
DAPEEEDHVLVLRKSNFAEALAAHKYPPVEFHAPWCGHCKALAPEYAKAAGKLK Gene
(protein) AEGSEIRLAKVDATEESDLAQQYGVRGYPTIKFFRNGDTASPKEYTAGREADDI
VNWLKKRTGPAATTLPDGAAAESLVESSEVAVIGFFKDVESDSAKQFLQAAEAI
DDIPFGITSNSDVFSKYQLDKDGVVLFKKFDEGRNNFEGEVTKENLLDFIKHNQ
LPLVIEFTEQTAPKIFGGEIKTHILLFLPKSVSDYDGKLSNFKTAAESFKGKIL
FIFIDSDHTDNQRILEFFGLKKEECPAVRLITLEEEMTKYKPESEELTAERITE
FCHRFLEGKIKPHLMSQELPEDWDKQPVKVLVGKNFEDVAFDEKKNVFVEFYAP
WCGHCKQLAPIWDKLGETYKDHENIVIAKMDSTANEVEAVKVHGFPTLGFFPAS
ADRTVIDYNGERTLDGFKKFLESGGQDGAGDVDDLEDLEEAEEPDMEEDDDQKA VHDEL 41
Pichia pastoris
ATGCAATTCAACTGGAATATTAAAACTGTGGCAAGTATTTTGTCCGCTCTCACA PDI1 Gene
CTAGCACAAGCAAGTGATCAGGAGGCTATTGCTCCAGAGGACTCTCATGTCGTC (DNA)
AAATTGACTGAAGCCACTTTTGAGTCTTTCATCACCAGTAATCCTCACGTTTTG
GCAGAGTTTTTTGCCCCTTGGTGTGGTCACTGTAAGAAGTTGGGCCCTGAACTT
GTTTCTGCTGCCGAGATCTTAAAGGACAATGAGCAGGTTAAGATTGCTCAAATT
GATTGTACGGAGGAGAAGGAATTATGTCAAGGCTACGAAATTAAAGGGTATCCT
ACTTTGAAGGTGTTCCATGGTGAGGTTGAGGTCCCAAGTGACTATCAAGGTCAA
AGACAGAGCCAAAGCATTGTCAGCTATATGCTAAAGCAGAGTTTACCCCCTGTC
AGTGAAATCAATGCAACCAAAGATTTAGACGACACAATCGCCGAGGCAAAAGAG
CCCGTGATTGTGCAAGTACTACCGGAAGATGCATCCAACTTGGAATCTAACACC
ACATTTTACGGAGTTGCCGGTACTCTCAGAGAGAAATTCACTTTTGTCTCCACT
AAGTCTACTGATTATGCCAAAAAATACACTAGCGACTCGACTCCTGCCTATTTG
CTTGTCAGACCTGGCGAGGAACCTAGTGTTTACTCTGGTGAGGAGTTAGATGAG
ACTCATTTGGTGCACTGGATTGATATTGAGTCCAAACCTCTATTTGGAGACATT
GACGGATCCACCTTCAAATCATATGCTGAAGCTAACATCCCTTTAGCCTACTAT
TTCTATGAGAACGAAGAACAACGTGCTGCTGCTGCCGATATTATTAAACCTTTT
GCTAAAGAGCAACGTGGCAAAATTAACTTTGTTGGCTTAGATGCCGTTAAATTC
GGTAAGCATGCCAAGAACTTAAACATGGATGAAGAGAAACTCCCTCTATTTGTC
ATTCATGATTTGGTGAGCAACAAGAAGTTTGGAGTTCCTCAAGACCAAGAATTG
ACGAACAAAGATGTGACCGAGCTGATTGAGAAATTCATCGCAGGAGAGGCAGAA
CCAATTGTGAAATCAGAGCCAATTCCAGAAATTCAAGAAGAGAAAGTCTTCAAG
CTAGTCGGAAAGGCCCACGATGAAGTTGTCTTCGATGAATCTAAAGATGTTCTA
GTCAAGTACTACGCCCCTTGGTGTGGTCACTGTAAGAGAATGGCTCCTGCTTAT
GAGGAATTGGCTACTCTTTACGCCAATGATGAGGATGCCTCTTCAAAGGTTGTG
ATTGCAAAACTTGATCACACTTTGAACGATGTCGACAACGTTGATATTCAAGGT
TATCCTACTTTGATCCTTTATCCAGCTGGTGATAAATCCAATCCTCAACTGTAT
GATGGATCTCGTGACCTAGAATCATTGGCTGAGTTTGTAAAGGAGAGAGGAACC
CACAAAGTGGATGCCCTAGCACTCAGACCAGTCGAGGAAGAAAAGGAAGCTGAA
GAAGAAGCTGAAAGTGAGGCAGACGCTCACGACGAGCTTTAA 42 Pichia pastoris
MQFNWNIKTVASILSALTLAQASDQEAIAPEDSHVVKLTEATFESFITSNPHVL PDI1 Gene
AEFFAPWCGHCKKLGPELVSAAEILKDNEQVKIAQIDCTEEKELCQGYEIKGYP (protein)
TLKVFHGEVEVPSDYQGQRQSQSIVSYMLKQSLPPVSEINATKDLDDTIAEAKE
PVIVQVLPEDASNLESNTTFYGVAGTLREKFTFVSTKSTDYAKKYTSDSTPAYL
LVRPGEEPSVYSGEELDETHLVHWIDIESKPLFGDIDGSTFKSYAEANIPLAYY
FYENEEQRAAAADIIKPFAKEQRGKINFVGLDAVKFGKHAKNLNMDEEKLPLFV
IHDLVSNKKFGVPQDQELTNKDVTELIEKFIAGEAEPIVKSEPIPEIQEEKVFK
LVGKAHDEVVFDESKDVLVKYYAPWCGHCKRMAPAYEELATLYANDEDASSKVV
IAKLDHTLNDVDNVDIQGYPTLILYPAGDKSNPQLYDGSRDLESLAEFVKERGT
HKVDALALRPVEEEKEAEEEAESEADAHDEL 43 human GRP94
GATGATGAAGTTGACGTTGACGGTACTGTTGAAGAGGACTTGGGAAAGTCTAGA Gene (DNA)
GAGGGTTCCAGAACTGACGACGAAGTTGTTCAGAGAGAGGAAGAGGCTATTCAG
TTGGACGGATTGAACGCTTCCCAAATCAGAGAGTTGAGAGAGAAGTCCGAGAAG
TTCGCTTTCCAAGCTGAGGTTAACAGAATGATGAAATTGATTATCAACTCCTTG
TACAAGAACAAAGAGATTTTCTTGAGAGAGTTGATCTCTAACGCTTCTGACGCT
TTGGACAAGATCAGATTGATCTCCTTGACTGACGAAAACGCTTTGTCCGGTAAC
GAAGAGTTGACTGTTAAGATCAAGTGTGACAAAGAGAAGAACTTGTTGCACGTT
ACTGACACTGGTGTTGGAATGACTAGAGAAGAGTTGGTTAAGAACTTGGGTACT
ATCGCTAAGTCTGGTACTTCCGAGTTCTTGAACAAGATGACTGAGGCTCAAGAA
GATGGTCAATCCACTTCCGAGTTGATTGGTCAGTTCGGTGTTGGTTTCTACTCC
GCTTTCTTGGTTGCTGACAAGGTTATCGTTACTTCCAAGCACAACAACGACACT
CAACACATTTGGGAATCCGATTCCAACGAGTTCTCCGTTATTGCTGACCCAAGA
GGTAACACTTTGGGTAGAGGTACTACTATCACTTTGGTTTTGAAAGAAGAGGCT
TCCGACTACTTGGAGTTGGACACTATCAAGAACTTGGTTAAGAAGTACTCCCAG
TTCATCAACTTCCCAATCTATGTTTGGTCCTCCAAGACTGAGACTGTTGAGGAA
CCAATGGAAGAAGAAGAGGCTGCTAAAGAAGAGAAAGAGGAATCTGACGACGAG
GCTGCTGTTGAAGAAGAGGAAGAAGAAAAGAAGCCAAAGACTAAGAAGGTTGAA
AAGACTGTTTGGGACTGGGAGCTTATGAACGACATCAAGCCAATTTGGCAGAGA
CCATCCAAAGAGGTTGAGGAGGACGAGTACAAGGCTTTCTACAAGTCCTTCTCC
AAAGAATCCGATGACCCAATGGCTTACATCCACTTCACTGCTGAGGGTGAAGTT
ACTTTCAAGTCCATCTTGTTCGTTCCAACTTCTGCTCCAAGAGGATTGTTCGAC
GAGTACGGTTCTAAGAAGTCCGACTACATCAAACTTTATGTTAGAAGAGTTTTC
ATCACTGACGACTTCCACGATATGATGCCAAAGTACTTGAACTTCGTTAAGGGT
GTTGTTGATTCCGATGACTTGCCATTGAACGTTTCCAGAGAGACTTTGCAGCAG
CACAAGTTGTTGAAGGTTATCAGAAAGAAACTTGTTAGAAAGACTTTGGACATG
ATCAAGAAGATCGCTGACGACAAGTACAACGACACTTTCTGGAAAGAGTTCGGA
ACTAACATCAAGTTGGGTGTTATTGAGGACCACTCCAACAGAACTAGATTGGCT
AAGTTGTTGAGATTCCAGTCCTCTCATCACCCAACTGACATCACTTCCTTGGAC
CAGTACGTTGAGAGAATGAAAGAGAAGCAGGACAAAATCTACTTCATGGCTGGT
TCCTCTAGAAAAGAGGCTGAATCCTCCCCATTCGTTGAGAGATTGTTGAAGAAG
GGTTACGAGGTTATCTACTTGACTGAGCCAGTTGACGAGTACTGTATCCAGGCT
TTGCCAGAGTTTGACGGAAAGAGATTCCAGAACGTTGCTAAAGAGGGTGTTAAG
TTCGACGAATCCGAAAAGACTAAAGAATCCAGAGAGGCTGTTGAGAAAGAGTTC
GAGCCATTGTTGAACTGGATGAAGGACAAGGCTTTGAAGGACAAGATCGAGAAG
GCTGTTGTTTCCCAGAGATTGACTGAATCCCCATGTGCTTTGGTTGCTTCCCAA
TACGGATGGAGTGGTAACATGGAAAGAATCATGAAGGCTCAGGCTTACCAAACT
GGAAAGGACATCTCCACTAACTACTACGCTTCCCAGAAGAAAACTTTCGAGATC
AACCCAAGACACCCATTGATCAGAGACATGTTGAGAAGAATCAAAGAGGACGAG
GACGACAAGACTGTTTTGGATTTGGCTGTTGTTTTGTTCGAGACTGCTACTTTG
AGATCCGGTTACTTGTTGCCAGACACTAAGGCTTACGGTGACAGAATCGAGAGA
ATGTTGAGATTGTCCTTGAACATTGACCCAGACGCTAAGGTTGAAGAAGAACCA
GAAGAAGAGCCAGAGGAAACTGCTGAAGATACTACTGAGGACACTGAACAAGAC
GAGGACGAAGAGATGGATGTTGGTACTGACGAAGAGGAAGAGACAGCAAAGGAA
TCCACTGCTGAACACGACGAGTTGTAA 44 human GRP94
DDEVDVDGTVEEDLGKSREGSRTDDEVVQREEEAIQLDGLNASQIRELREKSEK Gene
(protein) FAFQAEVNRMMKLIINSLYKNKEIFLRELISNASDALDKIRLISLTDENALSGN
EELTVKIKCDKEKNLLHVTDTGVGMTREELVKNLGTIAKSGTSEFLNKMTEAQE
DGQSTSELIGQFGVGFYSAFLVADKVIVTSKHNNDTQHIWESDSNEFSVIADPR
GNTLGRGTTITLVLKEEASDYLELDTIKNLVKKYSQFINFPIYVWSSKTETVEE
PMEEEEAAKEEKEESDDEAAVEEEEEEKKPKTKKVEKTVWDWELMNDIKPIWQR
PSKEVEEDEYKAFYKSFSKESDDPMAYIHFTAEGEVTFKSILFVPTSAPRGLFD
EYGSKKSDYIKLYVRRVFITDDFHDMMPKYLNFVKGVVDSDDLPLNVSRETLQQ
HKLLKVIRKKLVRKTLDMIKKIADDKYNDTFWKEFGTNIKLGVIEDHSNRTRLA
KLLRFQSSHHPTDITSLDQYVERMKEKQDKIYFMAGSSRKEAESSPFVERLLKK
GYEVIYLTEPVDEYCIQALPEFDGKRFQNVAKEGVKFDESEKTKESREAVEKEF
EPLLNWMKDKALKDKIEKAVVSQRLTESPCALVASQYGWSGNMERIMKAQAYQT
GKDISTNYYASQKKTFEINPRHPLIRDMLRRIKEDEDDKTVLDLAVVLFETATL
RSGYLLPDTKAYGDRIERMLRLSLNIDPDAKVEEEPEEEPEETAEDTTEDTEQD
EDEEMDVGTDEEEETAKESTAEHDEL 45 ProteinA
GAATTCGAAACGATGAGATTCCCATCCATCTTCACTGCTGTTTTGTTCGCTGCT fusion
protein TCTTCTGCTTTGGCGGCCGCTAATGCTGCTCAACACGACGAAGCTCAACAGAAC
(apre-5xBD-Htag)
GCTTTCTACCAGGTTTTGAACATGCCAAACTTGAACGCTGACCAGAGGAATGGT as
EcoRI/SalI TTCATCCAGTCCTTGAAGGATGACCCATCTCAATCCGCTAACGTTTTGGGTGAA
fragment, GCTCAGAAGTTGAACGACAGTCAAGCTCCTAAGGCTGATGCTCAACAAAACAAC
including alpha
TTCAACAAGGACCAGCAATCTGCTTTCTACGAAATCTTGAATATGCCTAATTTG MF pre
signal AACGAGGCTCAGAGAAATGGATTCATTCAATCTTTGAAAGACGACCCATCCCAG
sequence TCTACTAATGTTTTGGGAGAGGCTAAGAAACTTAATGAAAGTCAGGCTCCTAAA
(underlined), 5
GCTGACAACAACTTTAACAAAGAGCAGCAGAACGCTTTTTATGAGATTCTTAAC Fc binding
ATGCCTAACTTGAACGAAGAGCAAAGAAACGGTTTTATTCAATCATTGAAGGAC domains, and
a GATCCTTCACAGTCTGCTAACTTGTTGTCCGAGGCTAAAAAGTTGAACGAATCT HA and 9 x
CAGGCTCCTAAGGCTGATAATAAGTTCAACAAAGAACAACAAAATGCTTTCTAC HIS tag at
the GAGATTTTGCACTTGCCAAATTTGAATGAGGAACAGAGAAACGGTTTTATTCAG
C-terminus. TCATTGAAGGATGACCCTTCCCAATCTGCTAATTTGTTGGCTGAAGCTAAGAAA
TTGAACGACGCTCAGGCTCCAAAAGCTGATAACAAATTCAACAAAGAGCAACAG
AACGCTTTCTACGAAATCTTGCATTTGCCAAACTTGACAGAAGAGCAGAGAAAC
GGATTCATTCAGTCTTTGAAGGATGACCCTTCCGTTTCCAAAGAGATTTTGGCT
GAGGCTAAAAAGTTGAATGATGCTCAAGCTCCAAAAGGTGGTGGTTACCCATAC
GATGTTCCAGATTACGCTGGAGGTCATCATCATCACCACCATCACCATCATGGT GGTGTCGAC 46
Protein A MRFPSIFTAVLFAASSALAAANAAQHDEAQQNAFYQVLNMPNLNADQRNGFIQS
fusion protein;
LKDDPSQSANVLGEAQKLNDSQAPKADAQQNNFNKDQQSAFYEILNMPNLNEAQ
alpha-MF-pre-
RNGFIQSLKDDPSQSTNVLGEAKKLNESQAPKADNNFNKEQQNAFYEILNMPNL signal is
NEEQRNGFIQSLKDDPSQSANLLSEAKKLNESQAPKADNKFNKEQQNAFYEILH underlined
LPNLNEEQRNGFIQSLKDDPSQSANLLAEAKKLNDAQAPKADNKFNKEQQNAFY
EILHLPNLTEEQRNGFIQSLKDDPSVSKEILAEAKKLNDAQAPKGGGYPYDVPD
YAGGHHHHHHHHHGGVD 47 alpha-amylase-
CGGAATTCacgatggtcgcttggtggtctttgtttctgtacggtcttcaggtcg ProtAZZ/up:
ctgcacctgctttggctTCTGGTGGTGTTACTCCAGCTGCTAACGCTGCTCAAC ACG 48
HA-ProtAZZ- GCCTCGAGAGCGTAGTCTGGAACATCGTATGGGTAACCACCACCAGCATC
Xho1ZZ/lp: 49 DNA sequence of
GAATTCacgatggtcgcttggtggtctttgtttctgtacggtcttcaggtcgct the
ZZ-domain as gcacctgctttggctTCTGGTGGTGTTACTCCAGCTGCTAACGCTGCTTCTGGT
EcoRI/XhoI GGTGTTACTCCAGCTGCTAACGCTGCTCAACACGATGAAGCTGTTGACAACAAG
fragment: TTCAACAAAGAGCAGCAGAACGCTTTCTACGAGATCTTGCACTTGCCAAACTTG
Alpha-amylase
AACGAAGAGCAAAGAAACGCTTTCATCCAGTCCTTGAAGGATGACCCATCTCAA sequence
TCCGCTAACTTGTTGGCTGAGGCTAAGAAGTTGAACGACGCTCAAGCTCCAAAG underlined
GTCGACAATAAGTTTAACAAAGAACAACAAAATGCCTTCTACGAAATTCTGCAT
CTGCCCAACCTTAACGAGGAACAGAGAAACGCCTTCATTCAGAGTTTGAAGGAC
GATCCTTCCCAGTCTGCTAATTTGCTTGCCGAAGCCAAGAAATTGAATGATGCC
CAGGCTCCAAAAGTTGATGCTGGTGGTGGTTACCCATACGATGTTCCAGACTAC GCTCTCGAG 50
Protein MVAWWSLFLYGLQVAAPALASGGVTPAANAAQHDEAVDNKFNKEQQNAFYEILH
sequence of the
LPNLNEEQRNAFIQSLKDDPSQSANLLAEAKKLNDAQAPKVDNKFNKEQQNAFY ZZ-domain:
EILHLPNLNEEQRNAFIQSLKDDPSQSANLLAEAKKLNDAQAPKVDAGGGYPYD
Alpha-amylase VPDYALE leader is underlined 51 5Ecoapp:
AACGGAATTCATGAGATTTCCTTCAATTTTTAC 52 3HtagSal
CGATGTCGACGTGATGGTGATGGTGGTGATGATGATGACCACC 53 DNA sequence
GAATTCATGAGATTTCCTTCAATTTTTACTGCTGTTTTATTCGCAGCATCCTCC of the
GCATTAGCTGCTCCAGTCAACACTACAACAGAAGATGAAACGGCACAAATTCCG FcRIII(LF)
as GCTGAAGCTGTCATCGGTTACTCAGATTTAGAAGGGGATTTCGATGTTGCTGTT
EcoRI/SalI TTGCCATTTTCCAACAGCACAAATAACGGGTTATTGTTTATAAATACTACTATT
fragment: GCCAGCATTGCTGCTAAAGAAGAAGGGGTATCTCTCGAGAAAAGAGCTGGAATG
AGAACTGAGGACTTGCCAAAGGCTGTTGTTTTCTTGGAGCCACAGTGGTACAGA
GTTTTGGAGAAGGATTCCGTTACTTTGAAGTGTCAGGGAGCTTACTCTCCAGAA
GATAACTCCACTCAGTGGTTCCACAACGAATCCTTGATTTCTTCTCAGGCTTCC
TCCTACTTCATTGACGCTGCTACTGTTGACGATTCCGGTGAGTACAGATGTCAG
ACTAACTTGTCCACTTTGTCCGACCCAGTTCAATTGGAGGTTCACATCGGTTGG
TTGTTGTTGCAAGCTCCAAGATGGGTTTTCAAGGAGGAGGACCCAATTCATTTG
AGATGTCACTCTTGGAAGAACACTGCTTTGCACAAAGTTACTTACTTGCAGAAC
GGAAAGGGTAGAAAGTATTTCCACCACAACTCCGACTTCTACATCCCAAAGGCT
ACTTTGAAGGATTCCGGTTCCTACTTCTGTAGAGGATTGTTCGGTTCCAAGAAC
GTTTCTTCCGAGACTGTTAACATCACTATCACTCAGGGATTGGCTGTTTCCACT
ATCTCTTCCTTCTTCCCACCAGGTTATCAAGGTGGTGGTCATCATCATCACCAC
CATCACCATCACGTCGAC 54 Protein
MRFPSIFTAVLFAASSALAAPVNTTTEDETAQIPAEAVIGYSDLEGDFDVAVLP sequence of
the FSNSTNNGLLFINTTIASIAAKEEGVSLEKRAGMRTEDLPKAVVFLEPQWYRVL
FcRIII(LF) with
EKDSVTLKCQGAYSPEDNSTQWFHNESLISSQASSYFIDAATVDDSGEYRCQTN alpha MF pre
LSTLSDPVQLEVHIGWLLLQAPRWVFKEEDPIHLRCHSWKNTALHKVTYLQNGK signal
sequence GRKYFHHNSDFYIPKATLKDSGSYFCRGLFGSKNVSSETVNITITQGLAVSTIS and
HIS Tag: SFFPPGYQGGGHHHHHHHHHVD 55 DNA sequence
GAATTCATGAGATTTCCTTCAATTTTTACTGCTGTTTTATTCGCAGCATCCTCC of the
FcRIas GCATTAGCTGCTCCAGTCAACACTACAACAGAAGATGAAACGGCACAAATTCCG
EcoRI/SalI GCTGAAGCTGTCATCGGTTACTCAGATTTAGAAGGGGATTTCGATGTTGCTGTT
fragment: TTGCCATTTTCCAACAGCACAAATAACGGGTTATTGTTTATAAATACTACTATT
GCCAGCATTGCTGCTAAAGAAGAAGGGGTATCTCTCGAGAAAAGAGCTGATACT
ACTAAGGCTGTTATCACTTTGCAACCACCATGGGTTTCCGTTTTCCAGGAGGAG
ACTGTTACTTTGCACTGTGAGGTTTTGCATTTGCCTGGTTCCTCTTCCACTCAG
TGGTTCTTGAACGGTACTGCTACTCAAACTTCCACTCCATCCTACAGAATTACT
TCCGCTTCCGTTAACGATTCTGGTGAGTACAGATGTCAGAGAGGATTGTCTGGT
AGATCCGACCCAATTCAGTTGGAGATTCACAGAGGATGGTTGTTGTTGCAGGTT
TCCTCCAGAGTTTTCACTGAGGGTGAACCATTGGCTTTGAGATGTCACGCTTGG
AAGGACAAGTTGGTTTACAACGTTTTGTACTACAGAAACGGAAAGGCTTTCAAG
TTCTTCCACTGGAACTCCAACTTGACTATCTTGAAAACTAACATCTCCCACAAC
GGTACTTACCACTGTTCTGGAATGGGAAAGCACAGATACACTTCCGCTGGTATC
TCCGTTACTGTTAAGGAGTTGTTCCCAGCTCCAGTTTTGAACGCTTCCGTTACT
TCTCCATTGTTGGAGGGAAACTTGGTTACTTTGTCCTGTGAGACTAAATTGTTG
TTGCAAAGACCAGGATTGCAGTTGTACTTCTCCTTCTACATGGGTTCCAAGACT
TTGAGAGGTAGAAACACTTCCTCCGAGTACCAAATCTTGACTGCTAGAAGAGAG
GATTCCGGTTTGTACTGGTGTGAAGCTGCTACTGAGGACGGTAACGTTTTGAAG
AGATCCCCAGAGTTGGAGTTGCAAGTTTTGGGATTGCAATTGCCAACTCCAGGT
GGTGGTCATCATCATCACCACCATCACCATCACGTCGAC 56 Protein
MRFPSIFTAVLFAASSALAAPVNTTTEDETAQIPAEAVIGYSDLEGDFDVAVLP sequence of
the FSNSTNNGLLFINTTIASIAAKEEGVSLEKRADTTKAVITLQPPWVSVFQEETV FcRI
with alpha TLHCEVLHLPGSSSTQWFLNGTATQTSTPSYRITSASVNDSGEYRCQRGLSGRS
MF pre signal
DPIQLEIHRGWLLLQVSSRVFTEGEPLALRCHAWKDKLVYNVLYYRNGKAFKFF sequence and
HWNSNLTILKTNISHNGTYHCSGMGKHRYTSAGISVTVKELFPAPVLNASVTSP HIS Tag:
LLEGNLVTLSCETKLLLQRPGLQLYFSFYMGSKTLRGRNTSSEYQILTARREDS
GLYWCEAATEDGNVLKRSPELELQVLGLQLPTPGGGHHHHHHHHHVD 57 5gutBglII:
ATTGAGATCTACCCAATTTAGCAGCCTGCATTCTC 58 3gutEcoRI:
GTCAGAATTCATCTGTGGTATAGTGTGAAAAAGTAG 59 DNA sequence
AGATCTACCCAATTTAGCAGCCTGCATTCTCTTGATTTTATGGGGGAAACTAAC GUT1
promoter AATAGTGTTGCCTTGATTTTAAGTGGCATTGTTCTTTGAAATCGAAATTGGGGA
TAACGTCATACCGAAAGGTAAACAACTTCGGGGAATTGCCCTGGTTAAACATTT
ATTAAGCGAGATAAATAGGGGATAGCGAGATAGGGGGCGGAGAAGAAGAAGGGT
GTTAAATTGCTGAAATCTCTCAATCTGGAAGAAACGGAATAAATTAACTCCTTC
CTGAGATAATAAGATCCGACTCTGCTATGACCCCACACGGTACTGACCTCGGCA
TACCCCATTGGATCTGGTGCGAAGCAACAGGTCCTGAAACCTTTATCACGTGTA
GTAGATTGACCTTCCAGCAAAAAAAGGCATTATATATTTTGTTGTTGAAGGGGT
GAGGGGAGGTGCAGGTGGTTCTTTTATTCGTCTTGTAGTTAATTTTCCCGGGGT
TGCGGAGCGTCAAAAGTTTGCCCGATCTGATAGCTTGCAAGATGCCACCGCTTA
TCCAACGCACTTCAGAGAGCTTGCCGTAGAAAGAACGTTTTCCTCGTAGTATTC
CAGCACTTCATGGTGAAGTCGCTATTTCACCGAAGGGGGGGTATTAAGGTTGCG
CACCCCCTCCCCACACCCCAGAATCGTTTATTGGCTGGGTTCAATGGCGTTTGA
GTTAGCACATTTTTTCCTTAAACACCCTCCAAACACGGATAAAAATGCATGTGC
ATCCTGAAACTGGTAGAGATGCGTACTCCGTGCTCCGATAATAACAGTGGTGTT
GGGGTTGCTGTTAGCTCACGCACTCCGTTTTTTTTTCAACCAGCAAAATTCGAT
GGGGAGAAACTTGGGGTACTTTGCCGACTCCTCCACCATACTGGTATATAAATA
ATACTCGCCCACTTTTCGTTTGCTGCTTTTATATTTCAAGGACTGAAAAAGACT
CTTCTTCTACTTTTTCACACTATACCACAGATGAATTC 60 S. cerevisiae
VDQFSNSTSASTDVTSSSSISTSSGSVTITSSEAPESDNGTSTAAPTETSTEAP SED1
(without TTAIPTNGTSTEAPTTAIPTNGTSTEAPTDTTTEAPTTALPTNGTSTEAPTDTT
endogenous TEAPTTGLPTNGTTSAFPPTTSLPPSNTTTTPPYNPSTDYTTDYTVVTEYTTYC
leader sequence
PEPTTFTTNGKTYTVTEPTTLTITDCPCTIEKPTTTSTTEYTVVTEYTTYCPEP
TTFTTNGKTYTVTEPTTLTITDCPCTIEKSEAPESSVPVTESKGTTTKETGVTT
KQTTANPSLTVSTVVPVSSSASSHSVVINSNGANVVVPGALGLAGVAMLFL 61 S.
cerevisiae GTCGACCAATTCTCTAACTCTACTTCCGCTTCCTCTACTGACGTTACTTCCTCC
SED1 DNA TCCTCTATTTCTACTTCCTCCGGTTCCGTTACTATTACTTCCTCTGAGGCTCCA
sequence GAATCTGACAACGGTACTTCTACTGCTGCTCCAACTGAAACTTCTACTGAGGCT
CCTACTACTGCTATTCCAACTAACGGAACTTCCACAGAGGCTCCAACAACAGCT
ATCCCTACAAACGGTACATCCACTGAAGCTCCTACTGACACTACTACAGAAGCT
CCAACTACTGCTTTGCCTACTAATGGTACATCAACAGAGGCTCCTACAGATACA
ACAACTGAAGCTCCAACAACTGGATTGCCAACAAACGGTACTACTTCTGCTTTC
CCACCAACTACTTCCTTGCCACCATCCAACACTACTACTACTCCACCATACAAC
CCATCCACTGACTACACTACTGACTACACAGTTGTTACTGAGTACACTACTTAC
TGTCCAGAGCCAACTACTTTCACAACAAACGGAAAGACTTACACTGTTACTGAG
CCTACTACTTTGACTATCACTGACTGTCCATGTACTATCGAGAAGCCAACTACT
ACTTCCACTACAGAGTATACTGTTGTTACAGAATACACAACATATTGTCCTGAG
CCAACAACATTCACTACTAATGGAAAAACATACACAGTTACAGAACCAACTACA
TTGACAATTACAGATTGTCCTTGTACAATTGAGAAGTCCGAGGCTCCTGAATCT
TCTGTTCCAGTTACTGAATCCAAGGGTACTACTACTAAAGAAACTGGTGTTACT
ACTAAGCAGACTACTGCTAACCCATCCTTGACTGTTTCCACTGTTGTTCCAGTT
TCTTCCTCTGCTTCTTCCCACTCCGTTGTTATCAACTCCAACGGTGCTAACGTT
GTTGTTCCTGGTGCTTTGGGATTGGCTGGTGTTGCTATGTTGTTCTTGTAA
[0208] While the present invention is described herein with
reference to illustrated embodiments, it should be understood that
the invention is not limited hereto. Those having ordinary skill in
the art and access to the teachings herein will recognize
additional modifications and embodiments within the scope thereof.
Therefore, the present invention is limited only by the claims
attached herein.
Sequence CWU 1
1
61130DNAArtificial SequencePCR primer hPDI/UP1 1agcgctgacg
cccccgagga ggaggaccac 30242DNAArtificial SequencePCR primer
hPDI/LP-PacI 2ccttaattaa ttacagttca tcatgcacag ctttctgatc at
42347DNAArtificial SequencePCR primer PB248 3atgaattcag gccatatcgg
ccattgttta ctgtgcgccc acagtag 47435DNAArtificial SequencePCR primer
PB249 4atgtttaaac gtgaggatta ctggtgatga aagac 35534DNAArtificial
SequencePCR primer PB250 5agactagtct atttggagac attgacggat ccac
34646DNAArtificial SequencePCR primer PB251 6atctcgagag gccatgcagg
ccaaccacaa gatgaatcaa attttg 46734DNAArtificial SequencePCR primer
PpPDI/UPi-1 7ggtgaggttg aggtcccaag tgactatcaa ggtc
34834DNAArtificial SequencePCR primer PpPDI/LPi-1 8gaccttgata
gtcacttggg acctcaacct cacc 34934DNAArtificial SequencePCR primer
PpPDI/UPi-2 9cgccaatgat gaggatgcct cttcaaaggt tgtg
341034DNAArtificial SequencePCR primer PpPDI/LPi-2 10cacaaccttt
gaagaggcat cctcatcatt ggcg 341124DNAArtificial SequencePCR primer
PpPDI-5'/UP 11ggcgattgca ttcgcgactg tatc 241223DNAArtificial
SequencePCR primer hPDI-3'/LP 12cctagagagc ggtggccaag atg
231325DNAArtificial SequencePCR primer hPDI/UP 13gtggccacac
cagggggcat ggaac 251423DNAArtificial SequencePCR primer hPDI-3'/LP
14cctagagagc ggtggccaag atg 231537DNAArtificial SequencePCR primer
hGRP94/UP1 15agcgctgacg atgaagttga tgtggatggt acagtag
371638DNAArtificial SequencePCR primer hGRP94/LP1 16ggccggcctt
acaattcatc atgttcagct gtagattc 381724DNAArtificial SequencePCR
primer PMT1-KO1 17tgaacccatc tgtaaataga atgc 241845DNAArtificial
SequencePCR primer PMT1-KO2 18gtgtcaccta aatcgtatgt gcccatttac
tggaagctgc taacc 451945DNAArtificial SequencePCR primer PMT1-KO3
19ctccctatag tgagtcgtat tcatcattgt actttggtat attgg
452024DNAArtificial SequencePCR primer PMT1-KO4 20tatttgtacc
tgcgtcctgt ttgc 242121DNAArtificial SequencePCR primer PR29
21cacatacgat ttaggtgaca c 212221DNAArtificial SequencePCR primer
PR32 22aatacgactc actataggga g 212324DNAArtificial SequencePCR
primer PMT4-KO1 23tgctctccgc gtgcaataga aact 242445DNAArtificial
SequencePCR primer PMT4-KO2 24ctccctatag tgagtcgtat tcacagtgta
ccatctttca tctcc 452545DNAArtificial SequencePCR primer PMT4-KO3
25gtgtcaccta aatcgtatgt gaacctaact ctaattcttc aaagc
452624DNAArtificial SequencePCR primer PMT4-KO4 26actagggtat
ataattccca aggt 242757DNAArtificial SequencePre-pro alpha-mating
factor signal peptide (ScaMTprepro) (DNA) 27atgagattcc catccatctt
cactgctgtt ttgttcgctg cttcttctgc tttggct 572819PRTArtificial
SequencePre-pro alpha-mating factor signal peptide (protein) 28Met
Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser1 5 10
15 Ala Leu Ala291353DNAArtificial SequenceAnti-Her2 Heavy chain (VH
+ IgG1 constant region) 29gaggttcagt tggttgaatc tggaggagga
ttggttcaac ctggtggttc tttgagattg 60tcctgtgctg cttccggttt caacatcaag
gacacttaca tccactgggt tagacaagct 120ccaggaaagg gattggagtg
ggttgctaga atctacccaa ctaacggtta cacaagatac 180gctgactccg
ttaagggaag attcactatc tctgctgaca cttccaagaa cactgcttac
240ttgcagatga actccttgag agctgaggat actgctgttt actactgttc
cagatggggt 300ggtgatggtt tctacgctat ggactactgg ggtcaaggaa
ctttggttac tgtttcctcc 360gcttctacta agggaccatc tgttttccca
ttggctccat cttctaagtc tacttccggt 420ggtactgctg ctttgggatg
tttggttaaa gactacttcc cagagccagt tactgtttct 480tggaactccg
gtgctttgac ttctggtgtt cacactttcc cagctgtttt gcaatcttcc
540ggtttgtact ctttgtcctc cgttgttact gttccatcct cttccttggg
tactcagact 600tacatctgta acgttaacca caagccatcc aacactaagg
ttgacaagaa ggttgagcca 660aagtcctgtg acaagactca tacttgtcca
ccatgtccag ctccagaatt gttgggtggt 720ccttccgttt ttttgttccc
accaaagcca aaggacactt tgatgatctc cagaactcca 780gaggttacat
gtgttgttgt tgacgtttct cacgaggacc cagaggttaa gttcaactgg
840tacgttgacg gtgttgaagt tcacaacgct aagactaagc caagagagga
gcagtacaac 900tccacttaca gagttgtttc cgttttgact gttttgcacc
aggattggtt gaacggaaag 960gagtacaagt gtaaggtttc caacaaggct
ttgccagctc caatcgaaaa gactatctcc 1020aaggctaagg gtcaaccaag
agagccacag gtttacactt tgccaccatc cagagatgag 1080ttgactaaga
accaggtttc cttgacttgt ttggttaagg gattctaccc atccgacatt
1140gctgttgaat gggagtctaa cggtcaacca gagaacaact acaagactac
tccacctgtt 1200ttggactctg acggttcctt tttcttgtac tccaagttga
ctgttgacaa gtccagatgg 1260caacagggta acgttttctc ctgttccgtt
atgcatgagg ctttgcacaa ccactacact 1320caaaagtcct tgtctttgtc
ccctggtaag taa 135330450PRTArtificial SequenceAnti-Her2 Heavy chain
(VH + IgG1 constant region) 30Glu Val Gln Leu Val Glu Ser Gly Gly
Gly Leu Val Gln Pro Gly Gly1 5 10 15 Ser Leu Arg Leu Ser Cys Ala
Ala Ser Gly Phe Asn Ile Lys Asp Thr 20 25 30 Tyr Ile His Trp Val
Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val 35 40 45 Ala Arg Ile
Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val 50 55 60 Lys
Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr65 70 75
80 Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95 Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp
Gly Gln 100 105 110 Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys
Gly Pro Ser Val 115 120 125 Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr
Ser Gly Gly Thr Ala Ala 130 135 140 Leu Gly Cys Leu Val Lys Asp Tyr
Phe Pro Glu Pro Val Thr Val Ser145 150 155 160 Trp Asn Ser Gly Ala
Leu Thr Ser Gly Val His Thr Phe Pro Ala Val 165 170 175 Leu Gln Ser
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro 180 185 190 Ser
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys 195 200
205 Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp
210 215 220 Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu
Gly Gly225 230 235 240 Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys
Asp Thr Leu Met Ile 245 250 255 Ser Arg Thr Pro Glu Val Thr Cys Val
Val Val Asp Val Ser His Glu 260 265 270 Asp Pro Glu Val Lys Phe Asn
Trp Tyr Val Asp Gly Val Glu Val His 275 280 285 Asn Ala Lys Thr Lys
Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg 290 295 300 Val Val Ser
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys305 310 315 320
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu 325
330 335 Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
Tyr 340 345 350 Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln
Val Ser Leu 355 360 365 Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp
Ile Ala Val Glu Trp 370 375 380 Glu Ser Asn Gly Gln Pro Glu Asn Asn
Tyr Lys Thr Thr Pro Pro Val385 390 395 400 Leu Asp Ser Asp Gly Ser
Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp 405 410 415 Lys Ser Arg Trp
Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His 420 425 430 Glu Ala
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro 435 440 445
Gly Lys 450 31645DNAArtificial SequenceAnti-Her2 light chain (VL +
Kappa constant region) 31gacatccaaa tgactcaatc cccatcttct
ttgtctgctt ccgttggtga cagagttact 60atcacttgta gagcttccca ggacgttaat
actgctgttg cttggtatca acagaagcca 120ggaaaggctc caaagttgtt
gatctactcc gcttccttct tgtactctgg tgttccatcc 180agattctctg
gttccagatc cggtactgac ttcactttga ctatctcctc cttgcaacca
240gaagatttcg ctacttacta ctgtcagcag cactacacta ctccaccaac
tttcggacag 300ggtactaagg ttgagatcaa gagaactgtt gctgctccat
ccgttttcat tttcccacca 360tccgacgaac agttgaagtc tggtacagct
tccgttgttt gtttgttgaa caacttctac 420ccaagagagg ctaaggttca
gtggaaggtt gacaacgctt tgcaatccgg taactcccaa 480gaatccgtta
ctgagcaaga ctctaaggac tccacttact ccttgtcctc cactttgact
540ttgtccaagg ctgattacga gaagcacaag gtttacgctt gtgaggttac
acatcagggt 600ttgtcctccc cagttactaa gtccttcaac agaggagagt gttaa
64532213PRTArtificial SequenceAnti-Her2 light chain (VL + Kappa
constant region) 32Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser
Ala Ser Val Gly1 5 10 15 Asp Arg Val Thr Ile Thr Cys Arg Ala Ser
Gln Asp Val Asn Thr Ala 20 25 30 Val Ala Trp Tyr Gln Gln Lys Pro
Gly Lys Ala Pro Lys Leu Leu Ile 35 40 45 Tyr Ser Ala Ser Phe Leu
Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly 50 55 60 Ser Arg Ser Gly
Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro65 70 75 80 Glu Asp
Phe Ala Thr Tyr Tyr Cys Gln Gln His Tyr Thr Thr Pro Pro 85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala Ala 100
105 110 Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser
Gly 115 120 125 Thr Ala Ser Val Val Cys Leu Asn Asn Phe Tyr Pro Arg
Glu Ala Lys 130 135 140 Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser
Gly Asn Ser Gln Glu145 150 155 160 Ser Val Thr Glu Gln Asp Ser Lys
Asp Ser Thr Tyr Ser Leu Ser Ser 165 170 175 Thr Leu Thr Leu Ser Lys
Ala Asp Tyr Glu Lys His Lys Val Tyr Ala 180 185 190 Cys Glu Val Thr
His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser Phe 195 200 205 Asn Arg
Gly Glu Cys 210 3360DNAArtificial SequenceAlpha amylase signal
peptide (from Aspergillus niger -amylase) 33atggttgctt ggtggtcctt
gttcttgtac ggattgcaag ttgctgctcc agctttggct 603420PRTArtificial
SequenceAlpha amylase signal peptide (from Aspergillus niger
-amylase) 34Met Val Ala Trp Trp Ser Leu Phe Leu Tyr Gly Leu Gln Val
Ala Ala1 5 10 15 Pro Ala Leu Ala 20 35397DNAArtificial
SequenceAnti-CD20 Light chain Variable Region 35gagatcgttt
tgacacagtc cccagctact ttgtctttgt ccccaggtga aagagctaca 60ttgtcctgta
gagcttccca atctgtttcc tcctacttgg cttggtatca acaaaagcca
120ggacaggctc caagattgtt gatctacgac gcttccaata gagctactgg
tatcccagct 180agattctctg gttctggttc cggtactgac ttcactttga
ctatctcttc cttggaacca 240gaggacttcg ctgtttacta ctgtcagcag
agatccaatt ggccattgac tttcggtggt 300ggtactaagg ttgagatcaa
gcgtacggtt gctgctcctt ccgttttcat tttcccacca 360tccgacgaac
aattgaagtc tggtacccaa ttcgccc 39736132PRTArtificial
SequenceAnti-CD20 Light chain Variable Region 36Glu Ile Val Leu Thr
Gln Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly1 5 10 15 Glu Arg Ala
Thr Leu Ser Cys Arg Ala Ser Gln Ser Val Ser Ser Tyr 20 25 30 Leu
Ala Trp Tyr Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Leu Ile 35 40
45 Tyr Asp Ala Ser Asn Arg Ala Thr Gly Ile Pro Ala Arg Phe Ser Gly
50 55 60 Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu
Glu Pro65 70 75 80 Glu Asp Phe Ala Val Tyr Tyr Cys Gln Gln Arg Ser
Asn Trp Pro Leu 85 90 95 Thr Phe Gly Gly Gly Thr Lys Val Glu Ile
Lys Arg Thr Val Ala Ala 100 105 110 Pro Ser Val Phe Ile Phe Pro Pro
Ser Asp Glu Gln Leu Lys Ser Gly 115 120 125 Thr Gln Phe Ala 130
37445DNAArtificial SequenceAnti-CD20 Heavy chain Variable Region
37gctgttcagc tggttgaatc tggtggtgga ttggttcaac ctggtagatc cttgagattg
60tcctgtgctg cttccggttt tactttcggt gactacacta tgcactgggt tagacaagct
120ccaggaaagg gattggaatg ggtttccggt atttcttgga actccggttc
cattggttac 180gctgattccg ttaagggaag attcactatc tccagagaca
acgctaagaa ctccttgtac 240ttgcagatga actccttgag agctgaggat
actgctttgt actactgtac taaggacaac 300caatacggtt ctggttccac
ttacggattg ggagtttggg gacagggaac tttggttact 360gtctcgagtg
cttctactaa gggaccatcc gtttttccat tggctccatc ctctaagtct
420acttccggtg gtacccaatt cgccc 44538148PRTArtificial
SequenceAnti-CD20 Heavy chain Variable Region 38Ala Val Gln Leu Val
Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Arg1 5 10 15 Ser Leu Arg
Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Gly Asp Tyr 20 25 30 Thr
Met His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val 35 40
45 Ser Gly Ile Ser Trp Asn Ser Gly Ser Ile Gly Tyr Ala Asp Ser Val
50 55 60 Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ala Lys Asn Ser
Leu Tyr65 70 75 80 Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala
Leu Tyr Tyr Cys 85 90 95 Thr Lys Asp Asn Gln Tyr Gly Ser Gly Ser
Thr Tyr Gly Leu Gly Val 100 105 110 Trp Gly Gln Gly Thr Leu Val Thr
Val Ser Ser Ala Ser Thr Lys Gly 115 120 125 Pro Ser Val Phe Pro Leu
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly 130 135 140 Thr Gln Phe
Ala145 391476DNAArtificial SequencePart of human PDI gene
39gacgcccccg aggaggagga ccacgtcttg gtgctgcgga aaagcaactt cgcggaggcg
60ctggcggccc acaagtaccc gccggtggag ttccatgccc cctggtgtgg ccactgcaag
120gctctggccc ctgagtatgc caaagccgct gggaagctga aggcagaagg
ttccgagatc 180aggttggcca aggtggacgc cacggaggag tctgacctag
cccagcagta cggcgtgcgc 240ggctatccca ccatcaagtt cttcaggaat
ggagacacgg cttcccccaa ggaatataca 300gctggcagag aggctgatga
catcgtgaac tggctgaaga agcgcacggg cccggctgcc 360accaccctgc
ctgacggcgc agctgcagag tccttggtgg agtccagcga ggtggccgtc
420atcggcttct tcaaggacgt ggagtcggac tctgccaagc agtttttgca
ggcagcagag 480gccatcgatg acataccatt tgggatcact tccaacagtg
acgtgttctc caaataccag 540ctcgacaaag atggggttgt cctctttaag
aagtttgatg aaggccggaa caactttgaa 600ggggaggtca ccaaggagaa
cctgctggac tttatcaaac acaaccagct gccccttgtc 660atcgagttca
ccgagcagac agccccgaag atttttggag gtgaaatcaa gactcacatc
720ctgctgttct tgcccaagag tgtgtctgac tatgacggca aactgagcaa
cttcaaaaca 780gcagccgaga gcttcaaggg caagatcctg ttcatcttca
tcgacagcga ccacaccgac 840aaccagcgca tcctcgagtt ctttggcctg
aagaaggaag agtgcccggc cgtgcgcctc 900atcaccttgg aggaggagat
gaccaagtac aagcccgaat cggaggagct gacggcagag 960aggatcacag
agttctgcca ccgcttcctg gagggcaaaa tcaagcccca cctgatgagc
1020caggagctgc cggaggactg ggacaagcag cctgtcaagg tgcttgttgg
gaagaacttt 1080gaagacgtgg cttttgatga gaaaaaaaac gtctttgtgg
agttctatgc cccatggtgt 1140ggtcactgca aacagttggc tcccatttgg
gataaactgg gagagacgta caaggaccat 1200gagaacatcg tcatcgccaa
gatggactcg actgccaacg aggtggaggc cgtcaaagtg 1260cacggcttcc
ccacactcgg gttctttcct gccagtgccg acaggacggt cattgattac
1320aacggggaac gcacgctgga tggttttaag aaattcctag agagcggtgg
ccaagatggg 1380gcaggggatg ttgacgacct cgaggacctc gaagaagcag
aggagccaga catggaggaa 1440gacgatgacc agaaagctgt gaaagatgaa ctgtaa
147640491PRTArtificial SequencePart of human PDI gene 40Asp Ala Pro
Glu Glu Glu Asp His Val Leu Val Leu Arg Lys Ser Asn1 5 10 15 Phe
Ala Glu Ala Leu Ala Ala His Lys Tyr Pro Pro Val Glu Phe His 20 25
30 Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Glu Tyr Ala Lys
35 40 45 Ala Ala Gly Lys Leu Lys Ala Glu Gly Ser Glu Ile Arg Leu
Ala Lys 50 55 60 Val Asp Ala Thr Glu Glu Ser Asp Leu Ala Gln Gln
Tyr Gly Val Arg65 70
75 80 Gly Tyr Pro Thr Ile Lys Phe Phe Arg Asn Gly Asp Thr Ala Ser
Pro 85 90 95 Lys Glu Tyr Thr Ala Gly Arg Glu Ala Asp Asp Ile Val
Asn Trp Leu 100 105 110 Lys Lys Arg Thr Gly Pro Ala Ala Thr Thr Leu
Pro Asp Gly Ala Ala 115 120 125 Ala Glu Ser Leu Val Glu Ser Ser Glu
Val Ala Val Ile Gly Phe Phe 130 135 140 Lys Asp Val Glu Ser Asp Ser
Ala Lys Gln Phe Leu Gln Ala Ala Glu145 150 155 160 Ala Ile Asp Asp
Ile Pro Phe Gly Ile Thr Ser Asn Ser Asp Val Phe 165 170 175 Ser Lys
Tyr Gln Leu Asp Lys Asp Gly Val Val Leu Phe Lys Lys Phe 180 185 190
Asp Glu Gly Arg Asn Asn Phe Glu Gly Glu Val Thr Lys Glu Asn Leu 195
200 205 Leu Asp Phe Ile Lys His Asn Gln Leu Pro Leu Val Ile Glu Phe
Thr 210 215 220 Glu Gln Thr Ala Pro Lys Ile Phe Gly Gly Glu Ile Lys
Thr His Ile225 230 235 240 Leu Leu Phe Leu Pro Lys Ser Val Ser Asp
Tyr Asp Gly Lys Leu Ser 245 250 255 Asn Phe Lys Thr Ala Ala Glu Ser
Phe Lys Gly Lys Ile Leu Phe Ile 260 265 270 Phe Ile Asp Ser Asp His
Thr Asp Asn Gln Arg Ile Leu Glu Phe Phe 275 280 285 Gly Leu Lys Lys
Glu Glu Cys Pro Ala Val Arg Leu Ile Thr Leu Glu 290 295 300 Glu Glu
Met Thr Lys Tyr Lys Pro Glu Ser Glu Glu Leu Thr Ala Glu305 310 315
320 Arg Ile Thr Glu Phe Cys His Arg Phe Leu Glu Gly Lys Ile Lys Pro
325 330 335 His Leu Met Ser Gln Glu Leu Pro Glu Asp Trp Asp Lys Gln
Pro Val 340 345 350 Lys Val Leu Val Gly Lys Asn Phe Glu Asp Val Ala
Phe Asp Glu Lys 355 360 365 Lys Asn Val Phe Val Glu Phe Tyr Ala Pro
Trp Cys Gly His Cys Lys 370 375 380 Gln Leu Ala Pro Ile Trp Asp Lys
Leu Gly Glu Thr Tyr Lys Asp His385 390 395 400 Glu Asn Ile Val Ile
Ala Lys Met Asp Ser Thr Ala Asn Glu Val Glu 405 410 415 Ala Val Lys
Val His Gly Phe Pro Thr Leu Gly Phe Phe Pro Ala Ser 420 425 430 Ala
Asp Arg Thr Val Ile Asp Tyr Asn Gly Glu Arg Thr Leu Asp Gly 435 440
445 Phe Lys Lys Phe Leu Glu Ser Gly Gly Gln Asp Gly Ala Gly Asp Val
450 455 460 Asp Asp Leu Glu Asp Leu Glu Glu Ala Glu Glu Pro Asp Met
Glu Glu465 470 475 480 Asp Asp Asp Gln Lys Ala Val His Asp Glu Leu
485 490 411554DNAPichia pastoris 41atgcaattca actggaatat taaaactgtg
gcaagtattt tgtccgctct cacactagca 60caagcaagtg atcaggaggc tattgctcca
gaggactctc atgtcgtcaa attgactgaa 120gccacttttg agtctttcat
caccagtaat cctcacgttt tggcagagtt ttttgcccct 180tggtgtggtc
actgtaagaa gttgggccct gaacttgttt ctgctgccga gatcttaaag
240gacaatgagc aggttaagat tgctcaaatt gattgtacgg aggagaagga
attatgtcaa 300ggctacgaaa ttaaagggta tcctactttg aaggtgttcc
atggtgaggt tgaggtccca 360agtgactatc aaggtcaaag acagagccaa
agcattgtca gctatatgct aaagcagagt 420ttaccccctg tcagtgaaat
caatgcaacc aaagatttag acgacacaat cgccgaggca 480aaagagcccg
tgattgtgca agtactaccg gaagatgcat ccaacttgga atctaacacc
540acattttacg gagttgccgg tactctcaga gagaaattca cttttgtctc
cactaagtct 600actgattatg ccaaaaaata cactagcgac tcgactcctg
cctatttgct tgtcagacct 660ggcgaggaac ctagtgttta ctctggtgag
gagttagatg agactcattt ggtgcactgg 720attgatattg agtccaaacc
tctatttgga gacattgacg gatccacctt caaatcatat 780gctgaagcta
acatcccttt agcctactat ttctatgaga acgaagaaca acgtgctgct
840gctgccgata ttattaaacc ttttgctaaa gagcaacgtg gcaaaattaa
ctttgttggc 900ttagatgccg ttaaattcgg taagcatgcc aagaacttaa
acatggatga agagaaactc 960cctctatttg tcattcatga tttggtgagc
aacaagaagt ttggagttcc tcaagaccaa 1020gaattgacga acaaagatgt
gaccgagctg attgagaaat tcatcgcagg agaggcagaa 1080ccaattgtga
aatcagagcc aattccagaa attcaagaag agaaagtctt caagctagtc
1140ggaaaggccc acgatgaagt tgtcttcgat gaatctaaag atgttctagt
caagtactac 1200gccccttggt gtggtcactg taagagaatg gctcctgctt
atgaggaatt ggctactctt 1260tacgccaatg atgaggatgc ctcttcaaag
gttgtgattg caaaacttga tcacactttg 1320aacgatgtcg acaacgttga
tattcaaggt tatcctactt tgatccttta tccagctggt 1380gataaatcca
atcctcaact gtatgatgga tctcgtgacc tagaatcatt ggctgagttt
1440gtaaaggaga gaggaaccca caaagtggat gccctagcac tcagaccagt
cgaggaagaa 1500aaggaagctg aagaagaagc tgaaagtgag gcagacgctc
acgacgagct ttaa 155442517PRTPichia pastoris 42Met Gln Phe Asn Trp
Asn Ile Lys Thr Val Ala Ser Ile Leu Ser Ala1 5 10 15 Leu Thr Leu
Ala Gln Ala Ser Asp Gln Glu Ala Ile Ala Pro Glu Asp 20 25 30 Ser
His Val Val Lys Leu Thr Glu Ala Thr Phe Glu Ser Phe Ile Thr 35 40
45 Ser Asn Pro His Val Leu Ala Glu Phe Phe Ala Pro Trp Cys Gly His
50 55 60 Cys Lys Lys Leu Gly Pro Glu Leu Val Ser Ala Ala Glu Ile
Leu Lys65 70 75 80 Asp Asn Glu Gln Val Lys Ile Ala Gln Ile Asp Cys
Thr Glu Glu Lys 85 90 95 Glu Leu Cys Gln Gly Tyr Glu Ile Lys Gly
Tyr Pro Thr Leu Lys Val 100 105 110 Phe His Gly Glu Val Glu Val Pro
Ser Asp Tyr Gln Gly Gln Arg Gln 115 120 125 Ser Gln Ser Ile Val Ser
Tyr Met Leu Lys Gln Ser Leu Pro Pro Val 130 135 140 Ser Glu Ile Asn
Ala Thr Lys Asp Leu Asp Asp Thr Ile Ala Glu Ala145 150 155 160 Lys
Glu Pro Val Ile Val Gln Val Leu Pro Glu Asp Ala Ser Asn Leu 165 170
175 Glu Ser Asn Thr Thr Phe Tyr Gly Val Ala Gly Thr Leu Arg Glu Lys
180 185 190 Phe Thr Phe Val Ser Thr Lys Ser Thr Asp Tyr Ala Lys Lys
Tyr Thr 195 200 205 Ser Asp Ser Thr Pro Ala Tyr Leu Leu Val Arg Pro
Gly Glu Glu Pro 210 215 220 Ser Val Tyr Ser Gly Glu Glu Leu Asp Glu
Thr His Leu Val His Trp225 230 235 240 Ile Asp Ile Glu Ser Lys Pro
Leu Phe Gly Asp Ile Asp Gly Ser Thr 245 250 255 Phe Lys Ser Tyr Ala
Glu Ala Asn Ile Pro Leu Ala Tyr Tyr Phe Tyr 260 265 270 Glu Asn Glu
Glu Gln Arg Ala Ala Ala Ala Asp Ile Ile Lys Pro Phe 275 280 285 Ala
Lys Glu Gln Arg Gly Lys Ile Asn Phe Val Gly Leu Asp Ala Val 290 295
300 Lys Phe Gly Lys His Ala Lys Asn Leu Asn Met Asp Glu Glu Lys
Leu305 310 315 320 Pro Leu Phe Val Ile His Asp Leu Val Ser Asn Lys
Lys Phe Gly Val 325 330 335 Pro Gln Asp Gln Glu Leu Thr Asn Lys Asp
Val Thr Glu Leu Ile Glu 340 345 350 Lys Phe Ile Ala Gly Glu Ala Glu
Pro Ile Val Lys Ser Glu Pro Ile 355 360 365 Pro Glu Ile Gln Glu Glu
Lys Val Phe Lys Leu Val Gly Lys Ala His 370 375 380 Asp Glu Val Val
Phe Asp Glu Ser Lys Asp Val Leu Val Lys Tyr Tyr385 390 395 400 Ala
Pro Trp Cys Gly His Cys Lys Arg Met Ala Pro Ala Tyr Glu Glu 405 410
415 Leu Ala Thr Leu Tyr Ala Asn Asp Glu Asp Ala Ser Ser Lys Val Val
420 425 430 Ile Ala Lys Leu Asp His Thr Leu Asn Asp Val Asp Asn Val
Asp Ile 435 440 445 Gln Gly Tyr Pro Thr Leu Ile Leu Tyr Pro Ala Gly
Asp Lys Ser Asn 450 455 460 Pro Gln Leu Tyr Asp Gly Ser Arg Asp Leu
Glu Ser Leu Ala Glu Phe465 470 475 480 Val Lys Glu Arg Gly Thr His
Lys Val Asp Ala Leu Ala Leu Arg Pro 485 490 495 Val Glu Glu Glu Lys
Glu Ala Glu Glu Glu Ala Glu Ser Glu Ala Asp 500 505 510 Ala His Asp
Glu Leu 515 432349DNAArtificial SequencePart of human GRP94 Gene
43gatgatgaag ttgacgttga cggtactgtt gaagaggact tgggaaagtc tagagagggt
60tccagaactg acgacgaagt tgttcagaga gaggaagagg ctattcagtt ggacggattg
120aacgcttccc aaatcagaga gttgagagag aagtccgaga agttcgcttt
ccaagctgag 180gttaacagaa tgatgaaatt gattatcaac tccttgtaca
agaacaaaga gattttcttg 240agagagttga tctctaacgc ttctgacgct
ttggacaaga tcagattgat ctccttgact 300gacgaaaacg ctttgtccgg
taacgaagag ttgactgtta agatcaagtg tgacaaagag 360aagaacttgt
tgcacgttac tgacactggt gttggaatga ctagagaaga gttggttaag
420aacttgggta ctatcgctaa gtctggtact tccgagttct tgaacaagat
gactgaggct 480caagaagatg gtcaatccac ttccgagttg attggtcagt
tcggtgttgg tttctactcc 540gctttcttgg ttgctgacaa ggttatcgtt
acttccaagc acaacaacga cactcaacac 600atttgggaat ccgattccaa
cgagttctcc gttattgctg acccaagagg taacactttg 660ggtagaggta
ctactatcac tttggttttg aaagaagagg cttccgacta cttggagttg
720gacactatca agaacttggt taagaagtac tcccagttca tcaacttccc
aatctatgtt 780tggtcctcca agactgagac tgttgaggaa ccaatggaag
aagaagaggc tgctaaagaa 840gagaaagagg aatctgacga cgaggctgct
gttgaagaag aggaagaaga aaagaagcca 900aagactaaga aggttgaaaa
gactgtttgg gactgggagc ttatgaacga catcaagcca 960atttggcaga
gaccatccaa agaggttgag gaggacgagt acaaggcttt ctacaagtcc
1020ttctccaaag aatccgatga cccaatggct tacatccact tcactgctga
gggtgaagtt 1080actttcaagt ccatcttgtt cgttccaact tctgctccaa
gaggattgtt cgacgagtac 1140ggttctaaga agtccgacta catcaaactt
tatgttagaa gagttttcat cactgacgac 1200ttccacgata tgatgccaaa
gtacttgaac ttcgttaagg gtgttgttga ttccgatgac 1260ttgccattga
acgtttccag agagactttg cagcagcaca agttgttgaa ggttatcaga
1320aagaaacttg ttagaaagac tttggacatg atcaagaaga tcgctgacga
caagtacaac 1380gacactttct ggaaagagtt cggaactaac atcaagttgg
gtgttattga ggaccactcc 1440aacagaacta gattggctaa gttgttgaga
ttccagtcct ctcatcaccc aactgacatc 1500acttccttgg accagtacgt
tgagagaatg aaagagaagc aggacaaaat ctacttcatg 1560gctggttcct
ctagaaaaga ggctgaatcc tccccattcg ttgagagatt gttgaagaag
1620ggttacgagg ttatctactt gactgagcca gttgacgagt actgtatcca
ggctttgcca 1680gagtttgacg gaaagagatt ccagaacgtt gctaaagagg
gtgttaagtt cgacgaatcc 1740gaaaagacta aagaatccag agaggctgtt
gagaaagagt tcgagccatt gttgaactgg 1800atgaaggaca aggctttgaa
ggacaagatc gagaaggctg ttgtttccca gagattgact 1860gaatccccat
gtgctttggt tgcttcccaa tacggatgga gtggtaacat ggaaagaatc
1920atgaaggctc aggcttacca aactggaaag gacatctcca ctaactacta
cgcttcccag 1980aagaaaactt tcgagatcaa cccaagacac ccattgatca
gagacatgtt gagaagaatc 2040aaagaggacg aggacgacaa gactgttttg
gatttggctg ttgttttgtt cgagactgct 2100actttgagat ccggttactt
gttgccagac actaaggctt acggtgacag aatcgagaga 2160atgttgagat
tgtccttgaa cattgaccca gacgctaagg ttgaagaaga accagaagaa
2220gagccagagg aaactgctga agatactact gaggacactg aacaagacga
ggacgaagag 2280atggatgttg gtactgacga agaggaagag acagcaaagg
aatccactgc tgaacacgac 2340gagttgtaa 234944782PRTArtificial
SequencePart of human GRP94 Gene 44Asp Asp Glu Val Asp Val Asp Gly
Thr Val Glu Glu Asp Leu Gly Lys1 5 10 15 Ser Arg Glu Gly Ser Arg
Thr Asp Asp Glu Val Val Gln Arg Glu Glu 20 25 30 Glu Ala Ile Gln
Leu Asp Gly Leu Asn Ala Ser Gln Ile Arg Glu Leu 35 40 45 Arg Glu
Lys Ser Glu Lys Phe Ala Phe Gln Ala Glu Val Asn Arg Met 50 55 60
Met Lys Leu Ile Ile Asn Ser Leu Tyr Lys Asn Lys Glu Ile Phe Leu65
70 75 80 Arg Glu Leu Ile Ser Asn Ala Ser Asp Ala Leu Asp Lys Ile
Arg Leu 85 90 95 Ile Ser Leu Thr Asp Glu Asn Ala Leu Ser Gly Asn
Glu Glu Leu Thr 100 105 110 Val Lys Ile Lys Cys Asp Lys Glu Lys Asn
Leu Leu His Val Thr Asp 115 120 125 Thr Gly Val Gly Met Thr Arg Glu
Glu Leu Val Lys Asn Leu Gly Thr 130 135 140 Ile Ala Lys Ser Gly Thr
Ser Glu Phe Leu Asn Lys Met Thr Glu Ala145 150 155 160 Gln Glu Asp
Gly Gln Ser Thr Ser Glu Leu Ile Gly Gln Phe Gly Val 165 170 175 Gly
Phe Tyr Ser Ala Phe Leu Val Ala Asp Lys Val Ile Val Thr Ser 180 185
190 Lys His Asn Asn Asp Thr Gln His Ile Trp Glu Ser Asp Ser Asn Glu
195 200 205 Phe Ser Val Ile Ala Asp Pro Arg Gly Asn Thr Leu Gly Arg
Gly Thr 210 215 220 Thr Ile Thr Leu Val Leu Lys Glu Glu Ala Ser Asp
Tyr Leu Glu Leu225 230 235 240 Asp Thr Ile Lys Asn Leu Val Lys Lys
Tyr Ser Gln Phe Ile Asn Phe 245 250 255 Pro Ile Tyr Val Trp Ser Ser
Lys Thr Glu Thr Val Glu Glu Pro Met 260 265 270 Glu Glu Glu Glu Ala
Ala Lys Glu Glu Lys Glu Glu Ser Asp Asp Glu 275 280 285 Ala Ala Val
Glu Glu Glu Glu Glu Glu Lys Lys Pro Lys Thr Lys Lys 290 295 300 Val
Glu Lys Thr Val Trp Asp Trp Glu Leu Met Asn Asp Ile Lys Pro305 310
315 320 Ile Trp Gln Arg Pro Ser Lys Glu Val Glu Glu Asp Glu Tyr Lys
Ala 325 330 335 Phe Tyr Lys Ser Phe Ser Lys Glu Ser Asp Asp Pro Met
Ala Tyr Ile 340 345 350 His Phe Thr Ala Glu Gly Glu Val Thr Phe Lys
Ser Ile Leu Phe Val 355 360 365 Pro Thr Ser Ala Pro Arg Gly Leu Phe
Asp Glu Tyr Gly Ser Lys Lys 370 375 380 Ser Asp Tyr Ile Lys Leu Tyr
Val Arg Arg Val Phe Ile Thr Asp Asp385 390 395 400 Phe His Asp Met
Met Pro Lys Tyr Leu Asn Phe Val Lys Gly Val Val 405 410 415 Asp Ser
Asp Asp Leu Pro Leu Asn Val Ser Arg Glu Thr Leu Gln Gln 420 425 430
His Lys Leu Leu Lys Val Ile Arg Lys Lys Leu Val Arg Lys Thr Leu 435
440 445 Asp Met Ile Lys Lys Ile Ala Asp Asp Lys Tyr Asn Asp Thr Phe
Trp 450 455 460 Lys Glu Phe Gly Thr Asn Ile Lys Leu Gly Val Ile Glu
Asp His Ser465 470 475 480 Asn Arg Thr Arg Leu Ala Lys Leu Leu Arg
Phe Gln Ser Ser His His 485 490 495 Pro Thr Asp Ile Thr Ser Leu Asp
Gln Tyr Val Glu Arg Met Lys Glu 500 505 510 Lys Gln Asp Lys Ile Tyr
Phe Met Ala Gly Ser Ser Arg Lys Glu Ala 515 520 525 Glu Ser Ser Pro
Phe Val Glu Arg Leu Leu Lys Lys Gly Tyr Glu Val 530 535 540 Ile Tyr
Leu Thr Glu Pro Val Asp Glu Tyr Cys Ile Gln Ala Leu Pro545 550 555
560 Glu Phe Asp Gly Lys Arg Phe Gln Asn Val Ala Lys Glu Gly Val Lys
565 570 575 Phe Asp Glu Ser Glu Lys Thr Lys Glu Ser Arg Glu Ala Val
Glu Lys 580 585 590 Glu Phe Glu Pro Leu Leu Asn Trp Met Lys Asp Lys
Ala Leu Lys Asp 595 600 605 Lys Ile Glu Lys Ala Val Val Ser Gln Arg
Leu Thr Glu Ser Pro Cys 610 615 620 Ala Leu Val Ala Ser Gln Tyr Gly
Trp Ser Gly Asn Met Glu Arg Ile625 630 635 640 Met Lys Ala Gln Ala
Tyr Gln Thr Gly Lys Asp Ile Ser Thr Asn Tyr 645 650 655 Tyr Ala Ser
Gln Lys Lys Thr Phe Glu Ile Asn Pro Arg His Pro Leu 660 665 670 Ile
Arg Asp Met Leu Arg Arg Ile Lys Glu Asp Glu Asp Asp Lys Thr 675 680
685 Val Leu Asp Leu Ala Val Val Leu Phe Glu Thr Ala Thr Leu Arg Ser
690 695 700 Gly Tyr Leu Leu Pro Asp Thr Lys Ala Tyr Gly Asp Arg Ile
Glu Arg705 710 715 720 Met Leu Arg Leu Ser Leu Asn Ile Asp Pro Asp
Ala Lys Val Glu Glu 725 730 735 Glu Pro Glu Glu Glu Pro Glu Glu Thr
Ala Glu Asp Thr Thr Glu Asp 740 745 750 Thr Glu Gln
Asp Glu Asp Glu Glu Met Asp Val Gly Thr Asp Glu Glu 755 760 765 Glu
Glu Thr Ala Lys Glu Ser Thr Ala Glu His Asp Glu Leu 770 775 780
451035DNAArtificial SequenceProteinA fusion protein
(apre-5xBD-Htag) as EcoRI/SalI fragment, including alpha MF pre
signal sequence, 5 Fc binding domains, and a HA and 9 x HIS tag at
the C-terminus 45gaattcgaaa cgatgagatt cccatccatc ttcactgctg
ttttgttcgc tgcttcttct 60gctttggcgg ccgctaatgc tgctcaacac gacgaagctc
aacagaacgc tttctaccag 120gttttgaaca tgccaaactt gaacgctgac
cagaggaatg gtttcatcca gtccttgaag 180gatgacccat ctcaatccgc
taacgttttg ggtgaagctc agaagttgaa cgacagtcaa 240gctcctaagg
ctgatgctca acaaaacaac ttcaacaagg accagcaatc tgctttctac
300gaaatcttga atatgcctaa tttgaacgag gctcagagaa atggattcat
tcaatctttg 360aaagacgacc catcccagtc tactaatgtt ttgggagagg
ctaagaaact taatgaaagt 420caggctccta aagctgacaa caactttaac
aaagagcagc agaacgcttt ttatgagatt 480cttaacatgc ctaacttgaa
cgaagagcaa agaaacggtt ttattcaatc attgaaggac 540gatccttcac
agtctgctaa cttgttgtcc gaggctaaaa agttgaacga atctcaggct
600cctaaggctg ataataagtt caacaaagaa caacaaaatg ctttctacga
gattttgcac 660ttgccaaatt tgaatgagga acagagaaac ggttttattc
agtcattgaa ggatgaccct 720tcccaatctg ctaatttgtt ggctgaagct
aagaaattga acgacgctca ggctccaaaa 780gctgataaca aattcaacaa
agagcaacag aacgctttct acgaaatctt gcatttgcca 840aacttgacag
aagagcagag aaacggattc attcagtctt tgaaggatga cccttccgtt
900tccaaagaga ttttggctga ggctaaaaag ttgaatgatg ctcaagctcc
aaaaggtggt 960ggttacccat acgatgttcc agattacgct ggaggtcatc
atcatcacca ccatcaccat 1020catggtggtg tcgac 103546341PRTArtificial
SequenceProteinA fusion protein (apre-5xBD-Htag) as EcoRI/SalI
fragment, including alpha MF pre signal sequence, 5 Fc binding
domains, and a HA and 9 x HIS tag at the C-terminus 46Met Arg Phe
Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser1 5 10 15 Ala
Leu Ala Ala Ala Asn Ala Ala Gln His Asp Glu Ala Gln Gln Asn 20 25
30 Ala Phe Tyr Gln Val Leu Asn Met Pro Asn Leu Asn Ala Asp Gln Arg
35 40 45 Asn Gly Phe Ile Gln Ser Leu Lys Asp Asp Pro Ser Gln Ser
Ala Asn 50 55 60 Val Leu Gly Glu Ala Gln Lys Leu Asn Asp Ser Gln
Ala Pro Lys Ala65 70 75 80 Asp Ala Gln Gln Asn Asn Phe Asn Lys Asp
Gln Gln Ser Ala Phe Tyr 85 90 95 Glu Ile Leu Asn Met Pro Asn Leu
Asn Glu Ala Gln Arg Asn Gly Phe 100 105 110 Ile Gln Ser Leu Lys Asp
Asp Pro Ser Gln Ser Thr Asn Val Leu Gly 115 120 125 Glu Ala Lys Lys
Leu Asn Glu Ser Gln Ala Pro Lys Ala Asp Asn Asn 130 135 140 Phe Asn
Lys Glu Gln Gln Asn Ala Phe Tyr Glu Ile Leu Asn Met Pro145 150 155
160 Asn Leu Asn Glu Glu Gln Arg Asn Gly Phe Ile Gln Ser Leu Lys Asp
165 170 175 Asp Pro Ser Gln Ser Ala Asn Leu Leu Ser Glu Ala Lys Lys
Leu Asn 180 185 190 Glu Ser Gln Ala Pro Lys Ala Asp Asn Lys Phe Asn
Lys Glu Gln Gln 195 200 205 Asn Ala Phe Tyr Glu Ile Leu His Leu Pro
Asn Leu Asn Glu Glu Gln 210 215 220 Arg Asn Gly Phe Ile Gln Ser Leu
Lys Asp Asp Pro Ser Gln Ser Ala225 230 235 240 Asn Leu Leu Ala Glu
Ala Lys Lys Leu Asn Asp Ala Gln Ala Pro Lys 245 250 255 Ala Asp Asn
Lys Phe Asn Lys Glu Gln Gln Asn Ala Phe Tyr Glu Ile 260 265 270 Leu
His Leu Pro Asn Leu Thr Glu Glu Gln Arg Asn Gly Phe Ile Gln 275 280
285 Ser Leu Lys Asp Asp Pro Ser Val Ser Lys Glu Ile Leu Ala Glu Ala
290 295 300 Lys Lys Leu Asn Asp Ala Gln Ala Pro Lys Gly Gly Gly Tyr
Pro Tyr305 310 315 320 Asp Val Pro Asp Tyr Ala Gly Gly His His His
His His His His His 325 330 335 His Gly Gly Val Asp 340
47111DNAArtificial Sequencealpha-amylase-ProtAZZ/up primer
47cggaattcac gatggtcgct tggtggtctt tgtttctgta cggtcttcag gtcgctgcac
60ctgctttggc ttctggtggt gttactccag ctgctaacgc tgctcaacac g
1114850DNAArtificial SequenceHA-ProtAZZ-Xho1ZZ/lp primer
48gcctcgagag cgtagtctgg aacatcgtat gggtaaccac caccagcatc
5049516DNAArtificial SequenceDNA sequence of the ZZ-domain as
EcoRI/XhoI fragment with Alpha-amylase sequence 49gaattcacga
tggtcgcttg gtggtctttg tttctgtacg gtcttcaggt cgctgcacct 60gctttggctt
ctggtggtgt tactccagct gctaacgctg ctcaacacga tgaagctgtt
120gacaacaagt tcaacaaaga gcagcagaac gctttctacg agatcttgca
cttgccaaac 180ttgaacgaag agcaaagaaa cgctttcatc cagtccttga
aggatgaccc atctcaatcc 240gctaacttgt tggctgaggc taagaagttg
aacgacgctc aagctccaaa ggtcgacaat 300aagtttaaca aagaacaaca
aaatgccttc tacgaaattc tgcatctgcc caaccttaac 360gaggaacaga
gaaacgcctt cattcagagt ttgaaggacg atccttccca gtctgctaat
420ttgcttgccg aagccaagaa attgaatgat gcccaggctc caaaagttga
tgctggtggt 480ggttacccat acgatgttcc agactacgct ctcgag
51650169PRTArtificial SequenceProtein sequence of the ZZ-domain
with Alpha-amylase leader 50Met Val Ala Trp Trp Ser Leu Phe Leu Tyr
Gly Leu Gln Val Ala Ala1 5 10 15 Pro Ala Leu Ala Ser Gly Gly Val
Thr Pro Ala Ala Asn Ala Ala Gln 20 25 30 His Asp Glu Ala Val Asp
Asn Lys Phe Asn Lys Glu Gln Gln Asn Ala 35 40 45 Phe Tyr Glu Ile
Leu His Leu Pro Asn Leu Asn Glu Glu Gln Arg Asn 50 55 60 Ala Phe
Ile Gln Ser Leu Lys Asp Asp Pro Ser Gln Ser Ala Asn Leu65 70 75 80
Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gln Ala Pro Lys Val Asp 85
90 95 Asn Lys Phe Asn Lys Glu Gln Gln Asn Ala Phe Tyr Glu Ile Leu
His 100 105 110 Leu Pro Asn Leu Asn Glu Glu Gln Arg Asn Ala Phe Ile
Gln Ser Leu 115 120 125 Lys Asp Asp Pro Ser Gln Ser Ala Asn Leu Leu
Ala Glu Ala Lys Lys 130 135 140 Leu Asn Asp Ala Gln Ala Pro Lys Val
Asp Ala Gly Gly Gly Tyr Pro145 150 155 160 Tyr Asp Val Pro Asp Tyr
Ala Leu Glu 165 5133DNAArtificial Sequence5Ecoapp primer
51aacggaattc atgagatttc cttcaatttt tac 335243DNAArtificial
Sequence3HtagSal primer 52cgatgtcgac gtgatggtga tggtggtgat
gatgatgacc acc 4353882DNAArtificial SequenceFcRIII(LF) as
EcoRI/SalI fragment 53gaattcatga gatttccttc aatttttact gctgttttat
tcgcagcatc ctccgcatta 60gctgctccag tcaacactac aacagaagat gaaacggcac
aaattccggc tgaagctgtc 120atcggttact cagatttaga aggggatttc
gatgttgctg ttttgccatt ttccaacagc 180acaaataacg ggttattgtt
tataaatact actattgcca gcattgctgc taaagaagaa 240ggggtatctc
tcgagaaaag agctggaatg agaactgagg acttgccaaa ggctgttgtt
300ttcttggagc cacagtggta cagagttttg gagaaggatt ccgttacttt
gaagtgtcag 360ggagcttact ctccagaaga taactccact cagtggttcc
acaacgaatc cttgatttct 420tctcaggctt cctcctactt cattgacgct
gctactgttg acgattccgg tgagtacaga 480tgtcagacta acttgtccac
tttgtccgac ccagttcaat tggaggttca catcggttgg 540ttgttgttgc
aagctccaag atgggttttc aaggaggagg acccaattca tttgagatgt
600cactcttgga agaacactgc tttgcacaaa gttacttact tgcagaacgg
aaagggtaga 660aagtatttcc accacaactc cgacttctac atcccaaagg
ctactttgaa ggattccggt 720tcctacttct gtagaggatt gttcggttcc
aagaacgttt cttccgagac tgttaacatc 780actatcactc agggattggc
tgtttccact atctcttcct tcttcccacc aggttatcaa 840ggtggtggtc
atcatcatca ccaccatcac catcacgtcg ac 88254292PRTArtificial
SequenceFcRIII(LF) with alpha MF pre signal sequence and HIS Tag
54Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser1
5 10 15 Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala
Gln 20 25 30 Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu
Gly Asp Phe 35 40 45 Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr
Asn Asn Gly Leu Leu 50 55 60 Phe Ile Asn Thr Thr Ile Ala Ser Ile
Ala Ala Lys Glu Glu Gly Val65 70 75 80 Ser Leu Glu Lys Arg Ala Gly
Met Arg Thr Glu Asp Leu Pro Lys Ala 85 90 95 Val Val Phe Leu Glu
Pro Gln Trp Tyr Arg Val Leu Glu Lys Asp Ser 100 105 110 Val Thr Leu
Lys Cys Gln Gly Ala Tyr Ser Pro Glu Asp Asn Ser Thr 115 120 125 Gln
Trp Phe His Asn Glu Ser Leu Ile Ser Ser Gln Ala Ser Ser Tyr 130 135
140 Phe Ile Asp Ala Ala Thr Val Asp Asp Ser Gly Glu Tyr Arg Cys
Gln145 150 155 160 Thr Asn Leu Ser Thr Leu Ser Asp Pro Val Gln Leu
Glu Val His Ile 165 170 175 Gly Trp Leu Leu Leu Gln Ala Pro Arg Trp
Val Phe Lys Glu Glu Asp 180 185 190 Pro Ile His Leu Arg Cys His Ser
Trp Lys Asn Thr Ala Leu His Lys 195 200 205 Val Thr Tyr Leu Gln Asn
Gly Lys Gly Arg Lys Tyr Phe His His Asn 210 215 220 Ser Asp Phe Tyr
Ile Pro Lys Ala Thr Leu Lys Asp Ser Gly Ser Tyr225 230 235 240 Phe
Cys Arg Gly Leu Phe Gly Ser Lys Asn Val Ser Ser Glu Thr Val 245 250
255 Asn Ile Thr Ile Thr Gln Gly Leu Ala Val Ser Thr Ile Ser Ser Phe
260 265 270 Phe Pro Pro Gly Tyr Gln Gly Gly Gly His His His His His
His His 275 280 285 His His Val Asp 290 551119DNAArtificial
SequenceFcRIas EcoRI/SalI fragment 55gaattcatga gatttccttc
aatttttact gctgttttat tcgcagcatc ctccgcatta 60gctgctccag tcaacactac
aacagaagat gaaacggcac aaattccggc tgaagctgtc 120atcggttact
cagatttaga aggggatttc gatgttgctg ttttgccatt ttccaacagc
180acaaataacg ggttattgtt tataaatact actattgcca gcattgctgc
taaagaagaa 240ggggtatctc tcgagaaaag agctgatact actaaggctg
ttatcacttt gcaaccacca 300tgggtttccg ttttccagga ggagactgtt
actttgcact gtgaggtttt gcatttgcct 360ggttcctctt ccactcagtg
gttcttgaac ggtactgcta ctcaaacttc cactccatcc 420tacagaatta
cttccgcttc cgttaacgat tctggtgagt acagatgtca gagaggattg
480tctggtagat ccgacccaat tcagttggag attcacagag gatggttgtt
gttgcaggtt 540tcctccagag ttttcactga gggtgaacca ttggctttga
gatgtcacgc ttggaaggac 600aagttggttt acaacgtttt gtactacaga
aacggaaagg ctttcaagtt cttccactgg 660aactccaact tgactatctt
gaaaactaac atctcccaca acggtactta ccactgttct 720ggaatgggaa
agcacagata cacttccgct ggtatctccg ttactgttaa ggagttgttc
780ccagctccag ttttgaacgc ttccgttact tctccattgt tggagggaaa
cttggttact 840ttgtcctgtg agactaaatt gttgttgcaa agaccaggat
tgcagttgta cttctccttc 900tacatgggtt ccaagacttt gagaggtaga
aacacttcct ccgagtacca aatcttgact 960gctagaagag aggattccgg
tttgtactgg tgtgaagctg ctactgagga cggtaacgtt 1020ttgaagagat
ccccagagtt ggagttgcaa gttttgggat tgcaattgcc aactccaggt
1080ggtggtcatc atcatcacca ccatcaccat cacgtcgac
111956371PRTArtificial SequenceFcRI with alpha MF pre signal
sequence and HIS Tag 56Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu
Phe Ala Ala Ser Ser1 5 10 15 Ala Leu Ala Ala Pro Val Asn Thr Thr
Thr Glu Asp Glu Thr Ala Gln 20 25 30 Ile Pro Ala Glu Ala Val Ile
Gly Tyr Ser Asp Leu Glu Gly Asp Phe 35 40 45 Asp Val Ala Val Leu
Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 50 55 60 Phe Ile Asn
Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val65 70 75 80 Ser
Leu Glu Lys Arg Ala Asp Thr Thr Lys Ala Val Ile Thr Leu Gln 85 90
95 Pro Pro Trp Val Ser Val Phe Gln Glu Glu Thr Val Thr Leu His Cys
100 105 110 Glu Val Leu His Leu Pro Gly Ser Ser Ser Thr Gln Trp Phe
Leu Asn 115 120 125 Gly Thr Ala Thr Gln Thr Ser Thr Pro Ser Tyr Arg
Ile Thr Ser Ala 130 135 140 Ser Val Asn Asp Ser Gly Glu Tyr Arg Cys
Gln Arg Gly Leu Ser Gly145 150 155 160 Arg Ser Asp Pro Ile Gln Leu
Glu Ile His Arg Gly Trp Leu Leu Leu 165 170 175 Gln Val Ser Ser Arg
Val Phe Thr Glu Gly Glu Pro Leu Ala Leu Arg 180 185 190 Cys His Ala
Trp Lys Asp Lys Leu Val Tyr Asn Val Leu Tyr Tyr Arg 195 200 205 Asn
Gly Lys Ala Phe Lys Phe Phe His Trp Asn Ser Asn Leu Thr Ile 210 215
220 Leu Lys Thr Asn Ile Ser His Asn Gly Thr Tyr His Cys Ser Gly
Met225 230 235 240 Gly Lys His Arg Tyr Thr Ser Ala Gly Ile Ser Val
Thr Val Lys Glu 245 250 255 Leu Phe Pro Ala Pro Val Leu Asn Ala Ser
Val Thr Ser Pro Leu Leu 260 265 270 Glu Gly Asn Leu Val Thr Leu Ser
Cys Glu Thr Lys Leu Leu Leu Gln 275 280 285 Arg Pro Gly Leu Gln Leu
Tyr Phe Ser Phe Tyr Met Gly Ser Lys Thr 290 295 300 Leu Arg Gly Arg
Asn Thr Ser Ser Glu Tyr Gln Ile Leu Thr Ala Arg305 310 315 320 Arg
Glu Asp Ser Gly Leu Tyr Trp Cys Glu Ala Ala Thr Glu Asp Gly 325 330
335 Asn Val Leu Lys Arg Ser Pro Glu Leu Glu Leu Gln Val Leu Gly Leu
340 345 350 Gln Leu Pro Thr Pro Gly Gly Gly His His His His His His
His His 355 360 365 His Val Asp 370 5735DNAArtificial
Sequence5gutBglII primer 57attgagatct acccaattta gcagcctgca ttctc
355836DNAArtificial Sequence3gutEcoRI primer 58gtcagaattc
atctgtggta tagtgtgaaa aagtag 36591010DNAArtificial SequencePichia
pastoris GUT1 promoter 59agatctaccc aatttagcag cctgcattct
cttgatttta tgggggaaac taacaatagt 60gttgccttga ttttaagtgg cattgttctt
tgaaatcgaa attggggata acgtcatacc 120gaaaggtaaa caacttcggg
gaattgccct ggttaaacat ttattaagcg agataaatag 180gggatagcga
gatagggggc ggagaagaag aagggtgtta aattgctgaa atctctcaat
240ctggaagaaa cggaataaat taactccttc ctgagataat aagatccgac
tctgctatga 300ccccacacgg tactgacctc ggcatacccc attggatctg
gtgcgaagca acaggtcctg 360aaacctttat cacgtgtagt agattgacct
tccagcaaaa aaaggcatta tatattttgt 420tgttgaaggg gtgaggggag
gtgcaggtgg ttcttttatt cgtcttgtag ttaattttcc 480cggggttgcg
gagcgtcaaa agtttgcccg atctgatagc ttgcaagatg ccaccgctta
540tccaacgcac ttcagagagc ttgccgtaga aagaacgttt tcctcgtagt
attccagcac 600ttcatggtga agtcgctatt tcaccgaagg gggggtatta
aggttgcgca ccccctcccc 660acaccccaga atcgtttatt ggctgggttc
aatggcgttt gagttagcac attttttcct 720taaacaccct ccaaacacgg
ataaaaatgc atgtgcatcc tgaaactggt agagatgcgt 780actccgtgct
ccgataataa cagtggtgtt ggggttgctg ttagctcacg cactccgttt
840ttttttcaac cagcaaaatt cgatggggag aaacttgggg tactttgccg
actcctccac 900catactggta tataaataat actcgcccac ttttcgtttg
ctgcttttat atttcaagga 960ctgaaaaaga ctcttcttct actttttcac
actataccac agatgaattc 101060322PRTArtificial SequenceS. cerevisiae
SED1 (without endogenous leader sequence 60Val Asp Gln Phe Ser Asn
Ser Thr Ser Ala Ser Ser Thr Asp Val Thr1 5 10 15 Ser Ser Ser Ser
Ile Ser Thr Ser Ser Gly Ser Val Thr Ile Thr Ser 20 25 30 Ser Glu
Ala Pro Glu Ser Asp Asn Gly Thr Ser Thr Ala Ala Pro Thr 35 40 45
Glu Thr Ser Thr Glu Ala Pro Thr Thr Ala Ile Pro Thr Asn Gly Thr 50
55 60 Ser Thr Glu Ala Pro Thr Thr Ala Ile Pro Thr Asn Gly Thr Ser
Thr65 70 75 80 Glu Ala Pro Thr Asp Thr Thr Thr Glu Ala Pro Thr Thr
Ala Leu Pro 85 90 95 Thr Asn Gly Thr Ser Thr Glu Ala Pro Thr Asp
Thr Thr Thr Glu Ala 100 105 110 Pro Thr Thr Gly Leu Pro Thr Asn Gly
Thr Thr Ser Ala Phe Pro Pro 115 120 125 Thr Thr Ser Leu Pro Pro Ser
Asn Thr Thr Thr Thr Pro Pro Tyr Asn 130 135 140
Pro Ser Thr Asp Tyr Thr Thr Asp Tyr Thr Val Val Thr Glu Tyr Thr145
150 155 160 Thr Tyr Cys Pro Glu Pro Thr Thr Phe Thr Thr Asn Gly Lys
Thr Tyr 165 170 175 Thr Val Thr Glu Pro Thr Thr Leu Thr Ile Thr Asp
Cys Pro Cys Thr 180 185 190 Ile Glu Lys Pro Thr Thr Thr Ser Thr Thr
Glu Tyr Thr Val Val Thr 195 200 205 Glu Tyr Thr Thr Tyr Cys Pro Glu
Pro Thr Thr Phe Thr Thr Asn Gly 210 215 220 Lys Thr Tyr Thr Val Thr
Glu Pro Thr Thr Leu Thr Ile Thr Asp Cys225 230 235 240 Pro Cys Thr
Ile Glu Lys Ser Glu Ala Pro Glu Ser Ser Val Pro Val 245 250 255 Thr
Glu Ser Lys Gly Thr Thr Thr Lys Glu Thr Gly Val Thr Thr Lys 260 265
270 Gln Thr Thr Ala Asn Pro Ser Leu Thr Val Ser Thr Val Val Pro Val
275 280 285 Ser Ser Ser Ala Ser Ser His Ser Val Val Ile Asn Ser Asn
Gly Ala 290 295 300 Asn Val Val Val Pro Gly Ala Leu Gly Leu Ala Gly
Val Ala Met Leu305 310 315 320 Phe Leu61969DNAArtificial SequenceS.
cerevisiae SED1 (without endogenous leader sequence 61gtcgaccaat
tctctaactc tacttccgct tcctctactg acgttacttc ctcctcctct 60atttctactt
cctccggttc cgttactatt acttcctctg aggctccaga atctgacaac
120ggtacttcta ctgctgctcc aactgaaact tctactgagg ctcctactac
tgctattcca 180actaacggaa cttccacaga ggctccaaca acagctatcc
ctacaaacgg tacatccact 240gaagctccta ctgacactac tacagaagct
ccaactactg ctttgcctac taatggtaca 300tcaacagagg ctcctacaga
tacaacaact gaagctccaa caactggatt gccaacaaac 360ggtactactt
ctgctttccc accaactact tccttgccac catccaacac tactactact
420ccaccataca acccatccac tgactacact actgactaca cagttgttac
tgagtacact 480acttactgtc cagagccaac tactttcaca acaaacggaa
agacttacac tgttactgag 540cctactactt tgactatcac tgactgtcca
tgtactatcg agaagccaac tactacttcc 600actacagagt atactgttgt
tacagaatac acaacatatt gtcctgagcc aacaacattc 660actactaatg
gaaaaacata cacagttaca gaaccaacta cattgacaat tacagattgt
720ccttgtacaa ttgagaagtc cgaggctcct gaatcttctg ttccagttac
tgaatccaag 780ggtactacta ctaaagaaac tggtgttact actaagcaga
ctactgctaa cccatccttg 840actgtttcca ctgttgttcc agtttcttcc
tctgcttctt cccactccgt tgttatcaac 900tccaacggtg ctaacgttgt
tgttcctggt gctttgggat tggctggtgt tgctatgttg 960ttcttgtaa 969
* * * * *