U.S. patent number RE42,130 [Application Number 11/986,963] was granted by the patent office on 2011-02-08 for phagemid vectors.
This patent grant is currently assigned to Alexion Pharmaceuticals, Inc.. Invention is credited to Katherine S. Bowdish, Shana Frederickson, Martha Wild.
United States Patent |
RE42,130 |
Bowdish , et al. |
February 8, 2011 |
Phagemid vectors
Abstract
Phagemid vectors containing a sequence of features between a Col
E1 origin and an f1 origin are useful for display of polypeptides
or proteins, including antibody libraries.
Inventors: |
Bowdish; Katherine S. (Del Mar,
CA), Frederickson; Shana (Solana Beach, CA), Wild;
Martha (San Diego, CA) |
Assignee: |
Alexion Pharmaceuticals, Inc.
(Cheshire, CT)
|
Family
ID: |
23102530 |
Appl.
No.: |
11/986,963 |
Filed: |
November 26, 2007 |
Related U.S. Patent Documents
|
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
Issue Date |
|
|
60287355 |
Apr 27, 2001 |
|
|
|
Reissue of: |
10134188 |
Apr 26, 2002 |
06803230 |
Oct 12, 2004 |
|
|
Current U.S.
Class: |
435/320.1;
536/23.7; 536/23.2; 435/69.1; 435/5; 536/23.4; 536/23.1;
536/24.1 |
Current CPC
Class: |
C12N
15/1037 (20130101); C40B 40/02 (20130101); C12N
15/70 (20130101) |
Current International
Class: |
C12N
15/63 (20060101); C12N 15/65 (20060101); C07H
21/04 (20060101); C12N 15/73 (20060101); C12N
15/70 (20060101); C12N 15/64 (20060101) |
References Cited
[Referenced By]
U.S. Patent Documents
|
|
|
5658727 |
August 1997 |
Barbas et al. |
6113896 |
September 2000 |
Lazarus et al. |
6346394 |
February 2002 |
Mitsuzumi et al. |
|
Foreign Patent Documents
|
|
|
|
|
|
|
WO-95/11317 |
|
Apr 1995 |
|
WO |
|
WO-2000/71694 |
|
Nov 2000 |
|
WO |
|
WO-2004/078937 |
|
Sep 2004 |
|
WO |
|
Other References
Mattanovich et al. Optimization of recombinant gene expression in
Escherichia coli. Annals of the New York Academy of Sciences, vol.
782, pp. 182-190, 1996. cited by examiner .
Alting-Mees, Ma et al., Methods in Enzymology "PBluescriptll:
multifunctional cloning and mapping vectors", vol. 216, pp. 483-495
(1992). cited by other .
Barbas III et al., "Pharge Display: A Laboratory Manual", Cold
Spring Harbor Press, (2001). cited by other .
Genbank Entry Number AF268280.1 (sequence of pComb3H vector)
available on National Center fro Biological Information database
(2000). cited by other .
Genbank Entry Number AF268280.1 (sequence of pComb3X vector)
available on National Center fro Biological Information database
(2000). cited by other .
Krebber, A. et al., Inclusion of an upstream transcriptional
terminator in phage display vector abolishes background expression
of toxic fusions with coat protein g3p Gene, 178 (1996) 71-74.
cited by other .
Lewin, Gene, Chapter 13, pp. 202-216 (1983). cited by other .
Mamoru, W. et al., Protein Expression and Purification,
"Overproduction of D-Aminoaclase from Alcalugenes xylosoxydans
subsp. xylosoxydans A-6 in Escherichia coli and Its Purification",
7: 395-389 (1996). cited by other .
Map of pComb3H vector. printed in 2006. cited by other .
Map of pComb3X vector.printed in 2006. cited by other .
Print-out from Amazon.com website showing publication of Chapter 2
of "Phage Display: A Laboratory Manual", dated Jan. 15, 2001. cited
by other .
Rader, C. et al., Current Opinion in Biotechnology "Phage Display
of Combinatorial Antibody Libraries", 8: 503-508 (1997). cited by
other .
Table of features of pComb3X and pComb3H vectors relative of claim
1 of EP-B-1390489, 2006. cited by other .
Weiss, Ga et al., JMB, "Design and Evolution of Artificial M13 Coat
Proteins", 300: 213-219 (2000). cited by other .
Zimmerman et al., Protein Expression and Purification, "High-Level
Expression of Rat Farnesyl: Protein Transferase in Escherichia Coli
as a Translationally Coupled Heterodimer", 14: 395-402 (1998).
cited by other .
Gao et al., "Making artificial antibodies: A format for phage
display of combinatorial heterodimeric arrays", Proc. National
Acad. Sci. USA, vol. 96, pp. 6025-6030 (1999). cited by other .
Gao et al., "Making chemistry selectable by linking it to
infectivity", Proc. National Acad. Sci. USA, vol. 94, pp.
11777-11782 (1997). cited by other .
Minute of the Oral Proceedings in EP Patent No. B-1390489 (App. No.
02 734 056.1) dated Dec. 17, 2008. cited by other .
Krebber et al., Gene, 1996, vol. 178, pp. 71-74. cited by examiner
.
"Genes", Benjamin Lewis, Ed., John Wiley & Sons, Inc., Chapter
13, "Termination and Antitermination", pp. 202-216. cited by
examiner.
|
Primary Examiner: Dunston; Jennifer
Attorney, Agent or Firm: Ropes & Gray LLP
Parent Case Text
This application claims priority under 35 USC 119(e) to provisional
application 60/287,355, filed Apr. 27, 2001.
Claims
We claim:
.[.1. A phagemid vector comprising: a selectable marker; a ColE1
origin; an f1 origin; and after the ColE1 origin but before the f1
origin, further comprising the following features: a bacterial
transcription terminator; a promoter, a first ribosomal binding
site; a first leader sequence; a first cloning region; a second
ribosomal binding site; a second leader sequence; a second cloning
region for receiving a gene encoding a polypeptide to be displayed;
and a nucleotide sequence encoding a product that enables display
of a polypeptide on the surface of a phagemid particle..].
.[.2. A phagemid vector as in claim 1 wherein at least one of the
first or second ribosomal binding sites comprises Seq. ID No.
13..].
3. .[.A phagemid vector as in claim 1.]. .Iadd.A phagemid vector
comprising: a selectable marker; a ColE1 origin; an f1 origin; and
after the ColE1 origin but before the f1 origin, further comprising
the following features: a bacterial transcription terminator and a
promoter wherein the bacterial transcription terminator is upstream
of the promoter; a first ribosomal binding site; a first leader
sequence; a first cloning region; a second ribosomal binding site;
a second leader sequence; a second cloning region for receiving a
gene encoding a polypeptide to be displayed; and a nucleotide
sequence encoding a product that enables display of a polypeptide
on the surface of a phagemid particle, .Iaddend. wherein at least
one of the first or second leader sequences comprises a sequence
selected from the group consisting of Seq. ID No. 14 and Seq. ID
No. 17.
.[.4. A phagemid vector as in claim 1 wherein the nucleotide
sequence encoding a product encodes a protein selected from the
group consisting of pIII and pVIII..].
.[.5. A phagemid vector as in claim 1 wherein the nucleotide
sequence encoding a product encodes a truncated pIII..].
.[.6. A phagemid vector as in claim 1 wherein the nucleotide
sequence encoding a product encodes a synthetic pIII..].
.[.7. A phagemid vector as in claim 1 wherein the selectable marker
is selected from the group consisting of ampicillin resistance,
chloramphenicol transferase resistance, tetracycline resistance and
kanamycin resistance..].
8. A phagemid vector comprising Seq. ID No. 18.
9. A vector comprising a sequence selected from the group
consisting of Seq. ID Nos. 19, 20 and 21.
Description
BACKGROUND
1. Technical Field
This disclosure relates to cloning vectors. More specifically,
phagemid vectors useful in the cloning and expression of foreign
genetic information are disclosed.
2. Background of Related Art
Plasmids are extrachromosomal genetic elements capable of
autonomous replication within their hosts. Bacterial plasmids range
in size from 1 Kb to 200 Kb or more and encode a variety of useful
properties. Plasmid encoded traits include resistance to
antibiotics, production of antibiotics, degradation of complex
organic molecules, production of bacteriocins, such as colicins,
production of enterotoxins, and production of DNA restriction and
modification enzymes.
Although plasmids have been studied for a number of years in their
own right, particularly in terms of their replication,
transmissibility, structure and evolution, with the advent of
genetic engineering technology the focus of plasmid research has
turned to the use of plasmids as vectors for the cloning and
expression of foreign genetic information. In its application as a
vector, the plasmid should possess one or more of the following
properties. The plasmid DNA should be relatively small but capable
of having relatively large amounts of foreign DNA incorporated into
it. The size of the DNA insert is of concern in vectors based on
bacteriophages where packing the nucleic acid into the phage
particles can determine an upper limit. The plasmid should be under
relaxed replication control. That is, where the replication of the
plasmid molecule is not strictly coupled to the replication of the
host DNA (stringent control), thereby resulting in multiple copies
of plasmid DNA per host cell. The plasmid should express one or
more selectable markers, such as the drug resistance markers,
mentioned above, to permit the identification of host cells which
contain the plasmid and also to provide a positive selection
pressure for the maintenance of the plasmid in the host cell.
Finally the plasmid should contain a single restriction site for
one or more endonucleases in a region of plasmid which is not
essential for plasmid replication. A vector as described above is
useful, for example, for cloning genetic information, by which is
meant integrating a segment of foreign DNA into the vector and
reproducing identical copies of that information by virtue of the
replication of the plasmid DNA.
The next step in the evolution of vector technology was the
construction of so-called expression vectors. These vectors are
characterized by their ability not only to replicate the inserted
foreign genetic information but also to promote the transcription
of the genetic information into mRNA and its subsequent translation
into protein. This expression requires a variety of regulatory
genetic sequences including but not necessarily limited to
promoters, operators, transcription terminators, ribosomal binding
sites and protein synthesis initiation and termination codons.
These expression elements can be provided with the foreign DNA
segment as parts thereof or can be integrated within the vector in
a region adjacent to a restriction site so that when a foreign DNA
segment is introduced into the vector it falls under the control of
those elements to which it is now chemically joined.
Filamentous bacteriophage consist of a circular, single-stranded
DNA molecule surrounded by a cylinder of coat proteins. There are
about 2,700 molecules of the major coat proteins pVIII that
envelope the phage. At one end of the phage particle, there are
approximately five copies of each of gene III and VI proteins (pIII
and pVI) that are involved in host cell binding and in the
termination of the assembly process. The other end contains five
copies of each of pVII and pIX that are required for the initiation
of assembly and for maintenance of virion stability. In recent
years, vectors have been developed and utilized for the display of
foreign peptides and proteins on the surface of filamentous phage
or phagemid particles.
The display of peptides and proteins on the surface of phage or
phagemid particles represents a powerful methodology for selection
of rare members in a complex library and for carrying out molecular
evolution in the laboratory. The ability to construct libraries of
enormous molecular diversity and to select for molecules with
predetermined properties has made this technology applicable to a
wide range of problems. A few of the many applications of such
technology are: i) phage display of natural peptides including,
mapping epitopes of monoclonal and polyclonal antibodies and
generating immunogens; ii) phage display of random peptides,
including mapping epitopes of monoclonal and polyclonal antibodies,
identifying peptide ligands, and mapping substrate sites for
proteases and kinases; and iii) phage display of protein and
protein domains, including directed evolution of proteins,
isolation of antibodies and cDNA expression screening.
Vectors have been developed which incorporate DNA from plasmids and
bacteriophage. These phagemid vectors are derived by modifications
of a plasmid genome containing an origin of replication from a
bacteriophage, (e.g. f1, M13, fd) as well as the plasmid origin of
replication. Phagemids are useful for the expression of foreign
genetic information.
One known phagemid vector is .[.pBluescript.].
.Iadd.PBLUESCRIPT.TM. .Iaddend.II KS+ (pBS II KS+) (Stratagene, La
Jolla, Calif.), which is a useful starting point for the
construction of the present vector because of its small size and
the fact that it contains the colE1 plasmid origin of replication
and the phage f1 origin of replication in the desired orientation.
The plasmid also carries an ampicillin resistance gene.
Vectors which due to their structures provide enhanced
functionality would be desirable.
SUMMARY
Novel plasmid vectors capable of replication and expression of
foreign genetic information in bacteria, such as, for example,
cyanobacterium and E. coli are described herein. These new vectors
contain a specific sequence of features after the ColE1 origin but
before the f1 origin. Specifically, the present phagemid vector
contains, after the ColE1 origin but before the f1 origin, a
bacterial transcription terminator, a promoter, a first ribosomal
binding site, a first leader sequence and a first cloning region, a
second ribosomal binding site, a second leader sequence and a
second cloning region. The second cloning region is adapted to
receive a gene encoding a polypeptide to be displayed and a
nucleotide sequence encoding at least a functional domain of a
display protein.
The vectors described herein are constructed through a series of
steps which convert a starting vector through a series of
intermediate plasmids to the present novel vector which can be used
for display of antibody libraries.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 schematically illustrates the structure of pBS II KS+, a
useful starting vector for making the novel vectors described
herein;
FIG. 2 is a flow chart illustrating the method of making the novel
vectors described herein;
FIG. 3 schematically illustrates the digestion of the starting
vector and insertion of the promoter;
FIGS. 4A-C show the sequence (Seq. ID No. 19) of intermediate
vector p110-81.6;
FIG. 5 schematically illustrates the insertion of the
terminator;
FIGS. 6A-C show the sequence (Seq. ID No. 20) of intermediate
vector p131-03.7;
FIG. 7 schematically illustrates the insertion of multiple cloning
sites;
FIGS. 8A-C show the sequence (Seq. ID No. 21) of intermediate
vector p131-39.1;
FIG. 9 schematically illustrates the insertion of the nucleotide
sequence encoding the display protein and the two transcriptional
control cassettes;
FIG. 10 is a map of plasmid pAX131; and
FIGS. 11A-D show the nucleic acid sequence (Seq. ID No. 18) of
plasmid pAX131, including the domains corresponding to particular
genes.
FIGS. 12A-G show the nucleic acid sequences of illustrative stuffer
sequences.
FIGS. 13A-C show the nucleic acid sequence of plasmid pAX131
Xba/Not.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
The present novel phagemid vectors are useful for display of
polypeptides such as, for example, antibody libraries. The vectors
described herein can be prepared using any commercially available
vector containing a ColE1 and an f1 origin of replication as the
starting material. Such starting materials are known and are
commercially available. One suitable starting material is the
vector pBS II KS+ which is commercially available from Stratagene
Corp., La Jolla, Calif. (Sec FIG. 1).
FIG. 2 is a flow-chart showing one embodiment of the steps involved
in converting a starting vector into one of the present novel
vectors. Those skilled in the art will readily envision other
schemes for preparing the present vectors. Accordingly, the present
disclosure is not limited to the sequence of steps shown in FIG.
2.
In the first step, the starting vector is digested with restriction
enzymes to remove a substantial portion of the vector between the
ColE1 origin and the f1 origin of replication. Typically, the
portion to be removed from the starting vector includes multiple
cloning sites. Depending on the particular restriction sites
present in the starting vector, suitable methods for digesting the
starting vector are known to and readily selected by those skilled
in the art.
Next, a promoter is inserted downstream of the ColE1 origin of the
digested starting vector. Any promoter recognized by a host cell
can be employed. Suitable promoters include, but are not limited
to, ara, lac and trc promoters. The promoter drives expression of
other sequences inserted into the vector, such as, for example
expression of polypeptides. In particularly useful embodiments, a
promoter sequence generated from the starting vector is employed as
the promoter inserted downstream of the ColE1 origin as described
in more detail below.
In the next step, a bacterial transcription terminator is inserted
downstream of the ColE1 origin, and upstream of the promoter. Any
terminator recognized by a host cell can be employed. Suitable
terminators include, but are not limited to, the t.sub.HP
terminator, the bgIG terminator, and the crp terminator. It should
be noted that bioinformatics analysis has allowed the
identification of over 100 rho-independent transcription
terminators in the E. coli genome, all of which should be suitable
for this purpose (Ermolaeva, et al, J. Mol Biol 301:27-33
(2000)).
In the next step, multiple restriction sites are inserted
downstream of the promoter. The restriction site can be any known
restriction site. Suitable restriction sites for insertion include,
but are not limited to Nhe I, Hind III, Nco I, Xma I, Bgl II, Bst
I, Pvu I, etc. The number of restriction sites inserted is not
critical, provided a sufficient number of restriction sites are
inserted to allow completion of the balance of the steps needed to
create the present novel vectors. Thus as few as 2 to as many as 10
or more restriction sites can be inserted in this step. It should
be understood that if one or more of the restriction sites selected
for insertion is present in the starting vector, it may be
desirable to remove or disable the native restriction site to avoid
unwanted digestion during further processing. The restriction site
can be inserted using any technique known to those skilled in the
art. A particularly preferred combination of restriction sites
inserted in this step is Not I, Sfi I, Spe I, Xho I, Xba I and EcoR
I.
The next step involves inserting a nucleotide sequence encoding a
product that enables display of a polypeptide on the surface of a
phagemid particle. The product encoded can thus be considered at
least a functional domain of a display protein. The display protein
can be any natural or synthetic polypeptide to which a polypeptide
to be displayed can be fused and which can present the polypeptide
to be displayed for screening processes. Suitable display
polypeptides include proteins that can be incorporated into the
coat of a phage particle. As those skilled in the art will
appreciate, filamentous bacteriophage consist of a circular,
single-stranded DNA molecule a surrounded by a cylinder of coat
proteins. There are about 2,700 molecules of the major coat protein
pVIII that encapsidate the phage. At one end of the phage particle,
there are approximately five copies each of gene III and VI
proteins (pIII and pVI) that are involved in host-cell binding and
in the termination of the assembly process. The other end contains
five copies each of pVII and pIX that are required for the
initiation of assembly and for maintenance of virion stability. A
nucleotide sequence encoding any of these coat proteins can be
employed in making the novel vectors herein. Particularly preferred
are nucleotide sequences encoding at least a functional domain of
pIII. The nucleotide sequence encoding at least a functional domain
of pill can be natural or synthetic. The nucleotide sequence
inserted can encode a truncated pIII provided the display function
of the protein is maintained. An example of a synthetic or
artificial coat protein useful herein is that disclosed in Weiss et
al., Mol Biol., 300(1), 213-219 (2000), the disclosure of which is
incorporated herein by reference.
In the next step, two transcriptional control cassettes are
inserted, an upstream transcriptional control cassette and a
downstream transcriptional control cassette. Each of the
transcriptional control cassettes include a ribosomal binding site,
a leader sequence and a cloning site for receiving a gene encoding
a polypeptide to be expressed. Any known ribosomal binding site
(RBS) and leader sequence recognized by the host cell can be
employed. Preferably, the RBS and leader sequence employed is
optimized for expression in E. coli. The cloning site is a region
of the nucleic acid between two restriction sites, typically with a
nonessential region of nucleotide sequence (commonly referred to as
a "stuffer" sequence) positioned therebetween. Alternatively, the
stuffer sequence may contain a non-essential region and a portion
of an antibody constant domain. Suitable stuffer sequences include,
for example, those shown in FIGS. 12A-G.
The downstream transcriptional control cassette is inserted
adjacent to the nucleotide sequence encoding at least the
functional domain of the display protein. In this manner, a fusion
protein will be expressed when a gene encoding a polypeptide to be
displayed is inserted at the cloning site of the downstream
transcriptional control cassette. As those skilled in the art will
appreciate, a suppressible stop codon could be positioned between
the gene encoding the polypeptide to be displayed and the
nucleotide sequence encoding at least a functional domain of the
display protein such that fusion display is obtained in a
suppressing host (as long as the gene is inserted in-frame) and a
secreted protein without the display protein is obtained in a
non-suppressing host.
The upstream transcriptional control cassette is inserted upstream
of the downstream transcriptional control cassette. The upstream
transcriptional control cassette provides a second cloning region
for receiving a second gene encoding a polypeptide that can
dimerize with the polypeptide to be displayed. For example, where
the vector expresses a heavy chain Fd fused to a display protein,
the second gene preferably encodes an antibody light chain. As with
the cloning site of the downstream transcriptional control
cassette, the cloning site of the upstream transcriptional control
cassette is a region of the vector between two restriction sites,
typically with a stuffer positioned therebetween. It should of
course be understood that where a polypeptide other than an
antibody is to be displayed (such as, for example, where monomeric
display of a single polypeptide or protein is intended) a second
gene need not be cloned into the vector at the cloning site of the
upstream transcriptional control cassette. In such cases the second
cloning site can simply remain unused. As those skilled in the art
will also appreciate, where a single chain antibody is encoded by
the gene inserted at the cloning site of the downstream
transcriptional control cassette, there is no need to insert a
second gene into the vector at the cloning site of the upstream
transcriptional control cassette.
Thus, the phagmid vector produced by the process illustrated in
FIG. 2 will contain, after the ColE1 origin but before the f1
origin, a terminator, a promoter, a first ribosomal binding site, a
first leader sequence and a first cloning region, a second
ribosomal binding site, a second leader sequence and, a second
cloning region for receiving a gene encoding a polypeptide to be
displayed and a nucleotide sequence encoding at least a functional
domain of a display protein.
The present vectors also include a selectable marker. Either an
ampicillin resistant or a CAT resistant vector can be produced in
accordance with the present disclosure. The ampicillin or CAT
resistance can be provided by simply choosing a starting vector
having the desired resistance. Alternatively, if the starting
vector is ampicillin resistant to produce a CAT resistant vector,
the ampicillin resistant gene is removed and replaced with the
chloramphenicol transferase gene. Techniques for providing either
ampicillin or CAT resistance in the present vectors will be readily
apparent to those skilled in the art. Other suitable selectable
markers include, but are not limited to, tetracycline or kanamycin
resistance.
The vectors described herein can be transformed into a host cell
using known techniques (e.g., electroporation) and amplified. The
vectors described herein can also be digested and have a first gene
and optionally a second gene ligated therein in accordance with
this disclosure. The vector so engineered can be transformed into a
host cell using known techniques and amplified or to effect
expression of polypeptides and/or proteins encoded thereby to
produce phage particles displaying single polypeptides or dimeric
species. Those skilled in the art will readily envision other uses
for the novel vectors described herein.
The following examples illustrate the present invention without
limiting its scope. The steps involved in constructing the vectors
described herein are discussed in detail in the Examples. Those
skilled in the art possess knowledge of suitable techniques to
accomplish the steps described below without the need for undue
experimentation, such techniques being well known to those skilled
in the art.
EXAMPLE 1
This example illustrates methods and compositions for the
construction of one embodiment of a phagemid vector according to
the present disclosure. The starting phagemid selected for
construction was pBS II KS+ which contains an ampicillin resistant
gene which results in a final vector, pAX131, which is ampicillin
resistant.
Digestion of Starting Vector and Insertion of Promoter
The commercially available vector pBS II KS+ (Stratagene, LaJolla,
Calif.) was digested with Pvu I and Sap I to generate a 2424 bp pBS
II KS+ fragment which lacks the bases at positions 500 to 1037
corresponding to the multiple cloning region. The resulting
fragment contains the Ampicillin resistant gene (AmpR), phage f1
origin, and the Col E1origin. (See FIG. 3.) Next, two mutagenic
primers were used with the pBS II KS+ fragment in a PCR reaction
followed by digestion with EcoR I and Sap I to generate a 209 bp
fragment containing the lac promoter. The primers used were as
follows: 5' AAC CGT ATT ACC GCC TTT GAG TG 3' (SEQ. ID. NO. 1): and
5' CCT GAA TTC AAT TGT TAT CCG CTC ACA ATT CCA C 3' (SEQ. ID. NO.
2).
The 2424 bp fragment and the 209 bp fragment were combined in a
three-way ligation reaction with two overlapping oligonucleotides
which contain a Not I, EcoR I and Pvu I sites to form a first
intermediate plasmid (designated p110-81.6). (See FIG. 3.) The
oligonucleotides used for this reaction were: 5' CGG TAA TGC GGC
CGC TAC ATG 3' (SEQ. ID. NO. 3); and 5' AAT TCA TGT AGC GGC CGC ATT
ACC GAT 3' (SEQ. ID. NO. 4).
The resulting plasmid p110-81.6 was digested and sequenced in the
altered region to identify a clone with the correct incorporation
of the lac promoter, Pvu I, Sap I, EcoR, and Not I sites. The
sequencing of p110-81.6 revealed a nucleic acid change at position
875 within the lac promoter. The published sequence of pBS II KS+
had an adenine at position 875. However, sequencing of p110-81.6
and the original pBS II KS+ revealed a guanine at position 875. The
sequence (Seq. ID No. 19) of intermediate plasmid p110-81.6 is
shown in FIGS. 4A-C.
Insertion of Terminator
A transcription termination sequence was inserted into the first
intermediate plasmid (p110-81.6) upstream of the lac promoter at
the Sap I site. (See FIG. 5.)
Plasmid 110-81.6 was digested with Sap I to create an insertion
point for the oligonucleotides which contained a t.sub.HP
terminator (Nohno et al., Molecular and General Genetics, Vol. 205,
pages 260-269 (1986). The oligonucleotides used in this ligation
were: 5' AGC GTA CCC GAT AAA AGC GGC TTC CTG ACA GGA GGC CGT TTT
GTT TTG CAG CCC ACC T 3'; (SEQ. ID. No. 5); and 5' GCT AGG TGG GCT
GCA AAA CAA AAC GGC CTC CTG TCA GGA AGC CGC TTT TAT CGG GTA C 3'
(SEQ. ID. NO. 6).
The resulting intermediate vector (designated p131-03.7) was
digested and sequenced in the altered region to determine its
identity. The sequence (Seq. ID No. 20) of intermediate vector
p131-03.7 is shown in FIGS. 6A-C.
Insertion of Multiple Restriction Sites
Oligonucleotides containing the Xba I, XhoI, SpeI and Sfi sites
were then inserted into intermediate plasmid p131-03.7.(See FIG.
7.)
Intermediate vector p131-03.7 was digested with EcoR I and Not I
and then gel purified. Then overlapping oligonucleotides containing
the Xba I, Xho I, Spe I and Sfi I sites were ligated into the
p131-03.7 backbone. The oligonucleotides inserted were: 5' AAT TCA
CAT CTA GAT ATC TCG AGT CAA TAC TAG TGG CCA GGC CGG CCA GC 3' (SEQ.
ID. NO. 7); and 5' GGC CGC TGG CCG GCC TGG CCA CTA GTA TTG ACT CGA
GAT ATC TAG ATG TG 3' (SEQ. ID. NO. 8).
The resulting intermediate plasmid (designated p131-39.1) was
sequenced and analyzed to determine its identity. The sequence
(Seq. ID No. 21) of intermediate plasmid p131-39.1 is shown in
FIGS. 8A-C.
Construction of Nucleotide Sequence Encoding Display Protein
Single stranded DNA from phage f1 (ATCC #15766-B2) was used as a
template for the cloning of gene III. (Sec FIG. 9.)
The primers used were: 5' AGT GGC CAG GCC GGC CTT GAA ACT GTT GAA
AGT TGT TTA GCA AA 3' (SEQ. ID. NO. 9) which contains the Sfi I
site, bases to maintain the coding frame and a portion of gene III;
and 5 TCT GCG GCC GCT TAG CTA GCT TAA GAC TCT TTA TTA CGC AGT ATG
TTA GCA 3' (SEQ. ID. NO. 10); which contains the end of gene III in
which an internal ribosome binding site ordinarily used for the
next downstream gene has been removed by changing a silent third
base position in the corresponding codon. This oligonucleotide also
contains a stop codon, Nhe I site for potential use in removal of
the fusion, a second stop codon for use with the fusion, and the
Not I site for cloning. The PCR fragment was digested with Sfi I
and Not I and inserted into p131-39.1 digested with Sfi I and Not I
to create intermediate vector p131-44.2. The integrity of the gene
III region and flanking sequences was confirmed by sequence
analysis. Creation of the Upstream Transcriptional Control
Cassette
Plasmid 131-39.1 was utilized as a shuttle vector for cloning the
oligonucleotides containing the ompA signal peptide coding
sequence. The upstream transcriptional control cassette was
generated within intermediate plasmid 131-39.1 by inserting a pair
of oligonucleotides which contain EcoR I, the ompA signal peptide
leader, followed by a Sac I site, a small stuffer region, and a
ribosome binding site. (See FIG. 9.) The oligonucleotides used
were:
Eco Xba: 5' AAT TCA AGG AGT TAA TTA TGA AAA AAA CCG CGA TTG CGA TTG
CGG TGG CGC TGG CGG GCT TTG CGA CCG TGG CCC AGG CGG CCG AGC TCA TCT
T 3' (SEQ. ID. NO. 11); and Xba Eco: 5' CTA GAA GAT GAG CTC GGC CGC
CTG GGC CAC GGT CGC AAA GCC CGC CAG CGC CAC CGC AAT CGC AAT CGC GGT
TTT TTT CAT AAT TAA CTC CTT G 3' (SEQ. ID. NO. 12). The RBS and
leader sequences included in the upstream transcriptional control
cassette are optimized for use in E. coli. These novel sequences
are: 5' AAG GAG 3' (Seq. ID No.13) for the RBS; and 5' ATG AAA AAA
ACC GCG ATT GCG ATT GCG GTG GCG CTG GCG GGC TTT GCG ACC GTG GCC CAG
GCG GCC 3' (Seq. ID No. 14) for the ompA leader. The resulting
plasmid was sequenced to confirm the identity of the insert and
digested at the EcoRI and Xbal sites to generate a 94 bp fragment
which is the upstream transcriptional control cassette. Creation of
the Downstream Transcriptional Control Cassette
Intermediate plasmid 131-39.1 was utilized as a shuttle vector for
cloning the oligonucleotides containing the pelB signal peptide
coding sequence. The downstream transcriptional control cassette
was generated within intermediate plasmid 131-39.1 by inserting a
pair of oligonucleotides containing the pelB signal peptide, Xba I,
site, and a ribosome binding site. The oligonucleotides used
were:
XbaXho: 5' CTA GAT ATA ATT AAG GAG ATA AAT ATG AAA TAT CTG CTG CCG
ACC GCG GCG GCG GGC CTG CTG CTG CTG GCG GCG CAG CCG GCG ATG GCGC 3'
(SEQ. ID. NO. 15); and XhoXba: 5' TCG AGC GCC ATC GCC GGC TGC GCC
GCC AGC AGC AGC AGG CCC GCC GCC GCG GTC GGC AGC AGA TAT TTC ATA TTT
ATC TCC TTA ATT ATA T 3' (SEQ. ID. NO. 16).
The novel pelB leader sequence was optimized for use in E. coli and
had the sequence 5' TAT GAA ATA TCT GCT GCC GAC CGC GGC GGC GGG CCT
GCT GCT GCT GGC GGC GCA GCC GGC GAT GGC G 3' (Seq. ID No. 17). The
resulting plasmid was sequenced to confirm the identity of the
insert and digested at the Xbal and XhoI sites to generate a 91 bp
fragment which is the downstream transcriptional control cassette.
Construction of pAx131 Vector
The upstream transcriptional control cassette and the downstream
transcriptional control cassette were combined with intermediate
plasmid p131-44.2 digested with EcoRI and XhoI in a 3-way ligation
reaction to produce pAX131 (See FIG. 9). FIG. 10 is a map of the
resulting pAX131 vector. The pAX131 was analyzed to determine its
nucleic acid sequence (SEQ. ID. NO. 18) which is shown in FIGS.
11A-D.
EXAMPLE 2
Insertion of an alternate upstream transcriptional control
cassette
PAX131 vector was digested with Not I restriction enzyme. The
resulting DNA overhangs were then filled in with Klenow fragment
Polymerase to blunt end the DNA followed by ligation. This was
performed to remove the existing Not I site. The Not I deleted
PAX131 vector was digested with EcoR I/Xba I, and ligated with a
duplexed oligo containing EcoR I and Spe I overhangs (Xba I, and
Spe I have compatible ends).
Eco/Spe oligo: 5' AAT TCA AGG AGT TAA TTA TGA AAA AAA CCG CGA TTG
CGA TTG CGG TGG CGC TGG CGG GCT TTG CGA CCG TGG CCC AGG CGG CCT CTA
GAA TCT GCG GCC GCA 3' (SEQ. ID NO. 22) Spe/Eco oligo: 5' CTA GTG
CGG CCG CAG ATT CTA GAG GCC GCC TGG GCC ACG GTC GCA AAG CCC GCC AGC
GCC ACC GCA ATC GCA ATC GCG GTT TTT TTC ATA ATT AAC TCC TTG 3'
(SEQ. ID NO. 23)
The resulting vector (pAX131 Xba/Not) had Xba I, and Not I sites
for cloning of a gene, such as light chains, rather than Sac I and
Xba I. FIGS. 13A-C show the nucleic acid sequence for vector
(pAX131 Xba/Not.
It is contemplated that the present novel vectors can be used in
connection with the production and screening of libraries made in
accordance with conventional phage display technologies. Both
natural and synthetic antibody repertoires have been generated as
phage displayed libraries. Natural antibodies can be cloned from
B-cell mRNA isolated from peripheral blood lymphocytes, bone
marrow, spleen, or other lymphatic tissue of a human or non-human
donor. Donors with an immune response to the antigen(s) of interest
can be used to create immune antibody libraries. Alternatively,
non-immune libraries may be generated from donors by isolating
naive antibody B cell genes. PCR using antibody specific primers on
the 18.sup.st strand cDNA allows the isolation of light chain and
heavy chain antibody fragments which can then be cloned into the
display vector.
Synthetic antibodies or antibody libraries can be made up in part
or entirely with regions of synthetically derived sequence. Library
diversity can be engineered within variable regions, particularly
within CDRs, through the use of degenerate oligonucleotides. For
example, a single Fab gene may be modified at the heavy chain CDR3
position to contain random nucleotide sequences. The random
sequence can be introduced into the heavy chain gene using an
oligonucleotide which contains the degenerate coding region in an
overlap PCR approach. Alternatively, degenerate oligo cassettes can
be cloned into restriction sites that flank the CDR(s) to create
diversity. The resulting library generated by this or other
approaches can then be cloned into a display vector in accordance
with this disclosure.
Upon introduction of the display library into bacteria, phage
particles will be generated that have antibody displayed on the
surface. The resulting collection of phage-displayed antibodies can
be selected for those with the ability to bind to the antigen of
interest using techniques known to those skilled in the art.
Antibodies identified by this system can be used therapeutically,
as diagnostic reagents, or as research tools.
It is contemplated that single and double stranded versions of the
vectors described herein are within the scope of the present
invention. It is well within the purview of those skilled in the
art to prepare either single or double stranded vectors having the
features described herein.
It will be understood that various modifications may be made to the
embodiments described herein. For example, as those skilled in the
art will appreciate, a first gene encoding a fusion protein having
an antibody light chain to be fused to and displayed by pVIII and a
second gene encoding a heavy chain Fd can be inserted into the
vector at the newly created restriction site to provide effective
antibody display. Therefore, the above description should not be
construed as limiting, but merely as exemplifications of preferred
embodiments. Those skilled in the art will envision other
modifications within the scope and spirit of the claims appended
hereto.
SEQUENCE LISTINGS
1
31123DNAartificial sequenceprimer 1 aaccgtatta ccgcctttga gtg 23
234DNAartificial sequenceprimer 2 cctgaattca attgttatcc gctcacaatt
ccac 34 321DNAartificial sequenceoligonucleotide 3 cggtaatgcg
gccgctacat g 21 427DNAartificial sequenceoligonucleotide 4
aattcatgta gcggccgcat taccgat 27 558DNAartificial
sequenceoligonucleotide 5 agcgtacccg ataaaagcgg cttcctgaca
ggaggccgtt ttgttttgca g cccacct 58 658DNAartificial
sequenceoligonucleotide 6 gctaggtggg ctgcaaaaca aaacggcctc
ctgtcaggaa gccgctttta t cgggtac 58 750DNAartificial
sequenceoligonucleotide 7 aattcacatc tagatatctc gagtcaatac
tagtggccag gccggccagc 50 850DNAartificial sequenceoligonucleotide 8
ggccgctggc cggcctggcc actagtattg actcgagata tctagatgtg 50
944DNAartificial sequenceprimer 9 agtggccagg ccggccttga aactgttgaa
agttgtttag caaa 44 1051DNAartificial sequenceprimer 10 tctgcggccg
cttagctagc ttaagactct ttattacgca gtatgttagc a 51 1194DNAartificial
sequenceoligonucleotide 11 aattcaagga gttaattatg aaaaaaaccg
cgattgcgat tgcggtggcg c tggcgggct 60 ttgcgaccgt ggcccaggcg
gccgagctca tctt 94 1294DNAartificial sequenceoligonucleotide 12
ctagaagatg agctcggccg cctgggccac ggtcgcaaag cccgccagcg c caccgcaat
60 cgcaatcgcg gtttttttca taattaactc cttg 94 136DNAartificial
sequenceRBS 13 aaggag 6 1466DNAartificial sequenceompA leader 14
atgaaaaaaa ccgcgattgc gattgcggtg gcgctggcgg gctttgcgac c gtggcccag
60 gcggcc 66 1591DNAartificial sequenceoligonucleotide 15
ctagatataa ttaaggagat aaatatgaaa tatctgctgc cgaccgcggc g gcgggcctg
60 ctgctgctgg cggcgcagcc ggcgatggcg c 91 1691DNAartificial
sequenceoligonucleotide 16 tcgagcgcca tcgccggctg cgccgccagc
agcagcaggc ccgccgccgc g gtcggcagc 60 agatatttca tatttatctc
cttaattata t 91 1767DNAartificial sequencepelB leader 17 tatgaaatat
ctgctgccga ccgcggcggc gggcctgctg ctgctggcgg c gcagccggc 60 gatggcg
67 184154DNAartificial sequenceplasmid vector 18 gtggcacttt
tcggggaaat gtgcgcggaa cccctatttg tttatttttc t aaatacatt 60
caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa t attgaaaaa
120 ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt g
cggcatttt 180 gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt
aaaagatgct g aagatcagt 240 tgggtgcacg agtgggttac atcgaactgg
atctcaacag cggtaagatc c ttgagagtt 300 ttcgccccga agaacgtttt
ccaatgatga gcacttttaa agttctgcta t gtggcgcgg 360 tattatcccg
tattgacgcc gggcaagagc aactcggtcg ccgcatacac t attctcaga 420
atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc a tgacagtaa
480 gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac t
tacttctga 540 caacgatcgg aggaccgaag gagctaaccg cttttttgca
caacatgggg g atcatgtaa 600 ctcgccttga tcgttgggaa ccggagctga
atgaagccat accaaacgac g agcgtgaca 660 ccacgatgcc tgtagcaatg
gcaacaacgt tgcgcaaact attaactggc g aactactta 720 ctctagcttc
ccggcaacaa ttaatagact ggatggaggc ggataaagtt g caggaccac 780
ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga g ccggtgagc
840 gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc c
gtatcgtag 900 ttatctacac gacggggagt caggcaacta tggatgaacg
aaatagacag a tcgctgaga 960 taggtgcctc actgattaag cattggtaac
tgtcagacca agtttactca t atatacttt 1020 agattgattt aaaacttcat
ttttaattta aaaggatcta ggtgaagatc c tttttgata 1080 atctcatgac
caaaatccct taacgtgagt tttcgttcca ctgagcgtca g accccgtag 1140
aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc t gcttgcaaa
1200 caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta c
caactcttt 1260 ttccgaaggt aactggcttc agcagagcgc agataccaaa
tactgtcctt c tagtgtagc 1320 cgtagttagg ccaccacttc aagaactctg
tagcaccgcc tacatacctc g ctctgctaa 1380 tcctgttacc agtggctgct
gccagtggcg ataagtcgtg tcttaccggg t tggactcaa 1440 gacgatagtt
accggataag gcgcagcggt cgggctgaac ggggggttcg t gcacacagc 1500
ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag c tatgagaaa
1560 gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc a
gggtcggaa 1620 caggagagcg cacgagggag cttccagggg gaaacgcctg
gtatctttat a gtcctgtcg 1680 ggtttcgcca cctctgactt gagcgtcgat
ttttgtgatg ctcgtcaggg g ggcggagcc 1740 tatggaaaaa cgccagcaac
gcggcctttt tacggttcct ggccttttgc t ggccttttg 1800 ctcacatgtt
ctttcctgcg ttatcccctg attctgtgga taaccgtatt a ccgcctttg 1860
agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca g tgagcgagg
1920 aagcgtaccc gataaaagcg gcttcctgac aggaggccgt tttgttttgc a
gcccaccta 1980 gcggaagagc gcccaatacg caaaccgcct ctccccgcgc
gttggccgat t cattaatgc 2040 agctggcacg acaggtttcc cgactggaaa
gcgggcagtg agcgcaacgc a attaatgtg 2100 agttagctca ctcattaggc
accccaggct ttacacttta tgctcccggc t cgtatgttg 2160 tgtggaattg
tgagcggata acaattgaat tcaaggagtt aattatgaaa a aaaccgcga 2220
ttgcgattgc ggtggcgctg gcgggctttg cgaccgtggc ccaggcggcc g agctcatct
2280 tctagatata attaaggaga taaatatgaa atatctgctg ccgaccgcgg c
ggcgggcct 2340 gctgctgctg gcggcgcagc cggcgatggc gctcgagtca
atactagtgg c caggccggc 2400 cttgaaactg ttgaaagttg tttagcaaaa
ccccatacag aaaattcatt t actaacgtc 2460 tggaaagacg acaaaacttt
agatcgttac gctaactatg agggctgtct g tggaatgct 2520 acaggcgttg
tagtttgtac tggtgacgaa actcagtgtt acggtacatg g gttcctatt 2580
gggcttgcta tccctgaaaa tgagggtggt ggctctgagg gtggcggttc t gagggtggc
2640 ggctctgagg gtggcggtac taaacctcct gagtacggtg atacacctat t
ccgggctat 2700 acttatatca accctctcga cggcacttat ccgcctggta
ctgagcaaaa c cccgctaat 2760 cctaatcctt ctcttgagga gtctcagcct
cttaatactt tcatgtttca g aataatagg 2820 ttccgaaata ggcagggggc
attaactgtt tatacgggca ctgttactca a ggcactgac 2880 cccgttaaaa
cttattacca gtacactcct gtatcatcaa aagccatgta t gacgcttac 2940
tggaacggta aattcagaga ctgcgctttc cattctggct ttaatgagga t ccattcgtt
3000 tgtgaatatc aaggccaatc gtctgacctg cctcaacctc ctgttaatgc t
ggcggcggc 3060 tctggtggtg gttctggtgg cggctctgag ggtggtggct
ctgagggtgg c ggttctgag 3120 ggtggcggct ctgagggtgg cggttccggt
ggtggctctg gttccggtga t tttgattat 3180 gaaaagatgg caaacgctaa
taagggggct atgaccgaaa atgccgatga a aacgatgaa 3240 aacgcgctac
agtctgacgc taaaggcaaa cttgattctg tcgctactga t tacggtgct 3300
gctatcgacg gtttcattgg tgacgtttcc ggccttgcta atggtaatgg t gctactggt
3360 gattttgctg gctctaattc ccaaatggct caagtcggtg acggtgataa t
tcaccttta 3420 atgaataatt tccgtcaata tttaccttcc ctccctcaat
cggttgaatg t cgccctttt 3480 gtctttggcg ctggtaaacc atatgaattt
tctattgatt gtgacaaaat a aacttattc 3540 cgtggtgtct ttgcgtttct
tttatatgtt gccaccttta tgtatgtatt t tcgacgttt 3600 gctaacatac
tgcgtaataa agagtcttaa gctagctaag cggccgcatt a ccgatcgcc 3660
cttcccaaca gttgcgcagc ctgaatggcg aatgggacgc gccctgtagc g gcgcattaa
3720 gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac acttgccagc g
ccctagcgc 3780 ccgctccttt cgctttcttc ccttcctttc tcgccacgtt
cgccggcttt c cccgtcaag 3840 ctctaaatcg ggggctccct ttagggttcc
gatttagtgc tttacggcac c tcgacccca 3900 aaaaacttga ttagggtgat
ggttcacgta gtgggccatc gccctgatag a cggtttttc 3960 gccctttgac
gttggagtcc acgttcttta atagtggact cttgttccaa a ctggaacaa 4020
cactcaaccc tatctcggtc tattcttttg atttataagg gattttgccg a tttcggcct
4080 attggttaaa aaatgagctg atttaacaaa aatttaacgc gaattttaac a
aaatattaa 4140 cgcttacaat ttag 4154 192654DNAartificial
sequenceplasmid vector 19 ctaaattgta agcgttaata ttttgttaaa
attcgcgtta aatttttgtt a aatcagctc 60 attttttaac caataggccg
aaatcggcaa aatcccttat aaatcaaaag a atagaccga 120 gatagggttg
agtgttgttc cagtttggaa caagagtcca ctattaaaga a cgtggactc 180
caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg a accatcacc
240 ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc c
taaagggag 300 cccccgattt agagcttgac ggggaaagcc ggcgaacgtg
gcgagaaagg a agggaagaa 360 agcgaaagga gcgggcgcta gggcgctggc
aagtgtagcg gtcacgctgc g cgtaaccac 420 cacacccgcc gcgcttaatg
cgccgctaca gggcgcgtcc cattcgccat t caggctgcg 480 caactgttgg
gaagggcgat cggtaatgcg gccgctacat gaattcaatt g ttatccgct 540
cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg g tgcctaatg
600 agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt c
gggaaacct 660 gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg
agaggcggtt t gcgtattgg 720 gcgctcttcc gcttcctcgc tcactgactc
gctgcgctcg gtcgttcggc t gcggcgagc 780 ggtatcagct cactcaaagg
cggtaatacg gttatccaca gaatcagggg a taacgcagg 840 aaagaacatg
tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg c cgcgttgct 900
ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac g ctcaagtca
960 gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg g
aagctccct 1020 cgtgcgctct cctgttccga ccctgccgct taccggatac
ctgtccgcct t tctcccttc 1080 gggaagcgtg gcgctttctc atagctcacg
ctgtaggtat ctcagttcgg t gtaggtcgt 1140 tcgctccaag ctgggctgtg
tgcacgaacc ccccgttcag cccgaccgct g cgccttatc 1200 cggtaactat
cgtcttgagt ccaacccggt aagacacgac ttatcgccac t ggcagcagc 1260
cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt t cttgaagtg
1320 gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc t
gctgaagcc 1380 agttaccttc ggaaaaagag ttggtagctc ttgatccggc
aaacaaacca c cgctggtag 1440 cggtggtttt tttgtttgca agcagcagat
tacgcgcaga aaaaaaggat c tcaagaaga 1500 tcctttgatc ttttctacgg
ggtctgacgc tcagtggaac gaaaactcac g ttaagggat 1560 tttggtcatg
agattatcaa aaaggatctt cacctagatc cttttaaatt a aaaatgaag 1620
ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc a atgcttaat
1680 cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg c
ctgactccc 1740 cgtcgtgtag ataactacga tacgggaggg cttaccatct
ggccccagtg c tgcaatgat 1800 accgcgagac ccacgctcac cggctccaga
tttatcagca ataaaccagc c agccggaag 1860 ggccgagcgc agaagtggtc
ctgcaacttt atccgcctcc atccagtcta t taattgttg 1920 ccgggaagct
agagtaagta gttcgccagt taatagtttg cgcaacgttg t tgccattgc 1980
tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct c cggttccca
2040 acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta g
ctccttcgg 2100 tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta
tcactcatgg t tatggcagc 2160 actgcataat tctcttactg tcatgccatc
cgtaagatgc ttttctgtga c tggtgagta 2220 ctcaaccaag tcattctgag
aatagtgtat gcggcgaccg agttgctctt g cccggcgtc 2280 aatacgggat
aataccgcgc cacatagcag aactttaaaa gtgctcatca t tggaaaacg 2340
ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt c gatgtaacc
2400 cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt c
tgggtgagc 2460 aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg
gcgacacgga a atgttgaat 2520 actcatactc ttcctttttc aatattattg
aagcatttat cagggttatt g tctcatgag 2580 cggatacata tttgaatgta
tttagaaaaa taaacaaata ggggttccgc g cacatttcc 2640 ccgaaaagtg ccac
2654 202712DNAartificial sequenceplasmid vector 20 ctaaattgta
agcgttaata ttttgttaaa attcgcgtta aatttttgtt a aatcagctc 60
attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag a atagaccga
120 gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga a
cgtggactc 180 caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc
ccactacgtg a accatcacc 240 ctaatcaagt tttttggggt cgaggtgccg
taaagcacta aatcggaacc c taaagggag 300 cccccgattt agagcttgac
ggggaaagcc ggcgaacgtg gcgagaaagg a agggaagaa 360 agcgaaagga
gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc g cgtaaccac 420
cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat t caggctgcg
480 caactgttgg gaagggcgat cggtaatgcg gccgctacat gaattcaatt g
ttatccgct 540 cacaattcca cacaacatac gagccgggag cataaagtgt
aaagcctggg g tgcctaatg 600 agtgagctaa ctcacattaa ttgcgttgcg
ctcactgccc gctttccagt c gggaaacct 660 gtcgtgccag ctgcattaat
gaatcggcca acgcgcgggg agaggcggtt t gcgtattgg 720 gcgctcttcc
gctaggtggg ctgcaaaaca aaacggcctc ctgtcaggaa g ccgctttta 780
tcgggtacgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg c ggcgagcgg
840 tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat a
acgcaggaa 900 agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg
taaaaaggcc g cgttgctgg 960 cgtttttcca taggctccgc ccccctgacg
agcatcacaa aaatcgacgc t caagtcaga 1020 ggtggcgaaa cccgacagga
ctataaagat accaggcgtt tccccctgga a gctccctcg 1080 tgcgctctcc
tgttccgacc ctgccgctta ccggatacct gtccgccttt c tcccttcgg 1140
gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg t aggtcgttc
1200 gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc g
ccttatccg 1260 gtaactatcg tcttgagtcc aacccggtaa gacacgactt
atcgccactg g cagcagcca 1320 ctggtaacag gattagcaga gcgaggtatg
taggcggtgc tacagagttc t tgaagtggt 1380 ggcctaacta cggctacact
agaaggacag tatttggtat ctgcgctctg c tgaagccag 1440 ttaccttcgg
aaaaagagtt ggtagctctt gatccggcaa acaaaccacc g ctggtagcg 1500
gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct c aagaagatc
1560 ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt t
aagggattt 1620 tggtcatgag attatcaaaa aggatcttca cctagatcct
tttaaattaa a aatgaagtt 1680 ttaaatcaat ctaaagtata tatgagtaaa
cttggtctga cagttaccaa t gcttaatca 1740 gtgaggcacc tatctcagcg
atctgtctat ttcgttcatc catagttgcc t gactccccg 1800 tcgtgtagat
aactacgata cgggagggct taccatctgg ccccagtgct g caatgatac 1860
cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca g ccggaaggg
1920 ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt a
attgttgcc 1980 gggaagctag agtaagtagt tcgccagtta atagtttgcg
caacgttgtt g ccattgcta 2040 caggcatcgt ggtgtcacgc tcgtcgtttg
gtatggcttc attcagctcc g gttcccaac 2100 gatcaaggcg agttacatga
tcccccatgt tgtgcaaaaa agcggttagc t ccttcggtc 2160 ctccgatcgt
tgtcagaagt aagttggccg cagtgttatc actcatggtt a tggcagcac 2220
tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact g gtgagtact
2280 caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc c
cggcgtcaa 2340 tacgggataa taccgcgcca catagcagaa ctttaaaagt
gctcatcatt g gaaaacgtt 2400 cttcggggcg aaaactctca aggatcttac
cgctgttgag atccagttcg a tgtaaccca 2460 ctcgtgcacc caactgatct
tcagcatctt ttactttcac cagcgtttct g ggtgagcaa 2520 aaacaggaag
gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa t gttgaatac 2580
tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt c tcatgagcg
2640 gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc a
catttcccc 2700 gaaaagtgcc ac 2712 212750DNAartificial
sequenceplasmid vector 21 gtggcacttt tcggggaaat gtgcgcggaa
cccctatttg tttatttttc t aaatacatt 60 caaatatgta tccgctcatg
agacaataac cctgataaat gcttcaataa t attgaaaaa 120 ggaagagtat
gagtattcaa catttccgtg tcgcccttat tccctttttt g cggcatttt 180
gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct g aagatcagt
240 tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc c
ttgagagtt 300 ttcgccccga agaacgtttt ccaatgatga gcacttttaa
agttctgcta t gtggcgcgg 360 tattatcccg tattgacgcc gggcaagagc
aactcggtcg ccgcatacac t attctcaga 420 atgacttggt tgagtactca
ccagtcacag aaaagcatct tacggatggc a tgacagtaa 480 gagaattatg
cagtgctgcc ataaccatga gtgataacac tgcggccaac t tacttctga 540
caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg g atcatgtaa
600 ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac g
agcgtgaca 660 ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact
attaactggc g aactactta 720 ctctagcttc ccggcaacaa ttaatagact
ggatggaggc ggataaagtt g caggaccac 780 ttctgcgctc ggcccttccg
gctggctggt ttattgctga taaatctgga g ccggtgagc 840 gtgggtctcg
cggtatcatt gcagcactgg ggccagatgg taagccctcc c gtatcgtag 900
ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag a tcgctgaga
960 taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca t
atatacttt 1020 agattgattt aaaacttcat ttttaattta aaaggatcta
ggtgaagatc c tttttgata 1080 atctcatgac caaaatccct taacgtgagt
tttcgttcca ctgagcgtca g accccgtag 1140 aaaagatcaa aggatcttct
tgagatcctt tttttctgcg cgtaatctgc t gcttgcaaa 1200 caaaaaaacc
accgctacca gcggtggttt gtttgccgga tcaagagcta c caactcttt 1260
ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt c tagtgtagc
1320 cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc g
ctctgctaa 1380 tcctgttacc agtggctgct gccagtggcg ataagtcgtg
tcttaccggg t tggactcaa 1440 gacgatagtt accggataag gcgcagcggt
cgggctgaac ggggggttcg t gcacacagc 1500 ccagcttgga gcgaacgacc
tacaccgaac tgagatacct acagcgtgag c tatgagaaa 1560 gcgccacgct
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc a gggtcggaa 1620
caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat a gtcctgtcg
1680 ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg g
ggcggagcc 1740 tatggaaaaa cgccagcaac gcggcctttt tacggttcct
ggccttttgc t ggccttttg 1800 ctcacatgtt ctttcctgcg ttatcccctg
attctgtgga taaccgtatt a ccgcctttg 1860 agtgagctga taccgctcgc
cgcagccgaa cgaccgagcg cagcgagtca g tgagcgagg 1920 aagcgtaccc
gataaaagcg gcttcctgac aggaggccgt tttgttttgc a gcccaccta 1980
gcggaagagc gcccaatacg caaaccgcct ctccccgcgc gttggccgat t cattaatgc
2040
agctggcacg acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc a attaatgtg
2100 agttagctca ctcattaggc accccaggct ttacacttta tgctcccggc t
cgtatgttg 2160 tgtggaattg tgagcggata acaattgaat tcacatctag
atatctcgag t caatactag 2220 tggccaggcc ggccagcggc cgcattaccg
atcgcccttc ccaacagttg c gcagcctga 2280 atggcgaatg ggacgcgccc
tgtagcggcg cattaagcgc ggcgggtgtg g tggttacgc 2340 gcagcgtgac
cgctacactt gccagcgccc tagcgcccgc tcctttcgct t tcttccctt 2400
cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg c tccctttag
2460 ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag g
gtgatggtt 2520 cacgtagtgg gccatcgccc tgatagacgg tttttcgccc
tttgacgttg g agtccacgt 2580 tctttaatag tggactcttg ttccaaactg
gaacaacact caaccctatc t cggtctatt 2640 cttttgattt ataagggatt
ttgccgattt cggcctattg gttaaaaaat g agctgattt 2700 aacaaaaatt
taacgcgaat tttaacaaaa tattaacgct tacaatttag 2750 22102DNAartificial
sequenceoligonucleotide 22 aattcaagga gttaattatg aaaaaaaccg
cgattgcgat tgcggtggcg c tggcgggct 60 ttgcgaccgt ggcccaggcg
gcctctagaa tctgcggccg ca 102 23102DNAartificial
sequenceoligonucleotide 23 ctagtgcggc cgcagattct agaggccgcc
tgggccacgg tcgcaaagcc c gccagcgcc 60 accgcaatcg caatcgcggt
ttttttcata attaactcct tg 102 24565DNAartificial sequencestuffer
sequence 24 tctagataac tgtggctgca ccatctgtct tcatcttccc gccatctgat
g agcagttga 60 aatctggaac tgcctctgtt gtgtgcctgc tgaataactt
ctatcccaga g aggccaaag 120 tacagtggaa ggtggataac gccctccaat
cgggtaactc ccaggagagt g tcacagagc 180 aggacagcaa ggacagcacc
tacagcctca gcagcaccct gacgctgagc a aagcagact 240 acgagaaaca
caaagtctac gcctgcgaag tcacccatca gggcctgagc t ccggaggtg 300
cctcagtcgt gtgcttcttg aacaacttct accccaaaga catcaatgtc a agtggaaga
360 ttgatggcag tgaacgacaa aatggcgtcc tgaacagttg gactgatcag g
acagcaaag 420 acagcaccta cagcatgagc agcaccctca cgttgaccaa
ggacgagtat g aacgacata 480 acagctatac ctgtgaggcc actcacaaga
catcaacttc acccattgtc a agagcttca 540 acaggaatga gtgttaagcg gccgc
565 251131DNAartificial sequencestuffer sequence 25 ctcgagctga
tgagccatgg aagctgtgtc gcctgcacca ggctcccacg g ctcgtggtg 60
cggtgcgctt ctggtgttcg ctgcctacag ccgacacgtc gagcttcgtg c ccctagagt
120 tgcgcgtcac agcagcctcc ggcgctccgc gatatcaccg tgtcatccac a
tcaatgaag 180 tagtgctcct agacgccccc gtggggctgg tggcgcggtt
ggctgacgag a gcggccacg 240 tagtgttgcg ctggctcccg ccgcctgaga
cacccatgac gtctcacatc c gctacgagg 300 tggacgtctc ggccggcaac
ggcgcaggga gcgtacagag ggtggagatc c tggagggcc 360 gcaccgagtg
tgtgctgagc aacctgcggg gccggacgcg ctacaccttc g ccgtccgcg 420
cgcgtatggc tgagccgagc ttcggcggct tctggagcgc ctggtcggag c ctgtgtcgc
480 tgctgacgcc tagcgacctg gaccccctca tcctgacgct ctccctcatc c
tcgtggtca 540 tcctggtgct gctgaccgtg ctcgcgctgc tctcccaccg
ccgggctctg a agcagaaga 600 tctggcctgg catcccgagc ccagagagcg
agtttgaagg cctcttcacc a cccacaagg 660 gtaacttcca gctgtggctg
taccagaatg atggctgcct gtggtggagc c cctgcaccc 720 ccttcacgga
ggacccacct gcttccctgg aagtcctctc agagcgctgc t gggggacga 780
tgcaggcagt ggagccgggg acagatgatg agggcccttt tccccctcgt c tcctgtgag
840 aattccccgt cggatacgag cagcgtggcc gttggctgcc tcgcacagga c
ttccttccc 900 gactccatca ctttctcctg gaaatacaag aacaactctg
acatcagcag c acccggggc 960 ttcccatcag tcctgagagg gggcaagtac
gcagccacct cacaggtgct g ctgccttcc 1020 aaggacgtca tgcagggcac
agacgaacac gtggtgtgca aagtccagca c cccaacggc 1080 aacaaagaaa
agaacgtgcc tcttccagtg attgctgagc tgcctactag t 1131
261121DNAartificial sequencestuffer sequence 26 ctcgagctga
tgagccatgg aagctgtgtc gcctgcacca ggctcccacg g ctcgtggtg 60
cggtgcgctt ctggtgttcg ctgcctacag ccgacacgtc gagcttcgtg c ccctagagt
120 tgcgcgtcac agcagcctcc ggcgctccgc gatatcaccg tgtcatccac a
tcaatgaag 180 tagtgctcct agacgccccc gtggggctgg tggcgcggtt
ggctgacgag a gcggccacg 240 tagtgttgcg ctggctcccg ccgcctgaga
cacccatgac gtctcacatc c gctacgagg 300 tggacgtctc ggccggcaac
ggcgcaggga gcgtacagag ggtggagatc c tggagggcc 360 gcaccgagtg
tgtgctgagc aacctgcggg gccggacgcg ctacaccttc g ccgtccgcg 420
cgcgtatggc tgagccgagc ttcggcggct tctggagcgc ctggtcggag c ctgtgtcgc
480 tgctgacgcc tagcgacctg gaccccctca tcctgacgct ctccctcatc c
tcgtggtca 540 tcctggtgct gctgaccgtg ctcgcgctgc tctcccaccg
ccgggctctg a agcagaaga 600 tctggcctgg catcccgagc ccagagagcg
agtttgaagg cctcttcacc a cccacaagg 660 gtaacttcca gctgtggctg
taccagaatg atggctgcct gtggtggagc c cctgcaccc 720 ccttcacgga
ggacccacct gcttccctgg aagtcctctc agagcgctgc t gggggacga 780
tgcaggcagt ggagccgggg acagatgatg agggcccatc ggtcttcccc c tggcaccct
840 cctccaagag cacctctggc ggcacagcgg ccctgggctg cctggtcaag g
actacttcc 900 ccgaaccggt gacggtgtcg tggaactcag gcgctctgac
cagcggcgtg c acaccttcc 960 cggctgtcct acagtcctca ggactctact
ccctcagcag cgtggtgacc g tgccatcca 1020 gcagcttggg cacccagacc
tacatctgca acgtgaatca caagcccagc a acaccaagg 1080 tggacaagaa
agttgagccc aaatcttgtg acaaaactag t 1121 27337DNAartificial
sequencestuffer sequence 27 tctagataac tgtggctgca ccatctgtct
tcatcttccc gccatctgat g agcagttga 60 aatctggaac tgcctctgtt
gtgtgcctgc tgaataactt ctatcccaga g aggccaaag 120 tacagtggaa
ggtggataac gccctccaat cgggtaactc ccaggagagt g tcacagagc 180
aggacagcaa ggacagcacc tacagcctca gcagcaccct gacgctgagc a aagcagact
240 acgagaaaca caaagtctac gcctgcgaag tcacccatca gggcctgagc t
cgcccgtca 300 caaagagctt caacagggga gagtgttaag cggccgc 337
28509DNAartificial sequencestuffer sequence 28 tctagataac
tgtggctgca ccatctgtct tcatcttccc gccatctgat g agcagttga 60
aatctggaac tgcctctgtt gtgtgcctgc tgaataactt ctatcccaga g aggccaaag
120 tacagtggaa ggtggataac gccctccaat cgggtaactc ccaggagagt g
tcacagagc 180 aggacagcaa ggacagcacc tacagcctca gcagcaccct
gacgctgagc a aagcagact 240 acgagaaaca caaagtctac gcctgcgaag
tcacccatca gggcctgagc t ctgacagtg 300 gcttggaaag cagatagcag
ccccgtcaag gcgggagtgg agaccaccac a ccctccaaa 360 caaagcaaca
acaagtacgc ggccagcagc tatctgagcc tgacgcctga g cagtggaag 420
tcccacagaa gctacagctg ccaggtcacg catgaaggga gcaccgtgga g aagacagtg
480 gcccctacag aatgttcata agcggccgc 509 291059DNAartificial
sequencestuffer sequence 29 ctcgagctga tgagccatgg aagctgtgtc
gcctgcacca ggctcccacg g ctcgtggtg 60 cggtgcgctt ctggtgttcg
ctgcctacag ccgacacgtc gagcttcgtg c ccctagagt 120 tgcgcgtcac
agcagcctcc ggcgctccgc gatatcaccg tgtcatccac a tcaatgaag 180
tagtgctcct agacgccccc gtggggctgg tggcgcggtt ggctgacgag a gcggccacg
240 tagtgttgcg ctggctcccg ccgcctgaga cacccatgac gtctcacatc c
gctacgagg 300 tggacgtctc ggccggcaac ggcgcaggga gcgtacagag
ggtggagatc c tggagggcc 360 gcaccgagtg tgtgctgagc aacctgcggg
gccggacgcg ctacaccttc g ccgtccgcg 420 cgcgtatggc tgagccgagc
ttcggcggct tctggagcgc ctggtcggag c ctgtgtcgc 480 tgctgacgcc
tagcgacctg gaccccctca tcctgacgct ctccctcatc c tcgtggtca 540
tcctggtgct gctgaccgtg ctcgcgctgc tctcccaccg ccgggctctg a agcagaaga
600 tctggcctgg catcccgagc ccagagagcg agtttgaagg cctcttcacc a
cccacaagg 660 gtaacttcca gctgtggctg taccagaatg atggctgcct
gtggtggagc c cctgcaccc 720 ccttcacgga ggacccacct gcttccctgg
aagtcctctc agagcgctgc t gggggacga 780 tgcaggcagt ggagccgggg
acagatgatg agggccctag gatgcctggt c aagggttat 840 ttccctgagc
cagtgacctt gacctggaac tctggatccc tgtccagtgg t gtgcacacc 900
ttcccagctg tcctgcagtc tgacctctac accctcagca gctcagtgac t gtaacctcc
960 agcacctggc ccagccagtc catcacctgc aatgtggccc acccggcaag c
agcaccaag 1020 gtggacaaga aaattgagcc cagagtgccc acaactagt 1059
301056DNAartificial sequencestuffer sequence 30 ctcgagctga
tgagccatgg aagctgtgtc gcctgcacca ggctcccacg g ctcgtggtg 60
cggtgcgctt ctggtgttcg ctgcctacag ccgacacgtc gagcttcgtg c ccctagagt
120 tgcgcgtcac agcagcctcc ggcgctccgc gatatcaccg tgtcatccac a
tcaatgaag 180 tagtgctcct agacgccccc gtggggctgg tggcgcggtt
ggctgacgag a gcggccacg 240 tagtgttgcg ctggctcccg ccgcctgaga
cacccatgac gtctcacatc c gctacgagg 300 tggacgtctc ggccggcaac
ggcgcaggga gcgtacagag ggtggagatc c tggagggcc 360 gcaccgagtg
tgtgctgagc aacctgcggg gccggacgcg ctacaccttc g ccgtccgcg 420
cgcgtatggc tgagccgagc ttcggcggct tctggagcgc ctggtcggag c ctgtgtcgc
480 tgctgacgcc tagcgacctg gaccccctca tcctgacgct ctccctcatc c
tcgtggtca 540 tcctggtgct gctgaccgtg ctcgcgctgc tctcccaccg
ccgggctctg a agcagaaga 600 tctggcctgg catcccgagc ccagagagcg
agtttgaagg cctcttcacc a cccacaagg 660 gtaacttcca gctgtggctg
taccagaatg atggctgcct gtggtggagc c cctgcaccc 720 ccttcacgga
ggacccacct gcttccctgg aagtcctctc agagcgctgc t gggggacga 780
tgcaggcagt ggagccgggg acagatgatg agggccctag gatgcctggt c aagggctat
840 ttccctgagc cagtgacagt gacctggaac tctggatccc tgtccagcgg t
gtgcacacc 900 ttcccagctg tcctgcagtc tgacctctac actctgagca
gctcagtgac t gtcccctcc 960 agcacctggc ccagcgagac cgtcacctgc
aacgttgccc acccggccag c agcaccaag 1020 gtggacaaga aaattgtgcc
cagggattgt actagt 1056 314153DNAartificial sequenceplasmid vector
31 gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc t
aaatacatt 60 caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa
t attgaaaaa 120 ggaagagtat gagtattcaa catttccgtg tcgcccttat
tccctttttt g cggcatttt 180 gccttcctgt ttttgctcac ccagaaacgc
tggtgaaagt aaaagatgct g aagatcagt 240 tgggtgcacg agtgggttac
atcgaactgg atctcaacag cggtaagatc c ttgagagtt 300 ttcgccccga
agaacgtttt ccaatgatga gcacttttaa agttctgcta t gtggcgcgg 360
tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac t attctcaga
420 atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc a
tgacagtaa 480 gagaattatg cagtgctgcc ataaccatga gtgataacac
tgcggccaac t tacttctga 540 caacgatcgg aggaccgaag gagctaaccg
cttttttgca caacatgggg g atcatgtaa 600 ctcgccttga tcgttgggaa
ccggagctga atgaagccat accaaacgac g agcgtgaca 660 ccacgatgcc
tgtagcaatg gcaacaacgt tgcgcaaact attaactggc g aactactta 720
ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt g caggaccac
780 ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga g
ccggtgagc 840 gtgggtctcg cggtatcatt gcagcactgg ggccagatgg
taagccctcc c gtatcgtag 900 ttatctacac gacggggagt caggcaacta
tggatgaacg aaatagacag a tcgctgaga 960 taggtgcctc actgattaag
cattggtaac tgtcagacca agtttactca t atatacttt 1020 agattgattt
aaaacttcat ttttaattta aaaggatcta ggtgaagatc c tttttgata 1080
atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca g accccgtag
1140 aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc t
gcttgcaaa 1200 caaaaaaacc accgctacca gcggtggttt gtttgccgga
tcaagagcta c caactcttt 1260 ttccgaaggt aactggcttc agcagagcgc
agataccaaa tactgtcctt c tagtgtagc 1320 cgtagttagg ccaccacttc
aagaactctg tagcaccgcc tacatacctc g ctctgctaa 1380 tcctgttacc
agtggctgct gccagtggcg ataagtcgtg tcttaccggg t tggactcaa 1440
gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg t gcacacagc
1500 ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag c
tatgagaaa 1560 gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc
ggtaagcggc a gggtcggaa 1620 caggagagcg cacgagggag cttccagggg
gaaacgcctg gtatctttat a gtcctgtcg 1680 ggtttcgcca cctctgactt
gagcgtcgat ttttgtgatg ctcgtcaggg g ggcggagcc 1740 tatggaaaaa
cgccagcaac gcggcctttt tacggttcct ggccttttgc t ggccttttg 1800
ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt a ccgcctttg
1860 agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca g
tgagcgagg 1920 aagcgtaccc gataaaagcg gcttcctgac aggaggccgt
tttgttttgc a gcccaccta 1980 gcggaagagc gcccaatacg caaaccgcct
ctccccgcgc gttggccgat t cattaatgc 2040 agctggcacg acaggtttcc
cgactggaaa gcgggcagtg agcgcaacgc a attaatgtg 2100 agttagctca
ctcattaggc accccaggct ttacacttta tgctcccggc t cgtatgttg 2160
tgtggaattg tgagcggata acaattgaat tcaaggagtt aattatgaaa a aaaccgcga
2220 ttgcgattgc ggtggcgctg gcgggctttg cgaccgtggc ccaggcggcc t
ctagaatct 2280 gcggccgcac tagatataat taaggagata aatatgaaat
atctgctgcc g accgcggcg 2340 gcgggcctgc tgctgctggc ggcgcagccg
gcgatggcgc tcgagtcaat a ctagtggcc 2400 aggccggcct tgaaactgtt
gaaagttgtt tagcaaaacc ccatacagaa a attcattta 2460 ctaacgtctg
gaaagacgac aaaactttag atcgttacgc taactatgag g gctgtctgt 2520
ggaatgctac aggcgttgta gtttgtactg gtgacgaaac tcagtgttac g gtacatggg
2580 ttcctattgg gcttgctatc cctgaaaatg agggtggtgg ctctgagggt g
gcggttctg 2640 agggtggcgg ctctgagggt ggcggtacta aacctcctga
gtacggtgat a cacctattc 2700 cgggctatac ttatatcaac cctctcgacg
gcacttatcc gcctggtact g agcaaaacc 2760 ccgctaatcc taatccttct
cttgaggagt ctcagcctct taatactttc a tgtttcaga 2820 ataataggtt
ccgaaatagg cagggggcat taactgttta tacgggcact g ttactcaag 2880
gcactgaccc cgttaaaact tattaccagt acactcctgt atcatcaaaa g ccatgtatg
2940 acgcttactg gaacggtaaa ttcagagact gcgctttcca ttctggcttt a
atgaggatc 3000 cattcgtttg tgaatatcaa ggccaatcgt ctgacctgcc
tcaacctcct g ttaatgctg 3060 gcggcggctc tggtggtggt tctggtggcg
gctctgaggg tggtggctct g agggtggcg 3120 gttctgaggg tggcggctct
gagggtggcg gttccggtgg tggctctggt t ccggtgatt 3180 ttgattatga
aaagatggca aacgctaata agggggctat gaccgaaaat g ccgatgaaa 3240
acgcgctaca gtctgacgct aaaggcaaac ttgattctgt cgctactgat t acggtgctg
3300 ctatcgacgg tttcattggt gacgtttccg gccttgctaa tggtaatggt g
ctactggtg 3360 attttgctgg ctctaattcc caaatggctc aagtcggtga
cggtgataat t cacctttaa 3420 tgaataattt ccgtcaatat ttaccttccc
tccctcaatc ggttgaatgt c gcccttttg 3480 tctttggcgc tggtaaacca
tatgaatttt ctattgattg tgacaaaata a acttattcc 3540 gtggtgtctt
tgcgtttctt ttatatgttg ccacctttat gtatgtattt t cgacgtttg 3600
ctaacatact gcgtaataaa gagtcttaag ctagctaagc ggccgcatta c cgatcgccc
3660 ttcccaacag ttgcgcagcc tgaatggcga atgggacgcg ccctgtagcg g
cgcattaag 3720 cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca
cttgccagcg c cctagcgcc 3780 cgctcctttc gctttcttcc cttcctttct
cgccacgttc gccggctttc c ccgtcaagc 3840 tctaaatcgg gggctccctt
tagggttccg atttagtgct ttacggcacc t cgaccccaa 3900 aaaacttgat
tagggtgatg gttcacgtag tgggccatcg ccctgataga c ggtttttcg 3960
ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa c tggaacaac
4020 actcaaccct atctcggtct attcttttga tttataaggg attttgccga t
ttcggccta 4080 ttggttaaaa aatgagctga tttaacaaaa atttaacgcg
aattttaaca a aatattaac 4140 gcttacaatt tag 4153
* * * * *