U.S. patent application number 13/747740 was filed with the patent office on 2013-05-23 for genes encoding nematode toxins.
This patent application is currently assigned to ATHENIX CORPORATION. The applicant listed for this patent is Athenix Corporation. Invention is credited to Vadim Beilinson, Brian Carr, Julia T. Daum, Cheryl L. Peters, Candace Poutre, Kimberly Sampson, Brian Vande Berg, Sandra Volrath.
Application Number | 20130133107 13/747740 |
Document ID | / |
Family ID | 42062400 |
Filed Date | 2013-05-23 |
United States Patent
Application |
20130133107 |
Kind Code |
A1 |
Peters; Cheryl L. ; et
al. |
May 23, 2013 |
GENES ENCODING NEMATODE TOXINS
Abstract
Compositions and methods for conferring nematicidal activity to
bacteria, plants, plant cells, tissues and seeds are provided.
Compositions including a coding sequence for nematicidal
polypeptides are provided. The coding sequences can be used in DNA
constructs or expression cassettes for transformation and
expression in plants and bacteria. Compositions also include
transformed bacteria, plants, plant cells, tissues, and seeds. In
particular, isolated nematicidal nucleic acid molecules are
provided. Additionally, amino acid sequences corresponding to the
polynucleotides are encompassed. In particular, the present
invention provides for isolated nucleic acid molecules including
nucleotide sequences encoding the amino acid sequence shown in SEQ
ID NO:4, 5, 8, 9, 13, 14, 47, 48, or 49, the nucleotide sequence
set forth in SEQ ID NO:1, 2, 3, 6, 7, 10, 11, 12, 15, 45, or 46, as
well as variants and fragments thereof.
Inventors: |
Peters; Cheryl L.; (Raleigh,
NC) ; Vande Berg; Brian; (Durham, NC) ; Carr;
Brian; (Raleigh, NC) ; Daum; Julia T.; (Apex,
NC) ; Beilinson; Vadim; (Cary, NC) ; Volrath;
Sandra; (Durham, NC) ; Poutre; Candace;
(Moncure, NC) ; Sampson; Kimberly; (Durham,
NC) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
Athenix Corporation; |
Research Triangle Park |
NC |
US |
|
|
Assignee: |
ATHENIX CORPORATION
Research Triangle Park
NC
|
Family ID: |
42062400 |
Appl. No.: |
13/747740 |
Filed: |
January 23, 2013 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
12638591 |
Dec 15, 2009 |
8404934 |
|
|
13747740 |
|
|
|
|
61122674 |
Dec 15, 2008 |
|
|
|
61224811 |
Jul 10, 2009 |
|
|
|
Current U.S.
Class: |
800/279 ;
435/252.3; 435/320.1; 435/419; 514/4.6; 530/300; 536/23.74;
800/301 |
Current CPC
Class: |
C12N 9/0059 20130101;
C12N 15/8285 20130101; Y02A 40/146 20180101; C12N 15/8241 20130101;
C12Y 110/03001 20130101; Y02A 40/164 20180101; C12Y 114/18001
20130101; C12N 9/0071 20130101 |
Class at
Publication: |
800/279 ;
536/23.74; 435/320.1; 435/252.3; 435/419; 800/301; 530/300;
514/4.6 |
International
Class: |
C12N 15/82 20060101
C12N015/82 |
Claims
1. An isolated or recombinant nucleic acid molecule comprising a
nucleotide sequence encoding an amino acid sequence having
nematicidal activity, wherein said nucleotide sequence is selected
from the group consisting of: a) a nucleotide sequence that encodes
a polypeptide comprising the amino acid sequence of SEQ ID NO:4 or
5; b) a nucleotide sequence encoding a proteolytic cleavage
fragment of SEQ ID NO:4 or 5, wherein said fragment has nematicidal
activity; c) a nucleotide sequence that encodes a polypeptide
comprising an amino acid sequence having at least 95% sequence
identity to the amino acid sequence of SEQ ID NO:4 or 5; and d) the
nucleotide sequence set forth in SEQ ID NO:1, 2, or 3.
2. The isolated nucleic acid molecule of claim 1, wherein said
nucleotide sequence is a synthetic sequence that has been designed
for expression in a plant.
3. The isolated nucleic acid molecule of claim 2, wherein said
synthetic sequence comprises SEQ ID NO:6.
4. A vector comprising the nucleic acid molecule of claim 1.
5. The vector of claim 4, further comprising a nucleic acid
molecule encoding a heterologous polypeptide.
6. A bacterial host cell that contains the insert of the vector of
claim 4.
7. A transgenic plant cell comprising a nucleotide sequence
encoding a polypeptide selected from the group consisting of: a) a
polypeptide comprising the amino acid sequence of SEQ ID NO:4 or 5;
b) a polypeptide comprising an amino acid sequence having at least
95% sequence identity to the amino acid sequence of SEQ ID NO:4 or
5, wherein said polypeptide has nematicidal activity; c) a
polypeptide that is a proteolytic cleavage fragment of SEQ ID NO:4
or 5, wherein said fragment has nematicidal activity; and d) a
polypeptide that is encoded by SEQ ID NO:1, 2, or 3.
8. A transgenic plant regenerated from the plant cell of claim
7.
9. The transgenic plant of claim 8, wherein said plant is selected
from the group consisting of maize, sorghum, wheat, cabbage,
sunflower, tomato, crucifers, peppers, potato, cotton, rice,
soybean, sugarbeet, sugarcane, tobacco, barley, and oilseed
rape.
10. An isolated polypeptide with nematicidal activity, selected
from the group consisting of: a) a polypeptide comprising the amino
acid sequence of SEQ ID NO:4 or 5; b) a polypeptide comprising an
amino acid sequence having at least 95% sequence identity to the
amino acid sequence of SEQ ID NO:4 or 5, wherein said polypeptide
has nematicidal activity; c) a polypeptide that is a proteolytic
cleavage fragment of SEQ ID NO:4 or 5, wherein said fragment has
nematicidal activity; and d) a polypeptide that is encoded by SEQ
ID NO:1, 2, 3, 6, 7, 10, 12, 15, 45, or 46.
11. The polypeptide of claim 10 further comprising heterologous
amino acid sequence.
12. A composition comprising the polypeptide of claim 10.
13. The composition of claim 12, wherein said composition is
selected from the group consisting of a powder, dust, pellet,
granule, spray, emulsion, colloid, and solution.
14. The composition of claim 12, wherein said composition is
prepared by desiccation, lyophilization, homogenization,
extraction, filtration, centrifugation, sedimentation, or
concentration of a culture of bacterial cells.
15. The composition of claim 12, comprising from about 1% to about
99% by weight of said polypeptide.
16. A method for killing or controlling a nematode pest, comprising
contacting said pest with, or feeding to said pest, a
nematicidally-effective amount of a composition comprising the
polypeptide of claim 10.
17. A method for protecting a plant from a pest, comprising
introducing into said plant or cell thereof at least one expression
vector comprising a nucleotide sequence that encodes a nematicidal
polypeptide, wherein said plant is planted in an area susceptible
to nematode infestation and wherein said nucleotide sequence is
selected from the group consisting of: a) a nucleotide sequence
that encodes a polypeptide comprising the amino acid sequence of
SEQ ID NO:4 or 5; b) a nucleotide sequence encoding a proteolytic
cleavage fragment of SEQ ID NO:4 or 5, wherein said fragment has
nematicidal activity; c) a nucleotide sequence that encodes a
polypeptide comprising an amino acid sequence having at least 95%
sequence identity to the amino acid sequence of SEQ ID NO:4 or 5;
and d) the nucleotide sequence set forth in SEQ ID NO:1, 2, or 3.
Description
CROSS REFERENCE TO RELATED APPLICATION
[0001] This application is a divisional of U.S. patent application
Ser. No. 12/638,591, filed Dec. 15, 2009, which claims the benefit
of U.S. Provisional Application Ser. No. 61/122,674, filed Dec. 15,
2008 and U.S. Provisional Application Ser. No. 61/224,811, filed
Jul. 10, 2009, the contents of which are herein incorporated by
reference in their entirety.
REFERENCE TO SEQUENCE LISTING SUBMITTED ELECTRONICALLY
[0002] The official copy of the sequence listing is submitted
electronically via EFS-Web as an ASCII formatted sequence listing
with a file named "APA060US02DSEQLIST.txt", created on Jan. 14,
2013, and having a size of 165 kilobytes and is filed concurrently
with the specification. The sequence listing contained in this
ASCII formatted document is part of the specification and is herein
incorporated by reference in its entirety.
FIELD OF THE INVENTION
[0003] This invention relates to the field of molecular biology.
Provided are novel genes that encode nematicidal proteins. These
proteins and the nucleic acid sequences that encode them are useful
in preparing nematicidal formulations and in the production of
transgenic pest-resistant plants.
BACKGROUND OF THE INVENTION
[0004] Nematodes (derived from the Greek word for thread) are
active, flexible, elongate, organisms that live on moist surfaces
or in liquid environments, including films of water within soil and
moist tissues within other organisms. While only 20,000 species of
nematode have been identified, it is estimated that 40,000 to 10
million actually exist. Some species of nematodes have evolved to
be very successful parasites of both plants and animals and are
responsible for significant economic losses in agriculture and
livestock and for morbidity and mortality in humans (Whitehead
(1998) Plant Nematode Control. CAB International, New York).
[0005] It is estimated that parasitic nematodes cost the
horticulture and agriculture industries in excess of $78 billion
worldwide a year, based on an estimated average 12% annual loss
spread across all major crops. For example, it is estimated that
nematodes cause soybean losses of approximately $3.2 billion
annually worldwide (Barker et al. (1994) Plant and Soil Nematodes:
Societal Impact and Focus for the Future. The Committee on National
Needs and Priorities in Nematology. Cooperative State Research
Service, US Department of Agriculture and Society of
Nematologists).
[0006] There are a very small array of chemicals available to
control nematodes (Becker (1999) Agricultural Research Magazine
47(3):22-24; U.S. Pat. No. 6,048,714). Nevertheless, the
application of chemical nematicides remains the major means of
nematode control. In general, chemical nematicides are highly toxic
compounds known to cause substantial environmental impact and are
increasingly restricted in the amounts and locations in which then
can be used.
[0007] The macrocyclic lactones (e.g., avermectins and milbemycins)
are chemicals that in principle provide excellent specificity and
efficacy and should allow environmentally safe control of plant
parasitic nematodes. Unfortunately, in practice, these two
nematicidal agents have proven less effective in agricultural
applications against root pathogens. Although certain avermectins
show exquisite activity against plant parasitic nematodes these
chemicals are hampered by poor bioavailability due to their light
sensitivity, degradation by soil microorganisms and tight binding
to soil particles (Lasota & Dybas (1990) Acta Leiden
59(1-2):217-225; Wright & Perry (1998) Musculature and
Neurobiology. In: The Physiology and Biochemistry of Free-Living
and Plant-parasitic Nematodes (eds R. N. Perry & D. J. Wright),
CAB International 1998). Consequently despite years of research and
extensive use against animal parasitic nematodes, mites and insects
(plant and animal applications), macrocyclic lactones (e.g.,
avermectins and milbemycins) have never been commercially developed
to control plant parasitic nematodes in the soil.
SUMMARY OF INVENTION
[0008] Compositions and methods for conferring nematode tolerance
activity to plants, plant cells, tissues and seeds are provided.
Compositions include nucleic acid molecules encoding sequences for
nematicidal polypeptides, vectors comprising those nucleic acid
molecules, and host cells comprising the vectors. Compositions also
include the nematicidal polypeptide sequences and antibodies to
those polypeptides. The nucleotide sequences can be used in DNA
constructs or expression cassettes for transformation and
expression in organisms, including microorganisms and plants. The
nucleotide or amino acid sequences may be synthetic sequences that
have been designed for expression in an organism including, but not
limited to, a microorganism or a plant. Compositions also comprise
transformed bacteria, plants, plant cells, tissues, and seeds.
[0009] In particular, isolated nucleic acid molecules are provided
that encode nematicidal proteins. Additionally, amino acid
sequences corresponding to the nematicidal protein are encompassed.
In particular, the present invention provides for an isolated
nucleic acid molecule comprising a nucleotide sequence encoding the
amino acid sequence shown in SEQ ID NO:4, 5, 8, 9, 13, 14, 47, 48,
or 49, the nucleotide sequence set forth in SEQ ID NO:1, 2, 3, 6,
7, 10, 11, 12, 15, 45, or 46, as well as variants and fragments
thereof. Nucleotide sequences that are complementary to a
nucleotide sequence of the invention, or that hybridize to a
sequence of the invention are also encompassed.
[0010] Methods are provided for producing the polypeptides of the
invention, and for using those polypeptides for controlling or
killing a nematode pest. Methods and kits for detecting the nucleic
acids and polypeptides of the invention in a sample are also
included.
[0011] The compositions and methods of the invention are useful for
the production of organisms with enhanced nematode resistance or
tolerance. These organisms and compositions comprising the
organisms are desirable for agricultural purposes. The compositions
of the invention are useful for identifying and generating plant
populations having improved nematode resistance, as well as in the
identification of Quantitative Trait Loci (QTLs) useful in
marker-assisted breeding of plants having nematode resistance or
tolerance.
DESCRIPTION OF FIGURES
[0012] FIG. 1 shows the enzymatic action of polyphenol
oxidases.
[0013] FIG. 2 shows the AXN-1 precursor protein (SEQ ID NO:13) with
the mass spectroscopy peaks mapped.
[0014] FIGS. 3A and 3B show an alignment of AXN-1 (SEQ ID NO:4)
with polyphenol oxidases from Neurospora crassa (SEQ ID NO:31),
Pyrenophora tritici-repentis (SEQ ID NO:32), Podospora anserina
(SEQ ID NO:33), Lentinula edodes (SEQ ID NO:34), Pycnoporus
sanguineus (SEQ ID NO:35), Pholio nameko (SEQ ID NO:36), Tuber
melanosporum (SEQ ID NO:37), and Aspergillus fumigatus (SEQ ID
NO:38).
[0015] FIG. 4 shows an alignment of AXN-8 (SEQ ID NO:13) with
polyphenol oxidases from Agaricus bisporus (SEQ ID NO:39),
Neurospora crassa (SEQ ID NO:31), and Streptomyces
castaneglobisporus (SEQ ID NO:40). Putative copper binding
histidines are found at amino acid positions 47, 81, 90, 208, 212,
and 235 of SEQ ID NO:13. The protease activation site is located at
position 377 of SEQ ID NO:39 and position 403 of SEQ ID NO:31.
Copper binding histidines are located at amino acid position 58 of
SEQ ID NO:39, position 67 of SEQ ID NO:31, and positions 38, 54,
63, 190, 194, and 216 of SEQ ID NO:40.
[0016] FIG. 5 shows an alignment of AXN-9 (SEQ ID NO:48) with AXN-8
(SEQ ID NO:13).
[0017] FIG. 6 shows a Western blot of soybean hairy root tissue
incubated with anti-AXN-1 antibody. Lane A is root tissue from
transgenic root tissue containing the AXN-1 gene, and Lane B is
from a control line lacking the AXN-1 gene.
DETAILED DESCRIPTION
Overview
[0018] Nematodes cause a substantial loss in agricultural products
including food and industrial crops and have primarily been
combated with chemical compounds having nematicidal activity.
Nematodes are microscopic wormlike animals that feed on roots,
leaves, and stems of more than 2,000 vegetables, fruits, and
ornamental plants. One common type of nematode is the root-knot
nematode, whose feeding causes the characteristic galls on roots.
Other root-feeding nematodes are the cyst- and lesion-type, which
are more host specific. Soybean cyst nematode (SCN) can decrease
the number of nitrogen-fixing nodules on the roots, and may make
the roots more susceptible to attacks by other soil-borne plant
pathogens. Due to the toxicity (and in many cases, poor efficacy)
of existing nematode control methods, it would be desirable to
develop safe and effective alternatives for nematode control.
[0019] The present invention is drawn to compositions and methods
for regulating nematode resistance or tolerance in organisms,
particularly plants or plant cells. By "resistance" is intended
that the nematode is killed upon ingestion or other contact with
the polypeptides of the invention. By "tolerance" is intended an
impairment or reduction in the movement, feeding, reproduction, or
other functions of the nematode. The methods involve transforming
organisms with a nucleotide sequence encoding a nematicidal protein
of the invention. In particular, the nucleotide sequences of the
invention are useful for preparing plants and microorganisms that
possess nematicidal activity. Thus, transformed bacteria, plants,
plant cells, plant tissues and seeds are provided.
[0020] Compositions include nematicidal nucleic acids and proteins
of bacterial, fungal, or plant origin. The nematicidal nucleic acid
sequences described herein encode polyphenol oxidase enzymes.
Polyphenol oxidases are believed to play key physiological roles
both in preventing insects and microorganisms from attacking plants
and as part of the wound response of plants and plant products to
insects, microorganisms and bruising (reviewed in Marshall et al.
(2000) "Enzymatic Browning in Fruits, Vegetables and Seafoods" Food
and Agricultural Organization of the United Nations at
www.fao.org). As fruits and vegetables ripen, their susceptibility
to disease and infestation is increased due to a decline in their
phenolic content. Phenoloxidase enzymes endogenous to fruits and
vegetables catalyze the production of quinones from their phenolic
constituents. Once formed, these quinones undergo polymerization
reactions, leading to the production of melanins, which exhibit
both antibacterial and antifungal activity and assist in keeping
the fruit and/or vegetable physiologically wholesome. However, the
use of polyphenol oxidase activity for nematode control has not
previously been discovered.
[0021] The polyphenol oxidase enzymes encompassed herein include
novel sequences as well as polyphenol oxidase sequences known in
the art. The sequences find use in the construction of expression
vectors for subsequent transformation into organisms of interest,
as probes for the isolation of other homologous (or partially
homologous) genes, and for the generation of altered nematicidal
proteins by methods known in the art, such as domain swapping or
DNA shuffling. The proteins find use in controlling or killing
nematode pest populations and for producing compositions with
nematicidal activity.
[0022] By "nematicidal toxin" or "nematicidal protein" is intended
a toxin that has toxic activity against one or more nematode pests,
including, but not limited to, the nematicidal toxins set forth in
SEQ ID NO:4, 5, 8, 9, 13, 14, 18, 20, 22, 47, 48, or 49, or a
protein that has homology to such a protein. Nematicidal proteins
include amino acid sequences deduced from the full-length
nucleotide sequences disclosed herein, and amino acid sequences
that are shorter than the full-length sequences, either due to the
use of an alternate downstream start site, or due to processing
(e.g., proteolytic cleavage, alternative splicing, and the like)
that produces a shorter protein having nematicidal activity.
Processing may occur in the organism the protein is expressed in,
or in the pest after ingestion of the protein.
Nematode Pests
[0023] The compositions and methods of the present invention are
useful for developing transgenic plants that are tolerant to
nematode pests, particularly plant-parasitic nematodes. Nematode
parasites of plants can inhabit all parts of plants, including
roots, developing flower buds, leaves, and stems. Plant parasites
are classified on the basis of their feeding habits into the broad
categories: migratory ectoparasites, migratory endoparasites, and
sedentary endoparasites. Sedentary endoparasites, which include the
root knot nematodes (Meloidogyne) and cyst nematodes (Globodera and
Heterodera) induce feeding sites and establish long-term infections
within roots that are often very damaging to crops (Whitehead
(1998) Plant Nematode Control. CAB International, New York).
Exemplary plant-parasitic nematodes include, but are not limited
to, Aphelenchoides spp. (Foliar nematodes), Belonolaimus spp. (The
Sting nematode), Bursaphelenchus xylophilus (Pine wilt nematode),
Criconemoides species (Ring Nematode), Ditylenchus destructor
(Potato Rot Nematode), Ditylenchus dipsaci (Stem and bulb
nematode), Globodera pallida (Pale Potato Cyst Nematode), Globodera
rostochiensis (Golden Nematode), Helicotylenchus (Spiral
Nematodes), Heterodera glycines (Soybean cyst nematode, Heterodera
schachtii (Sugar beet cyst nematode), Heterodera zeae (The Corn
Cyst Nematode), Heterodera avenae (cereal cyst nematode),
Hoplolaimus (The Lance Nematode), Meloidogyne spp. (Root-knot
nematodes), Mesocriconema xenoplax (Ring nematode), Nacobbus
aberrans (False root-knot nematode), Paratrichodorus (Stubby-Root
Nematodes), Pratylenchus spp (Lesion nematode), Radopholus similis
(Burrowing nematode), Rotylenchulus spp. (Reniform nematode),
Tylenchorhynchus spp. (Stunt nematodes), Tylenchulus semipenetrans
(The Citrus nematode), and Xiphinema (The Dagger Nematode).
Polyphenol Oxidases
[0024] The nematicidal compositions disclosed herein comprise
polyphenol oxidase nucleic acid and amino acid sequences, as well
as variants and fragments thereof. In various embodiments, the
compositions comprise transgenic plants or pesticidal formulations
expressing or comprising a polyphenol oxidase. The compositions are
useful for controlling or killing plant-parasitic nematodes in an
area susceptible to nematode infestation, particularly
plant-parasitic nematode infestation.
[0025] For the purposes of the present invention, a "polyphenol
oxidase" refers to a class of copper-containing oxidase enzymes
that includes, for example, monophenol monooxidases such as
tyrosinase, diphenol oxidases such as catechol oxidase and laccase,
hemocyanins, and the like. In various embodiments, the polyphenol
oxidase enzymes encompassed herein are members of the type 3 copper
protein family.
[0026] Polyphenol oxidases are enzymes with a dinuclear copper
center, with the copper ions serving to bind a molecular oxygen
atom within the active site of the enzyme to allow catalysis. The
oxidation state of each copper atom influences oxygen binding and
thus oxidase activity at each step. In the case of a monophenol
monooxidase, copper ions in the +2 oxidation state guide the
addition of a hydroxyl group in the ortho-position on an existing
phenol ring. Subsequently, a diphenol oxidase can bind this
diphenol product and oxidize both hydroxyl moieties to generate the
corresponding quinone. The diphenol oxidase activity takes place by
reduction of the copper ions to the +1 state and binding to a
molecular oxygen atom. While some organisms possess only a single
polyphenol oxidase activity (notably plants, which carry out the
diphenol oxidase step), other enzymes perform both the monooxidase
and diphenol oxidase reactions. Several x-ray structures have been
solved for type 3 copper enzymes, and distinct structural motifs
are conserved among the enzymes. Notable is the active site of
these enzymes, in which copper is bound by six or seven histidine
residues and a single cysteine residue is highly conserved. The
structural data also suggests most polyphenol oxidase enzymes have
somewhat relaxed specificity for their substrates, and that the
active site of the enzymes is flexible during catalysis.
[0027] The enzyme seems to be of almost universal distribution in
animals, plants, fungi and bacteria. Primary protein sequences of
polyphenol oxidases from Streptomyces glaucescens (Huber et al.
1985), Streptomyces antibioticus (Bernan et al. 1985) and
Neurospora crassa (Lerch, 1982), tomato (Shahar et al. 1992; Newman
et al. 1993), broad bean (Cary et al. 1992) potato (Hunt et al.
1993), mice (Shibahara et al. 1986) and humans (Kwon et al. 1987;
Giebel et al. 1991) have been determined using cDNA sequencing
techniques. Polyphenol oxidases of closely related plants, such as
tomato and potato, show approximately 91 percent sequence homology,
while those of tomato and fava bean show only 40 percent exact
homology (Wong, 1995).
[0028] Despite low sequence identity amongst polyphenol oxidase
enzymes derived from different species, they all have at their
active site a dinuclear copper center, in which type 3 copper is
bound to histidine residues, and this structure is highly
conserved. Marusek et al. show that a number of important
structural features are conserved in the N-terminal domains of
polyphenol oxidases from various plants and fungi, including a
tyrosine motif which can be considered a landmark indicating the
beginning of the linker region connecting the N- and C-terminal
domains. Sequence alignments and secondary structure predictions
indicate that the C-terminal domains of polyphenol oxidases are
likely to be similar in tertiary structure to that of hemocyanin
(Marusek et al. (2006) J Inorg Biochem. 100(1):108-23, herein
incorporated by reference in its entirety, particularly with
respect to the description of conserved structural features of
polyphenol oxidases).
[0029] The amino acid sequence of a considerable number of PPOs, on
plants, fungi and other organisms derived from cloning of the
enzyme, has now been published and many of the reports and reviews
give such comparative information, e.g. van Gelder et al. (1997)
Phytochemistry 45 :1309-1323; Wichers et al. (2003) Appl.
Microbiol. Biotechnol. 61:336-341; Cho et al. (2003) Proc. Nat.
Acad. Sci. USA 100:10641-10646; Marusek et al. (2006) J Inorg
Biochem. 100(1):108-23; Halaouili et al. (2006) J. Appl. Microbiol.
100:219-232; Hernandez-Romero et al. (2006) FEBS J. 273:257-270;
Nakamura et al. (2000) Biochem. J. 350:537-545; and, Matoba et al.
(2006) J. Biol. Chem. 281:8981-8990, each of which is herein
incorporated by reference in its entirety. Polyphenol oxidase
enzymes have been isolated from mammals, birds, fish, insects,
reptiles, amphibians, fungi and bacteria.
[0030] Polyphenol oxidase exists in certain species as a zymogen or
propolyphenol oxidase form, and proteases are also believed to be
involved in the activation of the propolyphenol oxidase form. These
proteases are thought to be induced by microbial activity, and also
suggests that these enzymes can be activated by a host protease
following an infection or invasion event. Secondary metabolites,
such as glucans, glycoproteins, laminarins, lipopolysaccharides,
etc., produced by organisms may also induce the activation of
propolyphenol oxidase by proteases. These metabolites are also
capable of activating the propolyphenol oxidase even in the absence
of proteolytic activity.
[0031] In various plant species, polyphenol oxidase genes are
encoded within the nucleus and undergo translation within the
cytoplasm. Once formed, propolyphenol oxidase is transported to the
chloroplast where it undergoes proteolytic cleavage, to produce the
active polyphenol oxidase form (Vaughn et al., 1988, Physiol.
Plant., 72: 659-665).
[0032] Monophenol Monooxygenases
[0033] Monophenol monooxygenase (EC 1.14.18.1; CAS number:
9002-10-2) catalyses the hydroxylation of monophenols to
o-diphenols. The enzyme is referred to as tyrosinase in animals,
since L-tyrosine is the major monophenolic substrate. Tyrosine, on
the other hand, which is a monohydroxy phenol, is an important
amino acid. Hydroxylation of tyrosine leads to the formation of
dihydroxyphenylalanine (DOPA).
[0034] In plants, the enzyme is sometimes referred to as cresolase
owing to the ability of the enzyme to utilize the monophenolic
substrate, cresol. Monophenol monooxygenase is also known as
monophenol monooxidase, dopa oxidase, phenol oxidase,
phenoloxidase, phenoloxidase A, phenoloxidase B, and
tyrosinase.
[0035] Crystallographic structure of a Streptomyces derived
tyrosinase in complex with a so called "caddie protein" is
described in Matoba et al (2006) J. Biol. Chem. 281(13):8981-8990,
which is herein incorporated by reference in its entirety.
[0036] Diphenol Oxidases
[0037] Diphenol oxidase (EC 1.10.3.1; CAS number: 9002-10-2) is an
enzyme that catalyses the oxidation of phenols such as catechol.
Diphenol oxidases are also known as catechol oxidase, polyphenol
oxidase, and polyphenoloxidase. Diphenol oxidase carries out the
oxidation of phenols such as catechol, using dioxygen (O2). In the
presence of catechol, benzoquinone is formed. Hydrogens removed
from catechol combine with oxygen to form water.
[0038] Catechol oxidase is a copper-containing enzyme whose
activity is similar to that of tyrosinase, a related class of
copper oxidases.
[0039] Laccase (p-diphenol oxidase, E.C. 1.10.3.2) (DPO) is a type
of copper-containing polyphenol oxidase. It has the unique ability
of oxidizing p-diphenols, thus allowing it to be distinguished from
o-diphenol oxidases such as catechol oxidase. Several phenolic
substrates, including polyphenols, methoxy-substituted phenols,
diamines and a considerable range of other compounds serve as
substrates for laccase (Thurston, 1994, Microbiology, 140: 19-26).
Laccases occur in many phytopathogenic fungi and in certain higher
plants (Mayer and Harel, 1991, Phenoloxidase and their significance
in fruit and vegetables. In P. F. Fx, ed. Food Enzymology, p. 373.
London, Elsevier).
Isolated Nucleic Acid Molecules, and Variants and Fragments
Thereof
[0040] One aspect of the invention pertains to isolated or
recombinant nucleic acid molecules comprising nucleotide sequences
encoding nematicidal proteins and polypeptides or biologically
active portions thereof, as well as nucleic acid molecules
sufficient for use as hybridization probes to identify nucleic acid
molecules encoding proteins with regions of sequence homology. As
used herein, the term "nucleic acid molecule" is intended to
include DNA molecules (e.g., recombinant DNA, cDNA or genomic DNA)
and RNA molecules (e.g., mRNA) and analogs of the DNA or RNA
generated using nucleotide analogs. The nucleic acid molecule can
be single-stranded or double-stranded, but preferably is
double-stranded DNA.
[0041] An "isolated" or "purified" nucleic acid molecule or
protein, or biologically active portion thereof, is substantially
free of other cellular material, or culture medium when produced by
recombinant techniques, or substantially free of chemical
precursors or other chemicals when chemically synthesized.
Preferably, an "isolated" nucleic acid is free of sequences
(preferably protein encoding sequences) that naturally flank the
nucleic acid (i.e., sequences located at the 5' and 3' ends of the
nucleic acid) in the genomic DNA of the organism from which the
nucleic acid is derived. For purposes of the invention, "isolated"
when used to refer to nucleic acid molecules excludes isolated
chromosomes. For example, in various embodiments, the isolated
nucleic acid molecule encoding a nematicidal protein can contain
less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of
nucleotide sequences that naturally flank the nucleic acid molecule
in genomic DNA of the cell from which the nucleic acid is derived.
A nematicidal protein that is substantially free of cellular
material includes preparations of protein having less than about
30%, 20%, 10%, or 5% (by dry weight) of non-nematicidal protein
(also referred to herein as a "contaminating protein").
[0042] Nucleotide sequences encoding the proteins of the present
invention include the sequence set forth in SEQ ID NO:1, 2, 3, 6,
7, 10, 11, 12, 15, 16, 17, 19, 21, 45, or 46, and variants,
fragments, and complements thereof. By "complement" is intended a
nucleotide sequence that is sufficiently complementary to a given
nucleotide sequence such that it can hybridize to the given
nucleotide sequence to thereby form a stable duplex. The
corresponding amino acid sequence for the nematicidal protein
encoded by this nucleotide sequence are set forth in SEQ ID NO:4,
5, 8, 9, 13, 14, 18, 20, 22, 47, 48, or 49.
[0043] Nucleic acid molecules that are fragments of these
nucleotide sequences encoding nematicidal proteins are also
encompassed by the present invention. By "fragment" is intended a
portion of the nucleotide sequence encoding a nematicidal protein.
A fragment of a nucleotide sequence may encode a biologically
active portion of a nematicidal protein, or it may be a fragment
that can be used as a hybridization probe or PCR primer using
methods disclosed below. Nucleic acid molecules that are fragments
of a nucleotide sequence encoding a nematicidal protein comprise at
least about 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000,
1100, 1200, 1300, 1350, 1400 contiguous nucleotides, or up to the
number of nucleotides present in a full-length nucleotide sequence
encoding a nematicidal protein disclosed herein, depending upon the
intended use. By "contiguous" nucleotides is intended nucleotide
residues that are immediately adjacent to one another. Fragments of
the nucleotide sequences of the present invention will encode
protein fragments that retain the biological activity of the
nematicidal protein and, hence, retain nematicidal and polyphenol
oxidase activity. By "retains activity" is intended that the
fragment will have at least about 30%, at least about 50%, at least
about 70%, 80%, 90%, 95% or higher of the nematicidal and/or
polyphenol oxidase activity of the reference protein protein.
[0044] Methods for measuring nematode resistance or nematicidal
activity are described in, for example, U.S. Patent Publication
Nos. 20050191714 and 20080153102, as well as in the Experimental
Examples provided herein. Methods for measuring polyphenol oxidase
activity include, for example, detecting the presence of o-quinone
produced in an enzymatic reaction of the polyphenol oxidase on
tyrosine. Polyphenol oxidase oxidizes tyrosine which, in turn, is
oxidized to o-quinone. The latter is accompanied by an increase in
absorbance at 280 nm. The rate of increase is proportional to
enzyme concentration and is linear during a period of 5-10 minutes
after an initial lag. One unit causes a change in absorbance at 280
nm of 0.001 per minute at 25.degree. C., pH 6.5 under the specified
conditions.
[0045] A fragment of a nucleotide sequence encoding a nematicidal
protein that encodes a biologically active portion of a protein of
the invention will encode at least about 15, 25, 30, 50, 75, 100,
125, 150, 175, 200, 250, 300, 350, 400, 450 contiguous amino acids,
or up to the total number of amino acids present in a full-length
nematicidal protein of the invention.
[0046] Preferred nematicidal proteins of the present invention are
encoded by a nucleotide sequence sufficiently identical to the
nucleotide sequence of SEQ ID NO:1, 2, 3, 6, 7, 10, 11, 12, 15, 16,
17, 19, 21, 45, or 46, or a nucleotide sequence encoding an amino
acid sufficiently identical to SEQ ID NO:4, 5, 8, 9, 13, 14, 18,
20, 22, 47, 48, or 49. By "sufficiently identical" is intended an
amino acid or nucleotide sequence that has at least about 60% or
65% sequence identity, about 70% or 75% sequence identity, about
80% or 85% sequence identity, about 90%, 91%, 92%, 93%, 94%, 95%,
96%, 97%, 98%, 99% or greater sequence identity compared to a
reference sequence using one of the alignment programs described
herein using standard parameters. One of skill in the art will
recognize that these values can be appropriately adjusted to
determine corresponding identity of proteins encoded by two
nucleotide sequences by taking into account codon degeneracy, amino
acid similarity, reading frame positioning, and the like.
[0047] To determine the percent identity of two amino acid
sequences or of two nucleic acids, the sequences are aligned for
optimal comparison purposes. The percent identity between the two
sequences is a function of the number of identical positions shared
by the sequences (i.e., percent identity=number of identical
positions/total number of positions (e.g., overlapping
positions).times.100). In one embodiment, the two sequences are the
same length. In another embodiment, the percent identity is
calculated across the entirety of the reference sequence (i.e., the
sequence disclosed herein as any of SEQ ID NO:1-22, and 45-49). The
percent identity between two sequences can be determined using
techniques similar to those described below, with or without
allowing gaps. In calculating percent identity, typically exact
matches are counted.
[0048] The determination of percent identity between two sequences
can be accomplished using a mathematical algorithm. A nonlimiting
example of a mathematical algorithm utilized for the comparison of
two sequences is the algorithm of Karlin and Altschul (1990) Proc.
Natl. Acad. Sci. USA 87:2264, modified as in Karlin and Altschul
(1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm
is incorporated into the BLASTN and BLASTX programs of Altschul et
al. (1990) J. Mol. Biol. 215:403. BLAST nucleotide searches can be
performed with the BLASTN program, score=100, wordlength=12, to
obtain nucleotide sequences homologous to the nucleic acid
molecules of the invention. BLAST protein searches can be performed
with the BLASTX program, score=50, wordlength=3, to obtain amino
acid sequences homologous to protein molecules of the invention. To
obtain gapped alignments for comparison purposes, Gapped BLAST (in
BLAST 2.0) can be utilized as described in Altschul et al. (1997)
Nucleic Acids Res. 25:3389. Alternatively, PSI-Blast can be used to
perform an iterated search that detects distant relationships
between molecules. See Altschul et al. (1997) supra. When utilizing
BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters
of the respective programs (e.g., BLASTX and BLASTN) can be used.
Alignment may also be performed manually by inspection.
[0049] Another non-limiting example of a mathematical algorithm
utilized for the comparison of sequences is the ClustalW algorithm
(Higgins et al. (1994) Nucleic Acids Res. 22:4673-4680). ClustalW
compares sequences and aligns the entirety of the amino acid or DNA
sequence, and thus can provide data about the sequence conservation
of the entire amino acid sequence. The ClustalW algorithm is used
in several commercially available DNA/amino acid analysis software
packages, such as the ALIGNX module of the Vector NTI Program Suite
(Invitrogen Corporation, Carlsbad, Calif.). After alignment of
amino acid sequences with ClustalW, the percent amino acid identity
can be assessed. A non-limiting example of a software program
useful for analysis of ClustalW alignments is GENEDOC.TM..
GENEDOC.TM. (Karl Nicholas) allows assessment of amino acid (or
DNA) similarity and identity between multiple proteins. Another
non-limiting example of a mathematical algorithm utilized for the
comparison of sequences is the algorithm of Myers and Miller (1988)
CABIOS 4:11-17. Such an algorithm is incorporated into the ALIGN
program (version 2.0), which is part of the GCG Wisconsin Genetics
Software Package, Version 10 (available from Accelrys, Inc., 9685
Scranton Rd., San Diego, Calif., USA). When utilizing the ALIGN
program for comparing amino acid sequences, a PAM120 weight residue
table, a gap length penalty of 12, and a gap penalty of 4 can be
used.
[0050] Unless otherwise stated, GAP Version 10, which uses the
algorithm of Needleman and Wunsch (1970) J. Mol. Biol.
48(3):443-453, will be used to determine sequence identity or
similarity using the following parameters: % identity and %
similarity for a nucleotide sequence using GAP Weight of 50 and
Length Weight of 3, and the nwsgapdna.cmp scoring matrix; %
identity or % similarity for an amino acid sequence using GAP
weight of 8 and length weight of 2, and the BLOSUM62 scoring
program. Equivalent programs may also be used. By "equivalent
program" is intended any sequence comparison program that, for any
two sequences in question, generates an alignment having identical
nucleotide residue matches and an identical percent sequence
identity when compared to the corresponding alignment generated by
GAP Version 10.
[0051] The invention also encompasses variant nucleic acid
molecules. "Variants" of the nematicidal protein encoding
nucleotide sequences include those sequences that encode the
nematicidal proteins disclosed herein but that differ
conservatively because of the degeneracy of the genetic code as
well as those that are sufficiently identical as discussed above.
Naturally occurring allelic variants can be identified with the use
of well-known molecular biology techniques, such as polymerase
chain reaction (PCR) and hybridization techniques as outlined
below. Variant nucleotide sequences also include synthetically
derived nucleotide sequences that have been generated, for example,
by using site-directed mutagenesis but which still encode the
nematicidal proteins disclosed in the present invention as
discussed below. Variant proteins encompassed by the present
invention are biologically active, that is they continue to possess
the desired biological activity of the native protein, that is,
polyphenol oxidase and/or nematicidal activity. By "retains
activity" is intended that the variant will have at least about
30%, at least about 50%, at least about 70%, or at least about 80%
of the nematicidal activity and/or the polyphenol oxidase activity
of the reference protein. One of skill in the art will recognize
that variants may have an increase or decrease in one activity
(e.g., nematicidal or polyphenol oxidase) without affecting, or
only minimally affecting, the other activity. For example, variants
proteins may show improved nematicidal activity relative to the
native protein without concomitant improvements in polyphenol
oxidase activity and vice versa. Unless otherwise specified,
variants proteins will have at least 30% of each activity relative
to the native protein. Methods for measuring these activities are
described elsewhere herein.
[0052] The skilled artisan will further appreciate that changes can
be introduced by mutation of the nucleotide sequences of the
invention thereby leading to changes in the amino acid sequence of
the encoded nematicidal proteins, without altering the biological
activity of the proteins. Thus, variant isolated nucleic acid
molecules can be created by introducing one or more nucleotide
substitutions, additions, or deletions into the corresponding
nucleotide sequence disclosed herein, such that one or more amino
acid substitutions, additions or deletions are introduced into the
encoded protein. Mutations can be introduced by standard
techniques, such as site-directed mutagenesis and PCR-mediated
mutagenesis. Such variant nucleotide sequences are also encompassed
by the present invention.
[0053] For example, conservative amino acid substitutions may be
made at one or more, predicted, nonessential amino acid residues. A
"nonessential" amino acid residue is a residue that can be altered
from the wild-type sequence of a nematicidal protein without
altering the biological activity, whereas an "essential" amino acid
residue is required for biological activity. A "conservative amino
acid substitution" is one in which the amino acid residue is
replaced with an amino acid residue having a similar side chain.
Families of amino acid residues having similar side chains have
been defined in the art. These families include amino acids with
basic side chains (e.g., lysine, arginine, histidine), acidic side
chains (e.g., aspartic acid, glutamic acid), uncharged polar side
chains (e.g., glycine, asparagine, glutamine, serine, threonine,
tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine,
leucine, isoleucine, proline, phenylalanine, methionine,
tryptophan), beta-branched side chains (e.g., threonine, valine,
isoleucine) and aromatic side chains (e.g., tyrosine,
phenylalanine, tryptophan, histidine).
[0054] Amino acid substitutions may be made in nonconserved regions
that retain function. In general, such substitutions would not be
made for conserved amino acid residues, or for amino acid residues
residing within a conserved motif, where such residues are
essential for protein activity. Examples of residues that are
conserved and that may be essential for protein activity include,
for example, residues that are identical between all proteins
contained in an alignment of similar or related toxins to the
sequences of the invention (e.g., residues that are identical
between all proteins contained in the alignment in FIGS. 3 and 4).
Examples of residues that are conserved but that may allow
conservative amino acid substitutions and still retain activity
include, for example, residues that have only conservative
substitutions between all proteins contained in an alignment of
similar or related toxins to the sequences of the invention (e.g.,
residues that have only conservative substitutions between all
proteins contained in the alignment in FIGS. 3, 4, and 5). However,
one of skill in the art would understand that functional variants
may have minor conserved or nonconserved alterations in the
conserved residues.
[0055] Alternatively, variant nucleotide sequences can be made by
introducing mutations randomly along all or part of the coding
sequence, such as by saturation mutagenesis, and the resultant
mutants can be screened for ability to confer nematicidal activity
to identify mutants that retain activity. Following mutagenesis,
the encoded protein can be expressed recombinantly, and the
activity of the protein can be determined using standard assay
techniques.
[0056] Using methods such as PCR, hybridization, and the like
corresponding nematicidal sequences can be identified, such
sequences having substantial identity to the sequences of the
invention. See, for example, Sambrook and Russell (2001) Molecular
Cloning: A Laboratory Manual. (Cold Spring Harbor Laboratory Press,
Cold Spring Harbor, N.Y.) and Innis, et al. (1990) PCR Protocols: A
Guide to Methods and Applications (Academic Press, NY).
Alternatively, polyphenol oxidase sequences can be identified using
any number of polyphenol oxidase sequences known in the art.
[0057] In a hybridization method, all or part of the nematicidal
nucleotide sequence can be used to screen cDNA or genomic
libraries. Methods for construction of such cDNA and genomic
libraries are generally known in the art and are disclosed in
Sambrook and Russell, 2001, supra. The so-called hybridization
probes may be genomic DNA fragments, cDNA fragments, RNA fragments,
or other oligonucleotides, and may be labeled with a detectable
group such as .sup.32P, or any other detectable marker, such as
other radioisotopes, a fluorescent compound, an enzyme, or an
enzyme co-factor. Probes for hybridization can be made by labeling
synthetic oligonucleotides based on the known nematicidal
protein-encoding nucleotide sequence disclosed herein. Degenerate
primers designed on the basis of conserved nucleotides or amino
acid residues in the nucleotide sequence or encoded amino acid
sequence can additionally be used. The probe typically comprises a
region of nucleotide sequence that hybridizes under stringent
conditions to at least about 12, at least about 25, at least about
50, 75, 100, 125, 150, 175, or 200 consecutive nucleotides of
nucleotide sequence encoding a nematicidal protein of the invention
or a fragment or variant thereof. Methods for the preparation of
probes for hybridization are generally known in the art and are
disclosed in Sambrook and Russell, 2001, supra herein incorporated
by reference.
[0058] For example, an entire nematicidal protein sequence
disclosed herein, or one or more portions thereof, may be used as a
probe capable of specifically hybridizing to corresponding
nematicidal protein-like sequences and messenger RNAs. To achieve
specific hybridization under a variety of conditions, such probes
include sequences that are unique and are preferably at least about
10 nucleotides in length, or at least about 20 nucleotides in
length. Such probes may be used to amplify corresponding
nematicidal sequences from a chosen organism by PCR. This technique
may be used to isolate additional coding sequences from a desired
organism or as a diagnostic assay to determine the presence of
coding sequences in an organism. Hybridization techniques include
hybridization screening of plated DNA libraries (either plaques or
colonies; see, for example, Sambrook et al. (1989) Molecular
Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory
Press, Cold Spring Harbor, N.Y.).
[0059] Hybridization of such sequences may be carried out under
stringent conditions. By "stringent conditions" or "stringent
hybridization conditions" is intended conditions under which a
probe will hybridize to its target sequence to a detectably greater
degree than to other sequences (e.g., at least 2-fold over
background). Stringent conditions are sequence-dependent and will
be different in different circumstances. By controlling the
stringency of the hybridization and/or washing conditions, target
sequences that are 100% complementary to the probe can be
identified (homologous probing). Alternatively, stringency
conditions can be adjusted to allow some mismatching in sequences
so that lower degrees of similarity are detected (heterologous
probing). Generally, a probe is less than about 1000 nucleotides in
length, preferably less than 500 nucleotides in length.
[0060] Typically, stringent conditions will be those in which the
salt concentration is less than about 1.5 M Na ion, typically about
0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to
8.3 and the temperature is at least about 30.degree. C. for short
probes (e.g., 10 to 50 nucleotides) and at least about 60.degree.
C. for long probes (e.g., greater than 50 nucleotides). Stringent
conditions may also be achieved with the addition of destabilizing
agents such as formamide. Exemplary low stringency conditions
include hybridization with a buffer solution of 30 to 35%
formamide, 1 M NaCl, 1% SDS (sodium dodecyl sulphate) at 37.degree.
C., and a wash in 1.times. to 2.times.SSC (20.times.SSC=3.0 M
NaCl/0.3 M trisodium citrate) at 50 to 55.degree. C. Exemplary
moderate stringency conditions include hybridization in 40 to 45%
formamide, 1.0 M NaCl, 1% SDS at 37.degree. C., and a wash in
0.5.times. to 1.times.SSC at 55 to 60.degree. C. Exemplary high
stringency conditions include hybridization in 50% formamide, 1 M
NaCl, 1% SDS at 37.degree. C., and a wash in 0.1.times.SSC at 60 to
65.degree. C. Optionally, wash buffers may comprise about 0.1% to
about 1% SDS. Duration of hybridization is generally less than
about 24 hours, usually about 4 to about 12 hours.
[0061] Specificity is typically the function of post-hybridization
washes, the critical factors being the ionic strength and
temperature of the final wash solution. For DNA-DNA hybrids, the
T.sub.m can be approximated from the equation of Meinkoth and Wahl
(1984) Anal. Biochem. 138:267-284: T.sub.m=81.5.degree. C.+16.6
(log M)+0.41 (% GC)-0.61 (% form)-500/L; where M is the molarity of
monovalent cations, % GC is the percentage of guanosine and
cytosine nucleotides in the DNA, % form is the percentage of
formamide in the hybridization solution, and L is the length of the
hybrid in base pairs. The T.sub.m is the temperature (under defined
ionic strength and pH) at which 50% of a complementary target
sequence hybridizes to a perfectly matched probe. T.sub.m is
reduced by about 1.degree. C. for each 1% of mismatching; thus,
T.sub.m, hybridization, and/or wash conditions can be adjusted to
hybridize to sequences of the desired identity. For example, if
sequences with >90% identity are sought, the T.sub.m can be
decreased 10.degree. C. Generally, stringent conditions are
selected to be about 5.degree. C. lower than the thermal melting
point (T.sub.m) for the specific sequence and its complement at a
defined ionic strength and pH. However, severely stringent
conditions can utilize a hybridization and/or wash at 1, 2, 3, or
4.degree. C. lower than the thermal melting point (T.sub.m);
moderately stringent conditions can utilize a hybridization and/or
wash at 6, 7, 8, 9, or 10.degree. C. lower than the thermal melting
point (T.sub.m); low stringency conditions can utilize a
hybridization and/or wash at 11, 12, 13, 14, 15, or 20.degree. C.
lower than the thermal melting point (T.sub.m). Using the equation,
hybridization and wash compositions, and desired T.sub.m, those of
ordinary skill will understand that variations in the stringency of
hybridization and/or wash solutions are inherently described. If
the desired degree of mismatching results in a T.sub.m of less than
45.degree. C. (aqueous solution) or 32.degree. C. (formamide
solution), it is preferred to increase the SSC concentration so
that a higher temperature can be used. An extensive guide to the
hybridization of nucleic acids is found in Tijssen (1993)
Laboratory Techniques in Biochemistry and Molecular
Biology-Hybridization with Nucleic Acid Probes, Part I, Chapter 2
(Elsevier, New York); and Ausubel et al., eds. (1995) Current
Protocols in Molecular Biology, Chapter 2 (Greene Publishing and
Wiley-Interscience, New York). See Sambrook et al. (1989) Molecular
Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory
Press, Cold Spring Harbor, N.Y.).
Isolated Proteins and Variants and Fragments Thereof
[0062] Nematicidal proteins are also encompassed within the present
invention. By "nematicidal protein" is intended a protein having
the amino acid sequence set forth in SEQ ID NO:5, 8, 14, 18, 20,
22, or 48. Fragments, biologically active portions, and variants
thereof are also provided, and may be used to practice the methods
of the present invention.
[0063] "Fragments" or "biologically active portions" include
polypeptide fragments comprising amino acid sequences sufficiently
identical to the amino acid sequence set forth in SEQ ID NO:4, 5,
8, 13, 14, 18, 20, 22, 47, 48, or 49, and that exhibit polyphenol
oxidase and/or nematicidal activity. In some embodiments, the
biologically active fragments exhibit both polyphenol oxidase and
nematicidal activity. A biologically active portion of a
nematicidal protein can be a polypeptide that is, for example, 10,
25, 50, 100, 150, 200, 250 or more amino acids in length. Such
biologically active portions can be prepared by recombinant
techniques and evaluated for nematicidal and/or polyphenol oxidase
activity. Methods for measuring nematicidal activity and polyphenol
oxidase activity are described elsewhere herein. As used herein, a
fragment comprises at least 8 contiguous amino acids of SEQ ID
NO:4, 5, 8, 13, 14, 18, 20, 22, 47, 48, or 49. The invention
encompasses other fragments, however, such as any fragment in the
protein greater than about 10, 20, 30, 50, 100, 150, 200, 250, or
300 contiguous amino acids.
[0064] By "variants" is intended proteins or polypeptides having an
amino acid sequence that is at least about 60%, 65%, about 70%,
75%, about 80%, 85%, about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%,
98% or 99% identical to the amino acid sequence of SEQ ID NO: 4, 5,
8, 13, 14, 18, 20, 22, 47, 48, or 49. Variants also include
polypeptides encoded by a nucleic acid molecule that hybridizes to
the nucleic acid molecule of SEQ ID NO:1, 2, 3, 6, 7, 10, 12, 15,
16, 17, 19, 21, 45, or 46, or a complement thereof, under stringent
conditions. Variants include polypeptides that differ in amino acid
sequence due to mutagenesis. Variant proteins encompassed by the
present invention are biologically active, that is they continue to
possess the desired biological activity of the native protein, that
is, retaining nematicidal activity and/or polyphenol oxidase
activity. In some embodiments, the variants exhibit both polyphenol
oxidase and nematicidal activity.
[0065] Bacterial genes, such as the some of the novel genes
disclosed herein, quite often possess multiple methionine
initiation codons in proximity to the start of the open reading
frame. Often, translation initiation at one or more of these start
codons will lead to generation of a functional protein. These start
codons can include ATG codons. However, some bacteria also
recognize the codon GTG as a start codon, and proteins that
initiate translation at GTG codons contain a methionine at the
first amino acid. Furthermore, it is not often determined a priori
which of these codons are used naturally in the bacterium. Thus, it
is understood that use of one of the alternate methionine codons
may also lead to generation of nematicidal proteins. These
nematicidal proteins are encompassed in the present invention and
may be used in the methods of the present invention.
[0066] Antibodies to the polypeptides of the present invention, or
to variants or fragments thereof, are also encompassed. Methods for
producing antibodies are well known in the art (see, for example,
Harlow and Lane (1988) Antibodies: A Laboratory Manual, Cold Spring
Harbor Laboratory, Cold Spring Harbor, N.Y.; U.S. Pat. No.
4,196,265).
Altered or Improved Variants
[0067] It is recognized that DNA sequences of a nematicidal protein
may be altered by various methods, and that these alterations may
result in DNA sequences encoding proteins with amino acid sequences
different than that encoded by a nematicidal protein of the present
invention. This protein may be altered in various ways including
amino acid substitutions, deletions, truncations, and insertions of
one or more amino acids of SEQ ID NO:4, 5, 8, 9, 13, 14, 18, 20,
22, 47, 48, or 49, including up to about 2, about 3, about 4, about
5, about 6, about 7, about 8, about 9, about 10, about 15, about
20, about 25, about 30, about 35, about 40, about 45, about 50,
about 55, about 60, about 65, about 70, about 75, about 80, about
85, about 90, about 100, about 105, about 110, about 115, about
120, about 125, about 130, about 135, about 140, about 145, about
150, about 155, or more amino acid substitutions, deletions or
insertions. Methods for such manipulations are generally known in
the art. For example, amino acid sequence variants of a nematicidal
protein can be prepared by mutations in the DNA. This may also be
accomplished by one of several forms of mutagenesis and/or in
directed evolution. In some aspects, the changes encoded in the
amino acid sequence will not substantially affect the function of
the protein. Such variants will possess the desired nematicidal
activity. However, it is understood that the ability of a
nematicidal protein to confer nematicidal activity may be improved
by the use of such techniques upon the compositions of this
invention. For example, one may express a nematicidal protein in
host cells that exhibit high rates of base misincorporation during
DNA replication, such as XL-1 Red (Stratagene, La Jolla, Calif.).
After propagation in such strains, one can isolate the DNA (for
example by preparing plasmid DNA, or by amplifying by PCR and
cloning the resulting PCR fragment into a vector), culture the
nematicidal protein mutations in a non-mutagenic strain, and
identify mutated genes with nematicidal activity, for example by
performing an assay to test for nematicidal activity. Generally,
the protein is mixed and used in feeding assays. See, for example
Marrone et al. (1985) J. of Economic Entomology 78:290-293. Such
assays can include contacting plants with one or more pests and
determining the plant's ability to survive and/or cause the death
of the pests.
[0068] Alternatively, alterations may be made to the protein
sequence of many proteins at the amino or carboxy terminus without
substantially affecting activity. This can include insertions,
deletions, or alterations introduced by modern molecular methods,
such as PCR, including PCR amplifications that alter or extend the
protein coding sequence by virtue of inclusion of amino acid
encoding sequences in the oligonucleotides utilized in the PCR
amplification. Alternatively, the protein sequences added can
include entire protein-coding sequences, such as those used
commonly in the art to generate protein fusions. Such fusion
proteins are often used to (1) increase expression of a protein of
interest (2) introduce a binding domain, enzymatic activity, or
epitope to facilitate either protein purification, protein
detection, or other experimental uses known in the art (3) target
secretion or translation of a protein to a subcellular organelle,
such as the periplasmic space of Gram-negative bacteria, or the
endoplasmic reticulum of eukaryotic cells, the latter of which
often results in glycosylation of the protein.
[0069] Variant nucleotide and amino acid sequences of the present
invention also encompass sequences derived from mutagenic and
recombinogenic procedures such as DNA shuffling. With such a
procedure, one or more different nematicidal protein coding regions
can be used to create a new nematicidal protein possessing the
desired properties. In this manner, libraries of recombinant
polynucleotides are generated from a population of related sequence
polynucleotides comprising sequence regions that have substantial
sequence identity and can be homologously recombined in vitro or in
vivo. For example, using this approach, sequence motifs encoding a
domain of interest may be shuffled between a nematicidal gene of
the invention and other known nematicidal genes to obtain a new
gene coding for a protein with an improved property of interest,
such as an increased insecticidal activity. Strategies for such DNA
shuffling are known in the art. See, for example, Stemmer (1994)
Proc. Natl. Acad. Sci. USA 91:10747-10751; Stemmer (1994) Nature
370:389-391; Crameri et al. (1997) Nature Biotech. 15:436-438;
Moore et al. (1997) J. Mol. Biol. 272:336-347; Zhang et al. (1997)
Proc. Natl. Acad. Sci. USA 94:4504-4509; Crameri et al. (1998)
Nature 391:288-291; and U.S. Pat. Nos. 5,605,793 and 5,837,458.
[0070] Domain swapping or shuffling is another mechanism for
generating altered nematicidal proteins. Domains may be swapped
between nematicidal proteins, resulting in hybrid or chimeric
toxins with improved nematicidal activity or target spectrum.
Methods for generating recombinant proteins and testing them for
nematicidal activity are well known in the art (see, for example,
Naimov et al. (2001) Appl. Environ. Microbiol. 67:5328-5330; de
Maagd et al. (1996) Appl. Environ. Microbiol. 62:1537-1543; Ge et
al. (1991) J. Biol. Chem. 266:17954-17958; Schnepf et al. (1990) J.
Biol. Chem. 265:20923-20930; Rang et al. 91999) Appl. Environ.
Microbiol. 65:2918-2925).
Protease Cleavage Site Manipulation
[0071] In various embodiments of the present invention, a
nucleotide sequence encoding a cleavage fragment of the full-length
polyphenol oxidase is expressed in the host cell of interest. In
other embodiments, the nucleotide sequences encoding the polyphenol
oxidase sequences are modified to add or remove sequences encoding
proteolytic cleavage sites. For example, some full-length
polyphenol oxidases, such as AXN-1 and AXN-8, are inactive
precursors, which require proteolytic truncation to yield a toxin
that is activate against SCN. For instance, full-length AXN-8
expressed in E. coli is not active against SCN, but when it is
treated with trypsin, a C-terminal portion of the protein is
removed, yielding an active truncated protein. When AXN-8 was
expressed in E. coli in the truncated form, SCN activity was not
seen, suggesting that the entire sequence may be needed in order
for the protein to fold properly when it is synthesized.
Furthermore, while not being bound by any particular theory or
mechanism, it is also possible that an active polyphenol oxidase
may catalyze the production of compounds that could be toxic to the
plant or to animals (other than the pest of interest, e.g.,
nematodes) that feed on the plant. Expression of a full-length
inactive protein would prevent this from occurring until the enzyme
is activated by proteolytic truncation. This activation would only
occur when a nematode infects the plant, and only in the area where
the nematode is located. Once the nematode is killed by the toxin,
no further active polyphenol oxidase will be produced because no
more proteases are being produced by the nematode.
[0072] If an inactive full-length protein is expressed in a plant
for either of the reasons described above, then it must be
proteolytically truncated in order to show toxicity against SCN or
other plant-parasitic nematodes. It is possible that plant
proteases will carry out the activation to at least some extent,
but more complete activation could be achieved if proteases
produced by the nematode are capable of truncating the protein. If
it is desirable to have the polyphenol oxidase remain inactive
until a nematode infects the plant (for example, as a way of
preventing the catalysis of chemical reactions that might produce
compounds toxic to the plant or to non-target organisms), then any
truncation site naturally occurring in the protein that is capable
of being cleaved by plant proteases can be mutated so that it will
no longer be cleaved. In either case, the sequence of the
polyphenol oxidase can be modified (or further modified) such that
it contains a recognition site for nematode proteases at the
appropriate truncation location. This location can be determined by
sequence analysis of active toxin isolated from its natural source,
or by sequence analysis of active toxin produced by treating the
full-length protein with proteases capable of carrying out the
truncation, such as trypsin in the case of AXN-8. The choice of the
protease recognition site will depend on the proteases that are
secreted by the nematode into the plant, or that are present within
the nematode digestive system. This site can be determined by
isolating proteases and determining their substrate specificity, or
by sequencing genes from the nematode or from a cDNA library
prepared from mRNA extracted from the nematode, and determining to
which protease families the genes belong. A secreted protease will
activate the toxin in the plant, while a protease in the nematode's
digestive system would activate the toxin after it is ingested.
[0073] Esophageal gland cells from soybean cyst nematode have been
shown to express a putative cysteine proteinase (Genbank accession
AF345792). This proteinase falls into the Peptidase C13 family,
which consists of asparaginyl cysteine endopeptidases (proteases
that cleave specifically after asparagines residues). In one
example of this invention, a polyphenol oxidase expressed in a
transgenic plant could be rendered activatable by SCN by altering
the sequence of the polyphenol oxidase such that it contains an
asparagine residue at the truncation site that results in an active
enzyme. While not bound by any particular theory or mechanism, this
version of the polyphenol oxidase might give greater activity than
the wild-type enzyme because it would be fully activated in the
presence of SCN. Furthermore, it might remain inactive in the
absence of SCN, thereby avoiding the accumulation of chemical
products of reactions catalyzed by the enzyme. If a recognition
site for plant proteases is present in the protein, it can be
mutated so that only the nematode proteases are capable of carrying
out the truncation. A similar approach can be taken for any target
pest. The truncation site of the polyphenol oxidase can be modified
so that it will be susceptible to truncation by proteases produced
by the target pest.
Vectors
[0074] A polyphenol oxidase sequence of the invention (or any other
polyphenol oxidase sequences known in the art) may be provided in
an expression cassette for expression in a plant of interest. In
various embodiments, the polyphenol oxidase sequence is selected
from any polyphenol oxidase known in the art. In another
embodiment, the polyphenol oxidase is selected from the polyphenol
oxidase derived from Trichoderma reesei, Bacillus thuringiensis,
Glycine Max, Zea maize, Streptomyces castaneoglobisporus,
Neurospora crassa species.
[0075] By "plant expression cassette" is intended a DNA construct
that is capable of resulting in the expression of a protein from an
open reading frame in a plant cell. Typically these contain a
promoter and a coding sequence. Often, such constructs will also
contain a 3' untranslated region. Such constructs may contain a
"signal sequence" or "leader sequence" to facilitate
co-translational or post-translational transport of the peptide to
certain intracellular structures such as the chloroplast (or other
plastid), endoplasmic reticulum, or Golgi apparatus.
[0076] By "signal sequence" is intended a sequence that is known or
suspected to result in cotranslational or post-translational
peptide transport across the cell membrane. In eukaryotes, this
typically involves secretion into the Golgi apparatus, with some
resulting glycosylation. Pesticidal toxins of bacteria are often
synthesized as protoxins, which are protolytically activated in the
gut of the target pest (Chang (1987) Methods Enzymol. 153:507-516).
In some embodiments of the present invention, the signal sequence
is located in the native sequence, or may be derived from a
sequence of the invention. By "leader sequence" is intended any
sequence that when translated, results in an amino acid sequence
sufficient to trigger co-translational transport of the peptide
chain to a subcellular organelle. Thus, this includes leader
sequences targeting transport and/or glycosylation by passage into
the endoplasmic reticulum, passage to vacuoles, plastids including
chloroplasts, mitochondria, and the like.
[0077] By "plant transformation vector" is intended a DNA molecule
that is necessary for efficient transformation of a plant cell.
Such a molecule may consist of one or more plant expression
cassettes, and may be organized into more than one "vector" DNA
molecule. For example, binary vectors are plant transformation
vectors that utilize two non-contiguous DNA vectors to encode all
requisite cis- and trans-acting functions for transformation of
plant cells (Hellens and Mullineaux (2000) Trends in Plant Science
5:446-451). "Vector" refers to a nucleic acid construct designed
for transfer between different host cells. "Expression vector"
refers to a vector that has the ability to incorporate, integrate
and express heterologous DNA sequences or fragments in a foreign
cell. The cassette will include 5' and 3' regulatory sequences
operably linked to a sequence of the invention. By "operably
linked" is intended a functional linkage between a promoter and a
second sequence, wherein the promoter sequence initiates and
mediates transcription of the DNA sequence corresponding to the
second sequence. Generally, operably linked means that the nucleic
acid sequences being linked are contiguous and, where necessary to
join two protein coding regions, contiguous and in the same reading
frame. The cassette may additionally contain at least one
additional gene to be cotransformed into the organism.
Alternatively, the additional gene(s) can be provided on multiple
expression cassettes. In various embodiments, the invention
encompasses host cells comprising the insert of the vectors. By
"insert of the vectors" is intended the DNA sequence comprising the
gene(s) of the invention that is integrated into the host cell
genome.
[0078] "Promoter" refers to a nucleic acid sequence that functions
to direct transcription of a downstream coding sequence. The
promoter together with other transcriptional and translational
regulatory nucleic acid sequences (also termed "control sequences")
are necessary for the expression of a DNA sequence of interest. The
promoters may be constitutive or inducible, or may be functional
only in certain plant parts. In various embodiments, the promoter
is a root-specific promoter (e.g., FaRB7, Vaughan (2006) J. Exp.
Bot. 57:3901-3910). In some embodiments, the promoter is a feeding
site specific promoter (e.g., TobRB7, Opperman (1994) Science
263(5144) 221-223).
[0079] Such an expression cassette is provided with a plurality of
restriction sites for insertion of the nematicidal sequence to be
under the transcriptional regulation of the regulatory regions.
[0080] The expression cassette will include in the 5'-3' direction
of transcription, a transcriptional and translational initiation
region (i.e., a promoter), a DNA sequence of the invention, and a
translational and transcriptional termination region (i.e.,
termination region) functional in plants. The promoter may be
native or analogous, or foreign or heterologous, to the plant host
and/or to the DNA sequence of the invention. Additionally, the
promoter may be the natural sequence or alternatively a synthetic
sequence. Where the promoter is "native" or "homologous" to the
plant host, it is intended that the promoter is found in the native
plant into which the promoter is introduced. Where the promoter is
"foreign" or "heterologous" to the DNA sequence of the invention,
it is intended that the promoter is not the native or naturally
occurring promoter for the operably linked DNA sequence of the
invention.
[0081] The termination region may be native with the
transcriptional initiation region, may be native with the operably
linked DNA sequence of interest, may be native with the plant host,
or may be derived from another source (i.e., foreign or
heterologous to the promoter, the DNA sequence of interest, the
plant host, or any combination thereof). Convenient termination
regions are available from the Ti-plasmid of A. tumefaciens, such
as the octopine synthase and nopaline synthase termination regions.
See also Guerineau et al. (1991) Mol. Gen. Genet. 262:141-144;
Proudfoot (1991) Cell 64:671-674; Sanfacon et al. (1991) Genes Dev.
5:141-149; Mogen et al. (1990) Plant Cell 2:1261-1272; Munroe et
al. (1990) Gene 91:151-158; Ballas et al. (1989) Nucleic Acids Res.
17:7891-7903; and Joshi et al. (1987) Nucleic Acid Res.
15:9627-9639.
[0082] Where appropriate, the gene(s) may be optimized for
increased expression in the transformed host cell. That is, the
genes can be synthesized using host cell-preferred codons for
improved expression, or may be synthesized using codons at a
host-preferred codon usage frequency. Generally, the GC content of
the gene will be increased. See, for example, Campbell and Gowri
(1990) Plant Physiol. 92:1-11 for a discussion of host-preferred
codon usage. Methods are available in the art for synthesizing
plant-preferred genes. See, for example, U.S. Pat. Nos. 5,380,831,
and 5,436,391, and Murray et al. (1989) Nucleic Acids Res.
17:477-498, herein incorporated by reference.
[0083] In one embodiment, the protein is targeted to the
chloroplast for expression. In this manner, where the protein is
not directly inserted into the chloroplast, the expression cassette
will additionally contain a nucleic acid encoding a transit peptide
to direct the protein to the chloroplasts. Such transit peptides
are known in the art. See, for example, Von Heijne et al. (1991)
Plant Mol. Biol. Rep. 9:104-126; Clark et al. (1989) J. Biol. Chem.
264:17544-17550; Della-Cioppa et al. (1987) Plant Physiol.
84:965-968; Romer et al. (1993) Biochem. Biophys. Res. Commun.
196:1414-1421; and Shah et al. (1986) Science 233:478-481.
[0084] The gene to be targeted to the chloroplast may be optimized
for expression in the chloroplast to account for differences in
codon usage between the plant nucleus and this organelle. In this
manner, the nucleic acids of interest may be synthesized using
chloroplast-preferred codons. See, for example, U.S. Pat. No.
5,380,831, herein incorporated by reference.
Plant Transformation
[0085] Methods of the invention involve introducing a nucleotide
construct into a plant. The methods comprise introducing at least
one nucleotide sequence encoding a heterologous polyphenol oxidase
enzyme into at least one plant cell. In various embodiments, the
polyphenol oxidase is derived from a plant. In other embodiments,
the polyphenol oxidase is derived from a non-plant organism (e.g.,
fungal, algal, bacterial, or other non-plant microorganism). The
polyphenol oxidase may be a monophenol oxidase or a diphenol
oxidase. In various embodiments, the polyphenol oxidase is selected
from any of SEQ ID NO:1-22 or 45-49, or any of the polyphenol
oxidases referenced in Table 13.
[0086] By "introducing" is intended to present to the plant the
nucleotide construct in such a manner that the construct gains
access to the interior of a cell of the plant. The methods of the
invention do not require that a particular method for introducing a
nucleotide construct to a plant is used, only that the nucleotide
construct gains access to the interior of at least one cell of the
plant. Methods for introducing nucleotide constructs into plants
are known in the art including, but not limited to, stable
transformation methods, transient transformation methods, and
virus-mediated methods.
[0087] By "plant" is intended whole plants, plant organs (e.g.,
leaves, stems, roots, etc.), seeds, plant cells, propagules,
embryos and progeny of the same. Plant cells can be differentiated
or undifferentiated (e.g. callus, suspension culture cells,
protoplasts, leaf cells, root cells, phloem cells, pollen).
[0088] "Transgenic plants" or "transformed plants" or "stably
transformed" plants or cells or tissues refers to plants that have
incorporated or integrated exogenous nucleic acid sequences or DNA
fragments into the plant cell. These nucleic acid sequences include
those that are exogenous, or not present in the untransformed plant
cell. "Heterologous" refers to the nucleic acid sequences that are
not endogenous to the cell or part of the native genome in which
they are present, and have been added to the cell by infection,
transfection, microinjection, electroporation, microprojection, or
the like.
[0089] Transformation of plant cells can be accomplished by one of
several techniques known in the art. The polyphenol oxidase genes
described herein may be modified to obtain or enhance expression in
plant cells. Typically a construct that expresses such a protein
would contain a promoter to drive transcription of the gene, as
well as a 3' untranslated region to allow transcription termination
and polyadenylation. The organization of such constructs is well
known in the art. In some instances, it may be useful to engineer
the gene such that the resulting peptide is secreted, or otherwise
targeted within the plant cell. For example, the gene can be
engineered to contain a signal peptide to facilitate transfer of
the peptide to the endoplasmic reticulum. It may also be preferable
to engineer the plant expression cassette to contain an intron,
such that mRNA processing of the intron is required for
expression.
[0090] Typically this "plant expression cassette" will be inserted
into a "plant transformation vector". This plant transformation
vector may be comprised of one or more DNA vectors needed for
achieving plant transformation. For example, it is a common
practice in the art to utilize plant transformation vectors that
are comprised of more than one contiguous DNA segment. These
vectors are often referred to in the art as "binary vectors".
Binary vectors as well as vectors with helper plasmids are most
often used for Agrobacterium-mediated transformation, where the
size and complexity of DNA segments needed to achieve efficient
transformation is quite large, and it is advantageous to separate
functions onto separate DNA molecules. Binary vectors typically
contain a plasmid vector that contains the cis-acting sequences
required for T-DNA transfer (such as left border and right border),
a selectable marker that is engineered to be capable of expression
in a plant cell, and a "gene of interest" (a gene engineered to be
capable of expression in a plant cell for which generation of
transgenic plants is desired). Also present on this plasmid vector
are sequences required for bacterial replication. The cis-acting
sequences are arranged in a fashion to allow efficient transfer
into plant cells and expression therein. For example, the
selectable marker gene and the nematicidal gene are located between
the left and right borders. Often a second plasmid vector contains
the trans-acting factors that mediate T-DNA transfer from
Agrobacterium to plant cells. This plasmid often contains the
virulence functions (Vir genes) that allow infection of plant cells
by Agrobacterium, and transfer of DNA by cleavage at border
sequences and vir-mediated DNA transfer, as is understood in the
art (Hellens and Mullineaux (2000) Trends in Plant Science
5:446-451). Several types of Agrobacterium strains (e.g. LBA4404,
GV3101, EHA101, EHA105, etc.) can be used for plant transformation.
The second plasmid vector is not necessary for transforming the
plants by other methods such as microprojection, microinjection,
electroporation, polyethylene glycol, etc.
[0091] In general, plant transformation methods involve
transferring heterologous DNA into target plant cells (e.g.
immature or mature embryos, suspension cultures, undifferentiated
callus, protoplasts, etc.), followed by applying a maximum
threshold level of appropriate selection (depending on the
selectable marker gene) to recover the transformed plant cells from
a group of untransformed cell mass. Explants are typically
transferred to a fresh supply of the same medium and cultured
routinely. Subsequently, the transformed cells are differentiated
into shoots after placing on regeneration medium supplemented with
a maximum threshold level of selecting agent. The shoots are then
transferred to a selective rooting medium for recovering rooted
shoot or plantlet. The transgenic plantlet then grows into a mature
plant and produces fertile seeds (e.g. Hiei et al. (1994) The Plant
Journal 6:271-282; Ishida et al. (1996) Nature Biotechnology
14:745-750). Explants are typically transferred to a fresh supply
of the same medium and cultured routinely. A general description of
the techniques and methods for generating transgenic plants are
found in Ayres and Park (1994) Critical Reviews in Plant Science
13:219-239 and Bommineni and Jauhar (1997) Maydica 42:107-120.
Since the transformed material contains many cells; both
transformed and non-transformed cells are present in any piece of
subjected target callus or tissue or group of cells. The ability to
kill non-transformed cells and allow transformed cells to
proliferate results in transformed plant cultures. Often, the
ability to remove non-transformed cells is a limitation to rapid
recovery of transformed plant cells and successful generation of
transgenic plants.
[0092] Transformation protocols as well as protocols for
introducing nucleotide sequences into plants may vary depending on
the type of plant or plant cell, i.e., monocot or dicot, targeted
for transformation. Generation of transgenic plants may be
performed by one of several methods, including, but not limited to,
microinjection, electroporation, direct gene transfer, introduction
of heterologous DNA by Agrobacterium into plant cells
(Agrobacterium-mediated transformation), bombardment of plant cells
with heterologous foreign DNA adhered to particles, ballistic
particle acceleration, aerosol beam transformation (U.S. Published
Application No. 20010026941; U.S. Pat. No. 4,945,050; International
Publication No. WO 91/00915; U.S. Published Application No.
2002015066), Lec1 transformation, and various other non-particle
direct-mediated methods to transfer DNA.
[0093] Methods for transformation of chloroplasts are known in the
art. See, for example, Svab et al. (1990) Proc. Natl. Acad. Sci.
USA 87:8526-8530; Svab and Maliga (1993) Proc. Natl. Acad. Sci. USA
90:913-917; Svab and Maliga (1993) EMBO J. 12:601-606. The method
relies on particle gun delivery of DNA containing a selectable
marker and targeting of the DNA to the plastid genome through
homologous recombination. Additionally, plastid transformation can
be accomplished by transactivation of a silent plastid-borne
transgene by tissue-preferred expression of a nuclear-encoded and
plastid-directed RNA polymerase. Such a system has been reported in
McBride et al. (1994) Proc. Natl. Acad. Sci. USA 91:7301-7305.
[0094] Following integration of heterologous foreign DNA into plant
cells, one then applies a maximum threshold level of appropriate
selection in the medium to kill the untransformed cells and
separate and proliferate the putatively transformed cells that
survive from this selection treatment by transferring regularly to
a fresh medium. By continuous passage and challenge with
appropriate selection, one identifies and proliferates the cells
that are transformed with the plasmid vector.
[0095] A number of markers have been developed for use with plant
cells, such as resistance to chloramphenicol, the aminoglycoside
G418, hygromycin, or the like. Other genes that encode a product
involved in chloroplast metabolism may also be used as selectable
markers. For example, genes that provide resistance to plant
herbicides such as glyphosate, bromoxynil, or imidazolinone may
find particular use. Such genes have been reported (Stalker et al.
(1985) J. Biol. Chem. 263:6310-6314 (bromoxynil resistance
nitrilase gene); and Sathasivan et al. (1990) Nucl. Acids Res.
18:2188 (AHAS imidazolinone resistance gene). Additionally, the
genes disclosed herein are useful as markers to assess
transformation of bacterial or plant cells. Molecular and
biochemical methods can then be used to confirm the presence of the
integrated heterologous gene of interest into the genome of the
transgenic plant. Methods for detecting the presence of a transgene
in a plant, plant organ (e.g., leaves, stems, roots, etc.), seed,
plant cell, propagule, embryo or progeny of the same are well known
in the art. In one embodiment, the presence of the transgene is
detected by testing for nematicidal activity. In another
embodiment, the presence of the transgene is detected by testing
for polyphenol oxidase activity.
[0096] The cells that have been transformed may be grown into
plants in accordance with conventional ways. See, for example,
McCormick et al. (1986) Plant Cell Reports 5:81-84. These plants
may then be grown, and either pollinated with the same transformed
strain or different strains, and the resulting hybrid having
constitutive expression of the desired phenotypic characteristic
identified. Two or more generations may be grown to ensure that
expression of the desired phenotypic characteristic is stably
maintained and inherited and then seeds harvested to ensure
expression of the desired phenotypic characteristic has been
achieved. In this manner, the present invention provides
transformed seed (also referred to as "transgenic seed") having a
nucleotide construct of the invention, for example, an expression
cassette of the invention, stably incorporated into their
genome.
[0097] The present invention may be used for transformation of any
plant species, including, but not limited to, monocots and dicots.
Examples of plants of interest include, but are not limited to,
corn (maize), sorghum, wheat, sunflower, tomato, crucifers,
peppers, potato, cotton, rice, soybean, sugarbeet, sugarcane,
tobacco, barley, and oilseed rape, Brassica sp., alfalfa, rye,
millet, safflower, peanuts, sweet potato, cassava, coffee, coconut,
pineapple, citrus trees, cocoa, tea, banana, avocado, fig, guava,
mango, olive, papaya, cashew, macadamia, almond, oats, vegetables,
ornamentals, and conifers.
[0098] Vegetables include, but are not limited to, tomatoes,
lettuce, green beans, lima beans, peas, and members of the genus
Curcumis such as cucumber, cantaloupe, and musk melon. Ornamentals
include, but are not limited to, azalea, hydrangea, hibiscus,
roses, tulips, daffodils, petunias, carnation, poinsettia, and
chrysanthemum. Preferably, plants of the present invention are crop
plants (for example, maize, sorghum, wheat, sunflower, tomato,
crucifers, peppers, potato, cotton, rice, soybean, sugarbeet,
sugarcane, tobacco, barley, oilseed rape, etc.).
Evaluation of Plant Transformation
[0099] Following introduction of heterologous foreign DNA into
plant cells, the transformation or integration of heterologous gene
in the plant genome is confirmed by various methods such as
analysis of nucleic acids, proteins and metabolites associated with
the integrated gene.
[0100] PCR analysis is a rapid method to screen transformed cells,
tissue or shoots for the presence of incorporated gene at the
earlier stage before transplanting into the soil (Sambrook and
Russell (2001) Molecular Cloning: A Laboratory Manual. Cold Spring
Harbor Laboratory Press, Cold Spring Harbor, N.Y.). PCR is carried
out using oligonucleotide primers specific to the gene of interest
or Agrobacterium vector background, etc.
[0101] Plant transformation may be confirmed by Southern blot
analysis of genomic DNA (Sambrook and Russell, 2001, supra). In
general, total DNA is extracted from the transformant, digested
with appropriate restriction enzymes, fractionated in an agarose
gel and transferred to a nitrocellulose or nylon membrane. The
membrane or "blot" is then probed with, for example, radiolabeled
.sup.32P target DNA fragment to confirm the integration of
introduced gene into the plant genome according to standard
techniques (Sambrook and Russell, 2001, supra).
[0102] In Northern blot analysis, RNA is isolated from specific
tissues of transformant, fractionated in a formaldehyde agarose
gel, and blotted onto a nylon filter according to standard
procedures that are routinely used in the art (Sambrook and
Russell, 2001, supra). Expression of RNA encoded by the nematicidal
gene is then tested by hybridizing the filter to a radioactive
probe derived from a nematicidal gene, by methods known in the art
(Sambrook and Russell, 2001, supra).
[0103] Western blot, biochemical assays and the like may be carried
out on the transgenic plants to confirm the presence of protein
encoded by the nematicidal gene by standard procedures (Sambrook
and Russell, 2001, supra) using antibodies that bind to one or more
epitopes present on the nematicidal protein.
Methods for Screening for and Developing Plants with Polyphenol
Oxidase Activity
[0104] Various plant species are known to express polyphenol
oxidase. In some instances, expression of polyphenol oxidase has
been shown to be associated with improved agronomic performance.
For example, plants which exhibit comparably high resistance to
climatic stress have been shown to posses relatively higher
polyphenol oxidase levels than susceptible varieties (Thipyapong et
al. (2007) Molecules 12(8):1569-95). However, prior to the present
invention, resistance to nematode infestation has not been
demonstrated in plants having polyphenol oxidase activity.
Identification of plants having optimal polyphenol oxidase levels
provides a hitherto unrecognized opportunity for developing plants
suitable for cultivation in an area susceptible to nematode
infestation. By "optimal polyphenol oxidase activity" is intended a
level of activity sufficient to bring about death to at least one
pest, or to noticeably reduce pest growth, feeding, or normal
physiological development when the plant expressing the polyphenol
oxidase is exposed to a nematode pest.
[0105] Thus, provided herein are methods for screening a plant or
plant variety for polyphenol oxidase activity. For example, root
extracts from different plants (for example, different inbred
lines, or different progeny of a cross) can be tested for
polyphenol oxidase activity using assays known in the art and
described elsewhere herein. Plants expressing polyphenol oxidase
may be tested for nematicidal activity, and the plants showing
optimal activity selected for use in a field susceptible to
nematode infestation, or used for further breeding for
introgression of the nematode-resistance trait into a plant
population. Identification of a polyphenol oxidase having optimal
activity may be associated with the presence of a polyphenol
oxidase, the relative level of expression or activity of a
polyphenol oxidase, or the presence of a particular polymorphism
associated with improved polyphenol oxidase activity and/or
nematode resistance. The polymorphism may be within the polyphenol
oxidase gene itself, or may be within a genetic marker identified
as being associated with or linked to polyphenol oxidase expression
(i.e., within a Quantitative Trait Loci (QTL) associated with
polyphenol oxidase expression).
[0106] The methods of the invention further contemplate screening
of existing QTLs for nematode resistance for the presence of a
polyphenol oxidase gene or polymorphism. Previous studies have
identified large genetic regions linked as QTLs involved in
nematode resistance, and these regions may contain certain
polyphenol oxidases or tyrosinases. QTLs typically contain many
hundreds if not thousands of genes, yet identification of the
causal gene for the associated trait often remains elusive. Thus,
the invention anticipates screening for polyphenol oxidase genes
(or particular polymorphisms thereof) from such regions. These
genetic elements, or genetic markers closely linked to these
polyphenol oxidase genetic elements, can be used in marker-assisted
breeding protocols to develop plants more resistant to nematode
infestation. Methods for screening a genetic region for a gene of
interest are routine in the art, as are methods for marker-assisted
breeding.
Germplasm Mutagenesis
[0107] Further provided are methods for developing plants with
nematode resistance using germplasm mutagenesis. Mutagenesis is
means of creating genetic diversity that does not exist or has not
been found in existing germplasm. Treating somatic embryos, embryos
derived from culturing portions of immature seeds, with mutagenic
agents can be an efficient method of creating mutations because
they are easier to regenerate into whole plants than cell cultures
and easier to handle in large numbers than seeds. Thus, the methods
encompassed herein comprise mutagenizing a plant germplasm and
screening a component of the plant derived therefrom (for example,
root extracts) for polyphenol oxidase activity. Isolates having
optimal polyphenol oxidase activity can be used to develop a plant
population suitable for cultivation in an area susceptible to
nematode infestation.
[0108] Methods for germplasm mutagenesis are generally known in the
art. Gamma rays are the most frequently used mutagen, but new
agents including ion beams and space condition have also been used
in mutation induction and breeding (Chen et al. (2006) Plant
Mutation Reports Volume 1 Number 1 at
www-naweb.iaea.org/nafa/pbg/public/pmr-01-01.pdf). Use of in vitro
cultures for mutation induction, or use of another culture to
rapidly produce homozygous lines from irradiated progenies, has
proven to be very useful in several laboratories.
Methods for Controlling Nematodes in a Field
[0109] Provided herein are methods for controlling nematodes in a
field susceptible to infestation by one or more plant-parasitic
nematode pests. The methods comprise cultivating a plant in an area
susceptible to plant-parasitic nematode infestation, wherein the
plant expresses a heterologous polyphenol oxidase. An "area" or a
"field" susceptible to infestation includes a geographic region or
planting area that has a detectable level of one or more species of
plant-parasitic nematodes. A "detectable level" includes any level
of plant-parasitic nematodes sufficiently high enough to cause
damage in a susceptible plant. Signs of nematode damage include
stunting and yellowing of leaves, and wilting of the plants during
hot periods. However, some nematodes, including soybean cyst
nematode (SCN), can cause significant yield loss without obvious
above-ground symptoms. In this instance, roots infected with
plant-parasitic nematodes will be dwarfed or stunted compared to
the roots of a plant not infected with nematodes. Various other
macroscopic and microscopic detection methods of different types of
nematodes are known in the art, and are typically available via
local agricultural extension services. An area susceptible to
nematode infestation may also include an area that has a detectable
level of plant-parasitic nematodes in the soil.
Use in Pesticidal Control
[0110] General methods for employing strains comprising a
nucleotide sequence of the present invention, or a variant thereof,
in pest control or in engineering other organisms as pesticidal
agents are known in the art. See, for example U.S. Pat. No.
5,039,523 and EP 0480762A2.
[0111] The bacterial or fungal strains containing the nucleotide
sequence(s) of the present invention, or a variant thereof, or the
microorganisms that have been genetically altered to contain a
nematicidal gene and protein may be used for protecting
agricultural crops and products from pests. In one aspect of the
invention, whole, i.e., unlysed, cells of a toxin-producing
organism are treated with reagents that prolong the activity of the
toxin produced in the cell when the cell is applied to the
environment of target pest(s).
[0112] Alternatively, the pesticide is produced by introducing a
nematicidal gene into a cellular host. Expression of the
nematicidal gene results, directly or indirectly, in the
intracellular production and maintenance of the nematode toxin. In
one aspect of this invention, these cells are then treated under
conditions that prolong the activity of the toxin produced in the
cell when the cell is applied to the environment of target pest(s).
The resulting product retains the toxicity of the toxin. These
naturally encapsulated pesticides may then be formulated in
accordance with conventional techniques for application to the
environment hosting a target pest, e.g., soil, water, root, seed
and/or foliage of plants. See, for example EPA 0192319, and the
references cited therein. In various embodiments, the polyphenol
oxidase may be expressed in a bacterial cell and used as a
probiotic to treat the seed of the plant. Alternatively, one may
formulate the cells expressing a gene of this invention such as to
allow application of the resulting material as a pesticide.
[0113] The active ingredients of the present invention are normally
applied in the form of compositions and can be applied to the crop
area or plant to be treated, simultaneously or in succession, with
other compounds. These compounds can be fertilizers, weed killers,
cryoprotectants, surfactants, detergents, pesticidal soaps, dormant
oils, polymers, and/or time-release or biodegradable carrier
formulations that permit long-term dosing of a target area
following a single application of the formulation. The compounds
can be cofactors or other molecules that enhance the activity of
the polyphenol oxidase enzyme. For example, the compound can be
methyl jasmonate, which has been shown to increase the expression
of polyphenol oxidase genes (see, for example, Constable and Ryan
(1998) Plant Mol. Biol. 36(1):55-62), a phenol such as L-DOPA or
tyrosine, or a substrate capable of participating in polyphenol
oxidase-mediated crosslinking (e.g., tyrosine). These compounds can
be provided to the plants before, during, or after (or any
combination thereof) application of the pesticidal composition.
Where the compound is a polypeptide capable of expression in a
plant, the susceptible plant may be transgenic for this
polypeptide.
[0114] These compounds can also be selective herbicides, chemical
insecticides, virucides, microbicides, amoebicides, pesticides,
fungicides, bacteriocides, nematocides, molluscicides or mixtures
of several of these preparations, if desired, together with further
agriculturally acceptable carriers, surfactants or
application-promoting adjuvants customarily employed in the art of
formulation. Suitable carriers and adjuvants can be solid or liquid
and correspond to the substances ordinarily employed in formulation
technology, e.g. natural or regenerated mineral substances,
solvents, dispersants, wetting agents, tackifiers, binders or
fertilizers. Likewise the formulations may be prepared into edible
"baits" or fashioned into pest "traps" to permit feeding or
ingestion by a target pest of the nematicidal formulation.
[0115] Methods of applying an active ingredient of the present
invention or an agrochemical composition of the present invention
that contains at least one of the nematicidal proteins of the
present invention include leaf application, seed coating and soil
application. The number of applications and the rate of application
depend on the intensity of infestation by the corresponding
pest.
[0116] The composition may be formulated as a powder, dust, pellet,
granule, spray, emulsion, colloid, solution, or such like, and may
be prepared by such conventional means as desiccation,
lyophilization, homogenation, extraction, filtration,
centrifugation, sedimentation, or concentration of a culture of
cells comprising the polypeptide. In all such compositions that
contain at least one such nematicidal polypeptide, the polypeptide
may be present in a concentration of from about 1% to about 99% by
weight.
[0117] Nematode pests may be killed or reduced in numbers in a
given area by the methods of the invention, or may be
prophylactically applied to an environmental area to prevent
infestation by a susceptible pest (i.e., nematode). Preferably the
pest ingests, or is contacted with, a nematicidally-effective
amount of the polypeptide. By "nematicidally-effective amount" is
intended an amount of the pesticide that is able to bring about
death to at least one pest, or to noticeably reduce pest growth,
feeding, or normal physiological development. This amount will vary
depending on such factors as, for example, the specific nematode
species to be controlled, the specific environment, location,
plant, crop, or agricultural site to be treated, the environmental
conditions, and the method, rate, concentration, stability, and
quantity of application of the nematicidally-effective polypeptide
composition. The formulations may also vary with respect to
climatic conditions, environmental considerations, and/or frequency
of application and/or severity of pest infestation.
[0118] The nematicidal compositions described may be made by
formulating either the microbial cell (or extract thereof)
expressing the nematicidal gene of the invention, or isolated
protein component with the desired agriculturally-acceptable
carrier. The compositions may be formulated prior to administration
in an appropriate means such as lyophilized, freeze-dried,
desiccated, or in an aqueous carrier, medium or suitable diluent,
such as saline or other buffer. The formulated compositions may be
in the form of a dust or granular material, or a suspension in oil
(vegetable or mineral), or water or oil/water emulsions, or as a
wettable powder, or in combination with any other carrier material
suitable for agricultural application. Suitable agricultural
carriers can be solid or liquid and are well known in the art. The
term "agriculturally-acceptable carrier" covers all adjuvants,
inert components, dispersants, surfactants, tackifiers, binders,
etc. that are ordinarily used in pesticide formulation technology;
these are well known to those skilled in pesticide formulation. The
formulations may be mixed with one or more solid or liquid
adjuvants and prepared by various means, e.g., by homogeneously
mixing, blending and/or grinding the nematicidal composition with
suitable adjuvants using conventional formulation techniques.
Suitable formulations and application methods are described in U.S.
Pat. No. 6,468,523, herein incorporated by reference.
[0119] In various embodiments, the polyphenol oxidase can be used
to treat or prevent the infestation of plants with insects, fungi,
bacteria, mites, ticks, and the like. Insect pests include insects
selected from the orders Coleoptera, Diptera, Hymenoptera,
Lepidoptera, Mallophaga, Homoptera, Hemiptera, Orthroptera,
Thysanoptera, Dermaptera, Isoptera, Anoplura, Siphonaptera,
Trichoptera, etc., particularly Coleoptera, Lepidoptera, and
Diptera.
[0120] The order Coleoptera includes the suborders Adephaga and
Polyphaga. Suborder Adephaga includes the superfamilies Caraboidea
and Gyrinoidea, while suborder Polyphaga includes the superfamilies
Hydrophiloidea, Staphylinoidea, Cantharoidea, Cleroidea,
Elateroidea, Dascilloidea, Dryopoidea, Byrrhoidea, Cucujoidea,
Meloidea, Mordelloidea, Tenebrionoidea, Bostrichoidea,
Scarabaeoidea, Cerambycoidea, Chrysomeloidea, and Curculionoidea.
Superfamily Caraboidea includes the families Cicindelidae,
Carabidae, and Dytiscidae. Superfamily Gyrinoidea includes the
family Gyrimidae. Superfamily Hydrophiloidea includes the family
Hydrophilidae. Superfamily Staphylinoidea includes the families
Silphidae and Staphylimidae. Superfamily Cantharoidea includes the
families Cantharidae and Lampyridae. Superfamily Cleroidea includes
the families Cleridae and Dermestidae. Superfamily Elateroidea
includes the families Elateridae and Buprestidae. Superfamily
Cucujoidea includes the family Coccinellidae. Superfamily Meloidea
includes the family Meloidae. Superfamily Tenebrionoidea includes
the family Tenebrionidae. Superfamily Scarabaeoidea includes the
families Passalidae and Scarabaeidae. Superfamily Cerambycoidea
includes the family Cerambycidae. Superfamily Chrysomeloidea
includes the family Chrysomelidae. Superfamily Curculionoidea
includes the families Curculionidae and Scolytidae.
[0121] The order Diptera includes the Suborders Nematocera,
Brachycera, and Cyclorrhapha. Suborder Nematocera includes the
families Tipulidae, Psychodidae, Culicidae, Ceratopogonidae,
Chironomidae, Simuliidae, Bibionidae, and Cecidomyiidae. Suborder
Brachycera includes the families Stratiomyidae, Tabanidae,
Therevidae, Asilidae, Mydidae, Bombyliidae, and Dolichopodidae.
Suborder Cyclorrhapha includes the Divisions Aschiza and Aschiza.
Division Aschiza includes the families Phoridae, Syrphidae, and
Conopidae. Division Aschiza includes the Sections Acalyptratae and
Calyptratae. Section Acalyptratae includes the families Otitidae,
Tephritidae, Agromyzidae, and Drosophilidae. Section Calyptratae
includes the families Hippoboscidae, Oestridae, Tachimidae,
Anthomyiidae, Muscidae, Calliphoridae, and Sarcophagidae.
[0122] The order Lepidoptera includes the families Papilionidae,
Pieridae, Lycaenidae, Nymphalidae, Danaidae, Satyridae,
Hesperiidae, Sphingidae, Saturniidae, Geometridae, Arctiidae,
Noctuidae, Lymantriidae, Sesiidae, and Tineidae.
[0123] Insect pests of the invention for the major crops include:
Maize: Ostrinia nubilalis, European corn borer; Agrotis ipsilon,
black cutworm; Helicoverpa zea, corn earworm; Spodoptera
frugiperda, fall armyworm; Diatraea grandiosella, southwestern corn
borer; Elasmopalpus lignosellus, lesser cornstalk borer; Diatraea
saccharalis, surgarcane borer; Diabrotica virgifera, western corn
rootworm; Diabrotica longicornis barberi, northern corn rootworm;
Diabrotica undecimpunctata howardi, southern corn rootworm;
Melanotus spp., wireworms; Cyclocephala borealis, northern masked
chafer (white grub); Cyclocephala immaculata, southern masked
chafer (white grub); Popillia japonica, Japanese beetle;
Chaetocnema pulicaria, corn flea beetle; Sphenophorus maidis, maize
billbug; Rhopalosiphum maidis, corn leaf aphid; Anuraphis
maidiradicis, corn root aphid; Blissus leucopterus leucopterus,
chinch bug; Melanoplus femurrubrum, redlegged grasshopper;
Melanoplus sanguinipes, migratory grasshopper; Hylemya platura,
seedcorn maggot; Agromyza parvicornis, corn blot leafminer;
Anaphothrips obscrurus, grass thrips; Solenopsis milesta, thief
ant; Tetranychus urticae, twospotted spider mite; Sorghum: Chilo
partellus, sorghum borer; Spodoptera frugiperda, fall armyworm;
Helicoverpa zea, corn earworm; Elasmopalpus lignosellus, lesser
cornstalk borer; Feltia subterranea, granulate cutworm; Phyllophaga
crinita, white grub; Eleodes, Conoderus, and Aeolus spp.,
wireworms; Oulema melanopus, cereal leaf beetle; Chaetocnema
pulicaria, corn flea beetle; Sphenophorus maidis, maize billbug;
Rhopalosiphum maidis; corn leaf aphid; Sipha flava, yellow
sugarcane aphid; Blissus leucopterus leucopterus, chinch bug;
Contarinia sorghicola, sorghum midge; Tetranychus cinnabarinus,
carmine spider mite; Tetranychus urticae, twospotted spider mite;
Wheat: Pseudaletia unipunctata, army worm; Spodoptera frugiperda,
fall armyworm; Elasmopalpus lignosellus, lesser cornstalk borer;
Agrotis orthogonia, western cutworm; Elasmopalpus lignosellus,
lesser cornstalk borer; Oulema melanopus, cereal leaf beetle;
Hypera punctata, clover leaf weevil; Diabrotica undecimpunctata
howardi, southern corn rootworm; Russian wheat aphid; Schizaphis
graminum, greenbug; Macrosiphum avenae, English grain aphid;
Melanoplus femurrubrum, redlegged grasshopper; Melanoplus
differentialis, differential grasshopper; Melanoplus sanguinipes,
migratory grasshopper; Mayetiola destructor, Hessian fly;
Sitodiplosis mosellana, wheat midge; Meromyza americana, wheat stem
maggot; Hylemya coarctata, wheat bulb fly; Frankliniella fusca,
tobacco thrips; Cephus cinctus, wheat stem sawfly; Aceria tulipae,
wheat curl mite; Sunflower: Suleima helianthana, sunflower bud
moth; Homoeosoma electellum, sunflower moth; zygogramma
exclamationis, sunflower beetle; Bothyrus gibbosus, carrot beetle;
Neolasioptera murtfeldtiana, sunflower seed midge; Cotton:
Heliothis virescens, cotton budworm; Helicoverpa zea, cotton
bollworm; Spodoptera exigua, beet armyworm; Pectinophora
gossypiella, pink bollworm; Anthonomus grandis, boll weevil; Aphis
gossypii, cotton aphid; Pseudatomoscelis seriatus, cotton
fleahopper; Trialeurodes abutilonea, bandedwinged whitefly; Lygus
lineolaris, tarnished plant bug; Melanoplus femurrubrum, redlegged
grasshopper; Melanoplus differentialis, differential grasshopper;
Thrips tabaci, onion thrips; Franklinkiella fusca, tobacco thrips;
Tetranychus cinnabarinus, carmine spider mite; Tetranychus urticae,
twospotted spider mite; Rice: Diatraea saccharalis, sugarcane
borer; Spodoptera frugiperda, fall armyworm; Helicoverpa zea, corn
earworm; Colaspis brunnea, grape colaspis; Lissorhoptrus
oryzophilus, rice water weevil; Sitophilus oryzae, rice weevil;
Nephotettix nigropictus, rice leafhopper; Blissus leucopterus
leucopterus, chinch bug; Acrosternum hilare, green stink bug;
Soybean: Pseudoplusia includens, soybean looper; Anticarsia
gemmatalis, velvetbean caterpillar; Plathypena scabra, green
cloverworm; Ostrinia nubilalis, European corn borer; Agrotis
ipsilon, black cutworm; Spodoptera exigua, beet armyworm; Heliothis
virescens, cotton budworm; Helicoverpa zea, cotton bollworm;
Epilachna varivestis, Mexican bean beetle; Myzus persicae, green
peach aphid; Empoasca fabae, potato leafhopper; Acrosternum hilare,
green stink bug; Melanoplus femurrubrum, redlegged grasshopper;
Melanoplus differentialis, differential grasshopper; Hylemya
platura, seedcorn maggot; Sericothrips variabilis, soybean thrips;
Thrips tabaci, onion thrips; Tetranychus turkestani, strawberry
spider mite; Tetranychus urticae, twospotted spider mite; Barley:
Ostrinia nubilalis, European corn borer; Agrotis ipsilon, black
cutworm; Schizaphis graminum, greenbug; Blissus leucopterus
leucopterus, chinch bug; Acrosternum hilare, green stink bug;
Euschistus servus, brown stink bug; Delia platura, seedcorn maggot;
Mayetiola destructor, Hessian fly; Petrobia latens, brown wheat
mite; Oil Seed Rape: Brevicoryne brassicae, cabbage aphid;
Phyllotreta cruciferae, Flea beetle; Mamestra configurata, Bertha
armyworm; Plutella xylostella, Diamond-back moth; Delia ssp., Root
maggots.
Methods for Increasing Plant Yield
[0124] Methods for increasing plant yield are provided. The methods
comprise introducing into a plant or plant cell a polynucleotide
comprising a nematicidal sequence disclosed herein. Expression of
the nematicidal sequence results in improved resistance to nematode
infestation which, in turn, increases the yield of a transgenic
plant compared to the yield of a plant not expressing a polyphenol
oxidase (when exposed to plant-parasitic nematodes). As defined
herein, the "yield" of the plant refers to the quality and/or
quantity of biomass produced by the plant. By "biomass" is intended
any measured plant product. An increase in biomass production is
any improvement in the yield of the measured plant product.
Increasing plant yield has several commercial applications. For
example, increasing plant leaf biomass may increase the yield of
leafy vegetables for human or animal consumption. Additionally,
increasing leaf biomass can be used to increase production of
plant-derived pharmaceutical or industrial products. An increase in
yield can comprise any statistically significant increase
including, but not limited to, at least a 1% increase, at least a
3% increase, at least a 5% increase, at least a 10% increase, at
least a 20% increase, at least a 30%, at least a 50%, at least a
70%, at least a 100% or a greater increase in yield compared to a
plant not expressing the nematicidal sequence.
Methods for Identifying Quantitative Trait Loci Associated with
Nematode Resistance
[0125] Also provided herein are methods for identifying or
validating markers associated with a quantitative trait loci (QTL)
for nematode resistance or tolerance. The methods comprise
evaluating genetic markers within the genomic region surrounding
one or more polyphenol oxidase genes in a population of plants
showing resistance or tolerance to nematode infestation, and
detecting an association between one or more genetic marker(s) and
the nematode resistance trait. High density genetic maps have been
developed for many species of plants susceptible to nematode
infestation, including maize and soybean plants. Markers from these
maps can be evaluated for the association, and
positively-associated markers can be used in downstream
applications such as marker-assisted breeding. Methods for
evaluating marker:trait associations are known in the art and can
be applied to genomic regions encoding genes having homology to
polyphenol oxidase genes.
[0126] In another embodiment, QTLs that are known or suspected to
be associated with nematode resistance can be evaluated to
determine whether a polyphenol oxidase gene is within or near the
QTL. In this embodiment, regions within or surrounding the QTL can
be sequenced and searched for polyphenol oxidase homologs.
[0127] The following examples are offered by way of illustration
and not by way of limitation.
EXPERIMENTAL
Example 1
Assay for Nematicidal Activity
[0128] Use of biogenic amines to induce feeding and/or movement
from parasitic nematodes has been demonstrated previously for RNAi
uptake experiments (for example, see P. E. Urwin, Catherine J.
Lilley, and Howard J. Atkinson, "Ingestion of Double-Stranded RNA
by Preparasitic Juvenile Cyst Nematodes Leads to RNA Interference"
Molecular Plant Microbe Interaction Vol. 15, No. 8, 2002, pp.
747-752. Also see M J Kimber, S McKinney, S. McMaster T A Day, C C
Flemming and A G Maule (2007) "Flp gene disruption in a parasitic
nematode reveals motor dysfunction and unusual neuronal sensitivity
to RNA interference" The FASEB Journal vol 21 pp 1233-1242)
[0129] Assays of SCN activity provided herein are based on use of
an SCN bioassay that typically contains .about.200 J2 nematodes
(hatched within 2 days of assay) per well in a 96-well half-area
plate. The nematodes are incubated in 20 mM Tris buffer (pH 8.0)
containing 50 mM octopamine, and the following antibiotic and
antifungal components: gentamycin (1.5 ug/ul), nystatin (0.05
ug/ul), Sigma antibiotic-antimycotic (cat #A5955) at 1.times.,
Infuse antimycotic (1/1500 dilution from stock) all in 30 ul final
volume, including the test strain or protein. The assay plate is
incubated at 28 C in a humidified chamber. Scoring of the assay is
facilitated by addition of sodium carbonate, which causes living
nematodes to curl, while dead nematodes remain straight and rigid.
Scoring must be done within .about.10 minutes of the carbonate
addition. Activity on nematodes is scored on the following scale,
and compared with negative control, and positive control
samples.
TABLE-US-00001 TABLE 1 Scoring Convention for SCN Assays SCN
Scoring Convention Score assigned Mortality on SCN (%) 0 0-10% 1
11-20% 2 21-50% 3 51-75% 4 76-95% 5 96-100%
Example 2
Enhancement of Steady-State Levels of Nematode Protein Toxins in
Microbial Strains
[0130] A microbial strain of interest (e.g., a bacteria or fungal
strain) is grown under media conditions that can partially limit
the availability of nutrients to the microbe. For instance, the
availability of carbon or nitrogen can be reduced in the minimal
growth medium.
[0131] The medium is supplemented with components that are useful
to stimulate microbial production of nematode toxins. As one
example, the addition of gelatin to a growth medium can mimic the
gelatinous cuticle found on some nematodes, and thus stimulate the
microbial production of nematode protein toxins. As another
example, the addition of nematodes to the growth medium (such as C.
elegans or soybean cyst nematode) can stimulate the microbial
production of protein toxins. As another example, a nematode
extract can be prepared and added to a microbial growth medium to
stimulate the production of microbial protein toxins. The various
components can also be combined.
[0132] The growth medium (supplemented with a component to
stimulate toxin production) is inoculated with a microbial strain
or strains, and then grown under conditions appropriate for strain
growth. Whole culture or some fraction of the culture (for example:
culture supernatants, protein extracts, solubilized protein
extracts, pellet extracts, etc) are then tested to determine if a
nematode toxin has been produced by the microbial strain under the
growth medium and growth conditions tested.
Example 3
Identification of Nematode Protein Toxin from Arthrobotrys
oligospora
[0133] It is known in the art that nematicidal fungi can be
isolated from soil, in particular from suppressive soils. Several
such fungi were obtained and tested for SCN activity, under a
variety of growth conditions.
[0134] Arthrobotrys oligospora is a nematophagous fungus, and has
been observed previously to have nematicidal activity in soil. This
activity has been associated with nematophagous trapping in the
literature (for example, see Nansen et al., 1988. Vet Parasitol.,
26:329-37). There is no description of nematode protein toxins
production by Arthrobotrys oligospora or related strains.
[0135] To test the ability of an Arthrobotrys oligospora strain to
produce protein toxins, an Arthrobotrys oligospora strain
(ATX21995) was inoculated into Arthrobotrys medium supplemented as
shown below, and incubated at 30.degree. C. with gentle shaking for
7 days. The resulting extracts were tested for ability to kill SCN.
[0136] Arthrobotrys medium (per liter): [0137] 1 g glucose [0138]
0.5 g (NH.sub.4).sub.2SO.sub.4 [0139] 0.5 g MgSO.sub.4 [0140] 2 g
KH2PO.sub.4 [0141] 0.005 g FeSO.sub.4 [0142] Adjust to pH 6.0 with
KOH [0143] Optional: [0144] 0.5 g gelatin [0145] Add C. elegans
harvested from 1 MYOB plate (100 mm plate, first inoculated with E.
coli to generate a lawn as diet for C. elegans) per 50 mL of
medium
TABLE-US-00002 [0145] TABLE 2 Activity of ATX 21995 grown in
various media Activity of ATX 21995 Media Description extract Media
1 Potato dextrose broth No/Low activity Media 2 Arthrobotrys medium
No/Low Activity Media 3 Arthrobotrys medium + gelatin Inconsistent
Activity Media 4 Arthrobotrys medium + gelatin + Consistent, Strong
Activity nematodes
[0146] From each culture medium, a soluble protein extract was
prepared following seven days of growth. At that time, the fungal
biomass was separated from the growth medium using a disposable 0.4
micron filter unit, and this biomass was then ground in a mortar
and pestle in the presence of liquid nitrogen to lyse the cells.
This material was then resuspended in buffer A (50 mM Tris (pH
8.0), 1 mM DTT) and submitted for soybean cyst nematode (SCN)
bioassays.
[0147] Protein extracts were prepared from Arthrobotrys oligospora
(ATX21995) cultures grown in Arthrobotrys medium (+gelatin, +C.
elegans) for seven days. Extracts were prepared by grinding the
fungal biomass in the presence of liquid nitrogen (as described
above) and resuspending the lysed cell material in buffer at pH 6.0
(50 mM MES, 1 mM DTT), pH 8.0 (50 mM Tris, 1 mM DTT) or pH 10.4 (50
mM CAPS, 1 mM DTT). Extracts prepared in this manner assayed for
SCN activity all showed strong activity on SCN.
Example 4
Purification of AXN-1 from ATX21995
[0148] Purifications were carried out using extracts prepared from
ATX 21995 grown in Arthrobotrys Medium contain gelatin and
nematodes. Typically, purifications were carried out at large scale
by growing several 250 mL flasks (approximately 30-60 flasks) with
50 mL of medium in each flask to allow sufficient quantities of
protein to enter the purifications.
[0149] Two different protein purifications were carried out from
cultures of strain ATX21995. The total fungal biomass from these
cultures was lysed (mortar and pestle with liquid nitrogen), and
the protein was fractionated by FPLC using standard purification
methods. These purifications resulted in identification of an
.about.50 kDa protein that correlated with the elution of the SCN
activity.
Example 5
Protein Characterization of 50 kDa Protein from ATX21995
[0150] To clone the gene encoding the .about.50 kDa protein,
approximately 10-15 micrograms of this protein was isolated, and a
small quantity of the sample was electroblotted to a PVDF membrane
by standard methods, stained the membrane with Coomassie dye, and
the band corresponding to the 50 kDa protein excised and subjected
to N-terminal sequencing as known in the art. This protein was
found to yield very small amounts of free amino acids during the
sequencing reactions, which suggested that the N-terminus of the
protein might be chemically modified.
[0151] A gel slice containing the 50 kDa protein was digested
in-gel with trypsin, and the fragments were then separated by HPLC.
Individual peaks were then analyzed by MALDI to identify fragments
suitable for protein sequencing. A total of 5 tryptic fragments
were selected, and subjected to Edman degradation for protein
sequencing (Table 3). Edman degradation sequencing reactions
yielded the following sequences for each of these peaks:
TABLE-US-00003 TABLE 3 N-terminal sequence of tryptic fragments
Peak Primary Sequence Identified by Name Edman Degradation SEQ ID
NO: 20 G-T-W-S-I A A G S R 23 24 D-S-T-G-E-F-N-A-T-L-Y-R 24 29
S-A-P-Y-A-I-T-G-I 25 36 Y-P-D-A-W-F-N-A-Q-S-A-Q-L-R 26 42
F-G-S-S-Y-P-E-L-Q-P 27
Example 6
Cloning of a cDNA that Encodes the 50 kDa Protein from ATX21995
[0152] Total RNA was isolated from ATX21995 cultures grown for 2
days, 4 days and 6 days. This RNA was reverse transcribed to
generate cDNA; this cDNA was subsequently normalized to decrease
the abundance of strongly expressed transcripts. Using this cDNA as
a starting template, several PCR products were generated and
sequenced.
[0153] Degenerate PCR Based Use of cDNA Linker Sequence.
[0154] A number of degenerate oligonucleotides based on the amino
acid sequence of 24 (see Table 3) were designed and tested in
combination with the oligonucleotides that represent the ends of
the cDNA pool. A set of conditions was identified that resulted in
amplification of an 1840 nucleotide PCR product. This PCR product,
as well as several other candidate PCR products, was cloned into a
TOPO vector, and the DNA sequences adjacent to the vector were
determined.
[0155] Degenerate PCR Based Solely on Amino Acid Sequence.
[0156] Degenerate PCR primers were designed based on the amino acid
sequences of fragments 20, 24, and 29, 36, and 42 from Table 3.
This set of degenerate oligonucleotides utilized inosine in several
positions to reduce the degeneracy of the resulting
oligonucleotides. Also, when possible, a set of nested degenerate
PCR primers was designed for each amino acid sequence in Table 3.
This strategy allows use of the "outside" primers (those based on
the more N-terminal amino acids of a sequence in Table 3) in the
first round of PCR, and a second "nested" set of primers (based on
amino acids slightly C-terminal, but overlapping the amino acids
utilized for the "outside" primers).
[0157] A matrix of PCR reactions using these degenerate
oligonucleotides lead to the cloning and sequencing of several
amplification products, which showed DNA homology and overlap with
the 1840 nucleotide clone isolated previously, and together
comprised a complete cDNA open reading frame; suggesting that all
of these partial cDNAs originated from a single gene.
[0158] Cloning of axn-1 cDNA, and Determination of the Genomic
Sequence.
[0159] Based on the DNA sequences of several partial cDNA
sequences, PCR primers were designed to repeatedly amplify and
sequence the cDNA coding region. Several independent cDNAs were
cloned and completely sequenced. In some cases, individual cDNA
clones contained small unspliced introns, consistent with alternate
splicing of the hRNA produced from this gene. For example, two
variants of the 5' untranslated region (UTR) were recovered. These
variants are identical for 42 nt upstream of the start site (and
including the region encoding the N-terminus of the encoded
protein); however they then diverge for another 60-80 nt upstream;
this is likely to presence of an alternately spliced or unspliced
intron in the 5' UTR of one of the cDNAs.
[0160] PCR primers from the cDNA were used to amplify and sequence
eight independent genomic clones from the region encoding the cDNA.
The sequence from this genomic region matches the cDNA sequence
exactly over the length of the cDNA. Thus, the DNA sequence of the
multiple genomic and full cDNA clones confirms the structure of the
cDNA, and its genomic organization.
[0161] This gene encoding the cDNA is designated herein as axn-1,
and the encoded full length protein is designated as AXN-1. The
sequence of the axn-1 cDNA is set forth in SEQ ID NO:2, and the
open reading frame is provided as SEQ ID NO:3; the sequence of the
AXN-1 full length protein is provided as SEQ ID NO:4. The
full-length chromosomal sequence for axn-1 is set forth in SEQ ID
NO:1. The truncated amino acid sequence is set forth in SEQ ID
NO:5. A synthetic DNA sequence encoding the full-length AXN-1 amino
acid sequence is set forth in SEQ ID NO:6.
[0162] In addition to the 5'UTR variants, it is interesting to note
that many cDNAs isolated by these experiments have internal
modifications relative to the sequences described herein. For
example, many clones appear to be internally deleted relative to
the full-length sequence, and others clearly contain unspliced
introns. So, it is likely that this gene, designated herein as
axn-1, is subject to alternate mRNA processing including alternate
mRNA splicing. These alternate mRNAs are likely to be minor
components of the steady-state axn-1 mRNA, and the cDNA
normalization process utilized in the cloning of these cDNA has
likely increased the relative proportion of these variants to the
fully spliced full-length transcript.
Example 7
AXN-1 is Homologous to Monophenol Oxidases
[0163] An alignment of AXN-1 to other polyphenol oxidase sequences
is provided in FIG. 3, and the percent sequence identity of AXN-1
to these sequences is provided in Table 4.
[0164] Another interesting observation was that a section of the
protein encoded by this cDNA contained many repeated amino acids,
especially glutamine (Q), and did not show homology to polyphenol
oxidases or tyrosinases in databases searches.
TABLE-US-00004 TABLE 4 Amino Acid Identity of AXN-1 to other fungal
proteins Percent Identity Organism/Protein to AXN-1
Neurospora_crassa 17% Pyrenophora_tritici 15% Podospora_anserina
20% Lentinula_edodes 17% Pycnoporus_sanguineus 19% Pholio_nameko
18% Tuber_melanosporum 16% Asp_fum_tyrosinase 14%
Example 8
Nematode Toxin from Strain ATX20514
[0165] Bacterial strain ATX20514 was identified from empirical
screening of strains, based on strong toxicity of cultures towards
soybean cyst nematode (SCN) in the standard bioassay format.
[0166] ATX 20514 was grown in C2 medium in 96-well blocks for 3
days at 30.degree. C. Next, the cells in each well were lyzed with
a bead beater, and the lysed cell extract was fed to soybean cyst
nematodes (J2 stage) in the presence of a feeding stimulant
(octopamine). Five days after incubation, the toxicity towards SCN
was scored on the scale of 0 to 5 as shown in Table 1.
[0167] The soluble fraction prepared from ATX20514 in this manner
scored as a "5" when 5 .mu.L of this extract was incorporated into
the SCN bioassay.
[0168] A protein extract was prepared from strain ATX20514 by
growing the strain in 50 mL of C2 medium at 30.degree. C. for 3
days. At that time, cells in the culture were lysed by bead beater
treatment, and the crude lysate was centrifuged at 18,000.times.g
for 15 minutes to pellet the cell debris and insoluble proteins.
The soluble protein extract was recovered as the supernatant
fraction, and then filtered, and this material was then subjected
to multiple treatments followed by testing in an SCN bioassay.
[0169] Heat. An aliquot of the protein extract (100 .mu.L) was
heated at 100.degree. C. for 30 minutes, and tested in an SCN
bioassay. A negative control sample was mock treated alongside, and
likewise tested in SCN bioassay.
[0170] Protease. An aliquot of the protein extract (95 .mu.L) was
proteolytically digested with 5 .mu.L of Pronase (1 mg/mL
final)(Roche) for 3 hours at 37.degree. C.
[0171] Dialysis. An aliquot of the protein extract (100 .mu.L) was
dialyzed against either 20 mM Tris, pH 8.0, ("Buffer A") or 50 mM
sodium phosphate, 150 mM NaCl, pH 7.0.
[0172] Filtration. An aliquot of the protein extract (500 .mu.L)
was placed above a spin filter membrane with a 3000 molecular
weight cutoff (Millipore) and centrifuged at 12,000.times.g until
approximately 400 .mu.L of the total volume had passed through the
filter unit. Additional protein extract was then added above the
spin filter membrane, and the centrifuge step was repeated until
approximately 400 .mu.L of the total volume had again passed
through the filter unit.
[0173] The results of the SCN bioassays are shown in Table 5. These
results support the conclusion that the SCN activity in strain
ATX20514 is due to a protein active against SCN.
TABLE-US-00005 TABLE 5 Characterization of ATX20514 activity SCN
Sample Score Heat Treatment ATX 20514 extract 5 ATX 20514 extract
heat treated 0 Protease Treatment Protease Treatment Protease
treated 0 Control; protease only, no extract 0 Dialysis Dialyzed vs
20 mM Tris, pH 8.0 5 20 mM Tris, pH 8.0 0 ATX20514 extract dialyzed
50 mM sodium phosphate, 150 mM 5 NaCl, pH 7.0 50 mM sodium
phosphate, 150 mM NaCl, pH 7.0 0 Size exclusion filtration ATX20514
extract retentate from spin dialysis 5 ATX20514, filtrate from spin
dialysis 0
Example 9
Purification of a Nematode Protein Toxin from ATX20514
[0174] A four-column purification was carried out, leading to the
identification of a 52 kDa protein band that correlated with SCN
toxicity.
[0175] ATX20514 was grown in 2 liters of C2 medium at 30.degree. C.
for 3 days. The culture was centrifuged, and the pellet was
resuspended in 100 mL of 50 mM Tris (pH 8.0). The cell pellet was
then lysed using a French press, centrifuged at 18,000.times.g for
15 minutes, and the supernatant fraction (i.e., the soluble protein
extract) was forwarded into ammonium sulfate precipitation,
dialysis, and column chromatography purification.
[0176] After 3 steps of column chromatography, the active fractions
were dialyzed against 50 mM Tris (pH 8.0), 1 mM DTT; ("Buffer A"),
loaded onto a Mono Q anion exchange column (1 mL; GE Healthcare)
and washed with the same buffer. Elution was carried out with a 40
column volume gradient from 0 M to 0.2 M NaCl in Buffer A.
Individual fractions were submitted for SCN bioassays, and
fractions 21 through 24 were found to possess the strongest SCN
toxicity. A protein of approximately 52 kDa correlated well with
the SCN toxicity in these fractions.
Example 10
N-Terminal Sequence of the 52 kDa Protein from ATX20514
[0177] Individual purification fractions enriched for the 52 kDa
protein were separated by gel electrophoresis, transferred to PVDF,
and stained with Coomassie Blue. The section of the membrane
containing the 52 kDa protein was excised and subjected to
N-terminal sequencing. The resulting N-terminal sequence was
compiled using the amino acid corresponding to the biggest peak at
each position on the chromatograms.
Example 11
Purification and N-Terminal Sequence of 31 kDa Protein from
ATX20514
[0178] In addition to the activity that correlated with the 52 kDa
protein, a second active peak having SCN activity was eluted from a
cation exchange column. These active fractions were subsequently
loaded onto an anion exchange column to further purify the
activity. Thus, a 31 kDa protein was identified that correlated
with this SCN activity:
[0179] To further characterize the 31 kDa protein, N-terminal
sequencing was performed. This analysis allowed us to compare the
N-terminal protein sequence of the 52 kDa protein to that of the 31
kDa protein. We found the two amino acid sequences to be very
similar, suggesting that the 31 kDa protein is a truncation of the
52 kDa protein:
TABLE-US-00006 TABLE 6 N-terminal sequences of ATX 20514 toxins
Protein Primary Sequence Identified by Size Edman Degradation SEQ
ID NO: 52 kDa STSRQDVAKLGPGWNKVLLNYALAMQALDE 28 31 kDa
STSGQDVAKLGPQWNKVLLNYALAMQALDE 29
Example 12
Cloning of axn-8 from ATX20514
[0180] Using the N-terminal sequence data from the 52 kDa and 31
kDa toxins, the gene encoding these proteins was cloned in several
steps by a degenerate PCR and Tail strategies as known in the art,
leading to the amplification of an approximately 5 kb fragment from
multiple rounds of TAIL PCR. This region contains an open reading
frame encoding an amino acid protein. Herein we designate this gene
as axn-8 and the corresponding protein as AXN-8. The N-terminus of
the predicted AXN-8 protein matches well to the amino acid
sequences of the 52 kDa and 31 kDa proteins. Furthermore, the DNA
sequence downstream of axn-8 contains a second open reading frame
that is likely to be co-expressed with axn-8 in an operon. The DNA
sequence containing regulatory elements is set forth in SEQ ID
NO:11. The open reading frame for axn-8 is set forth in SEQ ID
NO:12, and encodes SEQ ID NO:13. The predicted truncated protein is
set forth in SEQ ID NO:14. The metal binding integral membrane
protein encoded by the downstream ORF is set forth in SEQ ID NO:41.
It is recognized that the truncation site may be at least about 1,
at least about 2, at least about 3, 4, 5, 6, 7, 8, 9, or 10 amino
acids in either direction of the arginine at position 295 of SEQ ID
NO:13. A synthetic DNA sequence encoding SEQ ID NO:13 is set forth
in SEQ ID NO:15.
Example 13
Homology of AXN-8 to Monophenol Oxidases
[0181] A BLAST analysis of AXN-8 shows that it shares homology with
known bacterial monophenol oxidases. This class of enzymes also
includes tyrosinases.
TABLE-US-00007 TABLE 7 Closest homologs of AXN-8 GENBANK % E Enzyme
Source Accession# Homology Score Tyrosinase Delftia YP_001562639
55% 7e-92 acidovorans Tyrosinase Ruegeria YP_166646 54% 6e-86
pomeroyi Tyrosinase Burkholderia ZP_02466656 52% 6e-79
thailandensis Monophenol Agrobacterium YP_002549739 50% 2e-71
oxidase vitis Tyrosinase Rhizobium etli ZP_03501998 47% 2e-60
Tyrosinase Dyadobacter ZP_03898981 44% 1e-56 fermentans
[0182] An alignment of AXN-8 with tyrosinase enzymes (FIG. 4)
reveals that it possesses sequence motifs that are consistent with
these tyrosinases, including the presence of histidine residues
that are likely to be necessary for binding of copper ions by the
enzyme.
Example 14
Dose Response of AXN-8 Activity
[0183] A sample of AXN-8 protein was used to assess the effect of
different protein amounts on SCN. This sample was diluted in
nematode assay buffer, and assays were set up to establish final
AXN-8 protein concentrations up to 25 .mu.g/ml. Nematodes were
incubated, and results scored after five days. Scores are the
average of two to four replicates.
TABLE-US-00008 TABLE 8 Dose Response of AXN-8 [AXN-8] in Assay
(.mu.g/ml) SCN Score 25 4.7 12.5 4 6.25 2.7 3.125 1.7 1.6 0.75 0.8
1 0.4 1 0 1
Example 15
Cloning of AXN-2 from Bacillus thuringiensis Strain ATX25028
[0184] Independently of the purification of AXN-1 and AXN-8, SCN
activity was observed from several Bacillus strains. The discovery
that both AXN-1 and AXN-8 encode proteins with homology to
oxidase/tyrosinase suggested that perhaps the activity in these
Bacillus strains was also due to oxidase/tyrosinase like
activity.
[0185] ATX25028 DNA was prepared as described previously, and DNA
sequence of plasmid preparations was obtained.
[0186] Analysis of the partial DNA sequences obtained from ATX25028
DNA demonstrated existence of a gene encoding an
oxidase/tyrosinase-like enzyme in this strain. The DNA sequence was
utilized to design PCR primers, and the open reading frame of the
full gene was amplified by PCR from genomic DNA or ATX 25028. This
gene is referred to herein as axn-2 (SEQ ID NO:7), and its encoded
protein as AXN-2 (SEQ ID NO:8). Clone pAX5530 contains axn-2
inserted into a modified pRSF-1b vector (Novagen) as a BamHI-AscI
fragment to generate a his-tag containing protein (SEQ ID NO:9).
pAX5531 contains axn-2 inserted into a modified pRSF-1b vector as a
PstI-AscI fragment such that the expressed protein lacks a His-tag.
Lysates derived from E. coli cells expressing AXN-2 protein were
generated and tested for activity on SCN. Clones both containing
and lacking an N-terminal His tag exhibited strong activity on
SCN.
[0187] A synthetic gene encoding AXN-2 is set forth in SEQ ID
NO:10.
TABLE-US-00009 TABLE 9 Activity of AXN-2 clones on SCN SCN Clone
Protein Sample description score Neg control Neg control Unlysed
culture 0 Neg control Neg control Lysed culture 0 Neg control Neg
control Tris extract concentrated 4X 0 pAX5531 AXN-2 Unlysed
culture 5 pAX5531 AXN-2 Lysed culture 4 pAX5531 AXN-2 Tris extract
concentrated 4X 5 pAX5530 AXN-2 5'His tag Unlysed culture 5 pAX5530
AXN-2 5'His tag Lysed culture 5 pAX5530 AXN-2 5'His tag Tris
extract concentrated 4X 5 -- -- LB kan media control 0 -- -- Tris
Buffer buffer control 0
Example 16
Cloning of a Polyphenol Oxidase from the Nematode Active Strain
ATX26455
[0188] ATX26455 was identified as a strain exhibiting strong
activity in a soybean cyst nematode (SCN) assay. ATX 26455 was
grown in a C2 medium in 96-well blocks for 3 days at 30.degree. C.
Cells were then lysed, and the lysed cell extract was fed to
soybean cyst nematodes (J2 stage) as described herein. Five days
after incubation, the toxicity towards SCN was scored as described
herein. Such an assay using 5 .mu.L of lysed extract prepared from
ATX26455 was assigned a score of "5," denoting 96-100% mortality of
this extract on SCN.
Preliminary Biochemical Screening of ATX26455
[0189] A protein extract was prepared from strain ATX26455 by
growing the strain in C2 medium at 30.degree. C. for 3 days. A
soluble protein extract was prepared from the culture and subjected
to the following biochemical characterizations, followed by assay
for activity on SCN: [0190] Heat treatment. An aliquot of the
protein extract was heat-treated to destroy protein activity, and
then tested in an SCN bioassay. A negative control sample was mock
treated alongside the heat treated sample, and likewise tested in
an SCN bioassay. [0191] Protease treatment. An aliquot of the
protein extract was proteolytically digested with a protease (such
as Pronase) and then tested in an SCN bioassay. Negative controls
without protease were also tested in an SCN bioassay. [0192]
Dialysis treatment. An aliquot of the protein extract was dialyzed
to allow small molecules to be removed from the extract, and then
tested in an SCN bioassay. [0193] Filtration testing. An aliquot of
the protein extract was placed above a spin filter membrane with a
3000 molecular weight cutoff and then centrifuged to pass the
extract through the filter unit. The retentate and the filtrate
were then tested in an SCN bioassay.
[0194] The results of the SCN bioassays carried out on each of the
preliminary biochemical samples are shown in Table 10. These
results suggest that the nematode toxin produced by ATX26455 is
conferred by a protein.
TABLE-US-00010 TABLE 10 Activity Tests of ATX 26455 Fractions SCN
Bioassay Sample Score Heat Treatment Extract 5 Extract, heat
treated 0 Protease Treatment Extract, protease treated 0 Protease
only (negative control) 0 Dialysis Treatment Extract, dialyzed 2
Dialysis control 0 Filtration Testing Filter, retained by filter 4
Filter, flow through 0
Purification of Nematode Protein Toxin from ATX26455
[0195] Cells were lysed using a French press, and the lysate was
centrifuged, and the supernatant collected, resulting in a
clarified lysate. The clarified lysate was highly active in an SCN
bioassay. The activity in this clarified lysate was confirmed to be
sensitive to protease digestion. The clarified lysate was further
enriched by the following ammonium sulfate precipitation steps.
[0196] First, the clarified lysate was brought to 13% saturation
with ammonium sulfate, centrifuged, and the pellet discarded. This
procedure was repeated at 25% saturation with ammonium sulfate.
Finally, the supernatant was brought to 50% saturation with
ammonium sulfate, and after centrifugation, the pellet was
recovered and resuspended in buffer, and subjected to dialysis to
remove the residual ammonium sulfate. The resuspended pellet was
then fractionated on an anion exchange column, and the fractions
that showed activity in SCN bioassay were collected and pooled. The
pooled active fractions were further fractionated on a hydrophobic
interaction column. This resulted in the identification of a
protein band migrating at location corresponding to a protein of
approximately 35 kDa. This protein (referred to herein as the "35
kDa protein") correlated well with the SCN toxicity observed during
each step of the purification, and that was highly enriched during
the purification process.
Characterization of the 35 kDa Protein from ATX26455
[0197] The sequence of the N-terminal amino acids of a protein of
interest from ATX 26455 was determined by Edman degradation as
known in the art. A protein fraction containing the protein of
interest was separated by gel electrophoresis, and the proteins in
the resulting gel were transferred to a PVDF membrane. The membrane
was then stained with Coomassie Blue, and the section of the
membrane containing the 35 kDa protein was excised, and subjected
to N-terminal sequencing. The N-terminal sequence of this protein
was determined by this method to be as follows:
N-Terminal Sequence of Protein from Active Fractions of
ATX26455
TABLE-US-00011 (SEQ ID NO: 42)
M-N-T-I-R-Q-D-V-A-T-L-G-S-G-W-D-N-K-V-L-L-N-Y-A-L-
A-M-R-E-L-D-K-L-P-I-T-N.
[0198] Interestingly, this protein sequence revealed sequence
similarities with the AXN-8 protein described herein, suggesting
that the activity of ATX 26455 is in fact, also due to activity of
a homologous, but novel, polyphenol oxidase.
Cloning of Nematode Active Toxin Gene from ATX26455
[0199] The N-terminal protein sequence of the putative toxin was
utilized to a degenerate oligonucleotide primer corresponding to
this sequence. The sequence of that primer is shown here (using the
nomenclature established by the International Union of Pure and
Applied Chemistry):
TABLE-US-00012 (SEQ ID NO: 43) 5' CAR GAY GTI GCI ACI YTI GGI CCI
GGI TGG 3'
[0200] To generate a degenerate oligonucleotide to amplify the
reverse strand of the toxin gene, the DNA sequence of the axn-8
gene was utilized as a template, resulting in generation of a
series of degenerate oligonucleotide primers for testing on
ATX26455. One PCR primer designed by this approach is shown
here:
TABLE-US-00013 (SEQ ID NO: 44) 5' RTG RTG IAG CCA RAA IAT IGG RTC
3'
[0201] PCR reactions using the degenerate primers (SEQ ID NO:43 and
44) resulted in amplification and sequencing of a 711 nucleotide
PCR product. This 711 nt PCR fragment was confirmed to originate
from the DNA region encoding the 35 kDa protein.
[0202] The DNA sequence of the 711 nucleotide PCR product was
utilized to isolate the entire region coding for the 35 kDa protein
by thermal interlaced (TAIL) PCR methods known in the art. This
approach allowed assembly of the sequence of the complete open
reading frame encoding the 35 kDa protein. The axn-9 open reading
frame was amplified by PCR from ATX 26455 and cloned into a
modified prsf1b cloning vector. The insert of the resulting clone
(pAX5597) was sequenced and found to be identical to the sequence
obtained by TAIL.
[0203] The sequence of the DNA fragment is provided as SEQ ID
NO:45. The open reading frame contained within this DNA region is
designated as axn-9 (SEQ ID NO:46), and its corresponding protein
as AXN-9 (SEQ ID NO:47). The predicted truncated protein
corresponds to residue 314 of SEQ ID NO:47. It is recognized that
the truncation site may be at least about 1, at least about 2, at
least about 3, 4, 5, 6, 7, 8, 9, or 10 amino acids in either
direction of the lysine at position 314 of SEQ ID NO:47.
[0204] Inspection of the DNA sequence of the axn-9 open reading
frame shows that there is a GTG codon present at nucleotides 22-24
of axn-9. Given the proximity of this codon to the ATG start site,
and the tendency for some bacterial open reading frames to tolerate
multiple translational start sites, it is possible that translation
from this GTG codon occurs in nature, and that the resulting
protein has similar properties to the full length AXN-9 protein.
Thus, this protein is also provided herein as SEQ ID NO:48 and
designated AXN-9 (GTG).
Example 17
Homology of AXN-9 to AXN-8 and Other Polyphenol Oxidase
Proteins
[0205] ANX-9 is homologous to the AXN-8 protein disclosed herein.
AXN-9 is 68% identical to AXN-8. An alignment of AXN-9 with AXN-8
is provided as FIG. 5. Given that AXN-8 is known to be truncated,
and also given that most polyphenol oxidases are proteolytically
processed, and given the homology between AXN-8 and AXN-9, we can
predict that AXN-9 is likely to be similarly truncated by
proteolysis.
Example 18
Protease-Activated AXN-9 Protein is Active in SCN Bioassay
[0206] The bacterial AXN-9 expression vector described above
(pAX5597) was transformed into BL21*DE3 cells (Invitrogen).
Following IPTG induction, the whole cell culture was centrifuged.
The resulting pellet was resuspended in 1/10.sup.th volume of
buffer (50 mM Tris (pH 8.0), 10 .mu.M CuSO4) and then lysed by
sonication. The lysate was split into 2 aliquots, and 1 aliquot was
treated with freshly prepared trypsin (0.1 mg/mL of lysate) for 2
hours at 37.degree. C. AXN-9 protein treated with trypsin showed
strong activity on SCN, while untreated AXN-9 protein did not show
activity on SCN (Table 11).
TABLE-US-00014 TABLE 11 Activity of AXN-9 clones on SCN SCN Clone
Sample description score Neg control Buffer (50 mM Tris (pH 8.0),
10 .mu.M CuSO4) 0 Neg control Buffer + Trypsin 0 Pos control Buffer
+ Mushroom Tyrosinase 4 pAX5597 AXN-9 protein, untreated 0 pAX5597
AXN-9 protein, trypsin-treated 5
Example 19
Activity of Mushroom Tyrosinase on SCN
[0207] Given the discovery of activity of AXN-1, AXN-8, AXN-9, and
AXN-2 on SCN and the homology of AXN-1, AXN-8, AXN-9, and AXN-2 to
polyphenol oxidase/tyrosinase enzymes, previously identified
tyrosinase enzymes were tested for this property.
[0208] Mushroom tyrosinase (Sigma T3824) was resuspended in buffer
to yield a concentrated solution. Dilutions of this solution were
test on SCN as described above, and the following assay results
were obtained. Though not directly stated by the provider of this
enzyme, the enzyme contained in this "Mushroom tyrosinase" is
likely to have been derived from the white mushroom.
TABLE-US-00015 TABLE 12 Titration of mushroom tyrosinase activity
[protein] in Mushroom Assay (.mu.g/ml) tyrosinase 50 4 25 3.3 12.5
3 6.25 2.7 3.125 2.7 1.6 2 0.8 1 0 1
[0209] The mushroom tyrosinase preparation obtained is demonstrated
to have activity on SCN; at approximately the same relative
concentration of enzyme relative to the tested amounts of AXN-1 and
AXN-8.
Example 20
Possibilities of Other Polyphenol Oxidases Having SCN Activity
[0210] Given the discovery of anti-SCN activity from both fungal
and bacterial proteins that have homology to polyphenol oxidases,
and the observation of activity from mushroom tyrosinase, it is
understood that many known polyphenol oxidases/tyrosinases are
likely to have such activity when tested as described herein; for
example, in an SCN bioassay containing J2 juveniles, 20 mM Tris,
and 50 mM octopamine for 3-7 days at about 20.degree. C., with
shaking in a rotary incubator, contained in a plate such as a
96-well plate.
[0211] For example, Selinheimo et al describe characterization of
fungal and plant tyrosinases, and demonstrate substrate and
activity differences between these broad classes of enzymes,
including the mushroom tyrosinase (fluka) which is likely the same
enzyme described in Example 19 above. Table 2 of Selinheimo et al
shows that such enzymes can have different substrate specificities
toward mono- and polyphenolic compounds. In general it is
understood that the plant enzyme, such as the apple and potato
enzymes of the Selinheimo et al. study, have less activity on
monophenol substrates such as tyrosine than fungal or bacterial
enzymes. Furthermore, FIG. 3 and the text of Selinheimo et al
describe that certain enzymes have the ability to cross-link a
representative protein (casein). In the cited study, each of the
enzymes is capable of crosslinking casein, but the enzymes differ
in the amount of enzyme required of crosslinking. Further, all but
one of these enzymes seemed to have a strong preference and/or
requirement for a monophenol or diphenol in the reaction in order
to achieve crosslinking. The notable exception to this requirement
is the enzyme from T. reesei.
[0212] The T. reesei enzyme (set forth in SEQ ID NO:21 and 22
herein) exhibits substrate and activity parameters that distinguish
it from the other tested enzymes. Notably, the T. reesei enzyme
showed the most efficient crosslinking of casein at the lower of
the two enzyme concentrations tested. Furthermore, and in contrast
to the other tested enzymes, T. reesei had strong activity in the
absence of a monophenol or diphenol in the reaction; although the
addition of such compounds appeared to increase the amount of such
crosslinking. The other enzymes tested appear to require a
monophenol or diphenol for such crosslinking activity. Selinheimo
et al provides further evidence for this property of the T. reesei
enzyme in additional references (Selinheimo et al. (2008) J Agric
Food Chem. 56(9):3118-28 and Selinheimo et al. (2007) J Agric Food
Chem. 55(15):6357-65), each of which is herein incorporated by
reference in its entirety.
[0213] For example, the cDNA with GENBANK accession number AK246031
from Glycine max (SEQ ID NO:16 and 17, encoding SEQ ID NO:18;
Umezawa et al (2008) DNA Res. 15(6):333-46) exhibits characteristic
homologies of plant phenol oxidases.
[0214] By way further of example, the cDNA with GENBANK accession
number AM418385 (SEQ ID NO:19) encoding a T. reesei enzyme (SEQ ID
NO:20) with homology to polyphenol oxidases, (Selinheimo et al.
(2006) FEBS Lett. 273, 4322-4335) is provided as an example of a
polyphenol oxidase that given the inventions herein is likely to
exhibit activity upon SCN.
[0215] Other sequences (according to GENBANK accession numbers)
having homology to the sequences disclosed herein are encompassed
by the present invention. An exemplary (but non-limiting) list is
set forth in Table 13.
TABLE-US-00016 TABLE 13 GENBANK Accession Numbers of Polyphenol
oxidase homologs AB005228.1 AB188749.1 AB277357.1 AF078789.2
AB010101.1 AB188750.1 AB277358.1 AF136926.2 AB011827.1 AB188751.1
AB277359.1 AF183578.1 AB011828.1 AB188752.1 AB280948.1 AF183583.1
AB011829.1 AB188753.1 AB280949.1 AF183588.1 AB011830.1 AB188754.1
AB280950.1 AF183593.1 AB011831.1 AB188755.1 AB353113.1 AF183599.1
AB018244.1 AB188756.1 AB430855.1 AF183604.1 AB022095.1 AB188757.1
AB430856.1 AF183609.1 AB023291.1 AB188758.1 AC007607.6 AF183614.1
AB024278.1 AB188759.1 AC007861.5 AF183619.1 AB024279.1 AB188760.1
AC025271.7 AF183624.1 AB024280.1 AB188761.1 AC084064.6 AF183629.1
AB024281.1 AB188762.1 AC084197.1 AF183634.1 AB027512.1 AB188763.1
AC084321.37 AF183639.1 AB032694.1 AB188764.1 AC084628.1 AF183644.1
AB032695.1 AB188765.1 AC090416.1 AF183649.1 AB032696.1 AB188766.1
AC115007.6 AF183654.1 AB032697.1 AB188767.1 AC116734.14 AF183659.1
AB033993.1 AB188768.1 AC119816.5 AF183664.1 AB038994.1 AB188769.1
AC122194.4 AF183669.1 AB044884.1 AB207236.1 AC122517.2 AF183674.1
AB052940.1 AB207237.1 AC138173.2 AF183679.1 AB056680.1 AB214954.1
AC138230.5 AF183684.1 AB060689.1 AB215107.1 AC157507.2 AF187155.1
AB070938.1 AB215108.1 AC157710.2 AF216388.1 AB070939.1 AB223612.1
AC163891.2 AF237792.1 AB081466.1 AB224151.1 AC166548.2 AF237794.1
AB107880.1 AB225958.1 AC182653.2 AF237797.1 AB107881.1 AB238605.1
AC185364.2 AF237799.1 AB108529.1 AB254132.1 AC208369.1 AF237802.1
AB108530.1 AB254133.1 AC210555.1 AF237804.1 AB108531.1 AB259663.1
AC214595.1 AF237807.1 AB120567.1 AB275646.1 AC215650.1 AF237809.1
AB178936.1 AB275647.1 AC216911.1 AF249161.1 AB178937.1 AB277347.1
AC217034.1 AF249162.1 AB178938.1 AB277348.1 AC232778.1 AF249163.1
AB178939.1 AB277349.1 AE016825.1 AF249164.1 AB178940.1 AB277350.1
AE017195.1 AF249165.1 AB188743.1 AB277351.1 AF001295.1 AF249166.1
AB188744.1 AB277352.1 AF020548.1 AF249167.1 AB188745.1 AB277353.1
AF020786.1 AF249168.1 AB188746.1 AB277354.1 AF039165.1 AF249169.1
AB188747.1 AB277355.1 AF064803.1 AF249170.1 AB188748.1 AB277356.1
AF076781.1 AF249171.2 AF249172.1 AJ006097.1 AK191107.1 AK209365.1
AF249173.1 AJ012048.1 AK191149.1 AK209447.1 AF249174.1 AJ223816.1
AK191393.1 AK209840.1 AF249175.1 AJ245880.1 AK192283.1 AK209886.1
AF249176.1 AJ248285.1 AK192618.1 AK209941.1 AF249177.1 AJ250302.1
AK192643.1 AK210502.1 AF249178.1 AJ252741.1 AK192803.1 AK210623.1
AF249179.1 AJ293806.1 AK192857.1 AK212036.1 AF249180.1 AJ297474.1
AK193779.1 AK212309.1 AF249181.1 AJ297475.1 AK193825.1 AK212393.1
AF249182.1 AJ309175.1 AK195046.1 AK212795.1 AF249183.1 AJ309176.1
AK195075.1 AK213314.1 AF249184.1 AJ334488.1 AK195088.1 AK213631.1
AF249185.1 AJ547813.1 AK195144.1 AK213730.1 AF249186.1 AJ556169.1
AK195566.1 AK214235.1 AF249187.1 AJ564729.1 AK195805.1 AK215621.1
AF249188.1 AJ574915.1 AK196684.1 AK216615.1 AF249189.1 AJ619741.1
AK197855.1 AK216621.1 AF249190.1 AJ635323.1 AK198477.1 AK216757.1
AF249191.1 AJ697805.1 AK198785.1 AK217145.1 AF252540.1 AJ698339.1
AK199363.1 AK217194.1 AF255610.1 AJ698340.1 AK199676.1 AK217421.1
AF261957.1 AJ698341.1 AK201739.1 AK217687.1 AF261958.1 AJ698342.1
AK201880.1 AK217984.1 AF263611.1 AJ786639.1 AK201909.1 AK218840.1
AF269192.1 AJ786640.1 AK202339.1 AK219113.1 AF280808.1 AJ845083.2
AK202954.1 AK219854.1 AF338426.3 AK014619.1 AK202956.1 AK219958.1
AF343911.2 AK027025.1 AK204044.1 AK219983.1 AF350261.1 AK027863.1
AK204101.1 AK220030.1 AF359360.3 AK033040.1 AK204148.1 AK241303.1
AF359361.3 AK108237.1 AK205161.1 AK246031.1 AF363027.1 AK115853.1
AK205196.1 AK247107.1 AF368291.1 AK115906.1 AK205614.1 AK247126.1
AF380300.1 AK116290.1 AK206598.1 AK247410.1 AF391288.1 AK148172.1
AK206970.1 AK293115.1 AF395447.2 AK148332.1 AK207020.1 AK297887.1
AF397401.1 AK148341.1 AK207203.1 AL138753.8 AF397402.1 AK148357.1
AK207218.1 AL139318.9 AF400250.1 AK148370.1 AK207451.1 AL591688.1
AF401231.1 AK148432.1 AK207965.1 AL606526.10 AF445638.2 AK148441.1
AK208194.1 AL606645.2 AF473807.2 AK177534.1 AK208432.1 AL646052.1
AF507945.1 AK190354.1 AK208651.1 AL670884.7 AJ000503.1 AK191069.1
AK208819.1 AL731611.2 AL731637.2 AY149460.1 AY333979.1 AYS12904.1
AL939108.1 AY149880.1 AY333982.1 AYS15264.1 AL939113.1 AY149881.1
AY333984.1 AY822711.1 AL954747.1 AY149882.1 AY333985.1 AY837842.1
AM418385.1 AY162287.1 AY338251.1 AY842859.1 AM420293.1 AY236224.1
AY341747.1 AY844019.1 AM424232.2 AY254101.1 AY341748.1 AY844020.1
AM440949.2 AY266330.1 AY341749.1 AY844021.1 AM442013.1 AY274808.1
AY341750.1 AY844022.1 AM448108.2 AY279540.1 AY341751.1 AY844023.1
AM451548.2 AY283062.1 AY341752.1 AY844024.1 AM467012.2 AY322334.1
AY341753.1 AY844025.1 AM478512.2 AY322335.1 AY341754.1 AY844026.1
AM502246.1 AY322336.1 AY341755.1 AY844027.1 AM746676.1 AY322337.1
AY341756.1 AY844028.1 AM774403.1 AY322338.1 AY341757.1 AY844029.1
AM920430.1 AY322339.1 AY341758.1 AY844030.1 AM920435.1 AY322340.1
AY341759.1 AY844031.1 AM920436.1 AY322341.1 AY341760.1 AY844032.1
AM920437.1 AY322342.1 AY341761.1 AY844033.1 AM949571.1 AY322343.1
AY341762.1 AY844034.1 AM949572.1 AY322344.1 AY341763.1 AY844035.1
AM949573.1 AY322345.1 AY341764.1 AY844036.1 AM949574.1 AY322346.1
AY341765.1 AY844037.1 AM949575.1 AY322347.1 AY370019.1 AY844038.1
AM949576.1 AY322348.1 AY451324.1 AY844039.1 AP000720.4 AY322349.1
AY515506.1 AY844040.1 AP003280.2 AY322350.1 AY526904.1 AY844041.1
AP003290.2 AY322351.1 AY596266.1 AY844042.1 AP008207.1 AY322352.1
AY596267.1 AY844043.1 AP008210.1 AY322353.1 AY596268.1 AY844044.1
AP009294.1 AY322354.1 AY596269.1 AY844045.1 AP009493.1 AY322355.1
AY596270.1 AY844046.1 AP009632.1 AY322356.1 AY659975.1 AY844047.1
AY017302.1 AY322357.1 AY665681.1 AY844048.1 AY017303.1 AY322358.1
AY665682.1 AY844049.1 AY017304.1 AY322359.1 AY675348.1 AY844050.1
AY046527.2 AY322360.1 AY743343.1 AY844051.1 AY052751.3 AY322361.1
AY743344.1 AY844052.1 AY052787.2 AY322362.1 AY743345.1 AY844053.1
AY072037.1 AY322363.1 AY751301.1 AY844054.1 AY072038.1 AY327520.1
AY787659.1 AY844055.1 AY075039.1 AY333967.1 AY804220.1 AY844056.1
AY103683.1 AY333970.1 AY804228.1 AY844057.1 AY123973.1 AY333975.1
AY804236.1 AY844058.1 AY844059.1 AY844104.1 AY844149.1 AY849378.1
AY844060.1 AY844105.1 AY844150.1 AY865623.2 AY844061.1 AY844106.1
AY844151.1 AY865624.1 AY844062.1 AY844107.1 AY844152.1 AY866432.1
AY844063.1 AY844108.1 AY844153.1 AY874457.1 AY844064.1 AY844109.1
AY844154.1 AY874458.1 AY844065.1 AY844110.1 AY844155.1 AY874460.1
AY844066.1 AY844111.1 AY844156.1 AY874462.1 AY844067.1 AY844112.1
AY844157.1 AY874465.1 AY844068.1 AY844113.1 AY844158.1 AY874467.1
AY844069.1 AY844114.1 AY844159.1 AY904721.1 AY844070.1 AY844115.1
AY844160.1 AY959314.1 AY844071.1 AY844116.1 AY844161.1 AY959316.1
AY844072.1 AY844117.1 AY844162.1 AY965743.1 AY844073.1 AY844118.1
AY844163.1 AY965744.1 AY844074.1 AY844119.1 AY844164.1 AY965745.1
AY844075.1 AY844120.1 AY844165.1 AY965746.1 AY844076.1 AY844121.1
AY844166.1 AY971012.1 AY844077.1 AY844122.1 AY844167.1 BA000030.3
AY844078.1 AY844123.1 AY844168.1 BA000035.2 AY844079.1 AY844124.1
AY844169.1 BC021799.1 AY844080.1 AY844125.1 AY844170.1 BC027179.1
AY844081.1 AY844126.1 AY844171.1 BC028311.1 AY844082.1 AY844127.1
AY844172.1 BC052608.1 AY844083.1 AY844128.1 AY844173.1 BC067064.1
AY844084.1 AY844129.1 AY844174.1 BC073623.1 AY844085.1 AY844130.1
AY844175.1 BC076406.1 AY844086.1 AY844131.1 AY844176.1 BC079678.1
AY844087.1 AY844132.1 AY844177.1 BC082330.1 AY844088.1 AY844133.1
AY844178.1 BC097647.1 AY844089.1 AY844134.1 AY844179.1 BC106678.1
AY844090.1 AY844135.1 AY844180.1 BC118918.1 AY844091.1 AY844136.1
AY844181.1 BC118919.1 AY844092.1 AY844137.1 AY844182.1 BC129260.1
AY844093.1 AY844138.1 AY844183.1 BC135907.1 AY844094.1 AY844139.1
AY844184.1 BC155086.1 AY844095.1 AY844140.1 AY844185.1 BC160742.1
AY844096.1 AY844141.1 AY844186.1 BC164222.1 AY844097.1 AY844142.1
AY844187.1 BT009357.1 AY844098.1 AY844143.1 AY844188.1 BT013158.1
AY844099.1 AY844144.1 AY844189.1 BT027443.1 AY844100.1 AY844145.1
AY844190.1 BT031730.1 AY844101.1 AY844146.1 AY844191.1 BT031775.1
AY844102.1 AY844147.1 AY844192.1 BX295539.1 AY844103.1 AY844148.1
AY844193.1 BX571966.1 BX842680.1 CR555306.1 DQ058416.1 DQ282930.1
BX901913.8 CR788249.6 DQ060504.1 DQ282931.1 BX950229.1 CR931725.6
DQ060505.1 DQ282932.1 CP000031.1 CR931761.6 DQ060506.1 DQ282933.1
CP000085.1 CT025512.2 DQ060507.1 DQ282934.1 CP000094.1 CT573213.2
DQ060508.1 DQ282935.1 CP000103.1 CU222605.1 DQ060509.1 DQ282936.1
CP000112.1 CU223051.1 DQ060510.1 DQ282937.1 CP000115.1 CU223259.1
DQ100014.1 DQ282938.1 CP000125.1 CU231766.1 DQ100027.1 DQ282939.1
CP000150.1 CU233197.1 DQ112679.1 DQ282941.1 CP000155.1 CU329671.1
DQ123805.1 DQ282942.1 CP000249.1 CU349019.1 DQ123806.1 DQ282943.1
CP000250.1 CU354092.1 DQ217371.1 DQ282944.1 CP000282.1 CU356179.1
DQ282898.1 DQ282945.1 CP000319.1 CU357691.1 DQ282899.1 DQ282946.1
CP000325.1 CU358302.1 DQ282900.1 DQ282947.1 CP000386.1 CU359628.1
DQ282901.1 DQ282948.1 CP000388.1 CU360132.1 DQ282902.1 DQ282949.1
CP000472.1 CU360325.1 DQ282903.1 DQ282950.1 CP000491.1 CU360391.1
DQ282904.1 DQ282951.1 CP000502.1 CU364068.1 DQ282905.1 DQ282952.1
CP000542.1 CU364414.1 DQ282906.1 DQ282953.1 CP000571.1 CU365186.1
DQ282907.1 DQ282954.1 CP000573.1 CU366348.1 DQ282908.1 DQ282955.1
CP000593.1 CU366591.1 DQ282910.1 DQ282956.1 CP000673.1 CU633872.1
DQ282911.1 DQ282957.1 CP000678.1 CU688450.1 DQ282912.1 DQ282958.1
CP000740.1 CU688451.1 DQ282913.1 DQ282959.1 CP000741.1 CU694472.5
DQ282914.1 DQ282960.1 CP000820.1 D00131.1 DQ282915.1 DQ282961.1
CP000839.1 D00439.1 DQ282916.1 DQ282962.1 CP000854.1 D00440.1
DQ282917.1 DQ282963.1 CP000884.1 D12514.1 DQ282918.1 DQ282964.1
CP000926.1 D17547.1 DQ282919.1 DQ282965.1 CP000930.2 D29686.1
DQ282920.1 DQ282966.1 CP000937.1 D37779.1 DQ282921.1 DQ282967.1
CP000943.1 D37929.1 DQ282922.1 DQ282968.1 CP001037.1 D45385.1
DQ282923.1 DQ282969.1 CP001044.1 D45386.1 DQ282924.1 DQ282970.1
CP001191.1 D63948.1 DQ282925.1 DQ282971.1 CR293496.1 D63949.1
DQ282926.1 DQ282972.1 CR387935.16 D63950.1 DQ282927.1 DQ282973.1
CR388132.9 D87669.1 DQ282928.1 DQ282974.1 CR407683.1 D87670.1
DQ282929.1 DQ282975.1 DQ282976.1 DQ283021.1 DQ347167.1 DQ360052.1
DQ282977.1 DQ283022.1 DQ347168.1 DQ360053.1 DQ282978.1 DQ283023.1
DQ347169.1 DQ360054.1 DQ282979.1 DQ283024.1 DQ347170.1 DQ360055.1
DQ282980.1 DQ283025.1 DQ347171.1 DQ360056.1 DQ282981.1 DQ283026.1
DQ347172.1 DQ360057.1 DQ282982.1 DQ283027.1 DQ347173.1 DQ360058.1
DQ282983.1 DQ283028.1 DQ347174.1 DQ360059.1 DQ282984.1 DQ283029.1
DQ347175.1 DQ360060.1 DQ282985.1 DQ307747.1 DQ347176.1 DQ360061.1
DQ282986.1 DQ307748.1 DQ347177.1 DQ360062.1 DQ282987.1 DQ307749.1
DQ347178.1 DQ360063.1 DQ282988.1 DQ347130.1 DQ347179.1 DQ360064.1
DQ282989.1 DQ347131.1 DQ347180.1 DQ360065.1 DQ282990.1 DQ347132.1
DQ347181.1 DQ360066.1 DQ282991.1 DQ347133.1 DQ347182.1 DQ360067.1
DQ282992.1 DQ347134.1 DQ347183.1 DQ360068.1 DQ282993.1 DQ347138.1
DQ347184.1 DQ388569.1 DQ282994.1 DQ347139.1 DQ347185.1 DQ388570.1
DQ282995.1 DQ347140.1 DQ347187.1 DQ408547.1
DQ282996.1 DQ347141.1 DQ347189.1 DQ408550.1 DQ282997.1 DQ347142.1
DQ347190.1 DQ408551.1 DQ282998.1 DQ347144.1 DQ347191.1 DQ458272.1
DQ282999.1 DQ347145.1 DQ347192.1 DQ458273.1 DQ283000.1 DQ347146.1
DQ347193.1 DQ458274.1 DQ283001.1 DQ347147.1 DQ347194.1 DQ458275.1
DQ283002.1 DQ347148.1 DQ347195.1 DQ458276.1 DQ283003.1 DQ347149.1
DQ347196.1 DQ458277.1 DQ283004.1 DQ347150.1 DQ347197.1 DQ458278.1
DQ283005.1 DQ347151.1 DQ347198.1 DQ458279.1 DQ283006.1 DQ347152.1
DQ356947.1 DQ458280.1 DQ283007.1 DQ347153.1 DQ360038.1 DQ458281.1
DQ283008.1 DQ347154.1 DQ360039.1 DQ458283.1 DQ283009.1 DQ347155.1
DQ360040.1 DQ458284.1 DQ283010.1 DQ347156.1 DQ360041.1 DQ458285.1
DQ283011.1 DQ347157.1 DQ360042.1 DQ458286.1 DQ283012.1 DQ347158.1
DQ360043.1 DQ503140.1 DQ283013.1 DQ347159.1 DQ360044.1 DQ503141.1
DQ283014.1 DQ347160.1 DQ360045.1 DQ503142.1 DQ283015.1 DQ347161.1
DQ360046.1 DQ503143.1 DQ283016.1 DQ347162.1 DQ360047.1 DQ503144.1
DQ283017.1 DQ347163.1 DQ360048.1 DQ503145.1 DQ283018.1 DQ347164.1
DQ360049.1 DQ503146.1 DQ283019.1 DQ347165.1 DQ360050.1 DQ503147.1
DQ283020.1 DQ347166.1 DQ360051.1 DQ503148.1 DQ503149.1 DQ532390.1
DQ851196.1 DQ889705.1 DQ503150.1 DQ532391.1 DQ851197.1 DQ889706.1
DQ503151.1 DQ532392.1 DQ851198.1 DQ889707.1 DQ503152.1 DQ532393.1
DQ851199.1 DQ889708.1 DQ503153.1 DQ532394.1 DQ851200.1 DQ889709.1
DQ503154.1 DQ532395.1 DQ851201.1 DQ889710.1 DQ503155.1 DQ532396.1
DQ851202.1 DQ891466.2 DQ503156.1 DQ532397.1 DQ851203.1 DQ894649.2
DQ503157.1 DQ532398.1 DQ851204.1 DQ902581.1 DQ503158.1 DQ532399.1
DQ851205.1 DQ990911.1 DQ503159.1 DQ532400.1 DQ851206.1 DQ992481.1
DQ503160.1 DQ532401.1 DQ851207.1 DQ992482.1 DQ503161.1 DQ532402.1
DQ851208.1 DQ992483.1 DQ503162.1 DQ532403.1 DQ851209.1 EF070147.1
DQ503163.1 DQ532404.1 DQ851210.1 EF070148.1 DQ503164.1 DQ532405.1
DQ851211.1 EF070149.1 DQ503165.1 DQ532406.1 DQ851212.1 EF070150.1
DQ503166.1 DQ532407.1 DQ851213.1 EF102109.1 DQ503167.1 DQ532408.1
DQ851214.1 EF102110.1 DQ503168.1 DQ532409.1 DQ851215.1 EF102111.1
DQ503169.1 DQ532410.1 DQ851216.1 EF102112.1 DQ503170.1 DQ532411.1
DQ851217.1 EF158427.1 DQ503171.1 DQ532412.1 DQ851218.1 EF158428.1
DQ503172.1 DQ532413.1 DQ851219.1 EF183483.1 DQ503173.1 DQ532414.1
DQ851220.1 EF183484.1 DQ503174.1 DQ532415.1 DQ851221.1 EF363553.1
DQ513312.1 DQ532416.1 DQ851222.1 EF364134.1 DQ513313.1 DQ532417.1
DQ851223.1 EF364135.1 DQ530058.1 DQ532418.1 DQ889677.1 EF364136.1
DQ530059.1 DQ532419.1 DQ889678.1 EF364137.1 DQ532375.1 DQ532420.1
DQ889690.1 EF364138.1 DQ532376.1 DQ532421.1 DQ889691.1 EF364139.1
DQ532377.1 DQ532422.1 DQ889692.1 EF364140.1 DQ532378.1 DQ532423.1
DQ889693.1 EF364141.1 DQ532379.1 DQ532424.1 DQ889694.1 EF364142.1
DQ532380.1 DQ532425.1 DQ889695.1 EF364143.1 DQ532381.1 DQ532426.1
DQ889696.1 EF364144.1 DQ532382.1 DQ532427.1 DQ889697.1 EF364145.1
DQ532383.1 DQ532428.1 DQ889698.1 EF364146.1 DQ532384.1 DQ532429.1
DQ889699.1 EF364147.1 DQ532385.1 DQ532430.1 DQ889700.1 EF364148.1
DQ532386.1 DQ532431.1 DQ889701.1 EF364149.1 DQ532387.1 DQ532432.1
DQ889702.1 EF364150.1 DQ532388.1 DQ838002.1 DQ889703.1 EF364151.1
DQ532389.1 DQ841551.1 DQ889704.1 EF364152.1 EF364153.1 EF364198.1
EF364329.1 EF376135.1 EF364154.1 EF364199.1 EF364330.1 EF376136.1
EF364155.1 EF364200.1 EF364331.1 EF376137.1 EF364156.1 EF364201.1
EF364332.1 EF376138.1 EF364157.1 EF364202.1 EF364333.1 EF376139.1
EF364158.1 EF364203.1 EF364334.1 EF376140.1 EF364159.1 EF364204.1
EF364335.1 EF376141.1 EF364160.1 EF364205.1 EF364336.1 EF376142.1
EF364161.1 EF364206.1 EF364337.1 EF376143.1 EF364162.1 EF364207.1
EF364338.1 EF376145.1 EF364163.1 EF364208.1 EF364339.1 EF376146.1
EF364164.1 EF364209.1 EF364340.1 EF376148.1 EF364165.1 EF364210.1
EF364341.1 EF376149.1 EF364166.1 EF364211.1 EF364342.1 EF376150.1
EF364167.1 EF364212.1 EF364343.1 EF376151.1 EF364168.1 EF364213.1
EF364344.1 EF376152.1 EF364169.1 EF364214.1 EF364345.1 EF376153.1
EF364170.1 EF364215.1 EF364346.1 EF376154.1 EF364171.1 EF364216.1
EF364347.1 EF376155.1 EF364172.1 EF364217.1 EF364348.1 EF376156.1
EF364173.1 EF364218.1 EF364349.1 EF376157.1 EF364174.1 EF364219.1
EF364350.1 EF376158.1 EF364175.1 EF364220.1 EF364351.1 EF376159.1
EF364176.1 EF364221.1 EF364352.1 EF395959.1 EF364177.1 EF364222.1
EF364353.1 EF395960.1 EF364178.1 EF364223.1 EF364354.1 EF395961.1
EF364179.1 EF364224.1 EF364355.1 EF395962.1 EF364180.1 EF364311.1
EF364356.1 EF395963.1 EF364181.1 EF364312.1 EF364357.1 EF395964.1
EF364182.1 EF364313.1 EF364358.1 EF395965.1 EF364183.1 EF364314.1
EF364359.1 EF395966.1 EF364184.1 EF364315.1 EF364360.1 EF395967.1
EF364185.1 EF364316.1 EF364361.1 EF395968.1 EF364186.1 EF364317.1
EF364362.1 EF395969.1 EF364187.1 EF364318.1 EF376122.1 EF395970.1
EF364188.1 EF364319.1 EF376123.1 EF395971.1 EF364189.1 EF364320.1
EF376125.1 EF395972.1 EF364190.1 EF364321.1 EF376126.1 EF395973.1
EF364191.1 EF364322.1 EF376128.1 EF395974.1 EF364192.1 EF364323.1
EF376129.1 EF395975.1 EF364193.1 EF364324.1 EF376130.1 EF395976.1
EF364194.1 EF364325.1 EF376131.1 EF395977.1 EF364195.1 EF364326.1
EF376132.1 EF395978.1 EF364196.1 EF364327.1 EF376133.1 EF395979.1
EF364197.1 EF364328.1 EF376134.1 EF395980.1 EF395981.1 EF493477.1
EF571127.1 EF646495.1 EF395982.1 EF493478.1 EF571128.1 EF646497.1
EF395983.1 EF493479.1 EF571129.1 EF646498.1 EF395984.1 EF493480.1
EF571130.1 EF646499.1 EF395985.1 EF493481.1 EF571131.1 EF646500.1
EF395986.1 EF493482.1 EF571132.1 EF646501.1 EF395987.1 EF493483.1
EF571133.1 EF646502.1 EF395988.1 EF493484.1 EF571134.1 EF646503.1
EF395989.1 EF493485.1 EF571135.1 EF646504.1 EF395990.1 EF493486.1
EF571136.1 EF646506.1 EF395991.1 EF493487.1 EF571137.1 EF646507.1
EF395992.1 EF493488.1 EF571138.1 EF646508.1 EF395993.1 EF493489.1
EF571139.1 EF650014.1 EF405957.1 EF493490.1 EF571140.1 EF650016.1
EF407506.1 EF493491.1 EF571141.1 EF650017.1 EF432113.1 EF493492.1
EF571142.1 EF675246.1 EF445968.1 EF493493.1 EF571143.1 EF675247.1
EF445969.1 EF493494.1 EF571144.1 EF675248.1 EF445970.1 EF493495.1
EF571145.1 EF675249.1 EF445971.1 EF493496.1 EF571146.1 EF675250.1
EF445972.1 EF493497.1 EF571147.1 EF675251.1 EF445973.1 EF493498.1
EF571148.1 EF675252.1 EF445974.1 EF493500.1 EF623826.1 EF675253.1
EF493455.1 EF493501.1 EF635860.1 EF675254.1 EF493456.1 EF493502.1
EF646474.1 EF675255.1 EF493457.1 EF493503.1 EF646475.1 EF675256.1
EF493458.1 EF493504.1 EF646476.1 EF675257.1 EF493459.1 EF493505.1
EF646477.1 EF675258.1 EF493460.1 EF493506.1 EF646478.1 EF675259.1
EF493461.1 EF493507.1 EF646479.1 EF675260.1 EF493462.1 EF493509.1
EF646480.1 EF675261.1 EF493463.1 EF493510.1 EF646481.1 EF675262.1
EF493464.1 EF571114.1 EF646482.1 EF675263.1 EF493465.1 EF571115.1
EF646483.1 EF675264.1 EF493466.1 EF571116.1 EF646484.1 EF675265.1
EF493467.1 EF571117.1 EF646485.1 EF675266.1 EF493468.1 EF571118.1
EF646486.1 EF675267.1 EF493469.1 EF571119.1 EF646487.1 EF675268.1
EF493470.1 EF571120.1 EF646488.1 EF675269.1 EF493471.1 EF571121.1
EF646489.1 EF675270.1 EF493472.1 EF571122.1 EF646490.1 EF675271.1
EF493473.1 EF571123.1 EF646491.1 EF675272.1 EF493474.1 EF571124.1
EF646492.1 EF675273.1 EF493475.1 EF571125.1 EF646493.1 EF675274.1
EF493476.1 EF571126.1 EF646494.1 EF675275.1 EF675276.1 EF675321.1
EF988277.1 EF988327.1 EF675277.1 EF675322.1 EF988278.1 EF988328.1
EF675278.1 EF675323.1 EF988279.1 EF988329.1 EF675279.1 EF675324.1
EF988280.1 EF988330.1 EF675280.1 EF675325.1 EF988281.1 EF988331.1
EF675281.1 EF675326.1 EF988282.1 EF988332.1 EF675282.1 EF675327.1
EF988283.1 EU022001.1 EF675283.1 EF675328.1 EF988284.1 EU028313.1
EF675284.1 EF675329.1 EF988285.1 EU037991.1 EF675285.1 EF675330.1
EF988286.1 EU046599.1 EF675286.1 EF675331.1 EF988287.1 EU046600.1
EF675287.1 EF675332.1 EF988288.1 EU048225.1 EF675288.1 EF675333.1
EF988289.1 EU076756.1 EF675289.1 EF675334.1 EF988290.1 EU076757.1
EF675290.1 EF675335.1 EF988291.1 EU076758.1 EF675291.1 EF675336.1
EF988292.1 EU076759.1 EF675292.1 EF675337.1 EF988293.1 EU076760.1
EF675293.1 EF675338.1 EF988294.1 EU076761.1 EF675294.1 EF675339.1
EF988295.1 EU076762.1 EF675295.1 EF675340.1 EF988296.1 EU076763.1
EF675296.1 EF675341.1 EF988297.1 EU076764.1 EF675297.1 EF675342.1
EF988298.1 EU076765.1 EF675298.1 EF675343.1 EF988299.1 EU076766.1
EF675299.1 EF675344.1 EF988300.1 EU076767.1 EF675300.1 EF675345.1
EF988301.1 EU076768.1 EF675301.1 EF675346.1 EF988302.1 EU076769.1
EF675302.1 EF675347.1 EF988303.1 EU076770.1 EF675303.1 EF675348.1
EF988304.1 EU076771.1 EF675304.1 EF675349.1 EF988305.1 EU076772.1
EF675305.1 EF675350.1 EF988306.1 EU076773.1 EF675306.1 EF675351.1
EF988308.1 EU076774.1 EF675307.1 EF675352.1 EF988309.1 EU076775.1
EF675308.1 EF675353.1 EF988310.1 EU076776.1 EF675309.1 EF675354.1
EF988311.1 EU076777.1 EF675310.1 EF675355.1 EF988312.1 EU076778.1
EF675311.1 EF675356.1 EF988313.1 EU076779.1 EF675312.1 EF675357.1
EF988314.1 EU076780.1 EF675313.1 EF675358.1 EF988315.1 EU076781.1
EF675314.1 EF675359.1 EF988316.1 EU076782.1 EF675315.1 EF675360.1
EF988317.1 EU076783.1 EF675316.1 EF675361.1 EF988318.1 EU076784.1
EF675317.1 EF988271.1 EF988320.1 EU076785.1 EF675318.1 EF988273.1
EF988322.1 EU076786.1 EF675319.1 EF988274.1 EF988325.1 EU076787.1
EF675320.1 EF988276.1 EF988326.1 EU076788.1 EU076789.1 EU215590.1
EU769531.1 M11302.1 EU076790.1 EU215591.1 EU769532.1 M11582.1
EU076791.1 EU215592.1 EU769533.1 M20234.1 EU076792.1 EU215593.1
EU769534.1 M24560.1 EU076793.1 EU215594.1 EU769535.1 M26729.1
EU076794.1 EU215595.1 EU769536.1 M27160.1 EU076795.1 EU215596.1
EU769537.1 M32843.1 EU076796.1 EU215597.1 EU769538.1 M33271.1
EU076797.1 EU215598.1 EU769539.1 M57288.1 EU076798.1 EU215599.1
EU769540.1 M63235.1 EU076799.1 EU215600.1 EU769541.1 M63237.1
EU076800.1 EU215601.1 EU769542.1 M74314.1 EU076801.1 EU215602.1
EU769543.1 M95196.1 EU076802.1 EU215603.1 EU769544.1 M95197.1
EU076803.1 EU215604.1 EU769545.1 NG_008748.1 EU076804.1 EU215605.1
EU769546.1 NM_000372.4 EU076805.1 EU215606.1 EU769547.1 NM_000550.2
EU076806.1 EU215607.1 EU769548.1 NM_001002749.1 EU126854.1
EU215608.1 EU769549.1 NM_001002941.1 EU139474.1 EU215609.1
EU769550.1 NM_001012666.1 EU147298.1 EU215610.1 EU769551.1
NM_001016476.2 EU154993.1 EU215611.1 EU787433.1 NM_001017161.2
EU186763.1 EU215612.1 EU939720.1 NM_001022594.1 EU186764.1
EU215613.1 EU939721.1 NM_001025212.1 EU186765.1 EU275350.1
EU939722.1 NM_001025226.1 EU186766.1 EU330225.1 EU939723.1
NM_001025227.1 EU186768.1 EU371651.1 EU955868.1 NM_001033837.1
EU186769.1 EU371652.1 EU956830.1 NM_001039975.1 EU186770.1
EU371653.1 EU963699.1 NM_001042560.2 EU186771.1 EU371654.1
EU966440.1 NM_001051031.1 EU186772.1 EU371656.1 FJ184078.1
NM_001060465.1 EU186773.1 EU371657.1 FJ210643.1 NM_001060466.1
EU186774.1 EU522120.1 FJ210644.1 NM_001060467.1 EU186775.1
EU523113.1 FJ210645.1 NM_001076816.1 EU186776.1 EU554632.1
FM178478.1 NM_001081840.1 EU186777.1 EU555188.1 FM864217.1
NM_001082077.1 EU186778.1 EU627590.1 FM877576.1 NM_001087023.1
EU186779.1 EU627691.1 FM877577.1 NM_001103048.1 EU186780.1
EU760771.1 J02835.1 NM_001104802.1 EU215584.1 EU760773.1 J03581.1
NM_001106664.1 EU215585.1 EU769526.1 L18967.1 NM_001107535.1
EU215586.1 EU769527.1 L23649.2 NM_001123643.1 EU215587.1 EU769528.1
L29450.1 NM_001123688.1 EU215588.1 EU769529.1 L46685.1
NM_001124219.1 EU215589.1 EU769530.1 L46805.1 NM_001124222.1
NM_001128295.1 NW_001914846.1 XM_001083014.1 XM_001344111.2
NM_001129889.1 NW_001914848.1 XM_001083129.1 XM_001372445.1
NM_001130023.1 NW_001914849.1 XM_001104954.1 XM_001377599.1
NM_001130024.1 NW_001914850.1 XM_001105033.1 XM_001386428.1
NM_001130027.1 NW_001914851.1 XM_001111475.1 XM_001389305.1
NM_001922.3 NW_001914853.1 XM_001118455.1 XM_001389898.1
NM_010024.3 NW_001914855.1 XM_001136041.1 XM_001393577.1
NM_011661.4 NW_001914856.1 XM_001195948.1 XM_001393787.1
NM_031202.2 NW_001914857.1 XM_001208556.1 XM_001395217.1
NM_032866.3 NW_001914859.1 XM_001209390.1 XM_001397251.1
NM_058730.2 NW_002196562.1 XM_001212742.1 XM_001402482.1
NM_059308.5 NW_002196563.1 XM_001218713.1 XM_001403959.1
NM_059654.3 NW_002196567.1 XM_001219433.1 XM_001405108.1
NM_066310.3 NW_002196569.1 XM_001220372.1 XM_001410762.1
NM_067435.3 S40548.1 XM_001220433.1 XM_001410969.1 NM_068586.5
S56788.1 XM_001220654.1 XM_001413345.1 NM_077759.3 S56789.1
XM_001221472.1 XM_001470114.1 NM_077760.3 S69231.1 XM_001221522.1
XM_001474717.1 NM_131013.1 S71755.1 XM_001222188.1 XM_001491619.2
NM_131555.1 S81675.1 XM_001222231.1 XM_001492560.2 NM_174480.3
U01873.1 XM_001223254.1 XM_001499964.1 NM_181001.2 U19270.1
XM_001224265.1 XM_001507059.1 NM_204160.1 U22921.1 XM_001224529.1
XM_001511967.1 NM_204935.1 U22922.1 XM_001226375.1 XM_001512063.1
NM_205045.1 U42219.1 XM_001227018.1 XM_001521874.1 NW_001263857.1
U46014.1 XM_001227695.1 XM_001521941.1 NW_001594031.1 U66807.1
XM_001227852.1 XM_001521969.1 NW_001594096.1 U66808.1
XM_001228418.1 XM_001537500.1 NW_001594210.1 U80928.5
XM_001228654.1 XM_001538844.1 NW_001594218.1 U83274.1
XM_001229971.1 XM_001539679.1 NW_001594271.1 U97407.2
XM_001238383.1 XM_001540372.1 NW_001594359.1 X03687.1
XM_001239849.1 XM_001540446.1 NW_001594360.1 X12782.1
XM_001240419.1 XM_001541520.1 NW_001594468.1 X16073.1
XM_001243417.1 XM_001543595.1 NW_001849580.1 X51420.1
XM_001258740.1 XM_001545601.1 NW_001884663.1 X51455.1
XM_001264009.1 XM_001546170.1 NW_001884666.1 X63349.1
XM_001266670.1 XM_001546443.1 NW_001884668.1 X69526.1
XM_001267633.1 XM_001555260.1 NW_001884670.1 X85113.1
XM_001272229.1 XM_001556657.1 NW_001884672.1 X89382.1
XM_001273481.1 XM_001559424.1 NW_001884674.1 X90869.1
XM_001273821.1 XM_001560118.1 NW_001884677.1 X95703.1
XM_001276415.1 XM_001560753.1 NW_001884682.1 X95705.1
XM_001276725.1 XM_001560913.1 NW_001914832.1 XM_001078239.1
XM_001328999.1 XM_001584514.1 NW_001914843.1 XM_001082890.1
XM_001341424.2 XM_001585797.1 XM_001594867.1 XM_001794545.1
XM_001892395.1 XM_002125300.1 XM_001597332.1 XM_001794653.1
XM_001892420.1 XM_002128389.1 XM_001625411.1 XM_001794931.1
XM_001893247.1 XM_002128413.1 XM_001635746.1 XM_001796832.1
XM_001895139.1 XM_002129643.1 XM_001636194.1 XM_001798236.1
XM_001901072.1 XM_002130461.1 XM_001638262.1 XM_001798742.1
XM_001901073.1 XM_002143212.1 XM_001638766.1 XM_001800083.1
XM_001903268.1 XM_002146447.1 XM_001638830.1 XM_001801793.1
XM_001904803.1 XM_002149565.1 XM_001640169.1 XM_001802391.1
XM_001905139.1 XM_002153381.1 XM_001645672.1 XM_001802709.1
XM_001905238.1 XM_224517.4 XM_001649753.1 XM_001803635.1
XM_001905452.1 XM_361488.1 XM_001665827.1 XM_001803854.1
XM_001905547.1 XM_362982.1 XM_001666225.1 XM_001804507.1
XM_001906049.1 XM_363645.2 XM_001666226.1 XM_001805205.1
XM_001906413.1 XM_364769.1 XM_001667828.1 XM_001818619.1
XM_001906450.1 XM_365447.2 XM_001668104.1 XM_001818856.1
XM_001906989.1 XM_366343.2 XM_001670594.1 XM_001820921.1
XM_001907225.1 XM_366367.1 XM_001678316.1 XM_001821589.1
XM_001909833.1 XM_367508.1 XM_001693100.1 XM_001823590.1
XM_001910225.1 XM_367674.1 XM_001697086.1 XM_001824175.1
XM_001912748.1 XM_369295.2 XM_001701312.1 XM_001824991.1
XM_001912994.1 XM_369550.1 XM_001727078.1 XM_001827481.1
XM_001916811.1 XM_369602.2 XM_001727530.1 XM_001828111.1
XM_001919765.1 XM_382164.1 XM_001728018.1 XM_001828125.1
XM_001930339.1 XM_383524.1 XM_001728211.1 XM_001829397.1
XM_001931225.1 XM_383707.1 XM_001743609.1 XM_001829411.1
XM_001931717.1 XM_383794.1 XM_001750461.1 XM_001829654.1
XM_001931950.1 XM_383977.1 XM_001752106.1 XM_001829655.1
XM_001931983.1 XM_384686.1 XM_001752310.1 XM_001829755.1
XM_001932641.1 XM_385045.1 XM_001755025.1 XM_001829756.1
XM_001932672.1 XM_385804.1 XM_001760085.1 XM_001830172.1
XM_001933667.1 XM_388183.1 XM_001766207.1 XM_001831367.1
XM_001935114.1 XM_389698.1 XM_001766285.1 XM_001831834.1
XM_001938308.1 XM_391282.1 XM_001766501.1 XM_001832157.1
XM_001939194.1 XM_391596.1 XM_001767122.1 XM_001832731.1
XM_001939588.1 XM_391626.1 XM_001770057.1 XM_001832740.1
XM_001941106.1 XM_391693.1 XM_001772163.1 XM_001834494.1
XM_002119109.1 XM_391704.1 XM_001773718.1 XM_001834544.1
XM_002119565.1 XM_396715.2 XM_001777448.1 XM_001835487.1
XM_002119639.1 XM_508687.2 XM_001785193.1 XM_001835984.1
XM_002122507.1 XM_520488.2 XM_001785572.1 XM_001838066.1
XM_002122831.1 XM_531934.2 XM_001785897.1 XM_001838103.1
XM_002123004.1 XM_542639.2 XM_001791723.1 XM_001885099.1
XM_002124302.1 XM_633156.1 XM_001794065.1 XM_001885343.1
XM_002124333.1 XM_633850.1 XM_001794296.1 XM_001885517.1
XM_002125243.1 XM_633851.1 XM_652743.1 XM_746360.1 XM_954050.1
Z12833.1 XM_653830.1 XM_748017.1 XM_954644.1 Z12834.1 XM_655311.1
XM_752184.1 XM_959730.1 Z12835.1 XM_657823.1 XM_756546.1
XM_959839.2 Z12836.1 XM_658954.1 XM_787355.2 XM_969232.2 Z12837.1
XM_658998.1 XM_859433.1 XR_022754.1 Z12838.1 XM_659572.1
XM_859450.1 Y00819.1 Z27411.1 XM_676089.1 XM_870599.3 Y12501.1
Z66559.1 XM_676612.1 XM_952931.2 Y13219.2 Z71261.1 XM_677565.1
XM_953538.2 Z11702.1 Z81568.1 XM_743335.1
Example 21
Additional Monophenol Oxidases
[0216] Given the demonstration of strong SCN activity by enzyme
from bacteria and fungi with homology to monophenol
oxidase/tyrosinases, it is now apparent that many previously
identified enzymes of this class will exhibit activity on SCN.
Example 22
Assays for Nematicidal Activity
[0217] The nucleotide sequences of the invention can be tested for
their ability to produce nematicidal proteins. The ability of a
protein to act as a pesticide upon a nematode pest is often
assessed in a number of ways. One way well known in the art is to
perform a feeding assay. In such a feeding assay, one exposes the
pest to a sample containing either compounds to be tested or
control samples. Often this is performed by placing the material to
be tested, or a suitable dilution of such material, onto a material
that the pest will ingest, such as an artificial diet. The material
to be tested may be composed of a liquid, solid, or slurry. The
material to be tested may be placed upon the surface and then
allowed to dry. Alternatively, the material to be tested may be
mixed with a molten artificial diet, then dispensed into the assay
chamber. The assay chamber may be, for example, a cup, a dish, or a
well of a microtiter plate.
[0218] Other types of assays can include microinjection of the test
material into the mouth, or gut of the pest, as well as development
of transgenic plants, followed by test of the ability of the pest
to feed upon the transgenic plant. Plant testing may involve
isolation of the plant parts normally consumed, for example, small
cages attached to a leaf, or isolation of entire plants in cages
containing insects.
[0219] Other methods and approaches to assay pests are known in the
art, and can be found, for example in Robertson and Preisler, eds.
(1992) Pesticide bioassays with arthropods, CRC, Boca Raton, Fla.
Alternatively, assays are commonly described in the journals
Arthropod Management Tests and Journal of Economic Entomology or by
discussion with members of the Entomological Society of America
(ESA).
Example 23
Synthetic Gene Sequences
[0220] The following genes were designed that encode either the
AXN-1, AXN-2, AXN-8, or AXN-9 amino acid sequences, but utilizing a
different nucleotide sequence.
SEQ ID NO:6 describes a novel AXN-1 encoding nucleotide sequence
SEQ ID NO:10 describes a novel AXN-2 encoding nucleotide sequence
SEQ ID NO:15 describes a novel AXN-8 encoding nucleotide sequence
SEQ ID NO:17 describes a novel nucleotide sequence encoding the
protein predicted from GENBANK accession number AK246031 from
Glycine max. SEQ ID NO:21 describes a novel nucleotide sequence
encoding the protein predicted from genbank accession number the
cDNA with GENBANK accession number AM418385 encoding a T. reesei
enzyme.
Example 24
Vectoring of Genes for Plant Expression
[0221] The coding regions of the invention are connected with
appropriate promoter and terminator sequences for expression in
plants. Such sequences are well known in the art and may include
the rice actin promoter or maize ubiquitin promoter for expression
in monocots, the Arabidopsis UBQ3 promoter or CaMV 35S promoter for
expression in dicots, and the nos or PinII terminators. Techniques
for producing and confirming promoter-gene-terminator constructs
also are well known in the art.
[0222] In one aspect of the invention, synthetic DNA sequences are
designed and generated. These synthetic sequences have altered
nucleotide sequence relative to the parent sequence, but encode
proteins that are essentially identical to the parent amino acid
sequence.
[0223] In another aspect of the invention, modified versions of the
synthetic genes are designed such that the resulting peptide is
targeted to a plant organelle, such as the endoplasmic reticulum or
the apoplast. Peptide sequences known to result in targeting of
fusion proteins to plant organelles are known in the art. For
example, the N-terminal region of the acid phosphatase gene from
the White Lupin Lupinus albus (GENBANK.RTM.ID GI:14276838, Miller
et al. (2001) Plant Physiology 127: 594-606) is known in the art to
result in endoplasmic reticulum targeting of heterologous proteins.
If the resulting fusion protein also contains an endoplasmic
reticulum retention sequence comprising the peptide
N-terminus-lysine-aspartic acid-glutamic acid-leucine (i.e., the
"KDEL" motif, SEQ ID NO:30) at the C-terminus, the fusion protein
will be targeted to the endoplasmic reticulum. If the fusion
protein lacks an endoplasmic reticulum targeting sequence at the
C-terminus, the protein will be targeted to the endoplasmic
reticulum, but will ultimately be sequestered in the apoplast.
[0224] Thus, this gene encodes a fusion protein that contains the
N-terminal thirty-one amino acids of the acid phosphatase gene from
the White Lupin Lupinus albus (GENBANK.RTM. ID GI:14276838, Miller
et al., 2001, supra) fused to the N-terminus of the sequence of the
invention, as well as the KDEL sequence at the C-terminus. Thus,
the resulting protein is predicted to be targeted the plant
endoplasmic reticulum upon expression in a plant cell.
[0225] The plant expression cassettes described above are combined
with an appropriate plant selectable marker to aid in the selection
of transformed cells and tissues, and ligated into plant
transformation vectors. These may include binary vectors from
Agrobacterium-mediated transformation or simple plasmid vectors for
aerosol or biolistic transformation.
Example 25
Vectoring genes for Plant Expression
[0226] The coding region DNA of the genes encompassed herein are
operably connected with appropriate promoter and terminator
sequences for expression in plants. Such sequences are well known
in the art and may include the rice actin promoter or maize
ubiquitin promoter for expression in monocots, the Arabidopsis UBQ3
promoter or CaMV 35S promoter for expression in dicots, and the nos
or PinII terminators. Techniques for producing and confirming
promoter-gene-terminator constructs also are well known in the
art.
[0227] The plant expression cassettes described above are combined
with an appropriate plant selectable marker to aid in the
selections of transformed cells and tissues, and ligated into plant
transformation vectors. These may include binary vectors from
Agrobacterium-mediated transformation or simple plasmid vectors for
aerosol or biolistic transformation.
Example 26
AXN-1 Protein Expression in Soybean Root Tissue
Vector Construction
[0228] Vector pAG6004 was prepared to guide overexpression of the
AXN-1 protein in soybean hairy root tissues. pAG6004 contains the
full-length AXN-1 gene, organized 3' to the UBQ10 promoter
(Arabidopsis thaliana) and 5' to the 35S terminator (cauliflower
mosaic virus), in a manner likely to lead to transcription of the
axn-1 gene from the UBQ10 promoter, and termination of such
transcription by the 35S terminator. Also present in the vector is
a visual marker (yellow fluorescent protein (YFP), under control of
UBQ3 promoter), a replication origin functional in Agrobacterium
species, and a gentamicin resistance gene. The organization of the
vector was confirmed by DNA sequencing of the entire vector, and
then introduced into Agrobacterium rhizogenes strain K599 and
propagated by growth on gentamicin.
Soybean Transformation
[0229] Soybean hairy root cultures were prepared as follows.
Soybean seeds (cultivar Williams 82) were germinated in a growth
chamber (25.degree. C.) for 1 week, at which time the cotyledons
were excised (after removing seed coats). The cotyledons were then
wounded with a scalpel that had been dipped in an overnight A.
rhizogenes culture transformed with pAG6004. The infected
cotyledons were placed abaxial side up on the top of a Whatman
filter paper, submerged in sterile water in a Petri dish and
incubated in a dark growth chamber at 25.degree. C. for 3 to 5
days. Next, individual cotyledons were transferred to and cultured
abaxial side up on MB carb medium (MS salts, B5 vitamins, 3%
sucrose, 500 mg/L of carbenicillin, and solidified with 3 g/L of
Gelrite). Cotyledons were sub-cultured every two weeks on the same
MB carb medium to regenerate hairy roots. Roots expressing yellow
fluorescent protein (YFP) gene associated with AXN-1 gene derived
from pAG6004 were detected under a ZEISS stereo dissecting
microscope (KL 1500 LCD) with filter set (exciting filter 508 nm;
emission filter 524 nm). YFP roots were sub-cultured on the same MB
carb medium every two weeks or as needed.
Detection of AXN-1 Protein in Root Tissue
[0230] Western blot analysis was utilized to identify AXN-1 protein
expression in hairy root tissues. One gram of transgenic and
control tissues that had been grown for approximately 6 weeks were
suspended 2.times.LDS loading dye (Invitrogen) with 2.5 mM
(3-mercaptoethanol, and then homogenized using stainless beads in a
bead beater instrument. The homogenized extracts were separated on
a 4-20% Big-Tris gel, transferred to nitrocellulose, and then
incubated with rabbit serum from rabbits immunized with purified
AXN-1 protein. Following a series of wash steps and incubation with
a secondary antibody (donkey anti-rabbit, conjugated with
horseradish peroxidase, Pierce), the presence of AXN-1 was
visualized by ECL (Pierce). Interestingly, this analysis revealed
that the soybean roots generated a truncated form of the protein
(approximately 50 kDa) rather than the full-length protein (103
kDa). This observation is consistent with post-translational
processing of the AXN-1 protein, and matches the size of the AXN-1
protein that was purified from the host bacterial strain,
ATX21995.
Detection of AXN-8 Protein in Root Tissue
[0231] Western blot analysis was utilized to identify AXN-8 protein
expression in hairy root tissues. One gram of transgenic and
control tissues that had been grown for approximately 6 weeks were
suspended 2.times.LDS loading dye (Invitrogen) with 2.5 mM
.beta.-mercaptoethanol, and then homogenized using stainless beads
in a bead beater instrument. The homogenized extracts were
separated on a 4-20% Big-Tris gel, transferred to nitrocellulose,
and then incubated with rabbit serum from rabbits immunized with
purified AXN-8 protein. Following a series of wash steps and
incubation with a secondary antibody (donkey anti-rabbit,
conjugated with horseradish peroxidase, Pierce), the presence of
AXN-8 was visualized by ECL (Pierce). This analysis revealed that
the soybean roots generated a full-length AXN-8 protein
(approximately 50 kDa in size) that matches the size of the AXN-8
protein that was purified from the host bacterial strain, ATX20514,
as well as additional truncated forms of the protein.
Detection of Phenol Oxidase Enzymatic Activity in Root Tissue
[0232] Several phenol oxidases, including AXN-1, can utilize
tyrosine as a substrate to produce melanin. To determine if the
AXN-1 protein expressed in soybean hairy roots was enzymatically
active, we carried out enzymatic assays with protein extracts from
AXN-1 (pAG6004) and control (pAG5385) root tissues. Each tissue
(approximately 1 gram) was homogenized in liquid nitrogen, and 10
mg of each was suspended in 0.4 mL of buffer (20 mM Tris, pH 8.0).
Each tissue suspension was then added at 1/10.sup.th final volume
to enzyme assays containing the same buffer and 1 mM tyrosine.
Assay reactions were incubated overnight, and a commercial
tyrosinase preparation (Sigma-Aldrich) was used as a positive
control for enzymatic activity. Both the AXN-1 root tissue and
commercial tyrosinase enzyme generated a brown color in the assay
that is consistent with melanin, while control root tissue was
negative. Color formation was dependent on the presence of the
substrate tyrosine. Thus, axn-1 is effectively expressed in soybean
tissue, resulting in active polyphenol oxidase activity.
Example 27
Transformation of Maize Cells with the Nematicidal Genes Described
Herein
[0233] Maize ears are best collected 8-12 days after pollination.
Embryos are isolated from the ears, and those embryos 0.8-1.5 mm in
size are preferred for use in transformation. Embryos are plated
scutellum side-up on a suitable incubation media, such as DN62A5S
media (3.98 g/L N6 Salts; 1 mL/L (of 1000.times. Stock) N6
Vitamins; 800 mg/L L-Asparagine; 100 mg/L Myo-inositol; 1.4 g/L
L-Proline; 100 mg/L Casamino acids; 50 g/L sucrose; 1 mL/L (of 1
mg/mL Stock) 2,4-D). However, media and salts other than DN62A5S
are suitable and are known in the art. Embryos are incubated
overnight at 25.degree. C. in the dark. However, it is not
necessary per se to incubate the embryos overnight.
[0234] The resulting explants are transferred to mesh squares
(30-40 per plate), transferred onto osmotic media for about 30-45
minutes, then transferred to a beaming plate (see, for example, PCT
Publication No. WO/0138514 and U.S. Pat. No. 5,240,842).
[0235] DNA constructs designed to the genes of the invention in
plant cells are accelerated into plant tissue using an aerosol beam
accelerator, using conditions essentially as described in PCT
Publication No. WO/0138514. After beaming, embryos are incubated
for about 30 min on osmotic media, and placed onto incubation media
overnight at 25.degree. C. in the dark. To avoid unduly damaging
beamed explants, they are incubated for at least 24 hours prior to
transfer to recovery media. Embryos are then spread onto recovery
period media, for about 5 days, 25.degree. C. in the dark, then
transferred to a selection media. Explants are incubated in
selection media for up to eight weeks, depending on the nature and
characteristics of the particular selection utilized. After the
selection period, the resulting callus is transferred to embryo
maturation media, until the formation of mature somatic embryos is
observed. The resulting mature somatic embryos are then placed
under low light, and the process of regeneration is initiated by
methods known in the art. The resulting shoots are allowed to root
on rooting media, and the resulting plants are transferred to
nursery pots and propagated as transgenic plants.
Materials
DN62A5S Media
TABLE-US-00017 [0236] Components Per Liter Source Chu's N6 Basal
Salt 3.98 g/L Phytotechnology Labs Mixture (Prod. No. C 416) Chu's
N6 Vitamin 1 mL/L Phytotechnology Labs Solution (Prod. No. C 149)
(of 1000x Stock) L-Asparagine 800 mg/L Phytotechnology Labs
Myo-inositol 100 mg/L Sigma L-Proline 1.4 g/L Phytotechnology Labs
Casamino acids 100 mg/L Fisher Scientific Sucrose 50 g/L
Phytotechnology Labs 2,4-D (Prod. No. D-7299) 1 mL/L Sigma (of 1
mg/mL Stock)
[0237] The pH of the solution is adjusted to pH 5.8 with 1N KOH/1N
KCl, Gelrite (Sigma) is added at a concentration up to 3 g/L, and
the media is autoclaved. After cooling to 50.degree. C., 2 ml/L of
a 5 mg/ml stock solution of silver nitrate (Phytotechnology Labs)
is added.
Example 28
Transformation of the Genes of the Invention in Plant Cells by
Agrobacterium-Mediated Transformation
[0238] Ears are best collected 8-12 days after pollination. Embryos
are isolated from the ears, and those embryos 0.8-1.5 mm in size
are preferred for use in transformation. Embryos are plated
scutellum side-up on a suitable incubation media, and incubated
overnight at 25.degree. C. in the dark. However, it is not
necessary per se to incubate the embryos overnight. Embryos are
contacted with an Agrobacterium strain containing the appropriate
vectors for Ti plasmid mediated transfer for about 5-10 min, and
then plated onto co-cultivation media for about 3 days (25.degree.
C. in the dark). After co-cultivation, explants are transferred to
recovery period media for about five days (at 25.degree. C. in the
dark). Explants are incubated in selection media for up to eight
weeks, depending on the nature and characteristics of the
particular selection utilized. After the selection period, the
resulting callus is transferred to embryo maturation media, until
the formation of mature somatic embryos is observed. The resulting
mature somatic embryos are then placed under low light, and the
process of regeneration is initiated as known in the art.
Example 29
AXN-8 Protein Expression in Maize Leaf Tissue
Vector Construction
[0239] Vector pAG4146 was prepared to guide overexpression of the
AXN-8 protein in maize tissues. pAG4146 contains the full-length
AXN-8 gene, organized 3' to the sugarcane Ubi4 ubiquitin promoter
(Saccharum sp.) and 5' to the 35S terminator (cauliflower mosaic
virus), in a manner likely to lead to transcription of the axn-8
gene from the Ubi promoter, and termination of such transcription
by the 35S terminator. Also present in the vector is a selectable
marker that confers resistance to glyphosate (GRG23ace5, under
control of sugarcane Ubi4 promoter), a replication origin
functional in Agrobacterium species, and a spectinomycin resistance
gene. The organization of the vector was confirmed by DNA
sequencing of the entire vector.
Detection of AXN-8 Protein in Maize Leaf Tissues
[0240] Western blot analysis was utilized to identify AXN-8 protein
expression in both leaf and root tissues. One gram of transgenic
and control tissues were suspended 2.times.LDS loading dye
(Invitrogen) with 2.5 mM .beta.-mercaptoethanol, and then
homogenized using stainless beads in a bead beater instrument. The
homogenized extracts were separated on a 4-20% Big-Tris gel,
transferred to nitrocellulose, and then incubated with rabbit serum
from rabbits immunized with purified AXN-8 protein. Following a
series of wash steps and incubation with a secondary antibody
(donkey anti-rabbit, conjugated with horseradish peroxidase,
Pierce), the presence of AXN-8 was visualized by ECL (Pierce). The
size of the protein detected by Western blot was very similar for
the leaf and root tissue, and is similar to that expected for the
full-length AXN-8 protein (approximately 50 kDa), and matches the
size of the AXN-8 protein that was purified from the host bacterial
strain, ATX20514.
[0241] All publications and patent applications mentioned in the
specification are indicative of the level of skill of those skilled
in the art to which this invention pertains. All publications and
patent applications are herein incorporated by reference to the
same extent as if each individual publication or patent application
was specifically and individually indicated to be incorporated by
reference.
[0242] Although the foregoing invention has been described in some
detail by way of illustration and example for purposes of clarity
of understanding, it will be obvious that certain changes and
modifications may be practiced within the scope of the appended
claims.
Sequence CWU 1
1
4813769DNAArthrobotrys oligospora 1tccgacgacg ttgcagtttc caacatggca
tcagcaccat acgctatcac gggcattcca 60actaccagag cccctgatgg agccctcccg
cttcgtcaag agattgatgc ttggtctgcg 120aacccagcca atgttgacca
ggtgaactta tatctccagg cgcttgctgc tttccaacag 180ttgcctgcga
cagataagct ctcttacttc cagattgctg gttagtccga tcaaccgtta
240cttctcttat ccattgagat tccttggcta attgcgattt tttgttgttg
tagggattca 300tggtatgaat ggtatcaata ttggtatctt ggatagatat
actaatttat cttaaaaaaa 360ggggagcctt ttatcccgtg ggatgagaat
accagtccta atccaagatc taggtggaga 420ggtcagtata ttggtcctat
tactctatat gtttttatat tccaataaac taaccaagtt 480attgaataca
ggatattgta cacatgcatc aatcctcttc ccgtaagcat aaaaacccag
540ggccgaattt cttgtcgcgg accggatatt agtctaactt aagatacaat
agaacatggc 600atcggccgta tctcgctgtc ttcgaggtat ataatttcta
cctaaaggaa aaatctttcg 660tagatagtta acgcttgatt tcgatattta
tagcaaatcc ttcattcgat tatgcagcga 720attgcggcag catatccaga
ccaagagctt cgaacccgat atcagactgc cgcagaagca 780ttccgtattc
catactggga cagtgcacaa cttaaggaac gtgggggcag aagatccttg
840aacgttcctt acctttgcac cttgcctact gttcaagtct tcactcctac
ttccgctgga 900gatactatca ggccttttga aactattgat aatcccttgt
acagctacaa atttgtcacc 960acacaaggaa ttactagttt ccaagaccag
gatggaaatt tctttccagt aatagaaaca 1020tccattctcc aggattcact
acgtggattg tcactaataa gcattcgatt ggtagttcgc 1080aaacgcgatg
ggaacttccc gctatccacc acaatacaat tctcgcgacc ccaccgtttc
1140ttctcagtgg accaatggat tcgttgataa cgactcgatc acggaggcac
tacggaatct 1200gagttctctt ggtgaggacg tttaccgatc attcacgacc
agcaattatg cctggtactc 1260tagcacccaa caatcaaatc ccccagcgcc
caacaggtat gaaacttgaa atgtaaatat 1320actagtaaat accagtctca
ctggctaatc ttatccaaca cttagctacc aatctctcga 1380atcgattcac
aatgaaatcc acggcatcac aggagggggt ggacatatga gctggaatac
1440gtaagtcgta tgaacctgac attaaaatta aaaatctacg ttagctaata
tacctaactt 1500cgcacactca gagtttcatc ttttggtatg gattctacca
attcattgat gcttttttat 1560acttgctaac ttgtattcct cagatcctat
tttctggctc caccactgca acgtggatcg 1620tctgtttgcc atctggcaag
ctatctacgc tgataccggc cgatatcctg atgcttggtt 1680taatgcacaa
tcagcacaac ttcgagacga acgaggaact tggtcgattg ctgcaggttc
1740tcgcgaaaat gctgacactc cactagctcc attccataag gacgacagag
gcagcgtcta 1800caattccaat gacgtccgca attggactag gtttggctct
tcgtaccctg aattgcaacc 1860atggcttcct caataccgag attccactgg
tgaatttaac gcaacgctat atcgtaacga 1920tgttgttgca caggtcaccg
acttgtattc gcgagtcaga aggcgtgtcc agaacactca 1980agttccacga
aatcgccttt ttgctgccac ccagaccggc acccagacat tccaaggcag
2040ttccgctact gcaggcgggt cgtttgcggc cccaccgaca acacaagggc
ccggtcagca 2100gttgcaattt ggtccccctc cttccggcgg gcaacaggcc
ttcgcccctc caccaacagt 2160ccaagcccaa gcccagtctc aaggacaacc
attcaccccg ccaacgacgc tgcccactca 2220gggacagcaa tttacctctc
ctcctcctca aactgctcag ggccaacagt tcccaccccc 2280gccgactcag
cagcaacagt tctcgccgcc gccgactcat cagcagcaat tcgcccctcc
2340tcctacgcag gagcacggac aggcggttac gtcaccacct gcacagacac
aattctcccc 2400tccgccaact caggcattct cgccgccacc gactggtgat
tcccacggac agcagtttac 2460tccacagccg caacagcaat tcactccaca
accgcaacag caacagcaac agcaatttgc 2520gcctccccag caaggaccag
gcggccatac cccacaggga cagcatagct ctccaccacc 2580caagaaaagc
ggcctcagtg gccttatgtc ctctgctaaa ctgcactttg gtgaagccct
2640tactgcaggc cgtgaagccg ctcaaggcca ccagcagcct gtacaacagc
atcaacagcc 2700cactcacact ccaggaaacc ctggcagcag tggtactgct
cttgctacta aatttggtgg 2760tattattgga ggcggtattc atatggccca
agaacgtctt ggttctaaga agcagccggg 2820ccaacctgga acccgtggta
ttgatgacga acctggtcaa gaaggagaat tgagccgtgg 2880attcggtgat
atgagcttgg gccaacaaag tttcggctca ggagagtcgc ttacttacca
2940cgaatacgat gcaaacatcc gatttgagag gtaaactgcg taacaaccaa
agaaacccca 3000agtatcaagc cgctaacctt agaaatatag attcgacctc
ggtggtcgtc cattcacagt 3060ccacatcttc cttggagact tcaacccgga
cccagcaact tggatgtggg acaagaatcg 3120tgtcggtgga atctataact
ttgtcgccgg tgttcagcgt ggagacggaa gcgcttgctc 3180caactgcgaa
actcaatccc aggaccacac tatcgttacg ggtcaggtgt ctctcactaa
3240cgcccttctt gacgacgttg aagactcagc aaatggcttg aatagcctga
ttcccgagga 3300ggttatcccg tatttgcaac gacatctgca ctggcgtatc
actgacgtat gttgatccct 3360cccaaagttc actttatatt gttctcaatt
gttaactaac acgtggggaa attagccgaa 3420tggaagggag atcccacgcc
agagcctcaa taccttaaag atctctgttg ttgaatgttc 3480cgccaccatt
tcaaacaacc ccggcgagct cacccaatat ggggatcaca gagtcttgga
3540catagttact gaaggtcgtc cggctggcaa agcggctggc gatggttact
aaaaaaaatc 3600tagtgaaccc tttcagcata ttgcacgcag attgctgttt
tgtttgtttt atgtagggca 3660ttcgaattcg acgaccctga aatttgcttc
acgagcatta aatcagagag ggaaatagtg 3720aatattaacc gctgggcgag
cgtcttttca tgtttatgta cttaggcag 376923094DNAArthrobotrys oligospora
2tccgacgacg ttgcagtttc caacatggca tcagcaccat acgctatcac gggcattcca
60actaccagag cccctgatgg agccctcccg cttcgtcaag agattgatgc ttggtctgcg
120aacccagcca atgttgacca ggtgaactta tatctccagg cgcttgctgc
tttccaacag 180ttgcctgcga cagataagct ctcttacttc cagattgctg
ggattcatgg ggagcctttt 240atcccgtggg atgagaatac cagtcctaat
ccaagatcta ggtggagagg atattgtaca 300catgcatcaa tcctcttccc
aacatggcat cggccgtatc tcgctgtctt cgagcaaatc 360cttcattcga
ttatgcagcg aattgcggca gcatatccag accaagagct tcgaacccga
420tatcagactg ccgcagaagc attccgtatt ccatactggg acagtgcaca
acttaaggaa 480cgtgggggca gaagatcctt gaacgttcct tacctttgca
ccttgcctac tgttcaagtc 540ttcactccta cttccgctgg agatactatc
aggccttttg aaactattga taatcccttg 600tacagctaca aatttgtcac
cacacaagga attactagtt tccaagacca ggatggaaat 660ttctttccat
tcgcaaacgc gatgggaact tcccgctatc caccacaata caattctcgc
720gaccccaccg tttcttctca gtggaccaat ggattcgttg ataacgactc
gatcacggag 780gcactacgga atctgagttc tcttggtgag gacgtttacc
gatcattcac gaccagcaat 840tatgcctggt actctagcac ccaacaatca
aatcccccag cgcccaacag ctaccaatct 900ctcgaatcga ttcacaatga
aatccacggc atcacaggag ggggtggaca tatgagctgg 960aatacagttt
catcttttga tcctattttc tggctccacc actgcaacgt ggatcgtctg
1020tttgccatct ggcaagctat ctacgctgat accggccgat atcctgatgc
ttggtttaat 1080gcacaatcag cacaacttcg agacgaacga ggaacttggt
cgattgctgc aggttctcgc 1140gaaaatgctg acactccact agctccattc
cataaggacg acagaggcag cgtctacaat 1200tccaatgacg tccgcaattg
gactaggttt ggctcttcgt accctgaatt gcaaccatgg 1260cttcctcaat
accgagattc cactggtgaa tttaacgcaa cgctatatcg taacgatgtt
1320gttgcacagg tcaccgactt gtattcgcga gtcagaaggc gtgtccagaa
cactcaagtt 1380ccacgaaatc gcctttttgc tgccacccag accggcaccc
agacattcca aggcagttcc 1440gctactgcag gcgggtcgtt tgcggcccca
ccgacaacac aagggcccgg tcagcagttg 1500caatttggtc cccctccttc
cggcgggcaa caggccttcg cccctccacc aacagtccaa 1560gcccaagccc
agtctcaagg acaaccattc accccgccaa cgacgctgcc cactcaggga
1620cagcaattta cctctcctcc tcctcaaact gctcagggcc aacagttccc
acccccgccg 1680actcagcagc aacagttctc gccgccgccg actcatcagc
agcaattcgc ccctcctcct 1740acgcaggagc acggacaggc ggttacgtca
ccacctgcac agacacaatt ctcccctccg 1800ccaactcagg cattctcgcc
gccaccgact ggtgattccc acggacagca gtttactcca 1860cagccgcaac
agcaattcac tccacaaccg caacagcaac agcaacagca atttgcgcct
1920ccccagcaag gaccaggcgg ccatacccca cagggacagc atagctctcc
accacccaag 1980aaaagcggcc tcagtggcct tatgtcctct gctaaactgc
actttggtga agcccttact 2040gcaggccgtg aagccgctca aggccaccag
cagcctgtac aacagcatca acagcccact 2100cacactccag gaaaccctgg
cagcagtggt actgctcttg ctactaaatt tggtggtatt 2160attggaggcg
gtattcatat ggcccaagaa cgtcttggtt ctaagaagca gccgggccaa
2220cctggaaccc gtggtattga tgacgaacct ggtcaagaag gagaattgag
ccgtggattc 2280ggtgatatga gcttgggcca acaaagtttc ggctcaggag
agtcgcttac ttaccacgaa 2340tacgatgcaa acatccgatt tgagagattc
gacctcggtg gtcgtccatt cacagtccac 2400atcttccttg gagacttcaa
cccggaccca gcaacttgga tgtgggacaa gaatcgtgtc 2460ggtggaatct
ataactttgt cgccggtgtt cagcgtggag acggaagcgc ttgctccaac
2520tgcgaaactc aatcccagga ccacactatc gttacgggtc aggtgtctct
cactaacgcc 2580cttcttgacg acgttgaaga ctcagcaaat ggcttgaata
gcctgattcc cgaggaggtt 2640atcccgtatt tgcaacgaca tctgcactgg
cgtatcactg acccgaatgg aagggagatc 2700ccacgccaga gcctcaatac
cttaaagatc tctgttgttg aatgttccgc caccatttca 2760aacaaccccg
gcgagctcac ccaatatggg gatcacagag tcttggacat agttactgaa
2820ggtcgtccgg ctggcaaagc ggctggcgat ggttactaaa aaaaatctag
tgaacccttt 2880cagcatattg cacgcagatt gctgttttgt ttgttttatg
tagggcattc gaattcgacg 2940accctgaaat ttgcttcacg agcattaaat
cagagaggga aatagtgaat attaaccgct 3000gggcgagcgt cttttcatgt
ttatgtactt aggcagttgc ctgtttttgc tggaatatat 3060tttaattgag
tcccaaaaaa aaaaaaaaaa aaaa 309432835DNAArthrobotrys
oligosporaCDS(1)...(2835) 3atg gca tca gca cca tac gct atc acg ggc
att cca act acc aga gcc 48Met Ala Ser Ala Pro Tyr Ala Ile Thr Gly
Ile Pro Thr Thr Arg Ala1 5 10 15cct gat gga gcc ctc ccg ctt cgt caa
gag att gat gct tgg tct gcg 96Pro Asp Gly Ala Leu Pro Leu Arg Gln
Glu Ile Asp Ala Trp Ser Ala 20 25 30aac cca gcc aat gtt gac cag gtg
aac tta tat ctc cag gcg ctt gct 144Asn Pro Ala Asn Val Asp Gln Val
Asn Leu Tyr Leu Gln Ala Leu Ala 35 40 45gct ttc caa cag ttg cct gcg
aca gat aag ctc tct tac ttc cag att 192Ala Phe Gln Gln Leu Pro Ala
Thr Asp Lys Leu Ser Tyr Phe Gln Ile 50 55 60gct ggg att cat ggg gag
cct ttt atc ccg tgg gat gag aat acc agt 240Ala Gly Ile His Gly Glu
Pro Phe Ile Pro Trp Asp Glu Asn Thr Ser65 70 75 80cct aat cca aga
tct agg tgg aga gga tat tgt aca cat gca tca atc 288Pro Asn Pro Arg
Ser Arg Trp Arg Gly Tyr Cys Thr His Ala Ser Ile 85 90 95ctc ttc cca
aca tgg cat cgg ccg tat ctc gct gtc ttc gag caa atc 336Leu Phe Pro
Thr Trp His Arg Pro Tyr Leu Ala Val Phe Glu Gln Ile 100 105 110ctt
cat tcg att atg cag cga att gcg gca gca tat cca gac caa gag 384Leu
His Ser Ile Met Gln Arg Ile Ala Ala Ala Tyr Pro Asp Gln Glu 115 120
125ctt cga acc cga tat cag act gcc gca gaa gca ttc cgt att cca tac
432Leu Arg Thr Arg Tyr Gln Thr Ala Ala Glu Ala Phe Arg Ile Pro Tyr
130 135 140tgg gac agt gca caa ctt aag gaa cgt ggg ggc aga aga tcc
ttg aac 480Trp Asp Ser Ala Gln Leu Lys Glu Arg Gly Gly Arg Arg Ser
Leu Asn145 150 155 160gtt cct tac ctt tgc acc ttg cct act gtt caa
gtc ttc act cct act 528Val Pro Tyr Leu Cys Thr Leu Pro Thr Val Gln
Val Phe Thr Pro Thr 165 170 175tcc gct gga gat act atc agg cct ttt
gaa act att gat aat ccc ttg 576Ser Ala Gly Asp Thr Ile Arg Pro Phe
Glu Thr Ile Asp Asn Pro Leu 180 185 190tac agc tac aaa ttt gtc acc
aca caa gga att act agt ttc caa gac 624Tyr Ser Tyr Lys Phe Val Thr
Thr Gln Gly Ile Thr Ser Phe Gln Asp 195 200 205cag gat gga aat ttc
ttt cca ttc gca aac gcg atg gga act tcc cgc 672Gln Asp Gly Asn Phe
Phe Pro Phe Ala Asn Ala Met Gly Thr Ser Arg 210 215 220tat cca cca
caa tac aat tct cgc gac ccc acc gtt tct tct cag tgg 720Tyr Pro Pro
Gln Tyr Asn Ser Arg Asp Pro Thr Val Ser Ser Gln Trp225 230 235
240acc aat gga ttc gtt gat aac gac tcg atc acg gag gca cta cgg aat
768Thr Asn Gly Phe Val Asp Asn Asp Ser Ile Thr Glu Ala Leu Arg Asn
245 250 255ctg agt tct ctt ggt gag gac gtt tac cga tca ttc acg acc
agc aat 816Leu Ser Ser Leu Gly Glu Asp Val Tyr Arg Ser Phe Thr Thr
Ser Asn 260 265 270tat gcc tgg tac tct agc acc caa caa tca aat ccc
cca gcg ccc aac 864Tyr Ala Trp Tyr Ser Ser Thr Gln Gln Ser Asn Pro
Pro Ala Pro Asn 275 280 285agc tac caa tct ctc gaa tcg att cac aat
gaa atc cac ggc atc aca 912Ser Tyr Gln Ser Leu Glu Ser Ile His Asn
Glu Ile His Gly Ile Thr 290 295 300gga ggg ggt gga cat atg agc tgg
aat aca gtt tca tct ttt gat cct 960Gly Gly Gly Gly His Met Ser Trp
Asn Thr Val Ser Ser Phe Asp Pro305 310 315 320att ttc tgg ctc cac
cac tgc aac gtg gat cgt ctg ttt gcc atc tgg 1008Ile Phe Trp Leu His
His Cys Asn Val Asp Arg Leu Phe Ala Ile Trp 325 330 335caa gct atc
tac gct gat acc ggc cga tat cct gat gct tgg ttt aat 1056Gln Ala Ile
Tyr Ala Asp Thr Gly Arg Tyr Pro Asp Ala Trp Phe Asn 340 345 350gca
caa tca gca caa ctt cga gac gaa cga gga act tgg tcg att gct 1104Ala
Gln Ser Ala Gln Leu Arg Asp Glu Arg Gly Thr Trp Ser Ile Ala 355 360
365gca ggt tct cgc gaa aat gct gac act cca cta gct cca ttc cat aag
1152Ala Gly Ser Arg Glu Asn Ala Asp Thr Pro Leu Ala Pro Phe His Lys
370 375 380gac gac aga ggc agc gtc tac aat tcc aat gac gtc cgc aat
tgg act 1200Asp Asp Arg Gly Ser Val Tyr Asn Ser Asn Asp Val Arg Asn
Trp Thr385 390 395 400agg ttt ggc tct tcg tac cct gaa ttg caa cca
tgg ctt cct caa tac 1248Arg Phe Gly Ser Ser Tyr Pro Glu Leu Gln Pro
Trp Leu Pro Gln Tyr 405 410 415cga gat tcc act ggt gaa ttt aac gca
acg cta tat cgt aac gat gtt 1296Arg Asp Ser Thr Gly Glu Phe Asn Ala
Thr Leu Tyr Arg Asn Asp Val 420 425 430gtt gca cag gtc acc gac ttg
tat tcg cga gtc aga agg cgt gtc cag 1344Val Ala Gln Val Thr Asp Leu
Tyr Ser Arg Val Arg Arg Arg Val Gln 435 440 445aac act caa gtt cca
cga aat cgc ctt ttt gct gcc acc cag acc ggc 1392Asn Thr Gln Val Pro
Arg Asn Arg Leu Phe Ala Ala Thr Gln Thr Gly 450 455 460acc cag aca
ttc caa ggc agt tcc gct act gca ggc ggg tcg ttt gcg 1440Thr Gln Thr
Phe Gln Gly Ser Ser Ala Thr Ala Gly Gly Ser Phe Ala465 470 475
480gcc cca ccg aca aca caa ggg ccc ggt cag cag ttg caa ttt ggt ccc
1488Ala Pro Pro Thr Thr Gln Gly Pro Gly Gln Gln Leu Gln Phe Gly Pro
485 490 495cct cct tcc ggc ggg caa cag gcc ttc gcc cct cca cca aca
gtc caa 1536Pro Pro Ser Gly Gly Gln Gln Ala Phe Ala Pro Pro Pro Thr
Val Gln 500 505 510gcc caa gcc cag tct caa gga caa cca ttc acc ccg
cca acg acg ctg 1584Ala Gln Ala Gln Ser Gln Gly Gln Pro Phe Thr Pro
Pro Thr Thr Leu 515 520 525ccc act cag gga cag caa ttt acc tct cct
cct cct caa act gct cag 1632Pro Thr Gln Gly Gln Gln Phe Thr Ser Pro
Pro Pro Gln Thr Ala Gln 530 535 540ggc caa cag ttc cca ccc ccg ccg
act cag cag caa cag ttc tcg ccg 1680Gly Gln Gln Phe Pro Pro Pro Pro
Thr Gln Gln Gln Gln Phe Ser Pro545 550 555 560ccg ccg act cat cag
cag caa ttc gcc cct cct cct acg cag gag cac 1728Pro Pro Thr His Gln
Gln Gln Phe Ala Pro Pro Pro Thr Gln Glu His 565 570 575gga cag gcg
gtt acg tca cca cct gca cag aca caa ttc tcc cct ccg 1776Gly Gln Ala
Val Thr Ser Pro Pro Ala Gln Thr Gln Phe Ser Pro Pro 580 585 590cca
act cag gca ttc tcg ccg cca ccg act ggt gat tcc cac gga cag 1824Pro
Thr Gln Ala Phe Ser Pro Pro Pro Thr Gly Asp Ser His Gly Gln 595 600
605cag ttt act cca cag ccg caa cag caa ttc act cca caa ccg caa cag
1872Gln Phe Thr Pro Gln Pro Gln Gln Gln Phe Thr Pro Gln Pro Gln Gln
610 615 620caa cag caa cag caa ttt gcg cct ccc cag caa gga cca ggc
ggc cat 1920Gln Gln Gln Gln Gln Phe Ala Pro Pro Gln Gln Gly Pro Gly
Gly His625 630 635 640acc cca cag gga cag cat agc tct cca cca ccc
aag aaa agc ggc ctc 1968Thr Pro Gln Gly Gln His Ser Ser Pro Pro Pro
Lys Lys Ser Gly Leu 645 650 655agt ggc ctt atg tcc tct gct aaa ctg
cac ttt ggt gaa gcc ctt act 2016Ser Gly Leu Met Ser Ser Ala Lys Leu
His Phe Gly Glu Ala Leu Thr 660 665 670gca ggc cgt gaa gcc gct caa
ggc cac cag cag cct gta caa cag cat 2064Ala Gly Arg Glu Ala Ala Gln
Gly His Gln Gln Pro Val Gln Gln His 675 680 685caa cag ccc act cac
act cca gga aac cct ggc agc agt ggt act gct 2112Gln Gln Pro Thr His
Thr Pro Gly Asn Pro Gly Ser Ser Gly Thr Ala 690 695 700ctt gct act
aaa ttt ggt ggt att att gga ggc ggt att cat atg gcc 2160Leu Ala Thr
Lys Phe Gly Gly Ile Ile Gly Gly Gly Ile His Met Ala705 710 715
720caa gaa cgt ctt ggt tct aag aag cag ccg ggc caa cct gga acc cgt
2208Gln Glu Arg Leu Gly Ser Lys Lys Gln Pro Gly Gln Pro Gly Thr Arg
725 730 735ggt att gat gac gaa cct ggt caa gaa gga gaa ttg agc cgt
gga ttc 2256Gly Ile Asp Asp Glu Pro Gly Gln Glu Gly Glu Leu Ser Arg
Gly Phe 740 745 750ggt gat atg agc ttg ggc caa caa agt ttc ggc tca
gga gag tcg ctt 2304Gly Asp Met Ser Leu Gly Gln Gln Ser Phe Gly Ser
Gly Glu Ser Leu 755 760 765act tac cac gaa tac gat gca aac atc cga
ttt gag aga ttc gac ctc 2352Thr Tyr His Glu Tyr Asp Ala Asn Ile Arg
Phe Glu Arg Phe Asp Leu 770 775 780ggt ggt cgt cca ttc aca gtc cac
atc ttc ctt gga gac ttc aac ccg 2400Gly Gly Arg Pro Phe Thr Val His
Ile Phe Leu Gly Asp Phe Asn Pro785 790 795 800gac cca gca act tgg
atg tgg gac aag aat cgt gtc ggt gga atc tat 2448Asp Pro Ala Thr Trp
Met Trp Asp Lys Asn Arg Val Gly Gly Ile Tyr 805 810 815aac ttt gtc
gcc ggt gtt cag cgt gga gac gga agc gct tgc tcc aac 2496Asn Phe Val
Ala Gly Val Gln Arg Gly Asp
Gly Ser Ala Cys Ser Asn 820 825 830tgc gaa act caa tcc cag gac cac
act atc gtt acg ggt cag gtg tct 2544Cys Glu Thr Gln Ser Gln Asp His
Thr Ile Val Thr Gly Gln Val Ser 835 840 845ctc act aac gcc ctt ctt
gac gac gtt gaa gac tca gca aat ggc ttg 2592Leu Thr Asn Ala Leu Leu
Asp Asp Val Glu Asp Ser Ala Asn Gly Leu 850 855 860aat agc ctg att
ccc gag gag gtt atc ccg tat ttg caa cga cat ctg 2640Asn Ser Leu Ile
Pro Glu Glu Val Ile Pro Tyr Leu Gln Arg His Leu865 870 875 880cac
tgg cgt atc act gac ccg aat gga agg gag atc cca cgc cag agc 2688His
Trp Arg Ile Thr Asp Pro Asn Gly Arg Glu Ile Pro Arg Gln Ser 885 890
895ctc aat acc tta aag atc tct gtt gtt gaa tgt tcc gcc acc att tca
2736Leu Asn Thr Leu Lys Ile Ser Val Val Glu Cys Ser Ala Thr Ile Ser
900 905 910aac aac ccc ggc gag ctc acc caa tat ggg gat cac aga gtc
ttg gac 2784Asn Asn Pro Gly Glu Leu Thr Gln Tyr Gly Asp His Arg Val
Leu Asp 915 920 925ata gtt act gaa ggt cgt ccg gct ggc aaa gcg gct
ggc gat ggt tac 2832Ile Val Thr Glu Gly Arg Pro Ala Gly Lys Ala Ala
Gly Asp Gly Tyr 930 935 940taa 28354944PRTArthrobotrys oligospora
4Met Ala Ser Ala Pro Tyr Ala Ile Thr Gly Ile Pro Thr Thr Arg Ala1 5
10 15 Pro Asp Gly Ala Leu Pro Leu Arg Gln Glu Ile Asp Ala Trp Ser
Ala 20 25 30 Asn Pro Ala Asn Val Asp Gln Val Asn Leu Tyr Leu Gln
Ala Leu Ala 35 40 45 Ala Phe Gln Gln Leu Pro Ala Thr Asp Lys Leu
Ser Tyr Phe Gln Ile 50 55 60 Ala Gly Ile His Gly Glu Pro Phe Ile
Pro Trp Asp Glu Asn Thr Ser65 70 75 80 Pro Asn Pro Arg Ser Arg Trp
Arg Gly Tyr Cys Thr His Ala Ser Ile 85 90 95 Leu Phe Pro Thr Trp
His Arg Pro Tyr Leu Ala Val Phe Glu Gln Ile 100 105 110 Leu His Ser
Ile Met Gln Arg Ile Ala Ala Ala Tyr Pro Asp Gln Glu 115 120 125 Leu
Arg Thr Arg Tyr Gln Thr Ala Ala Glu Ala Phe Arg Ile Pro Tyr 130 135
140 Trp Asp Ser Ala Gln Leu Lys Glu Arg Gly Gly Arg Arg Ser Leu
Asn145 150 155 160 Val Pro Tyr Leu Cys Thr Leu Pro Thr Val Gln Val
Phe Thr Pro Thr 165 170 175 Ser Ala Gly Asp Thr Ile Arg Pro Phe Glu
Thr Ile Asp Asn Pro Leu 180 185 190 Tyr Ser Tyr Lys Phe Val Thr Thr
Gln Gly Ile Thr Ser Phe Gln Asp 195 200 205 Gln Asp Gly Asn Phe Phe
Pro Phe Ala Asn Ala Met Gly Thr Ser Arg 210 215 220 Tyr Pro Pro Gln
Tyr Asn Ser Arg Asp Pro Thr Val Ser Ser Gln Trp225 230 235 240 Thr
Asn Gly Phe Val Asp Asn Asp Ser Ile Thr Glu Ala Leu Arg Asn 245 250
255 Leu Ser Ser Leu Gly Glu Asp Val Tyr Arg Ser Phe Thr Thr Ser Asn
260 265 270 Tyr Ala Trp Tyr Ser Ser Thr Gln Gln Ser Asn Pro Pro Ala
Pro Asn 275 280 285 Ser Tyr Gln Ser Leu Glu Ser Ile His Asn Glu Ile
His Gly Ile Thr 290 295 300 Gly Gly Gly Gly His Met Ser Trp Asn Thr
Val Ser Ser Phe Asp Pro305 310 315 320 Ile Phe Trp Leu His His Cys
Asn Val Asp Arg Leu Phe Ala Ile Trp 325 330 335 Gln Ala Ile Tyr Ala
Asp Thr Gly Arg Tyr Pro Asp Ala Trp Phe Asn 340 345 350 Ala Gln Ser
Ala Gln Leu Arg Asp Glu Arg Gly Thr Trp Ser Ile Ala 355 360 365 Ala
Gly Ser Arg Glu Asn Ala Asp Thr Pro Leu Ala Pro Phe His Lys 370 375
380 Asp Asp Arg Gly Ser Val Tyr Asn Ser Asn Asp Val Arg Asn Trp
Thr385 390 395 400 Arg Phe Gly Ser Ser Tyr Pro Glu Leu Gln Pro Trp
Leu Pro Gln Tyr 405 410 415 Arg Asp Ser Thr Gly Glu Phe Asn Ala Thr
Leu Tyr Arg Asn Asp Val 420 425 430 Val Ala Gln Val Thr Asp Leu Tyr
Ser Arg Val Arg Arg Arg Val Gln 435 440 445 Asn Thr Gln Val Pro Arg
Asn Arg Leu Phe Ala Ala Thr Gln Thr Gly 450 455 460 Thr Gln Thr Phe
Gln Gly Ser Ser Ala Thr Ala Gly Gly Ser Phe Ala465 470 475 480 Ala
Pro Pro Thr Thr Gln Gly Pro Gly Gln Gln Leu Gln Phe Gly Pro 485 490
495 Pro Pro Ser Gly Gly Gln Gln Ala Phe Ala Pro Pro Pro Thr Val Gln
500 505 510 Ala Gln Ala Gln Ser Gln Gly Gln Pro Phe Thr Pro Pro Thr
Thr Leu 515 520 525 Pro Thr Gln Gly Gln Gln Phe Thr Ser Pro Pro Pro
Gln Thr Ala Gln 530 535 540 Gly Gln Gln Phe Pro Pro Pro Pro Thr Gln
Gln Gln Gln Phe Ser Pro545 550 555 560 Pro Pro Thr His Gln Gln Gln
Phe Ala Pro Pro Pro Thr Gln Glu His 565 570 575 Gly Gln Ala Val Thr
Ser Pro Pro Ala Gln Thr Gln Phe Ser Pro Pro 580 585 590 Pro Thr Gln
Ala Phe Ser Pro Pro Pro Thr Gly Asp Ser His Gly Gln 595 600 605 Gln
Phe Thr Pro Gln Pro Gln Gln Gln Phe Thr Pro Gln Pro Gln Gln 610 615
620 Gln Gln Gln Gln Gln Phe Ala Pro Pro Gln Gln Gly Pro Gly Gly
His625 630 635 640 Thr Pro Gln Gly Gln His Ser Ser Pro Pro Pro Lys
Lys Ser Gly Leu 645 650 655 Ser Gly Leu Met Ser Ser Ala Lys Leu His
Phe Gly Glu Ala Leu Thr 660 665 670 Ala Gly Arg Glu Ala Ala Gln Gly
His Gln Gln Pro Val Gln Gln His 675 680 685 Gln Gln Pro Thr His Thr
Pro Gly Asn Pro Gly Ser Ser Gly Thr Ala 690 695 700 Leu Ala Thr Lys
Phe Gly Gly Ile Ile Gly Gly Gly Ile His Met Ala705 710 715 720 Gln
Glu Arg Leu Gly Ser Lys Lys Gln Pro Gly Gln Pro Gly Thr Arg 725 730
735 Gly Ile Asp Asp Glu Pro Gly Gln Glu Gly Glu Leu Ser Arg Gly Phe
740 745 750 Gly Asp Met Ser Leu Gly Gln Gln Ser Phe Gly Ser Gly Glu
Ser Leu 755 760 765 Thr Tyr His Glu Tyr Asp Ala Asn Ile Arg Phe Glu
Arg Phe Asp Leu 770 775 780 Gly Gly Arg Pro Phe Thr Val His Ile Phe
Leu Gly Asp Phe Asn Pro785 790 795 800 Asp Pro Ala Thr Trp Met Trp
Asp Lys Asn Arg Val Gly Gly Ile Tyr 805 810 815 Asn Phe Val Ala Gly
Val Gln Arg Gly Asp Gly Ser Ala Cys Ser Asn 820 825 830 Cys Glu Thr
Gln Ser Gln Asp His Thr Ile Val Thr Gly Gln Val Ser 835 840 845 Leu
Thr Asn Ala Leu Leu Asp Asp Val Glu Asp Ser Ala Asn Gly Leu 850 855
860 Asn Ser Leu Ile Pro Glu Glu Val Ile Pro Tyr Leu Gln Arg His
Leu865 870 875 880 His Trp Arg Ile Thr Asp Pro Asn Gly Arg Glu Ile
Pro Arg Gln Ser 885 890 895 Leu Asn Thr Leu Lys Ile Ser Val Val Glu
Cys Ser Ala Thr Ile Ser 900 905 910 Asn Asn Pro Gly Glu Leu Thr Gln
Tyr Gly Asp His Arg Val Leu Asp 915 920 925 Ile Val Thr Glu Gly Arg
Pro Ala Gly Lys Ala Ala Gly Asp Gly Tyr 930 935 940
5440PRTArthrobotrys oligospora 5Met Ala Ser Ala Pro Tyr Ala Ile Thr
Gly Ile Pro Thr Thr Arg Ala1 5 10 15 Pro Asp Gly Ala Leu Pro Leu
Arg Gln Glu Ile Asp Ala Trp Ser Ala 20 25 30 Asn Pro Ala Asn Val
Asp Gln Val Asn Leu Tyr Leu Gln Ala Leu Ala 35 40 45 Ala Phe Gln
Gln Leu Pro Ala Thr Asp Lys Leu Ser Tyr Phe Gln Ile 50 55 60 Ala
Gly Ile His Gly Glu Pro Phe Ile Pro Trp Asp Glu Asn Thr Ser65 70 75
80 Pro Asn Pro Arg Ser Arg Trp Arg Gly Tyr Cys Thr His Ala Ser Ile
85 90 95 Leu Phe Pro Thr Trp His Arg Pro Tyr Leu Ala Val Phe Glu
Gln Ile 100 105 110 Leu His Ser Ile Met Gln Arg Ile Ala Ala Ala Tyr
Pro Asp Gln Glu 115 120 125 Leu Arg Thr Arg Tyr Gln Thr Ala Ala Glu
Ala Phe Arg Ile Pro Tyr 130 135 140 Trp Asp Ser Ala Gln Leu Lys Glu
Arg Gly Gly Arg Arg Ser Leu Asn145 150 155 160 Val Pro Tyr Leu Cys
Thr Leu Pro Thr Val Gln Val Phe Thr Pro Thr 165 170 175 Ser Ala Gly
Asp Thr Ile Arg Pro Phe Glu Thr Ile Asp Asn Pro Leu 180 185 190 Tyr
Ser Tyr Lys Phe Val Thr Thr Gln Gly Ile Thr Ser Phe Gln Asp 195 200
205 Gln Asp Gly Asn Phe Phe Pro Phe Ala Asn Ala Met Gly Thr Ser Arg
210 215 220 Tyr Pro Pro Gln Tyr Asn Ser Arg Asp Pro Thr Val Ser Ser
Gln Trp225 230 235 240 Thr Asn Gly Phe Val Asp Asn Asp Ser Ile Thr
Glu Ala Leu Arg Asn 245 250 255 Leu Ser Ser Leu Gly Glu Asp Val Tyr
Arg Ser Phe Thr Thr Ser Asn 260 265 270 Tyr Ala Trp Tyr Ser Ser Thr
Gln Gln Ser Asn Pro Pro Ala Pro Asn 275 280 285 Ser Tyr Gln Ser Leu
Glu Ser Ile His Asn Glu Ile His Gly Ile Thr 290 295 300 Gly Gly Gly
Gly His Met Ser Trp Asn Thr Val Ser Ser Phe Asp Pro305 310 315 320
Ile Phe Trp Leu His His Cys Asn Val Asp Arg Leu Phe Ala Ile Trp 325
330 335 Gln Ala Ile Tyr Ala Asp Thr Gly Arg Tyr Pro Asp Ala Trp Phe
Asn 340 345 350 Ala Gln Ser Ala Gln Leu Arg Asp Glu Arg Gly Thr Trp
Ser Ile Ala 355 360 365 Ala Gly Ser Arg Glu Asn Ala Asp Thr Pro Leu
Ala Pro Phe His Lys 370 375 380 Asp Asp Arg Gly Ser Val Tyr Asn Ser
Asn Asp Val Arg Asn Trp Thr385 390 395 400 Arg Phe Gly Ser Ser Tyr
Pro Glu Leu Gln Pro Trp Leu Pro Gln Tyr 405 410 415 Arg Asp Ser Thr
Gly Glu Phe Asn Ala Thr Leu Tyr Arg Asn Asp Val 420 425 430 Val Ala
Gln Val Thr Asp Leu Tyr 435 440 62835DNAArtificial
SequenceSynthetic sequence encoding AXN-1 6atggcttctg ctccatatgc
tattaccggt attccaacta ctagggctcc agatggtgct 60ttgccactta ggcaagagat
tgatgcttgg agtgctaacc cagctaacgt tgatcaggtg 120aacctttacc
ttcaagctct tgctgctttc caacaacttc cagctaccga taagttgtcc
180tacttccaga ttgctggtat tcatggcgaa ccattcattc catgggatga
gaacacttct 240ccaaacccaa gatctaggtg gagaggatat tgcacccacg
cttctatttt gttcccaacc 300tggcatagac cataccttgc tgtgttcgag
cagattctcc actctatcat gcaaagaatc 360gctgctgctt atccagatca
agagcttagg actagatacc aaactgctgc tgaggctttc 420aggattccat
attgggattc cgctcagttg aaagaaagag gtggaagaag atctctcaac
480gttccatact tgtgcactct tccaactgtt caagttttca ccccaacttc
agctggtgat 540accattagac ccttcgagac tattgataac ccactctact
cctacaagtt cgttactacc 600cagggaatta cctcattcca agatcaggat
ggcaacttct tcccattcgc taacgctatg 660ggaacttcaa gatacccacc
acagtacaac tcaagagatc caaccgtttc ttctcaatgg 720accaacggat
tcgtggataa cgattccatt actgaggctc ttaggaacct ttctagcctt
780ggagaggatg tgtacagatc tttcaccacc tctaactacg cttggtactc
ttctactcag 840caatctaacc cacctgctcc aaactcttac cagtctctgg
agtctattca caacgagatt 900cacggaatta ctggtggagg tggacatatg
tcttggaaca ccgtgtcatc cttcgatcca 960attttctggc ttcatcactg
caacgttgat aggcttttcg ctatttggca agctatctac 1020gctgatactg
gaagatatcc agatgcatgg ttcaacgctc aatctgctca acttagagat
1080gagaggggaa cttggtctat tgctgctgga tcaagagaaa acgctgatac
tccacttgct 1140ccattccata aggatgatag gggttctgtg tacaactcta
acgatgttag gaactggact 1200agattcggat cttcttaccc agaacttcaa
ccatggcttc cacagtatag ggattctacc 1260ggtgagttca acgctactct
ctacaggaac gatgttgttg ctcaggttac cgatctttac 1320tcaagagtga
gaagaagggt tcaaaacact caagttccta ggaacagact tttcgctgct
1380actcaaactg gaactcaaac cttccaagga tcttctgcta ctgctggcgg
atctttcgct 1440gctccaccaa ctactcaagg accaggacaa caacttcaat
tcggaccacc accatctggt 1500ggacaacagg ctttcgcacc accacctact
gttcaagctc aagctcaatc tcagggacaa 1560ccattcactc cacctactac
tttgccaact caaggacaac aattcacatc tccaccacca 1620caaactgctc
aaggtcaaca gttcccaccc ccacctactc aacaacaaca gttctctcca
1680cctcctactc atcagcaaca attcgctcct ccacccacac aagaacatgg
acaagctgtt 1740acttctcctc ctgctcaaac tcaattttct ccacctccaa
cacaggcttt ctcaccacct 1800ccaactggtg attctcatgg acagcaattc
actcctcaac cacaacagca gtttactcca 1860cagccacaac aacagcagca
acagcaattc gctccacctc aacaaggacc aggtggacat 1920actccacaag
gacagcattc ttcaccacca cctaagaagt ctggactttc tggtcttatg
1980tcctctgcta agttgcattt cggagaggct cttactgctg gaagagaagc
tgctcaagga 2040catcaacaac cagttcaaca acatcaacag ccaactcata
ctccaggaaa cccaggatct 2100tctggaactg ctcttgctac taagttcgga
ggaattattg gaggtggaat ccatatggct 2160caagagagac tcggatctaa
gaagcaacca ggacaaccag gtactagggg tattgatgat 2220gaaccaggac
aagagggtga actttcaaga ggattcggag atatgtctct tggtcaacag
2280tctttcggat ctggtgagtc tcttacttac cacgagtacg atgctaacat
tagattcgag 2340agattcgatc ttggaggaag gccattcacc gttcacattt
tcttgggaga cttcaaccca 2400gatccagcta cttggatgtg ggataagaac
agagttggag gcatctataa cttcgttgct 2460ggtgttcaaa ggggagatgg
atctgcttgc tctaactgcg agactcagtc tcaagatcat 2520accattgtga
ccggacaagt ttctcttacc aacgctctcc ttgatgatgt tgaggattct
2580gctaacggac ttaactctct cattcccgag gaagtgattc cataccttca
gaggcatctc 2640cattggagaa ttaccgatcc aaacggaaga gagattccaa
ggcagtctct taacaccctt 2700aagatttctg ttgtggagtg ctctgctacc
atttctaaca accctggtga gttgactcaa 2760tacggtgatc atagggtgtt
ggatattgtg actgaaggta gaccagctgg aaaggctgct 2820ggtgatggct actaa
283571083DNABacillus thuringiensisCDS(1)...(1083) 7atg aga att aga
agg aac caa tcc act ctg agc cat aat gaa cgc cta 48Met Arg Ile Arg
Arg Asn Gln Ser Thr Leu Ser His Asn Glu Arg Leu1 5 10 15gcg ttt act
aat gcg gta tta gaa tta aaa cgt aga cca agt cgt tta 96Ala Phe Thr
Asn Ala Val Leu Glu Leu Lys Arg Arg Pro Ser Arg Leu 20 25 30ccg atg
tca ttg ggt agt aca agt cgt tat gat gat tat gtt tat tgg 144Pro Met
Ser Leu Gly Ser Thr Ser Arg Tyr Asp Asp Tyr Val Tyr Trp 35 40 45cat
tta cag tca atg gaa aat caa aca tcg act aca cca gga tgg gct 192His
Leu Gln Ser Met Glu Asn Gln Thr Ser Thr Thr Pro Gly Trp Ala 50 55
60cat aga ggc cca gca ttt tta cct tgg cat cgt tat tat cta aat caa
240His Arg Gly Pro Ala Phe Leu Pro Trp His Arg Tyr Tyr Leu Asn
Gln65 70 75 80ttt gaa gaa gat tta caa cga att gat cat aca gtt aca
ctt cct tat 288Phe Glu Glu Asp Leu Gln Arg Ile Asp His Thr Val Thr
Leu Pro Tyr 85 90 95tgg gat tgg aca gtt gat aac tca act gat tca tca
gtt cca gga agt 336Trp Asp Trp Thr Val Asp Asn Ser Thr Asp Ser Ser
Val Pro Gly Ser 100 105 110cct tgg act gat gat ttt atg ggc ggt gat
ggt gat cct acc caa gaa 384Pro Trp Thr Asp Asp Phe Met Gly Gly Asp
Gly Asp Pro Thr Gln Glu 115 120 125tat act gtc aca aca ggt ccc ttt
aca ggt gac aat tgg aag tta act 432Tyr Thr Val Thr Thr Gly Pro Phe
Thr Gly Asp Asn Trp Lys Leu Thr 130 135 140ctt ttt gat cat cat gaa
aac gag cct cat aat gct cga tta cgc cgt 480Leu Phe Asp His His Glu
Asn Glu Pro His Asn Ala Arg Leu Arg Arg145 150 155 160cag tta gga
act act tta aat gcc tct gga aat act ata tca atc aat 528Gln Leu Gly
Thr Thr Leu Asn Ala Ser Gly Asn Thr Ile Ser Ile Asn 165 170 175ctt
cca aca gat tca gag gta cag aat tgt tta tta gaa act cca tat 576Leu
Pro Thr Asp Ser Glu Val Gln Asn Cys
Leu Leu Glu Thr Pro Tyr 180 185 190tat gta tct cct tgg cgt gca ggg
caa gat gta aat caa cct gca tta 624Tyr Val Ser Pro Trp Arg Ala Gly
Gln Asp Val Asn Gln Pro Ala Leu 195 200 205aat cca aca aaa cca agt
ttt tgt aat cgt ctt gaa ggt tgg tat gga 672Asn Pro Thr Lys Pro Ser
Phe Cys Asn Arg Leu Glu Gly Trp Tyr Gly 210 215 220gca gga agt att
cat aat aaa gtt cat gta tgg gta gct ggt gct aca 720Ala Gly Ser Ile
His Asn Lys Val His Val Trp Val Ala Gly Ala Thr225 230 235 240gag
ggc tct atg att tgg atg agc tca cca aat gat cct gtc ttt ttc 768Glu
Gly Ser Met Ile Trp Met Ser Ser Pro Asn Asp Pro Val Phe Phe 245 250
255tta cat cat gca aat att gat cgc cta tgg gtc caa tgg cag gcc aat
816Leu His His Ala Asn Ile Asp Arg Leu Trp Val Gln Trp Gln Ala Asn
260 265 270aat cca aat gaa ggg tat cat cct act gga aat ggt aat gaa
gtt gga 864Asn Pro Asn Glu Gly Tyr His Pro Thr Gly Asn Gly Asn Glu
Val Gly 275 280 285cca aca ggt cat aat tta aat gat tca atg aat cct
tgg ggg agg aag 912Pro Thr Gly His Asn Leu Asn Asp Ser Met Asn Pro
Trp Gly Arg Lys 290 295 300gtt act cca aat aat gtc ctt aat cat tat
agt ctt ggt tat act tac 960Val Thr Pro Asn Asn Val Leu Asn His Tyr
Ser Leu Gly Tyr Thr Tyr305 310 315 320gat aca gat tca acc cct ctt
tct gaa atc ttt atg cat aca ttt aat 1008Asp Thr Asp Ser Thr Pro Leu
Ser Glu Ile Phe Met His Thr Phe Asn 325 330 335ctg aaa att cgt aaa
gaa aaa caa atc aaa gat ggt cat ttt ggt tta 1056Leu Lys Ile Arg Lys
Glu Lys Gln Ile Lys Asp Gly His Phe Gly Leu 340 345 350agt caa gaa
gat tta gac aaa ttg taa 1083Ser Gln Glu Asp Leu Asp Lys Leu 355
3608360PRTBacillus thuringiensis 8Met Arg Ile Arg Arg Asn Gln Ser
Thr Leu Ser His Asn Glu Arg Leu1 5 10 15 Ala Phe Thr Asn Ala Val
Leu Glu Leu Lys Arg Arg Pro Ser Arg Leu 20 25 30 Pro Met Ser Leu
Gly Ser Thr Ser Arg Tyr Asp Asp Tyr Val Tyr Trp 35 40 45 His Leu
Gln Ser Met Glu Asn Gln Thr Ser Thr Thr Pro Gly Trp Ala 50 55 60
His Arg Gly Pro Ala Phe Leu Pro Trp His Arg Tyr Tyr Leu Asn Gln65
70 75 80 Phe Glu Glu Asp Leu Gln Arg Ile Asp His Thr Val Thr Leu
Pro Tyr 85 90 95 Trp Asp Trp Thr Val Asp Asn Ser Thr Asp Ser Ser
Val Pro Gly Ser 100 105 110 Pro Trp Thr Asp Asp Phe Met Gly Gly Asp
Gly Asp Pro Thr Gln Glu 115 120 125 Tyr Thr Val Thr Thr Gly Pro Phe
Thr Gly Asp Asn Trp Lys Leu Thr 130 135 140 Leu Phe Asp His His Glu
Asn Glu Pro His Asn Ala Arg Leu Arg Arg145 150 155 160 Gln Leu Gly
Thr Thr Leu Asn Ala Ser Gly Asn Thr Ile Ser Ile Asn 165 170 175 Leu
Pro Thr Asp Ser Glu Val Gln Asn Cys Leu Leu Glu Thr Pro Tyr 180 185
190 Tyr Val Ser Pro Trp Arg Ala Gly Gln Asp Val Asn Gln Pro Ala Leu
195 200 205 Asn Pro Thr Lys Pro Ser Phe Cys Asn Arg Leu Glu Gly Trp
Tyr Gly 210 215 220 Ala Gly Ser Ile His Asn Lys Val His Val Trp Val
Ala Gly Ala Thr225 230 235 240 Glu Gly Ser Met Ile Trp Met Ser Ser
Pro Asn Asp Pro Val Phe Phe 245 250 255 Leu His His Ala Asn Ile Asp
Arg Leu Trp Val Gln Trp Gln Ala Asn 260 265 270 Asn Pro Asn Glu Gly
Tyr His Pro Thr Gly Asn Gly Asn Glu Val Gly 275 280 285 Pro Thr Gly
His Asn Leu Asn Asp Ser Met Asn Pro Trp Gly Arg Lys 290 295 300 Val
Thr Pro Asn Asn Val Leu Asn His Tyr Ser Leu Gly Tyr Thr Tyr305 310
315 320 Asp Thr Asp Ser Thr Pro Leu Ser Glu Ile Phe Met His Thr Phe
Asn 325 330 335 Leu Lys Ile Arg Lys Glu Lys Gln Ile Lys Asp Gly His
Phe Gly Leu 340 345 350 Ser Gln Glu Asp Leu Asp Lys Leu 355 360
9369PRTArtificial SequenceAXN-2 with His tag 9Met Ala His His His
His His His Gly Ser Met Arg Ile Arg Arg Asn1 5 10 15 Gln Ser Thr
Leu Ser His Asn Glu Arg Leu Ala Phe Thr Asn Ala Val 20 25 30 Leu
Glu Leu Lys Arg Arg Pro Ser Arg Leu Pro Met Ser Leu Gly Ser 35 40
45 Thr Ser Arg Tyr Asp Asp Tyr Val Tyr Trp His Leu Gln Ser Met Glu
50 55 60 Asn Gln Thr Ser Thr Thr Pro Gly Trp Ala His Arg Gly Pro
Ala Phe65 70 75 80 Leu Pro Trp His Arg Tyr Tyr Leu Asn Gln Phe Glu
Glu Asp Leu Gln 85 90 95 Arg Ile Asp His Thr Val Thr Leu Pro Tyr
Trp Asp Trp Thr Val Asp 100 105 110 Asn Ser Thr Asp Ser Ser Val Pro
Gly Ser Pro Trp Thr Asp Asp Phe 115 120 125 Met Gly Gly Asp Gly Asp
Pro Thr Gln Glu Tyr Thr Val Thr Thr Gly 130 135 140 Pro Phe Thr Gly
Asp Asn Trp Lys Leu Thr Leu Phe Asp His His Glu145 150 155 160 Asn
Glu Pro His Asn Ala Arg Leu Arg Arg Gln Leu Gly Thr Thr Leu 165 170
175 Asn Ala Ser Gly Asn Thr Ile Ser Ile Asn Leu Pro Thr Asp Ser Glu
180 185 190 Val Gln Asn Cys Leu Leu Glu Thr Pro Tyr Tyr Val Ser Pro
Trp Arg 195 200 205 Ala Gly Gln Asp Val Asn Gln Pro Ala Leu Asn Pro
Thr Lys Pro Ser 210 215 220 Phe Cys Asn Arg Leu Glu Gly Trp Tyr Gly
Ala Gly Ser Ile His Asn225 230 235 240 Lys Val His Val Trp Val Ala
Gly Ala Thr Glu Gly Ser Met Ile Trp 245 250 255 Met Ser Ser Pro Asn
Asp Pro Val Phe Phe Leu His His Ala Asn Ile 260 265 270 Asp Arg Leu
Trp Val Gln Trp Gln Ala Asn Asn Pro Asn Glu Gly Tyr 275 280 285 His
Pro Thr Gly Asn Gly Asn Glu Val Gly Pro Thr Gly His Asn Leu 290 295
300 Asn Asp Ser Met Asn Pro Trp Gly Arg Lys Val Thr Pro Asn Asn
Val305 310 315 320 Leu Asn His Tyr Ser Leu Gly Tyr Thr Tyr Asp Thr
Asp Ser Thr Pro 325 330 335 Leu Ser Glu Ile Phe Met His Thr Phe Asn
Leu Lys Ile Arg Lys Glu 340 345 350 Lys Gln Ile Lys Asp Gly His Phe
Gly Leu Ser Gln Glu Asp Leu Asp 355 360 365 Lys101083DNAArtificial
SequenceSynthetic sequence encoding AXN-2 10atgaggatca gaagaaacca
gtctaccttg tctcataacg agaggcttgc tttcactaac 60gctgtgcttg agcttaagag
aaggccatct aggcttccaa tgtctcttgg atctacctcc 120agatacgatg
attacgtgta ctggcacctt caatctatgg aaaaccagac ttctactact
180ccaggatggg ctcatagagg accagctttt ttgccatggc acaggtatta
tctcaaccag 240ttcgaagagg atcttcagag gattgatcat accgttaccc
ttccatattg ggattggacc 300gtggataact ctaccgattc ttctgttcca
ggatctccat ggactgatga tttcatggga 360ggtgatggtg atccaactca
agagtacact gttactactg gaccattcac tggtgataac 420tggaagctca
cccttttcga tcatcatgag aacgaaccac ataacgctag acttagaagg
480caacttggaa ctacccttaa cgcttccgga aacaccattt ccattaacct
tccaaccgat 540tctgaggttc agaactgcct tcttgagact ccttactacg
tttcaccttg gagagctgga 600caagatgtta accagccagc tcttaaccca
actaagccat ctttctgcaa cagacttgag 660ggatggtatg gtgctggatc
tattcataac aaagtgcatg tttgggtggc aggtgctact 720gaaggatcta
tgatctggat gtcctctcca aacgatccag ttttcttcct tcaccacgct
780aacattgata ggctttgggt tcaatggcaa gctaacaacc caaacgaggg
atatcatcca 840actggaaacg gaaacgaagt tggaccaacc ggacataacc
ttaacgattc catgaaccca 900tggggaagaa aggttacccc aaacaacgtt
cttaaccact actctctcgg atacacttac 960gatactgatt ctaccccact
ctccgagatt ttcatgcaca ccttcaacct caagatcagg 1020aaagagaagc
agattaagga cggacatttc ggactttctc aagaggatct cgacaagctc 1080tga
1083114795DNAPseudomonasmisc_difference(1635)...(1865)putative
transcription regulator 11ggtgctgtct gccggggtga ggccgaactc
tacaaaaccg aactgctgca ggtgctggcg 60gacaagtccg ggctgccccc cgatcaagcc
caggaatatc tgcacaacgc aggctatgac 120ctgaaccgcg ccttgagtgc
tctggaacaa gcgcgcttca ccctgacccg gcgcatcctg 180cgcaaacatc
accaggacaa ggcccgggcc ctggacctga tcgcccagtc catcgaaacc
240gccgaacaat tgccacgcca gtactggctg gccttcgagc ggctggagca
actggccccg 300gcaccgcgct gcttgatggt gctccacgaa tggctggcct
tcgaggactg ggaaggcttc 360gacagtgccc tgcactttca tctgccgcag
gccatcgccc agttcaggca cctgcaactg 420gatgccctgg cggacaccct
ggaacaggcc gatcagcgcc agcgacagct acgggcagcc 480catgccgagc
gcgaaagccc catcgagctg gccgtgcagg tcaatcagga tccgttgttc
540aacgcctgcc gggacagttt cagccaacag cggcttcggc tcgacgagcg
cctgtacgaa 600tgggtggagc gccatataga gcagtttcca gcctgagtgg
cgagttcgcg tacgcgcaca 660aaccgctgct acccaccccc ccgaaactcg
ctaaactgcc gccctcaccg aacacggacc 720tcggatcaag ccctcatgga
attgcactac agcatcaccc ccgcccatat ccaggcctgg 780atcgccgaac
cgctgaagca ggaaatgctc aagcatgatc agcagcaggc ccaggcgatg
840gccaatgttg cccggtggca acgccgcttg gtcggcccct tgatgttcgc
cttgtgcctg 900gtcggcggca tgctggcgct ctatttcccc gagcgacgct
tcaccgcgca gaacgtcatt 960gccatggtgc tgttcgcgtt gatcttcatc
ccgctctggt ggcgtttttc cgggcgctgg 1020atcaagcacc tgcaagcgcg
tattgccgcc aaccacgcca agccccgggc gccgttgcgc 1080gggctgaatc
agcggctgat cgaaaccagg ctgcgggcgc cgctgaaatc cgtcgaaagc
1140acctattgcc tgagcttcga cgaccagggc tttaccctcg acaaagcccg
cagcggcaag 1200agcaccctcg cctgggaaca gatcgcgcgc ctgcaggaaa
caccggactt ctacctcgtg 1260gcgagcgcgg acatggtgcg ccagggcgtg
gcctgtctca ttgccaaaca cagtgacctg 1320atgccggccg aggaatatca
gcaagggctg caagcgtttc tgagccagtg cccagtggcg 1380ccctcagcaa
actgatcaca gtgctagcac tgcagaccga tacgcttcga gcactaccgc
1440tcagcacctg cccgcaaaaa caccttgcag gtaaaacaac gccgactatg
ctgccaaaat 1500cagccctaat ggccattggc cttgagcggg ccgacctccg
gttttgtcgt ccgggcatcg 1560caacgctgat tagcagaaag ctatacacct
gacgacgcaa gcgcgttcgc cgcggtgttt 1620cccaggaggg aatcatgcct
gccgcaggac agtacttgat tgcaatcccg gtctacgacg 1680gtgtcgacct
gctcgacgtc agcgccccct acgaactgtt cagtacaacc cctgcccccc
1740ccttcttcag tggcgatcct tccgtcgcca aaccgccggt ctacacccca
ggcgccggaa 1800gcacctgcaa ttgcgctctc gccgagagca tcaccaatgt
gctcaagcgc acaggccctt 1860cctgagaggc ctgcaggcag gtgcacacag
agtagcgctg tcgccgcgaa atgaagtccg 1920gcctcacccc ggcagacagc
acccaacagg aacggaggga aatctcatga gtacttctcg 1980tcaggatgtc
gcaaaactcg gcccagggtg gaacaaggta ttactgaact acgcactggc
2040catgcaagcg ttggatgaac agccaatagc ggaccgcaat agctggaaat
tcctcggggc 2100catgcatggg tttcatcccc aattatggat caacgagcgc
ctgatcaagt caggtgcccc 2160gattccagcg gatttgacca accataccta
cggcaatcag tgtcagcacg gcagttggta 2220ttttctgtcc tggcaccgtg
cgtatctgtt tgcgttcgag gcaatcgtcg ccgctaaagt 2280gaaggagctg
acgggtgacg actgggcact tccctactgg aactacctcg acagcagcaa
2340cccgcaagcg ctgtacctgc ccgatgcctt tgtcgccaag acattacccg
acggaaagcc 2400aaaccctctg aacaagtacc ctcgccgccc cggaatcaag
gcgatcaagc cggtgcgcgg 2460gttcagcctt gaggcgatgg atgagaatga
cttcatcgta ggtaacggga ccctgggatt 2520tggtgggggg attaccggca
atttcgtcca gttcgacggg gttgccggcg agctggagac 2580caacccgcat
aacacggtcc atggcctcgt cggaggatac atgggcaatg ccttgctcgc
2640aggtcttgat ccgatcttct ggcttcacca ttgcaacatc gatcggctct
gggaggcgtg 2700gatgaacacg ccgggcaaga caatggttcg cgatccgctc
tggctcaatg gcccggcaga 2760ccgcagtttc atcatgccag tacccggaga
caatgcacct ggagtaacgt tcaccagtaa 2820ggacacgttg aaaggcggca
aattttatcg gacctatgat gatttgatca tcggcacggg 2880tgtaacgccg
ggagtgcatg ctgtggcacg cgtcaatatg ggctcgccca gtaaacaaac
2940cgttcagcca atcggcgcca atgccgcggt cgtcaagatc ggcggggcgc
cggtaggtac 3000ccatattgac ctcgaaccga cagccgccgc caacagcatg
gccacgatgg gcgcgacatc 3060gccaggcaag gaagtggccc ggctctatct
ttccctggag tctgtgcgcg gctccgcacc 3120ctcgcctctg ctggatgtgt
acgtcaactt gcccgaaggc gccgacccgg cacttagccc 3180tgaccggtac
gccggtagcc ttacgctttt cggtctcaac gttgcctcgc aaacggacgg
3240cccccatgca ggcagcgggt tgggctatac gatcgacatc accgatctgg
cccagcggct 3300ggcggacgcc ggggactttg atccgaatca tctgcgggtg
accctggtgc ctggagagca 3360gataaccgat gaagaaccta taaccgttga
acgcataagc gtactcaagc gaagcggcat 3420cgtcagttga gggggcggcc
tcatgcggcc aggcctggtt ttacgcagtt tcacttatgc 3480cccttggcca
gtgcttctgg ccacggcggg attcggcctg gccctttcca tttacagcga
3540cgcaagtaca gaaggtcccg cgttttgcgt ggccaccaat gggctatcga
tcttcaccag 3600ctggcccgcg gtgctgcaag cggagctcgc ggtgaacccg
atccaccgta tcctggcggg 3660ttggttgttg atgctgttga ccatgatgcc
gccccttctg gcgatgccac tcatgcatgt 3720gtggcgctcc agcctgccaa
acaggcgaat acgtgcgagc gccggtttcc tgctcggcta 3780ctgcgcgccg
tggatggccg caggtctggt cctgtcggcc ctggcgctgt tgctacagat
3840cactgtcgtg gacaacgccc tggcaatagc cctgctgatc gcgttgcttt
ggagcgcaag 3900cccgtggcac cgtgcggcac tcaatcgcag ccatcaaccg
cggcgaatag gcctgttcgg 3960tcgggccgcc gaccgggatt gcctggtctt
cggcatgacg catggggcct attgtattgg 4020ctcgtgctgg gcctggatgt
tggtgcccgt tgtcagcggc gcctggcaca ttccgatgat 4080gctgttcact
ggcgtcatca tgctggctga acggttcacg cctcctggcc ctgcgcgctg
4140gtgctggccc cggttttttt cacccgctca cctatacacc ctcctcaccc
agcgcaatgc 4200ggagcgtccc catggttagg cgtgtttgtc tggcaatcgg
cgtcagcacg gtgacgcctg 4260tcgcgaaact ggccctggac ttcgcctatc
tggatggcgc cgttttcgcc gctcgcgcaa 4320tgggagaatg ggccctgcgc
tcgggtttcg gcgtcgacaa cgtaagagtc gtcgacgacg 4380gctcaaccga
cggcaaagcg aacccggtca cacgagagcg ggtacagcgg gccgtcgatg
4440aactgttccc ggtgggcgcc gaggtcgttg accagttgat cctgtcattt
tgcgggcatg 4500gactgaccgg ggcgaacttc ggctcgatct tttggctgtt
cagcgactcg ctgcacatga 4560agtaccgcat cgtggtcgat gggttctatg
aggaattact tctgcacggc gtcaaacgca 4620taacgcttat taccgacacc
tgccgcgaag cgccgcagag cctggagctg atgcggcttg 4680atggtgtgcg
cgggatcgtt gtacagggca ctcgcgttga cagcccgaga ttcgaccgcc
4740ttgcatcctg ccaggacgga cagctcggct atatggtcta tgaccctgcc gccgc
4795121464DNAPseudomonasCDS(1)...(1464) 12atg agt act tct cgt cag
gat gtc gca aaa ctc ggc cca ggg tgg aac 48Met Ser Thr Ser Arg Gln
Asp Val Ala Lys Leu Gly Pro Gly Trp Asn1 5 10 15aag gta tta ctg aac
tac gcc ctc gcc atg caa gcg ttg gat gaa cag 96Lys Val Leu Leu Asn
Tyr Ala Leu Ala Met Gln Ala Leu Asp Glu Gln 20 25 30cca ata gcg gac
cgc aat agc tgg aaa ttc ctc ggg gcc atg cat ggg 144Pro Ile Ala Asp
Arg Asn Ser Trp Lys Phe Leu Gly Ala Met His Gly 35 40 45ttt cat ccc
caa tta tgg atc aac gag cgc ctg atc aag tca ggt gcc 192Phe His Pro
Gln Leu Trp Ile Asn Glu Arg Leu Ile Lys Ser Gly Ala 50 55 60ccg att
cca gcg gat ttg acc aac cat acc tac ggc aat cag tgt cag 240Pro Ile
Pro Ala Asp Leu Thr Asn His Thr Tyr Gly Asn Gln Cys Gln65 70 75
80cac ggc agt tgg tat ttt ctg tcc tgg cac cgt gcg tat ctg ttt gcg
288His Gly Ser Trp Tyr Phe Leu Ser Trp His Arg Ala Tyr Leu Phe Ala
85 90 95ttc gag gca atc gtc gcc gct aaa gtg aag gag ctg acg ggt gac
gac 336Phe Glu Ala Ile Val Ala Ala Lys Val Lys Glu Leu Thr Gly Asp
Asp 100 105 110tgg gca ctt ccc tac tgg aac tac ctc gac agc agc aac
ccg caa gcg 384Trp Ala Leu Pro Tyr Trp Asn Tyr Leu Asp Ser Ser Asn
Pro Gln Ala 115 120 125ctg tac ctg ccc gat gcc ttt gtc gcc aag aca
tta ccc gac gga aag 432Leu Tyr Leu Pro Asp Ala Phe Val Ala Lys Thr
Leu Pro Asp Gly Lys 130 135 140cca aac cct ctg aac aag tac cct cgc
cgc ccc gga atc aag gcg atc 480Pro Asn Pro Leu Asn Lys Tyr Pro Arg
Arg Pro Gly Ile Lys Ala Ile145 150 155 160aag ccg gtg cgc ggg ttc
agc ctt gag gcg atg gat gag aat gac ttc 528Lys Pro Val Arg Gly Phe
Ser Leu Glu Ala Met Asp Glu Asn Asp Phe 165 170 175atc gta ggt aac
ggg acc ctg gga ttt ggt ggg ggg att acc ggc aat 576Ile Val Gly Asn
Gly Thr Leu Gly Phe Gly Gly Gly Ile Thr Gly Asn 180 185 190ttc gtc
cag ttc gac ggg gtt gcc ggc gag ctg gag acc aac ccg cat 624Phe Val
Gln Phe Asp Gly Val Ala Gly Glu Leu Glu Thr Asn Pro His 195 200
205aac acg gtc cat ggc ctc gtc gga gga tac atg ggc aat gcc ttg ctc
672Asn Thr Val His Gly Leu Val Gly Gly Tyr Met Gly Asn Ala Leu Leu
210 215 220gca ggt ctt gat ccg atc ttc tgg ctt cac cat tgc aac atc
gat cgg 720Ala Gly Leu Asp Pro Ile Phe Trp Leu His His Cys Asn Ile
Asp Arg225 230 235 240ctc tgg gag gcg tgg atg aac acg ccg ggc aag
aca atg gtt cgc gat
768Leu Trp Glu Ala Trp Met Asn Thr Pro Gly Lys Thr Met Val Arg Asp
245 250 255ccg ctc tgg ctc aat ggc ccg gca gac cgc agt ttc atc atg
cca gta 816Pro Leu Trp Leu Asn Gly Pro Ala Asp Arg Ser Phe Ile Met
Pro Val 260 265 270ccc gga gac aat gca cct gga gta acg ttc acc agt
aag gac acg ttg 864Pro Gly Asp Asn Ala Pro Gly Val Thr Phe Thr Ser
Lys Asp Thr Leu 275 280 285aaa ggc ggc aaa ttt tat cgg acc tat gat
gat ttg atc atc ggc acg 912Lys Gly Gly Lys Phe Tyr Arg Thr Tyr Asp
Asp Leu Ile Ile Gly Thr 290 295 300ggt gta acg ccg gga gtg cat gct
gtg gca cgc gtc aat atg ggc tcg 960Gly Val Thr Pro Gly Val His Ala
Val Ala Arg Val Asn Met Gly Ser305 310 315 320ccc agt aaa caa acc
gtt cag cca atc ggc gcc aat gcc gcg gtc gtc 1008Pro Ser Lys Gln Thr
Val Gln Pro Ile Gly Ala Asn Ala Ala Val Val 325 330 335aag atc ggc
ggg gcg ccg gta ggt acc cat att gac ctc gaa ccg aca 1056Lys Ile Gly
Gly Ala Pro Val Gly Thr His Ile Asp Leu Glu Pro Thr 340 345 350gcc
gcc gcc aac agc atg gcc acg atg ggc gcg aca tcg cca ggc aag 1104Ala
Ala Ala Asn Ser Met Ala Thr Met Gly Ala Thr Ser Pro Gly Lys 355 360
365gaa gtg gcc cgg ctc tat ctt tcc ctg gag tct gtg cgc ggc tcc gca
1152Glu Val Ala Arg Leu Tyr Leu Ser Leu Glu Ser Val Arg Gly Ser Ala
370 375 380ccc tcg cct ctg ctg gat gtg tac gtc aac ttg ccc gaa ggc
gcc gac 1200Pro Ser Pro Leu Leu Asp Val Tyr Val Asn Leu Pro Glu Gly
Ala Asp385 390 395 400ccg gca ctt agc cct gac cgg tac gcc ggt agc
ctt acg ctt ttc ggt 1248Pro Ala Leu Ser Pro Asp Arg Tyr Ala Gly Ser
Leu Thr Leu Phe Gly 405 410 415ctc aac gtt gcc tcg caa acg gac ggc
ccc cat gca ggc agc ggg ttg 1296Leu Asn Val Ala Ser Gln Thr Asp Gly
Pro His Ala Gly Ser Gly Leu 420 425 430ggc tat acg atc gac atc acc
gat ctg gcc cag cgg ctg gcg gac gcc 1344Gly Tyr Thr Ile Asp Ile Thr
Asp Leu Ala Gln Arg Leu Ala Asp Ala 435 440 445ggg gac ttt gat ccg
aat cat ctg cgg gtg acc ctg gtg cct gga gag 1392Gly Asp Phe Asp Pro
Asn His Leu Arg Val Thr Leu Val Pro Gly Glu 450 455 460cag ata acc
gat gaa gaa cct ata acc gtt gaa cgc ata agc gta ctc 1440Gln Ile Thr
Asp Glu Glu Pro Ile Thr Val Glu Arg Ile Ser Val Leu465 470 475
480aag cga agc ggc atc gtc agt tga 1464Lys Arg Ser Gly Ile Val Ser
48513487PRTPseudomonas 13Met Ser Thr Ser Arg Gln Asp Val Ala Lys
Leu Gly Pro Gly Trp Asn1 5 10 15 Lys Val Leu Leu Asn Tyr Ala Leu
Ala Met Gln Ala Leu Asp Glu Gln 20 25 30 Pro Ile Ala Asp Arg Asn
Ser Trp Lys Phe Leu Gly Ala Met His Gly 35 40 45 Phe His Pro Gln
Leu Trp Ile Asn Glu Arg Leu Ile Lys Ser Gly Ala 50 55 60 Pro Ile
Pro Ala Asp Leu Thr Asn His Thr Tyr Gly Asn Gln Cys Gln65 70 75 80
His Gly Ser Trp Tyr Phe Leu Ser Trp His Arg Ala Tyr Leu Phe Ala 85
90 95 Phe Glu Ala Ile Val Ala Ala Lys Val Lys Glu Leu Thr Gly Asp
Asp 100 105 110 Trp Ala Leu Pro Tyr Trp Asn Tyr Leu Asp Ser Ser Asn
Pro Gln Ala 115 120 125 Leu Tyr Leu Pro Asp Ala Phe Val Ala Lys Thr
Leu Pro Asp Gly Lys 130 135 140 Pro Asn Pro Leu Asn Lys Tyr Pro Arg
Arg Pro Gly Ile Lys Ala Ile145 150 155 160 Lys Pro Val Arg Gly Phe
Ser Leu Glu Ala Met Asp Glu Asn Asp Phe 165 170 175 Ile Val Gly Asn
Gly Thr Leu Gly Phe Gly Gly Gly Ile Thr Gly Asn 180 185 190 Phe Val
Gln Phe Asp Gly Val Ala Gly Glu Leu Glu Thr Asn Pro His 195 200 205
Asn Thr Val His Gly Leu Val Gly Gly Tyr Met Gly Asn Ala Leu Leu 210
215 220 Ala Gly Leu Asp Pro Ile Phe Trp Leu His His Cys Asn Ile Asp
Arg225 230 235 240 Leu Trp Glu Ala Trp Met Asn Thr Pro Gly Lys Thr
Met Val Arg Asp 245 250 255 Pro Leu Trp Leu Asn Gly Pro Ala Asp Arg
Ser Phe Ile Met Pro Val 260 265 270 Pro Gly Asp Asn Ala Pro Gly Val
Thr Phe Thr Ser Lys Asp Thr Leu 275 280 285 Lys Gly Gly Lys Phe Tyr
Arg Thr Tyr Asp Asp Leu Ile Ile Gly Thr 290 295 300 Gly Val Thr Pro
Gly Val His Ala Val Ala Arg Val Asn Met Gly Ser305 310 315 320 Pro
Ser Lys Gln Thr Val Gln Pro Ile Gly Ala Asn Ala Ala Val Val 325 330
335 Lys Ile Gly Gly Ala Pro Val Gly Thr His Ile Asp Leu Glu Pro Thr
340 345 350 Ala Ala Ala Asn Ser Met Ala Thr Met Gly Ala Thr Ser Pro
Gly Lys 355 360 365 Glu Val Ala Arg Leu Tyr Leu Ser Leu Glu Ser Val
Arg Gly Ser Ala 370 375 380 Pro Ser Pro Leu Leu Asp Val Tyr Val Asn
Leu Pro Glu Gly Ala Asp385 390 395 400 Pro Ala Leu Ser Pro Asp Arg
Tyr Ala Gly Ser Leu Thr Leu Phe Gly 405 410 415 Leu Asn Val Ala Ser
Gln Thr Asp Gly Pro His Ala Gly Ser Gly Leu 420 425 430 Gly Tyr Thr
Ile Asp Ile Thr Asp Leu Ala Gln Arg Leu Ala Asp Ala 435 440 445 Gly
Asp Phe Asp Pro Asn His Leu Arg Val Thr Leu Val Pro Gly Glu 450 455
460 Gln Ile Thr Asp Glu Glu Pro Ile Thr Val Glu Arg Ile Ser Val
Leu465 470 475 480 Lys Arg Ser Gly Ile Val Ser 485
14295PRTArtificial SequencePredicted processed AXN-8 protein 14Met
Ser Thr Ser Arg Gln Asp Val Ala Lys Leu Gly Pro Gly Trp Asn1 5 10
15 Lys Val Leu Leu Asn Tyr Ala Leu Ala Met Gln Ala Leu Asp Glu Gln
20 25 30 Pro Ile Ala Asp Arg Asn Ser Trp Lys Phe Leu Gly Ala Met
His Gly 35 40 45 Phe His Pro Gln Leu Trp Ile Asn Glu Arg Leu Ile
Lys Ser Gly Ala 50 55 60 Pro Ile Pro Ala Asp Leu Thr Asn His Thr
Tyr Gly Asn Gln Cys Gln65 70 75 80 His Gly Ser Trp Tyr Phe Leu Ser
Trp His Arg Ala Tyr Leu Phe Ala 85 90 95 Phe Glu Ala Ile Val Ala
Ala Lys Val Lys Glu Leu Thr Gly Asp Asp 100 105 110 Trp Ala Leu Pro
Tyr Trp Asn Tyr Leu Asp Ser Ser Asn Pro Gln Ala 115 120 125 Leu Tyr
Leu Pro Asp Ala Phe Val Ala Lys Thr Leu Pro Asp Gly Lys 130 135 140
Pro Asn Pro Leu Asn Lys Tyr Pro Arg Arg Pro Gly Ile Lys Ala Ile145
150 155 160 Lys Pro Val Arg Gly Phe Ser Leu Glu Ala Met Asp Glu Asn
Asp Phe 165 170 175 Ile Val Gly Asn Gly Thr Leu Gly Phe Gly Gly Gly
Ile Thr Gly Asn 180 185 190 Phe Val Gln Phe Asp Gly Val Ala Gly Glu
Leu Glu Thr Asn Pro His 195 200 205 Asn Thr Val His Gly Leu Val Gly
Gly Tyr Met Gly Asn Ala Leu Leu 210 215 220 Ala Gly Leu Asp Pro Ile
Phe Trp Leu His His Cys Asn Ile Asp Arg225 230 235 240 Leu Trp Glu
Ala Trp Met Asn Thr Pro Gly Lys Thr Met Val Arg Asp 245 250 255 Pro
Leu Trp Leu Asn Gly Pro Ala Asp Arg Ser Phe Ile Met Pro Val 260 265
270 Pro Gly Asp Asn Ala Pro Gly Val Thr Phe Thr Ser Lys Asp Thr Leu
275 280 285 Lys Gly Gly Lys Phe Tyr Arg 290 295 151464DNAArtificial
SequenceSynthetic sequence encoding AXN-8 15atgtctacct ctaggcaaga
tgttgctaag ttgggaccag gatggaacaa ggtgttgctt 60aactacgctc ttgctatgca
agctcttgat gagcaaccta tcgctgatag aaactcctgg 120aagttccttg
gagctatgca tggattccat ccacagcttt ggattaacga gaggctcatt
180aagtctggtg ctccaattcc agctgatctt accaaccata cctatggaaa
ccagtgccaa 240catggatctt ggtatttctt gtcttggcac agggcttatc
ttttcgcttt cgaggctatt 300gtggctgcta aggtgaaaga actcaccggt
gatgattggg ctttgccata ctggaactac 360cttgattctt ctaaccctca
ggctctttat cttccagatg ctttcgttgc taagactctc 420ccagatggaa
agccaaaccc acttaacaag tacccaagaa ggccaggtat taaggctatt
480aagccagtga gaggattctc tttggaagct atggatgaga acgatttcat
tgtgggaaac 540ggaactcttg gattcggagg tggaattacc ggaaacttcg
ttcaattcga tggtgttgct 600ggtgaacttg agactaaccc acataacacc
gttcatggac ttgttggagg ttatatggga 660aacgctctcc ttgctggact
tgatccaatt ttctggcttc accactgcaa cattgataga 720ctttgggagg
cttggatgaa cactcctgga aagactatgg tgcgtgatcc actttggctt
780aacggaccag ctgatagatc tttcatcatg ccagtgccag gtgataacgc
tccaggtgtt 840actttcacct ctaaggatac ccttaagggt ggaaagttct
acaggaccta cgatgatctc 900attattggaa ccggtgttac tccaggtgtt
catgctgttg ctagggttaa catgggatct 960ccatctaagc aaaccgttca
gccaattgga gctaacgctg ctgttgttaa gattggaggt 1020gctccagttg
gaacccatat tgatcttgag ccaactgctg ctgctaactc tatggctact
1080atgggagcta cttctccagg aaaagaggtt gcaaggcttt acttgtctct
tgagtctgtt 1140agaggatctg ctccttctcc acttcttgat gtgtacgtga
accttccaga aggtgctgat 1200ccagctttgt ctccagatag atacgctgga
tctcttaccc ttttcggact taacgttgct 1260tctcaaaccg atggaccaca
tgctggatct ggacttggat acaccatcga tattaccgat 1320cttgctcaga
gacttgctga tgctggtgat ttcgatccaa accatcttag ggttaccctt
1380gttccaggtg aacaaatcac tgacgaggaa cctattaccg ttgagaggat
ttctgtgctt 1440aagagatccg gaattgtgtc ctga 1464162205DNAGlycine max
16gatacgcaat ttggagaaag gaagaagata agctagctaa ggcagcaatg ggtaatcctt
60ctaagctttt cccatttttc tttgcattca ttgtgtttct gatgccctta gtttccttat
120cccacaatga cttctctacc tttgccataa aaaccgtttc atacctagtt
tcctttagtg 180aaaatccaaa ccataatggc cacatcacca caagctccaa
tgaaagagac aaatcacgtc 240tttggaggaa agccttcatt ggcttaaaaa
atactcacga gccatcttcg aatatttctc 300gagcaatatc ccttaatgca
agagagtgtt ttcctgtgga gttaccttct gatgcaataa 360cttctacccg
ttgttgtcca cctaggccat ctccttctaa tatcatagat ttcaaagatt
420ttgcttctcc aaacgccacg cttcgagtaa gaaaacctgc tcacatggta
gatgaggagt 480acatagcaaa acttgaaaag ggcattgcac tcatgaaagc
actccctgat gatgacccac 540gtaatttcat tcaacaagca aaggtccatt
gcgcttattg taacggtgcc tatcacctac 600gccatccctt tcaggacaca
aaactcaaca ttcacaggtc ttggtttttc tttccctttc 660atcgttggta
cctttacttc tttgagagaa ctttgggaaa attaattggt gacccaaact
720ttgccctacc cttttggaat tgggattctg tagaagggat gcaaattcca
tcatatttca 780ataaccctaa ttcgtcgctt tatcaccaac tccgaaacca
aaaccacttg cccccacacg 840tggttgatct gaactacaat aaacttgatc
ctaatgatga tacgccttct catcaacaag 900tttcgtataa tctagccttc
atgtacaagc aaatggtgct agcaagtacc aaagaattgt 960tcatgggaag
cccttttcgc cttggcgata accctactcc gggtatgggc tctatagagg
1020ctgctcctca taacactgtt catacatggg tgggtgctgc tgataagcca
caccatgaag 1080acatgggagc attctacaca gcagctagag accccatttt
ctacgctcat cacccgaact 1140cggatcgatt gtgggggcta tggaagacat
tggaaggagg aagaaaggac tatagtgatg 1200atccagattg gttagattct
gagttttact tctatgatga gaatgccaat tttgttcgtg 1260ttaaggtaag
agattgcctt gatactaaaa aattagggta tgtttacgaa gatgttgatc
1320ttccatggct gcgaacgcca cccacatcgc cgaaaagcaa gctactgaga
gaagcgaaga 1380agagtccact tttgagttca aagccaagca aatttccttt
ggttttggat tccataacga 1440gtaccgttgt taagaggccg aagaaattga
gaagcaagga agagaaagaa caagaggagg 1500aggttttggt gatagaaggg
attgagtttg gaagtgataa atatgtaaag tttgatgttc 1560atattgatga
tgatgaagat aatttgagtg aaccggatca gacagagttt gtgggaactt
1620ttgttaattt gttccatgga caaggccata atatcaacac tagttttaag
gtagggatat 1680cgaaagtgct ggagtgttta gaagctgaag aagatgatgt
tgtgctggtt actttggtgc 1740ctaaggttgg gaaaggagat gtcatcatag
gaggcatcaa aattgagttt attccaaagt 1800agaaagatta gttttgttgt
tgtgtgtgca aatttaatac acttattaca ggtttattgt 1860tttatgcttt
taaaaagtac actttctttt ttggtttagc atctcgagct cgtattctca
1920gtggctggat tttgtccaac caactgaaat atgagatgtc gaatttgctt
tggtatagcg 1980atagtagaag aagggaaaga agggaaagag tgtgaaggac
agctgaaatt ttggatgcgg 2040agaagtactc ttctacaagt atagatgagt
gtttttgaag aaaatcaaat aaatcaattt 2100gattttctag aattaatttt
cataataaaa tatgagtctg gtgtaaaaat ttgtatttga 2160tttttttttt
atgtgaaagg tgattttagc aaaaaaaaaa aaaaa 2205171509DNAArtificial
SequenceSynthetic sequence encoding the polyphenol oxidase from
Glycine max 17atg tcc agg gct att tct ctt aac gct aga gag tgc ttc
cca gtt gaa 48Met Ser Arg Ala Ile Ser Leu Asn Ala Arg Glu Cys Phe
Pro Val Glu1 5 10 15ctt cca tct gat gct att act tct act aga tgc tgc
cca cca aga cca 96Leu Pro Ser Asp Ala Ile Thr Ser Thr Arg Cys Cys
Pro Pro Arg Pro 20 25 30tct cca tcc aac atc atc gac ttc aag gat ttc
gct tct cca aac gct 144Ser Pro Ser Asn Ile Ile Asp Phe Lys Asp Phe
Ala Ser Pro Asn Ala 35 40 45act ctt aga gtt aga aag cca gct cat atg
gtg gat gaa gag tac att 192Thr Leu Arg Val Arg Lys Pro Ala His Met
Val Asp Glu Glu Tyr Ile 50 55 60gca aag ctc gag aag gga att gct ctt
atg aag gct ctc cca gat gat 240Ala Lys Leu Glu Lys Gly Ile Ala Leu
Met Lys Ala Leu Pro Asp Asp65 70 75 80gat cct agg aac ttc att cag
cag gct aag gtt cac tgc gct tat tgc 288Asp Pro Arg Asn Phe Ile Gln
Gln Ala Lys Val His Cys Ala Tyr Cys 85 90 95aac ggt gct tac cat ctt
aga cac cca ttc cag gat acc aag ctc aac 336Asn Gly Ala Tyr His Leu
Arg His Pro Phe Gln Asp Thr Lys Leu Asn 100 105 110att cat agg tcc
tgg ttc ttt ttc cca ttc cac cgt tgg tat ctc tat 384Ile His Arg Ser
Trp Phe Phe Phe Pro Phe His Arg Trp Tyr Leu Tyr 115 120 125ttc ttc
gag agg acc ctt gga aag ttg att ggc gat cca aac ttc gct 432Phe Phe
Glu Arg Thr Leu Gly Lys Leu Ile Gly Asp Pro Asn Phe Ala 130 135
140ttg cca ttc tgg aac tgg gat tct gtt gag gga atg caa atc cca tcc
480Leu Pro Phe Trp Asn Trp Asp Ser Val Glu Gly Met Gln Ile Pro
Ser145 150 155 160tac ttc aac aac cca aac tct tca ctt tac cac caa
ctc agg aac cag 528Tyr Phe Asn Asn Pro Asn Ser Ser Leu Tyr His Gln
Leu Arg Asn Gln 165 170 175aac cat ctt cca cca cat gtt gtg gat ctc
aac tac aac aag ctc gat 576Asn His Leu Pro Pro His Val Val Asp Leu
Asn Tyr Asn Lys Leu Asp 180 185 190cca aac gat gat act cca tct cat
cag cag gtg tca tac aac ctt gcc 624Pro Asn Asp Asp Thr Pro Ser His
Gln Gln Val Ser Tyr Asn Leu Ala 195 200 205ttc atg tac aag cag atg
gtt ctt gct tct acc aaa gaa ctc ttc atg 672Phe Met Tyr Lys Gln Met
Val Leu Ala Ser Thr Lys Glu Leu Phe Met 210 215 220gga tct cca ttc
aga ctt gga gat aac cca act cca gga atg gga tct 720Gly Ser Pro Phe
Arg Leu Gly Asp Asn Pro Thr Pro Gly Met Gly Ser225 230 235 240att
gaa gct gct cca cat aac act gtt cat act tgg gtt ggt gct gct 768Ile
Glu Ala Ala Pro His Asn Thr Val His Thr Trp Val Gly Ala Ala 245 250
255gat aag cca cat cat gag gat atg gga gct ttc tat act gct gct agg
816Asp Lys Pro His His Glu Asp Met Gly Ala Phe Tyr Thr Ala Ala Arg
260 265 270gac cca att ttc tac gct cat cac cca aac tct gat aga ctt
tgg gga 864Asp Pro Ile Phe Tyr Ala His His Pro Asn Ser Asp Arg Leu
Trp Gly 275 280 285ctt tgg aaa act ctt gag ggc gga aga aag gat tat
tcc gat gat cca 912Leu Trp Lys Thr Leu Glu Gly Gly Arg Lys Asp Tyr
Ser Asp Asp Pro 290 295 300gat tgg ctt gat tcc gag ttc tac ttc tac
gat gag aac gct aac ttt 960Asp Trp Leu Asp Ser Glu Phe Tyr Phe Tyr
Asp Glu Asn Ala Asn Phe305 310 315 320gtt agg gtg aaa gtg agg gat
tgc ctt gat aca aag aag ctc ggc tac 1008Val Arg Val Lys Val Arg Asp
Cys Leu Asp Thr Lys Lys Leu Gly Tyr 325 330 335gtt tac gaa gat gtg
gat ctt cca tgg ctt aga act cca cca act tct 1056Val Tyr Glu Asp Val
Asp Leu Pro Trp Leu Arg Thr Pro Pro Thr Ser 340 345 350cca aag tct
aag ctc ctt aga gag gct aag aag tct cca ctt ttg tcc 1104Pro Lys Ser
Lys Leu Leu Arg Glu Ala Lys Lys Ser Pro Leu Leu Ser 355 360 365tct
aag
cca tct aag ttc cca ctt gtg ctc gat tct att acc tct acc 1152Ser Lys
Pro Ser Lys Phe Pro Leu Val Leu Asp Ser Ile Thr Ser Thr 370 375
380gtt gtg aag agg cca aag aag ttg agg tcc aaa gaa gag aaa gag caa
1200Val Val Lys Arg Pro Lys Lys Leu Arg Ser Lys Glu Glu Lys Glu
Gln385 390 395 400gag gaa gag gtt ttg gtt att gag gga att gag ttc
ggt tct gac aag 1248Glu Glu Glu Val Leu Val Ile Glu Gly Ile Glu Phe
Gly Ser Asp Lys 405 410 415tac gtt aag ttc gac gtg cac atc gat gat
gat gag gat aac ctt tct 1296Tyr Val Lys Phe Asp Val His Ile Asp Asp
Asp Glu Asp Asn Leu Ser 420 425 430gag cca gat caa act gag ttc gtt
ggt act ttc gtg aac ctt ttc cat 1344Glu Pro Asp Gln Thr Glu Phe Val
Gly Thr Phe Val Asn Leu Phe His 435 440 445gga cag gga cac aac att
aac acc agc ttc aaa gtg gga att tct aag 1392Gly Gln Gly His Asn Ile
Asn Thr Ser Phe Lys Val Gly Ile Ser Lys 450 455 460gtg ttg gag tgc
ctt gaa gct gaa gag gat gat gtt gtg ctt gtt acc 1440Val Leu Glu Cys
Leu Glu Ala Glu Glu Asp Asp Val Val Leu Val Thr465 470 475 480ctt
gtt cca aaa gtg gga aag ggt gat gtg att att gga ggc atc aag 1488Leu
Val Pro Lys Val Gly Lys Gly Asp Val Ile Ile Gly Gly Ile Lys 485 490
495atc gag ttc atc cca aag tga 1509Ile Glu Phe Ile Pro Lys
50018502PRTArtificial SequenceAmino acid sequence encoded by the
synthetic sequence encoding the polyphenol oxidase from Glycine max
18Met Ser Arg Ala Ile Ser Leu Asn Ala Arg Glu Cys Phe Pro Val Glu1
5 10 15 Leu Pro Ser Asp Ala Ile Thr Ser Thr Arg Cys Cys Pro Pro Arg
Pro 20 25 30 Ser Pro Ser Asn Ile Ile Asp Phe Lys Asp Phe Ala Ser
Pro Asn Ala 35 40 45 Thr Leu Arg Val Arg Lys Pro Ala His Met Val
Asp Glu Glu Tyr Ile 50 55 60 Ala Lys Leu Glu Lys Gly Ile Ala Leu
Met Lys Ala Leu Pro Asp Asp65 70 75 80 Asp Pro Arg Asn Phe Ile Gln
Gln Ala Lys Val His Cys Ala Tyr Cys 85 90 95 Asn Gly Ala Tyr His
Leu Arg His Pro Phe Gln Asp Thr Lys Leu Asn 100 105 110 Ile His Arg
Ser Trp Phe Phe Phe Pro Phe His Arg Trp Tyr Leu Tyr 115 120 125 Phe
Phe Glu Arg Thr Leu Gly Lys Leu Ile Gly Asp Pro Asn Phe Ala 130 135
140 Leu Pro Phe Trp Asn Trp Asp Ser Val Glu Gly Met Gln Ile Pro
Ser145 150 155 160 Tyr Phe Asn Asn Pro Asn Ser Ser Leu Tyr His Gln
Leu Arg Asn Gln 165 170 175 Asn His Leu Pro Pro His Val Val Asp Leu
Asn Tyr Asn Lys Leu Asp 180 185 190 Pro Asn Asp Asp Thr Pro Ser His
Gln Gln Val Ser Tyr Asn Leu Ala 195 200 205 Phe Met Tyr Lys Gln Met
Val Leu Ala Ser Thr Lys Glu Leu Phe Met 210 215 220 Gly Ser Pro Phe
Arg Leu Gly Asp Asn Pro Thr Pro Gly Met Gly Ser225 230 235 240 Ile
Glu Ala Ala Pro His Asn Thr Val His Thr Trp Val Gly Ala Ala 245 250
255 Asp Lys Pro His His Glu Asp Met Gly Ala Phe Tyr Thr Ala Ala Arg
260 265 270 Asp Pro Ile Phe Tyr Ala His His Pro Asn Ser Asp Arg Leu
Trp Gly 275 280 285 Leu Trp Lys Thr Leu Glu Gly Gly Arg Lys Asp Tyr
Ser Asp Asp Pro 290 295 300 Asp Trp Leu Asp Ser Glu Phe Tyr Phe Tyr
Asp Glu Asn Ala Asn Phe305 310 315 320 Val Arg Val Lys Val Arg Asp
Cys Leu Asp Thr Lys Lys Leu Gly Tyr 325 330 335 Val Tyr Glu Asp Val
Asp Leu Pro Trp Leu Arg Thr Pro Pro Thr Ser 340 345 350 Pro Lys Ser
Lys Leu Leu Arg Glu Ala Lys Lys Ser Pro Leu Leu Ser 355 360 365 Ser
Lys Pro Ser Lys Phe Pro Leu Val Leu Asp Ser Ile Thr Ser Thr 370 375
380 Val Val Lys Arg Pro Lys Lys Leu Arg Ser Lys Glu Glu Lys Glu
Gln385 390 395 400 Glu Glu Glu Val Leu Val Ile Glu Gly Ile Glu Phe
Gly Ser Asp Lys 405 410 415 Tyr Val Lys Phe Asp Val His Ile Asp Asp
Asp Glu Asp Asn Leu Ser 420 425 430 Glu Pro Asp Gln Thr Glu Phe Val
Gly Thr Phe Val Asn Leu Phe His 435 440 445 Gly Gln Gly His Asn Ile
Asn Thr Ser Phe Lys Val Gly Ile Ser Lys 450 455 460 Val Leu Glu Cys
Leu Glu Ala Glu Glu Asp Asp Val Val Leu Val Thr465 470 475 480 Leu
Val Pro Lys Val Gly Lys Gly Asp Val Ile Ile Gly Gly Ile Lys 485 490
495 Ile Glu Phe Ile Pro Lys 500 192404DNAHypocrea jecorina
19atgctgttgt cagcgtccct ctcggcgttg gccttggcca cagtttcact cgcacagggc
60acgacacaca tccccgtcac cggtgttccc gtctctcctg gtgctgccgt gccgctgaga
120cagaacatca atgacctggc caagtccggg ccgcaatggt gagtgacgcc
ctccttccac 180cacactttac ctcagtcaag agacaagagg gagacaagta
caaagcggat gaaaagaggt 240ggacaagaga gagagagaga gaaagtgtgt
gtgtgtatgt gagagcgaga gagagagaga 300gagacaagag ctattggatg
gaccaggagc cagcatggag aacaggggga gacttgacga 360ttcgaggaga
ggggggctca catgtgcgtg cgaataggga tctctacgtt caggccatgt
420acaacatgtc caagatggac tcccatgacc cgtacagctt cttccagatt
gccggtaaat 480atacatctcg gcctcctgcg aggcgacgtg actctcggag
cttttagtaa caccagctag 540gcatccacgg cgcaccgtac attgagtaca
acaaggccgg agcaaagtcg ggcgatggct 600ggctgggcta ctgccctcac
ggtgtatgtg tttttgtcca tcgaggaggg cgcaagagtt 660tcatggactt
gaactcttcg cccttgttgt gagccggaaa tcatcgtctc tgacagtttc
720attaggagga cctcttcatc agctggcacc gcccctatgt cctgctcttt
gaggtatgat 780ttgaccacgc tggactttga cctcatacaa acatcaactg
acatcgttgc agcaagcctt 840ggtctccgtc gccaagggca tcgccaactc
gtatcccccg tctgtccgcg ccaagtacca 900ggctgccgcc gccagcctgc
gcgcccccta ctgggactgg gccgccgaca gctccgtgcc 960cgccgtcacc
gtcccccaga cgctcaagat caacgtcccc agcggcagca gcaccaagac
1020cgtcgactac accaacccgc tcaagacgta ctacttcccg cgcatgtcct
tgaccggctc 1080gtacggcgag ttcaccggcg gaggcaacga ccacaccgtc
cgctgcgccg cctccaagca 1140gagctatccc gccaccgcca actccaacct
ggctgcccgt ccttacaagt cctggatcgt 1200acgtagtccc cctttccctt
tggaagcttc cccttgagta aagctcgtca ctgacacaga 1260gagcggcccg
cagtacgatg tcctgaccaa ctctcaaaac tttgccgact tcgcctccac
1320cagcggcccc ggcatcaacg ttgagcagat ccacaacgcc atccactggg
acggtgcttg 1380cggctcccag ttcctcgccc ccgactactc cggcttcgac
cccctgttgt aagtcaatcg 1440agacgtcaag agtcatcttg tcaacaaccg
atggcaaacg cagtctgtac tgacgctgca 1500aaatagcttc atgcaccacg
cccaggtcga ccgcatgtgg gccttctggg aggccatcat 1560gccctcgtcg
cccctcttca cggcctcgta caagggccag tcgcgcttca actccaagtc
1620gggcagcacc atcacccccg actcgcccct gcagcccttc taccaggcca
acggcaagtt 1680ccacacgtcc aacacggtca agagcatcca gggcatgggc
tactcgtacc agggcatcga 1740gtactggcaa aagtcccagg cccagatcaa
gtcgagcgtc accaccatca tcaaccagct 1800gtacgggccc aactcgggca
agaagcgcaa cgccccgcgc gacttcttga gcgacattgt 1860caccgacgtc
gagaacctca tcaagacccg ttactttgcc aagatctcgg tcaacgtgac
1920cgaggtgacg gtccgccccg ccgagatcaa cgtctacgtc ggcggccaga
aggccggcag 1980cttgatcgtc atgaagctcc ccgccgaggg cacggtcaac
ggcggcttca ccattgacaa 2040ccccatgcaa agcatcctgc acggtggtct
ccgcaacgcc gtccaggcct ttaccgagga 2100cattgaggtt gagattctct
ctgtaagttt tcccccctct ctccactccc gaccactcac 2160tgtcactatt
tcgactagtc accgtcaaga tgtgtatttg tttgctgacc cccaagcgca
2220gaaggacgga caagccatcc ccctcgagac ggtccccagc ctgtccatcg
acctcgaggt 2280cgccaacgtc accctgccct ccgccctcga ccagctgccc
aagtacggcc agcgctccag 2340gcaccgcgcc aaggccgccc agcgcggaca
ccgctttgcc gttccccata tccctcctct 2400gtaa 240420561PRTHypocrea
jecorina 20Met Leu Leu Ser Ala Ser Leu Ser Ala Leu Ala Leu Ala Thr
Val Ser1 5 10 15 Leu Ala Gln Gly Thr Thr His Ile Pro Val Thr Gly
Val Pro Val Ser 20 25 30 Pro Gly Ala Ala Val Pro Leu Arg Gln Asn
Ile Asn Asp Leu Ala Lys 35 40 45 Ser Gly Pro Gln Trp Asp Leu Tyr
Val Gln Ala Met Tyr Asn Met Ser 50 55 60 Lys Met Asp Ser His Asp
Pro Tyr Ser Phe Phe Gln Ile Ala Gly Ile65 70 75 80 His Gly Ala Pro
Tyr Ile Glu Tyr Asn Lys Ala Gly Ala Lys Ser Gly 85 90 95 Asp Gly
Trp Leu Gly Tyr Cys Pro His Gly Glu Asp Leu Phe Ile Ser 100 105 110
Trp His Arg Pro Tyr Val Leu Leu Phe Glu Gln Ala Leu Val Ser Val 115
120 125 Ala Lys Gly Ile Ala Asn Ser Tyr Pro Pro Ser Val Arg Ala Lys
Tyr 130 135 140 Gln Ala Ala Ala Ala Ser Leu Arg Ala Pro Tyr Trp Asp
Trp Ala Ala145 150 155 160 Asp Ser Ser Val Pro Ala Val Thr Val Pro
Gln Thr Leu Lys Ile Asn 165 170 175 Val Pro Ser Gly Ser Ser Thr Lys
Thr Val Asp Tyr Thr Asn Pro Leu 180 185 190 Lys Thr Tyr Tyr Phe Pro
Arg Met Ser Leu Thr Gly Ser Tyr Gly Glu 195 200 205 Phe Thr Gly Gly
Gly Asn Asp His Thr Val Arg Cys Ala Ala Ser Lys 210 215 220 Gln Ser
Tyr Pro Ala Thr Ala Asn Ser Asn Leu Ala Ala Arg Pro Tyr225 230 235
240 Lys Ser Trp Ile Tyr Asp Val Leu Thr Asn Ser Gln Asn Phe Ala Asp
245 250 255 Phe Ala Ser Thr Ser Gly Pro Gly Ile Asn Val Glu Gln Ile
His Asn 260 265 270 Ala Ile His Trp Asp Gly Ala Cys Gly Ser Gln Phe
Leu Ala Pro Asp 275 280 285 Tyr Ser Gly Phe Asp Pro Leu Phe Phe Met
His His Ala Gln Val Asp 290 295 300 Arg Met Trp Ala Phe Trp Glu Ala
Ile Met Pro Ser Ser Pro Leu Phe305 310 315 320 Thr Ala Ser Tyr Lys
Gly Gln Ser Arg Phe Asn Ser Lys Ser Gly Ser 325 330 335 Thr Ile Thr
Pro Asp Ser Pro Leu Gln Pro Phe Tyr Gln Ala Asn Gly 340 345 350 Lys
Phe His Thr Ser Asn Thr Val Lys Ser Ile Gln Gly Met Gly Tyr 355 360
365 Ser Tyr Gln Gly Ile Glu Tyr Trp Gln Lys Ser Gln Ala Gln Ile Lys
370 375 380 Ser Ser Val Thr Thr Ile Ile Asn Gln Leu Tyr Gly Pro Asn
Ser Gly385 390 395 400 Lys Lys Arg Asn Ala Pro Arg Asp Phe Leu Ser
Asp Ile Val Thr Asp 405 410 415 Val Glu Asn Leu Ile Lys Thr Arg Tyr
Phe Ala Lys Ile Ser Val Asn 420 425 430 Val Thr Glu Val Thr Val Arg
Pro Ala Glu Ile Asn Val Tyr Val Gly 435 440 445 Gly Gln Lys Ala Gly
Ser Leu Ile Val Met Lys Leu Pro Ala Glu Gly 450 455 460 Thr Val Asn
Gly Gly Phe Thr Ile Asp Asn Pro Met Gln Ser Ile Leu465 470 475 480
His Gly Gly Leu Arg Asn Ala Val Gln Ala Phe Thr Glu Asp Ile Glu 485
490 495 Val Glu Ile Leu Ser Lys Asp Gly Gln Ala Ile Pro Leu Glu Thr
Val 500 505 510 Pro Ser Leu Ser Ile Asp Leu Glu Val Ala Asn Val Thr
Leu Pro Ser 515 520 525 Ala Leu Asp Gln Leu Pro Lys Tyr Gly Gln Arg
Ser Arg His Arg Ala 530 535 540 Lys Ala Ala Gln Arg Gly His Arg Phe
Ala Val Pro His Ile Pro Pro545 550 555 560 Leu211686DNAArtificial
SequenceSynthetic sequence encoding the polyphenol oxidase from T.
reesei 21atg ctt ctt tct gct tct ctt tct gct ctt gct ctt gct act
gtt tct 48Met Leu Leu Ser Ala Ser Leu Ser Ala Leu Ala Leu Ala Thr
Val Ser1 5 10 15ctt gct cag gga acc act cat att cca gtt act ggt gtt
cca gtt tct 96Leu Ala Gln Gly Thr Thr His Ile Pro Val Thr Gly Val
Pro Val Ser 20 25 30cca ggt gct gct gtt cca ctt agg cag aac att aac
gat ctt gct aag 144Pro Gly Ala Ala Val Pro Leu Arg Gln Asn Ile Asn
Asp Leu Ala Lys 35 40 45tct gga cca caa tgg gat ctt tac gtt cag gcc
atg tac aac atg tct 192Ser Gly Pro Gln Trp Asp Leu Tyr Val Gln Ala
Met Tyr Asn Met Ser 50 55 60aag atg gat tcc cac gac cca tat tca ttc
ttc cag atc gct ggt att 240Lys Met Asp Ser His Asp Pro Tyr Ser Phe
Phe Gln Ile Ala Gly Ile65 70 75 80cat ggt gct ccc tac att gag tat
aac aag gct ggt gct aag tca ggt 288His Gly Ala Pro Tyr Ile Glu Tyr
Asn Lys Ala Gly Ala Lys Ser Gly 85 90 95gat gga tgg ctt gga tat tgc
cca cat ggt gaa gat ctt ttc att tcc 336Asp Gly Trp Leu Gly Tyr Cys
Pro His Gly Glu Asp Leu Phe Ile Ser 100 105 110tgg cat agg cca tac
gtt ctt ttg ttc gag cag gct ctt gtt tct gtt 384Trp His Arg Pro Tyr
Val Leu Leu Phe Glu Gln Ala Leu Val Ser Val 115 120 125gct aag ggt
atc gct aac tct tat cca cca tct gtt agg gct aag tat 432Ala Lys Gly
Ile Ala Asn Ser Tyr Pro Pro Ser Val Arg Ala Lys Tyr 130 135 140caa
gct gct gct gct tct ctt agg gct cca tat tgg gat tgg gct gct 480Gln
Ala Ala Ala Ala Ser Leu Arg Ala Pro Tyr Trp Asp Trp Ala Ala145 150
155 160gat tct tct gtt cca gct gtt act gtt cca cag acc ctc aag att
aac 528Asp Ser Ser Val Pro Ala Val Thr Val Pro Gln Thr Leu Lys Ile
Asn 165 170 175gtt cca tct gga tct tct acc aag acc gtg gat tac act
aac cca ctc 576Val Pro Ser Gly Ser Ser Thr Lys Thr Val Asp Tyr Thr
Asn Pro Leu 180 185 190aag act tac tat ttc cca agg atg tct ctt act
gga tct tac ggt gag 624Lys Thr Tyr Tyr Phe Pro Arg Met Ser Leu Thr
Gly Ser Tyr Gly Glu 195 200 205ttc act ggt gga gga aac gat cat act
gtt aga tgc gct gct tct aag 672Phe Thr Gly Gly Gly Asn Asp His Thr
Val Arg Cys Ala Ala Ser Lys 210 215 220caa tct tac cca gct act gct
aac tct aac ctt gct gct aga cca tac 720Gln Ser Tyr Pro Ala Thr Ala
Asn Ser Asn Leu Ala Ala Arg Pro Tyr225 230 235 240aag tcc tgg atc
tac gat gtt ctt acc aac tct cag aac ttc gct gat 768Lys Ser Trp Ile
Tyr Asp Val Leu Thr Asn Ser Gln Asn Phe Ala Asp 245 250 255ttc gct
tct act tcc gga cca ggt att aac gtt gag cag atc cac aac 816Phe Ala
Ser Thr Ser Gly Pro Gly Ile Asn Val Glu Gln Ile His Asn 260 265
270gct att cat tgg gat ggt gct tgc gga tct caa ttc ctt gct cca gat
864Ala Ile His Trp Asp Gly Ala Cys Gly Ser Gln Phe Leu Ala Pro Asp
275 280 285tac tct gga ttc gac cca ctt ttc ttc atg cat cat gct caa
gtt gat 912Tyr Ser Gly Phe Asp Pro Leu Phe Phe Met His His Ala Gln
Val Asp 290 295 300agg atg tgg gct ttc tgg gaa gct att atg cca tct
tct cca ctt ttc 960Arg Met Trp Ala Phe Trp Glu Ala Ile Met Pro Ser
Ser Pro Leu Phe305 310 315 320acc gct tca tac aag gga caa tcc agg
ttc aac tct aag tct ggt tct 1008Thr Ala Ser Tyr Lys Gly Gln Ser Arg
Phe Asn Ser Lys Ser Gly Ser 325 330 335acc att act cca gat tct cca
ctt caa cca ttc tac cag gct aac gga 1056Thr Ile Thr Pro Asp Ser Pro
Leu Gln Pro Phe Tyr Gln Ala Asn Gly 340 345 350aag ttc cat acc tct
aac acc gtg aag tct att cag gga atg gga tac 1104Lys Phe His Thr Ser
Asn Thr Val Lys Ser Ile Gln Gly Met Gly Tyr 355 360 365tct tac cag
gga att gag tac tgg caa aag tct cag gct cag att aag 1152Ser Tyr Gln
Gly Ile Glu Tyr Trp Gln Lys Ser Gln Ala Gln Ile Lys 370 375 380tca
tct gtg acc acc att atc aac cag ctt tac gga cca aac tct gga 1200Ser
Ser Val Thr Thr Ile Ile Asn Gln Leu Tyr Gly Pro Asn Ser Gly385 390
395 400aag aag aga aac gct cca agg gat ttc ctt tcc gat att gtg acc
gat 1248Lys Lys Arg Asn Ala Pro Arg Asp Phe Leu Ser Asp Ile Val Thr
Asp 405 410 415gtg gag aac ctt att
aag acc aga tac ttc gct aag att tcc gtt aac 1296Val Glu Asn Leu Ile
Lys Thr Arg Tyr Phe Ala Lys Ile Ser Val Asn 420 425 430gtt acc gaa
gtt act gtt agg cca gct gag att aac gtt tat gtg gga 1344Val Thr Glu
Val Thr Val Arg Pro Ala Glu Ile Asn Val Tyr Val Gly 435 440 445gga
caa aag gct gga tct ctc att gtg atg aag ttg cca gct gag gga 1392Gly
Gln Lys Ala Gly Ser Leu Ile Val Met Lys Leu Pro Ala Glu Gly 450 455
460act gtt aac ggt gga ttc acc att gat aac ccc atg caa tcc att ctt
1440Thr Val Asn Gly Gly Phe Thr Ile Asp Asn Pro Met Gln Ser Ile
Leu465 470 475 480cat ggt gga ctt agg aac gct gtt cag gct ttc act
gag gat att gag 1488His Gly Gly Leu Arg Asn Ala Val Gln Ala Phe Thr
Glu Asp Ile Glu 485 490 495gtg gag att ctc tct aag gat gga cag gct
att cca ctt gag act gtg 1536Val Glu Ile Leu Ser Lys Asp Gly Gln Ala
Ile Pro Leu Glu Thr Val 500 505 510cca tct ctt agc att gat ctt gag
gtt gca aac gtt act ctt cca tct 1584Pro Ser Leu Ser Ile Asp Leu Glu
Val Ala Asn Val Thr Leu Pro Ser 515 520 525gct ctt gat cag ctt cca
aag tac gga caa aga tct aga cat agg gct 1632Ala Leu Asp Gln Leu Pro
Lys Tyr Gly Gln Arg Ser Arg His Arg Ala 530 535 540aag gct gct caa
aga gga cat aga ttc gct gtt cca cac att cca cca 1680Lys Ala Ala Gln
Arg Gly His Arg Phe Ala Val Pro His Ile Pro Pro545 550 555 560ctt
tga 1686Leu 22561PRTArtificial SequenceAmino acid sequence encoded
by the synthetic sequence encoding the polyphenol oxidase from T.
reesei 22Met Leu Leu Ser Ala Ser Leu Ser Ala Leu Ala Leu Ala Thr
Val Ser1 5 10 15 Leu Ala Gln Gly Thr Thr His Ile Pro Val Thr Gly
Val Pro Val Ser 20 25 30 Pro Gly Ala Ala Val Pro Leu Arg Gln Asn
Ile Asn Asp Leu Ala Lys 35 40 45 Ser Gly Pro Gln Trp Asp Leu Tyr
Val Gln Ala Met Tyr Asn Met Ser 50 55 60 Lys Met Asp Ser His Asp
Pro Tyr Ser Phe Phe Gln Ile Ala Gly Ile65 70 75 80 His Gly Ala Pro
Tyr Ile Glu Tyr Asn Lys Ala Gly Ala Lys Ser Gly 85 90 95 Asp Gly
Trp Leu Gly Tyr Cys Pro His Gly Glu Asp Leu Phe Ile Ser 100 105 110
Trp His Arg Pro Tyr Val Leu Leu Phe Glu Gln Ala Leu Val Ser Val 115
120 125 Ala Lys Gly Ile Ala Asn Ser Tyr Pro Pro Ser Val Arg Ala Lys
Tyr 130 135 140 Gln Ala Ala Ala Ala Ser Leu Arg Ala Pro Tyr Trp Asp
Trp Ala Ala145 150 155 160 Asp Ser Ser Val Pro Ala Val Thr Val Pro
Gln Thr Leu Lys Ile Asn 165 170 175 Val Pro Ser Gly Ser Ser Thr Lys
Thr Val Asp Tyr Thr Asn Pro Leu 180 185 190 Lys Thr Tyr Tyr Phe Pro
Arg Met Ser Leu Thr Gly Ser Tyr Gly Glu 195 200 205 Phe Thr Gly Gly
Gly Asn Asp His Thr Val Arg Cys Ala Ala Ser Lys 210 215 220 Gln Ser
Tyr Pro Ala Thr Ala Asn Ser Asn Leu Ala Ala Arg Pro Tyr225 230 235
240 Lys Ser Trp Ile Tyr Asp Val Leu Thr Asn Ser Gln Asn Phe Ala Asp
245 250 255 Phe Ala Ser Thr Ser Gly Pro Gly Ile Asn Val Glu Gln Ile
His Asn 260 265 270 Ala Ile His Trp Asp Gly Ala Cys Gly Ser Gln Phe
Leu Ala Pro Asp 275 280 285 Tyr Ser Gly Phe Asp Pro Leu Phe Phe Met
His His Ala Gln Val Asp 290 295 300 Arg Met Trp Ala Phe Trp Glu Ala
Ile Met Pro Ser Ser Pro Leu Phe305 310 315 320 Thr Ala Ser Tyr Lys
Gly Gln Ser Arg Phe Asn Ser Lys Ser Gly Ser 325 330 335 Thr Ile Thr
Pro Asp Ser Pro Leu Gln Pro Phe Tyr Gln Ala Asn Gly 340 345 350 Lys
Phe His Thr Ser Asn Thr Val Lys Ser Ile Gln Gly Met Gly Tyr 355 360
365 Ser Tyr Gln Gly Ile Glu Tyr Trp Gln Lys Ser Gln Ala Gln Ile Lys
370 375 380 Ser Ser Val Thr Thr Ile Ile Asn Gln Leu Tyr Gly Pro Asn
Ser Gly385 390 395 400 Lys Lys Arg Asn Ala Pro Arg Asp Phe Leu Ser
Asp Ile Val Thr Asp 405 410 415 Val Glu Asn Leu Ile Lys Thr Arg Tyr
Phe Ala Lys Ile Ser Val Asn 420 425 430 Val Thr Glu Val Thr Val Arg
Pro Ala Glu Ile Asn Val Tyr Val Gly 435 440 445 Gly Gln Lys Ala Gly
Ser Leu Ile Val Met Lys Leu Pro Ala Glu Gly 450 455 460 Thr Val Asn
Gly Gly Phe Thr Ile Asp Asn Pro Met Gln Ser Ile Leu465 470 475 480
His Gly Gly Leu Arg Asn Ala Val Gln Ala Phe Thr Glu Asp Ile Glu 485
490 495 Val Glu Ile Leu Ser Lys Asp Gly Gln Ala Ile Pro Leu Glu Thr
Val 500 505 510 Pro Ser Leu Ser Ile Asp Leu Glu Val Ala Asn Val Thr
Leu Pro Ser 515 520 525 Ala Leu Asp Gln Leu Pro Lys Tyr Gly Gln Arg
Ser Arg His Arg Ala 530 535 540 Lys Ala Ala Gln Arg Gly His Arg Phe
Ala Val Pro His Ile Pro Pro545 550 555 560 Leu2310PRTArtificial
SequenceN-terminal sequence from trypsin fragments of the 50 kDa
protein from ATX21995 23Gly Thr Trp Ser Ile Ala Ala Gly Ser Arg1 5
10 2412PRTArtificial SequenceN-terminal sequence from trypsin
fragments of the 50 kDa protein from ATX21995 24Asp Ser Thr Gly Glu
Phe Asn Ala Thr Leu Tyr Arg1 5 10 259PRTArtificial
SequenceN-terminal sequence from trypsin fragments of the 50 kDa
protein from ATX21995 25Ser Ala Pro Tyr Ala Ile Thr Gly Ile1 5
2614PRTArtificial SequenceN-terminal sequence from trypsin
fragments of the 50 kDa protein from ATX21995 26Tyr Pro Asp Ala Trp
Phe Asn Ala Gln Ser Ala Gln Leu Arg1 5 10 2710PRTArtificial
SequenceN-terminal sequence from trypsin fragments of the 50 kDa
protein from ATX21995 27Phe Gly Ser Ser Tyr Pro Glu Leu Gln Pro1 5
10 2830PRTArtificial SequenceN-terminal sequences of ATX20514
toxins 28Ser Thr Ser Arg Gln Asp Val Ala Lys Leu Gly Pro Gly Trp
Asn Lys1 5 10 15 Val Leu Leu Asn Tyr Ala Leu Ala Met Gln Ala Leu
Asp Glu 20 25 30 2930PRTArtificial SequenceN-terminal sequences of
ATX20514 toxins 29Ser Thr Ser Gly Gln Asp Val Ala Lys Leu Gly Pro
Gln Trp Asn Lys1 5 10 15 Val Leu Leu Asn Tyr Ala Leu Ala Met Gln
Ala Leu Asp Glu 20 25 30 304PRTArtificial Sequencetargeting peptide
30Lys Asp Glu Leu1 31685PRTNeurospora crassa 31Met Ser Thr Asp Ile
Lys Phe Ala Ile Thr Gly Val Pro Thr Pro Pro1 5 10 15 Ser Ser Asn
Gly Ala Val Pro Leu Arg Arg Glu Leu Arg Asp Leu Gln 20 25 30 Gln
Asn Tyr Pro Glu Gln Phe Asn Leu Tyr Leu Leu Gly Leu Arg Asp 35 40
45 Phe Gln Gly Leu Asp Glu Ala Lys Leu Asp Ser Tyr Tyr Gln Val Ala
50 55 60 Gly Ile His Gly Met Pro Phe Lys Pro Trp Ala Gly Val Pro
Ser Asp65 70 75 80 Thr Asp Trp Ser Gln Pro Gly Ser Ser Gly Phe Gly
Gly Tyr Cys Thr 85 90 95 His Ser Ser Ile Leu Phe Ile Thr Trp His
Arg Pro Tyr Leu Ala Leu 100 105 110 Tyr Glu Gln Ala Leu Tyr Ala Ser
Val Gln Ala Val Ala Gln Lys Phe 115 120 125 Pro Val Glu Gly Gly Leu
Arg Ala Lys Tyr Val Ala Ala Ala Lys Asp 130 135 140 Phe Arg Ala Pro
Tyr Phe Asp Trp Ala Ser Gln Pro Pro Lys Gly Thr145 150 155 160 Leu
Ala Phe Pro Glu Ser Leu Ser Ser Arg Thr Ile Gln Val Val Asp 165 170
175 Val Asp Gly Lys Thr Lys Ser Ile Asn Asn Pro Leu His Arg Phe Thr
180 185 190 Phe His Pro Val Asn Pro Ser Pro Gly Asp Phe Ser Ala Ala
Trp Ser 195 200 205 Arg Tyr Pro Ser Thr Val Arg Tyr Pro Asn Arg Leu
Thr Gly Ala Ser 210 215 220 Arg Asp Glu Arg Ile Ala Pro Ile Leu Ala
Asn Glu Leu Ala Ser Leu225 230 235 240 Arg Asn Asn Val Ser Leu Leu
Leu Leu Ser Tyr Lys Asp Phe Asp Ala 245 250 255 Phe Ser Tyr Asn Arg
Trp Asp Pro Asn Thr Asn Pro Gly Asp Phe Gly 260 265 270 Ser Leu Glu
Asp Val His Asn Glu Ile His Asp Arg Thr Gly Gly Asn 275 280 285 Gly
His Met Ser Ser Leu Glu Val Ser Ala Phe Asp Pro Leu Phe Trp 290 295
300 Leu His His Val Asn Val Asp Arg Leu Trp Ser Ile Trp Gln Asp
Leu305 310 315 320 Asn Pro Asn Ser Phe Met Thr Pro Arg Pro Ala Pro
Tyr Ser Thr Phe 325 330 335 Val Ala Gln Glu Gly Glu Ser Gln Ser Lys
Ser Thr Pro Leu Glu Pro 340 345 350 Phe Trp Asp Lys Ser Ala Ala Asn
Phe Trp Thr Ser Glu Gln Val Lys 355 360 365 Asp Ser Ile Thr Phe Gly
Tyr Ala Tyr Pro Glu Thr Gln Lys Trp Lys 370 375 380 Tyr Ser Ser Val
Lys Glu Tyr Gln Ala Ala Ile Arg Lys Ser Val Thr385 390 395 400 Ala
Leu Tyr Gly Ser Asn Val Phe Ala Asn Phe Val Glu Asn Val Ala 405 410
415 Asp Arg Thr Pro Ala Leu Lys Lys Pro Gln Ala Thr Gly Glu Glu Ser
420 425 430 Lys Ser Thr Val Ser Ala Ala Ala Ala His Ala Val Glu Leu
Ser Gly 435 440 445 Ala Lys Lys Val Ala Glu Lys Val His Asn Val Phe
Gln His Ala Glu 450 455 460 Glu Lys Ala Gln Lys Pro Val Val Pro Val
Lys Asp Thr Lys Ala Glu465 470 475 480 Ser Ser Thr Ala Ala Gly Met
Met Ile Gly Leu Ser Ile Lys Arg Pro 485 490 495 Ser Lys Leu Thr Ala
Ser Pro Gly Pro Ile Pro Glu Ser Leu Lys Tyr 500 505 510 Leu Ala Pro
Asp Gly Lys Tyr Thr Asp Trp Ile Val Asn Val Arg Ala 515 520 525 Gln
Lys His Gly Leu Gly Gln Ser Phe Arg Val Ile Val Phe Leu Gly 530 535
540 Glu Phe Asn Pro Asp Pro Glu Thr Trp Asp Asp Glu Phe Asn Cys
Val545 550 555 560 Gly Arg Val Ser Val Leu Gly Arg Ser Ala Glu Thr
Gln Cys Gly Lys 565 570 575 Cys Arg Lys Asp Asn Ala Asn Gly Leu Ile
Val Ser Gly Thr Val Pro 580 585 590 Leu Thr Ser Ala Leu Leu Gln Asp
Ile Val Gly Gly Glu Leu Gln Ser 595 600 605 Leu Lys Pro Glu Asp Val
Ile Pro His Leu Arg Ala Asn Leu Lys Trp 610 615 620 Lys Val Ala Leu
Phe Asn Gly Asp Glu Tyr Asn Leu Glu Glu Val Pro625 630 635 640 Asp
Leu Lys Val Ser Val Ala Ser Thr Glu Val Thr Ile Asp Glu Glu 645 650
655 Gly Leu Pro His Tyr Ser Arg Gln Tyr Thr Val Tyr Pro Glu Ile Thr
660 665 670 Glu Gly Lys Pro Cys Gly His Gly Pro Glu Asp His Ile 675
680 685 32579PRTPyrenophora triticirepentis 32Met Val Asn Asp Thr
Gln Ala Phe Gln Gln Gly Ala Leu Ser Asn Ala1 5 10 15 Leu Thr Gly
Asn Val Phe Val Arg Arg Glu Val Arg Asp Leu Gln Ala 20 25 30 Asn
Phe Pro Asp Gln Trp Thr Leu Tyr Ile Leu Ala Leu Asn Lys Leu 35 40
45 His Asn Ala Asn Gln Ser Asp Ala Tyr Ser Phe Tyr Gly Ile Ala Ser
50 55 60 Ile His Gly Arg Pro Phe Gln Thr Trp Gly Asp Ala Pro Gly
Leu Pro65 70 75 80 Tyr Lys Gln Gly Met Thr Gly Tyr Cys Pro His Gly
Asn Glu Leu Phe 85 90 95 Met Gly Trp His Arg Pro Tyr Leu Ala Leu
Phe Glu Gln Val Val Ser 100 105 110 Asp Tyr Val His Asp Ile Ala Thr
Gln Ala Pro Thr Asp Lys Val Glu 115 120 125 Arg Tyr Leu Ala Ala Ala
Asn Glu Phe Arg Ile Pro Tyr Trp Asp Trp 130 135 140 Ala Gln Gly Thr
Asn Ser Gly Pro Val Pro Glu Phe Phe Thr Asn Pro145 150 155 160 Met
Leu Thr Val Thr Asn Thr Asp Gly Val Ser Thr Pro Met Ser Asn 165 170
175 Pro Leu Tyr Ser Tyr Gln Phe Asn Pro Ile Ser Asp Arg Phe Asp Glu
180 185 190 Lys Trp Arg Asn Ile Asn Ala Thr Ile Arg Trp Pro Asn Thr
Asp Asp 195 200 205 Ala Thr Ala His Ser Gln Asn Gly Met Phe Ser Asp
Ala Phe Ala Gly 210 215 220 Gln Ser Val Asn Ile Val Ala Gln Ile Gly
Val Val Phe Arg Ser Ser225 230 235 240 Thr Phe Ser Arg Phe Ser Thr
Thr Leu Glu Asp Pro His Gly Trp Ile 245 250 255 His Gly Ile Ile Gly
Gly Gly Tyr Thr Ala Asp Ala Pro Tyr Lys Gly 260 265 270 His Met Trp
Pro Leu Glu Tyr Ser Ala Phe Glu Pro Leu Phe Met Leu 275 280 285 His
His Ala Asn Val Asp Arg Leu Leu Ala Leu Tyr Gln Ala Ala His 290 295
300 Pro Asp Arg Trp Met Glu Ser Ser Asn Ile Gly Pro His Gly Asn
Val305 310 315 320 Tyr Leu Glu Asp Tyr Gln Glu Val Asn Gly Asp Thr
Ser Leu Leu Pro 325 330 335 Phe Arg Lys Thr Pro Gly Glu Phe Trp Thr
Pro Asn Ala Cys Arg Asn 340 345 350 Thr Thr Val Leu Gly Tyr Ala Tyr
Pro Glu Thr Gln Arg Trp Gln Tyr 355 360 365 Pro Ser Asp Asp Ser Tyr
Gln Asn Ala Val Asn Ser Val Ile Ser Thr 370 375 380 Leu Tyr Gly Gly
Gln Thr Arg Ser Gln Leu Thr Ser Ala Ile Glu Thr385 390 395 400 Gly
Ser Gly Glu Arg Leu Leu Lys Asn Gly Asn Ser Phe Thr Asp Trp 405 410
415 Thr Ile Asn Thr Gln Ala Ile Ala Ser Lys Leu Pro Ser Thr Phe Ile
420 425 430 Val Lys Phe Ser Phe Val Gly Ile Phe Gln Ser Asp Pro Ser
Val Asp 435 440 445 Ala Gly Ser Trp Met Met Leu Met Pro Asp Asn Lys
Gln Asn Met His 450 455 460 Thr Leu Gln Val Arg Thr Glu Ser Glu Lys
Val Leu Tyr Gly Thr Thr465 470 475 480 Ser Ile Thr Ala His Leu Ile
Asp Leu Val Asn Ala Gly Lys Leu Asn 485 490 495 Ser Ile Ser Ser Asp
Asp Val Val Pro Tyr Leu Arg Asp Thr Leu Thr 500 505 510 Trp Asn Ile
Phe Thr Asp Asn Gly Thr Arg Ile Ala Gln Pro Asn Gly 515 520 525 Ala
Leu Thr Val Gln Val Thr Ser Thr Glu Ala Tyr Val Pro Glu Asp 530 535
540 Arg Ser Ala Pro Ile Gln Tyr Ser Glu Asn Ile Thr Glu His Pro
Glu545 550 555 560 Ile Thr Ala Asn Lys Phe Gly Gly Thr Ser Ser Thr
Ser Pro Ala Met 565 570 575 Met Phe Leu33574PRTPodospora anserina
33Met Ser Thr Thr Gly Asn Ile Ala Ile Thr Gly Ile Pro Thr Thr Ala1
5 10 15 Gly Pro Asp Gly Ser Phe Pro Leu Arg Arg Glu
Leu Arg Asp Leu Gln 20 25 30 Arg Asn Tyr Pro Asp His Phe Asn Leu
Leu Val Leu Ala Leu Lys Asp 35 40 45 Phe Gln Ala Leu Asn Glu Ser
Val Gln Thr Ser Tyr Tyr Gln Ile Ala 50 55 60 Gly Ile His Gly Leu
Pro Tyr Lys Pro Trp Asn Asn Val Gly Ser Asn65 70 75 80 Ser Asp Trp
Gln Ser Thr Ser Gly Phe Gly Gly Tyr Cys Thr His Ser 85 90 95 Ser
Ile Leu Phe Leu Thr Trp His Arg Pro Tyr Leu Ala Leu Phe Glu 100 105
110 Gln Ala Leu Tyr Asn Ser Ile Gln Lys Ile Ala Asn Gln Phe Pro Gln
115 120 125 Gly Pro Leu Arg Thr Lys Tyr Val Glu Ala Ala Lys Thr Phe
Arg Met 130 135 140 Pro Tyr Phe Asp Trp Ala Ser Gln Pro Pro Ser Gly
Ser Ser Ala Phe145 150 155 160 Pro Ser Ala Phe Thr Ala Pro Ser Leu
Gln Val Val Asp Val Asp Gly 165 170 175 Lys Thr Lys Ser Thr Ala Asn
Pro Ile Tyr Arg Phe Val Phe His Pro 180 185 190 Val Asn Pro Ser Pro
Gly Asp Phe Pro Arg Gln Trp Ser Arg Phe Pro 195 200 205 Thr Thr Val
Arg Tyr Pro Asn Pro Arg Thr Gly Gln Ser Gln Asp Asn 210 215 220 Arg
Val Ala Pro Ile Leu Ala Asn Glu Leu Ala Ser Leu Arg Thr Asn225 230
235 240 Val Ser Leu Leu Leu Leu Ser Tyr Thr Asn Phe Asp Ala Phe Ser
Phe 245 250 255 Asn Arg Trp Asp Pro Asn Met Thr Pro Gly Glu Phe Gly
Ser Leu Glu 260 265 270 Asp Val His Asn Glu Ile His Asp Arg Thr Gly
Gly Gly Gly His Met 275 280 285 Ser Ser Leu Asp Val Ser Ser Phe Asp
Pro Leu Phe Trp Phe His His 290 295 300 Thr Asn Val Asp Arg Leu Trp
Ala Ile Trp Gln Asp Leu Asn Pro Asp305 310 315 320 Asn Phe Leu Thr
Pro Arg Pro Ala Pro Tyr Ser Thr Phe Asn Ser Thr 325 330 335 Glu Gly
Glu Ser Gln Thr Lys Asp Thr Pro Leu Thr Pro Phe Trp Asp 340 345 350
Lys Ser Ala Thr Lys Phe Trp Thr Ser Glu Glu Ile Lys Asp Thr Thr 355
360 365 Thr Thr Phe Gly Tyr Ala Tyr Pro Glu Thr Gln Glu Trp Lys Tyr
Arg 370 375 380 Thr Gly Ser Glu Tyr Gln Thr Ser Ile Arg Gln Ala Val
Thr Thr Leu385 390 395 400 Tyr Gly Thr Asn Val Phe Ala Asn Phe Ala
Ala Ala Asn Val Gln Ala 405 410 415 Arg Ala Thr Glu His Thr Glu Leu
Ile Lys Ser Leu Ser Leu Ala Ala 420 425 430 Pro Pro Pro Ser Ala Pro
Ile Thr Ala Glu Lys Pro Leu Leu Ile Thr 435 440 445 Gln Glu Met Lys
Ala Ser Pro Ile Pro Glu His Leu Gln His Leu Ala 450 455 460 Pro Asn
Asn Lys Tyr Pro Glu Trp Val Val Asn Ile Arg Ala Gln Lys465 470 475
480 His Gly Leu His Gly Ala Phe Arg Val Ile Val Phe Leu Gly Pro Ile
485 490 495 Asp Glu Ser Asp Pro Asp Ser Trp Gln Thr Glu Phe Asn Thr
Val Gly 500 505 510 Arg Val Ser Val Leu Gly Arg Ser Thr Gln Gly Pro
Thr Thr Thr Lys 515 520 525 Cys Ala Lys Cys Ile Thr Asp Ala Ala Asp
Glu Leu Met Ile Ser Gly 530 535 540 Thr Val Pro Leu Thr Ser Ala Leu
Leu Gln Asp Ile Val Asn Glu Asn545 550 555 560 Thr Ala Ser Ile Ala
Cys Ser Gln Arg Lys Trp Cys Arg Ile 565 570 34618PRTLentinula
elodes 34Met Ser His Tyr Leu Val Thr Gly Ala Thr Gly Gly Ser Thr
Ser Gly1 5 10 15 Ala Ala Ala Pro Asn Arg Leu Glu Ile Asn Asp Phe
Val Lys Gln Glu 20 25 30 Asp Gln Phe Ser Leu Tyr Ile Gln Ala Leu
Gln Tyr Ile Tyr Ser Ser 35 40 45 Lys Ser Gln Asp Asp Ile Asp Ser
Phe Phe Gln Ile Gly Gly Ile His 50 55 60 Gly Leu Pro Tyr Val Pro
Trp Asp Gly Ala Gly Asn Lys Pro Val Asp65 70 75 80 Thr Asp Ala Trp
Glu Gly Tyr Cys Thr His Gly Ser Val Leu Phe Pro 85 90 95 Thr Phe
His Arg Pro Tyr Val Leu Leu Ile Glu Gln Ala Ile Gln Ala 100 105 110
Ala Ala Val Asp Ile Ala Ala Thr Tyr Ile Val Asp Arg Ala Arg Tyr 115
120 125 Gln Asp Ala Ala Leu Asn Leu Arg Gln Pro Tyr Trp Asp Trp Ala
Arg 130 135 140 Asn Pro Val Pro Pro Pro Glu Val Ile Ser Leu Asp Glu
Val Thr Ile145 150 155 160 Val Asn Pro Ser Gly Glu Lys Ile Ser Val
Pro Asn Pro Leu Arg Arg 165 170 175 Tyr Thr Phe His Pro Ile Asp Pro
Ser Phe Pro Glu Pro Tyr Gln Ser 180 185 190 Trp Ser Thr Thr Leu Arg
His Pro Leu Ser Asp Asp Ala Asn Ala Ser 195 200 205 Asp Asn Val Pro
Glu Leu Lys Ala Thr Leu Arg Ser Ala Gly Pro Gln 210 215 220 Leu Lys
Thr Lys Thr Tyr Asn Leu Leu Thr Arg Val His Thr Trp Pro225 230 235
240 Ala Phe Ser Asn His Thr Pro Asp Asp Gly Gly Ser Thr Ser Asn Ser
245 250 255 Leu Glu Gly Ile His Asp Ser Val His Val Asp Val Gly Gly
Asn Gly 260 265 270 Gln Met Ser Asp Pro Ser Val Ala Gly Phe Asp Pro
Ile Phe Phe Met 275 280 285 His His Ala Gln Val Asp Arg Leu Leu Ser
Leu Trp Ser Ala Leu Asn 290 295 300 Pro Arg Val Trp Ile Thr Asp Gly
Pro Ser Gly Asp Gly Thr Trp Thr305 310 315 320 Ile Pro Pro Asp Thr
Val Val Gly Lys Asp Thr Asp Leu Thr Pro Phe 325 330 335 Trp Asn Thr
Gln Ser Ser Tyr Trp Ile Ser Ala Asn Val Thr Asp Thr 340 345 350 Ser
Lys Met Gly Tyr Thr Tyr Pro Glu Phe Asn Asn Leu Asp Met Gly 355 360
365 Asn Glu Val Ala Val Arg Ser Ala Ile Ala Ala Gln Val Asn Lys Leu
370 375 380 Tyr Gly Gly Pro Phe Thr Lys Phe Ala Ala Ala Ile Gln Gln
Pro Ser385 390 395 400 Ser Gln Thr Thr Ala Asp Ala Ser Thr Ile Gly
Asn Val Thr Ser Asp 405 410 415 Ala Ser Ser His Leu Val Asp Ser Lys
Ile Asn Pro Thr Pro Asn Arg 420 425 430 Ser Ile Asp Asp Ala Pro Gln
Val Lys Ile Ala Ser Thr Leu Arg Asn 435 440 445 Asn Glu Gln Lys Glu
Phe Trp Glu Trp Thr Ala Arg Val Gln Val Lys 450 455 460 Lys Tyr Glu
Ile Gly Gly Ser Phe Lys Val Leu Phe Phe Leu Gly Ser465 470 475 480
Val Pro Ser Asp Pro Lys Glu Trp Ala Thr Asp Pro His Phe Val Gly 485
490 495 Ala Phe His Gly Phe Val Asn Ser Ser Ala Glu Arg Cys Ala Asn
Cys 500 505 510 Arg Arg Gln Gln Asp Val Val Leu Glu Gly Phe Val His
Leu Asn Glu 515 520 525 Gly Ile Ala Asn Ile Ser Asn Leu Asn Ser Phe
Asp Pro Ile Val Val 530 535 540 Glu Pro Tyr Leu Lys Glu Asn Leu His
Trp Arg Val Gln Lys Val Ser545 550 555 560 Gly Glu Val Val Asn Leu
Asp Ala Ala Thr Ser Leu Glu Val Val Val 565 570 575 Val Ala Thr Arg
Leu Glu Leu Pro Pro Gly Glu Ile Phe Pro Val Pro 580 585 590 Ala Glu
Thr His His His His His Ile Thr His Gly Arg Pro Gly Gly 595 600 605
Ser Arg His Ser Val Ala Ser Ser Ser Ser 610 615 35618PRTPycnoporus
sanguineus 35Met Ser His Phe Ile Val Thr Gly Pro Val Gly Gly Gln
Thr Glu Gly1 5 10 15 Ala Pro Ala Pro Asn Arg Leu Glu Ile Asn Asp
Phe Val Lys Asn Glu 20 25 30 Glu Phe Phe Ser Leu Tyr Val Gln Ala
Leu Asp Ile Met Tyr Gly Leu 35 40 45 Lys Gln Glu Glu Leu Ile Ser
Phe Phe Gln Ile Gly Gly Ile His Gly 50 55 60 Leu Pro Tyr Val Ala
Trp Ser Asp Ala Gly Ala Asp Asp Pro Ala Glu65 70 75 80 Pro Ser Gly
Tyr Cys Thr His Gly Ser Val Leu Phe Pro Thr Trp His 85 90 95 Arg
Pro Tyr Val Ala Leu Tyr Glu Gln Ile Leu His Lys Tyr Ala Gly 100 105
110 Glu Ile Ala Asp Lys Tyr Thr Val Asp Lys Pro Arg Trp Gln Lys Ala
115 120 125 Ala Ala Asp Leu Arg Gln Pro Phe Trp Asp Trp Ala Lys Asn
Thr Leu 130 135 140 Pro Pro Pro Glu Val Ile Ser Leu Asp Lys Val Thr
Ile Thr Thr Pro145 150 155 160 Asp Gly Gln Arg Thr Gln Val Asp Asn
Pro Leu Arg Arg Tyr Arg Phe 165 170 175 His Pro Ile Asp Pro Ser Phe
Pro Glu Pro Tyr Ser Asn Trp Pro Ala 180 185 190 Thr Leu Arg His Pro
Thr Ser Asp Gly Ser Asp Ala Lys Asp Asn Val 195 200 205 Lys Asp Leu
Thr Thr Thr Leu Lys Ala Asp Gln Pro Asp Ile Thr Thr 210 215 220 Lys
Thr Tyr Asn Leu Leu Thr Arg Val His Thr Trp Pro Ala Phe Ser225 230
235 240 Asn His Thr Pro Gly Asp Gly Gly Ser Ser Ser Asn Ser Leu Glu
Ala 245 250 255 Ile His Asp His Ile His Asp Ser Val Gly Gly Gly Gly
Gln Met Gly 260 265 270 Asp Pro Ser Val Ala Gly Phe Asp Pro Ile Phe
Phe Leu His His Cys 275 280 285 Gln Val Asp Arg Leu Leu Ala Leu Trp
Ser Ala Leu Asn Pro Gly Val 290 295 300 Trp Val Asn Ser Ser Ser Ser
Glu Asp Gly Thr Tyr Thr Ile Pro Pro305 310 315 320 Asp Ser Thr Val
Asp Gln Thr Thr Ala Leu Thr Pro Phe Trp Asp Thr 325 330 335 Gln Ser
Thr Phe Trp Thr Ser Phe Gln Ser Ala Gly Val Ser Pro Ser 340 345 350
Gln Phe Gly Tyr Ser Tyr Pro Glu Phe Asn Gly Leu Asn Leu Gln Asp 355
360 365 Gln Lys Ala Val Lys Asp His Ile Ala Glu Val Val Asn Glu Leu
Tyr 370 375 380 Gly His Arg Met Arg Lys Thr Phe Pro Phe Pro Gln Leu
Gln Ala Val385 390 395 400 Ser Val Ala Lys Gln Gly Asp Ala Val Thr
Pro Ser Val Ala Thr Asp 405 410 415 Ser Val Ser Ser Ser Thr Thr Pro
Ala Glu Asn Pro Ala Ser Arg Glu 420 425 430 Asp Ala Ser Asp Lys Asp
Thr Glu Pro Thr Leu Asn Val Glu Val Ala 435 440 445 Ala Pro Gly Ala
His Leu Thr Ser Thr Lys Tyr Trp Asp Trp Thr Ala 450 455 460 Arg Ile
His Val Lys Lys Tyr Glu Val Gly Gly Ser Phe Ser Val Leu465 470 475
480 Leu Phe Leu Gly Ala Ile Pro Glu Asn Pro Ala Asp Trp Arg Thr Ser
485 490 495 Pro Asn Tyr Val Gly Gly His His Ala Phe Val Asn Ser Ser
Pro Gln 500 505 510 Arg Cys Ala Asn Cys Arg Gly Gln Gly Asp Leu Val
Ile Glu Gly Phe 515 520 525 Val His Leu Asn Glu Ala Ile Ala Arg His
Ala His Leu Asp Ser Phe 530 535 540 Asp Pro Thr Val Val Arg Pro Tyr
Leu Thr Arg Glu Leu His Trp Gly545 550 555 560 Val Met Lys Val Asn
Gly Thr Val Val Pro Leu Gln Asp Val Pro Ser 565 570 575 Leu Glu Val
Val Val Leu Ser Thr Pro Leu Thr Leu Pro Pro Gly Glu 580 585 590 Pro
Phe Pro Val Pro Gly Thr Pro Val Asn His His Asp Ile Thr His 595 600
605 Gly Arg Pro Gly Gly Ser His His Thr His 610 615 36625PRTPholio
nameka 36Met Ser Arg Val Val Ile Thr Gly Val Ser Gly Thr Val Ala
Asn Arg1 5 10 15 Leu Glu Ile Asn Asp Phe Val Lys Asn Asp Lys Phe
Phe Ser Leu Tyr 20 25 30 Ile Gln Ala Leu Gln Val Met Ser Ser Val
Pro Pro Gln Glu Asn Val 35 40 45 Arg Ser Phe Phe Gln Ile Gly Gly
Ile His Gly Leu Pro Tyr Thr Pro 50 55 60 Trp Asp Gly Ile Thr Gly
Asp Gln Pro Phe Asp Pro Asn Thr Gln Trp65 70 75 80 Gly Gly Tyr Cys
Thr His Gly Ser Val Leu Phe Pro Thr Trp His Arg 85 90 95 Pro Tyr
Val Leu Leu Tyr Glu Gln Ile Leu His Lys His Val Gln Asp 100 105 110
Ile Ala Ala Thr Tyr Thr Thr Ser Asp Lys Ala Ala Trp Val Gln Ala 115
120 125 Ala Ala Asn Leu Arg Gln Pro Tyr Trp Asp Trp Ala Ala Asn Ala
Val 130 135 140 Pro Pro Asp Gln Val Ile Ala Ser Lys Lys Val Thr Ile
Thr Gly Ser145 150 155 160 Asn Gly His Lys Val Glu Val Asp Asn Pro
Leu Tyr His Tyr Lys Phe 165 170 175 His Pro Ile Asp Ser Ser Phe Pro
Arg Pro Tyr Ser Glu Trp Pro Thr 180 185 190 Thr Leu Arg Gln Pro Asn
Ser Ser Arg Pro Asn Ala Thr Asp Asn Val 195 200 205 Ala Lys Leu Arg
Asn Val Leu Arg Ala Ser Gln Glu Asn Ile Thr Ser 210 215 220 Asn Thr
Tyr Ser Met Leu Thr Arg Val His Thr Trp Lys Ala Phe Ser225 230 235
240 Asn His Thr Val Gly Asp Gly Gly Ser Thr Ser Asn Ser Leu Glu Ala
245 250 255 Ile His Asp Gly Ile His Val Asp Val Gly Gly Gly Gly His
Met Ala 260 265 270 Asp Pro Ala Val Ala Ala Phe Asp Pro Ile Phe Phe
Leu His His Cys 275 280 285 Asn Val Asp Arg Leu Leu Ser Leu Trp Ala
Ala Ile Asn Pro Gly Val 290 295 300 Trp Val Ser Pro Gly Asp Ser Glu
Asp Gly Thr Phe Ile Leu Pro Pro305 310 315 320 Glu Ala Pro Val Asp
Val Ser Thr Pro Leu Thr Pro Phe Ser Asn Thr 325 330 335 Glu Thr Thr
Phe Trp Ala Ser Gly Gly Ile Thr Asp Thr Thr Lys Leu 340 345 350 Gly
Tyr Thr Tyr Pro Glu Phe Asn Gly Leu Asp Leu Gly Asn Ala Gln 355 360
365 Ala Val Lys Ala Ala Ile Gly Asn Ile Val Asn Arg Leu Tyr Gly Ala
370 375 380 Ser Val Phe Ser Gly Phe Ala Ala Ala Thr Ser Ala Ile Gly
Ala Gly385 390 395 400 Ser Val Ala Ser Leu Ala Ala Asp Val Pro Leu
Glu Lys Ala Pro Ala 405 410 415 Pro Ala Pro Glu Ala Ala Ala Gln Ser
Pro Val Pro Ala Pro Ala His 420 425 430 Val Glu Pro Ala Val Arg Ala
Val Ser Val His Ala Ala Ala Ala Gln 435 440 445 Pro His Ala Glu Pro
Pro Val His Val Ser Ala Gly Gly His Pro Ser 450 455 460 Pro His Gly
Phe Tyr Asp Trp Thr Ala Arg Ile Glu Phe Lys Lys Tyr465 470 475 480
Glu Phe Gly Ser Ser Phe Ser Val Leu Leu Phe Leu Gly Pro Val Pro 485
490 495 Glu Asp Pro Glu Gln Trp Leu Val Ser Pro Asn Phe Val Gly Ala
His 500 505 510 His Ala Phe Val Asn Ser Ala Ala Gly
His Cys Ala Asn Cys Arg Asn 515 520 525 Gln Gly Asn Val Val Val Glu
Gly Phe Val His Leu Thr Lys Tyr Ile 530 535 540 Ser Glu His Ala Gly
Leu Arg Ser Leu Asn Pro Glu Val Val Glu Pro545 550 555 560 Tyr Leu
Thr Asn Glu Leu His Trp Arg Val Leu Lys Ala Asp Gly Ser 565 570 575
Val Gly Gln Leu Glu Ser Leu Glu Val Ser Val Tyr Gly Thr Pro Met 580
585 590 Asn Leu Pro Val Gly Ala Met Phe Pro Val Pro Gly Asn Arg Arg
His 595 600 605 Phe His Gly Ile Thr His Gly Arg Val Gly Gly Ser Arg
His Ala Ile 610 615 620 Val625 37630PRTTuber melanosporum 37Met Thr
Met Lys Thr Tyr Pro Ile Thr Gly Val Ala Ser Gln Ala Pro1 5 10 15
Arg Pro Arg Arg Asn Ile Asn Asp Phe Ala Gln Asp Pro Leu Gln Trp 20
25 30 Asn Leu Phe Leu Gln Ala Leu Ile Asn Leu Gln Ser Gln Gly Glu
Asp 35 40 45 Thr His Ser Pro Leu Gly Tyr Tyr Gln Val Ala Gly Val
His Gly Thr 50 55 60 Pro Tyr Ile Pro Trp Met Glu Lys Ala Asp Ala
Asp Asp Arg Ala Gly65 70 75 80 Asp Tyr Cys Thr His Gly Thr Ala Leu
Phe Ile Thr Trp His Arg Pro 85 90 95 Tyr Leu Leu Leu Phe Glu Gln
Arg Ile Val Glu Glu Ala Leu Thr Ile 100 105 110 Ala Arg Asn Phe Ser
Asp Lys Tyr Arg Ala Glu Tyr Glu Glu Ala Ala 115 120 125 Leu Asn Ile
Arg Ile Pro Tyr Trp Asp Trp Ala Thr Asp Ser Asp Val 130 135 140 Pro
Gln Ser Ile Arg Phe Ala Glu Thr Asp Ile Thr Leu Pro Glu Val145 150
155 160 Gly Ser Asp Ala Pro Pro Val Thr Arg Lys Gly Val Pro Asn Pro
Met 165 170 175 Tyr Ser Tyr Lys Phe Lys Thr Ser Ile Arg Arg Gln Arg
Asp Phe Ser 180 185 190 Ile Val Gly Val Gln Glu Met Val Ala Trp Glu
Glu Thr Lys Arg Cys 195 200 205 Pro Asp Glu Lys Gly Ile Ser His Pro
Glu Ile Val Asp Arg Gln Leu 210 215 220 Arg Ile Pro Thr Val Asn Pro
Thr Ala Gly Ser Ser Phe Arg Asp Pro225 230 235 240 Ile Tyr Lys Leu
Leu Thr Leu Val Gly Ser Tyr Gly Ala Phe Gly Asn 245 250 255 Thr Gly
Trp Gln Thr Gly Arg Pro Gly Pro Asn Asn Ile Ser Leu Glu 260 265 270
His Tyr His Asn Ile Ile His Thr Phe Thr Gly Thr Asn Tyr Ile Glu 275
280 285 Glu Asn Ser Lys Glu Gly His Met Ser Glu Val Gly Val Ser Ala
Phe 290 295 300 Asp Pro Ile Phe Trp Leu His His Cys Asn Val Asp Arg
Leu Tyr Ala305 310 315 320 Ile Trp Gln Ala Ile His Tyr Glu Ala Pro
Phe Glu Asp Gln Ala Thr 325 330 335 Asp Tyr Thr Arg Met Pro Leu Thr
Lys Ala Ile Asp Asp Ala Glu Thr 340 345 350 Thr Leu Arg Pro Phe Tyr
Lys Asp Glu Cys Tyr Asp Val Pro Trp Thr 355 360 365 Ser Ser Met Val
Gln Lys Ser Ser Ala Ala Thr Gly Pro Thr Val Phe 370 375 380 Asp Tyr
Asn Tyr His Tyr Pro Glu Leu Pro Val Asp Leu Ser Gly Pro385 390 395
400 Gly Lys Gln Lys Glu Met Ala Ser His Val Leu Arg Arg Val His Gln
405 410 415 Leu Tyr Gly Pro Pro Thr Asp Glu Ser Leu Val Asp Thr Pro
Lys Val 420 425 430 Pro Asn Ala Leu Leu Pro Pro Ser Arg Ile Val Arg
Asp Gly Met Phe 435 440 445 Arg Arg Glu Trp Leu Ile Phe Leu Arg Val
Arg Lys Tyr Leu Ile Pro 450 455 460 Gly Asn Phe Ile Ile Phe Phe Phe
Leu Gly Glu Pro Gly Asp Asp Pro465 470 475 480 Arg Gln Trp Leu Leu
Ser Glu Asn His Val Gly Ala Val Asn Thr Phe 485 490 495 Lys Ser Ser
Thr Asp Ile Cys Gly Asn Cys Ala Gly Gln Gly Ala Ala 500 505 510 Asp
Gln Leu Phe Ser Gly Gly Val Asp Ile Thr Asn Ala Leu Tyr Asn 515 520
525 Lys Leu Ala Asn Ile Gly Leu Thr Leu Asp Asp Gln Asp Glu Ile Glu
530 535 540 Glu Trp Leu Ala Lys Asn Leu Lys Trp Arg Ile Leu Lys Gln
Asn Asp545 550 555 560 Lys Thr Glu Leu Thr Ser His Glu Ile Leu Glu
Asn Pro Asp Ser Leu 565 570 575 Phe Ile Gly Val Lys Ser Phe Val Leu
Leu Tyr Pro Thr Ser Arg Leu 580 585 590 Pro Ile Asp Gly Gly Glu Phe
Leu Ser Ala Pro Lys Ile Ile Asn Glu 595 600 605 Lys Ile His Phe Gly
Ala Thr Glu Pro His Lys Asn Arg Gly Gly Leu 610 615 620 Gly Ala Gln
Asp Pro Tyr625 630 38616PRTAspergillus fumigatus 38Met Ser Ser Asn
Lys Pro Tyr Val Ile Lys Gly Ile Pro Val Asp Ala1 5 10 15 Gly Gln
Ile Ile Pro Val Arg Arg Asp Ile Asp Glu Trp Tyr Glu Asp 20 25 30
Thr Ser Arg Gln Ser Arg Ile Gln Leu Ser Ile Phe Ile Trp Ala Leu 35
40 45 Arg Glu Phe Gln Ser Ile Asp Tyr Lys Asp Arg Leu Ser Tyr Phe
Gln 50 55 60 Ile Ala Gly Ile His His Phe Pro Leu Ile Thr Trp Asp
Glu Glu Glu65 70 75 80 Pro Pro Val Pro Asn Lys Pro Gly Tyr Cys Val
His Asn Asn Val Thr 85 90 95 Phe Pro Thr Trp His Arg Pro Tyr Met
Leu Leu Phe Glu Gln Arg Leu 100 105 110 Phe Glu Ile Met Glu Thr Thr
Ile Lys Glu Thr Val Pro Glu Ser His 115 120 125 Lys Gln Glu Trp Arg
Asp Ala Ala Arg Gln Trp Arg Leu Pro Tyr Trp 130 135 140 Asp Phe Ala
Lys Thr Ser Gly Pro His Ala Thr Gly Pro Leu Ser Leu145 150 155 160
Pro Val Leu Cys Gly Leu Ala Asn Val Val Ile Leu Asn Pro Ala Asn 165
170 175 Pro Glu Thr Pro Ile Glu Leu Pro Asn Pro Val Tyr Lys Tyr Arg
Ala 180 185 190 Pro Asp Leu Met Gly Asn Leu Asp Lys Pro Phe His Ile
Pro Pro Glu 195 200 205 Arg Ile Asp Pro Asp Lys Asp Asp Tyr Tyr Pro
Trp Asp Lys Cys Gln 210 215 220 Ala Thr Thr Lys Tyr Gly Leu Leu Lys
Asn Asn Pro His Ile Gln Asp225 230 235 240 Ala Gly Gln Asp Val Thr
Lys Ser Asn Leu Ala Leu Asn Glu His Pro 245 250 255 Trp Tyr Arg Pro
Asn Lys Ala Gly Phe Pro Pro Leu Gln Thr Leu Thr 260 265 270 Tyr Glu
Val His Arg Leu Leu Ser Phe Lys Phe Ser Ser Trp Gly Ala 275 280 285
Phe Ala Ser Thr Lys Trp Cys Asn Glu Glu Asn Lys Pro Pro Ala Ser 290
295 300 Gln Gln Thr Arg Asp Ile Leu Ser Leu Glu Tyr Ile His Asn Asn
Val305 310 315 320 His Pro Asp Leu Gln Gly Ala Gly His Met Ser Ser
Val Pro Val Ala 325 330 335 Ala Phe Asp Pro Ile Phe Trp Leu Tyr His
Asn Asn Val Asp Arg Leu 340 345 350 Thr Ala Ile Trp Gln Val Leu Asn
Gln Asp His Trp Phe Asp Glu Pro 355 360 365 His Pro Ser Asp Ala Lys
Pro Asp Asp Pro Leu Lys Pro Phe His Val 370 375 380 Ser Lys Asp Lys
Tyr Phe Thr Ser Asp Asp Ala Arg Phe Trp Arg Lys385 390 395 400 Tyr
Gly Tyr Asp Tyr Asp Ile Val Lys Lys Pro Gly Thr Asn Glu Asp 405 410
415 Arg Ala Pro Glu Glu Val Lys Met Lys Ile Asn Gln Leu Tyr Gly Glu
420 425 430 Pro Ile Ser Arg Leu His Glu Gly Gln Pro Val Glu Tyr Asp
Tyr Val 435 440 445 Ile Asn Val Ile Tyr Asp Arg Tyr Ala Leu Asp Gly
Ile Pro Tyr Thr 450 455 460 Ile Val Phe Tyr Leu His Leu Lys Asp Gly
Ser Tyr Lys Cys Leu Gly465 470 475 480 Gly Val Tyr Thr Phe Ser Thr
Lys Leu Ser Asp Ala Gln Asp Thr Glu 485 490 495 Arg Gly Gly Cys Asp
Asn Cys Arg Glu Gln Lys Lys Ala Gly Val Leu 500 505 510 Ala Ser Ala
Gln Ile Pro Leu Thr Tyr Thr Leu Tyr Glu Arg Gln Glu 515 520 525 Trp
His Asn Leu Gly Lys Leu Leu Pro Val Lys Glu Thr Ala Asp Ile 530 535
540 Ile Arg Gln His Leu Cys Trp Lys Val Val Gly Val Asn Asn Ser
Ile545 550 555 560 Leu Phe Asp Ser Glu Gln Pro Met Arg Gly Asp Pro
Ala Thr Trp Arg 565 570 575 Ser Leu Asp Val Thr Ala Ala Tyr Ser Glu
Ile His Tyr Pro Val Asp 580 585 590 Arg Asn Tyr Lys Tyr Ile Asp Arg
Gly Leu Pro Ala Tyr His Asn Tyr 595 600 605 Leu Pro Ile His Leu Ser
Pro Thr 610 615 39556PRTAgaricus bisporus 39Met Ser Leu Ile Ala Thr
Val Gly Pro Thr Gly Gly Val Lys Asn Arg1 5 10 15 Leu Asn Ile Val
Asp Phe Val Lys Asn Glu Lys Phe Phe Thr Leu Tyr 20 25 30 Val Arg
Ser Leu Glu Leu Leu Gln Ala Lys Glu Gln His Asp Tyr Ser 35 40 45
Ser Phe Phe Gln Leu Ala Gly Ile His Gly Leu Pro Phe Thr Glu Trp 50
55 60 Ala Lys Glu Arg Pro Ser Met Asn Leu Tyr Lys Ala Gly Tyr Cys
Thr65 70 75 80 His Gly Gln Val Leu Phe Pro Thr Trp His Arg Thr Tyr
Leu Ser Val 85 90 95 Leu Glu Gln Ile Leu Gln Gly Ala Ala Ile Glu
Val Ala Lys Lys Phe 100 105 110 Thr Ser Asn Gln Thr Asp Trp Val Gln
Ala Ala Gln Asp Leu Arg Gln 115 120 125 Pro Tyr Trp Asp Trp Gly Phe
Glu Leu Met Pro Pro Asp Glu Val Ile 130 135 140 Lys Asn Glu Glu Val
Asn Ile Thr Asn Tyr Asp Gly Lys Lys Ile Ser145 150 155 160 Val Lys
Asn Pro Ile Leu Arg Tyr His Phe His Pro Ile Asp Pro Ser 165 170 175
Phe Lys Pro Tyr Gly Asp Phe Ala Thr Trp Arg Thr Thr Val Arg Asn 180
185 190 Pro Asp Arg Asn Arg Arg Glu Asp Ile Pro Gly Leu Ile Lys Lys
Met 195 200 205 Arg Leu Glu Glu Gly Gln Ile Arg Glu Lys Thr Tyr Asn
Met Leu Lys 210 215 220 Phe Asn Asp Ala Trp Glu Arg Phe Ser Asn His
Gly Ile Ser Asp Asp225 230 235 240 Gln His Ala Asn Ser Leu Glu Ser
Val His Asp Asp Ile His Val Met 245 250 255 Val Gly Tyr Gly Lys Ile
Glu Gly His Met Asp His Pro Phe Phe Ala 260 265 270 Ala Phe Asp Pro
Ile Phe Trp Leu His His Thr Asn Val Asp Arg Leu 275 280 285 Leu Ser
Leu Trp Lys Ala Ile Asn Pro Asp Val Trp Val Thr Ser Gly 290 295 300
Arg Asn Arg Asp Gly Thr Met Gly Ile Ala Pro Asn Ala Gln Ile Asn305
310 315 320 Ser Glu Thr Pro Leu Glu Pro Phe Tyr Gln Ser Gly Asp Lys
Val Trp 325 330 335 Thr Ser Ala Ser Leu Ala Asp Thr Ala Arg Leu Gly
Tyr Ser Tyr Pro 340 345 350 Asp Phe Asp Lys Leu Val Gly Gly Thr Lys
Glu Leu Ile Arg Asp Ala 355 360 365 Ile Asp Asp Leu Ile Asp Glu Arg
Tyr Gly Ser Lys Pro Ser Ser Gly 370 375 380 Ala Arg Asn Thr Ala Phe
Asp Leu Leu Ala Asp Phe Lys Gly Ile Thr385 390 395 400 Lys Glu His
Lys Glu Asp Leu Lys Met Tyr Asp Trp Thr Ile His Val 405 410 415 Ala
Phe Lys Lys Phe Glu Leu Lys Glu Ser Phe Ser Leu Leu Phe Tyr 420 425
430 Phe Ala Ser Asp Gly Gly Asp Tyr Asp Gln Glu Asn Cys Phe Val Gly
435 440 445 Ser Ile Asn Ala Phe Arg Gly Thr Ala Pro Glu Thr Cys Ala
Asn Cys 450 455 460 Gln Asp Asn Glu Asn Leu Ile Gln Glu Gly Phe Ile
His Leu Asn His465 470 475 480 Tyr Leu Ala Arg Asp Leu Glu Ser Phe
Glu Pro Gln Asp Val His Lys 485 490 495 Phe Leu Lys Glu Lys Gly Leu
Ser Tyr Lys Leu Tyr Ser Arg Gly Asp 500 505 510 Lys Pro Leu Thr Ser
Leu Ser Val Lys Ile Glu Gly Arg Pro Leu His 515 520 525 Leu Pro Pro
Gly Glu His Arg Pro Lys Tyr Asp His Thr Gln Ala Arg 530 535 540 Val
Val Phe Asp Asp Val Ala Val His Val Ile Asn545 550 555
40273PRTStreptomyces castaneglobisporus 40Met Thr Val Arg Lys Asn
Gln Ala Thr Leu Thr Ala Asp Glu Lys Arg1 5 10 15 Arg Phe Val Ala
Ala Val Leu Glu Leu Lys Arg Ser Gly Arg Tyr Asp 20 25 30 Glu Phe
Val Arg Thr His Asn Glu Phe Ile Met Ser Asp Thr Asp Ser 35 40 45
Gly Glu Arg Thr Gly His Arg Ser Pro Ser Phe Leu Pro Trp His Arg 50
55 60 Arg Phe Leu Leu Asp Phe Glu Gln Ala Leu Gln Ser Val Asp Ser
Ser65 70 75 80 Val Thr Leu Pro Tyr Trp Asp Trp Ser Ala Asp Arg Thr
Val Arg Ala 85 90 95 Ser Leu Trp Ala Pro Asp Phe Leu Gly Gly Thr
Gly Arg Ser Thr Asp 100 105 110 Gly Arg Val Met Asp Gly Pro Phe Ala
Ala Phe Thr Gly Asn Trp Pro 115 120 125 Ile Asn Val Arg Val Asp Ser
Arg Thr Tyr Leu Arg Arg Ser Leu Gly 130 135 140 Gly Ser Val Ala Glu
Leu Pro Thr Arg Ala Glu Val Glu Ser Val Leu145 150 155 160 Ala Ile
Ser Ala Tyr Asp Leu Pro Pro Tyr Asn Ser Ala Ser Glu Gly 165 170 175
Phe Arg Asn His Leu Glu Gly Trp Arg Gly Val Asn Leu His Asn Arg 180
185 190 Val His Val Trp Val Gly Gly Gln Met Ala Thr Gly Val Ser Pro
Asn 195 200 205 Asp Pro Val Phe Trp Leu His His Ala Tyr Val Asp Lys
Leu Trp Ala 210 215 220 Glu Trp Gln Arg Arg His Pro Asp Ser Ala Tyr
Val Pro Thr Gly Gly225 230 235 240 Thr Pro Asp Val Val Asp Leu Asn
Glu Thr Met Lys Pro Trp Asn Thr 245 250 255 Val Arg Pro Ala Asp Leu
Leu Asp His Thr Ala Tyr Tyr Thr Phe Asp 260 265 270
Ala41258PRTUnknownisolated from soil 41Met Arg Pro Gly Leu Val Leu
Arg Ser Phe Thr Tyr Ala Pro Trp Pro1 5 10 15 Val Leu Leu Ala Thr
Ala Gly Phe Gly Leu Ala Leu Ser Ile Tyr Ser 20 25 30 Asp Ala Ser
Thr Glu Gly Pro Ala Phe Cys Val Ala Thr Asn Gly Leu 35 40 45 Ser
Ile Phe Thr Ser Trp Pro Ala Val Leu Gln Ala Glu Leu Ala Val 50 55
60 Asn Pro Ile His Arg Ile Leu Ala Gly Trp Leu Leu Met Leu Leu
Thr65 70 75 80 Met Met Pro Pro Leu Leu Ala Met Pro Leu Met His Val
Trp Arg Ser 85 90 95 Ser Leu Pro Asn Arg Arg Ile Arg Ala Ser Ala
Gly Phe Leu Leu Gly
100 105 110 Tyr Cys Ala Pro Trp Met Ala Ala Gly Leu Val Leu Ser Ala
Leu Ala 115 120 125 Leu Leu Leu Gln Ile Thr Val Val Asp Asn Ala Leu
Ala Ile Ala Leu 130 135 140 Leu Ile Ala Leu Leu Trp Ser Ala Ser Pro
Trp His Arg Ala Ala Leu145 150 155 160 Asn Arg Ser His Gln Pro Arg
Arg Ile Gly Leu Phe Gly Arg Ala Ala 165 170 175 Asp Arg Asp Cys Leu
Val Phe Gly Met Thr His Gly Ala Tyr Cys Ile 180 185 190 Gly Ser Cys
Trp Ala Trp Met Leu Val Pro Val Val Ser Gly Ala Trp 195 200 205 His
Ile Pro Met Met Leu Phe Thr Gly Val Ile Met Leu Ala Glu Arg 210 215
220 Phe Thr Pro Pro Gly Pro Ala Arg Trp Cys Trp Pro Arg Phe Phe
Ser225 230 235 240 Pro Ala His Leu Tyr Thr Leu Leu Thr Gln Arg Asn
Ala Glu Arg Pro 245 250 255 His Gly4237PRTArtificial
SequenceN-terminal sequence of protein from active fractions of
strain ATX26455 42Met Asn Thr Ile Arg Gln Asp Val Ala Thr Leu Gly
Ser Gly Trp Asp1 5 10 15 Asn Lys Val Leu Leu Asn Tyr Ala Leu Ala
Met Arg Glu Leu Asp Lys 20 25 30 Leu Pro Ile Thr Asn 35
4330DNAArtificial Sequenceolignonucleotide primer 43cangangtng
cnacnntngg nccnggntgg 304424DNAArtificial Sequenceolignonucleotide
primer 44ntgntgnagc canaanatng gntc 24451971DNAPseudomonas
aurantiaca 45ctgagcatct gggaacacca gcagttgcag cgcctgctgc aggcgttgtg
aacaaaggtt 60ccttccatta cacccacgcc aatcctccgt ccgtccgccc aagccaccgg
aacccgtgtc 120gttcatcggg ataatgggaa tcggccatgg cgtttttgcc
aggcctctat actcattttc 180gacgaggcgc gcaccggcac tgcgggcctc
atgagcgcag tkscgycgwg agacatgaag 240tcgccagcgg caaaggattg
cgaggggtgt ggcgccatac gcgtcacctg gcctgatgct 300gcaaggaagg
tgcattcatg aacacgatcc gacaggatgt ggcaacactc ggctccggat
360gggacaacaa ggtcttgctc aactacgcgc tggccatgcg cgagctggac
aaactaccga 420tcaccaaccg caacagctgg aagttcctcg gcgccatcca
cggcttcgat cggcagttgt 480gggtcgaggt gaatgtcctg ggcgattccg
atccggttcc caaggacctg accaacttta 540cctacggcag ccagtgccag
cacggcagct ggtacttcct gtcctggcac cgcggttacc 600tggcggcctt
cgaggcgatt gtcgcggcca aggtcaagga actgacgggt gacgactggg
660cgctgccgta ctggaactac ctcaatagca aaaacccgga tgcgcggcgg
gccccggagg 720cattcctggc ggacaccctg cctgacggca gccccaaccc
gctgaagaaa taccctcgcc 780ggcagggctt taccacgctg cggccgaact
ccctcgatgc cttcagcctg gcggcaatgc 840aggagaacga tttccaggtc
ggcaatgacg gcagcatcgg cttcggcggc ggggtcaccg 900gcaatttcgc
ccagttcgcc cgctggaccg gcgacctgga gaacaacccg cacaacaccg
960tgcatcgtct gatcggcggg ggcgaaggct tcatggccga cccgtacctc
gccgccctgg 1020acccgatctt ctggttgcac cattgcaacg tcgaccggct
ctgggaggcc tggatgaaca 1080ccccgggcaa gaccatggtc cgcgatccgc
gctggctcga cggtccggcc gaccgccgtt 1140tcatcatgcc gacggtcggt
ggcagtgacc ctggcatgaa attcaccggc cgcgacacat 1200tgaaggatgg
caaattgcat ccgcgctatg ccgacttgag catcggcacg ggcgtgaaac
1260caggagtaga ggccgtgaca cgggtcaaga tgggtgcgcc ggaacaacag
aacatcgaac 1320cgatcggtgc caaccgttcg gtggtcacgg tcggcggcgc
gccggtgcgc acccaggtcg 1380acctcgaccg ccaggccacc agcaccggga
tcgccgcgat gggcgcgacg gacctgggcc 1440agccggtgac ccggctctac
ctggcgctgg aatcggtgcg cggctccgcg ccctcgccgc 1500agcttacggt
gtacatcaac ctgccgaaag acagcgaccc gcagcagcat cccgagtgcc
1560atgccggcag cctgacgctg ttcgggctga acgtcgcctc gcggccagac
ggtggccatg 1620gcggccacgg gctcggctat acgatcgaca tcaccgacct
ggcccagcgg ctgaccgatg 1680ccggcgattt cgatcccgac tatctgcggg
tgaccctggt cccaggcgag caggtatcgg 1740cggataaacc ggtgaccgtg
gagcggatca gcgtgctcaa gcgcagtggt atcgtcagct 1800gagtaacgcc
tcatgcaacc cggaccggtc ttgctcagct tcacccgggc gccctggccg
1860ttgctgttcg cgacggccgg gctgggcctg gccctgtgtc tctacaccgc
cgggcacagc 1920accctgcccg ccttctgcgg ttccgcgcta tccatcgttg
ccagttggcc c 1971461482DNAPseudomonas aurantiaca 46atgaacacga
tccgacagga tgtggcaaca ctcggctccg gatgggacaa caaggtcttg 60ctcaactacg
cgctggccat gcgcgagctg gacaaactac cgatcaccaa ccgcaacagc
120tggaagttcc tcggcgccat ccacggcttc gatcggcagt tgtgggtcga
ggtgaatgtc 180ctgggcgatt ccgatccggt tcccaaggac ctgaccaact
ttacctacgg cagccagtgc 240cagcacggca gctggtactt cctgtcctgg
caccgcggtt acctggcggc cttcgaggcg 300attgtcgcgg ccaaggtcaa
ggaactgacg ggtgacgact gggcgctgcc gtactggaac 360tacctcaata
gcaaaaaccc ggatgcgcgg cgggccccgg aggcattcct ggcggacacc
420ctgcctgacg gcagccccaa cccgctgaag aaataccctc gccggcaggg
ctttaccacg 480ctgcggccga actccctcga tgccttcagc ctggcggcaa
tgcaggagaa cgatttccag 540gtcggcaatg acggcagcat cggcttcggc
ggcggggtca ccggcaattt cgcccagttc 600gcccgctgga ccggcgacct
ggagaacaac ccgcacaaca ccgtgcatcg tctgatcggc 660gggggcgaag
gcttcatggc cgacccgtac ctcgccgccc tggacccgat cttctggttg
720caccattgca acgtcgaccg gctctgggag gcctggatga acaccccggg
caagaccatg 780gtccgcgatc cgcgctggct cgacggtccg gccgaccgcc
gtttcatcat gccgacggtc 840ggtggcagtg accctggcat gaaattcacc
ggccgcgaca cattgaagga tggcaaattg 900catccgcgct atgccgactt
gagcatcggc acgggcgtga aaccaggagt agaggccgtg 960acacgggtca
agatgggtgc gccggaacaa cagaacatcg aaccgatcgg tgccaaccgt
1020tcggtggtca cggtcggcgg cgcgccggtg cgcacccagg tcgacctcga
ccgccaggcc 1080accagcaccg ggatcgccgc gatgggcgcg acggacctgg
gccagccggt gacccggctc 1140tacctggcgc tggaatcggt gcgcggctcc
gcgccctcgc cgcagcttac ggtgtacatc 1200aacctgccga aagacagcga
cccgcagcag catcccgagt gccatgccgg cagcctgacg 1260ctgttcgggc
tgaacgtcgc ctcgcggcca gacggtggcc atggcggcca cgggctcggc
1320tatacgatcg acatcaccga cctggcccag cggctgaccg atgccggcga
tttcgatccc 1380gactatctgc gggtgaccct ggtcccaggc gagcaggtat
cggcggataa accggtgacc 1440gtggagcgga tcagcgtgct caagcgcagt
ggtatcgtca gc 148247494PRTPseudomonas aurantiaca 47Met Asn Thr Ile
Arg Gln Asp Val Ala Thr Leu Gly Ser Gly Trp Asp1 5 10 15 Asn Lys
Val Leu Leu Asn Tyr Ala Leu Ala Met Arg Glu Leu Asp Lys 20 25 30
Leu Pro Ile Thr Asn Arg Asn Ser Trp Lys Phe Leu Gly Ala Ile His 35
40 45 Gly Phe Asp Arg Gln Leu Trp Val Glu Val Asn Val Leu Gly Asp
Ser 50 55 60 Asp Pro Val Pro Lys Asp Leu Thr Asn Phe Thr Tyr Gly
Ser Gln Cys65 70 75 80 Gln His Gly Ser Trp Tyr Phe Leu Ser Trp His
Arg Gly Tyr Leu Ala 85 90 95 Ala Phe Glu Ala Ile Val Ala Ala Lys
Val Lys Glu Leu Thr Gly Asp 100 105 110 Asp Trp Ala Leu Pro Tyr Trp
Asn Tyr Leu Asn Ser Lys Asn Pro Asp 115 120 125 Ala Arg Arg Ala Pro
Glu Ala Phe Leu Ala Asp Thr Leu Pro Asp Gly 130 135 140 Ser Pro Asn
Pro Leu Lys Lys Tyr Pro Arg Arg Gln Gly Phe Thr Thr145 150 155 160
Leu Arg Pro Asn Ser Leu Asp Ala Phe Ser Leu Ala Ala Met Gln Glu 165
170 175 Asn Asp Phe Gln Val Gly Asn Asp Gly Ser Ile Gly Phe Gly Gly
Gly 180 185 190 Val Thr Gly Asn Phe Ala Gln Phe Ala Arg Trp Thr Gly
Asp Leu Glu 195 200 205 Asn Asn Pro His Asn Thr Val His Arg Leu Ile
Gly Gly Gly Glu Gly 210 215 220 Phe Met Ala Asp Pro Tyr Leu Ala Ala
Leu Asp Pro Ile Phe Trp Leu225 230 235 240 His His Cys Asn Val Asp
Arg Leu Trp Glu Ala Trp Met Asn Thr Pro 245 250 255 Gly Lys Thr Met
Val Arg Asp Pro Arg Trp Leu Asp Gly Pro Ala Asp 260 265 270 Arg Arg
Phe Ile Met Pro Thr Val Gly Gly Ser Asp Pro Gly Met Lys 275 280 285
Phe Thr Gly Arg Asp Thr Leu Lys Asp Gly Lys Leu His Pro Arg Tyr 290
295 300 Ala Asp Leu Ser Ile Gly Thr Gly Val Lys Pro Gly Val Glu Ala
Val305 310 315 320 Thr Arg Val Lys Met Gly Ala Pro Glu Gln Gln Asn
Ile Glu Pro Ile 325 330 335 Gly Ala Asn Arg Ser Val Val Thr Val Gly
Gly Ala Pro Val Arg Thr 340 345 350 Gln Val Asp Leu Asp Arg Gln Ala
Thr Ser Thr Gly Ile Ala Ala Met 355 360 365 Gly Ala Thr Asp Leu Gly
Gln Pro Val Thr Arg Leu Tyr Leu Ala Leu 370 375 380 Glu Ser Val Arg
Gly Ser Ala Pro Ser Pro Gln Leu Thr Val Tyr Ile385 390 395 400 Asn
Leu Pro Lys Asp Ser Asp Pro Gln Gln His Pro Glu Cys His Ala 405 410
415 Gly Ser Leu Thr Leu Phe Gly Leu Asn Val Ala Ser Arg Pro Asp Gly
420 425 430 Gly His Gly Gly His Gly Leu Gly Tyr Thr Ile Asp Ile Thr
Asp Leu 435 440 445 Ala Gln Arg Leu Thr Asp Ala Gly Asp Phe Asp Pro
Asp Tyr Leu Arg 450 455 460 Val Thr Leu Val Pro Gly Glu Gln Val Ser
Ala Asp Lys Pro Val Thr465 470 475 480 Val Glu Arg Ile Ser Val Leu
Lys Arg Ser Gly Ile Val Ser 485 490 48487PRTPseudomonas aurantiaca
48Met Ala Thr Leu Gly Ser Gly Trp Asp Asn Lys Val Leu Leu Asn Tyr1
5 10 15 Ala Leu Ala Met Arg Glu Leu Asp Lys Leu Pro Ile Thr Asn Arg
Asn 20 25 30 Ser Trp Lys Phe Leu Gly Ala Ile His Gly Phe Asp Arg
Gln Leu Trp 35 40 45 Val Glu Val Asn Val Leu Gly Asp Ser Asp Pro
Val Pro Lys Asp Leu 50 55 60 Thr Asn Phe Thr Tyr Gly Ser Gln Cys
Gln His Gly Ser Trp Tyr Phe65 70 75 80 Leu Ser Trp His Arg Gly Tyr
Leu Ala Ala Phe Glu Ala Ile Val Ala 85 90 95 Ala Lys Val Lys Glu
Leu Thr Gly Asp Asp Trp Ala Leu Pro Tyr Trp 100 105 110 Asn Tyr Leu
Asn Ser Lys Asn Pro Asp Ala Arg Arg Ala Pro Glu Ala 115 120 125 Phe
Leu Ala Asp Thr Leu Pro Asp Gly Ser Pro Asn Pro Leu Lys Lys 130 135
140 Tyr Pro Arg Arg Gln Gly Phe Thr Thr Leu Arg Pro Asn Ser Leu
Asp145 150 155 160 Ala Phe Ser Leu Ala Ala Met Gln Glu Asn Asp Phe
Gln Val Gly Asn 165 170 175 Asp Gly Ser Ile Gly Phe Gly Gly Gly Val
Thr Gly Asn Phe Ala Gln 180 185 190 Phe Ala Arg Trp Thr Gly Asp Leu
Glu Asn Asn Pro His Asn Thr Val 195 200 205 His Arg Leu Ile Gly Gly
Gly Glu Gly Phe Met Ala Asp Pro Tyr Leu 210 215 220 Ala Ala Leu Asp
Pro Ile Phe Trp Leu His His Cys Asn Val Asp Arg225 230 235 240 Leu
Trp Glu Ala Trp Met Asn Thr Pro Gly Lys Thr Met Val Arg Asp 245 250
255 Pro Arg Trp Leu Asp Gly Pro Ala Asp Arg Arg Phe Ile Met Pro Thr
260 265 270 Val Gly Gly Ser Asp Pro Gly Met Lys Phe Thr Gly Arg Asp
Thr Leu 275 280 285 Lys Asp Gly Lys Leu His Pro Arg Tyr Ala Asp Leu
Ser Ile Gly Thr 290 295 300 Gly Val Lys Pro Gly Val Glu Ala Val Thr
Arg Val Lys Met Gly Ala305 310 315 320 Pro Glu Gln Gln Asn Ile Glu
Pro Ile Gly Ala Asn Arg Ser Val Val 325 330 335 Thr Val Gly Gly Ala
Pro Val Arg Thr Gln Val Asp Leu Asp Arg Gln 340 345 350 Ala Thr Ser
Thr Gly Ile Ala Ala Met Gly Ala Thr Asp Leu Gly Gln 355 360 365 Pro
Val Thr Arg Leu Tyr Leu Ala Leu Glu Ser Val Arg Gly Ser Ala 370 375
380 Pro Ser Pro Gln Leu Thr Val Tyr Ile Asn Leu Pro Lys Asp Ser
Asp385 390 395 400 Pro Gln Gln His Pro Glu Cys His Ala Gly Ser Leu
Thr Leu Phe Gly 405 410 415 Leu Asn Val Ala Ser Arg Pro Asp Gly Gly
His Gly Gly His Gly Leu 420 425 430 Gly Tyr Thr Ile Asp Ile Thr Asp
Leu Ala Gln Arg Leu Thr Asp Ala 435 440 445 Gly Asp Phe Asp Pro Asp
Tyr Leu Arg Val Thr Leu Val Pro Gly Glu 450 455 460 Gln Val Ser Ala
Asp Lys Pro Val Thr Val Glu Arg Ile Ser Val Leu465 470 475 480 Lys
Arg Ser Gly Ile Val Ser 485
* * * * *
References