U.S. patent application number 15/657999 was filed with the patent office on 2017-11-16 for tal-mediated transfer dna insertion.
The applicant listed for this patent is J.R. Simplot Company. Invention is credited to Hui Duan, Caius M. Rommens, J. Troy Weeks.
Application Number | 20170327833 15/657999 |
Document ID | / |
Family ID | 50776498 |
Filed Date | 2017-11-16 |
United States Patent
Application |
20170327833 |
Kind Code |
A1 |
Rommens; Caius M. ; et
al. |
November 16, 2017 |
TAL-MEDIATED TRANSFER DNA INSERTION
Abstract
The invention relates to methods for stably integrating a
desired polynucleotide into a plant genome, comprising transforming
plant material with a first vector comprising nucleotide sequences
encoding TAL proteins designed to recognize a target sequence;
transforming the plant material with a second vector comprising (i)
a marker gene that is not operably linked to a promoter
("promoter-free marker cassette") and which comprises a sequence
homologous to the target sequence; and (ii) a desired
polynucleotide; and identifying transformed plant material in which
the desired polynucleotide is stably integrated.
Inventors: |
Rommens; Caius M.; (Boise,
ID) ; Duan; Hui; (Boise, ID) ; Weeks; J.
Troy; (Boise, ID) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
J.R. Simplot Company |
Boise |
ID |
US |
|
|
Family ID: |
50776498 |
Appl. No.: |
15/657999 |
Filed: |
July 24, 2017 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
14084406 |
Nov 19, 2013 |
9756871 |
|
|
15657999 |
|
|
|
|
61790434 |
Mar 15, 2013 |
|
|
|
61728466 |
Nov 20, 2012 |
|
|
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
C12N 15/8213 20130101;
C12N 15/8278 20130101; C12N 15/8209 20130101; C12N 15/8245
20130101; C12N 15/8274 20130101; C12N 15/8251 20130101; C12N
15/8282 20130101; C12N 15/825 20130101; A23V 2002/00 20130101; A23L
19/18 20160801 |
International
Class: |
C12N 15/82 20060101
C12N015/82; C12N 15/82 20060101 C12N015/82; C12N 15/82 20060101
C12N015/82; A23L 19/18 20060101 A23L019/18; C12N 15/82 20060101
C12N015/82; C12N 15/82 20060101 C12N015/82; C12N 15/82 20060101
C12N015/82 |
Claims
1. A transformed potato plant comprising in its genome a sequence
exogenous to the untransformed plant, said sequence comprising: (i)
a promoter-free marker cassette; and (ii) a desired polynucleotide;
wherein the promoter-free marker cassette and the desired
polynucleotide are positioned downstream of one of the plant's
genomic endogenous gene promoters, and wherein the promoter-free
marker cassette is expressed by said genomic endogenous gene
promoter.
2. The transformed potato plant of claim 1, wherein the desired
polynucleotide comprises a silencing cassette targeting one or more
genes selected from the group consisting of asparagine synthase 1
(Asn1), polyphenol oxidase (Ppo), and vacuolar invertase (Inv)
genes.
3. The transformed potato plant of claim 2, wherein the desired
polynucleotide further expresses a late blight resistance gene
Vnt1.
4. The transformed potato plant of claim 3, wherein said
transformed potato plant is capable of producing tubers wherein the
plant has a phenotype characterized by one or more of black spot
bruise tolerance, reduced cold-induced sweetening and reduced
asparagine levels in its tubers as a result of the expression of
the desired polynucleotide.
5. A heat-processed product, wherein said heat processed product
comprises cells from the transformed potato plant of claim 4.
6. The heat-processed product of claim 5, wherein the product is a
French fry, chip, crisp, potato, dehydrated potato or baked
potato.
7. The heat-processed product of claim 5, wherein the
heat-processed product has a lower level of acrylamide than a
heat-processed product of an otherwise identical plant lacking the
desired polynucleotide.
8. The transformed potato plant of claim 1, wherein the
promoter-free marker cassette encodes a mutated acetolactate
synthase (ALS) gene, and wherein said mutated ALS gene confers the
plant with resistance to at least one ALS inhibitor selected from
the group consisting of sulfonylureas, imidazolinones,
triazolopyrimidines, pyrimidinyl oxybenzoates, and sulfonylamino
carbonyl triazolinones.
9. The transformed potato plant of claim 8, wherein the mutated ALS
gene encodes for a peptide selected from the group consisting of
SEQ ID Nos: 11 and 13.
10. A heat-processed product, wherein said heat processed product
comprises cells from the tuber grown from the transformed potato
plant of claim 9.
11. A method for stably integrating a desired polynucleotide
downstream of a potato plant's endogenous gene promoter, said
method comprising: (A) transforming potato plant material with a
first vector comprising nucleotide sequences encoding Transcription
Activator-Like Effector Nuclease (TAL) proteins designed to
recognize a target sequence, wherein the target sequence is
downstream of one of the potato plant's endogenous gene promoters;
(B) transforming the potato plant material with a second vector
comprising (i) a marker gene that is not operably linked to a
promoter, referred to as a "promoter-free marker cassette", and
which comprises a sequence homologous to the target sequence, and
(ii) a desired polynucleotide; and (C) identifying transformed
potato plant material in which the desired polynucleotide is stably
integrated downstream of the potato plant's gene promoter.
12. The method of claim 11, wherein the transformed plant material
is exposed to conditions that reflect the presence or absence of
the marker gene in the transformed plant.
13. The method of claim 12, wherein the marker gene is an herbicide
resistance gene and the transformed plant material is exposed to
herbicide.
14. The method of claims 13, wherein the herbicide resistance gene
is a mutated ALS gene, and wherein said mutated ALS gene confers
the potato plant with resistance to at least one ALS inhibitor
selected from the group consisting of sulfonylureas,
imidazolinones, triazolopyrimidines, pyrimidinyl oxybenzoates, and
sulfonylamino carbonyl triazolinones.
15. The method of claim 11, wherein the promoter-free marker
cassette is stably integrated into the potato plant's genome.
16. A method for the targeted insertion of exogenous DNA downstream
of a potato plant's endogenous gene promoter, said method
comprising the steps of (i) transforming isolated potato plant
cells with (A) a first binary vector comprising a promoter-less
cassette comprising (a) a right border sequence linked to (b) a
partial sequence of a target sequence located downstream of the
endogenous gene promoter, said partial sequence being fused to a
mutated ALS gene; (c) a desired nucleotide sequence, wherein the
desired nucleotide sequence is not operably linked to a promoter;
and (d) a terminator sequence; and (B) a second binary vector
comprising (a) a forward expression cassette and a reverse
expression cassette, wherein each expression cassette comprises a
nucleotide sequence encoding a modified TAL operably linked to a
strong constitutive promoter, and a terminator sequence; and (b) a
sequence encoding isopentenyl transferase (ipt), wherein the
modified TAL is designed to bind the target sequence; and (ii)
culturing the transformed potato plant cells under conditions that
promote growth of edited potato plants that express the desired
nucleotide sequence; wherein no vector backbone DNA is permanently
inserted into the edited potato plant's genome.
17. The method of claim 16, wherein the modified TAL comprises (a)
a truncated C-terminal activation domain comprising a Fok1
endonuclease catalytic domain; (b) a codon-optimized target
sequence binding domain comprising 16.5 repeat variable diresidues
corresponding to the endogenous Ubi7 5'-untranslated intron
sequence; and (c) an N-terminal region comprising a SV40 nuclear
localization sequence.
18. The method of claim 16, wherein the desired nucleotide sequence
is a silencing cassette targeting one or more genes selected from
the group consisting of Asn1, Ppo, and Inv genes.
19. The method of claim 16, wherein the first binary vector further
comprises a late blight resistance gene Vnt1 operably linked to its
native promoter and terminator sequences.
Description
[0001] This application claims priority to U.S. provisional
application No. 61/790,434, filed Mar. 15, 2013, and U.S.
provisional application No. 61/728,466, filed Nov. 20, 2012, both
of which are incorporated herein by reference in their
entireties.
SEQUENCE LISTING
[0002] The instant application contains a Sequence Listing which
has been submitted electronically in ASCH format and is hereby
incorporated by reference in its entirety. Said ASCH copy, created
on Jan. 30, 2014, is named 058951-0450_SL.txt and is 100,363 bytes
in size.
FIELD OF THE INVENTION
[0003] The present invention relates to the field of plant
biotechnology and provides methods for targeted transfer DNA
insertion for the production of plants and plant products with
desirable traits.
BACKGROUND OF THE INVENTION
[0004] A plant can be modified through insertion of a DNA segment
into its genome. The added DNA comprises genetic elements
rearranged to produce RNA that either encodes a protein or triggers
the degradation of specific native RNA. The prior art teaches a
variety of sub-optimal methods that result in non-targeted
(unpredictable and random) insertion.
[0005] These is a need in the art for an efficient and reproducible
production of genetically engineered plants and plant products with
desirable traits. The challenges associated with the employment of
transgenic traits are disconcerting, especially because important
quality issues have not effectively been addressed through
conventional breeding.
SUMMARY OF THE INVENTION
[0006] One aspect of the present invention is a method for stably
integrating a desired polynucleotide into a plant genome,
comprising:
(A) transforming plant material with a first vector comprising
sequences encoding TAL proteins designed to recognize a target
sequence; (B) transforming the plant material with a second vector
comprising (i) a marker gene that is not operably linked to a
promoter ("promoter-free marker cassette") and which comprises a
sequence homologous to the target sequence; and (ii) a desired
polynucleotide; and (C) identifying transformed plant material in
which the desired polynucleotide is stably integrated.
[0007] In one embodiment, the transformed plant material is exposed
to conditions that reflect the presence or absence of the marker
gene in the transformed plant. In another embodiment, the marker
gene is a herbicide resistance gene and the transformed plant
material is exposed to herbicide. In one embodiment, the herbicide
resistance gene is the ALS gene. In another embodiment, the
promoter-free marker cassette is stably integrated into the plant
genome.
[0008] In another embodiment, the invention provides a method for
the targeted insertion of exogenous DNA into a plant comprising the
steps of (i) transforming isolated plant cells with (A) a first
binary vector comprising a promoter-less cassette comprising (a) a
right border sequence linked to (b) a partial sequence of the Ubi7
intron 5'-untranslated region; (c) an Ubi7 monomer-encoding
sequence fused to a mutated acetolactate synthase (ALS) gene; (d) a
desired nuelcotide sequence; and (e) a terminator sequence, wherein
the desired nucleotide sequence is not operably linked to a
promoter; and (B) a second binary vector comprising (a) a right
border; (b) a forward expression cassette and a reverse expression
cassette, each comprising a modified TAL effector operably linked
to a strong constitutive promoter, and a terminator sequence; and
(c) a sequence encoding an enzyme involved in cytokinin production,
such as isopentenyl transferase (ipt), wherein the modified TAL
effector is designed to bind the desired nucleotide sequence within
an intron of potato's ubiquitin-7 (Ubi7) gene; and (ii) culturing
the isolated plant cells under conditions that promote growth of
plants that express the desired nucleotide sequence; wherein no
vector backbone DNA is permanently inserted into the plant
genome.
[0009] In a preferred aspect of the invention, the modified TAL
effector comprises (a) a truncated C-terminal activation domain
comprising a Fok1 endonuclease catalytic domain; (b) a
codon-optimized target sequence binding domain comprising 16.5
repeat variable diresidues corresponding to the Ubi7
5'-untranslated intron sequence; and (c) an N-terminal region
comprising a SV40 nuclear localizaton sequence.
[0010] In an additional preferred aspect of the invention, the
desired nucleotide sequence is a silencing cassette targeting one
or more genes selected from the group consisting of asparagine
synthase 1 (Asn1), polyphenol oxidase (Ppo), and vacuolar invertase
(Inv) genes. In an even more preferred aspect of the invention, the
first binary vector further comprises a late blight resistance gene
Vnt1 operably linked to its native promoter and terminator
sequences.
[0011] In a different embodiment, the invention provides a
transformed plant comprising in its genome an endogenous Ubi7
promoter operably linked to a desired exogenous nucleotide sequence
operably linked to an exogenous terminator sequence. In one aspect
of the invention, the expression of one or more genes selected from
the group consisting of asparagine synthase 1 (Asn1), polyphenol
oxidase (Ppo), and vacuolar invertase (Inv) genes in the
transformed plant is down-regulated. In a preferred aspect of the
invention, the plant further expresses a late blight resistance
gene Vnt 1.
[0012] In one embodiment, the transformed plant is a tuber-bearing
plant. In a preferred embodiment, the tuber-bearing plant is a
potato plant. Preferably, the transformed plant has a phenotype
characterized by one or more of late blight resistance, black spot
bruise tolerance, reduced cold-induced sweetening and reduced
asparagine levels in its tubers.
[0013] In yet another embodiment, the invention provides a
heat-processed product of the transformed plant of the invention.
Preferably, the heat-processed product is a French fry, chip,
crisp, potato, dehydrated potato or baked potato. In a preferred
aspect of the invention, the heat-processed product has a lower
level of acrylamide than a heat-processed product of a
non-transformed plant of the same species.
[0014] In a different embodiment, the invention provides a modified
TAL effector designed to bind to a desired sequence comprising (a)
a truncated C-terminal activation domain comprising a catalytic
domain: (b) a codon-optimized target sequence binding domain; and
(c) an N-terminal region comprising a nuclear localization
sequence. In a preferred aspect of the invention, the modified TAL
effector is designed to bind the desired sequence within an intron
of potato's ubiquitin-7 (Ubi7) gene. As such, the modified TAL
effector comprises (a) a catalytic domain in the C-terminal
activation domain comprising a Fok 1 endonuclease; (b) a target
sequence binding domain comprising 16.5 repeat variable diresidues
corresponding to the Ubi7 5'-untranslated intron sequence; and (c)
a SV40 nuclear localization sequence in the N-terminal region.
[0015] In yet another embodiment, the invention provides a binary
vector comprising (a) a right border; (b) a forward expression
cassette and a reverse expression cassette, each comprising a
modified TAL effector according to claim 16 operably linked to a
strong constitutive promoter and a terminator sequence: and (c) a
sequence encoding an enzyme invoked in cytokinine production, such
as isopentenyl transferase (ipt).
[0016] In yet another embodiment, the invention provides a DNA
construct comprising a promoter-less cassette comprising (a) a
right border sequence inked to (b) a partial Ubi7 5'-untranslated
intron sequence: (c) an Ubi7 monomer-encoding sequence fused to a
mutated acetolactate synthase (ALS) gene: (d) a desired nuelcotide
sequence; (e) a terminator sequence; and (f) a left border, wherein
the desired nucleotide sequence is not operably linked to a
promoter. In a preferred aspect of the invention, the desired
nuelcotide sequence in the DNA construct is a silencing cassette
targeting one or more genes selected from the group consisting of
asparagine synthase 1 (Asn1), polyphenol oxidase (Ppo), and
vacuolar invertase (Inv) genes. In an even more preferred aspect,
the DNA construct further comprises a late blight resistance gene
Vnt1 operably linked to its native promoter and terminator
sequences.
[0017] In a different embodiment, the invention provides a kit for
targeted insertion of exogenous DNA into a plant comprising: (A) a
first binary vector comprising a promoter-less cassette comprising
(a) a right border sequence linked to (b) a partial sequence of the
Ubi7 intron 5'-untranslated region; (c) an Ubi7 monomer-encoding
sequence fused to a mutated acetolactate synthase (ALS) gene: (d) a
desired nucleotide sequence; and (e) a terminator sequence, wherein
the desired nucleotide sequence is not operably linked to a
promoter; and B) a second binary vector comprising (a) a right
border; (b) a forward expression cassette and a reverse expression
cassette, each comprising a modified TAL effector operably linked
to a strong constitutive promoter, and a terminator sequence; and
(c) a sequence encoding isopentenyl transferase (ipt). In a
preferred aspect of the invention, the modified TAL effector is
designed to bind the desired nucleotide sequence within an intron
of potato's ubiquitin-7 (Ubi7) gene, and comprises (a) a truncated
C-terminal activation domain comprising a Fok1 endonuclease
catalytic domain; (b) a codon-optimized target sequence binding
domain comprising 16.5 repeat variable diresidues corresponding to
the Ubi7 5'-untranslated intron sequence; and (c) an N-terminal
region comprising a SV40 nuclear localization sequence. In another
preferred aspect of the invention, the desired nucleotide sequence
is a silencing cassette targeting one or more genes selected from
the group consisting of asparagine synthase 1 (Asn1), polyphenol
oxidase (Ppo), and vacuolar invertase (Inv) genes. In an even more
preferred aspect of the invention, the first binary vector further
comprises a late blight resistance gene Vnt1 operably linked to its
native promoter and terminator sequences.
[0018] The foregoing general description and the following detailed
description are exemplary and explanatory and are intended to
provide further explanation of the invention as claimed. Other
objects, advantages and novel features will be readily apparent to
those skilled in the art from the following detailed description of
the invention.
BRIEF DESCRIPTION OF THE DRAWINGS
[0019] This application contains at least one drawing executed in
color.
[0020] FIG. 1 illustrates the transfer DNA organization of the
plasmid pSIM2168. Sequence shown in the bottom panel starts from
the 25 bp unhighlighted right border. The light gray highlighted
sequence is part of the Ubi7 intron, the dark gray highlighted
sequence is the Ubi7 monomer and the remaining unhighlighted
sequence is part of the potato ALS gene coding. The Ubi7In segment
of pSIM 2168 comprises a homologous arm which is homologous to the
endogenous intron sequence selectively cut by TAL. FIG. 1 discloses
SEQ ID NO: 22.
[0021] FIG. 2 illustrates the forward (E3) and reverse (E4) TAL
effector cassettes in the vector pSIM2170.
[0022] FIG. 3 shows the right border testing cassette described in
Example 9.
[0023] FIG. 4 illustrates the DNA organization of the plasmid
pSIM2162 carrying the Ubi7::ALS cassette.
[0024] FIG. 5 shows the organization of the forward (5A) and
reverse (5B) effector proteins (SEQ ID NOS 7 and 9,
respectively).
[0025] FIG. 6 illustrates the organization of the plasmid pSIM216
carrying the target sequence containing the forward and reverse
recognition sites positioned immediately downstream from the start
codon of the GUS reporter gene. FIG. 6 discloses SEQ ID NO: 23.
[0026] FIG. 7 shows GUS staining of Nicothiana benthamiana leaves
following Agrobacterium infiltration. Left panel: infiltration with
target vector pSIM2167 alone. Right panel; infiltration with target
vector pSIM2167 and TAL effector vector pSIM2170.
[0027] FIG. 8 shows the sequence of PCR-amplified target region of
the plasmid pSIM2167 after co-infiltration with the TAL effector
vector pSIM2170. Effector recognition site is gray highlighted.
Modifications on target sequence are small deletion (majority form)
and substitutions (dark gray highlighted). FIG. 8 discloses SEQ ID
NOS 24-37, respectively, in order of appearance.
[0028] FIG. 9 shows the sequences of fragments from targeted
insertion-specific PCR. The first non-highlighted sequence and the
first light gray highlighted sequence are potato genome sequences,
non highlighted sequence: part of Uni7-like promoter; light gray
highlighted sequence; Uni7-like intron. The remaining sequences are
from the pSIM2168 vector. Dark gray sequence: part of Ubi7 intron;
non-highlighted sequence: Ubi7 monomer; light gray highlighted
sequence: part of the ALS coding sequence. FIG. 9 discloses SEQ ID
NOS 38-40, respectively, in order of appearance.
[0029] FIG. 10 shows inter-node explants grown in hormone-free
medium containing timentin and 0.0 mg/l imazamox (left panel) or
2.0 mg/l imazamox (right panel). No fully developed normal shoots
were visible when the inter-node explants were grown in a medium
containing imazamox.
[0030] FIG. 11 shows Ranger Russet control (RR-C) lines and
herbicide-resistant Ranger Russet lines co-transformed with the
pSIM2170 and pSIM2168 plasmids for targeted insertion, challenged
with P. infestans late blight strain US8 BF6 for the development of
disease symptoms, at seven days after infection.
[0031] FIG. 12 depicts the southern blot gels for selected
herbicide-resistant Ranger Russet lines co-transformed with the
pSIM2170 and pSIM2168 plasmids for targeted insertion. Left panel:
invertase probe; right panel: Vnt1 promoter probe. Each additional
band in the transformed lines, as compared to the Ranger Russet
control (RR) lines, indicates a single copy trasgene for lines
RR-36 (36) and RR-39 (39). Transformed lines RR-26 and RR-32 are
not shown.
[0032] FIG. 13 shows uniform silencing of asparagine in potato
tubers. Snowden lines 2, 15, 55 and 83 were transformed with
pSIM2168 and pSM2170.
[0033] FIG. 14 shows uniform silencing of polyphenol oxidase in
potato tuber. Snowden lines 2, 15, 55 and 83 were transformed with
pSIM2168 and pSIM2170.
[0034] FIG. 15 shows uniform silencing of asparagine in potato
tubers. Ranger lines 26, 32, 38 and 39 were transformed with
pSIM2168 and pSIM2170.
[0035] FIG. 16 shows uniform silencing of polyphenol oxidase in
potato tubers. Ranger lines 26, 32, 38 and 39 were transformed with
pSIM2168 and pSIM2170.
[0036] FIG. 17 shows yields of transformed potato lines.
[0037] FIG. 18 shows FMV-CAS9-OCS and 35s-gRNA-Nos cassettes in
pSIM4187. The sequence of gRNA is shown and the 20 bp target
specific sequence is highlighted. FIG. 18 discloses SEQ ID NOS
41-42, respectively, in order of appearance.
[0038] FIG. 19 shows GUS staining of N. benthamiana leaf after
Agrobacterium infiltration, pSIM2167; Target-GUS vector: pSIM4187:
Cas9 and gRNA vector.
[0039] FIG. 20 shows sequence of PCR amplified target region of
plasmid pSIM2167 after co-infiltrated with pSIM4187. Target
specific sequence in gRNA is dark green highlighted. Modifications
on target sequence are small deletion and substitutions. FIG. 20
discloses SEQ ID NOS 43-51, respectively, in order of
appearance.
DETAILED DESCRIPTION OF THE INVENTION
[0040] One aspect of the present invention is the transient
expression of transcription activator-like effector proteins
designed to bind to, and consequently cut, a desired genomic target
locus, thereby facilitating the insertion of a desired
polynucleotide at that particular target locus. Accordingly, the
present invention encompasses the transformation of plant material
with a sector that contains an expression cassette encoding
peptides or proteins that form appropriate TAL dimers that
recognize and cleave a target locus, and a second vector that
comprises one or more desired expression cassettes. Such desired
expression cassettes may encode a particular protein or gene
silencing transcript. In one embodiment, the second vector may
comprise a cassette referred to herein as a "promoter-free"
cassette, which comprises (i) a marker gene or gene that encodes a
desired phenotype, and appropriate other regulatory elements that
would facilitate appropriate expression of that marker gene if it
was operably linked to a promoter; and (ii) a nucleotide region
homologous to the endogenous target locus site destination. Said
homologous nucleotide region can comprise, for example, 10-20,
20-50, or 20-100 nucleotides, that share 100%, or at least 99%, or
at least 98%, or at least 97%, or at least 96%, or at least 95%, or
at least 94%, or at least 93%, or at least 92%, or at least 91%, or
at least 90%, or at least 89%, or at least 88%, or at least 87%, or
at least 86%, or at least 83%, or at least 84%, or at least 83%, or
at least 82%, or at least 81 %. or at least 80%, or at least 79%,
or at least 78%, or at least 77%. or at least 76%. or at least 75%,
or at least 74%, or at least 73%, or at least 72%, or at least 71%,
or at least 70%, or at least 69%, or at least 68%, or at least 67%,
or at least 66%, or at least 65%, or at least 64%, or at least 63%,
or at least 62%, or at least 61%, or at least 60%, or at least 59%,
or at least 58%, or at least 57%, or at least 56%, or at least 55%,
or at least 54%, or at least 53%. or at least 52%, or at least 31%,
or at least 50% nucleotide sequence identity with the corresponding
sequence of the endogenous target locus.
[0041] Thus, in one embodiment, the second vector comprises at
least (i) an expression cassette encoding a desired polynucleotide
(such as one that encodes a protein or untranslatable RNA
transcript, which RNA transcript may comprise a sense, an
antisense, and/or an inverted repeat of a sequence of a target gene
to be downregulated, and (ii) a promoter-free marker cassette that
comprises a marker gene operably linked to a regulatory element
such as a terminator or 3-untranslated region, along with the
homologous target site region.
[0042] The promoter-free marker cassette and the expression
cassettes(s) of the second vector ideally travel together so that
both become integrated into the target locus as a consequence of
TAL-mediated activity (brought about by the other TAL-encoding,
vector). Ideally, the promoter-free cassette and the expression
cassette(s) are integrated into a desired site at the target locus
suitably near, e.g., downstream or upstream as the case may be, of
one or more functional endogenous promoters or endogenous
regulatory elements, such that the endogenous promoter or
regulatory element expresses the marker gene of the promoter-free
marker cassette in the second vector. The appropriate design of the
TAL sequences to recognize such a target sequence downstream or
upstream of an endogenous gene promoter or regulatory element that
initiates gene expression, such as an enhancer element, is
therefore important in helping to ensure that the expression
cassette(s) and promoter-free marker cassette are integrated at a
particularly chosen genomic location time and again between
different transformation events.
[0043] Therefore, the present invention permits site-specific
insertion of a desired polynucleotide such as one the cassettes
disclosed herein and as described elsewhere, which ensures
consistency in the expression or downregulation level of a
particular target gene between different transformation events. For
example, the site-specific insertion of the desired polynucleotide
could function to express de novo or overexpress a target gene. In
some embodiments, the levels of de novo expression or
overexpression of the target gene might vary among different
transformation events for no more than 200%, or no more than 100%,
or no more than 50%, or no more than 30%. Alternatively, the
site-specific insertion of the desired polynucleotide could
function to produce an RNA transcript to downregulate a target
gene. In some embodiments, the levels of downregulation of the
target gene might vary among different transformation events for no
more than 200%, or no more than 100%, or no more than 50%, or no
more than 30%.
[0044] The marker gene is important because if the promoter-free
marker cassette is appropriately integrated, the marker gene will
be expressed by the endogenous regulatory element, and depending on
the type of marker will (a) effectively identity successful
transformants, and (b) give a preliminary indication of the
successful insertion of the co-joined expression cassette(s) at the
desired target location. Thus, if the marker gene is a herbicide
resistant gene, the transformed plant cells may be cultured on the
relevant herbicide and cells that survive reflect those that are
transformed with the herbicide resistance gene at the desired
target locus near a functional endogenous promoter.
[0045] Thus, the ability to routinely insert an expression cassette
at the same genomic locus between different transformation events
is highly desirable and advantageous and cost-effective because
this reduces the magnitude of screens needed to identify
integration events that would otherwise occur randomly in different
genomic environments. See, e.g., Example 14. Those differences in
random integration loci can often disrupt the local genomic
environment detrimentally, knock-out essential genes, or place the
desired expression cassettes in loci that fail to express the
integrated DNA.
[0046] Accordingly, the homologous target site region present in
the promoter-free marker cassette of the second vector is
specifically designed to match up with the endogenous target site
sequence that the TAL protein dimer of the first vector is also
designed to recognize, bind to, and cut. The second vector may
comprises one homologous target site region upstream or downstream
of the polynucleotide sequences to be inserted, or two homologous
target site regions flanking of the polynucleotide sequences to be
inserted. Thus, both the promoter-free marker cassette and the TAL
expression cassette contain sequences unique to the endogenous
genorme target locus, such that the promoter-free marker cassette
and its co-joined desired expression cassettes, is inserted into
the precise target locus site cut by the TALs.
[0047] The present invention is not limited to the insertion of
promoter-free marker cassettes and expression cassettes into a
genomic locus or nearby an endogenous promoter or regulatory
element. Rather, the present invention encompasses the use of the
inventive method to stack cassettes in a modular fashion based upon
the design of TAL sequences and homologous regions that recognize
polynucleotide sequences from prior transformation events. That is,
in one embodiment a plant may have already been stably transformed
with Expression Cassette A that, with or without the use of TAL,
expresses Gene X at a particular or random site in the plant
genome. The present TAL-mediated integration method allows for the
design of TAL sequences that recognize a sequence perhaps
downstream of Gene X in Expression Cassette A, such that the TAL
dimer effectively cleaves the plant genome at that Gene X site. If
the promoter-free marker cassette--or any expression
cassette--comprises a homologous region to that Gene X site, then
it is possible to introduce that cassette immediately downstream of
Gene X. As mentioned, it is not necessary that in all cases the
present invention must utilize a promoter-free marker system for it
may be the case that the gene-of-interest integrated downstream of
the pre-transformed Gene X plant produces a detectable and desired
trait in and of itself. Furthermore, the additional expression
cassette may contain its own promoter or may be promoter-tree such
that the gene-of-interest is expressed from the promoter or
regulatory element of Expression Cassette A.
[0048] Accordingly, in one embodiment, the present invention
encompasses the de novo insertion of a desired expression cassette
into a target locus using the promoter-free marker design to
identity successful and appropriate transformants. In another
embodiment, the present invention encompasses the subsequent
insertion of one or more additional expression cassettes, which may
or may not include a promoter-free marker cassette, downstream or
upstream of a prior integration event. Thus, in the latter
approach, the present invention permits the ability to stack genes
at precise and defined locations within a plant genome by
effectively linking together different expression cassettes even
though this is done via different transformations, using
TAL-mediated site-specific insertion technology described
herein.
[0049] In one embodiment, it is desirable to only transiently
express the TAL proteins such that the only DNA that becomes stably
integrated into the plant genome belongs to the desired expression
cassette(s) and promoter-free marker cassette, if used.
[0050] Accordingly, one aspect of the present invention encompasses
(1) identifying in the genome of a plant a desired target locus
sequence; (2) designing corresponding TAL sequences that
specifically recognize that target locus sequence; and, optionally
(3) assaying the designed TAL sequences in an infiltration assay,
for instance, to test if the corresponding TAL dimer, when formed,
cuts appropriately. TALs that work can then be subcloned into a
transient expression transformation vector, such as shown in FIG.
2. Such steps are described in detail herein. See for instance
Examples 10 and 11.
[0051] A second vector can then be designed comprising one or more
desired expression cassettes along with the promoter-free marker
cassette, and both the TAL vector and the second vector
subsequently transformed into one or two strains of
Agrobacteria.
[0052] Plant material, such as explants, calli, cells, leaves, or
stems, can then be transformed using these Agrobacteria. In one
embodiment, the transformed plant material can be grown into calli
on media that does not contain any selection component. That is,
for the ease of illustration, if the second sector comprises a
herbicide resistance marker gene that is not operably linked to a
promoter, then the transformed plant material would initially be
cultured on media that does not contain herbicide for a certain
period of time. After that period of time, the plant material may
be placed on callus induction media that does contain herbicide.
Those materials that survive can then be placed on shoot induction
media that also contains the same herbicide until shoots develop
and survive for a period of time. The shoots that grow on herbicide
media are therefore likely to contain the stably integrated
herbicide resistance gene in then genomes along with the actual
desired expression cassettes. Those herbicide-resistant shoots or
leaves that grow from those shoots can then be subjected to PCR and
other molecular analyses to determine if they contain the marker
and also the desired expression cassette(s) in the correct and
expected genomic target location. This method is described herein
in detail, see for instance Example 4. When the ALS gene was used
as the marker gene in the promoter-free arrangement, as discussed
herein, 80% of the analyzed transformed shoots/leaves contained the
desired insert stably integrated in the desired genomic locus.
[0053] In one embodiment, the second vector comprises the gene
expression cassette for late blight and a gene silencing cassette
for silencing PPO, ASNI, and invertase, in addition to a
promoter-free ALS herbicide marker gene, as shown in pSIM2168 (FIG.
1). In this case, the ALS gene is not operably linked to a promoter
but it is operably linked to a terminator and includes, upstream, a
sequence homologous to a region of the endogenous plant Ubi7 gene
intron and part of the Ubi1 exon #1. FIG. 2 depicts the
corresponding vector (pSIM2170) that expresses the E4Rep and E3
repeat TAL sequences that are also designed to recognize a
naturally-occurring sequence within the Ubi7 gene intron. Both
pSIM2168 and pSIM2170 are transformed into potato stem explants and
subjected to the method described above and as described in
methodological detail in the Examples provided herein. The results
show that the inventive approach was successful in using
TAL-mediated integration to stably integrate the cassettes of
pSIM2168 into the precise target location desired in the endogenous
Ubi7 gene intron.
[0054] The present inventive methods are not limited to the
introduction of such vectors using transfer-DNAs or Agrobacterium.
It is possible to use particle bombardment, for instance, without
any Agrobacterium or T-DNA components. In one embodiment, for
instance, it is possible to coat particle bombardment particles
with DNA encoding the desired expression cassette(s) and
promoter-free marker cassette, and also coat the same particles
with TAL proteins or TAL protein dimers. In this fashion therefore
a particle may comprise both encoding DNA and TAL proteins, or some
particles may be coated with either the encoding DNA or the TAL
proteins. In any event, plant material can be bombarded with such
coated particles whereupon when the particles enter the plant cell,
the TAL proteins function as intended to cut the genomic sequence
at a desired site and integrate the co-delivered DNA. See for
instance Martin-Ortigosa et al., Adv. Funct. Mater. 32, 3576-3582
(2012), which is incorporated herein by reference, for examples of
how to use particle bombardment to co-deliver proteins and DNA into
plants.
[0055] As used herein, a "desired polynucleotide" is essentially
any polynucleotide or series of DNA sequences within an expression
cassette or gene silencing cassette that the user desires to be
integrated into the plant genome. Accordingly, "desired
polynucleotide" may be used interchangeably with "cassette"
"expression cassette" or "silencing cassette" herein. A desired
polynucleotide in any expression cassette can be operably linked to
any kind or strength of promoter and its expression is not
necessarily therefore dependent on the expression of an endogenous
plant genomic promoter.
[0056] While it is desirable to stably transform plant genomes
according to the present TAL-mediated integration technology,
another embodiment of the inventive methods encompasses the
integration of a desired polynucleotide into any form or sample of
nucleic acid, not
[0057] TALs
[0058] Transcription activator-like (TAL) effectors are plant
pathogenic bacterial proteins that contain modular DNA binding
domains that facilitate site-specific integration of DNAs into a
particularly desired target site, such as in a plant genome. These
domains comprise tandem, polymorphic amino acid repeats that
individually specify contiguous nucleotides in DNA that are useful
for directing the targeted site-specific integration approach.
[0059] A central repeat domain may comprise between 1.5 and 33.5
repeats typically 34 residues in length. An example of a repeat
sequence is:
TABLE-US-00001 (SEQ ID NO: 21)
LTPEQVVAIASHDGGKQALETVQRLLPVLCQAHG.
[0060] The residues at the 12th and 13th positions can be
hypervariable known as the "repeat variable diresidue" or RVD.
There is a relationship between the identity of these two residues
in sequential repeats and sequential DNA bases in the TAL
effector's target site. The code between RVD sequence and target
DNA base can be expressed as:
[0061] NI=A
[0062] HD=C
[0063] NG=T
[0064] NN=R (G or A), and
[0065] NS=N (A, C, G, or T).
[0066] RVD NK can target G, but TAL effector nucleases (TALENs)
that exclusively use NK instead of NN to target G can be less
active.
[0067] Target sites of TAL effectors may include a T flanking the
5'-base targeted by the first repeat perhaps due to a contact
between this T nucleotide and a conserved tryptophan in the region
N-terminal of the central repeat domain.
[0068] See also the following publications which are all
incorporated herein by reference in their entirety: Boch J. Bonas U
(September 2010). "XanthomonasAvrBs3 Family-Type III effectors:
Discovery and Function". Annual Review of Phytopathology 48:
419-36; Voytas D F, Joung J K (December 2009). "Plant science. DNA
binding made easy". Science 326 (5959). 1491-2. Bibcode 2009;
Moscou M J, Bogdanove A J (December 2009). "A simple cipher governs
DNA recognition by TAL effectors". Science 326 (5959); 1501; Boch
J, Scholze H, Schornack S et al. (December 2009); "Breaking the
code of DNA binding specificity of TAL-type III effectors". Science
326(5959); 1509-12: Morbitzer, R; Romer, P.; Boch. J.; Lahaye, T.
(2010). "Regulation of selected genome loci using de
novo-engineered transcription activator-like effector (TALE)-type
transcription factors". Proceedings of the National Academy of
Sciences 107 (50): 21617-21622; Miller, J. C.; Tan S.; Qiao, G.;
Barlow, K. A.; Wang, J.; Xia, D. F.; Meng, X.; Paschon, D. E. et
al. (2010). "A TALE nuclease architecture for efficient genome
editing". Nature Biotechnology 29 (2): 14; Huang. P.; Xiao, A.;
Zhou, M.; Zhu, Z.; Lin, S.; Zhang, B. (2011). "Heritable gene
targeting in zebrafish using customized TALENs". Nature
Biotechnology 29 (8): 699; and Mak, A. N.-S.; Bradley, P.;
Cernadas, R. A.; Bogdanove, A. J.; Stoddard, B. L. (2012). "The
Crystal Structure of TAL Effector PthXol Bound to its DNA Target",
Science. doi:10.1126/science.1216211.
[0069] Markers
[0070] Examples of the categories of marker genes that can be used
as disclosed herein in the promoter-free marker gene cassette
include, but are not limited to, herbicide tolerance, pesticide
tolerance insect resistance, tolerance to stress, enhanced flavor
or stability of the fruit or seed, or the ability to synthesize
useful, non-plant proteins, e.g., medically valuable proteins or
the ability to generate altered concentrations of plant proteins,
and related impacts on the plant, e.g., altered levels of plant
proteins catalyzing production of plant metabolites including
secondary plant metabolites.
[0071] This invention provides methods and kits for the targeted
insertion of desired nucleotide sequences into plants, by inserting
promoter-less desired nucleotide sequences into the intron sequence
of the ubiquitin-7 (Ubi7) gene, such that the expression of
exogenous nucleotide sequences in the plants is driven by the
endogenous Ubi7 promoter. In particular, the inventors were able to
create binary vectors for the transient expression of transcription
activator-like effector nucleases specifically designed to bind
desired nucleotide sequences within the intron sequence of the
Ubi7gene, such that when these vectors are introduced into plant
cells together with binary vectors carrying the targeted
promoter-less nucleotide sequences, the desired nucleotide
sequences are inserted into the intron sequence of the Ubi7 gene
with proper orientation and spacing, and their expression is driven
by the endogenous Ubi7 promoter. The transformed plants
regenerating from the transformed explants thus obtained carry only
the targeted sequences.
[0072] The invention further provides plants transformed by the
methods of the invention, as well as the binary vectors for
transient and permanent transformation.
[0073] The technology strategy of the present invention addresses
the need to efficiently produce genetically engineered plants and
plant products with desirable traits by targeted transformation,
such that the nutritional value and agronomic characteristics of
plant crops, and in particular tuber-bearing plants, such as potato
plants, may be improved. Desirable traits include, but are not
limited to, high tolerance to impact-induced black spot bruise,
reduced formation of the acrylamide precursor asparagine, reduced
accumulation of reducing sugars and reduced accumulation of toxic
Maillard products, including acrylamide. Thus, the present
invention allows the targeted insertion of these desirable traits
into a plant genome by reducing the expression of enzymes, such as
polyphenol oxidase (PPO), which is responsible for black spot
bruise, and asparagine synthetase-1 (Asn-1), which is responsible
for the accumulation of asparagine, a precursor in acrylamide
formation, and by increasing the expression of the late blight
resistance gene Vnt1.
[0074] The present invention uses terms and phrases that are well
known to those practicing the art. Unless defined otherwise, all
technical and scientific terms used herein have the same meaning as
commonly understood by one of ordinary skill in the art to which
this invention belongs. Generally, the nomenclature used herein and
the laboratory procedures in cell culture, molecular genetics, and
nucleic acid chemistry and hybridization described herein are those
well known and commonly employed in the art. Standard techniques
are used for recombinant nucleic acid methods, polynucleotide
synthesis, microbial culture, cell culture, tissue culture,
transformation, transfection, transduction, analytical chemistry,
organic synthetic chemistry, chemical syntheses, chemical analysis,
and pharmaceutical formulation and delivery. Generally, enzymatic
reactions and purification and/or isolation steps are performed
according to the manufacturers' specifications. The techniques and
procedures are generally performed according to conventional
methodology (Molecular Cloning, A Laboratory Manual, 3rd. edition,
edited by Sambrook & Russel Cold Spring Harbor Laboratory
Press, Cold Spring Harbor, N.Y., 2001).
[0075] Agrobacterium or bacterial transformation: as is well known
in the field. Agrobacteria that are used for transforming plant
cells are disarmed and virulent derivatives of, usually,
Agrobacterium tumefaciens or Agrobacterium rhizogenes. Upon
infection of plants, explants, cells, or protoplasts, a single
Agrobacterium strain containing a binary vector comprising a TAL
effector cassette and a binary vector comprising the gene of
interest, or two separate Agrobacterium strains, one containing a
binary vector comprising a TAL effector cassette, and the other
containing a binary vector comprising the gene of interest,
transfer a desired DNA segment from a plasmid vector to the plant
cell nucleus. The vector typically contains a desired
polynucleotide that is located between the borders of a T-DNA.
However, any bacteria capable of transforming a plant cell may be
used, such as, Rhizobium trifolii, Rhizobium leguminosarum,
Phyllobacterium myrsinacearum, SinoRhizobium meliloti, and
MesoRhizobium loti. The present invention is not limited to the use
of bacterial transformation systems. Any organism however that
contains the appropriate cellular machinery and proteins to
accomplish plant cell transformation.
[0076] Angiosperm: vascular plants having seeds enclosed in an
ovary. Angiosperms are seed plants that produce flowers that bear
fruits. Angiosperms are divided into dicotyledonous and
monocotyledonous plant.
[0077] Antibiotic Resistance: ability of a cell to survive in the
presence of an antibiotic. Antibiotic resistance, as used herein,
results from the expression of an antibiotic resistance gene in a
host cell. A cell may have resistance to any antibiotic. Examples
of commonly used antibiotics include kanamycin and hygromycin.
[0078] Dicotyledonous plant (dicot); a flowering plant whose
embryos have two seed halves or cotyledons, branching leaf veins,
and flower parts in multiples of four or five. Examples of dicots
include but are not limited to, potato, sugar beet broccoli,
cassava, sweet potato, pepper, poinsettia, bean, alfalfa, soybean,
and avocado.
[0079] Endogenous: nucleic acid, gene, polynucleotide, DNA, RNA,
mRNA, or cDNA molecule that is isolated either from the genome of a
plant or plant species that is to be transformed or is isolated
from a plant or species that is sexually compatible or interfertile
with the plaint species that is to be transformed, is "native" to,
i.e., indigenous to, the plant species.
[0080] Expression cassette: polynucleotide that may comprise, from
5' to 3', (a) a first promoter, (b) a sequence comprising (i) at
least one copy of a gene or gene fragment, or (ii) at least one
copy of a fragment of the promoter of a gene, and (c) either a
terminator or a second promoter that is positioned in the opposite
orientation as the first promoter.
[0081] Foreign: "foreign," with respect to a nucleic acid, means
that the nucleic acid is derived from non-plant organisms, or
derived from a plant that is not the same species as the plant to
be transformed or is not derived from a plant that is not
interfertile with the plant to be transformed, does not belong to
the species of the target plant. According to the present
invention, foreign DNA or RNA represents nucleic acids that are
naturally occurring in the genetic makeup of fungi, bacteria,
viruses, mammals, fish or birds, but are not naturally occurring in
the plant that is to be transformed. Thus, a foreign nucleic acid
is one that encodes, for instance, a polypeptide that is not
naturally produced by the transformed plant. A foreign nucleic acid
does not have to encode a protein product.
[0082] Gene: A gene is a segment of a DNA molecule that contains
all the information required for synthesis of a product,
polypeptide chain or RNA molecule that includes both coding and
non-coding sequences. A gene can also represent multiple sequences,
each of which may be expressed independently, and may encode
slightly different proteins that display the same functional
activity. For instance, the asparagine synthetase 1 and 2 genes
can, together, be referred to as a gene.
[0083] Genetic element: a "genetic element" is any discreet
nucleotide sequence such as, but not limited to, a promoter, gene,
terminator, intron, enhancer, spacer, 5'-untranslated region,
3'-untranslated region, or recombinase recognition site.
[0084] Genectic modification: stable introduction of DNA into the
genome of certain organisms by applying methods in molecular and
cell biology.
[0085] Gymnosperm: as used herein, refers to a seed plant that
bears seed without ovaries. Examples of gymnosperms include
conifers, cycads, ginkgos, and ephedras.
[0086] Introduction: as used herein, refers to the insertion of a
nucleic acid sequence into a cell, by methods including infection,
transfection, transformation or transduction.
[0087] Monocotyledonous plant (monocot): a flowering plant having
embryos with one cotyledon or seed leaf, parallel leaf veins, and
flower parts in multiples of three. Examples of monocots include,
but are not limited to maize, rice, oat, wheat, barley, and
sorghum.
[0088] Native: nucleic acid, gene, polynucleotide, DNA, RNA, mRNA,
or cDNA molecule that is isolated either from the genome of a plant
or plant species that is to be transformed or is isolated from a
plant or species that is sexually compatible or interfertile with
the plant species that is to be transformed, is "native" to, i.e.,
indigenous to, the plant species.
[0089] Native DNA: any nucleic acid, gene, polynucleotide, DNA,
RNA, mRNA, or cDNA molecule that is isolated either from the genome
of a plant or plant species that is to be transformed or is
isolated from a plant or species that is sexually compatible or
interfertile with the plant species that is to be transformed, is
"native" to, i.e., indigenous to, the plant species. In other
words, a native genetic element represents all genetic material
that is accessible to plant breeders for the improvement of plants
through classical plant breeding. Any variants of a native nucleic
acid also are considered "native" in accordance with the present
invention. For instance, a native DNA may comprise a point mutation
since such point mutations occur naturally. It is also possible to
link two different native DNAs by employing restriction sites
because such sites are ubiquitous in plant genomes.
[0090] Native Nucleic Acid Construct: a polynucleotide comprising
at least one native DNA.
[0091] Operably linked: combining two or more molecules in such a
fashion that in combination they function properly in a plant cell.
For instance, a promoter is operably linked to a structural gene
when the promoter controls transcription of the structural
gene.
[0092] Overexpression: expression of a gene to levels that are
higher than those in control plants.
[0093] P-DNA: a plant-derived transfer-DNA ("P-DNA") border
sequence of the present invention is not identical in nucleotide
sequence to any known bacterium-derived T-DNA border sequence, but
it functions for essentially the same purpose. That is, the P-DNA
can be used to transfer and integrate one polynucleotide into
another. A P-DNA can be inserted into a tumor-inducing plasmid,
such as a Ti-plasmid from Agrobacterium in place of a conventional
T-DNA, and maintained in a bacterium strain, just like conventional
transformation plasmids. The P-DNA can be manipulated so as to
contain a desired polynucleotide, which is destined for integration
into a plant genome via bacteria-mediated plant transformation. The
P-DNA comprises at least one border sequence. See Rommens et al.
2005 Plant Physiology 139: 1338-1349, which is incorporated herein
by reference. In certain embodiments of the invention, the T-DNA is
replaced by the P-DNA.
[0094] Phenotype: phenotype is a distinguishing feature or
characteristic of a plant, which may be altered according to the
present invention by integrating one or more "desired
polynucleotides" and/or screenable/selectable markers into the
genome of at least one plant cell of a transformed plant. The
"desired polynucleotide(s)" and/or markers may confer a change in
the phenotype of a transformed plant, by modifying any one of a
number of genetic, molecular, biochemical, physiological
morphological, or agronomic characteristics or properties of the
transformed plant cell or plant as a whole.
[0095] Plant tissue: a "plant" is any of various photosynthetic,
eukaryotic, multicellular organisms of the kingdom Plantae
characteristically producing embryos, containing chloroplasts, and
having cellulose cell walls. A part of a plant, i.e., a "plant
tissue" may be treated according to the methods of the present
invention to produce a transgenic plant. Many suitable plant
tissues can be transformed according to the present invention and
include, but are not limited to, somatic embryos, pollen, leaves,
stems, calli, stolons, microtubers, and shoots. Thus, the present
invention envisions the transformation of angiosperm and gymnosperm
plants such as wheat, maize, rice, barley, oat, sugar beet, potato,
tomato, alfalfa, cassava, sweet potato, and soybean. According to
the present invention "plant tissue" also encompasses plant cells.
Plant cells include suspension cultures, callus, embryos,
meristematic regions, callus tissue, leaves, roots, shoots,
gametophytes, sporophytes, pollen, seeds and microspores. Plant
tissues may be at various stages of maturity and may be grown in
liquid or solid culture, or in soil or suitable media in pots,
greenhouses or fields. A plant tissue also refers to any clone of
such a plant, seed, progeny, propagule whether generated sexually
or asexually, and descendents of any of these, such as cuttings or
seed. Of particular interest are potato, maize, and wheat.
[0096] Plant transformation and cell culture: broadly refers to the
process by which plant cells are genetically modified and
transferred to an appropriate plant culture medium for maintenance,
further growth, and/or further development. Such methods are well
known to the skilled artisan.
[0097] Processing: the process of producing a food from (1) the
seed of, for instance, wheat, corn, coffee, plant, or cocoa tree,
(2) the tuber of, for instance, potato, or (3) the root of for
instance, sweet potato and yam comprising heating to at least
120.degree. C. Examples of processed foods include bread, breakfast
cereal, pies, cakes, toast, biscuits, cookies, pizza, pretzels,
tortilla, French fries, oven-baked fries, potato chips, hash
browns. roasted coffee, and cocoa.
[0098] Progeny: a "progeny" of the present invention, such as the
progeny of a transgenic plant, is one that is born of, begotten by,
or derived from a plant or the transgenic plant. Thus, a "progeny"
plant, i.e., an "FI" generation plant is an offspring, or a
descendant of the transgenic plant produced by the inventive
methods. A progeny of a transgenic plant may contain in at least
one, some, or all of its cell genomes, the desired polynucleotide
that was integrated into a cell of the parent transgenic plant by
the methods described herein. Thus, the desired polynucleotide is
"transmitted" or "inherited" by the progeny plant. The desired
polynucleotide that is so inherited in the progeny plant may reside
within a T-DNA or P-DNA construct, which also is inherited by the
progeny plant from its parent. The term "progeny" as used herein,
also may be considered to be the offspring or descendants of a
group of plants.
[0099] Promoter: promoter is intended to mean a nucleic acid,
preferably DNA that binds RNA polymerase and/or other transcription
regulatory elements. A promoter is a nucleic acid sequence that
enables a gene with which it is associated to be transcribed. A
regulatory region refers to nucleic acid sequences that influence
anchor promote initiation of transcription. Promoters are typically
considered to include regulatory regions, such as enhancer or
inducer elements.
[0100] Eukaryotic promoters typically lie upstream of the gene to
which they are most immediately associated. Promoters can have
regulatory elements located several kilobases away from their
transcriptional start site, although certain tertiary structural
formations by the transcriptional complex can cause DNA to fold,
which brings those regulatory elements closer to the actual site of
transcription. Many eukaryotic promoters contain a "TATA box"
sequence, typically denoted by the nucleotide sequence, TATAAA.
This element binds a TATA binding protein, which aids formation of
the RNA polymerase transcriptional complex. The TATA box typically
lies within 50 bases of the transcriptional start site.
[0101] Eukaryotic promoters also are characterized by the presence
of certain regulatory sequences that bind transcription factors
involved in the formation of the transcriptional complex. An
example is the E-box denoted by the sequence CACGTG, which binds
transcription factors in the basic-helix-loop-helix family. There
also are regions that are high in GC nucleotide content.
[0102] A polynucleotide may be linked in two different orientations
to the promoter. In one orientation, e.g., "sense", at least the
5'-part of the resultant RNA transcript will share sequence
identity with at least part of at least one target transcript. In
the other orientation designated as "antisense" at least the
5'-part of the predicted transcript will be identical or homologous
to at least part of the inverse complement of at least one target
transcript.
[0103] A plant promoter is a promoter capable of initiating
transcription in plant cells whether or not its origin is a plant
cell. Exemplary plant promoters include, but are not limited to,
those that are obtained from plants, plant viruses, and bacteria
such as Agrobacterium or Rhizobium which comprise genes expressed
in plant cells. Examples of promoters under developmental control
include promoters that preferentially initiate transcription in
certain tissues, such as xylem, leaves, roots, or seeds. Such
promoters are referred to as tissue-preferred promoters. Promoters
which initiate transcription only in certain tissues are referred
to as tissue-specific promoters. A cell type-specific promoter
primarily drives expression in certain cell types in one or more
organs, for example, vascular cells in roots or leaves. An
inducible or repressible promoter is a promoter which is under
environmental control. Examples of environmental conditions that
may effect transcription by inducible promoters include anaerobic
conditions or the presence of light. Tissue specific, tissue
preferred, cell type specific, and inducible promoters constitute
the class of non-constitutive promoters. A constitutive promoter is
a promoter which is active under most environmental conditions, and
in most plant parts.
[0104] Polynucleotide is a nucleotide sequence, comprising a gene
coding sequence or a fragment thereof, (comprising at least 15
consecutive nucleotides, preferably at least 30 consecutive
nucleotides, and more preferably at least 50 consecutive
nucleotides), a promoter, an intron, an enhancer region, a
pulyadenylation site, a translation initiation site, 5' or 3'
untranslated regions, a reporter gene, a selectable marker or the
like. The polynucleotide may comprise single stranded or double
stranded DNA or RNA. The polynucleotide may comprise modified bases
or a modified backbone. The polynucleotide may be genomic, an RNA
transcript (such as an mRNA) or a processed nucleotide sequence
(such as a cDNA). The polynucleotide may comprise a sequence in
either sense or antisense orientations.
[0105] An isolated polynucleotide is a polynucleotide sequence that
is not in its native state, e.g., the polynucleotide is comprised
of a nucleotide sequence not found in nature or the polynucleotide
is separated from nucleotide sequences with which it typically is
in proximity or is next to nucleotide sequences with which it
typically is not in proximity.
[0106] Seed: a "seed" may be regarded as a ripened plant ovule
containing an embryo, and a propagative part of a plant, as a tuber
or spore. Seed may be incubated prior to Agrobacterium-mediated
transformation, in the dark, for instance, to facilitate
germination. Seed also may be sterilized prior to incubation, such
as by brief treatment with bleach. The resultant seedling can then
he exposed to a desired strain of Agrobacterium.
[0107] Selectable/screenable marker: a gene that, if expressed in
plants or plant tissues, makes it possible to distinguish them from
other plants or plant tissues that do not express that gene.
Screening procedures may require assays for expression of proteins
encoded by the screenable marker gene. Examples of selectable
markers include herbicide resistance genes, such as acetolactate
synthase (ALS), the neomycin phosphotransferase (Nptll) gene
encoding kanamycin and geneticin resistance, the hygromycin
phosphotransferase (Hptll) gene encoding resistance to hygromycin,
or other similar genes known in the art.
[0108] Sensory characteristics: panels of professionally trained
individuals can rate food products for sensory characteristics such
as appearance, flavor, aroma, and texture. Thus, the present
invention contemplates improving the sensory characteristics of a
plant product obtained from a plant that has been modified
according to the present invention to manipulate its tuber yield
production.
[0109] Sequence identity: as used herein, "sequence identity" or
"identity" in the context of two nucleic acid or polypeptide
sequences includes reference to the residues in the two sequences
which are the same when aligned for maximum correspondence over a
specified region. A homologous region or sequence as used herein,
therefore describes a sequence that shares some degree of sequence
identity with a target genomic loci. When percentage of sequence
identity is used in reference to proteins it is recognized that
residue positions which are not identical often differ by
conservative amino acid substitutions, where amino acid residues
are substituted for other amino acid residues with similar chemical
properties (e.g. charge or hydrophobicity) and therefore do not
change the functional properties of the molecule. Where sequences
differ in conservative substitutions, the percent sequence identity
may be adjusted upwards to correct for the conservative nature of
the substitution. Sequences which differ by such conservative
substitutions are said to have "sequence similarity" or
"similarity," Means for making this adjustment are well-known to
those of skill in the art. Typically this involves scoring a
conservative substitution as a partial rather than a foil mismatch,
thereby increasing the percentage sequence identity. Thus, for
example, where an identical amino acid is given a score of 1 and a
non conservative substitution is given a score of zero, a
conservative substitution is given a score between zero and 1. The
scoring of conservative substitutions is calculated, e.g.,
according to the algorithm of Meyers and Miller, Computer Applic.
Biol. Sci., 4: 11 17 (1988) e.g., as implemented in the program
PC/GENE (Intelligenetics, Mountain View, Calif., USA).
[0110] As used herein, percentage of sequence identity means the
value determined by comparing two optimally aligned sequences over
a comparison window, wherein the portion of the polynucleotide
sequence in the comparison window may comprise additions or
deletions (i.e., gaps) as compared to the reference sequence (which
does not comprise additions or deletions) for optimal alignment of
the two sequences. The percentage is calculated by determining the
number of positions at which the identical nucleic acid base or
amino acid residue occurs in both sequences to yield the number of
matched positions, dividing the number of matched positions by the
total number of positions in the window of comparison and
multiplying the result by 100 to yield the percentage of sequence
identity.
[0111] "Sequence identity" has an art-recognized meaning and can be
calculated using published techniques. See COMPUTATIONAL MOLECULAR
BIOLOGY, Lesk, ed. (Oxford University Press, 1988), BIOCOMPUTING:
INFORMATICS AND GENOME PROJECTS, Smith, ed. (Academic Press. 1993).
COMPUTER ANALYSIS OF SEQUENCE DATA, PART 1, Griffin & Griffin,
eds., (Humana Press, 1994), SEQUENCE ANALYSIS IN MOLECULAR BIOLOGY,
Von Heinje ed., Academic Press (1987), SEQUENCE ANALYSIS PRIMER,
(Gribskov & Devereus:, eds. (Macmillan Stockton Press, 1991),
and Carillo & Lipton, SIAM J. Applied Math. 48; 1073 (1988).
Methods commonly employed to determine identity or similarity
between two sequences include but are not limited to those
disclosed in GUIDE TO HUGE COMPUTERS, Bishop, ed., (Academic Press,
1994) and Carillo & Lipton, supra. Methods to determine
identity and similarity are codified in computer programs.
Preferred computer program methods to determine identity and
similarity between two sequences include but are not limited to the
GCG program package (Devereux et al., Nucleic Acids Research 12:
387 (1984)), BLASTP, BLASTN, FASTA (Atschul et al., J. Mol. Biol.
215: 403 (1990), and FASTDB (Brutlag et al., Comp. App. Biosci. 6:
237 (1990)).
[0112] Silencing: The unidirectional and unperturbed transcription
of either genes or gene fragments from promoter to terminator can
trigger post-transcriptional silencing of target genes. Initial
expression cassettes for post-transcriptional gene silencing in
plants comprised a single gene fragment positioned in either the
antisense (McCormick et al., U.S. Pat. No. 6,617,496; Shewmaker et
al., U.S. Pat. No. 5,107,065) or sense (van der Krol et al., Plant
Cell 2:291-299, 1990) orientation between regulatory sequences for
transcript initiation and termination. In Arabidopsis, recognition
of the resulting transcripts by RNA-dependent RNA polymerase leads
to the production of double-stranded (ds) RNA. Cleavage of the
dsRNA by Dicer-like (Dcl) proteins such as Dcl4 yields
21-nucleotide (nt) small interfering RNAs (siRNAs). These siRNAs
complex with proteins including members of the Argonaute (Ago)
family to produce RNA-induced silencing complexes (RISCs). The
RISCs then target homologous RNAs for endonucleolytic cleavage.
[0113] More effecting silencing constructs contain both a sense and
antisense component, producing RNA molecules that fold back into
hairpin structures (Waterhouse et al., Proc Natl Acad Sci U S A 95:
13959-13964. 1998). The high dsRNA levels produced by expression of
inverted repeat transgenes were hypothesized to promote the
activity of multiple Dcls. Analyses of combinatorial Dcl knockouts
in Arabidopsis supported this idea, and also identified Dcl4 as one
of the proteins involved in RNA cleavage.
[0114] One component of conventional sense, antisense, and
double-strand (ds) RNA-based gene silencing constructs is the
transcriptional terminator. WO 2006/036739, which is incorporated
in its entirety by reference, shows that this regulatory element
becomes obsolete when gene fragments are positioned between two
oppositely oriented and functionally active promoters. The
resulting convergent transcription triggers gene silencing that is
at least as effective as unidirectional `promoter-to-terminator`
transcription. In addition to short variably-sized and
non-polyadenylated RNAs, terminator-free cassette produced rare
longer transcripts that reach into the flanking promoter.
Replacement of gene fragments by promoter-derived sequences further
increased the extent of gene silencing.
[0115] TAL effectors (TALE) are proteins secreted by Xanthomonas
bacteria characterized by the presence of a DNA binding domain that
contains a repeated highly conserved 33-34 amino acid sequence,
except for the highly variable 12th and 13th amino acids, which
show a strong correlation with specific nucleotide recognition.
These proteins can bind promoter sequences in the host plant and
activate the expression of plant genes. This application makes use
of engineered TAL effectors that are fused to the cleavage domain
of Fok1 endonucleases for the targeted insertion of desirable genes
into plants.
[0116] Tissue: any part of a plant that is used to produce a food.
A tissue can be a tuber of a potato, a root of a sweet potato, or a
seed of a maize plant.
[0117] Transcriptional terminators: The expression DNA constructs
of the present indention typically have a transcriptional
termination region at the opposite end from the transcription
initiation regulatory region. The transcriptional termination
region may be selected, for stability of the mRNA to enhance
expression and or for the addition of polyadenylation tails added
to the gene transcription product. Translation of a nascent
polypeptide undergoes termination when any of the three
chain-termination codons enters the A site on the ribosome.
Translation termination codons are UAA, UAG, and UGA. In the
instant invention, transcription terminators are derived from
either a gene or, more preferably, from a sequence that does not
represent a gene but intergenic DNA. For example, the terminator
sequence from the potato ubiquitin gene may be used.
[0118] Transfer DNA (T-DNA): a transfer DNA is a DNA segment
delineated by either T-DNA borders or P-DNA borders to create a
T-DNA or P-DNA, respectively. A T-DNA is a genetic element that is
well-known as an element capable of integrating a nucleotide
sequence contained within its borders into another genome. In this
respect, a T-DNA is flanked, typically, by two "border" sequences.
A desired polynucleotide of the present invention and a selectable
marker may be positioned between the left border-like sequence and
the right border-like sequence of a T-DNA. The desired
polynucleotide and selectable marker contained within the T-DNA may
be operably linked to a variety of different, plant-specific (i.e.,
native), or foreign nucleic acids, like promoter and terminator
regulatory elements that facilitate its expression, i.e.,
transcription and/or translation of the DNA sequence encoded by the
desired polynucleotide or selectable marker.
[0119] Transformation of plant cells: A process by which a nucleic
acid is stably inserted into the genome of a plant cell.
Transformation may occur under natural or artificial conditions
using various methods well known in the art. Transformation may
rely on any known method for the insertion of nucleic acid
sequences into a prokaryotic or eukaryotic host cell, including
Agrobacterium-mediated transformation protocols such as `refined
transformation` or `precise breeding`, viral infection, whiskers,
electroporation, microinjection, polyethylene glycol-treatment,
heat shock, lipofection and particle bombardment.
[0120] Transgenic plant: a transgenic plant of the present
invention is one that comprises at least one cell genome in which
an exogenous nucleic acid has been stably integrated. According to
the present invention, a transgenic plant is a plant that comprises
only one genetically modified cell and cell genome, or is a plant
that comprises some genetically modified cells, or is a plant in
which all of the cells are genetically modified. A transgenic plant
of the present invention may be one that comprises expression of
the desired polynucleotide, i.e., the exogenous nucleic acid, in
only certain parts of the plant. Thus, a transgenic plant may
contain only genetically modified cells in certain parts of its
structure.
[0121] Variant: a "variant," as used herein, is understood to mean
a nucleotide or amino acid sequence that deviates from the
standard, or given, nucleotide or amino acid sequence of a
particular gene or protein. The terms, "isoform," "isotype," and
"analog" also refer to "variant" forms of a nucleotide or an amino
acid sequence. An amino acid sequence that is altered by the
addition, removal or substitution of one or more ammo acids, or a
change in nucleotide sequence, may be considered a "variant"
sequence. The variant may have "conservative" changes, wherein a
substituted amino acid has similar structural or chemical
properties, e.g., replacement of leucine with isoleucine. A variant
may have "nonconservative" changes, e.g., replacement of a glycine
with a tryptophan. Analogous minor variations may also include
amino acid deletions or insertions, or both. Guidance in
determining which amino acid residues may be substituted, inserted,
or deleted may be found using computer programs well known in the
art such as Vector NTI Suite (InforMax, MD) software. "Variant" may
also refer to a "shuffled gene" such as those described in
Maxygen-assigned patents.
[0122] Although the present application primarily uses TAL to
illustrate the targeted transfer DNA insertion technology, it is
understood that other endonuclease based genome editing enzymes can
also be used, including meganuclease (Epinat et al., Nucleic Acids
Res. 31 (11):2952-2963 (2003)), Zinc finger nuclease (ZFN)
(Porteus, et al., Science 300 (5260):763 (2003); Bogdanove et al.,
Science 333 (6051):1843-6 (2011)), and CRISPR-associated (Cas)
endonuclease (Jinek et al., Science 337(6096):816-21 (2012);
Mussolino et al., Nat Methods 8(9):725-6 (2013)). In this regard,
Example 15 illustrates the successful use of Cas9 for the targeted
transfer DNA insertion technology described herein.
[0123] It is understood that the present invention is not limited
to the particular methodology, protocols, vectors, and reagents,
etc., described herein, as these may vary. It is also to be
understood that the terminology used herein is used tor the purpose
of describing particular embodiments only, and is not intended to
limit the scope of the present invention. It must be noted that as
used herein and in the appended claims, the singular forms "a,"
"an," and "the" include plural reference unless the context clearly
dictates otherwise. Thus, for example, a reference to "a gene" is a
reference to one or more genes and includes equivalents thereof
known to those skilled in the art and so forth. Indeed, one skilled
in the art can use the methods described herein to express any
native gene (known presently or subsequently) in plant host
systems.
[0124] The following examples are set forth as representative of
specific and preferred embodiments of the present invention. These
examples are not to be construed as limiting the scope of the
invention in any manner. It should be understood that many
variations and modifications can be made while remaining within the
spirit and scope of the invention
Additional Embodiments
Embodiment 1
[0125] A method for stably integrating a desired polynucleotide
into a plant genome, comprising:
(A) transforming plant material with a first vector comprising
nucleotide sequences encoding TAL or CRISPR proteins designed to
recognize a target sequence; (B) transforming the plant material
with a second vector comprising (i) a marker gene that is not
operably linked to a promoter ("promoter-free marker cassette") and
which comprises a sequence homologous to the target sequence; and
(ii) a desired polynucleotide; and (C) identifying transformed
plant material in which the desired polynucleotide is stably
integrated.
Embodiment 2
[0126] The method of Embodiment 1, wherein the transformed plant
material is exposed to conditions that reflect the presence or
absence of the marker gene in the transformed plant.
Embodiment 3
[0127] The method of any of Embodiments 1-2, wherein the marker
gene is a herbicide resistance gene and the transformed plant
material is exposed to herbicide.
Embodiment 4
[0128] The method of any of Embodiments 1-3, wherein the herbicide
resistance gene is the ALS gene.
Embodiment 5
[0129] The method of any of Embodiments 1-4, wherein the
promoter-free marker cassette is stably integrated into the plant
genome.
Embodiment 6
[0130] A method for the targeted insertion of exogenous DNA into a
plant comprising the steps of
[0131] (i) transforming isolated plant cells with [0132] (A) a
first binary vector comprising a promoter-less cassette comprising
(a) a right border sequence linked to (b) a partial sequence of the
Ubi7 intron 5'-translated region; (c) an Ubi7 monomer-encoding
sequence fused to a mutated acetolactate synthase (ALS) gene; (d) a
desired nucleotide sequence; and (e) a terminator sequence, wherein
the desired nucleotide sequence is not operably linked to a
promoter; and [0133] (B) a second binary vector comprising (a) a
right border; (b) a forward expression cassette and a reverse
expression cassette, each comprising a modified TAL effector or
Cas9 operably linked to a strong constitutive promoter, and a
terminator sequence; and (c) a sequence encoding isopentenyl
transferase (ipt), wherein the modified TAL effector or Cas9 is
designed to bind the desired nucleotide sequence within an intron
of potato's ubiquitin-7 (Ubi7) gene; and
[0134] (ii) culturing the isolated plant cells under conditions
that promote growth of plants that express the desired nucleotide
sequence; wherein no vector backbone DNA is permanently inserted
into the plant genome.
Embodiment 7
[0135] The method of Embodiment 6, wherein the modified TAL
effector comprises (a) a truncated C-terminal activation domain
comprising a Fok1 endonuclease catalytic domain; (b) a
codon-optimized target sequence binding domain comprising 16.5
repeat variable diresidues corresponding to the Ubi7
5'-untranslated intron sequence; and (c) an N-terminal region
comprising a SV40 nuclear localization sequence.
Embodiment 8
[0136] The method of any of Embodiments 6-7, wherein the desired
nucleotide sequence is a silencing cassette targeting one or more
genes selected from the group consisting of asparagine synthase 1
(Asn1), polyphenol oxidase (Ppo), and vacuolar invertase (Inv)
genes.
Embodiment 9
[0137] The method of any of Embodiments 6-8, wherein the first
binary vector further comprises a late blight resistance gene Vnt1
operably linked to its native promoter and terminator
sequences.
Embodiment 10
[0138] A transformed plant comprising in its genome an endogenous
Ubi7 promoter operably linked to a desired exogenous nucleotide
sequence operably linked to an exogenous terminator sequence.
Embodiment 11
[0139] The transformed plant of Embodiment 10, wherein the
expression of one or more genes selected from the group consisting
of asparagine synthase 1 (Asn1), polyphenol oxidase (Ppo), and
vacular invertase (Inv) genes is down-regulated.
Embodiment 12
[0140] The transformed plant of any of Embodiments 10-11, wherein
the plant further expresses a late blight resistance gene Vnt1.
Embodiment 13
[0141] The transformed plant of any of Embodiments 10-12, wherein
the plant is a tuber-bearing plant.
Embodiment 14
[0142] The transformed plant of any of Embodiments 10-13, wherein
the tuber-bearing plant is a potato plant.
Embodiment 15
[0143] The transformed plant of any of Embodiments 10-14, wherein
the plant has a phenotype characterized by one or more of late
blight resistance, black spot bruise tolerance, reduced
cold-induced sweetening and reduced asparagine levels in its
tubers.
Embodiment 16
[0144] A heat-processed product of the transformed plant of any of
Embodiments 10-15.
Embodiment 17
[0145] The heat-processed product of Embodiment 16, wherein the
product is a French fry, chip, crisp, potato, dehydrated potato or
baked potato.
Embodiment 18
[0146] The heat-processed product of Embodiments 16 or 17, wherein
the heat-processed product has a lower level of acrylamide than a
heat-processed product of a non-transformed plant of the same
species.
Embodiment 19
[0147] A modified TAL effector designed to bind to a desired
sequence comprising (a) a truncated C-terminal activation domain
comprising a catalytic domain; (b) a codon-optimized target
sequence binding domain designed to bind a 5'-untranslated intron
sequence; and (c) an N-terminal region comprising a nuclear
localization sequence.
Embodiment 20
[0148] The modified TAL effector of Embodiment 19, wherein the
modified TAL effector is designed to bind the desired sequence
within an intron of potato's ubiquitin-7 (Ubi7) gene.
Embodiment 21
[0149] The modified TAL effector of Embodiment 19 or 20, wherein
(a) the catalytic domain in the C-terminal activation domain
comprises a Fok1 endonuclease; (b) the target sequence binding
domain comprises 16.5 repeat variable diresidues corresponding to
the Ubi7 5'-untranslated intron sequence; and (c) the nuclear
localization sequence in the N-terminal region is a SV40 nuclear
localization sequence.
Embodiment 22
[0150] A binary vector comprising (a) a right border; (b) a forward
expression cassette and a reverse expression cassette, each
encoding a modified TAL effector according to any of Embodiments
19-21 operably linked to a strong constitutive promoter and a
terminator sequence; and (c) a sequence encoding isopentenyl
transferase (ipt).
Embodiment 23
[0151] A DNA construct comprising a promoter-less cassette
comprising (a) a right border sequence linked to (b) a partial Ubi
7 5'-untranslated intron sequence; (c) an Ubi7 monomer-encoding
sequence fused to a mutated acetolactate synthase (ALS) gene; (d) a
desired nucleotide sequence; (e) a terminator sequence; and (f) a
left border, wherein the desired nucleotide sequence is not
operably linked to a promoter.
Embodiment 24
[0152] The DMA construct of Embodiment 23, wherein the desired
nucleotide sequence is a silencing cassette targeting one or more
genes selected from the group consisting of asparagine synthase 1
(Asn1), polyphenol oxidase (Ppo), and vacuolar invertase (Inv)
genes.
Embodiment 25
[0153] The DNA construct of Embodiment 23 or 24, wherein the DNA
construct further comprises a late blight resistance gene Vnt1
operably linked to its native promoter and terminator
sequences.
Embodiment 26
[0154] A kit for targeted insertion of exogenous DNA into a plant
comprising:
[0155] (A) a first binary vector comprising a promoter-less
cassette comprising (a) a right border sequence linked to (b) a
partial sequence of the Ubi7 intron 5'-untranslated region; (c) an
Ubi7 monomer-encoding sequence fused to a mutated acetolactate
synthase (ALS) gene; (d) a desired nucleotide sequence; and (e) a
terminator sequence, wherein the desired nucleotide sequence is not
operably finked to a promoter; and
[0156] (B) a second binary vector comprising (a) a right border;
(b) a forward expression cassette and a reverse expression
cassette, each comprising a modified TAL effector or Cas9operably
linked to a strong constitutive promoter, and a terminator
sequence; and (e) a sequence encoding isopentenyl transferase
(ipt), wherein the modified TAL effector or Cas9 is designed to
bind the desired nucleotide sequence within an intron of potato's
ubiquitin-7 (Ubi7) gene.
Embodiment 27
[0157] The kit of Embodiment 26, wherein the modified TAL effector
comprises (a) a truncated C-terminal activation domain comprising a
Fok1 endonuclease catalytic domain; (b) a codon-optimized target
sequence binding domain comprising 16.5 repeat variable diresidues
corresponding to the Ubi7 5'-untranslated intron sequence; and (c)
an N-terminal region comprising a SV40 nuclear localization
sequence.
Embodiment 28
[0158] The kit of Embodiment 26 or 27, wherein the desired
nucleotide sequence is a silencing cassette targeting one or more
genes selected from the group consisting of asparagine synthase 1
(Asn1), polyphenol oxidase (Ppo), and vacuolar invertase (Inv)
genes.
Embodiment 29
[0159] The kit of any of Embodiments 26-28, wherein the first
binary vector further comprises a late blight resistance gene Vnt1
operably linked to its native promoter and terminator
sequences.
Embodiment 30
[0160] A method for targeted insertion of a transfer DNA into a
plant genome, comprising: transforming plant material with one or
more vectors, which comprise:
[0161] a first genetic cassette encoding an endonuclease-based
enzyme that selectively introduces a double-stranded DNA break
within a 5' untranslated intron sequence of a targeted gene locus
of the plant genome, and
[0162] a second genetic cassette which (a) does not comprise a
promoter, (b) comprises a desired gene sequence, and (c) comprises
a homologous sequence that mediates homologous recombination-based
repair of the double-stranded DNA break introduced by the
endonuclease-based enzyme, and
[0163] wherein the transformed plant material comprises the desired
gene sequence selectively inserted in the targeted gene locus and
operable linked to a promoter associated with the 5' untranslated
intron sequence.
Embodiment 33
[0164] The method of Embodiment 30, wherein the endonuclease-based
enzyme is TAL effector.
Embodiment 32
[0165] The method of Embodiment 30, wherein the endonuclease-based
enzyme is Cas9.
Embodiment 33
[0166] The method of any of Embodiments 30-32, wherein the targeted
gene locus is potato's ubiquitin-7 (Ubi7) gene.
Embodiment 34
[0167] The method of any of Embodiments 30-33, further comprising
selecting the transformed plant based on a phenotype conferred by
the expression of the desired gene sequence inserted in targeted
gene locus.
Embodiment 35
[0168] A transformed plant obtained by the method of any of
Embodiments 30-34, comprising a modified plant genome which
comprises, form 5' to 3', an endogenous promoter operably linked to
an exogenous gene sequence operably linked to an exogenous
terminator.
Embodiment 36
[0169] A vector suitable for the method of any of Embodiments
30-34, encoding a transfer DNA comprising a right border sequence
linked to a promoter-less genetic cassette, wherein the
promoter-less genetic cassette expresses a protein, of an RNA
transcript when inserted in the targeted gene locus and operably
linked to a promoter associated with the 5' untranslated intron
sequence.
EXAMPLES
Example 1
Method for Targeted Insertion
[0170] A preferred target site for gene insertion is within an
intron positioned in the untranslated 5'-leader region of the
potato's ubiquitin-7 (Ubi7) gene. Potato is tetraploid and contains
four copies of this gene; the copies are identical or
near-identical. The Ubi7 genes are expressed at high levels in a
near-constitutive manner, which suggests that they are located in
regions that promote transcriptional activity. Sequences positioned
within a transfer DNA are therefore expected to be effectively
expressed. Furthermore, insertional inactivation of one of the Ubi7
genes is not expected to cause any quality or agronomic issues
because potato still contains three functionally-active copies of
the gene.
[0171] DNA segments were inserted into the intron sequence of the
ubiquitin-7 gene according to the following steps:
[0172] (1) TAL effectors were designed to bind to sequences within
the intron, which is (a) more than about 25-bp upstream from the
region comprising branch site (consensus=CU(A/G)A(C/U)),
pyrimidine-rich (=AT-rich) sequence, and intron/exon junction
(consensus=CAGG), and (b) more than about 50-bp downstream from the
splice donor site at the exon/intron junction (consensus=AGGT).
[0173] (2) A binary vector was created for transient expression of
the TAL effectors in plant cells. This sector contains (a) a single
right border but no left border; (b) two TAL effector genes
operably linked to strong constitutive promoters: and (c) an
expression cassette for the isopentyl transferase (ipt) gene
involved in cytokinin production. Stable transformation can be
selected against because it would result in integration of the
entire vector and, consequently, produce stunted shoots that
overexpress cytokinins and are unable to produce roots.
[0174] (3) A second binary vector was created for stable
transformation with a transfer DNA comprising genetic elements from
potato delineated by borders: (a) right border; (b) part of the
intron of the Ubi7 promoter, starting from the sequence between
targeted TAL binding sites; (c) Ubi7 monomer-encoding sequence: (d)
modified acetolactate synthase (ALS) gene that is insensitive to as
least one ALS inhibitor selected from the group including
sulfonylureas, imidazolinones, triazolopyrimidines, pyrimidinyl
oxybenzoates, and sulfonylamino carbonyl triazolinones; (e)
terminator of the ubiquitin-3 gene; (f) silencing cassette
targeting the asparagine synthase 1 (Asn1), polyphenol oxidase
(Ppo), and vacuolar invertase (Inv) genes; (g) late blight
resistance gene Vnt1 operably linked to its native promoter and
terminator sequences; and (h) left border. The vector backbone
contains, apart from sequences required for maintenance and
selection in E. coli and A. tumefaciens, an expression cassette or
the ipt gene.
[0175] (4) The two binary vectors were separately introduced into
the A. tumefaciens AGL-1 strain.
[0176] (5) Potato stem explants were co-infected with the two
strains from step (4) and then co-cultivated for two days.
[0177] (6) Explants were transferred to media containing selection
agents that kill Agrobacterium.
[0178] (7) Two weeks after transformation, the explants were again
transferred to media also containing an ALS inhibitor.
[0179] (8) Herbicide resistant shoots arising from the explants
within the next three months were transferred to root-inducing
media and analyzed by PCR for the presence of a junction between
the Ubi7 promoter and the modified ALS gene. At least 80% of
regenerated plants contained such a junction.
[0180] (9) PCR-positive plants were regenerated, propagated, and
evaluated for late blight resistance, reduced asparagine levels in
tubers, black spot bruise tolerance, and reduced cold-induced
sweetening.
[0181] The next examples describe aspects of the method.
Example 2
Imazamox Kill-Curve Essay
[0182] To determine the concentration of imazamox needed to kill
untransformed potato cells, Ranger Russet internode stem explants
were transformed with the binary vector pSIM1331. This vector
contains (a) an expression cassette for the selectable marker gene
nptII inserted between borders and (b) an expression cassette for
the ipt gene in the backbone. The strain used to mediate
transformation was Agrobacterium strain LBA4404, grown to an OD600
of 0.2. Following a 10 minute inoculation period, the explants were
transferred to co-culture medium and placed in a Percival growth
chamber at 24.degree. C. under filtered light for 48 hours.
Inter-node explants were transferred to hormone-free medium (HFM)
containing the antibiotic timentin but lacking imazamox. Petri
plates were place in Percival growth chamber at 24.degree. C. and a
16 h photoperiod.
[0183] After two weeks, the inter-node explants were transferred to
HFM containing timentin and five treatment concentrations (0, 0.5,
1.0, 1.5 & 2.0 mg/l) of the plant selection herbicide imazamox.
Each treatment consisted of 3 replicates with each replicate
containing .about.20 inter-node explants per Petri plate. Petri
plates were placed in Percival growth chamber at 24.degree. C. and
a 16 h photoperiod. Inter-node explants were subcultured every 2
weeks to fresh HFM containing the respective treatment
concentration of imazamox to encourage any regeneration of shoots
and reduce any Agrobacterium over-growth.
[0184] Results indicated that a small number of inter-node explants
in all imazamox treatment concentrations exhibited some Ipt
meristamatic callus growth and primary shoot formation. However, no
fully developed normal shoots arose in any of the imazamox
treatment concentrations. Based upon these results, it was
determined that 2.0 mg/l imazamox is the optimal concentration for
in vitro selection. Optimal concentration is defined as the
concentration of a selective agent that allows cell growth to some
degree but does not allow regeneration of fully developed
shoots.
[0185] The co-culture medium included 0.444 g/l Murashige &
Skoog modified basal medium with Gamborg vitamins (M404;
PhytoTechnology Laboratories), 30 g/l sucrose (S24060; Research
Products International Corp.) and 6.0 g/l agar (S20400; Research
Products International Corp.) and had pH 5.7.
[0186] The hormone-free medium (HFM) included 4.44 g/l Murashige
& Skoog modified basal medium with Gamborg vitamins (404;
PhytoTechnology Laboratories), 30 g/l sucrose (S24060; Research
Products International Corp.), 300 mg/l timentin and 2.0g/l Gelzan
(G024; Caisson), and had pH 5.7
Example 3
Transformation and Regeneration of Potato Plants from Stem Explains
Single Strain Approach
[0187] (1) 3-4-week old in vitro Ranger Russet potato plants
growing on stock medium comprising 2.22 g/l Murashige & Skoog
modified basal medium with Gamborg vitamins (M404; PhytoTechnology
Laboratories), 15 g/l sucrose (S24060; Research Products
International Corp.) and 2.0 g/l Gelzan (G024; Caisson) at pH 5.7,
were used.
[0188] (2) The leaves and node sections were removed and inter-node
stem portions were isolated. The inter-node stem portions were cut
into 3-5 mm explants sections and placed in 15 ml of MS liquid
medium containing 4.44 g/l Murashige & Skoog modified basal
medium with Gamborg vitamins (M404; PhytoTechnology Laboratories),
and 30 g/l sucrose (S24060; Research Products International Corp.)
at pH 5.7.
[0189] (3) Agrobacterium (LBA440) derived from a single colony
containing a binary vector TAL effector cassette and a binary
vector gene-of-interest cassette was grown overnight in Luria Broth
at 28.degree. C. in a shaking incubator. The next day the bacterial
solution was pelleted and resuspended to 0.2 OD600 in MS liquid
medium.
[0190] (4) Stem explants were incubated in the bacterial solution
for 10 minutes at room temperature and blotted dry on sterile
filter paper to remove excels of bacteria.
[0191] (5) The inoculated stem explains were placed on co-culture
medium without selection in a Percival growth chamber for 48 h
under filtered light. The co-culture medium contained 0.444 g/l
Murashige & Skoog modified basal medium with Gamborg vitamins
(M404; PhytoTechnology Laboratories), 30 g/l sucrose (S24060;
Research Products International Corp.) and 6.0 g/l agar (S20400;
Research Products International Corp.) at pH 5.7
[0192] (6) The stem explains were transferred to either callus
induction hormone medium (CIHM) or hormone-free medium (HFM)
containing antibiotics (timentin) and without plant selection.
Petri plates were placed in a Percival growth chamber at 24.degree.
C. with a 16 h photoperiod. The CIHM contained 4.44 g/l Murashige
& Skoog modified basal medium with Gamborg vitamins (M404;
Phytotechnology Laboratories), 30 g/l sucrose(S24060; Research
Products International Corp.), 2.5 mg/l zeatin riboside, 0.1 mg/l
NAA, 300 mg/l timentin and 6.0 g/l agar (S20400; Research Products
International Corp.) at pH 5.7. The HFM contained 4.44 g/l
Murashige & Skoog modified basal medium with Gamborg vitamins
(M404; PhytoTechnology Laboratories), 30 g/l sucrose (S24060;
Research Products International Corp.), 300 mg/l timentin and 2.0
g/l Gelzan (G024; Caisson) at pH 5.7
[0193] (7) After two weeks, the stem explants were transferred to
either callus induction hormone medium (CIHM) or hormone-free
medium (HFM) containing antibiotics (timentin) and plant selection.
Petri plates were placed in a Percival growth chamber at 24.degree.
C. with a 16 h photoperiod. The CIHM contained 4.44 g/l Murashige
& Skoog modified basal medium with Gamborg vitamins (M404;
PhytoTechnology Laboratories), 10 g/l sucrose (S24060; Research
Products International Corp.), 2.5 mg/l zeatin riboside, 0.1 mg/l
NAA, 300 mg/l timentin, 2.0 mg/l imazamox and 6.0 g/l agar (S20400;
Research Products International Corp.) at pH 5.7. The HFM contained
4.44 g/l Murashige & Skoog modified basal medium with Gamborg
vitamins (M404; PhytoTechnology Laboratories), 30 g/l sucrose
(S24060; Research Products International Corp.), 300 mg/l timentin,
2.0 mg/l imazamox and 2.0 g/l Gelzan (OD24; Caisson) at pH 5.7.
[0194] (8) Four weeks post-transformation, the stem explants were
transferred to either Shoot induction hormone medium (SIHM) or
hormone-free medium (HFM) containing antibiotics (timentin) and
plant selection. Petri plates were placed in a Percival growth
chamber at 24.degree. C. with a 16 h photoperiod. Stem explants
were sub-cultured every 2-4 weeks to fresh SIHM or HFM to encourage
full regeneration of shoots. The SIHM contained 4.44 g/l Murashige
& Skoog modified basal medium with Gamborg vitamins (M404;
PhytoTechnology Laboratories), 30 g/l sucrose (S24060: Research
Products International Corp.), 2,5 mg/l zeatin riboside, 0.3 mg/l
GA3, 300 mg/l timentin, 2.0 mg/l imazamox and 6.0 g/l agar (S20400;
Research Products International Corp.) at pH 5.7. The HFM contained
4.44 g/l Murashige & Skoog modified basal medium with Gamborg
vitamins (M404; PhytoTechnology Laboratories), 30 g/l sucrose
(S24060; Research Products International Corp.), 300 mg/l timentin,
2.0 mg/l imazamox and 2.0 g/l Gelzan (G024; Caisson) at pH 5.7.
[0195] (9) Fully developed shoots were propagated for future
testing and analysis.
Example 4
Transformation and Regeneration of Potato Plants from Stem Explants
Double Strain Approach
[0196] (1) 3-4 week-old in vitro Ranger Russet potato plants
growing on stock medium containing 2.22 g/l Murashige & Skoog
modified basal medium with Gamborg vitamins (M404; Phytotechnology
Laboratories), 15 g/l sucrose (S24060; Research Products
International Corp.) and 2.0 g/l Gelzan (G024; Caisson) at pH 5.7
were used.
[0197] The leaves and node sections were removed and inter-node
stem portions were isolated. The inter-node stem portions were cut
into 3-5 mm explants sections and placed in 15 ml of MS liquid
medium containing 4.44 g/l Murashige & Skoog modified basal
medium with Gamborg vitamins (M404; PhytoTechnology Laboratories),
and 30 g/l sucrose (S24060; Research Products International Corp.)
at pH 5.7
[0198] (3) Two separate Agrobacterium strains (LBA4404), each
derived from a single colony, one containing a binary vector
comprising a TAL effector cassette and the other containing a
binary vector comprising a gene-of-interest cassette, were grown
overnight in Luria Broth at 28.degree. C. in a shaking incubator.
The next day, each separate bacterial solution was pelleted and
re-suspended to 0.2 OD600 in MS liquid medium.
[0199] (4) Stem explants were incubated in a single combined
bacterial solution that consisted of equal volumes from each
individual bacterial solution (co-transformation) for 10 minutes at
room temperature and blotted dry on sterile fiber paper to remove
excess of bacteria.
[0200] (5) The inoculated stem explants were placed on co-culture
medium without selection in a Percival growth chamber for 48 h
under filtered light. The co-culture medium contained 0.444 g/l
Murashige & Skoog modified basal medium with Gamborg vitamins
(M404; PhytoTechnology Laboratories), 30 g/l sucrose (S24060;
Research Products International Corp.) and 6.0 g/l agar (S20400;
Research Products International Corp.) at pH 5.7.
[0201] (6) The stem explants were transferred to either callus
induction hormone medium (CIHM) or hormone-free medium (HFM)
containing antibiotics (Timentin) and without plant selection. The
Petri plates were placed in a Percival growth chamber at 24.degree.
C. with a 16 h photoperiod. The CIHM contained 4.44 g/l Murashige
& Skoog modified basal medium with Gamborg vitamins (M404;
PhytoTechnology Laboratories), 30 g/l sucrose (S24060; Research
Products International Corp.), 2.5 mg/l Zeatin Riboside, 0.1 mg/l
NAA, 300 mg/l Timentin and 6.0 g/l agar (S20400; Research Products
International Corp.) at pH 5.7. The HFM contained 4.44 g/l
Murashige & Skoog modified basal medium with Gamborg vitamins
(M404; PhytoTechnology Laboratories), 30 g/l sucrose (S24060;
Research Products International Corp.), 300 mg/l Timentin and 2.0
g/l Gelzan (G024; Caisson) at pH 5.7.
[0202] (7) After two weeks, the stem explants were transferred to
either callus induction hormone medium (CIHM) or hormone-free
medium (HFM) containing antibiotics (Timentin) and plant selection.
The Petri plates were placed in a Percival growth chamber at
24.degree. C. with a 16 h photoperiod. The CIHM contained 4.44 g/l
Murashige & Skoog modified basal medium with Gamborg vitamins
(M404; PhytoTechnology Laboratories), 30 g/l sucrose (S24060;
Research Products International Corp.). 2.5 mg/l Zeatin Riboside,
0.1 mg/l NAA, 3.00 mg/l Timentin, 2.0 mg/l imazamox and 6.0 g/l
agar (S20400; Research Products International Corp.) at pH 5.7. The
HFM contained 4.44 g/l Murashige & Skoog modified basal medium
with Gamborg vitamins (M404; PhytoTechnology Laboratories), 30 g/l
sucrose (S24060; Research Products International Corp.), 300 mg/l
Timentin, 2.0 mg/l imazamox and 2.0 g/l Gelzan (G024; Caisson) at
pH 5.7.
[0203] (8) Four weeks post-transformation, the stem explants were
transferred to either Shoot induction hormone medium (SIHM) or
hormone-free medium (HFM) containing antibiotics (Timentin) and
plant selection. The Petri plates were placed in a Percival growth
chamber at 24.degree. C. with a 16 h photoperiod. Stem explants
were subcultured every 2-4 weeks to fresh SIHM or HFM to encourage
full regeneration of shoots. The SIHM contained 4.44 g/l Murashige
& Skoog modified basal medium with Gamborg vitamins (M404;
PhytoTechnology Laboratories), 30 g/l sucrose (S24060; Research
Products International Corp.), 2.5 mg/l Zeatin Riboside, 0.3 mg/l
GA3, 300 mg/l Timentin, 2.0 mg/l imazamox and 6.0 g/l agar (S20400;
Research Products International Corp.) at pH 5.7. The HFM contained
4.44 g/l Murashige & Skoog modified basal medium with Gamborg
vitamins (M404; PhytoTechnology Laboratories), 30 g/l sucrose
(S24060; Research Products International Corp.), 300 mg/l Timentin,
2.0 mg/l imazamox and 2.0 g/l Gelzan (G024; Caisson) at pH 5.7
[0204] (9) Fully developed shoots were propagated for future
testing and analysis.
Example 5
Target Site Sequence in Potato Ranger, Burbank and Atlantic
Cultivars
[0205] To determine if the target region (5' region of Ubi7
promoter intron) is conserved across different potato cultivars,
primer pair HD175F1 and HD175R1 (SEQ ID NO: 1 and SEQ ID NO: 2)
were designed and used to amplify target region from the potato
varieties Ranger, Burbank, and Atlantic. The amplified fragments
were cloned into pGEMT-easy vector and sequenced. Sequence result
showed that the target regions is identical for all varieties
tested. The ubi7 promoter intron sequence is represented by SEQ ID
NO: 3.
Example 6
Design of TAL Effectors
[0206] A pair of TAL effectors was designed to target the selected
region. Forward and reverse TALE recognition sites are listed as
SEQ ID NO: 4 and SEQ ID NO: 5, respectively. The TALE scaffold was
Hax3, a member of the AvrBs3 family that was identified in
Brassicaceae pathogen X. campestris pv. Armoraciae strain 5. The
modification made on this scaffold included: (a) the C-terminal
activation domain of original Hax3 was truncated; (b) a nuclear
localization sequence from SV0 virus was added at the N terminal of
truncated Hax3 protein; (c) a codon optimization was performed on
original Hax3 DNA sequence; (d) original 11.5 repeat variable
diresidues (RVD) were replaced by 16.5 RVDs corresponding to the
targeting sites; (e) a catalytic domain of Fok1 nuclease was added
at the C-terminal of modified Hax3 scaffold.
Example 7
Vector for DNA Transfer
[0207] The transfer DNA consisted of potato-derived genetic
elements and was delineated by T-DNA-like borders. It included
three cassettes from left border to right border; (a) a late blight
resistant cassette; (b) a tuber-specific silencing cassette
targeting three genes: the ASN1 gene involved in asparagine
formation; the acidic invertase (INV) gene associated with
hydrolysis of sucrose; and the polyphenol oxidase (PPO) gene that
encodes the enzyme oxidizing polyphenols upon impact braise; and
(c) a promoter-less mutated potato acetolactate synthase (ALS) gene
(with W563L AND S642I substitutions) that was hypothesized to
confer resistance to ALS inhibiting herbicides when
over-expressed.
[0208] The transfer DNA was designed to be inserted into the intron
region positioned within the leader of one of potato's four Ubi7
genes, so that the associated Ubi7 promoter would drive expression
of the ALS gene and confer resistance against ALS inhibitor-type
herbicides.
[0209] Since the Ubi7 monomer plays an important role in protein
stabilization, the coding sequence, preceded by part of the intron,
was fused in frame to the ALS gene. Insertion of the transfer DNA
into a binary vector created the plasmid pSIM2168. The organization
of the transfer DNA is illustrated in FIG. 1. The DNA and protein
sequences of wild type and mutated ALS gene are represented by SEQ
ID NOS: 10, 11, 12 and 13. The whole transfer DNA sequence in
pSIM2168 is represented by SEQ ID NO: 14.
Example 8
Vector for TAL Effectors
[0210] Each TAL effector, forward or reverse, was driven by a
constitutive (35s or FMV) promoter and followed by a terminator
(Nos or Ocs), to form two separate plant expression cassettes. The
two cassettes were cloned into a binary vector to form the pSIM2170
as shown in FIG. 2. This binary vector had only one border and
contained an ipt gene expression cassette so that it was possible
to select against stable integration of the effector genes.
Example 9
The Right Border Upstream the Ubi7 Intron 5' Region Supports DNA
Transfer
[0211] Because efficacy of the border as primary cleavage site is
dependent, in part, on flanking DNA sequences, a right border
upstream the Ubi7 intron 5' region was tested for its ability to
support DNA transfer. For this purpose, a DNA fragment comprising
the right border/intron sequence upstream from the Ubi7 monomer and
modified ALS gene was cloned into the binary vector pSIM123-F to
for pSIM2164. Vector pSIM123-F contained an expression cassette for
the selectable marker gene nptII, but lacked the borders needed to
transfer this cassette into plant cells (see FIG. 3). Nevertheless,
infection of explants with an Agrobacterium strain carrying the
pSIM2164 generated the same number of kanamycin resistant shoots
per explant as a positive control (infection with a strain carrying
the nptII gene positioned within T-DNA borders).
[0212] To test the efficiency of the mutated ALS gene in conferring
imazamox resistance to potato, a vector carrying a Ubi7::ALS
cassette (pSIM2162, see FIG. 4) was created. Transformation with
this vector yielded herbicide resistant plants that were confirmed
by PCR to contain the Ubi7::ALS cassette.
Example 10
Vector Design for Transient Transformation in N. benthamiana
[0213] To test the efficiency of the specifically designed TALEs in
vivo, a vector with the target sequence (part of the Ubi7 intron)
was designed. This vector, pSIM2167, was co-transformed with the
vector carrying the effectors into N. benthamiana. As shown in FIG.
6, the target sequence contained the forward and reverse
recognition sites positioned immediately downstream from the start
codon of the GUS reporter gene. A stop codon between the two
recognition sequences and in frame wish the GUS coding sequence
rendered the GUS coding sequence inactive. If the TALEs bind their
designed recognition sites and cleave in the intermediary sequence,
subsequent repair would be expected to occasionally eliminate the
stop codon without altering the reading frame, thus restoring GUS
function. Such events, could be visualized by histochemically
staining the N. benthamiana leaves, about 4 days after
infiltration.
[0214] The target sequence region can also be PCR amplified and
sequenced to identify TALE mediated mutations. However, direct PCR
and cloning of the target sequence would yield an un-modified
target sequence because of the possible low efficiency of
transformation. Therefore, the isolated DNA was first digested with
the AluI enzyme, which cleaves the AGCT restriction site located
between the two TALE recognition sites. After amplification, the
PCR products were again digested with Alul to further enrich the
mutated target sequence for downstream cloning and sequencing
analyses. The entire sequence of FMV-target-GUS-Nos cassette is
represented by SEQ ID N: 15. The PCR primers used for amplifying
the target sequence are represented by SEQ ID NO: 16 and SEQ ID NO:
17.
Example 11
Agrobacterium Transformation and N. benthamiana Infiltration
[0215] The designed sectors were transformed into Agrobacterium
strain AGL1 and tested for vector stability. Four to six days after
infiltration, leaf discs from infiltrated tissue were collected for
GUS staining assay and DNA isolation. Isolated DNA was digested
with the AluI enzyme and used as template for target region
amplification and further cloning and sequencing. As shown in FIG.
7, GUS staining was observed in co-infiltrated tissue (right panel)
but not in the tissue infiltrated by target vector alone (left
panel). Further sequence analyses showed in FIG. 8 also confirmed
that the target sequence was modified by TALEs.
Example 12
Genotyping of Stable Transformants
[0216] Primer pairs HD208 F1 and R1 were designed to genotype
herbicide resistant transformants. The forward primer is located in
the promoter region of Ubi7 gene, and the reverse primer is located
in ALS coding region. The primer pair is targeted-insertion
specific primer because only if the transfer DNA is inserted into
the designed position, the primer pair will amplify a fragment. PCR
analysis of the independent herbicide resistant lines from the
co-transformation of pSIM2170 and pSIM2168 did amplify fragments.
These fragments were cloned and sequenced. As shown in FIG. 9, in
one line, TALE1, the fragment contained part of the transfer DNA
cassette, including the partial Ubi7 intron, the Ubi7 monomer and
part of the ALS coding region, flanked by potato genome sequence.
Sequence blast showed that the flanked potato genome is the
promoter region of an Ubi7 like gene located on chromosome 7 which
also contains very similar recognition sites of the designed TALE.
In another two lines, TALE2 and TALE3, the transfer DNA cassettes
were inserted into the same genomic loci as in TALE1, except that
intron portions of the transfer DNA cassette were largely deleted.
The TALE2 and TALE3 lines were very similar, except that in TALE2
there was a 9 bp deletion in the Ubi7 monomer.
Example 13
Characterization of Stable Transformed Lines for Targeted
Insertion
[0217] The data and results described above indicated that the
targeted insertion of an intended DNA segment was successful.
Herbicide resistant Ranger Russet (RR) lines from the
co-transformation of pSIM2170 and pSIM2168 were propagated and
transferred to soil for following tests/analyses. Specifically, the
transformed lines were tested for resistance to Late blight
diseases challenge, by determining the activity of the enzyme
polyphenol oxidase, and running southern analyses for copy number
of both silencing and Vnt1 cassettes. For diseases assay, plantlets
in soil for three weeks were inoculated with P. infestans late
blight strain US8 BF6 for the development of disease symptom. For
Southern blot analyses, 3 .mu.g DNA isloated from leaf tissues were
digested by HindIII restriction enzyme, run on 0.7% agarose gel,
transferred to positive charged nylon membrane and hybridized with
Dig labeled probes either for invertase fragment in silencing
cassette or for Vnt1 promoter in Vnt1 expression cassette. Four
lines were identified and summarized in Table 1 below. These lines
were late blight resistant (see FIG. 11) and had a single copy for
both cassettes (see FIG. 12). Each extra band in line RR-36 and
RR-39, as compared to RR control lines, indicated the presence of a
single copy of the transgene. (Data for line RR-26 and RR-32 are
not shown).
TABLE-US-00002 TABLE 1 Line Characterization for Targeted
Insertion. Line number Late Blight Invertase Copy No. Vnt1 Copy No.
Ranger control susceptible 0 0 RR-26 resistant 1 1 RR-32 resistant
1 1 RR-36 resistant 1 1 RR-38 resistant 1 1
Example 14
Field Trial Evaluation of Transformed Lines for Targeted
Insertion
[0218] Plantlets of Ranger Russet (RR) and Snowden (SN) that were
co-transformed with pSIM2170 and pSIM2168 were planted in a
replicated field trial . Plant lines were evaluated for trait
efficacy and yield. Snowden lines 2, 15, 55 <& 83 (see FIG.
13) as well as Ranger lines 26, 32, 38 & 39 (see FIG. 15) were
very uniform for the silencing of asparagine in potato tubers. In
addition, the same SN lines (see FIG. 14) and RR lines (see FIG.
16) also had very uniform silencing of polyphenol oxidase (PPO).
These results indicate that the target site allows for uniform and
high expression of the silencing cassette. Yield from the above
mention SN and RR lines (sec FIG. 17) were not significantly
different when compared to the wild type (WT) and empty vector
controls (2162, 2320). This result suggest that the targeted
insertion site does not have a negative impact on yield
potential.
Example 15
CAS9-Mediated Targeted Transfer DNA Insertion
[0219] Besides Transcription activator-like effector nucleases
(TALEN) there are other endonuclease based genome editing enzymes
such as meganuclease (Epinat et al., 2003), Zinc finger nuclease
(ZFN) (Porteus and Baltimore, 2003), (Bogdanove and Voytas, 2011)
and CRISPR-associated (Cas) endonuclease (Jinek et al., 2012;
Mussolino, C. & Cathomen 2013) that could introduce DSB in the
target DNA sequence. Once DSB was generated, plant DNA repair
machinery will either repair the break through a non-homologous end
joining (NHEJ) pathway, which is imprecise and creates mutations to
achieve gene knock out, or through a homologous recombination (HR)
pathway to achieve gene targeting (gene replacement or insertions
(Symington and Gautier 2011). Therefore, the similar strategy used
in our TALEN based DNA integration is transferable into other
nuclease based genome editing tools. As an example, here we show an
engineered CAS9 endonuclease can modify on our Ubi7 intron DNA
target.
[0220] Cas9 genome editing technology uses a small chimeric RNA
which contains a 20 bp target specific sequence and a small RNA
scaffold to guide Cas9 nucleases to cleave the target. Construct
pSIM4187 was designed to contain two expression cassettes (FIG.
18). The first is a CAS9 nuclease expression cassette. Plant
codon-optimized Cas9 was driven by a constitutive FMV promoter and
a nuclear localization sequence from SV40 virus was added at the N
terminal of protein. The DNA and amino acid sequences of engineered
Cas9 are listed as SEQ ID No: 18 and SEQ ID No: 19, respectively.
The second cassette produces guide RNA upon transcription in the
plant cell under the control of a constitutive 35S promoter. The
sequence of guide RNA is listed as SEQ ID NO: 20. The designed
vectors pSIM4187 was transformed into Agrobacterium strain AGL1 and
checked for vector stability. Then agrobacteria containing pSIM4187
were used to co-infiltrate N. benthamina with agrobacteria
containing plasmid pSIM2167, which is the construct containing the
Ubi7 intron target described in previous examples. Two to four days
after infiltration, leaf discs from the infiltrated tissues were
collected for GUS staining assay and DNA isolation. Isolated DNA
was digested with AluI enzyme and used as template for target
region amplification and further cloning and sequencing. As shown
in FIG. 19, GUS staining was observed in co-infiltrated tissue
(right panel) but not in the tissue infiltrated by target vector
alone (left panel). Further sequence analyses showed in FIG. 20
confirmed the target sequence was modified by Cas9. The
modification is very similar to that induced by designed TALEN.
Sequence CWU 1
1
51120DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 1tcctaatttt ccccaccaca 20220DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
2aacagccgga gaaactcaaa 203568DNASolanum tuberosum 3gttagaaatc
ttctctattt ttggtttttg tctgtttaga ttctcgaatt agctaatcag 60gtgctgttat
agcccttaat tttgagtttt ttttcggttg tcttgatgga aaaggcctaa
120aatttgagtt tttttacgtt ggtttgatgg aaaaggccta caattggagt
tttccccgtt 180gttttgatga aaaagcccct agtttgagat tttttttctg
tcgattcgat tctaaaggtt 240taaaattaga gtttttacat ttgtttgatg
aaaaaggcct taaatttgag tttttccggt 300tgatttgatg aaaaagccct
agaatttgtg ttttttcgtc ggtttgattc tgaaggccta 360aaatttgagt
ttctccggct gttttgatga aaaagcccta aatttgagtt tctccggctg
420ttttgatgaa aaagccctaa atttgagttt tttccccgtg ttttagattg
tttggtttta 480attctcgaat cagctaatca gggagtgtga aaagccctaa
atttgagttt ttttcgttgt 540tctgattgtt gtttttatga atttgcag
568418DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 4ttttgtctgt ttagattc 18518DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 5ttaagggcta taacagca 1863396DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
6atggctccca aaaagaagag aaaggtagaa ccaggatcac ctggtggaca atcacttatg
60gacccaatac gaagcagaac gccatcacca gctagggaac ttctctctgg accacagcct
120gatggagttc agccaactgc agatcgaggt gtttctccgc cagccggtgg
ccctttagat 180ggactcccag caagaagaac aatgtcccgt accagactcc
caagtccccc tgccccgtcg 240ccagcctttt cagctgactc cttctctgat
cttcttaggc aatttgaccc ttctcttttc 300aatacatccc ttttcgattc
acttcctcct ttcggcgcac atcatactga ggcagccacc 360ggcgaatggg
acgaagtcca aagtggttta agggcagctg atgctccacc accgacgatg
420agagtcgctg ttaccgccgc acgtcctcct agagccaagc cagcccctag
aagacgagct 480gcgcaaccct ccgatgcaag ccctgcagct caagtagacc
ttcgaacact aggttactcc 540cagcaacaac aagaaaaaat aaagccaaag
gttagatcaa cagttgcaca acatcacgaa 600gccctagtcg gacacggatt
tacacatgct catatcgtgg ctctttcaca acatcctgca 660gctcttggaa
cagtcgctgt caaatatcag gatatgattg ctgcattgcc agaagctact
720cacgaagcta tcgtcggagt tgggaaacaa tggtcaggcg caagagcatt
agaggcgctt 780ctcaccgtag ctggtgaatt acgaggtcct ccactccaat
tggatactgg gcaattatta 840aaaatcgcta aacgaggtgg agtcactgct
gtcgaagccg ttcatgcatg gcgtaacgct 900ctcacggggg ccccactaaa
ccttacccca caacaagttg tggcaatagc ttctaatggt 960ggtggtaaac
aagcccttga gacggttcaa agacttctac cagttctttg tcaggcacat
1020ggattgaccc cacaacaggt cgtagcaatc gcatctaacg gaggtggtaa
gcaagctctt 1080gaaacggtac aaagattact tcccgtgctt tgtcaagctc
atggactcac tcctcaacaa 1140gtggtcgcta ttgcaagtaa cggtggtgga
aagcaagcac tagaaaccgt ccaacgactc 1200cttcctgttc tctgtcaagc
acatggtttg actcctcagc aggtcgtcgc aattgcatca 1260aacaatggag
gcaaacaagc tttagaaaca gtacaaagac tattgcccgt tctttgccaa
1320gcgcatgggt taactcccga acaagtcgtt gccattgcaa gtaacggagg
aggtaaacaa 1380gctctcgaaa cggttcaagc acttttaccc gttctctgtc
aagcacatgg actcacacct 1440gaacaagtag ttgctatcgc atcgcatgat
ggtggaaaac aagcactgga aactgtacaa 1500agacttttgc cagttttatg
tcaagcgcac ggtcttactc ctcaacaagt tgtcgccatt 1560gcctctaatg
gaggtggaaa acaagctctt gaaactgtcc agagacttct gcccgttcta
1620tgtcaggctc atgggctaac ccctcaacag gttgttgcaa tcgcatctaa
taatggagga 1680aaacaagctt tagaaactgt ccaacgacta ctgcccgttc
tctgccaagc acacggactt 1740acaccacaac aggttgtagc tatagctagc
aatggtggcg gtaaacaggc tttggaaaca 1800gtacagcggc ttctaccagt
cttatgccaa gcccacgggc ttactcctca acaagttgtc 1860gccattgcct
ctaatggagg tggaaaacaa gctcttgaaa ctgtccagag acttctgccc
1920gttctatgtc aggctcatgg gcttactcct gaacaggttg tcgcaatagc
ttcaaacggt 1980ggcggaaaac aagctcttga aacagtgcaa cgtctccttc
ccgtcctctg tcaggctcac 2040ggacttacgc ccgaacaagt tgttgctata
gcttcgaata ttggtggaaa acaagctctc 2100gaaaccgtcc aaaggctcct
cccagtactt tgccaagcac atggattaac ccctgagcaa 2160gtagttgcaa
ttgcctcgaa caatggagga aagcaagcat tagaaactgt tcagagactt
2220ttgcctgtcc tgtgtcaagc ccacggtctt acaccagagc aggttgtcgc
tatagcttct 2280aacattggtg gaaagcaagc tcttgagact gtgcaacgtt
tgcttccagt cctctgtcaa 2340gcacacggac tcactccaca acaggtggtt
gcaattgctt caaatggcgg tggcaaacaa 2400gcattagaga ctgtacagag
actacttcct gttctttgtc aagcacaagg gctcacccct 2460gagcaggtag
tcgctatcgc ctcaaatggt ggcgggaagc aggccctgga gactgttcag
2520agactactgc ccgtcctatg tcaggctcac ggtctaacac cacaacaagt
cgtcgcaatc 2580gctagtcatg acggaggtcg acctgctcta gagtcgatag
tcgcacaact atcacgacct 2640gatcccgctc ttgcagcatt gacaaacgat
catttagtcg cacttgcatg tttaggagga 2700cgaccagcac ttgatgccgt
taagaaagga ctaccgcacg cccctgcatt gattaaaaga 2760acaaacagac
gaatcccgga gagaacttca catcgtgtag ccaagcaact tgtcaaaagt
2820gaactggagg agaagaaatc tgaacttcgt cataaattga aatatgtgcc
tcatgaatat 2880attgaattaa ttgaaattgc cagaaattcc actcaggata
gaattcttga aatgaaggta 2940atggaatttt ttatgaaagt ttatggatat
agaggtaaac atttgggtgg atcaaggaaa 3000ccggacggag caatttatac
tgtcggatct cctattgatt acggtgtgat cgtggatact 3060aaagcttata
gcggaggtta taatctgcca attggccaag cagatgaaat gcaacgatat
3120gtcgaagaaa atcaaacacg aaacaaacat atcaacccta atgaatggtg
gaaagtctat 3180ccatcttctg taacggaatt taagttttta tttgtgagtg
gtcactttaa aggaaactac 3240aaagctcagc ttacacgatt aaatcatatc
actaattgta atggagctgt tcttagtgta 3300gaagagcttt taattggtgg
agaaatgatt aaagccggca cattaacctt agaggaagtg 3360agacggaaat
ttaataacgg cgagataaac ttttga 339671131PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
7Met Ala Pro Lys Lys Lys Arg Lys Val Glu Pro Gly Ser Pro Gly Gly 1
5 10 15 Gln Ser Leu Met Asp Pro Ile Arg Ser Arg Thr Pro Ser Pro Ala
Arg 20 25 30 Glu Leu Leu Ser Gly Pro Gln Pro Asp Gly Val Gln Pro
Thr Ala Asp 35 40 45 Arg Gly Val Ser Pro Pro Ala Gly Gly Pro Leu
Asp Gly Leu Pro Ala 50 55 60 Arg Arg Thr Met Ser Arg Thr Arg Leu
Pro Ser Pro Pro Ala Pro Ser 65 70 75 80 Pro Ala Phe Ser Ala Asp Ser
Phe Ser Asp Leu Leu Arg Gln Phe Asp 85 90 95 Pro Ser Leu Phe Asn
Thr Ser Leu Phe Asp Ser Leu Pro Pro Phe Gly 100 105 110 Ala His His
Thr Glu Ala Ala Thr Gly Glu Trp Asp Glu Val Gln Ser 115 120 125 Gly
Leu Arg Ala Ala Asp Ala Pro Pro Pro Thr Met Arg Val Ala Val 130 135
140 Thr Ala Ala Arg Pro Pro Arg Ala Lys Pro Ala Pro Arg Arg Arg Ala
145 150 155 160 Ala Gln Pro Ser Asp Ala Ser Pro Ala Ala Gln Val Asp
Leu Arg Thr 165 170 175 Leu Gly Tyr Ser Gln Gln Gln Gln Glu Lys Ile
Lys Pro Lys Val Arg 180 185 190 Ser Thr Val Ala Gln His His Glu Ala
Leu Val Gly His Gly Phe Thr 195 200 205 His Ala His Ile Val Ala Leu
Ser Gln His Pro Ala Ala Leu Gly Thr 210 215 220 Val Ala Val Lys Tyr
Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr 225 230 235 240 His Glu
Ala Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala 245 250 255
Leu Glu Ala Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu 260
265 270 Gln Leu Asp Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly
Val 275 280 285 Thr Ala Val Glu Ala Val His Ala Trp Arg Asn Ala Leu
Thr Gly Ala 290 295 300 Pro Leu Asn Leu Thr Pro Gln Gln Val Val Ala
Ile Ala Ser Asn Gly 305 310 315 320 Gly Gly Lys Gln Ala Leu Glu Thr
Val Gln Arg Leu Leu Pro Val Leu 325 330 335 Cys Gln Ala His Gly Leu
Thr Pro Gln Gln Val Val Ala Ile Ala Ser 340 345 350 Asn Gly Gly Gly
Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro 355 360 365 Val Leu
Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile 370 375 380
Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu 385
390 395 400 Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln
Val Val 405 410 415 Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu
Glu Thr Val Gln 420 425 430 Arg Leu Leu Pro Val Leu Cys Gln Ala His
Gly Leu Thr Pro Glu Gln 435 440 445 Val Val Ala Ile Ala Ser Asn Gly
Gly Gly Lys Gln Ala Leu Glu Thr 450 455 460 Val Gln Ala Leu Leu Pro
Val Leu Cys Gln Ala His Gly Leu Thr Pro 465 470 475 480 Glu Gln Val
Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu 485 490 495 Glu
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu 500 505
510 Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln
515 520 525 Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln
Ala His 530 535 540 Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser
Asn Asn Gly Gly 545 550 555 560 Lys Gln Ala Leu Glu Thr Val Gln Arg
Leu Leu Pro Val Leu Cys Gln 565 570 575 Ala His Gly Leu Thr Pro Gln
Gln Val Val Ala Ile Ala Ser Asn Gly 580 585 590 Gly Gly Lys Gln Ala
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu 595 600 605 Cys Gln Ala
His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser 610 615 620 Asn
Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro 625 630
635 640 Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
Ile 645 650 655 Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val
Gln Arg Leu 660 665 670 Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
Pro Glu Gln Val Val 675 680 685 Ala Ile Ala Ser Asn Ile Gly Gly Lys
Gln Ala Leu Glu Thr Val Gln 690 695 700 Arg Leu Leu Pro Val Leu Cys
Gln Ala His Gly Leu Thr Pro Glu Gln 705 710 715 720 Val Val Ala Ile
Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr 725 730 735 Val Gln
Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro 740 745 750
Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu 755
760 765 Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
Leu 770 775 780 Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly
Gly Lys Gln 785 790 795 800 Ala Leu Glu Thr Val Gln Arg Leu Leu Pro
Val Leu Cys Gln Ala Gln 805 810 815 Gly Leu Thr Pro Glu Gln Val Val
Ala Ile Ala Ser Asn Gly Gly Gly 820 825 830 Lys Gln Ala Leu Glu Thr
Val Gln Arg Leu Leu Pro Val Leu Cys Gln 835 840 845 Ala His Gly Leu
Thr Pro Gln Gln Val Val Ala Ile Ala Ser His Asp 850 855 860 Gly Gly
Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg Pro 865 870 875
880 Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala
885 890 895 Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly
Leu Pro 900 905 910 His Ala Pro Ala Leu Ile Lys Arg Thr Asn Arg Arg
Ile Pro Glu Arg 915 920 925 Thr Ser His Arg Val Ala Lys Gln Leu Val
Lys Ser Glu Leu Glu Glu 930 935 940 Lys Lys Ser Glu Leu Arg His Lys
Leu Lys Tyr Val Pro His Glu Tyr 945 950 955 960 Ile Glu Leu Ile Glu
Ile Ala Arg Asn Ser Thr Gln Asp Arg Ile Leu 965 970 975 Glu Met Lys
Val Met Glu Phe Phe Met Lys Val Tyr Gly Tyr Arg Gly 980 985 990 Lys
His Leu Gly Gly Ser Arg Lys Pro Asp Gly Ala Ile Tyr Thr Val 995
1000 1005 Gly Ser Pro Ile Asp Tyr Gly Val Ile Val Asp Thr Lys Ala
Tyr 1010 1015 1020 Ser Gly Gly Tyr Asn Leu Pro Ile Gly Gln Ala Asp
Glu Met Gln 1025 1030 1035 Arg Tyr Val Glu Glu Asn Gln Thr Arg Asn
Lys His Ile Asn Pro 1040 1045 1050 Asn Glu Trp Trp Lys Val Tyr Pro
Ser Ser Val Thr Glu Phe Lys 1055 1060 1065 Phe Leu Phe Val Ser Gly
His Phe Lys Gly Asn Tyr Lys Ala Gln 1070 1075 1080 Leu Thr Arg Leu
Asn His Ile Thr Asn Cys Asn Gly Ala Val Leu 1085 1090 1095 Ser Val
Glu Glu Leu Leu Ile Gly Gly Glu Met Ile Lys Ala Gly 1100 1105 1110
Thr Leu Thr Leu Glu Glu Val Arg Arg Lys Phe Asn Asn Gly Glu 1115
1120 1125 Ile Asn Phe 1130 83396DNAArtificial SequenceDescription
of Artificial Sequence Synthetic polynucleotide 8atggctccca
aaaagaagag aaaggtagaa ccaggatcac ctggtggaca atcacttatg 60gacccaatac
gaagcagaac gccatcacca gctagggaac ttctctctgg accacagcct
120gatggagttc agccaactgc agatcgaggt gtttctccgc cagccggtgg
ccctttagat 180ggactcccag caagaagaac aatgtcccgt accagactcc
caagtccccc tgccccgtcg 240ccagcctttt cagctgactc cttctctgat
cttcttaggc aatttgaccc ttctcttttc 300aatacatccc ttttcgattc
acttcctcct ttcggcgcac atcatactga ggcagccacc 360ggcgaatggg
acgaagtcca aagtggttta agggcagctg atgctccacc accgacgatg
420agagtcgctg ttaccgccgc acgtcctcct agagccaagc cagcccctag
aagacgagct 480gcgcaaccct ccgatgcaag ccctgcagct caagtagacc
ttcgaacact aggttactcc 540cagcaacaac aagaaaaaat aaagccaaag
gttagatcaa cagttgcaca acatcacgaa 600gccctagtcg gacacggatt
tacacatgct catatcgtgg ctctttcaca acatcctgca 660gctcttggaa
cagtcgctgt caaatatcag gatatgattg ctgcattgcc agaagctact
720cacgaagcta tcgtcggagt tgggaaacaa tggtcaggcg caagagcatt
agaggcgctt 780ctcaccgtag ctggtgaatt acgaggtcct ccactccaat
tggatactgg gcaattatta 840aaaatcgcta aacgaggtgg agtcactgct
gtcgaagccg ttcatgcatg gcgtaacgct 900ctcacggggg ccccactaaa
ccttacccca caacaagttg tggcaatagc ttctaatgga 960ggtggtaaac
aagcccttga gacggttcaa agacttctac cagttctttg tcaggcacat
1020ggattgaccc cacaacaggt cgtagcaatc gcatctaaca ttggtggtaa
gcaagctctt 1080gaaacggtac aaagattact tcccgtgctt tgtcaagctc
atggactcac tcctcaacaa 1140gtggtcgcta ttgcaagtaa tattggtgga
aagcaagcac tagaaaccgt ccaacgactc 1200cttcctgttc tctgtcaagc
acatggtttg actcctcagc aggtcgtcgc aattgcatca 1260aataacggag
gcaaacaagc tttagaaaca gtacaaagac tattgcccgt tctttgccaa
1320gcgcatgggt taactcccga acaagtcgtt gccattgcaa gtaacaatgg
aggtaaacaa 1380gctctcgaaa cggttcaagc acttttaccc gttctctgtc
aagcacatgg actcacacct 1440gaacaagtag ttgctatcgc atcgaataat
ggtggaaaac aagcactgga aactgtacaa 1500agacttttgc cagttttatg
tcaagcgcac ggtcttactc ctcaacaagt tgtcgccatt 1560gcctctcatg
atggtggaaa acaagctctt gaaactgtcc agagacttct gcccgttcta
1620tgtcaggctc atgggctaac ccctcaacag gttgttgcaa tcgcatctaa
tggtggagga 1680aaacaagctt tagaaactgt ccaacgacta ctgcccgttc
tctgccaagc acacggactt 1740acaccacaac aggttgtagc tatagctagc
aatattggcg gtaaacaggc tttggaaaca 1800gtacagcggc ttctaccagt
cttatgccaa gcccacgggc ttactcctca acaagttgtc 1860gccattgcct
ctaacggagg tggaaaacaa gctcttgaaa ctgtccagag acttctgccc
1920gttctatgtc aggctcatgg gcttactcct gaacaggttg tcgcaatagc
ttcaaacatt 1980ggcggaaaac aagctcttga aacagtgcaa cgtctccttc
ccgtcctctg tcaggctcac 2040ggacttacgc ccgaacaagt tgttgctata
gcttcgaata ttggtggaaa acaagctctc 2100gaaaccgtcc aaaggctcct
cccagtactt tgccaagcac atggattaac ccctgagcaa 2160gtagttgcaa
ttgcctcgca cgatggagga aagcaagcat tagaaactgt tcagagactt
2220ttgcctgtcc tgtgtcaagc ccacggtctt acaccagagc aggttgtcgc
tatagcttct 2280aatatcggtg gaaagcaagc tcttgagact gtgcaacgtt
tgcttccagt cctctgtcaa 2340gcacacggac tcactccaca acaggtggtt
gcaattgctt caaataatgg tggcaaacaa 2400gcattagaga ctgtacagag
actacttcct gttctttgtc aagcacaagg gctcacccct 2460gagcaggtag
tcgctatcgc ctcacacgac ggcgggaagc aggccctgga gactgttcag
2520agactactgc ccgtcctatg tcaggctcac ggtctaacac cacaacaagt
cgtcgcaatc 2580gctagtaata ttggaggtcg acctgctcta gagtcgatag
tcgcacaact atcacgacct 2640gatcccgctc ttgcagcatt gacaaacgat
catttagtcg cacttgcatg tttaggagga 2700cgaccagcac ttgatgccgt
taagaaagga ctaccgcacg cccctgcatt gattaaaaga 2760acaaacagac
gaatcccgga gagaacttca catcgtgtag ccaagcaact
tgtcaaaagt 2820gaactggagg agaagaaatc tgaacttcgt cataaattga
aatatgtgcc tcatgaatat 2880attgaattaa ttgaaattgc cagaaattcc
actcaggata gaattcttga aatgaaggta 2940atggaatttt ttatgaaagt
ttatggatat agaggtaaac atttgggtgg atcaaggaaa 3000ccggacggag
caatttatac tgtcggatct cctattgatt acggtgtgat cgtggatact
3060aaagcttata gcggaggtta taatctgcca attggccaag cagatgaaat
gcaacgatat 3120gtcgaagaaa atcaaacacg aaacaaacat atcaacccta
atgaatggtg gaaagtctat 3180ccatcttctg taacggaatt taagttttta
tttgtgagtg gtcactttaa aggaaactac 3240aaagctcagc ttacacgatt
aaatcatatc actaattgta atggagctgt tcttagtgta 3300gaagagcttt
taattggtgg agaaatgatt aaagccggca cattaacctt agaggaagtg
3360agacggaaat ttaataacgg cgagataaac ttttga 339691131PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
9Met Ala Pro Lys Lys Lys Arg Lys Val Glu Pro Gly Ser Pro Gly Gly 1
5 10 15 Gln Ser Leu Met Asp Pro Ile Arg Ser Arg Thr Pro Ser Pro Ala
Arg 20 25 30 Glu Leu Leu Ser Gly Pro Gln Pro Asp Gly Val Gln Pro
Thr Ala Asp 35 40 45 Arg Gly Val Ser Pro Pro Ala Gly Gly Pro Leu
Asp Gly Leu Pro Ala 50 55 60 Arg Arg Thr Met Ser Arg Thr Arg Leu
Pro Ser Pro Pro Ala Pro Ser 65 70 75 80 Pro Ala Phe Ser Ala Asp Ser
Phe Ser Asp Leu Leu Arg Gln Phe Asp 85 90 95 Pro Ser Leu Phe Asn
Thr Ser Leu Phe Asp Ser Leu Pro Pro Phe Gly 100 105 110 Ala His His
Thr Glu Ala Ala Thr Gly Glu Trp Asp Glu Val Gln Ser 115 120 125 Gly
Leu Arg Ala Ala Asp Ala Pro Pro Pro Thr Met Arg Val Ala Val 130 135
140 Thr Ala Ala Arg Pro Pro Arg Ala Lys Pro Ala Pro Arg Arg Arg Ala
145 150 155 160 Ala Gln Pro Ser Asp Ala Ser Pro Ala Ala Gln Val Asp
Leu Arg Thr 165 170 175 Leu Gly Tyr Ser Gln Gln Gln Gln Glu Lys Ile
Lys Pro Lys Val Arg 180 185 190 Ser Thr Val Ala Gln His His Glu Ala
Leu Val Gly His Gly Phe Thr 195 200 205 His Ala His Ile Val Ala Leu
Ser Gln His Pro Ala Ala Leu Gly Thr 210 215 220 Val Ala Val Lys Tyr
Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr 225 230 235 240 His Glu
Ala Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala 245 250 255
Leu Glu Ala Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu 260
265 270 Gln Leu Asp Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly
Val 275 280 285 Thr Ala Val Glu Ala Val His Ala Trp Arg Asn Ala Leu
Thr Gly Ala 290 295 300 Pro Leu Asn Leu Thr Pro Gln Gln Val Val Ala
Ile Ala Ser Asn Gly 305 310 315 320 Gly Gly Lys Gln Ala Leu Glu Thr
Val Gln Arg Leu Leu Pro Val Leu 325 330 335 Cys Gln Ala His Gly Leu
Thr Pro Gln Gln Val Val Ala Ile Ala Ser 340 345 350 Asn Ile Gly Gly
Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro 355 360 365 Val Leu
Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile 370 375 380
Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu 385
390 395 400 Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln
Val Val 405 410 415 Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu
Glu Thr Val Gln 420 425 430 Arg Leu Leu Pro Val Leu Cys Gln Ala His
Gly Leu Thr Pro Glu Gln 435 440 445 Val Val Ala Ile Ala Ser Asn Asn
Gly Gly Lys Gln Ala Leu Glu Thr 450 455 460 Val Gln Ala Leu Leu Pro
Val Leu Cys Gln Ala His Gly Leu Thr Pro 465 470 475 480 Glu Gln Val
Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu 485 490 495 Glu
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu 500 505
510 Thr Pro Gln Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln
515 520 525 Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln
Ala His 530 535 540 Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser
Asn Gly Gly Gly 545 550 555 560 Lys Gln Ala Leu Glu Thr Val Gln Arg
Leu Leu Pro Val Leu Cys Gln 565 570 575 Ala His Gly Leu Thr Pro Gln
Gln Val Val Ala Ile Ala Ser Asn Ile 580 585 590 Gly Gly Lys Gln Ala
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu 595 600 605 Cys Gln Ala
His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser 610 615 620 Asn
Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro 625 630
635 640 Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
Ile 645 650 655 Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val
Gln Arg Leu 660 665 670 Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
Pro Glu Gln Val Val 675 680 685 Ala Ile Ala Ser Asn Ile Gly Gly Lys
Gln Ala Leu Glu Thr Val Gln 690 695 700 Arg Leu Leu Pro Val Leu Cys
Gln Ala His Gly Leu Thr Pro Glu Gln 705 710 715 720 Val Val Ala Ile
Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr 725 730 735 Val Gln
Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro 740 745 750
Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu 755
760 765 Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
Leu 770 775 780 Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Asn Gly
Gly Lys Gln 785 790 795 800 Ala Leu Glu Thr Val Gln Arg Leu Leu Pro
Val Leu Cys Gln Ala Gln 805 810 815 Gly Leu Thr Pro Glu Gln Val Val
Ala Ile Ala Ser His Asp Gly Gly 820 825 830 Lys Gln Ala Leu Glu Thr
Val Gln Arg Leu Leu Pro Val Leu Cys Gln 835 840 845 Ala His Gly Leu
Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Ile 850 855 860 Gly Gly
Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg Pro 865 870 875
880 Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala
885 890 895 Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly
Leu Pro 900 905 910 His Ala Pro Ala Leu Ile Lys Arg Thr Asn Arg Arg
Ile Pro Glu Arg 915 920 925 Thr Ser His Arg Val Ala Lys Gln Leu Val
Lys Ser Glu Leu Glu Glu 930 935 940 Lys Lys Ser Glu Leu Arg His Lys
Leu Lys Tyr Val Pro His Glu Tyr 945 950 955 960 Ile Glu Leu Ile Glu
Ile Ala Arg Asn Ser Thr Gln Asp Arg Ile Leu 965 970 975 Glu Met Lys
Val Met Glu Phe Phe Met Lys Val Tyr Gly Tyr Arg Gly 980 985 990 Lys
His Leu Gly Gly Ser Arg Lys Pro Asp Gly Ala Ile Tyr Thr Val 995
1000 1005 Gly Ser Pro Ile Asp Tyr Gly Val Ile Val Asp Thr Lys Ala
Tyr 1010 1015 1020 Ser Gly Gly Tyr Asn Leu Pro Ile Gly Gln Ala Asp
Glu Met Gln 1025 1030 1035 Arg Tyr Val Glu Glu Asn Gln Thr Arg Asn
Lys His Ile Asn Pro 1040 1045 1050 Asn Glu Trp Trp Lys Val Tyr Pro
Ser Ser Val Thr Glu Phe Lys 1055 1060 1065 Phe Leu Phe Val Ser Gly
His Phe Lys Gly Asn Tyr Lys Ala Gln 1070 1075 1080 Leu Thr Arg Leu
Asn His Ile Thr Asn Cys Asn Gly Ala Val Leu 1085 1090 1095 Ser Val
Glu Glu Leu Leu Ile Gly Gly Glu Met Ile Lys Ala Gly 1100 1105 1110
Thr Leu Thr Leu Glu Glu Val Arg Arg Lys Phe Asn Asn Gly Glu 1115
1120 1125 Ile Asn Phe 1130 101980DNASolanum tuberosum 10atggcggctg
ctgcctcacc atctccatgt ttctccaaaa ccctacctcc atcttcctcc 60aaatcttcca
ccattcttcc tagatctacc ttccctttcc acaatcaccc tcaaaaagcc
120tcaccccttc atctcaccca cacccatcat catcgtcgtg gtttcgccgt
ttccaatgtc 180gtcatatcca ctaccaccca taacgacgtt tctgaacctg
aaacattcgt ttcccgtttc 240gcccctgacg aacccagaaa gggttgtgat
gttcttgtgg aggcacttga aagggagggg 300gttacggatg tatttgcgta
cccaggaggt gcttctatgg agattcatca ggctttgaca 360cgttcgaata
ttattcgtaa tgtgctgcca cgtcatgagc aaggtggtgt gtttgctgca
420gagggttacg cacgggcgac tgggttccct ggtgtttgca ttgctacctc
tggtccggga 480gctacgaatc ttgttagtgg tcttgcggat gctttgttgg
atagtattcc gattgttgct 540attacgggtc aagtgccgag gaggatgatt
ggtactgatg cgtttcagga aacgcctatt 600gttgaggtaa cgagatctat
tacgaagcat aattatcttg ttatggatgt agaggatatt 660cctagggttg
ttcgtgaagc gttttttcta gcgaaatcgg gacggcctgg gccggttttg
720attgatgtac ctaaggatat tcagcaacaa ttggtgatac ctaattggga
tcagccaatg 780aggttgcctg gttacatgtc taggttacct aaattgccta
atgagatgct tttggaacaa 840attattaggc tgatttcgga gtcgaagaag
cctgttttgt atgtgggtgg tgggtgtttg 900caatcaagtg aggagctgag
acgatttgtg gagcttacgg gtattcctgt ggcgagtact 960ttgatgggtc
ttggagcttt tccaactggg gatgagcttt cccttcaaat gttgggtatg
1020catgggactg tgtatgctaa ttatgctgtg gatggtagtg atttgttgct
tgcatttggg 1080gtgaggtttg atgatcgagt tactggtaaa ttggaagctt
ttgctagccg agcgaaaatt 1140gtccacattg atattgattc ggctgagatt
ggaaagaaca agcaacctca tgtttccatt 1200tgtgcagata tcaagttggc
attacagggt ttgaattcca tattggaggg taaagaaggt 1260aagctgaagt
tggacttttc tgcttggaga caggagttaa cggaacagaa ggtgaagtac
1320ccattgagtt ttaagacttt tggtgaagcc atccctccac aatatgctat
tcaggttctt 1380gatgagttaa ctaacggaaa tgccattatt agtactggtg
tggggcaaca ccagatgtgg 1440gctgcccaat actataagta caaaaagcca
caccaatggt tgacatctgg tggattagga 1500gcaatgggat ttggtttgcc
tgctgcaata ggtgcggctg ttggaagacc gggtgagatt 1560gtggttgaca
ttgatggtga cgggagtttt atcatgaatg tgcaggagtt agcaacaatt
1620aaggtggaga atctcccagt taagattatg ttgctgaata atcaacactt
gggaatggtg 1680gttcaatggg aggatcgatt ctataaggct aacagagcac
acacttactt gggtgatcct 1740gctaatgagg aagagatctt ccctaatatg
ttgaaattcg cagaggcttg tggcgtacct 1800gctgcaagag tgtcacacag
ggatgatctt agagctgcca ttcaaaagat gttagacact 1860cctgggccat
acttgttgga tgtgattgta cctcatcagg agcacgttct acctatgatt
1920cccagtggcg gtgctttcaa agatgtgatc acagagggtg atgggagacg
ttcatattga 198011659PRTSolanum tuberosum 11Met Ala Ala Ala Ala Ser
Pro Ser Pro Cys Phe Ser Lys Thr Leu Pro 1 5 10 15 Pro Ser Ser Ser
Lys Ser Ser Thr Ile Leu Pro Arg Ser Thr Phe Pro 20 25 30 Phe His
Asn His Pro Gln Lys Ala Ser Pro Leu His Leu Thr His Thr 35 40 45
His His His Arg Arg Gly Phe Ala Val Ser Asn Val Val Ile Ser Thr 50
55 60 Thr Thr His Asn Asp Val Ser Glu Pro Glu Thr Phe Val Ser Arg
Phe 65 70 75 80 Ala Pro Asp Glu Pro Arg Lys Gly Cys Asp Val Leu Val
Glu Ala Leu 85 90 95 Glu Arg Glu Gly Val Thr Asp Val Phe Ala Tyr
Pro Gly Gly Ala Ser 100 105 110 Met Glu Ile His Gln Ala Leu Thr Arg
Ser Asn Ile Ile Arg Asn Val 115 120 125 Leu Pro Arg His Glu Gln Gly
Gly Val Phe Ala Ala Glu Gly Tyr Ala 130 135 140 Arg Ala Thr Gly Phe
Pro Gly Val Cys Ile Ala Thr Ser Gly Pro Gly 145 150 155 160 Ala Thr
Asn Leu Val Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Ile 165 170 175
Pro Ile Val Ala Ile Thr Gly Gln Val Pro Arg Arg Met Ile Gly Thr 180
185 190 Asp Ala Phe Gln Glu Thr Pro Ile Val Glu Val Thr Arg Ser Ile
Thr 195 200 205 Lys His Asn Tyr Leu Val Met Asp Val Glu Asp Ile Pro
Arg Val Val 210 215 220 Arg Glu Ala Phe Phe Leu Ala Lys Ser Gly Arg
Pro Gly Pro Val Leu 225 230 235 240 Ile Asp Val Pro Lys Asp Ile Gln
Gln Gln Leu Val Ile Pro Asn Trp 245 250 255 Asp Gln Pro Met Arg Leu
Pro Gly Tyr Met Ser Arg Leu Pro Lys Leu 260 265 270 Pro Asn Glu Met
Leu Leu Glu Gln Ile Ile Arg Leu Ile Ser Glu Ser 275 280 285 Lys Lys
Pro Val Leu Tyr Val Gly Gly Gly Cys Leu Gln Ser Ser Glu 290 295 300
Glu Leu Arg Arg Phe Val Glu Leu Thr Gly Ile Pro Val Ala Ser Thr 305
310 315 320 Leu Met Gly Leu Gly Ala Phe Pro Thr Gly Asp Glu Leu Ser
Leu Gln 325 330 335 Met Leu Gly Met His Gly Thr Val Tyr Ala Asn Tyr
Ala Val Asp Gly 340 345 350 Ser Asp Leu Leu Leu Ala Phe Gly Val Arg
Phe Asp Asp Arg Val Thr 355 360 365 Gly Lys Leu Glu Ala Phe Ala Ser
Arg Ala Lys Ile Val His Ile Asp 370 375 380 Ile Asp Ser Ala Glu Ile
Gly Lys Asn Lys Gln Pro His Val Ser Ile 385 390 395 400 Cys Ala Asp
Ile Lys Leu Ala Leu Gln Gly Leu Asn Ser Ile Leu Glu 405 410 415 Gly
Lys Glu Gly Lys Leu Lys Leu Asp Phe Ser Ala Trp Arg Gln Glu 420 425
430 Leu Thr Glu Gln Lys Val Lys Tyr Pro Leu Ser Phe Lys Thr Phe Gly
435 440 445 Glu Ala Ile Pro Pro Gln Tyr Ala Ile Gln Val Leu Asp Glu
Leu Thr 450 455 460 Asn Gly Asn Ala Ile Ile Ser Thr Gly Val Gly Gln
His Gln Met Trp 465 470 475 480 Ala Ala Gln Tyr Tyr Lys Tyr Lys Lys
Pro His Gln Trp Leu Thr Ser 485 490 495 Gly Gly Leu Gly Ala Met Gly
Phe Gly Leu Pro Ala Ala Ile Gly Ala 500 505 510 Ala Val Gly Arg Pro
Gly Glu Ile Val Val Asp Ile Asp Gly Asp Gly 515 520 525 Ser Phe Ile
Met Asn Val Gln Glu Leu Ala Thr Ile Lys Val Glu Asn 530 535 540 Leu
Pro Val Lys Ile Met Leu Leu Asn Asn Gln His Leu Gly Met Val 545 550
555 560 Val Gln Trp Glu Asp Arg Phe Tyr Lys Ala Asn Arg Ala His Thr
Tyr 565 570 575 Leu Gly Asp Pro Ala Asn Glu Glu Glu Ile Phe Pro Asn
Met Leu Lys 580 585 590 Phe Ala Glu Ala Cys Gly Val Pro Ala Ala Arg
Val Ser His Arg Asp 595 600 605 Asp Leu Arg Ala Ala Ile Gln Lys Met
Leu Asp Thr Pro Gly Pro Tyr 610 615 620 Leu Leu Asp Val Ile Val Pro
His Gln Glu His Val Leu Pro Met Ile 625 630 635 640 Pro Ser Gly Gly
Ala Phe Lys Asp Val Ile Thr Glu Gly Asp Gly Arg 645 650 655 Arg Ser
Tyr 121980DNASolanum tuberosum 12atggcggctg ctgcctcacc atctccatgt
ttctccaaaa ccctacctcc atcttcctcc 60aaatcttcca ccattcttcc tagatctacc
ttccctttcc acaatcaccc tcaaaaagcc 120tcaccccttc atctcaccca
cacccatcat catcgtcgtg gtttcgccgt ttccaatgtc 180gtcatatcca
ctaccaccca taacgacgtt tctgaacctg aaacattcgt ttcccgtttc
240gcccctgacg aacccagaaa gggttgtgat gttcttgtgg aggcacttga
aagggagggg 300gttacggatg tatttgcgta cccaggaggt gcttctatgg
agattcatca ggctttgaca 360cgttcgaata ttattcgtaa tgtgctgcca
cgtcatgagc aaggtggtgt gtttgctgca 420gagggttacg cacgggcgac
tgggttccct ggtgtttgca ttgctacctc tggtccggga 480gctacgaatc
ttgttagtgg tcttgcggat gctttgttgg atagtattcc gattgttgct
540attacgggtc aagtgccgag gaggatgatt ggtactgatg
cgtttcagga aacgcctatt 600gttgaggtaa cgagatctat tacgaagcat
aattatcttg ttatggatgt agaggatatt 660cctagggttg ttcgtgaagc
gttttttcta gcgaaatcgg gacggcctgg gccggttttg 720attgatgtac
ctaaggatat tcagcaacaa ttggtgatac ctaattggga tcagccaatg
780aggttgcctg gttacatgtc tagattacct aaattgccta atgagatgct
tttggaacaa 840attattaggc tgatttcgga gtcgaagaag cctgttttgt
atgtgggtgg tgggtgtttg 900caatcaagtg aggagctgag acgatttgtg
gagcttacgg gtattcctgt ggcgagtact 960ttgatgggtc ttggagcttt
tccaactggg gatgagcttt cccttcaaat gttgggtatg 1020catgggactg
tgtatgctaa ttatgctgtg gatggtagtg atttgttgct tgcatttggg
1080gtgaggtttg atgatcgagt tactggtaaa ttggaagctt ttgctagccg
agcgaaaatt 1140gtccacattg atattgattc ggctgagatt ggaaagaaca
agcaacctca tgtttccatt 1200tgtgcagata tcaagttggc attacagggt
ttgaattcca tattggaggg taaagaaggt 1260aagctgaagt tggacttttc
tgcttggaga caggagttaa cggaacagaa ggtgaagtac 1320ccattgagtt
ttaagacttt tggtgaagcc atccctccac aatatgctat tcaggttctt
1380gatgagttaa ctaacggaaa tgccattatt agtactggtg tggggcaaca
ccagatgtgg 1440gctgcccaat actataagta caaaaagcca caccaatggt
tgacatctgg tggattagga 1500gcaatgggat ttggtttgcc tgctgcaata
ggtgcggctg ttggaagacc gggtgagatt 1560gtggttgaca ttgatggtga
cgggagtttt atcatgaatg tgcaggagtt agcaacaatt 1620aaggtggaga
atctcccagt taagattatg ttgctgaata atcaacactt gggaatggtg
1680gttcaactgg aggatcgatt ctataaggct aacagagcac acacttactt
gggtgatcct 1740gctaatgagg aagagatctt ccctaatatg ttgaaattcg
cagaggcttg tggcgtacct 1800gctgcaagag tgtcacacag ggatgatctt
agagctgcca ttcaaaagat gttagacact 1860cctgggccat acttgttgga
tgtgattgta cctcatcagg agcacgttct acctatgatt 1920cccattggcg
gtgctttcaa agatgtgatc acagagggtg atgggagacg ttcatattga
198013659PRTSolanum tuberosum 13Met Ala Ala Ala Ala Ser Pro Ser Pro
Cys Phe Ser Lys Thr Leu Pro 1 5 10 15 Pro Ser Ser Ser Lys Ser Ser
Thr Ile Leu Pro Arg Ser Thr Phe Pro 20 25 30 Phe His Asn His Pro
Gln Lys Ala Ser Pro Leu His Leu Thr His Thr 35 40 45 His His His
Arg Arg Gly Phe Ala Val Ser Asn Val Val Ile Ser Thr 50 55 60 Thr
Thr His Asn Asp Val Ser Glu Pro Glu Thr Phe Val Ser Arg Phe 65 70
75 80 Ala Pro Asp Glu Pro Arg Lys Gly Cys Asp Val Leu Val Glu Ala
Leu 85 90 95 Glu Arg Glu Gly Val Thr Asp Val Phe Ala Tyr Pro Gly
Gly Ala Ser 100 105 110 Met Glu Ile His Gln Ala Leu Thr Arg Ser Asn
Ile Ile Arg Asn Val 115 120 125 Leu Pro Arg His Glu Gln Gly Gly Val
Phe Ala Ala Glu Gly Tyr Ala 130 135 140 Arg Ala Thr Gly Phe Pro Gly
Val Cys Ile Ala Thr Ser Gly Pro Gly 145 150 155 160 Ala Thr Asn Leu
Val Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Ile 165 170 175 Pro Ile
Val Ala Ile Thr Gly Gln Val Pro Arg Arg Met Ile Gly Thr 180 185 190
Asp Ala Phe Gln Glu Thr Pro Ile Val Glu Val Thr Arg Ser Ile Thr 195
200 205 Lys His Asn Tyr Leu Val Met Asp Val Glu Asp Ile Pro Arg Val
Val 210 215 220 Arg Glu Ala Phe Phe Leu Ala Lys Ser Gly Arg Pro Gly
Pro Val Leu 225 230 235 240 Ile Asp Val Pro Lys Asp Ile Gln Gln Gln
Leu Val Ile Pro Asn Trp 245 250 255 Asp Gln Pro Met Arg Leu Pro Gly
Tyr Met Ser Arg Leu Pro Lys Leu 260 265 270 Pro Asn Glu Met Leu Leu
Glu Gln Ile Ile Arg Leu Ile Ser Glu Ser 275 280 285 Lys Lys Pro Val
Leu Tyr Val Gly Gly Gly Cys Leu Gln Ser Ser Glu 290 295 300 Glu Leu
Arg Arg Phe Val Glu Leu Thr Gly Ile Pro Val Ala Ser Thr 305 310 315
320 Leu Met Gly Leu Gly Ala Phe Pro Thr Gly Asp Glu Leu Ser Leu Gln
325 330 335 Met Leu Gly Met His Gly Thr Val Tyr Ala Asn Tyr Ala Val
Asp Gly 340 345 350 Ser Asp Leu Leu Leu Ala Phe Gly Val Arg Phe Asp
Asp Arg Val Thr 355 360 365 Gly Lys Leu Glu Ala Phe Ala Ser Arg Ala
Lys Ile Val His Ile Asp 370 375 380 Ile Asp Ser Ala Glu Ile Gly Lys
Asn Lys Gln Pro His Val Ser Ile 385 390 395 400 Cys Ala Asp Ile Lys
Leu Ala Leu Gln Gly Leu Asn Ser Ile Leu Glu 405 410 415 Gly Lys Glu
Gly Lys Leu Lys Leu Asp Phe Ser Ala Trp Arg Gln Glu 420 425 430 Leu
Thr Glu Gln Lys Val Lys Tyr Pro Leu Ser Phe Lys Thr Phe Gly 435 440
445 Glu Ala Ile Pro Pro Gln Tyr Ala Ile Gln Val Leu Asp Glu Leu Thr
450 455 460 Asn Gly Asn Ala Ile Ile Ser Thr Gly Val Gly Gln His Gln
Met Trp 465 470 475 480 Ala Ala Gln Tyr Tyr Lys Tyr Lys Lys Pro His
Gln Trp Leu Thr Ser 485 490 495 Gly Gly Leu Gly Ala Met Gly Phe Gly
Leu Pro Ala Ala Ile Gly Ala 500 505 510 Ala Val Gly Arg Pro Gly Glu
Ile Val Val Asp Ile Asp Gly Asp Gly 515 520 525 Ser Phe Ile Met Asn
Val Gln Glu Leu Ala Thr Ile Lys Val Glu Asn 530 535 540 Leu Pro Val
Lys Ile Met Leu Leu Asn Asn Gln His Leu Gly Met Val 545 550 555 560
Val Gln Leu Glu Asp Arg Phe Tyr Lys Ala Asn Arg Ala His Thr Tyr 565
570 575 Leu Gly Asp Pro Ala Asn Glu Glu Glu Ile Phe Pro Asn Met Leu
Lys 580 585 590 Phe Ala Glu Ala Cys Gly Val Pro Ala Ala Arg Val Ser
His Arg Asp 595 600 605 Asp Leu Arg Ala Ala Ile Gln Lys Met Leu Asp
Thr Pro Gly Pro Tyr 610 615 620 Leu Leu Asp Val Ile Val Pro His Gln
Glu His Val Leu Pro Met Ile 625 630 635 640 Pro Ile Gly Gly Ala Phe
Lys Asp Val Ile Thr Glu Gly Asp Gly Arg 645 650 655 Arg Ser Tyr
1413057DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 14tggcaggata tataccggtg taaacgaagt
gtgtgtggtt gatccaaaat ctatcgtacc 60tttagaaagt gtagctatga aggatagtct
cacttatgaa gaactaccta ttgagattct 120tgatcgtcag gtccgaaggt
tgagaaaaat agaagtcgct tcagttacgg ctttgtggag 180gagtaagggt
accagttata caccctacat tctactcgag tcattatgat gatgtctcac
240gaccaaatca aatcaaagtt aaataaatat cgaaccgaac gcccactctg
tatgagtatg 300gcaaaagatt ttgagagaat caagttgcat aaaagcctaa
ttttcatgga acatacaaat 360tgagtctcat aatagcccaa actcacagcc
atgaacccaa attgggtaaa gttttgcaag 420acgttcatca aacagttagg
aaacataaaa tggcgctaga tatataataa atttttttaa 480catatggtgt
gattgatagt tatatactaa agatgtttgc ttagttacgt aattttttca
540aaaaaaaaag gtacattatc aatcatcagt cacaaaatat taaaagttac
tgtttgtttt 600ttaaattcca tgtcgaattt aattgaatga cacttaaatt
gggacgaacg gtgtaatttc 660ttttgactat tctactagta tctatccaca
gcacgtgttg ttcctttctt ctttcgtttt 720tcatttactt gacattatta
ggagacttgg ccctgaactc caactattct aagctgacct 780ttcttttcct
ttaccaatta tcttcttctt tctaatttcg ttttacgcgt agtactgcct
840gaattttctg actttcaacg tttgttattc atgcttgaaa acgaaatacc
agctaacaaa 900agatgaatta ttgtgtttac aagacttggg ccgttgactc
ttactttccc ttcctcatcc 960tcacatttag aaaaaagaaa tttaacgaaa
aattaaagga gatggctgaa attcttctca 1020cagcagtcat caataaatca
atagaaatag ctggaaatgt actctttcaa gaaggtacgc 1080gtttatattg
gttgaaagag gacatcgatt ggctccagag agaaatgaga cacattcgat
1140catatgtaga caatgcaaag gcaaaggaag ttggaggcga ttcaagggtg
aaaaacttat 1200taaaagatat tcaacaactg gcaggtgatg tggaggatct
attagatgag tttcttccaa 1260aaattcaaca atccaataag ttcatttgtt
gccttaagac ggtttctttt gccgatgagt 1320ttgctatgga gattgagaag
ataaaaagaa gagttgctga tattgaccgt gtaaggacaa 1380cttacagcat
cacagataca agtaacaata atgatgattg cattccattg gaccggagaa
1440gattgttcct tcatgctgat gaaacagagg tcatcggtct ggaagatgac
ttcaatacac 1500tacaagccaa attacttgat catgatttgc cttatggagt
tgtttcaata gttggcatgc 1560ccggtttggg aaaaacaact cttgccaaga
aactttatag gcatgtctgt catcaatttg 1620agtgttcggg actggtctat
gtttcacaac agccaagggc gggagaaatc ttacatgaca 1680tagccaaaca
agttggactg acggaagagg aaaggaaaga aaacttggag aacaacctac
1740gatcactctt gaaaataaaa aggtatgtta ttctcttaga tgacatttgg
gatgttgaaa 1800tttgggatga tctaaaactt gtccttcctg aatgtgattc
aaaaattggc agtaggataa 1860ttataacctc tcgaaatagt aatgtaggca
gatacatagg aggggatttc tcaatccacg 1920tgttgcaacc cctagattca
gagaaaagct ttgaactctt taccaagaaa atctttaatt 1980ttgttaatga
taattgggcc aatgcttcac cagacttggt aaatattggt agatgtatag
2040ttgagagatg tggaggtata ccgctagcaa ttgtggtgac tgcaggcatg
ttaagggcaa 2100gaggaagaac agaacatgca tggaacagag tacttgagag
tatggctcat aaaattcaag 2160atggatgtgg taaggtattg gctctgagtt
acaatgattt gcccattgca ttaaggccat 2220gtttcttgta ctttggtctt
taccccgagg accatgaaat tcgtgctttt gatttgacaa 2280atatgtggat
tgctgagaag ctgatagttg taaatactgg caatgggcga gaggctgaaa
2340gtttggcgga tgatgtccta aatgatttgg tttcaagaaa cttgattcaa
gttgccaaaa 2400ggacatatga tggaagaatt tcaagttgtc gcatacatga
cttgttacat agtttgtgtg 2460tggacttggc taaggaaagt aacttctttc
acacggagca caatgcattt ggtgatccta 2520gcaatgttgc tagggtgcga
aggattacat tctactctga tgataatgcc atgaatgagt 2580tcttccattt
aaatcctaag cctatgaagc ttcgttcact tttctgtttc acaaaagacc
2640gttgcatatt ttctcaaatg gctcatctta acttcaaatt attgcaagtg
ttggttgtag 2700tcatgtctca aaagggttat cagcatgtta ctttccccaa
aaaaattggg aacatgagtt 2760gcctacgtta tgtgcgattg gagggggcaa
ttagagtaaa attgccaaat agtattgtca 2820agctcaaatg tctagagacc
ctggatatat ttcatagctc tagtaaactt ccttttggtg 2880tttgggagtc
taaaatattg agacatcttt gttacacaga agaatgttac tgtgtctctt
2940ttgcaagtcc attttgccga atcatgcctc ctaataatct acaaactttg
atgtgggtgg 3000atgataaatt ttgtgaacca agattgttgc accgattgat
aaatttaaga acattgtgta 3060taatggatgt atccggttct accattaaga
tattatcagc attgagccct gtgcctagag 3120cgttggaggt tctgaagctc
agatttttca agaacacgag tgagcaaata aacttgtcgt 3180cccatccaaa
tattgtcgag ttgggtttgg ttggtttctc agcaatgctc ttgaacattg
3240aagcattccc tccaaatctt gtcaagctta atcttgtcgg cttgatggta
gacggtcatc 3300tattggcagt gcttaagaaa ttgcccaaat taaggatact
tatattgctt tggtgcagac 3360atgatgcaga aaaaatggat ctctctggtg
atagctttcc gcaacttgaa gttttgtata 3420ttgaggatgc acaagggttg
tctgaagtaa cgtgcatgga tgatatgagt atgcctaaat 3480tgaaaaagct
atttcttgta caaggcccaa acatttcccc aattagtctc agggtctcgg
3540aacggcttgc aaagttgaga atatcacagg tactataaat aattatttac
gtttaatatc 3600catgattttt ttaaatttgt atttagttca tcaactaaat
attccatgtc taataaattg 3660cagggatgcc tttgaaaatg attctgtgtt
ggagagaatc ttctgatgcc tgttggtatt 3720ataatactaa taataagaga
aaaagtttga ttactgtttc aagttaattg cttgtgattt 3780gtaaaaacaa
attactttta tatttctctt tgttttattt tatgtttatt tatctttaat
3840taatggagta ataaaataaa aatcttattt tcaatagaaa aaagtagacc
ttatttgtgg 3900tgcatgtatg gtatcttttt gaaatttttg atatatttgc
tctttgattc gaatttcttg 3960cttatatgat gatttgcata aatataaaat
attatacaaa tacctatggg ttggaaaata 4020tagaaatatg ccaatcaaat
gtatacaaaa atcattaata gatagaatcg taaaagatat 4080acaaatgaga
aatgcttgac taagaagctt cgtgcaacct ctcacactga gcacaatgca
4140tttggtgatc tcggcactat tgctgttact tgtaagacta cgttccccaa
taagtctttc 4200caaacggctt gcaaagctga gaatatgaaa atctcatagg
ttagtttgct gcgttaatta 4260tttacattta atatgctcga taaggtgatt
ttaaaaaaat ttgtactagt taattcatga 4320actaaatatt tcatttaata
ctccataatt ctgaatatgg aaaataaata atatttaata 4380acaagaataa
aatgataaat tattcattga ttttataaat tggataaata ttattaaata
4440ttcttaaata atataatgaa caagtgaaga tgaacggagg gagtatgaag
cctcttttca 4500aaggggcccc aagtgtctga gacaaccaaa actgaaagtg
ggaaaccaaa ctctaagtca 4560aagactttat atacaaaatg gtataaatat
aattatttaa tttactatcg ggttatcgat 4620taacccgtta agaaaaaact
tcaaaccgtt aagaaccgat aacccgataa caaaaaaaat 4680ctaaatcgtt
atcaaaaccg ctaaactaat aacccaatat tgataaacca ataacttttt
4740ttattcgggt tatcggtttc agttctgttt ggaacaatcc tagtgtccta
attattgttt 4800tgagaaccaa gaaaacaaaa acttacgtcg caaatatttc
agtaaatact tgtatatctc 4860agtgataatt gatttccaac atgtataatt
atcatttacg taataataga tggtttccga 4920aacttacgct tccctttttt
cttttgcagt cgtatggaat aaaagttgga tatggaggca 4980ttcccgggcc
ttcaggtgga agagacggag ctgcttcaca aggagggggt tgttgtactt
5040gaaaatgggc atttattgtt cgcaaaccta tcatgttcct atggttgttt
atttgtagtt 5100tggtgttctt aatatcgagt gttctttagt ttgttccttt
taatgaaagg ataatatctg 5160tgcaaaaata agtaaattcg gtacataaag
acattttttt ttgcattttc tgtttatgga 5220gttgtcaaat gtgaatttat
ttcatagcat gtgagtttcc tctccttttt catgtgccct 5280tgggccttgc
atgtttcttg caccgcagtg tgccagggct gtcggcagat ggacataaat
5340ggcacaccgc tcggctcgtg gaaagagtat ggtcagtttc attgataagt
atttactcgt 5400attcggtgtt tacatcaagt taatatgttc aaacacatgt
gatatcatac atccattagt 5460taagtataaa tgccaacttt ttacttgaat
cgccgaataa atttacttac gtccaatatt 5520tagttttgtg tgtcaaacat
atcatgcact atttgattaa gaataaataa acgatgtgta 5580atttgaaaac
caattagaaa agaagtatga cgggattgat gttctgtgaa atcactggta
5640aattggacgg acgatgaaat ttgatcgtcc atttaagcat agcaacatgg
gtctttagtc 5700atcatcatta tgttataatt attttcttga aacttgatac
accaactttc attgggaaag 5760tgacagcata gtataaacta taatatcaat
tctggcaatt tcgaattatt ccaaatctct 5820tttgtcattt catttcctcc
cctatgtctg caagtaccaa ttatttaagt acaaaaaatc 5880ttgattaaac
aatttatttt ctcactaata atcacattta atcatcaacg gttcatacac
5940gtctgtcact ctttttttat tctctcaagc gcatgtgatc ataccaatta
tttaaataca 6000aaaaatcttg attaaacaat tcagtttctc actaataatc
acatttaatc atcaacggtt 6060catacacatc cgtcactctt tttttattct
ctcaagcgca tgtgatcata ccaattattt 6120aaatacaaaa aatcttgatt
aaacaattca ttttctcact aataatcaca tttaatcatc 6180aacggtttat
acacgtccgc cactcttttt ttattctctc aagcgtatgt gatcatatct
6240aactctcgtg caaacaagtg aaatgacgtt cactaataaa taatcttttg
aatactttgt 6300tcagtttaat ttatttaatt tgataagaat ttttttatta
ttgaattttt attgttttaa 6360attaaaaata agttaaatat atcaaaatat
cttttaattt tatttttgaa aaataacgta 6420gttcaaacaa attaaaattg
agtaactgtt tttcgaaaaa taatgattct aatagtatat 6480tctttttcat
cattagatat tttttttaag ctaagtacaa aagtcatatt tcaatcccca
6540aaatagcctc aatcacaaga aatgcttaaa tccccaaaat accctcaatc
acaagacgtg 6600tgtaccaatc atacctatgg tcctctcgta aattccgaca
aaatcaggtc tataaagtta 6660cccttgatat cagtattata aaactaaaaa
tctcagctgt aattcaagtg caatcacact 6720ctaccacaca ctctctagta
gagagatcag ttgataacaa gcttgttaac ggatccataa 6780ttgtaactga
tttattcttg aataacaact tcaatgaaat caagcaacaa agctgatttc
6840aacataaaaa aacagaacaa gaaaacaaaa acagagcatc atccatcaaa
gtgtaatctc 6900agcagattca atagagacta caagattttg cacttgtaca
taatcatcag tgtcaccggt 6960ataaagcatc atgatctgac catcgggtag
gatggtagcg gacccagtcc agacaccgtt 7020aatatcgtac cattgatcag
gaaccatggc aaaaggcaag tagagccagt ggatcaagtc 7080cttggatacg
gcatggcccc atgtgatatt tccccaaata gctgaatctg gattgtattg
7140ataaaaaaga tgataccatc ccttgtggta caatggacca ttaggatcgt
tcatccaatt 7200tttttgaggt tgaaaatggt aagcagttct ttgccagcta
agcatagcat tggaccacgc 7260ataagaaacg tgactagcat tgacgacatc
tcgaaaagtc ttatcggaga ctccctgaga 7320aacacctctt gacggcggcg
ccggcgaacg ggagttactc tgcaagtccg gtgactggtt 7380gttgaggatc
ggaaagaagg ctacagaaag caaaaggaaa gaggagagga aaatgccgga
7440gatgatttta agggacttcc ggtggccgga atcgggttga tccgggagga
atgtgtaatg 7500ggaggcggag ttttccgggt cataactgga atggtactgc
gtggccatac tcgtgcctaa 7560aatggcgaat aagtagagta taacactaca
tattctccct ctcttccctt tcttgatggg 7620acatcggtga aataaccttc
aaatgaaaaa aagaatgaag aagatatggc ttgatgaaga 7680actctttatc
cagaaatggt actctagctt ctaagcccca cgcggatgta gccttgtttg
7740ctcttaaaca gtcatactgg tgaagcgctt ttatcttgcg acatgtttcc
gtgtggaact 7800cttccttgtt tggagccttg tggaagtaca agtagccacc
aaaaatttcg tcagcacctt 7860cccctgatat gaccatcttc actcctagtg
atttaatctt acgtgacata aggaacatag 7920gagtgctggc tcttattgtt
gttacatcat acgtctcgat atgatatata acatcttcaa 7980tagcatcaat
cccgtcctga acagtaaagt gaaactcgtg gtgaacggtt cctaaaaagt
8040cagcaacttc ttttgcagcc ttgagatctg gtgagccctc gagaacattt
tgaagttttc 8100cctccggggc acttgtactc tagcaagaac ggagggctta
ggagatggta caatcccgct 8160tggttctctg aagcaattcc ttccactcct
tatgacactt tggttctgag gcgtgccttc 8220gaaaatgctg ttatcaaacg
gttgatgact gatgtcccct ttggcgttct gctctcgggg 8280ggacttgatt
cgtctttggt tgcttctgtc actactcgat acttggctgg aacaaaagct
8340gctaagcaat ggggagcaca acttcattcc ttctgtgttg gtctcgaggg
ctcaccagat 8400ctcaaggctg caaaagaagt tgctgacttt ttaggaaccg
ttcaccacga gtttcacttt 8460actgttcagg acgggattga tgctattgaa
gatgttatat atcatatcga gacgtatgat 8520gtaacaacaa taagagccag
cactcctatg ttccttatgt cacgtaagat taaatcacta 8580ggagtgaaga
tggtcatatc aggggaaggt gctgacgaaa tttttggtgg ctacttgtac
8640ttccacaagg ctccaaacaa ggaagagttc cacacggaaa catgtcgcaa
gataaaagcg 8700cttcaccagt atgactgttt aagagcaaac aaggctacat
ccgcgtgggg cttagaagct 8760agagtaccat ttctggataa agagttcttc
atcaagccat atcttcttca ttcttttttt 8820catttgaagg ttatttcacc
gatgtcccat caagaaaggg aagagaggga gaatatgtag 8880tgttatactc
tacttattcg ccattttagg cacgagtatg gccacgcagt accattccag
8940ttatgacccg gaaaactccg cctcccatta cacattcctc ccggatcaac
ccgattccgg 9000ccaccggaag tcccttaaaa tcatctccgg cattttcctc
tcctctttcc ttttgctttc 9060tgtagccttc tttccgatcc tcaacaacca
gtcaccggac ttgcagagta actcccgttc 9120gccggcgccg ccgtcaagag
gtgtttctca gggagtctcc gataagactt ttcgagatgt 9180cgtcaatgct
agtcacgttt cttatgcgtg gtccaatgct atgcttagct ggcaaagaac
9240tgcttaccat tttcaacctc
aaaaaaattg gatgaacgat cctaatggtc cattgtacca 9300caagggatgg
tatcatcttt tttatcaata caatccagat tcagctattt ggggaaatat
9360cacatggggc catgccgtat ccaaggactt gatccactgg ctctacttgc
cttttgccat 9420ggttcctgat caatggtacg atattaacgg tgtctggact
gggtccgcta ccatcctacc 9480cgatggtcag atcatgatgc tttataccgg
tgacactgat gattatgtac aagtgcaaaa 9540tcttgtagtc tctattgaat
ctgctgagat tacactttga tggatgatgc tctgtttttg 9600ttttcttgtt
ctgttttttt atgttgaaat cagctttgtt gcttgatttc attgaagttg
9660ttattcaaga ataaatcagt tacaattata ctagtcccta gacttgtcca
tcttctggat 9720tggccaactt aattaatgta tgaaataaaa ggatgcacac
atagtgacat gctaatcact 9780ataatgtggg catcaaagtt gtgtgttatg
tgtaattact aattatctga ataagagaaa 9840gagatcatcc atatttctta
tcctaaatga atgtcacgtg tctttataat tctttgatga 9900accagatgca
ttttattaac caattccata tacgagctcc ctattttttt actatattat
9960actcaaccca atgagcataa agactgtaaa atctcaaatt cctgagaagc
atatttatcg 10020atcccacaga cttgatagtt ccataatcca tacgctgcag
ccaaattgct agtgtgttga 10080acatttaaca cgtagagaac tagaaaagat
ataaaactaa gattgatatc caaaatagac 10140gagaacaata agcaaaaact
cttagttttg aaataaatca acaatcccga gggttgtcac 10200atacatcaaa
aacgaaaatc catatagcaa aaaaaactct aaattaccgt tcgacaaaaa
10260gagaaaactg ataggacatt tgctaaacat taaaatcaat atgaacgtct
cccatcaccc 10320tctgtgatca catctttgaa agcaccgcca atgggaatca
taggtagaac gtgctcctga 10380tgaggtacaa tcacatccaa caagtatggc
ccaggagtgt ctaacatctt ttgaatggca 10440gctctaagat catccctgtg
tgacactctt gcagcaggta cgccacaagc ctctgcgaat 10500ttcaacatat
tagggaagat ctcttcctca ttagcaggat cacccaagta agtgtgtgct
10560ctgttagcct tatagaatcg atcctccagt tgaaccacca ttcccaagtg
ttgattattc 10620agcaacataa tcttaactgg gagattctcc accttaattg
ttgctaactc ctgcacattc 10680atgataaaac tcccgtcacc atcaatgtca
accacaatct cacccggtct tccaacagcc 10740gcacctattg cagcaggcaa
accaaatccc attgctccta atccaccaga tgtcaaccat 10800tggtgtggct
ttttgtactt atagtattgg gcagcccaca tctggtgttg ccccacacca
10860gtactaataa tggcatttcc gttagttaac tcatcaagaa cctgaatagc
atattgtgga 10920gggatggctt caccaaaagt cttaaaactc aatgggtact
tcaccttctg ttccgttaac 10980tcctgtctcc aagcagaaaa gtccaacttc
agcttacctt ctttaccctc caatatggaa 11040ttcaaaccct gtaatgccaa
cttgatatct gcacaaatgg aaacatgagg ttgcttgttc 11100tttccaatct
cagccgaatc aatatcaatg tggacaattt tcgctcggct agcaaaagct
11160tccaatttac cagtaactcg atcatcaaac ctcaccccaa atgcaagcaa
caaatcacta 11220ccatccacag cataattagc atacacagtc ccatgcatac
ccaacatttg aagggaaagc 11280tcatccccag ttggaaaagc tccaagaccc
atcaaagtac tcgccacagg aatacccgta 11340agctccacaa atcgtctcag
ctcctcactt gattgcaaac acccaccacc cacatacaaa 11400acaggcttct
tcgactccga aatcagccta ataatttgtt ccaaaagcat ctcattaggc
11460aatttaggta atctagacat gtaaccaggc aacctcattg gctgatccca
attaggtatc 11520accaattgtt gctgaatatc cttaggtaca tcaatcaaaa
ccggcccagg ccgtcccgat 11580ttcgctagaa aaaacgcttc acgaacaacc
ctaggaatat cctctacatc cataacaaga 11640taattatgct tcgtaataga
tctcgttacc tcaacaatag gcgtttcctg aaacgcatca 11700gtaccaatca
tcctcctcgg cacttgaccc gtaatagcaa caatcggaat actatccaac
11760aaagcatccg caagaccact aacaagattc gtagctcccg gaccagaggt
agcaatgcaa 11820acaccaggga acccagtcgc ccgtgcgtaa ccctctgcag
caaacacacc accttgctca 11880tgacgtggca gcacattacg aataatattc
gaacgtgtca aagcctgatg aatctccata 11940gaagcacctc ctgggtacgc
aaatacatcc gtaaccccct ccctttcaag tgcctccaca 12000agaacatcac
aaccctttct gggttcgtca ggggcgaaac gggaaacgaa tgtttcaggt
12060tcagaaacgt cgttatgggt ggtagtggat atgacgacat tggaaacggc
gaaaccacga 12120cgatgatgat gggtgtgggt gagatgaagg ggtgaggctt
tttgagggtg attgtggaaa 12180gggaaggtag atctaggaag aatggtggaa
gatttggagg aagatggagg tagggttttg 12240gagaaacatg gagatggtga
ggcagcagcc gccatacctc cacgtagacg gagcaccaaa 12300tggagggtag
actccttctg gatgttgtaa tcagctagag tacgtccgtc ctccaactgc
12360tttccggcga agataagcct ttgctgatcc gggggaattc cttccttatc
ctggatctta 12420gccttaacgt tgtcgattgt atcagaactt tccacctcta
gggtgatagt ctttccggtg 12480agagttttca caaagatctg catctgcaaa
ttcataaaaa caacaatcag aacaacgaaa 12540aaaactcaaa tttagggctt
ttcacactcc ctgattagct gattcgagaa ttaaaaccaa 12600acaatctaaa
acacggggaa aaaactcaaa tttagggctt tttcatcaaa acagccggag
12660aaactcaaat ttagggcttt ttcatcaaaa cagccggaga aactcaaatt
ttaggccttc 12720agaatcaaac cgacgaaaaa acacaaattc tagggctttt
tcatcaaatc aaccggaaaa 12780actcaaattt aaggcctttt tcatcaaaca
aatgtaaaaa ctctaatttt aaacctttag 12840aatcgaatcg acagaaaaaa
aatctcaaac taggggcttt ttcatcaaaa caacggggaa 12900aactccaatt
gtaggccttt tccatcaaac caacgtaaaa aaactcaaat tttaggcctt
12960ttccatcaag acaaccgaaa aaaaactcaa aattaagggc tataacagca
cctgattagc 13020taattcgaga attgacagga tatatggtac tgtaaac
13057153014DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 15atttagcagc attccagatt gggttcaatc
aacaaggtac gagccatatc actttattca 60aattggtatc gccaaaacca agaaggaact
cccatcctca aaggtttgta aggaagaatt 120ctcagtccaa agcctcaaca
aggtcagggt acagagtctc caaaccatta gccaaaagct 180acaggagatc
aatgaagaat cttcaatcaa agtaaactac tgttccagca catgcatcat
240ggtcagtaag tttcagaaaa agacatccac cgaagactta aagttagtgg
gcatctttga 300aagtaatctt gtcaacatcg agcagctggc ttgtggggac
cagacaaaaa aggaatggtg 360cagaattgtt aggcgcacct accaaaagca
tctttgcctt tattgcaaag ataaagcaga 420ttcctctagt acaagtgggg
aacaaaataa cgtggaaaag agctgtcctg acagcccact 480cactaatgcg
tatgacgaac gcagtgacga ccacaaaaga attccctcta tataagaagg
540cattcattcc catttgaagg atcatcagat actcaaccaa tactagtatg
ttaagggcta 600taacagcacc tgattagcta attcgagaat ctaaacagac
aaaaaccaaa gtccgtcctg 660tagaaacccc aacccgtgaa atcaaaaaac
tcgacggcct gtgggcattc agtctggatc 720gcgaaaactg tggaattgat
cagcgttggt gggaaagcgc gttacaagaa agccgggcaa 780ttgctgtgcc
aggcagtttt aacgatcagt tcgccgatgc agatattcgt aattatgcgg
840gcaacgtctg gtatcagcgc gaagtcttta taccgaaagg ttgggcaggc
cagcgtatcg 900tgctgcgttt cgatgcggtc actcattacg gcaaagtgtg
ggtcaataat caggaagtga 960tggagcatca gggcggctat acgccatttg
aagccgatgt cacgccgtat gttattgccg 1020ggaaaagtgt acgtaagttt
ctgcttctac ctttgatata tatataataa ttatcattaa 1080ttagtagtaa
tataatattt caaatatttt tttcaaaata aaagaatgta gtatatagca
1140attgcttttc tgtagtttat aagtgtgtat attttaattt ataacttttc
taatatatga 1200ccaaaatttg ttgatgtgca ggtatcaccg tttgtgtgaa
caacgaactg aactggcaga 1260ctatcccgcc gggaatggtg attaccgacg
aaaacggcaa gaaaaagcag tcttacttcc 1320atgatttctt taactatgcc
ggaatccatc gcagcgtaat gctctacacc acgccgaaca 1380cctgggtgga
cgatatcacc gtggtgacgc atgtcgcgca agactgtaac cacgcgtctg
1440ttgactggca ggtggtggcc aatggtgatg tcagcgttga actgcgtgat
gcggatcaac 1500aggtggttgc aactggacaa ggcactagcg ggactttgca
agtggtgaat ccgcacctct 1560ggcaaccggg tgaaggttat ctctatgaac
tgtgcgtcac agccaaaagc cagacagagt 1620gtgatatcta cccgcttcgc
gtcggcatcc ggtcagtggc agtgaagggc gaacagttcc 1680tgattaacca
caaaccgttc tactttactg gctttggtcg tcatgaagat gcggacttgc
1740gtggcaaagg attcgataac gtgctgatgg tgcacgacca cgcattaatg
gactggattg 1800gggccaactc ctaccgtacc tcgcattacc cttacgctga
agagatgctc gactgggcag 1860atgaacatgg catcgtggtg attgatgaaa
ctgctgctgt cggctttaac ctctctttag 1920gcattggttt cgaagcgggc
aacaagccga aagaactgta cagcgaagag gcagtcaacg 1980gggaaactca
gcaagcgcac ttacaggcga ttaaagagct gatagcgcgt gacaaaaacc
2040acccaagcgt ggtgatgtgg agtattgcca acgaaccgga tacccgtccg
caaggtgcac 2100gggaatattt cgcgccactg gcggaagcaa cgcgtaaact
cgacccgacg cgtccgatca 2160cctgcgtcaa tgtaatgttc tgcgacgctc
acaccgatac catcagcgat ctctttgatg 2220tgctgtgcct gaaccgttat
tacggatggt atgtccaaag cggcgatttg gaaacggcag 2280agaaggtact
ggaaaaagaa cttctggcct ggcaggagaa actgcatcag ccgattatca
2340tcaccgaata cggcgtggat acgttagccg ggctgcactc aatgtacacc
gacatgtgga 2400gtgaagagta tcagtgtgca tggctggata tgtatcaccg
cgtctttgat cgcgtcagcg 2460ccgtcgtcgg tgaacaggta tggaatttcg
ccgattttgc gacctcgcaa ggcatattgc 2520gcgttggcgg taacaagaaa
gggatcttca ctcgcgaccg caaaccgaag tcggcggctt 2580ttctgctgca
aaaacgctgg actggcatga acttcggtga aaaaccgcag cagggaggca
2640aacaatgaaa gcttttgatt ttaatgttta gcaaatgtcc tatcagtttt
ctctttttgt 2700cgaacggtaa tttagagttt tttttgctat atggattttc
gtttttgatg tatgtgacaa 2760ccctcgggat tgttgattta tttcaaaact
aagagttttt gcttattgtt ctcgtctatt 2820ttggatatca atcttagttt
tatatctttt ctagttctct acgtgttaaa tgttcaacac 2880actagcaatt
tggctgcagc gtatggatta tggaactatc aagtctgtgg gatcgataaa
2940tatgcttctc aggaatttga gattttacag tctttatgct cattgggttg
agtataatat 3000agtaaaaaaa tagg 30141620DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
16aaggcattca ttcccatttg 201720DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 17gacccacact ttgccgtaat
20184164DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 18atggctccta agaagaagag gaaggttgag
ccaggatctc caggtggaca gtctctcatg 60gataagaagt actccatcgg cctcgatatc
ggcactaact ctgttggatg ggctgtgatc 120actgatgagt acaaggtgcc
atccaagaag ttcaaggttc tcggcaacac tgataggcac 180tccatcaaga
agaaccttat cggcgctctc ctcttcgatt ccggtgaaac tgctgaggct
240actaggctta agaggactgc tagaaggcgt tacactaggc gtaagaacag
gatctgctac 300ctccaagaga tcttctccaa cgagatggct aaagtggatg
attcattctt ccacaggctc 360gaagagtcct tcttggtgga agaagataag
aagcacgaga ggcacccaat cttcggcaac 420attgtggatg aagtggctta
ccacgagaag tacccaacta tctaccacct ccgtaagaag 480ctcgttgatt
ccactgataa ggctgatctc aggctcatct accttgctct cgctcacatg
540atcaagttca ggggacactt cctcatcgag ggtgatctca acccagataa
ctccgatgtg 600gataagttgt tcatccagct cgtgcagact tacaaccagc
ttttcgaaga gaacccaatc 660aacgcttccg gtgtggatgc taaggctatt
ctttctgcta ggctctccaa gtccaggcgt 720cttgagaatc ttattgctca
gctcccaggc gagaagaaga acggactttt cggaaacttg 780atcgctctct
cccttggcct cactccaaac ttcaagtcca acttcgatct cgctgaggat
840gcaaagctcc agctctctaa ggatacttac gatgatgatc tcgataacct
cctcgctcag 900atcggagatc agtacgctga tttgttcctc gctgctaaga
acctctccga tgctatcctc 960ctctctgata tcctccgtgt gaacactgag
atcactaagg ctccactctc cgcttccatg 1020attaagaggt acgatgagca
ccaccaggat cttacacttc tcaaggctct tgtgaggcag 1080cagcttcctg
agaagtacaa agagattttc ttcgatcagt ccaagaacgg ctacgctggt
1140tacattgatg gtggcgcttc tcaagaagag ttctacaagt tcatcaagcc
aatcctcgaa 1200aagatggatg gaactgagga actcctcgtg aagctcaaca
gagaggatct ccttaggaag 1260cagaggactt tcgataacgg ctccattcca
caccagattc accttggtga gttgcacgct 1320attctcaggc gtcaagagga
tttctaccca ttcctcaagg ataaccgtga gaagatcgag 1380aagattctta
ctttccgtat cccttactac gtgggaccac ttgctagggg aaattctagg
1440ttcgcttgga tgactaggaa gtccgaagag actatcactc catggaactt
cgaagaggtg 1500gtggataagg gtgctagtgc tcagtctttc atcgagagga
tgactaactt cgataagaac 1560cttccaaacg agaaggtgct cccaaagcac
tctttgctct acgagtactt cactgtgtac 1620aacgagttga ctaaggtgaa
gtacgtgaca gagggcatga ggaagccagc ttttttgtct 1680ggtgagcaga
agaaggctat cgttgatctc ttgttcaaga ctaaccgtaa ggtgacagtg
1740aagcagctca aagaggatta cttcaagaaa atcgagtgct tcgattctgt
tgagatctcc 1800ggcgttgagg ataggttcaa tgcttccctt ggcacatacc
acgatttgct caagatcatt 1860aaggataagg atttcttgga taacgaggaa
aacgaggata ttcttgagga tatcgtgctt 1920actctcactc tcttcgagga
tcgtgagatg attgaggaaa ggctcaagac ttacgctcac 1980cttttcgatg
ataaggtgat gaagcagttg aagaggcgta ggtacactgg atggggaagg
2040ctttctagga agctcatcaa cggcatcagg gataagcagt ccggtaagac
tattctcgat 2100ttcctcaagt ccgatggctt cgcaaaccgt aacttcatgc
agctcatcca cgatgattcc 2160ctcactttta aagaggatat ccagaaggct
caggtttccg gacaaggcga ttctcttcat 2220gagcacattg ctaacctcgc
tggctcccca gctattaaga agggaattct ccagactgtg 2280aaagtggtgg
atgagttggt gaaggtgatg ggaaggcata agccagagaa catcgtgatc
2340gagatggcac gtgagaacca gactactcag aagggccaga agaactccag
ggaaaggatg 2400aagaggatcg aggaaggcat caaagagctt ggctcccaga
tccttaaaga gcacccagtt 2460gagaacactc agctccagaa tgagaagctc
tacctctact acctccagaa cggcagggat 2520atgtatgtgg atcaagagtt
ggatatcaac aggctctccg attatgatgt tgatcacatc 2580gtgccacagt
cattcttgaa ggatgattcc atcgataaca aggtgctcac taggtccgat
2640aagaataggg gcaagtctga taacgtgcca agtgaagagg ttgtgaagaa
aatgaagaac 2700tactggcgtc agcttctcaa cgctaagctc attactcagc
gtaagttcga taacttgaca 2760aaggctgaga ggggaggcct ctctgaattg
gataaggcag gattcatcaa gaggcagctc 2820gtggaaacta ggcagatcac
aaagcacgtg gcacagatcc tcgattccag gatgaacact 2880aagtatgatg
agaacgataa gttgatccgt gaggttaagg tgatcactct caagtctaag
2940ctcgtgtccg attttaggaa ggatttccaa ttctacaagg tgagggaaat
caacaactac 3000caccacgctc acgatgctta ccttaacgct gttgtgggaa
cagctcttat caagaagtat 3060ccaaagttgg agtccgagtt cgtgtacggc
gattacaagg tttacgatgt gaggaagatg 3120atcgctaagt ccgagcaaga
gatcggcaag gctactgcta agtacttttt ctactccaac 3180atcatgaatt
tcttcaagac agagatcaca ctcgctaacg gcgagattag gaagaggcca
3240ctcattgaga ctaacggtga gactggtgag atcgtgtggg ataagggaag
ggatttcgct 3300actgtgcgta aggtgctctc tatgccacag gtgaacattg
tgaaaaagac agaggtgcag 3360actggcggct tctccaaaga gtctattctc
ccaaagagga actccgataa gctcattgct 3420aggaagaagg attgggaccc
aaagaagtac ggcggcttcg atagtcctac tgtggcttac 3480tctgttcttg
tggtggctaa ggttgagaag ggcaagtcaa agaagctcaa gtctgttaag
3540gaattgctcg gcatcactat catggaaagg tcctcattcg agaagaaccc
tattgatttc 3600cttgaggcta agggctacaa agaggttaag aaggatctca
tcatcaagct ccctaagtac 3660tccttgttcg agcttgagaa cggccgtaag
aggatgcttg cttctgctgg tgaactccag 3720aagggaaacg aacttgctct
cccatccaag tacgttaact ttctctacct cgcttcccac 3780tacgagaagt
tgaagggatc accagaggat aacgaacaga agcaactttt cgttgagcag
3840cacaagcact atctcgatga gattatcgag cagatctccg agttctccaa
gcgtgtgatt 3900ctcgctgatg caaacctcga taaggtgttg tccgcttaca
acaagcaccg tgataagcct 3960attcgtgagc aggctgagaa catcatccac
cttttcactc tcactaacct cggtgctcca 4020gctgctttca agtacttcga
tacaacaatc gataggaagc gttacacatc cacaaaagag 4080gtgctcgatg
ctactctcat tcaccagtcc atcactggcc tttacgagac taggatcgat
4140ctttctcagc tcggaggcga ttga 4164191387PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
19Met Ala Pro Lys Lys Lys Arg Lys Val Glu Pro Gly Ser Pro Gly Gly 1
5 10 15 Gln Ser Leu Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly
Thr 20 25 30 Asn Ser Val Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys
Val Pro Ser 35 40 45 Lys Lys Phe Lys Val Leu Gly Asn Thr Asp Arg
His Ser Ile Lys Lys 50 55 60 Asn Leu Ile Gly Ala Leu Leu Phe Asp
Ser Gly Glu Thr Ala Glu Ala 65 70 75 80 Thr Arg Leu Lys Arg Thr Ala
Arg Arg Arg Tyr Thr Arg Arg Lys Asn 85 90 95 Arg Ile Cys Tyr Leu
Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val 100 105 110 Asp Asp Ser
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu 115 120 125 Asp
Lys Lys His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu 130 135
140 Val Ala Tyr His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys
145 150 155 160 Leu Val Asp Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile
Tyr Leu Ala 165 170 175 Leu Ala His Met Ile Lys Phe Arg Gly His Phe
Leu Ile Glu Gly Asp 180 185 190 Leu Asn Pro Asp Asn Ser Asp Val Asp
Lys Leu Phe Ile Gln Leu Val 195 200 205 Gln Thr Tyr Asn Gln Leu Phe
Glu Glu Asn Pro Ile Asn Ala Ser Gly 210 215 220 Val Asp Ala Lys Ala
Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg 225 230 235 240 Leu Glu
Asn Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu 245 250 255
Phe Gly Asn Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys 260
265 270 Ser Asn Phe Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys
Asp 275 280 285 Thr Tyr Asp Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile
Gly Asp Gln 290 295 300 Tyr Ala Asp Leu Phe Leu Ala Ala Lys Asn Leu
Ser Asp Ala Ile Leu 305 310 315 320 Leu Ser Asp Ile Leu Arg Val Asn
Thr Glu Ile Thr Lys Ala Pro Leu 325 330 335 Ser Ala Ser Met Ile Lys
Arg Tyr Asp Glu His His Gln Asp Leu Thr 340 345 350 Leu Leu Lys Ala
Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu 355 360 365 Ile Phe
Phe Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly 370 375 380
Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu 385
390 395 400 Lys Met Asp Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg
Glu Asp 405 410 415 Leu Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly Ser
Ile Pro His Gln 420 425 430 Ile His Leu Gly Glu Leu His Ala Ile Leu
Arg Arg Gln Glu Asp Phe 435 440 445 Tyr Pro Phe Leu Lys Asp Asn Arg
Glu Lys Ile Glu Lys Ile Leu Thr 450 455 460 Phe Arg Ile Pro Tyr Tyr
Val Gly Pro Leu Ala Arg Gly Asn Ser Arg 465 470 475 480 Phe Ala Trp
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn 485 490 495 Phe
Glu Glu Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu 500 505
510 Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro
515 520 525 Lys His Ser Leu Leu Tyr Glu Tyr Phe Thr Val Tyr
Asn Glu Leu Thr 530 535 540 Lys Val Lys Tyr Val Thr Glu Gly Met Arg
Lys Pro Ala Phe Leu Ser 545 550 555 560 Gly Glu Gln Lys Lys Ala Ile
Val Asp Leu Leu Phe Lys Thr Asn Arg 565 570 575 Lys Val Thr Val Lys
Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu 580 585 590 Cys Phe Asp
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala 595 600 605 Ser
Leu Gly Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp 610 615
620 Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu
625 630 635 640 Thr Leu Thr Leu Phe Glu Asp Arg Glu Met Ile Glu Glu
Arg Leu Lys 645 650 655 Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met
Lys Gln Leu Lys Arg 660 665 670 Arg Arg Tyr Thr Gly Trp Gly Arg Leu
Ser Arg Lys Leu Ile Asn Gly 675 680 685 Ile Arg Asp Lys Gln Ser Gly
Lys Thr Ile Leu Asp Phe Leu Lys Ser 690 695 700 Asp Gly Phe Ala Asn
Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser 705 710 715 720 Leu Thr
Phe Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly 725 730 735
Asp Ser Leu His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile 740
745 750 Lys Lys Gly Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val
Lys 755 760 765 Val Met Gly Arg His Lys Pro Glu Asn Ile Val Ile Glu
Met Ala Arg 770 775 780 Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn
Ser Arg Glu Arg Met 785 790 795 800 Lys Arg Ile Glu Glu Gly Ile Lys
Glu Leu Gly Ser Gln Ile Leu Lys 805 810 815 Glu His Pro Val Glu Asn
Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu 820 825 830 Tyr Tyr Leu Gln
Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp 835 840 845 Ile Asn
Arg Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser 850 855 860
Phe Leu Lys Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp 865
870 875 880 Lys Asn Arg Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val
Val Lys 885 890 895 Lys Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala
Lys Leu Ile Thr 900 905 910 Gln Arg Lys Phe Asp Asn Leu Thr Lys Ala
Glu Arg Gly Gly Leu Ser 915 920 925 Glu Leu Asp Lys Ala Gly Phe Ile
Lys Arg Gln Leu Val Glu Thr Arg 930 935 940 Gln Ile Thr Lys His Val
Ala Gln Ile Leu Asp Ser Arg Met Asn Thr 945 950 955 960 Lys Tyr Asp
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr 965 970 975 Leu
Lys Ser Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr 980 985
990 Lys Val Arg Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu
995 1000 1005 Asn Ala Val Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro
Lys Leu 1010 1015 1020 Glu Ser Glu Phe Val Tyr Gly Asp Tyr Lys Val
Tyr Asp Val Arg 1025 1030 1035 Lys Met Ile Ala Lys Ser Glu Gln Glu
Ile Gly Lys Ala Thr Ala 1040 1045 1050 Lys Tyr Phe Phe Tyr Ser Asn
Ile Met Asn Phe Phe Lys Thr Glu 1055 1060 1065 Ile Thr Leu Ala Asn
Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu 1070 1075 1080 Thr Asn Gly
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp 1085 1090 1095 Phe
Ala Thr Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile 1100 1105
1110 Val Lys Lys Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser
1115 1120 1125 Ile Leu Pro Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg
Lys Lys 1130 1135 1140 Asp Trp Asp Pro Lys Lys Tyr Gly Gly Phe Asp
Ser Pro Thr Val 1145 1150 1155 Ala Tyr Ser Val Leu Val Val Ala Lys
Val Glu Lys Gly Lys Ser 1160 1165 1170 Lys Lys Leu Lys Ser Val Lys
Glu Leu Leu Gly Ile Thr Ile Met 1175 1180 1185 Glu Arg Ser Ser Phe
Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala 1190 1195 1200 Lys Gly Tyr
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro 1205 1210 1215 Lys
Tyr Ser Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu 1220 1225
1230 Ala Ser Ala Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro
1235 1240 1245 Ser Lys Tyr Val Asn Phe Leu Tyr Leu Ala Ser His Tyr
Glu Lys 1250 1255 1260 Leu Lys Gly Ser Pro Glu Asp Asn Glu Gln Lys
Gln Leu Phe Val 1265 1270 1275 Glu Gln His Lys His Tyr Leu Asp Glu
Ile Ile Glu Gln Ile Ser 1280 1285 1290 Glu Phe Ser Lys Arg Val Ile
Leu Ala Asp Ala Asn Leu Asp Lys 1295 1300 1305 Val Leu Ser Ala Tyr
Asn Lys His Arg Asp Lys Pro Ile Arg Glu 1310 1315 1320 Gln Ala Glu
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly 1325 1330 1335 Ala
Pro Ala Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys 1340 1345
1350 Arg Tyr Thr Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His
1355 1360 1365 Gln Ser Ile Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu
Ser Gln 1370 1375 1380 Leu Gly Gly Asp 1385 20103DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
20gattctcgaa ttagctaatc gttttagagc tagaaatagc aagttaaaat aaggctagtc
60cgttatcaac ttgaaaaagt ggcaccgagt cggtgctttt ttt
1032134PRTUnknownDescription of Unknown Central repeat domain
polypeptide 21Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp
Gly Gly Lys 1 5 10 15 Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro
Val Leu Cys Gln Ala 20 25 30 His Gly 221018DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
22gtttacagta ccatatatcc tgtcaattct cgaattagct aatcaggtgc tgttatagcc
60cttaattttg agtttttttt cggttgtctt gatggaaaag gcctaaaatt tgagtttttt
120tacgttggtt tgatggaaaa ggcctacaat tggagttttc cccgttgttt
tgatgaaaaa 180gcccctagtt tgagattttt tttctgtcga ttcgattcta
aaggtttaaa attagagttt 240ttacatttgt ttgatgaaaa aggccttaaa
tttgagtttt tccggttgat ttgatgaaaa 300agccctagaa tttgtgtttt
ttcgtcggtt tgattctgaa ggcctaaaat ttgagtttct 360ccggctgttt
tgatgaaaaa gccctaaatt tgagtttctc cggctgtttt gatgaaaaag
420ccctaaattt gagttttttc cccgtgtttt agattgtttg gttttaattc
tcgaatcagc 480taatcaggga gtgtgaaaag ccctaaattt gagttttttt
cgttgttctg attgttgttt 540ttatgaattt gcagatgcag atctttgtga
aaactctcac cggaaagact atcaccctag 600aggtggaaag ttctgataca
atcgacaacg ttaaggctaa gatccaggat aaggaaggaa 660ttcccccgga
tcagcaaagg cttatcttcg ccggaaagca gttggaggac ggacgtactc
720tagctgatta caacatccag aaggagtcta ccctccattt ggtgctccgt
ctacgtggag 780gtatggcggc tgctgcctca ccatctccat gtttctccaa
aaccctacct ccatcttcct 840ccaaatcttc caccattctt cctagatcta
ccttcccttt ccacaatcac cctcaaaaag 900cctcacccct tcatctcacc
cacacccatc atcatcgtcg tggtttcgcc gtttccaatg 960tcgtcatatc
cactaccacc cataacgacg tttctgaacc tgaaacattc gtttcccg
10182366DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 23atgttaaggg ctataacagc acctgattag
ctaattcgag aatctaaaca gacaaaaacc 60aaagtc 662460DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 24tttggttttt gtctgtttag attctcgaat tagctaatca
ggtgctgtta tagcccttaa 602560DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 25tttggttttt
gtctgtttag attctcgaat tagctaatca ggtgctgtta tagcccttaa
602660DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 26tttggttttt gtctgtttag attctcgaat
tagttaatca ggtgctgtta tagcccttaa 602755DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 27tttggttttt gtctgtttag attctcgaat aatcaggtgc
tgttatagcc cttaa 552853DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 28tttggttttt
gtctgtttag attctcgaaa tcaggtgctg ttatagccct taa 532945DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 29tttggttttt gtctgtttag attctcgaac tgttatagcc cttaa
453052DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 30tttggttttt gtctgtttag attctcgaat
caggtgctgt tatagccctt aa 523156DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 31tttggttttt
gtctgtttag attctcgaat taatcaggtg ctgttatagc ccttaa
563252DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 32tttggttttt gtctgcttag attctcgaat
taggtgctgt tatagccctt aa 523353DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 33tttggttttt
gtctgtttag attctcgaat tcaggtgctg ttatagccct taa 533455DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 34tttggttttt gtctgtttag attctcgaat tatcaggtgc
tgttatagcc cttaa 553556DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 35tttggttttt
gtctgtttag attctcgaat taatcaggtg ctgttatagc ccttaa
563658DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 36tttggttttt gtctgtttag attctcgaat
tataatcagg tgctgttata gcccttaa 583761DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 37tttggttttt gtctgtttag attctcgaat tagcctaatc
aggtgctgtt atagccctta 60a 61381450DNAArtificial SequenceDescription
of Artificial Sequence Synthetic polynucleotide 38ctgatttcta
ttataatttc tattaattgc cttcaaattt ctctttcaag gttagaaatc 60ttctctattt
tttggttttt gtctgtttag attctcgaat tagctaatca ggtgctgtta
120aagccctaaa atttgagttt tttttccgtc gaattgatgc taaaggctta
aaattagagt 180tttttcgtcg gtttgactct gaaggcctaa aatttggggt
tttccgggtg atttgatgat 240aaagccctag aatttgagtt tttttatttg
tcggtttgat gaaaaaggcc ttaaatttaa 300tttttttccc ggttgatttg
atgaaaaagc cctagaattt gtgttttttc gtcggtttga 360ttctaaaggc
ctaaaatttg agtttttccg gttgttttga tgaaaaagcc ctaaaatttg
420agttttttcc ccgtgtttta gattgtttgg ttttaattct tgaatcagat
aatcagggag 480tgtgaaaagc cctaaaattt gagttttttt cgttgttctg
attgttgttt ttatgaattt 540gattctcgaa ttagctaatc aggtgctgtt
atagccctta attttgagtt ttttttcggt 600tgtcttgatg gaaaaggcct
aaaatttgag tttttttacg ttggtttgat ggaaaaggcc 660tacaattgga
gttttccccg ttgttttgat gaaaaagccc ctagtttgag attttttttc
720tgtcgattcg attctaaagg tttaaaatta gagtttttac atttgtttga
tgaaaaaggc 780cttaaatttg agtttttccg gttgatttga tgaaaaagcc
ctagaatttg tgtttttcgt 840cggtttgatt ctgaaggttt gattctgaag
gcctaaaatt tgagtttctc cggctgtttt 900gatgaaaaag ccctaaattt
gagtttctcc ggctgttttg atgaaaaagc cctaaatttg 960agttttttcc
ccgtgtttta gattgtttgg ttttaattct cgaatcagct aatcagggag
1020tgtgaaaagc cctaaatttg agtttttttc gttgttctga ttgttgtttt
tatgaatttg 1080cagatgcaga tctttgtgaa aactctcacc ggaaagacta
tcaccctaga ggtggaaagt 1140tctgatacaa tcgacaacgt taaggctaag
atccaggata aggaaggaat tcccccggat 1200cagcaaaggc ttatcttcgc
cggaaagcag ttggaggacg gacgtactct agctgattac 1260aacatccaga
aggagtctac cctccatttg gtgctccgtc tacgtggagg tatggcggct
1320gctgcctcac catctccatg tttctccaaa accctacctc catcttcctc
caaatcttcc 1380accattcttc ctagatctac cttccctttc cacaatcacc
ctcaaaaagc ctcacccctt 1440catctcaccc 145039891DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
39ttttcatctt ctatctgatt tctattataa tttctattaa ttgccttcaa atttctcttt
60caaggttaga aatcttctct attttttggt ttttgtctgt ttagattctc gaattagcta
120atcaggtgct gttaaagccc taaaatttga gttttttttc cgccgaattg
atgctaaagg 180cttaaaatta gggttttttc gtcggtttga ctctgaaggc
ctaaaatttg gggttttccg 240ggtgatttga tgataaagcc ctagaatttg
agttttttta tttgtcggtt tgatgaaaaa 300ggccttaaat ttaatttttt
tcccggttga tttgatgaaa aagccctaga atttgtgttt 360tttcgtcggt
ttgattctaa aggcctaaaa tttgagtttt tccggttgtt ttgatgaaaa
420agccctaaaa tttgagtttt ttccccgtgt tttagattgt ttggttttaa
ttcttgaatc 480agataatcag ggagtgtgaa aagccctaaa tttgagtttt
tttcgttgtt ctgattgttg 540tttttatgaa tttgcagatg cagatctttg
tgaaaactct caccggaaag actatcaccc 600tagaggtgga aacaatcgac
aacgttaagg ctaagatcca ggataaggaa ggaattcccc 660cggatcagca
aaggcttatc ttcgccggaa agcagttgga ggacggacgt actctagctg
720attacaacat ccagaaggag tctaccctcc atttggtgct ccgtctacgt
ggaggtatgg 780cggctgctgc ctcaccatct ccatgcttct ccaaaaccct
acctccatct tcctccaaat 840cttccaccat tcttcctaga tctaccttcc
ctttccacaa tcaccctcaa a 89140900DNAArtificial SequenceDescription
of Artificial Sequence Synthetic polynucleotide 40ttttcatctt
ctatctgatt tctattataa tttctattaa ttgccttcaa atttctcttt 60caaggttaga
aatcttctct attttttggt ttttgtctgt ttagattctc gaattagcta
120atcaggtgct gttaaagccc taaaatttga gttttttttc cgccgaattg
atgctaaagg 180cttaaaatta gggttttttc gtcggtttga ctctgaaggc
ctaaaatttg gggttttccg 240ggtgatttga tgataaagcc ctagaatttg
agttttttta tttgtcggtt tgatgaaaaa 300ggccttaaat ttaatttttt
tcccggttga tttgatgaaa aagccctaga atttgtgttt 360tttcgtcggt
ttgattctaa aggcctaaaa tttgagtttt tccggttgtt ttgatgaaaa
420agccctaaaa tttgagtttt ttccccgtgt tttagattgt ttggttttaa
ttcttgaatc 480agataatcag ggagtgtgaa aagccctaaa tttgagtttt
tttcgttgtt ctgattgttg 540tttttatgaa tttgcagatg cagatctttg
tgaaaactct caccggaaag actatcaccc 600tagaggtgga aagttctgat
acaatcgaca acgttaaggc taagatccag gataaggaag 660gaattccccc
ggatcagcaa aggcttatct tcgccggaaa gcagttggag gacggacgta
720ctctagctga ttacaacatc cagaaggagt ctaccctcca tttggtgctc
cgtctacgtg 780gaggtatggc ggctgctgcc tcaccatctc catgcttctc
caaaacccta cctccatctt 840cctccaaatc ttccaccatt cttcctagat
ctaccttccc tttccacaat caccctcaaa 9004129DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 41gattctcgaa ttagctaatc gttttagag
294221DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 42caccgagtcg gtgctttttt t
214360DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 43ttaagggcta taacagcacc tgattagcta
attcgagaat ctaaacagac aaaaaccaaa 604456DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 44ttaagggcta taacagcacc tgattaattc gagaatctaa
acagacaaaa accaaa 564555DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 45ttaagggcta
taacagcacc tgataattcg agaatctaaa cagacaaaaa ccaaa
554653DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 46ttaagggcta taacagcacc tgatttcgag
aatctaaaca gacaaaaacc aaa 534752DNAArtificial SequenceDescription
of Artificial Sequence Synthetic oligonucleotide 47ttaaggggca
caacagcacc tgattcgaga atctaaacag acaaaaacca aa 524852DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 48ttaagggcta taacagcacc tgattcgaga atctaaacag
acaaaaacca aa 524946DNAArtificial SequenceDescription of Artificial
Sequence Synthetic oligonucleotide 49ttaagggcta taacagcatc
gagaatctaa acagacaaaa accaaa 465060DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 50ttaagggcta taacagcacc tgattagata attcgagaat
ctaaacagac aaaaaccaaa 605160DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 51ttaagggaaa
gaacagcaca gcactagata attcgagaat ctaaacagac aaaaaccaaa 60
* * * * *