U.S. patent application number 15/619658 was filed with the patent office on 2017-10-05 for method for producing patchoulol and 7-epi-alpha-selinene.
This patent application is currently assigned to FIRMENICH SA. The applicant listed for this patent is FIRMENICH SA. Invention is credited to Fabienne DEGUERRY, Michel SCHALK.
Application Number | 20170283841 15/619658 |
Document ID | / |
Family ID | 42710599 |
Filed Date | 2017-10-05 |
United States Patent
Application |
20170283841 |
Kind Code |
A1 |
SCHALK; Michel ; et
al. |
October 5, 2017 |
METHOD FOR PRODUCING PATCHOULOL AND 7-EPI-ALPHA-SELINENE
Abstract
The present invention provides a method of producing patchoulol
and 7-epi-.alpha.-selinene, said method comprising contacting at
least one polypeptide with farnesyl phyrophosphate (FPP). In
particular, said method may be carried out in vitro or in vivo to
produce patchoulol and 7-epi-.alpha.-selinene, compounds which can
be useful in the field of perfumery. The present invention also
provides the amino acid sequence of a polypeptide useful in the
method of the invention. A nucleic acid encoding the polypeptide of
the invention and an expression vector containing said nucleic acid
are also part of the present invention. A non-human host organism
or a cell transformed to be used in the method of producing
patchoulol and 7-epi-.alpha.-selinene is also an object of the
present invention.
Inventors: |
SCHALK; Michel;
(Collonges-Sous Saleve, FR) ; DEGUERRY; Fabienne;
(Geneva, CH) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
FIRMENICH SA |
Geneva 8 |
|
CH |
|
|
Assignee: |
FIRMENICH SA
Geneva 8
CH
|
Family ID: |
42710599 |
Appl. No.: |
15/619658 |
Filed: |
June 12, 2017 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
14577475 |
Dec 19, 2014 |
9714440 |
|
|
15619658 |
|
|
|
|
13695840 |
Nov 2, 2012 |
8927238 |
|
|
PCT/IB2011/051991 |
May 5, 2011 |
|
|
|
14577475 |
|
|
|
|
61333002 |
May 10, 2010 |
|
|
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
C12N 9/88 20130101; C12Y
402/0307 20130101; C12P 15/00 20130101; C12N 15/8243 20130101; C12P
5/002 20130101 |
International
Class: |
C12P 15/00 20060101
C12P015/00; C12P 5/00 20060101 C12P005/00; C12N 9/88 20060101
C12N009/88; C12N 15/82 20060101 C12N015/82 |
Foreign Application Data
Date |
Code |
Application Number |
May 10, 2010 |
EP |
10162433.6 |
Claims
1. A recombinant polypeptide having sesquiterpene synthase activity
and comprising an amino acid sequence having at least 90% sequence
identity to SEQ ID NO: 1, wherein the polypeptide when expressed
produces a mixture of terpenes comprising one or more of patchoulol
and 7-epi-.alpha.-selinene.
2. The polypeptide of claim 1, wherein the polypeptide comprises an
amino acid sequence having at least 95% or 98% sequence identity to
SEQ ID NO: 1.
3. The polypeptide of claim 1, wherein the polypeptide comprises
the amino acid sequence of SEQ ID NO: 1.
4. An isolated recombinant nucleic acid molecule comprising a) a
nucleotide sequence encoding the polypeptide of claim 1; b) a
nucleotide sequence having at least 90% sequence identity to SEQ ID
NO: 2 or the complement thereof.
5. The nucleic acid molecule of claim 4, wherein the nucleic acid
molecule comprises the nucleotide sequence of SEQ ID NO: 2 or the
complement thereof.
6. An expression vector comprising the nucleic acid molecule of
claim 4 or a nucleic acid molecule comprising the nucleotide
sequence of SEQ ID NO: 2 or the complement thereof.
7. The expression vector of claim 6, wherein the vector is in the
form of a viral vector, a bacteriophage or a plasmid.
8. The expression vector of claim 6, wherein the vector further
comprises at least one regulatory sequence operably linked to the
nucleic acid, and optionally at least one selection marker.
9. The expression vector of claim 8, wherein the at least one
regulatory sequence controls transcription, translation initiation
or termination, and wherein the at least one regulatory sequence
comprises a transcriptional promoter, operator or enhancer or an
mRNA ribosomal binding site.
10. A non-human host organism or a host cell transformed to harbor
at least one nucleic acid molecule of claim 4 or a vector
comprising said nucleic acid, so that it heterologously expresses
or over-expresses at least one polypeptide encoded by the nucleic
acid molecule.
11. The non-human host organism of claim 10, wherein the non-human
host organism or host cell is a plant, a microorganism, a
prokaryote, or a fungus.
12. The non-human host organism or host cell of claim 11, wherein
the non-human host organism or host cell is a bacteria or a
yeast.
13. The non-human host organism or host cell of claim 12, wherein
the bacteria is E. coli and the yeast is Saccharomyces
cerevisiae.
14. The host cell of claim 10, wherein the host cell is a plant
cell or a fungal cell.
15. A method for producing at least one polypeptide having
sesquiterpene synthase activity comprising a) culturing a non-human
host organism or a host cell transformed to express or over-express
the polypeptide of claim 1 under conditions that allow for the
production of the polypeptide; and b) optionally isolating the
polypeptide from the non-human host organism or cell cultured in
step a) or from culture media used in culturing the non-human host
organism or a host cell.
16. The method of claim 15, further comprising, prior to step a),
transforming a non-human host organism or host cell with an
expression vector comprising the nucleic acid molecule of claim 4,
so that it harbors said nucleic acid and expresses or
over-expresses a polypeptide encoded by said nucleic acid.
17. A method for preparing a variant polypeptide having
sesquiterpene synthase activity comprising a patchoulol and
7-epi-.alpha.-selinene synthase activity, the method comprising:
(a) selecting a nucleic acid according to claim 4; (b) modifying
the selected nucleic acid to obtain at least one mutant nucleic
acid; (c) transforming host cells or unicellular organisms with the
mutant nucleic acid sequence to express a polypeptide encoded by
the mutant nucleic acid sequence; (d) screening the polypeptide for
at least one modified property; and, (e) optionally, if the
polypeptide has no desired variant patchoulol and
7-epi-.alpha.-selinene synthase activity, repeating the process
steps (a) to (d) until a polypeptide with a desired variant
patchoulol and 7-epi-.alpha.-selinene synthase activity is
obtained; (f) optionally, if a polypeptide having a desired variant
patchoulol and 7-epi-.alpha.-selinene synthase activity was
identified in step d), isolating the corresponding mutant nucleic
acid obtained in step (c).
Description
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a divisional of U.S. patent application
Ser. No. 14/577,475 filed on Dec. 19, 2014, which is a continuation
of U.S. patent application Ser. No. 13/695,840 filed Nov. 2, 2012,
now U.S. Pat. No. 8,927,238 issued Jan. 6, 2015, which is a
national stage application under 35 U.S.C. .sctn.371 of
International Patent Application PCT/IB2011/051991 filed on May 5,
2011, which claims benefit of U.S. Provisional Application No.
61/333,002 filed on May 10, 2010 and of European Application No.
10162433.6 filed on May 10, 2010. The entire contents of each of
these applications are hereby incorporated by reference herein in
their entirety.
SUBMISSION OF SEQUENCE LISTING
[0002] The Sequence Listing associated with this application is
filed in electronic format via EFS-Web and hereby incorporated by
reference into the specification in its entirety. The name of the
text file containing the Sequence Listing is
7930USDIV1_SequenceListing. The size of the text file is 12 KB, and
the text file was created on Jun. 9, 2017.
TECHNICAL FIELD
[0003] The present invention provides a method of producing
patchoulol and 7-epi-.alpha.-selinene, said method comprising
contacting at least one polypeptide with farnesyl phyrophosphate
(FPP). In particular, said method may be carried out in vitro or in
vivo to produce patchoulol and 7-epi-.alpha.-selinene, compounds
which can be useful in the field of perfumery. The present
invention also provides the amino acid sequence of a polypeptide
useful in the method of the invention. A nucleic acid encoding the
polypeptide of the invention and an expression vector containing
said nucleic acid are also part of the present invention. A
non-human host organism or a cell transformed to be used in the
method of producing patchoulol and 7-epi-.alpha.-selinene is also
an object of the present invention.
PRIOR ART
[0004] Terpenes are found in most organisms (microorganisms,
animals and plants). These compounds are made up of five carbon
units called isoprene units and are classified by the number of
these units present in their structure. Thus monoterpenes,
sesquiterpenes and diterpenes are terpenes containing 10, 15 and 20
carbon atoms respectively. Sesquiterpenes, for example, are widely
found in the plant kingdom. Many sesquiterpene molecules are known
for their flavor and fragrance properties and their cosmetic,
medicinal and antimicrobial effects. Over 300 sesquiterpene
hydrocarbons and 3000 sesquiterpenoids have been identified and
many new structures are identified each year. Plant extracts
obtained by different means such as steam distillation or solvent
extraction are used as source of terpenes. Terpene molecules are
often used as such, but in some cases chemical reactions are used
to transform the terpenes into other high value molecules.
[0005] Biosynthetic production of terpenes involves enzymes called
terpene synthases. There is virtually an infinity of sesquiterpene
synthases present in the plant kingdom, all using the same
substrate (farnesyl pyrophosphate, FPP) but having different
product profiles. Genes and cDNAs encoding sesquiterpene synthases
have been cloned and the corresponding recombinant enzymes
characterized. The biosynthesis of terpenes in plants and other
organisms has been extensively studied and is not further detailed
in here.
[0006] Generally, the price and availability of plant natural
extracts are dependent on the abundance, oil yield and geographical
origin of the plants. In addition, the availability and quality of
natural extracts is very much dependent on climate and other local
conditions leading to variability from year to year, rendering the
use of such ingredients in high quality perfumery very difficult or
even impossible some years.
[0007] Valerian oil is one of these natural extracts. It is an
aromatic oil, of which some components can advantageously be used
in the field of perfumery and flavors. However, the purification of
these individual constituents from the oil is not feasible at large
scale.
[0008] A plant-independent method for producing the valerian oil
constituents would therefore be very desirable but a cost-effective
chemical synthesis of such compounds is so far not available.
[0009] Patchoulol and 7-epi-.alpha.-selinene are naturally
occurring sesquiterpene molecules, which are useful in the field of
perfumery and flavoring. A biochemical pathway leading to the
synthesis of patchoulol and 7-epi-.alpha.-selinene would therefore
be of great interest.
[0010] Analysis of the composition of Valeriana jatamansi oil
showed that patchoulol was the major constituent of this oil. See
for example Mathela C. S. et al. (2005), Chem. Biodivers. 2(9),
1174-1182 and Bos R. et al. Flay. Fragr. J. 12(2), 123-131.
Nevertheless, this document does not provide or even suggest an
amino acid or nucleotide sequence leading to the production of
patchoulol and 7-epi-.alpha.-selinene.
[0011] A sesquiterpene synthase capable of synthesizing patchoulol
has already been described in WO 2005/052163. A sesquiterpene
synthase capable of synthesizing 7-epi-.alpha.-selinene synthase
has been reported in Lucker et al (2004), Phytochemistry 65,
2649-2659. However, a sesquiterpene synthase capable of
synthesizing both patchoulol and 7-epi-.alpha.-selinene has never
been disclosed in the prior art.
[0012] The percentage of identity between the known sesquiterpene
synthases and the polypeptide of the invention is very low. In
particular, the patchoulol synthase described in WO 2005/052163 is
only 39% identical to the sesquiterpene synthase of the invention.
The closest protein sequence to the sesquiterpene synthase of the
invention are a vetispiradiene synthase from Solanum tuberosus
(NCBI accession No. AAD02269) and a predicted protein from Populus
trichocarpa (NCBI accession no XP_002321642 having both 49%
identity with the present sesquiterpene synthase.
[0013] Despite extensive studies of terpene cyclization, the
isolation and characterization of the terpene synthases is still
difficult, particularly in plants, due to their low abundance,
their often transient expression patterns, and the complexity of
purifying them from the mixtures of resins and phenolic compounds
in tissues where they are expressed.
[0014] It is an objective of the present invention to provide
methods for making patchoulol and 7-epi-.alpha.-selinene in an
economic way, as indicated above. Accordingly, the present
invention has the objective to produce patchoulol and
7-epi-.alpha.-selinene while having little waste, a more energy and
resource efficient process and while reducing dependency on fossil
fuels. It is a further objective to provide enzymes capable of
synthesizing patchoulol and 7-epi-.alpha.-selinene, which are
useful as perfumery and/or aroma ingredients.
ABBREVIATIONS USED
[0015] bp base pair [0016] BSA bovine serum albumin [0017] cDNA
complementary DNA [0018] s DNA deoxyribonucleic acid [0019] dNTP
deoxy nucleotide triphosphate [0020] DTT dithiothreitol [0021] FPP
farnesyl pyrophosphate [0022] GC gaseous chromatograph [0023] IPTG
isopropyl-D-thiogalacto-pyranoside [0024] LB lysogeny broth [0025]
MOPSO 3-(N-morpholino)-2-hydroxypropanesulfonic acid [0026] MS mass
spectrometer [0027] PCR polymerase chain reaction [0028] RMCE
recombinase-mediated cassette exchange [0029] 3'-/5'-RACE 3' and 5'
rapid amplification of cDNA ends [0030] RNA ribonucleic acid [0031]
mRNA messenger ribonucleic acid
DESCRIPTION OF THE INVENTION
[0032] The present invention provides a method to biosynthetically
produce patchoulol and 7-epi-.alpha.-selinene in an economic,
reliable and reproducible way.
[0033] A "sesquiterpene synthase" or a "polypeptide having a
sesquiterpene synthase activity" is intended here as a polypeptide
capable of catalyzing the synthesis of a sesquiterpene molecule or
of a mixture of sesquiterpene molecules from the acyclic terpene
precursor FPP.
[0034] As a "patchoulol and 7-epi-.alpha.-selinene synthase" or as
a "polypeptide having a patchoulol and 7-epi-.alpha.-selinene
synthase activity", we mean here a polypeptide capable of
catalyzing the synthesis of both patchoulol and
7-epi-.alpha.-selinene starting from FPP. Patchoulol and
7-epi-.alpha.-selinene may be the only products or may be part of a
mixture with other sesquiterpenes.
[0035] The ability of a polypeptide to catalyze the synthesis of
particular sesquiterpenes (for example patchoulol and
7-epi-.alpha.-selinene) can be simply confirmed by performing the
enzyme assay as detailed in Example 2.
[0036] According to the present invention, polypeptides are also
meant to include truncated polypeptides provided that they keep
their sesquiterpene synthase activity as defined in any of the
embodiments of the present invention and that they share at least
the defined percentage of identity with the corresponding fragment
of SEQ ID NO: 1.
[0037] As intended herein below, "a nucleotide sequence obtained by
modifying SEQ ID NO:2 or the complement thereof" encompasses any
sequence that has been obtained by changing the sequence of SEQ ID
NO:2 or of the complement thereof using any method known in the
art, for example by introducing any type of mutations such as
deletion, insertion or substitution mutations. Examples of such
methods are cited in the part of the description relative to the
variant polypeptides and the methods to prepare them.
[0038] The percentage of identity between two peptidic or
nucleotidic sequences is a function of the number of amino acids or
nucleotide residues that are identical in the two sequences when an
alignment of these two sequences has been generated. Identical
residues are defined as residues that are the same in the two
sequences in a given position of the alignment. The percentage of
sequence identity, as used herein, is calculated from the optimal
alignment by taking the number of residues identical between two
sequences dividing it by the total number of residues in the
shortest sequence and multiplying by 100. The optimal alignment is
the alignment in which the percentage of identity is the highest
possible. Gaps may be introduced into one or both sequences in one
or more positions of the alignment to obtain the optimal alignment.
These gaps are then taken into account as non-identical residues
for the calculation of the percentage of sequence identity.
[0039] Alignment for the purpose of determining the percentage of
amino acid or nucleic acid sequence identity can be achieved in
various ways using computer programs and for instance publicly
available computer programs available on the worldwide web.
Preferably, the BLAST program (Tatiana et al, FEMS Microbiol Lett.,
1999, 174:247-250, 1999) set to the default parameters, available
from the National Center for Biotechnology Information (NCBI), can
be used to obtain an optimal alignment of peptidic or nucleotidic
sequences and to calculate the percentage of sequence identity.
[0040] One object of the present invention is therefore a method
for producing patchoulol and 7-epi-.alpha.-selinene comprising
[0041] a) contacting FPP with at least one polypeptide having a
patchoulol and 7-epi-.alpha.-selinene synthase activity and
comprising an amino acid sequence at least 50% identical to SEQ ID
NO:1; [0042] b) optionally, isolating the patchoulol and the
7-epi-.alpha.-selinene produced in step a).
[0043] According to a preferred embodiment, the method is a method
for producing patchoulol and 7-epi-.alpha.-selinene as the major
products. According to an even more preferred embodiment,
patchoulol and 7-epi-.alpha.-selinene represent altogether at least
50%, preferably at least 60%, preferably at least 70%, preferably
at least 80%, preferably at least 90% of the product produced by
the method of the invention.
[0044] According to another embodiment of the invention,
7-epi-.alpha.-selinene is the major product and patchoulol is the
second major product. For example, 7-epi-.alpha.-selinene can
represent at least 20%, preferably at least 30%, preferably at
least 34% of the product obtained by the method of the invention.
As another example, patchoulol can represent at least 10%,
preferably at least 15%, preferably at least 16% of the product
obtained by the method of the invention.
[0045] According to a preferred embodiment, the product of the
method of the present invention further comprises at least one
compound selected from the group consisting of germacrene A,
.alpha.-guaiene, .alpha.-humulene, .alpha.-selinene, seychellene,
.alpha.-patchoulene, .alpha.-bulnesene and (E)-nerolidol, these
compounds being defined by the way of their structure as depicted
in FIG. 1. In a more preferred embodiment, the product of the
method of the present invention comprises all of these
compounds.
[0046] The method can be carried out in vitro as well as in vivo,
as will be explained in details further on.
[0047] The polypeptide to be contacted with FPP in vitro can be
obtained by extraction from any organism expressing it, using
standard protein or enzyme extraction technologies. If the host
organism is an unicellular organism or cell releasing the
polypeptide of the invention into the culture medium, the
polypeptide may simply be collected from the culture medium, for
example by centrifugation, optionally followed by washing steps and
re-suspension in suitable buffer solutions. If the organism or cell
accumulates the polypeptide within its cells, the polypeptide may
be obtained by disruption or lysis of the cells and further
extraction of the polypeptide from the cell lysate.
[0048] The polypeptide having a patchoulol and
7-epi-.alpha.-selinene synthase activity, either in an isolated
form or together with other proteins, for example in a crude
protein extract obtained from cultured cells or microorganisms, may
then be suspended in a buffer solution at optimal pH. If adequate,
salts, BSA and other kinds of enzymatic co-factors, may be added in
order to optimize enzyme activity. Appropriate conditions are
described in more details in the Examples further on.
[0049] The precursor FPP may then be added to the suspension or
solution, which is then incubated at optimal temperature, for
example between 15 and 40.degree. C., preferably between 25 and
35.degree. C., more preferably at 30.degree. C. After incubation,
the patchoulol and 7-epi-.alpha.-selinene, and optionally the other
sesquiterpene by-products produced may be isolated from the
incubated solution by standard isolation procedures, such as
solvent extraction and distillation, optionally after removal of
polypeptides from the solution.
[0050] According to another preferred embodiment, the method of any
of the above-described embodiments is carried out in vivo. In this
case, step a) comprises cultivating a non-human host organism or
cell capable of producing FPP and transformed to express at least
one polypeptide comprising an amino acid sequence at least 50%
identical to SEQ ID NO:1 and having a patchoulol and
7-epi-.alpha.-selinene synthase activity, under conditions
conducive to the production of patchoulol and
7-epi-.alpha.-selinene.
[0051] According to a more preferred embodiment, the method further
comprises, prior to step a), transforming a non human organism or
cell capable of producing FPP with at least one nucleic acid
encoding a polypeptide comprising an amino acid sequence at least
50% identical to SEQ ID NO:1 and having a patchoulol and
7-epi-.alpha.-selinene synthase activity, so that said organism
expresses said polypeptide.
[0052] These embodiments of the invention are particularly
advantageous since it is possible to carry out the method in vivo
without previously isolating the polypeptide. The reaction occurs
directly within the organism or cell transformed to express said
polypeptide.
[0053] According to a particular embodiment of the invention, the
at least one nucleic acid encoding the patchoulol and
7-epi-.alpha.-selinene synthase comprises a nucleotide sequence at
least 50%, preferably at least 55%, preferably at least 60%,
preferably at least 65%, preferably at least 70%, preferably at
least 75%, preferably at least 80%, preferably at least 85%,
preferably at least 90%, more preferably at least 95% and even more
preferably at least 98% identical to SEQ ID NO:2 or the complement
thereof. According to an even more preferred embodiment, said
nucleic acid comprises the nucleotide sequence SEQ ID NO:2 or the
complement thereof. In a most preferred embodiment, said nucleic
acid consists of SEQ ID NO:2 or the complement thereof.
[0054] According to a preferred embodiment the at least one nucleic
acid used in any of the above embodiments comprises a nucleotide
sequence that has been obtained by modifying SEQ ID NO:2 or the
complement thereof. According to an even more preferred embodiment,
said at least one nucleic acid consists of a nucleotide sequence
that has been obtained by modifying SEQ ID NO:2 or the complement
thereof.
[0055] According to another embodiment, the at least one nucleic
acid is isolated from Valeriana jatamansi.
[0056] The organism or cell is meant to "express" a polypeptide,
provided that the organism or cell is transformed to harbor a
nucleic acid encoding said polypeptide, this nucleic acid is
transcribed to mRNA and the polypeptide is found in the host
organism or cell. The term "express" encompasses "heterologously
express" and "over-express", the latter referring to levels of
mRNA, polypeptide and/or enzyme activity over and above what is
measured in a non-transformed organism or cell. A more detailed
description of suitable methods to transform a non-human host
organism or cell will be described later on in the part of the
specification that is dedicated to such transformed non-human host
organisms or cells as specific objects of the present invention and
in the examples.
[0057] A particular organism or cell is meant to be "capable of
producing FPP" when it produces FPP naturally or when it does not
produce FPP naturally but is transformed to produce FPP, either
prior to the transformation with a nucleic acid as described herein
or together with said nucleic acid. Organisms or cells transformed
to produce a higher amount of FPP than the naturally occurring
organism or cell are also encompassed by the "organisms or cells
capable of producing FPP". Methods to transform organisms, for
example microorganisms, so that they produce FPP are already known
in the art. Such methods can for example be found in the
literature, for example in the following publications Martin, V.
J., Pitera, D. J., Withers, S. T., Newman, J. D., and Keasling, J.
D. Nat Biotechnol., 2003, 21(7), 796-802 (transformation of E.
coli); Wu, S., Schalk, M., Clark, A., Miles, R. B., Coates, R., and
Chappell, J., Nat Biolechnol., 2006, 24(11), 1441-1447
(transformation of plants); Takahashi, S., Yeo, Y., Greenhagen, B.
T., McMullin, T., Song, L., Maurina-Brunker, J., Rosson, R., Noel,
J., Chappell, J, Biotechnology and Bioengineering, 2007, 97(1),
170-181 (transformation of yeast).
[0058] To carry out the invention in vivo, the host organism or
cell is cultivated under conditions conducive to the production of
patchoulol and 7-epi-.alpha.-selinene. Accordingly, if the host is
a transgenic plant, optimal growth conditions are provided, such as
optimal light, water and nutrient conditions, for example. If the
host is a unicellular organism, conditions conducive to the
production of patchoulol and 7-epi-.alpha.-selinene may comprise
addition of suitable cofactors to the culture medium of the host.
In addition, a culture medium may be selected, so as to maximize
patchoulol and 7-epi-.alpha.-selinene synthesis. Optimal culture
conditions are described in a more detailed manner in the following
examples.
[0059] Non-human host organisms suitable to carry out the method of
the invention in vivo may be any non-human multicellular or
unicellular organisms. In a preferred embodiment, the non-human
host organism used to carry out the invention in vivo is a plant, a
prokaryote or a fungus. Any plant, prokaryote or fungus can be
used. Particularly useful plants are those that naturally produce
high amounts of terpenes. In a more preferred embodiment, the plant
is selected from the family of Solanaceae, Poaceae, Brassicaceae,
Fabaceae, Malvaceae, Asteraceae or Lamiaceae. For example, the
plant is selected from the genera Nicotiana, Solanum, Sorghum,
Arabidopsis, Brassica (rape), Medicago (alfalfa), Gossypium
(cotton), Artemisia, Salvia and Mentha. Preferably, the plant
belongs to the species of Nicotiana tabacum.
[0060] In a more preferred embodiment the non-human host organism
used to carry out the method of the invention in vivo is a
microorganism. Any microorganism can be used but according to an
even more preferred embodiment said microorganism is a bacteria or
yeast. Most preferably, said bacteria is E. coli and said yeast is
Saccharomyces cerevisiae.
[0061] Some of these organisms do not produce FPP naturally. To be
suitable to carry out the method of the invention, these organisms
have to be transformed to produce said precursor. They can be so
transformed either before the modification with the nucleic acid
described according to any of the above embodiments or
simultaneously, as explained above.
[0062] Isolated higher eukaryotic cells can also be used, instead
of complete organisms, as hosts to carry out the method of the
invention in vivo. Suitable eukaryotic cells may be any non-human
cell, but are preferably plant or fungal cells.
[0063] According to a preferred embodiment, the at least one
polypeptide having a patchoulol and 7-epi-.alpha.-selinene synthase
activity used in any of the above-described embodiments or encoded
by the nucleic acid used in any of the above-described embodiments
comprises an amino acid sequence at least 55%, preferably at least
60%, preferably at least 65%, preferably at least 70%, preferably
at least 75%, preferably at least 80%, preferably at least 85%,
preferably at least 90%, more preferably at least 95% and even more
preferably at least 98% identical to SEQ ID NO:1. According to a
more preferred embodiment, said polypeptide comprises the amino
acid sequence SEQ ID NO: 1. In an even more preferred embodiment,
said polypeptide consists of SEQ ID NO: 1.
[0064] According to another preferred embodiment, the at least one
polypeptide having a patchoulol and 7-epi-.alpha.-selinene synthase
activity used in any of the above-described embodiments or encoded
by the nucleic acid used in any of the above-described embodiments
comprises an amino acid sequence that is a variant of SEQ ID NO:1
obtained by genetic engineering. In other terms, said polypeptide
comprises an amino acid sequence encoded by a nucleotide sequence
that has been obtained by modifying SEQ ID NO:2 or the complement
thereof. According to a more preferred embodiment, the at least one
polypeptide having a patchoulol and 7-epi-.alpha.-selinene synthase
activity used in any of the above-described embodiments or encoded
by the nucleic acid used in any of the above-described embodiments
consists of an amino acid sequence that is a variant of SEQ ID NO:1
obtained by genetic engineering, i.e. an amino acid sequence
encoded by a nucleotide sequence that has been obtained by
modifying SEQ ID NO:2 or the complement thereof.
[0065] As used herein, the polypeptide is intended as a polypeptide
or peptide fragment that encompasses the amino acid sequences
identified herein, as well as truncated or variant polypeptides,
provided that they keep their activity as defined above and that
they share at least the defined percentage of identity with the
corresponding fragment of SEQ ID NO: 1.
[0066] Examples of variant polypeptides are naturally occurring
proteins that result from alternate mRNA splicing events or form
proteolytic cleavage of the polypeptides described herein.
Variations attributable to proteolysis include, for example,
differences in the N- or C-termini upon expression in different
types of host cells, due to proteolytic removal of one or more
terminal amino acids from the polypeptides of the invention.
Polypeptides encoded by a nucleic acid obtained by natural or
artificial mutation of a nucleic acid of the invention, as
described thereafter, are also encompassed by the invention.
[0067] Polypeptide variants resulting from a fusion of additional
peptide sequences at the amino and carboxyl terminal ends can also
be used in the methods of the invention. In particular such a
fusion can enhance expression of the polypeptides, be useful in the
purification of the protein or improve the enzymatic activity of
the polypeptide in a desired environment or expression system. Such
additional peptide sequences may be signal peptides, for example.
Accordingly, the present invention encompasses methods using
variant polypeptides, such as those obtained by fusion with other
oligo- or polypeptides and/or those which are linked to signal
peptides. Polypeptides resulting from a fusion with another
functional protein, such as another protein from the terpene
biosynthesis pathway, can also be advantageously be used in the
methods of the invention.
[0068] According to another embodiment, the at least one
polypeptide having a patchoulol and 7-epi-.alpha.-selinene synthase
activity used in any of the above-described embodiments or encoded
by the nucleic acid used in any of the above-described embodiments
is isolated from Valeriana jatamansi.
[0069] An important tool to carry out the method of the invention
is the polypeptide itself. A polypeptide having a patchoulol and
7-epi-.alpha.-selinene synthase activity and comprising an amino
acid sequence at least 50% identical to SEQ ID NO: 1 is therefore
another object of the present invention.
[0070] According to a preferred embodiment, the polypeptide is
capable of producing patchoulol and 7-epi-.alpha.-selinene as the
major products. According to an even more preferred embodiment,
patchoulol and 7-epi-.alpha.-selinene represent altogether at least
50%, preferably at least 60%, preferably at least 70%, preferably
at least 80%, preferably at least 90% of the product produced by
the polypeptide of the invention.
[0071] According to another embodiment of the invention,
7-epi-.alpha.-selinene is the major product and patchoulol is the
second major product. For example, 7-epi-.alpha.-selinene can
represent at least 20%, preferably at least 30%, preferably at
least 34% of the product obtained by the method of the invention.
As another example, patchoulol can represent at least 10%,
preferably at least IS %, preferably at least 16% of the product
obtained by the method of the invention.
[0072] According to another embodiment, the product of the
polypeptide of the present invention further comprises at least one
compound selected from the group consisting of germacrene A,
.alpha.-guaiene, .alpha.-humulene, .alpha.-selinene, seychellene,
.alpha.-patchoulene, .alpha.-bulnesene and (E)-nerolidol, these
compounds being defined by the way of their structure as depicted
in FIG. 1. In a more preferred embodiment, the product of the
method of the present invention comprises all of these
compounds.
[0073] According to a preferred embodiment, the polypeptide
comprises an amino acid sequence at least 55%, preferably at least
60%, preferably at least 65%, preferably at least 70%, preferably
at least 75%, preferably at least 80%, preferably at least 85%,
preferably at least 90%, more preferably at least 95% and even more
preferably at least 98% identical to SEQ ID NO: 1. According to a
more preferred embodiment, the polypeptide comprises the amino acid
sequence SEQ ID NO:1. According to an even more preferred
embodiment, the polypeptide consists of SEQ ID NO:1.
[0074] According to another preferred embodiment, the polypeptide
comprises an amino acid sequence that is a variant of SEQ ID NO:1
obtained by genetic engineering. In other terms, said polypeptide
comprises an amino acid sequence encoded by a nucleotide sequence
that has been obtained by modifying SEQ ID NO:2 or the complement
thereof. According to a more preferred embodiment, the polypeptide
having a patchoulol and 7-epi-.alpha.-selinene synthase activity
consists of an amino acid sequence that is a variant of SEQ ID NO:1
obtained by genetic engineering, i.e. an amino acid sequence
encoded by a nucleotide sequence that has been obtained by
modifying SEQ ID NO:2 or the complement thereof.
[0075] According to another embodiment, the polypeptide is isolated
form Valeriana jatamansi.
[0076] As used herein, the polypeptide is intended as a polypeptide
or peptide fragment that encompasses the amino acid sequences
identified herein, as well as truncated or variant polypeptides,
provided that they keep their activity as defined above and that
they share at least the defined percentage of identity with the
corresponding fragment of SEQ ID NO:1.
[0077] Examples of variant polypeptides are naturally occurring
proteins that result from alternate mRNA splicing events or form
proteolytic cleavage of the polypeptides described herein.
Variations attributable to proteolysis include, for example,
differences in the N- or C-termini upon expression in different
types of host cells, due to proteolytic removal of one or more
terminal amino acids from the polypeptides of the invention.
Polypeptides encoded by a nucleic acid obtained by natural or
artificial mutation of a nucleic acid of the invention, as
described thereafter, are also encompassed by the invention.
[0078] Polypeptide variants resulting from a fusion of additional
peptide sequences at the amino and carboxyl terminal ends are also
encompassed by the polypeptides of the invention. In particular
such a fusion can enhance expression of the polypeptides, be useful
in the purification of the protein or improve the enzymatic
activity of the polypeptide in a desired environment or expression
system. Such additional peptide sequences may be a signal peptide,
for example. Accordingly, the present invention encompasses
variants of the polypeptides of the invention, such as those
obtained by fusion with other oligo- or polypeptides and/or those
which are linked to signal peptides. Polypeptides resulting from a
fusion with another functional protein, such as another protein
from the terpene biosynthesis pathway, are also encompassed by the
polypeptides of the invention.
[0079] As mentioned above, the nucleic acid encoding the
polypeptide of the invention is a useful tool to modify non-human
host organisms or cells intended to be used when the method is
carried out in vivo.
[0080] A nucleic acid encoding a polypeptide according to any of
the above-described embodiments is therefore also an object of the
present invention.
[0081] According to a preferred embodiment, the nucleic acid
comprises a nucleotide sequence at least 50%, preferably at least
55%, preferably at least 60%, preferably at least 65%, preferably
at least 70%, preferably at least 75%, preferably at least 80%,
preferably at least 85%, preferably at least 90%, more preferably
at least 95% and even more preferably at least 98% identical to SEQ
ID NO:2 or the complement thereof. According to a more preferred
embodiment, the nucleic acid comprises the nucleotide sequence SEQ
ID NO:2 or the complement thereof. According to an even more
preferred embodiment, the nucleic acid consists of SEQ ID NO:2 or
the complement thereof.
[0082] According to another embodiment, the nucleic acid is
isolated from Valeriana jatamansi.
[0083] The nucleic acid of the invention can be defined as
including deoxyribonucleotide or ribonucleotide polymers in either
single- or double-stranded form (DNA and/or RNA). The terms
"nucleotide sequence" should also be understood as comprising a
polynucleotide molecule or an oligonucleotide molecule in the form
of a separate fragment or as a component of a larger nucleic acid.
Nucleic acids of the invention also encompass certain isolated
nucleotide sequences including those that are substantially free
from contaminating endogenous material. The nucleic acid of the
invention may be truncated, provided that it encodes a polypeptide
encompassed by the present invention, as described above.
[0084] According to a more preferred embodiment, the at least one
nucleic acid according to any of the above embodiments comprises a
nucleotide sequence that has been obtained by modifying SEQ ID NO:2
or the complement thereof. Preferably said nucleic acid consists of
a nucleotide sequence that has been obtained by modifying SEQ ID
NO:2 or the complement thereof.
[0085] The nucleic acids comprising a sequence obtained by mutation
of SEQ ID NO:2 or the complement thereof are encompassed by the
invention, provided that the sequences they comprise share at least
the defined percentage of identity with the corresponding fragments
of SEQ ID NO:2 or the complement thereof and provided that they
encode a polypeptide having a patchoulol and 7-epi-.alpha.-selinene
synthase activity, as defined in any of the above embodiments.
Mutations may be any kind of mutations of these nucleic acids, such
as point mutations, deletion mutations, insertion mutations and/or
frame shift mutations. A variant nucleic acid may be prepared in
order to adapt its nucleotide sequence to a specific expression
system. For example, bacterial expression systems are known to more
efficiently express polypeptides if amino acids are encoded by a
preferred codon. Due to the degeneracy of the genetic code, wherein
more than one codon can encode the same amino acid, multiple DNA
sequences can code for the same polypeptide, all these DNA
sequences being encompassed by the invention.
[0086] Another important tool for transforming host organisms or
cells suitable to carry out the method of the invention in vivo is
an expression vector comprising a nucleic acid according to any
embodiment of the invention. Such a vector is therefore also an
object of the present invention.
[0087] An "expression vector" as used herein includes any linear or
circular recombinant vector including but not limited to viral
vectors, bacteriophages and plasmids. The skilled person is capable
of selecting a suitable vector according to the expression system.
In one embodiment, the expression vector includes the nucleic acid
of the invention operably linked to at least one regulatory
sequence, which controls the initiation and/or termination of the
transcription and/or translation, such as a transcriptional
promoter, operator or enhancer, or an mRNA ribosomal binding site
and, optionally, including at least one selection marker.
Nucleotide sequences are "operably linked" when the regulatory
sequence functionally relates to the nucleic acid of the
invention.
[0088] The expression vectors of the present invention may be used
in the methods for preparing a genetically transformed host
organism and/or cell, in host organisms and/or cells harboring the
nucleic acids of the invention and in the methods for producing or
making polypeptides having a patchoulol and 7-epi-.alpha.-selinene
synthase activity, as disclosed further below.
[0089] Recombinant non-human host organisms and cells transformed
to harbor at least one nucleic acid of the invention so that it
heterologously expresses or over-expresses at least one polypeptide
of the invention are also very useful tools to carry out the method
of the invention. Such non-human host organisms and cells are
therefore another object of the present invention.
[0090] A nucleic acid according to any of the above-described
embodiments can be used to transform the non-human host organisms
and cells and the expressed polypeptide can be any of the
above-described polypeptides.
[0091] Non-human host organisms of the invention may be any
non-human multicellular or unicellular organisms. In a preferred
embodiment, the non-human host organism is a plant, a prokaryote or
a fungus. Any plant, prokaryote or fungus is suitable to be
transformed according to the present invention. Particularly useful
plants are those that naturally produce high amounts of terpenes.
In a more preferred embodiment, the plant is selected from the
family of Solanaceae, Poaceae, Brassicaceae, Fabaceae, Malvaceae,
Asteraceae or Lamiaceae. For example, the plant is selected from
the genera Nicotiana, Solanum, Sorghum, Arabidopsis, Brassica
(rape), Medicago (alfalfa), Gossypium (cotton), Artemisia, Salvia
and Mentha. Preferably, the plant belongs to the species of
Nicoliana tabacum.
[0092] In a more preferred embodiment the non-human host organism
is a microorganism. Any microorganism is suitable for the present
invention, but according to an even more preferred embodiment said
microorganism is a bacteria or yeast. Most preferably, said
bacteria is E. coli and said yeast is Saccharomyces cerevisiae.
[0093] Isolated higher eukaryotic cells can also be transformed,
instead of complete organisms. As higher eukaryotic cells, we mean
here any non-human eukaryotic cell except yeast cells. Preferred
higher eukaryotic cells are plant cells or fungal cells.
[0094] The term "transformed" refers to the fact that the host was
subjected to genetic engineering to comprise one, two or more
copies of each of the nucleic acids required in any of the
above-described embodiment. Preferably the term "transformed"
relates to hosts heterologously expressing the polypeptides encoded
by the nucleic acid with which they are transformed, as well as
over-expressing said polypeptides. Accordingly, in an embodiment,
the present invention provides a transformed organism, in which the
polypeptides are expressed in higher quantity than in the same
organism not so transformed.
[0095] There are several methods known in the art for the creation
of transgenic host organisms or cells such as plants, fungi,
prokaryotes, or cultures of higher eukaryotic cells. Appropriate
cloning and expression vectors for use with bacterial, fungal,
yeast, plant and mammalian cellular hosts are described, for
example, in Pouwels et al., Cloning Vectors: A Laboratory Manual.
1985, Elsevier, New York and Sambrook et al., Molecular Cloning: A
Laboratory Manual, 2.sup.nd edition, 1989, Cold Spring Harbor
Laboratory Press. Cloning and expression vectors for higher plants
and/or plant cells in particular are available to the skilled
person. See for example Schardl et al. Gene 61: 1-11, 1987.
[0096] Methods for transforming host organisms or cells to harbor
transgenic nucleic acids are familiar to the skilled person. For
the creation of transgenic plants, for example, current methods
include: electroporation of plant protoplasts, liposome-mediated
transformation, agrobacterium-mediated transformation,
polyethylene-glycol-mediated transformation, particle bombardement,
microinjection of plant cells, and transformation using
viruses.
[0097] In one embodiment, transformed DNA is integrated into a
chromosome of a non-human host organism and/or cell such that a
stable recombinant system results. Any chromosomal integration
method known in the art may be used in the practice of the
invention, including but not limited to recombinase-mediated
cassette exchange (RMCE), viral site-specific chromosomal
insertion, adenovirus and pronuclear injection.
[0098] In order to carry out the method for producing patchoulol
and 7-epi-.alpha.-selinene in vitro, as exposed herein above, it is
very advantageous to provide a method of making at least one
polypeptide having a patchoulol and 7-epi-.alpha.-selinene synthase
activity as described in any embodiment of the invention.
Therefore, the invention provides a method for producing at least
one polypeptide according to any embodiment of the invention
comprising [0099] a) culturing a non-human host organism or cell
transformed with the expression vector of the invention, so that it
harbors a nucleic acid according to the invention and expresses or
over-expresses a polypeptide of the invention; [0100] b) isolating
the polypeptide from the non-human host organism or cell cultured
in step a).
[0101] According to a preferred embodiment, said method further
comprises, prior to step a), transforming a non-human host organism
or cell with the expression vector of the invention, so that it
harbors a nucleic acid according to the invention and expresses or
over-expresses the polypeptide of the invention.
[0102] A nucleic acid according to any of the above-described
embodiments can be used.
[0103] Transforming and culturing of the non-human host organism or
cell can be carried out as described above for the method of
producing patchoulol and 7-epi-.alpha.-selinene in vivo. Step b)
may be performed using any technique well known in the art to
isolate a particular polypeptide from an organism or cell.
[0104] A "polypeptide variant" as referred to herein means a
polypeptide having a patchoulol and 7-epi-.alpha.-selinene synthase
activity and being substantially homologous to the polypeptide
according to any of the above embodiments, but having an amino acid
sequence different from that encoded by any of the nucleic acid
sequences of the invention because of one or more deletions,
insertions or substitutions.
[0105] Variants can comprise conservatively substituted sequences,
meaning that a given amino acid residue is replaced by a residue
having similar physiochemical characteristics. Examples of
conservative substitutions include substitution of one aliphatic
residue for another, such as Ile, Val, Leu, or Ala for one another,
or substitutions of one polar residue for another, such as between
Lys and Arg; Glu and Asp; or Gin and Asn. See Zubay, Biochemistry,
1983, Addison-Wesley Pub. Co. The effects of such substitutions can
be calculated using substitution score matrices such a PAM-120,
PAM-200, and PAM-250 as discussed in Altschul, J. Mol. Biol., 1991,
219, 555-565. Other such conservative substitutions, for example
substitutions of entire regions having similar hydrophobicity
characteristics, are well known.
[0106] Naturally occurring peptide variants are also encompassed by
the invention. Examples of such variants are proteins that result
from alternate mRNA splicing events or from proteolytic cleavage of
the polypeptides described herein. Variations attributable to
proteolysis include, for example, differences in the N- or
C-termini upon expression in different types of host cells, due to
proteolytic removal of one or more terminal amino acids from the
polypeptides encoded by the sequences of the invention.
[0107] Variants of the polypeptides of the invention may be used to
attain for example desired enhanced or reduced enzymatic activity,
modified regiochemistry or stereochemistry, or altered substrate
utilization or product distribution, increased affinity for the
substrate, improved specificity for the production of one or more
desired compounds, increased velocity of the enzyme reaction,
higher activity or stability in a specific environment (pH,
temperature, solvent, etc), or improved expression level in a
desired expression system. A variant or site directed mutant may be
made by any method known in the art. Variants and derivatives of
native polypeptides can be obtained by isolating
naturally-occurring variants, or the nucleotide sequence of
variants, of other or same plant lines or species, or by
artificially programming mutations of nucleotide sequences coding
for the polypeptides of the invention. Alterations of the native
amino acid sequence can be accomplished by any of a number of
conventional methods.
[0108] Polypeptide variants resulting from a fusion of additional
peptide sequences at the amino and carboxyl terminal ends of the
polypeptides of the invention can be used to enhance expression of
the polypeptides, be useful in the purification of the protein or
improve the enzymatic activity of the polypeptide in a desired
environment or expression system. Such additional peptide sequences
may be signal peptides, for example. Accordingly, the present
invention encompasses variants of the polypeptides of the
invention, such as those obtained by fusion with other oligo- or
polypeptides and/or those which are linked to signal peptides.
Fusion polypeptides encompassed by the invention also comprise
fusion polypeptides resulting from a fusion of other functional
proteins, such as other proteins from the terpene biosynthesis
pathway.
[0109] Therefore, in an embodiment, the present invention provides
a method for preparing a variant polypeptide having a patchoulol
and 7-epi-.alpha.-selinene synthase activity, as described in any
of the above embodiments, and comprising the steps of: [0110] (a)
selecting a nucleic acid according to any of the embodiments
exposed above; [0111] (b) modifying the selected nucleic acid to
obtain at least one mutant nucleic acid; [0112] (c) transforming
host cells or unicellular organisms with the mutant nucleic acid
sequence to express a polypeptide encoded by the mutant nucleic
acid sequence; [0113] (d) screening the polypeptide for at least
one modified property; and, [0114] (e) optionally, if the
polypeptide has no desired variant patchoulol and
7-epi-.alpha.-selinene synthase activity, repeating the process
steps (a) to (d) until a polypeptide with a desired variant
patchoulol and 7-epi-.alpha.-selinene synthase activity is
obtained; [0115] (f) optionally, if a polypeptide having a desired
variant patchoulol and 7-epi-.alpha.-selinene synthase activity was
identified in step d), isolating the corresponding mutant nucleic
acid obtained in step (c).
[0116] According to a preferred embodiment, the variant polypeptide
prepared is capable of producing patchoulol and
7-epi-.alpha.-selinene as the major products. According to an even
more preferred embodiment, patchoulol and 7-epi-.alpha.-selinene
represent altogether at least 50%, preferably at least 60%,
preferably at least 70%, preferably at least 80%, preferably at
least 90.degree./o of the product produced by the variant
polypeptide prepared.
[0117] According to another embodiment of the invention,
7-epi-.alpha.-selinene is the major product and patchoulol is the
second major product. For example, 7-epi-.alpha.-selinene can
represent at least 20%, preferably at least 30%, preferably at
least 34% of the product obtained with the variant polypeptide
prepared. As another example, patchoulol can represent at least
10%, preferably at least 15%, preferably at least 16% of the
product obtained with the variant polypeptide prepared.
[0118] According to another embodiment, the product of the variant
polypeptide prepared further comprises at least one compound
selected from the group consisting of germacrene A,
.alpha.-guaiene, .alpha.-humulene, .alpha.-selinene, seychellene,
.alpha.-patchoulene, .alpha.-bulnesene and (E)-nerolidol, these
compounds being defined by the way of their structure as depicted
in FIG. 1. In a more preferred embodiment, the product of the
variant polypeptide prepared comprises all of these compounds.
[0119] In step (b), a large number of mutant nucleic acid sequences
may be created, for example by random mutagenesis, site-specific
mutagenesis, or DNA shuffling. The detailed procedures of gene
shuffling are found in Stemmer, DNA shuffling by random
fragmentation and reassembly: in vitro recombination for molecular
evolution. Proc Natl Acad Sci USA., 1994, 91(22): 10747-1075. In
short, DNA shuffling refers to a process of random recombination of
known sequences in vitro, involving at least two nucleic acids
selected for recombination. For example mutations can be introduced
at particular loci by synthesizing oligonucleotides containing a
mutant sequence, flanked by restriction sites enabling ligation to
fragments of the native sequence. Following ligation, the resulting
reconstructed sequence encodes an analog having the desired amino
acid insertion, substitution, or deletion or insertion.
[0120] Alternatively, oligonucleotide-directed site-specific
mutagenesis procedures can be employed to provide an altered gene
wherein predetermined codons can be altered by substitution,
deletion or insertion.
[0121] Accordingly, the polypeptide comprising SEQ ID NO:1 may be
recombined with any other sesquiterpene synthase encoding nucleic
acids, for example isolated from an organism other than Valeriana
jatamansi. Thus, mutant nucleic acids may be obtained and
separated, which may be used for transforming a host cell according
to standard procedures, for example such as disclosed in the
present examples.
[0122] In step (d), the polypeptide obtained in step (c) is
screened for at least one modified property, for example a desired
modified enzymatic activity. Examples of desired enzymatic
activities, for which an expressed polypeptide may be screened,
include enhanced or reduced enzymatic activity, for example as
measured by K.sub.M or V.sub.max value, modified regio-chemistry or
stereochemistry and altered substrate utilization or product
distribution. The screening of enzymatic activity can be performed
according to procedures familiar to the skilled person and those
disclosed in the present examples.
[0123] Step (e) provides for repetition of process steps (a)-(d),
which may preferably be performed in parallel. Accordingly, by
creating a significant number of mutant nucleic acids, many host
cells may be transformed with different mutant nucleic acids at the
same time, allowing for the subsequent screening of an elevated
number of polypeptides. The chances of obtaining a desired variant
polypeptide may thus be increased at the discretion of the skilled
person.
[0124] All the publications mentioned in this application are
incorporated by reference to disclose and describe the methods
and/or materials in connection with which the publications are
cited.
DESCRIPTION OF THE DRAWINGS
[0125] FIG. 1: Total ion chromatograms of GC-MS analysis of the
sesquiterpene products generated by the patchoulol and
7-epi-.alpha.-selinene synthase of the present invention (SEQ ID
NO:1) (A) and profile obtained with a negative control (B). The
peaks marked with numbers were identified as sesquiterpenes:
.beta.-elemene (degradation product of (+)-germacrene A) (1);
.alpha.-guaiene (2); seychellene (3); .alpha.-humulene (4);
.alpha.-patchoulene (5); .alpha.-selinene (6); .alpha.-bulnesene
(7); 7-epi-.alpha.-selinene (8); (E)-nerolidol (9); unidentified
sesquiterpene alcohol (10); patchoulol (11).
[0126] FIG. 2: Mass spectra of the two major products of the
patchoulol and 7-epi-.alpha.-selinene synthase of the invention
(SEQ ID NO:2) (peak 8 and peak 11 in FIG. 1 (A and B respectively))
and comparison to the mass spectra of authentic
7-epi-.alpha.-selinene and patchoulol (C and D respectively).
[0127] FIG. 3: Structure of the sesquiterpenes produced by the
patchoulol and 7-epi-.alpha.-selinene synthase of the invention
(SEQ ID NO:2).
SPECIFIC EMBODIMENTS OF THE INVENTION OR EXAMPLES
[0128] The invention will now be described in further detail by way
of the following Examples.
Example 1
[0129] Isolation of a Sesquitepene Synthase cDNA from Valeriana
Jatamansi Roots
[0130] Valeriana jatamansi (synonym: Valeriana walichii) plants
were obtained from B & T World Seeds (Paguignan, Aigues-Vives,
France). The plants were cultivated in a green house at minimal
temperature of 14.degree. C. Rhizomes, young and mature roots were
collected separately and immediately frozen in liquid nitrogen. The
material was crushed and granddad to a fine powder in liquid
nitrogen using a mortar and pestle. Total RNA was extracted using
the CONCERT.TM. Plant RNA Reagent from Invitrogen following the
manufacturer instruction except for the isopropanol precipitation
which was replaced by a 2M LiCl precipitation. The quality of the
RNA was evaluated on an agarose gel by verifying the integrity of
the ribosomal RNA bands. The mRNA was purified from the total RNA
by oligodT-cellulose affinity chromatography using the
FASTTRACK.RTM. 2.0 mRNA isolation Kit (Invitrogen) following the
manufacturer's instructions.
[0131] Degenerated oligonucleotides were designed to conserved
motifs observed in alignments of the amino acid sequences of plant
sesquiterpene synthases (Deguerry et al, 2006, Arch Biochem
Biophys. 454(2), 123-36). RT-PCR with these sesquiterpene
synthases-specific oligonucleotides were performed using the Qiagen
OneStep RT-PCR Kit and an Eppendorf Mastercycler gradient thermal
cycler. Typical reaction mixtures contain 10 .mu.l SX Qiagen
OneStep RT-PCR buffer, 200 .mu.M each dNTP, 0.4 .mu.M each primer,
2 .mu.l Qiagen OneStep RT-PCR Enzyme Mix, 1 .mu.l RNASIN.RTM.
Ribonuclease Inhibitor (Promega Co.) and 1 .mu.g total RNA in a
final volume of 50 .mu.l. The thermal cycler conditions were: 30
min at 50.degree. C. (reverse transcription); 15 min at 95.degree.
C. (DNA polymerase activation); 35 cycles of 30 sec at 94.degree.
C., 30 sec at 41 to 51.degree. C. and 1 min at 72.degree. C. The
sizes of the PCR products were evaluated on a 1.2% agarose gel. The
bands corresponding to the expected size were excised from the gel,
purified using the QIAQUICK.RTM. Gel Extraction Kit (Qiagen) and
cloned in the pCR.RTM. 2.1-TOPO vector using the TOPO TA cloning
Kit (Invitrogen). Inserted cDNAs were then subject to DNA
sequencing and the sequence compared against the GenBank
non-redundant protein database (NCBI) using the BLASTX algorithm
(Altschul et al 1990, J. Mol. Biol. 215, 403-410). The combination
of the forward primer TpsCF2 (SEQ ID NO:3)
(5'-GGGA(A/T)(A/T)G(A/T)(A/T/G/C)(A/T)(C/T/G)GTTGAA(T/G)(T/G)TTATTTTTGG-3-
') and the reverse primer TpsCR3 (SEQ ID NO:4)
(5'-GT(A/T)(C/G)CGTG(A/T/G/C/)G(A/C/T)GTCGTA(A/C/T)G(T/G)GTCATC-3')
provided a fragment of 81 bp presenting sequence homology with
known plant sesquiterpene synthases.
[0132] A combination of 3' and 5' Rapid Amplification of cDNA Ends
(RACE) was then used to obtain the full-length sequence of the cDNA
corresponding to this fragment. For the 3'RACE, two sens
oligonucleotides were deduced from the 81 bp sequence obtained by
RT-PCR: 20-3R1 (SEQ ID NO:5) and 20-3R2 (SEQ ID NO:6). An
adaptor-ligated double stranded cDNA was prepared using the
MARATHON.TM. cDNA Amplification Kit (Clontech) following the
manufacturer's protocol. This cDNA was prepared from mRNA purified
from V. jatamansi rhizome total RNA. Typical RACE reaction mixtures
contain, in a final volume of 50 .mu.l, 5 .mu.l 10.times.PCR
Reaction Buffer (Clontech), 200 .mu.M each dNTP, 1 .mu.l
ADVANTAGE.RTM. 2 Polymerase Mix, 200 nM each primer and 5 .mu.l of
250 fold diluted cDNA. Amplification was performed on an Eppendorf
Mastercycler gradient thermal cycler. The thermal Cycling
conditions were as follows: 1 min at 94.degree. C., 5 cycles of 5
sec at 94.degree. C. and 3 min at 72.degree. C., 5 cycles of 5 sec
at 94.degree. C. and 3 min at 70.degree. C., 20 cycles of 5 sec at
94.degree. C. and 3 min at 68.degree. C. The amplification products
were evaluated, sub-cloned, and the sequence analyzed as described
above. The 3'-end of the cDNA was obtained after a first round
amplification with the primer 20-3R1 and the AP1 primer (clontech)
and a second round amplification with the primer 20-3R2 and the AP2
primer (Clontech).
[0133] Two reverse primers were deduced from the sequence obtained
by 3'RACE: 20-5R1 (SEQ ID NO:7) and 20-5R2 (SEQ ID NO:8). The
SMART.TM. RACE cDNA amplification Kit (Clontech) was used to
prepare a 5'RACE-ready cDNA from V. jatamansi root total RNA. The
5'RACE was performed in 50 .mu.l of 1.times. advantage 2 PCR buffer
containing 200 .mu.M dNTPs, 5 .mu.l Universal Primer Mix
(Clontech), 2 .mu.M of the first gene specific primer (20-5R1), 2.5
.mu.l 5'RACE-ready cDNA and 1 .mu.l ADVANTAGE.RTM. 2 Polymerase
Mix. A second round of amplification was performed in the same
condition with 1 .mu.l of the first amplification, the nested gene
specific primer (20-5R2) and the Nested Universal primer
(Clontech). The temperature cycling condition and DNA fragment
analysis were as described above for the in 3'RACE.
[0134] The combination of 3' and 5'-RACE allowed the reconstitution
of the full-length sequence of a cDNA which was named ValR20 (SEQ
ID NO:9). The deduced amino acid sequence (SEQ ID NO: 10) showed
homology with plant sesquiterpene synthases and contained typical
terpene synthases amino acid motifs such as the DDxxD motif. The
closest sequences were a vetispiradiene synthase amino acid
sequence from S. tuberosum (49% identity) and a putative
sesquiterpene synthase from V. vinifera (up to 46% identity).
Example 2
[0135] Heterologous Expression and Characterization of ValR20 as a
7-epi-.alpha.-Selinene and Patchoulol Synthase
[0136] Two oligonucleotides, Val-R20-topo-start (SEQ ID NO: 11) and
Val-R20-stop (SEQ ID NO: 12), where designed from the start and
stop regions of ValR20 and used to amplify the full-length sequence
of this cDNA. The Val-R20-topo-start was designed according to the
CHAMPION.TM. pET Directional TOPO.RTM. Expression Kits
(invitrogen). The amplification was performed with the Pfu DNA
polymerase (Promega) from the 5'-RACE-Ready cDNA pool prepared with
the Smart RACE cDNA amplification kit (clontech). The thermal
cycling conditions were as follows: 2 min at 95.degree. C.; 32
cycles of 5 sec at 94.degree. C., 20 sec at 53.degree. C. and 3 min
at 72.degree. C. The PCR products were purified on an agarose gel
and eluted using the QIAQUICK.RTM. Gel Extraction Kit (Qiagen,
Valencia, Calif.). The PCR product was ligated into the pET101
plasmid following the manufacturer protocol (Invitrogen). The
constructs were verified by DNA sequencing. One clone, 504-ValR20
(SEQ ID NO:2), was selected for the subsequent heterologous
expression and enzyme assay experiments. Compared to the sequences
reconstituted from the RACE, the sequence of 504-ValR20 (SEQ ID
NO:2) showed four nucleotide differences leading to three amino
acid changes. The amino acid sequence encoded by 504-ValR20 is
provided in SEQ ID NO: 1.
[0137] The plasmid was transferred into BL21 STAR.TM. (DE3) E. coli
cells (Invitrogen). Single colonies of transformed cells were used
to inoculate 5 mL LB medium. After reaching an OD of 0.3, the 5 mL
cultures were used to inoculate 25 mL of LB medium. The cultures
were incubated at 37.degree. C. until reaching an OD of 0.5 and
they were then transferred to a 20.degree. C. incubator. After 1
hour equilibration, the expression of the proteins was induced by
addition of 1 mM IPTG and the culture were incubated over-night at
20.degree. C. The next day, the cells were collected by
centrifugation, resuspended in 1 mL Extraction Buffer (50 mM MOPSO,
pH 7.4, 1 mM DTT, 10% glycerol) and disrupted by sonication. The
cell debris were sedimented by centrifugation 30 min at 18,000 g
and the supernatant containing the soluble proteins was
recovered.
[0138] The enzymatic assays were performed in Teflon sealed glass
tubes using 250 .mu.L protein extract in a final volume of 5 mL
Extraction Buffer supplemented with 10 mM MgCl.sub.2 and 100 to 250
.mu.M farnesyl pyrophosphate (prepared as described by Keller and
Thompson, J. Chromatogr 645(1), 161-167, 1993). The assays were
overlaid with 3 mL pentane and the tubes incubated overnight at
30.degree. C. The pentane phase was recovered and the medium
extracted with a second volume of pentane. The combined pentane
fractions were concentrated under nitrogen and analyzed by GC-MS on
a Hewlett-Packard 6890N GC-quadrupole mass selective detector
system, equipped with a 0.25 mm inner diameter by 30 m length
DB-IMS (J&W Scientific) capillary column. The oven temperature
was programmed from 50.degree. C. (1 min hold) to 280.degree. C. at
10.degree. C./min. The carrier gas was He at a constant flow of 1
mL/min. The identity of the products was confirmed based on the
concordance of the retention indexes and mass spectra of authentic
standards when available or based on published data (Joulain and
Koenig, 1998). Negative controls were performed with proteins
obtained from E coli transformed with the pET101 plasmid without
insert.
[0139] The recombinant enzyme (SEQ ID NO:1) was found to be a
multiple product sesquiterpene synthase converting
farnesyl-pyrophosphate to at least 11 products. The major product
was 7-epi-.alpha.-selinene which accounted for 34.2% of the total
sesquiterpene mixture. Patchoulol was the second most abound
product of the enzyme (16.4%). Other sesquiterpene produced were
germacrene A (12.2%), .alpha.-guaiene (2.4%), seychellene (4.6%),
.alpha.-humulene (5.3%), .alpha.-patchoulene (2.4%),
.alpha.-selinene (9.6%), .alpha.-bulnesene (6.3%), (E)-nerolidol
(2.5%) and an unidentified sesquiterpene alcohol (4.1%).
Sequence CWU 1
1
111553PRTValeriana jatamansi 1Met Leu Ser Thr Glu Ser Gln Val Phe
Arg Pro Leu Ala Asn Phe Glu 1 5 10 15 Lys Ser Leu Trp Gly Asn Leu
Phe Thr Ser Phe Ser Val Asp Tyr Leu 20 25 30 Thr Lys Lys Thr Asn
Thr Glu Glu His Glu Gly Leu Leu Glu Lys Val 35 40 45 Arg Leu Met
Phe Leu Asp Ala Ser Lys Leu Lys Ile Pro Glu Lys Ile 50 55 60 Asn
Phe Ile Asn Thr Leu Glu Arg Leu Gly Val Ser Tyr His Met Glu 65 70
75 80 Arg Glu Ile Glu Asp Gln Leu His Gln Met Phe Asp Ala His Ser
Lys 85 90 95 Phe Gln Asp Asp Ile Gln Arg Phe Asp Leu Phe Thr Leu
Gly Ile Tyr 100 105 110 Phe Arg Ile Leu Arg Gln His Gly Tyr Asn Ile
Ser Ser Asp Val Phe 115 120 125 Lys Lys Leu Lys Asp Ser Asn Gly Lys
Phe Lys Glu Glu Leu Lys Asp 130 135 140 Asp Val Ile Gly Ile Leu Ser
Leu Tyr Glu Ala Thr His Val Arg Thr 145 150 155 160 His Gly Asp Asp
Ile Leu Asp Glu Ala Phe Ile Tyr Thr Lys Ala Gln 165 170 175 Leu Glu
Ser Met Ser Thr Ala Ser Leu Ser Pro Phe Leu Gly Met Gln 180 185 190
Val Thr His Ala Leu Ile Gln Ser Leu His Lys Gly Ile Pro Arg Ile 195
200 205 Glu Ser Arg Asn Tyr Ile Ser Val Tyr Glu Glu Asp Pro Asn Lys
Asn 210 215 220 Asp Leu Leu Leu Arg Phe Ser Thr Ile Asp Phe Asn Leu
Leu Gln Met 225 230 235 240 Leu His Lys Gln Glu Leu Cys Asp Ala Ser
Arg Trp Trp Asn Glu Met 245 250 255 Glu Phe Glu Thr Lys Leu Ser Tyr
Ala Arg Asp Arg Val Val Glu Gly 260 265 270 Tyr Leu Trp Thr Leu Ser
Ala Tyr Tyr Glu Pro Lys Tyr Ser Leu Ala 275 280 285 Arg Arg Ile Leu
Ile Lys Leu Met Ile Leu Val Ser Leu Thr Asp Asp 290 295 300 Thr Tyr
Asp Ala Tyr Gly Thr Leu Asp Glu Leu Gln Leu Phe Thr Asp 305 310 315
320 Ala Val Glu Arg Leu Asp Glu Gly Ser Ile Asn Gln Leu Pro Asp Tyr
325 330 335 Met Lys Ile Leu Tyr Lys Ala Leu Leu Asp Phe Phe Glu Glu
Thr Glu 340 345 350 Asp Ile Leu Cys Lys His Gly Ile Ile Asn Gly Ser
His Arg Val Asn 355 360 365 Tyr Gly Lys Tyr Val Tyr Lys Glu Ile Val
Asn Cys Tyr Asn Thr Glu 370 375 380 Tyr Lys Trp Phe Asn Lys Arg Tyr
Val Pro Asp Phe Glu Glu Tyr Met 385 390 395 400 Gln Lys Ala Val Val
Thr Ser Gly Asn Asn Leu Leu Ile Thr Trp Ser 405 410 415 Phe Gln Gly
Met Asp Gln Val Ala Ser Ile Lys Ala Phe Glu Trp Val 420 425 430 Lys
Ser His Pro Lys Met Val Val Ser Ser Asn Lys Val Leu Arg Leu 435 440
445 Val Asp Asp Val Met Ser His Glu Glu Glu Asp Glu Arg Gly His Val
450 455 460 Ala Thr Gly Leu Glu Cys Tyr Gln Lys Thr Tyr Gly Gly Asn
Arg Lys 465 470 475 480 Glu Ile Ile Pro Glu Phe Tyr Lys Arg Ile Asp
Asp Ala Trp Lys Asp 485 490 495 Val Asn Glu Glu Phe Leu Lys Pro Asp
Lys Leu Pro Leu Glu Ile Leu 500 505 510 Met Arg Val Ile Asn Leu Thr
Arg Ile Gly Asp Val Val Tyr Lys Tyr 515 520 525 Asp Asp Gly Tyr Thr
His Pro Thr Lys Ala Leu Lys Asp His Ile Ile 530 535 540 Ser Leu Phe
Val Asn Pro Ile Ile Ile 545 550 21662DNAValeriana jatamansi
2atgttaagca ctgagagtca agtttttcgt ccgttggcaa attttgagaa aagtttatgg
60ggaaatcttt tcacctcatt ttctgtggat tatctgacta agaaaacaaa tacagaagaa
120catgaaggat tattagaaaa agtgagactg atgtttttag atgcatccaa
attgaagatt 180ccagaaaaga tcaatttcat aaatacactt gaaaggttag
gtgtatcata tcatatggag 240agagagattg aagatcagct tcatcagatg
tttgatgctc attctaaatt tcaagatgat 300attcaacggt ttgatttgtt
cactttggga atttacttca ggattctcag acaacatggt 360tataatatct
ctagcgatgt tttcaagaag ttgaaagata gcaatggaaa attcaaggaa
420gaactaaaag atgatgtgat tggcattcta agcttgtatg aagctacaca
tgtaagaacc 480cacggtgacg atattctaga tgaagctttc atctatacaa
aagctcaact agaatctatg 540tccaccgcaa gtttaagccc gtttcttggt
atgcaagtta cgcatgcttt gattcagtct 600ctccacaaag ggatcccaag
aatcgagtcg cgcaactata tatctgttta tgaagaagat 660ccaaacaaaa
atgatctatt attgaggttc tcaacgattg atttcaatct gctgcaaatg
720cttcacaagc aagaattgtg tgatgcctca aggtggtgga atgaaatgga
gtttgagacg 780aaactatctt atgcgagaga tcgagtggta gaaggctatt
tatggaccct tagcgcatat 840tacgaaccga aatactcttt ggctcgaaga
atattaatca aattaatgat attggtatct 900cttacggatg atacgtatga
tgcatatggt acgttagatg aacttcaact ctttacggat 960gcagtagaaa
ggttggatga gggttccatc aatcagcttc ctgattacat gaagattctc
1020tataaggctc tgctagattt tttcgaggaa acagaagata tattatgcaa
acatggaata 1080attaatggtt ctcatcgcgt taattatggg aaatatgtgt
ataaagagat tgtgaattgc 1140tacaataccg agtacaaatg gttcaacaaa
agatacgtgc cggattttga agaatatatg 1200cagaaagcag tagtgacttc
aggtaacaat ttgcttataa cgtggtcttt tcaaggaatg 1260gatcaagtcg
caagtatcaa agcgttcgag tgggttaaaa gtcatccgaa aatggtagtc
1320tcgtcgaata aagtcctacg acttgttgac gacgtaatga gccacgagga
agaagatgaa 1380aggggacatg ttgcaacagg ccttgaatgc tatcagaaaa
catatggtgg aaatagaaaa 1440gagatcattc cagaatttta taagaggatt
gatgatgctt ggaaagatgt aaatgaagaa 1500tttttgaaac ccgataaatt
accgctagaa atactaatgc gtgttattaa cctcacgaga 1560attggcgacg
ttgtttacaa gtatgacgac gggtatactc atccaacgaa agcattaaag
1620gatcacatca tatcgttgtt cgtgaatccc ataatcatat ga
1662329DNAArtificial sequencePrimermisc_feature(9)..(9)n is a, c,
g, or t 3gggawwgwnw bgttgaakkt tatttttgg 29426DNAArtificial
sequencePrimermisc_feature(9)..(9)n is a, c, g, or t 4gtwscgtgng
hgtcgtahgk gtcatc 26525DNAArtificial sequencePrimer 5acccttagcg
catattacga accga 25628DNAArtificial sequencePrimer 6cgaaccgaaa
tactctttgg ctcgaaga 28728DNAArtificial sequencePrimer 7ccactcgaac
gctttgatac ttgcgact 28828DNAArtificial sequencePrimer 8cctcatccaa
cctttctact gcatccgt 2891979DNAValeriana jatamansi 9acgcggggga
ctatcatata attaagagat taattaagtt tgtgaaaatg ttaagcactg 60agagtcaagt
ttttcgtccg ttggcaaatt ttgagaaaag tttatgggga aatcttttca
120cctcattttc tgtggattat ctgactaaga aaacaaatac agaagaacat
gaaggattat 180tagaaaaagt gagactgatg tttttagatg catccaaatt
gaagattcca gaaaagatca 240atttcataaa tacacttgaa aggttaggtg
tatcatatca tatggagaga gagattgaag 300atcagcttca tcagatgttt
gatgctcatt ctaaatttca agatgatatt caacggtttg 360atttgttcac
tttgggaatt tacttcagga ttctcagaca acatggttat aatatctcta
420gcgatgtttt caagaagttg aaagatagca atggaaaatt caaggaagaa
ctaaaagatg 480atgtgattgg cattctaagc ttgtatgaag ctacacatgt
aagaacccac ggtgacgata 540ttctagatga agctttcatc tatacaaaag
ctcaactaga atctatgtcc accgcaagtt 600taagcccgtt tcttggtatg
caagttacgc atgctttgat tcagtctctc cacaaaggga 660tcccaagaat
cgagtcgcgc aactatatat ctgtttatga agaagatcca aacaaaaatg
720atctattatt gaggttctca aagattgatt tcaatctgct gcaaatgctt
cacaagcaag 780aattgtgtga tgcctcaagg tggtggaatg aaatggagtt
tgagacgaaa ctatcttatg 840cgagagatcg agtggtagaa ggctatttat
ggacccttag cgcatattac gaaccgaaat 900actctttggc tcgaagaata
ttaatcaaat taatgatatt ggtatctctt acggatgata 960cgtatgatgc
atatggtacg ttagatgaac ttcaactctt tacggatgca gtagaaaggt
1020tggatgaggg ttccatcaat cagcttcctg attacatgaa gattctctat
aaggctctgc 1080tagatttttt cgaggaaaca gaagatatat tatgcaaaca
tggaataatt aatggttctc 1140atcgcgttaa ttatggaaaa tatgtgtata
aagagattgt gaattgctac aataccgagt 1200acaaatggtt caacaaaaga
tacgtgccgg attttgaaga atatatgcag aaagcagtag 1260tgacttcagg
taacaatttg cttataacgt ggtcttttca aggaatggat caagtcgcaa
1320gtatcaaagc gttcgagtgg gttaaaaatc atccgaaaat ggtagtctcg
tcgaataaag 1380tcctacgact tgttgacgac gtaatgagcc acgaggaaga
agatgaaagg ggacatgttg 1440caacaggcct tgaatgctat cagaaaacat
atggtggaaa tagaaaagag atcattccag 1500aattttataa gaggattgat
gatgcttgga aagatgtaaa tgaagaattt ttgaaacccg 1560ataaattacc
gctaaaaata ctaatgcgtg ttattaacct cacgagaatt ggcgacgttg
1620tttacaagta tgacgacggg tatactcatc caacgaaagc attaaaggat
cacatcatat 1680cgttgttcgt gaatcccata atcatatgat tatcatcaat
cgttcgacgt gtgtgcgcgt 1740ggttttggtt aataagtaaa taaggtggac
tgcatcacca tcccaaatct gcatgtttgc 1800ttctgggttt ctttatttgt
gtttgcgtgt ttacttgatt tcttttccgt tgtttgtaat 1860aataccgagt
atttcttttc cgttgtttgt aataatacgg agtatgcatt actctctttt
1920tcacatgcaa ttaataagac gatattattt gtaaaaaaaa aaaaaaaaaa
aaaaaaaaa 19791028DNAArtificial sequencePrimer 10caccatgtta
agcactgaga gtcaagtt 281127DNAArtificial sequencePrimer 11gataatcata
tgattatggg attcacg 27
* * * * *