U.S. patent application number 12/889163 was filed with the patent office on 2011-07-07 for production of fatty acid derivatives.
This patent application is currently assigned to LS9, INC.. Invention is credited to Alfred GAERTNER.
Application Number | 20110162259 12/889163 |
Document ID | / |
Family ID | 43778722 |
Filed Date | 2011-07-07 |
United States Patent
Application |
20110162259 |
Kind Code |
A1 |
GAERTNER; Alfred |
July 7, 2011 |
PRODUCTION OF FATTY ACID DERIVATIVES
Abstract
Methods and compositions for producing fatty acid derivatives,
for example, fatty esters, and commercial fuel compositions
comprising fatty acid derivatives are described.
Inventors: |
GAERTNER; Alfred; (South San
Francisco, CA) |
Assignee: |
LS9, INC.
South San Francisco
CA
|
Family ID: |
43778722 |
Appl. No.: |
12/889163 |
Filed: |
September 23, 2010 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
61245943 |
Sep 25, 2009 |
|
|
|
Current U.S.
Class: |
44/385 ; 435/134;
435/243; 435/252.3; 554/1 |
Current CPC
Class: |
C12N 1/36 20130101; C12N
9/001 20130101; C12P 7/649 20130101; C12N 9/1029 20130101; C12Y
203/01075 20130101; Y02E 50/10 20130101; C11C 3/10 20130101; C10L
1/19 20130101; C12N 15/63 20130101; C12Y 103/99003 20130101; C10L
1/026 20130101; Y02E 50/13 20130101 |
Class at
Publication: |
44/385 ; 554/1;
435/134; 435/243; 435/252.3 |
International
Class: |
C10L 1/19 20060101
C10L001/19; C07C 53/126 20060101 C07C053/126; C07C 69/003 20060101
C07C069/003; C12P 7/64 20060101 C12P007/64; C12N 1/00 20060101
C12N001/00; C12N 1/21 20060101 C12N001/21 |
Claims
1. A method of producing a fatty acid derivative, the method
comprising culturing a host cell in the presence of a carbon
source, wherein the host cell is genetically engineered to
overexpress a gene encoding a thioesterase, a gene encoding an
acyl-CoA synthase, and a gene encoding an ester synthase, wherein
the gene encoding the thioesterase and the gene encoding the
acyl-CoA synthase are integrated into the genomic DNA of the host
cell.
2. The method of claim 1, further comprising isolating the fatty
acid derivative.
3. The method of claim 1, further comprising culturing the host
cell in the presence of an alcohol.
4. The method of claim 1, wherein the gene encoding a thioesterase
is tesA, `tesA, fatB, fatB2, fatB3, fatA1, or fatA.
5. The method of claim 1, wherein the gene encoding an acyl-CoA
synthase is fadD, fadK, BH3103, pfl-4354, EAV15023, fadD1, fadD2,
RPC.sub.--4074, fadDD, faa39, the gene that encodes the protein of
GenBank Accession No. ZP.sub.--01644875, or yhfL.
6. The method of claim 1, wherein the gene encoding an ester
synthase is obtained from Acinetobacter sp., Alcanivorax
borkumensis, Alcaligenes eutrophus, Mortierella alpina,
Cryptococcus curvatus, Arabidopsis thaliana, Fundibacter jadensis,
Pseudomonas aeruginosa, Rhodococcus opacus, Marinobacter
hydrocarbonoclastics, Saccharomyces cerevisiae, Homo sapiens, or
Simmondis chinensis.
7. The method of claim 6, wherein the gene encoding an ester
synthase is wax/dgat, a gene encoding a wax synthase, a gene
encoding a bifunctional ester synthase/acyl-CoA:diacylglycerol
acyltransferase, ES9, ES8, DSM 8798, or AtfA2.
8. The method of claim 1, wherein the host cell is genetically
engineered to express, relative to a wild type host cell, a
decreased level of at least one of a gene encoding an acyl-CoA
dehydrogenase, a gene encoding an outer membrane protein receptor,
and a gene encoding a transcriptional regulator of fatty acid
biosynthesis.
9. The method of claim 8, wherein the gene encoding the outer
membrane protein receptor is a gene encoding an outer membrane
ferrichrome transporter.
10. The method of claim 8, wherein the gene encoding the
transcriptional regulator of fatty acid biosynthesis encodes a DNA
transcriptional repressor.
11. A method of producing a fatty acid derivative the method
comprising culturing a host cell in the presence of a carbon
source, wherein the host cell is genetically engineered to
overexpress a gene encoding a thioesterase, a gene encoding an
acyl-CoA synthase, and a gene encoding an ester synthase.
12. The method of claim 11, wherein the host cell is genetically
engineered to express, relative to a wild type host cell, a
decreased level of a gene encoding a pyruvate formate lyase, a
lactate dehydrogenase, or both.
13. The method of claim 1, wherein the host cell is selected from
the group consisting of a mammalian cell, plant cell, insect cell,
yeast cell, fungus cell, filamentous fungi cell, cyanobacterial
cell, and bacterial cell.
14. The method of claim 1, wherein the host cell is an E. coli
cell.
15. A fatty acid derivative produced by the method of claim 1.
16. A fatty ester produced by the method of claim 1.
17. A genetically engineered microorganism comprising at least one
of a gene encoding a thioesterase, a gene encoding an acyl-CoA
synthase, and a gene encoding an ester synthase, wherein the gene
encoding the thioesterase and the gene encoding the acyl-CoA
synthase are integrated into the genomic DNA of the microorganism,
and wherein the microorganism produces an increased level of a
fatty ester relative to a wild-type microorganism.
18. A genetically engineered microorganism comprising an exogenous
control sequence stably incorporated into the genomic DNA of the
microorganism upstream of a gene encoding a thioesterase, and a
gene encoding an acyl-CoA synthase, wherein the microorganism
produces an increased level of a fatty ester relative to a
wild-type microorganism.
19. The genetically engineered microorganism of claim 18, wherein
the exogenous control sequence is a promoter.
20. The genetically engineered microorganism of claim 19, wherein
the microorganism is genetically engineered to express, relative to
a wild type microorganism, a decreased level of at least one of a
gene encoding an acyl-CoA dehydrogenase, a gene encoding an outer
membrane protein receptor, and a gene encoding a transcriptional
regulator of fatty acid biosynthesis.
21. The genetically engineered microorganism of claim 19, wherein
the microorganism is genetically engineered to express, relative to
a wild type microorganism, a decreased level of at least one of a
gene encoding a pyruvate formate lyase, and a gene encoding a
lactate dehydrogenase.
22. The genetically engineered microorganism of claim 20, wherein
the gene encoding the outer membrane protein receptor encodes a
ferrichrome outer membrane transporter.
23. The genetically engineered microorganism of claim 20, wherein
the gene encoding the transcriptional regulator encodes a DNA
binding transcriptional repressor.
24. The genetically engineered microorganism of claim 17, selected
from a Gram-negative or a Gram-positive bacterium.
25. A method of producing a fatty acid derivative comprising
culturing the genetically engineered microorganism of claim 17, in
the presence of an alcohol.
26. A fatty acid derivative produced by the method of claim 25.
27. The fatty acid derivative of claim 26, wherein the fatty acid
derivative is a fatty ester.
28. A biofuel composition comprising the fatty acid derivative of
claim 26.
29. The biofuel composition of claim 28, wherein the composition
has a cloud point of about 5.degree. C. or lower, of about
0.degree. C. or lower, or of about -4.degree. C. or lower.
30. The biofuel composition of claim 28, wherein the composition
has a simulated distillation T90 of about 360.degree. C. or lower,
or of about 358.degree. C. or lower.
31. The biofuel composition of claim 28, wherein the composition
has a sulfur content of about 15 ppm or less, of about 12 ppm or
less, or of about 10.7 ppm or less.
32. The biofuel composition of claim 28, wherein the composition
has a Karl Fischer moisture content of about 0.1 wt. % or less, of
about 0.08 wt. % or less, or of about 0.06 wt. % or less.
33. The biofuel composition of claim 28, wherein the composition
has a total acid number of about 0.50 mg KOH/g or less, of about
0.20 mg KOH/g or less, or of about 0.15 mg KOH/g or less.
34. The biofuel composition of claim 28, wherein the composition
has a combined calcium and magnesium content of about 5 ppm or
less, of about 2 ppm or less, or of about 0.5 ppm or less.
35. The biofuel composition of claim 28, wherein the composition
has a combined sodium and potassium content of about 5 ppm or less,
of about 2.5 ppm or less, or of about 1.6 ppm or less.
36. The biofuel composition of claim 28, wherein the composition
has a specific mass at 20.degree. C. of about 850 to about 900
kg/m.sup.3, of about 860 to about 890 kg/m.sup.3, or of about 870
to about 880 kg/m.sup.3.
37. The biofuel composition of claim 28, wherein the composition
has a kinematic viscosity at 40.degree. C. of about 3 to about 6
mm.sup.2/s, of about 3.2 to about 5 mm.sup.2/s, or of about 3.5 to
about 4 mm.sup.2/s.
38. The biofuel composition of claim 28, wherein the composition
has a water content of about 500 mg/kg or less, of about 480 mg/kg
or less, or of about 450 mg/kg or less.
39. The biofuel composition of claim 28, wherein the composition
has a total contamination level of about 24 mg/kg or less, of about
20 mg/kg or less, of about 10 mg/kg or less, or of about 5 mg/kg or
less.
40. The biofuel composition of claim 28, wherein the composition
has a flash point of about 100.degree. C. or higher, of about
110.degree. C. or higher, or of about 120.degree. C. or higher.
41. The biofuel composition of claim 28, wherein the composition
has an ester content of about 96.5 wt. % or more, or of about 97
wt. % or more.
42. The biofuel composition of claim 28, wherein the composition
has a carbon residue number of about 0.05 wt. % or less.
43. The biofuel composition of claim 28, wherein the composition
has a sulfated ash level of 0.02 wt. % or less, or of 0.01 wt. % or
less.
44. The biofuel composition of claim 28, wherein the composition
has a phosphorus level of about 10 mg/kg or less, of about 5 mg/kg
or less, or of about 1 mg/kg or less.
45. The biofuel composition of claim 28, wherein the composition
has a copper corrosion score of 1.
46. The biofuel composition of claim 28, wherein the composition
has a cold filter plugging point of about 19.degree. C. or lower,
of about 10.degree. C. or lower, or of about 0.degree. C. or
lower.
47. The biofuel composition of claim 28, wherein the composition
has a free glycerol level of about 0.02 wt. % or less.
48. The biofuel composition of claim 28, wherein the composition
has a total glycerol level of about 0.25 wt. % or less, of about
0.15 wt. % or less, of about 0.10 wt. % or less, or of about 0.05
wt. % or less.
49. The biofuel composition of claim 28, wherein the composition
has a monoacylglycerol level of about 0.02 wt. % or less.
50. The biofuel composition of claim 28, wherein the composition
has a methanol or ethanol level of about 0.2 wt. % or less, of
about 0.1 wt. % or less, or of about 0.02 wt. % or less.
51. The biofuel composition of claim 28, wherein the composition
has an iodine number of about 65 g/100 g or less.
52. The biofuel composition of claim 28, wherein the composition
has an oxidation stability at 110.degree. C. of about 6 hours or
longer, of about 9 hours or longer, or of about 11 hours or longer.
Description
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. Provisional
Application No. 61/245,943, filed Sep. 25, 2009, the contents of
which are hereby incorporated in their entirety herein.
BACKGROUND OF THE INVENTION
[0002] Petroleum is a limited, natural resource found in the Earth
in liquid, gaseous, or solid forms. Petroleum is primarily composed
of hydrocarbons, which are comprised mainly of carbon and hydrogen.
It also contains significant amounts of other elements, such as,
nitrogen, oxygen, or sulfur, in different forms.
[0003] Petroleum is a valuable resource, but petroleum products are
developed at considerable costs, both financial and environmental.
First, sources of petroleum must be discovered. Petroleum
exploration is an expensive and risky venture. The cost of
exploring deep water wells can exceed $100 million. In addition to
the economic cost, petroleum exploration carries a high
environmental cost. For example, offshore exploration disturbs the
surrounding marine environments.
[0004] After a productive well is discovered, the petroleum must be
extracted from the Earth at great expense. Even under the best
circumstances, only 50% of the petroleum in a well can be
extracted. Petroleum extraction also carries an environmental cost.
For example, petroleum extraction can result in large seepages of
petroleum rising to the surface. Offshore drilling involves
dredging the seabed which disrupts or destroys the surrounding
marine environment.
[0005] After extraction, petroleum must be transported over great
distances from petroleum producing regions to petroleum consuming
regions. In addition to the shipping costs, there is also the
environmental risk of devastating oil spills.
[0006] In its natural form, crude petroleum extracted from the
Earth has few commercial uses. It is a mixture of hydrocarbons
(e.g., paraffins (or alkanes), olefins (or alkenes), alkynes,
napthenes (or cycloalkanes), aliphatic compounds, aromatic
compounds, etc.) of varying length and complexity. In addition,
crude petroleum contains other organic compounds (e.g., organic
compounds containing nitrogen, oxygen, sulfur, etc.) and impurities
(e.g., sulfur, salt, acid, metals, etc.).
[0007] Hence, crude petroleum must be refined and purified before
it can be used commercially. Due to its high energy density and its
easy transportability, most petroleum is refined into fuels, such
as transportation fuels (e.g., gasoline, diesel, aviation fuel,
etc.), heating oil, liquefied petroleum gas, etc.
[0008] Crude petroleum is also a primary source of raw materials
for producing petrochemicals. The two main classes of raw materials
derived from petroleum are short chain olefins (e.g., ethylene and
propylene) and aromatics (e.g., benzene and xylene isomers). These
raw materials are derived from the longer chain hydrocarbons in
crude petroleum by cracking the long chain hydrocarbons at
considerable expense using a variety of methods, such as catalytic
cracking, steam cracking, or catalytic reforming. These raw
materials are used to make petrochemicals, which cannot be directly
refined from crude petroleum, such as monomers, solvents,
detergents, or adhesives.
[0009] One example of a raw material derived from crude petroleum
is ethylene. Ethylene is used to produce petrochemicals such as,
polyethylene, ethanol, ethylene oxide, ethylene glycol, polyester,
glycol ether, ethoxylate, vinyl acetate, 1,2-dichloroethane,
trichloroethylene, tetrachloroethylene, vinyl chloride, and
polyvinyl chloride. Another example of a raw material derived from
crude petroleum is propylene. Propylene is used to produce
isopropyl alcohol, acrylonitrile, polypropylene, propylene oxide,
propylene glycol, glycol ethers, butylene, isobutylene,
1,3-butadiene, synthetic elastomers, polyolefins, alpha-olefins,
fatty alcohols, acrylic acid, acrylic polymers, allyl chloride,
epichlorohydrin, and epoxy resins.
[0010] Petrochemicals can be used to make specialty chemicals, such
as plastics, resins, fibers, elastomers, pharmaceuticals,
lubricants, or gels. Examples of specialty chemicals which can be
produced from petrochemical raw materials are: fatty acids,
hydrocarbons (e.g., long chain hydrocarbons, branched chain
hydrocarbons, saturated hydrocarbons, unsaturated hydrocarbons,
etc.), fatty alcohols, esters, fatty aldehydes, ketones,
lubricants, etc.
[0011] Specialty chemicals have many commercial uses. Fatty acids
are used commercially as surfactants. Surfactants can be found in
detergents and soaps. Fatty acids can also be used as additives in
fuels, lubricating oils, paints, lacquers, candles, salad oils,
shortenings, cosmetics, and emulsifiers. In addition, fatty acids
are used as accelerator activators in rubber products. Fatty acids
can also be used as a feedstock to produce methyl esters, amides,
amines, acid chlorides, anhydrides, ketene dimers, and peroxy acids
and esters.
[0012] Esters have many commercial uses. For example, biodiesel, an
alternative fuel, is comprised of esters (e.g., fatty acid methyl
ester, fatty acid ethyl esters, etc.). Some low molecular weight
esters are volatile with a pleasant odor which makes them useful as
fragrances or flavoring agents. In addition, esters are used as
solvents for lacquers, paints, and varnishes. Furthermore, some
naturally occurring substances, such as waxes, fats, and oils are
comprised of esters. Esters are also used as softening agents in
resins and plastics, plasticizers, flame retardants, and additives
in gasoline and oil. In addition, esters can be used in the
manufacture of polymers, films, textiles, dyes, and
pharmaceuticals.
[0013] In addition, crude petroleum is a source of lubricants.
Lubricants derived petroleum are typically composed of olefins,
particularly polyolefins and alpha-olefins. Lubricants can either
be refined from crude petroleum or manufactured using the raw
materials refined from crude petroleum.
[0014] Obtaining these specialty chemicals from crude petroleum
requires a significant financial investment as well as a great deal
of energy. It is also an inefficient process because frequently the
long chain hydrocarbons in crude petroleum are cracked to produce
smaller monomers. These monomers are then used as the raw material
to manufacture the more complex specialty chemicals.
[0015] In addition to the problems with exploring, extracting,
transporting, and refining petroleum, petroleum is a limited and
dwindling resource. One estimate of current world petroleum
consumption is 30 billion barrels per year. By some estimates, it
is predicted that at current production levels, the world's
petroleum reserves could be depleted before the year 2050.
[0016] Finally, the burning of petroleum based fuels releases
greenhouse gases (e.g., carbon dioxide) and other forms of air
pollution (e.g., carbon monoxide, sulfur dioxide, etc.). As the
world's demand for fuel increases, the emission of greenhouse gases
and other forms of air pollution also increases. The accumulation
of greenhouse gases in the atmosphere leads to an increase in
global warming. Hence, in addition to damaging the environment
locally (e.g., oil spills, dredging of marine environments, etc.),
burning petroleum also damages the environment globally.
[0017] Due to the inherent challenges posed by petroleum, there is
a need for a renewable petroleum source which does not need to be
explored, extracted, transported over long distances, or
substantially refined like petroleum. There is also a need for a
renewable petroleum source that can be produced economically. In
addition, there is a need for a renewable petroleum source that
does not create the type of environmental damage produced by the
petroleum industry and the burning of petroleum based fuels. For
similar reasons, there is also a need for a renewable source of
chemicals that are typically derived from petroleum.
[0018] Renewable energy sources, such as sunlight, water, wind, and
biomass, are a potential alternative to petroleum fuels. Biofuel is
a biodegradable, clean-burning combustible fuel produced from
biomass, and can be made of alkanes and esters. An exemplary
biofuel is biodiesel. Biodiesel can be used in most internal
combustion diesel engines in either a pure form, which is referred
to as "neat" biodiesel, or as a mixture in any concentration with
regular petroleum diesel.
[0019] Biodiesel offers advantages compared to petroleum-based
diesel, including reduced emissions (e.g., carbon monoxide,
sulphur, aromatic hydrocarbons, soot particles) during combustion.
Biodiesel also maintains a balanced carbon dioxide cycle because it
is based on renewable biological materials. Biodiesel is typically
biodegradable, and imparts enhanced safety due to its high flash
point and low flammability. Furthermore, biodiesel provides good
lubrication properties, thereby reducing wear and tear on
engines.
[0020] Current methods of making biodiesel involve
transesterification of triacylglycerides from vegetable oil
feedstocks, such as rapeseed in Europe, soybean in North America,
and palm oil in South East Asia. Industrial-scale biodiesel
production is thus geographically and seasonally restricted to
areas where vegetable oil feedstocks are produced. The
transesterification process leads to a mixture of fatty esters
which can be used as biodiesel. However, glycerin is an undesirable
byproduct of the transesterification process. To be usable as
biodiesel, the fatty esters must be further purified from the
heterogeneous product. This increases costs and the amount of
energy required for fatty ester production and, ultimately,
biodiesel production as well. Furthermore, vegetable oil feedstocks
are inefficient sources of energy because they require extensive
acreage for cultivation. For example, the yield of biodiesel from
rapeseed is only 1300 L/hectare because only the seed oil is used
for biodiesel production, while the rest of the rapeseed biomass is
discarded. Additionally, cultivating some vegetable oil feedsocks,
such as rapeseed and soybean, requires frequent crop rotation to
prevent nutrient depletion of the land.
[0021] Thus there is a need for an economically- and
energy-efficient biofuel, and methods of making biofuels from
renewable energy sources such as biomass.
SUMMARY OF THE INVENTION
[0022] The invention is based, at least in part, on the production
of fatty esters, such as fatty esters, including, for example fatty
acid methyl esters ("FAME") and fatty acid ethyl esters ("FAEE"),
from genetically engineered microorganisms. Accordingly, in one
aspect, the invention features a method of producing a fatty ester.
The method comprises culturing a host cell in the presence of a
carbon source, wherein the host cell is genetically engineered to
express or overexpress a gene encoding a thioesterase, a gene
encoding an acyl-CoA synthase, and a gene encoding an ester
synthase. In some embodiments, the method further comprises
isolating the fatty ester.
[0023] In some embodiments, the fatty ester is present in the
extracellular environment. In some embodiments, the fatty ester is
isolated from the extracellular environment of the host cell. In
some embodiments, the fatty ester is spontaneously secreted,
partially or completely, from the host cell. In alternative
embodiments, the fatty ester is transported into the extracellular
environment, optionally with the aid of one or more suitable
transport proteins. In other embodiments, the fatty ester is
passively transported into the extracellular environment.
[0024] In some embodiments, the method further comprises culturing
the host cell in the presence of an alcohol. In certain
embodiments, the alcohol is methanol, ethanol, propanol, or
butanol. In some embodiments, the alcohol is present at a
concentration of about 1 mL/L to about 100 mL/L. For example, the
alcohol is present at a concentration of about 1 mL/L or more
(e.g., about 1 mL/L or more, about 5 mL/L or more, about 10 mL/L or
more). In alternative embodiments, the alcohol is present at a
concentration of about 100 mL/L or less (e.g., about 100 mL/L or
less, about 90 mL/L or less, about 80 mL/L or less).
[0025] In certain embodiments, the gene encoding a thioesterase is
tesA, `tesA, tesB, fatB, fatB2, fatB3, fatA1, or fatA. In some
embodiments, the gene encoding an acyl-CoA synthase is fadD, fadK,
BH3103, pfl-4354, EAV15023, fadD1, fadD2, RPC.sub.--4074, fadDD35,
fadDD22, faa39, the gene encoding the protein of GenBank Accession
No. ZP.sub.--01644857, or yhfL. In yet other embodiments, the gene
encoding an ester synthase is one encoding an enzyme of enzyme
classification EC 2.3.1.75 or EC 2.3.1.20, one encoding wax/dgat, a
bifunctional ester synthase/acyl-CoA:diacylglycerl acyltransferase
from Simmondsia chinensis, Acinetobacter sp. ADP1, Alcanivorax
borkumensis, Pseudomonas aeruginosa, Fundibacter jadensis,
Arabidopsis thaliana, or Alkaligenes eutrophus, or one encoding
AtfA1, AtfA2, ES9, or ES8, or a variant thereof.
[0026] In other embodiments, the host cell is genetically
engineered to express, relative to a wild type host cell, a
decreased level of at least one of a gene encoding an acyl-CoA
dehydrogenase, a gene encoding an outer membrane protein receptor,
and a gene encoding a transcriptional regulator of fatty acid
biosynthesis. In some embodiments, one or more of a gene encoding
an acyl-CoA dehydrogenase, a gene encoding an outer membrane
protein receptor, and a gene encoding a transcriptional regulator
of fatty acid biosynthesis are functionally deleted.
[0027] In some embodiments, the gene encoding an acyl-CoA
dehydrogenase is fadE. In some embodiments, the gene encoding an
outer membrane protein receptor encodes an outer membrane
ferrichrome transporter, for example, fhuA. Yet in other
embodiments, the gene encoding a transcriptional regulator of fatty
acid biosynthesis encodes a DNA transcription repressor, for
example, fabR.
[0028] In some embodiments, the host cell is genetically engineered
to express, relative to a wild type host cell, an attenuated level
of at least one of a gene encoding a pyruvate formate lyase, a gene
encoding a lactate dehydrogenase, or both. In certain embodiments,
one or more of a gene encoding a pyruvate formate lyase, a gene
encoding a lactate dehydrogenase, or both, are functionally
deleted. In some embodiments, the gene encoding a pyruvate formate
lyase is pflB. In certain embodiments, the gene encoding a lactate
dehydrogenase is ldhA.
[0029] In some embodiments, the host cell is genetically engineered
to express, relative to a wild type host cell, attenuated levels of
one or more or all of a gene encoding an acyl-CoA dehydrogenase, a
gene encoding a ferrichrome transporter, a gene encoding a pyruvate
formate lyase, and a gene encoding a lactate dehydrogenase. In
other embodiments, the host cell is engineered such that one or
more or all of a gene encoding an endogenous acyl-CoA
dehydrogenase, a gene encoding an endogenous ferrichrome
transporter, a gene encoding an endogenous pyruvate formate lyase,
and a gene encoding an endogenous lactate dehydrogenase are
functionally deleted from the host cell.
[0030] In some embodiments, the host cell is cultured in a culture
medium comprising an initial concentration of the carbon source of
about 2 g/L to about 100 g/L. In other embodiments, the culture
medium comprises an initial concentration of about 2 g/L to about
10 g/L of a carbon source, of about 10 g/L to about 20 g/L of a
carbon source, of about 20 g/L to about 30 g/L of a carbon source,
of about 30 g/L to about 40 g/L of a carbon source, or of about 40
g/L to about 50 g/L of a carbon source.
[0031] In some embodiments, the method further includes the step of
monitoring the level of the carbon source in the culture medium. In
some embodiments, the method further includes adding a supplemental
carbon source to the culture medium when the level of the carbon
source in the medium is less than about 0.5 g/L. In some
embodiments, supplemental carbon source is added to the culture
medium when the level of the carbon source in the medium is less
than about 0.4 g/L, less than about 0.3 g/L, less than about 0.2
g/L, or less than about 0.1 g/L.
[0032] In some embodiments, the supplemental carbon source is added
to maintain a carbon source level of about 1 g/L to about 25 g/L.
In some embodiments, the supplemental carbon source is added to
maintain a carbon source level of about 2 g/L or more (e.g., about
2 g/L or more, about 3 g/L or more, about 4 g/L or more). In
certain embodiments, the supplemental carbon source is added to
maintain a carbon source level of about 5 g/L or less (e.g., about
5 g/L or less, about 4 g/L or less, about 3 g/L or less). In some
embodiments, the supplemental carbon source is added to maintain a
carbon source level of about 2 g/L to about 5 g/L, of about 5 g/L
to about 10 g/L, or of about 10 g/L to about 25 g/L. In some
embodiments, the carbon source is glucose.
[0033] In some embodiments, the fatty acid methyl ester is produced
at a concentration of about 1 g/L to about 200 g/L. In some
embodiments, the fatty acid methyl ester is produced at a
concentration of about 1 g/L or more (e.g., about 1 g/L or more,
about 10 g/L or more, about 20 g/L or more, about 50 g/L or more,
about 100 g/L or more). In some embodiments, the fatty acid methyl
ester is produced at a concentration of about 1 g/L to about 170
g/L, of about 1 g/L to about 10 g/L, of about 40 g/L to about 170
g/L, of about 100 g/L to about 170 g/L, of about 10 g/L to about
100 g/L, of about 1 g/L to about 40 g/L, of about 40 g/L to about
100 g/L, or of about 1 g/L to about 100 g/L.
[0034] In some embodiments, the host cell is selected from the
group consisting of a mammalian cell, plant cell, insect cell,
yeast cell, fungus cell, filamentous fungi cell, and bacterial
cell.
[0035] In particular embodiments, the host cell is selected from
the genus Escherichia, Bacillus, Lactobacillus, Rhodococcus,
Synechococcus, Synechoystis, Pseudomonas, Aspergillus, Trichoderma,
Neurospora, Fusarium, Humicola, Rhizomucor, Kluyveromyces, Pichia,
Mucor, Myceliophtora, Penicillium, Phanerochaete, Pleurotus,
Trametes, Chrysosporium, Saccharomyces, Stenotrophamonas,
Schizosaccharomyces, Yarrowia, or Streptomyces.
[0036] In other embodiments, the host cell is a Bacillus lentus
cell, a Bacillus brevis cell, a Bacillus stearothermophilus cell, a
Bacillus licheniformis cell, a Bacillus alkalophilus cell, a
Bacillus coagulans cell, a Bacillus circulans cell, a Bacillus
pumilis cell, a Bacillus thuringiensis cell, a Bacillus clausii
cell, a Bacillus megaterium cell, a Bacillus subtilis cell, or a
Bacillus amyloliquefaciens cell.
[0037] In certain embodiments, the host cell is a Synechococcus sp.
PCC7002, Synechococcus elongatus PCC 7942, Synechoystis sp. PCC
6803, Synechococcus elongatus PCC6301, Prochlorococcus marinus
CCMP1986 (MED4), Anabaena variabilis ATCC29413, Nostoc punctiforme
ATCC29133 (PCC73102), Gloeobacter violaceus ATCC29082 (PCC7421),
Nostoc sp. ATCC27893 (PCC7120), Cyanothece sp. PCC7425 (29141),
Cyanothece sp. ATCC51442, or Synechococcus sp. ATCC27264
(PCC7002).
[0038] In other embodiments, the host cell is a Trichoderma
koningii cell, a Trichoderma viride cell, a Trichoderma reesei
cell, a Trichoderma longibrachiatum cell, an Aspergillus awamori
cell, an Aspergillus fumigates cell, an Aspergillus foetidus cell,
an Aspergillus nidulans cell, an Aspergillus niger cell, an
Aspergillus oryzae cell, a Humicola insolens cell, a Humicola
lanuginose cell, a Rhodococcus opacus cell, a Rhizomucor miehei
cell, or a Mucor michei cell.
[0039] In other embodiments, the host cell is an Actinomycetes
cell. In yet other embodiments, the host cell is a Streptomyces
lividans cell or a Streptomyces murinus cell. In other embodiments,
the host cell is a Saccharomyces cerevisiae cell.
[0040] In yet other embodiments, the host cell is a cell from an
eukaryotic plant, algae, cyanobacterium, green-sulfur bacterium,
green non-sulfur bacterium, purple sulfur bacterium, purple
non-sulfur bacterium, extremophile, yeast, fungus, engineered
organisms thereof, or a synthetic organism. In some embodiments,
the host cell is light dependent or fixes carbon. In some
embodiments, the host cell has autotrophic activity. In some
embodiments, the host cell has photoautotrophic activity, such as
in the presence of light. In certain embodiments, the host cell is
a cell from Arabidopsis thaliana, Panicum virgatums, Miscanthus
giganteus, Zea mays, botryococcuse braunii, Chlamydomonas
reinhardtii, Dunaliela salina, Thermosynechococcus elongatus,
Chlorobium tepidum, Chloroflexus auranticus, Chromatiumm vinosum,
Rhodospirillum rubrum, Rhodobacter capsulatus, Rhodopseudomonas
palusris, Clostridium ljungdahlii, Clostridiuthermocellum, or
Pencillium chrysogenum.
[0041] In certain other embodiments, the host cell is from Pichia
pastories, Saccharomyces cerevisiae, Yarrowia lipolytica,
Schizosaccharomyces pombe, Pseudomonas fluorescens, or Zymomonas
mobilis. In yet further embodiments, the host cell is a cell from
Synechococcus sp. PCC 7002, Synechococcus sp. PCC 7942, or
Synechocystis sp. PCC6803.
[0042] In some embodiments, the host cell is a CHO cell, a COS
cell, a VERO cell, a BHK cell, a HeLa cell, a Cv1 cell, an MDCK
cell, a 293 cell, a 3T3 cell, or a PC12 cell.
[0043] In particular embodiments, the host cell is an E. coli cell.
In some embodiments, the E. coli cell is a strain B, a strain C, a
strain K, or a strain W E. coli cell.
[0044] In another aspect, the invention features a genetically
engineered microorganism capable of producing fatty esters under
conditions that allow product production. In some embodiments, the
genetically engineered microorganism comprises at least one or more
of a gene encoding a thioesterase, a gene encoding an acyl-CoA
synthase, and a gene encoding an ester synthase. In certain
embodiments, the genetically engineered microorganism comprises a
gene encoding a thioesterase, a gene encoding an acyl-CoA synthase,
and a gene encoding an ester synthase. In certain embodiments, two
or more of the genes encoding a thioesterase, a gene encoding an
acyl-CoA synthase, and a gene encoding an ester synthase are linked
in a single operon. In certain other embodiments, all three of the
genes encoding a thioesterase, a gene encoding an acyl-CoA
synthase, and a gene encoding an ester synthase are linked into a
single operon.
[0045] In certain embodiment, the genetically engineered
microorganism comprises an exogenous control sequence stably
incorporated into the genomic DNA of the microorganism upstream of
one or more of at least one of a gene encoding a thioesterase, a
gene encoding an acyl-CoA synthase, and a gene encoding an ester
synthase. In an exemplary embodiment, the microorganism is
engineered such that it comprises an exogenous control sequence
stably incorporated into the genomic DNA of the microorganism up
stream of a gene encoding a thioesterase, an acyl-CoA synthase and
a gene encoding an ester synthase. In certain embodiments, the
microorganism engineered as such produces an increased level of a
fatty ester relative to a wild-type microorganism. In certain
embodiments, the exogenous control sequence is, for example, a
promoter. Exemplary promoters include, without limitation, a
developmentally-regulated, organelle-specific, tissue-specific,
inducible, constitutive, or cell-specific promoter.
[0046] In further embodiments, the microorganism is genetically
engineered to express, relative to a wild type microorganism, a
decreased level of at least one of a gene encoding an acyl-CoA
dehydrogenase, a gene encoding an outer membrane protein receptor,
and a gene encoding a transcriptional regulator of fatty acid
biosynthesis. In certain embodiments, the microorganism is
genetically engineered such that at one or more of a gene encoding
an acyl-CoA dehydrogenase, a gene encoding an outer membrane
protein receptor, and a gene encoding a transcriptional regulator
of fatty acid biosynthesis are functionally deleted. In certain
embodiments, the gene encoding the acyl-CoA dehydrogenase is fadE.
In other embodiments, the gene encoding an outer membrane protein
receptor encodes an outer membrane ferrichrome transporter, for
example, fhuA. In further embodiments, the gene encoding a
transcriptional regulator of fatty acid biosynthesis encodes a DNA
transcription repressor, for example, fabR.
[0047] In some embodiments, the microorganism is genetically
engineered to express, relative to a wild type microorganism, an
attenuated level of a gene encoding a pyruvate formate lyase, a
lactate dehydrogenase, or both. In some embodiments, at least one
of a gene encoding a pyruvate formate lyase and a gene encoding a
lactate dehydrogenase are functionally deleted. In some
embodiments, the gene encoding the pyruvate formate lyase is pflB.
In other embodiments, the gene encoding the lactate dehydrogenase
is ldhA.
[0048] In certain embodiments, the microorganism is genetically
engineered to express attenuated levels of one or more or all of an
acyl-CoA dehydrogenase gene, an outer membrane ferrichrome
transporter gene, a pyruvate formate lyase gene, and a lactate
dehydrogenase gene. In other embodiments, one or more or all of an
endogenous acyl-CoA dehydrogenase gene, an endogenous outer
membrane ferrichrome transporter gene, an endogenous pyruvate
formate lyase gene, and an endogenous lactate dehydrogenase gene
are functionally deleted from the microorganism.
[0049] In certain embodiments, the genetically engineered
microorganism can suitably be selected from a Gram-negative or a
Gram-positive bacterium. In some embodiments, the genetically
engineered microorganism is selected from an E. coli,
mycobacterium, Nocardia sp., Nocardia farcinica, Streptomyces
griseus, Salinispora arenicola, Clavibacter michiganenesis,
Acinetobacter, Alcanivorax, Alcaligenes, Arabidopsis, Fundibacter,
Marinobacter, Mus musculus, Pseudomonas, or Simmodsia, Yarrowia,
Candida, Rhodotorula, Rhodosporidium, Cryptococcus, Trichosporon,
or Lipomyces. In certain embodiments, the genetically engineered
microorganism is selected from an E. coli strain B, strain C,
strain K or strain W. In further embodiments, the genetically
engineered microorganism is selected from Synechococcus sp.
PCC7002, Synechococcus elongatus PCC 7942, Synechoystis sp. PCC
6803, Synechococcus elongatus PCC6301, Prochlorococcus marinus
CCMP1986 (MED4), Anabaena variabilis ATCC29413, Nostoc punctiforme
ATCC29133 (PCC73102), Gloeobacter violaceus ATCC29082 (PCC7421),
Nostoc sp. ATCC27893 (PCC7120), Cyanothece sp. PCC7425 (29141),
Cyanothece sp. ATCC51442, or Synechococcus sp. ATCC27264
(PCC7002).
[0050] In another aspect, the invention features a method of
producing a fatty ester comprising culturing a genetically
engineered microorganism described herein in the presence of a
suitable alcohol substrate. In certain embodiments, the alcohol
substrate is ethanol, methanol, propanol, or butanol. In a
particular embodiment, the alcohol substrate is a methanol. In
further embodiments, the method further comprises culturing the
genetically engineered microorganism under conditions that allows
it to produce the fatty ester product.
[0051] In any of the aspects described herein, the fatty ester is
produced at a yield of about 0.5 g to about 50 g of fatty ester per
100 g of glucose in the culture medium. For example, the fatty
ester is produced at a yield of about 0.5 g of fatty ester per 100
g of glucose or more (e.g., about 0.5 g of fatty ester per 100 g of
glucose or more, about 2 g of fatty ester per 100 g of glucose or
more, about 5 g of fatty ester per 100 g of glucose or more, about
10 g of fatty ester per 100 g of glucose or more, or about 15 g of
fatty ester per 100 g of glucose or more). In particular
embodiments, the fatty ester is produced at a yield of about 0.5 g
to about 40 g of fatty ester per 100 g of glucose, about 0.5 g to
about 30 g of fatty ester per 100 g of glucose, about 0.5 g to
about 20 g of fatty ester per 100 g of glucose, about 0.5 g to
about 18 g of fatty ester per 100 g of glucose, about 2 g to about
16 g of fatty ester per 100 g of glucose, or about 5 g to about 15
g of fatty ester per 100 g of glucose in the culture medium. In
particular embodiments, the fatty ester is produced at a yield of
at least 0.5 g of fatty ester, at least 4 g of fatty ester, at
least 5 g of fatty ester, at least 10 g of fatty ester, at least 20
g of fatty ester, at least 30 g of fatty ester, at least 40 g of
fatty ester, or at least 50 g of fatty ester per 100 g of glucose
in the culture medium. In particular embodiments, the fatty ester
is produced at a yield of no more than 50 g of fatty ester per 100
g of glucose in the culture medium.
[0052] In some embodiments, the fatty ester is produced at a yield
of about 0.5% to about 50% by mass of the glucose in the culture
medium. For example, the fatty ester is produced at a yield of
about 0.5% or more (e.g., about 0.5% or more, about 2% or more,
about 5% or more, or about 10% or more) by mass of the glucose in
the culture medium. In particular embodiments, the fatty ester is
produced at a yield of about 0.5% to about 40%, about 0.5% to about
30%, about 0.5% to about 20%, about 0.5% to about 10%, about 0.5%
to about 5%, or about 0.5% to about 4% by mass of the glucose in
the culture medium. In particular embodiments, the fatty ester is
produced at a yield of at least about 0.5%, at least about 4%, at
least about 5%, at least about 10%, at least about 20%, at least
about 30%, at least about 40%, or at least about 50% by mass of
glucose in the culture medium. In particular embodiments, the fatty
ester is produced at a yield of no more than 50% by mass of glucose
in the culture medium.
[0053] In some embodiments, the fatty ester is produced at a yield
of about 10% to about 95% by mass of carbon in the carbon source in
the culture medium. In particular embodiments, the fatty ester is
produced at a yield of about 15% to about 90%, about 20% to about
80%, or about 30% to about 70% by mass of carbon in the carbon
source in the culture medium. In particular embodiments, the fatty
ester is produced at a yield of at least about 10%, at least about
20%, at least about 30%, at least about 40%, at least about 50%, at
least about 60%, at least about 70%, at least about 80%, at least
about 90%, or at least about 95% by mass of carbon in the carbon
source in the culture medium. In particular embodiments, the fatty
ester is produced at a yield of no more than 95% by mass of carbon
in the carbon source in the culture medium.
[0054] In some embodiments, the fatty ester is a fatty acid ethyl
ester and is produced at a yield of about 0.5 g to about 50 g of
fatty acid ethyl ester per 100 g of glucose in the culture medium.
For example, the fatty acid ethyl ester and is produced at a yield
of about 0.5 g or more (e.g., about 0.5 g or more, about 2 g or
more, about 5 g or more, about 10 g or more, about 15 g or more)
per 100 g of glucose in the culture medium. In particular
embodiments, the fatty acid ethyl ester is produced at a yield of
about 0.5 g to about 40 g of fatty acid ethyl ester per 100 g of
glucose, about 0.5 g to about 30 g of fatty acid ethyl ester per
100 g of glucose, about 0.5 g to about 20 g of fatty acid ethyl
ester per 100 g of glucose, about 0.5 g to about 10 g of fatty acid
ethyl ester per 100 g of glucose, about 0.5 g to about 5 g of fatty
acid ethyl ester per 100 g of glucose, or about 0.5 g to about 4 g
of fatty acid ethyl ester per 100 g of glucose in the culture
medium. In particular embodiments, the fatty acid ethyl ester is
produced at a yield of at least 0.5 g of fatty acid ethyl ester, at
least 4 g of fatty acid ethyl ester, at least 5 g of fatty acid
ethyl ester, at least 10 g of fatty acid ethyl ester, at least 20 g
of fatty acid ethyl ester, at least 30 g of fatty acid ethyl ester,
at least 40 g of fatty acid ethyl ester, or at least 50 g of fatty
acid ethyl ester per 100 g of glucose in the culture medium. In
particular embodiments, the fatty acid ethyl ester is produced at a
yield of no more than 50 g of fatty acid ethyl ester per 100 g of
glucose in the culture medium.
[0055] In some embodiments, the fatty acid ethyl ester is produced
at a yield of about 0.5% to about 50% by mass of the glucose in the
culture medium. For example, the fatty acid ethyl ester is produced
at a yield of about 0.5% or more (e.g., of about 0.5% or more, of
about 1% or more, of about 2% or more of about 5% or more of about
10% or more) by mass of the glucose in the culture medium. In
particular embodiments, the fatty acid ethyl ester is produced at a
yield of about 0.5% to about 40%, about 0.5% to about 30%, about
0.5% to about 20%, about 0.5% to about 10%, about 0.5% to about 5%,
or about 0.5% to about 4% by mass of the glucose in the culture
medium. In particular embodiments, the fatty acid ethyl ester is
produced at a yield of at least about 0.5%, at least about 4%, at
least about 5%, at least about 10%, at least about 20%, at least
about 30%, at least about 40%, or at least about 50% by mass of
glucose in the culture medium. In particular embodiments, the fatty
acid ethyl ester is produced at a yield of no more than 50% by mass
of glucose in the culture medium.
[0056] In some embodiments, the fatty acid ethyl ester is produced
at a yield of about 10% to about 95% by mass of carbon in the
carbon source in the culture medium. For example, the fatty acid
ethyl ester is produced at a yield of about 10% or more (e.g., of
about 10% or more, of about 15% or more, of about 20% or more, of
about 25% or more) by pass of carbon in the carbon source in the
culture medium. In particular embodiments, the fatty acid ethyl
ester is produced at a yield of about 15% to about 90%, about 20%
to about 80%, or about 30% to about 70% by mass of carbon in the
carbon source in the culture medium. In particular embodiments, the
fatty acid ethyl ester is produced at a yield of at least about
10%, at least about 20%, at least about 30%, at least about 40%, at
least about 50%, at least about 60%, at least about 70%, at least
about 80%, at least about 90%, or at least about 95% by mass of
carbon in the carbon source in the culture medium. In particular
embodiments, the fatty acid ethyl ester is produced at a yield of
no more than 95% by mass of carbon in the carbon source in the
culture medium.
[0057] In some embodiments, the fatty ester is a fatty acid methyl
ester and is produced at a yield of about 0.5 g to about 50 g of
fatty acid methyl ester per 100 g of glucose in the culture medium.
For example, the fatty ester is a fatty acid methyl ester and is
produced at a yield of about 0.5 g or more (e.g., about 0.5 g or
more, about 1 g or more, about 2 g or more, about 5 g or more,
about 10 g or more) of fatty acid methyl ester per 100 g of glucose
in the culture medium. In particular embodiments, the fatty acid
methyl ester is produced at a yield of about 0.5 g to about 40 g of
fatty acid methyl ester per 100 g of glucose, about 0.5 g to about
30 g of fatty acid methyl ester per 100 g of glucose, about 0.5 g
to about 20 g of fatty acid methyl ester per 100 g of glucose,
about 0.5 g to about 10 g of fatty acid methyl ester per 100 g of
glucose, about 0.5 g to about 5 g of fatty acid methyl ester per
100 g of glucose, or about 0.5 g to about 4 g of fatty acid methyl
ester per 100 g of glucose in the culture medium. In particular
embodiments, the fatty acid methyl ester is produced at a yield of
at least 0.5 g of fatty acid methyl ester, at least 4 g of fatty
acid methyl ester, at least 5 g of fatty acid methyl ester, at
least 10 g of fatty acid methyl ester, at least 20 g of fatty acid
methyl ester, at least 30 g of fatty acid methyl ester, at least 40
g of fatty acid methyl ester, or at least 50 g of fatty acid methyl
ester per 100 g of glucose in the culture medium. In particular
embodiments, the fatty acid methyl ester is produced at a yield of
no more than 50 g of fatty acid methyl ester per 100 g of glucose
in the culture medium.
[0058] In some embodiments, the fatty acid methyl ester is produced
at a yield of about 0.5% to about 50% by mass of the glucose in the
culture medium. For example, fatty acid methyl ester is produced at
a yield of about 0.5% or more (e.g., about 0.5% or more, about 1%
or more, about 2% or more, about 5% or more, about 10% or more,
about 15% or more) by mass of the glucose in the culture medium. In
particular embodiments, the fatty acid methyl ester is produced at
a yield of about 0.5% to about 40%, about 0.5% to about 30%, about
0.5% to about 20%, about 0.5% to about 10%, about 0.5% to about 5%,
or about 0.5% to about 4% by mass of the glucose in the culture
medium. In particular embodiments, the fatty acid methyl ester is
produced at a yield of at least about 0.5%, at least about 4%, at
least about 5%, at least about 10%, at least about 20%, at least
about 30%, at least about 40%, or at least about 50% by mass of
glucose in the culture medium. In particular embodiments, the fatty
acid methyl ester is produced at a yield of no more than 50% by
mass of glucose in the culture medium.
[0059] In some embodiments, the fatty acid methyl ester is produced
at a yield of about 10% to about 95% by mass of carbon in the
carbon source in the culture medium. For example, the fatty acid
methyl ester is produced at a yield of about 10% or more (e.g.,
about 10% or more, about 20% or more, about 25% or more, about 30%
or more, about 35% or more, about 40% or more) by mass of carbon in
the carbon source in the culture medium. In particular embodiments,
the fatty acid methyl ester is produced at a yield of about 15% to
about 90%, about 20% to about 80%, or about 30% to about 70% by
mass of carbon in the carbon source in the culture medium. In
particular embodiments, the fatty acid methyl ester is produced at
a yield of at least about 10%, at least about 20%, at least about
30%, at least about 40%, at least about 50%, at least about 60%, at
least about 70%, at least about 80%, at least about 90%, or at
least about 95% by mass of carbon in the carbon source in the
culture medium. In particular embodiments, the fatty acid methyl
ester is produced at a yield of no more than 95% by mass of carbon
in the carbon source in the culture medium.
[0060] In another aspect, the invention features a fatty ester,
such as a fatty acid ester, produced by a method described herein.
In some embodiments, the fatty acid ester is a fatty acid methyl
ester. In some embodiments, the fatty acid methyl ester is at least
about 4, 6, 8, 10, 12, 14, 16, or 18 carbons in length.
[0061] In some embodiments, the fatty ester comprises an A side
(i.e., the carbon chain attached to the carboxylate oxygen) and a B
side (i.e., the carbon chain comprising the parent carboxylate). In
some embodiments, the B side of the fatty acid methyl ester is at
least about 4, 6, 8, 10, 12, 14, 16, or 18 carbons in length. In
some embodiments, the B side of the fatty acid methyl ester
includes a straight chain. In other embodiments, the B side of the
fatty acid methyl ester includes a branched chain. In still other
embodiments, the B side of the fatty acid methyl ester comprises at
least one cyclic moiety. In further embodiments, the fatty ester is
selected from methyl dedecanoate, methyl 5-dodecenoate, methyl
tetradecanoate, methyl 7-tetradecenoate, methyl hexadecanoate,
methyl 9-hexadecenoate, methyl octadecanoate, methyl
11-octadecenoate, or combinations thereof. In yet further
embodiments, the fatty ester is selected from ethyl dedecanoate,
ethyl 5-dodecenoate, ethyl tetradecanoate, ethyl 7-tetradecenoate,
ethyl hexadecanoate, ethyl 9-hexadecenoate, ethyl octadecanoate,
ethyl 11-octadecenoate, or combinations thereof.
[0062] In some embodiments, the fatty acid methyl ester is
saturated. In other embodiments, the fatty acid methyl ester is
unsaturated. In other embodiments, the fatty acid methyl ester is
monounsaturated. In certain embodiments, the fatty acid ethyl ester
is saturated. In other embodiments, the fatty acid ethyl ester is
unsaturated. In other embodiments, the fatty acid ethyl ester is
monounsaturated.
[0063] In a further aspect, the invention features a method of
producing a biologically-derived diesel fuel of commercial quality
according to commercial standards (e.g., ASTM or ANP). In some
embodiments, the method comprises fermenting carbohydrates using a
genetically modified microorganism described herein. The process
provides a direct route for producing fatty esters, for example,
fatty acid esters such as fatty acid methyl esters or fatty acid
ethyl esters, without the need of producing oils, which are later
chemically transesterified with the concomitant production of large
quantities of glycerol. The fuel composition thus produced can be
utilized as a diesel fuel alone, or be blended with petroleum
diesel according to customary proportions, resulting in clean
emission profiles and low amounts of impurities and/or undesirable
contaminants.
[0064] In another aspect, the invention features a fuel
composition, including, for example, a diesel composition,
comprising a fatty ester produced by a method or a genetically
engineered microorganism described herein. In certain embodiments,
the fuel composition further comprises one or more suitable fuel
additives.
[0065] The drawings and examples provided herein are intended
solely to illustrate the features of the present invention. They
are not intended to be limiting.
DETAILED DESCRIPTION OF THE INVENTION
[0066] Unless otherwise defined, all technical and scientific terms
used herein have the same meaning as commonly understood by one of
ordinary skill in the art to which this invention belongs. Although
methods and materials similar or equivalent to those described
herein can be used in the practice or testing of the present
invention, suitable methods and materials are described below. All
publications, patent applications, patents, and other references
mentioned herein, including GenBank database sequences, are
incorporated by reference in their entirety. In case of conflict,
the present specification, including definitions, will control. In
addition, the materials, methods, and examples are illustrative
only and not intended to be limiting.
[0067] Other features and advantages of the invention will be
apparent from the following detailed description, and from the
claims.
DEFINITIONS
[0068] The articles "a" and "an" are used herein to refer to one or
to more than one (i.e., to at least one) of the grammatical object
of the article. By way of example, "an element" means one element
or more than one element.
[0069] The term "about" is used herein to mean a value .+-.20% of a
given numerical value. Thus, "about 60%" means a value of between
60.+-.(20% of 60) (i.e., between 48 and 72).
[0070] As used herein, the term "attenuate" means to weaken,
reduce, or diminish. For example, a polypeptide can be attenuated
by modifying the polypeptide to reduce its activity (e.g., by
modifying a nucleotide sequence that encodes the polypeptide).
[0071] As used herein, the term "biocrude" refers to a product
derived from biomass, biomass derivatives, or other biological
sources that, like petroleum crude, can be converted into other
fuels. For example, biocrude can be converted into gasoline,
diesel, jet fuel, or heating oil. Moreover, biocrude, like
petroleum crude, can be converted into other industrially useful
chemicals for use in, for example, pharmaceuticals, cosmetics,
consumer goods, industrial processes, and the like.
[0072] Biocrude may include, for example, hydrocarbons, hydrocarbon
products, fatty acid esters, and/or aliphatic ketones. In a
preferred embodiment, biocrude is comprised of hydrocarbons, for
example aliphatic (e.g., alkanes, alkenes, alkynes) or aromatic
hydrocarbons.
[0073] As used herein, the term "biodiesel" means a biofuel that
can be a substitute of diesel, which is derived from petroleum.
Biodiesel can be used in internal combustion diesel engines in
either a pure form, which is referred to as "neat" biodiesel, or as
a mixture in any concentration with petroleum-based diesel.
[0074] In one embodiment, biodiesel can include esters or
hydrocarbons, such as aldehydes, alkanes, or alkenes.
[0075] As used herein, the term "biofuel" refers to any fuel
derived from biomass, biomass derivatives, or other biological
sources. Biofuels can be substituted for petroleum based fuels. For
example, biofuels are inclusive of transportation fuels (e.g.,
gasoline, diesel, jet fuel, etc.), heating fuels, and
electricity-generating fuels. Biofuels are a renewable energy
source.
[0076] As used herein, the term "biomass" refers to a carbon source
derived from biological material. Biomass can be converted into a
biofuel. One exemplary source of biomass is plant matter. For
example, corn, sugar cane, or switchgrass can be used as biomass.
Another non-limiting example of biomass is animal matter, for
example cow manure. Biomass also includes waste products from
industry, agriculture, forestry, and households. Examples of such
waste products that can be used as biomass are fermentation waste,
straw, lumber, sewage, garbage, and food leftovers. Biomass also
includes sources of carbon, such as carbohydrates (e.g.,
monosaccharides, disaccharides, or polysaccharides).
[0077] As used herein, the phrase "carbon source" refers to a
substrate or compound suitable to be used as a source of carbon for
prokaryotic or simple eukaryotic cell growth. Carbon sources can be
in various forms, including, but not limited to polymers,
carbohydrates, acids, alcohols, aldehydes, ketones, amino acids,
peptides, and gases (e.g., CO and CO.sub.2). These include, for
example, various monosaccharides, such as glucose, fructose,
mannose, and galactose; oligosaccharides, such as
fructo-oligosaccharide and galacto-oligosaccharide; polysaccharides
such as xylose and arabinose; disaccharides, such as sucrose,
maltose, and turanose; cellulosic material, such as methyl
cellulose and sodium carboxymethyl cellulose; saturated or
unsaturated fatty acid esters, such as succinate, lactate, and
acetate; alcohols, such as methanol, ethanol, propanol, or mixtures
thereof. The carbon source can also be a product of photosynthesis,
including, but not limited to, glucose. A preferred carbon source
is biomass. Another preferred carbon source is glucose.
[0078] As used herein, a "cloud point lowering additive" is an
additive added to a composition to decrease or lower the cloud
point of a solution.
[0079] As used herein, the phrase "cloud point of a fluid" means
the temperature at which dissolved solids are no longer completely
soluble. Below this temperature, solids begin precipitating as a
second phase giving the fluid a cloudy appearance. In the petroleum
industry, cloud point refers to the temperature below which a
solidified material or other heavy hydrocarbon crystallizes in a
crude oil, refined oil, or fuel to form a cloudy appearance. The
presence of solidified materials influences the flowing behavior of
the fluid, the tendency of the fluid to clog fuel filters,
injectors, etc., the accumulation of solidified materials on cold
surfaces (e.g., a pipeline or heat exchanger fouling), and the
emulsion characteristics of the fluid with water.
[0080] A nucleotide sequence is "complementary" to another
nucleotide sequence if each of the bases of the two sequences
matches (e.g., is capable of forming Watson Crick base pairs). The
term "complementary strand" is used herein interchangeably with the
term "complement". The complement of a nucleic acid strand can be
the complement of a coding strand or the complement of a non-coding
strand.
[0081] The terms "comprising," "having," "including," and
"containing" are to be construed as open-ended terms (e.g., meaning
"including, but not limited to,") unless otherwise noted.
[0082] As used herein, the term "conditions sufficient to allow
expression" means any conditions that allow a host cell to produce
a desired product, such as a polypeptide, aldehyde, or alkane
described herein. Suitable conditions include, for example,
fermentation conditions. Fermentation conditions can comprise many
parameters, such as temperature ranges, levels of aeration, and
media composition. Each of these conditions, individually and in
combination, allows the host cell to grow. Exemplary culture media
include broths or gels. Generally, the medium includes a carbon
source, such as glucose, fructose, cellulose, or the like, that can
be metabolized by a host cell directly. In addition, enzymes can be
used in the medium to facilitate the mobilization (e.g., the
depolymerization of starch or cellulose to fermentable sugars) and
subsequent metabolism of the carbon source.
[0083] To determine if conditions are sufficient to allow
expression, a host cell can be cultured, for example, for about 4,
8, 12, 24, 36, or 48 hours. During and/or after culturing, samples
can be obtained and analyzed to determine if the conditions allow
expression. For example, the host cells in the sample or the medium
in which the host cells were grown can be tested for the presence
of a desired product. When testing for the presence of a product,
assays, such as, but not limited to, TLC, HPLC, GC/FID, GC/MS,
LC/MS, MS, can be used.
[0084] It is understood that the polypeptides described herein may
have additional conservative or non-essential amino acid
substitutions, which do not have a substantial effect on the
polypeptide functions. Whether or not a particular substitution
will be tolerated (e.g., will not adversely affect desired
biological properties, such as decarboxylase activity) can be
determined as described in Bowie et al., Science (1990) 247:1306
1310. A "conservative amino acid substitution" is one in which the
amino acid residue is replaced with an amino acid residue having a
similar side chain. Families of amino acid residues having similar
side chains have been defined in the art. These families include
amino acids with basic side chains (e.g., lysine, arginine, and
histidine), acidic side chains (e.g., aspartic acid and glutamic
acid), uncharged polar side chains (e.g., glycine, asparagine,
glutamine, serine, threonine, tyrosine, and cysteine), nonpolar
side chains (e.g., alanine, valine, leucine, isoleucine, proline,
phenylalanine, methionine, and tryptophan), beta-branched side
chains (e.g., threonine, valine, and isoleucine), and aromatic side
chains (e.g., tyrosine, phenylalanine, tryptophan, and
histidine).
[0085] As used herein, "conditions that permit product production"
refers to any fermentation conditions that allow a production host
to produce a desired product, such as acyl-CoA or fatty acid
derivatives (e.g., fatty acids, hydrocarbons, fatty alcohols,
waxes, or fatty esters). Fermentation conditions usually comprise
many parameters. Exemplary conditions include, but are not limited
to, temperature ranges, levels of aeration, and media composition.
Each of these conditions, individually and/or in combination,
allows the production host to grow.
[0086] Exemplary media include broths and/or gels. Generally, a
suitable medium includes a carbon source (e.g., glucose, fructose,
cellulose, etc.) that can be metabolized by the microorganism
directly. In addition, enzymes can be used in the medium to
facilitate the mobilization (e.g., the depolymerization of starch
or cellulose to fermentable sugars) and subsequent metabolism of
the carbon source.
[0087] To determine if the fermentation conditions permit product
production, the production host can be cultured for about 4, 8, 12,
24, 36, 48 or 96 hours. During culturing or after culturing,
samples can be obtained and analyzed to determine if the
fermentation conditions have permitted product production. For
example, the production hosts in the sample or the medium in which
the production hosts are grown can be tested for the presence of
the desired product. Exemplary assays, such as TLC, HPLC, GC/FID,
GC/MS, LC/MS, MS, as well as those provided herein, can be used
identify and quantify the presence of a product.
[0088] As used herein, "control element" means a transcriptional
control element. Control elements include promoters and enhancers.
The term "promoter element," "promoter," or "promoter sequence"
refers to a DNA sequence that functions as a switch that activates
the expression of a gene. If the gene is activated, it is said to
be transcribed or participating in transcription. Transcription
involves the synthesis of mRNA from the gene. A promoter,
therefore, serves as a transcriptional regulatory element and also
provides a site for initiation of transcription of the gene into
mRNA. Control elements interact specifically with cellular proteins
involved in transcription (Maniatis et al., Science 236:1237
(1987)).
[0089] As used herein, the term "deletion," or "knockout" means
modifying or inactivating a polynucleotide sequence that encodes a
target protein in order to reduce or eliminate the function of the
target protein. A polynucleotide deletion can be performed by
methods well known in the art (See, e.g., Datsenko et al., Proc.
Nat. Acad. Sci. USA, 97:6640-45 (2000), or International Patent
Application Nos. PCT/US2007/011923 and PCT/US2008/058788).
[0090] As used herein, the term "endogenous" means a polynucleotide
that is in the cell and was not introduced into the cell using
recombinant genetic engineering techniques. For example, a gene
that was present in the cell when the cell was originally isolated
from nature. A polynucleotide is still considered endogenous if the
control sequences, such as a promoter or enhancer sequences which
activate transcription or translation, have been altered through
recombinant techniques.
[0091] As used herein, the term "ester synthase" means a peptide
capable of producing fatty esters. More specifically, an ester
synthase is a peptide which converts a thioester to a fatty ester.
In a preferred embodiment, the ester synthase converts a thioester
(e.g., acyl-CoA) to a fatty ester.
[0092] In an alternate embodiment, an ester synthase uses a
thioester and an alcohol as substrates to produce a fatty ester.
Ester synthases are capable of using short and long chain
thioesters as substrates. In addition, ester synthases are capable
of using short and long chain alcohols as substrates.
[0093] Non-limiting examples of ester synthases are wax synthases,
wax-ester synthases, acyl CoA:alcohol transacylases,
acyltransferases, and fatty acyl-coenzyme A:fatty alcohol
acyltransferases. Exemplary ester synthases are classified in
enzyme classification number EC 2.3.1.75. A number of these
enzymes, as well as other useful enzymes for making the products
described herein, have been disclosed in, for example,
International Patent Application Nos. PCT/US2007/011923 and
PCT/US2008/058788, which are incorporated herein by reference.
[0094] As used herein, the term "fatty acid" means a carboxylic
acid having the formula RCOOH. R represents an aliphatic group,
preferably an alkyl group. R can comprise between about 4 and about
22 carbon atoms. Fatty acids can be saturated, monounsaturated, or
polyunsaturated. In a preferred embodiment, the fatty acid is made
from a fatty acid biosynthetic pathway.
[0095] As used herein, the term "fatty acid biosynthetic pathway"
means a biosynthetic pathway that produces fatty acids. The fatty
acid biosynthetic pathway includes fatty acid enzymes that can be
engineered, as described herein, to produce fatty acids, and in
some embodiments can be expressed with additional enzymes to
produce fatty acids having desired carbon chain
characteristics.
[0096] As used herein, the term "fatty acid degradation enzyme"
means an enzyme involved in the breakdown or conversion of a fatty
acid or fatty acid derivative into another product. A non-limiting
example of a fatty acid degradation enzyme is an acyl-CoA synthase.
A number of these enzymes, as well as other useful enzymes for
making the products described herein, have been disclosed in, for
example, International Patent Application Nos. PCT/US2007/011923
and PCT/US2008/058788, which are incorporated herein by reference.
Additional examples of fatty acid degradation enzymes are described
herein.
[0097] As used herein, the term "fatty acid derivative" means
products made in part from the fatty acid biosynthetic pathway of
the production host organism. "Fatty acid derivative" also includes
products made in part from acyl-ACP or acyl-ACP derivatives. The
fatty acid biosynthetic pathway includes fatty acid synthase
enzymes which can be engineered as described herein to produce
fatty acid derivatives, and in some examples can be expressed with
additional enzymes to produce fatty acid derivatives having desired
carbon chain characteristics. Exemplary fatty acid derivatives
include for example, fatty acids, acyl-CoAs, fatty aldehydes, short
and long chain alcohols, hydrocarbons, fatty alcohols, ketones, and
esters (e.g., waxes, fatty acid esters, or fatty esters).
[0098] As used herein, the term "fatty acid derivative enzymes"
means all enzymes that may be expressed or overexpressed in the
production of fatty acid derivatives. These enzymes are
collectively referred to herein as fatty acid derivative enzymes.
These enzymes may be part of the fatty acid biosynthetic pathway.
Non-limiting examples of fatty acid derivative enzymes include
fatty acid synthases, thioesterases, acyl-CoA synthases, acyl-CoA
reductases, alcohol dehydrogenases, alcohol acyltransferases,
carboxylic acid reductases, fatty alcohol-forming acyl-CoA
reductase, ester synthases, aldehyde biosynthetic polypeptides, and
alkane biosynthetic polypeptides. Fatty acid derivative enzymes
convert a substrate into a fatty acid derivative. In some examples,
the substrate may be a fatty acid derivative which the fatty acid
derivative enzyme converts into a different fatty acid derivative.
A number of these enzymes, as well as other useful enzymes for
making the products described herein, have been disclosed in, for
example, International Patent Application Nos. PCT/US2007/011923
and PCT/US2008/058788, which are incorporated herein by
reference.
[0099] As used herein, "fatty acid enzyme" means any enzyme
involved in fatty acid biosynthesis. Fatty acid enzymes can be
expressed or overexpressed in host cells to produce fatty acids.
Non-limiting examples of fatty acid enzymes include fatty acid
synthases and thioesterases. A number of these enzymes, as well as
other useful enzymes for making the products described herein, have
been disclosed in, for example, International Patent Application
Nos. PCT/US2007/011923 and PCT/US2008/058788, which are
incorporated herein by reference.
[0100] As used herein, the term "fatty ester" means an ester. In a
preferred embodiment, a fatty ester is any ester made from a fatty
acid to produce, for example, a fatty acid ester. In one
embodiment, a fatty ester contains an A side (i.e., the carbon
chain attached to the carboxylate oxygen) and a B side (i.e., the
carbon chain comprising the parent carboxylate). In a preferred
embodiment, when the fatty ester is derived from the fatty acid
biosynthetic pathway, the A side is contributed by an alcohol, and
the B side is contributed by a fatty acid. Any alcohol can be used
to form the A side of the fatty esters. For example, the alcohol
can be derived from the fatty acid biosynthetic pathway.
Alternatively, the alcohol can be produced through non-fatty acid
biosynthetic pathways. Moreover, the alcohol can be provided
exogenously. For example, the alcohol can be supplied in the
fermentation broth in instances where the fatty ester is produced
by an organism that can also produce the fatty acid. Alternatively,
a carboxylic acid, such as a fatty acid or acetic acid, can be
supplied exogenously in instances where the fatty ester is produced
by an organism that can also produce alcohol.
[0101] The carbon chains comprising the A side or B side can be of
any length. In one embodiment, the A side of the ester is at least
about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 18 or 20 carbons
in length. The B side of the ester is at least about 4, 6, 8, 10,
12, 14, 16, 18, 20, 22, 24, or 26 carbons in length. The A side
and/or the B side can be straight or branched chain. The branched
chains may have one or more points of branching. In addition, the
branched chains may include cyclic branches. Furthermore, the A
side and/or B side can be saturated or unsaturated. If unsaturated,
the A side and/or B side can have one or more points of
unsaturation.
[0102] In one embodiment, the fatty ester is produced
biosynthetically. In this embodiment, first the fatty acid is
"activated." Non-limiting examples of "activated" fatty acids are
acyl-CoA, acyl-ACP, and acyl phosphate. Acyl-CoA can be a direct
product of fatty acid biosynthesis or degradation. In addition,
acyl-CoA can be synthesized from a free fatty acid, a CoA, or an
adenosine nucleotide triphosphate (ATP). An example of an enzyme
which produces acyl-CoA is acyl-CoA synthase.
[0103] After the fatty acid is activated, it can be readily
transferred to a recipient nucleophile. Exemplary nucleophiles are
alcohols, thiols, or phosphates.
[0104] In one embodiment, the fatty ester is a wax. The wax can be
derived from a long chain alcohol and a long chain fatty acid. In
another embodiment, the fatty ester can be derived from a fatty
acyl-thioester and an alcohol. In another embodiment, the fatty
ester is a fatty acid thioester, for example fatty acyl Coenzyme A
(CoA). In other embodiments, the fatty ester is a fatty acyl
panthothenate, an acyl carrier protein (ACP), or a fatty phosphate
ester. Fatty esters have many uses. For example, fatty esters can
be used as biofuels, surfactants, or formulated into additives that
provide lubrication and other benefits to fuels and industrial
chemicals.
[0105] As used herein, "fraction of modern carbon" or "f.sub.M" has
the same meaning as defined by National Institute of Standards and
Technology (NIST) Standard Reference Materials (SRMs) 4990B and
4990C, known as oxalic acids standards HOxI and HOxII,
respectively. The fundamental definition relates to 0.95 times the
.sup.14C/.sup.12C isotope ratio HOxI (referenced to AD 1950). This
is roughly equivalent to decay-corrected pre-Industrial Revolution
wood. For the current living biosphere (e.g., plant material),
f.sub.M is approximately 1.1.
[0106] The genes "fhuA" and "tonA", which encode a ferrichrome
outer membrane transporter (GenBank Accession No. NP.sub.--414692),
are used interchangeably herein.
[0107] "Gene knockout", as used herein, refers to a procedure by
which a gene encoding a target protein is modified or inactivated
so to reduce or eliminate the function of the intact protein.
Inactivation of the gene may be performed by general methods such
as mutagenesis by UV irradiation or treatment with
N-methyl-N'-nitro-N-nitrosoguanidine, site-directed mutagenesis,
homologous recombination, insertion-deletion mutagenesis, or
"Red-driven integration" (Datsenko et al., Proc. Natl. Acad. Sci.
USA, 97:6640-45 (2000)). For example, in one embodiment, a
construct is introduced into a host cell, such that it is possible
to select for homologous recombination events in the host cell. One
of skill in the art can readily design a knock-out construct
including both positive and negative selection genes for
efficiently selecting transfected cells that undergo a homologous
recombination event with the construct. The alteration in the host
cell may be obtained, for example, by replacing through a single or
double crossover recombination a wild type DNA sequence by a DNA
sequence containing the alteration. For convenient selection of
transformants, the alteration may, for example, be a DNA sequence
encoding an antibiotic resistance marker or a gene complementing a
possible auxotrophy of the host cell. Mutations include, but are
not limited to, deletion-insertion mutations. An example of such an
alteration includes a gene disruption, i.e., a perturbation of a
gene such that the product that is normally produced from this gene
is not produced in a functional form. This could be due to a
complete deletion, a deletion and insertion of a selective marker,
an insertion of a selective marker, a frameshift mutation, an
in-frame deletion, or a point mutation that leads to premature
termination. In some instances, the entire mRNA for the gene is
absent. In other situations, the amount of mRNA produced
varies.
[0108] Calculations of "homology" between two sequences can be
performed as follows. The sequences are aligned for optimal
comparison purposes (e.g., gaps can be introduced in one or both of
a first and a second amino acid or nucleic acid sequence for
optimal alignment and non-homologous sequences can be disregarded
for comparison purposes). In a preferred embodiment, the length of
a reference sequence that is aligned for comparison purposes is at
least about 30%, preferably at least about 40%, more preferably at
least about 50%, even more preferably at least about 60%, and even
more preferably at least about 70%, at least about 80%, at least
about 90%, or about 100% of the length of the reference sequence.
The amino acid residues or nucleotides at corresponding amino acid
positions or nucleotide positions are then compared. When a
position in the first sequence is occupied by the same amino acid
residue or nucleotide as the corresponding position in the second
sequence, then the molecules are identical at that position (as
used herein, amino acid or nucleic acid "identity" is equivalent to
amino acid or nucleic acid "homology"). The percent identity
between the two sequences is a function of the number of identical
positions shared by the sequences, taking into account the number
of gaps and the length of each gap, which need to be introduced for
optimal alignment of the two sequences.
[0109] The comparison of sequences and determination of percent
homology between two sequences can be accomplished using a
mathematical algorithm. In a preferred embodiment, the percent
homology between two amino acid sequences is determined using the
Needleman and Wunsch, J. Mol. Biol. 48:444 453 (1970), algorithm
that has been incorporated into the GAP program in the GCG software
package, using either a Blossum 62 matrix or a PAM250 matrix, and a
gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1,
2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent
homology between two nucleotide sequences is determined using the
GAP program in the GCG software package, using a NWSgapdna. CMP
matrix and a gap weight of about 40, 50, 60, 70, or 80 and a length
weight of about 1, 2, 3, 4, 5, or 6. A particularly preferred set
of parameters (and the one that should be used if the practitioner
is uncertain about which parameters should be applied to determine
if a molecule is within a homology limitation of the claims) are a
Blossum 62 scoring matrix with a gap penalty of 12, a gap extend
penalty of 4, and a frameshift gap penalty of 5.
[0110] Other methods for aligning sequences for comparison are well
known in the art. Various programs and alignment algorithms are
described in, for example, Smith & Waterman, Adv. Appl. Math.
2:482 (1981); Pearson & Lipman, Proc. Natl. Acad. Sci. USA
85:2444 (1988); Higgins & Sharp, Gene 73:237 244 (1988);
Higgins & Sharp, CABIOS 5:151-153 (1989); Corpet et al.,
Nucleic Acids Research 16:10881-10890 (1988); Huang et al., CABIOS
8:155-165, 1992; and Pearson et al., Methods in Molecular Biology
24:307-331, 1994. and Altschul et al., J. Mol. Biol. 215:403-410,
1990.
[0111] As used herein, a "host cell" is a cell used to produce a
product described herein (e.g., an aldehyde or alkane). A host cell
can be modified to express or overexpress selected genes or to have
attenuated expression of selected genes. Non-limiting examples of
host cells include plant, animal, human, bacteria, cyanobacteria,
yeast, or filamentous fungi cells.
[0112] As used herein, the term "hybridizes under low stringency,
medium stringency, high stringency, or very high stringency
conditions" describes conditions for hybridization and washing.
Guidance for performing hybridization reactions can be found, for
example, in Current Protocols in Molecular Biology, John Wiley
& Sons, N.Y. (1989), 6.3.1-6.3.6. Aqueous and nonaqueous
methods are described in that reference and either method can be
used. Specific hybridization conditions referred to herein are as
follows: 1) low stringency hybridization conditions in 6.times.
sodium chloride/sodium citrate (SSC) at about 45.degree. C.,
followed by two washes in 0.2.times.SSC, 0.1% SDS at least at
50.degree. C. (the temperature of the washes can be increased to
55.degree. C. for low stringency conditions); 2) medium stringency
hybridization conditions in 6.times.SSC at about 45.degree. C.,
followed by one or more washes in 0.2.times.SSC, 0.1% SDS at
60.degree. C.; 3) high stringency hybridization conditions in
6.times.SSC at about 45.degree. C., followed by one or more washes
in 0.2.times.SSC, 0.1% SDS at 65.degree. C.; and preferably 4) very
high stringency hybridization conditions are 0.5M sodium phosphate,
7% SDS at 65.degree. C., followed by one or more washes at
0.2.times.SSC, 1% SDS at 65.degree. C. Very high stringency
conditions (4) are the preferred conditions unless otherwise
specified.
[0113] The term "isolated" as used herein with respect to nucleic
acids, such as DNA or RNA, refers to molecules separated from other
DNAs or RNAs, respectively, that are present in the natural source
of the nucleic acid. Moreover, an "isolated nucleic acid" includes
nucleic acid fragments, such as fragments that are not naturally
occurring. The term "isolated" is also used herein to refer to
polypeptides, which are isolated from other cellular proteins, and
encompasses both purified endogenous polypeptides and recombinant
polypeptides. The term "isolated" as used herein also refers to a
nucleic acid or polypeptide that is substantially free of cellular
material, viral material, or culture medium when produced by
recombinant DNA techniques. The term "isolated" as used herein also
refers to a nucleic acid or polypeptide that is substantially free
of chemical precursors or other chemicals when chemically
synthesized.
[0114] As used herein, the "level of expression of a gene in a
cell" refers to the level of mRNA, pre-mRNA nascent transcript(s),
transcript processing intermediates, mature mRNA(s), and/or
degradation products encoded by the gene in the cell.
[0115] As used herein, the term "microorganism" means prokaryotic
and eukaryotic microbial species from the domains Archaea, Bacteria
and Eucarya, the latter including yeast and filamentous fungi,
protozoa, algae, or higher Protista. The term "microbial cell", as
used herein, means a cell from a microorganism.
[0116] As used herein, the term "nucleic acid" refers to a
polynucleotide, such as deoxyribonucleic acid (DNA), and, where
appropriate, ribonucleic acid (RNA). The term also includes analogs
of either RNA or DNA made from nucleotide analogs, and, as
applicable to the embodiment being described, single (sense or
antisense) and double-stranded polynucleotides, ESTs, chromosomes,
cDNAs, mRNAs, and rRNAs. The term "nucleic acid" may be used
interchangeably with "polynucleotide," "DNA," "nucleic acid
molecule," "nucleotide sequence," and/or "gene" unless otherwise
indicated herein or otherwise clearly contradicted by context.
[0117] As used herein, the term "operably linked" means that a
selected nucleotide sequence (e.g., encoding a polypeptide
described herein) is in proximity with a promoter to allow the
promoter to regulate expression of the selected nucleotide
sequence. In addition, the promoter is located upstream of the
selected nucleotide sequence in terms of the direction of
transcription and translation. By "operably linked" is meant that a
nucleotide sequence and a regulatory sequence(s) are connected in
such a way as to permit gene expression when the appropriate
molecules (e.g., transcriptional activator proteins) are bound to
the regulatory sequence(s).
[0118] The term "or" is used herein to mean, and is used
interchangeably with, the term "and/or," unless context clearly
indicates otherwise.
[0119] As used herein, "overexpress" means to express or cause to
be expressed or produced a nucleic acid, polypeptide, or
hydrocarbon in a cell at a greater concentration than is normally
expressed in a corresponding wild-type cell. For example, a
polypeptide can be "overexpressed" in a recombinant host cell when
the polypeptide is present in a greater concentration in the
recombinant host cell compared to its concentration in a
non-recombinant host cell of the same species.
[0120] As used herein, "partition coefficient" or "P," is defined
as the equilibrium concentration of a compound in an organic phase
divided by the concentration at equilibrium in an aqueous phase
(e.g., fermentation broth). In one embodiment of a bi-phasic system
described herein, the organic phase is formed by the aldehyde or
alkane during the production process. However, in some examples, an
organic phase can be provided, such as by providing a layer of
octane, to facilitate product separation. When describing a two
phase system, the partition characteristics of a compound can be
described as logP. For example, a compound with a logP of 1 would
partition 10:1 to the organic phase. A compound with a logP of -1
would partition 1:10 to the organic phase. By choosing an
appropriate fermentation broth and organic phase, an organic fatty
acid derivative or product with a high logP value can separate into
the organic phase even at very low concentrations in the
fermentation vessel.
[0121] As used herein, the term "polypeptide" may be used
interchangeably with "protein," "peptide," and/or "enzyme" unless
otherwise indicated herein or otherwise clearly contradicted by
context.
[0122] As used herein, the term "production host" means a cell used
to produce the products disclosed herein. The production host is
modified to express, overexpress, attenuate or delete expression of
selected polynucleotides. Non-limiting examples of production hosts
include plant, algal, animal, human, bacteria, yeast, and
filamentous fungi cells.
[0123] As used herein, the term "purify," "purified," or
"purification" means the removal or isolation of a molecule from
its environment by, for example, isolation or separation.
"Substantially purified" molecules are at least about 60% free,
preferably at least about 75% free, and more preferably at least
about 90% free from other components with which they are
associated. As used herein, these terms also refer to the removal
of contaminants from a sample. For example, the removal of
contaminants can result in an increase in the percentage of a fatty
acid derivative or product in a sample. For example, when a fatty
acid derivatives or products are produced in a host cell, the fatty
acid derivatives or products can be purified by the removal of host
cell proteins. After purification, the percentage of fatty acid
derivatives or products in the sample is increased.
[0124] The terms "purify," "purified," and "purification" do not
require absolute purity. They are relative terms. Thus, for
example, when the fatty acid derivatives or products are produced
in host cells, a purified fatty acid derivative or product is one
that is substantially separated from other cellular components
(e.g., nucleic acids, polypeptides, lipids, carbohydrates, or other
fatty acid derivatives or products). In another example, a purified
fatty acid derivative or purified product preparation is one in
which the fatty acid derivative or product is substantially free
from contaminants, such as those that might be present following
fermentation. In some embodiments, a fatty acid derivative or
product is purified when at least about 50% by weight of a sample
is composed of the fatty acid derivative or product. In other
embodiments, a fatty acid derivative or product is purified when at
least about 60%, 70%, 80%, 85%, 90%, 92%, 95%, 98%, or 99% or more
by weight of a sample is composed of the fatty acid derivative or
product.
[0125] As used herein, the term "recombinant polypeptide" refers to
a polypeptide that is produced by recombinant DNA techniques,
wherein generally DNA encoding the expressed polypeptide or RNA is
inserted into a suitable expression vector and that is in turn used
to transform a host cell to produce the polypeptide or RNA.
[0126] As used herein, the term "substantially identical" (or
"substantially homologous") is used to refer to a first amino acid
or nucleotide sequence that contains a sufficient number of
identical or equivalent (e.g., with a similar side chain) amino
acid residues (e.g., conserved amino acid substitutions) or
nucleotides to a second amino acid or nucleotide sequence such that
the first and second amino acid or nucleotide sequences have
similar activities.
[0127] As used herein, the term "synthase" means an enzyme which
catalyzes a synthesis process. As used herein, the term synthase
includes synthases, synthetases, and ligases.
[0128] As used herein, the term "transfection" means the
introduction of a nucleic acid (e.g., via an expression vector)
into a recipient cell by nucleic acid-mediated gene transfer.
[0129] As used herein, the term "transformation" refers to a
process in which a cell's genotype is changed as a result of the
cellular uptake of exogenous nucleic acid. This may result in the
transformed cell expressing a recombinant form of a RNA or
polypeptide. In the case of antisense expression from the
transferred gene, the expression of a naturally-occurring form of
the polypeptide is disrupted.
[0130] As used herein, the term "transport protein" means a
polypeptide that facilitates the movement of one or more compounds
in and/or out of a cellular organelle and/or a cell. A number of
these proteins, as well as other useful proteins for making the
products described herein, have been disclosed in, for example,
International Patent Application Nos. PCT/US2007/011923 and
PCT/US2008/058788, which are incorporated herein by reference.
[0131] As used herein, a "variant" of polypeptide X refers to a
polypeptide having the amino acid sequence of polypeptide X in
which one or more amino acid residues is altered. The variant may
have conservative changes or non-conservative changes. Guidance in
determining which amino acid residues may be substituted, inserted,
or deleted without affecting biological activity may be found using
computer programs well known in the art, for example, LASERGENE
software (DNASTAR).
[0132] The term "variant," when used in the context of a
polynucleotide sequence, may encompass a polynucleotide sequence
related to that of a gene or the coding sequence thereof. This
definition may also include, for example, "allelic," "splice,"
"species," or "polymorphic" variants. A splice variant may have
significant identity to a reference polynucleotide, but will
generally have a greater or fewer number of polynucleotides due to
alternative splicing of exons during mRNA processing. The
corresponding polypeptide may possess additional functional domains
or an absence of domains. Species variants are polynucleotide
sequences that vary from one species to another. The resulting
polypeptides generally will have significant amino acid identity
relative to each other. A polymorphic variant is a variation in the
polynucleotide sequence of a particular gene between individuals of
a given species.
[0133] As used herein, the term "vector" refers to a nucleic acid
molecule capable of transporting another nucleic acid to which it
has been linked. One type of useful vector is an episome (i.e., a
nucleic acid capable of extra-chromosomal replication). Useful
vectors are those capable of autonomous replication and/or
expression of nucleic acids to which they are linked. Vectors
capable of directing the expression of genes to which they are
operatively linked are referred to herein as "expression vectors".
In general, expression vectors of utility in recombinant DNA
techniques are often in the form of "plasmids," which refer
generally to circular double stranded DNA loops that, in their
vector form, are not bound to the chromosome. In the present
specification, "plasmid" and "vector" are used interchangeably, as
the plasmid is the most commonly used form of vector. However, also
included are such other forms of expression vectors that serve
equivalent functions and that become known in the art subsequently
hereto.
[0134] As used herein, the term "wax" means a composition comprised
of fatty esters. In a preferred embodiment, the fatty ester in the
wax is comprised of medium to long carbon chains. In addition to
fatty esters, a wax may comprise other components (e.g.,
hydrocarbons, sterol esters, aliphatic aldehydes, alcohols,
ketones, beta-diketones, triacylglycerols, etc.).
[0135] Throughout the specification, a reference may be made using
an abbreviated gene name or polypeptide name, but it is understood
that such an abbreviated gene or polypeptide name represents the
genus of genes or polypeptides. Such gene names include all genes
encoding the same polypeptide and homologous polypeptides having
the same physiological function. Polypeptide names include all
polypeptides that have the same activity (e.g., that catalyze the
same fundamental chemical reaction).
[0136] Unless otherwise indicated, the accession numbers referenced
herein are derived from the NCBI database (National Center for
Biotechnology Information) maintained by the National Institute of
Health, U.S.A. Unless otherwise indicated, the accession numbers
are as provided in the database as of October 2009.
[0137] EC numbers are established by the Nomenclature Committee of
the International Union of Biochemistry and Molecular Biology
(NC-IUBMB) (available at http://www.chem.qmul.ac.uk/iubmb/enzyme/).
The EC numbers referenced herein are derived from the KEGG Ligand
database, maintained by the Kyoto Encyclopedia of Genes and
Genomics, sponsored in part by the University of Tokyo. Unless
otherwise indicated, the EC numbers are as provided in the database
as of October 2009.
[0138] Unless otherwise defined, all technical and scientific terms
used herein have the same meaning as commonly understood by one of
ordinary skill in the art to which this invention belongs.
[0139] Although methods and materials similar or equivalent to
those described herein can be used in the practice or testing of
the present invention, suitable methods and materials are described
below. All methods described herein can be performed in any
suitable order unless otherwise indicated herein or otherwise
clearly contradicted by context.
[0140] Unless otherwise stated, amounts listed in percentage (%)
are in weight percent, based on the total weight of the
composition.
[0141] All publications, patent applications, patents, and other
references mentioned herein are incorporated by reference in their
entirety. In case of conflict, the present specification, including
definitions, will control. In addition, the materials, methods, and
examples are illustrative only and not intended to be limiting.
[0142] Recitation of ranges of values herein are merely intended to
serve as a shorthand method of referring individually to each
separate value falling within the range, unless otherwise indicated
herein, and each separate value is incorporated into the
specification as if it were individually recited herein.
[0143] The use of any and all examples, or exemplary language
(e.g., "such as") provided herein, is intended merely to better
illuminate the invention and does not pose a limitation on the
scope of the invention unless otherwise claimed. No language in the
specification should be construed as indicating any non-claimed
element as essential to the practice of the invention.
[0144] Other features and advantages of the invention will be
apparent from the following detailed description and from the
claims.
Fatty Esters
[0145] This disclosure relates to the production of fatty esters,
such as fatty acid esters including, for example fatty acid methyl
esters ("FAME") and fatty acid ethyl esters ("FAEE"), in host
cells. In particular embodiments, the methods described herein are
used to produce fatty acid methyl esters, which can be used in
biodiesels.
[0146] Fatty esters produced by the methods described herein are
not limited to esters of any particular length or other
characteristics. For example, a microorganism can be genetically
engineered to produce any of the fatty esters described in Knothe,
Fuel Processing Technology 86:1059-1070 (2005), using the teachings
provided herein. Such fatty esters can be characterized, for
example, by cetane number (CN), viscosity, melting point, and heat
of combustion, as described by Knothe.
[0147] Fatty esters that are produced in accordance with the
methods, cells, or microorganisms herein comprise, consist
essentially of, or consist of the following formula: BCOOA, having
an A side and a B side, where the "A side" refers to the carbon
chain attached to the carboxylate oxygen of the ester, and the "B
side" refers to the carbon chain comprising the parent carboxylate
of the ester. The A side is contributed by an alcohol, such as a
fatty alcohol, and the B side is contributed by an acid, such as a
fatty acid. B is an aliphatic group. In some embodiments, B is a
carbon chain. In some embodiments, B comprises a carbon chain that
is at least 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18,
19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 carbons in
length. A comprises at least one carbon atom. In some embodiments,
A is an aliphatic group. In some embodiments, A is an alkyl group.
In some embodiments, the alkyl group comprises, consists
essentially of, or consists of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,
12, 13, 14, 15, 16, 17, 18, 19, or 20 carbon atoms. In some
embodiments, any of the above B groups can be combined with any of
the above A groups. In some embodiments, A comprises, consists
essentially of, or consists of a carbon chain having a number of
carbons selected from the group consisting of 1, 2, 3, 4, and 5
carbon atoms, while B comprises, consists essentially of, or
consists of at least 12, 13, 14, 15, 16, 17, 18, 19, or 20 carbon
atoms.
[0148] In some embodiments, the fatty esters of the invention
comprise a plurality of individual fatty esters. In some
embodiments, the methods described herein permit production of a
plurality of fatty esters of varied length. In some embodiments,
the fatty ester product comprises saturated or unsaturated fatty
esters product(s) having a carbon atom content limited to between 5
and 25 carbon atoms. In other words, the invention provides a
composition comprising C.sub.5-C.sub.25 fatty esters (e.g.,
C.sub.10-C.sub.20 fatty esters, or C.sub.12-C.sub.18 fatty
esters).
[0149] In some embodiments, the fatty esters comprise one or more
fatty esters having a double bond at one or more points in the
carbon chain. Thus, in some embodiments, a 6-, 7-, 8-, 9-, 10-,
11-, 12-, 13-, 14-, 15-, 16-, 17-, 18-, 19-, 20-, 21-, 22-, 23-,
24-, 25-, 26-, 27-, 28-, 29-, or 30-carbon chain can have 1, 2, 3,
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21,
22, 23, or 24 double bonds, and 1-24 of the aforesaid double bonds
can be located following carbon 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28,
or 29. In some embodiments, a 1-, 2-, 3-, 4-, or 5-carbon chain for
A can have 1, 2, 3, or 4 double bonds and 1-4 of the double bonds
can be located following carbon 1, 2, 3, or 4. In some embodiments,
any of the above A groups can be combined with any of the above B
groups.
[0150] In certain preferred embodiments, the B group can have 12,
13, 14, 15, 16, 17, 18 carbon atoms in a chain. In other
embodiments, the A group can have one or two carbon atoms.
[0151] In some preferred embodiments, the B group can have one
double bond at one or more points in the carbon chain. In more
preferred embodiments, the B group can have one double bond at
position 7 of the carbon chain, numbering from the reduced end of
the carbon chain. One of ordinary skill in the art will recognize
that one end of the B group will have a methyl group, and the other
end of the B group will have a carboxyl group (C(.dbd.O)O--). The
end of the B group which is a methyl group is the reduced end of
the carbon chain comprising the B group, thus, the double bond is
at carbon 7 counting from the methyl group terminus of the B group
(e.g., at between carbons 7 and 8 of the B group). The double bond
can have any geometry, thus, the double bond in the B group can be
cis or trans.
[0152] In some embodiments, the fatty esters comprise straight
chain fatty esters. In some embodiments, the fatty esters comprise
branched chain fatty esters. In some embodiments, the fatty esters
comprise cyclic moieties.
[0153] In certain preferred embodiments, the fatty esters can be
selected from the group consisting of methyl dodecanoate, methyl
5-dodecenoate, methyl tetradecanoate, methyl 7-tetradecenoate,
methyl hexadecanoate, methyl 9-hexadecenoate, methyl octadecanoate,
methyl 11-octadecenoate, and combinations thereof.
[0154] In some embodiments, the fatty ester composition comprises
about 5 wt. % or more methyl dedecanoate. In some embodiments, the
fatty ester composition comprises about 25% or more methyl
dedecanoate. In some embodiments, the fatty ester composition
comprises about 5 wt. % to about 25 wt. % methyl dodecanoate.
[0155] In some embodiments, the fatty ester composition comprises
about 10 wt. % or less methyl dodec-7-enoate. In some embodiments,
the fatty ester composition comprises about 0 wt. % to about 10 wt.
% methyl dodec-7-enoate.
[0156] In some embodiments, the fatty ester composition comprises
about 30 wt. % or more methyl tetradecanoate. In some embodiments,
the fatty ester composition comprises about 50 wt. % or less methyl
tetradecanoate. In some embodiments, the fatty ester composition
comprises about 30 wt. % to about 50 wt. % methyl
tetradecanoate.
[0157] In some embodiments, the fatty ester composition comprises
about 10 wt. % or less methyl tetradec-7-enoate. In some
embodiments, the fatty ester composition comprises about 0 wt. % to
about 10 wt. % methyl tetradec-7-enoate.
[0158] In some embodiments, the fatty ester composition comprises
about 15 wt. % or less methyl hexadecanoate. In some embodiments,
the fatty ester composition comprises about 0 wt. % to about 15 wt.
% methyl hexadecanoate.
[0159] In some embodiments, the fatty ester composition comprises
about 10 wt. % or more methyl hexadec-7-enoate. In some
embodiments, the fatty ester composition comprises about 40 wt. %
or less methyl hexadec-7-enoate. In some embodiments, the fatty
ester composition comprises about 10 wt. % to about 40 wt. % methyl
hexadec-7-enoate.
[0160] In some embodiments, the fatty ester composition comprises
about 15 wt. % or less methyl octadec-7-enoate. In some
embodiments, the fatty ester composition comprises about 0 wt. % to
about 15 wt. % methyl octadec-7-enoate.
[0161] These exemplary fatty ester products, having characteristic
features of A side and/or B side, can be prepared and produced
using substrates having similar or the same features. Accordingly,
each step within a biosynthetic pathway that leads to the
production of a fatty acid derivative can be modified to produce or
overproduce a substrate leading to a fatty alcohol and/or a fatty
acid. For example, known genes involved in the fatty acid
biosynthetic pathway or the fatty alcohol pathway can be expressed,
overexpressed, or attenuated in host cells to produce a desired
substrate (see, e.g., WO2008/119082, the disclosure of which is
incorporated by reference herein).
Synthesis of Substrates
[0162] Fatty acid synthase (FAS) is a group of polypeptides that
catalyze the initiation and elongation of acyl chains (Marrakchi et
al., Biochemical Society, 30:1050-1055 (2002)). The acyl carrier
protein (ACP) along with the enzymes in the FAS pathway control the
length, degree of saturation, and branching of the fatty acid
derivatives produced. The fatty acid biosynthetic pathway involves
the precursors acetyl-CoA and malonyl-CoA. The steps in this
pathway are catalyzed by enzymes of the fatty acid biosynthesis
(fab) and acetyl-CoA carboxylase (acc) gene families (see, e.g.,
Heath et al., Prog. Lipid Res. 40(6):467-97 (2001)).
[0163] Host cells can be engineered to express fatty acid
derivative substrates by recombinantly expressing or overexpressing
one or more fatty acid synthase genes, such as acetyl-CoA and/or
malonyl-CoA synthase genes. For example, to increase acetyl-CoA
production, one or more of the following genes can be expressed in
a host cell: pdh (a multienzyme complex comprising aceEF (which
encodes the E1p dehydrogenase component, the E2p dihydrolipoamide
acyltransferase component of the pyruvate and 2-oxoglutarate
dehydrogenase complexes, and lpd), panK, fabH, fabB, fabD, fabG,
acpP, and fabF. Exemplary GenBank accession numbers for these genes
are: pdh (BAB34380, AAC73227, AAC73226), panK (also known as CoA,
AAC76952), aceEF (AAC73227, AAC73226), fabH (AAC74175), fabB
(P0A953), fabD (AAC74176), fabG (AAC74177), acpP (AAC74178), fabF
(AAC74179). Additionally, the expression levels of fadE, gpsA,
ldhA, pflb, adhE, pta, poxB, ackA, and/or ackB can be attenuated or
knocked-out in an engineered host cell by transformation with
conditionally replicative or non-replicative plasmids containing
null or deletion mutations of the corresponding genes or by
substituting promoter or enhancer sequences. Exemplary GenBank
accession numbers for these genes are: fadE (AAC73325), gspA
(AAC76632), ldhA (AAC74462), pflb (AAC73989), adhE (AAC74323), pta
(AAC75357), poxB (AAC73958), ackA (AAC75356), and ackB (BAB81430).
The resulting host cells will have increased acetyl-CoA production
levels when grown in an appropriate environment.
[0164] Malonyl-CoA overexpression can be affected, for example, by
introducing one or more or all subunits of a four-subunit protein
accABCD (e.g., accession number AAC73296, EC 6.4.1.2) into a host
cell. Fatty acids can be further produced in host cells by
introducing into the host cell a DNA sequence encoding a lipase
(e.g., accession numbers CAA89087, CAA98876).
[0165] In addition, inhibiting PlsB can lead to an increase in the
levels of long chain acyl-ACP, which will inhibit early steps in
the pathway (e.g., the expression of accABCD, fabH, or fabI). The
plsB (e.g., accession number AAC77011) D311E mutation can be used
to increase the amount of available fatty acids.
[0166] In addition, a host cell can be engineered to overexpress a
sfa gene (suppressor of fabA, e.g., accession number AAN79592) to
increase production of monounsaturated fatty acids (Rock et al., J.
Bacteriology 178:5382-5387 (1996)).
[0167] The chain length of a fatty acid derivative substrate can be
selected for by modifying the expression of a thioesterase, which
influences the chain length of fatty acids produced. Hence, host
cells can be engineered to express, overexpress, have attenuated
expression, or not to express one or more selected thioesterases to
increase the production of a preferred fatty acid derivative
substrate. For example, C.sub.10 fatty acids can be produced by
expressing a thioesterase that has a preference for producing
C.sub.10 fatty acids and attenuating thioesterases that have a
preference for producing fatty acids other than C.sub.10 fatty
acids (e.g., a thioesterase which prefers to produce C.sub.14 fatty
acids). This would result in a relatively homogeneous population of
fatty acids that have a carbon chain length of, for example, 10. In
other instances, C.sub.14 fatty acids can be produced by
attenuating endogenous thioesterases that produce non-C.sub.14
fatty acids and expressing the thioesterases that use C.sub.14-ACP.
In some situations, C.sub.12 fatty acids can be produced by
expressing thioesterases that use C.sub.12-ACP, while in parallel,
attenuating thioesterases that produce non-C.sub.12 fatty acids.
Acetyl-CoA, malonyl-CoA, and fatty acid overproduction can be
verified using methods known in the art, for example, by using
radioactive precursors, HPLC, or GC-MS subsequent to cell lysis.
Non-limiting examples of thioesterases that can be used in the
methods described herein are listed in Table 1.
TABLE-US-00001 TABLE 1 Thioesterases Accession Number Source
Organism Gene AAC73596 E. coli tesA without leader sequence
AAC73555 E. coli tesB Q41635, AAA34215 Umbellularia california fatB
AAC49269 Cuphea hookeriana fatB2 Q39513; AAC72881 Cuphea hookeriana
fatB3 Q39473, AAC49151 Cinnamonum camphorum fatB CAA85388
Arabidopsis thaliana fatB [M141T]* NP 189147; NP 193041 Arabidopsis
thaliana fatA CAC39106 Bradyrhiizobium japonicum fatA AAC72883
Cuphea hookeriana fatA AAL79361 Helianthus annus fatA1 *Mayer et
al., BMC Plant Biology 7: 1-11 (2007)
[0168] In other instances, fatty esters are produced in a host cell
that contains a naturally occurring mutation that results in an
increased level of fatty acids in the host cell. In some instances,
the host cell is genetically engineered to increase the level of
fatty acids in the host cell relative to a corresponding wild-type
host cell. For example, the host cell can be genetically engineered
to express a reduced or attenuated level of an acyl-CoA synthase
relative to a corresponding wild-type host cell. In a particular
embodiment, the level of expression of one or more genes (e.g., an
acyl-CoA synthase gene) can be eliminated by genetically
engineering a "knock out" host cell, wherein the one or more genes
are deleted.
[0169] Any known acyl-CoA synthase gene can be reduced or knocked
out in a host cell. Non-limiting examples of acyl-CoA synthase
genes include fadD, fadK, BH3103, yhfL, Pfl-4354, EAV15023, fadD1,
fadD2, RPC.sub.--4074, fadDD35, fadDD22, faa3p or the gene encoding
the protein ZP.sub.--01644857. Specific examples of acyl-CoA
synthase genes include fadDD35 from M. tuberculosis H37Rv
[NP.sub.--217021], fadDD22 from M. tuberculosis H37Rv
[NP.sub.--217464], fadD from E. coli [NP.sub.--416319], fadK from
E. coli [YP.sub.--416216], fadD from Acinetobacter sp. ADP1
[YP.sub.--045024], fadD from Haemophilus influenza RdkW20
[NP.sub.--438551], fadD from Rhodopseudomonas palustris Bis B18
[YP.sub.--533919], BH3101 from Bacillus halodurans C-125
[NP.sub.--243969], Pfl-4354 from Pseudomonas fluorescens Pfo-1
[YP.sub.--350082], EAV15023 from Comamonas testosterone KF-1
[ZP.sub.--01520072], yhfL from B. subtilis [NP.sub.--388908], fadD1
from P. aeruginosa PAO1 [NP.sub.--251989], fadD1 from Ralstonia
solanacearum GM1 1000 [NP.sub.--520978], fadD2 from P. aeruginosa
PAO1 [NP.sub.--251990], the gene encoding the protein
ZP.sub.--01644857 from Stenotrophomonas maltophilia R551-3, faa3p
from Saccharomyces cerevisiae [NP.sub.--012257], faa1p from
Saccharomyces cerevisiae [NP.sub.--014962], lcfA from Bacillus
subtilis [CAA99571], or those described in Shockey et al., Plant.
Physiol. 129:1710-1722 (2002); Caviglia et al., J. Biol. Chem.
279:1163-1169 (2004); Knoll et al., J. Biol. Chem. 269(23):16348-56
(1994); Johnson et al., J. Biol. Chem. 269: 18037-18046 (1994); and
Black et al., J. Biol. Chem. 267: 25513-25520 (1992).
Formation of Branched Fatty Esters
[0170] Fatty esters can be produced that contain branch points by
using branched fatty acid derivatives as substrates or precursors.
For example, although E. coli naturally produces straight chain
fatty acids (sFAs), E. coli can be engineered to produce branched
chain fatty acids (brFAs) by introducing and expressing or
overexpressing genes that provide branched precursors in the E.
coli (e.g., bkd, ilv, icm, and fab gene families). Additionally, a
host cell can be engineered to express or overexpress one or more
genes encoding one or more proteins for the elongation of brFAs
(e.g., ACP, FabF, etc.) and/or to delete or attenuate the
corresponding host cell genes that normally lead to sFAs.
[0171] The first step in forming brFAs is the production of the
corresponding .alpha.-keto acids by a branched-chain amino acid
aminotransferase. Host cells may endogenously include genes
encoding such enzymes or such genes can be recombinantly
introduced. E. coli, for example, endogenously expresses such an
enzyme, IlvE (EC 2.6.1.42; GenBank accession YP.sub.--026247). In
some host cells, a heterologous branched-chain amino acid
aminotransferase may not be expressed. However, E. coli IlvE or any
other branched-chain amino acid aminotransferase (e.g., IlvE from
Lactococcus lactis (GenBank accession AAF34406), IlvE from
Pseudomonas putida (GenBank accession NP.sub.--745648), or IlvE
from Streptomyces coelicolor (GenBank accession NP.sub.--629657)),
if not endogenous, can be introduced.
[0172] In another embodiment, the production of .alpha.-keto acids
can be achieved by using the methods described in Atsumi et al.,
Nature 451:86-89 (2008). For example, 2-ketoisovalerate can be
produced by overexpressing the genes encoding IlvI, IlvH, IlvC, or
IlvD. In another example, 2-keto-3-methyl-valerate can be produced
by overexpressing the genes encoding IlvA and IlvI, IlvH (or AlsS
of Bacillus subtilis), IlvC, IlvD, or their corresponding homologs.
In a further embodiment, 2-keto-4-methyl-pentanoate can be produced
by overexpressing the genes encoding IlvI, IlvH, IlvC, IlvD and
LeuA, LeuB, LeuC, LeuD, or their corresponding homologs.
[0173] The second step is the oxidative decarboxylation of the
.alpha.-keto acids to the corresponding branched-chain acyl-CoA.
This reaction can be catalyzed by a branched-chain .alpha.-keto
acid dehydrogenase complex (bkd; EC 1.2.4.4.) (Denoya et al., J.
Bacteria 177:3504 (1995)), which consists of E1a/13
(decarboxylase), E2 (dihydrolipoyl transacylase), and E3
(dihydrolipoyl dehydrogenase) subunits. These branched-chain
.alpha.-keto acid dehydrogenase complexes are similar to pyruvate
and .alpha.-ketoglutarate dehydrogenase complexes. Any
microorganism that possesses brFAs and/or grows on branched-chain
amino acids can be used as a source to isolate bkd genes for
expression in host cells, for example, E. coli. Furthermore, E.
coli has the E3 component as part of its pyruvate dehydrogenase
complex (lpd, EC 1.8.1.4, GenBank accession NP.sub.--414658). Thus,
it may be sufficient to express only the E1.alpha./.beta. and E2
bkd genes. Table 2 lists non-limiting examples of bkd genes from
several microorganisms that can be recombinantly introduced and
expressed in a host cell to provide branched-chain acyl-CoA
precursors.
TABLE-US-00002 TABLE 2 Bkd genes from selected microorganisms
Organism Gene GenBank Accession # Streptomyces coelicolor bkdA1
(E1.alpha.) NP_628006 bkdB1 (E1.beta.) NP_628005 bkdC1 (E2)
NP_628004 Streptomyces coelicolor bkdA2 (E1.alpha.) NP_733618 bkdB2
(E1.beta.) NP_628019 bkdC2 (E2) NP_628018 Streptomyces avermitilis
bkdA (E1a) BAC72074 bkdB (E1b) BAC72075 bkdC (E2) BAC72076
Streptomyces avermitilis bkdF (E1.alpha.) BAC72088 bkdG (E1.beta.)
BAC72089 bkdH (E2) BAC72090 Bacillus subtilis bkdAA (E1.alpha.)
NP_390288 bkdAB (E1.beta.) NP_390288 bkdB (E2) NP_390288
Pseudomonas putida bkdA1 (E1.alpha.) AAA65614 bkdA2 (E1.beta.)
AAA65615 bkdC (E2) AAA65617
[0174] In another example, isobutyryl-CoA can be made in a host
cell, for example in E. coli, through the coexpression of a
crotonyl-CoA reductase (Ccr, EC 1.6.5.5, 1.1.1.1) and
isobutyryl-CoA mutase (large subunit IcmA, EC 5.4.99.2; small
subunit IcmB, EC 5.4.99.2) (Han and Reynolds, J. Bacteriol.
179:5157 (1997)). Crotonyl-CoA is an intermediate in fatty acid
biosynthesis in E. coli and other microorganisms. Non-limiting
examples of ccr and icm genes from selected microorganisms are
listed in Table 3.
TABLE-US-00003 TABLE 3 Ccr and icm genes from selected
microorganisms Organism Gene GenBank Accession # Streptomyces
coelicolor ccr NP_630556 icmA NP_629554 icmB NP_630904 Streptomyces
cinnamonensis ccr AAD53915 icmA AAC08713 icmB AJ246005
[0175] In addition to expression of the bkd genes, the initiation
of brFA biosynthesis utilizes .beta.-ketoacyl-acyl-carrier-protein
synthase III (FabH, EC 2.3.1.41) with specificity for branched
chain acyl-CoAs (Li et al., J. Bacteriol. 187:3795-3799 (2005)).
Non-limiting examples of such FabH enzymes are listed in Table 4.
fabH genes that are involved in fatty acid biosynthesis of any
brFA-containing microorganism can be expressed in a host cell. The
Bkd and FabH enzymes from host cells that do not naturally make
brFA may not support brFA production. Therefore, bkd and fabH can
be expressed recombinantly. Vectors containing the bkd and fabH
genes can be inserted into such a host cell. Similarly, the
endogenous level of Bkd and FabH production may not be sufficient
to produce brFA. In this case, they can be overexpressed.
Additionally, other components of the fatty acid biosynthesis
pathway can be expressed or overexpressed, such as acyl carrier
proteins (ACPs) and .beta.-ketoacyl-acyl-carrier-protein synthase
II (fabF, EC 2.3.1.41) (non-limiting examples of FabH, ACP, and
fabF genes from select microorganisms are listed in Table 4). In
addition to expressing these genes, some genes in the endogenous
fatty acid biosynthesis pathway can be attenuated in the host cell
(e.g., the E. coli genes fabH (GenBank accession #NP.sub.--415609)
and/or fabF (GenBank accession #NP.sub.--415613)).
TABLE-US-00004 TABLE 4 FabH, ACP and fabF genes from selected
microorganisms with brFAs GenBank Organism Gene Accession #
Streptomyces coelicolor fabH1 NP_626634 acp NP_626635 fabF
NP_626636 Streptomyces avermitilis fabH3 NP_823466 fabC3 (acp)
NP_823467 fabF NP_823468 Bacillus subtilis fabH_A NP_389015 fabH_B
NP_388898 acp NP_389474 fabF NP_389016 Stenotrophomonas
SmalDRAFT_0818 (fabH) ZP_01643059 maltophilia SmalDRAFT_0821 (acp)
ZP_01643063 SmalDRAFT_0822 (fabF) ZP_01643064 Legionella
pneumophila fabH YP_123672 acp YP_123675 fabF YP_123676
Formation of Cyclic Fatty Esters
[0176] Cyclic fatty esters can be produced by using cyclic fatty
acid derivatives as substrates. To produce cyclic fatty acid
derivative substrates, genes that provide cyclic precursors (e.g.,
the ans, chc, and plm gene families) can be introduced into the
host cell and expressed to allow initiation of fatty acid
biosynthesis from cyclic precursors. For example, to convert a host
cell, such as E. coli, into one capable of synthesizing w-cyclic
fatty acids (cyFA), a gene that provides the cyclic precursor
cyclohexylcarbonyl-CoA (CHC-CoA) (Cropp et al., Nature Biotech.
18:980-983 (2000)) can be introduced and expressed in the host
cell. Non-limiting examples of genes that provide CHC-CoA in E.
coli include: ansJ, ansK, ansL, chcA, and ansM from the ansatrienin
gene cluster of Streptomyces collinus (Chen et al., Eur. J.
Biochem. 261: 98-107 (1999)) or plmJ, plmK, plmL, chcA, and plmM
from the phoslactomycin B gene cluster of Streptomyces sp. HK803
(Palaniappan et al., J. Biol. Chem. 278:35552-35557 (2003))
together with the chcB gene (Patton et al., Biochem. 39:7595-7604
(2000)) from S. collinus, S. avermitilis, or S. coelicolor (see
Table 5). Furthermore, the genes listed in Table 4, which tend to
have broad substrate specificity, can then be expressed to allow
initiation and elongation of co-cyclic fatty acids. Alternatively,
the homologous genes can be isolated from microorganisms that make
cyFA and expressed in a host cell (e.g., E. coli).
TABLE-US-00005 TABLE 5 Genes for the synthesis of CHC-CoA Organism
Gene GenBank Accession # Streptomyces collinus ansJK U72144* ansL
chcA ansM chcB AF268489 Streptomyces sp. HK803 pmlJK AAQ84158 pmlL
AAQ84159 chcA AAQ84160 pmlM AAQ84161 Streptomyces coelicolor
chcB/caiD NP_629292 Streptomyces avermitilis chcB/caiD NP_629292
*Only chcA is annotated in GenBank entry U72144, ansJKLM are
according to Chen et al. (Eur. J. Biochem. 261: 98-107 (1999)).
[0177] The genes listed in Table 4 (fabH, acp, and fabF) allow
initiation and elongation of co-cyclic fatty acids because they
have broad substrate specificity. If the coexpression of any of
these genes with the genes listed in Table 5 does not yield cyFA,
then fabH, acp, and/or fabF homologs from microorganisms that make
cyFAs (e.g., those listed in Table 6) can be isolated (e.g., by
using degenerate PCR primers or heterologous DNA sequence probes)
and coexpressed.
TABLE-US-00006 TABLE 6 Non-limiting examples of microorganisms that
contain .omega.-cyclic fatty acids Organism Reference
Curtobacterium pusillum ATCC19096 Alicyclobacillus acidoterrestris
ATCC49025 Alicyclobacillus acidocaldarius ATCC27009
Alicyclobacillus cycloheptanicus* Moore, J. Org. Chem. 62: pp.
2173, 1997 *Uses cycloheptylcarbonyl-CoA and not
cyclohexylcarbonyl-CoA as precursor for cyFA biosynthesis.
Fatty Ester Saturation Levels
[0178] The degree of saturation in fatty acids can be controlled by
regulating the degree of saturation of fatty acid intermediates.
For example, the sfa, gns, and fab families of genes can be
expressed, overexpressed, or expressed at reduced levels, to
control the saturation of fatty acids. A number of those genes have
been described in, for example, WO 2008/119082, the disclosure of
which is incorporated by reference. Non-limiting examples of genes
in these gene families include GenBank Accession No: AAN79592,
AAC44390, ABD18647.1, AAC74076.1, BAA16180, AAF98273, AAU39821, or
DDA05501.
[0179] For example, host cells can be engineered to produce
unsaturated fatty acids by engineering the production host to
overexpress fabB or by growing the production host at low
temperatures (e.g., less than 37.degree. C.). FabB has preference
to cis-.delta.3decenoyl-ACP and results in unsaturated fatty acid
production in E. coli. Overexpression of fabB results in the
production of a significant percentage of unsaturated fatty acids
(de Mendoza et al., J. Biol. Chem. 258:2098-2101 (1983)). In some
embodiments, an endogenous fabB gene may be modified such that it
is overexpressed. In some other embodiments, a heterologous fabB
gene may be inserted into and expressed in host cells not naturally
having the gene. These unsaturated fatty acids can then be used as
intermediates in host cells that are engineered to produce fatty
acid derivatives, such as fatty esters.
[0180] In other instances, a repressor of fatty acid biosynthesis,
for example, fabR (GenBank accession NP.sub.--418398), can be
deleted, which will also result in increased unsaturated fatty acid
production in E. coli (Zhang et al., J. Biol. Chem. 277:15558,
(2002)). Similar deletions may be made in other host cells. A
further increase in unsaturated fatty acids may be achieved, for
example, by overexpressing fabM (trans-2, cis-3-decenoyl-ACP
isomerase, GenBank accession DAA05501) and controlled expression of
fabK (trans-2-enoyl-ACP reductase II, GenBank accession
NP.sub.--357969) from Streptococcus pneumoniae (Marrakchi et al.,
J. Biol. Chem. 277: 44809 (2002)), while deleting E. coli fabI
(trans-2-enoyl-ACP reductase, GenBank accession NP.sub.--415804).
In some examples, the endogenous fabF gene can be attenuated, thus
increasing the percentage of palmitoleate (C16:1) produced.
[0181] In yet other examples, host cells can be engineered to
produce saturated fatty acids by reducing the expression of an sfa,
gns, or fab gene.
[0182] For example, a host cell can be engineered to express a
decreased level of fabA and/or fabB. In some instances, the host
cell can be grown in the presence of unsaturated fatty acids. In
other instances, the host cell can be further engineered to express
or overexpress a gene encoding a desaturase enzyme. One
non-limiting example of a desaturase is B. subtilis DesA
(AF037430). Other genes encoding desaturase enzymes are known in
the art and can be used in the host cells and methods described
herein, such as desaturases that use acyl-ACP, such as
hexadecanoyl-ACP or octadecanoyl-ACP.
Ester Synthases
[0183] Fatty esters can be synthesized by acyl-CoA:fatty alcohol
acyltransferase, which conjugates an alcohol to a fatty acyl-CoA
via an ester linkage. Ester synthases and encoding genes are known
from the jojoba plant and the bacterium Acinetobacter sp. strain
ADP1 (formerly Acinetobacter calcoaceticus ADP1). The bacterial
ester synthase is a bifunctional enzyme, exhibiting ester synthase
activity and the ability to form triacylglycerols from
diacylglycerol substrates and fatty acyl-CoAs (acyl-CoA:diglycerol
acyltransferase (DGAT) activity). The gene wax/dgat encodes both
ester synthase and DGAT. See Cheng et al., J. Biol. Chem.
279(36):37798-37807 (2004); Kalscheuer and Steinbuchel, J. Biol.
Chem. 278:8075-8082 (2003).
[0184] Other ester synthases (EC 2.3.1.20, 2.3.1.75) are known in
the art, and exemplary GenBank Accession Numbers include, without
limitation, NP.sub.--190765, AAA16514, AAF19262, AAX48018,
AAO17391, or AAD38041. Methods to identify ester synthase activity
are provided in, e.g., U.S. Pat. No. 7,118,896, which is herein
incorporated by reference in its entirety. Other ester synthases
include bifunctional ester synthase/acyl-CoA:diacylglycerol
acyltransferases, nonlimiting examples of which include the
multienzyme complexes from Simmondsia chinensis, Acinetobacter sp.
strain ADP1 (formerly Acinetobacter calcoaceticus ADP1),
Alcanivorax borkumensis, Pseudomonas aeruginosa, Fundibacter
jadensis, Arabidopsis thaliana, or Alcaligenes eutrophus (later
renamed Ralstonia eutropha). In one embodiment, the fatty acid
elongases, acyl-CoA reductases or wax synthases can be from a
multienzyme complex from Alcaligenes eutrophus (later renamed
Ralstonia eutropha) or other organisms known in the literature to
produce esters, such as wax or fatty esters. Additional sources of
heterologous DNA sequence encoding ester synthesis proteins that
can be used in fatty ester production include, but are not limited
to, Mortierella alpina (e.g., ATCC 32222), Cryptococcus curvatus
(also referred to as Apiotricum curvatum), Alcanivorax jadensis
(for example T9T, DSM 12718, or ATCC 700854), Acinetobacter sp.
HO1-N, (e.g., ATCC 14987), Rhodococcus opacus (e.g., PD630, DSMZ
44193), and the ester synthases from Marinobacter
hydrocarbonoclastics (e.g., DSM 8798). In certain embodiments, the
ester synthase is selected from the group consisting of: AtfA1 (an
ester synthase derived from Alcanivorax borkumensis SK2, GenBank
Accession No. YP.sub.--694462), AtfA2 (another ester synthase
derived from Alcanivorax borkumensis SK2, GenBank Accession No.
YP.sub.--693524), ES9 (an ester synthase from Marinobacter
hydrocarbonoclasticus DSM 8798, GenBank Accession No. ABO21021),
ES8 (another ester synthase derived from Marinobacter
hydrocarbonoclasticus DSM 8798, GenBank Accession No. ABO21020),
and variants thereof. In a particular embodiment, the gene encoding
the ester synthase or a suitable variant is overexpressed.
[0185] Additional ester synthases that can be used are described in
the PCT patent application entitled "Production of Fatty Acid
Derivatives", filed concurrently herewith, under Attorney Docket
No. 2001235.150WO2.
Genetic Engineering of Host Cells to Produce Fatty Esters
[0186] Various host cells can be used to produce fatty esters, as
described herein. A host cell can be any prokaryotic or eukaryotic
cell. For example, a polypeptide described herein can be expressed
in bacterial cells (such as E. coli), insect cells, yeast, or
mammalian cells (such as Chinese hamster ovary cells (CHO) cells,
COS cells, VERO cells, BHK cells, HeLa cells, Cv1 cells, MDCK
cells, 293 cells, 3T3 cells, or PC12 cells). Other exemplary host
cells include cells from the members of the genus Escherichia,
Bacillus, Lactobacillus, Rhodococcus, Pseudomonas, Aspergillus,
Trichoderma, Neurospora, Fusarium, Humicola, Rhizomucor,
Kluyveromyces, Pichia, Mucor, Myceliophtora, Penicillium,
Phanerochaete, Pleurotus, Trametes, Chrysosporium, Saccharomyces,
Schizosaccharomyces, Yarrowia, or Streptomyces. Yet other exemplary
host cells can be a Bacillus lentus cell, a Bacillus brevis cell, a
Bacillus stearothermophilus cell, a Bacillus licheniformis cell, a
Bacillus alkalophilus cell, a Bacillus coagulans cell, a Bacillus
circulans cell, a Bacillus pumilis cell, a Bacillus thuringiensis
cell, a Bacillus clausii cell, a Bacillus megaterium cell, a
Bacillus subtilis cell, a Bacillus amyloliquefaciens cell, a
Trichoderma koningii cell, a Trichoderma viride cell, a Trichoderma
reesei cell, a Trichoderma longibrachiatum cell, an Aspergillus
awamori cell, an Aspergillus fumigates cell, an Aspergillus
foetidus cell, an Aspergillus nidulans cell, an Aspergillus niger
cell, an Aspergillus oryzae cell, a Humicola insolens cell, a
Humicola lanuginose cell, a Rhizomucor miehei cell, a Mucor michei
cell, a Streptomyces lividans cell, a Streptomyces murinus cell, or
an Actinomycetes cell. Other host cells are cyanobacterial host
cells.
[0187] In certain embodiments, the host cell is an Actinomycetes,
Saccharomyces cerevisiae, Candida lipolytica (or Yarrowia
lipolytica), E. coli, Arthrobacter AK 19, Rhodotorula glutinims,
Acintobacter sp. M-1 cell, or a cell from other oleaginous
microorganisms.
[0188] In particular embodiments, the host cell is a cell from an
eukaryotic plant, algae, cyanobacterium, green-sulfur bacterium,
green non-sulfur bacterium, purple sulfur bacterium, purple
non-sulfur bacterium, extremophile, yeast, fungus, engineered
organisms thereof, or a synthetic organism. In some embodiments,
the host cell is light dependent or fixes carbon. In some
embodiments, the host cell has autotrophic activity. In some
embodiments, the host cell has photoautotrophic activity, such as
in the presence of light. In certain embodiments, the host cell is
a cell from Arabidopsis thaliana, Panicum virgatums, Miscanthus
giganteus, Zea mays, botryococcuse braunii, Chlamydomonas
reinhardtii, Dunaliela salina, Thermosynechococcus elongatus,
Chlorobium tepidum, Chloroflexus auranticus, Chromatiumm vinosum,
Rhodospirillum rubrum, Rhodobacter capsulatus, Rhodopseudomonas
palusris, Clostridium ljungdahlii, Clostridiuthermocellum, or
Pencillium chrysogenum. In certain other embodiments, the host cell
is from Pichia pastories, Saccharomyces cerevisiae, Yarrowia
lipolytica, Schizosaccharomyces pombe, Pseudomonas fluorescens, or
Zymomonas mobilis. In yet further embodiments, the host cell is a
cell from Synechococcus sp. PCC 7002, Synechococcus elongatus. PCC
7942, or Synechocystis sp. PCC6803.
[0189] In a preferred embodiment, the host cell is an E. coli cell,
a Saccharomyces cerevisiae cell, or a Bacillus subtilis cell. In a
more preferred embodiment, the host cell is from E. coli strains B,
C, K, or W. Other suitable host cells are known to those skilled in
the art.
[0190] Various methods well known in the art can be used to
genetically engineer host cells to produce fatty esters. The
methods can include the use of vectors, preferably expression
vectors, containing a nucleic acid encoding a fatty acid
biosynthetic polypeptide described herein, polypeptide variant, or
a fragment thereof. Those skilled in the art will appreciate a
variety of viral vectors (for example, retroviral vectors,
lentiviral vectors, adenoviral vectors, and adeno-associated viral
vectors) and non-viral vectors can be used in the methods described
herein.
[0191] The recombinant expression vectors described herein include
a nucleic acid described herein in a form suitable for expression
of the nucleic acid in a host cell. The recombinant expression
vectors can include one or more control sequences, selected on the
basis of the host cell to be used for expression. The control
sequence is operably linked to the nucleic acid sequence to be
expressed. Such control sequences are described, for example, in
Goeddel, Gene Expression Technology: Methods in Enzymology 185,
Academic Press, San Diego, Calif. (1990). Control sequences include
those that direct constitutive expression of a nucleotide sequence
in many types of host cells and those that direct expression of the
nucleotide sequence only in certain host cells (e.g.,
tissue-specific regulatory sequences). It will be appreciated by
those skilled in the art that the design of the expression vector
can depend on such factors as the choice of the host cell to be
transformed, the level of expression of protein desired, etc. The
expression vectors described herein can be introduced into host
cells to produce polypeptides, including fusion polypeptides,
encoded by the nucleic acids as described herein.
[0192] Recombinant expression vectors can be designed for
expression of a fatty acid biosynthetic polypeptide or variant in
prokaryotic or eukaryotic cells (e.g., bacterial cells, such as E.
coli, insect cells (e.g., using baculovirus expression vectors),
yeast cells, or mammalian cells). Suitable host cells are discussed
further in Goeddel, Gene Expression Technology: Methods in
Enzymology 185, Academic Press, San Diego, Calif. (1990).
Alternatively, the recombinant expression vector can be transcribed
and translated in vitro, for example, by using T7 promoter
regulatory sequences and T7 polymerase.
[0193] Expression of polypeptides in prokaryotes, for example, E.
coli, is most often carried out with vectors containing
constitutive or inducible promoters directing the expression of
either fusion or non-fusion polypeptides. Fusion vectors add a
number of amino acids to a polypeptide encoded therein, usually to
the amino terminus of the recombinant polypeptide. Such fusion
vectors typically serve three purposes: (1) to increase expression
of the recombinant polypeptide; (2) to increase the solubility of
the recombinant polypeptide; and (3) to aid in the purification of
the recombinant polypeptide by acting as a ligand in affinity
purification. Often, in fusion expression vectors, a proteolytic
cleavage site is introduced at the junction of the fusion moiety
and the recombinant polypeptide. This enables separation of the
recombinant polypeptide from the fusion moiety after purification
of the fusion polypeptide. Examples of such enzymes, and their
cognate recognition sequences, include Factor Xa, thrombin, and
enterokinase. Exemplary fusion expression vectors include pGEX
(Pharmacia, Piscataway, N.J.; Smith et al., Gene 67:31-40 (1988)),
pMAL (New England Biolabs, Inc., Ipswich, Mass.), and pRITS
(Pharmacia, Piscataway, N.J.), which fuse glutathione S-transferase
(GST), maltose E binding protein, or protein A, respectively, to
the target recombinant polypeptide.
[0194] Examples of inducible, non-fusion E. coli expression vectors
include pTrc (Amann et al., Gene 69:301-315 (1988)) and pET 11d
(Studier et al., Gene Expression Technology Methods in Enzymology
185, Academic Press, San Diego, Calif. 60-89 (1990)). Target gene
expression from the pTrc vector relies on host RNA polymerase
transcription from a hybrid tip-lac fusion promoter. Target gene
expression from the pET 11d vector relies on transcription from a
T7 gn10-lac fusion promoter mediated by a coexpressed viral RNA
polymerase (T7 gn1). This viral polymerase is supplied by host
strains BL21(DE3) or HMS174(DE3) from a resident 2, prophage
harboring a T7 gn1 gene under the transcriptional control of the
lacUV 5 promoter.
[0195] One strategy to maximize recombinant polypeptide expression
is to express the polypeptide in a host cell with an impaired
capacity to proteolytically cleave the recombinant polypeptide
(see, Gottesman, Gene Expression Technology: Methods in Enzymology
185, Academic Press, San Diego, Calif. 119-128 (1990)). Another
strategy is to alter the nucleic acid sequence to be inserted into
an expression vector so that the individual codons for each amino
acid are those preferentially utilized in the host cell (Wada et
al., Nucleic Acids Res. 20:2111-2118 (1992)). Such alteration of
nucleic acid sequences can be carried out by standard DNA synthesis
techniques.
[0196] In another embodiment, the host cell is a yeast cell. In
this embodiment, the expression vector is a yeast expression
vector. Examples of vectors for expression in yeast S. cerevisiae
include pYepSec1 (Baldari et al., EMBO J. 6:229-234 (1987)), pMFa
(Kurjan et al., Cell 30:933-943 (1982)), pJRY88 (Schultz et al.,
Gene 54:113-123 (1987)), pYES2 (Invitrogen Corporation, San Diego,
Calif.), and picZ (Invitrogen Corp, San Diego, Calif.).
[0197] Alternatively, a polypeptide described herein can be
expressed in insect cells using baculovirus expression vectors.
Baculovirus vectors available for expression of proteins in
cultured insect cells (e.g., Sf9 cells) include, for example, the
pAc series (Smith et al., Mol. Cell Biol. 3:2156-2165 (1983)) and
the pVL series (Lucklow et al., Virology 170:31-39 (1989)).
[0198] In yet another embodiment, the nucleic acids described
herein can be expressed in mammalian cells using a mammalian
expression vector. Examples of mammalian expression vectors include
pCDM8 (Seed, Nature 329:840 (1987)) and pMT2PC (Kaufman et al.,
EMBO J. 6:187-195 (1987)). When used in mammalian cells, the
expression vector's control functions can be provided by viral
regulatory elements. For example, commonly used promoters are
derived from polyoma, Adenovirus 2, cytomegalovirus, and Simian
Virus 40. Other suitable expression systems for both prokaryotic
and eukaryotic cells are described in chapters 16 and 17 of
Sambrook et al., eds., Molecular Cloning: A Laboratory Manual. 2nd
ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory
Press, Cold Spring Harbor, N.Y. (1989).
[0199] Vectors can be introduced into prokaryotic or eukaryotic
cells via conventional transformation or transfection techniques.
As used herein, the terms "transformation" and "transfection" refer
to a variety of art-recognized techniques for introducing foreign
nucleic acid (e.g., DNA) into a host cell, including calcium
phosphate or calcium chloride co-precipitation,
DEAE-dextran-mediated transfection, lipofection, or
electroporation. Suitable methods for transforming or transfecting
host cells can be found in, for example, Sambrook et al. supra.
[0200] For stable transformation of bacterial cells, it is known
that, depending upon the expression vector and transformation
technique used, only a small fraction of cells will take-up and
replicate the expression vector. In order to identify and select
these transformants, a gene that encodes a selectable marker (e.g.,
resistance to antibiotics) can be introduced into the host cells
along with the gene of interest. Selectable markers include those
that confer resistance to drugs, such as ampicillin, kanamycin,
chloramphenicol, carbenicillin, spectinomycin, or tetracycline.
Nucleic acids encoding a selectable marker can be introduced into a
host cell on the same vector as that encoding a polypeptide
described herein or can be introduced on a separate vector. Cells
stably transfected with the introduced nucleic acid can be
identified by drug selection (e.g., cells that have incorporated
the selectable marker gene will survive, while the other cells
die).
[0201] For stable transfection of mammalian cells, it is known
that, depending upon the expression vector and transfection
technique used, only a small fraction of cells may integrate the
foreign DNA into their genome. In order to identify and select
these integrants, a gene that encodes a selectable marker (e.g.,
resistance to antibiotics) can be introduced into the host cells
along with the gene of interest. Preferred selectable markers
include those which confer resistance to drugs, such as G418,
hygromycin, and methotrexate. Nucleic acids encoding a selectable
marker can be introduced into a host cell on the same vector as
that encoding a polypeptide described herein or can be introduced
on a separate vector. Cells stably transfected with the introduced
nucleic acid can be identified by drug selection (e.g., cells that
have incorporated the selectable marker gene will survive, while
the other cells die).
Transport Proteins
[0202] Transport proteins can export polypeptides and organic
compounds (e.g., fatty esters) out of a host cell. Many transport
and efflux proteins serve to excrete a wide variety of compounds
and can be naturally modified to be selective for particular types
of fatty esters.
[0203] Non-limiting examples of suitable transport proteins are
ATP-Binding Cassette (ABC) transport proteins, efflux proteins, and
fatty acid transporter proteins (FATP). Additional non-limiting
examples of suitable transport proteins include the ABC transport
proteins from organisms such as Caenorhabditis elegans, Arabidopsis
thalania, Alkaligenes eutrophus, and Rhodococcus erythropolis.
Exemplary ABC transport proteins that can be used have been
described in, for example, WO 08/119,082, the disclosure of which
is incorporated herein by reference. Exemplary GenBank Accession
numbers include, without limitation, CER5 [GenBank Accession No:
At1g51500, AY734542, At3g21090, or At1g51460], AtMRP5 [GenBank
Accession No: NP.sub.--171908], AmiS2 [GenBank Accession No.
JC5491], or AtPGP1 [GenBank Accession No. NP.sub.--181228]. Host
cells can also be chosen for their endogenous ability to secrete
organic compounds. The efficiency of organic compound production
and secretion into the host cell environment (e.g., culture medium,
fermentation broth) can be expressed as a ratio of intracellular
product to extracellular product. In some examples, the ratio can
be about 5:1, 4:1, 3:1, 2:1, 1:1, 1:2, 1:3, 1:4, or 1:5.
[0204] In certain embodiments, the fatty esters produced by the
host cells herein are present in the extracellular environment. In
some embodiments, the fatty esters are isolated from the
extracellular environment of the host cell. In some other
embodiments, the fatty esters are spontaneously secreted, partially
or completely, from the host cell. In further embodiments, the
fatty esters are transported into the extracellular environment,
optionally with the aid of one or more suitable transport proteins
as described herein. In other embodiments, the fatty esters are
passively transported into the extracellular environment.
Fermentation
[0205] The production and isolation of fatty esters can be enhanced
by employing beneficial fermentation techniques. One method for
maximizing production while reducing costs is increasing the
percentage of the carbon source that is converted to fatty ester
products.
[0206] During normal cellular lifecycles, carbon is used in
cellular functions, such as producing lipids, saccharides,
proteins, organic acids, and nucleic acids. Reducing the amount of
carbon necessary for growth-related activities can increase the
efficiency of carbon source conversion to product. This can be
achieved by, for example, first growing host cells to a desired
density (for example, a density achieved at the peak of the log
phase of growth). At such a point, replication checkpoint genes can
be harnessed to stop the growth of cells. Specifically, quorum
sensing mechanisms (reviewed in Camilli et al., Science 311:1113,
(2006); Venturi, FEMS Microbio. Rev. 30:274-291 (2006); and Reading
et al., FEMS Microbiol. Lett. 254:1-11 (2006)) can be used to
activate checkpoint genes, such as p53, p21, or other checkpoint
genes.
[0207] Genes that can be activated to stop cell replication and
growth in E. coli include umuDC genes. The overexpression of umuDC
genes stops the progression from stationary phase to exponential
growth (Murli et al., J. of Bact. 182:1127 (2000)). UmuC is a DNA
polymerase that can carry out translesion synthesis over non-coding
lesions--the mechanistic basis of most UV and chemical mutagenesis.
The umuDC gene products are involved in the process of translesion
synthesis and also serve as a DNA sequence damage checkpoint. The
umuDC gene products include UmuC, UmuD, umuD', UmuD'.sub.2C,
UmuD'.sub.2, and UmuD.sub.2. Simultaneously, product-producing
genes can be activated, thus minimizing the need for replication
and maintenance pathways to be used while a fatty ester is being
made. Host cells can also be engineered to express umuC and umuD
from E. coli in pBAD24 under the prpBCDE promoter system through de
novo synthesis of this gene with the appropriate end-product
production genes.
[0208] The percentage of input carbons converted to fatty esters
can be a cost driver. The more efficient the process is (i.e., the
higher the percentage of input carbons converted to fatty esters),
the less expensive the process will be. For oxygen-containing
carbon sources (e.g., glucose and other carbohydrate based
sources), the oxygen must be released in the form of carbon
dioxide. For every 2 oxygen atoms released, a carbon atom is also
released leading to a maximal theoretical metabolic efficiency of
approximately 34% (w/w) (for fatty acid derived products). This
figure, however, changes for other organic compounds and carbon
sources. Typical efficiencies in the literature are approximately
less than 5%. Host cells engineered to produce fatty esters can
have greater than about 1, 3, 5, 10, 15, 20, 25, and 30%
efficiency. In one example, host cells can exhibit an efficiency of
about 10% to about 25%. In other examples, such host cells can
exhibit an efficiency of about 25% to about 30%. In other examples,
host cells can exhibit greater than 30% efficiency.
[0209] The host cell can be additionally engineered to express
recombinant cellulosomes, such as those described in PCT
application number PCT/US2007/003736. These cellulosomes can allow
the host cell to use cellulosic material as a carbon source. For
example, the host cell can be additionally engineered to express
invertases (EC 3.2.1.26) so that sucrose can be used as a carbon
source. Similarly, the host cell can be engineered using the
teachings described in U.S. Pat. Nos. 5,000,000; 5,028,539;
5,424,202; 5,482,846; and 5,602,030; so that the host cell can
assimilate carbon efficiently and use cellulosic materials as
carbon sources.
[0210] In one example, the fermentation chamber can enclose a
fermentation that is undergoing a continuous reduction. In this
instance, a stable reductive environment can be created. The
electron balance can be maintained by the release of carbon dioxide
(in gaseous form). Efforts to augment the NAD/H and NADP/H balance
can also facilitate in stabilizing the electron balance. The
availability of intracellular NADPH can also be enhanced by
engineering the host cell to express an NADH:NADPH transhydrogenase
or an NADPH:NADH transhydrogenase. The expression of one or more
NADH:NADPH transhydrogenases converts the NADH produced in
glycolysis to NADPH, which can enhance the production of fatty
acids.
[0211] For small scale production, the engineered host cells can be
grown in batches of, for example, about 100 mL, 500 mL, 1 L, 2 L, 5
L, 10 L or 25 L; fermented and induced to express desired
biosynthetic genes based on the specific genes encoded in the
appropriate plasmids. For large scale production, the engineered
host cells can be grown in batches of about 50 L, 100 L, 1000 L,
10,000 L, 100,000 L, 1,000,000 L or larger; fermented and induced
to express desired biosynthetic genes based on the specific genes
encoded in the appropriate plasmids or incorporated into the host
cell's genome.
[0212] For example, a suitable production host, such as E. coli
cells, harboring plasmids containing the desired biosynthetic genes
or having the biosynthetic genes integrated in its chromosome can
be incubated in a suitable reactor, for example a 1 L reactor, for
20 h at 37.degree. C. in M9 medium supplemented with 2% glucose,
carbenicillin, and chloramphenicol. When the OD.sub.600 of the
culture reaches 0.9, the production host can be induced with IPTG
to activate the engineered gene systems for fatty ester production.
After incubation, the spent media can be extracted and the organic
phase can be examined for the presence of fatty esters using
GC-MS.
[0213] In some instances, after the first hour of induction,
aliquots of no more than about 10% of the total cell volume can be
removed each hour and allowed to sit without agitation to allow the
fatty esters to rise to the surface and undergo a spontaneous phase
separation or precipitation. The fatty ester component can then be
collected, and the aqueous phase returned to the reaction chamber.
The reaction chamber can be operated continuously. When the
OD.sub.600 drops below 0.6, the cells can be replaced with a new
batch grown from a seed culture.
Glucose
[0214] In some instances, the methods disclosed herein are
performed using glucose as a carbon source. In certain instances,
microorganisms are grown in a culture medium containing an initial
glucose concentration of about 2 g/L to about 100 g/L, such as
about 5 g/L to about 20 g/L. In some instances, the glucose
concentration of the culture medium decreases from the initial
glucose concentration as the microorganisms consume the glucose,
and a concentration of about 0 g/L to about 5 g/L glucose is
maintained in the culture medium during the fatty ester production
process. In certain instances, glucose is fed to the microorganisms
in a solution of about 50% to about 65% wt/wt glucose.
[0215] In some instances, the feed rate of glucose is set to match
the cells' growth rate to avoid excess accumulation of glucose
(i.e., >0% glucose) in the fermentor. In other instances, and a
low concentration of excess glucose (e.g., about 2 g/L to about 5
g/L) is maintained.
[0216] In certain instances, fatty esters can be produced from
carbohydrates other than glucose, including but not limited to
fructose, hydrolyzed sucrose, hydrolyzed molasses and glycerol.
Post-Production Processing
[0217] The fatty esters produced during fermentation can be
separated from the fermentation media. Any known technique for
separating fatty esters from aqueous media can be used. One
exemplary separation process is a two phase (bi-phasic) separation
process. This process involves fermenting the genetically
engineered host cells under conditions sufficient to produce a
fatty ester, allowing the fatty ester to collect in an organic
phase, and separating the organic phase from the aqueous
fermentation broth. This method can be practiced in both a batch
and continuous fermentation processes.
[0218] Bi-phasic separation uses the relative immiscibility of
fatty esters to facilitate separation. Immiscible refers to the
relative inability of a compound to dissolve in water and is
defined by the compound's partition coefficient. One of ordinary
skill in the art will appreciate that by choosing a fermentation
broth and organic phase, such that the fatty ester being produced
has a high logP value, the fatty ester can separate into the
organic phase, even at very low concentrations, in the fermentation
vessel.
[0219] The fatty esters produced by the methods described herein
can be relatively immiscible in the fermentation broth, as well as
in the cytoplasm. Therefore, the fatty ester can collect in an
organic phase either intracellularly or extracellularly. The
collection of the products in the organic phase can lessen the
impact of the fatty ester on cellular function and can allow the
host cell to produce more product.
[0220] The methods described herein can result in the production of
homogeneous compounds wherein at least about 60%, 70%, 80%, 90%, or
95% of the fatty esters produced will have carbon chain lengths
that vary by less than about 6 carbons, less than about 4 carbons,
or less than about 2 carbons. These compounds can also be produced
with a relatively uniform degree of saturation. These compounds can
be used directly as fuels, fuel additives, starting materials for
production of other chemical compounds (e.g., polymers,
surfactants, plastics, textiles, solvents, adhesives, etc.), or
personal care additives. These compounds can also be used as
feedstock for subsequent reactions, for example, hydrogenation,
catalytic cracking (e.g., via hydrogenation, pyrolisis, or both),
to make other products.
[0221] In some embodiments, the fatty esters produced using methods
described herein can contain between about 50% and about 90%
carbon; or between about 5% and about 25% hydrogen. In other
embodiments, the fatty esters produced using methods described
herein can contain between about 65% and about 85% carbon; or
between about 10% and about 15% hydrogen.
Bioproducts
[0222] Bioproducts (e.g., the fatty esters produced in accordance
with the present disclosure) comprising biologically produced
organic compounds, and in particular, the fatty esters biologically
produced using the fatty acid biosynthetic pathway herein, have not
been produced from renewable sources and, as such, are new
compositions of matter. These new bioproducts can be distinguished
from organic compounds derived from petrochemical carbon on the
basis of dual carbon-isotopic fingerprinting or .sup.14C dating.
Additionally, the specific source of biosourced carbon (e.g.,
glucose vs. glycerol) can be determined by dual carbon-isotopic
fingerprinting (see, e.g., U.S. Pat. No. 7,169,588, which is herein
incorporated by reference).
[0223] The ability to distinguish bioproducts from petroleum based
organic compounds is beneficial in tracking these materials in
commerce. For example, organic compounds or chemicals comprising
both biologically based and petroleum based carbon isotope profiles
may be distinguished from organic compounds and chemicals made only
of petroleum based materials. Hence, the bioproducts herein can be
followed or tracked in commerce on the basis of their unique carbon
isotope profile.
[0224] Bioproducts can be distinguished from petroleum based
organic compounds by comparing the stable carbon isotope ratio
(.sup.13C/.sup.12C) in each fuel. The .sup.13C/.sup.12C ratio in a
given bioproduct is a consequence of the .sup.13C/.sup.12C ratio in
atmospheric carbon dioxide at the time the carbon dioxide is fixed.
It also reflects the precise metabolic pathway. Regional variations
also occur. Petroleum, C.sub.3 plants (the broadleaf), C.sub.4
plants (the grasses), and marine carbonates all show significant
differences in .sup.13C/.sup.12C and the corresponding
.delta..sup.13C values. Furthermore, lipid matter of C.sub.3 and
C.sub.4 plants analyze differently than materials derived from the
carbohydrate components of the same plants as a consequence of the
metabolic pathway.
[0225] Within the precision of measurement, .sup.13C shows large
variations due to isotopic fractionation effects, the most
significant of which for bioproducts is the photosynthetic
mechanism. The major cause of differences in the carbon isotope
ratio in plants is closely associated with differences in the
pathway of photosynthetic carbon metabolism in the plants,
particularly the reaction occurring during the primary
carboxylation (i.e., the initial fixation of atmospheric CO.sub.2).
Two large classes of vegetation are those that incorporate the
"C.sub.3" (or Calvin-Benson) photosynthetic cycle and those that
incorporate the "C.sub.4" (or Hatch-Slack) photosynthetic
cycle.
[0226] In C.sub.3 plants, the primary CO.sub.2 fixation or
carboxylation reaction involves the enzyme ribulose-1,5-diphosphate
carboxylase, and the first stable product is a 3-carbon compound.
C.sub.3 plants, such as hardwoods and conifers, are dominant in the
temperate climate zones.
[0227] In C.sub.4 plants, an additional carboxylation reaction
involving another enzyme, phosphoenol-pyruvate carboxylase, is the
primary carboxylation reaction. The first stable carbon compound is
a 4-carbon acid that is subsequently decarboxylated. The CO.sub.2
thus released is refixed by the C.sub.3 cycle. Examples of C.sub.4
plants are tropical grasses, corn, and sugar cane.
[0228] Both C.sub.4 and C.sub.3 plants exhibit a range of
.sup.13C/.sup.12C isotopic ratios, but typical values are about -7
to about -13 per mil for C.sub.4 plants and about -19 to about -27
per mil for C.sub.3 plants (see, e.g., Stuiver et al., Radiocarbon
19:355 (1977)). Coal and petroleum fall generally in this latter
range. The .sup.13C measurement scale was originally defined by a
zero set by Pee Dee Belemnite (PDB) limestone, where values are
given in parts per thousand deviations from this material. The
".delta..sup.13C" values are expressed in parts per thousand (per
mil), abbreviated, %0, and are calculated as follows:
.delta..sup.13C(.Salinity.)=[(.sup.13C/.sup.12C).sub.sample-(.sup.13C/.s-
up.12C).sub.standard]/(.sup.13C/.sup.12C).sub.standard.times.1000
[0229] Since the PDB reference material (RM) has been exhausted, a
series of alternative RMs have been developed in cooperation with
the IAEA, USGS, NIST, and other selected international isotope
laboratories. Notations for the per mil deviations from PDB is
.delta..sup.13C. Measurements are made on CO.sub.2 by high
precision stable ratio mass spectrometry (IRMS) on molecular ions
of masses 44, 45, and 46.
[0230] The compositions described herein include bioproducts
produced by any of the methods described herein, including, for
example, the fatty ester products. Specifically, the bioproduct can
have a .delta..sup.13C of about -28 or greater, about -27 or
greater, -20 or greater, -18 or greater, -15 or greater, -13 or
greater, -10 or greater, or -8 or greater. For example, the
bioproduct can have a .delta..sup.13C of about -30 to about -15,
about -27 to about -19, about -25 to about -21, about -15 to about
-5, about -13 to about -7, or about -13 to about -10. In other
instances, the bioproduct can have a .delta..sup.13C of about -10,
-11, -12, or -12.3.
[0231] Bioproducts, including the bioproducts produced in
accordance with the disclosure herein, can also be distinguished
from petroleum based organic compounds by comparing the amount of
.sup.14C in each compound. Because .sup.14C has a nuclear half life
of 5730 years, petroleum based fuels containing "older" carbon can
be distinguished from bioproducts which contain "newer" carbon
(see, e.g., Currie, "Source Apportionment of Atmospheric
Particles", Characterization of Environmental Particles, J. Buffle
and H. P. van Leeuwen, Eds., 1 of Vol. I of the IUPAC Environmental
Analytical Chemistry Series (Lewis Publishers, Inc) 3-74,
(1992)).
[0232] The basic assumption in radiocarbon dating is that the
constancy of .sup.14C concentration in the atmosphere leads to the
constancy of .sup.14C in living organisms. However, because of
atmospheric nuclear testing since 1950 and the burning of fossil
fuel since 1850, .sup.14C has acquired a second, geochemical time
characteristic. Its concentration in atmospheric CO.sub.2, and
hence in the living biosphere, approximately doubled at the peak of
nuclear testing, in the mid-1960s. It has since been gradually
returning to the steady-state cosmogenic (atmospheric) baseline
isotope rate (.sup.14C/.sup.12C) of about 1.2.times.10.sup.-12,
with an approximate relaxation "half-life" of 7-10 years. (This
latter half-life must not be taken literally; rather, one must use
the detailed atmospheric nuclear input/decay function to trace the
variation of atmospheric and biospheric .sup.14C since the onset of
the nuclear age.)
[0233] It is this latter biospheric .sup.14C time characteristic
that holds out the promise of annual dating of recent biospheric
carbon. .sup.14C can be measured by accelerator mass spectrometry
(AMS), with results given in units of "fraction of modern carbon"
(f.sub.M). f.sub.M is defined by National Institute of Standards
and Technology (NIST) Standard Reference Materials (SRMs) 4990B and
4990C. As used herein, "fraction of modern carbon" or "f.sub.M" has
the same meaning as defined by National Institute of Standards and
Technology (NIST) Standard Reference Materials (SRMs) 4990B and
4990C, known as oxalic acids standards HOxI and HOxII,
respectively. The fundamental definition relates to 0.95 times the
.sup.14C/.sup.12C isotope ratio HOxI (referenced to AD 1950). This
is roughly equivalent to decay-corrected pre-Industrial Revolution
wood. For the current living biosphere (plant material), f.sub.M is
approximately 1.1.
[0234] The invention provides a bioproduct comprising one or more
fatty esters, which can have an f.sub.M .sup.14C of at least about
1. For example, the bioproduct of the invention can have an f.sub.M
.sup.14C of at least about 1.01, an f.sub.M .sup.14C of about 1 to
about 1.5, an f.sub.M .sup.14C of about 1.04 to about 1.18, or an
f.sub.M .sup.14C of about 1.111 to about 1.124.
[0235] Another measurement of .sup.14C is known as the percent of
modern carbon (pMC). For an archaeologist or geologist using
.sup.14C dates, AD 1950 equals "zero years old". This also
represents 100 pMC. "Bomb carbon" in the atmosphere reached almost
twice the normal level in 1963 at the peak of thermo-nuclear
weapons. Its distribution within the atmosphere has been
approximated since its appearance, showing values that are greater
than 100 pMC for plants and animals living since AD 1950. It has
gradually decreased over time with today's value being near 107.5
pMC. This means that a fresh biomass material, such as corn, would
give a .sup.14C signature near 107.5 pMC. Petroleum based compounds
will have a pMC value of zero. Combining fossil carbon with present
day carbon will result in a dilution of the present day pMC
content. By presuming 107.5 pMC represents the .sup.14C content of
present day biomass materials and 0 pMC represents the .sup.14C
content of petroleum based products, the measured pMC value for
that material will reflect the proportions of the two component
types. For example, a material derived 100% from present day
soybeans would give a radiocarbon signature near 107.5 pMC. If that
material was diluted 50% with petroleum based products, it would
give a radiocarbon signature of approximately 54 pMC.
[0236] A biologically based carbon content is derived by assigning
"100%" equal to 107.5 pMC and "0%" equal to 0 pMC. For example, a
sample measuring 99 pMC will give an equivalent biologically based
carbon content of 93%. This value is referred to as the mean
biologically based carbon result and assumes all the components
within the analyzed material originated either from present day
biological material or petroleum based material.
[0237] A bioproduct comprising one or more fatty esters as
described herein can have a pMC of at least about 50, 60, 70, 75,
80, 85, 90, 95, 96, 97, 98, 99, or 100. In other instances, a
bioproduct described herein can have a pMC of between about 50 and
about 100; about 60 and about 100; about 70 and about 100; about 80
and about 100; about 85 and about 100; about 87 and about 98; or
about 90 and about 95. In yet other instances, a bioproduct
described herein can have a pMC of about 90, 91, 92, 93, 94, or
94.2.
[0238] Fatty esters produced by the methods described herein can be
used as biofuels. For example, any fatty acid methyl ester
described herein can be used solely or as a component of
biodiesel.
[0239] The invention is further illustrated by the following
examples. The examples are provided for illustrative purposes only.
They are not to be construed as limiting the scope or content of
the invention in any way.
EXAMPLES
Example 1
Production of E. coli MG1655 .DELTA.fadE
[0240] This example describes the construction of a genetically
engineered microorganism wherein the expression of a fatty acid
degradation enzyme is attenuated.
[0241] The fadE gene of E. coli MG1655 was deleted using the Lambda
Red (also known as the Red-Driven Integration) system described in
Datsenko et al., Proc. Natl. Acad. Sci. USA 97: 6640-6645 (2000),
with the following modifications.
[0242] Two primers were used to create the deletion:
TABLE-US-00007 Del-fadE-F: (SEQ ID NO: 1)
5'-AAAAACAGCAACAATGTGAGCTTTGTTGTAATTATATTGTAAACAT
ATTGATTCCGGGGATCCGTCGACC-3' Del-fadE-R: (SEQ ID NO: 2)
5'-AAACGGAGCCTTTCGGCTCCGTTATTCATTTACGCGGCTTCAACTT
TCCTGTAGGCTGGAGCTGCTTC-3'
[0243] The Del-fadE-F and Del-fadE-R primers were used to amplify
the Kanamycin resistance (Km.sup.R) cassette from plasmid pKD13 (as
described in Datsenko et al., supra) by PCR. The PCR product was
then used to transform electrocompetent E. coli MG1655 cells
containing pKD46 (described in Datsenko et al., supra). These cells
had been previously induced with arabinose for 3-4 h. Following a
3-h outgrowth in a Super Optimal Broth with Catabolite repression
(SOC) medium at 37.degree. C., the cells were plated on Luria agar
plates containing 50 .mu.g/mL Kanamycin. Resistant colonies were
identified and isolated after an overnight incubation at 37.degree.
C. Disruption of the fadE gene was confirmed in select colonies
using PCR amplification with primers fadE-L2 and fadE-R1, which
were designed to flank the fadE gene:
TABLE-US-00008 (SEQ ID NO: 3) fadE-L2 5'-CGGGCAGGTGCTATGACCAGGAC-3'
(SEQ ID NO: 4) fadE-R1 5'-CGCGGCGTTGACCGGCAGCCTGG-3'
[0244] After the fadE deletion was confirmed, a single colony was
used to remove the Km.sup.R marker, using the pCP20 plasmid as
described in Datsenko et al., supra. The resulting MG1655 E. coli
strain with the fadE gene deleted and the Km.sup.R marker removed
was named E. coli MG1655 .DELTA.fadE, or E. coli MG1655 D1.
Example 2
Production of E. coli MG1655 .DELTA.fadE .DELTA.fhuA
[0245] This example describes the construction of a genetically
engineered microorganism in which the expression of a fatty acid
degradation enzyme and an outer membrane protein receptor are
attenuated.
[0246] The fhuA (also known as tonA) gene of E. coli MG1655, which
encodes a ferrichrome outer membrane transporter (GenBank Accession
No. NP.sub.--414692), was deleted from strain E. coli MG1655 D1 of
Example 1 using the Lambda Red system described in Datsenko et al.,
supra, but with the following modifications.
[0247] Two primers were used to create the deletion:
TABLE-US-00009 Del-fhuA-F: (SEQ ID NO: 5)
5'-ATCATTCTCGTTTACGTTATCATTCACTTTACATCAGAGATATACC
AATGATTCCGGGGATCCGTCGACC-3'; Del-fhuA-R: (SEQ ID NO: 6)
5'-GCACGGAAATCCGTGCCCCAAAAGAGAAATTAGAAACGGAAGGTT
GCGGTTGTAGGCTGGAGCTGCTTC-3'
[0248] The Del-fhuA-F and Del-fhuA-R primers were used to amplify
the Km.sup.R cassette from plasmid pKD13 by PCR. The PCR product
obtained was used to transform the electrocompetent E. coli MG1655
D1 cells containing pKD46 (see Example 1). These cells had been
previously induced with arabinose for 3-4 h. Following a 3-h
outgrowth in SOC medium at 37.degree. C., the cells were plated on
Luria agar plates containing 50 .mu.g/mL Kanamycin. Kanamycin
resistant colonies were identified and isolated after an overnight
incubation at 37.degree. C. Disruption of the fhuA gene was
confirmed in select colonies by PCR amplification with primers
fhuA-verF and fhuA-verR, which were designed to flank the fhuA
gene.
[0249] Confirmation of the deletion was performed using the
following primers:
TABLE-US-00010 fhuA-verF: 5'-CAACAGCAACCTGCTCAGCAA-3' (SEQ ID NO:
7) fhuA-verR: 5'-AAGCTGGAGCAGCAAAGCGTT-3' (SEQ ID NO: 8)
[0250] After the fhuA deletion was confirmed, a single colony was
used to remove the Km.sup.R marker, using the pCP20 plasmid as
described in Datsenko et al., supra. The resulting MG1655 E. coli
strain having the fadE and fhuA gene deletions was named E. coli
MG1655 .DELTA.fadE .DELTA.fhuA, or E. coli MG1655 DV2.
Example 3
Production of E. coli MG1655 .DELTA.fadE .DELTA.fhuA .DELTA.pflB
.DELTA.ldhA
[0251] This example describes the construction of a genetically
engineered microorganism in which the expression of an acyl-CoA
dehydrogenase, an outer membrane protein receptor, a pyruvate
formate lyase and a lactate dehydrogenase are attenuated.
[0252] The pflB gene of E. coli MG1655, which encodes a pyruvate
formate lyase (GenBank Accession No. AAC73989), was deleted from E.
coli MG1655 DV2 (see, Example 2) using the Lambda Red System
according to Datsenko et al., supra, but with the following
modifications:
[0253] The primers used to create the deletion strain were:
TABLE-US-00011 Del-pflB-F: (SEQ ID NO: 33)
5'-GCCGCAGCCTGATGGACAAAGCGTTCATTATGGTGCTGCCGGT
CGCGATGATTCCGGGGATCCGTCGACC-3' Del-pflB-R: (SEQ ID NO: 34)
5'-ATCTTCAACGGTAACTTCTTTACCGCCATGCGTGTCCCAGGTG
TCTGTAGGCTGGAGCTGCTTCG-3'
[0254] The Del-pflB-F and Del-pflB-R primers were used to amplify
the Kanamycin resistance (Km.sup.R) cassette from plasmid pKD13 by
PCR. The PCR product was then used to transform electrocompetent E.
coli MG1655 DV2 cells (see Example 2).
[0255] In parallel, the ldhA gene of E. coli MG1655, which encodes
a lactate dehydrogenase, specifically an NAD-linked fermentative
D-lactate dehydrogenase (see, e.g., Mat-Jan et al., J. Bacteriol.
171(1):342-8 (1989); Bunch et al., Microbiol. 143(1):187-95 (1997)
(GenBank Accession No. AAC74462) was also deleted from E. coli
MG1655 DV2 (see, Example 2) using the Lambda Red System according
to Datsenko et al., supra, but with the following
modifications.
[0256] Two primers were used to create the deletion:
TABLE-US-00012 Del-ldhA-F: (SEQ ID NO: 35)
5'-CTCCCCTGGAATGCAGGGGAGCGGCAAGATTAAACCAGTTCGT TCG
GGCAGTGTAGGCTGGAGCTGCTTCG-3' Del-ldhA-R: (SEQ ID NO: 36)
5'-TATTTTTAGTAGCTTAAATGTGATTCAACATCACTGGAGAAAG TC
TTATGCATATGAATATCCTCCTTAGTTCC-3'
[0257] The Del-ldhA-F and Del-ldhA-R primers were used to amplify
the chloramphenicol acetyltransferase resistance (Cm.sup.R)
cassette from plasmid pKD3 (see, Datsenko et al., supra) by PCR.
The PCR product was also used to transform electrocompetent E. coli
MG1655 DV2 cells (see, Example 2).
[0258] The E. coli MG1655 DV2 (see Example 2) cells had been
previously induced with arabinose for about 3-4 h. Following a 3-h
outgrowth in SOC medium at 37.degree. C., the cells were plated on
Luria agar plates containing 50 .mu.g/mL of kanamycin and 30
.mu.g/mL chloramphenicol. Colonies that were resistant to both
kanamycin and chloramphenicol were identified and isolated after an
overnight incubation at 37.degree. C. Disruption of the pflB gene
was confirmed using primers flanking the E. coli pflB gene, and
disruption of the ldhA gene was verified using primers flanking the
E. coli ldhA gene.
[0259] Confirmation of the deletion of pflB was performed using the
following primers:
TABLE-US-00013 pflB-verF: 5'-GGACTAAACGTCCTACAAAC-3' (SEQ ID NO:
37) PflB-verR: 5'-TTCATCTGTTTGAGATCGAG-3' (SEQ ID NO: 38)
[0260] Confirmation of the deletion of ldhA gene was performed
using the following primers:
TABLE-US-00014 (SEQ ID NO: 39) ldhA-verF:
5'-CCCGAGCGGTAGCCAGATGCCCGCCAGCG-3' (SEQ ID NO: 40) ldhA-verR:
5'-GCTGCGGGTTAGCGCACATCATACGGGTC-3'
[0261] After the deletions were confirmed, a single colony was used
to remove the Km.sup.R and Cm.sup.R markers in accordance with the
method described by Datsenko et al., supra. The resultant MG1655 E.
coli strain having fadE, fhuA, pflB and ldhA gene deletions was
named E. coli MG1655
.DELTA.fadE_.DELTA.fhuA_.DELTA.pflB_.DELTA.ldhA, or E. coli MG1655
DV4.
Example 4
Production of E. coli MG1655 .DELTA.fadE, .DELTA.fhuA,
lacI.sub.q-Ptrc-`tesA-fadD
[0262] This example describes the construction of a genetically
engineered microorganism in which nucleotide sequences encoding a
thioesterase, and an acyl-CoA synthase are integrated into the
microorganism's chromosome, under the control of a promoter.
[0263] Plasmid pGRG25 (GenBank Accession No. DQ460223) (SEQ ID
NO:9) was purified and subject to restriction digestions by NotI
and AvrII (New England Biolabs, Inc., Ipswich, Mass.). In parallel,
a plasmid pACYC-`tesA-fadD, which contained the lacI.sub.q,
Ptrc-`tesA-fadD cassette was constructed as follows:
[0264] `tesA is a nucleotide sequence comprising a leaderless E.
coli tesA gene (GenBank entry AAC73596, refseq accession U00096.2).
`tesA encodes an E. coli thioesterase (EC 3.1.1.5, 3.1.2.-) in
which the first twenty-five amino acids were deleted and the amino
acid in position 26, alanine, was replaced with methionine. That
methionine then became the first amino acid of `tesA. See Cho et
al., J. Biol. Chem., 270:4216-4219 (1995).
[0265] E. coli fadD (GenBank entry AAC74875; REFSEQ: accession
U00096.2) encodes an acyl-CoA synthase.
Construction of the `tesA Plasmid
[0266] `tesA was amplified from a pETDuet-1-`tesA plasmid
constructed as described below. (see also, e.g., WO 2007/136762 A2,
which is incorporated by reference). The `tesA gene was cloned into
an NdeI/AvrII digested pETDuet-1 plasmid (Novagen, Madison,
Wis.).
Construction of the fadD Plasmid
[0267] The fadD gene was amplified from a pHZ1.61 plasmid
constructed as described below. A fadD gene was cloned into a
pCDFDuet-1 plasmid (Novagen, Madison, Wis.) under the control of a
T7 promoter, generating a pHZ1.61 plasmid containing the following
nucleotide sequence:
TABLE-US-00015 (SEQ ID NO: 10)
GGGGAATTGTGAGCGGATAACAATTCCCCTGTAGAAATAATTTTGTTT
AACTTTAATAAGGAGATATACCATGGTGAAGAAGGTTTGGCTTAACCG
TTATCCCGCGGACGTTCCGACGGAGATCAACCCTGACCGTTATCAATC
TCTGGTAGATATGTTTGAGCAGTCGGTCGCGCGCTACGCCGATCAACC
TGCGTTTGTGAATATGGGGGAGGTAATGACCTTCCGCAAGCTGGAAGA
ACGCAGTCGCGCGTTTGCCGCTTATTTGCAACAAGGGTTGGGGCTGAA
GAAAGGCGATCGCGTTGCGTTGATGATGCCTAATTTATTGCAATATCC
GGTGGCGCTGTTTGGCATTTTGCGTGCCGGGATGATCGTCGTAAACGT
TAACCCGTTGTATACCCCGCGTGAGCTTGAGCATCAGCTTAACGATAG
CGGCGCATCGGCGATTGTTATCGTGTCTAACTTTGCTCACACACTGGA
AAAAGTGGTTGATAAAACCGCCGTTCAGCACGTAATTCTGACCCGTAT
GGGCGATCAGCTATCTACGGCAAAAGGCACGGTAGTCAATTTCGTTGT
TAAATACATCAAGCGTTTGGTGCCGAAATACCATCTGCCAGATGCCAT
TTCATTTCGTAGCGCACTGCATAACGGCTACCGGATGCAGTACGTCAA
ACCCGAACTGGTGCCGGAAGATTTAGCTTTTCTGCAATACACCGGCGG
CACCACTGGTGTGGCGAAAGGCGCGATGCTGACTCACCGCAATATGCT
GGCGAACCTGGAACAGGTTAACGCGACCTATGGTCCGCTGTTGCATCC
GGGCAAAGAGCTGGTGGTGACGGCGCTGCCGCTGTATCACATTTTTGC
CCTGACCATTAACTGCCTGCTGTTTATCGAACTGGGTGGGCAGAACCT
GCTTATCACTAACCCGCGCGATATTCCAGGGTTGGTAAAAGAGTTAGC
GAAATATCCGTTTACCGCTATCACGGGCGTTAACACCTTGTTCAATGC
GTTGCTGAACAATAAAGAGTTCCAGCAGCTGGATTTCTCCAGTCTGCA
TCTTTCCGCAGGCGGAGGGATGCCAGTGCAGCAAGTGGTGGCAGAGCG
TTGGGTGAAACTGACAGGACAGTATCTGCTGGAAGGCTATGGCCTTAC
CGAGTGTGCGCCGCTGGTCAGCGTTAACCCATATGATATTGATTATCA
TAGTGGTAGCATCGGTTTGCCGGTGCCGTCGACGGAAGCCAAACTGGT
GGATGATGATGATAATGAAGTACCACCGGGTCAACCGGGTGAGCTTTG
TGTCAAAGGACCGCAGGTGATGCTGGGTTACTGGCAGCGTCCGGATGC
TACAGATGAGATCATCAAAAATGGCTGGTTACACACCGGCGACATCGC
GGTGATGGATGAAGAAGGGTTCCTGCGCATTGTCGATCGTAAAAAAGA
CATGATTCTGGTTTCCGGTTTTAACGTCTATCCCAACGAGATTGAAGA
TGTCGTCATGCAGCATCCTGGCGTACAGGAAGTCGCGGCTGTTGGCGT
ACCTTCCGGCTCCAGTGGTGAAGCGGTGAAAATCTTCGTAGTGAAAAA
AGATCCATCGCTTACCGAAGAGTCACTGGTGACCTTTTGCCGCCGTCA
GCTCACGGGCTACAAAGTACCGAAGCTGGTGGAGTTTCGTGATGAGTT
ACCGAAATCTAACGTCGGAAAAATTTTGCGACGAGAATTACGTGACGA
AGCGCGCGGCAAAGTGGACAATAAAGCCTGAAAGCTTGCGGCCGCATA
ATGCTTAAGTCGAACAGAAAGTAATCGTATTGTACACGGCCGCATAAT
CGAAATTAATACGACTCACTATAGGGGAATTGTGAGCGGATAACAATT
CCCCATCTTAGTATATTAGTTAAGTATAAGAAGGAGATATACATATGC
GCCCATTACATCCGATTGATTTTATATTCCTGTCACTAGAAAAAAGAC
AACAGCCTATGCATGTAGGTGGTTTATTTTTGTTTCAGATTCCTGATA
ACGCCCCAGACACCTTTATTCAAGATCTGGTGAATGATATCCGGATAT
CAAAATCAATCCCTGTTCCACCATTCAACAATAAACTGAATGGGCTTT
TTTGGGATGAAGATGAAGAGTTTGATTTAGATCATCATTTTCGTCATA
TTGCACTGCCTCATCCTGGTCGTATTCGTGAATTGCTTATTTATATTT
CACAAGAGCACAGTACGCTGCTAGATCGGGCAAAGCCCTTGTGGACCT
GCAATATTATTGAAGGAATTGAAGGCAATCGTTTTGCCATGTACTTCA
AAATTCACCATGCGATGGTCGATGGCGTTGCTGGTATGCGGTTAATTG
AAAAATCACTCTCCCATGATGTAACAGAAAAAAGTATCGTGCCACCTT
GGTGTGTTGAGGGAAAACGTGCAAAGCGCTTAAGAGAACCTAAAACAG
GTAAAATTAAGAAAATCATGTCTGGTATTAAGAGTCAGCTTCAGGCGA
CACCCACAGTCATTCAAGAGCTTTCTCAGACAGTATTTAAAGATATTG
GACGTAATCCTGATCATGTTTCAAGCTTTCAGGCGCCTTGTTCTATTT
TGAATCAGCGTGTGAGCTCATCGCGACGTTTTGCAGCACAGTCTTTTG
ACCTAGATCGTTTTCGTAATATTGCCAAATCGTTGAATGTGACCATTA
ATGATGTTGTACTAGCGGTATGTTCTGGTGCATTACGTGCGTATTTGA
TGAGTCATAATAGTTTGCCTTCAAAACCATTAATTGCCATGGTTCCAG
CCTCTATTCGCAATGACGATTCAGATGTCAGCAACCGTATTACGATGA
TTCTGGCAAATTTGGCAACCCACAAAGATGATCCTTTACAACGTCTTG
AAATTATCCGCCGTAGTGTTCAAAACTCAAAGCAACGCTTCAAACGTA
TGACCAGCGATCAGATTCTAAATTATAGTGCTGTCGTATATGGCCCTG
CAGGACTCAACATAATTTCTGGCATGATGCCAAAACGCCAAGCCTTCA
ATCTGGTTATTTCCAATGTGCCTGGCCCAAGAGAGCCACTTTACTGGA
ATGGTGCCAAACTTGATGCACTCTACCCAGCTTCAATTGTATTAGACG
GTCAAGCATTGAATATTACAATGACCAGTTATTTAGATAAACTTGAAG
TTGGTTTGATTGCATGCCGTAATGCATTGCCAAGAATGCAGAATTTAC
TGACACATTTAGAAGAAGAAATTCAACTATTTGAAGGCGTAATTGCAA
AGCAGGAAGATATTAAAACAGCCAATTAAAAACAATAAACTTGATTTT
TTAATTTATCAGATAAAACTAAAGGGCTAAATTAGCCCTCCTAGGCTG
CTGCCACCGCTGAGCAATAACTAGCATAACCCCTTGGGGCCTCTAAAC
GGGTCTTGAGGGGTTTTTTGCTGAAACCTCAGGCATTTGAGAAGCACA
CGGTCACACTGCTTCCGGTAGTCAATAAACCGGTAAACCAGCAATAGA
CATAAGCGGCTATTTAACGACCCTGCCCTGAACCGACGACCGGGTCAT
CGTGGCCGGATCTTGCGGCCCCTCGGCTTGAACGAATTGTTAGACATT
ATTTGCCGACTACCTTGGTGATCTCGCCTTTCACGTAGTGGACAAATT
CTTCCAACTGATCTGCGCGCGAGGCCAAGCGATCTTCTTCTTGTCCAA
GATAAGCCTGTCTAGCTTCAAGTATGACGGGCTGATACTGGGCCGGCA
GGCGCTCCATTGCCCAGTCGGCAGCGACATCCTTCGGCGCGATTTTGC
CGGTTACTGCGCTGTACCAAATGCGGGACAACGTAAGCACTACATTTC
GCTCATCGCCAGCCCAGTCGGGCGGCGAGTTCCATAGCGTTAAGGTTT
CATTTAGCGCCTCAAATAGATCCTGTTCAGGAACCGGATCAAAGAGTT
CCTCCGCCGCTGGACCTACCAAGGCAACGCTATGTTCTCTTGCTTTTG
TCAGCAAGATAGCCAGATCAATGTCGATCGTGGCTGGCTCGAAGATAC
CTGCAAGAATGTCATTGCGCTGCCATTCTCCAAATTGCAGTTCGCGCT
TAGCTGGATAACGCCACGGAATGATGTCGTCGTGCACAACAATGGTGA
CTTCTACAGCGCGGAGAATCTCGCTCTCTCCAGGGGAAGCCGAAGTTT
CCAAAAGGTCGTTGATCAAAGCTCGCCGCGTTGTTTCATCAAGCCTTA
CGGTCACCGTAACCAGCAAATCAATATCACTGTGTGGCTTCAGGCCGC
CATCCACTGCGGAGCCGTACAAATGTACGGCCAGCAACGTCGGTTCGA
GATGGCGCTCGATGACGCCAACTACCTCTGATAGTTGAGTCGATACTT
CGGCGATCACCGCTTCCCTCATACTCTTCCTTTTTCAATATTATTGAA
GCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTA
TTTAGAAAAATAAACAAATAGCTAGCTCACTCGGTCGCTACGCTCCGG
GCGTGAGACTGCGGCGGGCGCTGCGGACACATACAAAGTTACCCACAG
ATTCCGTGGATAAGCAGGGGACTAACATGTGAGGCAAAACAGCAGGGC
CGCGCCGGTGGCGTTTTTCCATAGGCTCCGCCCTCCTGCCAGAGTTCA
CATAAACAGACGCTTTTCCGGTGCATCTGTGGGAGCCGTGAGGCTCAA
CCATGAATCTGACAGTACGGGCGAAACCCGACAGGACTTAAAGATCCC
CACCGTTTCCGGCGGGTCGCTCCCTCTTGCGCTCTCCTGTTCCGACCC
TGCCGTTTACCGGATACCTGTTCCGCCTTTCTCCCTTACGGGAAGTGT
GGCGCTTTCTCATAGCTCACACACTGGTATCTCGGCTCGGTGTAGGTC
GTTCGCTCCAAGCTGGGCTGTAAGCAAGAACTCCCCGTTCAGCCCGAC
TGCTGCGCCTTATCCGGTAACTGTTCACTTGAGTCCAACCCGGAAAAG
CACGGTAAAACGCCACTGGCAGCAGCCATTGGTAACTGGGAGTTCGCA
GAGGATTTGTTTAGCTAAACACGCGGTTGCTCTTGAAGTGTGCGCCAA
AGTCCGGCTACACTGGAAGGACAGATTTGGTTGCTGTGCTCTGCGAAA
GCCAGTTACCACGGTTAAGCAGTTCCCCAACTGACTTAACCTTCGATC
AAACCACCTCCCCAGGTGGTTTTTTCGTTTACAGGGCAAAAGATTACG
CGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACTGAA
CCGCTCTAGATTTCAGTGCAATTTATCTCTTCAAATGTAGCACCTGAA
GTCAGCCCCATACGATATAAGTTGTAATTCTCATGTTAGTCATGCCCC
GCGCCCACCGGAAGGAGCTGACTGGGTTGAAGGCTCTCAAGGGCATCG
GTCGAGATCCCGGTGCCTAATGAGTGAGCTAACTTACATTAATTGCGT
TGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGC
ATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGC
GCCAGGGTGGTTTTTCTTTTCACCAGTGAGACGGGCAACAGCTGATTG
CCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAAGCGGTCCACGCTG
GTTTGCCCCAGCAGGCGAAAATCCTGTTTGATGGTGGTTAACGGCGGG
ATATAACATGAGCTGTCTTCGGTATCGTCGTATCCCACTACCGAGATG
TCCGCACCAACGCGCAGCCCGGACTCGGTAATGGCGCGCATTGCGCCC
AGCGCCATCTGATCGTTGGCAACCAGCATCGCAGTGGGAACGATGCCC
TCATTCAGCATTTGCATGGTTTGTTGAAAACCGGACATGGCACTCCAG
TCGCCTTCCCGTTCCGCTATCGGCTGAATTTGATTGCGAGTGAGATAT
TTATGCCAGCCAGCCAGACGCAGACGCGCCGAGACAGAACTTAATGGG
CCCGCTAACAGCGCGATTTGCTGGTGACCCAATGCGACCAGATGCTCC
ACGCCCAGTCGCGTACCGTCTTCATGGGAGAAAATAATACTGTTGATG
GGTGTCTGGTCAGAGACATCAAGAAATAACGCCGGAACATTAGTGCAG
GCAGCTTCCACAGCAATGGCATCCTGGTCATCCAGCGGATAGTTAATG
ATCAGCCCACTGACGCGTTGCGCGAGAAGATTGTGCACCGCCGCTTTA
CAGGCTTCGACGCCGCTTCGTTCTACCATCGACACCACCACGCTGGCA
CCCAGTTGATCGGCGCGAGATTTAATCGCCGCGACAATTTGCGACGGC
GCGTGCAGGGCCAGACTGGAGGTGGCAACGCCAATCAGCAACGACTGT
TTGCCCGCCAGTTGTTGTGCCACGCGGTTGGGAATGTAATTCAGCTCC
GCCATCGCCGCTTCCACTTTTTCCCGCGTTTTCGCAGAAACGTGGCTG
GCCTGGTTCACCACGCGGGAAACGGTCTGATAAGAGACACCGGCATAC
TCTGCGACATCGTATAACGTTACTGGTTTCACATTCACCACCCTGAAT
TGACTCTCTTCCGGGCGCTATCATGCCATACCGCGAAAGGTTTTGCGC
CATTCGATGGTGTCCGGGATCTCGACGCTCTCCCTTATGCGACTCCTG
CATTAGGAAATTAATACGACTCACTATA
Construction of pACYC-Ptrc Plasmid Containing `tesA and fadD
[0268] A pACYC-Ptrc vector having the following sequence was used
to construct a pACYC-Ptrc-`tesA-fadD plasmid. The nucleotide
sequence of the pACYC-Ptrc vector is as follows:
TABLE-US-00016 (SEQ ID NO: 11)
ACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAG
AATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACT
TACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGC
ACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGC
TGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGCAG
CAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTC
TAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTG
CAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTG
ATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCAC
TGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGG
GGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAG
GTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCAT
ATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCT
AGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTG
AGTTTTCGTTCCACTGAGCGTCAGACCCCTTAATAAGATGATCTTCTT
GAGATCGTTTTGGTCTGCGCGTAATCTCTTGCTCTGAAAACGAAAAAA
CCGCCTTGCAGGGCGGTTTTTCGAAGGTTCTCTGAGCTACCAACTCTT
TGAACCGAGGTAACTGGCTTGGAGGAGCGCAGTCACCAAAACTTGTCC
TTTCAGTTTAGCCTTAACCGGCGCATGACTTCAAGACTAACTCCTCTA
AATCAATTACCAGTGGCTGCTGCCAGTGGTGCTTTTGCATGTCTTTCC
GGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGAC
TGAACGGGGGGTTCGTGCATACAGTCCAGCTTGGAGCGAACTGCCTAC
CCGGAACTGAGTGTCAGGCGTGGAATGAGACAAACGCGGCCATAACAG
CGGAATGACACCGGTAAACCGAAAGGCAGGAACAGGAGAGCGCACGAG
GGAGCCGCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTT
TCGCCACCACTGATTTGAGCGTCAGATTTCGTGATGCTTGTCAGGGGG
GCGGAGCCTATGGAAAAACGGCTTTGCCGCGGCCCTCTCACTTCCCTG
TTAAGTATCTTCCTGGCATCTTCCAGGAAATCTCCGCCCCGTTCGTAA
GCCATTTCCGCTCGCCGCAGTCGAACGACCGAGCGTAGCGAGTCAGTG
AGCGAGGAAGCGGAATATATCCTGTATCACATATTCTGCTGACGCACC
GGTGCAGCCTTTTTTCTCCTGCCACATGAAGCACTTCACTGACACCCT
CATCAGTGCCAACATAGTAAGCCAGTATACACTCCGCTAGCGCTGAGG
TCTGCCTCGTGAAGAAGGTGTTGCTGACTCATACCAGGCCTGAATCGC
CCCATCATCCAGCCAGAAAGTGAGGGAGCCACGGTTGATGAGAGCTTT
GTTGTAGGTGGACCAGTTGGTGATTTTGAACTTTTGCTTTGCCACGGA
ACGGTCTGCGTTGTCGGGAAGATGCGTGATCTGATCCTTCAACTCAGC
AAAAGTTCGATTTATTCAACAAAGCCACGTTGTGTCTCAAAATCTCTG
ATGTTACATTGCACAAGATAAAAATATATCATCATGAACAATAAAACT
GTCTGCTTACATAAACAGTAATACAAGGGGTGTTATGAGCCATATTCA
ACGGGAAACGTCTTGCTCGAGGCCGCGATTAAATTCCAACATGGATGC
TGATTTATATGGGTATAAATGGGCTCGCGATAATGTCGGGCAATCAGG
TGCGACAATCTATCGATTGTATGGGAAGCCCGATGCGCCAGAGTTGTT
TCTGAAACATGGCAAAGGTAGCGTTGCCAATGATGTTACAGATGAGAT
GGTCAGACTAAACTGGCTGACGGAATTTATGCCTCTTCCGACCATCAA
GCATTTTATCCGTACTCCTGATGATGCATGGTTACTCACCACTGCGAT
CCCCGGGAAAACAGCATTCCAGGTATTAGAAGAATATCCTGATTCAGG
TGAAAATATTGTTGATGCGCTGGCAGTGTTCCTGCGCCGGTTGCATTC
GATTCCTGTTTGTAATTGTCCTTTTAACAGCGATCGCGTATTTCGTCT
CGCTCAGGCGCAATCACGAATGAATAACGGTTTGGTTGATGCGAGTGA
TTTTGATGACGAGCGTAATGGCTGGCCTGTTGAACAAGTCTGGAAAGA
AATGCATAAGCTTTTGCCATTCTCACCGGATTCAGTCGTCACTCATGG
TGATTTCTCACTTGATAACCTTATTTTTGACGAGGGGAAATTAATAGG
TTGTATTGATGTTGGACGAGTCGGAATCGCAGACCGATACCAGGATCT
TGCCATCCTATGGAACTGCCTCGGTGAGTTTTCTCCTTCATTACAGAA
ACGGCTTTTTCAAAAATATGGTATTGATAATCCTGATATGAATAAATT
GCAGTTTCATTTGATGCTCGATGAGTTTTTCTAATCAGAATTGGTTAA
TTGGTTGTAACACTGGCAGAGCATTACGCTGACTTGACGGGACGGCGG
CTTTGTTGAATAAATCGAACTTTTGCTGAGTTGAAGGATCAGATCACG
CATCTTCCCGACAACGCAGACCGTTCCGTGGCAAAGCAAAAGTTCAAA
ATCACCAACTGGTCCACCTACAACAAAGCTCTCATCAACCGTGGCTCC
CTCACTTTCTGGCTGGATGATGGGGCGATTCAGGCCTGGTATGAGTCA
GCAACACCTTCTTCACGAGGCAGACCTCAGCGCTCAAAGATGCAGGGG
TAAAAGCTAACCGCATCTTTACCGACAAGGCATCCGGCAGTTCAACAG
ATCGGGAAGGGCTGGATTTGCTGAGGATGAAGGTGGAGGAAGGTGATG
TCATTCTGGTGAAGAAGCTCGACCGTCTTGGCCGCGACACCGCCGACA
TGATCCAACTGATAAAAGAGTTTGATGCTCAGGGTGTAGCGGTTCGGT
TTATTGACGACGGGATCAGTACCGACGGTGATATGGGGCAAATGGTGG
TCACCATCCTGTCGGCTGTGGCACAGGCTGAACGCCGGAGGATCCTAG
AGCGCACGAATGAGGGCCGACAGGAAGCAAAGCTGAAAGGAATCAAAT
TTGGCCGCAGGCGTACCGTGGACAGGAACGTCGTGCTGACGCTTCATC
AGAAGGGCACTGGTGCAACGGAAATTGCTCATCAGCTCAGTATTGCCC
GCTCCACGGTTTATAAAATTCTTGAAGACGAAAGGGCCTCGTGATACG
CCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTC
TTAATTAATCAGGAGAGCGTTCACCGACAAACAACAGATAAAACGAAA
GGCCCAGTCTTTCGACTGAGCCTTTCGTTTTATTTGATGCCTGGCAGT
TCCCTACTCTCGCATGGGGAGACCCCACACTACCATCGGCGCTACGGC
GTTTCACTTCTGAGTTCGGCATGGGGTCAGGTGGGACCACCGCGCTAC
TGCCGCCAGGCAAATTCTGTTTTATCAGACCGCTTCTGCGTTCTGATT
TAATCTGTATCAGGCTGAAAATCTTCTCTCATCCGCCAAAACAGCCAA
GCTGGAGACCGTTTAAACTCAATGATGATGATGATGATGGTCGACGGC
GCTATTCAGATCCTCTTCTGAGATGAGTTTTTGTTCGGGCCCAAGCTT
CGAATTCCCATATGGTACCAGCTGCAGATCTCGAGCTCGGATCCATGG
TTTATTCCTCCTTATTTAATCGATACATTAATATATACCTCTTTAATT
TTTAATAATAAAGTTAATCGATAATTCCGGTCGAGTGCCCACACAGAT
TGTCTGATAAATTGTTAAAGAGCAGTGCCGCTTCGCTTTTTCTCAGCG
GCGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACATT
ATACGAGCCGGATGATTAATTGTCAACAGCTCATTTCAGAATATTTGC
CAGAACCGTTATGATGTCGGCGCAAAAAACATTATCCAGAACGGGAGT
GCGCCTTGAGCGACACGAATTATGCAGTGATTTACGACCTGCACAGCC
ATACCACAGCTTCCGATGGCTGCCTGACGCCAGAAGCATTGGTGCACC
GTGCAGTCGATGATAAGCTGTCAAACCAGATCAATTCGCGCTAACTCA
CATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGT
CGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTT
TGCGTATTGGGCGCCAGGGTGGTTTTTCTTTTCACCAGTGAGACGGGC
AACAGCTGATTGCCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAAG
CGGTCCACGCTGGTTTGCCCCAGCAGGCGAAAATCCTGTTTGATGGTG
GTTGACGGCGGGATATAACATGAGCTGTCTTCGGTATCGTCGTATCCC
ACTACCGAGATATCCGCACCAACGCGCAGCCCGGACTCGGTAATGGCG
CGCATTGCGCCCAGCGCCATCTGATCGTTGGCAACCAGCATCGCAGTG
GGAACGATGCCCTCATTCAGCATTTGCATGGTTTGTTGAAAACCGGAC
ATGGCACTCCAGTCGCCTTCCCGTTCCGCTATCGGCTGAATTTGATTG
CGAGTGAGATATTTATGCCAGCCAGCCAGACGCAGACGCGCCGAGACA
GAACTTAATGGGCCCGCTAACAGCGCGATTTGCTGGTGACCCAATGCG
ACCAGATGCTCCACGCCCAGTCGCGTACCGTCTTCATGGGAGAAAATA
ATACTGTTGATGGGTGTCTGGTCAGAGACATCAAGAAATAACGCCGGA
ACATTAGTGCAGGCAGCTTCCACAGCAATGGCATCCTGGTCATCCAGC
GGATAGTTAATGATCAGCCCACTGACGCGTTGCGCGAGAAGATTGTGC
ACCGCCGCTTTACAGGCTTCGACGCCGCTTCGTTCTACCATCGACACC
ACCACGCTGGCACCCAGTTGATCGGCGCGAGATTTAATCGCCGCGACA
ATTTGCGACGGCGCGTGCAGGGCCAGACTGGAGGTGGCAACGCCAATC
AGCAACGACTGTTTGCCCGCCAGTTGTTGTGCCACGCGGTTGGGAATG
TAATTCAGCTCCGCCATCGCCGCTTCCACTTTTTCCCGCGTTTTCGCA
GAAACGTGGCTGGCCTGGTTCACCACGCGGGAAACGGTCTGATAAGAG
ACACCGGCATACTCTGCGACATCGTATAACGTTACTGGTTTCACATTC
ACCACCCTGAATTGACTCTCTTCCGGGCGCTATCATGCCATACCGCGA
AAGGTTTTGCACCATTCGATGGTGTCAACGTAAATGCATGCCGCTTCG
CCTTCGCGCGCGAATTGATCTGCTGCCTCGCGCGTTTCGGTGATGACG
GTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTC
TGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGG
GTGTTGGCGGGGCCGGCCTCG
[0269] The `tesA, and fadD genes were amplified using high fidelity
Phusion.TM. polymerase (New England Biolabs, Inc., Ipswich, Mass.),
with the following primers from their respective plasmids,
pETDuet-1-`tesA and pHZ1.61:
TABLE-US-00017 `tesAForward- (SEQ ID NO: 12)
5'-CTCTAGAAATAATTTAACTTTAAGTAGGAGAUAGGTACCCATGGC GGACACGTTATTGAT-3'
`tesAReverse- (SEQ ID NO: 13)
5'-CTTCGAATTCCATTTAAATTATTTCTAGAGTCATTATGAGTCATG ATTTACTAAAGGC-3'
fadDForward- (SEQ ID NO: 14)
5'-CTCTAGAAATAATTTTAGTTAAGTATAAGAAGGAGATATACCATG
GTGAAGAAGGTTTGGCTTAA-3' fadDReverse- (SEQ ID NO: 15)
5'-CTTCGAATTCCATTTAAATTATTTCTAGAGTTATCAGGCTTTATT GTCCAC-3'
Insertion of `tesA into pACYC-Ptrc Plasmid
[0270] Using NcoI and EcoRI sites on both the insert and vector,
the `tesA PCR product amplified from pETDuet-1-`tesA was cloned
into the initial position of pACYC-Ptrc vector (SEQ ID NO:11). A T4
DNA ligase (New England Biolabs, Ipswich, Mass.) was then used to
ligate the pACYC-Ptrc vector and `tesA, producing a
pACYC-Ptrc-`tesA plasmid. Following overnight ligation, the DNA
product was transformed into TOP 10.RTM. One Shot.RTM. cells
(Invitrogen, Carlsbad, Calif.). The insertion of `tesA into the
pACYC-Ptrc vector was confirmed by restriction digestion. An SwaI
restriction site as well as overlapping fragments for In-Fusion.TM.
cloning (Clontech, Mountain View, Calif.) were also created as a
result at the 3'-end of the `tesA insert.
Construction of pACYC-Ptrc-`tesA-fadD
[0271] The pACYC-Ptrc-`tesA plasmid was then subject to an
overnight restriction digestion by SwaI. fadD amplified the plasmid
pHZ1.61 (described above) was cloned downstream from the `tesA gene
using the In-Fusion.TM. PCR Cloning System (Clontech, Menlo Park,
Calif.). The insertion of fadD was verified with restriction
digestion. The insertion of fadD destroys the SwaI site following
the `tesA gene, but recreates a new SwaI site at the 3' end of
fadD.
[0272] The pACYC-Ptrc-`tesA-fadD plasmid was used as a template to
generate a Ptrc-`tesA-fadD cassette. The following primers were
used to obtain the cassette:
TABLE-US-00018 IFF: (SEQ ID NO: 16)
5'-GGGTCAATAGCGGCCGCCAATTCGCGCGCGAAGGCG-3' IFR: (SEQ ID NO: 17)
5'-TGGCGCGCCTCCTAGGGCATTACGCTGACTTGACGGG-3'
[0273] This cassette was subsequently cloned into the NotI and
AvrII restriction sites of pGRG25 (described above) using the
Infusion.TM. PCR cloning system (Clontech, Mountain View, Calif.),
creating the Tn7tesfad plasmid (SEQ ID NO:18), wherein the
lacI.sub.q, Ptrc-`tesA-fadD genes were flanked by the left and
right Tn7 ends.
[0274] The plasmid Tn7tesfad was electroporated into strain E. coli
MG1655 DV2 (Example 2, above) using a protocol described by
McKenzie et al., BMC Microbiology 6:39 (2006). After
electroporation, Ampicillin-resistant cells were selected by growth
in an LB medium containing 0.1% glucose and 100 .mu.g/mL
carbenicillin at 32.degree. C. overnight. This was followed by
selection of plasmids comprising the Tn7-transposition fractions,
using the growth of cells on an LB plus 0.1% arabinose plates
overnight at 32.degree. C. Single colonies were selected and
streaked onto new LB medium plates with and without Ampicillin, and
they were grown overnight at 42.degree. C. to cure of Tn7tesfad
plasmid. Thus, the lacI.sub.q, Ptrc-`tesA-fadD genes were
integrated into the attTn7 site on the E. coli MG1655 chromosome
located between the pstS and glmS genes. Integration of these genes
was confirmed by PCR and sequencing using the following
primers:
TABLE-US-00019 attTn7.A: 5'-GATGCTGGTGGCGAAGCTGT-3' (SEQ ID NO: 19)
attTn7.C: 5'-GTTGCGACGGTGGTACGCATAAC-3' (SEQ ID NO: 20)
[0275] The resulting strain was given the name E. coli MG1655
DAM1.
Example 5
Production of E. coli MG1655 DAM1/pDS57
[0276] A plasmid pDS57 (SEQ ID NO:41) was prepared as described
below.
[0277] An ester synthase gene encoding an ester synthase ES9 from
Marinobacter hydrocarbonoclasticus DSM8789 (GenBank Accession No.
ABO21021) was synthesized by DNA2.0 (Menlo Park, Calif.). The
synthesized gene was then cloned into a pCOLADuet-1 plasmid (EMD
Chemicals, Inc., Gibbstown, N.J.) to form a pHZ1.97-ES9 construct.
The internal BspHI restriction site of the ester synthase gene was
then removed by site-directed mutagenesis, using the QuikChange.TM.
Multi Kit (Stratagene, Carlsbad, Calif.) and the primer:
TABLE-US-00020 (SEQ ID NO: 21) ES9BspF:
5'-CCCAGATCAGTTTTATGATTGCCTCGCTGG-3'
[0278] This primer introduced a silent mutation into the ester
synthase gene. The resulting plasmid was called pDS32.
[0279] pDS32 was then used as a template to amplify the ester
synthase gene using the following primers:
TABLE-US-00021 ES9BspH-Forward: 5'-ATCATGAAACGTCTCGGAAC-3' (SEQ ID
NO: 22) ES9Xho-Reverse: 5'-CCTCGAGTTACTTGCGGGTTCGGGCGCG-3' (SEQ ID
NO: 23)
[0280] The PCR product was subject to restriction digestions with
BspHI and XhoI. This digestion fragment was then ligated into a
pDS23 plasmid (as described below) that had been digested with NcoI
and XhoI, to form a plasmid pDS33.ES9 (SEQ ID NO:24).
Construction of pDS23
[0281] A Pspc promoter (SEQ ID NO:25) was obtained by PCR
amplification, using Phusion.TM. Polymerase (New England Biolabs,
Inc., Ipswich, Mass.) from E. coli MG1655 chromosomal DNA. The
following primers were used:
TABLE-US-00022 PspcIFF: (SEQ ID NO: 26)
5'-AAAGGATGTCGCAAACGCTGTTTCAGTACACTCTCTCAATAC-3' PspcIFR: (SEQ ID
NO: 27) 5'-GAGCTCGGATCCATGGTTTAGTGCTCCGCTAATG-3'
[0282] The PCR fragment was then used to replace the lacI.sub.q and
Ptrc promoter sequences of a plasmid OP-80 (SEQ ID NO:28), which
was constructed as described below:
Construction of Plasmid OP-80:
[0283] A commercial vector pCL1920 (see, Lerner, et al., Nucleic
Acids Res. 18:4631 (1990)), carrying a strong transcriptional
promoter, was used as the starting point. The pCL1920 vector was
digested with AflII and sfoI (New England Biolabs, Ipswich, Mass.).
Three DNA fragments were produced, among which, a 3737-bp fragment
was gel-purified using a gel-purification kit (Qiagen, Inc.,
Valencia, Calif.).
[0284] In parallel, a DNA fragment comprising the Ptrc promoter and
the lacI sequences was obtained from a plasmid pTrcHis2
(Invitrogen, Carlsbad, Calif.) using the following primers:
TABLE-US-00023 (SEQ ID NO: 29) LF302:
5'-ATATGACGTCGGCATCCGCTTACAGACA-3' (SEQ ID NO: 30) LF303:
5'-AATTCTTAAGTCAGGAGAGCGTTCACCGACAA-3'
[0285] These primers also introduced the restriction sites for ZraI
and AflII.
[0286] The PCR product was purified using a PCR-purification kit
(Qiagen, Inc., Valencia, Calif.) and digested with ZraI and AflII.
The digestion product was gel-purified and ligated with the 3737-bp
fragment (described above). The ligation mixture was then
transformed into TOP 10.RTM. chemically competent cells
(Invitrogen, Carlsbad, Calif.). The transformants were selected on
Luria agar plates containing 100 .mu.g/mL spectinomycin during
overnight incubation. Resistant colonies were identified, and
plasmids within these colonies were purified, and verified with
restriction digestion and sequencing. One plasmid produced this way
was retained, and given the name of OP-80 (SEQ ID NO:28).
[0287] The PCR fragment comprising the Pspc promoter (described
above) was cloned into the BseRI and NcoI restriction sites of
OP-80 using the InFusion.TM. Cloning Kit (Clontech, Menlo Park,
Calif.). The resulting plasmid was given the name pDS22. pDS22
still possessed a lacZ gene sequence downstream of the multiple
cloning site. The lacZ sequence was removed with PCR employing the
following primers:
TABLE-US-00024 pCLlacDF: 5'-GAATTCCACCCGCTGACGAGCTTA-3' (SEQ ID NO:
31) pCLEcoR: 5'-CGAATTCCCATATGGTACCAG-3' (SEQ ID NO: 32)
The PCR product was subject to restriction digestion by EcoRI. The
digested product was subsequently self-ligated to form a plasmid
named pDS23, which did not contain lacI.sub.q, lacZ or promoter
Ptrc sequence.
[0288] The plasmid pDS33.ES9 (SEQ ID NO:24) (described above) was
again digested with BspHI and XhoI. After digestion, the fragment
was ligated with an OP-80 plasmid (described above) that had been
previously linearized using NcoI/XhoI restriction digestions.
[0289] The ligation product was transformed into TOP 10.RTM. One
Shot chemically competent cells (Invitrogen, Carlsbad, Calif.).
Cells were then plated on LB plates containing 100 .mu.g/mL
spectinomycin, and incubated overnight at 37.degree. C. After
overnight growth, several colonies were purified and the sequence
of the inserts verified. The plasmid was given the name pDS57 (SEQ
ID NO:41).
[0290] E. coli DAM1 strain was made electrocompetent using standard
methods. The competent cells were then transformed with plasmid
pDS57 and plated on LB plates containing 100 .mu.g/mL of
spectinomycin, and incubated overnight at 37.degree. C. Resistant
colonies were purified and the presence of the pDS57 plasmid was
confirmed using restriction digestion and sequencing. The resulting
construct was given the name E. coli DAM1/pDS57.
Example 6
Production of Biodiesel by Fermentation
[0291] This example demonstrates processes to produce a fatty ester
composition using the genetically modified microorganisms described
herein. A fermentation and recovery process was used to produce
biodiesel of commercial grade quality by fermentation of
carbohydrates. The fermentation process produced a mix of fatty
acid methyl esters (FAME) and fatty acid ethyl esters (FAEE) for
use as a biodiesel using the genetically engineered microorganisms
described in Examples 1-5.
Fermentation
[0292] The E. coli MG1655 DAM1/pDS57 cells (see, Example 4, supra)
was taken from a frozen stock and grown in a defined media
consisting of: 4.54 g/L of K.sub.2HPO.sub.4 trihydrate, 4 g/L of
(NH.sub.4).sub.2SO.sub.4, 0.15 g/L of MgSO.sub.4 heptahydrate, 20
g/L of glucose, 200 mM of Bis-Tris buffer (pH 7.2), 1.25 mL/L of a
first trace mineral solution and 1.25 mL/L of a vitamin solution.
The first trace metals solution was composed of 27 g/L of
FeCl.sub.3.6H.sub.2O, 2 g/L of ZnCl.sub.2.4H.sub.2O, 2 g/L of
CaCl.sub.2.6H.sub.2O, 2 g/L of Na.sub.2MoO.sub.4.2H.sub.2O, 1.9 g/L
of CuSO.sub.4.5H.sub.2O, 0.5 g/L of H.sub.3BO.sub.3, and 100 mL/L
of concentrated HCl. The trace vitamin solution was composed of
0.42 g/L of riboflavin, 5.4 g/L of pantothenic acid, 6 g/L of
niacin, 1.4 g/L of pyridoxine, 0.06 g/L of biotin, and 0.04 g/L of
folic acid.
[0293] 50 mL of cultures of DAM1/pDS57 cells were grown overnight
and subsequently used to inoculate 1 L of fermentation medium
containing: 0.5 g/L (NH.sub.4).sub.2SO.sub.4, 2.0 g/L
KH.sub.2PO.sub.4, 0.15 g/L MgSO.sub.4 heptahydrate, 0.034 g/L
ferric citrate, 2.5 g/L bacto casamino acids, 10 g/L glucose, 1.25
mL/L of a second trace mineral solution and 1.25 mL/L of a vitamin
solution, in a fermentor with temperature, pH, agitation, aeration
and dissolved oxygen control. The second trace mineral solution
contained 2 g/L of ZnCl.sub.2.4H.sub.2O, 2 g/L of
CaCl.sub.2.6H.sub.2O, 2 g/L of Na.sub.2MoO.sub.4.2H.sub.2O, 1.9 g/L
of CuSO.sub.4.5H.sub.2O, 0.5 g/L of H.sub.3BO.sub.3, and 100 mL/L
of concentrated HCl. The trace vitamin solution was as described
above.
[0294] The feed to the fermentor contained 600 g/L glucose, 3.9 g/L
MgSO.sub.4 heptahydrate, 1.6 g/L KH.sub.2PO.sub.4, 2.5 g/L casamino
acids, 0.05 g/L ferric citrate, 20 mL/L of the second trace mineral
solution and 2 mL/L of the trace vitamin solution.
[0295] The preferred conditions for the fermentation were
32.degree. C., pH 6.8 and dissolved oxygen (DO) equal to 30% of
saturation. The pH was maintained by addition of NH.sub.4OH, which
also served as nitrogen source for cell growth. The glucose level
in the culture was monitored using methods known to those skilled
in the art. When the initial glucose was almost consumed, a feed
consisting of 600 g/L glucose, 3.9 g/L MgSO.sub.4 heptahydrate, 1.6
g/L KH.sub.2PO.sub.4, 2.5 g/L casamino acids, 0.05 g/L ferric
citrate, 20 mL/L of the second trace mineral solution and 2 mL/L of
the trace vitamin solution was supplied to the fermentor. The feed
rate was set up to allow for a cell growth rate of 0.3 h.sup.-1, up
to a maximum of 10 g glucose/L/h, at which point it was fixed. This
rate was maintained for the remaining duration of the fermentation
as long as glucose did not accumulate in the fermentor. By avoiding
glucose accumulation, it was possible to reduce or eliminate the
formation of by-products such as acetate, formate and ethanol,
which are commonly produced by E. coli. In the early phases of the
growth, the production of fatty esters was induced by the addition
of 1 mM IPTG and 20 mL/L of a pure methanol or a pure ethanol. The
fermentation was continued for a period of 3 days. Methanol or
ethanol was added several times during the run to replenish what
was consumed by the cells for the production of fatty esters, but
mostly to replace the alcohol lost by evaporation in the off-gas.
The additions were targeted to maintain the concentration of
methanol or ethanol in the fermentation broth between 10 and 30
mL/L, to allow efficient production while avoiding inhibition of
cell growth.
[0296] The progression of the fermentation was followed by
measurements of OD.sub.600 (optical density at 600 nm), glucose
consumption, and ester production.
Analysis
[0297] Glucose consumption throughout the fermentation was analyzed
by High Pressure Liquid Chromatography (HPLC). The HPLC analysis
was performed according to methods commonly used for some sugars
and organic acids in the art, using the following conditions:
Agilent HPLC 1200 Series with Refractive Index detector; Column.
Aminex HPX-87H, 300 mm.times.7.8 mm; column temperature:
350.degree. C.; mobile phase: 0.01M H.sub.2SO.sub.4 (aqueous); flow
rate: 0.6 mL/m; injection volume: 20 .mu.L.
[0298] The production of fatty acid methyl and ethyl esters was
analyzed by gas chromatography with flame ionization detector
(GC-FID). Samples from fermentation broth were extracted with ethyl
acetate in a ratio of 1:1 vol/vol. After strong vortexing, the
samples were centrifuged, and the organic phase was analyzed by gas
chromatography (GC). The analysis conditions were as follows:
instrument: Trace GC Ultra, Thermo Electron Corporation with Flame
ionization detector (FID) detector; column: DB-1 (1% diphenyl
siloxane; 99% dimethyl siloxane) CO1 UFM 1/0.1/5 01 DET from Thermo
Electron Corporation, phase pH 5, FT: 0.4 .mu.m, length 5 m, id:
0.1 mm; inlet conditions: 250.degree. C. splitless, 3.8 m 1/25
split method used depending upon sample concentration with split
flow of 75 mL/m; carrier gas, flow rate: Helium, 3.0 mL/m; block
temperature: 330.degree. C.; oven temperature: 0.5 m hold at
50.degree. C., 100.degree. C./m to 330.degree. C., 0.5 m hold at
330.degree. C.; detector temperature: 300.degree. C.; injection
volume: 2 .mu.L; run time/flow rate: 6.3 m/3.0 mL/m (splitless
method), 3.8 m/1.5 mL/m (split 1/25 method), 3.04 m/1.2 mL/m (split
1/50 method).
Recovery
[0299] After fermentation, the fatty ester composition can be
separated from the fermentation broth by several different methods
well known in the art. After the completion of the fermentation,
the broth was centrifuged to separate a first light phase
containing the methylesters from a first heavy phase consisting of
water, salts and the microbial biomass. The first light phase was
centrifuged a second time to recover the biodiesel and to separate
a second light phase (which consisted of a mixture of ethers) from
a second heavy phase.
[0300] Centrifugation was performed in disk-stacked continuous
centrifuges of pilot scale capacity (fixed centrifugal force about
10,000 g) with flows from about 1 to about 5 LPM. The same
centrifuge was used each time for the first and second steps.
Normal adjustments known in the art to centrifugation configuration
and conditions (gravity ring size, back pressure in outlets, flow)
were undertaken in each case to achieve the most favorable
separation in terms of recovery efficiency and cleanness of the
product. For the first centrifugation step, the fermentation broth
was sent directly from the fermentor to the centrifuge without any
physical or chemical adjustments.
[0301] In instances where it was more difficult to break the
emulsion and obtain clear oil, additional pretreatments were
applied to the light phase to help with the separation during the
second centrifugation step. These treatments consisted of the
following: heating to 60 to 80.degree. C., adjusting the pH to 2.0
to 2.5 with sulfuric acid, and addition of demulsifiers (ARB-8285
(Baker Hughes, Houston, Tex.), less than 1% of the emulsion/light
phase volume). The temperature was held for 1 to 2 h before the
second centrifugation.
Polishing
[0302] The composition obtained from the harvesting step had
characteristics that were very close to the regulatory standards
for biodiesel. The inherent properties of this composition, as well
as the properties related to purity, met the regulatory standards
for biodiesel, including cetane number, kinematic viscosity, flash
point, oxidation stability, copper corrosion, free and total
glycerin, methanol, phosphorous, sulfate, K.sup.+ and Na.sup.+
content. However, the composition obtained from the harvesting step
can, in certain embodiments, be subjected to optional minor further
purification steps to eliminate other impurities.
[0303] Specifically, the following polishing steps were performed
on the clarified composition: (1) a lime wash step; (2) an acid
wash step; (3) a water wash step; and (4) an absorption treatment
step. In the lime wash step, 1 part by volume of a lime slurry (5%
lime slurry w/w) was mixed with 5 parts of raw oil at room
temperature for 5 m. The mixture was then centrifuged at 10,000 g
for 15 m. The lime-washed oil was then mixed with a dilute sulfuric
acid (5% v/v) in the ratio of 1 part oil to 0.75 parts diluted
sulfuric acid. The mixture was centrifuged at 10,000 g for 15 m.
The acid-washed oil was then mixed water in the ratio of 1 part oil
plus 1 part water. The mixture was then centrifuged at 10,000 g for
15 m. The oil was separated, and the water-washed oil was then
dried at 80.degree. C. for 1 h in the rotovap. This dried oil was
heated to 90.degree. C., and added to an absorbent, Magnesol.TM.
D60 (The Dallas Group, Inc., Whitehouse, N.J.) (1% w/v), and mixed
for 1 h. Magnesol.TM. D60 was removed by filtration through 0.2
micron filters.
Results
[0304] The E. coli MG1655 DAM1/pDS57 strain was grown according to
the protocol above, in 2-L and 5-L fermentors. Representative
results obtained are presented in the Table 7. The yield is
expressed as the grams of product obtained per 100 g of carbon
source used.
TABLE-US-00025 TABLE 7 Parameter Methanol Ethanol FAME
concentration (g/L) 44.8 -- FFA concentration (g/L) 0.6 4.9 FAEE
Concentration (g/L) 1.1 8.1 Yield of Fatty Ester on glucose (%)
16.1 5.1 Yield of All Fatty Species on glucose (%) 16.7 8.1
Example 7
Performance Profile
[0305] This example illustrates the performance profile of the
fatty ester composition produced using the genetically modified
microorganism produced and processed in accordance with Example 6.
The fatty ester composition was subject to the analyses as a diesel
fuel, without blending with petroleum diesel. The analyses were
performed by the School of Chemistry, Federal University of Rio De
Janeiro, Brazil. The results of the analysis are set forth in Table
8.
TABLE-US-00026 TABLE 8 ANP No. 7 Properties Methods Results
Standard Aspect Visual Lipid, without LII impurities (24.degree.
C.) Specific mass @20.degree. C. ASTM D4052 875.3 kg/m.sup.3
850-900 kg/m.sup.3 Kinematic Viscosity @40.degree. C. ASTM D445
3.55 mm.sup.2/s 3-6 mm.sup.2/s Water content, max EN 12937 450
mg/kg .ltoreq.500 mg/kg Total contamination EN 12662 3.2 mg/kg
.ltoreq.24 mg/kg Flash point, min ASTM D93 128.degree. C.
.gtoreq.100.degree. C. Ester content, min EN 14103 97.5 wt. %
.gtoreq.96.5 wt. % Carbon residue, max ASTM D4530 N/D .ltoreq.0.05
wt. % Sulfated ash, max ASTM D874 0.01 wt. % .ltoreq.0.02 wt. %
Total sulfur, max EN 20884 15.0 mg/kg .ltoreq.50 mg/kg Na + K, max
EN 14108 1.6 mg/kg .ltoreq.5 mg/kg EN 14109 Ca + Mg, max NBR 15556
0.3 mg/kg .ltoreq.5 mg/kg Phosphorus, max ASTM D4951 0.9 mg/kg
.ltoreq.10 mg/kg Copper corrosion, 3 h @50.degree. C., ASTM D130 1
1 max Cold filter plugging point, max ASTM D6371 -3.degree. C.
.ltoreq.19.degree. C. Acid value, max ASTM D664 0.5 mg KOH/g
.ltoreq.0.5 mg KOH/g Free glycerol, max ASTM D6584 0.02 wt. %
.ltoreq.0.02 wt. % Total glycerol, max ASTM D6584 0.03 wt. %
.ltoreq.0.25 wt. % Monoacylglycerol ASTM D6584 0.02 wt. % Report
Diacylglycerol ASTM D6584 0 wt. % Report Triacylglycerol ASTM D6584
0 wt. % Report Methanol or Ethanol, EN 14110 0.01 wt. % .ltoreq.0.2
wt. % max Iodine value EN 14111 64.92 g/100 g Report Oxidation
stability @100.degree. C., EN 14112 11.0 h .gtoreq.6 h min
[0306] In parallel, a sample obtained from the same process was
provided to Gorge Analytical in Hood River, Oreg. for testing and
the results are listed below in Table 9:
TABLE-US-00027 TABLE 9 Analysis Method Result Pass/Fail Cloud Point
ASTM D2500 -4.degree. C. n/a Simulated ASTM D2887 358.degree. C.
Pass distillation - T90 Sulfur by UVF ASTM D5453 10.7 ppm Pass Karl
Fischer ASTM D6304 0.0542 wt. % n/a Moisture- Coulometric Total
Acid ASTM D664 0.14 mg KOH/g Pass Number Ca + Mg EN 14538 - <2
ppm Pass Ca, Mg Na + K EN 14538 - 2.2 ppm Pass Na, K
OTHER EMBODIMENTS
[0307] It is to be understood that while the invention has been
described in conjunction with the detailed description thereof, the
foregoing description is intended to illustrate and not limit the
scope of the invention, which is defined by the scope of the
appended claims. Other aspects, advantages, and modifications are
within the scope of the following claims.
Sequence CWU 1
1
42170DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 1aaaaacagca acaatgtgag ctttgttgta attatattgt
aaacatattg attccgggga 60tccgtcgacc 70268DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
2aaacggagcc tttcggctcc gttattcatt tacgcggctt caactttcct gtaggctgga
60gctgcttc 68323DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 3cgggcaggtg ctatgaccag gac
23423DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 4cgcggcgttg accggcagcc tgg 23570DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
5atcattctcg tttacgttat cattcacttt acatcagaga tataccaatg attccgggga
60tccgtcgacc 70669DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 6gcacggaaat ccgtgcccca aaagagaaat
tagaaacgga aggttgcggt tgtaggctgg 60agctgcttc 69721DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
7caacagcaac ctgctcagca a 21821DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 8aagctggagc agcaaagcgt t
2195180DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 9gggtcaatag cggccgccaa ttcgcgcgcg
aaggcgaagc ggcatgcatt tacgttgaca 60ccatcgaatg gtgcaaaacc tttcgcggta
tggcatgata gcgcccggaa gagagtcaat 120tcagggtggt gaatgtgaaa
ccagtaacgt tatacgatgt cgcagagtat gccggtgtct 180cttatcagac
cgtttcccgc gtggtgaacc aggccagcca cgtttctgcg aaaacgcggg
240aaaaagtgga agcggcgatg gcggagctga attacattcc caaccgcgtg
gcacaacaac 300tggcgggcaa acagtcgttg ctgattggcg ttgccacctc
cagtctggcc ctgcacgcgc 360cgtcgcaaat tgtcgcggcg attaaatctc
gcgccgatca actgggtgcc agcgtggtgg 420tgtcgatggt agaacgaagc
ggcgtcgaag cctgtaaagc ggcggtgcac aatcttctcg 480cgcaacgcgt
cagtgggctg atcattaact atccgctgga tgaccaggat gccattgctg
540tggaagctgc ctgcactaat gttccggcgt tatttcttga tgtctctgac
cagacaccca 600tcaacagtat tattttctcc catgaagacg gtacgcgact
gggcgtggag catctggtcg 660cattgggtca ccagcaaatc gcgctgttag
cgggcccatt aagttctgtc tcggcgcgtc 720tgcgtctggc tggctggcat
aaatatctca ctcgcaatca aattcagccg atagcggaac 780gggaaggcga
ctggagtgcc atgtccggtt ttcaacaaac catgcaaatg ctgaatgagg
840gcatcgttcc cactgcgatg ctggttgcca acgatcagat ggcgctgggc
gcaatgcgcg 900ccattaccga gtccgggctg cgcgttggtg cggatatctc
ggtagtggga tacgacgata 960ccgaagacag ctcatgttat atcccgccgt
caaccaccat caaacaggat tttcgcctgc 1020tggggcaaac cagcgtggac
cgcttgctgc aactctctca gggccaggcg gtgaagggca 1080atcagctgtt
gcccgtctca ctggtgaaaa gaaaaaccac cctggcgccc aatacgcaaa
1140ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag
gtttcccgac 1200tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt
agcgcgaatt gatctggttt 1260gacagcttat catcgactgc acggtgcacc
aatgcttctg gcgtcaggca gccatcggaa 1320gctgtggtat ggctgtgcag
gtcgtaaatc actgcataat tcgtgtcgct caaggcgcac 1380tcccgttctg
gataatgttt tttgcgccga catcataacg gttctggcaa atattctgaa
1440atgagctgtt gacaattaat catccggctc gtataatgtg tggaattgtg
agcggataac 1500aatttcacac aggaaacagc gccgctgaga aaaagcgaag
cggcactgct ctttaacaat 1560ttatcagaca atctgtgtgg gcactcgacc
ggaattatcg attaacttta ttattaaaaa 1620ttaaagaggt atatattaat
gtatcgatta aataaggagg aataaaccat ggcggacacg 1680ttattgattc
tgggtgatag cctgagcgcc gggtatcgaa tgtctgccag cgcggcctgg
1740cctgccttgt tgaatgataa gtggcagagt aaaacgtcgg tagttaatgc
cagcatcagc 1800ggcgacacct cgcaacaagg actggcgcgc cttccggctc
tgctgaaaca gcatcagccg 1860cgttgggtgc tggttgaact gggcggcaat
gacggtttgc gtggttttca gccacagcaa 1920accgagcaaa cgctgcgcca
gattttgcag gatgtcaaag ccgccaacgc tgaaccattg 1980ttaatgcaaa
tacgtctgcc tgcaaactat ggtcgccgtt ataatgaagc ctttagcgcc
2040atttacccca aactcgccaa agagtttgat gttccgctgc tgcccttttt
tatggaagag 2100gtctacctca agccacaatg gatgcaggat gacggtattc
atcccaaccg cgacgcccag 2160ccgtttattg ccgactggat ggcgaagcag
ttgcagcctt tagtaaatca tgactcataa 2220tgactctaga aataatttta
gttaagtata agaaggagat ataccatggt gaagaaggtt 2280tggcttaacc
gttatcccgc ggacgttccg acggagatca accctgaccg ttatcaatct
2340ctggtagata tgtttgagca gtcggtcgcg cgctacgccg atcaacctgc
gtttgtgaat 2400atgggggagg taatgacctt ccgcaagctg gaagaacgca
gtcgcgcgtt tgccgcttat 2460ttgcaacaag ggttggggct gaagaaaggc
gatcgcgttg cgttgatgat gcctaattta 2520ttgcaatatc cggtggcgct
gtttggcatt ttgcgtgccg ggatgatcgt cgtaaacgtt 2580aacccgttgt
ataccccgcg tgagcttgag catcagctta acgatagcgg cgcatcggcg
2640attgttatcg tgtctaactt tgctcacaca ctggaaaaag tggttgataa
aaccgccgtt 2700cagcacgtaa ttctgacccg tatgggcgat cagctatcta
cggcaaaagg cacggtagtc 2760aatttcgttg ttaaatacat caagcgtttg
gtgccgaaat accatctgcc agatgccatt 2820tcatttcgta gcgcactgca
taacggctac cggatgcagt acgtcaaacc cgaactggtg 2880ccggaagatt
tagcttttct gcaatacacc ggcggcacca ctggtgtggc gaaaggcgcg
2940atgctgactc accgcaatat gctggcgaac ctggaacagg ttaacgcgac
ctatggtccg 3000ctgttgcatc cgggcaaaga gctggtggtg acggcgctgc
cgctgtatca catttttgcc 3060ctgaccatta actgcctgct gtttatcgaa
ctgggtgggc agaacctgct tatcactaac 3120ccgcgcgata ttccagggtt
ggtaaaagag ttagcgaaat atccgtttac cgctatcacg 3180ggcgttaaca
ccttgttcaa tgcgttgctg aacaataaag agttccagca gctggatttc
3240tccagtctgc atctttccgc aggcggaggg atgccagtgc agcaagtggt
ggcagagcgt 3300tgggtgaaac tgacaggaca gtatctgctg gaaggctatg
gccttaccga gtgtgcgccg 3360ctggtcagcg ttaacccata tgatattgat
tatcatagtg gtagcatcgg tttgccggtg 3420ccgtcgacgg aagccaaact
ggtggatgat gatgataatg aagtaccacc gggtcaaccg 3480ggtgagcttt
gtgtcaaagg accgcaggtg atgctgggtt actggcagcg tccggatgct
3540acagatgaga tcatcaaaaa tggctggtta cacaccggcg acatcgcggt
gatggatgaa 3600gaagggttcc tgcgcattgt cgatcgtaaa aaagacatga
ttctggtttc cggttttaac 3660gtctatccca acgagattga agatgtcgtc
atgcagcatc ctggcgtaca ggaagtcgcg 3720gctgttggcg taccttccgg
ctccagtggt gaagcggtga aaatcttcgt agtgaaaaaa 3780gatccatcgc
ttaccgaaga gtcactggtg accttttgcc gccgtcagct cacgggctac
3840aaagtaccga agctggtgga gtttcgtgat gagttaccga aatctaacgt
cggaaaaatt 3900ttgcgacgag aattacgtga cgaagcgcgc ggcaaagtgg
acaataaagc ctgataactc 3960tagaaataat ttaaatggaa ttcgaagctt
gggcccgaac aaaaactcat ctcagaagag 4020gatctgaata gcgccgtcga
ccatcatcat catcatcatt gagtttaaac ggtctccagc 4080ttggctgttt
tggcggatga gagaagattt tcagcctgat acagattaaa tcagaacgca
4140gaagcggtct gataaaacag aatttgcctg gcggcagtag cgcggtggtc
ccacctgacc 4200ccatgccgaa ctcagaagtg aaacgccgta gcgccgatgg
tagtgtgggg tctccccatg 4260cgagagtagg gaactgccag gcatcaaata
aaacgaaagg ctcagtcgaa agactgggcc 4320tttcgtttta tctgttgttt
gtcggtgaac gctctcctga ttaattaaga cgtctaagaa 4380accattatta
tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtctt
4440caagaatttt ataaaccgtg gagcgggcaa tactgagctg atgagcaatt
tccgttgcac 4500cagtgccctt ctgatgaagc gtcagcacga cgttcctgtc
cacggtacgc ctgcggccaa 4560atttgattcc tttcagcttt gcttcctgtc
ggccctcatt cgtgcgctct aggatcctcc 4620ggcgttcagc ctgtgccaca
gccgacagga tggtgaccac catttgcccc atatcaccgt 4680cggtactgat
cccgtcgtca ataaaccgaa ccgctacacc ctgagcatca aactctttta
4740tcagttggat catgtcggcg gtgtcgcggc caagacggtc gagcttcttc
accagaatga 4800catcaccttc ctccaccttc atcctcagca aatccagccc
ttcccgatct gttgaactgc 4860cggatgcctt gtcggtaaag atgcggttag
cttttacccc tgcatctttg agcgctgagg 4920tctgcctcgt gaagaaggtg
ttgctgactc ataccaggcc tgaatcgccc catcatccag 4980ccagaaagtg
agggagccac ggttgatgag agctttgttg taggtggacc agttggtgat
5040tttgaacttt tgctttgcca cggaacggtc tgcgttgtcg ggaagatgcg
tgatctgatc 5100cttcaactca gcaaaagttc gatttattca acaaagccgc
cgtcccgtca agtcagcgta 5160atgccctagg aggcgcgcca
5180106700DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 10ggggaattgt gagcggataa caattcccct
gtagaaataa ttttgtttaa ctttaataag 60gagatatacc atggtgaaga aggtttggct
taaccgttat cccgcggacg ttccgacgga 120gatcaaccct gaccgttatc
aatctctggt agatatgttt gagcagtcgg tcgcgcgcta 180cgccgatcaa
cctgcgtttg tgaatatggg ggaggtaatg accttccgca agctggaaga
240acgcagtcgc gcgtttgccg cttatttgca acaagggttg gggctgaaga
aaggcgatcg 300cgttgcgttg atgatgccta atttattgca atatccggtg
gcgctgtttg gcattttgcg 360tgccgggatg atcgtcgtaa acgttaaccc
gttgtatacc ccgcgtgagc ttgagcatca 420gcttaacgat agcggcgcat
cggcgattgt tatcgtgtct aactttgctc acacactgga 480aaaagtggtt
gataaaaccg ccgttcagca cgtaattctg acccgtatgg gcgatcagct
540atctacggca aaaggcacgg tagtcaattt cgttgttaaa tacatcaagc
gtttggtgcc 600gaaataccat ctgccagatg ccatttcatt tcgtagcgca
ctgcataacg gctaccggat 660gcagtacgtc aaacccgaac tggtgccgga
agatttagct tttctgcaat acaccggcgg 720caccactggt gtggcgaaag
gcgcgatgct gactcaccgc aatatgctgg cgaacctgga 780acaggttaac
gcgacctatg gtccgctgtt gcatccgggc aaagagctgg tggtgacggc
840gctgccgctg tatcacattt ttgccctgac cattaactgc ctgctgttta
tcgaactggg 900tgggcagaac ctgcttatca ctaacccgcg cgatattcca
gggttggtaa aagagttagc 960gaaatatccg tttaccgcta tcacgggcgt
taacaccttg ttcaatgcgt tgctgaacaa 1020taaagagttc cagcagctgg
atttctccag tctgcatctt tccgcaggcg gagggatgcc 1080agtgcagcaa
gtggtggcag agcgttgggt gaaactgaca ggacagtatc tgctggaagg
1140ctatggcctt accgagtgtg cgccgctggt cagcgttaac ccatatgata
ttgattatca 1200tagtggtagc atcggtttgc cggtgccgtc gacggaagcc
aaactggtgg atgatgatga 1260taatgaagta ccaccgggtc aaccgggtga
gctttgtgtc aaaggaccgc aggtgatgct 1320gggttactgg cagcgtccgg
atgctacaga tgagatcatc aaaaatggct ggttacacac 1380cggcgacatc
gcggtgatgg atgaagaagg gttcctgcgc attgtcgatc gtaaaaaaga
1440catgattctg gtttccggtt ttaacgtcta tcccaacgag attgaagatg
tcgtcatgca 1500gcatcctggc gtacaggaag tcgcggctgt tggcgtacct
tccggctcca gtggtgaagc 1560ggtgaaaatc ttcgtagtga aaaaagatcc
atcgcttacc gaagagtcac tggtgacctt 1620ttgccgccgt cagctcacgg
gctacaaagt accgaagctg gtggagtttc gtgatgagtt 1680accgaaatct
aacgtcggaa aaattttgcg acgagaatta cgtgacgaag cgcgcggcaa
1740agtggacaat aaagcctgaa agcttgcggc cgcataatgc ttaagtcgaa
cagaaagtaa 1800tcgtattgta cacggccgca taatcgaaat taatacgact
cactataggg gaattgtgag 1860cggataacaa ttccccatct tagtatatta
gttaagtata agaaggagat atacatatgc 1920gcccattaca tccgattgat
tttatattcc tgtcactaga aaaaagacaa cagcctatgc 1980atgtaggtgg
tttatttttg tttcagattc ctgataacgc cccagacacc tttattcaag
2040atctggtgaa tgatatccgg atatcaaaat caatccctgt tccaccattc
aacaataaac 2100tgaatgggct tttttgggat gaagatgaag agtttgattt
agatcatcat tttcgtcata 2160ttgcactgcc tcatcctggt cgtattcgtg
aattgcttat ttatatttca caagagcaca 2220gtacgctgct agatcgggca
aagcccttgt ggacctgcaa tattattgaa ggaattgaag 2280gcaatcgttt
tgccatgtac ttcaaaattc accatgcgat ggtcgatggc gttgctggta
2340tgcggttaat tgaaaaatca ctctcccatg atgtaacaga aaaaagtatc
gtgccacctt 2400ggtgtgttga gggaaaacgt gcaaagcgct taagagaacc
taaaacaggt aaaattaaga 2460aaatcatgtc tggtattaag agtcagcttc
aggcgacacc cacagtcatt caagagcttt 2520ctcagacagt atttaaagat
attggacgta atcctgatca tgtttcaagc tttcaggcgc 2580cttgttctat
tttgaatcag cgtgtgagct catcgcgacg ttttgcagca cagtcttttg
2640acctagatcg ttttcgtaat attgccaaat cgttgaatgt gaccattaat
gatgttgtac 2700tagcggtatg ttctggtgca ttacgtgcgt atttgatgag
tcataatagt ttgccttcaa 2760aaccattaat tgccatggtt ccagcctcta
ttcgcaatga cgattcagat gtcagcaacc 2820gtattacgat gattctggca
aatttggcaa cccacaaaga tgatccttta caacgtcttg 2880aaattatccg
ccgtagtgtt caaaactcaa agcaacgctt caaacgtatg accagcgatc
2940agattctaaa ttatagtgct gtcgtatatg gccctgcagg actcaacata
atttctggca 3000tgatgccaaa acgccaagcc ttcaatctgg ttatttccaa
tgtgcctggc ccaagagagc 3060cactttactg gaatggtgcc aaacttgatg
cactctaccc agcttcaatt gtattagacg 3120gtcaagcatt gaatattaca
atgaccagtt atttagataa acttgaagtt ggtttgattg 3180catgccgtaa
tgcattgcca agaatgcaga atttactgac acatttagaa gaagaaattc
3240aactatttga aggcgtaatt gcaaagcagg aagatattaa aacagccaat
taaaaacaat 3300aaacttgatt ttttaattta tcagataaaa ctaaagggct
aaattagccc tcctaggctg 3360ctgccaccgc tgagcaataa ctagcataac
cccttggggc ctctaaacgg gtcttgaggg 3420gttttttgct gaaacctcag
gcatttgaga agcacacggt cacactgctt ccggtagtca 3480ataaaccggt
aaaccagcaa tagacataag cggctattta acgaccctgc cctgaaccga
3540cgaccgggtc atcgtggccg gatcttgcgg cccctcggct tgaacgaatt
gttagacatt 3600atttgccgac taccttggtg atctcgcctt tcacgtagtg
gacaaattct tccaactgat 3660ctgcgcgcga ggccaagcga tcttcttctt
gtccaagata agcctgtcta gcttcaagta 3720tgacgggctg atactgggcc
ggcaggcgct ccattgccca gtcggcagcg acatccttcg 3780gcgcgatttt
gccggttact gcgctgtacc aaatgcggga caacgtaagc actacatttc
3840gctcatcgcc agcccagtcg ggcggcgagt tccatagcgt taaggtttca
tttagcgcct 3900caaatagatc ctgttcagga accggatcaa agagttcctc
cgccgctgga cctaccaagg 3960caacgctatg ttctcttgct tttgtcagca
agatagccag atcaatgtcg atcgtggctg 4020gctcgaagat acctgcaaga
atgtcattgc gctgccattc tccaaattgc agttcgcgct 4080tagctggata
acgccacgga atgatgtcgt cgtgcacaac aatggtgact tctacagcgc
4140ggagaatctc gctctctcca ggggaagccg aagtttccaa aaggtcgttg
atcaaagctc 4200gccgcgttgt ttcatcaagc cttacggtca ccgtaaccag
caaatcaata tcactgtgtg 4260gcttcaggcc gccatccact gcggagccgt
acaaatgtac ggccagcaac gtcggttcga 4320gatggcgctc gatgacgcca
actacctctg atagttgagt cgatacttcg gcgatcaccg 4380cttccctcat
actcttcctt tttcaatatt attgaagcat ttatcagggt tattgtctca
4440tgagcggata catatttgaa tgtatttaga aaaataaaca aatagctagc
tcactcggtc 4500gctacgctcc gggcgtgaga ctgcggcggg cgctgcggac
acatacaaag ttacccacag 4560attccgtgga taagcagggg actaacatgt
gaggcaaaac agcagggccg cgccggtggc 4620gtttttccat aggctccgcc
ctcctgccag agttcacata aacagacgct tttccggtgc 4680atctgtggga
gccgtgaggc tcaaccatga atctgacagt acgggcgaaa cccgacagga
4740cttaaagatc cccaccgttt ccggcgggtc gctccctctt gcgctctcct
gttccgaccc 4800tgccgtttac cggatacctg ttccgccttt ctcccttacg
ggaagtgtgg cgctttctca 4860tagctcacac actggtatct cggctcggtg
taggtcgttc gctccaagct gggctgtaag 4920caagaactcc ccgttcagcc
cgactgctgc gccttatccg gtaactgttc acttgagtcc 4980aacccggaaa
agcacggtaa aacgccactg gcagcagcca ttggtaactg ggagttcgca
5040gaggatttgt ttagctaaac acgcggttgc tcttgaagtg tgcgccaaag
tccggctaca 5100ctggaaggac agatttggtt gctgtgctct gcgaaagcca
gttaccacgg ttaagcagtt 5160ccccaactga cttaaccttc gatcaaacca
cctccccagg tggttttttc gtttacaggg 5220caaaagatta cgcgcagaaa
aaaaggatct caagaagatc ctttgatctt ttctactgaa 5280ccgctctaga
tttcagtgca atttatctct tcaaatgtag cacctgaagt cagccccata
5340cgatataagt tgtaattctc atgttagtca tgccccgcgc ccaccggaag
gagctgactg 5400ggttgaaggc tctcaagggc atcggtcgag atcccggtgc
ctaatgagtg agctaactta 5460cattaattgc gttgcgctca ctgcccgctt
tccagtcggg aaacctgtcg tgccagctgc 5520attaatgaat cggccaacgc
gcggggagag gcggtttgcg tattgggcgc cagggtggtt 5580tttcttttca
ccagtgagac gggcaacagc tgattgccct tcaccgcctg gccctgagag
5640agttgcagca agcggtccac gctggtttgc cccagcaggc gaaaatcctg
tttgatggtg 5700gttaacggcg ggatataaca tgagctgtct tcggtatcgt
cgtatcccac taccgagatg 5760tccgcaccaa cgcgcagccc ggactcggta
atggcgcgca ttgcgcccag cgccatctga 5820tcgttggcaa ccagcatcgc
agtgggaacg atgccctcat tcagcatttg catggtttgt 5880tgaaaaccgg
acatggcact ccagtcgcct tcccgttccg ctatcggctg aatttgattg
5940cgagtgagat atttatgcca gccagccaga cgcagacgcg ccgagacaga
acttaatggg 6000cccgctaaca gcgcgatttg ctggtgaccc aatgcgacca
gatgctccac gcccagtcgc 6060gtaccgtctt catgggagaa aataatactg
ttgatgggtg tctggtcaga gacatcaaga 6120aataacgccg gaacattagt
gcaggcagct tccacagcaa tggcatcctg gtcatccagc 6180ggatagttaa
tgatcagccc actgacgcgt tgcgcgagaa gattgtgcac cgccgcttta
6240caggcttcga cgccgcttcg ttctaccatc gacaccacca cgctggcacc
cagttgatcg 6300gcgcgagatt taatcgccgc gacaatttgc gacggcgcgt
gcagggccag actggaggtg 6360gcaacgccaa tcagcaacga ctgtttgccc
gccagttgtt gtgccacgcg gttgggaatg 6420taattcagct ccgccatcgc
cgcttccact ttttcccgcg ttttcgcaga aacgtggctg 6480gcctggttca
ccacgcggga aacggtctga taagagacac cggcatactc tgcgacatcg
6540tataacgtta ctggtttcac attcaccacc ctgaattgac tctcttccgg
gcgctatcat 6600gccataccgc gaaaggtttt gcgccattcg atggtgtccg
ggatctcgac gctctccctt 6660atgcgactcc tgcattagga aattaatacg
actcactata 6700115733DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 11actcaccagt
cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 60ctgccataac
catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac
120cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc
cttgatcgtt 180gggaaccgga gctgaatgaa gccataccaa acgacgagcg
tgacaccacg atgcctgcag 240caatggcaac aacgttgcgc aaactattaa
ctggcgaact acttactcta gcttcccggc 300aacaattaat agactggatg
gaggcggata aagttgcagg accacttctg cgctcggccc 360ttccggctgg
ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta
420tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc
tacacgacgg 480ggagtcaggc aactatggat gaacgaaata gacagatcgc
tgagataggt gcctcactga 540ttaagcattg gtaactgtca gaccaagttt
actcatatat actttagatt gatttaaaac 600ttcattttta atttaaaagg
atctaggtga agatcctttt tgataatctc atgaccaaaa 660tcccttaacg
tgagttttcg ttccactgag cgtcagaccc cttaataaga tgatcttctt
720gagatcgttt tggtctgcgc gtaatctctt gctctgaaaa cgaaaaaacc
gccttgcagg 780gcggtttttc gaaggttctc tgagctacca actctttgaa
ccgaggtaac tggcttggag 840gagcgcagtc accaaaactt gtcctttcag
tttagcctta accggcgcat gacttcaaga 900ctaactcctc taaatcaatt
accagtggct gctgccagtg gtgcttttgc atgtctttcc 960gggttggact
caagacgata gttaccggat aaggcgcagc ggtcggactg aacggggggt
1020tcgtgcatac agtccagctt ggagcgaact gcctacccgg aactgagtgt
caggcgtgga 1080atgagacaaa cgcggccata acagcggaat gacaccggta
aaccgaaagg caggaacagg 1140agagcgcacg agggagccgc cagggggaaa
cgcctggtat ctttatagtc ctgtcgggtt 1200tcgccaccac tgatttgagc
gtcagatttc gtgatgcttg tcaggggggc ggagcctatg 1260gaaaaacggc
tttgccgcgg ccctctcact tccctgttaa gtatcttcct ggcatcttcc
1320aggaaatctc cgccccgttc gtaagccatt tccgctcgcc gcagtcgaac
gaccgagcgt 1380agcgagtcag tgagcgagga agcggaatat atcctgtatc
acatattctg ctgacgcacc 1440ggtgcagcct tttttctcct gccacatgaa
gcacttcact gacaccctca tcagtgccaa 1500catagtaagc cagtatacac
tccgctagcg ctgaggtctg cctcgtgaag aaggtgttgc
1560tgactcatac caggcctgaa tcgccccatc atccagccag aaagtgaggg
agccacggtt 1620gatgagagct ttgttgtagg tggaccagtt ggtgattttg
aacttttgct ttgccacgga 1680acggtctgcg ttgtcgggaa gatgcgtgat
ctgatccttc aactcagcaa aagttcgatt 1740tattcaacaa agccacgttg
tgtctcaaaa tctctgatgt tacattgcac aagataaaaa 1800tatatcatca
tgaacaataa aactgtctgc ttacataaac agtaatacaa ggggtgttat
1860gagccatatt caacgggaaa cgtcttgctc gaggccgcga ttaaattcca
acatggatgc 1920tgatttatat gggtataaat gggctcgcga taatgtcggg
caatcaggtg cgacaatcta 1980tcgattgtat gggaagcccg atgcgccaga
gttgtttctg aaacatggca aaggtagcgt 2040tgccaatgat gttacagatg
agatggtcag actaaactgg ctgacggaat ttatgcctct 2100tccgaccatc
aagcatttta tccgtactcc tgatgatgca tggttactca ccactgcgat
2160ccccgggaaa acagcattcc aggtattaga agaatatcct gattcaggtg
aaaatattgt 2220tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt
cctgtttgta attgtccttt 2280taacagcgat cgcgtatttc gtctcgctca
ggcgcaatca cgaatgaata acggtttggt 2340tgatgcgagt gattttgatg
acgagcgtaa tggctggcct gttgaacaag tctggaaaga 2400aatgcataag
cttttgccat tctcaccgga ttcagtcgtc actcatggtg atttctcact
2460tgataacctt atttttgacg aggggaaatt aataggttgt attgatgttg
gacgagtcgg 2520aatcgcagac cgataccagg atcttgccat cctatggaac
tgcctcggtg agttttctcc 2580ttcattacag aaacggcttt ttcaaaaata
tggtattgat aatcctgata tgaataaatt 2640gcagtttcat ttgatgctcg
atgagttttt ctaatcagaa ttggttaatt ggttgtaaca 2700ctggcagagc
attacgctga cttgacggga cggcggcttt gttgaataaa tcgaactttt
2760gctgagttga aggatcagat cacgcatctt cccgacaacg cagaccgttc
cgtggcaaag 2820caaaagttca aaatcaccaa ctggtccacc tacaacaaag
ctctcatcaa ccgtggctcc 2880ctcactttct ggctggatga tggggcgatt
caggcctggt atgagtcagc aacaccttct 2940tcacgaggca gacctcagcg
ctcaaagatg caggggtaaa agctaaccgc atctttaccg 3000acaaggcatc
cggcagttca acagatcggg aagggctgga tttgctgagg atgaaggtgg
3060aggaaggtga tgtcattctg gtgaagaagc tcgaccgtct tggccgcgac
accgccgaca 3120tgatccaact gataaaagag tttgatgctc agggtgtagc
ggttcggttt attgacgacg 3180ggatcagtac cgacggtgat atggggcaaa
tggtggtcac catcctgtcg gctgtggcac 3240aggctgaacg ccggaggatc
ctagagcgca cgaatgaggg ccgacaggaa gcaaagctga 3300aaggaatcaa
atttggccgc aggcgtaccg tggacaggaa cgtcgtgctg acgcttcatc
3360agaagggcac tggtgcaacg gaaattgctc atcagctcag tattgcccgc
tccacggttt 3420ataaaattct tgaagacgaa agggcctcgt gatacgccta
tttttatagg ttaatgtcat 3480gataataatg gtttcttaga cgtcttaatt
aatcaggaga gcgttcaccg acaaacaaca 3540gataaaacga aaggcccagt
ctttcgactg agcctttcgt tttatttgat gcctggcagt 3600tccctactct
cgcatgggga gaccccacac taccatcggc gctacggcgt ttcacttctg
3660agttcggcat ggggtcaggt gggaccaccg cgctactgcc gccaggcaaa
ttctgtttta 3720tcagaccgct tctgcgttct gatttaatct gtatcaggct
gaaaatcttc tctcatccgc 3780caaaacagcc aagctggaga ccgtttaaac
tcaatgatga tgatgatgat ggtcgacggc 3840gctattcaga tcctcttctg
agatgagttt ttgttcgggc ccaagcttcg aattcccata 3900tggtaccagc
tgcagatctc gagctcggat ccatggttta ttcctcctta tttaatcgat
3960acattaatat atacctcttt aatttttaat aataaagtta atcgataatt
ccggtcgagt 4020gcccacacag attgtctgat aaattgttaa agagcagtgc
cgcttcgctt tttctcagcg 4080gcgctgtttc ctgtgtgaaa ttgttatccg
ctcacaattc cacacattat acgagccgga 4140tgattaattg tcaacagctc
atttcagaat atttgccaga accgttatga tgtcggcgca 4200aaaaacatta
tccagaacgg gagtgcgcct tgagcgacac gaattatgca gtgatttacg
4260acctgcacag ccataccaca gcttccgatg gctgcctgac gccagaagca
ttggtgcacc 4320gtgcagtcga tgataagctg tcaaaccaga tcaattcgcg
ctaactcaca ttaattgcgt 4380tgcgctcact gcccgctttc cagtcgggaa
acctgtcgtg ccagctgcat taatgaatcg 4440gccaacgcgc ggggagaggc
ggtttgcgta ttgggcgcca gggtggtttt tcttttcacc 4500agtgagacgg
gcaacagctg attgcccttc accgcctggc cctgagagag ttgcagcaag
4560cggtccacgc tggtttgccc cagcaggcga aaatcctgtt tgatggtggt
tgacggcggg 4620atataacatg agctgtcttc ggtatcgtcg tatcccacta
ccgagatatc cgcaccaacg 4680cgcagcccgg actcggtaat ggcgcgcatt
gcgcccagcg ccatctgatc gttggcaacc 4740agcatcgcag tgggaacgat
gccctcattc agcatttgca tggtttgttg aaaaccggac 4800atggcactcc
agtcgccttc ccgttccgct atcggctgaa tttgattgcg agtgagatat
4860ttatgccagc cagccagacg cagacgcgcc gagacagaac ttaatgggcc
cgctaacagc 4920gcgatttgct ggtgacccaa tgcgaccaga tgctccacgc
ccagtcgcgt accgtcttca 4980tgggagaaaa taatactgtt gatgggtgtc
tggtcagaga catcaagaaa taacgccgga 5040acattagtgc aggcagcttc
cacagcaatg gcatcctggt catccagcgg atagttaatg 5100atcagcccac
tgacgcgttg cgcgagaaga ttgtgcaccg ccgctttaca ggcttcgacg
5160ccgcttcgtt ctaccatcga caccaccacg ctggcaccca gttgatcggc
gcgagattta 5220atcgccgcga caatttgcga cggcgcgtgc agggccagac
tggaggtggc aacgccaatc 5280agcaacgact gtttgcccgc cagttgttgt
gccacgcggt tgggaatgta attcagctcc 5340gccatcgccg cttccacttt
ttcccgcgtt ttcgcagaaa cgtggctggc ctggttcacc 5400acgcgggaaa
cggtctgata agagacaccg gcatactctg cgacatcgta taacgttact
5460ggtttcacat tcaccaccct gaattgactc tcttccgggc gctatcatgc
cataccgcga 5520aaggttttgc accattcgat ggtgtcaacg taaatgcatg
ccgcttcgcc ttcgcgcgcg 5580aattgatctg ctgcctcgcg cgtttcggtg
atgacggtga aaacctctga cacatgcagc 5640tcccggagac ggtcacagct
tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg 5700gcgcgtcagc
gggtgttggc ggggccggcc tcg 57331260DNAArtificial SequenceDescription
of Artificial Sequence Synthetic primer 12ctctagaaat aatttaactt
taagtaggag auaggtaccc atggcggaca cgttattgat 601358DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
13cttcgaattc catttaaatt atttctagag tcattatgag tcatgattta ctaaaggc
581465DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 14ctctagaaat aattttagtt aagtataaga aggagatata
ccatggtgaa gaaggtttgg 60cttaa 651551DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
15cttcgaattc catttaaatt atttctagag ttatcaggct ttattgtcca c
511636DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 16gggtcaatag cggccgccaa ttcgcgcgcg aaggcg
361737DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 17tggcgcgcct cctagggcat tacgctgact tgacggg
371817683DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 18ggccacgatg cgtccggcgt agaggatctg
ctcatgtttg acagcttatc atcgatgcat 60aatgtgcctg tcaaatggac gaagcaggga
ttctgcaaac cctatgctac tccgtcaagc 120cgtcaattgt ctgattcgtt
accaattatg acaacttgac ggctacatca ttcacttttt 180cttcacaacc
ggcacggaac tcgctcgggc tggccccggt gcatttttta aatacccgcg
240agaaatagag ttgatcgtca aaaccaacat tgcgaccgac ggtggcgata
ggcatccggg 300tggtgctcaa aagcagcttc gcctggctga tacgttggtc
ctcgcgccag cttaagacgc 360taatccctaa ctgctggcgg aaaagatgtg
acagacgcga cggcgacaag caaacatgct 420gtgcgacgct ggcgatatca
aaattgctgt ctgccaggtg atcgctgatg tactgacaag 480cctcgcgtac
ccgattatcc atcggtggat ggagcgactc gttaatcgct tccatgcgcc
540gcagtaacaa ttgctcaagc agatttatcg ccagcagctc cgaatagcgc
ccttcccctt 600gcccggcgtt aatgatttgc ccaaacaggt cgctgaaatg
cggctggtgc gcttcatccg 660ggcgaaagaa ccccgtattg gcaaatattg
acggccagtt aagccattca tgccagtagg 720cgcgcggacg aaagtaaacc
cactggtgat accattcgcg agcctccgga tgacgaccgt 780agtgatgaat
ctctcctggc gggaacagca aaatatcacc cggtcggcaa acaaattctc
840gtccctgatt tttcaccacc ccctgaccgc gaatggtgag attgagaata
taacctttca 900ttcccagcgg tcggtcgata aaaaaatcga gataaccgtt
ggcctcaatc ggcgttaaac 960ccgccaccag atgggcatta aacgagtatc
ccggcagcag gggatcattt tgcgcttcag 1020ccatactttt catactcccg
ccattcagag aagaaaccaa ttgtccatat tgcatcagac 1080attgccgtca
ctgcgtcttt tactggctct tctcgctaac caaaccggta accccgctta
1140ttaaaagcat tctgtaacaa agcgggacca aagccatgac aaaaacgcgt
aacaaaagtg 1200tctataatca cggcagaaaa gtccacattg attatttgca
cggcgtcaca ctttgctatg 1260ccatagcatt tttatccata agattagcgg
atcctacctg acgcttttta tcgcaactct 1320ctactgtttc tccatacccg
tttttttggg ctagcgaatt cgagctcggt acccaagtct 1380taaactagac
agaatagttg taaactgaaa tcagtccagt tatgctgtga aaaagcatac
1440tggacttttg ttatggctaa agcaaactct tcattttctg aagtgcaaat
tgcccgtcgt 1500attaaagagg ggcgtggcca agggcatggt aaagactata
ttccatggct aacagtacaa 1560gaagttcctt cttcaggtcg ttcccaccgt
atttattctc ataagacggg acgagtccat 1620catttgctat ctgacttaga
gcttgctgtt tttctcagtc ttgagtggga gagcagcgtg 1680ctagatatac
gcgagcagtt ccccttatta cctagtgata ccaggcagat tgcaatagat
1740agtggtatta agcatcctgt tattcgtggt gtagatcagg ttatgtctac
tgatttttta 1800gtggactgca aagatggtcc ttttgagcag tttgctattc
aagtcaaacc tgcagcagcc 1860ttacaagacg agcgtacctt agaaaaacta
gaactagagc gtcgctattg gcagcaaaag 1920caaattcctt ggttcatttt
tactgataaa gaaataaatc ccgtagtaaa agaaaatatt 1980gaatggcttt
attcagtgaa aacagaagaa gtttctgcgg agcttttagc acaactatcc
2040ccattggccc atatcctgca agaaaaagga gatgaaaaca ttatcaatgt
ctgtaagcag 2100gttgatattg cttatgattt ggagttaggc aaaacattga
gtgagatacg agccttaacc 2160gcaaatggtt ttattaagtt caatatttat
aagtctttca gggcaaataa gtgtgcagat 2220ctctgtatta gccaagtagt
gaatatggag gagttgcgct atgtggcaaa ttaatgaggt 2280tgtgctattt
gataatgatc cgtatcgcat tttggctata gaggatggcc aagttgtctg
2340gatgcaaata agcgctgata aaggagttcc acaagctagg gctgagttgt
tgctaatgca 2400gtatttagat gaaggccgct tagttagaac tgatgaccct
tatgtacatc ttgatttaga 2460agagccgtct gtagattctg tcagcttcca
gaagcgcgag gaggattatc gaaaaattct 2520tcctattatt aatagtaagg
atcgtttcga ccctaaagtc agaagcgaac tcgttgagca 2580tgtggtccaa
gaacataagg ttactaaggc tacagtttat aagttgttac gccgttactg
2640gcagcgtggt caaacgccta atgcattaat tcctgactac aaaaacagcg
gtgcaccagg 2700ggaaagacgt tcagcgacag gaacagcaaa gattggccga
gccagagaat atggtaaggg 2760tgaaggaacc aaggtaacgc ccgagattga
acgccttttt aggttgacca tagaaaagca 2820cctgttaaat caaaaaggta
caaagaccac cgttgcctat agacgatttg tggacttgtt 2880tgctcagtat
tttcctcgca ttccccaaga ggattaccca acactacgtc agtttcgtta
2940tttttatgat cgagaatacc ctaaagctca gcgcttaaag tctagagtta
aagcaggggt 3000atataaaaaa gacgtacgac ccttaagtag tacagccact
tctcaggcgt taggccctgg 3060gagtcgttat gagattgatg ccacgattgc
tgatatttat ttagtggatc atcatgatcg 3120ccaaaaaatc ataggaagac
caacgcttta cattgtgatt gatgtgttta gtcggatgat 3180cacgggcttt
tatatcggct ttgaaaatcc gtcttatgtg gtggcgatgc aggcttttgt
3240aaatgcttgc tctgacaaaa cggccatttg tgcccagcat gatattgaga
ttagtagctc 3300agactggccg tgtgtaggtt tgccagatgt gttgctagcg
gaccgtggcg aattaatgag 3360tcatcaggtc gaagccttag tttctagttt
taatgtgcga gtggaaagtg ctccacctag 3420acgtggcgat gctaaaggca
tagtggaaag cacttttaga acactacaag ccgagtttaa 3480gtcctttgca
cctggcattg tagagggcag tcggatcaaa agccatggtg aaacagacta
3540taggttagat gcatctctgt cggtatttga gttcacacaa attattttgc
gtacgatctt 3600attcagaaat aaccatctgg tgatggataa atacgatcga
gatgctgatt ttcctacaga 3660tttaccgtct attcctgtcc agctatggca
atggggtatg cagcatcgta caggtagttt 3720aagggctgtg gagcaagagc
agttgcgagt agcgttactg cctcgccgaa aggtctctat 3780ttcttcattt
ggcgttaatt tgtggggttt gtattactcg gggtcagaga ttctgcgtga
3840gggttggttg cagcggagca ctgatatagc tagacctcaa catttagaag
cggcttatga 3900cccagtgctg gttgatacga tttatttgtt tccgcaagtt
ggcagccgtg tattttggcg 3960ctgtaatctg acggaacgta gtcggcagtt
taaaggtctc tcattttggg aggtttggga 4020tatacaagca caagaaaaac
acaataaagc caatgcgaag caggatgagt taactaaacg 4080cagggagctt
gaggcgttta ttcagcaaac cattcagaaa gcgaataagt taacgcccag
4140tactactgag cccaaatcaa cacgcattaa gcagattaaa actaataaaa
aagaagccgt 4200gacctcggag cgtaaaaaac gtgcggagca tttgaagcca
agctcttcag gtgatgaggc 4260taaagttatt cctttcaacg cagtggaagc
ggatgatcaa gaagattaca gcctacccac 4320atacgtgcct gaattatttc
aggatccacc agaaaaggat gagtcatgag tgctacccgg 4380attcaagcag
tttatcgtga tacgggggta gaggcttatc gtgataatcc ttttatcgag
4440gccttaccac cattacaaga gtcagtgaat agtgctgcat cactgaaatc
ctctttacag 4500cttacttcct ctgacttgca aaagtcccgt gttatcagag
ctcataccat ttgtcgtatt 4560ccagatgact attttcagcc attaggtacg
catttgctac taagtgagcg tatttcggtc 4620atgattcgag gtggctacgt
aggcagaaat cctaaaacag gagatttaca aaagcattta 4680caaaatggtt
atgagcgtgt tcaaacggga gagttggaga catttcgctt tgaggaggca
4740cgatctacgg cacaaagctt attgttaatt ggttgttctg gtagtgggaa
gacgacctct 4800cttcatcgta ttctagccac gtatcctcag gtgatttacc
atcgtgaact caatgtagag 4860caggtggtgt atttgaaaat agactgctcg
cataatggtt cgctaaaaga aatctgcttg 4920aattttttca gagcgttgga
tcgagccttg ggctcgaact atgagcgtcg ttatggctta 4980aaacgtcatg
gtatagaaac catgttggct ttgatgtcgc aaatagccaa tgcacatgct
5040ttagggttgt tggttattga tgaaattcag catttaagcc gctctcgttc
gggtggatct 5100caagagatgc tgaacttttt tgtgacgatg gtgaatatta
ttggcgtacc agtgatgttg 5160attggtaccc ctaaagcacg agagattttt
gaggctgatt tgcggtctgc acgtagaggg 5220gcagggtttg gagctatatt
ctgggatcct atacaacaaa cgcaacgtgg aaagcccaat 5280caagagtgga
tcgcttttac ggataatctt tggcaattac agcttttaca acgcaaagat
5340gcgctgttat cggatgaggt ccgtgatgtg tggtatgagc taagccaagg
agtgatggac 5400attgtagtaa aactttttgt actcgctcag ctccgtgcgc
tagctttagg caatgagcgt 5460attaccgctg gtttattgcg gcaagtgtat
caagatgagt taaagcctgt gcaccccatg 5520ctagaggcat tacgctcggg
tatcccagaa cgcattgctc gttattctga tctagtcgtt 5580cccgagattg
ataaacggtt aatccaactt cagctagata tcgcagcgat acaagaacaa
5640acaccagaag aaaaagccct tcaagagtta gataccgaag atcagcgtca
tttatatctg 5700atgctgaaag aggattacga ttcaagcctg ttaattccca
ctattaaaaa agcgtttagc 5760cagaatccaa cgatgacaag acaaaagtta
ctgcctcttg ttttgcagtg gttgatggaa 5820ggcgaaacgg tagtgtcaga
actagaaaag ccctccaaga gtaaaaaggt ttcggctata 5880aaggtagtca
agcccagcga ctgggatagc ttgcctgata cggatttacg ttatatctat
5940tcacaacgcc aacctgaaaa aaccatgcat gaacggttaa aagggaaagg
ggtaatagtg 6000gatatggcga gcttatttaa acaagcaggt tagccatgag
aaactttcct gttccgtact 6060cgaatgagct gatttatagc actattgcac
gggcaggcgt ttatcaaggg attgttagtc 6120ctaagcagct gttggatgag
gtgtatggca accgcaaggt ggtcgctacc ttaggtctgc 6180cctcgcattt
aggtgtgata gcaagacatc tacatcaaac aggacgttac gctgttcagc
6240agcttattta tgagcatacc ttattccctt tatatgctcc gtttgtaggc
aaggagcgcc 6300gagacgaagc tattcggtta atggagtacc aagcgcaagg
tgcggtgcat ttaatgctag 6360gagtcgctgc ttctagagtt aagagcgata
accgctttag atactgccct gattgcgttg 6420ctcttcagct aaataggtat
ggggaagcct tttggcaacg agattggtat ttgcccgctt 6480tgccatattg
tccaaaacac ggtgctttag tcttctttga tagagctgta gatgatcacc
6540gacatcaatt ttgggctttg ggtcatactg agctgctttc agactacccc
aaagactccc 6600tatctcaatt aacagcacta gctgcttata tagcccctct
gttagatgct ccacgagcgc 6660aagagctttc cccaagcctt gagcagtgga
cgctgtttta tcagcgctta gcgcaggatc 6720tagggctaac caaaagcaag
cacattcgtc atgacttggt ggcggagaga gtgaggcaga 6780cttttagtga
tgaggcacta gagaaactgg atttaaagtt ggcagagaac aaggacacgt
6840gttggctgaa aagtatattc cgtaagcata gaaaagcctt tagttattta
cagcatagta 6900ttgtgtggca agccttattg ccaaaactaa cggttataga
agcgctacag caggcaagtg 6960ctcttactga gcactctata acgacaagac
ctgttagcca gtctgtgcaa cctaactctg 7020aagatttatc tgttaagcat
aaagactggc agcaactagt gcataaatac caaggaatta 7080aggcggcaag
acagtcttta gagggtgggg tgctatacgc ttggctttac cgacatgaca
7140gggattggct agttcactgg aatcaacagc atcaacaaga gcgtctggca
cccgccccta 7200gagttgattg gaaccaaaga gatcgaattg ctgtacgaca
actattaaga atcataaagc 7260gtctagatag tagccttgat cacccaagag
cgacatcgag ctggctgtta aagcaaactc 7320ctaacggaac ctctcttgca
aaaaatctac agaaactgcc tttggtagcg ctttgcttaa 7380agcgttactc
agagagtgtg gaagattatc aaattagacg gattagccaa gcttttatta
7440agcttaaaca ggaagatgtt gagcttaggc gctggcgatt attaagaagt
gcaacgttat 7500ctaaagagcg gataactgag gaagcacaaa gattcttgga
aatggtttat ggggaagagt 7560gagtggttag gctagctaca tttaatgaca
atgtgcaggt tgtacatatt ggtcatttat 7620tccgtaactc gggtcataag
gagtggcgta tttttgtttg gtttaatcca atgcaagaac 7680ggaaatggac
tcgatttact catttgcctt tattaagtcg agctaaggtg gttaacagta
7740caacaaagca aataaataag gcggatcgtg tgattgagtt tgaagcatcg
gatcttcaac 7800gagccaaaat aatcgatttt cctaatctct cgtcctttgc
ttccgtacgc aacaaggatg 7860gagcgcagag ttcatttatt tacgaagctg
aaacaccata tagcaagact cgttatcaca 7920tcccacagtt agagctagct
cggtcattat ttttagggga tcctctagag tcgacctgca 7980ggcatgcaag
cttggctgtt ttggcggatg agagaagatt ttcagcctga tacagattaa
8040atcagaacgc agaagcggtc tgataaaaca gaatttgcct ggcggcagta
gcgcggtggt 8100cccacctgac cccatgccga actcagaagt gaaacgccgt
agcgccgatg gtagtgtggg 8160gtctccccat gcgagagtag ggaactgcca
ggcatcaaat aaaacgaaag gctcagtcga 8220aagactgggc ctttcgtttt
atctgttgtt tgtcggtgaa cgctctcctg agtaggacaa 8280atccgccggg
agcggatttg aacgttgcga agcaacggcc cggagggtgg cgggcaggac
8340gcccgccata aactgccagg catcaaatta agcagaaggc catcctgacg
gatggccttt 8400ttgcgtttct acaaactctt ttgtttattt ttctaaatac
attcaaatat gtatccgctc 8460atgagacaat aaccctgata aatgcttcaa
taatattgaa aaaggaagag tatgagtatt 8520caacatttcc gtgtcgccct
tattcccttt tttgcggcat tttgccttcc tgtttttgct 8580cacccagaaa
cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt
8640tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc
cgaagaacgt 8700tttccaatga tgagcacttt taaagttctg ctatgtggcg
cggtattatc ccgtgttgac 8760gccgggcaag agcaactcgg tcgccgcata
cactattctc agaatgactt ggttgagtac 8820tcaccagtca cagaaaagca
tcttacggat ggcatgacag taagagaatt atgcagtgct 8880gccataacca
tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg
8940aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct
tgatcgttgg 9000gaaccggagc tgaatgaagc cataccaaac gacgagcgtg
acaccacgat gcctgcagca 9060atggcaacaa cgttgcgcaa actattaact
ggcgaactac ttactctagc ttcccggcaa 9120caattaatag actggatgga
ggcggataaa gttgcaggac cacttctgcg ctcggccctt 9180ccggctggct
ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc
9240attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta
cacgacgggg 9300agtcaggcaa ctatggatga acgaaataga cagatcgctg
agataggtgc ctcactgatt 9360aagcattggt aactgtcaga ccaagtttac
tcatatatac tttagattga tttacgcgcc 9420ctgtagcggc gcattaagcg
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact 9480tgccagcgcc
ctagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc
9540cgccggccag cctcgcagag caggattccc gttgagcacc gccaggtgcg
aataagggac 9600agtgaagaag gaacacccgc tcgcgggtgg gcctacttca
cctatcctgc ccggcggcat 9660caccggcgcc acaggtgcgg ttgctggcgc
ctatatcgcc gacatcaccg atggggaaga 9720tcgggctcgc cacttcgggc
tcatgagcgc ttgtttcggc gtgggtatgg tggcaggccc 9780cgtggccggg
ggactgttgg gcgccatctc cttgcatgca ccattccttg cggcggcggt
9840gctcaacggc ctcaacctac tactgggctg cttcctaatg caggagtcgc
ataagggaga 9900gcgtcgatcc
ccgacagtaa gacgggtaag cctgttgatg ataccgctgc cttactgggt
9960gcattagcca gtctgaatga cctgtcacgg gataatccga agtggtcaga
ctggaaaatc 10020agagggcagg aactgctgaa cagcaaaaag tcagatagca
ccacatagca gacccgccat 10080aaaacgccct gagaagcccg tgacgggctt
ttcttgtatt atgggtagtt tccttgcatg 10140aatccataaa aggcgcctgt
agtgccattt acccccattc actgccagag ccgtgagcgc 10200agcgaactga
atgtcacgaa aaagacagcg actcaggtgc ctgatggtcg gagacaaaag
10260gaatattcag cgatttgccc gagcttgcga gggtgctact taagccttta
gggttttaag 10320gtctgttttg tagaggagca aacagcgttt gcgacatcct
tttgtaatac tgcggaactg 10380actaaagtag tgagttatac acagggctgg
gatctattct ttttatcttt ttttattctt 10440tctttattct ataaattata
accacttgaa tataaacaaa aaaaacacac aaaggtctag 10500cggaatttac
agagggtcta gcagaattta caagttttcc agcaaaggtc tagcagaatt
10560tacagatacc cacaactcaa aggaaaagga ctagtaatta tcattgacta
gcccatctca 10620attggtatag tgattaaaat cacctagacc aattgagatg
tatgtctgaa ttagttgttt 10680tcaaagcaaa tgaactagcg attagtcgct
atgacttaac ggagcatgaa accaagctaa 10740ttttatgctg tgtggcacta
ctcaacccca cgattgaaaa ccctacaagg aaagaacgga 10800cggtatcgtt
cacttataac caatacgttc agatgatgaa catcagtagg gaaaatgctt
10860atggtgtatt agctaaagca accagagagc tgatgacgag aactgtggaa
atcaggaatc 10920ctttggttaa aggctttgag attttccagt ggacaaacta
tgccaagttc tcaagcgaaa 10980aattagaatt agtttttagt gaagagatat
tgccttatct tttccagtta aaaaaattca 11040taaaatataa tctggaacat
gttaagtctt ttgaaaacaa atactctatg aggatttatg 11100agtggttatt
aaaagaacta acacaaaaga aaactcacaa ggcaaatata gagattagcc
11160ttgatgaatt taagttcatg ttaatgcttg aaaataacta ccatgagttt
aaaaggctta 11220accaatgggt tttgaaacca ataagtaaag atttaaacac
ttacagcaat atgaaattgg 11280tggttgataa gcgaggccgc ccgactgata
cgttgatttt ccaagttgaa ctagatagac 11340aaatggatct cgtaaccgaa
cttgagaaca accagataaa aatgaatggt gacaaaatac 11400caacaaccat
tacatcagat tcctacctac ataacggact aagaaaaaca ctacacgatg
11460ctttaactgc aaaaattcag ctcaccagtt ttgaggcaaa atttttgagt
gacatgcaaa 11520gtaagtatga tctcaatggt tcgttctcat ggctcacgca
aaaacaacga accacactag 11580agaacatact ggctaaatac ggaaggatct
gaggttctta tggctcttgt atctatcagt 11640gaagcatcaa gactaacaaa
caaaagtaga acaactgttc accgttacat atcaaaggga 11700aaactgtcca
tatgcacaga tgaaaacggt gtaaaaaaga tagatacatc agagctttta
11760cgagtttttg gtgcatttaa agctgttcac catgaacaga tcgacaatgt
aacagatgaa 11820cagcatgtaa cacctaatag aacaggtgaa accagtaaaa
caaagcaact agaacatgaa 11880attgaacacc tgagacaact tgttacagct
caacagtcac acatagacag cctgaaacag 11940gcgatgctgc ttatcgaatc
aaagctgccg acaacacggg agccagtgac gcctcccgtg 12000gggaaaaaat
catggcaatt ctggaagaaa tagcgctttc agcctgtggg cggacaaaat
12060agttgggaac tgggaggggt ggaaatggag tttttaagga ttatttaggg
aagagtgaca 12120aaatagatgg gaactgggtg tagcgtcgta agctaatacg
aaaattaaaa atgacaaaat 12180agtttggaac tagatttcac ttatctggtt
ggtcgacact agtattaccc tgttatccct 12240agatttaaat gatatcggat
cctagtaagc cacgttttaa ttaatcagat gggtcaatag 12300cggccgccaa
ttcgcgcgcg aaggcgaagc ggcatgcatt tacgttgaca ccatcgaatg
12360gtgcaaaacc tttcgcggta tggcatgata gcgcccggaa gagagtcaat
tcagggtggt 12420gaatgtgaaa ccagtaacgt tatacgatgt cgcagagtat
gccggtgtct cttatcagac 12480cgtttcccgc gtggtgaacc aggccagcca
cgtttctgcg aaaacgcggg aaaaagtgga 12540agcggcgatg gcggagctga
attacattcc caaccgcgtg gcacaacaac tggcgggcaa 12600acagtcgttg
ctgattggcg ttgccacctc cagtctggcc ctgcacgcgc cgtcgcaaat
12660tgtcgcggcg attaaatctc gcgccgatca actgggtgcc agcgtggtgg
tgtcgatggt 12720agaacgaagc ggcgtcgaag cctgtaaagc ggcggtgcac
aatcttctcg cgcaacgcgt 12780cagtgggctg atcattaact atccgctgga
tgaccaggat gccattgctg tggaagctgc 12840ctgcactaat gttccggcgt
tatttcttga tgtctctgac cagacaccca tcaacagtat 12900tattttctcc
catgaagacg gtacgcgact gggcgtggag catctggtcg cattgggtca
12960ccagcaaatc gcgctgttag cgggcccatt aagttctgtc tcggcgcgtc
tgcgtctggc 13020tggctggcat aaatatctca ctcgcaatca aattcagccg
atagcggaac gggaaggcga 13080ctggagtgcc atgtccggtt ttcaacaaac
catgcaaatg ctgaatgagg gcatcgttcc 13140cactgcgatg ctggttgcca
acgatcagat ggcgctgggc gcaatgcgcg ccattaccga 13200gtccgggctg
cgcgttggtg cggatatctc ggtagtggga tacgacgata ccgaagacag
13260ctcatgttat atcccgccgt caaccaccat caaacaggat tttcgcctgc
tggggcaaac 13320cagcgtggac cgcttgctgc aactctctca gggccaggcg
gtgaagggca atcagctgtt 13380gcccgtctca ctggtgaaaa gaaaaaccac
cctggcgccc aatacgcaaa ccgcctctcc 13440ccgcgcgttg gccgattcat
taatgcagct ggcacgacag gtttcccgac tggaaagcgg 13500gcagtgagcg
caacgcaatt aatgtgagtt agcgcgaatt gatctggttt gacagcttat
13560catcgactgc acggtgcacc aatgcttctg gcgtcaggca gccatcggaa
gctgtggtat 13620ggctgtgcag gtcgtaaatc actgcataat tcgtgtcgct
caaggcgcac tcccgttctg 13680gataatgttt tttgcgccga catcataacg
gttctggcaa atattctgaa atgagctgtt 13740gacaattaat catccggctc
gtataatgtg tggaattgtg agcggataac aatttcacac 13800aggaaacagc
gccgctgaga aaaagcgaag cggcactgct ctttaacaat ttatcagaca
13860atctgtgtgg gcactcgacc ggaattatcg attaacttta ttattaaaaa
ttaaagaggt 13920atatattaat gtatcgatta aataaggagg aataaaccat
ggcggacacg ttattgattc 13980tgggtgatag cctgagcgcc gggtatcgaa
tgtctgccag cgcggcctgg cctgccttgt 14040tgaatgataa gtggcagagt
aaaacgtcgg tagttaatgc cagcatcagc ggcgacacct 14100cgcaacaagg
actggcgcgc cttccggctc tgctgaaaca gcatcagccg cgttgggtgc
14160tggttgaact gggcggcaat gacggtttgc gtggttttca gccacagcaa
accgagcaaa 14220cgctgcgcca gattttgcag gatgtcaaag ccgccaacgc
tgaaccattg ttaatgcaaa 14280tacgtctgcc tgcaaactat ggtcgccgtt
ataatgaagc ctttagcgcc atttacccca 14340aactcgccaa agagtttgat
gttccgctgc tgcccttttt tatggaagag gtctacctca 14400agccacaatg
gatgcaggat gacggtattc atcccaaccg cgacgcccag ccgtttattg
14460ccgactggat ggcgaagcag ttgcagcctt tagtaaatca tgactcataa
tgactctaga 14520aataatttta gttaagtata agaaggagat ataccatggt
gaagaaggtt tggcttaacc 14580gttatcccgc ggacgttccg acggagatca
accctgaccg ttatcaatct ctggtagata 14640tgtttgagca gtcggtcgcg
cgctacgccg atcaacctgc gtttgtgaat atgggggagg 14700taatgacctt
ccgcaagctg gaagaacgca gtcgcgcgtt tgccgcttat ttgcaacaag
14760ggttggggct gaagaaaggc gatcgcgttg cgttgatgat gcctaattta
ttgcaatatc 14820cggtggcgct gtttggcatt ttgcgtgccg ggatgatcgt
cgtaaacgtt aacccgttgt 14880ataccccgcg tgagcttgag catcagctta
acgatagcgg cgcatcggcg attgttatcg 14940tgtctaactt tgctcacaca
ctggaaaaag tggttgataa aaccgccgtt cagcacgtaa 15000ttctgacccg
tatgggcgat cagctatcta cggcaaaagg cacggtagtc aatttcgttg
15060ttaaatacat caagcgtttg gtgccgaaat accatctgcc agatgccatt
tcatttcgta 15120gcgcactgca taacggctac cggatgcagt acgtcaaacc
cgaactggtg ccggaagatt 15180tagcttttct gcaatacacc ggcggcacca
ctggtgtggc gaaaggcgcg atgctgactc 15240accgcaatat gctggcgaac
ctggaacagg ttaacgcgac ctatggtccg ctgttgcatc 15300cgggcaaaga
gctggtggtg acggcgctgc cgctgtatca catttttgcc ctgaccatta
15360actgcctgct gtttatcgaa ctgggtgggc agaacctgct tatcactaac
ccgcgcgata 15420ttccagggtt ggtaaaagag ttagcgaaat atccgtttac
cgctatcacg ggcgttaaca 15480ccttgttcaa tgcgttgctg aacaataaag
agttccagca gctggatttc tccagtctgc 15540atctttccgc aggcggaggg
atgccagtgc agcaagtggt ggcagagcgt tgggtgaaac 15600tgacaggaca
gtatctgctg gaaggctatg gccttaccga gtgtgcgccg ctggtcagcg
15660ttaacccata tgatattgat tatcatagtg gtagcatcgg tttgccggtg
ccgtcgacgg 15720aagccaaact ggtggatgat gatgataatg aagtaccacc
gggtcaaccg ggtgagcttt 15780gtgtcaaagg accgcaggtg atgctgggtt
actggcagcg tccggatgct acagatgaga 15840tcatcaaaaa tggctggtta
cacaccggcg acatcgcggt gatggatgaa gaagggttcc 15900tgcgcattgt
cgatcgtaaa aaagacatga ttctggtttc cggttttaac gtctatccca
15960acgagattga agatgtcgtc atgcagcatc ctggcgtaca ggaagtcgcg
gctgttggcg 16020taccttccgg ctccagtggt gaagcggtga aaatcttcgt
agtgaaaaaa gatccatcgc 16080ttaccgaaga gtcactggtg accttttgcc
gccgtcagct cacgggctac aaagtaccga 16140agctggtgga gtttcgtgat
gagttaccga aatctaacgt cggaaaaatt ttgcgacgag 16200aattacgtga
cgaagcgcgc ggcaaagtgg acaataaagc ctgataactc tagaaataat
16260ttaaatggaa ttcgaagctt gggcccgaac aaaaactcat ctcagaagag
gatctgaata 16320gcgccgtcga ccatcatcat catcatcatt gagtttaaac
ggtctccagc ttggctgttt 16380tggcggatga gagaagattt tcagcctgat
acagattaaa tcagaacgca gaagcggtct 16440gataaaacag aatttgcctg
gcggcagtag cgcggtggtc ccacctgacc ccatgccgaa 16500ctcagaagtg
aaacgccgta gcgccgatgg tagtgtgggg tctccccatg cgagagtagg
16560gaactgccag gcatcaaata aaacgaaagg ctcagtcgaa agactgggcc
tttcgtttta 16620tctgttgttt gtcggtgaac gctctcctga ttaattaaga
cgtctaagaa accattatta 16680tcatgacatt aacctataaa aataggcgta
tcacgaggcc ctttcgtctt caagaatttt 16740ataaaccgtg gagcgggcaa
tactgagctg atgagcaatt tccgttgcac cagtgccctt 16800ctgatgaagc
gtcagcacga cgttcctgtc cacggtacgc ctgcggccaa atttgattcc
16860tttcagcttt gcttcctgtc ggccctcatt cgtgcgctct aggatcctcc
ggcgttcagc 16920ctgtgccaca gccgacagga tggtgaccac catttgcccc
atatcaccgt cggtactgat 16980cccgtcgtca ataaaccgaa ccgctacacc
ctgagcatca aactctttta tcagttggat 17040catgtcggcg gtgtcgcggc
caagacggtc gagcttcttc accagaatga catcaccttc 17100ctccaccttc
atcctcagca aatccagccc ttcccgatct gttgaactgc cggatgcctt
17160gtcggtaaag atgcggttag cttttacccc tgcatctttg agcgctgagg
tctgcctcgt 17220gaagaaggtg ttgctgactc ataccaggcc tgaatcgccc
catcatccag ccagaaagtg 17280agggagccac ggttgatgag agctttgttg
taggtggacc agttggtgat tttgaacttt 17340tgctttgcca cggaacggtc
tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca 17400gcaaaagttc
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgccctagg
17460aggcgcgcca cggccgcgtc gaccccacgc ccctctttaa tacgacgggc
aatttgcact 17520tcagaaaatg aagagtttgc tttagccata acaaaagtcc
agtatgcttt ttcacagcat 17580aactggactg atttcagttt acaactattc
tgtctagttt aagactttat tgtcatagtt 17640tagatctatt ttgttcagtt
taagacttta ttgtccgccc aca 176831920DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
19gatgctggtg gcgaagctgt 202023DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 20gttgcgacgg tggtacgcat aac
232130DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 21cccagatcag ttttatgatt gcctcgctgg
302220DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 22atcatgaaac gtctcggaac 202328DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
23cctcgagtta cttgcgggtt cgggcgcg 28245199DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
24cactatacca attgagatgg gctagtcaat gataattact agtccttttc ctttgagttg
60tgggtatctg taaattctgc tagacctttg ctggaaaact tgtaaattct gctagaccct
120ctgtaaattc cgctagacct ttgtgtgttt tttttgttta tattcaagtg
gttataattt 180atagaataaa gaaagaataa aaaaagataa aaagaataga
tcccagccct gtgtataact 240cactacttta gtcagttccg cagtattaca
aaaggatgtc gcaaacgctg tttcagtaca 300ctctctcaat acgaataaac
ggctcagaaa tgagccgttt attttttcta cccatatcct 360tgaagcggtg
ttataatgcc gcgccctcga tatggggatt tttaacgacc tgattttcgg
420gtctcagtag tagttgacat tagcggagca ctaaaccatg aaacgtctcg
gaaccctgga 480cgcctcctgg ctggcggttg aatctgaaga caccccgatg
catgtgggta cgcttcagat 540tttctcactg ccggaaggcg caccagaaac
cttcctgcgt gacatggtca ctcgaatgaa 600agaggccggc gatgtggcac
caccctgggg atacaaactg gcctggtctg gtttcctcgg 660gcgcgtgatc
gccccggcct ggaaagtcga taaggatatc gatctggatt atcacgtccg
720gcactcagcc ctgcctcgcc ccggcgggga gcgcgaactg ggtattctgg
tatcccgact 780gcactctaac cccctggatt tttcccgccc tctttgggaa
tgccacgtta ttgaaggcct 840ggagaataac cgttttgccc tttacaccaa
aatgcaccac tcgatgattg acggcatcag 900cggcgtgcga ctgatgcaga
gggtgctcac caccgatccc gaacgctgca atatgccacc 960gccctggacg
gtacgcccac accaacgccg tggtgcaaaa accgacaaag aggccagcgt
1020gcccgcagcg gtttcccagg caatggacgc cctgaagctc caggcagaca
tggcccccag 1080gctgtggcag gccggcaatc gcctggtgca ttcggttcga
cacccggaag acggactgac 1140cgcgcccttc actggaccgg tttcggtgct
caatcaccgg gttaccgcgc agcgacgttt 1200tgccacccag cattatcaac
tggaccggct gaaaaacctg gcccatgctt ccggcggttc 1260cttgaacgac
atcgtgcttt acctgtgtgg caccgcattg cggcgctttc tggctgagca
1320gaacaatctg ccagacaccc cgctgacggc tggtataccg gtgaatatcc
ggccggcaga 1380cgacgagggt acgggcaccc agatcagttt tatgattgcc
tcgctggcca ccgacgaagc 1440tgatccgttg aaccgcctgc aacagatcaa
aacctcgacc cgacgggcca aggagcacct 1500gcagaaactt ccaaaaagtg
ccctgaccca gtacaccatg ctgctgatgt caccctacat 1560tctgcaattg
atgtcaggtc tcggggggag gatgcgacca gtcttcaacg tgaccatttc
1620caacgtgccc ggcccggaag gcacgctgta ttatgaagga gcccggcttg
aggccatgta 1680tccggtatcg ctaatcgctc acggcggcgc cctgaacatc
acctgcctga gctatgccgg 1740atcgctgaat ttcggtttta ccggctgtcg
ggatacgctg ccgagcatgc agaaactggc 1800ggtttatacc ggtgaagctc
tggatgagct ggaatcgctg attctgccac ccaagaagcg 1860cgcccgaacc
cgcaagtaac tcgagatctg cagctggtac catatgggaa ttcacccgct
1920gacgagctta gtaaagccct cgctagattt taatgcggat gttgcgatta
cttcgccaac 1980tattgcgata acaagaaaaa gccagccttt catgatatat
ctcccaattt gtgtagggct 2040tattatgcac gcttaaaaat aataaaagca
gacttgacct gatagtttgg ctgtgagcaa 2100ttatgtgctt agtgcatcta
acgcttgagt taagccgcgc cgcgaagcgg cgtcggcttg 2160aacgaattgt
tagacattat ttgccgacta ccttggtgat ctcgcctttc acgtagtgga
2220caaattcttc caactgatct gcgcgcgagg ccaagcgatc ttcttcttgt
ccaagataag 2280cctgtctagc ttcaagtatg acgggctgat actgggccgg
caggcgctcc attgcccagt 2340cggcagcgac atccttcggc gcgattttgc
cggttactgc gctgtaccaa atgcgggaca 2400acgtaagcac tacatttcgc
tcatcgccag cccagtcggg cggcgagttc catagcgtta 2460aggtttcatt
tagcgcctca aatagatcct gttcaggaac cggatcaaag agttcctccg
2520ccgctggacc taccaaggca acgctatgtt ctcttgcttt tgtcagcaag
atagccagat 2580caatgtcgat cgtggctggc tcgaagatac ctgcaagaat
gtcattgcgc tgccattctc 2640caaattgcag ttcgcgctta gctggataac
gccacggaat gatgtcgtcg tgcacaacaa 2700tggtgacttc tacagcgcgg
agaatctcgc tctctccagg ggaagccgaa gtttccaaaa 2760ggtcgttgat
caaagctcgc cgcgttgttt catcaagcct tacggtcacc gtaaccagca
2820aatcaatatc actgtgtggc ttcaggccgc catccactgc ggagccgtac
aaatgtacgg 2880ccagcaacgt cggttcgaga tggcgctcga tgacgccaac
tacctctgat agttgagtcg 2940atacttcggc gatcaccgct tccctcatga
tgtttaactt tgttttaggg cgactgccct 3000gctgcgtaac atcgttgctg
ctccataaca tcaaacatcg acccacggcg taacgcgctt 3060gctgcttgga
tgcccgaggc atagactgta ccccaaaaaa acagtcataa caagccatga
3120aaaccgccac tgcgccgtta ccaccgctgc gttcggtcaa ggttctggac
cagttgcgtg 3180agcgcatacg ctacttgcat tacagcttac gaaccgaaca
ggcttatgtc cactgggttc 3240gtgccttcat ccgtttccac ggtgtgcgtc
acccggcaac cttgggcagc agcgaagtcg 3300aggcatttct gtcctggctg
gcgaacgagc gcaaggtttc ggtctccacg catcgtcagg 3360cattggcggc
cttgctgttc ttctacggca aggtgctgtg cacggatctg ccctggcttc
3420aggagatcgg aagacctcgg ccgtcgcggc gcttgccggt ggtgctgacc
ccggatgaag 3480tggttcgcat cctcggtttt ctggaaggcg agcatcgttt
gttcgcccag cttctgtatg 3540gaacgggcat gcggatcagt gagggtttgc
aactgcgggt caaggatctg gatttcgatc 3600acggcacgat catcgtgcgg
gagggcaagg gctccaagga tcgggccttg atgttacccg 3660agagcttggc
acccagcctg cgcgagcagg ggaattaatt cccacgggtt ttgctgcccg
3720caaacgggct gttctggtgt tgctagtttg ttatcagaat cgcagatccg
gcttcagccg 3780gtttgccggc tgaaagcgct atttcttcca gaattgccat
gattttttcc ccacgggagg 3840cgtcactggc tcccgtgttg tcggcagctt
tgattcgata agcagcatcg cctgtttcag 3900gctgtctatg tgtgactgtt
gagctgtaac aagttgtctc aggtgttcaa tttcatgttc 3960tagttgcttt
gttttactgg tttcacctgt tctattaggt gttacatgct gttcatctgt
4020tacattgtcg atctgttcat ggtgaacagc tttgaatgca ccaaaaactc
gtaaaagctc 4080tgatgtatct atctttttta caccgttttc atctgtgcat
atggacagtt ttccctttga 4140tatgtaacgg tgaacagttg ttctactttt
gtttgttagt cttgatgctt cactgataga 4200tacaagagcc ataagaacct
cagatccttc cgtatttagc cagtatgttc tctagtgtgg 4260ttcgttgttt
ttgcgtgagc catgagaacg aaccattgag atcatactta ctttgcatgt
4320cactcaaaaa ttttgcctca aaactggtga gctgaatttt tgcagttaaa
gcatcgtgta 4380gtgtttttct tagtccgtta tgtaggtagg aatctgatgt
aatggttgtt ggtattttgt 4440caccattcat ttttatctgg ttgttctcaa
gttcggttac gagatccatt tgtctatcta 4500gttcaacttg gaaaatcaac
gtatcagtcg ggcggcctcg cttatcaacc accaatttca 4560tattgctgta
agtgtttaaa tctttactta ttggtttcaa aacccattgg ttaagccttt
4620taaactcatg gtagttattt tcaagcatta acatgaactt aaattcatca
aggctaatct 4680ctatatttgc cttgtgagtt ttcttttgtg ttagttcttt
taataaccac tcataaatcc 4740tcatagagta tttgttttca aaagacttaa
catgttccag attatatttt atgaattttt 4800ttaactggaa aagataaggc
aatatctctt cactaaaaac taattctaat ttttcgcttg 4860agaacttggc
atagtttgtc cactggaaaa tctcaaagcc tttaaccaaa ggattcctga
4920tttccacagt tctcgtcatc agctctctgg ttgctttagc taatacacca
taagcatttt 4980ccctactgat gttcatcatc tgagcgtatt ggttataagt
gaacgatacc gtccgttctt 5040tccttgtagg gttttcaatc gtggggttga
gtagtgccac acagcataaa attagcttgg 5100tttcatgctc cgttaagtca
tagcgactaa tcgctagttc atttgctttg aaaacaacta 5160attcagacat
acatctcaat tggtctaggt gattttaat 519925169DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
25gctgtttcag tacactctct caatacgaat aaacggctca gaaatgagcc gtttattttt
60tctacccata tccttgaagc ggtgttataa tgccgcgccc tcgatatggg gatttttaac
120gacctgattt tcgggtctca gtagtagttg acattagcgg agcactaaa
1692642DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 26aaaggatgtc gcaaacgctg tttcagtaca ctctctcaat ac
422734DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 27gagctcggat ccatggttta gtgctccgct aatg
34285903DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 28cactatacca attgagatgg gctagtcaat
gataattact agtccttttc ctttgagttg 60tgggtatctg taaattctgc tagacctttg
ctggaaaact tgtaaattct gctagaccct 120ctgtaaattc cgctagacct
ttgtgtgttt tttttgttta tattcaagtg gttataattt 180atagaataaa
gaaagaataa aaaaagataa aaagaataga tcccagccct gtgtataact
240cactacttta gtcagttccg cagtattaca aaaggatgtc gcaaacgctg
tttgctcctc 300tacaaaacag accttaaaac cctaaaggcg tcggcatccg
cttacagaca agctgtgacc 360gtctccggga gctgcatgtg tcagaggttt
tcaccgtcat caccgaaacg cgcgaggcag 420cagatcaatt cgcgcgcgaa
ggcgaagcgg catgcattta cgttgacacc atcgaatggt 480gcaaaacctt
tcgcggtatg gcatgatagc gcccggaaga gagtcaattc agggtggtga
540atgtgaaacc agtaacgtta tacgatgtcg cagagtatgc cggtgtctct
tatcagaccg 600tttcccgcgt ggtgaaccag gccagccacg tttctgcgaa
aacgcgggaa aaagtggaag 660cggcgatggc ggagctgaat
tacattccca accgcgtggc acaacaactg gcgggcaaac 720agtcgttgct
gattggcgtt gccacctcca gtctggccct gcacgcgccg tcgcaaattg
780tcgcggcgat taaatctcgc gccgatcaac tgggtgccag cgtggtggtg
tcgatggtag 840aacgaagcgg cgtcgaagcc tgtaaagcgg cggtgcacaa
tcttctcgcg caacgcgtca 900gtgggctgat cattaactat ccgctggatg
accaggatgc cattgctgtg gaagctgcct 960gcactaatgt tccggcgtta
tttcttgatg tctctgacca gacacccatc aacagtatta 1020ttttctccca
tgaagacggt acgcgactgg gcgtggagca tctggtcgca ttgggtcacc
1080agcaaatcgc gctgttagcg ggcccattaa gttctgtctc ggcgcgtctg
cgtctggctg 1140gctggcataa atatctcact cgcaatcaaa ttcagccgat
agcggaacgg gaaggcgact 1200ggagtgccat gtccggtttt caacaaacca
tgcaaatgct gaatgagggc atcgttccca 1260ctgcgatgct ggttgccaac
gatcagatgg cgctgggcgc aatgcgcgcc attaccgagt 1320ccgggctgcg
cgttggtgcg gatatctcgg tagtgggata cgacgatacc gaagacagct
1380catgttatat cccgccgtta accaccatca aacaggattt tcgcctgctg
gggcaaacca 1440gcgtggaccg cttgctgcaa ctctctcagg gccaggcggt
gaagggcaat cagctgttgc 1500ccgtctcact ggtgaaaaga aaaaccaccc
tggcgcccaa tacgcaaacc gcctctcccc 1560gcgcgttggc cgattcatta
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc 1620agtgagcgca
acgcaattaa tgtaagttag cgcgaattga tctggtttga cagcttatca
1680tcgactgcac ggtgcaccaa tgcttctggc gtcaggcagc catcggaagc
tgtggtatgg 1740ctgtgcaggt cgtaaatcac tgcataattc gtgtcgctca
aggcgcactc ccgttctgga 1800taatgttttt tgcgccgaca tcataacggt
tctggcaaat attctgaaat gagctgttga 1860caattaatca tccggctcgt
ataatgtgtg gaattgtgag cggataacaa tttcacacag 1920gaaacagcgc
cgctgagaaa aagcgaagcg gcactgctct ttaacaattt atcagacaat
1980ctgtgtgggc actcgaccgg aattatcgat taactttatt attaaaaatt
aaagaggtat 2040atattaatgt atcgattaaa taaggaggaa taaaccatgg
atccgagctc gagatctgca 2100gctggtacca tatgggaatt cgaagcttgg
gcccgaacaa aaactcatct cagaagagga 2160tctgaatagc gccgtcgacc
atcatcatca tcatcattga gtttaaacgg tctccagctt 2220ggctgttttg
gcggatgaga gaagattttc agcctgatac agattaaatc agaacgcaga
2280agcggtctga taaaacagaa tttgcctggc ggcagtagcg cggtggtccc
acctgacccc 2340atgccgaact cagaagtgaa acgccgtagc gccgatggta
gtgtggggtc tccccatgcg 2400agagtaggga actgccaggc atcaaataaa
acgaaaggct cagtcgaaag actgggcctt 2460tcgttttatc tgttgtttgt
cggtgaacgc tctcctgacg cctgatgcgg tattttctcc 2520ttacgcatct
gtgcggtatt tcacaccgca tatggtgcac tctcagtaca atctgctctg
2580atgccgcata gttaagccag ccccgacacc cgccaacacc cgctgacgag
cttagtaaag 2640ccctcgctag attttaatgc ggatgttgcg attacttcgc
caactattgc gataacaaga 2700aaaagccagc ctttcatgat atatctccca
atttgtgtag ggcttattat gcacgcttaa 2760aaataataaa agcagacttg
acctgatagt ttggctgtga gcaattatgt gcttagtgca 2820tctaacgctt
gagttaagcc gcgccgcgaa gcggcgtcgg cttgaacgaa ttgttagaca
2880ttatttgccg actaccttgg tgatctcgcc tttcacgtag tggacaaatt
cttccaactg 2940atctgcgcgc gaggccaagc gatcttcttc ttgtccaaga
taagcctgtc tagcttcaag 3000tatgacgggc tgatactggg ccggcaggcg
ctccattgcc cagtcggcag cgacatcctt 3060cggcgcgatt ttgccggtta
ctgcgctgta ccaaatgcgg gacaacgtaa gcactacatt 3120tcgctcatcg
ccagcccagt cgggcggcga gttccatagc gttaaggttt catttagcgc
3180ctcaaataga tcctgttcag gaaccggatc aaagagttcc tccgccgctg
gacctaccaa 3240ggcaacgcta tgttctcttg cttttgtcag caagatagcc
agatcaatgt cgatcgtggc 3300tggctcgaag atacctgcaa gaatgtcatt
gcgctgccat tctccaaatt gcagttcgcg 3360cttagctgga taacgccacg
gaatgatgtc gtcgtgcaca acaatggtga cttctacagc 3420gcggagaatc
tcgctctctc caggggaagc cgaagtttcc aaaaggtcgt tgatcaaagc
3480tcgccgcgtt gtttcatcaa gccttacggt caccgtaacc agcaaatcaa
tatcactgtg 3540tggcttcagg ccgccatcca ctgcggagcc gtacaaatgt
acggccagca acgtcggttc 3600gagatggcgc tcgatgacgc caactacctc
tgatagttga gtcgatactt cggcgatcac 3660cgcttccctc atgatgttta
actttgtttt agggcgactg ccctgctgcg taacatcgtt 3720gctgctccat
aacatcaaac atcgacccac ggcgtaacgc gcttgctgct tggatgcccg
3780aggcatagac tgtaccccaa aaaaacagtc ataacaagcc atgaaaaccg
ccactgcgcc 3840gttaccaccg ctgcgttcgg tcaaggttct ggaccagttg
cgtgagcgca tacgctactt 3900gcattacagc ttacgaaccg aacaggctta
tgtccactgg gttcgtgcct tcatccgttt 3960ccacggtgtg cgtcacccgg
caaccttggg cagcagcgaa gtcgaggcat ttctgtcctg 4020gctggcgaac
gagcgcaagg tttcggtctc cacgcatcgt caggcattgg cggccttgct
4080gttcttctac ggcaaggtgc tgtgcacgga tctgccctgg cttcaggaga
tcggaagacc 4140tcggccgtcg cggcgcttgc cggtggtgct gaccccggat
gaagtggttc gcatcctcgg 4200ttttctggaa ggcgagcatc gtttgttcgc
ccagcttctg tatggaacgg gcatgcggat 4260cagtgagggt ttgcaactgc
gggtcaagga tctggatttc gatcacggca cgatcatcgt 4320gcgggagggc
aagggctcca aggatcgggc cttgatgtta cccgagagct tggcacccag
4380cctgcgcgag caggggaatt aattcccacg ggttttgctg cccgcaaacg
ggctgttctg 4440gtgttgctag tttgttatca gaatcgcaga tccggcttca
gccggtttgc cggctgaaag 4500cgctatttct tccagaattg ccatgatttt
ttccccacgg gaggcgtcac tggctcccgt 4560gttgtcggca gctttgattc
gataagcagc atcgcctgtt tcaggctgtc tatgtgtgac 4620tgttgagctg
taacaagttg tctcaggtgt tcaatttcat gttctagttg ctttgtttta
4680ctggtttcac ctgttctatt aggtgttaca tgctgttcat ctgttacatt
gtcgatctgt 4740tcatggtgaa cagctttgaa tgcaccaaaa actcgtaaaa
gctctgatgt atctatcttt 4800tttacaccgt tttcatctgt gcatatggac
agttttccct ttgatatgta acggtgaaca 4860gttgttctac ttttgtttgt
tagtcttgat gcttcactga tagatacaag agccataaga 4920acctcagatc
cttccgtatt tagccagtat gttctctagt gtggttcgtt gtttttgcgt
4980gagccatgag aacgaaccat tgagatcata cttactttgc atgtcactca
aaaattttgc 5040ctcaaaactg gtgagctgaa tttttgcagt taaagcatcg
tgtagtgttt ttcttagtcc 5100gttatgtagg taggaatctg atgtaatggt
tgttggtatt ttgtcaccat tcatttttat 5160ctggttgttc tcaagttcgg
ttacgagatc catttgtcta tctagttcaa cttggaaaat 5220caacgtatca
gtcgggcggc ctcgcttatc aaccaccaat ttcatattgc tgtaagtgtt
5280taaatcttta cttattggtt tcaaaaccca ttggttaagc cttttaaact
catggtagtt 5340attttcaagc attaacatga acttaaattc atcaaggcta
atctctatat ttgccttgtg 5400agttttcttt tgtgttagtt cttttaataa
ccactcataa atcctcatag agtatttgtt 5460ttcaaaagac ttaacatgtt
ccagattata ttttatgaat ttttttaact ggaaaagata 5520aggcaatatc
tcttcactaa aaactaattc taatttttcg cttgagaact tggcatagtt
5580tgtccactgg aaaatctcaa agcctttaac caaaggattc ctgatttcca
cagttctcgt 5640catcagctct ctggttgctt tagctaatac accataagca
ttttccctac tgatgttcat 5700catctgagcg tattggttat aagtgaacga
taccgtccgt tctttccttg tagggttttc 5760aatcgtgggg ttgagtagtg
ccacacagca taaaattagc ttggtttcat gctccgttaa 5820gtcatagcga
ctaatcgcta gttcatttgc tttgaaaaca actaattcag acatacatct
5880caattggtct aggtgatttt aat 59032928DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
29atatgacgtc ggcatccgct tacagaca 283032DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
30aattcttaag tcaggagagc gttcaccgac aa 323124DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
31gaattccacc cgctgacgag ctta 243221DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
32cgaattccca tatggtacca g 213370DNAArtificial SequenceDescription
of Artificial Sequence Synthetic primer 33gccgcagcct gatggacaaa
gcgttcatta tggtgctgcc ggtcgcgatg attccgggga 60tccgtcgacc
703465DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 34atcttcaacg gtaacttctt taccgccatg cgtgtcccag
gtgtctgtag gctggagctg 60cttcg 653571DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
35ctcccctgga atgcagggga gcggcaagat taaaccagtt cgttcgggca gtgtaggctg
60gagctgcttc g 713674DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 36tatttttagt agcttaaatg
tgattcaaca tcactggaga aagtcttatg catatgaata 60tcctccttag ttcc
743720DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 37ggactaaacg tcctacaaac 203820DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
38ttcatctgtt tgagatcgag 203929DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 39cccgagcggt agccagatgc
ccgccagcg 294029DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 40gctgcgggtt agcgcacatc atacgggtc
29417314DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 41cactatacca attgagatgg gctagtcaat
gataattact agtccttttc ctttgagttg 60tgggtatctg taaattctgc tagacctttg
ctggaaaact tgtaaattct gctagaccct 120ctgtaaattc cgctagacct
ttgtgtgttt tttttgttta tattcaagtg gttataattt 180atagaataaa
gaaagaataa aaaaagataa aaagaataga tcccagccct gtgtataact
240cactacttta gtcagttccg cagtattaca aaaggatgtc gcaaacgctg
tttgctcctc 300tacaaaacag accttaaaac cctaaaggcg tcggcatccg
cttacagaca agctgtgacc 360gtctccggga gctgcatgtg tcagaggttt
tcaccgtcat caccgaaacg cgcgaggcag 420cagatcaatt cgcgcgcgaa
ggcgaagcgg catgcattta cgttgacacc atcgaatggt 480gcaaaacctt
tcgcggtatg gcatgatagc gcccggaaga gagtcaattc agggtggtga
540atgtgaaacc agtaacgtta tacgatgtcg cagagtatgc cggtgtctct
tatcagaccg 600tttcccgcgt ggtgaaccag gccagccacg tttctgcgaa
aacgcgggaa aaagtggaag 660cggcgatggc ggagctgaat tacattccca
accgcgtggc acaacaactg gcgggcaaac 720agtcgttgct gattggcgtt
gccacctcca gtctggccct gcacgcgccg tcgcaaattg 780tcgcggcgat
taaatctcgc gccgatcaac tgggtgccag cgtggtggtg tcgatggtag
840aacgaagcgg cgtcgaagcc tgtaaagcgg cggtgcacaa tcttctcgcg
caacgcgtca 900gtgggctgat cattaactat ccgctggatg accaggatgc
cattgctgtg gaagctgcct 960gcactaatgt tccggcgtta tttcttgatg
tctctgacca gacacccatc aacagtatta 1020ttttctccca tgaagacggt
acgcgactgg gcgtggagca tctggtcgca ttgggtcacc 1080agcaaatcgc
gctgttagcg ggcccattaa gttctgtctc ggcgcgtctg cgtctggctg
1140gctggcataa atatctcact cgcaatcaaa ttcagccgat agcggaacgg
gaaggcgact 1200ggagtgccat gtccggtttt caacaaacca tgcaaatgct
gaatgagggc atcgttccca 1260ctgcgatgct ggttgccaac gatcagatgg
cgctgggcgc aatgcgcgcc attaccgagt 1320ccgggctgcg cgttggtgcg
gatatctcgg tagtgggata cgacgatacc gaagacagct 1380catgttatat
cccgccgtta accaccatca aacaggattt tcgcctgctg gggcaaacca
1440gcgtggaccg cttgctgcaa ctctctcagg gccaggcggt gaagggcaat
cagctgttgc 1500ccgtctcact ggtgaaaaga aaaaccaccc tggcgcccaa
tacgcaaacc gcctctcccc 1560gcgcgttggc cgattcatta atgcagctgg
cacgacaggt ttcccgactg gaaagcgggc 1620agtgagcgca acgcaattaa
tgtaagttag cgcgaattga tctggtttga cagcttatca 1680tcgactgcac
ggtgcaccaa tgcttctggc gtcaggcagc catcggaagc tgtggtatgg
1740ctgtgcaggt cgtaaatcac tgcataattc gtgtcgctca aggcgcactc
ccgttctgga 1800taatgttttt tgcgccgaca tcataacggt tctggcaaat
attctgaaat gagctgttga 1860caattaatca tccggctcgt ataatgtgtg
gaattgtgag cggataacaa tttcacacag 1920gaaacagcgc cgctgagaaa
aagcgaagcg gcactgctct ttaacaattt atcagacaat 1980ctgtgtgggc
actcgaccgg aattatcgat taactttatt attaaaaatt aaagaggtat
2040atattaatgt atcgattaaa taaggaggaa taaaccatga aacgtctcgg
aaccctggac 2100gcctcctggc tggcggttga atctgaagac accccgatgc
atgtgggtac gcttcagatt 2160ttctcactgc cggaaggcgc accagaaacc
ttcctgcgtg acatggtcac tcgaatgaaa 2220gaggccggcg atgtggcacc
accctgggga tacaaactgg cctggtctgg tttcctcggg 2280cgcgtgatcg
ccccggcctg gaaagtcgat aaggatatcg atctggatta tcacgtccgg
2340cactcagccc tgcctcgccc cggcggggag cgcgaactgg gtattctggt
atcccgactg 2400cactctaacc ccctggattt ttcccgccct ctttgggaat
gccacgttat tgaaggcctg 2460gagaataacc gttttgccct ttacaccaaa
atgcaccact cgatgattga cggcatcagc 2520ggcgtgcgac tgatgcagag
ggtgctcacc accgatcccg aacgctgcaa tatgccaccg 2580ccctggacgg
tacgcccaca ccaacgccgt ggtgcaaaaa ccgacaaaga ggccagcgtg
2640cccgcagcgg tttcccaggc aatggacgcc ctgaagctcc aggcagacat
ggcccccagg 2700ctgtggcagg ccggcaatcg cctggtgcat tcggttcgac
acccggaaga cggactgacc 2760gcgcccttca ctggaccggt ttcggtgctc
aatcaccggg ttaccgcgca gcgacgtttt 2820gccacccagc attatcaact
ggaccggctg aaaaacctgg cccatgcttc cggcggttcc 2880ttgaacgaca
tcgtgcttta cctgtgtggc accgcattgc ggcgctttct ggctgagcag
2940aacaatctgc cagacacccc gctgacggct ggtataccgg tgaatatccg
gccggcagac 3000gacgagggta cgggcaccca gatcagtttt atgattgcct
cgctggccac cgacgaagct 3060gatccgttga accgcctgca acagatcaaa
acctcgaccc gacgggccaa ggagcacctg 3120cagaaacttc caaaaagtgc
cctgacccag tacaccatgc tgctgatgtc accctacatt 3180ctgcaattga
tgtcaggtct cggggggagg atgcgaccag tcttcaacgt gaccatttcc
3240aacgtgcccg gcccggaagg cacgctgtat tatgaaggag cccggcttga
ggccatgtat 3300ccggtatcgc taatcgctca cggcggcgcc ctgaacatca
cctgcctgag ctatgccgga 3360tcgctgaatt tcggttttac cggctgtcgg
gatacgctgc cgagcatgca gaaactggcg 3420gtttataccg gtgaagctct
ggatgagctg gaatcgctga ttctgccacc caagaagcgc 3480gcccgaaccc
gcaagtaact cgagatctgc agctggtacc atatgggaat tcgaagcttg
3540ggcccgaaca aaaactcatc tcagaagagg atctgaatag cgccgtcgac
catcatcatc 3600atcatcattg agtttaaacg gtctccagct tggctgtttt
ggcggatgag agaagatttt 3660cagcctgata cagattaaat cagaacgcag
aagcggtctg ataaaacaga atttgcctgg 3720cggcagtagc gcggtggtcc
cacctgaccc catgccgaac tcagaagtga aacgccgtag 3780cgccgatggt
agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa
3840aacgaaaggc tcagtcgaaa gactgggcct ttcgttttat ctgttgtttg
tcggtgaacg 3900ctctcctgac gcctgatgcg gtattttctc cttacgcatc
tgtgcggtat ttcacaccgc 3960atatggtgca ctctcagtac aatctgctct
gatgccgcat agttaagcca gccccgacac 4020ccgccaacac ccgctgacga
gcttagtaaa gccctcgcta gattttaatg cggatgttgc 4080gattacttcg
ccaactattg cgataacaag aaaaagccag cctttcatga tatatctccc
4140aatttgtgta gggcttatta tgcacgctta aaaataataa aagcagactt
gacctgatag 4200tttggctgtg agcaattatg tgcttagtgc atctaacgct
tgagttaagc cgcgccgcga 4260agcggcgtcg gcttgaacga attgttagac
attatttgcc gactaccttg gtgatctcgc 4320ctttcacgta gtggacaaat
tcttccaact gatctgcgcg cgaggccaag cgatcttctt 4380cttgtccaag
ataagcctgt ctagcttcaa gtatgacggg ctgatactgg gccggcaggc
4440gctccattgc ccagtcggca gcgacatcct tcggcgcgat tttgccggtt
actgcgctgt 4500accaaatgcg ggacaacgta agcactacat ttcgctcatc
gccagcccag tcgggcggcg 4560agttccatag cgttaaggtt tcatttagcg
cctcaaatag atcctgttca ggaaccggat 4620caaagagttc ctccgccgct
ggacctacca aggcaacgct atgttctctt gcttttgtca 4680gcaagatagc
cagatcaatg tcgatcgtgg ctggctcgaa gatacctgca agaatgtcat
4740tgcgctgcca ttctccaaat tgcagttcgc gcttagctgg ataacgccac
ggaatgatgt 4800cgtcgtgcac aacaatggtg acttctacag cgcggagaat
ctcgctctct ccaggggaag 4860ccgaagtttc caaaaggtcg ttgatcaaag
ctcgccgcgt tgtttcatca agccttacgg 4920tcaccgtaac cagcaaatca
atatcactgt gtggcttcag gccgccatcc actgcggagc 4980cgtacaaatg
tacggccagc aacgtcggtt cgagatggcg ctcgatgacg ccaactacct
5040ctgatagttg agtcgatact tcggcgatca ccgcttccct catgatgttt
aactttgttt 5100tagggcgact gccctgctgc gtaacatcgt tgctgctcca
taacatcaaa catcgaccca 5160cggcgtaacg cgcttgctgc ttggatgccc
gaggcataga ctgtacccca aaaaaacagt 5220cataacaagc catgaaaacc
gccactgcgc cgttaccacc gctgcgttcg gtcaaggttc 5280tggaccagtt
gcgtgagcgc atacgctact tgcattacag cttacgaacc gaacaggctt
5340atgtccactg ggttcgtgcc ttcatccgtt tccacggtgt gcgtcacccg
gcaaccttgg 5400gcagcagcga agtcgaggca tttctgtcct ggctggcgaa
cgagcgcaag gtttcggtct 5460ccacgcatcg tcaggcattg gcggccttgc
tgttcttcta cggcaaggtg ctgtgcacgg 5520atctgccctg gcttcaggag
atcggaagac ctcggccgtc gcggcgcttg ccggtggtgc 5580tgaccccgga
tgaagtggtt cgcatcctcg gttttctgga aggcgagcat cgtttgttcg
5640cccagcttct gtatggaacg ggcatgcgga tcagtgaggg tttgcaactg
cgggtcaagg 5700atctggattt cgatcacggc acgatcatcg tgcgggaggg
caagggctcc aaggatcggg 5760ccttgatgtt acccgagagc ttggcaccca
gcctgcgcga gcaggggaat taattcccac 5820gggttttgct gcccgcaaac
gggctgttct ggtgttgcta gtttgttatc agaatcgcag 5880atccggcttc
agccggtttg ccggctgaaa gcgctatttc ttccagaatt gccatgattt
5940tttccccacg ggaggcgtca ctggctcccg tgttgtcggc agctttgatt
cgataagcag 6000catcgcctgt ttcaggctgt ctatgtgtga ctgttgagct
gtaacaagtt gtctcaggtg 6060ttcaatttca tgttctagtt gctttgtttt
actggtttca cctgttctat taggtgttac 6120atgctgttca tctgttacat
tgtcgatctg ttcatggtga acagctttga atgcaccaaa 6180aactcgtaaa
agctctgatg tatctatctt ttttacaccg ttttcatctg tgcatatgga
6240cagttttccc tttgatatgt aacggtgaac agttgttcta cttttgtttg
ttagtcttga 6300tgcttcactg atagatacaa gagccataag aacctcagat
ccttccgtat ttagccagta 6360tgttctctag tgtggttcgt tgtttttgcg
tgagccatga gaacgaacca ttgagatcat 6420acttactttg catgtcactc
aaaaattttg cctcaaaact ggtgagctga atttttgcag 6480ttaaagcatc
gtgtagtgtt tttcttagtc cgttatgtag gtaggaatct gatgtaatgg
6540ttgttggtat tttgtcacca ttcattttta tctggttgtt ctcaagttcg
gttacgagat 6600ccatttgtct atctagttca acttggaaaa tcaacgtatc
agtcgggcgg cctcgcttat 6660caaccaccaa tttcatattg ctgtaagtgt
ttaaatcttt acttattggt ttcaaaaccc 6720attggttaag ccttttaaac
tcatggtagt tattttcaag cattaacatg aacttaaatt 6780catcaaggct
aatctctata tttgccttgt gagttttctt ttgtgttagt tcttttaata
6840accactcata aatcctcata gagtatttgt tttcaaaaga cttaacatgt
tccagattat 6900attttatgaa tttttttaac tggaaaagat aaggcaatat
ctcttcacta aaaactaatt 6960ctaatttttc gcttgagaac ttggcatagt
ttgtccactg gaaaatctca aagcctttaa 7020ccaaaggatt cctgatttcc
acagttctcg tcatcagctc tctggttgct ttagctaata 7080caccataagc
attttcccta ctgatgttca tcatctgagc gtattggtta taagtgaacg
7140ataccgtccg ttctttcctt gtagggtttt caatcgtggg gttgagtagt
gccacacagc 7200ataaaattag cttggtttca tgctccgtta agtcatagcg
actaatcgct agttcatttg 7260ctttgaaaac aactaattca gacatacatc
tcaattggtc taggtgattt taat 731442787PRTStenotrophomonas
maltophiliaR551-3 42Met Thr Met Gly Val Gly Leu Cys Thr Gly Asp Val
Ser Ile Gly Arg1 5 10 15Gly Gly Gly Arg Ser Arg Arg Gly Trp Val Ser
Glu Phe Leu Pro Val 20 25 30Ala Val Ala Gly Asp Glu Gly Ala Gly Val
Ala Ala Ala Thr Gln Trp 35 40 45Cys Arg Cys Asp Leu Arg Thr Arg Arg
Asn Arg Cys Gly Gly Gly Trp 50 55 60Leu Asp Thr Arg Gly His Cys Leu
His Arg Gly Gly Asp Leu Val Val65 70 75 80Gln Ala Glu Asp Arg
His Gly Val Gly Gln Leu Ala Ser Leu Phe Phe 85 90 95His Arg Thr Cys
Gly Gly Gly Gly Phe Phe His Gln Arg Ser Ile Leu 100 105 110Leu Gly
Gly Phe Val His Leu His His Gly Leu Val Asp Leu Leu Asp 115 120
125Ala Gly Arg Leu Leu Leu Arg Gly Arg Gly Asp Leu Gly His Asp Val
130 135 140Gly His Ala Leu His Arg Gly Asp Asp Leu Gly His Gly Ala
Ala Gly145 150 155 160Leu Val Asp Gln Ala Gly Ala Leu Gly Asp Leu
Ala Asp Arg Val Ile 165 170 175Asp Gln Ala Leu Asp Leu Leu Gly Gly
Gly Gly Arg Ala Leu Gly Glu 180 185 190Gly Ala His Phe Ala Gly Asp
His Gly Lys Ala Thr Ala Leu Phe Ala 195 200 205Gly Thr Cys Gly Phe
His Cys Gly Val Gln Arg Gln Asp Val Gly Leu 210 215 220Glu Gly Asp
Ala Ile Asp Asp Ala Asp Asp Leu Gly Asp Leu Leu Arg225 230 235
240Arg Gly Phe Asp Gly Arg His Gly Val Asp His Leu Ala Asp His Ala
245 250 255Ala Thr Leu Arg Gly His Thr Leu Arg Ala Asp Gly Glu Leu
Val Gly 260 265 270Leu Ala Gly Met Leu Gly Val Leu Ala His Gly Gly
Gly Gln Leu Leu 275 280 285His Arg Gly Arg Gly Phe Phe Gln Ile Gly
Ser Leu Leu Leu Gly Thr 290 295 300Ala Arg Gln Val Ala Val Ala Arg
Arg Asp Leu Thr Ser Gly Glu Gly305 310 315 320Asp Ala Gly Arg Ala
Gly Leu Asp Leu Ala Asp Asp Leu Gly Gln Leu 325 330 335Gly Asp Gly
Gly Val Gly Ile Val Thr His Ala Arg Glu His Ala Leu 340 345 350Val
Ile Ala Met His Ala Cys Gly Gln Val Ala Phe Gly Asp Gly Arg 355 360
365Gln Gln Leu Arg Glu Leu Ala Glu Val Val Ile Ala Asp Arg His His
370 375 380Arg Val Glu Val Phe Gln His Gln Ala Glu Ile Val Val Glu
Ala Leu385 390 395 400Arg Val Ala Thr Thr Ala Glu Val Ala Gly Gly
Gly Gly Thr Gly Gln 405 410 415Leu Leu Asp Leu Gly Val Asp Ala Ala
Lys Val Gly Leu Asp Arg Ile 420 425 430Asp Gly Gly Gly His His Arg
Leu Leu Ala Arg Val Ala Cys Asp Val 435 440 445Thr Ala Glu Val Ala
Asp Cys Val Leu Leu His Asp Leu Gln Tyr Ile 450 455 460Val Gln Gly
Leu His Met Ala Thr Asp Gln Gly Val Gly Phe Leu His465 470 475
480His Gln Pro Val Phe Ala Gly Glu Gly Ala Gly Ile Asp Ala Val Ala
485 490 495Gln Leu Ala Thr Val Val Ala Ala Gly His Phe Ala Leu Ala
Ala Asp 500 505 510His Arg Val Gln Leu Leu Leu His Ala Gly His Arg
Leu Gln Gln Ala 515 520 525Ala Gly Phe Ile Met Arg Leu Arg Ala Asp
Met Ala Val Glu Leu Ala 530 535 540Gly Gly Asp Thr Phe Gly Asp Gly
Gly Gly Leu Leu Gln Arg His Gly545 550 555 560Asp Ala Val Ala Asp
Asp Pro Ala Gln Gly Gln His Asp Gln His Gln 565 570 575His Ala Ala
Gly Glu Gly His Asp Gly Gly Glu His Gln Arg Leu Leu 580 585 590Leu
Asn Val Ile His Val Asp Ala Ala Ala Asp His Pro Val Pro Gly 595 600
605Cys Glu Gln Ala Gly Val Gly His Leu Leu Asp Val Ala Leu Ala Ala
610 615 620Arg Leu Gly Pro Ala Val Val Asp Glu Ala Ala Ala Gly Leu
Gly Arg625 630 635 640Leu Asp Leu Val Val Val Asp Ala His Ala Val
Val Gly Ala Glu Val 645 650 655Leu His Val Leu Ala Asp Gln Val Leu
Ala Glu Arg Val His Gln Gln 660 665 670Ala Ile Ala Gly Val Val Asp
Val Val Val Ile Gly Val Val Leu Gly 675 680 685Ala His His Phe Gln
Arg Leu Gln Gly Ser Gly Leu Gly Ser Leu Leu 690 695 700Ala Glu Leu
Ala Gly Ala Gly Gln Ala Met Val Val Leu Glu Asp Ala705 710 715
720Ile Ala Gln Phe Asp Leu Gly Leu Gln Arg Gly Leu Ala Gly Leu Gly
725 730 735Gln Val Gln Val Leu His Ala Gly Gly Asp His Arg Gln Arg
Asp His 740 745 750Ala Glu Arg Asp Glu Gln Arg Gln Gln Val Glu Leu
Ala Pro Asp Gly 755 760 765Glu Val Ala Glu Val Val Leu Pro Ala Met
Gln Lys Ile His Arg Ala 770 775 780Gly Pro Gly785
* * * * *
References