Elevation Of Oil Levels In Plants Ravanello; Monica P. ; et al. [Foley; Terry J.]

Elevation Of Oil Levels In Plants

Ravanello; Monica P. ; et al.

Patent Application Summary

U.S. patent application number 11/676445 was filed with the patent office on 2007-12-06 for elevation of oil levels in plants. Invention is credited to Terry J. Foley, John R. LeDeaux, Monica P. Ravanello, Thomas J. Savage, Annette E. Wyrick.

Application Number	20070283458 11/676445
Document ID	/
Family ID	33555618
Filed Date	2007-12-06

United States Patent Application	20070283458
Kind Code	A1
Ravanello; Monica P. ; et al.	December 6, 2007

ELEVATION OF OIL LEVELS IN PLANTS

Abstract

This present invention provides a method for increasing oil levels in corn kernel tissue by expression of an HOI001 GBSS allele. The present invention also provides isolated nucleic acid molecules encoding a HOI001 GBSS polypeptide.

Inventors:	Ravanello; Monica P.; (Vacaville, CA) ; Foley; Terry J.; (Williamsburg, IA) ; LeDeaux; John R.; (St. Louis, MO) ; Wyrick; Annette E.; (Vacaville, CA) ; Savage; Thomas J.; (Sacramento, CA)
Correspondence Address:	SONNENSCHEIN NATH & ROSENTHAL LLP P.O. BOX 061080 SOUTH WACKER DRIVE STATION, SEARS TOWER CHICAGO IL 60606 US
Family ID:	33555618
Appl. No.:	11/676445
Filed:	February 19, 2007

Related U.S. Patent Documents


Application Number	Filing Date	Patent Number
10877645	Jun 25, 2004	7179956
11676445	Feb 19, 2007
60483491	Jun 27, 2003

Current U.S. Class:	800/281 ; 426/601; 426/615; 435/419; 435/6.18; 536/23.6; 800/298; 800/320.1
Current CPC Class:	A23D 9/00 20130101; C12N 9/1051 20130101; C12N 15/8247 20130101; A23K 10/30 20160501
Class at Publication:	800/281 ; 426/601; 426/615; 435/419; 435/006; 536/023.6; 800/298; 800/320.1
International Class:	A01H 1/00 20060101 A01H001/00; A01H 5/00 20060101 A01H005/00; A23D 7/00 20060101 A23D007/00; A23K 1/00 20060101 A23K001/00; C07H 21/04 20060101 C07H021/04; C12N 15/82 20060101 C12N015/82; C12Q 1/68 20060101 C12Q001/68

Claims

1. A substantially purified nucleic acid molecule selected from the group consisting of: a) a nucleic acid molecule comprising SEQ ID NO: 1 or the complement thereof; b) a nucleic acid molecule comprising SEQ ID NO: 11 or the complement thereof; and c) a nucleic acid molecule which encodes a polypeptide having at least about 94% amino acid identity with SEQ ID NO: 3.

2. An expression cassette comprising a nucleic acid molecule of claim 1, wherein said nucleic acid molecule is operably linked to a promoter, which is functional in a plant cell.

3. A plant cell comprising the expression cassette of claim 2.

4. A plant comprising the plant cell of claim 3.

5. A plant according to claim 4, wherein the plant is a monocot.

6. A plant according to claim 5, wherein the monocot plant is corn.

7. Seeds obtained from a corn plant, wherein the seeds comprise the nucleic acid molecule of claim 1.

8. An animal feed obtained from the seeds of claim 7.

9. A plant having stably incorporated into it's genome a nucleic acid molecule selected from the group consisting of: a) a nucleic acid molecule comprising SEQ ID NO: 1 or the complement thereof; b) a nucleic acid molecule comprising SEQ ID NO: 11 or the complement thereof; and c) a nucleic acid molecule which encodes a polypeptide having at least about 94% amino acid identity with SEQ ID NO: 3.

10. A plant according to claim 9, wherein the plant is a monocot.

11. The monocot plant of claim 10, wherein the plant is corn.

12. Seeds obtained from the plant of claim 11.

13. Oil obtained from the seeds of claim 12.

14. An animal feed obtained from the seeds of claim 12.

15. A method of producing a plant having increased levels of oil production, wherein the method comprises: (a) transforming a plant with an expression cassette comprising a nucleic acid molecule selected from the group consisting of: i) a nucleic acid molecule comprising SEQ ID NO: 1 or the complement thereof; ii) a nucleic acid molecule comprising SEQ ID NO: 11 or the complement thereof; and iii) a nucleic acid molecule which encodes a polypeptide having at least about 94% amino acid identity with SEQ ID NO: 3; wherein said expression cassette further comprises a promoter region functional in a plant cell, operably linked to said nucleic acid molecule; and (b) growing the transformed plant.

16. The method claim 15, wherein the plant is a monocot.

17. The method of claim 16, wherein the monocot plant is corn.

18. The method of claim 17, wherein the promoter region is an endosperm promoter region.

19. The method of claim 18, wherein the promoter region is the Z27 promoter.

20. A method of selecting corn germplasm, comprising the steps of: a) identifying at least one polymorphism unique to the HOI001 GBSS sequence represented in SEQ ID NO: 1; b) selecting a fragment of SEQ ID NO: 1 containing at least part of one of the identified polymorphisms to be used as a molecular marker; c) assaying corn plants for the presence of the marker; and d) selecting plants that contain the marker.

Description

[0001] This application claims the benefit of the filing date of the Provisional Application U.S. Ser. No. 60/483,491, filed Jun. 27, 2003, which is incorporated herein by reference.

[0002] The present invention relates to the fields of nucleic acid chemistry and agricultural biotechnology. In particular, the present invention is directed at the identification of nucleic acids that encode proteins useful for increasing oil levels in maize plants and creating maize plants that include such nucleic acids.

[0003] Plants are a major source of oils for feed, food, and industrial uses. While tissues of most plant species contain little oil, the cultivation of certain plant types, over many acres, permit large quantities of plant oils to be produced. If the oil content of these plants could be increased, then plant oils could be produced more efficiently. For example, the normal oil content of yellow #2, dent corn is about 4%. If the oil content of corn could be increased to 8% or even 12%, without significantly affecting yield, the same amount of oil could be produced from half or even one-third the number of acres.

[0004] Currently, levels of oil in oilseed crops have increased incrementally by traditional breeding and selection methods. There exist few references to transgenic plants with increased levels of oil. In contrast, increases in the proportions of some strategic fatty acids have been achieved by the introduction or manipulation of various plant fatty acid biosynthesis genes in oilseeds. For instance, Voelker et al., Science, 257:72-74 (1992), demonstrated that expression in Brassicaceae of a medium chain fatty acyl-ACP thioesterase from California Bay, increased the lauric acid (12:0) content. Hitz et al., Proc. 9.sup.th International Cambridge Rapeseed Congress UK, pp 470-472 (1995) increased proportions of oleic acid in Glycine max by co-suppression using a sense construct encoding a plant microsomal FAD-2 (.DELTA.12) desaturase. Although the use of these plant transgenes resulted in an increased production of lauric acid in canola and altered proportions of oleic acid in soy, there was no evidence of increased total fatty acid content, or increased oil yield in these transgenics.

[0005] Certain workers have attempted to increase or modulate the oil content of plants by manipulation of oil biosynthetic pathway genes. For example, U.S. Pat. No. 6,268,550 to Gengenbach et al. provides maize acetyl CoA carboxylase nucleic acids for altering the oil content of plants. Additionally, U.S. Pat. No. 5,925,805 to Ohlrogge et al. provides an Arabidopsis acetyl CoA carboxylase gene that can be used to increase the oil content of plants. However, the synthesis of fatty acids requires the coordinated activity of many enzymes, none of which when solely upregulated has been found to substantially increase oil content.

[0006] A need therefore exists for an improved method to alter the oil content of plants, and in particular to increase the oil content of plants and seeds.

[0007] In addition to oil, starch from maize is also agriculturally and commercially significant. Starch comprises a major component of animal feed and human food. Starch is also used industrially in the production of paper, textiles, plastics, and adhesives, as well as providing the raw material for some bioreactors.

[0008] In higher plants, the starch consists of linear chain and branched chain glucans known as amylose and amylopectin, respectively. Starch with various amounts of amylose and amylopectin are found in different plants. Typically, maize starch contains approximately 25% amylose, the remainder being amylopectin. Amylopectin contains short chains and long chains, the short chains ranging from 5-30 glucose units and the long chains ranging from 30-100 glucose units, or more. The ratio of amylose to amylopectin, as well as the distribution of short to long chains in the amylopectin fraction, affect the physical properties of starch, (e.g., thermal stabilization, retrogradation, and viscosity).

[0009] The WAXY locus of maize determines the amylose content in pollen and in kernel endosperm, (Shure et al., Cell, 35(1):225-233 (1983)), resulting in starch having unique properties. Most mutations in the WAXY locus of maize, which encodes granule bound starch synthase (GBSS), result in an opaque endosperm of smooth, firm non-corneous starch comprising mostly amylopectin and a reduced amount of amylose in the endosperm, pollen and embryo sac ("WAXY phenotype") (see, Okagaki and Wessler, Genetics, 120(4):1137-1143 (1988)). When no functioning GBSS is synthesized in the homozygous WAXY mutant, it also lacks amylose (Echt and Schwartz, Genetics, 99:275-284 (1981)).

[0010] Additionally, classic, recessive WAXY has a small (approximately 0.5% increase) effect on percent oil in the kernel when compared to yellow #2 corn (Pfahler and Linskens, Theoretical and Applied Genetics, 41(1):2-4 (1971)). In comparison, the inbred line HOI001, a dominant WAXY mutant inbred described in U.S. Patent Publication No. 20030172416, herein incorporated by reference, has whole kernel oil concentrations greater than four times that of yellow #2 corn.

SUMMARY OF THE INVENTION

[0011] The present invention describes and provides isolated nucleic acid molecules encoding an HOI001 GBSS polypeptide. In addition, this invention relates to nucleic acid molecules that are complementary to the nucleic acid molecule encoding an HOI001 GBSS polypeptide. In addition, this invention relates to expression cassettes comprising these nucleic acid molecules. In addition, this invention relates to transgenic maize plants containing these expression cassettes. In addition, this invention relates to the seeds of these transgenic maize plants. This invention further relates to the oil and animal feed obtained from the seeds of these transgenic maize plants.

[0012] In another embodiment, the present invention relates to a recombinant DNA construct, associated with increased oil production in plants, comprising a nucleic acid molecule encoding an HOI001 GBSS polypeptide operably linked to a promoter, which is functional in a plant cell.

[0013] The present invention describes and provides a method of increasing oil in a maize plant by expression of an HOI001 GBSS gene. This invention further provides a method of altering the kernel composition in a corn plant by expression of an HOI001 GBSS gene. This invention further describes and provides sequences of an HOI001 GBSS gene from Zea mays. This invention further provides vector constructs for plant transformation and tissue-specific expression of an HOI001 GBSS gene. This invention further provides maize plants transformed with the GBSS gene with higher oil levels when compared to plants with the same or similar genetic background, but not containing the inserted HOI001 GBSS gene. This invention further provides seeds from these maize plants. This invention further provides for kernels from maize plants transformed with the HOI001 GBSS gene containing a higher level of oil when compared to kernels from corn plants with the same or similar genetic background, but not containing the inserted HOI001 GBSS gene. This invention also provides oil and animal feed produced from these seeds and kernels.

[0014] The present invention further provides a method of marker-assisted breeding useful in breeding higher oil levels in maize.

BRIEF DESCRIPTION OF THE FIGURES

[0015] FIG. 1 shows the nucleic acid sequence alignment of the granule bound starch synthase gene isolated from HOI001 (HOI001 GBSS, pMON72506) [SEQ ID NO: 1] compared to the granule bound starch synthase (GBSS) gene from inbred LH59 (pMON72510), and published sequence of the GBSS gene described in Shure et al., supra, (X03935). For additional comparison, the coding sequence for the published GBSS gene is given (CDS22509).

[0016] FIG. 2 shows the alignment of the corresponding predicted amino acid sequences from the GBSS gene isolated from HOI001 (HOI001 GBSS from pMON72506) [SEQ ID NO: 3], and the GBSS gene described in Shure et al., supra, [SEQ ID NO: 4], respectively.

[0017] FIG. 3 shows the alignment of the corresponding predicted amino acid sequences from the Zea mays GBSS gene isolated from inbred LH59 [SEQ ID NO: 10], and the Zea mays granule bound starch synthase gene described in Shure et al., supra, respectively.

[0018] FIG. 4 depicts a plasmid map of pMON72506.

[0019] FIG. 5 depicts a plasmid map of pMON72510.

[0020] FIGS. 6A and 6B graphically depict the difference in oil levels from kernels of plants transformed with pMON72506 containing the GBSS from HOI001 (SEQ ID NO: 1, 6A) and pMON72510 containing the GBSS from LH59 (SEQ ID NO: 8, 6B). Gene positive and gene negative kernels are compared from each event. Only events with statistically significant changes in oil (14 of 29) are shown in 6A.

[0021] FIG. 7 depicts a plasmid map of pMON81464.

[0022] FIG. 8 depicts a plasmid map of pMON68298.

[0023] FIG. 9 depicts a plasmid map of pMON81465.

BRIEF DESCRIPTION OF THE SEQUENCES

[0024] SEQ ID NO: 1 is the nucleic acid sequence of the granule bound starch synthase from HOI001 (HOI001 GBSS from pMON72506).

[0025] SEQ ID NO: 2 is the published nucleic acid sequence of Zea mays GBSS from Shure et al., supra.

[0026] SEQ ID NO: 3 sets forth the predicted amino acid sequence of HOI001 GBSS from pMON72506.

[0027] SEQ ID NO: 4 sets forth the predicted amino acid sequence from the Zea mays GBSS as published by Shure et al., supra.

[0028] SEQ ID NO: 5 is a primer sequence for Primer number 14543.

[0029] SEQ ID NO: 6 is a primer sequence for Primer number 14547.

[0030] SEQ ID NO: 7 sets forth a nucleic acid sequence of a DNA molecule that encodes a GBSS from corn line LH59.

[0031] SEQ ID NO: 8 sets forth the predicted amino acid sequence of GBSS from corn line LH59.

[0032] SEQ ID NO: 9 is a primer sequence for Primer number 20095.

[0033] SEQ ID NO: 10 is a primer sequence for Primer number 20092.

[0034] SEQ ID NO: 11 sets forth the coding region of the GBSS cDNA of HOI001.

DETAILED DESCRIPTION OF THE INVENTION

[0035] The following definitions are provided as an aid to understanding the detailed description of the present invention.

[0036] The phrases "coding sequence," "coding region," "structural sequence," and "structural nucleic acid sequence" refer to a physical structure comprising an orderly arrangement of nucleotides. The nucleotides are arranged in a series of triplets that each form a codon. Each codon encodes a specific amino acid. Thus, the coding sequence, structural sequence, and structural nucleic acid sequence encode a series of amino acids forming a protein, polypeptide, or peptide sequence. The coding sequence, structural sequence, and structural nucleic acid sequence may be contained within a larger nucleic acid molecule, vector, or the like. In addition, the orderly arrangement of nucleotides in these sequences may be depicted in the form of a sequence listing, figure, table, electronic medium, or the like.

[0037] The phrase "codon degeneracy" refers to divergence in the genetic code permitting variation of the nucleotide sequence without affecting the amino acid sequence of an encoded polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid sequences set forth herein. The skilled artisan is well aware of the "codon-bias" exhibited by a specific host cell in usage of nucleotide codons to specify a given amino acid. Therefore, when synthesizing a nucleic acid fragment for improved expression in a host cell, it is desirable to design the nucleic acid fragment such that its frequency of codon usage approaches the frequency of preferred codon usage of the host cell.

[0038] The term "cDNA" refers to a double-stranded DNA that is complementary to and derived from mRNA.

[0039] The phrases "DNA sequence," "nucleic acid sequence," and "nucleic acid molecule" refer to a physical structure comprising an orderly arrangement of nucleotides. The DNA sequence or nucleotide sequence may be contained within a larger nucleotide molecule, vector, or the like. In addition, the orderly arrangement of nucleic acids in these sequences may be depicted in the form of a sequence listing, figure, table, electronic medium, or the like.

[0040] "Expression" refers to the transcription of a gene to produce the corresponding mRNA and translation of this mRNA to produce the corresponding gene product (i.e., a peptide, polypeptide, or protein).

[0041] "Expression of antisense RNA" refers to the transcription of a DNA to produce a first RNA molecule capable of hybridizing to a second RNA molecule, which second RNA molecule encodes a gene product that is desirably down-regulated.

[0042] As used herein, "gene" refers to a nucleic acid fragment that expresses a specific protein, including regulatory sequences preceding (5' non-coding sequences) and following (3' non-coding sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its own regulatory sequences. "Chimeric gene" refers to any gene that is not a native gene, comprising regulatory and coding sequences that are not found together in nature. Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. "Endogenous gene" refers to a native gene in its natural location in the genome of an organism. An "exogenous gene" or "transgene" refer to a non-native gene that has been introduced into the genome by a transformation procedure.

[0043] "Hemizygous" refers to a diploid individual having only one copy of a particular gene (for example, because a chromosome has been lost). "Homozygous" refers to a gene pair having identical alleles in two homologous chromosomes.

[0044] "Heterologous" refers to the relationship between two or more nucleic acid or protein sequences that are derived from different sources. For example, a promoter is heterologous with respect to a coding sequence if such a combination is not normally found in nature. In addition, a particular sequence may be "heterologous" with respect to a cell or organism into which it is inserted (i.e., does not naturally occur in that particular cell or organism).

[0045] "Homology" refers to the level of similarity between two or more nucleic acid or amino acid sequences in terms of percent of positional identity (i.e., sequence similarity or identity). Homology also refers to the concept of similar functional properties among different nucleic acids or proteins.

[0046] "Hybridization" refers to the ability of a first strand of nucleic acid to join with a second strand via hydrogen bond base pairing when the two nucleic acid strands have sufficient sequence complementarity. As used herein, a nucleic acid molecule is said to be the "complement" of another nucleic acid molecule if they exhibit complete complementarity. As used herein, molecules are said to exhibit "complete complementarity" when every nucleotide of one of the molecules is complementary to a nucleotide of the other. Thus two nucleic acid strands are said to have sufficient complementarity when they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under appropriate conditions.

[0047] The phrases "marker-assisted selection" or "marker-assisted breeding" refer to the use of genetic markers to identify and select plants with superior phenotypic potential. Genetic markers determined previously to be associated with a trait locus or trait loci are used to uncover the genotype at trait loci by virtue of linkage between the marker locus and the trait locus. Plants containing desired trait alleles are chosen based upon their genotypes at linked marker loci.

[0048] The phrase "breeding population" refers to a genetically heterogeneous collection of plants created for the purpose of identifying one or more individuals with desired phenotypic characteristics. The term "phenotype" refers to the observed expression of one or more plant characteristics.

[0049] A "genetic marker" is any morphological, biochemical, or nucleic acid based phenotypic difference which reveals a DNA polymorphism. Examples of genetic markers include but are not limited to RFLPs, RAPDs, allozymes, SSRs, and AFLPs.

[0050] The phrase "marker locus" refers to the genetically defined location of DNA polymorphisms as revealed by a genetic marker. A "trait locus" refers to a genetically defined location for a collection of one or more genes (alleles) which contribute to an observed characteristic.

[0051] The phrase "restriction fragment length polymorphism" (RFLP) refers to a DNA-based genetic marker in which size differences in restriction endonuclease generated DNA fragments are observed via hybridization (Botstein et al., Am. J. Hum. Genet., 32:314-331 (1980)).

[0052] The phrase "random amplified polymorphic DNA" (RAPD) refers to a DNA amplification based genetic marker in which short, sequence arbitrary primers are used and the resulting amplification products are size separated and differences in amplification patterns observed (Williams et al., Nucleic Acids Res., 18:6531-6535 (1990)).

[0053] The phrase "simple sequence repeat" (SSR) refers to a DNA amplification-based genetic marker in which short stretches of tandemly repeated sequence motifs are amplified and the resulting amplification products are size separated and differences in length of the nucleotide repeat are observed (Tautz, Nucleic Acids Res., 112:4127-4138 (1989)).

[0054] The term "AFLP" refers to a DNA amplification-based genetic marker in which restriction endonuclease generated DNA fragments are ligated to short DNA fragments which facilitate the amplification of the restricted DNA fragments (Vos et al., Nucleic Acids Res., 23:4407-4414 (1995)). The amplified fragments are size separated and differences in amplification patterns observed.

[0055] The phrase "operably linked" refers to the functional spatial arrangement of two or more nucleic acid regions or nucleic acid sequences. For example, a promoter region may be positioned relative to a nucleic acid sequence such that transcription of the nucleic acid sequence is directed by the promoter region. Thus, a promoter region is "operably linked" to the nucleic acid sequence.

[0056] The terms "promoter" or "promoter region" refer to a nucleic acid sequence, usually found upstream (5') to a coding sequence that is capable of directing transcription of a nucleic acid sequence into mRNA. The promoter or promoter region typically provides a recognition site for RNA polymerase and the other factors necessary for proper initiation of transcription. As contemplated herein, a promoter or promoter region includes variations of promoters derived by inserting or deleting regulatory regions, subjecting the promoter to random or site-directed mutagenesis, and the like. The activity or strength of a promoter may be measured in terms of the amounts of RNA it produces, or the amount of protein accumulation in a cell or tissue, relative to a second promoter that is similarly measured.

[0057] The phrase "3' non-coding sequences" refers to nucleotide sequences located downstream of a coding sequence and include polyadenylation recognition sequences and other sequences encoding regulatory signals capable of affecting mRNA processing or gene expression. The polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is exemplified by Ingelbrecht et al., Plant Cell, 1:671-680 (1989).

[0058] "Translation leader sequence" or "5' untranslated region" or "5'-UTR" all refer to a nucleotide sequence located between the promoter sequence of a gene and the coding sequence. The 5'-UTR is present in the fully processed mRNA upstream of the translation start sequence. The 5'-UTR may affect processing of the primary transcript to mRNA, mRNA stability, or translation efficiency. Examples of translation leader sequences have been described (Turner and Foster, Molecular Biotechnology, 3:225(1995)).

[0059] "RNA transcript" refers to the product resulting from RNA polymerase-catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect complementary copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA sequence derived from posttranscriptional processing of the primary transcript and is referred to as the mature RNA. "Messenger RNA" (mRNA) refers to the RNA that is without introns and that can be translated into polypeptide by the cell. "Sense RNA" refers to an RNA transcript that includes the mRNA and so can be translated into a polypeptide by the cell. "Antisense RNA" refers to an RNA transcript that is complementary to a target mRNA, resulting in specific RNA: RNA duplexes being formed by base pairing between the antisense RNA substrate and the target mRNA.

[0060] "Recombinant vector" refers to any agent by or in which a nucleic acid of interest is amplified, expressed, or stored, such as a plasmid, cosmid, virus, autonomously replicating sequence, phage, or linear single-stranded, circular single-stranded, linear double-stranded, or circular double-stranded DNA or RNA nucleotide sequence. The recombinant vector may be derived from any source and is capable of genomic integration or autonomous replication.

[0061] "Regulatory sequence" refers to a nucleotide sequence located upstream (5'), within, or downstream (3') with respect to a coding sequence. Additionally, introns may have regulatory activity. Transcription and expression of the coding sequence is typically impacted by the presence or absence of the regulatory sequence.

[0062] "Substantially homologous" refers to two sequences that are at least about 90% identical in sequence, as measured by the CLUSTAL W method in the Omiga program, using default parameters (Version 2.0; Accelrys, San Diego, Calif.).

[0063] "Substantially purified" refers to a molecule separated from substantially all other molecules normally associated with it in its native state. More preferably, a substantially purified molecule is the predominant species present in a preparation. A substantially purified molecule may be greater than about 60% free, preferably about 75% free, more preferably about 90% free, and most preferably about 95% free from the other molecules (exclusive of solvent) present in the natural mixture. The phrase "substantially purified" is not intended to encompass molecules present in their native state.

[0064] The term "transformation" refers to the introduction of nucleic acid into a recipient host. The term "host" refers to bacteria cells, fungi, animals or animal cells, plants or seeds, or any plant parts or tissues including plant cells, protoplasts, calli, roots, tubers, seeds, stems, leaves, seedlings, embryos, and pollen.

[0065] As used herein, a "transgenic plant" is a plant having stably introduced into its genome, for example, the nuclear or plastid genomes, a nucleic acid.

[0066] The terms "seeds" and "kernels" are understood to be equivalent in meaning. The term kernel is frequently used in describing the seed of a corn or rice plant. In all plants the seed is the mature ovule consisting of a seed coat, embryo, and in plants of the present invention, an endosperm.

HOI001 GBSS Nucleic Acids

[0067] The present invention provides nucleic acids that encode polypeptides substantially homologous to a granule bound starch synthase isolated from the inbred plant HOI001 (HOI001 GBSS). In one embodiment, these nucleic acid molecules are used in the context of the present invention for increasing the oil content of plant tissues. In one embodiment, the present invention provides an isolated nucleic acid that encodes a HOI001 GBSS protein, which nucleic acid is selected from the group consisting of SEQ ID NO: 1 and complements thereof, and nucleic acids which encode polypeptides having at least about 94% sequence identity with SEQ ID NO: 3. The percent sequence identity of the polypeptides encoded by nucleic acids of this invention is preferably at least about 95%; and most preferably at least about 98%.

[0068] The present invention also provides vectors containing such HOI001 GBSS nucleic acids. As set forth in further detail hereinbelow, preferred nucleic acids include appropriate regulatory elements operably linked thereto that facilitate efficient expression of the inventive nucleic acids in a host, including without limitation bacterial, fungal, or plant hosts. Vectors useful in the context of the present invention can include such regulatory elements.

[0069] The nucleic acids and vectors encompassed by the present invention need not have the exact nucleic acid sequences described herein. Instead, the sequences of these nucleic acids and vectors can vary, so long as the nucleic acid either performs the function for which it is intended or has some other utility, for example, as a nucleic acid probe for complementary nucleic acids. For example, some sequence variability in any part of a HOI001 GBSS nucleic acid is permitted so long as transformation of a plant with the mutant or variant polypeptide or polypeptides result in a phenotype substantially similar to that of HOI001 GBSS. Most preferably, the aforementioned sequence variability results in increased oil accumulation in plant tissues, as compared to plants of the same or similar genotype, but without the transgene.

[0070] Fragment and variant nucleic acids of SEQ ID NO: 1, are also encompassed by the present invention. Nucleic acid fragments encompassed by the present invention are of three general types. First, fragment nucleic acids that are not full length but do perform their intended function are encompassed within the present invention. Second, fragments of nucleic acids identified herein that are useful as hybridization probes, are also included in the invention. And, third, fragments of nucleic acids identified herein can be used in suppression technologies known in the art, such as, for example, anti-sense technology or RNA inhibition (RNAi), which provides for reducing carbon flow in a plant into oil, making more carbon available for protein or starch accumulation, for example. Thus, fragments of a nucleotide sequence, such as SEQ ID NO: 1 may range from at least about 15 nucleotides, about 17 nucleotides, about 18 nucleotides, about 20 nucleotides, about 50 nucleotides, about 100 nucleotides or more. In general, a fragment nucleic acid of the present invention can have any upper size limit so long as it is related in sequence to the nucleic acids of the present invention but does not include the full length.

[0071] As used herein, "variants" have substantially similar or substantially homologous sequences when compared to reference or wild type sequence. For nucleotide sequences that encode proteins, variants also include those sequences that, because of the degeneracy of the genetic code, encode the identical amino acid sequence of the reference protein. Variant nucleic acids also include those that encode polypeptides that do not have amino acid sequences identical to that of the proteins identified herein, but which encode an active protein with conservative changes in the amino acid sequence.

[0072] The present invention is not limited to silent changes in the present nucleotide sequences but also includes variant nucleic acid sequences that conservatively alter the amino acid sequence of a polypeptide of the present invention. Because it is the interactive capacity and nature of a protein that defines that protein's biological functional activity, certain amino acid sequence substitutions can be made in a protein sequence and, of course, its underlying DNA coding sequence and, nevertheless, a protein with like properties can still be obtained. It is thus contemplated by the inventors that various changes may be made in the peptide sequences of the proteins or fragments of the present invention, or corresponding DNA sequences that encode the peptides, without appreciable loss of their biological utility or activity. According to the present invention, then, variant and reference nucleic acids of the present invention may differ in the encoded amino acid sequence by one or more substitutions, additions, insertions, deletions, fusions, and truncations, which may be present in any combination, so long as an active HOI001 GBSS protein is encoded by the variant nucleic acid. Such variant nucleic acids will not encode exactly the same amino acid sequence as the reference nucleic acid, but have conservative sequence changes. Codons capable of coding for such conservative amino acid substitutions are well known in the art.

[0073] Another approach to identifying conservative amino acid substitutions require analysis of the hydropathic index of amino acids may be considered. The importance of the hydropathic amino acid index in conferring interactive biological function on a protein is generally understood in the art (Kyte and Doolittle, J. Mol. Biol., 157:105-132 (1982)). It is accepted that the relative hydropathic character of the amino acid contributes to the secondary structure of the resultant polypeptide, which in turn defines the interaction of the protein with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and the like.

[0074] Each amino acid has been assigned a hydropathic index on the basis of its hydrophobicity and charge characteristics (Kyte and Doolittle, J. Mol. Biol., 157:105-132 (1982)); these are isoleucine (+4.5), valine (+4.2), leucine (+3.8), phenylalanine (+2.8), cysteine/cystine (+2.5), methionine (+1.9), alanine (+1.8), glycine (-0.4), threonine (-0.7), serine (-0.8), tryptophan (-0.9), tyrosine (-1.3), proline (-1.6), histidine (-3.2), glutamate (-3.5), glutamine (-3.5), aspartate (-3.5), asparagine (-3.5), lysine (-3.9), and arginine (-4.5).

[0075] In making such changes, the substitution of amino acids whose hydropathic indices are within .+-.2 is preferred, those that are within .+-.1 are particularly preferred, and those within .+-.0.5 are even more particularly preferred.

[0076] It is also understood in the art that the substitution of like amino acids can be made effectively on the basis of hydrophilicity. U.S. Pat. No. 4,554,101, states that the greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein.

[0077] As detailed in U.S. Pat. No. 4,554,101, the following hydrophilicity values have been assigned to amino acid residues: arginine (+3.0), lysine (+3.0), aspartate (+3.0.+-.1), glutamate (+3.0.+-.1), serine (+0.3), asparagine (+0.2), glutamine (+0.2), glycine (0), threonine (-0.4), proline (-0.5.+-.1), alanine (-0.5), histidine (-0.5), cysteine (-1.0), methionine (-1.3), valine (-1.5), leucine (-1.8), isoleucine (-1.8), tyrosine (-2.3), phenylalanine (-2.5), and tryptophan (-3.4).

[0078] In making such changes, the substitution of amino acids whose hydrophilicity values are within .+-.2 is preferred, those that are within .+-.1 are particularly preferred, and those within .+-.0.5 are even more particularly preferred.

[0079] Variant nucleic acids with silent and conservative changes can be defined and characterized by the degree of homology to the reference nucleic acid. Preferred variant nucleic acids are substantially homologous to the reference nucleic acids of the present invention. As recognized by one of skill in the art, such substantially similar nucleic acids can hybridize under stringent conditions with the reference nucleic acids identified by SEQ ID NO: 1, herein. These types of substantially homologous nucleic acids are encompassed by this invention.

[0080] Variant nucleic acids can be detected and isolated by standard hybridization procedures. Hybridization to detect or isolate such sequences is generally carried out under "moderately stringent" and preferably under "stringent" conditions. Moderately stringent hybridization conditions and associated moderately stringent and stringent hybridization wash conditions used in the context of nucleic acid hybridization experiments, such as Southern and northern hybridization, are sequence dependent, and are different under different environmental parameters. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, page 1, Chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays" Elsevier, NY (1993). See also, J. Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, NY, pp 9.31-9.58 (1989); J. Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, NY (3rd ed. 2001).

[0081] The present invention also provides methods for detection and isolation of derivative or variant nucleic acids encoding the proteins provided herein. The methods involve hybridizing at least a portion of a nucleic acid comprising any part of SEQ ID NO: 1 with respect to HOI001 GBSS-related sequences, to a sample nucleic acid, thereby forming a hybridization complex; and detecting the hybridization complex. The presence of the complex correlates with the presence of a derivative or variant nucleic acid that can be further characterized by nucleic acid sequencing, expression of RNA and/or protein and testing to determine whether the derivative or variant retains the ability to increase oil levels in plant tissue when transformed into that plant. In general, the portion of a nucleic acid comprising any part of SEQ ID NO: 1 used for hybridization is preferably at least about fifteen nucleotides, and hybridization is under hybridization conditions that are sufficiently stringent to permit detection and isolation of substantially homologous nucleic acids; preferably, the hybridization conditions are "moderately stringent", more preferably the hybridization conditions are "stringent", as defined herein and in the context of conventional molecular biological techniques well known in the art.

[0082] Generally, highly stringent hybridization and wash conditions are selected to be about 5.degree. C. lower than the thermal melting point (T.sub.m) for the specific double-stranded sequence at a defined ionic strength and pH. For example, under "highly stringent conditions" or "highly stringent hybridization conditions" a nucleic acid will hybridize to its complement to a detectably greater degree than to other sequences (e.g., at least 2-fold over background). By controlling the stringency of the hybridization and/or the washing conditions, nucleic acids having 100% complementary can be identified and isolated.

[0083] Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30.degree. C. for short probes (e.g., 10 to 50 nucleotides) and at least about 60.degree. C. for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide, in which case hybridization temperatures can be decreased. Dextran sulfate and/or Denhardt's solution (50.times.Denhardt's is 5% Ficoll, 5% polyvinylpyrrolidone, 5% BSA) can also be included in the hybridization reactions.

[0084] Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 50% formamide, 5.times.SSC (20.times.SSC is 3M NaCl, 0.3 M trisodium citrate), 50 mM sodium phosphate, pH7, 5 mM EDTA, 0.1% SDS (sodium dodecyl sulfate), 5.times.Denhardt's with 100 .mu.g/ml denatured salmon sperm DNA at 37.degree. C., and a wash in 1.times. to 5.times.SSC (20.times.SSC defined as 3.0 M NaCl and 0.3 M trisodium citrate), 0.1% SDS at 37.degree. C. Exemplary moderate stringency conditions include hybridization in 40 to 50% formamide, 5.times.SSC 50 mM sodium phosphate, pH 7, 5 mM EDTA, 0.1% SDS, 5.times.Denhardt's with 100 .mu.g/ml denatured salmon sperm DNA at 42.degree. C., and a wash in 0.1.times. to 2.times.SSC, 0.1% SDS at 42 to 55.degree. C. Exemplary high stringency conditions include hybridization in 50% formamide, 5.times.SSC, 50 mM sodium phosphate, pH 7.0, 5 mM EDTA, 0.1% SDS, 5.times.Denhardt's with 100 .mu.g/ml denatured salmon sperm DNA at 42.degree. C., and a wash in 0.1.times.SSC, 0.1% SDS at 60 to 65.degree. C.

[0085] In another embodiment of the present invention, the inventive nucleic acids are defined by the percent identity relationship between particular nucleic acids and other members of the class using analytic protocols well known in the art. Such analytic protocols include, but are not limited to: CLUSTAL in the PC/Gene program (available from Intelligenetics, Mountain View, Calif. or in the Omiga program version 2.0 Accelrys Inc., San Diego, Calif.); the ALIGN program (Version 2.0); and GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Version 8 (available from Genetics Computer Group (GCG), 575 Science Drive, Madison, Wis.). Alignments using these programs can be performed using the default parameters. The CLUSTAL program is well described by Higgins et al., Gene, 73:237-244 (1988); Higgins et al., CABIOS, 5:151-153 (1989); Corpet et al., Nucleic Acids Res., 16:10881-10890 (1988); Huang et al., CABIOS, 8:155-165 (1992); and Pearson et al., Meth. Mol. Biol., 24:307-331 (1994). The ALIGN program is based on the algorithm of Meyers and Miller, Computer Applic. Biol. Sci., 4:11-17 (1988). The BLAST programs of Altschul et al., J. Mol. Biol., 215:403 (1990), are based on the algorithm of Karlin and Altschul, Proc. Natl. Acad. Sci. U.S.A., 87:2264-2268 (1990). To obtain gapped alignments for comparison purposes, Gapped BLAST (in BLAST 2.0) can be utilized as described in Altschul et al., Nucleic Acids Res., 25:3389 (1997). Alternatively, PSI-BLAST (in BLAST 2.0) can be used to perform an iterated search that detects distant relationships between molecules. See, Altschul et al., supra. When utilizing BLAST, Gapped BLAST, PSI-BLAST, the default parameters of the respective programs (e.g., BLASTN for nucleotide sequences, BLASTP for proteins) can be used. The BLASTN program (for nucleotide sequences) uses as defaults a word length (W) of 11, an expectation (E) of 10, a cutoff of 100, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a word length (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see, Henikoff & Henikoff, Proc. Natl. Acad. Sci. U.S.A., 89:10915 (1989)) (see, http://www.ncbi.n1m.nih.gov.). Alignment may also be performed manually by inspection.

[0086] For purposes of the present invention, comparison of nucleotide sequences for determination of percent sequence identity to the nucleic acid sequences disclosed herein is preferably made using the BLASTN program (version 1.4.7 or later) with its default parameters or any equivalent program. By "equivalent program" is intended any sequence comparison program that, for any two sequences in question, generates an alignment having identical nucleotide or amino acid residue matches and an identical percent sequence identity when compared to the corresponding alignment generated by the preferred program.

Expression Vectors and Cassettes

[0087] The expression vectors and cassettes of the present invention include nucleic acids encoding HOI001 GBSS. A transgene comprising a HOI001 GBSS can be subcloned into an expression vector or cassette, and HOI001 GBSS expression can be detected and/or quantified. This method of screening is useful to identify transgenes providing for an expression of a HOI001 GBSS, and expression of a HOI001 GBSS in a transformed plant cell.

[0088] Plasmid vectors that provide for easy selection, amplification, and transformation of the transgene in prokaryotic and eukaryotic cells include, for example, pUC-derived vectors, pSK-derived vectors, pGEM-derived vectors, pSP-derived vectors, pBS-derived vectors, pFastBac (Invitrogen Corporation, Carlsbad, Calif.) for baculovirus expression and pYES2 (Invitrogen) for yeast expression. Additional elements may be present in such vectors, including origins of replication to provide for autonomous replication of the vector, selectable marker genes, preferably encoding antibiotic or herbicide resistance, unique multiple cloning sites providing for multiple sites to insert DNA sequences or genes encoded in the transgene, and sequences that enhance transformation of prokaryotic and eukaryotic cells. One vector that is useful for expression in both plant and prokaryotic cells is the binary Ti plasmid (as disclosed in Schilperoot et al., U.S. Pat. No. 4,940,838), as exemplified by vector pGA582. This binary Ti plasmid vector has been previously characterized by An, Methods in Enzymology, 153:292 (1987). This binary Ti vector can be replicated in prokaryotic bacteria, such as E. coli and Agrobacterium. The Agrobacterium plasmid vectors can also be used to transfer the transgene to plant cells. The binary Ti vectors preferably include the T DNA right and left borders to provide for efficient plant cell transformation, a selectable marker gene, unique multiple cloning sites in the T border regions, the colE1 replication of origin and a wide host range replicon. The binary Ti vectors carrying a transgene of the present invention can be used to transform both prokaryotic and eukaryotic cells, but is preferably used to transform plant cells, (see, Glassman et al., U.S. Pat. No. 5,258,300). Examples of plant expression vectors include the commercial vectors pBI101, pBI101.2, pBI101.3, and pBIN19 (Clontech, Palo Alto, Calif.).

[0089] In general, the expression vectors and cassettes of the present invention contain at least a promoter capable of expressing RNA in a plant cell and a terminator, in addition to a nucleic acid encoding a HOI001 GBSS. Other elements may also be present in the expression cassettes of the present invention. For example, expression cassettes can also contain enhancers, introns, untranslated leader sequences, cloning sites, matrix attachment regions for silencing the effects of chromosomal control elements, and other elements known to one of skill in the art.

[0090] Expression cassettes have promoters that can regulate gene expression. Promoter regions are typically found in the flanking DNA sequence upstream from coding regions in both prokaryotic and eukaryotic cells. A promoter sequence provides for regulation of transcription of the downstream gene sequence and typically includes from about 50 to about 2,000 nucleotide base pairs. Promoter sequences also contain regulatory sequences, such as enhancer sequences that can influence the level of gene expression. Some isolated promoter sequences can provide for gene expression of heterologous genes, that is, a gene different from the native or homologous gene. Promoter sequences are also known to be strong or weak or inducible. A strong promoter provides for a high level of gene expression, whereas a weak promoter provides for a very low level of gene expression. An inducible promoter is a promoter that provides for turning on and off of gene expression in response to an exogenously added agent or to an environmental or developmental stimulus. Promoters can also provide for tissue specific or developmental regulation. An isolated promoter sequence that is a strong promoter for heterologous genes is advantageous because it provides for a sufficient level of gene expression to allow for easy detection and selection of transformed cells and provides for a high level of gene expression when desired. Transcription initiation regions that are preferentially expressed in seed tissue, and that are undetectable in other plant parts, are considered desirable for seed oil modifications in order to minimize any disruptive or adverse effects of the gene product.

[0091] Promoters of the present invention will generally include, but are not limited to, promoters that function in bacteria, plant cells, or plastids. Useful promoters for bacterial expression are the lacZ, T7, T5, or E. coli glg C promoters. Useful promoters for plant cells include wheat high molecular weight glutenin promoter (bp 2647-3895 of Genbank Accession X12928, version X12928.3, originally described in Anderson et al., Nucleic Acids Res., 17:461-462 (1989)), the globulin promoter (see, Belanger and Kriz, Genet., 129:863-872, (1991)), gamma zein Z27 promoter (see, U.S. Ser. No. 08/763,705; also, Lopes et al., Mol Gen Genet., 247:603-613 (1995)), L3 oleosin promoter (U.S. Pat. No. 6,433,252), CaMV 35S promoter (Odell et al., Nature, 313:810 (1985)), the CaMV 19S (Lawton et al., Plant Mol. Biol., 9:31F (1987)), nos (Ebert et al., Proc. Natl. Acad. Sci. U.S.A., 84:5745 (1987)), Adh (Walker et al., Proc. Natl. Acad. Sci. U.S.A., 84:6624 (1987)), sucrose synthase (Yang et al., Proc. Natl. Acad. Sci. U.S.A., 87:4144 (1990)), tubulin, actin (Wang et al., Mol. Cell. Biol., 12:3399 (1992)), cab (Sullivan et al., Mol. Gen. Genet., 215:431 (1989)), PEPCase promoter (Hudspeth et al., Plant Mol. Biol., 12:579 (1989)), or those associated with the R gene complex (Chandler et al., The Plant Cell, 1:1175 (1989)).

[0092] Indeed in a preferred embodiment the promoter used is highly-expressed in the endosperm. Exemplary promoters include those from the zeins which are a group of storage proteins found in maize endosperm. Genomic clones for zein genes have been isolated (Pedersen et al., Cell, 29:1015-1026 (1982) and Russell et al., Transgenic Res., 6(2):157-168 (1997)) and the promoters from these clones, including the 15 kD, 16 kD, 19 kD, 22 kD, and 27 kD genes (Z27, U.S. Ser. No. 08/763,705; also, Reina et al., Nucl. Acids Res., 18:6426 (1990), Lopes et al., Mol. Gen. Genet., 247:603-613 (1995)), can also be used. Other preferred promoters, known to function in maize, and in other plants, include the promoters for the following genes: WAXY (granule bound starch synthase; Shure et al., Cell, 35:225-233 (1983); Russell et al., Transgenic Res., 6(2):157-168 (1997)), Brittle 2 and Shrunken 2 (ADP glucose pryophosphorylase, Anderson et al., Gene, 97:199-205 (1991), Russell et al., Transgenic Res., 6(2):157-168 (1997)), Shrunken 1(sucrose synthase, Yang and Russell, Proc. Natl. Acad. Sci. U.S.A., 87:4144-4148 (1990)), branching enzymes I and II, WAXY promoter from rice (Terada et al., Plant Cell Physiology, 41(7):881-888 (2000)), debranching enzymes, glutelins (Zheng et al., Plant J., 4:357-366 (1993), Russell et al., Transgenic Res., 6(2):157-168 (1997)), and Betl1 (basal endosperm transfer layer; Hueros et al., Plant Physiol., 121:1143-1152 (1999)). Other promoters useful in the practice of the present invention that are known by one of skill in the art are also contemplated by the invention.

[0093] Moreover, transcription enhancers or duplications of enhancers can be used to increase expression from a particular promoter. Examples of such enhancers include, but are not limited to, elements from the CaMV 35S promoter and octopine synthase genes (Last et al., U.S. Pat. No. 5,290,924). As the DNA sequence between the transcription initiation site and the start of the coding sequence, i.e., the untranslated leader sequence, can influence gene expression, one may also wish to employ a particular leader sequence. Any leader sequence available to one of skill in the art may be employed. Preferred leader sequences direct optimum levels of expression of the attached gene, for example, by increasing or maintaining mRNA stability and/or by preventing inappropriate initiation of translation (Joshi, Nucl. Acid Res., 15:6643 (1987)). The choice of such sequences is at the discretion of those of skill in the art. Sequences that are derived from genes that are highly expressed in higher plants, and in soybean, corn, and canola in particular, are contemplated.

[0094] Expression cassettes of the present invention will also include a sequence near the 3' end of the cassette that acts as a signal to terminate transcription from a heterologous nucleic acid and that directs polyadenylation of the resultant mRNA. These are commonly referred to as 3' untranslated regions or 3' UTRs. Some 3' elements that can act as transcription termination signals include the wheat HSP17 3' UTR (bp532-741 of GenBank X13431, version X13431.1, McElvain and Spiker, Nucleic Acids Res., 17:1764 (1989)), those from the nopaline synthase gene of Agrobacterium tumefaciens (Bevan et al., Nucl. Acid Res., 11:369 (1983)), a napin 3' UTR (Kridl et al., Seed Sci Res., 1:209-219 (1991)), a globulin 3' UTR (Belanger and Kriz, Genetics, 129:863-872 (1991)), or one from a zein gene, such as Z27 (Lopes et al., Mol Gen Genet., 247:603-613 (1995)). Other 3' elements known by one of skill in the art also can be used in the vectors of the present invention.

[0095] Regulatory elements, such as Adh intron 1 (Callis et al., Genes Develop., 1:1183 (1987)), a rice actin intron (McElroy et al., Mol. Gen. Genet., 231(1):150-160 (1991)), sucrose synthase intron (Vasil et al., Plant Physiol., 91:5175 (1989)), the maize HSP70 intron (Rochester et al., EMBO J., 5:451-458 (1986)), or TMV omega element (Gallie et al., The Plant Cell, 1:301 (1989)) may further be included where desired. These 3' nontranslated regulatory sequences can be obtained as described in An, Methods in Enzymology, 153:292 (1987) or are already present in plasmids available from commercial sources, such as Clontech, Palo Alto, Calif. The 3' nontranslated regulatory sequences can be operably linked to the 3' terminus of any heterologous nucleic acid to be expressed by the expression cassettes contained within the present vectors. Other such regulatory elements useful in the practice of the present invention are known by one of skill in the art and can also be placed in the vectors of the invention.

[0096] The vectors of the present invention, as well as the coding regions claimed herein, can be optimized for expression in plants by having one or more codons replaced by other codons encoding the same amino acids so that the polypeptide is optimally translated by the translation machinery of the plant species in which the vector is used.

Selectable Markers

[0097] Selectable marker genes or reporter genes are also useful in the present invention. Such genes can impart a distinct phenotype to cells expressing the marker gene and thus allow such transformed cells to be distinguished from cells that do not have the marker. Selectable marker genes confer a trait that one can "select" for by chemical means, i.e., through the use of a selective agent (e.g., a herbicide, antibiotic, or the like). Reporter genes, or screenable genes, confer a trait that one can identify through observation or testing, i.e., by "screening" (e.g., the R-locus trait). Of course, many examples of suitable marker genes are known to the art and can be employed in the practice of the present invention.

[0098] A number of selectable marker genes are known in the art and can be used in the present invention. A preferred selectable marker gene for use in the present invention would include genes that confer resistance to herbicides like glyphosate, such as EPSP (Della-Cioppa et al., Bio/Technology, 5(6):579-84 (1987)). A particularly preferred selectable marker would include a gene that encodes an altered EPSP synthase protein (Hinchee et al., Biotech., 6:915 (1988)). Other possible selectable markers for use in connection with the present invention include, but are not limited to, a neo gene (Potrykus et al., Mol. Gen. Genet., 199:183 (1985)) which codes for kanamycin resistance and can be selected for by applying kanamycin, a kanamycin analog such as geneticin (Sigma Chemical Company, St. Louis, Mo.), and the like; a bar gene that codes for bialaphos resistance; a nitrilase gene, such as bxn from Klebsiella ozaenae, which confers resistance to bromoxynil (Stalker et al., Science, 242:419 (1988)); a mutant acetolactate synthase gene (ALS) that confers resistance to imidazolinone, sulfonylurea or other ALS-inhibiting chemicals (EP 154 204A1 (1985)); a methotrexate-resistant DHFR gene (Thillet et al., J. Biol. Chem., 263:12500 (1988)); a dalapon dehalogenase gene that confers resistance to the herbicide dalapon. Where a mutant EPSP synthase gene is employed, additional benefit may be realized through the incorporation of a suitable plastid transit peptide (CTP).

[0099] Screenable markers that may be employed include, but are not limited to, .beta.-glucuronidase or uidA gene (GUS), which encodes an enzyme for which various chromogenic substrates are known; an R-locus gene, which encodes a product that regulates the production of anthocyanin pigments (red color) in plant tissues (Dellaporta et al., In Chromosome Structure and Function, pp. 263-282 (1988)); a .beta.-lactamase gene (Sutcliffe, Proc. Natl. Acad. Sci. U.S.A., 75:3737 (1978)), which encodes an enzyme for which various chromogenic substrates are known (e.g., PADAC, a chromogenic cephalosporin); a xylE gene (Zukowsky et al., Proc. Natl. Acad. Sci. U.S.A., 80:1101 (1983)) that encodes a catechol dioxygenase that can convert chromogenic catechols; an .alpha.-amylase gene (Ikuta et al., Biotech., 8:241 (1990)); a tyrosinase gene (Katz et al., J. Gen. Microbiol., 129:2703 (1983)) that encodes an enzyme capable of oxidizing tyrosine to DOPA and dopaquinone, which in turn condenses to form the easily detectable compound melanin; a .beta.-galactosidase gene, which encodes an enzyme for which there are chromogenic substrates; a luciferase (lux) gene (Ow et al., Science, 234:856 (1986)), which allows for bioluminescence detection; or an aequorin gene (Prasher et al., Biochem. Biophys. Res. Comm., 126:1259 (1985)), which may be employed in calcium-sensitive bioluminescence detection, or a green fluorescent protein gene (Niedz et al., Plant Cell Reports, 14:403 (1995)). In a preferred embodiment, the screenable marker gene is operably linked to an aleurone-specific promoter as described by Kriz et al., in U.S. Pat. No. 6,307,123.

[0100] In addition to nuclear plant transformation, the present invention also extends to direct transformation of the plastid genome of plants. Hence, targeting of the gene product to an intracellular compartment within plant cells may also be achieved by direct delivery of a gene to the intracellular compartment. In some embodiments, direct transformation of plastid genome may provide additional benefits over nuclear transformation. For example, direct plastid transformation of HOI001 GBSS eliminates the requirement for a plastid targeting peptide and post-translational transport and processing of the pre-protein derived from the corresponding nuclear transformants. Plastid transformation of plants has been described by P. Maliga, Current Opinion in Plant Biology, 5:164-172 (2002), Heifetz, Biochimie, 82:655-666 (2000), Bock, J. Mol. Biol., 312:425-438 (2001), and Daniell et al., Trends in Plant Science, 7:84-91 (2002), and references cited therein.

[0101] After constructing a transgene containing an HOI001 GBSS, the expression vector or cassette can then be introduced into a plant cell. Depending on the type of plant cell, the level of gene expression, and the activity of the enzyme encoded by the gene, introduction of DNA encoding an HOI001 GBSS into the plant cell can lead to increased oil content in plant tissues.

Plant Transformation

[0102] Techniques for transforming a plant cell, a plant tissue, a plant organ, or a plant with a nucleic acid construct, such as a vector are known in the art. Such methods involve plant tissue culture techniques, for example. As used herein, "transforming" refers to the introduction of nucleic acid into a recipient host and the expression therein.

[0103] The plant cell, plant tissue, plant organ, or plant can be contacted with the vector by any suitable means as known in the art. Preferably, a transgenic plant expressing the desired protein is to be produced. Various methods for the introduction of a desired polynucleotide sequence encoding the desired protein into plant cells include, but are not limited to: (1) physical methods such as microinjection (Capecchi, Cell, 22(2):479-488 (1980)), electroporation (Fromm et al., Proc. Nat. Acad. Sci. U.S.A., 82(17):5824-5828 (1985); U.S. Pat. No. 5,384,253), and microprojectile bombardment mediated delivery (Christou et al., Bio/Technology, 9:957 (1991); Fynan et al., Proc. Nat. Acad. Sci. U.S.A., 90(24):11478-11482 (1993)); (2) virus mediated delivery methods (Clapp, Clin. Perinatol., 20(1):155-168 (1993); Lu et al., J. Exp. Med., 178(6):2089-2096 (1993); Eglitis and Anderson, Biotechniques, 6(7):608-614 (1988); and (3) Agrobacterium-mediated transformation methods.

[0104] The most commonly used methods for transformation of plant cells are the Agrobacterium-mediated DNA transfer process (Fraley et al., Proc. Nat. Acad. Sci. U.S.A., 80:4803 (1983)) and the microprojectile bombardment mediated process. Typically, nuclear transformation is desired but where it is desirable to specifically transform plastids, such as chloroplasts or amyloplasts, plant plastids may be transformed utilizing a microprojectile bombardment mediated delivery of the desired polynucleotide for certain plant species such as tobacco, Arabidopsis, potato, and Brassica species.

[0105] Agrobacterium-mediated transformation is achieved through the use of a genetically engineered soil bacterium belonging to the genus Agrobacterium. Several Agrobacterium species mediate the transfer of a specific DNA known as "T-DNA," which can be genetically engineered to carry any desired piece of DNA into many plant species. The major events marking the process of T-DNA mediated pathogenesis are: induction of virulence genes, processing, and transfer of T-DNA. This process is the subject of many reviews (Ream, Ann. Rev. Phytopathol., 27:583-618 (1989); Howard and Citovsky, Bioassays, 12:103-108 (1990); Kado, Crit. Rev. Plant Sci., 10:1-32 (1991); Zambryski, Annual Rev. Plant Physiol. Plant Mol. Biol., 43:465-490 (1992); Gelvin, In Transgenic Plants, Kung and Wu, (eds.), Academic Press, San Diego, Calif., pp. 49-87 (1993); Binns and Howitz, In Bacterial Pathogenesis of Plants and Animals, Dang, (ed.). Berlin: Springer Verlag, pp. 119-138 (1994); Hooykaas and Beijersbergen, Ann. Rev. Phytopathol., 32:157-179 (1994); Lessl and Lanka, Cell, 77:321-324 (1994); Zupan and Zambryski, Annual Rev. Phytopathol., 27:583-618 (1995)).

[0106] Agrobacterium-mediated genetic transformation of plants involves several steps. The first step, in which the virulent Agrobacterium and plant cells are first brought into contact with each other, is generally called "inoculation." The Agrobacterium containing solution is then removed from contact with the explant by draining or aspiration. Following the inoculation, the Agrobacterium and plant cells/tissues are permitted to be grown together for a period of several hours to several days or more under conditions suitable for growth and T-DNA transfer. This step is termed "co-culture." Following co-culture and T-DNA delivery, the plant cells are treated with bactericidal or bacteriostatic agents to kill the Agrobacterium remaining in contact with the explant and/or in the vessel containing the explant. If this is done in the absence of any selective agents to promote preferential growth of transgenic versus non-transgenic plant cells, then this is typically referred to as the "delay" step. If done in the presence of selective pressure favoring transgenic plant cells, then it is referred to as a "selection" step. When a "delay" is used, it is typically followed by one or more "selection" steps. Both the "delay" and "selection" steps typically include bactericidal or bacteriostatic agents to kill any remaining Agrobacterium cells because the growth of Agrobacterium cells is undesirable after the infection (inoculation and co-culture) process.

[0107] A number of wild-type and disarmed strains of Agrobacterium tumefaciens and Agrobacterium rhizogenes harboring Ti or Ri plasmids can be used for gene transfer into plants. The Agrobacterium hosts contain disarmed Ti and Ri plasmids that do not contain the oncogenes that cause tumorigenesis or rhizogenesis, respectfully, which are used as the vectors and contain the genes of interest that are subsequently introduced into plants. Preferred strains would include but are not limited to Agrobacterium tumefaciens strain C58, a nopaline-type strain that is used to mediate the transfer of DNA into a plant cell, octopine-type strains such as LBA4404 or succinamopine-type strains, e.g., EHA101 or EHA105. The nucleic acid molecule, prepared as a DNA composition in vitro, is introduced into a suitable host such as E. coli and mated into the Agrobacterium, or directly transformed into competent Agrobacterium. These techniques are well-known to those of skill in the art.

[0108] The Agrobacterium can be prepared either by inoculating a liquid such as Luria Burtani (LB) media directly from a glycerol stock or streaking the Agrobacterium onto a solidified media from a glycerol stock, allowing the bacteria to grow under the appropriate selective conditions, generally from about 26.degree. C.-30.degree. C., or about 28.degree. C., and taking a single colony or a small loop of Agrobacterium from the plate and inoculating a liquid culture medium containing the selective agents. Those of skill in the art are familiar with procedures for growth and suitable culture conditions for Agrobacterium as well as subsequent inoculation procedures. The density of the Agrobacterium culture used for inoculation and the ratio of Agrobacterium cells to explant can vary from one system to the next, and therefore optimization of these parameters for any transformation method is expected.

[0109] Typically, an Agrobacterium culture is inoculated from a streaked plate or glycerol stock and is grown overnight and the bacterial cells are washed and resuspended in a culture medium suitable for inoculation of the explant.

[0110] With respect to microprojectile bombardment (U.S. Pat. Nos. 5,550,318; 5,538,880; and 5,610,042; and PCT Publication WO 95/06128; each of which is specifically incorporated herein by reference in its entirety), particles are coated with nucleic acids and delivered into cells by a propelling force. Exemplary particles include those comprised of tungsten, platinum, and preferably, gold. It is contemplated that in some instances DNA precipitation onto metal particles would not be necessary for DNA delivery to a recipient cell using microprojectile bombardment. However, it is contemplated that particles may contain DNA rather than be coated with DNA. Hence, it is proposed that DNA-coated particles may increase the level of DNA delivery via particle bombardment but are not, in and of themselves, necessary.

[0111] For the bombardment, cells in suspension are concentrated on filters or solid culture medium. Alternatively, immature embryos or other target cells may be arranged on solid culture medium. The cells to be bombarded are positioned at an appropriate distance below the microprojectile stopping plate.

[0112] An illustrative embodiment of a method for delivering DNA into plant cells by microprojectile bombardment is the Biolistics Particle Delivery System (BioRad, Hercules, Calif.), which can be used to propel particles coated with DNA or cells through a screen, such as a stainless steel or Nytex screen, onto a filter surface covered with monocot plant cells cultured in suspension. The screen disperses the particles so that they are not delivered to the recipient cells in large aggregates. It is believed that a screen intervening between the projectile apparatus and the cells to be bombarded reduces the size of projectile aggregates and may contribute to a higher frequency of transformation by reducing the damage inflicted on the recipient cells by projectiles that are too large.

[0113] For microprojectile bombardment, one will attach (i.e., "coat") DNA to the microprojectiles such that it is delivered to recipient cells in a form suitable for transformation thereof. In this respect, at least some of the transforming DNA must be available to the target cell for transformation to occur, while at the same time during delivery the DNA must be attached to the microprojectile. Therefore, availability of the transforming DNA from the microprojectile may comprise the physical reversal of bonds between transforming DNA and the microprojectile following delivery of the microprojectile to the target cell. This need not be the case, however, as availability to a target cell may occur as a result of breakage of unbound segments of DNA or of other molecules which comprise the physical attachment to the microprojectile. Availability may further occur as a result of breakage of bonds between the transforming DNA and other molecules, which are either directly or indirectly attached to the microprojectile. It further is contemplated that transformation of a target cell may occur by way of direct recombination between the transforming DNA and the genomic DNA of the recipient cell. Therefore, as used herein, a "coated" microprojectile will be one which is capable of being used to transform a target cell, in that the transforming DNA will be delivered to the target cell, yet will be accessible to the target cell such that transformation may occur.

[0114] Any technique for coating microprojectiles, which allows for delivery of transforming DNA to the target cells, may be used. Methods for coating microprojectiles, which have been demonstrated to work well with the present invention, have been specifically disclosed herein. DNA may be bound to microprojectile particles using alternative techniques, however. For example, particles may be coated with streptavidin and DNA end labeled with long chain thiol cleavable biotinylated nucleotide chains. The DNA adheres to the particles due to the streptavidin-biotin interaction, but is released in the cell by reduction of the thiol linkage through reducing agents present in the cell.

[0115] Alternatively, particles may be prepared by functionalizing the surface of a gold oxide particle, providing free amine groups. DNA, having a strong negative charge, binds to the functionalized particles. Furthermore, charged particles may be deposited in controlled arrays on the surface of mylar flyer disks used in the PDS-1000 Biolistics device, thereby facilitating controlled distribution of particles delivered to target tissue.

[0116] As disclosed above, it further is proposed, that the concentration of DNA used to coat microprojectiles may influence the recovery of transformants containing a single copy of the transgene. For example, a lower concentration of DNA may not necessarily change the efficiency of the transformation, but may instead increase the proportion of single copy insertion events. In this regard, approximately 1 ng to 2000 ng of transforming DNA may be used per each 1.8 mg of starting microprojectiles. In other embodiments of the present invention, approximately 2.5 ng to 1000 ng, 2.5 ng to 750 ng, 2.5 ng to 500 ng, 2.5 ng to 250 ng, 2.5 ng to 100 ng, or 2.5 ng to 50 ng of transforming DNA may be used per each 1.8 mg of starting microprojectiles.

[0117] Microprojectile bombardment techniques are widely applicable, and may be used to transform virtually any plant species. Examples of species that have been transformed by microprojectile bombardment include monocot species such as maize (PCT Publication WO 95/06128), barley, wheat (U.S. Pat. No. 5,563,055, specifically incorporated herein by reference in its entirety), rice, oat, rye, sugarcane, and sorghum; as well as a number of dicots including tobacco, soybean (U.S. Pat. No. 5,322,783, specifically incorporated herein by reference in its entirety), sunflower, peanut, cotton, tomato, and legumes in general (U.S. Pat. No. 5,563,055, specifically incorporated herein by reference in its entirety).

[0118] For microprojectile bombardment transformation in accordance with the present invention, both physical and biological parameters may be optimized. Physical factors are those that involve manipulating the DNA/microprojectile precipitate or those that affect the flight and velocity of either the macro- or microprojectiles. Biological factors include all steps involved in manipulation of cells before and immediately after bombardment, such as the osmotic adjustment of target cells to help alleviate the trauma associated with bombardment, the orientation of an immature embryo or other target tissue relative to the particle trajectory, and also the nature of the transforming DNA, such as linearized DNA or intact supercoiled plasmids. It is believed that pre-bombardment manipulations are especially important for successful transformation of immature embryos.

[0119] Accordingly, it is contemplated that one may wish to adjust various bombardment parameters in small scale studies to fully optimize the conditions. One may particularly wish to adjust physical parameters such as DNA concentration, gap distance, flight distance, tissue distance, and helium pressure. It further is contemplated that the grade of helium may affect transformation efficiency. One also may optimize the trauma reduction factors (TRFs) by modifying conditions which influence the physiological state of the recipient cells and which may therefore influence transformation and integration efficiencies. For example, the osmotic state, tissue hydration, and the subculture stage or cell cycle of the recipient cells may be adjusted for optimum transformation.

[0120] Other methods of cell transformation can also be used and include but are not limited to introduction of DNA into plants by direct DNA transfer into pollen (Hess et al., Intern Rev. Cytol., 107:367 (1987); Luo et al., Plant Mol. Biol. Reporter, 6:165 (1988)), by direct injection of DNA into reproductive organs of a plant (Pena et al., Nature, 325:274 (1987)), or by direct injection of DNA into the cells of immature embryos followed by the rehydration of desiccated embryos (Neuhaus et al., Theor. Appl. Genet., 75:30 (1987)).

[0121] The regeneration, development, and cultivation of plants from single plant protoplast transformants or from various transformed explants is well known in the art (Weissbach and Weissbach, In: Methods for Plant Molecular Biology, Academic Press, San Diego, Calif., (1988)). This regeneration and growth process typically includes the steps of selection of transformed cells, culturing those individualized cells through the usual stages of embryonic development through the rooted plantlet stage. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant growth medium such as soil.

[0122] The development or regeneration of plants containing the exogenous gene that encodes a protein of interest is well known in the art. Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic plants. Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown plants of agronomically important lines. Conversely, pollen from plants of these important lines is used to pollinate regenerated plants. A transgenic plant of the present invention containing a desired polypeptide is cultivated using methods well known to one skilled in the art.

[0123] There are a variety of methods for the regeneration of plants from plant tissue. The particular method of regeneration will depend on the starting plant tissue and the particular plant species to be regenerated.

[0124] Assays for gene expression based on the transient expression of cloned nucleic acid constructs have been developed by introducing the nucleic acid molecules into plant cells by polyethylene glycol treatment, electroporation, or particle bombardment (Marcotte et al., Nature, 335:454-457 (1988); Marcotte et al., Plant Cell, 1:523-532 (1989); McCarty et al., Cell, 66:895-905 (1991); Hattori et al., Genes Dev., 6:609-618 (1992); Goff et al., EMBO J., 9:2517-2522 (1990)). Transient expression systems may be used to functionally dissect gene constructs (see generally, Maliga et al., Methods in Plant Molecular Biology, Cold Spring Harbor Press (1995)).

[0125] Any of the nucleic acid molecules of the present invention may be introduced into a plant cell in a permanent or transient manner in combination with other genetic elements such as vectors, promoters, enhancers, etc. Further, any of the nucleic acid molecules of the present invention may be introduced into a plant cell in a manner that allows for expression or overexpression of the protein or fragment thereof encoded by the nucleic acid molecule.

[0126] Transgenic plants may find use in the commercial manufacture of proteins or other molecules, such as oils, where the molecules of interest are extracted or purified from plant parts, seeds, and the like. Cells or tissue from the plants may also be cultured, grown in vitro, or fermented to manufacture such molecules.

[0127] Improvements encoded by the recombinant DNA may be transferred, e.g., from cells of one species to cells of other species, e.g., by protoplast fusion. The transgenic plants may also be used in commercial breeding programs, or may be crossed or bred to plants of related crop species. For example, a nucleic acid of the present invention, operably linked to a promoter, can be introduced into a particular plant variety by crossing, without the need for ever directly transforming a plant of that given variety. Therefore the present invention not only encompasses a plant directly regenerated from cells that have been transformed in accordance with the present invention, but also the progeny of such plants.

[0128] The present invention also provides for a method of stably expressing an HOI001 GBSS of interest in a plant, which includes, contacting the plant cell with a vector of the present invention that has a nucleic acid encoding the HOI001 GBSS of interest, under conditions effective to transfer and integrate the vector into the nuclear genome of the cell. A promoter within the expression cassette can be any of the promoters provided herein, for example, a constitutive promoter, an inducible promoter, a tissue-specific promoter, or a seed specific promoter. Such promoters can provide expression of an encoded HOI001 GBSS at a desired time, or at a desired developmental stage, or in a desired tissue. The vector can also include a selectable marker gene. When using the vector with Agrobacterium tumefaciens, the vector can have an Agrobacterium tumefaciens origin of replication.

Plants

[0129] Plants for use with the vectors of the present invention preferably include monocots, especially oil producing species, most preferably corn (Zea mays). Other species contemplated by the present invention include alfalfa (Medicago sativa), rice (Oryza sativa), barley (Hordeum vulgare), millet (Panicum miliaceum), rye (Secale cereale), wheat (Triticum aestivum), and sorghum (Sorghum bicolor).

[0130] Any of the plants or parts thereof of the present invention may be processed to produce a feed, meal, protein, or oil preparation. A particularly preferred plant part for this purpose is a seed. Methods to produce feed, meal, protein, and oil preparations are known in the art. See, for example, U.S. Pat. Nos. 4,957,748; 5,100,679; 5,219,596; 5,936,069; 6,005,076; 6,146,669; and 6,156,227.

Characterization of Transformed Plants

[0131] To confirm the presence of the transgene in the regenerated plant, a variety of techniques, which are well known in the art, are available. Examples of these techniques include but are not limited to: (a) molecular assays of DNA integration or RNA expression such as Southern or northern blotting, TAQMAN.RTM. technology (Applied Biosystems, Foster City, Calif.) and PCR; (b) biochemical assays detecting the presence of the protein product such as ELISA, western blotting, or by enzymatic function; or (c) chemical analysis of the targeted plant part, such as seed tissue, for qualitative and quantitative determination of oil, protein, or starch.

[0132] The following examples are provided to illustrate the present invention and are not intended to limit the invention in any way.

EXAMPLE 1

[0133] This example describes the isolation and sequencing of the HOI001 GBSS gene from corn line HOI001. HOI001 is an inbred plant derived from MGSC 915E (Maize Genetic Stock Center, Urbana, Ill.), and is more fully described in U.S. Patent Publication Nos. 20030172416 and 20030154524, both of which are incorporated herein by reference.

[0134] Genomic DNA was extracted from corn germ tissue from HOI001, 22 days after pollination, using the following procedure. Between 50-100 mg dissected germ tissue was placed in a Bio101 Multimix tube (Qiagen, Carlsbad, Calif., Cat. No. 657-601) with extraction buffer and glass beads. The extraction buffer consisted of 100 mM Tris-HCl (pH 8.0), 50 mM EDTA, 100 mM NaCl, 5 mM DTT, and 1% SDS. The tissue was then disrupted using the Bio 101 FASTPREP.RTM. machine (Qiagen) with 3 pulses of 20 seconds each. Following a 15 minute incubation at 65.degree. C., 330 .mu.l of 5M potassium acetate was added to each tube. The tubes were then incubated at 0.degree. C. for 20 minutes to precipitate the SDS, followed by centrifugation at 12,000 rpm (Eppendorf Model 54172) for 10 minutes. The supernatant was transferred to a new tube and 100 .mu.l of 5M ammonium acetate (pH 7.0) and 700 .mu.l of isopropanol was added to precipitate the DNA. The tubes were mixed by inversion and centrifuged at 14,000 rpm for 10 minutes. After discarding the supernatant, the pellet was resuspended in 500 .mu.l of 70% ethanol and recovered by centrifugation at 14,000 rpm for 5 minutes. The recovered pellet containing the DNA was resuspended in 50 .mu.l of TE buffer and stored at 4.degree. C.

[0135] The HOI001 GBSS gene was isolated from the extracted genomic DNA using PCR methodology that was adapted from Advantage GC (BD Biosciences Clontech, Palo Alto, Calif.). The following primers were designed based on the published sequence of Zea mays GBSS from Shure et al., Cell, 35(1):225-233 (1983) [SEQ ID NO: 2]: TABLE-US-00001 5' primer (Primer number 14543) [SEQ ID NO: 5] 5'-TCAGCCGTTCGTGTGGCAAGATTCATCTGTTGTCTC-3' 3' primer (Primer number 14547) [SEQ ID NO: 6] 5'-TCAGCGGGATTATTTACTCCACCACTACAGGTCCATTT-3'.

[0136] The following PCR reaction was assembled for a total volume of 50 .mu.l; TABLE-US-00002 37 .mu.l PCR grade water 5 .mu.l 5X Advantage GC PCR buffer 1 .mu.l 50X dNTP Mix (10 mM each) 1 .mu.l 50X Advantage GC Polymerase Mix 2.5 .mu.l primer 14543 2.5 .mu.l primer 14547 1 .mu.l genomic DNA

[0137] The cycle parameters were: 95.degree. C. for 1 minute, 35 cycles of 95.degree. C. for 30 seconds and 68.degree. C. for 3 minutes.

[0138] The PCR products were separated by electrophoresis in agarose and a 4.7 kB fragment containing the gene of interest was observed. Five microliters of the original PCR reaction was utilized as template for additional amplification using the same primers and conditions described above. The 4.7 kB amplification products from independent amplification reactions were isolated by agarose gel electrophoresis, cloned into the PCR 2.1 cloning vector using the TOPO TA cloning kit (Invitrogen), then transformed into an E. coli host. Plasmid DNA was prepared from cultures grown from each colony, and then the inserts from 3 separate plasmid preparations were sequenced. Alignment of these sequences generated a consensus sequence highly homologous but not identical to the published GBSS gene, although no specific insert sequence was equivalent to the consensus. One clone (designated pCGN9480-2) had an insert sequence with the fewest sequence changes relative to the consensus. A clone containing the consensus was then generated by restriction enzyme-mediated excision of non-consensus sequence and religation with fragments containing the consensus sequence, obtained by digestion of the other clones or by PCR amplification from HOI001 genomic DNA. The consensus sequence, including 1.5 kB upstream of the transcription start site and approximately 300 base pairs downstream of the stop codon, is listed as SEQ ID NO: 1.

[0139] The GBSS gene from elite corn inbred line LH59 [SEQ ID NO: 7] was isolated using the procedures and primers described above, and cloned into the binary vector pMON68203. The resulting plasmid containing the LH59 GBSS is named pMON72510 (FIG. 5).

[0140] FIG. 1 shows the nucleic acid sequence alignment of the HOI001 GBSS [SEQ ID NO: 1] compared to the published sequence of Shure et al., supra [SEQ ID NO: 2] and the GBSS from LH59 [SEQ ID NO: 7], using the Omiga software package 2.0, (Accelrys Inc., San Diego, Calif.). The alignment shows there are the following polymorphisms unique to the HOI001 GBSS sequence, that is not found in either the LH59 GBSS sequence or the published sequence of Shure et al., supra:

[0141] 1. Single Nucleotide Polymorphisms: [0142] a. T>C at position 158 [0143] b. G>A at position 337 [0144] c. C>A at position 343 [0145] d. C>A at position 349 [0146] e. G>A at position 441 [0147] f. C>T at position 666 [0148] g. G>C at position 777 [0149] h. T>A at position 878 [0150] i. C>T at position 980 [0151] j. T>A at position 1210 [0152] k. C>T at position 1216 [0153] l. A>T at position 1450 [0154] m. T>C at position 1709 [0155] n. A>G at position 1720 [0156] o. T>A at position 1721 [0157] p. G>C at position 1722 [0158] q. C>T at position 1761 [0159] r. G>A at position 1836 [0160] s. C>T at position 1852 [0161] t. G>A at position 1953 [0162] u. C>T at position 2043 [0163] v. C>T at position 2109 [0164] w. C>G at position 2110 [0165] x. G>C at position 2115 [0166] y. A>T at position 2448 [0167] z. C>T at position 2454 [0168] aa. T>G at position 2609 [0169] bb. A>G at position 2929 [0170] cc. G>T at position 2933 [0171] dd. C>T at position 2946 [0172] ee. G>T at position 3875 [0173] ff. T>A at position 4008 [0174] gg. T>C at position 4018 [0175] hh. T>G at position 4023 [0176] ii. C>A at position 4025 [0177] jj. C>T at position 4169 [0178] kk. A>T at position 4225 [0179] ll. C>A at position 4562

[0180] 2. Insertions: [0181] a. Sequence g at position 632 [0182] b. Sequence atgc at position 1185-1189 [0183] c. Sequence tgcaccagcagc at position 1456-1467 [0184] d. Sequence atgca at position 1746-1750 [0185] e. Sequence catcaca at position 1868-1874 [0186] f. Sequence ct at position 2100-2101 [0187] g. Sequence ccat at position 2488-2491 [0188] h. Sequence tat position 3810-3812

[0189] 3. Deletions: [0190] a. Sequence cgt at position 288-290 [0191] b. Sequence aa at position 704-705 [0192] c. Sequence c at position 882 [0193] d. Sequence atccg at position 1139-1143 [0194] e. Sequence ctctctg at position 1256-1262 [0195] f. Sequence tc at position 1714-1715 [0196] g. Sequence tgcaactgcaaatgca at position 1917-1932 [0197] h. Sequence g or a at position 3790 [0198] i. Sequence cgagccaggggt(t or c)gaaggcgaggagatcgcgccgctcgccaagg agaacgtggccgcgccctgaagagttcggcct at position 4393-4467

[0199] FIG. 2 shows the alignment of the corresponding predicted amino acid sequences from the GBSS gene isolated from HOI001 [SEQ ID NO: 3], and the GBSS gene described in Shure et al., supra [SEQ ID NO: 4], respectively. The results indicate that there is a sequence of additional amino acid residues on the carboxy terminus of the HOI001 GBSS starting at approximately position 1441 and an area of non-alignment in the region of amino acid residue 55-60.

[0200] FIG. 3 shows the alignment of the corresponding predicted amino acid sequences from the Zea mays GBSS gene isolated from inbred LH59 [SEQ ID NO: 8], and the Zea mays granule bound starch synthase gene described in Shure et al., supra, respectively. The results indicate that there is a sequence of additional amino acid residues on the carboxy terminus of the HOI001 GBSS starting at approximately position 1441 and an area of non-alignment in the region of amino acid residue 55-60.

EXAMPLE 2

[0201] This example sets forth the construction of plant transformation vectors containing the sequences of the HOI001 GBSS and the GBSS from inbred line LH59, [SEQ ID NOs: 1 and 7, respectively].

[0202] The HOI001 GBSS sequence was cut from the consensus-corrected version of pMON9480-2 using the restriction enzyme EcoR1. The resulting 4.7 kb fragment was purified following the manufacturer's protocol for the Qiagen miniprep kit (Qiagen, Inc., Valencia, Calif.). The ends of the fragment were blunted following manufacturer's protocol in the Stratagene PCR polishing kit (Stratagene, Inc., La Jolla, Calif.). The fragment was then gel purified using the Qiagen Gel Extraction kit (Qiagen), and cloned into pMON68203, a binary vector for plant transformation. The binary vector, pMON68203, contains left and right borders for T-DNA transfer, a CaMV 35S promoter::nptII::nos 3' UTR plant selectable marker element (described in U.S. Pat. No. 6,255,560), and plant expression cassette sequences which include a 1.1 kb Z27 promoter (bp 19-1117 of Accession #S78780, Lopes et al., Mol. Gen. Genet., 247(5):603-613 (1995)) for endosperm expression, a corn hsp70 intron (base pairs 4-153 of the maize gene for heat shock 70 exon 2, Accession #X03679, Rochester et al., EMBO J., 5:451-458 (1986)), and a nos 3' UTR, (base pairs 2924-2671 of the Agrobacterium tumefaciens strain C58 Ti plasmid, Accession #AE009420, Wood et al., Science, 294:2317-2323 (2001)). The binary vector pMON68203 was digested with Stu1, dephosphorylated by incubating with shrimp alkaline phosphatase (Roche Applied Science, Indianapolis, Ind.) at 37.degree. C. for 60 minutes and ligated with the 4.7 kb gel purified fragment of the HOI001 GBSS, described above. The resulting plasmid was named pMON72506 (FIG. 4).

[0203] The GBSS from corn line LH59, [SEQ ID NO: 7], was similarly cloned into the binary vector pMON68203, to form pMON72510.

EXAMPLE 3

[0204] This example describes the transformation of corn with the HOI001 GBSS and the GBSS from corn line LH59, using the vectors described in Example 2.

[0205] The transformation vectors pMON72506 and pMON72510 were used to transform maize plants using the following procedure.

[0206] Corn plants are grown in a greenhouse under standard practices. Controlled pollinations were made. The ears of the plants are harvested when the resulting hybrid embryos were 1.5 to 2.0 mm in length, usually 10-15 days after pollination. After removing the husks, the kernels on the ears were surface-sterilized by spraying with or soaking in 80% ethanol.

[0207] The Agrobacterium strain ABI, and an Agrobacterium tumefaciens binary vector system were used for the transformations. Plasmids pMON72506 and pMON72510 were transformed into Agrobacterium tumefaciens according to methods well known in the art. Prior to inoculation of corn cells the Agrobacterium cells are grown overnight at room temperature in AB medium (Chilton et al., Proc. Nat. Acad. Sci. U.S.A., 71:3672-3676 (1974)) comprising appropriate antibiotics for plasmid maintenance and 200 .mu.M acetosyringone. Immediately prior to inoculation the Agrobacterium cells were pelleted by centrifugation, and resuspended in either CRN122 medium (2.2 g/L MS (Murashige and Skoog, Physiol. Plant, 15:473-497 (1962)) basal salts, 2 mg/L glycine, 0.5 g/L niacin, 0.5 g/l L-pyridoxine-HCl, 0.1 g/L thiamine, 115 mg/L L-proline, 36 g/L glucose, and 68.5 g/L sucrose, pH 5.4) or CRN347 medium (CRN122 medium except with 0.44 g/L MS salts, 10 g/L glucose, 20 g/L sucrose, and 100 mg/L ascorbic acid) containing 200 .mu.M acetosyringone and 20 .mu.M Ag NO.sub.3.

[0208] The immature maize embryos were excised from individual kernels, immersed in an Agrobacterium suspension, and incubated at room temperature for 5-15 minutes. The Agrobacterium solution is then removed, and the inoculated immature embryos were transferred scutellum-side up from inoculation CRN122 medium to co-cultivation CRN123 medium (CRN122 medium except with 0.5 mg/L additional thiamine-HCl, 20 g/L sucrose, 10 g/L glucose and 3 mg/L 2,4 D) containing 200 .mu.M acetosyringone and 20 .mu.M silver nitrate and incubated at 23.degree. C. for 1 day. Alternatively, excised embryos were cultured for 8-11 days in 211V medium (3.98 g/L Chu N6 salts (Chu, C.C., The N6 medium and its application to anther culture of cereal crops, in Plant Tissue Culture Plant Tissue Culture. Proceedings of the Pekinig Symposium, Boston, Mass. (1981), 43-50), 0.5 mg/L thiamine HCl, 0.5 mg/L nicotinic acid; 1.0 mg/L 2,4 D, 20 g/L sucrose, 0.69 g/L L-proline, 0.91 g/L L-asparagine monohydrate, 1.6 g/L MgCl.sub.2 hexahydrate, 0.1 g/L casein hydrolysate, 0.5 g/L MES, 0.1 g/L myo-inositol, and 16.9 mg/L silver nitrate, pH 5.8 solidified with 2 g/L Gelgro) and calli were inoculated with Agrobacterium CRN347 medium suspensions at 23.degree. C. for 3 days without adding additional media.

[0209] The embryos were then transferred to CRN220 selection medium (4.4 g/L MS salts, 1.3 mg/L nicotinic acid, 0.25 mg/L pyridoxine HCl, 0.25 mg/L thiamine HCl, 0.25 mg/L calcium pantothenate, 30 g/L sucrose, 12 mM proline, 0.05 g/L casamino acids, 500 mg/L carbenicillin, 200 mg/L paromomycin, 2.2 mg/L picloram, 0.5 mg/L 2,4 D and 3.4 mg/L silver nitrate, pH 5.6 solidified with 7 g/L Phytagar), or calli are transferred to CRN344 selection medium (3.98 g/L Chu N6 salts, 1.0 mg/L thiamine HCl, 0.5 mg/L nicotinic acid; 1.0 mg/L 2,4 D, 20 g/L sucrose, 0.69 g/L L-proline, 0.91 g/L L-asparagine monohydrate, 1.6 g/L MgCl.sub.2 hexahydrate, 0.1 g/L casein hydrolysate, 0.5 g/L MES, 0.1 g/L myo-inositol, 500 mg/L carbenicillin, 200 mg/L paromomycin and 16.9 mg/L silver nitrate, pH 5.8 solidified with 6 g/L Phytagar). After 2-3 weeks at 27.degree. C. in the dark, surviving tissues were transferred to the same selection medium and cultured for up to an additional 2 weeks or transferred to regeneration medium as described below.

[0210] Plant regeneration is achieved by transferring the putative transgenic callus from CRN220 to CRN232 medium (CRN220 medium lacking picloram, 2,4-D, and silver nitrate, and containing 3.52 mg/L benzylaminopurine (BAP) and 250 mg/L carbenicillin) or from CRN344 medium to 217A medium (211RTTV lacking silver nitrate, 2,4 D, and paromomycin and containing 3.52 mg/L BAP and 250 mg/L carbenicillin) and incubating for 5-7 days at 27.degree. C. Tissue is then transferred from CRN232 medium to CRN264 medium (4.4 g/L MS salts, 1.3 g/L nicotinic acid, 0.25 mg/L pyridoxine HCl, 0.25 mg/L thiamine HCl, 0.25 mg/L calcium pantothenate, 10 g/L glucose, 20 g/L maltose, 1 mM L-asparagine, 0.1 g/L myo-inositol, 250 mg/L carbenicillin and 100 mg/L paromomycin, pH 5.8 solidified with 6 g/L Phytagar) or from 217A medium to CRN346 medium (4.4 g/L MS salts, MS vitamins, 60 g/L sucrose, 0.05 g/L myo-inositol, 250 mg/L carbenicillin, 75 mg/L paromomycin, pH 5.8 solidified with 6 g/L KOH) in Phytatrays, and incubated in the light at 28.degree. C. until shoots with well-developed roots were produced (typically 2-3 weeks). These developing plantlets were then transferred to soil, hardened off in a growth chamber at 27.degree. C., 80% humidity, and low light intensity for approximately 1 week, and then transferred to a greenhouse and the R0 plants were grown under standard greenhouse conditions. The R0 plants were reciprocally crossed and both immature/developing kernels and mature kernels were collected from each of the resulting plants for subsequent analyses. The results of the analyses are described below in Example 6.

[0211] These developing plantlets were then transferred to soil, hardened off in a growth chamber at 27.degree. C., 80% humidity, and low light intensity for approximately 1 week, and then transferred to a greenhouse. The R0 plants were then grown under standard greenhouse conditions. Fertile R0 plants were crossed to a non-transgenic recurrent inbred, with the R0 plant serving as either the female or male (or occasionally both) in the cross. Both developing and mature F1 kernels were collected and analyzed, from each of the resulting ears as described in Example 4. The results of the analyses are reported below in Example 5.

EXAMPLE 4

[0212] This example provides the analytical procedures to determine oil, protein, and starch levels in kernels from transgenic plants containing the HOI001 GBSS gene or the LH59 GBSS gene.

[0213] Oil Content Analysis: Oil levels (on a mass basis and as a percent of tissue weight) of first generation single corn kernels and dissected germ and endosperm are determined by low-resolution .sup.1H nuclear magnetic resonance (NMR) (Tiwari et al., JAOCS, 51:104-109 (1974); or Rubel, JAOCS, 71:1057-1062 (1994)), whereby NMR relaxation times of single kernel samples are measured, and oil levels are calculated based on regression analysis using a standard curve generated from analysis of corn kernels with varying oil levels as determined gravimetrically following accelerated solvent extraction.

[0214] To compare oil analyses of transgenic and non-transgenic kernels, the presence or absence of the transgene is determined by detection (or lack thereof) of a transgene-specific 517 bp PCR product, using a sequence within the Hsp70 intron as a 5' primer, and a sequence within the HOI001 GBSS gene as a 3' primer: TABLE-US-00003 5' primer (primer number 19056): [SEQ ID NO: 16] 5'-ATCTTGCTCGATGCCTTCTC-3', 3' primer (primer number 18986): [SEQ ID NO: 17] 5'-GCCTTCGCTTGTCGTGGGT-3'.

[0215] Oil levels in advanced generation seed are determined by NIT spectroscopy, whereby NIT spectra of pooled seed samples harvested from individual plants are measured, and oil levels are calculated based on regression analysis using a standard curve generated from analysis of corn kernels with varying oil levels, as determined gravimetrically following accelerated solvent extraction or elemental (% N) analysis, respectively.

[0216] One-way analysis of variance and the Student's T-test are performed to identify significant differences in oil (% kernel weight) between seed from marker positive and marker negative plants.

[0217] Alternatively, oil levels of pooled kernels from single ears are determined by low-resolution .sup.1H nuclear magnetic resonance (NMR) (Tiwari et al., JAOCS, 51:104-109 (1974); or Rubel, JAOCS, 71:1057-1062 (1994)), whereby NNM relaxation times of pools of kernels are measured, and oil levels are calculated based on regression analysis using a standard curve generated from analysis of corn kernels with varying oil levels as determined gravimetrically following accelerated solvent extraction.

[0218] Protein Analyses: For kernel protein analysis, small bulk samples consisting of 50-100 kernels for each treatment are measured using near infrared reflectance spectroscopy (InfraTec model 1221, Teccator, Hogannas Sweden). This procedure is based upon the observation that a linear relation exists between the absorption of near infrared radiation and the quantity of chemical constituents comprised in a typical grain sample. Prior to analyzing unknown samples, spectral data is collected with calibration samples that are subsequently analyzed using a nitrogen combustion analysis technique (Murray, I., and P. C. Williams, 1987, Chemical Principles of Near-infrared Technology, In Near-Infrared Technology in the Agricultural and Food Industries, P. Williams and K. Norris eds.). A multivariate model is developed using the spectral data from the spectrometer and the primary data. In the present case a PLS-1 (Partial Least Squares Regression Type I) multivariate model is constructed using 152 calibration samples. Each unknown sample is scanned on the spectrometer at least 5 times and its protein content predicted with each scan. Each time the sample is scanned it is added back to the sample cuvette to minimize multiplicative scattering effects, which are not correlated to chemical property of interest. The predicted starch is averaged for the multiple scans and then reported for each sample.

[0219] Starch analyses: For kernel starch analysis, small bulk samples consisting of 50-100 kernels for each treatment are measured using near infrared reflectance spectroscopy (InfraTec model 1221, Teccator, Hogannas Sweden). This procedure is based upon the observation that a linear relation exists between the absorption of near infrared radiation and the quantity of chemical constituents comprised in a typical grain sample. Prior to analyzing unknown samples, spectral data is collected with calibration samples that are subsequently analyzed using standard wet chemistry analytical techniques (Murray, I., and P. C. Williams, 1987, Chemical Principles of Near-infrared Technology, In Near-Infrared Technology in the Agricultural and Food Industries, P. Williams and K. Norris eds.). A multivariate model is developed using the spectral data from the spectrometer and the primary data. Each unknown sample is scanned on the spectrometer at least 5 times and its starch content predicted with each scan. Each time the sample is scanned it is added back to the sample cuvette to minimize multiplicative scattering effects, which are not correlated to chemical property of interest. The predicted starch is averaged for the multiple scans and then reported for each sample.

EXAMPLE 5

[0220] This example describes the analysis of kernels from plants transformed with the HOI001 GBSS and the GBSS from LH59.

[0221] Kernels from a total of 54 transgenic events expressing the HOI001 GBSS transgenic allele were analyzed using the procedures set forth in Example 4. Table 1 shows whole kernel oil levels of transgenic (positive) and nontransgenic (negative) F1 kernels from ears of 20 transgenic events analyzed by the single kernel NMR procedure described in Example 4. Only results from events with a statistically significant increase in oil (p<0.05) are shown.

[0222] The results demonstrate that transgenic kernels from ears of 20 of the 54 events had statistically significant increases in whole kernel oil content (% dry weight) relative to nontransgenic kernels on the same ear. No events had a statistically significant decrease in oil. TABLE-US-00004 TABLE 1 Positive Negative Pedigree n Mean n Mean Delta Prob > F ZM_S67336/LH172 12 4.22 12 3.19 1.03 0.0000 ZM_S66829/LH172 8 4.33 16 3.44 0.89 0.0013 ZM_S67359/LH172 12 4.36 12 3.52 0.84 0.0003 ZM_S71593/LH172 14 3.09 10 2.40 0.69 0.0258 ZM_S67345/LH172 8 3.31 15 2.67 0.64 0.0199 ZM_S67335/LH172 9 3.85 15 3.25 0.61 0.0000 ZM_S71577/LH172 4 3.50 20 2.92 0.59 0.0298 ZM_S66804/LH172 10 3.50 14 2.94 0.56 0.0017 ZM_S67348/LH172 6 3.76 16 3.20 0.56 0.0000 ZM_S67351/LH172 11 3.73 13 3.19 0.54 0.0173 ZM_S69437/LH172 9 3.70 15 3.16 0.54 0.0002 ZM_S67331/LH172 13 3.70 11 3.17 0.53 0.0026 ZM_S67330/LH172 12 3.90 12 3.43 0.47 0.0071 LH172/ZM_S71581 17 3.47 7 3.07 0.40 0.0004 ZM_S66805/LH172 11 3.76 13 3.37 0.39 0.0151 LH172/ZM_S69443 11 3.23 13 2.86 0.37 0.0243 LH172/ZM_S67360 11 3.34 13 2.98 0.36 0.0080 LH172/ZM_S67338 17 3.01 7 2.68 0.34 0.0287 ZM_S71569/LH172 11 2.99 12 2.74 0.25 0.0354 LH172/ZM_S66817 14 3.05 9 2.83 0.22 0.0391

[0223] Transgenic kernels from R0 plants pollinated by non-transgenic inbred pollen (for example, pedigree ZM_S67336/LH172, positive) had both a higher frequency of a significant oil increase (15/29 plants analyzed) and a higher average significant oil increase (0.61%) relative to kernels from non-transgenic inbred plants pollinated by transgenic pollen (for example, pedigree LH172/ZM_S66817, negative) from an R0 male parent (5/37 plants analyzed, 0.34% significant oil increase). These results suggest that the greater transgene dosage found in the endosperm of kernels from the R0 plants, due to maternal inheritance effects, may result in a greater increase in oil.

[0224] Similarly, kernels from a total of 15 transgenic events containing the LH59 GBSS transgenic allele were analyzed. None of the kernels from ears of any of the events had statistically significant increases in whole kernel oil content (% dry weight) relative to nontransgenic kernels on the same ear, indicating that the increase in oil was unique to the HOI001 GBSS allele.

EXAMPLE 6

[0225] This example describes the increase in oil levels obtained in transgenic F2 kernels from field-grown plants.

[0226] To ascertain the impact of the HOI001 GBSS gene on kernel oil levels of field-grown plants, 24-48 segregating F1 seed from each of 40 events were planted in a field nursery. Developing plants were screened for the presence of the transgenic cassette by a non-lethal kanamycin resistance assay, whereby an antibiotic solution (0.1% (w/v) kanamycin and 0.1% (w/v) paromomycin) is applied to the leaf surface and scored for the presence (nontransgenic) or absence (transgenic) of necrotic lesions 1 week after antibiotic application. Kernels were isolated from the ears of both transgenic plants and non-transgenic plants, and then were assayed for kernel oil, protein, and starch by Near-Infrared Transmittance Spectroscopy.

[0227] Table 2 shows the mean whole kernel oil levels and the increase in whole kernel oil levels (Delta) in ears from plants containing (positive) and lacking (negative) the transgenic cassette containing the selectable marker and the HOI001 GBSS gene. Oil levels were determined by the NIT procedure described in Example 4, and only events with a statistically significant increase in oil (p<0.05) are shown. TABLE-US-00005 TABLE 2 Positive Negative Event n Mean n Mean Delta Prob > F ZM_S67359 8 4.76 7 3.83 0.93 <.0001 ZM_S71546 5 5.42 3 4.50 0.92 0.0012 ZM_S67354 2 4.85 13 4.02 0.83 <.0001 ZM_S66817 3 4.37 1 3.70 0.67 0.0099 ZM_S71577 3 4.60 8 3.95 0.65 <.0001 ZM_S67343 5 4.42 4 3.78 0.65 0.0142 ZM_S71555 5 5.10 3 4.47 0.63 0.0343 ZM_S71551 9 4.69 6 4.07 0.62 0.041 ZM_S69437 3 4.57 8 3.95 0.62 0.0002 ZM_S66804 7 5.04 7 4.44 0.60 0.0016 ZM_S67338 7 4.30 6 3.73 0.57 0.0068 ZM_S71573 3 4.87 3 4.40 0.47 0.0405 ZM_S67331 4 4.35 12 3.92 0.43 0.0025 ZM_S71594 2 4.35 8 3.95 0.40 0.0037 ZM_S67340 12 4.04 11 3.72 0.32 0.0115 ZM_S66800 1 4.30 8 3.95 0.30 0.0398

[0228] The results show whole kernel oil level was increased in transgenic ears relative to nontransgenic ears (p<0.05) in 16 out of 36 events analyzed.

[0229] Table 3 shows the mean kernel starch levels (%) and the change in kernel starch levels in ears from plants containing (positive) and lacking (negative) the transgenic cassette containing the selectable marker and the HOI001 GBSS gene. Only events with a statistically significant increase in oil (p<0.05) are shown.

[0230] Table 4 shows mean kernel protein levels (%) and the change in kernel protein levels in ears from plants containing (positive) and lacking (negative) the transgenic cassette containing the selectable marker and the HOI001 GBSS gene. Only events with a statistically significant increase in oil (p<0.05) are shown.

[0231] Based on NIT analysis, starch levels in events with increases in oil were lowered slightly (Table 3), and protein levels were mostly unchanged (Table 4). TABLE-US-00006 TABLE 3 Positive Negative Event n Mean n Mean Delta Prob > F ZM_S67359 8 69.15 7 70.94 -1.79 0.0004 ZM_S71546 5 70.10 3 71.33 -1.23 0.0451 ZM_S67354 2 69.50 13 71.12 -1.62 0.0024 ZM_S66817 3 69.73 1 70.00 -0.27 0.5286 ZM_S71577 3 71.47 8 71.40 0.07 0.8702 ZM_S67343 5 69.92 4 71.20 -1.28 0.0187 ZM_S71555 5 70.30 3 71.23 -0.93 0.118 ZM_S71551 9 70.49 6 71.23 -0.74 0.117 ZM_S69437 3 69.77 8 71.40 -1.63 0.0018 ZM_S66804 7 71.19 7 72.27 -1.09 0.0073 ZM_S67338 7 69.81 6 71.25 -1.44 0.0009 ZM_S71573 3 69.77 3 70.67 -0.90 0.1352 ZM_S67331 4 70.10 12 71.23 -1.13 0.0026 ZM_S71594 2 70.75 8 71.40 -0.65 0.1906 ZM_S67340 12 70.82 11 71.35 -0.53 0.0257 ZM_S66800 1 70.50 8 71.40 -0.90 0.16

[0232] TABLE-US-00007 TABLE 4 Positive Negative Event n Mean n Mean Delta Prob > F ZM_S67359 8 11.84 7 11.29 0.55 0.2942 ZM_S71546 5 12.86 3 12.30 0.56 0.1775 ZM_S67354 2 12.55 13 12.07 0.48 0.2885 ZM_S66817 3 12.70 1 14.40 -1.70 0.1336 ZM_S71577 3 9.50 8 12.03 -2.53 0.0005 ZM_S67343 5 11.46 4 11.40 0.06 0.929 ZM_S71555 5 12.58 3 12.43 0.15 0.6995 ZM_S71551 9 12.23 6 11.97 0.27 0.4938 ZM_S69437 3 11.80 8 12.03 -0.23 0.7384 ZM_S66804 7 12.14 7 11.07 1.07 0.0304 ZM_S67338 7 12.26 6 11.87 0.39 0.452 ZM_S71573 3 11.37 3 11.30 0.07 0.8416 ZM_S67331 4 12.18 12 12.23 -0.05 0.9133 ZM_S71594 2 11.80 8 12.03 -0.23 0.6682 ZM_S67340 12 11.44 11 11.28 0.16 0.578 ZM_S66800 1 11.40 8 12.03 -0.63 0.412

EXAMPLE 7

[0233] This example describes the increase in oil levels obtained in transgenic F2 hybrid kernels from field-grown plants.

[0234] To ascertain the impact of the HOI001 GBSS gene on kernel oil levels of hybrid field-grown plants, 24-48 segregating F1 seed from each of 14 events, having sufficient seed, were planted in a field nursery. Developing plants were screened for the presence of the transgenic cassette by the non-lethal Kanamycin resistance assay, described above in Example 6. Pollen from transgenic plants was used to pollinate the stiff-stalk inbred LH244. The segregating F1 transgenic seed generated was then planted and the resultant plants were screened for the presence of the transgene by the non-lethal Kanamycin resistance assay. F2 hybrid kernels were isolated from ears from transgenic plants and non-transgenic plants, and assayed for kernel oil by Nuclear Magnetic Resonance Spectroscopy, as described in Example 4.

[0235] Table 5 shows mean whole kernel oil levels and the increase (Delta) in whole kernel oil levels in ears from hybrid plants containing (positive) and lacking (negative) the transgenic cassette containing the selectable marker and the HOI001 GBSS gene. Oil levels were determined by the bulk set NMR procedure described in Example 4, and only events with a statistically significant increase in oil (p<0.05) are shown. The data indicate that whole kernel oil level was increased in transgenic ears relative to nontransgenic ears (p<0.05) in 9 out of 14 events analyzed. TABLE-US-00008 TABLE 5 Positive Negative Event n Mean n Mean Delta Prob > F ZM_S67354 6 4.43 1 3.00 1.43 0.0431 ZM_S67346 9 3.90 8 3.13 0.78 0.0003 ZM_S71546 9 4.24 5 3.58 0.66 0.0016 ZM_S71556 10 4.12 4 3.48 0.65 0.0044 ZM_S71577 8 3.94 6 3.30 0.64 0.0047 ZM_S71594 10 3.91 7 3.30 0.61 0.0041 ZM_S71573 10 4.02 6 3.42 0.60 0.0001 ZM_S67343 10 3.72 4 3.15 0.57 0.0009 ZM_S67331 8 3.80 3 3.40 0.40 0.0281

EXAMPLE 8

[0236] This example describes the elevation of GBSS activity in corn endosperm tissue expressing the HOI001 GBSS gene.

[0237] Developing ears from F1 plants screened for the presence of the transgenic cassette by the non-lethal Kanamycin resistance assay were harvested and immediately frozen at 24 days post pollination. Segregating F2 kernels were removed from the ear, then dissected into germ and endosperm fractions. Individual dissected kernels were identified as transgenic or nontransgenic by screening for the ability to PCR-amplify a portion of the transgenic cassette from genomic DNA isolated from individual germs using transgene-specific primers, as described in Example 4. For each of six events, approximately 10 endosperms from the corresponding transgenic and nontransgenic kernels were pooled separately.

[0238] Each endosperm pool was ground to a fine powder with a mortar and pestle under liquid nitrogen, and starch granules were isolated in triplicate according to the procedure of Shure et al., Cell, 35(1):225-233 (1983). Granule-bound starch synthase activity was assayed on the isolated granules using the method of Vos-Scheperkeuter et al., Plant Physiol., 82:411-416(1986).

[0239] Table 6 shows the granule-bound starch synthase activity (pmol/min/mg starch) in developing F2 endosperm containing or lacking the HOI001 GBSS transgenic cassette. Values shown are means and standard errors of triplicate assays. The data indicates that starch granules from transgenic kernels generally had elevated GBSS activity, indicating that the effect of the HOI001 allele on oil levels is not a function of reducing overall GBSS activity, but functions by the addition of an activity uniquely encoded by the HOI001 GBSS gene. TABLE-US-00009 TABLE 6 Trans- Non- genic transgenic Event Mean SE mean SE p > F S67338 585 41 455 15 0.0392 S67359 583 20 521 15 0.0665 S71546 635 16 540 23 0.0281 S66804 587 50 454 20 0.0702 S71551 588 9 574 11 0.3712 S71555 486 24 464 12 0.4641

EXAMPLE 9

[0240] This example describes the isolation and sequencing of the coding region of the GBSS cDNA from corn line HOI001.

[0241] mRNA was extracted from developing corn endosperm tissue from HOI001, 22 days after pollination, using a procedure adapted from Opsahl-Ferstad et al., Plant J., 12(10):235-246 (1997). Briefly, developing endosperm from 3 separate kernels was pooled, frozen in liquid nitrogen, and then pulverized with a mortar and pestle. Approximately 50 mg frozen powdered endosperm was extracted with 0.5 mL buffer (0.5 M LiCl, 10 mM EDTA, 5 mM dithiothreitol, 100 mM Tris-HCl, pH 8.0, 1% (w/v) SDS). This aqueous extract was then extracted with phenol:chloroform:isoamyl alcohol (25:24:21), and the organic fraction was discarded. Nucleic acids were precipitated from the aqueous fraction by addition of an equal volume of isopropyl alcohol followed by centrifugation. The resulting supernatant was discarded. The pellet containing the mRNA was washed twice with 70% ethanol, dried, and then resuspended in 50 .mu.L water containing 0.1% (v/v) diethylpyrocarbonate.

[0242] First-strand cDNA was synthesized from the isolated mRNA using the Clontech SMART cDNA synthesis system (BD Biosciences). This first-strand cDNA was used as template to amplify HOI001 GBSS cDNA sequences using primers containing an EcoRV restriction site followed by 18 bp of the predicted translational start site (5' primer) and a Sse83871 restriction site followed by 17 bp of the predicted 3' end up to the translation stop site (3' primer): TABLE-US-00010 5' Primer (primer number 20095): [SEQ ID NO: 9] 5'-GGATATCACCATGGCGGCTCTGGCCACG-3', 3' Primer (primer number 20092): [SEQ ID NO: 10] 5'-GTCCTGCAGGCTACACATACTTGTCCA-3'.

[0243] The resulting 1.8 kB amplification products from independent amplification reactions were isolated by agarose gel electrophoresis, cloned independently into the PCR 2.1 cloning vector using the TOPO TA cloning kit (Invitrogen), and then transformed into an E. coli host. Multiple colonies were isolated from each transformation, plasmid DNA was prepared from cultures grown from each colony, and then the insert in each plasmid preparation was sequenced. Alignment of these sequences generated a consensus sequence containing an open reading frame equivalent to that predicted to be encoded by the HOI001 GBSS gene, although no specific insert sequence was equivalent to the consensus. One clone (designated 7345705-10) had an insert sequence with single base pair deletion relative to the consensus. This clone was used to generate a plasmid (designated pMON81463) containing the consensus sequence by inserting the additional nucleotide using the Quick-Change mutagenesis kit (Stratagene). This sequence, representing the coding region of the HOI001 GBSS cDNA, is listed in SEQ ID NO: 11.

EXAMPLE 10

[0244] This example sets forth the construction of plant transformation vectors containing the HOI001 GBSS cDNA coding region [SEQ ID NO: 11], designed to obtain different levels, timing and spatial patterns of expression, and the subsequent transformation of corn.

[0245] A plant transformation vector containing the HOI001 GBSS coding region driven by a Z27 promoter was constructed. The HOI001 GBSS coding region was isolated from pMON81463 by restriction digest with EcoRV and Sse83871, and cloned into the binary vector pMON71274. This binary vector contains left and right borders for T DNA transfer; a rice Actin promoter::rice Actin intron::CP4::nos 3' UTR, plant selectable marker element; and plant expression cassette sequences which include a 1.1 kb Z27 promoter (bp 19-1117 of Genbank Accession #S78780, Lopes et al., Mol. Gen. Genet., 247(5):603 613 (1995)) for endosperm expression; a corn hsp70 intron (base pairs 4-153 of the maize gene for heat shock 70 exon 2, Genbank Accession #X03679, Rochester et al., EMBO J., 5:451-458 (1986)), and a globulin 3' UTR. The resulting plasmid was named pMON81464 (FIG. 7).

[0246] A second plant expression binary vector containing the wheat high molecular weight glutenin promoter (bp 2647-3895 of Genbank Accession X12928, version X12928.3, originally described in Anderson et al., Nucleic Acids Res., 17:461-462 (1989)) and the corn hsp70 intron, fused to the GBSS coding region, fused to the wheat HSP17n3' UTR (bp532-741 of Gen Bank Accession X13431, version X13431.1, McElvain and Spiker, Nucleic Acids Res., 17:1764 (1989)), was constructed. A sequence containing the wheat high molecular weight glutenin promoter fused to the corn hsp70 intron was amplified from an intermediary vector using 5' and 3' primers containing AscI and NotI restriction sites, respectively: TABLE-US-00011 5' Primer (primer number 21084): [SEQ ID NO: 12] 5'-GGCGCGCCGTCGACGGTATCGATAAGCTTGC-3', 3' Primer (primer number 21085): [SEQ ID NO: 13] 5'-GCGGCCGCCCGCTTGGTATCTGCATTACAATG-3'.

[0247] The amplification product, containing the promoter and intron fragment with the introduced restriction sites, was purified by agarose gel electrophoresis and cloned into pCR2.1 TOPO (Invitrogen) to generate a plasmid vector for E. coli transformation (pOP28). After transformation into an E. coli vector, plasmid DNA was isolated, digested with AscI and NotI, and the purified fragment was cloned into the binary vector pMON71274 to generate a vector (pOP29) containing a cassette with the wheat high molecular weight glutenin promoter fused to the corn HSP70 intron fused to the globulin 3' UTR. The HOI001 GBSS coding region was isolated by digestion of pMON81464 with NotI/Sse83871 and cloned into pOP29 to generate the binary vector pOP31 containing an expression cassette with the wheat high molecular weight glutenin promoter fused to the corn HSP70 intron fused to the HOI001 GBSS coding region fused to the globulin 3' UTR. The promoter/intron/HOI001 GBSS coding region fragment was then isolated from pOP31 by digestion with AscI/Sse83871 and then cloned into the plant binary vector pMON71290 containing a gene of interest cassette with the TR7 3' UTR to generate pOP35, containing an expression cassette with the wheat high molecular weight glutenin promoter fused to the corn HSP70 intron fused to the HOI001 GBSS coding region fused to the TR7 3' UTR. The promoter/intron/HOI001 GBSS coding region fragment was then isolated from pOP35 by digestion with AscI/Sse83871 and then cloned into the plant binary vector pMON67647, containing a gene of interest cassette with the wheat HSP17 3' UTR. The resulting plasmid contained an expression cassette with the wheat high molecular weight glutenin promoter fused to the corn HSP70 intron fused to the HOI001 GBSS coding region fused to the wheat HSP17 3' UTR. This plasmid, was named pMON68298, is shown in FIG. 8.

[0248] A third plant expression binary vector containing the promoter and 5' UTR of the HOI001 GBSS gene fused to the HOI001 GBSS coding region, fused to the corn globulin 3' UTR, was constructed. The HOI001 GBSS promoter and 5' UTR (which also contained the first predicted intron) was isolated by PCR amplification from pMON72506, using a 5' primer that contains the restriction site for PmeI: TABLE-US-00012 5' Primer (primer number 20362): [SEQ ID NO: 14] 5'-GATCGTTTAAACGTTCGTGTGGCAGATTCATC-3', 3' Primer (primer number 20363): [SEQ ID NO: 15] 5'-GACGTGGCCAGAGCCGCCATGCCGATTAATCCACTGCATAG-3'.

[0249] The amplification product, a fragment containing 1125 bp upstream of the predicted HOI001 GBSS translational start site and 20 bp of the predicted coding sequence from pMON72506 (corresponding to bp 17-1162 of SEQ ID NO: 1), was purified by agarose gel electrophoresis and cloned into pCR2.1 TOPO (Invitrogen) to generate pMON81466.

[0250] The HOI001 GBSS coding region was removed from pMON81463 and cloned into the vector pMON81466 to generate pMON81468, containing the HOI001 GBSS promoter/5' UTR fused to the HOI001 GBSS coding region, with 45 bp extraneous polylinker sequence between the promoter/UTR and coding region elements. This extraneous sequence was then deleted by digestion of pMON81468 with MluI to remove a 780 bp fragment spanning the extraneous sequence, then reannealing with the analogous 735 bp fragment (lacking the extraneous sequence), generating pMON81469. This 735 bp fragment was generated by digestion of pMON72506 with MluI and isolating the resulting fragment. This entire promoter/UTR/coding region sequence was then isolated from pMON81469 by digestion with PmeI and Sse83871, and cloned into the binary vector pMON71274 to generate the binary vector pMON81465. This vector contained an expression cassette with the promoter and 5' UTR of the HOI001 GBSS gene fused to the GBSS coding region fused to the corn globulin 3' UTR (FIG. 9).

[0251] These three plant transformation vectors are transformed into an elite corn inbred (LH244) (Corn States Hybrid Serv., LLC, Des Moines, Iowa). Briefly, ears containing immature embryos are harvested approximately 10 days after pollination and kept refrigerated at 4.degree. C. until use (up to 5 days post-harvest). The preferred embryo size for this method of transformation is .about.1.0-2.0 mm. This size is usually achieved 10 days after pollination inside the greenhouse with the growth conditions of an average temperature of 87.degree. F., day length of 14 hours with supplemental lighting supplied by GE 1000 Watt High Pressure Sodium lamps.

[0252] Immature embryos are isolated from surface sterilized ears and directly dropped into the prepared Agrobacterium cell suspension in a 1.5-mL microcentrifuge tube. The isolation lasts continuously for 15 minutes. The tube is then set aside for 5 minutes, resulting in a total inoculation time for individual embryos from 5 to 20 minutes. After the Agrobacterium cell suspension is removed using a fine tipped sterile transfer pipette, the immature embryos are transferred onto a co-culture medium (Table 7). The embryos are then placed on the medium with the scutellum side facing up. The embryos are cultured in a dark incubator (23.degree. C.) for approximately 24 hours.

[0253] The embryos are then transferred onto a modified MS medium (MSW50, Table 7) supplemented with 0.1 or 0.25 mM glyphosate and 250 mg/L carbenicillin to inhibit Agrobacterium in Petri dishes (100 mm.times.25 mm). The cultures are incubated in a dark culture room at 27.degree. C. for 2-3 weeks. All the callus pieces are then transferred individually onto the first regeneration medium (MS/6BA, Table 7) supplemented with the same levels of glyphosate. The cultures are grown on this medium and in a culture room with 16 hours light/8 hours dark photoperiod and 27.degree. C. for 5-7 days. They are then transferred onto the second 15 regeneration medium (MSOD, Table 7) in Petri dishes (100 mm.times.25 mm) for approximately 2 weeks. All the callus pieces with regenerating shoots and living tissue are transferred onto the same medium contained in phytatrays for shoots to grow further prior to being transferred to soil (approximately 2-4 weeks). The regeneration media (MS6BA and MSOD) are all supplemented with 250 mg/L carbenicillin and 0.1 or 0.25 mM glyphosate.

[0254] These developing plantlets are then transferred to soil, hardened off in a growth chamber at 27.degree. C., 80% humidity, and low light intensity for approximately 1 week, and then transferred to a greenhouse and grown under standard greenhouse conditions. The resulting kernels are collected and analyzed as described in Example 4. The results indicate that the different promoters have different impacts on oil accumulation based upon the strength and timing of the expression of the HOI001 GBSS coding region. TABLE-US-00013 TABLE 7 ,Composition of media used in corn transformation. Co-culture Component Media MSW50 MS/6BA MSOD MS salts 2.2 g/L 4.4 g/L 4.4 g/L 4.4 g/L Sucrose 20 g/L 30 g/L 30 g/L Maltose 20 g/L Glucose 10 g/L 10 g/L 1-Proline 115 mg/L 1.38 g/L 1.36 g/L Casamino 500 mg/L 50 mg/L Acids Glycine 2 mg/L 2 mg/L 1-Asparagine 150 mg/L Myo-inositol 100 mg/L 100 mg/L 100 mg/L Nicotinic acid 0.5 mg/L 0.5 mg/L 1.3 mg/L 1.3 mg/L Pyridoxin HCl 0.5 mg/L 0.5 mg/L 0.25 mg/L 0.25 mg/L Thiamine-HCl 0.5 mg/L 0.6 mg/L 0.25 mg/L 0.25 mg/L Ca 0.25 mg/L 0.25 mg/L Pantothenate 2,4-D 3 mg/L 0.5 mg/L Picloram Silver Nitrate 1.7 mg/L BAP 3.5 mg/L

[0255] Co-culture medium was solidified with 5.5 mg/l low EEO agarose. All other media were solidified with 7 g/l Phytagar for NPTII selection and with 3 g/l phytagel for glyphosate selection.

EXAMPLE 11

[0256] This example sets forth the use of the polymorphisms in the HOI001 GBSS gene as molecular markers to accelerate incorporation of HOI001 GBSS sequence polymorphisms into other corn germplasm with the result of increasing oil in the kernel.

[0257] The present invention provides a corn plant with increased kernel oil selected for by use of marker assisted breeding wherein a population of plants are selected for the presence of a polymorphism sequence unique to the HOI001 GBSS gene (SEQ ID NO: 1). Example 1, above, lists polymorphisms unique to the HOI001 GBSS sequence, that is not found in either the LH59 GBSS sequence or the published sequence (Shure et al., supra).

[0258] The selection of plants having the HOI001 GBSS gene for high oil comprises probing genomic DNA of the resulting plants, through the selection process, for the presence of the molecular marker for the HOI001 GBSS gene. The molecular marker is a DNA molecule representing a unique polymorphism in the HOI001 GBSS gene that functions as a probe or primer to a target HOI001 GBSS in a plant genome. The selected polymorphism may or may not be from a coding region of the gene. The plants containing the HOI001 GBSS gene are continued in the breeding and selection process.

Sequence CWU 1

1

17 1 4470 DNA Zea mays 1 aattcgccct ttcagccgtt cgtgtggcag attcatctgt tgtctcgtct cctgtgcttc 60 ctgggtagct tgtgcagtgg agctgacatg gtctgagcag gcttaaaatt tgctcgtaga 120 cgaggagtac cagcacagca cgttgcggat ttctctgcct gtgaagtgca acgtctagga 180 ttgtcacacg ccttggtcgc gtcgatgcgg tggtgagcag agcagcaaca gctgggcgac 240 ccaaagttgg attccgtgtc ttcgtcgtac gtacgcgcgc gccggggaca cgcagagagc 300 ggagagcgag ccgtgcacgg ggaggtggtg tggaagtgaa gccgcgcgcc cggccgcccg 360 cgcccggtgg gcaacccaaa agtacccacg acaagcgaag gcgccaaagc gatccaagct 420 ccggaacgca tcagccacaa gcagccgaga accgaaccgg tgggcgacgc gtcgtgggac 480 ggacgcgggc gacgcttcca aacggggcca cgtacgccgg cgtgtgcgtg tgtgcgtgca 540 gacgacaagc caaggcgagg cagcccccga tcgggaaaag cgtcaagtag gtgcgccggg 600 ctttggcttt gggcgcgagc gctggcgtgc gggtcagtcg ctggtgcgca gtgccggggc 660 gaacgggtat cgtggggggc gcgggcggag gagagcgtgg cgagggccga gagcagcgcg 720 cggccgggtc acgcaacgcg ccccacgtac agcctccccc tccgcgcgcg ctagaaatac 780 cgaggcctgg accgggggcc ccccggcaca tccatccatc gaccgatcga tcgatcgcca 840 cagccaacat cacccgccga ggcgacgcga cagccgccag gaggaaggaa taaactcact 900 gccagccagt gaagggggag aagtgtactg ctccgtcgac cagtgcgcgc accgcccggc 960 agggctgctc atctcgtcga cgaccaggtt ccgttccgtt ccgatccgat cctgtccttg 1020 agtttcgtcc agatcctggc gtgtatctgc atgcgtgttt gatgatccag gttcatcgaa 1080 tctaaatctg tccgtgcaca tgtcttctct ctctctgtct gctatgcagt ggattaatcg 1140 gcatggcggc tctggccacg tcgcagctcg tcgcaacgcg cgccggcctg ggcgtcccgg 1200 acgcgtccac gttccgccgc ggcgccgcgc agggcctgag gggggcccgg gcgtcggcgg 1260 cggcggacac gctcagcatg cggaccagcg cgcgcgcggc gcccaggctc cagctgcacc 1320 agcagcagca gcaggcgcgc cgcggggcca ggttcccgtc gctcgtcgtg tgcgccagcg 1380 ccggcatgaa cgtcgtcttc gtcggcgccg aggtggcgcc gtggagcaag accggcggcc 1440 tcggcgacgt cctcggcggc ctgccgccgg ccatggccgt aagcgcgcgc accgagacat 1500 gcatccgttg gatcgcgtct tcttcgtgct cttgccgcgt gcatgatgca tgtgtttcct 1560 cctggctcgt gtatgtgact gacgtgtgtg ttcgggcatg caatgcatgc aggcgaatgg 1620 gcaccgtgtc atggtcgtct ctccccgcta cgaccagtac aaggacgcct gggacaccag 1680 cgtcgtgtcc gaagtacggc caccgagatc agattcagat cacacatcac agtcacacac 1740 accgtcatat gaacctttct ctgctctgat gcctgcagat caagatggga gacaggtacg 1800 agacggtcag gttcttccac tgctacaagc gcggagtgga ccgcgtgttc gttgaccacc 1860 cactgttcct ggagagggtg agatgagatc tgatcactcg atacgcaatt accaccccat 1920 tgtaagcagt tacagtgagc cttttttttt gcccccgcct ggtcgctggt ttcaggtttg 1980 gggaaagacc gaggagaaga tctacgggcc tgtcgctgga acggactaca gggacaacca 2040 gctgcggttc agcctgctat gccaggtcag gatggcttgc tactacaact tcagatcatc 2100 tgtatgcagc agtatacacc gatgagaaat gcatgctgtt ctgcaggcag cacttgaagc 2160 tccaaggatc ctgagcctca acaacaaccc atacttctcc ggaccatacg gtaagagttg 2220 tagtcttcgt atatatatct gttgagctcg agaatcttca caggaaacgg cccatcagac 2280 ggactgtctt tttatactga ctactgctgc tgctcttcgt ccatccatcc atacaagggg 2340 aggacgtcgt gttcgtctgc aacgactggc acaccggccc tctctcgtgc tacctcaaga 2400 gcaactacca gtcccacggc atctacaggg acgcaaaggt tgccttctcg gaactgaaca 2460 acgccgtttt cgttctccat gctcgtatat acctcatctg gtggtggtgc ttctctgaaa 2520 ctgaaactga aactgactgc atgtctgtct gaccatcttc acgtactacc taccagaccg 2580 ctttctgcat ccacaacatc tcctaccagg gccggttcgc cttctccgac tacccggagc 2640 tgaacctccc cgagagattc aagtcgtcct tcgatttcat cgacgggtct gttttcctgc 2700 gtgcatgtga acattcatga acggtaaccc acaactgctc gcgtcctgct ggttcattat 2760 ctggccttga ttgcattgta gctacgagaa gcccgtggaa ggccggaaga tcaactggat 2820 gaaggccggg atcctcgagg ccgacagggt cctcaccgtc agcccctact acgccgagga 2880 gctcatctcc ggcatcgcca ggggctgcga gctcgacaac atcatgcgcc tcaccggcat 2940 caccggcatc gtcaacggca tggacgtcag cgagtgggac cccagcaggg acaagtacat 3000 cgccgtgaag tacgacgtgt cgacggtgag ctggctagct agctgattct gctgcctggt 3060 cctcctgctc atgctggttc ggttctgacg cggcaagtgt acgtacgtgc gtgcgacggt 3120 ggtgtggtgt ccggttcagg ccgtggaggc caaggcgctg aacaaggagg cgctgcaggc 3180 ggaggtcggg ctcccggtgg accggaacat cccgctggtg gcgttcatcg gcaggctgga 3240 agagcagaag ggccccgacg tcatggcggc cgccatcccg cagctcatgg agatggtgga 3300 ggacgtgcag atcgttctgc tggtacgtgt gcgccggccg ccacccggct actacatgcg 3360 tgtatcgttc gttctactgg aacatgcgtg tgagcaacgc gatggataat gctgcagggc 3420 acgggcaaga agaagttcga gcgcatgctc atgagcgccg aggagaagtt cccaggcaag 3480 gtgcgcgccg tggtcaagtt caacgcggcg ctggcgcacc acatcatggc cggcgccgac 3540 gtgctcgccg tcaccagccg cttcgagccc tgcggcctca tccagctgca ggggatgcga 3600 tacggaacgg tacgagagaa aaaaaaacat cctgaatcta tcctgacgag agggacagag 3660 acagattgat tatgaatgct tcatcgattt gaattgattg atctatgtct cccgctgcga 3720 ctcttgcagc cctgcgcctg cgcgtccacc ggtggactcg tcgacaccat catcgaaggc 3780 aagaccgggt tccacatggg ccgcctcagc gtcgacgtaa gcctacctct gccatgatct 3840 ttcttccttc tgtatgtatg tatgtatgta tgaatcagca ccgccattct tgtttcgtcg 3900 tcctctcttc ccagtgcaac gtcgtggagc cggcggacgt caagaaggtg gccaccacct 3960 tgcagcgcgc catcaaggtg gtcggcacgc cggtgtacga ggagatggtg aggaactgca 4020 tgatccagga tctctcctgg aaggtacgtt cgcccgcccc gccagagcag agcgccaaga 4080 tcgatcgatc gaccgaccac acgtacgcgc ctcgctcttg tcgctgaccg tggtttaatt 4140 tgcgaaatgc gcagggccct gccaagaact gggagaacgt gctgctcagc ctcggggtcg 4200 ccggcggtgc agggcccctg atctcgcgcg tggtgcaaag atgttgggac atcttcttat 4260 atatgctgtt tcgtttatgt gatatggaca agtatgtgta gatgcttgct tgtgctagtg 4320 taatgtagtg tagtggtggc cagtggcaca acctaataag cgcatgaact aattgcttgc 4380 gtgtgtagtt aagtaccgat cggtaatttt atattgcgag taaataaatg gacctgtagt 4440 ggtggagtaa ataatcccgc tgaaagggcg 4470 2 1818 DNA Zea mays 2 atggcggctc tggccacgtc gcagctcgtc gcaacgcgcg ccggcctggg cgtcccggac 60 gcgtccacgt tccgccgcgg cgccgcgcag ggcctgaggg gggcccgggc gtcggcggcg 120 gcggacacgc tcagcatgcg gaccagcgcg cgcgcggcgc ccaggcacca gcagcaggcg 180 cgccgcgggg gcaggttccc gtcgctcgtc gtgtgcgcca gcgccggcat gaacgtcgtc 240 ttcgtcggcg ccgagatggc gccgtggagc aagaccggcg gcctcggcga cgtcctcggc 300 ggcctgccgc cggccatggc cgcgaacggg caccgtgtca tggtcgtctc tccccgctac 360 gaccagtaca aggacgcctg ggacaccagc gtcgtgtccg agatcaagat gggagacggg 420 tacgagacgg tcaggttctt ccactgctac aagcgcggag tggaccgcgt gttcgttgac 480 cacccactgt tcctggagag ggtttgggga aagaccgagg agaagatcta cgggcctgtc 540 gctggaacgg actacaggga caaccagctg cggttcagcc tgctatgcca ggcagcactt 600 gaagctccaa ggatcctgag cctcaacaac aacccatact tctccggacc atacggggag 660 gacgtcgtgt tcgtctgcaa cgactggcac accggccctc tctcgtgcta cctcaagagc 720 aactaccagt cccacggcat ctacagggac gcaaagaccg ctttctgcat ccacaacatc 780 tcctaccagg gccggttcgc cttctccgac tacccggagc tgaacctccc ggagagattc 840 aagtcgtcct tcgatttcat cgacggctac gagaagcccg tggaaggccg gaagatcaac 900 tggatgaagg ccgggatcct cgaggccgac agggtcctca ccgtcagccc ctactacgcc 960 gaggagctca tctccggcat cgccaggggc tgcgagctcg acaacatcat gcgcctcacc 1020 ggcatcaccg gcatcgtcaa cggcatggac gtcagcgagt gggaccccag cagggacaag 1080 tacatcgccg tgaagtacga cgtgtcgacg gccgtggagg ccaaggcgct gaacaaggag 1140 gcgctgcagg cggaggtcgg gctcccggtg gaccggaaca tcccgctggt ggcgttcatc 1200 ggcaggctgg aagagcagaa gggccccgac gtcatggcgg ccgccatccc gcagctcatg 1260 gagatggtgg aggacgtgca gatcgttctg ctgggcacgg gcaagaagaa gttcgagcgc 1320 atgctcatga gcgccgagga gaagttccca ggcaaggtgc gcgccgtggt caagttcaac 1380 gcggcgctgg cgcaccacat catggccggc gccgacgtgc tcgccgtcac cagccgcttc 1440 gagccctgcg gcctcatcca gctgcagggg atgcgatacg gaacgccctg cgcctgcgcg 1500 tccaccggtg gactcgtcga caccatcatc gaaggcaaga ccgggttcca catgggccgc 1560 ctcagcgtcg actgtaacgt cgtggagccg gcggacgtca agaaggtggc caccacattg 1620 cagcgcgcca tcaaggtggt cggcacgccg gcgtacgagg agatggtgag gaactgcatg 1680 atccaggatc tctcctggaa gggccctgcc aagaactggg agaacgtgct gctcagcctc 1740 ggggtcgccg gcggcgagcc aggggtcgaa ggcgaggaga tcgcgccgct cgccaaggag 1800 aacgtggccg cgccctga 1818 3 620 PRT Zea mays 3 Met Ala Ala Leu Ala Thr Ser Gln Leu Val Ala Thr Arg Ala Gly Leu 1 5 10 15 Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gln Gly Leu 20 25 30 Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg Thr 35 40 45 Ser Ala Arg Ala Ala Pro Arg Leu Gln Leu His Gln Gln Gln Gln Gln 50 55 60 Ala Arg Arg Gly Ala Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala 65 70 75 80 Gly Met Asn Val Val Phe Val Gly Ala Glu Met Ala Pro Trp Ser Lys 85 90 95 Thr Gly Gly Leu Gly Asp Val Leu Gly Gly Leu Pro Pro Ala Met Ala 100 105 110 Ala Asn Gly His Arg Val Met Val Val Ser Pro Arg Tyr Asp Gln Tyr 115 120 125 Lys Asp Ala Trp Asp Thr Ser Val Val Ser Glu Ile Lys Met Gly Asp 130 135 140 Arg Tyr Glu Thr Val Arg Phe Phe His Cys Tyr Lys Arg Gly Val Asp 145 150 155 160 Arg Val Phe Val Asp His Pro Leu Phe Leu Glu Arg Val Trp Gly Lys 165 170 175 Thr Glu Glu Lys Ile Tyr Gly Pro Val Ala Gly Thr Asp Tyr Arg Asp 180 185 190 Asn Gln Leu Arg Phe Ser Leu Leu Cys Gln Ala Ala Leu Glu Ala Pro 195 200 205 Arg Ile Leu Ser Leu Asn Asn Asn Pro Tyr Phe Ser Gly Pro Tyr Gly 210 215 220 Glu Asp Val Val Phe Val Cys Asn Asp Trp His Thr Gly Pro Leu Ser 225 230 235 240 Cys Tyr Leu Lys Ser Asn Tyr Gln Ser His Gly Ile Tyr Arg Asp Ala 245 250 255 Lys Thr Ala Phe Cys Ile His Asn Ile Ser Tyr Gln Gly Arg Phe Ala 260 265 270 Phe Ser Asp Tyr Pro Glu Leu Asn Leu Pro Glu Arg Phe Lys Ser Ser 275 280 285 Phe Asp Phe Ile Asp Gly Tyr Glu Lys Pro Val Glu Gly Arg Lys Ile 290 295 300 Asn Trp Met Lys Ala Gly Ile Leu Glu Ala Asp Arg Val Leu Thr Val 305 310 315 320 Ser Pro Tyr Tyr Ala Glu Glu Leu Ile Ser Gly Ile Ala Arg Gly Cys 325 330 335 Glu Leu Asp Asn Ile Met Arg Leu Thr Gly Ile Thr Gly Ile Val Asn 340 345 350 Gly Met Asp Val Ser Glu Trp Asp Pro Ser Arg Asp Lys Tyr Ile Ala 355 360 365 Val Lys Tyr Asp Val Ser Thr Ala Val Glu Ala Lys Ala Leu Asn Lys 370 375 380 Glu Ala Leu Gln Ala Glu Val Gly Leu Pro Val Asp Arg Asn Ile Pro 385 390 395 400 Leu Val Ala Phe Ile Gly Arg Leu Glu Glu Gln Lys Gly Pro Asp Val 405 410 415 Met Ala Ala Ala Ile Pro Gln Leu Met Glu Met Val Glu Asp Val Gln 420 425 430 Ile Val Leu Leu Gly Thr Gly Lys Lys Lys Phe Glu Arg Met Leu Met 435 440 445 Ser Ala Glu Glu Lys Phe Pro Gly Lys Val Arg Ala Val Val Lys Phe 450 455 460 Asn Ala Ala Leu Ala His His Ile Met Ala Gly Ala Asp Val Leu Ala 465 470 475 480 Val Thr Ser Arg Phe Glu Pro Cys Gly Leu Ile Gln Leu Gln Gly Met 485 490 495 Arg Tyr Gly Thr Pro Cys Ala Cys Ala Ser Thr Gly Gly Leu Val Asp 500 505 510 Thr Ile Ile Glu Gly Lys Thr Gly Phe His Met Gly Arg Leu Ser Val 515 520 525 Asp Cys Asn Val Val Glu Pro Ala Asp Val Lys Lys Val Ala Thr Thr 530 535 540 Leu Gln Arg Ala Ile Lys Val Val Gly Thr Pro Val Tyr Glu Glu Met 545 550 555 560 Val Arg Asn Cys Met Ile Gln Asp Leu Ser Trp Lys Gly Pro Ala Lys 565 570 575 Asn Trp Glu Asn Val Leu Leu Ser Leu Gly Val Ala Gly Gly Ala Gly 580 585 590 Pro Leu Ile Ser Arg Val Val Gln Arg Cys Trp Asp Ile Phe Leu Tyr 595 600 605 Met Leu Phe Arg Leu Cys Asp Met Asp Lys Tyr Val 610 615 620 4 605 PRT Zea mays 4 Met Ala Ala Leu Ala Thr Ser Gln Leu Val Ala Thr Arg Ala Gly Leu 1 5 10 15 Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gln Gly Leu 20 25 30 Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg Thr 35 40 45 Ser Ala Arg Ala Ala Pro Arg His Gln Gln Gln Ala Arg Arg Gly Gly 50 55 60 Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Met Asn Val Val 65 70 75 80 Phe Val Gly Ala Glu Met Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly 85 90 95 Asp Val Leu Gly Gly Leu Pro Pro Ala Met Ala Ala Asn Gly His Arg 100 105 110 Val Met Val Val Ser Pro Arg Tyr Asp Gln Tyr Lys Asp Ala Trp Asp 115 120 125 Thr Ser Val Val Ser Glu Ile Lys Met Gly Asp Gly Tyr Glu Thr Val 130 135 140 Arg Phe Phe His Cys Tyr Lys Arg Gly Val Asp Arg Val Phe Val Asp 145 150 155 160 His Pro Leu Phe Leu Glu Arg Val Trp Gly Lys Thr Glu Glu Lys Ile 165 170 175 Tyr Gly Pro Val Ala Gly Thr Asp Tyr Arg Asp Asn Gln Leu Arg Phe 180 185 190 Ser Leu Leu Cys Gln Ala Ala Leu Glu Ala Pro Arg Ile Leu Ser Leu 195 200 205 Asn Asn Asn Pro Tyr Phe Ser Gly Pro Tyr Gly Glu Asp Val Val Phe 210 215 220 Val Cys Asn Asp Trp His Thr Gly Pro Leu Ser Cys Tyr Leu Lys Ser 225 230 235 240 Asn Tyr Gln Ser His Gly Ile Tyr Arg Asp Ala Lys Thr Ala Phe Cys 245 250 255 Ile His Asn Ile Ser Tyr Gln Gly Arg Phe Ala Phe Ser Asp Tyr Pro 260 265 270 Glu Leu Asn Leu Pro Glu Arg Phe Lys Ser Ser Phe Asp Phe Ile Asp 275 280 285 Gly Tyr Glu Lys Pro Val Glu Gly Arg Lys Ile Asn Trp Met Lys Ala 290 295 300 Gly Ile Leu Glu Ala Asp Arg Val Leu Thr Val Ser Pro Tyr Tyr Ala 305 310 315 320 Glu Glu Leu Ile Ser Gly Ile Ala Arg Gly Cys Glu Leu Asp Asn Ile 325 330 335 Met Arg Leu Thr Gly Ile Thr Gly Ile Val Asn Gly Met Asp Val Ser 340 345 350 Glu Trp Asp Pro Ser Arg Asp Lys Tyr Ile Ala Val Lys Tyr Asp Val 355 360 365 Ser Thr Ala Val Glu Ala Lys Ala Leu Asn Lys Glu Ala Leu Gln Ala 370 375 380 Glu Val Gly Leu Pro Val Asp Arg Asn Ile Pro Leu Val Ala Phe Ile 385 390 395 400 Gly Arg Leu Glu Glu Gln Lys Gly Pro Asp Val Met Ala Ala Ala Ile 405 410 415 Pro Gln Leu Met Glu Met Val Glu Asp Val Gln Ile Val Leu Leu Gly 420 425 430 Thr Gly Lys Lys Lys Phe Glu Arg Met Leu Met Ser Ala Glu Glu Lys 435 440 445 Phe Pro Gly Lys Val Arg Ala Val Val Lys Phe Asn Ala Ala Leu Ala 450 455 460 His His Ile Met Ala Gly Ala Asp Val Leu Ala Val Thr Ser Arg Phe 465 470 475 480 Glu Pro Cys Gly Leu Ile Gln Leu Gln Gly Met Arg Tyr Gly Thr Pro 485 490 495 Cys Ala Cys Ala Ser Thr Gly Gly Leu Val Asp Thr Ile Ile Glu Gly 500 505 510 Lys Thr Gly Phe His Met Gly Arg Leu Ser Val Asp Cys Asn Val Val 515 520 525 Glu Pro Ala Asp Val Lys Lys Val Ala Thr Thr Leu Gln Arg Ala Ile 530 535 540 Lys Val Val Gly Thr Pro Ala Tyr Glu Glu Met Val Arg Asn Cys Met 545 550 555 560 Ile Gln Asp Leu Ser Trp Lys Gly Pro Ala Lys Asn Trp Glu Asn Val 565 570 575 Leu Leu Ser Leu Gly Val Ala Gly Gly Glu Pro Gly Val Glu Gly Glu 580 585 590 Glu Ile Ala Pro Leu Ala Lys Glu Asn Val Ala Ala Pro 595 600 605 5 36 DNA Artificial primer sequence 5 tcagccgttc gtgtggcaag attcatctgt tgtctc 36 6 38 DNA Artificial primer sequence 6 tcagcgggat tatttactcc accactacag gtccattt 38 7 4207 DNA Zea mays 7 gttcgtgtgg cagattcatc tgttgtctcg tctcctgtgc ttcctgggta gcttgtgtag 60 tggagctgac atggtctgag caggcttaaa atttgctcgt agacgaggag taccagcaca 120 gcacgttgcg gatttctctg cctgtgaagt gcaacgtcta ggattgtcac acgccttggt 180 cgcgtcgcgt cgatgcggtg gtgagcagag cagcaacagc tgggcggccc aacgttggct 240 tccgtgtctt cgtcgtacgt acgcgcgcgc cggggacacg cagcgagcgg agaacgagcc 300 gtgcacgggg gaggtggtgt gcaagtggag ccgcgcgccc ggccgcccgc gcccggtggg 360 caacccaaaa gtacccacga caagcgaagg cgccaaagcg atccaagctc cggaacgcat 420 cagccacaag cagccgagaa ccgaaccggt gggcgacgcg tcgtgggacg gacgcgggcg 480 acgcttccaa acgggccacg tacgccggcg tgtgcgtgcg tgcgtgcaga cgacaagcca 540 aggcgaggca gcccccgatc gggaaaaaag cgtcaagtag gtgcgccggg ctttggcttt 600 gggcgcgagc gctggcgtgc gggtcagtcg ctggtgcgca gtgccggggg gaacgggtat 660 cgtgggggcg cgggcggagg agagcgtggc gagggccgag agcagcgcgc ggccgggtca 720 cgcaacgcgc cccacgtact gccctccccc tccgcgcgcg ctagaaatac cgaggcctgg 780 accgggggcc ccccggcaca tccatccatc gaccgatcga tcgatcgcca cagccaacac 840 cacccgccga ggcgacgcga cagccgccag gaggaaggaa

taaactcact gccagccagt 900 gaagggggag aagtgtactg ctccgtcgac cagtgcgcgc accgcccggc agggctgctc 960 atctcgtcga cgaccaggtt ccgttccgtt ccgatcctgt ccttgagttt cgtccagata 1020 ctggcgtgta tctgcgtgtt tgatgatcca ggttcttcga acctaaatct gtccgtgcac 1080 atgtcctctc tctctctgtc tctctctgct atgcagtgga ttaatcggca tggcggctct 1140 ggccacgtcg cagctcgtcg caacgcgcgc cggcctgggc gtcccggacg cgtccacgtt 1200 ccgccgcggc gccgcgcagg gcctgagggg ggcccgggcg tcggcggcgg cggacacgct 1260 cagcatgcgg accagcgcgc gcgcggcgcc caggcaccag caccagcagg cgcgccgcgg 1320 ggccaggttc ccgtcgctcg tcgtgtgcgc cagcgccggc atgaacgtcg tcttcgtcgg 1380 cgccgagatg gcgccgtgga gcaagaccgg aggcctcggc gacgtcctcg gcggcctgcc 1440 gccggccatg gccgtaagcg cgcgcaccga gacatgcatc cgttggatcg cgtcttcttc 1500 gtgctcttgc cgcgtgcatg atgcatgtgt ttcctcctgg cttgtgttcg tgtatgtgac 1560 gtgtttgttc gggcatgcat gcaggcgaac gggcaccgtg tcatggtcgt ctctccccgc 1620 tacgaccagt acaaggacgc ctgggacacc agcgtcgtgt ccgaggtacg gccaccgaga 1680 ccagattcag atcacagtca cacacaccgt catgtgaacc tttctctgct ctgatgcctg 1740 caactgcaaa tgcatgcaga tcaagatggg agacgggtac gagacggtca ggttcttcca 1800 ctgctacaag cgcggagtgg accgcgtgtt cgttgaccac ccactgttcc tggagagggt 1860 gagacgagat ctgatcactc gatacgcaat taccacccca ttgtaagcag ttacagtgag 1920 ctttttttcc ccccggcctg gtcgctggtt tcaggtttgg ggaaagaccg aggagaagat 1980 ctacgggcct gtcgctggaa cggactacag ggacaaccag ctgcggttca gcctgctatg 2040 ccaggtcagg atggcttgct actacaactt cagatcatct gtatgcagca gtatacaccg 2100 atgagaaatg catgctgttc tgcaggcagc acttgaagct ccaaggatcc tgagcctcaa 2160 caacaaccca tacttctccg gaccatacgg taagagttgc tgctcttcgt ccatcagacg 2220 gactgtcatt ttacactgac tactgctgct gctcttcgtc catccataca aggggaggac 2280 gtcgtgttcg tctgcaacga ctggcacacc ggccctctct cgtgctacct caagagcaac 2340 taccagtccc acggcatcta cagggacgca aaggttgcct tctctgctga actgaacaac 2400 gccgccttcg ttctccatgc tcgtatatac ctcatctggt ggtggtgctt ctctgaaact 2460 gaaactgaaa ctgactgcat gtctgtctga ccatcttcac gtactaccta ccagaccgct 2520 ttctgcatcc acaacatctc ctaccagggc cggttcgcct tctccgacta cccggagctg 2580 aacctccccg agagattcaa gtcgtccttc gatttcatcg acgggtctgt tttcctgcgt 2640 gcatgtgaac attcatgaac ggtaacccac aactgttcgc gtcctgctgg ttcattatct 2700 gacctggatt gcattgcagc tacgagaagc ccgtggaagg ccggaagatc aactggatga 2760 aggccgggat cctcgaggcc gacagggtcc tcaccgtcag cccctactac gccgaggagc 2820 tcatctccgg catcgccagg ggctgcgagc tcgacaacat catgcgcctc accggcatca 2880 ccggcatcgt caacggcatg gacgtcagcg agtgggaccc cagcagggac aagtacatcg 2940 ccgtgaagta cgacgtgtcg acggtgagct ggctggctag ctgattctgc tgcctggtcc 3000 tcctgctcat gctggttcgg ttctgacgcg gcgagtgtac gtacgtgcgt gcgacggtgg 3060 tgtggtgtcc ggttcaggcc gtggaggcca aggcgctgaa caaggaggcg ctgcaggcgg 3120 aggtcgggct cccggtggac cggaacatcc cgctggtggc gttcatcggc aggctggaag 3180 agcagaaggg ccccgacgtc atggcggccg ccatcccgca gctcatggag atggtggagg 3240 acgtgcagat cgttctgctg gtacgtgtgc gccgcccgcc acccggctac tacatgcgtg 3300 tatcgttcta ctggaacata cgtgtgagca acgcgatgga taatgctgca gggcacgggc 3360 aagaagaagt tcgagcgcat gctcatgagc gccgaggaga agttcccagg caaggtgcgc 3420 gccgtggtca agttcaacgc ggcgctggcg caccacatca tggccggcgc cgacgtgctc 3480 gccgtcacca gccgcttcga gccctgcggc ctcatccagc tgcaggggat gcgatacgga 3540 acggtacgag agagaaaaaa aacatcctga atccctgacg agagggacag agacagattg 3600 attatgaatg cttcatcgat ttgaattgat tgatcgatgt ctcccgctgc gactcttgca 3660 gccctgcgcc tgcgcgtcca ccggtggact cgtcgacacc atcatcgaag gcaagaccgg 3720 gttccacatg ggccgcctca gcgtcgacgt aagcctacct ctgccatgtt ctttcttctt 3780 tctttctgta tgtatgtatg tatgtacgaa tcagcaccgc cattcttgtt tcgtcgtcct 3840 ctcttcccag tgcaacgtcg tggagccggc ggacgtcaag aaggtggcca ccaccttgca 3900 gcgcgccatc aaggtggtcg gcacgccggc gtacgaggag atggtgagga actgcatgat 3960 ccaggatctc tcctggaagg tacgtacgcc cgccccgcca gagcagagcg ccaagatcga 4020 tcgatcgacc gaccacacgt acgcgcctcg ctcttgtcgc tgaccgtggt ttaatttgcg 4080 aaatgcgcag ggccctgcca agaactggga gaacgtgctg ctcagcctcg gggtcgccgg 4140 cggcgagcca ggggttgaag gcgaggagat cgcgccgctc gccaaggaga acgtggccgc 4200 gccctga 4207 8 606 PRT Zea mays 8 Met Ala Ala Leu Ala Thr Ser Gln Leu Val Ala Thr Arg Ala Gly Leu 1 5 10 15 Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gln Gly Leu 20 25 30 Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg Thr 35 40 45 Ser Ala Arg Ala Ala Pro Arg His Gln His Gln Gln Ala Arg Arg Gly 50 55 60 Ala Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Met Asn Val 65 70 75 80 Val Phe Val Gly Ala Glu Met Ala Pro Trp Ser Lys Thr Gly Gly Leu 85 90 95 Gly Asp Val Leu Gly Gly Leu Pro Pro Ala Met Ala Ala Asn Gly His 100 105 110 Arg Val Met Val Val Ser Pro Arg Tyr Asp Gln Tyr Lys Asp Ala Trp 115 120 125 Asp Thr Ser Val Val Ser Glu Ile Lys Met Gly Asp Gly Tyr Glu Thr 130 135 140 Val Arg Phe Phe His Cys Tyr Lys Arg Gly Val Asp Arg Val Phe Val 145 150 155 160 Asp His Pro Leu Phe Leu Glu Arg Val Trp Gly Lys Thr Glu Glu Lys 165 170 175 Ile Tyr Gly Pro Val Ala Gly Thr Asp Tyr Arg Asp Asn Gln Leu Arg 180 185 190 Phe Ser Leu Leu Cys Gln Ala Ala Leu Glu Ala Pro Arg Ile Leu Ser 195 200 205 Leu Asn Asn Asn Pro Tyr Phe Ser Gly Pro Tyr Gly Glu Asp Val Val 210 215 220 Phe Val Cys Asn Asp Trp His Thr Gly Pro Leu Ser Cys Tyr Leu Lys 225 230 235 240 Ser Asn Tyr Gln Ser His Gly Ile Tyr Arg Asp Ala Lys Thr Ala Phe 245 250 255 Cys Ile His Asn Ile Ser Tyr Gln Gly Arg Phe Ala Phe Ser Asp Tyr 260 265 270 Pro Glu Leu Asn Leu Pro Glu Arg Phe Lys Ser Ser Phe Asp Phe Ile 275 280 285 Asp Gly Tyr Glu Lys Pro Val Glu Gly Arg Lys Ile Asn Trp Met Lys 290 295 300 Ala Gly Ile Leu Glu Ala Asp Arg Val Leu Thr Val Ser Pro Tyr Tyr 305 310 315 320 Ala Glu Glu Leu Ile Ser Gly Ile Ala Arg Gly Cys Glu Leu Asp Asn 325 330 335 Ile Met Arg Leu Thr Gly Ile Thr Gly Ile Val Asn Gly Met Asp Val 340 345 350 Ser Glu Trp Asp Pro Ser Arg Asp Lys Tyr Ile Ala Val Lys Tyr Asp 355 360 365 Val Ser Thr Ala Val Glu Ala Lys Ala Leu Asn Lys Glu Ala Leu Gln 370 375 380 Ala Glu Val Gly Leu Pro Val Asp Arg Asn Ile Pro Leu Val Ala Phe 385 390 395 400 Ile Gly Arg Leu Glu Glu Gln Lys Gly Pro Asp Val Met Ala Ala Ala 405 410 415 Ile Pro Gln Leu Met Glu Met Val Glu Asp Val Gln Ile Val Leu Leu 420 425 430 Gly Thr Gly Lys Lys Lys Phe Glu Arg Met Leu Met Ser Ala Glu Glu 435 440 445 Lys Phe Pro Gly Lys Val Arg Ala Val Val Lys Phe Asn Ala Ala Leu 450 455 460 Ala His His Ile Met Ala Gly Ala Asp Val Leu Ala Val Thr Ser Arg 465 470 475 480 Phe Glu Pro Cys Gly Leu Ile Gln Leu Gln Gly Met Arg Tyr Gly Thr 485 490 495 Pro Cys Ala Cys Ala Ser Thr Gly Gly Leu Val Asp Thr Ile Ile Glu 500 505 510 Gly Lys Thr Gly Phe His Met Gly Arg Leu Ser Val Asp Cys Asn Val 515 520 525 Val Glu Pro Ala Asp Val Lys Lys Val Ala Thr Thr Leu Gln Arg Ala 530 535 540 Ile Lys Val Val Gly Thr Pro Ala Tyr Glu Glu Met Val Arg Asn Cys 545 550 555 560 Met Ile Gln Asp Leu Ser Trp Lys Gly Pro Ala Lys Asn Trp Glu Asn 565 570 575 Val Leu Leu Ser Leu Gly Val Ala Gly Gly Glu Pro Gly Val Glu Gly 580 585 590 Glu Glu Ile Ala Pro Leu Ala Lys Glu Asn Val Ala Ala Pro 595 600 605 9 28 DNA Artificial primer sequence 9 ggatatcacc atggcggctc tggccacg 28 10 27 DNA Artificial primer sequence 10 gtcctgcagg ctacacatac ttgtcca 27 11 1863 DNA Zea mays 11 atggcggctc tggccacgtc gcagctcgtc gcaacgcgcg ccggcctggg cgtcccggac 60 gcgtccacgt tccgccgcgg cgccgcgcag ggcctgaggg gggcccgggc gtcggcggcg 120 gcggacacgc tcagcatgcg gaccagcgcg cgcgcggcgc ccaggctcca gctgcaccag 180 cagcagcagc aggcgcgccg cggggccagg ttcccgtcgc tcgtcgtgtg cgccagcgcc 240 ggcatgaacg tcgtcttcgt cggcgccgag atggcgccgt ggagcaagac cggcggcctc 300 ggcgacgtcc tcggcggcct gccgccggcc atggccgcga atgggcaccg tgtcatggtc 360 gtctctcccc gctacgacca gtacaaggac gcctgggaca ccagcgtcgt gtccgagatc 420 aagatgggag acaggtacga gacggtcagg ttcttccact gctacaagcg cggagtggac 480 cgcgtgttcg ttgaccaccc actgttcctg gagagggttt ggggaaagac cgaggagaag 540 atctacgggc ctgtcgctgg aacggactac agggacaacc agctgcggtt cagcctgcta 600 tgccaggcag cacttgaagc tccaaggatc ctgagcctca acaacaaccc atacttctcc 660 ggaccatacg gggaggacgt cgtgttcgtc tgcaacgact ggcacaccgg ccctctctcg 720 tgctacctca agagcaacta ccagtcccac ggcatctaca gggacgcaaa gaccgctttc 780 tgcatccaca acatctccta ccagggccgg ttcgccttct ccgactaccc ggagctgaac 840 ctccccgaga gattcaagtc gtccttcgat ttcatcgacg gctacgagaa gcccgtggaa 900 ggccggaaga tcaactggat gaaggccggg atcctcgagg ccgacagggt cctcaccgtc 960 agcccctact acgccgagga gctcatctcc ggcatcgcca ggggctgcga gctcgacaac 1020 atcatgcgcc tcaccggcat caccggcatc gtcaacggca tggacgtcag cgagtgggac 1080 cccagcaggg acaagtacat cgccgtgaag tacgacgtgt cgacggccgt ggaggccaag 1140 gcgctgaaca aggaggcgct gcaggcggag gtcgggctcc cggtggaccg gaacatcccg 1200 ctggtggcgt tcatcggcag gctggaagag cagaagggcc ccgacgtcat ggcggccgcc 1260 atcccgcagc tcatggagat ggtggaggac gtgcagatcg ttctgctggg cacgggcaag 1320 aagaagttcg agcgcatgct catgagcgcc gaggagaagt tcccaggcaa ggtgcgcgcc 1380 gtggtcaagt tcaacgcggc gctggcgcac cacatcatgg ccggcgccga cgtgctcgcc 1440 gtcaccagcc gcttcgagcc ctgcggcctc atccagctgc aggggatgcg atacggaacg 1500 ccctgcgcct gcgcgtccac cggtggactc gtcgacacca tcatcgaagg caagaccggg 1560 ttccacatgg gccgcctcag cgtcgactgc aacgtcgtgg agccggcgga cgtcaagaag 1620 gtggccacca ccttgcagcg cgccatcaag gtggtcggca cgccggtgta cgaggagatg 1680 gtgaggaact gcatgatcca ggatctctcc tggaagggcc ctgccaagaa ctgggagaac 1740 gtgctgctca gcctcggggt cgccggcggt gcagggcccc tgatctcgcg cgtggtgcaa 1800 agatgttggg acatcttctt atatatgctg tttcgtttat gtgatatgga caagtatgtg 1860 tag 1863 12 31 DNA Artificial primer sequence 12 ggcgcgccgt cgacggtatc gataagcttg c 31 13 32 DNA Artificial primer sequence 13 gcggccgccc gcttggtatc tgcattacaa tg 32 14 32 DNA Artificial primer sequence 14 gatcgtttaa acgttcgtgt ggcagattca tc 32 15 41 DNA Artificial primer sequence 15 gacgtggcca gagccgccat gccgattaat ccactgcata g 41 16 20 DNA Artificial primer sequence 16 atcttgctcg atgccttctc 20 17 19 DNA Artificial primer sequence 17 gccttcgctt gtcgtgggt 19

* * * * *

References

ncbi.n1m.nih.gov