Haploid Inducers

BOLDUAN; Christof ;   et al.

Patent Application Summary

U.S. patent application number 16/825490 was filed with the patent office on 2020-07-23 for haploid inducers. This patent application is currently assigned to KWS SAAT SE & CO. KGAA. The applicant listed for this patent is KWS SAAT SE & CO. KGAA. Invention is credited to Christof BOLDUAN, Monika KLOIBER-MAITZ, Markus NIESSEN, Milena OUZUNOVA, Fridtjof WELTMEIER.

Application Number20200229367 16/825490
Document ID /
Family ID54557390
Filed Date2020-07-23

United States Patent Application 20200229367
Kind Code A1
BOLDUAN; Christof ;   et al. July 23, 2020

HAPLOID INDUCERS

Abstract

The present invention relates to the provision of technical means such as nucleic acids which, after transcription or after expression in a plant, are suitable for mediating the property of a haploid inductor or for increasing the induction capability of a haploid inductor, as well as methods and uses for the production and identification of non-transgenic and transgenic plant haploid inductors, as well as the improvement of existing plant haploid inductors.


Inventors: BOLDUAN; Christof; (Einbeck, DE) ; KLOIBER-MAITZ; Monika; (Einbeck, DE) ; NIESSEN; Markus; (Hannover, DE) ; OUZUNOVA; Milena; (Gottingen, DE) ; WELTMEIER; Fridtjof; (Einbeck, DE)
Applicant:
Name City State Country Type

KWS SAAT SE & CO. KGAA

Einbeck

DE
Assignee: KWS SAAT SE & CO. KGAA
Einbeck
DE

Family ID: 54557390
Appl. No.: 16/825490
Filed: March 20, 2020

Related U.S. Patent Documents

Application Number Filing Date Patent Number
15526552 May 12, 2017 10631482
PCT/EP2015/076469 Nov 12, 2015
16825490

Current U.S. Class: 1/1
Current CPC Class: C12N 15/8218 20130101; A01H 1/06 20130101; C12N 9/90 20130101; C12N 9/20 20130101; A01H 6/4684 20180501; C12N 9/16 20130101; C12Q 2600/13 20130101; C12N 9/18 20130101; C12Q 1/6895 20130101; C07K 14/415 20130101; C12N 9/1007 20130101; C12N 15/8287 20130101; A01H 1/08 20130101
International Class: A01H 1/08 20060101 A01H001/08; C07K 14/415 20060101 C07K014/415; A01H 1/06 20060101 A01H001/06; A01H 6/46 20060101 A01H006/46; C12N 9/20 20060101 C12N009/20

Foreign Application Data

Date Code Application Number
Nov 12, 2014 DE 102014016667.8

Claims



1-18. (canceled)

19. A method for production of a plant which is suitable for use as a haploid inductor, the method comprising: mutagenizing plant cells; regenerating plants from the mutagenized plant cells; and identifying the regenerated plant which has at least one mutation in an endogenous DNA sequence which is identical to an isolated nucleic acid comprising a nucleotide sequence, which is (i) selected from the group consisting of SEQ ID Nos: 8, 9, 10 and 11, (ii) complementary to the sequence selected from the group consisting of SEQ ID Nos: 8, 9, 10 and 11, (iii) is at least 80% identical to the sequence selected from the group consisting of SEQ ID Nos: 8, 9, 10 and 11, (iv) hybridizes with a sequence at least 80% identical to the sequence selected from the group consisting of SEQ ID Nos: 8, 9, 10 and 11 under stringent conditions, or has at least one mutation in a regulatory sequence of the endogenous DNA sequence, which mutation produces a change in expression rate of the endogenous DNA sequence in the identified plant, in comparison to a non-mutagenized wild-type plant, or a change in the activity or stability of a protein or polypeptide encoded by the endogenous DNA sequence in the identified plant, in comparison to a non-mutagenized wild-type plant, wherein the at least one mutation causes a property of a haploid inducer to be mediated or the induction capability of a haploid inducer to be increased in the identified plant.

20. The method of claim 19, wherein the change in the expression rate of the endogenous DNA sequence in the identified plant is detected in a pollen or a tissue of a pollen of the identified plant.

21. The method of claim 19, wherein the change in the activity or stability of the protein or the polypeptide encoded by the endogenous DNA sequence is detected in a pollen or a tissue of a pollen of the identified plant.

22. The method of claim 19, wherein the plant comprises the at least one mutation in the regulatory sequence of the endogenous DNA sequence.

23. The method of claim 22, wherein the at least one mutation is the deletion of a nucleotide or the addition of a nucleotide.

24. The method of claim 22, wherein the regulatory sequence of the endogenous DNA sequence is a promoter.

25. The method of claim 24, wherein the promoter comprises a modification in a cis-regulatory element.

26. The method of claim 25, wherein the activity of the promoter is altered as compared to the activity of a promoter without the modification in the cis-regulatory element.

27. The method of claim 26, wherein the activity of the promoter is increased as compared to the activity of a promoter without the modification in the cis-regulatory element.

28. The method of claim 26, wherein the activity of the promoter is reduced as compared to the activity of a promoter without the modification in the cis-regulatory element.

29. The method of claim 19, wherein the at least one mutation is an exchange, addition or deletion of at least one nucleobase in the encoding region of the endogenous DNA sequence, and wherein the exchange, addition or deletion of the at least one nucleobase leads to an amino acid exchange in the encoded protein and produces an alteration in the activity or stability of the protein, in comparison to the wild-type protein.

30. The method of claim 19, wherein the identifying step comprises contacting a sample from the plant with a molecular marker, wherein the molecular marker is a DNA primer or a pair of DNA primers.

31. The method of claim 30, wherein a sample from the plant is contacted with the DNA primer or the pair of DNA primers in a polymerase chain reaction (PCR).

32. The method of claim 31, wherein the PCR is reverse transcription PCR (RT-PCR).

33. The method of claim 29, wherein the molecular marker demonstrates the presence or absence of the at least one mutation.

34. The method of claim 33, wherein the molecular markers are KASPar or TaqMan markers.

35. The method of claim 29, wherein the sample is a pollen sample.

36. The method of claim 31, wherein the DNA primer or the pair of DNA primers comprises a sequence for detection of the at least one mutation in a gene encoding for phospholipase.

37. The method of claim 31, wherein the DNA primer or the pair of DNA primers are specific for a transposon or a target gene.

38. The method of claim 31, wherein the at least one mutation in the endogenous DNA sequence is a point mutation.

39. A plant or plant part produced according to the method of claim 19.

40. A descendant of the plant according to claim 39.
Description



CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is a divisional of Ser. No. 15/526,552, filed on May 12, 2017, which is a U.S. National Phase of International Patent Application No. PCT/EP2015/076469, filed Nov. 12, 2015, which published as International Application No. WO 2016/075255 A1, on May 19, 2016 and claims priority to German Patent Application No. 102014016667.8, filed Nov. 12, 2014, all of which applications are hereby incorporated by reference in their entirety.

SEQUENCE LISTING

[0002] The instant application contains a Sequence Listing which has been submitted in ASCII format via EFS-Web and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Apr. 11, 2017, is named KWS0220PCT_ST25 en.txt, and is 307,875 bytes in size.

FIELD OF THE INVENTION

[0003] The present invention relates to the field of the modification of plants by means of molecular biology methods and marker technology and genetic engineering. It relates to the provision of technical means such as nucleic acids and vectors, as well as methods and uses for the production and identification of non-transgenic and transgenic plant haploid inductors, and the improvement of existing plant haploid inductors.

BACKGROUND OF THE INVENTION

[0004] Typically, in the production of hybrid plants, two breeding lines as parents are crossed with one another, the descendants of which generate, in part, a strongly increased yield relative to the parent lines, due to the known heterosis effect. The breeding lines may be obtained via multiple selfing steps, which, however, takes multiple generations and therefore is connected with an enormous time cost. Modern plant breeding already increasingly transitioned many years ago to generating the breeding lines via haploid induction and the subsequent chromosome duplication in a much shorter amount of time. A technical requirement for this is a functioning haploid induction system, which also simultaneously promises a sufficient efficiency, in order to be economically usable.

[0005] For example, for maize (Zea mays), a maternal in vivo induction system is known in which the plants to be induced are pollinated with pollen of the inductor. Up to 10% of the descendants that are thereby generated then contain only the simple (haploid) chromosome set of the seed parent. A few such inductors are presently available for maize hybrid breeding. However, these are all to be ascribed to the single line "Stock 6," described by Coe, 1959. One example of such a known inductor is the RWS (Rober et al., 2005) line. In the past, multiple QTL studies for the identification of the inductor-relevant loci were conducted on these lines. A main-QTL at chromosome 1 (bin 1.04) in maize was already identified in 1997 by Deimling et al. More precise mapping was performed by Barret et al. 2008 in the range between 66.96 MB and 68.11 MB on chromosome 1, by Prigge et al. 2012 in the range between 62.9 MB and 70.8 MB, and following this by Dong et al. 2013 in the range between 68.18 MB and 68.43 MB which, according to public annotation, contains three genes. All position information refers to the reference genome of B73, Version AGPv02. The functionality of the locus appears to have already been demonstrated on its own by Dong et al. 2014 by achieving an induction rate of 5%. However, an incorrect fine mapping cannot be excluded, since no unambiguous delimitation of the QTL is possible due to the lack of information of flanking markers in the recurrent parent.

[0006] Furthermore, WO 2012/030893 discloses an inductor-relevant locus on chromosome 1 in maize that, however, differs markedly from the preceding locus and is localized in more detail at the telomere. There is no overlap in the genome regions considered.

[0007] Overall, the molecular and development-specific mechanisms, which the in vivo haploid induction in maize lines which resulted from "Stock 6," are largely unknown today. For example, it is conceivable that a fertilization occurs, but it subsequently leads to a chromosome elimination which then allows haploid descendants to emerge. For example, such a mechanism has been described by Ravi & Chan (2010) in a system with the histone protein CenH3. On the other hand, however, the fertilization may also fail, and the development of the haploid egg cells occurs in the triploid endosperm. Without the understanding of the underlying maternal in vivo haploid induction suitability of an inductor genotype derived from "Stock 6" and the knowledge about the responsible genes, a targeted improvement of this maize inductor genotype or the transfer of the induction suitability to non-inductor genotypes, or the targeted mediation of the in vivo haploid induction capability in maize non-inductors, is practically impossible.

[0008] Furthermore, for some cultivated plants, no efficiently (and, therefore, economically) applicable system for the production of haploid and double-haploid plants is known at all--for example, for sorghum, rye, or sunflower.

[0009] There is also a need for the provision of genetic elements such as genes or regulatory elements that are usable in transgenic and/or non-transgenic approaches, in order to enable haploid development, or an improved efficiency in haploid development, via in vivo induction.

SUMMARY OF THE INVENTION

[0010] The present invention was produced before the background of the prior art described in the preceding, wherein it is an object of the present invention to provide means and methods which may be used to produce an in vivo haploid inductor and/or to produce a haploid plant.

[0011] According to the invention, the achievement of the posed object takes place by means of a nucleic acid which, after transcription or expression in a plant, is suitable for mediating the property of a haploid inductor or for increasing the induction capability of a haploid inductor. The nucleic acid according to the invention may be used as a trans-gene. On the other hand, an endogenous DNA sequence in the genome of a plant, or in the genome of a plant haploid inductor, which is identical to one of the nucleic acids according to the invention, may also be modified such that the property of a haploid inductor is mediated, or the induction capability of the haploid inductor is increased, after transcription or expression of the endogenous DNA sequence. The nucleic acid of the present invention is preferably an isolated nucleic acid which is extracted from its natural or original environment (genetic context). A nucleic acid may be double-stranded or single-stranded, and linear or circular. It may thereby be genomic DNA, synthetic DNA, cDNA, or an RNA type (for example, lncRNA, siRNA, or miRNA), wherein the nucleobase uracil occurs in RNA instead of the nucleobase thymine.

[0012] In a preferred embodiment of the present invention, the nucleic acid according to the invention, or an RNA encoded from the nucleic acid, or a protein or polypeptide encoded from the nucleic acid, has an influence on the pollen tube growth in a plant, on the energy metabolism of a pollen of a plant, and/or on the activity of the centromere--preferably, in a generative cell which develops into a pollen, for example.

[0013] The nucleic acid according to the invention may be characterized in that the nucleic acid, or an RNA encoded from the nucleic acid, or a protein or polypeptide encoded from the nucleic acid, is suitable or may be used for accelerating or promoting the pollen tube growth (for example, in a pollen of a plant), in comparison with a pollen of a wild-type plant in which the nucleic acid according to the invention, or an RNA encoded from the nucleic acid, or a protein or polypeptide encoded from the nucleic acid, is used as described in the following. For example, the nucleic acid according to the invention encodes for a protein which is involved in the transport of macromolecules, or affects this transport, in the pollen tube of a pollen of a plant. Belonging to these are, for example, SNAREv proteins which, for example, mediate the transport of pectins or phospholipids, e.g., at the tip of the pollen tube (Kato et al., 2010). Furthermore, enzymes of the phospholipase class--especially, phospholipase A2 or patatin phospholipase--are in the position to promote the growth of the pollen tube (Kim et al., 2011), whereas enzymes of the inositol polyphosphate-5-phosphatase class, such as inositol-1,4,5-triphosphate-5-phosphatase, may inhibit the growth of the pollen tube (Wang et al., 2012). The nucleic acid according to the invention may be used as a transgene for the purpose of accelerated pollen tube growth, wherein it then--for example, by means of an over-expression approach--increases the expression rate of a pollen tube growth-promoting gene or the transcription rate of an RNA such as an lncRNA--which positively regulates (activates) a pollen tube growth-promoting gene or negatively regulates (inhibits) pollen tube growth-inhibiting genes--in a plant or a part thereof, in comparison to a wild-type plant or corresponding part thereof, and/or the expression rate of a pollen tube growth-inhibiting gene is reduced--by means of an RNAi approach or miRNA approach (Fire et al., 1998)--in a plant or a part thereof, in comparison to a wild-type plant or corresponding part thereof. On the other hand, an endogenous DNA sequence in the genome of a plant or in a genome of a plant haploid inductor which is identical to the nucleic acid according to the invention, or a regulatory sequence of the endogenous DNA sequence, may also be modified, e.g., via mutagenization or "genome editing." This modification may increase or reduce the transcription or expression rate of the endogenous DNA sequence, or the activity or stability of the protein or polypeptide encoded by the endogenous DNA sequence, in a plant, in comparison to a non-mutagenized wild-type plant. For example, the expression rate of an endogenous pollen tube growth-promoting gene, or the transcription rate of an endogenous RNA such as an lncRNA which positively regulates (activates) a pollen tube growth-promoting gene or negatively regulates (inhibits) a pollen tube growth-inhibiting gene, may thus be increased in a plant, in comparison to a non-mutagenized wild-type plant or a wild-type plant modified via "genome editing," or the expression rate of an endogenous pollen tube growth-inhibiting gene, or the transcription rate of an RNA such as an lncRNA which negatively regulates (inhibits) a pollen tube growth-promoting gene or positively regulates (activates) a pollen tube growth-inhibiting gene, may thus be reduced in a plant, in comparison to a non-mutagenized wild-type plant or a wild-type plant modified via "genome editing." Moreover, the activity or stability of a pollen tube growth-promoting protein or polypeptide encoded by the endogenous DNA sequence may be increased in a plant, in comparison to a non-mutagenized wild-type plant or a wild-type plant modified via "genome editing," or the activity or stability of a pollen tube growth-inhibiting protein or polypeptide encoded by the endogenous DNA sequence may be reduced in a plant, in comparison to a non-mutagenized wild-type plant or a wild-type plant modified via "genome editing."

[0014] In a further example, the nucleic acid according to the invention may be characterized in that, via the use of the nucleic acid, or of an RNA encoded by the nucleic acid, or of a protein or polypeptide encoded by the nucleic acid, the energy metabolism of a pollen in a plant may be negatively affected in comparison to a wild-type plant. For example, this may take place via a phosphoglycerate mutase, or a mitochondrial transporter or mitochondrial import receptor. For this purpose, the nucleic acid according to the invention may be used as a transgene in an over-expression approach, or in an RNAi approach, or in an miRNA approach (Fire et al., 1998). On the other hand, an endogenous DNA sequence in the genome of a plant or in a genome of a plant haploid inductor which is identical to the nucleic acid according to the invention, or a regulatory sequence of the endogenous DNA sequence, may also be modified, e.g., via mutagenization or "genome editing." This modification may increase or reduce the transcription or expression rate of the endogenous DNA sequence, or the activity or stability of the protein or polypeptide encoded by the endogenous DNA sequence, in the plant, in comparison to a non-mutagenized wild-type plant or wild-type plant modified via "genome editing."

[0015] In another example, the nucleic acid according to the invention may also be characterized in that, via the use of the nucleic acid or of an RNA encoded by the nucleic acid or of a protein or polypeptide encoded by the nucleic acid, the activity of the centromere in a plant is modified--in particular, in the early embryogenesis, and preferably in a generative cell of the plant which develops into a pollen, for example--in comparison to a wild-type plant, which may lead to the elimination of the inductor genome, for example. The activity of the centromere may be modified via chromatin modification of DNA or at the histone level--moreover, also via transcription, RNA interactions, or RNA binding. A change in the activity of the centromere may take place via a methyl transferase such as an RNA methyl transferase, for example. For this purpose, the nucleic acid according to the invention is used as a transgene, wherein it then increases--by means of an overexpression approach--the expression rate of a chromatin-modifying gene or the transcription rate of an RNA (such as an lncRNA), which positively regulates (activates) a chromatin-modifying gene in a plant, in comparison to a wild-type plant. On the other hand, an endogenous DNA sequence in the genome of a plant or in a genome of a plant haploid inductor, which is identical to the nucleic acid according to the invention, or a regulatory sequence of the endogenous DNA sequence, may also be modified, e.g., via mutagenization or "genome editing." This modification may increase or reduce the transcription or expression rate of the endogenous DNA sequence, or the activity or stability of the protein or polypeptide encoded by the endogenous DNA sequence, in a plant, in comparison to the non-mutagenized wild-type plant or wild-type plant modified via "genome editing." The expression rate of an endogenous chromatin-modifying gene or the transcription rate of an endogenous RNA (such as an lncRNA), which positively regulates (activates) a chromatin-modifying gene, may thus also be increased in a plant, in comparison to the non-mutagenized wild-type plant or wild-type plant modified via "genome editing." Moreover, the activity or stability of a chromatin-modified protein encoded by the endogenous DNA sequence may be increased in a plant, in comparison to the non-mutagenized wild-type plant or wild-type plant modified via "genome editing."

[0016] Uses of the nucleic acid according to the invention, or of an RNA encoded by the nucleic acid, or of a protein or polypeptide encoded by the nucleic acid, that are stated in the preceding are not exclusive or limiting, but rather are to be understood only as examples. Numerous additional technical means and methods are known to a person skilled in the art from the prior art, with which he may effect the above-described changes in the expression or transcription rate according to the invention of the nucleic acid or of the identical endogenous DNA sequence, or the above-described changes in the stability and activity of the protein or polypeptide encoded by the nucleic acid according to the invention or the endogenous DNA sequence.

[0017] In an especially preferred embodiment of the present invention, the nucleic acid which, after transcription or expression in a plant, is suitable for mediating the property of a haploid inductor or for increasing the induction capability of a haploid inductor may be a nucleic acid that includes a nucleotide sequence which

(i) is a sequence selected from SEQ ID Nos: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 42, 43, 46, 47, 49, 50, 52, 53, 55, 56, 57, 58, 59, 60, 61, and/or 62, or has a functional fragment of these, or (ii) is complementary to a sequence from (i), or (iii) is at least 80%, 82%, 84%, 86%, 88%--preferably, at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, or, especially preferably, at least 97%, 97.5%, 98%, 98.5%, 99%, or 99.5%--identical to a sequence from (i), or (iv) encodes for a protein with the amino acid sequence selected from SEQ ID Nos: 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 44, 45, 48, 51, 54, 63, 64, and/or 65, or a functional part of the protein, or (v) encodes for a homolog, analog, or ortholog of the protein according to (iv), or a functional part thereof, or (vi) hybridizes with a sequence from (ii) under stringent conditions.

[0018] This nucleic acid may encode for a protein or a functional portion thereof, wherein the protein or the functional portion thereof has the functionality of a SNARE protein--especially, of a SNAREv protein--of a phospholipase--especially, a phospholipase A2 or a patatin phospholipase--a methyl transferase--especially, an RNA methyl transferase or a mitochondrial import receptor (see Table 1). A use of the nucleic acid may take place as described above, i.e., in order to mediate the property of a haploid inductor or to increase the induction capability of a haploid inductor in a plant, is, for example, for transgenically or endogenously increasing the expression rate of the nucleic acid or the activity or stability of the encoded protein or of the encoded part of the protein. Since this nucleic acid, or an RNA encoded by the nucleic acid or a protein or polypeptide encoded by the nucleic acid, has a positive effect on the haploid induction capability of a plant, in the following, a nucleic acid which is defined here is designated as an induction-promoting nucleic acid. Additional methods and uses of the induction-promoting nucleic acid, as well as substances which comprise the induction-promoting nucleic acid, are disclosed further below.

[0019] In a further, especially preferred embodiment of the present invention, the nucleic acid which is suitable--after transcription or after expression in a plant--for mediating the property of a haploid inductor or for increasing the induction capability of a haploid inductor may be a nucleic acid that comprises a nucleotide sequence that

(i) has a sequence selected from SEQ ID Nos: 26, 27, 28, 29, 30, and/or 31, or a functional fragment thereof, or (ii) is complementary to a sequence from (i), or (iii) is at least 80%, 82%, 84%, 86%, 88%--preferably, at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, or, especially preferably, at least 97%, 97.5%, 98%, 98.5%, 99%, or 99.5%--identical to a sequence from (i). (iv) encodes for a protein with the amino acid sequence selected from SEQ ID Nos: 32, 33, and/or 34, or a functional part of the protein, or (v) encodes for a homolog, analog, or ortholog of the protein according to (iv), or a functional part thereof, or (vi) hybridizes with a sequence from (ii) under stringent conditions.

[0020] Such a nucleic acid may encode for a protein or a functional portion thereof, wherein the protein or the functional portion thereof has the functionality of an inositol polyphosphate-5-phosphatase--especially, of an inositol-1,4,5-triphosphate-5-phosphatase--or of a phosphoglycerate mutase (see Table 1). A use of the nucleic acid may take place as described above, i.e., in order to mediate the property of a haploid inductor or to increase the induction capability of a haploid inductor in a plant, is, for example, for transgenically or endogenously reducing the expression rate of the nucleic acid or the activity or stability of the encoded protein or of the encoded part of the protein. Since this nucleic acid, or an RNA encoded by the nucleic acid or a protein or polypeptide encoded by the nucleic acid, has a negative effect on the haploid induction capability of a plant, in the following, a nucleic acid which is defined here is designated as an induction-inhibiting nucleic acid. Additional methods and uses of the induction-inhibiting nucleic acid, as well as substances which comprise the induction-inhibiting nucleic acid, are disclosed further below.

[0021] In another especially preferred embodiment of the present invention, the nucleic acid which--after transcription or expression in a plant--is suitable for mediating the property of a haploid inductor or for increasing the induction capability of a haploid inductor may be a nucleic acid that encodes for an RNA that has a double-stranded portion, wherein at least one strand of the double-stranded portion has a nucleotide sequence which is homologous or identical to at least 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25--preferably, to at least 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, or 140, and, especially preferably, to at least 160, 180, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900, or 1000--successive nucleotides of a coding sequence of a nucleic acid that

(i) has a sequence selected from SEQ ID Nos: 26, 27, 28, 29, 30, and/or 31, or a fragment thereof in a sense or anti-sense orientation, or (ii) is complementary to a sequence from (i), or (iii) is at least 80%, 82%, 84%, 86%, or 88%--preferably, at least 90%, 91%, 92%, 93%, 94%, 95%, or 96%, or, especially preferably, at least 97%, 97.5%, 98%, 98.5%, 99%, or 99.5%--identical to a sequence from (i), or (iv) encodes for a protein with the amino acid sequence selected from SEQ ID Nos: 32, 33, and/or 34, or a part of the protein, or (v) encodes for a homolog, analog, or ortholog of the protein according to (iv), or a part thereof, or (vi) hybridizes with a sequence from (ii) under stringent conditions. In post-transcriptional gene silencing, as described in, for example, the RNAi approach or miRNA approach (Fire et al., 1998), such a nucleic acid may be used to suppress the expression of the induction-inhibiting nucleic acid described above. The dsRNA-encoding nucleic acid may also be a nucleic acid which encodes for a long, non-coding RNA (lncRNA). The lncRNA-encoding nucleic acid then preferably comprises a nucleotide sequence that (a) has a sequence selected from SEQ ID Nos: 35, 36, 37, and/or 38, or a fragment thereof, or (b) is complementary to a sequence from (a), or (c) is at least 80%, 82%, 84%, 86%, or 88%--preferably, at least 90%, 91%, 92%, 93%, 94%, 95%, or 96%, or, especially preferably, at least 97%, 97.5%, 98%, 98.5%, 99%, or 99.5%--identical to a sequence from (a), or (d) encodes for a polypeptide with the amino acid sequence of SEQ ID Nos: 40 or 41, or a part of the polypeptide, or (e) hybridizes with a sequence (b) under stringent conditions. This lncRNA, designated in the following as lncRNA 1, may serve for expression or translation regulation of an inositol polyphosphate-5-phosphatase such as an inositol-1,4,5-triphosphate-5-phosphatase. Furthermore, the lncRNA-encoding nucleic acid may preferably comprises a nucleotide sequence that (w) has a sequence of the SEQ ID No: 39, or a fragment thereof, or (x) is complementary to a sequence from (w), or (y) is at least 80%, 82%, 84%, 86%, or 88%--preferably, at least 90%, 91%, 92%, 93%, 94%, 95%, or 96%, or, especially preferably, at least 97%, 97.5%, 98%, 98.5%, 99%, or 99.5%--identical to a sequence from (w), or (z) hybridizes with a sequence from (x) under stringent conditions. This lncRNA, designated in the following as lncRNA 2, may serve for expression or translation regulation of a phospholipase--especially, of the phospholipase A2 or the patatin phospholipase.

[0022] In a further especially preferred embodiment of the present invention, the nucleic acid which--after transcription or expression in a plant--is suitable for mediating the property of a haploid inductor or for increasing the induction capability of a haploid inductor may be a nucleic acid that encodes for an RNA that has a double-stranded portion, wherein at least one strand of the double-stranded portion has a nucleotide sequence which is homologous or identical to at least 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25--preferably to at least 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, or 140, and, especially preferably, to at least one 160, 180, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900, or 1000--successive nucleotides of an intron sequence of a nucleic acid that

(i) has a sequence selected from SEQ ID Nos: 1, 6, 8, 9, 12, 13, 26, 30, 42, 43, 46, 55, 58, and/or 60, or a fragment thereof in a sense or anti-sense orientation, or (ii) is complementary to a sequence from (i), or (iii) is at least 80%, 82%, 84%, 86%, or 88%--preferably, at least 90%, 91%, 92%, 93%, 94%, 95%, or 96%, or, especially preferably, at least 97%, 97.5%, 98%, 98.5%, 99%, or 99.5%--identical to a sequence from (i), or (iv) encodes for a protein with the amino acid sequence selected from SEQ ID Nos: 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 44, 45, 48, 63, 64, and/or 65, or from SEQ ID Nos: 32, 33, and/or 34, or a part of the protein, or (v) encodes for a homolog, analog, or ortholog of the protein according to (iv), or a part thereof, or (vi) hybridizes with a sequence from (ii) under stringent conditions. In transcriptional gene silencing, as, for example, in the RdDM approach (Shibuya et al., 2009), such a nucleic acid may be used to activate the expression of the induction-inducing nucleic acid described above, or to suppress the expression of the induction-inhibiting nucleic acid described above. The dsRNA-encoding nucleic acid may also be a nucleic acid which encodes for a long, non-coding RNA (lncRNA). The lncRNA-encoding nucleic acid then preferably comprises a nucleotide sequence that (a) has a sequence selected from SEQ ID Nos: 35, 36, 37, and/or 38, or a fragment thereof, or (b) is complementary to a sequence from (a), or (c) is at least 80%, 82%, 84%, 86%, or 88%--preferably, at least 90%, 91%, 92%, 93%, 94%, 95%, or 96%, or, especially preferably, at least 97%, 97.5%, 98%, 98.5%, 99%, or 99.5%--identical to a sequence from (a), or (d) encodes for a polypeptide with the amino acid sequence of SEQ ID Nos: 40 or 41, or a part of the polypeptide, or (e) hybridizes with a sequence from (b) under stringent conditions. This lncRNA, designated in the following as lncRNA 1, may serve for expression or translation regulation of an inositol polyphosphate-5-phosphatase such as an inositol-1,4,5-triphosphate-5-phosphatase. Furthermore, the lncRNA-encoding nucleic acid may preferably comprise a nucleotide sequence that (w) has a sequence of the SEQ ID No: 39, or a fragment thereof, or (x) is complementary to a sequence from (w), or (y) is at least 80%, 82%, 84%, 86%, or 88%--preferably, at least 90%, 91%, 92%, 93%, 94%, 95%, or 96%, or, especially preferably, at least 97%, 97.5%, 98%, 98.5%, 99%, or 99.5%--identical to a sequence from (w), or (z) hybridizes with a sequence from (x) under stringent conditions. This lncRNA, designated in the following as lncRNA 2, may serve for expression or translation regulation of a phospholipase--especially, of the phospholipase A2 or the patatin phospholipase.

TABLE-US-00001 TABLE 1 Sequence index and sequence association of the nucleotide and amino acid sequences. The names of gene families/protein families correspond to the public models. Discrepant functionalities may occur in the inductors due to structural changes. SEQ Gene family/protein ID family identifier// No: derived gene model Sequence type Source 1 GRMZM2G179789 genomic DNA B73 SNAREv 1 (Zea mays) 2 GRMZM2G179789 cDNA B73 SNAREv 1 (Zea mays) 3 GRMZM2G179789 cDNA (Variant 1) RWS SNAREv 1 (Zea mays) 4 GRMZM2G179789 cDNA (Variant 2) RWS SNAREv 1 (Zea mays) 5 GRMZM2G179789 cDNA (partial sequence of RWS SNAREv 1 (Zea mays) nucleotide 1240 to 2321 of SEQ ID No: 3) 6 GRMZM2G412426 genomic DNA B73 SNAREv 2 (Zea mays) 7 GRMZM2G412426 cDNA B73 SNAREv 2 (Zea mays) 8 GRMZM2G471240 genomic DNA, including B73 Patatin phospholipase edges (Zea mays) 9 GRMZM2G471240 genomic DNA, including RWS Patatin phospholipase edges (Zea mays) 10 GRMZM2G471240 cDNA B73 Patatin phospholipase (Zea mays) 11 GRMZM2G471240 cDNA RWS Patatin phospholipase (Zea mays) 12 GRMZM2G347808 genomic DNA, including B73 RNA methyl transferase edges (Zea mays) 13 GRMZM2G347808 genomic DNA, including RWS RNA methyl transferase edges (Zea mays) 14 GRMZM2G347808 cDNA B73 RNA methyl transferase (Zea mays) 15 GRMZM2G347808 cDNA RWS RNA methyl transferase (Zea mays) 16 GRMZM2G179789 Polypeptide (Variant 1) B73 SNAREv 1 (Zea mays) 17 GRMZM2G179789 Polypeptide (Variant 2) B73 SNAREv 1 (Zea mays) 18 GRMZM2G179789 Polypeptide (Variant 1) RWS SNAREv 1 (Zea mays) 19 GRMZM2G179789 Polypeptide (Variant 2) RWS SNAREv 1 (Zea mays) 20 GRMZM2G412426 Polypeptide B73 SNAREv 2 (Zea mays) 21 GRMZM2G471240 Polypeptide (Variant 1) B73 Patatin phospholipase (Zea mays) 22 GRMZM2G471240 Polypeptide (Variant 2) B73 Patatin phospholipase (Zea mays) 23 GRMZM2G471240 Polypeptide RWS Patatin phospholipase (Zea mays) 24 GRMZM2G347808 Polypeptide B73 RNA methyl transferase (Zea mays) 25 GRMZM2G347808 Polypeptide RWS RNA methyl transferase (Zea mays) 26 GRMZM2G106834 genomic DNA B73 Phosphoinositol phosphatase (Zea mays) 27 GRMZM2G106834 cDNA (Variant 1) B73 Phosphoinositol phosphatase (Zea mays) 28 GRMZM2G106834 cDNA (Variant 2) B73 Phosphoinositol phosphatase (Zea mays) 29 GRMZM2G106834 cDNA (Variant 3) B73 Phosphoinositol phosphatase (Zea mays) 30 GRMZM2G062320 genomic DNA B73 Phosphoglycerate mutase (Zea mays) 31 GRMZM2G062320 cDNA B73 Phosphoglycerate mutase (Zea mays) 32 GRMZM2G106834 Polypeptide (Variant 1) B73 Phosphoinositol phosphatase (Zea mays) 33 GRMZM2G106834 Polypeptide (Variant 2) B73 Phosphoinositol phosphatase (Zea mays) 34 GRMZM2G062320 Polypeptide B73 Phosphoglycerate mutase (Zea mays) 35 IncRNA for cDNA (Variant 1) B73 phosphoinositol phosphatase (Zea mays) 36 IncRNA for cDNA (Variant 2) B73 phosphoinositol phosphatase (Zea mays) 37 IncRNA for cDNA (Variant 3) B73 phosphoinositol phosphatase (Zea mays) 38 IncRNA for cDNA RWS phosphoinositol phosphatase (Zea mays) 39 IncRNA for Polypeptide (Variant 1) B73 phospholipase (Zea mays) 40 IncRNA for Polypeptide (Variant 2) B73 phosphoinositol phosphatase (Zea mays) 41 IncRNA for genomic DNA B73 phosphoinositol phosphatase (Zea mays) 42 Mitochondrial genomic DNA RWS import receptor (MITO1) (Zea mays) 43 Mitochondrial genomic DNA RWS import receptor (MITO2) (Zea mays) 44 Mitochondrial Polypeptide RWS import receptor (MITO1) (Zea mays) 45 Mitochondrial Polypeptide RWS import receptor (MITO2) (Zea mays) 46 Phospholipase genomic DNA (Helianthus annuus) 47 Phospholipase cDNA (Helianthus annuus) 48 Phospholipase Polypeptide (Helianthus annuus) 49 Patatin genomic DNA TILLING phospholipase D74N (Zea mays) 50 Patatin cDNA TILLING phospholipase D74N (Zea mays) 51 Patatin Polypeptide TILLING phospholipase D74N (Zea mays) 52 Patatin genomic DNA TILLING phospholipase G78R (Zea mays) 53 Patatin cDNA TILLING phospholipase G78R (Zea mays) 54 Patatin Polypeptide TILLING phospholipase G78R (Zea mays) 55 Snare T1 (Zea mays) genomic RWS 56 Snare T1.t1 (Zea mays) cDNA RWS 57 Snare T1.t2 (Zea mays) cDNA RWS 58 Snare T2 (Zea mays) genomic RWS 59 Snare T2 (Zea mays) cDNA RWS 60 Snare T3 (Zea mays) genomic RWS 61 Snare T3.t1 (Zea mays) cDNA RWS 62 Snare T3.t2 (Zea mays) cDNA RWS 63 Snare T1.t2 (Zea mays) Polypeptide RWS 64 Snare T2 (Zea mays) Polypeptide RWS 65 Snare T3.t1 (Zea mays) Polypeptide RWS

[0023] In a further aspect, the present invention relates to a vector which comprises the nucleic acid according to the invention. The vector may be a plasmid, a cosmid, a phage or an expression vector, a transformation vector, shuttle vector, or cloning vector; it may be double- or single-stranded, linear or circular; or it may transform a prokaryotic or eukaryotic host, either via integration into its genome or extrachromosomally. The nucleic acid according to the invention is preferably operatively linked in a vector with one or more regulatory sequences which allow the transcription, and, optionally, the expression, in a prokaryotic or eukaryotic host cell. A regulatory sequence--preferably, DNA--may be homologous or heterologous to the nucleic acid according to the invention. For example, the nucleic acid is under the control of a suitable promoter or terminator. Suitable promoters may be promoters which are constitutively induced (example: 35S promoter from the "Cauliflower mosaic virus" (Odell et al., 1985); those promoters which are tissue-specific are especially suitable (example: Pollen-specific promoters, Chen et al. (2010), Zhao et al. (2006), or Twell et al. (1991)), or are development-specific (example: blossom-specific promoters). Suitable promoters may also be synthetic or chimeric promoters which do not occur in nature, are composed of multiple elements, and contain a minimal promoter, as well as--upstream of the minimum promoter--at least one cis-regulatory element which serves as a binding location for special transcription factors. Chimeric promoters may be designed according to the desired specifics and are induced or repressed via different factors. Examples of such promoters are found in Gun & Rushton (2005) or Venter (2007). For example, a suitable terminator is the nos-terminator (Depicker et al., 1982).

[0024] In addition to the vectors described above, the present invention also provides a method that includes the insertion of a described vector into a host cell. For example, the vector may be introduced via conjugation, mobilization, biolistic transformation, agrobacteria-mediated transformation, transfection, transduction, vacuum infiltration, or electroporation. Such methods, like the methods for preparation of described vectors, are commonplace to the person skilled in the art (Sambrook et al., 2001).

[0025] In a further aspect, the present invention relates to a host cell which comprises the nucleic acid according to the invention or the vector of the present invention. A host cell in the sense of the invention may be a prokaryotic (for example, bacterial) or eukaryotic cell (for example, a plant cell or a yeast cell). The host cell is preferably an Agrobacterium, such as Agrobacterium tumefaciens or Agrobacterium rhizogenes, or a plant cell which comprises the nucleic acid according to the invention or the vector of the present invention. To the person skilled in the art, both numerous methods (such as conjugation or electroporation) with which he may introduce the nucleic acid according to the invention or the vector of the present invention into an Agrobacterium, and methods such as diverse transformation methods (biolistic transformation, Agrobacterium-mediated transformation) with which he may introduce the nucleic acid according to the invention or the vector of the present invention into a plant cell (Sambrook et al., 2001), are known.

[0026] In a further aspect, the present invention relates to a transgenic plant cell which comprises the nucleic acid according to the invention as a transgene or the vector of the present invention, and relates to a transgenic plant or a part thereof which comprises the transgenic plant cell. For example, such a plant cell or plant is a plant cell or plant which is (preferably, stably) transformed with the nucleic acid according to the invention or with the vector of the present invention. A transgenic plant of the present invention is preferably suitable for use as a haploid inductor. In a preferred embodiment of the transgenic plant, the nucleic acid is operatively linked with one or more regulatory sequences which allow the transcription and, optionally, the expression in the plant cell. A regulatory sequence, preferably DNA, may be homologous or heterologous to the nucleic acid according to the invention. The total structure made up of the nucleic acid according to the invention and the regulatory sequence(s) may then represent the transgene. A part of a plant may be a fertilized or unfertilized seed, an embryo, a pollen, a tissue, an organ, or a plant cell, wherein the fertilized or unfertilized seed, the embryo, or the pollen are generated in the transgenic plant, and the nucleic acid according to the invention is integrated into its genome as a transgene or the vector. The present invention likewise also includes a descendant of the transgenic plant in whose genome the nucleic acid according to the invention is integrated as a transgene or vector, and which is suitable for use as a haploid inductor.

[0027] In another aspect, the present invention relates to a protein or a polypeptide which is encoded by a nucleic acid according to the invention. The protein or polypeptide is preferably suitable for mediating the property of a haploid inductor in a plant, or for increasing the induction capability of a haploid inductor. The protein or polypeptide encoded by the induction-inducing nucleic acid is especially preferred. A protein or a polypeptide of the present invention preferably includes an amino acid sequence selected from SEQ ID Nos: 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 44, 45, 48, 51, 54, 63, 64, and/or 65, or from SEQ ID Nos: 32, 33, and/or 34, or from SEQ ID Nos: 40 and/or 41.

[0028] In a further aspect, the present invention describes a method for the production of a plant which is suitable for use as a haploid inductor. The method may include the following steps:

A) mutagenization of plant cells and subsequent regeneration of plants from the mutagenized plant cells or mutagenization of plants, and B) identification of a plant A) which has at least one mutation in an endogenous DNA sequence which is identical to the nucleic acid according to the invention, or in a regulatory sequence of the endogenous DNA sequence (for example, a promoter, enhancer, terminator, or intron), which mutation produces a change in the transcription or expression rate of the endogenous DNA sequence in the identified plant, in comparison to a non-mutagenized wild-type plant, or a change in the activity or stability of a protein or polypeptide encoded by the endogenous DNA sequence in the identified plant, in comparison to a non-mutagenized wild-type plant, wherein the at least one mutation causes the property of a haploid inductor to be mediated or the induction capability of a haploid inductor to be increased in the identified plant. The change in the transcription rate or expression rate, or the change in the activity or stability, preferably appears at least in a pollen of the identified plant or in a tissue of a pollen of the identified plant.

[0029] The endogenous DNA sequence from step B), or an RNA encoded from the endogenous DNA sequence, or a protein or polypeptide encoded from the DNA sequence, preferably has an influence on the pollen tube growth in a plant, on the energy metabolism of a pollen of a plant, and/or on the activity of the centromere--preferably, in a generative cell which develops into a pollen, for example.

[0030] The endogenous DNA sequence from step B) of the method for the production of a plant which is suitable for use as a haploid inductor especially preferably encodes for a SNAREv protein; an enzyme of the phospholipase class--in particular, phospholipase A2 or the patatin phospholipase; an enzyme of the inositol polyphosphate-5-phosphatase class, such as inositol-1,4,5-triphosphate-5-phosphatase; a phosphoglycerate mutase or methyl transferase--in particular, an RNA methyl transferase, wherein, in the case of the SNARE protein, the phospholipase and the methyl transferase, the transcription rate or expression rate, or the activity or stability, is preferably altered to the effect that it is increased, and wherein, in the case of the inositol polyphosphate-5-phosphatase and the phosphoglycerate mutase, the transcription rate or expression rate, or the activity or stability, is preferably altered to the effect that it is reduced.

[0031] Step B) of the method for the production of a plant is, with very particular preference, the identification of a plant from A) which a) has at least one mutation in an endogenous DNA sequence having a nucleotide sequence that is identical to the induction-inducing nucleic acid or the nucleic acid which encodes the lncRNA 1, or in a regulatory sequence of the endogenous DNA sequence (for example, a promoter, enhancer, terminator, or intron), which at least one mutation effects an increase in the transcription or expression rate of the endogenous DNA sequence or an increase in the activity or stability of a protein or polypeptide encoded by the endogenous DNA sequence; and/or b) has at least one mutation in an endogenous DNA sequence having a nucleotide sequence that is identical to the induction-inhibiting nucleic acid or the nucleic acid which encodes the lncRNA 2, or in a regulatory sequence of the endogenous DNA sequence (for example, a promoter, enhancer, terminator, or intron), which at least one mutation effects a reduction in the transcription or expression rate of the endogenous DNA sequence or a reduction in the activity or stability of a protein or polypeptide encoded by the endogenous DNA sequence, wherein the at least one mutation from a) and/or b) causes the property of a haploid inductor to be mediated or the induction capability of a haploid inductor to be increased in the identified plant. The change in the transcription rate or expression rate, or the change in the activity or stability, preferably appears at least in a pollen of the identified plant or in a tissue of a pollen of the identified plant.

[0032] A mutation means a modification at the DNA level, and thus a change in the genetics and/or epigenetics. For example, an alteration in the genetics may be the exchange of at least one nucleobase in the endogenous DNA sequence or in a regulatory sequence of the endogenous DNA sequence. If such a nucleobase exchange takes place in a promoter, for example, this may lead to an altered activity of the promoter, since, for example, cis-regulator elements are modified such that the affinity of a transcription factor to the mutated cis-regulatory elements is altered in comparison to the wild-type promoter, so that the activity of the promoter with the mutated cis-regulatory elements is increased or reduced, depending upon whether the transcription factor is a repressor or inductor, or whether the affinity of the transcription factor to the mutated cis-regulatory elements is intensified or weakened. If such a nucleobase exchange occurs, e.g., in an encoding region of the endogenous DNA sequence, this may lead to an amino acid exchange in the encoded protein, which may produce an alteration in the activity or stability of the protein, in comparison to the wild-type protein. An additional example of an alteration in the genetics is the deletion of nucleotides in the regulatory sequence and/or of the endogenous DNA sequence, as well as the addition of nucleotides in the regulatory sequence and/or the endogenous DNA sequence. Das & Martienssen (1995) shows an example of the regulation of genes via insertion of nucleotides by transposon mutagenesis in maize. An alteration in the epigenetics may take place via an altered methylation pattern of the DNA.

[0033] It is known to the person skilled in the art how a mutation in the sense of the invention may be achieved via the process of a mutagenization in step A) of the method for production of a plant which is suitable for use as a haploid inductor. The mutagenization in this connection includes both conventional mutagenesis and location-specific mutagenesis or "genome editing." In conventional mutagenesis, modification at the DNA level is not produced in a targeted manner. The plant cell or the plant is exposed to mutagenic conditions, such as TILLING, via UV light exposure or the use of chemical substances (Till et al., 2004). An additional method of random mutagenesis is mutagenesis with the aid of a transposon. The UniformMU project makes a comprehensive collection of mutants freely available. The collection and the method are described in McCarty et al. (2005). Location-specific mutagenesis enables the introduction of modification at the DNA level in a target-oriented manner at predefined locations in the DNA. For example, TALENS (WO 2010/079430, WO 2011/072246), meganucleases (Silva et al., 2011), homing endonucleases (Chevalier 2002), zinc finger nucleases (Lloyd et al., 2005), or a CRISPR/Cas System (Gaj et al., 2013) may be used for this.

[0034] The identification of a plant in step B) may take place with the aid of molecular markers or probes, for example. For example, DNA probes are primers or primer pairs which may be used in a PCR reaction. For example, Tilling mutants may be verified or identified by sequencing the target gene in a Tilling population, or via additional methods that verify the mispairings in the DNA, e.g., melting point analyses or use of mispairing-specific nucleases. For this, the present invention likewise incorporates primers/primer pairs that are usable for this, e.g., primers for phospholipase, phosphoglycerate mutase, methyl transferase and lncRNA for phospholipase. Mutants generated by means of transposons may also be verified by use of transposon-specific primers and target gene-specific primers in PCR, across the entire population and subsequent sequencing of PCR products. Such primers are also encompassed by the present invention. For example, a change in the expression rate in pollen may be determined with RT-PCR; the change in the stability may be determined by examining ubiquitin binding locations and prediction of changes to the tertiary structure, for example. Furthermore, recombinant expression of the wild-type proteins, and the corresponding mutant proteins and subsequent biochemical activity tests, are also suitable. Additional means and methods which may be used to identify a plant in step B) are known to the person skilled in the art from the prior art.

[0035] The present invention also relates to molecular markers, which demonstrate the presence or absence of a mutation in the endogenous DNA sequence, or in a regulatory sequence of the endogenous DNA sequence. For example, such markers are based upon an SNP and are specific to the mutation (examples: KASPar or TaqMan markers).

[0036] The present invention further also relates to a plant which can be or is produced with the preceding method, or a part of this plant, wherein a part of the plant may be a fertilized or unfertilized seed, an embryo, a pollen, a tissue, an organ, or a plant cell, wherein the fertilized or unfertilized seed, the embryo, or the pollen are generated at the transgenic plant, and the at least one mutation is present in its genome. The present invention likewise also includes a descendant of the plant which has the at least one mutation and is suitable for use as a haploid inductor. Two examples of plants which have been produced with the preceding method are plants--preferably, Zea mays or Helianthus annuus--that, in an endogenous DNA sequence, with the nucleic acid, comprehensively (i) has a sequence selected from SEQ ID Nos: 8, 9, and/or 46 or a functional fragment thereof; or (ii) is complementary to a sequence from (i); or (iii) is at least 80% identical to a sequence from (i); or (iv) encodes for a protein having the amino acid sequence selected from SEQ ID Nos: 21, 22, 23, and/or 48, or a functional part of the protein; or (v) encodes for a homolog, analog, or ortholog of the protein according to (iv), or a functional part thereof; or (vi) is identical to a sequence hybridized under stringent conditions from (ii), or has at least one mutation in a regulatory sequence of the endogenous DNA sequence, which produces a change in the transcription or expression rate of the endogenous DNA sequence in the identified plant, in comparison to a non-mutagenized wild-type plant, or a change in the activity or stability of a protein or polypeptide encoded by the endogenous DNA sequence in the identified plant, in comparison to a non-mutagenized wild-type plant, wherein the at least one mutation causes the property of a haploid inductor to be mediated or the induction capability of a haploid inductor to be increased in the identified plant. The mutation is preferably an alteration in the encoding sequence of SEQ ID No: 8 or 9 (for example, a point mutation) which causes an amino acid exchange between the amino acid positions 74 and 78 of SEQ ID No: 21, 22, or 23, or the mutation causes modifications in the encoding sequence of SEQ ID No. 46 which causes an amino acid exchange in [the] corresponding encoding sequence of SEQ ID No: 48. This may here involve mutations according to SEQ ID Nos: 49 through 54. The mutation caused by TILLING in SEQ ID No: 49 causes an amino acid exchange in the encoded amino acid at position 74, wherein the aspartate is replaced by asparagine (D74N); the mutation in SEQ ID No: 52 causes an amino acid exchange in the encoded amino acid at position 78, wherein the glycine is replaced by arginine (G78R).

[0037] Furthermore, the present invention also concerns a method for isolation of a nucleic acid that mediates the property of a haploid inductor or increases the induction capability of a haploid inductor in a plant, including the following steps:

A) production of a plant according to the method described in the preceding, or provision of a plant which can be or is produced with the method described in the preceding; and B) isolation, from the genome of the plant from A), of a nucleic acid which comprises the endogenous DNA sequence having the at least one mutation. The isolation of the nucleic acid in step B) may take place via CTAB extraction or via DNA-binding columns; the verification of the mutation may take place via sequencing or molecular markers such as SNP-based KASPar or TaqMan markers, or, for insertion or deletion mutants, via length polymorphism-based markers.

[0038] The present invention also includes a nucleic acid which was obtained or can be obtained via the method for isolation as described in the preceding, as well as a vector which comprises the isolated nucleic acid.

[0039] In another aspect, the present invention also relates to a method for the production of a transgenic plant which is suitable for use as a haploid inductor. The method may include the following steps:

A) provision of the nucleic acid described above which, after transcription or expression in a plant, is suitable for mediating the property of a haploid inductor or for increasing the induction capability of a haploid inductor; or provision of the isolated nucleic acid described above, which nucleic acid comprises the endogenous DNA sequence having the at least one mutation; or provision of one of the vectors described above, B) transformation--preferably, stable transformation--of plant cells via introduction of the nucleic acid or of the vector from A), C) regeneration of transgenic plants from the transformed plant cells from B), and D) identification of a transgenic plant from C) in which, via a modified expression pattern--preferably, in pollen of the identified plant or in a tissue of a pollen of the identified plant--the property of a haploid inductor is mediated, or the induction capability of a haploid inductor is increased. The method for production of a transgenic plant which is suitable for use as a haploid inductor also includes the provision of two or more of the nucleic acids described above--alternatively, also different embodiments of the nucleic acid according to the invention and, optionally, in one or more vectors--and the transformation of plant cells via the introduction of two or more nucleic acids. Alternatively or additionally, one or more additional nucleic acids which are known to be usable for the generation of a haploid inductor (for example, manipulated cenh3 gene (Ravi & Chan, 2010)) may also be provided and transformed, in addition to the nucleic acid according to the invention.

[0040] The expression pattern is preferably altered to the effect that

(I) the transcription or expression rate of the introduced induction-promoting nucleic acid or introduced nucleic acid which encodes the lncRNA 1 is increased in the identified plant in comparison to a wild-type plant which, for example, was regenerated from an isogenic, untransformed plant cell, and/or (II) the transcription or expression rate of the introduced induction-inhibiting nucleic acid or introduced nucleic acid which encodes the lncRNA 2 is reduced in the identified plant in comparison to a wild-type plant which, for example, was regenerated from an isogenic, untransformed plant cell, and/or (III) due to post-transcriptional gene silencing, the expression rate of an endogenous DNA sequence having a nucleotide sequence that is identical to the induction-inhibiting nucleic acid is reduced--via a double-stranded RNA which is encoded by the introduced nucleic acid which is described above in connection with post-transcriptional gene silencing--in the identified plant in comparison to a wild-type plant which, for example, was regenerated from an isogenic, untransformed plant cell, and/or (IV) due to transcriptional gene silencing, the transcription or expression rate of an endogenous DNA sequence having a nucleotide sequence that is identical to the induction-inducing nucleic acid or introduced nucleic acid which encodes the lncRNA 1 is increased, by a double-stranded RNA which is encoded by the introduced nucleic acid which is described above in detail in connection with transcriptional gene silencing, in comparison to a wild-type plant which, for example, was regenerated from an isogenic, untransformed plant cell; and/or the transcription or expression rate of an endogenous DNA sequence having a nucleotide sequence that is identical to the induction-inhibiting nucleic acid or introduced nucleic acid which encodes for lncRNA 2 is reduced, by a double-stranded RNA which is encoded by the introduced nucleic acid which is described above in detail in connection with transcriptional gene silencing, in comparison to a wild-type plant which, for example, was regenerated from an isogenic, untransformed plant cell. A verification of the transcription rate may take place via qRT-PCR, for example. An altered protein stability may be determined via Western blot, for example.

[0041] The present invention further also relates to a transgenic plant which can be or is produced with this method, or a part of this plant, wherein a part of the plant may be a fertilized or unfertilized seed, an embryo, a pollen, a tissue, an organ, or a plant cell, wherein the fertilized or unfertilized seed, the embryo, or the pollen are generated at the transgenic plant, and the nucleic acid according to the invention is integrated into its genome as a transgene or the vector. The present invention likewise also includes a descendant of the transgenic plant which has the introduced nucleic acid as a transgene and is suitable for use as a haploid inductor.

[0042] In another aspect, the present invention relates to a method for the production of a haploid plant, which method includes the following steps:

A) crossing a non-transgenic or transgenic plant of the present invention which is suitable for use as a haploid inductor with a plant of the same genus--preferably, of the same species, B) selecting a fertilized haploid seed or embryo, and C) generating a haploid plant from the seed or embryo from B).

[0043] The plant which is suitable for use as a haploid inductor is preferably used as a pollen parent and is crossed with a seed elder of the same genus--preferably, of the same species. The plant which is suitable for use as a haploid inductor may also be used as a seed parent and be crossed with a pollen elder of the same genus--preferably, of the same species. Both cross partners in step A)--thus, seed parent and pollen parent--may also be the same individual. The crossing step then represents a selfing.

[0044] The selection of the haploid fertilized seed or embryo may include a step of the verification of the haploidy, and the separation of the haploid fertilized seed or embryo of polyploid fertilized seed or embryo. The verification of the haploidy of a fertilized seed or embryo may take place phenotypically or genotypically, in that, for example, the inductor is provided with an embryo-specific dominant marker that is visible in all diploid descendants, but not in the induced haploid descendants. Furthermore, the ploidy status may be determined via flow cytometry. Moreover, a complete, homozygotic pattern of molecular markers provides an indication of haploid plants. For example, the separation may take place automatically on the basis of data of the verification of the haploidy.

[0045] The present invention further also relates to a haploid, fertilized seed, or embryo which is created upon crossing in step A) of the method for production of a haploid plant, as well as a haploid plant which can be or is produced with this method, or a part of this plant, wherein a part of a plant may be a seed, an embryo, a tissue, an organ, or a plant cell. The present invention likewise also includes a descendant of the plant. Furthermore, the present invention also includes a double-haploid (diploid) plant or a part thereof, wherein the double-haploid (diploid) plant or a part thereof was generated by chromosome duplication of the haploid plant or of the part thereof.

[0046] In a further aspect, the present invention relates to the use of the nucleic acid according to the invention, or of the vector according to the invention, in a plant to mediate the property of a haploid inductor or to increase the induction capability of a haploid inductor, or the use of the nucleic acid according to the invention or of the vector according to the invention to produce a plant or a transgenic plant which is suitable for use as a haploid inductor. Furthermore, the present invention also includes the use of a plant according to the invention as described above, which is suitable for use as a haploid inductor, to produce a haploid, fertilized seed or embryo, or a haploid plant. Preceding explanations regarding subject matters and methods of the present invention are also applicable to the cited uses.

[0047] In another aspect, the present invention also relates to a means for external application to plants. This means is provided for external application to plants and is suitable for mediating the property of a haploid inductor in the plant or for increasing the induction capability of a haploid inductor plant. The application preferably occurs at the point in time of the anther formation, pollen formation, or fertilization. The means comprises RNA that has a double-stranded portion, wherein at least one strand of the double-stranded portion has a nucleotide sequence which is homologous or identical to at least 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25--preferably, to at least 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, or 140, and, especially preferably, to at least 160, 180, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900, or 1000--successive nucleotides of a coding sequence of a nucleic acid that

(i) has a sequence selected from SEQ ID Nos: 26, 27, 28, 29, 30, and/or 31, or a fragment thereof in a sense or anti-sense orientation, or (ii) is complementary to a sequence from (i), or (iii) is at least 80%, 82%, 84%, 86%, or 88%--preferably, at least 90%, 91%, 92%, 93%, 94%, 95%, or 96%, or, especially preferably, at least 97%, 97.5%, 98%, 98.5%, 99%, or 99.5%--identical to a sequence from (i), or (iv) encodes for a protein with the amino acid sequence selected from SEQ ID Nos: 32, 33, and/or 34, or a part of the protein, or (v) encodes for a homolog, analog, or ortholog of the protein according to (iv), or a part thereof, or (vi) hybridizes with a sequence from (ii) under stringent conditions.

[0048] Double-stranded RNA for the production of the means according to the invention may be produced in vitro by means of the methods known to the person skilled in the art. For example, the synthesis of the double-stranded RNA may take place synthetically, wherein the RNA is formed directly in vitro. Starting from a double-stranded DNA, the double-stranded RNA may also be synthesized via the formation of an mRNA transcript, which then forms a hairpin structure, for example. The means may be used as a trigger for a haploid induction in a plant. For example, the means may be used by being sprayed in the form of a spray, or via additional ways of external application that are commonplace to the person skilled in the art, onto the plant tissue, or by spraying or mixing with additional additives before or after the flowering of the plant. For example, additives may be wetting agents, carrier substances, or RNA stabilizers, e.g., liposomes.

[0049] Surprisingly, the inventors have established that it is precisely genes or gene products which have an influence on the pollen tube growth, on the energy metabolism of a pollen and/or on the activity of the centromere--preferably, in a generative cell which develops into a pollen, for example--are especially suited for converting a non-haploid inductor into a haploid inductor. For this, multiple gene families/protein families which are of significant importance could be identified. Their use for generation of haploid inductors has neither been described nor suggested before in the prior art. Because the creation of pollen, and also the fertilization process (including the growth of the pollen tube), follow generally valid principles in mono- and dicotyledon plants, with the technical teaching of the present invention, the person skilled in the art receives the possibility of developing haploid inductors even for cultivated plants for which neither an efficient system of in vivo haploid induction nor other cell culture-based methods for the creation of double-haploid plants had previously existed. For this, using the genetic information which he obtains via the present invention, he may discover homologs, orthologs, or analogs of the described gene products via routine activity, and manipulate them as described here. The technical teaching of the present invention is, however, also suitable for further improving the already existing inductors with regard to their efficiency (i.e., haploid induction rate), and thus for making them economically applicable for the first time. Furthermore, a person skilled in the art may also combine this technical teaching with additional known mechanisms of haploid induction, such as a manipulation of the CENH3 protein (Ravi & Chan, 2010), and thus further increase the efficiency.

[0050] Some of the terms used in this application are explained in detail in the following:

[0051] "B73" is a maize breeding line that is used as a model genotype in maize genetics and was used to create the first maize reference sequence.

[0052] "Mediate the property of a haploid inductor" or the "mediation of the property of a haploid inductor" or a comparable phrase means that, via the use of a nucleic acid according to the invention, a plant is placed in the position for producing fertilized seeds or embryos which have a single (haploid) chromosome set from a crossing with a plant of the same genus--preferably, of the same species--which does not have the property of a haploid inductor. The property of a haploid inductor, specified as an absolute haploid induction rate, means that at least 0.1%, 0.2%, 0.3%, 0.4%, 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, or 1%--preferably, at least 1.5%, 2%, 2.5%, 3%, 3.5%, 4%, 4.5%, or 5%, or, especially preferably, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, or 15%, or, with very particular preference, at least 20%, 25%, 30%, 35%, 40%, 45%, or 50%--of the fertilized seeds or embryos have a haploid chromosome set.

[0053] "Increase in the expression rate" or "increased expression rate" or "activation of the expression" or a comparable expression means an increase in the expression rate of a nucleotide sequence by more than 10%, 15%, 20%, 25%, or 30%--preferably, by more than 40%, 50%, 60%, 70%, 80%, 90%, or 100%, or, especially preferably, by more than 150%, 200%, 250%, 300%, 500%, or 1000%--in comparison to the specified reference. The increase in the expression rate preferably leads to a change of the phenotype of a plant in which the expression rate is increased. An altered phenotype may be the mediation of the property of a haploid inductor, or the increase in the induction capability of a haploid inductor.

[0054] "Increase in the transcription rate" or "increased transcription rate" or a comparable expression means an increase in the transcription rate of a nucleotide sequence by more than 10%, 15%, 20%, 25%, or 30%--preferably, by more than 40%, 50%, 60%, 70%, 80%, 90%, or 100%, or, especially preferably, by more than 150%, 200%, 250%, 300%, 500%, or 1000%--in comparison to the specified reference. The increase in the transcription rate preferably leads to a change of the phenotype of a plant in which the transcription rate is increased. An altered phenotype may be the mediation of the property of a haploid inductor, or the increase in the induction capability of a haploid inductor.

[0055] A "functional fragment" of a nucleotide sequence means a segment of a nucleotide sequence which has the functionality identical or comparable to the complete nucleotide sequence from which the functional fragment originates. As such, the functional fragment may possess a nucleotide sequence which is identical or homologous to the complete nucleotide sequence over a length of at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 92%, 94% 96%, 97%, 98%, or 99%. Furthermore, a "functional fragment" of a nucleotide sequence may also mean a segment of a nucleotide sequence which alters the functionality of the total nucleotide sequence, e.g., in the course of post-transcriptional or transcriptional gene silencing. As such, the functional fragment of a nucleotide sequence may include at least 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25--preferably, at least 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, or 140, or, especially preferably, at least 160, 180, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900, or 1000--successive nucleotides of the complete nucleotide sequence.

[0056] A "functional part" of a protein means a segment of a protein, or a section of the amino acid sequence, that encodes for the protein, wherein the segment may exert functionality identical or comparable to the entire protein in a plant cell. A functional part of a protein has, over a length of at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 92%, 94%, 96%, 97%, 98%, or 99%, an identical or--under conservative and semi-conservative amino acid exchanges--similar amino acid sequence to that of the protein from which the functional part originates.

[0057] "Haploid inductor" also means an in vivo haploid inductor.

[0058] The term "heterolog" means that the introduced polynucleotide originates from, for example, a cell or an organism having a different genetic background of the same species or another species, or is homologous to the prokaryotic or eukaryotic host cell, but then is localized in a different genetic environment and thus differs from a possible, naturally present, corresponding polynucleotide. A heterologous polynucleotide may be present in addition to a corresponding endogenous gene.

[0059] In the sense of the invention, what is understood by a "homolog" is a protein of the same phylogenetic origin, what is understood by an "analog" is a protein which exerts the same function, but has a different phylogenetic origin, and what is understood by an "ortholog" is a protein from a different species that exerts the same function.

[0060] What is understood by "hybridizing" or "hybridization" is a process in which a single-stranded nucleic acid molecule is added to a nucleic acid strand that is complementary to the greatest possible extent, i.e., enters into base pairing. Standard methods for hybridization are described in Sambrook et al. 2001, for example. What is preferably understood by this is that at least 60%--more preferably, at least 65%, 70%, 75%, 80%, or 85%, or, especially preferably, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%--of the bases of the nucleic acid molecule enter into a base pairing with the nucleic acid strand that is complementary to the greatest possible extent. The possibility of such an addition depends upon the stringency of the hybridization conditions. The term "stringency" relates to the hybridization conditions. High stringency is present when a base pairing is made more difficult; low stringency is present if a base pairing is made easier. For example, the stringency of the hybridization conditions depends upon the salt concentration, or ion strength, and the temperature. In general, the stringency may be increased by increasing the temperature and/or decreasing the salt content. What are to be understood by "stringent hybridization conditions" are those conditions given which a hybridization predominantly occurs only between homologous nucleic acid molecules. The term "hybridization conditions" thereby relates not only to the conditions prevailing in the actual addition of the nucleic acids, but also to the conditions prevailing in the following washing steps. Stringent hybridization conditions are, for example, conditions under which, predominantly, only those nucleic acid molecules are hybridized that have at least 70%--preferably, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%--sequence identity. Stringent hybridization conditions are, for example, hybridization in 4.times.SSC at 65.degree. C., and subsequent repeated washing in 0.1.times.SSC at 65.degree. C. for approximately 1 hour in total. The term "stringent hybridization conditions" that is used here may also mean hybridization at 68.degree. C. in 0.25 M sodium phosphate, pH 7.2, 7% SDS, 1 mM EDTA and 1% BSA for 16 hours, and subsequent washing twice with 2.times.SSC and 0.1% SDS at 68.degree. C. A hybridization preferably occurs under stringent conditions.

[0061] "Increase the induction capability of a haploid inductor" or "the increase in the induction capability of a haploid inductor" means that the haploid induction rate of a plant which has the property of a haploid inductor is increased. The number of fertilized seeds which have a haploid chromosome set and have arisen from a crossing of the haploid inductor with a plant of the same genus (preferably, of the same species) which does not have the property of a haploid inductor may thus be higher by at least 0.1%, 0.2%, 0.3%, 0.4%, 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, or 1%--preferably, at least 1.5%, 2%, 2.5%, 3%, 3.5%, 4%, 4.5%, or 5%, and, especially preferably, at least 6%, 7%, 8%, 9%, 10%, 15%, 20%, 30%, or 50%--than the number of haploid fertilized seeds which is achieved without the use of the nucleic acid in the sense of the present invention, i.e., the haploid induction rate may be increased by at least 0.1%, 0.2%, 0.3%, 0.4%, 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, or 1%--preferably, at least 1.5%, 2%, 2.5%, 3%, 3.5%, 4%, 4.5%, or 5%, and, especially preferably, at least 6%, 7%, 8%, 9%, 10%, 15%, 20%, 30%, or 50%--relative to the previously achieved haploid induction rate.

[0062] "Operatively linked" means connected in a common nucleic acid molecule in such a manner that the connected elements are positioned and oriented relative to one another such that a transcription of the nucleic acid molecule may occur. A DNA which is operatively linked with a promoter is under the transcriptional control of this promoter.

[0063] Plant "organs" mean, for example, leaves, shoots, stem, roots, vegetative buds, meristems, embryos, anthers, ovules, or fruits. Plant "parts" means a combination of multiple organs, e.g., a bloom or a seed, or a part of an organ, e.g., a cross-section from the shoot. Plant "tissues" are, for example, callus tissue, storage tissue, meristematic tissue, leaf tissue, stem tissue, root tissue, plant tumor tissue, or reproductive tissue. For example, what are to be understood by plant "cells" are, for example, isolated cells having a cell wall or aggregates thereof, or protoplasts.

[0064] In the sense of the invention, insofar as not otherwise indicated, a "plant" may be of any species from the dicotyledon, monocotyledon, and gymnosperm plants. Numbering among these are, for example, Hordeum vulgare, Sorghum bicolor, Secale cereale, Triticale, Saccharum officinarium, Zea mays, Setaria italic, Oryza sativa, Oryza minuta, Oryza australiensis, Oryza alta, Triticum aestivum, Triticum durum, Hordeum bulbosum, Brachypodium distachyon, Hordeum marinum, Aegilops tauschii, Beta vulgaris, Helianthus annuus, Daucus glochidiatus, Daucus pusillus, Daucus muricatus, Daucus carota, Eucalyptus grandis, Erythranthe guttata, Genlisea aurea, Gossypium sp., Musa sp., Avena sp., Nicotiana sylvestris, Nicotiana tabacum, Nicotiana tomentosiformis, Solanum lycopersicum, Solanum tuberosum, Coffea canephora, Vitis vinifera, Cucumis sativus, Morus notabilis, Arabidopsis thaliana, Arabidopsis lyrata, Arabidopsis arenosa, Crucihimalaya himalaica, Crucihimalaya wallichii, Cardamine flexuosa, Lepidium virginicum, Capsella bursa-pastoris, Olmarabidopsis pumila, Arabis hirsuta, Brassica napus, Brassica oleracea, Brassica rapa, Brassica juncacea, Brassica nigra, Raphanus sativus, Eruca vesicaria sativa, Citrus sinensis, Jatropha curcas, Glycine max, and Populus trichocarpa. A plant according to the invention is preferably a plant of the genus Zea--especially of the species Zea mays--or sorghum.

[0065] "Reducing the expression rate" or "reduction in the expression rate" or "suppression of the expression," "reduced expression rate," or a comparable phrase means a reduction in the expression rate of a nucleotide sequence by more than 10%, 15%, 20%, 25%, or 30%--preferably, by more than 40%, 45%, 50%, 55%, 60%, or 65%, and, especially preferably, by more than 70%, 75%, 80%, 85%, 90%, 92%, 94%, 96%, or 98%--in comparison to the specified reference. However, it may also mean that the expression rate of a nucleotide sequence is reduced by 100%. The reduction in the expression rate preferably leads to a change of the phenotype of a plant in which the expression rate is reduced. An altered phenotype may be the mediation of the property of a haploid inductor, or the increase in the induction capability of a haploid inductor.

[0066] "Reduction in the transcription rate" or "reduced transcription rate" or a comparable expression means a reduction in the transcription rate of a nucleotide sequence by more than 10%, 15%, 20%, 25%, or 30%--preferably by more than 40%, 45%, 50%, 55%, 60%, or 65%, and, especially preferably, by more than 70%, 75%, 80%, 85%, 90%, 92%, 94%, 96%, or 98%--in comparison to the specified reference. However, it may also mean that the expression rate of a nucleotide sequence is reduced by 100%. The reduction in the transcription rate preferably leads to a change of the phenotype of a plant in which the transcription rate is reduced. An altered phenotype may be the mediation of the property of a haploid inductor, or the increase in the induction capability of a haploid inductor.

[0067] In connection with the present invention, the term "regulatory sequence" relates to a nucleotide sequence which affects the specificity and/or the expression strength, e.g., in that the regulatory sequence mediates a defined tissue specificity. Such a regulatory sequence may be located upstream of the transcription initiation point of a minimal promoter, but also downstream of it, e.g., as in a transcribed, but untranslated, leader sequence or within an intron.

[0068] "Suitable for use as a haploid inductor" means that a plant is in the position to produce fertilized seeds which have a single (haploid) chromosome set from a cross with a plant of the same genus--preferably, of the same species--which does not have the property of a haploid inductor. The use a haploid inductor, specified as an absolute haploid induction rate, means that at least 0.1%, 0.2%, 0.3%, 0.4%, 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, or 1%--preferably, at least 1.5%, 2%, 2.5%, 3%, 3.5%, 4%, 4.5%, or 5%, or, especially preferably, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, or 15%, or, with very particular preference, at least 20%, 25%, 30%, 35%, 40%, 45%, or 50% --of the fertilized seeds have a haploid chromosome set.

[0069] Designs and embodiments of the present invention are described, by way of example, with regard to the attached figures and sequences.

BRIEF DESCRIPTION OF THE DRAWINGS

[0070] FIG. 1: Genomic arrangement of the identified genes in comparison to B73 (AGPv02):

SNAREv 1 (GRMZM2G179789): increased expression in RWS pollen; SNAREv 2 (GRMZM2G412426): increased expression in RWS pollen; ITP (Inositol-1,4,5-triphosphate-5-phosphatase) (GRMZM2G106834): reduced expression in RWS pollen; PL (Patatin phospholipase) (GRMZM2G471240): polymorphisms in encoding sequence; MITO1 (Mitochondrial import receptor): present only in RWS; MITO2: Homolog to MITO1, but shortened. Present only in RWS; PGM (Phosphoglycerate mutase) (GRMZM2G062320): deleted in RWS; lncRNA: Homolog of PL: deleted in RWS; AC213048: anchor gene for comparison of the sequences; MT (RNA methyl transferase) (GRMZM2G347808): polymorphisms in the regulatory region. The GRMZM names relate to the annotation in AGPv02.

[0071] FIG. 2: RT-PCR of ripe pollen in inductor RWS and three non-inductor controls (NI1, NI2, NI3) across the genes SNAREv 1, RNA methyl transferase, and patatin phospholipase.

[0072] FIG. 3: RNASeq data of RWS pollen, projected onto artificial reference from AGPv02 with regions for SNARE and phospholipase loci substituted by RWS BAC's. (T1: Transcript 1. Homolog to SNARE2, but with altered exon-intron structure; T2: Homolog to SNARE1. Encoding for a protein of 131AA; T3: Homolog to SNARE1/2. RT-PCR fragment from FIG. 2).

QTL ANALYSIS AND IDENTIFICATION OF CANDIDATE GENES

[0073] In the maize haploid inductor RWS, which is to be ascribed to the inductor Stock6 (Coe, 1959), a main-QTL on chromosome 1 (bin 1.04) was identified and finely mapped. Based upon these works, the QTL in RWS should be verified and molecularly analyzed, in order to identify and functionally validate the underlying genes. A QTL mapping population from RWS.times.Control1 (maternal inductor.times.non-inductor) was tested for induction capability. It could thereby be shown that the known QTL is probably also present in the inductor RWS. However, it was further also achieved that a strong allele shift to the benefit of the non-RWS (Control 1) allele was discovered.

[0074] In order to molecularly describe the locus, various sequencing approaches to DNA and at the RNA level were selected. Due to structural differences between inductors and reference genome B73, only a small proportion of classical, reference-based sequencing approaches lead to success. Expanded and complicated bioinformatic analyses had the result that structural differences would then need to be reviewed via other technologies (FIG. 1).

[0075] Within the scope of a sequence capture approach, approximately three megabases around the identified QTL in three Stock6-derived inductors, as well as RWS and five non-inductor controls, were sequenced, and were analyzed on inductor-specific polymorphisms such as presence-absence variations, SNP's, and InDel's. Initially, 16 candidate genes were thereby identified, of which three genes were confirmed via post-sequencing and analysis of expression data: one gene that encodes for an anther-specific patatin phospholipase A2 which has an RWS inductor-specific haplotype; a phosphoglycerate mutase gene which is not present in the inductor RWS; and an RNA methyl transferase gene which has a mutation in a regulatory sequence (FIG. 2).

[0076] BAC banks were also developed for RWS, EMK (an additional inductor derived from Stock6), and Control 1 and screened with probes distributed over the identified QTL. For a target range of approximately 150 kB, which was mentioned by Dong et al. 2013 in inductor UH400 as possibly being inductor-relevant, BAC's of RWS, Control 1, and EMK were extracted and sequenced. The BAC sequences were annotated and compared with comprehensive transcriptome data which were created for RWS, Control 1, EMK, and B73.

[0077] As a result, the deletion in the inductor could here be confirmed. Accordingly, the examined maternal inductors lack a region of 100 kB between 68.26 and 68.36 MB (AGP Version 2 of the B73 reference sequence) on chromosome 1. Furthermore, an inversion in a gene-similar region and a large, repetitive sequence segment that is not comparable to the reference genome of B73 and to Control 1 appears outside of the target region in the inductors.

[0078] In spite of the deletion, the already identified phospholipase is still present in the inductors, but shows the aforementioned haplotype strongly deviating from the controls, and marked genetic variations in the promoter region. As a result of the deletion, the phosphoglycerate mutase that was already identified above is no longer present.

[0079] Furthermore, it is also achieved that a non-coding RNA (lncRNA) is identified in the 100 kb deletion. Like the phospholipase, it is pollen-specifically expressed and, moreover, shows a homology of 82% with the identified phospholipase. The sequence is inherently complementary, i.e., the lncRNA forms a hairpin structure. The very high expression rate, the significant homology with the phospholipase, and the low SNP density that was determined via Sanger sequencing indicate a regulatory function of this lncRNA for the phospholipase. Theoretically, an 88 amino acid-long, truncated version of the phospholipase protein could also be translated from this transcript.

[0080] In order to also be able to measure differences in the expression level of the identified gene from the region, in addition to measuring polymorphisms at the DNA level, RT-PCR and RNASeq experiments were implemented. In addition to RWP (a subline of RWS) as an inductor, three, genetically very different, control lines were used. From these plants, pollen was harvested, anthers without pollen, and embryos from 6-7 days after pollination by selfings or crossings. The phospholipase here showed a slightly increased expression in pollen from RWP. The methyl transferase shows a weak expression in the pollen of RWP and no expression in the pollen of the control. lncRNA is expressed and absent pollen-specifically, as also expected in RWP.

[0081] RNASeq was additionally applied to pollen of the same material in order to further verify the preceding results.

[0082] The transcriptome data (RNA-Seq at Pollen RNA of RWS) was projected on an artificial reference, in which the region of the QTL in B73 was replaced with RWS-BAC's. This analysis shows an expression of the phospholipase in pollen. The exon-intron structure of the gene corresponds to that of B73, but a deletion exists at the 5' end, which leads to a stop codon and therefore to a shortened protein. Furthermore, three additional RWS-specific transcripts were detected above and below the phospholipase. A region having two transcripts is located approximately 60 kb above the phospholipase. The first transcript is non-coding; the second encodes for a 192 amino acid-long protein that shows homologies with the mitochondrial import receptor (MITO1). In B73, this is situated only 15 megabases upstream of the QTL (GRMZM2G174696). Approximately 90 kilobases (kb) below the phospholipase is an additional transcript that in turn shows high homologies with the 192 amino acid-long transcript.

[0083] In order to also receive inductor-specific expression outside of the QTL, the RNASeq data were evaluated genome-wide. Unexpectedly, new candidate genes were identified outside of, but near, the finely-mapped region cited above, which probably could not previously be found due to the technical limitations of the SeqCapture approach. Approximately 400 kb upstream of the identified phospholipase from the finely-mapped region is a gene complex which, in pollen of RWP, is expressed distinctly differently (by at least a factor of 2) in comparison to the controls. This gene complex contains three genes: two genes annotated as SNAREv genes which have a high homology to one another and are over-expressed in RWP, and one gene that is annotated as inositol-1,4,5-trisphosphate-5-phosphatase and whose expression in RWP is reduced. Cloned transcripts of these genes distinctly deviate in part from the public annotation, such that they may also encode for proteins with deviating functions, or also may function as lncRNA's. A BAC made up of RWS could be isolated from this locus, and sequenced. This sequence was integrated into the artificial reference for re-analysis of the RNASeq data in AGPv02 (FIG. 3). In addition to a transposase, two RNA's (T1 (SEQ ID Nos: 55, 56, 57, and 63) and T3 (SEQ ID Nos: 60, 61, 62, and 65)) and an RNA with an ORF of 131 amino acids are expressed in this locus (T2 (SEQ ID Nos: 58, 59, and 64)). Except for the transposase, all transcripts are situated within or between the two SNAREv genes. Although they presumably have no SNARE function themselves, they could be involved in the regulation of homologous genes. The sequence capture data of this region show that there are distinct structural deviations between inductors, controls, and reference genome. The BAC sequencing confirms the absence of both inositol-1,4,5-trisphosphate-5-phosphatase gene at the genomic level in the inductor and the absence of an lncRNA from B73 that shares the transcription start with the inositol-1,4,5-trisphosphate-5-phosphatase, but is read from the counter-strand. The isolation of a cDNA from one of the SNARE genes (GRMZM2G179789) also indicates complex structural changes in the inductors, since one part of the cDNA corresponds to the plus strand and one part corresponds to the minus strand of the reference.

Gene Functionalities

[0084] Overall, seven genes could thus be identified which could be important for the in vivo haploid induction or the in vivo haploid induction capability in maize.

[0085] Among these four genes, which are of particular importance to pollen tube growth:

the two SNAREv genes encoding for proteins which are known to be involved in vesicle transport (literature). In the model plant Arabidopsis thaliana, SNAREv proteins have already been demonstrated at the tip of the pollen tube, where they are involved in the transport of phospholipids and pectins (literature). The over-expression of the SNAREv proteins that was observed in the examined maize inductors would lead to increased pollen tube growth.

[0086] That the phospholipase A2 also distinctly influences the pollen tube growth could be shown in the model plant Nicotiana tabacum. The inhibition of phospholipase A2 accordingly leads to a suppression of the pollen tube growth (Kim et al., 2011). In the examined maize inductors, the absence of the identified lncRNA having significant homology with the phospholipase may lead to a reduction in the expression or translation rate of the phospholipase gene, which would accelerate the growth speed of the pollen tube.

[0087] In a knockout mutant of inositol-polyphosphate-5-phosphatase in Arabidopsis thaliana, it appeared that the pollen tube grows uninhibitedly. In the examined maize inductors, the reduced expression level of the inositol-1,4,5-trisphosphate-5-phosphatase thus may likewise lead to an accelerated pollen tube growth. The identified lncRNA associated with inositol-1,4,5-trisphosphate-5-phosphatase could here have a regulatory effect on the expression rate.

[0088] The examined maize inductors thus show a modified regulation/expression rate of the four genes, in comparison to non-inductors. This disruption should lead to a markedly faster pollen tube growth, which is also promoted by a possibly increased energy metabolism, due to the expression of a mitochondrial transporter or its regulation. This could have the result of a decoupling of the transport of the generative cells in the pollen tube with its growth. As a result, an incomplete or incorrect pollination with subsequent chromosome elimination may occur.

[0089] It is known that active centromeres play a key role in chromosome distribution and are characterized and modified via chromatin modifications at the DNA or histone level--moreover, by transcription, RNA interactions, and RNA binding. A change in the regulation of the methyl transferase gene may influence the activity of the inductor centromere during the early embryogenesis, which ultimately leads to the elimination of the inductor genome in the early seed development stage.

[0090] In the examined inductors, it could be shown that the phosphoglycerate mutase gene is no longer present. The absence of the gene may negatively affect the energy metabolism of the pollen, and therefore have effects on the pollination. Moreover, the energy metabolism may be influenced by the mitochondrial membrane protein.

[0091] Any gene individually, or any combination of the genes, may be responsible for the effect of the haploid induction.

Creation of New In Vivo Haploid Inductors

[0092] In order to develop a new inductor in other crop types or maize non-inductor genotypes, or to increase the induction capability of an inductor genotype, the following is to be performed:

[0093] Identification of the corresponding genes in other crop types or maize non-inductor genotypes: In single-cotyledon plants such as maize, rice, wheat, rye, or barley, the pollen-specific patatin phospholipases are strongly conserved, and, therefore, homologs of these are easy to identify. In contrast to this, regulatory lncRNA's are absent in most single-cotyledon plants. However, in the event that they are present, they may likewise be discovered using significant homologies, just as they also occurred in the examined maize inductors. In double-cotyledon plants, other phospholipase types take on the corresponding tasks in the pollen tube growth. In order to identify these, RNA banks of pollen or pollen tubes are to be created and screened for the specific phospholipase of the present invention. A patatin phospholipase that is strongly expressed in pollen could already be identified via RNASeq of sunflower pollen (SEQ ID Nos: 46-48).

[0094] The SNAREv genes and the methyl transferase gene do not need to be pollen-specific. For example, one of the identified SNAREv genes (SNAREv 1) in maize is also not expressed in a pollen-specific manner. SNAREv 1 is not expressed at all in wild-type pollen. In annotated genomes, homologous genes may be identified via BLASTP and the functional region of a SNAREv protein. In unannotated genomes, RNASeq data would need to be annotated and selected for SNARE genes.

[0095] Homologous inositol-1,4,5-trisphosphate-5-phosphatases and phosphoglycerate mutases must be expressed in pollen, in order to be used as candidate genes. The identification may take place as above, via BLASTP and subsequent RT-PCR in pollen or via annotation of RNASeq data of pollen.

Manipulation of the Candidate Gene:

[0096] Possible inductors or an increased induction capability may be achieved via transgenic expression of the phospholipases and/or SNARE's and/or methyl transferase and/or phosphoglycerate mutases and/or lncRNA's and/or of the mitochondrial import receptor described above. For this, the corresponding genes--including their promoters--are to be cloned from the inductor line RWS. These genes may be cloned in a suitable transformation vector and be transformed in the desired plant.

[0097] The pollen-expressed inositol-1,4,5-trisphosphate-5-phosphatase may be additionally or exclusively reduced in their activity via RNAi, for example. For example, for this, hairpin constructs are to be produced, which then include a suitable promoter and terminator which allow a transcription of the hairpin construct before or at the point in time of the pollen formation. These hairpin constructs would be cloned in a suitable transformation vector and be transformed in the desired plant.

[0098] Alternatively or additionally, plants having mutations (for example, in the identified genes) that stabilize the phospholipase and/or SNARE's and/or methyl transferase, amplify the expression, or increase the activity may be generated via TILLING, transposon mutagenesis or other mutagenesis methods, or "genome editing." Structural analyses of secondary and tertiary structure of the mutated proteins may be helpful for this, which mutated proteins indicate denser structures, for example, and therefore fewer attack points for proteases. Moreover, the regions of the proteins that play a role in ubiquitin interactions may be considered. Mutants in the active center of the gene may be directly tested for their activity. For verification of the functionality of the phospholipase, various Tilling mutants have already been checked for induction capability. The exchange D74N (exchange of aspartate at Position 74 for asparagine) or G78R (exchange of glycine at Position 78 for arginine) lead to a maternal induction rate of 0.2-0.4%. In order to alternatively or additionally manipulate the inositol-1,4,5-triphosphate-5-phosphatase or the phosphoglycerate mutase, one must search for knockout mutants or for additional mutants that reduce the activity of the gene.

[0099] A Stock6-derived inductor may also be improved. This is possible via the above-described transgenic approach and via the introduction of mutations in the identified candidate genes. Insofar as they are expressed in pollen, it would additionally be possible to manipulate additional copies of the genes in the genome via transgenic or non-transgenic approaches.

[0100] Test of the induction capability: There are, for example, the following possibilities for testing the induction capability of a potential inductor:

1. Pollination of a line having a visual recessive marker (for example, glossy (Bordes et al., 1997) or liguleless (Sylvester et al., 1990), for maize). Descendants that express this feature are tested for haploidy via flow cytometry. 2. Pollination of a line that differs genetically from the inductor--optimally, via multiple markers. Use of these markers in order to identify homozygotic plants. These plants are tested for haploidy via flow cytometry.

[0101] Both possibilities were applied to test the induction capability.

REFERENCES

[0102] Barret, P., Brinkmann, M., & Beckert, M. (2008). A major locus expressed in the male gametophyte with incomplete penetrance is responsible for in situ gynogenesis in maize. Theoretical and Applied Genetics, 117(4), 581-594. [0103] Bordes, J., de Vaulx, R. D., Lapierre, A., & Pollacsek, M. (1997). Haplodiploidization of maize (Zea mays L) through induced gynogenesis assisted by glossy markers and its use in breeding. Agronomie, 17(5), 291-297. [0104] Chen, L., Tu, Z., Hussain, J., Cong, L., Yan, Y., Jin, L., . . . & He, G. (2010). Isolation and heterologous transformation analysis of a pollen-specific promoter from wheat (Triticum aestivum L.). Molecular Biology Reports, 37(2), 737-744. [0105] Chevalier, B. S., Kortemme, T., Chadsey, M. S., Baker, D., Monnat Jr, R. J., & Stoddard, B. L. (2002). Design, activity, and structure of a highly specific artificial endonuclease. Molecular Cell, 10(4), 895-905. [0106] Coe, E. H. (1959). A line of maize with high haploid frequency. American Naturalist, 381-382. [0107] Das, L., & Martienssen, R. (1995). Site-selected transposon mutagenesis at the hcf106 locus in maize. The Plant Cell Online, 7(3), 287-294. [0108] Deimling, S., Rober, F. K., Geiger, H. H. (1997). Methodik and Genetik der in-vivo-Haploideninduktion bei Mais. [Methods and genetics of in vivo haploid induction in maize] Presentation Pflanzenzuchtung, 38: 203-224. [0109] Depicker, A., Stachel, S., Dhaese, P., Zambryski, P., & Goodman, H. M. (1981). Nopaline synthase: transcript mapping and DNA sequence. Journal of Molecular and Applied Genetics, 1(6), 561-573. [0110] Dong, X., Xu, X., Li, L., Liu, C., Tian, X., Li, W., & Chen, S. (2014). Marker-assisted selection and evaluation of high oil in vivo haploid inducers in maize. Molecular Breeding, 1-12. [0111] Dong, X., Xu, X., Miao, J., Li, L., Zhang, D., Mi, X., . . . & Chen, S. (2013). Fine mapping of qhirl influencing in vivo haploid induction in maize. Theoretical and Applied Genetics, 126(7), 1713-1720. [0112] Fire, A., Xu, S., Montgomery, M. K., Kostas, S. A., Driver, S. E., & Mello, C. C. (1998). Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature, 391(6669), 806-811. [0113] Gaj, T., Gersbach, C. A., & Barbas III, C. F. (2013). ZFN, TALEN, and CRISPR/Cas-based methods for genome engineering. Trends in Biotechnology, 31(7), 397-405. [0114] Gurr, S. J., & Rushton, P. J. (2005). Engineering plants with increased disease resistance: what are we going to express? Trends in Biotechnology, 23(6), 275-282. [0115] Kato, N., He, H., & Steger, A. P. (2010). A systems model of vesicle trafficking in Arabidopsis pollen tubes. Plant Physiology, 152(2), 590-601. [0116] Kim, H. J., Ok, S. H., Bahn, S. C., Jang, J., Oh, S. A., Park, S. K., . . . & Shin, J. S. (2011). Endoplasmic reticulum--and golgi-localized phospholipase A2 plays critical roles in Arabidopsis pollen development and germination. The Plant Cell Online, 23(1), 94-110. [0117] Lloyd, A., Plaisier, C. L., Carroll, D., & Drews, G. N. (2005). Targeted mutagenesis using zinc-finger nucleases in Arabidopsis. Proceedings of the National Academy of Sciences of the United States of America, 102(6), 2232-2237. [0118] McCarty, D. R., Mark Settles, A., Suzuki, M., Tan, B. C., Latshaw, S., Porch, T., . . . & Curtis Hannah, L. (2005). Steady-state transposon mutagenesis in inbred maize. The Plant Journal, 44(1), 52-61. [0119] Odell, J. T., Nagy, F., & Chua, N. H. (1985). Identification of DNA sequences required for activity of the cauliflower mosaic virus 35S promoter. [0120] Prigge, V., Xu, X., Li, L., Babu, R., Chen, S., Atlin, G. N., & Melchinger, A. E. (2012). New insights into the genetics of in vivo induction of maternal haploids, the backbone of doubled haploid technology in maize. Genetics, 190(2), 781-793. [0121] Ravi, M., & Chan, S. W. (2010). Haploid plants produced by centromere-mediated genome elimination. Nature, 464(7288), 615-618. [0122] Rober, F. K., Gordillo, G. A., & Geiger, H. H. (2005). In vivo haploid induction in maize-performance of new inducers and significance of doubled haploid lines in hybrid breeding. Maydica, 50(3/4), 275. [0123] Sambrook, J., Russell, D. W., & Russell, D. W. (2001). Molecular Cloning: A Laboratory Manual (3-volume set) (Vol. 999). Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press. [0124] Shibuya, K., Fukushima, S., & Takatsuji, H. (2009). RNA-directed DNA methylation induces transcriptional activation in plants. Proceedings of the National Academy of Sciences, 106(5), 1660-1665. [0125] Silva, G., Poirot, L., Galetto, R., Smith, J., Montoya, G., & Duchateau, P. (2011). Meganucleases and other tools for targeted genome engineering: perspectives and challenges for gene therapy. Current Gene Therapy, 11(1), 11. [0126] Sylvester, A. W., Cande, W. Z., & Freeling, M. (1990). Division and differentiation during normal and liguleless-1 maize leaf development. Development, 110(3), 985-1000. [0127] Till, B. J., Reynolds, S. H., Weil, C., Springer, N., Burtner, C., Young, K., . . . & Henikoff, S. (2004). Discovery of induced point mutations in maize genes by TILLING. BMC Plant Biology, 4(1), 12. [0128] Twell, D., Yamaguchi, J., Wing, R. A., Ushiba, J., & McCormick, S. (1991). Promoter analysis of genes that are coordinately expressed during pollen development reveals pollen-specific enhancer sequences and shared regulatory elements. Genes & Development, 5(3), 496-507. [0129] Venter, M. (2007). Synthetic promoters: genetic control through <i> cis</i> engineering. Trends in Plant Science, 12(3), 118-124. [0130] Wang, Y., Chu, Y. J., & Xue, H. W. (2012). Inositol polyphosphate 5-phosphatase-controlled Ins (1, 4, 5) P3/Ca2+is crucial for maintaining pollen dormancy and regulating early germination of pollen. Development, 139(12), 2221-2233. [0131] Zhao, Y., Zhao, Q., Ao, G., & Yu, J. (2006). Characterization and functional analysis of a pollen-specific gene st901 in Solanum tuberosum. Planta, 224(2), 405-412. [0132] WO/2010/079430 (Bonas et al.) Modular DNA-binding domains and methods of use. [0133] WO/2011/072246 (Regents of the University of Minnesota) TAL effector-mediated DNA modification. [0134] WO 2012/030893 (Monsanto Technology LLC) Molecular markers associated with haploid induction in Zea mays.

Sequence CWU 1

1

65147944DNAZea maysmisc_feature(12578)..(12677)n is a, c, g, or tmisc_feature(22322)..(22421)n is a, c, g, or tmisc_feature(42238)..(42337)n is a, c, g, or t 1atggggagca gtgaggagca tgttttttta gatcccacca gaatatgtgc atccgtgtca 60cttcttgctc atgatctcat tggccgaatg cttaatcgag aggtctcttc aaggcccaat 120gccaaagaag ttctccgtaa gttcaagcac ccttgtaact tgtgctttat atatatgatt 180ctcaatttat cattgacttt tcctaatggc tttcaacaca gggcaccatg ggtcttattc 240tacactgatt gcccgcagaa agctgaattc tctaacatat gggatactaa caaaactgca 300gctcccatga ttcatcggga gatagtcagg tttggttact gtgagtcttc atcttcaaaa 360tcctcaagtg acaactctga agagcgagat gaatgcggta tagttgatgc actggtgaca 420acaataacac aggtgaggat ctcagagccc aagaggagtc ggctgttcag cctacccaac 480gggttgttgc cgccaagcag gaacagtctc cgaacatgaa gatgatgaat ccgtgtgtgg 540ctttctaact tgacctacct agctcccatc cccatgcatg tataaacgac atttggggaa 600tgggtagaaa agcagagatt agggattttc gtttccgtcg gtgcagtttt ggtgttccaa 660tggagttgcg agatgtttat gtgccttagt cttcaatttg ggggttgggg gaaaagtaat 720tttatgtttt tgttttgtgt ctgcagattc ggaagatgga cttggaggca aggagcctac 780agcctagcat taaggctggt ttgcttgcaa agctgaggga gtataaatct gacctcaaca 840acgtcaagag tgagctcaag aggatatttg cgcccaatgc caggcaggct acccgggagg 900agctcctaga gtttggaatg gctgatactc tcgctgtgag ctaatgctag gacttgactg 960tgtctacgag actgctccta acaataaact gaagaaagca aaagaaatca ttcaacgtat 1020tcgccgaaga gaactctaca aggtagtatg atgctttaat tgctcatata caagtgtcat 1080tttgtcatgt cattacacat ggttaggata catacttaag tttctaacgt aggcgtccac 1140acaacggatt ggtgcacggt tctgccgatg tatcccacgc acgtgcatgg aaggaggcag 1200gcacccttcc ccgccgcccc ggatctcgcg ccagcccccg ccctaccccg cctgcccttc 1260cactcttccc ccgccgcccc cggtcaacgt cacgaacccg ggcctcgtgc cgctcgtcgt 1320ggccacactg ttcgacgagc gagtcataga gctgctgagc gtgctcgctg atgcggcggt 1380ggggcgacca ggcaggtggt ccatcggcga agcgccatgg tcgtcgtcgg ggggcacgaa 1440ccaggcggtg tacgcgcgcc gcgcgcccgg ctcttcatcg cctccacccg ctccagcgtc 1500tccaccactt ccttcatcga gggccgactg cttggctcgc tggccaggca gccgagcatt 1560agttgcgccg cttggaacgc ctgcttttgt tgatcgtttg ttttggtctg atttcagtgg 1620gtctatccgc agagaggaag aagcagaagc tctccgagat ccaatccggc gttgaggaag 1680ctgaatcgct ggtaaataga tgtcgcgacg cgttctgttt tggggatccc cttggctaac 1740gggacatacg acatttgggg aatgggtaga aaagcagaga ttagggattt ttcgtttccg 1800tcggtgcagt tttggtgttc caacagagtt gcgagatgtt tatgtgcctt agtcttcaat 1860ttgggggttg ggggaaaagt aattttatgt ttttgttttg tgtctgcaga ttcagaaaat 1920ggacctggag gcaaggagcc tacagcctag cattaaggct ggtttgcttg caaagccgag 1980ggattataaa tctgacctca acaacgtcaa gagtgagctc aagaggatat ctgcgcccaa 2040tgccaggtag gctacccggg aggagctcgt ggagtctaga atggctgata ctctcgcagt 2100gagctaatgc taggacttga ctgtgtctac gagactgctc ctaataataa actgaagaaa 2160gcaaaagaaa tcattcaacg tattcgccga agagaactct acaaggtagt atgatgcttt 2220aattgctcat atacaagtgt cattttgtca tgtcattaca catggttagg atacatactt 2280aagtttctaa cgtaggcatc cacacaatgg attggtgcac ggttctgccg atgtatccca 2340cgcacgcgca tggaaggagg caggcaccct tccctgccgc cccggatctc gcgccagcca 2400tcgccctacc ccgcctgccc ttccactctt ccccctgaaa gtcgcataga gggggggtga 2460atagggcgaa tctgaaattt acaaacttaa gcacaactac aagccgggtt aacgttagaa 2520atataaacga gtccgagaga gagggcgcaa aacaaatcat gagcaaataa agagtgagac 2580acgatgattt gttttaccga ggttcggttc ttgcaaacct actccccgtt gaggtggtca 2640caaagaccgg gtctctttca accctttccc tctctcaaac ggtcacttag accgagtgag 2700cttctcttct caatcaaacg gaacacaaag ttcccgcaag gaccaccaca caattggtgt 2760ctcttgcctt ggttacaatt gagtttgatc acaagaagaa tgagaaagaa aagaagcgat 2820ccaagcgcaa gagctcaaat gaacacaaat gtcgctctct ctagtcacta tttgatttgg 2880agtgattccg gacttgggag aggatttgat cttttggagt gtctagaatt gaatgctata 2940gctcttgtaa tatgttgaag gtgggaaact tggatgccat tgaatgtggg gtggttgggg 3000tatttatagc cccaaaacac caaaaaaggc cgttggaagg ctgctctcgc atggcgcacc 3060ggacagtccg gtgcgccagc cacgtcagca gaccgttggg gttcgaccgt tggagctctg 3120acttgtgggg cctctgggct gtccggtggt gcaccagaca ggtcctgtag gatgtctggt 3180gcgccaactg cacgtgctct gtcctctgcg cgcgcaggcg cgcattaaat gcgttgtagt 3240caaccgttgc gcgcgaagta gccattgctc tgctggcaca ccggacagtc cggtgaatta 3300tagcggagcg ccctctgatt ttcccgaagg tagcgagttc agcttcgagt gccctggtgc 3360accggacact gtccggtgcg ccaaaccagg gtgccttccg ggatgtcttt tgctctcttt 3420gtttgaaccc tttcttggtc tttttattgg cttattgtga acctttgaca cctgtaaaac 3480ttatagacta gagcaaacta gttagtccaa ttatttgtgt tggacaattc aaccaccaaa 3540atcaattagg aaataggtgt gagcctaatt ccctttcaat ctcccccttt ttggtgattg 3600atgccaacac aaaccaaagc aagtatagaa gtgcataatt gaactagttt gcataatgta 3660agtgcaaagg ttacttagaa ttgaaccaat aaatattttc ataagttatg catggattgt 3720ttctttattt tcatcatttt ggaccacgct tgcaccacat gttttgtttt tgcaaatcct 3780tttgtaaata gtcaaaggta aatgaataag attttgagaa gcattttcaa aatttgaaat 3840tttctccccc tgtttcaaat gcttttcctt tgacttaaac aaaactcccc cctcaaaaat 3900cctactcata gtgttcaaga gggttttaag atatcaattt tgaaaatgct actttctccc 3960ccttttgaat ataataagat atcaattgaa aaattcatca ttttaaaacc ttttgaaaat 4020gggtggtggt gcggtccttt tgctttgggc taatactttc tccccctttg gcatgaatcg 4080ccaaaaacga atacttgagt gaaatataag cccctttaac tactttctcc tgctttggcg 4140aacataatat gagtgaagat tataccaaag ttggagagtt gcttgaagcg atggtgaagg 4200atgagttatg gagtggaggt taagcctttg tcttcgccga agattccaat tccctttcaa 4260tacacctatg acttggttga aaatatactt gaaaacacat tagtcatagc acatgaaaga 4320gatatgatca aaggtatatt aatgagctat gtatgcaaga catcaaaaga aattcctaga 4380atcaagaata tttagctcgt gtctaagttt gttcatctag tggcttggta aagatatcag 4440ctaattgttc cttagtgtta atataggcaa tctcgatatc tccctttttt tggtgatccc 4500ttaggaaatg ataccgaatg gctatgtgtt tagtgcggct atgctcaacg ggattatccg 4560ccatgcggat tgcactctca ttatcacata gaagaggaac tttggttaat ttttaaccat 4620agtccctaag ggtttgcctc atccaaagta attgtgcgca acaatggcct gcggcaatat 4680actcggcttc ggcggtagaa agagctacgg aattttgctt ctttgaagcc caagacacca 4740gggaccttcc caagaactgg caagtcctcg atgtactctt tctattaatt ttacaccccg 4800cccaatcggc atccgaataa ccaatcaaaa tcaaaatgtg gatcccgtag gataccaaag 4860cccaaactta ggagtatgaa ctaaatatct caagattcgt tttacggccg taaggtgagc 4920ttccttaggg tcggcttgga atcttgcaca catgcatacg gaaaagcata atatccggtc 4980gagatgcaca taaatagagt aaagagccta tcatcgaccg gtataccttt ttgatcgacg 5040gatttacctc ccgtgtcgag gtcgagatgc ccattggttc ccatgggtgt cttgatgggt 5100ttggcatcct tcatcccata cttgtttaga atgtcttgaa tgtacttcgt ttggctaatg 5160aaggtgccct cttagcgttg cttcacttga aatcacaaga agtacttcaa ctcccccatc 5220atagacatct cgaatttctg tgtcatgatc ctactaaatt cctcacatgt agattcatta 5280gtagacccaa atataatatc atcaacataa atttggcata caaacaaatc attgtcaaga 5340gttttagtaa ataaagtagg atcggccttt ccgactttga aaccattagt gataaggaaa 5400tctctaaggc attcatacca tgctcttggg gcttgcttga gcccataaag cgccttagag 5460agtttatata cgtgattagg gtactcacta tcttcaaagc cgggaggttg ctcaacatag 5520acctcttcct tgattggtcc gttgaggaag gcacttttca cgtccatttg ataaagcttg 5580aagccatggt aagtagcata ggcaagtaat atgcgaattg actcaagcct agctacgggt 5640gcataggttt caccaaaatc caaaccttcg acttgtgaat aacccttggc cacaagtcgg 5700gctttgttcc ttgtcaccac accatgctca tcttgtttgt tgcggaagac ccacttggtt 5760cctacaacat tttggttagg acgtggaact aaatgtcata cctcattcct agtgaagttg 5820ttgagctcct cttgcatcgc caccacccaa tccgaatctt gaagtgcttc ctctaaccta 5880tgtggctcaa tagaggaaac aaaagagtaa tgttcacaaa aatgagcaac ccgagatcta 5940gtagttaccc ccttatgaat gtcgccgagg atggtgtcga cggggtggtc tcgttggatt 6000gcttggtgga ctcttgggtg tggcgggcgt tgctcgtcct ccttgtcttg atcatttgca 6060tctccccctt gatctatgcc gtcatctaga ggtggctcat ttgattgatc ttcttcttca 6120tcaacttgag cttcatcctc attttgagtc ggtggagatg cttgcatgga ggaggacggt 6180tgatcttgtg tatttggagg ctcttcggat tccttaggac acacatcccc aatggacatg 6240ttccttagcg cgatgcatgg agcctcttca tcacctatct catcaagatc aacttgctct 6300acttgagagc cgttagtttc atcaaacaca acgtcacatg aggcttcaac tagtccagtg 6360gacttgttaa agaccctata tgcccttgtg tttgagtcat aaccaagtaa aaagccttct 6420acagtcttag gagcaaattt agattttcta cctcttttaa caagaataaa gcatttgcta 6480ccaaaaactc taaaatatga aatattgggc tttttaccgg tgaggagttc gtatgatgtc 6540ttcttgagga ttcggtgtag atataaccgg ttgatggcgt agcaagcggt gttgactgcc 6600tcggcccaaa accgatccga agttttgtac tcatcaagca tggttcttgc catgtccaat 6660agagttcgat tcttcctctc cactacacca ttttgttgtg gcgtgtaggg tgaagagaac 6720tcatgcttga tgccctcctc ctcaagaaag ccttcgattt gagagttctt gaactccgtc 6780ccgttgtcgc ttctaatttt cttgatcctt aagccgaact cattttgagc ccgtcttaag 6840aatcccttta aggtctcttg ggtttgagat ttttcctgta aaaagaatac ccaagtgaag 6900cgagaataat catccacaat aactagacag tacttactcc cgccgatgct tatgtaagcg 6960atcgggccga atagatccat gtgtaggagc tccagtggcc tgtcggtcgt cattatgttc 7020ttatgcggat gatgagagcc aacttgcttc cctgcttggc atgcgctaca aatcctgtct 7080ttctcaaaat aaacatttgt caatcctaaa atgtgttctc cctttagaag cttgtgaaga 7140ttcttcatcc caacatgtgc aagtcggcga tgccagagcc aacccatgtt agtcttagca 7200attaagcaag tgtcgagttc agctctatcg aaatctacca agtatagctg accctctaac 7260actcccttaa atgctattga atcatcactt cttctaaaga cagtgacacc aacatcagta 7320aaaagacagt tgtagcccat ttgacacaat tgggatacag aaagcaaatt gtaatctaaa 7380gaatctacaa gaaaaacatt ggaaatagaa tggtcaggag atatagcaat tttacccaat 7440cctttgacca aaccttgatt tccatcccca aatgtgatcg ctctttgggg atcttggttt 7500ttctcatatg aggagaacat ctttttctcc ccagtcatgt ggtttgtgca cccgctgtcg 7560agtatccagc ttgagccccc ggatgcataa acctacaaaa taattttagt tcttgatttt 7620aggtacccaa atggttttgg gtcctttggc attagacaca agaactttgg gtacccaaac 7680acaagtcttg gagcccttgt gcttgccccc aacatatttg gcaactacct tgccggattt 7740gttagtcaac acataagatg catcaaaagt tttgaatgaa atgtcatgat catttgatgc 7800actaggagtt ttctttctag gcaacttggc acgggttggt tgcctagagc tagatgtctc 7860acccttatac ataaaagcat aattaggacc agagtgagac ttcctagaat gaattctcct 7920aattttgttc tcgggataac cggcagggta taaaatgtaa ccctcgttat cctgaggcat 7980gggagccttg cccttaacaa agttggacaa tcttttagga ggggcactaa ttttgacatt 8040gtttcccctt tggaagccaa tgccatcttt aatgcccggg cgtctcccat tataaagcat 8100gccacgagca aatttaaatt tctcattttc taagttgtgc tcggcaattt tagcatctag 8160ttttgctata tgatcatttt gttgtttaat taaggtcata tgatcatgaa tagcattaac 8220atcaacatct ctacatctag tacaaataga tacatgctca acagtagatg tagagggttt 8280gcaagaatta agttcaacaa tcttagcatg aagaatatca tttttatccc taagatcgga 8340aattgtagtt ttgcaaacat caaaatcttt agccttagca attaaatttt catttttctg 8400ttctaaggct agcaagagaa atgtttaatt cttcaatcct agcaagcaaa tcatcattat 8460tatctttagg attgggaatt gaaacattac aaacatgtga atcaacctta gcatttaaac 8520tagtattttc atgtctaagg ttgtcaatca tctcatggca agtgcttagc tcactagata 8580gtttttgaca tttttctact tctagggcgt aagcattttt aaccttaaca tgtttcttgt 8640tttccttaat aagacaatcc tcttgggaat ccaaaaggtc atctttttca tgaatagcac 8700taattaattc atttaatttt tccttttgtt ccatgttaag attagcaaaa agggtacgca 8760agttatcctc ctcatcacta gcattttcat cactagaggt ttcatattta gtggaggatc 8820ttgattttac cttcttcctt ttgccgtcct ttgccatgag gcacttgtgg ccgacgttgg 8880ggaagaggag tcccttggtg acggcgatgt tggcggcgtc ctcgtcgtcg gaggagtcgc 8940ttgagctttc gtcggaatcc cactcccgac aaacatgggc atcgccgccc ttcttcttgt 9000agtacttctt cttctccttt cttctcccct tcttgtcgtc gccacggtca ctgtcactag 9060atatgggaca tttagcaata aaatgaccgg gcttaccaca tttgtagcaa accttcttgg 9120agcgggactt gtagtctttc cccctccttt gtttgaggat ttggcggaag ctcttgatga 9180cgagcgccat ttcctcattg tcgagcttgg aggcgtctat tggttgtcga cttggtgtag 9240cctcctcctt cttttcttcc gttgccttga atgcgacggg ttgagcttcg gatgtggtgg 9300gttcgtcaag ctcattgatc tttctcgagc cttcgatcat gcactcaaaa cttacaaaat 9360gcccgataac ttcctcgggg gtcattttag tatatctagg attaccacga attaattgaa 9420cttgagtagg gttaaggaaa ataagtgatc ttagaataac cttaaccacc tcgtggtcgt 9480cccactttat gctcccgagg ttgcgcactt ggttcaccaa ggtcttgagc cggttgtaca 9540tgtgttgtgg ctcctcccct ttgcgaagcc gaaaccgacc gagctccccc tcgatcgttt 9600cccgcttggt gatctttgtg agctcgtctc cctcgtgcgc ggttttgagc acatcccaaa 9660cctccttggc gctcttcaac ccttgaactt tgttatactc ctctctactt aaagaggcga 9720ggagtattgt tgttgcttga gagttgaagt gctcgatttg ggccacctca tcctcatcat 9780aatccttatc ccctacggat ggtacctgtg caccaaactc aacaacatcc catatacttt 9840tgtggagtga ggttagatga aatcgcatta aatcactcca cctagcataa tcttcaccat 9900caaaagttgg tggtttgcct aacgggacgg aaagtaaagg tgcatgttta gaaatgcgag 9960ggtagtgtag gggaatctta ctaaacttct tacgctcttg gcgtttagaa gttacggagg 10020gcgcgtcgga gccggaggtt gatgttgatg aagtgtcggt ctcgtagtag accactttcc 10080tcatcctctt ttgtttgtcc ccactccgat gaggcttgtg ggaagaagat ttttccttct 10140tctctttgtg gtgagaagaa gatttcttct ccttcccttt gttggaggag ctcttcttct 10200tctccctccg tttggtgcgg gactcttcca atgaagtgct ctcgttgctt gtagtgggct 10260tttcgccggt ctccatctcc ttcttggcgt gatctcccga catcacttcg agcggttagg 10320ctctaacgaa gcaccgggct ctgataccaa ttgatagtcg cctagagggg gggtgaatag 10380ggcgaaactg aaatttacaa atataaacac aactacaagc cgggttagcg ttagtaatga 10440agaaacgagt ccgcgagaga gggcgcaaaa caaatcgcaa gcaaatgaag agtgtgacac 10500gtggatttgt tttaccgagg ttcggttctc gcaaacctac tccccgttga ggaggccaca 10560aaggcccggg tctctttcaa cccttccctc tctcaaacgg tccctcggac cgagtgagct 10620tctcttctct aatcaaagtt gggaacaaaa cttcccaaca agggccacca cacaattggt 10680gcctcttgcc ttgattacaa tgggtttttg atcacaagaa caagtgcgaa agaaaagaag 10740caatccaagc gcaagagctc aaaaagaaca cggcaaatct ctctctctaa tcactaaagc 10800cttttgtgga attggagagg atttgatctc ttttggtgtg tctagaattg aatgctagag 10860ctcttgtagt agttgagaag tggaaaactt ggatgcaatg aatggtgggg tggttggggt 10920atttatagcc ccaaccacca aacttgaccg ttggctgggt tgtctgttcg atggcgcacc 10980ggacagtccg gtgcacaccg gacagtccgg tgcccctgcc acgtcatcac tgccgttgga 11040ttctagccgt tgaagcttct gacttgtggg cccgcctggg tgtccggtgc acaccggaca 11100tctactgttc cttgtccggt gtgccggagt gggcgcgcct gacatctgcg cgcgcagagc 11160gcgcattaaa tgcgcggcag agagccgttg gcgcggaaat agccgttgct ctcgagtcgc 11220accggacagt ccggtgcaca ccggacagtc cggtgaatta tagtggacgg gccgatggct 11280tttcccgaag ctggcgagtt cctgaggccg acctcccttg gcgcaccgga cactgtccgg 11340tgtacaccgg acagtccggt gaattatagc ggagtcgcct ctggcaattc tcgaaggggg 11400cgagttggag cttgagtcct ctggtgcacc ggacgctgtc cggtgtacac cggacagtcc 11460ggtgctctca gaccagaggg ccttcggttc ccactatgct cctttgttga atccaaaaac 11520ttggtccttt tattggctga gtgtgaacct tttacacctg tgtaatctat aaacttgtgc 11580aaacttagtt agtccaattg tttgtgttgg gcaattcaac caccaaaatt aattagggac 11640taggtgtaag cctaattccc tttcagtttt cccgggcggt catccataga acaggtcctt 11700acggagaggc actcgagaaa ccgctcgagc ccccttgaag accacaagca caacatcata 11760ataagagaag ggaaaacagc gtatcataga taatctcatc atgttcattg attagagtta 11820agcaatagca taaagctaaa cagtaataat ccaacccaaa taggtgaaca aggacatgga 11880taacaaaagc tagtcaatcc ttaggcataa atgtgtaaag cgggaggtga attaaataat 11940gaataggaca tagataggtc aagggacact tgcctccacc aaccgactgc tgctcagggg 12000cttctcctgc gggttcctcg ggctcttcaa ccggatcgtt ctctatgcga gcgcaaacat 12060acacacatcc acatatttaa taccaaagaa cagtacacca tacaatagaa tgcaataagt 12120aaacagacgt tccacgcggg ctcgcgagta cggttaagag agaaagagga aaagacagtc 12180gagaaacgat cacgttgcat gattataaat tagccactag cttaatggaa ggaaatttaa 12240tgtagacact atgtttagcg taaagtaaag tcatgtttca tgtctaatta ttataagcag 12300gtggagacaa ataaaaggat agccgcgcgg cgagacgcgc gacaaagctc tctaaaacaa 12360attaagaagt taacgactcg tcgcgcgact gagcacgcag cgagacactt cgccttagtt 12420aagaggagac gttaagcgtc gcgcgacgaa gcgcacgacg gcatacgtcg actaaactga 12480gtccaaagtg gaacgtcgcg tcaattccca cgcggcgtta caccttaaac aacctgaaac 12540aaaatgaacg aatcaagcct gatcatccgc cccccccnnn nnnnnnnnnn nnnnnnnnnn 12600nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12660nnnnnnnnnn nnnnnnnacc gttcaacaaa caccgcccgg ggttccaacc gaccaggctc 12720cagatcgccc gaccgatccc ccgtcccggc ctcgcccccc cccttcttcg ctcgcccgcg 12780ctcgcggtga ttgctcgtta agaagcgtgt tgcgtcgcgc gcttcgccgc gcgacggatt 12840tatctaaaat tcagattcta tcctgtgttg cgtcgtgcgt ttcgtcgcgc gacgatccat 12900tttgtttcag gttgtttaag gtgtaacgcc gcgtgtgtat tcacgcgacg ttccactttg 12960gactcagttt agtcgacgta tgccgtcgtg cgcttcgtcg cgcgacgctt aacgtctcct 13020cttaactaag gcgaagtgtc tcgctgcgtg ctcagtcgcg cgacgagtcg ttaacttctt 13080aatttgtttt agagagcttt gtcgcgcgtc tcgccgcgcg gctatccttt tatttgtctc 13140cacctgctta taataattag atatgaaaca tgactttact ttacgctaaa catagtgtct 13200acattaaatt tccttccatt aagctagtgg ctaatttata atcatgcaac gtgatcgttt 13260ctcgactgtc ttttcctctt tctctcttaa ccgtactcgc gagcccgcgt ggaacgtctg 13320tttacttatt gcattctatt gtatggtgta ctgttctttg gtattaaata tgtggatgtg 13380tgtatgtttg cgctcgcata gagaacgatc cggttgaaga gcccgaggaa cccgcaggag 13440aagcccctga gcagcagtcg gttggtggag gcaagtgtcc cttgacctat ctatgtccta 13500ttcattattt aattcacctc ccgctttaca catttatgcc taaggattga ctagcttttg 13560ttatccatgt ccttgttcac ctatttgggt tggattatta ctgtttagct ttatgctatt 13620gcttaactct aatcaatgaa catgatgaga ttatctatga tacgctgttt tcccttctct 13680tattatgatg ttgtgcttgt ggtcttcaag ggggctcgag cggtttctcg agtgcctctc 13740cgtaaggacc tgttctatgg atgaccgccc gggaaaacag tgcaaccatg agggtggaat 13800ggggtgccct tagctgaata attagaggat ccggggtgta gttcacttag ccgtcgtgcc 13860gtcaatgggg ctcggtgtat gcggctcgct ctgccaagtt tgggttcgcc ccttggggag 13920gagtgcggtg catttaggaa acctaacggg tggctacagt cccggggaat ctttgtaaag 13980gctatgtagt gatgccctgc tgggtcacct tggtagtgat caatggagag tcatgatctc 14040cgggtagaat gggaatcacg gcttgtgggt aaagtgcaca acctctgcag agtgtttgaa 14100aactgatata tcagccgtgc tcacggttat gagcggccaa gggagctcca gtgattagtg 14160gtacttgatc agagatactt tggtacaggt ggttatgaga tcgatgattc tggttatgac 14220tatgatgctg gtaagtggta ctctttccgt ttggaaagga gtacgtttgg gttaataact 14280tgggttaatg ctaaaacttg gctttctatt agtaaataat aatctgacca actaaaagca 14340actgcttgac ttatccccac ataaagctag tccactacag ccaaacagga tacttgctga 14400gtatgttgat gtgtactcac ccttgctcta cacaccaaac ccccccccca tccccaggtt 14460gtcagcattg caaccactgc tcagtcgaag atgaagctgt ggaaggagac ttccaggagt 14520tccaagatta cgatgagttc taggtgtggg ttagcggcaa cccccagtcg gctgcctgtg 14580aaggccgcgg ttatctacgt ttcttttccg cactttgatt tattgtaaga actatatgga 14640cgtctcagac gtatgatgta atcgactatt tcccttagta atactatttt gagcactgtg 14700tgatgatgtc catgttatgt aactgctgtg tacgtgaata actgatcctg gcacgtacat 14760ggttcgcatt cggtttgcct tctaaaaccg ggtgtgacat aagtggtatc aaagccgtgc 14820tgactgtagg accgctaacc tagaatagaa tggtcgctct aaggactata gacctctgtc 14880tctgccttga ctttgatatc

ccttcaaaag ttggtcatac cgaccaaacc tatgttctac 14940tatatattat accttgctga aaatcatgtt ttattccagt ccttcattta cttatgattc 15000attatttgct ggtcatatta attctgttct cacctttttg cttgcgatgt cttttgtaga 15060tggctcgact tagacacact gcacgaaagt cagtcatccc cttcttaccc tcccgccttg 15120ctgagcgtcc gcttcgccgt cccgtggccg gacagtccag ccacttggag agactacacc 15180accgcctgcg tgaggagcag gagcgtcgac gacaggagca gcagagctct tccttctcgc 15240tccaccagga gatagagtct gtgaggagct gctcccctgt gcttcctctg gagccgcccc 15300ctccaccacc actgggcgcc ccagcttctg gagtagctgc tggaggagac ccagacgatg 15360gagatggcga cgacagctcg agccacgaca ccgacttctc tgctaacctt gagccggaag 15420gatgggttac tcgacccatc actcgcgacg ctgctcgcgg gtgtcacttc cacgatgcgc 15480tcgacaccct gctacgtcgg gcatttaacc agcatacttg gtctgtcgag tatcgctgtg 15540tggtctacca gcacagtcgc ggggtctacc cggaccgctg ggaggcaacc tgcttggtgc 15600gctgcccgga gaacagtctc cagggtgcgg aggcctgctc agagcactat tctatctctg 15660agcgggactc agctgaggca gccatgcaag atgctgcacg gcgtgcgctt tcgcactact 15720gctcggtttt cggtggggca gctgacggtc ttgacctgaa gtattacccc cgccgtccat 15780ctggcagcac aggaggcgtg attgtctcac ctgtcggtga gggcaatcct aggttgagca 15840gcacagtcaa cctagccgcc gtgctaaaca cggagctgga ccatgcatta gacgagctga 15900gtagggctcg tgctgagatc gccctgctgc gggctgagcg cgcggaacgt cgtcacctgg 15960atggtggttc ccccgctccc gtcgggactc agcacccgta ccgctcacct cagcgtggac 16020accagtctta tggcaatccc gcctgcaaga ccaagataac tctagaacca tatatcgtta 16080gagttggatc ttgtaattaa tacgaaatat atacatagaa gcttcagtct tagcgttagt 16140ctcggtctta gttagtctta gttaaacagg gtagtttgct atatcctgtg catttatgtt 16200tgtcatgatg aactatgttt ggtttggatc tttgtaatga ttgtcaccag agtgtgggta 16260ccccctgcat tttggtttac ctattatgtt aatagagtta gttatatagt tgggaaacct 16320tttattccac tctcctcttt atctgagaaa ctgtgtggtc tgtgttggag atcagtgaag 16380atgctcatct gttcagtgct gttgaagaat tctattctct tttcttatgc tgcaagattt 16440gccagatcag tcctgatgtg tggttgcatt ctgcagatgt cagagaacag gcgcagagga 16500ggaaggcgtg ctcagcagga gcgagccgct caacaggagg aggtgcccca gcagcagcac 16560ctgccgcccc cgcccccgat gtccatcgag cagatgtttc tgatgcagac tcaggcagtt 16620caagccatcg gtcagactct ggccgccatt cagcagcagc agcagcagca ggccccacct 16680caacctcaga tgcctcagat gcccagagac aagcgtgctg aattcatgag aggtcatcca 16740ccaacgttcg ctcattcttc tgaccctatg gatgctgaag attggctgcg cactgtggag 16800cgggagttgc ataccgctca gtgcgatgac agggagaaag tcctgtatgg tccccgtctg 16860ttgagaggag cagcccagtc atggtgggag tcttacctcg ccacccatgc ccatcctgac 16920gccatcacct gggaagagtt cagaggtagc tttcgtcagt accatgttcc tgcaggtctg 16980atgacagtga agaaggagga gttcctggcc ctcaaggaag ggccattgtc tgtcagtgag 17040taccgagaca ggtttctgca attgtctcgc tatgctcctg aagatgtcaa caccgacgcc 17100aagcgacagt accgtttcct gagaggcttg gttgaccctc tgcagtacca actgatgaat 17160cacaccttcc cgacattcca gcacctgatt gacagagcaa tcatgacaga aggaagcgta 17220aggagatgga agatcgtaag cgcaagatca gtggacccca gcctggaagc agcaatcgtc 17280ctcgtttctc aggcaatcaa cctcagcagt tcaggcagaa ccagcgtcca cctcagcagc 17340agcagcaatt ccaaaggcag tatcctcagc accagtacca gaaccgtcag agcaatcagt 17400caggaggtca gtttcagagg cagaatcagc aagcacctcg tcttcctgcc ccagcaaatc 17460agcagaacag tcaggcagca ccagctcagg ttggaaacag agcatgtttc cactgtggag 17520agcaaggcca ctgggtgatg caatgtccga agaaggcagc ccagcagcag tcaggcccca 17580atgccccagc gaagcagaat gtgcctcagc ctggagcagg caatcgctct cagccgcgct 17640ataatcatgg aaggctgaac cacttggagg ctgaagcagt gcaggagacc cccggcatga 17700tagtaggtat gttcccagtc gactcccata ttgcagaagt gttatttgat actggagcaa 17760cgcattcttt cattactgca tcatgggtag aagcacataa ccttccaatt actaccatgt 17820caacccccat tcaaattgac tcagccggtg gtagaattcg agccgatagc atttgtttga 17880atataagtgt ggaaataagg gggatagcgt ttcccgccaa ccttatagta atgggtactc 17940aggcaataga tgtcatccta gggatgaatt ggctagataa gtatcaggca gttatcagtt 18000gtgataaaag gaccatcaag ttggtgtccc cactaggaga ggaagtggtg accgagttag 18060tcccgcctga gccaaagaaa ggaagttgtt atcagatagc tgttgatagc agtgaagcag 18120acccaatcga gaggatcaag gttgtgtccg agttcccaga tgtgtttcca aaggacttac 18180cgggtatgcc accagagcgg aaagttgagt ttgctataga gcttcttccc ggaaccgccc 18240ctatctttaa gagagcttac agaatatctg gaccagagtt ggttgagctt aagaagcaga 18300ttgatgagct gtcagagaaa ggttacattc ggccaagcac ctcgccttgg gccgcccctg 18360tcctatttgt ggagaagaaa gatggcacca agaggatgtg tatcgattat cgagctttga 18420atgaagtcac gatcaagaac aagtatccct tgcccagaat agaagatttg ttcgaccagt 18480tgagaggagc cagtgtgttc tccaagattg atctgaggtc aggttatcat cagctcagga 18540tccgaccttc ggacattccg aagacggcat tcatttccaa gtatggtttg tatgagttca 18600cagtgatgtc ttttggtttg accaatgcgc cagcgttctt catgaacttg atgaacagtg 18660tattcatgga ttatctcgat aagtttgtgg tggtattcat tgatgacatt ctggtttatt 18720ctcaaagcga agaagagcac gcagatcatt tgaggttggt attgcagaga ttgcgagagc 18780atcagttgta tgcaaagttg agcaagtgtg agttctggat cagtgaggtc ctgttcttgg 18840gtcacataat caacaaagaa ggattggttg tggatccgaa gaaagtggca gacattttga 18900actggaaagc gccaacagat gctagaggaa tcaagagttt cattggaatg gccggatact 18960atcggcgatt cattgaaggg ttttcgaaga tcgcaaaacc aatgacagcg ttgctaggca 19020acaaagttga gttcaagtgg acccagaaat gtcaagaggc ctttgaagcg ctgaaagaga 19080agttgactac agcgcctgtc ctagtcttgc ctgatgtgca caagcccttc tcagtgtatt 19140gtgatgcttg ttacacaggt ttgggatgtg tgttgatgca agagggaaga gttgtggctt 19200actcgtcccg acaactgaag gttcatgaga agaattaccc aatccatgat ctagagttgg 19260cagcagtggt tcacgcactg aagtcatgga ggcactatct gtatggacag aaatgcgatg 19320tttacacaga tcacaagagt ctgaagtaca tattcactca gtcagagttg aacatgaggc 19380aacgaagatg gttagagttg atcaaagatt atgagttgga gattcattac catccaggca 19440aagcaaacgt agtggcagat gctttgagca gaaagagtca agtcaatctg atggtcgctc 19500gtccgatgcc ttatgagttg gccaaagagt ttgacaagtt gagtctcggt tttctgaata 19560attcgcgagg agtcaaagtt gagttggaac ctaccttgga gcgcgaaatc aaagaagcgc 19620agaagaatga tgagaaaatc agcgagatcc ggcgactgat tctagatggc aaaggcaaag 19680aatttcgaga agatgcagaa ggcgtgatat ggttcaaaga ccgcttgtgt gttcctaatg 19740tccagtctat tcgggagttg attctcaagg aagctcatga gacgtcctat tcgattcacc 19800ctggcagtga gaagatgtat caggatctga aaaagaaatt ctggtggtac ggaatgaaga 19860gggagatcgc agagcatgtg gctaggtgcg atagttgccg aagaattaag gcagagcacc 19920agagacctgc tggattgttg caaccattgc agatccctca gtggaaatgg gacgaaatcg 19980gtatggattt catagtcgga ttgcctcgca ctcgagccgg ctacgattcc atctgggtag 20040tagtggaccg cttgaccaag tcagcccact tcatacctgt caagaccaac tacagcagtg 20100cagtattggc agaattgtat atgtctcgga tcatttgtct tcatggtgtg ccaaagaaga 20160tagtgtcaga cagaggaacg cagttcacct ctcatttctg gcagcagttg catgaagctt 20220tgggcacgca tctgaatttc agttcagctt atcatccaca gacagatggc cagaccgaaa 20280ggaccaacca aattcttgaa gatatgttga gagcctgtgc gttgcaagat cagtccggat 20340gggataagag attgccttat gcagagtttt cctgtaacaa cagttaccag gccagcttga 20400agatgtcacc atttcaggcg ctctatggaa ggagttgtag aactccgttg caatgggatc 20460agcctggaga aaagcaagtg tttgggccag acattttgct tgaagccgaa gagaacatca 20520agatggtccg agagaatctg aagatagcgc aatcgaggca gcgaagctat gcagacacaa 20580gaagaagaga gctgagtttc gaagtcggag actttgtcta tctgaaagtg tcaccaatca 20640gaggagtcag aaggttcgga gtgaaaggca agctagcacc ccgctacatt ggtccgtatc 20700agattcttgc aagacgtgga gaagtggcct atcagctcag tttgccagag aatttgtctg 20760ctgtgcatga tgtctttcat gtgtctcagt tgaagaagtg cttgcgtgtg ccagaagagc 20820agttgccagt agaaggtctt gaagtccagg aggacttgac ctacgttgag aagccagtgc 20880aaatccttga ggttgcagac cgagtcacct ggaggaagac catcagaatg tgcaaagtca 20940gatgggatca tcactctgag gaagaagcaa cctgggagcg tgaagatgat ctgatggcca 21000agtaccctga gctctttgct agccaaccct gaatctcgag ggcgagattc ttttaagggg 21060gataggtttg taacgccctg aatttggggg tagaattttt cttcttttct ctcaccaaat 21120tcgggcgtta ctctcttttc tctttccccg tttgctcctt cttcccaatt tcaaaccagt 21180atagcggcag gtgtccgtgt catgtataaa ccaaaaccta agtgtcatgg gtgttgcatc 21240atgccgaagc acatttcttt gtctgatgtt gagtgttcgt ctcgttccgt tccggatttc 21300ggttcgcgat ttaattccgt ttagtggtcc gcgctcgtcg cgggttttcg atccgcgaag 21360tggcccgacc catcccaacc tagtccagcc cagcccagcc ggcccggccc gccccggcct 21420gcgcgcccct ggcgcccaaa cccccccatg cgccccccct cctctctctc tctcatttgg 21480atctcccgcg caacaacctc tcctctccct cttccacctc tctctccccg tggtgcccta 21540ggatttggag acggcgatca ccggattttg gaccccgagg tgagctcccc tcccctcccc 21600ttctcttctc tctctctctc cctcctcttc ttctccccac gcgcgccccc cttctccccc 21660tgctcacgcg cgcccctgcc cgcccccgct caccggcggc gcggcgcccc ccgcccctgc 21720ccggccccgc gcggcggcgc ccgccctccc ccggccgcgg cggcgctcgt ccgccctcac 21780ccccggccgc ggcgcccgcc cctccccctc gcgcgcggag gcggcgcccg ccctcaccac 21840gacccgcagc gaccccgccc cgtccccgcc tctccccccg cgcgcggcgg cgcccgcccc 21900ccgcccctcc ccgcggcggc gctcgcccgc ccgcccctgc ccctccccgc ggcggcgccc 21960gcccgcccct gctcgcgcgc agcgcggccc cggcgcgccc ccggcatggc ccggcgcggc 22020cccggtggcc cctgctcgcc ggcgcgaccc cggcgtggcc cccagccccc ggcgcgtccc 22080cggcgcggcc ccggccggct cggccgcccc tggccggttc aaccgccccc cctggccggt 22140tcaaccgccc tggccccagc tcgcccgccc gttcccccgt cccggcctcg cccgaccgta 22200tcttcgctcg cccgcgctcg cggtgattgc tcgttaatta gcgtgttgcg tcgcacgcta 22260cgccgcgcga cggatttatc taaaattcag attctatcct gtgatacgta gtacaactga 22320cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 22380nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ntgggtcggc gctgagatca 22440gcttgattcg tttttggtta tacatgacac ggacacctgc cgctatactg gtttgaaatt 22500gggaagaagg agcaaacggg gaaagagaaa agagagtaac gcccgaattt ggtgagagaa 22560aagaagaaaa aattctaccc ccaaattcag ggcgttacat caggctacaa aggatgccaa 22620tggtattgct gctctctata ttgttcttgt tctaatgtaa aaactacaac acaactcttt 22680acttgatccc agaaattcct tctgcctcaa atggagacaa tgacgagtgg tcataagtac 22740agagattgca gacaaggtaa attttgcaat agaaataact aaccaaccat tagtgcttga 22800aaaaaactgg actggtgact ggggcacgtg gtttcatcaa catttggacc tcaacggtct 22860aatcagtata acttagaagt tggctagctc ttgaaaaaca ctgcatgaca ctaagcattt 22920gtttattttc agctgcttac acccctatga tttcaagtaa ctacttgtct acttgtgata 22980atcacctgaa tatgattatt tgaaatgctt atcatgtctc gtcaattgca tttcttttat 23040gtgtacctga agtctgctct tgcttcctaa tagagttcgt tttttaatac agaaaccact 23100ctgagatagc cacaatatag taaaagtggc agctaaggta ctaaaaacac ccatgcaaat 23160aagaaaaaaa tgaatcttgt attttaattt tgttaaatac ctctatagtt tggcgatata 23220ttatgttacc atcctgctta tagcctgtag gtcattttat atgagccatc aaattgcgat 23280gacagttgcc acaaatccag tttcatatga aggtattagc tgtgtaacaa gctaattgtt 23340gctctctgcc caataagtta ttcaattgga ttagtaggtt gcatccaagg ttattcaatt 23400ggatcagtag gttgcatcca aggtatactg ctgctctctg cccaataagt tattcaattc 23460gatcagtagg ttgcatgttc ccttcatttt attaaaaaat acataataat ataataagta 23520cttgtttgtt ctaaaaataa tacttctgta aatgaggata ttaattttcc ttttggtaat 23580aatgcaggtt gatgatactg aagtcatcag ttttttgctg caaactgaaa taattcctct 23640gtgcttgcga accatggaga tgggtagtga gctatccaaa attgtatgta gctagccaaa 23700tattctcatt caaatatcat aatttatctc ttctgcttaa tactggcaaa ggtgtaatag 23760tttttttagt attgatttgt cacctgaagt ttatcttgtg cactactact ttgccatcat 23820cagttatctc tagaatactc ttgtcctgta ccattttctc tctgataagc ctaaatttgt 23880acaattcata agcctaaaag gtgacttata taatatatac aaggaccctc aagagttgtt 23940tggcaattca gtgactgtcc tgggtcctgt tttggggagc ttctggtagc ttttgcttct 24000ccaaaagaaa agctagaagc tccccccaaa cagagcagct tcttcaagcc ggtaaaagct 24060tcaaaagctg taattatact aaaaacagtg aagctccctc agagcagctt cccagctctc 24120taggagatgc ttttggagaa gctacagttt ccccaaacag ggccctgctc tgttgaaccc 24180cccccttcct gatacatatt tgaatatgag tttatagtgt gtgtgggggt gtaagtaggg 24240gggtaatggg ttctaaattt tatactataa aaattaagga tcggattaga attgagctct 24300atttctattc atttttgaac taaaattaat taagggctca aatgaattat gaagaagcat 24360taggatcatg atccattacc acccctacgt gtaagatgtt ttttggtggt tgtggttgat 24420tttgaatttt aaggccgcat atgtctcatg gaccacacaa gctcatattc atctacattt 24480gtagccgtca ctaacttagc caaatatgca tatgtggcgg ctagcaacag gtccttggtt 24540tcttgggtta tttattctct ttttatcgtg tttgaatgtt ttcgtgttca tttgcataac 24600atcttaggtc tacattagta tatgaattga gatcaaatgt gaattggacc acacaagctc 24660atattcatct acatttgtag tcgtcactaa cttagccaaa tatgcatatg tccgcttctg 24720atttcattgt gtcttttctt caggagtttg gggatcaagg agaggactcc attatcttgt 24780caccgcgact gaaggagatt agtactcctg accgccccac tgccctccgt ttcctaggta 24840cacgcataac agccattggt atgaatacat gttttatacg tgaatggagt tccagtttaa 24900tttaaagatt caagttcact acaacaagat tttacagtac tgagcccatt tgactttcct 24960tgagaaatag tgaaagggaa ttaggcttac acctagttcc taaataattt tggtggttga 25020attgcccaac acaaataatt ggactaacta gtttgctcta gtgtacaagt tatacaggtg 25080ccaaggttca caacaagcca attaaaaaga ccaaagttgg gttcaaaata gagagccaaa 25140ggcatcccga aaggctccct ggtttggcgc accggactgt ccggtggcgc accggacagt 25200gtccggtgca ccaggggacc tcgcgcagaa ctcctcagcc tcgggaattt ttcggagccg 25260ccgcgctata attcaccgga ctgtccggtg tacaccggac agtgtccggt gctccaagaa 25320aacgcggctc cggaacttgg cagcctcagg aaatcagaac ggctgctccg ctataattca 25380ccagacatgt ccggtgtaca ccggactgtc cggtgcaact gcggagcaac ggctacttcg 25440cgccaacggt cacctgcagg cgcattcaat gcgcgccaga agcgcgcaga agtcaggcac 25500acccatgctg gcgcaccgga cactctacag tacatgtccg gtgcgccacc ggacatcaag 25560gcgggcccag aagacagaac tccaacggtc aaattccaac gactttggtg acgtggctgg 25620cgcaccggac tgtccggtgc accatatgac agacagcctc caccaacggt catgtttggt 25680ggttggggct ataaataccc caaccacccc accattcatt gcatccaagt tttccagctt 25740ccaaccacta tataagagct agcattcatt gcaaagcaca ccaaaagaga tcaagtcctc 25800tcccaactcc acacaaagcc ttagtgatta gagagagtga tttgtagtgt tcatttgagc 25860tcttgcgctt ggatcgcttc ttttctttgg cattctttct tgtgatcaaa cactcacttg 25920taattgaggc aagagacacc aattgtgtgg tggtccttgc gggaagtttg attcccaagt 25980gatttgagaa gagaagctca ctcggtccaa gggaccgttt gagagaggga agggttgaaa 26040gagacccggc ctttgtggcc tcctcaacgg ggagtaggtt tgagagaacc gaacctcggt 26100aaaacaaatc cacgcgtctc acttcattat tcgcttgcga tttgttttca cgccctctct 26160cggactcgtt cttatttcta acgctaaccc ggcttgtagt tgtgtttata tttgtaaatt 26220tcagtttcgc cctattcacc ccccctctag gcgactatca attggtatca gagcccggtg 26280cttcattaga gcctaaccgc tcgaagtgat gtcgggagat cacgccaaga aggagatgga 26340gaccggcgaa aagcccacta caagccacgg gagcacttca tcggaagagt cccgcaccaa 26400gaggaaggag aagaaagact cctccaaagg gaaggagaag aagaaggact cctccaaagg 26460aaaggagaag aaatcttctt cacacaaaga aaagaaggag aagtcttcct cccacgagcc 26520gcaacggagt ggggacaaga aaaagaggat gaggaaagtg gtctactacg agaccgattc 26580ttcatcgaca tccacctctg gctccgacgc ggcgtccgtc acttctaaac gccaagagcg 26640taagaagtat agtaagattc ccctacgcta ccctcgcatt tctaaacata cacctttact 26700ttccgtccca ttaggcaaac caccaacttt tgatggtgaa gattatgcta ggtggagtga 26760tttaatgcga tttcatctaa cctcactcca caaaagtata tgggatgttg ttgagtttgg 26820tgcacatgta ccatccgtag gggatgaaga ctatgatgag gatgaggtga cccaaatcga 26880gcacttcaac tcccaagcca caaccatact cctcgcctct ctaagtagag aggaatacaa 26940caaggtgcaa gggttgaaga atgcgaaaga aatttgggat ctactcaaga ccgcgcacga 27000gggtgatgaa ctcaccaaga ttaccaagca ggaaacgatc gagggggagc tcagtcgctt 27060ccgtcttcgc caaggggagg agccacaaga tatgtacaac cggctcaaaa ccttggtgaa 27120ccaagtgcgc aacctcggga gcaagaaatg ggatgaccac gaggtggtta aggttattct 27180tagatcactc atcttcctta accccactca agttcaatta attcgtggta atcctagata 27240tacactaatg acccccgagg aagttattgg gaattttgtg agctttgaat gtatgatcaa 27300gggctcaaag aagatcaacg agcttgatga tccctccacg tccgaagcac aaccggtggc 27360tttcaaggcg acggaggaga agaaggagga gtctacacca agtagacaac caattgacgc 27420ttcaaagctc gacaacgagg agatggcttt aatcatcaaa agctttcgcc aaatcctcaa 27480gcaacggaag gggaaggatt acaaatcccg ttcaaagaaa gtttgctaca agtgtggtaa 27540gcccggtcac tttattgcta aatgtccatt atcaagtgac agtgacaggg ataatgacaa 27600gaagggcaag aggagagaaa agaagaggta ccacaagaag aggggcggtg atgcccacgt 27660atgccgcgag tgggactcca acgagagctc caccgactcc tccgacgacg aggacgtcgc 27720caacatcgcc gacaccaagg gactcctctt ccccaacgtc ggccacaagt gcctcatggc 27780aaaggacggc aaaaacaaga aggataaatc taaatcctcc actagatatg aatcctctag 27840tgatgaaaat gttagtgatg aggaagataa cttgcgatct ctttttgcca acctcaacat 27900gcaacaaaaa gagaaactta atgaattgat tagtgtcatt catgaaaagg atgatctctt 27960ggacacccaa gaggacttcc ttattaaaga aaataagaag catgttaagg ttaaaaatgc 28020ttatgctcta aaagtagaaa aatgtgaaaa attgtctagt gagctaagca cttgccatga 28080gactataaac aaccttagaa atgagaatgc taatttgtta gctaaggttg attctcatat 28140ttgtaatgtt tcaagttcca atcctagaga taataatgat gatttatttg ctaggattaa 28200agatttgaac atttcacttg ctagccttag aaatgaaaat gaaaaattgc ttgctaaggc 28260taaagatttt gatgtttgca atgttactat ttctaacctt agaagtgaaa acgacatatt 28320acatgctaag gttgtagaat taaaatcttg caaacctcct acatctatag ttgagcatgt 28380atctatttgt actagatgta gagatattga tgttgatgct attcatgatc acatgacttt 28440aattaaacaa caaaatgatc atatagcaaa actagatgct aaaattgccg agcataactt 28500agaaaatgaa aaatttaaat ttgctagaag tatgctctat agtgggagac gccctggcat 28560caaggatggc attggcttcc aaaggggaga caatgtcaaa cttaatgccc ctcctaaaag 28620attatctaat tttgtaaagg gcaaggctcc catgcctcag gataacgagg gttacatttt 28680gtaccctgcc ggttatcccg agagcaaaat taggaggatt cactctagga agtctcactc 28740tggccctaac catgctttca tgtacaaggg tgagacatct agctctaggc aaccaaccca 28800tgttaagttg cctaagaaga aaactcctag tgcatcaaat gaacatagca tttcatttaa 28860gacttttgat gcatcttatg ttttgactaa caaatccggc aaagtagttg ccaagtttgt 28920tgggggcaaa cacaagggct ccaagacttg tgtttgggta cccaaagttc ttgtttctaa 28980tgccaaagga cccaaaaccg tttgggtacc taaagtcaag aactaaaatt gttttgtagg 29040tttatgcatc cggaggctca agttggatac tcgacagcgg gtgcacaaac catatgacag 29100gggagaagaa gatgttctcc tcctacgaga aaaaccagga tccccaacga gctatcacat 29160tcggggatgg aaatcaaggt ttggtcaaag gtcttggtaa aatagctata tctcctgacc 29220attctatttc caatgttttt cttgtagatt cattagatta caatttgctt tctgtatctc 29280aattatgcaa aatgggctac aactgtcttt tcactgatat aggtgtcact gtctttagaa 29340gaagtgatga ttcaatagca tttaagggag tgttggaggg tcagctatac ttagtagatt 29400ttgatagagc tgaactcgac acttgcttaa ttgctaagac taacatgggc tggctctggc 29460atcgccgact agcacatgtt gggatgaaga atcttcataa gcttctaaag ggagagcaca 29520ttttaggatt aaccaatgtt cattttgaga atgacagggt ttgtagcgca tgccaggcag 29580gaaagcaagt tggagcccat catccacaca agaacatcat gacgaccgac aggccgcttg 29640agctactcca catggatcta ttcggcccga ttgcttacct aagcatcggc gggagtaagt 29700attgtcttgt gatagtggat gattattctc gcttcacttg ggtgttcttt ttgcaggaaa 29760aatctcaaac ccaagagacc ttaaaaggat tcttgagacg ggctcaaaat gagttcgcct 29820taaggatcaa gaaaataaga agcgacaacg gaacggagtt caagaactct caaattgaag 29880gcttccttga ggaggagggc atcaagcatg agttctcttc tccctacacg tcacaacaaa 29940atggtgtagt agagaggaag

aatcgaactc tattggacat ggcaagaacc atgcttgatg 30000agtacaagac tttggatcgg ttttgggctg aggcggtcaa caccgcctgc tacgccatca 30060accggttata tctacaccga atcctcaaga agacatctta tgaactccta accggtaaaa 30120agcccaatat ttcatatttt agagtctttg gtagcaaatg ttttattctt gttaaaagag 30180gtagaaaatc taaatttgct cctaagactg tagaaggctt tttactagga tatgattcaa 30240acacaagggc atatagagtc tttaacaagt ccactggaca agttgaagtt tcttgtgacg 30300ttgtgtttga tgagactaac ggctctcaag tagagcaagt tgatcttgat gaaataggta 30360atgaagaggc tccatgcatc gcgctaagga acatgtccat tggggatgtg tgtcctaagg 30420aatccgaaga gcctccaaat gcacaagatc aactatcctc ctccacgcaa gcatctccac 30480cgactcaaaa tgaggatgaa gctcaagttg atgaagtaga agatcaagca aatgagacac 30540ctcaagatga cgacaatgat caagggggag atgcaaatga tcaagacaag gaggatgaag 30600agcataggcc gccacaccca agagtccacc aagcaatcca acgagatcac cccgtcgaca 30660ccatcctcgg cgacattcat aagggggtaa ctactagatc tcgtattgca catttttgtg 30720agcattactc ttttgtttcc tctattgagc cacacagggt agaggaagca ctccaagatt 30780cggattgggt ggtggcgatg caagaggagc tcaacaactt cactaggaat gaggtatggc 30840atttagttcc acgtcctaat caaaatgttg taggaaccaa atgggtcttc cgcaacaagc 30900aagatgagca tggtgtggtg acaaggaaca aagctcgact tgtggccaaa ggatactccc 30960aagtcgaagg tttggatttc ggtgaaacct atgcacccgt agctaggctt gagtcaattc 31020gtatattatt ggcctatgat acttaccatg gctttaagct ttatcaaatg gacgtgaaaa 31080gtgccttcct caatggacca atcaaggaag aggtctatgt tgagcaacct cccggctttg 31140aagacagtga gtaccctaac catgtctata agctctctaa ggcgctttat gggctcaagc 31200aagccccaag agcatggtat gaatgcctta gagatttcct tattgctaat ggcttcaaag 31260tcggaaaagc cgatcctaca ctctttacta aaactcttga aaatgacttg tttatatgcc 31320aaatttatgt tgatgatatt atatttggat ctactaacga gtccacttgt gaagagttta 31380gtaggatcat gacacagaaa ttcgagatgt ctatgatggg ggagttgaag tattttctag 31440gattccaagt caagcaactc caagagggca ccttcattag ccaaacaaaa tacactcaag 31500atattctaag caagtttgga atgaaggatg ccaagcccat caagacaccc atgggaacta 31560atgggcatct cgacctcgac acgggaggta agtccgtgga tcaaaagcta taccggtcga 31620tgataggttc tttactctat ttatgtgcat ctcgaccgga cattatgctt tccgtatgca 31680tgtgtgcaag attccaagcc gaccctaagg aagcccacct tacggccgta aaacgaatct 31740tgagatatct ggcttatact cctaagtttg ggctttggta tcctagggga tccacatttg 31800atttgattgg ttattcggat gccgattggg cagggtgcaa aatcaatagg aagagcacat 31860ccgggacttg ccagttcttg ggaagatcct tgggtgtctt gggcttcaaa gaagcaaaat 31920tcggtcgctc tttccaccgc cgaagccgag tatattgccc gcaggccact gttgcgcgca 31980actgctttgg atgaggcaaa ccctgcggga ctatggttac aaactaacca aggtcccttt 32040gctatgtgat aatgagagtg caatcaaaat ggtcgacaat cccgtcgagc atagccgcac 32100taagcacata gccattcggt atcacttttt gagggatcac caacaaaagg gagatatcga 32160gatttcatac attaatacta acgatcaatt agctgatatc tttaccaagc ctcttgatga 32220acaatctttt aacaaactta ggcatgagct caatattctt gattctagga acttcttttg 32280ttaaattgca cacattgttc ttttatatac ctttgatcat atctctttta tatgctatga 32340ctaatgtgtt ttcaagtcta tttcaaacca agtcataggt atattgaaag ggaattggag 32400tcttcggcga agacaaaggc ttccactccg tacctcatcc ttcgccatca cttcaagcaa 32460ctctccgttc tcgggggaga taagcatgag catcaaagaa aaggactttg ggggagaaat 32520gagcccaaag ccaaaggacc ggacttcgtc tttggtataa tcttaactca tttatttatg 32580accaaaaggg aaaatagcac ttcgagggct ctaatgattc cgtttttggc gattcatgcc 32640aaaaaggggg agaaatgagc ccaaagcaaa aggaccgcac caccaccaat ttcaaaaact 32700tagtgttgaa tatttttcaa tttgtatctt attttcaatt ggtatcttat tgtgttcaaa 32760agggggagaa agtagtattt taaaatgata tatcaaaaac cctcttgaat actaagagga 32820ggatctcttt tagggggagt tttgtttaag tcaaaggaaa agcatttgaa acagggggag 32880aaaatttcaa atcttgagaa tgctttgcaa aaatcctatt catttacctt tgactatttg 32940caaaagaact ttgaaaagga tttacaaaat aatttgcaaa aacaaaactc gtggtgcaag 33000cgtggtccaa aatgttatat aaagaaagaa acaatccatg catatcttgt aagtattcat 33060attggctcaa ttccaagcaa cctttacact tacattatgc aaactagttc aattatacac 33120ttctatattt gctttggttt gtgttggcat caatcaccaa aaagggggag attgaaaggg 33180aattaggctt acacctagtc cctaattaat tttggtggtt gaattgccca acacaaacaa 33240ttggactaac taagtttgca caagtttata gattacacag gtgtaaaagg ttcacactca 33300gccaataaaa ggaccaagtt tttggattca acaaaggagc atagtgggaa ccgaaggccc 33360tctggtctga gagcaccgga ctgtccggtg tacaccggac agtgtccggt gcaccagagg 33420actcaagctc caactcgccc ccttcgggaa ttgccagagg cgactccgct ataattcacc 33480ggactgtccg gtgtacaccg gacagtgtcc ggtgcgccaa gggaggtcgg cctcaggaac 33540tcgccagctt cgggaaaagc catcggcccg tccactataa ttcaccggac tgtctggtgt 33600gcaccggact gtccggtgcg actcgagagc aacggctatt tccgcgccaa cggctctctg 33660ccgcgcattt aatgcgcgct ctgcgcgcgc agatgtcagg cgcgcccact ccggcacacc 33720ggacaaggaa cagtagatgt ccggtgtgca ccggacaccc aggcgggccc acaagtcaga 33780agcttcaacg gctagaatcc aacggcagtg atgacgtggc aggggcaccg gactgtccgg 33840tgtgcaccgg actgtccggt gcgtcatcga acagacaacc cagccaacgg tcaagtttgg 33900tggttggggc tataaatacc cccaaccacc ccaccattca ttgcatccaa gttttccact 33960tctcaactac tacaagagct ctagcattca attctagaca caccaaaaga gatcaaatcc 34020tctccaattc cacaaaaggc tttagtgatt agagagagag atttgccgtg ttctttttga 34080gctcttgcgc ttggattgct tcttttcttt cgcacttgtt cttgtgatca aaaacccatt 34140gtaatcaagg caagaggcac caattgtgtg gtggcccttg ttgggaagtt ttgttcccaa 34200ctttgattag agaagagaag ctcactcggt ccgagggacc gtttgagaga gggaagggtt 34260gaaagagacc cggcctttgt ggcctcctca acggggagta ggtttgcgag aaccgaacct 34320cggtaaaaca aatccacgtg tcacactctt catttgcttg cgatttgttt tgcgccctct 34380ctcgcggact cgtttcttca ttactaacgc taacccggct tgtagttgtg tttatatttg 34440taaatttcag tttcgcccta ttcacccccc ctctaggcga ctatcaaaaa cagtgcaacc 34500atgagggtgg aatggggtgc ccttagctga ataattagag gatccggggt gtagttcact 34560tagccatcgt gccgtcaatg gggctcggtg tatgcggctc gctctgccaa gtttgggttc 34620gccccttggg gaggagtgcg gtgcatttag gaaacctaac gggtggctac agtcccgggg 34680aatctttgta aaggctacgt agtgatgccc tgctgggtca ccttggtagt gatcaatgga 34740gagtcatgat ctccgggcag aatgggaatc acggcttgtg ggtaaagtgc acaacctctg 34800cagagtgttt gaaaactgat atatcagccg tgctcacggt tatgagcagc caagggagct 34860ccagtgatta gtggtacttg atcagagata ctttggtaca ggtggttatg agatcgatga 34920ttctggttat gactatgatg ctggtaagtg gtactctttc cgtttggaaa ggagtacgtt 34980tgggttaata acttgggtta atgctaaaac ttggctttct attagtaaat aataatctga 35040ccaactaaaa gcaactgctt gacttatccc cacataaagc tagtccacta cagccaaaca 35100ggatacttgc tgagtatgtt gatgtgtact cacccttgct ctacacacca aacccccccc 35160ccatccccag gttgtcagca ttgcaaccac tgctcagtcg aagatgaagc tgtggaagga 35220gacttccagg agttccaaga ttacgacgag ttctaggtgt gggttagcgg caacccccag 35280tcggctgcct gtgaaggccg cggttatcta cgtttctttt ccgcactttg atttattgta 35340agaactatat ggacgtctca gacgtatgat gtaatcgact atttccctta gtaatactat 35400tttgagcact gtgtgatgat gtccatgtta tgtaactgct gtgtacgtga ataactgatc 35460ctggcacgta catggttcgc attcggtttg ccttctaaaa ccgggtgtga cacctgatta 35520ctctcaagca aagcctatag gtagtttaag aggttgagta caatgagaaa catttcaatc 35580attatttgca aaagaaacat tttgatcata ttaaggaaaa tcataggagt gaaagaaaaa 35640caatgtgtgc aaataactga acctcctgca gctccatcat gctggccaaa gtatgcttcg 35700acgaattcaa aatatcatca tatgcttgct ctatctgctt atcatgattg caaccttgtt 35760cggatgggtt cctggcttcc tgcaacagca cagtagaaaa caaactggag tttagaattc 35820aacttgagac ttcagagttg aaacaaaacc tgtattcaca tatgtagcac cgatgcatat 35880acaaatacct ttatcagaaa catgaagata tgtgaataca ttattttctc aataaaagtc 35940atgatagaat tcatgcaaat ttttttagaa ataaataaat gatgaggcat actacaattc 36000taaaagcagt agcagtgcaa cacgaacgaa cattcaaatt gccccacatt cttgaaactg 36060tgctgctttg ctctcgttcc aaaaaaactc cgctaaagta aaacttggaa aggtctggtt 36120ttgcgtgagc caccaacacc aaccaaactt tacgtcacta tgacagtttc agcctttcgg 36180tcccggcgac agccatggcg gacgcggggg tgacaggggg tgctggccaa gctgggtgag 36240ctgacggagg aggaggcgac gacgctgctg cgcgtggacg ccgagatacg ggcattgtgg 36300cagaagctgg cctacctgca ggcgctcgta cgcggggccg gccgccagcg ccgcgaccgc 36360gcaagcgagc tgctcctgct ctggctacgc gagaccagag aggttgcttt cgcgggtggt 36420tctgccatac atagcggcga tggctcttcc tcccaggact accatatcta cacataacaa 36480tcatgctctc atgaagttgt gatgtaatag gtcacacgat tttaatgtat aagattgtga 36540agagtaaatt aattcaaatg aattcatgga catgggacaa ctatgtgtta aaaacagaat 36600ctcctatgta tcctaaccat gtgtaatgac atgacaaaat gacacttgta tatgagcaat 36660taaagcatca tactaccttg tagagttctc ttcggcgaat acgttgaatg atttcttttg 36720ctttcttcag tttattgtta ggagcagtct cgtagacaca gtcaagtcct agcattagct 36780cactgcgaga gtatcagcca ttccagactc caggagctcc tcccgggtag cctgcctggc 36840aatgggcgca gatatcctct tgagctcact cttaatgcta ggttgtaggc tccttgcttc 36900caggtccatc ttccgaatct gcagacacaa aacaaaaaca taaaattact tttcccccaa 36960cccccaaatt gaagactaag gcacataaac atctcgcaac tccgttggaa caccaaaact 37020gcaccgacgg aaacgaaaaa tccctaatct ctgcttttct acccattccc caaatgtcgt 37080atgtcccgtt agccaagggg atccccaaaa cagaacgtgt cacgacatct atttaccagc 37140gattcagctt cctcaacgcc ggattggatc tcagagagct tctgcttctt cctctctgcg 37200gacagaccca ccgaaatcag accaaaacaa acggtcaaca aaagaaggct ttccaagcgg 37260cgctactgac gctcggctgc ctggccggcg agcctcgcag tcggccctcg atgaaggagg 37320tggtggagac gctggagcgg gtggaggcga tgaagagccg ggcacgcggc gcgcgtacac 37380cgcccggttc ctgccccccc ccccccccga cgacgaccat ggcgcttcgc cgatggccca 37440cttgcctggt cgccccgccg tcgcgtcagc gagcgcgctc agcagctccg tgacccgctc 37500gtcgaaccgc gtggccacgg cgagcggcac gaggcctggg ttcgtgacgt tgaccggggg 37560gcggcggggg tgaaagggaa ttaggctcac acctatttcc taattgattt tggtggttga 37620attgtctaac acaaataatt ggactaacta gtttgctcta gtctataagt tttacaggtg 37680ccaaaggttc ataataagcc aataaaaaga ccaagaaagg gttcaaacaa aaagagcaaa 37740agacatcccg gaaggcaccc tggtctggcg caccggactg tccggtgtgc caccggacag 37800tgtccggtgc accagggcac tcgaagctga actcgctacc ttcgggaaaa tcagagggcg 37860ctccgctata attcaccaga ctgtccggtg aagcaccgga ctgtccggtg tgccagcgga 37920gcaacggcta cttcgcgcgc aacggtcgac tgcaacgcat tcaatgcgcg cctgcgcgcg 37980cagagggcag agcactcaca gttggcgcac cggacagtct acaggacctg tccggtgcac 38040caccggacag cccagaggcc ccacaagtca gagctccaac gatcgaaccc caacgatctg 38100ctgacgtggc tggcgcaccg gactgtccgg tgcgccatgc gaccgcagcc ttccaacggc 38160catttttggt ggtttagggc tataaatacc ccaaccaccc cacattcaat ggcatccaag 38220tttcccacct tcaacacatt acaagagcta taacattcaa ttctagacac tccaaaagat 38280caaatcctct cccaagtccg gaatcactcc aaatcaaata gtgactagag agagcgacat 38340ttgtgttcat ttgagctctt gcgcttggat cgcttctttt ctttctcatt cttcttgtga 38400tcaaactcaa ttgtaaccaa ggcaagagac accaattgtg tggtggtcct tgcaggaact 38460ttgtgttccg tttgattgag aagagaagct cactcggtct aagtgaccgt ttgagagagg 38520gaaagggttg aaagagaccc ggtctttgtg accacctcaa tggggagtag gtttgcaaga 38580accgaacctc ggtaaaacaa atcatcgtgt ctcgctcttt atatttctaa cgttaacccg 38640gcttgtagtt gtgcttaagt ttgtaaattt cagattcgcc ctattcaccc cccctctagg 38700cgactttcaa ttggtatcgg agccggtgct tcattagagc ctaactgctc gaagtgatgt 38760cgggagcatc cgccatgagg gatctcggga ccggcgacaa gaccgcatgc tcgggaagaa 38820ctcactcaag ggagtccgcc cacaagcata aggaggaatc gtcttcctcc atcaagtccc 38880atcggatggg tgacaaaaag aagaagatga ggaaggtggt ctactacgag accgactctt 38940cgtcaccctc cacctccggc tcggaatcgg cctccaccac ttcaaagcgc catgagcgca 39000agaagtatag taagatgccc cttcgctatc ctcgcatttc tagacgcact ccatcactct 39060tcgttccatt aggcaaacca cctatatttg aaggtgaaga ttattctatg tggagtgata 39120aaatgaggca tcacctaacc tcactccaca aaagcatatg ggatattgtt gagtatggag 39180tgcaggtacc aaagaaggga gataaagatt acgactcgga ggaggttgaa caaatccaac 39240atttcaaatc caagtcgaga ggagtataat aaggtgcaag ggttgaagag tgcaaaggat 39300atctgggacg tgctaaagac cgcgcacgaa ggagacgagg taaccaagat caccaagcgg 39360gagacgatcg agggggagct cggtcgcttc cggcttcgcc aaggggagga gccacaagat 39420atgtacaacc ggctcaagac cttggtgaac caagtgcgca acctcgggag caaaaaatgg 39480gatgaccatg aaatggttaa ggttattctt agatcacttg tgttccttaa ccctacgcaa 39540gttcaattaa ttcgtggtaa tcctagatat acactaatga ctcccgagga agtaatagga 39600aactttgtga gctttgagtt gatgatcaaa ggctcaaaga aaattatcga gcacgacggt 39660ccctccacgc ccgaagcaca accggtcgca ttcaaggcga cagaggagaa gaaagaggag 39720tctacatcaa gtagacaacc catcgacgcc tctaagctcg acaacgagaa aatggcgctc 39780atcatcaaga gcttccacca aatcctcaaa caaaggaagg ggaaagatta caagccttgt 39840tccaaaaggg tgtgctacaa gtgtggtaag cccggtcatt tcattgttaa atgtccttta 39900tctagtgata gtgacagggg cgacgacaag aagggcaaga ggagagaaaa gaggaggtat 39960tacaagaaga agggcggcga tgcccatgtg tgccgcgagt gggactccga cgagagttcc 40020tccgactcct catccgacga ggacgccgcc aacatcgccg tcaccaaagg gctcctcttc 40080cccaacgtcg gccacaagtg cctcatggca aaggacggca aaaagaagaa ggtaaaatca 40140aaatcctcca ctaaatatgc atcctctagt gatgaagata atgctagtga tgaggaggat 40200aatttgcgta ccctttttgt caacctaaac atgcaactac aggaaaaact aaatgaatta 40260attagtgcta ttcatgagaa agatgatctc ttggactttc aagaggactt cctaattaag 40320gaaaataaga agcatgttaa ggttaaaaat gcttatgctc tagaagtaga aaaatgtgaa 40380aaattatcta gtgagctaag cacttgccat gatactatta ccatccttag aaataaaaat 40440actaaactaa ttgctaaggt tgattctaat atttgtgatg tttcaattcc caatcttaga 40500gatgataatg ttaatttgct tgctaagatt gaagaattga atgtctctct tgctagcctt 40560agggttgaaa atgaaaaatt gattgctaag gctaaagaat tagatgtttg caatgcttcc 40620atttctgatc ttagaaataa caatgatatt ttacgtgcta agattgttga acttaattct 40680tgcaaaccct ctacatctgc cattgagcat gtcattattt gcactagatg tagagatatt 40740aacattgatg ctattcatga tcatatggct ttaattaaac aacaaaataa tcatatagca 40800aaattagatg ctaaaattgc cgagcatgac ttaaaaaatg aaaaatttaa atttgctaga 40860agcatgctct atagtgggag acgccctggc atcaaggatg gcattggctt ccaaaaggga 40920aacaatgtca aacttaatgc ctctcctaaa agattgtcaa actttgttaa gggcaaggct 40980cccatgcctc aggataatga gggttacatt ttgtaccctg ccggttatcc cgagagcaaa 41040attaggagaa ttcattctag gaagtctcac tctggccata atcatgcttt tatgtataag 41100ggtgagacat ctagctctag gcaatcaacc cgtgcaaaat tgcctaagaa gaaaactcct 41160gctgcatcaa atgatcataa catttcattc aaaacttttg atgcatctta tgttttaact 41220aacaaatccg acaagatagt tgccaagtat gttgggggca aacacaaggg atcaaagact 41280tgtgtttggg tacccaaagt tcttgtatct aatgtcaaag gacccaaaac catttgggta 41340cctaaaatca agaactaaac ttgttttgta ggtttatgca tccgggggcc caagttggat 41400catcgatagc gggtgcacaa accatatgac aggggagaag aaaatgttct cctcctatga 41460gaaaaaccaa gatccccaaa gagcgatcac attcggggat ggaaaccaag gtttggtcaa 41520aggattgggt aaaattgcta tatctcctga ccattccatt tccaatgtgt ttcttgtaga 41580ttctttagat tacaacttgc tttcagtttc gcaattatgt aaaatgggct acaactgtct 41640ttttacagat ataggtgtta ctgtctttag aagaagtgat gattcagtag catttaaggg 41700agtgttagag ggtcagctat acttggtaga ttttgataga gctgaactcg acacttgctt 41760aattgctaag actaacatgg gctggctctg gcatcaccga ctagcacatg ttggaatgaa 41820gaatcttcac aagcttctaa agggagaaca cattttggga ctaacaaatg ttcactttga 41880gaaagatagg atttgtagcg catgtcagac agggaagcaa gttggtactc atcatccaca 41940caagaacatc atgatgactg acaggccact cgagctccta catatggacc tattcggccc 42000gatagcttac ataagcatcg gcgggagtaa gtactatcta gttattgtgg atgattatac 42060tcgcttcact tgggtattct ttttgcagga aaaatctcat acccaagaga ccttaaaggg 42120attcttggga cgggctcaaa atgagttcgg cttaagaatc aaatttgttt taagcgacaa 42180cgggacggag tcaagaatct caaatcgaag gcacgatctc ctagatccgg cccaaaannn 42240nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 42300nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnncgc tgatgaatca gcttgattcg 42360tgacacggga ggtaagtccg tggatcaaaa ggtataccgg taagataata ggctctttac 42420tctatttatg tgcatctcga ccggatatta tgctttccgt atgcatgtgt gcaagattcc 42480aagctgaccc taaggaagct caccttacgg ccgtaaaacg aatcttgaga tatttggctt 42540atactcctaa gtttgggctt tggtatccta ggggatccac atttgatttg attggttatt 42600cggatgctga ttgggcgggg tgtaaaatca atagaaagag cacatcaggg acttgccagt 42660tcttgggaag atccttggtg tcttgggctt caaagaagca aaattcggtc gctctttcca 42720ccgccgaagc cgagtacatt gccgcaggcc attgttgcgc gcaattgctt tggatgaggc 42780aaaccctgcg ggactatggt tacaaattaa cctaagtccc tttgctatgt gataatgaga 42840gtgcaatcaa gatggcggat aatcccgtcg aacatagccg cactaaacac atagccattc 42900ggtatcattt tcttagggat caccaacaaa agggagatat cgagatttct tacattaaca 42960ctaaagatca attagccgat atctttacca agcctcttga tgaacaaacc tttaacaaac 43020ttaggcatga gctcaatatt cttgattcgc gcaatttctt ttgctaaatt gcacacatag 43080ctcatttata tacctttgat catatctctt tcatatgcta tgactaatat gttcttcaag 43140tctatttcaa accaagtcat aggtgtattg aaagggaatt ggagtcttcg gcaaagacaa 43200aggcttccac tccgtaactc atccttcgtc gtcgctctgg gccactctcc atctttgggg 43260gagagagcaa aagacttcgt ctttggtaca atcttaactc atttatttat gaccaaaggg 43320gaagaaagta cttcgagggc tctaatgatt ccgtttttgg cgattcatgc caaaggggga 43380gagagtatga gcccaaagca aacggaccgc accaccacca atttcaaaaa cttagttttt 43440caaagagtat tttcaattgg tatcctattg tgttcaaaag ggggagaaag tagtattttc 43500aaaaatgata tatcaaaacc ctcttgaaca ctaagaggtg gatctcattt agggggagtt 43560ttgtttagtc aaaggaaaag catttgaaac agggggagaa aatttcaaat cttgaaaatg 43620cttcataaaa tcgtattcat ttacctttga ctttttgcaa aagaactttg aaaaggattt 43680acaaaatagt ttgcaaaaac aaaacatgtg gtgcaagtgt ggtccaaaat gataaaaaca 43740aaggaacgat ccatgcatat cttgtaagta tttatattgg ctcaaatcca agcaaccttt 43800gcacttacat tatgcaaact agttcaatta tgcattttat acttgctttg gtttgtgttg 43860gcatcaatca ccaaaaaggg ggagattgaa agggaattag gcttacacct agttcctaaa 43920taattttggt ggttgaattg cccaacacaa ataattggac taactagttt gctctagtgt 43980acaagttata caggtgccaa ggttcacaac aagccaatta aaaagaccaa agttgggttc 44040aaaatagaga gccaaaggca tcccgaaagg ctccctggtt tggcgcaccg gactgtccgg 44100tggcgcaccg gacagtgtcc ggtgcaccag gggacctcgc gcagaactcc tcagcctcgg 44160gaatttttcg gagccgccgc gctataattc accggactgt ccggtgtaca ccggacagtg 44220tccggtgctc caagaaaaca cggctccaga acttggcagc ctcgggaaat cagaacggct 44280gctccgctat aattcaccgg acatgtccgg tgtacaccgg actgtccggt gcaactgcgg 44340agcaacggct acttcgcgcc aacggtcacc tgcaggcgca ttcaatgcgc gccagaagcg 44400cgcagaagtc aggcacgccc atgctggcgc accggacact ctacagtaca tgtccggtgc 44460gccaccggac atcaaggcgg gcccagaagg cagaactcca acggtcaaat tccaacggct 44520ttggtgacat ggctggcgca ccggactgtc cggtgcacca tacgacagac agcctccacc 44580aacggtcatg tttggtggtt ggggctataa ataccccaac caccccacca ttcattgcat 44640ccaagttttc cagcttccaa ccactataca agagctagca ttcattgcaa agcacaccaa 44700aagagatcaa atcctctccc aactccacac aaagccttag tgattagaga gagtgatttg 44760tagtgttcat ttgagctctt gcgcttggat cgcttctttt ctttggcatt ctttcttgtg 44820atcaaacact cacttgtaat tgaggcaaga gacaccaatt gtgtggtggt ccttgcgggg 44880agtttgattc ccaagtgatt tgagaagaga agctcactcg gtccaaggga ccgtttgaga 44940gagggaaggg ttgaaagaga cccggccttt gtggcctcct caacggggag taggtttgag 45000agaaccgaac ctcggtaaaa

caaatccacg cgtctcactt cattattcgc ttgcgatttg 45060ttttcacgcc ctctctcgga ctcgttctta tttctaacgc taacccggct tgtagttgtg 45120tttatatttg taaatttcag tttcgcccta ttcacccccc ctctaggcga ctatcaaata 45180gccagtgctt tttggtctgc gagttcctgc acttggttaa tcaactgtgt cgcttgatct 45240tctacttgtt tgcacgagaa ggtcaaagcc actttcgaag ctattagttc agaacacaca 45300acatctagct aaatacatca ccagtttgaa gtcattgatt gtattcttga tatcatcttt 45360attcttgaat gtcatttgtg ccagttcatt taactcttgt gctgcaaacc aacctgacat 45420cgtcaattca tttaatctct caatctcagt ttcctttttt tgtttcacat tgaagctccc 45480taattgttgc ttctcgatgt gcagtggcct gattagctac aagaagctct tgttccatgg 45540actcgatctc tggaccgcac tatcgttgcc tcagccccta ggtcgtgctg ccctctggcc 45600tcctcatcgt acaattcacc aacatctcca atgtaagtgc agcaggttca gtaatgaact 45660cagaagtggc atcagaatac tccaagagtt ttttgatctt tttgcctgga tatataccaa 45720gggaaatgca ttcaaaactc ctatagatga cgaatcccat ctctccctct tttctcggac 45780acggatcccc aggtccgtct ccgtgcttta ctcatttgtt ttttacaagt tcagatccac 45840ttgcgtactc acacggtgga catctgttat gcacatgtgt aaaccagcat aagtccttac 45900actcgaaaat gcatgtgtta tttagcttga gaataaataa aattattagc aaggagaaaa 45960caaaaaaata ggactaaaca atagagtcac attggtttaa attagtacct agaagtaaaa 46020aaagatgatc taaattagat acatcatacc aaataccata ttactattcc agttaccccg 46080tctactatgc ctagatatca aattcttgaa ggttggcctt ctcattttca gtaatagcct 46140gacgaaagta gagtatgttt gtatgagcaa ttatgctgct cactactgcc ttgcgctata 46200ataggccact actgatttta catgcttttt ctacattaga tagctcacaa acatgctacc 46260tcaaaaaaat gatggcaaag gggagccaca aaatgtcaat tattttgtca agtattagca 46320gttttcttgt gtatgtgatc agactaacac tgcatgtctt tgttttcctg taaaactatg 46380tatgatgaaa ccatggtgtg attgtattgg ctggccttac cctgttttgt tgcaatgcat 46440tcgttgttgt acaggtaata tgttgaaaca caattcattg catatgacaa ttctgttttt 46500tctttctaga atattgacat attgtttgat cattattttc taagcaataa tcatggctat 46560tcttatagta ttgcataata cctttttctt ttcgaaccct agcgcattga ttctttagtg 46620aagtgattat agtgattcca gcgggagagt agggtgggga gcagagggtt gattctggac 46680tgatttcggt ggagattaaa tggggagcag tgaggagcat gtttttttag atcccaccag 46740aatatgtgcg ccattttgct atttggctga ggagtgatgc tcagggagaa tccgttctca 46800ggagctgtgc caaatgtcgc cttaggtttt atgatatgac ctgacttctg tgttaatatt 46860tgttagatct ttattttatt tgaggttaca aaggtggtgt tctcaagcta gaaacaaagt 46920tgtggctagg tcaaaactag atgatgctct tgaaccgtgt cttttgactc tgttacttgt 46980tgcaggtttg atgttcacta atatgttgtt caactttgag caggtcagca attgactggt 47040gttgctggta gcctggcata tctggcacct gaggttctac taggaaatta ctcccaaaag 47100gtagatgtat gggctgccgg ggtgcttctg catgttctgt tgatgggcac tcttccgttc 47160caaggaaaat ctatcgaagc tatctttgat gttataaaga ctgctgaact tgactttcac 47220aatagtcagt gggcatctgt gtcacttctt gcttatgatc tcattggtcg aatgcttaat 47280cgagaggtct cttcaaggcc cgatgccgaa gatgttctcc gtaagttcaa gcacccttgt 47340aacttgtgct ttatatatat atataatata tatatatata tgattctcaa tttatcattg 47400acttttccta atggctttca acacagggca cccatgggtc ttattctaca ctgattgcct 47460gcagaaagct gaattctcta acctatggga tactaacaaa actgcagctc ccatgattca 47520tcgggagata gtcaggtttg gttactgcga gtcttcatct tcaaaatcct caagtgacaa 47580ctctgaagag cgagatgaat gcggtatagt tgatgcactg gcgacaacaa taacacaggt 47640gaggatctca gagcccaaga ggagtcggct gttcagccta cccaacgggt tgttgccgcc 47700aagcaggaac agtctccgaa catgaagatg atgaatccgt gtgtggcttt ctaacttgac 47760ctacctagct cccatcccca tgcatgtata aacgagataa acgagctctg tgattttata 47820gatggaaaat tttcaccgtg gttgatgttt tgcgattgct agctcgctga gcctgcaatc 47880ctctgtaaat atatcattgt tgtcatcatt tttgtacatc gatgacaccg taattgattc 47940gatt 4794421524DNAArtificial SequencecDNA 2atggggagca gtgaggagca tgttttttta gatcccacca gaatatgtgc atccgtgtca 60cttcttgctc atgatctcat tggccgaatg cttaatcgag aggtctcttc aaggcccaat 120gccaaagaag ttctccctcc catgattcat cgggagatag tcaggtttgg ttactgtgag 180tcttcatctt caaaatcctc aagtgacaac tctgaagagc gagatgaatg cggtatagtt 240gatgcactgg tgacaacaat aacacagatt cggaagatgg acttggaggc aaggagccta 300cagcctagca ttaaggctgg tttgcttgca aagctgaggg agtataaatc tgacctcaac 360aacgtcaaga tgggtctatc cgcagagagg aagaagcaga agctctccga gatccaatcc 420ggcgttgagg aagctgaatc gctgattcag aaaatggacc tggaggcaag gagcctacag 480cctagcatta aggctggttt gcttgcaaag ccgagggatt ataaatctga cctcaacaac 540gtcaagagtg agctcaagag gatatctgcg cccaatgcca gtggcctgat tagctacaag 600aagctcttgt tccatggact cgatctctgg accgcactat cgttgcctca gcccctaggt 660cgtgctgccc tctggcctcc tcatcgtaca attcaccaac atctccaatg tcagcaattg 720actggtgttg ctggtagcct ggcatatctg gcacctgagg ttctactagg aaattactcc 780caaaaggtag atgtatgggc tgccggggtg cttctgcatg ttctgttgat gggcactctt 840ccgttccaag gaaaatctat cgaagctatc tttgatgtta taaagactgc tgaacttgac 900tttcacaata gtcagtgggc atctgtgtca cttcttgctt atgatctcat tggtcgaatg 960cttaatcgag aggtctcttc aaggcccgat gccgaagatg ttctccggca cccatgggtc 1020ttattctaca ctgattgcct gcagaaagct gaattctcta acctatggga tactaacaaa 1080actgcagctc ccatgattca tcgggagata gtcaggtttg gttactgcga gtcttcatct 1140tcaaaatcct caagtgacaa ctctgaagag cgagatgaat gcggtatagt tgatgcactg 1200gcgacaacaa taacacaggt gaggatctca gagcccaaga ggagtcggct gttcagccta 1260cccaacgggt tgttgccgcc aagcaggaac agtctccgaa catgaagatg atgaatccgt 1320gtgtggcttt ctaacttgac ctacctagct cccatcccca tgcatgtata aacgagataa 1380acgagctctg tgattttata gatggaaaat tttcaccgtg gttgatgttt tgcgattgct 1440agctcgctga gcctgcaatc ctctgtaaat atatcattgt tgtcatcatt tttgtacatc 1500gatgacaccg taattgattc gatt 152431082DNAArtificial SequencecDNA 3cctgcccttc cattcttccc ccgctgcccc cggtcaacgt cacgaacccg ggcctcgtgc 60cgctcgtcgt ggccacactg ttcgacgagc gagtcacaga gctgctgagc gtgctcgctg 120atgcggcggt ggggcgacca ggcaggtggt ccatcggcga agcgccatgg tcgtcgtcgg 180ggggcacgaa ccaggcggtg tacgcgcgcc gcgcgcccgg ctcttcatcg cctccacccg 240ctccagcgtc tccaccactt ccttcatcga gggccgactg cttggctcgc tggccaggca 300gccgagcatt agttgcgccg cttggaacgc ctgcttttgt tgatcgtttg ttttggtctg 360atttcagtgg gtctatccgc agagaggaag aagcagaagc tctccgagat ccaatccggc 420gttgaggaag ctgaatcgct gattcagaaa atggacctgg aggcaaggag cctacagcct 480agcattaagg ctggtttgct tgcaaagccg agggattata aatctgacct caacaacgtc 540aagagtgagc tcaagaggat atctgcgccc aatgccagat tcggaagatg gacctggaag 600caaggagcct acaacctagc attaagagtg agctcaagag gatatctgcg cccattgcca 660ggcaggctac ccgggaggag ctcctggagt ctggaatggc tgatactctc gcagtgagct 720aatgctagga cttgactgtg tctacgagac tgctcctaac aataaactga agaaagcaaa 780agaaatcatt caacgtattc gccgaagaga actctacaag atatggtagt cctgggagga 840agagccatcg ccgctatgta tggcagaacc acccgcgaaa gcaacctctc tggtctcgcg 900tagccagagc aggagcagct cgcttgcgcg gtcgcggcgc tggcggccgg ccccgcgtac 960gagcgcctgc aggaagccag gaacccatcc gaacaaggtt gcaatcatga taagcagata 1020gagcaagcat atgatgatat tttgaattcg tcgaagcata ctttggccag catgatggag 1080ct 108242321DNAArtificial SequencecDNA 4tgaggagcat gtttttttag atcccaccag aatatgtgca tccgtgtcac ttcttgctca 60tgatctcatt ggccgaatgc ttaatcgaga ggtctcttca aggcccaatg ccaaagaagt 120tctccgtaag ttcaagcacc cttgtaactt gtgctttata tatatgattc tcaatttatc 180attgactttt cctaatggct ttcaacacag ggcaccatgg gtcttattct acactgattg 240cccgcagaaa gctgaattct ctaacatatg ggatactaac aaaactgcag ctcccatgat 300tcatcgggag atagtcaggt ttggttactg tgagtcttca tcttcaaaat cctcaagtga 360caactctgaa gagcgagatg aatgcggtat agttgatgca ctggtgacaa caataacaca 420ggtgaggatc tcagagccca agaggagtcg gctgttcagc ctacccaacg ggttgttgcc 480gccaagcagg aacagtctcc gaacatgaag atgatgaatc cgtgtgtggc tttctaactt 540gacctaccta gctcccatcc ccatgcatgt ataaacgaca tttggggaat gggtagaaaa 600gcagagatta gggattttcg tttccgtcgg tgcagttttg gtgttccaat ggagttgcga 660gatgtttatg tgccttagtc ttcaatttgg gggttggggg aaaagtaatt ttatgttttt 720gttttgtgtc tgcagattcg gaagatggac ttggaggcaa ggagcctaca gcctagcatt 780aaggctggtt tgcttgcaaa gctgagggag tataaatctg acctcaacaa cgtcaagagt 840gagctcaaga ggatatttgc gcccaatgcc aggcaggcta cccgggagga gctcctagag 900tttggaatgg ctgatactct cgctgtgagc taatgctagg acttgactgt gtctacgaga 960ctgctcctaa caataaactg aagaaagcaa aagaaatcat tcaacgtatt cgccgaagag 1020aactctacaa ggtagtatga tgctttaatt gctcatatac aagtgtcatt ttgtcatgtc 1080attacacatg gttaggatac atacttaagt ttctaacgta ggcgtccaca caacggattg 1140gtgcacggtt ctgccgatgt atcccacgca cgtgcatgga aggaggcagg cacccttccc 1200cgccgccccg gatctcgcgc cagcccccgc cctaccccgc ctgcccttcc attcttcccc 1260cgctgccccc ggtcaacgtc acgaacccgg gcctcgtgcc gctcgtcgtg gccacactgt 1320tcgacgagcg agtcacagag ctgctgagcg tgctcgctga tgcggcggtg gggcgaccag 1380gcaggtggtc catcggcgaa gcgccatggt cgtcgtcggg gggcacgaac caggcggtgt 1440acgcgcgccg cgcgcccggc tcttcatcgc ctccacccgc tccagcgtct ccaccacttc 1500cttcatcgag ggccgactgc ttggctcgct ggccaggcag ccgagcatta gttgcgccgc 1560ttggaacgcc tgcttttgtt gatcgtttgt tttggtctga tttcagtggg tctatccgca 1620gagaggaaga agcagaagct ctccgagatc caatccggcg ttgaggaagc tgaatcgctg 1680attcagaaaa tggacctgga ggcaaggagc ctacagccta gcattaaggc tggtttgctt 1740gcaaagccga gggattataa atctgacctc aacaacgtca agagtgagct caagaggata 1800tctgcgccca atgccagatt cggaagatgg acctggaagc aaggagccta caacctagca 1860ttaagagtga gctcaagagg atatctgcgc ccattgccag gcaggctacc cgggaggagc 1920tcctggagtc tggaatggct gatactctcg cagtgagcta atgctaggac ttgactgtgt 1980ctacgagact gctcctaaca ataaactgaa gaaagcaaaa gaaatcattc aacgtattcg 2040ccgaagagaa ctctacaaga tatggtagtc ctgggaggaa gagccatcgc cgctatgtat 2100ggcagaacca cccgcgaaag caacctctct ggtctcgcgt agccagagca ggagcagctc 2160gcttgcgcgg tcgcggcgct ggcggccggc cccgcgtacg agcgcctgca ggaagccagg 2220aacccatccg aacaaggttg caatcatgat aagcagatag agcaagcata tgatgatatt 2280ttgaattcgt cgaagcatac tttggccagc atgatggagc t 232151082DNAArtificial SequencecDNA 5cctgcccttc cattcttccc ccgctgcccc cggtcaacgt cacgaacccg ggcctcgtgc 60cgctcgtcgt ggccacactg ttcgacgagc gagtcacaga gctgctgagc gtgctcgctg 120atgcggcggt ggggcgacca ggcaggtggt ccatcggcga agcgccatgg tcgtcgtcgg 180ggggcacgaa ccaggcggtg tacgcgcgcc gcgcgcccgg ctcttcatcg cctccacccg 240ctccagcgtc tccaccactt ccttcatcga gggccgactg cttggctcgc tggccaggca 300gccgagcatt agttgcgccg cttggaacgc ctgcttttgt tgatcgtttg ttttggtctg 360atttcagtgg gtctatccgc agagaggaag aagcagaagc tctccgagat ccaatccggc 420gttgaggaag ctgaatcgct gattcagaaa atggacctgg aggcaaggag cctacagcct 480agcattaagg ctggtttgct tgcaaagccg agggattata aatctgacct caacaacgtc 540aagagtgagc tcaagaggat atctgcgccc aatgccagat tcggaagatg gacctggaag 600caaggagcct acaacctagc attaagagtg agctcaagag gatatctgcg cccattgcca 660ggcaggctac ccgggaggag ctcctggagt ctggaatggc tgatactctc gcagtgagct 720aatgctagga cttgactgtg tctacgagac tgctcctaac aataaactga agaaagcaaa 780agaaatcatt caacgtattc gccgaagaga actctacaag atatggtagt cctgggagga 840agagccatcg ccgctatgta tggcagaacc acccgcgaaa gcaacctctc tggtctcgcg 900tagccagagc aggagcagct cgcttgcgcg gtcgcggcgc tggcggccgg ccccgcgtac 960gagcgcctgc aggaagccag gaacccatcc gaacaaggtt gcaatcatga taagcagata 1020gagcaagcat atgatgatat tttgaattcg tcgaagcata ctttggccag catgatggag 1080ct 108263646DNAZea mays 6atggcacact ttgatgaact agaggataaa acaacagatt atgttgattt atcggttcaa 60gaatttgctc ttaagcaacc tcaatgtggc atggcttata attactatgg aaatttaagg 120ctttatgtag tagccaataa agctgaattg gcctcttcaa tatttgaaat cgataaggta 180aacaaaggcg gagttaatgc atctatgcca gtgaccactt ccactcctaa ttcgaatcaa 240aattcatgaa ccggttatgg aacaaataga gaatcaagtt tcggtgaggt ataatacgat 300tcccctaacc catggaattt acctagtaaa aatcctgtag ttaatagtgt actagtaact 360tctgtcaccg acttgaataa agctttgaat gagtataaaa atgagatgtc taaatttatt 420gagaatagct tggtgtatag attaagccta gtagaaacac ttataacaag ttgtatgctt 480caattttttt tgattttttt ggaagctact catagttgga gggtaccaaa tttacaaaaa 540aaaattggtg attataatag taaatctacc atagaacatg ttagcttgtt tcttgctctg 600agaggtgaag ctagtagcat gaaaattgaa tgtgcgttat ttttcttttt cacttactgg 660tacaattttt gcatggttta tgttgttgcc ctgcttgttg tattggttca tgggctggtc 720tgtgaaataa tttggcgata gccattttct tttttgagat acattgcttt tgctatatat 780atctagatat ggtgcatatt taaatgcata ataaaaatgt aaaaatctaa aacgtcttat 840aatttaggac agatgaaagt actagatatt agacattttt agtgttttta ttaaaatgga 900atatgtaccg cctttgatgc tacaactttt acttagcttt taaaacacac cattctaaat 960tgtaaaaaaa tattaaaaat gtgttttgca agatgaatat actaaccttt gttatgataa 1020tagttttcat atgttaatgg aacaagctaa aaagtttggc aaagtatagt cctatagctt 1080ccatttcgac tcagagagag tatgttgtat ccactaaccg tgtacacaag atagcccaac 1140taattaatta ttttgtgagc tatcacccaa ccttctgttt atcatggatt catggaaaaa 1200tgtaattgcc atcattacac taaaaactaa aacttatgaa ggagaaccat tgtcttgcta 1260tatatgagat gacaaaattt tccaaagaag agagaagccg gcagaaccca tcctgtttca 1320aatctcttct actacttaag tttctaacgt aggcgtccac aaaacggatt ggtgcacggt 1380tctgccgatg tctcccacac acgcgcatgg aaggaggcag gcacccttcc ccgccgcccc 1440ggatctcgcg ccagccccag ccctaccccg cctgcccttc cattcttccc cagccgcccc 1500ccggtcaacg tcacgaaccc gggcctcgtg ccgttcgccg tggccacgcg gttcgacgag 1560cgggtcacgg agctgctgag cgcgctcgct gacgcggcgg cggggcgacc aggcaggtgg 1620gccatcggcg aagcgccatg gtcgtcgtcg gggggcagga accaggcggt gtacgcgcgc 1680cgcgcgcccg gctcttcatc gcctccaccc gctccagcgt ctccaccacc tccttcatcg 1740agggccgact gcgaggctcg ccggccaggc agccgagcgt cagttgcgcc gcttggaacg 1800cctgcttttg ttgatcgttt gttttggtct gatttcggtg ggtctatccg cagagaggaa 1860gaagcagaag ctctccgaga tccaatccgg cgttgaggaa gctgaatcgc tggtaaatag 1920atgccgcgac acgttctggt ttggggatcc ccttggctaa caggacatac gacatttggg 1980gaatgggtag aaaagcagag attagggatt tttcgtttcc gtcggtgcag ttttggtgtt 2040ccaacggagt tgcgagatgt ttatgtgcct tagtcttcaa tttgggggtt gggggaaaag 2100taattttatg tttttgtttt gtgtctgcag attcagaaaa tggacctgga ggcaaggagc 2160ctacagccta gcattaaggc tagtttgctt gcaaagctga gggagtataa atctgacctc 2220aacaacgtca agagtgagct caagaggata tctgcgccca atgccaggca ggctacccgg 2280gaggagctcc tggagtctgg aatggctgat actctcgcag tgagctaatg ataggacttg 2340actgtgtcta cgagactgct cctaacaata aactgaagaa agcaaaagaa atcattcaac 2400gtattcgccg aagagaactc tacaaggtag tatgatgctt taattgctca tatacaagtg 2460tcattttgtc atgtcattac acatggttag gatacatagg agattctgtt ttttaacaca 2520tagttgtccc atgtccatga attcatttga attaatttac tcttcgcaat cttatacatt 2580aaaatcgtgt tacctattac atcacaactt catgagagca tgcttgttct gtgtagatat 2640ggtagtcctg ggaggaagag ccatcgccgc tatgtatggc agaaccaccc gcgaaagcaa 2700cctctctggt ctcgcatagc cagagcagga gcagctcgct tgcgcggccg cagcgctggc 2760ggtcggcccc gcgtacgagc gcctgcaggt aggccagctt ctgctgcaat gcccgaatct 2820cggcgtccac gcgcagcagc gtcgtcgcct cctcctccgt cagctcaccc agcttggcca 2880gcacccccgt cacccccgcg tccgccatgg ctgtcgccgg gaccgaaagg ctaaaactgt 2940cacaatgacg taaagtttgg ttggtgttgg cggctcacgc aaaaccagac ctttccaagt 3000tttactttag cggagttttt ttggaacgag agcaaagcag cacagtttca agaatgtggg 3060gcaatttgaa tgttcgttcc tgctgcactg ctactgcttt tagaattgta gtatgcttca 3120tcatttattt atttctaaaa aaacttgcat gaattctatc gtgactttta ttgagaaaat 3180aatgtattca cgtatcttca tgtttctgat aaaggtattt gtatatgcat cggtgctaca 3240tatgcgaata caagttttgt ttcaactctg aagtctcaag ttgaattcta aactccagtt 3300tgttttctac tgtgctgctg caggaagcca ggaacccatc cgaacaaggt tgcaatcatg 3360ataagcagat agagcaagca tatgatgata ttttgaattc gtcgaagcat actttggcca 3420gcatgatgga gctgcaggag gttcagttat ttgcacacat tgtttttctt tcactcctat 3480gattttcctc aatatgatca aaatgtttct tttgcaaata atgattgaaa tgtttctcat 3540tgtactcaac ctcttaaact acctataggc tttgcttgag agtaatcagg ctacaaagga 3600tgccaatggt attgctgctc tctatattgt tcttgttcta atgtaa 364673646DNAArtificial SequencecDNA 7atggcacact ttgatgaact agaggataaa acaacagatt atgttgattt atcggttcaa 60gaatttgctc ttaagcaacc tcaatgtggc atggcttata attactatgg aaatttaagg 120ctttatgtag tagccaataa agctgaattg gcctcttcaa tatttgaaat cgataaggta 180aacaaaggcg gagttaatgc atctatgcca gtgaccactt ccactcctaa ttcgaatcaa 240aattcatgaa ccggttatgg aacaaataga gaatcaagtt tcggtgaggt ataatacgat 300tcccctaacc catggaattt acctagtaaa aatcctgtag ttaatagtgt actagtaact 360tctgtcaccg acttgaataa agctttgaat gagtataaaa atgagatgtc taaatttatt 420gagaatagct tggtgtatag attaagccta gtagaaacac ttataacaag ttgtatgctt 480caattttttt tgattttttt ggaagctact catagttgga gggtaccaaa tttacaaaaa 540aaaattggtg attataatag taaatctacc atagaacatg ttagcttgtt tcttgctctg 600agaggtgaag ctagtagcat gaaaattgaa tgtgcgttat ttttcttttt cacttactgg 660tacaattttt gcatggttta tgttgttgcc ctgcttgttg tattggttca tgggctggtc 720tgtgaaataa tttggcgata gccattttct tttttgagat acattgcttt tgctatatat 780atctagatat ggtgcatatt taaatgcata ataaaaatgt aaaaatctaa aacgtcttat 840aatttaggac agatgaaagt actagatatt agacattttt agtgttttta ttaaaatgga 900atatgtaccg cctttgatgc tacaactttt acttagcttt taaaacacac cattctaaat 960tgtaaaaaaa tattaaaaat gtgttttgca agatgaatat actaaccttt gttatgataa 1020tagttttcat atgttaatgg aacaagctaa aaagtttggc aaagtatagt cctatagctt 1080ccatttcgac tcagagagag tatgttgtat ccactaaccg tgtacacaag atagcccaac 1140taattaatta ttttgtgagc tatcacccaa ccttctgttt atcatggatt catggaaaaa 1200tgtaattgcc atcattacac taaaaactaa aacttatgaa ggagaaccat tgtcttgcta 1260tatatgagat gacaaaattt tccaaagaag agagaagccg gcagaaccca tcctgtttca 1320aatctcttct actacttaag tttctaacgt aggcgtccac aaaacggatt ggtgcacggt 1380tctgccgatg tctcccacac acgcgcatgg aaggaggcag gcacccttcc ccgccgcccc 1440ggatctcgcg ccagccccag ccctaccccg cctgcccttc cattcttccc cagccgcccc 1500ccggtcaacg tcacgaaccc gggcctcgtg ccgttcgccg tggccacgcg gttcgacgag 1560cgggtcacgg agctgctgag cgcgctcgct gacgcggcgg cggggcgacc aggcaggtgg 1620gccatcggcg aagcgccatg gtcgtcgtcg gggggcagga accaggcggt gtacgcgcgc 1680cgcgcgcccg gctcttcatc gcctccaccc gctccagcgt ctccaccacc tccttcatcg 1740agggccgact gcgaggctcg ccggccaggc agccgagcgt cagttgcgcc gcttggaacg 1800cctgcttttg ttgatcgttt gttttggtct gatttcggtg ggtctatccg cagagaggaa 1860gaagcagaag ctctccgaga tccaatccgg cgttgaggaa gctgaatcgc tggtaaatag 1920atgccgcgac acgttctggt ttggggatcc ccttggctaa caggacatac gacatttggg 1980gaatgggtag aaaagcagag attagggatt tttcgtttcc gtcggtgcag ttttggtgtt 2040ccaacggagt tgcgagatgt ttatgtgcct

tagtcttcaa tttgggggtt gggggaaaag 2100taattttatg tttttgtttt gtgtctgcag attcagaaaa tggacctgga ggcaaggagc 2160ctacagccta gcattaaggc tagtttgctt gcaaagctga gggagtataa atctgacctc 2220aacaacgtca agagtgagct caagaggata tctgcgccca atgccaggca ggctacccgg 2280gaggagctcc tggagtctgg aatggctgat actctcgcag tgagctaatg ataggacttg 2340actgtgtcta cgagactgct cctaacaata aactgaagaa agcaaaagaa atcattcaac 2400gtattcgccg aagagaactc tacaaggtag tatgatgctt taattgctca tatacaagtg 2460tcattttgtc atgtcattac acatggttag gatacatagg agattctgtt ttttaacaca 2520tagttgtccc atgtccatga attcatttga attaatttac tcttcgcaat cttatacatt 2580aaaatcgtgt tacctattac atcacaactt catgagagca tgcttgttct gtgtagatat 2640ggtagtcctg ggaggaagag ccatcgccgc tatgtatggc agaaccaccc gcgaaagcaa 2700cctctctggt ctcgcatagc cagagcagga gcagctcgct tgcgcggccg cagcgctggc 2760ggtcggcccc gcgtacgagc gcctgcaggt aggccagctt ctgctgcaat gcccgaatct 2820cggcgtccac gcgcagcagc gtcgtcgcct cctcctccgt cagctcaccc agcttggcca 2880gcacccccgt cacccccgcg tccgccatgg ctgtcgccgg gaccgaaagg ctaaaactgt 2940cacaatgacg taaagtttgg ttggtgttgg cggctcacgc aaaaccagac ctttccaagt 3000tttactttag cggagttttt ttggaacgag agcaaagcag cacagtttca agaatgtggg 3060gcaatttgaa tgttcgttcc tgctgcactg ctactgcttt tagaattgta gtatgcttca 3120tcatttattt atttctaaaa aaacttgcat gaattctatc gtgactttta ttgagaaaat 3180aatgtattca cgtatcttca tgtttctgat aaaggtattt gtatatgcat cggtgctaca 3240tatgcgaata caagttttgt ttcaactctg aagtctcaag ttgaattcta aactccagtt 3300tgttttctac tgtgctgctg caggaagcca ggaacccatc cgaacaaggt tgcaatcatg 3360ataagcagat agagcaagca tatgatgata ttttgaattc gtcgaagcat actttggcca 3420gcatgatgga gctgcaggag gttcagttat ttgcacacat tgtttttctt tcactcctat 3480gattttcctc aatatgatca aaatgtttct tttgcaaata atgattgaaa tgtttctcat 3540tgtactcaac ctcttaaact acctataggc tttgcttgag agtaatcagg ctacaaagga 3600tgccaatggt attgctgctc tctatattgt tcttgttcta atgtaa 3646810605DNAZea mays 8aggaatctta aacatgtgga acaggtgctc aacacattta gcaactagtt gttgatgacc 60cataactttg cagccttcat aatgcacaca attgatgcat caattgcata cctcctgtct 120ttgtcaacat tttcaacacc ttttttcttc tcatcaacag gaggcgatgg aatccaaaaa 180gagtgacaac aaaaatatta gtataataac taaactctaa gtctaccaaa cagtgaaaga 240atagtgaaac aggaaatacc ttaattctta tcattatgtt aataaattta aaattgaact 300aaaaaacatc attaggtgag atggatctat ttgtcggttc cctgttaagg atcttgtatt 360tcacacaaga aagtgaatgg agcaaacaca cactacatca tctgacatgt ttagttgtgt 420tgcaatttca aaatatctaa gcctgtcgga tccattgaac aatagtagta atgtggcatg 480tcaaaaaatg tcaaacatgt taatatagca ccattttttg tgatgcagaa atgacccgag 540attacctgat atgtcataac aacgagctca atagggttag catcaaactt cgcacttacg 600ttgcaggttc ccatccatgt cagcttcatg tgcttcgttc ttgtttgatg gaactccttg 660aaaacatcta cacatttcag tgcctcatga gactcgccat agcgagcggg gtggaggatg 720gtgacaattt cggcagcctc gccgggggca gcactgcgac gagcgaggcg ggaggggtgg 780caatctcgtg ggggaggggc caggcacgac cgtccaacca cggcgggcgg agacacggta 840tccatgtgag atatagccac ctcatctagc cttatctcca agattttaaa tcactgatca 900tcataagcga tcagatggag gccagttcac atcgaccgac aggactgata ctaacaggac 960catccacact tgcacatact ataaagatta ataagagatt acataagaac taagtagtga 1020atcagacacc cattgctgac cttgttaatc agcccatcgc caatgaaggt gctgctgatc 1080ttcttgaaca acctgtacaa tgcctcgacc ttgttcacgc tgactgcaag aatgacaagc 1140agaggaaacc caaccaggca ggaaaatgat gaccaacact gaatagaaaa agtaaatgaa 1200cacctccagc tatcaaaatt ttaaagcaat gtgaagtcct caaagaacca ggacaacact 1260catgattttt tataactaag ggaattgttt atcatcaatt cattctaaaa tacaagacaa 1320tcaaaagaac taagcaaagc atgagataca aaaattcaaa gcacatgtat agtgtcttgg 1380taaaaaattt acaagatggt gaatgaattc aactcaggtt gtctacttca gcattagttt 1440gcactgtcca gaaaaagaac aacagcaaga ttggaataat gctatggcca ccagaataaa 1500aggtcagagc tgtcttttaa tgctaatatt gttcatgcca aacatttctt tgttagcttg 1560tgaatttata cttggacact ggactgggcc ttgatcgacg ctggcaatat catgctgaac 1620tctgaaggca ccaaaactgt tagctccttc cctcgtcaat ttgtcaattc aacatgtctg 1680cttcaaaatg gttatgcgta ggttgaagaa aagttgggag tttacaaaat aatacaatgg 1740gatgcctgtt ctatcatcta acttaagcca tgtatcaagg ttgcaagtta cataaaatac 1800gcttatattc tgatggttgg aaccacacat tctacacgtt tcccaaaaca atgaaaaagg 1860tagttgtcga aagatttaag catctaaagt gtccactctc tctgagagca tcaaaataaa 1920gtagtacgtc ttatgtttta aactatttat tgaagtacca aactatacgg ctactaaaga 1980tttatttaga tgagtaaacg aaataattta tggtatataa attaagaagg ggtgattagt 2040catgaaaaat aaaatgtcac aattaccagc agcacgtgat tttctaaata atttaagcat 2100gtgcggtgct cttccagata aaacttaggg gacgaccacc tagttcattg aaagagggga 2160ataaaccaag ctccaacttt caagcttgtc aaggcttgtc attattaatt taaacaggac 2220agccaattct cagacatgat gttccaaact gctaatgaat atataatgct caaaataaac 2280aactaggttc ttaactgtca attacaccca caagatgcac ataattagaa aaggtaaaag 2340agaaggcaaa tggaatacca ggaattatat gactactaaa tcatttattt agataagtag 2400atgaaataat ttatggtaca taatataaga acgggtgatt agttatgaga aataaaaggt 2460caccattacc agcagcatgt gttgttctaa ataatttaag cctgtgtggt attctttgag 2520ataaaacata ggagacgacc acctaattca ttggaagagg ggaacagacg aagctccaac 2580cttcgagctt gtcaaggctt ggcattatta atttaaacag gacagacaat gctcaatctg 2640aactgccatt gtatctacaa tactcaaaat aaacaactag attctgaaca accagattat 2700ttgtactcat tccatgtctc ataaacaagg aaaaaataac aaccagatta tttgtactca 2760ttccatgtct cataaacttt gggcaccatc catccaacac atccaatcta aacacaccaa 2820acgatgggga atggaaagag cagtattcga ttcaacaatg gcaaacaaat atcactgaat 2880tagaccaaga ataaacctaa ttagacaacg acctcccaac catcattcgt caggctgtaa 2940agaagataaa gctgccatgg ggcatggatc aagcagaaca ccagagatga atccaaacac 3000acagaaaatc acgcgcgctg tctacaatga caacaagccc cacatttcat tgcagtacac 3060tgggctacaa aggcacgtac aacaaagagc tagggaaaca ttgcggaggg cacgagagag 3120cagctaactt gacaatatag cagactgagc ttgcactgtt agcaggcgag gaagggaatc 3180atggggacgg agaatggggt ccatgcccgc gaaggagaag gcggacgccg ccacggtggc 3240accggcgcac gcgcacacag ggaacccgca caggcagcca tggatgctgc ctcgccattg 3300cgccggtcgt ctctgccacg ctcctctctc tctcccgctg catcgccgtg gatggggcaa 3360gcagagagca gggactgcga cgatctgggc ggaggactcg ccttggagag cgcggacgca 3420gacgggattc tagggagaga gcgaagacgg ggcgcgcgcg gcgctcgcgc ggcgtggtgg 3480cggcgagatt agcgggggtg gggggagggc ggagccgtgg tgagggtgtg gacgccctcc 3540ttaccctctt aagtagtagt agagatataa tccgttccaa aatatccatc cgttcaattt 3600atatttcgtt tgatcttttt accctaaatt tgattgactc atcttattaa aaaagttcat 3660aactattatt aatctttatt gagatatcat ttagcatata atatacttta agtgtggttt 3720tagatttttt ttaaaaaaaa aaattcgcaa aaattaaatg aaacgaccca atcaaacttg 3780aaaagtaaaa ctaattataa atttgaacgg aaggagtaag aggatgtttg aatgtactag 3840agctaatagt tggttgcttt aaaatttgct agtagaatta gctagctaat aaatatctag 3900ataactatta gctaatttgc taaaacagct aatagttgaa ctattagcta gattgtttgg 3960atgtattcgg ctaattttaa tggctaacta ttagctatag tacaatattc aaacacctcc 4020taattaaaat ggacaaatat ctcttctttt ggtcccttgc gttagatttt tcatatctcc 4080ttatttagta taaaagaatc atcaaaaagt ggacaacccc tagtggaaca ccattttagt 4140agtggttgca tgaaaccttt cgcgcaccag tttctatgtg tcactctaaa aatgggacag 4200catgtacgta gtgcctatat atatacaagt catctatcgt tgcctcctca gttcatcact 4260aatcacactt attgtgccct cgacgagtat ctatagctag ctcattaatc gattcggggg 4320tgtgttgtcg aaggcggcaa tggcgagcta ctcgtcgcgg cgtccatgca atacctgtag 4380cacgaaggcg atggccggga gcgtggtcgg cgagcccgtc gtgctggggc agagggtgac 4440ggtgctgacg gtggacggcg gcggcgtccg gggtctcatc ccgggaacca tcctcgcctt 4500cctggaggcc aggctgcagg agctggacgg accggaggcg aggctggcgg actacttcga 4560ctacatcgcc ggaaccagca ccggcggtct catcaccgcc atgctcaccg cgcccggcaa 4620ggacaagcgg cctctctacg ctgccaagga catcaaccac ttttacatgc agaactgccc 4680gcgcatcttt cctcagaagt gagtccgatg ctgccgccat tgttcttgca tccatccagc 4740atcgtacgta cgtcctctat acatctgcgg atcatcatgt gcgcatgttt gtggcatgca 4800tgcatgcatg tgagcaggag caggcttgcg gccgccatgt ccgcgctgag gaagccaaag 4860tacaacggca agtgcatgcg cagcctgatt aggagcatcc tcggcgagac gagggtaagc 4920gagacgctga ccaacgtcat catccctgcc ttcgacatca ggctgctgca gcctatcatc 4980ttctctacct acgacgtacg tacgtcgtca cgaatgattc atctgtacgt cgtcgcatgc 5040gaatggctgc ctacgtacgc cgtgcgctaa catactcagc tctttcctat ctgctgcgcc 5100aatttgcagg ccaagagcac gcctctgaag aacgctctgc tctcggacgt gtgcattggc 5160acgtccgccg cgccgaccta cctcccggcg cactacttcc agactgaaga cgccaacggc 5220aaggagcgcg aatacaacct catcgacggc ggtgtggcgg ccaacaaccc ggtaactgac 5280tagctaactg gaaaacggac gcacagactc catgtccatg gcggcccaca aggtcgatgc 5340taattgttgc ttatgtatgt cgcccgattg cacatgcgta gacgatggtt gcgatgacgc 5400agatcaccaa aaagatgctt gccagcaagg acaaggccga ggagctgtac ccagtgaagc 5460cgtcgaactg ccgcaggttc ctggtgctgt ccatcgggac ggggtcgacg tccgagcagg 5520gcctctacac ggcgcggcag tgctcccggt ggggtatctg ccggtggctc cgcaacaacg 5580gcatggcccc catcatcgac atcttcatgg cggccagctc ggacctggtg gacatccacg 5640tcgccgcgat gttccagtcg ctccacagcg acggcgacta cctgcgcatc caggacaact 5700cgctccgtgg cgccgcggcc accgtggacg cggcgacgcc ggagaacatg cggacgctcg 5760tcgggatcgg ggagcggatg ctggcacaga gggtgtccag ggtcaacgtg gagacaggga 5820ggtacgaacc ggtgactggc gaaggaagca atgccgatgc cctcggtggg ctcgctaggc 5880agctctccga ggagaggaga acaaggctcg cgcgccgcgt ctctgccatc aacccaagag 5940gctctagatg tgcgtcgtac gatatctaag acaagtggct ttactgtcag tcacatgctt 6000gtaaataagt agactttatt ttaataaaac ataaaaatat atatatgttc ttgaatataa 6060aattgataac caaattaaaa ttcgaaccat cacttataca taattttact ttatttttta 6120taaaacgtga acgggaagga ctaccgtgaa tgactataga accaatcata ctagtataaa 6180atatatgatg acactacggg agagacaaac tttgtctggc gctaaatatt ttgccgagtg 6240tgaattcacg ggcactaggc aaagatcttc tttgccgagt gttacgctgg gcaaagtaag 6300acactaggta aatcagtcat ttgccgagtg tccgccacta ggcaaagcaa aacactggca 6360aatcaaaagt ttacctagtg ccagacacta ggcaaaaaaa aaacgctcgg caaatcggaa 6420gtttccctag tgccagacac tagacaaaga aaaacacttg ataaactagc gtcgtcagct 6480aacaccatcc accaaccgtt aacgttgccg agtatctgac ttcgacactc ggcaaagaag 6540gtctctttgc ctagtgtcgg tctggaacac taggcaaaga ggcactttac ctagtgtcgt 6600attttgacac tcagtaaaat aatttttttt ctttctgctt ccaaactttt tatgatgtgt 6660tcctatagca cctagaacta catgtcaagt tttggtaaaa tttttgaagt ttttgctata 6720tttacttaat ttattttatt taattgaatt tcttttgata attcaaattt gaactcggca 6780aggtaagaag cgagggtagc ctggaaacac actttgccta gtgttacact cggtacagga 6840gcctcccctg cctagtgctg cactcgacaa aagattcgcc tttgcctagc gctgcactcg 6900gcacaggagt cgcctttgcc tagtgctgca ctaggcaaag cctccgttac cgtgccttcc 6960atcgtcatgg aaacttttct tcgccgagtg acgtgtggca ctaggcaaag tttttgccga 7020gtgcccgaga aatggcactc ggcaaggact ctttgccgat cccttcgttg ccgacttctt 7080tttgccgagt gcaacactag gcaaaccatt tgccgagtgt aaaagaggct ttgcctagtg 7140tctgtggcac taggcaaaga agacgagtcc tgtagtgaac ctagtaggcc agtgcgggac 7200cattccaaaa aatacctata aaaataaatt taatattaaa ttaaacatat ggtccacgta 7260ccaagatatt aaactcaaaa gaacaattat tacaatttat cttagctaaa aggccgagaa 7320aaagtatatg ttaaaaagga gtgtgatccc atttttatag ctcgctcggt cgatcgcccg 7380tccactttta ggtaacgagg tggtaccatg taggagtgtt gcgttgcgtg cgacttccta 7440tcatgttggg cttaggtggc ttctcacgac ccaatgatag gcgagaagtg tggaagatga 7500acaaacctac ttgtttcgtg cacgacgcat gtgtttgaac aacgagttag attagaaaaa 7560aaatataatg actttttttt ttgcaaaagt gaggataatg aaaaccagaa aaactggtgc 7620ttcataagag tagagatttg atggtaaata tagtagtaat gcaatggcta tactacacgc 7680gagagtccaa tggcaagccg gtgtgttggg gcgaaggcga agacgctacc cttcgctcca 7740ggcctttgtc aactcgctgc accaacagag gcaagatgac cggcgcggcc cacccttcgt 7800cctcttcact gcaagacgaa ggcctacgac gaagtctctc catcccacgt cctcgcctta 7860cctggaggcc cacgtgggat tcggcccatc gtaacaggcc ccgcacggac aggcgtgtta 7920cgggtttgat ttgtaatagc ttttctgtaa tgacagtttg taaccctccc ttatgggaat 7980attctgggga taatccaggt gtctgagggc ataagcgtcc ttacatcggg acgttgggcg 8040ctcgggcacc tataaatacc cccgtacagt gcccttgaga ggctggatta acatagcaat 8100tgccatctcg agttaaacct tgcttgcatc ctttccactc tcccgttgga tcaacttgcc 8160caagagagct agttccaaca tttggcgccc accgttcgtg ctacgagcaa accacccgcg 8220atggcaccca aaagagctag ttcgaaggca gccccatccg tcgacgaagc ggcgaaggca 8280gcactgctag ctgagaaaaa gggcaaggcc ctcgcagaca acacccacca agaagctggc 8340gaagacgaag cactcagtaa gagacagcgc aacgatcaac acactctcga aggcaccctc 8400cgcacctaca gctccggagg ccaaccacaa gtaccacccc taggcttcgc tccactagag 8460ggcgaggaca caacagagga cggcgaagtc atcggcgtct cagcagaaga acaactacag 8520ttatgggccc tgcgcctcaa gaaccgcaac ctccaaaagc agaaagaaat cctcgaagcc 8580aagcgccaac gcgtctccgc gcaagccaaa gtgcgttaga tgatacgaga cgaggagcag 8640agggcccggg aactagagca agagattgcg ctcatgcaga gcgaaggaca gcatgatcta 8700cagcatggcc cacccctcca gcagcgcgcg ccagctagag atttattcat tccccagcgc 8760gggcccttca tcccacacgc cgcagctttc caaggcatca actaccttga tgagcgaagc 8820cccctggcgc cgcaactcca agtgtcacct tggcccgcca acttcagggc agggagctac 8880cccaagtaca atggcagcac cgacccagca caatacatca tgagctatca agtcgctgtc 8940gcatcatccg gaggggacga cgccacaatg gccaagtcct tcatcatcgc cctcgaaggt 9000ccggccttga cctggtacac caggttgccc ccactgtcca tcgactcctg gcgaagtctc 9060cgggacaagt ttctgcttaa ctttcaaggg taccgcccag acatcgatgc cttggccaag 9120ctgtcactct acaaacaaca agagaaagaa accctacggg agtactaccg caagttcctg 9180gctctcaagt cgcaactgcc ctcggtcgac gaccaaatcg ccatacacta cgccatcagt 9240ggccttcggg ctggcgtcct atacagtcac tgcatcaggt acccacccaa aaacctccaa 9300gagctctatc agttgtttga aaagtacgcc agatccgaag agctccatca gcgcaaggtc 9360gagtctcaaa gaaagcccaa ggaccctccg cagtctagcc aaacatggac aagaccttca 9420cagtcagact ccggtcggga caaccgcagt cagcagcagg tgcataacat tgccaaccag 9480caccccgcca gcgaagcccc tcgccgccaa gattatcccc ccagggccgc ggcaatggca 9540cgcgtggtcg gggctgggga cgggcgcaac agccgtgcag atattactgc ctgttttcac 9600ggcgaagact gcacgcaccc aaccaaggat tgtccggaaa cgaaggccac cagggacagg 9660atgtctcggg cacaacccgc cgacaaccca agagttgtcg cgcacacata ccaacaccac 9720cacccacaac catacaacca cggccccgcc cagcatctac ccaaccacgc atatcaacac 9780caccaggagt tacaagtcat accacctcca cccccgcctc cgcatcaacc aaacatccac 9840caccaaaatc accccaagca ccaaaacagg aagacttcgc tgatcagccg tatcgcggag 9900tcattcacat gatcaccgga ggggtccagc attgactttg acacgaagcg acaaaagagg 9960aatcactacc gaagcatcaa ccacgtcgcc atcaccggct cggtcgtgca aacgaagtgg 10020tcacatgtgc cgctaacctt cgacgccaga gatgttgatc tgcgcagcgc accccacatt 10080gatgccatgg taatcaactg cagtgtggca ggctgggacc tgcacaaagt cctagttgac 10140aacggcagcc aggcggacat catcttcctc catgccttcg accgcatggg catcagccac 10200agccctctca agccttcgaa caatccccta tatggcttcg gcggcaaggg caccttccct 10260gtgggcaaga tagagctacc cctatccttc ggcgtagcac ccaatgcgcg aagcgaatag 10320gtcacctttg acatcattga catggtctat ccctacaacg ccataatggg tcggggctct 10380atcaacaaat ttgaagcggc aatccacgga ttatacctct gcatgaaaat tccgggtcca 10440caatgcgtaa taacggtgta cgggaaccag cagactgcgc ataacattga gagagatttc 10500gttcccgatc aacggaacgt acactgcctt acgacgcagc gcgaagtccc cgaggctacc 10560tgcctagctg ccaacaaaaa tgaaaaggca cagctaaaaa gcaac 10605911001DNAZea mays 9aaatggccga agctattttg gatgaagcca tctctcgact attaaacgaa gctgcggaag 60cagttttaaa agaagaatag ttgttattgt aaaaacattt ggaatgtaat atttgctgaa 120caaagtgtgt aatattttta taatttgaat gtaatatata agctgctcgt aactcaattc 180tttacgatgc atgaaacttt acgtacatac cgtttttgag ccttcggcga aaaaacacct 240tcccttcttt tcatgcttcg tgaagaatat ccatacttcg taaaaacatt atgcttcata 300agcaatagat ctctttttca tattagagtt gatgaagttg tacttgttca aaacttattg 360tgccttggca ctgcttcttc gaaacaatct cgaagatcaa cattgtatcc ccttcttgtg 420ttattgatgc aatatgatgt tatgctatgc aaaatgatgt gatgatgtta tgctatgcaa 480aatgatattt atgtcgaaga tacataaaca ttcccacagt agagcacaca atctttttgc 540cgtttatttt tcggcttcac cgcttatttt tcggtgtatc agcgctgact tttcgctgta 600agcctccctt aggtgcttct tcgcctttta cttcggcggt atttgcgttg actttttgcg 660cttcgcctta tacttcggtg gaatcagcgt ttatttctcg ctgtaagctc tgcattccct 720ttggaacgac ttttgagcag aaaacttacg ctgcgctccc ttagaaatga ctttttgtaa 780cttcggcaaa cttacgctgc gtttcataga acgacttttt tgtagtttcg gagatacttt 840ctgtagccac aagttcttaa gaacgagttt tcatgcttca tcaacttttt gaattccgta 900agtctgtgga gaagatatat tttcactatg acaaaaacaa agctgttaca agaaattgaa 960aacaacaaga aaaacttagg ctttcaatga ttgttcttta ttaaaaagaa aaatgataac 1020taatgcaaga actatttcag aagtaggata tctgttagta gatgtgcttt gactctggca 1080caatactgtt gactgtgcga gcttcggact cctctctgaa gtctcgttgc tgatgagtgt 1140gctggctccc ttctggctgc tggcctcgtt gtattggtgg tggaggtgga agctgttgcc 1200aagatgcctg aggttggctt gccgaagcaa cagaagctgc aggatggtta cccttaccca 1260catactctgg aatgtacggt gagtggtacg aagcagtatg cataacttgc ttcggctggc 1320tctgttgggc tgcagtttct gctatctctt tctgtttctg gatggtgaca tggcacatcc 1380tggtagtatg gcccttgtcc tcaccgcaga atagacaata aattttcctg ggctgatccc 1440caaaccttcc tccgaagccc ctggctcctc tgccccttgg ggctggaggc cgaggataac 1500tctgttgctg ccccgaagcc taggaggaat attgcggcct ctgttgctga cttcccctgt 1560cgtcattctg agtggaatga attgatctga catgtctagg gtgtactctc cctccgaagc 1620ccctggtcat ctcggagaat ctgtaagaag gcccccaaca cctctctttc ccattcgttt 1680aggtgagaaa acacattggg gagcactagg gagttctttc ctagtgaggc gtctgtgatt 1740tgataatcac aagattaagg atttcattag tgcatgtgta gtagcaagtg tgcatccacc 1800ttcctcatta agcttgttta ggataagcca gagtttgtgc cggttactct tgatgttcaa 1860caacaccaag atggcttggt ggtaattaag agcttggtga tctctcagtg gtgctcgtga 1920gagtcccaac tcattgtgta ataaaagatt ataggtgatt caccatgccg gagtggtgaa 1980taatcaaccc gtagagagca ttgagtcctt gaatggatcg atggggggct acacccttgt 2040gtgggtcaag tcagagtttt agcagttctt gcacccatga tctcatcgtg aagcatagat 2100aaatttaaat tcttttgaat tatttatata tgacaacact attcgtcgct ctaggtgact 2160atcacctacc ctaaaatgac ttaacaaatc tttattaatt gttaagtcat tcacattttt 2220gttaatccac tccaaagtca gggtgtttag tgtttttaca tccatgtctc cttagactca 2280cggtgtctct cccagattct ctctcaccct cacctctctc tcactagcca ctagggaacg 2340caacacccat cgatggctct tcgccccatg aaacgttcac acaatcgcaa ttgtcgaggc 2400atgcatggct gggagagcag acatggaggc atacgtgcta gggttgcaca tgggcaagag 2460ggtgggtgtg gctattcaga tatgcatggt gagcaagatg ggtgaggttg tgggcatgat 2520gaggggataa ggaagaataa gatctctttt gttaggctgt ctccagcagc tatcgtatcc 2580cattccctat cgcatcccct attttaaact ttactatgca aacaatgtaa tatatagtgc 2640agattcccta ttttacacaa tgtgttgtag acaaccttgg agctcttgca taaaagctct 2700agttttggct ctagctcctc tgagaaaaca atccccacca tgtttttagg aagaatccct 2760gaagggcacc ccatttggtt ggaaatacat ctcctcctac aggattatgt ttgacttttt

2820tttgcaatgt gggacccaca ggggagagga ggacgagaag gaaccggaga gcctattttt 2880tgggctcctg gcttcgcttg gtttctaggg gcggctcctt cctattttca caaaggagct 2940agtagaggag cctcccattt catgattttt tgaaggatct atttaaggag ccttgaaaga 3000gccctaccaa ggtaggccta gaaataataa aggaggaaaa agagaaggta tcacaacttt 3060tgtctacaac gtgaaaatgt ttggctaaat agataaaaca gtttgaattt tatcgattca 3120attgtttatt gagggcatgt ttgggagggc tttagttcta gcttctttcg cgaaaaatcc 3180agagccctac aaaatgacgt ttggtaaaac gacttcttcc gaaaaacacc caaaaaccca 3240agatatttta tactacgaag gaaaggtcac acatcctagt tagcttcact ggttctagct 3300ccttccaatt ttgcaaaaaa gtcacaaagg ataagccatt ttttcaaatg atttgtgaaa 3360tgcctacgct aaaaagtcta cttttccaaa aaaactagag ctagagccgt ttttggcaag 3420tcagaaccct accaaatagt ccctcagttt aagcaaagtg aggctatact gaagctaaat 3480tatgccaaat tgggcctaca tctccatatt ttcaaccaaa tgctttaggg tttcttgtaa 3540tcgacatgat ttgtttcttc ataaatagta tatggaccgc tccaaaatac tccatccgtt 3600tcaatttata ttacgtttga tctttttacc ctaaatttga tcgactcgtc ttattaaaaa 3660agttcataac tattaataat ctttactgtg atatcattta gcatataata tactttaagt 3720gtagctttga tttttttttt gcaaaaatta aatgaaacga cccaatcaaa cttgataaaa 3780aagtaaaact aattataaat ttggacataa ggagtaggag ggtgtttgaa tacactagag 3840ttaatagtta gttgtcttaa aatttgctag tacaattagc tagctaacaa atatttaggt 3900aactattagc taatttgcta aaaacagcta atagttgaac tattagttga actattagct 3960agactgtttg gatgtattca actaatttta gcagctaact attagttata gtataatatt 4020caaacacctc ctaattaaaa tggacaaata tctattccct tggtcccttg cgttagattt 4080tccatatatc ctcatttagt ataaaaagaa tcatcaaaaa gtggacaacc cctagtggaa 4140caccatttta gtagtggttg catgaaacct ttcgcgcatc agttactatg tgtcactcta 4200aaaatggggc agcatgtacg cagtgcctat atttatacaa ggcatctatc gttgcctcct 4260cagttcatca ctaatcacac ttattgtgcc ctcgacgagt atctagctag ctcattaatc 4320gatcaatcgg ggtgtgcggt cgaaggcggc aatggcgagc tactcgtcgc ggcgtccatg 4380caatacctgt agcacgaagg cgatggccgg gagcgtggtc ggcgagcccg tcgtgctggg 4440gcagagggtg acggtgctga cggtggacgg cggcggcgtc cggggtctca tcccgggaac 4500catcctcgcc ttcctggagg ccaggctgca ggagctggac ggaccggagg cgaggctggc 4560ggactacttc gactacatcg ccggaaccag caccggcggt ctcatcaccg ccatgctcac 4620cgcgcccggc aaggacaagc ggcctctcta cgctgccaag gacatcaact acttttacat 4680ggagaactgc ccgcgcatct tccctcagaa gtgagtccga tgctgccgcc attgttctcg 4740catccatcca gcatcgtacg tcctctatac atctgcggat gatcatttgc gcatgtttgt 4800ggcatgcatg tgagcaggag caggcttgcg gccgccatgt ccgcgctgag gaagccaaag 4860tacaacggca agtgcatgcg cagcctgatt aggagcatcc tcggcgagac gagggtaagc 4920gagacgctga ccaacgtcat catccctgcc ttcgacatca ggctgctgca gcctatcatc 4980ttctctacct acgacgtacg tacgtcgtca cgaatgattc atctgtacgt cgtcgcatgc 5040gaatggctgc ctacgccgtg cgctaacata ctcagctctt tccgatctgc tgcgccaatt 5100tgcaggccaa gagcacgcct ctgaagaacg cgctgctctc ggacgtgtgc attggcacgt 5160ccgccgcgcc gacctacctc ccggcgcact acttccagac tgaagacgcc aacggcaagg 5220agcgcgaata caacctcatc gacggcggtg tggcggccaa caacccggta actgactagc 5280taactgcaaa acgaacgcac agactccatg tccatggcgg cccacaaggt cgatgctaat 5340tgttgcttat gtatgtcgcc cgattgcaca tgcgtagacg atggttgcga tgacgcagat 5400caccaaaaag atgcttgcca gcaaggacaa ggccgaggag ctgtacccag tgaacccgtc 5460gaactgccgc aggttcctgg tgctgtccat cgggacgggg tcgacgtccg agcagggcct 5520ctacacggcg cggcagtgct cccggtgggg catctgccgg tggctccgca acaacggcat 5580ggcccccatc atcgacatct tcatggcggc cagctcggac ctggtggaca tccacgtcgc 5640cgcgatgttc cagtcgctcc acagcgacgg cgactaccta cgcatccagg acaactcgct 5700ccgtggcgcc gcggcaaccg tggacgcggc gacgccggag aacatgcgga cgctcgtcgg 5760gatcggggag cggatgctgg cacagcgggt gtccagggtc aacgtggaga cagggagcga 5820ggtacgaacc ggtgaccgga gaaggaagca atgccgatgc cctcggtggg ctcgctaggc 5880agctctccga ggagaggaga acaaggctcg cgcgccgcgt ctctgccatc aaccccagaa 5940gctctagatg tgcgccctac gatatctaag acaagtggct ttactgtcaa tcacatgctt 6000gtaaataagt agactttatt ttaataaaat ataaatatat atatattctg ataaccaaga 6060ttcgaaccct cacttataca caattttatc ttatttttta taaaatgaga atggaaagga 6120ctaccgtgaa cgactataga accaatcata ctagtttaaa atgctcgtaa gctatgacga 6180acctagtagg ccggtgctgg accattccaa aaaacctata aaaataaatt taatattaaa 6240ttaaacatat ggtctatata tcagatatta aactcaaaag aataattatt ataatttatc 6300ttagctaaaa ggttgagaaa ggtatgcgtt aaaaaagagt tttaacccat ttttatagct 6360tatttgatcg cccgtccact tttagggagc gaggtggtac tatgcagaag tgttgcgctg 6420tgtgcgactt actatcatgt tgggtttagg tggattctca cgacccaatg atagacgaga 6480agtgtgggag atgaacaaac ctacgcattt cgcgtacgac acatgtgttt gaacaacgag 6540ttagattgga aaaaatataa tgaccttttt tgcaaaaatg actacaatga aaaccaggaa 6600aaccggtgct tcataggagt agagatttga cggtaaattg ttacgatcta ctggtatttg 6660ctgcgaggat gtattcgctt ggtgaaaaca gaattacaga gtagcagtag cagggaagac 6720agtagcgaga ggagaagaag aaacttgagg aagaagaaga taaatgtagt tgttacatcc 6780tgccttcgcc gtaggtctca gcgagcatat atcttcaggt cctccattct gggccctgga 6840atctcacatt ggccttacgc tggcgtgttc ctcttctcgg cccaactgta gtcttctctt 6900gaggcccacc agtctccaca ttcctttgtt gctgctatag ctcctcggac acggctgctt 6960ccgcctgctg ctgcacctgg atgtcttctg aagtcgactt gcgtggaggg acagtgctgc 7020cattcccctc ccgataacac gctgcttgtc cccaagcagg cgctcgaggg aacctctgac 7080gaagtggaat caggtcctcc caagttgcca gagatggatg caactcagac cacagaatca 7140accgttgtga tgctccatta ggcccccatc ggcattgtag tactcgttca ggaatttggt 7200ggttgtcaag gcgatcagga agctgtgcca ccaccactaa agaacccact gccttcttga 7260gttgtgaaac atggaacacg ggatggatag tagaagtaac tggaagctcc agcctgtaag 7320caacagatcc caacttagca gcaactggaa atggcccaaa aagcgaaaat ctagcttctg 7380atttgcccga ggtgcaagcg atgactgcac ataaggctgc ataacagcct tctcttgcag 7440ccactctccg aggataggca caggagtgga atccaaaatg tcaatgccaa aatgcttggg 7500tgcataacca tatagcacct caaatggaga cattttcaat gctgaatgcc aactagaatt 7560gtaccaaaat tctgccaagt acaaccagtc aatccatttg tgaggacaag catgcacaaa 7620acatctcaaa aaggtctcaa ggcactgatt aactctcttt gtttgtccat cggattgagg 7680gtgataggaa gaactcatat tcagtgatac accagccagg gtaaacaatg atttccagag 7740ctgactagta aagattttat cgcgatcaga gaccatagca gatggcatac catgcaggcg 7800ataaatgtgt tgcataaagg ccttggccac cactgcagct gtgaaggggt gtttaagggg 7860aatgaaatgt ccaaacttgg agaatttgtc aaccacgaca agaatacaat ttttacctcc 7920ggacacaagc aagccttcaa caaaatccat agtgatcgtt tgccaagctc ctgaaggcac 7980atgaagaggt tggagtaggc ctgggtattt cactctctca ggtttagctt gctgacaagt 8040ggcacatgct gcaatgaact ggatgacaga tttcttcatg ttcggccagg caaacaactg 8100cttcagccga tggtaagcga ctgctatacc cgagtgaccc ccaacagcag aactatggag 8160agcagacaat atagactgtt gaagtgtatg attgttacca acccaaatac ggcctttaaa 8220cttaagcaac ccttcttgaa gagtaaaatg aggaacaaca tcttggtcaa caaccaactt 8280agataacaag gtcttagccg aagggtccaa caaatatcca tccattacca aacgagtcca 8340ctatggtgaa cagactgaga gagcatgtaa tgtgatagca tgttgtcttc tcgacaaggc 8400gtcagcaact ctattttcat gtccatgctt gtatacaatc ttatattgca accccaaaag 8460tttagtaaag acttttgctg ccatggagtg ttaagccatt gctcattcaa atgcaccaaa 8520ctcttttggt cagtataaat aacaaactcc ccatgaagta agtaagctcg ccattgttcc 8580accgcgacca atatggctaa gtactccttc tcataggttg ataagccttg agtcttaaca 8640ccaagaggtt tgctgagaac gctaatggat gaccattctg caaaagtaca gctcccaccc 8700cattcttgca agcatcggtc tcaatagcaa aaggttggtg aaagttggat aatgctaaca 8760ccggggctga gatcacagct tgcttcaagg tattgaagga gatttcttga tcttgagtcc 8820aaacatagaa cacccctttc tttcacagtg catttagagg tttggcaata atagcaaaat 8880gactgacaaa tcgcctataa taacccgcca aaccaaggaa gctccttaac tctttaacat 8940tggagggcac aggccagttc aacacagcat caacctttgc aggatcagta tccactccag 9000cagcactgat cacatgaccc aagtaagcaa tagatgtttg agcaaattta cacttagact 9060tcttgacaaa ccagtggtct ttttggagaa tggtgagaac ttgggccaag tgagatacgt 9120gatcgtcaaa tgacctgctg tagactaaga tgctatcaaa gaagactaca acacacttcc 9180tcaacaaaag ggccaaagaa gagttcatag cgccctgaaa ggtaccaggt gctcccgaca 9240atccaaaagg catgactcaa aactcaaatt gaccatgatg tgtctagaac gctgttttaa 9300actcttccct aggcttcaac ctcacccgat gataccctga agccaagtcc aatgtggtga 9360accaacatgc accatgcaac tcatccatta actgttcaaa gatggggatg gaaaacgggc 9420tttgaccgtt aaagcattca aatagcgata atccacacaa aactgaaaag tgccatcctt 9480ctttctcacc aacagcacag gagaattaaa agatgatgca ctgggtctga taatccccga 9540ctgaaccatt tctgccacct gacgttcaat ttcatctttc ggagctggtg ggtatcgata 9600gggtctgata ttaactgggc tggcaccagc aaccaatggt atactatgat cacaacttct 9660ttctagaggt aaggacatag gtttagcaaa aaccgactga aactggttca acagctgaac 9720aatttcagga ggaagggtag cctcatcagt cagagcaaca gacacttgag atagggaaat 9780ctgaaccaac aattcgtcaa ccggctctac agtttcccct tgcaagagca cttgcacacc 9840atgataaggg atcaacatcc atcgttgttt ccaatgcact tccataggac tgaaagtctc 9900taaccaatca agtcccaata tcacatcaaa ggattgtaat ggcaacacct tcagatcaaa 9960ggaaaaccca tacccctaaa ttgtccattg agcttgagta aacacttggg agcaagtcat 10020aacaccttca ttagccactt taacctggag acaagttggt accaaagtaa tgtcggttaa 10080cctggaaatc atagtgtgac tgataaaaga gtgtgaactg cttgaatcca caagaataac 10140gatttcatac tcctgtatag accctctcaa caaaatggat cctttggccc ttgatcttga 10200aacagcctca gcagatagag ccaagaagag ttgttcctcg tggactgaat cttctgatgg 10260agaagactca ttcagaaatg caggcataag atcccagact tcctgcaggg catgaagttg 10320aactgtagta ttgcagacat gtcccctatg ccatttttca gcacaacggt cacaaagacc 10380acgtgcatga cgataagcac ggagtgcagc cagtttttca tcagtgcttc gatcagtctg 10440catgagacgc tgcttaggta aggatctcgg cgcactggca cttgctgcag cagtagttgg 10500ttgaggaaga ggcagggtcg tcttgtgctg ggatttcgac caaataccac gatcaccctt 10560gcagaattca ctccgcatag gtggagcagc gacctcatcc tgcaaaaaag cgagggaaca 10620ggcaatatcc agatccatcg acctttgaag cataacaaca gcacgcatat catcgcaaag 10680accatccaca aaccgtgtaa caaaatataa cggatcaacc ccagactcat acacagagag 10740ttgatctacg agagaagaaa attgttcaat atattgagtt aaactaccta actatttgat 10800gcgaaataag tggtgcaaca acagctgata ctggtcctta ccaaaccgtt cgttcatcaa 10860ttgacacaag agaggccaag acaaacgggg atgacgaaac ataactgaca gaaaccagca 10920cgctactgta ggcgacaagt gcatagaggc aattttgacc catgaattca gatcaacttg 10980atatatatca aagtaattct t 11001101284DNAArtificial SequencecDNA 10atggcgagct actcgtcgcg gcgtccatgc aatacctgta gcacgaaggc gatggccggg 60agcgtggtcg gcgagcccgt cgtgctgggg cagagggtga cggtgctgac ggtggacggc 120ggcggcgtcc ggggtctcat cccgggaacc atcctcgcct tcctggaggc caggctgcag 180gagctggacg gaccggaggc gaggctggcg gactacttcg actacatcgc cggaaccagc 240accggcggtc tcatcaccgc catgctcacc gcgcccggca aggacaagcg gcctctctac 300gctgccaagg acatcaacca cttttacatg cagaactgcc cgcgcatctt tcctcagaag 360agcaggcttg cggccgccat gtccgcgctg aggaagccaa agtacaacgg caagtgcatg 420cgcagcctga ttaggagcat cctcggcgag acgagggtaa gcgagacgct gaccaacgtc 480atcatccctg ccttcgacat caggctgctg cagcctatca tcttctctac ctacgacgcc 540aagagcacgc ctctgaagaa cgctctgctc tcggacgtgt gcattggcac gtccgccgcg 600ccgacctacc tcccggcgca ctacttccag actgaagacg ccaacggcaa ggagcgcgaa 660tacaacctca tcgacggcgg tgtggcggcc aacaacccga cgatggttgc gatgacgcag 720atcaccaaaa agatgcttgc cagcaaggac aaggccgagg agctgtaccc agtgaagccg 780tcgaactgcc gcaggttcct ggtgctgtcc atcgggacgg ggtcgacgtc cgagcagggc 840ctctacacgg cgcggcagtg ctcccggtgg ggtatctgcc ggtggctccg caacaacggc 900atggccccca tcatcgacat cttcatggcg gccagctcgg acctggtgga catccacgtc 960gccgcgatgt tccagtcgct ccacagcgac ggcgactacc tgcgcatcca ggacaactcg 1020ctccgtggcg ccgcggccac cgtggacgcg gcgacgccgg agaacatgcg gacgctcgtc 1080gggatcgggg agcggatgct ggcacagagg gtgtccaggg tcaacgtgga gacagggagg 1140tacgaaccgg tgactggcga aggaagcaat gccgatgccc tcggtgggct cgctaggcag 1200ctctccgagg agaggagaac aaggctcgcg cgccgcgtct ctgccatcaa cccaagaggc 1260tctagatgtg cgtcgtacga tatc 1284111140DNAArtificial SequencecDNA 11atggcgagct actcgtcgcg gcgtccatgc aatacctgta gcacgaaggc gatggccggg 60agcgtggtcg gcgagcccgt cgtgctgggg cagagggtga cggtgctgac ggtggacggc 120ggcggcgtcc ggggtctcat cccgggaacc atcctcgcct tcctggaggc caggctgcag 180gagctggacg gaccggaggc gaggctggcg gactacttcg actacatcgc cggaaccagc 240accggcggtc tcatcaccgc catgctcacc gcgcccggca aggacaagcg gcctctctac 300gctgccaagg acatcaacta cttttacatg gagaactgcc cgcgcatctt ccctcagaag 360agcaggcttg cggccgccat gtccgcgctg aggaagccaa agtacaacgg caagtgcatg 420cgcagcctga ttaggagcat cctcggcgag acgagggtaa gcgagacgct gaccaacgtc 480atcatccctg ccttcgacat caggctgctg cagcctatca tcttctctac ctacgacgcc 540aagagcacgc ctctgaagaa cgcgctgctc tcggacgtgt gcattggcac gtccgccgcg 600ccgacctacc tcccggcgca ctacttccag actgaagacg ccaacggcaa ggagcgcgaa 660tacaacctca tcgacggcgg tgtggcggcc aacaacccga cgatggttgc gatgacgcag 720atcaccaaaa agatgcttgc cagcaaggac aaggccgagg agctgtaccc agtgaacccg 780tcgaactgcc gcaggttcct ggtgctgtcc atcgggacgg ggtcgacgtc cgagcagggc 840ctctacacgg cgcggcagtg ctcccggtgg ggcatctgcc ggtggctccg caacaacggc 900atggccccca tcatcgacat cttcatggcg gccagctcgg acctggtgga catccacgtc 960gccgcgatgt tccagtcgct ccacagcgac ggcgactacc tacgcatcca ggacaactcg 1020ctccgtggcg ccgcggcaac cgtggacgcg gcgacgccgg agaacatgcg gacgctcgtc 1080gggatcgggg agcggatgct ggcacagcgg gtgtccaggg tcaacgtgga gacagggagc 11401216619DNAZea mays 12atcttttatt ggtttgagtt gaacctatat gcacctgtag aatataatct agagcaaact 60agttagtcca attatttgtg ttgggcattc aaccaccaaa attatttata ggaaaaggtt 120aaaccctatt tccctttcat ccgggccctt gcggcggacc gtccgcgaca ccagggtgag 180ccttggacag gaacactgca aaaacacaag ttaacactac ggatcgtccg atggagaagc 240gagcaccgtc cgagaccaag cacggaccgt ccggcctcag gcgcgaatcg cccggtcgtt 300gaaaaaccag aaaaacccga aggtgacggg ttcggtaaaa tgcattttta gcgtccttgc 360ggatcgtcct gggtgcacgg tcggaccgtc cacgactgct ttatctgaca tttgacgacg 420cattaaaagc tctatagccg ttactcctga ccgttgtgat ttcagtcgtt gatgtgcagg 480ggtacggacc gtccgcggtc ggtagaaaat gagcaacgac taggaagtgg ttggaggcta 540taaatacaag agaaaattcc tggtatgcca tcagatttta tactcatccc ttgtgtgcca 600ctgagtggca tataagtata tttttttgtg tctatgacat gtggggccag tggcatacaa 660ggaatgagta tatttttcag tggcatacag ggaattggcc ctaaatacaa ccccaaccac 720ctccattcaa atgatccaag cactccactc attcacattc aatacaggag ctagcaatac 780attccaagac acactcaaag ctttcaatct ctcaaagtcc cacaatttag acaagtgatc 840attagtgctt agtgacttga gagagtgtga tctatgtgtt atttgtcgct cttgttgctt 900ggctttcaca attgggcttt cttcatctct ttctcaacct tctaagtgaa ttataaagca 960agcaagagac acctaatttt gtggtgatcc ttgtggggtc ttagtgaccc gtgtgattaa 1020gaagaagcac tcgaccggtc taagtgaccg actgagagag ggaaagggtt ggaatagacc 1080cggactttgt ggcctcctta acggggacta ggttctttgg aatcaaacct cggtaaacaa 1140atcgctgtgt ttatttgtgt tgattttcac tcgatttgtt tcccctccct tcctctctct 1200aaaattccct tgctcatatt gttgtgagtt ggctctcaaa gttatctgca ttgattgggc 1260aactacttgc aaggataact atcttccgca ctccgaatta tttctgacat taaccccggg 1320cataatgtgt gttttaagtg tataattttc atgtttcgcc tatttacccc cctctaggcg 1380actttcaaat gttctccttc acttgtgatg tctacaacca taatcagctc aacatttgga 1440ctatcaccct tgaacactta tgttgaactt taaaagttgt gcactaagca cttgtccaac 1500acttaacaca cttgtcagtc ctttaattgg gttgtcatct aaaccaccaa aaaccacaaa 1560gagatctttc accggggtcc gtggttcatg gccgtactgc tcggtctcaa gatttttatt 1620ataaaatcac tagagctcta ctatttatgg ttcggtgtgc catcgaaccg tctccaacgg 1680gtacatccaa cgagcgccag cacaccaacg actagttgac gtggtctacg gtccagaggc 1740tcatcagact tgtcaggagg ctcgtcgact ggtctggtgc cacacccgtc gacttgggat 1800cgaacaagaa tgatggcaag aggacgaagc gcatcaagaa gatcatctac tacgactcct 1860cttacccttc acacaaggac gacgattcca cctcctccaa gaaaaatacg gttaaacaag 1920gttactctaa gacatctttt atttattctc gcattcctta caatttcaat gctcatttgc 1980tttctattca tcttggcaag cctgctcgct ttgatgggag gactattctt ggtggagcca 2040taaaatgcgt agccatattt ttttgctcca ccctagcatt tgggatgtcg tagacaatgt 2100aatgcaattg ctaggtagcg atgataaaaa ttataatact attattgccc aataatctat 2160tcataatagc gcccaagcta aagaggtgcg ctaagggatc ctttttatag cccaaagagg 2220tccataggcg ttgctccttc cttctaaaca tcgatgaaat tgtgttgtct gcgaggacat 2280caaaccgggt ctgtgcacac cttctccggg atctgatttg gtgctttcct taactgattc 2340acagcggacc gatccgatgc accaccggac cgtctgccac atcagctagt cgttggcctt 2400caaactccac cgggaagtag ccgttggagg cggtccggtg cgccattgga ccgtgcaaag 2460tgaagggtcg atgattttta taaataaaat ctcgagacct caaagttcag ctctgtaggc 2520ggtccggtgc acatggacct ggttcgatgc attaggtcgc ctaacactaa gttctatgcc 2580ctgcggactt gatccggtgc accaccagac gagtccgatg aggcctagaa caacccaagt 2640aaggctgttt tgagcctaac ttcttcaaat ccttttggct attcttggga gctttccaac 2700aacttagaca aacataatta gcacatattc caattgatta ggtgtggaga actcaccttt 2760tactttgtcg ttcaccatga tttgcatttt ggcttaatct aagtattcga accacttttc 2820tcacaggata gagttagagt tcaaataaag tgctaaacac atagtattag acacatgcaa 2880cttatctaag taatcaaacc tcatgatttt acccttttgt ccaaagctgc acactttagc 2940cttcatttta gttctttagg atctagtact ttcaaattga cttcaagtgc ttgtgctcgt 3000actcatatca aattagttag tccatgttgt tgtgctaaac acttaatcac taaaacatgt 3060agaaatggtt atctaacaca tttttctttc ataagtaaaa caggagattt atattgtaga 3120tgttattgtt tgttgatgaa atttgaaata agagatataa gagcaactcg aaaagcctag 3180ctaaatcgat ttgtatcggt aaaaatagaa aactgatgat taaaatagga tccaacaaac 3240tctctttgct cctctctatg ctatcctgct cagcatcacg tcgaggtctc tagccatatt 3300tgctgagctc acctgcctcg ccatcttcat tctctcgtgc atcctcaccg tccctgcgcc 3360ttgccgtcgt tgcagctcac tgtcccacgc cctgctgtcg ccatgcctcg ccgaccctgc 3420ggctcacctt accgcgcctc gtcatcgtcg cggctcgccg tctcgtgcct cgccaacccc 3480gctccttgac gtcctcacgg cttacagtcc ccatcgtcct cgtggctcca cgcctcgcca 3540tatttgcggt tcactgtcca cgtgccctgc catcctagag cccgacatcg atatgtcaaa 3600tgagacagga gaaaaagaaa tgaacatgtg attataatca gtgatttgaa tattgataga 3660taagatttga agagtctgtt gtgtcatatc acctttttat gaaactcttt attttttaga 3720gtttttataa aactctaaat ttagctaaaa ttatatctag tcttttagag ttactctaac 3780aagagatgag atagcgagct ggcgagctgc tggagacagc cgagagtaga gatagtagag 3840gagactagaa actccattag gcctagccca gagtcagcta gggatgccgc ccgatccact 3900acgtactgaa agcgatcccg gcccatgaac gctagtgggt tgtaattcgg cttacccaag 3960tacccagccg ttcctccaca cctctgcact acccgaaacc cacgcccaac ggccgtttcc 4020cgcaccccct atccgggaag gaagaaagcc agaactcacc cctgcttcgt ctggcggcgc 4080cgtttcccgc acgcgatgcc gtcgacggcg gaggatttcc cggcgataag gaagctgggg 4140aagctcttcc ggcttaccga agtgtacctc tggtaagtct cctgtctccc ctacccgctt 4200ctcagcgtag ggtttgccgt ttgcgaggag tacgtctcct caaactactc tctcttcctg 4260cagggacgat tcgtatggcg ctggacctca cgatggacag

aagaacgggc gctcggcgga 4320ggctgctctc gtggtacggt ctctgacccg ctttggtgta gctcactctt aagctttctg 4380agttgggggt gcgcttgtgc ttcgactagt agatggctaa tttcgtcggg ctactggtaa 4440tttcttggta tctgcattgt cgagaaagaa ggcccgacgc ataatttcat gcttgcccaa 4500gagtctactt aacacaagga attggttttg tggcgtggtt tgtgcattgc gcccaaactg 4560tagcctgtac aaaatgttaa tcgtcgcgtg cattttaaac aaagttttgt attatacgac 4620aaaataccct cggcacattt gttacagact accgataagt gcatacctat ttctcctagt 4680tctatcagga aataatcctg gacctcgaaa tgacagcctc gtctggctag aaccctacta 4740aacattttga gtgatcactt ttcattactc attttcttga tgaaagcaca ttactgacat 4800ggaagtttgc tacataagac ataacacttc cttgtagtgc tttatttaat tattgaccga 4860tgatcttttt ggaaaattaa gctgtattaa acaattgtag cttcggtgat gattgttgga 4920ttaagcatta gtctgctgca gtcctcttct tgattctgat atgacagtta tttgttgatt 4980aaaataatga tggtttgctt tacacttcga tctcctttga gaggaaaaca tgtgaaggtg 5040tggactagat catgtataga ccaacagcat tatcttatta aaacacttct aaataactta 5100gcaatttcat aaccattttt acacttctga ggaattcatc ttgtcgtgaa agagtcaatt 5160aacttagctg cttagcagac tgtgtcaagc ttattacttg tatgttgtgc cctacaaatt 5220actatgaggt ttataatgta catagcaatt tgacgacctt caacttttca ggactctcat 5280acagataaaa catgcaatga agcatccaat agcactgaca aaggttagat gtatttttct 5340tgtattctag tatcttcctt ggtcaatttt ctttacagag gatgttacaa tgtactctac 5400tttttttgtg tggaaacaac ccactagaga aaaaaaatca cttcatttga gaaatcttaa 5460gaatctgaac tctgaagctc agcatgcttc cacaccacca cttttcaggc cactgtctct 5520tcattggagt aaatgacgct tcttttagag agaaagagag gggggggggt ctgtttataa 5580ttgaatcaag agattttatt ggtcacctga tttctgttgc atgacgtggg acctggatag 5640acttcagatt tgccttagtt gataagttca ccggcactag tgaaagaaaa gtatatggta 5700caggtactct tatgaacagg caaccactta gcaattcagc atctaataga gaaggaccaa 5760gtcttcaact aaagtcacaa tataccttta gtactattag aggggctgac cctccttgtc 5820tagtgcttgt gagatcatag gagatggggc gtggtggtta agattgtgga tcatattcgc 5880caagctccca gggtcaatgt gattgaggga tgtggtatgc tacttgtgaa agggttcaaa 5940aaggccagga tgacattgtt cctacattcc tggaatgggg atgacacccc caggacaagg 6000aaggtggcac agttccacgt tcctggtgac atgttgtgta tcaaatggga ggccaaatca 6060ccacgggatt actctaggaa gggaagatga tgttgataat ttgagtcatt gttgcaacat 6120ctgttcatgg tttcatgcct cattttataa gtcatattgc ccacacataa cattgtaata 6180gtaaaatcaa caccagttat tttacgtttt cccttgtatg tcaaccgatt tcttactgtg 6240tatatgatct gtctatcaat aggccatttc tttgttgaag atttggaatt ggtcaatctc 6300atgggttctt tggggcttcc tgtttcattc agcacaagta aagtggtcag ttgctcacat 6360agtgtgacac catgttactg ttccctaccg gttccattgc ttaattcctt tatgcattgc 6420agaacaagaa cacatgcaac aagggaaaga aaaaaggaag acaagcaccg ctcaaagcag 6480caaacactca aatcaatgat gctgtgagga tatgtatcaa tactgaagat agagaaaatt 6540ctgttgaatc attggatgct atggagcaaa cgcactcatg caatttattt gtgacaccac 6600tgggtcaaaa tgaaccctcc cgtgatgaca ctgacaagag gcttagggaa gacagctctt 6660gtgttgaaga acaagaagag tctggctgta gcaccatcta ctctgctggc aaagcccctg 6720gctgtgatgc taaaaatcat ctcactgaac ttggggcttt tgagctttct gataacttgg 6780ccaactcagc aaaagaagaa tactcaattc aagaaaatca agcttatgaa agtgtgttgc 6840tagattctga agagatgtca aggaatgact gtgttgatga tgaatctaca cattcctgtg 6900ttggcattta tcaggatgaa agagtgtcca caaggggaga tcaaacatct gaagaaactc 6960tatcagtacc ccatgattac aatgatgttg gcagagaagc tagtctaagt ttggcagagc 7020catcatctat tgatgagcat gcacaaagct ctgccaacaa cttttactat gactatggtg 7080aatggagggt tatctgggat ccattctata atcggtatta tttttacaac atccagacac 7140aagagtccac atggtgtcct cctgaaggac tggaggattt tgcatcatat tgtagcccag 7200ataccactaa agagctagct gaactgggat ctcagtgttc aagcatggca ccacaagaga 7260acagtaaaaa ccctagtctc gtcctttgca gttgacatta cgaatagtta tatgcactac 7320gataaaaact ttctacaata tgtaacactt gagcatgtgg caatgggtgt aaacatttaa 7380taataaggta gtgaaatcca ttacacacag tattgaattt tgcactacaa atgctgaagg 7440agaaacctaa attgtcaatg ctttttggtg acattaatta ttgccattga tttcctgctt 7500gtaggtgctt catttatctg tctccaattt actcatatgc tagcttcttg tttgggacta 7560aaggctttgc tgttgtttta gtatgtcaca catttctctt taatctcacc atcacagatc 7620tggctactca tgtcaatcat ttagaagcac aggagcaaga tcactgcatt catgatttat 7680ctgacattcc tgttgaaaag ccaatatatc aaaggtaggg aataccaaac tgtacaatgt 7740tgaacaagtt attgtttttt tttgttaatt ctgttcatct atgcagtatg ataactacct 7800ctgacaaagc acagcacact gaaaataagt acagcgattc aacaactact gtgttagaga 7860tgaaccagga agttgctagc accaaaacga aaaagagagt aaggagatct cgatcgtgta 7920aggcgataat atatggcatc tgctttctag gagtttgttc ctgttacaat tttaggttgc 7980gcatttacac aatagtttct tggtttcttt gagcaaatgc agctttgcat gactgctaca 8040ttgcctactt atgtctaggt aacttttctt tgcaaactgc aaagttatgt ctaggtaact 8100atgccttcta gaaaacctcc ttgttagcta tgtattagtg agacttgcct aatatttatt 8160ttcttgtggt ccgttcttgt gctctttgta catatttgcc aataaccatt ttaattgttc 8220tacagatcat tcatgccaag acatggcagg gaacgtctct aatgacatca tcaagtactg 8280ggctcagcgg tattcacttt tctcactttt tgatagtggt ataaagatgg atgaagtagg 8340gtggttttca gttacgccag agccaattgc aaagcatcat gcatctcgtg tgggtgcggg 8400agtaatgatt gattgtttca caggagttgg tggaaatgcc atccaatttg ccaaaaagta 8460cgtcaatgtt atcttgcaat tgagttatgt gatggtctaa tgtatcattt gcttgaacac 8520ttcctgttta gtagcaactg ttatttttct tatgtcacga gaatgcaatg gctatatcac 8580cttaagcagt atgctatgtc cactgtccag tttaactaag gcatctgctt ccagtaatat 8640gcaaggctct tcttactttt gctgttattt aatatatgga agtgtcctta cggaggtgtt 8700attgtggaca ttttgagcat gttcatcatg tcacttgagt tagtagagcc agccttagtt 8760gtttgcagtg taggtggatt tattttatgt tatcaatgtt tcttctacag tactaagact 8820attgttccac attaactatg tctccttttc caggtgcaag catgtaattg cagttgatat 8880tgatccacaa aagattgatt gcgcgcatca taatgcatcc atttatggag taaatgatca 8940catagatttc attgtaggtg attttataca tatagctcct catctgaagg taatgccttt 9000ttcttggaat tattactttt aagtttctca acacgtcact tctattagct atatgttttt 9060gtagctgttt gcgagagtga atttattgtt gacattgttc tcatttgccc acccatttta 9120ggataggggc ttggtactac aaatatcttg atacttcaag tcctacaaaa agaaatttat 9180gtttcatatt ttttccattt gaacgtcgag attttatggt cccatggagt tctccctatt 9240tttcgatgat gcccatcttt tggcagtacc ttctttgtgt acacaataaa tgggaggata 9300ttttctgcag ggagaaactg ctttcatgtc gcctccttgg ggtggccctg actatgccaa 9360agttgatgtt tatgatatga aaagcatgct tattccttgt gatgggttag ttccttgttt 9420ctattttaag agagtaattt ctttcagttt gcactcactg atgtttactt actttgtgag 9480taaaacgcac cagagatcca ttaaccttta aggaggtgtt atctatgtcc atcaacactc 9540aaactgcatt tttgggttcc taaacttttt aagtgattca ccggagttcc gtaccccttc 9600gtttatattt gtattttgca gaaacctcac tctgttttat ttctccttcg catgtagggg 9660tggtaatgga tcacgattca aacgtttctc cacgattcgt ttgagccctt aattaatttt 9720agtacaaaaa taaatagaaa tagagataga gcctgatcct aatatgattt gatcctcaaa 9780ttttatagcg tagaatttag agcccattac catttaccac ccctattcac atgcctaccc 9840ctctccatct tctggattga atgttccaac ctaatttaca ctcgtagttt ctttgatctg 9900ccaatcaaat ccagagccta attgctataa cattagaacg aacacgccat attaccagaa 9960tactcgatgc agatatggat agaagcgagg cgctaagcgc agccagcctt ggcttcttgc 10020tctgcaggcc gatcagggcg ccagccaaag ccaaccatgc gcgcacgtga ctgcaatgct 10080actctctctt cgcctttgcc atcgtcgtcg caggatgtta cgttgtgctt atgctggctc 10140ccacgagtgc cgccgcccag agcgagctga gcgcacgcag ccactgcttg tggttcacga 10200gcgtgagcat gccctcacca cctgctgctg cgcccttgct gcttgcttgc ttgccggtgg 10260tgtacattat ggacggatta aattgaatgg atttacctgt tccagaaaaa gatctgatcg 10320acgatgggat gctatcttgt atggctccgg atcaatgaag attaatggaa caaccaatcg 10380aaggctcaga gcaggctagt tggtgcccgg aagactctgg ccagaagatg gaaatgggta 10440agcgtgtgaa ggaaaaaaga aatagagggg gatttctaca aaaaacaaac ataatgaaga 10500ggtatggatt tcaggtgaac cacttaaaaa ataaaaaggg catacccagt gccgtaggct 10560tcccgcactg tgcggggtcg tctggggaag ggtatcttta agcgtcaagt cttacccgca 10620taatatgcag aggctggggc tcgaacccgg gacctttcgg ttatagacgg taggctctac 10680cgccgcacca agcccgccct tgaaccattt aaaaaattta ggactcaaaa atacagtttg 10740acagttgatg gacctagatg acacctcctt aaaaaattta atggacctat ggtgcatttt 10800aatcttttgt tgatcgacta tactcaatgt tgaactcttt aggtactctc tttttaaact 10860cggaaccatg atagcttcaa gagtagtgat gtttcttcct cgcaacattg acctaaacca 10920attggcggac atgtccttgt ctgtggatcc cccgtgggca gttgaggtaa gcccattttt 10980gctgattttg tgccaagctg acgtttccta tagatgtcac agtggtctct ctctctgcag 11040gtcgagaaga acttcctcaa cggaaagctg aaagccataa cagcttactt tgaagaacag 11100gatcgttgaa ccaagcatcg gcgctggtga tacaaatcat cttgttagct atgactcacg 11160acaaattttt gtggtgaccc taaacagaac ctttgtgttc ggagacagaa agaagcggtt 11220tatcatcttc accgagcata gataatttat ttgcagagat gagtcattgg tatcatacaa 11280aagcagctca gcttatctca attcacagca agtgaaactg tcgaaggaaa actacaaggc 11340tgacagtcga acgcgtggga gttagcttaa ttttgcctta tgataagcaa gcatgcttcc 11400tggtttattt catacagcta ctagtagttt cagctgcaac agttgtgcgt tggtgtgcgt 11460gtgattctca catatctggt cctgcggatg tgagtgatgc aaatgtatgt gtcatcatcc 11520catgtaaggg tttgtttgtt tgctctcaat ctatgtagat tgagtgggat taagtgagtt 11580taaatctcag acaagtcaaa aaaaaatgtt ttcaatctca tccaatccac atatgatagt 11640aatacccgag taaggcttag atgtaatagt tggaataaga aaaacaagtc agccattttg 11700aagttttgtc cttggagttc tattaaaagg cattactgat aaatctccaa cagatttgca 11760gttgaagcaa catgtgaaac atatttatca tgttaaaaca atttgcctta gtattcgatt 11820atgccatgaa atctgacatt tccttacaca tcccagttta tcattgtcaa ctgtctttag 11880gaatgtattg tatctgctgt ttttacttgt atatgtatgt tattttttgt cgttgtatgt 11940atatgtttta ttataaacat ggccactaag gttgttctat tcgttaaaat aacacagatc 12000tataaacgac taaacaagct tcttgggata aagaatcata tggaggctgg attttcgagg 12060agtctggtgc actgttttgc taatgatcag cccccccccc ccccccctct aaaaataaag 12120aaaatactgg atttcctgtt catttattac attcatatgt aaatgcttct gtccttttct 12180atatctgggc tggacttttt gtgtgctcgt cactcaagtt ggttagtgtg gttaatttta 12240ttatgctccg tgctctttcc taccgaactt ggtctttgtt agtatcatta tcagtcagtt 12300atattttctc ctcttgatgc ttcatctaat ctatttttgc aaagttgtca tgttatgtac 12360tatatgatct tttacaaggt ttttgacttt tcaaattatt gtgtcgtata ttatttgtac 12420tcagattgtg cttacaactt tagtttatct atactttaag gggtgtttgg tttctatggg 12480ctaatttata atcccttcat tttattctat tttcgtacct aaattgtcaa gcacgaaaac 12540gaaaataaag ttttaacttt tatatttagc agtttataca ctaaaataga ataaaataga 12600tgaagtaaaa attagtcctc agaaaccaaa catctcctaa atgtctagta atagtcgcct 12660gaactgtaga gcgcccaaca cgcgccaccc tgatttggtg tcttaaaatg gcatgtgtat 12720ataggtggaa tgggtttgac gagactgtaa ctactttttc ttaattaaat tatagatgga 12780cttaactttt ctatatgcat tttaaatata tttttctata tttttggtgg gctgagttac 12840agtttatgtc aatataaatt acaacagacc gaatctaaaa ttttattata aaatgtatgc 12900accaaccgtt gactaaaaag ataaaatttg gacacctaca ttttagcaag tcacctgcta 12960atatatatct atactaggta agtgtccgtg cgttgcaacg aaaacatata ataatacgat 13020aacttatata taaaatatgt gttatactgt tatgagaaaa agtttcacct gtcctatttt 13080tatcaatatg acaacagagg atcaatataa ggccttggca tggcttcaga agttcagatt 13140aacgaattat ggagcaagag caactatttc tggtgtttca gtagaaagaa tggggatatg 13200tgtttatctc tctcatacat gttacacaat gtgctataga atgacacctc taggccgctg 13260ctacaactac agagaataaa ttatggatca tgggtgccct actaagttag ctacacgtaa 13320aatctggggc gattgacccc ctaatgcttc atcttggaga tctcactaaa atacaccttc 13380cgcacaaacc gagccttgat aaagctcgcg atcttcgtgt cctgatcttt agtgcagaca 13440actgcaaggg aaattattgt gccggtcagc aagagtggaa aatgtcagca gaaacacaag 13500aatggaaaat tatatgatgt gcaagtagga cggcacccta tttaactaaa ggtgtgtttg 13560gttcagtttt ctgaaccaat tcgttgccaa aaaatctaaa atctcacaca aacggtacaa 13620catcagaata gatttttaaa agtttataga tttctcaagt tcaattcaaa atcaccatct 13680accccaaatt tttcagatta ctattattca tgttacaact atcactcttg tagtatctac 13740cattgatagt tgttttaatc aaatatattt tagctttctc acaatcctca gctcgaaaca 13800gattttcatg gctcacagtt ggattcatat tttcataaat ctataggtgt gaaccaaata 13860gaccctaaaa catcatgttg taatgttcca aaaaaatcat cacaaaaaac atgtatgcaa 13920cacacctaag tgcaacaaca ctaacctgca ggagcgttgg cgtaggctgg actcttgtgt 13980gtctgcaatg tggaaacctg cattgtaaat ggttaggtaa atgcttcgct taaaaatggc 14040agtaaatgct tcactaaaaa tgcttccacg tgctcatgat caagtaggtt ttatgttcaa 14100tctgcagttc tacacaagtg cgattcattt tgatactcat ttctatttac tttcagagct 14160cgtgctaact gtcataaagt gcaatgcatc tatgactgcc aattgatatt gtgctcctgc 14220cataaagtgc actgcatcgt gctaactgta cactaatctg tgaggatcta agaccaatat 14280ttgtttacgt tttctccttt agtatcctat aaaaacaaca cgcctagaca acccaaaaaa 14340tgtgtcccaa taggaatatc agattctgac gctggcagca atctcgaatg ataatatttt 14400ttccaaacaa gcgaggtccc taaatagaag cggcaacaga taaaaactaa agataaaaac 14460taaagagtac agatgattgg catcacatcg ggaatgaaat atgcctaaca tatcaatttg 14520catattagat tatttgctga gaacaataac gaaaacatat ttagttgttc atcacaagtt 14580accttagatt ttgctgttca aggtcctttg ggtcttcttt ctgctagaac atacaagggt 14640atttcagatt tgcaaacaag gaaaagcaag aacttcaatg atacatcatt gtaaaaccaa 14700gtttccgatt taaataaaga tgatgcttgc ggtcaacaca ttcacaaatg taaatgtgtg 14760aaatcgttca aacataaggc ttatgtggtc atgctcaggt agtatgtaca gacctaaaaa 14820caaggtatat gacaacagta ccagccacta aacacacatg gttaatcact aaaacaattc 14880tccgattaac caggaaaact atagcagcca ctagaactat acaggtttct accagtaatt 14940gcttcactaa aaaatgcccc catgtgtaaa ttttcaggtg gtttgtacag acataaaaac 15000aagggtatag gactacttct tgtgctaaaa taaaagctgg cactaaacag tgtatccagt 15060tcatcaggaa aacagtttta gtgattaatc actaaaacaa tacccatggt gcaaattatc 15120tgattaacca tgtacatacc agtaattgct tcataaaaaa tgcccccaat ttgctcatgt 15180ttaaggaata tgtatagaac taaaaacaag ggtatatgat aacaatacca gccactaaac 15240acacatggtt aatcactaaa acaattctct gacaactcct ataaaaagat agcaaccacc 15300agaaataaac cgcccacgac atccctaatt gtagtcacta aaaactggta gcagtaattg 15360gttcactaaa aaatgaccat aatcacgaaa actataacag tcaccagaac tatataggtt 15420tcattagtaa ttgcttcact aaaaaatgtc cccatgtgta aattttcatg tcgtatgtat 15480agacctaaaa acaagggtat atgactactt gctatgctaa aacaatagct agcactaaac 15540agtgtattca gttaatcatc caaacaattt tgtgattaat cactaaaaca atacccatgg 15600agcaaattat atgagaataa gatctcgtcg ttcctattgt gaagaatata ctactacctc 15660cagtttcaaa ttacaatttc aaattacaag ttgtttagaa catccacaag gtaattgcga 15720agaatatact aattgctcta gtttcaaatt agaagttgtt tagagaaaag gtgttcctta 15780atagtctagt tttagaagct acatccaact ggtaaacata aattgcagaa accttttatg 15840tggaagcctc cgtcattgag tctgtcccct ttagctgtaa gtagttttct aaatattgtt 15900agtcaggctt agttgtttga gactctgttt ccattcgtga ccatgggaac tgtgaaatgt 15960gtagaagatg ctcatgctca tgcatatgca tcgaattgtt ttgtaaagtc atcttaatgc 16020tcaaacagtt tttttatctg ccccagctgt acactgcttt ctgaattatg tcatttaggc 16080ttagctgtcc gagataattt catttgtgat gaaggtaacc ggagcatctg tccttttgtt 16140ttaaacataa atattttgat agcttaactt gtgcgtcatt ttatcatgta ctaacatggt 16200atatagatgg cacttagcag taacattcct gacttatgtg attgtcctgt tagaatgctt 16260ctgcaatata atgggcttat gctaatctgt ttggaatccc atgatgaata acaattatgg 16320atgttgggca ttttgtattt ttatgatgta ggcttaacat attttcttct ctgctcagcc 16380ctgccggtgg agacctctat ttatagctat aatccagcca tatctagtga actaacagat 16440ctattcctat ctctagtgct tatcccctta acagatttat tttcctatct ctatttcttt 16500gatgttactt gctgcaggtg ctatccccca ctgatgcatt atatgacaat cactcgaaga 16560atcagatgct aattaattgt tttatacttg accattgcta attaagtact ccatttttt 166191317274DNAZea mays 13caaccctttc cctctctcaa acggtcactt agaccgagtg aggcttcttc cttaatctca 60tgggtcactt agaccccgca aggatcacca cacaattggt gtctcttgcc tcgcttacaa 120agcacttgag agtaagaagt gagaaagaaa agaaagccaa gccaagcaaa caagagcaac 180aaagaaacac aaatgatcct ttaacaagtt ctaatgcgct agagttgaat cgagaacttt 240gagtggatcg atcacttgaa ttgtgtcttt gcagtggagt ctattgctct tgtattgaat 300gcaatgtgtt gaatgcttgg atggttagag tggaggtggt tgggggtatt tatagccctc 360aaccaccaaa caactgttgg ggaggggttg ctgtcgatgg gcgcaccgga cagtccggtg 420cgccagcaac gtcacccaac cgttagggtt cgagcgcaga cgactgttgg agctttgtct 480tcttgtgcca ccgaacagtc aggtgccgca ccggacaggc actgttcact gtccggtgcg 540cctctgacgg ctgctctaac ttctgcgcgc actggtcgca cactgtagcg ttcgcaggtg 600tccgttgcag tcgaccattg tgctggaagc cgttgctccg tttggtgcat cggacagtct 660ggtggcacac cggacagtct ggtgaattat agcggagtgc ggcctgagaa acccgaaggt 720ggagagttcg gagttgtacg gtcctggtgc atcggacact gttcggtgcg ccagaccagg 780gcacctttgg tttctttgtt cctttgcttt tgaaccctaa ctttgatctt ttattggttt 840gagttgaacc tatatgcacc tgtagaatat aatctagagc aaactagtta gtccaattat 900ttgtgttggg cattcaacca ccaaaattat ttataggaaa aggttaaacc ctatttccct 960ttcatccggg cccttgcggc ggaccgtccg cgacaccagg gtgagccttg gacaggaaca 1020ctgcaaaaac acaagttaac actacggatc gtccgatgga gaagcgagca ccgtccgaga 1080ccaagcacgg accgtccggc ctcaggcgcg aatcgcccgg tcgttgaaaa accagaaaaa 1140cccgaaggtg acgggttcgg taaaatgcat ttttagcgtc cttgcggatc gtcctgggtg 1200cacggtcgga ccgtccacga ctgctttatc tgacatttga cgacgcatta aaagctctat 1260agccgttact cctgaccgtt gtgatttcag tcgttgatgt gcaggggtac ggaccgtccg 1320cggtcggtag aaaatgagca acgactagga agtggttgga ggctataaat acaaccccaa 1380ccacctccat tcaaatgatc caagcactcc actcattcac attcaataca ggagctagca 1440atacattcca agacacactc aaagctttca atctctcaaa gtcccacaat ttagacaagt 1500gatcattagt gcttagtgac ttgagagagt gtgatctatg tgttatttgt cgctcttgtt 1560gcttggcttt cacaattggg ctttcttcat ctctttctca accttctaag tgaattataa 1620agcaagcaag agacacctaa ttttgtggtg atccttgtgg ggtcttagtg acccgtgtga 1680ttaagaagaa gcactcgacc ggtctaagtg accgactgag agagggaaag ggttggaata 1740gacccggact ttgtggcctc cttaacgggg actaggttct ttggaatcaa acctcggtaa 1800acaaatcgct gtgtttattt gtgttgattt tcactcgatt tgtttcccct cccttcctct 1860ctctaaaatt cccttgctca tattgttgtg agttggctct caaagttatc tgcattgatt 1920gggcaactac ttgcaaggat aactatattc cgcactccga attatttctg acattaaccc 1980cgggcataat gtgtgtttta agtgtataat tttcatgttt cgcctattta cccccctcta 2040ggcgactttc aaatgttctc cttcacttgt gatgtctaca accataatca gctcaacatt 2100tggactatca cccttgaaca cttatgttga actttaaaag ttgtgcacta agcacttgtc 2160caacacttaa cacacttgtc agtcctttaa ttgggttgtc atctaaacca ccaaaaacca 2220caaagagatc tttcaccggg gtccgtggtt catggccgta ctgctcggtc tcaagatttt 2280tattataaaa tcactagagc tctactattt atggttcggt gtgccatcga accgtctcca 2340acgggtacat ccaacgagcg ccagcacacc aacgactagt tgacgtggtc tacggtccag 2400aggctcatca gacttgtcag gaggctcgtc gactggtctg gtgccacacc cgtcgacttg 2460ggatcgaaca agaatgatgg caagaggacg aagcgcatca agaagatcat ctactacgac 2520tcctcttacc cttcacacaa ggacgacgat tccacctcct ccaagaaaaa tacggttaaa 2580caaggttact ctaagacatc ttttatttat tctcgcattc cttacaattt caatgctcat 2640ttgctttcta ttcatcttgg caagcctgct cgctttgatg ggaggactat tcttggtgga 2700gccataaaat gcgtagccat

atttttttgc tccaccctag catttgggat gtcgtagaca 2760atgtaatgca attgctaggt agcgatgata aaaattataa tactattatt gcccaataat 2820ctattcataa tagcgcccaa gctaaagagg tgcgctaagg gatccttttt atagcccaaa 2880gaggtccata ggcgttgctc cttccttcta aacatcgatg aaattgtgtt gtctgcgagg 2940acatcaaacc gggtctgtgc acaccttctc cgggatctga tttggtgctt tccttaactg 3000attcacagcg gaccgatccg atgcaccacc ggaccgtctg ccacatcagc tagtcgttgg 3060ccttcaaact ccaccgggaa gtagccgttg gaggcggtcc ggtgcgccat tggaccgtgc 3120aaagtgaagg gtcgatgatt tttataaata aaatctcgag acctcaaagt tcagctctgt 3180aggcggtccg gtgcacatgg acctggttcg atgcattagg tcgcctaaca ctaagttcta 3240tgccctgcgg acttgatccg gtgcaccacc agacgagtcc gatgaggcct agaacaaccc 3300aagtaaggct gttttgagcc taacttcttc aaatcctttt ggctattctt gggagctttc 3360caacaactta gacaaacata attagcacat attccaattg attaggtgtg gagaactcac 3420cttttacttt gtcgttcacc atgatttgca ttttggctta atctaagtat tcgaaccact 3480tttctcacag gatagagtta gagttcaaat aaagtgctaa acacatagta ttagacacat 3540gcaacttatc taagtaatca aacctcatga ttttaccctt ttgtccaaag ctgcacactt 3600tagccttcat tttagttctt taggatctag tactttcaaa ttgacttcaa gtgcttgtgc 3660tcgtactcat atcaaattag ttagtccatg ttgttgtgct aaacacttaa tcactaaaac 3720atgtagaaat ggttatctaa cacatttttc tttcataagt aaaacaggag atttatattg 3780tagatgttat tgtttgttga tgaaatttga aataagagat ataagagcaa ctcgaaaagc 3840ctagctaaat cgatttgtat cggtaaaaat agaaaactga tgattaaaat aggatccaac 3900aaactctctt tgctcctctc tatgctatcc tgctcagcat cacgtcgagg tctctagcca 3960tatttgctga gctcacctgc ctcgccatct tcattctctc gtgcatcctc accgtccctg 4020cgccttgccg tcgttgcagc tcactgtccc acgccctgct gtcgccatgc ctcgccgacc 4080ctgcggctca ccttaccgcg cctcgtcatc gtcgcggctc gccgtctcgt gcctcgccaa 4140ccccgctcct tgacgtcctc acggcttaca gtccccatcg tcctcgtggc tccacgcctc 4200gccatatttg cggttcactg tccacgtgcc ctgccatcct agagcccgac atcgatatgt 4260caaatgagac aggagaaaaa gaaatgaaca tgtgattata atcagtgatt tgaatattga 4320tagataagat ttgaagagtc tgttgtgtca tatcaccttt ttatgaaact ctttattttt 4380tagagttttt ataaaactct aaatttagct aaaattatat ctagtctttt agagttactc 4440taacaagaga tgagatagcg agctggcgag ctgctggaga cagccgagag tagagatagt 4500agaggagact agaaactcca ttaggcctag cccagagtca gctagggatg ccgcccgatc 4560cactacgtac tgaaagcgat cccggcccat gaacgctagt gggttgtaat tcggcttacc 4620caagtaccca gccgttcctc cacacctctg cactacccga aacccacgcc caacggccgt 4680ttcccgcacc ccctatccgg gaaggaagaa agccagaact cacccctgct tcgtctggcg 4740gcgccgtttc ccgcacgcga tgccgtcgac ggcggaggat ttcccggcga taaggaagct 4800ggggaagctc ttccggctta ccgaagtgta cctctggtaa gtctcctgtc tcccctaccc 4860gcttctcagc gtagggtttg ccgtttgcga ggagtacgtc tcctcaaact actctctctt 4920cctgcaggga cgattcgtat ggcgctggac ctcacgatgg acagaagaac gggcgctcgg 4980cggaggctgc tctcgtggta cggtctctga cccgctttgg tgtagctcac tcttaagctt 5040tctgagttgg gggtgcgctt gtgcttcgac tagtagatgg ctaatttcgt cgggctactg 5100gtaatttctt ggtatctgca ttgtcgagaa agaaggcccg acgcataatt tcatgcttgc 5160ccaagagtct acttaacaca aggaattggt tttgtggcgt ggtttgtgca ttgcgcccaa 5220actgtagcct gtacaaaatg ttaatcgtcg cgtgcatttt aaacaaagtt ttgtattata 5280cgacaaaata ccctcggcac atttgttaca gactaccgat aagtgcatac ctatttctcc 5340tagttctatc aggaaataat cctggacctc gaaatgacag cctcgtctgg ctagaaccct 5400actaaacatt ttgagtgatc acttttcatt actcattttc ttgatgaaag cacattactg 5460acatggaagt ttgctacata agacataaca cttccttgta gtgctttatt taattattga 5520ccgatgatct ttttggaaaa ttaagctgta ttaaacaatt gtagcttcgg tgatgattgt 5580tggattaagc attagtctgc tgcagtcctc ttcttgattc tgatatgaca gttatttgtt 5640gattaaaata atgatggttt gctttacact tcgatctcct ttgagaggaa aacatgtgaa 5700ggtgtggact agatcatgta tagaccaaca gcattatctt attaaaacac ttctaaataa 5760cttagcaatt tcataaccat ttttacactt ctgaggaatt catcttgtcg tgaaagagtc 5820aattaactta gctgcttagc agactgtgtc aagcttatta cttgtatgtt gtgccctaca 5880aattactatg aggtttataa tgtacatagc aatttgacga ccttcaactt ttcaggactc 5940tcatacagat aaaacatgca atgaagcatc caatagcact gacaaaggtt agatgtattt 6000ttcttgtatt ctagtatctt ccttggtcaa ttttctttac agaggatgtt acaatgtact 6060ctactttttt tgtgtggaaa caacccacta gagaaaaaaa atcacttcat ttgagaaatc 6120ttaagaatct gaactctgaa gctcagcatg cttccacacc accacttttc aggccactgt 6180ctcttcattg gagtaaatga cgcttctttt agagagaaag agaggggggg ggggtctgtt 6240tataattgaa tcaagagatt ttattggtca cctgatttct gttgcatgac gtgggacctg 6300gatagacttc agatttgcct tagttgataa gttcaccggc actagtgaaa gaaaagtata 6360tggtacaggt actcttatga acaggcaacc acttagcaat tcagcatcta atagagaagg 6420accaagtctt caactaaagt cacaatatac ctttagtact attagagggg ctgaccctcc 6480ttgtctagtg cttgtgagat cataggagat ggggcgtggt ggttaagatt gtggatcata 6540ttcgccaagc tcccagggtc aatgtgattg agggatgtgg tatgctactt gtgaaagggt 6600tcaaaaaggc caggatgaca ttgttcctac attcctggaa tggggatgac acccccagga 6660caaggaaggt ggcacagttc cacgttcctg gtgacatgtt gtgtatcaaa tgggaggcca 6720aatcaccacg ggattactct aggaagggaa gatgatgttg ataatttgag tcattgttgc 6780aacatctgtt catggtttca tgcctcattt tataagtcat attgcccaca cataacattg 6840taatagtaaa atcaacacca gttattttac gttttccctt gtatgtcaac cgatttctta 6900ctgtgtatat gatctgtcta tcaataggcc atttctttgt tgaagatttg gaattggtca 6960atctcatggg ttctttgggg cttcctgttt cattcagcac aagtaaagtg gtcagttgct 7020cacatagtgt gacaccatgt tactgttccc taccggttcc attgcttaat tcctttatgc 7080attgcagaac aagaacacat gcaacaaggg aaagaaaaaa ggaagacaag caccgctcaa 7140agcagcaaac actcaaatca atgatgctgt gaggatatgt atcaatactg aagatagaga 7200aaattctgtt gaatcattgg atgctatgga gcaaacgcac tcatgcaatt tatttgtgac 7260accactgggt caaaatgaac cctcccgtga tgacactgac aagaggctta gggaagacag 7320ctcttgtgtt gaagaacaag aagagtctgg ctgtagcacc atctactctg ctggcaaagc 7380ccctggctgt gatgctaaaa atcatctcac tgaacttggg gcttttgagc tttctgataa 7440cttggccaac tcagcaaaag aagaatactc aattcaagaa aatcaagctt atgaaagtgt 7500gttgctagat tctgaagaga tgtcaaggaa tgactgtgtt gatgatgaat ctacacattc 7560ctgtgttggc atttatcagg atgaaagagt gtccacaagg ggagatcaaa catctgaaga 7620aactctatca gtaccccatg attacaatga tgttggcaga gaagctagtc taagtttggc 7680agagccatca tctattgatg agcatgcaca aagctctgcc aacaactttt actatgacta 7740tggtgaatgg agggttatct gggatccatt ctataatcgg tattattttt acaacatcca 7800gacacaagag tccacatggt gtcctcctga aggactggag gattttgcat catattgtag 7860cccagatacc actaaagagc tagctgaact gggatctcag tgttcaagca tggcaccaca 7920agagaacagt aaaaacccta gtctcgtcct ttgcagttga cattacgaat agttatatgc 7980actacgataa aaactttcta caatatgtaa cacttgagca tgtggcaatg ggtgtaaaca 8040tttaataata aggtagtgaa atccattaca cacagtattg aattttgcac tacaaatgct 8100gaaggagaaa cctaaattgt caatgctttt tggtgacatt aattattgcc attgatttcc 8160tgcttgtagg tgcttcattt atctgtctcc aatttactca tatgctagct tcttgtttgg 8220gactaaaggc tttgctgttg ttttagtatg tcacacattt ctctttaatc tcaccatcac 8280agatctggct actcatgtca atcatttaga agcacaggag caagatcact gcattcatga 8340tttatctgac attcctgttg aaaagccaat atatcaaagg tagggaatac caaactgtac 8400aatgttgaac aagttattgt tttttttgtt aattctgttc atctatgcag tatgataact 8460acctctgaca aagcacagca cactgaaaat aagtacagcg attcaacaac tactgtgtta 8520gagatgaacc aggaagttgc tagcaccaaa acgaaaaaga gagtaaggag atctcgatcg 8580tgtaaggcga taatatatgg catctgcttt ctaggagttt gttcctgtta caattttagg 8640ttgcgcattt acacaatagt ttcttggttt ctttgagcaa atgcagcttt gcatgactgc 8700tacattgcct acttatgtct aggtaacttt tctttgcaaa ctgcaaagtt atgtctaggt 8760aactatgcct tctagaaaac ctccttgtta gctatgtatt agtgagactt gcctaatatt 8820tattttcttg tggtccgttc ttgtgctctt tgtacatatt tgccaataac cattttaatt 8880gttctacaga tcattcatgc caagacatgg cagggaacgt ctctaatgac atcatcaagt 8940actgggctca gcggtattca cttttctcac tttttgatag tggtataaag atggatgaag 9000tagggtggtt ttcagttacg ccagagccaa ttgcaaagca tcatgcatct cgtgtgggtg 9060cgggagtaat gattgattgt ttcacaggag ttggtggaaa tgccatccaa tttgccaaaa 9120agtacgtcaa tgttatcttg caattgagtt atgtgatggt ctaatgtatc atttgcttga 9180acacttcctg tttagtagca actgttattt ttcttatgtc acgagaatgc aatggctata 9240tcaccttaag cagtatgcta tgtccactgt ccagtttaac taaggcatct gcttccagta 9300atatgcaagg ctcttcttac ttttgctgtt atttaatata tggaagtgtc cttacggagg 9360tgttattgtg gacattttga gcatgttcat catgtcactt gagttagtag agccagcctt 9420agttgtttgc agtgtaggtg gatttatttt atgttatcaa tgtttcttct acagtactaa 9480gactattgtt ccacattaac tatgtctcct tttccaggtg caagcatgta attgcagttg 9540atattgatcc acaaaagatt gattgcgcgc atcataatgc atccatttat ggagtaaatg 9600atcacataga tttcattgta ggtgatttta tacatatagc tcctcatctg aaggtaatgc 9660ctttttcttg gaattattac ttttaagttt ctcaacacgt cacttctatt agctatatgt 9720ttttgtagct gtttgcgaga gtgaatttat tgttgacatt gttctcattt gcccacccat 9780tttaggatag gggcttggta ctacaaatat cttgatactt caagtcctac aaaaagaaat 9840ttatgtttca tattttttcc atttgaacgt cgagatttta tggtcccatg gagttctccc 9900tatttttcga tgatgcccat cttttggcag taccttcttt gtgtacacaa taaatgggag 9960gatattttct gcagggagaa actgctttca tgtcgcctcc ttggggtggc cctgactatg 10020ccaaagttga tgtttatgat atgaaaagca tgcttattcc ttgtgatggg ttagttcctt 10080gtttctattt taagagagta atttctttca gtttgcactc actgatgttt acttactttg 10140tgagtaaaac gcaccagaga tccattaacc tttaaggagg tgttatctat gtccatcaac 10200actcaaactg catttttggg ttcctaaact ttttaagtga ttcaccggag ttccgtaccc 10260cttcgtttat atttgtattt tgcagaaacc tcactctgtt ttatttctcc ttcgcatgta 10320ggggtggtaa tggatcacga ttcaaacgtt tctccacgat tcgtttgagc ccttaattaa 10380ttttagtaca aaaataaata gaaatagaga tagagcctga tcctaatatg atttgatcct 10440caaattttat agcgtagaat ttagagccca ttaccattta ccacccctat tcacatgcct 10500acccctctcc atcttctgga ttgaatgttc caacctaatt tacactcgta gtttctttga 10560tctgccaatc aaatccagag cctaattgct ataacattag aacgaacacg ccatattacc 10620agaatactcg atgcagatat ggatagaagc gaggcgctaa gcgcagccag ccttggcttc 10680ttgctctgca ggccgatcag ggcgccagcc aaagccaacc atgcgcgcac gtgactgcaa 10740tgctactctc tcttcgcctt tgccatcgtc gtcgcaggat gttacgttgt gcttatgctg 10800gctcccacga gtgccgccgc ccagagcgag ctgagcgcac gcagccactg cttgtggttc 10860acgagcgtga gcatgccctc accacctgct gctgcgccct tgctgcttgc ttgcttgccg 10920gtggtgtaca ttatggacgg attaaattga atggatttac ctgttccaga aaaagatctg 10980atcgacgatg ggatgctatc ttgtatggct ccggatcaat gaagattaat ggaacaacca 11040atcgaaggct cagagcaggc tagttggtgc ccggaagact ctggccagaa gatggaaatg 11100ggtaagcgtg tgaaggaaaa aagaaataga gggggatttc tacaaaaaac aaacataatg 11160aagaggtatg gatttcaggt gaaccactta aaaataaaaa gggcataccc agtgccgtag 11220gcttcccgca ctgtgcgggg tcgtctgggg aagggtatct ttaagcgtca agtcttaccc 11280gcataatatg cagaggctgg ggctcgaacc cgggaccttt cggttataga cggtaggctc 11340taccgccgca ccaagcccgc ccttgaacca tttaaaaaat ttaggactca aaaatacagt 11400ttgacagttg atggacctag atgacacctc cttaaaaatt ttaatggacc tatggtgcat 11460tttaatcttt tgttgatcga ctatactcaa tgttgaactc tttaggtact ctctttttaa 11520actcggaacc atgatagctt caagagtagt gatgtttctt cctcgcaaca ttgacctaaa 11580ccaattggcg gacatgtcct tgtctgtgga tcccccgtgg gcagttgagg taagcccatt 11640tttgctgatt ttgtgccaag ctgacgtttc ctatagatgt cacagtggtc tctctctctg 11700caggtcgaga agaacttcct caacggaaag ctgaaagcca taacagctta ctttgaagaa 11760caggatcgtt gaaccaagca tcggcgctgg tgatacaaat catcttgtta gctatgactc 11820acgacaattt tttgtggtga ccctaaacag aacctttgtg ttcggagaca gaaagaagcg 11880gtttatcatc ttcaccgagc atagataatt tatttgcaga gatgagtcat tggtatcata 11940caaaagcagc tcagcttatc tcaattcaca gcaagtgaaa ctgtcgaagg aaaactacaa 12000ggctgacagt cgaacgcgtg ggagttagct taattttgcc ttatgataag caagcatgct 12060tcctggttta tttcatacag ctactagtag tttcagctgc aacagttgtg cgttggtgtg 12120cgtgtgattc tcacatatct ggtcctgcgg atgtgagtga tgcaaatgta tgtgtcatca 12180tcccatgttt gtttgtttgc tctcaatcta tgtagattga gtgggattaa gtgagtttaa 12240atctcagaca agtcaaaaaa aaatgttttc aatctcatcc aatccacata tgatagtaat 12300acccgagtaa ggcttagatg taatagttgg aataagaaaa acaagtcagc cattttgaag 12360ttttgtcctt ggagttctat taaaaggcat tactgataaa tctccaacag atttgcagtt 12420gaagcaacat gtgaaacata tttatcatgt taaaacaatt tgccttagta ttcgattatg 12480ccatgaaatc tgacatttcc ttacacatcc cagtttatca ttgtcaactg tctttaggaa 12540tgtattgtat ctgctgtttt tacttgtata tgtatgttat tttttgtcgt tgtatgtata 12600tgttttatta taaacatggc cactaaggtt gttctattcg ttaaaataac acagatctat 12660aaacgactaa acaagcttct tgggataaag aatcatatgg aggctggatt ttcgaggagt 12720ctggtgcact gttttgctaa tgatcagacc cccccccccc ctctaaaaat aaagaaaata 12780ctggatttcc tgttcattta ttacattcat atgtaaatgc ttctgtcctt ttctatatct 12840gggctggact ttttgtgtgc tcgtcactca agttggttag tgtggttaat tttattatgc 12900tccgtgctct ttcctaccga acttggtctt tgttagtatc attatcagtc agttatattt 12960tctcctcttg atgcttcatc taatctattt ttgcaaagtt gtcatgttat gtactatatg 13020atcttttaca aggtttttga cttttcaaat tattgtgtcg tatattattt gtactcagat 13080tgtgcttaca actttagttt atctatactt taaggggtgt ttggtttcta tgagctaatt 13140tataatccct tcattttatt ctattttcgt acctaaattg tcaagcacga aaacgaaaat 13200aaagttttaa cttttatatt tagcagttta tacactaaaa tagaataaaa tagatgaagt 13260aaaaattagt cctcagaaac caaacatctc ctaaatgtct agtaatagtc gcctgaactg 13320tagagcgccc aacacgcgcc accctgattt ggtgtcttaa aatggcatgt gtatataggt 13380ggaatgggtt tgacgagact gtaactactt tttcttaatt aaattataga tggacttaac 13440ttttctatat gcattttaaa tatatttttc tatatttttg gtgggctgag ttacagttta 13500tgtcaatata aattacaaca gaccgaatct aaaattttat tataaaatgt atgcaccaac 13560cgttgactaa aaagataaaa tttggacacc tacattttag caagtcacct gctaatatat 13620atctatacta ggtaagtgtc cgtgcgttgc aacgaaaaca tataataata cgataactta 13680tatataaaat atgtgttata ctgttatgag aaaaagtttc acctgtccta tttttatcaa 13740tatgacaaca gaggatcaat ataaggcctt ggcatggctt cagaagttca gattaacgaa 13800ttatggagca agagcaacta tttctggtgt ttcagtagaa agaatgggga tatgtgttta 13860tctctctcat acatgttaca caatgtgcta tagaatgaca cctctaggcc gctgctacaa 13920ctacagagaa taaattatgg atcatgggtg ccctactaag ttagctacac gtaaaatctg 13980gggcgattga ccccctaatg cttcatcttg gagatctcac taaaatacac cttccgcaca 14040aaccgagcct tgataaagct cgcgatcttc gtgtcctgat ctttagtgca gacaactgca 14100agggaaatta ttgtgccggt cagcaagagt ggaaaatgtc agcagaaaca caagaatgga 14160aaattatatg atgtgcaagt aggacggcac cctatttaac taaaggtgtg tttggttcag 14220ttttctgaac caattcgttg ccaaaaaatc taaaatctca cacaaacggt acaacatcag 14280aatagatttt taaaagttta tagatttctc aagttcaatt caaaatcacc atctacccca 14340aatttttcag attactatta ttcatgttac aactatcact cttgtagtat ctaccattga 14400tagttgtttt aatcaaatat attttagctt tctcacaatc ctcagctcga aacagatttt 14460catggctcac agttggattc atattttcat aaatctatag gtgtgaacca aatagaccct 14520aaaacatcat gttgtaatgt tccaaaaaaa tcatcacaaa aaacatgtat gcaacacacc 14580taagtgcaac aacactaacc tgcaggagcg ttggcgtagg ctggactctt gtgtgtctgc 14640aatgtggaaa cctgcattgt aaatggttag gtaaatgctt cgcttaaaaa tggcagtaaa 14700tgcttcacta aaaatgcttc cacgtgctca tgatcaagta ggttttatgt tcaatctgca 14760gttctacaca agtgcgattc attttgatac tcatttctat ttactttcag agctcgtgct 14820aactgtcata aagtgcaatg catctatgac tgccaattga tattgtgctc ctgccataaa 14880gtgcactgca tcgtgctaac tgtacactaa tctgtgagga tctaagacca atatttgttt 14940acgttttctc ctttagtatc ctataaaaac aacacgccta gacaacccaa aaaatgtgtc 15000ccaataggaa tatcagattc tgacgctggc agcaatctcg aatgataata ttttttccaa 15060acaagcgagg tccctaaata gaagcggcaa cagataaaaa ctaaagataa aaactaaaga 15120gtacagatga ttggcatcac atcgggaatg aaatatgcct aacatatcaa tttgcatatt 15180agattatttg ctgagaacaa taacgaaaac atatttagtt gttcatcaca agttacctta 15240gattttgctg ttcaaggtcc tttgggtctt ctttctgcta gaacatacaa gggtatttca 15300gatttgcaaa caaggaaaag caagaacttc aatgatacat cattgtaaaa ccaagtttcc 15360gatttaaata aagatgatgc ttgcggtcaa cacattcaca aatgtaaatg tgtgaaatcg 15420ttcaaacata aggcttatgt ggtcatgctc aggtagtatg tacagaccta aaaacaaggt 15480atatgacaac agtaccagcc actaaacaca catggttaat cactaaaaca attctccgat 15540taaccaggaa aactatagca gccactagaa ctatacaggt ttctaccagt aattgcttca 15600ctaaaaaatg cccccatgtg taaattttca ggtggtttgt acagacataa aaacaagggt 15660ataggactac ttcttgtgct aaaataaaag ctggcactaa acagtgtatc cagttcatca 15720ggaaaacagt tttagtgatt aatcactaaa acaataccca tggtgcaaat tatctgatta 15780accatgtaca taccagtaat tgcttcataa aaaatgcccc caatttgctc atgtttaagg 15840aatatgtata gaactaaaaa caagggtata tgataacaat accagccact aaacacacat 15900ggttaatcac taaaacaatt ctctgacaac tcctataaaa agatagcaac caccagaaat 15960aaaccgccca cgacatccct aattgtagtc actaaaaact ggtagcagta attggttcac 16020taaaaaatga ccataatcac gaaaactata acagtcacca gaactatata ggtttcatta 16080gtaattgctt cactaaaaaa tgtccccatg tgtaaatttt catgtcgtat gtatagacct 16140aaaaacaagg gtatatgact acttgctatg ctaaaacaat agctagcact aaacagtgta 16200ttcagttaat catccaaaca attttgtgat taatcactaa aacaataccc atggagcaaa 16260ttatatgaga ataagatctc gtcgttccta ttgtgaagaa tatactacta cctccagttt 16320caaattacaa tttcaaatta caagttgttt agaacatcca caaggtaatt gcgaagaata 16380tactaattgc tctagtttca aattacaagt tgtttagaga aaaggtgttc cttaatagtc 16440tagttttaga agctacatcc aactggtaaa cataaattgc agaaaccttt tatgtggaag 16500cctccgtcat tgagtctgtc ccctttagct gtaagtagtt ttctaaatat tgttagtcag 16560gcttagttgt ttgagactct gtttccattc gtgaccatgg gaactgtgaa atgtgtagaa 16620gatgctcatg ctcatgcata tgcatcgaat tgttttgtaa agtcatctta atgctcaaac 16680agttttttta tctgccccag ctgtacactg ctttctgaat tatgtcattt aggcttagct 16740gtccgagata atttcatttg tgatgaaggt aaccggagca tctgtccttt tgttttaaac 16800ataaatattt tgatagctta acttgtgcgt cattttatca tgtactaaca tggtatatag 16860atggcactta gcagtaacat tcctgactta tgtgattgtc ctgttagaat gcttctgcaa 16920tataatgggc ttatgctaat ctgtttggaa tcccatgatg aataacaatt atggatgttg 16980ggcattttgt atttttatga tgtaggctta acatattttc ttctctgctc agccctgccg 17040gtggagacct ctatttatag ctataatcca gccatatcta gtgaactaac agatctattc 17100ctatctctag tgcttatccc cttaacagat ttattttcct atctctattt ctttgatgtt 17160acttgctgca ggtgctatcc cccactgatg cattatatga caatcactcg aagaatcaga 17220tgctaattaa ttgttttata cttgaccatt gctaattaag tactccattt ttta 17274141767DNAArtificial SequencecDNA 14atgggttctt tggggcttcc tgtttcattc agcacaagta aagtgaacaa gaacacatgc 60aacaagggaa agaaaaaagg aagacaagca ccgctcaaag cagcaaacac tcaaatcaat 120gatgctgtga ggatatgtat caatactgaa gatagagaaa attctgttga atcattggat 180gctatggagc aaacgcactc atgcaattta tttgtgacac cactgggtca aaatgaaccc 240tcccgtgatg acactgacaa gaggcttagg gaagacagct cttgtgttga agaacaagaa 300gagtctggct gtagcaccat ctactctgct ggcaaagccc ctggctgtga tgctaaaaat 360catctcactg aacttggggc ttttgagctt tctgataact tggccaactc agcaaaagaa 420gaatactcaa ttcaagaaaa tcaagcttat gaaagtgtgt tgctagattc

tgaagagatg 480tcaaggaatg actgtgttga tgatgaatct acacattcct gtgttggcat ttatcaggat 540gaaagagtgt ccacaagggg agatcaaaca tctgaagaaa ctctatcagt accccatgat 600tacaatgatg ttggcagaga agctagtcta agtttggcag agccatcatc tattgatgag 660catgcacaaa gctctgccaa caacttttac tatgactatg gtgaatggag ggttatctgg 720gatccattct ataatcggta ttatttttac aacatccaga cacaagagtc cacatggtgt 780cctcctgaag gactggagga ttttgcatca tattgtagcc cagataccac taaagagcta 840gctgaactgg gatctcagtg ttcaagcatg gcaccacaag agaacaatct ggctactcat 900gtcaatcatt tagaagcaca ggagcaagat cactgcattc atgatttatc tgacattcct 960gttgaaaagc caatatatca aagtatgata actacctctg acaaagcaca gcacactgaa 1020aataagtaca gcgattcaac aactactgtg ttagagatga accaggaagt tgctagcacc 1080aaaacgaaaa agagagtaag gagatctcga tcgtatcatt catgccaaga catggcaggg 1140aacgtctcta atgacatcat caagtactgg gctcagcggt attcactttt ctcacttttt 1200gatagtggta taaagatgga tgaagtaggg tggttttcag ttacgccaga gccaattgca 1260aagcatcatg catctcgtgt gggtgcggga gtaatgattg attgtttcac aggagttggt 1320ggaaatgcca tccaatttgc caaaaagtgc aagcatgtaa ttgcagttga tattgatcca 1380caaaagattg attgcgcgca tcataatgca tccatttatg gagtaaatga tcacatagat 1440ttcattgtag gtgattttat acatatagct cctcatctga agggagaaac tgctttcatg 1500tcgcctcctt ggggtggccc tgactatgcc aaagttgatg tttatgatat gaaaagcatg 1560cttattcctt gtgatgggta ctctcttttt aaactcggaa ccatgatagc ttcaagagta 1620gtgatgtttc ttcctcgcaa cattgaccta aaccaattgg cggacatgtc cttgtctgtg 1680gatcccccgt gggcagttga ggtcgagaag aacttcctca acggaaagct gaaagccata 1740acagcttact ttgaagaaca ggatcgt 1767151767DNAArtificial SequencecDNA 15atgggttctt tggggcttcc tgtttcattc agcacaagta aagtgaacaa gaacacatgc 60aacaagggaa agaaaaaagg aagacaagca ccgctcaaag cagcaaacac tcaaatcaat 120gatgctgtga ggatatgtat caatactgaa gatagagaaa attctgttga atcattggat 180gctatggagc aaacgcactc atgcaattta tttgtgacac cactgggtca aaatgaaccc 240tcccgtgatg acactgacaa gaggcttagg gaagacagct cttgtgttga agaacaagaa 300gagtctggct gtagcaccat ctactctgct ggcaaagccc ctggctgtga tgctaaaaat 360catctcactg aacttggggc ttttgagctt tctgataact tggccaactc agcaaaagaa 420gaatactcaa ttcaagaaaa tcaagcttat gaaagtgtgt tgctagattc tgaagagatg 480tcaaggaatg actgtgttga tgatgaatct acacattcct gtgttggcat ttatcaggat 540gaaagagtgt ccacaagggg agatcaaaca tctgaagaaa ctctatcagt accccatgat 600tacaatgatg ttggcagaga agctagtcta agtttggcag agccatcatc tattgatgag 660catgcacaaa gctctgccaa caacttttac tatgactatg gtgaatggag ggttatctgg 720gatccattct ataatcggta ttatttttac aacatccaga cacaagagtc cacatggtgt 780cctcctgaag gactggagga ttttgcatca tattgtagcc cagataccac taaagagcta 840gctgaactgg gatctcagtg ttcaagcatg gcaccacaag agaacaatct ggctactcat 900gtcaatcatt tagaagcaca ggagcaagat cactgcattc atgatttatc tgacattcct 960gttgaaaagc caatatatca aagtatgata actacctctg acaaagcaca gcacactgaa 1020aataagtaca gcgattcaac aactactgtg ttagagatga accaggaagt tgctagcacc 1080aaaacgaaaa agagagtaag gagatctcga tcgtatcatt catgccaaga catggcaggg 1140aacgtctcta atgacatcat caagtactgg gctcagcggt attcactttt ctcacttttt 1200gatagtggta taaagatgga tgaagtaggg tggttttcag ttacgccaga gccaattgca 1260aagcatcatg catctcgtgt gggtgcggga gtaatgattg attgtttcac aggagttggt 1320ggaaatgcca tccaatttgc caaaaagtgc aagcatgtaa ttgcagttga tattgatcca 1380caaaagattg attgcgcgca tcataatgca tccatttatg gagtaaatga tcacatagat 1440ttcattgtag gtgattttat acatatagct cctcatctga agggagaaac tgctttcatg 1500tcgcctcctt ggggtggccc tgactatgcc aaagttgatg tttatgatat gaaaagcatg 1560cttattcctt gtgatgggta ctctcttttt aaactcggaa ccatgatagc ttcaagagta 1620gtgatgtttc ttcctcgcaa cattgaccta aaccaattgg cggacatgtc cttgtctgtg 1680gatcccccgt gggcagttga ggtcgagaag aacttcctca acggaaagct gaaagccata 1740acagcttact ttgaagaaca ggatcgt 176716434PRTZea mays 16Met Gly Ser Ser Glu Glu His Val Phe Leu Asp Pro Thr Arg Ile Cys1 5 10 15Ala Ser Val Ser Leu Leu Ala His Asp Leu Ile Gly Arg Met Leu Asn 20 25 30Arg Glu Val Ser Ser Arg Pro Asn Ala Lys Glu Val Leu Pro Pro Met 35 40 45Ile His Arg Glu Ile Val Arg Phe Gly Tyr Cys Glu Ser Ser Ser Ser 50 55 60Lys Ser Ser Ser Asp Asn Ser Glu Glu Arg Asp Glu Cys Gly Ile Val65 70 75 80Asp Ala Leu Val Thr Thr Ile Thr Gln Ile Arg Lys Met Asp Leu Glu 85 90 95Ala Arg Ser Leu Gln Pro Ser Ile Lys Ala Gly Leu Leu Ala Lys Leu 100 105 110Arg Glu Tyr Lys Ser Asp Leu Asn Asn Val Lys Met Gly Leu Ser Ala 115 120 125Glu Arg Lys Lys Gln Lys Leu Ser Glu Ile Gln Ser Gly Val Glu Glu 130 135 140Ala Glu Ser Leu Ile Gln Lys Met Asp Leu Glu Ala Arg Ser Leu Gln145 150 155 160Pro Ser Ile Lys Ala Gly Leu Leu Ala Lys Pro Arg Asp Tyr Lys Ser 165 170 175Asp Leu Asn Asn Val Lys Ser Glu Leu Lys Arg Ile Ser Ala Pro Asn 180 185 190Ala Ser Gly Leu Ile Ser Tyr Lys Lys Leu Leu Phe His Gly Leu Asp 195 200 205Leu Trp Thr Ala Leu Ser Leu Pro Gln Pro Leu Gly Arg Ala Ala Leu 210 215 220Trp Pro Pro His Arg Thr Ile His Gln His Leu Gln Cys Gln Gln Leu225 230 235 240Thr Gly Val Ala Gly Ser Leu Ala Tyr Leu Ala Pro Glu Val Leu Leu 245 250 255Gly Asn Tyr Ser Gln Lys Val Asp Val Trp Ala Ala Gly Val Leu Leu 260 265 270His Val Leu Leu Met Gly Thr Leu Pro Phe Gln Gly Lys Ser Ile Glu 275 280 285Ala Ile Phe Asp Val Ile Lys Thr Ala Glu Leu Asp Phe His Asn Ser 290 295 300Gln Trp Ala Ser Val Ser Leu Leu Ala Tyr Asp Leu Ile Gly Arg Met305 310 315 320Leu Asn Arg Glu Val Ser Ser Arg Pro Asp Ala Glu Asp Val Leu Arg 325 330 335His Pro Trp Val Leu Phe Tyr Thr Asp Cys Leu Gln Lys Ala Glu Phe 340 345 350Ser Asn Leu Trp Asp Thr Asn Lys Thr Ala Ala Pro Met Ile His Arg 355 360 365Glu Ile Val Arg Phe Gly Tyr Cys Glu Ser Ser Ser Ser Lys Ser Ser 370 375 380Ser Asp Asn Ser Glu Glu Arg Asp Glu Cys Gly Ile Val Asp Ala Leu385 390 395 400Ala Thr Thr Ile Thr Gln Val Arg Ile Ser Glu Pro Lys Arg Ser Arg 405 410 415Leu Phe Ser Leu Pro Asn Gly Leu Leu Pro Pro Ser Arg Asn Ser Leu 420 425 430Arg Thr17143PRTZea mays 17Met Leu Asn Arg Glu Val Ser Ser Arg Pro Asn Ala Lys Glu Val Leu1 5 10 15Arg Lys Phe Lys His Pro Cys Asn Leu Cys Phe Ile Tyr Met Ile Leu 20 25 30Asn Leu Ser Leu Thr Phe Pro Asn Gly Phe Gln His Arg Ala Pro Trp 35 40 45Val Leu Phe Tyr Thr Asp Cys Pro Gln Lys Ala Glu Phe Ser Asn Ile 50 55 60Trp Asp Thr Asn Lys Thr Ala Ala Pro Met Ile His Arg Glu Ile Val65 70 75 80Arg Phe Gly Tyr Cys Glu Ser Ser Ser Ser Lys Ser Ser Ser Asp Asn 85 90 95Ser Glu Glu Arg Asp Glu Cys Gly Ile Val Asp Ala Leu Val Thr Thr 100 105 110Ile Thr Gln Val Arg Ile Ser Glu Pro Lys Arg Ser Arg Leu Phe Ser 115 120 125Leu Pro Asn Gly Leu Leu Pro Pro Ser Arg Asn Ser Leu Arg Thr 130 135 14018162PRTZea mays 18Met Glu Gly Gly Arg His Pro Ser Pro Pro Pro Arg Ile Ser Arg Gln1 5 10 15Pro Pro Pro Tyr Pro Ala Cys Pro Ser Ile Leu Pro Pro Leu Pro Pro 20 25 30Val Asn Val Thr Asn Pro Gly Leu Val Pro Leu Val Val Ala Thr Leu 35 40 45Phe Asp Glu Arg Val Thr Glu Leu Leu Ser Val Leu Ala Asp Ala Ala 50 55 60Val Gly Arg Pro Gly Arg Trp Ser Ile Gly Glu Ala Pro Trp Ser Ser65 70 75 80Ser Gly Gly Thr Asn Gln Ala Val Tyr Ala Arg Arg Ala Pro Gly Ser 85 90 95Ser Ser Pro Pro Pro Ala Pro Ala Ser Pro Pro Leu Pro Ser Ser Arg 100 105 110Ala Asp Cys Leu Ala Arg Trp Pro Gly Ser Arg Ala Leu Val Ala Pro 115 120 125Leu Gly Thr Pro Ala Phe Val Asp Arg Leu Phe Trp Ser Asp Phe Ser 130 135 140Gly Ser Ile Arg Arg Glu Glu Glu Ala Glu Ala Leu Arg Asp Pro Ile145 150 155 160Arg Arg1987PRTZea mays 19Met Asp Leu Glu Ala Arg Ser Leu Gln Pro Ser Ile Lys Ala Gly Leu1 5 10 15Leu Ala Lys Pro Arg Asp Tyr Lys Ser Asp Leu Asn Asn Val Lys Ser 20 25 30Glu Leu Lys Arg Ile Ser Ala Pro Asn Ala Arg Phe Gly Arg Trp Thr 35 40 45Trp Lys Gln Gly Ala Tyr Asn Leu Ala Leu Arg Val Ser Ser Arg Gly 50 55 60Tyr Leu Arg Pro Leu Pro Gly Arg Leu Pro Gly Arg Ser Ser Trp Ser65 70 75 80Leu Glu Trp Leu Ile Leu Ser 8520279PRTZea mays 20Met Ala His Phe Asp Glu Leu Glu Asp Lys Thr Thr Asp Tyr Val Asp1 5 10 15Leu Ser Val Gln Glu Phe Ala Leu Lys Gln Pro Gln Cys Gly Met Ala 20 25 30Tyr Asn Tyr Tyr Gly Asn Leu Arg Leu Tyr Val Val Ala Asn Lys Ala 35 40 45Glu Leu Ala Ser Ser Ile Phe Glu Ile Asp Lys Ala Ser Thr Lys Arg 50 55 60Ile Gly Ala Arg Phe Cys Arg Cys Leu Pro His Thr Arg Met Glu Gly65 70 75 80Gly Arg His Pro Ser Pro Pro Pro Arg Ile Ser Arg Gln Pro Gln Pro 85 90 95Tyr Pro Ala Cys Pro Ser Ile Leu Pro Gln Pro Pro Pro Glu Arg Lys 100 105 110Lys Gln Lys Leu Ser Glu Ile Gln Ser Gly Val Glu Glu Ala Glu Ser 115 120 125Leu Ile Gln Lys Met Asp Leu Glu Ala Arg Ser Leu Gln Pro Ser Ile 130 135 140Lys Ala Ser Leu Leu Ala Lys Leu Arg Glu Tyr Lys Ser Asp Leu Asn145 150 155 160Asn Val Lys Ser Glu Leu Lys Arg Ile Ser Ala Pro Asn Ala Arg Gln 165 170 175Ala Thr Arg Glu Glu Leu Leu Glu Ser Gly Met Ala Asp Thr Leu Ala 180 185 190Pro Glu Gln Glu Gln Leu Ala Cys Ala Ala Ala Ala Leu Ala Val Gly 195 200 205Pro Ala Tyr Glu Arg Leu Gln Glu Ala Arg Asn Pro Ser Glu Gln Gly 210 215 220Cys Asn His Asp Lys Gln Ile Glu Gln Ala Tyr Asp Asp Ile Leu Asn225 230 235 240Ser Ser Lys His Thr Leu Ala Ser Met Met Glu Leu Gln Glu Ala Leu 245 250 255Leu Glu Ser Asn Gln Ala Thr Lys Asp Ala Asn Gly Ile Ala Ala Leu 260 265 270Tyr Ile Val Leu Val Leu Met 27521428PRTZea mays 21Met Ala Ser Tyr Ser Ser Arg Arg Pro Cys Asn Thr Cys Ser Thr Lys1 5 10 15Ala Met Ala Gly Ser Val Val Gly Glu Pro Val Val Leu Gly Gln Arg 20 25 30Val Thr Val Leu Thr Val Asp Gly Gly Gly Val Arg Gly Leu Ile Pro 35 40 45Gly Thr Ile Leu Ala Phe Leu Glu Ala Arg Leu Gln Glu Leu Asp Gly 50 55 60Pro Glu Ala Arg Leu Ala Asp Tyr Phe Asp Tyr Ile Ala Gly Thr Ser65 70 75 80Thr Gly Gly Leu Ile Thr Ala Met Leu Thr Ala Pro Gly Lys Asp Lys 85 90 95Arg Pro Leu Tyr Ala Ala Lys Asp Ile Asn His Phe Tyr Met Gln Asn 100 105 110Cys Pro Arg Ile Phe Pro Gln Lys Ser Arg Leu Ala Ala Ala Met Ser 115 120 125Ala Leu Arg Lys Pro Lys Tyr Asn Gly Lys Cys Met Arg Ser Leu Ile 130 135 140Arg Ser Ile Leu Gly Glu Thr Arg Val Ser Glu Thr Leu Thr Asn Val145 150 155 160Ile Ile Pro Ala Phe Asp Ile Arg Leu Leu Gln Pro Ile Ile Phe Ser 165 170 175Thr Tyr Asp Ala Lys Ser Thr Pro Leu Lys Asn Ala Leu Leu Ser Asp 180 185 190Val Cys Ile Gly Thr Ser Ala Ala Pro Thr Tyr Leu Pro Ala His Tyr 195 200 205Phe Gln Thr Glu Asp Ala Asn Gly Lys Glu Arg Glu Tyr Asn Leu Ile 210 215 220Asp Gly Gly Val Ala Ala Asn Asn Pro Thr Met Val Ala Met Thr Gln225 230 235 240Ile Thr Lys Lys Met Leu Ala Ser Lys Asp Lys Ala Glu Glu Leu Tyr 245 250 255Pro Val Lys Pro Ser Asn Cys Arg Arg Phe Leu Val Leu Ser Ile Gly 260 265 270Thr Gly Ser Thr Ser Glu Gln Gly Leu Tyr Thr Ala Arg Gln Cys Ser 275 280 285Arg Trp Gly Ile Cys Arg Trp Leu Arg Asn Asn Gly Met Ala Pro Ile 290 295 300Ile Asp Ile Phe Met Ala Ala Ser Ser Asp Leu Val Asp Ile His Val305 310 315 320Ala Ala Met Phe Gln Ser Leu His Ser Asp Gly Asp Tyr Leu Arg Ile 325 330 335Gln Asp Asn Ser Leu Arg Gly Ala Ala Ala Thr Val Asp Ala Ala Thr 340 345 350Pro Glu Asn Met Arg Thr Leu Val Gly Ile Gly Glu Arg Met Leu Ala 355 360 365Gln Arg Val Ser Arg Val Asn Val Glu Thr Gly Arg Tyr Glu Pro Val 370 375 380Thr Gly Glu Gly Ser Asn Ala Asp Ala Leu Gly Gly Leu Ala Arg Gln385 390 395 400Leu Ser Glu Glu Arg Arg Thr Arg Leu Ala Arg Arg Val Ser Ala Ile 405 410 415Asn Pro Arg Gly Ser Arg Cys Ala Ser Tyr Asp Ile 420 42522401PRTZea mays 22Met Ala Ser Tyr Ser Ser Arg Arg Pro Cys Asn Thr Cys Ser Thr Lys1 5 10 15Ala Met Ala Gly Ser Val Val Gly Glu Pro Val Val Leu Gly Gln Arg 20 25 30Val Thr Val Leu Thr Val Asp Gly Gly Gly Val Arg Gly Leu Ile Pro 35 40 45Gly Thr Ile Leu Ala Phe Leu Glu Ala Arg Leu Gln Glu Leu Asp Gly 50 55 60Pro Glu Ala Arg Leu Ala Asp Tyr Phe Asp Tyr Ile Ala Gly Thr Ser65 70 75 80Thr Gly Gly Leu Ile Thr Ala Met Leu Thr Ala Pro Gly Lys Asp Lys 85 90 95Arg Pro Leu Tyr Ala Ala Lys Asp Ile Asn His Phe Tyr Met Gln Asn 100 105 110Cys Pro Arg Ile Phe Pro Gln Lys Ser Arg Leu Ala Ala Ala Met Ser 115 120 125Ala Leu Arg Lys Pro Lys Tyr Asn Gly Lys Cys Met Arg Ser Leu Ile 130 135 140Arg Ser Ile Leu Gly Glu Thr Arg Ala Lys Ser Thr Pro Leu Lys Asn145 150 155 160Ala Leu Leu Ser Asp Val Cys Ile Gly Thr Ser Ala Ala Pro Thr Tyr 165 170 175Leu Pro Ala His Tyr Phe Gln Thr Glu Asp Ala Asn Gly Lys Glu Arg 180 185 190Glu Tyr Asn Leu Ile Asp Gly Gly Val Ala Ala Asn Asn Pro Thr Met 195 200 205Val Ala Met Thr Gln Ile Thr Lys Lys Met Leu Ala Ser Lys Asp Lys 210 215 220Ala Glu Glu Leu Tyr Pro Val Lys Pro Ser Asn Cys Arg Arg Phe Leu225 230 235 240Val Leu Ser Ile Gly Thr Gly Ser Thr Ser Glu Gln Gly Leu Tyr Thr 245 250 255Ala Arg Gln Cys Ser Arg Trp Gly Ile Cys Arg Trp Leu Arg Asn Asn 260 265 270Gly Met Ala Pro Ile Ile Asp Ile Phe Met Ala Ala Ser Ser Asp Leu 275 280 285Val Asp Ile His Val Ala Ala Met Phe Gln Ser Leu His Ser Asp Gly 290 295 300Asp Tyr Leu Arg Ile Gln Asp Asn Ser Leu Arg Gly Ala Ala Ala Thr305 310 315 320Val Asp Ala Ala Thr Pro Glu Asn Met Arg Thr Leu Val Gly Ile Gly 325 330 335Glu Arg Met Leu Ala Gln Arg Val Ser Arg Val Asn Val Glu Thr Gly 340 345 350Arg Tyr Glu Pro Val Thr Gly Glu Gly Ser Asn Ala Asp Ala Leu Gly 355 360 365Gly Leu Ala Arg Gln Leu Ser Glu Glu Arg Arg Thr Arg Leu Ala Arg 370 375 380Arg Val Ser Ala Ile Asn Pro Arg Gly Ser Arg Cys Ala Ser Tyr Asp385 390 395 400Ile23380PRTZea mays 23Met Ala Ser Tyr Ser Ser Arg Arg Pro Cys Asn Thr Cys Ser Thr Lys1 5 10

15Ala Met Ala Gly Ser Val Val Gly Glu Pro Val Val Leu Gly Gln Arg 20 25 30Val Thr Val Leu Thr Val Asp Gly Gly Gly Val Arg Gly Leu Ile Pro 35 40 45Gly Thr Ile Leu Ala Phe Leu Glu Ala Arg Leu Gln Glu Leu Asp Gly 50 55 60Pro Glu Ala Arg Leu Ala Asp Tyr Phe Asp Tyr Ile Ala Gly Thr Ser65 70 75 80Thr Gly Gly Leu Ile Thr Ala Met Leu Thr Ala Pro Gly Lys Asp Lys 85 90 95Arg Pro Leu Tyr Ala Ala Lys Asp Ile Asn Tyr Phe Tyr Met Glu Asn 100 105 110Cys Pro Arg Ile Phe Pro Gln Lys Ser Arg Leu Ala Ala Ala Met Ser 115 120 125Ala Leu Arg Lys Pro Lys Tyr Asn Gly Lys Cys Met Arg Ser Leu Ile 130 135 140Arg Ser Ile Leu Gly Glu Thr Arg Val Ser Glu Thr Leu Thr Asn Val145 150 155 160Ile Ile Pro Ala Phe Asp Ile Arg Leu Leu Gln Pro Ile Ile Phe Ser 165 170 175Thr Tyr Asp Ala Lys Ser Thr Pro Leu Lys Asn Ala Leu Leu Ser Asp 180 185 190Val Cys Ile Gly Thr Ser Ala Ala Pro Thr Tyr Leu Pro Ala His Tyr 195 200 205Phe Gln Thr Glu Asp Ala Asn Gly Lys Glu Arg Glu Tyr Asn Leu Ile 210 215 220Asp Gly Gly Val Ala Ala Asn Asn Pro Thr Met Val Ala Met Thr Gln225 230 235 240Ile Thr Lys Lys Met Leu Ala Ser Lys Asp Lys Ala Glu Glu Leu Tyr 245 250 255Pro Val Asn Pro Ser Asn Cys Arg Arg Phe Leu Val Leu Ser Ile Gly 260 265 270Thr Gly Ser Thr Ser Glu Gln Gly Leu Tyr Thr Ala Arg Gln Cys Ser 275 280 285Arg Trp Gly Ile Cys Arg Trp Leu Arg Asn Asn Gly Met Ala Pro Ile 290 295 300Ile Asp Ile Phe Met Ala Ala Ser Ser Asp Leu Val Asp Ile His Val305 310 315 320Ala Ala Met Phe Gln Ser Leu His Ser Asp Gly Asp Tyr Leu Arg Ile 325 330 335Gln Asp Asn Ser Leu Arg Gly Ala Ala Ala Thr Val Asp Ala Ala Thr 340 345 350Pro Glu Asn Met Arg Thr Leu Val Gly Ile Gly Glu Arg Met Leu Ala 355 360 365Gln Arg Val Ser Arg Val Asn Val Glu Thr Gly Ser 370 375 38024589PRTZea mays 24Met Gly Ser Leu Gly Leu Pro Val Ser Phe Ser Thr Ser Lys Val Asn1 5 10 15Lys Asn Thr Cys Asn Lys Gly Lys Lys Lys Gly Arg Gln Ala Pro Leu 20 25 30Lys Ala Ala Asn Thr Gln Ile Asn Asp Ala Val Arg Ile Cys Ile Asn 35 40 45Thr Glu Asp Arg Glu Asn Ser Val Glu Ser Leu Asp Ala Met Glu Gln 50 55 60Thr His Ser Cys Asn Leu Phe Val Thr Pro Leu Gly Gln Asn Glu Pro65 70 75 80Ser Arg Asp Asp Thr Asp Lys Arg Leu Arg Glu Asp Ser Ser Cys Val 85 90 95Glu Glu Gln Glu Glu Ser Gly Cys Ser Thr Ile Tyr Ser Ala Gly Lys 100 105 110Ala Pro Gly Cys Asp Ala Lys Asn His Leu Thr Glu Leu Gly Ala Phe 115 120 125Glu Leu Ser Asp Asn Leu Ala Asn Ser Ala Lys Glu Glu Tyr Ser Ile 130 135 140Gln Glu Asn Gln Ala Tyr Glu Ser Val Leu Leu Asp Ser Glu Glu Met145 150 155 160Ser Arg Asn Asp Cys Val Asp Asp Glu Ser Thr His Ser Cys Val Gly 165 170 175Ile Tyr Gln Asp Glu Arg Val Ser Thr Arg Gly Asp Gln Thr Ser Glu 180 185 190Glu Thr Leu Ser Val Pro His Asp Tyr Asn Asp Val Gly Arg Glu Ala 195 200 205Ser Leu Ser Leu Ala Glu Pro Ser Ser Ile Asp Glu His Ala Gln Ser 210 215 220Ser Ala Asn Asn Phe Tyr Tyr Asp Tyr Gly Glu Trp Arg Val Ile Trp225 230 235 240Asp Pro Phe Tyr Asn Arg Tyr Tyr Phe Tyr Asn Ile Gln Thr Gln Glu 245 250 255Ser Thr Trp Cys Pro Pro Glu Gly Leu Glu Asp Phe Ala Ser Tyr Cys 260 265 270Ser Pro Asp Thr Thr Lys Glu Leu Ala Glu Leu Gly Ser Gln Cys Ser 275 280 285Ser Met Ala Pro Gln Glu Asn Asn Leu Ala Thr His Val Asn His Leu 290 295 300Glu Ala Gln Glu Gln Asp His Cys Ile His Asp Leu Ser Asp Ile Pro305 310 315 320Val Glu Lys Pro Ile Tyr Gln Ser Met Ile Thr Thr Ser Asp Lys Ala 325 330 335Gln His Thr Glu Asn Lys Tyr Ser Asp Ser Thr Thr Thr Val Leu Glu 340 345 350Met Asn Gln Glu Val Ala Ser Thr Lys Thr Lys Lys Arg Val Arg Arg 355 360 365Ser Arg Ser Tyr His Ser Cys Gln Asp Met Ala Gly Asn Val Ser Asn 370 375 380Asp Ile Ile Lys Tyr Trp Ala Gln Arg Tyr Ser Leu Phe Ser Leu Phe385 390 395 400Asp Ser Gly Ile Lys Met Asp Glu Val Gly Trp Phe Ser Val Thr Pro 405 410 415Glu Pro Ile Ala Lys His His Ala Ser Arg Val Gly Ala Gly Val Met 420 425 430Ile Asp Cys Phe Thr Gly Val Gly Gly Asn Ala Ile Gln Phe Ala Lys 435 440 445Lys Cys Lys His Val Ile Ala Val Asp Ile Asp Pro Gln Lys Ile Asp 450 455 460Cys Ala His His Asn Ala Ser Ile Tyr Gly Val Asn Asp His Ile Asp465 470 475 480Phe Ile Val Gly Asp Phe Ile His Ile Ala Pro His Leu Lys Gly Glu 485 490 495Thr Ala Phe Met Ser Pro Pro Trp Gly Gly Pro Asp Tyr Ala Lys Val 500 505 510Asp Val Tyr Asp Met Lys Ser Met Leu Ile Pro Cys Asp Gly Tyr Ser 515 520 525Leu Phe Lys Leu Gly Thr Met Ile Ala Ser Arg Val Val Met Phe Leu 530 535 540Pro Arg Asn Ile Asp Leu Asn Gln Leu Ala Asp Met Ser Leu Ser Val545 550 555 560Asp Pro Pro Trp Ala Val Glu Val Glu Lys Asn Phe Leu Asn Gly Lys 565 570 575Leu Lys Ala Ile Thr Ala Tyr Phe Glu Glu Gln Asp Arg 580 58525589PRTZea mays 25Met Gly Ser Leu Gly Leu Pro Val Ser Phe Ser Thr Ser Lys Val Asn1 5 10 15Lys Asn Thr Cys Asn Lys Gly Lys Lys Lys Gly Arg Gln Ala Pro Leu 20 25 30Lys Ala Ala Asn Thr Gln Ile Asn Asp Ala Val Arg Ile Cys Ile Asn 35 40 45Thr Glu Asp Arg Glu Asn Ser Val Glu Ser Leu Asp Ala Met Glu Gln 50 55 60Thr His Ser Cys Asn Leu Phe Val Thr Pro Leu Gly Gln Asn Glu Pro65 70 75 80Ser Arg Asp Asp Thr Asp Lys Arg Leu Arg Glu Asp Ser Ser Cys Val 85 90 95Glu Glu Gln Glu Glu Ser Gly Cys Ser Thr Ile Tyr Ser Ala Gly Lys 100 105 110Ala Pro Gly Cys Asp Ala Lys Asn His Leu Thr Glu Leu Gly Ala Phe 115 120 125Glu Leu Ser Asp Asn Leu Ala Asn Ser Ala Lys Glu Glu Tyr Ser Ile 130 135 140Gln Glu Asn Gln Ala Tyr Glu Ser Val Leu Leu Asp Ser Glu Glu Met145 150 155 160Ser Arg Asn Asp Cys Val Asp Asp Glu Ser Thr His Ser Cys Val Gly 165 170 175Ile Tyr Gln Asp Glu Arg Val Ser Thr Arg Gly Asp Gln Thr Ser Glu 180 185 190Glu Thr Leu Ser Val Pro His Asp Tyr Asn Asp Val Gly Arg Glu Ala 195 200 205Ser Leu Ser Leu Ala Glu Pro Ser Ser Ile Asp Glu His Ala Gln Ser 210 215 220Ser Ala Asn Asn Phe Tyr Tyr Asp Tyr Gly Glu Trp Arg Val Ile Trp225 230 235 240Asp Pro Phe Tyr Asn Arg Tyr Tyr Phe Tyr Asn Ile Gln Thr Gln Glu 245 250 255Ser Thr Trp Cys Pro Pro Glu Gly Leu Glu Asp Phe Ala Ser Tyr Cys 260 265 270Ser Pro Asp Thr Thr Lys Glu Leu Ala Glu Leu Gly Ser Gln Cys Ser 275 280 285Ser Met Ala Pro Gln Glu Asn Asn Leu Ala Thr His Val Asn His Leu 290 295 300Glu Ala Gln Glu Gln Asp His Cys Ile His Asp Leu Ser Asp Ile Pro305 310 315 320Val Glu Lys Pro Ile Tyr Gln Ser Met Ile Thr Thr Ser Asp Lys Ala 325 330 335Gln His Thr Glu Asn Lys Tyr Ser Asp Ser Thr Thr Thr Val Leu Glu 340 345 350Met Asn Gln Glu Val Ala Ser Thr Lys Thr Lys Lys Arg Val Arg Arg 355 360 365Ser Arg Ser Tyr His Ser Cys Gln Asp Met Ala Gly Asn Val Ser Asn 370 375 380Asp Ile Ile Lys Tyr Trp Ala Gln Arg Tyr Ser Leu Phe Ser Leu Phe385 390 395 400Asp Ser Gly Ile Lys Met Asp Glu Val Gly Trp Phe Ser Val Thr Pro 405 410 415Glu Pro Ile Ala Lys His His Ala Ser Arg Val Gly Ala Gly Val Met 420 425 430Ile Asp Cys Phe Thr Gly Val Gly Gly Asn Ala Ile Gln Phe Ala Lys 435 440 445Lys Cys Lys His Val Ile Ala Val Asp Ile Asp Pro Gln Lys Ile Asp 450 455 460Cys Ala His His Asn Ala Ser Ile Tyr Gly Val Asn Asp His Ile Asp465 470 475 480Phe Ile Val Gly Asp Phe Ile His Ile Ala Pro His Leu Lys Gly Glu 485 490 495Thr Ala Phe Met Ser Pro Pro Trp Gly Gly Pro Asp Tyr Ala Lys Val 500 505 510Asp Val Tyr Asp Met Lys Ser Met Leu Ile Pro Cys Asp Gly Tyr Ser 515 520 525Leu Phe Lys Leu Gly Thr Met Ile Ala Ser Arg Val Val Met Phe Leu 530 535 540Pro Arg Asn Ile Asp Leu Asn Gln Leu Ala Asp Met Ser Leu Ser Val545 550 555 560Asp Pro Pro Trp Ala Val Glu Val Glu Lys Asn Phe Leu Asn Gly Lys 565 570 575Leu Lys Ala Ile Thr Ala Tyr Phe Glu Glu Gln Asp Arg 580 5852613229DNAZea maysmisc_feature(10689)..(10788)n is a, c, g, or t 26ccccgcgcgg ccaaccctct ctaagagggc cctggtcctt ccttttatag tcgtaaggag 60tggatccagg tgtacaacgg gggtgtagca gagtgctacg tgtctagcgg gggagagcta 120gcgccctaag tacatgccga tgtggcagcc ggagagatct tggcacccag cgagtgtgat 180gtcgtggcca tcggaggagc gacggagcct ggcggaggga cagctgttgg agcggttgag 240tccttgctga cgtcctcctg cttccgtaag agagctgaga gccgccgtcg tcacagagct 300tgcggggcgc catcattgcc tatctggcgg agctagccag ataggacacc ggtcttgttc 360tctgcggccc gagtcggctc ggggcagggt gatgatggcg cttcctgttg acgtgactgg 420cctgcgccct aggtcgggcg acgtggaggc tcctccgaag ccgaggtcga gtctgtcttc 480catggtcgag gccgagcccg agcccctggg tcgggcgagg cggaggtcgt tcggcagagg 540ccagggcgga gtccgagccc tggggtcggg cgaagcggag ttcgtcgtct tctggggctg 600agcccgagtc cgagccctgg gtcgggcgga gcggagttcg ccgtcttccg ggactttagc 660ccgagtccga gccctgggtc gggcggagcg gagttcgccg tcttccgggg cttagcccga 720gtccgagccc tgggtcgggc ggagcggagt tcgccgtctt ccgggactta ccccgagtcc 780gagccctggg gtcgggcgga gcttcctatg gtgcctttgg cagggcctga ctgcccgtca 840gtctcactct gtcgagtggc actgcagtcg gagtggcgca ggcggcgctg tccttctgcc 900aggccggtca gtggagcggc gaagtgacgg cggtcacttc ggctctgccg gagggcgtgt 960gtcaggataa aggtgtcagg ccacctttgc gttaaatgct cctgcgattc ggtcggtcgg 1020tgcggcgatt tagtcagggt tgcttcttag cgaaggcaag gcctcgggcg agccggagat 1080gtgtccgccg ttggaggggg gcctcgggcg agacggaaat cctccggggt cggctgccct 1140tgtccgaggc taggctcggg cgaggcgtga tcgagtcgct cgaatggact gatccttgac 1200ttaatcgcac ccatcgggcc tttgcagctt tatgctgatg ggggttacca gctgagaatt 1260aggcgtcttg agggtacccc taattatggt ccccgacaac cacaaacgcc cacgtcgtgc 1320gcgtggaggt aaggctatct gcattcatca tactttaaac tagttgggtg cccgtgcgtt 1380gcaacagata tcatataaat tcatgtgttt tgctacgcga cacgagggat cggtattatt 1440agtagtcatt tttctattgg acctagtgag gtactcgtac gttgctacgg agatcatata 1500aatccatgtg ttttgctacg cgacacgagg gatcggtatt attagaagtc attttctatt 1560ggacctagtg aggtacctgt atgtgcccgt acgttaccac gagagttaaa acctagtata 1620aaacataaat acagaatgac aaacatcatt atgatataca aattcgtgta acaaaatatc 1680actgtagcac taatattcaa taaaatgtag caactcactg ctcactaatg atgtgtccag 1740ggtaagtggg taaagcacgg tagacgtctt cttttgactt agtacaatgc tcgtgtgttg 1800agacggtgca caattattcg ataaaatgtt tgagcacaag tgcacaacga ttacataaaa 1860ttgaaaaata cttatattaa caaagtctaa ttgtctcaaa ttctttttgc caacaaccaa 1920ttcactgtgt tgcgatgata cacaataata tttcatgaat gactcaggca atccacctaa 1980gtgggtaacc caaaagcaca ccaatgtgat gcctacacgt gaacatctaa cctctattac 2040tacaatagtc ttgttggaaa tttacaactc atctagaatt tcaaaatact caaaattatt 2100tagactctct ctaaaccgga gcatacaaaa tagatcaggc agttttaacg gaaacatgta 2160tgactctcac catcctgtgc agcgctgctc gaataccaga aagtcccagc tcagccaata 2220ctgcatcaag ttctgctagt tccttttttt catctccttt ttagacaact gtctttctgt 2280ctctttgggt ggcgcagcta gggcccacaa ctgtcttgat agcaggttcg gctggggcat 2340ctttcggatc atgctttagg ttcatcctca gctccatcat caacctcaaa atcaagatcc 2400ttgtcatcac tctcaatgtc ctgcaacata tatagaagta catctaagaa cactatcaga 2460gaggctaact ataatgacaa tcagccaaac atgctccatc cccctcgaga tgagtaaatt 2520atacttccac ttaatagttt agttttgatg caaaaattat gacaaaaaac aagtagaaca 2580ataaccagag tttgctatga caaacatgca gcacctgtca tgtataacca tttccaaatc 2640acttgcttac ccaagatata gaaaaacttg gaacaagtcc ctatccaagg tacatatttt 2700ttatttttgt ttagcaggtg gtcagctctg aaaattttct gtcaaggtga agtttgcctt 2760atgacttgct catagcaggg tctgctttga actcctccac tgggaatgca atgtcgtgat 2820cccacagacc gatcaaatgg gattcaaatt taggctgtgc tatatcgcgc tctgaatcta 2880gtgataatgc gtctatcctt ggtttcttgg gagaatgttg catgatctct gggtagtcat 2940ctaagcttgg aggtgaatca gaaagtttct gaccctcaga agctttaatt gatccaccaa 3000gcttttgttg ctcctcaatg atcatctgca agtatattcc ttgtgcttta attgttagtt 3060gcagttgtct ttgaacctga aaacataact tatttagaca agcaaaagaa agttggcaag 3120gaaaatttag attgatcttg aagctaaaaa ggactcataa actacaagca ataagacaaa 3180acaggagaac tgccaacaat agattattga caactaagca ttactacaac catgagaaga 3240aaccactgtt aggtgtaacg tgacagttaa taagtgaata ttgagaatgg gccaaaatat 3300cagcttgtag aagtagcccc aactaagtag gacttctggt catagtagga ttgtcggtaa 3360actaaaatga tgtaaaaaac agaaatctct actacttatt aagtaagcaa tagtagtctg 3420cctccctcgt tctgccgttc tgccatccat cgatctggac cgttcatatc gcatcgtgcg 3480tctgcgtctc ccacactcta tctccctcca ccgcacgacc acccaatccc taaccttccc 3540ccacacgttc ctctctctct cgcgttgccc gccccctgcc cctgccgcga tagaatgcct 3600cgcggcctcc accgccgacg cggcctccac cgcttccctt cctccctttg cctccaccgc 3660ccctccccca acgtcgtcgc caccgccgct ctagggcctc gacgcaggca gcttgggaac 3720cgccctcggc ctcggtgtcg cacggcccca cgaggaccga ccttgccgca tggcggttgt 3780ggaggcgtgc gaccccgcaa ggatcgacat ctcgtccaag acagcagccc cctcagagcg 3840gacgacgcag cttatctctg gtggaaggca cgtagccacg cccccactcc ctcctcgtgc 3900tcccgccgac gcctgcctcg cgctcctcgt cgaccgtctc tccctctcct ccgtcgttgc 3960atctcgcctc tcgcgcacgc acaaaggtga gtctcccctc ccccgactca cccctctagg 4020tctctgattc ttctcatcgg cgagcaagga cggaggagct ggcggaggag ctgtcgaata 4080taacctaatt atgtcggtct cgggtgggat ttggctattt tgggcgccat ggcctaggtg 4140agcgtacaga tctggttctg cattttgttt tcttatctgt caatcatgtt ttttcatatt 4200actgcaggtg ggtctgatta aatattacag acatgttttg cttcgaagcc ctacacctgt 4260ccatctaatt ttaggaactg acgattgaca cagctttgtt gtggcgctat ttagtataag 4320ttatagaaat cggatctttg gctctcatct gcttgtagtt gcaacattga ggtagaacct 4380gggtgggtct gattgctcat tgctgtgata tatgtctgga aaagcagagt tatagtttgg 4440aaactagcgc gtagtacatg tcggtttctt taagtaattg cttatgctct gttgttttga 4500tttccagatg ccccaaatgt gctgaactta agctatcgcg tgagaacaca actttctggt 4560aagtggcaag ctcctccgtt tcttatttta ccgcattgga agttggaaca ccatgatagc 4620tctcaggacg aatgagtatc agctaatttt attgttttaa caatagagca taaaacctta 4680tgcatatttt aaaggttcat gtatcattta gttatggcct tatgaggaag cttcatcatc 4740atatctgtag ataccttgat tcgtggggat ggtaattttt tgtattcttg ttcttttgat 4800ttttgaagga aacacatctt ataatacgat agcacccatt ggaagtcctc gcatacagtg 4860cagggtagaa ttatgcttca tgtgttgccc cttcaccacg atatgccaaa ttgaatgtag 4920tttcatcagt tatgctcaat tatggatgtt cagaggcatc caatttcagt gtgtactgca 4980atacttgggt ccacctatag ttgatagatg ttctgtattt tgttttctta taaattagat 5040ttggcttgca ttatattgtt ctcttttgga acagagtcac tctccgacag ctcatcgccg 5100cctgcttcga ccgcctgtac acaaggttta agaagcgctc tgcgaacttc cacggccgcg 5160ccgtgctgct tctagctatt tacctttttc tatgatgcaa atgtttatat gcatactatg 5220ttacttgaga aacattaaag tacttgatgt acctaaacac attttgttta gtgatgtata 5280gtgatatagc attattgtct atattaatat ttatattgct ggtagcttct actttaattg 5340atctcaatgg ggcatttggg tggctagcaa ttcacattga taatttaaaa gtgaatttca 5400ggtgtacatt tgatggcctc cgatatggtg ctgccttcaa ttctctacaa tgcgcgagaa 5460tgctgctcag gagggtatta atggctcaac acagatgacc tcctcggagt catgtttcta 5520attatctaca ctatgattct ccttctgttg ataaaatatt gttttattgt gctgtgagct

5580aatgataaca gtgatggtaa gtaaatatgg tccatgcata ttctcatcat agatggctga 5640aaaactccga gtgctgctac gctaccagag tcttcatgtg catacttact tcaagaactc 5700aaggtacata gttttctcaa cagaagaata tgtatctgtt tgattccagc tgaattgctt 5760actaaactca gtgtgtcact ttaaatgata tgggatgaag ttgggcaaga ccaaagtgaa 5820agtgggagaa tacccgaaga acttcttgtt ggacgaactt ggagaaacca atactaaaac 5880tcagtgtcaa ccgcttgcaa caggcaattt aggtcgatgt gctcgcacgc tgtgtgacca 5940tgtctgagca ctccccaccc acccaatcgc ttcccacgtc atgccgccac gtcgagaatt 6000tgtacacaac tcaggttgcc cattttactc tgttattgaa cctcgctttt ctgtagaccc 6060aggtatgcta gaagtaggta atggtagcat gaccttagca gagtatcgct cacattttag 6120catttgggtt gtcatgaagg taagttttca ctgagattgg accgatgttg tgtcatcata 6180ttttgagtaa agggtttgac cgaagaattt tgaaaaataa aagactgtag ttcattgaag 6240gtgataactg gtatgcaagt attactatta atgattctcc acttactgat attacatttg 6300gataagagga aggagatatc ctaattctaa tgggttaact gctagcagtt ccttatgtac 6360tgttatgctt caggcccctt tattaattgc ctcatgtgga aagagggtgt gatgatcttt 6420tgttgttttt gtagcgagaa gctcaaagaa ggggaagaca tctacatcta tcagtaagtg 6480atccagttta gtagatgaat caccgctatg ggtgtttttc tattcttgaa taggcatgtg 6540gttctattga tggttgtttg atgtttggta tattgtgttc tgactgtagg ggtgacaaat 6600ccttttgcag catggagtgc agagaaaatt tcatggtaga cgagatggaa ggtgagccag 6660agagtatgta taatcagatc aattttagac tttttttcac taaccactta tccatagcac 6720acttgtgttt atttgagata tgtgtcgtca ttttgtgttc atcggctatt gctatgaaga 6780tgaccacagg gttaattctt gtttatgagt tcatcctcta gaatgtattg ggttgtcgcc 6840aagaaaaaac acacgaagac ataaattcaa ataatttctg gtgctgtgga gtattcataa 6900ttgtgtgctt ttgttgaatc cttcatgtac cataattctt actgcttgca agccctttta 6960gaatgtgact ttaggatgta gaatttggtg aaaggagaca agaaattatt ctctaaacta 7020ctatttttat aggcaaggga ggctggaagc accttttttg catgggagct gcatggagag 7080aacttattgg tagaaaacaa actttgacca gattcttgaa tgtgccaagg gaaaaaggta 7140ttaattggga ccttctattt atacaaggga aaaaggtatt ttttatacaa ttgctcttcc 7200ttctcatgta tggcaacttc ctgttttgta gagattgagc gcccttcctc actgtttgaa 7260ctttactcat cctcatgaaa ctgtgatgct acctgtagga gcggtttgtt caaggaggga 7320atgactttta gaccctaata acttgtctac acttaagata aaaattgttg tcattgctat 7380ggttacctcc tttagtgtcg taaactcaga ataatcatag atccctttgt tgcttacaca 7440ttattcatgt tctacaagct attgagattt tgactagccg ctacatttta gtgcaggttt 7500tcataatttc ttacctttat atctacattt tagatttcct cccttttgac actacatttt 7560agccgctcac tcggaagctt tccatccctg ctggagtact cagcccataa aggtaaatgc 7620atgcctttac ctgcctgaaa tgcattgtct ttcttttcaa ccgtgaatgt gaattggatc 7680catcattcat gtgcacacac aaggccggtt tgaatttgga atgcactgtt gttctgcctg 7740tgatccactt ggtggttttt atttgcgttc acatattaaa aaatatatta taccctgatt 7800ctgagtcaaa ctttggttag acctggattt gttgattagt ttccgttgtt gtgatctgtt 7860gaacaaacta taggttaggt gagttatcag attcttagat ctgtcctgta cggttgtgat 7920ctgttgaaca aactataggt taggtgagtt atcagattct tagatctgtc ttgtacgtct 7980gatctctacc atgttacaaa catctgaaag ttaataataa tcactactac attcacacct 8040atcttgtatg gatttgctgt tgaaattgca gaatgcttcc caacttgtgc atttttatta 8100catccccaca acttcatcgt atagtccaga catcctttgt ttgtgtagca aaatagatgt 8160gcaattgttt cattgtaaca atgtcctgta tattaacctc ggcggctctg gtgctttctg 8220caggcttctc attcttattt gcatggcctg taactgctgc acagcgcgct acggctggcg 8280tagcggcagc aactgctggt atgaacagga tccgtccacc ctcattagcg acgatgaagt 8340gacacccaat tctaggcgcg gacgtacgat ggcgccctcg cgcgggatat cgggctgagt 8400gatgagacgc tgctgccatt ggggttccag gcgcgcctga ccgctcccac tcctaggccc 8460ggcatgtctc tccgacgcta cctaccggtg ctggtgaaac gcaaacttaa aatcgtgcat 8520tgcaagatgt tgccttctct ttatgtctcc tctactctac gcctacgagc tttctttgtt 8580tataactttt tctgtggttt cgctttcaac aagctcaagc ggtcaacaga tcaacttaga 8640caaaatgcag atgcagtgga aatattaagg aagaccagat tccctcatgt tcatggagca 8700ggtgaaaaga agtcaccaga gacaattctg gatcatgaga ctatgagagg actttggcaa 8760caaatgttcg gtttggactt ctctgggtgg ttctatgata ctttataatt tcttttggct 8820tatgtgtcat ttcagaagat atgaaaatag ctaagcattg tcaataatat tagcttcctt 8880gtttcttgtt aacctaagat gactgcccta tttcacttgt tttttctagc tgtggtagta 8940ggtcattaga gtagttccct tcaatcgtag cactcagcca tgttgtaaat ggtttacttc 9000tattgtgaat cgtgtttttc ttttttaatg gtggggttag tggaaaatct ggacgtatcc 9060gcgaatggtg gagaagcaca caaaacaaca attatatgca aaaactaaac tatgttttct 9120ttctttctat ggtatatgaa tttagtacct tgataatcac aatgattaga gtctgtttgt 9180atttatatta aaacataact gcttaggagg tgtatgagga ggataacgag atgtgcagtg 9240cgcgcaacag taagcagact accatgactt ggttgctctt caagtgtgca agtgtgtcta 9300tggattgtat ggttctttgg tttttgttgt tcgctaaggt gcaaagtagg aagaaaatac 9360ggccctgacc ctgcaagtgg ggaatgtatc gtttttgctg gaactgaaat atgcggtgtt 9420cttttttatg cactaatatt ttgttgaata ttttgtatac cgttgcaacg cacgggcatc 9480tacctagtaa attaatagtt ttcaataaaa gacctcgagt ttctcatgta gtcgcttctg 9540aacctccatt tacatcttta gtgccttaaa aatcgtgcca atgtttccat atatcaaatc 9600acggtgatca acaattaaac aaaataactc ttaaccaaaa aaatggaagt gtcacgcact 9660catgaactac agatgtcaat agctaacata taaacaggct caagccttcc ttcaaggata 9720ccaggttgtt ctaagcagca tctctatatt tagagaggaa gacttgtctc gattcatctc 9780ttcctgagac cccactcatg tgggtagttc tacctttttt atgaatacag atgcaacatc 9840aacatataaa caggctcatg cctcagatac tactctcaat tggtctactg aaattctatt 9900gcatttatga cgaatgcagt tacagattta aaagtaaagg acgagtacag ctgcatgatg 9960aaaagtaaac agaacggtaa ctgcaaattt acttcagaca tgctgctgct ttgatgagat 10020cggaacaggg ttgttgctca caatctgtgg atttaaagcc tgcaacacat gtcacaattt 10080aatcatgagg ttcactaaga caacattaca caaaagaggt gttggtactg agaacataaa 10140agatcatgtc atccttgtct gaaaggaagt gcattccaac taagaacaaa gatcactgct 10200tctcaatgaa gcaaggctac gacatgatgc atatttagct caaattaccc aagcaagtgt 10260ttattaaaat aaaagcatgc atcttagaca tcagcaatag gttatctaag acgtcaatgc 10320atatttagcc cagttttaga caacaatgca tatttagcta aaatattaaa catgcaatgc 10380ctctttagct caaatttcag accacagtat taaccggtgg tgctacactt gaagaacaag 10440cttatctgga gaagcagccc acatccagtc ctgaaggcaa atcagatgtt tgaacaaaat 10500cagtgcttag ccaaatctag aaaccacaac cactcgaacg ctgactcaga tggtagagcc 10560atcacaccaa tggtgctacc atcagtaaaa taatagaatc aatttgtttt tccttactga 10620tacagtaccc ttcgaataat tgtttataag caattgaact tgctgcccaa tgtattatag 10680tcttggtcnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnntt agagacatat 10800agagtagtat agataaggat aatggacagt agtacttgaa gttaatacat aaaacttgca 10860gctaataaac aaacatagta ctattttagt taacatacaa caattttcca gctcttcttt 10920gcacttatag catttttagt acctacacac aacctgaact gaacaccatc aagaacagta 10980ctgcatttgg aagtttgaat aaaaatggtg tccaattaac agttttctaa tattgtaact 11040taattattga atggtgcaat agggtgttca cttgttgaat tggttgataa tcttgtgtgg 11100gctttaaata taacattgtt agtgaatgct aaataattct gcactcttgg aatggtcaag 11160ctgagaatga gccagagatt aactggtgcg aacctgggaa gcctgcaggg atggaggtcc 11220ttcttgggag taaggatggg ttcttgacga agaagattaa gagctatagc acatgccgaa 11280gtgatcctac aaagccacgt gcattgcaat gcctttcacc actgatagca ggtcatgtaa 11340gatacatcat ttaaccatct ggatatgtgt taatgttaga caactaattt ataaaatttt 11400cgattcaggt gggcaatgga attgtttgtt gaatgagagt aatgtttatt attggtaaag 11460tatcacaatt cagtctgatt caggtgggca tggtaaaatg aagtgcatct gccaaaaagt 11520aaggtaagta actaagtatc ttttggatgt ttggaggatc agatgttgtt gcagaatgtt 11580tattgactaa ttgcaagtca aatgaattaa aggggaaaac agataagcac taagattttt 11640atatgaaagc tgtgtgagtc aacctgctaa gagctctaaa tgtcctgtcc cagaagtagt 11700ggcggagcca gcccaaattt gaagcctatc gactcagaaa attcatccat gccaaccaaa 11760caaatagaat aattacctta tctgttgcac gggtgtgaag gagctggaga tgcagatcag 11820gtgtcacaaa ttctataacc aaattgcata atgcataagc aatcaaacaa tacaccccta 11880attctttagg ctaaacgaac tgcactaagc ctatgtttca gatttcaaga ttactattgg 11940attgtgttca tcacaccaat aggttaagta ccaacatgct aatcaataag aacagtaaag 12000gggcattacc tacaacaaat gttacgtcta aatagaacta acttggaggc aatgtaagct 12060ctgaatgtcc gaaacattga aacaaagtct gaactctgaa attgtctccc taatcctcac 12120ctcatcccat ctgtgtcgat ggatggtggg tgcctcgcct gagctgttgt gtggtgccgc 12180ggacgaggtg agcgcttgtg tagaatgaag atggggtcgc agagcgacgt ggccgacggc 12240ggaccgatga tggagagggt actttttact ttcattaaca gatcatcaag caaaataaaa 12300tagtaatgag caattgagcc cagatcacat ttgtaatcat ctagcatgta agcactattt 12360tttaatttga tacccctcat tcagagtttt tgcacaaact tagaactgga ggctgaaagt 12420aagcaaattg acttcatctt ttagtttgat gaagatcaaa tgaatttaaa agcttgagaa 12480gatgaaaaga tctcatatgt atgaccaaga gatgatagca aatggcttca agccaataaa 12540gtcacgagag atgaaagcaa ataggttcac atgcaggata caacagaaag ttatgtatac 12600aggaacaaat gctaatagaa ctatgcattt atcatatttt agaaactgta gcccttgttc 12660atcctacccc caatgagtat tcctatcagt acacatgtag gctagcattt ttttagttgt 12720aatacttgct tagctacaga tcactggcta gctgaaaaaa acctaagtag caacagtgga 12780acagtttgag tatagctgta tgaaaaaggc tatctgaata atagcaatat ttggattcat 12840gttcatttca agaacttcac cattgggaaa actccaacgg tactaccatt tgattgctgc 12900aatgtttgta atgataaaga gatctcactt actcacttgt ggtaccatag aaacctgcaa 12960aaaagaggaa aataatataa attggtgtaa aaattaagga attcatagta caaatatgat 13020cgactttata ggaccaccct tttgttaaat ttttcagtgt caacaggaag gtgtgattca 13080ggcctttgtg gatcaagtca gcacagaacc ttggcttctt cattcggtag aatgatgaca 13140agactaccga atgatgacaa gacttttaaa aatatcctgt tgaaaatgca tttaccatgt 13200cagctgtgtt accaagacgg gtaattact 13229271515DNAArtificial SequencecDNA 27atggcggttg tggaggcgtg cgaccccgca aggatcgaca tctcgtccaa gacagcagcc 60ccctcagagc ggacgacgca gcttatctct ggtggaaggc acgtagccac gcccccactc 120cctcctcgtg ctcccgccga cgcctgcctc gcgctcctcg tcgaccgtct ctccctctcc 180tccgtcgttg catctcgcct ctcgcgcacg cacaaagatg ccccaaatgt gctgaactta 240agctatcgcg tgagaacaca actttctgct tctactttaa ttgatctcaa tggggcattt 300gggtggctag caattcacat tgataattta aaagtgaatt tcaggtgtac atttgatggc 360ctccgatatg gtgctgcctt caattctcta caatgcgcga gaatgctgct caggaggatg 420gctgaaaaac tccgagtgct gctacgctac cagagtcttc atgtgcatac ttacttcaag 480aactcaagcg agaagctcaa agaaggggaa gacatctaca tctatcaggg tgacaaatcc 540ttttgcagca tggagtgcag agaaaatttc atggtagacg agatggaagg caagggaggc 600tggaagcacc ttttttgcat gggagctgca tggagagaac ttattggtag aaaacaaact 660ttgaccagat tcttgaatgt gccaagggaa aaaggtatta attgggacct tctatttata 720caagggaaaa aggcgcggac gtacgatggc gccctcgcgc gggatatcgg gctgagtgat 780gagacgctgc tgccattggg gttccaggcg cgcctgaccg ctcccactcc taggcccggc 840atgtctctcc gacgctacct accggtgctg gtgaaacgca aacttaaaat cgtgcattgc 900aagatgttgc cttctcttta tgtctcctct actctacgcc tacgagcttt ctttgtttat 960aactttttct gtggtttcgc tttcaacaag ctcaagcggt caacagatca acttagacaa 1020aatgcagatg cagtggaaat attaaggaag accagattcc ctcatgttca tggagcaggt 1080gaaaagaagt caccagagac aattctggat catgagacta tgagaggact ttggcaacaa 1140atgttcggtt tggacttctc tgggtggttc tatgatactt tataatttct tttggcttat 1200gtgtcatttc agaagatatg aaaatagcta agcattgtca ataatattag cttccttgtt 1260tcttgttaac ctaagatgac tgccctattt cacttgtttt ttctagctgt ggtagtaggt 1320cattagagta gttcccttca atcgtagcac tcagccatgt tgtaaatggt ttacttctat 1380tgtgaatcgt gtttttcttt tttaatggtg gggttagtgg aaaatctgga cgtatccgcg 1440aatggtggag aagcacacaa aacaacaatt atatgcaaaa actaaactat gttttctttc 1500tttctatggt atatg 1515282229DNAArtificial SequencecDNA 28attgatctca atggggcatt tgggtggcta gcaattcaca ttgataattt aaaagtgaat 60ttcaggtgta catttgatgg cctccgatat ggtgctgcct tcaattctct acaatgcgcg 120agaatgctgc tcaggagggt attaatggct caacacagat gacctcctcg gagtcatgtt 180tctaattatc tacactatga ttctccttct gttgataaaa tattgtttta ttgtgctgtg 240agctaatgat aacagtgatg gtaagtaaat atggtccatg catattctca tcatagatgg 300ctgaaaaact ccgagtgctg ctacgctacc agagtcttca tgtgcatact tacttcaaga 360actcaagttg ggcaagacca aagtgaaagt gggagaatac ccgaagaact tcttgttgga 420cgaacttgga gaaaccaata ctaaaactca gtgtcaaccg cttgcaacag gcaatttagg 480tcgatgtgct cgcacgctgt gtgaccatgt ctgagcactc cccacccacc caatcgcttc 540ccacgtcatg ccgccacgtc gagaatttgt acacaactca ggttgcccat tttactctgt 600tattgaacct cgcttttctg tagacccagg tatgctagaa gtaggtaatg gtagcatgac 660cttagcagag tatcgctcac attttagcat ttgggttgtc atgaaggtaa gttttcactg 720agattggacc gatgttgtgt catcatattt tgagtaaagg gtttgaccga agaattttga 780aaaataaaag actgtagttc attgaaggtg ataactggta tgcaagtatt actattaatg 840attctccact tactgatatt acatttggat aagaggaagg agatatccta attctaatgg 900gttaactgct agcagttcct tatgtactgt tatgcttcag gcccctttat taattgcctc 960atgtggaaag agggtgtgat gatcttttgt tgtttttgta gcgagaagct caaagaaggg 1020gaagacatct acatctatca gggtgacaaa tccttttgca gcatggagtg cagagaaaat 1080ttcatggtag acgagatgga aggcaaggga ggctggaagc accttttttg catgggagct 1140gcatggagag aacttattgg tagaaaacaa actttgacca gattcttgaa tgtgccaagg 1200gaaaaaggta ttaattggga ccttctattt atacaaggga aaaaggtatt ttttatacaa 1260ttgctcttcc ttctcatgta tggcaacttc ctgttttgta gagattgagc gcccttcctc 1320actgtttgaa ctttactcat cctcatgaaa ctgtgatgct acctgtagga gcggtttgtt 1380caaggaggga atgactttta gaccctaata acttgtctac acttaagata aaaattgttg 1440tcattgctat ggttacctcc tttagtgtcg taaactcaga ataatcatag atccctttgt 1500tgcttacaca ttattcatgt tctacaagct attgagattt tgactagccg ctacatttta 1560gtgcaggttt tcataatttc ttacctttat atctacattt tagatttcct cccttttgac 1620actacatttt agccgctcac tcggaagctt tccatccctg ctggagtact cagcccataa 1680aggcttctca ttcttatttg catggcctgt aactgctgca cagcgcgcta cggctggcgt 1740agcggcagca actgctggta tgaacaggat ccgtccaccc tcattagcga cgatgaagtg 1800acacccaatt ctaggcgcgg acgtacgatg gcgccctcgc gcgggatatc gggctgagtg 1860atgagacgct gctgccattg gggttccagg cgcgcctgac cgctcccact cctaggcccg 1920gcatgtctct ccgacgctac ctaccggtgc tggtgaaacg caaacttaaa atcgtgcatt 1980gcaagatgtt gccttctctt tatgtctcct ctactctacg cctacgagct ttctttgttt 2040ataacttttt ctgtggtttc gctttcaaca agctcaagcg gtcaacagat caacttagac 2100aaaatgcaga tgcagtggaa atattaagga agaccagatt ccctcatgtt catggagcag 2160gtgaaaagaa gtcaccagag acaattctgg atcatgagac tatgagagga ctttggcaac 2220aaatgttcg 222929579DNAArtificial SequencecDNA 29aagttttcac tgagattgga ccgatgttgt gtcatcatat tttgagtaaa gggtttgacc 60gaagaatttt gaaaaataaa agactgtagt tcattgaagg tgataactgg tatgcaagta 120ttactattaa tgattctcca cttactgata ttacatttgg ataagaggaa ggagatatcc 180taattctaat gggttaactg ctagcagttc cttatgtact gttatgcttc aggccccttt 240attaattgcc tcatgtggaa agagggtgtg atgatctttt gttgtttttg tagcgagaag 300ctcaaagaag gggaagacat ctacatctat cagggtgaca aatccttttg cagcatggag 360tgcagagaaa atttcatggt agacgagatg gaaggcaagg gaggctggaa gcaccttttt 420tgcatgggag ctgcatggag agaacttatt ggtagaaaac aaactttgac cagattcttg 480aatgtgccaa gggaaaaagg tattaattgg gaccttctat ttatacaagg gaaaaaggta 540ttttttatac aattgctctt ccttctcatg tatggcaac 579302512DNAZea mays 30cccgctacct gttcaccgcg cgccagcgaa acctccgcac gcccactgcc catctgttcc 60ccgtgcgcca gcgaaacatc cgcacgcccg cggcccgcct gttccccgcg catcccgctg 120cacgacttct gctaccgcaa cggccaccca cgcacgcccg cctgttcacc gcgcatcccg 180ctgacctccc cttcacgctc gcacacgctc cgttccccca ccccaccgca atccccgacg 240ctataagagc ggtaaccaac tccatctccc tggtgccacg cattgttgag ttcttaaggt 300gcgtttcgtt gaggacttgt tcatttttgt tggtcatgta ttccatttta ctgctctacc 360attttgtgga ataaagggag gaatgttttc actagaagag ttcatcaatc ttatgttggt 420ttcttggatc agttttgctc tatggctaaa tggtcgaatt gagcctattt cattataaag 480ttagcgagcg aataattgtt cagcctcttc ctagaactca ttaccagtag aatcagttac 540taactgcttt tctttttctt ggattagaat ggctggggct atctctcacc atgcgctagc 600attttcacaa tcccactggt gcagtgcgaa gaactctaga ttcggaaaga ggacgggcaa 660tgctcgcctg gtttatctaa aaggaagatg tggttcaggc agcagaaaac tgggtttgat 720gtgggcctcg agctcgcagt cttctgtcat ggagccgacg cacctaccat ctgatggcaa 780cagcagccac accccaaaaa aatcaagtaa ttttaacgac ctcctatggt ggttatttgt 840ttttaatttg agaaaactat ccatttgaca catttaactt tgggcttctc agaatttggg 900ggcatataat aagatctgct aatctgttat ctctatgtcg ttgtaggtga aagcgctctt 960atattgattt ggcatggtga atccctgtgg aacgagaaaa atctatttcc tggctgcatc 1020gatgtacccc tgacaccgaa gggtgttgag gaggccattg aggcaggtaa aaggatatgc 1080aatatcccaa tcgatgtgat atatacttca tcactgattt gtgctcagat gaccgcaatg 1140cttgccatga tgcagcatcg acgcaagaag gtttgtgtct ttcctttgaa attccagtaa 1200tttcttctag catttgtatg aacttgccgg agaaatcatg ctttgctggt gatatatgta 1260tttatagatc ctagttatca cgcataatga gagtgaacaa gctcacaggt ggagtcagat 1320atacagtgag gagacaatga aacagtccat tcctgtcatc acagcttggc aattgaatga 1380acggatgtaa tactttctcc atactctttg atttgctaat tactccctct gtctcaaaat 1440agtattaatt ttagctcttg atttttatgt ctatattcaa atagatgatg ataaatctag 1500attctagaca caaatataaa acatatacat caagtattat atgaatctat taatttacta 1560agaccaattt taatttggga cagagggagt atacgattat aatagttgtt tgactgtgct 1620tctctttaaa tatcccttga catttctagg tatggtgagc tacaaggcct taacaagcaa 1680gaaactgtag atcgatttgg caaagaacaa gttcatgagt ggcgccgcag ttatgatatt 1740cctccgccaa atggagaaag tctagagaag tgtgctgaga gagctgttgc ttatttcaaa 1800gatcaggcac atctagcaag gccactttac actaattgaa agatacactt tttacttggg 1860ttattggtct tgctgcagta ttggtatgca tgctaaaggt tattcttgaa tcgatgaatt 1920cctctactat gggatgcaga aatgcatgtg cttagttttc tttctattgt gctagctcat 1980atcaaattta taacctgaat tttttattta tgttcgactc taaaaaacag ttttttctag 2040ctcgatttga cctatagtaa tttttccgta atagattatt ccacaacttg tggctggaaa 2100acatgtgatg gttgctgcac atgggaattc acttcgttca attataatgc atctggacaa 2160attaacttct cagaaggtaa ttcactgtcg tttttgtctt tccatcaaaa aggactcggc 2220taaacagaac atgtagcatt atgttaagtt tgggagtgag cctttcgtcc cttcaggtaa 2280taagccttga gctgtctact ggcattccca tgctttacat attcaaagag ggaaagttta 2340ttcgacgtgg gactccagta ggaccttcgg aggccagtgt ttatgcttat accagggtaa 2400gattctttcc cccacatgtt ctaccatagg acgatactcc agtttacaaa ccttatctgt 2460acagaccaaa cgatttgctg agcacattac atttcagaac aaattggcct ag 2512311005DNAArtificial SequencecDNA 31atggctgggg ctatctctca ccatgcgcta gcattttcac aatcccactg gtgcagtgcg 60aagaactcta gattcggaaa gaggacgggc aatgctcgcc tggtttatct aaaaggaaga 120tgtggttcag gcagcagaaa actgggtttg atgtgggcct cgagctcgca gtcttctgtc 180atggagccga cgcacctacc atctgatggc aacagcagcc acaccccaaa aaaatcaagt 240gaaagcgctc ttatattgat ttggcatggt

gaatccctgt ggaacgagaa aaatctattt 300cctggctgca tcgatgtacc cctgacaccg aagggtgttg aggaggccat tgaggcaggt 360aaaaggatat gcaatatccc aatcgatgtg atatatactt catcactgat ttgtgctcag 420atgaccgcaa tgcttgccat gatgcagcat cgacgcaaga agatcctagt tatcacgcat 480aatgagagtg aacaagctca caggtggagt cagatataca gtgaggagac aatgaaacag 540tccattcctg tcatcacagc ttggcaattg aatgaacgga tgtatggtga gctacaaggc 600cttaacaagc aagaaactgt agatcgattt ggcaaagaac aagttcatga gtggcgccgc 660agttatgata ttcctccgcc aaatggagaa agtctagaga agtgtgctga gagagctgtt 720gcttatttca aagatcagat tattccacaa cttgtggctg gaaaacatgt gatggttgct 780gcacatggga attcacttcg ttcaattata atgcatctgg acaaattaac ttctcagaag 840gtaataagcc ttgagctgtc tactggcatt cccatgcttt acatattcaa agagggaaag 900tttattcgac gtgggactcc agtaggacct tcggaggcca gtgtttatgc ttataccagg 960accaaacgat ttgctgagca cattacattt cagaacaaat tggcc 100532394PRTZea mays 32Met Ala Val Val Glu Ala Cys Asp Pro Ala Arg Ile Asp Ile Ser Ser1 5 10 15Lys Thr Ala Ala Pro Ser Glu Arg Thr Thr Gln Leu Ile Ser Gly Gly 20 25 30Arg His Val Ala Thr Pro Pro Leu Pro Pro Arg Ala Pro Ala Asp Ala 35 40 45Cys Leu Ala Leu Leu Val Asp Arg Leu Ser Leu Ser Ser Val Val Ala 50 55 60Ser Arg Leu Ser Arg Thr His Lys Asp Ala Pro Asn Val Leu Asn Leu65 70 75 80Ser Tyr Arg Val Arg Thr Gln Leu Ser Ala Ser Thr Leu Ile Asp Leu 85 90 95Asn Gly Ala Phe Gly Trp Leu Ala Ile His Ile Asp Asn Leu Lys Val 100 105 110Asn Phe Arg Cys Thr Phe Asp Gly Leu Arg Tyr Gly Ala Ala Phe Asn 115 120 125Ser Leu Gln Cys Ala Arg Met Leu Leu Arg Arg Met Ala Glu Lys Leu 130 135 140Arg Val Leu Leu Arg Tyr Gln Ser Leu His Val His Thr Tyr Phe Lys145 150 155 160Asn Ser Ser Glu Lys Leu Lys Glu Gly Glu Asp Ile Tyr Ile Tyr Gln 165 170 175Gly Asp Lys Ser Phe Cys Ser Met Glu Cys Arg Glu Asn Phe Met Val 180 185 190Asp Glu Met Glu Gly Lys Gly Gly Trp Lys His Leu Phe Cys Met Gly 195 200 205Ala Ala Trp Arg Glu Leu Ile Gly Arg Lys Gln Thr Leu Thr Arg Phe 210 215 220Leu Asn Val Pro Arg Glu Lys Gly Ile Asn Trp Asp Leu Leu Phe Ile225 230 235 240Gln Gly Lys Lys Ala Arg Thr Tyr Asp Gly Ala Leu Ala Arg Asp Ile 245 250 255Gly Leu Ser Asp Glu Thr Leu Leu Pro Leu Gly Phe Gln Ala Arg Leu 260 265 270Thr Ala Pro Thr Pro Arg Pro Gly Met Ser Leu Arg Arg Tyr Leu Pro 275 280 285Val Leu Val Lys Arg Lys Leu Lys Ile Val His Cys Lys Met Leu Pro 290 295 300Ser Leu Tyr Val Ser Ser Thr Leu Arg Leu Arg Ala Phe Phe Val Tyr305 310 315 320Asn Phe Phe Cys Gly Phe Ala Phe Asn Lys Leu Lys Arg Ser Thr Asp 325 330 335Gln Leu Arg Gln Asn Ala Asp Ala Val Glu Ile Leu Arg Lys Thr Arg 340 345 350Phe Pro His Val His Gly Ala Gly Glu Lys Lys Ser Pro Glu Thr Ile 355 360 365Leu Asp His Glu Thr Met Arg Gly Leu Trp Gln Gln Met Phe Gly Leu 370 375 380Asp Phe Ser Gly Trp Phe Tyr Asp Thr Leu385 39033128PRTZea mays 33Met Tyr Cys Tyr Ala Ser Gly Pro Phe Ile Asn Cys Leu Met Trp Lys1 5 10 15Glu Gly Val Met Ile Phe Cys Cys Phe Cys Ser Glu Lys Leu Lys Glu 20 25 30Gly Glu Asp Ile Tyr Ile Tyr Gln Gly Asp Lys Ser Phe Cys Ser Met 35 40 45Glu Cys Arg Glu Asn Phe Met Val Asp Glu Met Glu Gly Lys Gly Gly 50 55 60Trp Lys His Leu Phe Cys Met Gly Ala Ala Trp Arg Glu Leu Ile Gly65 70 75 80Arg Lys Gln Thr Leu Thr Arg Phe Leu Asn Val Pro Arg Glu Lys Gly 85 90 95Ile Asn Trp Asp Leu Leu Phe Ile Gln Gly Lys Lys Val Phe Phe Ile 100 105 110Gln Leu Leu Phe Leu Leu Met Tyr Gly Asn Phe Leu Phe Cys Arg Asp 115 120 12534335PRTZea mays 34Met Ala Gly Ala Ile Ser His His Ala Leu Ala Phe Ser Gln Ser His1 5 10 15Trp Cys Ser Ala Lys Asn Ser Arg Phe Gly Lys Arg Thr Gly Asn Ala 20 25 30Arg Leu Val Tyr Leu Lys Gly Arg Cys Gly Ser Gly Ser Arg Lys Leu 35 40 45Gly Leu Met Trp Ala Ser Ser Ser Gln Ser Ser Val Met Glu Pro Thr 50 55 60His Leu Pro Ser Asp Gly Asn Ser Ser His Thr Pro Lys Lys Ser Ser65 70 75 80Glu Ser Ala Leu Ile Leu Ile Trp His Gly Glu Ser Leu Trp Asn Glu 85 90 95Lys Asn Leu Phe Pro Gly Cys Ile Asp Val Pro Leu Thr Pro Lys Gly 100 105 110Val Glu Glu Ala Ile Glu Ala Gly Lys Arg Ile Cys Asn Ile Pro Ile 115 120 125Asp Val Ile Tyr Thr Ser Ser Leu Ile Cys Ala Gln Met Thr Ala Met 130 135 140Leu Ala Met Met Gln His Arg Arg Lys Lys Ile Leu Val Ile Thr His145 150 155 160Asn Glu Ser Glu Gln Ala His Arg Trp Ser Gln Ile Tyr Ser Glu Glu 165 170 175Thr Met Lys Gln Ser Ile Pro Val Ile Thr Ala Trp Gln Leu Asn Glu 180 185 190Arg Met Tyr Gly Glu Leu Gln Gly Leu Asn Lys Gln Glu Thr Val Asp 195 200 205Arg Phe Gly Lys Glu Gln Val His Glu Trp Arg Arg Ser Tyr Asp Ile 210 215 220Pro Pro Pro Asn Gly Glu Ser Leu Glu Lys Cys Ala Glu Arg Ala Val225 230 235 240Ala Tyr Phe Lys Asp Gln Ile Ile Pro Gln Leu Val Ala Gly Lys His 245 250 255Val Met Val Ala Ala His Gly Asn Ser Leu Arg Ser Ile Ile Met His 260 265 270Leu Asp Lys Leu Thr Ser Gln Lys Val Ile Ser Leu Glu Leu Ser Thr 275 280 285Gly Ile Pro Met Leu Tyr Ile Phe Lys Glu Gly Lys Phe Ile Arg Arg 290 295 300Gly Thr Pro Val Gly Pro Ser Glu Ala Ser Val Tyr Ala Tyr Thr Arg305 310 315 320Thr Lys Arg Phe Ala Glu His Ile Thr Phe Gln Asn Lys Leu Ala 325 330 33535637DNAArtificial SequencecDNA 35gctacgtgcc ttccaccaga gataagctgc gtcgtccgct ctgagggggc tgctgtcttg 60gacgagatgt cgatccttgc ggggtcgcac gcctccacaa ccgccatgcg gcaaggttca 120aagacaactg caactaacaa ttaaagcaca aggaatatac ttgcagatga tcattgagga 180gcaacaaaag cttggtggat caattaaagc ttctgaggga cattgagagt gatgacaagg 240atcttgattt tgaggttgat gatggagctg aggatgaacc taaagcatga tccgaaagat 300gccccagccg aacctgctat caagacagtt gtggccctag ctgcgccacc caaagagaca 360gaaagacagt tgtctaaaaa ggagatgaaa aaaaggaact agcagaactt gatgcagtat 420tggctgagct gggactttct ggtattcgag cagcgctgca caggatggtg agagtcatac 480atgtttccgt taaaactgcc tgatctattt tgtatgctcc ggtttagaga gagtctaaat 540aattttgagt attttgaaat tctagatgag ttgtaaattt ccaacaagac tattgtagta 600atagaggtta gatgttcacg tgtaggcatc acattgg 63736691DNAArtificial SequencecDNA 36tatagggaga gcggccgcca gatcttccgg atggctcgag tttttcagca agatgctacg 60tgccttccac cagagataag ctgcgtcgtc cgctctgagg gggctgctgt cttggacgag 120atgtcgatcc ttgcggggtc gcacgcctcc acaaccgcca tgcggcaagg ttcaaagaca 180actgcaacta acaattaaag cacaaggaat atacttgcag atgatcattg aggagcaaca 240aaagcttggt ggatcaatta aagcttctga gggacattga gagtgatgac aaggatcttg 300attttgaggt tgatgatgga gctgaggatg aacctaaagc atgatccgaa agatgcccca 360gccgaacctg ctatcaagac agttgtggcc ctagctgcgc cacccaaaga gacagaaaga 420cagttgtcta aaaaggagat gaaaaaaagg aactagcaga acttgatgca gtattggctg 480agctgggact ttctggtatt cgagcagcgc tgcacaggat ggtgagagtc atacatgttt 540ccgttaaaac tgcctgatct attttgtatg ctccggttta gagagagtct aaataatttt 600gagtattttg aaattctaga tgagttgtaa atttccaaca agactattgt agtaatagag 660gttagatgtt cacgtgtagg catcacattg g 691372146DNAArtificial SequencecDNA 37atgagaagaa tcagagacct agaggggtga gtcgggggag gggagactca cctttgtgcg 60tgcgcgagag gcgagatgca acgacggagg agagggagag acggtcgacg aggagcgcga 120ggcaggcgtc ggcgggagca cgaggaggga gtgggggcgt ggctacgtgc cttccaccag 180agataagctg cgtcgtccgc tctgaggggg ctgctgtctt ggacgagatg tcgatccttg 240cggggtcgca cgcctccaca accgccatgc ggcaaggtcg gtcctcgtgg ggccgtgcga 300caccgaggcc gagggcggtt cccaagctgc ctgcgtcgag gccctagagc ggcggtggcg 360acgacgttgg gggaggggcg gtggaggcaa agggaggaag ggaagcggtg gaggccgcgt 420cggcggtgga ggccgcgagg cattctatcg cggcaggggc agggggcggg caacgcgaga 480gagagaggaa cgtgtggggg aaggttaggg attgggtggt cgtgcggtgg agggagatag 540agtgtgggag acgcagacgc acgatgcgat atgaacggtc cagatcgatg gatggcagaa 600cggcagaacg agggaggcag actactattg cttacttaat aagtagtaga gatttctgtt 660ttttacatca ttttagttta ccgacaatcc tactatgacc agaagtccta cttagttggg 720gctacttcta caagctgata ttttggccca ttctcaatat tcacttatta actgtcacgt 780tacacctaac agtggtttct tctcatggtt gtagtaatgc ttagttgtca ataatctatt 840gttggcagtt ctcctgtttt gtcttattgc ttgtagttta tgagtccttt ttagcttcaa 900gatcaatcta aattttcctt gccaactttc ttttgcttgt ctaaataagt tatgttttca 960ggttcaaaga caactgcaac taacaattaa agcacaagga atatacttgc agatgatcat 1020tgaggagcaa caaaagcttg gtggatcaat taaagcttct gagggtcaga aactttctga 1080ttcacctcca agcttagatg actacccaga gatcatgcaa cattctccca agaaaccaag 1140gatagacgca ttatcactag attcagagcg cgatatagca cagcctaaat ttgaatccca 1200tttgatcggt ctgtgggatc acgacattgc attcccagtg gaggagttca aagcagaccc 1260tgctatgagc aagtcataag gcaaacttca ccttgacaga aaattttcag agctgaccac 1320ctgctaaaca aaaataaaaa atatgtacct tggataggga cttgttccaa gtttttctat 1380atcttgggta agcaagtgat ttggaaatgg ttatacatga caggtgctgc atgtttgtca 1440tagcaaactc tggttattgt tctacttgtt ttttgtcata atttttgcat caaaactaaa 1500ctattaagtg gaagtataat ttactcatct cgagggggat ggagcatgtt tggctgattg 1560tcattatagt tagcctctct gatagtgttc ttagatgtac ttctatatat gttgcaggac 1620attgagagtg atgacaagga tcttgatttt gaggttgatg atggagctga ggatgaacct 1680aaagcatgat ccgaaagatg ccccagccga acctgctatc aagacagttg tgggccctag 1740ctgcgccacc caaagagaca gaaagacagt tgtctaaaaa ggagatgaaa aaaaggaact 1800agcagaactt gatgcagtat tggctgagct gggactttct ggtattcgag cagcgctgca 1860caggatggtg agagtcatac atgtttccgt taaaactgcc tgatctattt tgtatgctcc 1920ggtttagaga gagtctaaat aattttgagt attttgaaat tctagatgag ttgtaaattt 1980ccaacaagac tattgtagta atagaggtta gatgttcacg tgtaggcatc acattggtgt 2040gcttttgggt tacccactta ggtggattgc ctgagtcatt catgaaatat tattgtgtat 2100catcgcaaca cagtgaattg gttgttggca aaaagaattt gagaca 214638637DNAArtificial SequencecDNA 38gctacgtgcc ttccaccaga gataagctgc gtcgtccgct ctgagggggc tgctgtcttg 60gacgagatgt cgatccttgc ggggtcgcac gcctccacaa ccgccatgcg gcaaggttca 120aagacaactg caactaacaa ttaaagcaca aggaatatac ttgcagatga tcattgagga 180gcaacaaaag cttggtggat caattaaagc ttctgaggga cattgagagt gatgacaagg 240atcttgattt tgaggttgat gatggagctg aggatgaacc taaagcatga tccgaaagat 300gccccagccg aacctgctat caagacagtt gtggccctag ctgcgccacc caaagagaca 360gaaagacagt tgtctaaaaa ggagatgaaa aaaaggaact agcagaactt gatgcagtat 420tggctgagct gggactttct ggtattcgag cagcgctgca caggatggtg agagtcatac 480atgtttccgt taaaactgcc tgatctattt tgtatgctcc ggtttagaga gagtctaaat 540agttttgagt attttgaaat tctagatgag ttgtaaattt ccaacaagac tattgtagta 600atagaggtta gatgttcacg tgtaggcatc acattgg 6373955PRTZea mays 39Met Ala Arg Val Phe Gln Gln Asp Ala Thr Cys Leu Pro Pro Glu Ile1 5 10 15Ser Cys Val Val Arg Ser Glu Gly Ala Ala Val Leu Asp Glu Met Ser 20 25 30Ile Leu Ala Gly Ser His Ala Ser Thr Thr Ala Met Arg Gln Gly Ser 35 40 45Lys Thr Thr Ala Thr Asn Asn 50 554056PRTZea mays 40Met Thr Arg Ile Leu Ile Leu Arg Leu Met Met Glu Leu Arg Met Asn1 5 10 15Leu Lys His Asp Pro Lys Asp Ala Pro Ala Glu Pro Ala Ile Lys Thr 20 25 30Val Val Ala Leu Ala Ala Pro Pro Lys Glu Thr Glu Arg Gln Leu Ser 35 40 45Lys Lys Glu Met Lys Lys Arg Asn 50 55411327DNAZea mays 41tattgttgcc tcctcctcat ctcatcacta gtcactcaac cgcaattgat tgaaaattgt 60gttcatcatc tcgttggatc gatcataatt ctttcatttc tggcctcgac aagtatcgag 120ctcattaatc catcaatcca atgtgtgttc tgtcgaaggc gacaatggtg agctacttat 180cgcggcgtcc atttaatggc tgcagcacaa aggcgatgga cgtgatcgtg gtcgacaaga 240ccatcgtgcc ggggggggag gggggtagag ggtgacggtg ctgatgatgg atggcgatgg 300tatccggggt ctcatcccgg aaaccattct tgccttcctc gaggcgaggg tgcaggatct 360ggacaggctg gaggcgaggc tcgcagacta cttcgactac atcgccagga ccggtgggct 420cgtcatcacg ctgctcactt cgcccggcaa ggacaagcgg cctctctacg ttgccaagaa 480catcaaccac ttgttcttgc atccattcac atcgccctaa tcacatcaat gtatagagga 540ctatgatgga tggatgcaag aacaatgacg ccagatggaa ttcacttttg agggaagaca 600tgtgggctgt tctccatgta gaagtggttg atgtccttgg caacgtagag agaccgcttg 660tttttgccgg tcccgatgaa catggcggtg atgatctcat cggtgctatt cctagtgatg 720tagtcgaagt agtccgcgag ccttgccttc gtcccgtcca actcctacgg catggcctcg 780aggaaggtga ggatggttcc cgagatgaga ccccagatgg cgcctctgtc caccgttatc 840actgtcaccc tcggttcaac acgacgggct catcgaccat gctcccggcc atcgccttcg 900tgctgcatgt gttgcatgga cgccttgatg agtagctcac cgttgccgcc tttgaccaag 960cacacaccca attgatcgat taatgaacta gatactcatt gaggccacga atgaaagaat 1020tatcatcaaa ccaacaaatg acggacacaa ttttcaatcg attatggatg agtgtgagta 1080gtgatgagct gaggaagatg caatgataga tcgattgtgt acatatatag gcactgcgta 1140cgtgctgccc ctttttggag tgacaaatag gaactagcgc gcgtattttt gcatacaacc 1200actactaaat aagagatata tgtaaaattt aacgcaaggg atatagggaa gagatatttg 1260tccattgcaa tgtattttga agctgtccac atatactatt tatgaagaaa cggattatgc 1320caagtat 132742714DNAZea mays 42atcattacat aacttatgct atattttccc gagtatgtcc taacatcttc cacagtgttt 60ttatgggctc cttagaagtt ccagcccagg ggcctgaaac tattaaagtt ccaactgctc 120attatgaatt tggtgccaat tttttagatc caaagttaat gctcattgga agggtgataa 180cagatggaag gcttaatgct cgcgtgaaat gtgatttgac agacaatctc acgctgaaag 240taaatgcaca gcttacccaa gaggcacatt actcacaagg aatgtttaac tttgactaca 300aggttgacgt ttctgacaag tcagacgtaa cgagggcgtc cacaccgcgg ctccgccgga 360catcgcaaca atctccccgc cccagctctc ctctccctgc gccgaggcca caatccctgc 420cgccccggct ctcctcgtcc ccaaatcttg cacgcggtcg taatccccgc cgcctcgctc 480tcctcgcccc tagatcgccg cctccactat cgctgatata ccagaccaag caggtagagc 540agaccaagat gtcgctcgag gaggccaagc tggagatggc cacgctgctg cagcagcagg 600cgagcaagtc atgcatggta ctaagtcctg catggtacta atggttgtaa tgtagtgatg 660aaatagctag attaaaataa caaaatttat gtatggctag gatcacaaat agat 71443460DNAZea mays 43ccagcccagg ggcctgaaac tattaaagtt ccaactgctc attatgaatt tggtgccaat 60tttttagatc caaagttaat gctcattgga agggtgataa cagatggaag gcttaatgct 120cgcgtgaaat gtgatttgac agacaatctc acgctgaaag taaatgcaca gcttacccaa 180gaggcacatt actcacaagg aatgtttaac tttgactaca aggacgtaac gagggcgtcc 240acaccgcggc tccgccggac atcgcaacaa tctccccgcc ccagctctcc tctccctgcg 300ccgaggccac aatccctgcc gccccggctc tcctcgtccc caaatcttgc acgcggtcgt 360aatccccgcc gcctcgctct cctcgcccct agatcgccgc ctccactatc gctgatatac 420cagaccaagc aggtagagca gaccaagatg tcgctcgagg 46044192PRTZea mays 44Met Gly Ser Leu Glu Val Pro Ala Gln Gly Pro Glu Thr Ile Lys Val1 5 10 15Pro Thr Ala His Tyr Glu Phe Gly Ala Asn Phe Leu Asp Pro Lys Leu 20 25 30Met Leu Ile Gly Arg Val Ile Thr Asp Gly Arg Leu Asn Ala Arg Val 35 40 45Lys Cys Asp Leu Thr Asp Asn Leu Thr Leu Lys Val Asn Ala Gln Leu 50 55 60Thr Gln Glu Ala His Tyr Ser Gln Gly Met Phe Asn Phe Asp Tyr Lys65 70 75 80Val Asp Val Ser Asp Lys Ser Asp Val Thr Arg Ala Ser Thr Pro Arg 85 90 95Leu Arg Arg Thr Ser Gln Gln Ser Pro Arg Pro Ser Ser Pro Leu Pro 100 105 110Ala Pro Arg Pro Gln Ser Leu Pro Pro Arg Leu Ser Ser Ser Pro Asn 115 120 125Leu Ala Arg Gly Arg Asn Pro Arg Arg Leu Ala Leu Leu Ala Pro Arg 130 135 140Ser Pro Pro Pro Leu Ser Leu Ile Tyr Gln Thr Lys Gln Val Glu Gln145 150 155 160Thr Lys Met Ser Leu Glu Glu Ala Lys Leu Glu Met Ala Thr Leu Leu 165 170 175Gln Gln Gln Ala Ser Lys Ser Cys Met Val Leu Ser Pro Ala Trp Tyr 180 185 19045127PRTZea mays 45Met Leu Ile Gly Arg Val Ile Thr Asp Gly Arg Leu Asn Ala Arg Val1 5 10 15Lys Cys Asp Leu Thr Asp Asn Leu Thr Leu Lys Val Asn Ala Gln Leu 20 25 30Thr Gln Glu Ala His Tyr Ser

Gln Gly Met Phe Asn Phe Asp Tyr Lys 35 40 45Asp Val Thr Arg Ala Ser Thr Pro Arg Leu Arg Arg Thr Ser Gln Gln 50 55 60Ser Pro Arg Pro Ser Ser Pro Leu Pro Ala Pro Arg Pro Gln Ser Leu65 70 75 80Pro Pro Arg Leu Ser Ser Ser Pro Asn Leu Ala Arg Gly Arg Asn Pro 85 90 95Arg Arg Leu Ala Leu Leu Ala Pro Arg Ser Pro Pro Pro Leu Ser Leu 100 105 110Ile Tyr Gln Thr Lys Gln Val Glu Gln Thr Lys Met Ser Leu Glu 115 120 125462979DNAHelianthus annuus 46atgaataaca tcaatcttgt aatagtttcg cttgtaatcg cgattgtagc catccaaccc 60cttgcgcaag agcaaaccga tgtaggtgag gcaaatttcg tcactgttct tagcatcgat 120ggtgggggtg ttcgtggcat tgttcccgcc accttgcttg cttttcttga atccaaaatt 180caggtactcg aacttaaaat gcacatgtgc atcatattac aagctgtaac ttattattga 240aatgtgccgt ctcttcggat aggaaataga tgggccagat gcacgaattg cggattattt 300tgatgtaata gccggaacaa gcacaggagg gctgatgaca actatgcttg cagctcctaa 360tgagaaaaat cgtcccatgt tcgccgcaaa agacattacc aacttctact ttcaacattc 420gcctaggatc ttccctaaaa tagggtaaac tctaactagt ttccggatct ataagatcat 480cattaaatac aagtttcatt ttctttttcg aatcaaatac agacacacat ttgatgaggc 540gcaaccttat ccttctcaaa acgaagcctg cgaaatgggg tattctccta caaagacttt 600tgtaattcat gttctagtgg gtgtttggat gtgcgtttta aaactgatta ttatttacat 660gtagtttctg aagaaaataa aacagttatt caaacacttt ttgttaataa ttctactaga 720aaaaaaaaat ccttgtcagg aaattaatta aaaaaaagtt accatctatt aaagttcttt 780cttactaatc aaaagttttt aaattttatt atcatgttat tataactaaa catacacatc 840caaacactat ctcataccac atgattacac aagtctatta tttgaatatg ctaacttagt 900attttcatat aataagtttt taaaacgcca catccaaaac ccttgattct tattttacat 960tgtgtagcta aaacagtgtt tatacataaa aacaatcagt tatataaatc aaagcattat 1020ttaactaaag taagctcggt tcaaactcga taagagaatt aatatatacg agtcgagttc 1080ctgttgacca gatttcgcta gtgttaagtt tcgagttcaa aattgtatat gaacttgaac 1140ctgtgtatca tcatacttga catttaaacc ataatgttgt tgaataaata aagtgatttt 1200attttgtagt cggaccaaat tcatgaattc ggtagtaacc gtacttggtg aggccaccgg 1260accaaagtat gatggtaaat atcttcgagc catggcaaag atgatgttaa aaaacctcac 1320tattaaagat acgttgacga atgttgtcat acctgctttc gacattaggc ggcttcaacc 1380tgttatcttc tcctctgctc aagtaattaa actcgttttt tatatttata gcagttctct 1440atttaaaatt gattgtgtat cataaaatgg tttctgtttg atacgtttag ggaaaagagg 1500tcgcgtggaa aaatgctttg ctagcagacg tatgcattag taccgcggcg gcaccaacat 1560ttttcccgcc atactatttt gagactagag acgtcgatgg aaccaagcac acttttgatc 1620taatcgatgg cggggtagct gcaaacaatc cggtagttac atttcaacaa tattgagttt 1680gcattttatt tttaggacaa gtagtcacat tagggtgaag ggtgtgttca agctcatccc 1740gaaggtggga gcggtgttcc cactcgtacc tcatggcgca ttttcttctt tgttgcagct 1800ccaattttaa aaagccaccc cgccttttcc attccatagc gccacgtcaa ctgggaaatg 1860gtgttcccac tggtattgga gatttggagg cgctacgcca cctctgtcat cccgaagcca 1920caccctccac ccttaggagt gttatcggtt cagttttcgg tttatacagt ttaaaggttt 1980ttttttggtt gaaaccaaaa accgaactga actgaacgga attcgggtag ttcacaactg 2040aaccaaaaac tgaatccata ttcggttttc tgtttgaccg aataatcatt atttgttatt 2100tgattgtgcg aggttaaata agattgagca aaaattgtaa ttaattggat ttggctaaat 2160gacaatttaa ataaccgtta tttgttatcc gattcaaacc gaacacccaa ctcgatttgg 2220tttaataccg aaccgacaac cggatacgta attcagttcg ctattaaaag gttcgggttt 2280ggtagggttt aacccatttg aaatcgaatc ttcagagaaa gggttcgcta atgtcaccca 2340cataaccaaa gaaatcttgt ttaaatgtta atggcagaca catttggcta tcacacatat 2400aaccaaagaa gcggtgatgg ggaaatacag gttctctggc ccggaggttt tcgacggaag 2460acggatgctt gtgctttcac tcggcactgg tacgcagacg tacaatgact tatatactgc 2520acaaaaggct gcaaaatggg ggttgcttag ttggatcttt accaatggta ctgcgccaat 2580cctccgcatt tttggtgatg ccatgtcaga tatggtcgac atccatgtgt caactatatt 2640ccaatcgttg caagtcgaaa aaaactatct gcgtattcag gtataactaa gaacatataa 2700atataatgtt gtataggtta catgtttagt aacaaggagt ttttttatgg gcaggaagat 2760aacttgaaag gggaagcaac tgcaatggat atttcatcac ctgagaacat gagggcgcta 2820gaggacattg gcaagaaatt gttgaagaaa ccgttgtcga gattggatgt ggagacaggc 2880aagcttgaac cagttaaagg agaaggtacg aatgctgatg cattagcacg tttcgccact 2940ttgctttgtg ccgaacgaaa gcgccgcaat ccagcttaa 2979471302DNAArtificial SequencecDNA 47atgaataaca tcaatcttgt aatagtttcg cttgtaatcg cgattgtagc catccaaccc 60cttgcgcaag agcaaaccga tgtaggtgag gcaaatttcg tcactgttct tagcatcgat 120ggtgggggtg ttcgtggcat tgttcccgcc accttgcttg cttttcttga atccaaaatt 180caggaaatag atgggccaga tgcacgaatt gcggattatt ttgatgtaat agccggaaca 240agcacaggag ggctgatgac aactatgctt gcagctccta atgagaaaaa tcgtcccatg 300ttcgccgcaa aagacattac caacttctac tttcaacatt cgcctaggat cttccctaaa 360ataggacaca catttgatga ggcgcaacct tatccttctc aaaacgaagc ctgcgaaatg 420ggtcggacca aattcatgaa ttcggtagta accgtacttg gtgaggccac cggaccaaag 480tatgatggta aatatcttcg agccatggca aagatgatgt taaaaaacct cactattaaa 540gatacgttga cgaatattgt catacctgct ttcgacatca ggcggcttca acctgttatc 600ttctcctctg ctcaaggaaa agaggtcgcg tggaaaaatg ctttgctagc agacgtatgc 660attagtaccg cggcggcacc aacgtttttc ccgccatact attttgagac tagagatgtc 720gatggaacca agcacacttt tgatctaatc gatggcgggg tagctgcaaa caatccgaca 780catttggcta tcacacatat aaccaaagaa gcggtgatgg ggaaatacag gttctctggc 840ccggaggttt tcgacggcag acggatgctt gtgctttcac tcggcactgg tacgcagacg 900tacaatgact tatacactgc acaaaaggct gcaaaatggg ggttgcttag ttggatcttt 960accaatggta ctgcgccaat cctccgcatt tttggtgatg ccatgtcaga tatggtcgac 1020atccatgtgt caactatatt ccaatcgttg caagtcgaaa aaaactatct gcgtattcag 1080gaagataact tgaaagggga agcaactgca atggatattt catcacccga gaacatgagg 1140gcgctagagg acattggcaa gaaattgttg aagaaaccgt tgtcgagatt ggatgtggag 1200acaggcaagc ttgaaccagt taaaggagaa ggtacgaatg ctgatgcatt agcacgtttc 1260gccactttgc tttgtgccga acgaaagcgc cgcaatccag ct 130248433PRTHelianthus annuus 48Met Asn Asn Ile Asn Leu Val Ile Val Ser Leu Val Ile Ala Ile Val1 5 10 15Ala Ile Gln Pro Leu Ala Gln Glu Gln Thr Asp Val Gly Glu Ala Asn 20 25 30Phe Val Thr Val Leu Ser Ile Asp Gly Gly Gly Val Arg Gly Ile Val 35 40 45Pro Ala Thr Leu Leu Ala Phe Leu Glu Ser Lys Ile Gln Glu Ile Asp 50 55 60Gly Pro Asp Ala Arg Ile Ala Asp Tyr Phe Asp Val Ile Ala Gly Thr65 70 75 80Ser Thr Gly Gly Leu Met Thr Thr Met Leu Ala Ala Pro Asn Glu Lys 85 90 95Asn Arg Pro Met Phe Ala Ala Lys Asp Ile Thr Asn Phe Tyr Phe Gln 100 105 110His Ser Pro Arg Ile Phe Pro Lys Ile Gly His Thr Phe Asp Glu Ala 115 120 125Gln Pro Tyr Pro Ser Gln Asn Glu Ala Cys Glu Met Gly Arg Thr Lys 130 135 140Phe Met Asn Ser Val Val Thr Val Leu Gly Glu Ala Thr Gly Pro Lys145 150 155 160Tyr Asp Gly Lys Tyr Leu Arg Ala Met Ala Lys Met Met Leu Lys Asn 165 170 175Leu Thr Ile Lys Asp Thr Leu Thr Asn Ile Val Ile Pro Ala Phe Asp 180 185 190Ile Arg Arg Leu Gln Pro Val Ile Phe Ser Ser Ala Gln Gly Lys Glu 195 200 205Val Ala Trp Lys Asn Ala Leu Leu Ala Asp Val Cys Ile Ser Thr Ala 210 215 220Ala Ala Pro Thr Phe Phe Pro Pro Tyr Tyr Phe Glu Thr Arg Asp Val225 230 235 240Asp Gly Thr Lys His Thr Phe Asp Leu Ile Asp Gly Gly Val Ala Ala 245 250 255Asn Asn Pro Thr His Leu Ala Ile Thr His Ile Thr Lys Glu Ala Val 260 265 270Met Gly Lys Tyr Arg Phe Ser Gly Pro Glu Val Phe Asp Gly Arg Arg 275 280 285Met Leu Val Leu Ser Leu Gly Thr Gly Thr Gln Thr Tyr Asn Asp Leu 290 295 300Tyr Thr Ala Gln Lys Ala Ala Lys Trp Gly Leu Leu Ser Trp Ile Phe305 310 315 320Thr Asn Gly Thr Ala Pro Ile Leu Arg Ile Phe Gly Asp Ala Met Ser 325 330 335Asp Met Val Asp Ile His Val Ser Thr Ile Phe Gln Ser Leu Gln Val 340 345 350Glu Lys Asn Tyr Leu Arg Ile Gln Glu Asp Asn Leu Lys Gly Glu Ala 355 360 365Thr Ala Met Asp Ile Ser Ser Pro Glu Asn Met Arg Ala Leu Glu Asp 370 375 380Ile Gly Lys Lys Leu Leu Lys Lys Pro Leu Ser Arg Leu Asp Val Glu385 390 395 400Thr Gly Lys Leu Glu Pro Val Lys Gly Glu Gly Thr Asn Ala Asp Ala 405 410 415Leu Ala Arg Phe Ala Thr Leu Leu Cys Ala Glu Arg Lys Arg Arg Asn 420 425 430Pro491795DNAArtificial SequenceTILLING mutant D74N 49agttcatcac taatcacact tattgtgccc tcgacgagta tctatagcta gctcattaat 60cgattcgggg gtgtgttgtc gaaggcggca atggcgagct actcgtcgcg gcgtccatgc 120aatacctgta gcacgaaggc gatggccggg agcgtggtcg gcgagcccgt cgtgctgggg 180cagagggtga cggtgctgac ggtggacggc ggcggcgtcc ggggtctcat cccgggaacc 240atcctcgcct tcctggaggc caggctgcag gagctggacg gaccggaggc gaggctggcg 300gactacttca actacatcgc cggaaccagc accggcggtc tcatcaccgc catgctcacc 360gcgcccggca aggacaagcg gcctctctac gctgccaagg acatcaacca cttttacatg 420cagaactgcc cgcgcatctt tcctcagaag tgagtccgat gctgccgcca ttgttcttgc 480atccatccag catcgtacgt acgtcctcta tacatctgcg gatcatcatg tgcgcatgtt 540tgtggcatgc atgcatgcat gtgagcagga gcaggcttgc ggccgccatg tccgcgctga 600ggaagccaaa gtacaacggc aagtgcatgc gcagcctgat taggagcatc ctcggcgaga 660cgagggtaag cgagacgctg accaacgtca tcatccctgc cttcgacatc aggctgctgc 720agcctatcat cttctctacc tacgacgtac gtacgtcgtc acgaatgatt catctgtacg 780tcgtcgcatg cgaatggctg cctacgtacg ccgtgcgcta acatactcag ctctttccta 840tctgctgcgc caatttgcag gccaagagca cgcctctgaa gaacgctctg ctctcggacg 900tgtgcattgg cacgtccgcc gcgccgacct acctcccggc gcactacttc cagactgaag 960acgccaacgg caaggagcgc gaatacaacc tcatcgacgg cggtgtggcg gccaacaacc 1020cggtaactga ctagctaact ggaaaacgga cgcacagact ccatgtccat ggcggcccac 1080aaggtcgatg ctaattgttg cttatgtatg tcgcccgatt gcacatgcgt agacgatggt 1140tgcgatgacg cagatcacca aaaagatgct tgccagcaag gacaaggccg aggagctgta 1200cccagtgaag ccgtcgaact gccgcaggtt cctggtgctg tccatcggga cggggtcgac 1260gtccgagcag ggcctctaca cggcgcggca gtgctcccgg tggggtatct gccggtggct 1320ccgcaacaac ggcatggccc ccatcatcga catcttcatg gcggccagct cggacctggt 1380ggacatccac gtcgccgcga tgttccagtc gctccacagc gacggcgact acctgcgcat 1440ccaggacaac tcgctccgtg gcgccgcggc caccgtggac gcggcgacgc cggagaacat 1500gcggacgctc gtcgggatcg gggagcggat gctggcacag agggtgtcca gggtcaacgt 1560ggagacaggg aggtacgaac cggtgactgg cgaaggaagc aatgccgatg ccctcggtgg 1620gctcgctagg cagctctccg aggagaggag aacaaggctc gcgcgccgcg tctctgccat 1680caacccaaga ggctctagat gtgcgtcgta cgatatctaa gacaagtggc tttactgtca 1740gtcacatgct tgtaaataag tagactttat tttaataaaa cataaaaata tatat 1795501284DNAArtificial SequenceTILLING mutant D74N 50atggcgagct actcgtcgcg gcgtccatgc aatacctgta gcacgaaggc gatggccggg 60agcgtggtcg gcgagcccgt cgtgctgggg cagagggtga cggtgctgac ggtggacggc 120ggcggcgtcc ggggtctcat cccgggaacc atcctcgcct tcctggaggc caggctgcag 180gagctggacg gaccggaggc gaggctggcg gactacttca actacatcgc cggaaccagc 240accggcggtc tcatcaccgc catgctcacc gcgcccggca aggacaagcg gcctctctac 300gctgccaagg acatcaacca cttttacatg cagaactgcc cgcgcatctt tcctcagaag 360agcaggcttg cggccgccat gtccgcgctg aggaagccaa agtacaacgg caagtgcatg 420cgcagcctga ttaggagcat cctcggcgag acgagggtaa gcgagacgct gaccaacgtc 480atcatccctg ccttcgacat caggctgctg cagcctatca tcttctctac ctacgacgcc 540aagagcacgc ctctgaagaa cgctctgctc tcggacgtgt gcattggcac gtccgccgcg 600ccgacctacc tcccggcgca ctacttccag actgaagacg ccaacggcaa ggagcgcgaa 660tacaacctca tcgacggcgg tgtggcggcc aacaacccga cgatggttgc gatgacgcag 720atcaccaaaa agatgcttgc cagcaaggac aaggccgagg agctgtaccc agtgaagccg 780tcgaactgcc gcaggttcct ggtgctgtcc atcgggacgg ggtcgacgtc cgagcagggc 840ctctacacgg cgcggcagtg ctcccggtgg ggtatctgcc ggtggctccg caacaacggc 900atggccccca tcatcgacat cttcatggcg gccagctcgg acctggtgga catccacgtc 960gccgcgatgt tccagtcgct ccacagcgac ggcgactacc tgcgcatcca ggacaactcg 1020ctccgtggcg ccgcggccac cgtggacgcg gcgacgccgg agaacatgcg gacgctcgtc 1080gggatcgggg agcggatgct ggcacagagg gtgtccaggg tcaacgtgga gacagggagg 1140tacgaaccgg tgactggcga aggaagcaat gccgatgccc tcggtgggct cgctaggcag 1200ctctccgagg agaggagaac aaggctcgcg cgccgcgtct ctgccatcaa cccaagaggc 1260tctagatgtg cgtcgtacga tatc 128451428PRTArtificial SequenceTILLING mutant D74N 51Met Ala Ser Tyr Ser Ser Arg Arg Pro Cys Asn Thr Cys Ser Thr Lys1 5 10 15Ala Met Ala Gly Ser Val Val Gly Glu Pro Val Val Leu Gly Gln Arg 20 25 30Val Thr Val Leu Thr Val Asp Gly Gly Gly Val Arg Gly Leu Ile Pro 35 40 45Gly Thr Ile Leu Ala Phe Leu Glu Ala Arg Leu Gln Glu Leu Asp Gly 50 55 60Pro Glu Ala Arg Leu Ala Asp Tyr Phe Asn Tyr Ile Ala Gly Thr Ser65 70 75 80Thr Gly Gly Leu Ile Thr Ala Met Leu Thr Ala Pro Gly Lys Asp Lys 85 90 95Arg Pro Leu Tyr Ala Ala Lys Asp Ile Asn His Phe Tyr Met Gln Asn 100 105 110Cys Pro Arg Ile Phe Pro Gln Lys Ser Arg Leu Ala Ala Ala Met Ser 115 120 125Ala Leu Arg Lys Pro Lys Tyr Asn Gly Lys Cys Met Arg Ser Leu Ile 130 135 140Arg Ser Ile Leu Gly Glu Thr Arg Val Ser Glu Thr Leu Thr Asn Val145 150 155 160Ile Ile Pro Ala Phe Asp Ile Arg Leu Leu Gln Pro Ile Ile Phe Ser 165 170 175Thr Tyr Asp Ala Lys Ser Thr Pro Leu Lys Asn Ala Leu Leu Ser Asp 180 185 190Val Cys Ile Gly Thr Ser Ala Ala Pro Thr Tyr Leu Pro Ala His Tyr 195 200 205Phe Gln Thr Glu Asp Ala Asn Gly Lys Glu Arg Glu Tyr Asn Leu Ile 210 215 220Asp Gly Gly Val Ala Ala Asn Asn Pro Thr Met Val Ala Met Thr Gln225 230 235 240Ile Thr Lys Lys Met Leu Ala Ser Lys Asp Lys Ala Glu Glu Leu Tyr 245 250 255Pro Val Lys Pro Ser Asn Cys Arg Arg Phe Leu Val Leu Ser Ile Gly 260 265 270Thr Gly Ser Thr Ser Glu Gln Gly Leu Tyr Thr Ala Arg Gln Cys Ser 275 280 285Arg Trp Gly Ile Cys Arg Trp Leu Arg Asn Asn Gly Met Ala Pro Ile 290 295 300Ile Asp Ile Phe Met Ala Ala Ser Ser Asp Leu Val Asp Ile His Val305 310 315 320Ala Ala Met Phe Gln Ser Leu His Ser Asp Gly Asp Tyr Leu Arg Ile 325 330 335Gln Asp Asn Ser Leu Arg Gly Ala Ala Ala Thr Val Asp Ala Ala Thr 340 345 350Pro Glu Asn Met Arg Thr Leu Val Gly Ile Gly Glu Arg Met Leu Ala 355 360 365Gln Arg Val Ser Arg Val Asn Val Glu Thr Gly Arg Tyr Glu Pro Val 370 375 380Thr Gly Glu Gly Ser Asn Ala Asp Ala Leu Gly Gly Leu Ala Arg Gln385 390 395 400Leu Ser Glu Glu Arg Arg Thr Arg Leu Ala Arg Arg Val Ser Ala Ile 405 410 415Asn Pro Arg Gly Ser Arg Cys Ala Ser Tyr Asp Ile 420 425521795DNAArtificial SequenceTILLING mutant G78R 52agttcatcac taatcacact tattgtgccc tcgacgagta tctatagcta gctcattaat 60cgattcgggg gtgtgttgtc gaaggcggca atggcgagct actcgtcgcg gcgtccatgc 120aatacctgta gcacgaaggc gatggccggg agcgtggtcg gcgagcccgt cgtgctgggg 180cagagggtga cggtgctgac ggtggacggc ggcggcgtcc ggggtctcat cccgggaacc 240atcctcgcct tcctggaggc caggctgcag gagctggacg gaccggaggc gaggctggcg 300gactacttcg actacatcgc cagaaccagc accggcggtc tcatcaccgc catgctcacc 360gcgcccggca aggacaagcg gcctctctac gctgccaagg acatcaacca cttttacatg 420cagaactgcc cgcgcatctt tcctcagaag tgagtccgat gctgccgcca ttgttcttgc 480atccatccag catcgtacgt acgtcctcta tacatctgcg gatcatcatg tgcgcatgtt 540tgtggcatgc atgcatgcat gtgagcagga gcaggcttgc ggccgccatg tccgcgctga 600ggaagccaaa gtacaacggc aagtgcatgc gcagcctgat taggagcatc ctcggcgaga 660cgagggtaag cgagacgctg accaacgtca tcatccctgc cttcgacatc aggctgctgc 720agcctatcat cttctctacc tacgacgtac gtacgtcgtc acgaatgatt catctgtacg 780tcgtcgcatg cgaatggctg cctacgtacg ccgtgcgcta acatactcag ctctttccta 840tctgctgcgc caatttgcag gccaagagca cgcctctgaa gaacgctctg ctctcggacg 900tgtgcattgg cacgtccgcc gcgccgacct acctcccggc gcactacttc cagactgaag 960acgccaacgg caaggagcgc gaatacaacc tcatcgacgg cggtgtggcg gccaacaacc 1020cggtaactga ctagctaact ggaaaacgga cgcacagact ccatgtccat ggcggcccac 1080aaggtcgatg ctaattgttg cttatgtatg tcgcccgatt gcacatgcgt agacgatggt 1140tgcgatgacg cagatcacca aaaagatgct tgccagcaag gacaaggccg aggagctgta 1200cccagtgaag ccgtcgaact gccgcaggtt cctggtgctg tccatcggga cggggtcgac 1260gtccgagcag ggcctctaca cggcgcggca gtgctcccgg tggggtatct gccggtggct 1320ccgcaacaac ggcatggccc ccatcatcga catcttcatg gcggccagct cggacctggt 1380ggacatccac gtcgccgcga tgttccagtc gctccacagc gacggcgact acctgcgcat 1440ccaggacaac tcgctccgtg gcgccgcggc caccgtggac gcggcgacgc cggagaacat 1500gcggacgctc gtcgggatcg gggagcggat gctggcacag agggtgtcca gggtcaacgt 1560ggagacaggg

aggtacgaac cggtgactgg cgaaggaagc aatgccgatg ccctcggtgg 1620gctcgctagg cagctctccg aggagaggag aacaaggctc gcgcgccgcg tctctgccat 1680caacccaaga ggctctagat gtgcgtcgta cgatatctaa gacaagtggc tttactgtca 1740gtcacatgct tgtaaataag tagactttat tttaataaaa cataaaaata tatat 1795531284DNAArtificial SequenceTILLING mutant G78R 53atggcgagct actcgtcgcg gcgtccatgc aatacctgta gcacgaaggc gatggccggg 60agcgtggtcg gcgagcccgt cgtgctgggg cagagggtga cggtgctgac ggtggacggc 120ggcggcgtcc ggggtctcat cccgggaacc atcctcgcct tcctggaggc caggctgcag 180gagctggacg gaccggaggc gaggctggcg gactacttcg actacatcgc cagaaccagc 240accggcggtc tcatcaccgc catgctcacc gcgcccggca aggacaagcg gcctctctac 300gctgccaagg acatcaacca cttttacatg cagaactgcc cgcgcatctt tcctcagaag 360agcaggcttg cggccgccat gtccgcgctg aggaagccaa agtacaacgg caagtgcatg 420cgcagcctga ttaggagcat cctcggcgag acgagggtaa gcgagacgct gaccaacgtc 480atcatccctg ccttcgacat caggctgctg cagcctatca tcttctctac ctacgacgcc 540aagagcacgc ctctgaagaa cgctctgctc tcggacgtgt gcattggcac gtccgccgcg 600ccgacctacc tcccggcgca ctacttccag actgaagacg ccaacggcaa ggagcgcgaa 660tacaacctca tcgacggcgg tgtggcggcc aacaacccga cgatggttgc gatgacgcag 720atcaccaaaa agatgcttgc cagcaaggac aaggccgagg agctgtaccc agtgaagccg 780tcgaactgcc gcaggttcct ggtgctgtcc atcgggacgg ggtcgacgtc cgagcagggc 840ctctacacgg cgcggcagtg ctcccggtgg ggtatctgcc ggtggctccg caacaacggc 900atggccccca tcatcgacat cttcatggcg gccagctcgg acctggtgga catccacgtc 960gccgcgatgt tccagtcgct ccacagcgac ggcgactacc tgcgcatcca ggacaactcg 1020ctccgtggcg ccgcggccac cgtggacgcg gcgacgccgg agaacatgcg gacgctcgtc 1080gggatcgggg agcggatgct ggcacagagg gtgtccaggg tcaacgtgga gacagggagg 1140tacgaaccgg tgactggcga aggaagcaat gccgatgccc tcggtgggct cgctaggcag 1200ctctccgagg agaggagaac aaggctcgcg cgccgcgtct ctgccatcaa cccaagaggc 1260tctagatgtg cgtcgtacga tatc 128454428PRTArtificial SequenceTILLING mutant G78R 54Met Ala Ser Tyr Ser Ser Arg Arg Pro Cys Asn Thr Cys Ser Thr Lys1 5 10 15Ala Met Ala Gly Ser Val Val Gly Glu Pro Val Val Leu Gly Gln Arg 20 25 30Val Thr Val Leu Thr Val Asp Gly Gly Gly Val Arg Gly Leu Ile Pro 35 40 45Gly Thr Ile Leu Ala Phe Leu Glu Ala Arg Leu Gln Glu Leu Asp Gly 50 55 60Pro Glu Ala Arg Leu Ala Asp Tyr Phe Asp Tyr Ile Ala Arg Thr Ser65 70 75 80Thr Gly Gly Leu Ile Thr Ala Met Leu Thr Ala Pro Gly Lys Asp Lys 85 90 95Arg Pro Leu Tyr Ala Ala Lys Asp Ile Asn His Phe Tyr Met Gln Asn 100 105 110Cys Pro Arg Ile Phe Pro Gln Lys Ser Arg Leu Ala Ala Ala Met Ser 115 120 125Ala Leu Arg Lys Pro Lys Tyr Asn Gly Lys Cys Met Arg Ser Leu Ile 130 135 140Arg Ser Ile Leu Gly Glu Thr Arg Val Ser Glu Thr Leu Thr Asn Val145 150 155 160Ile Ile Pro Ala Phe Asp Ile Arg Leu Leu Gln Pro Ile Ile Phe Ser 165 170 175Thr Tyr Asp Ala Lys Ser Thr Pro Leu Lys Asn Ala Leu Leu Ser Asp 180 185 190Val Cys Ile Gly Thr Ser Ala Ala Pro Thr Tyr Leu Pro Ala His Tyr 195 200 205Phe Gln Thr Glu Asp Ala Asn Gly Lys Glu Arg Glu Tyr Asn Leu Ile 210 215 220Asp Gly Gly Val Ala Ala Asn Asn Pro Thr Met Val Ala Met Thr Gln225 230 235 240Ile Thr Lys Lys Met Leu Ala Ser Lys Asp Lys Ala Glu Glu Leu Tyr 245 250 255Pro Val Lys Pro Ser Asn Cys Arg Arg Phe Leu Val Leu Ser Ile Gly 260 265 270Thr Gly Ser Thr Ser Glu Gln Gly Leu Tyr Thr Ala Arg Gln Cys Ser 275 280 285Arg Trp Gly Ile Cys Arg Trp Leu Arg Asn Asn Gly Met Ala Pro Ile 290 295 300Ile Asp Ile Phe Met Ala Ala Ser Ser Asp Leu Val Asp Ile His Val305 310 315 320Ala Ala Met Phe Gln Ser Leu His Ser Asp Gly Asp Tyr Leu Arg Ile 325 330 335Gln Asp Asn Ser Leu Arg Gly Ala Ala Ala Thr Val Asp Ala Ala Thr 340 345 350Pro Glu Asn Met Arg Thr Leu Val Gly Ile Gly Glu Arg Met Leu Ala 355 360 365Gln Arg Val Ser Arg Val Asn Val Glu Thr Gly Arg Tyr Glu Pro Val 370 375 380Thr Gly Glu Gly Ser Asn Ala Asp Ala Leu Gly Gly Leu Ala Arg Gln385 390 395 400Leu Ser Glu Glu Arg Arg Thr Arg Leu Ala Arg Arg Val Ser Ala Ile 405 410 415Asn Pro Arg Gly Ser Arg Cys Ala Ser Tyr Asp Ile 420 4255513516DNAZea mays 55tcttgctata tatgagatga caaaattttc caaagaagag agaagccggc agaacccatc 60ctgtttcaaa tctcttctac tacttaagtt tctaacgtag gcgtcgacaa aacggattgg 120tgcacggttc tgccgatgtc tcccacacac gcgcatggaa ggaggcaggc acccttcccc 180gccgccccgg atctcgcgcc agccccagcc ctaccccgcc tgcccttcca ttcttcccca 240gccgcccccc ggtcaacgtc acgaacccgg gcctcgtgcc gttcgccgtg gccacgcggt 300tcgacgagcg ggtcacggag ctgctgagcg cgctcgctga cgcggcggcg gggcgaccag 360gcaggtgggc catcggcgaa gcgccatggt cgtcgtcggg gggcaggaac caggcggtgt 420acgcgcgccg cgcgcccggc tcttcatcgc ctccacccgc tccagcgtct ccaccacctc 480cttcatcgag ggccgactgc gaggctcgcc ggccaggcag ccgagcgtca gttgcgccgc 540ttggaacgcc tgcttttgtt gatcgtttgt tttggtctga tttcggtggg tctatccgca 600gagaggaaga agcagaagct ctccgagatc caatccggcg ttgaggaagc tgaatcgctg 660gtaaatagat gccgcgacac gttctggttt ggggatcccc ttggctaaca ggacatacga 720catttgggga atgggtagaa aagcagagat tagggatttt tcgtttccgt cggtgcagtt 780ttggtgttcc aacggagttg cgagatgttt atgtgcctta gtcttcaatt tgggggttgg 840gggaaaagta attttatgtt tttgttttgt gtctgcagat tcagaaaatg gacctggagg 900caaggagcct acagcctagc attaaggcta gtttgcttgc aaagctgagg gagtataaat 960ctgacctcaa caacgtcaag agtgagctca agaggatatc tgcgcccaat gccaggcagg 1020ctacccggga ggagctcctg gagtctggaa tggctgatac tctcgcagtg agctaatgat 1080aggacttgac tgtgtctacg agactgctcc taacaataaa ctgaagaaag caaaagaaat 1140cattcaacgt attcgccgaa gagaactcta caaggtagta tgatgcttta attgctcata 1200tacaagtgtc attttgtcat gtcattacac atggttagga tacataggag attctgtttt 1260ttaacacata gttgtcccat gtccatgaat tcatttgaat taatttactc ttcgcaatct 1320tatacattaa aatcgtgtta cctattacat cacaacttca tgagagcatg cttgttctgt 1380gtagatatgg tagtcctggg aggaagagcc atcgccgcta tgtatggcag aaccacccgc 1440gaaagcaacc tctctggtct cgcatagcca gagcaggagc agctcgcttg cgcggccgca 1500gcgctggcgg tcggccccgc gtacgagcgc ctgcaggtag gccagcttct gctgcaatgc 1560ccgaatctcg gcgtccacgc gcagcagcgt cgtcgcctcc tcctccgtca gctcacccag 1620cttggccagc acccccgtca cccccgcgtc cgccatggct gtcgccggga ccgaaaggct 1680aaaactgtca caatgacgta aagtttggtt ggtgttggcg gctcacgcaa aaccagacct 1740ttccaagttt tactttagca gagttttttt ggaacgagag caaagcagca cagtttcaag 1800aatgtggggc aatttgaatg ttcgttcctg ctgcactgct actgctttta gaattgtagt 1860atgcttcatc atttatttat ttctaaaaaa acttgcatga attctatcgt gacttttatt 1920gagaaaataa tgtattcacg tatcttcatg tttctgataa aggtatttgt atatgcatcg 1980gtgctacata tgcgaataca agttttgttt caactctgaa gtctcaagtt gaattctaaa 2040ctccagtttg ttttctactg tgctgctgca ggaagccagg aacccatccg aacaaggttg 2100caatcatgat aagcagatag agcaagcata tgatgatatt ttgaattcgt cgaagcatac 2160tttggccagc atgatggagc tgcaggaggt tcagttattt gcacacattg tttttctttc 2220actcctatga ttttcctcaa tatgatcaaa atgtttcttt tgcaaataat gattgaaatg 2280tttctcattg tactcaacct cttaaactac ctataggctt tgcttgagag taatcaggct 2340acaaaggatg ccaatggtat tgctgctctc tatattgttc ttgttctaat gtaaaaacta 2400caacacaact ctttacttga tcccagaaat tccttctgcc tcaaatggag acaatgacga 2460gtggtcagaa gtacagagat tgcagacaag gtaaattttg caatagaaat aactaaccaa 2520ccattagtgc ttgaaaaaaa ctggactggt gactggggca cgtggtttca tcaacatttg 2580gacctcaacg gtctaatcag tataacttag aagttggcta gctcttgaaa aacactgcat 2640gacactaagc atttgtttat tttcagctgc ttgcacccct atgatttcaa gtaactactt 2700gtctacttgt gataatcacc tgaatatgag tatttgaaat gcttatcacg tctcggcaat 2760tgcatttctt ttatgcgtaa ctgaagtctg ctctagcttc ctaatagagt tcatttttta 2820atacagaaac cactttgaga tagccacaat atagtaaaag tggcagctaa ggtactaaaa 2880acacccatgc aaataagaaa aaaaatgaat cttgtatttt aattttgtta aatacctcta 2940tagtttggcg atatattatg ttaccatcct gcttgtagcc tgtaggtcat tttatatgag 3000ccatcaaatt gcgatgacag ttgccacaaa tccagtttca tatgaaggta ttagctgtgt 3060aacaagctaa ctgctgctct ctgcccaata agttattcaa ttggattagt aggttgcatc 3120caaggttatt caattggatc agtaggttgc atccaaggta tactgctgct ctctgcccaa 3180taagttattc aattcgatca gtaggttgcc tgttcccttc attttattaa aaaatacata 3240ataatataat aagtacctgt ttgttctaaa aataatactt ctgtaaatga ggttattaat 3300tttccttttg gtaataatgc aggttgatga tactgaagtc atcagttttt tgttgcaaac 3360tgaaataatt tctctgtgct tgcgaaccat ggagatgggt agtgagctat ccaaaactgt 3420atgtagctag ccatatattc tcattcaaat atcataattt atctcttctg cttaatactg 3480gcaaaggtgt aatagttttt ttagtattga tttgtcacct gaagtttatc ttgtgcacta 3540ctactttgcc atcatcagtt atctctagaa tactcttatc ctgtaccatc ttctctctga 3600taagcctaaa tttgtacaat tcataagcct aaaaggtgac ttatataata tatacaagga 3660ccctcaagag ttgtttggca attcagtgac tgtcctgggt cctgttttgg ggagcttctg 3720gtagcttttg cttctccaaa agaaaagcta gaagctcccc ccaaacagag cagcttcttc 3780aagccggtaa aagcttcaaa agctataatt atactaaaaa cagtgaagct ccctcagagc 3840agcttcccag ctctccagga gatgcttttg gagaagctac agtttcccca aacagggccc 3900tgctctgttg aacccccctt cctgatacat atttgaatat gagtttatag tgtgtgtggg 3960ggggtgtaag taggggggta atgggttcta aattttatac tataaaaatt aaggatcaga 4020ttagaattga gctctatttc tattcatttt tgaactaaaa ttaattaagg gctcaaatga 4080attatgaaga agcattagga tcatgatcca ttaccacccc tacgtgtaag atgttttttg 4140gtggttgtgg ttgattttga attttaaggc cgcatatgtc tcatggacta cacaagctca 4200tattcatcta catttgtagc cgtcactaac ttagccaaat atgcatatgt ggcggctgag 4260agcacctaga gggggggggg gtgaataggt gatcctataa aaacttgaaa cataatgcca 4320caaaacttga ttaggagtta gcacaataaa gccaagtgac tagagaggag ttcttgcaag 4380acacgataac cacacgaaga tcaacacaga tagacacaat ggtttatccc gttgttcggc 4440caagtccaac acttgcctac tccacgttgt ggcgtcccaa cggacgaggg ttgcaatcaa 4500cccctctcaa gcggtccaag gacccacttg aataccacgg tgttttgctt agtttcacta 4560tatcccgctt gcgaggaatc tacacaactt ggagcctctc gcccttacaa tttgatgttc 4620acaaagaagc acgaaagtaa ggctgggatg agcaacgcac acaagacaca aaatcagagc 4680acaacacgca cacaagtcac aacacgagct cacaacacaa cccaaagagt tctctactca 4740aatggagctc tagttgctat cacaaagaat cgaatacgcg gaattggagt cttggtgctt 4800agaaacgctt agagaatgct tggtgtgttc ctccatgcgc ctaggggtcc cttttatagc 4860cccaaggcag ctaggagccg ttgagagcat tccaggaagg caattcttgc cttctgtcgc 4920ctggcgcacc ggacagtccg gtgcaccacc ggacactgtc cggtgcggat ttctttcctt 4980ctttagcgaa gccgaccgtt ggagattcag agccgttggc gcaccggaca gtgtccggtg 5040cacaccggac agtccggtgc ccccttctga ccgttggctc tgccacgcgt cgcacgcgga 5100ttacgcggcc gaccgttggc ccggccgact gttggctcac cggatagtcc ggtgcaccac 5160cggacagtcc ggtgatttat agccgtacgc cgccgacgaa acccgagagc agccagttcg 5220ccagagccag cctggcgcac cagacactgt ccggtgcacc cagactacgc agtcttggct 5280gcacagccaa gtcttttcca aattggtctt tttctgtttc tagcacttag acacattaca 5340ttagtatcca aaacaatgta ctaagtctta gaaacatacc tttactcttg atttgcactt 5400tgtccatcat ttggcataga ttaacacatg accacttgtg ttggcactca atctccaaaa 5460tacttagaaa tggcccaatg gcacatttcc ctttcaatct cccccttttt ggtgatttat 5520gccaacacaa caaaaagcaa ctaaaagaag tgcaacatca atgcaaatga gaacaaaaaa 5580ttgttttgat tcaaatttgg catatttgga tcattctttg ccaccacttg gttttgtttt 5640tgcaaatcaa cctcaatttc ctatctctaa gtcaaacaca cttgttgaaa cataaagaga 5700gttgttccat gagaaattga tcaaagattt caaaaactcc cccttttttc cataatcaaa 5760cattctcccc acaagagacc aacttttgac agaagagaca ataagagaat tttgacaaac 5820caaaaagctc tattctacta ttttcaaaat tctcaagtgg tagctgatcc atttattgct 5880ttggccttat tttctccccc tttggcatca agcaccagaa cgggataaat cttggccctt 5940aaaaccccat tgcctcacca aaatcttcaa ttaagagtaa aaaggcaata agagcatgaa 6000gatgaacttg gagttagtta ctctttcatc ggagtgcagt ggaagtcttt catggtccaa 6060gtccaacatt tcctttcaat ccacctttga gactaaatca agcaaactca agcacacagt 6120tagtctcaag gggtcaagtt gtagcacaac tccccctaaa tatgtgcatt acttgcaaat 6180ggacttgtga ggtccgggga gtgtttgtac aacttgagca ccatacataa acaacaaaat 6240gcataaagga acatgatcaa ggcataaaac acatgtatgc tataaatcaa tccaagttcc 6300gcgaatctaa gacatttagc tcactacgca gcctacaaaa ggtcttctca tctagaggct 6360tggtaaagat atcggctagc tggttctcgg tgctaacatg aaacacttcg atatctccct 6420tttgctggtg atctctcaaa aagtgatgcc ggatgtctat gtgcttagtg cggctgtgtt 6480caacaggatt atccgccatg cggatggcac tctcattatc acataggagt gggactttgc 6540tcagattgta gccaaagtcc ttgagggttt gcctcatcca aagtagttgc gcgcaacact 6600gtcctgcggt aacgtactcg gcctcagcgg tggatagggc aacggaggtt tgtttcttag 6660aattccacga caccagggac cttcctaaga attgacacgt ccccgatgta ctctttctat 6720cgaccttaca tccaacatag tcggagtctg aatatccaat caagtcaaag gtagacccct 6780ttggatacca gatcccgaag caaggcgtag cgactaaata tctaagaatt cgcttcacaa 6840ccactaagtg acactccttt ggatcggatt gaaatctagc acacatgcat acacttagca 6900taatatctgg tctactagca cataaataaa gtaaggaccc tatcatagac cggtatgcct 6960tttgatcaac ggacttacct cctttgttga ggtcggtgtg tccgtcagtt cccattgtag 7020tctttgcggg cttggcgttc ttcatcccaa actgctttag cagatcttgc gtgtacttcg 7080tttgggagat gaaagtgccg tccttgagtt gcttcacttg gaacccaagg aagtagttca 7140actcgcccat catcgacatc tcgaatttct gagtcatcac cctgctaaac tcttcacaag 7200acttttggtt agtagaacca aatattatgt catcgacata aatttggcac acaaaaagat 7260caccatcaca agtcttagtg aataaagttg gatcggcttt cccaaccttg aaagcattag 7320caagtaaaaa gtctctaagg cattcatacc atgctcttgg ggcttgctta agtccataga 7380gggccttaga gagcttacac acgtggtcgg ggtaccgttc atcctcgaag ccagggggtt 7440gctccacgtg cacctcctcc ttgattagcc cgttgaggaa agcactcttc acatccattt 7500ggaacaacct gaaggaatgg tgagcggcat aggctaacaa aatacgaata gactctagcc 7560tagccacagg agcaaaagtc tcctcaaagt ccaaacctgc gacttgggca tagccttttg 7620ccacaagtct cgccttgttc cttgtcacca ccctgtgctc gtcctgtttg ttgcggaaca 7680cccacttggt tcccacaacg ttttgcttgg gacgaggcac cagtgtccaa acttcatttc 7740gcttgaagtt gatgagctct tcctgcatgg ccaacaccca gtccggatct agcaaggcct 7800cttctaccct gaaaggctca atagaagaga caaaagagta atgctcacaa aatttaacta 7860atctagagcg agtagttact cccttgctaa tgtcacccaa aatctggtcg acgggatgat 7920tcctttgaat cgtcgctcga acttgagttg aaggggcttg aggtgcttct tcctccataa 7980catgatcatc ttgtgctccc ccttgatcac atgcctcctc ttgatgaacc tgttcatcgt 8040cttgagttgg gggatgtacc aatgttgagg aagaaggttg atcttgctcc ttttgttcct 8100gtggccgcac atctccaatc gtcatggtgc gtattgcggc cgttggaatg tcttcttcat 8160ctacatcatt aagatcaaca acttgctctc ttggagagcc attagtctca tcaaatacaa 8220cgtcgctaga gacttcaacc aaactcgatg atttgttgaa aaccctatac gcctttgtat 8280ttgagtcata acctaacaaa aacccttcta cagctttggg agcaaactta gaatttctac 8340ctttcttcac tagaatgtag catttactcc caaatacacg aaagtatgaa acattgggtt 8400tgttaccggt taggagctca tacgaagtct tcttgaggag gtgatgaagg tagacccggt 8460ttatggcgtg gcaagccgtg ttcacggctt ccgaccaaaa tcgctcgggg gtcttgaact 8520ctccaagcat cgtcctcgcc atgtcaatga gcgtcctatt tttcctctct accacaccat 8580tttgctgtgg tgtgtaggga gcggagaact cgtgcttgat tccttcctcc tcaaggtact 8640cttctacttg aagattcttg aactccgacc cgttgtcgct ccttatcttc ttcaccttga 8700gctcaaactc attttgagct ctccttagga agcgcttgag ggtcccttgg gtttcagatt 8760tatcctgcaa aaagaatacc caagtgaagc gggaaaaatc atcaactata acaagaccat 8820acttacttcc tcctaagctt aggtaggcga cgggcctgaa gaggtccata tgtagcaact 8880ccaaaggtct tgatgtggtc atcacatttt tggtatgatg agagcttccc acttgtttac 8940ctgcttgaca agctgcataa ggtctatctt tttcgaaagt aacatttgtt agtcctatca 9000cgtgttctcc ctttagaagc ttgtgaaggt tcttcatccc cacatgtgct aagcggcgat 9060gccacagcca gcccatgcta gtcttagcaa ttaagcatgc atctagaccg gcctcctctt 9120ttgcaaaatc aactaagtag agtttgccgt ctaatacacc cttaaaagct aatgaaccat 9180cactccttct aaagacggac acatctacat ttgtgaataa gcaattatat cccatattac 9240aaagctaact cacagacaat aagttatatc caagagactc aactaaaaac acattagaaa 9300tagagtgctc ggatgaaatg gcaatcttcc ctagtccttt gaccttgcct tgattcccgt 9360caccgaatat gattgagtct tgggaatcct tgttcttgac gtaggaggtg aacatcttct 9420tctcccccgt catgtggttt gtgcatccgc tgtcgataat ccagcttgat cccccggatg 9480cataaacctg caaggcaaat taggcttggg tcttaggtac ccaactcttg ttgggtccta 9540caaggttagt acaaatagcc ttagggaccc aaatgcaagt tttatctccc ttgcattttg 9600cccctaattt tctagcaacc accttcttat cctttctaca aatatcaaag gaagcattta 9660aagcatgata aattgtagaa ggttcattac ttgttttcct aggtacatga gcatttctcc 9720taggcacatg atgaatgata tttttcctag ccaaatttct atcatgcata atagaagaac 9780ttgaagcaaa cattgcattt gaatcataag catgtgaaat gacatcattg caacttctat 9840catgatgaac attcctggaa tatctcctat catggtataa gaaagcatgg ttcttttgaa 9900tactatttgc cataggggcc ttccctttct ccttgatgga gataggagcc ttatgacttg 9960tcaagttctt ggcttccctc ttgaagccaa gcccatcctt aattgagggg tgtctaccaa 10020ccgtgtaggc atcccttgca aattttagtt tatcaaaatc atttttgcta gtcttaagtt 10080gagcattaag actagccact tcatctttta gtttagaaat gcaaactagg tgttcactac 10140aagcatcaac attgaaatct ttacacctat tgcaaatcgt aacatgttct tcacgagagg 10200ttaatttact agctatttct aacttagcac tcaaatcatc attaacactt tttaggctag 10260agatagattc atggcatgta gacaattcac atgaaagcat ttcatttctt ttaatttcta 10320aagcaagaga attttgtgct tctacaaact tatcatgttc ttcatacaaa agatcctctt 10380gcttttctaa taatctattc ttatcattca aggcatcaat caactcattg atcttatcaa 10440tcttagttct atctaagccc ttgaacaaac tagcatagtc tatttcatca tcgctagatt 10500catcatcact agaagcataa gtagactttc gagtgtttac cttcttctcc tttgccatta 10560agcatgtgtg atgctcgttg ggggaagagg aacgacttgt tgaaggccga ggcgacgagt 10620ccttcgttgt cggagtcgga cgacgaacaa tccgagtccc actccttgcc aaggtgtgcc 10680tcacccttag ccttcttata agccttcttc ttttccctct tcttctcttg ttcctggtca 10740ctatcattat cgggacaatt agcgataaat gaccaatctt

accacatttg aagcatgagc 10800gcttcccctt cgtcttgttg ggatgctcct tgcgaccctt tagtgctgtc ttgaatcgct 10860tgatgatgag ggccatttct tcttcattaa gcccgaccgc ctcaacttgc gccaccttgc 10920taggtagcgc ctccttgctc ctcgttgctt tgagagccac agtttgaggc tcgtggattg 10980ggccattcaa cgcatcatcg acgtatcttg cctccttgat catcatccgc ccgcttacga 11040actttccaag tatttcttcg ggcgacatct tggtgtacct aggattttca cgaatattgt 11100ttacaagatg tggatcaagg atagtgaagg accttagcat taggcggacg acgtcgtgat 11160ccgtccatcg cgtgcttcca tagcttctta ttttgttgac gagggtcttg agccggttgt 11220acgtttgggt tggctcttct cccctgatca ttgcgaatct cccaagttcg ccctccacca 11280actccatctt ggtgagcatg gtgacatcgt tcccctcatg tgagatcttg agggtgtccc 11340agatctgctt ggcgttatcc aagccgctca ccttatggta ttcatccctg cacaatgatg 11400ctagaagaac agtagtagct tgtgcatttt tgtgaatttg ctcattgata aatatgggac 11460tatcagtact atcaaattgc attccactct ctactatctc ccatatgctt ggatggagag 11520agaacaagtg actacgcatt ttgtgactcc aaaatccgta gtcctctcca tcaaagtgag 11580gaggtttacc aagtggaatg gagagtaaat gagcatttgt actttgcgga atacgagaat 11640aatcaaaaga aaagtttgaa ttgactgttt tctttttctc gtagttgtcg tcgtcctttt 11700gggaagaaga ggactcgtcg ctgtcgtcgt agtagacgat ctccttgatg caccttgttt 11760tcttcttctt cctgtctttt cttttgtggc tcgagcccga gtcagtaggc ttgtcatctt 11820ttggatcatt gacgaaggac tccttctcct tatcattgac caccatcccc ttgcccttag 11880gatccatctc ttcgggcgat tagtccctta cgtgaagaga acgactcaga taccaattga 11940gagcacctag aggggggtga ataggtgatc ctataaaaac ttgaaactta atgccacaaa 12000acttgattag gagttagcac aataaagcca agtgactaga gagttcttgc aagacacgat 12060aaccacacaa agatcaacac agatagacac agtggtttat ctcgtggttc ggccaagtcc 12120aacacttgcc tactccacgt tgtggcgtcc caacggacga gggttgcaat caacccctct 12180caagcggtcc aaggacccac ttgaatacca cagtgttttg cttagtttca ctatatcccg 12240cttgcgagga atctccacaa cttgtagcct ctcgccctta caatttgatg ttcacaaaga 12300agcacgaaag taaggctggg atgagcaacg cacacaagac acaaaatcag agcacaacac 12360gcacacaagt cacaactcga gctcacaaca caacccaaag agttctctac tcaaatggag 12420ctctagttgc tatcacaaag aatcgaatgc gcggaattgg agtcttggtg cttaggaacg 12480cttagagaat gctttgtgtg ttcctccata cgcctagggg tcccttttat agccccaagg 12540cagctaggag ccgttgagag cattccagga aggcaattct tgccttctgt cgcctggcgc 12600accggacagt ccggtgcacc atcggacact gtccggtgca gatttctttc cttttttagc 12660gaagccgacc gtcggagatt cagagccgtt ggcgcactgg acactgtccg atggacaccg 12720gacagtccgg tgcccccttc tgaccgttgg ctctgccacg cgtcgcgcgc ggattacgcg 12780gccgaccgtt ggctcgaccg actgttggct cactgaacag tccggtgcac caccggacag 12840tccggtgatt tatagccgta cgccgccgac gaaacccgag agcagctagt tcgctagagc 12900cagtctggcg caccagacat tgtccggtgc accaccggac agttcggtgc acccagactg 12960cgcagagtct tggctgcaca gccaagtctt ttccaaattg gtcttttcct gtttctagca 13020cttagacaca ttacattagt ctccaaaaca atgtactaag tattagaaac atacctttac 13080tcttgatttg cactttgtcc atcatttggc atagattaac acatgaccac ttgtgttggc 13140actcaatctc caaaatactt agaaatggcc caagggcaca tttccctttc agcggctagc 13200aacaggtcct tggtttcttg ggttatttat tctcttttta tcgtgtttga atgttttcgt 13260gttcatttgc ataacatctt aggtctacat cagtatatga attgagatca aatgtgaatt 13320ggaccacaca agctcatatt catctacatt tgtagtcgtc actaacttag ccaaatatgc 13380atatgtccgc ttctgatttc attgtgtctt ttcttcagga gtttggggat caaggagagg 13440actccattat cttgtcaccg cgactgaagg agattagtac tcctgaccgc cccgctgccc 13500tccgtttcct aggtac 13516561026DNAArtificial SequencecDNA Snare T1 56gcgcccggct cttcatcgcc tccacccgct ccagcgtctc caccacctcc ttcatcgagg 60gccgactgcg aggctcgccg gccaggcagc cgagcgtcag ttgcgccgct tggaacgcct 120gcttttgttg atcgtttgtt ttggtctgat ttcggtgggt ctatccgcag agaggaagaa 180gcagaagctc tccgagatcc aatccggcgt tgaggaagct gaatcgctga ttcagaaaat 240ggacctggag gcaaggagcc tacagcctag cattaaggct agtttgcttg caaagctgag 300ggagtataaa tctgacctca acaacgtcaa gagtgagctc aagaggatat ctgcgcccaa 360tgccaggcag gctacccggg aggagctcct ggagtctgga atggctgata ctctcgcagt 420gagctaatga taggacttga ctgtgtctac gagactgctc ctaacaataa actgaagaaa 480gcaaaagaaa tcattcaacg tattcgccga agagaactct acaagatatg gtagtcctgg 540gaggaagagc catcgccgct atgtatggca gaaccacccg cgaaagcaac ctctctggtc 600tcgcatagcc agagcaggag cagctcgctt gcgcggccgc agcgctggcg gtcggccccg 660cgtacgagcg cctgcaggaa gccaggaacc catccgaaca aggttgcaat catgataagc 720agatagagca agcatatgat gatattttga attcgtcgaa gcatactttg gccagcatga 780tggagctgca ggaggctttg cttgagagta atcaggctac aaaggatgcc aatgaaattc 840cttctgcctc aaatggagac aatgacgagt ggtcagaagt acagagattg cagacaaggt 900aaattttgca atagaaataa ctaaccaacc attagtgctt gaaaaaaact ggactggtga 960ctggggcacg tggtttcatc aacatttgga cctcaacggt ctaatcagta taacttagaa 1020gttggc 102657874DNAArtificial SequencecDNA Snare T1 57gcgcccggct cttcatcgcc tccacccgct ccagcgtctc caccacctcc ttcatcgagg 60gccgactgcg aggctcgccg gccaggcagc cgagcgtcag ttgcgccgct tggaacgcct 120gcttttgttg atcgtttgtt ttggtctgat ttcggtgggt ctatccgcag agaggaagaa 180gcagaagctc tccgagatcc aatccggcgt tgaggaagct gaatcgctga ttcagaaaat 240ggacctggag gcaaggagcc tacagcctag cattaaggct agtttgcttg caaagctgag 300ggagtataaa tctgacctca acaacgtcaa gagtgagctc aagaggatat ctgcgcccaa 360tgccaggcag gctacccggg aggagctcct ggagtctgga atggctgata ctctcgcagt 420gagctaatga taggacttga ctgtgtctac gagactgctc ctaacaataa actgaagaaa 480gcaaaagaaa tcattcaacg tattcgccga agagaactct acaaggaagc caggaaccca 540tccgaacaag gttgcaatca tgataagcag atagagcaag catatgatga tattttgaat 600tcgtcgaagc atactttggc cagcatgatg gagctgcagg aggctttgct tgagagtaat 660caggctacaa aggatgccaa tgaaattcct tctgcctcaa atggagacaa tgacgagtgg 720tcagaagtac agagattgca gacaaggtaa attttgcaat agaaataact aaccaaccat 780tagtgcttga aaaaaactgg actggtgact ggggcacgtg gtttcatcaa catttggacc 840tcaacggtct aatcagtata acttagaagt tggc 87458553DNAZea mays 58cgatgtgcag tggcctgatt agctacaaga agctcttgtt ccatggactc gatctctgga 60ccgcactatc gttgcctcag cccctaggtc atgctgccct ctggcctcct catcgtacaa 120ttcaccaaca tctccaatgt aagtgcagct ggttcagtaa tgaactcaga agtggcatca 180gaatactcca agagtttttt gttctttttg cctggatata taccaaggga aatgcattca 240aaactcctat agatgacgaa tcccatctct ccctcttttc tcggacacgg atccccaggt 300ccgtctccgt gctttactca tttgtttttt acaagttcag atccacttgc gtactcacac 360agtggacatc tgttatgcac atgtgtaaac cagcataaga attaggaatt atgctcattt 420tatctaagaa gtccttacac tcgaaaatgc atgtgttatt tagcttgaga ataaataaaa 480ttattagcaa ggagaaaaaa aataggacta aagaatagag tcacattggt ttaaattagt 540acctagaagc aaa 55359527DNAArtificial SequencecDNA Snare T2 59gcttctcgat gtgcagtggc ctgattagct acaagaagct cttgttccat ggactcgatc 60tctggaccgc actatcgttg cctcagcccc taggtcatgc tgccctctgg cctcctcatc 120gtacaattca ccaacatctc caatgtaagt gcagctggtt cagtaatgaa ctcagaagtg 180gcatcagaat actccaagag ttttttgttc tttttgcctg gatatatacc aagggaaatg 240cattcaaaac tcctatagat gacgaatccc atctctccct cttttctcgg acacggatcc 300ccaggtccgt ctccgtgctt tactcatttg ttttttacaa gttcagatcc acttgcgtac 360tcacacagtg gacatctgtt atgcacatgt gtaaaccagc ataagaatta ggaattatgc 420tcattttatc taagaagtcc ttacactcga aaatgcatgt gttatttagc ttgagaataa 480ataaaattat tagcaaggag aaaaaaaata ggactaaaga atagagt 527609062DNAZea mays 60gttgcgagat gtttatgtgc cttagtcttc aatttggggg ttgggggaaa agtaatttta 60tgtttttgtt ttgtgtctgc agattcggaa gatggacttg gaggcaagga gcctacagcc 120tagcattaag gctggtttgc ttgcaaagct gagggagtat aaatctgacc tcaacaacgt 180caagagtgag ctcaagagga tatttgcgcc caatgccagg caggctaccc gggaggagct 240cctagagttt ggaatggctg atactctcgc tgtgagctaa tgctaggact tgactgtgtc 300tacgagactg ctcctaacaa taaactgaag aaagcaaaag aaatcattca acgtattcgc 360cgaagagaac tctacaaggt agtatgatgc tttaattgct catatacaag tgtcattttg 420tcatgtcatt acacatggtt aggatacata cttaagtttc taacgtaggc gtccacacaa 480cggattggtg cacggttctg ccgatgtatc ccacgcacgt gcatggaagg aggcaggcac 540ccttccccgc cgccccggat ctcgcgccag cccccgccct accccgcctg cccttccact 600cttcccccgc tgcccccggt caacgtcacg aacccgggcc tcgtgccgct cgtcgtggcc 660acactgttcg acgagcgagt cacagagctg ctgagcgtgc tcgctgatgc ggcggtgggg 720cgaccaggca ggtggtccat cggcgaagcg ccatggtcgt cgtcgggggg cacgaaccag 780gcggtgtacg cgcgccgcgc gcccggctct tcatcgcctc cacccgctcc agcgtctcca 840ccacttcctt catcgagggc cgactgcttg gctcgctggc caggcagccg agcattagtt 900gcgccgcttg gaacgcctgc ttttgttgat cgtttgtttt ggtctgattt cagtgggtct 960atccgcagag aggaagaagc agaagctctc cgagatccaa tccggcgttg aggaagctga 1020atcgctggta aatagatgtc gcgacgcgtt ctgttttggg gatccccttg gctaacggga 1080catacgacat ttggggaatg ggtagaaaag cagagattag ggatttttcg tttccgtcgg 1140tgcagttttg gtgttccaac agagttgcga gatgtttatg tgccttagtc ttcaatttgg 1200gggttggggg aaaagtaatt ttatgttttt gttttgtgtc tgcagattca gaaaatggac 1260ctggaggcaa ggagcctaca gcctagcatt aaggctggtt tgcttgcaaa gccgagggat 1320tataaatctg acctcaacaa cgtcaagagt gagctcaaga ggatatctgc gcccaatgcc 1380aggtaggcta cccgggagga gctcgtggag tctagaatgg ctgatactct cgcagtgagc 1440taatgctagg acttgactgt gtctacgaga ctgctcctaa taataaactg aagaaagcaa 1500aagaaatcat tcaacgtatt cgccgaagag aactctacaa ggtagtatga tgctttaatt 1560gctcatatac aagtgtcatt ttgtcatgtc attacacatg gttaggatac atacttaagt 1620ttctaacgta ggcatccaca caatggattg gtgcacggtt ctgccgatgt atcccacgca 1680cgcgcatgga aggaggcagg cacccttccc tgccgccccg gatctcgcgc cagccatcgc 1740cctaccccgc ctgcccttcc actcttcccc ctgaaagtcg cctagagggg gggtgaatag 1800ggcgaatctg aaatttacaa acttaagcac aactacaagc cgggttaacg ttagaaatat 1860aaacgagtcc gagagagagg gcgcaaaaca aatcatgagc aaataaagag tgagacacga 1920tgatttgttt taccgaggtt cggttcttgc aaacctactc cccgttgagg tggtcacaaa 1980gaccgggtct ctttcaaccc tttccctctc tcaaacggtc acttagaccg agtgagcttc 2040tcttctcaat caaacgaaac acaaagttcc cgcaaggacc accacacaat tggtgtctct 2100tgccttggtt acaattgagt ttgatcacaa gaagaatgag aaagaaaaga agcgatccaa 2160gcgcaagagc tcaaatgaac acaaatgtcg ctctctctag tcactatttg atttggagtg 2220attccggact tgggagagga tttgatcttt tggagtgtct agaattgaat gctatagctc 2280ttgtaatatg ttgaaggtgg gaaacttgga tgccattgaa tgtggggtgg ttggggtatt 2340tatagcccca aaacaccaaa aaaggccgtt ggaaggctgc tctcgcatgg cgcaccggac 2400agtccggtgc gccagccacg tcagcagacc gttggggttc gaccgttgga gctctgactt 2460gtggggcctc tgggctgtcc ggtggtgcac cggacaggtc ctgtaggatg tctggtgcgc 2520caactgcacg tgctctgtcc tctgcgcgcg caggcgcgca ttaaatgcgt tgtagtcaac 2580cgttgcgcgc gaagtagcca ttgctctgct ggcacaccgg acagtccggt gaattatagc 2640ggagcgccct ctgattttcc cgaaggtagc gagttcagct tcgagtgccc tggtgcaccg 2700gacactgtcc ggtgcgccaa accagggtgc cttccgggat gtcttttgct ctctttgttt 2760gaaccctttc ttggtctttt tattggctta ttgtgaacct ttgacacctg taaaacttat 2820agactagagc aaactagtta gtccaattat ttgtgttgga caattcaacc accaaaatca 2880attaggaaat aggtgtgagc ctaattccct ttcaatctcc ccctttttgg tgattgatgc 2940caacacaaac caaagcaagt atagaagtgc ataattgaac tagtttgcat aatgtaagtg 3000caaaggttac ttagaattga accaataaat attttcataa gttatgcatg gattgtttct 3060ttattttcat cattttggac cacgcttgca ccacatgttt tgtttttgca aatccttttg 3120taaatagtca aaggtaaatg aataagattt tgagaagcat tttcaaaatt tgaaattttc 3180tccccctgtt tcaaatgctt ttcctttgac ttaaacaaaa ctcccccctc aaaaatccta 3240ctcatagtgt tcaagagggt tttaagatat caattttgaa aatgctactt tctccccctt 3300ttgaatataa taagatatca attgaaaaat tcatcatttt aaaacctttt gaaaatgggt 3360ggtggtgcgg tccttttgct ttgggctaat actttctccc cctttggcat gaatcgccaa 3420aaacgaatac ttgagtgaaa tataagcccc tttaactact ttctcctgct ttggcgaaca 3480taatatgagt gaagattata ccaaagttgg agagttgctt gaagcgacgg tgaaggatga 3540gttatggagt ggaggttaag cctttgtctt cgccgaagat tccaattccc tttcaataca 3600cctatgactt ggtttgaaat atacttgaaa acacattagt catagcacat gaaagagata 3660tgatcaaagg tatattaatg agctatgtat gcaagacatc aaaagaaatt cctagaatca 3720agaatattta gctcgtgtct aagtttgttc atctagtggc ttggtaaaga tatcggctaa 3780ttgttcctta gtgttaatat aggcaatctc gatatctccc tttttttggt gatcccttag 3840gaaatgatac cgaatggcta tgtgtttagt gcggctatgc tcaacgggat tatccgccat 3900gcggattgca ctctcattat cacatagaag aggaactttg gttaattttt aaccatagtc 3960cctaagggtt tgcctcatcc aaagtaattg tgcgcaacaa tggcctgcgg caatatactc 4020ggcttcggcg gtagaaagag ctacggaatt ttgcttcttt gaagcccaag acaccaggga 4080ccttcccaag aactggcaag tccccgatgt actctttcta ttaattttac accccgccca 4140atcggcatcc gaataaccaa tcaaatcaaa tgtggatccc gtaggatacc aaagcccaaa 4200cttaggagta tgaactaaat atctcaagat tcgttttacg gccgtaaggt gagcttcctt 4260agggtcggct tggaatcttg cacacatgca tacggaaagc ataatatccg gtcgagatgc 4320acataaatag agtaaagagc ctatcatcga ccggtatacc ttttgatcga cggatttacc 4380tcccgtgtcg aggtcgagat gcccattggt tcccatgggt gtcttgatgg gtttggcatc 4440cttcatccca tacttgttta gaatgtcttg aatgtacttc gtttggctaa tgaaggtgcc 4500ctcttagcgt tgcttcactt gaaatcacaa gaagtacttc aactccccca tcatagacat 4560ctcgaatttc tgtgtcatga tcctactaaa ttcctcacat gtagattcat tagtagaccc 4620aaatataata tcatcaacat aaatttggca tacaaacaaa tcattgtcaa gagttttagt 4680aaataaagta ggatcggcct ttccgacttt gaaaccatta gtgataagga aatctctaag 4740gcattcatac catgctcttg gggcttgctt gagcccataa agcgccttag agagtttata 4800tacgtgatta gggtactcac tatcttcaaa gccgggaggt tgctcaacat agacctcttc 4860cttgattggt ccgttgagga aggcactttt cacgtccatt tgataaagct tgaagccatg 4920gtaagtagca taggcaagta atatgcgaat tgactcaagc ctagctacgg gtgcataggt 4980ttcaccaaaa tccaaacctt cgacttgtga ataacccttg gccacaagtc gggctttgtt 5040ccttgtcacc acaccatgct catcttgttt gttgcggaag acccacttgg ttcctacaac 5100attttgatta ggacgtggaa ctaagtgaca tacctcattc ctagtgaagt tgttgagctc 5160ctcttgcatc gccaccaccc aatctgaatc ctgtagtgct tcctctaccc tttgtggctc 5220aatagaggaa acaaaagagt aatgttcaca aaaatgagca acacgagatc gagtagttac 5280ccccttttga atatcgccga ggatggtgtt cacggggtga tctcgttgga ttgcttggtg 5340gactcttggg tgtggcggtc ttggttcttc atcctccttg tcttgatcat ttgcatctcc 5400cccttgatta ttgccgtcat cttgaggtgg ctcatcttct tgatcttctc ctttatcatc 5460ttgagcctca tcctcatttt gagttggtgg agatgcttgc gtggaggagg atggttgatc 5520ttgtgcattt ggaggctctt tggattcctt aggacacaca tccccaatgg acatgttcct 5580tagcgcgacg cacggagcct cttcatcacc tatctcatca agatcaactt gctctacttg 5640agagccgtta gtttcatcaa acacaatgtc accagaaact tcaactagtc ccgaggactt 5700gttaaagact ctatatgccc ttgtgtttga atcataccct agtaaaaagc cttctacagc 5760cttaggagca aatttagatt ttctacctct tttaacaaga atgaagcatt tgctaccaaa 5820gactctaaaa tatgaaacat tgggctttta ccggttagga gttcatatga tgtcttcttg 5880aggattcggt gtagatataa ccggttgatg gtgtagcaag cggtgttgac cgcctcgatc 5940caaaaccgat ccgaagtctt gtactcatca agcatggttc ttgccatgtc caatagagtt 6000cgattcttcc tctccactac accattttgt tgtggggtgt agggagaaga gaactcatgc 6060ttgatgccct cctcctcaag gaagccttcg atttgagaat tcttgaactc cgtcccgttg 6120tcgcttctaa tctttttgat tcttaagccg aactcatttt gagcccatct caagaatccc 6180tttaaggtct cttgggtatg agatttttcc tgcaaaaaga atacccaagt gaagcgagta 6240taatcatcca caataactag atagtactta ctcccgccga tgcttatgta agctatcggg 6300ccgaataggt ccatatgtag gagctcgagt ggcctgtcag tcatcatgat gttcttgtgt 6360ggatgatgag taccaacttg cttccctgtc tgacatgcgc tacaaatcct atctttctca 6420aagtgaacat ttgttagtcc caaaatgtgt tctcccttta gaagcttgtg aagattcttc 6480attccaacat gtgctagtcg gtgatgccag agccagccca tgttagtctt agcaattaag 6540caagtgtcga gttcagctct atcaaaatct accaagtata gctgaccctc taacactccc 6600ttaaatgcta ctgaatcatc acttcttcta aagacagtaa cacctatatc tgtaaaaaga 6660cagttgtagc ccattttaca taattgcgaa actgaaagca agttgtaatc taaagaatct 6720acaagaaaca cattggaaat ggaatggtca ggagatatag caattttacc caatcctttg 6780accaaacctt ggtttccatc cccgaatgtg atcgctcttt ggggatcttg gtttttctca 6840taggaggaga acattttctt ctcccctgtc atatggtttg tgcacccgct atcgatgatc 6900caacttgggc ccccggatgc ataaacctac aaaacaagtt tagttcttga ttttaggtac 6960ccaaatggtt ttgggtcctt tgacattaga tacaagaact ttgggtaccc aaacacaagt 7020ctttgatccc ttgtgtttgc ccccaacata cttggcaact atcttgtcgg atttgttagt 7080taaaacataa gatgcatcaa aagttttgaa tgaaatgtta tgatcatttg atgcagcagg 7140agttttcttc ttaggcaatt ttgcacgggt tgattgccta gagctagatg tctcaccctt 7200atacataaaa gcatgattat ggccagagtg agacttccta gaatgaattc tcctaatttt 7260gctctcggga taaccggcag ggtacaaaat gtaaccctca ttatcctgag gcatgggagc 7320cttgccctta acaaagtttg acaatctttt aggagaggca ttaagtttga cattgtttcc 7380cttttggaag ccaatgccat ccttgatgcc agggcgtctc ccactataga gcatgcttct 7440agcaaattta aatttttcat tttttaagtc atgctcggca attttagcat ctaattttgc 7500tatatgatta ttttgttgtt taattaaagc catatgatca tgaatagcat caatgttaat 7560atctctacat ctagtgcaaa taatgacatg ctcaatggca gatgtagagg gtttgcaaga 7620attaagttca acaatcttag cacgtaaaat atcattgtta tttctaagat cagaaatgga 7680agcattgcaa acatctaatt ctttagcctt agcaatcaat ttttcatttt caaccctaag 7740gctagcaaga gagacattca attcttcaat cttagcaagc aaattaacat tatcatctct 7800aagattggga attgaaacat cacaaatatt agaatcaacc ttagcaatta gtttagtatt 7860tttatttcta aggatggtaa tagtatcatg gcaagtgctt agctcactag ataatttttc 7920acatttttct acttctagag cataagcatt tttaacctta acatgcttct tattttcctt 7980aattaggaag tcctcttgaa agtccaagag atcatctttc tcatgaatag cactaattaa 8040ttcatttagt ttttcctgta gttgcatgtt taggttggca aaaagggtac gcaaattatc 8100ctcctcatca ctagcattat cttcatcact agaggatgca tatttagtgg aggattttga 8160ttttaccttc ttctttttgc cgtcctttgc catgaggcac ttgtggccga cgttggggaa 8220gaggagccct ttggtgacgg cgatgttggc ggcgtcctcg tcggatgagg agtcggagga 8280actctcgtcg gagtcccact cgcggcacac atgggcatcg ccgcccttct tcttgtaata 8340cctcctcttt tctctcctct tgcccttctt gtcgtcgccc ctgtcactat cactagataa 8400aggacattta acaatgaaat gaccgggctt accacacttg tagcacaccc ttttggaaca 8460aggcttgtaa tctttcccct tcctttgttt gaggatttgg tggaagctct tgatgatgag 8520cgccattttc tagttgtcga gcttagaggc gtcgatgggt tgtctacttg atgtagactc 8580ctctttcttc tcctctgtcg ccttgaatgc gaccggttgt gcttcgggcg tggagggacc 8640gtcgtgctcg ataattttct ttgagccttt gatcatcaac tcaaagctca caaagtttcc 8700tattacttcc tcgggagtca ttagtgtata tctaggatta ccacgaatta attgaacttg 8760cgtagggtta aggaacacaa gtgatctaag aataacctta accatttcat ggtcatccca 8820ttttttgctc ccgaggttgc gcacttggtt caccaaggtc ttgagccggt tgtacatatc 8880ttgtggctcc tccccttggc gaagccggaa gcgaccgagc tccccctcga tcgtctcccg 8940cttggtgatc ttggttacct cgtctccttc gtgcgcggtc tttagcacgt cccagatatc

9000ctttgcactc ttcaaccctt gcaccttatt atactcctct cgacttggat ttgaaatgtt 9060gg 9062611082DNAArtificial SequencecDNA Snare T3 61cctgcccttc cactcttccc ccgctgcccc cggtcaacgt cacgaacccg ggcctcgtgc 60cgctcgtcgt ggccacactg ttcgacgagc gagtcacaga gctgctgagc gtgctcgctg 120atgcggcggt ggggcgacca ggcaggtggt ccatcggcga agcgccatgg tcgtcgtcgg 180ggggcacgaa ccaggcggtg tacgcgcgcc gcgcgcccgg ctcttcatcg cctccacccg 240ctccagcgtc tccaccactt ccttcatcga gggccgactg cttggctcgc tggccaggca 300gccgagcatt agttgcgccg cttggaacgc ctgcttttgt tgatcgtttg ttttggtctg 360atttcagtgg gtctatccgc agagaggaag aagcagaagc tctccgagat ccaatccggc 420gttgaggaag ctgaatcgct gattcagaaa atggacctgg aggcaaggag cctacagcct 480agcattaagg ctggtttgct tgcaaagccg agggattata aatctgacct caacaacgtc 540aagagtgagc tcaagaggat atctgcgccc aatgccagat tcggaagatg gacctggaag 600caaggagcct acaacctagc attaagagtg agctcaagag gatatctgcg cccattgcca 660ggcaggctac ccgggaggag ctcctggagt ctggaatggc tgatactctc gcagtgagct 720aatgctagga cttgactgtg tctacgagac tgctcctaac aataaactga agaaagcaaa 780agaaatcatt caacgtattc gccgaagaga actctacaag atatggtagt cctgggagga 840agagccatcg ccgctatgta tggcagaacc acccgcgaaa gcaacctctc tggtctcgcg 900tagccagagc aggagcagct cgcttgcgcg gtcgcggcgc tggcggccgg ccccgcgtac 960gagcgcctgc aggaagccag gaacccatcc gaacaaggtt gcaatcatga taagcagata 1020gagcaagcat atgatgatat tttgaattcg tcgaagcata ctttggccag catgatggag 1080ct 1082621154DNAArtificial SequencecDNA Snare T3 62cctgcccttc cactcttccc ccgctgcccc cggtcaacgt cacgaacccg ggcctcgtgc 60cgctcgtcgt ggccacactg ttcgacgagc gagtcacaga gctgctgagc gtgctcgctg 120atgcggcggt ggggcgacca ggcaggtggt ccatcggcga agcgccatgg tcgtcgtcgg 180ggggcacgaa ccaggcggtg tacgcgcgcc gcgcgcccgg ctcttcatcg cctccacccg 240ctccagcgtc tccaccactt ccttcatcga gggccgactg cttggctcgc tggccaggca 300gccgagcatt agttgcgccg cttggaacgc ctgcttttgt tgatcgtttg ttttggtctg 360atttcagtgg gtctatccgc agagaggaag aagcagaagc tctccgagat ccaatccggc 420gttgaggaag ctgaatcgct gattcagaaa atggacctgg aggcaaggag cctacagcct 480agcattaagg ctggtttgct tgcaaagccg agggattata aatctgacct caacaacgtc 540aagagtgagc tcaagaggat atctgcgccc aatgccagac tgctcctaat aataaactga 600agaaagcaaa agaaatcatt caacgtattc gccgaagaga actctacaag attcggaaga 660tggacctgga agcaaggagc ctacaaccta gcattaagag tgagctcaag aggatatctg 720cgcccattgc caggcaggct acccgggagg agctcctgga gtctggaatg gctgatactc 780tcgcagtgag ctaatgctag gacttgactg tgtctacgag actgctccta acaataaact 840gaagaaagca aaagaaatca ttcaacgtat tcgccgaaga gaactctaca agatatggta 900gtcctgggag gaagagccat cgccgctatg tatggcagaa ccacccgcga aagcaacctc 960tctggtctcg cgtagccaga gcaggagcag ctcgcttgcg cggtcgcggc gctggcggcc 1020ggccccgcgt acgagcgcct gcaggaagcc aggaacccat ccgaacaagg ttgcaatcat 1080gataagcaga tagagcaagc atatgatgat attttgaatt cgtcgaagca tactttggcc 1140agcatgatgg agct 115463107PRTZea mays 63Met Ile Gly Leu Asp Cys Val Tyr Glu Thr Ala Pro Asn Asn Lys Leu1 5 10 15Lys Lys Ala Lys Glu Ile Ile Gln Arg Ile Arg Arg Arg Glu Leu Tyr 20 25 30Lys Glu Ala Arg Asn Pro Ser Glu Gln Gly Cys Asn His Asp Lys Gln 35 40 45Ile Glu Gln Ala Tyr Asp Asp Ile Leu Asn Ser Ser Lys His Thr Leu 50 55 60Ala Ser Met Met Glu Leu Gln Glu Ala Leu Leu Glu Ser Asn Gln Ala65 70 75 80Thr Lys Asp Ala Asn Glu Ile Pro Ser Ala Ser Asn Gly Asp Asn Asp 85 90 95Glu Trp Ser Glu Val Gln Arg Leu Gln Thr Arg 100 10564131PRTZea mays 64Met Cys Ser Gly Leu Ile Ser Tyr Lys Lys Leu Leu Phe His Gly Leu1 5 10 15Asp Leu Trp Thr Ala Leu Ser Leu Pro Gln Pro Leu Gly His Ala Ala 20 25 30Leu Trp Pro Pro His Arg Thr Ile His Gln His Leu Gln Cys Lys Cys 35 40 45Ser Trp Phe Ser Asn Glu Leu Arg Ser Gly Ile Arg Ile Leu Gln Glu 50 55 60Phe Phe Val Leu Phe Ala Trp Ile Tyr Thr Lys Gly Asn Ala Phe Lys65 70 75 80Thr Pro Ile Asp Asp Glu Ser His Leu Ser Leu Phe Ser Arg Thr Arg 85 90 95Ile Pro Arg Ser Val Ser Val Leu Tyr Ser Phe Val Phe Tyr Lys Phe 100 105 110Arg Ser Thr Cys Val Leu Thr Gln Trp Thr Ser Val Met His Met Cys 115 120 125Lys Pro Ala 13065162PRTZea mays 65Met Glu Gly Gly Arg His Pro Ser Pro Pro Pro Arg Ile Ser Arg Gln1 5 10 15Pro Pro Pro Tyr Pro Ala Cys Pro Ser Ile Leu Pro Pro Leu Pro Pro 20 25 30Val Asn Val Thr Asn Pro Gly Leu Val Pro Leu Val Val Ala Thr Leu 35 40 45Phe Asp Glu Arg Val Thr Glu Leu Leu Ser Val Leu Ala Asp Ala Ala 50 55 60Val Gly Arg Pro Gly Arg Trp Ser Ile Gly Glu Ala Pro Trp Ser Ser65 70 75 80Ser Gly Gly Thr Asn Gln Ala Val Tyr Ala Arg Arg Ala Pro Gly Ser 85 90 95Ser Ser Pro Pro Pro Ala Pro Ala Ser Pro Pro Leu Pro Ser Ser Arg 100 105 110Ala Asp Cys Leu Ala Arg Trp Pro Gly Ser Arg Ala Leu Val Ala Pro 115 120 125Leu Gly Thr Pro Ala Phe Val Asp Arg Leu Phe Trp Ser Asp Phe Ser 130 135 140Gly Ser Ile Arg Arg Glu Glu Glu Ala Glu Ala Leu Arg Asp Pro Ile145 150 155 160Arg Arg

* * * * *

Patent Diagrams and Documents
D00000
D00001
D00002
D00003
S00001
XML
US20200229367A1 – US 20200229367 A1

uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed