U.S. patent application number 17/316793 was filed with the patent office on 2021-09-02 for inhibition of bolting and flowering of a beta vulgaris plant.
The applicant listed for this patent is KWS SAAT SE & Co. KGaA. Invention is credited to Gerrit Cornelis ANGENENT, Rudolf Aart DE MAAGD, Josef KRAUS, Jeroen VAN ARKEL, David WURBS.
Application Number | 20210269817 17/316793 |
Document ID | / |
Family ID | 1000005586703 |
Filed Date | 2021-09-02 |
United States Patent
Application |
20210269817 |
Kind Code |
A1 |
DE MAAGD; Rudolf Aart ; et
al. |
September 2, 2021 |
INHIBITION OF BOLTING AND FLOWERING OF A BETA VULGARIS PLANT
Abstract
The present invention provides means for inhibiting the bolting
and flowering of a Beta vulgaris plant, including an isolated
nucleic acid, which can be used to produce a transgenic Beta
vulgaris plant, where bolting and flowering is inhibited after
vernalization. Furthermore, the invention discloses vectors,
transgenic and non-transgenic, non-bolting plants and parts
thereof, and methods for producing such plants.
Inventors: |
DE MAAGD; Rudolf Aart;
(Wageningen, NL) ; VAN ARKEL; Jeroen; (Renkum,
NL) ; ANGENENT; Gerrit Cornelis; (Wageningen, NL)
; WURBS; David; (Einbeck, DE) ; KRAUS; Josef;
(Einbeck, DE) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
KWS SAAT SE & Co. KGaA |
Einbeck |
|
DE |
|
|
Family ID: |
1000005586703 |
Appl. No.: |
17/316793 |
Filed: |
May 11, 2021 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
15771873 |
Apr 27, 2018 |
11034971 |
|
|
PCT/EP2016/076090 |
Oct 28, 2016 |
|
|
|
17316793 |
|
|
|
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
A01H 5/04 20130101; C12N
15/8265 20130101; C12N 15/827 20130101; C07K 14/415 20130101; C12N
15/8222 20130101; A01H 6/024 20180501; A01H 1/1215 20210101; C12N
15/8267 20130101 |
International
Class: |
C12N 15/82 20060101
C12N015/82; C07K 14/415 20060101 C07K014/415; A01H 5/04 20060101
A01H005/04; A01H 1/00 20060101 A01H001/00; A01H 6/02 20060101
A01H006/02 |
Foreign Application Data
Date |
Code |
Application Number |
Oct 30, 2015 |
EP |
15003108.6 |
Claims
1-15. (canceled)
16. A method for producing a Beta vulgaris plant, where the bolting
and flowering is inhibited after vernalization, comprising the
following steps: (I) mutagenizing one or more parts of a Beta
vulgaris plant and subsequently regenerating one or more Beta
vulgaris plants from one or more parts to yield a regenerated
plant, or mutagenizing on or more Beta vulgaris plants to yield a
mutagenized plant; (II) identifying a regenerated or mutagenized
plant of (I) that exhibits one or more mutations in an endogenous
DNA sequence; (III) generating a Beta vulgaris plant in which the
one or more mutations in the endogenous DNA sequence is homozygous;
wherein the endogenous DNA sequence has a nucleic acid sequence
identical to a sequence that A) exhibits a sequence comprising the
coding sequence of SEQ ID NO: 4, B) comprises a nucleotide sequence
exhibiting at least 98% sequence identity to the coding sequence of
SEQ ID NO: 4; C) is complementary to SEQ ID NO: 4; or D) encodes a
protein comprising an amino acid sequence exhibiting at least 98%
sequence identity to SEQ ID NO: 6.
17. A Beta vulgaris plant, produced by the method of claim 16.
18. A Beta vulgaris plant, wherein bolting and flowering is
inhibited after vernalization, wherein the plant comprises one or
more mutations in an endogenous DNA sequence comprising a nucleic
acid sequence identical to a sequence that a) comprises a sequence
comprising the coding sequence of SEQ ID NO: 4, b) comprises a
nucleotide sequence exhibiting at least 98% sequence identity to
the coding sequence of SEQ ID NO: 4, c) is complementary to SEQ ID
NO: 4, or d) encodes a protein comprising the amino acid sequence
of SED ID NO: 6 or encodes a protein comprising an amino acid
sequence exhibiting at least 98% sequence identity to SEQ ID NO: 6,
wherein the one or more mutations cause a reduction of the activity
or stability of the protein or polypeptide encoded by the
endogenous DNA sequence compared to a non-mutagenized wild type
plant.
Description
CROSS-REFERENCE TO RELATED PATENT APPLICATIONS
[0001] This application is a continuation of U.S. application Ser.
No. 15/771,873, filed Apr. 27, 2018, which is a National Stage of
International Application No. PCT/EP2016/076090, filed Oct. 28,
2016, which claims priority of European Patent Application No.
15003108.6, filed Oct. 30, 2015, the contents of each of which are
herein fully incorporated by reference into this application.
SEQUENCE LISTING
[0002] This application contains a Sequence Listing which has been
submitted in ASCII format via EFS-Web and is hereby incorporated by
reference herein in its entirety. The ASCII text file was created
on Apr. 17, 2018, is named SequenceListing_ST25.txt and is 246,431
bytes in size.
[0003] The present invention relates to an isolated nucleic acid
for inhibiting bolting (the first visible sign of reproductive
transition in beets) and flowering of a Beta vulgaris plant, as
well as the use thereof, a method for producing a transgenic Beta
vulgaris plant or an non-transgenic Beta vulgaris plant in which
bolting and flowering is inhibited after vernalization, vectors or
mobile genetic elements, as well as a transgenic or non-transgenic
Beta vulgaris in which bolting and flowering is inhibited after
vernalization, and seeds as well as their parts.
[0004] It is possible to use molecular biological techniques or
mutagenesis techniques to genetically modify crops in order to
change their properties and thus to improve them. One property of
importance in the cultivation and use of biennial plants such as
Beta vulgaris is that bolting and subsequent flowering requires an
induction by a longer period of cold weather, as regularly occurs
in temperate latitudes in winter. This transition from the
vegetative to the generative phase induced by a prolonged period of
low temperature is referred to as vernalization.
[0005] There are several metabolic pathways by which flowering is
controlled. These include inter alia the photoperiodic metabolic
pathway, an autonomous pathway, a gibberellic acid and a
vernalization dependent pathway. A large number of genes involved
in the regulation of flowering have been identified in recent years
in model plants. In particular the control of the timing of
flowering was extensively explored in the model plant Arabidopsis
(Boss, P K, Bastow R M, Mylne, J S, and Dean, C. (2004) Multiple
pathways in the decision to flower: enabling, promoting, and
resetting, Plant Cell 16 Suppl: 18-31; He, Y. and Amasino, RM
(2005) Role of chromatin modification in flowering-time control,
Trends Plant Sci 10, 30-35; Baeurle, I. and Dean, C. (2006) The
timing of developmental transitions in plants, Cell, 125 (4):
655-664). Primarily using Arabidopsis mutants many "early
flowering" or "late-flowering" genes were identified (Gazzani S.,
Gendall, A R, Lister, C., and Dean , C. (2003) Analysis of the
molecular basis of flowering time variation in Arabidopsis
accessions, Plant Physiol 132:. 1107-1114; Geraldo, N., Baurle, I.,
Kidou, S., Hu, X., and Dean , C. (2009), FRIGIDA Delays Flowering
in Arabidopsis via a Mechanism Involving Cotranscriptional Direct
Interaction with the Nuclear Cap-Binding Complex, Plant Physiology,
Jul. 1, 2009; 150 (3): 1611-1618; Michaels S D, Amasino, R M (2001)
Loss of FLOWERING LOCUS C activity eliminates the late-flowering
phenotype of FRIGIDA and autonomous pathway mutations but not
responsiveness to vernalization, Plant Cell 13: 935-942; Yalovsky,
Shaul, et al. "Prenylation of the floral transcription factor
APETALA1 modulates its function." The Plant Cell 12.8 (2000):
1257-1266; Gu, Qing, et al. "The FRUITFULL MADS-box gene mediates
cell differentiation during Arabidopsis fruit development."
Development 125.8 (1998): 1509-1517).
[0006] In Beta vulgaris so far only very few genes have been
characterized in detail. Therein it has been shown that for
instances the gene BvFLC is not a key control gene for flowering or
vernalization in Beta vulgaris (Reeves, P A, He Y, Schmitz R J,
Amasino R M, Panella, L W, Richards C M (2007), Evolutionary
FLOWERING LOCUS conservation of the C-mediated vernalization
response: evidence from the sugar beet (Beta vulgaris), Genetics
176 (1): 295-307; Chia, T. Y. P., Mueller, A., Young, C., and
Mutasa-Goettgens, E. S. (2008), Sugar beet contains a large
CONSTANS-LIKE gene family including a CO homolog that is
independent of the early-bolting (B) gene locus, J Exp Bot 59 (10):
2735-2748). In 2011 Kraus et al. showed that BvVil1 seems to have a
more crucial role in controlling bolting activity after
vernalization (WO 2011/032537).
[0007] Bolting and flowering of Beta vulgaris plants is
undesirable, since in the case of Beta vulgaris it is not the seeds
or fruits, but rather the underground part of the plant, the
storage root, that is used, and the energy stored in the root would
be consumed during the bolting and flowering of the plant.
Moreover, in some plants, which are called "bolters", an unwanted
emergence of shoots occurs in the first year of growing, which is
very disadvantageous for harvesting and processing.
[0008] It is thus the object of the present invention to provide
means to make it possible to inhibit bolting and/or flowering of
Beta vulgaris plants, and even to completely prevent this.
[0009] According to the invention the problem is solved by means of
an isolated nucleic acid for the inhibition of bolting and
flowering of a Beta vulgaris plant, wherein the nucleic acid
comprises at least one nucleotide sequence which a) exhibits a
sequence or partial sequence of SEQ ID NO: 1 or 2, or b) is
complementary to a sequence or partial sequence of a), or c)
exhibits in the antisense direction a sequence or partial sequence
of a) or b), or d) is a homolog to a sequence or partial sequence
of a), ore) at least 80% or 85%, preferably at least 90%, 95%, 96%,
97%, 98% or 99%, or more preferably at least 99.5%, 99, 6%, 99.7%,
99.8% or 99.9% identical to a sequence or partial sequence of a),
or f) encodes a protein or a part of the protein with the amino
acid sequence of SEQ ID NO: 5, or g) encodes a protein with an
amino acid sequence of Beta vulgaris which is a homolog to the
sequence of f), or h) hybridizes under stringent conditions with a
sequence or partial sequence of a), b) or c) and/or at least one
nucleotide sequence which A) exhibits a sequence or partial
sequence of SEQ ID NO: 3 or 4, or B) is complementary to a sequence
or partial sequence of A), or C) exhibits in the antisense
direction a sequence or partial sequence of A) or B), or D) is a
homolog to a sequence or partial sequence of A), or E) at least 80%
or 85%, preferably at least 90%, 95%, 96%, 97%, 98% or 99%, or more
preferably at least 99.5%, 99, 6%, 99.7%, 99.8% or 99.9% identical
to a sequence or partial sequence of A), or F) encodes a protein or
a part of the protein with the amino acid sequence of SEQ ID NO: 6,
or G) encodes a protein with an amino acid sequence of Beta
vulgaris which is a homolog to the sequence of F), or H) hybridizes
under stringent conditions with a sequence or partial sequence of
A), B) or C).
[0010] The inventive nucleic acid can be used, for example by the
RNA interference (RNAi) approach or micro-RNA (miRNA) interference
approach (Fire, A, Xu, S, Montgomery, M, Kostas, S, Driver, S,
Mello, C. (1998). Potent and specific genetic interference by
double-stranded RNA in Caenorhabditis elegans, Nature 391 (6669):
806-811) to inhibit bolting and flowering of Beta vulgaris, and in
particular, if possible, to completely prevent bolting and
flowering, for example by inhibiting genes that encoded flowering
inducers such as FT, Co, or VIN3.
[0011] The nucleic acid is characterized especially by the fact
that transgenic or non transgenic plants, in particular Beta
vulgaris plants, with special characteristics can be produced with
it: In beneficial manner they can be used for example for the
following purposes or with the following benefits:
[0012] Production of non-shoot emergent, non-flowering Beta
vulgaris plants
[0013] Production of a Beta vulgaris plant as winter beet
[0014] Production of a Beta vulgaris plant as spring beet
[0015] Increasing the biomass of the Beta vulgaris plant
[0016] Increasing the sugar yield
[0017] Avoiding Beta vulgaris bolters
[0018] Extension of the Beta vulgaris harvesting campaign
[0019] Avoidance of losses in Beta vulgaris storage material
[0020] Utilization of the higher humidity in the fall
[0021] Covering of soil and use of the stored nitrogen
[0022] Protection for beneficial insects in the field
[0023] Beta vulgaris is a biennial plant. After completion of the
winter, and the vernalization resulting therefrom, Beta vulgaris
usually bolts and flowers in the second year. By means of the
inventive nucleic acid, for example, a sequence shown in one of SEQ
ID NOs: 19-23 or another novel sequence or partial sequence
inserted using an RNAi or microRNA-approach, genes can be inhibited
and the effects of vernalization can be inhibited or completely
prevented. Mechanisms and methods for inhibiting or switching off
genes are known to the person of ordinary skill in the art, for
example, under the term "gene silencing" and include the already
mentioned and known to those skilled in the art RNAi or micro (mi)
RNA processes, but are not limited thereto. In an RNAi approach,
for example, the sequences of SEQ ID NO: 19 to SEQ ID NO: 23 can,
by molecular biology techniques known to the person skilled in the
art, be introduced into a Beta vulgaris cell in the antisense
orientation and under control of a suitable promoter be expressed
there.
[0024] In accordance with the present invention, the bolting and
flowering of the plant can be completely eliminated. Seed of Beta
vulgaris can be sown earlier, which ultimately leads to a longer
growing season and thus leads to a higher biomass and a higher
sugar yield. In combination with cold tolerance, sugar beets, for
example, can be grown as so-called winter beets. In the case of
seeding of sugar beets in August, in the following spring they can
already be harvested as spring beets. This allows the farmer an
additional crop rotation. By using the nucleic acid according to
the invention, even in the case of prolonged cold spells on the
field after sowing, there is no longer increased formation of
bolters. Even the normal sugar beet bolters previously observed
without prolonged cold spells can be prevented or at least
significantly reduced. Using the present invention it can not only
be accomplished that bolting and the subsequent flowering of sugar
beet after an initial vernalization, but, i.e. in the second year,
is inhibited or prevented, but the sugar beet can also be subjected
to other cold periods without vernalization effects observed.
[0025] The sugar beet cultivation is usually from April to
October/November. Since not all of the harvested sugar beets can be
processed at the same time, they must be stored or intermediate
stored. During storage, for example in piles, large losses in
storage substance (sucrose losses) occur as a result of by cleavage
of sucrose into glucose and fructose. By means of the inventive
nucleic acid, particularly when used in an RNAi approach, the
sowing and harvest dates can be varied so that the total harvest
(campaign) can be extended without loss of harvest. It can allow
more sugar beets to be processed for a prolonged period with less
loss of storage material.
[0026] The term " Beta vulgaris" or " Beta vulgaris plant" is
understood to refer to a plant of the genus Beta vulgaris, e.g.
Beta vulgaris ssp. vulgaris var altissima (sugar beet in the narrow
sense), Beta vulgaris ssp. maritima (sea beet), Beta vurlgaris ssp.
vulgaris var vulgaris (Mangold beet), Beta vulgaris ssp. vulgaris
var conditiva (red beetroot/beet), Beta vulgaris ssp. crassa
vulgaris var/alba (fodder beet).
[0027] The term "plant" according to the present invention includes
whole plants or parts of such a whole plant. Whole plants
preferably are seed plants, or a crop. "Parts of a plant" are e.g.
shoot vegetative organs/structures, e.g., leaves, stems and tubers;
roots, flowers and floral organs/structures, e.g. bracts, sepals,
petals, stamens, carpels, anthers and ovules; seed, including
embryo, endosperm, and seed coat; fruit and the mature ovary; plant
tissue, e.g. vascular tissue, ground tissue, and the like; and
cells, e.g. guard cells, egg cells, pollen, trichomes and the like;
and progeny of the same.
[0028] An "isolated nucleic acid" is understood to be a nucleic
acid extracted from its natural or original environment. The term
also includes a synthetic manufactured nucleic acid.
[0029] An "inhibition of bolting and flowering" of a Beta vulgaris
plant refers to a reduction in the proportion of bolting and
possibly flower forming Beta vulgaris plants in comparison to a
non-inventively modified Beta vulgaris plant of the same subspecies
or variety in a comparable stage of development, particularly in
the second year after passing through a corresponding cold period,
i.e. after vernalization. In particular, the term encompasses a
reduction of proportion of bolters to not more than 80%, preferably
not more than 70%, 60%, 50%, 40%, 30%, 20% or 10%, more preferably
not more than 5%, 4%, 3%, 2% , 1%, 0.5%, 0.4%, 0.3%, 0.2% or 0.1%
of the percentage of bolting compared to control plants not
according to the invention. "Control plants" are preferably plants
of the same variety, but they are not modified according to the
present invention, and exhibit for example a proportion of bolters
of at most 0.01%. The term "suppression" or "complete suppression"
of bolting and flowering is understood to mean inhibition of at
least 99%, preferably at least 99.5%, more preferably at least
99.8%, or at least 99.9%, that is, a reduction of the proportion of
bolters to not more than 1%, not more than 0.5%, not more than 0.2%
or not more than 0.1%, especially in the second year after
vernalization, compared to a non-inventively modified Beta vulgaris
plant, for example, a bolting percentage of maximal 0.01%. The term
of inhibition or the suppression of bolting and flowering comprises
mainly the inhibition/suppression of bolting, regardless of whether
it comes to a flowering of the plant or not.
[0030] The term "transgenic", "transgene" or "heterologous" as used
herein means genetically modified. The term includes also the case
that a species-specific nucleic acid in a form, arrangement or
quantity is introduced into a plant cell where the nucleic acid
does not occur naturally in the cell. If the gene, coding sequence
or the regulatory element may be one normally found in the cell, it
is called `autologous` or `endogenous`. A `heterologous` gene,
coding sequence or regulatory element may also be autologous to the
cell but is, however, arranged in an order and/or orientation or in
a genomic position or environment not normally found or occurring
in the cell in which it is introduced.
[0031] The term "homology" refers to identities or similarities in
the nucleotide sequence of two nucleic acid molecules or the amino
acid sequence of two proteins or peptides. The presence of homology
between two nucleic acids or proteins can be detected by comparing
each position in one sequence with the equivalent position in the
other sequence and determining whether identical or similar
residues are present here. Two compared sequences are homologous if
there is a particular minimum level of identical or similar
nucleotides. "Identical" means that when comparing two sequences at
equivalent positions there is the same nucleotide or the same amino
acid. It may be necessary to take into account gaps in sequence to
achieve the best possible alignment comparison of sequences.
Similar nucleotides/amino acids are non-identical nucleotides/amino
acids with the same or equivalent chemical and physical properties.
Exchanging a nucleotide (an amino acid) with a different nucleotide
(another amino acid) with the same or equivalent physical and
chemical properties is called a "conservative exchange." Examples
of chemical and physical properties of an amino acid include, for
example, the hydrophobicity or charge. In the context of nucleic
acids there is also understood a conservative or a similar
nucleotide exchange when replacing in a coding sequence a
nucleotide in a codon by another, whereby due to e.g. the
degeneration of the genetic code, the same amino acid or a similar
amino acid is encoded as in the equivalent codon in the compared
sequence. The person skilled in the art knows which nucleotide or
amino acid exchange is a conservative exchange. To determine the
level of similarity or identity between two nucleic acids, a
minimum length of 60 nucleotides or base pairs is assumed,
preferably a minimum length of 70, 80, 90, 100, 110, 120, 140, 160,
180, 200, 250 , 300, 350 or 400 nucleotides or base pairs, more
preferably the full length of the compared nucleic acids, and in
the case of proteins/peptides a minimum length of 20 amino acids is
assumed, preferably a minimum length of 25, 30, 35, 40, 45, 50, 60,
80, 100, 150, 200, 250 or 300 amino acids, and more preferably the
full length of the compared amino acid sequences. The level of
similarity ("positives") or identity of two sequences can be
determined using, for example, the computer program BLAST (Altschul
S. F. et al (1990), Basic Local Alignment Search Tool, J. Mol Biol
215: 403-410; see eg http://www.ncbi.nlm.nih.gov/BLAST/) with
standard parameters. The determination of homology depends on the
length of the compared sequences. In the context of the present
invention a homology between two nucleic acid sequences, whose
length is at least 100 nucleotides, is understood if at least 70%,
at least 75%, at least 80%, at least 85%, at least 90%, at least
95%, at least 97%, at least 98%, or at least 99% of nucleotides are
identical and/or similar ("identities" or "positives" according to
BLAST), and preferably are identical. In the case of a sequence
length of 50-99 nucleotides a homology between sequences is
understood if there is identity or similarity of at least 80%,
preferably at least 85%, 86%, 87%, 88% or 89%, with a sequence
length of 15-49 nucleotides with an identity or similarity of at
least 90%, preferably at least 95%, 96%, 97%, 98% or 99%. In the
case of proteins a homology is assumed, if using the computer
program BLAST with standard parameters and the BLOSUM62
substitution matrix (Henikoff, S., and Henikoff, J. Amino acid
substitution matrices from protein blocks Proc. Natl. Acad. Sci.
USA 89:. 10915-10919, 1992) an identity ("identities") and/or
similarity ("positive"), preferably identity, at least 25%, at
least 26%, at least 27%, at least 28%, at least 29% at least 30%,
preferably at least 35%, at least 40%, at least 45%, at least 50%,
at least 55%, at least 60%, at least 65%, at least 70%, at least
75%, at least 80%, at least 85%, at least 90%, at least 95%, at
least 97%, at least 98%, or at least 99% is shown, preferably the
entire length of the protein/peptide, which is compared with
another protein, e.g. the length of 260 amino acids in the case of
SEQ ID NO: 5 or the length of 245 amino acids in the case of SEQ ID
NO: 6, is considered in determination. The person skilled in the
art is able with his expert knowledge to use readily available
BLAST programs (e.g. BLASTn, BLASTp, BLASTx, tBLASTn or tBLASTx) to
determine the homology in question. In addition, there are other
programs that the expert knows, and which he can use in the case in
determining the homology of two or more comparative sequences or
partial sequences. Such programs include those that can be found,
for example on the website of the European Bioinformatics Institute
(EMBL) (see, e.g. www.ebi.ac.uk/Tools/similarity.html).
[0032] The term "hybridizing" or "hybridization" means a process in
which a single-stranded nucleic acid molecule attaches itself to a
complementary nucleic acid strand, i.e. agrees with this base
pairing. Standard procedures for hybridization are described, for
example, in Sambrook et al. (Molecular Cloning. A Laboratory
Manual, Cold Spring Harbor Laboratory Press, 3rd edition 2001).
Preferably this will be understood to mean an at least 50%, more
preferably at least 55%, 60%, 65%, 70%, 75%, 80% or 85%, more
preferably 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% of
the bases of the nucleic acid strand form base pairs with the
complementary nucleic acid strand. The possibility of such binding
depends on the stringency of the hybridization conditions. The term
"stringency" refers to hybridization conditions. High stringency is
if base pairing is more difficult, low stringency, when a
base-pairing is facilitated. The stringency of hybridization
conditions depends for example on the salt concentration or ionic
strength and temperature. Generally, the stringency can be
increased by increasing the temperature and/or decreasing salinity.
"Stringent hybridization conditions" are defined as conditions in
which hybridization occurs predominantly only between homologous
nucleic acid molecules. The term "hybridization conditions" refers
not only to the actual binding of the nucleic acids at the
prevailing conditions, but also in the subsequent washing steps
prevailing conditions. Stringent hybridization conditions are, for
example, conditions under which predominantly only those nucleic
acid molecules having at least 70%, preferably at least 75%, at
least 80%, at least 85%, at least 90% or at least 95% sequence
identity hybridize. Less stringent hybridization conditions
include: hybridization in 4.times.SSC at 37.degree. C., followed by
repeated washing in 1.times.SSC at room temperature. Stringent
hybridization conditions include: hybridization in 4.times.SSC at
65.degree. C., followed by repeated washing in 0.1.times.SSC at
65.degree. C. for a total of about 1 hour.
[0033] The term "complementary" refers to the ability of purine and
pyrimidine nucleotides to form base pairs with each other via
bridging hydrogen bonds. Complementary base pairs are, for example,
guanine and cytosine, adenine and thymine and adenine and uracil. A
complementary nucleic acid strand is accordingly a nucleic acid
strand that can, by pairing with complementary bases of another
nucleic acid strand, form a double strand.
[0034] As used herein, the term "homozygous" means a genetic
condition existing when two alleles reside at a specific locus, but
are positioned individually on corresponding pairs of homologous
chromosomes in the cell and the two alleles are identical or at
least identical for the one or more mutations.
[0035] A "fragment" or a "partial sequence" of a nucleic acid is
here understood to be a contiguous section of the nucleic acid,
i.e. a sequence segment of consecutive nucleotides of the nucleic
acid.
[0036] Fragments can e.g. be used advantageously in a miRNA or RNAi
approach, wherein the sequence can be used, for example, in
anti-sense ("antisense") direction. "Anti-sense direction" or
"antisense orientation" of a nucleic acid sequence, e.g. a DNA
sequence, means here, for example, that a transcription of the DNA
sequence results in an mRNA whose nucleotide sequence is
complementary to a natural (endogenous) mRNA, so that its
translation is hindered or prevented by the attachment of the
complementary RNA. An "antisense RNA" or "antisense RNA" is
understood to mean one of a particular mRNA or other RNAs
complementary to specific RNA. "Anti-sense direction" or "antisense
orientation" of an mRNA sequence, therefore, means that the mRNA
has a sequence that is complementary to an mRNA sequence, so that
its translation may be hindered or prevented by attachment. Partial
sequences, which may be advantageously used in the context of the
present invention, for example, in antisense orientation, are for
example nucleic acids having a sequence shown in SEQ ID NO: 10, 11,
12 or 13 which is a segment of the nucleic acid according to SEQ ID
NO: 1 or 2, and/or shown in SEQ ID NO: 14, 15, 16 or 17 which is a
segment of the nucleic acid according to SEQ ID NO 3 or 4. However,
any other nucleic acids with sequences or partial sequences of SEQ
ID NOs: 1 or 2 and/or of SEQ ID NO: 3 or 4 can be used, for
example, in the anti-sense direction. In addition, two or more
partial sequences may be fused and advantageously used in the
context of the present invention, for example, in antisense
orientation. Examples of fused partial sequences are nucleic acids
having a sequence shown in SEQ ID NO: 19, 20, 21 or 22, each of
these sequences contains a segment of the nucleic acid according to
SEQ ID NO: 1 or 2 as well as a segment of the nucleic acid
according to SEQ ID NO: 3 or 4. In another embodiment two or more
partial sequences may be fused and advantageously used in the
context of the present invention, for example, in antisense
orientation, whereby at least two fused partial sequences are
derived from the same nucleic acid according to SEQ ID NO: 1, 2, 3
or 4. The fused partial sequences can be linked by a spacer
sequence comprising at least 1, at least 2, at least 3, at least 4,
at least 5, at least 6, at least 7, at least 8, at least 9, at
least 10, at least 12, at least 14, at least 16, at least 18, at
least 20, at least 25 or at least 30 nucleotides.
[0037] The partial sequence preferably comprises a nucleic acid
with at least 25, preferably at least 30, 35, 40, 45, 50, 60, 70,
80, 90, or at least 100 consecutive nucleotides, more preferably at
least 150, 184, 200, 212, 250, 300, 350, 400 or 450 consecutive
nucleotides. A part of a protein (see, e.g., letter f) or F))
above) preferably comprises at least 5, preferably at least 10, 15,
20, 25, 30, 40 or 50, more preferably at least 60, 61, 70, 80, 90,
or at least 100 consecutive amino acids of SEQ ID NO: 5 or 6. The
sequence segment of SEQ ID NO: 5 or 6 (see, e.g., letter f) or F))
above) preferably comprises at least 50, 60, 61, 70, 80, 87 or 90,
more preferably at least 100, 105, 120, 150, 200 or 250 consecutive
amino acids of SEQ ID NO: 5 or 6. The necessary or useful length of
the partial sequence of the nucleic acid or protein or the sequence
segment can be selected by the person of ordinary skill in the art
with the aid of his general technical skills and, where
appropriate, by carrying out routine tests of the approach and the
intended effect, without this requiring an inventive step.
[0038] The nucleic acid is preferably at least 85%, preferably at
least 90%, 95%, 96%, 97%, 98% or 99%, more preferably at least
99.5%, 99.6%, 99.7%, 99.8% or 99.9% identical to a sequence or
partial sequence of one of SEQ ID NO: 1-4.
[0039] The nucleic acid according to the invention may include one
of the sequences of SEQ ID NO: 10-17 or 19-22, preferably in the
antisense orientation.
[0040] In a preferred embodiment the inventive nucleic acid for the
inhibition of bolting and flowering of a Beta vulgaris plant as
described above comprises a further nucleic acid comprising a
nucleotide sequence which (i) exhibits a sequence or partial
sequence of SEQ ID NO: 7 or 8, or (ii) is complementary to a
sequence or partial sequence of (i), or (iii) exhibits in the
antisense direction a sequence or partial sequence of (i) or (ii),
or (iv) is a homolog to a sequence or partial sequence of (i), or
(v) at least 80% or 85%, preferably at least 90%, 95%, 96%, 97%,
98% or 99%, or more preferably at least 99.5%, 99, 6%, 99.7%, 99.8%
or 99.9% identical to a sequence or partial sequence of (i), or
(vi) encodes a protein or a part of the protein with the amino acid
sequence of SEQ ID NO: 9, or (vii) encodes a protein with an amino
acid sequence of Beta vulgaris which is a homolog to the sequence
of F), or (viii) hybridizes under stringent conditions with a
sequence or partial sequence of (i), (ii) or (iii).
[0041] In a further aspect, the present invention concerns the use
of one or more of the inventive nucleic acids for the inhibition of
bolting and flowering of a Beta vulgaris plant. As already
indicated above, the nucleic acid can be introduced into a Beta
vulgaris plant, for example, in antisense orientation or in form of
a hairpin construct, thereby causing an inhibition of the genes
responsible for bolting and flowering. Methods which are suitable
for introducing a nucleic acid in a Beta vulgaris cell are known to
the skilled person, and include for example the
Agrobacterium-mediated transformation (Lindsey, K., and P. Gallois.
"Transformation of sugarbeet (Beta vulgaris) by Agrobacterium
tumefaciens." Journal of experimental botany 41.5 (1990): 529-536).
The introduction of a nucleic acid in antisense orientation into a
plant is only one of the known processes for the inhibition or
suppression of gene activity (gene silencing).
[0042] The inventive nucleic acids can also be used advantageously
in the context of other procedures or mechanisms that can cause an
inhibition or suppression of bolting/flowering, e.g. suppression by
co-expression, or as template for the generation of guide RNA
(gRNA) in a CRISPR/Cas system.
[0043] Furthermore, the inventive nucleic acid may also be used as
a probe to identify other factors, genes or gene products, which
can be used to inhibit or suppress flowering and bolting of Beta
vulgaris plants, or may also be used as molecular marker to detect
or to identify in a mutagenized Beta vulgaris plant or a part
thereof one or more mutations in an endogenous DNA sequence or a
regulatory sequence of the endogenous DNA sequence, wherein the
endogenous DNA sequence has a nucleic acid sequence identical to a
sequence which (a) exhibits a sequence or partial sequence of SEQ
ID NO: 1, or (b) is complementary to a sequence or partial sequence
of (a), or (c) exhibits in the antisense direction a sequence or
partial sequence of (a) or (b), or (d) is a homolog to a sequence
or partial sequence of (a), or (e) at least 80% or 85%, preferably
at least 90%, 95%, 96%, 97%, 98% or 99%, or more preferably at
least 99.5%, 99, 6%, 99.7%, 99.8% or 99.9% identical to a sequence
or partial sequence of a), or (f) encodes a protein or a part of
the protein with the amino acid sequence of SEQ ID NO: 5, or (g)
encodes a protein with an amino acid sequence of Beta vulgaris
which is a homolog to the sequence of (f), or (h) hybridizes under
stringent conditions with a sequence or partial sequence of a), b)
or c) and/or (A) exhibits a sequence or partial sequence of SEQ ID
NO: 3, or (B) is complementary to a sequence or partial sequence of
(A), or (C) exhibits in the antisense direction a sequence or
partial sequence of (A) or (B), or (D) is a homolog to a sequence
or partial sequence of (A), or (E) at least 80% or 85%, preferably
at least 90%, 95%, 96%, 97%, 98% or 99%, or more preferably at
least 99.5%, 99, 6%, 99.7%, 99.8% or 99.9% identical to a sequence
or partial sequence of (A), or (F) encodes a protein or a part of
the protein with the amino acid sequence of SEQ ID NO: 6, or (G)
encodes a protein with an amino acid sequence of Beta vulgaris
which is a homolog to the sequence of (F), or (H) hybridizes under
stringent conditions with a sequence or partial sequence of (A),
(B) or (C). Preferably, the one or more mutations cause a reduced
transcriptional or expressional rate or a reduced transcriptional
or expressional level of the endogenous DNA sequence in the
mutagenized plant compared to a non-mutagenized wildtype plant, or
the mutation causes a reduction of the activity or stability of the
protein or polypeptide encoded by the endogenous DNA sequence
compared to a non-mutagenized wildtype plant. More preferably, at
least one of the one or more mutations is selected from the group
consisting of mutations listed in Table 1.
[0044] In a particular embodiment of the present invention the use
of the one or more nucleic acids as described above includes in
addition to the one or more nucleic acids a further nucleic acid
comprising a nucleotide sequence which (i) exhibits a sequence or
partial sequence of SEQ ID NO: 7 or 8, or (ii) is complementary to
a sequence or partial sequence of (i), or (iii) exhibits in the
antisense direction a sequence or partial sequence of (i) or (ii),
or (iv) is a homolog to a sequence or partial sequence of (i), or
(v) at least 80% or 85%, preferably at least 90%, 95%, 96%, 97%,
98% or 99%, or more preferably at least 99.5%, 99, 6%, 99.7%, 99.8%
or 99.9% identical to a sequence or partial sequence of (i), or
(vi) encodes a protein or a part of the protein with the amino acid
sequence of SEQ ID NO: 9, or (vii) encodes a protein with an amino
acid sequence of Beta vulgaris which is a homolog to the sequence
of (vi), or (viii) hybridizes under stringent conditions with a
sequence or partial sequence of (i), (ii) or (iii). These nucleic
acids may be used for the inhibition of bolting and flowering of
Beta vulgaris plant.
[0045] In a further aspect the present invention concerns a protein
with an amino acid sequence of SEQ ID NO: 5 or 6, or a protein
having an amino acid sequence that contains a sequence segment of
SEQ ID NO: 5 or 6, that comprises preferably at least 50, 60, 61,
70, 80 or 90, at least 100, 120, 150, 200 or 250 consecutive amino
acids of SEQ ID NO: 5 or 6, or a thereto homologous protein from
Beta vulgaris. The protein or any part thereof, or the
corresponding amino acid sequences may/could for example be used as
a probe at the amino acid level to identify other factors, genes or
gene products which can be used to inhibit and/or suppress the
flowering and bolting of a Beta vulgaris plants.
[0046] In another aspect the present invention relates to a method
for producing a transgenic Beta vulgaris plant comprising the steps
of (a) transforming a Beta vulgaris cell with one or more inventive
nucleic acids and (b) regenerating a Beta vulgaris plant from the
transformed Beta vulgaris cell. The transformation of Beta vulgaris
cell can occur, for example, using known vectors, e.g. a
Ti-plasmid, and is known to the skilled person (Lindsey and
Gallois, 1990). The inventive nucleic acids may be found
advantageous under the control of a suitable promoter in such a
vector. In a particular embodiment of the present invention the
method for producing a transgenic Beta vulgaris plant, where the
bolting and flowering is inhibited after vernalization, comprises
the following steps of: (I) Transforming a Beta vulgaris cell with
a first nucleic acid as transgene and a second nucleic acid as
transgene, wherein the Beta vulgaris cell is transformed with one
construct comprising the first and the second nucleic acid or with
a construct comprising the first nucleic acid and another construct
comprising the second nucleic acid; and (II) regenerating a Beta
vulgaris plant from the transformed Beta vulgaris cell. Thereby,
the first nucleic acid as transgene comprises a nucleotide sequence
which a) exhibits a sequence or partial sequence of SEQ ID NO: 1 or
2, orb) is complementary to a sequence or partial sequence of a),
or c) exhibits in the antisense direction a sequence or partial
sequence of a) or b), or d) is a homolog to a sequence or partial
sequence of a), or e) at least 80% or 85%, preferably at least 90%,
95%, 96%, 97%, 98% or 99%, or more preferably at least 99.5%, 99,
6%, 99.7%, 99.8% or 99.9% identical to a sequence or partial
sequence of a), or f) encodes a protein or a part of the protein
with the amino acid sequence of SEQ ID NO: 5, or g) encodes a
protein with an amino acid sequence of Beta vulgaris which is a
homolog to the sequence off), or h) hybridizes under stringent
conditions with a sequence or partial sequence of a), b) or c); and
the second nucleic acid as transgene comprises a nucleotide
sequence which A) exhibits a sequence or partial sequence of SEQ ID
NO: 3 or 4, or B) is complementary to a sequence or partial
sequence of A), or C) exhibits in the antisense direction a
sequence or partial sequence of A) or B), or D) is a homolog to a
sequence or partial sequence of A), or E) at least 80% or 85%,
preferably at least 90%, 95%, 96%, 97%, 98% or 99%, or more
preferably at least 99.5%, 99, 6%, 99.7%, 99.8% or 99.9% identical
to a sequence or partial sequence of A), or F) encodes a protein or
a part of the protein with the amino acid sequence of SEQ ID NO: 6,
or G) encodes a protein with an amino acid sequence of Beta
vulgaris which is a homolog to the sequence of F), or H) hybridizes
under stringent conditions with a sequence or partial sequence of
A), B) or C). In an alternative particular embodiment of the
present invention the method for producing a transgenic Beta
vulgaris plant, where the bolting and flowering is inhibited after
vernalization, comprises the following steps of: (I) Transforming a
first Beta vulgaris cell with a first nucleic acid as transgene as
defined above; (II) transforming a second Beta vulgaris cell with a
second nucleic acid as transgene as defined above; and (III)
regenerating a first Beta vulgaris plant from the transformed first
Beta vulgaris cell and a second Beta vulgaris plant from the
transformed second Beta vulgaris cell; and (IV) crossing the first
Beta vulgaris plant with the second Beta vulgaris plant and
selecting a progeny comprising the first nucleic acid and the
second nucleic acid as transgenes.
[0047] The invention also relates to a vector or a mobile genetic
element that includes one or more inventive nucleic acids. Vectors
and mobile genetic elements are known in the art and include, for
example, plasmids such as the Ti-plasmid. The vector or mobile
genetic element can advantageously contain control/regulatory
elements, e.g. a promoter, enhancer, intronic sequence, or
terminator.
[0048] Furthermore, the invention relates to a transgenic Beta
vulgaris plant including one or more inventive nucleic acids as
transgene, preferably under the control of a suitable promoter, and
which is inhibited in bolting and flowering, as well as seeds
and/or parts of a Beta vulgaris plant transformed with one or more
inventive nucleic acids.
[0049] In a further aspect the present invention relates to a
method for producing a Beta vulgaris plant, preferably a
non-transgenic Beta vulgaris plant, where the bolting and flowering
is inhibited after vernalization. In one embodiment the method for
producing a Beta vulgaris plant comprises the following steps: (I)
Mutagenizing one or more parts of a Beta vulgaris plant and
subsequently regenerating Beta vulgaris plants from the one or more
parts, or mutagenizing one or more Beta vulgaris plants, (II)
identifying a plant of (I) which exhibits one or more mutations in
a first endogenous DNA sequence and/or in a regulatory sequence
thereof and exhibits one or more mutations in a second endogenous
DNA sequence and/or in a regulatory sequence thereof, and
optionally (III) generating a Beta vulgaris plant in which the one
or more mutations in the first endogenous DNA sequence and the one
or more mutations in the second endogenous DNA sequence are
homozygous. The first endogenous DNA sequence has a nucleic acid
sequence identical to a sequence which (a) exhibits a sequence or
partial sequence of SEQ ID NO: 1, or (b) is complementary to a
sequence or partial sequence of (a), or (c) exhibits in the
antisense direction a sequence or partial sequence of (a) or (b),
or (d) is a homolog to a sequence or partial sequence of (a), or
(e) at least 80% or 85%, preferably at least 90%, 95%, 96%, 97%,
98% or 99%, or more preferably at least 99.5%, 99, 6%, 99.7%, 99.8%
or 99.9% identical to a sequence or partial sequence of a), or (f)
encodes a protein or a part of the protein with the amino acid
sequence of SEQ ID NO: 5, or (g) encodes a protein with an amino
acid sequence of Beta vulgaris which is a homolog to the sequence
of (f), or (h) hybridizes under stringent conditions with a
sequence or partial sequence of a), b) or c) and the second
endogenous DNA sequence has a nucleic acid sequence identical to a
sequence which (A) exhibits a sequence or partial sequence of SEQ
ID NO: 3, or (B) is complementary to a sequence or partial sequence
of (A), or (C) exhibits in the antisense direction a sequence or
partial sequence of (A) or (B), or (D) is a homolog to a sequence
or partial sequence of (A), or (E) at least 80% or 85%, preferably
at least 90%, 95%, 96%, 97%, 98% or 99%, or more preferably at
least 99.5%, 99.6%, 99.7%, 99.8% or 99.9% identical to a sequence
or partial sequence of (A), or (F) encodes a protein or a part of
the protein with the amino acid sequence of SEQ ID NO: 6, or (G)
encodes a protein with an amino acid sequence of Beta vulgaris
which is a homolog to the sequence of (F), or (H) hybridizes under
stringent conditions with a sequence or partial sequence of (A),
(B) or (C). In another embodiment the method for producing a Beta
vulgaris plant comprises the following steps: (I) Mutagenizing one
or more parts of a Beta vulgaris plant and subsequently
regenerating one or more Beta vulgaris plants from the one or more
parts, or mutagenizing one or more Beta vulgaris plants; (II)
identifying a first plant of (I) which exhibits one or more
mutations in a first endogenous DNA sequence as defined above
and/or in a regulatory sequence thereof and a second plant of (I)
which exhibits one or more mutations in a second endogenous DNA
sequence as defined above and/or in a regulatory sequence thereof;
(III) crossing the first plant with the second plant and selecting
a progeny comprising the one or more mutations in the first
endogenous DNA sequence and/or in a regulatory sequence thereof and
the one or more mutations in the second endogenous DNA sequence
and/or in a regulatory sequence thereof; and optionally (IV)
generating from the progeny of (III) a Beta vulgaris plant in which
the one or more mutations in the first endogenous DNA sequence and
the one or more mutations in the second endogenous DNA sequence are
homozygous.
[0050] In a preferred embodiment the step of mutagenizing comprises
the steps of: i) Subjecting pollen or seeds of a Beta vulgaris
plant to a sufficient amount of the mutagen ethylmethane sulfonate
(EMS) or other mutagenic chemicals or mutagenic radiation to obtain
M1 plants, ii) optionally allowing sufficient production of M2
plants, and iii) isolating and analysing genomic DNA of M1 and/or
M2 plants.
[0051] Preferably, the one or more mutations cause a reduced
transcriptional or expressional rate or a reduced transcriptional
or expressional level of the endogenous DNA sequence in the
mutagenized plant compared to a non-mutagenized wildtype plant, or
the mutation causes a reduction of the activity or stability of the
protein or polypeptide encoded by the endogenous DNA sequence
compared to a non-mutagenized wildtype plant. More preferably, the
one or more mutations result in a loss of function, i.e. the
expression/transcription of the mutated DNA does not lead to the
synthesis of a functional protein (e.g. functional AP1 protein or
FUL protein, respectively).
[0052] The one or more mutations may cause an alteration of the
amino acid sequence of AP1 or FUL, in particular the one or more
mutations can be a point mutation resulting in at least one amino
acid exchange, the exchange of an amino acid coding codon to a
codon carrying the stop signal of translation (stop codon), or the
change of the start signal of translation (start codon). The
techniques of introducing such mutations via mutagenizing are
well-known to the person skilled in the art. In a preferred
embodiment, wherein the one or more mutations are effected in the
endogenous AP1 gene or FUL gene, the obtained Beta vulgaris plant
is non-transgenic. Preferably, the mutation is effected via
non-transgenic mutagenesis, transposon mutagenesis, in particular
chemical mutagenesis, preferably via EMS (ethylmethane
sulfonate)-induced TILLING or targeted genome editing (e.g.
CRISPR/Cas, TALEN, Zinc Finger nucleases, etc.). Exemplary, Table
1a and 1b show possible point mutations within the genomic DNA and
cDNA of AP1 and FUL resulting in a nucleotide exchange from
cytosine (c) to thymine (t) and thereby generating a stop codon.
Such mutations can reduce the activity or stability of the
corresponding protein or polypeptide encoded by the endogenous DNA
sequence compared to a non-mutagenized wildtype plant, or can
result in a loss of function of the corresponding protein.
[0053] Additionally, the one or more mutations may cause an
alteration of the amino acid sequence of the AP1 protein or FUL
protein by an insertion or deletion of one or more amino acids,
e.g. through a shift of the open reading frame. The insertion can
be introduced for instances by transposon mutagenesis and deletion
can be created for instances by genomic engineering. Insertion and
deletion can occur in any nucleotide sequence encoding one of the
above described proteins, in a nucleotide sequence of an intron or
in a nucleotide sequence of the 5' untranslated region (UTR) or 3'
UTR of the AP1 gene or FUL gene. The insertion can have a length of
at least 1 nucleotide, at least 2 nucleotides, at least 3
nucleotides, at least 4 nucleotides, at least 5 nucleotides, at
least 6 nucleotides, at least 7 nucleotides, at least 8
nucleotides, at least 9 nucleotides, at least 10 nucleotides, at
least 12 nucleotides, at least 14 nucleotides, at least 16
nucleotides, at least 18 nucleotides, at least 20 nucleotides, at
least 25 nucleotides, at least 30 nucleotides, at least 40
nucleotides, at least 50 nucleotides, at least 75 nucleotides, at
least 100 nucleotides, at least 200 nucleotides, at least 300
nucleotides, or at least 500 nucleotides. If such insertion or
deletion occurs for instances in a regulatory element (e.g.
promotor), that may reduce the transcriptional or expressional rate
or a reduced transcriptional or expressional level of the
corresponding endogenous DNA sequence in the Beta vulgaris plant
cell.
[0054] As used herein, the term "reduced expressional rate" or
"reduced expressional level" means a reduction of the expressional
rate or of the expressional level of one or more nucleic acid
sequences by more than 25% or 30%, preferably by more than 40%,
45%, 50%, 55%, 60%, or 65%, more preferably by more than 70%, 75%,
80%, 85%, 90%, 92%, 94%, 96% or 98% compared to the given
reference. It may be that the reduction of the expressional rate or
of the expressional level is 100%, e.g. in case of knock-out
mutants or loss of function mutants. Preferably the reduced
expressional rate or expressional level results in an amended
phenotype where the bolting and flowering is inhibited after
vernalization. The term "reduced transcriptional rate" or "reduced
transcriptional level" means a reduction of the transcriptional
rate or of the transcriptional level of one or more nucleic acid
sequences by more than 25% or 30%, preferably by more than 40%,
45%, 50%, 55%, 60%, or 65%, more preferably by more than 70%, 75%,
80%, 85%, 90%, 92%, 94%, 96% or 98% compared to the given
reference. It may be that the reduction of the transcriptional rate
or of the transcriptional level is 100%, e.g. in case of knock-out
mutants or loss of function mutants. Preferably the reduced
transcriptional rate or transcriptional level results in an amended
phenotype where the bolting and flowering is inhibited after
vernalization.
TABLE-US-00001 TABLE 1a List of positions in genomic DNA and cDNA
of BvAP1 where a point mutation causes a nucleotide exchange from
cytosine (c) to thymine (t) generating a stop codon. Gene Position
in genomic Position in cDNA name DNA (SEQ ID NO: 1) (SEQ ID NO: 2)
BvAP1 52 52 BvAP1 151 151 BvAP1 6999 262 BvAP1 7640 316 BvAP1 8573
343 BvAP1 8606 376 BvAP1 8618 388 BvAP1 8621 391 BvAP1 8648 418
BvAP1 11796 433 BvAP1 11826 463 BvAP1 21152 484 BvAP1 21313 541
BvAP1 21319 547 BvAP1 21322 550 BvAP1 21328 556 BvAP1 21334 562
BvAP1 21340 568 BvAP1 21346 574 BvAP1 21352 580 BvAP1 21358 586
BvAP1 21361 589 BvAP1 21364 592 BvAP1 21382 610 BvAP1 21391 619
BvAP1 21576 688 BvAP1 21582 694
TABLE-US-00002 TABLE 1b List of positions in genomic DNA and cDNA
of BvFUL where a point mutation causes a nucleotide exchange from
cytosine (c) to thymin (t) generating a stop codon. Gene Position
in genomic Position in cDNA name DNA (SEQ ID NO: 3) (SEQ ID NO: 4)
BvFUL 19 19 BvFUL 52 52 BvFUL 19373 235 BvFUL 19552 316 BvFUL 19561
325 BvFUL 19689 376 BvFUL 19704 391 BvFUL) 28161 433 BvFUL 28185
457 BvFUL 28191 463 BvFUL 28194 466 BvFUL 28447 541 BvFUL 28450 544
BvFUL 28465 559 BvFUL 28468 562 BvFUL 28507 601 BvFUL 29166 664
BvFUL 29196 694 BvFUL 29217 715 BvFUL 29226 724
[0055] Thus, the present invention relates to a Beta vulgaris
plant, preferably a non-transgenic Beta vulgaris plant, where the
bolting and flowering is inhibited after vernalization, wherein the
plant exhibits one or more mutations in a first endogenous DNA
sequence and/or in a regulatory sequence thereof and exhibits one
or more mutations in a second endogenous DNA sequence and/or in a
regulatory sequence thereof The first endogenous DNA sequence has a
nucleic acid sequence identical to a sequence which (a) exhibits a
sequence or partial sequence of SEQ ID NO: 1, or (b) is
complementary to a sequence or partial sequence of (a), or (c)
exhibits in the antisense direction a sequence or partial sequence
of (a) or (b), or (d) is a homolog to a sequence or partial
sequence of (a), or (e) at least 80% or 85%, preferably at least
90%, 95%, 96%, 97%, 98% or 99%, or more preferably at least 99.5%,
99.6%, 99.7%, 99.8% or 99.9% identical to a sequence or partial
sequence of a), or (f) encodes a protein or a part of the protein
with the amino acid sequence of SEQ ID NO: 5, or (g) encodes a
protein with an amino acid sequence of Beta vulgaris which is a
homolog to the sequence of (f), or (h) hybridizes under stringent
conditions with a sequence or partial sequence of a), b) or c) and
the second endogenous DNA sequence has a nucleic acid sequence
identical to a sequence which (A) exhibits a sequence or partial
sequence of SEQ ID NO: 3, or (B) is complementary to a sequence or
partial sequence of (A), or (C) exhibits in the antisense direction
a sequence or partial sequence of (A) or (B), or (D) is a homolog
to a sequence or partial sequence of (A), or (E) at least 80% or
85%, preferably at least 90%, 95%, 96%, 97%, 98% or 99%, or more
preferably at least 99.5%, 99, 6%, 99.7%, 99.8% or 99.9% identical
to a sequence or partial sequence of (A), or (F) encodes a protein
or a part of the protein with the amino acid sequence of SEQ ID NO:
6, or (G) encodes a protein with an amino acid sequence of Beta
vulgaris which is a homolog to the sequence of (F), or (H)
hybridizes under stringent conditions with a sequence or partial
sequence of (A), (B) or (C). Preferably, the one or more mutations
in the first and/or the second endogenous DNA sequence are
homozygous.
[0056] Preferably, the one or more mutations cause a reduced
transcriptional or expressional rate or a reduced transcriptional
or expressional level of the endogenous DNA sequence in the
mutagenized plant compared to a non-mutagenized wildtype plant, or
the mutation causes a reduction of the activity or stability of the
protein or polypeptide encoded by the endogenous DNA sequence
compared to a non-mutagenized wildtype plant. More preferably, the
one or more mutations result in a loss of function, i.e. the
expression/transcription of the mutated DNA does not lead to the
synthesis of a functional protein (e.g. functional AP1 protein or
FUL protein, respectively).
[0057] In a preferred embodiment of the Beta vulgaris plant or a
part thereof of the present invention is a Beta vulgaris plant or a
part thereof as described above wherein the one or more mutations
cause an alteration of the amino acid sequence of AP1 or FUL, in
particular the one or more mutations is a point mutation resulting
in at least one amino acid exchange, the exchange of an amino acid
coding codon to a codon carrying the stop signal of translation
(stop codon), or the change of the start signal of translation
(start codon). Preferably the point mutation in the AP1 gene (i.e.
first endogenous DNA sequence) is selected from the group
consisting of mutations listed in Table 1a or indicated by SEQ ID
NO: 33 or SEQ ID NO: 37 and/or the point mutation in the FUL gene
(i.e. second endogenous DNA sequence) is selected from the group
consisting of mutations listed in Table 1b or indicated by SEQ ID
NO: 34 or SEQ ID NO: 38. Corresponding positions in allelic
variants of AP1 and FUL are also included. More preferably, the
point mutation in the AP1 gene (i.e. first endogenous DNA sequence)
is a nucleotide exchange from cytosine (c) to thymine (t) at
position 262 of SEQ ID NO: 2 or at position 6999 of SEQ ID NO: 1 or
corresponding position in allelic variants of AP1, and/or the point
mutation in the FUL gene (i.e. second endogenous DNA sequence) is a
nucleotide exchange from cytosine (c) to thymine (t) at position
316 of SEQ ID NO: 4 or at position 19552 of SEQ ID NO: 3 or
corresponding position in allelic variants of FUL. Consequently,
the mutated AP1 gene (i.e. first endogenous DNA sequence) can have
the sequence of SEQ ID NO: 39 leading to a cDNA according to SEQ ID
NO: 35, and/or the mutated FUL gene (i.e. second endogenous DNA
sequence) can have the sequence of SEQ ID NO: 40 leading to a cDNA
according to SEQ ID NO: 36.
[0058] In an additional aspect of the invention the above described
methods for producing a transgenic or non-transgenic Beta vulgaris
plant can be combined. That means for instances that only one of
the first and second endogenous DNA sequences have been mutated by
introducing one or more mutations and the expression of the other,
non-mutated DNA sequence is suppressed or silenced using the
transformation approach as described above.
[0059] A further aspect of the invention is a Beta vulgaris plant
or a part thereof produced or producible by any of the methods for
producing a Beta vulgaris plant as described above.
[0060] The invention is described below with reference to exemplary
embodiments and the accompanying figures purely for illustrative
purposes.
BRIEF DESCRIPTION OF THE FIGURES
[0061] FIG. 1: Schematic structure of cloning vector pAB70S-1 35S
Ataap6 RNAi
[0062] FIG. 2: Schematic structure of cloning vector pAB70S-1 35S
Ataap6 FUL AP1 including partial sequences of AP1 and FUL
[0063] FIG. 3: Schematic structure of binary Ti-plasmid pZFN d35S
RNAi FUL-AP1 including partial sequences of AP1 (212 bp) and FUL
(184 bp) used in Agrobacterium-mediated transformation
[0064] FIG. 4: Schematic representation of cloning vector pAB70s-1
d35S Ataap6 RNAi viI1-AP1-FUL LF including partial sequences of
AP1, FUL and VIL1
[0065] FIG. 5: Schematic representation of the binary Ti-plasmid
pZFN d35S RNAi vil1-FUL-AP1 including partial sequences of AP1, FUL
and VIL1 used in Agrobacterium-mediated transformation
[0066] FIG. 6: Alignment of the amino acid sequences of the protein
AP1 from Beta vulgaris (SEQ ID NO: 5) and from Arabidopsis thaliana
(SEQ ID NO: 41) based on the EMBOSS Needle algorithm; BvAP1 =Beta
vulgaris AP1, AtAP1=Arabidopsis thaliana AP1
[0067] FIG. 7: Alignment of the amino acid sequence of the protein
FUL from Beta vulgaris (SEQ ID NO: 6) and from Arabidopsis thaliana
(SEQ ID NO: 42) based on the EMBOSS Needle algorithm; BvFUL=Beta
vulgaris FUL, AtFUL=Arabidopsis thaliana FUL
EXAMPLES
[0068] 1. Inhibition of bolting and flowering by RNAi constructs
targeted to AP1 and FUL
[0069] Identification/isolation and characterization/annotation of
complete cDNAs of sugar beet for inhibition of bolting and
flowering:
[0070] By analysis within a specially created proprietary sugar
beet EST database, the 780 base pairs (bp) long cDNA (SEQ ID NO: 2)
of BvAP1 as well as the 735 bp long cDNA (SEQ ID NO: 4) of BvFUL
have been identified. In addition, corresponding genomic DNA
sequences could be identified. An alignment of genomic DNA with
cDNA shows the structures of the entire DNAs. AP1 consists of 8
exons and 7 introns, FUL of 8 exons and 7 introns.
[0071] A comparison of the resulting full-length DNA and the
translated protein sequence shows only low sequence similarity with
Arabidopsis homologs AtAP1 and AtFUL. At protein level the identity
over the entire sequence length to AtAP1 is at 65.6% (see also FIG.
6) and to AtFUL is at 57.3% (see also FIG. 7), at cDNA level the
identity AtAP1 is 71% and to AtFUL is 72% (see Table 2).
TABLE-US-00003 TABLE 2 Sequence comparison of BvAP1 and BvFUL with
Arabidopsis thaliana (At)-AP1 and -FUL- candidates based on the
protein sequence and cDNA. Results given as sequence identity based
on the EMBOSS Needle algorithm (www.ebi.ac.uk). AtAP1 protein AtFUL
protein AtAP1 AtFUL (SEQ ID NO: 41) (SEQ ID NO: 42) cDNA cDNA BvAP1
protein 65.6% (SEQ ID NO: 5) BvFUL protein 57.3% (SEQ ID NO: 6)
BvAP1 cDNA 71% (SEQ ID NO: 2) BvFUL cDNA 72% (SEQ ID NO: 4)
[0072] Production of RNAi constructs targeted to AP1 and FUL and
inhibition of bolting and flowering in sugar beet:
[0073] For the production of RNAi constructs the sequences of SEQ
ID NO: 19 to 22 were synthesized. Sequence of SEQ ID NO: 19
includes a partial sequence of AP1 according to SEQ ID NO 10 with a
length of 184 bp and a partial sequence of FUL according to SEQ ID
NO 14 with a length of 212 bp. Sequence of SEQ ID NO: 20 includes a
partial sequence of AP1 according to SEQ ID NO 11 with a length of
150 bp and a partial sequence of FUL according to SEQ ID NO 15 with
a length of 150 bp. Sequence of SEQ ID NO: 21 includes a partial
sequence of AP1 according to SEQ ID NO 12 with a length of 100 bp
and a partial sequence of FUL according to SEQ ID NO 16 with a
length of 100 bp. Sequence of SEQ ID NO: 22 includes a partial
sequence of AP1 according to SEQ ID NO 13 with a length of 50 bp
and a partial sequence of FUL according to SEQ ID NO 17 with a
length of 50 bp.
[0074] For the further processing the sequences have been amplified
by PCR using PCR Primers with SalI/SmaI restriction sites like
primers according to SEQ ID NO: 25 (forward) and 26 (reverse) for
the amplification of SEQ ID NO 19. The PCR was performed using 10
ng of genomic sugar beet DNA, a primer concentration of 0.2 micron
at an "annealing" temperature of 55.degree. C. in a Multicycler
PTC-200 (MJ Research, Watertown, USA).
[0075] The PCR products were each integrated into the vector
pAB70S-1 35S Ataap6 RNAi (FIG. 1). The vector is designed for the
production of "intron-spliced" hairpin structures. The vector
contains the d35S promoter for constitutive expression, the ATAAP6
intron from Arabidopsis thaliana and one polyA terminator (nos-T).
The ATAAP6 intron is flanked by the interfaces or cleavage sites
XhoI/Ecl136II on the 5' end or by the restriction cleavage sites
SmaI/SalI at the 3'-end. This enables the integration of identical
fragments in a "sense" and "antisense", if these fragments have the
compatible restriction sites XhoI or SalI, or are stumped on the
other end ("blunt end"). For this the original PCR products were
reamplified with new PCR primers extended beyond these restriction
sites. For further use the PCR fragments were cloned into the TA
cloning vector pCR2.1 (TOPO TA Cloning Kit (Invitrogen, Carlsbad,
USA)) and transformed in E. coli. A blue-white selection enabled
the identification of recombinant plasmids (Sambrook et al. 201, in
Molecular Cloning A Laboratory Manual, Cold Spring Harbor
Laboratory Press, 3rd edition, New York). In the white colonies the
expression of ss-galactosidase is suppressed by an insert, which
results in white colonies, because the enzyme substrate added to
the medium is no longer cleaved. After a subsequent sequencing with
M13-fwd/rev-primers, the analysis and the alignment of the sequence
data was performed using the program Vector NTI (Invitrogen,
Carlsbad, USA).
[0076] The fragments Sal-SmaI and XhoI-SmaI-were each cut from the
topovector by SalI/SmaI or XhoI/SmaI and then subsequently first
ligated "in sense" with the SalI/SmaI or XhoI/Ecl136II cut pRTRNAi
vector. Subsequently, the same fragments were religated for a
second time in "antisense" in the compatible XhoI/Ecl136II or
SalI/SmaI. The cloning resulted in, for example, pAB70S-1 35S
Ataap6 FUL AP1 (FIG. 2).
[0077] Production of transformation constructs and sugar beet
transformation:
[0078] For plant transformation the binary vector pZFN was used.
The expression cassettes were cut using SfiI to transfer it into
the binary vector pZFN to create pZFN d35S RNAi FUL-AP1-212-184
(FIG. 3), pZFN d35S RNAi FUL-AP1-150-150, pZFN d35S RNAi
FUL-AP1-100-100, and pZFN d35S RNAi FUL-AP1-50-50. Each of the
binary vectors was transformed in Agrobacterium tumefaciens strain
GV3101 pMP90 by a direct DNA transformation method (An, G. (1987),
Ti binary vectors for plant transformation and promoter analysis,
Methods Enzymol. 153, 292-305). The selection of recombinant A.
tumefaciens clones was performed using the antibiotic streptomycin
(50 mg /1). The transformation of sugar beet and regeneration were
carried out according to Lindsey et al. (1990) and Lindsey et al.
(1991, "Regeneration and transformation of sugar beet by
Agrobacterium tumefaciens, Plant Tissue Culture Manual B7: 1-13,
Kluwer Academic Publishers).
[0079] The transgenicity of the plants was verified by PCR. The use
of designed primers led to the amplification of a particular DNA
fragment from the nptII gene. The PCR was performed using 10 ng
genomic DNA, a primer concentration of 0.2 micron at an annealing
temperature of 55.degree. C. in a Multicycler PTC-200 (MJ Research,
Watertown, USA).
[0080] Verification of the flowering and bolting behavior of the
transformants:
[0081] For each of the RNAi constructs five transgenic sugar beet
lines were regenerated carrying the corresponding binary vector
pZFN d35S RNAi FUL-AP1. The sugar beet plants were grown for
several weeks in sterile culture media, propagated and then rooted
together with non-transgenic isogenic controls. 7-9 plants per line
and control were transferred to the greenhouse. The transgenic
lines were grown in pots and then tested in different vernalization
regimes to determine the bolting and flowering behavior. After an
adjustment period, the plants were subjected to vernalization for
three, four and six months at 8.degree. C. in a cooling chamber
(winter simulation). Subsequently, the transformants, as well as
identically treated non-transgenic control plants, were transferred
back into the greenhouse (25.degree. C.). Shortly after transfer,
already after 10 days, the control plants began to grow shoots.
After 4 weeks the control plants began to bloom. In contrast,
several transformants lines showed surprisingly no response to the
shoot and flower induction by vernalization. Hereunder were lines
of each used RNAi construct.
[0082] Thus, the resulting transformants behaved like
not-vernalized sugar beets. They neither developed shoots nor
blooms. None of the plants showed deviations from the normal
phenotype. The plants were cultivated further; they continued to
develop to normal beets with normal beet bodies.
[0083] These lines were again tested in a greenhouse supplied with
soil for optimal root growth without temperature control in two
winters (2013/2014; 2014/2015). None of the plants did bolt or
flower.
[0084] Surprisingly, using the inventive approach, the
vernalization or its effect, namely the bolting and flowering, were
completely blocked in sugar beet.
[0085] 2. Inhibition of bolting and flowering by RNAi constructs
targeted to AP1, FUL and VIL1
[0086] For this approach an RNAi construct comprising partial
sequences of AP1 and FUL cDNA was extended by a third partial
sequence of 399 bp based on cDNA of the BvVil1 gene from Beta
vulgaris (WO 2011/032537). VIL1 was chosen due to its involvement
in flower formation.
[0087] PCR product BvVil1 RNAi was amplified using a forward primer
according to SEQ ID NO: 27 and a reverse primer according to SEQ ID
NO: 28. BvVil1 cDNA was used as template. The amplification led to
a VIL1 cDNA fragment according to SEQ ID NO: 29. Additionally, PCR
product BvFUL-AP1 was synthesized and amplified using primers
according to SEQ ID NO: 25 and 26. Vector pZFN as described above
was used as template. The amplification led to a FUL-AP1 cDNA
fragment according to SEQ ID NO: 30.
[0088] Both amplified DNAs were cloned into vector pAB70S-1 35S
Ataap6 RNAi (FIG. 1). PCR product BvVil1 RNAi was cloned using
Ecl136II and XhoI. PCR product BvFUL-AP1 was added using XhoI and
SmaI. The resulting intermediate pAB70s-1 d35S Ataap6 RNAi
vil1-AP1-FUL LF (FIG. 4) was used for another PCR step. The vector
pAB70s-1 d35S Ataap6 RNAi vil1-AP1-FUL LF was used as template for
forward primer according to SEQ ID NO: 31 and reverse primer
according to SEQ ID NO: 32. The PCR product was an RNAi construct
which then was cloned into the vector using SalI and SmaI. The
resulting vector was cut using SfiI and cloned into vector pZFN
resulting in vector pZFN d35S RNAi vil1-FUL-AP1 (FIG. 5).
[0089] Vector pZFN d35S RNAi vil1-FUL-AP1 was used to transform
Agrobacterium tumefaciens Gv3101 pmp90 which subsequently was used
to generate transgenic sugar beet lines. Transgenicity was
confirmed by PCR. After regeneration nine plants of one line were
rooted as described above. After vernalization in the greenhouse
none of the plants did bolt or flower.
[0090] 3. Inhibition of bolting and flowering by knock-out mutants
of BvAP1 and BvFUL
[0091] Mutagenization of sugar beet cells and identification of
BvAP1 and BvFUL mutants:
[0092] A sugar beet mutant population has been created by treatment
with different EMS concentrations for different durations of
incubation. From treated cells M1 plants could be regenerated.
Through selfing of the M1 plants several thousands of M2 plants
were grown.
[0093] These M2 plants were screened for knock out mutations in the
BvAP1 gene and the BvFUL gene. For that, DNA was been extracted
from collected leaf samples and analysed by use of designed
primers. Thereby, point mutations in the genes which introduce
additional stop codons into the coding sequence of the genes could
be identified. One plant showed a point mutation in the AP1 gene
which is a nucleotide exchange from cytosine (c) to thymine (t) at
position 262 of the cDNA (SEQ ID NO: 2) or at position 6999 of the
genomic DNA (SEQ ID NO: 1). A second identified plant contained a
point mutation in the FUL gene which is a nucleotide exchange from
cytosine (c) to thymine (t) at position 316 of the cDNA (SEQ ID NO:
4) or at position 19552 of the genomic DNA (SEQ ID NO: 3).
[0094] Verification of the flowering and bolting behavior of single
mutants:
[0095] Cells of the identified sugar beet mutants were cultured for
several weeks in sterile culture media, propagated and then rooted
together with non-mutated controls. 5 plants per mutant and control
were transferred to the greenhouse. The mutant lines were grown in
pots and then tested in different vernalization regimes to
determine the bolting and flowering behavior. After an adjustment
period, the plants were subjected to vernalization for three months
at 8.degree. C. in a cooling chamber. Subsequently, they were
transferred back into the greenhouse (25.degree. C.). Shortly after
transfer, already after 11 days, the control plants as well as the
mutant lines began to grow shoots. After 4 weeks all plants began
to develop flowers.
[0096] Verification of the flowering and bolting behavior of double
mutants:
[0097] F1 progenies of a cross of AP1 mutants with FUL mutants have
been analyzed for identification of plants carrying the mutated AP1
gene and the mutated FUL gene. Two F1 plants could be detected
which then were selfed to generate a F2 population. Selected plants
of the F2 generation have been tested in greenhouse as described
above. Shortly after transfer, already after 10 days, the control
plants and most of the selected plants began to grow shoots.
However, a few plants showed surprisingly no response to the shoot
and flower induction by vernalization. These non-bolting plant were
all homozygous for both of the identified point mutations in AP1
gene and FUL gene, expect two of the plants which showed a
heterozygous genotype for at least one of the mutation.
Sequence CWU 1
1
42121668DNABeta vulgarismisc_feature(9833)..(11385)n is a, c, g, or
tmisc_feature(12467)..(12605)n is a, c, g, or
tmisc_feature(13741)..(14088)n is a, c, g, or
tmisc_feature(16396)..(17194)n is a, c, g, or t 1atggggagag
gaagagtgca gctgaagagg atagagaata agatcaacag acaagtaact 60ttttcaaaga
gaagaagtgg acttgtgaag aaagctcatg aaatttctgt tctttgtgat
120gctgaggttg ctctgatcat tttttctcac cgaggaaaac tctttgagta
ttcttctgat 180tcttcgtaag tatatatata tatatattaa tagtaactac
ttgttttctg ctttctattt 240ttaggtctga tgcatattta atttaggtaa
tattaattcc ttatatctga tccttaattt 300ttttttcttt taccatttca
tttttgtttg ttttgaataa aagaaaattt ccccttcacg 360tgtgtcgaat
aggtcaaaat ttttacttga aggatgttct ctttgattac taaaatagga
420tccaacaatc acctgaaata aaggaagaag atggtgcaaa gtttttactg
tcatacttag 480tatttgataa atattctatg atgaacttgt ataaattagg
aaatagacct aactttcatg 540cacgaaaaca ttattccttc attcaatttt
tttattactt aaggatttac ttttttattg 600atcatatgaa gtagtagtac
ttgtaatcat tcaatttttt tgttggttaa ataggactac 660attttaaaac
aacccaattt taaaattttt tgtgtgaatt tcttcccttt ttaaaaataa
720agtctattat catagcttag agtagctgtg gcaaagctag acgaaataat
acagaaatct 780ggaaaggaaa ttgtactact tacatgaaca cacttattta
ttacttgcat gatatctgcg 840aaaaagttta tagcaaattt ggttaatata
tagcgtagta ctttggatat taatattact 900agtgtacaaa tacttgatcc
aatgggtaat gaaacttatg gaagatttga ccatacatga 960tgatgctaaa
tattaattgt tattgtccag ctttgttttc cctccatcca ttggcatctt
1020catctttaca ttgctactcc actcacttgt caattgtttc gtcctttatg
ttctttattc 1080acatgtgcac catacttcaa tactttcccc ttctttatcc
tcagtttttt tttcttgtca 1140ttttagggtt aatatccaat gaaatctagt
ttgctcgttt tagatctaat tttaattcga 1200tcacaaccat ccatattttt
gtttcttagc ttgacatcta ttctatggat ctgggatctt 1260cggtgtatag
atgttctcgg ttttcagatc aagatcctat tcatagaccc atttattgta
1320aacacttaaa tgtgttctta aaaagttagt ggctcgccaa gtcaactcaa
taacataacc 1380cccacgactt cattacatta cacaatgaaa gattagatgt
atgagtttgt gaagcttata 1440attctatttc aagtaggact aggatgtttt
gtgcaatcag cagctagtag tctttttaat 1500ttaagtcagt cttcattgtg
catcatatat ttttagaaat atatgcaagt ttgaaaccat 1560ttagaacctc
atgacccgcc tgactcacta taaaccggca agagcttaat ttttcacagc
1620tttgtatctt tatgagtagc gctagctagg ggtatgggca tagaaaaaaa
gggtttgggt 1680tagggtctta caagatctta tccgctattt ttatttcata
atctttcaaa atacatgttt 1740aataattcaa aatacatgtt taatactatc
tccatttcac aacatatgca ccaattgcct 1800agctatggtc caacctagtt
ggtttgtagc ttgcattgga tggttaggat gtattggagt 1860tgtttatgtg
caatcaaatt ttaattacgt atcaaaaaaa aaaaaaaaaa aacatatgca
1920ccaatttcca tttggacaca cttattgacc aatttttgac aatatttttc
tcaccatttt 1980gtaagaaaaa tcaaaatcaa gtggaatttt gttaagttta
tctcagtcaa aagattccat 2040acatcgacat tttataattt ttaatcatac
gcaattagaa atatcaatgt ctaaagaagc 2100gtgttggaat acgtgaaaaa
gcaaatgata catgaaacag atgtagtata tagaaaactt 2160aattttgtgt
cactcggatg tatgtgggcg gagccttcct agaaggcgta cccaccttag
2220tggctctgaa tctttgacga cccgttcggt tggtggtgat aatagatggt
aatagtaatg 2280taatttagtc taaatttata aataaatatt aatatcatta
cccatggtaa tacaagttct 2340tcacaaaaca tgtttcattt aaaaattatc
attactacct tttcaagtgg tattggatga 2400taataaaatt ttaggcaggg
aaatgggtat tgggatgaac attaccatgg gtaatgacat 2460gcaatttttg
ttacaagaat acagtataat acattactat tgccaccatg tataaccatt
2520aatcaaatgg accgtgagga tatgatgttg aagaagaagt cttaacctct
acgctattat 2580ttactagggt ctgtaaattt tcctttttta attataattc
ttgtgaaatc ttcttcactg 2640atggtactag cttattagga tgggtttctt
tagtatattg aaggctcttg ttgacagagt 2700ataaaaatat ttttggggtc
gcaaccatca atttaaactt ttgtttgatt ataaaattat 2760tttttgaaca
tcaacaatct acttaaattt ttggttgagt tagttctttg acatggtatc
2820acaaccatca tgacataaag gtctcatatt caaatctcat tcacctctca
tttccaagta 2880gaatatttac ctcaggtatg ggtatgaggg aggcttgtgt
tgcatgagtc aataacggat 2940cttgaccaat aatttaacag gggcgagttg
ataaattaag ttttaatgta aaattttaaa 3000tgatggataa aaacactaat
acacaccaaa atataaatat acttttatta atggttacaa 3060agagcttgta
gctaatgtaa taaatcaaaa tcccaaaggt gcaattttta agaaattatt
3120tccatttatt tatttgacca ttatgaaatc ttcaagaaat tgagtaagtt
tttaagaaat 3180ttaaggtata gttcattaac taaataaact actccagtaa
aaaaaaatta ccaaactgct 3240ttcttaagta aaaaaataaa taaatttata
ttttatgatt gttaggaatg agtgtgagga 3300aataaaaagt actattatag
ttaaataaaa atgaaagttc ttcagagaag aagaatagaa 3360gatagtacaa
tcaatgttaa atatttttct aaattagaca aattgatata aaccaaaaat
3420aaaggggaag aagaaagaaa taagtaaaaa aagaaagaag gaaaagaaaa
aaagaaaaaa 3480gagaagcaag tgaagaaaaa caaagaagtc caaatgtgtg
ttgatgcaag gttcgagctt 3540gcaacattaa gggctcaaac tttcttttac
actttggttc actgccaccg tgcccacagc 3600ttgttatgtg acatgaagtg
tagtttgctt aatttatctt atacagttat gggggaccaa 3660gcctccaccc
gccccttcta taatctgtca gtggttgcat ccacacttta agtccaatag
3720actcttgtct gagaggaggt gatagagtat ataaatattt ttggggcctc
aaccattagc 3780tcaatctttt gattaagttg gttctgtgac acttgtacta
tatactagtt atatatatac 3840tgtaaaacta gtaccacgag aacagtcctt
aatacaaaca acatgccctt aatagaattt 3900tcttagtata cacttaatat
aggttgacta gctttttgcc cttcagtatg cacacacctt 3960ttataatctg
tatcgttgtc tggtagatga taataaacct cagtattggc aatatatgaa
4020atgacataat ggccatgttt ggtgattaga gtttagagtt tagaggttac
agttcagagt 4080ttgtggttag atgattactt ttttgttcag aggatttgac
tgctgattta aataattgtt 4140gtgtaaaggt gtttggtaac acttagctta
ttgtttagag ttttgtactt tttagagcat 4200gtaaaatgac atttatggac
atatgtattt ttttaaaaca aattttagta gtaattatat 4260ggacaaaata
gtcatttgtt ttttctctct ccaaaactct catgaaaaag ctcctctacc
4320cagctttttc aaaagagagt tttgatcaga gttttcggta caaaactctc
tttagtcctc 4380tctctcacca aacacccaaa ttagagtttt tattggtcaa
aactctaaac tctctccaaa 4440cctctaaact ctctctaaaa ctctctcccc
caaacacccc caatttctta gaaaaatttg 4500ttgctccttt ttattgcact
atatttctat ctccaaacat aaagtttctt ttacaaattt 4560tcatttctac
tccataccac ctttatatgg caatataatt tctatgaatt aaaatgttca
4620caagttttga ggtggatttc aagagcatgg acaatatgat catgagactc
tccatacaaa 4680aattaccctt aaattttata atcatacacc aagcggtcgt
taaagtattg gaagtgcttg 4740agtagtttgt gaaaattaac atataataaa
gtgcagatct cccctctagt aagtagtaag 4800aagtagtaag acgatgtccc
tcatttgaga aagagaaaaa cccttatcag tttctcttgt 4860ttctttgact
gaacgcaagt caaatagaag tatgtaacta ggaaatcctt ggagaaatag
4920attttcttta aaactataaa agtataccta tatatatggt aacccacaaa
aatgtatata 4980atctgatcaa tatctaaaca aagtattctt atgttttctt
tcatcttgct tatttcctcc 5040ctttcctttt cttactttaa tttgtttact
ctctttaact tatttctttg cgtatctcat 5100ttcactttac aaggatatat
agttgattat gacagcttaa taaatatatt ttggaactag 5160gatttattgg
ttgtcgttgt tattttaatt tctacactga tcggctagag tttctagaac
5220atagggcttt attgaaacca ttagttaaca aaattgaatg acaatgattc
aatatgatag 5280aatatgtatg tattagttaa tgtttgatta ttgtttgtat
gtatataatc aaagattatt 5340tagtaatact tctatataca tattctatta
gaatcactta gaaagaccca ttgaacaata 5400ataaggatag gcagacaagc
aaacaaaaga aaaataaacc tgttactcct tccatttctt 5460aatgttctac
tcggaattat agatacacac tttgacacaa attagaaaga gagtgtaaaa
5520agtggatcca tattaatatt tttatttttt taaatgagga gagaagtgtg
ggtttattat 5580gtttcaaggg agatagagag cattgaatag tgagagaata
tgtgccaaag ataattaaat 5640cattgtaaat cttttgccaa ataaagaata
aagcatgtga gtaaaacttt aaaaaatggg 5700cgaaaaagga aagttgagta
gaactttaag agacggaagg aatatagaag aggacgtgac 5760agatgggagg
aagatcagac atcttagaag gggaatagtt aaatttgaga tagtctttta
5820attaaggttc tcactaaaga agatataaca gtaggggaaa gctaaaggtt
attcaaaact 5880ttccttccca tcttcatcac ttcatgtctt tactttagag
ctcttaacac ttagcctatg 5940aaattctgaa ctctttgtaa gattagtgat
agataaaaga atcttatcaa tttaatttat 6000aaatacaaca ggattcaata
aaaagatata gagatctata aataaagagc catactgttg 6060tgaactttta
tatctatcaa aacctttgca cattagacgt ggtataacta aatcaggctt
6120atcgaaattt tttaaaattg ttttcattat agccccttta tatttagaag
ttctaagatg 6180attgcataga tagttgatgc accgttctgg tcgacttttt
taaacacttc tttttgataa 6240attttttttt ttgtattcga atcattattt
taggtgtata aagagctgca aatgatctag 6300atgagattga tctcggtttc
atttatatgc taatagtgtg ttagatacac actattaaaa 6360aagtcatatg
acttagagat tattatggaa aagggatagt gcaccgatat taatataatg
6420gaaaatgaca cacgagttgt ccataataac atgtgaaaag tgaactattt
aaaaggtttt 6480tctgacctag tacatacaag gtgcgtaggt ttagctattt
tagtttttta gttttatttt 6540ttaaagtgaa gttagttatt gatctgaaat
catataacat gtacgtaccg tagatataaa 6600aaactaccaa gtatatatca
atttgaaata aacattattt taatatggca aaatcacaat 6660tgttgactag
acctaacact gaagaaaact atgtcatgtt tatcaattat gttgcataca
6720gttaaaaaca aatatgttag agaaatcgtt atttgaaata gaaaagttgc
gcaaaatagt 6780gattaacatc aaaatatgtt cagaaagttt ttataaatat
gtgatcttgc attgtctgtt 6840gactgtcgag gtttatgata atttcccctt
tttccaatgc aaaacttgtt gtgctatttt 6900ctaatgatat attttttcaa
agtatggaga agatcctaga aaggtatgag aggtattctt 6960acgcagaaag
acggctagct tcaaatgatc cagactcaca ggtagtgcat ttatgtaaat
7020atagatatac tcttcatgcc caagaagcct gaatttttta tcccactacg
tactgcaaag 7080ccaagtttaa ttgaataatt gtcctgttta aattatttag
ttttcagtac aataatgtaa 7140tcattagttt gcatgtttaa aaaagaaaag
cacaagttct gatcaagtga aatataaatt 7200gtaacgaaag agccaagcta
gacaattacc tagctaggag ttatttgtta tcgtttttgt 7260ttttaatttc
tagttttttt ttttaaacta gaaaatatag tttcaatctt ttgttatcag
7320ttttcaaaat gacatattta acataaatat gattgatttt aaattcattt
attatatcat 7380atttcatttc aaaataagtg aaacacttgt ctcaaaaagc
tcactctcac ataaatgata 7440aaagtgtttc acttatttta aaacggaaga
attatgactt ttacttttca taaaacgaaa 7500aactgaaata tgacaataat
ctcaaatagc ttggagaaac cagatttcta tatatttccg 7560tgatgaaatc
acttttcatt atacgtaggt aaactggacc tttgacttcg caaaactgaa
7620ggcgaagctt gaacttctac aaaggaatca taggtatgat ggcaatatgt
cataattttt 7680ctattattat ttttgcttcc aaaaccagac catatgtttg
tatatttata tagtgatata 7740ctccatccgt ttcattttaa tctatacatt
tacacttatc aggtatgtca atgcaaaatt 7800ttgaggatat atatctttag
ttttgtattt ataaaaatta taaaaagtac atattaataa 7860aatacatatt
atgatgaatc taacaagatc ccacatgacg atatttccgt ccgcgtatga
7920ataacaaata atggccaaag tgaaatttgt gaatagtgta aaatatcaaa
gtgtaacaat 7980taaaataaaa cggagggagt agtacttgtt tgtcacatac
ttacttattt ttgttctctc 8040cacaatgaaa ctgttctttc taataattaa
aaaaagtgca tatgttgatg atttctctgt 8100cactttaagt ggatattgaa
tagtgataat ggattacttt gtgtataatt gcatttcaca 8160tttgggtcta
attttatacc cttttcgcat atcatgcttt gtgaatagta catatgatgt
8220tcaagaatgt gagaagacat atcatacttt tgatatacct caaacatggg
tgtatactgt 8280atagtgaacg aaagtgttag tgtaatttta tttaggaggt
ttagtggttt gtcctatata 8340taatgctagt agttatacac catagttgtt
gatgagcatc aactggcttt cctaacattt 8400ttttctccat aactttaccc
ttaccttaca ttagattact ctaggattac attctaccta 8460aaatattatt
actcccatca ttttaggtaa atatttttac tttgattttt cgattatttt
8520caagagttta aaataatgat taaaatttac catgatcagg cactacttag
gacaagagct 8580tgactcactt aacatgaagg aacttcagag tttagagcaa
caacttgata ctgctctcaa 8640aaatgttcga tctaggaagg taagaaattt
tacttgtcta ccgtagtttc ataataaatt 8700agtatttggg ctcgggcttt
gccccagatt ggtattgtct tttcaaattt gatatgcatt 8760tttttccatt
tccactaaaa tatattaaga aaattcaaca tttaaaggat acaaatataa
8820taatgtggat acttaaagta tgattaaaat ttggttgaga tggtaattgt
gtcatgtata 8880atagcaagaa gtcacaagtt caaagctcgt tgcaagctaa
atttattttt gttgattgac 8940atgacttatc aacacactgg acaattctaa
tcatctagtg gagtagcata tactagcaat 9000ttatgcacgt gatgtgtgcc
ttactttttt agaatataat ttataacttt tttgagcata 9060aacaaaggta
aaatttgaac attagacata tttttttggg ctagctaaat ttgttgttta
9120aacctatatc acttaaccaa actcctcttt tattatttat tgatttatat
tttatttaaa 9180atttttaaaa ttaaaatgat gagcaataaa agaatgttaa
gtagatttat taagtatttc 9240ttatattttt atcaacaaag tattttgtgt
taattaaatt atttcacttt gttaattgat 9300tgtattttcc tttttaattt
attacttgat tgtgtattga ttgatcaaac ataatttttt 9360tgttaatttt
tttatgctat atttgaattt atttttcttt catctgtttt tggtagagta
9420gttgatttac taaagggtaa ttaaataaat ttattggggg acaccatagc
tcccccctcc 9480cttatataat agagatttgt atagatttat tgtcttcctc
aattattgat taactagtct 9540tctatgcacg cgatgcgtgt gttgattgtt
tgggtctatt cttaatataa atttcatcaa 9600aatataatta tagtagtgtg
atttacaatt attgctatac aaactactgt aatttataaa 9660gttgttagaa
attgagataa aaatttagat gtgaaatttt gtggtcaaat tatatttgta
9720attttttaaa ctgagtaacc gtttttctca tcatgtcaag ttactttgtt
aatgcttatt 9780taatttatta ttggaatttt tgacccatct ttaaattaga
aaaggatata atnnnnnnnn 9840nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9900nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9960nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10020nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10080nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 10140nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10200nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10260nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10320nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10380nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 10440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10500nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10560nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10620nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10680nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 10740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10800nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10860nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10920nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10980nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 11040nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11100nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11160nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
11220nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 11280nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 11340nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnntatat atatatttag 11400ctaagaaaaa aaagacattt
cattggggga taccataact cccccttagc ttatataata 11460gagatactat
acttcttttc tgatagtgtc aaatttaatt ataaatcttt aacattggcc
11520aattaataat tggacaagaa aaaaatgaga caataataaa taaggcgatc
ttcacagacg 11580tattaacatg atggtaatta aaaatgttaa tcatagatct
ttgtgttatc ttaataatat 11640aaatttacta attagaatgt atcacataaa
gtaagtatta atagcagcat aggataattc 11700ttataatgga gattttatat
ttttttatat aattatatga tttattgttg aaaatattag 11760ttgattttaa
ctggttgttt attcaatgac agaaccaact gatgcacgag tccatttctg
11820aactccagaa gaaggtaata actccatttt ttactctcaa aggtttattg
tttttaactt 11880atttcttcta accttttata tatgagaagg tattgggtta
gacgcgtctg accataatat 11940taggtcggat gactttcagt tggtttcaat
tttatttcag ttggtttcaa tttttgtcca 12000gttggtttca atttttgttc
agttggtttc aatttttttt agctggtttc aatttttgtt 12060cagttggttt
caatattttt tagttgatct tttttatttc agttggatgt cttttaagtt
12120cagttactta tcttattgtt tcatttacgt gttttattgt aactgaaaac
aaaacttaag 12180taaatgaaat aaaataagtt ctaaataaaa gcaacttagg
gcctgttctc cccagcttat 12240tttcagttca gttcaattca gttcagttca
attcaattca tttcagttca gttcagatca 12300gatcagttca gttcagatca
gatcagttct tgacaatact tttactctca catatcacta 12360ttcatttcag
ttcagttcaa ttcaattcag ttcagttcaa ttcagtttag ttcagttcag
12420ttcagttcaa ttcagttgtt ttatgccgaa gagaacaggc ccttagnnnn
nnnnnnnnnn 12480nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 12540nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12600nnnnnattca tttcagttca
gttcaattca attcagttca gttcaattca gtttagttca 12660gttcagttca
gttcaattca gttgttttat gccgaagaga acaggccctt agttttcagt
12720tacttatatt atcgtttcag ttagttttct tattctttca tttaacaact
aactaaaata 12780aaaaaaaaaa aactaactga aagcaaaact taattaaatg
caaaaaatta agttctaaat 12840gaaaccacat acgatcgaaa tttcaatcat
ttcaaacatt atggtgtttt cgattctttc 12900aaagaaggca agctgctccc
gctattctac cctctttaga tcacaataaa gctcaggcct 12960cacattcaaa
gtttcctcaa agatggacgt tccaagtatc acatagacac atagtcctct
13020tctccaaacg ctctccttcc tatcttgatg tcattagcaa acttcttgat
ccagacggcg 13080ccaacaaccg caccatgatc tccctctaaa gtactgacgg
cccgtttggt tgttggtcat 13140aaatgatggt aatgggaatg aagttgtgtg
taaatttgtg aaaaatatca ttgtccattc 13200ccatggtaat gctaatttat
cttaatgtgt ccactttcct tctagaattt tcattctcat 13260ccaataccac
cttgtaaggt ggtaatgagt ggtaatgaaa attgcttccc cttggagaca
13320aaaatacaag tttaggagtg agattgattg ctcatggaga aaaaaagtct
ccccatggag 13380atattaaggg tgattcccta ataaaattac acttaaaatt
tattcccatt accgcaattt 13440attaacatct accaaacggg ccgtgaaagt
cttgaaacac atagtcgagt gagtagcttt 13500gaggaaccat ctgtaaaaga
acctgaggga gccaatgtgt gcgtaagtac caacggcgtg 13560ttgtcagtgg
aaaaggtggt gccgtggtgg cactcagtag tgatggagcc gccgtggtgt
13620ttgagtgttg ccaaatacaa aggcggaatt tcgtaatctc taatttcttc
tgtgaaattt 13680ttgggatcag cctgtccgac caaacacatt ggatcaaacg
gtctgaccca atagtttcaa 13740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13800nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13860nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
13920nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 13980nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 14040nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnga tataaataga 14100gatggaggac aagggtcttg
gttttgtcat gttgtcaaag agttgaacaa tggttttttc 14160gtattgttaa
aaaattaaaa aaccagaggc cttcaatctc ttgatagata tagatagaaa
14220aggaggacaa ggccgttctt ggttttgtta tattgtcaaa gagttgaaca
atgatttttt 14280cgtgttgtca aaaaattaaa caatgaaaaa agatggcggg
tgcttgatct aatagatcgg 14340accatggatt gaaggtcttt taacttattt
tatatatata ttgaacttat ctgaagatta 14400tttaactctt tgaaattgta
ttaacttccg aactttatga acttttttaa ttcttcaaaa 14460cttatctaca
ttttatttga aaaaatattg aagacaaaaa aaccctcagt tggtttaaag
14520ctgcggtaag atagagtgta aatgttattt ttttttatta aatcaagaaa
taaaaagaaa 14580tattaaataa aaagaattaa aaatggaaat gatgacagaa
acttatggct tggaggagca 14640atacttttaa gatagaccta aaccttaaat
aagttaaaat ggaagtaatt tttcagtaga 14700atcttattcc aatctatact
ccgtgtttac tccatgtaat gcacatataa taaaaaaatt 14760agaaattaca
tagtataagg tttgatcctg tgactgtaag tttatatact aacttcttaa
14820ccactagagc aagtgatatt
tagtgttatc attttaaagt ataattttaa caaatgaaat 14880ttttttctta
cccggaacat agctcggacc taataactag ttgaacaatt ataatctgta
14940acttaaaatg atcctaatta ctgtactttc attacctata ataatagaat
cttactatca 15000ttggttcaga aaaaaaaaat cttattaaat gttaaccatt
tatttgtaat tgaaacatac 15060atgcacataa atgtaacttt tagtttatct
taacttaaaa actgagaaaa tgttagttgg 15120aaacttttgt atatatgttt
ggataaacga cgctcaaaag taggggctaa aattttagta 15180gataatataa
gattatactc catctgttct agatagactt ctcattttta attttggcag
15240tattcataaa taaaggaaat ctttcaaaaa aatttccaat atataagaaa
aaaaataatc 15300atgtgcggtt ttgtttgatt cgtctcattg tgtacattag
gaaaattaaa cttatataat 15360ttttactact atgtaattaa agatattaac
gatacaaaat gtgtattgac aaacttatat 15420tggagtaata ggaagtctat
taagggaccg aagaaatatt acgtaaataa atctaataca 15480aactaatata
aattctactc cagacaataa agattctgtc ttatattgcc aagatatagt
15540agctatttat tttatcttaa caaacataaa tgtttctaat gcttaaacat
ggacatgtat 15600tattttgtaa aatattatgt attatccaaa gttacatatt
taaaggaagt tctattgctt 15660gctctctttt agcactgccc aaaaaggtta
aagtaatttt ttttctctgt ttaaaaaaaa 15720aatgcattat atacagataa
tttttgctag tcaataaagc tatccttatg acttatgagt 15780gctacttgac
tagggatgtg ttgtactcaa ttggaggtat acatacacca agattataga
15840gcttttattt tgcctataaa aaatggaagc cggataggat accaaaaaag
ctttgactta 15900aatttgtaat gcataaaaat gatgatacct aacttattag
ccatacttat ctaagcgtac 15960gtcaatttaa atattgtgtt attgattaat
aatgatcctt atatatccat attttgacaa 16020ttaaacggta aattagagag
aaaagtttga gaaaataatt atagcttacg taatgctata 16080atccaaagtg
tctccgcaca agcgtgggac aaaatagtac tttcggagaa gttacaatca
16140acagctaggg agtcttcatt gttcttgaat agaaggatgg aaacaaagtt
caccttcttt 16200tattaaagta ttaaggtttg ttattagctc aatatccaat
actttctctg ctttttatta 16260cttcgtctgt ttcaaattaa atgatttttt
ttttatttta cactattttt aaatttcact 16320tttaccatca tttatgattt
atatgtgaat gaaaacatag ttacgtgtga tcttgttttt 16380tttttttttt
tttgtnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
16440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 16500nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 16560nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16620nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16680nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
16740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 16800nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 16860nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16920nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16980nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
17040nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 17100nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 17160nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnngtacat caaggaaatt tgcattaatg 17220aaaaacggga gtacaaccta
atggtaatac aaccaaacta aacagaaaga agaaacaaca 17280gacagtaaga
aaacctctat aacgcgtaaa aacaatttaa cataaacact aactagaaaa
17340agtccaggcc gaataacatt tgtcttgtgc gatggggagg gaagatgaag
gagagtgaaa 17400atctgttgaa agacacaact ctgacgatct tcatagagag
attgtcgtac aactgctagt 17460agatcctgtc cactaggcac gaaatcccaa
ggtccttgat gagcctcctg ggattctcct 17520gattcgttga acaaagaagt
cactttcgag agtaacttcg gaccctgctg cgagggtctt 17580cgaaaatcag
tcacattaga ctgaaacccg aaaaacaagg tggaggtctt gacgtgcccc
17640aaaacaagtt gggggcgtcc cacaagatgc gttttttagg tgtgatgaca
catgatgtca 17700tcacgagaaa ttggggcgag ttagtttgat gaactacgcc
ccactgacgg atcctaagat 17760ccaaagtgac aatctcgaaa ccagaagacc
gacaaacaac agatctgaaa catacaaaca 17820tgaaaataat gaaagcataa
actgccaccc gacatataga gctccggcaa acaacaccat 17880caagaacttg
caccaaagac ttccttggca tactaaagac actgattcca actaacacta
17940gcgggggacg gggaagggac actcgactac acctaaacct aaccagggga
cggggaaggg 18000gaacttagac taaaccttcg taaaaagggg gggatcgggg
aggggaaaac cttgaccaag 18060gaagctggtt ttaaaaacca cttagccgag
ccaaaaaccg tgggtgggaa gaagaaacag 18120accacaaaca gggggaaccg
ggggatggga actcaccgaa caggggaggg ggagaaatcg 18180cacagactcg
gggaacgcct aaggactggg ggacgaccaa cgaacgaaag gttggggtgg
18240tgcgaaaaca agggaggggg acgcaccgac gaacaaaaaa accgacgaag
aggccgaaaa 18300agcgaaaggc cgacggagat aagattgaaa ggcgacgaaa
aaagaaaaag gaacaaaacg 18360aaagaaaaac gaactcgtcg gagacccgcc
ggagacctac gcggcgccgg atctccggcg 18420agttctaggg ttagagggtt
tgttgtgttt gtttagggag aaggcagagg tttttttttt 18480ttacgtgtga
tcttgttaga tttgtcttaa catgtattct ttaatatact ttttttttta
18540taatttttgc aaatgcaaaa ttagagatat atgtcctcta aattttacat
tcacatacgt 18600gataaataag agtgctacaa ctaatttgaa acggataaag
tatttgaatt gtttttcatt 18660taaaaaagtt cgctatcatt tataatgtta
tatatttgcc aatatgttat ctctttctct 18720ctcttaccag agtttagatc
cagtagagtt agtaaataat tctaccacgt agagttgaac 18780aaatcatagc
cattgatttt caaatcattg gtttatatat tctttcccaa aactcccccc
18840tattttcccc aaaaatcctc cccctcctta tctctttcca taaaatctga
gtcgttgatt 18900ttaaaatata aggtttggat tcaactccac tatgtagagt
tttcatcaaa ctccaccgaa 18960tccgagcccc tcctaccata gtacttcttg
atttccccat atttctttcc tcatcttggt 19020cctcaagcac attttaatat
tatgggtatt aaacaataga gaaagtattt acttatagag 19080aaagtatttt
caatgattcc ctaaattttt ttttgaaaga aagaaaaggg atttcattaa
19140tatttcgcca aacggcactt acaagtcatt tctgaaaaac ataaaattct
aaaagaaata 19200catatcaccc tagaaatgta aacatcgcag atttgactta
attttgcctt aataaaaatc 19260ttcatctgaa gcaatgcaat ctgtgagttc
gctctggttc ggcatacgat ctgcagatgc 19320ggaaattttg ggatgaacgt
actccaatag tcttccataa ttttacagaa gttgtgaaac 19380cctaattctt
catgttgaat ctcgaacttc aaccaatgag aataatttct catacctaaa
19440aacaaaagaa ccatactcac aactcccata ggggagaagg agatttccaa
aacagaaact 19500aaaaacccca taaaagggtt tgagaaaatc tcataaagag
atactaattt attgaacaaa 19560acaagaaaat gaactaaaaa ctgaaaataa
aagggaaaaa ggggcttacc atggatgaaa 19620acatccatgg cagcccccta
attgatgaag aaggggtaag ggaggctagg gttttagaga 19680gagaaaagga
gaggggaggc taggttttaa aaaaaaatat aatgattccc taaatttact
19740tatatatatt taccaagatg acgtgatgtt ttacaaggcc catgattttt
acgcgatcat 19800gaaaaacaca gccaatttga atggagcaaa tatctacgcg
tcattttaga tatttttgta 19860tgggaaagtt ttttttgacc aatgtaatta
ttaagaagca tcggccaccg ggtagataag 19920atgtcactat acatcctttt
ccaaacttaa gtatgcctgt tgaacttttg ttgcgtttgc 19980agattcattt
gaaattatat ttcctcagat cctctacttg taaaagaatg ttccattatt
20040ttcttagttt acatgatatt tacaatagta tttgtctaca ttttgttcat
attacttagt 20100gatcagtgta tacgtcatat attagtttga actttgaaga
catttatttt ctatatactt 20160cctttgtctg ctaaattact ttggaaagct
ttgttttttt tattaatata agaccctttg 20220gagtttggaa atcactatct
aatgaaatat ataattcatc attagaacaa aaatacaaat 20280atcgtactat
cacctatcat gttccttttg gatttcgctt cacaaaaata cattttaaaa
20340aaaaataaaa taacaaatgg tagctaacaa cttattactt ttaaaagttt
gtgtgcaccc 20400taataagtac tcaaagtagt atgtaacaga gagagtataa
tgctaaaata caaactaaat 20460aaacaagaaa gtgtttctca acaataattt
gctgcaggaa ttaggaaaca aagtaaataa 20520attgcatgtt tatcatcaat
acaatttact ggtagttaat tacaaacttc actcatgata 20580attgaaagag
gccactcaat ttcagctagg agttgtttat ttatttattt ttctttcagt
20640taaattttga ctacccacaa aatcttcatc tggacctaat ctgcaatttg
tggattttgg 20700atgaaatttc taacctattt aagtagtctt attgtttaaa
taacccatgc aattaaatta 20760ggttatatgg gggtgattca tttaccaggc
ccaagatttt atctcattct caattattat 20820cgcaacaccc atgaacctaa
gccaacatga cttatttacc aggccagcta gagaagaaca 20880aggttgctga
ttttcttgtc cgtgattgta gaagaaatgt tagaaatcta aatgttgtta
20940gggatttacc cctcccccct actgagtgta tgaacttatt attgacggat
tgttgtaggc 21000ttccaagcca aaactctgat taagttttct tttatgccat
tttaaccaaa aaaaaaaaaa 21060aagctaggaa gctagctcag cgcgctctaa
ttatttcaca tgtgacatgt tttacactta 21120ttcatacttc tatatgcagg
agagggcaat gcaggagcac aataacatcc tgtctaagaa 21180ggtacttgca
cttgaccagt ttgtgtaata ttgtaattta atttcttaga ttttggttgc
21240atgctttgat gacgaatgac gattgacgaa tacattttta tgcagatcaa
ggagagagga 21300aaaaatctag agcaagtgca acagatgcag tggcagaacc
agcaccagca ccagcaccag 21360cagcagccgc caccgccgcc acaaaatcat
caagttcctc ctgatgcatc aaatttcatg 21420ctcccacctc caattccttc
tttgaacacg gggtagttac ttcttcaact taatttcctc 21480tattcaatat
taagttaaga aacagatcac gtgattagtt cgttaatatt gctaattaat
21540aatcatattg ttatatatca tgcattagtg ggtaccaagg acaatttggt
ggagaagtaa 21600ggaggaatga tcttgacctg acgctagaac cgatatactc
atgtcacatg ggatgcttta 21660caacatga 216682780DNAArtificial
SequenceBvAP1 cDNA 2atgggaaggg gtagggttga gctgaagagg atagagaata
agatcaacag acaagtaact 60ttttcaaaga gaagaagtgg acttgtgaag aaagctcatg
aaatttctgt tctttgtgat 120gctgaggttg ctctgatcat tttttctcac
cgaggaaaac tctttgagta ttcttctgat 180tcttctatgg agaagatcct
agaaaggtat gagaggtatt cttacgcaga aagacggcta 240gcttcaaatg
atccagactc acaggtaaac tggacctttg acttcgcaaa actgaaggcg
300aagcttgaac ttctacaaag gaatcatagg cactacttag gacaagagct
tgactcgctt 360aacatgaagg aacttcagag tttagagcaa caacttgata
ctgctctaaa aaatgttcga 420tctaggaaga accaactgat gcacgagtcc
atttctgaac tccagaagaa ggagagggca 480atgcaggagc acaataacat
cctgtctaag aagatcaagg agagaggaaa aaatctagag 540caagtgcaac
agatgcagtg gcagaaccag caccagcacc agcaccagca gcagccgcca
600ccgccgccac aaaatcatca agttcctcct gacgcatcaa atttcatgct
cccacctcca 660attccttctt tgaacacggg tgggtaccaa ggacaatttg
gtggagaagt aaggaggaat 720gatcttgacc tgacgctaga accgatatac
tcatgtcaca tgggatgctt tacaacatga 780329237DNABeta
vulgarismisc_feature(1029)..(1924)n is a, c, g, or
tmisc_feature(20703)..(20722)n is a, c, g, or t 3atggggagag
gtagggttca gctcaaaaga attgaaaaca agatcaaccg tcaagtgacc 60ttctccaaac
gtcggattgg attgttgaag aaagcgcacg agatctccat tctctgcgat
120gccgatgtag ctctcatcat cttctccact aaaggcaagc tcttcgagta
tgcttctgat 180acctggtatg tctaatttta taacttcttc ttttgtacat
caataatttt atcatcgact 240caactaaaag cttaagcaga tggttagggt
tctattatta ttgaattacc tcaaatttgt 300catcgactca actaaaagta
gagtatattt catgtagatc aggtgctttt tttgaatata 360ttgtcagttt
tagaactaca aaatgttgaa cacaagtatt tatacgcacg ctgacatgtg
420aattttttaa ttgacaactt tctaaattaa tactctaaat tactaatatg
aagaacgtaa 480tttattattt atcactttca gacaaaggca tgtttgtttt
ttctattatt tttcccatga 540aaattctcac caatatccga ttctgtatgt
taattttagt aatttctaat tttgatgact 600taataaattg taaaaaagta
taaaataaac aaatatccaa aacatctttg ttttcaagag 660aaatatctta
aaaacttttg ttttttaaga gaaatatctt aaaacacttt ttatcatact
720actatgatga tgtataaatc tattcaaaaa aaaaaaaaaa tgatgatgta
taaataattt 780aaagagttaa gtttattaga aattatagat atttatagag
ctgagtaata aaataatact 840ctacagatta tatgtagctg atgtagtgtg
tctgctcctg taagatttcc tttttatctc 900caaaaaaatt gcattgatat
tcgagccttg ccgaccccct tttcctcttc aaccatttga 960taagatccta
tgcactgagt aatctagtat tatatgttag atgttatata ttaataagct
1020aaaattgtnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1080nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 1140nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1200nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1260nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1320nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1380nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 1440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1560nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1620nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1680nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 1740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1800nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1860nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1920nnnngatgat attatgtatc attattatca gtcatttatt attgatagat
aatgttatta 1980tttccataga tcatatataa tcttaccata tttactccct
atgattatat atttctatct 2040attaacatag taatagtgat atactaacaa
cgagtatctt tcaagtaaat aaatcatata 2100tatagggtta gaaggtaaaa
aagacaatat tttatgagac tttatttaca cccttcgtct 2160cataaactcc
tttctatttt ttgtgttcac tcgttttcta aagttttttc tacttctttt
2220actttttatt ggataaatac tttttccgtc tgtatgtgat ctacttatat
tagtatttat 2280agatagatta tactcttaaa gtattacatt cacaaaatca
tgtgaattat taattaaagt 2340aaaattatga gtatggtatt ataacttaaa
cgaagtagtt gtattgttta aaccaataag 2400tataataact tataaaactt
aaaagttgat tctacactta tcatgcactt gtgttttgat 2460gggttaaaaa
ttagctagta cataagtaac ataacataaa cttatctctg tatagtatgt
2520tgaattatta ctttatattt gaaaagaaca aagacacaat aagtccaaaa
gatccgatct 2580tttgattatc aactatgtaa gtgtctttcc taaatgatca
atcaccttaa ttagtatact 2640aaacaggaca attatgacat ataacacttt
tctatttgta caagttaatt aacccattga 2700cattcattca gttagcctac
atttttcaat aggtagtcat catcattctt tttttaacca 2760atttttttac
aataaccatt acctaccaac aacattttca aataatgaac tgaatgatgg
2820attccgtcaa ggaaattgtt cgtggacagt ttgtttacct cccaaatttc
ctttaatctc 2880atgctttctc catcatccaa acaatctaat accaatcgtt
ttctctaatt gcataaaccc 2940taattgttga atcctttaat cttgcctttt
cacattgccc aattccttta gtatgatatt 3000ttttattcga taatccctga
taagtaaatt cactaatcta atttcagatg gattgttgta 3060gagattgggg
aattgaagat tttttgcctt ctttgttctt gtttacaatg aaggatgaag
3120tttcatggaa tattgaagag aaatttgaga aaagaaagaa agtgtgggct
ttactgacga 3180caaaaactgt cattttggtg gtttttttca ctaaggcatg
tttggcacta gcgtttaagg 3240tagcggttag cgatttgaca agatcaaaac
gctacttaag aaaatgatga gtgtttggta 3300agatagtggt tgttgtagca
ggtagcaatt agagtagatt atgagtagcg gttgtggaat 3360gttactacaa
gtaacgtttg agatttagag gtagcagtcc agcaaagaaa cattgtataa
3420tagcataagg tagaattaat taaccaatgt ttttttattt ctttttcctt
ttattgttta 3480ttaataattt tattttatgc caattcaatg tttattttac
aatagcaaaa atgtaattta 3540aatatttatt ttaaacataa taaattttat
aagtatttta gtaaaattgg taatataaat 3600ttttcgaggg ttgaatattt
tcttagatgt atacatttct ttgaatagtt aaaaaatcat 3660atctcctttt
tgtgtaactt ctttgaaaaa taaatcttga atttaactat tgaacgaaca
3720tatgaaatta ttgtagttca tttcatatta taataaatgt taggtattgt
cttttacaat 3780ttcaactact tttggcaaac aattctgatt aaacagctat
tttaatcgct gatcgttgac 3840agcaaccgct aacagctact agaaccgcta
cttttgccaa acatgcctaa accaagtaat 3900atcggggatt ctttattata
taaaaagtta ataatgtgat tatttgaaaa aaaggttgac 3960tttatatgtt
gtttaaagaa aaaaaaacat ttattaatag aaaatatact atataggtct
4020ctatataccc agggcgaaat gaatgtgtat cttttcaaat gagtatgcgt
acatgttctg 4080taaatgcata tttcatatga gcataatgtt tttactatta
ttatgcacat ttgtgtttta 4140atttttcaaa tgagtatgta aggaaaacat
gtattcttgg catgtcagtg ttagtgattt 4200ttgttgttat ataaatgttt
tcgtgatttg tgaatgtggt acaaacattc atgtgccatg 4260gcgtttagca
aaacttttct agctcacatg atgcttcaag ctaattgcaa tgaactaata
4320taagggagag gacattttac gaattagttt tacattgata gtagtttgaa
gaagatagtt 4380taggagatag tttgttggaa tagaagctat gtgttagaat
tagttatagt catcaatttt 4440tgaataagac tcattattat ttcaatcctt
ctacctttta attactagtc caactctcac 4500tctttggtat taaattacac
cattcctacg gactatctaa caactctaac cacggcccat 4560actttttctt
cttaacataa aataatatta cgcttactaa ctactaactc ctatgacatc
4620taccttttca ataaaataac aattgataac tatataacaa cttataatct
aaaagtaagt 4680gtcttgataa gtgtagagta ttgtgggacg aagggagtaa
ttcatagcaa atactatcat 4740agcaatacaa tactaggaga ctaagatgtg
agttttgaac ttcaaaaaaa aatatacgcg 4800acatagttca ccggaagaac
tgcagcacaa caaatgcaaa tggggattaa atgaggagtt 4860cacctacatc
acacacaaga gcgattgagg attttcagat ctggaagaag agcgaaaaat
4920caggcgagag cttcaacttt cggagtttaa tggaagagct aatgatgatt
gaattattca 4980gttggattta tcttttgtaa gatgagggga tgaagttgaa
gattgccata ataacccttg 5040agggcgacac aatggtggtg ggaatggaaa
tatggaccac gaccgattta tgagtggaaa 5100gaattgaagt ctctgataca
atgacgtttt ggtatatcat cggtgctttc atggctgtca 5160gatctaccga
aaggaaaagt agtggtgaaa acaaatgtcg ttcaaaccaa ccacctttga
5220ctttcaatat tgaagtgaca aggaaaactg gcaccttagc tagtgacttg
gtggagtctt 5280tgcgtaaacc ggtggaaaag aagcttatct ttgagtttta
gtgtggtctt ggtgcttgtt 5340taagtggctt gactatttag gaggaaaaaa
atcaattgtg aacaattggt gagtaaccta 5400aaaatcaatt gtgaacaatt
ggtgagtaac ctaaaaatca attgtgaaca attggtgagt 5460aacctttaaa
ccagactcaa gagtgggaaa gaggcggtta gacaagtaga tttagaggaa
5520gaggaggtag aaaatagatt agaatttttt ttgggagttt tcaaatgtag
tggtatcggc 5580tcataacaag gtggtcaaat ggatggagag cttggtcgtt
tgagtagtct catggtggtg 5640attataacta cgttggcgat aaacaagttc
gctttaccaa gaagttgatt ggtggaagag 5700agattttcat aaagaggggc
tcgtatagca acagctttgt gtggtctcag tttcattgtc 5760ggcaaagccg
tgggtggtaa atatggtgat tttggagatt gttatgatgg agggagttat
5820ggtggtttca tgagacatgg atgctatggt ggtgctgtga tcggtggaga
agaggagctc 5880gtgaaaggtc actctgatgg agcgggtgta atggtggtct
aggttttttc ggcatcggag 5940gttgcgtatc tcgtgaaggt tcacatttgt
acaccggtga atactatggt ggtgaactag 6000gatgacagtc aaggtgacca
tagtagaaaa aaaaaacata aaccatgtag cttagatgat 6060ttgaaaaaaa
atcatgtttt tggagaagaa tcttaaatat tatgacagag gcaaacttgt
6120cattgaccaa atagatgaca tagcaattgc acgtgtctcg ttaaaatttg
aaaccataaa 6180aaattcaaat tgcacttcat atgcctttct tttggttgaa
aacttcatat accctaatgc 6240gtcaatatgg ttctttttcc aaaaaaaaaa
gtaaattatt tcggcgttag taaaagcagg 6300tccacctcca taatccattt
tattaagcca actcctctac cctacttttg caacctatca 6360tttcttattt
tctaaaatca tatcaaaaaa acaagtgtga acccaaaact aactatatta
6420tacctaagtc taatttcttc atccaatgtg ttcaacccca tttttcaacc
cttccactca 6480taaacccatc ttctttcact cctaaaactt tcagctcacg
ctcgtatcac ctctttactc 6540acatatcagc ccaaacgatt tcttattgaa
tccactaaat tatgtatatc gattttttca 6600ccaagtactc cgagttttca
aaaaatttac ctggtacccc caagttttca aactacacgg 6660gataccccta
agtttcaaac taatacactc agataccctt aatgactaac gacattaatc
6720gccgttagtc attaacctta attttctaga tttcaaccta attaaccact
aaccctaacc 6780ccaaccctaa ccctaataat aaccctaaac ctaaccctaa
ccacccctcc ccaaccctcc 6840caccacccct gccccccatc ctccactcct
gcgcagccag caggcccccc acccatttga 6900ttttaaggaa gaaacacgta
tcagggaagg gggagaactt agctctgaca gcggcaacgg 6960accaccgacg
agtctggact gtaggacggc agcgtcaagg cgcgagccgg ttgggctact
7020gcagttcaaa gcaaacaggg gaggggaatc gagccgagaa gagagctaag
gaaagagggg 7080tgatgggcgt tgcggagatg gtgcgcatag tggtggtggt
gtgggggagc tgtggtgggg 7140gagaaatcga atgggtgggg gagagagggt
ggggggctgc ggtggggttg gggctggagg 7200gggtagggtg ggtgggtgga
ggggtggtgg ttgggtggta gtgggacaac tctcaaacat 7260gattctttct
caaacatgat tctttctcat atcctttttt ggatttcctt aaaaaaatcc
7320atatccaaat aacgttgacc ggtgagggac aactctttaa cactattctt
tctcaaatct 7380tatgaaatct tcacatttaa ctcgatctcc ctttcaatga
acctaaagat atatattaat 7440agagttgcaa attcctaaac tctagaaaat
tcaaatacaa cataagagtc ctagattctt 7500ccacaagatg tattatatct
ttcaaagttt cccagataaa attagtaatt aggaaactcc 7560ttaacaagga
tacttaaagt tttatctaaa tcttgcataa attgaaatcc aaccataatt
7620atgaataaat aatcataaag aatcctaaca taaataacta gaaaataaga
taataaagaa 7680gcaacaaaag aatctcataa ccaccatttg aatcccgcat
gagaacccaa atagttgttg 7740ttccttataa aaacccacca cctttcttcg
ggtattatga cggtattgga ctatagtatg 7800agacgagatc tcttaatcac
caatcaacta ttgtaaactt gtgagcctga ataatttatt 7860tgagatacaa
ttctaaggtt gtttatgaac gtgtttggta aaattgttat tgataactcg
7920ttcgtggaaa ataaatgcaa aagtcaacat gccaaaaaaa gtgctaaaat
caactttcgg 7980ctttgcttga aatgttaagt tttaagctat ccaagagcca
ttagtcaaaa tctattgaaa 8040gcgtactcaa aaaccattta tcaaacaccc
ctacaaatcc ctttagaaaa caataggagt 8100tgtacaatat aagtattgag
ttataaagtt gatcaagtga tttaggaggt tgttccaaat 8160caatctacaa
gagtttgtat acttataccc cttcgttttt ttaattgtta cacttaggcc
8220ttgtttgaca aatagagttt agcggttaga gtttagattt tgctgttaga
gttttaactt 8280tttgttaaat agatttgact gctgatttga caacttcttc
ttataaatgt gtttggtaat 8340tattaacaga ttgctaaaag cttattaccc
tttatttatg tgaaatgaca tgtatagaca 8400ttttaatcca catgggtatt
attattatta ttattcgagg catacaagtc attgaatata 8460tttttaccta
atcctctaaa gaaaaagctc ctattaggag ctttttcatt tcgagagttt
8520ttattccaga actctcttca aaactctttt ttaccaaaga ataggagctt
tttcatttca 8580agagttttta ctccagaact ctctttaaaa ctctttttta
ccaaacaccc cttttagagt 8640ttttgactag tcaaaactct aaaagtggtc
caaatttctc ttttaactcc aaaactctaa 8700ttgccaaaca cccccttaca
cttttcacgc ataccaatgc aacactttga cgattaacat 8760ctccagtttt
ttatttgtaa aaattataaa gagtgcatat taataagtag ggctgttcaa
8820agtgcggtct ggaccgcacc aaaccgcaac ccaaaccgtt gtttcgcggt
ttggtttggt 8880ttgcggtttt aaaattgcgg tttgggttat gatttcaagc
aaaccgcggt ttgcggtttg 8940ggttgggttt ttatttttgt aaaccaaaac
cgcaccgcaa accgcaatgt tacatttttt 9000ttaaaaaaat aaattaaata
catttatgaa ggtgacatac aattataaaa ttgaaaaaag 9060aagtttgagg
taaaaaactt taacacttat gataaatcat tatatatgtt taattatgaa
9120ttcagcttca tatctatttg gactcttatt aacaattttc ttttaatctt
aggaaacaaa 9180agtaatgtcg cggaggaaat aatggttaaa ccgcaaccca
aaccgcacca aaccgttttg 9240cgcggtttgg gttgggttgg tttgggaaaa
agtgcggtgc ggtttgggtt ggaaaatttt 9300caaaccgtat atttgcggtt
tgggttgggt tacatcccaa accgcacaaa cccaaaccgc 9360gaacacccct
attaataaga tacatattaa ttcgaatttg acaagatcca catgactatg
9420tttttattcg cgtataaacc acaaaagaag gttcaagtaa aatttgtgta
tggtgtaaca 9480tgtcaatcaa agaacggagg taatatttgt caagacactt
tagtcacttc taaattccta 9540taaacaaaga aatatggaag aaaactggtg
atgaaaattg aaaaggtggg tataataaga 9600gagacacaat tctaaaataa
gaaaatatta ataataaaat aataagttac gataggcctc 9660atgtttgaaa
acggaaaaaa taaggagata gttcgtgtaa aaaggaggga gtaaagggta
9720atgcatactt tgtattgcaa gcttagtttt aaaaggcata agacgcaaag
cgcatcgagg 9780cacaagacga aggcgcatgc atctcgtagt tgaggcgtgt
aatgatttta cttcacaacc 9840acctgagcaa cccaatacag aacgaccaca
agaaaaatag aaagaaagga aatgattttg 9900attgatcagc agaaaataca
gagcattcga gaggctcagt ctctcccaag gactacaaga 9960tactactaaa
tttcacaccc ccttcagtcc ccttacaccc ttatttatac tacttctgct
10020ctcctatttt aacggctact gacattctct gagctggcct gctattcctc
tttttgtgct 10080gacatttctg aatattctgt ggtagtggct ccattctcac
atttggacag gtttacccct 10140ccatttcttt cgtgcatacg tcagccacgg
tttggggatt gaattcatta cattaccctg 10200cccccaaaga ctcaccttgt
cctcaaggtg gaaggaaggg aaacgttctt gcaccaaatc 10260tgcatcttcc
cacgtggctt caaaaggtgc taagtccttc catttcagta gcacttcagt
10320ctgcgtatgc ctccctctct gtgtctgacg tacgtccaat aattcttcag
gttccacaac 10380tagttccaag tctgctgcta gttgagttgg tacagtggtt
gctgcctggg catctccaat 10440tgctcgtttg agctgggata cgtgaaatat
aggatgtatt ttactggtgc ctggtaactg 10500gagtttatag gcgaccttgc
ccaccttttg cagaacaggg aatgggccat agaagcgggc 10560tgccagcttc
tcaaatggtc gcttggccaa ggattgttga cggtatggct ggagctttaa
10620gtaaaccaga tcccccactt caaaggactc atcgcgcctt cttgtgtcag
cataggcttt 10680catcttttgc tgggagcgta gtagatgaaa gcgtaaatca
tcgaggatgg catcccgttc 10740ttgcaacact tcctctaagt tatctactgg
cgtttgccct ctgcctactc gccacaagtg 10800tgtgggtcac gcccgtacaa
caccctgaat ggagtcagct tagtagacat gtggggagag 10860gtgttgtgtg
agtattcagc ccaagggagc cactttgccc aagtcttcgg gtgccccgcc
10920acgaaacatc tcagatatgt ctcaagtcct ttgttcacaa tctcagtttg
tccatcggtt 10980tgcgggtgat aggcggtgct tctccttagt gttgtccctt
ggagtcgaaa caactctttc 11040caaaaagtac tcagaaaaat tcgatcccta
tccgaaacga ttgatgccgg aaacccatga 11100agttttacaa cctccctgat
gaaagcttca gccacttgtg agagcactaa aaggatgacg 11160aagcccaatg
aaatgcgcat atttcgataa acggtccacc actactaaga tcgtgtccac
11220ccccttggac aagggcaatc cttctatgaa atcaagagtg atatcctccc
aaacctgagc 11280tggaatggct aagggctgca gtaggcctgc tggcttttgt
tgagagctct tatgctgttg 11340acaaatgcta catcgctgca cgtacaatgc
cacgtgcttc ctcataccta tccaatacca 11400ctcagccgcc aacctaaggt
acgtttttac ttcacctgca tgtcctcctt ctggggaatc 11460atggtaagct
atcatcaact taggaatgat gacggaagtg ttgggaatta ccattcgccc
11520cttataccgc agcttgccat cctccaccgt gaaccccaca agtggtttat
ctccctgcgc 11580cacttcttcc ctgagccttt taaggaacca atcctcctct
acctcttttt ggagctctgc 11640ccagtccact ccttgggttg tgattatggt
ccctagctcc atttcaccta cagtttttct 11700agaaagcgca tccgcaacct
tgttggttgc ccccggtttg tagtgtattt caaagtcaaa 11760cccaattaat
ttgcttaccc atttctgaaa atcagccccc acttcccgtt gttgtgtgat
11820gaaacgcaaa ctttgttgat ccgtatggat cacaaatctt ctccccaaaa
ggtaatgttt 11880ccatttctgg accgcaaggc atatggcaat taattccttc
tcataaacgg acttgtgttg 11940tgctctcggt ccaaggagct tgctgtagaa
tgcaatgggc ctgccctctt gcattaggac 12000tgcccccacc ccatacccag
acgcatccgt ctcaactacg aaaggcttat ggaaatcggg 12060catagcaaga
accggtggct gggtcatagc ttcctttaag tgagagaaag ctgaagtagc
12120tttttcggac cagccaaagg agtccttacg caattgctcg gtaaggggct
gggcaatttg 12180cgcgtattgc ctgataaact tgcgataata cccggtcagc
cctaaaaatc ctcgaagctc 12240cctcaaattc ttgggaactt cccactccac
catggccctt atcttctcca tgtctactgc 12300caccccatgc tgcgaaattt
acatgcccca agtaggccac tgtcttcctc cccaagtcac 12360atttcttctg
gttagcgaac agtttgtgca atgctaacag ctgcaacacc aatcccatgt
12420gtcgtgcgtg gtcctctttg gtcttactgt agaccagaat gtcgtcgaag
aagaccagca 12480caaacttcct cagatatgga cggaaaacgt tattcatgag
tgactgaaaa gttgctgggg 12540cattggtgag cccaaagggc attacgagaa
attcgtaatg tccttcatgg gtgcgaaaag 12600cagtcttatg ggtatcctcc
gggcgaacta aaatttgatg gtatccggcc ttaaggtcga 12660gtttagagaa
gatggtagcg ccatgtaact cgtctagtag ctcatcaatg accggtatcg
12720gatacttatc cggaaccgtc tccttgttca aagcccgata atcgacacaa
aacctccaag 12780aaccatcttt tttcttcacc aataatacgg ggcttgaaaa
tggactagtt gagggcttga 12840tgatgcctgc ctccagcatc tctcggatga
gtctctcgat ttcgtctttt tgaaattgag 12900ggtaacggta tggcctaacc
cccaccggat tactgccttc cttcaacgtg attgcatgct 12960catgccccct
ctttggtggc aggcccaccg gagtatcaaa aacttccgca aactgactaa
13020ttaccttctg taaaaattcg ggtacttctt gtgcctcctt caactccgct
tctccccttt 13080tcccatcatc ctcaatctgg ttgagctcca agagaaaacc
ccctttttct tttcggattg 13140cctttatcat ggctctaagt gagattttag
atcttgctaa ggaagggtcc cctctcaatg 13200tcaccactct gccctccact
tcaaactgca taacctgagt tttccagttg gtaatcactg 13260accccaattt
ctcgagccac tgcactccta atattaaatc tgagttaccc aggccgagag
13320gtaaaaaatc ctctgttact tcgatttccc ccagctttaa agtcacccct
tgacacaccc 13380cagtaccatg gacagcttca ccattcccta aagacactcc
aaatccccct gcatctgaga 13440tgaccaactc aagttcctca acagttaaca
aggaaataaa attgtgagtg gcacccgggt 13500caatcatgac caccacctct
cttcctttaa tttttctagt gattttcatc gttttaggac 13560tcatcaaacc
aatcacagag ttgagagata cctcagtagg aagttccggt ggtggttcgg
13620acggtggtgc acgagctgcg tcgctcacct cttcggtttc ttcctcctcg
tcatgcatca 13680gaatcacgct gatctctttt ctccggcaga tgtggccggc
ggtccactta tcgtcacatt 13740tatagcacaa cccttttgct ctcttctctt
gatattcttt ctcggaaagt cgcttgaact 13800ctacagattt ttttccttgc
cccccagcaa ttggatacgt gttcaaggtg ttggaatttc 13860cccctgggtt
ttgggcccac attttgctgg caggtgggtt gaggctggtc gttggattga
13920aggaagcccc tacactcctt gtcattcccc ctctgttata aatcgagtaa
ggcccattct 13980tagttggccc acttctttta taacccacaa tcctattcct
ttcctcaatt cggcctgcta 14040gttccattgc ttgctctagg tccataggat
tgagtaacct gacctccact ttgatatcct 14100cttccagccc attaatgaac
tgacccatga gtatttcttc tggtactcta ctcaacggtg 14160ccgccttctc
aataaaagtg cgtcgatact catccaccgt ggtggtttgc tttgtggcca
14220accaccgttc ccacaatgaa ccatagtggg ttggtcgaaa ctgacggagg
aggtactcct 14280tcagatctgc ccaccacctt atcggccgcc ttttattctc
ccactggtac cacctgaggg 14340catccccctc tatagacaca accgccgcct
ccagggcttc actgctactc aggccataaa 14400acgaaaaata tcgctcggct
ctaaggatcc acccatccgg atcggaccca tggaaaatgg 14460gcatctctaa
ctttcgatat ttccagttcc ccccggaagt cgaacctcct ggccctccac
14520ctgagccgcc atccccatac gatcggcccc cgagctggaa acctccatat
tcgtcccctt 14580ctcggccccc cagatttatc aggtcaggag ccctccgttc
cggcggtcgg gggtgtccag 14640tgacggtttc gggtgtctcc cgcgagggtt
gtggcaaggt tgctcggatt tcaacctgaa 14700atttcctctg ttcagcacgc
aacccttgaa tcgtcccatc ctgtgtctcc cttgaccggt 14760taatccggtc
ctccagccta acggccaaga cctccatctc ctcgcgactc ttcctcgccg
14820cctcattctg gccttccaag atttgagcgg tcaacgattc ccccatggca
ttaatggcag 14880attccaccgc cctggacacc atggtggcca ctgacccttc
gagggctgcc attctttctt 14940ctaatgaatc caccctctgc acttcgtttc
ttggtgccat ggatcgatgc tctgatacca 15000agtgtaatga ttttacttac
ttcacaacca cctgagcaac ccaatacaga acgaccacaa 15060gaaaaataga
aagaaaggaa atgattttga ttgatcagca gaaaatacag agcattcgag
15120aggctcagtc tctcccaagg actacaagat actactaaat ttcacacccc
cttcagtccc 15180cttacaccct tatttatact acttctgctc tcctatttta
acggctactg acattctctg 15240agctggcctg ctattcctct ttttgtgctg
acatttctga atattctgtg gtagtggctc 15300cattctcaca tttggacagg
tttacccctc catttctttc gtgcatacgt cagccacggt 15360ttggggattg
aattcattac aaggcgcacc tttggtacca agagacttag acatcaaata
15420aagaagcatt gtaatactta gaataaacat tgtttttata ttctaaaaca
catattttct 15480ctaagacacc tagtaatctc atagtggatg tcatctagtt
taggtggtaa ggttttattg 15540agttggtact caagacctca gacttagagg
tggcaaacgg atcatacggg tcgggtgaaa 15600atgagtcggg tcataatcgg
gtcacctttg tgtccaggtt acggtcaggt cgagttcgtt 15660cgggtacgag
ttcatattga gtccatgagg tttcatgtca tatcgggtcg ggttagattg
15720gatttacaat ttcgcaaata aataaaacgc atataatact aaagagagta
aattaaataa 15780ttaacggaca ctagctaaat catatattag tattttatga
tgtattttcc ttaaatttat 15840ttaaaaaata actaatatga caatttttcg
ggccgggttc gggttgtggt catcattatc 15900gggtcaattt agtatcgggt
aggctcgggt tcatgtcata ttcgggtcta ttttaattcg 15960agtcgggtta
tttcggattt aagctctatt tcgggtcagt attttcgatg aagaacgggt
16020ttcggatcgg gtcaccggat acggatctat tttgccgagt cagactgctt
ttcaaaccta 16080ataatctcag tttttccacc tattcagatt tgcctatgat
ctctatttag ataaatgagt 16140acaatattgt gtctatccat gaaaatgaat
atctcacaat gtaaaaggat atctctaaat 16200ttcactaatc actctatctg
ttttgaataa taatattcta ttttattgca tgtagtaaag 16260atcgagtatt
tagtgagatt tggagaaaag aggaggctag agagaaacta ggatttagag
16320aggagaaggg ggctctgtaa cacatacaag atagatactc ttttacacta
acttttcaag 16380atactcaaca tataaaatca gcatcatctt ccaaacaaca
actttaagcc acccatgaat 16440cttaattaga taataaaaca taatcgtgaa
tcatctatcc tttgtttggg gggatcctaa 16500agcaattgag gaaaagcttt
gatgcaaata tcaattgtgt aaaaaagcaa gtattcgttc 16560gtgatgttgc
tatactaggt tattttttgg atccaaaagt cattcctact agaatcattt
16620aggaaaattg tcagtatgaa ttttaaattc aggttataac caaagataat
tgaaaattgt 16680caaacttttc aaataattcc gaaataaaca tgtttgtaac
atggataaac ttttcattgc 16740ttttcaaata attccaaaat aaacatgtta
taacatagat aaccttttca gataattcca 16800aaataaacat gttgtatcat
ggataaactt ttcattgttt ctagtcactt aaaattctaa 16860aaaaatcttt
cctccctact gttactctct ctagcaccaa atctatcaca tgagaaggca
16920gaggttttca aaataaaccg ttacttaatt tggtacttat ttcttgatcg
gtgttcatat 16980catatgagtt cctactctat atctctctac tcttctaaat
ccttgtgtca cttcctgtgt 17040ttcataaata aaaaggagga agtattagtt
ttgaaacgaa aggagtatgg tgcatacatt 17100gatagaaaaa agaagttatt
tgtccttatt tcactcatat aacaacacca aattctgtat 17160tgttatcaca
aaataaaact tggattatct ttgtttcata gcccaaattt agaattagtt
17220tgtcagattt ccaatcatct aattacaata ttagagctag acctaggaca
aaaggtgggt 17280ttggctactt ggtaatagct atgtctagtg ctaggatatg
tcattgtcgt agaaccatgt 17340tatggacatc ttaagaaaca aggttaacct
aattggttgg agatcctact ttcactttta 17400taataaagtt tcgattcttg
cctatttgta aagtagaatt cctaaatttc ccttcactga 17460tatttatctt
aacataaaaa aatgttataa acattgggat tgtatataag tcaaaataaa
17520ttgacaatct tggtaacaac taagttaaca ttaattttat aagtaaatga
ttaatcccaa 17580tataatctct tatttagtaa atgagacaaa cttgtacacc
ttcgtgttag actcgttaat 17640gttcgctaac aattcattca gtagtcaaca
gcattttaaa tttgaaataa gtgttcttgt 17700ggtttttgag agatcaagca
agaaaacatg tctctcccct ttgaccaact aattgggatt 17760aagaatacta
gttttaagat tttaagaatg agttatagtc tttcttagac cgctacaatc
17820cccttgttga tatgaaccag atatattttg tgttcaaata gtagatcaat
gcattgttga 17880taatcctttg ttaatgtact tgttgatctt attttgtact
tttggtagat gcgctatact 17940ttctttcgat tgctcatttt gaactcttaa
ctacatatgt tagtttaagt agatgattta 18000gaattgctat ttcaatcttc
aataagcaat ttaagttgtc aaaccttgtt tcacatcatt 18060agggtgaaag
ttatttggat aaagacctat atctaattca atccaaagca aattagtaat
18120gcggattgga ctcaaactat gtttagattg gattcgaatt gagtttcttt
tctttttctt 18180aaaaaaattg gattttccga tcgaattgag ggtgattaga
tccaaaataa ccgaatagta 18240gataggattt gtgttgtata ttagaattgg
gcttaaggat ttccatttta acaaaaaaac 18300caaatggtcc gactatcaaa
aactataatt tgatagtcat gcctatcgaa aactttattt 18360tcattctcgc
acctaattat gggcttgtat aaattagttc tactatcgaa aactaatttt
18420gatgctcgtg cctaatttaa attttcgaaa aaatgaagtt aagaaaattg
gatatttcgg 18480attggatcca atatatcttg tgaattatta atttggatta
gtttggactc aaattcttat 18540tggattggat tcaaattaaa agattaaaat
tcaaattttg ttcgaatcaa attggagtag 18600gcttaagttt aaatcataca
ccgaactttc accactaccc atcatgctta agcttctaat 18660gtaagagagt
gtttgggagt tgagctcgaa caactaaatt tctaaaagaa ccaagttcaa
18720acaagaaatc taaaagctcg attaaacttg agtcaagctc aaacacctat
attccttatt 18780ggagcttgac tgaagattga acacttattc cttattaagc
tttacgctaa aacattgctc 18840gactcaactc ttctacatcc ctatagttca
aaagaaatag ttgtgggctg tggtgctctt 18900gtagaccaac gcactagttt
aacaaagcta agtgcctgac tgcaattcca tacacattac 18960gatcaccatg
acctagtttc agctcacact ttggaagtct aatttgaact tgttctctac
19020ctccaattca ttgtggggta ggaggcgata gttaagggat caaaatctta
tgatataact 19080tgcataggct atgccactat ataatgcgtc ttgtgtccca
tattagttta atcaaattga 19140aatgttttac catttatatc ttcaattatg
tatggatact aatatttgat ttgacgtttg 19200atatgatatt aaatgtggac
tgttattctt gatgtgcttg agaagctttt ttggggccag 19260ttagaaacta
tattcctttt atggtcctaa ctaggttgtt gttggtgtgt tcccaaataa
19320cagcatggaa aggatactcg agcgctatga aagacactca tatgcagaga
gacaactgac 19380tgctccagat cctggatccc atgtaatcca gctaggcaac
tatcttttct aagcatttaa 19440atcgttgaga tttcaatttt aaatgtgttt
taactgataa ttcatgcatt atatgcttag 19500gtaagtttga ctctggaaca
cgcaaaactt aaggctaggc tggacattct tcagaaaaat 19560caaaggtaat
aagatccaga ccaaatataa tttgtataat aaccacctta tgaggaaaat
19620ttaagatcct tgataatttc aggcattaca tgggagaaga acttgatacc
ttgagtctca 19680aggagcttca gaatttagag catcaaattg acagtgctct
taaacacatc aggtcaaaga 19740aggtagtttc acagttgcat tagatcatct
tatggatcaa ttggatcact tgtttgtatt 19800ttagcgttgc tcaacacggt
cgtctaatat agtgtgcaaa acgacctaca gggcaacacc 19860ttttataggg
ctcgaaaata cgaaaaatta aatgtttgtt ttagtcatat tgttcaaacc
19920caagctttat cttgtcaaaa atattttata atgattattt tttagaatac
attatttaca 19980tttttgcaat ttatgcataa tacttctaag gtccaacttt
ataattgaaa tagaagtcct 20040taaattttaa agacgacctt gaggaaacct
aatttcttct catatataat taaatcaatt 20100attctacaag ttagtagaac
aaatactaca ataacaacaa tattgaagcc ctaatctcag 20160taggattgga
ttgattgtat gaagtcttat tagtggccgt taaatgtttc ttgtaggtca
20220agatgacatg gctcatatag taaggttact tgactaaaag acgaggattt
gtttcgactt 20280agattttaac aagtttccct catttgttaa cacctaagcg
tactaaatca aattctaggt 20340tttactcact caaatttccg atttaggaag
ggcttgagga tagttgtatt atcgtaactg 20400actaatcaaa ggagcctctc
ttagatcagg tttcacttgc caattctaac aacttgtttg 20460gtaaaaggaa
tttggaatga aaagaaagga attgaaaaga aacattctac ttttcaatgt
20520ttcattcaaa aataacattt taagtgatag gaaatggaaa gaagtgaaac
gaaagcctct 20580ttacaaaatt atcatttttc tacccccccc cccccccaaa
aaaaaaaaaa aaaataagta 20640gtaagtagta gaagaagaaa taaataacta
acaagagtag taagttttta cgttttcttt 20700ctnnnnnnnn nnnnnnnnnn
nnattcaaat cagaactgaa tagtcataac cggaagatta 20760gtttctctct
agcgtgacta gggtttgagt aaaaagagaa aacttaaatc aaacatggga
20820tattaaggtt ttttttcctt tcttcagttc ttttctcttc ccaatccttt
cctaaaaatg 20880aaccaaacag gctgcaaggt tttcacttgc ttaacacaag
atttattttt aaaaataatt 20940acactccaaa cttttaagct taaaaccaat
tttaattcaa atcagaactg aatagtcata 21000actggaagat tagtttctct
ctagcgtgac tagggtttga gtgaaaaagt ctagggtttc 21060atgtcattct
tcttgcttcg agtcccttct tgggattgtt gttagccatt atggctaccg
21120aaatcgttat taaatgtcta aatcttagaa ttactgctga agaaaacaac
ttggtgtttc 21180tcgaagatgt tgatgataac tcgcagcacc atacgctcgc
actggcgatt gttggaaagg 21240ttctttcgtc aagaccatac aatttcgagg
cacttaaatg aaccttaaac tagatatggg 21300tgatatccaa aggagcccta
cttcacccta ttgaaaacgg actttttgtg gtacaatttg 21360cgacaattaa
ggaccgatct aaggttctag tcagcagacc atggaccttc gatataaacc
21420ttgttctctt agatgctatt gaagggggta ctcaatcttg acccattgcc
cgttttggac 21480tcgcttgtat aaccttccta tggactgccg atatgagaag
ttcatcaaaa actattgttg 21540gtgtattggg ggaggtattg gaagttgatt
ttgacaggat tgtttgggat aaatctgcaa 21600gagtaaaggt gaagattgac
attacaaaat cgttttgtcg tgtgcagatg atcaagacta 21660acaggggtga
ggctgtgatg atcaatgtta agtatgaaag acttcctaca atttgttatg
21720tgtggaattc tggccatatt gaaagagatt gtgtgaagac ccaggaagaa
gagaaacaag 21780tggagagaca atagggggtc ttggaggcct ctccgcgtag
gggacgatta aagatggtga 21840aagagtcgaa agccttcctt cagtgtgctc
gtacactcca ctttaataac aaggaagaag 21900taaggggtga ggaaccacgg
gattatgtgg agccgagggg ttattgtcgg ctatcttagg 21960gggtaaaact
ttggtggtcc aggagatagt ggacggctct aaggatgcca tcgaggaagt
22020tcgtgctgaa ggtgcaccac tctagccccc ttgtaccctt tgggtaatgc
catgctacct 22080tttacttttg ctgttgggag tgctaatcct actccctccc
accgaaaagt taaaattaaa 22140aacaaggcaa gggttcaggg tgttttgaac
caagttaatg ttgtgggtgt tggggggttg 22200gctaataatg ggggttgtga
gaaaaggata ttccccaacc cgatggtgtt agaaaaagaa 22260aaggggttca
atgaagaggg
tttaagatag caaaacgaga ggattgtatg taacctatca 22320gtagggaggt
aactattgag gtggaggtgg gcgagaccca accccgcccg acattatgaa
22380tatcctatgt tgcaactgtt ggggattggg caacccccgg gaagttcgga
tgcttcgtag 22440gtggagcaat agtgctacac tgagttcggt ttttatttct
aaaactatga ttagtggtcg 22500tgatgtggaa agggtgcaaa gcgggtaggg
ttttgattgg gcaattgggg tggatagcgt 22560tggaacttca agagtttggt
gcatttattg gaaagctggg gaagtggact ttactctagt 22620ctctctatca
agtcatcata tttgtgggaa tgtgaagctt gttgatggga aggtatgatg
22680cttagtgagt atttatggtt gggcggatac aattcaaaag tataaaacat
gggagcttat 22740gcaatccttt cactcatatc atgggccgat attgtttggt
tgggacttca atgagatttt 22800gacaatcgga gaaattgaag gagggtccga
aactcaatga agtaacatgc ataattttct 22860agaaacttta gatgacatga
agcttaggga ccttggctat tcgggaactt ggtatacata 22920agagagaggc
tttaagccac ggaagagaat gagggagaaa cttgatcatt ttgttgcatc
22980ttcatcatgg tgtgacttct ttccgaaagc tacagttgag cacttgatgc
gctacaaatc 23040ggaccacact cctattttgg ttcgccttgt aggccatcag
tgaagacata agaagaaaaa 23100gacgtagttt tgttttgaga ctgcttgggt
gcttgaggaa ggttgtgagg cccaatgggt 23160gagtcatggg ccgggtttac
tcgcgaggta tttatcgagc gctttaaagc cgtggaaggt 23220gggttcaaag
caaggagtga tgggtctctt agtaatctgg gcccgcgtgt gagggagatt
23280gaggaggcca ttatagatgg gaggcagcga agcagataag gactatgagg
ctctatgaga 23340ctcctctccc acgaaagtta gacgaggtgt tggacaagca
ggagacgttt tggtttttga 23400ggtctcgtgt gagttagata aaggatggtg
atcgtaatac acaatatttc caccacaaag 23460cttcccaaca caaacgtcgc
aactacatag cggggatgta tgataataaa ggggtgtggc 23520aagataacga
agaggatatt gaagggaata tttcagagta ttaccaaacc tcgttcggtt
23580cgtgctcccc ctctaggaag aacgtcgcgg ttgtccttga ggttgtgagc
ccggtgataa 23640ctgatgatat gaatatggcg gttatgaaat cttacactaa
agatgaggtg tgggaagcac 23700taaaccacat gaagcctaac ggaatgcatg
ccatccttta tagaggttct ggaatacctt 23760ggagatgata ttacatctgt
cattttaggt attattcatg gcacccgacc ccagatgttt 23820ttaacaagac
taatattgtg ctcattccta aagtcaaatc cccaaatctt gtttctgagt
23880ttcgcccgat tagcctctgt gatgttatct ataaacttgc ctcaaaagta
cttgctaaca 23940gattaaaaca ggtttgcctg acattgttta tgataaccag
agtgcatttg tgtccggaag 24000ctatattacg aacaatgctt tgatttctct
tgaattattt gactctatga aaaaatgata 24060cagagctagg aaaggttttg
tgtcgatgaa attggatatg agtaaagcct ataaaagagt 24120tgagtggtgt
tttttcagta gtgtgttgga gaagttggat tttgctgaat catgagtgaa
24180tgttgttatg agatgtgtgt cttttgtgca gtactctttt gtggttaatg
ataatatatg 24240tggagctctg acaccctcaa gggggctttg acagggagac
cctatatccc cgtatttgtt 24300tatacttgtt gcagataccg ttttagctct
tcttagcaag gcattcaaca atgcgtggct 24360atacttgata ttctcaacaa
atatgaggca gcatcaggct agaaaataaa tattgacaag 24420tcaggaatct
ctttcaataa aagatttgac gtattttatg gccatgaaac aagttgagaa
24480gcatcagaaa gacttggtat cccaactttg gctaggagtt cgaaaaaagt
catatttgct 24540gacattcaag agcgaatttg gaagaagctg cacggatgga
gagaaaaact tctcgcgggc 24600ttgaaaagaa actctcttaa aagttgtggt
tcaagcaatt ctaacctatt tggtgggcgt 24660ttacagattc ctaaccagta
ttatccaggc cattcatttg gccatggtaa agttttggtg 24720ggggtcgaaa
agggcccaca attcgatgca tctgggggga tatgtgctca ccaaaatgtt
24780taaggagcct tagctttaaa gacttagggg tgttcaatga acctaaacta
aggaggaatg 24840cgtggcattt gattcctgct ggtgagtccc tttcgggtcg
agtgttctcg gccaagtact 24900attcgaagtc aacctttttg gactcatttc
taggtccggt aggtagcttc tcttggaaga 24960gtatttgggg ggccaaggca
ttagttaagg gtgttttatg gtgcgtaggc aatggcagac 25020aaatcaacat
atggcgtgac tcgcgggtgt tgaatggtga tagtaggttc atccccggag
25080agcgcgtttc aggccttgag gatgtttgtg atctaataga ttttgcacaa
tggagtgcga 25140tgtggacctt gtcacgattg cttcaatgaa gatgatgctc
aagccatttt agtcatacct 25200ctaagtaagc gccttctgaa ggacatggtc
tcttgggctt tcactaagga tgaatttttt 25260ttgtaaaaac aacctatatg
gccggttggt cgaggaattt gaatttgttt cacaaagcat 25320ggctgcaaat
ctggggcctt aacgtgtctc cgaaggtctg ccacttcctt tggcgtttat
25380gctcggtacc cttcctgttc gagctctttt aaaacgacgc cacataactg
atgatgattc 25440atgtcctttg tctaaaggag cccggaaagc atatcacacg
cgttgttcta ttgcccatat 25500gtagccgaag catgggagag tgcgggcctc
acaaattgtt tgcctttgtt tgatggggct 25560ggtatgcttg atgcgtgggg
ggagtgggaa acaatcgatg actagtccct tgtaagactt 25620agcttcttgg
cttatcactt gtggtttagg cgaaataaat gtgtttttga aggggtggtg
25680agagcgaatg agagtgttgt ggaatatgcc actaaagcta ctgttgatca
tggtttgtat 25740agtgcccgca tttatggtgg gtcgaaggct accgcatcca
aaagctcgaa ggtatgggtt 25800ccccctccag cttgtcgtac gatctaaagg
ttgatgcatc agtggggaat ggtggatggg 25860tggggctagg agtaatcgcc
tgaaactaga aaggggaggt gctcgtggct gcaactagga 25920gggtcagagc
ttgtggcccg tggaaatggc tgaagggaag gctctttgtc ttgctcttag
25980gcttgcctcg ctcatacaac ttgcaagaag tgatcgtgga gtttgactgt
caatcttggt 26040gaaccatctc tccaagggtg ctatttactt tgcattttta
agtcaaagct tgaaccttga 26100taaaaaaatc ccgttcgaca tgaaaagtgc
cttgattttg cgggtttggg agtgccttat 26160tgattctggg gtttgatttg
taacaccttt agtaaaaaca tgtaagctaa ctgtaaaacg 26220aacattaatc
aaactaggat atgtaaaatt cctaaatcaa gaagaatttc cacttgtgct
26280gaatttgtcc accttgcatg acacccaata aaagcccatg tctcctagaa
ccccttatgc 26340cgccttattc atcttttctc aagttgagtt ggagtcctct
atggtccact cgacttcttt 26400agcacactct cggtaaaaac ttttaatatt
attttatttt agactccacc atcttgacat 26460ttattccttc ttaaacttgc
ttcacacaaa catctaacac tagaattcta tatagaatag 26520cttgaatctc
tcttaggata accttatagt aaatgcaact acgcctatcc ttaaaccttt
26580ctaagaggag ctttatcgta tttacattcg cttcactttg aaacgtcgct
aagtgtatgt 26640tgcactttcc aaaccatgtg ttagctaaga ccaagttata
tgactgcata acctaaatag 26700tcttcctcga gaaaattcac tagttggatc
ggaagagttt gtgtaaatct atggcggcgc 26760gggactgggc cttcacgatt
tcaagtgttt taatgcagct cttctaggta aacaagtttc 26820ctaatagcgt
ggtgactcaa atattgagga cttgttgtta tactaatgct attcctggcg
26880gcacttaagg ggtgaaatag gtggtacaaa ggagtgcttg atggcgtgtg
ggtagtagtt 26940gaatatatca gtatgatcaa gtccatggat ccctcgtact
tattcgtgca agattatttt 27000tccacgaggc aaagcgagcg agaatcttaa
ggtttatgat catattcatc ttgtacgtgc 27060taagggtaat gtccctttca
ataatgagct atttctcctt ttgagcaaga gcgtatctta 27120agcattcctc
gtagttctcg tctccccaac gatgttttat gttggaatct gaatttggag
27180aaagacggag acttttcgtt cggtctatcg agccattctt ttgagttgga
tggcgagagc 27240gtgatttcat cgtcaatacg ctctaattta tggagtataa
tatggcagga tagtaccttt 27300caacgtgtta agcttttatc tgcattgtcg
acacaaaggg gattgagtaa gcctgtgccg 27360agtatggaac cattgtgtaa
tctttatgcg ttggaggatc aatggagcta cacttcttac 27420gagactttgt
tattggaagg gcttatatgg gatccaacta gggtagtcaa aacattggtc
27480ggggctgcgc tgcaattttg gggacttggc accggcgttt ttggagtagc
tccctcatgt 27540ggaacatagg cttttgatga cgatatacta ggctagatgg
aatataagag agaggtgttt 27600gtttgaggag gaggtttgtg atccctatca
aaccacatgc ataatctcat ggcgtagctt 27660catgtggaac actgacatat
tgcatgtgta gggttaggtg cttgcctagg acaccacatg 27720ggaagagctc
cgttaagctt aatgtggatg gagggtgtgt ggaagggttg ggtgcgtcca
27780ctggagtggt gattaggggg atggatgaaa aagcgttgta gttgcaacat
agaaaggtgg 27840aggactgcga ggaaccgtta aaaaggctat attttatggt
gttcatttgg ttgtggaagt 27900cgatttttga aatatggttg ttggaagtga
ctatcttcac ctcgttgaag caacttcttc 27960aaaagtggaa ggcaaaaata
gcttccatgt tattgttgat gacattgttc atggtagtgg 28020tatgttaaat
acttcgtctt gtagttttgt tcgtagggat gggaataggg tttctcacga
28080actcccccat ggaaatatca taatggctca tgctcatagg taaatgaata
agctaacaaa 28140attgttcatc tttgcagaac caactcatgc atgaatcgat
ttctcagctt cagcgaaagg 28200tagacagctc tagagagagc atctagtctc
aaaaatacca tgagattctg tagtgtcctg 28260acatgtttta tatgacagga
caaagcgtta aaggagcaca acaacttgct atccaagaag 28320gtataagttc
agcaagattg tttagtaaca ttgttaatct tgctgattgc tttgaaacat
28380gtcttgctat ggttaacaat gttgactgaa ccaaaatagg tgaaggagag
ggagaaggtg 28440ctggctcagc aggcagaatt ggatcagcaa aatcatgaca
ataactcatc tggctttgtg 28500atgtctcaag ctttgccctc actgaataca
gggtcagtcc tcaataacct ctaatcattt 28560ttccaagatc caaagtaaac
atggtttcat aatttaatta agattttttt gaaccatgtc 28620tccatacaac
cttactagga ctaatactac taatttaaga ccccaacgat aaacaacaat
28680aattagccat atctggctag caccttttgg acaacacacc acatgagact
cttggccaac 28740ttctttgatt tccttcagtc tgatagatat gaatatcttc
tgaagagctc tttggttcat 28800aattattgat ttagaaaaga attcagcaag
gtgagtcatt tggtaacctt aaggtcatta 28860tgggggtact aaatcaaagt
gaagatatat ttaggtggca tcagaagaga tgatatagat 28920aggttgtatc
ctgtcgatag gttatttgga tatgtatcaa aagtttcttt tataatatat
28980ctatactgat tggttgatgt atcaaatatc cctacagatt gtgaaaaaat
cccctacaga 29040ttgtgaaaat atccctagaa cctgtgatga tataagatgt
gctccgcatg ctttattgaa 29100cataatgtat tcaattcttg aaatgcagag
gaacaagcag cagtgcagtg gaagatgaag 29160caacacaacc accaaatcta
aacagcaact ctgcacaaat accgtcctgg atgcttcaac 29220acatccaaga gcagtaa
292374735DNAArtificial SequenceBvFUL cDNA 4atggggagag gtagggttca
gctcaaaaga attgaaaaca agatcaaccg tcaagtgacc 60ttctccaaac gtcggattgg
attgttgaag aaagcgcacg agatctccat tctctgcgat 120gccgatgtag
ctctcatcat cttctccact aaaggcaagc tcttcgagta tgcttctgat
180acctgcatgg aaaggatact cgagcgctat gaaagacact catatgcaga
gagacaactg 240actgctccag atcctggatc ccatgtaagt ttgactctgg
aacacgcaaa acttaaggct 300aggctggaca ttcttcagaa aaatcaaagg
cattacatgg gagaagaact tgataccttg 360agtctcaagg agcttcagaa
tttagagcat caaattgaca gtgctcttaa acacatcagg 420tcaaagaaga
accaactcat gcatgaatcg atttctcagc ttcagcgaaa ggacaaagcg
480ttaaaggagc acaacaactt gctatccaag aaggtgaagg agagggagaa
ggtgctggct 540cagcaggcag aattggatca gcaaaatcat gacaataact
catctggctt tgtgatgtct 600caagctttgc cctcactgaa tacaggagga
acaagcagca gtgcagtgga agatgaagca 660acacaaccac caaatctaaa
cagcaactct gcacaaatac cgtcctggat gcttcaacac 720atccaagagc agtaa
7355259PRTBeta vulgaris 5Met Gly Arg Gly Arg Val Glu Leu Lys Arg
Ile Glu Asn Lys Ile Asn1 5 10 15Arg Gln Val Thr Phe Ser Lys Arg Arg
Ser Gly Leu Val Lys Lys Ala 20 25 30His Glu Ile Ser Val Leu Cys Asp
Ala Glu Val Ala Leu Ile Ile Phe 35 40 45Ser His Arg Gly Lys Leu Phe
Glu Tyr Ser Ser Asp Ser Ser Met Glu 50 55 60Lys Ile Leu Glu Arg Tyr
Glu Arg Tyr Ser Tyr Ala Glu Arg Arg Leu65 70 75 80Ala Ser Asn Asp
Pro Asp Ser Gln Val Asn Trp Thr Phe Asp Phe Ala 85 90 95Lys Leu Lys
Ala Lys Leu Glu Leu Leu Gln Arg Asn His Arg His Tyr 100 105 110Leu
Gly Gln Glu Leu Asp Ser Leu Asn Met Lys Glu Leu Gln Ser Leu 115 120
125Glu Gln Gln Leu Asp Thr Ala Leu Lys Asn Val Arg Ser Arg Lys Asn
130 135 140Gln Leu Met His Glu Ser Ile Ser Glu Leu Gln Lys Lys Glu
Arg Ala145 150 155 160Met Gln Glu His Asn Asn Ile Leu Ser Lys Lys
Ile Lys Glu Arg Gly 165 170 175Lys Asn Leu Glu Gln Val Gln Gln Met
Gln Trp Gln Asn Gln His Gln 180 185 190His Gln His Gln Gln Gln Pro
Pro Pro Pro Pro Gln Asn His Gln Val 195 200 205Pro Pro Asp Ala Ser
Asn Phe Met Leu Pro Pro Pro Ile Pro Ser Leu 210 215 220Asn Thr Gly
Gly Tyr Gln Gly Gln Phe Gly Gly Glu Val Arg Arg Asn225 230 235
240Asp Leu Asp Leu Thr Leu Glu Pro Ile Tyr Ser Cys His Met Gly Cys
245 250 255Phe Thr Thr6244PRTBeta vulgaris 6Met Gly Arg Gly Arg Val
Gln Leu Lys Arg Ile Glu Asn Lys Ile Asn1 5 10 15Arg Gln Val Thr Phe
Ser Lys Arg Arg Ile Gly Leu Leu Lys Lys Ala 20 25 30His Glu Ile Ser
Ile Leu Cys Asp Ala Asp Val Ala Leu Ile Ile Phe 35 40 45Ser Thr Lys
Gly Lys Leu Phe Glu Tyr Ala Ser Asp Thr Cys Met Glu 50 55 60Arg Ile
Leu Glu Arg Tyr Glu Arg His Ser Tyr Ala Glu Arg Gln Leu65 70 75
80Thr Ala Pro Asp Pro Gly Ser His Val Ser Leu Thr Leu Glu His Ala
85 90 95Lys Leu Lys Ala Arg Leu Asp Ile Leu Gln Lys Asn Gln Arg His
Tyr 100 105 110Met Gly Glu Glu Leu Asp Thr Leu Ser Leu Lys Glu Leu
Gln Asn Leu 115 120 125Glu His Gln Ile Asp Ser Ala Leu Lys His Ile
Arg Ser Lys Lys Asn 130 135 140Gln Leu Met His Glu Ser Ile Ser Gln
Leu Gln Arg Lys Asp Lys Ala145 150 155 160Leu Lys Glu His Asn Asn
Leu Leu Ser Lys Lys Val Lys Glu Arg Glu 165 170 175Lys Val Leu Ala
Gln Gln Ala Glu Leu Asp Gln Gln Asn His Asp Asn 180 185 190Asn Ser
Ser Gly Phe Val Met Ser Gln Ala Leu Pro Ser Leu Asn Thr 195 200
205Gly Gly Thr Ser Ser Ser Ala Val Glu Asp Glu Ala Thr Gln Pro Pro
210 215 220Asn Leu Asn Ser Asn Ser Ala Gln Ile Pro Ser Trp Met Leu
Gln His225 230 235 240Ile Gln Glu Gln72366DNABeta vulgaris
7atgatagaac cgcagctgaa agcatgcaac aaaaatgtga agaatccgga gagcaggaag
60actgcttcca cttcgtacaa ttctgcttct aggaagcaaa gcaggaaggg agaaaatcct
120attcgtgtta ctccgttagg agagcaatct tctgattttg gatgttctag
tacttggata 180tgtaaaaatt ctgcatgtag agctgttctg tctatagatg
atgcgttctg tcggaggtgt 240tcatgctgca tctgtcatca atttgatgat
aataaagacc ctagtctttg gttggtttgt 300gaatccgagt ctgggcaggg
tgattcttgt ggattatcat gccatattga gtgtgcattt 360caacaagaaa
agctgggagt tgtgaacctt gggcaataca tgcatttgga tgggagttac
420tgttgttctt cttgcggcaa agtctctggg atacttgggt cagtacttct
gttttatgta 480gtgagatatt gacctgaagt catgcttgtt ttgatggaag
ataaataatt taaaaaaaat 540gttacatgcc caactattac aaggccagta
gaaaactatg agatatatta attttgatat 600tactgtggag gctcagttga
atttatatgc ttgagtttta ctcactaatg gcaggtgttg 660gaaaaagcaa
ttggctatag ctaaggatgc tcgacgtgtc gatgtgcttt gctatagaat
720atttttgagt tacagactcc tcgagggcac agctaagttt aaggacctcc
acgagattgt 780tgcagaagct aaaacaaagc tggaggcaga ggtgggtcct
atgaacggag actctgtcaa 840gatggccaga ggtattgtta gcaggcttgc
tattgctgca gatgtgcaaa agctctgttc 900gcacgcgatt gataaagcta
atgaatggct cgccaatgtt tctagcatta gttcaaattg 960caaaggttag
aatacataca gcctttattg ttctccattt acttggtgag tattctatta
1020taataaatta ttaatttctt ttggcataat ggtggcagtg gatgcacttc
ctgctgcatg 1080caggtttcta tttgaagaag ttacttcttg ttcacttgct
atagttttga tagatatccc 1140cacaccaatg actgattccg tcaaaggcta
caagctatgg tactgcaaaa gtagacatga 1200gacttttgca agggagccta
catccgtctt tccaagggag aaaagaaaaa tatctgtaaa 1260gaatctcaag
ccttgcaccg agtacacatt cagaatagtt tcctacacag aagttggtga
1320tttaggccac tctgaggcta agtgtttcac caagagtttg gagatcatta
gtaagaaatc 1380caccacagtg ggctgtaaga aggaagatcc ttgtgttgag
aggagctcct cgaatgcaaa 1440ggaacaacat aattcaaatt tggctgcaat
atcttctgga ttcaaggtgc gggaccttgg 1500gaaaatcttg cacctagcat
gggcccaaga acagggttgc cttgaaggtt tctgcagtgc 1560tgatgtagaa
caatgctgtg gagtaactaa atgtgaatct ccaaaagatc accagtcacc
1620tccacctgtt tctcgtgagc ttgacctaaa tgtagtttca gttcctgatt
taaatgaaga 1680ccttacccct cccttagagt cttcaaggga tgaagacaac
ggatgcacgc tagagcgtgc 1740tactgggcct gatgatgatg ctgcttccca
tggtgttgag aagaatgggc ttgggctagc 1800caggtcaaat ggtagtgggc
caagtgatga gtctcaagct tgggctctca tccgaaatgg 1860agatgtgcct
gctgttgatt ccttggcaga gacccgtcgg aagaggtctt caagtgcgaa
1920tgaagaaaca catgactgtg acagcactct gataaatgga tcgccatttc
ggatttcagg 1980cgggcctggt tctctagacg gtaactttga gtattgtgtg
aaggtcatcc ggtggttgga 2040gtgtgagggc tatctaaaac aggaatttag
attgaaatta ttgacttggt ttagcttaag 2100atctactgaa caagagcgtc
gggtagtcag cactttcatt caaactctga tggatgatcc 2160aaagagctta
gcaggacagc tagttgattc ctttggagat ctcatatcca gcaagaggcc
2220caggactagt ttcactagta agtttatgtc tatagttgtt ctttgattga
gaaattttat 2280gtcttctttc aaacatttct attactcttc ttattctgaa
gttttggact aattttgtca 2340tgctaattac aggcattcct tcctaa
236682285DNAArtificial SequenceBvVIL1 cDNA 8atgatagaac cgcagctgaa
agcatgcaac aaaaatgtga agaatccgga gagcaggaag 60actgcttcca cttcgtacaa
ttctgcttct aggaagcaaa gcaggaaggg agaaaatcct 120attcgtgtta
ctccgttagg agagcaatct tctgattttg gatgttctag tacttggata
180tgtaaaaatt ctgcatgtag agctgttctg tctatagatg atgcgttctg
tcggaggtgt 240tcatgctgca tctgtcatca atttgatgat aataaagacc
ctagtctttg gttggtttgt 300gaatccgagt ctgggcaggg tgattcttgt
ggattatcat gccatattga gtgtgcattt 360caacaagaaa agctgggagt
tgtgaacctt gggcaataca tgcatttgga tgggagttac 420tgttgttctt
cttgcggcaa agtctctggg atacttgggt gttggaaaaa gcaattggct
480atagctaagg atgctcgacg tgtcgatgtg ctttgctata gaatattttt
gagttacaga 540ctcctcgagg gcacagctaa gtttaaggac ctccacgaga
ttgttgcaga agctaaaaca 600aagctggagg cagaggtggg tcctatgaac
ggagactctg tcaagatggc cagaggtatt 660gttagcaggc ttgctattgc
tgcagatgtg caaaagctct gttcgcacgc gattgataaa 720gctaatgaat
ggctcgccaa tgtttctagc attagttcaa attgcaaagt ggatgcactt
780cctgctgcat gcaggtttct atttgaagaa gttacttctt gttcacttgc
tatagttttg 840atagatatcc ccacaccaat gactgattcc gtcaaaggct
acaagctatg gtactgcaaa 900agtagacatg agacttttgc aagggagcct
acatccgtct ttccaaggga gaaaagaaaa 960atatctgtaa agaatctcaa
gccttgcacc gagtacacat tcagaatagt ttcctacaca 1020gaagttggtg
atttaggcca ctctgaggct aagtgtttca ccaagagttt ggagatcatt
1080agtaagaaat ccaccacagt gggctgtaag aaggaagatc cttgtgttga
gaggagctcc 1140tcgaatgcaa aggaacaaca taattcaaat ttggctgcaa
tatcttctgg attcaaggtg 1200cgggaccttg ggaaaatctt gcacctagca
tgggcccaag aacagggttg ccttgaaggt 1260ttctgcagtg ctgatgtaga
acaatgctgt ggagtaacta aatgtgaatc tccaaaagat 1320caccagtcac
ctccacctgt ttctcgtgag cttgacctaa atgtagtttc agttcctgat
1380ttaaatgaag accttacccc tcccttagag tcttcaaggg atgaagacaa
cggatgcacg 1440ctagagcgtg ctactgggcc tgatgatgat gctgcttccc
atggtgttga gaagaatggg 1500cttgggctag ccaggtcaaa tggtagtggg
ccaagtgatg agtctcaagc ttgggctctc 1560atccgaaatg gagatgtgcc
tgctgttgat tccttggcag agacccgtcg gaagaggtct 1620tcaagtgcga
atgaagaaac acatgactgt gacagcactc tgataaatgg atcgccattt
1680cggatttcag
gcgggcctgg ttctctagac ggtaactttg agtattgtgt gaaggtcatc
1740cggtggttgg agtgtgaggg ctatctaaaa caggaattta gattgaaatt
attgacttgg 1800tttagcttaa gatctactga acaagagcgt cgggtagtca
gcactttcat tcaaactctg 1860atggatgatc caaagagctt agcaggacag
ctagttgatt cctttggaga tctcatatcc 1920agcaagaggc ccaggactag
tttcactagc attccttcct aaataaatct taactaagga 1980cggcacacat
atcttggata caattcagat gtttaggaca caatttttag gaggcagtac
2040ctgattttcc tcgagaaagg gattccatca gtggttaact gcacatttta
gaaggtattt 2100gttagagttt ccttgaccac atttgtagaa agattcacat
tgagacaatc attgttgcct 2160tctcgcattg aaggaaggat atatgcttca
atgaatattt aaattctagt tcaatttact 2220aattaattag tttgttttct
caaaaaaaaa aaaaaaaaaa aaaaagtact agtcgacgcg 2280tggcc
22859653PRTBeta vulgaris 9Met Ile Glu Pro Gln Leu Lys Ala Cys Asn
Lys Asn Val Lys Asn Pro1 5 10 15Glu Ser Arg Lys Thr Ala Ser Thr Ser
Tyr Asn Ser Ala Ser Arg Lys 20 25 30Gln Ser Arg Lys Gly Glu Asn Pro
Ile Arg Val Thr Pro Leu Gly Glu 35 40 45Gln Ser Ser Asp Phe Gly Cys
Ser Ser Thr Trp Ile Cys Lys Asn Ser 50 55 60Ala Cys Arg Ala Val Leu
Ser Ile Asp Asp Ala Phe Cys Arg Arg Cys65 70 75 80Ser Cys Cys Ile
Cys His Gln Phe Asp Asp Asn Lys Asp Pro Ser Leu 85 90 95Trp Leu Val
Cys Glu Ser Glu Ser Gly Gln Gly Asp Ser Cys Gly Leu 100 105 110Ser
Cys His Ile Glu Cys Ala Phe Gln Gln Glu Lys Leu Gly Val Val 115 120
125Asn Leu Gly Gln Tyr Met His Leu Asp Gly Ser Tyr Cys Cys Ser Ser
130 135 140Cys Gly Lys Val Ser Gly Ile Leu Gly Cys Trp Lys Lys Gln
Leu Ala145 150 155 160Ile Ala Lys Asp Ala Arg Arg Val Asp Val Leu
Cys Tyr Arg Ile Phe 165 170 175Leu Ser Tyr Arg Leu Leu Glu Gly Thr
Ala Lys Phe Lys Asp Leu His 180 185 190Glu Ile Val Ala Glu Ala Lys
Thr Lys Leu Glu Ala Glu Val Gly Pro 195 200 205Met Asn Gly Asp Ser
Val Lys Met Ala Arg Gly Ile Val Ser Arg Leu 210 215 220Ala Ile Ala
Ala Asp Val Gln Lys Leu Cys Ser His Ala Ile Asp Lys225 230 235
240Ala Asn Glu Trp Leu Ala Asn Val Ser Ser Ile Ser Ser Asn Cys Lys
245 250 255Val Asp Ala Leu Pro Ala Ala Cys Arg Phe Leu Phe Glu Glu
Val Thr 260 265 270Ser Cys Ser Leu Ala Ile Val Leu Ile Asp Ile Pro
Thr Pro Met Thr 275 280 285Asp Ser Val Lys Gly Tyr Lys Leu Trp Tyr
Cys Lys Ser Arg His Glu 290 295 300Thr Phe Ala Arg Glu Pro Thr Ser
Val Phe Pro Arg Glu Lys Arg Lys305 310 315 320Ile Ser Val Lys Asn
Leu Lys Pro Cys Thr Glu Tyr Thr Phe Arg Ile 325 330 335Val Ser Tyr
Thr Glu Val Gly Asp Leu Gly His Ser Glu Ala Lys Cys 340 345 350Phe
Thr Lys Ser Leu Glu Ile Ile Ser Lys Lys Ser Thr Thr Val Gly 355 360
365Cys Lys Lys Glu Asp Pro Cys Val Glu Arg Ser Ser Ser Asn Ala Lys
370 375 380Glu Gln His Asn Ser Asn Leu Ala Ala Ile Ser Ser Gly Phe
Lys Val385 390 395 400Arg Asp Leu Gly Lys Ile Leu His Leu Ala Trp
Ala Gln Glu Gln Gly 405 410 415Cys Leu Glu Gly Phe Cys Ser Ala Asp
Val Glu Gln Cys Cys Gly Val 420 425 430Thr Lys Cys Glu Ser Pro Lys
Asp His Gln Ser Pro Pro Pro Val Ser 435 440 445Arg Glu Leu Asp Leu
Asn Val Val Ser Val Pro Asp Leu Asn Glu Asp 450 455 460Leu Thr Pro
Pro Leu Glu Ser Ser Arg Asp Glu Asp Asn Gly Cys Thr465 470 475
480Leu Glu Arg Ala Thr Gly Pro Asp Asp Asp Ala Ala Ser His Gly Val
485 490 495Glu Lys Asn Gly Leu Gly Leu Ala Arg Ser Asn Gly Ser Gly
Pro Ser 500 505 510Asp Glu Ser Gln Ala Trp Ala Leu Ile Arg Asn Gly
Asp Val Pro Ala 515 520 525Val Asp Ser Leu Ala Glu Thr Arg Arg Lys
Arg Ser Ser Ser Ala Asn 530 535 540Glu Glu Thr His Asp Cys Asp Ser
Thr Leu Ile Asn Gly Ser Pro Phe545 550 555 560Arg Ile Ser Gly Gly
Pro Gly Ser Leu Asp Gly Asn Phe Glu Tyr Cys 565 570 575Val Lys Val
Ile Arg Trp Leu Glu Cys Glu Gly Tyr Leu Lys Gln Glu 580 585 590Phe
Arg Leu Lys Leu Leu Thr Trp Phe Ser Leu Arg Ser Thr Glu Gln 595 600
605Glu Arg Arg Val Val Ser Thr Phe Ile Gln Thr Leu Met Asp Asp Pro
610 615 620Lys Ser Leu Ala Gly Gln Leu Val Asp Ser Phe Gly Asp Leu
Ile Ser625 630 635 640Ser Lys Arg Pro Arg Thr Ser Phe Thr Ser Ile
Pro Ser 645 65010184DNAArtificial SequenceBvAP1 cDNA 10ctagagcaag
tgcaacagat gcagtggcag aaccagcacc agcaccagca ccagcagcag 60ccgccaccgc
cgccacaaaa tcatcaagtt cctcctgacg catcaaattt catgctccca
120cctccaattc cttctttgaa cacgggtggg taccaaggac aatttggtgg
agaagtaagg 180agga 18411150DNAArtificial SequenceBvAP1 fragment
11ctagagcaag tgcaacagat gcagtggcag aaccagcacc agcaccagca ccagcagcag
60ccgccaccgc cgccacaaaa tcatcaagtt cctcctgacg catcaaattt catgctccca
120cctccaattc cttctttgaa cacgggtggg 15012100DNAArtificial
SequenceBvAP1 fragment 12ctagagcaag tgcaacagat gcagtggcag
aaccagcacc agcaccagca ccagcagcag 60ccgccaccgc cgccacaaaa tcatcaagtt
cctcctgacg 1001350DNAArtificial SequenceBvAP1 fragment 13ctagagcaag
tgcaacagat gcagtggcag aaccagcacc agcaccagca 5014212DNAArtificial
SequenceBvFUL fragment 14agagggagaa ggtgctggct cagcaggcag
aattggatca gcaaaatcat gacaataact 60catctggctt tgtgatgtct caagctttgc
cctcactgaa tacaggagga acaagcagca 120gtgcagtgga agatgaagca
acacaaccac caaatctaaa cagcaactct gcacaaatac 180cgtcctggat
gcttcaacac atccaagagc ag 21215150DNAArtificial SequenceBvFUL
fragment 15tctggctttg tgatgtctca agctttgccc tcactgaata caggaggaac
aagcagcagt 60gcagtggaag atgaagcaac acaaccacca aatctaaaca gcaactctgc
acaaataccg 120tcctggatgc ttcaacacat ccaagagcag
15016100DNAArtificial SequenceBvFUL fragment 16aagcagcagt
gcagtggaag atgaagcaac acaaccacca aatctaaaca gcaactctgc 60acaaataccg
tcctggatgc ttcaacacat ccaagagcag 1001750DNAArtificial SequenceBvFUL
fragment 17gcaactctgc acaaataccg tcctggatgc ttcaacacat ccaagagcag
5018399DNAArtificial SequenceBvVIL1 fragment 18cacagctaag
tttaaggacc tccacgagat tgttgcagaa gctaaaacaa agctggaggc 60agaggtgggt
cctatgaacg gagactctgt caagatggcc agaggtattg ttagcaggct
120tgctattgct gcagatgtgc aaaagctctg ttcgcacgcg attgataaag
ctaatgaatg 180gctcgccaat gtttctagca ttagttcaaa ttgcaaagtg
gatgcacttc ctgctgcatg 240caggtttcta tttgaagaag ttacttcttg
ttcacttgct atagttttga tagatatccc 300cacaccaatg actgattccg
tcaaaggcta caagctatgg tactgcaaaa gtagacatga 360gacttttgca
agggagccta catccgtctt tccaaggga 39919396DNAArtificial
Sequenceconstruct comprising BvFUL fragment and BvAP1 fragment
19agagggagaa ggtgctggct cagcaggcag aattggatca gcaaaatcat gacaataact
60catctggctt tgtgatgtct caagctttgc cctcactgaa tacaggagga acaagcagca
120gtgcagtgga agatgaagca acacaaccac caaatctaaa cagcaactct
gcacaaatac 180cgtcctggat gcttcaacac atccaagagc agctagagca
agtgcaacag atgcagtggc 240agaaccagca ccagcaccag caccagcagc
agccgccacc gccgccacaa aatcatcaag 300ttcctcctga cgcatcaaat
ttcatgctcc cacctccaat tccttctttg aacacgggtg 360ggtaccaagg
acaatttggt ggagaagtaa ggagga 39620300DNAArtificial
Sequenceconstruct comprising BvFUL fragment and BvAP1 fragment
20tctggctttg tgatgtctca agctttgccc tcactgaata caggaggaac aagcagcagt
60gcagtggaag atgaagcaac acaaccacca aatctaaaca gcaactctgc acaaataccg
120tcctggatgc ttcaacacat ccaagagcag ctagagcaag tgcaacagat
gcagtggcag 180aaccagcacc agcaccagca ccagcagcag ccgccaccgc
cgccacaaaa tcatcaagtt 240cctcctgacg catcaaattt catgctccca
cctccaattc cttctttgaa cacgggtggg 30021200DNAArtificial
Sequenceconstruct comprising BvFUL fragment and BvAP1 fragment
21aagcagcagt gcagtggaag atgaagcaac acaaccacca aatctaaaca gcaactctgc
60acaaataccg tcctggatgc ttcaacacat ccaagagcag ctagagcaag tgcaacagat
120gcagtggcag aaccagcacc agcaccagca ccagcagcag ccgccaccgc
cgccacaaaa 180tcatcaagtt cctcctgacg 20022100DNAArtificial
Sequenceconstruct comprising BvFUL fragment and BvAP1 fragment
22gcaactctgc acaaataccg tcctggatgc ttcaacacat ccaagagcag ctagagcaag
60tgcaacagat gcagtggcag aaccagcacc agcaccagca 10023795DNAArtificial
Sequenceconstruct comprising BvFUL fragment, BvAP1 fragment and
BvVIL1 fragment 23agagggagaa ggtgctggct cagcaggcag aattggatca
gcaaaatcat gacaataact 60catctggctt tgtgatgtct caagctttgc cctcactgaa
tacaggagga acaagcagca 120gtgcagtgga agatgaagca acacaaccac
caaatctaaa cagcaactct gcacaaatac 180cgtcctggat gcttcaacac
atccaagagc agctagagca agtgcaacag atgcagtggc 240agaaccagca
ccagcaccag caccagcagc agccgccacc gccgccacaa aatcatcaag
300ttcctcctga cgcatcaaat ttcatgctcc cacctccaat tccttctttg
aacacgggtg 360ggtaccaagg acaatttggt ggagaagtaa ggaggacaca
gctaagttta aggacctcca 420cgagattgtt gcagaagcta aaacaaagct
ggaggcagag gtgggtccta tgaacggaga 480ctctgtcaag atggccagag
gtattgttag caggcttgct attgctgcag atgtgcaaaa 540gctctgttcg
cacgcgattg ataaagctaa tgaatggctc gccaatgttt ctagcattag
600ttcaaattgc aaagtggatg cacttcctgc tgcatgcagg tttctatttg
aagaagttac 660ttcttgttca cttgctatag ttttgataga tatccccaca
ccaatgactg attccgtcaa 720aggctacaag ctatggtact gcaaaagtag
acatgagact tttgcaaggg agcctacatc 780cgtctttcca aggga
79524845DNABeta vulgaris 24ctaggtcaga ttcgctatct atcttcttct
tcttttttgt tggtcaatca ctctgaagaa 60cctttatgac aagtagtatt aagtttcatg
aacttctagt ttaaactcgc tctaaaaata 120ctaaaagatt acatttcaat
atatattcct gtgttatata tacacatgct gttttgtagt 180aaatattatg
tgaagttggt cttaaaattg acatgatgta aatatcacgt cagtttttat
240aaccgattca aattaaacat cagttataac aagcgatgca aatattatgt
gaagttgcat 300cagttgttaa taaatcgatg taaattattt gcatcaacaa
gatacgaatc gtttatttaa 360gtgatgtaaa atttctttac atcaattata
agtaatacaa atattcaata taagtgattt 420aaaatgatat tttttttgta
gtgatcccat gtgaagttat aattcattct ttcagcatca 480tagtctcttg
gagttttctc tttgtcctca ccacacttat tattctcttc ccttttaact
540atcaaaagat accctcccct taactttaag attttaaatt aagaaatcgt
agacacgaaa 600aatctaagaa ctaacaaata ttttgataaa gacagcttct
aaaattcaat atctgaatag 660tatctcttgg aaaatgtcgt gttgtggtcg
gtcacatttc aacactcttg tacaaaagcg 720tcaacttgac ttcatgtgac
agtttttgtt tattagagat gtttagttgt agaattgatg 780atttatgtat
ataacaacga tattctatag atatatattg atgttaaaag ataaacagga 840ggaac
8452529DNAArtificial Sequenceforward primer WAG-AP1FUL 25ctcgagagag
ggagaaggtg ctggctcag 292631DNAArtificial Sequencereverse primer
WAG-AP1FUL 26cccgggtcct ccttacttct ccaccaaatt g 312731DNAArtificial
Sequenceforward primer vil1RNAi-fwd 27gtcgaccaca gctaagttta
aggacctcca c 312829DNAArtificial Sequencereverse primer
vil1RNAi-rev 28ctcgagtccc ttggaaagac ggatgtagg 2929411DNAArtificial
SequenceVIL1 cDNA fragment for cloning of VIL1-AP1-FUL RNAi
construct 29gtcgaccaca gctaagttta aggacctcca cgagattgtt gcagaagcta
aaacaaagct 60ggaggcagag gtgggtccta tgaacggaga ctctgtcaag atggccagag
gtattgttag 120caggcttgct attgctgcag atgtgcaaaa gctctgttcg
cacgcgattg ataaagctaa 180tgaatggctc gccaatgttt ctagcattag
ttcaaattgc aaagtggatg cacttcctgc 240tgcatgcagg tttctatttg
aagaagttac ttcttgttca cttgctatag ttttgataga 300tatccccaca
ccaatgactg attccgtcaa aggctacaag ctatggtact gcaaaagtag
360acatgagact tttgcaaggg agcctacatc cgtctttcca agggactcga g
41130408DNAArtificial Sequenceconstruct comprising BvFUL fragment
and BvAP1 fragment for cloning of VIL1-AP1-FUL RNAi construct
30ctcgagagag ggagaaggtg ctggctcagc aggcagaatt ggatcagcaa aatcatgaca
60ataactcatc tggctttgtg atgtctcaag ctttgccctc actgaataca ggaggaacaa
120gcagcagtgc agtggaagat gaagcaacac aaccaccaaa tctaaacagc
aactctgcac 180aaataccgtc ctggatgctt caacacatcc aagagcagct
agagcaagtg caacagatgc 240agtggcagaa ccagcaccag caccagcacc
agcagcagcc gccaccgccg ccacaaaatc 300atcaagttcc tcctgacgca
tcaaatttca tgctcccacc tccaattcct tctttgaaca 360cgggtgggta
ccaaggacaa tttggtggag aagtaaggag gacccggg 4083125DNAArtificial
Sequenceforward primer for VIL1-AP1-Ful 31gtcgaccaca gctaagttta
aggac 253225DNAArtificial Sequencereverse primer for VIL1-AP1-Ful
32cccgggtcct ccttacttct ccacc 2533780DNAArtificial SequenceBvAP1
cDNA showing sites for mutagenesismutation(1)..(1)any nucleotide
except amutation(2)..(2)any nucleotide except tmutation(3)..(3)any
nucleotide except gmutation(52)..(52)c to tmutation(151)..(151)c to
tmutation(262)..(262)c to tmutation(316)..(316)c to
tmutation(343)..(343)c to tmutation(376)..(376)c to
tmutation(388)..(388)c to tmutation(391)..(391)c to
tmutation(418)..(418)c to tmutation(433)..(433)c to
tmutation(463)..(463)c to tmutation(484)..(484)c to
tmutation(541)..(541)c to tmutation(547)..(547)c to
tmutation(550)..(550)c to tmutation(556)..(556)c to
tmutation(562)..(562)c to tmutation(568)..(568)c to
tmutation(574)..(574)c to tmutation(580)..(580)c to
tmutation(586)..(586)c to tmutation(589)..(589)c to
tmutation(592)..(592)c to tmutation(610)..(610)c to
tmutation(619)..(619)c to tmutation(688)..(688)c to
tmutation(694)..(694)c to t 33bvhggaaggg gtagggttga gctgaagagg
atagagaata agatcaatag acaagtaact 60ttttcaaaga gaagaagtgg acttgtgaag
aaagctcatg aaatttctgt tctttgtgat 120gctgaggttg ctctgatcat
tttttctcac tgaggaaaac tctttgagta ttcttctgat 180tcttctatgg
agaagatcct agaaaggtat gagaggtatt cttacgcaga aagacggcta
240gcttcaaatg atccagactc ataggtaaac tggacctttg acttcgcaaa
actgaaggcg 300aagcttgaac ttctataaag gaatcatagg cactacttag
gataagagct tgactcgctt 360aacatgaagg aactttagag tttagagtaa
taacttgata ctgctctaaa aaatgtttga 420tctaggaaga actaactgat
gcacgagtcc atttctgaac tctagaagaa ggagagggca 480atgtaggagc
acaataacat cctgtctaag aagatcaagg agagaggaaa aaatctagag
540taagtgtaat agatgtagtg gtagaactag cactagcact agcactagta
gtagccgcca 600ccgccgccat aaaatcatta agttcctcct gacgcatcaa
atttcatgct cccacctcca 660attccttctt tgaacacggg tgggtactaa
ggataatttg gtggagaagt aaggaggaat 720gatcttgacc tgacgctaga
accgatatac tcatgtcaca tgggatgctt tacaacatga 78034735DNAArtificial
SequenceBvAP1 cDNA showing sites for mutagenesismutation(1)..(1)any
nucleotide except amutation(2)..(2)any nucleotide except
tmutation(3)..(3)any nucleotide except gmutation(19)..(19)c to
tmutation(52)..(52)c to tmutation(235)..(235)c to
tmutation(316)..(316)c to tmutation(325)..(325)c to
tmutation(376)..(376)c to tmutation(391)..(391)c to
tmutation(433)..(433)c to tmutation(457)..(457)c to
tmutation(463)..(463)c to tmutation(466)..(466)c to
tmutation(541)..(541)c to tmutation(544)..(544)c to
tmutation(559)..(559)c to tmutation(562)..(562)c to
tmutation(601)..(601)c to tmutation(664)..(664)c to
tmutation(694)..(694)c to tmutation(715)..(715)c to
tmutation(724)..(724)c to t 34bvhgggagag gtagggttta gctcaaaaga
attgaaaaca agatcaaccg ttaagtgacc 60ttctccaaac gtcggattgg attgttgaag
aaagcgcacg agatctccat tctctgcgat 120gccgatgtag ctctcatcat
cttctccact aaaggcaagc tcttcgagta tgcttctgat 180acctgcatgg
aaaggatact cgagcgctat gaaagacact catatgcaga gagataactg
240actgctccag atcctggatc ccatgtaagt ttgactctgg aacacgcaaa
acttaaggct 300aggctggaca ttctttagaa aaattaaagg cattacatgg
gagaagaact tgataccttg 360agtctcaagg agctttagaa tttagagcat
taaattgaca gtgctcttaa acacatcagg 420tcaaagaaga actaactcat
gcatgaatcg atttcttagc tttagtgaaa ggacaaagcg 480ttaaaggagc
acaacaactt gctatccaag aaggtgaagg agagggagaa ggtgctggct
540tagtaggcag aattggatta gtaaaatcat gacaataact catctggctt
tgtgatgtct 600taagctttgc cctcactgaa tacaggagga acaagcagca
gtgcagtgga agatgaagca 660acataaccac caaatctaaa cagcaactct
gcataaatac cgtcctggat gctttaacac 720atctaagagc agtaa
73535780DNAArtificial SequenceBvAP1 cDNA mutation position
262mutation(262)..(262)c to t 35atgggaaggg gtagggttga gctgaagagg
atagagaata agatcaacag acaagtaact 60ttttcaaaga gaagaagtgg acttgtgaag
aaagctcatg aaatttctgt tctttgtgat 120gctgaggttg ctctgatcat
tttttctcac
cgaggaaaac tctttgagta ttcttctgat 180tcttctatgg agaagatcct
agaaaggtat gagaggtatt cttacgcaga aagacggcta 240gcttcaaatg
atccagactc ataggtaaac tggacctttg acttcgcaaa actgaaggcg
300aagcttgaac ttctacaaag gaatcatagg cactacttag gacaagagct
tgactcgctt 360aacatgaagg aacttcagag tttagagcaa caacttgata
ctgctctaaa aaatgttcga 420tctaggaaga accaactgat gcacgagtcc
atttctgaac tccagaagaa ggagagggca 480atgcaggagc acaataacat
cctgtctaag aagatcaagg agagaggaaa aaatctagag 540caagtgcaac
agatgcagtg gcagaaccag caccagcacc agcaccagca gcagccgcca
600ccgccgccac aaaatcatca agttcctcct gacgcatcaa atttcatgct
cccacctcca 660attccttctt tgaacacggg tgggtaccaa ggacaatttg
gtggagaagt aaggaggaat 720gatcttgacc tgacgctaga accgatatac
tcatgtcaca tgggatgctt tacaacatga 78036735DNAArtificial
SequenceBvFUL cDNA mutation position 316mutation(316)..(316)c to t
36atggggagag gtagggttca gctcaaaaga attgaaaaca agatcaaccg tcaagtgacc
60ttctccaaac gtcggattgg attgttgaag aaagcgcacg agatctccat tctctgcgat
120gccgatgtag ctctcatcat cttctccact aaaggcaagc tcttcgagta
tgcttctgat 180acctgcatgg aaaggatact cgagcgctat gaaagacact
catatgcaga gagacaactg 240actgctccag atcctggatc ccatgtaagt
ttgactctgg aacacgcaaa acttaaggct 300aggctggaca ttctttagaa
aaatcaaagg cattacatgg gagaagaact tgataccttg 360agtctcaagg
agcttcagaa tttagagcat caaattgaca gtgctcttaa acacatcagg
420tcaaagaaga accaactcat gcatgaatcg atttctcagc ttcagcgaaa
ggacaaagcg 480ttaaaggagc acaacaactt gctatccaag aaggtgaagg
agagggagaa ggtgctggct 540cagcaggcag aattggatca gcaaaatcat
gacaataact catctggctt tgtgatgtct 600caagctttgc cctcactgaa
tacaggagga acaagcagca gtgcagtgga agatgaagca 660acacaaccac
caaatctaaa cagcaactct gcacaaatac cgtcctggat gcttcaacac
720atccaagagc agtaa 7353721668DNAArtificial SequenceBvAP1 genomic
DNA showing sites for mutagenesismutation(1)..(1)any nucleotide
exept amutation(2)..(2)any nucleotide exept tmutation(3)..(3)any
nucleotide exept gmutation(52)..(52)c to tmutation(151)..(151)c to
tmutation(6999)..(6999)c to tmutation(7640)..(7640)c to
tmutation(8573)..(8573)c to tmutation(8606)..(8606)c to
tmutation(8618)..(8618)c to tmutation(8621)..(8621)c to
tmutation(8648)..(8648)c to tmisc_feature(9833)..(11385)n is a, c,
g, or tmutation(11796)..(11796)c to tmutation(11826)..(11826)c to
tmisc_feature(12467)..(12605)n is a, c, g, or
tmisc_feature(13741)..(14088)n is a, c, g, or
tmisc_feature(16396)..(17194)n is a, c, g, or
tmutation(21152)..(21152)c to tmutation(21313)..(21313)c to
tmutation(21319)..(21319)c to tmutation(21322)..(21322)c to
tmutation(21328)..(21328)c to tmutation(21334)..(21334)c to
tmutation(21340)..(21340)c to tmutation(21346)..(21346)c to
tmutation(21352)..(21352)c to tmutation(21358)..(21358)c to
tmutation(21361)..(21361)c to tmutation(21364)..(21364)c to
tmutation(21382)..(21382)c to tmutation(21391)..(21391)c to
tmutation(21576)..(21576)c to tmutation(21582)..(21582)c to t
37bvhgggagag gaagagtgca gctgaagagg atagagaata agatcaacag ataagtaact
60ttttcaaaga gaagaagtgg acttgtgaag aaagctcatg aaatttctgt tctttgtgat
120gctgaggttg ctctgatcat tttttctcac tgaggaaaac tctttgagta
ttcttctgat 180tcttcgtaag tatatatata tatatattaa tagtaactac
ttgttttctg ctttctattt 240ttaggtctga tgcatattta atttaggtaa
tattaattcc ttatatctga tccttaattt 300ttttttcttt taccatttca
tttttgtttg ttttgaataa aagaaaattt ccccttcacg 360tgtgtcgaat
aggtcaaaat ttttacttga aggatgttct ctttgattac taaaatagga
420tccaacaatc acctgaaata aaggaagaag atggtgcaaa gtttttactg
tcatacttag 480tatttgataa atattctatg atgaacttgt ataaattagg
aaatagacct aactttcatg 540cacgaaaaca ttattccttc attcaatttt
tttattactt aaggatttac ttttttattg 600atcatatgaa gtagtagtac
ttgtaatcat tcaatttttt tgttggttaa ataggactac 660attttaaaac
aacccaattt taaaattttt tgtgtgaatt tcttcccttt ttaaaaataa
720agtctattat catagcttag agtagctgtg gcaaagctag acgaaataat
acagaaatct 780ggaaaggaaa ttgtactact tacatgaaca cacttattta
ttacttgcat gatatctgcg 840aaaaagttta tagcaaattt ggttaatata
tagcgtagta ctttggatat taatattact 900agtgtacaaa tacttgatcc
aatgggtaat gaaacttatg gaagatttga ccatacatga 960tgatgctaaa
tattaattgt tattgtccag ctttgttttc cctccatcca ttggcatctt
1020catctttaca ttgctactcc actcacttgt caattgtttc gtcctttatg
ttctttattc 1080acatgtgcac catacttcaa tactttcccc ttctttatcc
tcagtttttt tttcttgtca 1140ttttagggtt aatatccaat gaaatctagt
ttgctcgttt tagatctaat tttaattcga 1200tcacaaccat ccatattttt
gtttcttagc ttgacatcta ttctatggat ctgggatctt 1260cggtgtatag
atgttctcgg ttttcagatc aagatcctat tcatagaccc atttattgta
1320aacacttaaa tgtgttctta aaaagttagt ggctcgccaa gtcaactcaa
taacataacc 1380cccacgactt cattacatta cacaatgaaa gattagatgt
atgagtttgt gaagcttata 1440attctatttc aagtaggact aggatgtttt
gtgcaatcag cagctagtag tctttttaat 1500ttaagtcagt cttcattgtg
catcatatat ttttagaaat atatgcaagt ttgaaaccat 1560ttagaacctc
atgacccgcc tgactcacta taaaccggca agagcttaat ttttcacagc
1620tttgtatctt tatgagtagc gctagctagg ggtatgggca tagaaaaaaa
gggtttgggt 1680tagggtctta caagatctta tccgctattt ttatttcata
atctttcaaa atacatgttt 1740aataattcaa aatacatgtt taatactatc
tccatttcac aacatatgca ccaattgcct 1800agctatggtc caacctagtt
ggtttgtagc ttgcattgga tggttaggat gtattggagt 1860tgtttatgtg
caatcaaatt ttaattacgt atcaaaaaaa aaaaaaaaaa aacatatgca
1920ccaatttcca tttggacaca cttattgacc aatttttgac aatatttttc
tcaccatttt 1980gtaagaaaaa tcaaaatcaa gtggaatttt gttaagttta
tctcagtcaa aagattccat 2040acatcgacat tttataattt ttaatcatac
gcaattagaa atatcaatgt ctaaagaagc 2100gtgttggaat acgtgaaaaa
gcaaatgata catgaaacag atgtagtata tagaaaactt 2160aattttgtgt
cactcggatg tatgtgggcg gagccttcct agaaggcgta cccaccttag
2220tggctctgaa tctttgacga cccgttcggt tggtggtgat aatagatggt
aatagtaatg 2280taatttagtc taaatttata aataaatatt aatatcatta
cccatggtaa tacaagttct 2340tcacaaaaca tgtttcattt aaaaattatc
attactacct tttcaagtgg tattggatga 2400taataaaatt ttaggcaggg
aaatgggtat tgggatgaac attaccatgg gtaatgacat 2460gcaatttttg
ttacaagaat acagtataat acattactat tgccaccatg tataaccatt
2520aatcaaatgg accgtgagga tatgatgttg aagaagaagt cttaacctct
acgctattat 2580ttactagggt ctgtaaattt tcctttttta attataattc
ttgtgaaatc ttcttcactg 2640atggtactag cttattagga tgggtttctt
tagtatattg aaggctcttg ttgacagagt 2700ataaaaatat ttttggggtc
gcaaccatca atttaaactt ttgtttgatt ataaaattat 2760tttttgaaca
tcaacaatct acttaaattt ttggttgagt tagttctttg acatggtatc
2820acaaccatca tgacataaag gtctcatatt caaatctcat tcacctctca
tttccaagta 2880gaatatttac ctcaggtatg ggtatgaggg aggcttgtgt
tgcatgagtc aataacggat 2940cttgaccaat aatttaacag gggcgagttg
ataaattaag ttttaatgta aaattttaaa 3000tgatggataa aaacactaat
acacaccaaa atataaatat acttttatta atggttacaa 3060agagcttgta
gctaatgtaa taaatcaaaa tcccaaaggt gcaattttta agaaattatt
3120tccatttatt tatttgacca ttatgaaatc ttcaagaaat tgagtaagtt
tttaagaaat 3180ttaaggtata gttcattaac taaataaact actccagtaa
aaaaaaatta ccaaactgct 3240ttcttaagta aaaaaataaa taaatttata
ttttatgatt gttaggaatg agtgtgagga 3300aataaaaagt actattatag
ttaaataaaa atgaaagttc ttcagagaag aagaatagaa 3360gatagtacaa
tcaatgttaa atatttttct aaattagaca aattgatata aaccaaaaat
3420aaaggggaag aagaaagaaa taagtaaaaa aagaaagaag gaaaagaaaa
aaagaaaaaa 3480gagaagcaag tgaagaaaaa caaagaagtc caaatgtgtg
ttgatgcaag gttcgagctt 3540gcaacattaa gggctcaaac tttcttttac
actttggttc actgccaccg tgcccacagc 3600ttgttatgtg acatgaagtg
tagtttgctt aatttatctt atacagttat gggggaccaa 3660gcctccaccc
gccccttcta taatctgtca gtggttgcat ccacacttta agtccaatag
3720actcttgtct gagaggaggt gatagagtat ataaatattt ttggggcctc
aaccattagc 3780tcaatctttt gattaagttg gttctgtgac acttgtacta
tatactagtt atatatatac 3840tgtaaaacta gtaccacgag aacagtcctt
aatacaaaca acatgccctt aatagaattt 3900tcttagtata cacttaatat
aggttgacta gctttttgcc cttcagtatg cacacacctt 3960ttataatctg
tatcgttgtc tggtagatga taataaacct cagtattggc aatatatgaa
4020atgacataat ggccatgttt ggtgattaga gtttagagtt tagaggttac
agttcagagt 4080ttgtggttag atgattactt ttttgttcag aggatttgac
tgctgattta aataattgtt 4140gtgtaaaggt gtttggtaac acttagctta
ttgtttagag ttttgtactt tttagagcat 4200gtaaaatgac atttatggac
atatgtattt ttttaaaaca aattttagta gtaattatat 4260ggacaaaata
gtcatttgtt ttttctctct ccaaaactct catgaaaaag ctcctctacc
4320cagctttttc aaaagagagt tttgatcaga gttttcggta caaaactctc
tttagtcctc 4380tctctcacca aacacccaaa ttagagtttt tattggtcaa
aactctaaac tctctccaaa 4440cctctaaact ctctctaaaa ctctctcccc
caaacacccc caatttctta gaaaaatttg 4500ttgctccttt ttattgcact
atatttctat ctccaaacat aaagtttctt ttacaaattt 4560tcatttctac
tccataccac ctttatatgg caatataatt tctatgaatt aaaatgttca
4620caagttttga ggtggatttc aagagcatgg acaatatgat catgagactc
tccatacaaa 4680aattaccctt aaattttata atcatacacc aagcggtcgt
taaagtattg gaagtgcttg 4740agtagtttgt gaaaattaac atataataaa
gtgcagatct cccctctagt aagtagtaag 4800aagtagtaag acgatgtccc
tcatttgaga aagagaaaaa cccttatcag tttctcttgt 4860ttctttgact
gaacgcaagt caaatagaag tatgtaacta ggaaatcctt ggagaaatag
4920attttcttta aaactataaa agtataccta tatatatggt aacccacaaa
aatgtatata 4980atctgatcaa tatctaaaca aagtattctt atgttttctt
tcatcttgct tatttcctcc 5040ctttcctttt cttactttaa tttgtttact
ctctttaact tatttctttg cgtatctcat 5100ttcactttac aaggatatat
agttgattat gacagcttaa taaatatatt ttggaactag 5160gatttattgg
ttgtcgttgt tattttaatt tctacactga tcggctagag tttctagaac
5220atagggcttt attgaaacca ttagttaaca aaattgaatg acaatgattc
aatatgatag 5280aatatgtatg tattagttaa tgtttgatta ttgtttgtat
gtatataatc aaagattatt 5340tagtaatact tctatataca tattctatta
gaatcactta gaaagaccca ttgaacaata 5400ataaggatag gcagacaagc
aaacaaaaga aaaataaacc tgttactcct tccatttctt 5460aatgttctac
tcggaattat agatacacac tttgacacaa attagaaaga gagtgtaaaa
5520agtggatcca tattaatatt tttatttttt taaatgagga gagaagtgtg
ggtttattat 5580gtttcaaggg agatagagag cattgaatag tgagagaata
tgtgccaaag ataattaaat 5640cattgtaaat cttttgccaa ataaagaata
aagcatgtga gtaaaacttt aaaaaatggg 5700cgaaaaagga aagttgagta
gaactttaag agacggaagg aatatagaag aggacgtgac 5760agatgggagg
aagatcagac atcttagaag gggaatagtt aaatttgaga tagtctttta
5820attaaggttc tcactaaaga agatataaca gtaggggaaa gctaaaggtt
attcaaaact 5880ttccttccca tcttcatcac ttcatgtctt tactttagag
ctcttaacac ttagcctatg 5940aaattctgaa ctctttgtaa gattagtgat
agataaaaga atcttatcaa tttaatttat 6000aaatacaaca ggattcaata
aaaagatata gagatctata aataaagagc catactgttg 6060tgaactttta
tatctatcaa aacctttgca cattagacgt ggtataacta aatcaggctt
6120atcgaaattt tttaaaattg ttttcattat agccccttta tatttagaag
ttctaagatg 6180attgcataga tagttgatgc accgttctgg tcgacttttt
taaacacttc tttttgataa 6240attttttttt ttgtattcga atcattattt
taggtgtata aagagctgca aatgatctag 6300atgagattga tctcggtttc
atttatatgc taatagtgtg ttagatacac actattaaaa 6360aagtcatatg
acttagagat tattatggaa aagggatagt gcaccgatat taatataatg
6420gaaaatgaca cacgagttgt ccataataac atgtgaaaag tgaactattt
aaaaggtttt 6480tctgacctag tacatacaag gtgcgtaggt ttagctattt
tagtttttta gttttatttt 6540ttaaagtgaa gttagttatt gatctgaaat
catataacat gtacgtaccg tagatataaa 6600aaactaccaa gtatatatca
atttgaaata aacattattt taatatggca aaatcacaat 6660tgttgactag
acctaacact gaagaaaact atgtcatgtt tatcaattat gttgcataca
6720gttaaaaaca aatatgttag agaaatcgtt atttgaaata gaaaagttgc
gcaaaatagt 6780gattaacatc aaaatatgtt cagaaagttt ttataaatat
gtgatcttgc attgtctgtt 6840gactgtcgag gtttatgata atttcccctt
tttccaatgc aaaacttgtt gtgctatttt 6900ctaatgatat attttttcaa
agtatggaga agatcctaga aaggtatgag aggtattctt 6960acgcagaaag
acggctagct tcaaatgatc cagactcata ggtagtgcat ttatgtaaat
7020atagatatac tcttcatgcc caagaagcct gaatttttta tcccactacg
tactgcaaag 7080ccaagtttaa ttgaataatt gtcctgttta aattatttag
ttttcagtac aataatgtaa 7140tcattagttt gcatgtttaa aaaagaaaag
cacaagttct gatcaagtga aatataaatt 7200gtaacgaaag agccaagcta
gacaattacc tagctaggag ttatttgtta tcgtttttgt 7260ttttaatttc
tagttttttt ttttaaacta gaaaatatag tttcaatctt ttgttatcag
7320ttttcaaaat gacatattta acataaatat gattgatttt aaattcattt
attatatcat 7380atttcatttc aaaataagtg aaacacttgt ctcaaaaagc
tcactctcac ataaatgata 7440aaagtgtttc acttatttta aaacggaaga
attatgactt ttacttttca taaaacgaaa 7500aactgaaata tgacaataat
ctcaaatagc ttggagaaac cagatttcta tatatttccg 7560tgatgaaatc
acttttcatt atacgtaggt aaactggacc tttgacttcg caaaactgaa
7620ggcgaagctt gaacttctat aaaggaatca taggtatgat ggcaatatgt
cataattttt 7680ctattattat ttttgcttcc aaaaccagac catatgtttg
tatatttata tagtgatata 7740ctccatccgt ttcattttaa tctatacatt
tacacttatc aggtatgtca atgcaaaatt 7800ttgaggatat atatctttag
ttttgtattt ataaaaatta taaaaagtac atattaataa 7860aatacatatt
atgatgaatc taacaagatc ccacatgacg atatttccgt ccgcgtatga
7920ataacaaata atggccaaag tgaaatttgt gaatagtgta aaatatcaaa
gtgtaacaat 7980taaaataaaa cggagggagt agtacttgtt tgtcacatac
ttacttattt ttgttctctc 8040cacaatgaaa ctgttctttc taataattaa
aaaaagtgca tatgttgatg atttctctgt 8100cactttaagt ggatattgaa
tagtgataat ggattacttt gtgtataatt gcatttcaca 8160tttgggtcta
attttatacc cttttcgcat atcatgcttt gtgaatagta catatgatgt
8220tcaagaatgt gagaagacat atcatacttt tgatatacct caaacatggg
tgtatactgt 8280atagtgaacg aaagtgttag tgtaatttta tttaggaggt
ttagtggttt gtcctatata 8340taatgctagt agttatacac catagttgtt
gatgagcatc aactggcttt cctaacattt 8400ttttctccat aactttaccc
ttaccttaca ttagattact ctaggattac attctaccta 8460aaatattatt
actcccatca ttttaggtaa atatttttac tttgattttt cgattatttt
8520caagagttta aaataatgat taaaatttac catgatcagg cactacttag
gataagagct 8580tgactcactt aacatgaagg aactttagag tttagagtaa
taacttgata ctgctctcaa 8640aaatgtttga tctaggaagg taagaaattt
tacttgtcta ccgtagtttc ataataaatt 8700agtatttggg ctcgggcttt
gccccagatt ggtattgtct tttcaaattt gatatgcatt 8760tttttccatt
tccactaaaa tatattaaga aaattcaaca tttaaaggat acaaatataa
8820taatgtggat acttaaagta tgattaaaat ttggttgaga tggtaattgt
gtcatgtata 8880atagcaagaa gtcacaagtt caaagctcgt tgcaagctaa
atttattttt gttgattgac 8940atgacttatc aacacactgg acaattctaa
tcatctagtg gagtagcata tactagcaat 9000ttatgcacgt gatgtgtgcc
ttactttttt agaatataat ttataacttt tttgagcata 9060aacaaaggta
aaatttgaac attagacata tttttttggg ctagctaaat ttgttgttta
9120aacctatatc acttaaccaa actcctcttt tattatttat tgatttatat
tttatttaaa 9180atttttaaaa ttaaaatgat gagcaataaa agaatgttaa
gtagatttat taagtatttc 9240ttatattttt atcaacaaag tattttgtgt
taattaaatt atttcacttt gttaattgat 9300tgtattttcc tttttaattt
attacttgat tgtgtattga ttgatcaaac ataatttttt 9360tgttaatttt
tttatgctat atttgaattt atttttcttt catctgtttt tggtagagta
9420gttgatttac taaagggtaa ttaaataaat ttattggggg acaccatagc
tcccccctcc 9480cttatataat agagatttgt atagatttat tgtcttcctc
aattattgat taactagtct 9540tctatgcacg cgatgcgtgt gttgattgtt
tgggtctatt cttaatataa atttcatcaa 9600aatataatta tagtagtgtg
atttacaatt attgctatac aaactactgt aatttataaa 9660gttgttagaa
attgagataa aaatttagat gtgaaatttt gtggtcaaat tatatttgta
9720attttttaaa ctgagtaacc gtttttctca tcatgtcaag ttactttgtt
aatgcttatt 9780taatttatta ttggaatttt tgacccatct ttaaattaga
aaaggatata atnnnnnnnn 9840nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9900nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9960nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10020nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10080nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 10140nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10200nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10260nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10320nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10380nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 10440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10500nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10560nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10620nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10680nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 10740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10800nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10860nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10920nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10980nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 11040nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11100nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11160nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
11220nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 11280nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 11340nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnntatat atatatttag 11400ctaagaaaaa aaagacattt
cattggggga taccataact cccccttagc ttatataata 11460gagatactat
acttcttttc tgatagtgtc aaatttaatt ataaatcttt aacattggcc
11520aattaataat tggacaagaa aaaaatgaga caataataaa taaggcgatc
ttcacagacg 11580tattaacatg atggtaatta aaaatgttaa tcatagatct
ttgtgttatc ttaataatat 11640aaatttacta attagaatgt atcacataaa
gtaagtatta atagcagcat aggataattc 11700ttataatgga gattttatat
ttttttatat aattatatga tttattgttg aaaatattag 11760ttgattttaa
ctggttgttt attcaatgac agaactaact gatgcacgag tccatttctg
11820aactctagaa gaaggtaata actccatttt ttactctcaa aggtttattg
tttttaactt 11880atttcttcta accttttata tatgagaagg tattgggtta
gacgcgtctg accataatat 11940taggtcggat gactttcagt tggtttcaat
tttatttcag ttggtttcaa tttttgtcca 12000gttggtttca atttttgttc
agttggtttc aatttttttt agctggtttc aatttttgtt 12060cagttggttt
caatattttt tagttgatct tttttatttc agttggatgt cttttaagtt
12120cagttactta tcttattgtt tcatttacgt gttttattgt aactgaaaac
aaaacttaag 12180taaatgaaat aaaataagtt ctaaataaaa gcaacttagg
gcctgttctc cccagcttat 12240tttcagttca gttcaattca gttcagttca
attcaattca tttcagttca gttcagatca 12300gatcagttca gttcagatca
gatcagttct tgacaatact tttactctca catatcacta 12360ttcatttcag
ttcagttcaa ttcaattcag
ttcagttcaa ttcagtttag ttcagttcag 12420ttcagttcaa ttcagttgtt
ttatgccgaa gagaacaggc ccttagnnnn nnnnnnnnnn 12480nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
12540nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 12600nnnnnattca tttcagttca gttcaattca attcagttca
gttcaattca gtttagttca 12660gttcagttca gttcaattca gttgttttat
gccgaagaga acaggccctt agttttcagt 12720tacttatatt atcgtttcag
ttagttttct tattctttca tttaacaact aactaaaata 12780aaaaaaaaaa
aactaactga aagcaaaact taattaaatg caaaaaatta agttctaaat
12840gaaaccacat acgatcgaaa tttcaatcat ttcaaacatt atggtgtttt
cgattctttc 12900aaagaaggca agctgctccc gctattctac cctctttaga
tcacaataaa gctcaggcct 12960cacattcaaa gtttcctcaa agatggacgt
tccaagtatc acatagacac atagtcctct 13020tctccaaacg ctctccttcc
tatcttgatg tcattagcaa acttcttgat ccagacggcg 13080ccaacaaccg
caccatgatc tccctctaaa gtactgacgg cccgtttggt tgttggtcat
13140aaatgatggt aatgggaatg aagttgtgtg taaatttgtg aaaaatatca
ttgtccattc 13200ccatggtaat gctaatttat cttaatgtgt ccactttcct
tctagaattt tcattctcat 13260ccaataccac cttgtaaggt ggtaatgagt
ggtaatgaaa attgcttccc cttggagaca 13320aaaatacaag tttaggagtg
agattgattg ctcatggaga aaaaaagtct ccccatggag 13380atattaaggg
tgattcccta ataaaattac acttaaaatt tattcccatt accgcaattt
13440attaacatct accaaacggg ccgtgaaagt cttgaaacac atagtcgagt
gagtagcttt 13500gaggaaccat ctgtaaaaga acctgaggga gccaatgtgt
gcgtaagtac caacggcgtg 13560ttgtcagtgg aaaaggtggt gccgtggtgg
cactcagtag tgatggagcc gccgtggtgt 13620ttgagtgttg ccaaatacaa
aggcggaatt tcgtaatctc taatttcttc tgtgaaattt 13680ttgggatcag
cctgtccgac caaacacatt ggatcaaacg gtctgaccca atagtttcaa
13740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 13800nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 13860nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13920nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13980nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
14040nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnga
tataaataga 14100gatggaggac aagggtcttg gttttgtcat gttgtcaaag
agttgaacaa tggttttttc 14160gtattgttaa aaaattaaaa aaccagaggc
cttcaatctc ttgatagata tagatagaaa 14220aggaggacaa ggccgttctt
ggttttgtta tattgtcaaa gagttgaaca atgatttttt 14280cgtgttgtca
aaaaattaaa caatgaaaaa agatggcggg tgcttgatct aatagatcgg
14340accatggatt gaaggtcttt taacttattt tatatatata ttgaacttat
ctgaagatta 14400tttaactctt tgaaattgta ttaacttccg aactttatga
acttttttaa ttcttcaaaa 14460cttatctaca ttttatttga aaaaatattg
aagacaaaaa aaccctcagt tggtttaaag 14520ctgcggtaag atagagtgta
aatgttattt ttttttatta aatcaagaaa taaaaagaaa 14580tattaaataa
aaagaattaa aaatggaaat gatgacagaa acttatggct tggaggagca
14640atacttttaa gatagaccta aaccttaaat aagttaaaat ggaagtaatt
tttcagtaga 14700atcttattcc aatctatact ccgtgtttac tccatgtaat
gcacatataa taaaaaaatt 14760agaaattaca tagtataagg tttgatcctg
tgactgtaag tttatatact aacttcttaa 14820ccactagagc aagtgatatt
tagtgttatc attttaaagt ataattttaa caaatgaaat 14880ttttttctta
cccggaacat agctcggacc taataactag ttgaacaatt ataatctgta
14940acttaaaatg atcctaatta ctgtactttc attacctata ataatagaat
cttactatca 15000ttggttcaga aaaaaaaaat cttattaaat gttaaccatt
tatttgtaat tgaaacatac 15060atgcacataa atgtaacttt tagtttatct
taacttaaaa actgagaaaa tgttagttgg 15120aaacttttgt atatatgttt
ggataaacga cgctcaaaag taggggctaa aattttagta 15180gataatataa
gattatactc catctgttct agatagactt ctcattttta attttggcag
15240tattcataaa taaaggaaat ctttcaaaaa aatttccaat atataagaaa
aaaaataatc 15300atgtgcggtt ttgtttgatt cgtctcattg tgtacattag
gaaaattaaa cttatataat 15360ttttactact atgtaattaa agatattaac
gatacaaaat gtgtattgac aaacttatat 15420tggagtaata ggaagtctat
taagggaccg aagaaatatt acgtaaataa atctaataca 15480aactaatata
aattctactc cagacaataa agattctgtc ttatattgcc aagatatagt
15540agctatttat tttatcttaa caaacataaa tgtttctaat gcttaaacat
ggacatgtat 15600tattttgtaa aatattatgt attatccaaa gttacatatt
taaaggaagt tctattgctt 15660gctctctttt agcactgccc aaaaaggtta
aagtaatttt ttttctctgt ttaaaaaaaa 15720aatgcattat atacagataa
tttttgctag tcaataaagc tatccttatg acttatgagt 15780gctacttgac
tagggatgtg ttgtactcaa ttggaggtat acatacacca agattataga
15840gcttttattt tgcctataaa aaatggaagc cggataggat accaaaaaag
ctttgactta 15900aatttgtaat gcataaaaat gatgatacct aacttattag
ccatacttat ctaagcgtac 15960gtcaatttaa atattgtgtt attgattaat
aatgatcctt atatatccat attttgacaa 16020ttaaacggta aattagagag
aaaagtttga gaaaataatt atagcttacg taatgctata 16080atccaaagtg
tctccgcaca agcgtgggac aaaatagtac tttcggagaa gttacaatca
16140acagctaggg agtcttcatt gttcttgaat agaaggatgg aaacaaagtt
caccttcttt 16200tattaaagta ttaaggtttg ttattagctc aatatccaat
actttctctg ctttttatta 16260cttcgtctgt ttcaaattaa atgatttttt
ttttatttta cactattttt aaatttcact 16320tttaccatca tttatgattt
atatgtgaat gaaaacatag ttacgtgtga tcttgttttt 16380tttttttttt
tttgtnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
16440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 16500nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 16560nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16620nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16680nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
16740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 16800nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 16860nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16920nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16980nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
17040nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 17100nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 17160nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnngtacat caaggaaatt tgcattaatg 17220aaaaacggga gtacaaccta
atggtaatac aaccaaacta aacagaaaga agaaacaaca 17280gacagtaaga
aaacctctat aacgcgtaaa aacaatttaa cataaacact aactagaaaa
17340agtccaggcc gaataacatt tgtcttgtgc gatggggagg gaagatgaag
gagagtgaaa 17400atctgttgaa agacacaact ctgacgatct tcatagagag
attgtcgtac aactgctagt 17460agatcctgtc cactaggcac gaaatcccaa
ggtccttgat gagcctcctg ggattctcct 17520gattcgttga acaaagaagt
cactttcgag agtaacttcg gaccctgctg cgagggtctt 17580cgaaaatcag
tcacattaga ctgaaacccg aaaaacaagg tggaggtctt gacgtgcccc
17640aaaacaagtt gggggcgtcc cacaagatgc gttttttagg tgtgatgaca
catgatgtca 17700tcacgagaaa ttggggcgag ttagtttgat gaactacgcc
ccactgacgg atcctaagat 17760ccaaagtgac aatctcgaaa ccagaagacc
gacaaacaac agatctgaaa catacaaaca 17820tgaaaataat gaaagcataa
actgccaccc gacatataga gctccggcaa acaacaccat 17880caagaacttg
caccaaagac ttccttggca tactaaagac actgattcca actaacacta
17940gcgggggacg gggaagggac actcgactac acctaaacct aaccagggga
cggggaaggg 18000gaacttagac taaaccttcg taaaaagggg gggatcgggg
aggggaaaac cttgaccaag 18060gaagctggtt ttaaaaacca cttagccgag
ccaaaaaccg tgggtgggaa gaagaaacag 18120accacaaaca gggggaaccg
ggggatggga actcaccgaa caggggaggg ggagaaatcg 18180cacagactcg
gggaacgcct aaggactggg ggacgaccaa cgaacgaaag gttggggtgg
18240tgcgaaaaca agggaggggg acgcaccgac gaacaaaaaa accgacgaag
aggccgaaaa 18300agcgaaaggc cgacggagat aagattgaaa ggcgacgaaa
aaagaaaaag gaacaaaacg 18360aaagaaaaac gaactcgtcg gagacccgcc
ggagacctac gcggcgccgg atctccggcg 18420agttctaggg ttagagggtt
tgttgtgttt gtttagggag aaggcagagg tttttttttt 18480ttacgtgtga
tcttgttaga tttgtcttaa catgtattct ttaatatact ttttttttta
18540taatttttgc aaatgcaaaa ttagagatat atgtcctcta aattttacat
tcacatacgt 18600gataaataag agtgctacaa ctaatttgaa acggataaag
tatttgaatt gtttttcatt 18660taaaaaagtt cgctatcatt tataatgtta
tatatttgcc aatatgttat ctctttctct 18720ctcttaccag agtttagatc
cagtagagtt agtaaataat tctaccacgt agagttgaac 18780aaatcatagc
cattgatttt caaatcattg gtttatatat tctttcccaa aactcccccc
18840tattttcccc aaaaatcctc cccctcctta tctctttcca taaaatctga
gtcgttgatt 18900ttaaaatata aggtttggat tcaactccac tatgtagagt
tttcatcaaa ctccaccgaa 18960tccgagcccc tcctaccata gtacttcttg
atttccccat atttctttcc tcatcttggt 19020cctcaagcac attttaatat
tatgggtatt aaacaataga gaaagtattt acttatagag 19080aaagtatttt
caatgattcc ctaaattttt ttttgaaaga aagaaaaggg atttcattaa
19140tatttcgcca aacggcactt acaagtcatt tctgaaaaac ataaaattct
aaaagaaata 19200catatcaccc tagaaatgta aacatcgcag atttgactta
attttgcctt aataaaaatc 19260ttcatctgaa gcaatgcaat ctgtgagttc
gctctggttc ggcatacgat ctgcagatgc 19320ggaaattttg ggatgaacgt
actccaatag tcttccataa ttttacagaa gttgtgaaac 19380cctaattctt
catgttgaat ctcgaacttc aaccaatgag aataatttct catacctaaa
19440aacaaaagaa ccatactcac aactcccata ggggagaagg agatttccaa
aacagaaact 19500aaaaacccca taaaagggtt tgagaaaatc tcataaagag
atactaattt attgaacaaa 19560acaagaaaat gaactaaaaa ctgaaaataa
aagggaaaaa ggggcttacc atggatgaaa 19620acatccatgg cagcccccta
attgatgaag aaggggtaag ggaggctagg gttttagaga 19680gagaaaagga
gaggggaggc taggttttaa aaaaaaatat aatgattccc taaatttact
19740tatatatatt taccaagatg acgtgatgtt ttacaaggcc catgattttt
acgcgatcat 19800gaaaaacaca gccaatttga atggagcaaa tatctacgcg
tcattttaga tatttttgta 19860tgggaaagtt ttttttgacc aatgtaatta
ttaagaagca tcggccaccg ggtagataag 19920atgtcactat acatcctttt
ccaaacttaa gtatgcctgt tgaacttttg ttgcgtttgc 19980agattcattt
gaaattatat ttcctcagat cctctacttg taaaagaatg ttccattatt
20040ttcttagttt acatgatatt tacaatagta tttgtctaca ttttgttcat
attacttagt 20100gatcagtgta tacgtcatat attagtttga actttgaaga
catttatttt ctatatactt 20160cctttgtctg ctaaattact ttggaaagct
ttgttttttt tattaatata agaccctttg 20220gagtttggaa atcactatct
aatgaaatat ataattcatc attagaacaa aaatacaaat 20280atcgtactat
cacctatcat gttccttttg gatttcgctt cacaaaaata cattttaaaa
20340aaaaataaaa taacaaatgg tagctaacaa cttattactt ttaaaagttt
gtgtgcaccc 20400taataagtac tcaaagtagt atgtaacaga gagagtataa
tgctaaaata caaactaaat 20460aaacaagaaa gtgtttctca acaataattt
gctgcaggaa ttaggaaaca aagtaaataa 20520attgcatgtt tatcatcaat
acaatttact ggtagttaat tacaaacttc actcatgata 20580attgaaagag
gccactcaat ttcagctagg agttgtttat ttatttattt ttctttcagt
20640taaattttga ctacccacaa aatcttcatc tggacctaat ctgcaatttg
tggattttgg 20700atgaaatttc taacctattt aagtagtctt attgtttaaa
taacccatgc aattaaatta 20760ggttatatgg gggtgattca tttaccaggc
ccaagatttt atctcattct caattattat 20820cgcaacaccc atgaacctaa
gccaacatga cttatttacc aggccagcta gagaagaaca 20880aggttgctga
ttttcttgtc cgtgattgta gaagaaatgt tagaaatcta aatgttgtta
20940gggatttacc cctcccccct actgagtgta tgaacttatt attgacggat
tgttgtaggc 21000ttccaagcca aaactctgat taagttttct tttatgccat
tttaaccaaa aaaaaaaaaa 21060aagctaggaa gctagctcag cgcgctctaa
ttatttcaca tgtgacatgt tttacactta 21120ttcatacttc tatatgcagg
agagggcaat gtaggagcac aataacatcc tgtctaagaa 21180ggtacttgca
cttgaccagt ttgtgtaata ttgtaattta atttcttaga ttttggttgc
21240atgctttgat gacgaatgac gattgacgaa tacattttta tgcagatcaa
ggagagagga 21300aaaaatctag agtaagtgta atagatgtag tggtagaact
agcactagca ctagcactag 21360tagtagccgc caccgccgcc ataaaatcat
taagttcctc ctgatgcatc aaatttcatg 21420ctcccacctc caattccttc
tttgaacacg gggtagttac ttcttcaact taatttcctc 21480tattcaatat
taagttaaga aacagatcac gtgattagtt cgttaatatt gctaattaat
21540aatcatattg ttatatatca tgcattagtg ggtactaagg ataatttggt
ggagaagtaa 21600ggaggaatga tcttgacctg acgctagaac cgatatactc
atgtcacatg ggatgcttta 21660caacatga 216683829237DNAArtificial
SequenceBvFUL genomic DNA showing sites for
mutagenesismutation(1)..(1)any nucleotide except
amutation(2)..(2)any nucleotide except tmutation(3)..(3)any
nucleotide except gmutation(19)..(19)c to tmutation(52)..(52)c to
tmisc_feature(1029)..(1924)n is a, c, g, or
tmutation(19373)..(19373)c to tmutation(19552)..(19552)c to
tmutation(19561)..(19561)c to tmutation(19689)..(19689)c to
tmutation(19704)..(19704)c to tmisc_feature(20703)..(20722)n is a,
c, g, or tmutation(28161)..(28161)c to tmutation(28185)..(28185)c
to tmutation(28191)..(28191)c to tmutation(28194)..(28194)c to
tmutation(28447)..(28447)c to tmutation(28450)..(28450)c to
tmutation(28465)..(28465)c to tmutation(28468)..(28468)c to
tmutation(28507)..(28507)c to tmutation(29166)..(29166)c to
tmutation(29196)..(29196)c to tmutation(29217)..(29217)c to
tmutation(29226)..(29226)c to t 38bvhgggagag gtagggttta gctcaaaaga
attgaaaaca agatcaaccg ttaagtgacc 60ttctccaaac gtcggattgg attgttgaag
aaagcgcacg agatctccat tctctgcgat 120gccgatgtag ctctcatcat
cttctccact aaaggcaagc tcttcgagta tgcttctgat 180acctggtatg
tctaatttta taacttcttc ttttgtacat caataatttt atcatcgact
240caactaaaag cttaagcaga tggttagggt tctattatta ttgaattacc
tcaaatttgt 300catcgactca actaaaagta gagtatattt catgtagatc
aggtgctttt tttgaatata 360ttgtcagttt tagaactaca aaatgttgaa
cacaagtatt tatacgcacg ctgacatgtg 420aattttttaa ttgacaactt
tctaaattaa tactctaaat tactaatatg aagaacgtaa 480tttattattt
atcactttca gacaaaggca tgtttgtttt ttctattatt tttcccatga
540aaattctcac caatatccga ttctgtatgt taattttagt aatttctaat
tttgatgact 600taataaattg taaaaaagta taaaataaac aaatatccaa
aacatctttg ttttcaagag 660aaatatctta aaaacttttg ttttttaaga
gaaatatctt aaaacacttt ttatcatact 720actatgatga tgtataaatc
tattcaaaaa aaaaaaaaaa tgatgatgta taaataattt 780aaagagttaa
gtttattaga aattatagat atttatagag ctgagtaata aaataatact
840ctacagatta tatgtagctg atgtagtgtg tctgctcctg taagatttcc
tttttatctc 900caaaaaaatt gcattgatat tcgagccttg ccgaccccct
tttcctcttc aaccatttga 960taagatccta tgcactgagt aatctagtat
tatatgttag atgttatata ttaataagct 1020aaaattgtnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1140nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1200nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 1260nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1320nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1500nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 1560nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1620nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1680nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1800nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 1860nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1920nnnngatgat attatgtatc
attattatca gtcatttatt attgatagat aatgttatta 1980tttccataga
tcatatataa tcttaccata tttactccct atgattatat atttctatct
2040attaacatag taatagtgat atactaacaa cgagtatctt tcaagtaaat
aaatcatata 2100tatagggtta gaaggtaaaa aagacaatat tttatgagac
tttatttaca cccttcgtct 2160cataaactcc tttctatttt ttgtgttcac
tcgttttcta aagttttttc tacttctttt 2220actttttatt ggataaatac
tttttccgtc tgtatgtgat ctacttatat tagtatttat 2280agatagatta
tactcttaaa gtattacatt cacaaaatca tgtgaattat taattaaagt
2340aaaattatga gtatggtatt ataacttaaa cgaagtagtt gtattgttta
aaccaataag 2400tataataact tataaaactt aaaagttgat tctacactta
tcatgcactt gtgttttgat 2460gggttaaaaa ttagctagta cataagtaac
ataacataaa cttatctctg tatagtatgt 2520tgaattatta ctttatattt
gaaaagaaca aagacacaat aagtccaaaa gatccgatct 2580tttgattatc
aactatgtaa gtgtctttcc taaatgatca atcaccttaa ttagtatact
2640aaacaggaca attatgacat ataacacttt tctatttgta caagttaatt
aacccattga 2700cattcattca gttagcctac atttttcaat aggtagtcat
catcattctt tttttaacca 2760atttttttac aataaccatt acctaccaac
aacattttca aataatgaac tgaatgatgg 2820attccgtcaa ggaaattgtt
cgtggacagt ttgtttacct cccaaatttc ctttaatctc 2880atgctttctc
catcatccaa acaatctaat accaatcgtt ttctctaatt gcataaaccc
2940taattgttga atcctttaat cttgcctttt cacattgccc aattccttta
gtatgatatt 3000ttttattcga taatccctga taagtaaatt cactaatcta
atttcagatg gattgttgta 3060gagattgggg aattgaagat tttttgcctt
ctttgttctt gtttacaatg aaggatgaag 3120tttcatggaa tattgaagag
aaatttgaga aaagaaagaa agtgtgggct ttactgacga 3180caaaaactgt
cattttggtg gtttttttca ctaaggcatg tttggcacta gcgtttaagg
3240tagcggttag cgatttgaca agatcaaaac gctacttaag aaaatgatga
gtgtttggta 3300agatagtggt tgttgtagca ggtagcaatt agagtagatt
atgagtagcg gttgtggaat 3360gttactacaa gtaacgtttg agatttagag
gtagcagtcc agcaaagaaa cattgtataa 3420tagcataagg tagaattaat
taaccaatgt ttttttattt ctttttcctt ttattgttta 3480ttaataattt
tattttatgc caattcaatg tttattttac aatagcaaaa atgtaattta
3540aatatttatt ttaaacataa taaattttat aagtatttta gtaaaattgg
taatataaat 3600ttttcgaggg ttgaatattt tcttagatgt atacatttct
ttgaatagtt aaaaaatcat 3660atctcctttt tgtgtaactt ctttgaaaaa
taaatcttga atttaactat tgaacgaaca 3720tatgaaatta ttgtagttca
tttcatatta taataaatgt taggtattgt cttttacaat 3780ttcaactact
tttggcaaac aattctgatt aaacagctat tttaatcgct gatcgttgac
3840agcaaccgct aacagctact agaaccgcta cttttgccaa acatgcctaa
accaagtaat 3900atcggggatt ctttattata taaaaagtta ataatgtgat
tatttgaaaa aaaggttgac 3960tttatatgtt gtttaaagaa aaaaaaacat
ttattaatag aaaatatact atataggtct 4020ctatataccc agggcgaaat
gaatgtgtat cttttcaaat gagtatgcgt acatgttctg 4080taaatgcata
tttcatatga gcataatgtt tttactatta ttatgcacat ttgtgtttta
4140atttttcaaa tgagtatgta aggaaaacat gtattcttgg catgtcagtg
ttagtgattt 4200ttgttgttat ataaatgttt tcgtgatttg tgaatgtggt
acaaacattc atgtgccatg 4260gcgtttagca aaacttttct agctcacatg
atgcttcaag ctaattgcaa tgaactaata 4320taagggagag gacattttac
gaattagttt tacattgata gtagtttgaa gaagatagtt 4380taggagatag
tttgttggaa tagaagctat gtgttagaat tagttatagt catcaatttt
4440tgaataagac tcattattat ttcaatcctt ctacctttta attactagtc
caactctcac 4500tctttggtat taaattacac cattcctacg gactatctaa
caactctaac cacggcccat 4560actttttctt cttaacataa aataatatta
cgcttactaa ctactaactc ctatgacatc 4620taccttttca ataaaataac
aattgataac tatataacaa cttataatct aaaagtaagt 4680gtcttgataa
gtgtagagta ttgtgggacg aagggagtaa ttcatagcaa atactatcat
4740agcaatacaa tactaggaga ctaagatgtg agttttgaac ttcaaaaaaa
aatatacgcg 4800acatagttca ccggaagaac tgcagcacaa caaatgcaaa
tggggattaa atgaggagtt 4860cacctacatc acacacaaga gcgattgagg
attttcagat ctggaagaag agcgaaaaat
4920caggcgagag cttcaacttt cggagtttaa tggaagagct aatgatgatt
gaattattca 4980gttggattta tcttttgtaa gatgagggga tgaagttgaa
gattgccata ataacccttg 5040agggcgacac aatggtggtg ggaatggaaa
tatggaccac gaccgattta tgagtggaaa 5100gaattgaagt ctctgataca
atgacgtttt ggtatatcat cggtgctttc atggctgtca 5160gatctaccga
aaggaaaagt agtggtgaaa acaaatgtcg ttcaaaccaa ccacctttga
5220ctttcaatat tgaagtgaca aggaaaactg gcaccttagc tagtgacttg
gtggagtctt 5280tgcgtaaacc ggtggaaaag aagcttatct ttgagtttta
gtgtggtctt ggtgcttgtt 5340taagtggctt gactatttag gaggaaaaaa
atcaattgtg aacaattggt gagtaaccta 5400aaaatcaatt gtgaacaatt
ggtgagtaac ctaaaaatca attgtgaaca attggtgagt 5460aacctttaaa
ccagactcaa gagtgggaaa gaggcggtta gacaagtaga tttagaggaa
5520gaggaggtag aaaatagatt agaatttttt ttgggagttt tcaaatgtag
tggtatcggc 5580tcataacaag gtggtcaaat ggatggagag cttggtcgtt
tgagtagtct catggtggtg 5640attataacta cgttggcgat aaacaagttc
gctttaccaa gaagttgatt ggtggaagag 5700agattttcat aaagaggggc
tcgtatagca acagctttgt gtggtctcag tttcattgtc 5760ggcaaagccg
tgggtggtaa atatggtgat tttggagatt gttatgatgg agggagttat
5820ggtggtttca tgagacatgg atgctatggt ggtgctgtga tcggtggaga
agaggagctc 5880gtgaaaggtc actctgatgg agcgggtgta atggtggtct
aggttttttc ggcatcggag 5940gttgcgtatc tcgtgaaggt tcacatttgt
acaccggtga atactatggt ggtgaactag 6000gatgacagtc aaggtgacca
tagtagaaaa aaaaaacata aaccatgtag cttagatgat 6060ttgaaaaaaa
atcatgtttt tggagaagaa tcttaaatat tatgacagag gcaaacttgt
6120cattgaccaa atagatgaca tagcaattgc acgtgtctcg ttaaaatttg
aaaccataaa 6180aaattcaaat tgcacttcat atgcctttct tttggttgaa
aacttcatat accctaatgc 6240gtcaatatgg ttctttttcc aaaaaaaaaa
gtaaattatt tcggcgttag taaaagcagg 6300tccacctcca taatccattt
tattaagcca actcctctac cctacttttg caacctatca 6360tttcttattt
tctaaaatca tatcaaaaaa acaagtgtga acccaaaact aactatatta
6420tacctaagtc taatttcttc atccaatgtg ttcaacccca tttttcaacc
cttccactca 6480taaacccatc ttctttcact cctaaaactt tcagctcacg
ctcgtatcac ctctttactc 6540acatatcagc ccaaacgatt tcttattgaa
tccactaaat tatgtatatc gattttttca 6600ccaagtactc cgagttttca
aaaaatttac ctggtacccc caagttttca aactacacgg 6660gataccccta
agtttcaaac taatacactc agataccctt aatgactaac gacattaatc
6720gccgttagtc attaacctta attttctaga tttcaaccta attaaccact
aaccctaacc 6780ccaaccctaa ccctaataat aaccctaaac ctaaccctaa
ccacccctcc ccaaccctcc 6840caccacccct gccccccatc ctccactcct
gcgcagccag caggcccccc acccatttga 6900ttttaaggaa gaaacacgta
tcagggaagg gggagaactt agctctgaca gcggcaacgg 6960accaccgacg
agtctggact gtaggacggc agcgtcaagg cgcgagccgg ttgggctact
7020gcagttcaaa gcaaacaggg gaggggaatc gagccgagaa gagagctaag
gaaagagggg 7080tgatgggcgt tgcggagatg gtgcgcatag tggtggtggt
gtgggggagc tgtggtgggg 7140gagaaatcga atgggtgggg gagagagggt
ggggggctgc ggtggggttg gggctggagg 7200gggtagggtg ggtgggtgga
ggggtggtgg ttgggtggta gtgggacaac tctcaaacat 7260gattctttct
caaacatgat tctttctcat atcctttttt ggatttcctt aaaaaaatcc
7320atatccaaat aacgttgacc ggtgagggac aactctttaa cactattctt
tctcaaatct 7380tatgaaatct tcacatttaa ctcgatctcc ctttcaatga
acctaaagat atatattaat 7440agagttgcaa attcctaaac tctagaaaat
tcaaatacaa cataagagtc ctagattctt 7500ccacaagatg tattatatct
ttcaaagttt cccagataaa attagtaatt aggaaactcc 7560ttaacaagga
tacttaaagt tttatctaaa tcttgcataa attgaaatcc aaccataatt
7620atgaataaat aatcataaag aatcctaaca taaataacta gaaaataaga
taataaagaa 7680gcaacaaaag aatctcataa ccaccatttg aatcccgcat
gagaacccaa atagttgttg 7740ttccttataa aaacccacca cctttcttcg
ggtattatga cggtattgga ctatagtatg 7800agacgagatc tcttaatcac
caatcaacta ttgtaaactt gtgagcctga ataatttatt 7860tgagatacaa
ttctaaggtt gtttatgaac gtgtttggta aaattgttat tgataactcg
7920ttcgtggaaa ataaatgcaa aagtcaacat gccaaaaaaa gtgctaaaat
caactttcgg 7980ctttgcttga aatgttaagt tttaagctat ccaagagcca
ttagtcaaaa tctattgaaa 8040gcgtactcaa aaaccattta tcaaacaccc
ctacaaatcc ctttagaaaa caataggagt 8100tgtacaatat aagtattgag
ttataaagtt gatcaagtga tttaggaggt tgttccaaat 8160caatctacaa
gagtttgtat acttataccc cttcgttttt ttaattgtta cacttaggcc
8220ttgtttgaca aatagagttt agcggttaga gtttagattt tgctgttaga
gttttaactt 8280tttgttaaat agatttgact gctgatttga caacttcttc
ttataaatgt gtttggtaat 8340tattaacaga ttgctaaaag cttattaccc
tttatttatg tgaaatgaca tgtatagaca 8400ttttaatcca catgggtatt
attattatta ttattcgagg catacaagtc attgaatata 8460tttttaccta
atcctctaaa gaaaaagctc ctattaggag ctttttcatt tcgagagttt
8520ttattccaga actctcttca aaactctttt ttaccaaaga ataggagctt
tttcatttca 8580agagttttta ctccagaact ctctttaaaa ctctttttta
ccaaacaccc cttttagagt 8640ttttgactag tcaaaactct aaaagtggtc
caaatttctc ttttaactcc aaaactctaa 8700ttgccaaaca cccccttaca
cttttcacgc ataccaatgc aacactttga cgattaacat 8760ctccagtttt
ttatttgtaa aaattataaa gagtgcatat taataagtag ggctgttcaa
8820agtgcggtct ggaccgcacc aaaccgcaac ccaaaccgtt gtttcgcggt
ttggtttggt 8880ttgcggtttt aaaattgcgg tttgggttat gatttcaagc
aaaccgcggt ttgcggtttg 8940ggttgggttt ttatttttgt aaaccaaaac
cgcaccgcaa accgcaatgt tacatttttt 9000ttaaaaaaat aaattaaata
catttatgaa ggtgacatac aattataaaa ttgaaaaaag 9060aagtttgagg
taaaaaactt taacacttat gataaatcat tatatatgtt taattatgaa
9120ttcagcttca tatctatttg gactcttatt aacaattttc ttttaatctt
aggaaacaaa 9180agtaatgtcg cggaggaaat aatggttaaa ccgcaaccca
aaccgcacca aaccgttttg 9240cgcggtttgg gttgggttgg tttgggaaaa
agtgcggtgc ggtttgggtt ggaaaatttt 9300caaaccgtat atttgcggtt
tgggttgggt tacatcccaa accgcacaaa cccaaaccgc 9360gaacacccct
attaataaga tacatattaa ttcgaatttg acaagatcca catgactatg
9420tttttattcg cgtataaacc acaaaagaag gttcaagtaa aatttgtgta
tggtgtaaca 9480tgtcaatcaa agaacggagg taatatttgt caagacactt
tagtcacttc taaattccta 9540taaacaaaga aatatggaag aaaactggtg
atgaaaattg aaaaggtggg tataataaga 9600gagacacaat tctaaaataa
gaaaatatta ataataaaat aataagttac gataggcctc 9660atgtttgaaa
acggaaaaaa taaggagata gttcgtgtaa aaaggaggga gtaaagggta
9720atgcatactt tgtattgcaa gcttagtttt aaaaggcata agacgcaaag
cgcatcgagg 9780cacaagacga aggcgcatgc atctcgtagt tgaggcgtgt
aatgatttta cttcacaacc 9840acctgagcaa cccaatacag aacgaccaca
agaaaaatag aaagaaagga aatgattttg 9900attgatcagc agaaaataca
gagcattcga gaggctcagt ctctcccaag gactacaaga 9960tactactaaa
tttcacaccc ccttcagtcc ccttacaccc ttatttatac tacttctgct
10020ctcctatttt aacggctact gacattctct gagctggcct gctattcctc
tttttgtgct 10080gacatttctg aatattctgt ggtagtggct ccattctcac
atttggacag gtttacccct 10140ccatttcttt cgtgcatacg tcagccacgg
tttggggatt gaattcatta cattaccctg 10200cccccaaaga ctcaccttgt
cctcaaggtg gaaggaaggg aaacgttctt gcaccaaatc 10260tgcatcttcc
cacgtggctt caaaaggtgc taagtccttc catttcagta gcacttcagt
10320ctgcgtatgc ctccctctct gtgtctgacg tacgtccaat aattcttcag
gttccacaac 10380tagttccaag tctgctgcta gttgagttgg tacagtggtt
gctgcctggg catctccaat 10440tgctcgtttg agctgggata cgtgaaatat
aggatgtatt ttactggtgc ctggtaactg 10500gagtttatag gcgaccttgc
ccaccttttg cagaacaggg aatgggccat agaagcgggc 10560tgccagcttc
tcaaatggtc gcttggccaa ggattgttga cggtatggct ggagctttaa
10620gtaaaccaga tcccccactt caaaggactc atcgcgcctt cttgtgtcag
cataggcttt 10680catcttttgc tgggagcgta gtagatgaaa gcgtaaatca
tcgaggatgg catcccgttc 10740ttgcaacact tcctctaagt tatctactgg
cgtttgccct ctgcctactc gccacaagtg 10800tgtgggtcac gcccgtacaa
caccctgaat ggagtcagct tagtagacat gtggggagag 10860gtgttgtgtg
agtattcagc ccaagggagc cactttgccc aagtcttcgg gtgccccgcc
10920acgaaacatc tcagatatgt ctcaagtcct ttgttcacaa tctcagtttg
tccatcggtt 10980tgcgggtgat aggcggtgct tctccttagt gttgtccctt
ggagtcgaaa caactctttc 11040caaaaagtac tcagaaaaat tcgatcccta
tccgaaacga ttgatgccgg aaacccatga 11100agttttacaa cctccctgat
gaaagcttca gccacttgtg agagcactaa aaggatgacg 11160aagcccaatg
aaatgcgcat atttcgataa acggtccacc actactaaga tcgtgtccac
11220ccccttggac aagggcaatc cttctatgaa atcaagagtg atatcctccc
aaacctgagc 11280tggaatggct aagggctgca gtaggcctgc tggcttttgt
tgagagctct tatgctgttg 11340acaaatgcta catcgctgca cgtacaatgc
cacgtgcttc ctcataccta tccaatacca 11400ctcagccgcc aacctaaggt
acgtttttac ttcacctgca tgtcctcctt ctggggaatc 11460atggtaagct
atcatcaact taggaatgat gacggaagtg ttgggaatta ccattcgccc
11520cttataccgc agcttgccat cctccaccgt gaaccccaca agtggtttat
ctccctgcgc 11580cacttcttcc ctgagccttt taaggaacca atcctcctct
acctcttttt ggagctctgc 11640ccagtccact ccttgggttg tgattatggt
ccctagctcc atttcaccta cagtttttct 11700agaaagcgca tccgcaacct
tgttggttgc ccccggtttg tagtgtattt caaagtcaaa 11760cccaattaat
ttgcttaccc atttctgaaa atcagccccc acttcccgtt gttgtgtgat
11820gaaacgcaaa ctttgttgat ccgtatggat cacaaatctt ctccccaaaa
ggtaatgttt 11880ccatttctgg accgcaaggc atatggcaat taattccttc
tcataaacgg acttgtgttg 11940tgctctcggt ccaaggagct tgctgtagaa
tgcaatgggc ctgccctctt gcattaggac 12000tgcccccacc ccatacccag
acgcatccgt ctcaactacg aaaggcttat ggaaatcggg 12060catagcaaga
accggtggct gggtcatagc ttcctttaag tgagagaaag ctgaagtagc
12120tttttcggac cagccaaagg agtccttacg caattgctcg gtaaggggct
gggcaatttg 12180cgcgtattgc ctgataaact tgcgataata cccggtcagc
cctaaaaatc ctcgaagctc 12240cctcaaattc ttgggaactt cccactccac
catggccctt atcttctcca tgtctactgc 12300caccccatgc tgcgaaattt
acatgcccca agtaggccac tgtcttcctc cccaagtcac 12360atttcttctg
gttagcgaac agtttgtgca atgctaacag ctgcaacacc aatcccatgt
12420gtcgtgcgtg gtcctctttg gtcttactgt agaccagaat gtcgtcgaag
aagaccagca 12480caaacttcct cagatatgga cggaaaacgt tattcatgag
tgactgaaaa gttgctgggg 12540cattggtgag cccaaagggc attacgagaa
attcgtaatg tccttcatgg gtgcgaaaag 12600cagtcttatg ggtatcctcc
gggcgaacta aaatttgatg gtatccggcc ttaaggtcga 12660gtttagagaa
gatggtagcg ccatgtaact cgtctagtag ctcatcaatg accggtatcg
12720gatacttatc cggaaccgtc tccttgttca aagcccgata atcgacacaa
aacctccaag 12780aaccatcttt tttcttcacc aataatacgg ggcttgaaaa
tggactagtt gagggcttga 12840tgatgcctgc ctccagcatc tctcggatga
gtctctcgat ttcgtctttt tgaaattgag 12900ggtaacggta tggcctaacc
cccaccggat tactgccttc cttcaacgtg attgcatgct 12960catgccccct
ctttggtggc aggcccaccg gagtatcaaa aacttccgca aactgactaa
13020ttaccttctg taaaaattcg ggtacttctt gtgcctcctt caactccgct
tctccccttt 13080tcccatcatc ctcaatctgg ttgagctcca agagaaaacc
ccctttttct tttcggattg 13140cctttatcat ggctctaagt gagattttag
atcttgctaa ggaagggtcc cctctcaatg 13200tcaccactct gccctccact
tcaaactgca taacctgagt tttccagttg gtaatcactg 13260accccaattt
ctcgagccac tgcactccta atattaaatc tgagttaccc aggccgagag
13320gtaaaaaatc ctctgttact tcgatttccc ccagctttaa agtcacccct
tgacacaccc 13380cagtaccatg gacagcttca ccattcccta aagacactcc
aaatccccct gcatctgaga 13440tgaccaactc aagttcctca acagttaaca
aggaaataaa attgtgagtg gcacccgggt 13500caatcatgac caccacctct
cttcctttaa tttttctagt gattttcatc gttttaggac 13560tcatcaaacc
aatcacagag ttgagagata cctcagtagg aagttccggt ggtggttcgg
13620acggtggtgc acgagctgcg tcgctcacct cttcggtttc ttcctcctcg
tcatgcatca 13680gaatcacgct gatctctttt ctccggcaga tgtggccggc
ggtccactta tcgtcacatt 13740tatagcacaa cccttttgct ctcttctctt
gatattcttt ctcggaaagt cgcttgaact 13800ctacagattt ttttccttgc
cccccagcaa ttggatacgt gttcaaggtg ttggaatttc 13860cccctgggtt
ttgggcccac attttgctgg caggtgggtt gaggctggtc gttggattga
13920aggaagcccc tacactcctt gtcattcccc ctctgttata aatcgagtaa
ggcccattct 13980tagttggccc acttctttta taacccacaa tcctattcct
ttcctcaatt cggcctgcta 14040gttccattgc ttgctctagg tccataggat
tgagtaacct gacctccact ttgatatcct 14100cttccagccc attaatgaac
tgacccatga gtatttcttc tggtactcta ctcaacggtg 14160ccgccttctc
aataaaagtg cgtcgatact catccaccgt ggtggtttgc tttgtggcca
14220accaccgttc ccacaatgaa ccatagtggg ttggtcgaaa ctgacggagg
aggtactcct 14280tcagatctgc ccaccacctt atcggccgcc ttttattctc
ccactggtac cacctgaggg 14340catccccctc tatagacaca accgccgcct
ccagggcttc actgctactc aggccataaa 14400acgaaaaata tcgctcggct
ctaaggatcc acccatccgg atcggaccca tggaaaatgg 14460gcatctctaa
ctttcgatat ttccagttcc ccccggaagt cgaacctcct ggccctccac
14520ctgagccgcc atccccatac gatcggcccc cgagctggaa acctccatat
tcgtcccctt 14580ctcggccccc cagatttatc aggtcaggag ccctccgttc
cggcggtcgg gggtgtccag 14640tgacggtttc gggtgtctcc cgcgagggtt
gtggcaaggt tgctcggatt tcaacctgaa 14700atttcctctg ttcagcacgc
aacccttgaa tcgtcccatc ctgtgtctcc cttgaccggt 14760taatccggtc
ctccagccta acggccaaga cctccatctc ctcgcgactc ttcctcgccg
14820cctcattctg gccttccaag atttgagcgg tcaacgattc ccccatggca
ttaatggcag 14880attccaccgc cctggacacc atggtggcca ctgacccttc
gagggctgcc attctttctt 14940ctaatgaatc caccctctgc acttcgtttc
ttggtgccat ggatcgatgc tctgatacca 15000agtgtaatga ttttacttac
ttcacaacca cctgagcaac ccaatacaga acgaccacaa 15060gaaaaataga
aagaaaggaa atgattttga ttgatcagca gaaaatacag agcattcgag
15120aggctcagtc tctcccaagg actacaagat actactaaat ttcacacccc
cttcagtccc 15180cttacaccct tatttatact acttctgctc tcctatttta
acggctactg acattctctg 15240agctggcctg ctattcctct ttttgtgctg
acatttctga atattctgtg gtagtggctc 15300cattctcaca tttggacagg
tttacccctc catttctttc gtgcatacgt cagccacggt 15360ttggggattg
aattcattac aaggcgcacc tttggtacca agagacttag acatcaaata
15420aagaagcatt gtaatactta gaataaacat tgtttttata ttctaaaaca
catattttct 15480ctaagacacc tagtaatctc atagtggatg tcatctagtt
taggtggtaa ggttttattg 15540agttggtact caagacctca gacttagagg
tggcaaacgg atcatacggg tcgggtgaaa 15600atgagtcggg tcataatcgg
gtcacctttg tgtccaggtt acggtcaggt cgagttcgtt 15660cgggtacgag
ttcatattga gtccatgagg tttcatgtca tatcgggtcg ggttagattg
15720gatttacaat ttcgcaaata aataaaacgc atataatact aaagagagta
aattaaataa 15780ttaacggaca ctagctaaat catatattag tattttatga
tgtattttcc ttaaatttat 15840ttaaaaaata actaatatga caatttttcg
ggccgggttc gggttgtggt catcattatc 15900gggtcaattt agtatcgggt
aggctcgggt tcatgtcata ttcgggtcta ttttaattcg 15960agtcgggtta
tttcggattt aagctctatt tcgggtcagt attttcgatg aagaacgggt
16020ttcggatcgg gtcaccggat acggatctat tttgccgagt cagactgctt
ttcaaaccta 16080ataatctcag tttttccacc tattcagatt tgcctatgat
ctctatttag ataaatgagt 16140acaatattgt gtctatccat gaaaatgaat
atctcacaat gtaaaaggat atctctaaat 16200ttcactaatc actctatctg
ttttgaataa taatattcta ttttattgca tgtagtaaag 16260atcgagtatt
tagtgagatt tggagaaaag aggaggctag agagaaacta ggatttagag
16320aggagaaggg ggctctgtaa cacatacaag atagatactc ttttacacta
acttttcaag 16380atactcaaca tataaaatca gcatcatctt ccaaacaaca
actttaagcc acccatgaat 16440cttaattaga taataaaaca taatcgtgaa
tcatctatcc tttgtttggg gggatcctaa 16500agcaattgag gaaaagcttt
gatgcaaata tcaattgtgt aaaaaagcaa gtattcgttc 16560gtgatgttgc
tatactaggt tattttttgg atccaaaagt cattcctact agaatcattt
16620aggaaaattg tcagtatgaa ttttaaattc aggttataac caaagataat
tgaaaattgt 16680caaacttttc aaataattcc gaaataaaca tgtttgtaac
atggataaac ttttcattgc 16740ttttcaaata attccaaaat aaacatgtta
taacatagat aaccttttca gataattcca 16800aaataaacat gttgtatcat
ggataaactt ttcattgttt ctagtcactt aaaattctaa 16860aaaaatcttt
cctccctact gttactctct ctagcaccaa atctatcaca tgagaaggca
16920gaggttttca aaataaaccg ttacttaatt tggtacttat ttcttgatcg
gtgttcatat 16980catatgagtt cctactctat atctctctac tcttctaaat
ccttgtgtca cttcctgtgt 17040ttcataaata aaaaggagga agtattagtt
ttgaaacgaa aggagtatgg tgcatacatt 17100gatagaaaaa agaagttatt
tgtccttatt tcactcatat aacaacacca aattctgtat 17160tgttatcaca
aaataaaact tggattatct ttgtttcata gcccaaattt agaattagtt
17220tgtcagattt ccaatcatct aattacaata ttagagctag acctaggaca
aaaggtgggt 17280ttggctactt ggtaatagct atgtctagtg ctaggatatg
tcattgtcgt agaaccatgt 17340tatggacatc ttaagaaaca aggttaacct
aattggttgg agatcctact ttcactttta 17400taataaagtt tcgattcttg
cctatttgta aagtagaatt cctaaatttc ccttcactga 17460tatttatctt
aacataaaaa aatgttataa acattgggat tgtatataag tcaaaataaa
17520ttgacaatct tggtaacaac taagttaaca ttaattttat aagtaaatga
ttaatcccaa 17580tataatctct tatttagtaa atgagacaaa cttgtacacc
ttcgtgttag actcgttaat 17640gttcgctaac aattcattca gtagtcaaca
gcattttaaa tttgaaataa gtgttcttgt 17700ggtttttgag agatcaagca
agaaaacatg tctctcccct ttgaccaact aattgggatt 17760aagaatacta
gttttaagat tttaagaatg agttatagtc tttcttagac cgctacaatc
17820cccttgttga tatgaaccag atatattttg tgttcaaata gtagatcaat
gcattgttga 17880taatcctttg ttaatgtact tgttgatctt attttgtact
tttggtagat gcgctatact 17940ttctttcgat tgctcatttt gaactcttaa
ctacatatgt tagtttaagt agatgattta 18000gaattgctat ttcaatcttc
aataagcaat ttaagttgtc aaaccttgtt tcacatcatt 18060agggtgaaag
ttatttggat aaagacctat atctaattca atccaaagca aattagtaat
18120gcggattgga ctcaaactat gtttagattg gattcgaatt gagtttcttt
tctttttctt 18180aaaaaaattg gattttccga tcgaattgag ggtgattaga
tccaaaataa ccgaatagta 18240gataggattt gtgttgtata ttagaattgg
gcttaaggat ttccatttta acaaaaaaac 18300caaatggtcc gactatcaaa
aactataatt tgatagtcat gcctatcgaa aactttattt 18360tcattctcgc
acctaattat gggcttgtat aaattagttc tactatcgaa aactaatttt
18420gatgctcgtg cctaatttaa attttcgaaa aaatgaagtt aagaaaattg
gatatttcgg 18480attggatcca atatatcttg tgaattatta atttggatta
gtttggactc aaattcttat 18540tggattggat tcaaattaaa agattaaaat
tcaaattttg ttcgaatcaa attggagtag 18600gcttaagttt aaatcataca
ccgaactttc accactaccc atcatgctta agcttctaat 18660gtaagagagt
gtttgggagt tgagctcgaa caactaaatt tctaaaagaa ccaagttcaa
18720acaagaaatc taaaagctcg attaaacttg agtcaagctc aaacacctat
attccttatt 18780ggagcttgac tgaagattga acacttattc cttattaagc
tttacgctaa aacattgctc 18840gactcaactc ttctacatcc ctatagttca
aaagaaatag ttgtgggctg tggtgctctt 18900gtagaccaac gcactagttt
aacaaagcta agtgcctgac tgcaattcca tacacattac 18960gatcaccatg
acctagtttc agctcacact ttggaagtct aatttgaact tgttctctac
19020ctccaattca ttgtggggta ggaggcgata gttaagggat caaaatctta
tgatataact 19080tgcataggct atgccactat ataatgcgtc ttgtgtccca
tattagttta atcaaattga 19140aatgttttac catttatatc ttcaattatg
tatggatact aatatttgat ttgacgtttg 19200atatgatatt aaatgtggac
tgttattctt gatgtgcttg agaagctttt ttggggccag 19260ttagaaacta
tattcctttt atggtcctaa ctaggttgtt gttggtgtgt tcccaaataa
19320cagcatggaa aggatactcg agcgctatga aagacactca tatgcagaga
gataactgac 19380tgctccagat cctggatccc atgtaatcca gctaggcaac
tatcttttct aagcatttaa 19440atcgttgaga tttcaatttt aaatgtgttt
taactgataa ttcatgcatt atatgcttag 19500gtaagtttga ctctggaaca
cgcaaaactt aaggctaggc tggacattct ttagaaaaat 19560taaaggtaat
aagatccaga ccaaatataa tttgtataat aaccacctta tgaggaaaat
19620ttaagatcct tgataatttc aggcattaca tgggagaaga acttgatacc
ttgagtctca 19680aggagcttta gaatttagag cattaaattg acagtgctct
taaacacatc aggtcaaaga 19740aggtagtttc acagttgcat tagatcatct
tatggatcaa ttggatcact tgtttgtatt 19800ttagcgttgc tcaacacggt
cgtctaatat agtgtgcaaa acgacctaca gggcaacacc 19860ttttataggg
ctcgaaaata cgaaaaatta aatgtttgtt ttagtcatat tgttcaaacc
19920caagctttat cttgtcaaaa atattttata atgattattt tttagaatac
attatttaca
19980tttttgcaat ttatgcataa tacttctaag gtccaacttt ataattgaaa
tagaagtcct 20040taaattttaa agacgacctt gaggaaacct aatttcttct
catatataat taaatcaatt 20100attctacaag ttagtagaac aaatactaca
ataacaacaa tattgaagcc ctaatctcag 20160taggattgga ttgattgtat
gaagtcttat tagtggccgt taaatgtttc ttgtaggtca 20220agatgacatg
gctcatatag taaggttact tgactaaaag acgaggattt gtttcgactt
20280agattttaac aagtttccct catttgttaa cacctaagcg tactaaatca
aattctaggt 20340tttactcact caaatttccg atttaggaag ggcttgagga
tagttgtatt atcgtaactg 20400actaatcaaa ggagcctctc ttagatcagg
tttcacttgc caattctaac aacttgtttg 20460gtaaaaggaa tttggaatga
aaagaaagga attgaaaaga aacattctac ttttcaatgt 20520ttcattcaaa
aataacattt taagtgatag gaaatggaaa gaagtgaaac gaaagcctct
20580ttacaaaatt atcatttttc tacccccccc cccccccaaa aaaaaaaaaa
aaaataagta 20640gtaagtagta gaagaagaaa taaataacta acaagagtag
taagttttta cgttttcttt 20700ctnnnnnnnn nnnnnnnnnn nnattcaaat
cagaactgaa tagtcataac cggaagatta 20760gtttctctct agcgtgacta
gggtttgagt aaaaagagaa aacttaaatc aaacatggga 20820tattaaggtt
ttttttcctt tcttcagttc ttttctcttc ccaatccttt cctaaaaatg
20880aaccaaacag gctgcaaggt tttcacttgc ttaacacaag atttattttt
aaaaataatt 20940acactccaaa cttttaagct taaaaccaat tttaattcaa
atcagaactg aatagtcata 21000actggaagat tagtttctct ctagcgtgac
tagggtttga gtgaaaaagt ctagggtttc 21060atgtcattct tcttgcttcg
agtcccttct tgggattgtt gttagccatt atggctaccg 21120aaatcgttat
taaatgtcta aatcttagaa ttactgctga agaaaacaac ttggtgtttc
21180tcgaagatgt tgatgataac tcgcagcacc atacgctcgc actggcgatt
gttggaaagg 21240ttctttcgtc aagaccatac aatttcgagg cacttaaatg
aaccttaaac tagatatggg 21300tgatatccaa aggagcccta cttcacccta
ttgaaaacgg actttttgtg gtacaatttg 21360cgacaattaa ggaccgatct
aaggttctag tcagcagacc atggaccttc gatataaacc 21420ttgttctctt
agatgctatt gaagggggta ctcaatcttg acccattgcc cgttttggac
21480tcgcttgtat aaccttccta tggactgccg atatgagaag ttcatcaaaa
actattgttg 21540gtgtattggg ggaggtattg gaagttgatt ttgacaggat
tgtttgggat aaatctgcaa 21600gagtaaaggt gaagattgac attacaaaat
cgttttgtcg tgtgcagatg atcaagacta 21660acaggggtga ggctgtgatg
atcaatgtta agtatgaaag acttcctaca atttgttatg 21720tgtggaattc
tggccatatt gaaagagatt gtgtgaagac ccaggaagaa gagaaacaag
21780tggagagaca atagggggtc ttggaggcct ctccgcgtag gggacgatta
aagatggtga 21840aagagtcgaa agccttcctt cagtgtgctc gtacactcca
ctttaataac aaggaagaag 21900taaggggtga ggaaccacgg gattatgtgg
agccgagggg ttattgtcgg ctatcttagg 21960gggtaaaact ttggtggtcc
aggagatagt ggacggctct aaggatgcca tcgaggaagt 22020tcgtgctgaa
ggtgcaccac tctagccccc ttgtaccctt tgggtaatgc catgctacct
22080tttacttttg ctgttgggag tgctaatcct actccctccc accgaaaagt
taaaattaaa 22140aacaaggcaa gggttcaggg tgttttgaac caagttaatg
ttgtgggtgt tggggggttg 22200gctaataatg ggggttgtga gaaaaggata
ttccccaacc cgatggtgtt agaaaaagaa 22260aaggggttca atgaagaggg
tttaagatag caaaacgaga ggattgtatg taacctatca 22320gtagggaggt
aactattgag gtggaggtgg gcgagaccca accccgcccg acattatgaa
22380tatcctatgt tgcaactgtt ggggattggg caacccccgg gaagttcgga
tgcttcgtag 22440gtggagcaat agtgctacac tgagttcggt ttttatttct
aaaactatga ttagtggtcg 22500tgatgtggaa agggtgcaaa gcgggtaggg
ttttgattgg gcaattgggg tggatagcgt 22560tggaacttca agagtttggt
gcatttattg gaaagctggg gaagtggact ttactctagt 22620ctctctatca
agtcatcata tttgtgggaa tgtgaagctt gttgatggga aggtatgatg
22680cttagtgagt atttatggtt gggcggatac aattcaaaag tataaaacat
gggagcttat 22740gcaatccttt cactcatatc atgggccgat attgtttggt
tgggacttca atgagatttt 22800gacaatcgga gaaattgaag gagggtccga
aactcaatga agtaacatgc ataattttct 22860agaaacttta gatgacatga
agcttaggga ccttggctat tcgggaactt ggtatacata 22920agagagaggc
tttaagccac ggaagagaat gagggagaaa cttgatcatt ttgttgcatc
22980ttcatcatgg tgtgacttct ttccgaaagc tacagttgag cacttgatgc
gctacaaatc 23040ggaccacact cctattttgg ttcgccttgt aggccatcag
tgaagacata agaagaaaaa 23100gacgtagttt tgttttgaga ctgcttgggt
gcttgaggaa ggttgtgagg cccaatgggt 23160gagtcatggg ccgggtttac
tcgcgaggta tttatcgagc gctttaaagc cgtggaaggt 23220gggttcaaag
caaggagtga tgggtctctt agtaatctgg gcccgcgtgt gagggagatt
23280gaggaggcca ttatagatgg gaggcagcga agcagataag gactatgagg
ctctatgaga 23340ctcctctccc acgaaagtta gacgaggtgt tggacaagca
ggagacgttt tggtttttga 23400ggtctcgtgt gagttagata aaggatggtg
atcgtaatac acaatatttc caccacaaag 23460cttcccaaca caaacgtcgc
aactacatag cggggatgta tgataataaa ggggtgtggc 23520aagataacga
agaggatatt gaagggaata tttcagagta ttaccaaacc tcgttcggtt
23580cgtgctcccc ctctaggaag aacgtcgcgg ttgtccttga ggttgtgagc
ccggtgataa 23640ctgatgatat gaatatggcg gttatgaaat cttacactaa
agatgaggtg tgggaagcac 23700taaaccacat gaagcctaac ggaatgcatg
ccatccttta tagaggttct ggaatacctt 23760ggagatgata ttacatctgt
cattttaggt attattcatg gcacccgacc ccagatgttt 23820ttaacaagac
taatattgtg ctcattccta aagtcaaatc cccaaatctt gtttctgagt
23880ttcgcccgat tagcctctgt gatgttatct ataaacttgc ctcaaaagta
cttgctaaca 23940gattaaaaca ggtttgcctg acattgttta tgataaccag
agtgcatttg tgtccggaag 24000ctatattacg aacaatgctt tgatttctct
tgaattattt gactctatga aaaaatgata 24060cagagctagg aaaggttttg
tgtcgatgaa attggatatg agtaaagcct ataaaagagt 24120tgagtggtgt
tttttcagta gtgtgttgga gaagttggat tttgctgaat catgagtgaa
24180tgttgttatg agatgtgtgt cttttgtgca gtactctttt gtggttaatg
ataatatatg 24240tggagctctg acaccctcaa gggggctttg acagggagac
cctatatccc cgtatttgtt 24300tatacttgtt gcagataccg ttttagctct
tcttagcaag gcattcaaca atgcgtggct 24360atacttgata ttctcaacaa
atatgaggca gcatcaggct agaaaataaa tattgacaag 24420tcaggaatct
ctttcaataa aagatttgac gtattttatg gccatgaaac aagttgagaa
24480gcatcagaaa gacttggtat cccaactttg gctaggagtt cgaaaaaagt
catatttgct 24540gacattcaag agcgaatttg gaagaagctg cacggatgga
gagaaaaact tctcgcgggc 24600ttgaaaagaa actctcttaa aagttgtggt
tcaagcaatt ctaacctatt tggtgggcgt 24660ttacagattc ctaaccagta
ttatccaggc cattcatttg gccatggtaa agttttggtg 24720ggggtcgaaa
agggcccaca attcgatgca tctgggggga tatgtgctca ccaaaatgtt
24780taaggagcct tagctttaaa gacttagggg tgttcaatga acctaaacta
aggaggaatg 24840cgtggcattt gattcctgct ggtgagtccc tttcgggtcg
agtgttctcg gccaagtact 24900attcgaagtc aacctttttg gactcatttc
taggtccggt aggtagcttc tcttggaaga 24960gtatttgggg ggccaaggca
ttagttaagg gtgttttatg gtgcgtaggc aatggcagac 25020aaatcaacat
atggcgtgac tcgcgggtgt tgaatggtga tagtaggttc atccccggag
25080agcgcgtttc aggccttgag gatgtttgtg atctaataga ttttgcacaa
tggagtgcga 25140tgtggacctt gtcacgattg cttcaatgaa gatgatgctc
aagccatttt agtcatacct 25200ctaagtaagc gccttctgaa ggacatggtc
tcttgggctt tcactaagga tgaatttttt 25260ttgtaaaaac aacctatatg
gccggttggt cgaggaattt gaatttgttt cacaaagcat 25320ggctgcaaat
ctggggcctt aacgtgtctc cgaaggtctg ccacttcctt tggcgtttat
25380gctcggtacc cttcctgttc gagctctttt aaaacgacgc cacataactg
atgatgattc 25440atgtcctttg tctaaaggag cccggaaagc atatcacacg
cgttgttcta ttgcccatat 25500gtagccgaag catgggagag tgcgggcctc
acaaattgtt tgcctttgtt tgatggggct 25560ggtatgcttg atgcgtgggg
ggagtgggaa acaatcgatg actagtccct tgtaagactt 25620agcttcttgg
cttatcactt gtggtttagg cgaaataaat gtgtttttga aggggtggtg
25680agagcgaatg agagtgttgt ggaatatgcc actaaagcta ctgttgatca
tggtttgtat 25740agtgcccgca tttatggtgg gtcgaaggct accgcatcca
aaagctcgaa ggtatgggtt 25800ccccctccag cttgtcgtac gatctaaagg
ttgatgcatc agtggggaat ggtggatggg 25860tggggctagg agtaatcgcc
tgaaactaga aaggggaggt gctcgtggct gcaactagga 25920gggtcagagc
ttgtggcccg tggaaatggc tgaagggaag gctctttgtc ttgctcttag
25980gcttgcctcg ctcatacaac ttgcaagaag tgatcgtgga gtttgactgt
caatcttggt 26040gaaccatctc tccaagggtg ctatttactt tgcattttta
agtcaaagct tgaaccttga 26100taaaaaaatc ccgttcgaca tgaaaagtgc
cttgattttg cgggtttggg agtgccttat 26160tgattctggg gtttgatttg
taacaccttt agtaaaaaca tgtaagctaa ctgtaaaacg 26220aacattaatc
aaactaggat atgtaaaatt cctaaatcaa gaagaatttc cacttgtgct
26280gaatttgtcc accttgcatg acacccaata aaagcccatg tctcctagaa
ccccttatgc 26340cgccttattc atcttttctc aagttgagtt ggagtcctct
atggtccact cgacttcttt 26400agcacactct cggtaaaaac ttttaatatt
attttatttt agactccacc atcttgacat 26460ttattccttc ttaaacttgc
ttcacacaaa catctaacac tagaattcta tatagaatag 26520cttgaatctc
tcttaggata accttatagt aaatgcaact acgcctatcc ttaaaccttt
26580ctaagaggag ctttatcgta tttacattcg cttcactttg aaacgtcgct
aagtgtatgt 26640tgcactttcc aaaccatgtg ttagctaaga ccaagttata
tgactgcata acctaaatag 26700tcttcctcga gaaaattcac tagttggatc
ggaagagttt gtgtaaatct atggcggcgc 26760gggactgggc cttcacgatt
tcaagtgttt taatgcagct cttctaggta aacaagtttc 26820ctaatagcgt
ggtgactcaa atattgagga cttgttgtta tactaatgct attcctggcg
26880gcacttaagg ggtgaaatag gtggtacaaa ggagtgcttg atggcgtgtg
ggtagtagtt 26940gaatatatca gtatgatcaa gtccatggat ccctcgtact
tattcgtgca agattatttt 27000tccacgaggc aaagcgagcg agaatcttaa
ggtttatgat catattcatc ttgtacgtgc 27060taagggtaat gtccctttca
ataatgagct atttctcctt ttgagcaaga gcgtatctta 27120agcattcctc
gtagttctcg tctccccaac gatgttttat gttggaatct gaatttggag
27180aaagacggag acttttcgtt cggtctatcg agccattctt ttgagttgga
tggcgagagc 27240gtgatttcat cgtcaatacg ctctaattta tggagtataa
tatggcagga tagtaccttt 27300caacgtgtta agcttttatc tgcattgtcg
acacaaaggg gattgagtaa gcctgtgccg 27360agtatggaac cattgtgtaa
tctttatgcg ttggaggatc aatggagcta cacttcttac 27420gagactttgt
tattggaagg gcttatatgg gatccaacta gggtagtcaa aacattggtc
27480ggggctgcgc tgcaattttg gggacttggc accggcgttt ttggagtagc
tccctcatgt 27540ggaacatagg cttttgatga cgatatacta ggctagatgg
aatataagag agaggtgttt 27600gtttgaggag gaggtttgtg atccctatca
aaccacatgc ataatctcat ggcgtagctt 27660catgtggaac actgacatat
tgcatgtgta gggttaggtg cttgcctagg acaccacatg 27720ggaagagctc
cgttaagctt aatgtggatg gagggtgtgt ggaagggttg ggtgcgtcca
27780ctggagtggt gattaggggg atggatgaaa aagcgttgta gttgcaacat
agaaaggtgg 27840aggactgcga ggaaccgtta aaaaggctat attttatggt
gttcatttgg ttgtggaagt 27900cgatttttga aatatggttg ttggaagtga
ctatcttcac ctcgttgaag caacttcttc 27960aaaagtggaa ggcaaaaata
gcttccatgt tattgttgat gacattgttc atggtagtgg 28020tatgttaaat
acttcgtctt gtagttttgt tcgtagggat gggaataggg tttctcacga
28080actcccccat ggaaatatca taatggctca tgctcatagg taaatgaata
agctaacaaa 28140attgttcatc tttgcagaac taactcatgc atgaatcgat
ttcttagctt tagtgaaagg 28200tagacagctc tagagagagc atctagtctc
aaaaatacca tgagattctg tagtgtcctg 28260acatgtttta tatgacagga
caaagcgtta aaggagcaca acaacttgct atccaagaag 28320gtataagttc
agcaagattg tttagtaaca ttgttaatct tgctgattgc tttgaaacat
28380gtcttgctat ggttaacaat gttgactgaa ccaaaatagg tgaaggagag
ggagaaggtg 28440ctggcttagt aggcagaatt ggattagtaa aatcatgaca
ataactcatc tggctttgtg 28500atgtcttaag ctttgccctc actgaataca
gggtcagtcc tcaataacct ctaatcattt 28560ttccaagatc caaagtaaac
atggtttcat aatttaatta agattttttt gaaccatgtc 28620tccatacaac
cttactagga ctaatactac taatttaaga ccccaacgat aaacaacaat
28680aattagccat atctggctag caccttttgg acaacacacc acatgagact
cttggccaac 28740ttctttgatt tccttcagtc tgatagatat gaatatcttc
tgaagagctc tttggttcat 28800aattattgat ttagaaaaga attcagcaag
gtgagtcatt tggtaacctt aaggtcatta 28860tgggggtact aaatcaaagt
gaagatatat ttaggtggca tcagaagaga tgatatagat 28920aggttgtatc
ctgtcgatag gttatttgga tatgtatcaa aagtttcttt tataatatat
28980ctatactgat tggttgatgt atcaaatatc cctacagatt gtgaaaaaat
cccctacaga 29040ttgtgaaaat atccctagaa cctgtgatga tataagatgt
gctccgcatg ctttattgaa 29100cataatgtat tcaattcttg aaatgcagag
gaacaagcag cagtgcagtg gaagatgaag 29160caacataacc accaaatcta
aacagcaact ctgcataaat accgtcctgg atgctttaac 29220acatctaaga gcagtaa
292373921668DNAArtificial SequenceBvAP1 genomic DNA showing
mutagenesis site at position 6999mutation(6999)..(6999)c to
tmisc_feature(9833)..(11385)n is a, c, g, or
tmisc_feature(12467)..(12605)n is a, c, g, or
tmisc_feature(13741)..(14088)n is a, c, g, or
tmisc_feature(16396)..(17194)n is a, c, g, or t 39atggggagag
gaagagtgca gctgaagagg atagagaata agatcaacag acaagtaact 60ttttcaaaga
gaagaagtgg acttgtgaag aaagctcatg aaatttctgt tctttgtgat
120gctgaggttg ctctgatcat tttttctcac cgaggaaaac tctttgagta
ttcttctgat 180tcttcgtaag tatatatata tatatattaa tagtaactac
ttgttttctg ctttctattt 240ttaggtctga tgcatattta atttaggtaa
tattaattcc ttatatctga tccttaattt 300ttttttcttt taccatttca
tttttgtttg ttttgaataa aagaaaattt ccccttcacg 360tgtgtcgaat
aggtcaaaat ttttacttga aggatgttct ctttgattac taaaatagga
420tccaacaatc acctgaaata aaggaagaag atggtgcaaa gtttttactg
tcatacttag 480tatttgataa atattctatg atgaacttgt ataaattagg
aaatagacct aactttcatg 540cacgaaaaca ttattccttc attcaatttt
tttattactt aaggatttac ttttttattg 600atcatatgaa gtagtagtac
ttgtaatcat tcaatttttt tgttggttaa ataggactac 660attttaaaac
aacccaattt taaaattttt tgtgtgaatt tcttcccttt ttaaaaataa
720agtctattat catagcttag agtagctgtg gcaaagctag acgaaataat
acagaaatct 780ggaaaggaaa ttgtactact tacatgaaca cacttattta
ttacttgcat gatatctgcg 840aaaaagttta tagcaaattt ggttaatata
tagcgtagta ctttggatat taatattact 900agtgtacaaa tacttgatcc
aatgggtaat gaaacttatg gaagatttga ccatacatga 960tgatgctaaa
tattaattgt tattgtccag ctttgttttc cctccatcca ttggcatctt
1020catctttaca ttgctactcc actcacttgt caattgtttc gtcctttatg
ttctttattc 1080acatgtgcac catacttcaa tactttcccc ttctttatcc
tcagtttttt tttcttgtca 1140ttttagggtt aatatccaat gaaatctagt
ttgctcgttt tagatctaat tttaattcga 1200tcacaaccat ccatattttt
gtttcttagc ttgacatcta ttctatggat ctgggatctt 1260cggtgtatag
atgttctcgg ttttcagatc aagatcctat tcatagaccc atttattgta
1320aacacttaaa tgtgttctta aaaagttagt ggctcgccaa gtcaactcaa
taacataacc 1380cccacgactt cattacatta cacaatgaaa gattagatgt
atgagtttgt gaagcttata 1440attctatttc aagtaggact aggatgtttt
gtgcaatcag cagctagtag tctttttaat 1500ttaagtcagt cttcattgtg
catcatatat ttttagaaat atatgcaagt ttgaaaccat 1560ttagaacctc
atgacccgcc tgactcacta taaaccggca agagcttaat ttttcacagc
1620tttgtatctt tatgagtagc gctagctagg ggtatgggca tagaaaaaaa
gggtttgggt 1680tagggtctta caagatctta tccgctattt ttatttcata
atctttcaaa atacatgttt 1740aataattcaa aatacatgtt taatactatc
tccatttcac aacatatgca ccaattgcct 1800agctatggtc caacctagtt
ggtttgtagc ttgcattgga tggttaggat gtattggagt 1860tgtttatgtg
caatcaaatt ttaattacgt atcaaaaaaa aaaaaaaaaa aacatatgca
1920ccaatttcca tttggacaca cttattgacc aatttttgac aatatttttc
tcaccatttt 1980gtaagaaaaa tcaaaatcaa gtggaatttt gttaagttta
tctcagtcaa aagattccat 2040acatcgacat tttataattt ttaatcatac
gcaattagaa atatcaatgt ctaaagaagc 2100gtgttggaat acgtgaaaaa
gcaaatgata catgaaacag atgtagtata tagaaaactt 2160aattttgtgt
cactcggatg tatgtgggcg gagccttcct agaaggcgta cccaccttag
2220tggctctgaa tctttgacga cccgttcggt tggtggtgat aatagatggt
aatagtaatg 2280taatttagtc taaatttata aataaatatt aatatcatta
cccatggtaa tacaagttct 2340tcacaaaaca tgtttcattt aaaaattatc
attactacct tttcaagtgg tattggatga 2400taataaaatt ttaggcaggg
aaatgggtat tgggatgaac attaccatgg gtaatgacat 2460gcaatttttg
ttacaagaat acagtataat acattactat tgccaccatg tataaccatt
2520aatcaaatgg accgtgagga tatgatgttg aagaagaagt cttaacctct
acgctattat 2580ttactagggt ctgtaaattt tcctttttta attataattc
ttgtgaaatc ttcttcactg 2640atggtactag cttattagga tgggtttctt
tagtatattg aaggctcttg ttgacagagt 2700ataaaaatat ttttggggtc
gcaaccatca atttaaactt ttgtttgatt ataaaattat 2760tttttgaaca
tcaacaatct acttaaattt ttggttgagt tagttctttg acatggtatc
2820acaaccatca tgacataaag gtctcatatt caaatctcat tcacctctca
tttccaagta 2880gaatatttac ctcaggtatg ggtatgaggg aggcttgtgt
tgcatgagtc aataacggat 2940cttgaccaat aatttaacag gggcgagttg
ataaattaag ttttaatgta aaattttaaa 3000tgatggataa aaacactaat
acacaccaaa atataaatat acttttatta atggttacaa 3060agagcttgta
gctaatgtaa taaatcaaaa tcccaaaggt gcaattttta agaaattatt
3120tccatttatt tatttgacca ttatgaaatc ttcaagaaat tgagtaagtt
tttaagaaat 3180ttaaggtata gttcattaac taaataaact actccagtaa
aaaaaaatta ccaaactgct 3240ttcttaagta aaaaaataaa taaatttata
ttttatgatt gttaggaatg agtgtgagga 3300aataaaaagt actattatag
ttaaataaaa atgaaagttc ttcagagaag aagaatagaa 3360gatagtacaa
tcaatgttaa atatttttct aaattagaca aattgatata aaccaaaaat
3420aaaggggaag aagaaagaaa taagtaaaaa aagaaagaag gaaaagaaaa
aaagaaaaaa 3480gagaagcaag tgaagaaaaa caaagaagtc caaatgtgtg
ttgatgcaag gttcgagctt 3540gcaacattaa gggctcaaac tttcttttac
actttggttc actgccaccg tgcccacagc 3600ttgttatgtg acatgaagtg
tagtttgctt aatttatctt atacagttat gggggaccaa 3660gcctccaccc
gccccttcta taatctgtca gtggttgcat ccacacttta agtccaatag
3720actcttgtct gagaggaggt gatagagtat ataaatattt ttggggcctc
aaccattagc 3780tcaatctttt gattaagttg gttctgtgac acttgtacta
tatactagtt atatatatac 3840tgtaaaacta gtaccacgag aacagtcctt
aatacaaaca acatgccctt aatagaattt 3900tcttagtata cacttaatat
aggttgacta gctttttgcc cttcagtatg cacacacctt 3960ttataatctg
tatcgttgtc tggtagatga taataaacct cagtattggc aatatatgaa
4020atgacataat ggccatgttt ggtgattaga gtttagagtt tagaggttac
agttcagagt 4080ttgtggttag atgattactt ttttgttcag aggatttgac
tgctgattta aataattgtt 4140gtgtaaaggt gtttggtaac acttagctta
ttgtttagag ttttgtactt tttagagcat 4200gtaaaatgac atttatggac
atatgtattt ttttaaaaca aattttagta gtaattatat 4260ggacaaaata
gtcatttgtt ttttctctct ccaaaactct catgaaaaag ctcctctacc
4320cagctttttc aaaagagagt tttgatcaga gttttcggta caaaactctc
tttagtcctc 4380tctctcacca aacacccaaa ttagagtttt tattggtcaa
aactctaaac tctctccaaa 4440cctctaaact ctctctaaaa ctctctcccc
caaacacccc caatttctta gaaaaatttg 4500ttgctccttt ttattgcact
atatttctat ctccaaacat aaagtttctt ttacaaattt 4560tcatttctac
tccataccac ctttatatgg caatataatt tctatgaatt aaaatgttca
4620caagttttga ggtggatttc aagagcatgg acaatatgat catgagactc
tccatacaaa 4680aattaccctt aaattttata atcatacacc aagcggtcgt
taaagtattg gaagtgcttg 4740agtagtttgt gaaaattaac atataataaa
gtgcagatct cccctctagt aagtagtaag 4800aagtagtaag acgatgtccc
tcatttgaga aagagaaaaa cccttatcag tttctcttgt 4860ttctttgact
gaacgcaagt caaatagaag tatgtaacta ggaaatcctt ggagaaatag
4920attttcttta aaactataaa agtataccta tatatatggt aacccacaaa
aatgtatata 4980atctgatcaa tatctaaaca aagtattctt atgttttctt
tcatcttgct tatttcctcc 5040ctttcctttt cttactttaa tttgtttact
ctctttaact tatttctttg cgtatctcat 5100ttcactttac aaggatatat
agttgattat gacagcttaa taaatatatt ttggaactag 5160gatttattgg
ttgtcgttgt tattttaatt tctacactga tcggctagag tttctagaac
5220atagggcttt attgaaacca ttagttaaca aaattgaatg acaatgattc
aatatgatag 5280aatatgtatg tattagttaa tgtttgatta ttgtttgtat
gtatataatc aaagattatt 5340tagtaatact tctatataca tattctatta
gaatcactta gaaagaccca ttgaacaata 5400ataaggatag gcagacaagc
aaacaaaaga aaaataaacc tgttactcct tccatttctt 5460aatgttctac
tcggaattat agatacacac tttgacacaa attagaaaga gagtgtaaaa
5520agtggatcca tattaatatt tttatttttt taaatgagga gagaagtgtg
ggtttattat 5580gtttcaaggg agatagagag cattgaatag tgagagaata
tgtgccaaag ataattaaat 5640cattgtaaat cttttgccaa ataaagaata
aagcatgtga gtaaaacttt aaaaaatggg 5700cgaaaaagga aagttgagta
gaactttaag agacggaagg aatatagaag aggacgtgac 5760agatgggagg
aagatcagac atcttagaag gggaatagtt aaatttgaga tagtctttta
5820attaaggttc tcactaaaga agatataaca gtaggggaaa gctaaaggtt
attcaaaact 5880ttccttccca tcttcatcac ttcatgtctt tactttagag
ctcttaacac ttagcctatg 5940aaattctgaa ctctttgtaa gattagtgat
agataaaaga atcttatcaa tttaatttat 6000aaatacaaca ggattcaata
aaaagatata gagatctata aataaagagc catactgttg 6060tgaactttta
tatctatcaa aacctttgca cattagacgt ggtataacta aatcaggctt
6120atcgaaattt tttaaaattg ttttcattat agccccttta tatttagaag
ttctaagatg 6180attgcataga tagttgatgc accgttctgg tcgacttttt
taaacacttc tttttgataa 6240attttttttt ttgtattcga atcattattt
taggtgtata aagagctgca aatgatctag 6300atgagattga tctcggtttc
atttatatgc taatagtgtg ttagatacac actattaaaa 6360aagtcatatg
acttagagat tattatggaa aagggatagt gcaccgatat taatataatg
6420gaaaatgaca cacgagttgt ccataataac atgtgaaaag tgaactattt
aaaaggtttt 6480tctgacctag tacatacaag gtgcgtaggt ttagctattt
tagtttttta gttttatttt 6540ttaaagtgaa gttagttatt gatctgaaat
catataacat gtacgtaccg tagatataaa 6600aaactaccaa gtatatatca
atttgaaata aacattattt taatatggca aaatcacaat 6660tgttgactag
acctaacact gaagaaaact atgtcatgtt tatcaattat gttgcataca
6720gttaaaaaca aatatgttag agaaatcgtt atttgaaata gaaaagttgc
gcaaaatagt 6780gattaacatc aaaatatgtt cagaaagttt ttataaatat
gtgatcttgc attgtctgtt 6840gactgtcgag gtttatgata atttcccctt
tttccaatgc aaaacttgtt gtgctatttt 6900ctaatgatat attttttcaa
agtatggaga agatcctaga aaggtatgag aggtattctt 6960acgcagaaag
acggctagct tcaaatgatc cagactcata ggtagtgcat ttatgtaaat
7020atagatatac tcttcatgcc caagaagcct gaatttttta tcccactacg
tactgcaaag 7080ccaagtttaa ttgaataatt gtcctgttta aattatttag
ttttcagtac aataatgtaa 7140tcattagttt gcatgtttaa aaaagaaaag
cacaagttct gatcaagtga aatataaatt 7200gtaacgaaag agccaagcta
gacaattacc tagctaggag ttatttgtta tcgtttttgt 7260ttttaatttc
tagttttttt ttttaaacta gaaaatatag tttcaatctt ttgttatcag
7320ttttcaaaat gacatattta acataaatat gattgatttt aaattcattt
attatatcat 7380atttcatttc aaaataagtg aaacacttgt ctcaaaaagc
tcactctcac ataaatgata 7440aaagtgtttc acttatttta aaacggaaga
attatgactt ttacttttca taaaacgaaa 7500aactgaaata tgacaataat
ctcaaatagc ttggagaaac cagatttcta tatatttccg 7560tgatgaaatc
acttttcatt atacgtaggt aaactggacc tttgacttcg caaaactgaa
7620ggcgaagctt gaacttctac aaaggaatca taggtatgat ggcaatatgt
cataattttt 7680ctattattat ttttgcttcc aaaaccagac catatgtttg
tatatttata tagtgatata 7740ctccatccgt ttcattttaa tctatacatt
tacacttatc aggtatgtca atgcaaaatt 7800ttgaggatat atatctttag
ttttgtattt ataaaaatta taaaaagtac atattaataa 7860aatacatatt
atgatgaatc taacaagatc ccacatgacg atatttccgt ccgcgtatga
7920ataacaaata atggccaaag tgaaatttgt gaatagtgta aaatatcaaa
gtgtaacaat 7980taaaataaaa cggagggagt agtacttgtt tgtcacatac
ttacttattt ttgttctctc 8040cacaatgaaa ctgttctttc taataattaa
aaaaagtgca tatgttgatg atttctctgt 8100cactttaagt ggatattgaa
tagtgataat ggattacttt gtgtataatt gcatttcaca 8160tttgggtcta
attttatacc cttttcgcat atcatgcttt gtgaatagta catatgatgt
8220tcaagaatgt gagaagacat atcatacttt tgatatacct caaacatggg
tgtatactgt 8280atagtgaacg aaagtgttag tgtaatttta tttaggaggt
ttagtggttt gtcctatata 8340taatgctagt agttatacac catagttgtt
gatgagcatc aactggcttt cctaacattt 8400ttttctccat aactttaccc
ttaccttaca ttagattact ctaggattac attctaccta 8460aaatattatt
actcccatca ttttaggtaa atatttttac tttgattttt cgattatttt
8520caagagttta aaataatgat taaaatttac catgatcagg cactacttag
gacaagagct 8580tgactcactt aacatgaagg aacttcagag tttagagcaa
caacttgata ctgctctcaa 8640aaatgttcga tctaggaagg taagaaattt
tacttgtcta ccgtagtttc ataataaatt 8700agtatttggg ctcgggcttt
gccccagatt ggtattgtct tttcaaattt gatatgcatt 8760tttttccatt
tccactaaaa tatattaaga aaattcaaca tttaaaggat acaaatataa
8820taatgtggat acttaaagta tgattaaaat ttggttgaga tggtaattgt
gtcatgtata 8880atagcaagaa gtcacaagtt caaagctcgt tgcaagctaa
atttattttt gttgattgac 8940atgacttatc aacacactgg acaattctaa
tcatctagtg gagtagcata tactagcaat 9000ttatgcacgt gatgtgtgcc
ttactttttt agaatataat ttataacttt tttgagcata 9060aacaaaggta
aaatttgaac attagacata tttttttggg ctagctaaat ttgttgttta
9120aacctatatc acttaaccaa actcctcttt tattatttat tgatttatat
tttatttaaa 9180atttttaaaa ttaaaatgat gagcaataaa agaatgttaa
gtagatttat taagtatttc 9240ttatattttt atcaacaaag tattttgtgt
taattaaatt atttcacttt gttaattgat 9300tgtattttcc tttttaattt
attacttgat tgtgtattga ttgatcaaac ataatttttt 9360tgttaatttt
tttatgctat atttgaattt atttttcttt catctgtttt tggtagagta
9420gttgatttac taaagggtaa ttaaataaat ttattggggg acaccatagc
tcccccctcc 9480cttatataat agagatttgt atagatttat tgtcttcctc
aattattgat taactagtct 9540tctatgcacg cgatgcgtgt gttgattgtt
tgggtctatt cttaatataa atttcatcaa 9600aatataatta tagtagtgtg
atttacaatt attgctatac aaactactgt aatttataaa 9660gttgttagaa
attgagataa aaatttagat gtgaaatttt gtggtcaaat tatatttgta
9720attttttaaa ctgagtaacc gtttttctca tcatgtcaag ttactttgtt
aatgcttatt 9780taatttatta ttggaatttt tgacccatct ttaaattaga
aaaggatata atnnnnnnnn 9840nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9900nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9960nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10020nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10080nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 10140nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10200nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10260nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10320nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10380nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 10440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10500nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10560nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10620nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10680nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 10740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10800nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10860nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
10920nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10980nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 11040nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11100nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11160nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
11220nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 11280nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 11340nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnntatat atatatttag 11400ctaagaaaaa aaagacattt
cattggggga taccataact cccccttagc ttatataata 11460gagatactat
acttcttttc tgatagtgtc aaatttaatt ataaatcttt aacattggcc
11520aattaataat tggacaagaa aaaaatgaga caataataaa taaggcgatc
ttcacagacg 11580tattaacatg atggtaatta aaaatgttaa tcatagatct
ttgtgttatc ttaataatat 11640aaatttacta attagaatgt atcacataaa
gtaagtatta atagcagcat aggataattc 11700ttataatgga gattttatat
ttttttatat aattatatga tttattgttg aaaatattag 11760ttgattttaa
ctggttgttt attcaatgac agaaccaact gatgcacgag tccatttctg
11820aactccagaa gaaggtaata actccatttt ttactctcaa aggtttattg
tttttaactt 11880atttcttcta accttttata tatgagaagg tattgggtta
gacgcgtctg accataatat 11940taggtcggat gactttcagt tggtttcaat
tttatttcag ttggtttcaa tttttgtcca 12000gttggtttca atttttgttc
agttggtttc aatttttttt agctggtttc aatttttgtt 12060cagttggttt
caatattttt tagttgatct tttttatttc agttggatgt cttttaagtt
12120cagttactta tcttattgtt tcatttacgt gttttattgt aactgaaaac
aaaacttaag 12180taaatgaaat aaaataagtt ctaaataaaa gcaacttagg
gcctgttctc cccagcttat 12240tttcagttca gttcaattca gttcagttca
attcaattca tttcagttca gttcagatca 12300gatcagttca gttcagatca
gatcagttct tgacaatact tttactctca catatcacta 12360ttcatttcag
ttcagttcaa ttcaattcag ttcagttcaa ttcagtttag ttcagttcag
12420ttcagttcaa ttcagttgtt ttatgccgaa gagaacaggc ccttagnnnn
nnnnnnnnnn 12480nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 12540nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12600nnnnnattca tttcagttca
gttcaattca attcagttca gttcaattca gtttagttca 12660gttcagttca
gttcaattca gttgttttat gccgaagaga acaggccctt agttttcagt
12720tacttatatt atcgtttcag ttagttttct tattctttca tttaacaact
aactaaaata 12780aaaaaaaaaa aactaactga aagcaaaact taattaaatg
caaaaaatta agttctaaat 12840gaaaccacat acgatcgaaa tttcaatcat
ttcaaacatt atggtgtttt cgattctttc 12900aaagaaggca agctgctccc
gctattctac cctctttaga tcacaataaa gctcaggcct 12960cacattcaaa
gtttcctcaa agatggacgt tccaagtatc acatagacac atagtcctct
13020tctccaaacg ctctccttcc tatcttgatg tcattagcaa acttcttgat
ccagacggcg 13080ccaacaaccg caccatgatc tccctctaaa gtactgacgg
cccgtttggt tgttggtcat 13140aaatgatggt aatgggaatg aagttgtgtg
taaatttgtg aaaaatatca ttgtccattc 13200ccatggtaat gctaatttat
cttaatgtgt ccactttcct tctagaattt tcattctcat 13260ccaataccac
cttgtaaggt ggtaatgagt ggtaatgaaa attgcttccc cttggagaca
13320aaaatacaag tttaggagtg agattgattg ctcatggaga aaaaaagtct
ccccatggag 13380atattaaggg tgattcccta ataaaattac acttaaaatt
tattcccatt accgcaattt 13440attaacatct accaaacggg ccgtgaaagt
cttgaaacac atagtcgagt gagtagcttt 13500gaggaaccat ctgtaaaaga
acctgaggga gccaatgtgt gcgtaagtac caacggcgtg 13560ttgtcagtgg
aaaaggtggt gccgtggtgg cactcagtag tgatggagcc gccgtggtgt
13620ttgagtgttg ccaaatacaa aggcggaatt tcgtaatctc taatttcttc
tgtgaaattt 13680ttgggatcag cctgtccgac caaacacatt ggatcaaacg
gtctgaccca atagtttcaa 13740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13800nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13860nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
13920nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 13980nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 14040nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnga tataaataga 14100gatggaggac aagggtcttg
gttttgtcat gttgtcaaag agttgaacaa tggttttttc 14160gtattgttaa
aaaattaaaa aaccagaggc cttcaatctc ttgatagata tagatagaaa
14220aggaggacaa ggccgttctt ggttttgtta tattgtcaaa gagttgaaca
atgatttttt 14280cgtgttgtca aaaaattaaa caatgaaaaa agatggcggg
tgcttgatct aatagatcgg 14340accatggatt gaaggtcttt taacttattt
tatatatata ttgaacttat ctgaagatta 14400tttaactctt tgaaattgta
ttaacttccg aactttatga acttttttaa ttcttcaaaa 14460cttatctaca
ttttatttga aaaaatattg aagacaaaaa aaccctcagt tggtttaaag
14520ctgcggtaag atagagtgta aatgttattt ttttttatta aatcaagaaa
taaaaagaaa 14580tattaaataa aaagaattaa aaatggaaat gatgacagaa
acttatggct tggaggagca 14640atacttttaa gatagaccta aaccttaaat
aagttaaaat ggaagtaatt tttcagtaga 14700atcttattcc aatctatact
ccgtgtttac tccatgtaat gcacatataa taaaaaaatt 14760agaaattaca
tagtataagg tttgatcctg tgactgtaag tttatatact aacttcttaa
14820ccactagagc aagtgatatt tagtgttatc attttaaagt ataattttaa
caaatgaaat 14880ttttttctta cccggaacat agctcggacc taataactag
ttgaacaatt ataatctgta 14940acttaaaatg atcctaatta ctgtactttc
attacctata ataatagaat cttactatca 15000ttggttcaga aaaaaaaaat
cttattaaat gttaaccatt tatttgtaat tgaaacatac 15060atgcacataa
atgtaacttt tagtttatct taacttaaaa actgagaaaa tgttagttgg
15120aaacttttgt atatatgttt ggataaacga cgctcaaaag taggggctaa
aattttagta 15180gataatataa gattatactc catctgttct agatagactt
ctcattttta attttggcag 15240tattcataaa taaaggaaat ctttcaaaaa
aatttccaat atataagaaa aaaaataatc 15300atgtgcggtt ttgtttgatt
cgtctcattg tgtacattag gaaaattaaa cttatataat 15360ttttactact
atgtaattaa agatattaac gatacaaaat gtgtattgac aaacttatat
15420tggagtaata ggaagtctat taagggaccg aagaaatatt acgtaaataa
atctaataca 15480aactaatata aattctactc cagacaataa agattctgtc
ttatattgcc aagatatagt 15540agctatttat tttatcttaa caaacataaa
tgtttctaat gcttaaacat ggacatgtat 15600tattttgtaa aatattatgt
attatccaaa gttacatatt taaaggaagt tctattgctt 15660gctctctttt
agcactgccc aaaaaggtta aagtaatttt ttttctctgt ttaaaaaaaa
15720aatgcattat atacagataa tttttgctag tcaataaagc tatccttatg
acttatgagt 15780gctacttgac tagggatgtg ttgtactcaa ttggaggtat
acatacacca agattataga 15840gcttttattt tgcctataaa aaatggaagc
cggataggat accaaaaaag ctttgactta 15900aatttgtaat gcataaaaat
gatgatacct aacttattag ccatacttat ctaagcgtac 15960gtcaatttaa
atattgtgtt attgattaat aatgatcctt atatatccat attttgacaa
16020ttaaacggta aattagagag aaaagtttga gaaaataatt atagcttacg
taatgctata 16080atccaaagtg tctccgcaca agcgtgggac aaaatagtac
tttcggagaa gttacaatca 16140acagctaggg agtcttcatt gttcttgaat
agaaggatgg aaacaaagtt caccttcttt 16200tattaaagta ttaaggtttg
ttattagctc aatatccaat actttctctg ctttttatta 16260cttcgtctgt
ttcaaattaa atgatttttt ttttatttta cactattttt aaatttcact
16320tttaccatca tttatgattt atatgtgaat gaaaacatag ttacgtgtga
tcttgttttt 16380tttttttttt tttgtnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 16440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16500nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16560nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
16620nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 16680nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 16740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16800nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16860nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
16920nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 16980nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 17040nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 17100nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 17160nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnngtacat caaggaaatt tgcattaatg
17220aaaaacggga gtacaaccta atggtaatac aaccaaacta aacagaaaga
agaaacaaca 17280gacagtaaga aaacctctat aacgcgtaaa aacaatttaa
cataaacact aactagaaaa 17340agtccaggcc gaataacatt tgtcttgtgc
gatggggagg gaagatgaag gagagtgaaa 17400atctgttgaa agacacaact
ctgacgatct tcatagagag attgtcgtac aactgctagt 17460agatcctgtc
cactaggcac gaaatcccaa ggtccttgat gagcctcctg ggattctcct
17520gattcgttga acaaagaagt cactttcgag agtaacttcg gaccctgctg
cgagggtctt 17580cgaaaatcag tcacattaga ctgaaacccg aaaaacaagg
tggaggtctt gacgtgcccc 17640aaaacaagtt gggggcgtcc cacaagatgc
gttttttagg tgtgatgaca catgatgtca 17700tcacgagaaa ttggggcgag
ttagtttgat gaactacgcc ccactgacgg atcctaagat 17760ccaaagtgac
aatctcgaaa ccagaagacc gacaaacaac agatctgaaa catacaaaca
17820tgaaaataat gaaagcataa actgccaccc gacatataga gctccggcaa
acaacaccat 17880caagaacttg caccaaagac ttccttggca tactaaagac
actgattcca actaacacta 17940gcgggggacg gggaagggac actcgactac
acctaaacct aaccagggga cggggaaggg 18000gaacttagac taaaccttcg
taaaaagggg gggatcgggg aggggaaaac cttgaccaag 18060gaagctggtt
ttaaaaacca cttagccgag ccaaaaaccg tgggtgggaa gaagaaacag
18120accacaaaca gggggaaccg ggggatggga actcaccgaa caggggaggg
ggagaaatcg 18180cacagactcg gggaacgcct aaggactggg ggacgaccaa
cgaacgaaag gttggggtgg 18240tgcgaaaaca agggaggggg acgcaccgac
gaacaaaaaa accgacgaag aggccgaaaa 18300agcgaaaggc cgacggagat
aagattgaaa ggcgacgaaa aaagaaaaag gaacaaaacg 18360aaagaaaaac
gaactcgtcg gagacccgcc ggagacctac gcggcgccgg atctccggcg
18420agttctaggg ttagagggtt tgttgtgttt gtttagggag aaggcagagg
tttttttttt 18480ttacgtgtga tcttgttaga tttgtcttaa catgtattct
ttaatatact ttttttttta 18540taatttttgc aaatgcaaaa ttagagatat
atgtcctcta aattttacat tcacatacgt 18600gataaataag agtgctacaa
ctaatttgaa acggataaag tatttgaatt gtttttcatt 18660taaaaaagtt
cgctatcatt tataatgtta tatatttgcc aatatgttat ctctttctct
18720ctcttaccag agtttagatc cagtagagtt agtaaataat tctaccacgt
agagttgaac 18780aaatcatagc cattgatttt caaatcattg gtttatatat
tctttcccaa aactcccccc 18840tattttcccc aaaaatcctc cccctcctta
tctctttcca taaaatctga gtcgttgatt 18900ttaaaatata aggtttggat
tcaactccac tatgtagagt tttcatcaaa ctccaccgaa 18960tccgagcccc
tcctaccata gtacttcttg atttccccat atttctttcc tcatcttggt
19020cctcaagcac attttaatat tatgggtatt aaacaataga gaaagtattt
acttatagag 19080aaagtatttt caatgattcc ctaaattttt ttttgaaaga
aagaaaaggg atttcattaa 19140tatttcgcca aacggcactt acaagtcatt
tctgaaaaac ataaaattct aaaagaaata 19200catatcaccc tagaaatgta
aacatcgcag atttgactta attttgcctt aataaaaatc 19260ttcatctgaa
gcaatgcaat ctgtgagttc gctctggttc ggcatacgat ctgcagatgc
19320ggaaattttg ggatgaacgt actccaatag tcttccataa ttttacagaa
gttgtgaaac 19380cctaattctt catgttgaat ctcgaacttc aaccaatgag
aataatttct catacctaaa 19440aacaaaagaa ccatactcac aactcccata
ggggagaagg agatttccaa aacagaaact 19500aaaaacccca taaaagggtt
tgagaaaatc tcataaagag atactaattt attgaacaaa 19560acaagaaaat
gaactaaaaa ctgaaaataa aagggaaaaa ggggcttacc atggatgaaa
19620acatccatgg cagcccccta attgatgaag aaggggtaag ggaggctagg
gttttagaga 19680gagaaaagga gaggggaggc taggttttaa aaaaaaatat
aatgattccc taaatttact 19740tatatatatt taccaagatg acgtgatgtt
ttacaaggcc catgattttt acgcgatcat 19800gaaaaacaca gccaatttga
atggagcaaa tatctacgcg tcattttaga tatttttgta 19860tgggaaagtt
ttttttgacc aatgtaatta ttaagaagca tcggccaccg ggtagataag
19920atgtcactat acatcctttt ccaaacttaa gtatgcctgt tgaacttttg
ttgcgtttgc 19980agattcattt gaaattatat ttcctcagat cctctacttg
taaaagaatg ttccattatt 20040ttcttagttt acatgatatt tacaatagta
tttgtctaca ttttgttcat attacttagt 20100gatcagtgta tacgtcatat
attagtttga actttgaaga catttatttt ctatatactt 20160cctttgtctg
ctaaattact ttggaaagct ttgttttttt tattaatata agaccctttg
20220gagtttggaa atcactatct aatgaaatat ataattcatc attagaacaa
aaatacaaat 20280atcgtactat cacctatcat gttccttttg gatttcgctt
cacaaaaata cattttaaaa 20340aaaaataaaa taacaaatgg tagctaacaa
cttattactt ttaaaagttt gtgtgcaccc 20400taataagtac tcaaagtagt
atgtaacaga gagagtataa tgctaaaata caaactaaat 20460aaacaagaaa
gtgtttctca
acaataattt gctgcaggaa ttaggaaaca aagtaaataa 20520attgcatgtt
tatcatcaat acaatttact ggtagttaat tacaaacttc actcatgata
20580attgaaagag gccactcaat ttcagctagg agttgtttat ttatttattt
ttctttcagt 20640taaattttga ctacccacaa aatcttcatc tggacctaat
ctgcaatttg tggattttgg 20700atgaaatttc taacctattt aagtagtctt
attgtttaaa taacccatgc aattaaatta 20760ggttatatgg gggtgattca
tttaccaggc ccaagatttt atctcattct caattattat 20820cgcaacaccc
atgaacctaa gccaacatga cttatttacc aggccagcta gagaagaaca
20880aggttgctga ttttcttgtc cgtgattgta gaagaaatgt tagaaatcta
aatgttgtta 20940gggatttacc cctcccccct actgagtgta tgaacttatt
attgacggat tgttgtaggc 21000ttccaagcca aaactctgat taagttttct
tttatgccat tttaaccaaa aaaaaaaaaa 21060aagctaggaa gctagctcag
cgcgctctaa ttatttcaca tgtgacatgt tttacactta 21120ttcatacttc
tatatgcagg agagggcaat gcaggagcac aataacatcc tgtctaagaa
21180ggtacttgca cttgaccagt ttgtgtaata ttgtaattta atttcttaga
ttttggttgc 21240atgctttgat gacgaatgac gattgacgaa tacattttta
tgcagatcaa ggagagagga 21300aaaaatctag agcaagtgca acagatgcag
tggcagaacc agcaccagca ccagcaccag 21360cagcagccgc caccgccgcc
acaaaatcat caagttcctc ctgatgcatc aaatttcatg 21420ctcccacctc
caattccttc tttgaacacg gggtagttac ttcttcaact taatttcctc
21480tattcaatat taagttaaga aacagatcac gtgattagtt cgttaatatt
gctaattaat 21540aatcatattg ttatatatca tgcattagtg ggtaccaagg
acaatttggt ggagaagtaa 21600ggaggaatga tcttgacctg acgctagaac
cgatatactc atgtcacatg ggatgcttta 21660caacatga
216684029237DNAArtificial SequenceBvFUL genomic DNA showing
mutagenesis site at position 19552misc_feature(1029)..(1924)n is a,
c, g, or tmutation(19552)..(19552)c to
tmisc_feature(20703)..(20722)n is a, c, g, or t 40atggggagag
gtagggttca gctcaaaaga attgaaaaca agatcaaccg tcaagtgacc 60ttctccaaac
gtcggattgg attgttgaag aaagcgcacg agatctccat tctctgcgat
120gccgatgtag ctctcatcat cttctccact aaaggcaagc tcttcgagta
tgcttctgat 180acctggtatg tctaatttta taacttcttc ttttgtacat
caataatttt atcatcgact 240caactaaaag cttaagcaga tggttagggt
tctattatta ttgaattacc tcaaatttgt 300catcgactca actaaaagta
gagtatattt catgtagatc aggtgctttt tttgaatata 360ttgtcagttt
tagaactaca aaatgttgaa cacaagtatt tatacgcacg ctgacatgtg
420aattttttaa ttgacaactt tctaaattaa tactctaaat tactaatatg
aagaacgtaa 480tttattattt atcactttca gacaaaggca tgtttgtttt
ttctattatt tttcccatga 540aaattctcac caatatccga ttctgtatgt
taattttagt aatttctaat tttgatgact 600taataaattg taaaaaagta
taaaataaac aaatatccaa aacatctttg ttttcaagag 660aaatatctta
aaaacttttg ttttttaaga gaaatatctt aaaacacttt ttatcatact
720actatgatga tgtataaatc tattcaaaaa aaaaaaaaaa tgatgatgta
taaataattt 780aaagagttaa gtttattaga aattatagat atttatagag
ctgagtaata aaataatact 840ctacagatta tatgtagctg atgtagtgtg
tctgctcctg taagatttcc tttttatctc 900caaaaaaatt gcattgatat
tcgagccttg ccgaccccct tttcctcttc aaccatttga 960taagatccta
tgcactgagt aatctagtat tatatgttag atgttatata ttaataagct
1020aaaattgtnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1080nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 1140nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1200nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1260nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1320nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1380nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 1440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1560nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1620nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1680nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 1740nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1800nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1860nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
1920nnnngatgat attatgtatc attattatca gtcatttatt attgatagat
aatgttatta 1980tttccataga tcatatataa tcttaccata tttactccct
atgattatat atttctatct 2040attaacatag taatagtgat atactaacaa
cgagtatctt tcaagtaaat aaatcatata 2100tatagggtta gaaggtaaaa
aagacaatat tttatgagac tttatttaca cccttcgtct 2160cataaactcc
tttctatttt ttgtgttcac tcgttttcta aagttttttc tacttctttt
2220actttttatt ggataaatac tttttccgtc tgtatgtgat ctacttatat
tagtatttat 2280agatagatta tactcttaaa gtattacatt cacaaaatca
tgtgaattat taattaaagt 2340aaaattatga gtatggtatt ataacttaaa
cgaagtagtt gtattgttta aaccaataag 2400tataataact tataaaactt
aaaagttgat tctacactta tcatgcactt gtgttttgat 2460gggttaaaaa
ttagctagta cataagtaac ataacataaa cttatctctg tatagtatgt
2520tgaattatta ctttatattt gaaaagaaca aagacacaat aagtccaaaa
gatccgatct 2580tttgattatc aactatgtaa gtgtctttcc taaatgatca
atcaccttaa ttagtatact 2640aaacaggaca attatgacat ataacacttt
tctatttgta caagttaatt aacccattga 2700cattcattca gttagcctac
atttttcaat aggtagtcat catcattctt tttttaacca 2760atttttttac
aataaccatt acctaccaac aacattttca aataatgaac tgaatgatgg
2820attccgtcaa ggaaattgtt cgtggacagt ttgtttacct cccaaatttc
ctttaatctc 2880atgctttctc catcatccaa acaatctaat accaatcgtt
ttctctaatt gcataaaccc 2940taattgttga atcctttaat cttgcctttt
cacattgccc aattccttta gtatgatatt 3000ttttattcga taatccctga
taagtaaatt cactaatcta atttcagatg gattgttgta 3060gagattgggg
aattgaagat tttttgcctt ctttgttctt gtttacaatg aaggatgaag
3120tttcatggaa tattgaagag aaatttgaga aaagaaagaa agtgtgggct
ttactgacga 3180caaaaactgt cattttggtg gtttttttca ctaaggcatg
tttggcacta gcgtttaagg 3240tagcggttag cgatttgaca agatcaaaac
gctacttaag aaaatgatga gtgtttggta 3300agatagtggt tgttgtagca
ggtagcaatt agagtagatt atgagtagcg gttgtggaat 3360gttactacaa
gtaacgtttg agatttagag gtagcagtcc agcaaagaaa cattgtataa
3420tagcataagg tagaattaat taaccaatgt ttttttattt ctttttcctt
ttattgttta 3480ttaataattt tattttatgc caattcaatg tttattttac
aatagcaaaa atgtaattta 3540aatatttatt ttaaacataa taaattttat
aagtatttta gtaaaattgg taatataaat 3600ttttcgaggg ttgaatattt
tcttagatgt atacatttct ttgaatagtt aaaaaatcat 3660atctcctttt
tgtgtaactt ctttgaaaaa taaatcttga atttaactat tgaacgaaca
3720tatgaaatta ttgtagttca tttcatatta taataaatgt taggtattgt
cttttacaat 3780ttcaactact tttggcaaac aattctgatt aaacagctat
tttaatcgct gatcgttgac 3840agcaaccgct aacagctact agaaccgcta
cttttgccaa acatgcctaa accaagtaat 3900atcggggatt ctttattata
taaaaagtta ataatgtgat tatttgaaaa aaaggttgac 3960tttatatgtt
gtttaaagaa aaaaaaacat ttattaatag aaaatatact atataggtct
4020ctatataccc agggcgaaat gaatgtgtat cttttcaaat gagtatgcgt
acatgttctg 4080taaatgcata tttcatatga gcataatgtt tttactatta
ttatgcacat ttgtgtttta 4140atttttcaaa tgagtatgta aggaaaacat
gtattcttgg catgtcagtg ttagtgattt 4200ttgttgttat ataaatgttt
tcgtgatttg tgaatgtggt acaaacattc atgtgccatg 4260gcgtttagca
aaacttttct agctcacatg atgcttcaag ctaattgcaa tgaactaata
4320taagggagag gacattttac gaattagttt tacattgata gtagtttgaa
gaagatagtt 4380taggagatag tttgttggaa tagaagctat gtgttagaat
tagttatagt catcaatttt 4440tgaataagac tcattattat ttcaatcctt
ctacctttta attactagtc caactctcac 4500tctttggtat taaattacac
cattcctacg gactatctaa caactctaac cacggcccat 4560actttttctt
cttaacataa aataatatta cgcttactaa ctactaactc ctatgacatc
4620taccttttca ataaaataac aattgataac tatataacaa cttataatct
aaaagtaagt 4680gtcttgataa gtgtagagta ttgtgggacg aagggagtaa
ttcatagcaa atactatcat 4740agcaatacaa tactaggaga ctaagatgtg
agttttgaac ttcaaaaaaa aatatacgcg 4800acatagttca ccggaagaac
tgcagcacaa caaatgcaaa tggggattaa atgaggagtt 4860cacctacatc
acacacaaga gcgattgagg attttcagat ctggaagaag agcgaaaaat
4920caggcgagag cttcaacttt cggagtttaa tggaagagct aatgatgatt
gaattattca 4980gttggattta tcttttgtaa gatgagggga tgaagttgaa
gattgccata ataacccttg 5040agggcgacac aatggtggtg ggaatggaaa
tatggaccac gaccgattta tgagtggaaa 5100gaattgaagt ctctgataca
atgacgtttt ggtatatcat cggtgctttc atggctgtca 5160gatctaccga
aaggaaaagt agtggtgaaa acaaatgtcg ttcaaaccaa ccacctttga
5220ctttcaatat tgaagtgaca aggaaaactg gcaccttagc tagtgacttg
gtggagtctt 5280tgcgtaaacc ggtggaaaag aagcttatct ttgagtttta
gtgtggtctt ggtgcttgtt 5340taagtggctt gactatttag gaggaaaaaa
atcaattgtg aacaattggt gagtaaccta 5400aaaatcaatt gtgaacaatt
ggtgagtaac ctaaaaatca attgtgaaca attggtgagt 5460aacctttaaa
ccagactcaa gagtgggaaa gaggcggtta gacaagtaga tttagaggaa
5520gaggaggtag aaaatagatt agaatttttt ttgggagttt tcaaatgtag
tggtatcggc 5580tcataacaag gtggtcaaat ggatggagag cttggtcgtt
tgagtagtct catggtggtg 5640attataacta cgttggcgat aaacaagttc
gctttaccaa gaagttgatt ggtggaagag 5700agattttcat aaagaggggc
tcgtatagca acagctttgt gtggtctcag tttcattgtc 5760ggcaaagccg
tgggtggtaa atatggtgat tttggagatt gttatgatgg agggagttat
5820ggtggtttca tgagacatgg atgctatggt ggtgctgtga tcggtggaga
agaggagctc 5880gtgaaaggtc actctgatgg agcgggtgta atggtggtct
aggttttttc ggcatcggag 5940gttgcgtatc tcgtgaaggt tcacatttgt
acaccggtga atactatggt ggtgaactag 6000gatgacagtc aaggtgacca
tagtagaaaa aaaaaacata aaccatgtag cttagatgat 6060ttgaaaaaaa
atcatgtttt tggagaagaa tcttaaatat tatgacagag gcaaacttgt
6120cattgaccaa atagatgaca tagcaattgc acgtgtctcg ttaaaatttg
aaaccataaa 6180aaattcaaat tgcacttcat atgcctttct tttggttgaa
aacttcatat accctaatgc 6240gtcaatatgg ttctttttcc aaaaaaaaaa
gtaaattatt tcggcgttag taaaagcagg 6300tccacctcca taatccattt
tattaagcca actcctctac cctacttttg caacctatca 6360tttcttattt
tctaaaatca tatcaaaaaa acaagtgtga acccaaaact aactatatta
6420tacctaagtc taatttcttc atccaatgtg ttcaacccca tttttcaacc
cttccactca 6480taaacccatc ttctttcact cctaaaactt tcagctcacg
ctcgtatcac ctctttactc 6540acatatcagc ccaaacgatt tcttattgaa
tccactaaat tatgtatatc gattttttca 6600ccaagtactc cgagttttca
aaaaatttac ctggtacccc caagttttca aactacacgg 6660gataccccta
agtttcaaac taatacactc agataccctt aatgactaac gacattaatc
6720gccgttagtc attaacctta attttctaga tttcaaccta attaaccact
aaccctaacc 6780ccaaccctaa ccctaataat aaccctaaac ctaaccctaa
ccacccctcc ccaaccctcc 6840caccacccct gccccccatc ctccactcct
gcgcagccag caggcccccc acccatttga 6900ttttaaggaa gaaacacgta
tcagggaagg gggagaactt agctctgaca gcggcaacgg 6960accaccgacg
agtctggact gtaggacggc agcgtcaagg cgcgagccgg ttgggctact
7020gcagttcaaa gcaaacaggg gaggggaatc gagccgagaa gagagctaag
gaaagagggg 7080tgatgggcgt tgcggagatg gtgcgcatag tggtggtggt
gtgggggagc tgtggtgggg 7140gagaaatcga atgggtgggg gagagagggt
ggggggctgc ggtggggttg gggctggagg 7200gggtagggtg ggtgggtgga
ggggtggtgg ttgggtggta gtgggacaac tctcaaacat 7260gattctttct
caaacatgat tctttctcat atcctttttt ggatttcctt aaaaaaatcc
7320atatccaaat aacgttgacc ggtgagggac aactctttaa cactattctt
tctcaaatct 7380tatgaaatct tcacatttaa ctcgatctcc ctttcaatga
acctaaagat atatattaat 7440agagttgcaa attcctaaac tctagaaaat
tcaaatacaa cataagagtc ctagattctt 7500ccacaagatg tattatatct
ttcaaagttt cccagataaa attagtaatt aggaaactcc 7560ttaacaagga
tacttaaagt tttatctaaa tcttgcataa attgaaatcc aaccataatt
7620atgaataaat aatcataaag aatcctaaca taaataacta gaaaataaga
taataaagaa 7680gcaacaaaag aatctcataa ccaccatttg aatcccgcat
gagaacccaa atagttgttg 7740ttccttataa aaacccacca cctttcttcg
ggtattatga cggtattgga ctatagtatg 7800agacgagatc tcttaatcac
caatcaacta ttgtaaactt gtgagcctga ataatttatt 7860tgagatacaa
ttctaaggtt gtttatgaac gtgtttggta aaattgttat tgataactcg
7920ttcgtggaaa ataaatgcaa aagtcaacat gccaaaaaaa gtgctaaaat
caactttcgg 7980ctttgcttga aatgttaagt tttaagctat ccaagagcca
ttagtcaaaa tctattgaaa 8040gcgtactcaa aaaccattta tcaaacaccc
ctacaaatcc ctttagaaaa caataggagt 8100tgtacaatat aagtattgag
ttataaagtt gatcaagtga tttaggaggt tgttccaaat 8160caatctacaa
gagtttgtat acttataccc cttcgttttt ttaattgtta cacttaggcc
8220ttgtttgaca aatagagttt agcggttaga gtttagattt tgctgttaga
gttttaactt 8280tttgttaaat agatttgact gctgatttga caacttcttc
ttataaatgt gtttggtaat 8340tattaacaga ttgctaaaag cttattaccc
tttatttatg tgaaatgaca tgtatagaca 8400ttttaatcca catgggtatt
attattatta ttattcgagg catacaagtc attgaatata 8460tttttaccta
atcctctaaa gaaaaagctc ctattaggag ctttttcatt tcgagagttt
8520ttattccaga actctcttca aaactctttt ttaccaaaga ataggagctt
tttcatttca 8580agagttttta ctccagaact ctctttaaaa ctctttttta
ccaaacaccc cttttagagt 8640ttttgactag tcaaaactct aaaagtggtc
caaatttctc ttttaactcc aaaactctaa 8700ttgccaaaca cccccttaca
cttttcacgc ataccaatgc aacactttga cgattaacat 8760ctccagtttt
ttatttgtaa aaattataaa gagtgcatat taataagtag ggctgttcaa
8820agtgcggtct ggaccgcacc aaaccgcaac ccaaaccgtt gtttcgcggt
ttggtttggt 8880ttgcggtttt aaaattgcgg tttgggttat gatttcaagc
aaaccgcggt ttgcggtttg 8940ggttgggttt ttatttttgt aaaccaaaac
cgcaccgcaa accgcaatgt tacatttttt 9000ttaaaaaaat aaattaaata
catttatgaa ggtgacatac aattataaaa ttgaaaaaag 9060aagtttgagg
taaaaaactt taacacttat gataaatcat tatatatgtt taattatgaa
9120ttcagcttca tatctatttg gactcttatt aacaattttc ttttaatctt
aggaaacaaa 9180agtaatgtcg cggaggaaat aatggttaaa ccgcaaccca
aaccgcacca aaccgttttg 9240cgcggtttgg gttgggttgg tttgggaaaa
agtgcggtgc ggtttgggtt ggaaaatttt 9300caaaccgtat atttgcggtt
tgggttgggt tacatcccaa accgcacaaa cccaaaccgc 9360gaacacccct
attaataaga tacatattaa ttcgaatttg acaagatcca catgactatg
9420tttttattcg cgtataaacc acaaaagaag gttcaagtaa aatttgtgta
tggtgtaaca 9480tgtcaatcaa agaacggagg taatatttgt caagacactt
tagtcacttc taaattccta 9540taaacaaaga aatatggaag aaaactggtg
atgaaaattg aaaaggtggg tataataaga 9600gagacacaat tctaaaataa
gaaaatatta ataataaaat aataagttac gataggcctc 9660atgtttgaaa
acggaaaaaa taaggagata gttcgtgtaa aaaggaggga gtaaagggta
9720atgcatactt tgtattgcaa gcttagtttt aaaaggcata agacgcaaag
cgcatcgagg 9780cacaagacga aggcgcatgc atctcgtagt tgaggcgtgt
aatgatttta cttcacaacc 9840acctgagcaa cccaatacag aacgaccaca
agaaaaatag aaagaaagga aatgattttg 9900attgatcagc agaaaataca
gagcattcga gaggctcagt ctctcccaag gactacaaga 9960tactactaaa
tttcacaccc ccttcagtcc ccttacaccc ttatttatac tacttctgct
10020ctcctatttt aacggctact gacattctct gagctggcct gctattcctc
tttttgtgct 10080gacatttctg aatattctgt ggtagtggct ccattctcac
atttggacag gtttacccct 10140ccatttcttt cgtgcatacg tcagccacgg
tttggggatt gaattcatta cattaccctg 10200cccccaaaga ctcaccttgt
cctcaaggtg gaaggaaggg aaacgttctt gcaccaaatc 10260tgcatcttcc
cacgtggctt caaaaggtgc taagtccttc catttcagta gcacttcagt
10320ctgcgtatgc ctccctctct gtgtctgacg tacgtccaat aattcttcag
gttccacaac 10380tagttccaag tctgctgcta gttgagttgg tacagtggtt
gctgcctggg catctccaat 10440tgctcgtttg agctgggata cgtgaaatat
aggatgtatt ttactggtgc ctggtaactg 10500gagtttatag gcgaccttgc
ccaccttttg cagaacaggg aatgggccat agaagcgggc 10560tgccagcttc
tcaaatggtc gcttggccaa ggattgttga cggtatggct ggagctttaa
10620gtaaaccaga tcccccactt caaaggactc atcgcgcctt cttgtgtcag
cataggcttt 10680catcttttgc tgggagcgta gtagatgaaa gcgtaaatca
tcgaggatgg catcccgttc 10740ttgcaacact tcctctaagt tatctactgg
cgtttgccct ctgcctactc gccacaagtg 10800tgtgggtcac gcccgtacaa
caccctgaat ggagtcagct tagtagacat gtggggagag 10860gtgttgtgtg
agtattcagc ccaagggagc cactttgccc aagtcttcgg gtgccccgcc
10920acgaaacatc tcagatatgt ctcaagtcct ttgttcacaa tctcagtttg
tccatcggtt 10980tgcgggtgat aggcggtgct tctccttagt gttgtccctt
ggagtcgaaa caactctttc 11040caaaaagtac tcagaaaaat tcgatcccta
tccgaaacga ttgatgccgg aaacccatga 11100agttttacaa cctccctgat
gaaagcttca gccacttgtg agagcactaa aaggatgacg 11160aagcccaatg
aaatgcgcat atttcgataa acggtccacc actactaaga tcgtgtccac
11220ccccttggac aagggcaatc cttctatgaa atcaagagtg atatcctccc
aaacctgagc 11280tggaatggct aagggctgca gtaggcctgc tggcttttgt
tgagagctct tatgctgttg 11340acaaatgcta catcgctgca cgtacaatgc
cacgtgcttc ctcataccta tccaatacca 11400ctcagccgcc aacctaaggt
acgtttttac ttcacctgca tgtcctcctt ctggggaatc 11460atggtaagct
atcatcaact taggaatgat gacggaagtg ttgggaatta ccattcgccc
11520cttataccgc agcttgccat cctccaccgt gaaccccaca agtggtttat
ctccctgcgc 11580cacttcttcc ctgagccttt taaggaacca atcctcctct
acctcttttt ggagctctgc 11640ccagtccact ccttgggttg tgattatggt
ccctagctcc atttcaccta cagtttttct 11700agaaagcgca tccgcaacct
tgttggttgc ccccggtttg tagtgtattt caaagtcaaa 11760cccaattaat
ttgcttaccc atttctgaaa atcagccccc acttcccgtt gttgtgtgat
11820gaaacgcaaa ctttgttgat ccgtatggat cacaaatctt ctccccaaaa
ggtaatgttt 11880ccatttctgg accgcaaggc atatggcaat taattccttc
tcataaacgg acttgtgttg 11940tgctctcggt ccaaggagct tgctgtagaa
tgcaatgggc ctgccctctt gcattaggac 12000tgcccccacc ccatacccag
acgcatccgt ctcaactacg aaaggcttat ggaaatcggg 12060catagcaaga
accggtggct gggtcatagc ttcctttaag tgagagaaag ctgaagtagc
12120tttttcggac cagccaaagg agtccttacg caattgctcg gtaaggggct
gggcaatttg 12180cgcgtattgc ctgataaact tgcgataata cccggtcagc
cctaaaaatc ctcgaagctc 12240cctcaaattc ttgggaactt cccactccac
catggccctt atcttctcca tgtctactgc 12300caccccatgc tgcgaaattt
acatgcccca agtaggccac tgtcttcctc cccaagtcac 12360atttcttctg
gttagcgaac agtttgtgca atgctaacag ctgcaacacc aatcccatgt
12420gtcgtgcgtg gtcctctttg gtcttactgt agaccagaat gtcgtcgaag
aagaccagca 12480caaacttcct cagatatgga cggaaaacgt tattcatgag
tgactgaaaa gttgctgggg 12540cattggtgag cccaaagggc attacgagaa
attcgtaatg tccttcatgg gtgcgaaaag 12600cagtcttatg ggtatcctcc
gggcgaacta aaatttgatg gtatccggcc ttaaggtcga 12660gtttagagaa
gatggtagcg ccatgtaact cgtctagtag ctcatcaatg accggtatcg
12720gatacttatc cggaaccgtc tccttgttca aagcccgata atcgacacaa
aacctccaag 12780aaccatcttt tttcttcacc aataatacgg ggcttgaaaa
tggactagtt gagggcttga 12840tgatgcctgc ctccagcatc tctcggatga
gtctctcgat ttcgtctttt tgaaattgag 12900ggtaacggta tggcctaacc
cccaccggat tactgccttc cttcaacgtg attgcatgct 12960catgccccct
ctttggtggc aggcccaccg gagtatcaaa aacttccgca aactgactaa
13020ttaccttctg taaaaattcg ggtacttctt gtgcctcctt caactccgct
tctccccttt 13080tcccatcatc ctcaatctgg ttgagctcca agagaaaacc
ccctttttct tttcggattg 13140cctttatcat ggctctaagt gagattttag
atcttgctaa ggaagggtcc cctctcaatg 13200tcaccactct gccctccact
tcaaactgca taacctgagt tttccagttg gtaatcactg 13260accccaattt
ctcgagccac tgcactccta atattaaatc tgagttaccc aggccgagag
13320gtaaaaaatc ctctgttact tcgatttccc ccagctttaa agtcacccct
tgacacaccc 13380cagtaccatg gacagcttca ccattcccta aagacactcc
aaatccccct gcatctgaga 13440tgaccaactc aagttcctca acagttaaca
aggaaataaa attgtgagtg gcacccgggt 13500caatcatgac caccacctct
cttcctttaa tttttctagt gattttcatc gttttaggac 13560tcatcaaacc
aatcacagag ttgagagata cctcagtagg aagttccggt ggtggttcgg
13620acggtggtgc acgagctgcg tcgctcacct cttcggtttc ttcctcctcg
tcatgcatca 13680gaatcacgct gatctctttt ctccggcaga tgtggccggc
ggtccactta tcgtcacatt 13740tatagcacaa cccttttgct ctcttctctt
gatattcttt ctcggaaagt cgcttgaact 13800ctacagattt ttttccttgc
cccccagcaa ttggatacgt gttcaaggtg ttggaatttc 13860cccctgggtt
ttgggcccac attttgctgg caggtgggtt gaggctggtc gttggattga
13920aggaagcccc tacactcctt gtcattcccc ctctgttata aatcgagtaa
ggcccattct 13980tagttggccc acttctttta taacccacaa tcctattcct
ttcctcaatt cggcctgcta 14040gttccattgc ttgctctagg tccataggat
tgagtaacct gacctccact ttgatatcct 14100cttccagccc attaatgaac
tgacccatga gtatttcttc tggtactcta ctcaacggtg 14160ccgccttctc
aataaaagtg cgtcgatact catccaccgt ggtggtttgc tttgtggcca
14220accaccgttc ccacaatgaa ccatagtggg ttggtcgaaa ctgacggagg
aggtactcct 14280tcagatctgc ccaccacctt atcggccgcc ttttattctc
ccactggtac cacctgaggg 14340catccccctc tatagacaca accgccgcct
ccagggcttc actgctactc aggccataaa 14400acgaaaaata tcgctcggct
ctaaggatcc acccatccgg atcggaccca tggaaaatgg 14460gcatctctaa
ctttcgatat ttccagttcc ccccggaagt cgaacctcct ggccctccac
14520ctgagccgcc atccccatac gatcggcccc cgagctggaa acctccatat
tcgtcccctt 14580ctcggccccc cagatttatc aggtcaggag ccctccgttc
cggcggtcgg gggtgtccag 14640tgacggtttc gggtgtctcc cgcgagggtt
gtggcaaggt tgctcggatt tcaacctgaa 14700atttcctctg ttcagcacgc
aacccttgaa tcgtcccatc ctgtgtctcc cttgaccggt 14760taatccggtc
ctccagccta acggccaaga cctccatctc ctcgcgactc ttcctcgccg
14820cctcattctg gccttccaag atttgagcgg tcaacgattc ccccatggca
ttaatggcag 14880attccaccgc cctggacacc atggtggcca ctgacccttc
gagggctgcc attctttctt 14940ctaatgaatc caccctctgc acttcgtttc
ttggtgccat ggatcgatgc tctgatacca 15000agtgtaatga ttttacttac
ttcacaacca cctgagcaac ccaatacaga acgaccacaa 15060gaaaaataga
aagaaaggaa atgattttga ttgatcagca gaaaatacag agcattcgag
15120aggctcagtc tctcccaagg actacaagat actactaaat ttcacacccc
cttcagtccc 15180cttacaccct tatttatact acttctgctc tcctatttta
acggctactg acattctctg 15240agctggcctg ctattcctct ttttgtgctg
acatttctga atattctgtg gtagtggctc 15300cattctcaca tttggacagg
tttacccctc catttctttc gtgcatacgt cagccacggt 15360ttggggattg
aattcattac aaggcgcacc tttggtacca agagacttag acatcaaata
15420aagaagcatt gtaatactta gaataaacat tgtttttata ttctaaaaca
catattttct 15480ctaagacacc tagtaatctc atagtggatg tcatctagtt
taggtggtaa ggttttattg 15540agttggtact caagacctca gacttagagg
tggcaaacgg atcatacggg tcgggtgaaa 15600atgagtcggg tcataatcgg
gtcacctttg tgtccaggtt acggtcaggt cgagttcgtt 15660cgggtacgag
ttcatattga gtccatgagg tttcatgtca tatcgggtcg ggttagattg
15720gatttacaat ttcgcaaata aataaaacgc atataatact aaagagagta
aattaaataa 15780ttaacggaca ctagctaaat catatattag tattttatga
tgtattttcc ttaaatttat 15840ttaaaaaata actaatatga caatttttcg
ggccgggttc gggttgtggt catcattatc 15900gggtcaattt agtatcgggt
aggctcgggt tcatgtcata ttcgggtcta ttttaattcg 15960agtcgggtta
tttcggattt aagctctatt tcgggtcagt attttcgatg aagaacgggt
16020ttcggatcgg gtcaccggat acggatctat tttgccgagt cagactgctt
ttcaaaccta 16080ataatctcag tttttccacc tattcagatt tgcctatgat
ctctatttag ataaatgagt 16140acaatattgt gtctatccat gaaaatgaat
atctcacaat gtaaaaggat atctctaaat 16200ttcactaatc actctatctg
ttttgaataa taatattcta ttttattgca tgtagtaaag 16260atcgagtatt
tagtgagatt tggagaaaag aggaggctag agagaaacta ggatttagag
16320aggagaaggg ggctctgtaa cacatacaag atagatactc ttttacacta
acttttcaag 16380atactcaaca tataaaatca gcatcatctt ccaaacaaca
actttaagcc acccatgaat 16440cttaattaga taataaaaca taatcgtgaa
tcatctatcc tttgtttggg gggatcctaa 16500agcaattgag gaaaagcttt
gatgcaaata tcaattgtgt aaaaaagcaa gtattcgttc 16560gtgatgttgc
tatactaggt tattttttgg atccaaaagt cattcctact agaatcattt
16620aggaaaattg tcagtatgaa ttttaaattc aggttataac caaagataat
tgaaaattgt 16680caaacttttc aaataattcc gaaataaaca tgtttgtaac
atggataaac ttttcattgc 16740ttttcaaata attccaaaat aaacatgtta
taacatagat aaccttttca gataattcca 16800aaataaacat gttgtatcat
ggataaactt ttcattgttt ctagtcactt aaaattctaa 16860aaaaatcttt
cctccctact gttactctct ctagcaccaa atctatcaca tgagaaggca
16920gaggttttca aaataaaccg ttacttaatt tggtacttat ttcttgatcg
gtgttcatat 16980catatgagtt cctactctat atctctctac tcttctaaat
ccttgtgtca cttcctgtgt 17040ttcataaata aaaaggagga agtattagtt
ttgaaacgaa aggagtatgg tgcatacatt 17100gatagaaaaa agaagttatt
tgtccttatt tcactcatat aacaacacca aattctgtat 17160tgttatcaca
aaataaaact tggattatct ttgtttcata gcccaaattt agaattagtt
17220tgtcagattt ccaatcatct aattacaata ttagagctag acctaggaca
aaaggtgggt 17280ttggctactt ggtaatagct atgtctagtg ctaggatatg
tcattgtcgt agaaccatgt 17340tatggacatc ttaagaaaca aggttaacct
aattggttgg agatcctact ttcactttta 17400taataaagtt tcgattcttg
cctatttgta aagtagaatt cctaaatttc ccttcactga 17460tatttatctt
aacataaaaa aatgttataa acattgggat tgtatataag tcaaaataaa
17520ttgacaatct tggtaacaac taagttaaca ttaattttat aagtaaatga
ttaatcccaa 17580tataatctct tatttagtaa atgagacaaa cttgtacacc
ttcgtgttag actcgttaat 17640gttcgctaac aattcattca gtagtcaaca
gcattttaaa tttgaaataa gtgttcttgt 17700ggtttttgag agatcaagca
agaaaacatg tctctcccct ttgaccaact aattgggatt 17760aagaatacta
gttttaagat tttaagaatg agttatagtc tttcttagac cgctacaatc
17820cccttgttga tatgaaccag atatattttg tgttcaaata gtagatcaat
gcattgttga 17880taatcctttg ttaatgtact tgttgatctt attttgtact
tttggtagat gcgctatact 17940ttctttcgat tgctcatttt gaactcttaa
ctacatatgt tagtttaagt agatgattta 18000gaattgctat ttcaatcttc
aataagcaat ttaagttgtc aaaccttgtt tcacatcatt 18060agggtgaaag
ttatttggat aaagacctat atctaattca atccaaagca aattagtaat
18120gcggattgga ctcaaactat gtttagattg gattcgaatt gagtttcttt
tctttttctt 18180aaaaaaattg gattttccga tcgaattgag ggtgattaga
tccaaaataa ccgaatagta 18240gataggattt gtgttgtata ttagaattgg
gcttaaggat ttccatttta acaaaaaaac 18300caaatggtcc gactatcaaa
aactataatt tgatagtcat gcctatcgaa aactttattt 18360tcattctcgc
acctaattat gggcttgtat aaattagttc tactatcgaa aactaatttt
18420gatgctcgtg cctaatttaa attttcgaaa aaatgaagtt aagaaaattg
gatatttcgg 18480attggatcca atatatcttg tgaattatta atttggatta
gtttggactc aaattcttat 18540tggattggat tcaaattaaa agattaaaat
tcaaattttg ttcgaatcaa attggagtag 18600gcttaagttt aaatcataca
ccgaactttc accactaccc atcatgctta agcttctaat 18660gtaagagagt
gtttgggagt tgagctcgaa caactaaatt tctaaaagaa ccaagttcaa
18720acaagaaatc taaaagctcg attaaacttg agtcaagctc aaacacctat
attccttatt 18780ggagcttgac tgaagattga acacttattc cttattaagc
tttacgctaa aacattgctc 18840gactcaactc ttctacatcc ctatagttca
aaagaaatag ttgtgggctg tggtgctctt 18900gtagaccaac gcactagttt
aacaaagcta agtgcctgac tgcaattcca tacacattac 18960gatcaccatg
acctagtttc agctcacact ttggaagtct aatttgaact tgttctctac
19020ctccaattca ttgtggggta ggaggcgata gttaagggat caaaatctta
tgatataact 19080tgcataggct atgccactat ataatgcgtc ttgtgtccca
tattagttta atcaaattga 19140aatgttttac catttatatc ttcaattatg
tatggatact aatatttgat ttgacgtttg 19200atatgatatt aaatgtggac
tgttattctt gatgtgcttg agaagctttt ttggggccag 19260ttagaaacta
tattcctttt atggtcctaa ctaggttgtt gttggtgtgt tcccaaataa
19320cagcatggaa aggatactcg agcgctatga aagacactca tatgcagaga
gacaactgac 19380tgctccagat cctggatccc atgtaatcca gctaggcaac
tatcttttct aagcatttaa 19440atcgttgaga tttcaatttt aaatgtgttt
taactgataa ttcatgcatt atatgcttag 19500gtaagtttga ctctggaaca
cgcaaaactt aaggctaggc tggacattct ttagaaaaat 19560caaaggtaat
aagatccaga ccaaatataa tttgtataat aaccacctta tgaggaaaat
19620ttaagatcct tgataatttc aggcattaca tgggagaaga acttgatacc
ttgagtctca 19680aggagcttca gaatttagag catcaaattg acagtgctct
taaacacatc aggtcaaaga 19740aggtagtttc acagttgcat tagatcatct
tatggatcaa ttggatcact tgtttgtatt 19800ttagcgttgc tcaacacggt
cgtctaatat agtgtgcaaa acgacctaca gggcaacacc 19860ttttataggg
ctcgaaaata cgaaaaatta aatgtttgtt ttagtcatat tgttcaaacc
19920caagctttat cttgtcaaaa atattttata atgattattt tttagaatac
attatttaca 19980tttttgcaat ttatgcataa tacttctaag gtccaacttt
ataattgaaa tagaagtcct 20040taaattttaa agacgacctt gaggaaacct
aatttcttct catatataat taaatcaatt 20100attctacaag ttagtagaac
aaatactaca ataacaacaa tattgaagcc ctaatctcag 20160taggattgga
ttgattgtat gaagtcttat tagtggccgt taaatgtttc ttgtaggtca
20220agatgacatg gctcatatag taaggttact tgactaaaag acgaggattt
gtttcgactt 20280agattttaac aagtttccct catttgttaa cacctaagcg
tactaaatca aattctaggt 20340tttactcact caaatttccg atttaggaag
ggcttgagga tagttgtatt atcgtaactg 20400actaatcaaa ggagcctctc
ttagatcagg tttcacttgc caattctaac aacttgtttg 20460gtaaaaggaa
tttggaatga aaagaaagga attgaaaaga aacattctac ttttcaatgt
20520ttcattcaaa aataacattt taagtgatag gaaatggaaa gaagtgaaac
gaaagcctct 20580ttacaaaatt atcatttttc tacccccccc cccccccaaa
aaaaaaaaaa aaaataagta 20640gtaagtagta gaagaagaaa taaataacta
acaagagtag taagttttta cgttttcttt 20700ctnnnnnnnn nnnnnnnnnn
nnattcaaat cagaactgaa tagtcataac cggaagatta 20760gtttctctct
agcgtgacta gggtttgagt aaaaagagaa aacttaaatc aaacatggga
20820tattaaggtt ttttttcctt tcttcagttc ttttctcttc ccaatccttt
cctaaaaatg 20880aaccaaacag gctgcaaggt tttcacttgc ttaacacaag
atttattttt aaaaataatt 20940acactccaaa cttttaagct taaaaccaat
tttaattcaa atcagaactg aatagtcata 21000actggaagat tagtttctct
ctagcgtgac tagggtttga gtgaaaaagt ctagggtttc 21060atgtcattct
tcttgcttcg agtcccttct tgggattgtt gttagccatt atggctaccg
21120aaatcgttat taaatgtcta aatcttagaa ttactgctga agaaaacaac
ttggtgtttc 21180tcgaagatgt tgatgataac tcgcagcacc atacgctcgc
actggcgatt gttggaaagg 21240ttctttcgtc aagaccatac aatttcgagg
cacttaaatg aaccttaaac tagatatggg 21300tgatatccaa aggagcccta
cttcacccta ttgaaaacgg actttttgtg gtacaatttg 21360cgacaattaa
ggaccgatct aaggttctag tcagcagacc atggaccttc gatataaacc
21420ttgttctctt agatgctatt gaagggggta ctcaatcttg acccattgcc
cgttttggac 21480tcgcttgtat aaccttccta tggactgccg atatgagaag
ttcatcaaaa actattgttg 21540gtgtattggg ggaggtattg gaagttgatt
ttgacaggat tgtttgggat aaatctgcaa 21600gagtaaaggt gaagattgac
attacaaaat cgttttgtcg tgtgcagatg atcaagacta 21660acaggggtga
ggctgtgatg atcaatgtta agtatgaaag acttcctaca atttgttatg
21720tgtggaattc tggccatatt gaaagagatt gtgtgaagac ccaggaagaa
gagaaacaag 21780tggagagaca atagggggtc ttggaggcct ctccgcgtag
gggacgatta aagatggtga 21840aagagtcgaa agccttcctt cagtgtgctc
gtacactcca ctttaataac aaggaagaag 21900taaggggtga ggaaccacgg
gattatgtgg agccgagggg ttattgtcgg ctatcttagg 21960gggtaaaact
ttggtggtcc aggagatagt ggacggctct aaggatgcca tcgaggaagt
22020tcgtgctgaa ggtgcaccac tctagccccc ttgtaccctt tgggtaatgc
catgctacct 22080tttacttttg ctgttgggag tgctaatcct actccctccc
accgaaaagt taaaattaaa 22140aacaaggcaa gggttcaggg tgttttgaac
caagttaatg ttgtgggtgt tggggggttg 22200gctaataatg ggggttgtga
gaaaaggata ttccccaacc cgatggtgtt agaaaaagaa 22260aaggggttca
atgaagaggg tttaagatag caaaacgaga ggattgtatg taacctatca
22320gtagggaggt aactattgag gtggaggtgg gcgagaccca accccgcccg
acattatgaa 22380tatcctatgt tgcaactgtt ggggattggg caacccccgg
gaagttcgga tgcttcgtag 22440gtggagcaat agtgctacac tgagttcggt
ttttatttct aaaactatga ttagtggtcg 22500tgatgtggaa agggtgcaaa
gcgggtaggg ttttgattgg gcaattgggg tggatagcgt 22560tggaacttca
agagtttggt gcatttattg gaaagctggg gaagtggact ttactctagt
22620ctctctatca agtcatcata tttgtgggaa tgtgaagctt gttgatggga
aggtatgatg 22680cttagtgagt atttatggtt gggcggatac aattcaaaag
tataaaacat gggagcttat 22740gcaatccttt cactcatatc atgggccgat
attgtttggt tgggacttca atgagatttt 22800gacaatcgga gaaattgaag
gagggtccga aactcaatga agtaacatgc ataattttct 22860agaaacttta
gatgacatga agcttaggga ccttggctat tcgggaactt ggtatacata
22920agagagaggc tttaagccac ggaagagaat gagggagaaa cttgatcatt
ttgttgcatc 22980ttcatcatgg tgtgacttct ttccgaaagc tacagttgag
cacttgatgc gctacaaatc 23040ggaccacact cctattttgg ttcgccttgt
aggccatcag tgaagacata agaagaaaaa 23100gacgtagttt tgttttgaga
ctgcttgggt gcttgaggaa ggttgtgagg cccaatgggt 23160gagtcatggg
ccgggtttac tcgcgaggta tttatcgagc gctttaaagc cgtggaaggt
23220gggttcaaag caaggagtga tgggtctctt agtaatctgg gcccgcgtgt
gagggagatt 23280gaggaggcca ttatagatgg gaggcagcga agcagataag
gactatgagg ctctatgaga 23340ctcctctccc acgaaagtta gacgaggtgt
tggacaagca ggagacgttt tggtttttga 23400ggtctcgtgt gagttagata
aaggatggtg atcgtaatac acaatatttc caccacaaag 23460cttcccaaca
caaacgtcgc aactacatag cggggatgta tgataataaa ggggtgtggc
23520aagataacga agaggatatt gaagggaata tttcagagta ttaccaaacc
tcgttcggtt 23580cgtgctcccc ctctaggaag aacgtcgcgg ttgtccttga
ggttgtgagc ccggtgataa 23640ctgatgatat gaatatggcg gttatgaaat
cttacactaa agatgaggtg tgggaagcac 23700taaaccacat gaagcctaac
ggaatgcatg ccatccttta tagaggttct ggaatacctt 23760ggagatgata
ttacatctgt cattttaggt attattcatg gcacccgacc ccagatgttt
23820ttaacaagac taatattgtg ctcattccta aagtcaaatc cccaaatctt
gtttctgagt 23880ttcgcccgat tagcctctgt gatgttatct ataaacttgc
ctcaaaagta cttgctaaca 23940gattaaaaca ggtttgcctg acattgttta
tgataaccag agtgcatttg tgtccggaag 24000ctatattacg aacaatgctt
tgatttctct tgaattattt gactctatga aaaaatgata 24060cagagctagg
aaaggttttg tgtcgatgaa attggatatg agtaaagcct ataaaagagt
24120tgagtggtgt tttttcagta gtgtgttgga gaagttggat tttgctgaat
catgagtgaa 24180tgttgttatg agatgtgtgt cttttgtgca gtactctttt
gtggttaatg ataatatatg 24240tggagctctg acaccctcaa gggggctttg
acagggagac cctatatccc cgtatttgtt 24300tatacttgtt gcagataccg
ttttagctct tcttagcaag gcattcaaca atgcgtggct 24360atacttgata
ttctcaacaa atatgaggca gcatcaggct agaaaataaa tattgacaag
24420tcaggaatct ctttcaataa aagatttgac gtattttatg gccatgaaac
aagttgagaa 24480gcatcagaaa gacttggtat cccaactttg gctaggagtt
cgaaaaaagt catatttgct 24540gacattcaag agcgaatttg gaagaagctg
cacggatgga gagaaaaact tctcgcgggc 24600ttgaaaagaa actctcttaa
aagttgtggt tcaagcaatt ctaacctatt tggtgggcgt 24660ttacagattc
ctaaccagta ttatccaggc cattcatttg gccatggtaa agttttggtg
24720ggggtcgaaa agggcccaca attcgatgca tctgggggga tatgtgctca
ccaaaatgtt 24780taaggagcct tagctttaaa gacttagggg tgttcaatga
acctaaacta aggaggaatg 24840cgtggcattt gattcctgct ggtgagtccc
tttcgggtcg agtgttctcg gccaagtact 24900attcgaagtc aacctttttg
gactcatttc taggtccggt aggtagcttc tcttggaaga 24960gtatttgggg
ggccaaggca ttagttaagg gtgttttatg gtgcgtaggc aatggcagac
25020aaatcaacat atggcgtgac tcgcgggtgt tgaatggtga tagtaggttc
atccccggag 25080agcgcgtttc aggccttgag gatgtttgtg atctaataga
ttttgcacaa tggagtgcga 25140tgtggacctt gtcacgattg cttcaatgaa
gatgatgctc aagccatttt agtcatacct 25200ctaagtaagc gccttctgaa
ggacatggtc tcttgggctt tcactaagga tgaatttttt 25260ttgtaaaaac
aacctatatg gccggttggt cgaggaattt gaatttgttt cacaaagcat
25320ggctgcaaat ctggggcctt aacgtgtctc cgaaggtctg ccacttcctt
tggcgtttat 25380gctcggtacc cttcctgttc gagctctttt aaaacgacgc
cacataactg atgatgattc 25440atgtcctttg tctaaaggag cccggaaagc
atatcacacg cgttgttcta ttgcccatat 25500gtagccgaag catgggagag
tgcgggcctc acaaattgtt tgcctttgtt tgatggggct 25560ggtatgcttg
atgcgtgggg ggagtgggaa acaatcgatg actagtccct tgtaagactt
25620agcttcttgg cttatcactt gtggtttagg cgaaataaat gtgtttttga
aggggtggtg 25680agagcgaatg agagtgttgt ggaatatgcc actaaagcta
ctgttgatca tggtttgtat 25740agtgcccgca tttatggtgg gtcgaaggct
accgcatcca aaagctcgaa ggtatgggtt 25800ccccctccag cttgtcgtac
gatctaaagg ttgatgcatc agtggggaat ggtggatggg 25860tggggctagg
agtaatcgcc tgaaactaga aaggggaggt gctcgtggct gcaactagga
25920gggtcagagc ttgtggcccg tggaaatggc tgaagggaag gctctttgtc
ttgctcttag 25980gcttgcctcg ctcatacaac ttgcaagaag tgatcgtgga
gtttgactgt caatcttggt 26040gaaccatctc tccaagggtg ctatttactt
tgcattttta agtcaaagct tgaaccttga 26100taaaaaaatc ccgttcgaca
tgaaaagtgc cttgattttg cgggtttggg agtgccttat 26160tgattctggg
gtttgatttg taacaccttt agtaaaaaca tgtaagctaa ctgtaaaacg
26220aacattaatc aaactaggat atgtaaaatt cctaaatcaa gaagaatttc
cacttgtgct 26280gaatttgtcc accttgcatg acacccaata aaagcccatg
tctcctagaa ccccttatgc 26340cgccttattc atcttttctc aagttgagtt
ggagtcctct atggtccact cgacttcttt 26400agcacactct cggtaaaaac
ttttaatatt attttatttt agactccacc atcttgacat 26460ttattccttc
ttaaacttgc ttcacacaaa catctaacac tagaattcta tatagaatag
26520cttgaatctc tcttaggata accttatagt aaatgcaact acgcctatcc
ttaaaccttt 26580ctaagaggag ctttatcgta tttacattcg cttcactttg
aaacgtcgct aagtgtatgt 26640tgcactttcc aaaccatgtg ttagctaaga
ccaagttata tgactgcata acctaaatag 26700tcttcctcga gaaaattcac
tagttggatc ggaagagttt gtgtaaatct atggcggcgc 26760gggactgggc
cttcacgatt tcaagtgttt taatgcagct cttctaggta aacaagtttc
26820ctaatagcgt ggtgactcaa atattgagga cttgttgtta tactaatgct
attcctggcg 26880gcacttaagg ggtgaaatag gtggtacaaa ggagtgcttg
atggcgtgtg ggtagtagtt 26940gaatatatca gtatgatcaa gtccatggat
ccctcgtact tattcgtgca agattatttt 27000tccacgaggc aaagcgagcg
agaatcttaa ggtttatgat catattcatc ttgtacgtgc 27060taagggtaat
gtccctttca ataatgagct atttctcctt ttgagcaaga gcgtatctta
27120agcattcctc gtagttctcg tctccccaac gatgttttat gttggaatct
gaatttggag 27180aaagacggag acttttcgtt cggtctatcg agccattctt
ttgagttgga tggcgagagc 27240gtgatttcat cgtcaatacg ctctaattta
tggagtataa tatggcagga tagtaccttt 27300caacgtgtta agcttttatc
tgcattgtcg acacaaaggg gattgagtaa gcctgtgccg 27360agtatggaac
cattgtgtaa tctttatgcg ttggaggatc aatggagcta cacttcttac
27420gagactttgt tattggaagg gcttatatgg gatccaacta gggtagtcaa
aacattggtc 27480ggggctgcgc tgcaattttg gggacttggc accggcgttt
ttggagtagc tccctcatgt 27540ggaacatagg cttttgatga cgatatacta
ggctagatgg aatataagag agaggtgttt 27600gtttgaggag gaggtttgtg
atccctatca aaccacatgc ataatctcat ggcgtagctt 27660catgtggaac
actgacatat tgcatgtgta gggttaggtg cttgcctagg acaccacatg
27720ggaagagctc cgttaagctt aatgtggatg gagggtgtgt ggaagggttg
ggtgcgtcca 27780ctggagtggt gattaggggg atggatgaaa aagcgttgta
gttgcaacat agaaaggtgg 27840aggactgcga ggaaccgtta aaaaggctat
attttatggt gttcatttgg ttgtggaagt 27900cgatttttga aatatggttg
ttggaagtga ctatcttcac ctcgttgaag caacttcttc 27960aaaagtggaa
ggcaaaaata gcttccatgt tattgttgat gacattgttc atggtagtgg
28020tatgttaaat acttcgtctt gtagttttgt tcgtagggat gggaataggg
tttctcacga 28080actcccccat ggaaatatca taatggctca tgctcatagg
taaatgaata agctaacaaa 28140attgttcatc tttgcagaac caactcatgc
atgaatcgat ttctcagctt cagcgaaagg 28200tagacagctc tagagagagc
atctagtctc aaaaatacca tgagattctg tagtgtcctg 28260acatgtttta
tatgacagga caaagcgtta aaggagcaca acaacttgct atccaagaag
28320gtataagttc agcaagattg tttagtaaca ttgttaatct tgctgattgc
tttgaaacat 28380gtcttgctat ggttaacaat gttgactgaa ccaaaatagg
tgaaggagag ggagaaggtg 28440ctggctcagc aggcagaatt ggatcagcaa
aatcatgaca ataactcatc tggctttgtg 28500atgtctcaag ctttgccctc
actgaataca gggtcagtcc tcaataacct ctaatcattt 28560ttccaagatc
caaagtaaac atggtttcat aatttaatta agattttttt gaaccatgtc
28620tccatacaac cttactagga ctaatactac taatttaaga ccccaacgat
aaacaacaat
28680aattagccat atctggctag caccttttgg acaacacacc acatgagact
cttggccaac 28740ttctttgatt tccttcagtc tgatagatat gaatatcttc
tgaagagctc tttggttcat 28800aattattgat ttagaaaaga attcagcaag
gtgagtcatt tggtaacctt aaggtcatta 28860tgggggtact aaatcaaagt
gaagatatat ttaggtggca tcagaagaga tgatatagat 28920aggttgtatc
ctgtcgatag gttatttgga tatgtatcaa aagtttcttt tataatatat
28980ctatactgat tggttgatgt atcaaatatc cctacagatt gtgaaaaaat
cccctacaga 29040ttgtgaaaat atccctagaa cctgtgatga tataagatgt
gctccgcatg ctttattgaa 29100cataatgtat tcaattcttg aaatgcagag
gaacaagcag cagtgcagtg gaagatgaag 29160caacacaacc accaaatcta
aacagcaact ctgcacaaat accgtcctgg atgcttcaac 29220acatccaaga gcagtaa
2923741256PRTArabidopsis thaliana 41Met Gly Arg Gly Arg Val Gln Leu
Lys Arg Ile Glu Asn Lys Ile Asn1 5 10 15Arg Gln Val Thr Phe Ser Lys
Arg Arg Ala Gly Leu Leu Lys Lys Ala 20 25 30His Glu Ile Ser Val Leu
Cys Asp Ala Glu Val Ala Leu Val Val Phe 35 40 45Ser His Lys Gly Lys
Leu Phe Glu Tyr Ser Thr Asp Ser Cys Met Glu 50 55 60Lys Ile Leu Glu
Arg Tyr Glu Arg Tyr Ser Tyr Ala Glu Arg Gln Leu65 70 75 80Ile Ala
Pro Glu Ser Asp Val Asn Thr Asn Trp Ser Met Glu Tyr Asn 85 90 95Arg
Leu Lys Ala Lys Ile Glu Leu Leu Glu Arg Asn Gln Arg His Tyr 100 105
110Leu Gly Glu Asp Leu Gln Ala Met Ser Pro Lys Glu Leu Gln Asn Leu
115 120 125Glu Gln Gln Leu Asp Thr Ala Leu Lys His Ile Arg Thr Arg
Lys Asn 130 135 140Gln Leu Met Tyr Glu Ser Ile Asn Glu Leu Gln Lys
Lys Glu Lys Ala145 150 155 160Ile Gln Glu Gln Asn Ser Met Leu Ser
Lys Gln Ile Lys Glu Arg Glu 165 170 175Lys Ile Leu Arg Ala Gln Gln
Glu Gln Trp Asp Gln Gln Asn Gln Gly 180 185 190His Asn Met Pro Pro
Pro Leu Pro Pro Gln Gln His Gln Ile Gln His 195 200 205Pro Tyr Met
Leu Ser His Gln Pro Ser Pro Phe Leu Asn Met Gly Gly 210 215 220Leu
Tyr Gln Glu Asp Asp Pro Met Ala Met Arg Arg Asn Asp Leu Glu225 230
235 240Leu Thr Leu Glu Pro Val Tyr Asn Cys Asn Leu Gly Cys Phe Ala
Ala 245 250 25542242PRTArabidopsis thaliana 42Met Gly Arg Gly Arg
Val Gln Leu Lys Arg Ile Glu Asn Lys Ile Asn1 5 10 15Arg Gln Val Thr
Phe Ser Lys Arg Arg Ser Gly Leu Leu Lys Lys Ala 20 25 30His Glu Ile
Ser Val Leu Cys Asp Ala Glu Val Ala Leu Ile Val Phe 35 40 45Ser Ser
Lys Gly Lys Leu Phe Glu Tyr Ser Thr Asp Ser Cys Met Glu 50 55 60Arg
Ile Leu Glu Arg Tyr Asp Arg Tyr Leu Tyr Ser Asp Lys Gln Leu65 70 75
80Val Gly Arg Asp Val Ser Gln Ser Glu Asn Trp Val Leu Glu His Ala
85 90 95Lys Leu Lys Ala Arg Val Glu Val Leu Glu Lys Asn Lys Arg Asn
Phe 100 105 110Met Gly Glu Asp Leu Asp Ser Leu Ser Leu Lys Glu Leu
Gln Ser Leu 115 120 125Glu His Gln Leu Asp Ala Ala Ile Lys Ser Ile
Arg Ser Arg Lys Asn 130 135 140Gln Ala Met Phe Glu Ser Ile Ser Ala
Leu Gln Lys Lys Asp Lys Ala145 150 155 160Leu Gln Asp His Asn Asn
Ser Leu Leu Lys Lys Ile Lys Glu Arg Glu 165 170 175Lys Lys Thr Gly
Gln Gln Glu Gly Gln Leu Val Gln Cys Ser Asn Ser 180 185 190Ser Ser
Val Leu Leu Pro Gln Tyr Cys Val Thr Ser Ser Arg Asp Gly 195 200
205Phe Val Glu Arg Val Gly Gly Glu Asn Gly Gly Ala Ser Ser Leu Thr
210 215 220Glu Pro Asn Ser Leu Leu Pro Ala Trp Met Leu Arg Pro Thr
Thr Thr225 230 235 240Asn Glu
* * * * *
References