U.S. patent application number 11/791856 was filed with the patent office on 2009-08-20 for generation of replication competent viruses for therapeutic use.
This patent application is currently assigned to BAYER SCHERING PHARMA AKTIENGESELLSCHAFT. Invention is credited to Terry Hermiston, Fang Jin, Peter Kretschmer.
Application Number | 20090208924 11/791856 |
Document ID | / |
Family ID | 36261056 |
Filed Date | 2009-08-20 |
United States Patent
Application |
20090208924 |
Kind Code |
A1 |
Hermiston; Terry ; et
al. |
August 20, 2009 |
Generation of Replication Competent Viruses for Therapeutic Use
Abstract
The present invention relates to the generation or
replication-competent viruses having therapeutic utility. The
replication-competent viruses of the invention can express proteins
useful in the treatment of disease.
Inventors: |
Hermiston; Terry; (Corte
Madera, CA) ; Jin; Fang; (Danville, CA) ;
Kretschmer; Peter; (San Francisco, CA) |
Correspondence
Address: |
BANNER & WITCOFF, LTD.
1100 13th STREET, N.W., SUITE 1200
WASHINGTON
DC
20005-4051
US
|
Assignee: |
BAYER SCHERING PHARMA
AKTIENGESELLSCHAFT
Berlin
DE
|
Family ID: |
36261056 |
Appl. No.: |
11/791856 |
Filed: |
November 28, 2005 |
PCT Filed: |
November 28, 2005 |
PCT NO: |
PCT/US05/42924 |
371 Date: |
July 7, 2008 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
60632671 |
Dec 1, 2004 |
|
|
|
Current U.S.
Class: |
435/5 ;
435/235.1 |
Current CPC
Class: |
C12N 2710/10361
20130101; A61K 35/13 20130101; C07K 14/005 20130101; C12N 2800/90
20130101; C12N 2830/42 20130101; C12Q 1/701 20130101; C12N
2770/20022 20130101; C12N 2710/10321 20130101; C12N 2710/10343
20130101; C12N 7/00 20130101; C12Q 1/6897 20130101; A61K 48/0091
20130101; C12N 15/86 20130101; A61K 38/45 20130101 |
Class at
Publication: |
435/5 ;
435/235.1 |
International
Class: |
C12Q 1/70 20060101
C12Q001/70; C12N 7/00 20060101 C12N007/00 |
Claims
1. A method of identifying functional insertion sites within the
genome of a replication competent target virus, said method
comprising the steps of: (a) mixing the genomic DNA of said target
virus with a donor DNA comprising a transposon under conditions
that allow transposition, wherein said transposon comprises at
least one genetic element of interest and inserts into said viral
genome in a non-biased manner; and (b) isolating replication
competent viruses from said step (a) which express said genetic
element of interest.
2. The method of claim 1, wherein said isolation step comprises
assaying for expression of said genetic element of interest in
animal cells transfected with said replication competent
viruses.
3. The method of claim 1, wherein said target virus is an animal
virus.
4. The method of claim 3, wherein said animal virus is
oncolytic.
5. The method of claim 4, wherein said animal virus is adenovirus,
VSV, NDV, HSV, or vaccinia virus.
6. The method of claim 5, wherein said adenovirus is Ad5.
7. The method of claim 1, wherein said genetic element is a gene
which encodes a reporter molecule, a therapeutic protein or an RNA
molecule.
8. The method of claim 7, wherein said reporter molecule is GFP,
LacZ, renilla luciferase or firefly luciferase.
9. The method of claim 8, wherein said therapeutic protein is an
immunomodulatory protein, an antibody, a symporter or a pro-drug
converting enzyme.
10. The method of claim 1, wherein said transposon further
comprises an expression element, wherein said element is located
upstream from, and is operably linked to, said genetic element of
interest.
11. The method of claim 10, wherein said expression element is a
eukaryotic promoter.
12. The method of claim 11, wherein said eukaryotic promoter is a
CMV, SV40, thymidine kinase, ubiquitin or actin promoter.
13. The method of claim 12, wherein said expression element is a
eukaryotic splice acceptor sequence.
14. The method of claim 13, wherein said splice acceptor sequence
is SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3.
15. The method of claim 1, wherein said transposon is a transposon
which employs an ATP-utilizing regulatory proteins.
16. The method of claim 15, wherein said transposon is a Tn-7 based
transposon.
17. The method of claim 1, wherein said transposon further
comprises a selectable gene.
18. The method of claim 17, wherein said selectable gene is an
antibiotic resistance gene, a methotrexate resistance gene, or a
G418 resistance gene.
19. The method of claim 1, wherein said donor DNA is a plasmid.
20. The method of claim 19, wherein said donor DNA further
comprises an origin of replication.
21. The method of claim 1, wherein said target virus genomic DNA is
present in a plasmid.
22. The method of claim 21, wherein said target virus genomic DNA
is flanked by restriction enzyme sites.
23. The method of claim 22, wherein said restriction enzyme sites
are identical.
24. A method of identifying functional insertion sites within the
genome of a replication competent target virus, said method
comprising the steps of: (a) mixing a plasmid comprising the
genomic DNA of said target virus with a plasmid donor DNA
comprising a transposon and an origin of replication under
conditions that allow transposition, wherein said transposon
comprises at least one genetic element of interest operably linked
to an expression element and a selectable gene and inserts into
said target viral genome in a non-biased manner; and (b) isolating
replication competent viruses from said step (a) which express said
genetic element of interest.
25. The method of claim 24, wherein said isolation step (b)
comprises assaying for expression of said genetic element of
interest in animal cells transfected with said replication
competent viruses
26. The method of claim 24, wherein said target virus genomic DNA
is an animal virus.
27. The method of claim 26, wherein said animal virus is
oncolytic.
28. The method of claim 27, wherein said animal virus is an
adenovirus, VSV, NDV, HSV, or vaccinia virus.
29. The method of claim 28, wherein said adenovirus is Ad5.
30. The method of claim 24, wherein said genetic element is a gene
which encodes a reporter molecule, a therapeutic protein or an RNA
molecule.
31. The method of claim 30, wherein said reporter gene encodes GFP,
LacZ, renilla luciferase or firefly luciferase.
32. The method of claim 31, wherein said therapeutic protein is an
immunomodulatory protein, an antibody, a symporter or a pro-drug
converting enzyme.
33. The method of claim 24, wherein said expression element is a
eukaryotic promoter or a eukaryotic splice acceptor sequence.
34. The method of claim 33, wherein said eukaryotic promoter is a
CMV, SV40, thymidine kinase, ubiquitin or actin promoter.
35. The method of claim 34, wherein said eukaryotic promoter is the
early SV40 promoter.
36. The method of claim 33, wherein said splice acceptor sequence
is SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3.
37. The method of claim 24, wherein said transposon is transposon
which employs an ATP-utilizing regulatory protein.
38. The method of claim 37, wherein said transposon is a Tn-7-based
transposon.
39. The method of claim 24, wherein said selectable gene is an
antibiotic resistance gene or a drug resistance gene
40. The method of claim 24, wherein said target virus genomic DNA
is flanked by restriction enzyme sites.
41. The method of claim 40, wherein said restriction enzyme sites
are identical.
42. A method of identifying functional insertion sites within the
genome of a replication competent Ad5 virus, said method comprising
the steps of: (a) mixing a plasmid comprising the genomic DNA of
said Ad5 virus with a plasmid donor DNA comprising a Tn-7 based
transposon and an R6K origin of replication under conditions that
allow transposition, wherein said transposon comprises a gene
encoding GFP operably linked to an expression element and a
chloramphenicol resistance gene and wherein said transposon inserts
into said target Ad5 genome in a non-biased manner; and (b)
isolating replication competent Ad5 viruses which express GFP from
said step (a).
43. The method of claim 42, wherein said isolation step (b)
comprises the steps of: (a') transforming bacterial cells with the
mixture of said step (a); (b') isolating replication competent Ad5
viruses from said transformed cells; (c') transfecting eukaryotic
cells with said Ad5 viruses; and (d') identifying plaques derived
from plating of said cells from said step (c').
44. The method of claim 42, wherein said expression element is a
eukaryotic promoter or a eukaryotic splice acceptor sequence.
45. The method of claim 44, wherein said eukaryotic promoter is a
CMV, SV40, thymidine kinase, ubiquitin or actin promoter.
46. The method of claim 45, wherein said eukaryotic promoter is an
early SV40 promoter.
47. The method of claim 44, wherein said splice acceptor sequence
is SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3.
48. A replication competent virus containing a functional insertion
site identified by the method of claim 1, claim 24, or claim
42.
49. A method of generating a replication competent virus comprising
at least one genetic element of interest inserted into a functional
insertion site within the genome of said virus, said method
comprising the steps of: (a) mixing the genomic DNA of a target
replication competent virus with a donor DNA comprising a
transposon, under conditions that allow transposition, wherein said
transposon comprises a genetic element of interest and inserts into
said viral genome in a non-biased manner; and (b) identifying
product replication competent viruses formed in said step (a) which
express said genetic element.
50. The method of claim 49, wherein said target virus is an animal
virus.
51. The method of claim 50, wherein said animal virus is
oncolytic.
52. The method of claim 51, wherein said animal virus is
adenovirus, VSV, NDV, HSV, or vaccinia virus.
53. The method of claim 52, wherein said adenovirus is Ad5.
54. The method of claim 49, wherein said genetic element is a gene
which encodes a reporter molecule, a therapeutic protein or an RNA
molecule.
55. The method of claim 54, wherein said reporter molecule is GFP,
LacZ, renilla luciferase or firefly luciferase.
56. The method of claim 54, wherein said therapeutic protein is an
immunomodulatory protein, an antibody, a symporter or a pro-drug
converting enzyme.
57. The method of claim 49, wherein said transposon further
comprises an expression element, wherein said element is located
upstream from, and is operably linked to, said genetic element of
interest.
58. The method of claim 57, wherein said expression element is a
eukaryotic promoter.
59. The method of claim 58, wherein said eukaryotic promoter is a
CMV, SV40, thymidine kinase, ubiquitin or actin promoter.
60. The method of claim 57, wherein said expression element is a
eukaryotic splice acceptor sequence.
61. The method of claim 60, wherein said splice acceptor sequence
is SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3.
62. The method of claim 49, wherein said transposon is a transposon
which employs an ATP-utilizing regulatory proteins.
63. The method of claim 62, wherein said transposon is a Tn-7 based
transposon.
64. The method of claim 49, wherein said transposon further
comprises a selectable gene.
65. The method of claim 64, wherein said selectable gene is an
antibiotic resistance gene, a methotrexate resistance gene, or a
G418 resistance gene.
66. The method of claim 49, wherein said donor DNA is a
plasmid.
67. The method of claim 66, wherein said donor DNA further
comprises an origin of replication.
68. The method of claim 49, wherein said target virus genomic DNA
is present in a plasmid.
69. The method of claim 68, wherein said target virus genomic DNA
is flanked by restriction enzyme sites.
70. The method of claim 69, wherein said restriction enzyme sites
are identical.
Description
[0001] This application claims the benefit of U.S. Provisional
Application Ser. No. 60/632,671, filed on Dec. 1, 2004, which is
incorporated herein by reference.
FIELD OF THE INVENTION
[0002] The present invention relates to the generation of
replication competent viruses having therapeutic applications.
BACKGROUND OF THE INVENTION
[0003] The goal of gene therapy is to deliver genetic material of
therapeutic value to a target tissue in a safe and efficient
manner. Safety is often related to how much damage is done to the
normal tissues of the patient during treatment. Efficiency can be
looked at as a ratio of desired result, e.g. reduction of tumor
load, to acceptable dosage level, where the parameters which
contribute to making a dosage level "acceptable" can include issues
of injection volume, frequency, etc. Therefore, any improvements
that result in an increase in the selectivity and efficiency of
gene therapy are clearly desirable.
[0004] Viral vectors derived from adenoviruses, have been the most
studied delivery agents for this type of therapy (Jolly, D. (1994)
Cancer Gene Therapy 1:51-64). Replication-defective vectors are
limited in their usefulness due to their ability to only kill tumor
cells that have been directly infected with virus. Viral vectors
developed from oncolytic (i.e. replication-competent) viruses are
more attractive choices for cancer therapeutics because they not
only selectively target tumor cells, but they can, through
replication within the infected tumor cells, amplify and spread the
input dose of infective virus throughout the tumor cell mass.
[0005] The potential use of vectors derived from oncolytic viruses,
such as adenovirus, in gene therapy can be further increased by
"arming" the viruses with therapeutic transgenes, i.e. engineering
them to contain therapeutic proteins or other molecules whose in
vivo expression can impact tumor survival (Hermiston, T. (2000) J.
Clin. Inv. 105:1169-1172). The combination of viral replication
within tumor cells and the activity of the therapeutic molecule
expressed within the cells can provide a synergistic assault on a
tumor.
[0006] The incorporation of therapeutic transgenes into an
oncolytic virus is a complex process. The insertion event needs to
occur in a site that maintains the replication competence of the
viral agent, which is complicated as viruses maximize their coding
capacity by generating highly complex transcription units
controlled by multiple promoters and alternative splicing
(Akusjarui and Stevenin (2003 Curr. Top. Microbiol. Immunol.
272:253-286). Consequently, the choice of insertion sites for
therapeutic genes has been limited primarily to regions known to be
non-essential for viral DNA replication in vitro (Hawkins et al.
(2001) Gene Ther. 8:1123-1131; Kurihara et al. (2002) J. Clin.
Invest 106:763-771) or by the replacement of a deleted region of
the viral genome to create the oncolytic virus (Freytag et al.
(1998) Human Gene Ther. 9:1323-1333; Lee et al. (2001) Cancer Gene
Ther. 8:397-404). While these approaches allow for therapeutic gene
insertion and expression, they are dependent upon a high level of
understanding of the viral biology (i.e. sites non-essential for
viral replication), a known viral genome sequence (for use in
genetic engineering or utilization of endogenous restriction enzyme
sites) and the presence of molecular biology systems for genomic
manipulations that may not be currently available for non-Ad5-based
systems.
[0007] In view of the above, there is a need for a method for
generating replication competent viruses which contain genetic
elements, e.g., a gene which encodes a therapeutic protein or RNA,
positioned within the viral genome such that expression of the
genetic element occurs. Of particular utility would be a method
that identifies functional insertion sites within replication
competent viruses whose genomic structure has not yet been
elucidated.
SUMMARY OF THE INVENTION
[0008] The present invention provides a novel method for
identifying functional insertion sites within the genome of a
replication competent virus and for generating replication
competent viruses which comprise a genetic element of interest or
an expression cassette inserted in one of the identified insertion
sites.
[0009] In particular, the method of the present invention comprises
the steps of [0010] (a) mixing the genomic DNA of a replication
competent target virus with a donor DNA comprising a transposon
under conditions that allow transposition, wherein said transposon
comprises at least one genetic element of interest and inserts into
said target viral genome in a non-biased manner; and [0011] (b)
isolating replication competent product viruses from said step (a)
which express said genetic element of interest.
[0012] In one embodiment of the present invention, the replication
competent target virus is an animal virus, preferably an oncolytic
animal virus. In a preferred embodiment, the animal virus is an
adenovirus, VSV, NDV, HSV or vaccinia virus. Particularly preferred
are adenoviruses belonging to groups B and C. A particularly
preferred Group C adenovirus is Ad5.
[0013] In one embodiment of the present invention, the genomic DNA
of the replication competent target virus is present within a
plasmid. In a preferred embodiment, the genomic DNA of the
replication competent target virus within the plasmid is flanked by
restriction enzyme sites. Particularly preferred is an embodiment
in which the restriction enzyme sites flanking the genomic DNA of
the replication competent target virus within the plasmid are
identical.
[0014] In one embodiment of the invention, the donor DNA comprising
the transposon is a plasmid and further comprises an origin of
replication. A preferred origin of replication is the R6K origin of
replication.
[0015] In one embodiment of the present invention, the transposon
comprises a genetic element of interest that is a gene that encodes
a reporter protein, where the reporter protein includes, but is not
limited to, green fluorescing protein (GFP), LacZ, renilla
luciferase or firefly luciferase. Particularly preferred is a gene
encoding GFP.
[0016] In another embodiment of the present invention, the genetic
element of interest is a gene that encodes a therapeutic protein,
where the therapeutic protein can be, but is not limited to, an
immunomodulatory protein, an antibody, a symporter, a pro-drug
converting enzyme, a fusogenic glycoprotein, or fragments thereof.
Preferred immunomodulatory proteins include molecules identified as
cytokines or chemokines.
[0017] In another embodiment of the present invention, the genetic
element of interest encodes a therapeutically useful RNA molecule
including, but not limited to, shRNA or antisense RNA.
[0018] In one embodiment of the present invention, the transposon
further comprises an expression element located upstream from, and
operably linked to, the genetic element of interest, such that the
expression element and the genetic element of interest comprise an
"expression cassette". In a preferred embodiment, the expression
element is a eukaryotic promoter. Preferred promoters include, but
are not limited to, traditional Pol II promoters (e.g. E2F or
hTERT), Pol III promoters (e.g. U6 or H1) or viral promoters (e.g.
CMV, SV40, HSV TK or Ad MLP). A particularly preferred promoter is
the early SV40 promoter.
[0019] In another embodiment of the present invention, the
expression element is a eukaryotic splice acceptor sequence.
Particularly preferred splice acceptor sequences include SEQ ID NO:
1, SEQ ID NO: 2 or SEQ ID NO: 3. Particularly preferred is the
splice acceptor sequence of SEQ ID NO: 1.
[0020] In another embodiment of the present invention, the
transposon further comprises restriction enzyme sites unique to the
transposon, i.e. not present in the genomic DNA of the replication
competent target virus, which are located close to the ends of the
transposon. In a preferred embodiment, the restriction enzyme sites
are identical. A particularly preferred restriction enzyme site is
a Pmel site.
[0021] In one embodiment of the present invention, the transposon
is one that employs an ATP-dependent utilizing regulatory protein
in the transposition process. Particularly preferred is a Tn-7
based transposon.
[0022] In another embodiment of the present invention, the
transposon further comprises a selectable/identifiable gene, such
as a gene that confers drug resistance or one that provides a
mechanism for visual identification of cells. Preferred genes that
confer drug resistance include genes that confer antibiotic
resistance. Particularly preferred is a gene that confers
chloramphenicol resistance. Among the genes that allow for visual
identification of cells are those encoding GFP, renilla luciferase,
or firefly luciferase.
[0023] The present invention also provides for replication
competent viruses made by the methods described herein and for the
use of these viruses as vectors for the delivery of therapeutic
molecules during gene therapy.
BRIEF DESCRIPTION OF THE FIGURES
[0024] The foregoing and other objects of the present invention,
the various features thereof, as well as the invention itself may
be more fully understood from the following description, when read
together with the accompanying drawings.
[0025] FIG. 1. Schematic of transposon-based method to identify
novel insertion sites within the genome of a replication competent
virus. A donor plasmid containing both an R6K origin of replication
and a transposon is mixed with a plasmid (pCJ51) containing the Ad5
genome (with the E3 region deleted) flanked by two Pacl restriction
enzyme sites. The transposon contains a reporter gene, GFP,
operably linked to an early SV40 promoter, i.e. an SV40/GFP
expression cassette, and also contains a chloramphenicol resistance
gene (CM.sup.R). Following in vitro transposition (see Example 3),
E. coli DH10B cells are transformed with the transposition mixture,
and the transformed cells plated on chloramphenicol-containing
plates. The donor plasmid, because of its R6K origin of
replication, cannot grow in DH10B cells. Plating on
chloramphenicol-containing media selects for those cells containing
Ad5 plasmids into which a transposon has been inserted.
Chloramphenicol resistant colonies are pooled and plasmid DNA
extracted. The extracted DNA is digested with Pacl to release
linearized Ad5 genomic DNAs containing transposons inserted into
various locations within the Ad5 genome. The isolated linearized
Ad5 genomic DNA is transfected into a cell line of choice and
individual green plaques, each containing a replication competent
product virus, are picked for further characterization (e.g. insert
site location, potency, etc.)
[0026] FIG. 2. Distribution of transposon insertions within the Ad5
genome Plasmid DNA containing inserted transposons, isolated from
60 chloramphenicol-resistant colonies, was digested with Bgl II and
the resulting DNA fragments were analysed on 1% agarose gels. The
Bgl II restriction fragments of the Ad5 containing plasmid, pCJ51,
range in size from 8.6 kb (which includes the entire plasmid
portion) to 0.27 kb. The transposon contains a Bgl II restriction
site within 32 bp of the right end of the transposon. Therefore,
insertion into any Bgl II fragment will result in loss of that
fragment and the appearance of two new fragments, the sum of which
will total the fragment plus 3.1 kb (the size of the transposon).
The fragment sizes will vary depending upon the site of insertion
within the fragment, and on the orientation of the inserted
transposon. The percentage of insertions within each fragment is
shown on the y-axis, while the continuous line indicates the
percentage predicted if insertion is non-biased (see Example
4).
[0027] FIG. 3. Insertion sites of SV40/GFP containing transposons
within the Ad 5 genome The outlined area illustrates the
transcription and translation map of Ad5. The early mRNAs are
designated E. Late mRNAs are designated L. Black ovals circle the
designated "E1" and "L5 to E4" regions in the translation map of
Ad5 and are enlarged in the map below, which indicates the site of
insertion (vertical arrow) of a transposon comprising an SV40/GFP
expression cassette within the Ad5 genome in each of a number of
product viruses (see Example 4). Horizontal arrows above each
product virus name indicate the transcriptional direction of
SV40/GFP expression in that virus. The open dotted line represents
the remaining Ad5 genome between E1B and L5.
[0028] FIG. 4. Potency of Ad5-viral isolates containing
transposons. MTT assays were performed in HT29 cells comparing the
potency of the original Ad5 virus (AdCJ51) and a number of
transposon-containing viruses, as described below. The
transposon-containing viruses, Ad5/PL11 and Ad5/PL29, contain the
SV40/GFP expression cassette inserted within the Ad5 genome, with
GFP expression oriented leftwards or rightwards in the Ad5 genome,
respectively (see arrows in FIG. 3, Example 4). In other viruses
tested, the original expression cassette (SV40/GFP) present in the
Ad5/PL11 and Ad5/PL29 viruses was replaced by a CMV/Luc, TK/RL or
CMV/LacZ expression cassette as described in Examples 5 to 7.
[0029] Panel A: the original virus, AdCJ51 (.largecircle.); and two
product viruses, Ad5/PL11/SV40/GFP (.quadrature.) and
Ad5/PL29/SV40/GFP(.DELTA.), in which the SV40/GFP expression
cassette had been determined to have been inserted in opposite
orientations within the viral genome (see FIG. 3);
[0030] Panel B: Ad5/PL11/SV40/GFP (.quadrature.); and
Ad5/PL11/TK/RL (.largecircle.); Ad5/PL11/CMV/Luc (.diamond-solid.)
and Ad5/PL11/CMV/LacZ (.diamond-solid.), where the expression
cassette replacements have the same orientation within the Ad 5
genome as the original SV40/GFP expression cassette; and
[0031] Panel C: Ad5/PL29/SV40/GFP (.DELTA. .fwdarw.);
Ad5/PL29/CMV/Luc (.diamond. .fwdarw., .rarw.)); Ad5/PL29/TK/RL
(.box-solid..fwdarw., .tangle-solidup. .rarw.)) and
Ad5/PL29/CMV/LacZ (.quadrature. .fwdarw.), where the expression
cassette replacements are oriented in both directions, as indicated
by the arrows.
[0032] FIG. 5. Insertion Sites of Splice Acceptor/GFP transposons
within the Ad5 genome The outlined area illustrates the
transcription and translation map of Ad5. The early mRNAs are
designated E. Late mRNAs are designated L. Black ovals circle the
designated "E1" and "L5 to E4" regions of the
transcription/translation map of Ad5 and are enlarged in the map
below, which indicates the site of insertion (vertical arrow) of
these transposons. Horizontal arrows above the product virus name
indicate the transcriptional direction of the GFP expression in
that viral construct.
[0033] FIG. 6: Cytotoxicity of ColoAd1/PL30/TK/TK adenovirus with
and without ganciclovir (GCV). MTS assays to measure potency were
performed using the ColoAd1/PL30/TK/TK virus, which was derived
from the ColoAd1 virus (SEQ ID NO: 22) as described in Example 9
and contains an expression cassette comprising a thymidine kinase
gene, operably linked to a thymidine kinase promoter. Potency of
this viral construct was examined in both a normal cell line
(HUVEC; FIG. 6A) and in a lung cancer cell line (A549; FIG. 6B).
Cytotoxicity in each cell line was determined with and without the
addition of 20 uM ganciclovir, 24 hrs post infection (see Example
9).
DETAILED DESCRIPTION OF THE INVENTION
[0034] All publications, including patents and patent applications,
mentioned in this specification are herein incorporated by
reference to the same extent as if each individual publication was
specifically and individually indicated to be incorporated by
reference in its entirety.
[0035] Technical and scientific terms used herein have the meanings
commonly understood by one of ordinary skill in the art to which
the present invention pertains, unless otherwise defined. Reference
is made herein to various methodologies known to those of ordinary
skill in the art. Publications and other materials setting forth
such known methodologies to which reference is made are
incorporated herein by reference in their entireties as though set
forth in full. Standard reference works setting forth the general
principles of recombinant DNA technology include Sambrook, J., et
al. (1989) Molecular Cloning, A Laboratory Manual, 2d Ed., Cold
Spring Harbor Laboratory Press, Planview, N.Y.; McPherson, M. J.,
Ed. (1991) Directed Mutagenesis: A Practical Approach, IRL Press,
Oxford; Jones, J. (1992) Amino Acid and Peptide Synthesis, Oxford
Science Publications, Oxford; Austen, B. M. and Westwood, O. M. R.
(1991) Protein Targeting and Secretion, IRL Press, Oxford. Any
suitable materials and/or methods known to those of ordinary skill
in the art can be utilized in carrying out the present invention;
however, preferred materials and/or methods are described.
Materials, reagents and the like to which reference is made in the
following description and examples are obtainable from commercial
sources, unless otherwise noted.
Definitions
[0036] Unless defined otherwise, all technical and scientific terms
used herein have the same meaning as commonly understood by one of
ordinary skill in the art to which this invention belongs.
Generally, the nomenclature used herein and the laboratory
procedures described below are those well known and commonly
employed in the art.
[0037] As used herein, the term "replication competent virus"
refers to a virus which contains all the information within it's
genome to allow it to replicate within a cell: i.e. it either
produces the protein or induces the necessary host proteins.
[0038] As used herein, the term "target virus" refers to a
replication competent virus capable of undergoing transposition, as
contemplated by the invention. While the target virus will usually
be a virus which has not previously undergone transposition, it is
possible to use as a target virus a replication competent virus
which has already undergone a round of transposition.
[0039] As used herein, the term "product virus" refers to the
replication competent virus produced when a "target virus" has
undergone transposition by the method of the invention.
[0040] As used herein, a "functional insertion site" within a
replication competent virus refers to a site within that virus
where, when a genetic element, operably linked to an expression
element (i.e. a eukaryotic promoter or eukaryotic splice acceptor
sequence) is inserted into that site, expression can occur.
[0041] As used herein, the term "genetic element of interest"
refers to a nucleic acid sequence that is introduced into a
transposon and that encodes a protein or RNA molecule whose
expression is useful in the present invention.
[0042] A genetic element of interest includes genes that encode
reporter proteins, i.e. proteins that are easily assayed or
identified, e.g. green fluorescent protein (GFP), renilla
luciferase, or firefly luciferase. A genetic element of interest,
as used herein, also encompasses any gene encoding a protein,
protein fragment or peptide (or modifications thereof) for which a
therapeutic purpose can be envisioned. Examples include, but are
not limited to, genes encoding immunomodulatory proteins,
antibodies, symporter, fusogenic glycoproteins or pro-drug
converting enzymes.
[0043] The term "genetic element of interest" also includes genes
encoding RNA molecules, such as antisense RNA or shRNA, whose in
vivo expression can result in modulation of cellular properties
(e.g. cell growth, chemotherapeutic sensitization) by altering the
expression levels of a targeted cellular protein (Karkare et al.
Appl. Biochem. Biotechnol. (2004) 119:1-12).
[0044] As used herein, "useful" means that a protein or RNA
expressed from a product virus would be useful in the evaluation or
identification of the replication competent product viruses of the
invention (e.g. the product of a reporter gene) or useful in
therapeutic applications (e.g. the product of a gene encoding a
therapeutic protein or an RNA).
[0045] As used herein, a "therapeutic protein" refers to a protein,
protein fragment, or peptide that would be expected to provide some
therapeutic benefit to an organism when expressed in vivo.
[0046] As used herein, the term "expression element" refers to a
nucleic acid sequence located upstream from, and operably linked
to, a genetic element of interest, and that facilitates expression
of the genetic element of interest within the product virus. The
expression element can be a promoter, e.g. a traditional POL II
promoter; e.g. E2F or hTERT (Wu et al. Trends in Mol. Med. (2003)
9:421-429, a POL III promoter (e.g. U6) or a viral promoter (e.g.
SV40, CMV, TK or MLP). Alternatively, an expression element can be
a eukaryotic splice acceptor sequence, e.g. SEQ ID NO: 1, SEQ ID
NO: 2, or SEQ ID NO: 3.
[0047] As used herein, the term "expression cassette" refers to a
nucleic acid sequence that contains both a genetic element of
interest and an expression element upstream from, and operably
linked to, the genetic element, so that when the expression
cassette is inserted in the proper orientation within the viral
genome, the protein or RNA encoded by the genetic element is
expressed. As used herein, an expression cassette comprising an
SV40 promoter and a GFP reporter gene is represented as
"SV40/GFP".
[0048] As used herein, the term "genomic DNA of the replication
competent target virus" refers to the viral genomic DNA into which
a transposon is inserted by the method of the invention.
[0049] As used herein, the term "product virus" refers to a genomic
DNA of the replication competent target virus of the invention
which has had a transposon inserted into it by the method of the
invention.
[0050] As used herein, the term "non-biased" refers to the
insertion of a transposon within a viral genome where the choice of
insertion site is not substantially influenced by or dependent on
particular sequences within the viral genome and is, therefore,
considered random.
[0051] As used herein, the term "optimally expresses" or "optimal
expression" refers to the characterization of a product virus
comprising a genetic element of interest, when compared to the
other isolated product viruses comprising that genetic element. The
most straightforward comparative measurement of expression of a
genetic element from a product virus would be measurement of the
level of expression of the gene product encoded by that genetic
element, i.e. that product virus demonstrating the highest level of
expression of the protein encoded by the genetic element of
interest would be ranked as that showing "optimal expression".
However, also contemplated within the scope of the invention is
measurement of "optimal expression" of a gene product by a product
virus under varying conditions such as level of tissue-specific
gene expression, expression in certain microenvironments (e.g.
hypoxic, glucose deprived) or other measurements which would be
indicative of relative in vitro or in vivo potency, etc.
[0052] As used herein, the term "adenovirus", refers to any of the
50+ human adenoviral serotypes currently known, or isolated in the
future. See, for example, Strauss, "Adenovirus infections in
humans," in The Adenoviruses, Ginsberg, ed., Plenum Press, New
York, N.Y., pp. 451-596 (1984). These serotypes are classified in
the subgroups A-F (see, Shenk, "Adenoviridae: The Viruses and Their
Replication," in Fields Virology, Vol. 2, Fourth Edition, Knipe,
ea., Lippincott Williams & Wilkins, pp. 2265-2267 (2001)
[0053] As used herein, the term "potency" refers to the lytic
potential of a virus and represents its ability to replicate, lyse,
and spread. For the purposes of the instant invention, potency is a
value which compares the cytolytic activity of a given product
virus of the invention to that of Ad5 in the same cell line, i.e.
potency=IC.sub.50 of X/IC.sub.50 of Ad5, where X is the particular
product virus being examined and wherein the potency of Ad5 is
given a value of 1.
[0054] As used herein, the term "oncolytic virus" refers to a virus
that preferentially kills cancer cells as compared with normal
cells.
[0055] As used herein, the term "therapeutic index" or "therapeutic
window" refers to a number indicating the oncolytic potential of a
given virus and is determined by dividing the potency of the virus
in a cancer cell line by the potency of the same virus in a normal
(i.e. non-cancerous) cell line.
[0056] As used herein, the term "modified" refers to a molecule
with a nucleotide or amino acid sequence differing from a
naturally-occurring, e.g. a wild-type nucleotide or amino acid
sequence. A modified molecule retains the function or activity of a
wild-type molecule; i.e. a modified product virus may retain its
oncolytic activity or therapeutic benefit. Modifications include
mutations to nucleic acids as described below.
[0057] As used herein, "mutation" with reference to a
polynucleotide or polypeptide, refers to a naturally-occurring,
synthetic, recombinant, or chemical change or difference to the
primary, secondary, or tertiary structure of a polynucleotide or
polypeptide, as compared to a reference polynucleotide or
polypeptide, respectively (e.g., as compared to a wild-type
polynucleotide or polypeptide). Mutations include such changes as,
for example, deletions, insertions, or substitutions.
Polynucleotides and polypeptides having such mutations can be
isolated or generated using methods well known in the art.
[0058] As used herein, the term "therapeutically effective dose" or
"effective amount" refers to that amount of product virus that
ameliorates the symptoms or conditions of a disease. A dose is
considered a therapeutically effective dose in the treatment of
cancer or its metastasis when tumor or metastatic growth is slowed
or stopped, or the tumor or metastasis is found to shrink in size,
so as to lead to an extension in life-span for the subject. For
other disease states, appropriate endpoints for defining a
therapeutically effective dose would be within the knowledge of one
skilled in the art.
Generation of Viruses of the Invention
[0059] The present invention provides a novel method for
identifying functional insertion sites within the genome of a
replication competent target virus and for generating replication
competent viruses ("product viruses") which comprise a genetic
element of interest or an expression cassette inserted in one of
the identified insertion sites. The product viruses of the
invention can express the inserted genetic element following
infection of cells in vitro and in vivo.
[0060] The method for identifying functional insertion sites within
the replication competent viruses of the invention comprises mixing
the genomic DNA of a replication competent target virus with a
donor DNA under conditions that allow formation of replication
competent product viruses comprising said transposon, where the
donor DNA comprises a transposon capable of inserting into the
genome of the target virus in a non-biased manner, and where the
transposon comprises a genetic element of interest or an expression
cassette.
[0061] The replication competent product viruses can be evaluated
to determine their utility, using assays that directly measure
protein expression or that measure an activity of the inserted
genetic element. A comparision of such values within a group of
isolated replication competent viruses, containing insertions at
different positions within the viral genome, allows selection of
the particular product virus best suited for a given therapeutic
situation.
[0062] Genomic DNA of the Replication Competent Target Virus.
[0063] The genomic DNA of the replication competent target virus of
the present invention can be that of any desired virus, preferably
an animal virus (i.e. a virus that can replicate in an animal or an
animal cell) and is most preferably an oncolytic animal virus
including, but not limited to, adenovirus, VSV, HSV or vaccinia
virus. An important advantage of the present invention is that it
allows one to use the genomic DNA of a virus for which specific
prior knowledge regarding viral genomic organization or sequence is
not available. A preferred genomic viral DNA for use in the present
invention is that of an adenovirus, and particularly preferred is
an adenovirus from Groups B or C. A particularly preferred
adenovirus within Group C is Ad5.
[0064] Wild-type viruses, as well as viral derivatives containing
deletions of non-essential polynucleotides within the viral genome,
e.g. the E3 region in adenovirus (Berkner and Sharp (1983) Nucleic
Acid Res. 11:6003-6020) are contemplated to be useful in the
present invention. Use of replication competent viruses which have
a reduced viral genome, is preferred, as the total amount of DNA
which can be packaged into a virus is limited by size constraints
(for example, see Bett et al. (1993) J. Virol. 10:5911-5921).
Therefore, use of a smaller starting viral genome (either naturally
occurring or artificially made) allows insertion of transposons
that contain larger genetic elements of interest or expression
cassettes and also allows for the possible insertion of more than
one transposon within the genomic DNA of the replication competent
target virus. Also useful are viral derivatives which have been
identified as having an increased potency toward particular cell
types, such as ColoAd1 (SEQ ID NO: 22), which has been shown to
have increased potency toward colon tumor cells.
[0065] Genomic DNA of the replication competent target viruses
useful in the present invention can be in a linear form or it can
be present in a plasmid and be linearized when desired, i.e. to
allow more efficient transfection to occur (Berkner and Sharp
(1983) Nucleic Acid Res. 11:6003-6020). In a preferred embodiment,
the genomic DNA of the replication competent target virus is
present in a plasmid, which facilitates initial selection and
amplification of product viruses. Where the genomic DNA of the
replication competent target virus is present in a plasmid, the
viral DNA should be easily excisable from the plasmid. In one
embodiment, restriction enzyme sites flank the viral DNA within the
plasmid. In a preferred embodiment, the restriction enzyme sites
are identical. In a particularly preferred embodiment, the genomic
DNA of the replication competent target virus is flanked by Pacl
sites (see FIG. 1). Pacl, however, is only one choice of
restriction enzyme site and determination of other useful
restriction enzyme sites is well within the knowledge of one
skilled in the art.
[0066] Donor DNA
[0067] The donor DNA of the present invention, which comprises a
transposon, can be in a linear form, such as a phage, or can be a
plasmid.
[0068] A transposon useful in the present invention has the ability
to insert itself into a viral genome in a non-biased manner, i.e.
the choice of insertion site is not influenced by or dependent on
particular sequences within the viral genome. Transposons that use
ATP-utilizing regulatory proteins are contemplated for use in the
invention, provided that appropriate mutations in the ATP-utilizing
protein have been made such that insertion is non-biased (see U.S.
Ser. No. 10/024,809). Examples of such transposons are Tn5090/Tn420
and Tn7. Particularly preferred is a Tn7-based transposon (Biery et
al. (2000) Nucleic Acid Res. 28:1067-1077).
[0069] A transposon useful in the present invention comprises a
genetic element of interest.
[0070] In one embodiment, the genetic element of interest is a
gene, often referred to as a "reporter gene", that encodes a
detectable gene product, i.e. a product whose expression can be
easily assayed or identified. For instance, the reporter gene may
encode a gene product that, by enzymatic activity, gives rise to a
detection signal based on color, fluorescence, or luminescence.
Many reporter genes are known to those of skill in the art and
others may be identified or synthesized by methods known to those
of skill in the art. Useful reporter genes include, but are not
limited to, green fluorescing protein (GFP), LacZ, renilla
luciferase or firefly luciferase. A particularly preferred reporter
gene is GFP. Assays for measuring these proteins are well known in
the art.
[0071] In another embodiment of the present invention, the genetic
element of interest is a gene that encodes a therapeutic protein. A
therapeutic protein, as used herein, refers to a protein, protein
fragment or peptide, or a modification thereof, that would be
expected to provide some therapeutic benefit to an organism when
expressed in vivo. Therapeutic proteins contemplated in the present
invention include, but are not limited to, proteins which are
immunomodulatory proteins, antibodies, symporters, fusogenic
glycoproteins or pro-drug converting enzymes, and encompass any
protein, protein fragment or polypeptide whose expression within a
cell would be expected, by one of skill in the art, to provide a
therapeutic benefit (e.g. a cytotoxic effect, growth inhibitory
effect etc.).
[0072] In one embodiment, the therapeutic protein of the present
invention can be a pro-drug activator, such as cytosine deaminase
(see, U.S. Pat. Nos. 5,631,236; 5,358,866; and 5,677,178). In
another embodiment, the therapeutic protein can be a known inducer
of cell-death, e.g. apoptin or adenoviral death protein, or a
fusion protein, e.g. fusogenic membrane glycoprotein (Danen-Van
Oorschot et al. (1997) Proc. Nat. Acad. Sci. 94:5843-5847;
Tollefson et al. (1996) J. Virol. 70:2296-2306; Fu et al. (2003)
Mol. Therapy 7: 48-754, 2003; Ahmed et al. (2003) Gene Therapy
10:1663-1671; Galanis et al. (2001) Human Gene Therapy 12(7):
811-821).
[0073] Alternatively, a therapeutic protein of the invention can be
a member of a symporter family (e.g. sodium/iodide symporter, NIS)
that would enable a therapeutic molecule to be more effectively
targeted to the tumor cell.
[0074] Other therapeutic proteins, or fragments thereof, useful in
the present invention, include those that encode immunomodulatory
proteins, such as cytokines or chemokines. Examples include
interleukin 2, U.S. Pat. Nos. 4,738,927 or 5,641,665; interleukin
7, U.S. Pat. Nos. 4,965,195 or 5,328,988; and interleukin 12, U.S.
Pat. No. 5,457,038; tumor necrosis factor alpha, U.S. Pat. Nos.
4,677,063 or 5,773,582; interferon gamma, U.S. Pat. Nos. 4,727,138
or 4,762,791; or GM CSF, U.S. Pat. Nos. 5,393,870 or 5,391,485,
Mackensen et al. (1997) Cytokine Growth Factor Rev. 8:119-128).
Additional immunomodulatory proteins include macrophage
inflammatory proteins, including MCP-3. Monocyte chemotactic
protein (MIP-3 alpha) may also be used.
[0075] In another embodiment, the genetic element of interest
encodes a protein whose expression is known to enhance the ability
of an oncolytic virus to eradicate the tumor, although not having
any direct impact on the tumor itself. These genes include encoding
proteins that (1) compromise MHC class I presentation (Hewitt et
al. (2003) Immunology 110: 163-169), (2) block complement activity,
(3) inhibit IFNs and IFN-induced mechanisms, (4) enhance NK cell
based killing (Orange et al., (2002) Nature Immunol 3:1006-1012;
Mireille et al. (2002) Immunogenetics 54: 527-542; Alcami (2003)
Nature Rev. Immunol. 3: 36-50; (5) down regulate the immune
response (e.g. IL-10, TGF-Beta, Khong and Restifo (2002) Nature
Immunol. 3: 999-1005; 2002) and (6) act to breakdown the
extracellular matrix and enhance spread of the virus within the
tumor (e.g. metalloproteinases) (Bosman and Stamenkovic (2003) J.
Pathol. 2000: 423-428; Visse and Nagase (2003) Circulation Res. 92:
827-839).
[0076] In another embodiment of the invention, the genetic element
of interest encodes a protein that can provide a safety mechanism,
such that expression of the protein can be used to abort viral
infection when necessary, e.g. the Herpes Simples Virus (HSV)
thymidine kinase, which, when expressed in the presence of
ganciclovir (GCV), converts GCV into a triphosphate which is toxic
to both cellular and viral replication (see Example 9; FIGS. 6A and
B).
[0077] In a different embodiment, the genetic element of interest
comprising the transposon encodes a therapeutically useful RNA
molecule, i.e. shRNA (Dorsett and Tuschl (2004) Nature Rev. Drug
Disc. 3:318-329) or antisense RNA, which hybridizes to a coding
mRNA nucleic acid sequence, e.g., a cancer protein sequence.
Expression of these types of molecules within a tumor cell can
provide a therapeutic benefit by reducing the translation and/or
stability of the targeted mRNA. In mammalian cells, short, e.g., 21
nucleotide, double stranded small interfering RNAs (shRNA) have
been shown to be effective at inducing an RNAi response (Elbashir,
et al. (2001) Nature 411:494-498). This mechanism may be used to
down-regulate expression levels of identified genes, e.g. for
treatment of or validation of relevance to disease. Antisense
polynucleotides can comprise naturally-occurring nucleotides, or
synthetic species formed from naturally-occurring subunits or their
close homologs (see Isis Pharmaceuticals, Carlsbad, Calif.;
Sequitor, Inc., Natick, Mass.). RNA molecules of interest, include
but are not limited to, shRNAs or antisense RNA molecules which
hybridize to the nucleotide sequences encoding cell survival
proteins overexpressed in cancer cells (e.g. survivin and XIAP
(Ling and Li (2004) Biotechniques 36:450-460; McManus et al. (2004)
Oncogene 23:8105-8117) or multi-drug resistance genes (e.g. Stege
et al. (2004) Cancer Gene Therapy 11:699-706).
[0078] In another embodiment, the genetic element of interest is a
chimeric gene consisting of a gene that encodes a protein that
traverses cell membranes, for example, VP22 or TAT, fused to a gene
that encodes a protein that is toxic to tumor but not normal cells
(Snyder and Dowdy (2004) Pharm. Res. 21:389-393).
[0079] The transposon of the invention can further comprise an
expression element, located upstream from, and operably linked to,
the genetic element of interest described above. The nucleic acid
sequence comprising both the genetic element of interest and the
expression element is referred to as an "expression cassette". An
expression cassette comprising an SV40 promoter and a GFP reporter
gene, for example, is represented as "SV40/GFP".
[0080] In one embodiment of the present invention, the expression
element is a eukaryotic promoter, capable of driving expression of
the genetic element of interest when the transposon is inserted in
the proper orientation within the viral genome. Possible promoters
include, but are not limited to, e.g. a traditional POL II
eukaryotic promoter (e.g. E2F or hTERT (Wu et al. Trends in Mol.
Med. (2003) 9:421-429)), a POL III promoter (e.g. U6) or a viral
promoter (e.g. SV40, CMV, TK or MLP). A particularly preferred
promoter is the SV40 promoter.
[0081] In another embodiment, the expression element is a
eukaryotic splice acceptor sequence (Senapahty et al. (1990)
Methods Enzymol. 183:252-278). When the transposon, comprising the
splice acceptor sequence upstream from, and operably linked to, a
genetic element of interest, is appropriately inserted into the
viral genome, transcription of the genetic element can occur.
Preferred splice acceptor sequences are the sequences SEQ ID NO: 1
(5'-TGCTAATCTTCCTTTCTCTCTTCAGG-3'), SEQ ID NO: 2
(5'-TTTCTCTCTTCAGG-3') or SEQ ID NO: 3 (5'-CAGG-3'). A particularly
preferred sequence is SEQ ID NO: 1.
[0082] In another embodiment of the present invention, the
transposon further comprises restriction enzyme sites located close
to the ends of the transposon and unique to the transposon, i.e.
not present in the genomic DNA of the replication competent target
virus. In a preferred embodiment, the restriction enzyme sites are
identical. A particularly preferred restriction enzyme site is a
Pmel site. The presence of these restriction enzyme sites allows
facile replacement of the genetic element of interest or the
expression cassette present in the originally isolated product
virus (see Examples 5 and 6) and thereby permits use of one
well-characterized product virus as a backbone for the creation of
a series of viral vectors. Each viral vector, although derived from
one product virus, can be armed with a different expression
cassette and thus be used to target treatment of different
diseases.
[0083] Also contemplated within the scope of the invention are
further steps within the method of the invention that increase the
efficiency of identification/isolation of product viruses produced
by the method of the invention. For example, in one embodiment of
the invention, product viruses isolated from bacteria are subjected
to two rounds of infection using different eukaryotic cells, so as
to increase the number of viral DNA colonies produced (see Example
4). Such additional steps are useful, but are not necessary, to the
practice of the invention.
[0084] As mentioned above, the donor DNA comprising the transposon
may be linear or may be a plasmid. Where the donor DNA is a
plasmid, the donor DNA may further comprise an origin of
replication, for example, the R6K origin of replication. A properly
chosen origin of replication insures that donor plasmids will not
be able to multiply in a bacterial cell chosen for amplification of
product viruses. Choice of an appropriate origin of replication is
well within the knowledge of one skilled in the art.
[0085] The transposon can further comprise a marker or selectable
gene that is useful in the efficient isolation and/or
identification of product viruses. Such a gene often provides a
selective growth advantage; e.g., the gene may enhance cell
viability, relieve a nutritional requirement, and/or provide
resistance to a drug. Any desired marker or selectable gene can be
used, including, but not limited to, genes conferring antibiotic
resistance (e.g. resistance to chloramphenicol, tetracycline,
ampicillin, kanamycin) or drug resistance (e.g. resistance to
methotrexate or G418). The preferred selectable gene may depend on
the organism being used to isolate or amplify the product viruses,
e.g. antibiotic resistance when using bacterial cells, methotrexate
resistance when using eukaryotic cells. Alternatively, a marker or
reporter gene may encode a gene product that, by enzymatic
activity, gives rise to a detection signal based on color,
fluorescence, or luminescence. Many reporter genes are known to
those of skill in the art and others may be identified or
synthesized by methods known to those of skill in the art. Useful
reporter genes include, but are not limited to, green fluorescing
protein (GFP), LacZ, renilla luciferase or firefly luciferase. A
particularly preferred reporter gene is GFP. Assays for measuring
these proteins are well known in the art. The choice of an
appropriate selection/marker gene is well within the knowledge of
one skilled in the art.
[0086] In a preferred embodiment of the invention (see Example 3;
FIG. 1), the donor DNA is a plasmid comprising an R6K origin of
replication and a transposon, where the transposon comprises a
chloramphenicol resistance gene, as well as a GFP expression
cassette, SV40/GFP, i.e. a GFP gene with an SV40 promoter operably
linked to it.
[0087] Identification and Evaluation of Product Viruses.
[0088] Replication competent product viruses comprising a
transposon inserted into a functional site within the viral genome
generated by the method of the invention are identified and
evaluated by measurement of expression of the genetic element of
interest present within the inserted transposon.
[0089] Expression can be assayed in a variety of ways. If the
genetic element of interest is a reporter gene, the product encoded
by the reporter gene is usually detected by an intrinsic activity
associated with that product. For example, the reporter gene can be
chosen from firefly luciferase (deWet et al. (1987) Proc. Natl.
Acad. Sci. 1:4154-4158), bacterial luciferase (Baldwin et al.
(1984) Biochemistry 23:3663-3667) or alkaline phosphatase (Toh et
al. (1989) Eur. J. Biochem. 182:231-238). Assays for these proteins
are well known in the art and many kits to perform these assays are
commercially available (e.g. Dual-Luciferase Reporter Assay Kit,
Promega, Madison, Wis.). In one preferred embodiment, use of a
reporter gene encoding GFP permits identification of product
viruses as green-fluorescing plaques.
[0090] Other methods for measurement of gene expression that can be
used include, but are not limited to, Southern blotting, Northern
blotting to quantitate mRNA transcription (Thomas et al. (1980)
Proc. Natl. Acad. Sci. USA 77:5201-5205), DNA analysis by dot
blotting, or in situ hybridization, using an appropriately labeled
probe, based on the sequences of the inserted genetic element.
Alternatively, gene expression may be measured by immunological
methods, such as immunohistochemical staining of cells or tissue
sections, to directly quantitate the expression of a gene product
using appropriately specific antibodies. Antibodies useful for
immunohistochemical staining and/or assay of cell supernatants may
be either monoclonal or polyclonal.
[0091] When the genetic element of interest encodes a therapeutic
protein, assays that directly measure protein expression or assays
which measure an activity of the expressed protein can be used to
identify product viruses of potential interest. In general, assay
methods that can be used to determine levels of an expressed
protein in a sample derived from a host are well-known to those of
skill in the art and include such assay methods as
radioimmunoassays (RIA), competitive-binding assays, western Blot
analysis and enzyme-linked immunoabsorbant assays (ELISA),
fluorescent activated cell sorting (FACS), and surface plasmon
resonance. Among these, ELISAs frequently are preferred. An ELISA
assay requires having an antibody specific to the gene product of
interest, preferably a monoclonal antibody. In addition, a reporter
antibody generally is prepared which binds to the monoclonal
antibody or is directly conjugated to the antibody. The reporter
antibody is attached to a detectable reagent such as a radioactive,
fluorescent or enzymatic reagent, e.g. horseradish peroxidase
enzyme.
[0092] To carry out an ELISA, a sample is removed from a host and
incubated on a solid support, e.g. a polystyrene dish that binds
the polypeptides in the sample. Any free polypeptide binding sites
on the dish are then covered by incubating with a non-specific
protein such as bovine serum albumin. Next, the monoclonal antibody
is incubated in the dish during which time the monoclonal
antibodies attach to any product polypeptides attached to a solid
support. Unbound monoclonal antibody is separated from bound
antibody by washing with buffer. The reporter antibody linked to
horseradish peroxidase is placed in the dish resulting in binding
of the reporter antibody to any monoclonal antibody bound to the
gene product of interest. Unattached reporter antibody is then
washed out. Reagents for peroxidase activity, including a
calorimetric substrate are then added to the dish. Immobilized
peroxidase, linked to the product protein through the primary and
secondary antibodies, produces a colored reaction product. The
amount of color developed in a given time period indicates the
amount of product polypeptide present in the sample. Quantitative
results typically are obtained by reference to a standard
curve.
[0093] A competition assay may be employed wherein antibodies
specific to the product protein are attached to a solid support and
labeled product protein and a sample derived from the host are
passed over the solid support and the amount of label detected
attached to the solid support correlated to a quantity of product
protein in the sample.
[0094] These and other assays are described, among other places, in
Hampton et al. (Serological Methods, a Laboratory Manual, APS
Press, St Paul, Minn., 1990) and Maddox et al. (J. Exp. Med.
158:12111, 1983).
[0095] Expression of shRNA or antisense RNA expression can be
measured through measurement of the expression of, or activity of,
the target message; i.e. a reduction in either of these
measurements would indicate production of the desired RNA
molecule.
[0096] Replacement of Genetic Element of Interest Within a Product
Virus
[0097] One advantage of the present invention is the potential to
use an isolated and characterized product virus of the invention as
a gene therapy vector for more than one therapeutic indication.
This is accomplished by incorporating the ability to easily
exchange the genetic element of interest or expression cassette
present within the transposon of the originally isolated product
virus with a different genetic element of interest or expression
cassette. Transposons useful in the present invention, therefore,
further comprise unique enzyme restriction sites; i.e. sites not
present in the starting viral genome, located close to the ends of
the transposon. Presence of these restriction enzyme sites allows
facile replacement of the original genetic element of interest or
expression cassette present in an isolated product virus with a
different genetic element of interest or expression cassette. In a
preferred embodiment, the restriction enzymes sites are blunt-ended
restriction enzyme sites, allowing insertion of a replacement
genetic element of interest or expression cassette in both possible
orientations within the product viral genome. It has been shown
that expression levels from therapeutic genes placed within the
replicating virus can vary depending on the viral sequences
flanking the gene and on the orientation of the gene within the
viral genome (Schneider et al. (1989) J. Gen. Virol. 70: (Pt 2)
417-427; Mittal et al. (1995) Virology 210:226-230) so it is useful
to be able to generate product viruses containing genetic elements
(or expression cassettes) inserted in both directions for
evaluation.
[0098] In a particularly preferred embodiment, the restriction
enzyme sites are identical. A particularly preferred restriction
enzyme site is the Pmel restriction enzyme site; however, choice of
enzyme restriction sites is well within the knowledge of one
skilled in the art (see Examples 5 and 6).
[0099] The replacement genetic element of interest, whether alone
or within an expression cassette can be any genetic element;
however, there is a limitation on the size of the insert which can
be used and still allow proper viral packaging (see, for example,
Bett et al (1993) J. Virol. 10:5911-5921).
[0100] Table 1 contains the results of an experiment showing
replacement of an SV40/GFP expression cassette present within two
isolated product viruses, identified as PL29 and PL11 (see Example
5). The SV40/GFP expression cassette is replaced with other
expression cassettes, i.e. TK/RL, CMV/LUC or CMV/LacZ (see Examples
6), allowing isolation of new replication competent product viruses
containing the expression cassettes oriented in both directions
within the PL29 and PL11 genomes. FIG. 4 demonstrates the
difference in potency among the various replication competent
product viruses generated by these substitutions.
[0101] Therapeutic Activity Measurements
[0102] Replication competent product viruses of the invention are
evaluated for their therapeutic utility using in vitro or in vivo
systems designed to model a particular disease or condition.
Elegant models now exist for some of the major cancer indications,
e.g. prostate cancer (Russell and Voeks (2003) Methods Mol. Med.
81:89-112; Powell et al. (2003) Current Drug Targets 4:263-279),
bone metastasis (Rosol et al. (2004) Cancer Treat. Rep. 118:47-81),
angiogenesis (Mg et al. (2000) J. Neurooncol. 50:89-98; Hanahan et
al. (1996) Eur. J. Cancer 32A:2386-2393) and these can be
complemented, in some cases, by spontaneous tumor models in dogs
and cats (Hansen and Khanna (2004) Eur. J. Cancer 40: 858-880).
[0103] Utility in the treatment of malignancy can be examined by
measurement of the lytic potential of a product virus in tumor
cells derived from tissues of interest as therapeutic targets.
Tumor cell lines useful for testing such viruses may include, but
are not limited to, colon cell lines, including but not limited to,
DLD-1, HCT116, HT29, LS1034 and SW48 cell lines; prostate cell
lines, including but not limited to, DU145 and PC-3 cell lines;
pancreatic cell lines, including but not limited to, the Panc-1
cell line; breast tumor cell lines, including but not limited to,
the MDA231 cell line and ovarian cell lines, including but not
limited to, the OVCAR-3 cell line. Hemopoietic cell lines include,
but are not limited to, the Raji and Daudi B-lymphoid cells, K562
erythroblastoid cells, U937 myeloid cells, and HSB2 T-lymphoid
cells. Any other tumor cell lines that are available can be used in
evaluating and identifying replication competent viruses of the
invention for the treatment of neoplasia.
[0104] The cytolytic activity of the replication competent viruses
of the invention can be determined in representative tumor cell
lines and the data converted to a measurement of potency, with an
adenovirus belonging to subgroup C, preferably Ad5, being used as a
standard (i.e. given a potency of 1). A preferred method for
determining cytolytic activity is an MTT assay (see Example 7, FIG.
4).
[0105] The therapeutic index of a replication competent virus of
the invention in a particular tumor cell line can be calculated by
comparison of the potency of the given virus in a tumor cell line
with the potency of that same virus in a non-cancerous cell line.
Preferred non-cancerous cell lines are SAEC cells, which are
epithelial in origin, and HuVec cells, which are endothelial in
origin. These two cell types represent normal cells from which
organs and vasculature, respectively, are derived, and are
representative of likely sites of toxicity during viral therapy,
depending on the mode of delivery of the virus. However, practice
of the invention is not limited to the use of these cells, and
other non-cancerous cell lines (e.g. B cells, T cells, macrophages,
monocytes, fibroblasts) may also be used.
[0106] The replication competent viruses of the invention can be
further evaluated for their ability to target neoplastic cell
growth (i.e. cancer) by their capacity to reduce tumorigenesis or
neoplastic cell burden in nude mice harboring a transplant of
neoplastic cells, as compared to untreated mice harboring an
equivalent neoplastic cell burden.
[0107] Evaluation of the replication competent viruses of the
invention can also be performed using primary human tumor explants
(Lam et al. (2003) Cancer Gene Therapy; Grill et al. (2003) Mol.
Therapy 6:609-614), which provide test conditions present in tumors
that cannot normally be produced using the tumor xenograft
studies.
Therapeutic Utility
[0108] The replication competent viruses generated by the method of
the invention are useful in gene therapy.
[0109] Pharmaceutical Compositions and Administration
[0110] The present invention also relates to pharmaceutical
compositions that comprise the replication competent viruses of the
invention, formulated for therapeutic administration to a patient.
For therapeutic use, a sterile composition containing a
pharmacologically effective dosage of virus is administered to a
human patient or veterinary non-human patient for treatment, for
example, of a neoplastic condition. Generally, the composition will
comprise about 10.sup.11 or more adenovirus particles in an aqueous
suspension. A pharmaceutically acceptable carrier or excipient is
often employed in such sterile compositions. A variety of aqueous
solutions can be used, e.g. water, buffered water, 0.4% saline,
0.3%-glycine and the like. These solutions are sterile and
generally free of particulate matter other than the desired
adenoviral vector. The compositions may contain pharmaceutically
acceptable auxiliary substances as required to approximate
physiological conditions such as pH adjusting and buffering agents,
toxicity adjusting agents and the like, e.g. sodium acetate, sodium
chloride, potassium chloride, calcium chloride, sodium lactate,
etc. Excipients that enhance infection of cells by adenovirus may
be included. (see U.S. Pat. No. 6,392,069)
[0111] Viruses of the invention may also be delivered to neoplastic
cells by liposome or immunoliposome delivery; such delivery may be
selectively targeted to neoplastic cells on the basis of a cell
surface property present on the neoplastic cell population (e.g.,
the presence of a cell surface protein which binds an
immunoglobulin in an immunoliposome). Typically, an aqueous
suspension containing the virions is encapsulated in liposomes or
immunoliposomes. For example, a suspension of adenoviral virions
can be encapsulated in micelles to form immunoliposomes by
conventional methods (U.S. Pat. No. 5,043,164, U.S. Pat. No.
4,957,735, U.S. Pat. No. 4,925,661; Connor and Huang, (1985) J.
Cell Biol. 101: 581; Lasic D. D. (1992) Nature 355: 279; Novel Drug
Delivery (eds. Prescott and Nimmo, Wiley, New York-, 1989); Reddy
et al. (1992) J. Immunol. 148:1585). Immunoliposomes comprising an
antibody that binds specifically to a cancer cell antigen (e.g.,
CALLA, CEA) present on the cancer cells of the individual may be
used to target virions to those cells (Fisher (2001) Gene Therapy
8:341-348).
[0112] Viral Therapy
[0113] The viruses of the invention, or pharmaceutical compositions
thereof, can be administered for therapeutic treatment of
neoplastic disease or cancer. In therapeutic applications,
compositions are administered to a patient already affected by the
particular neoplastic disease, in an amount sufficient to cure or
at least partially arrest the condition and its complications. An
amount adequate to accomplish this is defined as a "therapeutically
effective dose" or "efficacious dose". Amounts effective for this
use will depend upon the severity of the condition, the general
state of the patient, and the route of administration.
[0114] For example, but not by way of limitation, a human patient
or non-human mammal having a solid or haemotologic neoplastic
disease, (e.g. pancreatic, colon, ovarian, lung, or breast
carcinoma, leukemia or multiple myeloma) may be treated by
administering a therapeutically effective dosage of an appropriate
adenovirus of the invention, i.e. one which has been shown to have
an improved therapeutic index for that tissue type. A preferred
chimeric adenovirus for the treatment of colon cancer would be the
adenovirus ColoAd1 (SEQ ID NO: 22). Suspensions of infectious
adenovirus particles may be delivered to neoplastic tissue by
various routes, including intravenous, intraperitoneal,
intramuscular, subdermal, and topical. An adenovirus suspension
containing about 10.sup.3 to 10.sup.12 or more virion particles per
ml may be administered by infusion (e.g., into the peritoneal
cavity for treating ovarian cancer, into the portal vein for
treating hepatocarcinoma or liver metastases from other non-hepatic
primary tumors) or other suitable route, including direct injection
into a tumor mass (e.g., a breast tumor), enema (e.g., colon
cancer), or catheter (e.g., bladder cancer). Other routes of
administration may be suitable for carcinomas of other origins,
i.e. inhalation as a mist (e.g., for pulmonary delivery to treat
bronchogenic carcinoma, small-cell lung carcinoma, non-small cell
lung carcinoma, lung adenocarcinoma, or laryngeal cancer) or direct
application to a tumor site (e.g., bronchogenic carcinoma,
nasopharyngeal carcinoma, laryngeal carcinoma, cervical
carcinoma).
[0115] Viral therapy using the replication competent viruses of the
instant invention may be combined with other antineoplastic
protocols, such as conventional chemotherapy or x-ray therapy to
treat particular cancers. Treatment can be concurrent or
sequential. A preferred chemotherapeutic agent is cisplatin, and
the preferred dose may be chosen by the practitioner based on the
nature of the cancer to be treated, and other factors routinely
considered in administering cisplatin. Preferably, cisplatin will
be administered intravenously at a dose of 50-120 mg/m.sup.2 over
3-6 hours. More preferably it is administered intravenously at a
dose of 80 mg/m.sup.2over 4 hours. A second preferred
chemotherapeutic agent is 5-fluorouracil, which is often
administered in combination with cisplatin. The preferred dose of
5-fluorouracil is 800-1200 mg/m.sup.2 per day for 5 consecutive
days.
[0116] Viral therapy using the replication competent viruses of the
instant invention as vectors may also be combined with other genes
known to be useful in viral based therapy. See U.S. Pat. No.
5,648,478.
[0117] Kits
[0118] The invention further relates to pharmaceutical packs and
kits comprising one or more containers filled with one or more of
the ingredients of the aforementioned compositions of the
invention. Associated with such container(s) can be a notice in the
form prescribed by a governmental agency regulating the
manufacture, use or sale of pharmaceuticals or biological products,
reflecting approval by the agency of the manufacture, use or sale
of the product for human administration.
[0119] The present invention is further described by the following
examples, which are illustrative of specific embodiments of the
invention, and various uses thereof. These exemplifications, which
illustrating certain specific aspects of the invention, do not
portray the limitations or circumscribe the scope of the disclosed
invention.
[0120] Unless otherwise indicated, the practice of the present
invention employs conventional techniques of cell culture,
molecular biology, microbiology, recombinant DNA manipulation,
immunology science, which are within the skill of the art. Such
techniques are explained fully in the literature. See, e.g. Cell
Biology: a Laboratory Handbook: J. Celis (Ed). Academic Press. N.Y.
(1996); Graham, F. L. and Prevec, L. Adenovirus-based expression
vectors and recombinant vaccines. In: Vaccines: New Approaches to
Immunological Problems. R. W. Ellis (ed) Butterworth. Pp 363-390;
Graham and Prevec Manipulation of adenovirus vectors. In: Methods
in Molecular Biology, Vol. 7: Gene Transfer and Expression
Techniques. E. J. Murray and J. M. Walker (eds) Humana Press Inc.,
Clifton, N.J. pp 109-128, 1991; Sambrook et al. (1989), Molecular
Cloning, A Laboratory Manual, 2nd ed., Cold Spring Harbor
Laboratory Press; Sambrook et al. (1989), and Ausubel et al.
(1995), Short Protocols in Molecular Biology, John Wiley and
Sons.
EXAMPLES
[0121] Standard techniques are used for recombinant nucleic acid
methods, polynucleotide synthesis, and microbial culture and
transformation (e.g., electroporation, lipofection). Generally,
enzymatic reactions and purification steps are performed according
to the manufacturer's specifications. The techniques and procedures
are generally performed according to conventional methods in the
art and various general references (see generally, Sambrook et al.,
Molecular Cloning: A Laboratory Manual, 2nd. edition (1989) Cold
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.) which are
provided throughout this document. The nomenclature used herein and
the laboratory procedures in analytical chemistry, organic
synthetic chemistry, and pharmaceutical formulation and delivery,
and treatment of patients. Methods for the construction of
adenoviral mutants are generally known in the art. See, Mittal, S.
K., Virus Res., 1993, vol: 28, pages 67-90; and Hermiston, T. et
al., Methods in Molecular Medicine: Adenovirus Methods and
Protocols, W. S. M. Wold, ed, Humana Press, 1999.
[0122] The following examples are offered by way of illustration
and are not intended to limit the invention in any way.
Example 1
Construction of Donor Plasmids
[0123] A. Donor plasmid containing a transposon with an SV40/GFP
expression cassette. The oligonucleotides GER82
(5'-AAATGTGGCCGGCCACTGATTCCACGTAGTGGTCAGGTA-3'; SEQ ID NO: 4) and
GER83 (5'-CTAGTACCTGACCACTACGTGGAATCAGTGGCCGGCCACATTT-3'; SEQ ID
NO: 5) were annealed and cloned into the Swal/Spel digested pGPS4
plasmid (New England Biolabs, Beverly, Mass., USA), adding Fsel and
Dralll restriction sites to pGPS4 in the order
Swal-Fsel-Dralll-Spel, producing the plasmid pGER57. Annealing of
oligonucleotides GER72 (5'-GGAATTGGCCGGCCAT ATCCGC-3'; SEQ ID NO:
6) and GER73 (5'-GGATATGGCCGGCCAATTCCGC-3'; SEQ ID NO: 7) and
ligation to a Sacll digested plasmid pQBI25-fPA (Q-Biogene,
Carlsbad, Calif., USA) created plasmid pGER54, which contains a
unique Fsel site immediately 5' of the sgGFP coding sequence. The
SV40 promoter/enhancer was PCR amplified from phRL-SV40 (Promega,
Madison, Wis., USA) with oligonucleotides GER94
(5'-CATGGATGGCCGGCCGCTGTGGAATGTGTGTC A G-3'; SEQ ID NO: 8) and
GER95 (5'-TCAGTAGCTAGCCATGGTGGCTAAGAGCTGTAATTGAACTGG-3'; SEQ ID NO:
9) and restriction enzyme digested to create a fragment with the
following features: Fsel-SV40 promoter/enhancer-small IVS-Nhel.
This fragment was cloned into the Fsel/Nhel digested pGER54 plasmid
to create plasmid pGER59, which contains a transcription unit
bounded by Fsel and Dralll in which the sgGFP gene is driven by the
SV40 promoter/enhancer. This transcription unit was then cloned
into the Fsel/Dralll digested pGER57 to result in a plasmid,
pGER98, which contains a chloramphenicol gene and SV40-sgGFP
transcription unit within the Tn7 transposon.
[0124] Plasmids pGPS4, pGER57 and pGER98 have the R6K origin of
replication and are unable to grow in ordinary lab strains of E.
coli such as DH5a which are pir- (Kolter et al. (1978) Cell
15:1199-1208 (1978); Metcalf et al. (1994) Gene 138:1-7). For
transformation and growth of these plasmids, a desired amount of
DNA or ligation mixture was added to 20 ul electrocompetent E. coli
TransforMax EC100D.TM. pir+ cells (Epicentre, Madison, Wis., USA)
in a chilled microcentrifuge tube. The cell/DNA mixture was then
transferred to a chilled cuvette and electroporated at the settings
recommended by the manufacturer. Immediately after electroporation,
the cell/DNA mixture was transferred to 1 ml of SOC medium
(Invitrogen, Carlsbad, Calif., USA) and incubated at 37.degree. C.
for 1 hour with shaking. The desired number of cells was plated on
CM.sup.R LB agar plates for growth of colonies.
[0125] B. Donor plasmids containing transposons with splice
acceptor/GFP expression cassettes. Plasmid pGER54 containing sgGFP
(see above, this Example) was digested with Fsel and Nhel, and
ligated to two sets of oligos, GER88
(5'-CCTTTCTCTCTTCAGGCCGCCATGG-3'; SEQ ID NO: 10) and GER99
(5'-GTTCTGGATCCGTGAGTCAACAGGAAAGTTCC-3'; SEQ ID NO: 11), which when
annealed give a Fsel-SA-Nhel fragment, and GER100
(5'-CCTGCTAATCTTCCTTTCTCTCTTCAGGCCGCCATGG-3'; SEQ ID NO: 12) and
GER101 (5'-CTAGCCATGGCGGCCTGAAGAGAGAAAGGAAGATTAGCAGGCCGG-3'; SEQ ID
NO: 13), which when annealed give an Fsel-BPS-Nhel fragment,
resulting in plasmids pGER99 and pGER100, respectively. These
latter two plasmids were digested with Fsel and Dralll and ligated
to Fsel/Dralll digested pGER57 (see above, this Example) to result
in plasmids pGER111, containing a transposon with the splice
acceptor sequence, SEQ ID NO: 1 and pGER112, containing a
transposon with the branch-point splice acceptor sequence, SEQ ID
NO: 2.
Example 2
Generation of a Starting Plasmid (pCJ51) Comprising an Ad5
Genome
[0126] The left end of Ad5 was introduced into pAdEasy (Stratagene,
La Jolla, Calif., USA) by homologous recombination. The
Scal-BstZ17I fragment of pTG3602 (Chartier et al, 1996) was
co-transfected along with Clal-linearized pAdEasy into BJ5183
bacteria (Stratagene, La Jolla, Calif., USA). The resulting
recombinant plasmid was named pCJ38. The Sall fragment of pAdEasy
corresponding to Ad5 nucleotides 9841-16746 was sub-cloned into the
Sall site of pBluescript-KS+ (Stratagene, La Jolla, Calif. 92037,
USA). The Pmel site of the resulting pCJ36 was then mutated using
the CJ23-23r pair of oligonucleotides with annealed double stranded
sequence as follows: 5'-CCGGCGGCAGAAGATCCC
CTCGTTGCACA[GC*TTAAAC]AGCGAGGAGGAGCGCATTTTGCGCTA-3 (SEQ ID NO: 14).
The Pmel site is bracketed and the asterisk designates the T to C
point mutation that destroys the Pmel site without disturbing the
IIIa ORF. Annealed oCJ23-23r was co-transfected along with
Pmel-linearized pCJ36 into BJ5183 and pCJ39 was obtained by
homologous recombination. Finally, the Pmel-Sall fragment of pCJ39
was re-introduced into pCJ38 by homologous recombination into the
BJ5183 bacteria after linearization of the latter plasmid with
Pmel. The plasmid containing the Pacl-flanked, E3-deleted, and
Pmel.sup.- Ad5 genome was named pCJ51.
Example 3
Transposition of the sgGFP Transposon From pGER98 to pCJ51
[0127] The in vitro transposition components of the GPS.TM.-LS
Linker Scanning System from New England Biolabs (Beverly, Mass.,
USA) were utilized as described in the kit, as follows. The
transposon donor plasmid (pGER98, 0.04 ug) was mixed with TnsABC (1
ul) and the target Ad plasmid (pCJ51, 0.16 ug) and incubated for 10
minutes at 37.degree. C. Start solution was added, incubated for 1
hour at 37.degree. C., and heated at 75.degree. C. for 10 minutes,
for a final volume of 20 ul. One ul of this reaction mixture
resulted in approximately 5000 chloramphenicol resistant colonies
following electroporation to DH10B electrocompetent cells
(Invitrogen, Carlsbad, Calif., USA).
[0128] Transposition of the splice acceptor transposons were
performed in the same way.
Example 4
Identification of Transposon Insertion Sites Within the Ad5
Genome
[0129] Chloramphenicol resistant colonies (see above) were pooled
(about 10,000 colonies per pool) and plasmid DNA was extracted
using Qiagen (Valencia, Calif., USA) endo-free plasmid extraction
kits. Five ug of Pacl-digested plasmid DNA was transfected into one
10 cm dish of HEK 293 cells by the calcium phosphate method
(Invitrogen, Carlsbad, Calif., USA). After overnight incubation, 15
ml of 2% agar (2% agar in 2% FBS DMEM plus penicillin/Streptomycin)
was then added to the cells. Four or five days later, another 10 ml
of 2% agar was added. After 10 days to 3 weeks incubation, green
plaques were isolated, and serial dilutions (10.sup.-3 to
10.sup.-9) of these plaques were used to infect A549 cells plated
in 6-well plates. After 30 minutes, 2 ml of 2% agar were added to
the cells. Three to four days later another layer of agar was
added, and green plaques from the highest dilutions were picked and
amplified (6 from 60 plaques). Viral DNA was extracted from the
cells using the HIRT method (Hirt B. (1967) J. Mol. Biol.
26:265-369) as follows. A549 cells were plated in 10 cm dishes and
infected with 100 ul to 200 ul of viral CPE stock after overnight
culture. After 16 to 48 hours the cells were lysed with 1 ml of
HIRT extraction buffer (0.6% SDS, 0.5 nM EDTA and 1 nM Tris pH 7.5)
and rocked slowly in a shaker for 15 minutes, after which 250 ul of
5M NaCl was added followed by storage overnight at 4.degree. C. The
cell lysate was then microcentrifuged for 30 minutes at 4.degree.
C. followed by proteinase K digestion (final concentration: 500
ug/ml) for 2-3 hours at 37.degree. C. After
1.times.phenol-chloroform and 1.times.chloroform extraction of the
cell lysate, viral DNA was precipitated by 2 volumes of EtOH and
1/10 volume of 3M NaOAc pH 5.2. The DNA precipitate was then washed
with 70% alcohol and resuspended in 100 ul of TE buffer.
[0130] The viral DNA was initially digested with Bgl II to identify
the Bgl II fragment containing the inserted transposon (see FIG.
2). In order to define the exact insertion site, PCR primers
capable of amplifying approximately 3 kb of the Ad5 genome in
overlapping segments within and/or flanking the identified Bgl II
fragments were used to amplify the inserted transposons flanked by
Ad5 DNA. The PCR fragments were sequenced using primers originating
within the transposon and oriented toward the Ad5 DNA at each end
of the transposon. For all Ad-transposon genomes examined in this
manner, the two sequences were found to indicate the identical
position and orientation of the transposon insertion for any given
Ad isolate. Results are shown in FIG. 3.
[0131] Identification of the insertion sites for transposons
comprising the splice acceptor sequences SEQ ID NO: 1 (SA) or SEQ
ID NO: 2(BPS) were done in the same way (see FIG. 5).
Example 5
Replacement of Genetic Elements of Interest Within Ad5/PL11 and
Ad5/PL29
[0132] Two plasmids, Ad5/PL11 and Ad5/PL29, which contain
replication competent Ad5 viruses containing transposons comprising
the SV40/GFP expression cassette (see Example 4) were used to
examine the facility with which the original SV40/GFP expression
cassette could be replaced with different expression cassettes.
A. Expression cassette construction. Three reporter gene expression
cassettes, each of which contain a reporter gene under the control
of a promoter, were constructed as follows. The CMV/LacZ expression
cassette (LacZ gene driven by the CMV promoter) was PCR amplified
with primers (5'-AG CTGTTTAAACCGATGTACGGGCCAG-3' (SEQ ID NO: 15)
and 5'-TGACGTTTAAACTAGAAGGCACAGTCGAGGC-3' (SEQ ID NO: 16)) using
pcDNA3.1/LacZ/hygro (Invitrogen, Carlsbad, Calif., USA) as
template. The TK/RL (Renilla luciferase gene driven by TK promoter)
expression cassette was excised from the phRL/TK vector (Promega,
Madison, Wis., USA) with Bgl II and BamHI restriction endonucleases
and then blunt-ended with Klenow fragment. To construct the CMV/Luc
expression cassette (firefly luciferase gene driven by the CMV
promoter), two oligonucleotides (5'-CTAGCGAATTCGGTACCACGCGTCTC
GAGGCCAC-3' (SEQ ID NO: 17) and
5'-CATGGTGGCCTCGAGACGCGTGGTACCGAATTCG-3' (SEQ ID NO: 18)) were
annealed and inserted into the pGL3Basic plasmid (Promega, Madison,
Wis., USA) digested with Nhe I/Nco I, resulting in plasmid
pGL3BasicI that contains a convenient multiple cloning site (Nhe I,
EcoR I, Kpn I, Mlu I, Xho I and Nco I) at the 5' end of the firefly
luciferase gene. The firefly luciferase gene was excised with Nhe I
and Xba I from pGL3BasicI and cloned into Nhe I/Xba I digested
phRL-CMV (Promega) to result in the pCMV/Luc plasmid. The CMV/Luc
cassette was then excised from this plasmid with Bgl II and BamH I
and blunt-ended for cloning into the Pme I sites of Ad5/PL11 and
Ad5/PL29. B. Expression cassette replacement. The blunt-ended
expression cassettes (CM/LacZ, TK/RL or CMV/Luc) were cloned into
the transposon insertion sites within the Ad5/PL11 and Ad5/PL29
viral DNA, using the Pmel restriction sites present near both ends
of the inserted transposon. HIRT extracted or CsCl purifed
Ad5/pPL11 or Ad5/PL29 DNA was digested with Pme I to remove all but
15 bp of the 3.1 kb transposon. The three reporter expression
cassettes were each blunt-end ligated to Pmel-digested viral DNA
using T4 DNA ligase (rapid ligation kit from Epicentre, Madison,
Wis., USA). The mass ratio of the expression cassette fragment to
viral DNA was 1:10. The ligation mixture was transfected into HEK
293 cells using the calcium phosphate method. After 10 days to two
weeks, non-green viral plaques were picked and used to infect A549
cells to test for reporter gene expression. LacZ expression was
detected by the In Situ .beta.-Gal Staining Kit (Stratagene, La
Jolla, Calif., USA). Expression of firefly or renilla luciferase
was measured by the Dual-Luciferase reporter assay kit (Promega,
Madison, Wis., USA). Reporter-gene expression positive plaques were
then subjected to a second round of plaque assays in A549 cells.
Plaques were picked from the highest dilution wells and reporter
gene expression was verified.
Example 6
Identification of Insertion Orientations of Reporter Expression
Cassettes
[0133] Since Pmel is a blunt-end cutter, the blunt-ended reporter
expression cassettes can be inserted in both orientations. The
orientation was identified by conducting two separate PCR reactions
with the Ad/expression cassette viral DNA as the template. A primer
common to both reactions was a reverse primer from the given
expression cassette (not shown). The second primer for each
reaction was one of the two primers from the set of Ad primers that
originally identified the insertion site (see above under
"Identification of transposon insertion sites on Ad5 genome"). Only
one of the PCR fragments can result in a PCR fragment, which
therefore defines the direction of orientation of insertion of the
expression cassette in the Ad genome. The results are shown in
Table 1.
TABLE-US-00001 TABLE 1 Virus Total # Positive Orientation name
clones clones .fwdarw. .rarw. .fwdarw. 12 9 5 4 PL29/TK/RL 22 15 13
2 CMV/Luc 27 5 5* 0 CMVLacZ .rarw. 12 4 0 4 PL11/TK/RL 2 2 0 2
CMV/Luc 13 2 0 2 CMV/LacZ
Example 7
Viral Propagation and Potency Measurement
[0134] Human embryonic kidney cell line HEK 293 cells and human
epithelial lung carcinoma A549 cells were obtained from ATCC. Both
cell lines were maintained in DMEM with 10% FBS and 0.1 nM
nonessential amino acid (NEAA). For the viral infection, DMEM with
2% FBS, 0.1 nM NEAA and 1 % penicillin/streptomycin was used. Viral
propagation was performed as described and viruses were quantitated
using a TMAE column (Shabram et al. (1997) Human Gene Therapy
8:453) or using the Adeno-X.TM. Rapid Titer Kit (Clontech, Palo
Alto, Calif., USA. The viruses ability to successfully replicate,
lyse and spread was measured by using a modification of the MTT
assay (Shen et al. 2003 J. Virology 77:2640-2650). The MTS assay
(Promega, CellTiter 96.RTM. Aqueous Non-Radioactive Cell
Proliferation Assay) was used in place of the MTT assay since
conversion of MTS by cells into aqueous, soluble formazan reduces
time and eliminates the use of a volatile organic solvent
associated with the MTT assay.
[0135] To perform the assay, cells were seeded at a defined density
for each tumor cell line that generated a confluent monolayer
within 24 hr. These densely seeded cells were allowed to grow for 2
additional days prior to exposure to the test virus(es). Infections
of both tumor and primary normal cells were carried out in
quadruplicate with serial three fold dilutions of the viruses
starting at a particle per cell ratio of 100 and ending at a
particle per cell ratio of 0.005. Infected cells were incubated at
37.degree. C. and the MTS assay was performed at the time points
indicated for the individual primary cells or tumor cell lines.
Mock-infected cells served as negative controls and established the
100% survival point for the given assay.
Example 8
RACE Analysis of the Splice Acceptor Clones
[0136] To determine the identity of the internal viral promoters,
total RNA extracted from cells infected with selected viral clones
containing insertions of splice acceptor transposons were used as
templates for PCR amplification using primers appropriate for RACE
analysis. The GFP gene specific primer is
5'-GGCCATGGAACAGGCAGTTTGCCAGTAGTGC-3' (SEQ ID NO: 19). The 10
Universal Primer A Mix (UPM) used was from the BD SMART.TM. cDNA
Amplification Kit from BD Biosciences (Palo Alto, Calif.; Catalog #
634914), which included primers
5'-CTAATACGACTCACTATAGGGCAAGCAGTG-3' (SEQ ID NO: 20) and
5'-TAATACGACTCACTATAGGGC-3' (SEQ ID NO: 21). Amplified DNA
fragments were isolated from agarose gels, cloned into the
appropriate vector from the kit and submitted for DNA sequencing.
The resulting sequence of the RACE fragment was compared to the
known sequence of Ad5 in order to determine those regions of the
adenovirus and transposon DNA on the RACE fragments, enabling
identification of the promoter and splicing used to create mRNA
containing the GFP gene.
Example 9
Generation of ColoAd1/PL30/TK/TK and its In Vitro Cytotoxicity in
the Presence and Absence of Ganciclovir (GCV)
[0137] ColoAd1 (SEQ ID NO: 22) was introduced into plasmid
pL30/TK/TK by the methods given in Example 2. This plasmid was then
used to produce replication competent product viruses comprising a
transposon containing the expression cassette SV40/GFP. A
replication competent product virus, ColoAd1/PL30, with a potency
equivalent to parental ColoAd1, was identified and the original
expression cassette replaced with a TK/TK expression cassette, i.e.
a thymidine kinase gene driven by a thymidine kinase promoter, in a
manner similar to that describe in Example 5.
[0138] Human umbilical vein endothelial cells (HuVEC) and human
epithelial lung carcinoma (A549) cells were obtained from ATCC.
A549 cells were maintained in DMEM with 10% FBS and HuVEC cells
were maintained in EGM (includes Basal medium and SingleQuots). For
viral infection, A549 cells were grown in DMEM with 2% FBS, 0.1 nM
NEAA and 1% penicillin/streptomycin and HuVEC cells were grown in
EGM medium. Viruses were quantitated as previously described.
[0139] To determine biological activity of the TK protein, the in
vitro cytotoxicity of the virus was assayed with and without
addition of gancyclovir (GCV), a compound that is converted by
thymidine kinase into a triphosphate analog capable of interfering
with both cellular and viral replication. Cells were seeded at a
density for each cell line that generated a confluent monolayer
within 24 hr. These densely seeded cells were allowed to grow for 2
additional days prior to exposure to the ColoAd1/PL30/TK/TK virus.
Infections of both cell lines were carried out in quadruplicate
with serial three-fold dilutions of the viruses starting at a
particle per cell ratio of 100 and ending at a particle per cell
ratio of 0.0152. For each cell type, infections were with or
without GCV (20 uM, added at 24 hours post infection). Infected
cells were incubated at 37.degree. C. for 8 days post infection and
cell viability was measured by the MTS assay. Mock-infected cells
with or without GCV addition established the 100% survival point
for the given assay.
[0140] In HuVec cells (see FIG. 6A), there is no difference in cell
killing with or without GCV addition at the lower viral dose range
(0.0152 vp/cell to 1.23 vp/cell), but at a higher viral dose range
(3.7 vp/cell to 100 vp/cell), addition of GCV prevents cell killing
when compared to infection alone. This is explained by the fact
that at higher viral doses (11, 33 and 100 vp/cell) in the absence
of GVC, the level of initial infection is high enough to result in
significant killing of cells due to rounds of viral infection and
bursting. In the presence of GCV, the thymidine kinase expressed
from the TK promoter within the ColoAd1/PL30/TK/TK virus is
inhibited and results in death of the infectious centers, which
prevents the viral infection and bursting rounds needed for a
significant cell kill. In A549 cells, addition of GCV significantly
protects against cell killing at all doses of viral infection. The
significant killing at all vp/cell (as compared to the HuVEC cells)
is due to the fact that the virus replicates much faster in the
A549 cells than in HuVec cells, allowing a much greater number of
rounds of infection and bursting in the absence of GCV.
[0141] All patents and publications and patents mentioned in the
above specification are herein incorporated by reference. While the
present invention has been described with reference to the specific
embodiments thereof, it should be understood by those skilled in
the art that various changes may be made and equivalents may be
substituted without departing from the true spirit and scope of the
invention. In addition, many modifications may be made to adapt a
particular situation, material, composition of matter, process,
process step or steps, to the objective, spirit and scope of the
present invention. All such modifications are intended to be within
the scope of the claims appended hereto.
Sequence CWU 1
1
22126DNAHomo sapiens 1tgctaatctt cctttctctc ttcagg 26214DNAhomo
sapien 2tttctctctt cagg 1434DNAhomo sapien 3cagg
4439DNAartificialoligo 4aaatgtggcc ggccactgat tccacgtagt ggtcaggta
39543DNAartificialoligo 5ctagtacctg accactacgt ggaatcagtg
gccggccaca ttt 43622DNAartificialoligo 6ggaattggcc ggccatatcc gc
22722DNAartificialoligo 7ggatatggcc ggccaattcc gc
22834DNAartificialoligo 8catggatggc cggccgctgt ggaatgtgtg tcag
34942DNAartificialoligo 9tcagtagcta gccatggtgg ctaagagctg
taattgaact gg 421025DNAartificialoligo 10cctttctctc ttcaggccgc
catgg 251132DNAartificialoligo 11gttctggatc cgtgagtcaa caggaaagtt
cc 321237DNAartificialoligo 12cctgctaatc ttcctttctc tcttcaggcc
gccatgg 371345DNAartificialoligo 13ctagccatgg cggcctgaag agagaaagga
agattagcag gccgg 451463DNAartificialoligo 14ccggcggcag aagatcccct
cgttgcacag cttaaacagc gaggaggagc gcattttgcg 60cta
631527DNAartificialprimer 15agctgtttaa accgatgtac gggccag
271631DNAartificialprimer 16tgacgtttaa actagaaggc acagtcgagg c
311734DNAartificialprimer 17ctagcgaatt cggtaccacg cgtctcgagg ccac
341834DNAartificialprimer 18catggtggcc tcgagacgcg tggtaccgaa ttcg
341931DNAartificialprimer 19ggccatggaa caggcagttt gccagtagtg c
312045DNAartificialprimer 20ctaatacgac tcactatagg gcaagcagtg
gtatcaacgc agagt 452121DNAartificialprimer 21taatacgact cactataggg
c 212232325DNAAdenovirus 22ctatctatat aatatacctt atagatggaa
tggtgccaat atgtaaatga ggtgatttta 60aaaagtgtgg atcgtgtggt gattggctgt
ggggttaacg gctaaaaggg gcggtgcgac 120cgtgggaaaa tgacgttttg
tgggggtgga gtttttttgc aagttgtcgc gggaaatgtg 180acgcataaaa
aggctttttt ctcacggaac tacttagttt tcccacggta tttaacagga
240aatgaggtag ttttgaccgg atgcaagtga aaattgttga ttttcgcgcg
aaaactgaat 300gaggaagtgt ttttctgaat aatgtggtat ttatggcagg
gtggagtatt tgttcagggc 360caggtagact ttgacccatt acgtggaggt
ttcgattacc gtgtttttta cctgaatttc 420cgcgtaccgt gtcaaagtct
tctgttttta cgtaggtgtc agctgatcgc tagggtattt 480atacctcagg
gtttgtgtca agaggccact cttgagtgcc agcgagaaga gttttctcct
540ctgcgccggc agtttaataa taaaaaaatg agagatttgc gatttctgcc
tcaggaaata 600atctctgctg agactggaaa tgaaatattg gagcttgtgg
tgcacgccct gatgggagac 660gatccggagc cacctgtgca gctttttgag
cctcctacgc ttcaggaact gtatgattta 720gaggtagagg gatcggagga
ttctaatgag gaagctgtaa atggcttttt taccgattct 780atgcttttag
ctgctaatga agggttagaa ttagatccgc ctttggacac ttttgatact
840ccaggggtaa ttgtggaaag cggtacaggt gtaagaaaat tacctgattt
gagttccgtg 900gactgtgatt tgcactgcta tgaagacggg tttcctccga
gtgatgagga ggaccatgaa 960aaggagcagt ccatgcagac tgcagcgggt
gagggagtga aggctgccaa tgttggtttt 1020cagttggatt gcccggagct
tcctggacat ggctgtaagt cttgtgaatt tcacaggaaa 1080aatactggag
taaaggaact gttatgttcg ctttgttata tgagaacgca ctgccacttt
1140atttacagta agtgtgttta agttaaaatt taaaggaata tgctgttttt
cacatgtata 1200ttgagtgtga gttttgtgct tcttattata ggtcctgtgt
ctgatgctga tgaatcacca 1260tctcctgatt ctactacctc acctcctgag
attcaagcac ctgttcctgt ggacgtgcgc 1320aagcccattc ctgtgaagct
taagcctggg aaacgtccag cagtggaaaa acttgaggac 1380ttgttacagg
gtggggacgg acctttggac ttgagtacac ggaaacgtcc aagacaataa
1440gtgttccata tccgtgttta cttaaggtga cgtcaatatt tgtgtgacag
tgcaatgtaa 1500taaaaatatg ttaactgttc actggttttt attgcttttt
gggcggggac tcaggtatat 1560aagtagaagc agacctgtgt ggttagctca
taggagctgg ctttcatcca tggaggtttg 1620ggccattttg gaagacctta
ggaagactag gcaactgtta gagaacgctt cggacggagt 1680ctccggtttt
tggagattct ggttcgctag tgaattagct agggtagttt ttaggataaa
1740acaggactat aaacaagaat ttgaaaagtt gttggtagat tgcccaggac
tttttgaagc 1800tcttaatttg ggccatcagg ttcactttaa agaaaaagtt
ttatcagttt tagacttttc 1860aaccccaggt agaactgctg ctgctgtggc
ttttcttact tttatattag ataaatggat 1920cccgcagact catttcagca
ggggatacgt tttggatttc atagccacag cattgtggag 1980aacatggaag
gttcgcaaga tgaggacaat cttaggttac tggccagtgc agcctttggg
2040tgtagcggga atcctgaggc atccaccggt catgccagcg gttctggagg
aggaacagca 2100agaggacaac ccgagagccg gcctggaccc tccagtggag
gaggcggagt agctgacttg 2160tctcctgaac tgcaacgggt gcttactgga
tctacgtcca ctggacggga taggggcgtt 2220aagagggaga gggcatctag
tggtactgat gctagatctg agttggcttt aagtttaatg 2280agtcgcagac
gtcctgaaac catttggtgg catgaggttc agaaagaggg aagggatgaa
2340gtttctgtat tgcaggagaa atattcactg gaacaggtga aaacatgttg
gttggagcct 2400gaggatgatt gggaggtggc cattaaaaat tatgccaaga
tagctttgag gcctgataaa 2460cagtataaga ttactagacg gattaatatc
cggaatgctt gttacatatc tggaaatggg 2520gctgaggtgg taatagatac
tcaagacaag gcagttatta gatgctgcat gatggatatg 2580tggcctgggg
tagtcggtat ggaagcagta acttttgtaa atgttaagtt taggggagat
2640ggttataatg gaatagtgtt tatggccaat accaaactta tattgcatgg
ttgtagcttt 2700tttggtttca acaatacctg tgtagatgcc tggggacagg
ttagtgtacg gggatgtagt 2760ttctatgcgt gttggattgc cacagctggc
agaaccaaga gtcaattgtc tctgaagaaa 2820tgcatatttc aaagatgtaa
cctgggcatt ctgaatgaag gcgaagcaag ggtccgccac 2880tgcgcttcta
cagatactgg atgttttatt ttgattaagg gaaatgccag cgtaaagcat
2940aacatgattt gcggtgcttc cgatgagagg ccttatcaaa tgctcacttg
tgctggtggg 3000cattgtaata tgctggctac tgtgcatatt gtttcccatc
aacgcaaaaa atggcctgtt 3060tttgatcaca atgtgatgac gaagtgtacc
atgcatgcag gtgggcgtag aggaatgttt 3120atgccttacc agtgtaacat
gaatcatgtg aaagtgttgt tggaaccaga tgccttttcc 3180agaatgagcc
taacaggaat ttttgacatg aacatgcaaa tctggaagat cctgaggtat
3240gatgatacga gatcgagggt acgcgcatgc gaatgcggag gcaagcatgc
caggttccag 3300ccggtgtgtg tagatgtgac tgaagatctc agaccggatc
atttggttat tgcccgcact 3360ggagcagagt tcggatccag tggagaagaa
actgactaag gtgagtattg ggaaaacttt 3420ggggtgggat tttcagatgg
acagattgag taaaaatttg ttttttctgt cttgcagctg 3480tcatgagtgg
aaacgcttct tttaaggggg gagtcttcag cccttatctg acagggcgtc
3540tcccatcctg ggcaggagtt cgtcagaatg ttatgggatc tactgtggat
ggaagacccg 3600tccaacccgc caattcttca acgctgacct atgctacttt
aagttcttca cctttggacg 3660cagctgcagc tgccgccgcc gcttctgttg
ccgctaacac tgtgcttgga atgggttact 3720atggaagcat catggctaat
tccacttcct ctaataaccc ttctaccctg actcaggaca 3780agttacttgt
ccttttggcc cagctggagg ctttgaccca acgtctgggt gaactttctc
3840agcaggtggt cgagttgcga gtacaaactg agtctgctgt cggcacggca
aagtctaaat 3900aaaaaaatcc cagaatcaat gaataaataa acaagcttgt
tgttgattta aaatcaagtg 3960tttttatttc atttttcgcg cacggtatgc
cctagaccac cgatctctat cattgagaac 4020tcggtggatt ttttccagga
tcctatagag gtgggattga atgtttagat acatgggcat 4080taggccgtct
ttggggtgga gatagctcca ttgaagggat tcatgctccg gggtagtgtt
4140gtaaatcacc cagtcataac aaggtcgcag tgcatggtgt tgcacaatat
cttttagaag 4200taggctgatt gccacagata agcccttggt gtaggtgttt
acaaaccggt tgagctggga 4260tgggtgcatt cggggtgaaa ttatgtgcat
tttggattgg atttttaagt tggcaatatt 4320gccgccaaga tcccgtcttg
ggttcatgtt atgaaggacc accaagacgg tgtatccggt 4380acatttagga
aatttatcgt gcagcttgga tggaaaagcg tggaaaaatt tggagacacc
4440cttgtgtcct ccaagatttt ccatgcactc atccatgata atagcaatgg
ggccgtgggc 4500agcggcgcgg gcaaacacgt tccgtgggtc tgacacatca
tagttatgtt cctgagttaa 4560atcatcataa gccattttaa tgaatttggg
gcggagagta ccagattggg gtatgaatgt 4620tccttcgggc cccggagcat
agttcccctc acagatttgc atttcccaag ctttcagttc 4680cgagggtgga
atcatgtcca cctggggggc tatgaaaaac accgtttctg gggcgggggt
4740gattaattgt gatgatagca aatttctgag caattgagat ttgccacatc
cggtggggcc 4800ataaatgatt ccgattacgg gttgcaggtg gtagtttagg
gaacggcaac tgccgtcttc 4860tcgaagcaag ggggccacct cgttcatcat
ttcccttaca tgcatatttt cccgcaccaa 4920atccattagg aggcgctctc
ctcctagtga tagaagttct tgtagtgagg aaaagttttt 4980cagcggtttc
agaccgtcag ccatgggcat tttggagaga gtttgctgca aaagttctag
5040tctgttccac agttcagtga tgtgttctat ggcatctcga tccagcagac
ctcctcgttt 5100cgcgggtttg gacggctcct ggaatagggt atgagacgat
gggcgtccag cgctgccagg 5160gttcggtcct tccagggtct cagtgttcga
gtcagggttg tttccgtcac agtgaagggg 5220tgtgcgcctg cttgggcgct
tgccagggtg cgcttcagac tcatcctgct ggtcgaaaac 5280ttctgtcgct
tggcgccctg tatgtcggcc aagtagcagt ttaccatgag ttcgtagttg
5340agcgcctcgg ctgcgtggcc tttggcgcgg agcttacctt tggaagtttt
cttgcatacc 5400gggcagtata ggcatttcag cgcatacaac ttgggcgcaa
ggaaaacgga ttctggggag 5460tatgcatctg cgccgcagga ggcgcaaaca
gtttcacatt ccaccagcca ggttaaatcc 5520ggttcattgg ggtcaaaaac
aagttttccg ccatattttt tgatgcgttt cttacctttg 5580gtctccatga
gttcgtgtcc tcgttgagtg acaaacaggc tgtccgtgtc cccgtagact
5640gattttacag gcctcttctc cagtggagtg cctcggtctt cttcgtacag
gaactctgac 5700cactctgata caaaggcgcg cgtccaggcc agcacaaagg
aggctatgtg ggaggggtag 5760cgatcgttgt caaccagggg gtccaccttt
tccaaagtat gcaaacacat gtcaccctct 5820tcaacatcca ggaatgtgat
tggcttgtag gtgtatttca cgtgacctgg ggtccccgct 5880gggggggtat
aaaagggggc ggttctttgc tcttcctcac tgtcttccgg atcgctgtcc
5940aggaacgtca gctgttgggg taggtattcc ctctcgaagg cgggcatgac
ctctgcactc 6000aggttgtcag tttctaagaa cgaggaggat ttgatattga
cagtgccggt tgagatgcct 6060ttcatgaggt tttcgtccat ctggtcagaa
aacacaattt ttttattgtc aagtttggtg 6120gcaaatgatc catacagggc
gttggataaa agtttggcaa tggatcgcat ggtttggttc 6180ttttccttgt
ccgcgcgctc tttggcggcg atgttgagtt ggacatactc gcgtgccagg
6240cacttccatt cggggaagat agttgttaat tcatctggca cgattctcac
ttgccaccct 6300cgattatgca aggtaattaa atccacactg gtggccacct
cgcctcgaag gggttcattg 6360gtccaacaga gcctacctcc tttcctagaa
cagaaagggg gaagtgggtc tagcataagt 6420tcatcgggag ggtctgcatc
catggtaaag attcccggaa gtaaatcctt atcaaaatag 6480ctgatgggag
tggggtcatc taaggccatt tgccattctc gagctgccag tgcgcgctca
6540tatgggttaa ggggactgcc ccatggcatg ggatgggtga gtgcagaggc
atacatgcca 6600cagatgtcat agacgtagat gggatcctca aagatgccta
tgtaggttgg atagcatcgc 6660ccccctctga tacttgctcg cacatagtca
tatagttcat gtgatggcgc tagcagcccc 6720ggacccaagt tggtgcgatt
gggtttttct gttctgtaga cgatctggcg aaagatggcg 6780tgagaattgg
aagagatggt gggtctttga aaaatgttga aatgggcatg aggtagacct
6840acagagtctc tgacaaagtg ggcataagat tcttgaagct tggttaccag
ttcggcggtg 6900acaagtacgt ctagggcgca gtagtcaagt gtttcttgaa
tgatgtcata acctggttgg 6960tttttctttt cccacagttc gcggttgaga
aggtattctt cgcgatcctt ccagtactct 7020tctagcggaa acccgtcttt
gtctgcacgg taagatccta gcatgtagaa ctgattaact 7080gccttgtaag
ggcagcagcc cttctctacg ggtagagagt atgcttgagc agcttttcgt
7140agcgaagcgt gagtaagggc aaaggtgtct ctgaccatga ctttgaggaa
ttggtatttg 7200aagtcgatgt cgtcacaggc tccctgttcc cagagttgga
agtctacccg tttcttgtag 7260gcggggttgg gcaaagcgaa agtaacatca
ttgaagagaa tcttgccggc cctgggcatg 7320aaattgcgag tgatgcgaaa
aggctgtggt acttccgctc ggttattgat aacctgggca 7380gctaggacga
tctcgtcgaa accgttgatg ttgtgtccta cgatgtataa ttctatgaaa
7440cgcggcgtgc ctctgacgtg aggtagctta ctgagctcat caaaggttag
gtctgtgggg 7500tcagataagg cgtagtgttc gagagcccat tcgtgcaggt
gaggattcgc tttaaggaag 7560gaggaccaga ggtccactgc cagtgctgtt
tgtaactggt cccggtactg acgaaaatgc 7620cgtccgactg ccattttttc
tggggtgacg caatagaagg tttgggggtc ctgccgccag 7680cgatcccact
tgagttttat ggcgaggtca taggcgatgt tgacgagccg ctggtctcca
7740gagagtttca tgaccagcat gaaggggatt agctgcttgc caaaggaccc
catccaggtg 7800taggtttcca catcgtaggt gagaaagagc ctttctgtgc
gaggatgaga gccaatcggg 7860aagaactgga tctcctgcca ccagttggag
gaatggctgt tgatgtgatg gaagtagaac 7920tccctgcgac gcgccgagca
ttcatgcttg tgcttgtaca gacggccgca gtagtcgcag 7980cgttgcacgg
gttgtatctc gtgaatgagt tgtacctggc ttcccttgac gagaaatttc
8040agtgggaagc cgaggcctgg cgattgtatc tcgtgcttta ctatgttgtc
tgcatcggcc 8100tgttcatctt ctgtctcgat ggtggtcatg ctgacgagcc
ctcgcgggag gcaagtccag 8160acctcggcgc ggcaggggcg gagctcgagg
acgagagcgc gcaggctgga gctgtccagg 8220gtcctgagac gctgcggact
caggttagta ggcagtgtca ggagattaac ttgcatgatc 8280ttttggaggg
cgtgcgggag gttcagatag tacttgatct caacgggtcc gttggtggag
8340atgtcgatgg cttgcagggt tccgtgtccc ttgggcgcta ccaccgtgcc
cttgtttttc 8400attttggacg gcggtggctc tgttgcttct tgcatgttta
gaagcggtgt cgagggcgcg 8460caccgggcgg caggggcggc tcgggacccg
gcggcatggc tggcagtggt acgtcggcgc 8520cgcgcgcggg taggttctgg
tactgcgccc tgagaagact cgcatgcgcg acgacgcggc 8580ggttgacatc
ctggatctga cgcctctggg tgaaagctac cggccccgtg agcttgaacc
8640tgaaagagag ttcaacagaa tcaatctcgg tatcgttgac ggcggcttgc
ctaaggattt 8700cttgcacgtc accagagttg tcctggtagg cgatctccgc
catgaactgc tcgatctctt 8760cctcttgaag atctccgcgg cccgctctct
cgacggtggc cgcgaggtcg ttggagatgc 8820gcccaatgag ttgagagaat
gcattcatgc ccgcctcgtt ccagacgcgg ctgtagacca 8880cggcccccac
gggatctctc gcgcgcatga ccacctgggc gaggttgagc tccacgtggc
8940gggtgaagac cgcatagttg cataggcgct ggaaaaggta gttgagtgtg
gtggcgatgt 9000gctcggtgac gaagaaatac atgatccatc gtctcagcgg
catctcgctg acatcgccca 9060gagcttccaa gcgctccatg gcctcgtaga
agtccacggc aaaattaaaa aactgggagt 9120ttcgcgcgga cacggtcaac
tcctcttcca gaagacggat aagttcggcg atggtggtgc 9180gcacctcgcg
ctcgaaagcc cctgggattt cttcctcaat ctcttcttct tccactaaca
9240tctcttcctc ttcaggtggg gctgcaggag gagggggaac gcggcgacgc
cggcggcgca 9300cgggcagacg gtcgatgaat ctttcaatga cctctccgcg
gcggcggcgc atggtttcag 9360tgacggcgcg gccgttctcg cgcggtcgca
gagtaaaaac accgccgcgc atctccttaa 9420agtggtgact gggaggttct
ccgtttggga gggagagggc gctgattata cattttatta 9480attggcccgt
agggactgca cgcagagatc tgatcgtgtc aagatccacg ggatctgaaa
9540acctttcgac gaaagcgtct aaccagtcac agtcacaagg taggctgagt
acggcttctt 9600gtgggcgggg gtggttatgt gttcggtctg ggtcttctgt
ttcttcttca tctcgggaag 9660gtgagacgat gctgctggtg atgaaattaa
agtaggcagt tctaagacgg cggatggtgg 9720cgaggagcac caggtctttg
ggtccggctt gctggatacg caggcgattg gccattcccc 9780aagcattatc
ctgacatcta gcaagatctt tgtagtagtc ttgcatgagc cgttctacgg
9840gcacttcttc ctcacccgtt ctgccatgca tacgtgtgag tccaaatccg
cgcattggtt 9900gtaccagtgc caagtcagct acgactcttt cggcgaggat
ggcttgctgt acttgggtaa 9960gggtggcttg aaagtcatca aaatccacaa
agcggtggta agctcctgta ttaatggtgt 10020aagcacagtt ggccatgact
gaccagttaa ctgtctggtg accagggcgc acgagctcgg 10080tgtatttaag
gcgcgaatag gcgcgggtgt caaagatgta atcgttgcag gtgcgcacca
10140gatactggta ccctataaga aaatgcggcg gtggttggcg gtagagaggc
catcgttctg 10200tagctggagc gccaggggcg aggtcttcca acataaggcg
gtgatagccg tagatgtacc 10260tggacatcca ggtgattcct gcggcggtag
tagaagcccg aggaaactcg cgtacgcggt 10320tccaaatgtt gcgtagcggc
atgaagtagt tcattgtagg cacggtttga ccagtgaggc 10380gcgcgcagtc
attgatgctc tatagacacg gagaaaatga aagcgttcag cgactcgact
10440ccgtagcctg gaggaacgtg aacgggttgg gtcgcggtgt accccggttc
gagacttgta 10500ctcgagccgg ccggagccgc ggctaacgtg gtattggcac
tcccgtctcg acccagccta 10560caaaaatcca ggatacggaa tcgagtcgtt
ttgctggttt ccgaatggca gggaagtgag 10620tcctattttt tttttttgcc
gctcagatgc atcccgtgct gcgacagatg cgcccccaac 10680aacagccccc
ctcgcagcag cagcagcagc aatcacaaaa ggctgtccct gcaactactg
10740caactgccgc cgtgagcggt gcgggacagc ccgcctatga tctggacttg
gaagagggcg 10800aaggactggc acgtctaggt gcgccttcac ccgagcggca
tccgcgagtt caactgaaaa 10860aagattctcg cgaggcgtat gtgccccaac
agaacctatt tagagacaga agcggcgagg 10920agccggagga gatgcgagct
tcccgcttta acgcgggtcg tgagctgcgt cacggtttgg 10980accgaagacg
agtgttgcgg gacgaggatt tcgaagttga tgaaatgaca gggatcagtc
11040ctgccagggc acacgtggct gcagccaacc ttgtatcggc ttacgagcag
acagtaaagg 11100aagagcgtaa cttccaaaag tcttttaata atcatgtgcg
aaccctgatt gcccgcgaag 11160aagttaccct tggtttgatg catttgtggg
atttgatgga agctatcatt cagaacccta 11220ctagcaaacc tctgaccgcc
cagctgtttc tggtggtgca acacagcaga gacaatgagg 11280ctttcagaga
ggcgctgctg aacatcaccg aacccgaggg gagatggttg tatgatctta
11340tcaacattct acagagtatc atagtgcagg agcggagcct gggcctggcc
gagaaggtgg 11400ctgccatcaa ttactcggtt ttgagcttgg gaaaatatta
cgctcgcaaa atctacaaga 11460ctccatacgt tcccatagac aaggaggtga
agatagatgg gttctacatg cgcatgacgc 11520tcaaggtctt gaccctgagc
gatgatcttg gggtgtatcg caatgacaga atgcatcgcg 11580cggttagcgc
cagcaggagg cgcgagttaa gcgacaggga actgatgcac agtttgcaaa
11640gagctctgac tggagctgga accgagggtg agaattactt cgacatggga
gctgacttgc 11700agtggcagcc tagtcgcagg gctctgagcg ccgcgacggc
aggatgtgag cttccttaca 11760tagaagaggc ggatgaaggc gaggaggaag
agggcgagta cttggaagac tgatggcaca 11820acccgtgttt tttgctagat
ggaacagcaa gcaccggatc ccgcaatgcg ggcggcgctg 11880cagagccagc
cgtccggcat taactcctcg gacgattgga cccaggccat gcaacgtatc
11940atggcgttga cgactcgcaa ccccgaagcc tttagacagc aaccccaggc
caaccgtcta 12000tcggccatca tggaagctgt agtgccttcc cgctctaatc
ccactcatga gaaggtcctg 12060gccatcgtga acgcgttggt ggagaacaaa
gctattcgtc cagatgaggc cggactggta 12120tacaacgctc tcttagaacg
cgtggctcgc tacaacagta gcaatgtgca aaccaatttg 12180gaccgtatga
taacagatgt acgcgaagcc gtgtctcagc gcgaaaggtt ccagcgtgat
12240gccaacctgg gttcgctggt ggcgttaaat gctttcttga gtactcagcc
tgctaatgtg 12300ccgcgtggtc aacaggatta tactaacttt ttaagtgctt
tgagactgat ggtatcagaa 12360gtacctcaga gcgaagtgta tcagtccggt
cctgattact tctttcagac tagcagacag 12420ggcttgcaga cggtaaatct
gagccaagct tttaaaaacc tttaaaggtt tgtggggagt 12480gcatgccccg
gtaggagaaa gagcaaccgt gtctagcttg ttaactccga actcccgcct
12540attattactg ttggtagctc ctttcaccga cagcggtagc atcgaccgta
attcctattt 12600gggttaccta ctaaacctgt atcgcgaagc catagggcaa
agtcaggtgg acgagcagac 12660ctatcaagaa attacccaag tcagtcgcgc
tttgggacag gaagacactg gcagtttgga 12720agccactctg aacttcttgc
ttaccaatcg gtctcaaaag atccctcctc aatatgctct 12780tactgcggag
gaggagagga tccttagata tgtgcagcag agcgtgggat tgtttctgat
12840gcaagagggg gcaactccga ctgcagcact ggacatgaca gcgcgaaata
tggagcccag 12900catgtatgcc agtaaccgac ctttcattaa caaactgctg
gactacttgc acagagctgc 12960cgctatgaac tctgattatt tcaccaatgc
catcttaaac ccgcactggc tgcccccacc 13020tggtttctac acgggcgaat
atgacatgcc cgaccctaat gacggatttc tgtgggacga 13080cgtggacagc
gatgtttttt cacctctttc tgatcatcgc acgtggaaaa aggaaggcgg
13140cgatagaatg cattcttctg catcgctgtc cggggtcatg ggtgctaccg
cggctgagcc 13200cgagtctgca agtccttttc ctagtctacc cttttctcta
cacagtgtac gtagcagcga 13260agtgggtaga ataagtcgcc cgagtttaat
gggcgaagag
gagtatctaa acgattcctt 13320gctcagaccg gcaagagaaa aaaatttccc
aaacaatgga atagaaagtt tggtggataa 13380aatgagtaga tggaagactt
atgctcagga tcacagagac gagcctggga tcatggggat 13440tacaagtaga
gcgagccgta gacgccagcg ccatgacaga cagaggggtc ttgtgtggga
13500cgatgaggat tcggccgatg atagcagcgt gctggacttg ggtgggagag
gaaggggcaa 13560cccgtttgct catttgcgcc ctcgcttggg tggtatgttg
taaaaaaaaa taaaaaaaaa 13620actcaccaag gccatggcga cgagcgtacg
ttcgttcttc tttattatct gtgtctagta 13680taatgaggcg agtcgtgcta
ggcggagcgg tggtgtatcc ggagggtcct cctccttcgt 13740acgagagcgt
gatgcagcag cagcaggcga cggcggtgat gcaatcccca ctggaggctc
13800cctttgtgcc tccgcgatac ctggcaccta cggagggcag aaacagcatt
cgttattcgg 13860aactggcacc tcagtacgat accaccaggt tgtatctggt
ggacaacaag tcggcggaca 13920ttgcttctct gaactatcag aatgaccaca
gcaacttctt gaccacggtg gtgcaaaaca 13980atgactttac ccctacggaa
gccagcaccc agaccattaa ctttgatgaa cgatcgcggt 14040ggggcggtca
gctaaagacc atcatgcata ctaacatgcc aaacgtgaac gagtatatgt
14100ttagtaacaa gttcaaagcg cgtgtgatgg tgtccagaaa acctcccgac
ggtgctgcag 14160ttggggatac ttatgatcac aagcaggata ttttgaaata
tgagtggttc gagtttactt 14220tgccagaagg caacttttca gttactatga
ctattgattt gatgaacaat gccatcatag 14280ataattactt gaaagtgggt
agacagaatg gagtgcttga aagtgacatt ggtgttaagt 14340tcgacaccag
gaacttcaag ctgggatggg atcccgaaac caagttgatc atgcctggag
14400tgtatacgta tgaagccttc catcctgaca ttgtcttact gcctggctgc
ggagtggatt 14460ttaccgagag tcgtttgagc aaccttcttg gtatcagaaa
aaaacagcca tttcaagagg 14520gttttaagat tttgtatgaa gatttagaag
gtggtaatat tccggccctc ttggatgtag 14580atgcctatga gaacagtaag
aaagaacaaa aagccaaaat agaagctgct acagctgctg 14640cagaagctaa
ggcaaacata gttgccagcg actctacaag ggttgctaac gctggagagg
14700tcagaggaga caattttgcg ccaacacctg ttccgactgc agaatcatta
ttggccgatg 14760tgtctgaagg aacggacgtg aaactcacta ttcaacctgt
agaaaaagat agtaagaata 14820gaagctataa tgtgttggaa gacaaaatca
acacagccta tcgcagttgg tatctttcgt 14880acaattatgg cgatcccgaa
aaaggagtgc gttcctggac attgctcacc acctcagatg 14940tcacctgcgg
agcagagcag gtctactggt cgcttccaga catgatgaag gatcctgtca
15000ctttccgctc cactagacaa gtcagtaact accctgtggt gggtgcagag
cttatgcccg 15060tcttctcaaa gagcttctac aacgaacaag ctgtgtactc
ccagcagctc cgccagtcca 15120cctcgcttac gcacgtcttc aaccgctttc
ctgagaacca gattttaatc cgtccgccgg 15180cgcccaccat taccaccgtc
agtgaaaacg ttcctgctct cacagatcac gggaccctgc 15240cgttgcgcag
cagtatccgg ggagtccaac gtgtgaccgt tactgacgcc agacgccgca
15300cctgtcccta cgtgtacaag gcactgggca tagtcgcacc gcgcgtcctt
tcaagccgca 15360ctttctaaaa aaaaaaaaaa tgtccattct tatctcgccc
agtaataaca ccggttgggg 15420tctgcgcgct ccaagcaaga tgtacggagg
cgcacgcaaa cgttctaccc aacatcctgt 15480ccgtgttcgc ggacattttc
gcgctccatg gggcgccctc aagggccgca ctcgcgttcg 15540aaccaccgtc
gatgatgtaa tcgatcaggt ggttgccgac gcccgtaatt atactcctac
15600tgcgcctaca tctactgtgg atgcagttat tgacagtgta gtggctgacg
ctcgcaacta 15660tgctcgacgt aagagccggc gaaggcgcat tgccagacgc
caccgagcta ccactgccat 15720gcgagccgca agagctctgc tacgaagagc
tagacgcgtg gggcgaagag ccatgcttag 15780ggcggccaga cgtgcagctt
cgggcgccag cgccggcagg tcccgcaggc aagcagccgc 15840tgtcgcagcg
gcgactattg ccgacatggc ccaatcgcga agaggcaatg tatactgggt
15900gcgtgacgct gccaccggtc aacgtgtacc cgtgcgcacc cgtccccctc
gcacttagaa 15960gatactgagc agtctccgat gttgtgtccc agcggcgagg
atgtccaagc gcaaatacaa 16020ggaagaaatg ctgcaggtta tcgcacctga
agtctacggc caaccgttga aggatgaaaa 16080aaaaccccgc aaaatcaagc
gggttaaaaa ggacaaaaaa gaagaggaag atggcgatga 16140tgggctggcg
gagtttgtgc gcgagtttgc cccacggcga cgcgtgcaat ggcgtgggcg
16200caaagttcga catgtgttga gacctggaac ttcggtggtc tttacacccg
gcgagcgttc 16260aagcgctact tttaagcgtt cctatgatga ggtgtacggg
gatgatgata ttcttgagca 16320ggcggctgac cgattaggcg agtttgctta
tggcaagcgt agtagaataa cttccaagga 16380tgagacagtg tcgataccct
tggatcatgg aaatcccacc cctagtctta aaccggtcac 16440tttgcagcaa
gtgttacccg taactccgcg aacaggtgtt aaacgcgaag gtgaagattt
16500gtatcccact atgcaactga tggtacccaa acgccagaag ttggaggacg
ttttggagaa 16560agtaaaagtg gatccagata ttcaacctga ggttaaagtg
agacccatta agcaggtagc 16620gcctggtctg ggggtacaaa ctgtagacat
taagattccc actgaaagta tggaagtgca 16680aactgaaccc gcaaagccta
ctgccacctc cactgaagtg caaacggatc catggatgcc 16740catgcctatt
acaactgacg ccgccggtcc cactcgaaga tcccgacgaa agtacggtcc
16800agcaagtctg ttgatgccca attatgttgt acacccatct attattccta
ctcctggtta 16860ccgaggcact cgctactatc gcagccgaaa cagtacctcc
cgccgtcgcc gcaagacacc 16920tgcaaatcgc agtcgtcgcc gtagacgcac
aagcaaaccg actcccggcg ccctggtgcg 16980gcaagtgtac cgcaatggta
gtgcggaacc tttgacactg ccgcgtgcgc gttaccatcc 17040gagtatcatc
acttaatcaa tgttgccgct gcctccttgc agatatggcc ctcacttgtc
17100gccttcgcgt tcccatcact ggttaccgag gaagaaactc gcgccgtaga
agagggatgt 17160tgggacgcgg aatgcgacgc tacaggcgac ggcgtgctat
ccgcaagcaa ttgcggggtg 17220gttttttacc agccttaatt ccaattatcg
ctgctgcaat tggcgcgata ccaggcatag 17280cttccgtggc ggttcaggcc
tcgcaacgac attgacattg gaaaaaaacg tataaataaa 17340aaaaaaaaaa
tacaatggac tctgacactc ctggtcctgt gactatgttt tcttagagat
17400ggaagacatc aatttttcat ccttggctcc gcgacacggc acgaagccgt
acatgggcac 17460ctggagcgac atcggcacga gccaactgaa cgggggcgcc
ttcaattgga gcagtatctg 17520gagcgggctt aaaaattttg gctcaaccat
aaaaacatac gggaacaaag cttggaacag 17580cagtacagga caggcgctta
gaaataaact taaagaccag aacttccaac aaaaagtagt 17640cgatgggata
gcttccggca tcaatggagt ggtagatttg gctaaccagg ctgtgcagaa
17700aaagataaac agtcgtttgg acccgccgcc agcaacccca ggtgaaatgc
aagtggagga 17760agaaattcct ccgccagaaa aacgaggcga caagcgtccg
cgtcccgatt tggaagagac 17820gctggtgacg cgcgtagatg aaccgccttc
ttatgaggaa gcaacgaagc ttggaatgcc 17880caccactaga ccgatagccc
caatggccac cggggtgatg aaaccttctc agttgcatcg 17940acccgtcacc
ttggatttgc cccctccccc tgctgctact gctgtacccg cttctaagcc
18000tgtcgctgcc ccgaaaccag tcgccgtagc caggtcacgt cccgggggcg
ctcctcgtcc 18060aaatgcgcac tggcaaaata ctctgaacag catcgtgggt
ctaggcgtgc aaagtgtaaa 18120acgccgtcgc tgcttttaat taaatatgga
gtagcgctta acttgcctat ctgtgtatat 18180gtgtcattac acgccgtcac
agcagcagag gaaaaaagga agaggtcgtg cgtcgacgct 18240gagttacttt
caagatggcc accccatcga tgctgcccca atgggcatac atgcacatcg
18300ccggacagga tgcttcggag tacctgagtc cgggtctggt gcagttcgcc
cgcgccacag 18360acacctactt caatctggga aataagttta gaaatcccac
cgtagcgccg acccacgatg 18420tgaccaccga ccgtagccag cggctcatgt
tgcgcttcgt gcccgttgac cgggaggaca 18480atacatactc ttacaaagtg
cggtacaccc tggccgtggg cgacaacaga gtgctggata 18540tggccagcac
gttctttgac attaggggtg tgttggacag aggtcccagt ttcaaaccct
18600attctggtac ggcttacaac tccctggctc ctaaaggcgc tccaaataca
tctcagtgga 18660ttgcagaagg tgtaaaaaat acaactggtg aggaacacgt
aacagaagag gaaaccaata 18720ctactactta cacttttggc aatgctcctg
taaaagctga agctgaaatt acaaaagaag 18780gactcccagt aggtttggaa
gtttcagatg aagaaagtaa accgatttat gctgataaaa 18840catatcagcc
agaacctcag ctgggagatg aaacttggac tgaccttgat ggaaaaaccg
18900aaaagtatgg aggcagggct ctcaaacccg atactaagat gaaaccatgc
tacgggtcct 18960ttgccaaacc tactaatgtg aaaggcggtc aggcaaaaca
aaaaacaacg gagcagccaa 19020atcagaaagt cgaatatgat atcgacatgg
agttttttga tgcggcatcg cagaaaacaa 19080acttaagtcc taaaattgtc
atgtatgcag aaaatgtaaa tttggaaact ccagacactc 19140atgtagtgta
caaacctgga acagaagaca caagttccga agctaatttg ggacaacaat
19200ctatgcccaa cagacccaac tacattggct tcagagataa ctttattgga
cttatgtact 19260ataacagtac tggtaacatg ggggtgctgg ctggtcaagc
gtctcagtta aatgcagtgg 19320ttgacttgca ggacagaaac acagaacttt
cttaccaact cttgcttgac tctctgggcg 19380acagaaccag atactttagc
atgtggaatc aggctgtgga cagttatgat cctgatgtac 19440gtgttattga
aaatcatggt gtggaagatg aacttcccaa ctactgtttt ccactggacg
19500gcataggtgt tccaacaacc agttacaaat caatagttcc aaatggagac
aatgcgccta 19560attggaagga acctgaagta aatggaacaa gtgagatcgg
acagggtaat ttgtttgcca 19620tggaaattaa ccttcaagcc aatctatggc
gaagtttcct ttattccaat gtggctctat 19680atctcccaga ctcgtacaaa
tacaccccgt ccaatgtcac tcttccagaa aacaaaaaca 19740cctacgacta
catgaacggg cgggtggtgc cgccatctct agtagacacc tatgtgaaca
19800ttggtgccag gtggtctctg gatgccatgg acaatgtcaa cccattcaac
caccaccgta 19860acgctggctt gcgttaccga tccatgcttc tgggtaacgg
acgttatgtg cctttccaca 19920tacaagtgcc tcaaaaattc ttcgctgtta
aaaacctgct gcttctccca ggctcctaca 19980cttatgagtg gaactttagg
aaggatgtga acatggttct acagagttcc ctcggtaacg 20040acctgcgggt
agatggcgcc agcatcagtt tcacgagcat caacctctat gctacttttt
20100tccccatggc tcacaacacc gcttccaccc ttgaagccat gctgcggaat
gacaccaatg 20160atcagtcatt caacgactac ctatctgcag ctaacatgct
ctaccccatt cctgccaatg 20220caaccaatat tcccatttcc attccttctc
gcaactgggc ggctttcaga ggctggtcat 20280ttaccagact gaaaaccaaa
gaaactccct ctttggggtc tggatttgac ccctactttg 20340tctattctgg
ttctattccc tacctggatg gtaccttcta cctgaaccac acttttaaga
20400aggtttccat catgtttgac tcttcagtga gctggcctgg aaatgacagg
ttactatctc 20460ctaacgaatt tgaaataaag cgcactgtgg atggcgaagg
ctacaacgta gcccaatgca 20520acatgaccaa agactggttc ttggtacaga
tgctcgccaa ctacaacatc ggctatcagg 20580gcttctacat tccagaagga
tacaaagatc gcatgtattc atttttcaga aacttccagc 20640ccatgagcag
gcaggtggtt gatgaggtca attacaaaga cttcaaggcc gtcgccatac
20700cctaccaaca caacaactct ggctttgtgg gttacatggc tccgaccatg
cgccaaggtc 20760aaccctatcc cgctaactat ccctatccac tcattggaac
aactgccgta aatagtgtta 20820cgcagaaaaa gttcttgtgt gacagaacca
tgtggcgcat accgttctcg agcaacttca 20880tgtctatggg ggcccttaca
gacttgggac agaatatgct ctatgccaac tcagctcatg 20940ctctggacat
gacctttgag gtggatccca tggatgagcc caccctgctt tatcttctct
21000tcgaagtttt cgacgtggtc agagtgcatc agccacaccg cggcatcatc
gaggcagtct 21060acctgcgtac accgttctcg gccggtaacg ctaccacgta
agaagcttct tgcttcttgc 21120aaatagcagc tgcaaccatg gcctgcggat
cccaaaacgg ctccagcgag caagagctca 21180gagccattgt ccaagacctg
ggttgcggac cctatttttt gggaacctac gataagcgct 21240tcccggggtt
catggccccc gataagctcg cctgtgccat tgtaaatacg gccggacgtg
21300agacgggggg agagcactgg ttggctttcg gttggaaccc acgttctaac
acctgctacc 21360tttttgatcc ttttggattc tcggatgatc gtctcaaaca
gatttaccag tttgaatatg 21420agggtctcct gcgccgcagc gctcttgcta
ccaaggaccg ctgtattacg ctggaaaaat 21480ctacccagac cgtgcagggt
ccccgttctg ccgcctgcgg acttttctgc tgcatgttcc 21540ttcacgcctt
tgtgcactgg cctgaccgtc ccatggacgg aaaccccacc atgaaattgc
21600taactggagt gccaaacaac atgcttcatt ctcctaaagt ccagcccacc
ctgtgtgaca 21660atcaaaaagc actctaccat tttcttaata cccattcgcc
ttattttcgc tcccatcgta 21720cacacatcga aagggccact gcgttcgacc
gtatggatgt tcaataatga ctcatgtaaa 21780caacgtgttc aataaacatc
actttatttt tttacatgta tcaaggctct gcattactta 21840tttatttaca
agtcgaatgg gttctgacga gaatcagaat gacccgcagg cagtgatacg
21900ttgcggaact gatacttggg ttgccacttg aattcgggaa tcaccaactt
gggaaccggt 21960atatcgggca ggatgtcact ccacagcttt ctggtcagct
gcaaagctcc aagcaggtca 22020ggagccgaaa tcttgaaatc acaattagga
ccagtgcttt gagcgcgaga gttgcggtac 22080accggattgc agcactgaaa
caccatcagc gacggatgtc tcacgcttgc cagcacggtg 22140ggatctgcaa
tcatgcccac atccagatct tcagcattgg caatgctgaa cggggtcatc
22200ttgcaggtct gcctacccat ggcgggcacc caattaggct tgtggttgca
atcgcagtgc 22260agggggatca gtatcatctt ggcctgatcc tgtctgattc
ctggatacac ggctctcatg 22320aaagcatcat attgcttgaa agcctgctgg
gctttactac cctcggtata aaacatcccg 22380caggacctgc tcgaaaactg
gttagctgca cagccggcat cattcacaca gcagcgggcg 22440tcattgttag
ctatttgcac cacacttctg ccccagcggt tttgggtgat tttggttcgc
22500tcgggattct cctttaaggc tcgttgtccg ttctcgctgg ccacatccat
ctcgataatc 22560tgctccttct gaatcataat attgccatgc aggcacttca
gcttgccctc ataatcattg 22620cagccatgag gccacaacgc acagcctgta
cattcccaat tatggtgggc gatctgagaa 22680aaagaatgta tcattccctg
cagaaatctt cccatcatcg tgctcagtgt cttgtgacta 22740gtgaaagtta
actggatgcc tcggtgctcc tcgtttacgt actggtgaca gatgcgcttg
22800tattgttcgt gttgctcagg cattagttta aaagaggttc taagttcgtt
atccagcctg 22860tacttctcca tcagcagaca catcacttcc atgcctttct
cccaagcaga caccaggggc 22920aagctaatcg gattcttaac agtgcaggca
gcagctcctt tagccagagg gtcatcttta 22980gcgatcttct caatgcttct
tttgccatcc ttctcaacga tgcgcacggg cgggtagctg 23040aaacccactg
ctacaagttg cgcctcttct ctttcttctt cgctgtcttg actgatgtct
23100tgcatgggga tatgtttggt cttccttggc ttctttttgg ggggtatcgg
aggaggagga 23160ctgtcgctcc gttccggaga cagggaggat tgtgacgttt
cgctcaccat taccaactga 23220ctgtcggtag aagaacctga ccccacacgg
cgacaggtgt ttctcttcgg gggcagaggt 23280ggaggcgatt gcgaagggct
gcggtccgac ctggaaggcg gatgactggc agaacccctt 23340ccgcgttcgg
gggtgtgctc cctgtggcgg tcgcttaact gatttccttc gcggctggcc
23400attgtgttct cctaggcaga gaaacaacag acatggaaac tcagccattg
ctgtcaacat 23460cgccacgagt gccatcacat ctcgtcctca gcgacgagga
aaaggagcag agcttaagca 23520ttccaccgcc cagtcctgcc accacctcta
ccctagaaga taaggaggtc gacgcatctc 23580atgacatgca gaataaaaaa
gcgaaagagt ctgagacaga catcgagcaa gacccgggct 23640atgtgacacc
ggtggaacac gaggaagagt tgaaacgctt tctagagaga gaggatgaaa
23700actgcccaaa acaacgagca gataactatc accaagatgc tggaaatagg
gatcagaaca 23760ccgactacct catagggctt gacggggaag acgcgctcct
taaacatcta gcaagacagt 23820cgctcatagt caaggatgca ttattggaca
gaactgaagt gcccatcagt gtggaagagc 23880tcagccgcgc ctacgagctt
aacctctttt cacctcgtac tccccccaaa cgtcagccaa 23940acggcacctg
cgagccaaat cctcgcttaa acttttatcc agcttttgct gtgccagaag
24000tactggctac ctatcacatc ttttttaaaa atcaaaaaat tccagtctcc
tgccgcgcta 24060atcgcacccg cgccgatgcc ctactcaatc tgggacctgg
ttcacgctta cctgatatag 24120cttccttgga agaggttcca aagatcttcg
agggtctggg caataatgag actcgggccg 24180caaatgctct gcaaaaggga
gaaaatggca tggatgagca tcacagcgtt ctggtggaat 24240tggaaggcga
taatgccaga ctcgcagtac tcaagcgaag catcgaggtc acacacttcg
24300catatcccgc tgtcaacctg ccccctaaag tcatgacggc ggtcatggac
cagttactca 24360ttaagcgcgc aagtcccctt tcagaagaca tgcatgaccc
agatgcctgt gatgagggta 24420aaccagtggt cagtgatgag cagctaaccc
gatggctggg caccgactct cccagggatt 24480tggaagagcg tcgcaagctt
atgatggccg tggtgctggt taccgtagaa ctagagtgtc 24540tccgacgttt
ctttaccgat tcagaaacct tgcgcaaact cgaagagaat ctgcactaca
24600cttttagaca cggctttgtg cggcaggcat gcaagatatc taacgtggaa
ctcaccaacc 24660tggtttccta catgggtatt ctgcatgaga atcgcctagg
acaaagcgtg ctgcacagca 24720ccctgaaggg ggaagcccgc cgtgattaca
tccgcgattg tgtctatctg tacctgtgcc 24780acacgtggca aaccggcatg
ggtgtatggc agcaatgttt agaagaacag aacttgaaag 24840agcttgacaa
gctcttacag aaatctctta aggttctgtg gacagggttc gacgagcgca
24900ccgtcgcttc cgacctggca gacctcatct tcccagagcg tctcagggtt
actttgcgaa 24960acggattgcc tgactttatg agccagagca tgcttaacaa
ttttcgctct ttcatcctgg 25020aacgctccgg tatcctgccc gccacctgct
gcgcactgcc ctccgacttt gtgcctctca 25080cctaccgcga gtgccccccg
ccgctatgga gtcactgcta cctgttccgt ctggccaact 25140atctctccta
ccactcggat gtgatcgagg atgtgagcgg agacggcttg ctggagtgtc
25200actgccgctg caatctgtgc acgccccacc ggtccctagc ttgcaacccc
cagttgatga 25260gcgaaaccca gataataggc acctttgaat tgcaaggccc
cagcagccaa ggcgatgggt 25320cttctcctgg gcaaagttta aaactgaccc
cgggactgtg gacctccgcc tacttgcgca 25380agtttgctcc ggaagattac
cacccctatg aaatcaagtt ctatgaggac caatcacagc 25440ctccaaaggc
cgaactttcg gcctgcgtca tcacccaggg ggcaattctg gcccaattgc
25500aagccatcca aaaatcccgc caagaatttc tactgaaaaa gggtaagggg
gtctaccttg 25560acccccagac cggcgaggaa ctcaacacaa ggttccctca
ggatgtccca acgacgagaa 25620aacaagaagt tgaaggtgca gccgccgccc
ccagaagata tggaggaaga ttgggacagt 25680caggcagagg aggcggagga
ggacagtctg gaggacagtc tggaggaaga cagtttggag 25740gaggaaaacg
aggaggcaga ggaggtggaa gaagtaaccg ccgacaaaca gttatcctcg
25800gctgcggaga caagcaacag cgctaccatc tccgctccga gtcgaggaac
ccggcggcgt 25860cccagcagta gatgggacga gaccggacgc ttcccgaacc
caaccagcgc ttccaagacc 25920ggtaagaagg atcggcaggg atacaagtcc
tggcgggggc ataagaatgc catcatctcc 25980tgcttgcatg agtgcggggg
caacatatcc ttcacgcggc gctacttgct attccaccat 26040ggggtgaact
ttccgcgcaa tgttttgcat tactaccgtc acctccacag cccctactat
26100agccagcaaa tcccggcagt ctcgacagat aaagacagcg gcggcgacct
ccaacagaaa 26160accagcagcg gcagttagaa aatacacaac aagtgcagca
acaggaggat taaagattac 26220agccaacgag ccagcgcaaa cccgagagtt
aagaaatcgg atctttccaa ccctgtatgc 26280catcttccag cagagtcggg
gtcaagagca ggaactgaaa ataaaaaacc gatctctgcg 26340ttcgctcacc
agaagttgtt tgtatcacaa gagcgaagat caacttcagc gcactctcga
26400ggacgccgag gctctcttca acaagtactg cgcgctgact cttaaagagt
aggcagcgac 26460cgcgcttatt caaaaaaggc gggaattaca tcatcctcga
catgagtaaa gaaattccca 26520cgccttacat gtggagttat caaccccaaa
tgggattggc ggcaggcgcc tcccaggact 26580actccacccg catgaattgg
ctcagcgccg ggccttctat gatttctcga gttaatgata 26640tacgcgccta
ccgaaaccaa atacttttgg aacagtcagc tcttaccacc acgccccgcc
26700aacaccttaa tcccagaaat tggcccgccg ccctagtgta ccaggaaagt
cccgctccca 26760ccactgtatt acttcctcga gacgcccagg ccgaagtcca
aatgactaat gcaggtgcgc 26820agttagctgg cggctccacc ctatgtcgtc
acaggcctcg gcataatata aaacgcctga 26880tgatcagagg ccgaggtatc
cagctcaacg acgagtcggt gagctctccg cttggtctac 26940gaccagacgg
aatctttcag attgccggct gcgggagatc ttccttcacc cctcgtcagg
27000ctgttctgac tttggaaagt tcgtcttcgc aaccccgctc gggcggaatc
gggaccgttc 27060aatttgtgga ggagtttact ccctctgtct acttcaaccc
cttctccgga tctcctgggc 27120attacccgga cgagttcata ccgaacttcg
acgcgattag cgagtcagtg gacggctacg 27180attgatgtct ggtgacgcgg
ctgagctatc tcggctgcga catctagacc actgccgccg 27240ctttcgctgc
tttgcccggg aactcattga gttcatctac ttcgaactcc ccaaggatca
27300ccctcaaggt ccggcccacg gagtgcggat ttctatcgaa ggcaaaatag
actctcgcct 27360gcaacgaatt ttctcccagc ggcccgtgct gatcgagcga
gaccagggaa acaccacggt 27420ttccatctac tgcatttgta atcaccccgg
attgcatgaa agcctttgct gtcttatgtg 27480tactgagttt aataaaaact
gaattaagac tctcctacgg actgccgctt cttcaacccg 27540gattttacaa
ccagaagaac gaaacttttc ctgtcgtcca ggactctgtt aacttcacct
27600ttcctactca caaactagaa gctcaacgac tacaccgctt ttccagaagc
attttcccta 27660ctaatactac tttcaaaacc ggaggtgagc tccaaggtct
tcctacagaa aacccttggg 27720tggaagcggg ccttgtagtg ctaggaattc
ttgcgggtgg gcttgtgatt attctttgct 27780acctatacac accttgcttc
actttcttag tggtgttgtg gtattggttt aaaaaatggg 27840gcccatacta
gtcttgcttg ttttactttc gcttttggaa ccgggttctg ccaattacga
27900tccatgtcta gacttcgacc cagaaaactg cacacttact tttgcacccg
acacaagccg 27960catctgtgga gttcatcgcc tctcttacga acttggcccc
caacgacaaa aatttacctg 28020catggtggga atcaacccca tagttatcac
ccagcaaagt ggagatacta agggttgcat 28080tcactgctcc tgcgattcca
tcgagtgcac ctacaccctg ctgaagaccc tatgcggcct 28140aagagacctg
ctaccaatga attaaaaaat gattaataaa aaatcactta cttgaaatca
28200gcaataaggt ctctgttgaa attttctccc agcagcacct cacttccctc
ttcccaactc 28260tggtattcta aaccccgttc agcggcatac tttctccata
ctttaaaggg gatgtcaaat 28320tttagctcct ctcctgtacc cacaatcttc
atgtctttct
tcccagatga ccaagagagt 28380ccggctcagt gactccttca accctgtcta
cccctatgaa gatgaaagca cctcccaaca 28440cccctttata aacccagggt
ttatttcccc aaatggcttc acacaaagcc caaacggagt 28500tcttacttta
aaatgtttaa ccccactaac aaccacaggc ggatctctac agctaaaagt
28560gggaggggga cttacagtgg atgacaccaa cggttttttg aaagaaaaca
taagtgccac 28620cacaccactc gttaagactg gtcactctat aggtttacca
ctaggagccg gattgggaac 28680gaatgaaaat aaactttgta tcaaattagg
acaaggactt acattcaatt caaacaacat 28740ttgcattgat gacaatatta
acaccttatg gacaggagtc aaccccaccg aagccaactg 28800tcaaatcatg
aactccagtg aatctaatga ttgcaaatta attctaacac tagttaaaac
28860tggagcacta gtcactgcat ttgtttatgt tataggagta tctaacaatt
ttaatatgct 28920aactacacac agaaatataa attttactgc agagctgttt
ttcgattcta ctggtaattt 28980actaactaga ctctcatccc tcaaaactcc
acttaatcat aaatcaggac aaaacatggc 29040tactggtgcc attactaatg
ctaaaggttt catgcccagc acgactgcct atcctttcaa 29100tgataattct
agagaaaaag aaaactacat ttacggaact tgttactaca cagctagtga
29160tcgcactgct tttcccattg acatatctgt catgcttaac cgaagagcaa
taaatgacga 29220gacatcatat tgtattcgta taacttggtc ctggaacaca
ggagatgccc cagaggtgca 29280aacctctgct acaaccctag tcacctcccc
atttaccttt tactacatca gagaagacga 29340ctgacaaata aagtttaact
tgtttatttg aaaatcaatt cacaaaatcc gagtagttat 29400tttgcctccc
ccttcccatt taacagaata caccaatctc tccccacgca cagctttaaa
29460catttggata ccattagata tagacatggt tttagattcc acattccaaa
cagtttcaga 29520gcgagccaat ctggggtcag tgatagataa aaatccatcg
ggatagtctt ttaaagcgct 29580ttcacagtcc aactgctgcg gatgcgactc
cggagtctgg atcacggtca tctggaagaa 29640gaacgatggg aatcataatc
cgaaaacggt atcggacgat tgtgtctcat caaacccaca 29700agcagccgct
gtctgcgtcg ctccgtgcga ctgctgttta tgggatcagg gtccacagtg
29760tcctgaagca tgattttaat agcccttaac atcaactttc tggtgcgatg
cgcgcagcaa 29820cgcattctga tttcactcaa atctttgcag taggtacaac
acattattac aatattgttt 29880aataaaccat aattaaaagc gctccagcca
aaactcatat ctgatataat cgcccctgca 29940tgaccatcat accaaagttt
aatataaatt aaatgacgtt ccctcaaaaa cacactaccc 30000acatacatga
tctcttttgg catgtgcata ttaacaatct gtctgtacca tggacaacgt
30060tggttaatca tgcaacccaa tataaccttc cggaaccaca ctgccaacac
cgctccccca 30120gccatgcatt gaagtgaacc ctgctgatta caatgacaat
gaagaaccca attctctcga 30180ccgtgaatca cttgagaatg aaaaatatct
atagtggcac aacatagaca taaatgcatg 30240catcttctca taatttttaa
ctcctcagga tttagaaaca tatcccaggg aataggaagc 30300tcttgcagaa
cagtaaagct ggcagaacaa ggaagaccac gaacacaact tacactatgc
30360atagtcatag tatcacaatc tggcaacagc gggtggtctt cagtcataga
agctcgggtt 30420tcattttcct cacaacgtgg taactgggct ctggtgtaag
ggtgatgtct ggcgcatgat 30480gtcgagcgtg cgcgcaacct tgtcataatg
gagttgcttc ctgacattct cgtattttgt 30540atagcaaaac gcggccctgg
cagaacacac tcttcttcgc cttctatcct gccgcttagc 30600gtgttccgtg
tgatagttca agtacaacca cactcttaag ttggtcaaaa gaatgctggc
30660ttcagttgta atcaaaactc catcgcatct aatcgttctg aggaaatcat
ccaagcaatg 30720caactggatt gtgtttcaag caggagagga gagggaagag
acggaagaac catgttaatt 30780tttattccaa acgatctcgc agtacttcaa
attgtagatc gcgcagatgg catctctcgc 30840ccccactgtg ttggtgaaaa
agcacagcta gatcaaaaga aatgcgattt tcaaggtgct 30900caacggtggc
ttccagcaaa gcctccacgc gcacatccaa gaacaaaaga ataccaaaag
30960aaggagcatt ttctaactcc tcaatcatca tattacattc ctgcaccatt
cccagataat 31020tttcagcttt ccagccttga attattcgtg tcagttcttg
tggtaaatcc aatccacaca 31080ttacaaacag gtcccggagg gcgccctcca
ccaccattct taaacacacc ctcataatga 31140caaaatatct tgctcctgtg
tcacctgtag cgaattgaga atggcaacat caattgacat 31200gcccttggct
ctaagttctt ctttaagttc tagttgtaaa aactctctca tattatcacc
31260aaactgctta gccagaagcc ccccgggaac aagagcaggg gacgctacag
tgcagtacaa 31320gcgcagacct ccccaattgg ctccagcaaa aacaagattg
gaataagcat attgggaacc 31380gccagtaata tcatcgaagt tgctggaaat
ataatcaggc agagtttctt gtaaaaattg 31440aataaaagaa aaatttgcca
aaaaaacatt caaaacctct gggatgcaaa tgcaataggt 31500taccgcgctg
cgctccaaca ttgttagttt tgaattagtc tgcaaaaata aaaaaaaaaa
31560caagcgtcat atcatagtag cctgacgaac agatggataa atcagtcttt
ccatcacaag 31620acaagccaca gggtctccag ctcgaccctc gtaaaacctg
tcatcatgat taaacaacag 31680caccgaaagt tcctcgcggt gaccagcatg
aataattctt gatgaagcat acaatccaga 31740catgttagca tcagttaacg
agaaaaaaca gccaacatag cctttgggta taattatgct 31800taatcgtaag
tatagcaaag ccacccctcg cggatacaaa gtaaaaggca caggagaata
31860aaaaatataa ttatttctct gctgctgttc aggcaacgtc gcccccggtc
cctctaaata 31920cacatacaaa gcctcatcag ccatggctta ccagacaaag
tacagcgggc acacaaagca 31980caagctctaa agtgactctc caacctctcc
acaatatata tatacacaag ccctaaactg 32040acgtaatggg agtaaagtgt
aaaaaatccc gccaaaccca acacacaccc cgaaactgcg 32100tcaccaggga
aaagtacagt ttcacttccg caatcccaac aggcgtaact tcctctttct
32160cacggtacgt gatatcccac taacttgcaa cgtcattttc ccacggtcgc
accgcccctt 32220ttagccgtta accccacagc caatcaccac acgatccaca
ctttttaaaa tcacctcatt 32280tacatattgg caccattcca tctataaggt
atattatata gatag 32325
* * * * *