U.S. patent application number 16/584608 was filed with the patent office on 2020-01-23 for ptd-smad7 therapeutics.
The applicant listed for this patent is THE REGENTS OF THE UNIVERSITY OF COLORADO, A BODY CORPORATE. Invention is credited to Yosef REFAELI, Xiao-Jing Wang, Qinghong ZHANG.
Application Number | 20200023037 16/584608 |
Document ID | / |
Family ID | 51492023 |
Filed Date | 2020-01-23 |
![](/patent/app/20200023037/US20200023037A1-20200123-D00001.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00002.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00003.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00004.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00005.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00006.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00007.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00008.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00009.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00010.png)
![](/patent/app/20200023037/US20200023037A1-20200123-D00011.png)
View All Diagrams
United States Patent
Application |
20200023037 |
Kind Code |
A1 |
Wang; Xiao-Jing ; et
al. |
January 23, 2020 |
PTD-SMAD7 THERAPEUTICS
Abstract
The present technology provides methods and compositions for the
treatment of inflammatory and/or tissue damage conditions. In
particular, the use of Smad7 compositions delivered locally or
systemically to a site of inflammation and/or tissue damage is
described. Other specific embodiments concern treatment or
prevention of side effects caused by radiation and/or chemotherapy,
including but not limited to oral and gastric mucositis. Also
provided are codon-optimized nucleic acids encoding for Smad7
fusion proteins.
Inventors: |
Wang; Xiao-Jing; (Greenwood
Village, CO) ; ZHANG; Qinghong; (Englewood, CO)
; REFAELI; Yosef; (Denver, CO) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
THE REGENTS OF THE UNIVERSITY OF COLORADO, A BODY
CORPORATE |
Denver |
CO |
US |
|
|
Family ID: |
51492023 |
Appl. No.: |
16/584608 |
Filed: |
September 26, 2019 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
14773167 |
Sep 4, 2015 |
10456448 |
|
|
PCT/US2014/022052 |
Mar 7, 2014 |
|
|
|
16584608 |
|
|
|
|
61775252 |
Mar 8, 2013 |
|
|
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
A61P 17/06 20180101;
C07K 14/475 20130101; C12N 2740/16322 20130101; A61P 29/00
20180101; A61P 17/02 20180101; C12N 2800/22 20130101; C07K 2319/23
20130101; C07K 14/005 20130101; C07K 14/4703 20130101; C12N 7/00
20130101; A61K 38/162 20130101; A61P 37/02 20180101; A61K 38/18
20130101; A61K 48/005 20130101; C07K 2319/10 20130101; C07K 2319/20
20130101; A61K 38/00 20130101; C07K 14/4702 20130101; A61P 1/02
20180101; C12N 2740/16311 20130101 |
International
Class: |
A61K 38/18 20060101
A61K038/18; C07K 14/47 20060101 C07K014/47; A61K 38/16 20060101
A61K038/16; C07K 14/005 20060101 C07K014/005; C07K 14/475 20060101
C07K014/475; C12N 7/00 20060101 C12N007/00 |
Goverment Interests
STATEMENT OF GOVERNMENT INTEREST
[0002] This invention was made with government support under grant
number AR061796 awarded by the National Institutes of Health. The
government has certain rights in the invention.
Claims
1-48. (canceled)
49. A protein molecule comprising a protein transduction domain and
a human mothers against decapentaplegic-7 protein (Smad7) fragment
comprising amino acids 259-426 of the human Smad7 protein, wherein
the Smad7 fragment retains one or more biological activities of the
functional full-length Smad7 protein, and with the proviso that the
protein molecule lacks at least amino acids 1-203 of the human
Smad7 protein.
50. The protein molecule of claim 49, wherein the human mothers
against decapentaplegic-7 protein (Smad7) fragment consists of
amino acids 259-426 of the human Smad7 protein.
51. The protein molecule of claim 49, wherein the protein
transduction domain is Tat.
52. The protein molecule of claim 50, wherein the protein
transduction domain is Tat.
53. A pharmaceutical composition comprising the protein molecule of
claim 49, and one or more pharmaceutically acceptable
excipients.
54. A method for treating an inflammatory condition in a subject in
need thereof, comprising administering to the subject a
therapeutically effective amount of the protein molecule of claim
49.
55. The method of claim 54, wherein the inflammatory condition is
selected from the group consisting of radiation-induced damage,
psoriasis, stomatitis, proctitis, ulcerative colitis, Crohn's
disease, irritable bowel disease, and inflammatory bowel
disease.
56. A method for treating a disease or disorder in a subject in
need thereof, comprising administering to the subject a
therapeutically effective amount of the protein of claim 49,
wherein the administration of the protein results in an increase in
one or more of cell proliferation or cell migration, or reducing
one or more of apoptosis or DNA damage in the subject.
57. A method for treating a disease or disorder in a subject in
need thereof, comprising administering to the subject a
therapeutically effective amount of the protein of claim 49,
wherein the disease or disorder is selected from the group
consisting of promoting wound healing, treating inflammatory
diseases, aberrant healing, auto-immune diseases and auto-immune
disorders.
58. The method of claim 57, wherein the promoting wound healing
comprises promoting wound healing of chronic wounds, acute wounds
or mucositis.
59. The method of claim 58, wherein the chronic wounds are selected
from the group consisting of diabetic ulcers, pressure ulcers,
venous ulcers, and oral ulcers.
60. The method of claim 58, wherein the acute wounds are selected
from the group consisting of trauma-induced wounds, surgical
wounds, and scarring.
61. The method of claim 58, wherein the promoting healing in
mucositis comprises promoting healing in radiation-induced
mucositis and chemotherapy-induced mucositis.
62. The method of claim 58, wherein the mucositis is selected from
the group consisting of oral mucositis and gastrointestinal
mucositis.
63. The method of claim 57, wherein the aberrant healing comprises
scarring and fibrosis.
64. The method claim 57, wherein the autoimmune diseases and
auto-immune disorders are selected from the group consisting of
psoriasis, stomatitis, proctitis, ulcerative colitis, Crohn's
disease, irritable bowel disease, and inflammatory bowel disease.
Description
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] The present application is a continuation of U.S. patent
application Ser. No. 14/773,167, filed Sep. 4, 2015, which is a
National Stage Application under 35 U.S.C. .sctn. 371 of
International Application Serial No. PCT/US2014/022052, filed on
Mar. 7, 2014, which claims the benefit of and priority to U.S.
provisional patent application U.S. Ser. No. 61/775,252, filed Mar.
8, 2013, all of which are incorporated herein by reference in their
entireties.
SEQUENCE LISTING
[0003] The instant application contains a Sequence Listing which
has been submitted electronically in ASCII format and is hereby
incorporated by reference in its entirety. Said ASCII copy, created
on Apr. 29, 2014, is named 089491-0303_SL.txt and is 120,505 bytes
in size.
BACKGROUND
[0004] Oral mucositis, a severe oral ulceration, is a common
adverse effect of a large dose of radiation for bone marrow
transplant or craniofacial radiotherapy for cancer. Severe oral
mucositis could require feeding tubes, management of severe pain,
and prematurely halting radiotherapy. Excessive inflammation and
epithelial ablation are key features of oral mucositis.
[0005] Palifermin, a KGF (human keratinocyte growth factor)
recombinant protein, is approved for preventing oral mucositis in
bone-marrow transplant patients. Two Palifermin clinical trials in
head and neck cancer patients showed that Palifermin reduced severe
oral mucositis incidence from 67% and 69% to 51% and 54%,
respectively. Other oral mucositis drugs in clinical trials or
pre-clinical studies include growth factors, agents for
radioprotection, anti-inflammatory agents or immune modulators.
[0006] The modest effects of Palifermin and drugs being developed
in the above mentioned categories highlight the need for
identification of biomarkers for novel therapies. However, the lack
of routine diagnostic biopsies or discarded tissues from oral
mucositis patients has hindered this effort.
[0007] Cutaneous wound healing progresses through three overlapping
phases: inflammation, tissue formation, and tissue remodeling.
These are dynamic processes that involve interactions among the
epidermis, leukocytes, extracellular matrix (ECM), and dermal
fibroblasts. In response to skin injury, blood clots, infiltrated
inflammatory cells and other cell types in the wound release
multiple cytokines and chemokines. These cytokines initiate
fibroblast proliferation and synthesis of ECM that fill the wound
deficit and lead to wound closure.
[0008] Meanwhile, keratinocytes at the wound edge begin to
proliferate and migrate to cover the wound surface. Underneath the
re-epithelialized epidermis, new stroma, called granulation tissue,
begins to fill the wound space, which contains provisional ECM,
inflammatory cells, fibroblasts, and blood vessels. Once the wound
area is filled with the granulation tissue and covered by newly
re-epithelialized epidermis, the process of wound closure is
completed. Later on, the wound gradually returns to normal strength
and texture through tissue remodeling.
[0009] Among the many molecules known to influence wound healing,
transforming growth factor .beta. (TGF-.beta.) has the broadest
spectrum of action, affecting all cell types that are involved in
all stages of wound healing (Feng et al., Annu Rev Cell Dev Biol
21:659-693, 2005). The various functions of TGF-.beta. are mediated
by a number of signaling molecules, including the Smad family
members. When a ligand binds to TGF-.beta. type I and type II
receptors (TGF.beta.RI and TGF-.beta.RII), TGF-.beta.RI
phosphorylates Smad2 and Smad3. Phosphorylated Smad2 and Smad3 bind
a co-Smad, Smad4, to form heteromeric Smad complexes and
translocate into the nucleus to regulate transcription of
TGF-.beta. target genes.
[0010] TGF-.beta. signaling has been reported to exert both
positive and negative effects on wound healing (Wang et al., J
Investig Dermatol Symp Proc 11: 112-117, 2006). For instance, Smad3
deficient mice, in which TGF-.beta. signaling is partially
abrogated, exhibit accelerated wound healing (Ashcroft et al., Nat
Cell Biol 1:260-266, 1999). In contrast, the introduction of
exogenous Smad3 to wound sites to enhance TGF-.beta. signaling also
accelerated wound healing in a rabbit dermal ulcer model (Sumiyoshi
et al., J Invest Dermatol 123:229-236, 2004). Skin wounds in
Smad4-deficient mice have a dramatic increase in inflammation and
angiogenesis causing a delay in wound closure and formed an
excessive scar (Owens et al., Am J Pathol 176:122-133, 2010).
Transient adenoviral gene transfer of Smad7, an antagonist of
TGF-.beta. signaling, in corneal epithelium and stroma resulted in
accelerated corneal wound healing with reduced inflammation (Saika
et al., Am J Pathol 166:1405-1418, 2005). Further, Smad7 gene
transfer to the lens epithelium and stroma prevented injury-induced
epithelial-mesenchymal transition of lens epithelial cells and
suggests a potential role of Smad7 in prevention of capsular
fibrosis (Saika et al., Lab Invest 84:1259-1270, 2004). However,
adenoviral vector delivery of Smad7 to balloon injury in rat
carotid arteries resulted in reduced vascular healing
(Mallawaarachchi et al., Arterioscler Thromb Vasc Biol 25:
1383-1387, 2005). These studies suggest that the effects of
TGF-.beta. signaling components, such as Smad7, on wound healing
are complex and highly context-specific. Additionally, the effect
of Smad7 may not always be explained by its role in TGF-.beta.
signaling. For instance, Smad7 has also been shown to interact with
components of the Wnt/.beta.-catenin (Han et al., Dev Cell Biol
11:301-312, 2006) and the TNF.beta./NF-.kappa.B (Hong et al., Nat
Immunol 8:504-513, 2007) families.
SUMMARY
[0011] The present technology provides a nucleic acid molecule
comprising a codon-optimized human Smad7 cDNA nucleotide sequence.
In some embodiments, the codon-optimized human Smad7 nucleotide
sequence may include one or more codons for arginine optimized for
expression in one or more of bacteria or yeast, including one or
more codons for serine optimized for expression in one or more of
bacteria or yeast, and/or including one or more codons for
histidine optimized for expression in one or more of bacteria or
yeast. In some embodiments, the codon-optimized human Smad7
nucleotide sequence may include 28 serine codons, 6 histidine
codons, and 9 arginine codons optimized for expression in one or
more of bacteria or yeast. In some embodiments, the codon-optimized
human Smad7 nucleotide sequence may be selected from the group
consisting of SEQ ID NOs: 9, 21, 23, 24, 26, 28, 30, 32-34, 36, 38,
39, 87, 89, 91, 93, 96, 97, 99, and 100. In some embodiments, the
codon-optimized human Smad7 nucleotide sequence may have about 65
to 75 percent homology to human Smad7 cDNA, may comprise a
nucleotide sequence encoding an N-terminal fragment SMAD7, may
comprise a nucleotide sequence encoding a C-terminal fragment of
SMAD7, may comprise nucleotides encoding amino acids 2-258 of the
human Smad7 protein, may comprise nucleotides encoding amino acids
259-426 of the human Smad7 protein, or may comprise nucleotides
encoding amino acids 203-258 of the human Smad7 protein. In some
embodiments, any of the foregoing may further comprise a nucleotide
sequence encoding a protein transduction domain, such as Tat. In
some embodiments, any of the foregoing may also further comprise a
nucleotide sequence encoding one or more of an epitope tag or a
purification tag, such as V5, glutathione-S-transferase, or
6-Histidine (SEQ ID NO: 40).
[0012] In some embodiments, any of the foregoing may be isolated
and/or purified. In some embodiments, any one of the foregoing may
also encode a polypeptide having one or more biological activities
selected from the group consisting of reducing or eliminating
phosphorylation of Smad2, reducing or eliminating nuclear
translocation of the NF-.kappa.B p50 subunit, increasing cell
proliferation, reducing apoptosis, reducing radiation-induced DNA
damage, reducing inflammation, reducing angiogenesis, promoting
healing in oral mucositis, promoting wound healing, and treating
auto-immune disease. In some embodiments, pharmaceutical
compositions comprising the nucleic acid molecules above and one or
more pharmaceutically acceptable excipients are provided. In some
embodiments, expression vectors comprising the nucleic acid
molecules above operably linked to a promoter are provided, as are
host cells comprising such expression vectors, and pharmaceutical
compositions comprising such vectors and host cells with one or
more pharmaceutically acceptable excipients.
[0013] In one aspect, a protein molecule comprising a human Smad7
protein having leucine at position 216 is provided. In some
embodiments, the human Smad7 protein may be truncated at the
C-terminal, or truncated at the N-terminal. In some embodiments,
the truncated human Smad7 protein may include about 50% of the
full-length Smad7 sequence, or may include about 13% of the
full-length Smad7 sequence. In some embodiments, the human Smad7
protein may comprise or consist of amino acids 2-258, amino acids
203-258, or amino acids 259-426 of the human Smad7 protein. In some
embodiments, the protein molecule may have one or more biological
activities selected from the group consisting of reducing or
eliminating phosphorylation of Smad2, reducing or eliminating
nuclear translocation of the NF-.kappa.B p50 subunit, increasing
cell proliferation, reducing apoptosis, reducing radiation-induced
DNA damage, reducing inflammation, reducing angiogenesis, promoting
healing in oral mucositis, promoting wound healing, and treating
auto-immune disease. In some embodiments, any of the foregoing may
further comprise a protein transduction domain, such as Tat. In
some embodiments, any of the foregoing may also further comprise
one or more of an epitope tag or a purification tag, such as V5,
glutathione-S-transferase or 6-histidine (SEQ ID NO: 40). In some
embodiments, a pharmaceutical composition comprising any of the
foregoing, a protein molecule, and one or more pharmaceutically
acceptable excipients is provided.
[0014] In another aspect, a method is provided for treating or
preventing an inflammatory condition in a subject comprising
providing to the subject a therapeutically effective amount of the
pharmaceutical composition described above. In some embodiments,
the inflammatory condition may be one or more of a chronic wound,
skin inflammation, psoriasis, or an autoimmune disease. In some
embodiments, the composition may reduce inflammation through
inhibition of TGF-.beta. and NF-.kappa.B signaling.
[0015] In another aspect, a method is provided for preventing or
treating a disease or disorder in a subject comprising one or more
of increasing one or more of cell proliferation or cell migration,
or preventing one or more of apoptosis or DNA damage in the subject
comprising providing to the subject a therapeutically effective
amount of the pharmaceutical composition as described above,
wherein one or more of increasing one or more of cell proliferation
or cell migration, or preventing one or more of apoptosis or DNA
damage is useful in preventing or treating the disease or disorder.
In some embodiments, the disease or disorder may include one or
more of chronic wounds, acute wounds, or mucositis. In some
embodiments, the chronic wounds may include one or more of diabetic
ulcers, pressure ulcers, venous ulcers, or oral ulcers, the acute
wounds may include one or more of trauma-induced wounds, surgical
wounds, or scarring, the mucositis may include one or more of
radiation-induced mucositis or chemotherapy-induced mucositis and
the mucositis may include one or more of oral mucositis or gut
mucositis.
[0016] It is contemplated that any method or composition described
herein can be implemented with respect to any other method or
composition described herein.
[0017] The use of the word "a" or "an" when used in conjunction
with the term "comprising" in the claims and/or the specification
may mean "one," but it is also consistent with the meaning of "one
or more," "at least one," and "one or more than one." The word
"about" means plus or minus 5% of the stated number.
[0018] Other objects, features and advantages of the present
technology will become apparent from the following detailed
description. It should be understood, however, that the detailed
description and the specific examples, while indicating specific
embodiments of the present technology, are given by way of
illustration only, since various changes and modifications within
the spirit and scope of the present technology will become apparent
to those skilled in the art from this detailed description.
BRIEF DESCRIPTION OF THE DRAWINGS
[0019] The following drawings form part of the present
specification and are included to further demonstrate certain
embodiments of the present technology. The embodiments may be
better understood by reference to one or more of these drawings in
combination with the detailed description of specific embodiments
presented herein.
[0020] FIGS. 1A-G provide an illustrative embodiment of data
showing that K5.Smad7 mice were resistant to radiation-induced oral
mucositis. FIG. 1A provides an illustrative embodiment of H&E
staining in non-irradiated and irradiated (day 9 after initiation
of radiation) wild-type (WT) and K5.Smad7 tongues. The vertical
lines in the images of tongues from WT mice highlight the ulcer
boundary and dotted lines in the images indicate the
epithelial-stromal boundary (scale bar, 50 .mu.m). FIG. 1B provides
a graphical representation of the quantification of sizes of tongue
ulcers (mean+s.e.m); n=8 for WT mice and n=7 for K5.Smad7 mice in 8
Gy.times.3 radiation; n=5 for WT mice and n=4 for K5.Smad7 mice in
18-Gy radiation; n=5 per group for WT and K5.Smad7 mice in 22-Gy
radiation. FIG. 1C provides an illustrative embodiment of human
ventricular posterior of the tongue (top) and radiation-induced
tongue mucositis (bottom) visualized using H&E (left) and CD45
staining (right). The solid line indicates the ulcer boundary, and
dotted lines indicate the basement membrane (scale bar, 25 .mu.m).
FIG. 1D provides an illustrative embodiment of immunostaining of
CD45, proliferating cell nuclear antigen (PCNA), and TUNEL assay in
irradiated sections adjacent to an ulcer from WT mice and in
damaged areas from K5.Smad7 mice (PI, propidium iodide). Dotted
lines indicate the basement membrane (scale bar, 25 .mu.m). FIGS.
1E-1G provide graphical representations of the quantification of
staining in FIG. 1D (n=3 or 4 per group). Data are expressed as
mean+s.e.m (FIG. 1B) or mean+s.d (FIGS. 1E-1G), and two-tail
Student t-test is used to calculate P values. *P<0.05,
**P<0.01, ***P<0.001, NS, no significance determined by
two-tailed Student's t-test. Dotted lines in (FIG. 1A), (FIG. 1C)
and (FIG. 1D) highlight the basement membrane. Scale bar: 50 .mu.m
for all panels in (FIG. 1A) and (FIG. 1C), 25 .mu.m for all panels
in (FIG. 1D).
[0021] FIGS. 2A-G provide an illustrative embodiment of data
showing molecular alterations attenuated by Smad7. FIG. 2A provides
an illustrative embodiment of immune-staining of NF-.kappa.B p50,
TGF-.beta.1 and pSmad2. Irradiated tongue sections of wild-type
(WT) were adjacent to ulcer and sections of K5.Smad7 were from the
damaged area. Human samples were from non-irradiated oral mucosa
and radiation-induced mucositis. Dotted lines delineate
epithelial-stromal boundary. Scale bar, 25 .mu.m for all panels.
FIG. 2B provides a graphical representation of the quantification
of immunostaining of NF-.kappa.B p50 and pSmad2 shown in (FIG. 2A).
FIG. 2C provides an illustrative embodiment of qRT-PCR of
TGF-.beta.1 (normalized by Keratin 5, n=6 per group for day 0, n=4
for day 7 and day 9, and n=7 for day 10). FIG. 2D provides a
graphical representation of the quantification of human oral
keratinocyte migration (see images in FIG. 8). Scrambled, scrambled
siRNA; n=3 per group. FIG. 2E provides an illustrative embodiment
of a western analysis of knockdown efficiency of siSmad7-1 and
siSmad7-2 for Smad7 and for Rac1, 72 hours after Smad7 knockdown.
M, molecular markers. FIG. 2F provides an illustrative embodiment
of western analysis of total and activated (GTP-bound) Rac1
protein. M: molecular markers. FIG. 2G provides a graphical
representation of the quantification of the effect of Rac1
knockdown on Smad7-mediated keratinocyte migration (see knockdown
efficiency in FIG. 9A and images in FIG. 9D). n=3 per group. Data
are presented as mean+s.d. and two-tail Student's t-test was used
to calculate P values for (FIG. 2B-2D) and (FIG. 2G). *P<0.05,
**P<0.01, ***P<0.001. NS, no significance.
[0022] FIGS. 3A-H provide an illustrative embodiment of data
showing Smad7 increased Rac1 expression by repressing individual
Smad and CtBP1 binding to the SBE of the Rac1 promoter. FIG. 3A
provides a graphical representation of the quantification of Rac1
mRNA in wild-type (WT) and Smad7 transgenic keratinocytes. n=4 per
group. FIG. 3B provides an illustrative embodiment of western
analysis of GTP-Rac1 and total Rac1 in WT and Smad7 keratinocytes.
Smad7 protein levels in WT and Smad7 keratinocytes were determined
by reprobing the tubulin western blot with an antibody to Smad7
(see an additional western blot and quantification in FIGS. 10A-B).
FIG. 3C provides an illustrative embodiment of western analysis of
Rac1 protein level after knocking down individual Smad2, Smad3 or
Smad4 in human keratinocytes (see FIG. 10C-10E for Smad knockdown
efficiencies). FIG. 3D provides an illustrative embodiment of a
ChIP assay for Smad-2, -3, -4, and -7 binding to the -1.5 Kb SBE
site of the Rac1 promoter in WT and Smad7 transgenic keratinocytes.
FIG. 3E provides a graphical representation of the quantification
of Rac1 luciferase reporter assay in mouse keratinocytes.
Scrambled: scrambled siRNA. n=6. FIG. 3F provides a graphical
representation of the quantification of the activities of Rac1
luciferase reporter containing SBE or mutant (mut) SBE in WT or
Smad7 transgenic keratinocytes. n=6. FIG. 3G provides an
illustrative embodiment of images of ChIP assays of CtBP1 binding
to the SBE-1.5 Kb site of the Rac1 promoter in WT or K5.Smad7
keratinocytes. FIG. 3H provides a graphical representation of
ChIP-qPCR quantification of CtBP1 binding to the SBE shown in FIG.
3G in WT and Smad7 transgenic keratinocytes. n=4. Data are
presented as mean+s.d. and two-tail Student's t-test is used to
calculate P values for FIGS. 3A, 3E, 3F and 3H. *P<0.05, **P
<0.01, ***P<0.001.
[0023] FIGS. 4A-G provide an illustrative embodiment of data
showing CtBP1-associated Rac1 repression contributed to inhibition
of keratinocyte migration. FIG. 4A provides an illustrative
embodiment of western analysis of Rac1 protein after knockdown of
CtBP1 in human oral keratinocytes. FIG. 4B provides a graphical
representation of the quantification of SBE-containing Rac1 luc
reporter activity. n=6. FIG. 4C provides a graphical representation
of the quantification of the effect of CtBP1 knockdown on human
oral keratinocyte migration. n=3 per group. FIG. 4D provides an
illustrative embodiment of immunostaining of CtBP1. Irradiated
sections were adjacent to the ulcer (WT) or the damaged area
(K5.Smad7). Dotted lines denote the basement membrane. Scale bar,
50 .mu.m for all panels. FIG. 4E provides an illustrative
embodiment of immunostaining of CtBP1 in non-irradiated oral mucosa
and radiation-induced oral mucositis in human specimens. Dotted
lines denote the basement membrane. Scale bar, 50 .mu.m for both
panels. FIG. 4F provides a graphical representation of the
quantification of CtBP1 nuclear positive cells in FIGS. 4D-E. n=3
or 4 per group. FIG. 4G provides a graphical representation of the
quantification of qRT-PCR for CtBP1 (normalized with Keratin K5).
n=6 per group for day 0, n=4 for day 7 and day 9, and n=7 for day
10. Data are presented as mean+s.d. and two-tail Student's t-test
is used to calculate P values for FIGS. 4B, 4C, 4F and 4G.
*P<0.05, **P<0.01, ***P<0.001.
[0024] FIGS. 5A-G provide an illustrative embodiment of data
showing oral Tat-Smad7 application prevented radiation-induced oral
mucositis in mice. FIG. 5A provides a graphical representation of
the quantification of oral mucositis ulcer sizes on day 9 after
initiation of 8 Gy.times.3 radiation. Vehicle=saline or 50%
glycerol/PBS. FIG. 5B provides an illustrative embodiment of
pathological alterations on day 9 of initiation of 8 Gy.times.3
radiation. Vehicle=saline or 50% glycerol/PBS. Scale bar, 50 .mu.m
for H&E panels and 25 .mu.m for remaining panels. Dotted lines
delineate epithelial-stromal boundary; the solid line highlights
the ulcer boundary. FIGS. 5C, 5D, 5E, 5F, and 5G provide a
graphical representation of the quantification of immunostaining
shown in FIG. 5B. n=3 or 4 per group. Data are presented as
mean+s.e.m (FIG. 5A) or mean+s.d. (FIGS. 5C-5G) and two-tail
Student's t-test is used to calculate P values. *P<0.05,
**P<0.01, ***P<0.001. NS, no significance.
[0025] FIGS. 6A-G provide an illustrative embodiment of data
showing Tat-Smad7 treatment on oral mucositis. FIG. 6A provides a
graphical representation of the quantification of ulcer sizes
measured on day 10 after initiation of 8 Gy.times.3 radiation.
Glycerol=50% glycerol/PBS. FIG. 6B provides an illustrative
embodiment of H&E staining of oral mucosa. Upper panels: open
ulcer in Palifermin treated but not Tat-Smad7 treated mucosa. Lower
panels: comparison of epithelial thickness between Palifermin
treated and Tat-Smad7 treated mucosa. Dotted lines delineate the
basement membrane. The vertical line highlights the ulcer boundary.
Scale bar, 50 .mu.m for all panels. FIG. 6C provides an
illustrative embodiment of immune-staining of Tat-Smad7 treatment
in 20 Gy-induced oral mucositis after ulcers healed. V5
immunostaining visualizes Tat-Smad7 in oral epithelia (sections are
away from the damaged regions). K14 immunostaining was used as
counterstain. Dotted lines delineate basement membrane. Scale bar,
25 .mu.m for all panels. FIG. 6D provides an illustrative
embodiment of Rac1 western analysis of Tat-Smad7 treated mouse
tongues, day 10 after initiation of 8 Gy.times.3 radiation. FIG. 6E
provides an illustrative embodiment of Rac1 western analysis on
Tat-Smad7 treated normal human oral keratinocytes 48 hours after
treatment. FIG. 6F provides an illustrative embodiment of the
effect of Tat-Smad7 treatment on oral human keratinocyte migration
(NOK-SI, see images in FIG. 13A). n=4 per group. FIG. 6G provides a
graphical representation of the quantification of survival curves
of NOK-SI keratinocytes and SCC lines (Cal27 and MSK921) with or
without Tat-Smad7 treatment. n=4 per group for each radiation dose.
Data are presented as mean+s.e.m (FIG. 6A) or mean+s.d. (FIGS. 6F,
6G) and two-tail Student's t-test is used to calculate P values.
*P<0.05, **P<0.01, ***P<0.001. NS, no significance.
[0026] FIGS. 7A-E provide an illustrative embodiment of data
showing K5. Smad7 oral mucosal tissues were resistant to
radiation-induced oral mucositis. FIG. 7A provides an illustrative
embodiment of Smad7 western blots: undetectable in non-irradiated
wild-type (WT) tongue and barely detectable after radiation.
K5.Smad7 tongues have comparable Smad7 protein levels before and
after radiation. M: molecular marker. FIG. 7B provides an
illustrative embodiment of Smad7 immunostaining. Note that nuclei
in some irradiated epithelial cells are hypertrophic. Dotted lines
delineate epithelial-stromal boundary. FIG. 7C provides a graphical
representation of the quantification of reduced incidence of oral
mucositis-induced morbidity in K5.Smad7 mice. Fisher's exact test
is used to calculate the p value. **P=0.007. FIG. 7D provides an
illustrative embodiment of immune-staining of K5.Smad7 tongue
showing reduced infiltration of neutrophils (Ly-6G), macrophages
(BM8) and activated T cells (CD4) compared to WT oral mucositis.
Dotted lines delineate epithelial-stromal boundary. FIG. 7E
provides an illustrative embodiment of immune-staining showing no
significant difference in pSmad1/5/8-nuclear positive cells (light)
between WT and K5.Smad7 oral mucosa before or after radiation.
Keratin (K14) immunostaining (dark) highlights the epithelial
compartment. Note that nuclei of irradiated epithelial cells are
hypertrophic. The scale bar is 50 .mu.m for all panels.
[0027] FIGS. 8A-D provide an illustrative embodiment of data
showing migration in spontaneously immortalized human oral
epithelial cells (NOK-SI) was delayed by knocking down Smad7 but
accelerated by knocking down TGF-.beta.1. FIGS. 8A and 8B provide
an illustrative embodiment of representative images of cell
migration. Pairs of dotted lines delineate the scratch wound.
Quantification of cell migration and efficiency of Smad7 knockdown
are presented in FIG. 2D and FIG. 2E (above). Scrambled, scrambled
siRNA. FIG. 8C provides a graphical representation of the
quantification of cell migration after TGF-.beta.1 knockdown from 3
separate experiments. FIG. 8D provides a graphical representation
of qRT-PCR showing TGF-.beta.1 knockdown efficiency. Data are
presented as mean+s.d. and two-tail Student's t-test was used to
calculate P values. *P<0.05, **P<0.01. NS, no
significance.
[0028] FIGS. 9A-D provide an illustrative embodiment of data
showing knocking down Rac1 reduced proliferation and migration of
wild-type (WT) and Smad7 transgenic keratinocytes. FIG. 9A provides
an illustrative embodiment of western blot analysis for Rac1 48
hours after Rac1 siRNA (siRac1-1, siRac1-2) transfection. Control,
scrambled siRNA. FIG. 9B provides a graphical representation of the
percentage of BrdU labeled cells in WT and Smad7 cultured cells in
BrdU incorporation assay with or without Rac1 knockdown. Data from
3 separate experiments were presented as mean+s.d. ***P<0.001.
FIG. 9C provides an illustrative embodiment of representative
immunofluorescence of BrdU positive cells presented in (FIG. 9B).
An antibody against keratin 14 (K14, red) was used for
counterstain. FIG. 9D provides an illustrative embodiment of in
vitro cell migration assay for Smad7 transgenic and WT
keratinocytes after Rac1 knockdown. Pairs of dotted lines delineate
the scratch wound. Quantification of cell migration is presented in
FIG. 2G.
[0029] FIGS. 10A-F provide an illustrative embodiment of data
showing Smad7 increased Rac1 expression by repressing Smad and
CtBP1 binding to the SBE of the Rac1 promoter. FIG. 10A provides an
illustrative embodiment of western blot analysis for GTP-Rac1 and
total Rac1 in Smad7 transgenic keratinocytes. Additional samples
are shown in FIG. 3B. M, molecular marker. FIG. 10B provides a
graphical representation of the quantification of GTP-Rac1, total
Rac1 and Smad7 in WT and K5.Smad7 keratinocytes shown in FIG. 10A
and in FIG. 3B. The protein level in WT keratinocytes of each blot
was normalized as "1". Data is presented as mean+s.d. and two-tail
Student's t-test was used to calculate P values. **P<0.01,
***P<0.001. FIGS. 10C and 10D provide an illustrative embodiment
of western blot analysis for Smad2, Smad3, and Smad4 knockdown in
NOK-SI cells. Their effects on Rac1 expression are shown in FIG.
3C. M, molecular marker. GAPDH, internal protein control by
reprobing same blot. FIG. 10F provides an illustrative embodiment
showing CtBP1 knockdown promotes NOK-SI cell migration. Pairs of
dotted lines delineate the scratch wound. Quantification of cell
migration and efficiency of CtBP1 knockdown are shown in FIG. 4A
and FIG. 4C.
[0030] FIGS. 11A-G provide an illustrative embodiment of data
showing the purification and characterization of Tat-Smad7 and
Tat-Cre proteins. FIG. 11A shows an illustrative embodiment of a
schematic representation of Tat-Smad7 protein. FIG. 11A discloses
SEQ ID NOs: 49 and 101, respectively, in order of appearance. FIG.
11B provides an illustrative embodiment of a western blot of
purified Tat-Smad7 protein. FIG. 11C provides an illustrative
embodiment of immune-staining of Tat-Smad7 protein transduction in
keratinocytes. Left and middle panels: Tat-Smad7 staining (light)
using a V5 antibody, counterstained with a K14 antibody (dark).
Cells showed Tat-Smad7 in the nucleus 5 min after transduction and
in both nucleus and cytoplasm 12 hours after transduction. Right
panels: Tat-Smad7 abrogated Smad2 phosphorylation (pSmad2, light).
V5 (dark) counterstain visualizes Tat-Smad7 transduced cells. FIG.
11D provides an illustrative embodiment of immune-staining showing
that V5 antibody staining detects Tat-Smad7 transduction in buccal
mucosa 12 hours after Tat-Smad7 topical application. A K14 antibody
was used for counterstain. Scale bar, 50 .mu.m for both panels.
FIG. 11E provides an illustrative embodiment of a western blot of
purified Tat-Cre protein with the same Tat and V5 tags shown in
FIG. 11A. FIG. 11F provides an illustrative embodiment of an
agarose gel showing activity of Tat-Cre: Tat-Cre cuts out a 1,460
bp floxed fragment from the 7,650 bp vector pLL3.7. FIG. 11G
provides a graphical representation showing Tat-Smad7 protein
preventive treatment reduced 20 Gy radiation-induced oral ulcers.
Data are expressed as mean+s.e.m. Two-tail Student's t-test is used
to calculate P values. *P<0.05, ***P<0.001.
[0031] FIGS. 12A-I provide an illustrative embodiment of data
showing effects of Tat-Smad7 treatment on oral mucositis. FIG. 12A
provides a graphical representation of the quantification of
reduced ulcer size in Tat-Smad7 (0.8 .mu.g daily, day 6 to day 9)
treated oral mucosa. Samples were harvested on day 10. n=8 per
group. FIG. 12B provides an illustrative embodiment of
immunostaining of molecular markers for samples from FIG. 12A.
Scale bar, 50 .mu.m for the top two panels and 25 .mu.m for other
panels. Propidium iodide (PI) and K14 were used as counterstain.
FIGS. 12C-G provide graphical representation of the quantifications
of immunostaining shown in FIG. 12C. 3-4 samples were used. FIG.
12H provides a graphical representation quantification of the
Luciferase assay. Tat-Smad7 treatment increased the activity of the
Rac1 promoter with SBE but not the mutant SBE in mouse
keratinocytes. FIG. 12I provides an illustrative embodiment of a
ChIP assay for CtBP1 binding to the SBE of mouse Rac1 promoter in
Tat-Smad7 treated mouse keratinocytes. Data are expressed as
mean+s.e.m (a) or mean+s.d (c-h) and two-tail Student's t-test is
used to calculate P values. *P<0.05, **P<0.01, ***P<0.001.
NS, no significance.
[0032] FIGS. 13A-H provide an illustrative embodiment of data
showing effects of Tat-Smad7 treatment on migration of human
keratinocytes and tumor cell lines. FIG. 13A provides an
illustrative embodiment showing Tat-Smad7 accelerates NOK-SI cell
migration. Quantification from four separate experiments is shown
in FIG. 6F (above). Pairs of dotted lines delineate initial wounds.
FIG. 13B provides an illustrative embodiment of immunostaining of
Tat-Smad7 treatment in NOK-SI cells showing attenuated
radiation-induced pSmad2 and NF-.kappa.B p50 nuclear localization.
FIG. 13C provides an illustrative embodiment showing V5 staining of
MSK921 cells 2 hours after Tat-Smad7 treatment. K14 staining was
used as counterstain. FIG. 13D provides an illustrative embodiment
of a Rac1 western analysis in MSK921 60 hours after Tat-Smad7
treatment. M, molecular marker. FIG. 13E provides a graphical
representation of quantification of MSK921 cell migration from 3
separate experiments. FIG. 13F provides an illustrative embodiment
showing a representative MSK921 cell migration assay treated with
Tat-Smad7 and PBS. Pairs of solid lines delineate initial wounds.
Dotted lines highlight the forefront of migrated cells. FIG. 13G
provides a graphical representation of quantification of Cal27 cell
migration from 3 separate experiments. FIG. 13H provides an
illustrative embodiment showing representative images for FIG. 13G.
Pairs of solid lines delineate initial wounds. Dotted lines
highlight the forefront of migrated cells. Data are expressed as
mean+s.d. and the two-tail Student's t-test is used to calculate P
values. NS, no significance.
[0033] FIGS. 14A-B show an illustrative schematic of a summary of
potential mechanisms of Smad7-mediated protection and healing of
oral mucositis. FIG. 14A shows an illustrative schematic of how
radiation activates NF-.kappa.B, increases TGF-.beta.1 and CtBP1.
NF-.kappa.B and TGF-.beta.1 induce inflammation. TGF-.beta.1
induces apoptosis, growth arrest and activates Smad-2, -3 and -4,
which recruit CtBP1 to the Rac1 promoter to repress Rac1
transcription, leading to blunted re-epithelialization. FIG. 14B
shows an illustrative schematic of how Smad7 blocks NF-.kappa.B and
TGF-.beta.1-induced inflammation and blocks TGF-.beta.1-induced
apoptosis and growth arrest. Smad7 relieves Rac1 transcriptional
repression by either preventing TGF-.beta.1-mediated Smad
activation (phosphorylation) or competing with signaling
Smads/CtBP1 transcriptional repression complex in binding to the
Rac1 promoter. Increased Rac1 induced by Smad7 contributes to
keratinocyte migration during re-epithelialization.
[0034] FIG. 15 shows an illustrative schematic of Smad 7 domains
associated with protein partners, potential target effects, and
potential physiological effects.
[0035] FIGS. 16A-B are graphs demonstrating the ability of
truncated Smad7 proteins to accelerate wound healing in a mouse
wound healing model. FIG. 16A is a graph showing the effect of
C-terminally truncated (259-426aa) Tat-C-Smad7 on average percent
wound healing over time relative to full-length Tat-Smad7 and
control (PBS). n=3 for each group. FIG. 16B is a graph illustrating
the effect of Tat-N-Smad7 (1-258aa) on average percent wound
healing over time relative to full-length Tat-Smad7 and control
(PBS). n=6 for each group. Data are presented as mean+s.d., and
two-tail Student's t-test was used to calculate P values.
*p<0.05 compared to control (PBS), # p<0.05 compared to
Tat-Smad7.
[0036] FIGS. 17A-C are photographs and graphs demonstrating that
Smad7 accelerates wound healing in an impaired wound-healing model.
FIG. 17A is a digital photograph illustrating the gross appearance
of wounds in diabetic (db/db) mice treated with PBS or Tat-Smad7
over a time period of thirteen days. FIG. 17B is a graph showing
the effect of Tat-Smad7 on average percent wound healing over time
relative to REGRANEX.RTM. and control (PBS). n=6 for each group.
Data are presented as mean+s.d., and two-tail Student's t-test was
used to calculate P values. *p<0.05 compared to control (PBS).
FIG. 17C is a histological comparison of wound samples taken eight
days after wounding. The vertical dotted lines in the image from
the control (PBS) db/db mouse (top panel) highlight the wound
boundary.
DETAILED DESCRIPTION
[0037] As further described herein, the disclosure provides Smad7
proteins and biologically active fragments and derivatives thereof,
nucleic acids encoding such proteins, vectors including such
nucleic acids, and cells encompassing the vectors, nucleic acids,
and/or proteins all for use in formulating medicaments and for
treating and/or preventing one or more diseases or disorders. Also
provided are methods for making and for screening Smad7 proteins
and biologically active fragments and derivatives thereof useful
for treating and/or preventing one or more diseases or disorders.
Also provided are methods for predicting and/or evaluating a
response to treatment using one or more markers associated with
exposure to Smad7. Such markers may include, but are not limited
to, Rac1 for cell migration, NF-.kappa.B for inflammation, and
TGF-.beta. for growth arrest and inflammation.
[0038] Smad7 treatable diseases and disorders may include those
including one or more of reduced cell proliferation, reduced cell
migration, increased cell death, excessive inflammation, and/or DNA
damage. Smad7 treatable diseases and disorders may include those
where treatment with a Smad7 protein and biologically active
fragments and derivatives thereof that have one or more activities
including but not limited to increasing proliferation, reducing or
inhibiting cell death, reducing excessive inflammation, preventing
DNA damage, and/or increasing cell migration. Such diseases and/or
disorders may include but are not limited to acute (e.g., through
surgery, combat, trauma) and chronic wounds (e.g., ulcers, such as
diabetic, pressure, venous), scarring, fibrosis, and aberrant
healing, mucositis (e.g., oral and/or gastro-intestinal),
stomatitis, proctitis, autoimmune disease (e.g., psoriasis,
arthritis), and cancer.
[0039] It is critical for oral mucositis prevention and treatment
to overcome epithelial ablation due to massive apoptosis and
blunted keratinocyte proliferation. The proliferative and
anti-apoptotic effects of Smad7 are more obvious in oral mucositis
than in normal oral mucosa, when TGF-.beta.1, a potent growth
inhibitor and apoptosis inducer for epithelial cells, was
increased.
[0040] Although not wishing to be bound by theory, it is believed
that increased Rac1 activation is largely responsible for
Smad7-mediated keratinocyte migration in wound closure. This
finding was unexpected, given the documented role of TGF-.beta.
signaling in Rho/Rac activation in cancer cells via a
Smad-independent mechanism (Dernyck et al., Nature 415:577-584,
2003).
[0041] It is believed that during oral mucositis, Smad-dependent
Rac1 repression overcomes Smad-independent Rac1 activation (if any)
due to increased Smad signaling (evidenced by increased pSmad2) and
Smad transcriptional co-repressor CtBP1. When this repression is
abrogated by Smad7, it permits Rac1 activation-mediated
keratinocyte migration. However, in oral cancer cells, signaling
Smads are lost or inactivated, or other mechanisms independently
activate Rac1. As a result, Smad7-mediated abrogation of Rac1
repression would no longer occur.
[0042] Although Rac1 activation also contributed to keratinocyte
proliferation, knocking down Rac1 only partially attenuated the
proliferative effect of Smad7. Therefore, Rac1's contribution to
proliferation appears to be limited, and blocking
TGF-.beta.1-induced growth arrest is also needed to overcome
radiation-induced growth inhibitory effects.
[0043] Dampening excessive inflammation creates a microenvironment
for oral mucositis healing. The antagonistic effect of Smad7 on
both TGF-.beta. and NF-.kappa.B signaling makes Smad7 a more
efficient anti-inflammatory molecule than other agents targeting
only NF-.kappa.B. Because inflammatory cells produce cytokines that
further activate TGF-.beta. and NF-.kappa.B, reduced TGF-.beta. and
NF-.kappa.B signaling, found in K5.Smad7 or Tat-Smad7 treated oral
mucosa after radiation, reflects the direct antagonistic effect of
Smad7 on these two pathways and the consequence of reduced
inflammatory cytokines from infiltrated leukocytes. However, Smad7
did not reduce NF-.kappa.B or TGF-.beta. signaling below their
normal physiological conditions. This incomplete blockade of
NF-.kappa.B or TGF-.beta. signaling may be beneficial to oral
mucositis healing, as a complete loss of either pathway could
induce excessive inflammation.
[0044] The primary obstacle to using growth factors to treat oral
mucositis in cancer patients is the potential risk of promoting
cancer cell growth. The majority of human oral cancers lose
TGF-.beta. signaling in tumor epithelial cells. Thus,
anti-Smad-associated cell proliferation and migration by Smad7
would not be effective in cancer cells. In tumors with intact
TGF-.beta. signaling, activation of other oncogenic pathways could
override TGF-.beta.-induced tumor suppressive effects. These two
scenarios could explain why there was no observation of Smad7
increasing proliferation and migration in oral cancer cells with
mutant or intact TGF-.beta. signaling components.
[0045] Additionally, TGF-.beta. signaling promotes tumor invasion
mainly through Smad-independent mechanisms after loss of
TGF-.beta.-induced tumor suppression. Thus, blocking TGF-.beta.
signaling by Smad7 in cancer cells could abrogate
TGF-.beta.-mediated tumor promotion effects, which behaves
similarly to TGF-.beta. inhibitors currently being used in clinical
trials for advanced cancers. Further, the potent anti-inflammatory
effect of Smad7 may reduce the risk of tumor progression.
Therefore, long-term Smad7 application may also be helpful in
cancer treatment.
[0046] Spontaneous tumor formation in K5.Smad7 mice has not been
observed. Because Smad7 is not a secreted protein, local and
short-term Smad7 protein delivery in oral mucositis treatment
should have few systemic effects. In bone marrow transplant
patients, whose oral epithelia do not contain cancer cells, Smad7
topical application may be suitable for both prevention and
treatment of oral mucositis.
[0047] Although not wishing to be bound by any theory,
Smad7-mediated oral mucositis healing appears to be a result of
targeting multiple pathogenic processes mediated by one or more
molecules (see, e.g., FIGS. 14A-B). It is believed that one or more
of these molecules (e.g., TGF-.beta., NF-.kappa.B, CtBP1, Rac1) may
also be helpful as predictive and therapeutic responsive markers of
oral mucositis in patients.
A. Nucleic Acids, Vectors and Host Cells
[0048] The present disclosure also provides, in another embodiment,
genes encoding Smad7. In addition to the wild-type SMAD7 gene (SEQ
ID NOs: 22, 88), as well as various codon-optimized versions (SEQ
ID NOs: 9, 21, 23, 24, 26, 28, 30, 32-34, 36, 38, 39, 87, 89, 91,
93, 96, 97, 99, and 100), it should be clear that the present
technology is not limited to the specific nucleic acids disclosed
herein. As discussed below, a "Smad7 gene" may contain a variety of
different bases and yet still produce a corresponding polypeptide
that is functionally indistinguishable from, and in some cases
structurally identical to, the human gene disclosed herein.
[0049] 1. Nucleic Acids Encoding Smad7
[0050] Nucleic acids according to the present technology may
represent an entire Smad7 gene, a truncated portion, and/or a
fragment of Smad7 that expresses a polypeptide with one or more
activity associated with Smad7 such as but not limited to
increasing proliferation, reducing or inhibiting cell death,
reducing excessive inflammation, preventing DNA damage, and/or
increasing cell migration, as well as treating or preventing one or
more disease or disorders in which such treatment would be helpful
as further discussed herein. Such activities can be assessed using
one or more assays including, but not limited to, the ability to
block phosphorylation of Smad2 and/or nuclear translocation of the
NF-.kappa.B p50 subunit, increase cell proliferation, reduce
apoptosis and/or radiation-induced DNA damage, reduce inflammation
and/or angiogenesis, promote healing in oral mucositis, surgical
wounds, diabetes wounds, and/or wounds associated with chronic
inflammation in mice. The nucleic acid may be derived from genomic
DNA, i.e., cloned directly from the genome of a particular
organism. In particular embodiments, however, the nucleic acid
would comprise complementary DNA (cDNA). Also provided is a cDNA
plus a natural intron or an intron derived from another gene; such
engineered molecules are sometime referred to as "mini-genes." At a
minimum, these and other nucleic acids of the present technology
may be used as molecular weight standards in, for example, gel
electrophoresis.
[0051] The term "cDNA" is intended to refer to DNA prepared using
messenger RNA (mRNA) as template. The advantage of using a cDNA, as
opposed to genomic DNA or DNA polymerized from a genomic, non- or
partially-processed RNA template, is that the cDNA primarily
contains coding sequences of the corresponding protein. There may
be times when the full or partial genomic sequence is preferred,
such as where the non-coding regions are required for optimal
expression or where non-coding regions such as introns are to be
targeted in an antisense strategy.
[0052] As used in this application, the term "a nucleic acid
encoding a Smad7" may refer to a nucleic acid molecule that has
been isolated free of total cellular nucleic acid and/or may refer
to a cDNA encoding a Smad7 polypeptide. As used herein, the term
"isolated free of total cellular nucleic acid" means that the
nucleic acid molecule is about or at least about 75% pure, 80%
pure, 85% pure, 90% pure, 95% pure, 96% pure, 97% pure, 98% pure,
99% pure, or 100% pure of other cellular nucleic acid molecules as
determined using standard biochemical techniques, such as but not
limited to agarose gel electrophoresis. As used herein, the term
"isolated free of total cellular protein" means that the protein
molecule is about or at least about 75% pure, 80% pure, 85% pure,
90% pure, 95% pure, 96% pure, 97% pure, 98% pure, 99% pure, or 100%
pure of other cellular nucleic acid molecules as determined using
standard biochemical techniques, such as but not limited to a
western blot. In certain embodiments, the present technology
concerns a nucleic acid sequence essentially as set forth in,
and/or including any one of SEQ ID NOs: 9, 21, 23, 24, 26, 28, 30,
32-34, 36, 38, 39, 87, 89, 91, 93, 96, 97, 99, and 100.
[0053] An isolated nucleic acid molecule may be produced using
recombinant DNA technology (e.g., polymerase chain reaction (PCR)
amplification, cloning) or chemical synthesis. Isolated nucleic
acid molecules include natural nucleic acid molecules and
homologues thereof, including, but not limited to, natural allelic
variants and modified nucleic acid molecules in which nucleotides
have been inserted, deleted, substituted, and/or inverted in such a
manner that such modifications provide the desired effect (e.g.,
production of Smad7 protein in non-human expression systems).
[0054] The term "essentially as set forth in one or more nucleic
acid sequence (e.g., SEQ ID NOs: 9-11, 21, 23-41" means that the
nucleic acid sequence substantially corresponds to at least a
portion, and in some cases the entirety, of the one or more nucleic
acid sequence (e.g., SEQ ID NOs: 9, 21, 23, 24, 26, 28, 30, 32-34,
36, 38, 39, 87, 89, 91, 93, 96, 97, 99, and 100). In some
embodiments, sequences that substantially correspond to at least a
portion of a nucleic acid sequence, may correspond to about, or at
least about 50 nucleic acids, 75 nucleic acids, 150 nucleic acids,
200 nucleic acids, 250 nucleic acids, 300 nucleic acids, 350
nucleic acids, 400 nucleic acids, 450 nucleic acids, 500 nucleic
acids, 550 nucleic acids, 600 nucleic acids, 650 nucleic acids, 700
nucleic acids, 750 nucleic acids, 800 nucleic acids, 900 nucleic
acids, 1000 nucleic acids, 1100 nucleic acids, 1200 nucleic acids,
or 1250 nucleic acids of one or more of the sequences described
herein. In some embodiments, sequences that substantially
correspond to at least a portion of a nucleic acid sequence, may
correspond to about a range of about 50-1250 nucleic acids, 75-1250
nucleic acids, 150-1250 nucleic acids, 200-1250 nucleic acids,
250-1250 nucleic acids, 300-1250 nucleic acids, 350-1250 nucleic
acids, 400-1250 nucleic acids, 450-1250 nucleic acids, 500-1250
nucleic acids, 550-1250 nucleic acids, 600-1250 nucleic acids,
650-1250 nucleic acids, 700-1250 nucleic acids, 750-1250 nucleic
acids, 800-1250 nucleic acids, 900-1250 nucleic acids, 1000-1250
nucleic acids, 1100-1250 nucleic acids, 1200-1250 nucleic acids, at
least about 50-75 nucleic acids, 75-150 nucleic acids, 75-200
nucleic acids, 75-250 nucleic acids, 75-300 nucleic acids, 75-350
nucleic acids, 75-400 nucleic acids, 75-450 nucleic acids, 75-500
nucleic acids, 75-550 nucleic acids, 75-600 nucleic acids, 75-650
nucleic acids, 75-700 nucleic acids, 75-750 nucleic acids, 75-800
nucleic acids, 75-900 nucleic acids, 75-1000 nucleic acids, 75-1100
nucleic acids, 75-1200 nucleic acids, or 75-1250 nucleic acids or
1250 nucleic acids of one or more of the sequences described
herein.
[0055] In some embodiments, sequences that substantially correspond
to at least a portion of a nucleic acid sequence include identical
sequences to that portion of the nucleic acid sequence. In some
embodiments, sequences that substantially correspond to at least a
portion of a nucleic acid sequence or the entirety of a nucleic
acid sequence may include one or more functionally equivalent
codons. The term "functionally equivalent codon" is used herein to
refer to one or more codons that encode the same amino acid, such
as the six codons for arginine or serine, and in some embodiments
refers to codons that encode biologically equivalent amino acids,
as discussed in the following pages. The term "biologically
equivalent" amino acid is used herein to refer to one or more amino
acids that when changed from the amino acid present in the amino
acid sequence of human Smad7 wild-type protein, do not change one
or more (or in some embodiments any) of the biological activities
of Smad7 described herein, such as but not limited to, increasing
proliferation, reducing or inhibiting cell death, reducing
excessive inflammation, preventing DNA damage, and/or increasing
cell migration, as well as treating or preventing one or more
disease or disorders in which such treatment would be helpful as
further discussed herein.
[0056] In some embodiments, allowing for the degeneracy of the
genetic code, sequences that have about or at least about 60%, 70%,
80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and/or 99% of
nucleotides that are identical to the nucleotides of any one of the
codon-optimized nucleic acid sequences (e.g., SEQ ID NOs: 9-11, 21,
23-41) may be considered substantially corresponding nucleic acid
sequences. Sequences that are essentially the same as those set
forth in any one of the nucleic acid sequences (e.g., SEQ ID NOs:
9-11, 21, 23-41) also may be functionally defined as sequences that
are capable of hybridizing to a nucleic acid segment containing the
complement of SEQ ID NOs: 9-11, 21, 23-41 under various standard
conditions.
[0057] For applications requiring high selectivity, one will
typically desire to employ relatively high stringency conditions to
form the hybrids. For example, relatively low salt and/or high
temperature conditions, such as provided by about 0.02 M to about
0.10 M NaCl at temperatures of about 50.degree. C. to about
70.degree. C. Such high stringency conditions tolerate little, if
any, mismatch between the probe or primers and the template or
target strand and would be particularly suitable for isolating
specific genes or for detecting specific mRNA transcripts. It is
generally appreciated that conditions can be rendered more
stringent by the addition of increasing amounts of formamide.
[0058] For certain applications it is appreciated that lower
stringency conditions are preferred. Under these conditions,
hybridization may occur even though the sequences of the
hybridizing strands are not perfectly complementary, but are
mismatched at one or more positions. Conditions may be rendered
less stringent by increasing salt concentration and/or decreasing
temperature. For example, a medium stringency condition could be
provided by about 0.1 to 0.25 M NaCl at temperatures of about
37.degree. C. to about 55.degree. C., while a low stringency
condition could be provided by about 0.15 M to about 0.9 M salt, at
temperatures ranging from about 20.degree. C. to about 55.degree.
C. Hybridization conditions can be readily manipulated depending on
the desired results.
[0059] In other embodiments, hybridization may be achieved under
conditions of, for example, 50 mM Tris-HCl (pH 8.3), 75 mM KCl, 3
mM MgCl.sub.2, 1.0 mM dithiothreitol, at temperatures between
approximately 20.degree. C. to about 37.degree. C. Other
hybridization conditions utilized could include approximately 10 mM
Tris-HCl (pH 8.3), 50 mM KCl, 1.5 mM MgCl.sub.2, at temperatures
ranging from approximately 40.degree. C. to about 72.degree. C.
[0060] To determine the percent homology of two amino acid
sequences or of two nucleic acids, the sequences are aligned for
optimal comparison purposes (e.g., gaps are introduced in the
sequence of a first amino acid or nucleic acid sequence for optimal
alignment with a second amino acid or nucleic acid sequence). The
amino acid residues or nucleotides at corresponding amino acid
positions or nucleotide positions can then be compared. When a
position in the first sequence is occupied by the same amino acid
residue or nucleotide as the corresponding position in the second
sequence, then the molecules are identical at that position. The
percent homology between the two sequences is a function of the
number of identical positions shared by the sequences (% identity=#
of identical positions/total # of positions (e.g., overlapping
positions).times.100). In some embodiments the two sequences are
the same length.
[0061] To determine percent homology between two sequences, the
algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA
87:2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl.
Acad. Sci. USA 90:5873-5877 can be used. Such an algorithm is
incorporated into the NBLAST and XBLAST programs of Altschul et al.
(1990) J. Mol Biol. 215:403-410. BLAST nucleotide searches is
performed with the NBLAST program, score=100, wordlength=12 to
obtain nucleotide sequences homologous to a nucleic acid molecules
described or disclose herein. BLAST protein searches is performed
with the XBLAST program, score=50, wordlength=3. To obtain gapped
alignments for comparison purposes, Gapped BLAST may be utilized as
described in Altschul et al. (1997) Nucleic Acids Res.
25:3389-3402. When utilizing BLAST and Gapped BLAST programs, the
default parameters of the respective programs (for example, XBLAST
and NBLAST) are used. See the website of the National Center for
Biotechnology Information for further details (on the World Wide
Web at ncbi.nlm.nih.gov). Proteins suitable for use in the methods
described herein also includes proteins having between 1 to 15
amino acid changes, for example, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,
12, 13, 14, or 15 amino acid substitutions, deletions, or
additions, compared to the amino acid sequence of any protein
described herein. In other embodiments, the altered amino acid
sequence is at least 75% identical, for example, 77%, 80%, 82%,
85%, 88%, 90%, 92%, 95%, 97%, 98%, 99%, or 100% identical to the
amino acid sequence of any protein inhibitor described herein. Such
sequence-variant proteins are suitable for the methods described
herein as long as the altered amino acid sequence retains
sufficient biological activity to be functional in the compositions
and methods described herein. In certain instances conservative
amino acid substitutions are utilized. Illustrative conservative
substitution among amino acids are within each of the following
groups: (1) glycine, alanine, valine, leucine, and isoleucine, (2)
phenylalanine, tyrosine, and tryptophan, (3) serine and threonine,
(4) aspartate and glutamate, (5) glutamine and asparagine, and (6)
lysine, arginine and histidine. The BLOSUM62 table is an amino acid
substitution matrix derived from about 2,000 local multiple
alignments of protein sequence segments, representing highly
conserved regions of more than 500 groups of related proteins
(Henikoff et al. (1992), Proc. Natl Acad. Sci. USA,
89:10915-10919). The BLOSUM62 substitution frequencies can be used
to define conservative amino acid substitutions that, in some
embodiments, are introduced into the amino acid sequences described
or disclosed herein. Although it is possible to design amino acid
substitutions based solely upon chemical properties (as discussed
above), the language "conservative amino acid substitution"
preferably refers to a substitution represented by a BLOSUM62 value
of greater than -1. For example, an amino acid substitution is
conservative if the substitution is characterized by a BLOSUM62
value of 0, 1, 2, or 3. According to this system, preferred
conservative amino acid substitutions are characterized by a
BLOSUM62 value of at least 1 (e.g., 1, 2 or 3), while more
preferred conservative amino acid substitutions are characterized
by a BLOSUM62 value of at least 2 (e.g., 2 or 3).
[0062] The DNA segments of the present technology include those
encoding biologically functional equivalent Smad7 proteins and
peptides, as described above. Such sequences may arise as a
consequence of codon redundancy and amino acid functional
equivalency that are known to occur naturally within nucleic acid
sequences and the proteins thus encoded. Alternatively,
functionally equivalent proteins or peptides may be created via the
application of recombinant DNA technology, in which changes in the
protein structure may be engineered, based on considerations of the
properties of the amino acids being exchanged. Changes designed by
man may be introduced through the application of site-directed
mutagenesis techniques or may be introduced randomly and screened
later for the desired function, as described elsewhere.
[0063] As described in greater detail below, the Smad7 nucleic acid
sequence has been optimized for expression in alternative host
organisms (e.g., non-human). Although as described above, the
genetic code is degenerate, so frequently one amino acid may be
coded for by two or more nucleotide codons. Thus, multiple nucleic
acid sequences may encode one amino acid sequence. Although this
creates identical proteins, the nucleic acids themselves are
distinct, and can have distinct properties. As described herein,
one aspect of the choice of codon usage can be (but is not limited
to) the ability to express a protein in a non-native cells (e.g., a
human protein in bacteria or yeast), or the level of expression in
such cells. In order to obtain enough protein for purification,
testing, and use in in vitro assays, in animal models, and
eventually in clinical development, efficient protein expression in
non-human systems is needed.
[0064] A series of 23 arginine amino acids in the human Smad7
protein sequence coded for by one or more of AGG (1.7% codon
utilization; 9 residues), AGA (2.8% codon utilization; 2 residues),
CGA (3.5% codon utilization; 4 residues), or CGG (5.4% codon
utilization; 8 residues) has been identified, and it has been
determined that in order to have efficient protein expression from
non-human sources, such as, but not limited to, bacteria and/or
yeast that one or more, and potentially all the arginine codons
should be modified to CGT (20.6% codon utilization). Therefore, in
some embodiments, the Smad7 codon-optimized nucleic acid sequence
includes at least 1, at least 2, at least 3, at least 4, at least
5, at least 6, at least 7, at least 8, at least 9, at least 10, at
least 11, at least 12, at least 13, at least 14, at least 15, at
least 16, at least 17, at least 18, at least 19, at least 20, at
least 21, at least 22, or 23 codons for arginine that have been
changed to CGT. In some embodiments, the Smad7 codon-optimized
nucleic acid sequence includes one or more or all of the arginine
codons at nucleic acid sequence positions 7-9, 43-45, 169-171,
403-405, 490-492, 526-528, 526-528, 823-825, 1057-1059, 16-18,
136-138, 199-201, 598-600, 31-33, 112-114, 316-318, 772-774,
940-942, 973-975, 1135-1137, 1276-1278, 637-639, or 814-816 be
changed to CGT.
[0065] A series of 33 serine residues in the human Smad7 protein
sequence coded for by TCC or TCG (9%) has been identified, and it
has been determined that it may be beneficial to efficient protein
expression and purification from non-human sources, such as, but
not limited to, bacteria and/or yeast, that one or more, and
potentially all the serine codons be modified to AGC (15% codon
utilization). Therefore, in some embodiments, the Smad7
codon-optimized nucleic acid sequence includes at least 1, at least
2, at least 3, at least 4, at least 5, at least 6, at least 7, at
least 8, at least 9, at least 10, at least 11, at least 12, at
least 13, at least 14, at least 15, at least 16, at least 17, at
least 18, at least 19, at least 20, at least 21, at least 22, at
least 23, at least 24, at least 25, at least 26, at least 27, at
least 28, at least 29, at least 30, at least 31, at least 32 or 33
codons for serine that have been changed to (AGC). In some
embodiments, the Smad7 codon-optimized nucleic acid sequence
includes one or more or all of the serine codons at nucleic acid
sequence positions 19-21, 46-48, 133-135, 292-294, 349-351,
451-453, 454-456, 460-462, 511-513, 514-516, 544-546, 595-597,
616-618, 634-636, 691-693, 694-696, 739-741, 745-747, 775-777,
847-849, 907-909, 919-921, 943-945, 1006-1008, 1009-1101,
1030-1032, 1054-1056, 1093-1095, 1126-1128, 1192-1194, 1237-1239,
1240-1242, 1273-1275. Of these, 23 codons (19-21, 292-294, 349-351,
451-453, 454-456, 460-462, 511-513, 514-516, 544-546, 616-618,
634-636, 691-693, 694-696, 739-741, 745-747, 775-777, 847-849,
907-909, 919-921, 1009-1101, 1030-1032, 1054-1056, 1093-1095) can
be changed without introducing potential alternative open reading
frames.
[0066] A series of 12 histidine residues in the human Smad7 protein
sequence coded for by CAC (9.6% codon usage) has also been
identified, and it has been determined that it may be beneficial to
efficient protein expression and purification from non-human
sources, such as but not limited to bacteria and/or yeast, that one
or more, and potentially all the serine codons be modified to CAT
(optionally to 12.6% usage). Therefore, in some embodiments, the
Smad7 codon-optimized nucleic acid sequence includes at least 1, at
least 2, at least 3, at least 4, at least 5, at least 6, at least
7, at least 8, at least 9, at least 10, at least 11, or 12 codons
for histidine that have been changed to (CAT). In some embodiments,
the Smad7 codon-optimized nucleic acid sequence includes one or
more or all of the serine codons at nucleic acid sequence positions
142-144, 214-216, 217-219, 220-222, 226-228, 289-291, 589-591,
778-780, 1072-1074, 1147-1149. Of these, 4 codons (nucleotides
217-219, 220-222, 589-591, 778-780) can be changed without
introducing potential alternative open reading frames.
[0067] In some embodiments, one or more codon-optimized nucleic
acids may include one or more of at least one and any integer up to
22 of its arginine codons modified to CGT, at least one and any
integer up to 28 of its serine codons (optionally that are able to
be modified with introducing open reading frames) modified to AGC,
or at least one and any integer up to 12 of its histidine codons
(optionally that are able to be modified with introducing open
reading frames) modified to CAT. In some embodiments, one or more
codon-optimized nucleic acid may include at least one and any
integer up to 22 of its arginine codons modified to CGT, at least
one and any integer up to 28 of its serine codons (optionally that
are able to be modified with introducing open reading frames)
modified to AGC, and at least one and any integer up to 12
(optionally that are able to be modified with introducing open
reading frames) of its histidine codons modified to CAT. In some
embodiments, one or more codon-optimized nucleic acid may include
22 of its arginine codons modified to CGT, 28 of its serine codons
(optionally that are able to be modified with introducing open
reading frames) modified to AGC, and 12 of its histidine codons
(optionally that are able to be modified with introducing open
reading frames) modified to CAT. In some embodiments, one or more
codon-optimized nucleic acid may also have a nucleotide
substitution in the codon for Met216 (ATG), to form the codon for
Leu216 (CTG).
[0068] In some embodiments, one or more codon-optimized nucleic
acids may have about 65% to 75%, about 65% to 68%, about 68% to
75%, or about 68% to 71% homology to human Smad7 wild-type cDNA
(SEQ ID NOs: 22, 88). In some embodiments, one or more
codon-optimized nucleic acid may have about 65%, 66%, 67%, 68%,
69%, 70%, 71%, 72%, 73%, 74%, or 75%, homology to human Smad7
wild-type cDNA (SEQ ID NOs: 12, 22). In some embodiments, one or
more codon-optimized nucleic acid may also have a nucleotide
substitution in the codon for Met216 (ATG), to form the codon for
Leu216 (CTG).
[0069] A methionine codon (Met216; ATG) that has the potential for
being perceived by translation machinery (e.g., such as but not
limited bacteria or yeast) as an alternative open reading frame has
been identified. Although not intending to be bound by theory, it
is believed that the presence of the second potential open reading
frame may decrease expression of the Smad7 protein. In some
embodiments, one or more Smad7 nucleic acid sequences are modified
at nucleotide position (646-648) to encode a human Smad7 protein
where Met216 (ATG) is modified to Leu216 (CTG).
[0070] It has also been discovered that various truncated forms and
fragments of Smad7 protein retain one or more of the activities of
full-length human Smad7, such as, but not limited to, increasing
proliferation, reducing or inhibiting cell death, reducing
excessive inflammation, preventing DNA damage, and/or increasing
cell migration, as well as treating or preventing one or more
disease or disorders in which such treatment would be helpful as
further discussed herein. Such activities can be assessed using one
or more assays including, but not limited to, the ability to block
phosphorylation of Smad2 and/or nuclear translocation of the
NF-.kappa.B p50 subunit, increase cell proliferation, reduce
apoptosis and/or radiation-induced DNA damage, reduce inflammation
and/or angiogenesis, promote healing in oral mucositis, surgical
wounds, diabetes wounds, and/or wounds associated with chronic
inflammation in mice.
[0071] Further, in some embodiments, various truncated forms and
fragments of Smad7 protein retain only a subset of the one or more
of the activities of full-length human Smad7. For example, the
C-terminal MH2 domain of Smad7 may primarily mediate the
anti-inflammatory effect of Smad7. Smad7 peptides having this
anti-inflammatory function may be sufficient and optionally an
improvement for treating chronic inflammation associated
conditions, such as but not limited to, oral mucositis, stomatitis,
arthritis, and psoriasis, among others. The N-terminal MH1 domain
may primarily mediate cell migration and/or blocking
TGF-.beta.-induced growth arrest and/or fibrotic response. Smad7
peptides having this cell migration and proliferation function may
be sufficient, and optionally an improvement, for enhancing healing
that is not associated with excessive inflammation. Types of wounds
that might benefit from this form of treatment include, but are not
limited to, surgical wounds, fibrotic scarring, and diabetes
wounds, defective healing and/or scarring among others.
[0072] In some embodiments, nucleic acid molecules (optionally
codon-optimized nucleic acid molecules as described above and
herein) encode fragments or truncated forms of Smad7 protein
(optionally including Leu216). In some embodiments, these fragments
and/or truncated forms of Smad7 protein retain one or more or all
of the activities of full-length human Smad7 protein. In some
embodiments, such truncated nucleic acid sequences encode the
N-terminal portion of the Smad7 protein. In some embodiments, such
truncated nucleic acid sequences encode the C-terminal portion of
the Smad7 protein. In some embodiments, such truncated nucleic acid
sequences (nucleotide positions 4-774) encode amino acids 2-258 of
the human Smad7 protein. In some embodiments, such truncated
nucleic acid sequences (nucleotide positions 775-1278) encode amino
acids 259-426 of the human Smad7 protein. In some embodiments, such
fragments of the nucleic acid sequences (nucleotide positions
607-774) encode amino acids 203-258 of the human Smad7 protein.
[0073] The term "truncated" as used herein in reference to nucleic
acid molecules refers to a molecule that contains nucleotide
sequences encoding the natural N-terminus of a corresponding
protein (with or without a cleaved leader sequence), but lacks one
or more nucleotides starting from the C-terminus-encoding portion
of the molecule, or a molecule that contains nucleotide sequences
encoding the natural C-terminus of a corresponding protein (with or
without a cleaved leader sequence), but lacks one or more
nucleotides starting from the N-terminus-encoding portion of the
molecule. In some embodiments, molecules lacking nucleotides
encoding at least about 25, at least about 50, at least about 75,
at least about 100, at least about 125, at least about 150, at
least about 200, at least about 250, at least about 300, or at
least about 350, or at least about 400 amino acids from one or the
other terminus are specifically provided. Similarly, the term
"truncated" may also be used in reference to protein molecules
encoded by truncated nucleic acid molecules. In some embodiments, a
"truncated" molecule is biologically active, having (or encoding a
polypeptide having) one or more of the Smad7 activities described
herein.
[0074] The term "fragment" as used herein in reference to nucleic
acid molecules refers to a molecule containing contiguous residues
of a full length sequence but lacking some 5' and/or 3' sequences
of the full length sequence. In some embodiments, a "fragment"
includes a portion of one or more of the full length sequences
described herein. In some embodiments, the "fragment" does not
include sequences encoding either the N-terminal or the C-terminal,
but only internal fragments. In some embodiments, a "fragment"
encodes a polypeptide that is biologically active, having one or
more of the Smad7 activities described herein. In some embodiments,
nucleic acid fragments may encode proteins having at least about
25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100,
150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750,
800, 850, 900, 950, 1000, 1050, 1100, 1150 amino acids. Similarly,
"fragment" may also be used in reference to protein molecules
encoded by Smad7 nucleic acid fragments.
[0075] The term "N-terminal portion" as used herein in refers to a
fragment of a corresponding protein that contains the protein's
N-terminus but lacks all sequences C-terminal to an internal
residue.
[0076] The term "C-terminal portion" as used herein in refers to a
fragment of a corresponding protein that contains the protein's
C-terminus but lacks all sequences N-terminal to an internal
residue.
[0077] Although not intending to be bound by theory, the Smad7
protein activity is generally believed to be the result of
interactions in both the cytoplasm and nucleus of a cell. For that
reason among others, there existed a general belief that Smad7
protein was not a candidate for a therapeutic role. However, it was
decided to pursue development of Smad7 as a protein therapeutic,
and modify the Smad7 nucleic acid sequence to encode a protein
transduction domain (PTD) in frame with the Smad7 nucleic acid
sequence (e.g., optionally any nucleic acid sequence described
herein encoding Smad7 protein, including human wild-type and
codon-optimized sequences, both full-length and biologically active
fragments or truncated portions). In some embodiments, the PTD is
located at the 3' end of the Smad7 nucleic acid sequence, and in
some embodiments the PTD is located at the 5' end of the Smad7
nucleic acid sequence. In some embodiments, there is a linker
sequence encoding 1, 2, 3, 4, 5, or 6 amino acids that connects the
PTD and the Smad7 nucleic acid sequence.
[0078] In some embodiments, the PTD nucleic acid sequence is a Tat
nucleic acid sequence. ggccgtaaaaaacgccgtcaacgccgccgt (SEQ ID NO:
1) encoding GRKKRRQRRR (SEQ ID NO: 2),
tatggccgtaaaaaacgccgtcaacgccgccgt (SEQ ID NO: 3) encoding
YGRKKRRQRRR (SEQ ID NO: 4), or ggccgtaaaaaacgccgtcaa (SEQ ID NO: 5)
encoding GRKKRRQ (SEQ ID NO: 6).
[0079] In some embodiments, the nucleic acid sequence further
includes a nucleotide sequence encoding one or more of an epitope
tag or a purification tag. In some embodiments, the epitope tag is
V5. In some embodiments, the purification tag is one or more of
glutathione-S-Transferase (GST) or 6-histidine (H6) (SEQ ID NO:
40).
[0080] The term "epitope tag" as used herein in reference to
nucleic acid molecules refers to nucleotides encoding peptide
sequences that are recognized and bound by the variable region of
an antibody or fragment. In some embodiments, the epitope tag is
not part of the native protein. In some embodiments, the epitope
tag is removable. In some embodiments, the epitope tag is not
intrinsic to the protein's native biological activity. Examples of
epitope tags include, but are not limited to V5.
[0081] The term "purification tag" as used herein in reference to
nucleic acid molecules refers to nucleotides encoding peptide
sequences that facilitate the purification of the protein, but are
generally not necessary for the protein's biological activity. In
some embodiments, purification tags may be removed following
protein purification. Examples of purification tags include, but
are not limited to GST and H-6 (SEQ ID NO: 40).
[0082] 2. Vectors for Cloning, Gene Transfer and Expression
[0083] Within certain embodiments, expression vectors are employed
to express the Smad7 polypeptide product, which can then be
purified for various uses. In other embodiments, the expression
vectors are used in gene therapy. Expression requires that
appropriate signals be provided in the vectors, and which include
various regulatory elements, such as enhancers/promoters from both
viral and mammalian sources that drive expression of the genes of
interest in host cells. Elements designed to optimize messenger RNA
stability and translatability in host cells also are defined. The
conditions for the use of a number of dominant drug selection
markers for establishing permanent, stable cell clones expressing
the products are also provided, as is an element that links
expression of the drug selection markers to expression of the
polypeptide.
[0084] Throughout this application, the term "expression construct"
is meant to include any type of genetic construct containing a
nucleic acid coding for a gene product in which part or all of the
nucleic acid encoding sequence is capable of being transcribed. The
transcript may be translated into a protein, but it need not be. In
certain embodiments, expression includes both transcription of a
gene and translation of mRNA into a gene product. In other
embodiments, expression only includes transcription of the nucleic
acid encoding a gene of interest.
[0085] The term "vector" is used to refer to a carrier nucleic acid
molecule into which a nucleic acid sequence can be inserted for
introduction into a cell where it can be replicated. A nucleic acid
sequence can be "exogenous," which means that it is foreign to the
cell into which the vector is being introduced or that the sequence
is homologous to a sequence in the cell but in a position within
the host cell nucleic acid in which the sequence is ordinarily not
found. Vectors include plasmids, cosmids, viruses (bacteriophage,
animal viruses, and plant viruses), and artificial chromosomes
(e.g., YACs). One of skill in the art would be well-equipped to
construct a vector through standard recombinant techniques, which
are described, e.g., in Sambrook, et al., Molecular Cloning (Cold
Spring Harbor Lab Press, 1989), and Ausubel, et al., Current
Protocols in Molecular Biology (Wiley, 1994), both incorporated
herein by reference.
[0086] The term "expression vector" refers to a vector containing a
nucleic acid sequence coding for at least part of a gene product
capable of being transcribed. In some cases, RNA molecules are then
translated into a protein, polypeptide, or peptide. In other cases,
these sequences are not translated, for example, in the production
of antisense molecules or ribozymes. Expression vectors can contain
a variety of "control sequences," which refer to nucleic acid
sequences necessary for the transcription and possibly translation
of an operably linked coding sequence in a particular host
organism, including promoters and enhancers. In addition to control
sequences that govern transcription and translation, vectors and
expression vectors may contain nucleic acid sequences that serve
other functions, such as transcription termination signals and
poly-adenylation sites.
[0087] The capacity of certain viral vectors to efficiently infect
or enter cells, to integrate into a host cell genome and stably
express viral genes, have led to the development and application of
a number of different viral vector systems. Robbins, et al.,
Pharmacol. Ther. 80:35-47 (1998). Viral systems are currently used
as vectors for ex vivo and in vivo gene transfer. For example,
adenovirus, herpes-simplex virus, lentiviruses, retrovirus and
adeno-associated virus vectors are being evaluated currently for
treatment of diseases such as cancer, cystic fibrosis, Gaucher
disease, renal disease and arthritis. Robbins, et al., Pharmacol.
Ther. 80:35-47 (1998); Imai, et al., Nephrologie 19:379-402 (1998);
U.S. Pat. No. 5,670,488. The various viral vectors present specific
advantages and disadvantages, depending on the particular
gene-therapeutic application.
[0088] Suitable non-viral methods for nucleic acid delivery for
transformation of an organelle, a cell, a tissue or an organism for
use with the present technology are believed to include virtually
any method by which a nucleic acid (e.g., DNA) can be introduced
into an organelle, a cell, a tissue or an organism, as described
herein or as would be known to one of ordinary skill in the art.
Such methods include, but are not limited to, direct delivery of
DNA such as by injection (U.S. Pat. Nos. 5,994,624, 5,981,274,
5,945,100, 5,780,448, 5,736,524, 5,702,932, 5,656,610, 5,589,466
and 5,580,859, each incorporated herein by reference), including
microinjection (Harland and Weintraub, 1985; U.S. Pat. No.
5,789,215, incorporated herein by reference); by electroporation
(U.S. Pat. No. 5,384,253, incorporated herein by reference); by
calcium phosphate precipitation (Graham, et al., Virology
52:456-467 (1973); Chen, et al., Mol. Cell Biol. 7:2745-2752
(1987); Rippe, et al., Mol. Cell Biol. 10:689-695 (1990)); by using
DEAE-dextran followed by polyethylene glycol (Gopal, Mol. Cell
Biol. 5:1188-1190 (1985)); by direct sonic loading (Fechheimer, et
al., PNAS 84:8463-8467 (1987)); by liposome mediated transfection
(Nicolau, et al., Biochim. Biophys. Acta 721:185-190 (1982);
Fraley, et al., PNAS 76:3348-3352 (1979); Nicolau, et al., Methods
Enzymol. 149: 157-176 (1987); Wong, et al., Gene 10:87-94 (1980);
Kaneda, et al., J. Biol. Chem. 264:12126-12129 (1989); Kato, et
al., J. Biol. Chem. 266:3361-3364 (1991)); by microprojectile
bombardment (PCT Application Nos. WO 94/09699 and 95/06128; U.S.
Pat. Nos. 5,610,042; 5,322,783, 5,563,055, 5,550,318, 5,538,877 and
5,538,880, and each incorporated herein by reference); by agitation
with silicon carbide fibers (Kaeppler, et al., Plant Cell Rep.
9:415-418 (1990); U.S. Pat. Nos. 5,302,523 and 5,464,765, each
incorporated herein by reference); or by PEG-mediated
transformation of protoplasts (Omirulleh, et al., Plant Mol. Biol.
21:415-428 (1993); U.S. Pat. Nos. 4,684,611 and 4,952,500, each
incorporated herein by reference); by
desiccation/inhibition-mediated DNA uptake (Potrykus, et al., Mol.
Gen. Genet. 199:169-177 (1985)). Through the application of
techniques such as these, organelle(s), cell(s), tissue(s) or
organism(s) may be stably or transiently transformed.
[0089] 3. Expression Systems
[0090] Numerous expression systems exist that comprise at least a
part or all of the compositions discussed above. Prokaryote- and/or
eukaryote-based systems can be employed for use with the present
technology to produce nucleic acid sequences, or their cognate
polypeptides, proteins and peptides. Many such systems are
commercially and widely available.
[0091] The insect cell/baculovirus system can produce a high level
of protein expression of a heterologous nucleic acid segment, such
as described in U.S. Pat. Nos. 5,871,986 and 4,879,236, both herein
incorporated by reference, and which can be bought, for example,
under the name MAXBAC.RTM. 2.0 from INVITROGEN.RTM. and BACPACK.TM.
BACULOVIRUS EXPRESSION SYSTEM FROM CLONTECH.RTM..
[0092] Other examples of expression systems include
STRATAGENE.RTM.'s COMPLETE CONTROL.TM. Inducible Mammalian
Expression System, which involves a synthetic ecdysone-inducible
receptor, or its pET Expression System, an E. coli expression
system. Another example of an inducible expression system is
available from INVITROGEN.RTM., which carries the T-REX.TM.
(tetracycline-regulated expression) System, an inducible mammalian
expression system that uses the full-length CMV promoter.
INVITROGEN.RTM. also provides a yeast expression system called the
Pichia methanolica Expression System, which is designed for
high-level production of recombinant proteins in the methylotrophic
yeast Pichia methanolica. One of skill in the art would know how to
express a vector, such as an expression construct, to produce a
nucleic acid sequence or its cognate polypeptide, protein, or
peptide.
[0093] Primary mammalian cell cultures may be prepared in various
ways. In order for the cells to be kept viable while in vitro and
in contact with the expression construct, it is necessary to ensure
that the cells maintain contact with the correct ratio of oxygen
and carbon dioxide and nutrients but are protected from microbial
contamination. Cell culture techniques are well documented.
[0094] One embodiment of the foregoing involves the use of gene
transfer to immortalize cells for the production of proteins. The
gene for the protein of interest may be transferred as described
above into appropriate host cells followed by culture of cells
under the appropriate conditions. The gene for virtually any
polypeptide may be employed in this manner. The generation of
recombinant expression vectors, and the elements included therein,
are discussed above. Alternatively, the protein to be produced may
be an endogenous protein normally synthesized by the cell in
question.
[0095] Examples of useful mammalian host cell lines are Vero and
HeLa cells and cell lines of Chinese hamster ovary, W138, BHK,
COS-7, 293, HepG2, NIH3T3, RIN and MDCK cells. In addition, a host
cell strain may be chosen that modulates the expression of the
inserted sequences, or modifies and process the gene product in the
manner desired. Such modifications (e.g., glycosylation) and
processing (e.g., cleavage) of protein products may be important
for the function of the protein. Different host cells have
characteristic and specific mechanisms for the post-translational
processing and modification of proteins. Appropriate cell lines or
host systems can be chosen to insure the correct modification and
processing of the foreign protein expressed.
[0096] A number of selection systems may be used including, but not
limited to, HSV thymidine kinase, hypoxanthine-guanine
phosphoribosyltransferase and adenine phosphoribosyltransferase
genes, in tk-, hgprt- or aprt-cells, respectively. Also,
anti-metabolite resistance can be used as the basis of selection
for dhfr, that confers resistance to; gpt, that confers resistance
to mycophenolic acid; neo, that confers resistance to the
aminoglycoside G418; and hygro, that confers resistance to
hygromycin.
[0097] As used herein, the terms "cell," "cell line," and "cell
culture" may be used interchangeably. All of these terms also
include their progeny, which are any and all subsequent
generations. It is understood that all progeny may not be identical
due to deliberate or inadvertent mutations. In the context of
expressing a heterologous nucleic acid sequence, "host cell" refers
to a prokaryotic or eukaryotic cell, and it includes any
transformable organism that is capable of replicating a vector
and/or expressing a heterologous gene encoded by a vector. A host
cell can, and has been, used as a recipient for vectors. A host
cell may be "transfected" or "transformed," which refers to a
process by which exogenous nucleic acid is transferred or
introduced into the host cell. A transformed cell includes the
primary subject cell and its progeny.
[0098] Host cells may be derived from prokaryotes or eukaryotes
(e.g., bacteria or yeast), depending upon whether the desired
result is replication of the vector or expression of part or all of
the vector-encoded nucleic acid sequences. Numerous cell lines and
cultures are available for use as a host cell, and they can be
obtained through the American Type Culture Collection (ATCC), which
is an organization that serves as an archive for living cultures
and genetic materials (atcc.org). An appropriate host can be
determined by one of skill in the art based on the vector backbone
and the desired result. A plasmid or cosmid, for example, can be
introduced into a prokaryote host cell for replication of many
vectors. Bacterial cells used as host cells for vector replication
and/or expression include DH5.alpha., JM109, and KC8, as well as a
number of commercially available bacterial hosts such as SURE.RTM.
Competent Cells and SOLOPACK.TM. Gold Cells (STRATAGENE.RTM., La
Jolla). Alternatively, bacterial cells such as E. coli LE392 could
be used as host cells for phage viruses.
[0099] Examples of eukaryotic host cells for replication and/or
expression of a vector include HeLa, NIH3T3, Jurkat, 293, Cos, CHO,
Saos, and PC12. Many host cells from various cell types and
organisms are available and would be known to one of skill in the
art. Similarly, a viral vector may be used in conjunction with
either a eukaryotic or prokaryotic host cell, particularly one that
is permissive for replication or expression of the vector.
[0100] Some vectors may employ control sequences that allow it to
be replicated and/or expressed in both prokaryotic and eukaryotic
cells. One of skill in the art would further understand the
conditions under which to incubate all of the above described host
cells to maintain them and to permit replication of a vector. Also
understood and known are techniques and conditions that would allow
large-scale production of vectors, as well as production of the
nucleic acids encoded by vectors and their cognate polypeptides,
proteins, or peptides.
B. Smad7 Proteins and Protein Fragments
[0101] Mothers against decapentaplegic homolog 7 (Smad7) was
previously identified as an antagonist of TGF-.beta. signaling by
several mechanisms including: (a) blockade of TGF-.beta.
receptor-mediated phosphorylation and nuclear translocation of
signaling Smads; (b) increased degradation of TGF-.beta. receptors
and signaling Smads through specific ubiquitin-proteasome pathways
and (c) inhibition of signaling Smads for their binding to Smad
binding elements (SBEs). Smad7 also antagonizes other signaling
pathways, like the NF-.kappa.B pathway.
[0102] Smad7 protein is encoded by the SMAD7 gene, discussed above.
Like many other TGF-.beta. family members, Smad7 is involved in
cell signaling. It is a TGF-.beta. type 1 receptor antagonist. It
blocks TGF-.beta.1 and activin associating with the receptor,
blocking access to Smad2. It is an inhibitory Smad (I-SMAD) and is
enhanced by SMURF2. Smad7 also enhances muscle differentiation.
[0103] The terms "polypeptide," "peptide," and "protein" are used
interchangeably herein to refer to a polymer of amino acid
residues. The terms apply to naturally occurring amino acid
polymers as well as amino acid polymers in which one or more amino
acid residues is a non-naturally occurring amino acid, for example,
an amino acid analog. As used herein, the terms encompass amino
acid chains of any length, including full length proteins, wherein
the amino acid residues are linked by covalent peptide bonds.
[0104] In one embodiment, the present technology relates to Smad7
protein compositions. In addition to the entire Smad7 molecule, the
present technology also relates to truncated portions and fragments
of the polypeptide that retain one or more activity associated with
Smad7, such as, but not limited to, increasing proliferation,
reducing or inhibiting cell death, reducing excessive inflammation,
preventing DNA damage, and/or increasing cell migration, as well as
treating or preventing one or more disease or disorders in which
such treatment would be helpful as further discussed herein. Such
activities can be assessed using one or more assays including, but
not limited to, the ability to block phosphorylation of Smad2
and/or nuclear translocation of the NF-.kappa.B p50 subunit,
increase cell proliferation, reduce apoptosis and/or
radiation-induced DNA damage, reduce inflammation and/or
angiogenesis, promote healing in oral mucositis, surgical wounds,
diabetes wounds, and/or wounds associated with chronic inflammation
in mice.
[0105] Protein fragments may be generated by genetic engineering of
translation stop sites within the coding region (discussed below).
Alternatively, treatment of the Smad7 molecule with proteolytic
enzymes, known as proteases, can produces a variety of N-terminal,
C-terminal and internal fragments. These fragments may be purified
according to known methods, such as precipitation (e.g., ammonium
sulfate), HPLC, ion exchange chromatography, affinity
chromatography (including immunoaffinity chromatography) or various
size separations (sedimentation, gel electrophoresis, gel
filtration).
[0106] As used herein, reference to an isolated protein or
polypeptide in the present embodiments include full-length
proteins, fusion proteins, chimeric proteins, or any fragment
(truncated form, portion) or homologue of such a protein. More
specifically, an isolated protein can be a protein (including a
polypeptide or peptide) that has been removed from its natural
milieu (i.e., that has been subject to human manipulation), and can
include, but is not limited to, purified proteins, partially
purified proteins, recombinantly produced proteins, proteins
complexed with lipids, soluble proteins, synthetically produced
proteins, and isolated proteins associated with other proteins. As
such, "isolated" does not reflect the extent to which the protein
has been purified. Preferably, an isolated protein is produced
recombinantly.
[0107] Variants of Smad7 are also provided--these can be
substitutional, insertional or deletion variants. Deletion variants
lack one or more residues of the native protein that are not
essential for activity, including the truncation mutants described
above and herein. Substitutional variants typically contain the
exchange of one amino acid for another at one or more sites within
the protein, and may be designed to modulate one or more properties
of the polypeptide, such as stability against proteolytic cleavage
and/or translation and/or transcription (protein expression),
without the loss of other functions or properties. Substitutions of
this kind preferably are conservative, that is, one amino acid is
replaced with one of similar shape and charge. Conservative
substitutions are well known in the art and include, for example,
each amino acid can be changed or substituted with a different
amino acid. In making substitutional variants, the hydropathic
index, hydrophilicity, charge and size are normally considered.
[0108] Specifically contemplated deletion variants of Smad7 include
truncations and fragments, for example, including polypeptide
molecules having N-terminal sequences, but not C-terminal
sequences, having C-terminal sequences but not N-terminal
sequences, or having internal sequences, but not N-terminal or
c-terminal sequences. Specifically contemplated Smad7 polypeptide
truncations or fragments include, but are not limited to, molecules
including amino acid residues 2-258, 259-426, and 203-258
corresponding to the native human Smad7 protein sequence.
[0109] The term "truncated" as used herein in reference to protein
sequences refers to a molecule that contains the natural N-terminus
of a corresponding protein (with or without a cleaved leader
sequence), but lacks one or more amino acids starting from the
C-terminus of the molecule, or a molecule that contains the natural
C-terminus of a corresponding protein (with or without a cleaved
leader sequence), but lacks one or more amino acids starting from
the N-terminus of the molecule. In some embodiments, molecules
lacking at least about 25, at least about 50, at least about 75, at
least about 100, at least about 125, at least about 150, at least
about 200, at least about 250, at least about 300, or at least
about 350, or at least about 400 amino acids from one or the other
terminus are specifically provided. In some embodiments, a
"truncated" molecule is biologically active, having one or more of
the Smad7 activities described herein.
[0110] The term "fragment" as used herein in reference to
polypeptide sequences refers to a molecule containing contiguous
residues of a full length sequence but lacking some N-terminal
and/or C-terminal residues of the full length sequence. In some
embodiments, a "fragment" includes a portion of one or more of the
full length sequences described herein. In some embodiments, the
"fragment" does not include sequences encoding either the
N-terminal or the C-terminal, but only internal fragments. In some
embodiments, a "fragment" encodes a polypeptide that is
biologically active, having one or more of the Smad7 activities
described herein. In some embodiments, polypeptide fragments have
at least about 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85,
90, 95, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650,
700, 750, 800, 850, 900, 950, 1000, 1050, 1100, 1150 amino
acids.
[0111] A specialized kind of variant is the fusion protein. This
molecule generally has all or a substantial portion of the native
molecule, linked at the N- or C-terminus, to all or a portion of a
second polypeptide. However, in some embodiments, the fusion
protein may include any one of the fragments and/or truncated
(N-terminal, C-terminal) Smad7 proteins described throughout the
disclosure. For example, fusions may employ leader sequences from
other species to permit the recombinant expression of a protein in
a heterologous host. Another useful fusion includes the addition of
an optional functionally active domain, such as but not limited to
an antibody epitope and/or a purification tag (e.g., V5:
GKPIPNPLLGLDST (SEQ ID NO: 41); Flag: KYKDDDDK (SEQ ID NO: 42); HA:
YPYDVPDYA (SEQ ID NO: 43)). Another type of fusion includes
attaching a domain that can act as the target for an activating or
inactivating ligand, thereby permitting control of the fusion
protein's function once delivered to a subject. Such domains
include, for example, steroid ligand binding (e.g., ER, PR, GR),
which can be activated by small molecules, e.g., 4-hydroxyl
tamoxifen or RU486 that are either uniquely able to activate those
steroid ligand binding domains and/or do not exist in nature and
will therefore enable full control of the Smad7 function by the
presence of these small molecules.
[0112] Another specific form of a fusion protein finding particular
utility in the present technology is a fusion including a protein
transduction domain (PTD), also called a cell delivery domain or
cell transduction domain. Such domains have been described in the
art and are generally characterized as short amphipathic or
cationic peptides and peptide derivatives, often containing
multiple lysine and arginine resides (Fischer, Med. Res. Rev.
27:755-795 (2007)). In some embodiments, the PTD is one or more
variant of TAT protein from HIV (GRKKRRQRRR (SEQ ID NO: 2),
YGRKKRRQRRR (SEQ ID NO: 4), or GRKKRRQ (SEQ ID NO: 6)) or
alternatively, HSV VP 16. Alternate forms of Tat may be used. In
some embodiments, a linker may be used to connect one or more PTDs
and SMad7. In some embodiments, the PTD (optionally Tat) is fused
or linked in frame to the N-terminal and/or C-terminal end of any
one of the Smad7 full-length, fragments, and/or truncated
(N-terminal, C-terminal) proteins described throughout the
disclosure. Other examples of PTDs provided by the present
technology are shown in Table 1.
TABLE-US-00001 TABLE 1 PROTEIN TRANSDUCTION DOMAINS SEQ ID NO:
GALFLGWLGAAGSTMGAKK 44 KRKV RQIKIWFQNRRMKWKK 45 RRMKWKK 46
RRWRRWWRRWWRRWRR 47 RGGRLSYSRRRFSTSTGR 48 YGRKKRRQRRR 4 RKKRRQRRR
49 YARAAARQARA 50 RRRRRRRR 51 KKKKKKKK 52 GWTLNSAGYLLGKINLKALA 53
ALAKXIL LLILLRRRIRKQANAHSK 54 SRRHHCRSKAKRSRHH 55 NRARRNRRRVR 56
RQLRIAGRRLRGRSR 57 KLIKGRTPIKFGK 58 RRIPNRRPRR 59
KLALKLALKALKAALKLA 60 KLAKLAKKLAKLAK 61 GALFLGFLGAAGSTNGAWSQ 62
PKKKRKV KETWWETWWTEWSQPKKKR 63 KV LKKLLKKLLKKLLKKLLKKL 64
QAATATRGRSAASRPTERPR 65 APARSASRPRRPVE MGLGLHLLVLAAALQGAKS 66 KRKV
AAVALLPAVLLALLAPAAA 67 NYKKPKL MANLGYWLLALFVTMWTD 68 VGLCKKRPKP
LGTYTQDFNKFHTFPQTAIG 69 VGAP DPKGDPKGVTVTVTVTVTG 70 KGDPXPD
PPPPPPPPPPPPPP 71 VRLPPPVRLPPPVRLPPP 72 PRPLPPPRPG 73
SVRRRPRPPYLPRPRPPPFFPP 74 RLPPRIPP TRSSRAGLQFPVGRVHRLLRK 75
GIGKFLHSAKKFGKAFVGEI 76 MNS KWKLFKKIEKVGQNIRDGII 77
KAGPAVAVVGQATQIAK ALWMTLLKKVLKAAAKAA 78 LNAVLVGANA
GIGAVLKVLTTGLPALISWIK 79 RKRQQ INLKALAALAKKIL 80
GFFALIPKIISSPLPKTLLSAV 81 GSALGGSGGQE LAKWALKQGFAKLKS 82
SMAQDIISTIGDLVKWIIQTV 83 NXFTKK LLGDFFRKSKEKIGKEFKRIV 84
QRIKQRIKDFLANLVPRTES PAWRKAFRWAWRMLKKAA 85 KLKLKLKLKLKLKLKLKL
86
[0113] In particular embodiments, the present technology provides
for sequence variants of Smad7 in which one or more residues have
been altered. For example, in one embodiment, the methionine
residue found at position 216 of the human Smad7 sequence is
modified to a leucine residue (ATG to CTG).
C. Methods of Treatment
[0114] Smad7-treatable diseases and disorders may include those
including one or more of reduced cell proliferation, reduced cell
migration, increased cell death, excessive inflammation, and/or DNA
damage. Smad7-related diseases and disorders may include those
where treatment with a Smad7 protein and biologically active
fragments and derivatives thereof that have one or more activities
including but not limited to increasing proliferation, reducing or
inhibiting cell death, reducing excessive inflammation, preventing
DNA damage, and/or increasing cell migration is helpful. Such
diseases and/or disorders may include but are not limited to acute
(e.g., through surgery, combat, trauma) and chronic wounds (e.g.,
ulcers, such as diabetic, pressure, venous), scarring, fibrosis,
and aberrant healing, mucositis (e.g., oral and/or
gastro-intestinal), stomatitis, proctitis, autoimmune disease
(e.g., psoriasis, arthritis), and cancer.
[0115] In some embodiments, one or more of the diseases and or
disorders described herein may be prevented, treated, and/or
ameliorated by providing to a subject in need of such treatment a
therapeutically effective amount of one or more of the Smad7
proteins (e.g., full-length or biologically active truncated (e.g.,
N-terminal or C-terminal) or fragment thereof) described in the
disclosure. In some embodiments, the one or more Smad7 proteins are
fusion proteins including a PTD domain. In some embodiments, the
one or more Smad7 proteins includes Leu216. In some embodiments,
the Smad7 proteins make part of a pharmaceutical composition
including one or more pharmaceutically acceptable excipients.
[0116] In some embodiments, one or more of the diseases and or
disorders described herein may be prevented, treated, and/or
ameliorated by providing to a subject in need of such treatment a
therapeutically effective amount of one or more of the nucleic acid
molecules encoding one or more Smad7 proteins (e.g., full-length or
biologically active truncated (e.g., N-terminal or C-terminal) or
fragment thereof) described in the disclosure. In some embodiments,
the one or more nucleic acid molecules include codon-optimized
nucleotide sequences and/or sequences that encode Leu216. In some
embodiments, the one or more Smad7 nucleic acid molecules are
provided to the subject in a construct including an expression
vector. In some embodiments, the Smad7 nucleic acid molecules
(optionally part of an expression vector) make part of a
pharmaceutical composition including one or more pharmaceutically
acceptable excipients.
[0117] The term "subject" or "patient" as used herein refers to
persons or non-human animals in need of treatment and or prevention
using one or more of the treatments described herein. In some
embodiments, non-human animals include laboratory animals such as
monkeys, mice, rats, and rabbits, domestic pets such as dogs and
cats, and livestock such as cattle, horses, pigs, goats and
sheep.
[0118] 1. Chronic Wounds
[0119] A chronic wound is a wound that does not heal in an orderly
set of stages and in a predictable amount of time the way most
wounds do; wounds that do not heal within three months are often
considered chronic. Chronic wounds seem to be detained in one or
more of the phases of wound healing. For example, chronic wounds
often remain in the inflammatory stage for too long. In acute
wounds, there is a precise balance between production and
degradation of molecules such as collagen; in chronic wounds this
balance is lost and degradation plays too large a role.
[0120] As described in more detail elsewhere herein, PTD-Smad7 has
been shown to enhance wound healing in a mouse skin model and a
mucosal model. Application of PTD-Smad7 was effective through a
topical route, which is desirable for wound treatment. Although not
intending to be bound by theory, it is believed that PTD-Smad7 may
act to treat or ameliorate chronic wounds through multiple routes,
which may include one or more of reducing inflammation, increasing
cell proliferation (e.g., keratinocytes), increasing cell migration
(e.g., keratinocytes), or reducing fibrosis (e.g., through
modulation of collagen), among others.
[0121] Chronic wounds may never heal or may take years to do so.
These wounds cause patients severe emotional and physical stress as
well as creating a significant financial burden on patients and the
whole healthcare system. Acute and chronic wounds are at opposite
ends of a spectrum of wound healing types that progress toward
being healed at different rates. The vast majority of chronic
wounds can be classified into three categories: venous ulcers,
diabetic, and pressure ulcers. A small number of wounds that do not
fall into these categories may be due to causes such as radiation
poisoning or ischemia.
[0122] Venous and Arterial Ulcers.
[0123] Venous ulcers, which usually occur in the legs, account for
about 70% to 90% of chronic wounds and mostly affect the elderly.
They are thought to be due to venous hypertension caused by
improper function of valves that exist in the veins to prevent
blood from flowing backward. Ischemia results from the dysfunction
and, combined with reperfusion injury, causes the tissue damage
that leads to the wounds.
[0124] Diabetic Ulcers.
[0125] Another major cause of chronic wounds, diabetes, is
increasing in prevalence. Diabetics have a 15% higher risk for
amputation than the general population due to chronic ulcers.
Diabetes causes neuropathy, which inhibits nociception and the
perception of pain. Thus patients may not initially notice small
wounds to legs and feet, and may therefore fail to prevent
infection or repeated injury. Further, diabetes causes immune
compromise and damage to small blood vessels, preventing adequate
oxygenation of tissue, which can cause chronic wounds. Pressure
also plays a role in the formation of diabetic ulcers.
[0126] Pressure Ulcers.
[0127] Another leading type of chronic wounds is pressure ulcers,
which usually occur in people with conditions such as paralysis
that inhibit movement of body parts that are commonly subjected to
pressure such as the heels, shoulder blades, and sacrum. Pressure
ulcers are caused by ischemia that occurs when pressure on the
tissue is greater than the pressure in capillaries, and thus
restricts blood flow into the area. Muscle tissue, which needs more
oxygen and nutrients than skin does, shows the worst effects from
prolonged pressure. As in other chronic ulcers, reperfusion injury
damages tissue.
[0128] Chronic wounds may affect only the epidermis and dermis, or
they may affect tissues all the way to the fascia. They may be
formed originally by the same things that cause acute wounds, such
as surgery or accidental trauma, or they may form as the result of
systemic infection, vascular, immune, or nerve insufficiency, or
comorbidities such as neoplasias or metabolic disorders. Although
not intending to be bound by theory, the reason a wound becomes
chronic is that the body's ability to deal with the damage is
overwhelmed by factors such as repeated trauma, continued pressure,
ischemia, or illness. Some of the major factors that lead to
chronic wounds include, but are not limited to, ischemia,
reperfusion injury, and bacterial colonization.
[0129] Ischemia.
[0130] Ischemia is an important factor in the formation and
persistence of wounds, especially when it occurs repetitively (as
it usually does) or when combined with a patient's old age.
Ischemia causes tissue to become inflamed and cells to release
factors that attract neutrophils such as interleukins, chemokines,
leukotrienes, and complement factors.
[0131] While they fight pathogens, neutrophils also release
inflammatory cytokines and enzymes that damage cells. One of their
important functions is to produce Reactive Oxygen Species (ROS) to
kill bacteria, for which they use an enzyme called myeloperoxidase.
The enzymes and ROS produced by neutrophils and other leukocytes
damage cells and prevent cell proliferation and wound closure by
damaging DNA, lipids, proteins, the ECM, and cytokines that speed
healing. Neutrophils remain in chronic wounds for longer than they
do in acute wounds, and contribute to the fact that chronic wounds
have higher levels of inflammatory cytokines and ROS. Because wound
fluid from chronic wounds has an excess of proteases and ROS, the
fluid itself can inhibit healing by inhibiting cell growth and
breaking down growth factors and proteins in the ECM.
[0132] Bacterial Colonization.
[0133] Since more oxygen in the wound environment allows white
blood cells to produce ROS to kill bacteria, patients with
inadequate tissue oxygenation, for example, those who suffered
hypothermia during surgery, are at higher risk for infection. The
host's immune response to the presence of bacteria prolongs
inflammation, delays healing, and damages tissue. Infection can
lead not only to chronic wounds but also to gangrene, loss of the
infected limb, and death of the patient.
[0134] Like ischemia, bacterial colonization and infection damage
tissue by causing a greater number of neutrophils to enter the
wound site. In patients with chronic wounds, bacteria with
resistance to antibiotics may have time to develop. In addition,
patients carrying drug resistant bacterial strains, such as
methicillin-resistant Staphylococcus aureus (MRSA), have more
chronic wounds.
[0135] Growth Factors Andproteolytic Enzymes.
[0136] Chronic wounds also differ in makeup from acute wounds in
that their levels of proteolytic enzymes such as elastase and
matrix metalloproteinases (MMPs) are higher, while their
concentrations of growth factors such as Platelet-derived growth
factor and Keratinocyte Growth Factor are lower.
[0137] Since growth factors (GFs) are imperative in timely wound
healing, inadequate GF levels may be an important factor in chronic
wound formation. In chronic wounds, the formation and release of
growth factors may be prevented, the factors may be sequestered and
unable to perform their metabolic roles, or degraded in excess by
cellular or bacterial proteases.
[0138] Chronic wounds such as diabetic and venous ulcers are also
caused by a failure of fibroblasts to produce adequate ECM proteins
and by keratinocytes to epithelialize the wound. Fibroblast gene
expression is different in chronic wounds than in acute wounds.
[0139] Although all wounds require a certain level of elastase and
proteases for proper healing, too high a concentration is damaging.
Leukocytes in the wound area release elastase, which increases
inflammation, destroys tissue, proteoglycans, and collagen, and
damages growth factors, fibronectin, and factors that inhibit
proteases. The activity of elastase is increased by human serum
albumin, which is the most abundant protein found in chronic
wounds. However, chronic wounds with inadequate albumin are
especially unlikely to heal, so regulating the wound's levels of
that protein may in the future prove helpful in healing chronic
wounds.
[0140] Excess matrix metalloproteinases, which are released by
leukocytes, may also cause wounds to become chronic. MMPs break
down ECM molecules, growth factors, and protease inhibitors, and
thus increase degradation while reducing construction, throwing the
delicate compromise between production and degradation out of
balance.
[0141] Oral Ulcers.
[0142] A mouth ulcer (also termed an oral ulcer, or a mucosal
ulcer) is an ulcer that occurs on the mucous membrane of the oral
cavity. More plainly, a mouth ulcer is a sore or open lesion in the
mouth. Mouth ulcers are very common, occurring in association with
many diseases and by many different mechanisms, but usually there
is no serious underlying cause. The two most common causes of oral
ulceration are local trauma (e.g., rubbing from a sharp edge on a
filling) and aphthous stomatitis ("canker sores"), a condition
characterized by recurrent formation of oral ulcers for largely
unknown reasons. Some consider ulcers on the lips or on the skin
around the mouth to be included under the general term oral
ulceration (e.g., an ulcer left by rupture of a blister caused by
herpes labialis, i.e., a cold sore). Mouth ulcers often cause pain
and discomfort, and may alter the person's choice of food while
healing occurs (e.g., avoiding acidic or spicy foods and
beverages). They may occur singly or multiple ulcers may occur at
the same time (a "crop" of ulcers). Once formed, the ulcer may be
maintained by inflammation and/or secondary infection. Rarely, a
mouth ulcer that does not heal for many weeks may be a sign of oral
cancer. Other causes include burns, chemical injury, or
infection.
[0143] A mucosal ulcer is an ulcer which specifically occurs on a
mucous membrane. An ulcer is a tissue defect which has penetrated
the epithelial-connective tissue border, with its base at a deep
level in the submucosa, or even within muscle or periosteum. An
ulcer is a deeper breech of the epithelium than an erosion or an
excoriation, and involves damage to both epithelium and lamina
propria. An erosion is a superficial breach of the epithelium, with
little damage to the underlying lamina propria. A mucosal erosion
is an erosion which specifically occurs on a mucous membrane. Only
the superficial epithelial cells of the epidermis or of the mucosa
are lost, and the lesion can reach the depth of the basement
membrane. Erosions heal without scar formation. Excoriation is a
term sometimes used to describe a breach of the epithelium which is
deeper than an erosion but shallower than an ulcer. This type of
lesion is tangential to the rete pegs and shows punctiform (small
pinhead spots) bleeding, caused by exposed capillary loops.
[0144] 2. Acute Wounds/Trauma
[0145] Physical trauma is a serious and body-altering physical
injury, such as the removal of a limb. Blunt force trauma, a type
of physical trauma caused by impact or other force applied from or
with a blunt object, whereas penetrating trauma is a type of
physical trauma in which the skin or tissues are pierced by an
object. Trauma can also be described as both unplanned, such as an
accident, or planned, in the case of surgery. Both can be
characterized by mild to severe tissue damage, blood loss and/or
shock, and both may lead to subsequent infection, including sepsis.
The present technology provides for treatment of trauma, including
both pre-treatment (in the case of a medical procedure) and
treatment after trauma injury has occurred.
[0146] As described in more detail elsewhere herein (and briefly
mentioned above), PTD-Smad7 has been shown to enhance wound healing
in a mouse skin model and a mucosal model. Application of PTD-Smad7
was effective through a topical route, which is desirable for wound
treatment. Although not intending to be bound by theory, it is
believed that PTD-Smad7 may act to treat or ameliorate wounds
through multiple routes, which may include one or more of reducing
inflammation, increasing cell proliferation (e.g., keratinocytes),
increasing cell migration (e.g., keratinocytes), or reducing
fibrosis (e.g., through modulation of collagen), among others. As
described briefly below, reduced inflammation could significantly
contribute to accelerated wound healing, optionally through reduced
angiogenesis and collagen production and/or reduced leukocyte
infiltration leading to reduction of cytokines and chemokines
normally released by leukocytes, which are angiogenic and
fibrogenic. Temporal treatment with Smad7 may allow early stage
angiogenesis and collagen production required for wound repair,
while preventing prolonged angiogenesis and collagen production.
These changes could potentially accelerate wound stromal remodeling
and prevent excessive scarring due to unresolved inflammation or
collagen overproduction. For surgical procedures (as well as
everyday injuries), particularly where the potential for scarring
is an issue, treatment with Smad7 may be beneficial.
[0147] Surgery.
[0148] Surgery uses operative manual and instrumental techniques on
a patient to investigate and/or treat a pathological condition such
as disease or injury, to help improve bodily function or
appearance, or sometimes for some other reason. The present
technology can address trauma resulting from surgeries, as defined
further below.
[0149] As a general rule, a procedure is considered surgical when
it involves cutting of a patient's tissues or closure of a
previously sustained wound. Other procedures that do not
necessarily fall under this rubric, such as angioplasty or
endoscopy, may be considered surgery if they involve common
surgical procedure or settings, such as use of a sterile
environment, anesthesia, antiseptic conditions, typical surgical
instruments, and suturing or stapling. All forms of surgery are
considered invasive procedures; so-called noninvasive surgery
usually refers to an excision that does not penetrate the structure
being addressed (e.g., laser ablation of the cornea) or to a
radiosurgical procedure (e.g., irradiation of a tumor). Surgery can
last from minutes to hours.
[0150] Surgical procedures are commonly categorized by urgency,
type of procedure, body system involved, degree of invasiveness,
and special instrumentation. Elective surgery is done to correct a
non-life-threatening condition, and is carried out at the patient's
request, subject to the surgeon's and the surgical facility's
availability. Emergency surgery is surgery which must be done
quickly to save life, limb, or functional capacity. Exploratory
surgery is performed to aid or confirm a diagnosis. Therapeutic
surgery treats a previously diagnosed condition.
[0151] Amputation involves cutting off a body part, usually a limb
or digit. Replantation involves reattaching a severed body part.
Reconstructive surgery involves reconstruction of an injured,
mutilated, or deformed part of the body. Cosmetic surgery is done
to improve the appearance of an otherwise normal structure.
Excision is the cutting out of an organ, tissue, or other body part
from the patient. Transplant surgery is the replacement of an organ
or body part by insertion of another from different human (or
animal) into the patient. Removing an organ or body part from a
live human or animal for use in transplant is also a type of
surgery.
[0152] When surgery is performed on one organ system or structure,
it may be classified by the organ, organ system or tissue involved.
Examples include cardiac surgery (performed on the heart),
gastrointestinal surgery (performed within the digestive tract and
its accessory organs), and orthopedic surgery (performed on bones
and/or muscles).
[0153] Minimally invasive surgery involves smaller outer
incision(s) to insert miniaturized instruments within a body cavity
or structure, as in laparoscopic surgery or angioplasty. By
contrast, an open surgical procedure requires a large incision to
access the area of interest. Laser surgery involves use of a laser
for cutting tissue instead of a scalpel or similar surgical
instruments. Microsurgery involves the use of an operating
microscope for the surgeon to see small structures. Robotic surgery
makes use of a surgical robot, such as Da Vinci or Zeus surgical
systems, to control the instrumentation under the direction of the
surgeon.
[0154] 3. Autoimmune/Inflammatory Disease
[0155] The present technology contemplates the treatment of a
variety of autoimmune and/or inflammatory disease states such as
spondyloarthropathy, ankylosing spondylitis, psoriatic arthritis,
reactive arthritis, enteropathic arthritis, ulcerative colitis,
Crohn's disease, irritable bowel disease, inflammatory bowel
disease, rheumatoid arthritis, juvenile rheumatoid arthritis,
familial Mediterranean fever, amyotrophic lateral sclerosis,
Sjogren's syndrome, early arthritis, viral arthritis, multiple
sclerosis, or psoriasis. The diagnosis and treatment of these
diseases are well documented in the literature.
[0156] In general, autoimmune diseases are associated with an
overactive immune response of a body against substances and tissues
normally present in the body, and not normally the focus of an
immune response. There are more than 80 types of autoimmune
diseases, some of which have similar symptoms, and they may arise
from a similar underlying cause. The classic sign of an autoimmune
disease is inflammation, which as disclosed herein is amenable to
treatment with Smad7 (optionally PTD-Smad7) compositions.
[0157] 4. Chemotherapy, Radiotherapy and Cytokine Therapy
Toxicity
[0158] Various forms of cancer therapy, including chemotherapy,
radiation, and cytokines, are associated with toxicity, sometimes
severe, in the cancer patient. The present technology seeks to
reduce this toxicity using the pharmaceutical compositions of the
present technology, thereby reducing or alleviating discomfort on
the part of the patient, as well as permitting higher doses of the
therapy.
[0159] As described at length throughout this disclosure, it has
been found that PTD-Smad7 acts to heal as well as to prevent oral
mucositis in a mouse model. PTD-Smad7 was shown to be more
effective than palifermin, the existing approved drug for
preventing oral mucositis, in direct comparisons.
[0160] Oral cancer, the 6.sup.th most common cancer worldwide, is a
subtype of head and neck cancer, and includes any cancerous tissue
growth located in the oral cavity. It may arise as a primary lesion
originating in any of the oral tissues, by metastasis from a
distant site of origin, or by extension from a neighboring anatomic
structure, such as the nasal cavity or the oral cancers may
originate in any of the tissues of the mouth, and may be of varied
histologic types: teratoma, adenocarcinoma derived from a major or
minor salivary gland, lymphoma from tonsillar or other lymphoid
tissue, or melanoma from the pigment-producing cells of the oral
mucosa. There are several types of oral cancers, but around 90% are
squamous cell carcinomas, originating in the tissues that line the
mouth and lips. Oral or mouth cancer most commonly involves the
tongue. It may also occur on the floor of the mouth, cheek lining,
gingiva (gums), lips, or palate (roof of the mouth). Most oral
cancers look very similar under the microscope and are called
squamous cell carcinoma. These are malignant and tend to spread
rapidly.
[0161] Over 80% of oral cancer patients are treated with radiation
therapy and at least 75% of these individuals will develop oral
mucositis. Oral mucositis is a chronic oral ulceration. This
disease frequently occurs in radiation-treated patients of all
cancer types, including but not limited to patients who are
radiation-treated for organ transplants (to eliminate rejection of
the transplants), and patients undergoing routine chemotherapy.
Severe oral mucositis is extremely painful and impairs food/liquid
intake, hence is often the most severe complication of cancer
therapy. Oral mucositis is a major factor in determining the
maximum dose possible of radiation and chemotherapy to the head and
neck region; it can significantly complicate cancer treatment,
extend hospitalization, decrease quality of life and increase
costs.
[0162] Currently, there is no established therapy to effectively
treat severe oral mucositis. To date, palifermin (KEPIVANCE.RTM.),
a recombinant protein of human keratinocyte growth factor (KGF), is
the only FDA approved drug for intravenous (i.v.) injections for
severe oral mucositis in bone-marrow transplant patients, and its
use in cancer patients remains to be determined. It is also used
for prevention of oral mucositis. Hence, this drug is available for
only 4% of the at-risk population. It also suffers from the need
for medical service providers due to the i.v. administration route.
Other potential therapies include topical rinses, such as viscous
2% lidocaine rinses, or baking soda and saline solutions, or a
cocktail solution, for instance BAX (lidocaine, diphenhyramine,
sorbitol and MYLANTA.RTM.). Other investigative or mucoprotective
adjuvant therapies include, but are not limited to, beta carotene,
tocopherol, laser irradiation, prophylactic brushing the oral
mucosa with silver-nitrate, misoprostol, leucovorin, systemic KGF,
pentoxifylline, allopurinol mouthwash, systemic sucralfate,
chlorhexidine gluconate, and cryotherapy.
[0163] Chemotherapy- and radiation-induced gut mucositis is an
inflammatory condition that arises as a result of the acute death
of rapidly dividing intestinal epithelial cells. Most
chemotherapeutic drugs used for treatment of solid tumors, alone,
in a combination of drugs, or with radiation, will result in the
death of a large number of intestinal epithelial cells. The
clinical manifestations of the ensuing mucositis include digestive
symptoms such as nausea and vomiting, serious diarrhea, acute
weight loss and wasting. This is fast becoming one of the limiting
factors for administering chemotherapy for many cancer patients.
The ability of Tat-Smad7 to protect intestinal epithelial cells
from either chemotherapeutic agents, radiation, or a combinations
of those, will significantly decrease the undesirable side effects
of cancer therapies, and enable more aggressive ways to treat the
disease with existing tools.
[0164] Bone marrow failure syndromes are a set of conditions that
develop when the hematopoietic stem cell compartment is compromised
and fails to give rise to normal cell types. Bone marrow failure
occurs as a result of inherited genetic abnormalities, exposure to
a noxious substance, such as toxins, chemicals or viruses. Although
the nature and identity of environmental factors that can lead to
the development of acquired bone marrow failure is still not
completely understood, a few factors have been linked to the
development of acquired bone marrow failure among military
personnel including exposure to mustard gas, ionizing radiation,
and infectious agents such as visceral leishmaniasis or African
trypanosomiasis. The best approach for management of bone marrow
failure syndromes is still the transplantation of hematopoietic
stem cells (HSCs), unless a sufficient number of the remaining
resident bone marrow HSCs can be spared from these stresses and
encouraged to repopulate the hematopoietic compartment. The
modulation of Smad 7, as described here, should enable for the
deliberate protection of the remaining resident HSCs in patients
that exhibit clinical signs consistent with bone marrow
failure.
[0165] 5. Cancer
[0166] TGF-.beta. and NF-.kappa.B activations are known to promote
cancer invasion and metastasis. Currently, TGF-.beta. inhibitors
are in clinical trials for treating metastatic cancer and
NF-.kappa.B inhibitors are used in cancer prevention. The
demonstrated effect of Smad7 on blocking both TGF-.beta. and
NF-.kappa.B signaling present the possibility that it is an even
stronger anti-cancer/anti-metastasis agent than other inhibitors
that inhibit only one of these two pathways. Smad7 has been shown
to prevent angiogenesis and fibrogenesis, and may therefore be
particularly useful in situations where the tumor needs to develop
a blood supply and/or stroma.
[0167] The cancer may be selected from the group consisting of
brain, lung, liver, spleen, kidney, lymph node, small intestine,
pancreas, blood cells, colon, stomach, breast, endometrium,
prostate, testicle, cervix, uterus, ovary, skin, head & neck,
esophagus, bone marrow and blood cancer. The cancers may be
metastatic or primary, recurrent or multi-drug resistant. In some
embodiments, the cancer is a solid tumor (organ tumor). Solid
tumors refer to a mass of cells that grow in organ systems and can
occur anywhere in the body. Two types of solid tumors include
epithelial tumors (carcinomas) that occur in the epithelial tissue
inside or outside an organ, and sarcomas (connective tissue tumors)
that occur in connective tissue such as, but not limited to,
muscles, tendons, fat, nerves and other connective tissues that
support, surround, or connect structures and organs in the body. In
some embodiments the cancer is a liquid tumor or cancer of the
blood, bone marrow, or lymph nodes. These tumors include, but are
not limited to, leukemia, lymphoma, and myeloma.
[0168] 6. Scarring, Fibrosis, and Aberrant Healing
[0169] In addition to accelerated re-epithelialization (e.g.,
through increasing cell proliferation and/or increasing cell
migration), Smad7 effects on wound stroma include one or more of
reducing inflammation, angiogenesis, or collagen production, among
others. Although not intending to be bound by theory these effects
may be mediated through reduction of NF-.kappa.B signaling
(evidenced by reduced p50), and blocking TGF-.beta. signaling
(evidenced by reduced pSmad2). As a result, reduced inflammation
could significantly contribute to accelerated wound healing,
optionally through reduced angiogenesis and collagen production
and/or reduced leukocyte infiltration leading to reduction of
cytokines and chemokines normally released by leukocytes, which are
angiogenic and fibrogenic. Temporal treatment with Smad7 may allow
early stage angiogenesis and collagen production required for wound
repair, while preventing prolonged angiogenesis and collagen
production. These changes could potentially accelerate wound
stromal remodeling and prevent excessive scarring due to unresolved
inflammation or collagen overproduction.
[0170] 7. Stomatitis
[0171] Stomatitis is an inflammation of the mucous lining of any of
the structures in the mouth, which may involve the cheeks, gums,
tongue, lips, throat, and roof or floor of the mouth. The
inflammation can be caused by conditions in the mouth itself, such
as poor oral hygiene, dietary protein deficiency, poorly fitted
dentures, or from mouth burns from hot food or drinks, toxic
plants, or by conditions that affect the entire body, such as
medications, allergic reactions, radiation therapy, or infections.
Severe iron deficiency anemia can lead to stomatitis. Iron is
necessary for the upregulation of transcriptional elements for cell
replication and repair. Lack of iron can cause the genetic
downregulation of these elements, leading to ineffective repair and
regeneration of epithelial cells, especially in the mouth and lips.
This condition is also prevalent in people who have a deficiency in
vitamin B2 (Riboflavin), B3 (Niacin), B6 (Pyridoxine), B9 (folic
acid) or B12 (cobalamine). When it also involves an inflammation of
the gingiva (gums), it is called gingivostomatitis. It may also be
seen in ariboflavinosis (riboflavin deficiency) or neutropenia.
[0172] Irritation and fissuring in the corners of the lips is
termed angular stomatitis or angular cheilitis. In children,
angular stomatitis is a frequent cause is repeated lip-licking and
in adults it may be a sign of underlying iron deficiency anemia, or
vitamin B deficiencies (e.g., B2-riboflavin, B9-folate or
B12-cobalamin), which in turn may be evidence of poor diets or
malnutrition (e.g., celiac disease). Also, angular cheilitis can be
caused by a patient's jaws at rest being "overclosed" due to
edentulousness or tooth wear, causing the jaws to come to rest
closer together than if the complete/unaffected dentition were
present. This causes skin folds around the angle of the mouth which
are kept moist by saliva which in turn favours infection; mostly by
Candida albicans or similar species. Treatment usually involves the
administration of topical nystatin or similar antifungal agents.
Another treatment can be to correct the jaw relationship with
dental treatment (e.g., dentures or occlusal adjustment).
[0173] Migratory stomatitis is a condition in which extensive areas
in the oral cavity mucosa are affected by annular atrophic red
lesions that are surrounded by a thin white rim. This is a
relatively uncommon form of the geographic tongue condition, that,
as opposed to migratory stomatitis, is confined to the dorsal and
lateral aspects of the tongue mucosa only.
[0174] 8. Proctitis
[0175] Proctitis is inflammation of the lining of the rectum, the
lower end of the large intestine leading to the anus. With
proctitis, inflammation of the rectal lining-called the rectal
mucosa--is uncomfortable and sometimes painful. The condition may
lead to bleeding or mucous discharge from the rectum, among other
symptoms. Some causes of proctitis include, but are not limited to:
sexually transmitted diseases (STDs), such as those that can be
transmitted during anal sex (e.g., gonorrhea, chlamydia, syphilis,
and herpes); non-STD infections from, for example, food borne
bacteria (e.g., Salmonella and Shigella); anorectal trauma from,
for example, anal sex or the insertion of objects or substances
into the rectum (e.g., chemicals from enemas); ulcerative colitis
and Crohn's disease or other inflammatory bowel diseases, may cause
ulcers (e.g., sores) in the inner lining of the colon and rectum;
radiation therapy, particularly of the pelvic area (e.g., rectal,
ovarian, or prostate cancer) which may lead to rectal bleeding;
antibiotics which lead to a loss of commensal bacteria allowing
harmful bacteria (e.g., Clostridium difficile) to cause
disease.
[0176] 9. Formulations and Routes of Administration
[0177] Where clinical applications are contemplated, it will be
necessary to prepare pharmaceutical compositions--proteins,
expression vectors, virus stocks, proteins and drugs--in a form
appropriate for the intended application. Generally, this will
entail preparing compositions that are essentially free of
pyrogens, as well as other impurities that could be harmful to
humans or animals.
[0178] PTD-Smad7 (and truncated variants) were purified extensively
prior to use in animal models. PTD-Smad7 (and truncated versions)
were prepared for topical and trans-mucosal application using a
mixture of glycerol and PBS.
[0179] One will generally desire to employ appropriate salts and
buffers to render delivery vectors stable and allow for uptake by
target cells. Buffers also will be employed when recombinant cells
are introduced into a patient. Aqueous compositions of the present
technology comprise an effective amount of the vector to cells,
dissolved or dispersed in a pharmaceutically acceptable carrier or
aqueous medium. Such compositions also are referred to as inocula.
The phrase "pharmaceutically or pharmacologically acceptable" refer
to molecular entities and compositions that do not produce adverse,
allergic, or other untoward reactions when administered to an
animal or a human. As used herein, "pharmaceutically acceptable
carrier" includes any and all solvents, dispersion media, coatings,
antibacterial and antifungal agents, isotonic and absorption
delaying agents and the like. The use of such media and agents for
pharmaceutically active substances is well known in the art. Except
insofar as any conventional media or agent is incompatible with the
vectors or cells of the present technology, its use in therapeutic
compositions is contemplated. Supplementary active ingredients also
can be incorporated into the compositions.
[0180] The active compositions of the present technology may
include classic pharmaceutical preparations. Administration of
these compositions according to the present technology will be via
any common route so long as the target tissue is available via that
route. Such routes of administration may include oral parenteral
(including intravenous, intramuscular, subcutaneous, intradermal,
intra-articular, intra-synovial, intrathecal, intra-arterial,
intracardiac, subcutaneous, intraorbital, intracapsular,
intraspinal, intrastemal, and transdermal), nasal, buccal,
urethral, rectal, vaginal, mucosal, dermal, or topical (including
dermal, buccal, and sublingual). Alternatively, administration may
be by orthotopic, intradermal, subcutaneous, intramuscular,
intraperitoneal or intravenous injection. Such compositions would
normally be administered as pharmaceutically acceptable
compositions, described supra. Of particular interest is direct
intratumoral administration, perfusion of a tumor, or
administration local or regional to a tumor, for example, in the
local or regional vasculature or lymphatic system, or in a resected
tumor bed. Administration can also be via nasal spray, surgical
implant, internal surgical paint, infusion pump, or via catheter,
stent, balloon or other delivery device. The most useful and/or
beneficial mode of administration can vary, especially depending
upon the condition of the recipient and the disorder being
treated.
[0181] Solutions of the active compounds as free base or
pharmacologically acceptable salts can be prepared in water
suitably mixed with a surfactant, such as hydroxypropylcellulose.
Dispersions can also be prepared in glycerol, liquid polyethylene
glycols, and mixtures thereof and in oils. Under ordinary
conditions of storage and use, these preparations contain a
preservative to prevent the growth of microorganisms.
[0182] The pharmaceutical forms suitable for injectable use include
sterile aqueous solutions or dispersions and sterile powders for
the extemporaneous preparation of sterile injectable solutions or
dispersions. In all cases, the form must be sterile and must be
fluid to the extent that easy syringability exists. It must be
stable under the conditions of manufacture and storage and must be
preserved against the contaminating action of microorganisms, such
as bacteria and fungi. The carrier can be a solvent or dispersion
medium containing, for example, water, ethanol, polyol (e.g.,
glycerol, propylene glycol, and liquid polyethylene glycol, and the
like), suitable mixtures thereof, and vegetable oils. The proper
fluidity can be maintained, for example, by the use of a coating,
such as lecithin, by the maintenance of the required particle size
in the case of dispersion and by the use of surfactants. The
prevention of the action of microorganisms can be brought about by
various antibacterial an antifungal agents, for example, parabens,
chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In
many cases, it will be preferable to include isotonic agents, for
example, sugars or sodium chloride. Prolonged absorption of the
injectable compositions can be brought about by the use in the
compositions of agents delaying absorption, for example, aluminum
monostearate and gelatin.
[0183] Sterile injectable solutions are prepared by incorporating
the active compounds in the required amount in the appropriate
solvent with various of the other ingredients enumerated above, as
required, followed by filtered sterilization. Generally,
dispersions are prepared by incorporating the various sterilized
active ingredients into a sterile vehicle which contains the basic
dispersion medium and the required other ingredients from those
enumerated above. In the case of sterile powders for the
preparation of sterile injectable solutions, the preferred methods
of preparation are vacuum-drying and freeze-drying techniques which
yield a powder of the active ingredient plus any additional desired
ingredient from a previously sterile-filtered solution thereof.
[0184] As used herein, "pharmaceutically acceptable carrier"
includes any and all solvents, dispersion media, coatings,
antibacterial and antifungal agents, isotonic and absorption
delaying agents and the like. The use of such media and agents for
pharmaceutical active substances is well known in the art. Except
insofar as any conventional media or agent is incompatible with the
active ingredient, its use in the therapeutic compositions is
contemplated. Supplementary active ingredients can also be
incorporated into the compositions.
[0185] The compositions of the present technology may be formulated
in a neutral or salt form. Pharmaceutically-acceptable salts
include the acid addition salts (formed with the free amino groups
of the protein) and which are formed with inorganic acids such as,
for example, hydrochloric or phosphoric acids, or such organic
acids as acetic, oxalic, tartaric, mandelic, and the like. Salts
formed with the free carboxyl groups can also be derived from
inorganic bases such as, for example, sodium, potassium, ammonium,
calcium, or ferric hydroxides, and such organic bases as
isopropylamine, trimethylamine, histidine, procaine and the
like.
[0186] The formulations are easily administered in a variety of
dosage forms. Some variation in dosage will necessarily occur
depending on the condition of the subject being treated. The person
responsible for administration will, in any event, determine the
appropriate dose for the individual subject. Moreover, for human
administration, preparations should meet sterility, pyrogenicity,
general safety and purity standards as required by FDA Office of
Biologics standards.
[0187] For oral administration the polypeptides of the present
technology may be incorporated with excipients and used in the form
of non-ingestible mouthwashes and dentifrices. It is anticipated
that virtually any pill or capsule type known to one of skill in
the art including, e.g., coated, and time delay, slow release,
etc., may be used with the present technology. A mouthwash may be
prepared incorporating the active ingredient in the required amount
in an appropriate solvent, such as a sodium borate solution
(Dobell's Solution). Alternatively, the active ingredient may be
incorporated into an antiseptic wash containing sodium borate,
glycerin and potassium bicarbonate. The active ingredient may also
be dispersed in dentifrices, including: gels, pastes, creams,
powders and slurries. The active ingredient may be added in a
therapeutically effective amount to a paste dentifrice that may
include water, binders, abrasives, flavoring agents, foaming
agents, and humectants.
[0188] Pharmaceutical compositions suitable for oral dosage may
take various forms, such as tablets, capsules, caplets, and wafers
(including rapidly dissolving or effervescing), each containing a
predetermined amount of the active agent. The compositions may also
be in the form of a powder or granules, a solution or suspension in
an aqueous or non-aqueous liquid, and as a liquid emulsion
(oil-in-water and water-in-oil). The active agents may also be
delivered as a bolus, electuary, or paste. It is generally
understood that methods of preparations of the above dosage forms
are generally known in the art, and any such method would be
suitable for the preparation of the respective dosage forms for use
in delivery of the compositions.
[0189] In one embodiment, an active agent compound may be
administered orally in combination with a pharmaceutically
acceptable vehicle such as an inert diluent or an edible carrier.
Oral compositions may be enclosed in hard or soft shell gelatin
capsules, may be compressed into tablets or may be incorporated
directly with the food of the patient's diet. The percentage of the
composition and preparations may be varied; however, the amount of
substance in such therapeutically useful compositions is preferably
such that an effective dosage level will be obtained.
[0190] Hard capsules containing the active agent compounds may be
made using a physiologically degradable composition, such as
gelatin. Such hard capsules comprise the compound, and may further
comprise additional ingredients including, for example, an inert
solid diluent such as calcium carbonate, calcium phosphate, or
kaolin. Soft gelatin capsules containing the compound may be made
using a physiologically degradable composition, such as gelatin.
Such soft capsules comprise the compound, which may be mixed with
water or an oil medium such as peanut oil, liquid paraffin, or
olive oil.
[0191] Sublingual tablets are designed to dissolve very rapidly.
Examples of such compositions include ergotamine tartrate,
isosorbide dinitrate, and isoproterenol HCL. The compositions of
these tablets contain, in addition to the drug, various soluble
excipients, such as lactose, powdered sucrose, dextrose, and
mannitol. The solid dosage forms of the present technology may
optionally be coated, and examples of suitable coating materials
include, but are not limited to, cellulose polymers (such as
cellulose acetate phthalate, hydroxypropyl cellulose, hydroxypropyl
methylcellulose, hydroxypropyl methylcellulose phthalate, and
hydroxypropyl methylcellulose acetate succinate), polyvinyl acetate
phthalate, acrylic acid polymers and copolymers, and methacrylic
resins (such as those commercially available under the trade name
EUDRAGIT.RTM.), zein, shellac, and polysaccharides.
[0192] Powdered and granular compositions of a pharmaceutical
preparation may be prepared using known methods. Such compositions
may be administered directly to a patient or used in the
preparation of further dosage forms, such as to form tablets, fill
capsules, or prepare an aqueous or oily suspension or solution by
addition of an aqueous or oily vehicle thereto. Each of these
compositions may further comprise one or more additives, such as
dispersing or wetting agents, suspending agents, and preservatives.
Additional excipients (e.g., fillers, sweeteners, flavoring, or
coloring agents) may also be included in these compositions.
[0193] Liquid compositions of pharmaceutical compositions which are
suitable for oral administration may be prepared, packaged, and
sold either in liquid form or in the form of a dry product intended
for reconstitution with water or another suitable vehicle prior to
use.
[0194] A tablet containing one or more active agent compounds
described herein may be manufactured by any standard process
readily known to one of skill in the art, such as, for example, by
compression or molding, optionally with one or more adjuvant or
accessory ingredient. The tablets may optionally be coated or
scored and may be formulated so as to provide slow or controlled
release of the active agents.
[0195] Solid dosage forms may be formulated so as to provide a
delayed release of the active agents, such as by application of a
coating. Delayed release coatings are known in the art, and dosage
forms containing such may be prepared by any known suitable method.
Such methods generally include that, after preparation of the solid
dosage form (e.g., a tablet or caplet), a delayed release coating
composition is applied. Application can be by methods, such as
airless spraying, fluidized bed coating, use of a coating pan, or
the like. Materials for use as a delayed release coating can be
polymeric in nature, such as cellulosic material (e.g., cellulose
butyrate phthalate, hydroxypropyl methylcellulose phthalate, and
carboxymethyl ethylcellulose), and polymers and copolymers of
acrylic acid, methacrylic acid, and esters thereof.
[0196] Solid dosage forms according to the present technology may
also be sustained release (i.e., releasing the active agents over a
prolonged period of time), and may or may not also be delayed
release. Sustained release compositions are known in the art and
are generally prepared by dispersing a drug within a matrix of a
gradually degradable or hydrolyzable material, such as an insoluble
plastic, a hydrophilic polymer, or a fatty compound. Alternatively,
a solid dosage form may be coated with such a material.
[0197] Compositions for parenteral administration include aqueous
and non-aqueous sterile injection solutions, which may further
contain additional agents, such as antioxidants, buffers,
bacteriostats, and solutes, which render the compositions isotonic
with the blood of the intended recipient. The compositions may
include aqueous and non-aqueous sterile suspensions, which contain
suspending agents and thickening agents. Such compositions for
parenteral administration may be presented in unit-dose or
multi-dose containers, such as, for example, sealed ampoules and
vials, and may be stores in a freeze-dried (lyophilized) condition
requiring only the addition of the sterile liquid carrier, for
example, water (for injection), immediately prior to use.
Extemporaneous injection solutions and suspensions may be prepared
from sterile powders, granules, and tablets of the kind previously
described.
[0198] Compositions for rectal delivery include rectal
suppositories, creams, ointments, and liquids. Suppositories may be
presented as the active agents in combination with a carrier
generally known in the art, such as polyethylene glycol. Such
dosage forms may be designed to disintegrate rapidly or over an
extended period of time, and the time to complete disintegration
can range from a short time, such as about 10 minutes, to an
extended period of time, such as about 6 hours.
[0199] Topical compositions may be in any form suitable and readily
known in the art for delivery of active agents to the body surface,
including dermally, buccally, and sublingually. Typical examples of
topical compositions include ointments, creams, gels, pastes, and
solutions. Compositions for administration in the mouth include
lozenges.
[0200] In accordance with these embodiments, oral (topical,
mucosal, and/or dermal) delivery materials can also include creams,
salves, ointments, patches, liposomes, nanoparticles,
microparticles, timed-release formulations and other materials
known in the art for delivery to the oral cavity, mucosa, and/or to
the skin of a subject for treatment and/or prevention of a
condition disclosed herein. Certain embodiments concern the use of
a biodegradable oral (topical, mucosal, and/or dermal) patch
delivery system or gelatinous material. These compositions can be a
liquid formulation or a pharmaceutically acceptable delivery system
treated with a formulation of these compositions, and may also
include activator/inducers.
[0201] The compositions for use in the methods of the present
technology may also be administered transdermally, wherein the
active agents are incorporated into a laminated structure
(generally referred to as a "patch") that is adapted to remain in
intimate contact with the epidermis of the recipient for a
prolonged period of time. Typically, such patches are available as
single layer "drug-in-adhesive" patches or as multi-layer patches
where the active agents are contained in a layer separate from the
adhesive layer. Both types of patches also generally contain a
backing layer and a liner that is removed prior to attachment to
the recipient's skin. Transdermal drug delivery patches may also be
comprised of a reservoir underlying the backing layer that is
separated from the skin of the recipient by a semi-permeable
membrane and adhesive layer. Transdermal drug delivery may occur
through passive diffusion, electrotransport, or iontophoresis.
[0202] In certain embodiments, a patch contemplated herein may be a
slowly dissolving or a time-released patch. In accordance with
these embodiments, a slowly dissolving patch can be an alginate
patch. In certain examples, a patch may contain a detectible
indicator dye or agent such as a fluorescent agent. In other
embodiments, a tag (e.g., detectible tag such as a biotin or
fluorescently tagged agent) can be associated with a treatment
molecule in order to detect the molecule after delivery to the
subject. In certain embodiments, one or more oral delivery patches
or other treatment contemplated herein may be administered to a
subject three times daily, twice daily, once a day, every other
day, weekly, and the like, depending on the need of the subject as
assessed by a health professional. Patches contemplated herein may
be oral-biodegradable patches or patches for exterior use that may
or may not degrade. Patches contemplated herein may be 1 mm, 2 mm,
3 mm, 4 mm to 5 mm in size or more depending on need. In addition,
skin patches are contemplated herein for use for example in a
subject suffering from psoriasis. In treating psoriasis and chronic
wounds, Smad7 can be delivered topically using vehicles such as
glycerol, carboxymethycellulose. It can also use transdermal system
(e.g., commercially available from 3M) for delivery. Subcutaneous
injection into the lesion (in normal saline or PBS) can also be
used.
[0203] In some embodiments, compositions may include short-term,
rapid-onset, rapid-offset, controlled release, sustained release,
delayed release, and pulsatile release compositions, providing the
compositions achieve administration of a compound as described
herein. See Remington's Pharmaceutical Sciences (18th ed.; Mack
Publishing Company, Eaton, Pa., 1990), herein incorporated by
reference in its entirety.
[0204] In certain embodiments, the compounds and compositions
disclosed herein can be delivered via a medical device. Such
delivery can generally be via any insertable or implantable medical
device, including, but not limited to stents, catheters, balloon
catheters, shunts, or coils. In one embodiment, the present
technology provides medical devices, such as stents, the surface of
which is coated with a compound or composition as described herein.
The medical device of this technology can be used, for example, in
any application for treating, preventing, or otherwise affecting
the course of a disease or condition, such as those disclosed
herein.
[0205] It is contemplated that any molecular biology, cellular
biology or biochemical technique known in the art may be used to
generate and/or test treatments provided herein. In addition,
protein chemistry techniques are contemplated to assess utility of
treatments in model systems developed herein (e.g., mouse model
system).
[0206] 10. Combination Therapies
[0207] It is common in many fields of medicine to treat a disease
with multiple therapeutic modalities, often called "combination
therapies." Many of the diseases described herein (e.g.,
inflammatory disease and cancer) are no exception. In some
embodiments, to treat inflammatory disorders using the methods and
compositions of the present technology, one would contact a target
cell, organ or subject with a Smad7 protein, expression construct
or activator and at least one other therapy. These therapies would
be provided in a combined amount effective to achieve a reduction
in one or more disease parameter. This process may involve
contacting the cells/subjects with the both agents/therapies at the
same time, e.g., using a single composition or pharmacological
formulation that includes both agents, or by contacting the
cell/subject with two distinct compositions or formulations, at the
same time, wherein one composition includes the Smad7 agent and the
other includes the other agent.
[0208] Alternatively, the Smad7 agent may precede or follow the
other treatment by intervals ranging from minutes to weeks. One
would generally ensure that a significant period of time did not
expire between the time of each delivery, such that the therapies
would still be able to exert an advantageously combined effect on
the cell/subject. In such instances, it is contemplated that one
would contact the cell with both modalities within about 12-24
hours of each other, within about 6-12 hours of each other, or with
a delay time of only about 12 hours. In some situations, it may be
desirable to extend the time period for treatment significantly;
however, where several days (2, 3, 4, 5, 6 or 7) to several weeks
(1, 2, 3, 4, 5, 6, 7 or 8) lapse between the respective
administrations.
[0209] It also is conceivable that more than one administration of
either the Smad7 agent or the other therapy will be desired.
Various combinations may be employed, where the Smad7 agent is "A,"
and the other therapy is "B," as exemplified below:
[0210] A/B/A B/A/B B/B/A A/A/B B/A/A A/B/B B/B/B/A B/B/A/B
[0211] A/A/B/B A/B/A/B A/B/B/A B/B/A/A B/A/B/A B/A/A/B B/B/B/A
[0212] A/A/A/B B/A/A/A A/B/A/A A/A/B/A A/B/B/B B/A/B/B B/B/A/B
[0213] Other combinations are provided. Other agents suitable for
use in a combined therapy against an inflammatory disorder include
steroids, glucocorticoids, non-steriodal anti-inflammatory drugs
(NSAIDS; including COX-1 and COX-2 inhibitors), aspirin, ibuprofen,
and naproxen. Analgesics are commonly associated with
anti-inflammatory drugs but which have no anti-inflammatory
effects. An example is paracetamol, called acetaminophen in the
U.S. and sold under the brand name of Tylenol. As opposed to
NSAIDS, which reduce pain and inflammation by inhibiting COX
enzymes, paracetamol has recently been shown to block the reuptake
of endocannabinoids, which only reduces pain, likely explaining why
it has minimal effect on inflammation. A particular agent for
combination use is an anti-TGF-.beta. antibody.
[0214] The skilled artisan is directed to Remington's
Pharmaceutical Sciences, 15th Edition, chapter 33, in particular,
pages 624-652, 1990. Some variation in dosage will necessarily
occur depending on the condition of the subject being treated. The
person responsible for administration will, in any event, determine
the appropriate dose for the individual subject. Moreover, for
human administration, preparations should meet sterility,
pyrogenicity, general safety and purity standards as required by
FDA Office of Biologics standards.
[0215] It also should be pointed out that any of the foregoing
therapies may prove useful by themselves in treating
inflammation.
[0216] As discussed above, the present technology has particular
relevance to the treatment of DNA damage and/or inflammation
resulting from certain anti-cancer therapies, and for the treatment
of cancer. Thus, in particular, the present technology may be
applied as a combination with cancer therapies. This process may
involve contacting the cells, organ, or patient with the
agents/therapies at the same time, including by contacting the
cells, organ or patient with a single composition or
pharmacological formulation that includes both agents, or with two
distinct compositions or formulations at the same time, wherein one
composition includes the Smad7 agent and the other includes the
other agent. Alternatively, analogous to the chart set forth above,
the compositions can be delivered at different times, including
repeated doses of one or both agents.
[0217] Agents or factors suitable for use in a combined therapy
include any chemical compound or treatment method that induces DNA
damage when applied to a cell. Such agents and factors include
radiation and waves that induce DNA damage such as, irradiation,
microwaves, electronic emissions, and the like. A variety of
chemical compounds, also described as "chemotherapeutic" or
"genotoxic agents," are intended to be of use in the combined
treatment methods disclosed herein. In treating cancer according to
the present technology, one would contact the tumor cells with an
agent in addition to the expression construct. This may be achieved
by irradiating the localized tumor site; alternatively, the tumor
cells may be contacted with the agent by administering to the
subject a therapeutically effective amount of a pharmaceutical
composition.
[0218] Various classes of chemotherapeutic agents are provided for
use with in combination with peptides of the present technology.
For example, selective estrogen receptor antagonists ("SERMs"),
such as Tamoxifen, 4-hydroxy Tamoxifen (Afimoxfene), Falsodex,
Raloxifene, Bazedoxifene, Clomifene, Femarelle, Lasofoxifene,
Ormeloxifene, and Toremifene.
[0219] Chemotherapeutic agents contemplated to be of use, include,
e.g., camptothecin, actinomycin-D, mitomycin C. The present
technology also encompasses the use of a combination of one or more
DNA damaging agents, whether radiation-based or actual compounds,
such as the use of X-rays with cisplatin or the use of cisplatin
with etoposide. The agent may be prepared and used as a combined
therapeutic composition, or kit, by combining it with a MUC1
peptide, as described above.
[0220] Heat shock protein 90 is a regulatory protein found in many
eukaryotic cells. HSP90 inhibitors have been shown to be useful in
the treatment of cancer. Such inhibitors include Geldanamycin,
17-(Allylamino)-17-demethoxygeldanamycin, PU-H71 and Rifabutin.
[0221] Agents that directly cross-link DNA or form adducts are also
envisaged. Agents such as cisplatin, and other DNA alkylating
agents may be used. Cisplatin has been widely used to treat cancer,
with efficacious doses used in clinical applications of 20
mg/m.sup.2 for 5 days every three weeks for a total of three
courses. Cisplatin is not absorbed orally and must therefore be
delivered via injection intravenously, subcutaneously,
intratumorally or intraperitoneally.
[0222] Agents that damage DNA also include compounds that interfere
with DNA replication, mitosis and chromosomal segregation. Such
chemotherapeutic compounds include adriamycin, also known as
doxorubicin, etoposide, verapamil, podophyllotoxin, and the like.
Widely used in a clinical setting for the treatment of neoplasms,
these compounds are administered through bolus injections
intravenously at doses ranging from 25-75 mg/m.sup.2 at 21 day
intervals for doxorubicin, to 35-50 mg/m.sup.2 for etoposide
intravenously or double the intravenous dose orally. Microtubule
inhibitors, such as taxanes, also are contemplated. These molecules
are diterpenes produced by the plants of the genus Taxus, and
include paclitaxel and docetaxel.
[0223] Epidermal growth factor receptor inhibitors, such as Iressa,
mTOR, the mammalian target of rapamycin, also known as
FK506-binding protein 12-rapamycin associated protein 1 (FRAP1) is
a serine/threonine protein kinase that regulates cell growth, cell
proliferation, cell motility, cell survival, protein synthesis, and
transcription. Rapamycin and analogs thereof ("rapalogs") are
therefore provided for use in combination cancer therapy in
accordance with the present technology.
[0224] Another possible combination therapy with the peptides
claimed herein is TNF-.alpha. (tumor necrosis factor-alpha), a
cytokine involved in systemic inflammation and a member of a group
of cytokines that stimulate the acute phase reaction. The primary
role of TNF is in the regulation of immune cells. TNF is also able
to induce apoptotic cell death, to induce inflammation, and to
inhibit tumorigenesis and viral replication.
[0225] Agents that disrupt the synthesis and fidelity of nucleic
acid precursors and subunits also lead to DNA damage. As such a
number of nucleic acid precursors have been developed. Particularly
useful are agents that have undergone extensive testing and are
readily available. As such, agents such as 5-fluorouracil (5-FU),
are preferentially used by neoplastic tissue, making this agent
particularly useful for targeting to neoplastic cells. Although
quite toxic, 5-FU, is applicable in a wide range of carriers,
including topical, however intravenous administration with doses
ranging from 3 to 15 mg/kg/day being commonly used.
[0226] Other factors that cause DNA damage and have been used
extensively include what are commonly known as .gamma.-rays,
x-rays, and/or the directed delivery of radioisotopes to tumor
cells. Other forms of DNA damaging factors are also contemplated
such as microwaves and UV-irradiation. It is most likely that all
of these factors effect a broad range of damage DNA, on the
precursors of DNA, the replication and repair of DNA, and the
assembly and maintenance of chromosomes. Dosage ranges for x-rays
range from daily doses of 50 to 200 roentgens for prolonged periods
of time (3 to 4 weeks), to single doses of 2000 to 6000 roentgens.
Dosage ranges for radioisotopes vary widely, and depend on the
half-life of the isotope, the strength and type of radiation
emitted, and the uptake by the neoplastic cells.
[0227] The skilled artisan is directed to Remington's
Pharmaceutical Sciences, 15th Edition, chapter 33, in particular
pages 624-652. Some variation in dosage will necessarily occur
depending on the condition of the subject being treated. The person
responsible for administration will, in any event, determine the
appropriate dose for the individual subject. Moreover, for human
administration, preparations should meet sterility, pyrogenicity,
general safety and purity standards as required by FDA Office of
Biologics standards.
[0228] In addition to combining Smad7 therapies with chemo- and
radiotherapies, it also is contemplated that combination with
immunotherapy, hormone therapy, toxin therapy and surgery. In
particular, one may employ targeted therapies such as AVASTIN.RTM.,
ERBITUX.RTM., GLEEVEC.RTM., HERCEPTIN.RTM., and RITUXAN.RTM..
[0229] In other embodiments, to assess the roles and mechanisms of
Smad7 within the context of oral mucositis, "gene-switch"
transgenic mouse models were developed to allow control of the
level and duration of Smad7 transgene expression specifically in
oral epithelia. In accordance with these embodiments, these models
may be used to test other genes or downstream molecules for their
effects on oral epithelia and oral mucosa. Thus, these models can
be used for, but are not limited to, further analysis of oral wound
healing biology and testing therapeutic approaches to oral wound
healing. Molecular Smad7 targets identified in these studies can
provide additional therapeutic targets for subjects suffering from
oral mucositis. Models and resources developed herein can provide
unique tools for analytical studies to identify biomarkers and
therapeutic targets related to Smad7 overexpression and control,
for example, downstream molecules turned on or bound by Smad7 can
be identified as additional therapeutic targets for example, to
treat oral mucositis, psoriasis and other conditions aggravated by
TGF-.beta. activities and NF-.kappa.B activities.
D. Kits
[0230] In certain embodiments, a kit provided herein may include
compositions discussed above for treating a subject having a
condition provided herein, such as but not limited to oral
mucositis, psoriasis, or wound healing. The kits can include one or
more containers containing the therapeutic Smad7 compositions of
the present technology. Any of the kits will generally include at
least one vial, test tube, flask, bottle, syringe or other
container, into which compositions may be preferably and/or
suitably aliquoted. Kits herein may also include a kit for
assessing biological targets that contribute to a condition
provided herein.
E. Methods of Predicting or Evaluating Responses
[0231] Also provided are methods for predicting and/or evaluating a
response to treatment with Smad7 using by assessing the level of
expression of one or more markers associated with exposure to
Smad7. Such markers may include, but are not limited to, Rac1 for
cell migration, NF-.kappa.B for inflammation, and TGF-.beta. for
growth arrest and inflammation. As is discussed in the Examples,
methods for detection of and/or changes in the levels of one or
more markers associated with Smad7 activity are provided and/or
known in the art. In some embodiments, the level of expression of
one or more of the Smad7 markers in a subject may be assessed, and
based on the level detected, a decision may be made to treat (or to
continue or discontinue treatment) with Smad7, or to employ an
alternate treatment.
[0232] The term "detection of" as used herein refers to the ability
to measure the presence or absence of a marker at some repeatable
and controlled level. Typically, detection is performed over
background values, which may include the noise (or detection
limits) inherent in the testing system. As such, there is typically
a "lower limit" of detection associated with an assay, and in order
to be detected, a change may need to be above a certain cut-off
level, for example. Determination of such limits is well-known in
the art.
[0233] In some embodiments, detection is performed as compared to
controls, which may include, but are not limited to, a comparison
with data from normal subjects and/or comparable normal tissue (in
the same or different subjects) absent the disease or disorder
present in the subject (or the specific tissue of the subject
tested). In some embodiments, the comparison may be between levels
detected at a variety of time intervals (and/or locations) in a
patient. In some embodiments, the detection needs to be
statistically significant as compared to background or control
levels; the ability to assess significance is well-known in the
art, and exemplified in the Examples.
[0234] The term "changes in the levels" as used herein refers to a
detectable change from a control or background level, and or a
previously detected level. In some embodiments, the change is an
increase as compared to another level, and in some embodiments the
change is a decrease as compared to another level. In some
embodiments, the detectable change (increase or decrease) is
statistically significant. In some embodiments, such changes can be
assessed quantitatively as at least about a 5%, 10%, 25%, 50%,
100%, 200%, 500% or greater change, and/or about a 5-10%, 10-25%,
10-50%, 25-50%, 50-75%, 50-100%, 100-150%, 100-200%, 200-300%,
300-500%, or 500-1000% change.
F. Method of Screening for Additional Biologically Active
Fragments
[0235] In another aspect, methods of screening for additional
biologically active fragments (including, but not limited to
truncations) of Smad7 are contemplated. In some embodiments,
biological activity may be assessed using one of the methods
described herein, including those described below in Examples 5 and
8. Some of the biological activities that can be assessed include,
but are not limited to, increasing cell proliferation, reducing or
inhibiting cell death, reducing excessive inflammation, preventing
DNA damage, and/or increasing cell migration, as well as animal
models treating or preventing one or more disease or disorders in
which such treatment would be helpful as further discussed herein.
Such activities can be assessed using one or more assays including,
but not limited to, the ability to block phosphorylation of Smad2
and/or nuclear translocation of the NF-.kappa.B p50 subunit,
increase cell proliferation, reduce apoptosis and/or
radiation-induced DNA damage, reduce inflammation and/or
angiogenesis, promote healing in oral mucositis, surgical wounds,
diabetes wounds, and/or wounds associated with chronic inflammation
in mice and other laboratory models. Some specific examples
include, but are not limited to, immunofluorescence (IF),
immunohistochemistry (IHC), and TUNEL assay for apoptosis.
[0236] In some embodiments, biologically active fragments are those
that are selected to include one or more or all of the activities
described herein. In some embodiments, biologically active
fragments selected to include only or primarily 1, only or
primarily 2, only or primarily 3, only or primarily 4, or only or
primarily 5 of the activities described herein. In some
embodiments, biologically active fragments selected to exclude only
or primarily 1, only or primarily 2, only or primarily 3, only or
primarily 4, or only or primarily 5 of the activities described
herein. In some embodiments, the biologically active fragments are
selected to include or to exclude a specific subset of the
activities described herein. For instance, increased proliferation
and migration may be sufficient for treating diabetic wounds,
whereas anti-inflammation is needed in chronic inflammatory wounds.
Reduced apoptosis and DNA damage activities are needed for treating
oral mucositis but not for treating surgical wounds
[0237] The term "primarily includes" as used herein refers to
fragments in which although some level of other biological activity
may remain, that activity is reduced as compared with full-length
fragments, whereas the activity that is considered "primary"
remains at about the same or an increased level as that observed in
the full-length native protein. Similarly, the term "primarily
excludes" as used herein refers to fragments in which although some
level of a particular biological activity may remain, the level of
that particular activity is reduced (optionally significantly
and/or statistically significantly reduced) as compared with
full-length fragments, whereas one or more other biological
activities remains at about the same or increased level as that
observed in the full-length native protein.
[0238] In some embodiments involving selection of biologically
active fragments, the methods include assessing changes in the
level of expression of one or more biological activities, including
increases and decreases of one or more activities in a selected
fragment are assessed as changes in reference to the activities
observed in the full-length protein. In some embodiments, one or
more biological activities are being selected to remain the same as
that observed in the full-length fragments while other activities
may be increased or decreased or even eliminated (e.g., such
fragments would lack one or more of the activities discussed). In
some embodiments, the change is an increase as compared to another
level, and in some embodiments the change is a decrease as compared
to another level. In some embodiments, the detectable change
(increase or decrease) is statistically significant. In some
embodiments, such changes can be assessed quantitatively as at
least about a 5%, 10%, 25%, 50%, 100%, 200%, 500% or greater
change, and/or about a 5-10%, 10-25%, 10-50%, 25-50%, 50-75%,
50-100%, 100-150%, 100-200%, 200-300%, 300-500%, or 500-1000%
change. In some embodiments, an activity that "remains the same"
can still be observed to have some change from the activity of the
full-length protein, but such change might be limited to, for
example, about a 1%, 2%, 5%, 10%, or 20% change or less.
[0239] In a non-limiting example, fragments of interest may include
those that primarily mediate the anti-inflammatory effect of Smad7.
Smad7 peptides having this anti-inflammatory function may be
sufficient and optionally an improvement for treating chronic
inflammation associated conditions, such as but not limited to,
oral mucositis, stomatitis and psoriasis, among others. In another
non-limiting example, fragments of interest may include those that
primarily mediate cell migration and/or blocking TGF-.beta.-induced
growth arrest and/or fibrotic response. Smad7 peptides having this
cell migration and proliferation function may be sufficient, and
optionally an improvement, for enhancing healing that is not
associated with excessive inflammation. Types of wounds that might
benefit from this form of treatment include, but are not limited
to, surgical wounds, fibrotic scarring, and diabetes wounds,
defective healing and/or scarring among others.
G. Methods of Producing Smad7 Protein
[0240] In another aspect, methods for producing Smad7 protein,
including any of the Smad7 variants, fragments, truncations, fusion
proteins (e.g., PTD-Smad7) described herein are contemplated. The
inventors have discovered methods of producing Smad7 protein at
levels and purity sufficient for research, development, or
commercialization that include nucleic acid codon optimization. As
a result, methods for producing Smad7 including the use of one or
more of the codon-optimized Smad7 nucleic acid molecules described
herein (e.g., within the Examples) are expressly contemplated.
EXAMPLES
[0241] The following examples are included to illustrate various
embodiments. It should be appreciated by those of skill in the art
that the techniques disclosed in the examples that follow represent
techniques discovered to function well in the practice of the
claimed methods, compositions and apparatus. However, those of
skill in the art should, in light of the present disclosure,
appreciate that many changes may be made in the specific
embodiments which are disclosed and still obtain a like or similar
result without departing from the spirit and scope of the present
technology.
Example 1: K5.Smad7 Mice are Resistant to Oral Mucositis
[0242] A transgenic mouse model expressing a human Smad7 protein in
keratinocytes (K5.Smad7) was generated as previously described (Han
et al., Dev. Cell, 11:301-312, 2006). Transgene expression in oral
epithelia was confirmed (FIGS. 7A-B). The mice were bred into in
the C57BL/6 background, and 8-10 weeks old male and female
transgenic mice and wild-type littermates were used in studies.
These mice showed improved healing of excisional skin wounds (Han
et al., Am. J. Pathol., 179:1768-1779, 2011) and radiation-induced
oral mucositis.
[0243] K5.Smad7 mice and wild-type littermates were exposed to
cranial radiation to determine the biological equivalent dose (BED)
required to induce oral mucositis in mice. It was determined that 8
Gy.times.3 (BED=43.2), a regimen relevant to hypo-fractionated
radiotherapy in clinic, was the minimal dose needed to induce oral
mucositis (FIGS. 1A-B). To evaluate the potency of Smad7 effects,
they also tested single doses of cranial radiation and found that
oral mucositis severity correlated with BED values between 18 Gy
(BED=50.4) and 22 Gy (BED=70.4) (FIGS. 1A-B, FIG. 7C). By day 9
after initiation of radiation, wild-type mice developed oral ulcers
(FIGS. 1A-B).
[0244] K5.Smad7 oral mucosa prior to irradiation had morphology
similar to wild-type mice, but exhibited resistance to
radiation-induced oral mucositis (FIGS. 1A-B). Histological
analyses revealed that wild-type mice developed oral mucositis
(FIG. 1A) similar to that in humans (FIG. 1C). The First Affiliated
Hospital of Kunming Medical University, China provided
de-identified archived human tissue paraffin sections and approved
the study as an exempt for human subjects. Oral mucositis lesions
were from the tongue, buccal or oropharyngeal mucosa adjacent to
recurrent oral cancers that had undergone radiotherapy.
Non-irradiated oral mucosa sections were from surgically removed
sleep apnea oral tissues and a tongue biopsy adjacent to a cyst
(mucocele).
[0245] K5.Smad7 oral epithelia typically showed radiation
dose-dependent damage, i.e., thinning epithelium and flattened
tongue papillae after 8 Gy.times.3 radiation, and more damaged
(hypo- or hypertrophic) epithelial cells after 18 Gy and 22 Gy
radiation (FIG. 1A). Consistent with increased leukocyte
infiltration in human oral mucositis lesions (FIG. 1C), lesions in
wild-type mice harbored numerous infiltrated leukocytes (FIGS.
1D-E) consisting of neutrophils, macrophages, and lymphocytes (FIG.
7D); all were substantially reduced in K5.Smad7 oral mucosa (FIGS.
1D-E and 7D).
[0246] Because it is difficult to capture human oral mucositis
pathology at the acute phase, a mouse model was utilized to assess
proliferation and apoptosis when ulcers are just formed. Similar to
previous reports, proliferative cells were sparse in irradiated
wild-type oral epithelium, but were seen more in irradiated
K5.Smad7 oral epithelium (FIGS. 1D and 1F). Conversely, apoptotic
cells were significantly reduced in irradiated K5.Smad7 oral mucosa
compared to wild-type mice (FIGS. 1D and 1G).
[0247] As expected, cells with nuclear NF-.kappa.B p50 subunit were
significantly increased in oral mucositis compared to
non-irradiated wild-type oral mucosa (FIGS. 2A-B). Interestingly,
TGF-.beta.1, an immune suppressant in internal organ, but
pro-inflammatory in oral mucosa, together with its activated
signaling mediator, phosphorylated (p) Smad2, were also increased
in oral mucositis compared to non-irradiated oral mucosa in
wild-type mice (FIGS. 2A-B). Similar changes were also detected in
human oral mucositis lesions (FIGS. 2A-B).
[0248] Irradiated K5.Smad7 oral epithelia significantly reduced
cells positive for nuclear NF-.kappa.B p50 and pSmad2, even though
they still had abundant TGF-.beta.1 protein (FIGS. 2A-B).
TGF-.beta.1 mRNA in irradiated wild-type oral mucosa was
significantly increased on day 9 and day 10 (FIG. 2C). TGF-.beta.1
mRNA level in K5.Smad7 mucosa was similar to wild-type mucosa at
earlier time points, but was back to normal by day 10 (FIG. 2C).
Although not wishing to be bound by any theory, these data suggest
that TGF-.beta.1 transcription is not inhibited by Smad7, but its
more rapid decline in K5.Smad7 mucosa could be a consequence of
accelerated healing.
[0249] Phospho-Smad1/5/8, markers for activated BMP signaling, were
not affected by Smad7 before and after radiation (FIG. 7E). This
result is consistent with the ability of Smad7 to preferentially
inhibit TGF-.beta. signaling.
Example 2: Rac1 Contributes to Smad7-Mediated Keratinocyte
Migration
[0250] To determine if Smad7 contributes to healing in human oral
keratinocytes, Smad7 was knocked down in spontaneously immortalized
human oral keratinocytes (NOK-SI) Smad7 knockdown blunted
keratinocyte migration after wounding (FIG. 2D and FIG. 8A).
Conversely, knocking down TGF-.beta.1 accelerated keratinocyte
migration (FIGS. 8B-8D), consistent with accelerated wound healing
seen in mice null for TGF-.beta.1 or Smad3.
[0251] To search for molecular mechanisms associated with
Smad7-mediated keratinocyte migration, Rac1, a protein
indispensable for oral wound healing was examined. Rac1 was reduced
after Smad7 knockdown (FIG. 2E). It was expected that TGF-.beta.1
overexpression in oral mucositis would activate Rac1 through a
Smad-independent mechanism. However, although total Rac1 protein
increased by 2-fold after irradiation, activated Rac1 protein did
not change considerably in wild-type tongues (FIG. 2F).
[0252] In K5.Smad7 oral mucosa, both total and activated Rac1 were
significantly increased by 4-fold and 8-fold, respectively,
compared to wild-type oral mucosa (FIG. 2F). To determine the
functional significance of Smad7-induced Rac1 activation, Rac1 was
knocked down in primary keratinocytes isolated from wild-type and
Smad7 transgenic neonatal skin, and assays for cell proliferation
and migration were performed. Rac1 knockdown showed modestly
reduced proliferation in wild-type and Smad7 keratinocytes (FIGS.
9A-9C), but almost completely abrogated Smad7-induced migration
(FIG. 2G and FIG. 9D), suggesting that increased Rac1 contributes
to Smad7-mediated cell migration.
[0253] It was observed that increased Rac1 mRNA levels in Smad7
transgenic keratinocytes correlated with total and active Rac1
protein levels (FIGS. 3A-B and FIGS. 10A-B), suggesting that
increased Rac1 activation in Smad7 keratinocytes is, at least in
part, a consequence of increased Rac1 transcripts. Further, Rac1
protein increased by .about.3-fold (FIG. 3C) after knockdown of
individual Smads in NOK-SI cells (FIGS. 10C-10E). These data
suggest that normal Smad signaling represses Rac1
transcription.
[0254] Among the two putative Smad binding elements (SBEs) in the
mouse Rac1 promoter (-2.1 Kb and -1.5 Kb upstream of the coding
sequence), which are in similar regions of the human Rac1 promoter,
chromatin immunoprecipitation (ChIP) identified Smad-2, -3, -4, and
-7 binding to the -1.5 Kb site (FIG. 3D), but not the -2.1 Kb site
in wild-type keratinocytes; binding of Smad-2, -3 and -4 was
significantly reduced in Smad7 transgenic keratinocytes (FIG.
3D).
[0255] Luciferase reporter assays using a SBE-containing Rac1-Luc
construct show that knockdown of Smad7 in wild-type keratinocytes
significantly reduced luciferase activity (FIG. 3E). Conversely,
Smad7 transgenic cells had increased luciferase activity compared
to wild-type cells, and mutating the SBE attenuated this increase
(FIG. 3F). Thus, Smad7 binding to SBE appears necessary to expel
signaling Smads to abrogate Rac1 repression.
[0256] Among known Smad transcriptional co-repressors, it was found
that CtBP1 bound to the Rac1 promoter SBE-1.5 Kb site in wild-type
keratinocytes (FIG. 3G), and Smad7 transgene expression
significantly reduced CtBP1 binding to the SBE (FIGS. 3G-H). When
CtBP1 was knocked down in NOK-SI cells, Rac1 protein and Rac1-Luc
activity were increased compared to keratinocytes transfected with
scrambled siRNA (FIGS. 4A-B), suggesting that CtBP1 binding to
SBE-1.5 Kb represses Rac1 expression. Further, knocking down CtBP1
in NOK-SI cells increased their migration (FIG. 4C and FIG.
10F).
[0257] Upon examination of CtBP1 protein in radiation-induced oral
mucositis, it was found that CtBP1 is barely detectable in
non-irradiated mouse and human oral mucosa (FIGS. 4D-4F); however,
CtBP1 positive cells were significantly increased in irradiated
oral mucosa of wild-type and K5.Smad7 mice as well as in human oral
mucositis (FIGS. 4D-4F). Additionally, CtBP1 mRNA in irradiated
wild-type oral mucosa was significantly increased on day 9 and day
10 (FIG. 4G). CtBP1 mRNA level in K5.Smad7 mucosa was similar to
wild-type mucosa at earlier time points, but declined to normal by
day 10 (FIG. 4G). These results indicate that Smad7 does not reduce
CtBP1 mRNA but instead inhibits CtBP1 binding to the Rac1 promoter
by repelling the Smad/CtBP1 complex from the SBE binding site;
further, more rapid CtBP1 reduction in K5.Smad7 mucosa serves as a
marker of healing.
Example 3: Tat-Smad7 Alleviates Radiation-Induced Oral
Mucositis
[0258] Smad7 transgene's ability to block multiple pathological
processes of oral mucositis prompted us to explore if localized
Smad7 delivery can be used to prevent and treat oral mucositis.
Because Smad7 is a nuclear protein, local Smad7 delivery needs to
allow Smad7 to rapidly enter into cells before saliva washes off
the protein. Thus, a recombinant human Smad7 with an N-terminal
Tat-tag allowing proteins to rapidly permeate the cell membrane and
enter the nucleus was produced. A V5 epitope was added to the
C-terminal end of the Tat-Smad7 protein to track Tat-Smad7 cell
penetration (FIGS. 11A-11D).
[0259] Using its ability to block Smad2 phosphorylation, Tat-Smad7
bioactivity was tested (FIG. 11C). Tat-Cre recombinant protein with
the same tags as a control (FIGS. 11E-F) was produced, and cloned
into the pET101-Topo protein expression vector (Invitrogen) that
contains a sequence encoding C-terminal 6.lamda.His (SEQ ID NO:
40). Tat-Cre was transformed into BL-21 STAR.TM. E. coli
(Invitrogen) to produce Tat-Cre protein and was purified with
Ni-NTA column.
[0260] The purity and size of both proteins was verified using
SDS-PAGE electrophoresis. To evaluate transduction and activity of
Tat-Smad7 protein in vitro, Tat-Smad7 was added to primary mouse
keratinocytes. Slides were fixed in cold methanol for 5 minutes and
stained for V5 and pSmad2. Tat-Cre activity was verified by
digesting a 1,460 bp floxed fragment from the 7,650 bp vector
pLL3.7 (Addgene). For in vivo treatments, 30 L 50% glycerol/PBS as
a vehicle control and Tat-Cre as a non-irrelevant protein control
were used. Tat-Smad7 or Tat-Cre (in 30 .mu.L 50% glycerol/PBS,
doses and regimens are specified in each figure) was topically
applied to mouse oral cavity and mice were restricted from oral
intake for 1 hour.
[0261] For oral mucositis prevention, both Tat-Smad7 and Tat-Cre
(in 50% glycerol/PBS) were topically applied to the oral cavity of
8-10 week old C3H females (Jackson Laboratory) or C57BL/6 mice
daily, starting 24 hours prior to radiation through day 8 after
initiation of radiation. Treated tissues were examined on day 9.
Mouse tongues were harvested, fixed in 10% formalin, embedded in
paraffin, and cut into 5 .mu.m sections. Histological changes were
analyzed and ulcers were measured using H&E stained slides. An
additional group received Palifermin treatment with a clinical
regimen, i.e., 6.25 mg kg.sup.-1 (i.p.) daily for 3 days prior to
irradiation, and daily for 3 days 24 hours after the last dose of
radiation.
[0262] Tat-Cre showed no effect compared to vehicle controls (FIGS.
5A-B). Tat-Smad7 treatments showed preventive effects on ulcer
formation similar to Palifermin (FIG. 5A). The dose-dependent
effect of Tat-Smad7 was more obvious when used on animals given a
20 Gy (BED=60) single dose of radiation that induced larger oral
ulcers than fractionated radiation (FIG. 11G). Microscopically,
both Palifermin and Tat-Smad7 treated oral mucosa prevented open
ulceration in the majority of cases (FIG. 5B). Palifermin-treated
mucosa exhibited more keratinocyte down-growth but also more
damaged keratinocytes (condensed or charcoal-like nuclei, swelled
mono- or multi-nucleated cells and shattered nuclear fragments in
conified layers) than Tat-Smad7-treated mucosa (FIG. 5B).
Immunostaining revealed that Palifermin increased proliferation
more significantly than Tat-Smad7. Tat-Smad7 reduced apoptosis,
leukocyte infiltration, nuclear pSmad2 and NF-.kappa.B p50, but
Palifermin did not (FIGS. 5B-5G).
[0263] To test whether Tat-Smad7 can be used to treat existing oral
mucositis, mice were exposed to fractionated (8 Gy.times.3) cranial
radiation and Tat-Smad7 (topically) or Palifermin (6.25 mg
kg.sup.-1, i.p.) was applied daily from day 6 after initiation of
radiation (when mucosal damage was obvious) till day 9. Treated
tissues were examined on day 10. Although beginning post-radiation
administration of Palifermin at earlier time points than the
current protocol reduced oral mucositis in mice, Palifermin
administration with the current protocol did not accelerate ulcer
closure (FIG. 6A), regardless of its hyperproliferative effect on
the entire oral mucosa (FIG. 6B). This is not surprising, as
Palifermin is approved to prevent but not treat oral mucositis.
[0264] Tat-Smad7 treated oral mucositis reduced ulcer sizes and
pathological alterations after both fractionated and single dose
radiation (FIGS. 6A-B and FIGS. 12A-12G). Away from ulcers,
Tat-Smad7 treated oral mucosa exhibited less hyperplasia and more
differentiated epithelia than Palifermin-treated oral mucosa (FIG.
6B). With a 20 Gy single dose radiation that caused slower healing
than fractionated radiation, the effect of Tat-Smad7 on recovery
after wound closure was more obvious. When vehicle treated ulcer
was just re-epithelialized, Tat-Smad7 treated mucosa had almost
recovered to normal morphology (FIG. 6C).
[0265] Consistent with observations in K5.Smad7 mice, Tat-Smad7
increased Rac1 promoter activity and reduced CtBP1 binding to the
SBE of the mouse Rac1 promoter (FIGS. 12G and 12I), and increased
Rac1 protein in mouse oral mucositis and human oral keratinocytes
(FIGS. 6D-E).
[0266] Tat-Smad7-treated human oral keratinocytes after wound
scratch had accelerated wound closure (FIG. 6F and FIG. 13A).
Further, irradiated human oral keratinocytes increased nuclear
pSmad2 and NF-.kappa.B p50, which were attenuated by Tat-Smad7
treatment (FIG. 13B). In contrast, although Tat-Smad7 penetrated
oral cancer cells efficiently (FIG. 13C), it did not further
elevate Rac1 protein level that is already abundant in cancer cells
(FIG. 13D). This result could account for faster migration of
cancer cells than normal keratinocytes (FIG. 6F and FIGS. 13A,
13E-13H), and the lack of an effect of Tat-Smad7 on migration in
two oral cancer cell lines: MSK921 which does not contain genetic
loss of TGF-.beta. signaling components; and Cal27 which has a
mutated Smad4 (FIGS. 13E-13H).
[0267] Colony assays show that survival of human oral keratinocytes
was slightly increased by Tat-Smad7 treatment with or without
radiation (FIG. 6G). Consistent with the notion that reduced
survival after irradiation is more prominent in cancer cells than
in normal cells, SCC cells showed substantial reductions in cell
survival after radiation. Treatment with Tat-Smad7 did not affect
survival in SCC cells with or without radiation (FIG. 6G).
Example 4: Design of a Cell-Penetrating Smad7 Protein
[0268] It was hypothesized that in order to be effective as a
therapeutic, SMAD7 needed to be able to penetrate cells
efficiently. In order to achieve this, the Smad7 sequence was
modified to include a protein transduction domain.
[0269] The Tat sequence from HIV was selected to test with Smad7 as
a protein transduction domain. The nucleotide and protein sequences
of Tat that were used in fusion proteins with Smad7 and Smad7
fragments are derived from Cardarelli et al., Traffic Apr
9(4):528-39 (2008). The Tat nucleotide and amino acid sequences are
provided below:
TABLE-US-00002 (SEQ ID NO: 1) ggccgtaaaaaacgccgtcaacgccgccgt (SEQ
ID NO: 2) G R K K R R Q R R R
[0270] Fusion proteins were prepared having Tat directly linked in
frame to human Smad7 complementary DNA (cDNA) either at the 5' or
3' ends of Smad7 as shown below:
TABLE-US-00003 (SEQ ID NO: 7) 5' Tat:
Ggccgtaaaaaacgccgtcaacgccgccgt -Smad7 (SEQ ID NO: 8) 3' Tat:
Smad7-Ggccgtaaaaaacgccgtcaacgccgccgt
[0271] The 5' Tat-Smad7 construct included a 3' V5 tag sequence,
and was cloned into the pGEX-6p-1 protein expression vector (New
England Biolabs) to make a GST-Tat-Smad7 fusion protein. Tat-Smad7
gene was transformed into BL-21 Star Escherichia coli (Invitrogen)
to produce Tat-Smad7 protein. The protein was purified by
glutathione column purification and elution, using enzymatic
cleavage from the Glutathione S Transferase (GST) fusion (Precision
enzyme, GE Life Sciences).
[0272] While creating a PTD-Smad7 fusion protein, a V5 tag at the
3' end was included to monitor Tat-Smad7 penetration into cells by
immunostaining using a V5 antibody. This epitope tag can be deleted
for use in the clinic (e.g., by re-cloning the sequence in the
absence of the V5 tag), if appropriate.
[0273] A PTD-Smad7 fusion protein (Tat-Smad7-V5-6H) ("6H" disclosed
as SEQ ID NO: 40) was also created having a 6-Histidine (6-H) tag
(SEQ ID NO: 40) for protein purification, and is shown below.
Tat-Smad7-V5-6H ("6H" disclosed as SEQ ID NO: 40) has the following
nucleotide sequence: 1-53 include the 5' sequence of pET-TOPO;
54-1365 include Tat-Smad7; 1366-1497 include 3' pET-TOPO containing
the V5 epitope and 6.times.His tag (SEQ ID NO: 40) (V5 includes
1393-1434, His tag includes 1444-1461, and the Stop includes
1462-1464).
[0274] Tat-human Smad7, codon-optimized for protein production,
cloned to pET101/D-Topo vector is shown below:
TABLE-US-00004
ttcccctctagaaataattttgtttaactttaagaaggaattcaggagcccttcaccatg M
cgtaaaaaacgccgtcaacgccgccgtggtttccgtacgaaacgctcggccctggtccgt R K K
R R Q R R R G F R T K R S A L V R
cgcctgtggcgctcccgtgctccgggtggtgaagatgaagaagaaggtgctggcggcggt R L W
R S R A P G G E D E E E G A G G G
ggcggtggcggtgaactgcgtggcgagggtgcaaccgatagtcgtgcacacggtgcaggc G G G
G E L R G E G A T D S R A H G A G
ggtggcggtccgggtcgtgctggttgctgtctgggtaaagctgtgcgcggcgcgaaaggt G G G
P G R A G C C L G K A V R G A K G
catcaccatccgcacccgccggcagcaggtgcaggtgcagctggcggtgcggaagccgat H H H
P H P P A A G A G A A G G A E A D
ctgaaagccctgacccatagtgtcctgaaaaaactgaaagaacgtcagctggagctgctg L K A
L T H S V L K K L K E R Q L E L L
ctgcaagcagtagaatcccgtggcggtacccgtacggcttgtctgctgctgccgggtcgt L Q A
V E S R G G T R T A C L L L P G R
ctggattgccgtctgggtccgggtgcaccggctggtgcgcagccggcacaaccgccgagc L D C
R L G P G A P A G A Q P A Q P P S
tcttacagcctgccgctgctgctgtgtaaagtgtttcgttggccggacctgcgccacagt S Y S
L P L L L C K V F R W P D L R H S
tccgaagttaaacgcctgtgctgttgcgagagctatggcaaaattaacccggaactggtt S E V
K R L C C C E S Y G K I N P E L V
tgttgcaatccgcaccatctgtctcgtctgtgtgaactggagagcccgccgccgccgtat C C N
P H H L S R L C E L E S P P P P Y
tctcgttacccgatggatttcctgaaaccgactgcagattgcccggacgcagtcccgtca S R Y
P M D F L K P T A D C P D A V P S
tcggctgagaccggcggcaccaactatctggcaccgggcggtctgagtgattcccagctg S A E
T G G T N Y L A P G G L S D S Q L
ctgctggaaccgggcgaccgttcacattggtgtgtggttgcctattgggaagagaaaacg L L E
P G D R S H W C V V A Y W E E K T
cgtgtcggtcgcctgtactgcgtacaggaaccgtcgctggatatcttttatgacctgccg R V G
R L Y C V Q E P S L D I F Y D L P
cagggcaatggtttctgtctgggccaactgaactcagataataaatcgcagctggtgcaa Q G N
G F C L G Q L N S D N K S Q L V Q
aaagttcgctcaaaaattggctgcggtatccagctgacccgtgaagttgacggtgtctgg K V R
S K I G C G I Q L T R E V D G V W
gtatataaccgcagctcttacccgatttttatcaaaagtgccaccctggataatccggac V Y N
R S S Y P I F I K S A T L D N P D
tcccgtacgctgctggtccacaaagtatttccgggcttctcaatcaaagcgttcgattac S R T
L L V H K V F P G F S I K A F D Y
gagaaagcctactcgctgcagcgcccgaacgaccatgaattcatgcagcaaccgtggacg E K A
Y S L Q R P N D H E F M Q Q P W T
ggttttactgtgcagatctctttcgttaaaggctggggtcaatgctacacccgtcagttt G F T
V Q I S F V K G W G Q C Y T R Q F
atctcgtcctgtccgtgctggctggaagtgattttcaatagccgcaagggcgagctcaat I S S
C P C W L E V I F N S R K G E L N
tcgaagcttgaaggtaagcctatccctaaccctctcctcggtctcgattctacgcgtacc S K L
E G K P I P N P L L G L D S T R T
ggtcatcatcaccatcaccattgagtttgatccggctgctaacaaagcccgaaagga (SEQ ID
NO: 9) G H H H H H H - (SEQ ID NO: 10)
[0275] A comparison of the protein sequence of Tat-Smad7-v5 and
Smad7 is provided below. The first amino acid of Smad7 in Tat-Smad7
is not M (unlike Smad7), because Tat-Smad7 is designed to be
in-frame with Tat and/or GST to form a GST fusion protein.
Tat-Smad7 is then cleaved from the GST fusion protein after
purification. Upper case nucleotides identify the V5 tag.
Underlined italics indicate amino acids from the optional
pET101-Topo backbone vector.
[0276] Below a Tat-Smad7-v5 and Smad7 comparison is presented:
TABLE-US-00005 Tat-Smad7-V5 1
gsgrkkrrqrrrgfrtkrsalvrrlwrsrapggedeeegagggggggelr human Smad7 1
------------mfrtkrsalvrrlwrsrapggedeeegagggggggelr Tat-Smad7-V5 51
gegatdsrahgaggggpgragcclgkavrgakghhhphppaagagaagga human Smad7 39
gegatdsrahgaggggpgragcclgkavrgakghhhphppaagagaagga Tat-Smad7-V5 101
eadlkalthsvlkklkerqlelllqavesrggtrtaclllpgrldcrlgp human Smad7 89
eadlkalthsvlkklkerqlelllqavesrggtrtaclllpgrldcrlgp Tat-Smad7-V5 151
gapagaqpaqppssyslplllckvfrwpdlrhssevkrlcccesygkinp human Smad7 139
gapagaqpaqppssyslplllckvfrwpdlrhssevkrlcccesygkinp Tat-Smad7-V5 201
elvccnphhlsrlcelesppppysrypmdflkptadcpdavpssaetggt human Smad7 189
elvccnphhlsrlcelesppppysrypmdflkptadcpdavpssaetggt Tat-Smad7-V5 251
nylapgglsdsqlllepgdrshwcvvayweektrvgrlycvqepsldify human Smad7 239
nylapgglsdsqlllepgdrshwcvvayweektrvgrlycvqepsldify Tat-Smad7-V5 301
dlpqgngfclgqlnsdnksqlvqkvrskigcgiqltrevdgvwvynrssy human Smad7 289
dlpqgngfclgqlnsdnksqlvqkvrskigcgiqltrevdgvwvynrssy Tat-Smad7-V5 351
pifiksatldnpdsrtllvhkvfpgfsikafdyekayslqrpndhefmqq human Smad7 339
pifiksatldnpdsrtllvhkvfpgfsikafdyekayslqrpndhefmqq Tat-Smad7-V5 401
pwtgftvgisfvkgwgqcytrqfisscpcwlevifnsrkgelnskleGKP human Smad7 389
pwtgftvqisfvkgwgqcytrqfisscpcwlevifnsr------------ Tat-Smad7-V5 451
IPNPLLGLDST (SEQ ID NO: 11) human Smad7 427 ----------- (SEQ ID NO:
12)
Example 5: Additional Assays for PTD-Smad7 Protein Activity
[0277] Immunofluorescence (IF), immunohistochemistry (IHC), and
TUNEL assay for apoptosis.
[0278] IF and IHC were performed as previously described (Han, G.,
Li, F., Ten Dijke, P. & Wang, X. J. Temporal smad7 transgene
induction in mouse epidermis accelerates skin wound healing. Am J
Pathol 179, 1768-1779 (2011)). Primary antibodies used were guinea
pig antibody to K14 (1:400, Fitzgerald, 20R-CP200), rat antibody to
CD4 (1:20, BD Bioscience, 550278), Ly-6G (1:20, BD Bioscience,
550291), BM8 (antibody to F4/80, 1:20, Invitrogen, MF48000),
FITC-labeled antibody to BrdU (BD Bioscience, 347583), rat antibody
to CD45 (1:50, BD Bioscience, 550539) for mouse samples, mouse
antibody to CD45 (1:50, Abcam, Ab781) for human samples, chicken
antibody to TGF-.beta.1 (1:50, R&D, AF-101-NA), rabbit antibody
to CtBP1 (1:100, Millipore, 07-306), rabbit antibody to NF-.kappa.B
p50 (1:200, Santa Cruz Biotechnology, SC-7178), rabbit antibody to
PCNA (1:200, Santa Cruz Biotechnology, SC-7907), rabbit antibody to
pSmad2 (1:100, Cell Signaling Technology, 3101), and mouse antibody
to V5 (1:500, Invitrogen, 460705). For IF, secondary antibodies to
different species IgG were Alexa Fluor.RTM. 594 (red) or 488
(green) conjugated (1:200 for all, Invitrogen). For IHC, secondary
biotinylated antibodies to different species IgG (1:300, Vector
Labs) were used and were developed using Vectastain ABC kit (Vector
Labs). A Terminal deoxynucleotidyl transferase uridine nick
end-labeling (TUNEL, G3250) kit (Promega) was used on formalin
fixed tissue sections to detect apoptotic cells. BrdU labeling was
performed in vivo by i.p. injection of 0.125 mg g.sup.-1 BrdU 1
hour prior to euthanization. PCNA or BrdU were quantified as cells
mm.sup.-1 epithelial length including all epithelial cells, TUNEL
or CD45-positive cells as cells mm.sup.- epithelial length
including all epithelial layers and stroma above the muscle layer,
nuclear pSmad2 or NF-.kappa.B p50 positive cells as the number of
positive cells/existing total remaining epithelial cells (i.e.,
excluding sloughed epithelial cells induced by irradiation).
Consecutive fields of slides were used to count BrdU-labeled cells
using MetaMorph software.
[0279] Cell Culture.
[0280] Smad7 transgenic and wild-type primary keratinocytes were
prepared from neonatal mouse skin as previously described (Han, G.,
Li, F., Ten Dijke, P. & Wang, X. J. Temporal smad7 transgene
induction in mouse epidermis accelerates skin wound healing. Am J
Pathol 179, 1768-1779 (2011)), and cultured in PCT medium
(CELLnTEC). Spontaneously immortalized normal oral keratinocytes
(NOK-SI) derived from gingival tissues of healthy volunteers were
cultured and maintained in defined keratinocyte medium (Castilho,
R. M., et al. Rac1 is required for epithelial stem cell function
during dermal and oral mucosal wound healing but not for tissue
homeostasis in mice. PloS one 5, e10503 (2010)). Oral cancer cells
Cal27 (ATCC) and MSK921 were cultured (D. Raben's lab,
fingerprinted by University of Colorado Cancer Center Tissue
Culture Core) in Dulbecco Modified Eagle Medium supplemented with
10% fetal bovine serum (GIBCO.RTM.; Invitrogen). To assess the
effect of Tat-Smad7 in irradiated cells, the above human cell lines
were cultured in chamber slides (BD Bioscience, 354108), irradiated
with 3 Gy, and Tat-Smad7 (1 .mu.g mL.sup.-1) was added to the
culture medium immediately after irradiation. Cells were fixed in
100% cold methanol 4 hours after Tat-Smad7 treatment for
immunostaining of pSmad2, NF-.kappa.B p50 and V5.
[0281] Transfection with siRNA.
[0282] When cultured keratinocytes reached 70% confluency, 100 nM
of target siRNA or scrambled siRNA (Dharmacon) was transfected
using LIPOFECTAMINE.RTM. 2000 (Invitrogen). Cells were harvested
48-72 hours after transfection and subjected to western analyses to
determine knockdown efficiency. For migration assays, siRNA was
transfected when cells were plated. Target siRNAs included in this
study are: mouse siRac1-1 (Invitrogen, MSS237708) and siRac1-2
(IDT, MMC.RNAI.N009007.12.3); human siSmad2 (Dharmacon,
L-003561-00-0005), siSmad3 (Invitrogen, HSS106252), and siSmad4
(Invitrogen, HSS118066); human siCtBP1-1 and siCtBP1-2; human
siSmad7-1 and siSmad7-2; human TGF-.beta.1 (Dharmacon,
J-012562-08-0005); mouse siSmad7.
[0283] In Vitro Keratinocyte Proliferation Assay.
[0284] In vitro keratinocyte proliferation was determined by BrdU
incorporation in wild-type and Smad7 transgenic keratinocytes.
Cells at 70% confluency were transfected with Rac1 siRNAs, and
changed to regular culture medium 24 hours later. An in situ cell
proliferation kit (Roche Applied Science) was used to perform in
vitro BrdU labeling and detection, and MetaMorph software was used
to count BrdU-labeled cells.
[0285] In Vitro Cell Migration Assays.
[0286] When cells reached 100% confluency, the cells were treated
with mitomycin C (Sigma) at 10 .mu.g mL.sup.-1 for 2 hours to
inhibit cell proliferation and a scratch wound was introduced with
a Fisherbrand pipet tip. Cell migration was photographed daily.
Migration assays were performed when cells reached confluency after
24 to 36 hours of siRNA transfection, and Image-J software was used
to document cell migration as the wound area occupied with
migrating cells. For Tat-Smad7 treatment, cells were exposed to
Tat-Smad7 protein at 1 .mu.g mL.sup.-1 or vehicle control (PBS) in
medium after wound scratch, and medium was changed every other day
with freshly added Tat-Smad7 until migrating cells fully covered
the scratched wound.
[0287] Cell Survival Assay.
[0288] Cell survival assays were performed as previously described
(Munshi, A., Hobbs, M. & Meyn, R. E. Clonogenic cell survival
assay. Methods in molecular medicine 110, 21-28 (2005)), with
slight modifications. Briefly, cells were plated in 12-well plates
at 500 cells well.sup.-1 for non-irradiated wells, and increased up
to 1,500 cells well.sup.-1 along with increased radiation doses.
Cells were irradiated 24 hours after they were plated. Tat-Smad7
was added at 1 .mu.g mL.sup.-1 or the same volume of PBS used to
dissolve Tat-Smad7 (control) to culture medium of irradiated and
non-irradiated cells. The medium was changed every other day with
freshly added Tat-Smad7 or PBS for 10 to 14 days. Colonies were
fixed in methanol, stained in 0.5% crystal violet solution
(containing 25% methanol), counted and the average from 4 wells in
each experiment was calculated. Two to three separate experiments
were performed for each cell line. The relative surviving fraction
was calculated as previously described, i.e., the absolute
surviving fraction (colony numbers/total plated cells) under each
radiation dose divided by the absolute surviving fraction of
non-irradiated cells.
[0289] Western Analysis.
[0290] Protein extraction and western analyses were performed as
previously described (Li, A. G., Lu, S. L., Zhang, M. X., Deng, C.
& Wang, X. J. Smad3 knockout mice exhibit a resistance to skin
chemical carcinogenesis. Cancer Res 64, 7836-7845 (2004)). The
antibodies used in this study included rabbit antibody to Smad7
(1:500), rabbit antibodies to Smad2 (1:300, Zymed, 51-1300) and
Smad4 (1:300, Epitomics, 1676-1), rabbit antibody to Smad3 (1:300,
Cell Signaling Technology, 9513), mouse antibody to Rac1 (1:500, BD
Biosciences, 610651), rabbit antibody to CtBP1 (1:500, Millipore,
07-306), mouse antibody to tubulin (1:3000, Sigma, T5168), mouse
antibody to GAPDH (1:5000, Abcam, Ab8245) and goat antibody to
actin (1:1000, Santa Cruz Biotechnology, SC1616). Gray-scale images
were obtained using the ODYSSEY.RTM. v.1.2 software (LI-COR
Biosciences).
[0291] Rac1 Activation Assay.
[0292] Active GTP-bound Rac1 was examined using a BIOCHEM.TM. Kit
for Rac1 activation (Cytoskeleton Inc, BK035). Wild-type and Smad7
transgenic keratinocytes were cultured in 15 cm diameter tissue
culture plates and prepared protein lysates using the provided
lysis buffer. To assay Rac1 activity, 1 mg of cell lysate was used.
To examine total Rac1 and Smad7 proteins, 50 .mu.g of lysate was
used. To measure GTP-bound Rac1 in mouse tongues, half of the
tongue was ground to a powder in liquid nitrogen and lysed with
lysis buffer to extract protein, GTP-bound Rac1 was assayed in 2 mg
of protein lysate per sample and 50 .mu.g of protein lysate was
loaded for total Rac1 protein western blot.
[0293] Chip Assays.
[0294] ChIP assays were performed using the ChIP-IT express kit
(Active Motive, 53009) as previously described (Hoot, K. E., et al.
HGF upregulation contributes to angiogenesis in mice with
keratinocyte-specific Smad2 deletion. J Clin Invest 120, 3606-3616
(2010); Hoot, K. E., et al. Keratinocyte-specific Smad2 ablation
results in increased epithelial-mesenchymal transition during skin
cancer formation and progression. Owens, et al., J. Clin. Invest
118, 2722-2732 (2008). Smad4-dependent desmoglein-4 expression
contributes to hair follicle integrity. Owens, et al., Dev. Biol.
322:156-166 (2008). DNA-protein complex was isolated from primary
mouse keratinocytes. For ChIP, 6.3 .mu.g sheared chromatin was
incubated with protein-G magnetic beads and 2 .mu.g each of rabbit
antibodies to Smad2 (Cell Signaling Technology, 3122), Smad3 (Cell
Signaling Technology, 9523), Smad4 (Cell Signaling Technology,
9515), Smad7 antibody (Santa Cruz Biotechnology, SC-11392), CtBP1
(Millipore) or a negative control rabbit IgG (Santa Cruz
Biotechnology, SC-2027). Eluted DNA from the protein-DNA complex
was used for PCR analyses, and CtBP1 binding to the Rac1 promoter
was compared in wild-type and Smad7 transgenic keratinocytes by
ChIP band intensities on gel images or by quantitative PCR using
Power SYBR Green Master Mix (Applied Biosystems). Primers used to
amplify the Rac1 SBE-1.5 Kb promoter regions:
TABLE-US-00006 (SEQ ID NO: 13) 5'-TGGAATTCCTGGTCTGGTTT-3' (sense)
(SEQ ID NO: 14) 5'-GCCAAGCTGCTCTTCCAGTA-3' (antisense) (SEQ ID NO:
15) 5'-TCTCAGGGGGCCAAAGGTGTT-3' (sense) (SEQ ID NO: 16)
5'-TCCCAGCACCTGAATCACATGG-3' (antisense)
[0295] Rac1 Promoter Luciferase Reporter Construct, Site-Directed
Mutagenesis and Luciferase Assay.
[0296] The 883 bp fragment of -1671 bp to -789 bp of the Rac1
promoter, encompassing the SBE-1.5 Kb site, was amplified from
wild-type mouse DNA using 5' XhoI and 3' HindIII tagged primers,
and this Rac1 promoter fragment was cloned into pGL4.26 vector
(Promega) to make the Rac1 promoter-pGL4.26 luciferase reporter
(Rac1-Luc) construct. For site-directed mutagenesis, the SBE
sequence 5'-TGTCTGTGCT-3' (SEQ ID NO: 17) was mutated to
5'-TGATAGAGCT-3' (SEQ ID NO: 18). Rac1-Luc and pGL4.74 (1:20) were
co-transfected with Smad7 siRNA, CtBP1 siRNA or scrambled siRNA
using Lipofectamine 2000 (Invitrogen) to primary mouse
keratinocytes, or primary mouse keratinocytes with Tat-Smad7
treatment (1 .mu.g mL.sup.-1). Cell lysates were collected and
luciferase assays were performed 48 hours after transfection or
Tat-Smad7 treatment, using the DUAL-LUCIFERASE.RTM. Reporter Assay
kit (Promega) following manufacturer's instructions.
Rac1-luciferase activity was measured with the Glomax machine
(Promega) and expressed by the ratio of firefly activity to Renilla
activity. Primers used for amplification of Rac1 promoter sequence
were:
TABLE-US-00007 (SEQ ID NO: 19) 5'-ATCCTCGAG-TATCCTCCAGGTCTGGG-3'
(SEQ ID NO: 20) 5'-GCCAAGCTT-AGCGTCCAGCGTTAACCTG-3'
[0297] Statistical Analysis.
[0298] Statistical differences in molecular analyses and oral
mucositis ulcer size were analyzed using the Student's t-test and
all data was presented by mean+s.d. except ulcer size, which was
presented by mean+s.e.m. Oral mucositis incidences were analyzed by
Fisher's exact test.
Example 6: Codon Optimization for Smad7 Protein Production in E.
coli or Yeast
[0299] Although many mammalian proteins can be produced in bacteria
without nucleotide sequence modification, the analysis indicated
that the Smad7 nucleotide sequence would need to be modified to
allow protein expression in bacteria.
[0300] Analysis of Smad7 cDNA mammalian codon use revealed nine
arginine amino acids coded for by the following nucleotides: 7-9,
43-45, 169-171, 403-405, 490-492, 526-528, 526-528, 823-825,
1057-1059 are a rare codon (AGG, codon utilization 1.7%). Since
these codons are rare codons in bacteria, it is expected that they
could halt or reduce protein translation and/or production in
bacteria. The amino acids coded for by rare arginine codons are
indicated by bold capitals below in the illustrated human Smad7
protein, including arginines at positions 3, 15, 57, 135, 164, 169,
176, 275, and 353. Additionally, the following arginine codons also
have low frequency usages. CGA (3.5% codon utilization):
nucleotides 16-18, 136-138, 199-201, 598-600, which code for
arginine at positions 6, 46, 67, 200; CGG (5.4% codon utilization):
nucleotides 31-33, 112-114, 316-318, 772-774, 940-942, 973-975,
1135-1137, 1276-1278, which code for arginine at positions 11, 38,
106, 258, 314, 325, 379, 426; AGA (2.8% codon utilization):
nucleotides 637-639, 814-816, which code for arginine at positions
213, 272. These arginine residues are highlighted in bold upper
case R below and they are changed to CGC in at least one of the
codon-optimized nucleic acid sequences (20.6% codon
utilization):
TABLE-US-00008 (SEQ ID NO: 12) 1 mfRtkRsalv RrlwRsrapg gedeeegagg
gggggelRge gatdsRahga 51 ggggpgRagc clgkavrgak ghhhphppaa
gagaaggaea dlkalthsvl 101 kklkeRqlel llqavesrgg trtaclllpg
rldcRlgpga pagaqpaqpp 151 ssyslplllc kvfRwpdlRh ssevkRlccc
esygkinpel vccnphhlsR 201 lcelespppp ysRypmdflk ptadcpdavp
ssaetggtny lapgglsdsq 251 lllepgdRsh wcvvayweek tRvgRlycvq
epsldifydl pqgngfclgq 301 lnsdnksqlv qkvRskigcg iqltRevdgv
wvynrssypi fiksatldnp 351 dsRtllvhkv fpgfsikafd yekayslqRp
ndhefmqqpw tgftvqisfy 401 kgwgqcytrq fisscpcwle vifnsR
[0301] Based on this analysis, it was decided to optimize the Smad7
nucleotide sequence to codons that were believed to allow increased
Tat-Smad7 protein production in E. coli or yeast. Provided below is
the optimized nucleic acid codon sequence made by Genscript.
Briefly, the sequence has the following composition: nucleotides
1-6 include the restriction recognition site for BamHI; nucleotides
7-36 include the Tat sequence; nucleotides 37-1314 include
codon-optimized human Smad7 cDNA; nucleotides 1342-1383 include the
V5 epitope; nucleotides 1384-1386 are the stop codon; and
nucleotides 1387-1392 including the restriction recognition site
for SalI. In this sequence, ATG is removed to be used with GST. The
entire designed sequence was converted to E. coli codons based on
"Codon-Usage Database." The initial optimized Smad7 sequence (SEQ
ID NO: 23) is shown below:
TABLE-US-00009 (SEQ ID NO: 23) 1 ggatccggcc gtaaaaaacg ccgtcaacgc
cgccgtggtt tccgtacgaa acgctcggcc 61 ctggtccgtc gcctgtggcg
ctcccgtgct ccgggtggtg aagatgaaga agaaggtgct 121 ggcggcggtg
gcggtggcgg tgaactgcgt ggcgagggtg caaccgatag tcgtgcacac 181
ggtgcaggcg gtggcggtcc gggtcgtgct ggttgctgtc tgggtaaagc tgtgcgcggc
241 gcgaaaggtc atcaccatcc gcacccgccg gcagcaggtg caggtgcagc
tggcggtgcg 301 gaagccgatc tgaaagccct gacccatagt gtcctgaaaa
aactgaaaga acgtcagctg 361 gagctgctgc tgcaagcagt agaatcccgt
ggcggtaccc gtacggcttg tctgctgctg 421 ccgggtcgtc tggattgccg
tctgggtccg ggtgcaccgg ctggtgcgca gccggcacaa 481 ccgccgagct
cttacagcct gccgctgctg ctgtgtaaag tgtttcgttg gccggacctg 541
cgccacagtt ccgaagttaa acgcctgtgc tgttgcgaga gctatggcaa aattaacccg
601 gaactggttt gttgcaatcc gcaccatctg tctcgtctgt gtgaactgga
gagcccgccg 661 ccgccgtatt ctcgttaccc gatggatttc ctgaaaccga
ctgcagattg cccggacgca 721 gtcccgtcat cggctgagac cggcggcacc
aactatctgg caccgggcgg tctgagtgat 781 tcccagctgc tgctggaacc
gggcgaccgt tcacattggt gtgtggttgc ctattgggaa 841 gagaaaacgc
gtgtcggtcg cctgtactgc gtacaggaac cgtcgctgga tatcttttat 901
gacctgccgc agggcaatgg tttctgtctg ggccaactga actcagataa taaatcgcag
961 ctggtgcaaa aagttcgctc aaaaattggc tgcggtatcc agctgacccg
tgaagttgac 1021 ggtgtctggg tatataaccg cagctcttac ccgattttta
tcaaaagtgc caccctggat 1081 aatccggact cccgtacgct gctggtccac
aaagtatttc cgggcttctc aatcaaagcg 1141 ttcgattacg agaaagccta
ctcgctgcag cgcccgaacg accatgaatt catgcagcaa 1201 ccgtggacgg
gttttactgt gcagatctct ttcgttaaag gctggggtca atgctacacc 1261
cgtcagttta tctcgtcctg tccgtgctgg ctggaagtga ttttcaatag ccgcaagggc
1321 gagctcaatt cgaagcttga aggtaagcct atccctaacc ctctcctcgg
tctcgattct 1381 acgtgagtcg ac
[0302] A nucleotide sequence comparison between Tat-Smad7-V5 (SEQ
ID NO: 23) and human Smad7 (SEQ ID NO: 22) cDNA is provided below.
Human Smad7 and codon-optimized Tat-Smad7-V5 share 68% codon
homology. Human Smad7 and codon-optimized Tat-Smad7 share 71% codon
homology. Human Smad7 and codon-optimized Smad7 share 73% codon
homology.
TABLE-US-00010 Alignment: Global DNA alignment against reference
molecule Parameters: Scoring matrix: Linear (Mismatch 2, OpenGap 4,
ExtGap 1) Reference molecule: human Smad7 mRNA, Region 1-1281
Number of sequences to align: 2 Settings: Similarity significance
value cutoff: >= 90% Summary of Percent Matches: Reference:
human Smad7 mRNA 1-1281 (1281 bps) -- Sequence 2: Tat-Smad7-V5
1-1392 (1392 bps) 68% human Smad7 1
--atg----------------------------------ttcaggaccaaacgatctgcg
Tat-Smad7-V5 1
ggatccggccgtaaaaaacgccgtcaacgccgccgtggtttccgtacgaaacgctcggcc human
Smad7 25
ctcgtccggcgtctctggaggagccgtgcgcccggcggcgaagacgaggaagagggcgca
Tat-Smad7-V5 61
ctggtccgtcgcctgtggcgctcccgtgctccgggtggtgaagatgaagaaaaaggtgct human
Smad7 85
gggggaggtggaggaggaggcgagctgcggggagaaggggcgacggacagccgagcgcat
Tat-Smad7-V5 121
ggcggcggtggcggtggcggtgaactgcgtggcgagggtgcaaccgatagtcgtgcacac human
Smad7 145
ggggccggtggcggcggcccgggcagggctggatgctgcctgggcaaggcaatgcgaggt
Tat-Smad7-V5 181
ggtgcaggcggtggcggtccgggtcgtgctggttgctgtctgggtaaagctgtgcgcggc human
Smad7 205
gccaaaggtcaccaccatccccacccgccagccgcgggcgccggcgcggccgggggcgcc
Tat-Smad7-V5 241
gcgaaaggtcatcaccatccgcacccgccggcagcaggtgcaggtgcagctggcggtgcg human
Smad7 265
gaggcggatctgaaggcgctcacgcactcggtgctcaagaaactgaaggagcggcagcta
Tat-Smad7-V5 301
gaagccgatctgaaagccctgacccatagtgtcctgaaaaaactgaaagaacgtcagctg human
Smad7 325
gagctgctgctccaggccgtggagtcccgcggcgggacgcgcaccgcgtgcctcctgctg
Tat-Smad7-V5 361
gagctgctgctgcaagcagtagaatcccgtggcggtacccgtacggcttgtctgctgctg human
Smad7 385
cccggccgcctggactgcaggctaggcccgggggcgcccgccggcgcgcagcctgcgcag
Tat-Smad7-V5 421
ccgggtcgtctggattgccgtctgggtccgggtgcaccggctggtgcgcagccggcacaa human
Smad7 445
ccgccctcgtcctactcgctccccctcctgctgtgcaaagtgttcaggtggccggatctc
Tat-Smad7-V5 481
ccgccgagctcttacagcctgccgctgctgctgtgtaaagtgtttcgttggccggaccta human
Smad7 505
aggcattcctcggaagtcaagaggctgtgttgctgtgaatcttacgggaagatcaacccc
Tat-Smad7-V5 541
cgccacagttccgaagttaaacgcctgtgctgttgcgagagctatggcaaaattaacccg human
Smad7 565
gagctggtgtgctgcaacccccatcaccttagccgactctgcgaactagagtctcccccc
Tat-Smad7-V5 601
gaactggtttgttgcaatccgcaccatctgtctcgtctgtgtgaactggagagcccgcca human
Smad7 625
cctccttactccagatacccgatggattttctcaaaccaactgcagactgtccagatgct
Tat-Smad7-V5 661
ccgccgtattctcgttacccgatggatttcctgaaaccgactgcagattgcccggacgca human
Smad7 685
gtgccttcctccgctgaaacagggggaacgaattatctggcccctgggggactttcagat
Tat-Smad7-V5 721
gtcccgtcatcggctgagaccggcggcaccaactatctggcaccgggcggtctgagtgat human
Smad7 745
tcccaacttcttctggagcctggggatcggtcacactggtgcgtggtggcatactgggag
Tat-Smad7-V5 781
tcccagctgctgctggaaccgggcgaccgttcacattggtgtgtggttgcctattgggaa human
Smad7 805
gagaagacgagagtggggaggctctactgtgtccaggagccctctctggatatcttctat
Tat-Smad7-V5 841
gagaaaacgcgtgtcggtcgcctgtactgcgtacaggaaccgtcgctggatatcttttat human
Smad7 865
gatctacctcaggggaatggcttttgcctcggacagctcaattcggacaacaagagtcag
Tat-Smad7-V5 901
gacctgccgcagggcaatggtttctgtctgggccaactgaactcagataataaatcgcag human
Smad7 925
ctggtgcagaaggtgcggagcaaaatcggctgcggcatccagctgacgcgggaggtggat
Tat-Smad7-V5 961
ctggtgcaaaaagttcgctcaaaaattggctgcggtatccagctgacccgtgaagttgac human
Smad7 985
ggtgtgtgggtgtacaaccgcagcagttaccccatcttcatcaagtccgccacactggac
Tat-Smad7-V5 1021
ggtgtctgggtatataaccgcagctattacccgatttttatcaaaagtgccaccctggat human
Smad7 1045
aacccggactccaggacgctgttggtacacaaggtgttccccggtttctccatcaaggct
Tat-Smad7-V5 1081
aatccggactcccgtacgctgctggtccacaaagtatttccgggcttctcaatcaaagcg human
Smad7 1105
ttcgactacgagaaggcgtacagcctgcagcggcccaatgaccacgagtttatgcagcag
Tat-Smad7-V5 1141
ttcgattacgagaaagcctactcgctgcagcgcccgaacgaccatgaattcatgcagcaa human
Smad7 1165
ccgtggacgggctttaccgtgcagatcagctttgtgaagggctggggtcagtgctacacc
Tat-Smad7-V5 1201
ccgtggacgggttttactgtgcagatctctttcgttaaaggctggggtcaatgctacacc human
Smad7 1225
cgccagttcatcagcagctgcccgtgctggctagaggtcatcttcaacagccggtag---
Tat-Smad7-V5 1261
cgtcagtttatctcgtcctgtccgtgctggctagaagtgattttcaatagccgcaagaac human
Smad7 1282
------------------------------------------------------------
Tat-Smad7-V5 1321
gagctcaattcgaagcttgaaggtaagcctatccctaaccctctcctcggtctcgattct human
Smad7 1282 ------------ Tat-Smad7-V5 1381 acgtgagtcgac
[0303] In this optimization, Met216, which may form an alternative
open reading frame, was not altered as it was desired to preserve
the amino acid sequence of Smad7, if possible. In future codon
optimizations, Met216 will be mutated to Leu216 to improve protein
production without impacting function in vitro and in vivo.
Example 7: Production of Truncated Smad7 Proteins
[0304] It is believed that Smad7 has several activities in vivo
including, but not limited to, one or more of enhancing cell
proliferation, enhancing cell migration, reducing DNA damage,
reducing cell apoptosis, and decreasing inflammation. Smad7's
effects on these processes are due to one or more of blocking
TGF-.beta. signaling, blocking NF-.kappa.B signaling, blocking
CtBP1 activity, and/or increasing Rac1 expression and/or activity.
It is believed that a smaller functional domain of Ptd-Smad7 may be
sufficient to deliver a therapeutic effect (see, e.g., FIG. 15). In
addition, the resulting shorter protein sequence is expected to
enhance protein production. Additionally, it is believed that
different truncated Tat-Smad7 proteins that contain partial Smad7
sequences may be useful for different treatments.
[0305] For example, it is believed that the C-terminal MH2 domain
of Smad7 (about half length of Smad7 protein, e.g., 208-426aa) may
primarily mediate the anti-inflammatory effect of Smad7 (Hong et
al., Nat Immunology, 8, 504-513, 2007). Smad7 peptides having this
anti-inflammatory function may be sufficient and optionally an
improvement for treating chronic inflammation associated
conditions, such as but not limited to, oral mucositis, stomatitis
and psoriasis, among others.
[0306] The N-terminal MH1 domain plus the linker region of Smad7
(about half of the protein, e.g., 2-208aa) is known to activate
MAPK and binds to Smurf, a ubiquitin E3 ligase to degrade
TGF-.beta. receptor (Aragon, et al., Structure 20:1726-1736
(2012)). It is believed that it may primarily mediate cell
migration and/or blocking TGF-.beta.-induced growth arrest and/or
fibrotic response. Smad7 peptides having this cell migration and
proliferation function may be sufficient, and optionally an
improvement, for enhancing healing that is not associated with
excessive inflammation. Types of wounds that might benefit from
this form of treatment include, but are not limited to, surgical
wounds, fibrotic scarring, and diabetes wounds, defective healing
and/or scarring among others.
[0307] Truncated Smad7 N-terminal and C-terminal PTD-fusion
proteins were designed. One example of a Tat-Smad7-C-terminal
codon-optimized nucleotide and protein sequence is provided below.
In the nucleic acid sequence, nucleotides 1-6 include the
restriction recognition site for BamHI; nucleotides 7-36 include
the Tat PTD sequence; nucleotides 37-810 include codon-optimized
for the C terminal amino acids 258 to 426 of human Smad7;
nucleotides 568-609 include the V5 epitope sequence; nucleotides
610-612 include the stop sequence; and nucleotides 613-618 include
the restriction recognition site for SalI:
TABLE-US-00011
ggatccggccgtaaaaaacgccgtcaacgccgccgttcacattggtgtgtggttgcctat G S G
R K K R R Q R R R S H W C V V A Y
tgggaagagaaaacgcgtgtcggtcgcctgtactgcgtacaggaaccgtcgctggatatc W E E
K T R V G R L Y C V Q E P S L D I
ttttatgacctgccgcagggcaatggtttctgtctgggccaactgaactcagataataaa F Y D
L P Q G N G F C L G Q L N S D N K
tcgcagctggtgcaaaaagttcgctcaaaaattggctgcggtatccagctgacccgtgaa S Q L
V Q K V R S K I G C G I Q L T R E
gttgacggtgtctgggtatataaccgcagctcttacccgatttttatcaaaagtgccacc V D G
V W V Y N R S S Y P I F I K S A T
ctggataatccggactcccgtacgctgctggtccacaaagtatttccgggcttctcaatc L D N
P D S R T L L V H K V F P G F S I
aaagcgttcgattacgagaaagcctactcgctgcagcgcccgaacgaccatgaattcatg K A F
D Y E K A Y S L Q R P N D H E F M
cagcaaccgtggacgggttttactgtgcagatctctttcgttaaaggctggggtcaatgc Q Q P
W T G F T V Q I S F V K G W G Q C
tacacccgtcagtttatctcgtcctgtccgtgctggctggaagtgattttcaatagccgc Y T R
Q F I S S C P C W L E V I F N S R
aagggcgagctcaattcgaagcttgaaggtaagcctatccctaaccctctcctcggtctc K G E
L N S K L E G K P I P N P L L G L gattctacgtgagtcgac (SEQ ID NO:
24) D S T - (SEQ ID NO: 25)
[0308] In the nucleic acid sequence, nucleotides 1-6 include the
restriction recognition site for BamHI; nucleotides 7-36 include
the Tat PTD sequence; nucleotides 37-810 include codon-optimized
for the N terminal amino acids 1-258 of human Smad7; nucleotides
811-852 include the V5 epitope sequence (corresponding amino acid
sequence in bold); nucleotides 853-855 include the stop sequence;
and nucleotides 856-861 include the restriction recognition site
for SalI. ATG is removed to allow for fusion with GST:
TABLE-US-00012
ggatccggccgtaaaaaacgccgtcaacgccgccgtggtttccgtacgaaacgctcggcc G S G
R K K R R Q R R R G F R T K R S A
ctggtccgtcgcctgtggcgctcccgtgctccgggtggtgaagatgaagaagaaggtgct L V R
R L W R S R A P G G E D E E E G A
ggcggcggtggcggtggcggtgaactgcgtggcgagggtgcaaccgatagtcgtgcacac G G G
G G G G E L R G E G A T D S R A H
ggtgcaggcggtggcggtccgggtcgtgctggttgctgtctgggtaaagctgtgcgcggc G A G
G G G P G R A G C C L G K A V R G
gcgaaaggtcatcaccatccgcacccgccggcagcaggtgcaggtgcagctggcggtgcg A K G
H H H P H P P A A G A G A A G G A
gaagccgatctgaaagccctgacccatagtgtcctgaaaaaactgaaagaacgtcagctg E A D
L K A L T H S V L K K L K E R Q L
gagctgctgctgcaagcagtagaatcccgtggcggtacccgtacggcttgtctgctgctg E L L
L Q A V E S R G G T R T A C L L L
ccgggtcgtctggattgccgtctgggtccgggtgcaccggctggtgcgcagccggcacaa P G R
L D C R L G P G A P A G A Q P A Q
ccgccgagctcttacagcctgccgctgctgctgtgtaaagtgtttcgttggccggacctg P P S
S Y S L P L L L C K V F R W P D L
cgccacagttccgaagttaaacgcctgtgctgttgcgagagctatggcaaaattaacccg R H S
S E V K R L C C C E S Y G K I N P
gaactggtttgttgcaatccgcaccatctgtctcgtctgtgtgaactggagagcccgccg E L V
C C N P H H L S R L C E L E S P P
ccgccgtattctcgttacccgatggatttcctgaaaccgactgcagattgcccggacgca P P Y
S R Y P M D F L K P T A D C P D A
gtcccgtcatcggctgagaccggcggcaccaactatctggcaccgggcggtctgagtgat V P S
S A E T G G T N Y L A P G G L S D
tcccagctgctgctggaaccgggcgaccgtggtaagcctatccctaaccctctcctcggt S Q L
L L E P G D R G K P I P N P L L G ctcgattctacgtgagtcgac (SEQ ID NO:
26) L D S T - (SEQ ID NO: 27)
Example 8: Testing of Truncated Smad7 Proteins
[0309] Activity of truncated Smad7 proteins is tested using the in
vitro and in vivo assays used to test full-length Smad7 described
above, among other assays. Such assays include, but are not limited
to, the ability to block phosphorylation of Smad2 and/or nuclear
translocation of the NF-.kappa.B p50 subunit, increase cell
proliferation, reduce apoptosis and/or radiation-induced DNA
damage, reduce inflammation and/or angiogenesis, promote healing in
oral mucositis, surgical wounds, diabetes wounds, and/or wounds
associated with chronic inflammation in mice.
[0310] In a wound healing assay, 6-mm punch biopsies were performed
in wild-type mice followed by daily topical application of
C-terminal or N-terminal Tat-Smad7. By measuring gross wound
closure, both truncated Smad7 proteins described above (e.g.,
Tat-Smad7 C-terminal and N-terminal protein) were found to have
promoted wound healing similar to full length Tat-Smad7.
Example 9: Truncated Smad7 Proteins Accelerate Wound Healing
[0311] The ability of truncated Smad7 proteins to accelerate wound
healing was examined in wild-type mice. FIG. 16A demonstrates the
effect of C-terminally truncated (259-426aa) Tat-C-Smad7 on a mouse
wound healing model. Wild-type C57BL/6 mice were anaesthetized,
dorsally wounded by a 6-mm dermal punch biopsy, and treated every
other day with the topical application of PBS (control, 3 mice, 4
wounds/mouse), full-length Tat-Smad7 (0.4 .mu.g/10 .mu.L PBS/wound,
3 mice, 4 wounds/mouse) or Tat-C-Smad7 (0.4 .mu.g/10 .mu.L/wound
PBS, 3 mice, 4 wounds/mouse, total of 12 wounds were treated).
Wound area was photographed with Canon digital camera and measured
by imaging analysis using the Image J software with a 6-mm circle
inside of the photo for normalization at 1, 2, 4 and 5 days after
wounding, and the average percent wound area remaining was
calculated for each treatment group. The ability of Tat-C-Smad7 to
accelerate wound healing was similar to that of full-length Smad7
(FIG. 16A).
[0312] FIG. 16B illustrates the effect of N-terminally truncated
(1-258aa) Tat-N-Smad7 on wound healing. Wild-type C57BL/6 mice were
anaesthetized, dorsally wounded by a 6-mm dermal punch biopsy, and
treated every other day with the topical application of PBS
(control, 6 mice, 4 wounds/mouse), full-length Tat-Smad7 (0.4
.mu.g/10 .mu.L PBS, 6 mice, 4 wounds/mouse), or Tat-N-Smad7 (0.4
.mu.g/10 .mu.L PBS, 6 mice, 4 wounds/mouse). Wound area was
measured by imaging analysis using the Image J software with a 6-mm
circle inside of the photo for normalization at 1, 2, and 4 days
after wounding, and the average percent wound area remaining was
calculated for each treatment group. There was a significant
difference in the rate of wound healing between the control and
Tat-Smad7-treated mice after two days, with the latter healing
faster (FIG. 16B, p<0.05). Although Tat-N-Smad7 was tested as a
negative control, it unexpectedly promoted significantly
accelerated wound healing relative to control mice after just one
day, and significantly accelerated wound healing relative to
full-length Tat-Smad7-treated mice after two days (FIG. 16B,
p<0.05). Tat-Cre had no effect on wound closure relative to
control treatment (data not shown).
[0313] These results demonstrate that both full-length Tat-Smad7
and truncated Tat-Smad7 proteins (Tat-C-Smad7 and Tat-N-Smad7)
promote wound healing. Moreover, these results demonstrate that
certain truncated Tat-Smad7 proteins are more effective in
accelerating wound healing than full-length Tat-Smad7.
[0314] Accordingly, compositions comprising truncated Tat-Smad7
proteins are useful in treating wounds and in accelerating wound
healing.
Example 10: Smad7 Accelerates Wound Healing in an Impaired
Wound-Healing Model
[0315] Diabetic (db/db) mice were dorsally wounded by a 6-mm dermal
punch biopsy as described above in Example 9, and treated every
other day with the topical application of PBS (control, 6 mice, 4
wounds/mouse), full-length Tat-Smad7 (0.4 .mu.g/10 .mu.L PBS, 6
mice, 4 wounds/mouse), or REGRANEX.RTM. cream (smear with cotton
swab, 6 mice 4 wounds/mouse) to the wounds before day 8 when the
wounds were not completely covered by scabs. After day 10, the
treatments were topically applied to the gap between the scab and
wound periphery to avoid the barrier of the hard scab and the
unwounded stratum corneum. REGRANEX.RTM., which contains a
recombinant human platelet-derived growth factor (PDGF), is
approved for topical administration to diabetic ulcers.
[0316] Wound closure was visually assessed at 1, 2, 4, 6, 8, 9, 10,
11, 12, and 13 days after wounding (FIG. 17A). At each time point,
the wounds were photographed with Canon digital camera. After 8
days, there was a marked visual improvement in the wound closure in
Tat-Smad7-treated mice compared to controls (FIG. 17A).
[0317] Wound size was measured by using the Image J software with a
6-mm circle inside of the photo for normalization at 1, 2, 3, 4, 7,
9, and 11 days after wounding, and the average percent wound area
remaining was calculated for each treatment group. The ability of
Tat-Smad7 to accelerate wound healing was similar to that of
REGRANEX.RTM. (FIG. 17B). On day 7, image analysis results
indicated that wound closure was significantly accelerated in
Tat-Smad7-treated mice relative to controls (FIG. 17B, p<0.05).
These results were similar to those achieved with
REGRANEX.RTM..
[0318] Formalin fixed paraffin dorsal wound sections (1 mm) from
day 8 wound samples were stained using hematoxylin and eosin
(H&E). Histological comparison of day 8 wound samples revealed
complete re-epithelialization and an accelerated wound closure in
Smad7-treated db/db mice relative to controls (FIG. 17C).
[0319] Collectively, these results demonstrate that Tat-Smad7 is
useful for accelerating wound closure in poorly healing diabetic
wounds, and provides an alternative to treatment with
REGRANEX.RTM..
Example 11: Additional Codon Optimization for Smad7 Protein
Production
[0320] Smad7 nucleic acid molecules are designed with additional
nucleotide changes selected to increase protein production. For
example, the utilization of codons encoding the amino acids Ser and
His will be manipulated. In codon-optimized human Smad7 in examples
above, the Ser codon (TCC or TCG) has an amino acid frequency of
approximately 9% of codon utilization. It is believed that changing
the codon of Ser to AGC will increase Smad7 protein production, at
least partly because it can optionally increase codon usage to 15%.
There are 33 Ser amino acids in Smad7 protein (nucleotides at
positions 19-21, 46-48, 133-135, 292-294, 349-351, 451-453,
454-456, 460-462, 511-513, 514-516, 544-546, 595-597, 616-618,
634-636, 691-693, 694-696, 739-741, 745-747, 775-777, 847-849,
907-909, 919-921, 943-945, 1006-1008, 1009-1101, 1030-1032,
1054-1056, 1093-1095, 1126-1128, 1192-1194, 1237-1239, 1240-1242,
1273-1275; with a corresponding serine amino acid position of 7,
16, 45, 98, 117, 151, 152, 154, 171, 172, 182, 199, 206, 212, 231,
232, 247, 249, 259, 283, 303, 307, 315, 336, 337, 344, 352, 365,
376, 398, 413, 414, 425). Of these, 23 (nucleotides 19-21, 292-294,
349-351, 451-453, 454-456, 460-462, 511-513, 514-516, 544-546,
616-618, 634-636, 691-693, 694-696, 739-741, 745-747, 775-777,
847-849, 907-909, 919-921, 1009-1101, 1030-1032, 1054-1056,
1093-1095; corresponding serine amino acid position of 7, 98, 117,
151, 152, 154, 171, 172, 182, 206, 212, 231, 232, 247, 249, 259,
283, 303, 307, 337, 344, 352, 365) can be changed without
introducing potential alternative open reading frames.
[0321] Similarly, in codon-optimized human Smad7 in the examples
above, the His codon (CAC) has 9.6% of codon usage. It is believed
that changing the His codon to CAT (optionally to 12.6% usage) will
increase Smad7 protein production. There are 12 His (nucleotides
142-144, 214-216, 217-219, 220-222, 226-228, 289-291, 589-591,
778-780, 1072-1074, 1147-1149; corresponding to histidine amino
acids at position 48, 72, 73, 74, 76, 97, 170, 196, 197, 260, 358,
383) in Smad7 protein. Of these, 4 (nucleotides 217-219. 220-222,
589-591, 778-780, histidine residues 73, 76, 197, 260) can be
changed without introducing potential alternative open reading
frames.
[0322] In addition, wild-type human Smad7 includes a Met amino acid
as amino acid 216 (Met216). This may be perceived as an alternative
open reading frame by bacterial machinery, for example, and
decrease protein production. It is believed that changing Met216 to
Leu216 (ATG to CTG), the amino acid that has the biochemical
property the closest to Met and thus not expected to change 3D
structure of the protein, will increase protein production.
[0323] The comparison between original codon-optimized Tat-Smad7-V5
and further changes is provided below. Top strand: Tat-Smad7-V5
(SEQ ID NO: 23); bottom strand: after optimized Ser, His and M216
.mu.L mutation (SEQ ID NO: 30).
TABLE-US-00013 Alignment: Local DNA homologies. Parameters: Both
strands. Method: FastScan - Max Score Mol 1
20120323113102B12-854366-164160-1-PGEX-5.seq (1-1463) Mol 2
TatSmad7 Ser-His optim Number of sequences to align: 2 Settings:
Similarity significance value cutoff: >=60% Homology Block:
Percent Matches 94 Score 1227 Length 1392 201203231131 30
ggatccggccgtaaaaaacgccgtcaacgccgccgtggtttccgtacgaaacgctcggcc
TatSmad7 Ser 1
......................................................agc...
201203231131 90
ctggtccgtcgcctgtggcgctcccgtgctccgggtggtgaagatgaagaagaaggtgct
TatSmad7 Ser 61
.....................ag.....................................
201203231131 150
ggcggcggtggcggtggcggtgaactgcgtggcgagggtgcaaccgatagtcgtgcacac
TatSmad7 Ser 121
..................................................c........t
201203231131 210
ggtgcaggcggtggcggtccgggtcgtgctggttgctgtctgggtaaagctgtgcgcggc
TatSmad7 Ser 181
............................................................
201203231131 270
gcgaaaggtcatcaccatccgcacccgccggcagcaggtgcaggtgcagctggcggtgcg
TatSmad7 Ser 241
..............t........t....................................
201203231131 330
gaagccgatctgaaagccctgacccatagtgtcctgaaaaaaccgaaagaacgtcagctg
TatSmad7 Ser 301
.............................c..............................
201203231131 390
gagctgctgctgcaagcagtagaatcccgtggcggtacccgtacggcttgtctgctgctg
TatSmad7 Ser 361
........................ag..................................
201203231131 450
ccgggtcgtctggattgccgtctgggtccgggtgcaccggctggtgcgcagccggcacaa
TatSmad7 Ser 421
............................................................
201203231131 510
ccgccgagctcttacagcctgccgctgctgctgtgtaaagtgtttcgttggccggacctg
TatSmad7 Ser 481
.........agc................................................
201203231131 570
cgccacagttccgaagttaaacgcctgtgctgttgcgagagctatggcaaaattaacccg
TatSmad7 Ser 541
.....t..cag.................................................
201203231131 630
gaactggtttgttgcaatccgcaccatctgtctcgtctgtgtgaactggagagcccgccg
TatSmad7 Ser 601
.......................t......agc...........................
201203231131 690
ccgccgtattctcgttacccgatggatttcctgaaaccgactgcagattgcccggacgca
TatSmad7 Ser 661
.........agc.........c......................................
201203231131 750
gtcccgtcatcggctgagaccggcggcaccaactatctggcaccgggcggtctgagtgat
TatSmad7 Ser 721
......agcagc............................................c...
201203231131 810
tcccagctgctgctggaaccgggcgaccgttcacattggtgtgtggttgcctattgggaa
TatSmad7 Ser 781
ag............................agc...........................
201203231131 870
gagaaaacgcgtgtcggtcgcctgtactgcgtacaggaaccgtcgctggatatcttttat
TatSmad7 Ser 841
..........................................agc...............
201203231131 930
gacctgccgcagggcaatggtttctgtctgggccaactgaactcagataataaatcgcag
TatSmad7 Ser 901
..........................................agc.........agc...
201203231131 990
ctggtgcaaaaagttcgctcaaaaattggctgcggtatccagctgacccgtgaagttgac
TatSmad7 Ser 961
..................agc.......................................
201203231131 1050
ggtgtctgggtatataaccgcagctcttacccgatttttatcaaaagtgccaccctggat
TatSmad7 Ser 1021
........................agc....................c............
201203231131 1110
aatccggactcccgtacgctgctggtccacaaagtatttccgggcttctcaatcaaagcg
TatSmad7 Ser 1081
.........ag..................t..................agc.........
201203231131 1170
ttcgattacgagaaagcctactcgctgcagcgcccgaacgaccatgaattcatgcagcaa
TatSmad7 Ser 1141
.....................agc....................................
201203231131 1230
ccgtggacgggttttactgtgcagatctctttcattaaaggctggggtcaatgctacacc
TatSmad7 Ser 1201
...........................agc..............................
201203231131 1290
cgtcagtttatctcgtcctgtccgtgctggctggaagtgattttcaatagccgcaagggc
TatSmad7 Ser 1261
............agcag...........................................
201203231131 1350
gagctcaattcgaagcttgaaggtaagcctatccctaaccctctcctcggtctcgattct
TatSmad7 Ser 1321
.........agc.............................................agc
201203231131 1410 acgtgagtcgac TatSmad7 Ser 1381 ............
[0324] Nucleic acid sequences and their corresponding amino acid
sequences that would include all these changes are provided below.
The amino acid sequence includes the V5 epitope indicated in bold,
and the pET101-Topo backbone indicated by italics and underlining.
The Tat-Smad7.sup.M216L fully-optimized full length nucleotide and
protein sequence is shown below:
TABLE-US-00014
ggatccggccgtaaaaaacgccgtcaacgccgccgtggtttccgtacgaaacgcagcgcc G S G
R K K R R Q R R R G F R T K R S A
ctggtccgtcgcctgtggcgcagccgtgctccgggtggtgaagatgaagaagaaggtgct L V R
R L W R S R A P G G E D E E E G A
ggcggcggtggcggtggcggtgaactgcgtggcgagggtgcaaccgatagccgtgcacat G G G
G G G G E L R G E G A T D S R A H
ggtgcaggcggtggcggtccgggtcgtgctggttgctgtctgggtaaagctgtgcgcggc G A G
G G G P G R A G C C L G K A V R G
gcgaaaggtcatcatcatccgcatccgccggcagcaggtgcaggtgcagctggcggtgcg A K G
H H H P H P P A A G A G A A G G A
gaagccgatctgaaagccctgacccatagcgtcctgaaaaaactgaaagaacgtcagctg E A D
L K A L T H S V L K K L K E R Q L
gagctgctgctgcaagcagtagaaagccgtggcggtacccgtacggcttgtctgctgctg E L L
L Q A V E S R G G T R T A C L L L
ccgggtcgtctggattgccgtctgggtccgggtgcaccggctggtgcgcagccggcacaa P G R
L D C R L G P G A P A G A Q P A Q
ccgccgagcagctacagcctgccgctgctgctgtgtaaagtgttcgttggccggacctg P P S S
Y S L P L L L C K V F R W P D L
cgccatagcagcgaagttaaacgcctgtgctgttgcgagagctatggcaaaattaacccg R H S
S E V K R L C C C E S Y G K I N P
gaactggtttgttgcaatccgcatcatctgagccgtctgtgtgaactggagagcccgccg E L V
C C N P H H L S R L C E L E S P P
ccgccgtatagccgttacccgctggatttcctgaaaccgactgcagattgcccggacgca P P Y
S R Y P L D F L K P T A D C P D A
gtcccgagcagcgctgagaccggcggcaccaactatctggcaccgggcggtctgagcgat V P S
S A E T G G T N Y L A P G G L S D
agccagctgctgctggaaccgggcgaccgtagccattggtgtgtggttgcctattgggaa S Q L
L L E P G D R S H W C V V A Y W E
gagaaaacgcgtgtcggtcgcctgtactgcgtacaggaaccgagcctggatatcttttat E K T
R V G R L Y C V Q E P S L D I F Y
gacctgccgcagggcaatggtttctgtctgggccaactgaacagcgataataaaagccag D L P
Q G N G F C L G Q L N S D N K S Q
ctggtgcaaaaagttcgcagcaaaattggctgcggtatccagctgacccgtgaagttgac L V Q
K V R S K I G C G I Q L T R E V D
ggtgtctgggtatataaccgcagcagctacccgatttttatcaaaagcgccaccctggat G V W
V Y N R S S Y P I F I K S A T L D
aatccggacagccgtacgctgctggtccataaagtatttccgggcttcagcatcaaagcg N P D
S R T L L V H K V F P G F S I K A
ttcgattacgagaaagcctacagcctgcagcgcccgaacgaccatgaattcatgcagcaa F D Y
E K A Y S L Q R P N D H E F M Q Q
ccgtggacgggttttactgtgcagatcagcttcgttaaaggctggggtcaatgctacacc P W T
G F T V Q I S F V K G W G Q C Y T
cgtcagtttatcagcagctgtccgtgctggctggaagtgattttcaatagccgcaagggc R Q F
I S S C P C W L E V I F N S R K G
gagctcaatagcaagcttgaaggtaagcctatccctaaccctctcctcggtctcgatagc E L N
S K L E G K P I P N P L L G L D S acgtgagtcgac (SEQ ID NO: 30) T -
(SEQ ID NO: 31)
[0325] The optimized nucleotide and amino acid sequences will also
be used to make a variety of N-terminal and C-terminal Tat-Smad7
fragments. Representative examples are provided below.
[0326] The Tat-N-Smad7-V5 most optimized nucleotide and amino acid
sequences are provided. The protein sequence includes the V5
epitope, which is indicated by bold capital letters.
TABLE-US-00015
ggatccggccgtaaaaaacgccgtcaacgccgccgtggtttccgtacgaaacgcagcgcc G S G
R K K R R Q R R R G F R T K R S A
ctggtccgtcgcctgtggcgcagccgtgctccgggtggtgaagatgaagaagaaggtgct L V R
R L W R S R A P G G E D E E E G A
ggcggcggtggcggtggcggtgaactgcgtggcgagggtgcaaccgatagccgtgcacat G G G
G G G G E L R G E G A T D S R A H
ggtgcaggcggtggcggtccgggtcgtgctggttgctgtctgggtaaagctgtgcgcggc G A G
G G G P G R A G C C L G K A V R G
gcgaaaggtcatcatcatccgcatccgccggcagcaggtgcaggtgcagctggcggtgcg A K G
H H H P H P P A A G A G A A G G A
gaagccgatctgaaagccctgacccatagcgtcctgaaaaaactgaaagaacgtcagctg E A D
L K A L T H S V L K K L K E R Q L
gagctgctgctgcaagcagtagaaagccgtggcggtacccgtacggcttgtctgctgctg E L L
L Q A V E S R G G T R T A C L L L
ccgggtcgtctggattgccgtctgggtccgggtgcaccggctggtgcgcagccggcacaa P G R
L D C R L G P G A P A G A Q P A Q
ccgccgagcagctacagcctgccgctgctgctgtgtaaagtgtttcgttggccggacctg P P S
S Y S L P L L L C K V F R W P D L
cgccatagcagcgaagttaaacgcctgtgctgttgcgagagctatggcaaaattaacccg R H S
S E V K R L C C C E S Y G K I N P
gaactggtttgttgcaatccgcatcatctgagccgtctgtgtgaactggagagcccgccg E L V
C C N P H H L S R L C E L E S P P
ccgccgtatagccgttacccgatggatttcctgaaaccgactgcagattgcccggacgca P P Y
S R Y P M D F L K P T A D C P D A
gtcccgagcagcgctgagaccggcggcaccaactatctggcaccgggcggtctgagcgat V P S
S A E T G G T N Y L A P G G L S D
agccagctgctgctggaaccgggcgaccgtggtaagcctatccctaaccctctcctcggt S Q L
L L E P G D R G K P I P N P L L G ctcgattctacgtgagtcgac (SEQ ID NO:
32) L D S T - (SEQ ID NO: 27)
[0327] Tat-C-Smad7-V5 most optimized nucleotide and amino acid
sequences are provided. The protein sequence includes the V5
epitope (indicated by bold capital letters), and the pET101-Topo
backbone (indicated by underlined italics).
TABLE-US-00016
ggatccggccgtaaaaaacgccgtcaacgccgccgttcacattggtgtgtggttgcctat G S G
R K K R R Q R R R S H W C V V A Y
tgggaagagaaaacgcgtgtcggtcgcctgtactgcgtacaggaaccgagcctggatatc W E E
K T R V G R L Y C V Q E P S L D I
ttttatgacctgccgcagggcaatggtttctgtctgggccaactgaacagcgataataaa F Y D
L P Q G N G F C L G Q L N S D N K
agccagctggtgcaaaaagttcgcagcaaaattggctgcggtatccagctgacccgtgaa S Q L
V Q K V R S K I G C G I Q L T R E
gttgacggtgtctgggtatataaccgcagcagctacccgatttttatcaaaagcgccacc V D G
V W V Y N R S S Y P I F I K S A T
ctggataatccggacagccgtacgctgctggtccataaagtatttccgggcttcagcatc L D N
P D S R T L L V H K V F P G F S I
aaagcgttcgattacgagaaagcctacagcctgcagcgcccgaacgaccatgaattcatg K A F
D Y E K A Y S L Q R P N D H E F M
cagcaaccgtggacgggttttactgtgcagatcagcttcgttaaaggctggggtcaatgc Q Q P
W T G F T V Q I S F V K G W G Q C
tacacccgtcagtttatcagcagctgtccgtgctggctggaagtgattttcaatagccgc Y T R
Q F I S S C P C W L E V I F N S R
aagggcgagctcaatagcaagcttgaaggtaagcctatccctaaccctctcctcggtctc K G E
L N S K L E G K P I P N P L L G L gatagcacgtgagtcgac (SEQ ID NO:
34) D S T - (SEQ ID NO: 25)
[0328] The comparisons before and after the above optimizations are
provided below. C-terminal optimization (top strand) (alignment
discloses SEQ ID NOs: 34 and 24, respectively, in order of
appearance):
TABLE-US-00017 Alignment: Local DNA homologies. Parameters: Both
strands. Method: PastScan - Max Score Mol 1 Tat-C-Smad7-ser-his
optimized (1-618) Mol 2 Tat-C-termal Smad7-VS (1-618) Number of
sequences to align: 2 Settings: Similarity significance value
cutoff: >=60% Homology Block: Percent Matches 93 Score 541
Length 618 Tat-C-Smad7- 1
ggatccggccgtaaaaaacgccgtcaacgccgccgttcacattggtgtgtggttgcctat
Tat-C-termal 1
............................................................
Tat-C-Smad7- 61
tgggaagagaaaacgcgtgtcggtcgcctgtactgcgtacaggaaccgagcctggatatc
Tat-C-termal 61
................................................tcg.........
Tat-C-Smad7- 121
ttttatgacctgccgcagggcaatggtttctgtctgggccaactgaacagcgataataaa
Tat-C-termal 121
................................................tca.........
Tat-C-Smad7- 181
agccagctggtgcaaaaagttcgcagcaaaattggctgcggtatccagctgacccgtgaa
Tat-C-termal 181
tcg.....................tca.................................
Tat-C-Smad7- 241
gttgacggtgtctgggtatataaccgcagcagctacccgatttttatcaaaagcgccacc
Tat-C-termal 241
..............................tct....................t......
Tat-C-Smad7- 301
ctggataatccggacagccgtacgctgctggtccataaagtatttccgggcttcagcatc
Tat-C-termal 301
...............tc..................c..................tca...
Tat-C-Smad7- 361
aaagcgttcgattacgagaaagcctacagcctgcagcgcccgaacgaccatgaattcatg
Tat-C-termal 361
...........................tcg..............................
Tat-C-Smad7- 421
cagcaaccgtggacgggttttactgtgcagatcagcttcgttaaaggctggggtcaatgc
Tat-C-termal 421
.................................tct........................
Tat-C-Smad7- 481
tacacccgtcagtttatcagcagctgtccgcactgactagaagtgattttcaatagccgc
Tat-C-termal 481
..................tcgtc.....................................
Tat-C-Smad7- 541
aagggcgagctcaatagcaagcttgaaggtaagcctatccctaaccctctcctcggtctc
Tat-C-termal 541
...............tcg..........................................
Tat-C-Smad7- 601 gatagcacgtgagtcgac Tat-C-termal 601
...tct............
N-terminal optimization (top strand) (alignment discloses SEQ ID
NOs: 32 and 26, respectively, in order of appearance):
TABLE-US-00018 Alignment: Local DNA homologies. Parameters: Both
strands. Method: FastScan - Max Score Mol 1 Tat-N-Smad7-V5-Ser-His
optimized (1-861) Mol 2 Tat-N-Smad7-V5 (1-861) Number of sequences
to align: 2 Settings: Similarity significance value cutoff:
>=60% Homology Block: Percent Matches 95 Score 781 Length 861
Tat-N-Smad7- 1
ggatccggccgtaaaaaacgccgtcaacgccgccgtggtttccgtacgaaacgcagcgcc
Tat-N-Smad7- 1
......................................................tcg...
Tat-N-Smad7- 61
ctggtccgtcgcctgtggcgcagccgtgctccgggtggtgaagatgaagaagaaggtgct
Tat-N-Smad7- 61
.....................tc.....................................
Tat-N-Smad7- 121
ggcggcggtggcggtggcggtgaactgcgtggcgagggtgcaaccgatagccgtgcacat
Tat-N-Smad7- 121
..................................................t........c
Tat-N-Smad7- 181
ggtgcaggcggtggcggtccgggtcgtgctggttgctgtctgggtaaagctgtgcgcggc
Tat-N-Smad7- 181
............................................................
Tat-N-Smad7- 241
gcgaaaggtcatcatcatccgcatccgccggcagcaggtgcaggtgcagctggcggtgcg
Tat-N-Smad7- 241
..............c........c....................................
Tat-N-Smad7- 301
gaagccgatctgaaagccctgacccatagcgtcctgaaaaaactgaaagaacgtcagctg
Tat-N-Smad7- 301
.............................t..............................
Tat-N-Smad7- 361
gagctgctgctgcaagcagtagaaagccgtggcggtacccgtacggcttgtctgctgctg
Tat-N-Smad7- 361
........................tc..................................
Tat-N-Smad7- 421
ccgggtcgtctggattgccgtctgggtccgggtgcaccggctggtgcgcagccggcacaa
Tat-N-Smad7- 421
............................................................
Tat-N-Smad7- 481
ccgccgagcagctacagcctgccgctgctgctgtgtaaagtgtttcgttggccggacctg
Tat-N-Smad7- 481
.........tct................................................
Tat-N-Smad7- 541
cgccatagcagcgaagttaaacgcctgtgctgttgcgagagctatggcaaaattaacccg
Tat-N-Smad7- 541
.....c.ttc..................................................
Tat-N-Smad7- 601
gaactggtttgttgcaatccgcatcatctgagccgtctgtgtgaactggagagcccgccg
Tat-N-Smad7- 601
.......................c......tct...........................
Tat-N-Smad7- 661
ccgccgtatagccgttacccgatggatttcctgaaaccgactgcagattgcccggacgca
Tat-N-Smad7- 661
.........tct................................................
Tat-N-Smad7- 721
gtcccgagcagcgctgagaccggcggcaccaactatctggcaccgggcggtctgagcgat
Tat-N-Smad7- 721
......tcatcg............................................t...
Tat-N-Smad7- 781
agccagctgctgctggaaccgggcgaccgtggtaagcctatccctaaccctctcctcggt
Tat-N-Smad7- 781
tc..........................................................
Tat-N-Smad7- 841 ctcgattctacgtgagtcgac Tat-N-Smad7- 841
.....................
[0329] In addition, other codon-optimized nucleic acids will also
be assessed for their ability to produce Smad7 protein in one or
more expression systems. Provided below is another example of such
a sequence.
[0330] Tat-Smad7.sup.M7216L-V5 optimized by Optimizer program:
TABLE-US-00019
ggatccggtcgtaaaaaacgtcgtcagcgtcgtcgtggtttccgtaccaaacgttctgcg G S G
R K K R R Q R R R G F R T K R S A
ctggttcgtcgtctgtggcgttctcgtgcgccgggtggtgaagacgaagaagaaggtgcg L V R
R L W R S R A P G G E D E E E G A
ggtggtggtggtggtggtggtgaactgcgtggtgaaggtgcgaccgactctcgtgcgcac G G G
G G G G E L R G E G A T D S R A H
ggtgcgggtggtggtggtccgggtcgtgcgggttgctgcctgggtaaagcggttcgtggt G A G
G G G P G R A G C C L G K A V R G
gcgaaaggtcaccaccacccgcacccgccggcggcgggtgcgggtgcggcgggtggtgcg A K G
H H H P H P P A A G A G A A G G A
gaagcggacctgaaagcgctgacccactctgttctgaaaaaactgaaagaacgtcagctg E A D
L K A L T H S V L K K L K E R Q L
gaactgctgctgcaggcggttgaatctcgtggtggtacccgtaccgcgtgcctgctgctg E L L
L Q A V E S R G G T R T A C L L L
ccgggtcgtctggactgccgtctgggtccgggtgcgccggcgggtgcgcagccggcgcag P G R
L D C R L G P G A P A G A Q P A Q
ccgccgtcttcttactctctgccgctgctgctgtgcaaagttttccgttggccggacctg P P S
S Y S L P L L L C K V F R W P D L
cgtcactcttctgaagttaaacgtctgtgctgctgcgaatcttacggtaaaatcaacccg R H S
S E V K R L C C C E S Y G K I N P
gaactggtttgctgcaacccgcaccacctgtctcgtctgtgcgaactggaatctccgccg E L V
C C N P H H L S R L C E L E S P P
ccgccgtactctcgttacccgctggacttcctgaaaccgaccgcggactgcccggacgcg P P Y
S R Y P L D F L K P T A D C P D A
gttccgtcttctgcggaaaccggtggtaccaactacctggcgccgggtggtctgtctgac V P S
S A E T G G T N Y L A P G G L S D
tctcagctgctgctggaaccgggtgaccgttctcactggtgcgttgttgcgtactgggaa S Q L
L L E P G D R S H W C V V A Y W E
gaaaaaacccgtgttggtcgtctgtactgcgttcaggaaccgtctctggacatcttctac E K T
R V G R L Y C V Q E P S L D I F Y
gacctgccgcagggtaacggtttctgcctgggtcagctgaactctgacaacaaatctcag D L P
Q G N G F C L G Q L N S D N K S Q
ctggttcagaaagttcgttctaaaatcggttgcggtatccagctgacccgtgaagttgac L V Q
K V R S K I G C G I Q L T R E V D
ggtgtttgggtttacaaccgttcttcttacccgatcttcatcaaatctgcgaccctggac G V W
V Y N R S S Y P I F I K S A T L D
aacccggactctcgtaccctgctggttcacaaagttttcccgggtttctctatcaaagcg N P D
S R T L L V H K V F P G F S I K A
ttcgactacgaaaaagcgtactctctgcagcgtccgaacgaccacgaattcatgcagcag F D Y
E K A Y S L Q R P N D H E F M Q Q
ccgtggaccggtttcaccgttcagatctctttcgttaaaggttggggtcagtgctacacc P W T
G F T V Q I S F V K G W G Q C Y T
cgtcagttcatctcttcttgcccgtgctggctggaagttatcttcaactctcgtggtaaa R Q F
I S S C P C W L E V I F N S R G K
ccgatcccgaacccgctgctgggtctggactctacctgagtcgac (SEQ ID NO: 36) P I P
N P L L G L D S T - - (SEQ ID NO: 37)
[0331] Nucleotide Sequence:
1-6: BamHI; 7-36: Tat; 37-1314: codon-optimized human Smad7;
1315-1356: V5; 137-1359: stop; 1360-1365 SalI ATG is removed to be
used with GST; 682 ATG to CTG (M216 to L)
TABLE-US-00020 (SEQ ID NO: 36) 1 ggatccggtc gtaaaaaacg tcgtcagcgt
cgtcgtggtt tccgtaccaa acgttctgcg 61 ctggttcgtc gtctgtggcg
ttctcgtgcg ccgggtggtg aagacgaaga agaaggtgcg 121 ggtggtggtg
gtggtggtgg tgaactgcgt ggtgaaggtg cgaccgactc tcgtgcgcac 181
ggtgcgggtg gtggtggtcc gggtcgtgcg ggttgctgcc tgggtaaagc ggttcgtggt
241 gcgaaaggtc accaccaccc gcacccgccg gcggcgggtg cgggtgcggc
gggtggtgcg 301 gaagcggacc tgaaagcgct gacccactct gttctgaaaa
aactgaaaga acgtcagctg 361 gaactgctgc tgcaggcggt tgaatctcgt
ggtggtaccc gtaccgcgtg cctgctgctg 421 ccgggtcgtc tggactgccg
tctgggtccg ggtgcgccgg cgggtgcgca gccggcgcag 481 ccgccgtctt
cttactctct gccgctgctg ctgtgcaaag ttttccgttg gccggacctg 541
cgtcactctt ctgaagttaa acgtctgtgc tgctgcgaat cttacggtaa aatcaacccg
601 gaactggttt gctgcaaccc gcaccacctg tctcgtctgt gcgaactgga
atctccgccg 661 ccgccgtact ctcgttaccc gctggacttc ctgaaaccga
ccgcggactg cccggacgcg 721 gttccgtctt ctgcggaaac cggtggtacc
aactacctgg cgccgggtgg tctgtctgac 781 tctcagctgc tgctggaacc
gggtgaccgt tctcactggt gcgttgttgc gtactgggaa 841 gaaaaaaccc
gtgttggtcg tctgtactgc gttcaggaac cgtctctgga catcttctac 901
gacctgccgc agggtaacgg tttctgcctg ggtcagctga actctgacaa caaatctcag
961 ctggttcaga aagttcgttc taaaatcggt tgcggtatcc agctgacccg
tgaagttgac 1021 ggtgtttggg tttacaaccg ttcttcttac ccgatcttca
tcaaatctgc gaccctggac 1081 aacccggact ctcgtaccct gctggttcac
aaagttttcc cgggtttctc tatcaaagcg 1141 ttcgactacg aaaaagcgta
ctctctgcag cgtccgaacg accacgaatt catgcagcag 1201 ccgtggaccg
gtttcaccgt tcagatctct ttcgttaaag gttggggtca gtgctacacc 1261
cgtcagttca tctcttcttg cccgtgctgg ctggaagtta tcttcaactc tcgtggtaaa
1321 ccgatcccga acccgctgct gggtctggac tctacctgag tcgac
[0332] Sequence comparison with Tat-Smad7.sup.M7216L-V5 described
in Example 4 (alignment discloses SEQ ID NOs: 36 and 30,
respectively, in order of appearance):
TABLE-US-00021 Alignment: Global DNA alignment against reference
molecule Parameters: Scoring matrix: Linear (Mismatch 2, OpenGap 4,
ExtGap 1) Reference molecule: Tat-Smad7-216L-V5-optimizer, Region
1-1365 Number of sequences to align: 2 Settings: Similarity
significance value cutoff: >=60% Summary of Percent Matches:
Reference: Tat-Smad7-216L-V5-optimizer 1-1365 (1365 bps) --
Sequence 2: TatSmad7 Ser-His optimized-682 mutant 1-1392 (1392 bps)
79% Tat-Smad7-21 1
ggatccggtcgtaaaaaacgtcgtcagcgtcgtcgtggtttccgtaccaaacgttctgcg
TatSmad7 Ser 1
ggatccggccgtaaaaaacgccgtcaacgccgccgtggtttccgtacgaaacgcagcgcc
Tat-Smad7-21 61
ctggttcgtcgtctgtggcgttctcgtgcgccgggtggtgaagacgaagaagaaggtgcg
TatSmad7 Ser 61
ctggtccgtcgcctgtggcgcagccgtgctccgggtggtgaagatgaagaagaaggtgct
Tat-Smad7-21 121
ggtggtggtggtggtggtggtgaactgcgtggtgaaggtgcgaccgactctcgtgcgcac
TatSmad7 Ser 121
ggcggcggtggcggtggcggtgaactgcgtggcgagggtgcaaccgatagccgtgcacat
Tat-Smad7-21 181
ggtgcgggtggtggtggtccgggtcgtgcgggttgctgcctgggtaaagcggttcgtggt
TatSmad7 Ser 181
ggtgcaggcggtggcggtccgggtcgtgctggttgctgtctgggtaaagctgtgcgcggc
Tat-Smad7-21 241
gcgaaaggtcaccaccacccgcacccgccggcggcgggtgcgggtgcggcgggtggtgcg
TatSmad7 Ser 241
gcgaaaggtcatcatcatccgcatccgccggcagcaggtgcaggtgcagctggcggtgcg
Tat-Smad7-21 301
gaagcggacctgaaagcgctgacccactctgttctgaaaaaactgaaagaacgtcagctg
TatSmad7 Ser 301
gaagccgatctgaaagccctgacccatagcgtcctgaaaaaactgaaagaacgtcagctg
Tat-Smad7-21 361
gaactgctgctgcaggcggttgaatctcgtggtggtacccgtaccgcgtgcctgctgctg
TatSmad7 Ser 361
gagctgctgctgcaagcagtagaaagccgtggcggtacccgtacggcttgtctgctgctg
Tat-Smad7-21 421
ccgggtcgtctggactgccgtctgggtccgggtgcgccggcgggtgcgcagccggcgcag
TatSmad7 Ser 421
ccgggtcgtctggattgccgtctgggtccgggtgcaccggctggtgcgcagccggcacaa
Tat-Smad7-21 481
ccgccgtcttcttactctctgccgctgctgctgtgcaaagttttccgttggccggacctg
TatSmad7 Ser 481
ccgccgagcagctacagcctgccgctgctgctgtgtaaagtgtttcgttggccggacctg
Tat-Smad7-21 541
cgtcactcttctgaagttaaacgtctgtgctgctgcgaatcttacggtaaaatcaacccg
TatSmad7 Ser 541
cgccatagcagcgaagttaaacgcctgtgctgttgcgagagctatggcaaaattaacccg
Tat-Smad7-21 601
gaactggtttgctgcaacccgcaccacctgtctcgtctgtgcgaactggaatctccgccg
TatSmad7 Ser 601
gaactggtttgttgcaatccgcatcatctgagccgtctgtgtgaactggagagcccgccg
Tat-Smad7-21 661
ccgccgtactctcgttacccgctggacttcctgaaaccgaccgcggactgcccggacgcg
TatSmad7 Ser 661
ccgccgtatagccgttacccgctggatttcctgaaaccgactgcagattgcccggacgca
Tat-Smad7-21 721
gttccgtcttctgcggaaaccggtggtaccaactacctggcgccgggtggtctgtctgac
TatSmad7 Ser 721
gtcccgagcagcgctgagaccggcggcaccaactatctggcaccgggcggtctgagcgat
Tat-Smad7-21 781
tctcagctgctgctggaaccgggtgaccgttctcactggtgcgttgttgcgtactgggaa
TatSmad7 Ser 781
agccagctgctgctggaaccgggcgaccgtagccattggtgtgtggttgcctattgggaa
Tat-Smad7-21 841
gaaaaaacccgtgttggtcgtctgtactgcgttcaggaaccgtctctggacatcttctac
TatSmad7 Ser 841
gagaaaacgcgtgtcggtcgcctgtactgcgtacaggaaccgagcctggatatcttttat
Tat-Smad7-21 901
gacctgccgcagggtaacggtttctgcctgggtcagctgaactctgacaacaaatctcag
TatSmad7 Ser 901
gacctgccgcagggcaatggtttctgtctgggccaactgaacagcgataataaaagccag
Tat-Smad7-21 961
ctggttcagaaagttcgttctaaaatcggttgcggtatccagctgacccgtgaagttgac
TatSmad7 Ser 961
ctggtgcaaaaagttcgcagcaaaattggctgcggtatccagctgacccgtgaagttgac
Tat-Smad7-21 1021
ggtgtttgggtttacaaccgttcttcttacccgatcttcatcaaatctgcgaccctggac
TatSmad7 Ser 1021
ggtgtctgggtatataaccgcagcagctacccgatttttatcaaaagcgccaccctggat
Tat-Smad7-21 1081
aacccggactctcgtaccctgctggttcacaaagttttcccgggtttctctatcaaagcg
TatSmad7 Ser 1081
aatccggacagccgtacgctgctggtccataaagtatttccgggcttcagcatcaaagcg
Tat-Smad7-21 1141
ttcgactacgaaaaagcgtactctctgcagcgtccgaacgaccacgaattcatgcagcag
TatSmad7 Ser 1141
ttcgattacgagaaagcctacagcctgcagcgcccgaacgaccatgaattcatgcagcaa
Tat-Smad7-21 1201
ccgtggaccggtttcaccgttcagatctctttcgttaaaggttggggtcagtgctacacc
TatSmad7 Ser 1201
ccgtggacgggttttaccgtgcagatcagcttcgttaaaggctggggtcaatgctacacc
Tat-Smad7-21 1261
cgtcagttcatctcttcttgcccgtgctggctggaagttatcttcaactctcgt------
TatSmad7 Ser 1261
cgtcagtttatcagcagctgtccgtgctggctggaagtgattttcaatagccgcaagggc
Tat-Smad7-21 1315
---------------------ggtaaaccgatcccgaacccgctgctgggtctggactct
TatSmad7 Ser 1321
gagctcaatagcaagcttgaaggtaagcctatccctaaccctctcctcggtctcgatagc
Tat-Smad7-21 1354 acctgagtcgac TatSmad7 Ser 1381 acgtgagtcgac
[0333] Protein production using the sequences below is carried out
by transfecting mammalian expression vectors (e.g., pCMV-6-Entry)
into HEK293 or CHO cells.
[0334] Optimized GST-Tat-Smad7-myc-flag nucleotide sequence, cloned
in pCMV6-Entry plasmid:
TABLE-US-00022 (SEQ ID NO: 87) 1 gacgttgtat acgactccta tagggcggcc
gggaattcgt cgactggatc cggtaccgag 61 gagatctgcc gccgcgatcg
ccatgtcccc cattctgggc tactggaaga ttaagggcct 121 ggtgcagcct
actagactgc tgctggaata cctggaggaa aaatatgaag agcatctgta 181
tgaaagagac gagggggata aatggaggaa caagaaattc gaactgggac tggagtttcc
241 taatctgcca tactatattg acggcgatgt gaagctgact cagtctatgg
ctatcattag 301 atacatcgca gacaaacaca acatgctggg cgggtgtcct
aaggaaaggg cagagattag 361 tatgctggag ggagccgtgc tggatattag
atacggcgtc tcacgcatcg cctatagcaa 421 agacttcgaa accctgaagg
tggattttct gagcaaactg cctgaaatgc tgaagatgtt 481 cgaggacaga
ctgtgccaca aaacctacct gaatggcgac catgtcacac acccagattt 541
tatgctgtac gacgccctgg atgtggtcct gtatatggac cccatgtgtc tggatgcttt
601 ccctaagctg gtgtgcttta agaaaaggat cgaggcaatt ccccagatcg
ataagtacct 661 gaaaagctcc aagtatatcg cttggcctct ccagggctgg
caggcaacat tcggaggcgg 721 ggaccatccc cctaaaagcg acctggaggt
gctgtttcag ggaccactgg gcagcggccg 781 gaagaagcgg cggcagaggc
gccgaagtag gttccgcact aagcggtcag cactggtgcg 841 gagactgtgg
cgatctcggg ctcctggagg agaggacgag gaagagggag caggcggcgg 901
cggaggagga ggagaactgc gcggggaggg agctacagat agccgagccc acggagctgg
961 aggaggagga ccagggcgag ccggatgctg tctgggcaaa gcagtgagag
gcgccaaggg 1021 gcaccatcac ccccatccac ccgccgctgg cgcaggagca
gccggcggag ctgaggcaga 1081 cctgaaagcc ctgactcaca gtgtgctgaa
gaaactgaag gaaagacagc tggagctgct 1141 gctccaggca gtcgaatcac
gcggaggcac ccgaacagct tgtctgctgc tgcccggccg 1201 gctggactgc
cggctgggac ccggcgcccc tgctggggca cagccagccc agcctccatc 1261
tagttatagc ctgcccctgc tgctgtgtaa ggtgttccga tggcctgatc tgcggcattc
1321 aagcgaagtc aaaaggctgt gctgttgcga gtcctacggc aagattaacc
cagaactggt 1381 gtgttgcaat ccccatcacc tgtctcgact gtgtgaactg
gagtcccccc ctccacccta 1441 ctctaggtat cctatggact ttctgaagcc
aaccgctgac tgcccagatg cagtgccctc 1501 ctctgccgag actgggggaa
ccaactacct ggctcctggc ggactgagcg actcccagct 1561 gctgctggaa
ccaggggatc gcagccactg gtgtgtggtc gcctactggg aagagaagac 1621
aagagtggga aggctgtatt gcgtccagga gccttccctg gacatcttct acgatctgcc
1681 acaggggaat ggattttgtc tgggccagct gaactctgac aataagagtc
agctggtgca 1741 gaaagtccgg agcaagattg gctgcggcat ccagctgacc
agggaggtgg acggcgtgtg 1801 ggtctacaac cgcagttcat atccaatctt
catcaagagc gccactctgg acaatcccga 1861 ttcccgcacc ctgctggtgc
ataaggtctt ccccggcttc agcatcaagg ccttcgacta 1921 cgagaaggct
tatagtctcc agcggcccaa cgatcacgag ttcatgcagc agccttggac 1981
aggcttcact gtgcagatca gcttcgtcaa gggatggggc cagtgctaca caaggcagtt
2041 catctcaagc tgtccctgtt ggctggaagt cattttcaat agtaggacgc
gtacqcqqcc 2101 gctcgagcag aaactcatct cagaagagga tctggcagca
aatgatatcc tggattacaa 2161 ggatgacgac gataaggttt aa
[0335] The entire sequence (up to the NotI site) is codon optimized
for mammalian expression, using the OPTIMUMGENE.TM. program from
GenScript.
[0336] 1790-1792: changes GAT to GAC, (kept Asp) to avoid
alternative ORF (highlight)
[0337] 1-82: cloning sites+kozac sequence in pCMV6-Entry
[0338] 83-777: GST+ precision enzyme sites; 776-805: Tat;
[0339] 806-2086: full length human Smad7
[0340] 2087-2107: MluI, NotI, XhoI
[0341] 2108-2182: myc-flag from pCMV6-Entry
[0342] Bold underline: unique NotI site
[0343] The above nucleotide sequence is 79% homologous to native
human Smad7.
TABLE-US-00023 Mol 1 human Smad7 mRNA (1-3103) Mol 2 optimized
GST-Tat-Smad7-myc-flag (1-2182) Number of sequences to align: 2
Settings: Sort results list by Score human Smad7 500
tcaccaccatccccacccgccagccgcgggcgccggcgcggccgggggcg optimized GS
1027 .....c......--.......----..t.....a..a..a.....c.a. human Smad7
550 ccgaggcggatctgaaggcgctcacgcactcggtgctcaagaaactgaag optimized GS
1071 .t.....a..c.....a..c..g..t...agt.....g............ human Smad7
600 gagcggcagctggagctgctgctccaggccgtggagtcccgcggcgggac optimized GS
1121 ..aa.a.......................a..c..a..a.....a..c. human Smad7
650 gcgcaccgcgtgcctcctgctgcccggccgcctggactgcaggctgggcc optimized GS
1171 c..a..a..t..t..g..............g.........c.......a. human Smad7
700 cgggggcgcccgccggcgcgcagcctgcgcagccgccctcgtcctactcg optimized GS
1221 .c..c..c..t..t..g..a.....a..c.....t..a..tagt..tagc human Smad7
750 ctccccctcctgctgtgcaaagtgttcaggtggccggatctcaggcattc optimized GS
1271 ..g.....g........t..g......c.a.....t.....gc....... human Smad7
800 ctcggaagtcaagaggctgtgttgctgtgaatcttacgggaagatcaacc optimized GS
1321 aagc........a........c..t..c..g..c.....c.....t.... human Smad7
850 ccgagctggtgtgctgcaacccccatcaccttagccgactctgcgaacta optimized GS
1371 .a..a........t.....t...........gtct.....g..t.....g human Smad7
900 gagtctcccccccctccttactccagatacccgatggattttctcaaacc optimized GS
1421 .....c.....t..a..c.....t..g..t..t.....c.....g..g.. human Smad7
950 aactgcagactgtccagatgctgtgccttcctccgctgaaacagggggaa optimized GS
1471 ...c.t.....c........a.....c.....t..c..g..t....... human Smad7
1000 cgaattatctggcccctggggggctttcagattcccaacttcttctggag optimized
GS 1521 .c.c.c.....t.....c..a..gagc..c.....g..g..g.....a human
Smad7 1050 cctggggatcggtcacactggtgcgtggtggcatactgggaggagaagac
optimized GS 1571 ..a........cagc........t.....c.c........a........
human Smad7 1100 gagagtggggaggctctactgtgtccaggagccctctctggatatcttct
optimized GS 1621
a........a.....g..t..c...........t..c.....c....... human Smad7 1150
atgatctacctcaggggaatggcttttgcctcggacagctcaattcggac optimized GS
1671 .c.....g..a...........a.....t..g..c.....g..c..t... human Smad7
1200 aacaagagtcagctggtgcagaaggtgcggagcaaaatcggctgcggcat optimized
GS 1721 ..t....................a..c........g..t........... human
Smad7 1250 ccagctgacgcgggaggtggatggtgtgtgggtgtacaaccgcagcagtt
optimized GS 1771 .........ca..........c.c........c...........ttca.
human Smad7 1300 accccatcttcatcaagtccgccacactggacaacccggactccaggacg
optimized GS 1821
.t..a............ag......t........t..c..t...c.c..c human Smad7 1350
ctgttggtacacaaggtgttccccggtttctccatcaaggctttcgacta optimized GS
1871 ...c....g..t.....c........c...ag.........c........ human Smad7
1400 cgagaaggcgtacagcctgcagcggcccaatgaccacgagtttatgcagc optimized
GS 1921 .........t..t..t..c...........c.t........c....... human
Smad7 1450 agccgtggacgggctttaccgtgcagatcagctttgtgaagggctggggt
optimized GS 1971 ....t.....a.....c.t..............c.c.....a.....c
human Smad7 1500 cagtgctacacccgccagttcatcagcagctgcccgtgctggctagaggt
optimized GS 2021
...........aa.g.........tca.....t..c..t.....g..a.. human Smad7 1550
catcttcaa (SEQ ID NO: 88) optimized GS 2071 ...t..... (SEQ ID NO:
89)
[0344] Optimized GST-Tat-Smad7-myc-flag protein sequence from the
above nucleotide sequence:
TABLE-US-00024 (SEQ ID NO: 90) 1 mspilgywki kglvqptrll leyleekyee
hlyerdegdk wrnkkfelgl efpnlpyyid 61 gdvkltqsma iiryiadkhn
mlggcpkera eismlegavl dirygvsria yskdfetlkv 121 dflsklpeml
kmfedrlchk tylngdhvth pdfmlydald vvlymdpmcl dafpklvcfk 181
krieaipqid kylksskyia wplqgwqatf gggdhppksd levlfqgplg sgrkkrrqrr
241 rsrfrtkrsa lvrrlwrsra pggedeeega gggggggelr gegatdsrah
gaggggpgra 301 gcclgkavrg akghhhphpp aagagaagga eadlkalths
vlkklkerql elllqavesr 361 ggtrtaclll pgrldcrlgp gapagagpaq
ppssyslpll lckvfrwpdl rhssevkrlc 421 ccesygkinp elvccnphhl
srlcelespp ppysrypmdf lkptadcpda vpssaetggt 481 nylapgglsd
sqlllepgdr shwcvvaywe ektrvgrlyc vqepsldify dlpqgngfcl 541
gqlnsdnksq lvqkvrskig cgiqltrevd gvwvynrssy pifiksatld npdsrtllvh
601 kvfpgfsika fdyekayslq rpndhefmqq pwtgftvqis fvkgwgqcyt
rqfisscpcw 661 levifnsrtr trpleqklis eedlaandil dykddddkv 1-229:
GST+precision sites 232-668 Tat-Smad7 671-699: restriction
sites+myc-Flag
[0345] Optimized GST-Tat-N-Smad7-myc-flag nucleotide sequence,
cloned in pCMV6-Entry plasmid:
TABLE-US-00025 (SEQ ID NO: 91) 1 gaattcgtcg actggatccg gtaccgagga
gatctgccgc cgcgatcgcc atgtccccca 61 ttctgggcta ctggaagatt
aagggcctgg tgcagcctac tagactgctg ctggaatacc 121 tggaggaaaa
atatgaagag catctgtatg aaagagacga gggggataaa tggaggaaca 181
agaaattcga actgggactg gagtttccta atctgccata ctatattgac ggcgatgtga
241 agctgactca gtctatggct atcattagat acatcgcaga caaacacaac
atgctgggcg 301 ggtgtcctaa ggaaagggca gagattagta tgctggaggg
agccgtgctg gatattagat 361 acggcgtctc acgcatcgcc tatagcaaag
acttcgaaac cctgaaggtg gattttctga 421 gcaaactgcc tgaaatgctg
aagatgttcg aggacagact gtgccacaaa acctacctga 481 atggcgacca
tgtcacacac ccagatttta tgctgtacga cgccctggat gtggtcctgt 541
atatggaccc catgtgtctg gatgctttcc ctaagctggt gtgctttaag aaaaggatcg
601 aggcaattcc ccagatcgat aagtacctga aaagctccaa gtatatcgct
tggcctctcc 661 agggctggca ggcaacattc ggaggcgggg accatccccc
taaaagcgac ctggaggtgc 721 tgtttcaggg accactgggc agcggccgga
agaagcggcg gcagaggcgc cgaagtaggt 781 tccgcactaa gcggtcagca
ctggtgcgga gactgtggcg atctcgggct cctggaggag 841 aggacgagga
agagggagca ggcggcggcg gaggaggagg agaactgcgc ggggagggag 901
ctacagatag ccgagcccac ggagctggag gaggaggacc agggcgagcc ggatgctgtc
961 tgggcaaagc agtgagaggc gccaaggggc accatcaccc ccatccaccc
gccgctggcg 1021 caggagcagc cggcggagct gaggcagacc tgaaagccct
gactcacagt gtgctgaaga 1081 aactgaagga aagacagctg gagctgctgc
tccaggcagt cgaatcacgc ggaggcaccc 1141 gaacagcttg tctgctgctg
cccggccggc tggactgccg gctgggaccc ggcgcccctg 1201 ctggggcaca
gccagcccag cctccatcta gttatagcct gcccctgctg ctgtgtaagg 1261
tgttccgatg gcctgatctg cggcattcaa gcgaagtcaa aaggctgtgc tgttgcgagt
1321 cctacggcaa gattaaccca gaactggtgt gttgcaatcc ccatcacctg
tctcgactgt 1381 gtgaactgga gtccccccct ccaccctact ctaggtatcc
tatggacttt ctgaagccaa 1441 ccgctgactg cccagatgca gtgccctcct
ctgccgagac tgggggaacc aactacctgg 1501 ctcctggcgg actgagcgac
tcccagctgc tgctggaacc aggggatcgg cqqccqctcg 1561 agcagaaact
catctcagaa gaggatctgg cagcaaatga tatcctggat tacaaggatg 1621
acgacgataa ggtttaa
[0346] The entire sequence (up to the NotI site) is codon optimized
for mammalian expression, using the OPTIMUMGENE.TM. program from
GenScript.
[0347] 1-50: cloning sites+kozac sequence in pCMV6-Entry
[0348] 51-743: GST+ precision enzyme sites
[0349] 744-773-: Tat
[0350] 774-1549-: human Smad7 encoding N-terminal 1-258aa
[0351] 1550-1563: NotI, XhoI, Bold underline: unique NotI site
[0352] 1564-1637: myc-flag from pCMV6-Entry
[0353] Optimized GST-Tat-N-Smad7-myc-flag protein sequence from the
above nucleotide sequence:
TABLE-US-00026 (SEQ ID NO: 92) 1 mspilgywki kglvqptrll leyleekyee
hlyerdegdk wrnkkfelgl efpnlpyyid 61 gdvkltqsma iiryiadkhn
mlggcpkera eismlegavl dirygvsria yskdfetlkv 121 dflsklpeml
kmfedrlchk tylngdhvth pdfmlydald vvlymdpmcl dafpklvcfk 181
krieaipqid kylksskyia wplqgwqatf gggdhppksd levlfqgplg sgrkkrrqrr
241 rsrfrtkrsa lvrrlwrsra pggedeeega gggggggelr gegatdsrah
gaggggpgra 301 gcclgkavrg akghhhphpp aagagaagga eadlkalths
vlkklkerql elllqavesr 361 ggtrtaclll pgrldcrlgp gapagaqpaq
ppssyslpll lckvfrwpdl rhssevkrlc 421 ccesygkinp elvccnphhl
srlcelespp ppysrypmdf lkptadcpda vpssaetggt 481 nylapgglsd
sqlllepgdr rpleqklise edlaandild ykddddkv 1-241: GST-Tat 244-500:
N-Smad7 2-258 501-528: restriction sites+myc-Flag
[0354] Optimized GST-Tat-C-Smad7-myc-flag nucleotide sequence,
cloned in pCMV6-Entry plasmid:
TABLE-US-00027 (SEQ ID NO: 93) 1 gaattcgtcg actggatccg gtaccgagga
gatctgccgc cgcgatcgcc atgtccccca 61 ttctgggcta ctggaagatt
aagggcctgg tgcagcctac tagactgctg ctggaatacc 121 tggaggaaaa
atatgaagag catctgtatg aaagagacga gggggataaa tggaggaaca 181
agaaattcga actgggactg gagtttccta atctgccata ctatattgac ggcgatgtga
241 agctgactca gtctatggct atcattagat acatcgcaga caaacacaac
atgctgggcg 301 ggtgtcctaa ggaaagggca gagattagta tgctggaggg
agccgtgctg gatattagat 361 acggcgtctc acgcatcgcc tatagcaaag
acttcgaaac cctgaaggtg gattttctga 421 gcaaactgcc tgaaatgctg
aagatgttcg aggacagact gtgccacaaa acctacctga 481 atggcgacca
tgtcacacac ccagatttta tgctgtacga cgccctggat gtggtcctgt 541
atatggaccc catgtgtctg gatgctttcc ctaagctggt gtgctttaag aaaaggatcg
601 aggcaattcc ccagatcgat aagtacctga aaagctccaa gtatatcgct
tggcctctcc 661 agggctggca ggcaacattc ggaggcgggg accatccccc
taaaagcgac ctggaggtgc 721 tgtttcaggg accactgggc agcggccgga
agaagcggcg gcagaggcgc agccactggt 781 gtgtggtcgc ctactgggaa
gagaagacaa gagtgggaag gctgtattgc gtccaggagc 841 cttccctgga
catcttctac gatctgccac aggggaatgg attttgtctg ggccagctga 901
actctgacaa taagagtcag ctggtgcaga aagtccggag caagattggc tgcggcatcc
961 agctgaccag ggaggtggac ggcgtgtggg tctacaaccg cagttcatat
ccaatcttca 1021 tcaagagcgc cactctggac aatcccgatt cccgcaccct
gctggtgcat aaggtcttcc 1081 ccggcttcag catcaaggcc ttcgactacg
agaaggctta tagtctccag cggcccaacg 1141 atcacgagtt catgcagcag
ccttggacag gcttcactgt gcagatcagc ttcgtcaagg 1201 gatggggcca
gtgctacaca aggcagttca tctcaagctg tccctgttgg ctggaagtca 1261
ttttcaatag taggacgcgt acgcggccgc tcgagcagaa actcatctca gaagaggatc
1321 tggcagcaaa tgatatcctg gattacaagg atgacgacga taaggtttaa
[0355] The entire sequence (up to the NotI site) is codon optimized
for mammalian expression, using the OPTIMUMGENE.TM. program from
GenScript.
[0356] 1-50: cloning sites+kozac sequence in pCMV6-Entry
[0357] 51-743: GST+ precision enzyme sites
[0358] 744-773: Tat
[0359] 774-1274: c-terminal Smad7
[0360] 1275: Mlu
[0361] 1283: NotI
[0362] 1275-1370: mluI, NotI, XhoI, myc+flag
[0363] Optimized GST-Tat-C-Smad7-myc-flag protein sequence from the
above nucleotide sequence:
TABLE-US-00028 (SEQ ID NO: 94) 1 mspilgywki kglvqptrll leyleekyee
hlyerdegdk wrnkkfelgl efpnlpyyid 61 gdvkltqsma iiryiadkhn
mlggcpkera eismlegavl dirygvsria yskdfetlkv 121 dflsklpeml
kmfedrlchk tylngdhvth pdfmlydald vvlymdpmcl dafpklvcfk 181
krieaipqid kylksskyia wplqgwqatf gggdhppksd levlfqgplg sgrkkrrqrr
241 rshwcvvayw eektrvgrly cvqepsldif ydlpqgngfc lgqlnsdnks
qlvqkvrski 301 gcgiqltrev dgvwvynrss ypifiksatl dnpdsrtllv
hkvfpgfsik afdyekaysl 361 qrpndhefmq qpwtgftvqi sfvkgwgqcy
trqfisscpc wlevifnsrt rtrpleqkli 421 seedlaandi ldykddddkv 1-241:
GST+precision+Tat 242-409: c-Smad7 (259-426aa) 410-440:
Mlu+Not+myc+flag: restriction sites+myc-Flag
[0364] Based on the Smad7 structure, it is anticipated that the
linker (203-258aa), which contains the PY domain (203-217aa), will
have therapeutic effects by blocking TGF-.beta.-induced
inflammation or growth inhibition (FIG. 15).
[0365] The amino acid sequence for the Tat-Smad7-linker peptide
(203-258aa) is:
TABLE-US-00029 (SEQ ID NO: 95)
GRKKRRQRRR-ELESPPPPYSRYPMDFLKPTADCPDAVPSSAETGGTNYL
APGGLSDSQLLLEPGDR 1-10: Tat
[0366] The codon optimized nucleotide sequence for the above
peptide in bacterial production is:
TABLE-US-00030 (SEQ ID NO: 96) 1 ggccgtaaaa aacgccgtca acgccgccgt
gaactggaga gcccgccgcc gccgtatagc 61 cgttacccga tggatttcct
gaaaccgact gcagattgcc cggacgcagt cccgagcagc 121 gctgagaccg
gcggcaccaa ctatctggca ccgggcggtc tgagcgatag ccagctgctg 181
ctggaaccgg gcgaccgtgg taagcctatc cctaaccctc tcctcggtct cgattctacg
241 tgagtcgac 1-30: Tat 31-198: Smad7 linker 199-249: V5
[0367] The codon optimized nucleotide sequence for the above
peptide in mammalian cell production is:
TABLE-US-00031 (SEQ ID NO: 97) 1 ggccggaaga agcggcggca gaggcgccga
gaactggagt ccccccctcc accctactct 61 aggtatccta tggactttct
gaagccaacc gctgactgcc cagatgcagt gccctcctct 121 gccgagactg
ggggaaccaa ctacctggct cctggcggac tgagcgactc ccagctgctg 181
ctggaaccag gggatcggcg gccgctcgag cagaaactca tctcagaaga ggatctggca
241 gcaaatgata tcctggatta caaggatgac gacgataagg tttaa 1-30:
mammalian codon optimized Tat 31-198: Smad7 linker 199-285: NotI,
Myc tag, Flag tag
[0368] The amino acid sequence for the Tat-Smad7-PY peptide
(203-217aa) is:
TABLE-US-00032 (SEQ ID NO: 98) GRKKRRQRRR-ELESPPPPYSRYPMD 1-10:
Tat
[0369] The codon optimized nucleotide sequence for the above
peptide in bacterial production is:
TABLE-US-00033 (SEQ ID NO: 99) 1 ggccgtaaaa aacgccgtca acgccgccgt
gaactggaga gcccgccgcc gccgtatagc 61 cgttacccga tggatggtaa
gcctatccct aaccctctcc tcggtctcga ttctacgtga 121 gtcgac 1-30: Tat
31-75: Smad7-PY 76-126: V5
[0370] The codon optimized nucleotide sequence for the above
peptide in mammalian cell production is:
TABLE-US-00034 (SEQ ID NO: 100) 1 ggccggaaga agcggcggca gaggcgccga
gaactggagt ccccccctcc accctactct 61 aggtatccta tggaccggcc
gctcgagcag aaactcatct cagaagagga tctggcagca 121 aatgatatcc
tggattacaa ggatgacgac gataaggttt aa 1-30: mammalian codon optimized
Tat 31-75: Smad7 PY 76-162: NotI, Myc tag, Flag tag
[0371] The foregoing discussion of the present technology has been
presented for purposes of illustration and description. The
foregoing is not intended to limit the present technology to the
form or forms disclosed herein. Although the description of the
present technology has included description of one or more
embodiments and certain variations and modifications, other
variations and modifications are within the scope of the present
technology, e.g., as may be within the skill and knowledge of those
in the art, after understanding the present disclosure. It is
intended to obtain rights which include alternative embodiments to
the extent permitted, including alternate, interchangeable and/or
equivalent structures, functions, ranges or steps to those claimed,
whether or not such alternate, interchangeable and/or equivalent
structures, functions, ranges or steps are disclosed herein, and
without intending to publicly dedicate any patentable subject
matter.
Sequence CWU 1
1
101130DNAHuman immunodeficiency virusCDS(1)..(30) 1ggc cgt aaa aaa
cgc cgt caa cgc cgc cgt 30Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg1
5 10210PRTHuman immunodeficiency virus 2Gly Arg Lys Lys Arg Arg Gln
Arg Arg Arg1 5 10333DNAHuman immunodeficiency virus 3tatggccgta
aaaaacgccg tcaacgccgc cgt 33411PRTHuman immunodeficiency virus 4Tyr
Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg1 5 10521DNAHuman
immunodeficiency virus 5ggccgtaaaa aacgccgtca a 2167PRTHuman
immunodeficiency virus 6Gly Arg Lys Lys Arg Arg Gln1 5730DNAHuman
immunodeficiency virus 7ggccgtaaaa aacgccgtca acgccgccgt
30830DNAHuman immunodeficiency virus 8ggccgtaaaa aacgccgtca
acgccgccgt 3091497DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotideCDS(58)..(1461) 9ttcccctcta
gaaataattt tgtttaactt taagaaggaa ttcaggagcc cttcacc 57atg cgt aaa
aaa cgc cgt caa cgc cgc cgt ggt ttc cgt acg aaa cgc 105Met Arg Lys
Lys Arg Arg Gln Arg Arg Arg Gly Phe Arg Thr Lys Arg1 5 10 15tcg gcc
ctg gtc cgt cgc ctg tgg cgc tcc cgt gct ccg ggt ggt gaa 153Ser Ala
Leu Val Arg Arg Leu Trp Arg Ser Arg Ala Pro Gly Gly Glu 20 25 30gat
gaa gaa gaa ggt gct ggc ggc ggt ggc ggt ggc ggt gaa ctg cgt 201Asp
Glu Glu Glu Gly Ala Gly Gly Gly Gly Gly Gly Gly Glu Leu Arg 35 40
45ggc gag ggt gca acc gat agt cgt gca cac ggt gca ggc ggt ggc ggt
249Gly Glu Gly Ala Thr Asp Ser Arg Ala His Gly Ala Gly Gly Gly Gly
50 55 60ccg ggt cgt gct ggt tgc tgt ctg ggt aaa gct gtg cgc ggc gcg
aaa 297Pro Gly Arg Ala Gly Cys Cys Leu Gly Lys Ala Val Arg Gly Ala
Lys65 70 75 80ggt cat cac cat ccg cac ccg ccg gca gca ggt gca ggt
gca gct ggc 345Gly His His His Pro His Pro Pro Ala Ala Gly Ala Gly
Ala Ala Gly 85 90 95ggt gcg gaa gcc gat ctg aaa gcc ctg acc cat agt
gtc ctg aaa aaa 393Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser
Val Leu Lys Lys 100 105 110ctg aaa gaa cgt cag ctg gag ctg ctg ctg
caa gca gta gaa tcc cgt 441Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu
Gln Ala Val Glu Ser Arg 115 120 125ggc ggt acc cgt acg gct tgt ctg
ctg ctg ccg ggt cgt ctg gat tgc 489Gly Gly Thr Arg Thr Ala Cys Leu
Leu Leu Pro Gly Arg Leu Asp Cys 130 135 140cgt ctg ggt ccg ggt gca
ccg gct ggt gcg cag ccg gca caa ccg ccg 537Arg Leu Gly Pro Gly Ala
Pro Ala Gly Ala Gln Pro Ala Gln Pro Pro145 150 155 160agc tct tac
agc ctg ccg ctg ctg ctg tgt aaa gtg ttt cgt tgg ccg 585Ser Ser Tyr
Ser Leu Pro Leu Leu Leu Cys Lys Val Phe Arg Trp Pro 165 170 175gac
ctg cgc cac agt tcc gaa gtt aaa cgc ctg tgc tgt tgc gag agc 633Asp
Leu Arg His Ser Ser Glu Val Lys Arg Leu Cys Cys Cys Glu Ser 180 185
190tat ggc aaa att aac ccg gaa ctg gtt tgt tgc aat ccg cac cat ctg
681Tyr Gly Lys Ile Asn Pro Glu Leu Val Cys Cys Asn Pro His His Leu
195 200 205tct cgt ctg tgt gaa ctg gag agc ccg ccg ccg ccg tat tct
cgt tac 729Ser Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr Ser
Arg Tyr 210 215 220ccg atg gat ttc ctg aaa ccg act gca gat tgc ccg
gac gca gtc ccg 777Pro Met Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro
Asp Ala Val Pro225 230 235 240tca tcg gct gag acc ggc ggc acc aac
tat ctg gca ccg ggc ggt ctg 825Ser Ser Ala Glu Thr Gly Gly Thr Asn
Tyr Leu Ala Pro Gly Gly Leu 245 250 255agt gat tcc cag ctg ctg ctg
gaa ccg ggc gac cgt tca cat tgg tgt 873Ser Asp Ser Gln Leu Leu Leu
Glu Pro Gly Asp Arg Ser His Trp Cys 260 265 270gtg gtt gcc tat tgg
gaa gag aaa acg cgt gtc ggt cgc ctg tac tgc 921Val Val Ala Tyr Trp
Glu Glu Lys Thr Arg Val Gly Arg Leu Tyr Cys 275 280 285gta cag gaa
ccg tcg ctg gat atc ttt tat gac ctg ccg cag ggc aat 969Val Gln Glu
Pro Ser Leu Asp Ile Phe Tyr Asp Leu Pro Gln Gly Asn 290 295 300ggt
ttc tgt ctg ggc caa ctg aac tca gat aat aaa tcg cag ctg gtg 1017Gly
Phe Cys Leu Gly Gln Leu Asn Ser Asp Asn Lys Ser Gln Leu Val305 310
315 320caa aaa gtt cgc tca aaa att ggc tgc ggt atc cag ctg acc cgt
gaa 1065Gln Lys Val Arg Ser Lys Ile Gly Cys Gly Ile Gln Leu Thr Arg
Glu 325 330 335gtt gac ggt gtc tgg gta tat aac cgc agc tct tac ccg
att ttt atc 1113Val Asp Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro
Ile Phe Ile 340 345 350aaa agt gcc acc ctg gat aat ccg gac tcc cgt
acg ctg ctg gtc cac 1161Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser Arg
Thr Leu Leu Val His 355 360 365aaa gta ttt ccg ggc ttc tca atc aaa
gcg ttc gat tac gag aaa gcc 1209Lys Val Phe Pro Gly Phe Ser Ile Lys
Ala Phe Asp Tyr Glu Lys Ala 370 375 380tac tcg ctg cag cgc ccg aac
gac cat gaa ttc atg cag caa ccg tgg 1257Tyr Ser Leu Gln Arg Pro Asn
Asp His Glu Phe Met Gln Gln Pro Trp385 390 395 400acg ggt ttt act
gtg cag atc tct ttc gtt aaa ggc tgg ggt caa tgc 1305Thr Gly Phe Thr
Val Gln Ile Ser Phe Val Lys Gly Trp Gly Gln Cys 405 410 415tac acc
cgt cag ttt atc tcg tcc tgt ccg tgc tgg ctg gaa gtg att 1353Tyr Thr
Arg Gln Phe Ile Ser Ser Cys Pro Cys Trp Leu Glu Val Ile 420 425
430ttc aat agc cgc aag ggc gag ctc aat tcg aag ctt gaa ggt aag cct
1401Phe Asn Ser Arg Lys Gly Glu Leu Asn Ser Lys Leu Glu Gly Lys Pro
435 440 445atc cct aac cct ctc ctc ggt ctc gat tct acg cgt acc ggt
cat cat 1449Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr Arg Thr Gly
His His 450 455 460cac cat cac cat tgagtttgat ccggctgcta acaaagcccg
aaagga 1497His His His His46510468PRTArtificial SequenceDescription
of Artificial Sequence Synthetic polypeptide 10Met Arg Lys Lys Arg
Arg Gln Arg Arg Arg Gly Phe Arg Thr Lys Arg1 5 10 15Ser Ala Leu Val
Arg Arg Leu Trp Arg Ser Arg Ala Pro Gly Gly Glu 20 25 30Asp Glu Glu
Glu Gly Ala Gly Gly Gly Gly Gly Gly Gly Glu Leu Arg 35 40 45Gly Glu
Gly Ala Thr Asp Ser Arg Ala His Gly Ala Gly Gly Gly Gly 50 55 60Pro
Gly Arg Ala Gly Cys Cys Leu Gly Lys Ala Val Arg Gly Ala Lys65 70 75
80Gly His His His Pro His Pro Pro Ala Ala Gly Ala Gly Ala Ala Gly
85 90 95Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val Leu Lys
Lys 100 105 110Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu Gln Ala Val
Glu Ser Arg 115 120 125Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu Pro
Gly Arg Leu Asp Cys 130 135 140Arg Leu Gly Pro Gly Ala Pro Ala Gly
Ala Gln Pro Ala Gln Pro Pro145 150 155 160Ser Ser Tyr Ser Leu Pro
Leu Leu Leu Cys Lys Val Phe Arg Trp Pro 165 170 175Asp Leu Arg His
Ser Ser Glu Val Lys Arg Leu Cys Cys Cys Glu Ser 180 185 190Tyr Gly
Lys Ile Asn Pro Glu Leu Val Cys Cys Asn Pro His His Leu 195 200
205Ser Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr Ser Arg Tyr
210 215 220Pro Met Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro Asp Ala
Val Pro225 230 235 240Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr Leu
Ala Pro Gly Gly Leu 245 250 255Ser Asp Ser Gln Leu Leu Leu Glu Pro
Gly Asp Arg Ser His Trp Cys 260 265 270Val Val Ala Tyr Trp Glu Glu
Lys Thr Arg Val Gly Arg Leu Tyr Cys 275 280 285Val Gln Glu Pro Ser
Leu Asp Ile Phe Tyr Asp Leu Pro Gln Gly Asn 290 295 300Gly Phe Cys
Leu Gly Gln Leu Asn Ser Asp Asn Lys Ser Gln Leu Val305 310 315
320Gln Lys Val Arg Ser Lys Ile Gly Cys Gly Ile Gln Leu Thr Arg Glu
325 330 335Val Asp Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro Ile
Phe Ile 340 345 350Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser Arg Thr
Leu Leu Val His 355 360 365Lys Val Phe Pro Gly Phe Ser Ile Lys Ala
Phe Asp Tyr Glu Lys Ala 370 375 380Tyr Ser Leu Gln Arg Pro Asn Asp
His Glu Phe Met Gln Gln Pro Trp385 390 395 400Thr Gly Phe Thr Val
Gln Ile Ser Phe Val Lys Gly Trp Gly Gln Cys 405 410 415Tyr Thr Arg
Gln Phe Ile Ser Ser Cys Pro Cys Trp Leu Glu Val Ile 420 425 430Phe
Asn Ser Arg Lys Gly Glu Leu Asn Ser Lys Leu Glu Gly Lys Pro 435 440
445Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr Arg Thr Gly His His
450 455 460His His His His46511461PRTArtificial SequenceDescription
of Artificial Sequence Synthetic polypeptide 11Gly Ser Gly Arg Lys
Lys Arg Arg Gln Arg Arg Arg Gly Phe Arg Thr1 5 10 15Lys Arg Ser Ala
Leu Val Arg Arg Leu Trp Arg Ser Arg Ala Pro Gly 20 25 30Gly Glu Asp
Glu Glu Glu Gly Ala Gly Gly Gly Gly Gly Gly Gly Glu 35 40 45Leu Arg
Gly Glu Gly Ala Thr Asp Ser Arg Ala His Gly Ala Gly Gly 50 55 60Gly
Gly Pro Gly Arg Ala Gly Cys Cys Leu Gly Lys Ala Val Arg Gly65 70 75
80Ala Lys Gly His His His Pro His Pro Pro Ala Ala Gly Ala Gly Ala
85 90 95Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val
Leu 100 105 110Lys Lys Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu Gln
Ala Val Glu 115 120 125Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu
Leu Pro Gly Arg Leu 130 135 140Asp Cys Arg Leu Gly Pro Gly Ala Pro
Ala Gly Ala Gln Pro Ala Gln145 150 155 160Pro Pro Ser Ser Tyr Ser
Leu Pro Leu Leu Leu Cys Lys Val Phe Arg 165 170 175Trp Pro Asp Leu
Arg His Ser Ser Glu Val Lys Arg Leu Cys Cys Cys 180 185 190Glu Ser
Tyr Gly Lys Ile Asn Pro Glu Leu Val Cys Cys Asn Pro His 195 200
205His Leu Ser Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr Ser
210 215 220Arg Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro
Asp Ala225 230 235 240Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn
Tyr Leu Ala Pro Gly 245 250 255Gly Leu Ser Asp Ser Gln Leu Leu Leu
Glu Pro Gly Asp Arg Ser His 260 265 270Trp Cys Val Val Ala Tyr Trp
Glu Glu Lys Thr Arg Val Gly Arg Leu 275 280 285Tyr Cys Val Gln Glu
Pro Ser Leu Asp Ile Phe Tyr Asp Leu Pro Gln 290 295 300Gly Asn Gly
Phe Cys Leu Gly Gln Leu Asn Ser Asp Asn Lys Ser Gln305 310 315
320Leu Val Gln Lys Val Arg Ser Lys Ile Gly Cys Gly Ile Gln Leu Thr
325 330 335Arg Glu Val Asp Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr
Pro Ile 340 345 350Phe Ile Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser
Arg Thr Leu Leu 355 360 365Val His Lys Val Phe Pro Gly Phe Ser Ile
Lys Ala Phe Asp Tyr Glu 370 375 380Lys Ala Tyr Ser Leu Gln Arg Pro
Asn Asp His Glu Phe Met Gln Gln385 390 395 400Pro Trp Thr Gly Phe
Thr Val Gln Ile Ser Phe Val Lys Gly Trp Gly 405 410 415Gln Cys Tyr
Thr Arg Gln Phe Ile Ser Ser Cys Pro Cys Trp Leu Glu 420 425 430Val
Ile Phe Asn Ser Arg Lys Gly Glu Leu Asn Ser Lys Leu Glu Gly 435 440
445Lys Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr 450 455
46012426PRTHomo sapiens 12Met Phe Arg Thr Lys Arg Ser Ala Leu Val
Arg Arg Leu Trp Arg Ser1 5 10 15Arg Ala Pro Gly Gly Glu Asp Glu Glu
Glu Gly Ala Gly Gly Gly Gly 20 25 30Gly Gly Gly Glu Leu Arg Gly Glu
Gly Ala Thr Asp Ser Arg Ala His 35 40 45Gly Ala Gly Gly Gly Gly Pro
Gly Arg Ala Gly Cys Cys Leu Gly Lys 50 55 60Ala Val Arg Gly Ala Lys
Gly His His His Pro His Pro Pro Ala Ala65 70 75 80Gly Ala Gly Ala
Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr 85 90 95His Ser Val
Leu Lys Lys Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu 100 105 110Gln
Ala Val Glu Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu 115 120
125Pro Gly Arg Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala
130 135 140Gln Pro Ala Gln Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu
Leu Cys145 150 155 160Lys Val Phe Arg Trp Pro Asp Leu Arg His Ser
Ser Glu Val Lys Arg 165 170 175Leu Cys Cys Cys Glu Ser Tyr Gly Lys
Ile Asn Pro Glu Leu Val Cys 180 185 190Cys Asn Pro His His Leu Ser
Arg Leu Cys Glu Leu Glu Ser Pro Pro 195 200 205Pro Pro Tyr Ser Arg
Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp 210 215 220Cys Pro Asp
Ala Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr225 230 235
240Leu Ala Pro Gly Gly Leu Ser Asp Ser Gln Leu Leu Leu Glu Pro Gly
245 250 255Asp Arg Ser His Trp Cys Val Val Ala Tyr Trp Glu Glu Lys
Thr Arg 260 265 270Val Gly Arg Leu Tyr Cys Val Gln Glu Pro Ser Leu
Asp Ile Phe Tyr 275 280 285Asp Leu Pro Gln Gly Asn Gly Phe Cys Leu
Gly Gln Leu Asn Ser Asp 290 295 300Asn Lys Ser Gln Leu Val Gln Lys
Val Arg Ser Lys Ile Gly Cys Gly305 310 315 320Ile Gln Leu Thr Arg
Glu Val Asp Gly Val Trp Val Tyr Asn Arg Ser 325 330 335Ser Tyr Pro
Ile Phe Ile Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser 340 345 350Arg
Thr Leu Leu Val His Lys Val Phe Pro Gly Phe Ser Ile Lys Ala 355 360
365Phe Asp Tyr Glu Lys Ala Tyr Ser Leu Gln Arg Pro Asn Asp His Glu
370 375 380Phe Met Gln Gln Pro Trp Thr Gly Phe Thr Val Gln Ile Ser
Phe Val385 390 395 400Lys Gly Trp Gly Gln Cys Tyr Thr Arg Gln Phe
Ile Ser Ser Cys Pro 405 410 415Cys Trp Leu Glu Val Ile Phe Asn Ser
Arg 420 4251320DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 13tggaattcct ggtctggttt
201420DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 14gccaagctgc tcttccagta 201521DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
15tctcaggggg ccaaaggtgt t 211622DNAArtificial SequenceDescription
of Artificial Sequence Synthetic primer 16tcccagcacc tgaatcacat gg
221710DNAUnknownDescription of Unknown Smad binding element
oligonucleotide 17tgtctgtgct 101810DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 18tgatagagct 101926DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
19atcctcgagt atcctccagg tctggg 262028DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
20gccaagctta gcgtccagcg ttaacctg 28211392DNAArtificial
SequenceDescription of
Artificial Sequence Synthetic polynucleotide 21ggatccggcc
gtaaaaaacg ccgtcaacgc cgccgtggtt tccgtacgaa acgctcggcc 60ctggtccgtc
gcctgtggcg ctcccgtgct ccgggtggtg aagatgaaga agaaggtgct
120ggcggcggtg gcggtggcgg tgaactgcgt ggcgagggtg caaccgatag
tcgtgcacac 180ggtgcaggcg gtggcggtcc gggtcgtgct ggttgctgtc
tgggtaaagc tgtgcgcggc 240gcgaaaggtc atcaccatcc gcacccgccg
gcagcaggtg caggtgcagc tggcggtgcg 300gaagccgatc tgaaagccct
gacccatagt gtcctgaaaa aactgaaaga acgtcagctg 360gagctgctgc
tgcaagcagt agaatcccgt ggcggtaccc gtacggcttg tctgctgctg
420ccgggtcgtc tggattgccg tctgggtccg ggtgcaccgg ctggtgcgca
gccggcacaa 480ccgccgagct cttacagcct gccgctgctg ctgtgtaaag
tgtttcgttg gccggacctg 540cgccacagtt ccgaagttaa acgcctgtgc
tgttgcgaga gctatggcaa aattaacccg 600gaactggttt gttgcaatcc
gcaccatctg tctcgtctgt gtgaactgga gagcccgccg 660ccgccgtatt
ctcgttaccc gatggatttc ctgaaaccga ctgcagattg cccggacgca
720gtcccgtcat cggctgagac cggcggcacc aactatctgg caccgggcgg
tctgagtgat 780tcccagctgc tgctggaacc gggcgaccgt tcacattggt
gtgtggttgc ctattgggaa 840gagaaaacgc gtgtcggtcg cctgtactgc
gtacaggaac cgtcgctgga tatcttttat 900gacctgccgc agggcaatgg
tttctgtctg ggccaactga actcagataa taaatcgcag 960ctggtgcaaa
aagttcgctc aaaaattggc tgcggtatcc agctgacccg tgaagttgac
1020ggtgtctggg tatataaccg cagctcttac ccgattttta tcaaaagtgc
caccctggat 1080ggtgtctggg tatataaccg cagctcttac ccgattttta
tcaaaagtgc caccctggat 1140aatccggact cccgtacgct gctggtccac
aaagtatttc cgggcttctc aatcaaagcg 1200ccgtggacgg gttttactgt
gcagatctct ttcgttaaag gctggggtca atgctacacc 1260cgtcagttta
tctcgtcctg tccgtgctgg ctggaagtga ttttcaatag ccgcaagggc
1320gagctcaatt cgaagcttga aggtaagcct atccctaacc ctctcctcgg
tctcgattct 1380acgtgagtcg ac 1392221281DNAHomo sapiens 22atgttcagga
ccaaacgatc tgcgctcgtc cggcgtctct ggaggagccg tgcgcccggc 60ggcgaggacg
aggaggaggg cgcaggggga ggtggaggag gaggcgagct gcggggagaa
120ggggcgacgg acagccgagc gcatggggcc ggtggcggcg gcccgggcag
ggctggatgc 180tgcctgggca aggcggtgcg aggtgccaaa ggtcaccacc
atccccaccc gccagccgcg 240ggcgccggcg cggccggggg cgccgaggcg
gatctgaagg cgctcacgca ctcggtgctc 300aagaaactga aggagcggca
gctggagctg ctgctccagg ccgtggagtc ccgcggcggg 360acgcgcaccg
cgtgcctcct gctgcccggc cgcctggact gcaggctggg cccgggggcg
420cccgccggcg cgcagcctgc gcagccgccc tcgtcctact cgctccccct
cctgctgtgc 480aaagtgttca ggtggccgga tctcaggcat tcctcggaag
tcaagaggct gtgttgctgt 540gaatcttacg ggaagatcaa ccccgagctg
gtgtgctgca acccccatca ccttagccga 600ctctgcgaac tagagtctcc
cccccctcct tactccagat acccgatgga ttttctcaaa 660ccaactgcag
actgtccaga tgctgtgcct tcctccgctg aaacaggggg aacgaattat
720ctggcccctg gggggctttc agattcccaa cttcttctgg agcctgggga
tcggtcacac 780tggtgcgtgg tggcatactg ggaggagaag acgagagtgg
ggaggctcta ctgtgtccag 840gagccctctc tggatatctt ctatgatcta
cctcagggga atggcttttg cctcggacag 900ctcaattcgg acaacaagag
tcagctggtg cagaaggtgc ggagcaaaat cggctgcggc 960atccagctga
cgcgggaggt ggatggtgtg tgggtgtaca accgcagcag ttaccccatc
1020ttcatcaagt ccgccacact ggacaacccg gactccagga cgctgttggt
acacaaggtg 1080ttccccggtt tctccatcaa ggctttcgac tacgagaagg
cgtacagcct gcagcggccc 1140aatgaccacg agtttatgca gcagccgtgg
acgggcttta ccgtgcagat cagctttgtg 1200aagggctggg gtcagtgcta
cacccgccag ttcatcagca gctgcccgtg ctggctagag 1260gtcatcttca
acagccggta g 1281231392DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 23ggatccggcc
gtaaaaaacg ccgtcaacgc cgccgtggtt tccgtacgaa acgctcggcc 60ctggtccgtc
gcctgtggcg ctcccgtgct ccgggtggtg aagatgaaga agaaggtgct
120ggcggcggtg gcggtggcgg tgaactgcgt ggcgagggtg caaccgatag
tcgtgcacac 180ggtgcaggcg gtggcggtcc gggtcgtgct ggttgctgtc
tgggtaaagc tgtgcgcggc 240gcgaaaggtc atcaccatcc gcacccgccg
gcagcaggtg caggtgcagc tggcggtgcg 300gaagccgatc tgaaagccct
gacccatagt gtcctgaaaa aactgaaaga acgtcagctg 360gagctgctgc
tgcaagcagt agaatcccgt ggcggtaccc gtacggcttg tctgctgctg
420ccgggtcgtc tggattgccg tctgggtccg ggtgcaccgg ctggtgcgca
gccggcacaa 480ccgccgagct cttacagcct gccgctgctg ctgtgtaaag
tgtttcgttg gccggacctg 540cgccacagtt ccgaagttaa acgcctgtgc
tgttgcgaga gctatggcaa aattaacccg 600gaactggttt gttgcaatcc
gcaccatctg tctcgtctgt gtgaactgga gagcccgccg 660ccgccgtatt
ctcgttaccc gatggatttc ctgaaaccga ctgcagattg cccggacgca
720gtcccgtcat cggctgagac cggcggcacc aactatctgg caccgggcgg
tctgagtgat 780tcccagctgc tgctggaacc gggcgaccgt tcacattggt
gtgtggttgc ctattgggaa 840gagaaaacgc gtgtcggtcg cctgtactgc
gtacaggaac cgtcgctgga tatcttttat 900gacctgccgc agggcaatgg
tttctgtctg ggccaactga actcagataa taaatcgcag 960ctggtgcaaa
aagttcgctc aaaaattggc tgcggtatcc agctgacccg tgaagttgac
1020ggtgtctggg tatataaccg cagctcttac ccgattttta tcaaaagtgc
caccctggat 1080aatccggact cccgtacgct gctggtccac aaagtatttc
cgggcttctc aatcaaagcg 1140ttcgattacg agaaagccta ctcgctgcag
cgcccgaacg accatgaatt catgcagcaa 1200ccgtggacgg gttttactgt
gcagatctct ttcgttaaag gctggggtca atgctacacc 1260cgtcagttta
tctcgtcctg tccgtgctgg ctggaagtga ttttcaatag ccgcaagggc
1320gagctcaatt cgaagcttga aggtaagcct atccctaacc ctctcctcgg
tctcgattct 1380acgtgagtcg ac 139224618DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
polynucleotideCDS(1)..(609) 24gga tcc ggc cgt aaa aaa cgc cgt caa
cgc cgc cgt tca cat tgg tgt 48Gly Ser Gly Arg Lys Lys Arg Arg Gln
Arg Arg Arg Ser His Trp Cys1 5 10 15gtg gtt gcc tat tgg gaa gag aaa
acg cgt gtc ggt cgc ctg tac tgc 96Val Val Ala Tyr Trp Glu Glu Lys
Thr Arg Val Gly Arg Leu Tyr Cys 20 25 30gta cag gaa ccg tcg ctg gat
atc ttt tat gac ctg ccg cag ggc aat 144Val Gln Glu Pro Ser Leu Asp
Ile Phe Tyr Asp Leu Pro Gln Gly Asn 35 40 45ggt ttc tgt ctg ggc caa
ctg aac tca gat aat aaa tcg cag ctg gtg 192Gly Phe Cys Leu Gly Gln
Leu Asn Ser Asp Asn Lys Ser Gln Leu Val 50 55 60caa aaa gtt cgc tca
aaa att ggc tgc ggt atc cag ctg acc cgt gaa 240Gln Lys Val Arg Ser
Lys Ile Gly Cys Gly Ile Gln Leu Thr Arg Glu65 70 75 80gtt gac ggt
gtc tgg gta tat aac cgc agc tct tac ccg att ttt atc 288Val Asp Gly
Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro Ile Phe Ile 85 90 95aaa agt
gcc acc ctg gat aat ccg gac tcc cgt acg ctg ctg gtc cac 336Lys Ser
Ala Thr Leu Asp Asn Pro Asp Ser Arg Thr Leu Leu Val His 100 105
110aaa gta ttt ccg ggc ttc tca atc aaa gcg ttc gat tac gag aaa gcc
384Lys Val Phe Pro Gly Phe Ser Ile Lys Ala Phe Asp Tyr Glu Lys Ala
115 120 125tac tcg ctg cag cgc ccg aac gac cat gaa ttc atg cag caa
ccg tgg 432Tyr Ser Leu Gln Arg Pro Asn Asp His Glu Phe Met Gln Gln
Pro Trp 130 135 140acg ggt ttt act gtg cag atc tct ttc gtt aaa ggc
tgg ggt caa tgc 480Thr Gly Phe Thr Val Gln Ile Ser Phe Val Lys Gly
Trp Gly Gln Cys145 150 155 160tac acc cgt cag ttt atc tcg tcc tgt
ccg tgc tgg ctg gaa gtg att 528Tyr Thr Arg Gln Phe Ile Ser Ser Cys
Pro Cys Trp Leu Glu Val Ile 165 170 175ttc aat agc cgc aag ggc gag
ctc aat tcg aag ctt gaa ggt aag cct 576Phe Asn Ser Arg Lys Gly Glu
Leu Asn Ser Lys Leu Glu Gly Lys Pro 180 185 190atc cct aac cct ctc
ctc ggt ctc gat tct acg tgagtcgac 618Ile Pro Asn Pro Leu Leu Gly
Leu Asp Ser Thr 195 20025203PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 25Gly Ser Gly Arg Lys Lys
Arg Arg Gln Arg Arg Arg Ser His Trp Cys1 5 10 15Val Val Ala Tyr Trp
Glu Glu Lys Thr Arg Val Gly Arg Leu Tyr Cys 20 25 30Val Gln Glu Pro
Ser Leu Asp Ile Phe Tyr Asp Leu Pro Gln Gly Asn 35 40 45Gly Phe Cys
Leu Gly Gln Leu Asn Ser Asp Asn Lys Ser Gln Leu Val 50 55 60Gln Lys
Val Arg Ser Lys Ile Gly Cys Gly Ile Gln Leu Thr Arg Glu65 70 75
80Val Asp Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro Ile Phe Ile
85 90 95Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser Arg Thr Leu Leu Val
His 100 105 110Lys Val Phe Pro Gly Phe Ser Ile Lys Ala Phe Asp Tyr
Glu Lys Ala 115 120 125Tyr Ser Leu Gln Arg Pro Asn Asp His Glu Phe
Met Gln Gln Pro Trp 130 135 140Thr Gly Phe Thr Val Gln Ile Ser Phe
Val Lys Gly Trp Gly Gln Cys145 150 155 160Tyr Thr Arg Gln Phe Ile
Ser Ser Cys Pro Cys Trp Leu Glu Val Ile 165 170 175Phe Asn Ser Arg
Lys Gly Glu Leu Asn Ser Lys Leu Glu Gly Lys Pro 180 185 190Ile Pro
Asn Pro Leu Leu Gly Leu Asp Ser Thr 195 20026861DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
polynucleotideCDS(1)..(852) 26gga tcc ggc cgt aaa aaa cgc cgt caa
cgc cgc cgt ggt ttc cgt acg 48Gly Ser Gly Arg Lys Lys Arg Arg Gln
Arg Arg Arg Gly Phe Arg Thr1 5 10 15aaa cgc tcg gcc ctg gtc cgt cgc
ctg tgg cgc tcc cgt gct ccg ggt 96Lys Arg Ser Ala Leu Val Arg Arg
Leu Trp Arg Ser Arg Ala Pro Gly 20 25 30ggt gaa gat gaa gaa gaa ggt
gct ggc ggc ggt ggc ggt ggc ggt gaa 144Gly Glu Asp Glu Glu Glu Gly
Ala Gly Gly Gly Gly Gly Gly Gly Glu 35 40 45ctg cgt ggc gag ggt gca
acc gat agt cgt gca cac ggt gca ggc ggt 192Leu Arg Gly Glu Gly Ala
Thr Asp Ser Arg Ala His Gly Ala Gly Gly 50 55 60ggc ggt ccg ggt cgt
gct ggt tgc tgt ctg ggt aaa gct gtg cgc ggc 240Gly Gly Pro Gly Arg
Ala Gly Cys Cys Leu Gly Lys Ala Val Arg Gly65 70 75 80gcg aaa ggt
cat cac cat ccg cac ccg ccg gca gca ggt gca ggt gca 288Ala Lys Gly
His His His Pro His Pro Pro Ala Ala Gly Ala Gly Ala 85 90 95gct ggc
ggt gcg gaa gcc gat ctg aaa gcc ctg acc cat agt gtc ctg 336Ala Gly
Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val Leu 100 105
110aaa aaa ctg aaa gaa cgt cag ctg gag ctg ctg ctg caa gca gta gaa
384Lys Lys Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu Gln Ala Val Glu
115 120 125tcc cgt ggc ggt acc cgt acg gct tgt ctg ctg ctg ccg ggt
cgt ctg 432Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu Pro Gly
Arg Leu 130 135 140gat tgc cgt ctg ggt ccg ggt gca ccg gct ggt gcg
cag ccg gca caa 480Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala
Gln Pro Ala Gln145 150 155 160ccg ccg agc tct tac agc ctg ccg ctg
ctg ctg tgt aaa gtg ttt cgt 528Pro Pro Ser Ser Tyr Ser Leu Pro Leu
Leu Leu Cys Lys Val Phe Arg 165 170 175tgg ccg gac ctg cgc cac agt
tcc gaa gtt aaa cgc ctg tgc tgt tgc 576Trp Pro Asp Leu Arg His Ser
Ser Glu Val Lys Arg Leu Cys Cys Cys 180 185 190gag agc tat ggc aaa
att aac ccg gaa ctg gtt tgt tgc aat ccg cac 624Glu Ser Tyr Gly Lys
Ile Asn Pro Glu Leu Val Cys Cys Asn Pro His 195 200 205cat ctg tct
cgt ctg tgt gaa ctg gag agc ccg ccg ccg ccg tat tct 672His Leu Ser
Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr Ser 210 215 220cgt
tac ccg atg gat ttc ctg aaa ccg act gca gat tgc ccg gac gca 720Arg
Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro Asp Ala225 230
235 240gtc ccg tca tcg gct gag acc ggc ggc acc aac tat ctg gca ccg
ggc 768Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr Leu Ala Pro
Gly 245 250 255ggt ctg agt gat tcc cag ctg ctg ctg gaa ccg ggc gac
cgt ggt aag 816Gly Leu Ser Asp Ser Gln Leu Leu Leu Glu Pro Gly Asp
Arg Gly Lys 260 265 270cct atc cct aac cct ctc ctc ggt ctc gat tct
acg tgagtcgac 861Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr
275 28027284PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 27Gly Ser Gly Arg Lys Lys Arg Arg
Gln Arg Arg Arg Gly Phe Arg Thr1 5 10 15Lys Arg Ser Ala Leu Val Arg
Arg Leu Trp Arg Ser Arg Ala Pro Gly 20 25 30Gly Glu Asp Glu Glu Glu
Gly Ala Gly Gly Gly Gly Gly Gly Gly Glu 35 40 45Leu Arg Gly Glu Gly
Ala Thr Asp Ser Arg Ala His Gly Ala Gly Gly 50 55 60Gly Gly Pro Gly
Arg Ala Gly Cys Cys Leu Gly Lys Ala Val Arg Gly65 70 75 80Ala Lys
Gly His His His Pro His Pro Pro Ala Ala Gly Ala Gly Ala 85 90 95Ala
Gly Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val Leu 100 105
110Lys Lys Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu Gln Ala Val Glu
115 120 125Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu Pro Gly
Arg Leu 130 135 140Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala
Gln Pro Ala Gln145 150 155 160Pro Pro Ser Ser Tyr Ser Leu Pro Leu
Leu Leu Cys Lys Val Phe Arg 165 170 175Trp Pro Asp Leu Arg His Ser
Ser Glu Val Lys Arg Leu Cys Cys Cys 180 185 190Glu Ser Tyr Gly Lys
Ile Asn Pro Glu Leu Val Cys Cys Asn Pro His 195 200 205His Leu Ser
Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr Ser 210 215 220Arg
Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro Asp Ala225 230
235 240Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr Leu Ala Pro
Gly 245 250 255Gly Leu Ser Asp Ser Gln Leu Leu Leu Glu Pro Gly Asp
Arg Gly Lys 260 265 270Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser
Thr 275 280281442DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotide 28ggatccggcc gtaaaaaacg
ccgtcaacgc cgccgtggtt tccgtacgaa acgctcggcc 60ctggtccgtc gcctgtggcg
ctcccgtgct ccgggtggtg aagatgaaga agaaggtgct 120ggcggcggtg
gcggtggcgg tgaactgcgt aagatgaaga agaaggtgct ggcggcggtg
180gcggtggcgg tgaactgcgt ggcgagggtg caaccgatag tcgtgcacac
ggtgcaggcg 240gtggcggtcc gggtcgtgct ggttgctgtc tgggtaaagc
tgtgcgcggc gcgaaaggtc 300atcaccatcc gcacccgccg gcagcaggtg
caggtgcagc tggcggtgcg gaagccgatc 360tgaaagccct gacccatagt
gtcctgaaaa aactgaaaga acgtcagctg gagctgctgc 420tgcaagcagt
agaatcccgt ggcggtaccc gtacggcttg tctgctgctg ccgggtcgtc
480tggattgccg tctgggtccg ggtgcaccgg ctggtgcgca gccggcacaa
ccgccgagct 540cttacagcct gccgctgctg ctgtgtaaag tgtttcgttg
gccggacctg cgccacagtt 600ccgaagttaa acgcctgtgc tgttgcgaga
gctatggcaa aattaacccg gaactggttt 660gttgcaatcc gcaccatctg
tctcgtctgt gtgaactgga gagcccgccg ccgccgtatt 720ctcgttaccc
gatggatttc ctgaaaccga ctgcagattg cccggacgca gtcccgtcat
780cggctgagac cggcggcacc aactatctgg caccgggcgg tctgagtgat
tcccagctgc 840tgctggaacc gggcgaccgt tcacattggt gtgtggttgc
ctattgggaa gagaaaacgc 900gtgtcggtcg cctgtactgc gtacaggaac
cgtcgctgga tatcttttat gacctgccgc 960agggcaatgg tttctgtctg
ggccaactga actcagataa taaatcgcag ctggtgcaaa 1020aagttcgctc
aaaaattggc tgcggtatcc agctgacccg tgaagttgac ggtgtctggg
1080tatataaccg cagctcttac ccgattttta tcaaaagtgc caccctggat
aatccggact 1140cccgtacgct gctggtccac aaagtatttc cgggcttctc
aatcaaagcg ttcgattacg 1200agaaagccta ctcgctgcag cgcccgaacg
accatgaatt catgcagcaa ccgtggacgg 1260gttttactgt gcagatctct
ttcgttaaag gctggggtca atgctacacc cgtcagttta 1320tctcgtcctg
tccgtgctgg ctggaagtga ttttcaatag ccgcaagggc gagctcaatt
1380cgaagcttga aggtaagcct atccctaacc ctctcctcgg tctcgattct
acgtgagtcg 1440ac 14422966DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 29agcagctttc
agtcagtagc agccagcagc cagagcagca gccagtagca gcagcagcag 60agcagc
66301392DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotideCDS(1)..(1383) 30gga tcc ggc cgt aaa aaa
cgc cgt caa cgc cgc cgt ggt ttc cgt acg 48Gly Ser Gly Arg Lys Lys
Arg Arg Gln Arg Arg Arg Gly Phe Arg Thr1 5 10 15aaa cgc agc gcc ctg
gtc cgt cgc ctg tgg cgc agc cgt gct ccg ggt 96Lys Arg Ser Ala Leu
Val Arg Arg Leu Trp Arg Ser Arg Ala Pro Gly 20 25 30ggt gaa gat gaa
gaa gaa ggt gct ggc ggc ggt ggc ggt ggc ggt gaa 144Gly Glu Asp Glu
Glu Glu Gly Ala Gly Gly Gly Gly Gly Gly Gly Glu 35 40 45ctg cgt ggc
gag ggt gca acc gat agc cgt gca cat ggt gca ggc ggt 192Leu Arg Gly
Glu Gly Ala Thr Asp Ser Arg Ala His Gly Ala Gly Gly 50 55 60ggc ggt
ccg ggt cgt gct ggt tgc tgt ctg ggt aaa gct gtg cgc ggc 240Gly Gly
Pro Gly Arg Ala Gly Cys Cys Leu Gly Lys Ala Val Arg Gly65
70 75 80gcg aaa ggt cat cat cat ccg cat ccg ccg gca gca ggt gca ggt
gca 288Ala Lys Gly His His His Pro His Pro Pro Ala Ala Gly Ala Gly
Ala 85 90 95gct ggc ggt gcg gaa gcc gat ctg aaa gcc ctg acc cat agc
gtc ctg 336Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser
Val Leu 100 105 110aaa aaa ctg aaa gaa cgt cag ctg gag ctg ctg ctg
caa gca gta gaa 384Lys Lys Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu
Gln Ala Val Glu 115 120 125agc cgt ggc ggt acc cgt acg gct tgt ctg
ctg ctg ccg ggt cgt ctg 432Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu
Leu Leu Pro Gly Arg Leu 130 135 140gat tgc cgt ctg ggt ccg ggt gca
ccg gct ggt gcg cag ccg gca caa 480Asp Cys Arg Leu Gly Pro Gly Ala
Pro Ala Gly Ala Gln Pro Ala Gln145 150 155 160ccg ccg agc agc tac
agc ctg ccg ctg ctg ctg tgt aaa gtg ttt cgt 528Pro Pro Ser Ser Tyr
Ser Leu Pro Leu Leu Leu Cys Lys Val Phe Arg 165 170 175tgg ccg gac
ctg cgc cat agc agc gaa gtt aaa cgc ctg tgc tgt tgc 576Trp Pro Asp
Leu Arg His Ser Ser Glu Val Lys Arg Leu Cys Cys Cys 180 185 190gag
agc tat ggc aaa att aac ccg gaa ctg gtt tgt tgc aat ccg cat 624Glu
Ser Tyr Gly Lys Ile Asn Pro Glu Leu Val Cys Cys Asn Pro His 195 200
205cat ctg agc cgt ctg tgt gaa ctg gag agc ccg ccg ccg ccg tat agc
672His Leu Ser Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr Ser
210 215 220cgt tac ccg ctg gat ttc ctg aaa ccg act gca gat tgc ccg
gac gca 720Arg Tyr Pro Leu Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro
Asp Ala225 230 235 240gtc ccg agc agc gct gag acc ggc ggc acc aac
tat ctg gca ccg ggc 768Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn
Tyr Leu Ala Pro Gly 245 250 255ggt ctg agc gat agc cag ctg ctg ctg
gaa ccg ggc gac cgt agc cat 816Gly Leu Ser Asp Ser Gln Leu Leu Leu
Glu Pro Gly Asp Arg Ser His 260 265 270tgg tgt gtg gtt gcc tat tgg
gaa gag aaa acg cgt gtc ggt cgc ctg 864Trp Cys Val Val Ala Tyr Trp
Glu Glu Lys Thr Arg Val Gly Arg Leu 275 280 285tac tgc gta cag gaa
ccg agc ctg gat atc ttt tat gac ctg ccg cag 912Tyr Cys Val Gln Glu
Pro Ser Leu Asp Ile Phe Tyr Asp Leu Pro Gln 290 295 300ggc aat ggt
ttc tgt ctg ggc caa ctg aac agc gat aat aaa agc cag 960Gly Asn Gly
Phe Cys Leu Gly Gln Leu Asn Ser Asp Asn Lys Ser Gln305 310 315
320ctg gtg caa aaa gtt cgc agc aaa att ggc tgc ggt atc cag ctg acc
1008Leu Val Gln Lys Val Arg Ser Lys Ile Gly Cys Gly Ile Gln Leu Thr
325 330 335cgt gaa gtt gac ggt gtc tgg gta tat aac cgc agc agc tac
ccg att 1056Arg Glu Val Asp Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr
Pro Ile 340 345 350ttt atc aaa agc gcc acc ctg gat aat ccg gac agc
cgt acg ctg ctg 1104Phe Ile Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser
Arg Thr Leu Leu 355 360 365gtc cat aaa gta ttt ccg ggc ttc agc atc
aaa gcg ttc gat tac gag 1152Val His Lys Val Phe Pro Gly Phe Ser Ile
Lys Ala Phe Asp Tyr Glu 370 375 380aaa gcc tac agc ctg cag cgc ccg
aac gac cat gaa ttc atg cag caa 1200Lys Ala Tyr Ser Leu Gln Arg Pro
Asn Asp His Glu Phe Met Gln Gln385 390 395 400ccg tgg acg ggt ttt
act gtg cag atc agc ttc gtt aaa ggc tgg ggt 1248Pro Trp Thr Gly Phe
Thr Val Gln Ile Ser Phe Val Lys Gly Trp Gly 405 410 415caa tgc tac
acc cgt cag ttt atc agc agc tgt ccg tgc tgg ctg gaa 1296Gln Cys Tyr
Thr Arg Gln Phe Ile Ser Ser Cys Pro Cys Trp Leu Glu 420 425 430gtg
att ttc aat agc cgc aag ggc gag ctc aat agc aag ctt gaa ggt 1344Val
Ile Phe Asn Ser Arg Lys Gly Glu Leu Asn Ser Lys Leu Glu Gly 435 440
445aag cct atc cct aac cct ctc ctc ggt ctc gat agc acg tgagtcgac
1392Lys Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr 450 455
46031461PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 31Gly Ser Gly Arg Lys Lys Arg Arg Gln Arg Arg
Arg Gly Phe Arg Thr1 5 10 15Lys Arg Ser Ala Leu Val Arg Arg Leu Trp
Arg Ser Arg Ala Pro Gly 20 25 30Gly Glu Asp Glu Glu Glu Gly Ala Gly
Gly Gly Gly Gly Gly Gly Glu 35 40 45Leu Arg Gly Glu Gly Ala Thr Asp
Ser Arg Ala His Gly Ala Gly Gly 50 55 60Gly Gly Pro Gly Arg Ala Gly
Cys Cys Leu Gly Lys Ala Val Arg Gly65 70 75 80Ala Lys Gly His His
His Pro His Pro Pro Ala Ala Gly Ala Gly Ala 85 90 95Ala Gly Gly Ala
Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val Leu 100 105 110Lys Lys
Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu Gln Ala Val Glu 115 120
125Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu Pro Gly Arg Leu
130 135 140Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala Gln Pro
Ala Gln145 150 155 160Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu Leu
Cys Lys Val Phe Arg 165 170 175Trp Pro Asp Leu Arg His Ser Ser Glu
Val Lys Arg Leu Cys Cys Cys 180 185 190Glu Ser Tyr Gly Lys Ile Asn
Pro Glu Leu Val Cys Cys Asn Pro His 195 200 205His Leu Ser Arg Leu
Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr Ser 210 215 220Arg Tyr Pro
Leu Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro Asp Ala225 230 235
240Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr Leu Ala Pro Gly
245 250 255Gly Leu Ser Asp Ser Gln Leu Leu Leu Glu Pro Gly Asp Arg
Ser His 260 265 270Trp Cys Val Val Ala Tyr Trp Glu Glu Lys Thr Arg
Val Gly Arg Leu 275 280 285Tyr Cys Val Gln Glu Pro Ser Leu Asp Ile
Phe Tyr Asp Leu Pro Gln 290 295 300Gly Asn Gly Phe Cys Leu Gly Gln
Leu Asn Ser Asp Asn Lys Ser Gln305 310 315 320Leu Val Gln Lys Val
Arg Ser Lys Ile Gly Cys Gly Ile Gln Leu Thr 325 330 335Arg Glu Val
Asp Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro Ile 340 345 350Phe
Ile Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser Arg Thr Leu Leu 355 360
365Val His Lys Val Phe Pro Gly Phe Ser Ile Lys Ala Phe Asp Tyr Glu
370 375 380Lys Ala Tyr Ser Leu Gln Arg Pro Asn Asp His Glu Phe Met
Gln Gln385 390 395 400Pro Trp Thr Gly Phe Thr Val Gln Ile Ser Phe
Val Lys Gly Trp Gly 405 410 415Gln Cys Tyr Thr Arg Gln Phe Ile Ser
Ser Cys Pro Cys Trp Leu Glu 420 425 430Val Ile Phe Asn Ser Arg Lys
Gly Glu Leu Asn Ser Lys Leu Glu Gly 435 440 445Lys Pro Ile Pro Asn
Pro Leu Leu Gly Leu Asp Ser Thr 450 455 46032861DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
polynucleotideCDS(1)..(852) 32gga tcc ggc cgt aaa aaa cgc cgt caa
cgc cgc cgt ggt ttc cgt acg 48Gly Ser Gly Arg Lys Lys Arg Arg Gln
Arg Arg Arg Gly Phe Arg Thr1 5 10 15aaa cgc agc gcc ctg gtc cgt cgc
ctg tgg cgc agc cgt gct ccg ggt 96Lys Arg Ser Ala Leu Val Arg Arg
Leu Trp Arg Ser Arg Ala Pro Gly 20 25 30ggt gaa gat gaa gaa gaa ggt
gct ggc ggc ggt ggc ggt ggc ggt gaa 144Gly Glu Asp Glu Glu Glu Gly
Ala Gly Gly Gly Gly Gly Gly Gly Glu 35 40 45ctg cgt ggc gag ggt gca
acc gat agc cgt gca cat ggt gca ggc ggt 192Leu Arg Gly Glu Gly Ala
Thr Asp Ser Arg Ala His Gly Ala Gly Gly 50 55 60ggc ggt ccg ggt cgt
gct ggt tgc tgt ctg ggt aaa gct gtg cgc ggc 240Gly Gly Pro Gly Arg
Ala Gly Cys Cys Leu Gly Lys Ala Val Arg Gly65 70 75 80gcg aaa ggt
cat cat cat ccg cat ccg ccg gca gca ggt gca ggt gca 288Ala Lys Gly
His His His Pro His Pro Pro Ala Ala Gly Ala Gly Ala 85 90 95gct ggc
ggt gcg gaa gcc gat ctg aaa gcc ctg acc cat agc gtc ctg 336Ala Gly
Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val Leu 100 105
110aaa aaa ctg aaa gaa cgt cag ctg gag ctg ctg ctg caa gca gta gaa
384Lys Lys Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu Gln Ala Val Glu
115 120 125agc cgt ggc ggt acc cgt acg gct tgt ctg ctg ctg ccg ggt
cgt ctg 432Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu Pro Gly
Arg Leu 130 135 140gat tgc cgt ctg ggt ccg ggt gca ccg gct ggt gcg
cag ccg gca caa 480Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala
Gln Pro Ala Gln145 150 155 160ccg ccg agc agc tac agc ctg ccg ctg
ctg ctg tgt aaa gtg ttt cgt 528Pro Pro Ser Ser Tyr Ser Leu Pro Leu
Leu Leu Cys Lys Val Phe Arg 165 170 175tgg ccg gac ctg cgc cat agc
agc gaa gtt aaa cgc ctg tgc tgt tgc 576Trp Pro Asp Leu Arg His Ser
Ser Glu Val Lys Arg Leu Cys Cys Cys 180 185 190gag agc tat ggc aaa
att aac ccg gaa ctg gtt tgt tgc aat ccg cat 624Glu Ser Tyr Gly Lys
Ile Asn Pro Glu Leu Val Cys Cys Asn Pro His 195 200 205cat ctg agc
cgt ctg tgt gaa ctg gag agc ccg ccg ccg ccg tat agc 672His Leu Ser
Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr Ser 210 215 220cgt
tac ccg atg gat ttc ctg aaa ccg act gca gat tgc ccg gac gca 720Arg
Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro Asp Ala225 230
235 240gtc ccg agc agc gct gag acc ggc ggc acc aac tat ctg gca ccg
ggc 768Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr Leu Ala Pro
Gly 245 250 255ggt ctg agc gat agc cag ctg ctg ctg gaa ccg ggc gac
cgt ggt aag 816Gly Leu Ser Asp Ser Gln Leu Leu Leu Glu Pro Gly Asp
Arg Gly Lys 260 265 270cct atc cct aac cct ctc ctc ggt ctc gat tct
acg tgagtcgac 861Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr
275 28033558DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotide 33ggatccggcc gtaaaaaacg
ccgtcaacgc cgccgttcac attggtgtgt ggttgcctat 60tgggaagaga aaacgcgtgt
cggtcgcctg tactgcgtac aggaaccgag cctggatatc 120ttttatgacc
tgccgcaggg caatggtttc tgtctgggcc aactgaacag cgataataaa
180agccagctgg tgcaaaaagt tcgcagcaaa attggctgcg gtatccagct
gacccgtgaa 240gttgacggtg tctgggtata taaccgcagc agctacccga
tttttatcaa aagcgccacc 300ctggataatc cggacagccg tacgctgctg
gtccataaag tatttccggg cttcagcatc 360aaagcgttcg attacgagaa
agcctacagc ctgcagcgcc cgaacgacca tgaattcatg 420cagcaaccgt
ggacgggttt tactgtgcag atcagcttcg ttaaaggctg gggtcaatgc
480aagggcgagc tcaatagcaa gcttgaaggt aagcctatcc ctaaccctct
cctcggtctc 540gatagcacgt gagtcgac 55834618DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
polynucleotideCDS(1)..(609) 34gga tcc ggc cgt aaa aaa cgc cgt caa
cgc cgc cgt tca cat tgg tgt 48Gly Ser Gly Arg Lys Lys Arg Arg Gln
Arg Arg Arg Ser His Trp Cys1 5 10 15gtg gtt gcc tat tgg gaa gag aaa
acg cgt gtc ggt cgc ctg tac tgc 96Val Val Ala Tyr Trp Glu Glu Lys
Thr Arg Val Gly Arg Leu Tyr Cys 20 25 30gta cag gaa ccg agc ctg gat
atc ttt tat gac ctg ccg cag ggc aat 144Val Gln Glu Pro Ser Leu Asp
Ile Phe Tyr Asp Leu Pro Gln Gly Asn 35 40 45ggt ttc tgt ctg ggc caa
ctg aac agc gat aat aaa agc cag ctg gtg 192Gly Phe Cys Leu Gly Gln
Leu Asn Ser Asp Asn Lys Ser Gln Leu Val 50 55 60caa aaa gtt cgc agc
aaa att ggc tgc ggt atc cag ctg acc cgt gaa 240Gln Lys Val Arg Ser
Lys Ile Gly Cys Gly Ile Gln Leu Thr Arg Glu65 70 75 80gtt gac ggt
gtc tgg gta tat aac cgc agc agc tac ccg att ttt atc 288Val Asp Gly
Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro Ile Phe Ile 85 90 95aaa agc
gcc acc ctg gat aat ccg gac agc cgt acg ctg ctg gtc cat 336Lys Ser
Ala Thr Leu Asp Asn Pro Asp Ser Arg Thr Leu Leu Val His 100 105
110aaa gta ttt ccg ggc ttc agc atc aaa gcg ttc gat tac gag aaa gcc
384Lys Val Phe Pro Gly Phe Ser Ile Lys Ala Phe Asp Tyr Glu Lys Ala
115 120 125tac agc ctg cag cgc ccg aac gac cat gaa ttc atg cag caa
ccg tgg 432Tyr Ser Leu Gln Arg Pro Asn Asp His Glu Phe Met Gln Gln
Pro Trp 130 135 140acg ggt ttt act gtg cag atc agc ttc gtt aaa ggc
tgg ggt caa tgc 480Thr Gly Phe Thr Val Gln Ile Ser Phe Val Lys Gly
Trp Gly Gln Cys145 150 155 160tac acc cgt cag ttt atc agc agc tgt
ccg tgc tgg ctg gaa gtg att 528Tyr Thr Arg Gln Phe Ile Ser Ser Cys
Pro Cys Trp Leu Glu Val Ile 165 170 175ttc aat agc cgc aag ggc gag
ctc aat agc aag ctt gaa ggt aag cct 576Phe Asn Ser Arg Lys Gly Glu
Leu Asn Ser Lys Leu Glu Gly Lys Pro 180 185 190atc cct aac cct ctc
ctc ggt ctc gat agc acg tgagtcgac 618Ile Pro Asn Pro Leu Leu Gly
Leu Asp Ser Thr 195 2003539DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 35tcgtcatcgt
catctttcct catcgtcttc gtctcgtct 39361365DNAArtificial
SequenceDescription of Artificial Sequence Synthetic
polynucleotideCDS(1)..(1356) 36gga tcc ggt cgt aaa aaa cgt cgt cag
cgt cgt cgt ggt ttc cgt acc 48Gly Ser Gly Arg Lys Lys Arg Arg Gln
Arg Arg Arg Gly Phe Arg Thr1 5 10 15aaa cgt tct gcg ctg gtt cgt cgt
ctg tgg cgt tct cgt gcg ccg ggt 96Lys Arg Ser Ala Leu Val Arg Arg
Leu Trp Arg Ser Arg Ala Pro Gly 20 25 30ggt gaa gac gaa gaa gaa ggt
gcg ggt ggt ggt ggt ggt ggt ggt gaa 144Gly Glu Asp Glu Glu Glu Gly
Ala Gly Gly Gly Gly Gly Gly Gly Glu 35 40 45ctg cgt ggt gaa ggt gcg
acc gac tct cgt gcg cac ggt gcg ggt ggt 192Leu Arg Gly Glu Gly Ala
Thr Asp Ser Arg Ala His Gly Ala Gly Gly 50 55 60ggt ggt ccg ggt cgt
gcg ggt tgc tgc ctg ggt aaa gcg gtt cgt ggt 240Gly Gly Pro Gly Arg
Ala Gly Cys Cys Leu Gly Lys Ala Val Arg Gly65 70 75 80gcg aaa ggt
cac cac cac ccg cac ccg ccg gcg gcg ggt gcg ggt gcg 288Ala Lys Gly
His His His Pro His Pro Pro Ala Ala Gly Ala Gly Ala 85 90 95gcg ggt
ggt gcg gaa gcg gac ctg aaa gcg ctg acc cac tct gtt ctg 336Ala Gly
Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val Leu 100 105
110aaa aaa ctg aaa gaa cgt cag ctg gaa ctg ctg ctg cag gcg gtt gaa
384Lys Lys Leu Lys Glu Arg Gln Leu Glu Leu Leu Leu Gln Ala Val Glu
115 120 125tct cgt ggt ggt acc cgt acc gcg tgc ctg ctg ctg ccg ggt
cgt ctg 432Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu Pro Gly
Arg Leu 130 135 140gac tgc cgt ctg ggt ccg ggt gcg ccg gcg ggt gcg
cag ccg gcg cag 480Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala
Gln Pro Ala Gln145 150 155 160ccg ccg tct tct tac tct ctg ccg ctg
ctg ctg tgc aaa gtt ttc cgt 528Pro Pro Ser Ser Tyr Ser Leu Pro Leu
Leu Leu Cys Lys Val Phe Arg 165 170 175tgg ccg gac ctg cgt cac tct
tct gaa gtt aaa cgt ctg tgc tgc tgc 576Trp Pro Asp Leu Arg His Ser
Ser Glu Val Lys Arg Leu Cys Cys Cys 180 185 190gaa tct tac ggt aaa
atc aac ccg gaa ctg gtt tgc tgc aac ccg cac 624Glu Ser Tyr Gly Lys
Ile Asn Pro Glu Leu Val Cys Cys Asn Pro His 195 200 205cac ctg tct
cgt ctg tgc gaa ctg gaa tct ccg ccg ccg ccg tac tct 672His Leu Ser
Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr Ser 210 215 220cgt
tac ccg ctg gac ttc ctg aaa ccg acc gcg gac tgc ccg gac gcg 720Arg
Tyr Pro Leu Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro Asp Ala225 230
235 240gtt ccg tct tct gcg gaa acc ggt ggt acc aac tac ctg gcg ccg
ggt 768Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr Leu Ala Pro
Gly 245
250 255ggt ctg tct gac tct cag ctg ctg ctg gaa ccg ggt gac cgt tct
cac 816Gly Leu Ser Asp Ser Gln Leu Leu Leu Glu Pro Gly Asp Arg Ser
His 260 265 270tgg tgc gtt gtt gcg tac tgg gaa gaa aaa acc cgt gtt
ggt cgt ctg 864Trp Cys Val Val Ala Tyr Trp Glu Glu Lys Thr Arg Val
Gly Arg Leu 275 280 285tac tgc gtt cag gaa ccg tct ctg gac atc ttc
tac gac ctg ccg cag 912Tyr Cys Val Gln Glu Pro Ser Leu Asp Ile Phe
Tyr Asp Leu Pro Gln 290 295 300ggt aac ggt ttc tgc ctg ggt cag ctg
aac tct gac aac aaa tct cag 960Gly Asn Gly Phe Cys Leu Gly Gln Leu
Asn Ser Asp Asn Lys Ser Gln305 310 315 320ctg gtt cag aaa gtt cgt
tct aaa atc ggt tgc ggt atc cag ctg acc 1008Leu Val Gln Lys Val Arg
Ser Lys Ile Gly Cys Gly Ile Gln Leu Thr 325 330 335cgt gaa gtt gac
ggt gtt tgg gtt tac aac cgt tct tct tac ccg atc 1056Arg Glu Val Asp
Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro Ile 340 345 350ttc atc
aaa tct gcg acc ctg gac aac ccg gac tct cgt acc ctg ctg 1104Phe Ile
Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser Arg Thr Leu Leu 355 360
365gtt cac aaa gtt ttc ccg ggt ttc tct atc aaa gcg ttc gac tac gaa
1152Val His Lys Val Phe Pro Gly Phe Ser Ile Lys Ala Phe Asp Tyr Glu
370 375 380aaa gcg tac tct ctg cag cgt ccg aac gac cac gaa ttc atg
cag cag 1200Lys Ala Tyr Ser Leu Gln Arg Pro Asn Asp His Glu Phe Met
Gln Gln385 390 395 400ccg tgg acc ggt ttc acc gtt cag atc tct ttc
gtt aaa ggt tgg ggt 1248Pro Trp Thr Gly Phe Thr Val Gln Ile Ser Phe
Val Lys Gly Trp Gly 405 410 415cag tgc tac acc cgt cag ttc atc tct
tct tgc ccg tgc tgg ctg gaa 1296Gln Cys Tyr Thr Arg Gln Phe Ile Ser
Ser Cys Pro Cys Trp Leu Glu 420 425 430gtt atc ttc aac tct cgt ggt
aaa ccg atc ccg aac ccg ctg ctg ggt 1344Val Ile Phe Asn Ser Arg Gly
Lys Pro Ile Pro Asn Pro Leu Leu Gly 435 440 445ctg gac tct acc
tgagtcgac 1365Leu Asp Ser Thr 45037452PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
37Gly Ser Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg Gly Phe Arg Thr1
5 10 15Lys Arg Ser Ala Leu Val Arg Arg Leu Trp Arg Ser Arg Ala Pro
Gly 20 25 30Gly Glu Asp Glu Glu Glu Gly Ala Gly Gly Gly Gly Gly Gly
Gly Glu 35 40 45Leu Arg Gly Glu Gly Ala Thr Asp Ser Arg Ala His Gly
Ala Gly Gly 50 55 60Gly Gly Pro Gly Arg Ala Gly Cys Cys Leu Gly Lys
Ala Val Arg Gly65 70 75 80Ala Lys Gly His His His Pro His Pro Pro
Ala Ala Gly Ala Gly Ala 85 90 95Ala Gly Gly Ala Glu Ala Asp Leu Lys
Ala Leu Thr His Ser Val Leu 100 105 110Lys Lys Leu Lys Glu Arg Gln
Leu Glu Leu Leu Leu Gln Ala Val Glu 115 120 125Ser Arg Gly Gly Thr
Arg Thr Ala Cys Leu Leu Leu Pro Gly Arg Leu 130 135 140Asp Cys Arg
Leu Gly Pro Gly Ala Pro Ala Gly Ala Gln Pro Ala Gln145 150 155
160Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu Leu Cys Lys Val Phe Arg
165 170 175Trp Pro Asp Leu Arg His Ser Ser Glu Val Lys Arg Leu Cys
Cys Cys 180 185 190Glu Ser Tyr Gly Lys Ile Asn Pro Glu Leu Val Cys
Cys Asn Pro His 195 200 205His Leu Ser Arg Leu Cys Glu Leu Glu Ser
Pro Pro Pro Pro Tyr Ser 210 215 220Arg Tyr Pro Leu Asp Phe Leu Lys
Pro Thr Ala Asp Cys Pro Asp Ala225 230 235 240Val Pro Ser Ser Ala
Glu Thr Gly Gly Thr Asn Tyr Leu Ala Pro Gly 245 250 255Gly Leu Ser
Asp Ser Gln Leu Leu Leu Glu Pro Gly Asp Arg Ser His 260 265 270Trp
Cys Val Val Ala Tyr Trp Glu Glu Lys Thr Arg Val Gly Arg Leu 275 280
285Tyr Cys Val Gln Glu Pro Ser Leu Asp Ile Phe Tyr Asp Leu Pro Gln
290 295 300Gly Asn Gly Phe Cys Leu Gly Gln Leu Asn Ser Asp Asn Lys
Ser Gln305 310 315 320Leu Val Gln Lys Val Arg Ser Lys Ile Gly Cys
Gly Ile Gln Leu Thr 325 330 335Arg Glu Val Asp Gly Val Trp Val Tyr
Asn Arg Ser Ser Tyr Pro Ile 340 345 350Phe Ile Lys Ser Ala Thr Leu
Asp Asn Pro Asp Ser Arg Thr Leu Leu 355 360 365Val His Lys Val Phe
Pro Gly Phe Ser Ile Lys Ala Phe Asp Tyr Glu 370 375 380Lys Ala Tyr
Ser Leu Gln Arg Pro Asn Asp His Glu Phe Met Gln Gln385 390 395
400Pro Trp Thr Gly Phe Thr Val Gln Ile Ser Phe Val Lys Gly Trp Gly
405 410 415Gln Cys Tyr Thr Arg Gln Phe Ile Ser Ser Cys Pro Cys Trp
Leu Glu 420 425 430Val Ile Phe Asn Ser Arg Gly Lys Pro Ile Pro Asn
Pro Leu Leu Gly 435 440 445Leu Asp Ser Thr 450381365DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
38ggatccggtc gtaaaaaacg tcgtcagcgt cgtcgtggtt tccgtaccaa acgttctgcg
60ctggttcgtc gtctgtggcg ttctcgtgcg ccgggtggtg aagacgaaga agaaggtgcg
120ggtggtggtg gtggtggtgg tgaactgcgt ggtgaaggtg cgaccgactc
tcgtgcgcac 180ggtgcgggtg gtggtggtcc gggtcgtgcg ggttgctgcc
tgggtaaagc ggttcgtggt 240gcgaaaggtc accaccaccc gcacccgccg
gcggcgggtg cgggtgcggc gggtggtgcg 300gaagcggacc tgaaagcgct
gacccactct gttctgaaaa aactgaaaga acgtcagctg 360gaactgctgc
tgcaggcggt tgaatctcgt ggtggtaccc gtaccgcgtg cctgctgctg
420ccgggtcgtc tggactgccg tctgggtccg ggtgcgccgg cgggtgcgca
gccggcgcag 480ccgccgtctt cttactctct gccgctgctg ctgtgcaaag
ttttccgttg gccggacctg 540cgtcactctt ctgaagttaa acgtctgtgc
tgctgcgaat cttacggtaa aatcaacccg 600gaactggttt gctgcaaccc
gcaccacctg tctcgtctgt gcgaactgga atctccgccg 660ccgccgtact
ctcgttaccc gctggacttc ctgaaaccga ccgcggactg cccggacgcg
720gttccgtctt ctgcggaaac cggtggtacc aactacctgg cgccgggtgg
tctgtctgac 780tctcagctgc tgctggaacc gggtgaccgt tctcactggt
gcgttgttgc gtactgggaa 840gaaaaaaccc gtgttggtcg tctgtactgc
gttcaggaac cgtctctgga catcttctac 900gacctgccgc agggtaacgg
tttctgcctg ggtcagctga actctgacaa caaatctcag 960ctggttcaga
aagttcgttc taaaatcggt tgcggtatcc agctgacccg tgaagttgac
1020ggtgtttggg tttacaaccg ttcttcttac ccgatcttca tcaaatctgc
gaccctggac 1080aacccggact ctcgtaccct gctggttcac aaagttttcc
cgggtttctc tatcaaagcg 1140ttcgactacg aaaaagcgta ctctctgcag
cgtccgaacg accacgaatt catgcagcag 1200ccgtggaccg gtttcaccgt
tcagatctct ttcgttaaag gttggggtca gtgctacacc 1260cgtcagttca
tctcttcttg cccgtgctgg ctggaagtta tcttcaactc tcgtggtaaa
1320ccgatcccga acccgctgct gggtctggac tctacctgag tcgac
1365391392DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 39ggatccggcc gtaaaaaacg ccgtcaacgc
cgccgtggtt tccgtacgaa acgcagcgcc 60ctggtccgtc gcctgtggcg cagccgtgct
ccgggtggtg aagatgaaga agaaggtgct 120ggcggcggtg gcggtggcgg
tgaactgcgt ggcgagggtg caaccgatag ccgtgcacat 180ggtgcaggcg
gtggcggtcc gggtcgtgct ggttgctgtc tgggtaaagc tgtgcgcggc
240gcgaaaggtc atcatcatcc gcatccgccg gcagcaggtg caggtgcagc
tggcggtgcg 300gaagccgatc tgaaagccct gacccatagc gtcctgaaaa
aactgaaaga acgtcagctg 360gagctgctgc tgcaagcagt agaaagccgt
ggcggtaccc gtacggcttg tctgctgctg 420ccgggtcgtc tggattgccg
tctgggtccg ggtgcaccgg ctggtgcgca gccggcacaa 480ccgccgagca
gctacagcct gccgctgctg ctgtgtaaag tgtttcgttg gccggacctg
540cgccatagca gcgaagttaa acgcctgtgc tgttgcgaga gctatggcaa
aattaacccg 600gaactggttt gttgcaatcc gcatcatctg agccgtctgt
gtgaactgga gagcccgccg 660ccgccgtata gccgttaccc gctggatttc
ctgaaaccga ctgcagattg cccggacgca 720gtcccgagca gcgctgagac
cggcggcacc aactatctgg caccgggcgg tctgagcgat 780agccagctgc
tgctggaacc gggcgaccgt agccattggt gtgtggttgc ctattgggaa
840gagaaaacgc gtgtcggtcg cctgtactgc gtacaggaac cgagcctgga
tatcttttat 900gacctgccgc agggcaatgg tttctgtctg ggccaactga
acagcgataa taaaagccag 960ctggtgcaaa aagttcgcag caaaattggc
tgcggtatcc agctgacccg tgaagttgac 1020ggtgtctggg tatataaccg
cagcagctac ccgattttta tcaaaagcgc caccctggat 1080aatccggaca
gccgtacgct gctggtccat aaagtatttc cgggcttcag catcaaagcg
1140ttcgattacg agaaagccta cagcctgcag cgcccgaacg accatgaatt
catgcagcaa 1200ccgtggacgg gttttactgt gcagatcagc ttcgttaaag
gctggggtca atgctacacc 1260cgtcagttta tcagcagctg tccgtgctgg
ctggaagtga ttttcaatag ccgcaagggc 1320gagctcaata gcaagcttga
aggtaagcct atccctaacc ctctcctcgg tctcgatagc 1380acgtgagtcg ac
1392406PRTArtificial SequenceDescription of Artificial Sequence
Synthetic 6xHis tag 40His His His His His His1 54114PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 41Gly
Lys Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr1 5
10428PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 42Lys Tyr Lys Asp Asp Asp Asp Lys1
5439PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 43Tyr Pro Tyr Asp Val Pro Asp Tyr Ala1
54423PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 44Gly Ala Leu Phe Leu Gly Trp Leu Gly Ala Ala Gly
Ser Thr Met Gly1 5 10 15Ala Lys Lys Lys Arg Lys Val
204516PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 45Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg Arg Met
Lys Trp Lys Lys1 5 10 15467PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 46Arg Arg Met Lys Trp Lys
Lys1 54716PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 47Arg Arg Trp Arg Arg Trp Trp Arg Arg Trp Trp Arg
Arg Trp Arg Arg1 5 10 154818PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 48Arg Gly Gly Arg Leu Ser Tyr
Ser Arg Arg Arg Phe Ser Thr Ser Thr1 5 10 15Gly Arg499PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 49Arg
Lys Lys Arg Arg Gln Arg Arg Arg1 55011PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 50Tyr
Ala Arg Ala Ala Ala Arg Gln Ala Arg Ala1 5 10518PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 51Arg
Arg Arg Arg Arg Arg Arg Arg1 5528PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 52Lys Lys Lys Lys Lys Lys
Lys Lys1 55327PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptideMOD_RES(25)..(25)Any amino acid 53Gly Trp
Thr Leu Asn Ser Ala Gly Tyr Leu Leu Gly Lys Ile Asn Leu1 5 10 15Lys
Ala Leu Ala Ala Leu Ala Lys Xaa Ile Leu 20 255418PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 54Leu
Leu Ile Leu Leu Arg Arg Arg Ile Arg Lys Gln Ala Asn Ala His1 5 10
15Ser Lys5516PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 55Ser Arg Arg His His Cys Arg Ser Lys
Ala Lys Arg Ser Arg His His1 5 10 155611PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 56Asn
Arg Ala Arg Arg Asn Arg Arg Arg Val Arg1 5 105715PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 57Arg
Gln Leu Arg Ile Ala Gly Arg Arg Leu Arg Gly Arg Ser Arg1 5 10
155813PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 58Lys Leu Ile Lys Gly Arg Thr Pro Ile Lys Phe Gly
Lys1 5 105910PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 59Arg Arg Ile Pro Asn Arg Arg Pro Arg
Arg1 5 106018PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 60Lys Leu Ala Leu Lys Leu Ala Leu Lys
Ala Leu Lys Ala Ala Leu Lys1 5 10 15Leu Ala6114PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 61Lys
Leu Ala Lys Leu Ala Lys Lys Leu Ala Lys Leu Ala Lys1 5
106227PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 62Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly
Ser Thr Asn Gly1 5 10 15Ala Trp Ser Gln Pro Lys Lys Lys Arg Lys Val
20 256321PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 63Lys Glu Thr Trp Trp Glu Thr Trp Trp Thr Glu Trp
Ser Gln Pro Lys1 5 10 15Lys Lys Arg Lys Val 206420PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 64Leu
Lys Lys Leu Leu Lys Lys Leu Leu Lys Lys Leu Leu Lys Lys Leu1 5 10
15Leu Lys Lys Leu 206534PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 65Gln Ala Ala Thr Ala Thr
Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr1 5 10 15Glu Arg Pro Arg Ala
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro 20 25 30Val
Glu6623PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 66Met Gly Leu Gly Leu His Leu Leu Val Leu Ala Ala
Ala Leu Gln Gly1 5 10 15Ala Lys Ser Lys Arg Lys Val
206726PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 67Ala Ala Val Ala Leu Leu Pro Ala Val Leu Leu Ala
Leu Leu Ala Pro1 5 10 15Ala Ala Ala Asn Tyr Lys Lys Pro Lys Leu 20
256828PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 68Met Ala Asn Leu Gly Tyr Trp Leu Leu Ala Leu Phe
Val Thr Met Trp1 5 10 15Thr Asp Val Gly Leu Cys Lys Lys Arg Pro Lys
Pro 20 256924PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 69Leu Gly Thr Tyr Thr Gln Asp Phe Asn
Lys Phe His Thr Phe Pro Gln1 5 10 15Thr Ala Ile Gly Val Gly Ala Pro
207026PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptideMOD_RES(24)..(24)Any amino acid 70Asp Pro Lys Gly
Asp Pro Lys Gly Val Thr Val Thr Val Thr Val Thr1 5 10 15Val Thr Gly
Lys Gly Asp Pro Xaa Pro Asp 20 257114PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 71Pro
Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro1 5
107218PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 72Val Arg Leu Pro Pro Pro Val Arg Leu Pro Pro Pro
Val Arg Leu Pro1 5 10 15Pro Pro7310PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 73Pro
Arg Pro Leu Pro Pro Pro Arg Pro Gly1 5 107430PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
74Ser Val Arg Arg Arg Pro Arg Pro Pro Tyr Leu Pro Arg Pro Arg Pro1
5 10 15Pro Pro Phe Phe Pro Pro Arg Leu Pro Pro Arg Ile Pro Pro 20
25 307521PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 75Thr Arg Ser Ser Arg Ala Gly Leu Gln Phe Pro Val
Gly Arg Val His1 5 10 15Arg Leu Leu Arg Lys 207623PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 76Gly
Ile Gly Lys Phe Leu His Ser Ala Lys Lys Phe Gly Lys Ala Phe1 5 10
15Val Gly Glu Ile Met Asn Ser 207737PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
77Lys Trp Lys Leu Phe Lys Lys Ile Glu Lys Val Gly Gln Asn Ile Arg1
5 10 15Asp Gly Ile Ile Lys Ala Gly Pro Ala Val Ala Val Val Gly Gln
Ala 20 25 30Thr Gln Ile Ala Lys 357828PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 78Ala
Leu Trp Met Thr Leu Leu Lys Lys Val Leu Lys Ala Ala Ala Lys1 5 10
15Ala Ala Leu Asn Ala Val Leu Val Gly Ala Asn Ala 20
257926PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 79Gly Ile Gly Ala Val Leu Lys Val Leu Thr Thr Gly
Leu Pro Ala
Leu1 5 10 15Ile Ser Trp Ile Lys Arg Lys Arg Gln Gln 20
258014PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 80Ile Asn Leu Lys Ala Leu Ala Ala Leu Ala Lys Lys
Ile Leu1 5 108133PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 81Gly Phe Phe Ala Leu Ile Pro Lys
Ile Ile Ser Ser Pro Leu Pro Lys1 5 10 15Thr Leu Leu Ser Ala Val Gly
Ser Ala Leu Gly Gly Ser Gly Gly Gln 20 25 30Glu8215PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 82Leu
Ala Lys Trp Ala Leu Lys Gln Gly Phe Ala Lys Leu Lys Ser1 5 10
158327PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptideMOD_RES(23)..(23)Any amino acid 83Ser Met Ala Gln
Asp Ile Ile Ser Thr Ile Gly Asp Leu Val Lys Trp1 5 10 15Ile Ile Gln
Thr Val Asn Xaa Phe Thr Lys Lys 20 258441PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
84Leu Leu Gly Asp Phe Phe Arg Lys Ser Lys Glu Lys Ile Gly Lys Glu1
5 10 15Phe Lys Arg Ile Val Gln Arg Ile Lys Gln Arg Ile Lys Asp Phe
Leu 20 25 30Ala Asn Leu Val Pro Arg Thr Glu Ser 35
408518PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 85Pro Ala Trp Arg Lys Ala Phe Arg Trp Ala Trp Arg
Met Leu Lys Lys1 5 10 15Ala Ala8618PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 86Lys
Leu Lys Leu Lys Leu Lys Leu Lys Leu Lys Leu Lys Leu Lys Leu1 5 10
15Lys Leu872182DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotide 87gacgttgtat acgactccta
tagggcggcc gggaattcgt cgactggatc cggtaccgag 60gagatctgcc gccgcgatcg
ccatgtcccc cattctgggc tactggaaga ttaagggcct 120ggtgcagcct
actagactgc tgctggaata cctggaggaa aaatatgaag agcatctgta
180tgaaagagac gagggggata aatggaggaa caagaaattc gaactgggac
tggagtttcc 240taatctgcca tactatattg acggcgatgt gaagctgact
cagtctatgg ctatcattag 300atacatcgca gacaaacaca acatgctggg
cgggtgtcct aaggaaaggg cagagattag 360tatgctggag ggagccgtgc
tggatattag atacggcgtc tcacgcatcg cctatagcaa 420agacttcgaa
accctgaagg tggattttct gagcaaactg cctgaaatgc tgaagatgtt
480cgaggacaga ctgtgccaca aaacctacct gaatggcgac catgtcacac
acccagattt 540tatgctgtac gacgccctgg atgtggtcct gtatatggac
cccatgtgtc tggatgcttt 600ccctaagctg gtgtgcttta agaaaaggat
cgaggcaatt ccccagatcg ataagtacct 660gaaaagctcc aagtatatcg
cttggcctct ccagggctgg caggcaacat tcggaggcgg 720ggaccatccc
cctaaaagcg acctggaggt gctgtttcag ggaccactgg gcagcggccg
780gaagaagcgg cggcagaggc gccgaagtag gttccgcact aagcggtcag
cactggtgcg 840gagactgtgg cgatctcggg ctcctggagg agaggacgag
gaagagggag caggcggcgg 900cggaggagga ggagaactgc gcggggaggg
agctacagat agccgagccc acggagctgg 960aggaggagga ccagggcgag
ccggatgctg tctgggcaaa gcagtgagag gcgccaaggg 1020gcaccatcac
ccccatccac ccgccgctgg cgcaggagca gccggcggag ctgaggcaga
1080cctgaaagcc ctgactcaca gtgtgctgaa gaaactgaag gaaagacagc
tggagctgct 1140gctccaggca gtcgaatcac gcggaggcac ccgaacagct
tgtctgctgc tgcccggccg 1200gctggactgc cggctgggac ccggcgcccc
tgctggggca cagccagccc agcctccatc 1260tagttatagc ctgcccctgc
tgctgtgtaa ggtgttccga tggcctgatc tgcggcattc 1320aagcgaagtc
aaaaggctgt gctgttgcga gtcctacggc aagattaacc cagaactggt
1380gtgttgcaat ccccatcacc tgtctcgact gtgtgaactg gagtcccccc
ctccacccta 1440ctctaggtat cctatggact ttctgaagcc aaccgctgac
tgcccagatg cagtgccctc 1500ctctgccgag actgggggaa ccaactacct
ggctcctggc ggactgagcg actcccagct 1560gctgctggaa ccaggggatc
gcagccactg gtgtgtggtc gcctactggg aagagaagac 1620aagagtggga
aggctgtatt gcgtccagga gccttccctg gacatcttct acgatctgcc
1680acaggggaat ggattttgtc tgggccagct gaactctgac aataagagtc
agctggtgca 1740gaaagtccgg agcaagattg gctgcggcat ccagctgacc
agggaggtgg acggcgtgtg 1800ggtctacaac cgcagttcat atccaatctt
catcaagagc gccactctgg acaatcccga 1860ttcccgcacc ctgctggtgc
ataaggtctt ccccggcttc agcatcaagg ccttcgacta 1920cgagaaggct
tatagtctcc agcggcccaa cgatcacgag ttcatgcagc agccttggac
1980aggcttcact gtgcagatca gcttcgtcaa gggatggggc cagtgctaca
caaggcagtt 2040catctcaagc tgtccctgtt ggctggaagt cattttcaat
agtaggacgc gtacgcggcc 2100gctcgagcag aaactcatct cagaagagga
tctggcagca aatgatatcc tggattacaa 2160ggatgacgac gataaggttt aa
2182881059DNAHomo sapiens 88tcaccaccat ccccacccgc cagccgcggg
cgccggcgcg gccgggggcg ccgaggcgga 60tctgaaggcg ctcacgcact cggtgctcaa
gaaactgaag gagcggcagc tggagctgct 120gctccaggcc gtggagtccc
gcggcgggac gcgcaccgcg tgcctcctgc tgcccggccg 180cctggactgc
aggctgggcc cgggggcgcc cgccggcgcg cagcctgcgc agccgccctc
240gtcctactcg ctccccctcc tgctgtgcaa agtgttcagg tggccggatc
tcaggcattc 300ctcggaagtc aagaggctgt gttgctgtga atcttacggg
aagatcaacc ccgagctggt 360gtgctgcaac ccccatcacc ttagccgact
ctgcgaacta gagtctcccc cccctcctta 420ctccagatac ccgatggatt
ttctcaaacc aactgcagac tgtccagatg ctgtgccttc 480ctccgctgaa
acagggggaa cgaattatct ggcccctggg gggctttcag attcccaact
540tcttctggag cctggggatc ggtcacactg gtgcgtggtg gcatactggg
aggagaagac 600gagagtgggg aggctctact gtgtccagga gccctctctg
gatatcttct atgatctacc 660tcaggggaat ggcttttgcc tcggacagct
caattcggac aacaagagtc agctggtgca 720gaaggtgcgg agcaaaatcg
gctgcggcat ccagctgacg cgggaggtgg atggtgtgtg 780ggtgtacaac
cgcagcagtt accccatctt catcaagtcc gccacactgg acaacccgga
840ctccaggacg ctgttggtac acaaggtgtt ccccggtttc tccatcaagg
ctttcgacta 900cgagaaggcg tacagcctgc agcggcccaa tgaccacgag
tttatgcagc agccgtggac 960gggctttacc gtgcagatca gctttgtgaa
gggctggggt cagtgctaca cccgccagtt 1020catcagcagc tgcccgtgct
ggctagaggt catcttcaa 1059891053DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 89tcacccccat
ccacccgccg ctggcgcagg agcagccggc ggagctgagg cagacctgaa 60agccctgact
cacagtgtgc tgaagaaact gaaggaaaga cagctggagc tgctgctcca
120ggcagtcgaa tcacgcggag gcacccgaac agcttgtctg ctgctgcccg
gccggctgga 180ctgccggctg ggacccggcg cccctgctgg ggcacagcca
gcccagcctc catctagtta 240tagcctgccc ctgctgctgt gtaaggtgtt
ccgatggcct gatctgcggc attcaagcga 300agtcaaaagg ctgtgctgtt
gcgagtccta cggcaagatt aacccagaac tggtgtgttg 360caatccccat
cacctgtctc gactgtgtga actggagtcc ccccctccac cctactctag
420gtatcctatg gactttctga agccaaccgc tgactgccca gatgcagtgc
cctcctctgc 480cgagactggg ggaaccaact acctggctcc tggcggactg
agcgactccc agctgctgct 540ggaaccaggg gatcgcagcc actggtgtgt
ggtcgcctac tgggaagaga agacaagagt 600gggaaggctg tattgcgtcc
aggagccttc cctggacatc ttctacgatc tgccacaggg 660gaatggattt
tgtctgggcc agctgaactc tgacaataag agtcagctgg tgcagaaagt
720ccggagcaag attggctgcg gcatccagct gaccagggag gtggacggcg
tgtgggtcta 780caaccgcagt tcatatccaa tcttcatcaa gagcgccact
ctggacaatc ccgattcccg 840caccctgctg gtgcataagg tcttccccgg
cttcagcatc aaggccttcg actacgagaa 900ggcttatagt ctccagcggc
ccaacgatca cgagttcatg cagcagcctt ggacaggctt 960cactgtgcag
atcagcttcg tcaagggatg gggccagtgc tacacaaggc agttcatctc
1020aagctgtccc tgttggctgg aagtcatttt caa 105390699PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
90Met Ser Pro Ile Leu Gly Tyr Trp Lys Ile Lys Gly Leu Val Gln Pro1
5 10 15Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His
Leu 20 25 30Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe
Glu Leu 35 40 45Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr Ile Asp Gly
Asp Val Lys 50 55 60Leu Thr Gln Ser Met Ala Ile Ile Arg Tyr Ile Ala
Asp Lys His Asn65 70 75 80Met Leu Gly Gly Cys Pro Lys Glu Arg Ala
Glu Ile Ser Met Leu Glu 85 90 95Gly Ala Val Leu Asp Ile Arg Tyr Gly
Val Ser Arg Ile Ala Tyr Ser 100 105 110Lys Asp Phe Glu Thr Leu Lys
Val Asp Phe Leu Ser Lys Leu Pro Glu 115 120 125Met Leu Lys Met Phe
Glu Asp Arg Leu Cys His Lys Thr Tyr Leu Asn 130 135 140Gly Asp His
Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp145 150 155
160Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu
165 170 175Val Cys Phe Lys Lys Arg Ile Glu Ala Ile Pro Gln Ile Asp
Lys Tyr 180 185 190Leu Lys Ser Ser Lys Tyr Ile Ala Trp Pro Leu Gln
Gly Trp Gln Ala 195 200 205Thr Phe Gly Gly Gly Asp His Pro Pro Lys
Ser Asp Leu Glu Val Leu 210 215 220Phe Gln Gly Pro Leu Gly Ser Gly
Arg Lys Lys Arg Arg Gln Arg Arg225 230 235 240Arg Ser Arg Phe Arg
Thr Lys Arg Ser Ala Leu Val Arg Arg Leu Trp 245 250 255Arg Ser Arg
Ala Pro Gly Gly Glu Asp Glu Glu Glu Gly Ala Gly Gly 260 265 270Gly
Gly Gly Gly Gly Glu Leu Arg Gly Glu Gly Ala Thr Asp Ser Arg 275 280
285Ala His Gly Ala Gly Gly Gly Gly Pro Gly Arg Ala Gly Cys Cys Leu
290 295 300Gly Lys Ala Val Arg Gly Ala Lys Gly His His His Pro His
Pro Pro305 310 315 320Ala Ala Gly Ala Gly Ala Ala Gly Gly Ala Glu
Ala Asp Leu Lys Ala 325 330 335Leu Thr His Ser Val Leu Lys Lys Leu
Lys Glu Arg Gln Leu Glu Leu 340 345 350Leu Leu Gln Ala Val Glu Ser
Arg Gly Gly Thr Arg Thr Ala Cys Leu 355 360 365Leu Leu Pro Gly Arg
Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala 370 375 380Gly Ala Gln
Pro Ala Gln Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu385 390 395
400Leu Cys Lys Val Phe Arg Trp Pro Asp Leu Arg His Ser Ser Glu Val
405 410 415Lys Arg Leu Cys Cys Cys Glu Ser Tyr Gly Lys Ile Asn Pro
Glu Leu 420 425 430Val Cys Cys Asn Pro His His Leu Ser Arg Leu Cys
Glu Leu Glu Ser 435 440 445Pro Pro Pro Pro Tyr Ser Arg Tyr Pro Met
Asp Phe Leu Lys Pro Thr 450 455 460Ala Asp Cys Pro Asp Ala Val Pro
Ser Ser Ala Glu Thr Gly Gly Thr465 470 475 480Asn Tyr Leu Ala Pro
Gly Gly Leu Ser Asp Ser Gln Leu Leu Leu Glu 485 490 495Pro Gly Asp
Arg Ser His Trp Cys Val Val Ala Tyr Trp Glu Glu Lys 500 505 510Thr
Arg Val Gly Arg Leu Tyr Cys Val Gln Glu Pro Ser Leu Asp Ile 515 520
525Phe Tyr Asp Leu Pro Gln Gly Asn Gly Phe Cys Leu Gly Gln Leu Asn
530 535 540Ser Asp Asn Lys Ser Gln Leu Val Gln Lys Val Arg Ser Lys
Ile Gly545 550 555 560Cys Gly Ile Gln Leu Thr Arg Glu Val Asp Gly
Val Trp Val Tyr Asn 565 570 575Arg Ser Ser Tyr Pro Ile Phe Ile Lys
Ser Ala Thr Leu Asp Asn Pro 580 585 590Asp Ser Arg Thr Leu Leu Val
His Lys Val Phe Pro Gly Phe Ser Ile 595 600 605Lys Ala Phe Asp Tyr
Glu Lys Ala Tyr Ser Leu Gln Arg Pro Asn Asp 610 615 620His Glu Phe
Met Gln Gln Pro Trp Thr Gly Phe Thr Val Gln Ile Ser625 630 635
640Phe Val Lys Gly Trp Gly Gln Cys Tyr Thr Arg Gln Phe Ile Ser Ser
645 650 655Cys Pro Cys Trp Leu Glu Val Ile Phe Asn Ser Arg Thr Arg
Thr Arg 660 665 670Pro Leu Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu
Ala Ala Asn Asp 675 680 685Ile Leu Asp Tyr Lys Asp Asp Asp Asp Lys
Val 690 695911637DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotide 91gaattcgtcg actggatccg
gtaccgagga gatctgccgc cgcgatcgcc atgtccccca 60ttctgggcta ctggaagatt
aagggcctgg tgcagcctac tagactgctg ctggaatacc 120tggaggaaaa
atatgaagag catctgtatg aaagagacga gggggataaa tggaggaaca
180agaaattcga actgggactg gagtttccta atctgccata ctatattgac
ggcgatgtga 240agctgactca gtctatggct atcattagat acatcgcaga
caaacacaac atgctgggcg 300ggtgtcctaa ggaaagggca gagattagta
tgctggaggg agccgtgctg gatattagat 360acggcgtctc acgcatcgcc
tatagcaaag acttcgaaac cctgaaggtg gattttctga 420gcaaactgcc
tgaaatgctg aagatgttcg aggacagact gtgccacaaa acctacctga
480atggcgacca tgtcacacac ccagatttta tgctgtacga cgccctggat
gtggtcctgt 540atatggaccc catgtgtctg gatgctttcc ctaagctggt
gtgctttaag aaaaggatcg 600aggcaattcc ccagatcgat aagtacctga
aaagctccaa gtatatcgct tggcctctcc 660agggctggca ggcaacattc
ggaggcgggg accatccccc taaaagcgac ctggaggtgc 720tgtttcaggg
accactgggc agcggccgga agaagcggcg gcagaggcgc cgaagtaggt
780tccgcactaa gcggtcagca ctggtgcgga gactgtggcg atctcgggct
cctggaggag 840aggacgagga agagggagca ggcggcggcg gaggaggagg
agaactgcgc ggggagggag 900ctacagatag ccgagcccac ggagctggag
gaggaggacc agggcgagcc ggatgctgtc 960tgggcaaagc agtgagaggc
gccaaggggc accatcaccc ccatccaccc gccgctggcg 1020caggagcagc
cggcggagct gaggcagacc tgaaagccct gactcacagt gtgctgaaga
1080aactgaagga aagacagctg gagctgctgc tccaggcagt cgaatcacgc
ggaggcaccc 1140gaacagcttg tctgctgctg cccggccggc tggactgccg
gctgggaccc ggcgcccctg 1200ctggggcaca gccagcccag cctccatcta
gttatagcct gcccctgctg ctgtgtaagg 1260tgttccgatg gcctgatctg
cggcattcaa gcgaagtcaa aaggctgtgc tgttgcgagt 1320cctacggcaa
gattaaccca gaactggtgt gttgcaatcc ccatcacctg tctcgactgt
1380gtgaactgga gtccccccct ccaccctact ctaggtatcc tatggacttt
ctgaagccaa 1440ccgctgactg cccagatgca gtgccctcct ctgccgagac
tgggggaacc aactacctgg 1500ctcctggcgg actgagcgac tcccagctgc
tgctggaacc aggggatcgg cggccgctcg 1560agcagaaact catctcagaa
gaggatctgg cagcaaatga tatcctggat tacaaggatg 1620acgacgataa ggtttaa
163792528PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 92Met Ser Pro Ile Leu Gly Tyr Trp Lys Ile Lys
Gly Leu Val Gln Pro1 5 10 15Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu
Lys Tyr Glu Glu His Leu 20 25 30Tyr Glu Arg Asp Glu Gly Asp Lys Trp
Arg Asn Lys Lys Phe Glu Leu 35 40 45Gly Leu Glu Phe Pro Asn Leu Pro
Tyr Tyr Ile Asp Gly Asp Val Lys 50 55 60Leu Thr Gln Ser Met Ala Ile
Ile Arg Tyr Ile Ala Asp Lys His Asn65 70 75 80Met Leu Gly Gly Cys
Pro Lys Glu Arg Ala Glu Ile Ser Met Leu Glu 85 90 95Gly Ala Val Leu
Asp Ile Arg Tyr Gly Val Ser Arg Ile Ala Tyr Ser 100 105 110Lys Asp
Phe Glu Thr Leu Lys Val Asp Phe Leu Ser Lys Leu Pro Glu 115 120
125Met Leu Lys Met Phe Glu Asp Arg Leu Cys His Lys Thr Tyr Leu Asn
130 135 140Gly Asp His Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala
Leu Asp145 150 155 160Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp
Ala Phe Pro Lys Leu 165 170 175Val Cys Phe Lys Lys Arg Ile Glu Ala
Ile Pro Gln Ile Asp Lys Tyr 180 185 190Leu Lys Ser Ser Lys Tyr Ile
Ala Trp Pro Leu Gln Gly Trp Gln Ala 195 200 205Thr Phe Gly Gly Gly
Asp His Pro Pro Lys Ser Asp Leu Glu Val Leu 210 215 220Phe Gln Gly
Pro Leu Gly Ser Gly Arg Lys Lys Arg Arg Gln Arg Arg225 230 235
240Arg Ser Arg Phe Arg Thr Lys Arg Ser Ala Leu Val Arg Arg Leu Trp
245 250 255Arg Ser Arg Ala Pro Gly Gly Glu Asp Glu Glu Glu Gly Ala
Gly Gly 260 265 270Gly Gly Gly Gly Gly Glu Leu Arg Gly Glu Gly Ala
Thr Asp Ser Arg 275 280 285Ala His Gly Ala Gly Gly Gly Gly Pro Gly
Arg Ala Gly Cys Cys Leu 290 295 300Gly Lys Ala Val Arg Gly Ala Lys
Gly His His His Pro His Pro Pro305 310 315 320Ala Ala Gly Ala Gly
Ala Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala 325 330 335Leu Thr His
Ser Val Leu Lys Lys Leu Lys Glu Arg Gln Leu Glu Leu 340 345 350Leu
Leu Gln Ala Val Glu Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu 355 360
365Leu Leu Pro Gly Arg Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala
370 375 380Gly Ala Gln Pro Ala Gln Pro Pro Ser Ser Tyr Ser Leu Pro
Leu Leu385 390 395 400Leu Cys Lys Val Phe Arg Trp Pro Asp Leu Arg
His Ser Ser Glu Val 405 410 415Lys Arg Leu Cys Cys Cys Glu Ser Tyr
Gly Lys Ile Asn Pro Glu Leu 420 425 430Val Cys Cys Asn Pro His His
Leu Ser Arg Leu Cys Glu Leu
Glu Ser 435 440 445Pro Pro Pro Pro Tyr Ser Arg Tyr Pro Met Asp Phe
Leu Lys Pro Thr 450 455 460Ala Asp Cys Pro Asp Ala Val Pro Ser Ser
Ala Glu Thr Gly Gly Thr465 470 475 480Asn Tyr Leu Ala Pro Gly Gly
Leu Ser Asp Ser Gln Leu Leu Leu Glu 485 490 495Pro Gly Asp Arg Arg
Pro Leu Glu Gln Lys Leu Ile Ser Glu Glu Asp 500 505 510Leu Ala Ala
Asn Asp Ile Leu Asp Tyr Lys Asp Asp Asp Asp Lys Val 515 520
525931370DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 93gaattcgtcg actggatccg gtaccgagga
gatctgccgc cgcgatcgcc atgtccccca 60ttctgggcta ctggaagatt aagggcctgg
tgcagcctac tagactgctg ctggaatacc 120tggaggaaaa atatgaagag
catctgtatg aaagagacga gggggataaa tggaggaaca 180agaaattcga
actgggactg gagtttccta atctgccata ctatattgac ggcgatgtga
240agctgactca gtctatggct atcattagat acatcgcaga caaacacaac
atgctgggcg 300ggtgtcctaa ggaaagggca gagattagta tgctggaggg
agccgtgctg gatattagat 360acggcgtctc acgcatcgcc tatagcaaag
acttcgaaac cctgaaggtg gattttctga 420gcaaactgcc tgaaatgctg
aagatgttcg aggacagact gtgccacaaa acctacctga 480atggcgacca
tgtcacacac ccagatttta tgctgtacga cgccctggat gtggtcctgt
540atatggaccc catgtgtctg gatgctttcc ctaagctggt gtgctttaag
aaaaggatcg 600aggcaattcc ccagatcgat aagtacctga aaagctccaa
gtatatcgct tggcctctcc 660agggctggca ggcaacattc ggaggcgggg
accatccccc taaaagcgac ctggaggtgc 720tgtttcaggg accactgggc
agcggccgga agaagcggcg gcagaggcgc agccactggt 780gtgtggtcgc
ctactgggaa gagaagacaa gagtgggaag gctgtattgc gtccaggagc
840cttccctgga catcttctac gatctgccac aggggaatgg attttgtctg
ggccagctga 900actctgacaa taagagtcag ctggtgcaga aagtccggag
caagattggc tgcggcatcc 960agctgaccag ggaggtggac ggcgtgtggg
tctacaaccg cagttcatat ccaatcttca 1020tcaagagcgc cactctggac
aatcccgatt cccgcaccct gctggtgcat aaggtcttcc 1080ccggcttcag
catcaaggcc ttcgactacg agaaggctta tagtctccag cggcccaacg
1140atcacgagtt catgcagcag ccttggacag gcttcactgt gcagatcagc
ttcgtcaagg 1200gatggggcca gtgctacaca aggcagttca tctcaagctg
tccctgttgg ctggaagtca 1260ttttcaatag taggacgcgt acgcggccgc
tcgagcagaa actcatctca gaagaggatc 1320tggcagcaaa tgatatcctg
gattacaagg atgacgacga taaggtttaa 137094440PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
94Met Ser Pro Ile Leu Gly Tyr Trp Lys Ile Lys Gly Leu Val Gln Pro1
5 10 15Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His
Leu 20 25 30Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe
Glu Leu 35 40 45Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr Ile Asp Gly
Asp Val Lys 50 55 60Leu Thr Gln Ser Met Ala Ile Ile Arg Tyr Ile Ala
Asp Lys His Asn65 70 75 80Met Leu Gly Gly Cys Pro Lys Glu Arg Ala
Glu Ile Ser Met Leu Glu 85 90 95Gly Ala Val Leu Asp Ile Arg Tyr Gly
Val Ser Arg Ile Ala Tyr Ser 100 105 110Lys Asp Phe Glu Thr Leu Lys
Val Asp Phe Leu Ser Lys Leu Pro Glu 115 120 125Met Leu Lys Met Phe
Glu Asp Arg Leu Cys His Lys Thr Tyr Leu Asn 130 135 140Gly Asp His
Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp145 150 155
160Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu
165 170 175Val Cys Phe Lys Lys Arg Ile Glu Ala Ile Pro Gln Ile Asp
Lys Tyr 180 185 190Leu Lys Ser Ser Lys Tyr Ile Ala Trp Pro Leu Gln
Gly Trp Gln Ala 195 200 205Thr Phe Gly Gly Gly Asp His Pro Pro Lys
Ser Asp Leu Glu Val Leu 210 215 220Phe Gln Gly Pro Leu Gly Ser Gly
Arg Lys Lys Arg Arg Gln Arg Arg225 230 235 240Arg Ser His Trp Cys
Val Val Ala Tyr Trp Glu Glu Lys Thr Arg Val 245 250 255Gly Arg Leu
Tyr Cys Val Gln Glu Pro Ser Leu Asp Ile Phe Tyr Asp 260 265 270Leu
Pro Gln Gly Asn Gly Phe Cys Leu Gly Gln Leu Asn Ser Asp Asn 275 280
285Lys Ser Gln Leu Val Gln Lys Val Arg Ser Lys Ile Gly Cys Gly Ile
290 295 300Gln Leu Thr Arg Glu Val Asp Gly Val Trp Val Tyr Asn Arg
Ser Ser305 310 315 320Tyr Pro Ile Phe Ile Lys Ser Ala Thr Leu Asp
Asn Pro Asp Ser Arg 325 330 335Thr Leu Leu Val His Lys Val Phe Pro
Gly Phe Ser Ile Lys Ala Phe 340 345 350Asp Tyr Glu Lys Ala Tyr Ser
Leu Gln Arg Pro Asn Asp His Glu Phe 355 360 365Met Gln Gln Pro Trp
Thr Gly Phe Thr Val Gln Ile Ser Phe Val Lys 370 375 380Gly Trp Gly
Gln Cys Tyr Thr Arg Gln Phe Ile Ser Ser Cys Pro Cys385 390 395
400Trp Leu Glu Val Ile Phe Asn Ser Arg Thr Arg Thr Arg Pro Leu Glu
405 410 415Gln Lys Leu Ile Ser Glu Glu Asp Leu Ala Ala Asn Asp Ile
Leu Asp 420 425 430Tyr Lys Asp Asp Asp Asp Lys Val 435
4409566PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 95Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg Glu
Leu Glu Ser Pro Pro1 5 10 15Pro Pro Tyr Ser Arg Tyr Pro Met Asp Phe
Leu Lys Pro Thr Ala Asp 20 25 30Cys Pro Asp Ala Val Pro Ser Ser Ala
Glu Thr Gly Gly Thr Asn Tyr 35 40 45Leu Ala Pro Gly Gly Leu Ser Asp
Ser Gln Leu Leu Leu Glu Pro Gly 50 55 60Asp Arg6596249DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
96ggccgtaaaa aacgccgtca acgccgccgt gaactggaga gcccgccgcc gccgtatagc
60cgttacccga tggatttcct gaaaccgact gcagattgcc cggacgcagt cccgagcagc
120gctgagaccg gcggcaccaa ctatctggca ccgggcggtc tgagcgatag
ccagctgctg 180ctggaaccgg gcgaccgtgg taagcctatc cctaaccctc
tcctcggtct cgattctacg 240tgagtcgac 24997285DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
97ggccggaaga agcggcggca gaggcgccga gaactggagt ccccccctcc accctactct
60aggtatccta tggactttct gaagccaacc gctgactgcc cagatgcagt gccctcctct
120gccgagactg ggggaaccaa ctacctggct cctggcggac tgagcgactc
ccagctgctg 180ctggaaccag gggatcggcg gccgctcgag cagaaactca
tctcagaaga ggatctggca 240gcaaatgata tcctggatta caaggatgac
gacgataagg tttaa 2859825PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 98Gly Arg Lys Lys Arg Arg Gln
Arg Arg Arg Glu Leu Glu Ser Pro Pro1 5 10 15Pro Pro Tyr Ser Arg Tyr
Pro Met Asp 20 2599126DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 99ggccgtaaaa
aacgccgtca acgccgccgt gaactggaga gcccgccgcc gccgtatagc 60cgttacccga
tggatggtaa gcctatccct aaccctctcc tcggtctcga ttctacgtga 120gtcgac
126100162DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 100ggccggaaga agcggcggca gaggcgccga
gaactggagt ccccccctcc accctactct 60aggtatccta tggaccggcc gctcgagcag
aaactcatct cagaagagga tctggcagca 120aatgatatcc tggattacaa
ggatgacgac gataaggttt aa 16210113PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 101Gly Leu Pro Ile Pro Asn
Pro Leu Leu Gly Leu Asp Ser1 5 10
* * * * *