U.S. patent application number 15/754078 was filed with the patent office on 2018-09-13 for t cells modified to overexpress c-myb.
This patent application is currently assigned to The USA, as represented by the Secretary Department of Health and Human Services. The applicant listed for this patent is The USA, as represented by the Secretary Department of Health and Human Services, The USA, as represented by the Secretary Department of Health and Human Services. Invention is credited to Luca Gattinoni, Sanjivan Gautam, Yun Ji.
Application Number | 20180258394 15/754078 |
Document ID | / |
Family ID | 56940360 |
Filed Date | 2018-09-13 |
United States Patent
Application |
20180258394 |
Kind Code |
A1 |
Gautam; Sanjivan ; et
al. |
September 13, 2018 |
T CELLS MODIFIED TO OVEREXPRESS C-MYB
Abstract
Provided is an isolated or purified T cell comprising an
antigen-specific receptor, wherein the antigen-specific receptor is
a T cell receptor (TCR) or a chimeric antigen receptor (CAR),
wherein the T cell has been modified to express a transcription
factor at a level that is higher than the level of the
transcription factor expressed by a T cell that has not been
modified to express the transcription factor, wherein the
transcription factor is V-Myb Avian Myeloblastosis Viral Oncogene
Homolog (c-Myb), a functional variant of c-Myb, or a functional
fragment of c-Myb. Related populations of cells, pharmaceutical
compositions, methods of treating a disease, and methods of
inhibiting the differentiation of T cells are also provided.
Inventors: |
Gautam; Sanjivan;
(Rockville, MD) ; Ji; Yun; (Germantown, MD)
; Gattinoni; Luca; (Washington, DC) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
The USA, as represented by the Secretary Department of Health and
Human Services |
Bethesda |
MD |
US |
|
|
Assignee: |
The USA, as represented by the
Secretary Department of Health and Human Services
Bethesda
MD
|
Family ID: |
56940360 |
Appl. No.: |
15/754078 |
Filed: |
August 24, 2016 |
PCT Filed: |
August 24, 2016 |
PCT NO: |
PCT/US2016/048435 |
371 Date: |
February 21, 2018 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
62209497 |
Aug 25, 2015 |
|
|
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
A61K 35/17 20130101;
A61K 2035/124 20130101; A61P 35/00 20180101; C12N 5/0638 20130101;
C07K 14/7051 20130101; C07K 14/82 20130101; A61P 31/12
20180101 |
International
Class: |
C12N 5/0783 20060101
C12N005/0783; C07K 14/82 20060101 C07K014/82; C07K 14/725 20060101
C07K014/725; A61K 35/17 20060101 A61K035/17 |
Claims
1. An isolated or purified T cell comprising an antigen-specific
receptor, wherein the antigen-specific receptor is a T cell
receptor (TCR) or a chimeric antigen receptor (CAR), wherein the T
cell has been modified to express a transcription factor at a level
that is higher than the level of the transcription factor expressed
by a T cell that has not been modified to express the transcription
factor, wherein the transcription factor is V-Myb Avian
Myeloblastosis Viral Oncogene Homolog (c-Myb), a functional variant
of c-Myb, or a functional fragment of c-Myb.
2. The isolated or purified T cell according to claim 1, comprising
a vector comprising (i) a nucleic acid encoding the transcription
factor, and (ii) a heterologous nucleic acid sequence, wherein the
level of the transcription factor expressed by the T cell is
increased as compared to the level of the transcription factor
expressed by a T cell that lacks the vector.
3. The isolated or purified T cell of claim 2, wherein the vector
is a viral vector.
4. The isolated or purified T cell of claim 2, wherein the nucleic
acid encodes a c-Myb amino acid sequence having at least 90%
identity to an amino acid sequence selected from the group
consisting of SEQ ID NOs: 3-4 and 13-20.
5. The isolated or purified T cell of claim 2, wherein the nucleic
acid has at least 90% identity to a nucleotide sequence selected
from the group consisting of SEQ ID NOs: 1-2, 5-12, and 21.
6. The isolated or purified T cell according to claim 1, wherein
the receptor has antigenic specificity for a cancer antigen.
7. The isolated or purified T cell according to claim 1, wherein
the receptor has antigenic specificity for a viral antigen.
8. The isolated or purified T cell according to claim 1, wherein
the T cell is CD62L.sup.+.
9. The isolated or purified T cell according to claim 1, wherein
the T cell is KLRG1.sup.-.
10. The isolated or purified T cell of claim 1, wherein the c-Myb
is human c-Myb.
11. The isolated or purified T cell of claim 1, wherein the T cell
is CD8.sup.+.
12. The isolated or purified T cell of claim 1, wherein the
receptor is a recombinant TCR.
13. The isolated or purified T cell of claim 1, wherein the
receptor is a chimeric antigen receptor (CAR).
14. The isolated or purified T cell of claim 1, wherein the
receptor is an endogenous TCR.
15. A population of T cells comprising at least two T cells of
claim 1.
16. A pharmaceutical composition comprising the population of T
cells of claim 15 and a pharmaceutically acceptable carrier.
17.-18. (canceled)
19. A method of inhibiting the differentiation of T cells, the
method comprising introducing a nucleic acid encoding a
transcription factor into isolated or purified T cells under
conditions sufficient to obtain an increased expression of the
transcription factor as compared to T cells that lack the
introduced nucleic acid, wherein the transcription factor is c-Myb,
a functional variant of c-Myb, or a functional fragment of c-Myb,
and wherein the increased expression of the transcription factor
inhibits differentiation of the T cells.
20. The method according to claim 19, comprising increasing CD62L
expression by the T cells.
21. The method according to claim 19, comprising decreasing KLRG1
expression by the T cells.
22. A method of treating or preventing a disease in a mammal, the
method comprising administering to the mammal the population of T
cells of claim 15 in an amount effective to treat or prevent the
disease in the mammal, wherein the disease is a cancer or a viral
disease.
23. The method of claim 22, wherein the T cells of the population
are autologous to the mammal.
Description
CROSS-REFERENCE TO RELATED APPLICATION
[0001] This patent application claims the benefit of U.S.
Provisional Patent Application No, 62/209,497, filed Aug. 25, 2015,
which is incorporated by reference in its entirety herein.
INCORPORATION-BY-REFERENCE OF MATERIAL SUBMITTED ELECTRONICALLY
[0002] Incorporated by reference in its entirety herein is a
computer-readable nucleotide/amino acid sequence listing submitted
concurrently herewith and identified as follows: One 106,200 Byte
ASCII (Text) file named "726282_ST25.txt" dated Aug. 24, 2016.
BACKGROUND OF THE INVENTION
[0003] Adoptive cell therapy can be an effective treatment for
diseases (e,g., cancer) in some patients. However, obstacles to the
overall success of adoptive cell therapy still exist. For example,
the in vivo persistence, survival, and anti-tumor activity of T
cells can, in some cases, decrease following adoptive transfer.
Alternatively or additionally, in some cases, the increased
differentiation of T cells can pose obstacles to the treatment of
diseases.
[0004] In spite of considerable research into methods of producing
cells for adoptive cell therapy and treatments for cancer and viral
diseases, there still exists a need for improved methods for
producing cells for adoptive cell therapy and treating and/or
preventing cancer and viral diseases.
BRIEF SUMMARY OF THE INVENTION
[0005] An embodiment of the invention provides an isolated or
purified T cell comprising an antigen-specific receptor, wherein
the antigen-specific receptor is a T cell receptor (TCR) or a
chimeric antigen receptor (CAR), wherein the T cell has been
modified to express a transcription factor at a level that is
higher than the level of the transcription factor expressed by a T
cell that has not been modified to express the transcription
factor, wherein the transcription factor is V-Myb Avian
Myeloblastosis Viral Oncogene Homolog (c-Myb), a functional variant
of c-Myb, or a functional fragment of c-Myb.
[0006] Another embodiment of the invention provides a method of
inhibiting the differentiation of T cells, the method comprising
introducing a nucleic acid encoding a transcription factor into
isolated or purified T cells under conditions sufficient to obtain
an increased expression of the transcription factor as compared to
T cells that lack the introduced nucleic acid, wherein the
transcription factor is c-Myb, a functional variant of c-Myb, or a
functional fragment of c-Myb, and wherein the increased expression
of the transcription factor inhibits differentiation of the T
cells.
[0007] Further embodiments of the invention provide related
populations of T cells, pharmaceutical compositions, and methods of
treating a disease.
BRIEF DESCRIPTION OF THE. SEVERAL VIEWS OF THE DRAWINGS
[0008] FIG. 1A is a graph showing the quantity of Thy1.1.sup.+
pme1-1 CD8.sup.+ T cells measured at three and five days after
adoptive transfer of wild-type (WT) CD8.sup.+ T (circles) or pme1-1
Myb.sup.-/- CD8.sup.+ T (squares) cells into WT mice infected with
gp-100 vaccinia virus (VV).
[0009] FIG. 1B is a graph showing the percentage of KLRG1.sup.30
CD62L.sup.- cells obtained from the spleen on day five after
adoptive transfer of WT CD8.sup.+T (circles) or pme1-1 Myb.sup.-/-
CD8.sup.+ T (squares) cells into WT mice infected with gp-100
vaccinia virus (VV). There were four mice in each treatment
group.
[0010] FIG. 2A is a graph showing the total number of Thy1.1.sup.+
pme1-1 CD8.sup.+ T cells in the spleen after adoptive transfer of
Thy1.1 CD8.sup.+ (circles) or Thy1.1 Myb CD8.sup.+ T cells
(squares) into WT mice infected with gp100-VV, assessed at 0-30
days after transfer. Data represent mean and error bars are
S.E.M.
[0011] FIG. 2B is a graph showing the total number of Thy1.1.sup.+
pme1-1 CD8.sup.+ T cells in the spleen after adoptive transfer of
Thy1.1 CD8.sup.+ (shaded bars) or Thy1.1 Myb CD8.sup.+ T cells
(unshaded bars) into WT mice infected with gp100-VV, assessed at 5
days after heterologous reinfection (on day 30 of gp100-VV
infection) with gp100 adenovirus. Data represent mean and error
bars are S.E.M.
[0012] FIGS. 2C and 2E are graphs showing the total number of
KLRG1.sup.+CD62L.sup.- cells (FIG. 2C) or KLRG1.sup.-CD62L.sup.+
cells (FIG. 2E) from spleen 3-30 days post transfer of Thy1.1
CD8.sup.+ (circles) or Thy1.1 Myb CD8.sup.+ T cells (squares) into
WT mice infected with gp100-VV (after gating on Thy1.1.sup.+
Ly5.1.sup.-CD8.sup.+ cells or Thy1.1.sup.+ Ly5.1.sup.+ CD8.sup.+ T
cells).
[0013] FIGS. 2D and 2F are graphs showing the total number of
KLRG1.sup.+CD62L.sup.-cells (FIG. 2D) or KLRG1.sup.-CD62L.sup.+
cells (FIG. 2F) from spleen 5 days post transfer after heterologous
reinfection (on day 30 of gp100-VV infection) with gp100 adenovirus
(after gating on Thy1.1.sup.+ Ly5.1.sup.-CD8.sup.+ cells or
Thy1.1.sup.+ Ly5.1.sup.+CD8.sup.+ T cells).
[0014] FIGS. 3A and 3B are graphs showing the percentage of
Thy1.1.sup.+ cells from the lymph nodes (A) and lungs (B) after
adoptive transfer of Thy1.1 CD8.sup.+ (shaded bars) or Thy1.1 Myb
CD8.sup.+ T cells (unshaded bars) into WT mice infected with
gp100-VV 5-30 days post-transfer.
[0015] FIGS. 3C and 3D are graphs showing the percentage of KLRG1-
CD62L.sup.+Thy1.1.sup.+ cells (C) or KLRG1.sup.+CD62L.sup.- (D)
cells from lymph node and lungs 5-30 days (C) or 5 days (D) post
transfer of Thy1.1 CD8.sup.+ (shaded bars) or Thy1.1 Myb CD8.sup.+
T cells (unshaded bars) into WT mice infected with gp100-VV.
[0016] FIGS. 3E and 3F are graphs showing the percentage of
Thy1.1.sup.+ cells from lymph nodes (E) and lungs (F) 5 days post
transfer after heterologous reinfection with gp100 adenovirus (on
day 30 of gp100-VV infection) after gating on Thy1.1.sup.+
Ly5.1.sup.-CD8.sup.+ cells or Thy1.1.sup.+ Ly5.1.sup.+ CD8.sup.+ T
cells.
[0017] FIGS. 4A and 4B are graphs showing the percentage of KLRG1-
CD62L.sup.+Thy1.1.sup.+ cells (A) or KLRG1.sup.+CD62L.sup.- (B)
cells from lymph node and lungs 5 days post transfer after
heterologous reinfection with gp100 adenovirus (on day 30 of
gp100-VV infection) after gating on Thy1.1.sup.30
Ly5.1.sup.-CD8.sup.+ cells or Thy1.1.sup.+ Ly5.1.sup.+ CD8.sup.+ T
cells.
[0018] FIG. 5A is a graph showing the percentage of BrdU-positive,
Thy1.1.sup.+ pme1-1.sup.+ WT (circles) and BrdU-positive,
Thy1.1.sup.+ pme1-1.sup.+ c-Myb.sup.-/- CD8.sup.+ T cells (squares)
after transfer into gp-100VV-infected mice. Data were obtained from
3 or 4 mice per group. Data show mean and error bars are standard
errors of mean. **P<0.01.
[0019] FIG. 5B is a graph showing the percentage of Annexin
V-positive, Thy1.1.sup.+ pme1-1.sup.+ WT (circles) and Annexin
V-positive, Thy1.1.sup.+ pme1-1.sup.+ c-Myb.sup.-/- CD8.sup.+ T
cells (squares) after transfer into gp-100VV-infected mice. Data
were obtained from 3 or 4 mice per group. Data show mean and error
bars are standard errors of mean. *P<0.05.
[0020] FIG. 6A is a graph showing the expression of Zeb2 relative
to the expression of Rp113 in WT or c-Myb.sup.-/- naive,
CD62L.sup.+ and CD62L.sup.-populations of pme1-1 CD8.sup.+ T cells
before (0) or 5 days after adoptive transfer into recipient
wild-type mice infected with gp100-VV. WT naive (.box-solid.),
c-Myb.sup.-/- naive (.quadrature.), WT CD62L.sup.HI
(.tangle-solidup.), c-Myb.sup.-/- CD62L.sup.HI (.DELTA.), WT
CD62L.sup.LO (.circle-solid.), c-Myb.sup.-/- CD62L.sup.LO
(.smallcircle.).
[0021] FIG. 68 is a graph showing the expression of Tcf7 relative
to the expression of beta-actin in WT or c-Myb.sup.-/- naive,
CD62L.sup.+ and CD62L.sup.- populations of pme1-1 CD8.sup.+ T cells
before (0) or 5 days after adoptive transfer into recipient
wild-type mice infected with gp100-VV. WT naive (.box-solid.),
c-Myb.sup.-/- naive (.quadrature.), WT CD62L.sup.HI
(.tangle-solidup.), c-Myb.sup.-/- CD62L.sup.HI (.DELTA.), WT
CD62L.sup.LO (.circle-solid.), c-Myb.sup.-/- CD62L.sup.LO
(.smallcircle.).
[0022] FIG. 6C is a graph showing the expression of Zeb2 relative
to the expression of Rp113 by CD62L.sup.- populations of Thy1.1
(.tangle-solidup.) or Thy1.1 c-Myb overexpressing () pme1-1
CD8.sup.+ T cells 5 days after adoptive transfer into recipient
wild-type mice infected with gp100-VV. WT (circles) and MYB.sup.-/-
(squares) T cells served as controls.
[0023] FIG. 6D is a graph showing the expression of Tcf7 relative
to the expression of beta-actin by CD62L.sup.- populations of
Thy1.1 (.tangle-solidup.) or Thy1.1 c-Myb overexpressing () pme1-1
CD8.sup.+ T cells 5 d after adoptive transfer into recipient
wild-type mice infected with gp100-VV. WT (circles) and MYB.sup.-/-
(squares) T cells served as controls.
[0024] FIG. 6E is a graph showing the expression pattern of c-Myb
(circles) and Tcf7 (squares) in naive, CD62L.sup.+ and CD62L.sup.-
populations 5 days after adoptive transfer into recipient wild-type
mice infected with gp100-VV. Results are presented relative to Actb
(encoding .beta.-actin).
[0025] FIG. 6F is a graph showing the expression pattern of c-Myb
(circles) and Zeb2 (squares) in naive, CD62L.sup.+ and CD62L.sup.-
populations 5 days after adoptive transfer into recipient wild-type
mice infected with gp100-VV. Results are presented relative to Actb
(encoding .beta.-actin) err Rp113 mRNA.
[0026] FIG. 6G is a graph showing the results or a quantitative PCR
analysis of the promoter regions of Tcf7 and Zeb2 in chromatin
immunoprecipitation with anti-IgG or anti-c-Myb. Error bars
represent standard errors of mean. ***P<0.001, **P<0.1,
*P<0.05.
[0027] FIG. 7 is a graph showing the mean fluorescence index (MFI)
of GFP expression measured in Thy1.1 or Thy1.1 c-Myb CD8.sup.+ T
cells from Tcf7 GFP reporter mice (p45 mice) on day 3 after in
vitro activation of naive cells and on day 2 after retroviral
transduction of either Thy1.1 (circles) or Thy1.1 c-Myb
(squares).
[0028] FIG. 8 is a graph showing the percentage of KLRG1.sup.hi CD8
T cells five days after adoptive transfer of WT (circles) or
c-Myb-/- CD8+ T cells transduced either with an empty vector
control MIG (squares) or Tcf7 MIG (triangles) into recipient
mice.
DETAILED DESCRIPTION OF THE INVENTION
[0029] V-Myb Avian Myeloblastosis Viral Oncogene Homolog (c-Myb) is
a transcription factor that is a member of the MYB family of
transcription factor genes. The c-Myb protein contains three
domains: an N-terminal DNA-binding domain, a central
transcriptional activation domain, and a C;-terminal
transcriptional repression domain.
[0030] It has been discovered that c-Myb is involved i T cell
differentiation. In particular, it has been discovered that T cells
lacking c-Myb are more differentiated as compared to their
wild-type counterparts, whereas T cells that are modified to
overexpress c-Myb are less differentiated than their wild-type
counterparts.
[0031] An embodiment of the invention provides an isolated or
purified T cell comprising an antigen-specific receptor. The T cell
has been modified to express a transcription factor at a level that
is higher than the level of the transcription factor expressed by a
T cell that has not been modified to express the transcription
factor, wherein the transcription factor is c-Myb, a functional
variant of c-Myb, or a functional fragment of c-Myb. Hereinafter,
c-Myb, functional variants of c-Myb, and functional fragments of
c-Myb are referred to collectively as "c-Myb," unless specified
otherwise.
[0032] The inventive T cells have been modified to overexpress
c-Myb. In this regard, the modified T cell expresses c-Myb at a
level that is higher than the level of c-Myb expressed by a T cell
that has not been modified with respect to c-Myb expression (e.g.,
wild-type T cells). For example, if the T cell has been modified to
comprise a vector encoding c-Myb, as described in more detail
below, the modified T cell including the vector expresses c-Myb at
a level that is higher than the level of c-Myb expressed by a
control T cell that does not contain the vector,
[0033] The inventive T cells may provide many advantages, for
example, an increase of any one or more of in vivo proliferation,
survival, persistence, anti-tumor activity, and anti-viral activity
as compared to T cells that have not been modified to overexpress
c-Myb (e.g., T cells that lack the vector). The inventive T cells
may be less differentiated as compared to T cells that have not
been modified to overexpress c-Myb T cells that lack the vector).
The numbers of less differentiated T cells are believed to expand
to greater numbers in vitro as compared to more differentiated T
cells. Therefore, the inventive T cells may he suitable for
adoptive cell therapy.
[0034] The T cell may be isolated or purified. The term "isolated,"
as used herein, means having been removed from its natural
environment. The term "purified," as used herein, means having been
increased in purity, wherein "purity" is a relative term, and not
to be necessarily construed as absolute purity. A "purified" T cell
refers to a T cell which has been separated from other natural
components, such as tissues, cells, proteins, nucleic acids,
etc.
[0035] The T cell can be any T cell, such as a cultured T e.g., a
primary T cell, or a T cell from a cultured T cell line, e.g.,
Jurkat, SupT1 etc., or a T cell obtained from a mammal. If obtained
from a mammal, the T cell can be obtained from numerous sources,
including but not limited to blood, bone marrow, lymph node,
thymus, spleen, or other tissues or fluids. Cells can also be
enriched for or purified. Preferably, the T cell is a human T cell.
The T cell can be any type of T cell and can be of any
developmental stage, including but not limited to,
CD4.sup.+/CD8.sup.+ double positive T cells, CD4.sup.+ helper T
cells, e.g., Th.sub.1 and Th.sub.2 cells, CD4.sup.+ T cells,
CD8.sup.+ T cells (e.g., cytotoxic T cells), peripheral blood
mononuclear cells (PBMCs), peripheral blood leukocytes (PBLs),
tumor infiltrating cells (TILs), memory T cells, naive T cells, and
the like. Preferably, the cell is a CD8.sup.+ T cell.
[0036] The inventive compositions can comprise a single T cell or a
population thereof. The population of T cells can be a
heterogeneous population comprising the T cell that has been
modified to overexpress c-Myb (e.g., a T cell comprising the
vector), in addition to at least one other cell, e.g., a T cell,
which has not been modified to overexpress c-Myb (e.g., T cell
lacking the vector), or a cell other than a T cell, e.g., a B cell,
a macrophage, a neutrophil, an erythrocyte, a melanocyte, a
hepatocyte, an endothelial cell, an epithelial cell, a muscle cell,
a brain cell, etc. Alternatively, the population of cells can be a
substantially homogeneous population, in which the population
mainly comprises T cells that have been modified to overexpress
c-Myb. The population also can be a clonal population of T cells,
in which all T cells of the population are clones of a single T
cell that has been modified to overexpress c-Myb, such that all T
cells of the population overexpress c-Myb and have genetically
identical TCRs.
[0037] A T cell of the invention can be present in a population of
cells or a composition in an amount of about 10% or more, e.g.,
about 30% or more, about 50% or more, about 60% or more, about 70%
or more, about 75% or more, about 80% or more, about 85% or more,
or about 90% or more, based on the total number of cells in the
population or composition. Alternatively, or in addition, the T
cell of the invention can be present in a population of cells or a
composition in an amount of about 95% or less, e.g., about 90% or
less, about 85% or less, about 80% or less, about 75% or less,
about 70% or less, about 60% or less, about 40% or less, or about
30% or less based on the total number of cells in the population or
composition. Thus, the T cell of the invention can be present in a
population of cells or a composition in an amount bounded by any
two of the above endpoints. For example, the T cell of the
invention can be present in a population of cells or a composition
in an amount of about 30 to about 60%, about 50 to about 90%, about
60 to about 80%, about 80 to about 90%, or about 75 to about
85%.
[0038] In an embodiment of the invention, the T cell comprises an
antigen-specific receptor. The phrases "antigen-specific" and
"antigenic specificity," as used herein, mean that the receptor can
specifically bind to and immunologically recognize an antigen, or
an epitope thereof, such that binding of the receptor to antigen,
or the epitope thereof, elicits an immune response. In an
embodiment of the invention, the antigen-specific receptor is a T
cell receptor (TCR). The antigen-specific TCR generally comprises
two polypeptides (i.e., polypeptide chains), such as an
.alpha.-chain of a TCR, a .beta.-chain of a TCR, a .gamma.-chain of
a TCR, a .delta.-chain of a. TCR, or a combination thereof. Such
polypeptide chains of TCR.s are known in the art. The
antigen-specific receptor can comprise any amino acid sequence,
provided that the receptor can specifically bind to and
immunologically recognize an antigen, such as a disease-associated
antigen or epitope thereof.
[0039] The antigen-specific receptor can be an endogenous TCR,
i.e., the antigen-specific TCR that is endogenous or native to
(naturally-occurring on) the T cell. In such a case, the T cell
comprising the endogenous TCR can be a T cell that was isolated
from a mammal which is known to express the particular
disease-specific antigen. In certain embodiments, the T cell is a
primary T cell isolated from a host afflicted with a cancer. In
some embodiments, the T cell is a tumor infiltrating lymphocyte
(TIL) or a peripheral blood lymphocyte (PBL) isolated from a human
cancer patient.
[0040] In some embodiments, the mammal from which a T cell is
isolated is immunized with an antigen of, or specific for, a
disease. Desirably, the mammal is immunized prior to obtaining the
T cell from the mammal, In this way, the isolated T cells can
include T cells induced to have specificity for the disease to be
treated, or can include a higher proportion of cells specific for
the disease.
[0041] Alternatively, a T cell comprising an endogenous
antigen-specific TCR can be a T cell within a mixed population of
cells isolated from a mammal, and the mixed population can be
exposed to the antigen which is recognized by the endogenous TCR
while being cultured in vitro, in this manner, the T cell which
comprises the TCR that recognizes the disease-specific antigen,
expands or proliferates in vitro, thereby increasing the number of
T cells having the endogenous antigen-specific receptor.
[0042] The antigen-specific TCR can be an exogenous TCR, i.e., an
antigen-specific TCR that is not native to (not naturally-occurring
on) the T cell. A recombinant TCR is a TCR which has been generated
through recombinant expression of one or more exogenous TCR
.alpha.-, .beta.-, .gamma.-, and/or .delta.-chain encoding genes, A
recombinant TCR can comprise polypeptide chains derived entirely
from a single mammalian species, or the antigen-specific TCR can be
a chimeric or hybrid TCR comprised of amino acid sequences derived
from TCRs from two different mammalian species. For example, the
antigen-specific TCR can comprise a variable region derived from a
murine TCR, and a constant region of a human TCR such that the TCR
is "humanized." Methods of making recombinant TCRs are known in the
art. See, for example, U.S. Pat. Nos. 7,820,174; 8,785,601;
8,216,565; and U.S. Patent Application Publication No.
2013/0274203.
[0043] A T cell of the invention comprising an endogenous
antigen-specific TCR can also be transformed, e.g., transduced or
transfected, with one or more nucleic acids encoding an exogenous
(e.g., recombinant) TCR or other recombinant chimeric receptor.
Such exogenous chimeric receptors, e.g., chimeric TCRs, can confer
specificity for additional antigens to the transformed T cell
beyond the antigens for which the endogenous TCR is naturally
specific. This can, but need not, result in the production of T
cell having dual antigen specificities.
[0044] In an embodiment of the invention, the antigen-specific
receptor is a "chimeric antigen receptor" (CAR). Typically, a CAR
comprises the antigen binding domain of an antibody, e.g., a
single-chain variable fragment (scFv), fused to the transmembrane
and intracellular domains of a TCR. Thus, the antigenic specificity
of a TCR of the invention can be encoded by a scFv which
specifically binds to the antigen, or an epitope thereof. Methods
of making such chimeric TCRs are known in the art. See, for
example, U.S. Pat. No. 8,465,743 and U.S. Patent Application
Publication Nos. 2014/0037628 and 2014/0274909.
[0045] Any suitable nucleic acid encoding a CAR, TCR, or TCR-like
protein or polypeptide can be used. In these embodiments,
transformation with a nucleic acid encoding c-Myb, as discussed
below, can occur before, after, or simultaneously with,
antigen-specific receptor transformation. The antigen-specific
receptor encoded by the transformed nucleic acids can be of any
suitable form including for example, a single-chain TCR or a fusion
with other proteins or polypeptides (e.g., without limitation
co-stimulatory molecules).
[0046] The antigen which is recognized by the antigen-specific
receptor can be any antigen which is characteristic of a disease.
For example, the antigen may be, but is not limited to, a cancer
antigen (also termed a tumor antigen or a tumor associated antigen)
or a viral antigen. Viral antigens are known in the art and
include, for example, any viral protein, e.g., env, gag, pol,
gp120, thymidine kinase, and the like.
[0047] The term "cancer antigen," as used herein, refers to any
molecule (e.g., protein, polypeptide, peptide, lipid, carbohydrate,
etc.) solely or predominantly expressed or over-expressed by a
tumor cell or cancer cell, such that the antigen is associated with
the tumor or cancer. The cancer antigen can additionally be
expressed by normal, non-tumor, or non-cancerous cells. However, in
such cases, the expression of the cancer antigen by normal,
non-tumor, or non-cancerous cells is not as robust as the
expression by tumor or cancer cells. In this regard, the tumor or
cancer cells can over-express the antigen or express the antigen at
a significantly higher level, as compared to the expression of the
antigen by normal, non-tumor, or non-cancerous cells. Also, the
cancer antigen can additionally be expressed by cells of a
different state of development or maturation. For instance, the
cancer antigen can be additionally expressed by cells of the
embryonic or fetal stage, which cells are not normally found in an
adult host. Alternatively, the cancer antigen can be additionally
expressed by stern cells or precursor cells, which cells are not
normally found in an adult host. Cancer antigens are known in the
art and include, for instance, mesothelin, CD19, CD22, CD276
(B7H3), gp100, MART-1, Epidermal Growth Factor Receptor Variant III
(EGFRVIII), TRP-1, TRP-2, tyrosinase, NY-ESO-1 (also known as
CAG-3), MAGE-1, MAGE-3, etc.
[0048] The cancer antigen can he an antigen expressed by any cell
of any cancer or tumor, including the cancers and tumors described
herein. The cancer antigen may be a cancer antigen of only one type
of cancer or tumor, such that the cancer antigen is associated with
or characteristic of only one type of cancer or tumor.
Alternatively, the cancer antigen may be a cancer antigen (e.g.,
may be characteristic) of more than one type of cancer or tumor.
For example, the cancer antigen may be expressed by both breast and
prostate cancer cells and not expressed at all by normal,
non-tumor, or non-cancer cells.
[0049] The disease which is associated with or is characterized by
the antigen recognized by the antigen-specific receptor can be any
disease. For instance, the disease can be a cancer or a viral
disease, as discussed herein.
[0050] The cancer may be any cancer, including any of acute
lymphocytic cancer, acute myeloid leukemia, alveolar
rhabdomyosarcoma, bone cancer, brain cancer, breast cancer, cancer
of the anus, anal canal, or anorectum, cancer of the eye, cancer of
the intrahepatic bile duct, cancer of the joints, cancer of the
neck, gallbladder, or pleura, cancer of the nose, nasal cavity, or
middle ear, cancer of the oral cavity, cancer of the vulva, chronic
lymphocytic leukemia, chronic myeloid cancer, colon cancer,
esophageal cancer, cervical cancer, gastrointestinal carcinoid
tumor, Hodgkin lymphoma, hypopharynx cancer, kidney cancer, larynx
cancer, liver cancer, lung cancer, malignant mesothelioma,
melanoma, multiple myeloma, nasopharynx cancer, non-Hodgkin
lymphoma, ovarian cancer, pancreatic cancer, peritoneum, omentum,
and mesentery cancer, pharynx cancer, prostate cancer, rectal
cancer, renal cancer (e.g., renal cell carcinoma (RCC)), small
intestine cancer, soft tissue cancer, stomach cancer, testicular
cancer, thyroid cancer, ureter cancer, and urinary bladder cancer.
In certain preferred embodiments, the antigen-specific receptor has
specificity for an antigen derived from melanoma.
[0051] For purposes herein, "viral disease" means a disease that
can be transmitted from person to person or from organism to
organism, and is caused by a virus. In an embodiment of the
invention, the viral disease is caused by a virus selected from the
group consisting of herpes viruses, pox viruses, hepadnaviruses,
papilloma viruses, adenoviruses, coronoviruses, orthomyxoviruses,
paramyxoviruses, flaviviruses, and caliciviruses. For example, the
viral disease may be caused by a virus selected from the group
consisting of respiratory syncytial virus (RSV), influenza virus,
herpes simplex virus, Epstein-Barr virus, varicella virus,
cytomegalovirus, hepatitis A virus, hepatitis B virus, hepatitis C
virus, human immunodeficiency virus (HIV), human T-lymphotropic
virus, calicivirus, adenovirus, and Arena virus.
[0052] The viral disease may be, for example, influenza, pneumonia,
herpes, hepatitis, hepatitis A, hepatitis B. hepatitis C, chronic
fatigue syndrome, sudden acute respiratory syndrome (SARS),
gastroenteritis, enteritis, carditis, encephalitis, bronchiolitis,
respiratory papillomatosis, meningitis, HIV/AIDS, and
mononucleosis.
[0053] A T cell comprising an antigen-specific receptor can be
isolated or purified from a source using any suitable technique
known in the art. For example, a T cell comprising an
antigen-specific TCR present in a mammalian tissue, biological
fluid (e.g., blood), or in vitro culture medium can be separated
from impurities, e.g., other cell types, proteins, nucleic acids,
etc. using flow cytometry, immunomagnetic separation, or a
combination thereof.
[0054] An isolated or purified T cell may be modified to
overexpress c-Myb. The T cell may be modified to overexpress c-Myb
in any suitable manner. In an embodiment of the invention, the T
cell may he modified to overexpress c-Myb using genome editing
techniques. Genome editing techniques can modify gene expression in
a target cell by inserting, replacing, or removing DNA in the
genome using an artificially engineered nuclease. Examples of such
nucleases may include zinc finger nucleases (ZFNs) (Gommans et al.,
J. Mol. Biol., 354(3): 507-519 (2005)), transcription
activator-like effector nucleases (TALENs) (Zhang et al., Nature
Biotechnol., 29: 149-153 (2011)), the CRISPR/Cas system (Cheng et
al., Cell Res., 23: 1163-71 (2013)), and engineered meganucleases
(Riviere et al., Gene Ther., 21(5): 529-32 (2014)). The nucleases
create specific double-stranded breaks (DSBs) at targeted locations
in the genome, and use endogenous mechanisms in the cell to repair
the induced break by homologous recombination (HR) and
nonhomologous end-joining (NHEJ). Such techniques may be used to
achieve overexpression of c-Myb in T cells.
[0055] In another embodiment of the invention, the T cell may be
modified (e.g., transduced or transfected) so as to comprise a
nucleic acid encoding c-Myb. Preferably, the nucleic acid is a
recombinant nucleic acid. As used herein, the term "recombinant"
refers to (i) molecules that are constructed outside living cells
by joining natural or synthetic nucleic acid segments to nucleic
acid molecules that can replicate in a living cell, or (ii)
molecules that result from the replication of those described in
(i) above. For purposes herein, the replication can be in vitro
replication or in viva replication.
[0056] The terms "nucleic acid" and "polynucleotide," as used
herein, refer to a polymeric form of nucleotides of any length,
either ribonucleotides (RNA) or deoxyribonucleotides (DNA). These
terms refer to the primary structure of the molecule, and thus
include double- and single-stranded DNA, double- and
single-stranded RNA, and double-stranded DNA-RNA hybrids. The terms
include, as equivalents, analogs of either RNA or DNA made from
nucleotide analogs and modified polynucleotides such as, though not
limited to, methylated and/or capped polynucleotides. Suitable
nucleotide analogs are known and are described in, e.g., U.S.
Patent Application Publication 2012/0101148, and references cited
therein. In an embodiment of the invention, the nucleic acid is
complementary DNA (cDNA).
[0057] The term "nucleotide" as used herein refers to a monomeric
subunit of a polynucleotide that consists of a heterocyclic base, a
sugar, and one or more phosphate groups. The naturally occurring
bases (guanine (G), adenine (A), cytosine (C), thymine (T), and
uracil (U)) are typically derivatives of purine or pyrimidine,
though the invention includes the use of naturally and
non-naturally occurring base analogs. The naturally occurring sugar
is the pentose (five-carbon sugar) deoxyribose (which forms DNA) or
ribose (which forms RNA), though the invention includes the use of
naturally and non-naturally occurring sugar analogs. Nucleic acids
are typically linked via phosphate bonds to form nucleic acids or
polynucleotides, though many other linkages are known in the art
(e.g., phosphorothioates, boranophosphates and the like). Methods
of preparing polynucleotides are within the ordinary skill in the
art (Green and Sambrook, Molecular Cloning: A Laboratoty Manual,
(4th Ed.) Cold Spring Harbor Laboratory Press, New York
(2012)).
[0058] The nucleic acid may comprise any suitable c-Myb nucleotide
sequence, which may encode any suitable c-Myb amino acid sequence
from any mammal. In an embodiment of the invention, the c-Myb
sequence is a mouse c-Myb sequence. Two mouse e-Myb transcriptional
variants include mRNA Genbank Accession Nos.: NM_001198914.1 (SEQ
NO: 1) and NM_010848.3 (SEQ ID NO: 2), with corresponding protein
sequence Genbank Accession Nos. NP_001185843.1 (SEQ ID NO: 3) and
NP_034978.3 (SEQ ID NO: 4), respectively. Mouse genomic c-Myb
sequences include Genbank Accession Nos: NC_000076.6, CH466562.2,
K03547.1, and AC_00032.1. Mouse c-Myb mRNA sequences also include
Genbank Accession Nos: AA170760.1AK036518.1, AK038118.1 AK084390.1,
and AK088020.1. Mouse c-Myb amino acid sequences also include
Genbank Accession Nos: EDL03418.1, EDL03419.1AAA39786.1,
AAA39787.1AAA39783.1, AAA39782.1, AAA37505.1, CAA27724.1,
CAA34425.1, and CAA34426.1. Other mouse sequences, as well as other
e-Myb species, can be employed in accordance with the
invention.
[0059] In a preferred embodiment of the invention, the c-Myb
sequence is a human c-Myb sequence. Eight human c-Myb
transcriptional variants and their corresponding amino acid
sequences are set forth in Table A.
TABLE-US-00001 TABLE A Isoform mRNA Amino acid sequence Number
Genbank Accession No. Genbank Accession No. 1 NM_001130173.1
NP_001123645.1 (SEQ ID NO: 5) (SEQ ID NO: 13) 2 NM_005375.2
NP_005366.2 (SEQ ID NO: 6) (SEQ ID NO: 14) 3 NM_001130172.1
NP_001123644.1 (SEQ ID NO: 7) (SEQ ID NO: 15) 4 NM_001161656.1
NP_001155128.1 (SEQ ID NO: 8) (SEQ ID NO: 16) 5 NM_001161657.1
NP_001155129.1 (SEQ ID NO: 9) (SEQ ID NO: 17) 6 NM_001161658.1
NP_001155130.1 (SEQ ID NO: 10) (SEQ ID NO: 18) 7 NM_001161659.1
NP_001155131.1 (SEQ ID NO: 11) (SEQ ID NO: 19) 8 NM_001161660.1
NP_001155132.1 (SEQ ID NO: 12) (SEQ ID NO: 20)
Human genomic c-Myb sequences include Genbank Accession Nos:
NG_012330.1, NC_900006.12, and NC_018917.2. Human c-Myb mRNA
sequences also include Genbank Accession Nos: AJ606317.1,
AJ606318.1, AJ606319.1, AJ606320.1, AJ606321.1AJ606322.1,
AJ606323.1, AJ606324.1, AJ616235.1, and AJ616791.1. Human c-Myb
amino acid sequences also include Genbank Accession Nos:
EAW47968.1, CAS01767.1, ADL14499.1, CBX51725.1, AAA72118.1,
AAB49034.1, AAB49035.1, AAC96326.1, CAE55168,1, CAE55169.1,
CAE55170.1, CAE55171.1, CAE55172.1, CAE55173.1, CAE55174.1,
CAE55175.1, CAE82649.1 and CAF04477.1. Other human sequences, as
well as other c-Myb species can be employed in accordance with the
invention.
[0060] In an embodiment of the invention, the nucleic acid
comprises a codon-optimized nucleotide sequence encoding c-Myb.
Without being bound to any particular theory or mechanism, it is
believed that codon optimization of the nucleotide sequence
increases the translation efficiency of the mRNA transcripts. Codon
optimization of the nucleotide sequence may involve substituting a
native codon for another codon that encodes the same amino acid,
but can be translated by tRNA that is more readily available within
a cell, thus increasing translation efficiency. Optimization of the
nucleotide sequence may also reduce secondary mRNA structures that
would interfere with translation, thus increasing translation
efficiency. In this regard, the nucleic acid encoding c-Myb may
comprise the codon-optimized nucleotide sequence of SEQ ID NO: 21,
which encodes mouse c-Myb having the amino acid sequence of SEQ ID
NO: 4.
[0061] In other embodiments, the nucleic acid comprises, consists
essentially of, or consists of a nucleic acid sequence which is at
least about 75%, e.g., at least about 80%, at least about 85%, at
least about 90%, at least about 91%, at least about 92%, at least
about 93%, at least about 94%, at least about 95%, at least about
96%, at least about 97%, at least about 98%, or at least about 99%
identical to any of the nucleotide sequences described herein,
e.g., any one of SEQ ID NOs: 1-2, 5-12, and 21. In an embodiment of
the invention, the nucleic acid comprises, consists essentially of,
or consists of the nucleotide sequence of any one of SEQ ID NOs:
1-2, 5-12, and 21.
[0062] In certain preferred embodiments, the nucleic acid encoding
c-Myb is carried in a recombinant expression vector, Accordingly,
an embodiment of the invention provides an isolated or purified T
cell comprising a vector comprising (i) a nucleic acid encoding
c-Myb and (ii) a heterologous nucleic acid sequence, The phrase
"heterologous nucleic acid sequence," as used herein, means a
nucleic acid sequence that does not naturally occur in the species
that expresses the c-Myb encoded by the vector. For example, if the
c-Myb encoded by the vector is mouse c-Myb, the heterologous
nucleic acid sequence in the vector may be any sequence that does
not naturally occur in a mouse. In an embodiment in which the c-Myb
encoded by the vector is human c-Myb, the heterologous nucleic acid
sequence in the vector may be any sequence that does not naturally
occur in a human. The heterologous nucleic acid sequence may be a
nucleic acid sequence from any species other than the species that
expresses the c-Myb encoded by the vector.
[0063] The recombinant expression vector can comprise any type of
nucleotides, including, but not limited to DNA and RNA, which can
he single-stranded or double-stranded, synthesized or obtained in
part from natural sources, and which can contain natural,
non-natural or altered nucleotides. The recombinant expression
vectors can comprise naturally-occurring or non-naturally-occurring
intemucleotide linkages, or both types of linkages. The vector may
contain regulatory nucleic acid sequences which provide for c-Myb
expression.
[0064] The recombinant expression vector can be any suitable
recombinant expression vector that contains a heterologous nucleic
acid sequence as described above. Suitable vectors include those
designed for propagation and expansion or for expression or both,
such as plasmids and viruses. For example, the vector can be
selected from the pUC series (Fermentas Life Sciences, Glen Burnie,
Md.), the pBluescript series (Stratagene, LaJolla, Calif.), the pET
series (Novagen, Madison, Wis.), the pGEX series (Pharmacia
Biotech, Uppsala, Sweden), and the pEX series (Clontech, Palo Alto,
Calif.). Bacteriophage vectors, such as .lamda.GT10, .lamda.GT11,
.lamda.Zap11 (Stratagene), .lamda.EMBL4, and .lamda.NM1149, also
can be used. Examples of plant expression vectors useful in the
context of the invention include pBI01, pBI01.2, pBI101.3, pBI121
and pBIN19 (Clontech). Examples of animal expression vectors useful
in the context of the invention include pEUK-C1, pMAM, and pMAMneo
(Clontech).
[0065] In some embodiments, the recombinant expression vector is a
viral vector. Suitable viral vectors include, without limitation,
retroviral vectors, alphaviral, vaccinial, adenoviral,
adenoassociated viral, herpes viral, and fowl pox viral vectors,
and preferably have a native or engineered capacity to transform T
cells.
[0066] The recombinant expression vectors can he prepared using
standard recombinant DNA techniques described in, for example,
Green and Sambrook, supra. Constructs of expression vectors, which
are circular or linear, can be prepared to contain a replication
system functional in a prokaryotic or eukaryotic host cell.
Replication systems can be derived, e.g., from ColEl, 2.mu.
plasmid, .lamda., SV40, bovine papilloma virus, and the like.
[0067] The recombinant expression vector can comprise regulatory
sequences, such as transcription and translation initiation and
termination codons, which are specific to the type of host (e.g.,
bacterium, fungus, plant, or animal) into which the vector is to be
introduced, as appropriate, and taking into consideration whether
the vector is DNA- or RNA-based,
[0068] The recombinant expression vector can include one or more
marker genes, which allow for selection of transformed or
transfected hosts. Marker genes include biocide resistance, e.g.,
resistance to antibiotics, heavy metals, etc., complementation in
an auxotrophic host to provide prototrophy, and the like. Suitable
marker genes for the recombinant expression vectors include, for
instance, neomycin/G418 resistance genes, hygromycin resistance
genes, histidinol resistance genes, tetracycline resistance genes,
and ampicillin resistance genes.
[0069] The recombinant expression vector can comprise a native or
nonnative promoter operably linked to the nucleic acid encoding
c-Myb. Preferably, the promoter is functional in T cells. The
selection of a promoter, e.g., strong, weak, inducible,
tissue-specific and developmental-specific, is within the ordinary
skill of the artisan. Similarly, the combining of a nucleotide
sequence with a promoter is also within the skill of the artisan.
The promoter can be a non-viral promoter or a viral promoter, e.g.,
a cytomegalovirus (CMV) promoter, an SV40 promoter, an RSV
promoter, or a promoter found in the long-terminal repeat of the
murine stem cell virus.
[0070] The recombinant expression vector can be designed for either
transient expression, for stable expression, or for both. Also, the
recombinant expression vectors can be made for constitutive
expression or for inducible expression.
[0071] The vectors useful in the context of the invention can be
"naked" nucleic acid vectors (i.e., vectors having little or no
proteins, sugars, and/or lipids encapsulating them), or vectors
complexed with other molecules. Other molecules that can be
suitably combined with the vectors include without limitation viral
coats, cationic lipids, liposomes, polyamines, gold particles, and
targeting moieties such as ligands, receptors, or antibodies that
target cellular molecules.
[0072] The nucleic acid may encode a polypeptide or protein
comprising any of the c-Myb amino acid sequences described herein.
In an embodiment of the invention, the nucleic acid encodes a c-Myb
polypeptide or protein comprising, consisting, or consisting
essentially of an amino acid sequence selected from the group
consisting of SEQ ID NOs: 3-4 and 13-20.
[0073] In an embodiment of the invention, the nucleic acid encodes
a functional fragment of any of the c-Myb proteins or polypeptides
described herein. The functional fragment of the c-Myb protein or
polypeptide can comprise any contiguous part of the c-Myb protein
or polypeptide that retains a relevant biological activity of the
c-Myb protein or polypeptide, e.g., any one or more of inhibiting
the differentiation of T cells, binding to DNA, activating
transcription, and repressing transcription. Any given fragment of
a c-Myb protein or polypeptide can be tested for such biological
activity using methods known in the art. For example, the
functional fragment can comprise, consist essentially of, or
consist of any one or two of the N-terminal DNA-binding domain, the
central transcriptional activation domain, or the C-terminal
transcriptional repression domain of any of the c-Myb proteins or
polypeptides described herein. In reference to the parent c-Myb
protein or polypeptide, the functional fragment preferably
comprises, for instance, about 10% or more, 25% or more, 30% or
more, 50% or more, 60% or more, 80% or more, 90% or more, or even
95% or more of the parent c-Myb protein or polypeptide.
[0074] In an embodiment of the invention, the nucleic acid encodes
a functional variant of any of the c-Myb proteins or polypeptides
described herein. The term "functional variant," as used herein,
refers to a c-Myb protein or polypeptide having substantial or
significant sequence identity or similarity to a parent c-Myb
protein or polypeptide, which functional variant retains the
biological activity of the c-Myb protein or polypeptide of which it
is a variant. Functional variants encompass, for example, those
variants of the c-Myb protein or polypeptide described herein (the
parent c-Myb protein or polypeptide) that retain the ability to
provide any one or more of inhibition of T cell differentiation,
binding to DNA, activation of transcription, and repression of
transcription to a similar extent, the same extent, or to a higher
extent, as the parent c-Myb protein or polypeptide. In reference to
the parent c-Myb protein or polypeptide, the functional variant
can, for instance, be at least about 75%, e.g., at least about 80%,
at least about 85%, at least about 90%, at least about 91%, at
least about 92%, at least about 93%, at least about 94%, at least
about 95%, at least about 96%, at least about 97%, at least about
98%, or at least about 99% identical in amino acid sequence to any
of the c,-Myb proteins or polypeptides described herein, e.g., any
one of SEQ ID NOs: 3-4 and 13-20.
[0075] The functional variant can, for example, comprise the amino
acid sequence of the parent c-Myb protein or polypeptide with at
least one conservative amino acid substitution. Conservative amino
acid substitutions are known in the art, and include amino acid
substitutions in whiCh one amino acid having certain physical
andlor chemical properties is exchanged for another amino acid that
has the same chemical or physical properties. For instance, the
conservative amino acid substitution can be an acidic amino acid
substituted for another acidic amino acid (e.g., Asp or Glu), an
amino acid with a nonpolar side chain substituted for another amino
acid with a nonpolar side chain (e,g., Ala, Gly, Val, Ile, Leu,
Met, Phe, Pro, Trp, Val, etc.), a basic amino acid substituted for
another basic amino acid (Lys, Arg, etc.), an amino acid with a
polar side chain substituted for another ammo acid with a polar
side chain (Asn, Cys, Gln, Ser, Thr, Tyr, etc.), etc.
[0076] Alternatively or additionally, the functional variants can
comprise the amino acid sequence of the parent c-Myb protein or
polypeptide with at least one non-conservative amino acid
substitution. In this case, it is preferable for the
non-conservative amino acid substitution to not interfere with or
inhibit the biological activity of the functional variant.
Preferably, the non-conservative amino acid substitution enhances
the biological activity of the functional variant, such that the
biological activity of the functional variant is increased as
compared to the parent c-Myb protein or polypeptide.
[0077] The c-Myb protein or polypeptide can consist essentially of
the specified amino acid sequence or sequences described herein,
such that other components of the functional variant, e.g., other
amino acids, do not materially change the biological activity of
the functional variant. In this regard, the c-Myb protein or
polypeptide can, for example, consist essentially of the amino acid
sequence of any of SEQ ID NOs: 3-4 and 13-20.
[0078] Preferably, a T cell comprising an antigen-specific receptor
is isolated or purified as described herein, and then contacted
with a nucleic acid encoding c-Myb ex vivo or in vitro using
methods described herein or any other method known in the art.
Examples of such methods include, but are not limited to, the use
of a lipid, protein, particle, or other molecule capable of
facilitating cell transformation with the nucleic acid. However, a
T cell comprising an antigen-specific receptor also can he
contacted with a nucleic acid encoding c-Myb in vivo, such as by
way of a gene gun, for example.
[0079] T cells which have not been modified to overexpress c-Myb
(including, e.g., T cells that lack the vector encoding c-Myb) may
express a basil level of c-Myb mRNA, polypeptide, or protein.
However, the isolated or purified T cell of the invention
overexpresses c-Myb mRNA, polypeptide, or protein as compared to a
control T cell that has not been modified to overexpress c-Myb. For
example, without limiting the invention, a T cell or a population
thereof overexpressing c-Myb can contain an amount of c-Myb (mRNA,
protein, or polypeptide) that is 1.5-fold higher or more, e.g.,
2-fold higher or more, 3-fold higher or more, 5-fold higher or
more, 10-fold higher or more, 20-fold higher or more, or 50-fold
higher or more, than the amount of c-Myb present in a control T
cell or a population thereof that has not been modified to
overexpress c-Myb. Thus, the c-Myb can be present in a T cell or
population thereof in an amount bounded by any two of the above
endpoints. For example, the T cell or a population thereof
overexpressing c-Myb can contain an amount of c-Myb (mRNA, protein,
or polypeptide) that is about 1.5-fold to about 20-fold higher,
about 2-fold to about 5-fold higher, about 3-fold to about 50-fold
higher, about 10-fold or higher, or about 20-fold to about 50-fold
higher, than the amount of c-Myb present in a control T cell or a
population thereof that has not been modified to overexpress c-Myb.
Any suitable method known in the art can be utilized to determine
the amount of c-Myb mRNA, protein, or polypeptide present in a T
cell or a population thereof, such as quantitative reverse
transcription polymerase chain reaction (RT-PCR.) or stem-loop
quantitative RT-PCR.
[0080] The isolated or purified T cell of the invention may have a
less differentiated phenotype as compared to a T cell that has not
been modified to overexpress c-Myb. In an embodiment of the
invention, the isolated or purified T cell invention may have a
naive T cell (T.sub.N), T memory stem cell (T.sub.SCM), or central
memory T cell (T.sub.CM) phenotype. Alternatively or additionally,
the inventive T cell lacks an effector memory T cell (T.sub.EM)
phenotype. For example, CCR7 and CD62L are expressed by T.sub.N,
T.sub.SCM, and T.sub.CM cells, but are not expressed by T.sub.EM
cells. The transcription factors LEF1, FOXP1, and KLF7 are
expressed by T.sub.N, T.sub.SCM, and T.sub.CM cells, but are not
expressed by T.sub.EM cells. CD45RO and KLRG1 are not expressed by
T.sub.N or T.sub.SCM cells, but are expressed by T.sub.EM cells,
Gattinoni et al., Nat. Rev. Cancer, 12: 671-84 (2012). In an
embodiment of the invention, the isolated or purified T cell of the
invention may be any one or more of CD62L.sup.+, KLRG1.sup.-,
LEF1.sup.+, FOXP1.sup.+, and KLF7.sup.+, CCR7.sup.+, CD57.sup.-,
and CD45RO.sup.-. Preferably, the T cell is one or both of CD62L
and KLRG1.sup.-. In an especially preferred embodiment, the T cell
is both CD62L.sup.+ and KLRG1.sup.-. Alternatively or additionally,
T.sub.N, T.sub.SCM, and T.sub.CM cells may be characterized by
longer telomeres as compared to those of T.sub.EM cells.
[0081] The invention also provides a method of treating or
preventing a disease in a mammal. The method comprises
administering to the mammal any of the T cells described herein, or
a population thereof, or a composition comprising any of the T
cells described herein, in an amount effective to treat or prevent
the disease in the mammal. In an embodiment of the invention, the
disease is cancer or a viral disease. The cancer and viral disease
may be any of the cancers and viral diseases described herein with
respect to other aspects of the invention.
[0082] The terms "treat," and "prevent" as well as words stemming
therefrom, as used herein, do not necessarily imply 100% or
complete treatment or prevention. Rather, there are varying degrees
of treatment or prevention of which one of ordinary skill in the
art recognizes as having a potential benefit or therapeutic effect.
In this respect, the inventive methods can provide any amount of
any level of treatment or prevention of cancer or a viral disease
in a patient. Furthermore, the treatment or prevention provided by
the inventive method can include treatment or prevention of one or
more conditions or symptoms of the cancer or viral disease being
treated or prevented. For example, treatment or prevention can
include promoting the regression of a tumor. Also, for purposes
herein, "prevention" can encompass delaying the onset of the cancer
or viral disease, or a symptom or condition thereof.
[0083] The term "mammal" as used herein refers to any mammal,
including, but not limited to, mice, hamsters, rats, rabbits, cats,
dogs, cows, pigs, horses, monkeys, apes, and humans. Preferably,
the mammal is a human.
[0084] In the treatment or prevention of a disease in a mammal, the
T cells that have been modified to overexpress c-Myb can be
transferred into the same mammal from which T cells were obtained.
In other words, the T cell used in the inventive method of treating
or preventing can be an autologous T cell, i.e., can be obtained
from the mammal in which the disease is treated or prevented.
Alternatively, the T cell can he allogenically transferred into
another mammal. Preferably, the T cell is autologous to the mammal
in the inventive method of treating or preventing a disease in the
mammal.
[0085] In the instance that the T cells are autologous to the
mammal, the mammal can be immunologically naive, immunized,
diseased, or in another condition prior to isolation of the T cells
from the mammal, In some instances, it is preferable for the method
to comprise immunizing the mammal with an antigen of the disease
prior to isolating the T cell from the mammal, modifying the
obtained T cell to overexpress c-Myb, and the administering of the
T cell, or a population or composition thereof. As discussed
herein, immunization of the mammal with the antigen of disease will
allow the population of T cells having an endogenous TCR reactive
with the disease-specific antigen to increase in numbers, which
will increase the likelihood that the T cell obtained for being
modified to overexpress c-Myb will have a desired antigen-specific
TCR.
[0086] In accordance with the invention, a mammal with a disease
can be therapeutically immunized with an antigen from, or
associated with, that disease, including immunization via a
vaccine. While not desiring to be bound by any particular theory,
the vaccine or immunogen is provided to enhance the mammal's immune
response to the disease antigen present in or on the infectious
agent or diseased tissue. Such a therapeutic immunization includes,
but is not limited to, the use of recombinant or natural disease
proteins, peptides, or analogs thereof, or modified disease
peptides, or analogs thereof that can be used as a vaccine
therapeutically as part of adoptive immunotherapy. The vaccine or
immunogen, can be a cell, cell lysate (e.g., from cells transfected
with a recombinant expression vector), a recombinant expression
vector, or antigenic protein or polypeptide. Alternatively, the
vaccine, or immunogen, can be a partially or substantially purified
recombinant disease protein, polypeptide, peptide or analog
thereof, or modified proteins, polypeptides, peptides or analogs
thereof. The protein, polypeptide, or peptide may be conjugated
with lipoprotein or administered in liposomal form or with
adjuvant. Preferably, the vaccine comprises one or more of (i) the
disease-antigen for which the antigen-specific receptor of the T
cell of the invention is specific, (ii) an epitope of the antigen,
and (iii) a vector encoding the antigen or the epitope.
[0087] The inventive method of treating or preventing a disease in
a mammal can comprise additional steps. For instance, a variety of
procedures, as discussed below, can be performed on the T cells
prior to, substantially simultaneously with, or after their
isolation from a mammal. Similarly, a variety of procedures can be
performed on the T cells prior to, substantially simultaneously
with, or after modifying the T cells to over-express c-Myb.
[0088] In an embodiment of the invention, the T cells are expanded
in vitro after modifying the T cells to over-express c-Myb, but
prior to the administration to a mammal. Expansion of the numbers
of T cells can be accomplished by any of a number of methods as are
known in the art as described in, for example, U.S. Pat. No.
8,034,334; U.S. Pat. No. 8,383,099; and U.S. Patent Application
Publication No, 2012/0244133. For example, expansion of the numbers
of T cells may be carried out by culturing the T cells with OKT3
antibody, IL-2, and feeder PBMC (e.g., irradiated allogeneic PBMC).
In another embodiment of the invention, the T cells are not
expanded in vitro after modifying the T cells to over-express c-Myb
and prior to the administration to a mammal.
[0089] An embodiment of the invention provides a pharmaceutical
composition comprising the inventive T cell or a population thereof
and a pharmaceutically acceptable carrier. The pharmaceutically
acceptable carriers described herein, for example, vehicles,
adjuvants, excipients, and diluents, are well-known and readily
available to those skilled in the art. Preferably, the
pharmaceutically acceptable carrier is chemically inert to the
active agent(s), e.g., the T cell, and does not elicit any
detrimental side effects or toxicity under the conditions of
use.
[0090] The composition can be formulated for administration by any
suitable route, such as, for example, an administration route
selected from the group consisting of intravenous, intratumoral,
intraarterial, intramuscular, intraperitoneal, intrathecal,
epidural, and subcutaneous administration routes. Preferably, the
composition is formulated for a. parenteral route of
administration. An exemplary pharmaceutically acceptable carrier
for cells for injection may include any isotonic carrier such as,
for example, normal saline (about 0.90% w/v of NaCl in water, about
300 mOsm/L NaCl in water, or about 9.0 g NaCl per liter of water),
NORMOSOL R electrolyte solution (Abbott, Chicago, Ill.),
PLASMA-LYTE A (Baxter, Deerfield, Ill.), about 5% dextrose in
water, or Ringer's lactate. In an embodiment, the pharmaceutically
acceptable carrier is supplemented with human serum albumin.
[0091] For purposes of the invention, the amount or dose of the
inventive population of cells or pharmaceutical composition
administered (e.g., numbers of cells when the inventive population
of cells is administered) should be sufficient to effect, e.g., a
therapeutic or prophylactic response, in the patient over a
reasonable time frame. For example, the dose of the inventive
population of cells or pharmaceutical composition should be
sufficient to treat or prevent cancer or a viral disease in a
period of from about 2 hours or longer, e.g., 12 to 24 or more
hours, from the time of administration. In certain embodiments, the
time period could be even longer. The dose will be determined by
the efficacy of the particular inventive population of cells or
pharmaceutical composition administered and the condition of the
patient, as well as the body weight of the patient to be
treated.
[0092] Many assays for determining an administered dose are known
in the art. For purposes of the invention, an assay, which
comprises comparing the extent to which target cells are lysed upon
administration of a given dose of such T cells to a mammal among a
set of mammals of which is each given a different dose of the
cells, could be used to determine a starting dose to be
administered to a patient. The extent to which target cells are
lysed upon administration of a certain dose can be assayed by
methods known in the art.
[0093] The dose of the inventive population of cells or
pharmaceutical composition also will be determined by the
existence, nature and extent of any adverse side effects that might
accompany the administration of a particular inventive population
of cells or pharmaceutical composition. Typically, the attending
physician will decide the dosage of the population of cells or
pharmaceutical composition with which to treat each individual
patient, taking into consideration a variety of factors, such as
age, body weight, general health, diet, sex, inventive population
of cells or pharmaceutical composition to be administered, route of
administration, and the severity of the condition being
treated.
[0094] Any suitable number of T cells of the invention can be
administered to a mammal. While a single T cell of the invention
theoretically is capable of expanding, and providing a therapeutic
benefit, it is preferable to administer about 10.sup.2 or more,
e.g., about 10.sup.3 or more, about 10.sup.4 or more, about
10.sup.5 or more, about 10.sup.8 or more, T cells of the invention.
Alternatively, or additionally about 10.sup.12 or less, e.g., about
10.sup.11 or less, about 10.sup.9 or less, about 10.sup.7 or less,
or about 10.sup.5 or less, T cells of the invention can be
administered to a mammal. The number of T cells of the invention
can be administered to a mammal in an amount bounded by any two of
the above endpoints, e.g., about 10.sup.2 to about 10.sup.5, about
10.sup.3 to about 10.sup.7, about 10.sup.3 to about 10.sup.9, or
about 10.sup.5 to about 10.sup.10.
[0095] As explained above, the isolated or purified T cell of the
invention that is modified to over-express c-Myb may be less
differentiated as compared to a control T cell that has not been
modified to overexpress c-Myb. Accordingly, an embodiment of the
invention provides a method of inhibiting the differentiation of T
cells.
[0096] The method may comprise isolating or purifying a T cell from
a source. The method may comprise introducing any of the nucleic
acids encoding c-Myb described herein with respect to other aspects
of the invention into the isolated or purified T cells under
conditions sufficient to obtain an increased expression of c-Myb as
compared to T cells that lack the introduced nucleic acid. The T
cells may be isolated from a source, the nucleic acid may be
introduced into the T cells, and the c-Myb expression may be
increased as described herein with respect to other aspects of the
invention.
[0097] The increased expression of c-Myb may inhibit
differentiation of the T cells. In this regard, the T cells
produced by the method may have a less differentiated phenotype as
compared to a T cell that does not contain the introduced nucleic
acid encoding c-Myb. In an embodiment of the invention, the T cell
produced by the inventive method may have a T.sub.N, T.sub.SCM, or
T.sub.CM phenotype. Alternatively or additionally, the T cell
produced by the inventive method may lack a T.sub.EM phenotype. In
an embodiment of the invention, the method may further comprise one
or both of (i) increasing the expression of CD62L by the T cells
and (ii) decreasing the expression of KLRG1 by the T cells.
Preferably, the method comprises both (i) and (ii).
[0098] The following examples further illustrate the invention but,
of course, should not be construed as in any way limiting its
scope.
EXAMPLES
[0099] The following materials and methods were employed in the
experiments described in Examples 1-9.
Mice.
[0100] Myb fl/fl (C57BL/6) mice were a generous gift from Tim
Bender, Va. C57BL/6 mice, Ly5.1 (B6.SJL-PtprcaPepcb/BoyJ) mice and
pme1-1 (B6.Cg-Thy1a/Cy Tg(TcraTerb)8Rest/J) mice were from the
Jackson Laboratory (Bar Harbor, Me.); Cre-ERT2
(B6-Gt(ROSA)26Sortm9(cre/Esr1)Arte) mice were from Taconic (Hudson,
N.Y.). The pme1-1 mice were crossed with myb fl/fl mice and
Cre-ERT2 mice to generate pme1-1 Cre-ERT2 myb fl/fl mice. All mouse
experiments were done with the approval of the National Cancer
institute Animal Use and Care Committee.
Antibodies, Flow Cytometry and Cell Sorting.
[0101] Anti-Ly5.1, anti-Thy-1.1 (OX-7), anti-CD62L (MEL-14),
anti-CD8.alpha. (53-6.7), anti-KLRG-1 (2F1) antibodies were from BD
Biosciences (Franklin Lakes, N.J.). A LSRII flow cytometer (BD
Biosciences) was used for flow cytometry acquisition. Samples were
analyzed with FLOWJO software (TreeStar, Ashland, Oreg.). Naive
CD8.sup.+ T cells were sorted with a FACSARIA cell sorter (BD
Biosciences).
Retroviral Vector Construction and Virus Production.
[0102] A sequence composed of a codon-optimized myb cDNA (SEQ ID
NO: 21) and a sequence encoding Thy-1.1 joined by a sequence
encoding the picornavirus 2A ribosomal skip peptide21 was cloned
into a MSGV-1 vector. The sequence encoded a mouse c-Myb protein
having the amino acid sequence of SEQ ID NO: 4. PLATINUM ECO cell
lines (Cell Biolabs, San Diego, Calif.) were used for
gamma-retroviral production by transfection with DNA plasmids
through the use of LIPOFECTAMINE 2000 reagent (Invitrogen, Waltham,
Mass.). Virus was collected 36 hours (h) after transfection.
In Vitro Activation and Transduction of CD8.sup.+ T Cells.
[0103] CD8.sup.+ T cells were separated from non-CD8.sup.+ T cells
using a MACS negative selection kit (Miltenyi Biotech, Bergisch
Gladbach, Germany). CD8.sup.+ T cells were activated on plates
coated with anti-CD3.epsilon. antibody (2 .mu.g/ml; 145-2C11; BD
Biosciences) and soluble anti-CD28 antibody (1 .mu.g/ml; 37.51; BD
Biosciences) in culture medium containing interleukin (IL)-2 (10
ng/ml; Chiron, Emeryville, Calif.). Virus was `spin-inoculated` at
2,000 g for 2 h at 32.degree. C. onto plates coated with
RETRONECTIN reagent (Takara, Shiga, Japan). CD8+ T cells activated
for 24 h were spun onto plates after aspiration of viral
supernatants.
Adoptive Cell Transfer and Infection
[0104] Adoptive transfer of cells (50.times.10.sup.3 to
100.times.10.sup.3 cells) and infection with recombinant vaccinia
virus or adenovorus virus expressing human gp100 (rFPhgp100;
Therion Biologics, Cambridge, Mass.) were carried out.
[0105] Counting of Adoptively Transferred Cells.
[0106] Mice were sacrificed after infection. Samples were enriched
for CD8.sup.+ T cells, (MACS negative selection kit) and cells were
counted by trypan blue exclusion. The frequency of transferred T
cells was determined by measuring the expression of CD8 and Thy-1.1
Ly5.2 or CFSE (carboxyfluorescein succinimidyl ester) by flow
cytometry. The absolute number of pme1-1 cells was calculated by
multiplying the total cell count by the percentage of
CD8.sup.+Thy-1.1.sup.+ or CD8.sup.+Ly5.2.sup.+.
Example 1
[0107] This example demonstrates that deficiency of c-Myb impairs
accumulation of the numbers of T cells.
[0108] The pme1-1 mouse model is a useful tool for modeling the
treatment of malignant melanoma using adoptive cell therapy
(Overwijk et al., J. Exp. Med., 198(4): 569-80 (2003)). Adoptive
transfer of T cells expressing the anti-gp100 TCR from pme1-1 mice
can effectively mediate the regression of tumors when administered
in combination with a lymphodepleting pretreatment regimen,
cytokine administration, and vaccination (Zeng et al., J. Exp. Med,
201: 139-148 (2005); Gattinoni et al., J. Exp. Med., 202: 907-912
(2005); Klebanoff et al., Proc. Natl. Acad. Sci. USA, 101:1969-1974
(2004)).
[0109] CD8.sup.+ T cells were isolated from pme1-1 wild-type (WT)
mice and Myb knockout mice (Myb.sup.-/-). The isolated T cells were
transduced with a retroviral vector encoding the Thy1.1 antigen in
order to facilitate assessment of the T cells by flow cytometry
upon transfer into host mice. The transduced CD8.sup.+ T cells
(1.times.10.sup.5) were adoptively transferred into WT mice
infected with gp100 vaccinia virus (VV). The expression of Thy1.1
by the pme1-1 CD8.sup.+ T cells was assessed by flow cytometry 0-5
days after infection. The percentage of Thy1.1.sup.+ T cells (after
gating on CD8.sup.+ cells) on Day 3 or Day 5 after infection is
shown in Table 1. The quantification of Thy1.1.sup.+ pme1-1
CD8.sup.+ T cells 3 and 5 days after infection is shown in FIG.
1A.
TABLE-US-00002 TABLE 1 Day 3 Day 5 WT 0.0997 2.12 Myb.sup.-/-
0.0205 0.365
[0110] As shown in Table 1 and FIG. 1A, deficiency of c-Myb impairs
accumulation of the numbers of T cells.
Example 2
[0111] This example demonstrates that deficiency of c-Myb promotes
differentiation.
[0112] Pme1-1 WT CD8.sup.+ T cells or pme1-1 Myb.sup.-/- CD8.sup.+
T cells were transduced and adoptively transferred into infected WT
mice as described in Example 1. The expression of KLRG1 and CD62L
by Thy1.1.sup.+ pme1-1 CD8.sup.+ T cells that were isolated from
spleen was measured by flow cytometry on day 5 after adoptive
transfer.
[0113] The percentages of cells expressing the indicated phenotypes
are shown in Table 2. The percentage of KLRG1.sup.+CD26L.sup.-
cells obtained from four mice is shown in FIG. 1B.
TABLE-US-00003 TABLE 2 WT Myb.sup.-/- KLRG1.sup.+/CD62L.sup.+ 4.64
8.94 KLRG1.sup.-/CD62L.sup.+ 48 34.1 KLRG1.sup.+/CD62L.sup.- 5.09
18.3 KLRG1.sup.-/CD62L.sup.- 42.3 38.7
[0114] As shown in Table 2 and FIG. 1B, deficiency of c-Myb
promotes T cell differentiation.
Example 3
[0115] This example demonstrates that overexpression of c-Myb
enhances T cell proliferation.
[0116] CD8.sup.+ T cells were isolated from pme1-1 WT mice. The
isolated T cells were transduced with a retroviral vector encoding
(i) Thy1.1 only ("Thy1.1") or (ii) Thy1.1 and c-Myb (SEQ ID NO: 4)
("Thy1.1 Myb").
[0117] Thy1.1 CD8.sup.+ T cells (5.times.10.sup.4) or Thy1.1 Myb
CD8.sup.+ T cells (5.times.10.sup.4) were adoptively co-transferred
into WT mice infected with gp100-VV. The expression of Thy1.1 and
Ly5.1 by splenic T cells was measured by flow cytometry (after
gating on CD8.sup.+ T cells) 3-30 days after infection and 5 days
after heterologous infection with gp100 adenovirus (on day 30 of
gp100-VV infection). The percentages of Thy1.1.sup.+ Ly5.1.sup.- or
Thy1.1.sup.+ Ly5.1.sup.+ cells are shown in Table 3 and the
abundance of Thy1.1.sup.+ T cells 0-30 days (FIG. 2A) and 5 days
(FIG. 2B) after heterologous infection with gp100 adenovirus is
shown in FIGS. 2A-2B.
TABLE-US-00004 TABLE 3 Thy1.1.sup.+ Ly5.1.sup.- Thy1.1.sup.+
Ly5.1.sup.+ Day 0 49.5 49.3 Day 3 after infection 0.0185 0.179 Day
5 after infection 0.332 1.23 Day 10 after infection 0.429 0.717 Day
30 after infection 0.0973 0.0836 Day 5 after 1.56 7.25 heterologous
infection with gp100 adenovirus
[0118] As shown in Table 3 and FIGS. 2A and 2B, the overexpression
of c-Myb enhances T cell proliferation.
Example 4
[0119] This example demonstrates that overexpression of c-Myb
produces cells having a central memory phenotype.
[0120] Thy1.1 and Thy1.1 Myb CD8+ T cells were adoptively
co-transferred into infected WT mice as described in Example 3.
KLRG1 and CD62L expression by Thy1.1.sup.+ Ly5.1.sup.- and
Thy1.1.sup.+ Ly5.1.sup.+ CD8.sup.+ T cells from spleen was measured
by flow cytometry. The percentages of cells having the indicated
phenotype after gating on Thy1.1.sup.+ Ly5.1.sup.-CD8.sup.+ cells
or Thy1.1.sup.+ Ly5.1.sup.+ CD8.sup.+ T cells is shown in Table
4.
TABLE-US-00005 TABLE 4 Day 5 Day 30 Day 5 recall Thy1.1 Thy1.1
Thy1.1 Thy1.1 Myb Thy1.1 Myb Thy1.1 Myb KLRG1.sup.+/ 2.3 0.265 0
2.34 1.65 0.478 CD62L.sup.+ KLRG1.sup.-/ 22.5 80.2 28.9 52.3 10.2
39.3 CD62L.sup.+ KLRG1.sup.+/ 9.84 0.309 11.4 0.781 25.8 3.11
CD62L.sup.- KLRG1.sup.-/ 65.6 19.2 59.7 44.5 62.3 57.2
CD62L.sup.-
[0121] The total number of KLRG1.sup.+CD62L.sup.-cells (FIGS. 2C
and 2D) and KLRG1.sup.- CD62L.sup.+ cells (FIGS. 2E and 2F) from
spleen 3-5 days post transfer and 5 days after heterologous
infection with gp100 adenovirus (after gating on Thy1.1.sup.+
Ly5.1.sup.-CD8.sup.+ cells or Thy1.1.sup.+ Ly5.1.sup.+ CD8.sup.+ T
cells) is shown in FIGS. 2C-2F.
[0122] The percentage of Thy1.1.sup.+ cells from the lymph nodes
and lungs after adoptive transfer of Thy1.1 CD8.sup.+ or Thy1.1 Myb
CD8.sup.+ T cells into WT mice infected with gp100-VV 5-30 days
post-transfer is shown in FIGS. 3A-3B and 3E-F.
[0123] The percentage of KLRG1-CD62L.sup.+Thy1.1.sup.+ cells and
KLRG1.sup.+CD62L.sup.-cells from lymph node and lungs 5-30 days or
5 days post transfer of Thy1.1 CD8.sup.+ or Thy1.1 Myb CD8.sup.+ T
cells into WT mice infected with gp100-VV is shown in FIGS. 3C-3D
and 4A-4B.
[0124] As shown in Table 4, FIGS. 2C-2F, FIGS. 3A-3F, and 4A-4B,
overexpression of c-Myb produces T cells having a central memory
phenotype.
Example 5
[0125] This example demonstrates that c-Myb deficiency promotes
proliferation and apoptosis.
[0126] Thy1.1.sup.+ pme1-1.sup.+ WT or Thy1.1.sup.+ pme1-1.sup.+
c-Myb.sup.-/-CD8.sup.+ T cells (1.times.10.sup.5 cells) were
transferred into WT mice infected with gp-100VV. Three days after
infection, the frequency of bromodeoxyuridine (BrdU)-positive cells
was measured by flow cytometry. The results are shown in Table 5
and FIG. 5A. The numbers in Table 5 indicate the frequencies of
BrdU-positive cells after gating on Thy1.1+ CD8+ T cells.
TABLE-US-00006 TABLE 5 WT c-Myb.sup.-/- 33 45
[0127] After cell transfer and 3 days after infection as described
in this Example, the frequencies of Annexin V +/-, Propidium iodide
+/- and Annexin V and Propidium iodide double positive WT and
c-Myb.sup.-/- CD8.sup.+ T cells were also measured by flow
cytometry after gating on Thy1.1.sup.+ CD8.sup.+ T cells. The
results are shown in Table 6.
TABLE-US-00007 TABLE 6 WT Myb.sup.-/- Annexin V.sup.+Propidium
iodide.sup.+ 0.698 4.16 Annexin V.sup.-Propidium iodide.sup.- 93.2
79.9 Annexin V.sup.+Propidium iodide.sup.- 4.47 13 Annexin
V.sup.-Propidium iodide.sup.+ 1.68 2.89
[0128] As shown in Tables 5-6 and FIGS. 5A-5B, c-Myb deficiency
promotes proliferation and apoptosis.
Example 6
[0129] This example demonstrates that c-Myb deficiency
downregulates Tcf7 expression and upregulates Zeb2 expression.
[0130] Thy1.1.sup.+ pme1-1.sup.+ WT or Thy1.1.sup.+ pme1-1.sup.+
c-Myb.sup.-/- CD8.sup.+ T cells (3.times.10.sup.5 cells) were
injected into WT mice infected with gp-100VV. RNA sequencing data
were obtained from CD62L.sup.-KLRG1.sup.- Thy1.1.sup.+ WT and
CD62L.sup.-KLRG1.sup.- Thy1.1.sup.+ Pme1 c-Myb.sup.-/- CD8.sup.+ T
cells on day 5 post transfer. The results showed that c-Myb
deficiency downregulates Tcf7 expression and upregulates Zeb2
expression.
Example 7
[0131] This example demonstrates that c-Myb binds to the Tcf7 and
Zeb2 promoters activating Tcf7 expression and repressing Zeb2
expression.
[0132] Thy1.1.sup.+ pme1-1.sup.+ WT or Thy1.1.sup.+ pme1-1.sup.+
c-Myb.sup.-/- CD8.sup.+ T cells were transferred into WT mice
infected with gp-100VV. A quantitative RT-PCR analysis was carried
out before or 5 days after adoptive transfer to measure the
expression of Zeb2 and Tcf7 mRNA in WT or c-Myb.sup.-/- naive,
CD62L.sup.+ and CD62L.sup.- populations of pme1-1 CD8.sup.+ T
cells. The results are shown in FIGS. 6A-6B.
[0133] A quantitative RT-PCR analysis of the expression of Zeb2 and
Tcf7 mRNA by CD62L.sup.- populations of Thy1.1.sup.+ or Thy1.1
c-Myb overexpressing CD8.sup.+ T cells 5 days after adoptive
transfer into recipient wild-type mice infected with gp100-VV. The
results are shown in FIGS. 6C-6D.
[0134] The expression pattern of c-Myb, Tcf7 and Zeb2 in naive,
CD62L.sup.+ and CD62L.sup.- populations was measured on Day 5 after
adoptive transfer into recipient wild-type mice infected with
gp100-VV. The results are shown in FIGS. 6E-6F.
[0135] A quantitative RT-PCR analysis of the promoter regions of
Tcf7 and Zeb2 in chromatin immunoprecipitation with anti-IgG or
anti-c-Myb was carried out. The results are shown in FIG. 6G.
[0136] As shown in FIGS. 6A-6G, c-Myb binds to the Tcf7 and Zeb2
promoters activating Tcf7 expression and repressing Zeb2
expression.
Example 8
[0137] This example demonstrates that Tcf7 protein expression is
downregulated in c-Myb deficient CD8+ T cells and that enforced
c-Myb expression enhances Tcf7 promoter activity in the reporter
assay.
[0138] Thy1.1.sup.+ pme1-1.sup.+WT or Thy1.1.sup.+ pme1-1.sup.+
c-Myb.sup.-/- CD8.sup.+ T cells were transferred into WT mice
infected with gp-100VV. The expression of Tcf7 in WT and
c-Myb.sup.-/- CD62L.sup.+, CD62L.sup.- and KLRG1.sup.+ populations
of pme1-1 CD8.sup.+ T cells from lungs was measured by flow
cytometry 5 days after adoptive transfer. The percentages of cells
expressing one or both of KLRG1 and CD62L are shown in. Table
7.
TABLE-US-00008 TABLE 7 WT Myb.sup.-/- KLRG1.sup.+CD62L.sup.+ 8.44 8
KLRG1.sup.-CD62L.sup.- 27.6 24.7 KLRG1.sup.+CD62L.sup.- 53.4 61
KLRG1.sup.-CD62L.sup.+ 10.6 6.33
[0139] The expression level of Tcf7 protein was measured by flow
cytometry and compared between KLRG1+CD62L-, KLRG1-CD62L- and
KLRG1-CD62L subpopulations of WT and c-Myb.sup.-/- cells. The
results showed that TCF-7 protein expression is higher in WT as
compared to c-Myb.sup.-/- cells.
[0140] GFP expression by Thy1.1 or Thy1.1 c-Myb CD8.sup.+ T cells
from Tcf7 GFP reporter mice (p45 mice) on day 3 after in vitro
activation of naive cells and day 2 after retroviral transduction
of either Thy1.1 or Thy1.1 c-Myb was measured by flow cytometry.
The results are shown in FIG. 7.
[0141] Taken together, these data show that Tcf7 protein expression
is downregulated in c-Myb deficient CD8+ T cells and enforced c-Myb
expression enhances Tcf7 promoter activity in the reporter
assay.
Example 9
[0142] This example demonstrates that enforced Tcf7 expression
rescues the phenotype of c-Myb deficient cells.
[0143] WT and c-Myb.sup.-/- CD8.sup.+ T cells were transduced
either with an empty vector MIG (control) or Tcf7 MIG 5 and
transferred into recipient wild-type mice infected with gp100-VV.
Five days after transfer, KLRG1 and CD62L expression by WT and
c-Myb.sup.-/- CD8.sup.+ T cells was measured by flow cytometry. The
percentages of cells expressing one or both of KLRG1 and CD62L are
shown in Table 8. The percentage of KLRG1.sup.hi CD8 T cells is
shown in FIG. 8.
TABLE-US-00009 TABLE 8 WT MIG c-Myb.sup.-/- MIG c-Myb.sup.-/- Tcf7
MIG KLRG1.sup.+CD62L.sup.+ 0.799 0.97 1 KLRG1.sup.-CD62L.sup.- 64
31.5 55.9 KLRG1.sup.+CD62L.sup.- 24.3 65.7 37.9
KLRG1.sup.-CD62L.sup.+ 10.9 1.8 5.25
[0144] All references, including publications, patent applications,
and patents, cited herein are hereby incorporated by reference to
the same extent as if each reference were individually and
specifically indicated to be incorporated by reference and were set
forth in its entirety herein.
[0145] The use of the terms "a" and "an" and "the" and "at least
one" and similar referents in the context of describing the
invention (especially in the context of the following claims) are
to be construed to cover both the singular and the plural, unless
otherwise indicated herein or clearly contradicted by context. The
use of the term "at least one" followed by a list of one or more
items (for example, "at least one of A and B") is to be construed
to mean one item selected from the listed items (A or B) or any
combination of two or more of the listed items (A and B), unless
otherwise indicated herein or clearly contradicted by context. The
terms "comprising," "having," "including," and "containing" are to
be construed as open-ended terms (i.e., meaning "including, but not
limited to,") unless otherwise noted. Recitation of ranges of
values herein are merely intended to serve as a shorthand method of
referring individually to each separate value falling within the
range, unless otherwise indicated herein, and each separate value
is incorporated into the specification as if it were individually
recited herein. All methods described herein can be performed in
any suitable order unless otherwise indicated herein or otherwise
clearly contradicted by context. The use of any and all examples,
or exemplary language (e.g., "such as") provided herein, is
intended merely to better illuminate the invention and does not
pose a limitation on the scope of the invention unless otherwise
claimed. No language in the specification should be construed as
indicating any non-claimed element as essential to the practice of
the invention.
[0146] Preferred embodiments of this invention are described
herein, including the best mode known to the inventors for carrying
out the invention. Variations of those preferred embodiments may
become apparent to those of ordinary skill in the art upon reading
the foregoing description. The inventors expect skilled artisans to
employ such variations as appropriate, and the inventors intend for
the invention to be practiced otherwise than as specifically
described herein. Accordingly, this invention includes all
modifications and equivalents of the subject matter recited in the
claims appended hereto as permitted by applicable law. Moreover,
any combination of the above-described elements in all possible
variations thereof is encompassed by the invention unless otherwise
indicated herein or otherwise clearly contradicted by context.
Sequence CWU 1
1
2113777DNAMus musculus 1gggagtgtcc aaacctcttt gtttgatggc atctgtttac
agagttacac tttaatatca 60acctgtttcc tcctcctcct tctcctcctt ctcctcctcc
tcctcggtga cctccttctc 120ctcccctttc tccggagaaa cttcgccccg
gcggtgcgga gcgccgctgc gcagccgggg 180gaggacgcag gcaaggcgga
gggcagcggg aggcggcaac cggtgcggtc cccggggctc 240ttggcggagc
cccggcccgc ctcgccatgg cccggagacc ccgacacagc atctacagta
300gcgatgaaga tgatgaagac attgagatgt gtgaccatga ctacgatggg
ctgctgccca 360aatctggaaa gcgtcacttg gggaaaacta ggtggacaag
ggaagaggat gagaagctga 420agaagctggt ggaacagaac ggaacagacg
actggaaagt cattgccaat tatctgccca 480accggacaga tgtgcagtgc
caacaccggt ggcagaaagt gctgaaccct gaactcatca 540aaggtccctg
gaccaaagaa gaagatcaga gagtcataga gcttgtccag aaatatggtc
600cgaagcgttg gtctgttatt gccaagcact taaaagggag aattggaaag
cagtgtcggg 660agaggtggca caaccatttg aatccagaag ttaagaaaac
ctcctggaca gaagaggagg 720acagaatcat ttaccaggca cacaagcgtc
tggggaacag atgggcagag atcgcaaagc 780tgctgcccgg acggactgat
aatgctatca agaaccactg gaattccacc atgcgtcgca 840aggtggaaca
ggaaggctac ctgcaggagc cttccaaagc cagccagacg ccagtggcca
900cgagcttcca gaagaacaat catttgatgg ggtttgggca tgcctcacct
ccatctcagc 960tctctccaag tggccagtcc tccgtcaaca gcgaatatcc
ctattaccac atcgccgaag 1020cacaaaacat ctccagtcac gttccctatc
ctgtcgcatt gcatgttaat atagtcaacg 1080tccctcagcc ggctgcggca
gccatccaga gacactataa cgacgaagac cctgagaagg 1140aaaagcgaat
aaaggagctg gagttgctcc tgatgtcaac agagaacgag ctgaagggac
1200agcaggcatt accaacacag aaccacactt gcagctaccc cgggtggcac
agcacctcca 1260ttgtggacca gaccagacct catggggata gtgcacctgt
ttcctgtttg ggagaacacc 1320atgccacccc atctctgcct gcagatcccg
gctccctacc tgaagaaagt gcctcaccag 1380caaggtgcat gatcgtccac
cagggcacca ttctggacaa tgttaagaac ctcttagaat 1440ttgcagaaac
actccagttt atagattctg attcttcgtg gtgtgatctc agcagttttg
1500aattctctga agaagcggca gctttttcac ctagccagca gcccacaggc
aaagccttcc 1560agcttcagca aagagagggc catgggacta gatctgcagg
agagcctagc ctgagggtga 1620ccaggcgagt gctgagcgag gcatccctcg
gcccagactc accccaagcg aggcacagca 1680aggttccgct ggtcgtccta
cgaaaaaggc ggggccaggc cagcccccta gccgctggag 1740agcctagccc
ctccctcttt gctgacgtca tcagctcaac tctcaagcgt tcccctgtca
1800aaagcctacc cttctctccc tcgcagttct tgaacacttc cagcaaccat
gaaagctcgg 1860gcttagatgc acctacctta ccctccactc ctctcattgg
tcacaaactg acaccatgtc 1920gagaccagac tgtgaaaacc cagaaggaaa
attccatctt tagaactcca gctatcaaaa 1980ggtcaatcct cgaaagctct
cctcgaactc ccacaccatt caaacatgcc cttgcagctc 2040aagaaattaa
atacggtccc ctgaagatgc tacctcagac cccctcccat gcagtggagg
2100acctacaaga tgtgattaag caggaatcgg atgaatctgg aattgtggct
gagtttcaag 2160agagtggacc accgttactg aaaaaaatca agcaggaggt
ggagtcgcca actgagaaat 2220cgggaaactt cttctgctca aaccactggg
cagagaacag cctgagcacc cagctgttct 2280cgcaggcgtc tcctgtggca
gatgccccaa atattcttac aagctctgtt ttaatgacac 2340ctgtatcaga
agatgaagac aatgtcctca aagcctttac cgtacctaag aacaggcccc
2400tggtgggtcc cttgcagcca tgcagtggtg cctgggagcc agcatcctgt
gggaagacag 2460aggaccagat gacggcctcc ggtccggctc ggaaatacgt
gaacgcgttc tcagctcgaa 2520ctctggtcat gtgagacatt tccagaaaag
cattatggtt ttcagaacac ttaaaagttg 2580actttcgaca catggctcct
cagcgtggag cgctccatgg ctgagagaag agcctgattt 2640tgttgtggta
caacagttga gagcagcacc aagtgcattt ttagttgctt gagatctcac
2700ttgatttcac acaactaaaa aggatttttt tttttaaaaa taataataat
gaataacagt 2760cttacctaaa ttattaggta atgaattgtg accatttgtt
aatatcataa tcagattttt 2820taaaaaaaat aaaatgattt atttgtattt
tagaggatac aacagatcag tatttttgac 2880tgtggtgaat ttaaaaaaaa
aatttacaca aagaaatatc ccagtattcc atgtatctca 2940gtcactaaac
atacacagag agatttttaa aaaccaggag aagcattatt ttgaatgtta
3000gctaaatccc aagtaatact taatgcaacc ctctaggagc tcatttgtgg
ctaataatct 3060tggaaatatc tttattatac taaaccattt catgaggaga
attttgttgt cagcttgctt 3120gaaaagttat tactgtatga aatagtttta
ttgaaaaaat tatattttta ttcagtaatt 3180taattttgta aatgccaaat
ggagaaatgt gttcgctgct atggttttag cctgtagtca 3240tgctgctagc
tagtgtcagg gggcaataga gcttagatgg aaaaaagaga aagagactcg
3300gtgttagata acggactatg cactagtatt ccagactttt ttatttttat
atatatgtac 3360cttttccttt tgtaattgga aaacttattt gggagaattt
tgcatttgtt gtacattttt 3420gttttttagg attttttttt tttgttgtta
ttgtcgattt ataaaagcat tgcacttctt 3480tttctttttt tgggagattt
gtgttgttta tgtcatatgt tttgttttga gttcagcctg 3540aatgttcatc
cgtttgggcg tttttctgac ttggaagaac attctctgta ggtttctaag
3600tgtacagagc cggaactgcc tcgtggttcc tgggcttcag ggaagacaaa
tatggaagtc 3660aacagccagt ttctgccttg agagcatttg caagaatgct
ggccttgaat tctgaaatga 3720cagtgtatct actgccttgt agcaaaataa
agctatcctc ttattttaca tacttcc 377723420DNAMus musculus 2gggagtgtcc
aaacctcttt gtttgatggc atctgtttac agagttacac tttaatatca 60acctgtttcc
tcctcctcct tctcctcctt ctcctcctcc tcctcggtga cctccttctc
120ctcccctttc tccggagaaa cttcgccccg gcggtgcgga gcgccgctgc
gcagccgggg 180gaggacgcag gcaaggcgga gggcagcggg aggcggcaac
cggtgcggtc cccggggctc 240ttggcggagc cccggcccgc ctcgccatgg
cccggagacc ccgacacagc atctacagta 300gcgatgaaga tgatgaagac
attgagatgt gtgaccatga ctacgatggg ctgctgccca 360aatctggaaa
gcgtcacttg gggaaaacta ggtggacaag ggaagaggat gagaagctga
420agaagctggt ggaacagaac ggaacagacg actggaaagt cattgccaat
tatctgccca 480accggacaga tgtgcagtgc caacaccggt ggcagaaagt
gctgaaccct gaactcatca 540aaggtccctg gaccaaagaa gaagatcaga
gagtcataga gcttgtccag aaatatggtc 600cgaagcgttg gtctgttatt
gccaagcact taaaagggag aattggaaag cagtgtcggg 660agaggtggca
caaccatttg aatccagaag ttaagaaaac ctcctggaca gaagaggagg
720acagaatcat ttaccaggca cacaagcgtc tggggaacag atgggcagag
atcgcaaagc 780tgctgcccgg acggactgat aatgctatca agaaccactg
gaattccacc atgcgtcgca 840aggtggaaca ggaaggctac ctgcaggagc
cttccaaagc cagccagacg ccagtggcca 900cgagcttcca gaagaacaat
catttgatgg ggtttgggca tgcctcacct ccatctcagc 960tctctccaag
tggccagtcc tccgtcaaca gcgaatatcc ctattaccac atcgccgaag
1020cacaaaacat ctccagtcac gttccctatc ctgtcgcatt gcatgttaat
atagtcaacg 1080tccctcagcc ggctgcggca gccatccaga gacactataa
cgacgaagac cctgagaagg 1140aaaagcgaat aaaggagctg gagttgctcc
tgatgtcaac agagaacgag ctgaagggac 1200agcaggcatt accaacacag
aaccacactt gcagctaccc cgggtggcac agcacctcca 1260ttgtggacca
gaccagacct catggggata gtgcacctgt ttcctgtttg ggagaacacc
1320atgccacccc atctctgcct gcagatcccg gctccctacc tgaagaaagt
gcctcaccag 1380caaggtgcat gatcgtccac cagggcacca ttctggacaa
tgttaagaac ctcttagaat 1440ttgcagaaac actccagttt atagattctt
tcttgaacac ttccagcaac catgaaagct 1500cgggcttaga tgcacctacc
ttaccctcca ctcctctcat tggtcacaaa ctgacaccat 1560gtcgagacca
gactgtgaaa acccagaagg aaaattccat ctttagaact ccagctatca
1620aaaggtcaat cctcgaaagc tctcctcgaa ctcccacacc attcaaacat
gcccttgcag 1680ctcaagaaat taaatacggt cccctgaaga tgctacctca
gaccccctcc catgcagtgg 1740aggacctaca agatgtgatt aagcaggaat
cggatgaatc tggaattgtg gctgagtttc 1800aagagagtgg accaccgtta
ctgaaaaaaa tcaagcagga ggtggagtcg ccaactgaga 1860aatcgggaaa
cttcttctgc tcaaaccact gggcagagaa cagcctgagc acccagctgt
1920tctcgcaggc gtctcctgtg gcagatgccc caaatattct tacaagctct
gttttaatga 1980cacctgtatc agaagatgaa gacaatgtcc tcaaagcctt
taccgtacct aagaacaggc 2040ccctggtggg tcccttgcag ccatgcagtg
gtgcctggga gccagcatcc tgtgggaaga 2100cagaggacca gatgacggcc
tccggtccgg ctcggaaata cgtgaacgcg ttctcagctc 2160gaactctggt
catgtgagac atttccagaa aagcattatg gttttcagaa cacttaaaag
2220ttgactttcg acacatggct cctcagcgtg gagcgctcca tggctgagag
aagagcctga 2280ttttgttgtg gtacaacagt tgagagcagc accaagtgca
tttttagttg cttgagatct 2340cacttgattt cacacaacta aaaaggattt
ttttttttaa aaataataat aatgaataac 2400agtcttacct aaattattag
gtaatgaatt gtgaccattt gttaatatca taatcagatt 2460ttttaaaaaa
aataaaatga tttatttgta ttttagagga tacaacagat cagtattttt
2520gactgtggtg aatttaaaaa aaaaatttac acaaagaaat atcccagtat
tccatgtatc 2580tcagtcacta aacatacaca gagagatttt taaaaaccag
gagaagcatt attttgaatg 2640ttagctaaat cccaagtaat acttaatgca
accctctagg agctcatttg tggctaataa 2700tcttggaaat atctttatta
tactaaacca tttcatgagg agaattttgt tgtcagcttg 2760cttgaaaagt
tattactgta tgaaatagtt ttattgaaaa aattatattt ttattcagta
2820atttaatttt gtaaatgcca aatggagaaa tgtgttcgct gctatggttt
tagcctgtag 2880tcatgctgct agctagtgtc agggggcaat agagcttaga
tggaaaaaag agaaagagac 2940tcggtgttag ataacggact atgcactagt
attccagact tttttatttt tatatatatg 3000taccttttcc ttttgtaatt
ggaaaactta tttgggagaa ttttgcattt gttgtacatt 3060tttgtttttt
aggatttttt ttttttgttg ttattgtcga tttataaaag cattgcactt
3120ctttttcttt ttttgggaga tttgtgttgt ttatgtcata tgttttgttt
tgagttcagc 3180ctgaatgttc atccgtttgg gcgtttttct gacttggaag
aacattctct gtaggtttct 3240aagtgtacag agccggaact gcctcgtggt
tcctgggctt cagggaagac aaatatggaa 3300gtcaacagcc agtttctgcc
ttgagagcat ttgcaagaat gctggccttg aattctgaaa 3360tgacagtgta
tctactgcct tgtagcaaaa taaagctatc ctcttatttt acatacttcc
34203755PRTMus musculus 3Met Ala Arg Arg Pro Arg His Ser Ile Tyr
Ser Ser Asp Glu Asp Asp 1 5 10 15 Glu Asp Ile Glu Met Cys Asp His
Asp Tyr Asp Gly Leu Leu Pro Lys 20 25 30 Ser Gly Lys Arg His Leu
Gly Lys Thr Arg Trp Thr Arg Glu Glu Asp 35 40 45 Glu Lys Leu Lys
Lys Leu Val Glu Gln Asn Gly Thr Asp Asp Trp Lys 50 55 60 Val Ile
Ala Asn Tyr Leu Pro Asn Arg Thr Asp Val Gln Cys Gln His 65 70 75 80
Arg Trp Gln Lys Val Leu Asn Pro Glu Leu Ile Lys Gly Pro Trp Thr 85
90 95 Lys Glu Glu Asp Gln Arg Val Ile Glu Leu Val Gln Lys Tyr Gly
Pro 100 105 110 Lys Arg Trp Ser Val Ile Ala Lys His Leu Lys Gly Arg
Ile Gly Lys 115 120 125 Gln Cys Arg Glu Arg Trp His Asn His Leu Asn
Pro Glu Val Lys Lys 130 135 140 Thr Ser Trp Thr Glu Glu Glu Asp Arg
Ile Ile Tyr Gln Ala His Lys 145 150 155 160 Arg Leu Gly Asn Arg Trp
Ala Glu Ile Ala Lys Leu Leu Pro Gly Arg 165 170 175 Thr Asp Asn Ala
Ile Lys Asn His Trp Asn Ser Thr Met Arg Arg Lys 180 185 190 Val Glu
Gln Glu Gly Tyr Leu Gln Glu Pro Ser Lys Ala Ser Gln Thr 195 200 205
Pro Val Ala Thr Ser Phe Gln Lys Asn Asn His Leu Met Gly Phe Gly 210
215 220 His Ala Ser Pro Pro Ser Gln Leu Ser Pro Ser Gly Gln Ser Ser
Val 225 230 235 240 Asn Ser Glu Tyr Pro Tyr Tyr His Ile Ala Glu Ala
Gln Asn Ile Ser 245 250 255 Ser His Val Pro Tyr Pro Val Ala Leu His
Val Asn Ile Val Asn Val 260 265 270 Pro Gln Pro Ala Ala Ala Ala Ile
Gln Arg His Tyr Asn Asp Glu Asp 275 280 285 Pro Glu Lys Glu Lys Arg
Ile Lys Glu Leu Glu Leu Leu Leu Met Ser 290 295 300 Thr Glu Asn Glu
Leu Lys Gly Gln Gln Ala Leu Pro Thr Gln Asn His 305 310 315 320 Thr
Cys Ser Tyr Pro Gly Trp His Ser Thr Ser Ile Val Asp Gln Thr 325 330
335 Arg Pro His Gly Asp Ser Ala Pro Val Ser Cys Leu Gly Glu His His
340 345 350 Ala Thr Pro Ser Leu Pro Ala Asp Pro Gly Ser Leu Pro Glu
Glu Ser 355 360 365 Ala Ser Pro Ala Arg Cys Met Ile Val His Gln Gly
Thr Ile Leu Asp 370 375 380 Asn Val Lys Asn Leu Leu Glu Phe Ala Glu
Thr Leu Gln Phe Ile Asp 385 390 395 400 Ser Asp Ser Ser Trp Cys Asp
Leu Ser Ser Phe Glu Phe Ser Glu Glu 405 410 415 Ala Ala Ala Phe Ser
Pro Ser Gln Gln Pro Thr Gly Lys Ala Phe Gln 420 425 430 Leu Gln Gln
Arg Glu Gly His Gly Thr Arg Ser Ala Gly Glu Pro Ser 435 440 445 Leu
Arg Val Thr Arg Arg Val Leu Ser Glu Ala Ser Leu Gly Pro Asp 450 455
460 Ser Pro Gln Ala Arg His Ser Lys Val Pro Leu Val Val Leu Arg Lys
465 470 475 480 Arg Arg Gly Gln Ala Ser Pro Leu Ala Ala Gly Glu Pro
Ser Pro Ser 485 490 495 Leu Phe Ala Asp Val Ile Ser Ser Thr Leu Lys
Arg Ser Pro Val Lys 500 505 510 Ser Leu Pro Phe Ser Pro Ser Gln Phe
Leu Asn Thr Ser Ser Asn His 515 520 525 Glu Ser Ser Gly Leu Asp Ala
Pro Thr Leu Pro Ser Thr Pro Leu Ile 530 535 540 Gly His Lys Leu Thr
Pro Cys Arg Asp Gln Thr Val Lys Thr Gln Lys 545 550 555 560 Glu Asn
Ser Ile Phe Arg Thr Pro Ala Ile Lys Arg Ser Ile Leu Glu 565 570 575
Ser Ser Pro Arg Thr Pro Thr Pro Phe Lys His Ala Leu Ala Ala Gln 580
585 590 Glu Ile Lys Tyr Gly Pro Leu Lys Met Leu Pro Gln Thr Pro Ser
His 595 600 605 Ala Val Glu Asp Leu Gln Asp Val Ile Lys Gln Glu Ser
Asp Glu Ser 610 615 620 Gly Ile Val Ala Glu Phe Gln Glu Ser Gly Pro
Pro Leu Leu Lys Lys 625 630 635 640 Ile Lys Gln Glu Val Glu Ser Pro
Thr Glu Lys Ser Gly Asn Phe Phe 645 650 655 Cys Ser Asn His Trp Ala
Glu Asn Ser Leu Ser Thr Gln Leu Phe Ser 660 665 670 Gln Ala Ser Pro
Val Ala Asp Ala Pro Asn Ile Leu Thr Ser Ser Val 675 680 685 Leu Met
Thr Pro Val Ser Glu Asp Glu Asp Asn Val Leu Lys Ala Phe 690 695 700
Thr Val Pro Lys Asn Arg Pro Leu Val Gly Pro Leu Gln Pro Cys Ser 705
710 715 720 Gly Ala Trp Glu Pro Ala Ser Cys Gly Lys Thr Glu Asp Gln
Met Thr 725 730 735 Ala Ser Gly Pro Ala Arg Lys Tyr Val Asn Ala Phe
Ser Ala Arg Thr 740 745 750 Leu Val Met 755 4 636PRTMus musculus
4Met Ala Arg Arg Pro Arg His Ser Ile Tyr Ser Ser Asp Glu Asp Asp 1
5 10 15 Glu Asp Ile Glu Met Cys Asp His Asp Tyr Asp Gly Leu Leu Pro
Lys 20 25 30 Ser Gly Lys Arg His Leu Gly Lys Thr Arg Trp Thr Arg
Glu Glu Asp 35 40 45 Glu Lys Leu Lys Lys Leu Val Glu Gln Asn Gly
Thr Asp Asp Trp Lys 50 55 60 Val Ile Ala Asn Tyr Leu Pro Asn Arg
Thr Asp Val Gln Cys Gln His 65 70 75 80 Arg Trp Gln Lys Val Leu Asn
Pro Glu Leu Ile Lys Gly Pro Trp Thr 85 90 95 Lys Glu Glu Asp Gln
Arg Val Ile Glu Leu Val Gln Lys Tyr Gly Pro 100 105 110 Lys Arg Trp
Ser Val Ile Ala Lys His Leu Lys Gly Arg Ile Gly Lys 115 120 125 Gln
Cys Arg Glu Arg Trp His Asn His Leu Asn Pro Glu Val Lys Lys 130 135
140 Thr Ser Trp Thr Glu Glu Glu Asp Arg Ile Ile Tyr Gln Ala His Lys
145 150 155 160 Arg Leu Gly Asn Arg Trp Ala Glu Ile Ala Lys Leu Leu
Pro Gly Arg 165 170 175 Thr Asp Asn Ala Ile Lys Asn His Trp Asn Ser
Thr Met Arg Arg Lys 180 185 190 Val Glu Gln Glu Gly Tyr Leu Gln Glu
Pro Ser Lys Ala Ser Gln Thr 195 200 205 Pro Val Ala Thr Ser Phe Gln
Lys Asn Asn His Leu Met Gly Phe Gly 210 215 220 His Ala Ser Pro Pro
Ser Gln Leu Ser Pro Ser Gly Gln Ser Ser Val 225 230 235 240 Asn Ser
Glu Tyr Pro Tyr Tyr His Ile Ala Glu Ala Gln Asn Ile Ser 245 250 255
Ser His Val Pro Tyr Pro Val Ala Leu His Val Asn Ile Val Asn Val 260
265 270 Pro Gln Pro Ala Ala Ala Ala Ile Gln Arg His Tyr Asn Asp Glu
Asp 275 280 285 Pro Glu Lys Glu Lys Arg Ile Lys Glu Leu Glu Leu Leu
Leu Met Ser 290 295 300 Thr Glu Asn Glu Leu Lys Gly Gln Gln Ala Leu
Pro Thr Gln Asn His 305 310 315 320 Thr Cys Ser Tyr Pro Gly Trp His
Ser Thr Ser Ile Val Asp Gln Thr 325 330 335 Arg Pro His Gly Asp Ser
Ala Pro Val Ser Cys Leu Gly Glu His His 340 345 350 Ala Thr Pro Ser
Leu Pro Ala Asp Pro Gly Ser Leu Pro Glu Glu Ser 355 360 365 Ala Ser
Pro Ala Arg Cys Met Ile Val His Gln Gly Thr Ile Leu Asp 370 375 380
Asn Val Lys Asn Leu Leu Glu Phe Ala Glu Thr Leu Gln Phe Ile Asp 385
390 395 400 Ser Phe Leu Asn Thr Ser Ser Asn His Glu Ser Ser Gly Leu
Asp Ala 405 410 415 Pro Thr Leu Pro Ser Thr Pro Leu Ile Gly His Lys
Leu Thr Pro Cys 420 425 430 Arg Asp Gln Thr Val Lys Thr Gln
Lys Glu Asn Ser Ile Phe Arg Thr 435 440 445 Pro Ala Ile Lys Arg Ser
Ile Leu Glu Ser Ser Pro Arg Thr Pro Thr 450 455 460 Pro Phe Lys His
Ala Leu Ala Ala Gln Glu Ile Lys Tyr Gly Pro Leu 465 470 475 480 Lys
Met Leu Pro Gln Thr Pro Ser His Ala Val Glu Asp Leu Gln Asp 485 490
495 Val Ile Lys Gln Glu Ser Asp Glu Ser Gly Ile Val Ala Glu Phe Gln
500 505 510 Glu Ser Gly Pro Pro Leu Leu Lys Lys Ile Lys Gln Glu Val
Glu Ser 515 520 525 Pro Thr Glu Lys Ser Gly Asn Phe Phe Cys Ser Asn
His Trp Ala Glu 530 535 540 Asn Ser Leu Ser Thr Gln Leu Phe Ser Gln
Ala Ser Pro Val Ala Asp 545 550 555 560 Ala Pro Asn Ile Leu Thr Ser
Ser Val Leu Met Thr Pro Val Ser Glu 565 570 575 Asp Glu Asp Asn Val
Leu Lys Ala Phe Thr Val Pro Lys Asn Arg Pro 580 585 590 Leu Val Gly
Pro Leu Gln Pro Cys Ser Gly Ala Trp Glu Pro Ala Ser 595 600 605 Cys
Gly Lys Thr Glu Asp Gln Met Thr Ala Ser Gly Pro Ala Arg Lys 610 615
620 Tyr Val Asn Ala Phe Ser Ala Arg Thr Leu Val Met 625 630 635
53681DNAHomo sapiens 5aatatcaacc tgtttcctcc tcctccttct cctcctcctc
cgtgacctcc tcctcctctt 60tctcctgaga aacttcgccc cagcggtgcg gagcgccgct
gcgcagccgg ggagggacgc 120aggcaggcgg cgggcagcgg gaggcggcag
cccggtgcgg tccccgcggc tctcggcgga 180gccccgcgcc cgccgcgcca
tggcccgaag accccggcac agcatatata gcagtgacga 240ggatgatgag
gactttgaga tgtgtgacca tgactatgat gggctgcttc ccaagtctgg
300aaagcgtcac ttggggaaaa caaggtggac ccgggaagag gatgaaaaac
tgaagaagct 360ggtggaacag aatggaacag atgactggaa agttattgcc
aattatctcc cgaatcgaac 420agatgtgcag tgccagcacc gatggcagaa
agtactaaac cctgagctca tcaagggtcc 480ttggaccaaa gaagaagatc
agagagtgat agagcttgta cagaaatacg gtccgaaacg 540ttggtctgtt
attgccaagc acttaaaggg gagaattgga aaacaatgta gggagaggtg
600gcataaccac ttgaatccag aagttaagaa aacctcctgg acagaagagg
aagacagaat 660tatttaccag gcacacaaga gactggggaa cagatgggca
gaaatcgcaa agctactgcc 720tggacgaact gataatgcta tcaagaacca
ctggaattct acaatgcgtc ggaaggtcga 780acaggaaggt tatctgcagg
agtcttcaaa agccagccag ccagcagtgg ccacaagctt 840ccagaagaac
agtcatttga tgggttttgc tcaggctccg cctacagctc aactccctgc
900cactggccag cccactgtta acaacgacta ttcctattac cacatttctg
aagcacaaaa 960tgtctccagt catgttccat accctgtagc gttacatgta
aatatagtca atgtccctca 1020gccagctgcc gcagccattc agagacacta
taatgatgaa gaccctgaga aggaaaagcg 1080aataaaggaa ttagaattgc
tcctaatgtc aaccgagaat gagctaaaag gacagcaggt 1140gctaccaaca
cagaaccaca catgcagcta ccccgggtgg cacagcacca ccattgccga
1200ccacaccaga cctcatggag acagtgcacc tgtttcctgt ttgggagaac
accactccac 1260tccatctctg ccagcggatc ctggctccct acctgaagaa
agcgcctcgc cagcaaggtg 1320catgatcgtc caccagggca ccattctgga
taatgttaag aacctcttag aatttgcaga 1380aacactccaa tttatagatt
ctgattcttc atcatggtgt gatctcagca gttttgaatt 1440ctttgaagaa
gcagattttt cacctagcca acatcacaca ggcaaagccc tacagcttca
1500gcaaagagag ggcaatggga ctaaacctgc aggagaacct agcccaaggg
tgaacaaacg 1560tatgttgagt gagagttcac ttgacccacc caaggtctta
cctcctgcaa ggcacagcac 1620aattccactg gtcatccttc gaaaaaaacg
gggccaggcc agccccttag ccactggaga 1680ctgtagctcc ttcatatttg
ctgacgtcag cagttcaact cccaagcgtt cccctgtcaa 1740aagcctaccc
ttctctccct cgcagttctt aaacacttcc agtaaccatg aaaactcaga
1800cttggaaatg ccttctttaa cttccacccc cctcattggt cacaaattga
ctgttacaac 1860accatttcat agagaccaga ctgtgaaaac tcaaaaggaa
aatactgttt ttagaacccc 1920agctatcaaa aggtcaatct tagaaagctc
tccaagaact cctacaccat tcaaacatgc 1980acttgcagct caagaaatta
aatacggtcc cctgaagatg ctacctcaga caccctctca 2040tctagtagaa
gatctgcagg atgtgatcaa acaggaatct gatgaatctg gaattgttgc
2100tgagtttcaa gaaaatggac cacccttact gaagaaaatc aaacaagagg
tggaatctcc 2160aactgataaa tcaggaaact tcttctgctc acaccactgg
gaaggggaca gtctgaatac 2220ccaactgttc acgcagacct cgcctgtggc
agatgcaccg aatattctta caagctccgt 2280tttaatggca ccagcatcag
aagatgaaga caatgttctc aaagcattta cagtacctaa 2340aaacaggtcc
ctggcgagcc ccttgcagcc ttgtagcagt acctgggaac ctgcatcctg
2400tggaaagatg gaggagcaga tgacatcttc cagtcaagct cgtaaatacg
tgaatgcatt 2460ctcagcccgg acgctggtca tgtgagacat ttccagaaaa
gcattatggt tttcagaaca 2520cttcaagttg acttgggata tatcattcct
caacatgaaa cttttcatga atgggagaag 2580aacctatttt tgttgtggta
caacagttga gagcagcacc aagtgcattt agttgaatga 2640agtcttcttg
gatttcaccc aactaaaagg atttttaaaa ataaataaca gtcttaccta
2700aattattagg taatgaattg tagccagttg ttaatatctt aatgcagatt
tttttaaaaa 2760aaacataaaa tgatttatct gtattttaaa ggatccaaca
gatcagtatt ttttcctgtg 2820atgggttttt tgaaatttga cacattaaaa
ggtactccag tatttcactt ttctcgatca 2880ctaaacatat gcatatattt
ttaaaaatca gtaaaagcat tactctaagt gtagacttaa 2940taccatgtga
catttaatcc agattgtaaa tgctcattta tggttaatga cattgaaggt
3000acatttattg taccaaacca ttttatgagt tttctgttag cttgctttaa
aaattattac 3060tgtaagaaat agttttataa aaaattatat ttttattcag
taatttaatt ttgtaaatgc 3120caaatgaaaa acgttttttg ctgctatggt
cttagcctgt agacatgctg ctagtatcag 3180aggggcagta gagcttggac
agaaagaaaa gaaacttggt gttaggtaat tgactatgca 3240ctagtatttc
agacttttta attttatata tatatacatt ttttttcctt ctgcaataca
3300tttgaaaact tgtttgggag actctgcatt ttttattgtg gtttttttgt
tattgttggt 3360ttatacaagc atgcgttgca cttctttttt gggagatgtg
tgttgttgat gttctatgtt 3420ttgttttgag tgtagcctga ctgttttata
atttgggagt tctgcatttg atccgcatcc 3480cctgtggttt ctaagtgtat
ggtctcagaa ctgttgcatg gatcctgtgt ttgcaactgg 3540ggagacagaa
actgtggttg atagccagtc actgccttaa gaacatttga tgcaagatgg
3600ccagcactga acttttgaga tatgacggtg tacttactgc cttgtagcaa
aataaagatg 3660tgcccttatt ttacctacaa a 368163342DNAHomo sapiens
6aatatcaacc tgtttcctcc tcctccttct cctcctcctc cgtgacctcc tcctcctctt
60tctcctgaga aacttcgccc cagcggtgcg gagcgccgct gcgcagccgg ggagggacgc
120aggcaggcgg cgggcagcgg gaggcggcag cccggtgcgg tccccgcggc
tctcggcgga 180gccccgcgcc cgccgcgcca tggcccgaag accccggcac
agcatatata gcagtgacga 240ggatgatgag gactttgaga tgtgtgacca
tgactatgat gggctgcttc ccaagtctgg 300aaagcgtcac ttggggaaaa
caaggtggac ccgggaagag gatgaaaaac tgaagaagct 360ggtggaacag
aatggaacag atgactggaa agttattgcc aattatctcc cgaatcgaac
420agatgtgcag tgccagcacc gatggcagaa agtactaaac cctgagctca
tcaagggtcc 480ttggaccaaa gaagaagatc agagagtgat agagcttgta
cagaaatacg gtccgaaacg 540ttggtctgtt attgccaagc acttaaaggg
gagaattgga aaacaatgta gggagaggtg 600gcataaccac ttgaatccag
aagttaagaa aacctcctgg acagaagagg aagacagaat 660tatttaccag
gcacacaaga gactggggaa cagatgggca gaaatcgcaa agctactgcc
720tggacgaact gataatgcta tcaagaacca ctggaattct acaatgcgtc
ggaaggtcga 780acaggaaggt tatctgcagg agtcttcaaa agccagccag
ccagcagtgg ccacaagctt 840ccagaagaac agtcatttga tgggttttgc
tcaggctccg cctacagctc aactccctgc 900cactggccag cccactgtta
acaacgacta ttcctattac cacatttctg aagcacaaaa 960tgtctccagt
catgttccat accctgtagc gttacatgta aatatagtca atgtccctca
1020gccagctgcc gcagccattc agagacacta taatgatgaa gaccctgaga
aggaaaagcg 1080aataaaggaa ttagaattgc tcctaatgtc aaccgagaat
gagctaaaag gacagcaggt 1140gctaccaaca cagaaccaca catgcagcta
ccccgggtgg cacagcacca ccattgccga 1200ccacaccaga cctcatggag
acagtgcacc tgtttcctgt ttgggagaac accactccac 1260tccatctctg
ccagcggatc ctggctccct acctgaagaa agcgcctcgc cagcaaggtg
1320catgatcgtc caccagggca ccattctgga taatgttaag aacctcttag
aatttgcaga 1380aacactccaa tttatagatt ctttcttaaa cacttccagt
aaccatgaaa actcagactt 1440ggaaatgcct tctttaactt ccacccccct
cattggtcac aaattgactg ttacaacacc 1500atttcataga gaccagactg
tgaaaactca aaaggaaaat actgttttta gaaccccagc 1560tatcaaaagg
tcaatcttag aaagctctcc aagaactcct acaccattca aacatgcact
1620tgcagctcaa gaaattaaat acggtcccct gaagatgcta cctcagacac
cctctcatct 1680agtagaagat ctgcaggatg tgatcaaaca ggaatctgat
gaatctggaa ttgttgctga 1740gtttcaagaa aatggaccac ccttactgaa
gaaaatcaaa caagaggtgg aatctccaac 1800tgataaatca ggaaacttct
tctgctcaca ccactgggaa ggggacagtc tgaataccca 1860actgttcacg
cagacctcgc ctgtggcaga tgcaccgaat attcttacaa gctccgtttt
1920aatggcacca gcatcagaag atgaagacaa tgttctcaaa gcatttacag
tacctaaaaa 1980caggtccctg gcgagcccct tgcagccttg tagcagtacc
tgggaacctg catcctgtgg 2040aaagatggag gagcagatga catcttccag
tcaagctcgt aaatacgtga atgcattctc 2100agcccggacg ctggtcatgt
gagacatttc cagaaaagca ttatggtttt cagaacactt 2160caagttgact
tgggatatat cattcctcaa catgaaactt ttcatgaatg ggagaagaac
2220ctatttttgt tgtggtacaa cagttgagag cagcaccaag tgcatttagt
tgaatgaagt 2280cttcttggat ttcacccaac taaaaggatt tttaaaaata
aataacagtc ttacctaaat 2340tattaggtaa tgaattgtag ccagttgtta
atatcttaat gcagattttt ttaaaaaaaa 2400cataaaatga tttatctgta
ttttaaagga tccaacagat cagtattttt tcctgtgatg 2460ggttttttga
aatttgacac attaaaaggt actccagtat ttcacttttc tcgatcacta
2520aacatatgca tatattttta aaaatcagta aaagcattac tctaagtgta
gacttaatac 2580catgtgacat ttaatccaga ttgtaaatgc tcatttatgg
ttaatgacat tgaaggtaca 2640tttattgtac caaaccattt tatgagtttt
ctgttagctt gctttaaaaa ttattactgt 2700aagaaatagt tttataaaaa
attatatttt tattcagtaa tttaattttg taaatgccaa 2760atgaaaaacg
ttttttgctg ctatggtctt agcctgtaga catgctgcta gtatcagagg
2820ggcagtagag cttggacaga aagaaaagaa acttggtgtt aggtaattga
ctatgcacta 2880gtatttcaga ctttttaatt ttatatatat atacattttt
tttccttctg caatacattt 2940gaaaacttgt ttgggagact ctgcattttt
tattgtggtt tttttgttat tgttggttta 3000tacaagcatg cgttgcactt
cttttttggg agatgtgtgt tgttgatgtt ctatgttttg 3060ttttgagtgt
agcctgactg ttttataatt tgggagttct gcatttgatc cgcatcccct
3120gtggtttcta agtgtatggt ctcagaactg ttgcatggat cctgtgtttg
caactgggga 3180gacagaaact gtggttgata gccagtcact gccttaagaa
catttgatgc aagatggcca 3240gcactgaact tttgagatat gacggtgtac
ttactgcctt gtagcaaaat aaagatgtgc 3300ccttatttta cctaaaaaaa
aaaaaaaaaa aaaaaaaaaa aa 334273309DNAHomo sapiens 7aatatcaacc
tgtttcctcc tcctccttct cctcctcctc cgtgacctcc tcctcctctt 60tctcctgaga
aacttcgccc cagcggtgcg gagcgccgct gcgcagccgg ggagggacgc
120aggcaggcgg cgggcagcgg gaggcggcag cccggtgcgg tccccgcggc
tctcggcgga 180gccccgcgcc cgccgcgcca tggcccgaag accccggcac
agcatatata gcagtgacga 240ggatgatgag gactttgaga tgtgtgacca
tgactatgat gggctgcttc ccaagtctgg 300aaagcgtcac ttggggaaaa
caaggtggac ccgggaagag gatgaaaaac tgaagaagct 360ggtggaacag
aatggaacag atgactggaa agttattgcc aattatctcc cgaatcgaac
420agatgtgcag tgccagcacc gatggcagaa agtactaaac cctgagctca
tcaagggtcc 480ttggaccaaa gaagaagatc agagagtgat agagcttgta
cagaaatacg gtccgaaacg 540ttggtctgtt attgccaagc acttaaaggg
gagaattgga aaacaatgta gggagaggtg 600gcataaccac ttgaatccag
aagttaagaa aacctcctgg acagaagagg aagacagaat 660tatttaccag
gcacacaaga gactggggaa cagatgggca gaaatcgcaa agctactgcc
720tggacgaact gataatgcta tcaagaacca ctggaattct acaatgcgtc
ggaaggtcga 780acaggaaggt tatctgcagg agtcttcaaa agccagccag
ccagcagtgg ccacaagctt 840ccagaagaac agtcatttga tgggttttgc
tcaggctccg cctacagctc aactccctgc 900cactggccag cccactgtta
acaacgacta ttcctattac cacatttctg aagcacaaaa 960tgtctccagt
catgttccat accctgtagc gttacatgta aatatagtca atgtccctca
1020gccagctgcc gcagccattc agagacacta taatgatgaa gaccctgaga
aggaaaagcg 1080aataaaggaa ttagaattgc tcctaatgtc aaccgagaat
gagctaaaag gacagcagac 1140acagaaccac acatgcagct accccgggtg
gcacagcacc accattgccg accacaccag 1200acctcatgga gacagtgcac
ctgtttcctg tttgggagaa caccactcca ctccatctct 1260gccagcggat
cctggctccc tacctgaaga aagcgcctcg ccagcaaggt gcatgatcgt
1320ccaccagggc accattctgg ataatgttaa gaacctctta gaatttgcag
aaacactcca 1380atttatagat tctttcttaa acacttccag taaccatgaa
aactcagact tggaaatgcc 1440ttctttaact tccacccccc tcattggtca
caaattgact gttacaacac catttcatag 1500agaccagact gtgaaaactc
aaaaggaaaa tactgttttt agaaccccag ctatcaaaag 1560gtcaatctta
gaaagctctc caagaactcc tacaccattc aaacatgcac ttgcagctca
1620agaaattaaa tacggtcccc tgaagatgct acctcagaca ccctctcatc
tagtagaaga 1680tctgcaggat gtgatcaaac aggaatctga tgaatctgga
attgttgctg agtttcaaga 1740aaatggacca cccttactga agaaaatcaa
acaagaggtg gaatctccaa ctgataaatc 1800aggaaacttc ttctgctcac
accactggga aggggacagt ctgaataccc aactgttcac 1860gcagacctcg
cctgtggcag atgcaccgaa tattcttaca agctccgttt taatggcacc
1920agcatcagaa gatgaagaca atgttctcaa agcatttaca gtacctaaaa
acaggtccct 1980ggcgagcccc ttgcagcctt gtagcagtac ctgggaacct
gcatcctgtg gaaagatgga 2040ggagcagatg acatcttcca gtcaagctcg
taaatacgtg aatgcattct cagcccggac 2100gctggtcatg tgagacattt
ccagaaaagc attatggttt tcagaacact tcaagttgac 2160ttgggatata
tcattcctca acatgaaact tttcatgaat gggagaagaa cctatttttg
2220ttgtggtaca acagttgaga gcagcaccaa gtgcatttag ttgaatgaag
tcttcttgga 2280tttcacccaa ctaaaaggat ttttaaaaat aaataacagt
cttacctaaa ttattaggta 2340atgaattgta gccagttgtt aatatcttaa
tgcagatttt tttaaaaaaa acataaaatg 2400atttatctgt attttaaagg
atccaacaga tcagtatttt ttcctgtgat gggttttttg 2460aaatttgaca
cattaaaagg tactccagta tttcactttt ctcgatcact aaacatatgc
2520atatattttt aaaaatcagt aaaagcatta ctctaagtgt agacttaata
ccatgtgaca 2580tttaatccag attgtaaatg ctcatttatg gttaatgaca
ttgaaggtac atttattgta 2640ccaaaccatt ttatgagttt tctgttagct
tgctttaaaa attattactg taagaaatag 2700ttttataaaa aattatattt
ttattcagta atttaatttt gtaaatgcca aatgaaaaac 2760gttttttgct
gctatggtct tagcctgtag acatgctgct agtatcagag gggcagtaga
2820gcttggacag aaagaaaaga aacttggtgt taggtaattg actatgcact
agtatttcag 2880actttttaat tttatatata tatacatttt ttttccttct
gcaatacatt tgaaaacttg 2940tttgggagac tctgcatttt ttattgtggt
ttttttgtta ttgttggttt atacaagcat 3000gcgttgcact tcttttttgg
gagatgtgtg ttgttgatgt tctatgtttt gttttgagtg 3060tagcctgact
gttttataat ttgggagttc tgcatttgat ccgcatcccc tgtggtttct
3120aagtgtatgg tctcagaact gttgcatgga tcctgtgttt gcaactgggg
agacagaaac 3180tgtggttgat agccagtcac tgccttaaga acatttgatg
caagatggcc agcactgaac 3240ttttgagata tgacggtgta cttactgcct
tgtagcaaaa taaagatgtg cccttatttt 3300acctacaaa 330983669DNAHomo
sapiens 8aatatcaacc tgtttcctcc tcctccttct cctcctcctc cgtgacctcc
tcctcctctt 60tctcctgaga aacttcgccc cagcggtgcg gagcgccgct gcgcagccgg
ggagggacgc 120aggcaggcgg cgggcagcgg gaggcggcag cccggtgcgg
tccccgcggc tctcggcgga 180gccccgcgcc cgccgcgcca tggcccgaag
accccggcac agcatatata gcagtgacga 240ggatgatgag gactttgaga
tgtgtgacca tgactatgat gggctgcttc ccaagtctgg 300aaagcgtcac
ttggggaaaa caaggtggac ccgggaagag gatgaaaaac tgaagaagct
360ggtggaacag aatggaacag atgactggaa agttattgcc aattatctcc
cgaatcgaac 420agatgtgcag tgccagcacc gatggcagaa agtactaaac
cctgagctca tcaagggtcc 480ttggaccaaa gaagaagatc agagagtgat
agagcttgta cagaaatacg gtccgaaacg 540ttggtctgtt attgccaagc
acttaaaggg gagaattgga aaacaatgta gggagaggtg 600gcataaccac
ttgaatccag aagttaagaa aacctcctgg acagaagagg aagacagaat
660tatttaccag gcacacaaga gactggggaa cagatgggca gaaatcgcaa
agctactgcc 720tggacgaact gataatgcta tcaagaacca ctggaattct
acaatgcgtc ggaaggtcga 780acaggaaggt tatctgcagg agtcttcaaa
agccagccag ccagcagtgg ccacaagctt 840ccagaagaac agtcatttga
tgggttttgc tcaggctccg cctacagctc aactccctgc 900cactggccag
cccactgtta acaacgacta ttcctattac cacatttctg aagcacaaaa
960tgtctccagt catgttccat accctgtagc gttacatgta aatatagtca
atgtccctca 1020gccagctgcc gcagccattc agagacacta taatgatgaa
gaccctgaga aggaaaagcg 1080aataaaggaa ttagaattgc tcctaatgtc
aaccgagaat gagctaaaag gacagcagac 1140acagaaccac acatgcagct
accccgggtg gcacagcacc accattgccg accacaccag 1200acctcatgga
gacagtgcac ctgtttcctg tttgggagaa caccactcca ctccatctct
1260gccagcggat cctggctccc tacctgaaga aagcgcctcg ccagcaaggt
gcatgatcgt 1320ccaccagggc accattctgg ataatgttaa gaacctctta
gaatttgcag aaacactcca 1380atttatagat tctgattctt catcatggtg
tgatctcagc agttttgaat tctttgaaga 1440agcagatttt tcacctagcc
aacatcacac aggcaaagcc ctacagcttc agcaaagaga 1500gggcaatggg
actaaacctg caggagaacc tagcccaagg gtgaacaaac gtatgttgag
1560tgagagttca cttgacccac ccaaggtctt acctcctgca aggcacagca
caattccact 1620ggtcatcctt cgaaaaaaac ggggccaggc cagcccctta
gccactggag actgtagctc 1680cttcatattt gctgacgtca gcagttcaac
tcccaagcgt tcccctgtca aaagcctacc 1740cttctctccc tcgcagttct
taaacacttc cagtaaccat gaaaactcag acttggaaat 1800gccttcttta
acttccaccc ccctcattgg tcacaaattg actgttacaa caccatttca
1860tagagaccag actgtgaaaa ctcaaaagga aaatactgtt tttagaaccc
cagctatcaa 1920aaggtcaatc ttagaaagct ctccaagaac tcctacacca
ttcaaacatg cacttgcagc 1980tcaagaaatt aaatacggtc ccctgaagat
gctacctcag acaccctctc atctagtaga 2040agatctgcag gatgtgatca
aacaggaatc tgatgaatct ggaattgttg ctgagtttca 2100agaaaatgga
ccacccttac tgaagaaaat caaacaagag gtggaatctc caactgataa
2160atcaggaaac ttcttctgct cacaccactg ggaaggggac agtctgaata
cccaactgtt 2220cacgcagacc tcgcctgtgg cagatgcacc gaatattctt
acaagctccg ttttaatggc 2280accagcatca gaagatgaag acaatgttct
caaagcattt acagtaccta aaaacaggtc 2340cctggcgagc cccttgcagc
cttgtagcag tacctgggaa cctgcatcct gtggaaagat 2400ggaggagcag
atgacatctt ccagtcaagc tcgtaaatac gtgaatgcat tctcagcccg
2460gacgctggtc atgtgagaca tttccagaaa agcattatgg ttttcagaac
acttcaagtt 2520gacttgggat atatcattcc tcaacatgaa acttttcatg
aatgggagaa gaacctattt 2580ttgttgtggt acaacagttg agagcagcac
caagtgcatt tagttgaatg aagtcttctt 2640ggatttcacc caactaaaag
gatttttaaa aataaataac agtcttacct aaattattag 2700gtaatgaatt
gtagccagtt gttaatatct taatgcagat ttttttaaaa aaaacataaa
2760atgatttatc tgtattttaa aggatccaac agatcagtat tttttcctgt
gatgggtttt 2820ttgaaatttg acacattaaa aggtactcca gtatttcact
tttctcgatc actaaacata 2880tgcatatatt tttaaaaatc agtaaaagca
ttactctaag tgtagactta ataccatgtg 2940acatttaatc cagattgtaa
atgctcattt atggttaatg acattgaagg tacatttatt 3000gtaccaaacc
attttatgag ttttctgtta gcttgcttta aaaattatta ctgtaagaaa
3060tagttttata aaaaattata tttttattca gtaatttaat tttgtaaatg
ccaaatgaaa 3120aacgtttttt gctgctatgg tcttagcctg tagacatgct
gctagtatca gaggggcagt 3180agagcttgga cagaaagaaa agaaacttgg
tgttaggtaa ttgactatgc actagtattt
3240cagacttttt aattttatat atatatacat tttttttcct tctgcaatac
atttgaaaac 3300ttgtttggga gactctgcat tttttattgt ggtttttttg
ttattgttgg tttatacaag 3360catgcgttgc acttcttttt tgggagatgt
gtgttgttga tgttctatgt tttgttttga 3420gtgtagcctg actgttttat
aatttgggag ttctgcattt gatccgcatc ccctgtggtt 3480tctaagtgta
tggtctcaga actgttgcat ggatcctgtg tttgcaactg gggagacaga
3540aactgtggtt gatagccagt cactgcctta agaacatttg atgcaagatg
gccagcactg 3600aacttttgag atatgacggt gtacttactg ccttgtagca
aaataaagat gtgcccttat 3660tttacctac 366993060DNAHomo sapiens
9aatatcaacc tgtttcctcc tcctccttct cctcctcctc cgtgacctcc tcctcctctt
60tctcctgaga aacttcgccc cagcggtgcg gagcgccgct gcgcagccgg ggagggacgc
120aggcaggcgg cgggcagcgg gaggcggcag cccggtgcgg tccccgcggc
tctcggcgga 180gccccgcgcc cgccgcgcca tggcccgaag accccggcac
agcatatata gcagtgacga 240ggatgatgag gactttgaga tgtgtgacca
tgactatgat gggctgcttc ccaagtctgg 300aaagcgtcac ttggggaaaa
caaggtggac ccgggaagag gatgaaaaac tgaagaagct 360ggtggaacag
aatggaacag atgactggaa agttattgcc aattatctcc cgaatcgaac
420agatgtgcag tgccagcacc gatggcagaa agtactaaac cctgagctca
tcaagggtcc 480ttggaccaaa gaagaagatc agagagtgat agagcttgta
cagaaatacg gtccgaaacg 540ttggtctgtt attgccaagc acttaaaggg
gagaattgga aaacaatgta gggagaggtg 600gcataaccac ttgaatccag
aagttaagaa aacctcctgg acagaagagg aagacagaat 660tatttaccag
gcacacaaga gactggggaa cagatgggca gaaatcgcaa agctactgcc
720tggacgaact gataatgcta tcaagaacca ctggaattct acaatgcgtc
ggaaggtcga 780acaggaaggt tatctgcagg agtcttcaaa agccagccag
ccagcagtgg ccacaagctt 840ccagaagaac agtcatttga tgggttttgc
tcaggctccg cctacagctc aactccctgc 900cactggccag cccactgtta
acaacgacta ttcctattac cacatttctg aagcacaaaa 960tgtctccagt
catgttccat accctgtagc gttacatgta aatatagtca atgtccctca
1020gccagctgcc gcagccattc agagacacta taatgatgaa gaccctgaga
aggaaaagcg 1080aataaaggaa ttagaattgc tcctaatgtc aaccgagaat
gagctaaaag gacagcaggt 1140gctaccattc ttaaacactt ccagtaacca
tgaaaactca gacttggaaa tgccttcttt 1200aacttccacc cccctcattg
gtcacaaatt gactgttaca acaccatttc atagagacca 1260gactgtgaaa
actcaaaagg aaaatactgt ttttagaacc ccagctatca aaaggtcaat
1320cttagaaagc tctccaagaa ctcctacacc attcaaacat gcacttgcag
ctcaagaaat 1380taaatacggt cccctgaaga tgctacctca gacaccctct
catctagtag aagatctgca 1440ggatgtgatc aaacaggaat ctgatgaatc
tggaattgtt gctgagtttc aagaaaatgg 1500accaccctta ctgaagaaaa
tcaaacaaga ggtggaatct ccaactgata aatcaggaaa 1560cttcttctgc
tcacaccact gggaagggga cagtctgaat acccaactgt tcacgcagac
1620ctcgcctgtg gcagatgcac cgaatattct tacaagctcc gttttaatgg
caccagcatc 1680agaagatgaa gacaatgttc tcaaagcatt tacagtacct
aaaaacaggt ccctggcgag 1740ccccttgcag ccttgtagca gtacctggga
acctgcatcc tgtggaaaga tggaggagca 1800gatgacatct tccagtcaag
ctcgtaaata cgtgaatgca ttctcagccc ggacgctggt 1860catgtgagac
atttccagaa aagcattatg gttttcagaa cacttcaagt tgacttggga
1920tatatcattc ctcaacatga aacttttcat gaatgggaga agaacctatt
tttgttgtgg 1980tacaacagtt gagagcagca ccaagtgcat ttagttgaat
gaagtcttct tggatttcac 2040ccaactaaaa ggatttttaa aaataaataa
cagtcttacc taaattatta ggtaatgaat 2100tgtagccagt tgttaatatc
ttaatgcaga tttttttaaa aaaaacataa aatgatttat 2160ctgtatttta
aaggatccaa cagatcagta ttttttcctg tgatgggttt tttgaaattt
2220gacacattaa aaggtactcc agtatttcac ttttctcgat cactaaacat
atgcatatat 2280ttttaaaaat cagtaaaagc attactctaa gtgtagactt
aataccatgt gacatttaat 2340ccagattgta aatgctcatt tatggttaat
gacattgaag gtacatttat tgtaccaaac 2400cattttatga gttttctgtt
agcttgcttt aaaaattatt actgtaagaa atagttttat 2460aaaaaattat
atttttattc agtaatttaa ttttgtaaat gccaaatgaa aaacgttttt
2520tgctgctatg gtcttagcct gtagacatgc tgctagtatc agaggggcag
tagagcttgg 2580acagaaagaa aagaaacttg gtgttaggta attgactatg
cactagtatt tcagactttt 2640taattttata tatatataca ttttttttcc
ttctgcaata catttgaaaa cttgtttggg 2700agactctgca ttttttattg
tggttttttt gttattgttg gtttatacaa gcatgcgttg 2760cacttctttt
ttgggagatg tgtgttgttg atgttctatg ttttgttttg agtgtagcct
2820gactgtttta taatttggga gttctgcatt tgatccgcat cccctgtggt
ttctaagtgt 2880atggtctcag aactgttgca tggatcctgt gtttgcaact
ggggagacag aaactgtggt 2940tgatagccag tcactgcctt aagaacattt
gatgcaagat ggccagcact gaacttttga 3000gatatgacgg tgtacttact
gccttgtagc aaaataaaga tgtgccctta ttttacctac 3060103630DNAHomo
sapiens 10aatatcaacc tgtttcctcc tcctccttct cctcctcctc cgtgacctcc
tcctcctctt 60tctcctgaga aacttcgccc cagcggtgcg gagcgccgct gcgcagccgg
ggagggacgc 120aggcaggcgg cgggcagcgg gaggcggcag cccggtgcgg
tccccgcggc tctcggcgga 180gccccgcgcc cgccgcgcca tggcccgaag
accccggcac agcatatata gcagtgacga 240ggatgatgag gactttgaga
tgtgtgacca tgactatgat gggctgcttc ccaagtctgg 300aaagcgtcac
ttggggaaaa caaggtggac ccgggaagag gatgaaaaac tgaagaagct
360ggtggaacag aatggaacag atgactggaa agttattgcc aattatctcc
cgaatcgaac 420agatgtgcag tgccagcacc gatggcagaa agtactaaac
cctgagctca tcaagggtcc 480ttggaccaaa gaagaagatc agagagtgat
agagcttgta cagaaatacg gtccgaaacg 540ttggtctgtt attgccaagc
acttaaaggg gagaattgga aaacaatgta gggagaggtg 600gcataaccac
ttgaatccag aagttaagaa aacctcctgg acagaagagg aagacagaat
660tatttaccag gcacacaaga gactggggaa cagatgggca gaaatcgcaa
agctactgcc 720tggacgaact gataatgcta tcaagaacca ctggaattct
acaatgcgtc ggaaggtcga 780acaggaaggt tatctgcagg agtcttcaaa
agccagccag ccagcagtgg ccacaagctt 840ccagaagaac agtcatttga
tgggttttgc tcaggctccg cctacagctc aactccctgc 900cactggccag
cccactgtta acaacgacta ttcctattac cacatttctg aagcacaaaa
960tgtctccagt catgttccat accctgtagc gttacatgta aatatagtca
atgtccctca 1020gccagctgcc gcagccattc agagacacta taatgatgaa
gaccctgaga aggaaaagcg 1080aataaaggaa ttagaattgc tcctaatgtc
aaccgagaat gagctaaaag gacagcaggt 1140gctaccaaca cagaaccaca
catgcagcta ccccgggtgg cacagcacca ccattgccga 1200ccacaccaga
cctcatggag acagtgcacc tgtttcctgt ttgggagaac accactccac
1260tccatctctg ccagcggatc ctggctccct acctgaagaa agcgcctcgc
cagcaaggtg 1320catgatcgtc caccagggca ccattctgga taatgattct
tcatcatggt gtgatctcag 1380cagttttgaa ttctttgaag aagcagattt
ttcacctagc caacatcaca caggcaaagc 1440cctacagctt cagcaaagag
agggcaatgg gactaaacct gcaggagaac ctagcccaag 1500ggtgaacaaa
cgtatgttga gtgagagttc acttgaccca cccaaggtct tacctcctgc
1560aaggcacagc acaattccac tggtcatcct tcgaaaaaaa cggggccagg
ccagcccctt 1620agccactgga gactgtagct ccttcatatt tgctgacgtc
agcagttcaa ctcccaagcg 1680ttcccctgtc aaaagcctac ccttctctcc
ctcgcagttc ttaaacactt ccagtaacca 1740tgaaaactca gacttggaaa
tgccttcttt aacttccacc cccctcattg gtcacaaatt 1800gactgttaca
acaccatttc atagagacca gactgtgaaa actcaaaagg aaaatactgt
1860ttttagaacc ccagctatca aaaggtcaat cttagaaagc tctccaagaa
ctcctacacc 1920attcaaacat gcacttgcag ctcaagaaat taaatacggt
cccctgaaga tgctacctca 1980gacaccctct catctagtag aagatctgca
ggatgtgatc aaacaggaat ctgatgaatc 2040tggaattgtt gctgagtttc
aagaaaatgg accaccctta ctgaagaaaa tcaaacaaga 2100ggtggaatct
ccaactgata aatcaggaaa cttcttctgc tcacaccact gggaagggga
2160cagtctgaat acccaactgt tcacgcagac ctcgcctgtg gcagatgcac
cgaatattct 2220tacaagctcc gttttaatgg caccagcatc agaagatgaa
gacaatgttc tcaaagcatt 2280tacagtacct aaaaacaggt ccctggcgag
ccccttgcag ccttgtagca gtacctggga 2340acctgcatcc tgtggaaaga
tggaggagca gatgacatct tccagtcaag ctcgtaaata 2400cgtgaatgca
ttctcagccc ggacgctggt catgtgagac atttccagaa aagcattatg
2460gttttcagaa cacttcaagt tgacttggga tatatcattc ctcaacatga
aacttttcat 2520gaatgggaga agaacctatt tttgttgtgg tacaacagtt
gagagcagca ccaagtgcat 2580ttagttgaat gaagtcttct tggatttcac
ccaactaaaa ggatttttaa aaataaataa 2640cagtcttacc taaattatta
ggtaatgaat tgtagccagt tgttaatatc ttaatgcaga 2700tttttttaaa
aaaaacataa aatgatttat ctgtatttta aaggatccaa cagatcagta
2760ttttttcctg tgatgggttt tttgaaattt gacacattaa aaggtactcc
agtatttcac 2820ttttctcgat cactaaacat atgcatatat ttttaaaaat
cagtaaaagc attactctaa 2880gtgtagactt aataccatgt gacatttaat
ccagattgta aatgctcatt tatggttaat 2940gacattgaag gtacatttat
tgtaccaaac cattttatga gttttctgtt agcttgcttt 3000aaaaattatt
actgtaagaa atagttttat aaaaaattat atttttattc agtaatttaa
3060ttttgtaaat gccaaatgaa aaacgttttt tgctgctatg gtcttagcct
gtagacatgc 3120tgctagtatc agaggggcag tagagcttgg acagaaagaa
aagaaacttg gtgttaggta 3180attgactatg cactagtatt tcagactttt
taattttata tatatataca ttttttttcc 3240ttctgcaata catttgaaaa
cttgtttggg agactctgca ttttttattg tggttttttt 3300gttattgttg
gtttatacaa gcatgcgttg cacttctttt ttgggagatg tgtgttgttg
3360atgttctatg ttttgttttg agtgtagcct gactgtttta taatttggga
gttctgcatt 3420tgatccgcat cccctgtggt ttctaagtgt atggtctcag
aactgttgca tggatcctgt 3480gtttgcaact ggggagacag aaactgtggt
tgatagccag tcactgcctt aagaacattt 3540gatgcaagat ggccagcact
gaacttttga gatatgacgg tgtacttact gccttgtagc 3600aaaataaaga
tgtgccctta ttttacctac 3630113204DNAHomo sapiens 11aatatcaacc
tgtttcctcc tcctccttct cctcctcctc cgtgacctcc tcctcctctt 60tctcctgaga
aacttcgccc cagcggtgcg gagcgccgct gcgcagccgg ggagggacgc
120aggcaggcgg cgggcagcgg gaggcggcag cccggtgcgg tccccgcggc
tctcggcgga 180gccccgcgcc cgccgcgcca tggcccgaag accccggcac
agcatatata gcagtgacga 240ggatgatgag gactttgaga tgtgtgacca
tgactatgat gggctgcttc ccaagtctgg 300aaagcgtcac ttggggaaaa
caaggtggac ccgggaagag gatgaaaaac tgaagaagct 360ggtggaacag
aatggaacag atgactggaa agttattgcc aattatctcc cgaatcgaac
420agatgtgcag tgccagcacc gatggcagaa agtactaaac cctgagctca
tcaagggtcc 480ttggaccaaa gaagaagatc agagagtgat agagcttgta
cagaaatacg gtccgaaacg 540ttggtctgtt attgccaagc acttaaaggg
gagaattgga aaacaatgta gggagaggtg 600gcataaccac ttgaatccag
aagttaagaa aacctcctgg acagaagagg aagacagaat 660tatttaccag
gcacacaaga gactggggaa cagatgggca gaaatcgcaa agctactgcc
720tggacgaact gataatgcta tcaagaacca ctggaattct acaatgcgtc
ggaaggtcga 780acaggaaggt tatctgcagg agtcttcaaa agccagccag
ccagcagtgg ccacaagctt 840ccagaagaac agtcatttga tgggttttgc
tcaggctccg cctacagctc aactccctgc 900cactggccag cccactgtta
acaacgacta ttcctattac cacatttctg aagcacaaaa 960tgtctccagt
catgttccat accctgtagc gttacatgta aatatagtca atgtccctca
1020gccagctgcc gcagccattc agagacacta taatgatgaa gaccctgaga
aggaaaagcg 1080aataaaggaa ttagaattgc tcctaatgtc aaccgagaat
gagctaaaag gacagcaggt 1140gctaccaaca cagaaccaca catgcagcta
ccccgggtgg cacagcacca ccattgccga 1200ccacaccaga cctcatggag
acagtgcacc tgtttcctgt ttgggagaac accactccac 1260tccatctctg
ccagcggatc ctggctccct acctgaagaa agcgcctcgc cagcaaggtg
1320catgatcgtc caccagggca ccattctgga taatgttaag aacctcttag
aatttgcaga 1380aacactccaa tttatagatt ctttcttaaa cacttccagt
aaccatgaaa actcagactt 1440ggaaatgcct tctttaactt ccacccccct
cattggtcac aaattgactg ttacaacacc 1500atttcataga gaccagactg
tgaaaactca aaaggaaaat actgttttta gaaccccagc 1560tatcaaaagg
tcaatcttag aaagctctcc aagaactcct acaccattca aacatgcact
1620tgcagctcaa gaaattaaat acggtcccct gaagatgcta cctcagacac
cctctcatct 1680agtagaagat ctgcaggatg tgatcaaaca ggaatctgat
gaatctggaa ttgttgctga 1740gtttcaagaa aatggaccac ccttactgaa
gaaaatcaaa caagagaata ttcttacaag 1800ctccgtttta atggcaccag
catcagaaga tgaagacaat gttctcaaag catttacagt 1860acctaaaaac
aggtccctgg cgagcccctt gcagccttgt agcagtacct gggaacctgc
1920atcctgtgga aagatggagg agcagatgac atcttccagt caagctcgta
aatacgtgaa 1980tgcattctca gcccggacgc tggtcatgtg agacatttcc
agaaaagcat tatggttttc 2040agaacacttc aagttgactt gggatatatc
attcctcaac atgaaacttt tcatgaatgg 2100gagaagaacc tatttttgtt
gtggtacaac agttgagagc agcaccaagt gcatttagtt 2160gaatgaagtc
ttcttggatt tcacccaact aaaaggattt ttaaaaataa ataacagtct
2220tacctaaatt attaggtaat gaattgtagc cagttgttaa tatcttaatg
cagatttttt 2280taaaaaaaac ataaaatgat ttatctgtat tttaaaggat
ccaacagatc agtatttttt 2340cctgtgatgg gttttttgaa atttgacaca
ttaaaaggta ctccagtatt tcacttttct 2400cgatcactaa acatatgcat
atatttttaa aaatcagtaa aagcattact ctaagtgtag 2460acttaatacc
atgtgacatt taatccagat tgtaaatgct catttatggt taatgacatt
2520gaaggtacat ttattgtacc aaaccatttt atgagttttc tgttagcttg
ctttaaaaat 2580tattactgta agaaatagtt ttataaaaaa ttatattttt
attcagtaat ttaattttgt 2640aaatgccaaa tgaaaaacgt tttttgctgc
tatggtctta gcctgtagac atgctgctag 2700tatcagaggg gcagtagagc
ttggacagaa agaaaagaaa cttggtgtta ggtaattgac 2760tatgcactag
tatttcagac tttttaattt tatatatata tacatttttt ttccttctgc
2820aatacatttg aaaacttgtt tgggagactc tgcatttttt attgtggttt
ttttgttatt 2880gttggtttat acaagcatgc gttgcacttc ttttttggga
gatgtgtgtt gttgatgttc 2940tatgttttgt tttgagtgta gcctgactgt
tttataattt gggagttctg catttgatcc 3000gcatcccctg tggtttctaa
gtgtatggtc tcagaactgt tgcatggatc ctgtgtttgc 3060aactggggag
acagaaactg tggttgatag ccagtcactg ccttaagaac atttgatgca
3120agatggccag cactgaactt ttgagatatg acggtgtact tactgccttg
tagcaaaata 3180aagatgtgcc cttattttac ctac 3204123210DNAHomo sapiens
12aatatcaacc tgtttcctcc tcctccttct cctcctcctc cgtgacctcc tcctcctctt
60tctcctgaga aacttcgccc cagcggtgcg gagcgccgct gcgcagccgg ggagggacgc
120aggcaggcgg cgggcagcgg gaggcggcag cccggtgcgg tccccgcggc
tctcggcgga 180gccccgcgcc cgccgcgcca tggcccgaag accccggcac
agcatatata gcagtgacga 240ggatgatgag gactttgaga tgtgtgacca
tgactatgat gggctgcttc ccaagtctgg 300aaagcgtcac ttggggaaaa
caaggtggac ccgggaagag gatgaaaaac tgaagaagct 360ggtggaacag
aatggaacag atgactggaa agttattgcc aattatctcc cgaatcgaac
420agatgtgcag tgccagcacc gatggcagaa agtactaaac cctgagctca
tcaagggtcc 480ttggaccaaa gaagaagatc agagagtgat agagcttgta
cagaaatacg gtccgaaacg 540ttggtctgtt attgccaagc acttaaaggg
gagaattgga aaacaatgta gggagaggtg 600gcataaccac ttgaatccag
aagttaagaa aacctcctgg acagaagagg aagacagaat 660tatttaccag
gcacacaaga gactggggaa cagatgggca gaaatcgcaa agctactgcc
720tggacgaact gataatgcta tcaagaacca ctggaattct acaatgcgtc
ggaaggtcga 780acaggaaggt tatctgcagg agtcttcaaa agccagccag
ccagcagtgg ccacaagctt 840ccagaagaac agtcatttga tgggttttgc
tcaggctccg cctacagctc aactccctgc 900cactggccag cccactgtta
acaacgacta ttcctattac cacatttctg aagcacaaaa 960tgtctccagt
catgttccat accctgtagc gttacatgta aatatagtca atgtccctca
1020gccagctgcc gcagccattc agacacagaa ccacacatgc agctaccccg
ggtggcacag 1080caccaccatt gccgaccaca ccagacctca tggagacagt
gcacctgttt cctgtttggg 1140agaacaccac tccactccat ctctgccagc
ggatcctggc tccctacctg aagaaagcgc 1200ctcgccagca aggtgcatga
tcgtccacca gggcaccatt ctggataatg ttaagaacct 1260cttagaattt
gcagaaacac tccaatttat agattctttc ttaaacactt ccagtaacca
1320tgaaaactca gacttggaaa tgccttcttt aacttccacc cccctcattg
gtcacaaatt 1380gactgttaca acaccatttc atagagacca gactgtgaaa
actcaaaagg aaaatactgt 1440ttttagaacc ccagctatca aaaggtcaat
cttagaaagc tctccaagaa ctcctacacc 1500attcaaacat gcacttgcag
ctcaagaaat taaatacggt cccctgaaga tgctacctca 1560gacaccctct
catctagtag aagatctgca ggatgtgatc aaacaggaat ctgatgaatc
1620tggaattgtt gctgagtttc aagaaaatgg accaccctta ctgaagaaaa
tcaaacaaga 1680ggtggaatct ccaactgata aatcaggaaa cttcttctgc
tcacaccact gggaagggga 1740cagtctgaat acccaactgt tcacgcagac
ctcgcctgtg gcagatgcac cgaatattct 1800tacaagctcc gttttaatgg
caccagcatc agaagatgaa gacaatgttc tcaaagcatt 1860tacagtacct
aaaaacaggt ccctggcgag ccccttgcag ccttgtagca gtacctggga
1920acctgcatcc tgtggaaaga tggaggagca gatgacatct tccagtcaag
ctcgtaaata 1980cgtgaatgca ttctcagccc ggacgctggt catgtgagac
atttccagaa aagcattatg 2040gttttcagaa cacttcaagt tgacttggga
tatatcattc ctcaacatga aacttttcat 2100gaatgggaga agaacctatt
tttgttgtgg tacaacagtt gagagcagca ccaagtgcat 2160ttagttgaat
gaagtcttct tggatttcac ccaactaaaa ggatttttaa aaataaataa
2220cagtcttacc taaattatta ggtaatgaat tgtagccagt tgttaatatc
ttaatgcaga 2280tttttttaaa aaaaacataa aatgatttat ctgtatttta
aaggatccaa cagatcagta 2340ttttttcctg tgatgggttt tttgaaattt
gacacattaa aaggtactcc agtatttcac 2400ttttctcgat cactaaacat
atgcatatat ttttaaaaat cagtaaaagc attactctaa 2460gtgtagactt
aataccatgt gacatttaat ccagattgta aatgctcatt tatggttaat
2520gacattgaag gtacatttat tgtaccaaac cattttatga gttttctgtt
agcttgcttt 2580aaaaattatt actgtaagaa atagttttat aaaaaattat
atttttattc agtaatttaa 2640ttttgtaaat gccaaatgaa aaacgttttt
tgctgctatg gtcttagcct gtagacatgc 2700tgctagtatc agaggggcag
tagagcttgg acagaaagaa aagaaacttg gtgttaggta 2760attgactatg
cactagtatt tcagactttt taattttata tatatataca ttttttttcc
2820ttctgcaata catttgaaaa cttgtttggg agactctgca ttttttattg
tggttttttt 2880gttattgttg gtttatacaa gcatgcgttg cacttctttt
ttgggagatg tgtgttgttg 2940atgttctatg ttttgttttg agtgtagcct
gactgtttta taatttggga gttctgcatt 3000tgatccgcat cccctgtggt
ttctaagtgt atggtctcag aactgttgca tggatcctgt 3060gtttgcaact
ggggagacag aaactgtggt tgatagccag tcactgcctt aagaacattt
3120gatgcaagat ggccagcact gaacttttga gatatgacgg tgtacttact
gccttgtagc 3180aaaataaaga tgtgccctta ttttacctac 321013761PRTHomo
sapiens 13Met Ala Arg Arg Pro Arg His Ser Ile Tyr Ser Ser Asp Glu
Asp Asp 1 5 10 15 Glu Asp Phe Glu Met Cys Asp His Asp Tyr Asp Gly
Leu Leu Pro Lys 20 25 30 Ser Gly Lys Arg His Leu Gly Lys Thr Arg
Trp Thr Arg Glu Glu Asp 35 40 45 Glu Lys Leu Lys Lys Leu Val Glu
Gln Asn Gly Thr Asp Asp Trp Lys 50 55 60 Val Ile Ala Asn Tyr Leu
Pro Asn Arg Thr Asp Val Gln Cys Gln His 65 70 75 80 Arg Trp Gln Lys
Val Leu Asn Pro Glu Leu Ile Lys Gly Pro Trp Thr 85 90 95 Lys Glu
Glu Asp Gln Arg Val Ile Glu Leu Val Gln Lys Tyr Gly Pro 100 105 110
Lys Arg Trp Ser Val Ile Ala Lys His Leu Lys Gly Arg Ile Gly Lys 115
120 125 Gln Cys Arg Glu Arg Trp His Asn His Leu Asn Pro Glu Val Lys
Lys 130 135 140 Thr Ser Trp Thr Glu Glu Glu Asp Arg Ile Ile Tyr Gln
Ala His Lys 145 150 155 160 Arg Leu Gly Asn Arg Trp Ala Glu Ile Ala
Lys Leu Leu Pro Gly Arg 165 170 175 Thr Asp Asn Ala Ile Lys Asn His
Trp Asn Ser Thr Met Arg Arg Lys 180 185 190 Val Glu Gln Glu Gly Tyr
Leu Gln
Glu Ser Ser Lys Ala Ser Gln Pro 195 200 205 Ala Val Ala Thr Ser Phe
Gln Lys Asn Ser His Leu Met Gly Phe Ala 210 215 220 Gln Ala Pro Pro
Thr Ala Gln Leu Pro Ala Thr Gly Gln Pro Thr Val 225 230 235 240 Asn
Asn Asp Tyr Ser Tyr Tyr His Ile Ser Glu Ala Gln Asn Val Ser 245 250
255 Ser His Val Pro Tyr Pro Val Ala Leu His Val Asn Ile Val Asn Val
260 265 270 Pro Gln Pro Ala Ala Ala Ala Ile Gln Arg His Tyr Asn Asp
Glu Asp 275 280 285 Pro Glu Lys Glu Lys Arg Ile Lys Glu Leu Glu Leu
Leu Leu Met Ser 290 295 300 Thr Glu Asn Glu Leu Lys Gly Gln Gln Val
Leu Pro Thr Gln Asn His 305 310 315 320 Thr Cys Ser Tyr Pro Gly Trp
His Ser Thr Thr Ile Ala Asp His Thr 325 330 335 Arg Pro His Gly Asp
Ser Ala Pro Val Ser Cys Leu Gly Glu His His 340 345 350 Ser Thr Pro
Ser Leu Pro Ala Asp Pro Gly Ser Leu Pro Glu Glu Ser 355 360 365 Ala
Ser Pro Ala Arg Cys Met Ile Val His Gln Gly Thr Ile Leu Asp 370 375
380 Asn Val Lys Asn Leu Leu Glu Phe Ala Glu Thr Leu Gln Phe Ile Asp
385 390 395 400 Ser Asp Ser Ser Ser Trp Cys Asp Leu Ser Ser Phe Glu
Phe Phe Glu 405 410 415 Glu Ala Asp Phe Ser Pro Ser Gln His His Thr
Gly Lys Ala Leu Gln 420 425 430 Leu Gln Gln Arg Glu Gly Asn Gly Thr
Lys Pro Ala Gly Glu Pro Ser 435 440 445 Pro Arg Val Asn Lys Arg Met
Leu Ser Glu Ser Ser Leu Asp Pro Pro 450 455 460 Lys Val Leu Pro Pro
Ala Arg His Ser Thr Ile Pro Leu Val Ile Leu 465 470 475 480 Arg Lys
Lys Arg Gly Gln Ala Ser Pro Leu Ala Thr Gly Asp Cys Ser 485 490 495
Ser Phe Ile Phe Ala Asp Val Ser Ser Ser Thr Pro Lys Arg Ser Pro 500
505 510 Val Lys Ser Leu Pro Phe Ser Pro Ser Gln Phe Leu Asn Thr Ser
Ser 515 520 525 Asn His Glu Asn Ser Asp Leu Glu Met Pro Ser Leu Thr
Ser Thr Pro 530 535 540 Leu Ile Gly His Lys Leu Thr Val Thr Thr Pro
Phe His Arg Asp Gln 545 550 555 560 Thr Val Lys Thr Gln Lys Glu Asn
Thr Val Phe Arg Thr Pro Ala Ile 565 570 575 Lys Arg Ser Ile Leu Glu
Ser Ser Pro Arg Thr Pro Thr Pro Phe Lys 580 585 590 His Ala Leu Ala
Ala Gln Glu Ile Lys Tyr Gly Pro Leu Lys Met Leu 595 600 605 Pro Gln
Thr Pro Ser His Leu Val Glu Asp Leu Gln Asp Val Ile Lys 610 615 620
Gln Glu Ser Asp Glu Ser Gly Ile Val Ala Glu Phe Gln Glu Asn Gly 625
630 635 640 Pro Pro Leu Leu Lys Lys Ile Lys Gln Glu Val Glu Ser Pro
Thr Asp 645 650 655 Lys Ser Gly Asn Phe Phe Cys Ser His His Trp Glu
Gly Asp Ser Leu 660 665 670 Asn Thr Gln Leu Phe Thr Gln Thr Ser Pro
Val Ala Asp Ala Pro Asn 675 680 685 Ile Leu Thr Ser Ser Val Leu Met
Ala Pro Ala Ser Glu Asp Glu Asp 690 695 700 Asn Val Leu Lys Ala Phe
Thr Val Pro Lys Asn Arg Ser Leu Ala Ser 705 710 715 720 Pro Leu Gln
Pro Cys Ser Ser Thr Trp Glu Pro Ala Ser Cys Gly Lys 725 730 735 Met
Glu Glu Gln Met Thr Ser Ser Ser Gln Ala Arg Lys Tyr Val Asn 740 745
750 Ala Phe Ser Ala Arg Thr Leu Val Met 755 760 14640PRTHomo
sapiens 14Met Ala Arg Arg Pro Arg His Ser Ile Tyr Ser Ser Asp Glu
Asp Asp 1 5 10 15 Glu Asp Phe Glu Met Cys Asp His Asp Tyr Asp Gly
Leu Leu Pro Lys 20 25 30 Ser Gly Lys Arg His Leu Gly Lys Thr Arg
Trp Thr Arg Glu Glu Asp 35 40 45 Glu Lys Leu Lys Lys Leu Val Glu
Gln Asn Gly Thr Asp Asp Trp Lys 50 55 60 Val Ile Ala Asn Tyr Leu
Pro Asn Arg Thr Asp Val Gln Cys Gln His 65 70 75 80 Arg Trp Gln Lys
Val Leu Asn Pro Glu Leu Ile Lys Gly Pro Trp Thr 85 90 95 Lys Glu
Glu Asp Gln Arg Val Ile Glu Leu Val Gln Lys Tyr Gly Pro 100 105 110
Lys Arg Trp Ser Val Ile Ala Lys His Leu Lys Gly Arg Ile Gly Lys 115
120 125 Gln Cys Arg Glu Arg Trp His Asn His Leu Asn Pro Glu Val Lys
Lys 130 135 140 Thr Ser Trp Thr Glu Glu Glu Asp Arg Ile Ile Tyr Gln
Ala His Lys 145 150 155 160 Arg Leu Gly Asn Arg Trp Ala Glu Ile Ala
Lys Leu Leu Pro Gly Arg 165 170 175 Thr Asp Asn Ala Ile Lys Asn His
Trp Asn Ser Thr Met Arg Arg Lys 180 185 190 Val Glu Gln Glu Gly Tyr
Leu Gln Glu Ser Ser Lys Ala Ser Gln Pro 195 200 205 Ala Val Ala Thr
Ser Phe Gln Lys Asn Ser His Leu Met Gly Phe Ala 210 215 220 Gln Ala
Pro Pro Thr Ala Gln Leu Pro Ala Thr Gly Gln Pro Thr Val 225 230 235
240 Asn Asn Asp Tyr Ser Tyr Tyr His Ile Ser Glu Ala Gln Asn Val Ser
245 250 255 Ser His Val Pro Tyr Pro Val Ala Leu His Val Asn Ile Val
Asn Val 260 265 270 Pro Gln Pro Ala Ala Ala Ala Ile Gln Arg His Tyr
Asn Asp Glu Asp 275 280 285 Pro Glu Lys Glu Lys Arg Ile Lys Glu Leu
Glu Leu Leu Leu Met Ser 290 295 300 Thr Glu Asn Glu Leu Lys Gly Gln
Gln Val Leu Pro Thr Gln Asn His 305 310 315 320 Thr Cys Ser Tyr Pro
Gly Trp His Ser Thr Thr Ile Ala Asp His Thr 325 330 335 Arg Pro His
Gly Asp Ser Ala Pro Val Ser Cys Leu Gly Glu His His 340 345 350 Ser
Thr Pro Ser Leu Pro Ala Asp Pro Gly Ser Leu Pro Glu Glu Ser 355 360
365 Ala Ser Pro Ala Arg Cys Met Ile Val His Gln Gly Thr Ile Leu Asp
370 375 380 Asn Val Lys Asn Leu Leu Glu Phe Ala Glu Thr Leu Gln Phe
Ile Asp 385 390 395 400 Ser Phe Leu Asn Thr Ser Ser Asn His Glu Asn
Ser Asp Leu Glu Met 405 410 415 Pro Ser Leu Thr Ser Thr Pro Leu Ile
Gly His Lys Leu Thr Val Thr 420 425 430 Thr Pro Phe His Arg Asp Gln
Thr Val Lys Thr Gln Lys Glu Asn Thr 435 440 445 Val Phe Arg Thr Pro
Ala Ile Lys Arg Ser Ile Leu Glu Ser Ser Pro 450 455 460 Arg Thr Pro
Thr Pro Phe Lys His Ala Leu Ala Ala Gln Glu Ile Lys 465 470 475 480
Tyr Gly Pro Leu Lys Met Leu Pro Gln Thr Pro Ser His Leu Val Glu 485
490 495 Asp Leu Gln Asp Val Ile Lys Gln Glu Ser Asp Glu Ser Gly Ile
Val 500 505 510 Ala Glu Phe Gln Glu Asn Gly Pro Pro Leu Leu Lys Lys
Ile Lys Gln 515 520 525 Glu Val Glu Ser Pro Thr Asp Lys Ser Gly Asn
Phe Phe Cys Ser His 530 535 540 His Trp Glu Gly Asp Ser Leu Asn Thr
Gln Leu Phe Thr Gln Thr Ser 545 550 555 560 Pro Val Ala Asp Ala Pro
Asn Ile Leu Thr Ser Ser Val Leu Met Ala 565 570 575 Pro Ala Ser Glu
Asp Glu Asp Asn Val Leu Lys Ala Phe Thr Val Pro 580 585 590 Lys Asn
Arg Ser Leu Ala Ser Pro Leu Gln Pro Cys Ser Ser Thr Trp 595 600 605
Glu Pro Ala Ser Cys Gly Lys Met Glu Glu Gln Met Thr Ser Ser Ser 610
615 620 Gln Ala Arg Lys Tyr Val Asn Ala Phe Ser Ala Arg Thr Leu Val
Met 625 630 635 640 15637PRTHomo sapiens 15Met Ala Arg Arg Pro Arg
His Ser Ile Tyr Ser Ser Asp Glu Asp Asp 1 5 10 15 Glu Asp Phe Glu
Met Cys Asp His Asp Tyr Asp Gly Leu Leu Pro Lys 20 25 30 Ser Gly
Lys Arg His Leu Gly Lys Thr Arg Trp Thr Arg Glu Glu Asp 35 40 45
Glu Lys Leu Lys Lys Leu Val Glu Gln Asn Gly Thr Asp Asp Trp Lys 50
55 60 Val Ile Ala Asn Tyr Leu Pro Asn Arg Thr Asp Val Gln Cys Gln
His 65 70 75 80 Arg Trp Gln Lys Val Leu Asn Pro Glu Leu Ile Lys Gly
Pro Trp Thr 85 90 95 Lys Glu Glu Asp Gln Arg Val Ile Glu Leu Val
Gln Lys Tyr Gly Pro 100 105 110 Lys Arg Trp Ser Val Ile Ala Lys His
Leu Lys Gly Arg Ile Gly Lys 115 120 125 Gln Cys Arg Glu Arg Trp His
Asn His Leu Asn Pro Glu Val Lys Lys 130 135 140 Thr Ser Trp Thr Glu
Glu Glu Asp Arg Ile Ile Tyr Gln Ala His Lys 145 150 155 160 Arg Leu
Gly Asn Arg Trp Ala Glu Ile Ala Lys Leu Leu Pro Gly Arg 165 170 175
Thr Asp Asn Ala Ile Lys Asn His Trp Asn Ser Thr Met Arg Arg Lys 180
185 190 Val Glu Gln Glu Gly Tyr Leu Gln Glu Ser Ser Lys Ala Ser Gln
Pro 195 200 205 Ala Val Ala Thr Ser Phe Gln Lys Asn Ser His Leu Met
Gly Phe Ala 210 215 220 Gln Ala Pro Pro Thr Ala Gln Leu Pro Ala Thr
Gly Gln Pro Thr Val 225 230 235 240 Asn Asn Asp Tyr Ser Tyr Tyr His
Ile Ser Glu Ala Gln Asn Val Ser 245 250 255 Ser His Val Pro Tyr Pro
Val Ala Leu His Val Asn Ile Val Asn Val 260 265 270 Pro Gln Pro Ala
Ala Ala Ala Ile Gln Arg His Tyr Asn Asp Glu Asp 275 280 285 Pro Glu
Lys Glu Lys Arg Ile Lys Glu Leu Glu Leu Leu Leu Met Ser 290 295 300
Thr Glu Asn Glu Leu Lys Gly Gln Gln Thr Gln Asn His Thr Cys Ser 305
310 315 320 Tyr Pro Gly Trp His Ser Thr Thr Ile Ala Asp His Thr Arg
Pro His 325 330 335 Gly Asp Ser Ala Pro Val Ser Cys Leu Gly Glu His
His Ser Thr Pro 340 345 350 Ser Leu Pro Ala Asp Pro Gly Ser Leu Pro
Glu Glu Ser Ala Ser Pro 355 360 365 Ala Arg Cys Met Ile Val His Gln
Gly Thr Ile Leu Asp Asn Val Lys 370 375 380 Asn Leu Leu Glu Phe Ala
Glu Thr Leu Gln Phe Ile Asp Ser Phe Leu 385 390 395 400 Asn Thr Ser
Ser Asn His Glu Asn Ser Asp Leu Glu Met Pro Ser Leu 405 410 415 Thr
Ser Thr Pro Leu Ile Gly His Lys Leu Thr Val Thr Thr Pro Phe 420 425
430 His Arg Asp Gln Thr Val Lys Thr Gln Lys Glu Asn Thr Val Phe Arg
435 440 445 Thr Pro Ala Ile Lys Arg Ser Ile Leu Glu Ser Ser Pro Arg
Thr Pro 450 455 460 Thr Pro Phe Lys His Ala Leu Ala Ala Gln Glu Ile
Lys Tyr Gly Pro 465 470 475 480 Leu Lys Met Leu Pro Gln Thr Pro Ser
His Leu Val Glu Asp Leu Gln 485 490 495 Asp Val Ile Lys Gln Glu Ser
Asp Glu Ser Gly Ile Val Ala Glu Phe 500 505 510 Gln Glu Asn Gly Pro
Pro Leu Leu Lys Lys Ile Lys Gln Glu Val Glu 515 520 525 Ser Pro Thr
Asp Lys Ser Gly Asn Phe Phe Cys Ser His His Trp Glu 530 535 540 Gly
Asp Ser Leu Asn Thr Gln Leu Phe Thr Gln Thr Ser Pro Val Ala 545 550
555 560 Asp Ala Pro Asn Ile Leu Thr Ser Ser Val Leu Met Ala Pro Ala
Ser 565 570 575 Glu Asp Glu Asp Asn Val Leu Lys Ala Phe Thr Val Pro
Lys Asn Arg 580 585 590 Ser Leu Ala Ser Pro Leu Gln Pro Cys Ser Ser
Thr Trp Glu Pro Ala 595 600 605 Ser Cys Gly Lys Met Glu Glu Gln Met
Thr Ser Ser Ser Gln Ala Arg 610 615 620 Lys Tyr Val Asn Ala Phe Ser
Ala Arg Thr Leu Val Met 625 630 635 16758PRTHomo sapiens 16Met Ala
Arg Arg Pro Arg His Ser Ile Tyr Ser Ser Asp Glu Asp Asp 1 5 10 15
Glu Asp Phe Glu Met Cys Asp His Asp Tyr Asp Gly Leu Leu Pro Lys 20
25 30 Ser Gly Lys Arg His Leu Gly Lys Thr Arg Trp Thr Arg Glu Glu
Asp 35 40 45 Glu Lys Leu Lys Lys Leu Val Glu Gln Asn Gly Thr Asp
Asp Trp Lys 50 55 60 Val Ile Ala Asn Tyr Leu Pro Asn Arg Thr Asp
Val Gln Cys Gln His 65 70 75 80 Arg Trp Gln Lys Val Leu Asn Pro Glu
Leu Ile Lys Gly Pro Trp Thr 85 90 95 Lys Glu Glu Asp Gln Arg Val
Ile Glu Leu Val Gln Lys Tyr Gly Pro 100 105 110 Lys Arg Trp Ser Val
Ile Ala Lys His Leu Lys Gly Arg Ile Gly Lys 115 120 125 Gln Cys Arg
Glu Arg Trp His Asn His Leu Asn Pro Glu Val Lys Lys 130 135 140 Thr
Ser Trp Thr Glu Glu Glu Asp Arg Ile Ile Tyr Gln Ala His Lys 145 150
155 160 Arg Leu Gly Asn Arg Trp Ala Glu Ile Ala Lys Leu Leu Pro Gly
Arg 165 170 175 Thr Asp Asn Ala Ile Lys Asn His Trp Asn Ser Thr Met
Arg Arg Lys 180 185 190 Val Glu Gln Glu Gly Tyr Leu Gln Glu Ser Ser
Lys Ala Ser Gln Pro 195 200 205 Ala Val Ala Thr Ser Phe Gln Lys Asn
Ser His Leu Met Gly Phe Ala 210 215 220 Gln Ala Pro Pro Thr Ala Gln
Leu Pro Ala Thr Gly Gln Pro Thr Val 225 230 235 240 Asn Asn Asp Tyr
Ser Tyr Tyr His Ile Ser Glu Ala Gln Asn Val Ser 245 250 255 Ser His
Val Pro Tyr Pro Val Ala Leu His Val Asn Ile Val Asn Val 260 265 270
Pro Gln Pro Ala Ala Ala Ala Ile Gln Arg His Tyr Asn Asp Glu Asp 275
280 285 Pro Glu Lys Glu Lys Arg Ile Lys Glu Leu Glu Leu Leu Leu Met
Ser 290 295 300 Thr Glu Asn Glu Leu Lys Gly Gln Gln Thr Gln Asn His
Thr Cys Ser 305 310 315 320 Tyr Pro Gly Trp His Ser Thr Thr Ile Ala
Asp His Thr Arg Pro His 325 330 335 Gly Asp Ser Ala Pro Val Ser Cys
Leu Gly Glu His His Ser Thr Pro 340 345 350 Ser Leu Pro Ala Asp Pro
Gly Ser Leu Pro Glu Glu Ser Ala Ser Pro 355 360 365 Ala Arg Cys Met
Ile Val His Gln Gly Thr Ile Leu Asp Asn Val Lys 370 375 380 Asn Leu
Leu Glu Phe Ala Glu Thr Leu Gln Phe Ile Asp Ser Asp Ser 385 390 395
400 Ser Ser Trp Cys Asp Leu Ser Ser Phe Glu Phe Phe Glu Glu Ala Asp
405 410 415 Phe Ser Pro Ser Gln His His Thr Gly Lys Ala Leu Gln Leu
Gln Gln 420 425 430 Arg Glu Gly Asn Gly Thr Lys Pro Ala Gly Glu Pro
Ser Pro Arg Val 435 440 445 Asn Lys Arg Met Leu Ser Glu Ser Ser Leu
Asp Pro Pro Lys Val Leu 450
455 460 Pro Pro Ala Arg His Ser Thr Ile Pro Leu Val Ile Leu Arg Lys
Lys 465 470 475 480 Arg Gly Gln Ala Ser Pro Leu Ala Thr Gly Asp Cys
Ser Ser Phe Ile 485 490 495 Phe Ala Asp Val Ser Ser Ser Thr Pro Lys
Arg Ser Pro Val Lys Ser 500 505 510 Leu Pro Phe Ser Pro Ser Gln Phe
Leu Asn Thr Ser Ser Asn His Glu 515 520 525 Asn Ser Asp Leu Glu Met
Pro Ser Leu Thr Ser Thr Pro Leu Ile Gly 530 535 540 His Lys Leu Thr
Val Thr Thr Pro Phe His Arg Asp Gln Thr Val Lys 545 550 555 560 Thr
Gln Lys Glu Asn Thr Val Phe Arg Thr Pro Ala Ile Lys Arg Ser 565 570
575 Ile Leu Glu Ser Ser Pro Arg Thr Pro Thr Pro Phe Lys His Ala Leu
580 585 590 Ala Ala Gln Glu Ile Lys Tyr Gly Pro Leu Lys Met Leu Pro
Gln Thr 595 600 605 Pro Ser His Leu Val Glu Asp Leu Gln Asp Val Ile
Lys Gln Glu Ser 610 615 620 Asp Glu Ser Gly Ile Val Ala Glu Phe Gln
Glu Asn Gly Pro Pro Leu 625 630 635 640 Leu Lys Lys Ile Lys Gln Glu
Val Glu Ser Pro Thr Asp Lys Ser Gly 645 650 655 Asn Phe Phe Cys Ser
His His Trp Glu Gly Asp Ser Leu Asn Thr Gln 660 665 670 Leu Phe Thr
Gln Thr Ser Pro Val Ala Asp Ala Pro Asn Ile Leu Thr 675 680 685 Ser
Ser Val Leu Met Ala Pro Ala Ser Glu Asp Glu Asp Asn Val Leu 690 695
700 Lys Ala Phe Thr Val Pro Lys Asn Arg Ser Leu Ala Ser Pro Leu Gln
705 710 715 720 Pro Cys Ser Ser Thr Trp Glu Pro Ala Ser Cys Gly Lys
Met Glu Glu 725 730 735 Gln Met Thr Ser Ser Ser Gln Ala Arg Lys Tyr
Val Asn Ala Phe Ser 740 745 750 Ala Arg Thr Leu Val Met 755
17555PRTHomo sapiens 17Met Ala Arg Arg Pro Arg His Ser Ile Tyr Ser
Ser Asp Glu Asp Asp 1 5 10 15 Glu Asp Phe Glu Met Cys Asp His Asp
Tyr Asp Gly Leu Leu Pro Lys 20 25 30 Ser Gly Lys Arg His Leu Gly
Lys Thr Arg Trp Thr Arg Glu Glu Asp 35 40 45 Glu Lys Leu Lys Lys
Leu Val Glu Gln Asn Gly Thr Asp Asp Trp Lys 50 55 60 Val Ile Ala
Asn Tyr Leu Pro Asn Arg Thr Asp Val Gln Cys Gln His 65 70 75 80 Arg
Trp Gln Lys Val Leu Asn Pro Glu Leu Ile Lys Gly Pro Trp Thr 85 90
95 Lys Glu Glu Asp Gln Arg Val Ile Glu Leu Val Gln Lys Tyr Gly Pro
100 105 110 Lys Arg Trp Ser Val Ile Ala Lys His Leu Lys Gly Arg Ile
Gly Lys 115 120 125 Gln Cys Arg Glu Arg Trp His Asn His Leu Asn Pro
Glu Val Lys Lys 130 135 140 Thr Ser Trp Thr Glu Glu Glu Asp Arg Ile
Ile Tyr Gln Ala His Lys 145 150 155 160 Arg Leu Gly Asn Arg Trp Ala
Glu Ile Ala Lys Leu Leu Pro Gly Arg 165 170 175 Thr Asp Asn Ala Ile
Lys Asn His Trp Asn Ser Thr Met Arg Arg Lys 180 185 190 Val Glu Gln
Glu Gly Tyr Leu Gln Glu Ser Ser Lys Ala Ser Gln Pro 195 200 205 Ala
Val Ala Thr Ser Phe Gln Lys Asn Ser His Leu Met Gly Phe Ala 210 215
220 Gln Ala Pro Pro Thr Ala Gln Leu Pro Ala Thr Gly Gln Pro Thr Val
225 230 235 240 Asn Asn Asp Tyr Ser Tyr Tyr His Ile Ser Glu Ala Gln
Asn Val Ser 245 250 255 Ser His Val Pro Tyr Pro Val Ala Leu His Val
Asn Ile Val Asn Val 260 265 270 Pro Gln Pro Ala Ala Ala Ala Ile Gln
Arg His Tyr Asn Asp Glu Asp 275 280 285 Pro Glu Lys Glu Lys Arg Ile
Lys Glu Leu Glu Leu Leu Leu Met Ser 290 295 300 Thr Glu Asn Glu Leu
Lys Gly Gln Gln Val Leu Pro Phe Leu Asn Thr 305 310 315 320 Ser Ser
Asn His Glu Asn Ser Asp Leu Glu Met Pro Ser Leu Thr Ser 325 330 335
Thr Pro Leu Ile Gly His Lys Leu Thr Val Thr Thr Pro Phe His Arg 340
345 350 Asp Gln Thr Val Lys Thr Gln Lys Glu Asn Thr Val Phe Arg Thr
Pro 355 360 365 Ala Ile Lys Arg Ser Ile Leu Glu Ser Ser Pro Arg Thr
Pro Thr Pro 370 375 380 Phe Lys His Ala Leu Ala Ala Gln Glu Ile Lys
Tyr Gly Pro Leu Lys 385 390 395 400 Met Leu Pro Gln Thr Pro Ser His
Leu Val Glu Asp Leu Gln Asp Val 405 410 415 Ile Lys Gln Glu Ser Asp
Glu Ser Gly Ile Val Ala Glu Phe Gln Glu 420 425 430 Asn Gly Pro Pro
Leu Leu Lys Lys Ile Lys Gln Glu Val Glu Ser Pro 435 440 445 Thr Asp
Lys Ser Gly Asn Phe Phe Cys Ser His His Trp Glu Gly Asp 450 455 460
Ser Leu Asn Thr Gln Leu Phe Thr Gln Thr Ser Pro Val Ala Asp Ala 465
470 475 480 Pro Asn Ile Leu Thr Ser Ser Val Leu Met Ala Pro Ala Ser
Glu Asp 485 490 495 Glu Asp Asn Val Leu Lys Ala Phe Thr Val Pro Lys
Asn Arg Ser Leu 500 505 510 Ala Ser Pro Leu Gln Pro Cys Ser Ser Thr
Trp Glu Pro Ala Ser Cys 515 520 525 Gly Lys Met Glu Glu Gln Met Thr
Ser Ser Ser Gln Ala Arg Lys Tyr 530 535 540 Val Asn Ala Phe Ser Ala
Arg Thr Leu Val Met 545 550 555 18745PRTHomo sapiens 18Met Ala Arg
Arg Pro Arg His Ser Ile Tyr Ser Ser Asp Glu Asp Asp 1 5 10 15 Glu
Asp Phe Glu Met Cys Asp His Asp Tyr Asp Gly Leu Leu Pro Lys 20 25
30 Ser Gly Lys Arg His Leu Gly Lys Thr Arg Trp Thr Arg Glu Glu Asp
35 40 45 Glu Lys Leu Lys Lys Leu Val Glu Gln Asn Gly Thr Asp Asp
Trp Lys 50 55 60 Val Ile Ala Asn Tyr Leu Pro Asn Arg Thr Asp Val
Gln Cys Gln His 65 70 75 80 Arg Trp Gln Lys Val Leu Asn Pro Glu Leu
Ile Lys Gly Pro Trp Thr 85 90 95 Lys Glu Glu Asp Gln Arg Val Ile
Glu Leu Val Gln Lys Tyr Gly Pro 100 105 110 Lys Arg Trp Ser Val Ile
Ala Lys His Leu Lys Gly Arg Ile Gly Lys 115 120 125 Gln Cys Arg Glu
Arg Trp His Asn His Leu Asn Pro Glu Val Lys Lys 130 135 140 Thr Ser
Trp Thr Glu Glu Glu Asp Arg Ile Ile Tyr Gln Ala His Lys 145 150 155
160 Arg Leu Gly Asn Arg Trp Ala Glu Ile Ala Lys Leu Leu Pro Gly Arg
165 170 175 Thr Asp Asn Ala Ile Lys Asn His Trp Asn Ser Thr Met Arg
Arg Lys 180 185 190 Val Glu Gln Glu Gly Tyr Leu Gln Glu Ser Ser Lys
Ala Ser Gln Pro 195 200 205 Ala Val Ala Thr Ser Phe Gln Lys Asn Ser
His Leu Met Gly Phe Ala 210 215 220 Gln Ala Pro Pro Thr Ala Gln Leu
Pro Ala Thr Gly Gln Pro Thr Val 225 230 235 240 Asn Asn Asp Tyr Ser
Tyr Tyr His Ile Ser Glu Ala Gln Asn Val Ser 245 250 255 Ser His Val
Pro Tyr Pro Val Ala Leu His Val Asn Ile Val Asn Val 260 265 270 Pro
Gln Pro Ala Ala Ala Ala Ile Gln Arg His Tyr Asn Asp Glu Asp 275 280
285 Pro Glu Lys Glu Lys Arg Ile Lys Glu Leu Glu Leu Leu Leu Met Ser
290 295 300 Thr Glu Asn Glu Leu Lys Gly Gln Gln Val Leu Pro Thr Gln
Asn His 305 310 315 320 Thr Cys Ser Tyr Pro Gly Trp His Ser Thr Thr
Ile Ala Asp His Thr 325 330 335 Arg Pro His Gly Asp Ser Ala Pro Val
Ser Cys Leu Gly Glu His His 340 345 350 Ser Thr Pro Ser Leu Pro Ala
Asp Pro Gly Ser Leu Pro Glu Glu Ser 355 360 365 Ala Ser Pro Ala Arg
Cys Met Ile Val His Gln Gly Thr Ile Leu Asp 370 375 380 Asn Asp Ser
Ser Ser Trp Cys Asp Leu Ser Ser Phe Glu Phe Phe Glu 385 390 395 400
Glu Ala Asp Phe Ser Pro Ser Gln His His Thr Gly Lys Ala Leu Gln 405
410 415 Leu Gln Gln Arg Glu Gly Asn Gly Thr Lys Pro Ala Gly Glu Pro
Ser 420 425 430 Pro Arg Val Asn Lys Arg Met Leu Ser Glu Ser Ser Leu
Asp Pro Pro 435 440 445 Lys Val Leu Pro Pro Ala Arg His Ser Thr Ile
Pro Leu Val Ile Leu 450 455 460 Arg Lys Lys Arg Gly Gln Ala Ser Pro
Leu Ala Thr Gly Asp Cys Ser 465 470 475 480 Ser Phe Ile Phe Ala Asp
Val Ser Ser Ser Thr Pro Lys Arg Ser Pro 485 490 495 Val Lys Ser Leu
Pro Phe Ser Pro Ser Gln Phe Leu Asn Thr Ser Ser 500 505 510 Asn His
Glu Asn Ser Asp Leu Glu Met Pro Ser Leu Thr Ser Thr Pro 515 520 525
Leu Ile Gly His Lys Leu Thr Val Thr Thr Pro Phe His Arg Asp Gln 530
535 540 Thr Val Lys Thr Gln Lys Glu Asn Thr Val Phe Arg Thr Pro Ala
Ile 545 550 555 560 Lys Arg Ser Ile Leu Glu Ser Ser Pro Arg Thr Pro
Thr Pro Phe Lys 565 570 575 His Ala Leu Ala Ala Gln Glu Ile Lys Tyr
Gly Pro Leu Lys Met Leu 580 585 590 Pro Gln Thr Pro Ser His Leu Val
Glu Asp Leu Gln Asp Val Ile Lys 595 600 605 Gln Glu Ser Asp Glu Ser
Gly Ile Val Ala Glu Phe Gln Glu Asn Gly 610 615 620 Pro Pro Leu Leu
Lys Lys Ile Lys Gln Glu Val Glu Ser Pro Thr Asp 625 630 635 640 Lys
Ser Gly Asn Phe Phe Cys Ser His His Trp Glu Gly Asp Ser Leu 645 650
655 Asn Thr Gln Leu Phe Thr Gln Thr Ser Pro Val Ala Asp Ala Pro Asn
660 665 670 Ile Leu Thr Ser Ser Val Leu Met Ala Pro Ala Ser Glu Asp
Glu Asp 675 680 685 Asn Val Leu Lys Ala Phe Thr Val Pro Lys Asn Arg
Ser Leu Ala Ser 690 695 700 Pro Leu Gln Pro Cys Ser Ser Thr Trp Glu
Pro Ala Ser Cys Gly Lys 705 710 715 720 Met Glu Glu Gln Met Thr Ser
Ser Ser Gln Ala Arg Lys Tyr Val Asn 725 730 735 Ala Phe Ser Ala Arg
Thr Leu Val Met 740 745 19603PRTHomo sapiens 19Met Ala Arg Arg Pro
Arg His Ser Ile Tyr Ser Ser Asp Glu Asp Asp 1 5 10 15 Glu Asp Phe
Glu Met Cys Asp His Asp Tyr Asp Gly Leu Leu Pro Lys 20 25 30 Ser
Gly Lys Arg His Leu Gly Lys Thr Arg Trp Thr Arg Glu Glu Asp 35 40
45 Glu Lys Leu Lys Lys Leu Val Glu Gln Asn Gly Thr Asp Asp Trp Lys
50 55 60 Val Ile Ala Asn Tyr Leu Pro Asn Arg Thr Asp Val Gln Cys
Gln His 65 70 75 80 Arg Trp Gln Lys Val Leu Asn Pro Glu Leu Ile Lys
Gly Pro Trp Thr 85 90 95 Lys Glu Glu Asp Gln Arg Val Ile Glu Leu
Val Gln Lys Tyr Gly Pro 100 105 110 Lys Arg Trp Ser Val Ile Ala Lys
His Leu Lys Gly Arg Ile Gly Lys 115 120 125 Gln Cys Arg Glu Arg Trp
His Asn His Leu Asn Pro Glu Val Lys Lys 130 135 140 Thr Ser Trp Thr
Glu Glu Glu Asp Arg Ile Ile Tyr Gln Ala His Lys 145 150 155 160 Arg
Leu Gly Asn Arg Trp Ala Glu Ile Ala Lys Leu Leu Pro Gly Arg 165 170
175 Thr Asp Asn Ala Ile Lys Asn His Trp Asn Ser Thr Met Arg Arg Lys
180 185 190 Val Glu Gln Glu Gly Tyr Leu Gln Glu Ser Ser Lys Ala Ser
Gln Pro 195 200 205 Ala Val Ala Thr Ser Phe Gln Lys Asn Ser His Leu
Met Gly Phe Ala 210 215 220 Gln Ala Pro Pro Thr Ala Gln Leu Pro Ala
Thr Gly Gln Pro Thr Val 225 230 235 240 Asn Asn Asp Tyr Ser Tyr Tyr
His Ile Ser Glu Ala Gln Asn Val Ser 245 250 255 Ser His Val Pro Tyr
Pro Val Ala Leu His Val Asn Ile Val Asn Val 260 265 270 Pro Gln Pro
Ala Ala Ala Ala Ile Gln Arg His Tyr Asn Asp Glu Asp 275 280 285 Pro
Glu Lys Glu Lys Arg Ile Lys Glu Leu Glu Leu Leu Leu Met Ser 290 295
300 Thr Glu Asn Glu Leu Lys Gly Gln Gln Val Leu Pro Thr Gln Asn His
305 310 315 320 Thr Cys Ser Tyr Pro Gly Trp His Ser Thr Thr Ile Ala
Asp His Thr 325 330 335 Arg Pro His Gly Asp Ser Ala Pro Val Ser Cys
Leu Gly Glu His His 340 345 350 Ser Thr Pro Ser Leu Pro Ala Asp Pro
Gly Ser Leu Pro Glu Glu Ser 355 360 365 Ala Ser Pro Ala Arg Cys Met
Ile Val His Gln Gly Thr Ile Leu Asp 370 375 380 Asn Val Lys Asn Leu
Leu Glu Phe Ala Glu Thr Leu Gln Phe Ile Asp 385 390 395 400 Ser Phe
Leu Asn Thr Ser Ser Asn His Glu Asn Ser Asp Leu Glu Met 405 410 415
Pro Ser Leu Thr Ser Thr Pro Leu Ile Gly His Lys Leu Thr Val Thr 420
425 430 Thr Pro Phe His Arg Asp Gln Thr Val Lys Thr Gln Lys Glu Asn
Thr 435 440 445 Val Phe Arg Thr Pro Ala Ile Lys Arg Ser Ile Leu Glu
Ser Ser Pro 450 455 460 Arg Thr Pro Thr Pro Phe Lys His Ala Leu Ala
Ala Gln Glu Ile Lys 465 470 475 480 Tyr Gly Pro Leu Lys Met Leu Pro
Gln Thr Pro Ser His Leu Val Glu 485 490 495 Asp Leu Gln Asp Val Ile
Lys Gln Glu Ser Asp Glu Ser Gly Ile Val 500 505 510 Ala Glu Phe Gln
Glu Asn Gly Pro Pro Leu Leu Lys Lys Ile Lys Gln 515 520 525 Glu Asn
Ile Leu Thr Ser Ser Val Leu Met Ala Pro Ala Ser Glu Asp 530 535 540
Glu Asp Asn Val Leu Lys Ala Phe Thr Val Pro Lys Asn Arg Ser Leu 545
550 555 560 Ala Ser Pro Leu Gln Pro Cys Ser Ser Thr Trp Glu Pro Ala
Ser Cys 565 570 575 Gly Lys Met Glu Glu Gln Met Thr Ser Ser Ser Gln
Ala Arg Lys Tyr 580 585 590 Val Asn Ala Phe Ser Ala Arg Thr Leu Val
Met 595 600 20605PRTHomo sapiens 20Met Ala Arg Arg Pro Arg His Ser
Ile Tyr Ser Ser Asp Glu Asp Asp 1 5 10 15 Glu Asp Phe Glu Met Cys
Asp His Asp Tyr Asp Gly Leu Leu Pro Lys 20 25 30 Ser Gly Lys Arg
His Leu Gly Lys Thr Arg Trp Thr Arg Glu Glu Asp 35 40 45 Glu Lys
Leu Lys Lys Leu Val Glu Gln Asn Gly Thr Asp Asp Trp Lys 50 55 60
Val Ile Ala Asn Tyr Leu Pro Asn Arg Thr Asp Val Gln Cys Gln His 65
70 75 80 Arg Trp Gln Lys Val Leu Asn Pro Glu Leu Ile Lys Gly Pro
Trp Thr
85 90 95 Lys Glu Glu Asp Gln Arg Val Ile Glu Leu Val Gln Lys Tyr
Gly Pro 100 105 110 Lys Arg Trp Ser Val Ile Ala Lys His Leu Lys Gly
Arg Ile Gly Lys 115 120 125 Gln Cys Arg Glu Arg Trp His Asn His Leu
Asn Pro Glu Val Lys Lys 130 135 140 Thr Ser Trp Thr Glu Glu Glu Asp
Arg Ile Ile Tyr Gln Ala His Lys 145 150 155 160 Arg Leu Gly Asn Arg
Trp Ala Glu Ile Ala Lys Leu Leu Pro Gly Arg 165 170 175 Thr Asp Asn
Ala Ile Lys Asn His Trp Asn Ser Thr Met Arg Arg Lys 180 185 190 Val
Glu Gln Glu Gly Tyr Leu Gln Glu Ser Ser Lys Ala Ser Gln Pro 195 200
205 Ala Val Ala Thr Ser Phe Gln Lys Asn Ser His Leu Met Gly Phe Ala
210 215 220 Gln Ala Pro Pro Thr Ala Gln Leu Pro Ala Thr Gly Gln Pro
Thr Val 225 230 235 240 Asn Asn Asp Tyr Ser Tyr Tyr His Ile Ser Glu
Ala Gln Asn Val Ser 245 250 255 Ser His Val Pro Tyr Pro Val Ala Leu
His Val Asn Ile Val Asn Val 260 265 270 Pro Gln Pro Ala Ala Ala Ala
Ile Gln Thr Gln Asn His Thr Cys Ser 275 280 285 Tyr Pro Gly Trp His
Ser Thr Thr Ile Ala Asp His Thr Arg Pro His 290 295 300 Gly Asp Ser
Ala Pro Val Ser Cys Leu Gly Glu His His Ser Thr Pro 305 310 315 320
Ser Leu Pro Ala Asp Pro Gly Ser Leu Pro Glu Glu Ser Ala Ser Pro 325
330 335 Ala Arg Cys Met Ile Val His Gln Gly Thr Ile Leu Asp Asn Val
Lys 340 345 350 Asn Leu Leu Glu Phe Ala Glu Thr Leu Gln Phe Ile Asp
Ser Phe Leu 355 360 365 Asn Thr Ser Ser Asn His Glu Asn Ser Asp Leu
Glu Met Pro Ser Leu 370 375 380 Thr Ser Thr Pro Leu Ile Gly His Lys
Leu Thr Val Thr Thr Pro Phe 385 390 395 400 His Arg Asp Gln Thr Val
Lys Thr Gln Lys Glu Asn Thr Val Phe Arg 405 410 415 Thr Pro Ala Ile
Lys Arg Ser Ile Leu Glu Ser Ser Pro Arg Thr Pro 420 425 430 Thr Pro
Phe Lys His Ala Leu Ala Ala Gln Glu Ile Lys Tyr Gly Pro 435 440 445
Leu Lys Met Leu Pro Gln Thr Pro Ser His Leu Val Glu Asp Leu Gln 450
455 460 Asp Val Ile Lys Gln Glu Ser Asp Glu Ser Gly Ile Val Ala Glu
Phe 465 470 475 480 Gln Glu Asn Gly Pro Pro Leu Leu Lys Lys Ile Lys
Gln Glu Val Glu 485 490 495 Ser Pro Thr Asp Lys Ser Gly Asn Phe Phe
Cys Ser His His Trp Glu 500 505 510 Gly Asp Ser Leu Asn Thr Gln Leu
Phe Thr Gln Thr Ser Pro Val Ala 515 520 525 Asp Ala Pro Asn Ile Leu
Thr Ser Ser Val Leu Met Ala Pro Ala Ser 530 535 540 Glu Asp Glu Asp
Asn Val Leu Lys Ala Phe Thr Val Pro Lys Asn Arg 545 550 555 560 Ser
Leu Ala Ser Pro Leu Gln Pro Cys Ser Ser Thr Trp Glu Pro Ala 565 570
575 Ser Cys Gly Lys Met Glu Glu Gln Met Thr Ser Ser Ser Gln Ala Arg
580 585 590 Lys Tyr Val Asn Ala Phe Ser Ala Arg Thr Leu Val Met 595
600 605 21 1910DNAArtificial Sequencesynthetic 21atggctagac
gacctcgaca ctccatctac tcctccgatg aggatgatga ggacatcgag 60atgtgtgacc
acgattacga cggcctgctg ccaaaatctg gaaaacggca cctgggcaaa
120acacgctgga ctcgggagga ggacgaaaaa ctcaaaaaac tcgtcgaaca
gaatggcacc 180gatgactgga aagtgatcgc caactacctg ccaaatcgaa
ccgacgtcca gtgtcagcac 240cggtggcaga aagtcctgaa tcctgaactc
attaagggcc cttggacaaa agaggaggat 300cagcgagtca ttgaactggt
ccagaaatac ggccctaaac ggtggtctgt gattgctaaa 360cacctgaagg
gacggattgg gaaacagtgt agggaacgat ggcataacca tctgaaccct
420gaggtgaaaa aaacttcttg gactgaggag gaggatcgga tcatctacca
ggcccacaaa 480cgactgggga accgctgggc tgagattgcc aaactgctgc
ccgggagaac cgataatgct 540atcaaaaacc attggaactc cactatgagg
cggaaagtcg aacaggaggg atacctccag 600gaaccttcta aagcatccca
gacacctgtg gctacaagct ttcagaaaaa caaccatctc 660atgggctttg
gacatgctag tccccctagt cagctgtcac ccagtggaca gtctagtgtg
720aactctgaat acccctacta ccacattgct gaggctcaga acatttcctc
tcacgtgcct 780taccctgtcg ctctccacgt caacattgtg aacgtgcccc
agcctgccgc tgctgctatc 840cagagacact acaacgatga ggaccctgag
aaagagaaac gaatcaagga gctggaactg 900ctgctcatgt ctaccgagaa
tgagctgaag ggacagcagg ctctgcctac tcagaatcac 960acctgctcct
accctggctg gcattcaacc tcaatcgtcg accagacacg acctcatggg
1020gatagtgccc ctgtgtcatg cctgggcgaa catcatgcta caccttccct
gcctgccgac 1080cccggatctc tgcctgagga atctgcttct cctgcccgct
gtatgatcgt ccatcagggc 1140acaattctcg ataacgtgaa aaacctgctc
gaattcgccg aaacactcca gtttatcgat 1200tccttcctga acacctcttc
caaccacgaa tcttccggac tggatgcccc aactctccca 1260tccacaccac
tcattggcca caaactcacc ccttgtcggg atcagaccgt gaaaacccag
1320aaagaaaact ccattttccg gacacctgct atcaaacgga gcattctgga
gagtagtcct 1380agaaccccca cccccttcaa acatgctctg gccgctcagg
aaatcaaata cgggcccctg 1440aaaatgctgc ctcagacccc ttctcatgct
gtggaggacc tccaggacgt gatcaaacag 1500gaatccgacg aatctggcat
tgtcgctgag tttcaggagt ctggccctcc tctgctgaaa 1560aaaatcaaac
aggaggtgga gtctcctacc gaaaaatccg gcaacttctt ctgctccaat
1620cattgggccg agaactcact gagcacccag ctgtttagtc aggcatctcc
tgtggccgac 1680gctcccaata ttctcacttc ctccgtgctc atgacccccg
tgagtgagga tgaggataac 1740gtgctgaagg cctttactgt gcctaaaaac
cgccctctgg tgggaccact ccagccttgt 1800tccggagctt gggaacctgc
ctcttgtgga aaaaccgagg accagatgac tgctagtgga 1860cccgctagaa
aatacgtgaa cgccttctcc gctcgaactc tcgtgatggc 1910
* * * * *