U.S. patent application number 15/936601 was filed with the patent office on 2018-11-01 for ovary tumor markers and methods of use thereof.
The applicant listed for this patent is Externautics S.P.A.. Invention is credited to Susanna Campagnoli, Alberto Grandi, Renata Grifantini, Renzo Nogarotto, Matteo Parri, Andrea Pierleoni, Piero Pileri.
Application Number | 20180313841 15/936601 |
Document ID | / |
Family ID | 43302460 |
Filed Date | 2018-11-01 |
United States Patent
Application |
20180313841 |
Kind Code |
A1 |
Grifantini; Renata ; et
al. |
November 1, 2018 |
Ovary Tumor Markers And Methods Of Use Thereof
Abstract
Newly identified proteins as markers for the detection of ovary
tumors, or as therapeutic targets for treatment thereof; affinity
ligands capable of selectively interacting with the newly
identified markers, methods for tumor diagnosis and therapy using
the same.
Inventors: |
Grifantini; Renata; (Siena,
IT) ; Pileri; Piero; (Siena, IT) ; Campagnoli;
Susanna; (Siena, IT) ; Grandi; Alberto;
(Siena, IT) ; Parri; Matteo; (Siena, IT) ;
Pierleoni; Andrea; (Siena, IT) ; Nogarotto;
Renzo; (Siena, IT) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
Externautics S.P.A. |
Siena |
|
IT |
|
|
Family ID: |
43302460 |
Appl. No.: |
15/936601 |
Filed: |
March 27, 2018 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
13503417 |
Sep 10, 2012 |
|
|
|
PCT/EP2010/066154 |
Oct 26, 2010 |
|
|
|
15936601 |
|
|
|
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
C12N 2310/14 20130101;
C12N 15/113 20130101; C07K 16/3069 20130101; G01N 33/57449
20130101; G01N 2500/02 20130101 |
International
Class: |
G01N 33/574 20060101
G01N033/574; C07K 16/30 20060101 C07K016/30; C12N 15/113 20060101
C12N015/113 |
Foreign Application Data
Date |
Code |
Application Number |
Oct 26, 2009 |
EP |
09174062.1 |
Claims
1.-11. (canceled)
12. A method of treating a proliferative disease in a subject, the
method comprising administering to the subject an antibody or a
fragment thereof that specifically recognizes and binds to a KLRG2
protein.
13. The method of claim 12, wherein the proliferative disease is an
ovarian cancer.
14. The method of claim 12, wherein the KLRG2 protein has an amino
acid sequence that has at least 80% identity to SEQ ID NO: 85 or
SEQ ID NO: 86.
15. The method of claim 12, wherein the KLRG2 protein has an amino
acid sequence that has at least 90% identity to SEQ ID NO: 85 or
SEQ ID NO: 86.
16. The method of claim 12, wherein the KLRG2 protein has an amino
acid sequence that has at least 95% identity to SEQ ID NO: 85 or
SEQ ID NO: 86.
17. The method of claim 12, wherein the KLRG2 protein has an amino
acid sequence selected from SEQ ID NO: 85 or SEQ ID NO: 86.
18. The method of claim 12, wherein the antibody or fragment
thereof modulates KLRG2 activity.
19. The method of claim 12, wherein the antibody or fragment
thereof inhibits or abolishes KLRG2 activity.
20. The method of claim 12, wherein the antibody is a monoclonal
antibody.
21. The method of claim 12, wherein the antibody is a polyclonal
antibody.
22. The method of claim 12, wherein the antibody is labelled with a
radioactive, fluorescent or enzyme label.
23. The method of claim 12 further comprising detecting the
expression of KLRG2 protein in a tissue sample of the subject.
24. The method of claim 23, wherein the tissue sample is an ovarian
tissue sample.
25. The method of claim 23, wherein the KLRG2 protein is detected
by an immunoradiometric, immunoenzymatic or immunohistochemical
assay.
26. The method of claim 12 further comprising detecting the
expression of KLRG2 mRNA in a tissue sample of the subject.
27. The method of claim 26, wherein the tissue sample is an ovarian
tissue sample.
28. The method of claim 26, wherein the KLRG2 mRNA is detected by
polymerase chain reaction (PCR).
29. The method of claim 12, wherein the subject is a human.
30. A method of treating a proliferative disease in a subject, the
method comprising administering to the subject a siRNA molecule
that specifically silences KLRG2.
31. The method of claim 30, wherein the proliferative disease is an
ovarian cancer.
32. The method of claim 30, wherein the siRNA molecule has a
sequence complementary to SEQ ID NO: 87 or SEQ ID NO: 88.
33. The method of claim 29, wherein the subject is a human.
Description
[0001] The present invention relates to newly identified proteins
as markers for the detection of ovary tumors, or as targets for
their treatment. Also provided are affinity ligands capable of
selectively interacting with the newly identified markers, as well
as methods for tumor diagnosis and therapy using such ligands.
BACKGROUND OF THE INVENTION
[0002] Tumor Markers (or Biomarkers)
[0003] Tumor markers are substances that can be produced by tumor
cells or by other cells of the body in response to cancer. In
particular, a protein biomarker is either a single protein or a
panel of different proteins that could be used to unambiguously
distinguish a disease state. Ideally, a biomarker would have both a
high specificity and sensitivity, being represented in a
significant percentage of the cases of given disease and not in
healthy state.
[0004] Biomarkers can be identified in different biological
samples, like tissue biopsies or preferably biological fluids
(saliva, urine, blood-derivatives and other body fluids), whose
collection does not necessitate invasive treatments. Tumor marker
levels may be categorized in three major classes on the basis of
their clinical use. Diagnostic markers can be used in the detection
and diagnosis of cancer. Prognostics markers are indicative of
specific outcomes of the disease and can be used to define
predictive models that allow the clinicians to predict the likely
prognosis of the disease at time of diagnosis. Moreover, prognosis
markers are helpful to monitor the patient response to a drug
therapy and facilitate a more personalized patient management. A
decrease or return to a normal level may indicate that the cancer
is responding to therapy, whereas an increase may indicate that the
cancer is not responding. After treatment has ended, tumor marker
levels may be used to check for recurrence of the tumor. Finally,
therapeutic markers can be used to develop tumor-specific drugs or
affinity ligand (i.e. antibodies) for a prophylactic
intervention.
[0005] Currently, although an abnormal tumor marker level may
suggest cancer, this alone is usually not enough to accurately
diagnose cancer and their measurement in body fluids is frequently
combined with other tests, such as a biopsy and radioscopic
examination. Frequently, tumor marker levels are not altered in all
of people with a certain cancer disease, especially if the cancer
is at early stage. Some tumor marker levels can also be altered in
patients with noncancerous conditions. Most biomarkers commonly
used in clinical practice do not reach a sufficiently high level of
specificity and sensitivity to unambiguously distinguish a tumor
from a normal state.
[0006] To date the number of markers that are expressed abnormally
is limited to certain types/subtypes of cancer, some of which are
also found in other diseases.
(http://www.cancer.gov/cancertopics/factsheet).
[0007] For example, prostate-specific antigen (PSA) levels are
often used to screen men for prostate cancer, but this is
controversial since elevated PSA levels can be caused by both
prostate cancer or benign conditions, and most men with elevated
PSA levels turn out not to have prostate cancer.
[0008] Another tumor marker, Cancer Antigen 125, (CA 125), is
sometimes used to screen women who have an increased risk for
ovarian cancer. Scientists are studying whether measurement of CA
125, along with other tests and exams, is useful to find ovarian
cancer before symptoms develop. So far, CA 125 measurement is not
sensitive or specific enough to be used to screen all women for
ovarian cancer. Mostly, CA 125 is used to monitor response to
treatment and check for recurrence in women with ovarian cancer.
Finally, human epidermal growth factor receptor (HER2) is a marker
protein overproduced in about 20% of breast cancers, whose
expression is typically associated with a more aggressive and
recurrent tumors of this class.
[0009] Routine Screening Test for Tumor Diagnosis
[0010] Screening tests are a way of detecting cancer early, before
there are any symptoms. For a screening test to be helpful, it
should have high sensitivity and specificity. Sensitivity refers to
the test's ability to identify people who have the disease.
Specificity refers to the test's ability to identify people who do
not have the disease. Different molecular biology approaches such
as analysis of DNA sequencing, small nucleotide polymorphyms, in
situ hybridization and whole transcriptional profile analysis have
done remarkable progresses to discriminate a tumor state from a
normal state and are accelerating the knowledge process in the
tumor field. However so far different reasons are delaying their
use in the common clinical practice, including the higher analysis
complexity and their expensiveness. Other diagnosis tools whose
application is increasing in clinics include in situ hybridization
and gene sequencing.
[0011] Currently, Immuno-HistoChemistry (IHC), a technique that
allows the detection of proteins expressed in tissues and cells
using specific antibodies, is the most commonly used method for the
clinical diagnosis of tumor samples. This technique enables the
analysis of cell morphology and the classification of tissue
samples on the basis of their immunoreactivity. However, at
present, IHC can be used in clinical practice to detect cancerous
cells of tumor types for which protein markers and specific
antibodies are available. In this context, the identification of a
large panel of markers for the most frequent cancer classes would
have a great impact in the clinical diagnosis of the disease.
[0012] Anti-Cancer Therapies
[0013] In the last decades, an overwhelming number of studies
remarkably contributed to the comprehension of the molecular
mechanisms leading to cancer. However, this scientific progress in
the molecular oncology field has not been paralleled by a
comparable progress in cancer diagnosis and therapy. Surgery and/or
radiotherapy are the still the main modality of local treatment of
cancer in the majority of patients. However, these treatments are
effective only at initial phases of the disease and in particular
for solid tumors of epithelial origin, as is the case of colon,
lung, breast, prostate and others, while they are not effective for
distant recurrence of the disease. In some tumor classes,
chemotherapy treatments have been developed, which generally relies
on drugs, hormones and antibodies, targeting specific biological
processes used by cancers to grow and spread. However, so far many
cancer therapies had limited efficacy due to severity of side
effects and overall toxicity. Indeed, a major effort in cancer
therapy is the development of treatments able to target
specifically tumor cells causing limited damages to surrounding
normal cells thereby decreasing adverse side effects. Recent
developments in cancer therapy in this direction are encouraging,
indicating that in some cases a cancer specific therapy is
feasible. In particular, the development and commercialization of
humanized monoclonal antibodies that recognize specifically
tumor-associated markers and promote the elimination of cancer is
one of the most promising solution that appears to be an extremely
favorable market opportunity for pharmaceutical companies. However,
at present the number of therapeutic antibodies available on the
market or under clinical studies is very limited and restricted to
specific cancer classes. So far licensed monoclonal antibodies
currently used in clinics for the therapy of specific tumor
classes, show only a partial efficacy and are frequently associated
with chemotherapies to increase their therapeutic effect.
Administration of Trastuzumab (Herceptin), a commercial monoclonal
antibody targeting HER2, a protein overproduced in about 20% of
breast cancers, in conjunction with Taxol adjuvant chemotherapy
induces tumor remission in about 42% of the cases. Bevacizumab
(Avastin) and Cetuximab (Erbitux) are two monoclonal antibodies
recently licensed for use in humans, targeting the endothelial and
epithelial growth factors respectively that, combined with adjuvant
chemotherapy, proved to be effective against different tumor
diseases. Bevacizumab proved to be effective in prolonging the life
of patients with metastatic colorectal, breast and lung cancers.
Cetuximab demostrated efficacy in patients with tumor types
refractory to standard chemotherapeutic treatments (Adams G. P. and
Weiner L. M. (2005) Monoclonal antibody therapy cancer. Nat
Biotechnol. 23:1147-57).
[0014] In summary, available screening tests for tumor diagnosis
are uncomfortable or invasive and this sometimes limits their
applications. Moreover tumor markers available today have a limited
utility in clinics due to either their incapability to detect all
tumor subtypes of the defined cancers types and/or to distinguish
unambiguously tumor vs. normal tissues. Similarly, licensed
monoclonal antibodies combined with standard chemotherapies are not
effective against the majority of cases. Therefore, there is a
great demand for new tools to advance the diagnosis and treatment
of cancer.
[0015] Experimental Approaches Commonly Used to Identify Tumor
Markers
[0016] Most popular approaches used to discover new tumor markers
are based on genome-wide transcription profile or total protein
content analyses of tumor. These studies usually lead to the
identification of groups of mRNAs and proteins, which are
differentially expressed in tumors. Validation experiments then
follow to eventually single out, among the hundreds of
RNAs/proteins identified, the very few that have the potential to
become useful markers. Although often successful, these approaches
have several limitations and often, do not provide firm indications
on the association of protein markers with tumor. A first
limitation is that, since frequently mRNA levels not always
correlate with corresponding protein abundance (approx. 50%
correlation), studies based on transcription profile do not provide
solid information regarding the expression of protein markers in
tumor (1, 2, 3, 4).
[0017] A second limitation is that neither transcription profiles
nor analysis of total protein content discriminate post-translation
modifications, which often occur during oncogenesis. These
modifications, including phosphorylations, acetylations, and
glycosylations, or protein cleavages influence significantly
protein stability, localization, interactions, and functions
(5).
[0018] As a consequence, large scale studies generally result in
long lists of differentially expressed genes that would require
complex experimental paths in order to validate the potential
markers. However, large scale genomic/proteomic studies reporting
novel tumor markers frequently lack of confirmation data on the
reported potential novel markers and thus do not provide solid
demonstration on the association of the described protein markers
with tumor.
[0019] The approach that we used to identify the protein markers
included in the present invention is based on an innovative
immuno-proteomic technology. In essence, a library of recombinant
human proteins has been produced from E. coli and is being used to
generate polyclonal antibodies against each of the recombinant
proteins.
[0020] The screening of the antibodies library on Tissue
microarrays (TMAs) carrying clinical samples from different
patients affected by the tumor under investigation lead to the
identification of specific tumor marker proteins. Therefore, by
screening TMAs with the antibody library, the tumor markers are
visualized by immuno-histochemistry, the classical technology
applied in all clinical pathology laboratories. Since TMAs also
include healthy tissues, the specificity of the antibodies for the
tumors can be immediately appreciated and information on the
relative level of expression and cellular localization of the
markers can be obtained. In our approach the markers are subjected
to a validation process consisting in a molecular and cellular
characterization.
[0021] Altogether, the detection the marker proteins disclosed in
the present invention selectively in tumor samples and the
subsequent validation experiments lead to an unambiguous
confirmation of the marker identity and confirm its association
with defined tumor classes. Moreover this process provides an
indication of the possible use of the proteins as tools for
diagnostic or therapeutic intervention. For instance, markers
showing a surface cellular localization could be both diagnostic
and therapeutic markers against which both chemical and antibody
therapies can be developed. Differently, markers showing a
cytoplasmic expression could be more likely considered for the
development of tumor diagnostic tests and chemotherapy/small
molecules treatments.
SUMMARY OF THE INVENTION
[0022] The present invention provides new means for the detection
and treatment of tumors, in particular ovary tumor, based on the
identification of protein markers specific for these tumor type,
namely: [0023] i) Killer cell lectin-like receptor subfamily G
member 2 (C-type lectin domain family 15 member B) (KLRG2); [0024]
ii) Solute carrier family 39 (zinc transporter), member 10
(SLC39A10); [0025] iii) G protein-coupled receptor 107 (GPR107);
[0026] iv) DPY-19-like 3 (DPY19L3); [0027] v) Uncharacterized
protein FLJ42986 (FLJ42986); [0028] vi) Collagen, type XX, alpha 1
(COL20A1); [0029] vii) Glycosyltransferase 25 domain containing 2
(GLT25D2); [0030] viii) Synaptotagmin-like 3 (SYTL3); [0031] ix)
DENN/MADD domain containing 1B (DENND1B); [0032] x) Putative
uncharacterized protein (FLJ3710); [0033] xi) Chromosome 6 open
reading frame 98 (C6orf98); [0034] xii) Family with sequence
similarity 69, member B (Fam69B); [0035] xiii) EMI
domain-containing protein 1 Precursor (EMID1); [0036] xiv)
Endoplasmic reticulum metallopeptidase 1 (ERMP1); [0037] xv)
Vitelline membrane outer layer protein 1 homolog Precursor
(VMO1)
[0038] The invention also provides a method for the diagnosis of
these cancer types, comprising a step of detecting the
above-identified markers in a biological sample, e.g. in a tissue
sample of a subject suspected of having or at risk of developing
malignancies or susceptible to cancer recurrences.
[0039] In addition, the tumor markers identify novel targets for
affinity ligands, which can be used for therapeutic applications.
Also provided are affinity ligands, particularly antibodies,
capable of selectively interacting with the newly identified
protein markers.
DETAILED DISCLOSURE OF THE INVENTION
[0040] The present invention is based on the surprising finding of
antibodies that are able to specifically stain tumor tissues from
patients, while negative or very poor staining is observed in
normal tissues from the same patients. These antibodies have been
found to specifically bind to proteins for which no previous
association with tumor has been reported. Hence, in a first aspect,
the invention provides an ovarian tumor marker, which is selected
from the group consisting of:
[0041] i) KLRG2, SEQ ID NO:85, SEQ ID NO:86 or a different isoform
having sequence identity of at least 80%, preferably at least 90%,
more preferably at least 95% to SEQ ID NO:85, SEQ ID NO:86 or a
nucleic acid molecule containing a sequence coding for a KLRG2
protein, said encoding sequence being preferably selected from SEQ
ID NO: 87 and SEQ ID NO:88;
[0042] ii) SLC39A10 in one of its variant isoforms SEQ ID NO:1, SEQ
ID NO:2 or a different isoform having sequence identity of at least
80%, preferably at least 90%, more preferably at least 95% to SEQ
ID NO:1 or SEQ ID NO:2; or a nucleic acid molecule containing a
sequence coding for a SLC39A10 protein, said encoding sequence
being preferably selected from SEQ ID NO: 3 and SEQ ID NO: 4;
[0043] iii) GPR107, in one of its variant isoforms SEQ ID NO:5, SEQ
ID NO:6, SEQ ID NO:7, or a different isoform having sequence
identity of at least 80%, preferably at least 90%, more preferably
at least 95% to any of SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or a
nucleic acid molecule containing a sequence coding for a GPR107
protein, said encoding sequence being preferably selected from SEQ
ID NO:8, SEQ ID NO:9 and SEQ ID NO:10;
[0044] iv) DPY19L3, in one of its variant isoforms SEQ ID NO:11,
SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, or a different isoform
having sequence identity of at least 80%, preferably at least 90%,
more preferably at least 95% to SEQ ID NO:11, SEQ ID NO:12, SEQ ID
NO:13 or SEQ ID NO:14, or a nucleic acid molecule containing a
sequence coding for a DPY19L3 protein, said encoding sequence being
preferably selected from SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17
and SEQ ID NO:18;
[0045] v) FLJ42986, SEQ ID NO:19 or a different isoform having
sequence identity of at least 80%, preferably at least 90%, more
preferably at least 95% to SEQ ID NO:19, or a nucleic acid molecule
containing a sequence coding for a FLJ42986 protein, said encoding
sequence being preferably SEQ ID NO:20;
[0046] vi) COL20A1, in one of its variant isoforms SEQ ID NO:21,
SEQ ID NO:22, SEQ ID NO:23, or a different isoform having sequence
identity of at least 80%, preferably at least 90%, more preferably
at least 95% to any of SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, or
a nucleic acid molecule containing a sequence coding for a COL20A1
protein, said encoding sequence being preferably selected from SEQ
ID NO:24, SEQ ID NO:25 and SEQ ID NO:26;
[0047] vii) GLT25D2, in one of its variant isoforms SEQ ID NO:27,
SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, or a different isoform
having sequence identity of at least 80%, preferably at least 90%,
more preferably at least 95% to any of SEQ ID NO:27, SEQ ID NO:28,
SEQ ID NO:29, SEQ ID NO:30, or a nucleic acid molecule containing a
sequence coding for a GLT25D2 protein, said encoding sequence being
preferably selected from SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33
and SEQ ID NO:34;
[0048] viii) SYTL3, in one of its variant isoforms SEQ ID NO:35,
SEQ ID NO:36, SEQ ID NO:37, or a different isoform having sequence
identity of at least 80%, preferably at least 90%, more preferably
at least 95% to any of SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, or
a nucleic acid molecule containing a sequence coding for a SYTL3
protein, said encoding sequence being preferably selected from SEQ
ID NO:38, SEQ ID NO:39 and SEQ ID NO:40;
[0049] ix) DENND1B; in one of its variant isoforms SEQ ID NO:41,
SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, or a different isoform
having sequence identity of at least 80%, preferably at least 90%,
more preferably at least 95% to any of SEQ ID NO:41, SEQ ID NO:42,
SEQ ID NO:43, SEQ ID NO:44, or a nucleic acid molecule containing a
sequence coding for a DENND1B protein, said encoding sequence being
preferably selected from SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47
and SEQ ID NO:48;
[0050] x) FLJ3710, SEQ ID NO:49 or a different isoform having
sequence identity of at least 80%, preferably at least 90%, more
preferably at least 95% to SEQ ID NO:49, or a nucleic acid molecule
containing a sequence coding for a FLJ3710 protein, said encoding
sequence being preferably SEQ ID NO:50;
[0051] xi) Chromosome 6 orf 98 (C6orf98) SEQ ID NO:51 or a
different isoform having sequence identity of at least 80%,
preferably at least 90%, more preferably at least 95% to SEQ ID
NO:51, or a nucleic acid molecule containing a sequence coding for
a C6orf98 protein, said encoding sequence being preferably SEQ ID
NO:52;
[0052] xii) Family with sequence similarity 69, member B (Fam69B)
SEQ ID NO:53, SEQ ID NO:54 or a different isoform having sequence
identity of at least 80%, preferably at least 90%, more preferably
at least 95% to SEQ ID NO:53 or SEQ ID NO:54, or a nucleic acid
molecule containing a sequence coding for a Fam69B protein, said
encoding sequence being preferably selected from SEQ ID NO:55 and
SEQ ID NO:56;
[0053] xiii) EMI domain-containing protein 1 Precursor (EMID1) in
one of its variant isoforms SEQ ID NO:57, SEQ ID NO:58, SEQ ID
NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ
ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68,
SEQ ID NO:69, SEQ ID NO:70, or a different isoform having sequence
identity of at least 80%, preferably at least 90%, more preferably
at least 95% to SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID
NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ
ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69 OR
SEQ ID NO:70, or a nucleic acid molecule containing a sequence
coding for a EMID protein, said encoding sequence being preferably
selected from SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID
NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ
ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83
and SEQ ID NO:84;
[0054] xiv) ERMP1, SEQ ID NO:89, SEQ ID NO:90 and SEQ ID NO:91, or
a different isoform having sequence identity of at least 80%,
preferably at least 90%, more preferably at least 95% to SEQ ID
NO:89, SEQ ID NO:90 or SEQ ID NO:91, or a nucleic acid molecule
containing a sequence coding for a ERMP1 protein, said encoding
sequence being preferably selected from SEQ ID NO:92, SEQ ID NO:93
and SEQ ID NO:94;
[0055] xv) VMO1, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID
NO: 98 or a different isoform having sequence identity of at least
80%, preferably at least 90%, more preferably at least 95% to SEQ
ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO: 98 or a nucleic
acid molecule containing a sequence coding for a VMO1 protein, said
encoding sequence being preferably selected from SEQ ID NO:99, SEQ
ID NO:100 SEQ ID NO:101 and SEQ ID NO: 102.
[0056] As used herein, "Percent (%) amino acid sequence identity"
with respect to the marker protein sequences identified herein
indicates the percentage of amino acid residues in a full-length
protein variant or isoform according to the invention, or in a
portion thereof, that are identical with the amino acid residues in
the specific marker sequence, after aligning the sequences and
introducing gaps, if necessary, to achieve the maximum percent
sequence identity, and not considering any conservative
substitutions as part of the sequence identity. Alignment for
purposes of determining percent amino acid sequence identity can be
achieved in various ways that are within the skill in the art, for
instance, using publicly available computer software such as BLAST,
BLAST-2, ALIGN or Megalign (DNASTAR) software. Those skilled in the
art can determine appropriate parameters for measuring alignment,
including any algorithms needed to achieve maximal alignment over
the full length of the sequences being compared. Identity between
nucleotide sequences is preferably determined by the Smith-Waterman
homology search algorithm as implemented in the MPSRCH program
(Oxford Molecular), using an affine gap search with parameters gap
open penalty=12 and gap extension penalty=1.
[0057] SLC39A10 mRNA has been shown to be moderately upregulated in
breast cancer tissues as compared to normal samples (approximately
1.5 fold). In the same sudy, loss of SLC39A10 transcription in
breast cell lines has been reported to reduce cell migratory
activity (6). However, published studies on the expression of
SLC39A10 in breast tumor cells are limited to the analysis of
SLC39A10 transcript whilst, to the best of our knowledge, no data
have been reported documenting the presence of SLC39A10 protein in
these tumor cells.
[0058] SLC39A10 is also mentioned in a patent application reporting
long lists of differentially transcribed genes in tumor cells by
using genome-scale transcription profile analysis (e.g. in
Publication Number: US20070237770A1). Again, no data are given
documenting the expression of SLC39A10 in tumor at protein level.
The lack of correlation between mRNA and protein expression,
besides being a general fact, has been specifically demonstrated
for LIV-1, another member of the zinc transporter family,
suggesting that a similar phenomenon could be extended to other
proteins of this class (7). Finally, no evidence exists on the
association of SLC39A10 protein with other tumors, such as ovary
tumor classes.
[0059] In the present invention we disclose SLC39A10 as a protein
without previous known association with ovarian tumor classes and
preferably used as a marker ovary tumors and in general for cancers
of these types. As described below, an antibody generated towards
the SLC39A10 protein shows a selective immunoreactivity in
histological preparation of ovary cancer tissues which indicates
the presence of SLC39A10 in these cancer samples and makes SLC39A10
protein and its antibody highly interesting tools for specifically
distinguishing these cancer types from a normal state.
[0060] Moreover, this antibody also detected its target protein on
the surface of ovary tumor cell lines and in cells transfected with
a SLC39A10 encoding construct, indicating that the protein is
exposed to the external environment and easily accessible to
affinity ligands such as antibodies.
[0061] G protein-coupled receptor 107 (GPR107, synonyms: Protein
GPR107 Precursor, Lung seven transmembrane receptor 1, Gene ID:
ENSG00000148358; Transcript IDs: ENST00000347136, ENST00000372410,
ENST00000372406; Protein IDs:ENSP00000336988, ENSP00000361487,
ENSP00000361483) is a partially characterized protein. GPR107 has
been mentioned in patent applications on pancreatic tumor (e.g.
US20050260639A1) and in different patent application based on whole
genome transcription profile analysis of cancer, but no supporting
data are provided on the expression of GPR107 protein in tumor
samples. Based on the above, in the present invention we disclose
GPR107 as a protein without previous known association with the
tumor class under investigation and preferably used as a marker for
ovary tumor, and in general for cancers of this type. As described
below, an antibody generated towards GPR107 protein shows a
selective immunoreactivity in histological preparation of ovary
cancer tissues, which indicates the presence of this protein in
these cancer samples.
[0062] The protein was detected on the surface of tumor cell lines
by anti-GPR107 antibodies, indicating that it is accessible to the
binding of specific antibodies.
[0063] Protein dpy-19 homolog 3 (DPY19L3; synonym: Dpy-19-like
protein 3; Gene ID: ENSG00000178904; Transcript IDs:
ENST00000319326, ENST00000392250, ENST00000342179, ENST00000392248;
Protein IDs: ENSP00000315672, ENSP00000376081. ENSP00000344937,
ENSP00000376079) Dpy 19L3 transcript has been reported as
differentially expressed in multiple myeloma (Publication Number:
US20080280779A1). However not data are available at level of
protein expression. In the present invention we disclose Dpy 19L3
protein as associated with tumor and preferably used as a marker
for ovary tumor, and in general for these cancer types. As
described below, an antibody generated towards DPY19L3 protein
shows a selective immunoreactivity in histological preparation of
ovary cancer tissues, which indicates the presence of this protein
in these cancer samples.
[0064] Moreover the protein is detected on a panel of ovary tumor
cell lines reinforcing the evidence. Finally the protein was
detected on the surface of tumor cell lines by the specific
antibody, suggesting that it can be exploited as target for
affinity ligands with therapeutic activity.
[0065] Uncharacterized protein FLJ42986 (FLJ42986, Gene ID:
ENSG00000196460; Transcript ID: ENST00000376826; Protein ID:
ENSP00000366022) is a hypothetical protein without previous known
association with ovarian tumor classes and is preferably used as a
marker for ovary tumor and in general for these cancer types. As
described below, an antibody generated towards FLJ42986 protein
shows a selective immunoreactivity in histological preparation of
ovary cancer tissues, which indicates the presence of this protein
in these cancer samples.
[0066] Collagen, type XX, alpha 1 (COL20A1); Synonyms: Collagen
alpha-1(XX) chain Precursor; Gene ID: ENSG00000101203; Protein IDs:
ENSP00000323077; ENSP00000346302; ENSP00000351767; Transcript IDs:
ENST00000326996; ENST00000354338; ENST00000358894) belongs to the
family of collagenous domain, Fibronectin type III domain, heparin
binding domain, von Willebrand type A domain proteins. COL20A1 is a
protein without previous known association with tumor and is
preferably used as a marker for ovary tumor and in general for
these cancer types. As described below, an antibody generated
towards COL20A1 protein shows a selective immunoreactivity in
histological preparation of ovary tissues, which indicates the
presence of this protein in these cancer samples.
[0067] Glycosyltransferase 25 domain containing 2 (GLT25D2;
synonyms: Glycosyltransferase 25 family member 2 Precursor) Gene
ID: ENSG00000198756; Trasnscript IDs: ENST00000367522,
ENST00000361927, ENST00000367520, ENST00000367521; Protein IDs:
ENSP00000356492, ENSP00000354960, ENSP00000356490,
ENSP00000356491), is a protein without previous known association
with ovarian tumor and is preferably used as a marker for ovary
tumor and in general for these cancer types. As described below, an
antibody generated towards GLT25D2 protein shows a selective
immunoreactivity in histological preparation of ovarian cancer,
which indicates the presence of this protein in these cancer
samples.
[0068] Synaptotagmin-like 3 (SYTL3; synonyms: SLP3, SLP3-B,
Synaptotagmin-like protein 3, Exophilin-6; Gene ID:
ENSG00000164674; Transcript ID: ENST00000360448, ENST00000297239,
ENST00000367081; Protein ID: ENSP00000353631, ENSP00000297239,
ENSP00000356048) belongs the family of proteins containing C2
domain, calcium-dependent phospholipid binding, neurexin binding,
phospholipid binding, protein binding, rab3A effector domain, Slp
homology domain. SYTL3 is a protein without previous known
association with ovarian tumor and is preferably used as a marker
for ovary tumor and in general for these cancer types. As described
below, an antibody generated towards SYTL3 protein shows a
selective immunoreactivity in histological preparation of ovary
cancer tissues, which indicates the presence of this protein in
these cancer samples.
[0069] DENN/MADD domain containing 1B (DENND1B; synonyms: DENN
domain-containing protein 1B, Protein FAM31B, C1orf218; Gene ID:
ENSG00000162701. Transcript IDs: ENST00000294738, ENST00000367396,
ENST00000400967, ENST00000235453; Protein IDs: ENSP00000294738,
ENSP00000356366, ENSP00000383751, ENSP00000235453) is a poorly
characterized protein without previous known association with
ovarian tumors and is preferably used as a marker for ovary tumor
and in general for these cancer types. As described below, an
antibody generated towards DENND1B protein shows a selective
immunoreactivity in histological preparation of ovary cancer
tissues, which indicates the presence of this protein in these
cancer samples.
[0070] Putative uncharacterized protein FLJ37107 (FLJ37107;
synonyms: LOC284581; Gene ID: ENSG00000177990, Transcript ID:
gi|58218993|ref|NM_001010882.1, Protein ID:
gi|58218994|ref|NP_001010882.1| hypothetical protein LOC284581
[Homo sapiens], gi|74729692|sp|Q8N9I1.1|YA028_HUMAN) is an
uncharacterized protein without previous known association with
tumor and is preferably used as a marker for ovary tumor and in
general for these cancer types. As described below, an antibody
generated towards FLJ3710 protein shows a selective
immunoreactivity in histological preparation of ovary cancer
tissues, which indicates the presence of this protein in these
cancer samples.
[0071] Chromosome 6 open reading frame 98 (C6orf98; synonym:
dJ45H2.2; Gene ID: EG:387079, da ENSG00000222029 has 1 transcript:
ENST00000409023, associated peptide: ENSP00000386324 and 1 exon:
ENSE00001576965) is an uncharacterized protein. Analysis of human
genome databases (E.g. Ensembl) erroneously assigns C6orf98 as
SYNE1. Although SYNE nucleic acid sequences overlap with C6ORF98
transcript, the encoded proteins show no match. In fact C6orf98
maps on an SYNE1 untranslated region (intron) and its product
derives from a different reading frame than those annotated for
SYNE1 isoforms in public databases. C6orf98 is a protein without
C6orf98 is a protein without previous known association with tumor
and is preferably used as a marker for ovary tumor and in general
for these cancer types. As described below, an antibody generated
towards C6orf98 protein shows a selective immunoreactivity in
histological preparation of ovary cancer tissues, which indicates
the presence of this protein in these cancer samples.
[0072] Family with sequence similarity 69, member B (Fam69B;
synonym: C9orf136; Gene ID: ENSG00000165716; Transcript IDs:
ENST00000371692, ENST00000371691; Protein IDs: ENSP00000360757,
ENSP00000360756) is a hypothetical protein without previous known
association with tumor. This protein has been recently associated
with Type 2 diabetes mellitus disease (8) and included in patent
application on diabetes (Patent publication number:
WO2008065544A2). In the present invention we disclose FAM69B as
associated with tumor and preferably used as a marker for ovarian
tumor and in general for these cancer types. As described below, an
antibody generated towards Fam69B protein shows a selective
immunoreactivity in histological preparation of ovary cancer
tissues, which indicates the presence of this protein in these
cancer samples.
[0073] EMI domain-containing protein 1 Precursor (EMID1; synonyms:
Emilin and multimerin domain-containing protein 1, Protein Emu1;
Gene ID: >OTTHUMG00000030824
TABLE-US-00001 Transcript IDs: Protein IDs: OTTHUMT00000075712
OTTHUMP00000028901, ENST00000429226 ENSP00000403816,
ENST00000430127 ENSP00000399760, ENST00000435427 ENSP00000402621,
ENST00000404820 ENSP00000384452, ENST00000334018 ENSP00000335481,
ENST00000429415 ENSP00000409801, ENST00000448676 ENSP00000413034,
ENST00000404755 ENSP00000385414, ENST00000435194 ENSP00000417004,
ENST00000426629 ENSP00000403484, ENST00000457925 ENSP00000405422,
ENST00000433143 ENSP00000408339, ENST00000455501
ENSP00000413947),
[0074] is a poorly characterized protein. EMID1 gene is mentioned
in a patent application on follicular thyroid carcinoma
(Publication number US2006035244 (A1). However, no data are
available on the presence of this protein in the tumor class under
investigation. Therefore, we disclose EMID1 as a protein without
previous known association with ovary tumors and preferably used as
a marker for ovary tumor and in general for these cancer types. As
described below, an antibody generated towards EMID1 protein shows
a selective immunoreactivity in histological preparation of ovary
cancer tissues, which indicates the presence of this protein in
these cancer samples. In particular this antibody stains also tumor
secretion products indicating that EMID1 protein is specifically
released by tumor cells.
[0075] Killer cell lectin-like receptor subfamily G member 2
(C-type lectin domain family 15 member B) (KLRG2, synonyms:
CLEC15B, FLJ44186; GENE ID: ENSG00000188883; Transcript IDs:
ENST00000340940, ENST00000393039; Protein IDs: ENSP00000339356,
ENSP00000376759) is a poorly characterized protein. A KLRG2
sequence is included in a patent application on the use of an agent
with tumor-inhibiting action of a panel of targets associated with
different tumors, whose expression is mainly shown at RNA level
(Publication number WO2005030250). However no data are provided
documenting the presence of KLRG2 protein in the tumors. Moreover,
no experimental evidence is given on the specificity of the
proposed anti-tumor agent for KLRG2. Based on these considerations,
in the present invention we disclose KLRG2 as a protein without
previous known association with tumor class under investigation and
preferably used as a marker for ovary tumor, and in general for
cancers of this type. As described below, an antibody generated
towards KLRG2 protein shows a selective immunoreactivity in
histological preparation of ovary cancer tissues, which indicates
the presence of this protein in this cancer type. In particular,
the immunoistochemistry staining accumulates in the plasma membrane
of tumor cells. Moreover, localization analysis of tumor cell lines
showed that the protein is exposed on the cell surface and
accessible to the binding of specific antibodies. Finally,
silencing of KLRG2 significantly reduced the invasiveness and
proliferation properties of tumor cells lines. Overall, KLRG2 is a
likely target for the development of anti-cancer therapies being
accessible to the action of affinity ligands and being involved in
cellular processes relevant for tumor development.
[0076] Endoplasmic reticulum metallopeptidase 1 (ERMP1, synonyms:
FLJ23309, FXNA, KIAA1815; GENE ID: ENSG00000099219; Transcript IDs:
ENST00000214893, ENST00000339450, ENST00000381506; Protein IDs:
ENSP00000214893, ENSP00000340427, ENSP00000370917) is a
transmembrane metallopeptidase, so far described as localized to
the endoplasmic reticulum. ERMP1 transcript has been found
differentially expressed in the rat ovary at the time of
folliculogenesis. A lower level of ERMP1 transcript in the rat
ovary resulted in substantial loss of primordial, primary and
secondary follicles, and structural disorganization of the ovary,
suggesting that is required for normal ovarian histogenesis (9).
ERMP1 has been also included in a patent application (Publication
Number: US 2003064439) on novel nucleic acid sequences encoding
melanoma associated antigen molecules. However in these
publications, no solid data documented the relation of ERMP1
protein with tumor. Based on available information, ERMP1 protein
has never been previously associated with tumor. In the present
invention, differently with published scientific data, we disclose
ERMP1 as a protein associated with tumor, preferably used as a
marker for ovarian tumor, and in general for cancers of this type.
As described below, an antibody generated towards ERMP1 protein
shows a selective immunoreactivity in histological preparation of
ovary cancer tissues, which indicates the presence of this protein
in this cancer type. In particular our immunoistochemistry analysis
of ovarian tissues indicates that the protein shows plasma membrane
localization. Moreover, localization analysis of ovary tumor cell
lines showed that the protein is exposed on the cell surface and
accessible to the binding of specific antibodies. Finally,
silencing of ERMP1 significantly reduced the invasiveness and
proliferation properties of tumor cells lines. Based on the above
evidences, ERMP1 is a likely target for the development of
anti-cancer therapies being exposed to the action of affinity
ligand and being involved in cellular processes relevant for tumor
development.
[0077] Vitelline membrane outer layer protein 1 homolog Precursor
(VMO1, synonyms: ERGA6350, PRO21055; GeneID: ENSG00000182853;
Transcript ID: ENST00000328739, ENST00000354194, ENST00000416307,
ENST00000441199; Protein IDs: ENSP00000328397, ENSP00000346133,
ENSP00000390450, ENSP00000408166) is a marginally characterized
protein. Evidences on the expression of VMO1 human protein are
essentially based on studies in which the protein was detected in
sputum from smoking human female and urine (10, 11).
[0078] In the present invention we disclose VMO1 as a protein
without previous known association with tumor classes under
investigation and preferably used as a marker for ovarian tumor,
and in general for cancers of this type. As described below, an
antibody generated towards VMO1 protein shows a selective
immunoreactivity in histological preparation of ovarian cancer
tissues, which indicates the presence of this protein in this
cancer type. Immunoreactivity accumulates at the plasma membrane of
tumor cells, providing a first indication that this protein is
accessible to the action of affinity ligands (such as antibodies)
with anti-cancer activities.
[0079] A further aspect of this invention is a method of screening
a tissue sample for malignancy, which comprises determining the
presence in said sample of at least one of the above-mentioned
tumor markers. This method includes detecting either the marker
protein, e.g. by means of labeled monoclonal or polyclonal
antibodies that specifically bind to the target protein, or the
respective mRNA, e.g. by means of polymerase chain reaction
techniques such as RT-PCR. The methods for detecting proteins in a
tissue sample are known to one skilled in the art and include
immunoradiometric, immunoenzymatic or immunohistochemical
techniques, such as radioimmunoassays, immunofluorescent assays or
enzyme-linked immunoassays. Other known protein analysis
techniques, such as polyacrylamide gel electrophoresis (PAGE),
Western blot or Dot blot are suitable as well. Preferably, the
detection of the protein marker is carried out with the
immune-histochemistry technology, particularly by means of High
Through-Put methods that allow the analyses of the antibody
immune-reactivity simultaneously on different tissue samples
immobilized on a microscope slide. Briefly, each Tissue Micro Array
(TMA) slide includes tissue samples suspected of malignancy taken
from different patients, and an equal number of normal tissue
samples from the same patients as controls. The direct comparison
of samples by qualitative or quantitative measurement, e.g. by
enzimatic or colorimetric reactions, allows the identification of
tumors.
[0080] In one embodiment, the invention provides a method of
screening a sample of ovary tissue for malignancy, which comprises
determining the presence in said sample of the SLC39A10, GPR107,
DPY19L3, FLJ42986, COL20A1, GLT25D2, SYTL3, DENND1B, FLJ3710,
C6orf98, FAM69B, EMID1, KLRG2, ERMP1, VMO1 protein tumor marker,
variants or isoforms thereof as described above.
[0081] A further aspect of the invention is a method in vitro for
determining the presence of an ovary tumor in a subject, which
comprises the steps of:
[0082] (1) providing a sample of the tissue suspected of containing
tumor cells;
[0083] (2) determining the presence of a tumor marker as above
defined, or a combination thereof in said tissue sample by
detecting the expression of the marker protein or the presence of
the respective mRNA transcript;
[0084] wherein the detection of one or more tumor markers in the
tissue sample is indicative of the presence of tumor in said
subject.
[0085] The methods and techniques for carrying out the assay are
known to one skilled in the art and are preferably based on
immunoreactions for detecting proteins and on PCR methods for the
detection of mRNAs. The same methods for detecting proteins or
mRNAs from a tissue sample as disclosed above can be applied.
[0086] A further aspect of this invention is the use of the tumor
markers herein provided as targets for the identification of
candidate antitumor agents. Accordingly, the invention provides a
method for screening a test compound which comprises contacting the
cells expressing a tumor-associated protein selected from: Solute
carrier family 39 (zinc transporter), member 10 (SLC39A10); G
protein-coupled receptor 107 (GPR107); DPY19-like 3 (DPY19L3);
Uncharacterized protein FLJ42986 (FLJ42986); Collagen, type XX,
alpha 1 (COL20A1); Glycosyltransferase 25 domain containing 2
(GLT25D2); Synaptotagmin-like 3 (SYTL3); DENN/MADD domain
containing 1B (DENND1B); Putative uncharacterized protein
(FLJ3710); Chromosome 6 open reading frame 98 (C6orf98); Family
with sequence similarity 69, member B (Fam69B); EMI
domain-containing protein 1 Precursor (EMID1); Killer cell
lectin-like receptor subfamily G member 2 (KLRG2); Endoplasmic
reticulum metallopeptidase 1 (ERMP1); Vitelline membrane outer
layer protein 1 homolog precursor (VMO1);
[0087] with the test compound, and determining the binding of said
compound to said tumor-associated protein. In addition, the ability
of the test compound to modulate the activity of each target
molecule can be assayed.
[0088] A further aspect of the invention is an antibody or a
fragment thereof, which is able to specifically recognize and bind
to one of the tumor-associated proteins described above. The term
"antibody" as used herein refers to all types of immunoglobulins,
including IgG, IgM, IgA, IgD and IgE. Such antibodies may include
polyclonal, monoclonal, chimeric, single chain, antibodies or
fragments such as Fab or scFv. The antibodies may be of various
origin, including human, mouse, rat, rabbit and horse, or chimeric
antibodies. The production of antibodies is well known in the art.
For the production of antibodies in experimental animals, various
hosts including goats, rabbits, rats, mice, and others, may be
immunized by injection with polypeptides of the present invention
or any fragment or oligopeptide or derivative thereof which has
immunogenic properties or forms a suitable epitope. Monoclonal
antibodies may be produced following the procedures described in
Kohler and Milstein, Nature 265:495 (1975) or other techniques
known in the art.
[0089] The antibodies to the tumor markers of the invention can be
used to detect the presence of the marker in histologic
preparations or to distinguish tumor cells from normal cells. To
that purpose, the antibodies may be labeled with radiocative,
fluorescent or enzyme labels.
[0090] In addition, the antibodies can be used for treating
proliferative diseases by modulating, e.g. inhibiting or abolishing
the activity of a target protein according to the invention.
Therefore, in a further aspect the invention provides the use of
antibodies to a tumor-associated protein selected from: Solute
carrier family 39 (zinc transporter), member 10 (SLC39A10); G
protein-coupled receptor 107 (GPR107); DPY19-like 3 (DPY19L3);
Uncharacterized protein FLJ42986 (FLJ42986); Collagen type XX,
alpha 1 (COL20A1); glycosyltransferase 25 domain containing 2
(GLT25D2); Synaptotagmin-like 3 (SYTL3); DENN/MADD domain
containing 1B (DENND1B); Putative uncharacterized protein
(FLJ3710); Chromosome 6 open reading frame 98 (C6orf98); Family
with sequence similarity 69, member B (Fam69B); EMI
domain-containing protein 1 Precursor (EMID1); Killer cell
lectin-like receptor subfamily G member 2 (KLRG2); Endoplasmic
reticulum metallopeptidase 1 (ERMP1); Vitelline membrane outer
layer protein 1 homolog Precursor (VMO1);
[0091] for the preparation of a therapeutic agent for the treatment
of proliferative diseases. For use in therapy, the antibodies can
be formulated with suitable carriers and excipients, optionally
with the addition of adjuvants to enhance their effetcs.
[0092] A further aspect of the invention relates to a diagnostic
kit containing suitable means for detection, in particular the
polypeptides or polynucleotides, antibodies or fragments or
derivatives thereof described above, reagents, buffers, solutions
and materials needed for setting up and carrying out the
immunoassays, nucleic acid hybridization or PCR assays described
above. Parts of the kit of the invention can be packaged
individually in vials or bottles or in combination in containers or
multicontainer units.
DESCRIPTION OF THE FIGURES
[0093] FIG. 1. Analysis of purified SLC39A10 recombinant
protein
[0094] Left panel: Comassie staining of purified His-tag SLC39A10
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant SLC39A10 protein stained with anti-SLC39A10 antibody.
Arrow marks the protein band of the expected size. The low
molecular weight bands correspond to partially degraded forms of
SLC39A10 protein. Molecular weight markers are reported on the
left.
[0095] FIG. 2. Staining of ovary tumor TMA with anti-SLC39A10
antibodies
[0096] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0097] FIG. 3. Confocal microscopy analysis of expression and
localization of SLC39A10 in transfected cells
[0098] HeLa cells transfected with the empty pcDNA3 vector (upper
panels) or with the plasmid construct encoding the SLC39A10 gene
(lower panels) stained with secondary antibodies (left panels) and
with anti-SLC39A10 antibodies (right panels). Arrowheads mark
surface specific localization.
[0099] FIG. 4. Expression and localization of SLC39A10 in ovary
tumor cell lines
[0100] Flow cytometry analysis of SLC39A10 cell-surface
localization in OVCAR-5 (left panel) and OVCAR-8 (right panel)
tumor cells stained with a negative control antibody (filled curve
or with anti-SLC39A10 antibody (empty curve). X axis, Fluorescence
scale; Y axis, Cells (expressed as % relatively to major
peaks).
[0101] FIG. 5. Analysis of purified GPR107 recombinant protein
[0102] Left panel: Comassie staining of purified His-tag GPR107
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant GPR107 protein stained with anti-GPR107 antibody. Arrow
marks the protein band of the expected size. The high molecular
weight band is consistent with a protein dimer of GPR107. Molecular
weight markers are reported on the left.
[0103] FIG. 6. Staining of ovary tumor TMA with anti-GPR107
antibodies
[0104] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0105] FIG. 7. Expression and localization of GPR107 in ovary tumor
cells
[0106] Flow cytometry analysis of GPR107 cell surface localization
in OVCAR-8 cells stained with a control antibody (filled curve or
with anti-GPR107 antibody (empty curve). X axis, Fluorescence
scale; Y axis, Cells (expressed as percentage, relatively to major
peaks).
[0107] FIG. 8. Analysis of purified DPY19L3 recombinant protein
[0108] Left panel: Comassie staining of purified His-tag DPY19L3
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant DPY19L3 protein stained with anti-DPY19L3 antibody.
Arrow marks the protein band of the expected size. Molecular weight
markers are reported on the left.
[0109] FIG. 9. Staining of ovary tumor TMA with anti-DPY19L3
antibodies
[0110] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0111] FIG. 10. Expression and localization of DPY19L3 in ovary
tumor cell lines
[0112] Left panel: Western blot analysis of DPY19L3 expression in
total protein extracts separated by SDS-PAGE from the ovary-derived
tumor cells OVCAR-3 (lane 1), OVCAR-4 (lane 2), OVCAR-5 (lane 3),
OVCAR-8 (lane 4). Arrow marks the protein band of the expected
size. Molecular weight markers are reported on the left.
[0113] Right panel: Flow cytometry analysis of DPY19L3 cell surface
localization in OVCAR-5 cells stained with a negative control
antibody (filled curve or with anti-DPY19L3 antibody (empty curve).
X axis, Fluorescence scale; Y axis, Cells (expressed as percentage,
relatively to major peaks).
[0114] FIG. 11. Analysis of purified FLJ42986 recombinant
protein
[0115] Left panel: Comassie staining of purified His-tag FLJ42986
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant FLJ42986 protein stained with anti-FLJ42986 antibody.
Arrow marks the protein band of the expected size. Molecular weight
markers are reported on the left.
[0116] FIG. 12. Staining of ovary tumor TMA with anti-FLJ42986
antibodies
[0117] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0118] FIG. 13. Analysis of purified COL20A1 recombinant
protein
[0119] Left panel: Comassie staining of purified His-tag COL20A1
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant COL20A1 protein stained with anti-COL20A1 antibody.
Arrow marks the protein band of the expected size. The high
molecular weight band is consistent with the formation of protein
dimer. Molecular weight markers are reported on the left.
[0120] FIG. 14. Staining of ovary tumor TMA with anti-COL20A1
antibodies
[0121] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0122] FIG. 15. Analysis of purified GLT25D2 recombinant
protein
[0123] Left panel: Comassie staining of purified His-tag GLT25D2
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant GLT25D2 protein stained with anti-GLT25D2 antibody.
Arrow marks the protein band of the expected size. Molecular weight
markers are reported on the left.
[0124] FIG. 16. Staining of ovary tumor TMA with anti-GLT25D2
antibodies
[0125] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0126] FIG. 17. Analysis of purified SYTL3 recombinant protein Left
panel: Comassie staining of purified His-tag SYTL3 fusion protein
espressed; Right panel: WB on the recombinant SYTL3 protein stained
with anti-SYTL3 antibody. Arrow marks the protein band of the
expected size. Molecular weight markers are reported on the
left.
[0127] FIG. 18. Staining of ovary tumor TMA with anti-SYTL3
antibodies
[0128] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0129] FIG. 19. Analysis of purified DENND1B recombinant
protein
[0130] Left panel: Comassie staining of purified His-tag DENND1B
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant DENND1B protein stained with anti-DENND1B antibody.
Arrow marks the protein band of the expected size. The low
molecular weight band corresponds to partially degraded forms of
the protein. Molecular weight markers are reported on the left.
[0131] FIG. 20. Staining of ovary tumor TMA with anti DENND1B
antibodies
[0132] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0133] FIG. 21. Analysis of purified FLJ37107 recombinant
protein
[0134] Left panel: Comassie staining of purified His-tag FLJ37107
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant protein stained with anti-FLJ37107 antibody. Arrow
marks the protein band of the expected size. Molecular weight
markers are reported on the left.
[0135] FIG. 22. Staining of ovary tumor TMA with anti-FLJ37107
antibodies
[0136] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0137] FIG. 23. Analysis of purified C6orf98 recombinant
protein
[0138] Left panel: Comassie staining of purified His-tag C6orf98
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant C6orf98 protein stained with anti-C6orf98 antibody.
Arrow marks the protein band of the expected size. Molecular weight
markers are reported on the left.
[0139] FIG. 24. Staining of ovary tumor TMA with anti-C6orf98
antibodies
[0140] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0141] FIG. 25. Analysis of purified Fam69B recombinant protein
[0142] Left panel: Comassie staining of purified His-tag Fam69B
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant Fam69B protein stained with anti-Fam69B antibody. Arrow
marks the protein band of the expected size. Molecular weight
markers are reported on the left.
[0143] FIG. 26. Staining of ovary tumor TMA with anti-Fam69B
antibodies
[0144] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0145] FIG. 27. Analysis of purified EMID1 recombinant protein
[0146] Left panel: Comassie staining of purified His-tag EMID1
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant EMID1 protein stained with anti-EMID1 antibody. Arrow
marks the protein band of the expected size. Molecular weight
markers are reported on the left.
[0147] FIG. 28. Staining of ovary tumor TMA with anti-EMID1
antibodies
[0148] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0149] FIG. 29. Analysis of purified KLRG2 recombinant protein
[0150] Left panel: Comassie staining of purified His-tag KLRG2
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant protein stained with anti-KLRG2 antibody. Arrow marks
the protein band of the expected size. Molecular weight markers are
reported on the left.
[0151] FIG. 30. Staining of ovary tumor TMA with anti-KLRG2
antibodies
[0152] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray) and the stain clearly
delineates the plasma membrane; negative or poor staining is
visible in normal cells.
[0153] FIG. 31. Expression and localization of KLRG2 in tumor cell
lines
[0154] Panel A. KLRG2 expression in transfected HeLa cells. Western
blot analysis of KLRG2 expression in total protein extracts
separated by SDS-PAGE from HeLa cells (corresponding to
2.times.10.sup.5 cells) transfected with the empty pcDNA3 vector
(lane 1), with the plasmid construct encoding the isoform 2 of the
KLRG2 gene (lane 2); or with the plasmid construct encoding the
isoform1 of the KLRG2 gene (lane 3);
[0155] Panel B. KLRG2 expression in ovary-derived tumor cell lines.
Western blot analysis of KLRG2 expression in total protein extracts
separated by SDS-PAGE from OVCAR-5 (lane 1) and from OVCAR-8 tumor
cells (corresponding to 2.times.10.sup.5 cells) (lane 2);
[0156] Panel C. Surface exposure of KLRG2 in tumor cell lines. Flow
cytometry analysis of KLRG2 cell surface localization in OVCAR-8
cells stained with a negative control antibody (filled curve or
with anti-KLRG2 antibody (empty curve). X axis, Fluorescence scale;
Y axis, Cells (expressed as percentage relatively to major
peaks).
[0157] FIG. 32. KLRG2 confer malignant cell phenotypes
[0158] The proliferation and the migration/invasive phenotypes of
MCF7 cell line were assessed after transfection with KLRG2-siRNA
and a scrambled siRNA control using the MTT and the Boyden in vitro
invasion assay, respectively.
[0159] Panel A. Cell migration/invasiveness measured by the Boyden
migration assay. The graph represents the reduced
migration/invasiveness of MCF7 treated with the KLRG2 specific
siRNA. Small boxes below the columns show the visual counting of
the migrated cells.
[0160] Panel B. Cell proliferation determined by the MTT
incorporation assay. The graph represents the reduced proliferation
of the MCF7 tumor cells upon treatment with KLRG2-siRNA, as
determined by spectrophotometric reading.
[0161] FIG. 33. Analysis of purified ERMP1 recombinant protein
[0162] Left panel: Comassie staining of purified His-tag ERMP1
fusion protein separated by SDS-PAGE; Right panel: WB on the
recombinant protein stained with anti-ERMP1 antibody. Arrow marks
the protein band of the expected size. Molecular weight markers are
reported on the left.
[0163] FIG. 34. Staining of ovary tumor TMA with anti-ERMP1
antibodies
[0164] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray) and the stain clearly
delineates the plasma membrane; negative or poor staining is
visible in normal cells.
[0165] FIG. 35. Expression and localization of ERMP1 in tumor cell
lines
[0166] Panel A. ERMP1 expression in transfected HEK-293T cells.
Western blot analysis of ERMP1 expression in total protein extracts
separated by SDS-PAGE from HEK-293T cells (corresponding to
2.times.10.sup.5 cells) transfected with the empty pcDNA3 vector
(lane 1) or with the plasmid construct encoding the ERMP1 gene
(lane 2).
[0167] Panel B. Western blot analysis of ERMP1 expression in total
protein extracts separated by SDS-PAGE from OVCAR-4 tumor cells
(corresponding to 2.times.10.sup.5 cells) (lane 1). Arrow marks the
ERMP1 band. Molecular weight markers are reported on the left.
[0168] Panel C. Flow cytometry analysis of ERMP1 cell surface
localization in OVCAR-8 tumor cells stained with a negative control
antibody (filled curve or with anti-ERMP1 antibody (empty curve). X
axis, Fluorescence scale; Y axis, Cells (expressed as % relatively
to major peaks).
[0169] FIG. 36. ERMP1 confers malignant cell phenotypes
[0170] The proliferation and the invasive properties of the MCF7
cell line were assessed after transfection with ERMP1-siRNA and a
scramble siRNA control using the MTT and the Boyden in vitro
invasion assay, respectively.
[0171] Panel A. Cell migration/invasiveness measured by the Boyden
migration assay. The graph represents the reduced
migration/invasiveness of MCF7 treated with the ERMP1-specific
siRNA. Small boxes below the columns show the visual counting of
the migrated cells.
[0172] Panel B. Cell proliferation determined by the MTT
incorporation assay. The graph represents the reduced proliferation
of the MCF7 tumor cells upon treatment with ERMP1-siRNA, as
determined by spectrophotometric reading.
[0173] FIG. 37. Analysis of purified VMO1 recombinant protein
[0174] Left panel: Comassie staining of purified His-tag VMO1
fusion protein separated by SDS-PAGE; Right panel: WB on the
purified VMO1 protein stained with a specific antibody. Arrow marks
the protein band of the expected size. Molecular weight markers are
reported on the left
[0175] FIG. 38. Staining of ovary tumor TMA with anti-VMO1
antibodies
[0176] In the case of the ovarian sample, both tumor and normal
cells are represented on the same image. The antibody stains
specifically tumor cells (in dark gray); negative or poor staining
is visible in normal cells.
[0177] The following examples further illustrate the invention.
EXAMPLES
Example 1. Generation of Recombinant Human Protein Antigens and
Antibodies to Identify Tumor Markers
[0178] Methods
[0179] The entire coding region or suitable fragments of the genes
encoding the target proteins were designed for cloning and
expression using bioinformatic tools with the human genome sequence
as template (Lindskog M et al (2005). Where present, the leader
sequence for secretion was replaced with the ATG codon to drive the
expression of the recombinant proteins in the cytoplasm of E. coli.
For cloning, genes were PCR-amplified from templates derived from
Mammalian Gene Collection (http://mgc.nci.nih.gov/) clones or from
cDNAs mixtures generated from pools of total RNA derived from Human
testis, Human placenta, Human bone marrow, Human fetal brain, using
specific primers. Clonings were designed so as to fuse a 10
histidine tag sequence at the 5' end, annealed to in house
developed vectors, derivatives of vector pSP73 (Promega) adapted
for the T4 ligation independent cloning method (Nucleic Acids Res.
1990 Oct. 25; 18(20): 6069-6074) and used to transform E. coli
NovaBlue cells recipient strain. E. coli tranformants were plated
onto selective LB plates containing 100 .mu.g/ml ampicillin (LB
Amp) and positive E. coli clones were identified by restriction
enzyme analysis of purified plasmid followed by DNA sequence
analysis. For expression, plasmids were used to transform
BL21-(DE3) E. coli cells and BL21-(DE3) E. coli cells harbouring
the plasmid were inoculated in ZYP-5052 growth medium (Studier,
2005) and grown at 37.degree. C. for 24 hours. Afterwards, bacteria
were collected by centrifugation, lysed into B-Per Reagent
containing 1 mM MgCl2, 100 units DNAse I (Sigma), and 1 mg/ml
lysozime (Sigma). After 30 min at room temperature under gentle
shaking, the lysate was clarified by centrifugation at 30.000 g for
40 min at 4.degree. C. All proteins were purified from the
inclusion bodies by resuspending the pellet coming from lysate
centrifugation in 40 mM TRIS-HCl, 1 mM TCEP
{Tris(2-carboxyethyl)-phosphine hydrochloride, Pierce} and 6M
guanidine hydrochloride, pH 8 and performing an IMAC in denaturing
conditions. Briefly, the resuspended material was clarified by
centrifugation at 30.000 g for 30 min and the supernatant was
loaded on 0.5 ml columns of Ni-activated Chelating Sepharose Fast
Flow (Pharmacia). The column was washed with 50 mM TRIS-HCl buffer,
1 mM TCEP, 6M urea, 60 mM imidazole, 0.5M NaCl, pH 8. Recombinant
proteins were eluted with the same buffer containing 500 mM
imidazole. Proteins were analysed by SDS-Page and their
concentration was determined by Bradford assay using the BIORAD
reagent (BIORAD) with a bovine serum albumin standard according to
the manufacturer's recommendations.
[0180] To generate antisera, the purified proteins were used to
immunize CD1 mice (6 week-old females, Charles River laboratories,
5 mice per group) intraperitoneally, with 3 protein doses of 20
micrograms each, at 2 week-interval. Freund's complete adjuvant was
used for the first immunization, while Freund's incomplete adjuvant
was used for the two booster doses. Two weeks after the last
immunization animals were bled and sera collected from each animal
was pooled.
[0181] Results
[0182] Gene fragments of the expected size were obtained by PCR
from specific clones of the Mammalian Gene Collection or,
alternatively, from cDNA generated from pools of total RNA derived
from Human testis, Human placenta, Human bone marrow, Human fetal
brain, using primers specific for each gene.
[0183] In particular, for the SLC39A10 gene, a DNA fragment
corresponding to nucleotides 154-1287 of the transcript
ENST00000359634 and encoding a protein of 378 residues,
corresponding to the amino acid region from 26 to 403 of
ENSP00000352656 sequence was obtained.
[0184] For the GPR107 gene, a fragment corresponding to nucleotides
291 to 968 of the transcript ENST00000347136 and encoding a protein
of 226 residues, corresponding to the amino acid region from 39 to
264 of ENSP00000336988 sequence was obtained.
[0185] For the DPY19L3 gene, a fragment corresponding to
nucleotides 158 to 463 of the transcript ENST00000392250 and
encoding a protein of 102 residues, corresponding to the amino acid
region from 1 to 102 of ENSP00000376081 sequence was obtained.
[0186] For the FLJ42986 gene, a fragment corresponding to
nucleotides 1287 to 1717 of the transcript ENST00000376826 and
encoding protein of 144 residues, corresponding to the amino acid
region from 30 to 173 of ENSP00000366022 sequence was obtained.
[0187] For the COL20A1 gene, a fragment corresponding to
nucleotides 577 to 1095 of the transcript ENST00000354338 and
encoding a protein of 173 residues, corresponding to the amino acid
region from 193 to 365 of ENSP00000346302 sequence was
obtained.
[0188] For the GLT25D2 gene, a fragment corresponding to
nucleotides 454 to 831 of the transcript ENST00000361927 and
encoding a protein of 126 residues, corresponding to the amino acid
region from 25 to 153 of ENSP00000354960 sequence was obtained.
[0189] For the SYTL3 gene, a fragment corresponding to nucleotides
267 to 569 of the transcript ENST00000360448 and encoding a protein
of 101 residues, corresponding to the amino acid region from 50 to
150 of ENSP00000353631 sequence was obtained.
[0190] For the DENND1B gene, a fragment corresponding to
nucleotides 563 to 1468 of the transcript ENST00000235453 and
encoding a protein of 302 residues, corresponding to the amino acid
region from 95 to 396 of ENSP00000235453 sequence was obtained.
[0191] For the FLJ37107 gene, a fragment corresponding to
nucleotides 661-972 of the transcript
gi|58218993|ref|NM_001010882.1 and encoding a protein of 104
residues, corresponding to the amino acid region from 1 to 104 of
gi|58218994|ref|NP_001010882.1 sequence was obtained.
[0192] For the C6orf98 gene, a fragment corresponding to
nucleotides 67 to 396 of the transcript ENST00000409023 and
encoding a protein of 110 residues, corresponding to the amino acid
region from 22 to 132 of ENSP00000386324 sequence was obtained.
[0193] For the Fam69B gene, a fragment corresponding to nucleotides
233 to 688 of the transcript ENST00000371692 and encoding a protein
of 152 residues, corresponding to the amino acid region from 49 to
200 of ENSP00000360757 sequence was obtained.
[0194] For the EMID1 gene, a fragment corresponding to nucleotides
203 to 670 of the transcript OTTHUMT00000075712 and encoding a
protein of 156 residues, corresponding to the amino acid region
from 33 to 188 of OTTHUMP00000028901 sequence was obtained.
[0195] For the KLRG2 gene, a fragment corresponding to nucleotides
70 to 849 of the transcript ENST00000340940 and encoding a protein
of 260 residues, corresponding to the amino acid region from 1 to
260 of ENSP00000339356 sequence was obtained.
[0196] For the ERMP1 gene, a fragment corresponding to nucleotides
55 to 666 of the transcript ENST00000339450 and encoding a protein
of 204 residues, corresponding to the amino acid region from 1 to
204 of ENSP00000340427 sequence was obtained.
[0197] For the VMO1 gene, a fragment corresponding to nucleotides
157 to 690 of the transcript ENST00000328739 and encoding a protein
of 178 residues corresponding to the amino acid region from 25-202
of ENSP00000328397 sequence was obtained.
[0198] A clone encoding the correct amino acid sequence was
identified for each gene/gene fragment and, upon expression in E.
coli, a protein of the correct size was produced and subsequently
purified using affinity chromatography (FIGS. 1; 5; 8; 11; 13; 15;
17; 19; 21; 23; 25, 27, 29, 33, 37, left panels). As shown in the
figures, in some case SDS-PAGE analysis of affinity-purified
recombinant proteins revealed the presence of extra bands, of
either higher and/or lower masses. Mass spectrometry analysis
confirmed that they corresponded to either aggregates or
degradation products of the protein under analysis.
[0199] Antibodies generated by immunization specifically recognized
their target proteins in Western blot (WB) (FIGS. 1; 5; 8; 11; 13;
15; 17; 19; 21; 23; 25, 27, 29, 33, 37, right panels).
Example 2. Tissue Profiling by Immune-Histochemistry
[0200] Methods
[0201] The analysis of the antibodies' capability to recognize
their target proteins in tumor samples was carried out by Tissue
Micro Array (TMA), a miniaturized immuno-histochemistry technology
suitable for HTP analysis that allows to analyse the antibody
immuno-reactivity simultaneously on different tissue samples
immobilized on a microscope slide.
[0202] Since the TMAs include both tumor and healthy tissues, the
specificity of the antibodies for the tumors can be immediately
appreciated. The use of this technology, differently from
approaches based on transcription profile, has the important
advantage of giving a first hand evaluation on the potential of the
markers in clinics. Conversely, since mRNA levels not always
correlate with protein levels (approx. 50% correlation), studies
based on transcription profile do not provide solid information
regarding the expression of protein markers.
[0203] A tissue microarray was prepared containing formalin-fixed
paraffin-embedded cores of human tissues from patients affected by
ovary cancer and analyzed using the specific antibody sample. In
total, the TMA design consisted in 20 ovary tissue samples from 5
well pedigreed patients, each including both tumor and normal
epithelium (equal to four tumor samples from each patient) to
identify promising target molecules differentially expressed in
cancer and normal cells. The direct comparison between tumor and
normal epithelium of each patient allowed the identification of
antibodies that stain specifically tumor cells and provided
indication of target expression in ovary tumor. To confirm the
association of each protein with ovary tumors a tissue microarray
was prepared containing 100 formalin-fixed paraffin-embedded cores
of human ovary tissues from 50 patients (equal to two tissue
samples from each patient).
[0204] All formalin fixed, paraffin embedded tissues used as donor
blocks for TMA production were selected from the archives at the
IEO (Istituto Europeo Oncologico, Milan). Corresponding whole
tissue sections were examined to confirm diagnosis and tumour
classification, and to select representative areas in donor blocks.
Normal tissues were defined as microscopically normal
(non-neoplastic) and were generally selected from specimens
collected from the vicinity of surgically removed tumors. The TMA
production was performed essentially as previously described
(Kononen J et al (1998) Nature Med. 4:844-847; Kallioniemi O P et
a/(2001) Hum. MoI. Genet. 10:657-662). Briefly, a hole was made in
the recipient TMA block. A cylindrical core tissue sample (1 mm in
diameter) from the donor block was acquired and deposited in the
recipient TMA block. This was repeated in an automated tissue
arrayer "Galileo TMA CK 3500" BioRep (Milan) until a complete TMA
design was produced. TMA recipient blocks were baked at
42<0>C for 2 h prior to sectioning. The TMA blocks were
sectioned with 2-3 mm thicknes using a waterfall microtome (Leica),
and placed onto poli-L-lysinated glass slides for
immunohistochemical analysis. Automated immunohistochemistry was
performed as previously described (Kampf C. et al (2004) Clin.
Proteomics 1:285-300). In brief, the glass slides were incubated
for 30' min in 60.degree. C., de-paraffinized in xylene (2.times.15
min) using the Bio-Clear solution (Midway. Scientific, Melbourne,
Australia), and re-hydrated in graded alcohols. For antigen
retrieval, slides were immersed 0.01 M Na-citrate buffer, pH 6.0 at
99.degree. C. for 30 min Slides were placed in the Autostainer.RTM.
(DakoCytomation) and endogenous peroxidase was initially blocked
with 3% H2O2, for 5 min. Slides were then blocked in Dako
Cytomation Wash Buffer containing 5% Bovine serum albumin (BSA) and
subsequently incubated with mouse antibodies for 30' (dilution
1:200 in Dako Real.TM. dilution buffer). After washing with
DakoCytomation wash buffer, slides were incubated with the goat
anti-mouse peroxidase conjugated Envision.RTM. for 30 min each at
room temperature (DakoCytomation). Finally, diaminobenzidine
(DakoCytomation) was used as chromogen and Harris hematoxylin
(Sigma-Aldrich) was used for counterstaining. The slides were
mounted with Pertex.RTM. (Histolab).
[0205] The staining results have been evaluated by a trained
pathologist at the light microscope, and scored according to both
the percentage of immunostained cells and the intensity of
staining. The individual values and the combined score (from 0 to
300) were recorded in a custom-tailored database. Digital images of
the immunocytochemical findings have been taken at a Leica DM LB
light microscope, equipped with a Leica DFC289 color camera.
[0206] Results
[0207] The results from tissue profiling showed that the antibodies
specific for the recombinant proteins (see Example 1) are strongly
immunoreactive on ovary cancer tissues from patients with varying
frequencies, indicating the presence of the target proteins in
tumors tissues, while no or poor reactivity was detected in normal
tissues.
[0208] Based on this finding, the detection of target proteins in
tissue samples can be associated with the specific tumor/s. In some
cases the antibody staining accumulated at the plasma membrane of
tumor cells, providing a first indication of the cell localization
of target proteins. The capability of target-specific antibodies to
stain different tumor tissues is summarized in Table I.
Representative examples of microscopic enlargements of tissue
samples stained by each antibody are reported in FIGS. 2; 6; 9; 12;
14; 16; 18, 20; 22; 24; 26, 28, 30, 34, 38.
[0209] Table I reports the percentage of positive ovarian tumor
samples after staining with the target-specific antibodies
TABLE-US-00002 TABLE I Percentage of ovary tumor samples Gene
showing positive IHC staining SLC39A10 80 GPR107 40 DPY19L3 93
FLJ42986 80 COL20A1 20 GLT25D2 20 SYTL3 60 DENND1B 20 FLJ3710 80
C6orf98 100 Fam69B 73 EMID1 20* KLRG2 29** ERMP1 69** VMO1 20**
*The antibody specifically stains secretion products of ovary tumor
cells indicating that the corresponding protein is released by
tumor cells. **The antibody stains the plasma membrane of tumor
cells
Example 3. Expression of Target Proteins in Transfected Mammalian
Cells
[0210] Methods
[0211] The specificity of the antibodies for each target proteins
was assessed by western blot and/or confocal microscopy analysis of
eukaryotic cells transiently transfected with a plasmid construct
containing the complete sequence of the gene encoding the target
proteins. Examples of this type of confocal microscopy experiments
are represented for SLC39A10 (corresponding to Transcript ID
ENST00000359634), KLRG2 (two cloned sequences corresponding to
Transcripts ENST00000340940 and ENST00000393039, corresponding to
two transcript variants), ERMP1 (cloned sequence corresponding to
Transcripts ENST00000339450).
[0212] For cloning, cDNA were generated from pools of total RNA
derived from Human testis, Human placenta, Human bone marrow, Human
fetal brain, in reverse transcription reactions and the entire
coding regions were PCR-amplified with specific primers pairs. PCR
products were cloned into plasmid pcDNA3 (Invitrogen). HeLa and
Hek-293T cells were grown in DMEM-10% FCS supplemented with 1 mM
Glutamine were transiently transfected with preparation of the
resulting plasmid and with the empty vector as negative control
using the Lipofectamine-2000 transfection reagent (Invitrogen).
After 48 hours, cells were collected and analysed by Western blot
or confocal microscopy. For Western blot, cells were lysed with PBS
buffer containing 1% Triton X100 and total cell extracts
(corresponding to 1.times.10.sup.6 cells) were separated on
pre-cast SDS-PAGE gradient gels (NuPage 4-12% Bis-Tris gel,
Invitrogen) under reducing conditions, followed by electro-transfer
to nitrocellulose membranes (Invitrogen) according to the
manufacturer's recommendations. The membranes were blocked in
blocking buffer composed of 1.times.PBS-0.1% Tween 20 (PBST) added
with 10% dry milk, for 1 h at room temperature, incubated with the
antibody diluted 1:2500 in blocking buffer containing 1% dry milk
and washed in PBST-1%. The secondary HRP-conjugated antibody (goat
anti-mouse immunoglobulin/HRP, Perkin Elmer) was diluted 1:5000 in
blocking buffer and chemiluminescence detection was carried out
using a Chemidoc-IT UVP CCD camera (UVP) and the Western
Lightning.TM. cheminulescence Reagent Plus (Perkin Elmer),
according to the manufacturer's protocol.
[0213] For confocal microscopy analysis, the cells were plated on
glass cover slips and after 48 h were washed with PBS and fixed
with 3% p-formaldheyde solution in PBS for 20 min at RT. For
surface staining, cells were incubated overnight at 4.degree. C.
with polyclonal antibodies (1:200). The cells were then stained
with Alexafluor 488-labeled goat anti-mouse antibodies (Molecular
Probes). DAPI (Molecular Probes) was used to visualize nuclei;
Live/Dead.RTM. red fixable (Molecular Probes) was used to visualize
membrane. The cells were mounted with glycerol plastine and
observed under a laser-scanning confocal microscope (LeicaSP5).
[0214] Results
[0215] The selected coding sequences for SLC39A10, KLRG2 and ERMP1
were cloned in a eukaryotic expression vector and the derived
plasmids were used for transient transfection of HeLa or HEK293T
cell lines. Expression of target proteins KLRG2 and ERMP1 was
detected in total protein extracts from HeLa and HEK-293T cells,
respectively, by Western blot using the protein-specific
antibodies. Overall the data confirmed that the marker-specific
antibodies recognized specifically their target proteins while no
or marginal staining was visible in cell transfected with the empty
pcDNA3. In the case of KLRG2, specific protein bands of expected
size were detected in cells transfected with either of the two
plasmids encoding the two annotated KLRG2 variants (FIG. 31A). As
for cells transfected with ERMP1-encoding plasmid, a band of
molecular mass higher than expected (theoretical mass of
approximately 100 KDa) was specifically detected by the anti-ERMP1
antibody in transfected cells indicating that the protein forms
stable aggregates (FIG. 35A). Expression of protein SLC3910 was
carried by confocal microscopy of transfected HeLa cells. The
anti-SLC39A10 antibody specifically detected its target protein
expressed by transfected cells. In particular, the antibody mainly
stained the surface of transfected cells (FIG. 3).
Example 4. Expression and Localization of Target Proteins in Tumor
Cell Lines
[0216] Expression of marker genes was assessed by WB and/or flow
cytometry analysis of tumor cell lines. Cells were cultured in
under ATCC recommended conditions, and sub-confluent cell
monolayers were detached with PBS-0.5 mM EDTA for subsequent
analysis. For Western blot analysis, cells were lysed by several
freeze-thaw passages in PBS-1% Triton. Total protein extracts were
loaded on SDS-PAGE (2.times.10.sup.5 cells/lane), and subjected to
WB with specific antibodies as described above. For flow cytometry
analysis, cells (2.times.10.sup.4 per well) were pelletted in 96
U-bottom microplates by centrifugation at 200.times.g for 5 min at
4.degree. C. and incubated for 1 hour at 4.degree. C. with the
appropriate dilutions of the marker-specific antibodies. Cells were
washed twice in PBS-5% FCS and incubated for 20 min with the
appropriate dilution of R-Phycoerythrin (PE)-conjugated secondary
antibodies (Jackson Immuno Research, PA, USA) at 4.degree. C. After
washing, cells were analysed by a FACS Canto II flow cytometer
(Becton Dickinson). Data were analyzed with FlowJo 8.3.3
program.
[0217] Results
[0218] Western blot analysis of tumor cells lines is represented
for KLRG2, ERMP1 and DPY19L3. Concerning KLRG2, two major protein
bands were detected in the ovary tumor cell lines OVCAR-5 and
OVCAR-8, that could be ascribed to the annotated protein variants
(FIG. 31B). As regards ERMP1 a band of high molecular mass was
detected in the OVCAR-4 cell line (FIG. 35B) showing an
electrophoretic pattern similar to that reported in transfected
cells (see previous example). This further confirms the existence
of stable aggregates for this protein As for DPY19L3, a band of
expected size was detected by the antibody on a panel of tumor
cells lines (FIG. 10, left panel). Expression and localization
analysis was also performed by surface staining and flow cytometry
analysis of tumor cell lines. Results, represented for SLC39A10,
KLRG2, DPY10L3 and ERMP1, show that these proteins were detected on
the surface of ovary tumor cell lines by the marker-specific
antibodies (FIGS. 4, 31C, 35C and FIG. 10 right panel).
Example 5. Expression of the Marker Proteins Confer Malignant Cell
Phenotype
[0219] To verify that the proteins included in the present
invention can be exploited as targets for therapeutic applications,
the effect of marker depletion was evaluated in vitro in cellular
studies generally used to define the role of newly discovered
proteins in tumor development. Marker-specific knock-down and
control tumor cell lines were assayed for their proliferation and
the migration/invasive phenotypes using the MTT assays and the
Boyden in vitro invasion assay, respectively.
[0220] Method
[0221] Expression of marker genes were silenced in tumor cell lines
by the siRNA technology and the influence of the reduction of
marker expression on cell parameters relevant for tumor development
was assessed in in vitro assays. The expression of marker genes was
knocked down in a panel of epithelial tumor cell lines previously
shown to express the tumor markers using a panel of marker-specific
siRNAs (whose target sequence is reported in Table II) using the
HiPerfect transfection reagent (QIAGEN) following the
manufacturer's protocol. As control, cells treated with irrelevant
siRNA (scrambled siRNA) were analysed in parallel. At different
time points (ranging from 24 to 72 hours) post transfection, the
reduction of gene transcription was assessed by quantitative RT-PCR
(Q-RT-PCR) on total RNA, by evaluating the relative marker
transcript level, using the beta-actin, GAPDH or MAPK genes as
internal normalization control. Afterwards, cell proliferation and
migration/invasiveness assays were carried out to assess the effect
of the reduced marker expression. Cell proliferation was determined
using the MTT assay, a colorimetric assay based on the cellular
conversion of a tetrazolium salt into a purple colored formazan
product. Absorbance of the colored solution can be quantified using
a spectrophotometer to provide an estimate of the number of
attached living cells. Approximately 5.times.10.sup.3 cells/100
.mu.l were seeded in 96-well plates in DMEM with 10% FCS to allow
cell attachment. After overnight incubation with DMEM without FCS,
the cells were treated with 2.5% FBS for 72 hours. Four hours
before harvest 15 .mu.L of the MTT dye solution (Promega) were
added to each well. After 4-hour incubation at 37.degree. C., the
formazan precipitates were solubilized by the addition of 100 .mu.L
of solubilization solution (Promega) for 1 h at 37.degree. C.
Absorbance at 570 nm was determined on a multiwell plate reader
(SpectraMax, Molecular Devices).
[0222] Cell migration/invasiveness was tested using the Boyden in
vitro invasion assay, as compared to control cell lines treated
with a scramble siRNA. This assay is based on a chamber of two
medium-filled compartments separated by a microporous membrane.
Cells are placed in the upper compartment and are allowed to
migrate through the pores of the membrane into the lower
compartment, in which chemotactic agents are present. After an
appropriate incubation time, the membrane between the two
compartments is fixed and stained, and the number of cells that
have migrated to the lower side of the membrane is determined. For
this assay, a transwell system, equipped with 8-.mu.m pore
polyvinylpirrolidone-free polycarbonate filters, was used. The
upper sides of the porous polycarbonate filters were coated with 50
.mu.g/cm.sup.2 of reconstituted Matrigel basement membrane and
placed into six-well culture dishes containing complete growth
medium. Cells (1.times.10.sup.4 cells/well) were loaded into the
upper compartment in serum-free growth medium. After 16 h of
incubation at 37.degree. C., non-invading cells were removed
mechanically using cotton swabs, and the microporous membrane was
stained with Diff-Quick solution. Chemotaxis was evaluated by
counting the cells migrated to the lower surface of the
polycarbonate filters (six randomly chosen fields, mean.+-.SD).
[0223] Results
[0224] Examples of this analysis are reported for ERMP1 and KLRG2
in MCF7 tumor cell line. Gene silencing experiments with
marker-specific siRNA reduced the marker transcripts (approximately
30-40 fold reduction), as determined by Q-RT_PCR. Table II below
reports the sequences targeted by the siRNA molecules. The
reduction of the expression of either of the two genes expression
significantly impairs the proliferation and the invasive phenotype
of the MCF7 tumor cell line (FIGS. 32 and 36). This indicates that
the proteins are involved in tumor development and are therefore
likely targets for the development of anti-cancer therapies.
TABLE-US-00003 TABLE II Sequences of the marker-specific siRNA NCBI
gene siRNA Target Sequence KLRG2 CGAGGACAATCTGGATATCAA
CTGGAGCCCTCGAGCAAGAAA ERMP1 CCCGTGGTTCATCTGATATAA
AAGGACTTTGCTCGGCGTTTA TACGTGGATGTTTGTAACGTA
CTCGTATTGGCTCAATCATAA
REFERENCES
[0225] 1) Anderson, L., and Seilhamer, J. (1997). A comparison of
selected mRNA and protein abundances in human liver.
Electrophoresis 18, 533-537; [0226] 2) Chen, G., Gharib, T. G.,
Wang, H., Huang, C. C., Kuick, R., Thomas, D. G., Shedden, K. A.,
Misek, D. E., Taylor, J. M., Giordano, T. J., Kardia, S. L.,
Iannettoni, M. D., Yee, J., Hogg, P. J., Orringer, M. B., Hanash,
S. M., and Beer, D. G. (2003) Protein profiles associated with
survival in lung adenocarcinoma. Proc. Natl. Acad. Sci. U. S. A
100, 13537-13542; [0227] 3) Ginestier, C., Charafe-Jauffret, E.,
Bertucci, F., Eisinger, F., Geneix, J., Bechlian, D., Conte, N.,
Adelaide, J., Toiron, Y., Nguyen, C., Viens, P., Mozziconacci, M.
J., Houlgatte, R., Birnbaum, D., and Jacquemier, J. (2002) Distinct
and complementary information provided by use of tissue and DNA
microarrays in the study of breast tumor markers. Am. J. Pathol.
161, 1223-1233; [0228] 4) Gygi, S. P., Rochon, Y., Franza, B. R.,
and Aebersold, R. (1999) Correlation between protein and mRNA
abundance in yeast. Mol. Cell. Biol. 19, 1720-1730; Nishizuka, S.,
Charboneau, L., Young, L., Major, S., Reinhold, W. C., Waltham, M.,
Kouros-Mehr, H., Bussey, K. J., Lee, J. K., Espina, V., Munson, P.
J., Petricoin, E., III, Liotta, L. A., and Weinstein, J. N. (2003)
Proteomic profiling of the NCI-60 cancer cell lines using new
high-density reverse-phase lysate microarrays. Proc. Natl. Acad.
Sci. U. S. A 100, 14229-14234; [0229] 5) Tyers, M., and Mann, M.
(2003) From genomics to proteomics. Nature 422, 193-197; [0230] 6)
Kagara N, Tanaka N, Noguchi S, Hirano T. (2007) Zinc and its
transporter ZIP10 are involved in invasive behavior of breast
cancer cells. Cancer Sci. 98:692-697; [0231] 7) Kasper G, Weiser A
A, Rump A, Sparbier K, Dahl E, Hartmann A, Wild P, Schwidetzky U,
Castanos-Velez E, Lehmann K. (2005) Expression levels of the
putative zinc transporter LIV-1 are associated with a better
outcome of breast cancer patients. Int J Cancer. 117:961-973;
[0232] 8) Sladek R, Rocheleau G, Rung J, Dina C, Shen L, Serre D,
Boutin P, Vincent D, Belisle A, Hadjadj S, Balkau B, Heude B,
Charpentier G, Hudson T J, Montpetit A, Pshezhetsky A V, Prentki M,
Posner B I, Balding D J, Meyre D, Polychronakos C, Froguel P.
(2007) A genome-wide association study identifies novel risk loci
for type 2 diabetes. Nature. 445:881-885; [0233] 9) Garcia-Rudaz,
C., Luna, F., Tapia, V., Kerr, B., Colgin, L., Galimi, F., Dissen,
G. A., Rawlings, N. D. and Ojeda, S. R. (2007) Fxna, a novel gene
differentially expressed in the rat ovary at the time of
folliculogenesis, is required for normal ovarian histogenesis.
Development. 134, 945-957; [0234] 10) Nicholas B, Skipp P, Mould R,
Rennard S, Davies D E, O'Connor C D, Djukanovi R. (2006) Shotgun
proteomic analysis of human-induced sputum. (2006) Proteomics.
6:4390-4401; [0235] 11) Adachi J, Kumar C, Zhang Y, Olsen J V, Mann
M. The human urinary proteome contains more than 1500 proteins
including a large proportion of membranes proteins. (2006) Genome
Biol. 7:R80.
Sequence CWU 1
1
1081831PRTHomo sapiens 1Met Lys Val His Met His Thr Lys Phe Cys Leu
Ile Cys Leu Leu Thr 1 5 10 15 Phe Ile Phe His His Cys Asn His Cys
His Glu Glu His Asp His Gly 20 25 30 Pro Glu Ala Leu His Arg Gln
His Arg Gly Met Thr Glu Leu Glu Pro 35 40 45 Ser Lys Phe Ser Lys
Gln Ala Ala Glu Asn Glu Lys Lys Tyr Tyr Ile 50 55 60 Glu Lys Leu
Phe Glu Arg Tyr Gly Glu Asn Gly Arg Leu Ser Phe Phe 65 70 75 80 Gly
Leu Glu Lys Leu Leu Thr Asn Leu Gly Leu Gly Glu Arg Lys Val 85 90
95 Val Glu Ile Asn His Glu Asp Leu Gly His Asp His Val Ser His Leu
100 105 110 Asp Ile Leu Ala Val Gln Glu Gly Lys His Phe His Ser His
Asn His 115 120 125 Gln His Ser His Asn His Leu Asn Ser Glu Asn Gln
Thr Val Thr Ser 130 135 140 Val Ser Thr Lys Arg Asn His Lys Cys Asp
Pro Glu Lys Glu Thr Val 145 150 155 160 Glu Val Ser Val Lys Ser Asp
Asp Lys His Met His Asp His Asn His 165 170 175 Arg Leu Arg His His
His Arg Leu His His His Leu Asp His Asn Asn 180 185 190 Thr His His
Phe His Asn Asp Ser Ile Thr Pro Ser Glu Arg Gly Glu 195 200 205 Pro
Ser Asn Glu Pro Ser Thr Glu Thr Asn Lys Thr Gln Glu Gln Ser 210 215
220 Asp Val Lys Leu Pro Lys Gly Lys Arg Lys Lys Lys Gly Arg Lys Ser
225 230 235 240 Asn Glu Asn Ser Glu Val Ile Thr Pro Gly Phe Pro Pro
Asn His Asp 245 250 255 Gln Gly Glu Gln Tyr Glu His Asn Arg Val His
Lys Pro Asp Arg Val 260 265 270 His Asn Pro Gly His Ser His Val His
Leu Pro Glu Arg Asn Gly His 275 280 285 Asp Pro Gly Arg Gly His Gln
Asp Leu Asp Pro Asp Asn Glu Gly Glu 290 295 300 Leu Arg His Thr Arg
Lys Arg Glu Ala Pro His Val Lys Asn Asn Ala 305 310 315 320 Ile Ile
Ser Leu Arg Lys Asp Leu Asn Glu Asp Asp His His His Glu 325 330 335
Cys Leu Asn Val Thr Gln Leu Leu Lys Tyr Tyr Gly His Gly Ala Asn 340
345 350 Ser Pro Ile Ser Thr Asp Leu Phe Thr Tyr Leu Cys Pro Ala Leu
Leu 355 360 365 Tyr Gln Ile Asp Ser Arg Leu Cys Ile Glu His Phe Asp
Lys Leu Leu 370 375 380 Val Glu Asp Ile Asn Lys Asp Lys Asn Leu Val
Pro Glu Asp Glu Ala 385 390 395 400 Asn Ile Gly Ala Ser Ala Trp Ile
Cys Gly Ile Ile Ser Ile Thr Val 405 410 415 Ile Ser Leu Leu Ser Leu
Leu Gly Val Ile Leu Val Pro Ile Ile Asn 420 425 430 Gln Gly Cys Phe
Lys Phe Leu Leu Thr Phe Leu Val Ala Leu Ala Val 435 440 445 Gly Thr
Met Ser Gly Asp Ala Leu Leu His Leu Leu Pro His Ser Gln 450 455 460
Gly Gly His Asp His Ser His Gln His Ala His Gly His Gly His Ser 465
470 475 480 His Gly His Glu Ser Asn Lys Phe Leu Glu Glu Tyr Asp Ala
Val Leu 485 490 495 Lys Gly Leu Val Ala Leu Gly Gly Ile Tyr Leu Leu
Phe Ile Ile Glu 500 505 510 His Cys Ile Arg Met Phe Lys His Tyr Lys
Gln Gln Arg Gly Lys Gln 515 520 525 Lys Trp Phe Met Lys Gln Asn Thr
Glu Glu Ser Thr Ile Gly Arg Lys 530 535 540 Leu Ser Asp His Lys Leu
Asn Asn Thr Pro Asp Ser Asp Trp Leu Gln 545 550 555 560 Leu Lys Pro
Leu Ala Gly Thr Asp Asp Ser Val Val Ser Glu Asp Arg 565 570 575 Leu
Asn Glu Thr Glu Leu Thr Asp Leu Glu Gly Gln Gln Glu Ser Pro 580 585
590 Pro Lys Asn Tyr Leu Cys Ile Glu Glu Glu Lys Ile Ile Asp His Ser
595 600 605 His Ser Asp Gly Leu His Thr Ile His Glu His Asp Leu His
Ala Ala 610 615 620 Ala His Asn His His Gly Glu Asn Lys Thr Val Leu
Arg Lys His Asn 625 630 635 640 His Gln Trp His His Lys His Ser His
His Ser His Gly Pro Cys His 645 650 655 Ser Gly Ser Asp Leu Lys Glu
Thr Gly Ile Ala Asn Ile Ala Trp Met 660 665 670 Val Ile Met Gly Asp
Gly Ile His Asn Phe Ser Asp Gly Leu Ala Ile 675 680 685 Gly Ala Ala
Phe Ser Ala Gly Leu Thr Gly Gly Ile Ser Thr Ser Ile 690 695 700 Ala
Val Phe Cys His Glu Leu Pro His Glu Leu Gly Asp Phe Ala Val 705 710
715 720 Leu Leu Lys Ala Gly Met Thr Val Lys Gln Ala Ile Val Tyr Asn
Leu 725 730 735 Leu Ser Ala Met Met Ala Tyr Ile Gly Met Leu Ile Gly
Thr Ala Val 740 745 750 Gly Gln Tyr Ala Asn Asn Ile Thr Leu Trp Ile
Phe Ala Val Thr Ala 755 760 765 Gly Met Phe Leu Tyr Val Ala Leu Val
Asp Met Leu Pro Glu Met Leu 770 775 780 His Gly Asp Gly Asp Asn Glu
Glu His Gly Phe Cys Pro Val Gly Gln 785 790 795 800 Phe Ile Leu Gln
Asn Leu Gly Leu Leu Phe Gly Phe Ala Ile Met Leu 805 810 815 Val Ile
Ala Leu Tyr Glu Asp Lys Ile Val Phe Asp Ile Gln Phe 820 825 830
2831PRTHomo sapiens 2Met Lys Val His Met His Thr Lys Phe Cys Leu
Ile Cys Leu Leu Thr 1 5 10 15 Phe Ile Phe His His Cys Asn His Cys
His Glu Glu His Asp His Gly 20 25 30 Pro Glu Ala Leu His Arg Gln
His Arg Gly Met Thr Glu Leu Glu Pro 35 40 45 Ser Lys Phe Ser Lys
Gln Ala Ala Glu Asn Glu Lys Lys Tyr Tyr Ile 50 55 60 Glu Lys Leu
Phe Glu Arg Tyr Gly Glu Asn Gly Arg Leu Ser Phe Phe 65 70 75 80 Gly
Leu Glu Lys Leu Leu Thr Asn Leu Gly Leu Gly Glu Arg Lys Val 85 90
95 Val Glu Ile Asn His Glu Asp Leu Gly His Asp His Val Ser His Leu
100 105 110 Asp Ile Leu Ala Val Gln Glu Gly Lys His Phe His Ser His
Asn His 115 120 125 Gln His Ser His Asn His Leu Asn Ser Glu Asn Gln
Thr Val Thr Ser 130 135 140 Val Ser Thr Lys Arg Asn His Lys Cys Asp
Pro Glu Lys Glu Thr Val 145 150 155 160 Glu Val Ser Val Lys Ser Asp
Asp Lys His Met His Asp His Asn His 165 170 175 Arg Leu Arg His His
His Arg Leu His His His Leu Asp His Asn Asn 180 185 190 Thr His His
Phe His Asn Asp Ser Ile Thr Pro Ser Glu Arg Gly Glu 195 200 205 Pro
Ser Asn Glu Pro Ser Thr Glu Thr Asn Lys Thr Gln Glu Gln Ser 210 215
220 Asp Val Lys Leu Pro Lys Gly Lys Arg Lys Lys Lys Gly Arg Lys Ser
225 230 235 240 Asn Glu Asn Ser Glu Val Ile Thr Pro Gly Phe Pro Pro
Asn His Asp 245 250 255 Gln Gly Glu Gln Tyr Glu His Asn Arg Val His
Lys Pro Asp Arg Val 260 265 270 His Asn Pro Gly His Ser His Val His
Leu Pro Glu Arg Asn Gly His 275 280 285 Asp Pro Gly Arg Gly His Gln
Asp Leu Asp Pro Asp Asn Glu Gly Glu 290 295 300 Leu Arg His Thr Arg
Lys Arg Glu Ala Pro His Val Lys Asn Asn Ala 305 310 315 320 Ile Ile
Ser Leu Arg Lys Asp Leu Asn Glu Asp Asp His His His Glu 325 330 335
Cys Leu Asn Val Thr Gln Leu Leu Lys Tyr Tyr Gly His Gly Ala Asn 340
345 350 Ser Pro Ile Ser Thr Asp Leu Phe Thr Tyr Leu Cys Pro Ala Leu
Leu 355 360 365 Tyr Gln Ile Asp Ser Arg Leu Cys Ile Glu His Phe Asp
Lys Leu Leu 370 375 380 Val Glu Asp Ile Asn Lys Asp Lys Asn Leu Val
Pro Glu Asp Glu Ala 385 390 395 400 Asn Ile Gly Ala Ser Ala Trp Ile
Cys Gly Ile Ile Ser Ile Thr Val 405 410 415 Ile Ser Leu Leu Ser Leu
Leu Gly Val Ile Leu Val Pro Ile Ile Asn 420 425 430 Gln Gly Cys Phe
Lys Phe Leu Leu Thr Phe Leu Val Ala Leu Ala Val 435 440 445 Gly Thr
Met Ser Gly Asp Ala Leu Leu His Leu Leu Pro His Ser Gln 450 455 460
Gly Gly His Asp His Ser His Gln His Ala His Gly His Gly His Ser 465
470 475 480 His Gly His Glu Ser Asn Lys Phe Leu Glu Glu Tyr Asp Ala
Val Leu 485 490 495 Lys Gly Leu Val Ala Leu Gly Gly Ile Tyr Leu Leu
Phe Ile Ile Glu 500 505 510 His Cys Ile Arg Met Phe Lys His Tyr Lys
Gln Gln Arg Gly Lys Gln 515 520 525 Lys Trp Phe Met Lys Gln Asn Thr
Glu Glu Ser Thr Ile Gly Arg Lys 530 535 540 Leu Ser Asp His Lys Leu
Asn Asn Thr Pro Asp Ser Asp Trp Leu Gln 545 550 555 560 Leu Lys Pro
Leu Ala Gly Thr Asp Asp Ser Val Val Ser Glu Asp Arg 565 570 575 Leu
Asn Glu Thr Glu Leu Thr Asp Leu Glu Gly Gln Gln Glu Ser Pro 580 585
590 Pro Lys Asn Tyr Leu Cys Ile Glu Glu Glu Lys Ile Ile Asp His Ser
595 600 605 His Ser Asp Gly Leu His Thr Ile His Glu His Asp Leu His
Ala Ala 610 615 620 Ala His Asn His His Gly Glu Asn Lys Thr Val Leu
Arg Lys His Asn 625 630 635 640 His Gln Trp His His Lys His Ser His
His Ser His Gly Pro Cys His 645 650 655 Ser Gly Ser Asp Leu Lys Glu
Thr Gly Ile Ala Asn Ile Ala Trp Met 660 665 670 Val Ile Met Gly Asp
Gly Ile His Asn Phe Ser Asp Gly Leu Ala Ile 675 680 685 Gly Ala Ala
Phe Ser Ala Gly Leu Thr Gly Gly Ile Ser Thr Ser Ile 690 695 700 Ala
Val Phe Cys His Glu Leu Pro His Glu Leu Gly Asp Phe Ala Val 705 710
715 720 Leu Leu Lys Ala Gly Met Thr Val Lys Gln Ala Ile Val Tyr Asn
Leu 725 730 735 Leu Ser Ala Met Met Ala Tyr Ile Gly Met Leu Ile Gly
Thr Ala Val 740 745 750 Gly Gln Tyr Ala Asn Asn Ile Thr Leu Trp Ile
Phe Ala Val Thr Ala 755 760 765 Gly Met Phe Leu Tyr Val Ala Leu Val
Asp Met Leu Pro Glu Met Leu 770 775 780 His Gly Asp Gly Asp Asn Glu
Glu His Gly Phe Cys Pro Val Gly Gln 785 790 795 800 Phe Ile Leu Gln
Asn Leu Gly Leu Leu Phe Gly Phe Ala Ile Met Leu 805 810 815 Val Ile
Ala Leu Tyr Glu Asp Lys Ile Val Phe Asp Ile Gln Phe 820 825 830
35227DNAHomo sapiens 3cacgatttgg tgcagccggg gtttggtacc gagcggagag
gagatgcaca cggcactcga 60gtgtgaggaa aaatagaaat gaaggtacat atgcacacaa
aattttgcct catttgtttg 120ctgacattta tttttcatca ttgcaaccat
tgccatgaag aacatgacca tggccctgaa 180gcgcttcaca gacagcatcg
tggaatgaca gaattggagc caagcaaatt ttcaaagcaa 240gctgctgaaa
atgaaaaaaa atactatatt gaaaaacttt ttgagcgtta tggtgaaaat
300ggaagattat ccttttttgg tttggagaaa cttttaacaa acttgggcct
tggagagaga 360aaagtagttg agattaatca tgaggatctt ggccacgatc
atgtttctca tttagatatt 420ttggcagttc aagagggaaa gcattttcac
tcacataacc accagcattc ccataatcat 480ttaaattcag aaaatcaaac
tgtgaccagt gtatccacaa aaagaaacca taaatgtgat 540ccagagaaag
agacagttga agtgtctgta aaatctgatg ataaacatat gcatgaccat
600aatcaccgcc tacgtcatca ccatcgtttg catcatcatc ttgatcataa
caacactcac 660cattttcata atgattccat tactcccagt gagcgtgggg
agcctagcaa tgaaccttca 720acagagacca ataaaaccca ggaacaatct
gatgttaaac taccgaaagg aaagaggaag 780aaaaaaggga ggaaaagtaa
tgaaaattct gaggttatta caccaggttt tccccctaac 840catgatcagg
gtgaacagta tgagcataat cgggtccaca aacctgatcg tgtacataac
900ccaggtcatt ctcatgtaca tcttccagaa cgtaatggtc atgatcctgg
tcgtggacac 960caagatcttg atcctgataa tgaaggtgaa cttcgacata
ctagaaagag agaagcacca 1020catgttaaaa ataatgcaat aatttctttg
agaaaagatc taaatgaaga tgaccatcat 1080catgaatgtt tgaacgtcac
tcagttatta aaatactatg gtcatggtgc caactctccc 1140atctcaactg
atttatttac atacctttgc cctgcattgt tatatcaaat cgacagcaga
1200ctttgtattg agcattttga caaactttta gttgaagata taaataagga
taaaaacctg 1260gttcctgaag atgaggcaaa tataggggca tcagcctgga
tttgtggtat catttctatc 1320actgtcatta gcctgctttc cttgctaggc
gtgatcttgg ttcctatcat taaccaagga 1380tgcttcaaat tccttcttac
attccttgtt gcattagctg taggaacaat gagtggagac 1440gcccttcttc
atctactgcc ccattctcag ggtggacatg atcacagtca ccaacatgca
1500catgggcatg gacattctca tggacatgaa tctaacaagt ttttggaaga
atatgatgct 1560gtattgaaag gacttgttgc tctaggaggc atttacttgc
tatttatcat tgaacactgc 1620attagaatgt ttaagcacta caaacaacaa
agaggaaaac agaaatggtt tatgaaacag 1680aacacagaag aatcaactat
tggaagaaag ctttcagatc acaagttaaa caatacacca 1740gattctgact
ggcttcaact caagcctctt gccggaactg atgactcggt tgtttctgaa
1800gatcgactta atgaaactga actgacagat ttagaaggcc aacaagaatc
ccctcctaaa 1860aattaccttt gtatagaaga ggagaaaatc atagaccatt
ctcacagtga tggattacat 1920accattcatg agcatgatct ccatgctgct
gcacataacc accacggcga gaacaaaact 1980gtgctgagga agcataatca
ccagtggcac cacaagcatt ctcatcattc ccatggcccc 2040tgtcattctg
gatccgatct gaaagaaaca ggaatagcta atatagcctg gatggtgatc
2100atgggggatg gcatccacaa cttcagtgat gggctcgcaa ttggtgcagc
tttcagtgct 2160ggattgacag gaggaatcag tacttctata gccgtcttct
gtcatgaact gccacatgaa 2220ttaggagatt ttgcagttct tcttaaagca
ggcatgactg taaagcaagc aattgtatac 2280aacctcctct ctgccatgat
ggcttacata ggcatgctca taggcacagc tgttggtcag 2340tatgccaata
acatcacact ttggatcttt gcagtcactg caggcatgtt cctctatgta
2400gccttggtgg atatgcttcc agaaatgttg catggtgatg gtgacaatga
agaacatggc 2460ttttgtcctg tggggcaatt catccttcag aatttaggat
tgctctttgg atttgccatt 2520atgctggtga ttgccctcta tgaagataaa
attgtgtttg acatccagtt ttgacctttc 2580ccagtaatca ctgttgatta
cgagaatgtt accatgcagc tttgcatctg ttccttgtac 2640tgtatgcaca
ttgctcaaag gaaagtcagt ggcttgcact acttacaagt ttcatagatt
2700tgagcctaac cacaagaggc tggtgcttag tactgttttc cctgcacgta
ggggtctttt 2760aaaaatataa agcttgtgat aaagagagga gaatatggga
ctccatgaac cagtgttgat 2820atgtttgatt aagacttttc acaaaataat
catataaaac actagtctct ttattagtag 2880aaacttctgt ggctatgcag
aaatagagat cgaaccaaaa aaaatcattt aaactttaaa 2940aatattttaa
atggactttg gggagacatt ttttgtgtgt tttaagaatg aattgtagtg
3000ctctttaatt cagctacata tattcatgtg gtgataggga tcaacttgac
acaactttga 3060aactgcataa agtagacata ggaactagag gaaagctcag
gctgcattag agtatgaatt 3120tagcattggg aaaagccctt attcttgaat
ctagagttac tatttttgta tatatttgca 3180tagtgtttaa acctgcagcc
taaactactg aaatttgtga ttgtatgttt gtgtgagctt 3240cagtttaatg
aaagattcat aatggttctt tgtattatta taatacttgg tgttggggtg
3300ttctttctgt tttgtttttt actttaattt tgttttgatt tttttttttt
ttttttggcg 3360ggggtaggtg agggtttgga gcatgtggtc tttttaaaaa
attgtaaccc tctagaaaat 3420atcaaagaaa tgaaccagac gtggtttaaa
tagttgattt tcctatttta acagtaccaa 3480ctagttaatt gggaaatgta
agttctgaat gttcacattg ctttaccagt ttggcactgg 3540aaccaagagc
acatgtcgtg gctggctaca aggttgtaaa gcagaaaatc gaagtttacc
3600atgtctgtaa tgtgtacatg aagtgtcaat ttagaacagt tactaggata
aactccatta 3660ttgccatggc tgtcatggta cccaagtgac ttggaagatg
catttaaatt actcagctga 3720aatcacttga tcatcttgtg ccaagatatg
ctgttggtgc ctgataggga ttagtctttt 3780aggtgccctg ttctcctacc
ataattgtga atgatttgtg agaagtgcaa gccatgttta 3840tcctgaattt
ttacttaata atttgtatta ctagtcatat gcatgtagct ttctgtttac
3900atcctatgcc acatggtctt catttatgcc aggtaaactg tatttgaact
atgtgcagct 3960agctttgttt taatctgctt ggcaaccagt gtagctgctg
taacaatcta tcttattgtt 4020caaatatata agagccaaac tcttttccat
tccatctaaa atgttttcat ttagtactct 4080tctttcctcc tactctatga
acttcaaaac aaaaacaaaa ctttgagagc agcacatgca 4140tccaggtatt
tatagattat
tgccagtgtc ttttctgtat gctataagca agggagctta 4200ggtgttattt
ctttaattta tgcttgaatc tgaaaaatta tttctgactt actccatggc
4260ctccttataa taagtagaag ttttatatat aattaatttt cagcattggg
cactgaatta 4320ggacagtcct catctcattg cttggccctt caagcaacct
agctaaaagg tgctgatatt 4380ttatttagta ctgccaactt caagtgattt
agatatctat ctatctagat ttctgaacca 4440agatatattt atagttcact
tttgggtttt tatacccacg gtaggattct gcattccagc 4500attaaatctg
cttcatttta gaacctttat aaaagcaata gctggaatat actcccagtt
4560ttaaaataaa tgcctgattg atttaaagca agtaggttat gctgaagtat
ataaagaagt 4620tttatattct ctcaaaaatg gtattatctt tctttatttg
ctagattctt acaaatcttt 4680taagagggct gtaacagttg ctgctagtat
tagggttcca catcattcta atgtatagtt 4740tcaagtctta atagacaatc
tgaattccac tacatttctt ttggctccaa cattcctttt 4800agcttgacca
gtctaattta aaatgtgttt gttggaggtc attaacgtta cttgtacaat
4860gctgtcactg tgtgacatcc atatgaattt tggtatatat caatcaatca
atcaatcaca 4920ttgcattcaa tcaatcagct gtgattgatt gattatgctt
agaaatacta tagtaactag 4980atgcagtgtg aattttttcc attaacaaac
aaacaagtca gtggcttaaa tgtgattatg 5040gtcctgcaag gtgattcttg
ctaaaatatc taaacttttg ttttgtttta actgaatcat 5100tttttaactt
aaaaagctgg aaaatatcaa atgctgtttt ttttttttca ttgtcaacag
5160tggtgtgtca ttttatgtat gttcctaatg cttatggaac tcctccaaaa
taaagttact 5220caaagag 522745432DNAHomo sapiens 4agttgatcac
tctgaagctt tttggctaaa gcgtttgggt ttagagcttc cattactcat 60tcgccttgcc
caaggcctca gcaaccgacg ttcgaaagcc aggagaaaag gcgaatgata
120aagggcgctc cacgcatgcg ttaagaagcc gccccaactc ccccgcggcg
ttctttcttg 180gaacaaaact agcgcggagc cacggaactc cgcagtttgc
gtagacttga atttcctatt 240cctcggacga tccatgtgga atccgaaaaa
tagaaatgaa ggtacatatg cacacaaaat 300tttgcctcat ttgtttgctg
acatttattt ttcatcattg caaccattgc catgaagaac 360atgaccatgg
ccctgaagcg cttcacagac agcatcgtgg aatgacagaa ttggagccaa
420gcaaattttc aaagcaagct gctgaaaatg aaaaaaaata ctatattgaa
aaactttttg 480agcgttatgg tgaaaatgga agattatcct tttttggttt
ggagaaactt ttaacaaact 540tgggccttgg agagagaaaa gtagttgaga
ttaatcatga ggatcttggc cacgatcatg 600tttctcattt agatattttg
gcagttcaag agggaaagca ttttcactca cataaccacc 660agcattccca
taatcattta aattcagaaa atcaaactgt gaccagtgta tccacaaaaa
720gaaaccataa atgtgatcca gagaaagaga cagttgaagt gtctgtaaaa
tctgatgata 780aacatatgca tgaccataat caccgcctac gtcatcacca
tcgtttgcat catcatcttg 840atcataacaa cactcaccat tttcataatg
attccattac tcccagtgag cgtggggagc 900ctagcaatga accttcaaca
gagaccaata aaacccagga acaatctgat gttaaactac 960cgaaaggaaa
gaggaagaaa aaagggagga aaagtaatga aaattctgag gttattacac
1020caggttttcc ccctaaccat gatcagggtg aacagtatga gcataatcgg
gtccacaaac 1080ctgatcgtgt acataaccca ggtcattctc atgtacatct
tccagaacgt aatggtcatg 1140atcctggtcg tggacaccaa gatcttgatc
ctgataatga aggtgaactt cgacatacta 1200gaaagagaga agcaccacat
gttaaaaata atgcaataat ttctttgaga aaagatctaa 1260atgaagatga
ccatcatcat gaatgtttga acgtcactca gttattaaaa tactatggtc
1320atggtgccaa ctctcccatc tcaactgatt tatttacata cctttgccct
gcattgttat 1380atcaaatcga cagcagactt tgtattgagc attttgacaa
acttttagtt gaagatataa 1440ataaggataa aaacctggtt cctgaagatg
aggcaaatat aggggcatca gcctggattt 1500gtggtatcat ttctatcact
gtcattagcc tgctttcctt gctaggcgtg atcttggttc 1560ctatcattaa
ccaaggatgc ttcaaattcc ttcttacatt ccttgttgca ttagctgtag
1620gaacaatgag tggagacgcc cttcttcatc tactgcccca ttctcagggt
ggacatgatc 1680acagtcacca acatgcacat gggcatggac attctcatgg
acatgaatct aacaagtttt 1740tggaagaata tgatgctgta ttgaaaggac
ttgttgctct aggaggcatt tacttgctat 1800ttatcattga acactgcatt
agaatgttta agcactacaa acaacaaaga ggaaaacaga 1860aatggtttat
gaaacagaac acagaagaat caactattgg aagaaagctt tcagatcaca
1920agttaaacaa tacaccagat tctgactggc ttcaactcaa gcctcttgcc
ggaactgatg 1980actcggttgt ttctgaagat cgacttaatg aaactgaact
gacagattta gaaggccaac 2040aagaatcccc tcctaaaaat tacctttgta
tagaagagga gaaaatcata gaccattctc 2100acagtgatgg attacatacc
attcatgagc atgatctcca tgctgctgca cataaccacc 2160acggcgagaa
caaaactgtg ctgaggaagc ataatcacca gtggcaccac aagcattctc
2220atcattccca tggcccctgt cattctggat ccgatctgaa agaaacagga
atagctaata 2280tagcctggat ggtgatcatg ggggatggca tccacaactt
cagtgatggg ctcgcaattg 2340gtgcagcttt cagtgctgga ttgacaggag
gaatcagtac ttctatagcc gtcttctgtc 2400atgaactgcc acatgaatta
ggagattttg cagttcttct taaagcaggc atgactgtaa 2460agcaagcaat
tgtatacaac ctcctctctg ccatgatggc ttacataggc atgctcatag
2520gcacagctgt tggtcagtat gccaataaca tcacactttg gatctttgca
gtcactgcag 2580gcatgttcct ctatgtagcc ttggtggata tgcttccaga
aatgttgcat ggtgatggtg 2640acaatgaaga acatggcttt tgtcctgtgg
ggcaattcat ccttcagaat ttaggattgc 2700tctttggatt tgccattatg
ctggtgattg ccctctatga agataaaatt gtgtttgaca 2760tccagttttg
acctttccca gtaatcactg ttgattacga gaatgttacc atgcagcttt
2820gcatctgttc cttgtactgt atgcacattg ctcaaaggaa agtcagtggc
ttgcactact 2880tacaagtttc atagatttga gcctaaccac aagaggctgg
tgcttagtac tgttttccct 2940gcacgtaggg gtcttttaaa aatataaagc
ttgtgataaa gagaggagaa tatgggactc 3000catgaaccag tgttgatatg
tttgattaag acttttcaca aaataatcat ataaaacact 3060agtctcttta
ttagtagaaa cttctgtggc tatgcagaaa tagagatcga accaaaaaaa
3120atcatttaaa ctttaaaaat attttaaatg gactttgggg agacattttt
tgtgtgtttt 3180aagaatgaat tgtagtgctc tttaattcag ctacatatat
tcatgtggtg atagggatca 3240acttgacaca actttgaaac tgcataaagt
agacatagga actagaggaa agctcaggct 3300gcattagagt atgaatttag
cattgggaaa agcccttatt cttgaatcta gagttactat 3360ttttgtatat
atttgcatag tgtttaaacc tgcagcctaa actactgaaa tttgtgattg
3420tatgtttgtg tgagcttcag tttaatgaaa gattcataat ggttctttgt
attattataa 3480tacttggtgt tggggtgttc tttctgtttt gttttttact
ttaattttgt tttgattttt 3540tttttttttt tttggcgggg gtaggtgagg
gtttggagca tgtggtcttt ttaaaaaatt 3600gtaaccctct agaaaatatc
aaagaaatga accagacgtg gtttaaatag ttgattttcc 3660tattttaaca
gtaccaacta gttaattggg aaatgtaagt tctgaatgtt cacattgctt
3720taccagtttg gcactggaac caagagcaca tgtcgtggct ggctacaagg
ttgtaaagca 3780gaaaatcgaa gtttaccatg tctgtaatgt gtacatgaag
tgtcaattta gaacagttac 3840taggataaac tccattattg ccatggctgt
catggtaccc aagtgacttg gaagatgcat 3900ttaaattact cagctgaaat
cacttgatca tcttgtgcca agatatgctg ttggtgcctg 3960atagggatta
gtcttttagg tgccctgttc tcctaccata attgtgaatg atttgtgaga
4020agtgcaagcc atgtttatcc tgaattttta cttaataatt tgtattacta
gtcatatgca 4080tgtagctttc tgtttacatc ctatgccaca tggtcttcat
ttatgccagg taaactgtat 4140ttgaactatg tgcagctagc tttgttttaa
tctgcttggc aaccagtgta gctgctgtaa 4200caatctatct tattgttcaa
atatataaga gccaaactct tttccattcc atctaaaatg 4260ttttcattta
gtactcttct ttcctcctac tctatgaact tcaaaacaaa aacaaaactt
4320tgagagcagc acatgcatcc aggtatttat agattattgc cagtgtcttt
tctgtatgct 4380ataagcaagg gagcttaggt gttatttctt taatttatgc
ttgaatctga aaaattattt 4440ctgacttact ccatggcctc cttataataa
gtagaagttt tatatataat taattttcag 4500cattgggcac tgaattagga
cagtcctcat ctcattgctt ggcccttcaa gcaacctagc 4560taaaaggtgc
tgatatttta tttagtactg ccaacttcaa gtgatttaga tatctatcta
4620tctagatttc tgaaccaaga tatatttata gttcactttt gggtttttat
acccacggta 4680ggattctgca ttccagcatt aaatctgctt cattttagaa
cctttataaa agcaatagct 4740ggaatatact cccagtttta aaataaatgc
ctgattgatt taaagcaagt aggttatgct 4800gaagtatata aagaagtttt
atattctctc aaaaatggta ttatctttct ttatttgcta 4860gattcttaca
aatcttttaa gagggctgta acagttgctg ctagtattag ggttccacat
4920cattctaatg tatagtttca agtcttaata gacaatctga attccactac
atttcttttg 4980gctccaacat tccttttagc ttgaccagtc taatttaaaa
tgtgtttgtt ggaggtcatt 5040aacgttactt gtacaatgct gtcactgtgt
gacatccata tgaattttgg tatatatcaa 5100tcaatcaatc aatcacattg
cattcaatca atcagctgtg attgattgat tatgcttaga 5160aatactatag
taactagatg cagtgtgaat tttttccatt aacaaacaaa caagtcagtg
5220gcttaaatgt gattatggtc ctgcaaggtg attcttgcta aaatatctaa
acttttgttt 5280tgttttaact gaatcatttt ttaacttaaa aagctggaaa
atatcaaatg ctgttttttt 5340tttttcattg tcaacagtgg tgtgtcattt
tatgtatgtt cctaatgctt atggaactcc 5400tccaaaataa agttactcaa
agagagcaaa ta 54325552PRTHomo sapiens 5Met Ala Ala Leu Ala Pro Val
Gly Ser Pro Ala Ser Arg Gly Pro Arg 1 5 10 15 Leu Ala Ala Gly Leu
Arg Leu Leu Pro Met Leu Gly Leu Leu Gln Leu 20 25 30 Leu Ala Glu
Pro Gly Leu Gly Arg Val His His Leu Ala Leu Lys Asp 35 40 45 Asp
Val Arg His Lys Val His Leu Asn Thr Phe Gly Phe Phe Lys Asp 50 55
60 Gly Tyr Met Val Val Asn Val Ser Ser Leu Ser Leu Asn Glu Pro Glu
65 70 75 80 Asp Lys Asp Val Thr Ile Gly Phe Ser Leu Asp Arg Thr Lys
Asn Asp 85 90 95 Gly Phe Ser Ser Tyr Leu Asp Glu Asp Val Asn Tyr
Cys Ile Leu Lys 100 105 110 Lys Gln Ser Val Ser Val Thr Leu Leu Ile
Leu Asp Ile Ser Arg Ser 115 120 125 Glu Val Arg Val Lys Ser Pro Pro
Glu Ala Gly Thr Gln Leu Pro Lys 130 135 140 Ile Ile Phe Ser Arg Asp
Glu Lys Val Leu Gly Gln Ser Gln Glu Pro 145 150 155 160 Asn Val Asn
Pro Ala Ser Ala Gly Asn Gln Thr Gln Lys Thr Gln Asp 165 170 175 Gly
Gly Lys Ser Lys Arg Ser Thr Val Asp Ser Lys Ala Met Gly Glu 180 185
190 Lys Ser Phe Ser Val His Asn Asn Gly Gly Ala Val Ser Phe Gln Phe
195 200 205 Phe Phe Asn Ile Ser Thr Asp Asp Gln Glu Gly Leu Tyr Ser
Leu Tyr 210 215 220 Phe His Lys Cys Leu Gly Lys Glu Leu Pro Ser Asp
Lys Phe Thr Phe 225 230 235 240 Ser Leu Asp Ile Glu Ile Thr Glu Lys
Asn Pro Asp Ser Tyr Leu Ser 245 250 255 Ala Gly Glu Ile Pro Leu Pro
Lys Leu Tyr Ile Ser Met Ala Phe Phe 260 265 270 Phe Phe Leu Ser Gly
Thr Ile Trp Ile His Ile Leu Arg Lys Arg Arg 275 280 285 Asn Asp Val
Phe Lys Ile His Trp Leu Met Ala Ala Leu Pro Phe Thr 290 295 300 Lys
Ser Leu Ser Leu Val Phe His Ala Ile Asp Tyr His Tyr Ile Ser 305 310
315 320 Ser Gln Gly Phe Pro Ile Glu Gly Trp Ala Val Val Tyr Tyr Ile
Thr 325 330 335 His Leu Leu Lys Gly Ala Leu Leu Phe Ile Thr Ile Ala
Leu Ile Gly 340 345 350 Thr Gly Trp Ala Phe Ile Lys His Ile Leu Ser
Asp Lys Asp Lys Lys 355 360 365 Ile Phe Met Ile Val Ile Pro Leu Gln
Val Leu Ala Asn Val Ala Tyr 370 375 380 Ile Ile Ile Glu Ser Thr Glu
Glu Gly Thr Thr Glu Tyr Gly Leu Trp 385 390 395 400 Lys Asp Ser Leu
Phe Leu Val Asp Leu Leu Cys Cys Gly Ala Ile Leu 405 410 415 Phe Pro
Val Val Trp Ser Ile Arg His Leu Gln Glu Ala Ser Ala Thr 420 425 430
Asp Gly Lys Ala Ala Ile Asn Leu Ala Lys Leu Lys Leu Phe Arg His 435
440 445 Tyr Tyr Val Leu Ile Val Cys Tyr Ile Tyr Phe Thr Arg Ile Ile
Ala 450 455 460 Phe Leu Leu Lys Leu Ala Val Pro Phe Gln Trp Lys Trp
Leu Tyr Gln 465 470 475 480 Leu Leu Asp Glu Thr Ala Thr Leu Val Phe
Phe Val Leu Thr Gly Tyr 485 490 495 Lys Phe Arg Pro Ala Ser Asp Asn
Pro Tyr Leu Gln Leu Ser Gln Glu 500 505 510 Glu Glu Asp Leu Glu Met
Glu Ser Val Val Thr Thr Ser Gly Val Met 515 520 525 Glu Ser Met Lys
Lys Val Lys Lys Val Thr Asn Gly Ser Val Glu Pro 530 535 540 Gln Gly
Glu Trp Glu Gly Ala Val 545 550 6600PRTHomo sapiens 6Met Ala Ala
Leu Ala Pro Val Gly Ser Pro Ala Ser Arg Gly Pro Arg 1 5 10 15 Leu
Ala Ala Gly Leu Arg Leu Leu Pro Met Leu Gly Leu Leu Gln Leu 20 25
30 Leu Ala Glu Pro Gly Leu Gly Arg Val His His Leu Ala Leu Lys Asp
35 40 45 Asp Val Arg His Lys Val His Leu Asn Thr Phe Gly Phe Phe
Lys Asp 50 55 60 Gly Tyr Met Val Val Asn Val Ser Ser Leu Ser Leu
Asn Glu Pro Glu 65 70 75 80 Asp Lys Asp Val Thr Ile Gly Phe Ser Leu
Asp Arg Thr Lys Asn Asp 85 90 95 Gly Phe Ser Ser Tyr Leu Asp Glu
Asp Val Asn Tyr Cys Ile Leu Lys 100 105 110 Lys Gln Ser Val Ser Val
Thr Leu Leu Ile Leu Asp Ile Ser Arg Ser 115 120 125 Glu Val Arg Val
Lys Ser Pro Pro Glu Ala Gly Thr Gln Leu Pro Lys 130 135 140 Ile Ile
Phe Ser Arg Asp Glu Lys Val Leu Gly Gln Ser Gln Glu Pro 145 150 155
160 Asn Val Asn Pro Ala Ser Ala Gly Asn Gln Thr Gln Lys Thr Gln Asp
165 170 175 Gly Gly Lys Ser Lys Arg Ser Thr Val Asp Ser Lys Ala Met
Gly Glu 180 185 190 Lys Ser Phe Ser Val His Asn Asn Gly Gly Ala Val
Ser Phe Gln Phe 195 200 205 Phe Phe Asn Ile Ser Thr Asp Asp Gln Glu
Gly Leu Tyr Ser Leu Tyr 210 215 220 Phe His Lys Cys Leu Gly Lys Glu
Leu Pro Ser Asp Lys Phe Thr Phe 225 230 235 240 Ser Leu Asp Ile Glu
Ile Thr Glu Lys Asn Pro Asp Ser Tyr Leu Ser 245 250 255 Ala Gly Glu
Ile Pro Leu Pro Lys Leu Tyr Ile Ser Met Ala Phe Phe 260 265 270 Phe
Phe Leu Ser Gly Thr Ile Trp Ile His Ile Leu Arg Lys Arg Arg 275 280
285 Asn Asp Val Phe Lys Ile His Trp Leu Met Ala Ala Leu Pro Phe Thr
290 295 300 Lys Ser Leu Ser Leu Val Phe His Ala Ile Asp Tyr His Tyr
Ile Ser 305 310 315 320 Ser Gln Gly Phe Pro Ile Glu Gly Trp Ala Val
Val Tyr Tyr Ile Thr 325 330 335 His Leu Leu Lys Gly Ala Leu Leu Phe
Ile Thr Ile Ala Leu Ile Gly 340 345 350 Thr Gly Trp Ala Phe Ile Lys
His Ile Leu Ser Asp Lys Asp Lys Lys 355 360 365 Ile Phe Met Ile Val
Ile Pro Leu Gln Val Leu Ala Asn Val Ala Tyr 370 375 380 Ile Ile Ile
Glu Ser Thr Glu Glu Gly Thr Thr Glu Tyr Gly Leu Trp 385 390 395 400
Lys Asp Ser Leu Phe Leu Val Asp Leu Leu Cys Cys Gly Ala Ile Leu 405
410 415 Phe Pro Val Val Trp Ser Ile Arg His Leu Gln Glu Ala Ser Ala
Thr 420 425 430 Asp Gly Lys Gly Asp Ser Met Gly Pro Leu Gln Gln Arg
Ala Asn Leu 435 440 445 Arg Ala Gly Ser Arg Ile Glu Ser His His Phe
Ala Gln Ala Asp Leu 450 455 460 Glu Leu Leu Ala Ser Ser Cys Pro Pro
Ala Ser Val Ser Gln Arg Ala 465 470 475 480 Gly Ile Thr Ala Ala Ile
Asn Leu Ala Lys Leu Lys Leu Phe Arg His 485 490 495 Tyr Tyr Val Leu
Ile Val Cys Tyr Ile Tyr Phe Thr Arg Ile Ile Ala 500 505 510 Phe Leu
Leu Lys Leu Ala Val Pro Phe Gln Trp Lys Trp Leu Tyr Gln 515 520 525
Leu Leu Asp Glu Thr Ala Thr Leu Val Phe Phe Val Leu Thr Gly Tyr 530
535 540 Lys Phe Arg Pro Ala Ser Asp Asn Pro Tyr Leu Gln Leu Ser Gln
Glu 545 550 555 560 Glu Glu Asp Leu Glu Met Glu Ser Val Val Thr Thr
Ser Gly Val Met 565 570 575 Glu Ser Met Lys Lys Val Lys Lys Val Thr
Asn Gly Ser Val Glu Pro 580 585 590 Gln Gly Glu Trp Glu Gly Ala Val
595 600 7600PRTHomo sapiens 7Met Ala Ala Leu Ala Pro Val Gly Ser
Pro Ala Ser Arg Gly Pro Arg 1 5 10 15 Leu Ala Ala Gly Leu Arg Leu
Leu Pro Met Leu Gly Leu Leu Gln Leu 20 25 30 Leu Ala Glu Pro Gly
Leu Gly Arg Val His His Leu Ala Leu Lys Asp 35 40 45 Asp Val Arg
His Lys Val His Leu Asn Thr Phe Gly Phe Phe Lys Asp 50 55 60 Gly
Tyr Met Val Val Asn Val Ser Ser Leu Ser Leu Asn Glu Pro Glu 65 70
75 80 Asp Lys Asp Val Thr Ile Gly Phe Ser Leu Asp Arg Thr Lys Asn
Asp 85 90 95 Gly Phe Ser Ser Tyr Leu Asp Glu Asp Val Asn Tyr Cys
Ile Leu Lys 100 105 110 Lys Gln Ser Val Ser Val Thr Leu Leu Ile Leu
Asp Ile Ser Arg Ser 115 120 125 Glu Val Arg Val Lys Ser Pro Pro Glu
Ala Gly Thr Gln Leu Pro Lys 130
135 140 Ile Ile Phe Ser Arg Asp Glu Lys Val Leu Gly Gln Ser Gln Glu
Pro 145 150 155 160 Asn Val Asn Pro Ala Ser Ala Gly Asn Gln Thr Gln
Lys Thr Gln Asp 165 170 175 Gly Gly Lys Ser Lys Arg Ser Thr Val Asp
Ser Lys Ala Met Gly Glu 180 185 190 Lys Ser Phe Ser Val His Asn Asn
Gly Gly Ala Val Ser Phe Gln Phe 195 200 205 Phe Phe Asn Ile Ser Thr
Asp Asp Gln Glu Gly Leu Tyr Ser Leu Tyr 210 215 220 Phe His Lys Cys
Leu Gly Lys Glu Leu Pro Ser Asp Lys Phe Thr Phe 225 230 235 240 Ser
Leu Asp Ile Glu Ile Thr Glu Lys Asn Pro Asp Ser Tyr Leu Ser 245 250
255 Ala Gly Glu Ile Pro Leu Pro Lys Leu Tyr Ile Ser Met Ala Phe Phe
260 265 270 Phe Phe Leu Ser Gly Thr Ile Trp Ile His Ile Leu Arg Lys
Arg Arg 275 280 285 Asn Asp Val Phe Lys Ile His Trp Leu Met Ala Ala
Leu Pro Phe Thr 290 295 300 Lys Ser Leu Ser Leu Val Phe His Ala Ile
Asp Tyr His Tyr Ile Ser 305 310 315 320 Ser Gln Gly Phe Pro Ile Glu
Gly Trp Ala Val Val Tyr Tyr Ile Thr 325 330 335 His Leu Leu Lys Gly
Ala Leu Leu Phe Ile Thr Ile Ala Leu Ile Gly 340 345 350 Thr Gly Trp
Ala Phe Ile Lys His Ile Leu Ser Asp Lys Asp Lys Lys 355 360 365 Ile
Phe Met Ile Val Ile Pro Leu Gln Val Leu Ala Asn Val Ala Tyr 370 375
380 Ile Ile Ile Glu Ser Thr Glu Glu Gly Thr Thr Glu Tyr Gly Leu Trp
385 390 395 400 Lys Asp Ser Leu Phe Leu Val Asp Leu Leu Cys Cys Gly
Ala Ile Leu 405 410 415 Phe Pro Val Val Trp Ser Ile Arg His Leu Gln
Glu Ala Ser Ala Thr 420 425 430 Asp Gly Lys Gly Asp Ser Met Gly Pro
Leu Gln Gln Arg Ala Asn Leu 435 440 445 Arg Ala Gly Ser Arg Ile Glu
Ser His His Phe Ala Gln Ala Asp Leu 450 455 460 Glu Leu Leu Ala Ser
Ser Cys Pro Pro Ala Ser Val Ser Gln Arg Ala 465 470 475 480 Gly Ile
Thr Ala Ala Ile Asn Leu Ala Lys Leu Lys Leu Phe Arg His 485 490 495
Tyr Tyr Val Leu Ile Val Cys Tyr Ile Tyr Phe Thr Arg Ile Ile Ala 500
505 510 Phe Leu Leu Lys Leu Ala Val Pro Phe Gln Trp Lys Trp Leu Tyr
Gln 515 520 525 Leu Leu Asp Glu Thr Ala Thr Leu Val Phe Phe Val Leu
Thr Gly Tyr 530 535 540 Lys Phe Arg Pro Ala Ser Asp Asn Pro Tyr Leu
Gln Leu Ser Gln Glu 545 550 555 560 Glu Glu Asp Leu Glu Met Glu Ser
Val Val Thr Thr Ser Gly Val Met 565 570 575 Glu Ser Met Lys Lys Val
Lys Lys Val Thr Asn Gly Ser Val Glu Pro 580 585 590 Gln Gly Glu Trp
Glu Gly Ala Val 595 600 86874DNAHomo sapiens 8ccgaggtggg cagcacaggc
tcctcgacga cttcctaggt cgcaatctcc aggaaaacga 60ccacagggtc agcggagcta
gccgccgagc cccgctcccc gggcccttcc ggcggctgcg 120ccctttcacc
ccggacgtgg gcgggagagg aagcggctgg tgatgctgga acaaacatgg
180ccgctctggc gcccgtcggc tcccccgcct cccgcggtcc taggctggcc
gcgggcctcc 240ggctgctccc aatgctgggt ttgctgcagt tgctggccga
gcctggcctg ggccgcgtcc 300atcacctggc actcaaggat gatgtgaggc
ataaagttca tctgaacacc tttggcttct 360tcaaggatgg gtacatggtg
gtgaatgtca gtagcctctc actgaatgag cctgaagaca 420aggatgtgac
tattggattt agcctagacc gtacaaagaa tgatggcttt tcttcttacc
480tggatgaaga tgtgaattac tgtattttaa agaaacagtc tgtctctgtc
acccttttaa 540tcctagacat ctccagaagt gaggtaagag taaagtctcc
accagaagct ggtacccagt 600taccaaagat catcttcagc agggatgaga
aagtccttgg tcagagccag gagcctaatg 660ttaaccctgc ttcagcaggc
aaccagaccc agaagacaca agatggtgga aagtctaaaa 720gaagtacagt
ggattcaaag gccatgggag agaaatcctt ttctgttcat aataatggtg
780gggcagtgtc atttcagttt ttctttaaca tcagcactga tgaccaagaa
ggcctttaca 840gtctttattt tcataaatgc cttggaaaag aattgccaag
tgacaagttt acattcagcc 900ttgatattga gatcacagag aagaatcctg
acagctacct ctcagcagga gaaattcctc 960tccccaaatt atacatctca
atggcctttt tcttctttct ttctgggacc atctggattc 1020atatccttcg
aaaacgacgg aatgatgtat ttaaaatcca ctggctgatg gcggcccttc
1080ctttcaccaa gtctctttcc ttggtgttcc atgcaattga ctaccactac
atctcctccc 1140agggcttccc tatcgaaggc tgggctgttg tgtactacat
aactcacctt ttgaaagggg 1200cgctactctt catcaccatt gcactcattg
gcactggctg ggctttcatt aagcacatcc 1260tttctgataa agacaaaaag
atcttcatga ttgtcattcc actccaggtc ctggcaaatg 1320tagcctacat
catcatagag tccaccgagg agggcacgac tgaatatggc ttgtggaagg
1380actctctatt tctggtcgac ctgttgtgtt gtggtgccat cctcttccca
gtggtgtggt 1440caatcagaca tttacaagaa gcatcagcaa cagatggaaa
agctgctatt aacttagcaa 1500agctgaaact tttcagacat tattacgtct
tgattgtgtg ttacatatac ttcactagga 1560tcattgcatt tctcctcaaa
ctcgctgttc cattccagtg gaagtggctc taccagctcc 1620tggatgaaac
ggccacactg gtcttctttg ttctaacggg gtataaattc cgtccggctt
1680cagataaccc ctacctacaa ctttctcagg aagaagaaga cttggaaatg
gagtccgttg 1740tgacaacatc tggggtgatg gaaagtatga agaaagtcaa
gaaggtgacc aacggctccg 1800tggagcccca gggcgagtgg gaaggcgccg
tgtgacagag ccgaccctga ggatggcact 1860gtccaaggaa actgttaact
tattcatagt cctattggac agcaggagca gctcctacag 1920tgaactattg
gcaccaccga cagtgacacc agggcacatg gctggagcac agtgccgcgg
1980aaacctgatt ttgtactctc ttttatggaa acgatctgtg gctgtttaga
ggcagctgga 2040tcctctttca ggcgggaatg ggagggcggg cacagggagg
aggagaggaa gagaaaagga 2100agaattcatt tttaatttag gtttcttttt
ttcttcttca tttcggagct ctaaggtgta 2160tgcagttgtg accccatgtg
tggggaagtg tagcaaggac ggctggtgga gggggaagga 2220gggtgcgagg
tgtctgtctg atgctttagg aaatgtctac tgaggaccct gggacttaag
2280aagaagggcg gggagagtgc cattgcctgt ttgggagaca aaaatgaacg
aaaacaggtg 2340actttggaaa gcaaagtcaa aacccagttt aggatgtagc
acctgcccca ggattcctgc 2400cctcggcttt gccccagacc cttattccag
atgctgagag tgaccaggac agcagctcct 2460gaggcccagt ggtcttcttt
ccaacaggaa aagaaggctg tgatgtcgct gtcaggatca 2520tgccctgtgg
cacagcacag gtggtgggag gtggttttct gactgagatg ttgcctgatg
2580gatggaaaga aatgtatttt taagttcaaa aagcattatc ctgtggcgtt
gcctggacat 2640ccactccctg acagcccaga gcagcactgt ctggcttccc
ttcatgcttg tggctttgtt 2700gtgtttgatc agaattttgg gggaaatgga
aagttttcct caaggagcag ctgggggcag 2760aataggtagt atttaagcaa
atacttaagt ccaagcaaat catccccatt aaaaagcttt 2820tcctgtaggc
tagtaggatt tctaaataga tgaattcaac agacttggtc cccatagtcc
2880aagagtatgt atgtgaagaa agtgagcatg attcaacagt ttcactctca
gggattttag 2940gatggcaaaa tacttcacag aaactcaatg attaagttcc
cttccacact tccagagctt 3000gaatgaacac aggtagccac ctaaattgag
cagtattgca actcagagag aaaatcatct 3060gaatagtagg acaagctcag
aaggtacatt gtgactgagg gcttaaaagg agaccaaaac 3120atggccccat
cagggaagct tcttaatgct tggggggcca gctaggtagg gttgcttcca
3180aaagctggag cccacccctg cctaggggtt gtcagagagc cacacctgca
ggggaacagg 3240tacctccgag ggtgagagtc gtggtctctg ggagttgttt
tctcacctct ggcttagaag 3300ggtcaggcag aaaccacagg atgtggggtc
acactcactg tcccaagttt gggaacctga 3360aaaagtctcc attcagaaca
tggttgttct ccctgtccca tgctatctta tcttcctaaa 3420tgactaatga
ggaagcgggt gttctttttc tgcactttga ttcgccatct gggttctgta
3480gggtgctctg aaggtgtgat ctgccttctg gctgatgtgg aggaagagca
agcgccttcc 3540caggccacag ctgctcacct ctcggcagat attttaggca
agcatccgtg tgtcttccca 3600tcttcaggag aaaggtaaat gcaccctaag
tgttcacttc tggacctttt tcaagttcac 3660ttgggactgt gtgacagaag
ggagttggag ggaggatggg aatattttta acactttgtt 3720ttcctgtgca
gaaacataat accagttttc gcagaaatgt gtctcaatct gtgactacca
3780aagccctcct cagtccttcc ctcagaggga cacatttgct gtttctcccg
caagcagatg 3840ttgtggatga ggcgatagac tccttggcaa gaacgaaagg
tgtgatgaaa cctccctgct 3900cggaagggtc tccgtggagg tgtcctcatt
tcacatgctg ggttttgcaa gcgaggaagc 3960caggcagtgg aggaactaga
gagaggcagg cgtgtgtgtg gacaagcgct ggagccgcag 4020ccctcagact
ggcacgggaa cgccagcgtt gggtgttcag attccacgcg tatgtctggg
4080ctcactcaca gcatggccga gtgtctgcag tgctggtcct gacccttcca
gagcagcagt 4140ggacagatga gataagactg tttcagaaac aaagatggcc
acagccttcc taacaagcag 4200gtcatctggc catgtctgta ttgtaactgg
taaaaggctt caagtcagat tgatgatcaa 4260gaaaagtcaa aaccccagcc
caagattggg aaagcaggtg gtggttccaa gcttttaaaa 4320aattattgaa
gctctccatc ctgttctgtg agtgtgtctt ctctttctcc ttcacgtcat
4380agccgtgacc caccgttcat ctctgctctt gcgtaaagat gaccgatgga
gtccaaagcc 4440aagtggcttc accagctgac aagccaccct cctgcagcct
gagtttcaca gtccactggg 4500ttcgttgtca tgcggtgttt gaatggttaa
gcccttgcag tatttcagat cgggcaaaaa 4560atatcggatg cacatagcag
aaccattggt ggtatttata gctttgcttt gtactcctca 4620ctgtttctgc
ctacgcaaaa tatccatgtt tcctctgaga aatctgttgt ggactgaaag
4680cgctgctggc tgtgaaattt aataaagtgt gtatgctttg ctagaaaatt
atttcttgga 4740caataggaac agtcattgat ctgtaaatcc tggctcttaa
cagtgagtgg ccaaggactt 4800gatcagccca tttcttggtc cctcagtgct
ttaaaattta agtagcactg cattttgtaa 4860tgttgaatat gactctagtg
acttgtagga ggcacttgtg aggagatgct tgcttcagtg 4920taaaagatgc
tcatggcctg agtcagttga gttttctttc aagaaaccac ttcagagtga
4980aatatccagg gtttccccgc cctggacatg tccagcctgc ccaggcagca
cacagccctg 5040taagtccacc tcgtgtgggt gagatttcct cctgcgtgat
gacctcatcg ccatctctgc 5100tgtctcattc cacagcctcc ctccctcttc
tctcctcctc tgccctcgcc cttccccctt 5160ccccatcccc tccccctcct
cctctgccct cgcccttacc cctccccctt ccccttccgc 5220tcctcctccc
tcctccacct ctttctcctc ctccttccct cctcctccct cctcttccct
5280tctctgccat ctttctcccc gtgcctattg atcccacata ggctcattct
gggtacaccg 5340gctaaaggct ttggtgcatt gcagcgtttt ctcccagcag
ctgtgtgaaa gatgcatttt 5400ctaagctaag gagaattttc tcaagagtgg
catactcatg ccaaatatta ttgctctggg 5460ccatataggc tggtcttcct
ccacactaaa atgggtgtct tgttttggta cttaaaacag 5520tctactccag
gcatccagtc cttacagacc aaggaagagc atagcgatgc ctgttggaat
5580tgcagatgca ttctggcctt ctcccccgtc ctgaaacatt ttctttgagg
aaggctctta 5640gaacattaga tagtctgctg aggttgttgg cccagctcca
tacacccagt agaacagtgg 5700aacaactcat gcttcatgct gccaagctgc
tgtacttcaa aggaaacaga tctagcacac 5760tgctgcaccc ctgcttccac
actccacact tcaccccgct gcttttctct gacccgcccc 5820tggccttgta
agactcacgt aagctaagtc caggatgcct gtggcctgcg gcttgattct
5880tccctttagg attcagcaag ttaatggctt cctcgctata gaagtgagac
tttgacttga 5940tgcctcttgg tatatcaaaa agatattcat ccagaaagta
ccaaatgttc tgaaagaccc 6000gctcttcact ccagttttcc ctagggtgtt
tctggcaggg cgtttttaaa aggcatctac 6060ctgagttgac gctaatactt
gtcaccacct ggaacgtagt tatcggtcgg caggctgaac 6120atactccaga
ttccccagag gccacttctg tagcccagcg atgcatctga gcctctctgc
6180gtggtttatg cttgaaaaat agataatgct tttagatggt tcactgccag
gccatgggcc 6240ccacacatct caggccctgt gtgagggagc acactgagat
ggtgcaggag tgaatgggca 6300tggcttggcc tcgctacctc ggggacctgt
tggagttctg gcagcagggt gtctgcaggt 6360gggacggcgt tctgggcaga
gtcagaatgg tcagaatgaa acagaacagc caactcaccc 6420acaggacagc
ttattttgag gcaaggtttt ggattttgga ggaagcagcc agatgaggcg
6480gtgagcctcc agaaggtcag cctttggagc acgtaagata ctgttacagg
gtccagaaat 6540cgtgttcaca tgggggcttt gactcttcaa acagcttttg
cagatcgtaa attgcatttg 6600cctagtcgtg tgacctcaaa agaagtcaga
catatttaat ccagaaatag tttcgtttga 6660gggagggctt gcaggtctgt
aaatagcatt tgctttcctg gttagagatt gggatgcaga 6720aggagttttc
agtatttttt ttaaaacact aatgatcatt gaagagtatt tatgtaaaca
6780tacaacgtat aatgggtggg ggatccgatc atggtgatgt acggggtgaa
ttctcttgcc 6840gtgttgcaaa tgtgtaaaat aaagattatc tggc
687497018DNAHomo sapiens 9ccgaggtggg cagcacaggc tcctcgacga
cttcctaggt cgcaatctcc aggaaaacga 60ccacagggtc agcggagcta gccgccgagc
cccgctcccc gggcccttcc ggcggctgcg 120ccctttcacc ccggacgtgg
gcgggagagg aagcggctgg tgatgctgga acaaacatgg 180ccgctctggc
gcccgtcggc tcccccgcct cccgcggtcc taggctggcc gcgggcctcc
240ggctgctccc aatgctgggt ttgctgcagt tgctggccga gcctggcctg
ggccgcgtcc 300atcacctggc actcaaggat gatgtgaggc ataaagttca
tctgaacacc tttggcttct 360tcaaggatgg gtacatggtg gtgaatgtca
gtagcctctc actgaatgag cctgaagaca 420aggatgtgac tattggattt
agcctagacc gtacaaagaa tgatggcttt tcttcttacc 480tggatgaaga
tgtgaattac tgtattttaa agaaacagtc tgtctctgtc acccttttaa
540tcctagacat ctccagaagt gaggtaagag taaagtctcc accagaagct
ggtacccagt 600taccaaagat catcttcagc agggatgaga aagtccttgg
tcagagccag gagcctaatg 660ttaaccctgc ttcagcaggc aaccagaccc
agaagacaca agatggtgga aagtctaaaa 720gaagtacagt ggattcaaag
gccatgggag agaaatcctt ttctgttcat aataatggtg 780gggcagtgtc
atttcagttt ttctttaaca tcagcactga tgaccaagaa ggcctttaca
840gtctttattt tcataaatgc cttggaaaag aattgccaag tgacaagttt
acattcagcc 900ttgatattga gatcacagag aagaatcctg acagctacct
ctcagcagga gaaattcctc 960tccccaaatt atacatctca atggcctttt
tcttctttct ttctgggacc atctggattc 1020atatccttcg aaaacgacgg
aatgatgtat ttaaaatcca ctggctgatg gcggcccttc 1080ctttcaccaa
gtctctttcc ttggtgttcc atgcaattga ctaccactac atctcctccc
1140agggcttccc tatcgaaggc tgggctgttg tgtactacat aactcacctt
ttgaaagggg 1200cgctactctt catcaccatt gcactcattg gcactggctg
ggctttcatt aagcacatcc 1260tttctgataa agacaaaaag atcttcatga
ttgtcattcc actccaggtc ctggcaaatg 1320tagcctacat catcatagag
tccaccgagg agggcacgac tgaatatggc ttgtggaagg 1380actctctatt
tctggtcgac ctgttgtgtt gtggtgccat cctcttccca gtggtgtggt
1440caatcagaca tttacaagaa gcatcagcaa cagatggaaa aggtgacagc
atgggacctc 1500ttcagcagag agcgaatcta agagcaggaa gtcgcataga
gtctcaccat tttgcccagg 1560ctgatcttga actcctggcc tctagctgtc
ctcctgcctc agtctcccaa agggctggga 1620ttacagctgc tattaactta
gcaaagctga aacttttcag acattattac gtcttgattg 1680tgtgttacat
atacttcact aggatcattg catttctcct caaactcgct gttccattcc
1740agtggaagtg gctctaccag ctcctggatg aaacggccac actggtcttc
tttgttctaa 1800cggggtataa attccgtccg gcttcagata acccctacct
acaactttct caggaagaag 1860aagacttgga aatggagtcc gttgtgacaa
catctggggt gatggaaagt atgaagaaag 1920tcaagaaggt gaccaacggc
tccgtggagc cccagggcga gtgggaaggc gccgtgtgac 1980agagccgacc
ctgaggatgg cactgtccaa ggaaactgtt aacttattca tagtcctatt
2040ggacagcagg agcagctcct acagtgaact attggcacca ccgacagtga
caccagggca 2100catggctgga gcacagtgcc gcggaaacct gattttgtac
tctcttttat ggaaacgatc 2160tgtggctgtt tagaggcagc tggatcctct
ttcaggcggg aatgggaggg cgggcacagg 2220gaggaggaga ggaagagaaa
aggaagaatt catttttaat ttaggtttct ttttttcttc 2280ttcatttcgg
agctctaagg tgtatgcagt tgtgacccca tgtgtgggga agtgtagcaa
2340ggacggctgg tggaggggga aggagggtgc gaggtgtctg tctgatgctt
taggaaatgt 2400ctactgagga ccctgggact taagaagaag ggcggggaga
gtgccattgc ctgtttggga 2460gacaaaaatg aacgaaaaca ggtgactttg
gaaagcaaag tcaaaaccca gtttaggatg 2520tagcacctgc cccaggattc
ctgccctcgg ctttgcccca gacccttatt ccagatgctg 2580agagtgacca
ggacagcagc tcctgaggcc cagtggtctt ctttccaaca ggaaaagaag
2640gctgtgatgt cgctgtcagg atcatgccct gtggcacagc acaggtggtg
ggaggtggtt 2700ttctgactga gatgttgcct gatggatgga aagaaatgta
tttttaagtt caaaaagcat 2760tatcctgtgg cgttgcctgg acatccactc
cctgacagcc cagagcagca ctgtctggct 2820tcccttcatg cttgtggctt
tgttgtgttt gatcagaatt ttgggggaaa tggaaagttt 2880tcctcaagga
gcagctgggg gcagaatagg tagtatttaa gcaaatactt aagtccaagc
2940aaatcatccc cattaaaaag cttttcctgt aggctagtag gatttctaaa
tagatgaatt 3000caacagactt ggtccccata gtccaagagt atgtatgtga
agaaagtgag catgattcaa 3060cagtttcact ctcagggatt ttaggatggc
aaaatacttc acagaaactc aatgattaag 3120ttcccttcca cacttccaga
gcttgaatga acacaggtag ccacctaaat tgagcagtat 3180tgcaactcag
agagaaaatc atctgaatag taggacaagc tcagaaggta cattgtgact
3240gagggcttaa aaggagacca aaacatggcc ccatcaggga agcttcttaa
tgcttggggg 3300gccagctagg tagggttgct tccaaaagct ggagcccacc
cctgcctagg ggttgtcaga 3360gagccacacc tgcaggggaa caggtacctc
cgagggtgag agtcgtggtc tctgggagtt 3420gttttctcac ctctggctta
gaagggtcag gcagaaacca caggatgtgg ggtcacactc 3480actgtcccaa
gtttgggaac ctgaaaaagt ctccattcag aacatggttg ttctccctgt
3540cccatgctat cttatcttcc taaatgacta atgaggaagc gggtgttctt
tttctgcact 3600ttgattcgcc atctgggttc tgtagggtgc tctgaaggtg
tgatctgcct tctggctgat 3660gtggaggaag agcaagcgcc ttcccaggcc
acagctgctc acctctcggc agatatttta 3720ggcaagcatc cgtgtgtctt
cccatcttca ggagaaaggt aaatgcaccc taagtgttca 3780cttctggacc
tttttcaagt tcacttggga ctgtgtgaca gaagggagtt ggagggagga
3840tgggaatatt tttaacactt tgttttcctg tgcagaaaca taataccagt
tttcgcagaa 3900atgtgtctca atctgtgact accaaagccc tcctcagtcc
ttccctcaga gggacacatt 3960tgctgtttct cccgcaagca gatgttgtgg
atgaggcgat agactccttg gcaagaacga 4020aaggtgtgat gaaacctccc
tgctcggaag ggtctccgtg gaggtgtcct catttcacat 4080gctgggtttt
gcaagcgagg aagccaggca gtggaggaac tagagagagg caggcgtgtg
4140tgtggacaag cgctggagcc gcagccctca gactggcacg ggaacgccag
cgttgggtgt 4200tcagattcca cgcgtatgtc tgggctcact cacagcatgg
ccgagtgtct gcagtgctgg 4260tcctgaccct tccagagcag cagtggacag
atgagataag actgtttcag aaacaaagat 4320ggccacagcc ttcctaacaa
gcaggtcatc tggccatgtc tgtattgtaa ctggtaaaag 4380gcttcaagtc
agattgatga tcaagaaaag tcaaaacccc agcccaagat tgggaaagca
4440ggtggtggtt ccaagctttt aaaaaattat tgaagctctc catcctgttc
tgtgagtgtg 4500tcttctcttt ctccttcacg tcatagccgt gacccaccgt
tcatctctgc tcttgcgtaa 4560agatgaccga tggagtccaa agccaagtgg
cttcaccagc tgacaagcca ccctcctgca 4620gcctgagttt cacagtccac
tgggttcgtt gtcatgcggt gtttgaatgg ttaagccctt 4680gcagtatttc
agatcgggca aaaaatatcg gatgcacata gcagaaccat tggtggtatt
4740tatagctttg ctttgtactc ctcactgttt ctgcctacgc aaaatatcca
tgtttcctct 4800gagaaatctg ttgtggactg aaagcgctgc tggctgtgaa
atttaataaa gtgtgtatgc 4860tttgctagaa aattatttct tggacaatag
gaacagtcat tgatctgtaa atcctggctc 4920ttaacagtga gtggccaagg
acttgatcag cccatttctt ggtccctcag tgctttaaaa 4980tttaagtagc
actgcatttt gtaatgttga atatgactct agtgacttgt aggaggcact
5040tgtgaggaga tgcttgcttc agtgtaaaag atgctcatgg cctgagtcag
ttgagttttc 5100tttcaagaaa ccacttcaga gtgaaatatc cagggtttcc
ccgccctgga catgtccagc 5160ctgcccaggc agcacacagc cctgtaagtc
cacctcgtgt gggtgagatt tcctcctgcg 5220tgatgacctc atcgccatct
ctgctgtctc attccacagc ctccctccct cttctctcct 5280cctctgccct
cgcccttccc ccttccccat cccctccccc tcctcctctg ccctcgccct
5340tacccctccc ccttcccctt ccgctcctcc tccctcctcc acctctttct
cctcctcctt 5400ccctcctcct ccctcctctt cccttctctg ccatctttct
ccccgtgcct attgatccca 5460cataggctca ttctgggtac accggctaaa
ggctttggtg cattgcagcg ttttctccca 5520gcagctgtgt gaaagatgca
ttttctaagc taaggagaat tttctcaaga gtggcatact 5580catgccaaat
attattgctc tgggccatat aggctggtct tcctccacac taaaatgggt
5640gtcttgtttt ggtacttaaa acagtctact ccaggcatcc agtccttaca
gaccaaggaa 5700gagcatagcg atgcctgttg gaattgcaga tgcattctgg
ccttctcccc cgtcctgaaa 5760cattttcttt gaggaaggct cttagaacat
tagatagtct gctgaggttg ttggcccagc 5820tccatacacc cagtagaaca
gtggaacaac tcatgcttca tgctgccaag ctgctgtact 5880tcaaaggaaa
cagatctagc acactgctgc acccctgctt ccacactcca cacttcaccc
5940cgctgctttt ctctgacccg cccctggcct tgtaagactc acgtaagcta
agtccaggat 6000gcctgtggcc tgcggcttga ttcttccctt taggattcag
caagttaatg gcttcctcgc 6060tatagaagtg agactttgac ttgatgcctc
ttggtatatc aaaaagatat tcatccagaa 6120agtaccaaat gttctgaaag
acccgctctt cactccagtt ttccctaggg tgtttctggc 6180agggcgtttt
taaaaggcat ctacctgagt tgacgctaat acttgtcacc acctggaacg
6240tagttatcgg tcggcaggct gaacatactc cagattcccc agaggccact
tctgtagccc 6300agcgatgcat ctgagcctct ctgcgtggtt tatgcttgaa
aaatagataa tgcttttaga 6360tggttcactg ccaggccatg ggccccacac
atctcaggcc ctgtgtgagg gagcacactg 6420agatggtgca ggagtgaatg
ggcatggctt ggcctcgcta cctcggggac ctgttggagt 6480tctggcagca
gggtgtctgc aggtgggacg gcgttctggg cagagtcaga atggtcagaa
6540tgaaacagaa cagccaactc acccacagga cagcttattt tgaggcaagg
ttttggattt 6600tggaggaagc agccagatga ggcggtgagc ctccagaagg
tcagcctttg gagcacgtaa 6660gatactgtta cagggtccag aaatcgtgtt
cacatggggg ctttgactct tcaaacagct 6720tttgcagatc gtaaattgca
tttgcctagt cgtgtgacct caaaagaagt cagacatatt 6780taatccagaa
atagtttcgt ttgagggagg gcttgcaggt ctgtaaatag catttgcttt
6840cctggttaga gattgggatg cagaaggagt tttcagtatt ttttttaaaa
cactaatgat 6900cattgaagag tatttatgta aacatacaac gtataatggg
tgggggatcc gatcatggtg 6960atgtacgggg tgaattctct tgccgtgttg
caaatgtgta aaataaagat tatctggc 7018107353DNAHomo sapiens
10attattgcaa ccatttcgct tgtatttgag tgtgaagcgc ctagaaaacc acaggacccc
60tacggcgagc cgggaatttt tagatatttt cctccgagtc aacgctcagt gaaatcagtt
120caatcagtgg cccgacaccg tgcggctgac acagcttatc ccccgaccct
gagatcaggg 180gtccccggag cccaaggtcg cttccaaagc tcagcgaggc
ggaggtgcgg cccgggctgg 240tctggttcgg ccaccgttgt tatggcaacc
gccaataggt tggcttcatc tctaactgaa 300agtctgcaca ggagcggccg
atcgaagggc cccgaggtgg gcagcacagg ctcctcgacg 360acttcctagg
tcgcaatctc caggaaaacg accacagggt cagcggagct agccgccgag
420ccccgctccc cgggcccttc cggcggctgc gccctttcac cccggacgtg
ggcgggagag 480gaagcggctg gtgatgctgg aacaaacatg gccgctctgg
cgcccgtcgg ctcccccgcc 540tcccgcggtc ctaggctggc cgcgggcctc
cggctgctcc caatgctggg tttgctgcag 600ttgctggccg agcctggcct
gggccgcgtc catcacctgg cactcaagga tgatgtgagg 660cataaagttc
atctgaacac ctttggcttc ttcaaggatg ggtacatggt ggtgaatgtc
720agtagcctct cactgaatga gcctgaagac aaggatgtga ctattggatt
tagcctagac 780cgtacaaaga atgatggctt ttcttcttac ctggatgaag
atgtgaatta ctgtatttta 840aagaaacagt ctgtctctgt caccctttta
atcctagaca tctccagaag tgaggtaaga 900gtaaagtctc caccagaagc
tggtacccag ttaccaaaga tcatcttcag cagggatgag 960aaagtccttg
gtcagagcca ggagcctaat gttaaccctg cttcagcagg caaccagacc
1020cagaagacac aagatggtgg aaagtctaaa agaagtacag tggattcaaa
ggccatggga 1080gagaaatcct tttctgttca taataatggt ggggcagtgt
catttcagtt tttctttaac 1140atcagcactg atgaccaaga aggcctttac
agtctttatt ttcataaatg ccttggaaaa 1200gaattgccaa gtgacaagtt
tacattcagc cttgatattg agatcacaga gaagaatcct 1260gacagctacc
tctcagcagg agaaattcct ctccccaaat tatacatctc aatggccttt
1320ttcttctttc tttctgggac catctggatt catatccttc gaaaacgacg
gaatgatgta 1380tttaaaatcc actggctgat ggcggccctt cctttcacca
agtctctttc cttggtgttc 1440catgcaattg actaccacta catctcctcc
cagggcttcc ctatcgaagg ctgggctgtt 1500gtgtactaca taactcacct
tttgaaaggg gcgctactct tcatcaccat tgcactcatt 1560ggcactggct
gggctttcat taagcacatc ctttctgata aagacaaaaa gatcttcatg
1620attgtcattc cactccaggt cctggcaaat gtagcctaca tcatcataga
gtccaccgag 1680gagggcacga ctgaatatgg cttgtggaag gactctctat
ttctggtcga cctgttgtgt 1740tgtggtgcca tcctcttccc agtggtgtgg
tcaatcagac atttacaaga agcatcagca 1800acagatggaa aaggtgacag
catgggacct cttcagcaga gagcgaatct aagagcagga 1860agtcgcatag
agtctcacca ttttgcccag gctgatcttg aactcctggc ctctagctgt
1920cctcctgcct cagtctccca aagggctggg attacagctg ctattaactt
agcaaagctg 1980aaacttttca gacattatta cgtcttgatt gtgtgttaca
tatacttcac taggatcatt 2040gcatttctcc tcaaactcgc tgttccattc
cagtggaagt ggctctacca gctcctggat 2100gaaacggcca cactggtctt
ctttgttcta acggggtata aattccgtcc ggcttcagat 2160aacccctacc
tacaactttc tcaggaagaa gaagacttgg aaatggagtc cgttgtgaca
2220acatctgggg tgatggaaag tatgaagaaa gtcaagaagg tgaccaacgg
ctccgtggag 2280ccccagggcg agtgggaagg cgccgtgtga cagagccgac
cctgaggatg gcactgtcca 2340aggaaactgt taacttattc atagtcctat
tggacagcag gagcagctcc tacagtgaac 2400tattggcacc accgacagtg
acaccagggc acatggctgg agcacagtgc cgcggaaacc 2460tgattttgta
ctctctttta tggaaacgat ctgtggctgt ttagaggcag ctggatcctc
2520tttcaggcgg gaatgggagg gcgggcacag ggaggaggag aggaagagaa
aaggaagaat 2580tcatttttaa tttaggtttc tttttttctt cttcatttcg
gagctctaag gtgtatgcag 2640ttgtgacccc atgtgtgggg aagtgtagca
aggacggctg gtggaggggg aaggagggtg 2700cgaggtgtct gtctgatgct
ttaggaaatg tctactgagg accctgggac ttaagaagaa 2760gggcggggag
agtgccattg cctgtttggg agacaaaaat gaacgaaaac aggtgacttt
2820ggaaagcaaa gtcaaaaccc agtttaggat gtagcacctg ccccaggatt
cctgccctcg 2880gctttgcccc agacccttat tccagatgct gagagtgacc
aggacagcag ctcctgaggc 2940ccagtggtct tctttccaac aggaaaagaa
ggctgtgatg tcgctgtcag gatcatgccc 3000tgtggcacag cacaggtggt
gggaggtggt tttctgactg agatgttgcc tgatggatgg 3060aaagaaatgt
atttttaagt tcaaaaagca ttatcctgtg gcgttgcctg gacatccact
3120ccctgacagc ccagagcagc actgtctggc ttcccttcat gcttgtggct
ttgttgtgtt 3180tgatcagaat tttgggggaa atggaaagtt ttcctcaagg
agcagctggg ggcagaatag 3240gtagtattta agcaaatact taagtccaag
caaatcatcc ccattaaaaa gcttttcctg 3300taggctagta ggatttctaa
atagatgaat tcaacagact tggtccccat agtccaagag 3360tatgtatgtg
aagaaagtga gcatgattca acagtttcac tctcagggat tttaggatgg
3420caaaatactt cacagaaact caatgattaa gttcccttcc acacttccag
agcttgaatg 3480aacacaggta gccacctaaa ttgagcagta ttgcaactca
gagagaaaat catctgaata 3540gtaggacaag ctcagaaggt acattgtgac
tgagggctta aaaggagacc aaaacatggc 3600cccatcaggg aagcttctta
atgcttgggg ggccagctag gtagggttgc ttccaaaagc 3660tggagcccac
ccctgcctag gggttgtcag agagccacac ctgcagggga acaggtacct
3720ccgagggtga gagtcgtggt ctctgggagt tgttttctca cctctggctt
agaagggtca 3780ggcagaaacc acaggatgtg gggtcacact cactgtccca
agtttgggaa cctgaaaaag 3840tctccattca gaacatggtt gttctccctg
tcccatgcta tcttatcttc ctaaatgact 3900aatgaggaag cgggtgttct
ttttctgcac tttgattcgc catctgggtt ctgtagggtg 3960ctctgaaggt
gtgatctgcc ttctggctga tgtggaggaa gagcaagcgc cttcccaggc
4020cacagctgct cacctctcgg cagatatttt aggcaagcat ccgtgtgtct
tcccatcttc 4080aggagaaagg taaatgcacc ctaagtgttc acttctggac
ctttttcaag ttcacttggg 4140actgtgtgac agaagggagt tggagggagg
atgggaatat ttttaacact ttgttttcct 4200gtgcagaaac ataataccag
ttttcgcaga aatgtgtctc aatctgtgac taccaaagcc 4260ctcctcagtc
cttccctcag agggacacat ttgctgtttc tcccgcaagc agatgttgtg
4320gatgaggcga tagactcctt ggcaagaacg aaaggtgtga tgaaacctcc
ctgctcggaa 4380gggtctccgt ggaggtgtcc tcatttcaca tgctgggttt
tgcaagcgag gaagccaggc 4440agtggaggaa ctagagagag gcaggcgtgt
gtgtggacaa gcgctggagc cgcagccctc 4500agactggcac gggaacgcca
gcgttgggtg ttcagattcc acgcgtatgt ctgggctcac 4560tcacagcatg
gccgagtgtc tgcagtgctg gtcctgaccc ttccagagca gcagtggaca
4620gatgagataa gactgtttca gaaacaaaga tggccacagc cttcctaaca
agcaggtcat 4680ctggccatgt ctgtattgta actggtaaaa ggcttcaagt
cagattgatg atcaagaaaa 4740gtcaaaaccc cagcccaaga ttgggaaagc
aggtggtggt tccaagcttt taaaaaatta 4800ttgaagctct ccatcctgtt
ctgtgagtgt gtcttctctt tctccttcac gtcatagccg 4860tgacccaccg
ttcatctctg ctcttgcgta aagatgaccg atggagtcca aagccaagtg
4920gcttcaccag ctgacaagcc accctcctgc agcctgagtt tcacagtcca
ctgggttcgt 4980tgtcatgcgg tgtttgaatg gttaagccct tgcagtattt
cagatcgggc aaaaaatatc 5040ggatgcacat agcagaacca ttggtggtat
ttatagcttt gctttgtact cctcactgtt 5100tctgcctacg caaaatatcc
atgtttcctc tgagaaatct gttgtggact gaaagcgctg 5160ctggctgtga
aatttaataa agtgtgtatg ctttgctaga aaattatttc ttggacaata
5220ggaacagtca ttgatctgta aatcctggct cttaacagtg agtggccaag
gacttgatca 5280gcccatttct tggtccctca gtgctttaaa atttaagtag
cactgcattt tgtaatgttg 5340aatatgactc tagtgacttg taggaggcac
ttgtgaggag atgcttgctt cagtgtaaaa 5400gatgctcatg gcctgagtca
gttgagtttt ctttcaagaa accacttcag agtgaaatat 5460ccagggtttc
cccgccctgg acatgtccag cctgcccagg cagcacacag ccctgtaagt
5520ccacctcgtg tgggtgagat ttcctcctgc gtgatgacct catcgccatc
tctgctgtct 5580cattccacag cctccctccc tcttctctcc tcctctgccc
tcgcccttcc cccttcccca 5640tcccctcccc ctcctcctct gccctcgccc
ttacccctcc cccttcccct tccgctcctc 5700ctccctcctc cacctctttc
tcctcctcct tccctcctcc tccctcctct tcccttctct 5760gccatctttc
tccccgtgcc tattgatccc acataggctc attctgggta caccggctaa
5820aggctttggt gcattgcagc gttttctccc agcagctgtg tgaaagatgc
attttctaag 5880ctaaggagaa ttttctcaag agtggcatac tcatgccaaa
tattattgct ctgggccata 5940taggctggtc ttcctccaca ctaaaatggg
tgtcttgttt tggtacttaa aacagtctac 6000tccaggcatc cagtccttac
agaccaagga agagcatagc gatgcctgtt ggaattgcag 6060atgcattctg
gccttctccc ccgtcctgaa acattttctt tgaggaaggc tcttagaaca
6120ttagatagtc tgctgaggtt gttggcccag ctccatacac ccagtagaac
agtggaacaa 6180ctcatgcttc atgctgccaa gctgctgtac ttcaaaggaa
acagatctag cacactgctg 6240cacccctgct tccacactcc acacttcacc
ccgctgcttt tctctgaccc gcccctggcc 6300ttgtaagact cacgtaagct
aagtccagga tgcctgtggc ctgcggcttg attcttccct 6360ttaggattca
gcaagttaat ggcttcctcg ctatagaagt gagactttga cttgatgcct
6420cttggtatat caaaaagata ttcatccaga aagtaccaaa tgttctgaaa
gacccgctct 6480tcactccagt tttccctagg gtgtttctgg cagggcgttt
ttaaaaggca tctacctgag 6540ttgacgctaa tacttgtcac cacctggaac
gtagttatcg gtcggcaggc tgaacatact 6600ccagattccc cagaggccac
ttctgtagcc cagcgatgca tctgagcctc tctgcgtggt 6660ttatgcttga
aaaatagata atgcttttag atggttcact gccaggccat gggccccaca
6720catctcaggc cctgtgtgag ggagcacact gagatggtgc aggagtgaat
gggcatggct 6780tggcctcgct acctcgggga cctgttggag ttctggcagc
agggtgtctg caggtgggac 6840ggcgttctgg gcagagtcag aatggtcaga
atgaaacaga acagccaact cacccacagg 6900acagcttatt ttgaggcaag
gttttggatt ttggaggaag cagccagatg aggcggtgag 6960cctccagaag
gtcagccttt ggagcacgta agatactgtt acagggtcca gaaatcgtgt
7020tcacatgggg gctttgactc ttcaaacagc ttttgcagat cgtaaattgc
atttgcctag 7080tcgtgtgacc tcaaaagaag tcagacatat ttaatccaga
aatagtttcg tttgagggag 7140ggcttgcagg tctgtaaata gcatttgctt
tcctggttag agattgggat gcagaaggag 7200ttttcagtat tttttttaaa
acactaatga tcattgaaga gtatttatgt aaacatacaa 7260cgtataatgg
gtgggggatc cgatcatggt gatgtacggg gtgaattctc ttgccgtgtt
7320gcaaatgtgt aaaataaaga ttatctggca gaa 735311279PRTHomo sapiens
11Met Met Leu Met Gln Ala Leu Val Leu Phe Thr Leu Asp Ser Leu Asp 1
5 10 15 Met Leu Pro Ala Val Lys Ala Thr Trp Leu Tyr Gly Ile Gln Ile
Thr 20 25 30 Ser Leu Leu Leu Val Cys Ile Leu Gln Phe Phe Asn Ser
Met Ile Leu 35 40 45 Gly Ser Leu Leu Ile Ser Phe Asn Leu Ser Val
Phe Ile Ala Arg Lys 50 55 60 Leu Gln Lys Asn Leu Lys Thr Gly Ser
Phe Leu Asn Arg Leu Gly Lys 65 70 75 80 Leu Leu Leu His Leu Phe Met
Val Leu Cys Leu Thr Leu Phe Leu Asn 85 90 95 Asn Ile Ile Lys Lys
Ile Leu Asn Leu Lys Ser Asp Glu His Ile Phe 100 105 110 Lys Phe Leu
Lys Ala Lys Phe Gly Leu Gly Ala Thr Arg Asp Phe Asp 115 120 125 Ala
Asn Leu Tyr Leu Cys Glu Glu Ala Phe Gly Leu Leu Pro Phe Asn 130 135
140 Thr Phe Gly Arg Leu Ser Asp Thr Leu Leu Phe Tyr Ala Tyr Ile Phe
145 150 155 160 Val Leu Ser Ile Thr Val Ile Val Ala Phe Val Val Ala
Phe His Asn 165 170 175 Leu Ser Asp Ser Thr Asn Gln Gln Ser Val Gly
Lys Met Glu Lys Gly 180 185 190 Thr Val Asp Leu Lys Pro Glu Thr Ala
Tyr Asn Leu Ile His Thr Ile 195 200 205 Leu Phe Gly Phe Leu Ala Leu
Ser Thr Met Arg Met Lys Tyr Leu Trp 210 215 220 Thr Ser His Met Cys
Val Phe Ala Ser Phe Gly Leu Cys Ser Pro Glu 225 230 235 240 Ile Trp
Glu Leu Leu Leu Lys Ser Val His Leu Tyr Asn Pro Lys Arg 245 250 255
Ile Cys Ile Met Arg Tyr Ser Val Pro Ile Leu Ile Leu Leu Tyr Leu 260
265 270 Cys Tyr Lys Asn Gln Lys Ser 275 12102PRTHomo sapiens 12Met
Met Ser Ile Arg Gln Arg Arg Glu Ile Arg Ala Thr Glu Val Ser 1 5 10
15 Glu Asp Phe Pro Ala Gln Glu Glu Asn Val Lys Leu Glu Asn Lys Leu
20 25 30 Pro Ser Gly Cys Thr Ser Arg Arg Leu Trp Lys Ile Leu Ser
Leu Thr 35 40 45 Ile Gly Gly Thr Ile Ala Leu Cys Ile Gly Leu Leu
Thr Ser Val Tyr 50 55 60 Leu Ala Thr Leu His Glu Asn Asp Leu Trp
Phe Ser Asn Ile Lys Val 65 70 75 80 Trp Ser Phe Phe Asp His Cys Ile
Ile His Ser Val Gly Ser Pro Val 85 90 95 Val Ser His Val Asp Glu
100 13716PRTHomo sapiens 13Met Met Ser Ile Arg Gln Arg Arg Glu Ile
Arg Ala Thr Glu Val Ser 1 5 10 15 Glu Asp Phe Pro Ala Gln Glu Glu
Asn Val Lys Leu Glu Asn Lys Leu 20 25 30 Pro Ser Gly Cys Thr Ser
Arg Arg Leu Trp Lys Ile Leu Ser Leu Thr 35 40 45 Ile Gly Gly Thr
Ile Ala Leu Cys Ile Gly Leu Leu Thr Ser Val Tyr 50 55 60 Leu Ala
Thr Leu His Glu Asn Asp Leu Trp Phe Ser Asn Ile Lys Glu 65 70 75 80
Val Glu Arg Glu Ile Ser Phe Arg Thr Glu Cys Gly Leu Tyr Tyr Ser 85
90 95 Tyr Tyr Lys Gln Met Leu Gln Ala Pro Thr Leu Val Gln Gly Phe
His 100 105 110 Gly Leu Ile Tyr Asp Asn Lys Thr Glu Ser Met Lys Thr
Ile Asn Leu 115 120 125 Leu Gln Arg Met Asn Ile Tyr Gln Glu Val Phe
Leu Ser Ile Leu Tyr 130 135 140 Arg Val Leu Pro Ile Gln Lys Tyr Leu
Glu Pro Val Tyr Phe Tyr Ile 145 150 155 160 Tyr Thr Leu Phe Gly Leu
Gln Ala Ile Tyr Val Thr Ala Leu Tyr Ile 165 170 175 Thr Ser Trp Leu
Leu Ser Gly Thr Trp Leu Ser Gly Leu Leu Ala Ala 180 185 190 Phe Trp
Tyr Val Thr Asn Arg Ile Asp Thr Thr Arg Val Glu Phe Thr 195 200 205
Ile Pro Leu Arg Glu Asn Trp Ala Leu Pro Phe Phe Ala Ile Gln Ile 210
215 220 Ala Ala Ile Thr Tyr Phe Leu Arg Pro Asn Leu Gln Pro Leu Ser
Glu 225 230 235 240 Arg Leu Thr Leu Leu Ala Ile Phe Ile Ser Thr Phe
Leu Phe Ser Leu 245 250 255 Thr Trp Gln Phe Asn Gln Phe Met Met Leu
Met Gln Ala Leu Val Leu 260 265 270 Phe Thr Leu Asp Ser Leu Asp Met
Leu Pro Ala Val Lys Ala Thr Trp 275 280 285 Leu Tyr Gly Ile Gln Ile
Thr Ser Leu Leu Leu Val Cys Ile Leu Gln 290 295 300 Phe Phe Asn Ser
Met Ile Leu Gly Ser Leu Leu Ile Ser Phe Asn Leu 305 310 315 320 Ser
Val Phe Ile Ala Arg Lys Leu Gln Lys Asn Leu Lys Thr Gly Ser 325 330
335 Phe Leu Asn Arg Leu Gly Lys Leu Leu Leu His Leu Phe Met Val Leu
340 345 350 Cys Leu Thr Leu Phe Leu Asn Asn Ile Ile Lys Lys Ile Leu
Asn Leu 355 360 365 Lys Ser Asp Glu His Ile Phe Lys Phe Leu Lys Ala
Lys Phe Gly Leu 370 375 380 Gly Ala Thr Arg Asp Phe Asp Ala Asn Leu
Tyr Leu Cys Glu Glu Ala 385 390 395 400 Phe Gly Leu Leu Pro Phe Asn
Thr Phe Gly Arg Leu Ser Asp Thr Leu 405 410 415 Leu Phe Tyr Ala Tyr
Ile Phe Val Leu Ser Ile Thr Val Ile Val Ala 420 425 430 Phe Val Val
Ala Phe His Asn Leu Ser Asp Ser Thr Asn Gln Gln Ser 435 440 445 Val
Gly Lys Met Glu Lys Gly Thr Val Asp Leu Lys Pro Glu Thr Ala 450 455
460 Tyr Asn Leu Ile His Thr Ile Leu Phe Gly Phe Leu Ala Leu Ser Thr
465 470 475 480 Met Arg Met Lys Tyr Leu Trp Thr Ser His Met
Cys Val Phe Ala Ser 485 490 495 Phe Gly Leu Cys Ser Pro Glu Ile Trp
Glu Leu Leu Leu Lys Ser Val 500 505 510 His Leu Tyr Asn Pro Lys Arg
Ile Cys Ile Met Arg Tyr Ser Val Pro 515 520 525 Ile Leu Ile Leu Leu
Tyr Leu Cys Tyr Lys Phe Trp Pro Gly Met Met 530 535 540 Asp Glu Leu
Ser Glu Leu Arg Glu Phe Tyr Asp Pro Asp Thr Val Glu 545 550 555 560
Leu Met Asn Trp Ile Asn Ser Asn Thr Pro Arg Lys Ala Val Phe Ala 565
570 575 Gly Ser Met Gln Leu Leu Ala Gly Val Lys Leu Cys Thr Gly Arg
Thr 580 585 590 Leu Thr Asn His Pro His Tyr Glu Asp Ser Ser Leu Arg
Glu Arg Thr 595 600 605 Arg Ala Val Tyr Gln Ile Tyr Ala Lys Arg Ala
Pro Glu Glu Val His 610 615 620 Ala Leu Leu Arg Ser Phe Gly Thr Asp
Tyr Val Ile Leu Glu Asp Ser 625 630 635 640 Ile Cys Tyr Glu Arg Arg
His Arg Arg Gly Cys Arg Leu Arg Asp Leu 645 650 655 Leu Asp Ile Ala
Asn Gly His Met Met Asp Gly Pro Gly Glu Asn Asp 660 665 670 Pro Asp
Leu Lys Pro Ala Asp His Pro Arg Phe Cys Glu Glu Ile Lys 675 680 685
Arg Asn Leu Pro Pro Tyr Val Ala Tyr Phe Thr Arg Val Phe Gln Asn 690
695 700 Lys Thr Phe His Val Tyr Lys Leu Ser Arg Asn Lys 705 710 715
14112PRTHomo sapiens 14Met Met Ser Ile Arg Gln Arg Arg Glu Ile Arg
Ala Thr Glu Val Ser 1 5 10 15 Glu Asp Phe Pro Ala Gln Glu Glu Asn
Val Lys Leu Glu Asn Lys Leu 20 25 30 Pro Ser Gly Cys Thr Ser Arg
Arg Leu Trp Lys Ile Leu Ser Leu Thr 35 40 45 Ile Gly Gly Thr Pro
Phe Ala Leu Asp Phe Leu His Leu Ser Thr Leu 50 55 60 Pro Arg Tyr
Met Lys Met Ile Tyr Gly Phe Leu Ile Leu Arg Lys Trp 65 70 75 80 Ser
Glu Lys Ser His Ser Glu Gln Ser Val Ala Cys Ile Thr Pro Thr 85 90
95 Thr Ser Arg Cys Cys Arg Leu Gln Pro Ser Cys Lys Val Ile Thr Thr
100 105 110 151841DNAHomo sapiens 15aagtttgcgg agcggcttct
gctcgtcggc cgtgcggcga ggcagggcct gggctgcgac 60cccggcggcc gctcgcggtc
ttgggagagc tggggcgcgt gcctgaactt cccggctgcc 120cctgtccttg
gagacctacc tgatggggac gccaggtgtg caggggcgtg gcgcgtagga
180gtgatttgga gaacaatgca tgtaagtctg acatcatgat gtccatccgg
caaagaagag 240aaataagagc cacagaagtt tctgaagact ttccagccca
agaagaaaat gtgaagttgg 300aaaataaatt gccatctggt tgtaccagta
gaagattatg gaagattttg tcattgacaa 360ttggtggaac cattgccctt
tgcattggac ttcttacatc tgtctacctt gccacgttac 420atgaaaatga
tttatggttt tctaatatta aggaagtgga gcgagaaatc tcattcagaa
480cagagtgtgg cctgtattac tcctactaca agcagatgct gcaggctcca
accctcgtgc 540aaggttttca tggcctaata tatgataata aaactgaatc
tatgaagaca attaacctcc 600ttcagcgaat gaatatttac caagaggttt
ttctcagtat tttatataga gttctaccca 660tacagaaata tttagagcca
gtttattttt atatttacac cttatttggg ctccaggcga 720tctatgtcac
agctctctac ataaccagct ggctactcag tggtacatgg ctgtcaggac
780tgttggcagc tttctggtat gtcacaaata gaatagatac cacaagagtt
gagtttacca 840tcccactgag ggagaactgg gcgctgccat tctttgcaat
tcagatagca gcaattacat 900atttcctgag accaaactta cagcctcttt
ctgaaaggct gacacttctt gccattttca 960tatcaacttt tctctttagt
ctgacatggc aatttaatca atttatgatg ctgatgcaag 1020cattagtgct
gttcacactg gactccctgg acatgctgcc agcagtgaag gcgacatggc
1080tgtatggaat acagataaca agtttactcc tggtctgcat tcttcagttt
tttaattcca 1140tgattcttgg atcactgctt atcagtttta acctttcagt
attcattgca agaaaacttc 1200agaaaaatct gaaaactgga agcttcctta
ataggcttgg gaaacttttg ttacatttat 1260ttatggtttt atgtttgaca
ctttttctca acaacataat taagaaaatt cttaacctga 1320agtcagatga
acacatattt aaatttctga aggcaaaatt tgggcttgga gcaacaaggg
1380attttgatgc aaatctctat ctgtgtgaag aagcttttgg cctcctgcct
tttaatacat 1440ttggaaggct ttcagatact ctgctttttt atgcttacat
attcgttctg tccatcacag 1500tgattgtagc attcgttgtt gcctttcata
atctcagtga ttctacaaat caacaatccg 1560tgggtaaaat ggaaaaaggc
acagttgacc tgaaaccaga aactgcctac aacttaatac 1620ataccattct
gtttggattc ttggcattga gtacaatgag aatgaagtac ctctggacgt
1680cacacatgtg tgtgttcgca tcattcggcc tatgtagccc tgaaatatgg
gagttacttc 1740tgaagtcagt ccatctttat aacccaaaga ggatatgtat
aatgcgatat tcagtaccga 1800tattaatact gctgtatcta tgctataaga
atcagaaatc t 1841162206DNAHomo sapiens 16cggttctgcc ctccttgtac
ccgcggcgcg ctgcggcccg tggcgcggcc ccgttcccgc 60ctagccccgt cggcctcctt
cccctcccgg agccgcgcgt gaggacggct gaggccgcag 120gagtgatttg
gagaacaatg catgtaagtc tgacatcatg atgtccatcc ggcaaagaag
180agaaataaga gccacagaag tttctgaaga ctttccagcc caagaagaaa
atgtgaagtt 240ggaaaataaa ttgccatctg gttgtaccag tagaagatta
tggaagattt tgtcattgac 300aattggtgga accattgccc tttgcattgg
acttcttaca tctgtctacc ttgccacgtt 360acatgaaaat gatttatggt
tttctaatat taaggtatgg agtttctttg accattgtat 420cattcactca
gtgggatctc cagtagtaag ccatgtggat gaatgaccaa ggcaacacag
480ttttgccata aagaatccaa tctctagaaa ggttggacta tagagtgaaa
taacttttgt 540gtttattatt ttaaaataac atattagaat ctttttttaa
atttttcttt attatttatt 600tatttttgag atggagtctc actctgtcac
ccaggctgga gtgcggtggc gcaatcttgg 660ctcactacaa cctctgcctc
gcaggttcag gtgattcttc tggcttagcc tcccaagtag 720ctgggactat
aggtgcgtgc caccacaccc agctaatttt tgtattttta ctagagacgg
780ggtttcagca tattgaccag gctgatctcg aactcctgac cttgtgatct
gcctgtctca 840gcctcccaaa gtgctgggat tacaggcgtg agccactgcg
tccagccaga atctttattt 900ttcattttaa ttttttgaga tagggtattg
ctctgtcacc caggctagaa tgcagtggtg 960caaacatggg tcactgcagc
ctcaacctcc tgggctcaag tgagtatcct gcctaagctt 1020cctgtgtcac
tgggacccca ggcatgcacc acctcaccaa gctaaatttg atttttttgt
1080agagacaggg tctcactttg ttgcccatgc tggtctcgaa ctcctgggct
caagcgatcc 1140tactgccctg gtcttccaaa atatgagaat gagccatagc
acccagccca gaatttttat 1200aatcaagtga gttttttctt tttcattaac
ttattccatt tatttagcag ttattctaaa 1260ttagtatttt tcaagttata
gattgtgaaa ttagtgcagt aggtcatgag taacattttt 1320cttaatgaaa
tcaaaaagaa agaatactat cacatctagt agggttgagg attgttttgt
1380gaaactttta attttatata tatatatata tgcacaaact gggtcacagt
atacaaggta 1440cttccttttc ttttttttct tgttggctac aacaggaaaa
aaaaaaaaca gaaaaggaaa 1500taaaaaagcc actgctttaa atcatggggt
ctaaatgtgg ctccacagag ggtcctcagc 1560atgttcatga ctatctaata
ctctgtgcaa gtggttttgc agggcatagg gcgatgggga 1620agccatatgt
ttccagggaa aggaactgta attttaatca gattttcagg agggttagcc
1680gggcgtcacg cctgtaatcc cagcactttg ggaggtcgag gcgggcagat
cacttgaagt 1740caggagttca agaccagcct ggccaacatg gtggaaccct
atctctacta aaaatacaaa 1800aattagccgg gcatggtgac acacacctgt
aatctcagct actcaggagg ctgaggcaca 1860agaatcactt gaactcggga
ggaagaggtt gcagtgagct gagatcccac cactgcactc 1920cagcctgggc
aacagagcaa tactctttat caaaaaaaaa aagaaaaaag ttgagggggt
1980ggtctgtgac tctttaaaca cgtttccttg ttttctttct ctctctcttt
ttcaacattt 2040ctagaactcc tcttggcatt gttttcagaa ctcgtatata
acttacatgt ggaaatttgc 2100atccaaatat accttacatt ttaatctaat
atgtcatgat ctttaaccta aactgtggtg 2160tctaatgact agttgcttgt
aaaaataaac aaacaccttc aaagcc 2206174456DNAHomo sapiens 17aagtttgcgg
agcggcttct gctcgtcggc cgtgcggcga ggcagggcct gggctgcgac 60cccggcggcc
gctcgcggtc ttgggagagc tggggcgcgt gcctgaactt cccggctgcc
120cctgtccttg gagacctacc tgatggggac gccaggtgtg caggggcgtg
gcgcgtagga 180gtgatttgga gaacaatgca tgtaagtctg acatcatgat
gtccatccgg caaagaagag 240aaataagagc cacagaagtt tctgaagact
ttccagccca agaagaaaat gtgaagttgg 300aaaataaatt gccatctggt
tgtaccagta gaagattatg gaagattttg tcattgacaa 360ttggtggaac
cattgccctt tgcattggac ttcttacatc tgtctacctt gccacgttac
420atgaaaatga tttatggttt tctaatatta aggaagtgga gcgagaaatc
tcattcagaa 480cagagtgtgg cctgtattac tcctactaca agcagatgct
gcaggctcca accctcgtgc 540aaggttttca tggcctaata tatgataata
aaactgaatc tatgaagaca attaacctcc 600ttcagcgaat gaatatttac
caagaggttt ttctcagtat tttatataga gttctaccca 660tacagaaata
tttagagcca gtttattttt atatttacac cttatttggg ctccaggcga
720tctatgtcac agctctctac ataaccagct ggctactcag tggtacatgg
ctgtcaggac 780tgttggcagc tttctggtat gtcacaaata gaatagatac
cacaagagtt gagtttacca 840tcccactgag ggagaactgg gcgctgccat
tctttgcaat tcagatagca gcaattacat 900atttcctgag accaaactta
cagcctcttt ctgaaaggct gacacttctt gccattttca 960tatcaacttt
tctctttagt ctgacatggc aatttaatca atttatgatg ctgatgcaag
1020cattagtgct gttcacactg gactccctgg acatgctgcc agcagtgaag
gcgacatggc 1080tgtatggaat acagataaca agtttactcc tggtctgcat
tcttcagttt tttaattcca 1140tgattcttgg atcactgctt atcagtttta
acctttcagt attcattgca agaaaacttc 1200agaaaaatct gaaaactgga
agcttcctta ataggcttgg gaaacttttg ttacatttat 1260ttatggtttt
atgtttgaca ctttttctca acaacataat taagaaaatt cttaacctga
1320agtcagatga acacatattt aaatttctga aggcaaaatt tgggcttgga
gcaacaaggg 1380attttgatgc aaatctctat ctgtgtgaag aagcttttgg
cctcctgcct tttaatacat 1440ttggaaggct ttcagatact ctgctttttt
atgcttacat attcgttctg tccatcacag 1500tgattgtagc attcgttgtt
gcctttcata atctcagtga ttctacaaat caacaatccg 1560tgggtaaaat
ggaaaaaggc acagttgacc tgaaaccaga aactgcctac aacttaatac
1620ataccattct gtttggattc ttggcattga gtacaatgag aatgaagtac
ctctggacgt 1680cacacatgtg tgtgttcgca tcattcggcc tatgtagccc
tgaaatatgg gagttacttc 1740tgaagtcagt ccatctttat aacccaaaga
ggatatgtat aatgcgatat tcagtaccga 1800tattaatact gctgtatcta
tgctataagt tctggccagg aatgatggat gaactctccg 1860agttgagaga
attctatgat ccagatacag tggagctgat gaactggatt aactctaaca
1920ctccaagaaa ggctgtgttt gcgggaagca tgcagttgct ggccggagtc
aagctgtgca 1980cgggaaggac cctaaccaac cacccgcact atgaagacag
cagcctgaga gagcggacca 2040gagcggttta tcagatatat gccaagaggg
caccagagga agtgcatgcc ctcctaaggt 2100ccttcggcac tgactacgta
atcctggaag acagcatctg ctacgagcgg aggcaccgcc 2160ggggctgccg
actccgggac ctgctggaca ttgccaacgg ccacatgatg gatggcccag
2220gagagaatga tcctgatttg aaacctgcag accaccctcg cttctgtgaa
gagatcaaaa 2280gaaacctgcc tccctacgtg gcctacttca ccagagtgtt
ccagaacaaa accttccacg 2340tttacaagct gtccagaaac aagtagcgca
gatttctgcc cagtgtctat ttttgatacg 2400gagaaactgc atcatgatga
aactcaatag atgacgtttc ctatgtaagt aggtagccca 2460aaccttcaag
ctgtgatatg agtaagttct acagatgttt acacaagtgt tgccatcttt
2520gaaagcatct tctacaagca gaagtctttt tcgttgtgtg tctatctttc
tcattaatgt 2580tctttagcct aaatgttaac aactttctaa gagtgaccta
gaattatgtt gttggagaga 2640atgatgtgtg ttccatggat acctggatag
gcacataaca tgttggaaga tgagcacctg 2700ctcaggattt gaaatacgtt
taattttcag gtgacttaag acagctatga ttgaatcaac 2760tagagatgat
gatcgactta tttaatatga tttcactggt gaagaccaat tggtagcttt
2820ttaaaaagca ctttagtgtc ctgttttacc ttaaaatgtt ataatatttt
ccagttgtca 2880tgctgtcaac attaacaaaa aaaatcatgt taaggctttg
tatcaaacat tttgttacac 2940tctgtctgaa atgtaatgtg gagtacttca
gcagtatgtg tcatgtattg tgtgtgtctg 3000tgtgtgtgca tgtgcacaca
tgtgttttaa tgctgggcac agaaaagtgt tacaagttcc 3060atatcgtaag
tccttaaagg ggcagaaata tatgtagcca agtagaattt attacatttt
3120agtgttatta ttttaaaact tactgatact ctttaacctc tcctgcagta
atagttttgc 3180tttatttctt actcatttca atttattggg tttgcaaaat
tttgtaaact ttttgtgttt 3240ttagcctttg tattttttac agcctagaat
cttgcaaagt ctgaatattt tttaaatgtt 3300ctatcttaac tagttcacta
atacagtatt tttagcagac agcattttca gacagcattt 3360tcataccaag
tttgacttgt ggtctccaat cttactggga aggccctggt agtgtaattc
3420ttttccttat taaaaggtaa ccaagtgcct ctaagtcatg cttatttgta
aacaacaaag 3480aagagtatat gtacctgctc aaaatttttt tgataatcgc
ttatataatt aatttctaat 3540gatgaggaca tgtaaaagtt gccagtaaga
acatagtatg catttaatta aatcaagatg 3600gctaatggaa ttaactttct
cccctgttct tgccaggtgg aaatgattta agcatttctc 3660cttgcagttg
tattgaagta aattaccata ggcatcaaga tggctgcatc acattttcaa
3720atgattttat attcagttgc tacttataaa gcagcattca aaaagtcttt
tacactgtca 3780tgttggacac aagcagactc agcttttatc aaaacttgtt
taaataaaaa attgacagta 3840gctgggttat taaattatgc aactgaaact
cctgaattat atcttttctg tatcccttaa 3900taagattgga gaccactgcc
gtttaggata atacaataat aaaacgtttt aatcagtact 3960aaaactttaa
ttaagccaat aatgatgcat gcctgttgta gctgacagca tgggtcagta
4020catccttcag cgagtgcctt actctaattg aaaccaagca cacgtaaggt
acaatatgtt 4080agactctgtg attttgtttt caaaatcctc tgttatggct
atatttaaat ttattttaaa 4140tattcctgta tgtattcatc taagcatttg
ggcatttgga gtcttaatat acaagaaaca 4200cgtacttaaa tttttatgct
tatcaccgca atgatggcaa acagtgattt tttttttcat 4260agtttaggtg
tcattgttgc cagcaccttt agtgctcagt cttcagtgaa aaatataaag
4320tgccaaaaaa atcttgcaag acagaatcca tacttaacac tctttccaag
acactgtgac 4380catgtacagt agctatttcc tgatgaccaa atctctcaac
gaatcatgtt attaataaat 4440atttttagca ctcatc 445618336DNAHomo
sapiens 18atgatgtcca tccggcaaag aagagaaata agagccacag aagtttctga
agactttcca 60gcccaagaag aaaatgtgaa gttggaaaat aaattgccat ctggttgtac
cagtagaaga 120ttatggaaga ttttgtcatt gacaattggt ggaaccccct
ttgcattgga cttcttacat 180ctgtctacct tgccacgtta catgaaaatg
atttatggtt ttctaatatt aaggaagtgg 240agcgagaaat ctcattcaga
acagagtgtg gcctgtatta ctcctactac aagcagatgc 300tgcaggctcc
aaccctcgtg caaggtaatt acaact 33619173PRTHomo sapiens 19Met Thr Leu
Cys His Arg Asp Ser Phe Gly Ser Trp His Leu Phe His 1 5 10 15 Leu
Leu Leu Leu Glu Tyr Met Ile His Ile Leu Gln Ser Cys Leu Glu 20 25
30 Glu Glu Glu Glu Glu Glu Asp Met Gly Thr Val Lys Glu Met Leu Pro
35 40 45 Asp Asp Pro Thr Leu Gly Gln Pro Asp Gln Ala Leu Phe His
Ser Leu 50 55 60 Asn Ser Ser Leu Ser Gln Ala Cys Ala Ser Pro Ser
Met Glu Pro Leu 65 70 75 80 Gly Val Met Pro Thr His Met Gly Gln Gly
Arg Tyr Pro Val Gly Val 85 90 95 Ser Asn Met Val Leu Arg Ile Leu
Gly Phe Leu Val Asp Thr Ala Met 100 105 110 Gly Asn Lys Leu Ile Gln
Val Leu Leu Glu Asp Glu Thr Thr Glu Ser 115 120 125 Ala Val Lys Leu
Ser Leu Pro Met Gly Gln Glu Ala Leu Ile Thr Leu 130 135 140 Lys Asp
Gly Gln Gln Phe Val Ile Gln Ile Ser Asp Val Pro Gln Ser 145 150 155
160 Ser Glu Asp Ile Tyr Phe Arg Glu Asn Asn Ala Asn Val 165 170
201907DNAHomo sapiens 20ataactccaa cgctcaagca agtcaaggac acccacggac
tcaacaccgc gaccagattg 60gaaaaggtgt tggtcgacaa cttctgcatt tgcgaagagt
gcagcgtccc tcgctgtctc 120atgtatgaga tttacgtgga gacctgtggg
caaaacactg agaaccaagt caacccggcc 180acctttggga agcttgtgag
attggttttt ccggaccttg gcacccggag gctgggcact 240agaggaagtg
ccaggtatca ttatgatgga atctgtatca agaaaagctc tttcttctat
300gcccagtatt gctacctgat aggtgaaaaa aggtatcaca gtggagatgc
cattgccttt 360gaaaaatcta ctaattataa cagcattatc caacaagaag
caacatgtga agatcattca 420ccgatgaaga cagacccagt tggatcccct
ttgtctgaat tcaggagatg tccatttctg 480gagcaagaac aggcaaagaa
atactcctgt aatatgatgg ccttccttgc tgacgaatac 540tgcaactatt
gtcgagacat tttacgaaat gtgaggaact gagaacttga gagggtggag
600gacttgctta cttccttctg gaagtctctg cagcaagaca cagtcatgct
gatgtcattg 660cctgacgtgt gccagctctt taaatgctac gacgtccagc
tgtacaaggg aattgaggat 720gttctccttc atgacttctt ggaagatgtt
tctattcagt acctgaaatc tgtgcagtta 780tttagtaaga aatttaagct
gtggctcctt aatgctttgg aaggtgttcc agccctcttg 840cagatctcca
aactcaaaga ctatgcgaat ggtattgaaa agtaagaggc gtgtcagcgt
900tttgaagtca gatctacagg ccatcatcaa tcaaggcact ttggctactt
ctaagaaagc 960cctggcaagt gaccggagtg gcgcagatga actggagaac
aacccagaga tgaaatgttt 1020aagaaactta atttctttgc tgggaacatc
aacagatctc agggtattcc tcagctgtct 1080gtcttcacat ctccaagcat
ttgtgttcca gacaagcaga agcaaagaag agtttaccaa 1140attggccgcc
agcttccagc tgagatggaa tcttcttctc actgctgtaa gcaaagccat
1200gaccctctgc cacagagata gttttggctc ctggcatctg tttcacttgt
tgcttttgga 1260atatatgatt catatacttc agtcatgcct agaggaggaa
gaggaggagg aggacatggg 1320gactgtcaag gaaatgctac cagatgaccc
gactctcggc cagccagacc aggcactttt 1380ccattctctg aattcctcac
tgtcgcaggc gtgtgccagc cccagcatgg agccactggg 1440ggtgatgccc
acacacatgg gccagggccg atatcccgtg ggtgtgagca acatggtcct
1500caggatcctg ggcttcctgg tggacactgc catgggcaat aagctcatcc
aggtgctgtt 1560ggaagatgaa accactgaaa gcgcagttaa actcagcctt
cctatgggac aagaagccct 1620cataacccta aaagatggac aacaatttgt
gattcagata tcagatgtac cccaaagctc 1680tgaagatatt tatttcagag
aaaacaatgc taatgtgtga gattatttat ttgaatagag 1740aataagaaaa
ctgatagact tgcattctta aaaatattaa atactaaagt ttttctattg
1800acgaaagatg atgttatgta tataatagat gtagcattgt ctattttatg
tttatatgta 1860tttcaaggag gtggtttcga taaaatatgt aaactgattt ggagaat
1907211328PRTHomo sapiens 21Met Ser Ser Gly Asp Pro Ala His Leu Gly
Leu Cys Leu Trp Leu Trp 1 5 10 15 Leu Gly Ala Thr Leu Gly Arg Glu
Gln Val Gln Ala Ser Gly Leu Leu 20 25 30 Arg Leu Ala Val Leu Pro
Glu Asp Arg Leu Gln Met Lys Trp Arg Glu 35 40 45 Ser Glu Gly Ser
Gly Leu Gly Tyr Leu Val Gln Val Lys Pro Met Ala 50 55 60 Gly Asp
Ser Glu Gln Glu Val Ile Leu Thr Thr Lys Thr Pro Lys Ala 65 70 75 80
Thr Val Gly Gly Leu Ser Pro Ser Lys Gly Tyr Thr Leu Gln Ile Phe 85
90 95 Glu Leu Thr Gly Ser Gly Arg Phe Leu Leu Ala
Arg Arg Glu Phe Val 100 105 110 Ile Glu Asp Leu Lys Ser Ser Ser Leu
Asp Arg Ser Ser Gln Arg Pro 115 120 125 Leu Gly Ser Gly Ala Pro Glu
Pro Thr Pro Ser His Thr Gly Ser Pro 130 135 140 Asp Pro Glu Gln Ala
Ser Glu Pro Gln Val Ala Phe Thr Pro Ser Gln 145 150 155 160 Asp Pro
Arg Thr Pro Gly Gly Ser Glu Trp Arg Glu Thr Gly Pro Gln 165 170 175
Phe Arg Cys Leu Pro Pro Val Pro Ala Asp Met Val Phe Leu Val Asp 180
185 190 Gly Ser Trp Ser Ile Gly His Ser His Phe Gln Gln Val Lys Asp
Phe 195 200 205 Leu Ala Ser Val Ile Ala Pro Phe Glu Ile Gly Pro Asp
Lys Val Gln 210 215 220 Val Gly Leu Thr Gln Tyr Ser Gly Asp Ala Gln
Thr Glu Trp Asp Leu 225 230 235 240 Asn Ser Leu Ser Thr Lys Glu Gln
Val Leu Ala Ala Val Arg Arg Leu 245 250 255 Arg Tyr Lys Gly Gly Asn
Thr Phe Thr Gly Leu Ala Leu Thr His Val 260 265 270 Leu Gly Gln Asn
Leu Gln Pro Ala Ala Gly Leu Arg Pro Glu Ala Ala 275 280 285 Lys Val
Val Ile Leu Val Thr Asp Gly Lys Ser Gln Asp Asp Val His 290 295 300
Thr Ala Ala Arg Val Leu Lys Asp Leu Gly Val Asn Val Phe Ala Val 305
310 315 320 Gly Val Lys Asn Ala Asp Glu Ala Glu Leu Arg Leu Leu Ala
Ser Pro 325 330 335 Pro Arg Asp Ile Thr Val His Ser Val Leu Asp Phe
Leu Gln Leu Gly 340 345 350 Ala Leu Ala Gly Leu Leu Ser Arg Leu Ile
Cys Gln Arg Leu Gln Gly 355 360 365 Gly Ser Pro Arg Gln Gly Pro Ala
Ala Ala Pro Ala Leu Asp Thr Leu 370 375 380 Pro Ala Pro Thr Ser Leu
Val Leu Ser Gln Val Thr Ser Ser Ser Ile 385 390 395 400 Arg Leu Ser
Trp Thr Pro Ala Pro Arg His Pro Leu Lys Tyr Leu Ile 405 410 415 Val
Trp Arg Ala Ser Arg Gly Gly Thr Pro Arg Glu Val Val Val Glu 420 425
430 Gly Pro Ala Ala Ser Thr Glu Leu His Asn Leu Ala Ser Arg Thr Glu
435 440 445 Tyr Leu Val Ser Val Phe Pro Ile Tyr Glu Gly Gly Val Gly
Glu Gly 450 455 460 Leu Arg Gly Leu Val Thr Thr Ala Pro Leu Pro Pro
Pro Arg Ala Leu 465 470 475 480 Thr Leu Ala Ala Val Thr Pro Arg Thr
Val His Leu Thr Trp Gln Pro 485 490 495 Ser Ala Gly Ala Thr His Tyr
Leu Val Arg Cys Ser Pro Ala Ser Pro 500 505 510 Lys Gly Glu Glu Glu
Glu Arg Glu Val Gln Val Gly Arg Pro Glu Val 515 520 525 Leu Leu Asp
Gly Leu Glu Pro Gly Arg Asp Tyr Glu Val Ser Val Gln 530 535 540 Ser
Leu Arg Gly Pro Glu Gly Ser Glu Ala Arg Gly Ile Arg Ala Arg 545 550
555 560 Thr Pro Thr Leu Ala Pro Pro Arg His Leu Gly Phe Ser Asp Val
Ser 565 570 575 His Asp Ala Ala Arg Val Phe Trp Glu Gly Ala Pro Arg
Pro Val Arg 580 585 590 Leu Val Arg Val Thr Tyr Val Ser Ser Glu Gly
Gly His Ser Gly Gln 595 600 605 Thr Glu Ala Pro Gly Asn Ala Thr Ser
Ala Thr Leu Gly Pro Leu Ser 610 615 620 Ser Ser Thr Thr Tyr Thr Val
Arg Val Thr Cys Leu Tyr Pro Gly Gly 625 630 635 640 Gly Ser Ser Thr
Leu Thr Gly Arg Val Thr Thr Lys Lys Ala Pro Ser 645 650 655 Pro Ser
Gln Leu Ser Met Thr Glu Leu Pro Gly Asp Ala Val Gln Leu 660 665 670
Ala Trp Val Ala Ala Ala Pro Ser Gly Val Leu Val Tyr Gln Ile Thr 675
680 685 Trp Thr Pro Leu Gly Glu Gly Lys Ala His Glu Ile Ser Val Pro
Gly 690 695 700 Asn Leu Gly Thr Ala Val Leu Pro Gly Leu Gly Arg His
Thr Glu Tyr 705 710 715 720 Asp Val Thr Ile Leu Ala Tyr Tyr Arg Asp
Gly Ala Arg Ser Asp Pro 725 730 735 Val Ser Leu Arg Tyr Thr Pro Ser
Thr Val Ser Arg Ser Pro Pro Ser 740 745 750 Asn Leu Ala Leu Ala Ser
Glu Thr Pro Asp Ser Leu Gln Val Ser Trp 755 760 765 Thr Pro Pro Leu
Gly Arg Val Leu His Tyr Trp Leu Thr Tyr Ala Pro 770 775 780 Ala Ser
Gly Leu Gly Pro Glu Lys Ser Val Ser Val Pro Gly Ala Arg 785 790 795
800 Ser His Val Thr Leu Pro Asp Leu Gln Ala Ala Thr Lys Tyr Arg Val
805 810 815 Leu Val Ser Ala Ile Tyr Ala Ala Gly Arg Ser Glu Ala Val
Ser Ala 820 825 830 Thr Gly Gln Thr Ala Cys Pro Ala Leu Arg Pro Asp
Gly Ser Leu Pro 835 840 845 Gly Phe Asp Leu Met Val Ala Phe Ser Leu
Val Glu Lys Ala Tyr Ala 850 855 860 Ser Ile Arg Gly Val Ala Met Glu
Pro Ser Ala Phe Gly Gly Thr Pro 865 870 875 880 Thr Phe Thr Leu Phe
Lys Asp Ala Gln Leu Thr Arg Arg Val Ser Asp 885 890 895 Val Tyr Pro
Ala Pro Leu Pro Pro Glu His Thr Ile Val Phe Leu Val 900 905 910 Arg
Leu Leu Pro Glu Thr Pro Arg Glu Ala Phe Ala Leu Trp Gln Met 915 920
925 Thr Ala Glu Asp Phe Gln Pro Leu Leu Gly Val Leu Leu Asp Ala Gly
930 935 940 Lys Lys Ser Leu Thr Tyr Phe His Arg Asp Pro Arg Ala Ala
Leu Gln 945 950 955 960 Glu Ala Thr Phe Asp Pro Gln Glu Val Arg Lys
Ile Phe Phe Gly Ser 965 970 975 Phe His Lys Val His Val Ala Val Gly
Arg Ser Lys Val Arg Leu Tyr 980 985 990 Val Asp Cys Arg Lys Val Ala
Glu Arg Pro Leu Gly Glu Met Gly Ser 995 1000 1005 Pro Pro Ala Ala
Gly Phe Val Thr Leu Gly Arg Leu Ala Lys Ala 1010 1015 1020 Arg Gly
Pro Arg Ser Ser Ser Ala Ala Phe Gln Leu Gln Met Leu 1025 1030 1035
Gln Ile Val Cys Ser Asp Thr Trp Ala Asp Glu Asp Arg Cys Cys 1040
1045 1050 Glu Leu Pro Ala Ser Arg Asp Gly Glu Thr Cys Pro Ala Phe
Val 1055 1060 1065 Ser Ala Cys Ser Cys Ser Ser Glu Thr Pro Gly Pro
Pro Gly Pro 1070 1075 1080 Gln Gly Pro Pro Gly Leu Pro Gly Arg Asn
Gly Thr Pro Gly Glu 1085 1090 1095 Gln Gly Phe Pro Gly Pro Arg Gly
Glu Pro Gly Pro Pro Gly Gln 1100 1105 1110 Met Gly Pro Glu Gly Pro
Gly Gly Gln Gln Gly Ser Pro Gly Thr 1115 1120 1125 Gln Gly Arg Ala
Val Gln Gly Pro Val Gly Pro Pro Gly Val Lys 1130 1135 1140 Gly Glu
Lys Gly Asp His Gly Leu Pro Gly Leu Gln Gly His Pro 1145 1150 1155
Gly His Gln Gly Ile Pro Gly Arg Val Gly Leu Gln Gly Pro Lys 1160
1165 1170 Gly Met Arg Gly Leu Glu Gly Thr Ala Gly Leu Pro Gly Pro
Gly 1175 1180 1185 Pro Arg Gly Phe Gln Gly Met Ala Gly Ala Arg Gly
Thr Ser Gly 1190 1195 1200 Glu Arg Gly Pro Pro Gly Thr Val Gly Pro
Thr Gly Leu Pro Gly 1205 1210 1215 Pro Lys Gly Glu Arg Gly Glu Lys
Gly Glu Pro Gln Ser Leu Ala 1220 1225 1230 Thr Leu Tyr Gln Leu Val
Ser Gln Ala Cys Glu Ser Ala Ile Gln 1235 1240 1245 Thr His Val Ser
Lys Phe Asp Ser Phe His Glu Asn Thr Arg Pro 1250 1255 1260 Pro Met
Pro Ile Leu Glu Gln Lys Leu Glu Pro Gly Thr Glu Pro 1265 1270 1275
Leu Gly Ser Pro Gly Thr Arg Ser Lys Ala Leu Val Pro Gly Glu 1280
1285 1290 Trp Gly Arg Gly Gly Arg His Leu Glu Gly Arg Gly Glu Pro
Gly 1295 1300 1305 Ala Val Gly Gln Met Gly Ser Pro Gly Gln Gln Gly
Ala Ser Thr 1310 1315 1320 Gln Gly Leu Trp Glu 1325 221278PRTHomo
sapiens 22Met Ser Ser Gly Asp Pro Ala His Leu Gly Leu Cys Leu Trp
Leu Trp 1 5 10 15 Leu Gly Ala Thr Leu Gly Arg Glu Gln Val Gln Ala
Ser Gly Leu Leu 20 25 30 Arg Leu Ala Val Leu Pro Glu Asp Arg Leu
Gln Met Lys Trp Arg Glu 35 40 45 Ser Glu Gly Ser Gly Leu Gly Tyr
Leu Val Gln Val Lys Pro Met Ala 50 55 60 Gly Asp Ser Glu Gln Glu
Val Ile Leu Thr Thr Lys Thr Pro Lys Ala 65 70 75 80 Thr Val Gly Gly
Leu Ser Pro Ser Lys Gly Tyr Thr Leu Gln Ile Phe 85 90 95 Glu Leu
Thr Gly Ser Gly Arg Phe Leu Leu Ala Arg Arg Glu Phe Val 100 105 110
Ile Glu Asp Leu Lys Ser Ser Ser Leu Asp Arg Ser Ser Gln Arg Pro 115
120 125 Leu Gly Ser Gly Ala Pro Glu Pro Thr Pro Ser His Thr Gly Ser
Pro 130 135 140 Asp Pro Glu Gln Ala Ser Glu Pro Gln Val Ala Phe Thr
Pro Ser Gln 145 150 155 160 Asp Pro Arg Thr Pro Gly Gly Ser Glu Trp
Arg Glu Thr Gly Pro Gln 165 170 175 Phe Arg Cys Leu Pro Pro Val Pro
Ala Asp Met Val Phe Leu Val Asp 180 185 190 Gly Ser Trp Ser Ile Gly
His Ser His Phe Gln Gln Val Lys Asp Phe 195 200 205 Leu Ala Ser Val
Ile Ala Pro Phe Glu Ile Gly Pro Asp Lys Val Gln 210 215 220 Val Gly
Leu Thr Gln Tyr Ser Gly Asp Ala Gln Thr Glu Trp Asp Leu 225 230 235
240 Asn Ser Leu Ser Thr Lys Glu Gln Val Leu Ala Ala Val Arg Arg Leu
245 250 255 Arg Tyr Lys Gly Gly Asn Thr Phe Thr Gly Leu Ala Leu Thr
His Val 260 265 270 Leu Gly Gln Asn Leu Gln Pro Ala Ala Gly Leu Arg
Pro Glu Ala Ala 275 280 285 Lys Val Val Ile Leu Val Thr Asp Gly Lys
Ser Gln Asp Asp Val His 290 295 300 Thr Ala Ala Arg Val Leu Lys Asp
Leu Gly Val Asn Val Phe Ala Val 305 310 315 320 Gly Val Lys Asn Ala
Asp Glu Ala Glu Leu Arg Leu Leu Ala Ser Pro 325 330 335 Pro Arg Asp
Ile Thr Val His Ser Val Leu Asp Phe Leu Gln Leu Gly 340 345 350 Ala
Leu Ala Gly Leu Leu Ser Arg Leu Ile Cys Gln Arg Leu Gln Gly 355 360
365 Gly Ser Pro Arg Gln Gly Pro Ala Ala Ala Pro Ala Leu Asp Thr Leu
370 375 380 Pro Ala Pro Thr Ser Leu Val Leu Ser Gln Val Thr Ser Ser
Ser Ile 385 390 395 400 Arg Leu Ser Trp Thr Pro Ala Pro Arg His Pro
Leu Lys Tyr Leu Ile 405 410 415 Val Trp Arg Ala Ser Arg Gly Gly Thr
Pro Arg Glu Val Val Val Glu 420 425 430 Gly Pro Ala Ala Ser Thr Glu
Leu His Asn Leu Ala Ser Arg Thr Glu 435 440 445 Tyr Leu Val Ser Val
Phe Pro Ile Tyr Glu Gly Gly Val Gly Glu Gly 450 455 460 Leu Arg Gly
Leu Val Thr Thr Ala Pro Leu Pro Pro Pro Arg Ala Leu 465 470 475 480
Thr Leu Ala Ala Val Thr Pro Arg Thr Val His Leu Thr Trp Gln Pro 485
490 495 Ser Ala Gly Ala Thr His Tyr Leu Val Arg Cys Ser Pro Ala Ser
Pro 500 505 510 Lys Gly Glu Glu Glu Glu Arg Glu Val Gln Val Gly Arg
Pro Glu Val 515 520 525 Leu Leu Asp Gly Leu Glu Pro Gly Arg Asp Tyr
Glu Val Ser Val Gln 530 535 540 Ser Leu Arg Gly Pro Glu Gly Ser Glu
Ala Arg Gly Ile Arg Ala Arg 545 550 555 560 Thr Pro Thr Leu Ala Pro
Pro Arg His Leu Gly Phe Ser Asp Val Ser 565 570 575 His Asp Ala Ala
Arg Val Phe Trp Glu Gly Ala Pro Arg Pro Val Arg 580 585 590 Leu Val
Arg Val Thr Tyr Val Ser Ser Glu Gly Gly His Ser Gly Gln 595 600 605
Thr Glu Ala Pro Gly Asn Ala Thr Ser Ala Thr Leu Gly Pro Leu Ser 610
615 620 Ser Ser Thr Thr Tyr Thr Val Arg Val Thr Cys Leu Tyr Pro Gly
Gly 625 630 635 640 Gly Ser Ser Thr Leu Thr Gly Arg Val Thr Thr Lys
Lys Ala Pro Ser 645 650 655 Pro Ser Gln Leu Ser Met Thr Glu Leu Pro
Gly Asp Ala Val Gln Leu 660 665 670 Ala Trp Val Ala Ala Ala Pro Ser
Gly Val Leu Val Tyr Gln Ile Thr 675 680 685 Trp Thr Pro Leu Gly Glu
Gly Lys Ala His Glu Ile Ser Val Pro Gly 690 695 700 Asn Leu Gly Thr
Ala Val Leu Pro Gly Leu Gly Arg His Thr Glu Tyr 705 710 715 720 Asp
Val Thr Ile Leu Ala Tyr Tyr Arg Asp Gly Ala Arg Ser Asp Pro 725 730
735 Val Ser Leu Arg Tyr Thr Pro Ser Thr Val Ser Arg Ser Pro Pro Ser
740 745 750 Asn Leu Ala Leu Ala Ser Glu Thr Pro Asp Ser Leu Gln Val
Ser Trp 755 760 765 Thr Pro Pro Leu Gly Arg Val Leu His Tyr Trp Leu
Thr Tyr Ala Pro 770 775 780 Ala Ser Gly Leu Gly Pro Glu Lys Ser Val
Ser Val Pro Gly Ala Arg 785 790 795 800 Ser His Val Thr Leu Pro Asp
Leu Gln Ala Ala Thr Lys Tyr Arg Val 805 810 815 Leu Val Ser Ala Ile
Tyr Ala Ala Gly Arg Ser Glu Ala Val Ser Ala 820 825 830 Thr Gly Gln
Thr Ala Cys Pro Ala Leu Arg Pro Asp Gly Ser Leu Pro 835 840 845 Gly
Phe Asp Leu Met Val Ala Phe Ser Leu Val Glu Lys Ala Tyr Ala 850 855
860 Ser Ile Arg Gly Val Ala Met Glu Pro Ser Ala Phe Gly Gly Thr Pro
865 870 875 880 Thr Phe Thr Leu Phe Lys Asp Ala Gln Leu Thr Arg Arg
Val Ser Asp 885 890 895 Val Tyr Pro Ala Pro Leu Pro Pro Glu His Thr
Ile Val Phe Leu Val 900 905 910 Arg Leu Leu Pro Glu Thr Pro Arg Glu
Ala Phe Ala Leu Trp Gln Met 915 920 925 Thr Ala Glu Asp Phe Gln Pro
Leu Leu Gly Val Leu Leu Asp Ala Gly 930 935 940 Lys Lys Ser Leu Thr
Tyr Phe His Arg Asp Pro Arg Ala Ala Leu Gln 945 950 955 960 Glu Ala
Thr Phe Asp Pro Gln Glu Val Arg Lys Ile Phe Phe Gly Ser 965 970 975
Phe His Lys Val His Val Ala Val Gly Arg Ser Lys Val Arg Leu Tyr 980
985 990 Val Asp Cys Arg Lys Val Ala Glu Arg Pro Leu Gly Glu Met Gly
Ser 995 1000 1005 Pro Pro Ala Ala Gly Phe Val Thr Leu Gly Arg Leu
Ala Lys Ala 1010 1015 1020 Arg Gly Pro Arg Ser Ser Ser Ala Ala Phe
Gln Leu Gln Met Leu 1025 1030 1035 Gln Ile Val Cys Ser Asp Thr Trp
Ala Asp Glu Asp Arg Cys Cys 1040 1045 1050 Glu Leu Pro Ala Ser Arg
Asp Gly
Glu Thr Cys Pro Ala Phe Val 1055 1060 1065 Ser Ala Cys Ser Cys Ser
Ser Glu Thr Pro Gly Pro Pro Gly Pro 1070 1075 1080 Gln Gly Pro Pro
Gly Pro Pro Gly Val Lys Gly Glu Lys Gly Asp 1085 1090 1095 His Gly
Leu Pro Gly Leu Gln Gly His Pro Gly His Gln Gly Ile 1100 1105 1110
Pro Gly Arg Val Gly Leu Gln Gly Pro Lys Gly Met Arg Gly Leu 1115
1120 1125 Glu Gly Thr Ala Gly Leu Pro Gly Pro Gly Pro Arg Gly Phe
Gln 1130 1135 1140 Gly Met Ala Gly Ala Arg Gly Thr Ser Gly Glu Arg
Gly Pro Pro 1145 1150 1155 Gly Thr Val Gly Pro Thr Gly Leu Pro Gly
Pro Lys Gly Glu Arg 1160 1165 1170 Gly Glu Lys Gly Glu Pro Gln Ser
Leu Ala Thr Leu Tyr Gln Leu 1175 1180 1185 Val Ser Gln Ala Cys Glu
Ser Ala Ile Gln Thr His Val Ser Lys 1190 1195 1200 Phe Asp Ser Phe
His Glu Asn Thr Arg Pro Pro Met Pro Ile Leu 1205 1210 1215 Glu Gln
Lys Leu Glu Pro Gly Thr Glu Pro Leu Gly Ser Pro Gly 1220 1225 1230
Thr Arg Ser Lys Ala Leu Val Pro Gly Glu Trp Gly Arg Gly Gly 1235
1240 1245 Arg His Leu Glu Gly Arg Gly Glu Pro Gly Ala Val Gly Gln
Met 1250 1255 1260 Gly Ser Pro Gly Gln Gln Gly Ala Ser Thr Gln Gly
Leu Trp Glu 1265 1270 1275 231283PRTHomo sapiens 23Met Ser Ser Gly
Asp Pro Ala His Leu Gly Leu Cys Leu Trp Leu Trp 1 5 10 15 Leu Gly
Ala Thr Leu Gly Arg Glu Gln Val Gln Ala Ser Gly Leu Leu 20 25 30
Arg Leu Ala Val Leu Pro Glu Asp Arg Leu Gln Met Lys Trp Arg Glu 35
40 45 Ser Glu Gly Ser Gly Leu Gly Tyr Leu Val Gln Val Lys Pro Met
Ala 50 55 60 Gly Asp Ser Glu Gln Glu Val Ile Leu Thr Thr Lys Thr
Pro Lys Ala 65 70 75 80 Thr Val Gly Gly Leu Ser Pro Ser Lys Gly Tyr
Thr Leu Gln Ile Phe 85 90 95 Glu Leu Thr Gly Ser Gly Arg Phe Leu
Leu Ala Arg Arg Glu Phe Val 100 105 110 Ile Glu Asp Leu Lys Ser Ser
Ser Leu Asp Arg Ser Ser Gln Arg Pro 115 120 125 Leu Gly Ser Gly Ala
Pro Glu Pro Thr Pro Ser His Thr Gly Ser Pro 130 135 140 Asp Pro Glu
Gln Ala Ser Glu Pro Gln Val Ala Phe Thr Pro Ser Gln 145 150 155 160
Asp Pro Arg Thr Pro Ala Gly Pro Gln Phe Arg Cys Leu Pro Pro Val 165
170 175 Pro Ala Asp Met Val Phe Leu Val Asp Gly Ser Trp Ser Ile Gly
His 180 185 190 Ser His Phe Gln Gln Val Lys Asp Phe Leu Ala Ser Val
Ile Ala Pro 195 200 205 Phe Glu Ile Gly Pro Asp Lys Val Gln Val Gly
Leu Thr Gln Tyr Ser 210 215 220 Gly Asp Ala Gln Thr Glu Trp Asp Leu
Asn Ser Leu Ser Thr Lys Glu 225 230 235 240 Gln Val Leu Ala Ala Val
Arg Arg Leu Arg Tyr Lys Gly Gly Asn Thr 245 250 255 Phe Thr Gly Leu
Ala Leu Thr His Val Leu Gly Gln Asn Leu Gln Pro 260 265 270 Ala Ala
Gly Leu Arg Pro Glu Ala Ala Lys Val Val Ile Leu Val Thr 275 280 285
Asp Gly Lys Ser Gln Asp Asp Val His Thr Ala Ala Arg Val Leu Lys 290
295 300 Asp Leu Gly Val Asn Val Phe Ala Val Gly Val Lys Asn Ala Asp
Glu 305 310 315 320 Ala Glu Leu Arg Leu Leu Ala Ser Pro Pro Arg Asp
Ile Thr Val His 325 330 335 Ser Val Leu Asp Phe Leu Gln Leu Gly Ala
Leu Ala Gly Leu Leu Ser 340 345 350 Arg Leu Ile Cys Gln Arg Leu Gln
Gly Gly Ser Pro Arg Gln Gly Pro 355 360 365 Ala Ala Ala Pro Ala Leu
Asp Thr Leu Pro Ala Pro Thr Ser Leu Val 370 375 380 Leu Ser Gln Val
Thr Ser Ser Ser Ile Arg Leu Ser Trp Thr Pro Ala 385 390 395 400 Pro
Arg His Pro Leu Lys Tyr Leu Ile Val Trp Arg Ala Ser Arg Gly 405 410
415 Gly Thr Pro Arg Glu Val Val Val Glu Gly Pro Ala Ala Ser Thr Glu
420 425 430 Leu His Asn Leu Ala Ser Arg Thr Glu Tyr Leu Val Ser Val
Phe Pro 435 440 445 Ile Tyr Glu Gly Gly Val Gly Glu Gly Leu Arg Gly
Leu Val Thr Thr 450 455 460 Ala Pro Leu Pro Pro Pro Arg Ala Leu Thr
Leu Ala Ala Val Thr Pro 465 470 475 480 Arg Thr Val His Leu Thr Trp
Gln Pro Ser Ala Gly Ala Thr His Tyr 485 490 495 Leu Val Arg Cys Ser
Pro Ala Ser Pro Lys Gly Glu Glu Glu Glu Arg 500 505 510 Glu Val Gln
Val Gly Arg Pro Glu Val Leu Leu Asp Gly Leu Glu Pro 515 520 525 Gly
Arg Asp Tyr Glu Val Ser Val Gln Ser Leu Arg Gly Pro Glu Gly 530 535
540 Ser Glu Ala Arg Gly Ile Arg Ala Arg Thr Pro Thr Leu Ala Pro Pro
545 550 555 560 Arg His Leu Gly Phe Ser Asp Val Ser His Asp Ala Ala
Arg Val Phe 565 570 575 Trp Glu Gly Ala Pro Arg Pro Val Arg Leu Val
Arg Val Thr Tyr Val 580 585 590 Ser Ser Glu Gly Gly His Ser Gly Gln
Thr Glu Ala Pro Gly Asn Ala 595 600 605 Thr Ser Ala Thr Leu Gly Pro
Leu Ser Ser Ser Thr Thr Tyr Thr Val 610 615 620 Arg Val Thr Cys Leu
Tyr Pro Gly Gly Gly Ser Ser Thr Leu Thr Gly 625 630 635 640 Arg Val
Thr Thr Lys Lys Ala Pro Ser Pro Ser Gln Leu Ser Met Thr 645 650 655
Glu Leu Pro Gly Asp Ala Val Gln Leu Ala Trp Val Ala Ala Ala Pro 660
665 670 Ser Gly Val Leu Val Tyr Gln Ile Thr Trp Thr Pro Leu Gly Glu
Gly 675 680 685 Lys Ala His Glu Ile Ser Val Pro Gly Asn Leu Gly Thr
Ala Val Leu 690 695 700 Pro Gly Leu Gly Arg His Thr Glu Tyr Asp Val
Thr Ile Leu Ala Tyr 705 710 715 720 Tyr Arg Asp Gly Ala Arg Ser Asp
Pro Val Ser Leu Arg Tyr Thr Pro 725 730 735 Ser Thr Val Ser Arg Ser
Pro Pro Ser Asn Leu Ala Leu Ala Ser Glu 740 745 750 Thr Pro Asp Ser
Leu Gln Val Ser Trp Thr Pro Pro Leu Gly Arg Val 755 760 765 Leu His
Tyr Trp Leu Thr Tyr Ala Pro Ala Ser Gly Leu Gly Pro Glu 770 775 780
Lys Ser Val Ser Val Pro Gly Ala Arg Ser His Val Thr Leu Pro Asp 785
790 795 800 Leu Gln Ala Ala Thr Lys Tyr Arg Val Leu Val Ser Ala Ile
Tyr Ala 805 810 815 Ala Gly Arg Ser Glu Ala Val Ser Ala Thr Gly Gln
Thr Ala Cys Pro 820 825 830 Ala Leu Arg Pro Asp Gly Ser Leu Pro Gly
Phe Asp Leu Met Val Ala 835 840 845 Phe Ser Leu Val Glu Lys Ala Tyr
Ala Ser Ile Arg Gly Val Ala Met 850 855 860 Glu Pro Ser Ala Phe Gly
Gly Thr Pro Thr Phe Thr Leu Phe Lys Asp 865 870 875 880 Ala Gln Leu
Thr Arg Arg Val Ser Asp Val Tyr Pro Ala Pro Leu Pro 885 890 895 Pro
Glu His Thr Ile Val Phe Leu Val Arg Leu Leu Pro Glu Thr Pro 900 905
910 Arg Glu Ala Phe Ala Leu Trp Gln Met Thr Ala Glu Asp Phe Gln Pro
915 920 925 Leu Leu Gly Val Leu Leu Asp Ala Gly Lys Lys Ser Leu Thr
Tyr Phe 930 935 940 His Arg Asp Pro Arg Ala Ala Leu Gln Glu Ala Thr
Phe Asp Pro Gln 945 950 955 960 Glu Val Arg Lys Ile Phe Phe Gly Ser
Phe His Lys Val His Val Ala 965 970 975 Val Gly Arg Ser Lys Val Arg
Leu Tyr Val Asp Cys Arg Lys Val Ala 980 985 990 Glu Arg Pro Leu Gly
Glu Met Gly Ser Pro Pro Ala Ala Gly Phe Val 995 1000 1005 Thr Leu
Gly Arg Leu Ala Lys Ala Arg Gly Pro Arg Ser Ser Ser 1010 1015 1020
Ala Ala Phe Gln Leu Gln Met Leu Gln Ile Val Cys Ser Asp Thr 1025
1030 1035 Trp Ala Asp Glu Asp Arg Cys Cys Glu Leu Pro Ala Ser Arg
Asp 1040 1045 1050 Gly Glu Thr Cys Pro Ala Phe Val Ser Ala Cys Ser
Cys Ser Ser 1055 1060 1065 Glu Thr Pro Gly Pro Pro Gly Pro Gln Gly
Pro Pro Gly Leu Pro 1070 1075 1080 Gly Arg Asn Gly Thr Pro Gly Glu
Gln Gly Phe Pro Gly Pro Arg 1085 1090 1095 Gly Pro Pro Gly Val Lys
Gly Glu Lys Gly Asp His Gly Leu Pro 1100 1105 1110 Gly Leu Gln Gly
His Pro Gly His Gln Gly Ile Pro Gly Arg Val 1115 1120 1125 Gly Leu
Gln Gly Pro Lys Gly Met Arg Gly Leu Glu Gly Thr Ala 1130 1135 1140
Gly Leu Pro Gly Pro Gly Pro Arg Gly Phe Gln Gly Met Ala Gly 1145
1150 1155 Ala Arg Gly Thr Ser Gly Glu Arg Gly Pro Pro Gly Thr Val
Gly 1160 1165 1170 Pro Thr Gly Leu Pro Gly Pro Lys Gly Glu Arg Gly
Glu Lys Gly 1175 1180 1185 Glu Pro Gln Ser Leu Ala Thr Leu Tyr Gln
Leu Val Ser Gln Ala 1190 1195 1200 Ser His Val Ser Lys Phe Asp Ser
Phe His Glu Asn Thr Arg Pro 1205 1210 1215 Pro Met Pro Ile Leu Glu
Gln Lys Leu Glu Pro Gly Thr Glu Pro 1220 1225 1230 Leu Gly Ser Pro
Gly Thr Arg Ser Lys Ala Leu Val Pro Gly Glu 1235 1240 1245 Trp Gly
Arg Gly Gly Arg His Leu Glu Gly Arg Gly Glu Pro Gly 1250 1255 1260
Ala Val Gly Gln Met Gly Ser Pro Gly Gln Gln Gly Ala Ser Thr 1265
1270 1275 Gln Gly Leu Trp Glu 1280 243987DNAHomo sapiens
24atgagctccg gagaccctgc acacctcggc ctctgcctct ggctgtggct gggcgccacc
60ctgggaagag agcaagttca agcaagcggt ctcctgaggc tggctgtgct gcctgaggac
120cggctgcaga tgaagtggag agagtcggag gggagcggcc tcggctacct
ggtgcaggtg 180aagcccatgg caggggactc ggaacaggag gtgatactga
ccaccaagac ccctaaggcc 240acagtggggg gcctgagccc ctccaagggc
tacaccttgc agatcttcga gctcactggc 300tctgggcgct tcctgctagc
tcggagggag tttgtgattg aggatctgaa gagtagctcc 360ctggacagga
gcagccagag gcccctcggc tctggagccc cggagcccac cccctcccac
420acggggagcc cagaccctga gcaggcttct gagccccaag ttgccttcac
accaagccag 480gatccgcgca ctcctggtgg gtcagagtgg agagagaccg
gcccccagtt ccgctgcctg 540ccccccgtgc ctgctgacat ggtcttcctg
gtggacgggt cctggagcat tggccacagt 600cacttccagc aggtcaagga
cttcctggcc agtgtcatcg caccctttga aatcgggccg 660gataaggtcc
aagtaggcct gactcagtac agcggggatg ctcagactga gtgggacctg
720aactccctca gcaccaagga acaggtgctg gcagctgtgc gccgcctccg
ctacaagggg 780gggaacacgt tcacaggcct tgccctgacc cacgtgctgg
ggcagaacct gcagccggcg 840gctggcctcc gtccagaggc agccaaggtg
gtgattctgg tgacggacgg caagtcccag 900gacgatgtgc acactgctgc
ccgtgtcctc aaggacctgg gcgtgaacgt cttcgctgtg 960ggtgtgaaga
acgccgatga ggctgagctg aggctcctgg cgtccccgcc gagggacatc
1020accgtccaca gcgtgctgga cttcctgcag ctcggcgcgc tggctggcct
gctcagccgt 1080ctcatctgcc agaggctcca gggtgggagc ccgcggcagg
gcccagcagc ggctccagcc 1140ctggacaccc tccctgcccc caccagcctg
gtcctgagcc aggtgacctc ctccagcatc 1200cgcctgtcct ggactccagc
cccccggcac cccctcaagt atctgatcgt ttggcgagcc 1260tctagaggtg
gcacccccag ggaggtggtg gtggagggac ccgccgcctc cacggagctg
1320cacaacctgg cctcccgcac agagtacctg gtctccgtgt tccccatcta
tgagggcggg 1380gttggcgaag gcctgcgggg cctggtgacc acagcacctc
tgcctccgcc ccgggcgctg 1440accctggccg cagtgacgcc cagaaccgtc
cacctcacct ggcagccctc ggccggggcc 1500acccactacc tggtgcgatg
ttctcctgct tcccccaagg gtgaagagga ggagcgagag 1560gtgcaggtcg
ggcggcccga ggtgctgctg gatggcctgg aacctggcag ggactatgag
1620gtctcggtgc agagcctgcg aggccctgag ggcagcgagg cccggggcat
ccgtgccagg 1680acccccaccc tggccccccc gagacacctg ggcttctcag
acgtgagcca cgacgcggca 1740cgagtgttct gggagggtgc cccgaggcct
gtgcgcctgg tcagggtcac ctatgtgtcc 1800agcgagggtg gacactcggg
gcagacagag gctcctggga acgccacctc ggccacgctg 1860gggcctctct
cttcctccac cacctacact gtccgtgtca cctgcctcta ccctgggggt
1920ggctcctcta cgctgactgg ccgggtgacc accaagaaag ctcccagccc
aagccagctg 1980tccatgacgg agctgccagg ggatgcagtc cagctggcgt
gggtggccgc agccccgtct 2040ggcgtgcttg tctaccagat cacgtggacg
cccctgggag aggggaaggc tcacgagatc 2100tctgtcccag ggaacctcgg
cacggccgtc ctgcctggcc tagggaggca cacagagtac 2160gacgtcacca
tcttggccta ctacagggac ggggcccgca gtgaccctgt gtccctccgc
2220tataccccct ccacggtgag caggagccca ccctccaacc tggccctggc
ctcggagacc 2280cccgacagcc tgcaggtcag ctggacgccc ccgcttggcc
gcgtgctcca ttactggctc 2340acctacgccc ccgcctctgg cttgggaccc
gagaaatccg tctctgtgcc aggagccagg 2400agccacgtga cactgcccga
cctgcaggca gccacgaagt acagggtcct ggtctcagct 2460atctatgcag
caggcaggag tgaggctgtg tctgccacgg gccagacagc ctgcccagcc
2520ctccgccctg acggctccct cccagggttt gacctgatgg tggccttcag
cctggtggaa 2580aaggcttatg cgtccatccg gggcgtggcc atggagccct
ctgccttcgg tgggaccccg 2640accttcacgc tcttcaagga cgcccagctg
acaagacggg tcagtgacgt ctacccagcc 2700cccctacctc cagagcacac
catcgtcttc cttgtgcgcc tacttcccga gacaccccgt 2760gaggccttcg
cgctgtggca gatgacagcc gaggacttcc agcccctcct tggggttctg
2820ctggatgccg ggaagaagtc cctgacctac ttccaccgtg accccagggc
tgccttgcag 2880gaggccacct tcgacccgca ggaagtgagg aagattttct
tcgggagctt ccacaaggtg 2940cacgtggctg tgggccgctc caaggtcagg
ctctatgtgg actgccggaa ggtggctgag 3000cggccccttg gggagatggg
cagcccaccc gctgcgggct tcgtcacgct ggggaggctg 3060gccaaggcca
ggggcccccg gagcagttcg gccgcgtttc agctccagat gctgcagatc
3120gtgtgcagtg acacctgggc cgatgaggac cggtgctgtg agctccctgc
ctcgagggat 3180ggagagacct gccccgcctt cgtgtctgcc tgttcctgtt
cctcagagac ccctgggccc 3240ccaggacctc aaggaccccc aggcctccct
gggaggaatg gcaccccagg agagcagggc 3300ttcccagggc ccaggggaga
gcccgggcca cccggacaga tgggaccaga aggtcctgga 3360ggccagcagg
gctcgccggg gacccagggc cgtgcagtcc aggggcctgt gggtccacca
3420ggggtcaaag gagagaaggg agaccatggg cttccaggct tgcagggcca
ccccggccac 3480cagggcatcc ccgggagagt tggcctccag ggaccaaagg
gaatgagagg cctggaggga 3540actgctggcc tgcctggacc cggccccagg
gggttccagg gcatggcagg ggccaggggc 3600actagtggag agcgaggacc
tccagggacc gtggggccca caggactgcc agggcccaaa 3660ggggaacgag
gagagaaggg cgagccgcag tcccttgcca ccctctacca gcttgtgagc
3720caggcctgtg agtctgccat tcagacacac gtgtcaaagt tcgactcctt
ccacgagaac 3780accaggcccc ccatgcccat cttggagcag aagctggagc
cgggcactga gcccctgggg 3840tcccctggca cccgcagcaa ggccctggtt
cctggagaat gggggcgtgg tggccgccac 3900cttgagggca gaggggagcc
tggagctgtt ggtcagatgg gcagccctgg gcagcagggg 3960gctagcaccc
agggcctctg ggagtga 3987253837DNAHomo sapiens 25atgagctccg
gagaccctgc acacctcggc ctctgcctct ggctgtggct gggcgccacc 60ctgggaagag
agcaagttca agcaagcggt ctcctgaggc tggctgtgct gcctgaggac
120cggctgcaga tgaagtggag agagtcggag gggagcggcc tcggctacct
ggtgcaggtg 180aagcccatgg caggggactc ggaacaggag gtgatactga
ccaccaagac ccctaaggcc 240acagtggggg gcctgagccc ctccaagggc
tacaccttgc agatcttcga gctcactggc 300tctgggcgct tcctgctagc
tcggagggag tttgtgattg aggatctgaa gagtagctcc 360ctggacagga
gcagccagag gcccctcggc tctggagccc cggagcccac cccctcccac
420acggggagcc cagaccctga gcaggcttct gagccccaag ttgccttcac
accaagccag 480gatccgcgca ctcctggtgg gtcagagtgg agagagaccg
gcccccagtt ccgctgcctg 540ccccccgtgc ctgctgacat ggtcttcctg
gtggacgggt cctggagcat tggccacagt 600cacttccagc aggtcaagga
cttcctggcc agtgtcatcg caccctttga aatcgggccg 660gataaggtcc
aagtaggcct gactcagtac agcggggatg ctcagactga gtgggacctg
720aactccctca gcaccaagga acaggtgctg gcagctgtgc gccgcctccg
ctacaagggg 780gggaacacgt tcacaggcct tgccctgacc cacgtgctgg
ggcagaacct gcagccggcg 840gctggcctcc gtccagaggc agccaaggtg
gtgattctgg tgacggacgg caagtcccag 900gacgatgtgc acactgctgc
ccgtgtcctc aaggacctgg gcgtgaacgt cttcgctgtg 960ggtgtgaaga
acgccgatga
ggctgagctg aggctcctgg cgtccccgcc gagggacatc 1020accgtccaca
gcgtgctgga cttcctgcag ctcggcgcgc tggctggcct gctcagccgt
1080ctcatctgcc agaggctcca gggtgggagc ccgcggcagg gcccagcagc
ggctccagcc 1140ctggacaccc tccctgcccc caccagcctg gtcctgagcc
aggtgacctc ctccagcatc 1200cgcctgtcct ggactccagc cccccggcac
cccctcaagt atctgatcgt ttggcgagcc 1260tctagaggtg gcacccccag
ggaggtggtg gtggagggac ccgccgcctc cacggagctg 1320cacaacctgg
cctcccgcac agagtacctg gtctccgtgt tccccatcta tgagggcggg
1380gttggcgaag gcctgcgggg cctggtgacc acagcacctc tgcctccgcc
ccgggcgctg 1440accctggccg cagtgacgcc cagaaccgtc cacctcacct
ggcagccctc ggccggggcc 1500acccactacc tggtgcgatg ttctcctgct
tcccccaagg gtgaagagga ggagcgagag 1560gtgcaggtcg ggcggcccga
ggtgctgctg gatggcctgg aacctggcag ggactatgag 1620gtctcggtgc
agagcctgcg aggccctgag ggcagcgagg cccggggcat ccgtgccagg
1680acccccaccc tggccccccc gagacacctg ggcttctcag acgtgagcca
cgacgcggca 1740cgagtgttct gggagggtgc cccgaggcct gtgcgcctgg
tcagggtcac ctatgtgtcc 1800agcgagggtg gacactcggg gcagacagag
gctcctggga acgccacctc ggccacgctg 1860gggcctctct cttcctccac
cacctacact gtccgtgtca cctgcctcta ccctgggggt 1920ggctcctcta
cgctgactgg ccgggtgacc accaagaaag ctcccagccc aagccagctg
1980tccatgacgg agctgccagg ggatgcagtc cagctggcgt gggtggccgc
agccccgtct 2040ggcgtgcttg tctaccagat cacgtggacg cccctgggag
aggggaaggc tcacgagatc 2100tctgtcccag ggaacctcgg cacggccgtc
ctgcctggcc tagggaggca cacagagtac 2160gacgtcacca tcttggccta
ctacagggac ggggcccgca gtgaccctgt gtccctccgc 2220tataccccct
ccacggtgag caggagccca ccctccaacc tggccctggc ctcggagacc
2280cccgacagcc tgcaggtcag ctggacgccc ccgcttggcc gcgtgctcca
ttactggctc 2340acctacgccc ccgcctctgg cttgggaccc gagaaatccg
tctctgtgcc aggagccagg 2400agccacgtga cactgcccga cctgcaggca
gccacgaagt acagggtcct ggtctcagct 2460atctatgcag caggcaggag
tgaggctgtg tctgccacgg gccagacagc ctgcccagcc 2520ctccgccctg
acggctccct cccagggttt gacctgatgg tggccttcag cctggtggaa
2580aaggcttatg cgtccatccg gggcgtggcc atggagccct ctgccttcgg
tgggaccccg 2640accttcacgc tcttcaagga cgcccagctg acaagacggg
tcagtgacgt ctacccagcc 2700cccctacctc cagagcacac catcgtcttc
cttgtgcgcc tacttcccga gacaccccgt 2760gaggccttcg cgctgtggca
gatgacagcc gaggacttcc agcccctcct tggggttctg 2820ctggatgccg
ggaagaagtc cctgacctac ttccaccgtg accccagggc tgccttgcag
2880gaggccacct tcgacccgca ggaagtgagg aagattttct tcgggagctt
ccacaaggtg 2940cacgtggctg tgggccgctc caaggtcagg ctctatgtgg
actgccggaa ggtggctgag 3000cggccccttg gggagatggg cagcccaccc
gctgcgggct tcgtcacgct ggggaggctg 3060gccaaggcca ggggcccccg
gagcagttcg gccgcgtttc agctccagat gctgcagatc 3120gtgtgcagtg
acacctgggc cgatgaggac cggtgctgtg agctccctgc ctcgagggat
3180ggagagacct gccccgcctt cgtgtctgcc tgttcctgtt cctcagagac
ccctgggccc 3240ccaggacctc aaggaccccc aggtccacca ggggtcaaag
gagagaaggg agaccatggg 3300cttccaggct tgcagggcca ccccggccac
cagggcatcc ccgggagagt tggcctccag 3360ggaccaaagg gaatgagagg
cctggaggga actgctggcc tgcctggacc tggccccagg 3420gggttccagg
gcatggcagg ggccaggggc actagtggag agcgaggacc tccagggacc
3480gtggggccca caggactgcc agggcccaaa ggggaacgag gagagaaggg
cgagccgcag 3540tcccttgcca ccctctacca gcttgtgagc caggcctgtg
agtctgccat tcagacacac 3600gtgtcaaagt tcgactcctt ccacgagaac
accaggcccc ccatgcccat cttggagcag 3660aagctggagc cgggcactga
gcccctgggg tcccctggca cccgcagcaa ggccctggtt 3720cctggagaat
gggggcgtgg tggccgccac cttgagggca gaggggagcc tggagctgtt
3780ggtcagatgg gcagccctgg gcagcagggg gctagcaccc agggcctctg ggagtga
3837264168DNAHomo sapiens 26ataagctcca gccttcctgt ggccacagca
ggaccagagt ggaccagcac accccaggag 60agaggactgg ggtcccagga gtaggaggag
cccgagcacc atgagctccg gagaccctgc 120acacctcggc ctctgcctct
ggctgtggct gggcgccacc ctgggaagag agcaagttca 180agcaagcggt
ctcctgaggc tggctgtgct gcctgaggac cggctgcaga tgaagtggag
240agagtcggag gggagcggcc tcggctacct ggtgcaggtg aagcccatgg
caggggactc 300ggaacaggag gtgatactga ccaccaagac ccctaaggcc
acagtggggg gcctgagccc 360ctccaagggc tacaccttgc agatcttcga
gctcactggc tctgggcgct tcctgctagc 420tcggagggag tttgtgattg
aggatctgaa gagtagctcc ctggacagga gcagccagag 480gcccctcggc
tctggagccc cggagcccac cccctcccac acggggagcc cagaccctga
540gcaggcttct gagccccaag ttgccttcac accaagccag gatccgcgca
ctcctgccgg 600cccccagttc cgctgcctgc cccccgtgcc tgctgacatg
gtcttcctgg tggacgggtc 660ctggagcatt ggccacagtc acttccagca
ggtcaaggac ttcctggcca gtgtcatcgc 720accctttgaa atcgggccgg
ataaggtcca agtaggcctg actcagtaca gcggggatgc 780tcagactgag
tgggacctga actccctcag caccaaggaa caggtgctgg cagctgtgcg
840ccgcctccgc tacaaggggg ggaacacgtt cacaggcctt gccctgaccc
acgtgctggg 900gcagaacctg cagccggcgg ctggcctccg tccagaggca
gccaaggtgg tgattctggt 960gacggacggc aagtcccagg acgatgtgca
cactgctgcc cgtgtcctca aggacctggg 1020cgtgaacgtc ttcgctgtgg
gtgtgaagaa cgccgatgag gctgagctga ggctcctggc 1080gtccccgccg
agggacatca ccgtccacag cgtgctggac ttcctgcagc tcggcgcgct
1140ggctggcctg ctcagccgtc tcatctgcca gaggctccag ggtgggagcc
cgcggcaggg 1200cccagcagcg gctccagccc tggacaccct ccctgccccc
accagcctgg tcctgagcca 1260ggtgacctcc tccagcatcc gcctgtcctg
gactccagcc ccccggcacc ccctcaagta 1320tctgatcgtt tggcgagcct
ctagaggtgg cacccccagg gaggtggtgg tggagggacc 1380cgccgcctcc
acggagctgc acaacctggc ctcccgcaca gagtacctgg tctccgtgtt
1440ccccatctat gagggcgggg ttggcgaagg cctgcggggc ctggtgacca
cagcacctct 1500gcctccgccc cgggcgctga ccctggccgc agtgacgccc
agaaccgtcc acctcacctg 1560gcagccctcg gccggggcca cccactacct
ggtgcgatgt tctcctgctt cccccaaggg 1620tgaagaggag gagcgagagg
tgcaggtcgg gcggcccgag gtgctgctgg atggcctgga 1680acctggcagg
gactatgagg tctcggtgca gagcctgcga ggccctgagg gcagcgaggc
1740ccggggcatc cgtgccagga cccccaccct ggcccccccg agacacctgg
gcttctcaga 1800cgtgagccac gacgcggcac gagtgttctg ggagggtgcc
ccgaggcctg tgcgcctggt 1860cagggtcacc tatgtgtcca gcgagggtgg
acactcgggg cagacagagg ctcctgggaa 1920cgccacctcg gccacgctgg
ggcctctctc ttcctccacc acctacactg tccgtgtcac 1980ctgcctctac
cctgggggtg gctcctctac gctgactggc cgggtgacca ccaagaaagc
2040tcccagccca agccagctgt ccatgacgga gctgccaggg gatgcagtcc
agctggcgtg 2100ggtggccgca gccccgtctg gcgtgcttgt ctaccagatc
acgtggacgc ccctgggaga 2160ggggaaggct cacgagatct ctgtcccagg
gaacctcggc acggccgtcc tgcctggcct 2220agggaggcac acagagtacg
acgtcaccat cttggcctac tacagggacg gggcccgcag 2280tgaccctgtg
tccctccgct ataccccctc cacggtgagc aggagcccac cctccaacct
2340ggccctggcc tcggagaccc ccgacagcct gcaggtcagc tggacgcccc
cgcttggccg 2400cgtgctccat tactggctca cctacgcccc cgcctctggc
ttgggacccg agaaatccgt 2460ctctgtgcca ggagccagga gccacgtgac
actgcccgac ctgcaggcag ccacgaagta 2520cagggtcctg gtctcagcta
tctatgcagc aggcaggagt gaggctgtgt ctgccacggg 2580ccagacagcc
tgcccagccc tccgccctga cggctccctc ccagggtttg acctgatggt
2640ggccttcagc ctggtggaaa aggcttatgc gtccatccgg ggcgtggcca
tggagccctc 2700tgccttcggt gggaccccga ccttcacgct cttcaaggac
gcccagctga caagacgggt 2760cagtgacgtc tacccagccc ccctacctcc
agagcacacc atcgtcttcc ttgtgcgcct 2820acttcccgag acaccccgtg
aggccttcgc gctgtggcag atgacagccg aggacttcca 2880gcccctcctt
ggggttctgc tggatgccgg gaagaagtcc ctgacctact tccaccgtga
2940ccccagggct gccttgcagg aggccacctt cgacccgcag gaagtgagga
agattttctt 3000cgggagcttc cacaaggtgc acgtggctgt gggccgctcc
aaggtcaggc tctatgtgga 3060ctgccggaag gtggctgagc ggccccttgg
ggagatgggc agcccacccg ctgcgggctt 3120cgtcacgctg gggaggctgg
ccaaggccag gggcccccgg agcagttcgg ccgcgtttca 3180gctccagatg
ctgcagatcg tgtgcagtga cacctgggcc gatgaggacc ggtgctgtga
3240gctccctgcc tcgagggatg gagagacctg ccccgccttc gtgtctgcct
gttcctgttc 3300ctcagagacc cctgggcccc caggacctca aggaccccca
ggcctccctg ggaggaatgg 3360caccccagga gagcagggct tcccagggcc
caggggtcca ccaggggtca aaggagagaa 3420gggagaccat gggcttccag
gcttgcaggg ccaccccggc caccagggca tccccgggag 3480agttggcctc
cagggaccaa agggaatgag aggcctggag ggaactgctg gcctgcctgg
3540acctggcccc agggggttcc agggcatggc aggggccagg ggcactagtg
gagagcgagg 3600acctccaggg accgtggggc ccacaggact gccagggccc
aaaggggaac gaggagagaa 3660gggcgagccg cagtcccttg ccaccctcta
ccagcttgtg agccaggcct cacacgtgtc 3720aaagttcgac tccttccacg
agaacaccag gccccccatg cccatcttgg agcagaagct 3780ggagccgggc
actgagcccc tggggtcccc tggcacccgc agcaaggccc tggttcctgg
3840agaatggggg cgtggtggcc gccaccttga gggcagaggg gagcctggag
ctgttggtca 3900gatgggcagc cctgggcagc agggggctag cacccagggc
ctctgggagt gacaggacat 3960tttctgcact gccccgagga acgctgagcc
ttcctccctg ggtttgtctg gacaccgaga 4020gcgaccacat cctggagaag
ccaggagaaa agctcaggaa gagcctgcag gtggaaggag 4080agggaagcag
cggcctcggc caaggcccac cccatactct tggctctgta gcatttccaa
4140gttcagataa acccctgagt gctcaccc 416827740PRTHomo sapiens 27Ala
Val Arg Ala Glu Pro Gly Leu Glu Glu Leu Ser Ser Gly Leu Arg 1 5 10
15 Ala His Ser Pro Ser Ala Thr Thr Val Cys Glu Pro Glu Ala Gln Gly
20 25 30 Ser Ala Ser Gly Cys Arg Tyr Ala Ala His Pro His Trp Gly
Leu Gly 35 40 45 Gly Ala Ala Ala Ala Gly Gly Ser Trp Glu Pro Gln
Pro Pro Arg Pro 50 55 60 Val Cys Glu Pro Ala Gly Arg Gly Lys Pro
His Pro Pro Ala Ala Pro 65 70 75 80 Arg Ser Pro Leu Leu Pro Gly Ser
Arg Arg Arg Pro Gln Ala Ala Gln 85 90 95 Pro Gly Ala Arg Ala Arg
Thr Ser Pro Pro Pro Ala Ser Ala Arg Asn 100 105 110 Met Ala Ala Arg
Pro Ala Ala Thr Leu Ala Trp Ser Leu Leu Leu Leu 115 120 125 Ser Ser
Ala Leu Leu Arg Glu Gly Cys Arg Ala Arg Phe Val Ala Glu 130 135 140
Arg Asp Ser Glu Asp Asp Gly Glu Glu Pro Val Val Phe Pro Glu Ser 145
150 155 160 Pro Leu Gln Ser Pro Thr Val Leu Val Ala Val Leu Ala Arg
Asn Ala 165 170 175 Ala His Thr Leu Pro His Phe Leu Gly Cys Leu Glu
Arg Leu Asp Tyr 180 185 190 Pro Lys Ser Arg Met Ala Ile Trp Ala Ala
Thr Asp His Asn Val Asp 195 200 205 Asn Thr Thr Glu Ile Phe Arg Glu
Trp Leu Lys Asn Val Gln Arg Leu 210 215 220 Tyr His Tyr Val Glu Trp
Arg Pro Met Asp Glu Pro Glu Ser Tyr Pro 225 230 235 240 Asp Glu Ile
Gly Pro Lys His Trp Pro Thr Ser Arg Phe Ala His Val 245 250 255 Met
Lys Leu Arg Gln Ala Ala Leu Arg Thr Ala Arg Glu Lys Trp Ser 260 265
270 Asp Tyr Ile Leu Phe Ile Asp Val Asp Asn Phe Leu Thr Asn Pro Gln
275 280 285 Thr Leu Asn Leu Leu Ile Ala Glu Asn Lys Thr Ile Val Ala
Pro Met 290 295 300 Leu Glu Ser Arg Gly Leu Tyr Ser Asn Phe Trp Cys
Gly Ile Thr Pro 305 310 315 320 Lys Ala Lys Gly Phe Tyr Lys Arg Thr
Pro Asp Tyr Val Gln Ile Arg 325 330 335 Glu Trp Lys Arg Thr Gly Cys
Phe Pro Val Pro Met Val His Ser Thr 340 345 350 Phe Leu Ile Asp Leu
Arg Lys Glu Ala Ser Asp Lys Leu Thr Phe Tyr 355 360 365 Pro Pro His
Gln Asp Tyr Thr Trp Thr Phe Asp Asp Ile Ile Val Phe 370 375 380 Ala
Phe Ser Ser Arg Gln Ala Gly Ile Gln Met Tyr Leu Cys Asn Arg 385 390
395 400 Glu His Tyr Gly Tyr Leu Pro Ile Pro Leu Lys Pro His Gln Thr
Leu 405 410 415 Gln Glu Asp Ile Glu Asn Leu Ile His Val Gln Ile Glu
Ala Met Ile 420 425 430 Asp Arg Pro Pro Met Glu Pro Ser Gln Tyr Val
Ser Val Val Pro Lys 435 440 445 Tyr Pro Asp Lys Met Gly Phe Asp Glu
Ile Phe Met Ile Asn Leu Lys 450 455 460 Arg Arg Lys Asp Arg Arg Asp
Arg Met Leu Arg Thr Leu Tyr Glu Gln 465 470 475 480 Glu Ile Glu Val
Lys Ile Val Glu Ala Val Asp Gly Lys Ala Leu Asn 485 490 495 Thr Ser
Gln Leu Lys Ala Leu Asn Ile Glu Met Leu Pro Gly Tyr Arg 500 505 510
Asp Pro Tyr Ser Ser Arg Pro Leu Thr Arg Gly Glu Ile Gly Cys Phe 515
520 525 Leu Ser His Tyr Ser Val Trp Lys Glu Val Ile Asp Arg Glu Leu
Glu 530 535 540 Lys Thr Leu Val Ile Glu Asp Asp Val Arg Phe Glu His
Gln Phe Lys 545 550 555 560 Lys Lys Leu Met Lys Leu Met Asp Asn Ile
Asp Gln Ala Gln Leu Asp 565 570 575 Trp Glu Leu Ile Tyr Ile Gly Arg
Lys Arg Met Gln Val Lys Glu Pro 580 585 590 Glu Lys Ala Val Pro Asn
Val Ala Asn Leu Val Glu Ala Asp Tyr Ser 595 600 605 Tyr Trp Thr Leu
Gly Tyr Val Ile Ser Leu Glu Gly Ala Gln Lys Leu 610 615 620 Val Gly
Ala Asn Pro Phe Gly Lys Met Leu Pro Val Asp Glu Phe Leu 625 630 635
640 Pro Val Met Tyr Asn Lys His Pro Val Ala Glu Tyr Lys Glu Tyr Tyr
645 650 655 Glu Ser Arg Asp Leu Lys Ala Phe Ser Ala Glu Pro Leu Leu
Ile Tyr 660 665 670 Pro Thr His Tyr Thr Gly Gln Pro Gly Tyr Leu Ser
Asp Thr Glu Thr 675 680 685 Ser Thr Ile Trp Asp Asn Glu Thr Val Ala
Thr Asp Trp Asp Arg Thr 690 695 700 His Ala Trp Lys Ser Arg Lys Gln
Ser Arg Ile Tyr Ser Asn Ala Lys 705 710 715 720 Asn Thr Glu Ala Leu
Pro Pro Pro Thr Ser Leu Asp Thr Val Pro Ser 725 730 735 Arg Asp Glu
Leu 740 28626PRTHomo sapiens 28Met Ala Ala Arg Pro Ala Ala Thr Leu
Ala Trp Ser Leu Leu Leu Leu 1 5 10 15 Ser Ser Ala Leu Leu Arg Glu
Gly Cys Arg Ala Arg Phe Val Ala Glu 20 25 30 Arg Asp Ser Glu Asp
Asp Gly Glu Glu Pro Val Val Phe Pro Glu Ser 35 40 45 Pro Leu Gln
Ser Pro Thr Val Leu Val Ala Val Leu Ala Arg Asn Ala 50 55 60 Ala
His Thr Leu Pro His Phe Leu Gly Cys Leu Glu Arg Leu Asp Tyr 65 70
75 80 Pro Lys Ser Arg Met Ala Ile Trp Ala Ala Thr Asp His Asn Val
Asp 85 90 95 Asn Thr Thr Glu Ile Phe Arg Glu Trp Leu Lys Asn Val
Gln Arg Leu 100 105 110 Tyr His Tyr Val Glu Trp Arg Pro Met Asp Glu
Pro Glu Ser Tyr Pro 115 120 125 Asp Glu Ile Gly Pro Lys His Trp Pro
Thr Ser Arg Phe Ala His Val 130 135 140 Met Lys Leu Arg Gln Ala Ala
Leu Arg Thr Ala Arg Glu Lys Trp Ser 145 150 155 160 Asp Tyr Ile Leu
Phe Ile Asp Val Asp Asn Phe Leu Thr Asn Pro Gln 165 170 175 Thr Leu
Asn Leu Leu Ile Ala Glu Asn Lys Thr Ile Val Ala Pro Met 180 185 190
Leu Glu Ser Arg Gly Leu Tyr Ser Asn Phe Trp Cys Gly Ile Thr Pro 195
200 205 Lys Gly Phe Tyr Lys Arg Thr Pro Asp Tyr Val Gln Ile Arg Glu
Trp 210 215 220 Lys Arg Thr Gly Cys Phe Pro Val Pro Met Val His Ser
Thr Phe Leu 225 230 235 240 Ile Asp Leu Arg Lys Glu Ala Ser Asp Lys
Leu Thr Phe Tyr Pro Pro 245 250 255 His Gln Asp Tyr Thr Trp Thr Phe
Asp Asp Ile Ile Val Phe Ala Phe 260 265 270 Ser Ser Arg Gln Ala Gly
Ile Gln Met Tyr Leu Cys Asn Arg Glu His 275 280 285 Tyr Gly Tyr Leu
Pro Ile Pro Leu Lys Pro His Gln Thr Leu Gln Glu 290 295 300 Asp Ile
Glu Asn Leu Ile His Val Gln Ile Glu Ala Met Ile Asp Arg 305 310 315
320 Pro Pro Met Glu Pro Ser Gln Tyr Val Ser Val Val Pro Lys Tyr Pro
325 330 335 Asp Lys Met Gly Phe Asp Glu Ile Phe Met Ile Asn Leu Lys
Arg Arg 340 345 350 Lys Asp Arg Arg Asp Arg Met Leu Arg Thr Leu Tyr
Glu Gln Glu Ile 355 360 365 Glu Val Lys Ile Val Glu Ala Val Asp Gly
Lys Ala Leu Asn Thr Ser 370 375 380 Gln Leu Lys Ala Leu Asn Ile Glu
Met Leu Pro Gly Tyr Arg Asp Pro 385 390 395 400 Tyr Ser Ser Arg Pro
Leu Thr Arg Gly Glu Ile Gly Cys Phe Leu Ser 405 410 415 His Tyr Ser
Val Trp Lys Glu Val Ile Asp Arg Glu Leu Glu Lys Thr 420 425 430 Leu
Val Ile Glu Asp Asp Val Arg Phe Glu His Gln Phe Lys Lys Lys 435 440
445 Leu Met Lys Leu Met Asp Asn Ile Asp Gln Ala Gln Leu Asp Trp Glu
450 455 460 Leu Ile Tyr Ile Gly Arg Lys Arg Met Gln Val Lys Glu Pro
Glu Lys 465 470
475 480 Ala Val Pro Asn Val Ala Asn Leu Val Glu Ala Asp Tyr Ser Tyr
Trp 485 490 495 Thr Leu Gly Tyr Val Ile Ser Leu Glu Gly Ala Gln Lys
Leu Val Gly 500 505 510 Ala Asn Pro Phe Gly Lys Met Leu Pro Val Asp
Glu Phe Leu Pro Val 515 520 525 Met Tyr Asn Lys His Pro Val Ala Glu
Tyr Lys Glu Tyr Tyr Glu Ser 530 535 540 Arg Asp Leu Lys Ala Phe Ser
Ala Glu Pro Leu Leu Ile Tyr Pro Thr 545 550 555 560 His Tyr Thr Gly
Gln Pro Gly Tyr Leu Ser Asp Thr Glu Thr Ser Thr 565 570 575 Ile Trp
Asp Asn Glu Thr Val Ala Thr Asp Trp Asp Arg Thr His Ala 580 585 590
Trp Lys Ser Arg Lys Gln Ser Arg Ile Tyr Ser Asn Ala Lys Asn Thr 595
600 605 Glu Ala Leu Pro Pro Pro Thr Ser Leu Asp Thr Val Pro Ser Arg
Asp 610 615 620 Glu Leu 625 29363PRTHomo sapiens 29Met Phe Ser Asp
Cys Trp Leu Cys Pro Ser Leu Gly Ser Ser Gly Ile 1 5 10 15 Gln Met
Tyr Leu Cys Asn Arg Glu His Tyr Gly Tyr Leu Pro Ile Pro 20 25 30
Leu Lys Pro His Gln Thr Leu Gln Glu Asp Ile Glu Asn Leu Ile His 35
40 45 Val Gln Ile Glu Ala Met Ile Asp Arg Pro Pro Met Glu Pro Ser
Gln 50 55 60 Tyr Val Ser Val Val Pro Lys Tyr Pro Asp Lys Met Gly
Phe Asp Glu 65 70 75 80 Ile Phe Met Ile Asn Leu Lys Arg Arg Lys Asp
Arg Arg Asp Arg Met 85 90 95 Leu Arg Thr Leu Tyr Glu Gln Glu Ile
Glu Val Lys Ile Val Glu Ala 100 105 110 Val Asp Gly Lys Ala Leu Asn
Thr Ser Gln Leu Lys Ala Leu Asn Ile 115 120 125 Glu Met Leu Pro Gly
Tyr Arg Asp Pro Tyr Ser Ser Arg Pro Leu Thr 130 135 140 Arg Gly Glu
Ile Gly Cys Phe Leu Ser His Tyr Ser Val Trp Lys Glu 145 150 155 160
Val Ile Asp Arg Glu Leu Glu Lys Thr Leu Val Ile Glu Asp Asp Val 165
170 175 Arg Phe Glu His Gln Phe Lys Lys Lys Leu Met Lys Leu Met Asp
Asn 180 185 190 Ile Asp Gln Ala Gln Leu Asp Trp Glu Leu Ile Tyr Ile
Gly Arg Lys 195 200 205 Arg Met Gln Val Lys Glu Pro Glu Lys Ala Val
Pro Asn Val Ala Asn 210 215 220 Leu Val Glu Ala Asp Tyr Ser Tyr Trp
Thr Leu Gly Tyr Val Ile Ser 225 230 235 240 Leu Glu Gly Ala Gln Lys
Leu Val Gly Ala Asn Pro Phe Gly Lys Met 245 250 255 Leu Pro Val Asp
Glu Phe Leu Pro Val Met Tyr Asn Lys His Pro Val 260 265 270 Ala Glu
Tyr Lys Glu Tyr Tyr Glu Ser Arg Asp Leu Lys Ala Phe Ser 275 280 285
Ala Glu Pro Leu Leu Ile Tyr Pro Thr His Tyr Thr Gly Gln Pro Gly 290
295 300 Tyr Leu Ser Asp Thr Glu Thr Ser Thr Ile Trp Asp Asn Glu Thr
Val 305 310 315 320 Ala Thr Asp Trp Asp Arg Thr His Ala Trp Lys Ser
Arg Lys Gln Ser 325 330 335 Arg Ile Tyr Ser Asn Ala Lys Asn Thr Glu
Ala Leu Pro Pro Pro Thr 340 345 350 Ser Leu Asp Thr Val Pro Ser Arg
Asp Glu Leu 355 360 30234PRTHomo sapiens 30Met Leu Pro Gly Tyr Arg
Asp Pro Tyr Ser Ser Arg Pro Leu Thr Arg 1 5 10 15 Gly Glu Ile Gly
Cys Phe Leu Ser His Tyr Ser Val Trp Lys Glu Val 20 25 30 Ile Asp
Arg Glu Leu Glu Lys Thr Leu Val Ile Glu Asp Asp Val Arg 35 40 45
Phe Glu His Gln Phe Lys Lys Lys Leu Met Lys Leu Met Asp Asn Ile 50
55 60 Asp Gln Ala Gln Leu Asp Trp Glu Leu Ile Tyr Ile Gly Arg Lys
Arg 65 70 75 80 Met Gln Val Lys Glu Pro Glu Lys Ala Val Pro Asn Val
Ala Asn Leu 85 90 95 Val Glu Ala Asp Tyr Ser Tyr Trp Thr Leu Gly
Tyr Val Ile Ser Leu 100 105 110 Glu Gly Ala Gln Lys Leu Val Gly Ala
Asn Pro Phe Gly Lys Met Leu 115 120 125 Pro Val Asp Glu Phe Leu Pro
Val Met Tyr Asn Lys His Pro Val Ala 130 135 140 Glu Tyr Lys Glu Tyr
Tyr Glu Ser Arg Asp Leu Lys Ala Phe Ser Ala 145 150 155 160 Glu Pro
Leu Leu Ile Tyr Pro Thr His Tyr Thr Gly Gln Pro Gly Tyr 165 170 175
Leu Ser Asp Thr Glu Thr Ser Thr Ile Trp Asp Asn Glu Thr Val Ala 180
185 190 Thr Asp Trp Asp Arg Thr His Ala Trp Lys Ser Arg Lys Gln Ser
Arg 195 200 205 Ile Tyr Ser Asn Ala Lys Asn Thr Glu Ala Leu Pro Pro
Pro Thr Ser 210 215 220 Leu Asp Thr Val Pro Ser Arg Asp Glu Leu 225
230 315188DNAHomo sapiens 31cacaactcgc ggctctaggg aaggccacag
acctcagctg tacgagcgga acctggacta 60gaggaacttt cctcaggact cagggcacac
agcccctcgg ccactactgt ctgcgagccc 120gaggcgcagg ggagcgccag
cggctgcagg tacgctgcgc acccgcactg gggcctgggc 180ggtgcagcgg
cggcgggagg gtcctgggaa ccgcagccgc cgaggccagt gtgtgagccg
240gccggccgcg gcaagccgca tcccccggcc gcccctcgca gccccttgct
ccccggcagc 300cgccgccgcc ctcaggcagc ccagccgggc gctcgcgcca
ggacttcccc gccgcccgcc 360tcggcccgga acatggctgc gcgccctgct
gccaccctcg cctggtcgct actgctcctc 420tcctcagccc tgctccgcga
aggctgccga gcgcgcttcg tcgccgagcg ggactcggag 480gacgacggag
aggagccggt ggttttcccg gagtcgcccc tgcagagccc cacggtgctc
540gtggcggtcc tcgcccgcaa cgcggcgcac acgctgccgc acttcctcgg
ctgcctggag 600cggctggact accccaagag caggatggcc atctgggcag
ccactgatca caatgtggat 660aatacaacag aaatattcag ggagtggttg
aaaaatgtac agagactcta tcactatgtg 720gagtggaggc ctatggatga
accagagtct taccctgatg aaattggacc aaagcactgg 780ccaacctccc
ggtttgccca tgtgatgaaa ctacgacagg cagcccttcg aactgcgagg
840gaaaaatggt cagactacat tctgttcata gatgttgaca atttcctgac
taatccacag 900accctcaatc tactgattgc agaaaacaaa actattgtgg
cccccatgct ggagtctcgg 960ggcctgtatt ctaatttctg gtgcggaatc
acccctaagg caaagggctt ctataagagg 1020accccagact acgttcagat
tcgagaatgg aagaggacag gctgcttccc cgtccccatg 1080gtccactcca
ccttcctaat tgacctcagg aaggaggcct cggacaagct gactttctac
1140cccccacacc aggactacac ctggaccttt gatgacatca ttgtctttgc
cttctccagc 1200aggcaagcag gcatccagat gtacctctgc aacagagagc
actatggcta cctgcccatc 1260cccctgaagc cccatcagac actgcaggaa
gacatcgaga acctcatcca tgtgcagatt 1320gaagcaatga ttgaccgtcc
tccaatggaa ccctcccagt atgtctcagt tgtccctaaa 1380tatccagaca
agatgggatt tgatgagatt ttcatgataa acctcaaacg cagaaaggac
1440aggcgggacc ggatgctgcg cacactgtat gaacaggaga ttgaggtcaa
gattgtcgag 1500gctgtggatg gaaaggcact caacacaagc cagctgaagg
cactgaatat tgaaatgctg 1560cctggctatc gagatcccta ttcctccagg
cctctaacaa ggggtgaaat cggctgcttt 1620ctcagccact actcagtctg
gaaagaggta attgatcgag agctagagaa gactcttgta 1680attgaagacg
atgtgcgttt tgagcatcag tttaagaaga agctgatgaa gctgatggat
1740aacattgacc aggctcagct ggactgggaa ctgatttata ttggtaggaa
gaggatgcaa 1800gtaaaggagc cagagaaagc agtgcccaat gtggcaaacc
tggtcgaagc cgactattcc 1860tactggaccc tgggctacgt catctctctg
gaaggagcac agaagctggt tggagccaat 1920ccttttggga agatgctgcc
agtggatgag tttctgccag tcatgtacaa caagcatccc 1980gtagccgagt
acaaggagta ttatgaatcc agggacctga aagccttctc tgcagaaccc
2040ttgctcatct accctacgca ctacacaggc cagccggggt acctgagtga
cacggagacc 2100tccaccatct gggacaatga gacagtggcc accgactggg
ataggacaca tgcctggaag 2160tcccggaagc aaagccgcat ctacagcaat
gccaagaaca cagaggccct gccaccgcca 2220acctccctgg acactgtgcc
ttcaagggat gagctatgaa ggctccctgg gagtgtggcc 2280cacatcagtt
caacatcctc tggtttttct aaagggctat tcatctgttt gctccagttt
2340tctgttttgt tttgttctta gtggtcacag tcatctaacc aaagtgatct
agtgtgatag 2400atcgaaatta acatattttt gaccatggaa ggaaataagg
aaattcaacc caaatttccc 2460aagacggctg aaagacaggt tttttggaaa
ctgttaagat aaactgtaat ccagacacct 2520aattcttcag ttcactactc
atgtgatact gattcccaca ttaaggttga acaacatggc 2580tcagagtctt
gttcaagaga aagtgatcac cgagctgtca catcagcaaa tatgtagtca
2640aggcagccag gccaactaga ccacacttat tggtctagtt tgtccgtttt
atatgacatt 2700gaaaacttgt gtgtgcaact tttgggggac aggaatcact
taaaatcata tttatttggc 2760tttttattta aaggattctg tcacaagtct
tattgaaaag tagatttttt aaaaaaaaaa 2820aatcttagtc cctgttatcc
agtaggggtg ggtatttggg tccgactgag acttggcctg 2880tgaccatcat
ggcagttgga gttctcatat agaggtgacc agtttgccat gtggatataa
2940tttagtagat atttgacagt ttgtgtaggt atttgaggga aaaaactcaa
tgtttggctt 3000ttttattatg gccactcgag tcaggatgct ctatttataa
agataaatgt aatatataaa 3060gggtgaggac tggctgtgca tcctgccctg
tcccgggttt gcgcgctgct acagagcttc 3120acgctctccg ctccacccct
tagcctggga acccaccgca ggtgtgagtt ctgtgagtca 3180ctgctaagag
acagagcaca ttttcaggcc agcaactatc cttgccagag ttttttcatt
3240atattttgaa ttatttattt tacaaaatgg gcgaagatat tgtctttagg
ataaggcaga 3300gaaacagatg ttgcagactt ccacggcacc cgggggagtg
gtgggtgtgg acacattggt 3360tcggcaatct gattctcctg aatttcccag
ccaggctctt gtggggaggc ctgtggatgg 3420ggggatttga actatttgga
aacaaatgat tctctatctc aggtgagaaa cctggtcaga 3480aacaaagggc
tggtcacctg atttaggcca gcaaccaggg aagctcttag aatcccaggc
3540ggacaccctt tctcaaaaga tatcccctaa gagtcctttc tgctttcttc
acagattgat 3600tttatgtaaa atgcagagtt ggactacacg atttcttccc
actccacaat ctgtcatcct 3660agtatagatc atggtggttt ccctcaagtt
tatgttctca tgccctcaat ctgtaaattt 3720ttgtctccag aaaaaccctc
ccaggcatcc cataccagca ccgttcctca tcactgtcca 3780tgcaccatgc
agccatatgg ggggccgtgc acaccccaaa ccctgagctt cacacttaaa
3840ctcatgggga gggcccttca gagcagagtc cacaggcggg tggtgctaca
tacacaagct 3900tagtgtacga gtgtaagata cactttaagc cagacaccta
attcttcagt tcactgccca 3960tgtgatactg attgccacat taaggttgaa
cagcatggct cagagtctag agaaagtgat 4020caccaagctg tctcatcacc
aaataggtag tcaaggcagc ctcatctccc caggtgaggg 4080gcgggtcccc
actttaggac aagaggcagc ttgccttcca ccagacgcca gcctcggcct
4140tccttcccga ctcactgtgg gtacccttct acactgacca gcaagctagg
ccgctggagg 4200aaagggaact cacccaactc taaattgtgc cgcttagact
tagctgtcag tgtgacttcc 4260tttcccaccc acccccagaa aaacagaaag
agcatctggg gagcgagtga aaattcctta 4320ggtgattcct aagatttcct
tgggtatctg gtttttgttt tcatatttga gtgtgtgcat 4380gtgtgcatga
ctttaatgac ttttttaatg gggtgggagg tggctggggt gctggggttg
4440aaggaagttt gggttgattt ttgtggtgtt ttgtttaata gagaattttt
tttttcctgt 4500tcccctgtca gctggtctga cagatttaag aactctcatt
cttaaaagac tttggactta 4560aattctagca ttttagacta ggactgttct
actgtgaaga aagttctgtc tcctttagcc 4620cggtttgttt ctccctgctc
aggtctagaa tcccaagcag tgttcttttc tggtgaacac 4680tgtgagccgc
agatgtgact ttttttttaa agtcatctct tcagcaatcc agaggttcct
4740tgacctcatt atttgtccta tctctccctt atagtcctaa gccaagacat
ttgacctttg 4800acatttgacc tttgcagtgt catgtgaggg cgtcagtata
gaggcctttg catctgggcc 4860tggcacccgc tctctgcctc tggaggctaa
accctgtctg gatttctctt gggatctaac 4920gtgggatctt ctggacagac
aaccgtgaca tcagcagtgc tggtgctgct gtgtgtggac 4980tgaacacctg
cactttgcag aggacacgct gcatgggccc cgcttgcggt tcattcaggc
5040ctgctgcagg agctctgaga acaagaaaga gtggacaccc gttcccctgc
atcatctgtc 5100ttgcgtgcta tttcagagtg gggaagtgat aaactatttg
ccttctggag ctctttgtga 5160aaaattaaaa aaaaacttag ctcaaaga
5188325182DNAHomo sapiens 32cacaactcgc ggctctaggg aaggccacag
acctcagctg tacgagcgga acctggacta 60gaggaacttt cctcaggact cagggcacac
agcccctcgg ccactactgt ctgcgagccc 120gaggcgcagg ggagcgccag
cggctgcagg tacgctgcgc acccgcactg gggcctgggc 180ggtgcagcgg
cggcgggagg gtcctgggaa ccgcagccgc cgaggccagt gtgtgagccg
240gccggccgcg gcaagccgca tcccccggcc gcccctcgca gccccttgct
ccccggcagc 300cgccgccgcc ctcaggcagc ccagccgggc gctcgcgcca
ggacttcccc gccgcccgcc 360tcggcccgga acatggctgc gcgccctgct
gccaccctcg cctggtcgct actgctcctc 420tcctcagccc tgctccgcga
aggctgccga gcgcgcttcg tcgccgagcg ggactcggag 480gacgacggag
aggagccggt ggttttcccg gagtcgcccc tgcagagccc cacggtgctc
540gtggcggtcc tcgcccgcaa cgcggcgcac acgctgccgc acttcctcgg
ctgcctggag 600cggctggact accccaagag caggatggcc atctgggcag
ccactgatca caatgtggat 660aatacaacag aaatattcag ggagtggttg
aaaaatgtac agagactcta tcactatgtg 720gagtggaggc ctatggatga
accagagtct taccctgatg aaattggacc aaagcactgg 780ccaacctccc
ggtttgccca tgtgatgaaa ctacgacagg cagcccttcg aactgcgagg
840gaaaaatggt cagactacat tctgttcata gatgttgaca atttcctgac
taatccacag 900accctcaatc tactgattgc agaaaacaaa actattgtgg
cccccatgct ggagtctcgg 960ggcctgtatt ctaatttctg gtgcggaatc
acccctaagg gcttctataa gaggacccca 1020gactacgttc agattcgaga
atggaagagg acaggctgct tccccgtccc catggtccac 1080tccaccttcc
taattgacct caggaaggag gcctcggaca agctgacttt ctacccccca
1140caccaggact acacctggac ctttgatgac atcattgtct ttgccttctc
cagcaggcaa 1200gcaggcatcc agatgtacct ctgcaacaga gagcactatg
gctacctgcc catccccctg 1260aagccccatc agacactgca ggaagacatc
gagaacctca tccatgtgca gattgaagca 1320atgattgacc gtcctccaat
ggaaccctcc cagtatgtct cagttgtccc taaatatcca 1380gacaagatgg
gatttgatga gattttcatg ataaacctca aacgcagaaa ggacaggcgg
1440gaccggatgc tgcgcacact gtatgaacag gagattgagg tcaagattgt
cgaggctgtg 1500gatggaaagg cactcaacac aagccagctg aaggcactga
atattgaaat gctgcctggc 1560tatcgagatc cctattcctc caggcctcta
acaaggggtg aaatcggctg ctttctcagc 1620cactactcag tctggaaaga
ggtaattgat cgagagctag agaagactct tgtaattgaa 1680gacgatgtgc
gttttgagca tcagtttaag aagaagctga tgaagctgat ggataacatt
1740gaccaggctc agctggactg ggaactgatt tatattggta ggaagaggat
gcaagtaaag 1800gagccagaga aagcagtgcc caatgtggca aacctggtcg
aagccgacta ttcctactgg 1860accctgggct acgtcatctc tctggaagga
gcacagaagc tggttggagc caatcctttt 1920gggaagatgc tgccagtgga
tgagtttctg ccagtcatgt acaacaagca tcccgtagcc 1980gagtacaagg
agtattatga atccagggac ctgaaagcct tctctgcaga acccttgctc
2040atctacccta cgcactacac aggccagccg gggtacctga gtgacacgga
gacctccacc 2100atctgggaca atgagacagt ggccaccgac tgggatagga
cacatgcctg gaagtcccgg 2160aagcaaagcc gcatctacag caatgccaag
aacacagagg ccctgccacc gccaacctcc 2220ctggacactg tgccttcaag
ggatgagcta tgaaggctcc ctgggagtgt ggcccacatc 2280agttcaacat
cctctggttt ttctaaaggg ctattcatct gtttgctcca gttttctgtt
2340ttgttttgtt cttagtggtc acagtcatct aaccaaagtg atctagtgtg
atagatcgaa 2400attaacatat ttttgaccat ggaaggaaat aaggaaattc
aacccaaatt tcccaagacg 2460gctgaaagac aggttttttg gaaactgtta
agataaactg taatccagac acctaattct 2520tcagttcact actcatgtga
tactgattcc cacattaagg ttgaacaaca tggctcagag 2580tcttgttcaa
gagaaagtga tcaccgagct gtcacatcag caaatatgta gtcaaggcag
2640ccaggccaac tagaccacac ttattggtct agtttgtccg ttttatatga
cattgaaaac 2700ttgtgtgtgc aacttttggg ggacaggaat cacttaaaat
catatttatt tggcttttta 2760tttaaaggat tctgtcacaa gtcttattga
aaagtagatt ttttaaaaaa aaaaaatctt 2820agtccctgtt atccagtagg
ggtgggtatt tgggtccgac tgagacttgg cctgtgacca 2880tcatggcagt
tggagttctc atatagaggt gaccagtttg ccatgtggat ataatttagt
2940agatatttga cagtttgtgt aggtatttga gggaaaaaac tcaatgtttg
gcttttttat 3000tatggccact cgagtcagga tgctctattt ataaagataa
atgtaatata taaagggtga 3060ggactggctg tgcatcctgc cctgtcccgg
gtttgcgcgc tgctacagag cttcacgctc 3120tccgctccac cccttagcct
gggaacccac cgcaggtgtg agttctgtga gtcactgcta 3180agagacagag
cacattttca ggccagcaac tatccttgcc agagtttttt cattatattt
3240tgaattattt attttacaaa atgggcgaag atattgtctt taggataagg
cagagaaaca 3300gatgttgcag acttccacgg cacccggggg agtggtgggt
gtggacacat tggttcggca 3360atctgattct cctgaatttc ccagccaggc
tcttgtgggg aggcctgtgg atggggggat 3420ttgaactatt tggaaacaaa
tgattctcta tctcaggtga gaaacctggt cagaaacaaa 3480gggctggtca
cctgatttag gccagcaacc agggaagctc ttagaatccc aggcggacac
3540cctttctcaa aagatatccc ctaagagtcc tttctgcttt cttcacagat
tgattttatg 3600taaaatgcag agttggacta cacgatttct tcccactcca
caatctgtca tcctagtata 3660gatcatggtg gtttccctca agtttatgtt
ctcatgccct caatctgtaa atttttgtct 3720ccagaaaaac cctcccaggc
atcccatacc agcaccgttc ctcatcactg tccatgcacc 3780atgcagccat
atggggggcc gtgcacaccc caaaccctga gcttcacact taaactcatg
3840gggagggccc ttcagagcag agtccacagg cgggtggtgc tacatacaca
agcttagtgt 3900acgagtgtaa gatacacttt aagccagaca cctaattctt
cagttcactg cccatgtgat 3960actgattgcc acattaaggt tgaacagcat
ggctcagagt ctagagaaag tgatcaccaa 4020gctgtctcat caccaaatag
gtagtcaagg cagcctcatc tccccaggtg aggggcgggt 4080ccccacttta
ggacaagagg cagcttgcct tccaccagac gccagcctcg gccttccttc
4140ccgactcact gtgggtaccc ttctacactg accagcaagc taggccgctg
gaggaaaggg 4200aactcaccca actctaaatt gtgccgctta gacttagctg
tcagtgtgac ttcctttccc 4260acccaccccc agaaaaacag aaagagcatc
tggggagcga gtgaaaattc cttaggtgat 4320tcctaagatt tccttgggta
tctggttttt gttttcatat ttgagtgtgt gcatgtgtgc 4380atgactttaa
tgactttttt aatggggtgg gaggtggctg gggtgctggg gttgaaggaa
4440gtttgggttg atttttgtgg tgttttgttt aatagagaat tttttttttc
ctgttcccct 4500gtcagctggt ctgacagatt taagaactct cattcttaaa
agactttgga cttaaattct 4560agcattttag actaggactg ttctactgtg
aagaaagttc tgtctccttt agcccggttt 4620gtttctccct gctcaggtct
agaatcccaa gcagtgttct tttctggtga acactgtgag 4680ccgcagatgt
gacttttttt ttaaagtcat ctcttcagca atccagaggt tccttgacct
4740cattatttgt cctatctctc ccttatagtc ctaagccaag acatttgacc
tttgacattt 4800gacctttgca gtgtcatgtg agggcgtcag tatagaggcc
tttgcatctg ggcctggcac 4860ccgctctctg cctctggagg
ctaaaccctg tctggatttc tcttgggatc taacgtggga 4920tcttctggac
agacaaccgt gacatcagca gtgctggtgc tgctgtgtgt ggactgaaca
4980cctgcacttt gcagaggaca cgctgcatgg gccccgcttg cggttcattc
aggcctgctg 5040caggagctct gagaacaaga aagagtggac acccgttccc
ctgcatcatc tgtcttgcgt 5100gctatttcag agtggggaag tgataaacta
tttgccttct ggagctcttt gtgaaaaatt 5160aaaaaaaaac ttagctcaaa ga
5182332325DNAHomo sapiens 33ttttcgtttt tttgttctat gtatgcctag
atttagcttt aataatctgt cagcgttaca 60gaattttcct gtttgctttc atatcaaatg
tggatttcat tcctcatttt cttgttattt 120cctgtaggtg gattacagtg
ttccattctg ctctccagaa tgggaattcc ttgcctccct 180tttgatcctt
tcatgccttt tttgttcatt ccaggccatg agtttgcttt ttcttcagag
240ggaaaaaatc aatttcctct gggctgaaga atagccgagt gtttttatta
gtccccaatg 300ctgtgtttta cttttttaat aaaggagcat tttttaggtc
tatagacata cctgaaacaa 360ccacttattt aattgcagtg tcaaactcag
catcagacac agttcaggtt ccttctcctc 420gcttgtcact cagcagtatg
ttttctgatt gttggctttg tccctctctt ggatcctcag 480gcatccagat
gtacctctgc aacagagagc actatggcta cctgcccatc cccctgaagc
540cccatcagac actgcaggaa gacatcgaga acctcatcca tgtgcagatt
gaagcaatga 600ttgaccgtcc tccaatggaa ccctcccagt atgtctcagt
tgtccctaaa tatccagaca 660agatgggatt tgatgagatt ttcatgataa
acctcaaacg cagaaaggac aggcgggacc 720ggatgctgcg cacactgtat
gaacaggaga ttgaggtcaa gattgtcgag gctgtggatg 780gaaaggcact
caacacaagc cagctgaagg cactgaatat tgaaatgctg cctggctatc
840gagatcccta ttcctccagg cctctaacaa ggggtgaaat cggctgcttt
ctcagccact 900actcagtctg gaaagaggta attgatcgag agctagagaa
gactcttgta attgaagacg 960atgtgcgttt tgagcatcag tttaagaaga
agctgatgaa gctgatggat aacattgacc 1020aggctcagct ggactgggaa
ctgatttata ttggtaggaa gaggatgcaa gtaaaggagc 1080cagagaaagc
agtgcccaat gtggcaaacc tggtcgaagc cgactattcc tactggaccc
1140tgggctacgt catctctctg gaaggagcac agaagctggt tggagccaat
ccttttggga 1200agatgctgcc agtggatgag tttctgccag tcatgtacaa
caagcatccc gtagccgagt 1260acaaggagta ttatgaatcc agggacctga
aagccttctc tgcagaaccc ttgctcatct 1320accctacgca ctacacaggc
cagccggggt acctgagtga cacggagacc tccaccatct 1380gggacaatga
gacagtggcc accgactggg ataggacaca tgcctggaag tcccggaagc
1440aaagccgcat ctacagcaat gccaagaaca cagaggccct gccaccgcca
acctccctgg 1500acactgtgcc ttcaagggat gagctatgaa ggctccctgg
gagtgtggcc cacatcagtt 1560caacatcctc tggtttttct aaagggctat
tcatctgttt gctccagttt tctgttttgt 1620tttgttctta gtggtcacag
tcatctaacc aaagtgatct agtgtgatag atcgaaatta 1680acatattttt
gaccatggaa ggaaataagg aaattcaacc caaatttccc aagacggctg
1740aaagacaggt tttttggaaa ctgttaagat aaactgtaat ccagacacct
aattcttcag 1800ttcactactc atgtgatact gattcccaca ttaaggttga
acaacatggc tcagagtctt 1860gttcaagaga aagtgatcac cgagctgtca
catcagcaaa tatgtagtca aggcagccag 1920gccaactaga ccacacttat
tggtctagtt tgtccgtttt atatgacatt gaaaacttgt 1980gtgtgcaact
tttgggggac aggaatcact taaaatcata tttatttggc tttttattta
2040aaggattctg tcacaagtct tattgaaaag tagatttttt aaaaaaaaaa
aatcttagtc 2100cctgttatcc agtaggggtg ggtatttggg tccgactgag
acttggcctg tgaccatcat 2160ggcagttgga gttctcatat agaggtgacc
agtttgccat gtggatataa tttagtagat 2220atttgacagt ttgtgtaggt
atttgaggga aaaaactcaa tgtttggctt ttttattatg 2280gccactcgag
tcaggatgct ctatttataa agataaatgt aatat 2325344723DNAHomo sapiens
34ttttctattc ttttactttc aacctatttt tgtcttttaa tctaaaggct gcctcttata
60gataccacat ggttgtatta atttttaaaa cttactatgc caaattcttc cttcattgca
120gtgttttatt catttgcctt cactacattt ttttccagat ccttaaggtc
agccagaggt 180gagagactag ggacttctca agtttttcct gggcgtgtgg
acagccctga atatgcctgg 240ggccttctag attctcaggg atattttgga
gcaaatcaaa ctccctatgg acctcttatt 300ccttggtttt tccttttaag
ttttttggtg ggcttcttgt tagcatcagc tgataatgct 360tctcaaacag
ctgtaatgtt aaacagttgt tgctcatggg ttttgacaaa tgacctgcag
420atagtgctgt tcacagaaaa tgaactctga gtcaggtcaa atcaagacaa
gctctgagaa 480tgtagctttt caaggagcta ctagacaggt tgaatagaga
ttattttctg ggggcaggtc 540tttaagggag tctcagtcca ttttgtccct
gccagtggct tctgggttgg tggggtttgc 600cgctctctta gttgtgaggc
tactggtttt caaggctttt gtagggctgg ggtgaacagg 660ttaggattag
ggcagctgtg aacttcacaa agctcactgt gttaacagag attcagcagt
720aggtcttgat caaacactcc ttggattatt gcagtacttt aactaatttc
cataggagtg 780tacttattaa atcttgtcaa aaaactgatg atgcttagaa
ctatgttggg cactatgagg 840gatacacaaa aatatgatac cctcttgtct
tcccagacct cataatacta gtgtgacaca 900ttagcctgga ggaacttcta
aggaataaag ggaagcctct ctgtctttgt ctgtgtgggg 960gaggggtgtt
tgcagaacta actctctctt tctctggcct ttgtggtggg gacctttgac
1020ttgttaatct ccagacgtgt tttatgtgat gcagggcact caacacaagc
cagctgaagg 1080cactgaatat tgaaatgctg cctggctatc gagatcccta
ttcctccagg cctctaacaa 1140ggggtgaaat cggctgcttt ctcagccact
actcagtctg gaaagaggta attgatcgag 1200agctagagaa gactcttgta
attgaagacg atgtgcgttt tgagcatcag tttaagaaga 1260agctgatgaa
gctgatggat aacattgacc aggctcagct ggactgggaa ctgatttata
1320ttggtaggaa gaggatgcaa gtaaaggagc cagagaaagc agtgcccaat
gtggcaaacc 1380tggtcgaagc cgactattcc tactggaccc tgggctacgt
catctctctg gaaggagcac 1440agaagctggt tggagccaat ccttttggga
agatgctgcc agtggatgag tttctgccag 1500tcatgtacaa caagcatccc
gtagccgagt acaaggagta ttatgaatcc agggacctga 1560aagccttctc
tgcagaaccc ttgctcatct accctacgca ctacacaggc cagccggggt
1620acctgagtga cacggagacc tccaccatct gggacaatga gacagtggcc
accgactggg 1680ataggacaca tgcctggaag tcccggaagc aaagccgcat
ctacagcaat gccaagaaca 1740cagaggccct gccaccgcca acctccctgg
acactgtgcc ttcaagggat gagctatgaa 1800ggctccctgg gagtgtggcc
cacatcagtt caacatcctc tggtttttct aaagggctat 1860tcatctgttt
gctccagttt tctgttttgt tttgttctta gtggtcacag tcatctaacc
1920aaagtgatct agtgtgatag atcgaaatta acatattttt gaccatggaa
ggaaataagg 1980aaattcaacc caaatttccc aagacggctg aaagacaggt
tttttggaaa ctgttaagat 2040aaactgtaat ccagacacct aattcttcag
ttcactactc atgtgatact gattcccaca 2100ttaaggttga acaacatggc
tcagagtctt gttcaagaga aagtgatcac cgagctgtca 2160catcagcaaa
tatgtagtca aggcagccag gccaactaga ccacacttat tggtctagtt
2220tgtccgtttt atatgacatt gaaaacttgt gtgtgcaact tttgggggac
aggaatcact 2280taaaatcata tttatttggc tttttattta aaggattctg
tcacaagtct tattgaaaag 2340tagatttttt aaaaaaaaaa aatcttagtc
cctgttatcc agtaggggtg ggtatttggg 2400tccgactgag acttggcctg
tgaccatcat ggcagttgga gttctcatat agaggtgacc 2460agtttgccat
gtggatataa tttagtagat atttgacagt ttgtgtaggt atttgaggga
2520aaaaactcaa tgtttggctt ttttattatg gccactcgag tcaggatgct
ctatttataa 2580agataaatgt aatatataaa gggtgaggac tggctgtgca
tcctgccctg tcccgggttt 2640gcgcgctgct acagagcttc acgctctccg
ctccacccct tagcctggga acccaccgca 2700ggtgtgagtt ctgtgagtca
ctgctaagag acagagcaca ttttcaggcc agcaactatc 2760cttgccagag
ttttttcatt atattttgaa ttatttattt tacaaaatgg gcgaagatat
2820tgtctttagg ataaggcaga gaaacagatg ttgcagactt ccacggcacc
cgggggagtg 2880gtgggtgtgg acacattggt tcggcaatct gattctcctg
aatttcccag ccaggctctt 2940gtggggaggc ctgtggatgg ggggatttga
actatttgga aacaaatgat tctctatctc 3000aggtgagaaa cctggtcaga
aacaaagggc tggtcacctg atttaggcca gcaaccaggg 3060aagctcttag
aatcccaggc ggacaccctt tctcaaaaga tatcccctaa gagtcctttc
3120tgctttcttc acagattgat tttatgtaaa atgcagagtt ggactacacg
atttcttccc 3180actccacaat ctgtcatcct agtatagatc atggtggttt
ccctcaagtt tatgttctca 3240tgccctcaat ctgtaaattt ttgtctccag
aaaaaccctc ccaggcatcc cataccagca 3300ccgttcctca tcactgtcca
tgcaccatgc agccatatgg ggggccgtgc acaccccaaa 3360ccctgagctt
cacacttaaa ctcatgggga gggcccttca gagcagagtc cacaggcggg
3420tggtgctaca tacacaagct tagtgtacga gtgtaagata cactttaagc
cagacaccta 3480attcttcagt tcactgccca tgtgatactg attgccacat
taaggttgaa cagcatggct 3540cagagtctag agaaagtgat caccaagctg
tctcatcacc aaataggtag tcaaggcagc 3600ctcatctccc caggtgaggg
gcgggtcccc actttaggac aagaggcagc ttgccttcca 3660ccagacgcca
gcctcggcct tccttcccga ctcactgtgg gtacccttct acactgacca
3720gcaagctagg ccgctggagg aaagggaact cacccaactc taaattgtgc
cgcttagact 3780tagctgtcag tgtgacttcc tttcccaccc acccccagaa
aaacagaaag agcatctggg 3840gagcgagtga aaattcctta ggtgattcct
aagatttcct tgggtatctg gtttttgttt 3900tcatatttga gtgtgtgcat
gtgtgcatga ctttaatgac ttttttaatg gggtgggagg 3960tggctggggt
gctggggttg aaggaagttt gggttgattt ttgtggtgtt ttgtttaata
4020gagaattttt tttttcctgt tcccctgtca gctggtctga cagatttaag
aactctcatt 4080cttaaaagac tttggactta aattctagca ttttagacta
ggactgttct actgtgaaga 4140aagttctgtc tcctttagcc cggtttgttt
ctccctgctc aggtctagaa tcccaagcag 4200tgttcttttc tggtgaacac
tgtgagccgc agatgtgact ttttttttaa agtcatctct 4260tcagcaatcc
agaggttcct tgacctcatt atttgtccta tctctccctt atagtcctaa
4320gccaagacat ttgacctttg acatttgacc tttgcagtgt catgtgaggg
cgtcagtata 4380gaggcctttg catctgggcc tggcacccgc tctctgcctc
tggaggctaa accctgtctg 4440gatttctctt gggatctaac gtgggatctt
ctggacagac aaccgtgaca tcagcagtgc 4500tggtgctgct gtgtgtggac
tgaacacctg cactttgcag aggacacgct gcatgggccc 4560cgcttgcggt
tcattcaggc ctgctgcagg agctctgaga acaagaaaga gtggacaccc
4620gttcccctgc atcatctgtc ttgcgtgcta tttcagagtg gggaagtgat
aaactatttg 4680ccttctggag ctctttgtga aaaattaaaa aaaaacttag ctc
472335542PRTHomo sapiens 35Met Ala Gln Glu Ile Asp Leu Ser Ala Leu
Lys Glu Leu Glu Arg Glu 1 5 10 15 Ala Ile Leu Gln Val Leu Tyr Arg
Asp Gln Ala Val Gln Asn Thr Glu 20 25 30 Glu Glu Arg Thr Arg Lys
Leu Lys Thr His Leu Gln His Leu Arg Trp 35 40 45 Lys Gly Ala Lys
Asn Thr Asp Trp Glu His Lys Glu Lys Cys Cys Ala 50 55 60 Arg Cys
Gln Gln Val Leu Gly Phe Leu Leu His Arg Gly Ala Val Cys 65 70 75 80
Arg Gly Cys Ser His Arg Val Cys Ala Gln Cys Arg Val Phe Leu Arg 85
90 95 Gly Thr His Ala Trp Lys Cys Thr Val Cys Phe Glu Asp Arg Asn
Val 100 105 110 Lys Ile Lys Thr Gly Glu Trp Phe Tyr Glu Glu Arg Ala
Lys Lys Phe 115 120 125 Pro Thr Gly Gly Lys His Glu Thr Val Gly Gly
Gln Leu Leu Gln Ser 130 135 140 Tyr Gln Lys Leu Ser Lys Ile Ser Val
Val Pro Pro Thr Pro Pro Pro 145 150 155 160 Val Ser Glu Ser Gln Cys
Ser Arg Ser Pro Gly Arg Lys Val Ser Ala 165 170 175 Pro Asp Ile Leu
Lys Pro Leu Asn Gln Glu Asp Pro Lys Cys Ser Thr 180 185 190 Asn Pro
Ile Leu Lys Gln Gln Asn Leu Pro Ser Ser Pro Ala Pro Ser 195 200 205
Thr Ile Phe Ser Gly Gly Phe Arg His Gly Ser Leu Ile Ser Ile Asp 210
215 220 Ser Thr Cys Thr Glu Met Gly Asn Phe Asp Asn Ala Asn Val Thr
Gly 225 230 235 240 Glu Ile Glu Phe Ala Ile His Tyr Cys Phe Lys Thr
His Ser Leu Glu 245 250 255 Ile Cys Ile Lys Ala Cys Lys Asn Leu Ala
Tyr Gly Glu Glu Lys Lys 260 265 270 Lys Lys Cys Asn Pro Tyr Val Lys
Thr Tyr Leu Leu Pro Asp Arg Ser 275 280 285 Ser Gln Gly Lys Arg Lys
Thr Gly Val Gln Arg Asn Thr Val Asp Pro 290 295 300 Thr Phe Gln Glu
Thr Leu Lys Tyr Gln Val Ala Pro Ala Gln Leu Val 305 310 315 320 Thr
Arg Gln Leu Gln Val Ser Val Trp His Leu Gly Thr Leu Ala Arg 325 330
335 Arg Val Phe Leu Gly Glu Val Ile Ile Pro Leu Ala Thr Trp Asp Phe
340 345 350 Glu Asp Ser Thr Thr Gln Ser Phe Arg Trp His Pro Leu Arg
Ala Lys 355 360 365 Ala Glu Lys Tyr Glu Asp Ser Val Pro Gln Ser Asn
Gly Glu Leu Thr 370 375 380 Val Arg Ala Lys Leu Val Leu Pro Ser Arg
Pro Arg Lys Leu Gln Glu 385 390 395 400 Ala Gln Glu Gly Thr Asp Gln
Pro Ser Leu His Gly Gln Leu Cys Leu 405 410 415 Val Val Leu Gly Ala
Lys Asn Leu Pro Val Arg Pro Asp Gly Thr Leu 420 425 430 Asn Ser Phe
Val Lys Gly Cys Leu Thr Leu Pro Asp Gln Gln Lys Leu 435 440 445 Arg
Leu Lys Ser Pro Val Leu Arg Lys Gln Ala Cys Pro Gln Trp Lys 450 455
460 His Ser Phe Val Phe Ser Gly Val Thr Pro Ala Gln Leu Arg Gln Ser
465 470 475 480 Ser Leu Glu Leu Thr Val Trp Asp Gln Ala Leu Phe Gly
Met Asn Asp 485 490 495 Arg Leu Leu Gly Gly Thr Arg Leu Gly Ser Lys
Gly Asp Thr Ala Val 500 505 510 Gly Gly Asp Ala Cys Ser Leu Ser Lys
Leu Gln Trp Gln Lys Val Leu 515 520 525 Ser Ser Pro Asn Leu Trp Thr
Asp Met Thr Leu Val Leu His 530 535 540 36610PRTHomo sapiens 36Met
Ala Gln Glu Ile Asp Leu Ser Ala Leu Lys Glu Leu Glu Arg Glu 1 5 10
15 Ala Ile Leu Gln Val Leu Tyr Arg Asp Gln Ala Val Gln Asn Thr Glu
20 25 30 Glu Glu Arg Thr Arg Lys Leu Lys Thr His Leu Gln His Leu
Arg Trp 35 40 45 Lys Gly Ala Lys Asn Thr Asp Trp Glu His Lys Glu
Lys Cys Cys Ala 50 55 60 Arg Cys Gln Gln Val Leu Gly Phe Leu Leu
His Arg Gly Ala Val Cys 65 70 75 80 Arg Gly Cys Ser His Arg Val Cys
Ala Gln Cys Arg Val Phe Leu Arg 85 90 95 Gly Thr His Ala Trp Lys
Cys Thr Val Cys Phe Glu Asp Arg Asn Val 100 105 110 Lys Ile Lys Thr
Gly Glu Trp Phe Tyr Glu Glu Arg Ala Lys Lys Phe 115 120 125 Pro Thr
Gly Gly Lys His Glu Thr Val Gly Gly Gln Leu Leu Gln Ser 130 135 140
Tyr Gln Lys Leu Ser Lys Ile Ser Val Val Pro Pro Thr Pro Pro Pro 145
150 155 160 Val Ser Glu Ser Gln Cys Ser Arg Ser Pro Gly Arg Leu Gln
Glu Phe 165 170 175 Gly Gln Phe Arg Gly Phe Asn Lys Ser Val Glu Asn
Leu Phe Leu Ser 180 185 190 Leu Ala Thr His Val Lys Lys Leu Ser Lys
Ser Gln Asn Asp Met Thr 195 200 205 Ser Glu Lys His Leu Leu Ala Thr
Gly Pro Arg Gln Cys Val Gly Gln 210 215 220 Thr Glu Arg Arg Ser Gln
Ser Asp Thr Ala Val Asn Val Thr Thr Arg 225 230 235 240 Lys Val Ser
Ala Pro Asp Ile Leu Lys Pro Leu Asn Gln Glu Asp Pro 245 250 255 Lys
Cys Ser Thr Asn Pro Ile Leu Lys Gln Gln Asn Leu Pro Ser Ser 260 265
270 Pro Ala Pro Ser Thr Ile Phe Ser Gly Gly Phe Arg His Gly Ser Leu
275 280 285 Ile Ser Ile Asp Ser Thr Cys Thr Glu Met Gly Asn Phe Asp
Asn Ala 290 295 300 Asn Val Thr Gly Glu Ile Glu Phe Ala Ile His Tyr
Cys Phe Lys Thr 305 310 315 320 His Ser Leu Glu Ile Cys Ile Lys Ala
Cys Lys Asn Leu Ala Tyr Gly 325 330 335 Glu Glu Lys Lys Lys Lys Cys
Asn Pro Tyr Val Lys Thr Tyr Leu Leu 340 345 350 Pro Asp Arg Ser Ser
Gln Gly Lys Arg Lys Thr Gly Val Gln Arg Asn 355 360 365 Thr Val Asp
Pro Thr Phe Gln Glu Thr Leu Lys Tyr Gln Val Ala Pro 370 375 380 Ala
Gln Leu Val Thr Arg Gln Leu Gln Val Ser Val Trp His Leu Gly 385 390
395 400 Thr Leu Ala Arg Arg Val Phe Leu Gly Glu Val Ile Ile Pro Leu
Ala 405 410 415 Thr Trp Asp Phe Glu Asp Ser Thr Thr Gln Ser Phe Arg
Trp His Pro 420 425 430 Leu Arg Ala Lys Ala Glu Lys Tyr Glu Asp Ser
Val Pro Gln Ser Asn 435 440 445 Gly Glu Leu Thr Val Arg Ala Lys Leu
Val Leu Pro Ser Arg Pro Arg 450 455 460 Lys Leu Gln Glu Ala Gln Glu
Gly Thr Asp Gln Pro Ser Leu His Gly 465 470 475 480 Gln Leu Cys Leu
Val Val Leu Gly Ala Lys Asn Leu Pro Val Arg Pro 485 490 495 Asp Gly
Thr Leu Asn Ser Phe Val Lys Gly Cys Leu Thr Leu Pro Asp 500 505 510
Gln Gln Lys Leu Arg Leu Lys Ser Pro Val Leu Arg Lys Gln Ala Cys 515
520 525 Pro Gln Trp Lys His Ser Phe Val Phe Ser Gly Val Thr Pro Ala
Gln 530 535 540 Leu Arg Gln Ser Ser Leu Glu Leu Thr Val Trp Asp Gln
Ala Leu Phe 545 550 555 560 Gly Met Asn Asp Arg Leu Leu Gly Gly Thr
Arg Leu Gly Ser Lys Gly 565 570 575 Asp Thr Ala Val Gly Gly Asp Ala
Cys Ser Leu Ser Lys Leu Gln Trp 580 585 590 Gln Lys Val Leu Ser Ser
Pro Asn Leu Trp Thr Asp Met Thr Leu Val 595 600 605 Leu His 610
37542PRTHomo sapiens 37Met Ala Gln Glu Ile Asp Leu Ser
Ala Leu Lys Glu Leu Glu Arg Glu 1 5 10 15 Ala Ile Leu Gln Val Leu
Tyr Arg Asp Gln Ala Val Gln Asn Thr Glu 20 25 30 Glu Glu Arg Thr
Arg Lys Leu Lys Thr His Leu Gln His Leu Arg Trp 35 40 45 Lys Gly
Ala Lys Asn Thr Asp Trp Glu His Lys Glu Lys Cys Cys Ala 50 55 60
Arg Cys Gln Gln Val Leu Gly Phe Leu Leu His Arg Gly Ala Val Cys 65
70 75 80 Arg Gly Cys Ser His Arg Val Cys Ala Gln Cys Arg Val Phe
Leu Arg 85 90 95 Gly Thr His Ala Trp Lys Cys Thr Val Cys Phe Glu
Asp Arg Asn Val 100 105 110 Lys Ile Lys Thr Gly Glu Trp Phe Tyr Glu
Glu Arg Ala Lys Lys Phe 115 120 125 Pro Thr Gly Gly Lys His Glu Thr
Val Gly Gly Gln Leu Leu Gln Ser 130 135 140 Tyr Gln Lys Leu Ser Lys
Ile Ser Val Val Pro Pro Thr Pro Pro Pro 145 150 155 160 Val Ser Glu
Ser Gln Cys Ser Arg Ser Pro Gly Arg Lys Val Ser Ala 165 170 175 Pro
Asp Ile Leu Lys Pro Leu Asn Gln Glu Asp Pro Lys Cys Ser Thr 180 185
190 Asn Pro Ile Leu Lys Gln Gln Asn Leu Pro Ser Ser Pro Ala Pro Ser
195 200 205 Thr Ile Phe Ser Gly Gly Phe Arg His Gly Ser Leu Ile Ser
Ile Asp 210 215 220 Ser Thr Cys Thr Glu Met Gly Asn Phe Asp Asn Ala
Asn Val Thr Gly 225 230 235 240 Glu Ile Glu Phe Ala Ile His Tyr Cys
Phe Lys Thr His Ser Leu Glu 245 250 255 Ile Cys Ile Lys Ala Cys Lys
Asn Leu Ala Tyr Gly Glu Glu Lys Lys 260 265 270 Lys Lys Cys Asn Pro
Tyr Val Lys Thr Tyr Leu Leu Pro Asp Arg Ser 275 280 285 Ser Gln Gly
Lys Arg Lys Thr Gly Val Gln Arg Asn Thr Val Asp Pro 290 295 300 Thr
Phe Gln Glu Thr Leu Lys Tyr Gln Val Ala Pro Ala Gln Leu Val 305 310
315 320 Thr Arg Gln Leu Gln Val Ser Val Trp His Leu Gly Thr Leu Ala
Arg 325 330 335 Arg Val Phe Leu Gly Glu Val Ile Ile Pro Leu Ala Thr
Trp Asp Phe 340 345 350 Glu Asp Ser Thr Thr Gln Ser Phe Arg Trp His
Pro Leu Arg Ala Lys 355 360 365 Ala Glu Lys Tyr Glu Asp Ser Val Pro
Gln Ser Asn Gly Glu Leu Thr 370 375 380 Val Arg Ala Lys Leu Val Leu
Pro Ser Arg Pro Arg Lys Leu Gln Glu 385 390 395 400 Ala Gln Glu Gly
Thr Asp Gln Pro Ser Leu His Gly Gln Leu Cys Leu 405 410 415 Val Val
Leu Gly Ala Lys Asn Leu Pro Val Arg Pro Asp Gly Thr Leu 420 425 430
Asn Ser Phe Val Lys Gly Cys Leu Thr Leu Pro Asp Gln Gln Lys Leu 435
440 445 Arg Leu Lys Ser Pro Val Leu Arg Lys Gln Ala Cys Pro Gln Trp
Lys 450 455 460 His Ser Phe Val Phe Ser Gly Val Thr Pro Ala Gln Leu
Arg Gln Ser 465 470 475 480 Ser Leu Glu Leu Thr Val Trp Asp Gln Ala
Leu Phe Gly Met Asn Asp 485 490 495 Arg Leu Leu Gly Gly Thr Arg Leu
Gly Ser Lys Gly Asp Thr Ala Val 500 505 510 Gly Gly Asp Ala Cys Ser
Leu Ser Lys Leu Gln Trp Gln Lys Val Leu 515 520 525 Ser Ser Pro Asn
Leu Trp Thr Asp Met Thr Leu Val Leu His 530 535 540 382277DNAHomo
sapiens 38aagaagaatt gacttccttc tctgcagagc cggctctggt ctcttctctt
gaagcagatg 60cgaaggctcc ccgaatgaga aagaatactc ggaatcagcg gtgaattgca
gtgatctttc 120agagaaagcg cctgttcaac tttgtcctct ctcagagact
cagagccttg gggcactgag 180ggatgccagt tctgcctgtt catctggaac
ctggatctaa ggagggaaga ggcgttgccc 240ctgctggcat agtcaggtat
tgaacgggct gagcttttca tgatggttcc tgctgacctg 300gaaacatctt
aaatggaagg gcgtgagcgc ttggtccatg cagtgaagct cttccaacct
360gggtcaacga aaacggagaa gaaatggccc aagaaataga tctgagtgct
ctcaaggagt 420tagaacgcga ggccattctc caggtcctgt accgagacca
ggcggttcaa aacacagagg 480aggagaggac acggaaactg aaaacacacc
tgcagcatct ccggtggaaa ggagcgaaga 540acacggactg ggagcacaaa
gagaagtgct gtgcgcgctg ccagcaggtg ctggggttcc 600tgctgcaccg
gggcgccgtg tgccggggct gcagccaccg cgtgtgtgcc cagtgccgag
660tgttcctgag ggggacccat gcctggaagt gcacggtgtg cttcgaggac
aggaatgtca 720aaataaaaac tggagaatgg ttctatgagg aacgagccaa
gaaatttcca actggaggca 780aacatgagac agttggaggg cagctcttgc
aatcttatca gaagctgagc aaaatttctg 840tggttcctcc tactccacct
cctgtcagcg agagccagtg cagccgcagt cctggcagga 900aggtcagtgc
accagatatt ctgaaacctc tcaatcaaga ggatcccaaa tgctctacta
960accctatttt gaagcaacag aatctcccat ccagtccggc acccagtacc
atattctctg 1020gaggttttag acacggaagt ttaattagca ttgacagcac
ctgtacagag atgggcaatt 1080ttgacaatgc taatgtcact ggagaaatag
aatttgccat tcattattgc ttcaaaaccc 1140attctttaga aatatgcatc
aaggcctgta agaaccttgc ctatggagaa gaaaagaaga 1200aaaagtgcaa
tccgtatgtg aagacctacc tgttgcccga cagatcctcc cagggaaagc
1260gcaagactgg agtccaaagg aacaccgtgg acccgacctt tcaggagacc
ttgaagtatc 1320aggtggcccc tgcccagctg gtgacccggc agctgcaggt
ctcggtgtgg catctgggca 1380cgctggcccg gagagtgttt cttggagaag
tgatcattcc tctggccacg tgggactttg 1440aagacagcac aacacagtcc
ttccgctggc atccgctccg ggccaaggcg gagaaatacg 1500aagacagcgt
tcctcagagt aatggagagc tcacagtccg ggctaagctg gttctccctt
1560cacggcccag aaaactccaa gaggctcaag aagggacaga tcagccatca
cttcatggtc 1620aactttgttt ggtagtgcta ggagccaaga atttacctgt
gcggccagat ggcaccttga 1680actcatttgt taagggctgt ctcactctgc
cagaccaaca aaaactgaga ctgaagtcgc 1740cagtcctgag gaagcaggct
tgcccccagt ggaaacactc atttgtcttc agtggcgtaa 1800ccccagctca
gctgaggcag tcaagcttgg agttaactgt ctgggatcag gccctctttg
1860gaatgaacga ccgcttgctt ggaggaacca gacttggttc aaagggagac
acagctgttg 1920gcggggatgc atgctcacta tcgaagctcc agtggcagaa
agtcctttcc agccccaatc 1980tatggacaga catgactctt gtcctgcact
gacatgaagg cctcaaggtt ccaggttgca 2040gcaggcgtga ggcactgtgc
gtctgcagag gggctacgaa ccaggtgcag ggtcccagct 2100ggagacccct
ttgaccttga gcagtctcca tctgcggccc tgtcccatgg cttaaccgcc
2160tattggtatc tgtgtatatt tacgttaaac acaattatgt tacctaagcc
tctggtgggt 2220tatctcctct ttgagatgta gaaaatggcc agattttaat
aaacgttgtt acccatg 2277392131DNAHomo sapiens 39aggagggaag
aggcgttgcc cctgctggca tagtcaggta ccagcccagc caggtattga 60acgggctgag
cttttcatga tggttcctgc tgacctggaa acatcttaaa tggaagggcg
120tgagcgcttg gtccatgcag tgaagctctt ccaacctggg tcaacgaaaa
cggagaagaa 180atggcccaag aaatagatct gagtgctctc aaggagttag
aacgcgaggc cattctccag 240gtcctgtacc gagaccaggc ggttcaaaac
acagaggagg agaggacacg gaaactgaaa 300acacacctgc agcatctccg
gtggaaagga gcgaagaaca cggactggga gcacaaagag 360aagtgctgtg
cgcgctgcca gcaggtgctg gggttcctgc tgcaccgggg cgccgtgtgc
420cggggctgca gccaccgcgt gtgtgcccag tgccgagtgt tcctgagggg
gacccatgcc 480tggaagtgca cggtgtgctt cgaggacagg aatgtcaaaa
taaaaactgg agaatggttc 540tatgaggaac gagccaagaa atttccaact
ggaggcaaac atgagacagt tggagggcag 600ctcttgcaat cttatcagaa
gctgagcaaa atttctgtgg ttcctcctac tccacctcct 660gtcagcgaga
gccagtgcag ccgcagtcct ggcaggttac aggaatttgg tcagtttaga
720ggatttaata agtccgtgga aaatttgttt ctgtctcttg ctacccacgt
gaaaaagctc 780tccaaatccc agaatgatat gacttctgag aagcatcttc
tcgccacggg ccccaggcag 840tgtgtgggac agacagagag acggagccag
tctgacactg cggtcaacgt caccaccagg 900aaggtcagtg caccagatat
tctgaaacct ctcaatcaag aggatcccaa atgctctact 960aaccctattt
tgaagcaaca gaatctccca tccagtccgg cacccagtac catattctct
1020ggaggtttta gacacggaag tttaattagc attgacagca cctgtacaga
gatgggcaat 1080tttgacaatg ctaatgtcac tggagaaata gaatttgcca
ttcattattg cttcaaaacc 1140cattctttag aaatatgcat caaggcctgt
aagaaccttg cctatggaga agaaaagaag 1200aaaaagtgca atccgtatgt
gaagacctac ctgttgcccg acagatcctc ccagggaaag 1260cgcaagactg
gagtccaaag gaacaccgtg gacccgacct ttcaggagac cttgaagtat
1320caggtggccc ctgcccagct ggtgacccgg cagctgcagg tctcggtgtg
gcatctgggc 1380acgctggccc ggagagtgtt tcttggagaa gtgatcattc
ctctggccac gtgggacttt 1440gaagacagca caacacagtc cttccgctgg
catccgctcc gggccaaggc ggagaaatac 1500gaagacagcg ttcctcagag
taatggagag ctcacagtcc gggctaagct ggttctccct 1560tcacggccca
gaaaactcca agaggctcaa gaagggacag atcagccatc acttcatggt
1620caactttgtt tggtagtgct aggagccaag aatttacctg tgcggccaga
tggcaccttg 1680aactcatttg ttaagggctg tctcactctg ccagaccaac
aaaaactgag actgaagtcg 1740ccagtcctga ggaagcaggc ttgcccccag
tggaaacact catttgtctt cagtggcgta 1800accccagctc agctgaggca
gtcaagcttg gagttaactg tctgggatca ggccctcttt 1860ggaatgaacg
accgcttgct tggaggaacc agacttggtt caaagggaga cacagctgtt
1920ggcggggatg catgctcact atcgaagctc cagtggcaga aagtcctttc
cagccccaat 1980ctatggacag acatgactct tgtcctgcac tgacatgaag
gcctcaaggt tccaggttgc 2040agcaggcgtg aggcactgtg cgtctgcaga
ggggctacga accaggtgca gggtcccagc 2100tggagacccc tttgaccttg
agcagtctcc a 2131401927DNAHomo sapiens 40aggagggaag aggcgttgcc
cctgctggca tagtcaggta ccagcccagc caggtattga 60acgggctgag cttttcatga
tggttcctgc tgacctggaa acatcttaaa tggaagggcg 120tgagcgcttg
gtccatgcag tgaagctctt ccaacctggg tcaacgaaaa cggagaagaa
180atggcccaag aaatagatct gagtgctctc aaggagttag aacgcgaggc
cattctccag 240gtcctgtacc gagaccaggc ggttcaaaac acagaggagg
agaggacacg gaaactgaaa 300acacacctgc agcatctccg gtggaaagga
gcgaagaaca cggactggga gcacaaagag 360aagtgctgtg cgcgctgcca
gcaggtgctg gggttcctgc tgcaccgggg cgccgtgtgc 420cggggctgca
gccaccgcgt gtgtgcccag tgccgagtgt tcctgagggg gacccatgcc
480tggaagtgca cggtgtgctt cgaggacagg aatgtcaaaa taaaaactgg
agaatggttc 540tatgaggaac gagccaagaa atttccaact ggaggcaaac
atgagacagt tggagggcag 600ctcttgcaat cttatcagaa gctgagcaaa
atttctgtgg ttcctcctac tccacctcct 660gtcagcgaga gccagtgcag
ccgcagtcct ggcaggaagg tcagtgcacc agatattctg 720aaacctctca
atcaagagga tcccaaatgc tctactaacc ctattttgaa gcaacagaat
780ctcccatcca gtccggcacc cagtaccata ttctctggag gttttagaca
cggaagttta 840attagcattg acagcacctg tacagagatg ggcaattttg
acaatgctaa tgtcactgga 900gaaatagaat ttgccattca ttattgcttc
aaaacccatt ctttagaaat atgcatcaag 960gcctgtaaga accttgccta
tggagaagaa aagaagaaaa agtgcaatcc gtatgtgaag 1020acctacctgt
tgcccgacag atcctcccag ggaaagcgca agactggagt ccaaaggaac
1080accgtggacc cgacctttca ggagaccttg aagtatcagg tggcccctgc
ccagctggtg 1140acccggcagc tgcaggtctc ggtgtggcat ctgggcacgc
tggcccggag agtgtttctt 1200ggagaagtga tcattcctct ggccacgtgg
gactttgaag acagcacaac acagtccttc 1260cgctggcatc cgctccgggc
caaggcggag aaatacgaag acagcgttcc tcagagtaat 1320ggagagctca
cagtccgggc taagctggtt ctcccttcac ggcccagaaa actccaagag
1380gctcaagaag ggacagatca gccatcactt catggtcaac tttgtttggt
agtgctagga 1440gccaagaatt tacctgtgcg gccagatggc accttgaact
catttgttaa gggctgtctc 1500actctgccag accaacaaaa actgagactg
aagtcgccag tcctgaggaa gcaggcttgc 1560ccccagtgga aacactcatt
tgtcttcagt ggcgtaaccc cagctcagct gaggcagtca 1620agcttggagt
taactgtctg ggatcaggcc ctctttggaa tgaacgaccg cttgcttgga
1680ggaaccagac ttggttcaaa gggagacaca gctgttggcg gggatgcatg
ctcactatcg 1740aagctccagt ggcagaaagt cctttccagc cccaatctat
ggacagacat gactcttgtc 1800ctgcactgac atgaaggcct caaggttcca
ggttgcagca ggcgtgaggc actgtgcgtc 1860tgcagagggg ctacgaacca
ggtgcagggt cccagctgga gacccctttg accttgagca 1920gtctcca
192741406PRTHomo sapiens 41Met Asp Cys Arg Thr Lys Ala Asn Pro Asp
Arg Thr Phe Asp Leu Val 1 5 10 15 Leu Lys Val Lys Cys His Ala Ser
Glu Asn Glu Asp Pro Val Val Leu 20 25 30 Trp Lys Phe Pro Glu Asp
Phe Gly Asp Gln Glu Ile Leu Gln Ser Val 35 40 45 Pro Lys Phe Cys
Phe Pro Phe Asp Val Glu Arg Val Ser Gln Asn Gln 50 55 60 Val Gly
Gln His Phe Thr Phe Val Leu Thr Asp Ile Glu Ser Lys Gln 65 70 75 80
Arg Phe Gly Phe Cys Arg Leu Thr Ser Gly Gly Thr Ile Cys Leu Cys 85
90 95 Ile Leu Ser Tyr Leu Pro Trp Phe Glu Val Tyr Tyr Lys Leu Leu
Asn 100 105 110 Thr Leu Ala Asp Tyr Leu Ala Lys Glu Leu Glu Asn Asp
Leu Asn Glu 115 120 125 Thr Leu Arg Ser Leu Tyr Asn His Pro Val Pro
Lys Ala Asn Thr Pro 130 135 140 Val Asn Leu Ser Val His Ser Tyr Phe
Ile Ala Pro Asp Val Thr Gly 145 150 155 160 Leu Pro Thr Ile Pro Glu
Ser Arg Asn Leu Thr Glu Tyr Phe Val Ala 165 170 175 Val Asp Val Asn
Asn Met Leu Gln Leu Tyr Ala Ser Met Leu His Glu 180 185 190 Arg Arg
Ile Val Ile Ile Ser Ser Lys Leu Ser Thr Leu Thr Ala Cys 195 200 205
Ile His Gly Ser Ala Ala Leu Leu Tyr Pro Met Tyr Trp Gln His Ile 210
215 220 Tyr Ile Pro Val Leu Pro Pro His Leu Leu Asp Tyr Cys Cys Ala
Pro 225 230 235 240 Met Pro Tyr Leu Ile Gly Ile His Ser Ser Leu Ile
Glu Arg Val Lys 245 250 255 Asn Lys Ser Leu Glu Asp Val Val Met Leu
Asn Val Asp Thr Asn Thr 260 265 270 Leu Glu Ser Pro Phe Ser Asp Leu
Asn Asn Leu Pro Ser Asp Val Val 275 280 285 Ser Ala Leu Lys Asn Lys
Leu Lys Lys Gln Ser Thr Ala Thr Gly Asp 290 295 300 Gly Val Ala Arg
Ala Phe Leu Arg Ala Gln Ala Ala Leu Phe Gly Ser 305 310 315 320 Tyr
Arg Asp Ala Leu Arg Tyr Lys Pro Gly Glu Pro Ile Thr Phe Cys 325 330
335 Glu Glu Ser Phe Val Lys His Arg Ser Ser Val Met Lys Gln Phe Leu
340 345 350 Glu Thr Ala Ile Asn Leu Gln Leu Phe Lys Gln Phe Ile Asp
Gly Arg 355 360 365 Leu Ala Lys Leu Asn Ala Gly Arg Gly Phe Ser Asp
Val Phe Glu Glu 370 375 380 Glu Ile Thr Ser Gly Gly Phe Cys Gly Gly
Lys Asp Lys Leu Gln Tyr 385 390 395 400 Asp Tyr Pro Phe Ser Gln 405
42426PRTHomo sapiens 42Met Asp Cys Arg Thr Lys Ala Asn Pro Asp Arg
Thr Phe Asp Leu Val 1 5 10 15 Leu Lys Val Lys Cys His Ala Ser Glu
Asn Glu Asp Pro Val Val Leu 20 25 30 Trp Lys Phe Pro Glu Asp Phe
Gly Asp Gln Glu Ile Leu Gln Ser Val 35 40 45 Pro Lys Phe Cys Phe
Pro Phe Asp Val Glu Arg Val Ser Gln Asn Gln 50 55 60 Val Gly Gln
His Phe Thr Phe Val Leu Thr Asp Ile Glu Ser Lys Gln 65 70 75 80 Arg
Phe Gly Phe Cys Arg Leu Thr Ser Gly Gly Thr Ile Cys Leu Cys 85 90
95 Ile Leu Ser Tyr Leu Pro Trp Phe Glu Val Tyr Tyr Lys Leu Leu Asn
100 105 110 Thr Leu Ala Asp Tyr Leu Ala Lys Glu Leu Glu Asn Asp Leu
Asn Glu 115 120 125 Thr Leu Arg Ser Leu Tyr Asn His Pro Val Pro Lys
Ala Asn Thr Pro 130 135 140 Val Asn Leu Ser Val Asn Gln Glu Ile Phe
Ile Ala Cys Glu Gln Val 145 150 155 160 Leu Lys Asp Gln Pro Ala Leu
Val Pro His Ser Tyr Phe Ile Ala Pro 165 170 175 Asp Val Thr Gly Leu
Pro Thr Ile Pro Glu Ser Arg Asn Leu Thr Glu 180 185 190 Tyr Phe Val
Ala Val Asp Val Asn Asn Met Leu Gln Leu Tyr Ala Ser 195 200 205 Met
Leu His Glu Arg Arg Ile Val Ile Ile Ser Ser Lys Leu Ser Thr 210 215
220 Leu Thr Ala Cys Ile His Gly Ser Ala Ala Leu Leu Tyr Pro Met Tyr
225 230 235 240 Trp Gln His Ile Tyr Ile Pro Val Leu Pro Pro His Leu
Leu Asp Tyr 245 250 255 Cys Cys Ala Pro Met Pro Tyr Leu Ile Gly Ile
His Ser Ser Leu Ile 260 265 270 Glu Arg Val Lys Asn Lys Ser Leu Glu
Asp Val Val Met Leu Asn Val 275 280 285 Asp Thr Asn Thr Leu Glu Ser
Pro Phe Ser Asp Leu Asn Asn Leu Pro 290 295 300 Ser Asp Val Val Ser
Ala Leu Lys Asn Lys Leu Lys Lys Gln Ser Thr 305 310 315 320 Ala Thr
Gly Asp Gly Val Ala Arg Ala Phe Leu Arg Ala Gln Ala Ala 325 330 335
Leu Phe Gly Ser Tyr Arg Asp Ala Leu Arg Tyr Lys Pro Gly Glu Pro 340
345 350 Ile Thr Phe Cys Glu Glu Ser Phe Val Lys His Arg Ser Ser Val
Met 355
360 365 Lys Gln Phe Leu Glu Thr Ala Ile Asn Leu Gln Leu Phe Lys Gln
Phe 370 375 380 Ile Asp Gly Arg Leu Ala Lys Leu Asn Ala Gly Arg Gly
Phe Ser Asp 385 390 395 400 Val Phe Glu Glu Glu Ile Thr Ser Gly Gly
Phe Cys Gly Gly Lys Asp 405 410 415 Lys Leu Gln Tyr Asp Tyr Pro Phe
Ser Gln 420 425 43426PRTHomo sapiens 43Ile Glu Thr Lys Thr Arg Ala
Asn Pro Asp Arg Thr Phe Asp Leu Val 1 5 10 15 Leu Lys Val Lys Cys
His Ala Ser Glu Asn Glu Asp Pro Val Val Leu 20 25 30 Trp Lys Phe
Pro Glu Asp Phe Gly Asp Gln Glu Ile Leu Gln Ser Val 35 40 45 Pro
Lys Phe Cys Phe Pro Phe Asp Val Glu Arg Val Ser Gln Asn Gln 50 55
60 Val Gly Gln His Phe Thr Phe Val Leu Thr Asp Ile Glu Ser Lys Gln
65 70 75 80 Arg Phe Gly Phe Cys Arg Leu Thr Ser Gly Gly Thr Ile Cys
Leu Cys 85 90 95 Ile Leu Ser Tyr Leu Pro Trp Phe Glu Val Tyr Tyr
Lys Leu Leu Asn 100 105 110 Thr Leu Ala Asp Tyr Leu Ala Lys Glu Leu
Glu Asn Asp Leu Asn Glu 115 120 125 Thr Leu Arg Ser Leu Tyr Asn His
Pro Val Pro Lys Ala Asn Thr Pro 130 135 140 Val Asn Leu Ser Val Asn
Gln Glu Ile Phe Ile Ala Cys Glu Gln Val 145 150 155 160 Leu Lys Asp
Gln Pro Ala Leu Val Pro His Ser Tyr Phe Ile Ala Pro 165 170 175 Asp
Val Thr Gly Leu Pro Thr Ile Pro Glu Ser Arg Asn Leu Thr Glu 180 185
190 Tyr Phe Val Ala Val Asp Val Asn Asn Met Leu Gln Leu Tyr Ala Ser
195 200 205 Met Leu His Glu Arg Arg Ile Val Ile Ile Ser Ser Lys Leu
Ser Thr 210 215 220 Leu Thr Ala Cys Ile His Gly Ser Ala Ala Leu Leu
Tyr Pro Met Tyr 225 230 235 240 Trp Gln His Ile Tyr Ile Pro Val Leu
Pro Pro His Leu Leu Asp Tyr 245 250 255 Cys Cys Ala Pro Met Pro Tyr
Leu Ile Gly Ile His Ser Ser Leu Ile 260 265 270 Glu Arg Val Lys Asn
Lys Ser Leu Glu Asp Val Val Met Leu Asn Val 275 280 285 Asp Thr Asn
Thr Leu Glu Ser Pro Phe Ser Asp Leu Asn Asn Leu Pro 290 295 300 Ser
Asp Val Val Ser Ala Leu Lys Asn Lys Leu Lys Lys Gln Ser Thr 305 310
315 320 Ala Thr Gly Asp Gly Val Ala Arg Ala Phe Leu Arg Ala Gln Ala
Ala 325 330 335 Leu Phe Gly Ser Tyr Arg Asp Ala Leu Arg Tyr Lys Pro
Gly Glu Pro 340 345 350 Ile Thr Phe Cys Glu Glu Ser Phe Val Lys His
Arg Ser Ser Val Met 355 360 365 Lys Gln Phe Leu Glu Thr Ala Ile Asn
Leu Gln Leu Phe Lys Gln Phe 370 375 380 Ile Asp Gly Arg Leu Ala Lys
Leu Asn Ala Gly Arg Gly Phe Ser Asp 385 390 395 400 Val Phe Glu Glu
Glu Ile Thr Ser Gly Gly Phe Cys Gly Gly Lys Asp 405 410 415 Lys Leu
Gln Tyr Asp Tyr Pro Phe Ser Gln 420 425 44396PRTHomo sapiens 44Met
Ala Ala Ala Pro Arg Glu Glu Lys Arg Trp Pro Gln Pro Val Phe 1 5 10
15 Ser Asn Pro Val Val Leu Trp Lys Phe Pro Glu Asp Phe Gly Asp Gln
20 25 30 Glu Ile Leu Gln Ser Val Pro Lys Phe Cys Phe Pro Phe Asp
Val Glu 35 40 45 Arg Val Ser Gln Asn Gln Val Gly Gln His Phe Thr
Phe Val Leu Thr 50 55 60 Asp Ile Glu Ser Lys Gln Arg Phe Gly Phe
Cys Arg Leu Thr Ser Gly 65 70 75 80 Gly Thr Ile Cys Leu Cys Ile Leu
Ser Tyr Leu Pro Trp Phe Glu Val 85 90 95 Tyr Tyr Lys Leu Leu Asn
Thr Leu Ala Asp Tyr Leu Ala Lys Glu Leu 100 105 110 Glu Asn Asp Leu
Asn Glu Thr Leu Arg Ser Leu Tyr Asn His Pro Val 115 120 125 Pro Lys
Ala Asn Thr Pro Val Asn Leu Ser Val His Ser Tyr Phe Ile 130 135 140
Ala Pro Asp Val Thr Gly Leu Pro Thr Ile Pro Glu Ser Arg Asn Leu 145
150 155 160 Thr Glu Tyr Phe Val Ala Val Asp Val Asn Asn Met Leu Gln
Leu Tyr 165 170 175 Ala Ser Met Leu His Glu Arg Arg Ile Val Ile Ile
Ser Ser Lys Leu 180 185 190 Ser Thr Leu Thr Ala Cys Ile His Gly Ser
Ala Ala Leu Leu Tyr Pro 195 200 205 Met Tyr Trp Gln His Ile Tyr Ile
Pro Val Leu Pro Pro His Leu Leu 210 215 220 Asp Tyr Cys Cys Ala Pro
Met Pro Tyr Leu Ile Gly Ile His Ser Ser 225 230 235 240 Leu Ile Glu
Arg Val Lys Asn Lys Ser Leu Glu Asp Val Val Met Leu 245 250 255 Asn
Val Asp Thr Asn Thr Leu Glu Ser Pro Phe Ser Asp Leu Asn Asn 260 265
270 Leu Pro Ser Asp Val Val Ser Ala Leu Lys Asn Lys Leu Lys Lys Gln
275 280 285 Ser Thr Ala Thr Gly Asp Gly Val Ala Arg Ala Phe Leu Arg
Ala Gln 290 295 300 Ala Ala Leu Phe Gly Ser Tyr Arg Asp Ala Leu Arg
Tyr Lys Pro Gly 305 310 315 320 Glu Pro Ile Thr Phe Cys Glu Glu Ser
Phe Val Lys His Arg Ser Ser 325 330 335 Val Met Lys Gln Phe Leu Glu
Thr Ala Ile Asn Leu Gln Leu Phe Lys 340 345 350 Gln Phe Ile Asp Gly
Arg Leu Ala Lys Leu Asn Ala Gly Arg Gly Phe 355 360 365 Ser Asp Val
Phe Glu Glu Glu Ile Thr Ser Gly Gly Phe Cys Gly Gly 370 375 380 Lys
Asp Lys Leu Gln Tyr Asp Tyr Pro Phe Ser Gln 385 390 395
452117DNAHomo sapiens 45gccgggggcg cagccgacat gggcccgccg ccacggctgc
tgtgagcagc ctctttccct 60gtgtggccgc cggcgtgggc ggggacggcg cgaccctcgc
gcggccgggc tgcgggcttc 120caggccagcg cgcgggggcc ggacggacag
ccccacaccg acatgtaacc atggactgca 180ggaccaaggc aaatccagac
agaacctttg acttggtgtt gaaagtgaaa tgtcatgcct 240ctgaaaatga
agatcctgtg gtattgtgga aattcccaga ggactttgga gaccaggaaa
300tactacagag tgtgccaaag ttctgttttc cctttgacgt tgaaagggtg
tctcagaatc 360aagttggaca gcactttacc tttgtactga cagacattga
aagtaaacag agatttggat 420tctgcagact gacgtcagga ggcacaattt
gtttatgcat ccttagttac cttccctggt 480ttgaagtgta ttacaagctt
ctaaatactc ttgcagatta cttggctaag gaactggaaa 540atgatttgaa
tgaaactctc agatcactgt ataaccaccc agtaccaaag gcaaatactc
600ctgtaaattt gagtgtgcat tcctacttca ttgcccctga tgtaactgga
ctcccaacaa 660tacccgagag tagaaatctt acagaatatt ttgttgccgt
ggatgtgaac aacatgctgc 720agctgtatgc cagtatgctg catgaaaggc
gcatcgtgat tatctcgagc aaattaagca 780ctttaactgc ctgtatccat
ggatcagctg ctcttctata cccaatgtat tggcaacaca 840tatacatccc
agtgcttcct ccacacctgc tggactactg ctgtgcccca atgccatacc
900tgattggaat acactccagc ctcatagaga gagtgaaaaa caaatcattg
gaagatgttg 960ttatgttaaa tgttgataca aacacattag aatcaccatt
tagtgacttg aacaacctac 1020caagtgatgt ggtctcggcc ttgaaaaata
aactgaagaa gcagtctaca gctacgggtg 1080atggagtagc tagggccttt
cttagagcac aggctgcttt gtttggatcc tacagagatg 1140cactgagata
caaacctggt gagcccatca ctttctgtga ggagagtttt gtaaagcacc
1200gctcaagcgt gatgaaacag ttcctggaaa ctgccattaa cctccagctt
tttaagcagt 1260ttatcgatgg tcgactggca aaactaaatg caggaagggg
tttctctgat gtatttgaag 1320aagagatcac ttcaggtggc ttttgtggag
gtaaagacaa gttacaatat gattatccat 1380tttctcaata acaattttct
tggtctttgc acttgtgtct gataaaacct atttcataaa 1440caactaatga
tttcctccta aatatgtaat gtcttaaata catttttcat cttataaaag
1500ctatggaatt agcttatttt gcctgatacc tgttactcaa ggcattaagt
tggcctcctg 1560aattggcagc tgttggcctc gataatctct taatattgct
ggaaattagt aatacagaaa 1620tccaatcaac tcatatcttc ctgtctttcc
ttctgaatag tagtattctc tgctagaaaa 1680ctactagtga tggttattac
tgagtatgaa tttaagaact gaggttatga ttggtaatac 1740aatccaaaaa
gaagggtctg aacaccaaaa ttctttatac atatttaagt aactgtatta
1800ttattataca gatgtcttta cctttttgac tttatagatc actgcagcat
taagaaagtt 1860tccagtttac cattccataa gtacaattaa tccttctagt
gtaaatgttc aaatactgtt 1920ataattatct aggcaattaa taatttacaa
actgatattt ttgcacgatt gtagtggtgt 1980atagtcttga cttgcagagc
attttgcttg agtccttgaa atgtcgtgtt cattcattat 2040ttgctgagtg
cttacaatgt attaggcact gttctaaata ttaagtgtac taaataaaca
2100aaaatccttg tattctg 2117462177DNAHomo sapiens 46gccgggggcg
cagccgacat gggcccgccg ccacggctgc tgtgagcagc ctctttccct 60gtgtggccgc
cggcgtgggc ggggacggcg cgaccctcgc gcggccgggc tgcgggcttc
120caggccagcg cgcgggggcc ggacggacag ccccacaccg acatgtaacc
atggactgca 180ggaccaaggc aaatccagac agaacctttg acttggtgtt
gaaagtgaaa tgtcatgcct 240ctgaaaatga agatcctgtg gtattgtgga
aattcccaga ggactttgga gaccaggaaa 300tactacagag tgtgccaaag
ttctgttttc cctttgacgt tgaaagggtg tctcagaatc 360aagttggaca
gcactttacc tttgtactga cagacattga aagtaaacag agatttggat
420tctgcagact gacgtcagga ggcacaattt gtttatgcat ccttagttac
cttccctggt 480ttgaagtgta ttacaagctt ctaaatactc ttgcagatta
cttggctaag gaactggaaa 540atgatttgaa tgaaactctc agatcactgt
ataaccaccc agtaccaaag gcaaatactc 600ctgtaaattt gagtgtgaac
caagagatat ttattgcctg tgagcaagtt ctgaaagatc 660agcctgctct
agtaccgcat tcctacttca ttgcccctga tgtaactgga ctcccaacaa
720tacccgagag tagaaatctt acagaatatt ttgttgccgt ggatgtgaac
aacatgctgc 780agctgtatgc cagtatgctg catgaaaggc gcatcgtgat
tatctcgagc aaattaagca 840ctttaactgc ctgtatccat ggatcagctg
ctcttctata cccaatgtat tggcaacaca 900tatacatccc agtgcttcct
ccacacctgc tggactactg ctgtgcccca atgccatacc 960tgattggaat
acactccagc ctcatagaga gagtgaaaaa caaatcattg gaagatgttg
1020ttatgttaaa tgttgataca aacacattag aatcaccatt tagtgacttg
aacaacctac 1080caagtgatgt ggtctcggcc ttgaaaaata aactgaagaa
gcagtctaca gctacgggtg 1140atggagtagc tagggccttt cttagagcac
aggctgcttt gtttggatcc tacagagatg 1200cactgagata caaacctggt
gagcccatca ctttctgtga ggagagtttt gtaaagcacc 1260gctcaagcgt
gatgaaacag ttcctggaaa ctgccattaa cctccagctt tttaagcagt
1320ttatcgatgg tcgactggca aaactaaatg caggaagggg tttctctgat
gtatttgaag 1380aagagatcac ttcaggtggc ttttgtggag gtaaagacaa
gttacaatat gattatccat 1440tttctcaata acaattttct tggtctttgc
acttgtgtct gataaaacct atttcataaa 1500caactaatga tttcctccta
aatatgtaat gtcttaaata catttttcat cttataaaag 1560ctatggaatt
agcttatttt gcctgatacc tgttactcaa ggcattaagt tggcctcctg
1620aattggcagc tgttggcctc gataatctct taatattgct ggaaattagt
aatacagaaa 1680tccaatcaac tcatatcttc ctgtctttcc ttctgaatag
tagtattctc tgctagaaaa 1740ctactagtga tggttattac tgagtatgaa
tttaagaact gaggttatga ttggtaatac 1800aatccaaaaa gaagggtctg
aacaccaaaa ttctttatac atatttaagt aactgtatta 1860ttattataca
gatgtcttta cctttttgac tttatagatc actgcagcat taagaaagtt
1920tccagtttac cattccataa gtacaattaa tccttctagt gtaaatgttc
aaatactgtt 1980ataattatct aggcaattaa taatttacaa actgatattt
ttgcacgatt gtagtggtgt 2040atagtcttga cttgcagagc attttgcttg
agtccttgaa atgtcgtgtt cattcattat 2100ttgctgagtg cttacaatgt
attaggcact gttctaaata ttaagtgtac taaataaaca 2160aaaatccttg tattctg
2177472007DNAHomo sapiens 47attgagacaa aaacaagggc aaatccagac
agaacctttg acttggtgtt gaaagtgaaa 60tgtcatgcct ctgaaaatga agatcctgtg
gtattgtgga aattcccaga ggactttgga 120gaccaggaaa tactacagag
tgtgccaaag ttctgttttc cctttgacgt tgaaagggtg 180tctcagaatc
aagttggaca gcactttacc tttgtactga cagacattga aagtaaacag
240agatttggat tctgcagact gacgtcagga ggcacaattt gtttatgcat
ccttagttac 300cttccctggt ttgaagtgta ttacaagctt ctaaatactc
ttgcagatta cttggctaag 360gaactggaaa atgatttgaa tgaaactctc
agatcactgt ataaccaccc agtaccaaag 420gcaaatactc ctgtaaattt
gagtgtgaac caagagatat ttattgcctg tgagcaagtt 480ctgaaagatc
agcctgctct agtaccgcat tcctacttca ttgcccctga tgtaactgga
540ctcccaacaa tacccgagag tagaaatctt acagaatatt ttgttgccgt
ggatgtgaac 600aacatgctgc agctgtatgc cagtatgctg catgaaaggc
gcatcgtgat tatctcgagc 660aaattaagca ctttaactgc ctgtatccat
ggatcagctg ctcttctata cccaatgtat 720tggcaacaca tatacatccc
agtgcttcct ccacacctgc tggactactg ctgtgcccca 780atgccatacc
tgattggaat acactccagc ctcatagaga gagtgaaaaa caaatcattg
840gaagatgttg ttatgttaaa tgttgataca aacacattag aatcaccatt
tagtgacttg 900aacaacctac caagtgatgt ggtctcggcc ttgaaaaata
aactgaagaa gcagtctaca 960gctacgggtg atggagtagc tagggccttt
cttagagcac aggctgcttt gtttggatcc 1020tacagagatg cactgagata
caaacctggt gagcccatca ctttctgtga ggagagtttt 1080gtaaagcacc
gctcaagcgt gatgaaacag ttcctggaaa ctgccattaa cctccagctt
1140tttaagcagt ttatcgatgg tcgactggca aaactaaatg caggaagggg
tttctctgat 1200gtatttgaag aagagatcac ttcaggtggc ttttgtggag
gtaaagacaa gttacaatat 1260gattatccat tttctcaata acaattttct
tggtctttgc acttgtgtct gataaaacct 1320atttcataaa caactaatga
tttcctccta aatatgtaat gtcttaaata catttttcat 1380cttataaaag
ctatggaatt agcttatttt gcctgatacc tgttactcaa ggcattaagt
1440tggcctcctg aattggcagc tgttggcctc gataatctct taatattgct
ggaaattagt 1500aatacagaaa tccaatcaac tcatatcttc ctgtctttcc
ttctgaatag tagtattctc 1560tgctagaaaa ctactagtga tggttattac
tgagtatgaa tttaagaact gaggttatga 1620ttggtaatac aatccaaaaa
gaagggtctg aacaccaaaa ttctttatac atatttaagt 1680aactgtatta
ttattataca gatgtcttta cctttttgac tttatagatc actgcagcat
1740taagaaagtt tccagtttac cattccataa gtacaattaa tccttctagt
gtaaatgttc 1800aaatactgtt ataattatct aggcaattaa taatttacaa
actgatattt ttgcacgatt 1860gtagtggtgt atagtcttga cttgcagagc
attttgcttg agtccttgaa atgtcgtgtt 1920cattcattat ttgctgagtg
cttacaatgt attaggcact gttctaaata ttaagtgtac 1980taaataaaca
aaaatccttg tattctg 2007482197DNAHomo sapiens 48gcgggggccg
gacggacagc cccacaccga catgtaacca tggactgcag gaccaaggca 60aatccagaca
gaacctttga cttggtgttg aaagtgaaat gtcatgcctc tgaaaatgaa
120gaggacagtc cagcttatct gccgaggatt ccccctggaa aagtacgccg
attcgcattt 180tgcattaaga aactggaaaa ctttcctgtc ggtcctggcg
tagcgcctcc cgtgtccggg 240gtagaccttg taccggctga aaccgcatag
ctcgaccttc atggcggcag ctccacggga 300ggagaaaaga tggccccaac
ctgtattttc gaatcctgtg gtattgtgga aattcccaga 360ggactttgga
gaccaggaaa tactacagag tgtgccaaag ttctgttttc cctttgacgt
420tgaaagggtg tctcagaatc aagttggaca gcactttacc tttgtactga
cagacattga 480aagtaaacag agatttggat tctgcagact gacgtcagga
ggcacaattt gtttatgcat 540ccttagttac cttccctggt ttgaagtgta
ttacaagctt ctaaatactc ttgcagatta 600cttggctaag gaactggaaa
atgatttgaa tgaaactctc agatcactgt ataaccaccc 660agtaccaaag
gcaaatactc ctgtaaattt gagtgtgcat tcctacttca ttgcccctga
720tgtaactgga ctcccaacaa tacccgagag tagaaatctt acagaatatt
ttgttgccgt 780ggatgtgaac aacatgctgc agctgtatgc cagtatgctg
catgaaaggc gcatcgtgat 840tatctcgagc aaattaagca ctttaactgc
ctgtatccat ggatcagctg ctcttctata 900cccaatgtat tggcaacaca
tatacatccc agtgcttcct ccacacctgc tggactactg 960ctgtgcccca
atgccatacc tgattggaat acactccagc ctcatagaga gagtgaaaaa
1020caaatcattg gaagatgttg ttatgttaaa tgttgataca aacacattag
aatcaccatt 1080tagtgacttg aacaacctac caagtgatgt ggtctcggcc
ttgaaaaata aactgaagaa 1140gcagtctaca gctacgggtg atggagtagc
tagggccttt cttagagcac aggctgcttt 1200gtttggatcc tacagagatg
cactgagata caaacctggt gagcccatca ctttctgtga 1260ggagagtttt
gtaaagcacc gctcaagcgt gatgaaacag ttcctggaaa ctgccattaa
1320cctccagctt tttaagcagt ttatcgatgg tcgactggca aaactaaatg
caggaagggg 1380tttctctgat gtatttgaag aagagatcac ttcaggtggc
ttttgtggag gtaaagacaa 1440gttacaatat gattatccat tttctcaata
acaattttct tggtctttgc acttgtgtct 1500gataaaacct atttcataaa
caactaatga tttcctccta aatatgtaat gtcttaaata 1560catttttcat
cttataaaag ctatggaatt agcttatttt gcctgatacc tgttactcaa
1620ggcattaagt tggcctcctg aattggcagc tgttggcctc gataatctct
taatattgct 1680ggaaattagt aatacagaaa tccaatcaac tcatatcttc
ctgtctttcc ttctgaatag 1740tagtattctc tgctagaaaa ctactagtga
tggttattac tgagtatgaa tttaagaact 1800gaggttatga ttggtaatac
aatccaaaaa gaagggtctg aacaccaaaa ttctttatac 1860atatttaagt
aactgtatta ttattataca gatgtcttta cctttttgac tttatagatc
1920actgcagcat taagaaagtt tccagtttac cattccataa gtacaattaa
tccttctagt 1980gtaaatgttc aaatactgtt ataattatct aggcaattaa
taatttacaa actgatattt 2040ttgcacgatt gtagtggtgt atagtcttga
cttgcagagc attttgcttg agtccttgaa 2100atgtcgtgtt cattcattat
ttgctgagtg cttacaatgt attaggcact gttctaaata 2160ttaagtgtac
taaataaaca aaaatccttg tattctg 219749131PRTHomo sapiens 49Met Phe
Leu Gly Leu Val Gly Leu Arg Thr Lys Gly Arg Arg Trp Ile 1 5 10 15
Ser Ser Trp Ser Glu Gly Glu Asp Arg Gly Gln Ser Pro Glu Gly Val 20
25 30 Leu Leu Thr Trp Val Phe Gly Thr Lys Cys Val Met His Pro Cys
Glu 35 40 45 Glu Thr Thr Lys Gln Ala Leu Cys Glu Gln Gln Gly Cys
Leu Phe His 50 55 60 Leu Gly Ala Asp Glu Leu Ser Pro Lys Arg Glu
Ser Ala Gln Ser Ile 65 70
75 80 Ser Phe Lys Trp Glu Asn Ser Ile Tyr Leu His Ala Thr Leu Phe
Leu 85 90 95 Ile Gly Glu Tyr Leu His Leu Ala Phe Tyr Tyr Phe Leu
Leu Val Leu 100 105 110 Tyr Ile Leu Cys Ser Phe Leu Ser Tyr Cys Leu
Leu Leu Trp Leu Gly 115 120 125 Ser Phe Leu 130 502950DNAHomo
sapiens 50attatctagg tctcggagga tggagaaatc aaaagtgcca ttttctggcc
atttagaacc 60attgtcgagt ttgtattggg gccaagcagt gttgcagaag aaaataagac
atttagattt 120tagttcaggt gatagttgaa gaaattttaa gttcttgaga
acacaggcta agggagaaga 180aggaggaatg gagggtggaa gtttgcccat
agtgaaggag gcaagtttaa agagaaaggt 240agagacatgg agaaagggtt
ggggagcagc cctgggctgc aatgtgggtg agcagccaaa 300gcaggcatcc
ccgcaattga cttgccacca agggaatgtg gttgaatgac caaggcaggc
360atccctgaag atatcagacg ccaatggaat gtgggtgaat aatcaggcag
gcatccccgg 420aatgattaaa cactaaggga aggctgcctt cctgagtaca
tgaccagcac cagagttttg 480ggtccatgga taaaatgtgt ctcctttgtc
tctactagaa aatgaaagga attgaaatta 540agagaagaga gggagtgaag
ggtggcacca agaatgaaag gagaaagagg ttgagggata 600gtgagaaagg
ttggagaaga gagtaaaaag aggccactta cccgatttaa aatttgtgag
660atgttccttg ggctggttgg tctgaggacc aaaggtcgta ggtggatctc
ttcatggagt 720gagggtgagg acaggggaca gtctcctgaa ggagtcctgc
tgacctgggt ctttggcacc 780aaatgtgtca tgcatccatg tgaagagacc
accaaacagg ctttgtgtga gcaacagggc 840tgtttgtttc acctgggtgc
agacgagttg agtccaaaaa gagagtcagc ccagtctata 900tcttttaaat
gggaaaattc aatctattta catgcaacgt tattcttgat aggtgagtac
960ttacatctag cattctatta ttttctgctt gttttgtata tcctttgttc
cttcctctct 1020tactgtttac ttttgtggtt gggcagtttt ctgtagggat
aagatttgag tcttatctct 1080ttctccctat gtgttagctt taccagtgag
tttttatagt ttcacatatt tttatgatgc 1140tggttatcat cttctctgtg
gggaacaggc cccccaaaac ctggccataa actggcccca 1200aaactggcca
taaacaaaat ctctgcagca ctgtgacatg tacatgatgg tcttaacgcc
1260cacgctggaa ggttgtgggt ttaccagaat gagggcaagg aacacctggc
ccacccaggg 1320tggaaaaccg cttaaaggca ttcttaaacc acaaacaata
gcatgagtga tctgtgcctt 1380aaggccatgc tcctgctgca gatagctagt
ccaacccatc cctttatttc agcccatctc 1440ttcatttccc ataaggaata
attttagtta atctaatatc tatagaaaga atgctaatga 1500ctagcttgct
gttaataaat acatgggtaa acctctgttg gaggctctca gctctgaagg
1560ctgtgagacc cttgatttcc tacttcactc ctctatattt ctgtgtcttt
aattcctcta 1620gtgccactgg gttagagtct ccccgaccaa gctggtctca
gcaagtggtc tccatcatgg 1680gggctcgaat ccaggttgaa gggtcaccag
agtgatggtt ggagaacatg gaactagctg 1740gaggacacct gagtactctt
aaagcaaacc ccgtggtgag taagaagggg agctcagaag 1800catcagggta
acaatgggac aagtgtgggg tctggttcgt tccatcttgg aactttttca
1860cactgatgat gaggaagaag gagagtataa tgaagtaaca gaagaggtta
tagagcaggt 1920ttatttgcca gctaaagcta aagtggcaaa ggagggagag
gttcatccct acccttctgc 1980accccctcat tattattttg aagaaaaaga
gtggcctgac cctccagatc tttcttttcc 2040agaggacagt gggcaaaaat
tagttgcccc agtgactgtt caagcagcac ctcgagcgac 2100tgctcttagt
tctattcagt caggaattca gcaagctaga tgagaaggtg attaagaggc
2160ttggcagttc cctgttagac tacactgccc agaccaacag ggaaatattg
tagctacatt 2220tgagcctttt tgttttaaat tactcaaaga atttaaacaa
gctattaatc agtatggacc 2280aggttctcct tttgtaatgg gactattaaa
gaacattgct gtttccagtc agatgattcc 2340tactgactgg gacgctctta
ctcaagcttg tctaactcct gcttagttct tacaatttaa 2400aacttggtgg
gcagatgaag cttccattca ggcttctcac aacacgcagg accaacctca
2460aattaatata actgcagacc aacttttggg ggttggcagt tgggctggtt
tagatgcaca 2520aatggtcatg caggatgatg ccatagaaca gcttagagga
gcgtgcatta gagcttgggg 2580aaaaaaaatc acttcaagtg gagaacaata
ccctttcttt agtgctataa aacagggacc 2640agaagaatca tatgtggatt
ttatagctca gttacaggag tctcttaaaa agatgactgc 2700agatttggct
gctcaggata tagtgttgca attattagct ttcaacaatg ctaatcctga
2760ttgccaggct gctctgtgac ctatcagagg gaaagcacat ttagttgatt
atatcaaggc 2820ctgtggtggt atcagaggta atctgcatca ggccacctgc
tagcacgggc aatggcagga 2880ctgagagtgg atacagaaag tactccattt
cctggagctt gttttaactg tgggaagcat 2940ggtcatactg 295051132PRTHomo
sapiens 51Met Ser Gln Gly Arg His Leu Leu Glu Phe Leu Pro Leu Tyr
Ile Ala 1 5 10 15 Phe Met Leu Arg Gly Val Cys Arg Ile Asp Ala Gly
Ser Leu Asn Pro 20 25 30 Glu Leu Phe Leu Pro Met Leu His Glu Glu
Asp Trp Cys Trp Glu Ile 35 40 45 Ala Gly His Val Asp Ser Gln Glu
Leu Phe Val Gly Leu Phe Ser Ser 50 55 60 Thr Ser Thr Gly His Ala
Glu Leu Asp Lys Lys Val Asn Gly Leu Tyr 65 70 75 80 Tyr Asp Ser Val
Phe Gln Leu Ser Leu Asp Arg Met Arg His Thr Arg 85 90 95 Ser Met
Ala Arg Val Glu Arg Leu Arg His Arg Lys Ala Ile Gln Lys 100 105 110
Lys Thr Gln Leu Val His His Leu Leu Phe Lys Gly Trp Ala Ser Asp 115
120 125 Glu Thr Glu Ile 130 52399DNAHomo sapiens 52atgtcacaag
gcaggcatct tcttgagttt cttccattgt acatagcttt catgttacgt 60ggggtttgta
ggatagacgc tggaagcctt aatccagaac tgtttttgcc aatgttacat
120gaagaggatt ggtgttggga gatagctggc catgtggact cccaagagtt
attcgttggt 180ttgttttcta gtacctctac tgggcatgca gagctggaca
aaaaggttaa tggactttat 240tatgactctg tattccagtt gtctctggac
cgtatgcgtc atacaaggag tatggctaga 300gtagagaggc tgagacacag
gaaagcgatc cagaaaaaga ctcagttagt ccatcatctg 360ctatttaaag
gatgggcttc tgatgaaact gaaatttag 39953431PRTHomo sapiens 53Met Arg
Arg Leu Arg Arg Leu Ala His Leu Val Leu Phe Cys Pro Phe 1 5 10 15
Ser Lys Arg Leu Gln Gly Arg Leu Pro Gly Leu Arg Val Arg Cys Ile 20
25 30 Phe Leu Ala Trp Leu Gly Val Phe Ala Gly Ser Trp Leu Val Tyr
Val 35 40 45 His Tyr Ser Ser Tyr Ser Glu Arg Cys Arg Gly His Val
Cys Gln Val 50 55 60 Val Ile Cys Asp Gln Tyr Arg Lys Gly Ile Ile
Ser Gly Ser Val Cys 65 70 75 80 Gln Asp Leu Cys Glu Leu His Met Val
Glu Trp Arg Thr Cys Leu Ser 85 90 95 Val Ala Pro Gly Gln Gln Val
Tyr Ser Gly Leu Trp Arg Asp Lys Asp 100 105 110 Val Thr Ile Lys Cys
Gly Ile Glu Glu Thr Leu Asp Ser Lys Ala Arg 115 120 125 Ser Asp Ala
Ala Pro Arg Arg Glu Leu Val Leu Phe Asp Lys Pro Thr 130 135 140 Arg
Gly Thr Ser Ile Lys Glu Phe Arg Glu Met Thr Leu Ser Phe Leu 145 150
155 160 Lys Ala Asn Leu Gly Asp Leu Pro Ser Leu Pro Ala Leu Val Gly
Gln 165 170 175 Val Leu Leu Met Ala Asp Phe Asn Lys Asp Asn Arg Val
Ser Leu Ala 180 185 190 Glu Ala Lys Ser Val Trp Ala Leu Leu Gln Arg
Asn Glu Phe Leu Leu 195 200 205 Leu Leu Ser Leu Gln Glu Lys Glu His
Ala Ser Arg Leu Leu Gly Tyr 210 215 220 Cys Gly Asp Leu Tyr Leu Thr
Glu Gly Val Pro His Gly Ala Trp His 225 230 235 240 Ala Ala Ala Leu
Pro Pro Leu Leu Arg Pro Leu Leu Pro Pro Ala Leu 245 250 255 Gln Gly
Ala Leu Gln Gln Trp Leu Gly Pro Ala Trp Pro Trp Arg Ala 260 265 270
Lys Ile Ala Ile Gly Leu Leu Glu Phe Val Glu Glu Leu Phe His Gly 275
280 285 Ser Tyr Gly Thr Phe Tyr Met Cys Glu Thr Thr Leu Ala Asn Val
Gly 290 295 300 Tyr Thr Ala Thr Tyr Asp Phe Lys Met Ala Asp Leu Gln
Gln Val Ala 305 310 315 320 Pro Glu Ala Thr Val Arg Arg Phe Leu Gln
Gly Arg Arg Cys Glu His 325 330 335 Ser Thr Asp Cys Thr Tyr Gly Arg
Asp Cys Arg Ala Pro Cys Asp Arg 340 345 350 Leu Met Arg Gln Cys Lys
Gly Asp Leu Ile Gln Pro Asn Leu Ala Lys 355 360 365 Val Cys Ala Leu
Leu Arg Gly Tyr Leu Leu Pro Gly Ala Pro Ala Asp 370 375 380 Leu Arg
Glu Glu Leu Gly Thr Gln Leu Arg Thr Cys Thr Thr Leu Ser 385 390 395
400 Gly Leu Ala Ser Gln Val Glu Ala His His Ser Leu Val Leu Ser His
405 410 415 Leu Lys Thr Leu Leu Trp Lys Lys Ile Ser Asn Thr Lys Tyr
Ser 420 425 430 54344PRTHomo sapiens 54Met Val Glu Trp Arg Thr Cys
Leu Ser Val Ala Pro Gly Gln Gln Val 1 5 10 15 Tyr Ser Gly Leu Trp
Arg Asp Lys Asp Val Thr Ile Lys Cys Gly Ile 20 25 30 Glu Glu Thr
Leu Asp Ser Lys Ala Arg Ser Asp Ala Ala Pro Arg Arg 35 40 45 Glu
Leu Val Leu Phe Asp Lys Pro Thr Arg Gly Thr Ser Ile Lys Glu 50 55
60 Phe Arg Glu Met Thr Leu Ser Phe Leu Lys Ala Asn Leu Gly Asp Leu
65 70 75 80 Pro Ser Leu Pro Ala Leu Val Gly Gln Val Leu Leu Met Ala
Asp Phe 85 90 95 Asn Lys Asp Asn Arg Val Ser Leu Ala Glu Ala Lys
Ser Val Trp Ala 100 105 110 Leu Leu Gln Arg Asn Glu Phe Leu Leu Leu
Leu Ser Leu Gln Glu Lys 115 120 125 Glu His Ala Ser Arg Leu Leu Gly
Tyr Cys Gly Asp Leu Tyr Leu Thr 130 135 140 Glu Gly Val Pro His Gly
Ala Trp His Ala Ala Ala Leu Pro Pro Leu 145 150 155 160 Leu Arg Pro
Leu Leu Pro Pro Ala Leu Gln Gly Ala Leu Gln Gln Trp 165 170 175 Leu
Gly Pro Ala Trp Pro Trp Arg Ala Lys Ile Ala Ile Gly Leu Leu 180 185
190 Glu Phe Val Glu Glu Leu Phe His Gly Ser Tyr Gly Thr Phe Tyr Met
195 200 205 Cys Glu Thr Thr Leu Ala Asn Val Gly Tyr Thr Ala Thr Tyr
Asp Phe 210 215 220 Lys Met Ala Asp Leu Gln Gln Val Ala Pro Glu Ala
Thr Val Arg Arg 225 230 235 240 Phe Leu Gln Gly Arg Arg Cys Glu His
Ser Thr Asp Cys Thr Tyr Gly 245 250 255 Arg Asp Cys Arg Ala Pro Cys
Asp Arg Leu Met Arg Gln Cys Lys Gly 260 265 270 Asp Leu Ile Gln Pro
Asn Leu Ala Lys Val Cys Ala Leu Leu Arg Gly 275 280 285 Tyr Leu Leu
Pro Gly Ala Pro Ala Asp Leu Arg Glu Glu Leu Gly Thr 290 295 300 Gln
Leu Arg Thr Cys Thr Thr Leu Ser Gly Leu Ala Ser Gln Val Glu 305 310
315 320 Ala His His Ser Leu Val Leu Ser His Leu Lys Thr Leu Leu Trp
Lys 325 330 335 Lys Ile Ser Asn Thr Lys Tyr Ser 340 551645DNAHomo
sapiens 55agggagcggc ggccgctgcg ggccgggccg ggccggggct gaggccgagc
gagccgcggg 60gcccgcgcag ccccggccgg agcccaccat gcggcggctg cggcgcctgg
cgcacctggt 120gctcttctgc cccttctcca agcgcctgca gggccggctc
ccaggcctca gggtccgctg 180catcttcctg gcctggctgg gcgtctttgc
aggcagctgg ctggtgtacg tgcactactc 240gtcctactcg gagcgctgtc
gcggccatgt ctgccaggtg gtcatttgtg accagtaccg 300caaggggatc
atctcgggct ccgtctgcca ggacctgtgt gagctgcata tggtggagtg
360gaggacctgc ctctcggtgg ccccgggcca gcaggtgtac agcgggctct
ggcgggacaa 420ggatgtaacc atcaagtgtg gcattgagga gaccctcgac
tccaaggccc ggtcggatgc 480ggccccccgg cgggagctgg tactgtttga
caagcccacc cggggcacct ccatcaagga 540attccgggag atgaccctca
gcttcctcaa ggcgaacctg ggagacctgc cttccctgcc 600ggcgctggtt
ggccaggtcc tgctcatggc tgacttcaac aaggacaacc gggtgtccct
660ggcggaagcc aagtccgtgt gggccctgct gcagcgtaac gagttcctgc
tgctgctgtc 720cctgcaggag aaggagcacg cctccagact gctgggctac
tgtggggacc tctacctcac 780cgagggcgtg ccgcatggcg cctggcacgc
ggccgccctt ccacccctgt tgcgcccact 840gctgccgcct gccctgcagg
gtgctctcca gcagtggctg gggcctgcgt ggccttggcg 900ggccaagatc
gccatcggcc tgctggagtt cgtggaggag ctcttccacg gctcttacgg
960gactttctac atgtgtgaga ccacactggc caacgtgggc tacacagcca
cctacgactt 1020caagatggcc gacctgcagc aggtggcacc cgaggccacc
gtgcgccgct tcctgcaggg 1080ccgccgctgc gagcacagca ccgactgcac
ctacgggcgc gactgcaggg ccccgtgtga 1140caggctcatg aggcagtgca
agggcgacct catccagccc aacctggcca aggtgtgcgc 1200actgctacgg
ggctacctgc tgcctggcgc gcccgccgac ctccgcgagg agctgggcac
1260acagctgcgc acctgtacca cgctgagcgg gctggccagc caggtggagg
cccatcactc 1320gctggtgctc agccacctca agactctgct ctggaagaag
atctccaaca ccaagtactc 1380ttgatggggc agtgaggggc ctggccaccc
ttcctggagc tggccaggtg ccagggtcca 1440accctccctc aaggaatcct
gtcagaagat gtgaaatgca actgtgttgc aaaatcactc 1500ccctaccgtc
agggctctgg attccagcac cacagacatg agaccccagc tcggagcaaa
1560ggcggacatg gacatcccgg caggagagtc ctccaagggg gtttgttact
ctgaagaacg 1620taatgtcaat aaacagcttt tatgt 1645562410DNAHomo
sapiens 56ggccgcagcc tgtcctcctg cctcagcctg gggaaggctg ggccgggcca
aagagggagg 60ccaagtcctt gggacaggag gagacccaca cctgagatta gtggaaaccc
agccagaagc 120tgccatgcag gtgtggctct gggcatcagg atctgtcgtg
agagccccca tgagtgccca 180gtgcagaatg gctggcagcc cgccctgacg
gggagcagag gggctggacg cggtgccctt 240cacgggatga cacccagctg
tttgccctgt gtccaggggg ttgcttctct gacagaggcc 300ctatggctcg
tgtctgactc ctgtccaggt tctgccagcc tgaccatcca tcgctctggc
360accaagagcc cacccttttg ttcttcctgg cgtcccaggg aaagccctgc
ctgggtgggg 420cagctcctgg cccttcagat ggaagacgca gtccagtcag
caccatcata ggaaacaagt 480tcagaaatgt ctcacttact attccgggca
gggagggcgc catgagtcag ggggtgcatc 540ctccctcctg gcgtcacccg
aggcaggaat gaagagtcag gcagagagcg cgcgtgtggc 600agctggtggt
gtagatatta gggactagtg tgaattctag ttcaccggcc aatgcctgga
660tggtccagag ctgggtcggc tgggcggaga gctgcctcca ggttcctgcc
tctggccctg 720gcgtggggtc gacactgggt gtggtgtgtg tctcatgtcc
aggcagtggc ctttgctgtg 780ccgtcctgtt acaggagcca ggatggtggg
gacgggaccg gaccggaggg ttggcggggc 840tgcccctgca gccgacagcc
ccatcctgca gccaccaatg gcatgaccca gggccccggc 900actgcctgtg
tgaggggctg gcagctttcc aactgcagca agtggaggcc cctgccagct
960tcgggcctgt gggcaggggc tcagtggggc aggggtgtgg ctgccccgcc
cggcacgcct 1020gcacctgtct cctcagtgtg accagtaccg caaggggatc
atctcgggct ccgtctgcca 1080ggacctgtgt gagctgcata tggtggagtg
gaggacctgc ctctcggtgg ccccgggcca 1140gcaggtgtac agcgggctct
ggcgggacaa ggatgtaacc atcaagtgtg gcattgagga 1200gaccctcgac
tccaaggccc ggtcggatgc ggccccccgg cgggagctgg tactgtttga
1260caagcccacc cggggcacct ccatcaagga attccgggag atgaccctca
gcttcctcaa 1320ggcgaacctg ggagacctgc cttccctgcc ggcgctggtt
ggccaggtcc tgctcatggc 1380tgacttcaac aaggacaacc gggtgtccct
ggcggaagcc aagtccgtgt gggccctgct 1440gcagcgtaac gagttcctgc
tgctgctgtc cctgcaggag aaggagcacg cctccagact 1500gctgggctac
tgtggggacc tctacctcac cgagggcgtg ccgcatggcg cctggcacgc
1560ggccgccctt ccacccctgt tgcgcccact gctgccgcct gccctgcagg
gtgctctcca 1620gcagtggctg gggcctgcgt ggccttggcg ggccaagatc
gccatcggcc tgctggagtt 1680cgtggaggag ctcttccacg gctcttacgg
gactttctac atgtgtgaga ccacactggc 1740caacgtgggc tacacagcca
cctacgactt caagatggcc gacctgcagc aggtggcacc 1800cgaggccacc
gtgcgccgct tcctgcaggg ccgccgctgc gagcacagca ccgactgcac
1860ctacgggcgc gactgcaggg ccccgtgtga caggctcatg aggcagtgca
agggcgacct 1920catccagccc aacctggcca aggtgtgcgc actgctacgg
ggctacctgc tgcctggcgc 1980gcccgccgac ctccgcgagg agctgggcac
acagctgcgc acctgtacca cgctgagcgg 2040gctggccagc caggtggagg
cccatcactc gctggtgctc agccacctca agactctgct 2100ctggaagaag
atctccaaca ccaagtactc ttgatggggc agtgaggggc ctggccaccc
2160ttcctggagc tggccaggtg ccagggtcca accctccctc aaggaatcct
gtcagaagat 2220gtgaaatgca actgtgttgc aaaatcactc ccctaccgtc
agggctctgg attccagcac 2280cacagacatg agaccccagc tcggagcaaa
ggcggacatg gacatcccgg caggagagtc 2340ctccaagggg gtttgttact
ctgaagaacg taatgtcaat aaacagcttt tatgtaatgc 2400ccagggctga
241057423PRTHomo sapiens 57Met Gly Gly Pro Arg Ala Trp Ala Leu Leu
Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser
Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser
Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly
Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met
Ser Cys Pro Gly Ser Arg Thr Val Val Arg Pro Thr Tyr Lys 65 70 75 80
Val Met Tyr Lys Ile Val Thr Ala Pro Ser Ser Ala Ser Leu Glu Pro 85
90 95 Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu Arg Pro Thr
Ala 100 105 110 Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu Leu
Thr Glu Arg 115 120 125 Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu
Thr Val Ile Glu Gln 130 135 140 Pro Val Pro Pro Thr Pro Ala Thr Pro
Glu Asp Pro Ala Pro Leu Trp 145 150 155 160 Gly Pro Pro Pro
Ala Gln Gly Ser Pro Gly Asp Gly Gly Leu Gln Asp 165 170 175 Gln Val
Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly Pro Lys Gly Asp 180 185 190
Ala Gly Ser Arg Gly Pro Met Gly Met Arg Gly Pro Pro Gly Pro Gln 195
200 205 Gly Pro Pro Gly Ser Pro Gly Arg Ala Gly Ala Val Gly Thr Pro
Gly 210 215 220 Glu Arg Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro
Pro Gly Pro 225 230 235 240 Pro Ala Pro Val Gly Pro Pro His Ala Arg
Ile Ser Gln His Gly Asp 245 250 255 Pro Leu Leu Ser Asn Thr Phe Thr
Glu Thr Asn Asn His Trp Pro Gln 260 265 270 Gly Pro Thr Gly Pro Pro
Gly Pro Pro Gly Pro Met Gly Pro Pro Gly 275 280 285 Pro Pro Gly Pro
Thr Gly Val Pro Gly Ser Pro Gly His Ile Gly Pro 290 295 300 Pro Gly
Pro Thr Gly Pro Lys Gly Ile Ser Gly His Pro Gly Glu Lys 305 310 315
320 Gly Glu Arg Gly Leu Arg Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly
325 330 335 Gln Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly Glu Lys
Ser His 340 345 350 Trp Gly Glu Gly Leu His Gln Leu Arg Glu Ala Leu
Lys Ile Leu Ala 355 360 365 Glu Arg Val Leu Ile Leu Glu Thr Met Ile
Gly Leu Tyr Glu Pro Glu 370 375 380 Leu Gly Ser Gly Ala Gly Pro Ala
Gly Thr Gly Thr Pro Ser Leu Leu 385 390 395 400 Arg Gly Lys Arg Gly
Gly His Ala Thr Asn Tyr Arg Ile Val Ala Pro 405 410 415 Arg Ser Arg
Asp Glu Arg Gly 420 58212PRTHomo sapiens 58Met Gly Gly Pro Arg Ala
Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly
Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg
Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45
Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50
55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro
Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp
Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Val
Ala Ala Ser Ser Ala 100 105 110 Ser Leu Glu Pro Met Trp Ser Gly Ser
Thr Met Arg Arg Met Ala Leu 115 120 125 Arg Pro Thr Ala Phe Ser Gly
Cys Leu Asn Cys Ser Lys Val Ser Glu 130 135 140 Leu Thr Glu Arg Leu
Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr 145 150 155 160 Val Ile
Glu Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro 165 170 175
Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly 180
185 190 Gly Leu Gln Gly Asp Pro Leu Leu Ser Asn Thr Phe Thr Glu Thr
Asn 195 200 205 Asn His Trp Pro 210 59175PRTHomo sapiens 59Met Gly
Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15
Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20
25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys
His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn
Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr
Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr
Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser
Cys Glu Glu Gly Cys Leu Asn Cys Ser 100 105 110 Lys Val Ser Glu Leu
Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met 115 120 125 Thr Met Leu
Thr Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala Thr 130 135 140 Pro
Glu Asp Pro Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser 145 150
155 160 Pro Gly Asp Gly Gly Leu Gln Asp Gln Val Gly Ala Trp Gly Leu
165 170 175 6034PRTHomo sapiens 60Met Gly Gly Pro Arg Ala Trp Ala
Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala
Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg
61445PRTHomo sapiens 61Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys
Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile
Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr
Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr
Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser
Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr
Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90
95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Val Ala Ala Ser Ser Ala
100 105 110 Ser Leu Glu Pro Met Trp Ser Gly Ser Thr Met Arg Arg Met
Ala Leu 115 120 125 Arg Pro Thr Ala Phe Ser Gly Cys Leu Asn Cys Ser
Lys Val Ser Glu 130 135 140 Leu Thr Glu Arg Leu Lys Val Leu Glu Ala
Lys Met Thr Met Leu Thr 145 150 155 160 Val Ile Glu Gln Pro Val Pro
Pro Thr Pro Ala Thr Pro Glu Asp Pro 165 170 175 Ala Pro Leu Trp Gly
Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly 180 185 190 Gly Leu Gln
Asp Gln Val Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly 195 200 205 Pro
Lys Gly Asp Ala Gly Ser Arg Gly Pro Met Gly Met Arg Gly Pro 210 215
220 Pro Gly Pro Gln Gly Pro Pro Gly Ser Pro Gly Arg Ala Gly Ala Val
225 230 235 240 Gly Thr Pro Gly Glu Arg Gly Pro Pro Gly Pro Pro Gly
Pro Pro Gly 245 250 255 Pro Pro Gly Pro Pro Ala Pro Val Gly Pro Pro
His Ala Arg Ile Ser 260 265 270 Gln His Gly Asp Pro Leu Leu Ser Asn
Thr Phe Thr Glu Thr Asn Asn 275 280 285 His Trp Pro Gln Gly Pro Thr
Gly Pro Pro Gly Pro Pro Gly Pro Met 290 295 300 Gly Pro Pro Gly Pro
Pro Gly Pro Thr Gly Val Pro Gly Ser Pro Gly 305 310 315 320 His Ile
Gly Pro Pro Gly Pro Thr Gly Pro Lys Gly Ile Ser Gly His 325 330 335
Pro Gly Glu Lys Gly Glu Arg Gly Leu Arg Gly Glu Pro Gly Pro Gln 340
345 350 Gly Ser Ala Gly Gln Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro
Gly 355 360 365 Glu Lys Ser His Trp Ala Pro Ser Leu Gln Ser Phe Leu
Gln Gln Gln 370 375 380 Ala Gln Leu Glu Leu Leu Ala Arg Arg Val Thr
Leu Leu Glu Ala Ile 385 390 395 400 Ile Trp Pro Glu Pro Glu Leu Gly
Ser Gly Ala Gly Pro Ala Gly Thr 405 410 415 Gly Thr Pro Ser Leu Leu
Arg Gly Lys Arg Gly Gly His Ala Thr Asn 420 425 430 Tyr Arg Ile Val
Ala Pro Arg Ser Arg Asp Glu Arg Gly 435 440 445 62443PRTHomo
sapiens 62Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu
Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala
Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr
Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln
Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly
Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met
Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly
His Ser Gly Val Ser Cys Glu Glu Val Ala Ala Ser Ser Ala 100 105 110
Ser Leu Glu Pro Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu 115
120 125 Arg Pro Thr Ala Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser
Glu 130 135 140 Leu Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met Thr
Met Leu Thr 145 150 155 160 Val Ile Glu Gln Pro Val Pro Pro Thr Pro
Ala Thr Pro Glu Asp Pro 165 170 175 Ala Pro Leu Trp Gly Pro Pro Pro
Ala Gln Gly Ser Pro Gly Asp Gly 180 185 190 Gly Leu Gln Asp Gln Val
Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly 195 200 205 Pro Lys Gly Asp
Ala Gly Ser Arg Gly Pro Met Gly Met Arg Gly Pro 210 215 220 Pro Gly
Pro Gln Gly Pro Pro Gly Ser Pro Gly Arg Ala Gly Ala Val 225 230 235
240 Gly Thr Pro Gly Glu Arg Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly
245 250 255 Pro Pro Gly Pro Pro Ala Pro Val Gly Pro Pro His Ala Arg
Ile Ser 260 265 270 Gln His Gly Asp Pro Leu Leu Ser Asn Thr Phe Thr
Glu Thr Asn Asn 275 280 285 His Trp Pro Gln Gly Pro Thr Gly Pro Pro
Gly Pro Pro Gly Pro Met 290 295 300 Gly Pro Pro Gly Pro Pro Gly Pro
Thr Gly Val Pro Gly Ser Pro Gly 305 310 315 320 His Ile Gly Pro Pro
Gly Pro Thr Gly Pro Lys Gly Ile Ser Gly His 325 330 335 Pro Gly Glu
Lys Gly Glu Arg Gly Leu Arg Gly Glu Pro Gly Pro Gln 340 345 350 Gly
Ser Ala Gly Gln Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly 355 360
365 Glu Lys Ser His Trp Gly Glu Gly Leu His Gln Leu Arg Glu Ala Leu
370 375 380 Lys Ile Leu Ala Glu Arg Val Leu Ile Leu Glu Thr Met Ile
Gly Leu 385 390 395 400 Tyr Glu Pro Glu Leu Gly Ser Gly Ala Gly Pro
Ala Gly Thr Gly Thr 405 410 415 Pro Ser Leu Leu Arg Gly Lys Arg Gly
Gly His Ala Thr Asn Tyr Arg 420 425 430 Ile Val Ala Pro Arg Ser Arg
Asp Glu Arg Gly 435 440 63441PRTHomo sapiens 63Met Gly Gly Pro Arg
Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly
Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg
Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40
45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp
50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg
Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu
Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu
Ala Ser Ser Ala Ser Leu 100 105 110 Glu Pro Met Trp Ser Gly Ser Thr
Met Arg Arg Met Ala Leu Arg Pro 115 120 125 Thr Ala Phe Ser Gly Cys
Leu Asn Cys Ser Lys Val Ser Glu Leu Thr 130 135 140 Glu Arg Leu Lys
Val Leu Glu Ala Lys Met Thr Met Leu Thr Val Ile 145 150 155 160 Glu
Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro Ala Pro 165 170
175 Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly Gly Leu
180 185 190 Gln Asp Gln Val Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly
Pro Lys 195 200 205 Gly Asp Ala Gly Ser Arg Gly Pro Met Gly Met Arg
Gly Pro Pro Gly 210 215 220 Pro Gln Gly Pro Pro Gly Ser Pro Gly Arg
Ala Gly Ala Val Gly Thr 225 230 235 240 Pro Gly Glu Arg Gly Pro Pro
Gly Pro Pro Gly Pro Pro Gly Pro Pro 245 250 255 Gly Pro Pro Ala Pro
Val Gly Pro Pro His Ala Arg Ile Ser Gln His 260 265 270 Gly Asp Pro
Leu Leu Ser Asn Thr Phe Thr Glu Thr Asn Asn His Trp 275 280 285 Pro
Gln Gly Pro Thr Gly Pro Pro Gly Pro Pro Gly Pro Met Gly Pro 290 295
300 Pro Gly Pro Pro Gly Pro Thr Gly Val Pro Gly Ser Pro Gly His Ile
305 310 315 320 Gly Pro Pro Gly Pro Thr Gly Pro Lys Gly Ile Ser Gly
His Pro Gly 325 330 335 Glu Lys Gly Glu Arg Gly Leu Arg Gly Glu Pro
Gly Pro Gln Gly Ser 340 345 350 Ala Gly Gln Arg Gly Glu Pro Gly Pro
Lys Gly Asp Pro Gly Glu Lys 355 360 365 Ser His Trp Gly Glu Gly Leu
His Gln Leu Arg Glu Ala Leu Lys Ile 370 375 380 Leu Ala Glu Arg Val
Leu Ile Leu Glu Thr Met Ile Gly Leu Tyr Glu 385 390 395 400 Pro Glu
Leu Gly Ser Gly Ala Gly Pro Ala Gly Thr Gly Thr Pro Ser 405 410 415
Leu Leu Arg Gly Lys Arg Gly Gly His Ala Thr Asn Tyr Arg Ile Val 420
425 430 Ala Pro Arg Ser Arg Asp Glu Arg Gly 435 440 64439PRTHomo
sapiens 64Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu
Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala
Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr
Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln
Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly
Ser Arg Thr Val Val Arg Pro Thr Tyr Lys 65 70 75 80 Val Met Tyr Lys
Ile Val Thr Ala Arg Glu Trp Arg Cys Cys Pro Gly 85 90 95 His Ser
Gly Val Ser Cys Glu Glu Ala Ser Ser Ala Ser Leu Glu Pro 100 105 110
Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu Arg Pro Thr Ala 115
120 125 Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu Leu Thr Glu
Arg 130 135 140 Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr Val
Ile Glu Gln 145 150 155 160 Pro Val Pro Pro Thr Pro Ala Thr Pro Glu
Asp Pro Ala Pro Leu Trp 165 170 175 Gly Pro Pro Pro Ala Gln Gly Ser
Pro Gly Asp Gly Gly Leu Gln Asp 180 185 190 Gln Val Gly Ala Trp Gly
Leu Pro Gly Pro Thr Gly Pro Lys Gly Asp 195 200 205 Ala Gly Ser Arg
Gly Pro Met Gly Met Arg Gly Pro Pro Gly Pro Gln 210 215 220 Gly Pro
Pro Gly Ser Pro Gly Arg Ala Gly Ala Val Gly Thr Pro Gly 225 230 235
240 Glu Arg Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro
245 250 255 Pro Ala Pro Val Gly Pro Pro His Ala Arg Ile Ser Gln His
Gly Asp 260 265 270 Pro Leu Leu Ser Asn Thr Phe Thr Glu
Thr Asn Asn His Trp Pro Gln 275 280 285 Gly Pro Thr Gly Pro Pro Gly
Pro Pro Gly Pro Met Gly Pro Pro Gly 290 295 300 Pro Pro Gly Pro Thr
Gly Val Pro Gly Ser Pro Gly His Ile Gly Pro 305 310 315 320 Pro Gly
Pro Thr Gly Pro Lys Gly Ile Ser Gly His Pro Gly Glu Lys 325 330 335
Gly Glu Arg Gly Leu Arg Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly 340
345 350 Gln Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly Glu Lys Ser
His 355 360 365 Trp Gly Glu Gly Leu His Gln Leu Arg Glu Ala Leu Lys
Ile Leu Ala 370 375 380 Glu Arg Val Leu Ile Leu Glu Thr Met Ile Gly
Leu Tyr Glu Pro Glu 385 390 395 400 Leu Gly Ser Gly Ala Gly Pro Ala
Gly Thr Gly Thr Pro Ser Leu Leu 405 410 415 Arg Gly Lys Arg Gly Gly
His Ala Thr Asn Tyr Arg Ile Val Ala Pro 420 425 430 Arg Ser Arg Asp
Glu Arg Gly 435 65422PRTHomo sapiens 65Met Gly Gly Pro Arg Ala Trp
Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala
Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn
Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val
Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55
60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr
65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg
Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Val Ala
Ala Ser Ser Ala 100 105 110 Ser Leu Glu Pro Met Trp Ser Gly Ser Thr
Met Arg Arg Met Ala Leu 115 120 125 Arg Pro Thr Ala Phe Ser Gly Cys
Leu Asn Cys Ser Lys Val Ser Glu 130 135 140 Leu Thr Glu Arg Leu Lys
Val Leu Glu Ala Lys Met Thr Met Leu Thr 145 150 155 160 Val Ile Glu
Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro 165 170 175 Ala
Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly 180 185
190 Gly Leu Gln Asp Gln Val Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly
195 200 205 Pro Lys Gly Asp Ala Gly Ser Arg Gly Pro Met Gly Met Arg
Gly Pro 210 215 220 Pro Gly Pro Gln Gly Pro Pro Gly Ser Pro Gly Arg
Ala Gly Ala Val 225 230 235 240 Gly Thr Pro Gly Glu Arg Gly Pro Pro
Gly Pro Pro Gly Pro Pro Gly 245 250 255 Pro Pro Gly Pro Pro Ala Pro
Val Gly Pro Pro His Ala Arg Ile Ser 260 265 270 Gln His Gly Asp Pro
Leu Leu Ser Asn Thr Phe Thr Glu Thr Asn Asn 275 280 285 His Trp Pro
Gln Gly Pro Thr Gly Pro Pro Gly Pro Pro Gly Pro Met 290 295 300 Gly
Pro Pro Gly Pro Pro Gly Pro Thr Gly Val Pro Gly Ser Pro Gly 305 310
315 320 His Ile Gly Leu Arg Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly
Gln 325 330 335 Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly Glu Lys
Ser His Trp 340 345 350 Gly Glu Gly Leu His Gln Leu Arg Glu Ala Leu
Lys Ile Leu Ala Glu 355 360 365 Arg Val Leu Ile Leu Glu Thr Met Ile
Gly Leu Tyr Glu Pro Glu Leu 370 375 380 Gly Ser Gly Ala Gly Pro Ala
Gly Thr Gly Thr Pro Ser Leu Leu Arg 385 390 395 400 Gly Lys Arg Gly
Gly His Ala Thr Asn Tyr Arg Ile Val Ala Pro Arg 405 410 415 Ser Arg
Asp Glu Arg Gly 420 66212PRTHomo sapiens 66Met Gly Gly Pro Arg Ala
Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly
Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg
Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45
Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50
55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro
Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp
Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Val
Ala Ala Ser Ser Ala 100 105 110 Ser Leu Glu Pro Met Trp Ser Gly Ser
Thr Met Arg Arg Met Ala Leu 115 120 125 Arg Pro Thr Ala Phe Ser Gly
Cys Leu Asn Cys Ser Lys Val Ser Glu 130 135 140 Leu Thr Glu Arg Leu
Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr 145 150 155 160 Val Ile
Glu Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro 165 170 175
Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly 180
185 190 Gly Leu Gln Gly Asp Pro Leu Leu Ser Asn Thr Phe Thr Glu Thr
Asn 195 200 205 Asn His Trp Pro 210 67175PRTHomo sapiens 67Met Gly
Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15
Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20
25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys
His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn
Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr
Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr
Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser
Cys Glu Glu Gly Cys Leu Asn Cys Ser 100 105 110 Lys Val Ser Glu Leu
Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met 115 120 125 Thr Met Leu
Thr Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala Thr 130 135 140 Pro
Glu Asp Pro Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser 145 150
155 160 Pro Gly Asp Gly Gly Leu Gln Asp Gln Val Gly Ala Trp Gly Leu
165 170 175 68236PRTHomo sapiensmisc_feature(1)..(1)Xaa can be any
naturally occurring amino acid 68Xaa Met Thr Met Leu Thr Val Ile
Glu Gln Pro Val Pro Pro Thr Pro 1 5 10 15 Ala Thr Pro Glu Asp Pro
Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln 20 25 30 Gly Ser Pro Gly
Asp Gly Gly Leu Gln Gly Leu Pro Gly Ala Ile Glu 35 40 45 Ser Val
Arg Val Pro Leu Leu Pro Arg Asn Asp Gln Val Gly Ala Trp 50 55 60
Gly Leu Pro Gly Pro Thr Gly Pro Lys Gly Asp Ala Gly Ser Arg Gly 65
70 75 80 Pro Met Gly Met Arg Gly Pro Pro Gly Pro Gln Gly Pro Pro
Gly Ser 85 90 95 Pro Gly Arg Ala Gly Ala Val Gly Thr Pro Gly Glu
Arg Gly Pro Pro 100 105 110 Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly
Pro Pro Ala Pro Val Gly 115 120 125 Pro Pro His Ala Arg Ile Ser Gln
His Gly Asp Pro Leu Leu Ser Asn 130 135 140 Thr Phe Thr Glu Thr Asn
Asn His Trp Pro Gln Gly Pro Thr Gly Pro 145 150 155 160 Pro Gly Pro
Pro Gly Pro Met Gly Pro Pro Gly Pro Pro Gly Pro Thr 165 170 175 Gly
Val Pro Gly Ser Pro Gly His Ile Gly Pro Pro Gly Pro Thr Gly 180 185
190 Pro Lys Gly Ile Ser Gly His Pro Gly Glu Lys Gly Glu Arg Gly Leu
195 200 205 Arg Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly Gln Arg Gly
Glu Pro 210 215 220 Gly Pro Lys Gly Asp Pro Gly Glu Lys Ser His Trp
225 230 235 69305PRTHomo sapiens 69Met Thr Met Leu Thr Val Ile Glu
Gln Pro Val Pro Pro Thr Pro Ala 1 5 10 15 Thr Pro Glu Asp Pro Ala
Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly 20 25 30 Ser Pro Gly Asp
Gly Gly Leu Gln Gly Leu Pro Gly Ala Ile Glu Ser 35 40 45 Val Arg
Val Pro Leu Leu Pro Arg Asn Asp Gln Val Gly Ala Trp Gly 50 55 60
Leu Pro Gly Pro Thr Gly Pro Lys Gly Asp Ala Gly Ser Arg Gly Pro 65
70 75 80 Met Gly Met Arg Gly Pro Pro Gly Pro Gln Gly Pro Pro Gly
Ser Pro 85 90 95 Gly Arg Ala Gly Ala Val Gly Thr Pro Gly Glu Arg
Gly Pro Pro Gly 100 105 110 Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro
Pro Ala Pro Val Gly Pro 115 120 125 Pro His Ala Arg Ile Ser Gln His
Gly Asp Pro Leu Leu Ser Asn Thr 130 135 140 Phe Thr Glu Thr Asn Asn
His Trp Pro Gln Gly Pro Thr Gly Pro Pro 145 150 155 160 Gly Pro Pro
Gly Pro Met Gly Pro Pro Gly Pro Pro Gly Pro Thr Gly 165 170 175 Val
Pro Gly Ser Pro Gly His Ile Gly Pro Pro Gly Pro Thr Gly Pro 180 185
190 Lys Gly Ile Ser Gly His Pro Gly Glu Lys Gly Glu Arg Gly Leu Arg
195 200 205 Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly Gln Arg Gly Glu
Pro Gly 210 215 220 Pro Lys Gly Asp Pro Gly Glu Lys Ser His Trp Gly
Glu Gly Leu His 225 230 235 240 Gln Leu Arg Glu Ala Leu Lys Ile Leu
Ala Glu Arg Val Leu Ile Leu 245 250 255 Glu Thr Met Ile Gly Leu Tyr
Glu Pro Glu Leu Gly Ser Gly Ala Gly 260 265 270 Pro Ala Gly Thr Gly
Thr Pro Ser Leu Leu Arg Gly Lys Arg Gly Gly 275 280 285 His Ala Thr
Asn Tyr Arg Ile Val Ala Pro Arg Ser Arg Asp Glu Arg 290 295 300 Gly
305 70226PRTHomo sapiens 70Met Lys Ser Ser Leu Met Phe Thr Asp Pro
His Ser Leu Gly Thr Tyr 1 5 10 15 Thr Tyr Gln Ala Leu Ser Trp Ala
Leu Gly Gly Val Arg His Val Pro 20 25 30 Ala Leu Leu Glu Leu Pro
Cys Cys Trp Glu Gln Gly Trp Ala Glu Glu 35 40 45 Lys Gln Gln Cys
Leu Pro His Val Thr Arg Val Ser Met Arg Gly Phe 50 55 60 Gly Gly
Leu Gly Ala Pro Arg Lys Glu Asp Ser Ala Trp Thr Arg Trp 65 70 75 80
Arg Thr Arg Cys Cys Ala His Pro Pro Val Arg Leu Pro Gly Ser Leu 85
90 95 Gly Leu Trp Thr Pro Gly Pro Ser Leu Met Pro Thr Ala Pro Gly
Cys 100 105 110 Leu Val Leu Ser Leu Lys Ala Thr Leu Gly Leu Leu Ala
Ser Cys Ile 115 120 125 Pro Thr Asn Pro Cys Asp Ser Ile Ala Gly Pro
Gln Gly Pro Pro Gly 130 135 140 Ser Pro Gly Arg Ala Gly Ala Val Gly
Thr Pro Gly Glu Arg Gly Pro 145 150 155 160 Pro Gly Pro Pro Gly Pro
Pro Gly Pro Pro Gly Pro Pro Ala Pro Val 165 170 175 Gly Pro Pro His
Ala Arg Ile Ser Gln His Gly Glu Ser Pro Trp Asp 180 185 190 Pro Ser
Arg Trp Arg Trp Gly Trp Ser Ser His Gln His Ser Ala Arg 195 200 205
Tyr His Leu Pro Arg Ala Phe Cys Val Pro Ala Leu Leu Thr Ile Gly 210
215 220 His Met 225 712031DNAHomo sapiens 71gggctccgcg cgtccggggc
ggctggcggc gcgggcaggc aggcggggag gacaggctgg 60gggcggcgac cgcgaggggc
cgcgcgcgga gggcgcctgg tgcagcatgg gcggcccgcg 120ggcttgggcg
ctgctctgcc tcgggctcct gctcccggga ggcggcgctg cgtggagcat
180cggggcagct ccgttctccg gacgcaggaa ctggtgctcc tatgtggtga
cccgcaccat 240ctcatgccat gtgcagaatg gcacctacct tcagcgagtg
ctgcagaact gcccctggcc 300catgagctgt ccggggagca gaactgtggt
gagacccaca tacaaggtga tgtacaagat 360agtgaccgcc ccttcctctg
cctccttgga gcccatgtgg tcgggcagta ccatgcggcg 420gatggcgctt
cggcccacag ccttctcagg ttgtctcaac tgcagcaaag tgtcagagct
480gacagagcgg ctgaaggtgc tggaggccaa gatgaccatg ctgactgtca
tagagcagcc 540agtacctcca acaccagcta cccctgagga ccctgccccg
ctctggggtc cccctcctgc 600ccagggcagc cccggagatg gaggcctcca
ggaccaagtc ggtgcttggg ggcttcccgg 660gcccaccggc cccaagggag
atgccggcag tcggggccca atggggatga gaggcccacc 720aggtccacag
ggccccccag ggagccctgg ccgggctgga gctgtgggca cccctggaga
780gaggggacct cctgggccac cagggcctcc tggcccccct gggcccccag
cccctgttgg 840gccaccccat gcccggatct cccagcatgg agacccattg
ctgtccaaca ccttcactga 900gaccaacaac cactggcccc agggacccac
tgggcctcca ggccctccag ggcccatggg 960tccccctggg cctcctggcc
ccacaggtgt ccctgggagt cctggtcaca taggaccccc 1020aggccccact
ggacccaaag gaatctctgg ccacccagga gagaagggcg agagaggact
1080gcgtggggag cctggccccc aaggctctgc tgggcagcgg ggggaacctg
gccctaaggg 1140agaccctggt gagaagagcc actgggggga ggggttgcac
cagctacgcg aggctttgaa 1200gattttagct gagagggttt taatcttgga
aacaatgatt gggctctatg aaccagagct 1260ggggtctggg gcgggccctg
ccggcacagg cacccccagc ctccttcggg gcaagagggg 1320cggacatgca
accaactacc ggatcgtggc ccccaggagc cgggacgaga gaggctgagg
1380gtggtggcgg cccctgaggc agaccaggcc aggcttcccc tcctacctgg
actcggccag 1440ctgcctccag ggaccgcccg tccatattta ttaatgtcct
cagggtccct tctgccatct 1500aggccttagg ggtaagcagg tctcagtcct
ggcaccatgc acatgtctga ggctgagcaa 1560gggctgagag gagaggcttg
ggcctcagtt tccctctgtg aagtgggggg aggcaggcct 1620tcaaggaggg
atagaggtac aaggcttcgt ctcatctgct gtctgagcat ccaggcccaa
1680aggcactgag ggagtcagga gctggggctc ggcacatgca gagatgacag
ggcagggggc 1740agtcttcctc cccctccccg accaaacctc ggggagccct
cctgtgcccc tccctccttg 1800ttgtccagtg ctgggctccc caccccgagg
tcaggctgcc caatcctctg actggatcac 1860cgggggcttc ttgcctcagt
tcttccctct gagcccccag gccctcccgc atctcaggtt 1920ggggatgggg
acatggagag gaaggggccg cctactcctg caaatgcttg tgacagatgc
1980caggaggtag atgtgtgctg gccaataaag gcccctacct gattccccgc a
203172769DNAHomo sapiens 72cgccctccgg ccgcggagct ggaaaccggg
ctccgcgcgt ccggggcggc tggcggcgcg 60ggcaggcagg cggggaggac aggctggggg
cggcgaccgc gaggggccgc gcgcggaggg 120cgcctggtgc agcatgggcg
gcccgcgggc ttgggcgctg ctctgcctcg ggctcctgct 180cccgggaggc
ggcgctgcgt ggagcatcgg ggcagctccg ttctccggac gcaggaactg
240gtgctcctat gtggtgaccc gcaccatctc atgccatgtg cagaatggca
cctaccttca 300gcgagtgctg cagaactgcc cctggcccat gagctgtccg
gggagcagct acagaactgt 360ggtgagaccc acatacaagg tgatgtacaa
gatagtgacc gcccgtgagt ggaggtgctg 420ccctgggcac tcaggagtga
gctgcgagga agttgcagct tcctctgcct ccttggagcc 480catgtggtcg
ggcagtacca tgcggcggat ggcgcttcgg cccacagcct tctcaggttg
540tctcaactgc agcaaagtgt cagagctgac agagcggctg aaggtgctgg
aggccaagat 600gaccatgctg actgtcatag agcagccagt acctccaaca
ccagctaccc ctgaggaccc 660tgccccgctc tggggtcccc ctcctgccca
gggcagcccc ggagatggag gcctccaggg 720agacccattg ctgtccaaca
ccttcactga gaccaacaac cactggccc 76973641DNAHomo sapiens
73gctggaaacc gggctccgcg cgtccggggc ggctggcggc gcgggcaggc aggcggggag
60gacaggctgg gggcggcgac cgcgaggggc cgcgcgcgga gggcgcctgg tgcagcatgg
120gcggcccgcg ggcttgggcg ctgctctgcc tcgggctcct gctcccggga
ggcggcgctg 180cgtggagcat cggggcagct ccgttctccg gacgcaggaa
ctggtgctcc tatgtggtga 240cccgcaccat ctcatgccat gtgcagaatg
gcacctacct tcagcgagtg ctgcagaact 300gcccctggcc catgagctgt
ccggggagca gctacagaac tgtggtgaga cccacataca 360aggtgatgta
caagatagtg accgcccgtg agtggaggtg ctgccctggg cactcaggag
420tgagctgcga ggaaggttgt ctcaactgca gcaaagtgtc agagctgaca
gagcggctga 480aggtgctgga ggccaagatg accatgctga ctgtcataga
gcagccagta cctccaacac 540cagctacccc tgaggaccct gccccgctct
ggggtccccc tcctgcccag ggcagccccg 600gagatggagg cctccaggac
caagtcggtg cttgggggct t
64174482DNAHomo sapiens 74cggcgcgggc aggcaggcgg ggaggacagg
ctgggggcgg cgaccgcgag gggccgcgcg 60cggagggcgc ctggtgcagc atgggcggcc
cgcgggcttg ggcgctgctc tgcctcgggc 120tcctgctccc gggaggcggc
gctgcgtgga gcatcggggc agctccgttc tccggacgca 180gatgaccatg
ctgactgtca tagagcagcc agtacctcca acaccagcta cccctgagga
240ccctgccccg ctctggggtc cccctcctgc ccagggcagc cccggagatg
gaggcctcca 300ggaccaagtc ggtgcttggg ggcttcccgg gcccaccggc
cccaagggag atgccggcag 360tcggggccca atggggatga gaggcccacc
aggtccacag ggccccccag ggagccctgg 420ccgggctgga gctgtgggca
cccctggaga gaggggacct cctgggccac cagggcctcc 480tg 482752066DNAHomo
sapiens 75cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc
gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct
cgggctcctg 120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc
cgttctccgg acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc
tcatgccatg tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg
cccctggccc atgagctgtc cggggagcag ctacagaact 300gtggtgagac
ccacatacaa ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc
360tgccctgggc actcaggagt gagctgcgag gaagttgcag cttcctctgc
ctccttggag 420cccatgtggt cgggcagtac catgcggcgg atggcgcttc
ggcccacagc cttctcaggt 480tgtctcaact gcagcaaagt gtcagagctg
acagagcggc tgaaggtgct ggaggccaag 540atgaccatgc tgactgtcat
agagcagcca gtacctccaa caccagctac ccctgaggac 600cctgccccgc
tctggggtcc ccctcctgcc cagggcagcc ccggagatgg aggcctccag
660gaccaagtcg gtgcttgggg gcttcccggg cccaccggcc ccaagggaga
tgccggcagt 720cggggcccaa tggggatgag aggcccacca ggtccacagg
gccccccagg gagccctggc 780cgggctggag ctgtgggcac ccctggagag
aggggacctc ctgggccacc agggcctcct 840ggcccccctg ggcccccagc
ccctgttggg ccaccccatg cccggatctc ccagcatgga 900gacccattgc
tgtccaacac cttcactgag accaacaacc actggcccca gggacccact
960gggcctccag gccctccagg gcccatgggt ccccctgggc ctcctggccc
cacaggtgtc 1020cctgggagtc ctggtcacat aggaccccca ggccccactg
gacccaaagg aatctctggc 1080cacccaggag agaagggcga gagaggactg
cgtggggagc ctggccccca aggctctgct 1140gggcagcggg gggaacctgg
ccctaaggga gaccctggtg agaagagcca ctgggctcct 1200agcttacaga
gcttcctgca gcagcaggct cagctggagc tcctggccag acgggtcacc
1260ctcctggaag ccatcatctg gccagaacca gagctggggt ctggggcggg
ccctgccggc 1320acaggcaccc ccagcctcct tcggggcaag aggggcggac
atgcaaccaa ctaccggatc 1380gtggccccca ggagccggga cgagagaggc
tgagggtggt ggcggcccct gaggcagacc 1440aggccaggct tcccctccta
cctggactcg gccagctgcc tccagggacc gcccgtccat 1500atttattaat
gtcctcaggg tcccttctgc catctaggcc ttaggggtaa gcaggtctca
1560gtcctggcac catgcacatg tctgaggctg agcaagggct gagaggagag
gcttgggcct 1620cagtttccct ctgtgaagtg gggggaggca ggccttcaag
gagggataga ggtacaaggc 1680ttcgtctcat ctgctgtctg agcatccagg
cccaaaggca ctgagggagt caggagctgg 1740ggctcggcac atgcagagat
gacagggcag ggggcagtct tcctccccct ccccgaccaa 1800acctcgggga
gccctcctgt gcccctccct ccttgttgtc cagtgctggg ctccccaccc
1860cgaggtcagg ctgcccaatc ctctgactgg atcaccgggg gcttcttgcc
tcagttcttc 1920cctctgagcc cccaggccct cccgcatctc aggttgggga
tggggacatg gagaggaagg 1980ggccgcctac tcctgcaaat gcttgtgaca
gatgccagga ggtagatgtg tgctggccaa 2040taaaggcccc tacctgattc cccgca
2066762060DNAHomo sapiens 76cgggcaggca ggcggggagg acaggctggg
ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg
gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc
gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct
atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt
240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag
ctacagaact 300gtggtgagac ccacatacaa ggtgatgtac aagatagtga
ccgcccgtga gtggaggtgc 360tgccctgggc actcaggagt gagctgcgag
gaagttgcag cttcctctgc ctccttggag 420cccatgtggt cgggcagtac
catgcggcgg atggcgcttc ggcccacagc cttctcaggt 480tgtctcaact
gcagcaaagt gtcagagctg acagagcggc tgaaggtgct ggaggccaag
540atgaccatgc tgactgtcat agagcagcca gtacctccaa caccagctac
ccctgaggac 600cctgccccgc tctggggtcc ccctcctgcc cagggcagcc
ccggagatgg aggcctccag 660gaccaagtcg gtgcttgggg gcttcccggg
cccaccggcc ccaagggaga tgccggcagt 720cggggcccaa tggggatgag
aggcccacca ggtccacagg gccccccagg gagccctggc 780cgggctggag
ctgtgggcac ccctggagag aggggacctc ctgggccacc agggcctcct
840ggcccccctg ggcccccagc ccctgttggg ccaccccatg cccggatctc
ccagcatgga 900gacccattgc tgtccaacac cttcactgag accaacaacc
actggcccca gggacccact 960gggcctccag gccctccagg gcccatgggt
ccccctgggc ctcctggccc cacaggtgtc 1020cctgggagtc ctggtcacat
aggaccccca ggccccactg gacccaaagg aatctctggc 1080cacccaggag
agaagggcga gagaggactg cgtggggagc ctggccccca aggctctgct
1140gggcagcggg gggaacctgg ccctaaggga gaccctggtg agaagagcca
ctggggggag 1200gggttgcacc agctacgcga ggctttgaag attttagctg
agagggtttt aatcttggaa 1260acaatgattg ggctctatga accagagctg
gggtctgggg cgggccctgc cggcacaggc 1320acccccagcc tccttcgggg
caagaggggc ggacatgcaa ccaactaccg gatcgtggcc 1380cccaggagcc
gggacgagag aggctgaggg tggtggcggc ccctgaggca gaccaggcca
1440ggcttcccct cctacctgga ctcggccagc tgcctccagg gaccgcccgt
ccatatttat 1500taatgtcctc agggtccctt ctgccatcta ggccttaggg
gtaagcaggt ctcagtcctg 1560gcaccatgca catgtctgag gctgagcaag
ggctgagagg agaggcttgg gcctcagttt 1620ccctctgtga agtgggggga
ggcaggcctt caaggaggga tagaggtaca aggcttcgtc 1680tcatctgctg
tctgagcatc caggcccaaa ggcactgagg gagtcaggag ctggggctcg
1740gcacatgcag agatgacagg gcagggggca gtcttcctcc ccctccccga
ccaaacctcg 1800gggagccctc ctgtgcccct ccctccttgt tgtccagtgc
tgggctcccc accccgaggt 1860caggctgccc aatcctctga ctggatcacc
gggggcttct tgcctcagtt cttccctctg 1920agcccccagg ccctcccgca
tctcaggttg gggatgggga catggagagg aaggggccgc 1980ctactcctgc
aaatgcttgt gacagatgcc aggaggtaga tgtgtgctgg ccaataaagg
2040cccctacctg attccccgca 2060772054DNAHomo sapiens 77cgggcaggca
ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt
gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg
120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg
acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg
tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc
atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa
ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc
actcaggagt gagctgcgag gaagcttcct ctgcctcctt ggagcccatg
420tggtcgggca gtaccatgcg gcggatggcg cttcggccca cagccttctc
aggttgtctc 480aactgcagca aagtgtcaga gctgacagag cggctgaagg
tgctggaggc caagatgacc 540atgctgactg tcatagagca gccagtacct
ccaacaccag ctacccctga ggaccctgcc 600ccgctctggg gtccccctcc
tgcccagggc agccccggag atggaggcct ccaggaccaa 660gtcggtgctt
gggggcttcc cgggcccacc ggccccaagg gagatgccgg cagtcggggc
720ccaatgggga tgagaggccc accaggtcca cagggccccc cagggagccc
tggccgggct 780ggagctgtgg gcacccctgg agagagggga cctcctgggc
caccagggcc tcctggcccc 840cctgggcccc cagcccctgt tgggccaccc
catgcccgga tctcccagca tggagaccca 900ttgctgtcca acaccttcac
tgagaccaac aaccactggc cccagggacc cactgggcct 960ccaggccctc
cagggcccat gggtccccct gggcctcctg gccccacagg tgtccctggg
1020agtcctggtc acataggacc cccaggcccc actggaccca aaggaatctc
tggccaccca 1080ggagagaagg gcgagagagg actgcgtggg gagcctggcc
cccaaggctc tgctgggcag 1140cggggggaac ctggccctaa gggagaccct
ggtgagaaga gccactgggg ggaggggttg 1200caccagctac gcgaggcttt
gaagatttta gctgagaggg ttttaatctt ggaaacaatg 1260attgggctct
atgaaccaga gctggggtct ggggcgggcc ctgccggcac aggcaccccc
1320agcctccttc ggggcaagag gggcggacat gcaaccaact accggatcgt
ggcccccagg 1380agccgggacg agagaggctg agggtggtgg cggcccctga
ggcagaccag gccaggcttc 1440ccctcctacc tggactcggc cagctgcctc
cagggaccgc ccgtccatat ttattaatgt 1500cctcagggtc ccttctgcca
tctaggcctt aggggtaagc aggtctcagt cctggcacca 1560tgcacatgtc
tgaggctgag caagggctga gaggagaggc ttgggcctca gtttccctct
1620gtgaagtggg gggaggcagg ccttcaagga gggatagagg tacaaggctt
cgtctcatct 1680gctgtctgag catccaggcc caaaggcact gagggagtca
ggagctgggg ctcggcacat 1740gcagagatga cagggcaggg ggcagtcttc
ctccccctcc ccgaccaaac ctcggggagc 1800cctcctgtgc ccctccctcc
ttgttgtcca gtgctgggct ccccaccccg aggtcaggct 1860gcccaatcct
ctgactggat caccgggggc ttcttgcctc agttcttccc tctgagcccc
1920caggccctcc cgcatctcag gttggggatg gggacatgga gaggaagggg
ccgcctactc 1980ctgcaaatgc ttgtgacaga tgccaggagg tagatgtgtg
ctggccaata aaggccccta 2040cctgattccc cgca 2054782048DNAHomo sapiens
78cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag
60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg
120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg
acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg
tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc
atgagctgtc cggggagcag aactgtggtg 300agacccacat acaaggtgat
gtacaagata gtgaccgccc gtgagtggag gtgctgccct 360gggcactcag
gagtgagctg cgaggaagct tcctctgcct ccttggagcc catgtggtcg
420ggcagtacca tgcggcggat ggcgcttcgg cccacagcct tctcaggttg
tctcaactgc 480agcaaagtgt cagagctgac agagcggctg aaggtgctgg
aggccaagat gaccatgctg 540actgtcatag agcagccagt acctccaaca
ccagctaccc ctgaggaccc tgccccgctc 600tggggtcccc ctcctgccca
gggcagcccc ggagatggag gcctccagga ccaagtcggt 660gcttgggggc
ttcccgggcc caccggcccc aagggagatg ccggcagtcg gggcccaatg
720gggatgagag gcccaccagg tccacagggc cccccaggga gccctggccg
ggctggagct 780gtgggcaccc ctggagagag gggacctcct gggccaccag
ggcctcctgg cccccctggg 840cccccagccc ctgttgggcc accccatgcc
cggatctccc agcatggaga cccattgctg 900tccaacacct tcactgagac
caacaaccac tggccccagg gacccactgg gcctccaggc 960cctccagggc
ccatgggtcc ccctgggcct cctggcccca caggtgtccc tgggagtcct
1020ggtcacatag gacccccagg ccccactgga cccaaaggaa tctctggcca
cccaggagag 1080aagggcgaga gaggactgcg tggggagcct ggcccccaag
gctctgctgg gcagcggggg 1140gaacctggcc ctaagggaga ccctggtgag
aagagccact ggggggaggg gttgcaccag 1200ctacgcgagg ctttgaagat
tttagctgag agggttttaa tcttggaaac aatgattggg 1260ctctatgaac
cagagctggg gtctggggcg ggccctgccg gcacaggcac ccccagcctc
1320cttcggggca agaggggcgg acatgcaacc aactaccgga tcgtggcccc
caggagccgg 1380gacgagagag gctgagggtg gtggcggccc ctgaggcaga
ccaggccagg cttcccctcc 1440tacctggact cggccagctg cctccaggga
ccgcccgtcc atatttatta atgtcctcag 1500ggtcccttct gccatctagg
ccttaggggt aagcaggtct cagtcctggc accatgcaca 1560tgtctgaggc
tgagcaaggg ctgagaggag aggcttgggc ctcagtttcc ctctgtgaag
1620tggggggagg caggccttca aggagggata gaggtacaag gcttcgtctc
atctgctgtc 1680tgagcatcca ggcccaaagg cactgaggga gtcaggagct
ggggctcggc acatgcagag 1740atgacagggc agggggcagt cttcctcccc
ctccccgacc aaacctcggg gagccctcct 1800gtgcccctcc ctccttgttg
tccagtgctg ggctccccac cccgaggtca ggctgcccaa 1860tcctctgact
ggatcaccgg gggcttcttg cctcagttct tccctctgag cccccaggcc
1920ctcccgcatc tcaggttggg gatggggaca tggagaggaa ggggccgcct
actcctgcaa 1980atgcttgtga cagatgccag gaggtagatg tgtgctggcc
aataaaggcc cctacctgat 2040tccccgca 2048791997DNAHomo sapiens
79cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag
60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg
120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg
acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg
tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc
atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa
ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc
actcaggagt gagctgcgag gaagttgcag cttcctctgc ctccttggag
420cccatgtggt cgggcagtac catgcggcgg atggcgcttc ggcccacagc
cttctcaggt 480tgtctcaact gcagcaaagt gtcagagctg acagagcggc
tgaaggtgct ggaggccaag 540atgaccatgc tgactgtcat agagcagcca
gtacctccaa caccagctac ccctgaggac 600cctgccccgc tctggggtcc
ccctcctgcc cagggcagcc ccggagatgg aggcctccag 660gaccaagtcg
gtgcttgggg gcttcccggg cccaccggcc ccaagggaga tgccggcagt
720cggggcccaa tggggatgag aggcccacca ggtccacagg gccccccagg
gagccctggc 780cgggctggag ctgtgggcac ccctggagag aggggacctc
ctgggccacc agggcctcct 840ggcccccctg ggcccccagc ccctgttggg
ccaccccatg cccggatctc ccagcatgga 900gacccattgc tgtccaacac
cttcactgag accaacaacc actggcccca gggacccact 960gggcctccag
gccctccagg gcccatgggt ccccctgggc ctcctggccc cacaggtgtc
1020cctgggagtc ctggtcacat aggactgcgt ggggagcctg gcccccaagg
ctctgctggg 1080cagcgggggg aacctggccc taagggagac cctggtgaga
agagccactg gggggagggg 1140ttgcaccagc tacgcgaggc tttgaagatt
ttagctgaga gggttttaat cttggaaaca 1200atgattgggc tctatgaacc
agagctgggg tctggggcgg gccctgccgg cacaggcacc 1260cccagcctcc
ttcggggcaa gaggggcgga catgcaacca actaccggat cgtggccccc
1320aggagccggg acgagagagg ctgagggtgg tggcggcccc tgaggcagac
caggccaggc 1380ttcccctcct acctggactc ggccagctgc ctccagggac
cgcccgtcca tatttattaa 1440tgtcctcagg gtcccttctg ccatctaggc
cttaggggta agcaggtctc agtcctggca 1500ccatgcacat gtctgaggct
gagcaagggc tgagaggaga ggcttgggcc tcagtttccc 1560tctgtgaagt
ggggggaggc aggccttcaa ggagggatag aggtacaagg cttcgtctca
1620tctgctgtct gagcatccag gcccaaaggc actgagggag tcaggagctg
gggctcggca 1680catgcagaga tgacagggca gggggcagtc ttcctccccc
tccccgacca aacctcgggg 1740agccctcctg tgcccctccc tccttgttgt
ccagtgctgg gctccccacc ccgaggtcag 1800gctgcccaat cctctgactg
gatcaccggg ggcttcttgc ctcagttctt ccctctgagc 1860ccccaggccc
tcccgcatct caggttgggg atggggacat ggagaggaag gggccgccta
1920ctcctgcaaa tgcttgtgac agatgccagg aggtagatgt gtgctggcca
ataaaggccc 1980ctacctgatt ccccgca 1997801823DNAHomo sapiens
80cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag
60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg
120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg
acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg
tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc
atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa
ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc
actcaggagt gagctgcgag gaagttgcag cttcctctgc ctccttggag
420cccatgtggt cgggcagtac catgcggcgg atggcgcttc ggcccacagc
cttctcaggt 480tgtctcaact gcagcaaagt gtcagagctg acagagcggc
tgaaggtgct ggaggccaag 540atgaccatgc tgactgtcat agagcagcca
gtacctccaa caccagctac ccctgaggac 600cctgccccgc tctggggtcc
ccctcctgcc cagggcagcc ccggagatgg aggcctccag 660ggagacccat
tgctgtccaa caccttcact gagaccaaca accactggcc ccagggaccc
720actgggcctc caggccctcc agggcccatg ggtccccctg ggcctcctgg
ccccacaggt 780gtccctggga gtcctggtca cataggaccc ccaggcccca
ctggacccaa aggaatctct 840ggccacccag gagagaaggg cgagagagga
ctgcgtgggg agcctggccc ccaaggctct 900gctgggcagc ggggggaacc
tggccctaag ggagaccctg gtgagaagag ccactggggg 960gaggggttgc
accagctacg cgaggctttg aagattttag ctgagagggt tttaatcttg
1020gaaacaatga ttgggctcta tgaaccagag ctggggtctg gggcgggccc
tgccggcaca 1080ggcaccccca gcctccttcg gggcaagagg ggcggacatg
caaccaacta ccggatcgtg 1140gcccccagga gccgggacga gagaggctga
gggtggtggc ggcccctgag gcagaccagg 1200ccaggcttcc cctcctacct
ggactcggcc agctgcctcc agggaccgcc cgtccatatt 1260tattaatgtc
ctcagggtcc cttctgccat ctaggcctta ggggtaagca ggtctcagtc
1320ctggcaccat gcacatgtct gaggctgagc aagggctgag aggagaggct
tgggcctcag 1380tttccctctg tgaagtgggg ggaggcaggc cttcaaggag
ggatagaggt acaaggcttc 1440gtctcatctg ctgtctgagc atccaggccc
aaaggcactg agggagtcag gagctggggc 1500tcggcacatg cagagatgac
agggcagggg gcagtcttcc tccccctccc cgaccaaacc 1560tcggggagcc
ctcctgtgcc cctccctcct tgttgtccag tgctgggctc cccaccccga
1620ggtcaggctg cccaatcctc tgactggatc accgggggct tcttgcctca
gttcttccct 1680ctgagccccc aggccctccc gcatctcagg ttggggatgg
ggacatggag aggaaggggc 1740cgcctactcc tgcaaatgct tgtgacagat
gccaggaggt agatgtgtgc tggccaataa 1800aggcccctac ctgattcccc gca
1823811976DNAHomo sapiens 81cgggcaggca ggcggggagg acaggctggg
ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg
gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc
gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct
atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt
240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag
ctacagaact 300gtggtgagac ccacatacaa ggtgatgtac aagatagtga
ccgcccgtga gtggaggtgc 360tgccctgggc actcaggagt gagctgcgag
gaaggttgtc tcaactgcag caaagtgtca 420gagctgacag agcggctgaa
ggtgctggag gccaagatga ccatgctgac tgtcatagag 480cagccagtac
ctccaacacc agctacccct gaggaccctg ccccgctctg gggtccccct
540cctgcccagg gcagccccgg agatggaggc ctccaggacc aagtcggtgc
ttgggggctt 600cccgggccca ccggccccaa gggagatgcc ggcagtcggg
gcccaatggg gatgagaggc 660ccaccaggtc cacagggccc cccagggagc
cctggccggg ctggagctgt gggcacccct 720ggagagaggg gacctcctgg
gccaccaggg cctcctggcc cccctgggcc cccagcccct 780gttgggccac
cccatgcccg gatctcccag catggagacc cattgctgtc caacaccttc
840actgagacca acaaccactg gccccaggga cccactgggc ctccaggccc
tccagggccc 900atgggtcccc ctgggcctcc tggccccaca ggtgtccctg
ggagtcctgg tcacatagga 960cccccaggcc ccactggacc caaaggaatc
tctggccacc caggagagaa gggcgagaga 1020ggactgcgtg gggagcctgg
cccccaaggc tctgctgggc agcgggggga acctggccct 1080aagggagacc
ctggtgagaa gagccactgg ggggaggggt tgcaccagct acgcgaggct
1140ttgaagattt tagctgagag ggttttaatc ttggaaacaa tgattgggct
ctatgaacca 1200gagctggggt ctggggcggg ccctgccggc acaggcaccc
ccagcctcct tcggggcaag 1260aggggcggac atgcaaccaa ctaccggatc
gtggccccca ggagccggga cgagagaggc 1320tgagggtggt ggcggcccct
gaggcagacc aggccaggct tcccctccta cctggactcg 1380gccagctgcc
tccagggacc gcccgtccat atttattaat gtcctcaggg tcccttctgc
1440catctaggcc ttaggggtaa gcaggtctca gtcctggcac catgcacatg
tctgaggctg 1500agcaagggct gagaggagag gcttgggcct cagtttccct
ctgtgaagtg gggggaggca 1560ggccttcaag gagggataga ggtacaaggc
ttcgtctcat ctgctgtctg agcatccagg 1620cccaaaggca ctgagggagt
caggagctgg ggctcggcac atgcagagat gacagggcag 1680ggggcagtct
tcctccccct ccccgaccaa acctcgggga gccctcctgt gcccctccct
1740ccttgttgtc cagtgctggg ctccccaccc cgaggtcagg ctgcccaatc
ctctgactgg 1800atcaccgggg gcttcttgcc tcagttcttc cctctgagcc
cccaggccct cccgcatctc 1860aggttgggga tggggacatg gagaggaagg
ggccgcctac tcctgcaaat gcttgtgaca 1920gatgccagga ggtagatgtg
tgctggccaa taaaggcccc tacctgattc cccgca 1976822111DNAHomo sapiens
82cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc
gcgcgcggag
60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg
120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg
acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg
tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc
atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa
ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc
actcaggagt gagctgcgag gaagttgcag cttcctctgc ctccttggag
420cccatgtggt cgggcagtac catgcggcgg atggcgcttc ggcccacagc
cttctcaggt 480tgtctcaact gcagcaaagt gtcagagctg acagagcggc
tgaaggtgct ggaggccaag 540atgaccatgc tgactgtcat agagcagcca
gtacctccaa caccagctac ccctgaggac 600cctgccccgc tctggggtcc
ccctcctgcc cagggcagcc ccggagatgg aggcctccag 660gggctgccag
gagccataga gagtgtgagg gtcccgctgc ttccccgaaa tgaccaagtc
720ggtgcttggg ggcttcccgg gcccaccggc cccaagggag atgccggcag
tcggggccca 780atggggatga gaggcccacc aggtccacag ggccccccag
ggagccctgg ccgggctgga 840gctgtgggca cccctggaga gaggggacct
cctgggccac cagggcctcc tggcccccct 900gggcccccag cccctgttgg
gccaccccat gcccggatct cccagcatgg agacccattg 960ctgtccaaca
ccttcactga gaccaacaac cactggcccc agggacccac tgggcctcca
1020ggccctccag ggcccatggg tccccctggg cctcctggcc ccacaggtgt
ccctgggagt 1080cctggtcaca taggaccccc aggccccact ggacccaaag
gaatctctgg ccacccagga 1140gagaagggcg agagaggact gcgtggggag
cctggccccc aaggctctgc tgggcagcgg 1200ggggaacctg gccctaaggg
agaccctggt gagaagagcc actgggggga ggggttgcac 1260cagctacgcg
aggctttgaa gattttagct gagagggttt taatcttgga aacaatgatt
1320gggctctatg aaccagagct ggggtctggg gcgggccctg ccggcacagg
cacccccagc 1380ctccttcggg gcaagagggg cggacatgca accaactacc
ggatcgtggc ccccaggagc 1440cgggacgaga gaggctgagg gtggtggcgg
cccctgaggc agaccaggcc aggcttcccc 1500tcctacctgg actcggccag
ctgcctccag ggaccgcccg tccatattta ttaatgtcct 1560cagggtccct
tctgccatct aggccttagg ggtaagcagg tctcagtcct ggcaccatgc
1620acatgtctga ggctgagcaa gggctgagag gagaggcttg ggcctcagtt
tccctctgtg 1680aagtgggggg aggcaggcct tcaaggaggg atagaggtac
aaggcttcgt ctcatctgct 1740gtctgagcat ccaggcccaa aggcactgag
ggagtcagga gctggggctc ggcacatgca 1800gagatgacag ggcagggggc
agtcttcctc cccctccccg accaaacctc ggggagccct 1860cctgtgcccc
tccctccttg ttgtccagtg ctgggctccc caccccgagg tcaggctgcc
1920caatcctctg actggatcac cgggggcttc ttgcctcagt tcttccctct
gagcccccag 1980gccctcccgc atctcaggtt ggggatgggg acatggagag
gaaggggccg cctactcctg 2040caaatgcttg tgacagatgc caggaggtag
atgtgtgctg gccaataaag gcccctacct 2100gattccccgc a 211183707DNAHomo
sapiens 83agatgaccat gctgactgtc atagagcagc cagtacctcc aacaccagct
acccctgagg 60accctgcccc gctctggggt ccccctcctg cccagggcag ccccggagat
ggaggcctcc 120aggggctgcc aggagccata gagagtgtga gggtcccgct
gcttccccga aatgaccaag 180tcggtgcttg ggggcttccc gggcccaccg
gccccaaggg agatgccggc agtcggggcc 240caatggggat gagaggccca
ccaggtccac agggcccccc agggagccct ggccgggctg 300gagctgtggg
cacccctgga gagaggggac ctcctgggcc accagggcct cctggccccc
360ctgggccccc agcccctgtt gggccacccc atgcccggat ctcccagcat
ggagacccat 420tgctgtccaa caccttcact gagaccaaca accactggcc
ccagggaccc actgggcctc 480caggccctcc agggcccatg ggtccccctg
ggcctcctgg ccccacaggt gtccctggga 540gtcctggtca cataggaccc
ccaggcccca ctggacccaa aggaatctct ggccacccag 600gagagaaggg
cgagagagga ctgcgtgggg agcctggccc ccaaggctct gctgggcagc
660ggggggaacc tggccctaag ggagaccctg gtgagaagag ccactgg
707842034DNAHomo sapiens 84ggtgagtgcc cgcaatgctg ccccacagct
cctctggcca tcccctccac caggtgggcc 60cttccctgct cctgacatgg ccaggatgac
ctgggccctt tcatctactt gcctcttcac 120tcagcacccc accacggagt
gccctgccca cgcctgggct ccatgaagtc ctctcttatg 180ttcactgacc
cacattccct gggcacctac acttatcagg ctctgagctg ggcactgggt
240ggggtcagac atgtccctgc ccttctggag cttccatgct gctgggagca
gggctgggca 300gaggagaagc agcaatgctt gccccatgtg accagggttt
ctatgagggg ttttgggggt 360ttgggagccc caaggaagga agactcagcc
tggacgaggt ggagaactag gtgctgtgct 420catccccctg ttagactacc
aggcagccta ggcctgtgga ctccggggcc ctctctcatg 480cccactgctc
caggctgcct tgtcctgtcg ctcaaggcca ccctgggcct ccttgcctcc
540tgtataccca caaatccgtg tgattccatt gcaggtccac agggcccccc
agggagccct 600ggccgggctg gagctgtggg cacccctgga gagaggggac
ctcctgggcc accagggcct 660cctggccccc ctgggccccc agcccctgtt
gggccacccc atgcccggat ctcccagcat 720ggtgagtccc cctgggatcc
cagcaggtgg aggtgggggt ggagtagcca tcagcacagt 780gcccgctacc
atctgccacg tgccttctgt gtgccagccc tgctcacgat aggccacatg
840tgacccagtc ctccagcagg cgccgttgtc ctcctgtggt tacaggtgag
gaacactgag 900gaccagagag ggaaggtggc ttgccagggt cccacagcct
gggcgtaggg gaacggcttc 960aaacccaggc tgcctccaga acctgtgctt
agagccaccg ggcatcaggc cctcccaagc 1020cttggaactg gctggaatcc
agttctcgga acactgggac gcaaaagacc cggcggcagg 1080aagtgagtcc
tgaactccca aggccacagg cccggcccct cctccaggcc ctgacgtgcg
1140tccttggctt cttccctttg gcagcccagc ctgacctgcc catgggctgc
caggggtcag 1200agtgtggagc gccaggtttc agcctcttct ccactgtgtt
tttggtgcac aacccagcac 1260accattcatt cattctgcca tcccagcatt
cattccatct cactatccat acgatgggga 1320caatgacagt gccagcctcc
cagagctgcg taacatccat gtacagaagc ctggcacaca 1380gtaggtggtg
gataaatggt atcttttatt gtcattccca tttgacaggt gacagtacag
1440gctctgaaaa gtagaaagtg ttgctggatg tcaccagctg gattgcagtg
gggttagaac 1500ccacatctcc ctgcctcctg gtcttgcggg accaacactc
tccacactcc tcaccctgga 1560gcaggtgccc aggtggtacc agccatgctg
caggctgccc catagggcag tccaagctgt 1620cttggcagag gtggcaggtg
aagactaacc accccactct acccagctct actcactcat 1680catctttgct
cacccaggag acccattgct gtccaacacc ttcactgaga ccaacaacca
1740ctggccccag ggacccactg ggcctccagg ccctccaggg cccatgggtc
cccctgggcc 1800tcctggcccc acaggtgtcc ctgggagtcc tggtcacata
gtgagtagtt ctccttgtac 1860tctcacccat gtgtctgtcc atctttccat
ctatgcatac atccatacat ctgtccatca 1920tccacccttg tatccatcta
tccatccatc cattcatcct tccattcatt cattcaacaa 1980gtatttattg
agcacttaat atgcaaacta ccttccataa atcttattca atcc 203485409PRTHomo
sapiens 85Met Glu Glu Ser Trp Glu Ala Ala Pro Gly Gly Gln Ala Gly
Ala Glu 1 5 10 15 Leu Pro Met Glu Pro Val Gly Ser Leu Val Pro Thr
Leu Glu Gln Pro 20 25 30 Gln Val Pro Ala Lys Val Arg Gln Pro Glu
Gly Pro Glu Ser Ser Pro 35 40 45 Ser Pro Ala Gly Ala Val Glu Lys
Ala Ala Gly Ala Gly Leu Glu Pro 50 55 60 Ser Ser Lys Lys Lys Pro
Pro Ser Pro Arg Pro Gly Ser Pro Arg Val 65 70 75 80 Pro Pro Leu Ser
Leu Gly Tyr Gly Val Cys Pro Glu Pro Pro Ser Pro 85 90 95 Gly Pro
Ala Leu Val Lys Leu Pro Arg Asn Gly Glu Ala Pro Gly Ala 100 105 110
Glu Pro Ala Pro Ser Ala Trp Ala Pro Met Glu Leu Gln Val Asp Val 115
120 125 Arg Val Lys Pro Val Gly Ala Ala Gly Gly Ser Ser Thr Pro Ser
Pro 130 135 140 Arg Pro Ser Thr Arg Phe Leu Lys Val Pro Val Pro Glu
Ser Pro Ala 145 150 155 160 Phe Ser Arg His Ala Asp Pro Ala His Gln
Leu Leu Leu Arg Ala Pro 165 170 175 Ser Gln Gly Gly Thr Trp Gly Arg
Arg Ser Pro Leu Ala Ala Ala Arg 180 185 190 Thr Glu Ser Gly Cys Asp
Ala Glu Gly Arg Ala Ser Pro Ala Glu Gly 195 200 205 Ser Ala Gly Ser
Pro Gly Ser Pro Thr Cys Cys Arg Cys Lys Glu Leu 210 215 220 Gly Leu
Glu Lys Glu Asp Ala Ala Leu Leu Pro Arg Ala Gly Leu Asp 225 230 235
240 Gly Asp Glu Lys Leu Pro Arg Ala Val Thr Leu Thr Gly Leu Pro Met
245 250 255 Tyr Val Lys Ser Leu Tyr Trp Ala Leu Ala Phe Met Ala Val
Leu Leu 260 265 270 Ala Val Ser Gly Val Val Ile Val Val Leu Ala Ser
Arg Ala Gly Ala 275 280 285 Arg Cys Gln Gln Cys Pro Pro Gly Trp Val
Leu Ser Glu Glu His Cys 290 295 300 Tyr Tyr Phe Ser Ala Glu Ala Gln
Ala Trp Glu Ala Ser Gln Ala Phe 305 310 315 320 Cys Ser Ala Tyr His
Ala Thr Leu Pro Leu Leu Ser His Thr Gln Asp 325 330 335 Phe Leu Gly
Arg Tyr Pro Val Ser Arg His Ser Trp Val Gly Ala Trp 340 345 350 Arg
Gly Pro Gln Gly Trp His Trp Ile Asp Glu Ala Pro Leu Pro Pro 355 360
365 Gln Leu Leu Pro Glu Asp Gly Glu Asp Asn Leu Asp Ile Asn Cys Gly
370 375 380 Ala Leu Glu Glu Gly Thr Leu Val Ala Ala Asn Cys Ser Thr
Pro Arg 385 390 395 400 Pro Trp Val Cys Ala Lys Gly Thr Gln 405
86314PRTHomo sapiens 86Met Glu Glu Ser Trp Glu Ala Ala Pro Gly Gly
Gln Ala Gly Ala Glu 1 5 10 15 Leu Pro Met Glu Pro Val Gly Ser Leu
Val Pro Thr Leu Glu Gln Pro 20 25 30 Gln Val Pro Ala Lys Val Arg
Gln Pro Glu Gly Pro Glu Ser Ser Pro 35 40 45 Ser Pro Ala Gly Ala
Val Glu Lys Ala Ala Gly Ala Gly Leu Glu Pro 50 55 60 Ser Ser Lys
Lys Lys Pro Pro Ser Pro Arg Pro Gly Ser Pro Arg Val 65 70 75 80 Pro
Pro Leu Ser Leu Gly Tyr Gly Val Cys Pro Glu Pro Pro Ser Pro 85 90
95 Gly Pro Ala Leu Val Lys Leu Pro Arg Asn Gly Glu Ala Pro Gly Ala
100 105 110 Glu Pro Ala Pro Ser Ala Trp Ala Pro Met Glu Leu Gln Val
Asp Val 115 120 125 Arg Val Lys Pro Val Gly Ala Ala Gly Gly Ser Ser
Thr Pro Ser Pro 130 135 140 Arg Pro Ser Thr Arg Phe Leu Lys Val Pro
Val Pro Glu Ser Pro Ala 145 150 155 160 Phe Ser Arg His Ala Asp Pro
Ala His Gln Leu Leu Leu Arg Ala Pro 165 170 175 Ser Gln Gly Gly Thr
Trp Gly Arg Arg Ser Pro Leu Ala Ala Ala Arg 180 185 190 Thr Glu Ser
Gly Cys Asp Ala Glu Gly Arg Ala Ser Pro Ala Glu Gly 195 200 205 Ser
Ala Gly Ser Pro Gly Ser Pro Thr Cys Cys Arg Cys Lys Glu Leu 210 215
220 Gly Leu Glu Lys Glu Asp Ala Ala Leu Leu Pro Arg Ala Gly Leu Asp
225 230 235 240 Gly Asp Glu Lys Leu Pro Arg Ala Val Thr Leu Thr Asp
Ser Leu Arg 245 250 255 Thr Ala Arg Thr Ile Trp Ile Ser Thr Val Gly
Pro Trp Arg Lys Ala 260 265 270 Arg Trp Trp Leu Gln Thr Ala Ala Leu
Gln Asp Pro Gly Ser Val Pro 275 280 285 Arg Gly Pro Ser Asp Leu Gly
Ser Ala Trp Ser Ser Ala Cys Gln Ala 290 295 300 Asp Ala Ala Pro Pro
Thr Gly Glu Ala Ser 305 310 871544DNAHomo sapiens 87gagagcgaag
ctcctctgca ctgggcccag gtgcgctcct cagcgtctcc gggtggcggg 60gcgcgcggga
tggaggagtc ttgggaggct gcgcccggag gccaagccgg ggcagagctc
120ccaatggagc ccgtgggaag cctggtcccc acgctggagc agccgcaggt
gcccgcgaag 180gtgcgacaac ctgaaggtcc cgaaagcagc ccaagtccgg
ccggggccgt ggagaaggcg 240gcgggcgcag gcctggagcc ctcgagcaag
aaaaagccgc cttcgcctcg ccccgggtcc 300ccgcgcgtgc cgccgctcag
cctgggctac ggggtctgcc ccgagccgcc gtcaccgggc 360cctgccttgg
tcaagctgcc ccggaatggc gaggcgcccg gggctgagcc tgcgcccagc
420gcctgggcgc ccatggagct gcaggtagat gtgcgcgtga agcccgtggg
cgcggccggt 480ggcagcagca cgccatcgcc caggccctcc acgcgcttcc
tcaaggtgcc ggtgcccgag 540tcccctgcct tctcccgcca cgcggacccg
gcgcaccagc tcctgctgcg cgcaccatcc 600cagggcggca cgtggggccg
ccgctcgccg ctggctgcag cccggacgga gagcggctgc 660gacgcagagg
gccgggccag ccccgcggaa ggaagcgccg gctccccggg ctcccccacg
720tgctgccgct gcaaggagct ggggctggag aaggaggatg cggcgctgtt
gccccgcgcg 780gggttggacg gcgacgagaa gctgccccgg gccgtaacgc
ttacggggct acccatgtac 840gtgaagtccc tgtactgggc cctggcgttc
atggctgtgc tcctggcagt ctctggggtt 900gtcattgtgg tcctggcctc
aagagcagga gccagatgcc agcagtgccc cccaggctgg 960gtgttgtccg
aggagcactg ttactacttc tctgcagaag cgcaggcctg ggaagccagc
1020caggctttct gctcagccta ccacgctacc ctccccctgc taagccacac
ccaggacttc 1080ctgggcagat acccagtctc caggcactcc tgggtggggg
cctggcgagg cccccagggc 1140tggcactgga tcgacgaggc cccactcccg
ccccagctac tccctgagga cggcgaggac 1200aatctggata tcaactgtgg
ggccctggag gaaggcacgc tggtggctgc aaactgcagc 1260actccaagac
cctgggtctg tgccaagggg acccagtgat ctgggctctg cctggtcctc
1320agcctgccag gcagatgcag caccccctac aggggaggcc agttgagagc
ttgggcagcc 1380tcttcctgga cccagttatc caggtcttca tgctctgctc
aagggggcca catgagcgag 1440cctaggagct ggacttcaac ccaggaagat
gcatccgagg gaaaggagat tttctatggc 1500ctcaggcctg agtgccaata
ttagtctcca gcttctgtgg atga 1544881192DNAHomo sapiens 88gagagcgaag
ctcctctgca ctgggcccag gtgcgctcct cagcgtctcc gggtggcggg 60gcgcgcggga
tggaggagtc ttgggaggct gcgcccggag gccaagccgg ggcagagctc
120ccaatggagc ccgtgggaag cctggtcccc acgctggagc agccgcaggt
gcccgcgaag 180gtgcgacaac ctgaaggtcc cgaaagcagc ccaagtccgg
ccggggccgt ggagaaggcg 240gcgggcgcag gcctggagcc ctcgagcaag
aaaaagccgc cttcgcctcg ccccgggtcc 300ccgcgcgtgc cgccgctcag
cctgggctac ggggtctgcc ccgagccgcc gtcaccgggc 360cctgccttgg
tcaagctgcc ccggaatggc gaggcgcccg gggctgagcc tgcgcccagc
420gcctgggcgc ccatggagct gcaggtagat gtgcgcgtga agcccgtggg
cgcggccggt 480ggcagcagca cgccatcgcc caggccctcc acgcgcttcc
tcaaggtgcc ggtgcccgag 540tcccctgcct tctcccgcca cgcggacccg
gcgcaccagc tcctgctgcg cgcaccatcc 600cagggcggca cgtggggccg
ccgctcgccg ctggctgcag cccggacgga gagcggctgc 660gacgcagagg
gccgggccag ccccgcggaa ggaagcgccg gctccccggg ctcccccacg
720tgctgccgct gcaaggagct ggggctggag aaggaggatg cggcgctgtt
gccccgcgcg 780gggttggacg gcgacgagaa gctgccccgg gccgtaacgc
ttacggactc cctgaggacg 840gcgaggacaa tctggatatc aactgtgggg
ccctggagga aggcacgctg gtggctgcaa 900actgcagcac tccaagaccc
tgggtctgtg ccaaggggac ccagtgatct gggctctgcc 960tggtcctcag
cctgccaggc agatgcagca ccccctacag gggaggccag ttgagagctt
1020gggcagcctc ttcctggacc cagttatcca ggtcttcatg ctctgctcaa
gggggccaca 1080tgagcgagcc taggagctgg acttcaaccc aggaagatgc
atccgaggga aaggagattt 1140tctatggcct caggcctgag tgccaatatt
agtctccagc ttctgtggat ga 119289518PRTHomo sapiens 89Met Leu Ala Ala
Ala Ser Lys Tyr Arg His Gly Asn Met Val Phe Phe 1 5 10 15 Asp Val
Leu Gly Leu Phe Val Ile Ala Tyr Pro Ser Arg Ile Gly Ser 20 25 30
Ile Ile Asn Tyr Met Val Val Met Gly Val Val Leu Tyr Leu Gly Lys 35
40 45 Lys Phe Leu Gln Pro Lys His Lys Thr Gly Asn Tyr Lys Lys Asp
Phe 50 55 60 Leu Cys Gly Leu Gly Ile Thr Leu Ile Ser Trp Phe Thr
Ser Leu Val 65 70 75 80 Thr Val Leu Ile Ile Ala Val Phe Ile Ser Leu
Ile Gly Gln Ser Leu 85 90 95 Ser Trp Tyr Asn His Phe Tyr Val Ser
Val Cys Leu Tyr Gly Thr Ala 100 105 110 Thr Val Ala Lys Ile Ile Leu
Ile His Thr Leu Ala Lys Arg Phe Tyr 115 120 125 Tyr Met Asn Ala Ser
Ala Gln Tyr Leu Gly Glu Val Phe Phe Asp Ile 130 135 140 Ser Leu Phe
Val His Cys Cys Phe Leu Val Thr Leu Thr Tyr Gln Gly 145 150 155 160
Leu Cys Ser Ala Phe Ile Ser Ala Val Trp Val Ala Phe Pro Leu Leu 165
170 175 Thr Lys Leu Cys Val His Lys Asp Phe Lys Gln His Gly Ala Gln
Gly 180 185 190 Lys Phe Ile Ala Phe Tyr Leu Leu Gly Met Phe Ile Pro
Tyr Leu Tyr 195 200 205 Ala Leu Tyr Leu Ile Trp Ala Val Phe Glu Met
Phe Thr Pro Ile Leu 210 215 220 Gly Arg Ser Gly Ser Glu Ile Pro Pro
Asp Val Val Leu Ala Ser Ile 225 230 235 240 Leu Ala Gly Cys Thr Met
Ile Leu Ser Ser Tyr Phe Ile Asn Phe Ile 245 250 255 Tyr Leu Ala Lys
Ser Thr Lys Lys Thr Met Leu Thr Leu Thr Leu Val 260 265 270 Cys Ala
Ile Thr Phe Leu Leu Val Cys Ser Gly Thr Phe Phe Pro Tyr 275 280 285
Ser Ser Asn Pro Ala Asn Pro Lys Pro Lys Arg Val Phe Leu Gln His 290
295 300 Met Thr Arg Thr Phe His Asp Leu Glu Gly Asn Ala Val Lys Arg
Asp 305 310 315 320 Ser Gly Ile Trp Ile Asn Gly Phe Asp Tyr Thr Gly
Ile Ser His Ile 325 330 335 Thr Pro His Ile Pro Glu Ile Asn Asp Ser
Ile Arg Ala His Cys Glu 340 345 350 Glu Asn Ala Pro Leu Cys Gly Phe
Pro Trp Tyr Leu Pro Val His Phe 355 360 365 Leu Ile Arg Lys Asn Trp
Tyr Leu Pro Ala Pro Glu Val Ser Pro Arg 370 375 380 Asn Pro Pro His
Phe Arg Leu Ile Ser Lys Glu Gln Thr Pro Trp Asp 385 390
395 400 Ser Ile Lys Leu Thr Phe Glu Ala Thr Gly Pro Ser His Met Ser
Phe 405 410 415 Tyr Val Arg Ala His Lys Gly Ser Thr Leu Ser Gln Trp
Ser Leu Gly 420 425 430 Asn Gly Thr Pro Val Thr Ser Lys Gly Gly Asp
Tyr Phe Val Phe Tyr 435 440 445 Ser His Gly Leu Gln Ala Ser Ala Trp
Gln Phe Trp Ile Glu Val Gln 450 455 460 Val Ser Glu Glu His Pro Glu
Gly Met Val Thr Val Ala Ile Ala Ala 465 470 475 480 His Tyr Leu Ser
Gly Glu Asp Lys Arg Ser Pro Gln Leu Asp Ala Leu 485 490 495 Lys Glu
Lys Phe Pro Asp Trp Thr Phe Pro Ser Ala Trp Val Cys Thr 500 505 510
Tyr Asp Leu Phe Val Phe 515 90904PRTHomo sapiens 90Met Glu Trp Gly
Ser Glu Ser Ala Ala Val Arg Arg His Arg Val Gly 1 5 10 15 Val Glu
Arg Arg Glu Gly Ala Ala Ala Ala Pro Pro Pro Glu Arg Glu 20 25 30
Ala Arg Ala Gln Glu Pro Leu Val Asp Gly Cys Ser Gly Gly Gly Arg 35
40 45 Thr Arg Lys Arg Ser Pro Gly Gly Ser Gly Gly Ala Ser Arg Gly
Ala 50 55 60 Gly Thr Gly Leu Ser Glu Val Arg Ala Ala Leu Gly Leu
Ala Leu Tyr 65 70 75 80 Leu Ile Ala Leu Arg Thr Leu Val Gln Leu Ser
Leu Gln Gln Leu Val 85 90 95 Leu Arg Gly Ala Ala Gly His Arg Gly
Glu Phe Asp Ala Leu Gln Ala 100 105 110 Arg Asp Tyr Leu Glu His Ile
Thr Ser Ile Gly Pro Arg Thr Thr Gly 115 120 125 Ser Pro Glu Asn Glu
Ile Leu Thr Val His Tyr Leu Leu Glu Gln Ile 130 135 140 Lys Leu Ile
Glu Val Gln Ser Asn Ser Leu His Lys Ile Ser Val Asp 145 150 155 160
Val Gln Arg Pro Thr Gly Ser Phe Ser Ile Asp Phe Leu Gly Gly Phe 165
170 175 Thr Ser Tyr Tyr Asp Asn Ile Thr Asn Val Val Val Lys Leu Glu
Pro 180 185 190 Arg Asp Gly Ala Gln His Ala Val Leu Ala Asn Cys His
Phe Asp Ser 195 200 205 Val Ala Asn Ser Pro Gly Ala Ser Asp Asp Ala
Val Ser Cys Ser Val 210 215 220 Met Leu Glu Val Leu Arg Val Leu Ser
Thr Ser Ser Glu Ala Leu His 225 230 235 240 His Ala Val Ile Phe Leu
Phe Asn Gly Ala Glu Glu Asn Val Leu Gln 245 250 255 Ala Ser His Gly
Phe Ile Thr Gln His Pro Trp Ala Ser Leu Ile Arg 260 265 270 Ala Phe
Ile Asn Leu Glu Ala Ala Gly Val Gly Gly Lys Glu Leu Val 275 280 285
Phe Gln Thr Gly Pro Glu Asn Pro Trp Leu Val Gln Ala Tyr Val Ser 290
295 300 Ala Ala Lys His Pro Phe Ala Ser Val Val Ala Gln Glu Val Phe
Gln 305 310 315 320 Ser Gly Ile Ile Pro Ser Asp Thr Asp Phe Arg Ile
Tyr Arg Asp Phe 325 330 335 Gly Asn Ile Pro Gly Ile Asp Leu Ala Phe
Ile Glu Asn Gly Tyr Ile 340 345 350 Tyr His Thr Lys Tyr Asp Thr Ala
Asp Arg Ile Leu Thr Asp Ser Ile 355 360 365 Gln Arg Ala Gly Asp Asn
Ile Leu Ala Val Leu Lys His Leu Ala Thr 370 375 380 Ser Asp Met Leu
Ala Ala Ala Ser Lys Tyr Arg His Gly Asn Met Val 385 390 395 400 Phe
Phe Asp Val Leu Gly Leu Phe Val Ile Ala Tyr Pro Ser Arg Ile 405 410
415 Gly Ser Ile Ile Asn Tyr Met Val Val Met Gly Val Val Leu Tyr Leu
420 425 430 Gly Lys Lys Phe Leu Gln Pro Lys His Lys Thr Gly Asn Tyr
Lys Lys 435 440 445 Asp Phe Leu Cys Gly Leu Gly Ile Thr Leu Ile Ser
Trp Phe Thr Ser 450 455 460 Leu Val Thr Val Leu Ile Ile Ala Val Phe
Ile Ser Leu Ile Gly Gln 465 470 475 480 Ser Leu Ser Trp Tyr Asn His
Phe Tyr Val Ser Val Cys Leu Tyr Gly 485 490 495 Thr Ala Thr Val Ala
Lys Ile Ile Leu Ile His Thr Leu Ala Lys Arg 500 505 510 Phe Tyr Tyr
Met Asn Ala Ser Ala Gln Tyr Leu Gly Glu Val Phe Phe 515 520 525 Asp
Ile Ser Leu Phe Val His Cys Cys Phe Leu Val Thr Leu Thr Tyr 530 535
540 Gln Gly Leu Cys Ser Ala Phe Ile Ser Ala Val Trp Val Ala Phe Pro
545 550 555 560 Leu Leu Thr Lys Leu Cys Val His Lys Asp Phe Lys Gln
His Gly Ala 565 570 575 Gln Gly Lys Phe Ile Ala Phe Tyr Leu Leu Gly
Met Phe Ile Pro Tyr 580 585 590 Leu Tyr Ala Leu Tyr Leu Ile Trp Ala
Val Phe Glu Met Phe Thr Pro 595 600 605 Ile Leu Gly Arg Ser Gly Ser
Glu Ile Pro Pro Asp Val Val Leu Ala 610 615 620 Ser Ile Leu Ala Gly
Cys Thr Met Ile Leu Ser Ser Tyr Phe Ile Asn 625 630 635 640 Phe Ile
Tyr Leu Ala Lys Ser Thr Lys Lys Thr Met Leu Thr Leu Thr 645 650 655
Leu Val Cys Ala Ile Thr Phe Leu Leu Val Cys Ser Gly Thr Phe Phe 660
665 670 Pro Tyr Ser Ser Asn Pro Ala Asn Pro Lys Pro Lys Arg Val Phe
Leu 675 680 685 Gln His Met Thr Arg Thr Phe His Asp Leu Glu Gly Asn
Ala Val Lys 690 695 700 Arg Asp Ser Gly Ile Trp Ile Asn Gly Phe Asp
Tyr Thr Gly Ile Ser 705 710 715 720 His Ile Thr Pro His Ile Pro Glu
Ile Asn Asp Ser Ile Arg Ala His 725 730 735 Cys Glu Glu Asn Ala Pro
Leu Cys Gly Phe Pro Trp Tyr Leu Pro Val 740 745 750 His Phe Leu Ile
Arg Lys Asn Trp Tyr Leu Pro Ala Pro Glu Val Ser 755 760 765 Pro Arg
Asn Pro Pro His Phe Arg Leu Ile Ser Lys Glu Gln Thr Pro 770 775 780
Trp Asp Ser Ile Lys Leu Thr Phe Glu Ala Thr Gly Pro Ser His Met 785
790 795 800 Ser Phe Tyr Val Arg Ala His Lys Gly Ser Thr Leu Ser Gln
Trp Ser 805 810 815 Leu Gly Asn Gly Thr Pro Val Thr Ser Lys Gly Gly
Asp Tyr Phe Val 820 825 830 Phe Tyr Ser His Gly Leu Gln Ala Ser Ala
Trp Gln Phe Trp Ile Glu 835 840 845 Val Gln Val Ser Glu Glu His Pro
Glu Gly Met Val Thr Val Ala Ile 850 855 860 Ala Ala His Tyr Leu Ser
Gly Glu Asp Lys Arg Ser Pro Gln Leu Asp 865 870 875 880 Ala Leu Lys
Glu Lys Phe Pro Asp Trp Thr Phe Pro Ser Ala Trp Val 885 890 895 Cys
Thr Tyr Asp Leu Phe Val Phe 900 91419PRTHomo sapiens 91Met Val Val
Met Gly Val Val Leu Tyr Leu Gly Lys Lys Phe Leu Gln 1 5 10 15 Pro
Lys His Lys Thr Gly Asn Tyr Lys Lys Asp Phe Leu Cys Gly Leu 20 25
30 Gly Ile Thr Leu Ile Ser Trp Phe Thr Ser Leu Val Thr Val Leu Ile
35 40 45 Ile Ala Val Phe Ile Ser Leu Ile Gly Gln Ser Leu Ser Trp
Tyr Asn 50 55 60 His Phe Tyr Val Ser Val Cys Leu Tyr Gly Thr Ala
Thr Val Ala Lys 65 70 75 80 Ile Ile Leu Ile His Thr Leu Ala Lys Arg
Phe Tyr Tyr Met Asn Ala 85 90 95 Ser Ala Gln Tyr Leu Gly Glu Val
Phe Phe Asp Ile Ser Leu Phe Val 100 105 110 His Cys Cys Phe Leu Val
Thr Leu Thr Tyr Gln Gly Leu Cys Ser Ala 115 120 125 Phe Ile Ser Ala
Val Trp Val Ala Phe Pro Leu Leu Thr Lys Leu Cys 130 135 140 Val His
Lys Asp Phe Lys Gln His Gly Ala Gln Gly Lys Phe Ile Ala 145 150 155
160 Phe Tyr Leu Leu Gly Met Phe Ile Pro Tyr Leu Tyr Ala Leu Tyr Leu
165 170 175 Ile Trp Ala Val Phe Glu Met Phe Thr Pro Ile Leu Gly Arg
Ser Gly 180 185 190 Ser Glu Ile Pro Pro Asp Val Val Leu Ala Ser Ile
Leu Ala Gly Cys 195 200 205 Thr Met Ile Leu Ser Ser Tyr Phe Ile Asn
Phe Ile Tyr Leu Ala Lys 210 215 220 Ser Thr Lys Lys Thr Met Leu Thr
Leu Thr Leu Val Cys Ala Ile Thr 225 230 235 240 Phe Leu Leu Val Cys
Ser Gly Thr Phe Phe Pro Tyr Ser Ser Asn Pro 245 250 255 Ala Asn Pro
Lys Pro Lys Arg Val Phe Leu Gln His Met Thr Arg Thr 260 265 270 Phe
His Asp Leu Glu Gly Asn Ala Val Lys Arg Asp Ser Gly Ile Trp 275 280
285 Ile Asn Gly Phe Asp Tyr Thr Gly Ile Ser His Ile Thr Pro His Ile
290 295 300 Pro Glu Ile Asn Asp Ser Ile Arg Ala His Cys Glu Glu Asn
Ala Pro 305 310 315 320 Leu Cys Gly Phe Pro Trp Tyr Leu Pro Val His
Phe Leu Ile Arg Lys 325 330 335 Asn Trp Tyr Leu Pro Ala Pro Glu Val
Ser Pro Arg Asn Pro Pro His 340 345 350 Phe Arg Leu Ile Ser Lys Glu
Gln Thr Pro Trp Asp Ser Ile Lys Leu 355 360 365 Thr Phe Glu Ala Thr
Ala Cys Leu Pro Ile Leu Gln Ile Leu Asp Leu 370 375 380 Pro Ala Ser
Thr Ile Met Thr Lys Pro Tyr Val Leu Leu Cys Ser Ser 385 390 395 400
Pro Gln Arg Val Asn Thr Phe Ser Val Val Ser Trp Gln Trp His Pro 405
410 415 Ser His Lys 924974DNAHomo sapiens 92ggcgcgggga ccgggctgtc
tgaggtgcgc gccgcgctgg ggctcgcgct ctacctgatc 60gcgctgcgga cgctggtgca
gctctcgctg cagcagctcg tgctacgcgg ggccgctgga 120caccgcgggg
agttcgacgc gctccaagcc agggattatc ttgaacacat aacctccatt
180ggccccagga ctacaggaag tccagaaaat gaaattctga ccgtgcacta
ccttttggaa 240cagattaaac tgattgaagt gcaaagcaac agccttcata
agatttcagt agatgtacaa 300cggcccacag gctcttttag cattgatttc
ttgggaggtt ttacaagcta ttatgacaac 360atcaccaatg ttgtggtaaa
gctggaaccc agagatggag cccagcatgc tgtcttggct 420aattgtcatt
ttgactcagt agcaaactca ccaggccagt catggtttca ttactcagca
480cccctgggct agcttgattc gtgcattcat taacctagag gcagcaggtg
taggagggaa 540agaacttgta ttccaaacag gtcctgaaaa tccttggttg
gttcaagctt atgtttcagc 600agctaaacac ccttttgctt ctgtggtggc
tcaggaggtt tttcagagtg gaatcattcc 660ttcagatact gactttcgta
tctacaggga ttttgggaac attccaggaa tagacttagc 720ttttattgag
aatggataca tttatcacac caagtatgac acagcggaca gaattctaac
780agattccatt cagagagcag gtgacaacat tttagcagtt cttaagcatc
tagctacatc 840tgatatgctg gctgctgctt ctaagtatcg acatggaaac
atggtcttct ttgatgtgct 900gggcctgttt gtcattgcct acccctctcg
tattggctca atcataaact acatggtggt 960aatgggtgtt gttttgtacc
tgggcaaaaa atttttgcag cccaaacata agactggtaa 1020ctacaagaag
gacttcttgt gtggacttgg catcactttg atcagctggt tcactagcct
1080tgttaccgtt ctcattatag cagtgttcat ctctcttatt ggacagtctc
tctcatggta 1140taaccacttc tatgtctccg tttgtctgta tggaactgca
actgtagcca aaataatact 1200tatacatact cttgcgaaaa gattttatta
catgaatgcc agtgcccagt atctgggaga 1260agtatttttt gacatttcgc
tgtttgtcca ttgctgtttt cttgttaccc tcacttacca 1320aggactttgc
tcggcgttta ttagtgctgt ctgggtagca ttcccattgc tcacaaagct
1380ctgtgtgcat aaggacttca agcagcatgg tgcccaagga aaatttattg
ctttttacct 1440tttggggatg tttattcctt atctttatgc attgtacctc
atctgggcag tatttgagat 1500gtttacccct atcctcggga gaagtggttc
tgaaatccca cctgatgttg tgctggcatc 1560cattttggct ggctgtacaa
tgattctctc gtcctatttt attaacttca tctaccttgc 1620caagagcaca
aaaaaaacca tgctaacttt aactttggta tgtgcaatta cattcctcct
1680tgtttgcagt ggaacatttt ttccatatag ctccaatcct gctaatccga
agccaaagag 1740agtgtttctt cagcatatga ctagaacatt ccatgacttg
gaaggaaatg cagttaaacg 1800ggactctgga atatggatca atgggtttga
ttatactgga atttctcaca taacccctca 1860cattcctgag atcaatgata
gtatccgagc tcactgtgag gagaatgcac ctctttgtgg 1920ttttccttgg
tatcttccag tgcactttct gatcaggaaa aactggtatc ttcctgcccc
1980agaagtttct ccaagaaatc ctcctcattt ccgactcata tccaaagaac
agacaccttg 2040ggattctata aaattgactt ttgaagcaac aggaccaagc
catatgtcct tctatgttcg 2100agcccacaaa gggtcaacac tttctcagtg
gtctcttggc aatggcaccc cagtcacaag 2160taaaggagga gactactttg
tcttttactc ccatggactc caggcctctg catggcagtt 2220ctggatagaa
gtgcaggttt cagaagaaca tcctgaagga atggtcaccg tggccattgc
2280tgcccactat ctgtctgggg aagacaagag atcccctcaa ctggatgctc
tgaaggaaaa 2340gttcccagat tggacatttc cctctgcctg ggtgtgcacc
tacgatctct ttgtatttta 2400atcttgtgga tgagctctaa gtacatgccc
agtggatact ccatgtgaca tggtttctcc 2460ctatgttacg tggatgtttg
taacgtaagt caatgaattt taatgatcat atgttcaaag 2520agctttctgg
gttaacgctt ttcagggcca agcactataa gggtttagct gtggcgcagt
2580gatgcatggc ctgttgacac ttgaaaatgc cagtcttttg gcacttcagc
acatgtgggt 2640actgccacta cacacacgtc attttatatg accttaagga
caaagccaac aatccacttc 2700aatagctgcc cctttaggat caagaaagat
gtacactgtc agagcattgt taatgagaca 2760aaagttgttt ccaatttaag
ccccaaaacc atttgttgta ttagtggatg gtgggtaaaa 2820tatcattcac
tgaggtaatg attccccttg agaatataac tctgtgtagg tcactggaaa
2880gtgattgcca tagggctggg agagaagcat tgcactcttg aggctgtagc
ctgtgtcaag 2940ctgtttcttc aggcagcctc tcaaatgtgc tttgtctctc
tgtgctgagg cctggaccct 3000gtgctgagct ggtgactcac tgtcctgaca
agtggacaca cagatgcact gctgtgctgc 3060tttcctgagg tggttttcta
tgcctgtttt cctctgaaac atgtctgtta cccctctcca 3120tcttaccaag
ttgaaaaggg gaatatttgg ccacataccc ctctggtttt cgtaggttct
3180tttggttcag aatattgttt gtgccagtac atgaccttaa cttccttcct
cagagcactg 3240agctgccatc tgggctattc tggggtagaa ggaaggctgg
gagtggtggg aattttataa 3300atatttattc tcttttcttt gtttcatagg
agtcttgtgt tatacaaggt tagtccttca 3360tggtataatc ttactgatgc
actgggccta tctttttgtt ttccagccag ttgaatagat 3420tagtttttct
cagtaactta ctatccagca gactggcttt cctgagactt gaggttgtgg
3480cttatactgg aatgagacca ctgtacgtgt aggtggttca gatcctgcgt
aatggcagca 3540tgaggactta aaaggtggtt ttcattttga agatggctat
gtagcttgta aggtgtatca 3600cagcagtacc tctcatggct ttttggttcc
agcagtgagg gcattggtga gatcaatggt 3660aaactgtgca agctttcttt
ttatcattag gaaatgtgaa acgttggaca aattttgagt 3720tttaacaagg
acaaaaagtt gaaagaaaag gcacagttaa caaaaaaggg tggctagatt
3780tatcttgggt gatggaggaa atgagagagg aatgctcttg aaaggtggtc
tgtggatctg 3840tctgaataga aagagcacag taagtatgca ttgccggaga
aaacgtcctt gaagctgctt 3900gtctcatgtg tatgatgtgc tttttaaatc
atgcccctcg ttgcctgcct aatctgtgac 3960tccctaaaaa ctaactgggc
ccatgtagat ggggctgcaa ccagagctga ataacatgtt 4020aggctcacac
atgcatcagc actgcacact ggaatcattg ctcttcctgg actttgtaga
4080aatcagtctc aagtgcttca agagtctggc tcctgctact tttatctgtc
aggtagcaca 4140taaggtttgc agggtttata ttttgtatag aatcacagtt
gtggagaaaa agtaataatt 4200tctcaatgaa ttttaaaaat gggcctattt
tctatccccg tggttcatct gatataatta 4260gtgttccctg tgaattcccc
ccctctatgg gaaggatgcc tttactcttt atcagtaata 4320aattatgact
gttttcatat tgccttaggg ttatttccct gtgtaaacca ttgtcttttg
4380ttttggtttt ctttagcatt atgaagcttt ggtattgtac aaggtcagta
gtaagatgct 4440cactagtctc agggcttgtg taatattctg ggaggtcatt
taaatgccag aaatggtcaa 4500gcaattatac acagtattta tgactctgtt
aagcataccg tttgtctgtc acattagtag 4560attctgagat taaaaaaaat
ttttaaagag tgatcattta aataatttct aaaagggtct 4620tttcaagctc
taacaaagtc actaacaaat gcattatttt ctacagaatt agatgttagt
4680agtacagtac tgcatattca gggaaaaagt gtgaggaatt gatttcaaaa
tagttcgttc 4740ttgtgtttga cctaagaatg attgtcgcat gaagtgtttg
tttttacagt ttagcatata 4800taaacaaaca tgataggatt ccttaagatg
ttaccaccca gggggccaca agccagcctg 4860ctgtctcagg aagctgtaga
aggagtgttt gtcaatttct tgtcactggt ttgctgactt 4920actgaggatt
aattgttgcc ttacaatgtt actgaaataa actgtttaat atac 4974935338DNAHomo
sapiens 93ggccggggct gtcgcgggtt ggggcggttg ggctggcagc tgaggctcgt
ggccatggag 60tggggttctg agtcggctgc tgtgaggcgg caccgcgtcg gagtagagcg
tcgagaggga 120gcggcggccg cgccaccgcc ggagagggag gcccgagcgc
aggagcctct ggtggatggg 180tgcagcggcg gcgggaggac gcggaagagg
agccccgggg gtagcggcgg cgcgagcagg 240ggcgcgggga ccgggctgtc
tgaggtgcgc gccgcgctgg ggctcgcgct ctacctgatc 300gcgctgcgga
cgctggtgca gctctcgctg cagcagctcg tgctacgcgg ggccgctgga
360caccgcgggg agttcgacgc gctccaagcc agggattatc ttgaacacat
aacctccatt 420ggccccagga ctacaggaag tccagaaaat gaaattctga
ccgtgcacta ccttttggaa 480cagattaaac tgattgaagt gcaaagcaac
agccttcata agatttcagt agatgtacaa 540cggcccacag gctcttttag
cattgatttc ttgggaggtt
ttacaagcta ttatgacaac 600atcaccaatg ttgtggtaaa gctggaaccc
agagatggag cccagcatgc tgtcttggct 660aattgtcatt ttgactcagt
agcaaactca ccaggtgcca gtgatgatgc agttagctgc 720tcagtgatgc
tggaagtcct tcgcgtcttg tcaacatctt cagaagcctt gcatcatgct
780gtcatatttc tctttaatgg tgctgaggaa aatgtcttgc aagccagtca
tggtttcatt 840actcagcacc cctgggctag cttgattcgt gcattcatta
acctagaggc agcaggtgta 900ggagggaaag aacttgtatt ccaaacaggt
cctgaaaatc cttggttggt tcaagcttat 960gtttcagcag ctaaacaccc
ttttgcttct gtggtggctc aggaggtttt tcagagtgga 1020atcattcctt
cagatactga ctttcgtatc tacagggatt ttgggaacat tccaggaata
1080gacttagctt ttattgagaa tggatacatt tatcacacca agtatgacac
agcggacaga 1140attctaacag attccattca gagagcaggt gacaacattt
tagcagttct taagcatcta 1200gctacatctg atatgctggc tgctgcttct
aagtatcgac atggaaacat ggtcttcttt 1260gatgtgctgg gcctgtttgt
cattgcctac ccctctcgta ttggctcaat cataaactac 1320atggtggtaa
tgggtgttgt tttgtacctg ggcaaaaaat ttttgcagcc caaacataag
1380actggtaact acaagaagga cttcttgtgt ggacttggca tcactttgat
cagctggttc 1440actagccttg ttaccgttct cattatagca gtgttcatct
ctcttattgg acagtctctc 1500tcatggtata accacttcta tgtctccgtt
tgtctgtatg gaactgcaac tgtagccaaa 1560ataatactta tacatactct
tgcgaaaaga ttttattaca tgaatgccag tgcccagtat 1620ctgggagaag
tattttttga catttcgctg tttgtccatt gctgttttct tgttaccctc
1680acttaccaag gactttgctc ggcgtttatt agtgctgtct gggtagcatt
cccattgctc 1740acaaagctct gtgtgcataa ggacttcaag cagcatggtg
cccaaggaaa atttattgct 1800ttttaccttt tggggatgtt tattccttat
ctttatgcat tgtacctcat ctgggcagta 1860tttgagatgt ttacccctat
cctcgggaga agtggttctg aaatcccacc tgatgttgtg 1920ctggcatcca
ttttggctgg ctgtacaatg attctctcgt cctattttat taacttcatc
1980taccttgcca agagcacaaa aaaaaccatg ctaactttaa ctttggtatg
tgcaattaca 2040ttcctccttg tttgcagtgg aacatttttt ccatatagct
ccaatcctgc taatccgaag 2100ccaaagagag tgtttcttca gcatatgact
agaacattcc atgacttgga aggaaatgca 2160gttaaacggg actctggaat
atggatcaat gggtttgatt atactggaat ttctcacata 2220acccctcaca
ttcctgagat caatgatagt atccgagctc actgtgagga gaatgcacct
2280ctttgtggtt ttccttggta tcttccagtg cactttctga tcaggaaaaa
ctggtatctt 2340cctgccccag aagtttctcc aagaaatcct cctcatttcc
gactcatatc caaagaacag 2400acaccttggg attctataaa attgactttt
gaagcaacag gaccaagcca tatgtccttc 2460tatgttcgag cccacaaagg
gtcaacactt tctcagtggt ctcttggcaa tggcacccca 2520gtcacaagta
aaggaggaga ctactttgtc ttttactccc atggactcca ggcctctgca
2580tggcagttct ggatagaagt gcaggtttca gaagaacatc ctgaaggaat
ggtcaccgtg 2640gccattgctg cccactatct gtctggggaa gacaagagat
cccctcaact ggatgctctg 2700aaggaaaagt tcccagattg gacatttccc
tctgcctggg tgtgcaccta cgatctcttt 2760gtattttaat cttgtggatg
agctctaagt acatgcccag tggatactcc atgtgacatg 2820gtttctccct
atgttacgtg gatgtttgta acgtaagtca atgaatttta atgatcatat
2880gttcaaagag ctttctgggt taacgctttt cagggccaag cactataagg
gtttagctgt 2940ggcgcagtga tgcatggcct gttgacactt gaaaatgcca
gtcttttggc acttcagcac 3000atgtgggtac tgccactaca cacacgtcat
tttatatgac cttaaggaca aagccaacaa 3060tccacttcaa tagctgcccc
tttaggatca agaaagatgt acactgtcag agcattgtta 3120atgagacaaa
agttgtttcc aatttaagcc ccaaaaccat ttgttgtatt agtggatggt
3180gggtaaaata tcattcactg aggtaatgat tccccttgag aatataactc
tgtgtaggtc 3240actggaaagt gattgccata gggctgggag agaagcattg
cactcttgag gctgtagcct 3300gtgtcaagct gtttcttcag gcagcctctc
aaatgtgctt tgtctctctg tgctgaggcc 3360tggaccctgt gctgagctgg
tgactcactg tcctgacaag tggacacaca gatgcactgc 3420tgtgctgctt
tcctgaggtg gttttctatg cctgttttcc tctgaaacat gtctgttacc
3480cctctccatc ttaccaagtt gaaaagggga atatttggcc acatacccct
ctggttttcg 3540taggttcttt tggttcagaa tattgtttgt gccagtacat
gaccttaact tccttcctca 3600gagcactgag ctgccatctg ggctattctg
gggtagaagg aaggctggga gtggtgggaa 3660ttttataaat atttattctc
ttttctttgt ttcataggag tcttgtgtta tacaaggtta 3720gtccttcatg
gtataatctt actgatgcac tgggcctatc tttttgtttt ccagccagtt
3780gaatagatta gtttttctca gtaacttact atccagcaga ctggctttcc
tgagacttga 3840ggttgtggct tatactggaa tgagaccact gtacgtgtag
gtggttcaga tcctgcgtaa 3900tggcagcatg aggacttaaa aggtggtttt
cattttgaag atggctatgt agcttgtaag 3960gtgtatcaca gcagtacctc
tcatggcttt ttggttccag cagtgagggc attggtgaga 4020tcaatggtaa
actgtgcaag ctttcttttt atcattagga aatgtgaaac gttggacaaa
4080ttttgagttt taacaaggac aaaaagttga aagaaaaggc acagttaaca
aaaaagggtg 4140gctagattta tcttgggtga tggaggaaat gagagaggaa
tgctcttgaa aggtggtctg 4200tggatctgtc tgaatagaaa gagcacagta
agtatgcatt gccggagaaa acgtccttga 4260agctgcttgt ctcatgtgta
tgatgtgctt tttaaatcat gcccctcgtt gcctgcctaa 4320tctgtgactc
cctaaaaact aactgggccc atgtagatgg ggctgcaacc agagctgaat
4380aacatgttag gctcacacat gcatcagcac tgcacactgg aatcattgct
cttcctggac 4440tttgtagaaa tcagtctcaa gtgcttcaag agtctggctc
ctgctacttt tatctgtcag 4500gtagcacata aggtttgcag ggtttatatt
ttgtatagaa tcacagttgt ggagaaaaag 4560taataatttc tcaatgaatt
ttaaaaatgg gcctattttc tatccccgtg gttcatctga 4620tataattagt
gttccctgtg aattcccccc ctctatggga aggatgcctt tactctttat
4680cagtaataaa ttatgactgt tttcatattg ccttagggtt atttccctgt
gtaaaccatt 4740gtcttttgtt ttggttttct ttagcattat gaagctttgg
tattgtacaa ggtcagtagt 4800aagatgctca ctagtctcag ggcttgtgta
atattctggg aggtcattta aatgccagaa 4860atggtcaagc aattatacac
agtatttatg actctgttaa gcataccgtt tgtctgtcac 4920attagtagat
tctgagatta aaaaaaattt ttaaagagtg atcatttaaa taatttctaa
4980aagggtcttt tcaagctcta acaaagtcac taacaaatgc attattttct
acagaattag 5040atgttagtag tacagtactg catattcagg gaaaaagtgt
gaggaattga tttcaaaata 5100gttcgttctt gtgtttgacc taagaatgat
tgtcgcatga agtgtttgtt tttacagttt 5160agcatatata aacaaacatg
ataggattcc ttaagatgtt accacccagg gggccacaag 5220ccagcctgct
gtctcaggaa gctgtagaag gagtgtttgt caatttcttg tcactggttt
5280gctgacttac tgaggattaa ttgttgcctt acaatgttac tgaaataaac tgtttaat
5338945387DNAHomo sapiens 94ggccggggct gtcgcgggtt ggggcggttg
ggctggcagc tgaggctcgt ggccatggag 60tggggttctg agtcggctgc tgtgaggcgg
caccgcgtcg gagtagagcg tcgagaggga 120gcggcggccg cgccaccgcc
ggagagggag gcccgagcgc aggagcctct ggtggatggg 180tgcagcggcg
gcgggaggac gcggaagagg agccccgggg gtagcggcgg cgcgagcagg
240ggcgcgggga ccgggctgtc tgaggtgcgc gccgcgctgg ggctcgcgct
ctacctgatc 300gcgctgcgga cgctggtgca gctctcgctg cagcagctcg
tgctacgcgg ggccgctgga 360caccgcgggg agttcgacgc gctccaagcc
agggattatc ttgaacacat aacctccatt 420ggccccagga ctacaggaag
tccagaaaat gaaattctga ccgtgcacta ccttttggaa 480cagattaaac
tgattgaagt gcaaagcaac agccttcata agatttcagt agatgtacaa
540cggcccacag gctcttttag cattgatttc ttgggaggtt ttacaagcta
ttatgacaac 600atcaccaatg ttgtggtaaa gctggaaccc agagatggag
cccagcatgc tgtcttggct 660aattgtcatt ttgactcagt agcaaactca
ccaggtgcca gtgatgatgc agttagctgc 720tcagtgatgc tggaagtcct
tcgcgtcttg tcaacatctt cagaagcctt gcatcatgct 780gtcatatttc
tctttaatgg tgctgaggaa aatgtcttgc aagccagtca tggtttcatt
840actcagcacc cctgggctag cttgattcgt gcattcatta acctagaggc
agcaggtgta 900ggagggaaag aacttgtatt ccaaacaggt cctgaaaatc
cttggttggt tcaagcttat 960gtttcagcag ctaaacaccc ttttgcttct
gtggtggctc aggaggtttt tcagagtgga 1020atcattcctt cagatactga
ctttcgtatc tacagggatt ttgggaacat tccaggaata 1080gacttagctt
ttattgagaa tggatacatt tatcacacca agtatgacac agcggacaga
1140attctaacag attccattca gagagcaggt gacaacattt tagcagttct
taagcatcta 1200gctacatctg atatgctggc tgctgcttct aagtatcgac
atggaaacat ggtcttcttt 1260gatgtgctgg gcctgtttgt cattgcctac
ccctctcgta ttggctcaat cataaactac 1320atggtggtaa tgggtgttgt
tttgtacctg ggcaaaaaat ttttgcagcc caaacataag 1380actggtaact
acaagaagga cttcttgtgt ggacttggca tcactttgat cagctggttc
1440actagccttg ttaccgttct cattatagca gtgttcatct ctcttattgg
acagtctctc 1500tcatggtata accacttcta tgtctccgtt tgtctgtatg
gaactgcaac tgtagccaaa 1560ataatactta tacatactct tgcgaaaaga
ttttattaca tgaatgccag tgcccagtat 1620ctgggagaag tattttttga
catttcgctg tttgtccatt gctgttttct tgttaccctc 1680acttaccaag
gactttgctc ggcgtttatt agtgctgtct gggtagcatt cccattgctc
1740acaaagctct gtgtgcataa ggacttcaag cagcatggtg cccaaggaaa
atttattgct 1800ttttaccttt tggggatgtt tattccttat ctttatgcat
tgtacctcat ctgggcagta 1860tttgagatgt ttacccctat cctcgggaga
agtggttctg aaatcccacc tgatgttgtg 1920ctggcatcca ttttggctgg
ctgtacaatg attctctcgt cctattttat taacttcatc 1980taccttgcca
agagcacaaa aaaaaccatg ctaactttaa ctttggtatg tgcaattaca
2040ttcctccttg tttgcagtgg aacatttttt ccatatagct ccaatcctgc
taatccgaag 2100ccaaagagag tgtttcttca gcatatgact agaacattcc
atgacttgga aggaaatgca 2160gttaaacggg actctggaat atggatcaat
gggtttgatt atactggaat ttctcacata 2220acccctcaca ttcctgagat
caatgatagt atccgagctc actgtgagga gaatgcacct 2280ctttgtggtt
ttccttggta tcttccagtg cactttctga tcaggaaaaa ctggtatctt
2340cctgccccag aagtttctcc aagaaatcct cctcatttcc gactcatatc
caaagaacag 2400acaccttggg attctataaa attgactttt gaagcaacag
cctgcctgcc tatccttcag 2460attttggact tgccagcctc aacaatcatg
accaagccat atgtccttct atgttcgagc 2520ccacaaaggg tcaacacttt
ctcagtggtc tcttggcaat ggcaccccag tcacaagtaa 2580aggaggagac
tactttgtct tttactccca tggactccag gcctctgcat ggcagttctg
2640gatagaagtg caggtttcag aagaacatcc tgaaggaatg gtcaccgtgg
ccattgctgc 2700ccactatctg tctggggaag acaagagatc ccctcaactg
gatgctctga aggaaaagtt 2760cccagattgg acatttccct ctgcctgggt
gtgcacctac gatctctttg tattttaatc 2820ttgtggatga gctctaagta
catgcccagt ggatactcca tgtgacatgg tttctcccta 2880tgttacgtgg
atgtttgtaa cgtaagtcaa tgaattttaa tgatcatatg ttcaaagagc
2940tttctgggtt aacgcttttc agggccaagc actataaggg tttagctgtg
gcgcagtgat 3000gcatggcctg ttgacacttg aaaatgccag tcttttggca
cttcagcaca tgtgggtact 3060gccactacac acacgtcatt ttatatgacc
ttaaggacaa agccaacaat ccacttcaat 3120agctgcccct ttaggatcaa
gaaagatgta cactgtcaga gcattgttaa tgagacaaaa 3180gttgtttcca
atttaagccc caaaaccatt tgttgtatta gtggatggtg ggtaaaatat
3240cattcactga ggtaatgatt ccccttgaga atataactct gtgtaggtca
ctggaaagtg 3300attgccatag ggctgggaga gaagcattgc actcttgagg
ctgtagcctg tgtcaagctg 3360tttcttcagg cagcctctca aatgtgcttt
gtctctctgt gctgaggcct ggaccctgtg 3420ctgagctggt gactcactgt
cctgacaagt ggacacacag atgcactgct gtgctgcttt 3480cctgaggtgg
ttttctatgc ctgttttcct ctgaaacatg tctgttaccc ctctccatct
3540taccaagttg aaaaggggaa tatttggcca catacccctc tggttttcgt
aggttctttt 3600ggttcagaat attgtttgtg ccagtacatg accttaactt
ccttcctcag agcactgagc 3660tgccatctgg gctattctgg ggtagaagga
aggctgggag tggtgggaat tttataaata 3720tttattctct tttctttgtt
tcataggagt cttgtgttat acaaggttag tccttcatgg 3780tataatctta
ctgatgcact gggcctatct ttttgttttc cagccagttg aatagattag
3840tttttctcag taacttacta tccagcagac tggctttcct gagacttgag
gttgtggctt 3900atactggaat gagaccactg tacgtgtagg tggttcagat
cctgcgtaat ggcagcatga 3960ggacttaaaa ggtggttttc attttgaaga
tggctatgta gcttgtaagg tgtatcacag 4020cagtacctct catggctttt
tggttccagc agtgagggca ttggtgagat caatggtaaa 4080ctgtgcaagc
tttcttttta tcattaggaa atgtgaaacg ttggacaaat tttgagtttt
4140aacaaggaca aaaagttgaa agaaaaggca cagttaacaa aaaagggtgg
ctagatttat 4200cttgggtgat ggaggaaatg agagaggaat gctcttgaaa
ggtggtctgt ggatctgtct 4260gaatagaaag agcacagtaa gtatgcattg
ccggagaaaa cgtccttgaa gctgcttgtc 4320tcatgtgtat gatgtgcttt
ttaaatcatg cccctcgttg cctgcctaat ctgtgactcc 4380ctaaaaacta
actgggccca tgtagatggg gctgcaacca gagctgaata acatgttagg
4440ctcacacatg catcagcact gcacactgga atcattgctc ttcctggact
ttgtagaaat 4500cagtctcaag tgcttcaaga gtctggctcc tgctactttt
atctgtcagg tagcacataa 4560ggtttgcagg gtttatattt tgtatagaat
cacagttgtg gagaaaaagt aataatttct 4620caatgaattt taaaaatggg
cctattttct atccccgtgg ttcatctgat ataattagtg 4680ttccctgtga
attccccccc tctatgggaa ggatgccttt actctttatc agtaataaat
4740tatgactgtt ttcatattgc cttagggtta tttccctgtg taaaccattg
tcttttgttt 4800tggttttctt tagcattatg aagctttggt attgtacaag
gtcagtagta agatgctcac 4860tagtctcagg gcttgtgtaa tattctggga
ggtcatttaa atgccagaaa tggtcaagca 4920attatacaca gtatttatga
ctctgttaag cataccgttt gtctgtcaca ttagtagatt 4980ctgagattaa
aaaaaatttt taaagagtga tcatttaaat aatttctaaa agggtctttt
5040caagctctaa caaagtcact aacaaatgca ttattttcta cagaattaga
tgttagtagt 5100acagtactgc atattcaggg aaaaagtgtg aggaattgat
ttcaaaatag ttcgttcttg 5160tgtttgacct aagaatgatt gtcgcatgaa
gtgtttgttt ttacagttta gcatatataa 5220acaaacatga taggattcct
taagatgtta ccacccaggg ggccacaagc cagcctgctg 5280tctcaggaag
ctgtagaagg agtgtttgtc aatttcttgt cactggtttg ctgacttact
5340gaggattaat tgttgcctta caatgttact gaaataaact gtttaat
538795202PRTHomo sapiens 95Met Glu Arg Gly Ala Gly Ala Lys Leu Leu
Pro Leu Leu Leu Leu Leu 1 5 10 15 Arg Ala Thr Gly Phe Thr Cys Ala
Gln Thr Asp Gly Arg Asn Gly Tyr 20 25 30 Thr Ala Val Ile Glu Val
Thr Ser Gly Gly Pro Trp Gly Asp Trp Ala 35 40 45 Trp Pro Glu Met
Cys Pro Asp Gly Phe Phe Ala Ser Gly Phe Ser Leu 50 55 60 Lys Val
Glu Pro Pro Gln Gly Ile Pro Gly Asp Asp Thr Ala Leu Asn 65 70 75 80
Gly Ile Arg Leu His Cys Ala Arg Gly Asn Val Leu Gly Asn Thr His 85
90 95 Val Val Glu Ser Gln Ser Gly Ser Trp Gly Glu Trp Ser Glu Pro
Leu 100 105 110 Trp Cys Arg Gly Gly Ala Tyr Leu Val Ala Phe Ser Leu
Arg Val Glu 115 120 125 Ala Pro Thr Thr Leu Gly Asp Asn Thr Ala Ala
Asn Asn Val Arg Phe 130 135 140 Arg Cys Ser Asp Gly Glu Glu Leu Gln
Gly Pro Gly Leu Ser Trp Gly 145 150 155 160 Asp Phe Gly Asp Trp Ser
Asp His Cys Pro Lys Gly Ala Cys Gly Leu 165 170 175 Gln Thr Lys Ile
Gln Gly Pro Arg Gly Leu Gly Asp Asp Thr Ala Leu 180 185 190 Asn Asp
Ala Arg Leu Phe Cys Cys Arg Ser 195 200 9670PRTHomo sapiens 96Met
Glu Arg Gly Ala Gly Ala Lys Leu Leu Pro Leu Leu Leu Leu Leu 1 5 10
15 Arg Ala Thr Gly Phe Thr Cys Ala Gln Thr Asp Gly Arg Asn Gly Tyr
20 25 30 Thr Ala Val Ile Glu Val Thr Ser Gly Gly Pro Trp Gly Asp
Trp Ala 35 40 45 Trp Pro Glu Met Cys Pro Asp Gly Phe Phe Ala Ser
Gly Phe Ser Leu 50 55 60 Lys Leu Gly Arg Met Glu 65 70 97102PRTHomo
sapiens 97Met Glu Arg Gly Ala Gly Ala Lys Leu Leu Pro Leu Leu Leu
Leu Leu 1 5 10 15 Arg Ala Thr Gly Phe Thr Cys Ala Gln Thr Asp Gly
Arg Asn Gly Tyr 20 25 30 Thr Ala Val Ile Glu Val Thr Ser Gly Gly
Pro Trp Gly Asp Trp Ala 35 40 45 Trp Pro Glu Met Cys Pro Asp Gly
Phe Phe Ala Ser Gly Phe Ser Leu 50 55 60 Lys Val Glu Pro Pro Gln
Gly Ile Pro Gly Asp Asp Thr Ala Leu Asn 65 70 75 80 Gly Ile Arg Leu
His Cys Ala Arg Gly Asn Val Leu Gly Asn Thr His 85 90 95 Val Leu
Gly Arg Met Glu 100 98114PRTHomo sapiens 98Met Glu Arg Gly Ala Gly
Ala Lys Leu Leu Pro Leu Leu Leu Leu Leu 1 5 10 15 Arg Ala Thr Gly
Phe Thr Cys Ala Gln Thr Asp Gly Arg Asn Gly Tyr 20 25 30 Thr Ala
Val Ile Glu Val Thr Ser Gly Gly Pro Trp Gly Asp Trp Ala 35 40 45
Trp Pro Glu Met Cys Pro Asp Gly Phe Phe Ala Ser Gly Phe Ser Leu 50
55 60 Lys Val Glu Pro Pro Gln Gly Ile Pro Gly Asp Asp Thr Ala Leu
Asn 65 70 75 80 Gly Ile Arg Leu His Cys Ala Arg Gly Asn Val Leu Gly
Asn Thr His 85 90 95 Val Val Glu Ser Gln Ser Gly Arg Trp Gly Ala
Gly Val Glu Asp Pro 100 105 110 Leu Gly 99769DNAHomo sapiens
99ggaggccctt ccgcaatcgg agccctcaca gaggccaaac tgatataaat ctgcttagga
60ggcctgattc acagacgcta caggatggag cggggcgcag gagccaagct gctgccgctg
120ctgctgcttc tgcgggcgac tggtttcaca tgtgcacaga cagatggccg
gaacggctac 180acggcggtca tcgaagtgac cagcgggggt ccctggggcg
actgggcctg gcctgagatg 240tgtcccgatg gattcttcgc cagcgggttc
tcgctcaagg tggagcctcc ccaaggcatt 300cctggcgacg acactgcact
gaatgggatc aggctgcact gcgcgcgcgg gaacgtccta 360ggcaatacgc
acgtggtaga gtcccagtct ggaagctggg gcgaatggag tgagccgctg
420tggtgtcgcg gcggcgccta cctagtggct ttctcgcttc gcgtggaggc
acccacgacc 480ctcggtgaca acacagcagc gaacaacgtg cgcttccgct
gttcagacgg cgaggaactg 540caggggcctg ggctgagctg gggagacttt
ggagactgga gtgaccattg ccccaagggc 600gcgtgcggcc tgcagaccaa
gatccaggga cctagaggcc tcggcgatga cactgcgctg 660aacgacgcgc
gcttattctg ctgccgcagt tgaacggcgc cgccgccgcc gctctctccc
720gggccaggag gctagtccca cctcttgcta ttaaagcttc tctgagttg
769100653DNAHomo sapiens 100ggaggccctt ccgcaatcgg agccctcaca
gaggccaaac tgatataaat ctgcttagga 60ggcctgattc acagacgcta caggatggag
cggggcgcag gagccaagct gctgccgctg 120ctgctgcttc tgcgggcgac
tggtttcaca tgtgcacaga cagatggccg gaacggctac 180acggcggtca
tcgaagtgac cagcgggggt ccctggggcg actgggcctg gcctgagatg
240tgtcccgatg gattcttcgc cagcgggttc tcgctcaagc tggggcgaat
ggagtgagcc 300gctgtggtgt cgcggcggcg cctacctagt ggctttctcg
cttcgcgtgg aggcacccac 360gaccctcggt gacaacacag cagcgaacaa
cgtgcgcttc cgctgttcag acggcgagga 420actgcagggg cctgggctga
gctggggaga ctttggagac tggagtgacc attgccccaa 480gggcgcgtgc
ggcctgcaga ccaagatcca gggacctaga ggcctcggcg atgacactgc
540gctgaacgac gcgcgcttat tctgctgccg cagttgaacg gcgccgccgc
cgccgctctc 600tcccgggcca ggaggctagt cccacctctt gctattaaag
cttctctgag ttg 653101747DNAHomo sapiens 101aggcccttcc gcaatcggag
ccctcacaga ggccaaactg atataaatct gcttaggagg 60cctgattcac agacgctaca
ggatggagcg gggcgcagga
gccaagctgc tgccgctgct 120gctgcttctg cgggcgactg gtttcacatg
tgcacagaca gatggccgga acggctacac 180ggcggtcatc gaagtgacca
gcgggggtcc ctggggcgac tgggcctggc ctgagatgtg 240tcccgatgga
ttcttcgcca gcgggttctc gctcaaggtg gagcctcccc aaggcattcc
300tggcgacgac actgcactga atgggatcag gctgcactgc gcgcgcggga
acgtcctagg 360caatacgcac gtgctggggc gaatggagtg agccgctgtg
gtgtcgcggc ggcgcctacc 420tagtggcttt ctcgcttcgc gtggaggcac
ccacgaccct cggtgacaac acagcagcga 480acaacgtgcg cttccgctgt
tcagacggcg aggaactgca ggggcctggg ctgagctggg 540gagactttgg
agactggagt gaccattgcc ccaagggcgc gtgcggcctg cagaccaaga
600tccagggacc tagaggcctc ggcgatgaca ctgcgctgaa cgacgcgcgc
ttattctgct 660gccgcagttg aacggcgccg ccgccgccgc tctctcccgg
gccaggaggc tagtcccacc 720tcttgctatt aaagcttctc tgagttg
747102803DNAHomo sapiens 102aggcccttcc gcaatcggag ccctcacaga
ggccaaactg atataaatct gcttaggagg 60cctgattcac agacgctaca ggatggagcg
gggcgcagga gccaagctgc tgccgctgct 120gctgcttctg cgggcgactg
gtttcacatg tgcacagaca gatggccgga acggctacac 180ggcggtcatc
gaagtgacca gcgggggtcc ctggggcgac tgggcctggc ctgagatgtg
240tcccgatgga ttcttcgcca gcgggttctc gctcaaggtg gagcctcccc
aaggcattcc 300tggcgacgac actgcactga atgggatcag gctgcactgc
gcgcgcggga acgtcctagg 360caatacgcac gtggtagagt cccagtctgg
aaggtggggc gcaggggtcg aggatccctt 420ggggtgatgc tggggcgaat
ggagtgagcc gctgtggtgt cgcggcggcg cctacctagt 480ggctttctcg
cttcgcgtgg aggcacccac gaccctcggt gacaacacag cagcgaacaa
540cgtgcgcttc cgctgttcag acggcgagga actgcagggg cctgggctga
gctggggaga 600ctttggagac tggagtgacc attgccccaa gggcgcgtgc
ggcctgcaga ccaagatcca 660gggacctaga ggcctcggcg atgacactgc
gctgaacgac gcgcgcttat tctgctgccg 720cagttgaacg gcgccgccgc
cgccgctctc tcccgggcca ggaggctagt cccacctctt 780gctattaaag
cttctctgag ttg 80310321DNAHomo sapiens 103cgaggacaat ctggatatca a
2110421DNAHomo sapiens 104ctggagccct cgagcaagaa a 2110521DNAHomo
sapiens 105cccgtggttc atctgatata a 2110621DNAHomo sapiens
106aaggactttg ctcggcgttt a 2110721DNAHomo sapiens 107tacgtggatg
tttgtaacgt a 2110821DNAHomo sapiens 108ctcgtattgg ctcaatcata a
21
* * * * *
References