Ovary Tumor Markers And Methods Of Use Thereof

Grifantini; Renata ;   et al.

Patent Application Summary

U.S. patent application number 15/936601 was filed with the patent office on 2018-11-01 for ovary tumor markers and methods of use thereof. The applicant listed for this patent is Externautics S.P.A.. Invention is credited to Susanna Campagnoli, Alberto Grandi, Renata Grifantini, Renzo Nogarotto, Matteo Parri, Andrea Pierleoni, Piero Pileri.

Application Number20180313841 15/936601
Document ID /
Family ID43302460
Filed Date2018-11-01

United States Patent Application 20180313841
Kind Code A1
Grifantini; Renata ;   et al. November 1, 2018

Ovary Tumor Markers And Methods Of Use Thereof

Abstract

Newly identified proteins as markers for the detection of ovary tumors, or as therapeutic targets for treatment thereof; affinity ligands capable of selectively interacting with the newly identified markers, methods for tumor diagnosis and therapy using the same.


Inventors: Grifantini; Renata; (Siena, IT) ; Pileri; Piero; (Siena, IT) ; Campagnoli; Susanna; (Siena, IT) ; Grandi; Alberto; (Siena, IT) ; Parri; Matteo; (Siena, IT) ; Pierleoni; Andrea; (Siena, IT) ; Nogarotto; Renzo; (Siena, IT)
Applicant:
Name City State Country Type

Externautics S.P.A.

Siena

IT
Family ID: 43302460
Appl. No.: 15/936601
Filed: March 27, 2018

Related U.S. Patent Documents

Application Number Filing Date Patent Number
13503417 Sep 10, 2012
PCT/EP2010/066154 Oct 26, 2010
15936601

Current U.S. Class: 1/1
Current CPC Class: C12N 2310/14 20130101; C12N 15/113 20130101; C07K 16/3069 20130101; G01N 33/57449 20130101; G01N 2500/02 20130101
International Class: G01N 33/574 20060101 G01N033/574; C07K 16/30 20060101 C07K016/30; C12N 15/113 20060101 C12N015/113

Foreign Application Data

Date Code Application Number
Oct 26, 2009 EP 09174062.1

Claims



1.-11. (canceled)

12. A method of treating a proliferative disease in a subject, the method comprising administering to the subject an antibody or a fragment thereof that specifically recognizes and binds to a KLRG2 protein.

13. The method of claim 12, wherein the proliferative disease is an ovarian cancer.

14. The method of claim 12, wherein the KLRG2 protein has an amino acid sequence that has at least 80% identity to SEQ ID NO: 85 or SEQ ID NO: 86.

15. The method of claim 12, wherein the KLRG2 protein has an amino acid sequence that has at least 90% identity to SEQ ID NO: 85 or SEQ ID NO: 86.

16. The method of claim 12, wherein the KLRG2 protein has an amino acid sequence that has at least 95% identity to SEQ ID NO: 85 or SEQ ID NO: 86.

17. The method of claim 12, wherein the KLRG2 protein has an amino acid sequence selected from SEQ ID NO: 85 or SEQ ID NO: 86.

18. The method of claim 12, wherein the antibody or fragment thereof modulates KLRG2 activity.

19. The method of claim 12, wherein the antibody or fragment thereof inhibits or abolishes KLRG2 activity.

20. The method of claim 12, wherein the antibody is a monoclonal antibody.

21. The method of claim 12, wherein the antibody is a polyclonal antibody.

22. The method of claim 12, wherein the antibody is labelled with a radioactive, fluorescent or enzyme label.

23. The method of claim 12 further comprising detecting the expression of KLRG2 protein in a tissue sample of the subject.

24. The method of claim 23, wherein the tissue sample is an ovarian tissue sample.

25. The method of claim 23, wherein the KLRG2 protein is detected by an immunoradiometric, immunoenzymatic or immunohistochemical assay.

26. The method of claim 12 further comprising detecting the expression of KLRG2 mRNA in a tissue sample of the subject.

27. The method of claim 26, wherein the tissue sample is an ovarian tissue sample.

28. The method of claim 26, wherein the KLRG2 mRNA is detected by polymerase chain reaction (PCR).

29. The method of claim 12, wherein the subject is a human.

30. A method of treating a proliferative disease in a subject, the method comprising administering to the subject a siRNA molecule that specifically silences KLRG2.

31. The method of claim 30, wherein the proliferative disease is an ovarian cancer.

32. The method of claim 30, wherein the siRNA molecule has a sequence complementary to SEQ ID NO: 87 or SEQ ID NO: 88.

33. The method of claim 29, wherein the subject is a human.
Description



[0001] The present invention relates to newly identified proteins as markers for the detection of ovary tumors, or as targets for their treatment. Also provided are affinity ligands capable of selectively interacting with the newly identified markers, as well as methods for tumor diagnosis and therapy using such ligands.

BACKGROUND OF THE INVENTION

[0002] Tumor Markers (or Biomarkers)

[0003] Tumor markers are substances that can be produced by tumor cells or by other cells of the body in response to cancer. In particular, a protein biomarker is either a single protein or a panel of different proteins that could be used to unambiguously distinguish a disease state. Ideally, a biomarker would have both a high specificity and sensitivity, being represented in a significant percentage of the cases of given disease and not in healthy state.

[0004] Biomarkers can be identified in different biological samples, like tissue biopsies or preferably biological fluids (saliva, urine, blood-derivatives and other body fluids), whose collection does not necessitate invasive treatments. Tumor marker levels may be categorized in three major classes on the basis of their clinical use. Diagnostic markers can be used in the detection and diagnosis of cancer. Prognostics markers are indicative of specific outcomes of the disease and can be used to define predictive models that allow the clinicians to predict the likely prognosis of the disease at time of diagnosis. Moreover, prognosis markers are helpful to monitor the patient response to a drug therapy and facilitate a more personalized patient management. A decrease or return to a normal level may indicate that the cancer is responding to therapy, whereas an increase may indicate that the cancer is not responding. After treatment has ended, tumor marker levels may be used to check for recurrence of the tumor. Finally, therapeutic markers can be used to develop tumor-specific drugs or affinity ligand (i.e. antibodies) for a prophylactic intervention.

[0005] Currently, although an abnormal tumor marker level may suggest cancer, this alone is usually not enough to accurately diagnose cancer and their measurement in body fluids is frequently combined with other tests, such as a biopsy and radioscopic examination. Frequently, tumor marker levels are not altered in all of people with a certain cancer disease, especially if the cancer is at early stage. Some tumor marker levels can also be altered in patients with noncancerous conditions. Most biomarkers commonly used in clinical practice do not reach a sufficiently high level of specificity and sensitivity to unambiguously distinguish a tumor from a normal state.

[0006] To date the number of markers that are expressed abnormally is limited to certain types/subtypes of cancer, some of which are also found in other diseases. (http://www.cancer.gov/cancertopics/factsheet).

[0007] For example, prostate-specific antigen (PSA) levels are often used to screen men for prostate cancer, but this is controversial since elevated PSA levels can be caused by both prostate cancer or benign conditions, and most men with elevated PSA levels turn out not to have prostate cancer.

[0008] Another tumor marker, Cancer Antigen 125, (CA 125), is sometimes used to screen women who have an increased risk for ovarian cancer. Scientists are studying whether measurement of CA 125, along with other tests and exams, is useful to find ovarian cancer before symptoms develop. So far, CA 125 measurement is not sensitive or specific enough to be used to screen all women for ovarian cancer. Mostly, CA 125 is used to monitor response to treatment and check for recurrence in women with ovarian cancer. Finally, human epidermal growth factor receptor (HER2) is a marker protein overproduced in about 20% of breast cancers, whose expression is typically associated with a more aggressive and recurrent tumors of this class.

[0009] Routine Screening Test for Tumor Diagnosis

[0010] Screening tests are a way of detecting cancer early, before there are any symptoms. For a screening test to be helpful, it should have high sensitivity and specificity. Sensitivity refers to the test's ability to identify people who have the disease. Specificity refers to the test's ability to identify people who do not have the disease. Different molecular biology approaches such as analysis of DNA sequencing, small nucleotide polymorphyms, in situ hybridization and whole transcriptional profile analysis have done remarkable progresses to discriminate a tumor state from a normal state and are accelerating the knowledge process in the tumor field. However so far different reasons are delaying their use in the common clinical practice, including the higher analysis complexity and their expensiveness. Other diagnosis tools whose application is increasing in clinics include in situ hybridization and gene sequencing.

[0011] Currently, Immuno-HistoChemistry (IHC), a technique that allows the detection of proteins expressed in tissues and cells using specific antibodies, is the most commonly used method for the clinical diagnosis of tumor samples. This technique enables the analysis of cell morphology and the classification of tissue samples on the basis of their immunoreactivity. However, at present, IHC can be used in clinical practice to detect cancerous cells of tumor types for which protein markers and specific antibodies are available. In this context, the identification of a large panel of markers for the most frequent cancer classes would have a great impact in the clinical diagnosis of the disease.

[0012] Anti-Cancer Therapies

[0013] In the last decades, an overwhelming number of studies remarkably contributed to the comprehension of the molecular mechanisms leading to cancer. However, this scientific progress in the molecular oncology field has not been paralleled by a comparable progress in cancer diagnosis and therapy. Surgery and/or radiotherapy are the still the main modality of local treatment of cancer in the majority of patients. However, these treatments are effective only at initial phases of the disease and in particular for solid tumors of epithelial origin, as is the case of colon, lung, breast, prostate and others, while they are not effective for distant recurrence of the disease. In some tumor classes, chemotherapy treatments have been developed, which generally relies on drugs, hormones and antibodies, targeting specific biological processes used by cancers to grow and spread. However, so far many cancer therapies had limited efficacy due to severity of side effects and overall toxicity. Indeed, a major effort in cancer therapy is the development of treatments able to target specifically tumor cells causing limited damages to surrounding normal cells thereby decreasing adverse side effects. Recent developments in cancer therapy in this direction are encouraging, indicating that in some cases a cancer specific therapy is feasible. In particular, the development and commercialization of humanized monoclonal antibodies that recognize specifically tumor-associated markers and promote the elimination of cancer is one of the most promising solution that appears to be an extremely favorable market opportunity for pharmaceutical companies. However, at present the number of therapeutic antibodies available on the market or under clinical studies is very limited and restricted to specific cancer classes. So far licensed monoclonal antibodies currently used in clinics for the therapy of specific tumor classes, show only a partial efficacy and are frequently associated with chemotherapies to increase their therapeutic effect. Administration of Trastuzumab (Herceptin), a commercial monoclonal antibody targeting HER2, a protein overproduced in about 20% of breast cancers, in conjunction with Taxol adjuvant chemotherapy induces tumor remission in about 42% of the cases. Bevacizumab (Avastin) and Cetuximab (Erbitux) are two monoclonal antibodies recently licensed for use in humans, targeting the endothelial and epithelial growth factors respectively that, combined with adjuvant chemotherapy, proved to be effective against different tumor diseases. Bevacizumab proved to be effective in prolonging the life of patients with metastatic colorectal, breast and lung cancers. Cetuximab demostrated efficacy in patients with tumor types refractory to standard chemotherapeutic treatments (Adams G. P. and Weiner L. M. (2005) Monoclonal antibody therapy cancer. Nat Biotechnol. 23:1147-57).

[0014] In summary, available screening tests for tumor diagnosis are uncomfortable or invasive and this sometimes limits their applications. Moreover tumor markers available today have a limited utility in clinics due to either their incapability to detect all tumor subtypes of the defined cancers types and/or to distinguish unambiguously tumor vs. normal tissues. Similarly, licensed monoclonal antibodies combined with standard chemotherapies are not effective against the majority of cases. Therefore, there is a great demand for new tools to advance the diagnosis and treatment of cancer.

[0015] Experimental Approaches Commonly Used to Identify Tumor Markers

[0016] Most popular approaches used to discover new tumor markers are based on genome-wide transcription profile or total protein content analyses of tumor. These studies usually lead to the identification of groups of mRNAs and proteins, which are differentially expressed in tumors. Validation experiments then follow to eventually single out, among the hundreds of RNAs/proteins identified, the very few that have the potential to become useful markers. Although often successful, these approaches have several limitations and often, do not provide firm indications on the association of protein markers with tumor. A first limitation is that, since frequently mRNA levels not always correlate with corresponding protein abundance (approx. 50% correlation), studies based on transcription profile do not provide solid information regarding the expression of protein markers in tumor (1, 2, 3, 4).

[0017] A second limitation is that neither transcription profiles nor analysis of total protein content discriminate post-translation modifications, which often occur during oncogenesis. These modifications, including phosphorylations, acetylations, and glycosylations, or protein cleavages influence significantly protein stability, localization, interactions, and functions (5).

[0018] As a consequence, large scale studies generally result in long lists of differentially expressed genes that would require complex experimental paths in order to validate the potential markers. However, large scale genomic/proteomic studies reporting novel tumor markers frequently lack of confirmation data on the reported potential novel markers and thus do not provide solid demonstration on the association of the described protein markers with tumor.

[0019] The approach that we used to identify the protein markers included in the present invention is based on an innovative immuno-proteomic technology. In essence, a library of recombinant human proteins has been produced from E. coli and is being used to generate polyclonal antibodies against each of the recombinant proteins.

[0020] The screening of the antibodies library on Tissue microarrays (TMAs) carrying clinical samples from different patients affected by the tumor under investigation lead to the identification of specific tumor marker proteins. Therefore, by screening TMAs with the antibody library, the tumor markers are visualized by immuno-histochemistry, the classical technology applied in all clinical pathology laboratories. Since TMAs also include healthy tissues, the specificity of the antibodies for the tumors can be immediately appreciated and information on the relative level of expression and cellular localization of the markers can be obtained. In our approach the markers are subjected to a validation process consisting in a molecular and cellular characterization.

[0021] Altogether, the detection the marker proteins disclosed in the present invention selectively in tumor samples and the subsequent validation experiments lead to an unambiguous confirmation of the marker identity and confirm its association with defined tumor classes. Moreover this process provides an indication of the possible use of the proteins as tools for diagnostic or therapeutic intervention. For instance, markers showing a surface cellular localization could be both diagnostic and therapeutic markers against which both chemical and antibody therapies can be developed. Differently, markers showing a cytoplasmic expression could be more likely considered for the development of tumor diagnostic tests and chemotherapy/small molecules treatments.

SUMMARY OF THE INVENTION

[0022] The present invention provides new means for the detection and treatment of tumors, in particular ovary tumor, based on the identification of protein markers specific for these tumor type, namely: [0023] i) Killer cell lectin-like receptor subfamily G member 2 (C-type lectin domain family 15 member B) (KLRG2); [0024] ii) Solute carrier family 39 (zinc transporter), member 10 (SLC39A10); [0025] iii) G protein-coupled receptor 107 (GPR107); [0026] iv) DPY-19-like 3 (DPY19L3); [0027] v) Uncharacterized protein FLJ42986 (FLJ42986); [0028] vi) Collagen, type XX, alpha 1 (COL20A1); [0029] vii) Glycosyltransferase 25 domain containing 2 (GLT25D2); [0030] viii) Synaptotagmin-like 3 (SYTL3); [0031] ix) DENN/MADD domain containing 1B (DENND1B); [0032] x) Putative uncharacterized protein (FLJ3710); [0033] xi) Chromosome 6 open reading frame 98 (C6orf98); [0034] xii) Family with sequence similarity 69, member B (Fam69B); [0035] xiii) EMI domain-containing protein 1 Precursor (EMID1); [0036] xiv) Endoplasmic reticulum metallopeptidase 1 (ERMP1); [0037] xv) Vitelline membrane outer layer protein 1 homolog Precursor (VMO1)

[0038] The invention also provides a method for the diagnosis of these cancer types, comprising a step of detecting the above-identified markers in a biological sample, e.g. in a tissue sample of a subject suspected of having or at risk of developing malignancies or susceptible to cancer recurrences.

[0039] In addition, the tumor markers identify novel targets for affinity ligands, which can be used for therapeutic applications. Also provided are affinity ligands, particularly antibodies, capable of selectively interacting with the newly identified protein markers.

DETAILED DISCLOSURE OF THE INVENTION

[0040] The present invention is based on the surprising finding of antibodies that are able to specifically stain tumor tissues from patients, while negative or very poor staining is observed in normal tissues from the same patients. These antibodies have been found to specifically bind to proteins for which no previous association with tumor has been reported. Hence, in a first aspect, the invention provides an ovarian tumor marker, which is selected from the group consisting of:

[0041] i) KLRG2, SEQ ID NO:85, SEQ ID NO:86 or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:85, SEQ ID NO:86 or a nucleic acid molecule containing a sequence coding for a KLRG2 protein, said encoding sequence being preferably selected from SEQ ID NO: 87 and SEQ ID NO:88;

[0042] ii) SLC39A10 in one of its variant isoforms SEQ ID NO:1, SEQ ID NO:2 or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:1 or SEQ ID NO:2; or a nucleic acid molecule containing a sequence coding for a SLC39A10 protein, said encoding sequence being preferably selected from SEQ ID NO: 3 and SEQ ID NO: 4;

[0043] iii) GPR107, in one of its variant isoforms SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to any of SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or a nucleic acid molecule containing a sequence coding for a GPR107 protein, said encoding sequence being preferably selected from SEQ ID NO:8, SEQ ID NO:9 and SEQ ID NO:10;

[0044] iv) DPY19L3, in one of its variant isoforms SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13 or SEQ ID NO:14, or a nucleic acid molecule containing a sequence coding for a DPY19L3 protein, said encoding sequence being preferably selected from SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17 and SEQ ID NO:18;

[0045] v) FLJ42986, SEQ ID NO:19 or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:19, or a nucleic acid molecule containing a sequence coding for a FLJ42986 protein, said encoding sequence being preferably SEQ ID NO:20;

[0046] vi) COL20A1, in one of its variant isoforms SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to any of SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, or a nucleic acid molecule containing a sequence coding for a COL20A1 protein, said encoding sequence being preferably selected from SEQ ID NO:24, SEQ ID NO:25 and SEQ ID NO:26;

[0047] vii) GLT25D2, in one of its variant isoforms SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to any of SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, or a nucleic acid molecule containing a sequence coding for a GLT25D2 protein, said encoding sequence being preferably selected from SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33 and SEQ ID NO:34;

[0048] viii) SYTL3, in one of its variant isoforms SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to any of SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, or a nucleic acid molecule containing a sequence coding for a SYTL3 protein, said encoding sequence being preferably selected from SEQ ID NO:38, SEQ ID NO:39 and SEQ ID NO:40;

[0049] ix) DENND1B; in one of its variant isoforms SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to any of SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, or a nucleic acid molecule containing a sequence coding for a DENND1B protein, said encoding sequence being preferably selected from SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 and SEQ ID NO:48;

[0050] x) FLJ3710, SEQ ID NO:49 or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:49, or a nucleic acid molecule containing a sequence coding for a FLJ3710 protein, said encoding sequence being preferably SEQ ID NO:50;

[0051] xi) Chromosome 6 orf 98 (C6orf98) SEQ ID NO:51 or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:51, or a nucleic acid molecule containing a sequence coding for a C6orf98 protein, said encoding sequence being preferably SEQ ID NO:52;

[0052] xii) Family with sequence similarity 69, member B (Fam69B) SEQ ID NO:53, SEQ ID NO:54 or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:53 or SEQ ID NO:54, or a nucleic acid molecule containing a sequence coding for a Fam69B protein, said encoding sequence being preferably selected from SEQ ID NO:55 and SEQ ID NO:56;

[0053] xiii) EMI domain-containing protein 1 Precursor (EMID1) in one of its variant isoforms SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69 OR SEQ ID NO:70, or a nucleic acid molecule containing a sequence coding for a EMID protein, said encoding sequence being preferably selected from SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83 and SEQ ID NO:84;

[0054] xiv) ERMP1, SEQ ID NO:89, SEQ ID NO:90 and SEQ ID NO:91, or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:89, SEQ ID NO:90 or SEQ ID NO:91, or a nucleic acid molecule containing a sequence coding for a ERMP1 protein, said encoding sequence being preferably selected from SEQ ID NO:92, SEQ ID NO:93 and SEQ ID NO:94;

[0055] xv) VMO1, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO: 98 or a different isoform having sequence identity of at least 80%, preferably at least 90%, more preferably at least 95% to SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO: 98 or a nucleic acid molecule containing a sequence coding for a VMO1 protein, said encoding sequence being preferably selected from SEQ ID NO:99, SEQ ID NO:100 SEQ ID NO:101 and SEQ ID NO: 102.

[0056] As used herein, "Percent (%) amino acid sequence identity" with respect to the marker protein sequences identified herein indicates the percentage of amino acid residues in a full-length protein variant or isoform according to the invention, or in a portion thereof, that are identical with the amino acid residues in the specific marker sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. Alignment for purposes of determining percent amino acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. Identity between nucleotide sequences is preferably determined by the Smith-Waterman homology search algorithm as implemented in the MPSRCH program (Oxford Molecular), using an affine gap search with parameters gap open penalty=12 and gap extension penalty=1.

[0057] SLC39A10 mRNA has been shown to be moderately upregulated in breast cancer tissues as compared to normal samples (approximately 1.5 fold). In the same sudy, loss of SLC39A10 transcription in breast cell lines has been reported to reduce cell migratory activity (6). However, published studies on the expression of SLC39A10 in breast tumor cells are limited to the analysis of SLC39A10 transcript whilst, to the best of our knowledge, no data have been reported documenting the presence of SLC39A10 protein in these tumor cells.

[0058] SLC39A10 is also mentioned in a patent application reporting long lists of differentially transcribed genes in tumor cells by using genome-scale transcription profile analysis (e.g. in Publication Number: US20070237770A1). Again, no data are given documenting the expression of SLC39A10 in tumor at protein level. The lack of correlation between mRNA and protein expression, besides being a general fact, has been specifically demonstrated for LIV-1, another member of the zinc transporter family, suggesting that a similar phenomenon could be extended to other proteins of this class (7). Finally, no evidence exists on the association of SLC39A10 protein with other tumors, such as ovary tumor classes.

[0059] In the present invention we disclose SLC39A10 as a protein without previous known association with ovarian tumor classes and preferably used as a marker ovary tumors and in general for cancers of these types. As described below, an antibody generated towards the SLC39A10 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues which indicates the presence of SLC39A10 in these cancer samples and makes SLC39A10 protein and its antibody highly interesting tools for specifically distinguishing these cancer types from a normal state.

[0060] Moreover, this antibody also detected its target protein on the surface of ovary tumor cell lines and in cells transfected with a SLC39A10 encoding construct, indicating that the protein is exposed to the external environment and easily accessible to affinity ligands such as antibodies.

[0061] G protein-coupled receptor 107 (GPR107, synonyms: Protein GPR107 Precursor, Lung seven transmembrane receptor 1, Gene ID: ENSG00000148358; Transcript IDs: ENST00000347136, ENST00000372410, ENST00000372406; Protein IDs:ENSP00000336988, ENSP00000361487, ENSP00000361483) is a partially characterized protein. GPR107 has been mentioned in patent applications on pancreatic tumor (e.g. US20050260639A1) and in different patent application based on whole genome transcription profile analysis of cancer, but no supporting data are provided on the expression of GPR107 protein in tumor samples. Based on the above, in the present invention we disclose GPR107 as a protein without previous known association with the tumor class under investigation and preferably used as a marker for ovary tumor, and in general for cancers of this type. As described below, an antibody generated towards GPR107 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in these cancer samples.

[0062] The protein was detected on the surface of tumor cell lines by anti-GPR107 antibodies, indicating that it is accessible to the binding of specific antibodies.

[0063] Protein dpy-19 homolog 3 (DPY19L3; synonym: Dpy-19-like protein 3; Gene ID: ENSG00000178904; Transcript IDs: ENST00000319326, ENST00000392250, ENST00000342179, ENST00000392248; Protein IDs: ENSP00000315672, ENSP00000376081. ENSP00000344937, ENSP00000376079) Dpy 19L3 transcript has been reported as differentially expressed in multiple myeloma (Publication Number: US20080280779A1). However not data are available at level of protein expression. In the present invention we disclose Dpy 19L3 protein as associated with tumor and preferably used as a marker for ovary tumor, and in general for these cancer types. As described below, an antibody generated towards DPY19L3 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in these cancer samples.

[0064] Moreover the protein is detected on a panel of ovary tumor cell lines reinforcing the evidence. Finally the protein was detected on the surface of tumor cell lines by the specific antibody, suggesting that it can be exploited as target for affinity ligands with therapeutic activity.

[0065] Uncharacterized protein FLJ42986 (FLJ42986, Gene ID: ENSG00000196460; Transcript ID: ENST00000376826; Protein ID: ENSP00000366022) is a hypothetical protein without previous known association with ovarian tumor classes and is preferably used as a marker for ovary tumor and in general for these cancer types. As described below, an antibody generated towards FLJ42986 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in these cancer samples.

[0066] Collagen, type XX, alpha 1 (COL20A1); Synonyms: Collagen alpha-1(XX) chain Precursor; Gene ID: ENSG00000101203; Protein IDs: ENSP00000323077; ENSP00000346302; ENSP00000351767; Transcript IDs: ENST00000326996; ENST00000354338; ENST00000358894) belongs to the family of collagenous domain, Fibronectin type III domain, heparin binding domain, von Willebrand type A domain proteins. COL20A1 is a protein without previous known association with tumor and is preferably used as a marker for ovary tumor and in general for these cancer types. As described below, an antibody generated towards COL20A1 protein shows a selective immunoreactivity in histological preparation of ovary tissues, which indicates the presence of this protein in these cancer samples.

[0067] Glycosyltransferase 25 domain containing 2 (GLT25D2; synonyms: Glycosyltransferase 25 family member 2 Precursor) Gene ID: ENSG00000198756; Trasnscript IDs: ENST00000367522, ENST00000361927, ENST00000367520, ENST00000367521; Protein IDs: ENSP00000356492, ENSP00000354960, ENSP00000356490, ENSP00000356491), is a protein without previous known association with ovarian tumor and is preferably used as a marker for ovary tumor and in general for these cancer types. As described below, an antibody generated towards GLT25D2 protein shows a selective immunoreactivity in histological preparation of ovarian cancer, which indicates the presence of this protein in these cancer samples.

[0068] Synaptotagmin-like 3 (SYTL3; synonyms: SLP3, SLP3-B, Synaptotagmin-like protein 3, Exophilin-6; Gene ID: ENSG00000164674; Transcript ID: ENST00000360448, ENST00000297239, ENST00000367081; Protein ID: ENSP00000353631, ENSP00000297239, ENSP00000356048) belongs the family of proteins containing C2 domain, calcium-dependent phospholipid binding, neurexin binding, phospholipid binding, protein binding, rab3A effector domain, Slp homology domain. SYTL3 is a protein without previous known association with ovarian tumor and is preferably used as a marker for ovary tumor and in general for these cancer types. As described below, an antibody generated towards SYTL3 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in these cancer samples.

[0069] DENN/MADD domain containing 1B (DENND1B; synonyms: DENN domain-containing protein 1B, Protein FAM31B, C1orf218; Gene ID: ENSG00000162701. Transcript IDs: ENST00000294738, ENST00000367396, ENST00000400967, ENST00000235453; Protein IDs: ENSP00000294738, ENSP00000356366, ENSP00000383751, ENSP00000235453) is a poorly characterized protein without previous known association with ovarian tumors and is preferably used as a marker for ovary tumor and in general for these cancer types. As described below, an antibody generated towards DENND1B protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in these cancer samples.

[0070] Putative uncharacterized protein FLJ37107 (FLJ37107; synonyms: LOC284581; Gene ID: ENSG00000177990, Transcript ID: gi|58218993|ref|NM_001010882.1, Protein ID: gi|58218994|ref|NP_001010882.1| hypothetical protein LOC284581 [Homo sapiens], gi|74729692|sp|Q8N9I1.1|YA028_HUMAN) is an uncharacterized protein without previous known association with tumor and is preferably used as a marker for ovary tumor and in general for these cancer types. As described below, an antibody generated towards FLJ3710 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in these cancer samples.

[0071] Chromosome 6 open reading frame 98 (C6orf98; synonym: dJ45H2.2; Gene ID: EG:387079, da ENSG00000222029 has 1 transcript: ENST00000409023, associated peptide: ENSP00000386324 and 1 exon: ENSE00001576965) is an uncharacterized protein. Analysis of human genome databases (E.g. Ensembl) erroneously assigns C6orf98 as SYNE1. Although SYNE nucleic acid sequences overlap with C6ORF98 transcript, the encoded proteins show no match. In fact C6orf98 maps on an SYNE1 untranslated region (intron) and its product derives from a different reading frame than those annotated for SYNE1 isoforms in public databases. C6orf98 is a protein without C6orf98 is a protein without previous known association with tumor and is preferably used as a marker for ovary tumor and in general for these cancer types. As described below, an antibody generated towards C6orf98 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in these cancer samples.

[0072] Family with sequence similarity 69, member B (Fam69B; synonym: C9orf136; Gene ID: ENSG00000165716; Transcript IDs: ENST00000371692, ENST00000371691; Protein IDs: ENSP00000360757, ENSP00000360756) is a hypothetical protein without previous known association with tumor. This protein has been recently associated with Type 2 diabetes mellitus disease (8) and included in patent application on diabetes (Patent publication number: WO2008065544A2). In the present invention we disclose FAM69B as associated with tumor and preferably used as a marker for ovarian tumor and in general for these cancer types. As described below, an antibody generated towards Fam69B protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in these cancer samples.

[0073] EMI domain-containing protein 1 Precursor (EMID1; synonyms: Emilin and multimerin domain-containing protein 1, Protein Emu1; Gene ID: >OTTHUMG00000030824

TABLE-US-00001 Transcript IDs: Protein IDs: OTTHUMT00000075712 OTTHUMP00000028901, ENST00000429226 ENSP00000403816, ENST00000430127 ENSP00000399760, ENST00000435427 ENSP00000402621, ENST00000404820 ENSP00000384452, ENST00000334018 ENSP00000335481, ENST00000429415 ENSP00000409801, ENST00000448676 ENSP00000413034, ENST00000404755 ENSP00000385414, ENST00000435194 ENSP00000417004, ENST00000426629 ENSP00000403484, ENST00000457925 ENSP00000405422, ENST00000433143 ENSP00000408339, ENST00000455501 ENSP00000413947),

[0074] is a poorly characterized protein. EMID1 gene is mentioned in a patent application on follicular thyroid carcinoma (Publication number US2006035244 (A1). However, no data are available on the presence of this protein in the tumor class under investigation. Therefore, we disclose EMID1 as a protein without previous known association with ovary tumors and preferably used as a marker for ovary tumor and in general for these cancer types. As described below, an antibody generated towards EMID1 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in these cancer samples. In particular this antibody stains also tumor secretion products indicating that EMID1 protein is specifically released by tumor cells.

[0075] Killer cell lectin-like receptor subfamily G member 2 (C-type lectin domain family 15 member B) (KLRG2, synonyms: CLEC15B, FLJ44186; GENE ID: ENSG00000188883; Transcript IDs: ENST00000340940, ENST00000393039; Protein IDs: ENSP00000339356, ENSP00000376759) is a poorly characterized protein. A KLRG2 sequence is included in a patent application on the use of an agent with tumor-inhibiting action of a panel of targets associated with different tumors, whose expression is mainly shown at RNA level (Publication number WO2005030250). However no data are provided documenting the presence of KLRG2 protein in the tumors. Moreover, no experimental evidence is given on the specificity of the proposed anti-tumor agent for KLRG2. Based on these considerations, in the present invention we disclose KLRG2 as a protein without previous known association with tumor class under investigation and preferably used as a marker for ovary tumor, and in general for cancers of this type. As described below, an antibody generated towards KLRG2 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in this cancer type. In particular, the immunoistochemistry staining accumulates in the plasma membrane of tumor cells. Moreover, localization analysis of tumor cell lines showed that the protein is exposed on the cell surface and accessible to the binding of specific antibodies. Finally, silencing of KLRG2 significantly reduced the invasiveness and proliferation properties of tumor cells lines. Overall, KLRG2 is a likely target for the development of anti-cancer therapies being accessible to the action of affinity ligands and being involved in cellular processes relevant for tumor development.

[0076] Endoplasmic reticulum metallopeptidase 1 (ERMP1, synonyms: FLJ23309, FXNA, KIAA1815; GENE ID: ENSG00000099219; Transcript IDs: ENST00000214893, ENST00000339450, ENST00000381506; Protein IDs: ENSP00000214893, ENSP00000340427, ENSP00000370917) is a transmembrane metallopeptidase, so far described as localized to the endoplasmic reticulum. ERMP1 transcript has been found differentially expressed in the rat ovary at the time of folliculogenesis. A lower level of ERMP1 transcript in the rat ovary resulted in substantial loss of primordial, primary and secondary follicles, and structural disorganization of the ovary, suggesting that is required for normal ovarian histogenesis (9). ERMP1 has been also included in a patent application (Publication Number: US 2003064439) on novel nucleic acid sequences encoding melanoma associated antigen molecules. However in these publications, no solid data documented the relation of ERMP1 protein with tumor. Based on available information, ERMP1 protein has never been previously associated with tumor. In the present invention, differently with published scientific data, we disclose ERMP1 as a protein associated with tumor, preferably used as a marker for ovarian tumor, and in general for cancers of this type. As described below, an antibody generated towards ERMP1 protein shows a selective immunoreactivity in histological preparation of ovary cancer tissues, which indicates the presence of this protein in this cancer type. In particular our immunoistochemistry analysis of ovarian tissues indicates that the protein shows plasma membrane localization. Moreover, localization analysis of ovary tumor cell lines showed that the protein is exposed on the cell surface and accessible to the binding of specific antibodies. Finally, silencing of ERMP1 significantly reduced the invasiveness and proliferation properties of tumor cells lines. Based on the above evidences, ERMP1 is a likely target for the development of anti-cancer therapies being exposed to the action of affinity ligand and being involved in cellular processes relevant for tumor development.

[0077] Vitelline membrane outer layer protein 1 homolog Precursor (VMO1, synonyms: ERGA6350, PRO21055; GeneID: ENSG00000182853; Transcript ID: ENST00000328739, ENST00000354194, ENST00000416307, ENST00000441199; Protein IDs: ENSP00000328397, ENSP00000346133, ENSP00000390450, ENSP00000408166) is a marginally characterized protein. Evidences on the expression of VMO1 human protein are essentially based on studies in which the protein was detected in sputum from smoking human female and urine (10, 11).

[0078] In the present invention we disclose VMO1 as a protein without previous known association with tumor classes under investigation and preferably used as a marker for ovarian tumor, and in general for cancers of this type. As described below, an antibody generated towards VMO1 protein shows a selective immunoreactivity in histological preparation of ovarian cancer tissues, which indicates the presence of this protein in this cancer type. Immunoreactivity accumulates at the plasma membrane of tumor cells, providing a first indication that this protein is accessible to the action of affinity ligands (such as antibodies) with anti-cancer activities.

[0079] A further aspect of this invention is a method of screening a tissue sample for malignancy, which comprises determining the presence in said sample of at least one of the above-mentioned tumor markers. This method includes detecting either the marker protein, e.g. by means of labeled monoclonal or polyclonal antibodies that specifically bind to the target protein, or the respective mRNA, e.g. by means of polymerase chain reaction techniques such as RT-PCR. The methods for detecting proteins in a tissue sample are known to one skilled in the art and include immunoradiometric, immunoenzymatic or immunohistochemical techniques, such as radioimmunoassays, immunofluorescent assays or enzyme-linked immunoassays. Other known protein analysis techniques, such as polyacrylamide gel electrophoresis (PAGE), Western blot or Dot blot are suitable as well. Preferably, the detection of the protein marker is carried out with the immune-histochemistry technology, particularly by means of High Through-Put methods that allow the analyses of the antibody immune-reactivity simultaneously on different tissue samples immobilized on a microscope slide. Briefly, each Tissue Micro Array (TMA) slide includes tissue samples suspected of malignancy taken from different patients, and an equal number of normal tissue samples from the same patients as controls. The direct comparison of samples by qualitative or quantitative measurement, e.g. by enzimatic or colorimetric reactions, allows the identification of tumors.

[0080] In one embodiment, the invention provides a method of screening a sample of ovary tissue for malignancy, which comprises determining the presence in said sample of the SLC39A10, GPR107, DPY19L3, FLJ42986, COL20A1, GLT25D2, SYTL3, DENND1B, FLJ3710, C6orf98, FAM69B, EMID1, KLRG2, ERMP1, VMO1 protein tumor marker, variants or isoforms thereof as described above.

[0081] A further aspect of the invention is a method in vitro for determining the presence of an ovary tumor in a subject, which comprises the steps of:

[0082] (1) providing a sample of the tissue suspected of containing tumor cells;

[0083] (2) determining the presence of a tumor marker as above defined, or a combination thereof in said tissue sample by detecting the expression of the marker protein or the presence of the respective mRNA transcript;

[0084] wherein the detection of one or more tumor markers in the tissue sample is indicative of the presence of tumor in said subject.

[0085] The methods and techniques for carrying out the assay are known to one skilled in the art and are preferably based on immunoreactions for detecting proteins and on PCR methods for the detection of mRNAs. The same methods for detecting proteins or mRNAs from a tissue sample as disclosed above can be applied.

[0086] A further aspect of this invention is the use of the tumor markers herein provided as targets for the identification of candidate antitumor agents. Accordingly, the invention provides a method for screening a test compound which comprises contacting the cells expressing a tumor-associated protein selected from: Solute carrier family 39 (zinc transporter), member 10 (SLC39A10); G protein-coupled receptor 107 (GPR107); DPY19-like 3 (DPY19L3); Uncharacterized protein FLJ42986 (FLJ42986); Collagen, type XX, alpha 1 (COL20A1); Glycosyltransferase 25 domain containing 2 (GLT25D2); Synaptotagmin-like 3 (SYTL3); DENN/MADD domain containing 1B (DENND1B); Putative uncharacterized protein (FLJ3710); Chromosome 6 open reading frame 98 (C6orf98); Family with sequence similarity 69, member B (Fam69B); EMI domain-containing protein 1 Precursor (EMID1); Killer cell lectin-like receptor subfamily G member 2 (KLRG2); Endoplasmic reticulum metallopeptidase 1 (ERMP1); Vitelline membrane outer layer protein 1 homolog precursor (VMO1);

[0087] with the test compound, and determining the binding of said compound to said tumor-associated protein. In addition, the ability of the test compound to modulate the activity of each target molecule can be assayed.

[0088] A further aspect of the invention is an antibody or a fragment thereof, which is able to specifically recognize and bind to one of the tumor-associated proteins described above. The term "antibody" as used herein refers to all types of immunoglobulins, including IgG, IgM, IgA, IgD and IgE. Such antibodies may include polyclonal, monoclonal, chimeric, single chain, antibodies or fragments such as Fab or scFv. The antibodies may be of various origin, including human, mouse, rat, rabbit and horse, or chimeric antibodies. The production of antibodies is well known in the art. For the production of antibodies in experimental animals, various hosts including goats, rabbits, rats, mice, and others, may be immunized by injection with polypeptides of the present invention or any fragment or oligopeptide or derivative thereof which has immunogenic properties or forms a suitable epitope. Monoclonal antibodies may be produced following the procedures described in Kohler and Milstein, Nature 265:495 (1975) or other techniques known in the art.

[0089] The antibodies to the tumor markers of the invention can be used to detect the presence of the marker in histologic preparations or to distinguish tumor cells from normal cells. To that purpose, the antibodies may be labeled with radiocative, fluorescent or enzyme labels.

[0090] In addition, the antibodies can be used for treating proliferative diseases by modulating, e.g. inhibiting or abolishing the activity of a target protein according to the invention. Therefore, in a further aspect the invention provides the use of antibodies to a tumor-associated protein selected from: Solute carrier family 39 (zinc transporter), member 10 (SLC39A10); G protein-coupled receptor 107 (GPR107); DPY19-like 3 (DPY19L3); Uncharacterized protein FLJ42986 (FLJ42986); Collagen type XX, alpha 1 (COL20A1); glycosyltransferase 25 domain containing 2 (GLT25D2); Synaptotagmin-like 3 (SYTL3); DENN/MADD domain containing 1B (DENND1B); Putative uncharacterized protein (FLJ3710); Chromosome 6 open reading frame 98 (C6orf98); Family with sequence similarity 69, member B (Fam69B); EMI domain-containing protein 1 Precursor (EMID1); Killer cell lectin-like receptor subfamily G member 2 (KLRG2); Endoplasmic reticulum metallopeptidase 1 (ERMP1); Vitelline membrane outer layer protein 1 homolog Precursor (VMO1);

[0091] for the preparation of a therapeutic agent for the treatment of proliferative diseases. For use in therapy, the antibodies can be formulated with suitable carriers and excipients, optionally with the addition of adjuvants to enhance their effetcs.

[0092] A further aspect of the invention relates to a diagnostic kit containing suitable means for detection, in particular the polypeptides or polynucleotides, antibodies or fragments or derivatives thereof described above, reagents, buffers, solutions and materials needed for setting up and carrying out the immunoassays, nucleic acid hybridization or PCR assays described above. Parts of the kit of the invention can be packaged individually in vials or bottles or in combination in containers or multicontainer units.

DESCRIPTION OF THE FIGURES

[0093] FIG. 1. Analysis of purified SLC39A10 recombinant protein

[0094] Left panel: Comassie staining of purified His-tag SLC39A10 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant SLC39A10 protein stained with anti-SLC39A10 antibody. Arrow marks the protein band of the expected size. The low molecular weight bands correspond to partially degraded forms of SLC39A10 protein. Molecular weight markers are reported on the left.

[0095] FIG. 2. Staining of ovary tumor TMA with anti-SLC39A10 antibodies

[0096] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0097] FIG. 3. Confocal microscopy analysis of expression and localization of SLC39A10 in transfected cells

[0098] HeLa cells transfected with the empty pcDNA3 vector (upper panels) or with the plasmid construct encoding the SLC39A10 gene (lower panels) stained with secondary antibodies (left panels) and with anti-SLC39A10 antibodies (right panels). Arrowheads mark surface specific localization.

[0099] FIG. 4. Expression and localization of SLC39A10 in ovary tumor cell lines

[0100] Flow cytometry analysis of SLC39A10 cell-surface localization in OVCAR-5 (left panel) and OVCAR-8 (right panel) tumor cells stained with a negative control antibody (filled curve or with anti-SLC39A10 antibody (empty curve). X axis, Fluorescence scale; Y axis, Cells (expressed as % relatively to major peaks).

[0101] FIG. 5. Analysis of purified GPR107 recombinant protein

[0102] Left panel: Comassie staining of purified His-tag GPR107 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant GPR107 protein stained with anti-GPR107 antibody. Arrow marks the protein band of the expected size. The high molecular weight band is consistent with a protein dimer of GPR107. Molecular weight markers are reported on the left.

[0103] FIG. 6. Staining of ovary tumor TMA with anti-GPR107 antibodies

[0104] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0105] FIG. 7. Expression and localization of GPR107 in ovary tumor cells

[0106] Flow cytometry analysis of GPR107 cell surface localization in OVCAR-8 cells stained with a control antibody (filled curve or with anti-GPR107 antibody (empty curve). X axis, Fluorescence scale; Y axis, Cells (expressed as percentage, relatively to major peaks).

[0107] FIG. 8. Analysis of purified DPY19L3 recombinant protein

[0108] Left panel: Comassie staining of purified His-tag DPY19L3 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant DPY19L3 protein stained with anti-DPY19L3 antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0109] FIG. 9. Staining of ovary tumor TMA with anti-DPY19L3 antibodies

[0110] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0111] FIG. 10. Expression and localization of DPY19L3 in ovary tumor cell lines

[0112] Left panel: Western blot analysis of DPY19L3 expression in total protein extracts separated by SDS-PAGE from the ovary-derived tumor cells OVCAR-3 (lane 1), OVCAR-4 (lane 2), OVCAR-5 (lane 3), OVCAR-8 (lane 4). Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0113] Right panel: Flow cytometry analysis of DPY19L3 cell surface localization in OVCAR-5 cells stained with a negative control antibody (filled curve or with anti-DPY19L3 antibody (empty curve). X axis, Fluorescence scale; Y axis, Cells (expressed as percentage, relatively to major peaks).

[0114] FIG. 11. Analysis of purified FLJ42986 recombinant protein

[0115] Left panel: Comassie staining of purified His-tag FLJ42986 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant FLJ42986 protein stained with anti-FLJ42986 antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0116] FIG. 12. Staining of ovary tumor TMA with anti-FLJ42986 antibodies

[0117] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0118] FIG. 13. Analysis of purified COL20A1 recombinant protein

[0119] Left panel: Comassie staining of purified His-tag COL20A1 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant COL20A1 protein stained with anti-COL20A1 antibody. Arrow marks the protein band of the expected size. The high molecular weight band is consistent with the formation of protein dimer. Molecular weight markers are reported on the left.

[0120] FIG. 14. Staining of ovary tumor TMA with anti-COL20A1 antibodies

[0121] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0122] FIG. 15. Analysis of purified GLT25D2 recombinant protein

[0123] Left panel: Comassie staining of purified His-tag GLT25D2 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant GLT25D2 protein stained with anti-GLT25D2 antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0124] FIG. 16. Staining of ovary tumor TMA with anti-GLT25D2 antibodies

[0125] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0126] FIG. 17. Analysis of purified SYTL3 recombinant protein Left panel: Comassie staining of purified His-tag SYTL3 fusion protein espressed; Right panel: WB on the recombinant SYTL3 protein stained with anti-SYTL3 antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0127] FIG. 18. Staining of ovary tumor TMA with anti-SYTL3 antibodies

[0128] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0129] FIG. 19. Analysis of purified DENND1B recombinant protein

[0130] Left panel: Comassie staining of purified His-tag DENND1B fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant DENND1B protein stained with anti-DENND1B antibody. Arrow marks the protein band of the expected size. The low molecular weight band corresponds to partially degraded forms of the protein. Molecular weight markers are reported on the left.

[0131] FIG. 20. Staining of ovary tumor TMA with anti DENND1B antibodies

[0132] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0133] FIG. 21. Analysis of purified FLJ37107 recombinant protein

[0134] Left panel: Comassie staining of purified His-tag FLJ37107 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant protein stained with anti-FLJ37107 antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0135] FIG. 22. Staining of ovary tumor TMA with anti-FLJ37107 antibodies

[0136] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0137] FIG. 23. Analysis of purified C6orf98 recombinant protein

[0138] Left panel: Comassie staining of purified His-tag C6orf98 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant C6orf98 protein stained with anti-C6orf98 antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0139] FIG. 24. Staining of ovary tumor TMA with anti-C6orf98 antibodies

[0140] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0141] FIG. 25. Analysis of purified Fam69B recombinant protein

[0142] Left panel: Comassie staining of purified His-tag Fam69B fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant Fam69B protein stained with anti-Fam69B antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0143] FIG. 26. Staining of ovary tumor TMA with anti-Fam69B antibodies

[0144] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0145] FIG. 27. Analysis of purified EMID1 recombinant protein

[0146] Left panel: Comassie staining of purified His-tag EMID1 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant EMID1 protein stained with anti-EMID1 antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0147] FIG. 28. Staining of ovary tumor TMA with anti-EMID1 antibodies

[0148] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0149] FIG. 29. Analysis of purified KLRG2 recombinant protein

[0150] Left panel: Comassie staining of purified His-tag KLRG2 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant protein stained with anti-KLRG2 antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0151] FIG. 30. Staining of ovary tumor TMA with anti-KLRG2 antibodies

[0152] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray) and the stain clearly delineates the plasma membrane; negative or poor staining is visible in normal cells.

[0153] FIG. 31. Expression and localization of KLRG2 in tumor cell lines

[0154] Panel A. KLRG2 expression in transfected HeLa cells. Western blot analysis of KLRG2 expression in total protein extracts separated by SDS-PAGE from HeLa cells (corresponding to 2.times.10.sup.5 cells) transfected with the empty pcDNA3 vector (lane 1), with the plasmid construct encoding the isoform 2 of the KLRG2 gene (lane 2); or with the plasmid construct encoding the isoform1 of the KLRG2 gene (lane 3);

[0155] Panel B. KLRG2 expression in ovary-derived tumor cell lines. Western blot analysis of KLRG2 expression in total protein extracts separated by SDS-PAGE from OVCAR-5 (lane 1) and from OVCAR-8 tumor cells (corresponding to 2.times.10.sup.5 cells) (lane 2);

[0156] Panel C. Surface exposure of KLRG2 in tumor cell lines. Flow cytometry analysis of KLRG2 cell surface localization in OVCAR-8 cells stained with a negative control antibody (filled curve or with anti-KLRG2 antibody (empty curve). X axis, Fluorescence scale; Y axis, Cells (expressed as percentage relatively to major peaks).

[0157] FIG. 32. KLRG2 confer malignant cell phenotypes

[0158] The proliferation and the migration/invasive phenotypes of MCF7 cell line were assessed after transfection with KLRG2-siRNA and a scrambled siRNA control using the MTT and the Boyden in vitro invasion assay, respectively.

[0159] Panel A. Cell migration/invasiveness measured by the Boyden migration assay. The graph represents the reduced migration/invasiveness of MCF7 treated with the KLRG2 specific siRNA. Small boxes below the columns show the visual counting of the migrated cells.

[0160] Panel B. Cell proliferation determined by the MTT incorporation assay. The graph represents the reduced proliferation of the MCF7 tumor cells upon treatment with KLRG2-siRNA, as determined by spectrophotometric reading.

[0161] FIG. 33. Analysis of purified ERMP1 recombinant protein

[0162] Left panel: Comassie staining of purified His-tag ERMP1 fusion protein separated by SDS-PAGE; Right panel: WB on the recombinant protein stained with anti-ERMP1 antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left.

[0163] FIG. 34. Staining of ovary tumor TMA with anti-ERMP1 antibodies

[0164] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray) and the stain clearly delineates the plasma membrane; negative or poor staining is visible in normal cells.

[0165] FIG. 35. Expression and localization of ERMP1 in tumor cell lines

[0166] Panel A. ERMP1 expression in transfected HEK-293T cells. Western blot analysis of ERMP1 expression in total protein extracts separated by SDS-PAGE from HEK-293T cells (corresponding to 2.times.10.sup.5 cells) transfected with the empty pcDNA3 vector (lane 1) or with the plasmid construct encoding the ERMP1 gene (lane 2).

[0167] Panel B. Western blot analysis of ERMP1 expression in total protein extracts separated by SDS-PAGE from OVCAR-4 tumor cells (corresponding to 2.times.10.sup.5 cells) (lane 1). Arrow marks the ERMP1 band. Molecular weight markers are reported on the left.

[0168] Panel C. Flow cytometry analysis of ERMP1 cell surface localization in OVCAR-8 tumor cells stained with a negative control antibody (filled curve or with anti-ERMP1 antibody (empty curve). X axis, Fluorescence scale; Y axis, Cells (expressed as % relatively to major peaks).

[0169] FIG. 36. ERMP1 confers malignant cell phenotypes

[0170] The proliferation and the invasive properties of the MCF7 cell line were assessed after transfection with ERMP1-siRNA and a scramble siRNA control using the MTT and the Boyden in vitro invasion assay, respectively.

[0171] Panel A. Cell migration/invasiveness measured by the Boyden migration assay. The graph represents the reduced migration/invasiveness of MCF7 treated with the ERMP1-specific siRNA. Small boxes below the columns show the visual counting of the migrated cells.

[0172] Panel B. Cell proliferation determined by the MTT incorporation assay. The graph represents the reduced proliferation of the MCF7 tumor cells upon treatment with ERMP1-siRNA, as determined by spectrophotometric reading.

[0173] FIG. 37. Analysis of purified VMO1 recombinant protein

[0174] Left panel: Comassie staining of purified His-tag VMO1 fusion protein separated by SDS-PAGE; Right panel: WB on the purified VMO1 protein stained with a specific antibody. Arrow marks the protein band of the expected size. Molecular weight markers are reported on the left

[0175] FIG. 38. Staining of ovary tumor TMA with anti-VMO1 antibodies

[0176] In the case of the ovarian sample, both tumor and normal cells are represented on the same image. The antibody stains specifically tumor cells (in dark gray); negative or poor staining is visible in normal cells.

[0177] The following examples further illustrate the invention.

EXAMPLES

Example 1. Generation of Recombinant Human Protein Antigens and Antibodies to Identify Tumor Markers

[0178] Methods

[0179] The entire coding region or suitable fragments of the genes encoding the target proteins were designed for cloning and expression using bioinformatic tools with the human genome sequence as template (Lindskog M et al (2005). Where present, the leader sequence for secretion was replaced with the ATG codon to drive the expression of the recombinant proteins in the cytoplasm of E. coli. For cloning, genes were PCR-amplified from templates derived from Mammalian Gene Collection (http://mgc.nci.nih.gov/) clones or from cDNAs mixtures generated from pools of total RNA derived from Human testis, Human placenta, Human bone marrow, Human fetal brain, using specific primers. Clonings were designed so as to fuse a 10 histidine tag sequence at the 5' end, annealed to in house developed vectors, derivatives of vector pSP73 (Promega) adapted for the T4 ligation independent cloning method (Nucleic Acids Res. 1990 Oct. 25; 18(20): 6069-6074) and used to transform E. coli NovaBlue cells recipient strain. E. coli tranformants were plated onto selective LB plates containing 100 .mu.g/ml ampicillin (LB Amp) and positive E. coli clones were identified by restriction enzyme analysis of purified plasmid followed by DNA sequence analysis. For expression, plasmids were used to transform BL21-(DE3) E. coli cells and BL21-(DE3) E. coli cells harbouring the plasmid were inoculated in ZYP-5052 growth medium (Studier, 2005) and grown at 37.degree. C. for 24 hours. Afterwards, bacteria were collected by centrifugation, lysed into B-Per Reagent containing 1 mM MgCl2, 100 units DNAse I (Sigma), and 1 mg/ml lysozime (Sigma). After 30 min at room temperature under gentle shaking, the lysate was clarified by centrifugation at 30.000 g for 40 min at 4.degree. C. All proteins were purified from the inclusion bodies by resuspending the pellet coming from lysate centrifugation in 40 mM TRIS-HCl, 1 mM TCEP {Tris(2-carboxyethyl)-phosphine hydrochloride, Pierce} and 6M guanidine hydrochloride, pH 8 and performing an IMAC in denaturing conditions. Briefly, the resuspended material was clarified by centrifugation at 30.000 g for 30 min and the supernatant was loaded on 0.5 ml columns of Ni-activated Chelating Sepharose Fast Flow (Pharmacia). The column was washed with 50 mM TRIS-HCl buffer, 1 mM TCEP, 6M urea, 60 mM imidazole, 0.5M NaCl, pH 8. Recombinant proteins were eluted with the same buffer containing 500 mM imidazole. Proteins were analysed by SDS-Page and their concentration was determined by Bradford assay using the BIORAD reagent (BIORAD) with a bovine serum albumin standard according to the manufacturer's recommendations.

[0180] To generate antisera, the purified proteins were used to immunize CD1 mice (6 week-old females, Charles River laboratories, 5 mice per group) intraperitoneally, with 3 protein doses of 20 micrograms each, at 2 week-interval. Freund's complete adjuvant was used for the first immunization, while Freund's incomplete adjuvant was used for the two booster doses. Two weeks after the last immunization animals were bled and sera collected from each animal was pooled.

[0181] Results

[0182] Gene fragments of the expected size were obtained by PCR from specific clones of the Mammalian Gene Collection or, alternatively, from cDNA generated from pools of total RNA derived from Human testis, Human placenta, Human bone marrow, Human fetal brain, using primers specific for each gene.

[0183] In particular, for the SLC39A10 gene, a DNA fragment corresponding to nucleotides 154-1287 of the transcript ENST00000359634 and encoding a protein of 378 residues, corresponding to the amino acid region from 26 to 403 of ENSP00000352656 sequence was obtained.

[0184] For the GPR107 gene, a fragment corresponding to nucleotides 291 to 968 of the transcript ENST00000347136 and encoding a protein of 226 residues, corresponding to the amino acid region from 39 to 264 of ENSP00000336988 sequence was obtained.

[0185] For the DPY19L3 gene, a fragment corresponding to nucleotides 158 to 463 of the transcript ENST00000392250 and encoding a protein of 102 residues, corresponding to the amino acid region from 1 to 102 of ENSP00000376081 sequence was obtained.

[0186] For the FLJ42986 gene, a fragment corresponding to nucleotides 1287 to 1717 of the transcript ENST00000376826 and encoding protein of 144 residues, corresponding to the amino acid region from 30 to 173 of ENSP00000366022 sequence was obtained.

[0187] For the COL20A1 gene, a fragment corresponding to nucleotides 577 to 1095 of the transcript ENST00000354338 and encoding a protein of 173 residues, corresponding to the amino acid region from 193 to 365 of ENSP00000346302 sequence was obtained.

[0188] For the GLT25D2 gene, a fragment corresponding to nucleotides 454 to 831 of the transcript ENST00000361927 and encoding a protein of 126 residues, corresponding to the amino acid region from 25 to 153 of ENSP00000354960 sequence was obtained.

[0189] For the SYTL3 gene, a fragment corresponding to nucleotides 267 to 569 of the transcript ENST00000360448 and encoding a protein of 101 residues, corresponding to the amino acid region from 50 to 150 of ENSP00000353631 sequence was obtained.

[0190] For the DENND1B gene, a fragment corresponding to nucleotides 563 to 1468 of the transcript ENST00000235453 and encoding a protein of 302 residues, corresponding to the amino acid region from 95 to 396 of ENSP00000235453 sequence was obtained.

[0191] For the FLJ37107 gene, a fragment corresponding to nucleotides 661-972 of the transcript gi|58218993|ref|NM_001010882.1 and encoding a protein of 104 residues, corresponding to the amino acid region from 1 to 104 of gi|58218994|ref|NP_001010882.1 sequence was obtained.

[0192] For the C6orf98 gene, a fragment corresponding to nucleotides 67 to 396 of the transcript ENST00000409023 and encoding a protein of 110 residues, corresponding to the amino acid region from 22 to 132 of ENSP00000386324 sequence was obtained.

[0193] For the Fam69B gene, a fragment corresponding to nucleotides 233 to 688 of the transcript ENST00000371692 and encoding a protein of 152 residues, corresponding to the amino acid region from 49 to 200 of ENSP00000360757 sequence was obtained.

[0194] For the EMID1 gene, a fragment corresponding to nucleotides 203 to 670 of the transcript OTTHUMT00000075712 and encoding a protein of 156 residues, corresponding to the amino acid region from 33 to 188 of OTTHUMP00000028901 sequence was obtained.

[0195] For the KLRG2 gene, a fragment corresponding to nucleotides 70 to 849 of the transcript ENST00000340940 and encoding a protein of 260 residues, corresponding to the amino acid region from 1 to 260 of ENSP00000339356 sequence was obtained.

[0196] For the ERMP1 gene, a fragment corresponding to nucleotides 55 to 666 of the transcript ENST00000339450 and encoding a protein of 204 residues, corresponding to the amino acid region from 1 to 204 of ENSP00000340427 sequence was obtained.

[0197] For the VMO1 gene, a fragment corresponding to nucleotides 157 to 690 of the transcript ENST00000328739 and encoding a protein of 178 residues corresponding to the amino acid region from 25-202 of ENSP00000328397 sequence was obtained.

[0198] A clone encoding the correct amino acid sequence was identified for each gene/gene fragment and, upon expression in E. coli, a protein of the correct size was produced and subsequently purified using affinity chromatography (FIGS. 1; 5; 8; 11; 13; 15; 17; 19; 21; 23; 25, 27, 29, 33, 37, left panels). As shown in the figures, in some case SDS-PAGE analysis of affinity-purified recombinant proteins revealed the presence of extra bands, of either higher and/or lower masses. Mass spectrometry analysis confirmed that they corresponded to either aggregates or degradation products of the protein under analysis.

[0199] Antibodies generated by immunization specifically recognized their target proteins in Western blot (WB) (FIGS. 1; 5; 8; 11; 13; 15; 17; 19; 21; 23; 25, 27, 29, 33, 37, right panels).

Example 2. Tissue Profiling by Immune-Histochemistry

[0200] Methods

[0201] The analysis of the antibodies' capability to recognize their target proteins in tumor samples was carried out by Tissue Micro Array (TMA), a miniaturized immuno-histochemistry technology suitable for HTP analysis that allows to analyse the antibody immuno-reactivity simultaneously on different tissue samples immobilized on a microscope slide.

[0202] Since the TMAs include both tumor and healthy tissues, the specificity of the antibodies for the tumors can be immediately appreciated. The use of this technology, differently from approaches based on transcription profile, has the important advantage of giving a first hand evaluation on the potential of the markers in clinics. Conversely, since mRNA levels not always correlate with protein levels (approx. 50% correlation), studies based on transcription profile do not provide solid information regarding the expression of protein markers.

[0203] A tissue microarray was prepared containing formalin-fixed paraffin-embedded cores of human tissues from patients affected by ovary cancer and analyzed using the specific antibody sample. In total, the TMA design consisted in 20 ovary tissue samples from 5 well pedigreed patients, each including both tumor and normal epithelium (equal to four tumor samples from each patient) to identify promising target molecules differentially expressed in cancer and normal cells. The direct comparison between tumor and normal epithelium of each patient allowed the identification of antibodies that stain specifically tumor cells and provided indication of target expression in ovary tumor. To confirm the association of each protein with ovary tumors a tissue microarray was prepared containing 100 formalin-fixed paraffin-embedded cores of human ovary tissues from 50 patients (equal to two tissue samples from each patient).

[0204] All formalin fixed, paraffin embedded tissues used as donor blocks for TMA production were selected from the archives at the IEO (Istituto Europeo Oncologico, Milan). Corresponding whole tissue sections were examined to confirm diagnosis and tumour classification, and to select representative areas in donor blocks. Normal tissues were defined as microscopically normal (non-neoplastic) and were generally selected from specimens collected from the vicinity of surgically removed tumors. The TMA production was performed essentially as previously described (Kononen J et al (1998) Nature Med. 4:844-847; Kallioniemi O P et a/(2001) Hum. MoI. Genet. 10:657-662). Briefly, a hole was made in the recipient TMA block. A cylindrical core tissue sample (1 mm in diameter) from the donor block was acquired and deposited in the recipient TMA block. This was repeated in an automated tissue arrayer "Galileo TMA CK 3500" BioRep (Milan) until a complete TMA design was produced. TMA recipient blocks were baked at 42<0>C for 2 h prior to sectioning. The TMA blocks were sectioned with 2-3 mm thicknes using a waterfall microtome (Leica), and placed onto poli-L-lysinated glass slides for immunohistochemical analysis. Automated immunohistochemistry was performed as previously described (Kampf C. et al (2004) Clin. Proteomics 1:285-300). In brief, the glass slides were incubated for 30' min in 60.degree. C., de-paraffinized in xylene (2.times.15 min) using the Bio-Clear solution (Midway. Scientific, Melbourne, Australia), and re-hydrated in graded alcohols. For antigen retrieval, slides were immersed 0.01 M Na-citrate buffer, pH 6.0 at 99.degree. C. for 30 min Slides were placed in the Autostainer.RTM. (DakoCytomation) and endogenous peroxidase was initially blocked with 3% H2O2, for 5 min. Slides were then blocked in Dako Cytomation Wash Buffer containing 5% Bovine serum albumin (BSA) and subsequently incubated with mouse antibodies for 30' (dilution 1:200 in Dako Real.TM. dilution buffer). After washing with DakoCytomation wash buffer, slides were incubated with the goat anti-mouse peroxidase conjugated Envision.RTM. for 30 min each at room temperature (DakoCytomation). Finally, diaminobenzidine (DakoCytomation) was used as chromogen and Harris hematoxylin (Sigma-Aldrich) was used for counterstaining. The slides were mounted with Pertex.RTM. (Histolab).

[0205] The staining results have been evaluated by a trained pathologist at the light microscope, and scored according to both the percentage of immunostained cells and the intensity of staining. The individual values and the combined score (from 0 to 300) were recorded in a custom-tailored database. Digital images of the immunocytochemical findings have been taken at a Leica DM LB light microscope, equipped with a Leica DFC289 color camera.

[0206] Results

[0207] The results from tissue profiling showed that the antibodies specific for the recombinant proteins (see Example 1) are strongly immunoreactive on ovary cancer tissues from patients with varying frequencies, indicating the presence of the target proteins in tumors tissues, while no or poor reactivity was detected in normal tissues.

[0208] Based on this finding, the detection of target proteins in tissue samples can be associated with the specific tumor/s. In some cases the antibody staining accumulated at the plasma membrane of tumor cells, providing a first indication of the cell localization of target proteins. The capability of target-specific antibodies to stain different tumor tissues is summarized in Table I. Representative examples of microscopic enlargements of tissue samples stained by each antibody are reported in FIGS. 2; 6; 9; 12; 14; 16; 18, 20; 22; 24; 26, 28, 30, 34, 38.

[0209] Table I reports the percentage of positive ovarian tumor samples after staining with the target-specific antibodies

TABLE-US-00002 TABLE I Percentage of ovary tumor samples Gene showing positive IHC staining SLC39A10 80 GPR107 40 DPY19L3 93 FLJ42986 80 COL20A1 20 GLT25D2 20 SYTL3 60 DENND1B 20 FLJ3710 80 C6orf98 100 Fam69B 73 EMID1 20* KLRG2 29** ERMP1 69** VMO1 20** *The antibody specifically stains secretion products of ovary tumor cells indicating that the corresponding protein is released by tumor cells. **The antibody stains the plasma membrane of tumor cells

Example 3. Expression of Target Proteins in Transfected Mammalian Cells

[0210] Methods

[0211] The specificity of the antibodies for each target proteins was assessed by western blot and/or confocal microscopy analysis of eukaryotic cells transiently transfected with a plasmid construct containing the complete sequence of the gene encoding the target proteins. Examples of this type of confocal microscopy experiments are represented for SLC39A10 (corresponding to Transcript ID ENST00000359634), KLRG2 (two cloned sequences corresponding to Transcripts ENST00000340940 and ENST00000393039, corresponding to two transcript variants), ERMP1 (cloned sequence corresponding to Transcripts ENST00000339450).

[0212] For cloning, cDNA were generated from pools of total RNA derived from Human testis, Human placenta, Human bone marrow, Human fetal brain, in reverse transcription reactions and the entire coding regions were PCR-amplified with specific primers pairs. PCR products were cloned into plasmid pcDNA3 (Invitrogen). HeLa and Hek-293T cells were grown in DMEM-10% FCS supplemented with 1 mM Glutamine were transiently transfected with preparation of the resulting plasmid and with the empty vector as negative control using the Lipofectamine-2000 transfection reagent (Invitrogen). After 48 hours, cells were collected and analysed by Western blot or confocal microscopy. For Western blot, cells were lysed with PBS buffer containing 1% Triton X100 and total cell extracts (corresponding to 1.times.10.sup.6 cells) were separated on pre-cast SDS-PAGE gradient gels (NuPage 4-12% Bis-Tris gel, Invitrogen) under reducing conditions, followed by electro-transfer to nitrocellulose membranes (Invitrogen) according to the manufacturer's recommendations. The membranes were blocked in blocking buffer composed of 1.times.PBS-0.1% Tween 20 (PBST) added with 10% dry milk, for 1 h at room temperature, incubated with the antibody diluted 1:2500 in blocking buffer containing 1% dry milk and washed in PBST-1%. The secondary HRP-conjugated antibody (goat anti-mouse immunoglobulin/HRP, Perkin Elmer) was diluted 1:5000 in blocking buffer and chemiluminescence detection was carried out using a Chemidoc-IT UVP CCD camera (UVP) and the Western Lightning.TM. cheminulescence Reagent Plus (Perkin Elmer), according to the manufacturer's protocol.

[0213] For confocal microscopy analysis, the cells were plated on glass cover slips and after 48 h were washed with PBS and fixed with 3% p-formaldheyde solution in PBS for 20 min at RT. For surface staining, cells were incubated overnight at 4.degree. C. with polyclonal antibodies (1:200). The cells were then stained with Alexafluor 488-labeled goat anti-mouse antibodies (Molecular Probes). DAPI (Molecular Probes) was used to visualize nuclei; Live/Dead.RTM. red fixable (Molecular Probes) was used to visualize membrane. The cells were mounted with glycerol plastine and observed under a laser-scanning confocal microscope (LeicaSP5).

[0214] Results

[0215] The selected coding sequences for SLC39A10, KLRG2 and ERMP1 were cloned in a eukaryotic expression vector and the derived plasmids were used for transient transfection of HeLa or HEK293T cell lines. Expression of target proteins KLRG2 and ERMP1 was detected in total protein extracts from HeLa and HEK-293T cells, respectively, by Western blot using the protein-specific antibodies. Overall the data confirmed that the marker-specific antibodies recognized specifically their target proteins while no or marginal staining was visible in cell transfected with the empty pcDNA3. In the case of KLRG2, specific protein bands of expected size were detected in cells transfected with either of the two plasmids encoding the two annotated KLRG2 variants (FIG. 31A). As for cells transfected with ERMP1-encoding plasmid, a band of molecular mass higher than expected (theoretical mass of approximately 100 KDa) was specifically detected by the anti-ERMP1 antibody in transfected cells indicating that the protein forms stable aggregates (FIG. 35A). Expression of protein SLC3910 was carried by confocal microscopy of transfected HeLa cells. The anti-SLC39A10 antibody specifically detected its target protein expressed by transfected cells. In particular, the antibody mainly stained the surface of transfected cells (FIG. 3).

Example 4. Expression and Localization of Target Proteins in Tumor Cell Lines

[0216] Expression of marker genes was assessed by WB and/or flow cytometry analysis of tumor cell lines. Cells were cultured in under ATCC recommended conditions, and sub-confluent cell monolayers were detached with PBS-0.5 mM EDTA for subsequent analysis. For Western blot analysis, cells were lysed by several freeze-thaw passages in PBS-1% Triton. Total protein extracts were loaded on SDS-PAGE (2.times.10.sup.5 cells/lane), and subjected to WB with specific antibodies as described above. For flow cytometry analysis, cells (2.times.10.sup.4 per well) were pelletted in 96 U-bottom microplates by centrifugation at 200.times.g for 5 min at 4.degree. C. and incubated for 1 hour at 4.degree. C. with the appropriate dilutions of the marker-specific antibodies. Cells were washed twice in PBS-5% FCS and incubated for 20 min with the appropriate dilution of R-Phycoerythrin (PE)-conjugated secondary antibodies (Jackson Immuno Research, PA, USA) at 4.degree. C. After washing, cells were analysed by a FACS Canto II flow cytometer (Becton Dickinson). Data were analyzed with FlowJo 8.3.3 program.

[0217] Results

[0218] Western blot analysis of tumor cells lines is represented for KLRG2, ERMP1 and DPY19L3. Concerning KLRG2, two major protein bands were detected in the ovary tumor cell lines OVCAR-5 and OVCAR-8, that could be ascribed to the annotated protein variants (FIG. 31B). As regards ERMP1 a band of high molecular mass was detected in the OVCAR-4 cell line (FIG. 35B) showing an electrophoretic pattern similar to that reported in transfected cells (see previous example). This further confirms the existence of stable aggregates for this protein As for DPY19L3, a band of expected size was detected by the antibody on a panel of tumor cells lines (FIG. 10, left panel). Expression and localization analysis was also performed by surface staining and flow cytometry analysis of tumor cell lines. Results, represented for SLC39A10, KLRG2, DPY10L3 and ERMP1, show that these proteins were detected on the surface of ovary tumor cell lines by the marker-specific antibodies (FIGS. 4, 31C, 35C and FIG. 10 right panel).

Example 5. Expression of the Marker Proteins Confer Malignant Cell Phenotype

[0219] To verify that the proteins included in the present invention can be exploited as targets for therapeutic applications, the effect of marker depletion was evaluated in vitro in cellular studies generally used to define the role of newly discovered proteins in tumor development. Marker-specific knock-down and control tumor cell lines were assayed for their proliferation and the migration/invasive phenotypes using the MTT assays and the Boyden in vitro invasion assay, respectively.

[0220] Method

[0221] Expression of marker genes were silenced in tumor cell lines by the siRNA technology and the influence of the reduction of marker expression on cell parameters relevant for tumor development was assessed in in vitro assays. The expression of marker genes was knocked down in a panel of epithelial tumor cell lines previously shown to express the tumor markers using a panel of marker-specific siRNAs (whose target sequence is reported in Table II) using the HiPerfect transfection reagent (QIAGEN) following the manufacturer's protocol. As control, cells treated with irrelevant siRNA (scrambled siRNA) were analysed in parallel. At different time points (ranging from 24 to 72 hours) post transfection, the reduction of gene transcription was assessed by quantitative RT-PCR (Q-RT-PCR) on total RNA, by evaluating the relative marker transcript level, using the beta-actin, GAPDH or MAPK genes as internal normalization control. Afterwards, cell proliferation and migration/invasiveness assays were carried out to assess the effect of the reduced marker expression. Cell proliferation was determined using the MTT assay, a colorimetric assay based on the cellular conversion of a tetrazolium salt into a purple colored formazan product. Absorbance of the colored solution can be quantified using a spectrophotometer to provide an estimate of the number of attached living cells. Approximately 5.times.10.sup.3 cells/100 .mu.l were seeded in 96-well plates in DMEM with 10% FCS to allow cell attachment. After overnight incubation with DMEM without FCS, the cells were treated with 2.5% FBS for 72 hours. Four hours before harvest 15 .mu.L of the MTT dye solution (Promega) were added to each well. After 4-hour incubation at 37.degree. C., the formazan precipitates were solubilized by the addition of 100 .mu.L of solubilization solution (Promega) for 1 h at 37.degree. C. Absorbance at 570 nm was determined on a multiwell plate reader (SpectraMax, Molecular Devices).

[0222] Cell migration/invasiveness was tested using the Boyden in vitro invasion assay, as compared to control cell lines treated with a scramble siRNA. This assay is based on a chamber of two medium-filled compartments separated by a microporous membrane. Cells are placed in the upper compartment and are allowed to migrate through the pores of the membrane into the lower compartment, in which chemotactic agents are present. After an appropriate incubation time, the membrane between the two compartments is fixed and stained, and the number of cells that have migrated to the lower side of the membrane is determined. For this assay, a transwell system, equipped with 8-.mu.m pore polyvinylpirrolidone-free polycarbonate filters, was used. The upper sides of the porous polycarbonate filters were coated with 50 .mu.g/cm.sup.2 of reconstituted Matrigel basement membrane and placed into six-well culture dishes containing complete growth medium. Cells (1.times.10.sup.4 cells/well) were loaded into the upper compartment in serum-free growth medium. After 16 h of incubation at 37.degree. C., non-invading cells were removed mechanically using cotton swabs, and the microporous membrane was stained with Diff-Quick solution. Chemotaxis was evaluated by counting the cells migrated to the lower surface of the polycarbonate filters (six randomly chosen fields, mean.+-.SD).

[0223] Results

[0224] Examples of this analysis are reported for ERMP1 and KLRG2 in MCF7 tumor cell line. Gene silencing experiments with marker-specific siRNA reduced the marker transcripts (approximately 30-40 fold reduction), as determined by Q-RT_PCR. Table II below reports the sequences targeted by the siRNA molecules. The reduction of the expression of either of the two genes expression significantly impairs the proliferation and the invasive phenotype of the MCF7 tumor cell line (FIGS. 32 and 36). This indicates that the proteins are involved in tumor development and are therefore likely targets for the development of anti-cancer therapies.

TABLE-US-00003 TABLE II Sequences of the marker-specific siRNA NCBI gene siRNA Target Sequence KLRG2 CGAGGACAATCTGGATATCAA CTGGAGCCCTCGAGCAAGAAA ERMP1 CCCGTGGTTCATCTGATATAA AAGGACTTTGCTCGGCGTTTA TACGTGGATGTTTGTAACGTA CTCGTATTGGCTCAATCATAA

REFERENCES

[0225] 1) Anderson, L., and Seilhamer, J. (1997). A comparison of selected mRNA and protein abundances in human liver. Electrophoresis 18, 533-537; [0226] 2) Chen, G., Gharib, T. G., Wang, H., Huang, C. C., Kuick, R., Thomas, D. G., Shedden, K. A., Misek, D. E., Taylor, J. M., Giordano, T. J., Kardia, S. L., Iannettoni, M. D., Yee, J., Hogg, P. J., Orringer, M. B., Hanash, S. M., and Beer, D. G. (2003) Protein profiles associated with survival in lung adenocarcinoma. Proc. Natl. Acad. Sci. U. S. A 100, 13537-13542; [0227] 3) Ginestier, C., Charafe-Jauffret, E., Bertucci, F., Eisinger, F., Geneix, J., Bechlian, D., Conte, N., Adelaide, J., Toiron, Y., Nguyen, C., Viens, P., Mozziconacci, M. J., Houlgatte, R., Birnbaum, D., and Jacquemier, J. (2002) Distinct and complementary information provided by use of tissue and DNA microarrays in the study of breast tumor markers. Am. J. Pathol. 161, 1223-1233; [0228] 4) Gygi, S. P., Rochon, Y., Franza, B. R., and Aebersold, R. (1999) Correlation between protein and mRNA abundance in yeast. Mol. Cell. Biol. 19, 1720-1730; Nishizuka, S., Charboneau, L., Young, L., Major, S., Reinhold, W. C., Waltham, M., Kouros-Mehr, H., Bussey, K. J., Lee, J. K., Espina, V., Munson, P. J., Petricoin, E., III, Liotta, L. A., and Weinstein, J. N. (2003) Proteomic profiling of the NCI-60 cancer cell lines using new high-density reverse-phase lysate microarrays. Proc. Natl. Acad. Sci. U. S. A 100, 14229-14234; [0229] 5) Tyers, M., and Mann, M. (2003) From genomics to proteomics. Nature 422, 193-197; [0230] 6) Kagara N, Tanaka N, Noguchi S, Hirano T. (2007) Zinc and its transporter ZIP10 are involved in invasive behavior of breast cancer cells. Cancer Sci. 98:692-697; [0231] 7) Kasper G, Weiser A A, Rump A, Sparbier K, Dahl E, Hartmann A, Wild P, Schwidetzky U, Castanos-Velez E, Lehmann K. (2005) Expression levels of the putative zinc transporter LIV-1 are associated with a better outcome of breast cancer patients. Int J Cancer. 117:961-973; [0232] 8) Sladek R, Rocheleau G, Rung J, Dina C, Shen L, Serre D, Boutin P, Vincent D, Belisle A, Hadjadj S, Balkau B, Heude B, Charpentier G, Hudson T J, Montpetit A, Pshezhetsky A V, Prentki M, Posner B I, Balding D J, Meyre D, Polychronakos C, Froguel P. (2007) A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature. 445:881-885; [0233] 9) Garcia-Rudaz, C., Luna, F., Tapia, V., Kerr, B., Colgin, L., Galimi, F., Dissen, G. A., Rawlings, N. D. and Ojeda, S. R. (2007) Fxna, a novel gene differentially expressed in the rat ovary at the time of folliculogenesis, is required for normal ovarian histogenesis. Development. 134, 945-957; [0234] 10) Nicholas B, Skipp P, Mould R, Rennard S, Davies D E, O'Connor C D, Djukanovi R. (2006) Shotgun proteomic analysis of human-induced sputum. (2006) Proteomics. 6:4390-4401; [0235] 11) Adachi J, Kumar C, Zhang Y, Olsen J V, Mann M. The human urinary proteome contains more than 1500 proteins including a large proportion of membranes proteins. (2006) Genome Biol. 7:R80.

Sequence CWU 1

1

1081831PRTHomo sapiens 1Met Lys Val His Met His Thr Lys Phe Cys Leu Ile Cys Leu Leu Thr 1 5 10 15 Phe Ile Phe His His Cys Asn His Cys His Glu Glu His Asp His Gly 20 25 30 Pro Glu Ala Leu His Arg Gln His Arg Gly Met Thr Glu Leu Glu Pro 35 40 45 Ser Lys Phe Ser Lys Gln Ala Ala Glu Asn Glu Lys Lys Tyr Tyr Ile 50 55 60 Glu Lys Leu Phe Glu Arg Tyr Gly Glu Asn Gly Arg Leu Ser Phe Phe 65 70 75 80 Gly Leu Glu Lys Leu Leu Thr Asn Leu Gly Leu Gly Glu Arg Lys Val 85 90 95 Val Glu Ile Asn His Glu Asp Leu Gly His Asp His Val Ser His Leu 100 105 110 Asp Ile Leu Ala Val Gln Glu Gly Lys His Phe His Ser His Asn His 115 120 125 Gln His Ser His Asn His Leu Asn Ser Glu Asn Gln Thr Val Thr Ser 130 135 140 Val Ser Thr Lys Arg Asn His Lys Cys Asp Pro Glu Lys Glu Thr Val 145 150 155 160 Glu Val Ser Val Lys Ser Asp Asp Lys His Met His Asp His Asn His 165 170 175 Arg Leu Arg His His His Arg Leu His His His Leu Asp His Asn Asn 180 185 190 Thr His His Phe His Asn Asp Ser Ile Thr Pro Ser Glu Arg Gly Glu 195 200 205 Pro Ser Asn Glu Pro Ser Thr Glu Thr Asn Lys Thr Gln Glu Gln Ser 210 215 220 Asp Val Lys Leu Pro Lys Gly Lys Arg Lys Lys Lys Gly Arg Lys Ser 225 230 235 240 Asn Glu Asn Ser Glu Val Ile Thr Pro Gly Phe Pro Pro Asn His Asp 245 250 255 Gln Gly Glu Gln Tyr Glu His Asn Arg Val His Lys Pro Asp Arg Val 260 265 270 His Asn Pro Gly His Ser His Val His Leu Pro Glu Arg Asn Gly His 275 280 285 Asp Pro Gly Arg Gly His Gln Asp Leu Asp Pro Asp Asn Glu Gly Glu 290 295 300 Leu Arg His Thr Arg Lys Arg Glu Ala Pro His Val Lys Asn Asn Ala 305 310 315 320 Ile Ile Ser Leu Arg Lys Asp Leu Asn Glu Asp Asp His His His Glu 325 330 335 Cys Leu Asn Val Thr Gln Leu Leu Lys Tyr Tyr Gly His Gly Ala Asn 340 345 350 Ser Pro Ile Ser Thr Asp Leu Phe Thr Tyr Leu Cys Pro Ala Leu Leu 355 360 365 Tyr Gln Ile Asp Ser Arg Leu Cys Ile Glu His Phe Asp Lys Leu Leu 370 375 380 Val Glu Asp Ile Asn Lys Asp Lys Asn Leu Val Pro Glu Asp Glu Ala 385 390 395 400 Asn Ile Gly Ala Ser Ala Trp Ile Cys Gly Ile Ile Ser Ile Thr Val 405 410 415 Ile Ser Leu Leu Ser Leu Leu Gly Val Ile Leu Val Pro Ile Ile Asn 420 425 430 Gln Gly Cys Phe Lys Phe Leu Leu Thr Phe Leu Val Ala Leu Ala Val 435 440 445 Gly Thr Met Ser Gly Asp Ala Leu Leu His Leu Leu Pro His Ser Gln 450 455 460 Gly Gly His Asp His Ser His Gln His Ala His Gly His Gly His Ser 465 470 475 480 His Gly His Glu Ser Asn Lys Phe Leu Glu Glu Tyr Asp Ala Val Leu 485 490 495 Lys Gly Leu Val Ala Leu Gly Gly Ile Tyr Leu Leu Phe Ile Ile Glu 500 505 510 His Cys Ile Arg Met Phe Lys His Tyr Lys Gln Gln Arg Gly Lys Gln 515 520 525 Lys Trp Phe Met Lys Gln Asn Thr Glu Glu Ser Thr Ile Gly Arg Lys 530 535 540 Leu Ser Asp His Lys Leu Asn Asn Thr Pro Asp Ser Asp Trp Leu Gln 545 550 555 560 Leu Lys Pro Leu Ala Gly Thr Asp Asp Ser Val Val Ser Glu Asp Arg 565 570 575 Leu Asn Glu Thr Glu Leu Thr Asp Leu Glu Gly Gln Gln Glu Ser Pro 580 585 590 Pro Lys Asn Tyr Leu Cys Ile Glu Glu Glu Lys Ile Ile Asp His Ser 595 600 605 His Ser Asp Gly Leu His Thr Ile His Glu His Asp Leu His Ala Ala 610 615 620 Ala His Asn His His Gly Glu Asn Lys Thr Val Leu Arg Lys His Asn 625 630 635 640 His Gln Trp His His Lys His Ser His His Ser His Gly Pro Cys His 645 650 655 Ser Gly Ser Asp Leu Lys Glu Thr Gly Ile Ala Asn Ile Ala Trp Met 660 665 670 Val Ile Met Gly Asp Gly Ile His Asn Phe Ser Asp Gly Leu Ala Ile 675 680 685 Gly Ala Ala Phe Ser Ala Gly Leu Thr Gly Gly Ile Ser Thr Ser Ile 690 695 700 Ala Val Phe Cys His Glu Leu Pro His Glu Leu Gly Asp Phe Ala Val 705 710 715 720 Leu Leu Lys Ala Gly Met Thr Val Lys Gln Ala Ile Val Tyr Asn Leu 725 730 735 Leu Ser Ala Met Met Ala Tyr Ile Gly Met Leu Ile Gly Thr Ala Val 740 745 750 Gly Gln Tyr Ala Asn Asn Ile Thr Leu Trp Ile Phe Ala Val Thr Ala 755 760 765 Gly Met Phe Leu Tyr Val Ala Leu Val Asp Met Leu Pro Glu Met Leu 770 775 780 His Gly Asp Gly Asp Asn Glu Glu His Gly Phe Cys Pro Val Gly Gln 785 790 795 800 Phe Ile Leu Gln Asn Leu Gly Leu Leu Phe Gly Phe Ala Ile Met Leu 805 810 815 Val Ile Ala Leu Tyr Glu Asp Lys Ile Val Phe Asp Ile Gln Phe 820 825 830 2831PRTHomo sapiens 2Met Lys Val His Met His Thr Lys Phe Cys Leu Ile Cys Leu Leu Thr 1 5 10 15 Phe Ile Phe His His Cys Asn His Cys His Glu Glu His Asp His Gly 20 25 30 Pro Glu Ala Leu His Arg Gln His Arg Gly Met Thr Glu Leu Glu Pro 35 40 45 Ser Lys Phe Ser Lys Gln Ala Ala Glu Asn Glu Lys Lys Tyr Tyr Ile 50 55 60 Glu Lys Leu Phe Glu Arg Tyr Gly Glu Asn Gly Arg Leu Ser Phe Phe 65 70 75 80 Gly Leu Glu Lys Leu Leu Thr Asn Leu Gly Leu Gly Glu Arg Lys Val 85 90 95 Val Glu Ile Asn His Glu Asp Leu Gly His Asp His Val Ser His Leu 100 105 110 Asp Ile Leu Ala Val Gln Glu Gly Lys His Phe His Ser His Asn His 115 120 125 Gln His Ser His Asn His Leu Asn Ser Glu Asn Gln Thr Val Thr Ser 130 135 140 Val Ser Thr Lys Arg Asn His Lys Cys Asp Pro Glu Lys Glu Thr Val 145 150 155 160 Glu Val Ser Val Lys Ser Asp Asp Lys His Met His Asp His Asn His 165 170 175 Arg Leu Arg His His His Arg Leu His His His Leu Asp His Asn Asn 180 185 190 Thr His His Phe His Asn Asp Ser Ile Thr Pro Ser Glu Arg Gly Glu 195 200 205 Pro Ser Asn Glu Pro Ser Thr Glu Thr Asn Lys Thr Gln Glu Gln Ser 210 215 220 Asp Val Lys Leu Pro Lys Gly Lys Arg Lys Lys Lys Gly Arg Lys Ser 225 230 235 240 Asn Glu Asn Ser Glu Val Ile Thr Pro Gly Phe Pro Pro Asn His Asp 245 250 255 Gln Gly Glu Gln Tyr Glu His Asn Arg Val His Lys Pro Asp Arg Val 260 265 270 His Asn Pro Gly His Ser His Val His Leu Pro Glu Arg Asn Gly His 275 280 285 Asp Pro Gly Arg Gly His Gln Asp Leu Asp Pro Asp Asn Glu Gly Glu 290 295 300 Leu Arg His Thr Arg Lys Arg Glu Ala Pro His Val Lys Asn Asn Ala 305 310 315 320 Ile Ile Ser Leu Arg Lys Asp Leu Asn Glu Asp Asp His His His Glu 325 330 335 Cys Leu Asn Val Thr Gln Leu Leu Lys Tyr Tyr Gly His Gly Ala Asn 340 345 350 Ser Pro Ile Ser Thr Asp Leu Phe Thr Tyr Leu Cys Pro Ala Leu Leu 355 360 365 Tyr Gln Ile Asp Ser Arg Leu Cys Ile Glu His Phe Asp Lys Leu Leu 370 375 380 Val Glu Asp Ile Asn Lys Asp Lys Asn Leu Val Pro Glu Asp Glu Ala 385 390 395 400 Asn Ile Gly Ala Ser Ala Trp Ile Cys Gly Ile Ile Ser Ile Thr Val 405 410 415 Ile Ser Leu Leu Ser Leu Leu Gly Val Ile Leu Val Pro Ile Ile Asn 420 425 430 Gln Gly Cys Phe Lys Phe Leu Leu Thr Phe Leu Val Ala Leu Ala Val 435 440 445 Gly Thr Met Ser Gly Asp Ala Leu Leu His Leu Leu Pro His Ser Gln 450 455 460 Gly Gly His Asp His Ser His Gln His Ala His Gly His Gly His Ser 465 470 475 480 His Gly His Glu Ser Asn Lys Phe Leu Glu Glu Tyr Asp Ala Val Leu 485 490 495 Lys Gly Leu Val Ala Leu Gly Gly Ile Tyr Leu Leu Phe Ile Ile Glu 500 505 510 His Cys Ile Arg Met Phe Lys His Tyr Lys Gln Gln Arg Gly Lys Gln 515 520 525 Lys Trp Phe Met Lys Gln Asn Thr Glu Glu Ser Thr Ile Gly Arg Lys 530 535 540 Leu Ser Asp His Lys Leu Asn Asn Thr Pro Asp Ser Asp Trp Leu Gln 545 550 555 560 Leu Lys Pro Leu Ala Gly Thr Asp Asp Ser Val Val Ser Glu Asp Arg 565 570 575 Leu Asn Glu Thr Glu Leu Thr Asp Leu Glu Gly Gln Gln Glu Ser Pro 580 585 590 Pro Lys Asn Tyr Leu Cys Ile Glu Glu Glu Lys Ile Ile Asp His Ser 595 600 605 His Ser Asp Gly Leu His Thr Ile His Glu His Asp Leu His Ala Ala 610 615 620 Ala His Asn His His Gly Glu Asn Lys Thr Val Leu Arg Lys His Asn 625 630 635 640 His Gln Trp His His Lys His Ser His His Ser His Gly Pro Cys His 645 650 655 Ser Gly Ser Asp Leu Lys Glu Thr Gly Ile Ala Asn Ile Ala Trp Met 660 665 670 Val Ile Met Gly Asp Gly Ile His Asn Phe Ser Asp Gly Leu Ala Ile 675 680 685 Gly Ala Ala Phe Ser Ala Gly Leu Thr Gly Gly Ile Ser Thr Ser Ile 690 695 700 Ala Val Phe Cys His Glu Leu Pro His Glu Leu Gly Asp Phe Ala Val 705 710 715 720 Leu Leu Lys Ala Gly Met Thr Val Lys Gln Ala Ile Val Tyr Asn Leu 725 730 735 Leu Ser Ala Met Met Ala Tyr Ile Gly Met Leu Ile Gly Thr Ala Val 740 745 750 Gly Gln Tyr Ala Asn Asn Ile Thr Leu Trp Ile Phe Ala Val Thr Ala 755 760 765 Gly Met Phe Leu Tyr Val Ala Leu Val Asp Met Leu Pro Glu Met Leu 770 775 780 His Gly Asp Gly Asp Asn Glu Glu His Gly Phe Cys Pro Val Gly Gln 785 790 795 800 Phe Ile Leu Gln Asn Leu Gly Leu Leu Phe Gly Phe Ala Ile Met Leu 805 810 815 Val Ile Ala Leu Tyr Glu Asp Lys Ile Val Phe Asp Ile Gln Phe 820 825 830 35227DNAHomo sapiens 3cacgatttgg tgcagccggg gtttggtacc gagcggagag gagatgcaca cggcactcga 60gtgtgaggaa aaatagaaat gaaggtacat atgcacacaa aattttgcct catttgtttg 120ctgacattta tttttcatca ttgcaaccat tgccatgaag aacatgacca tggccctgaa 180gcgcttcaca gacagcatcg tggaatgaca gaattggagc caagcaaatt ttcaaagcaa 240gctgctgaaa atgaaaaaaa atactatatt gaaaaacttt ttgagcgtta tggtgaaaat 300ggaagattat ccttttttgg tttggagaaa cttttaacaa acttgggcct tggagagaga 360aaagtagttg agattaatca tgaggatctt ggccacgatc atgtttctca tttagatatt 420ttggcagttc aagagggaaa gcattttcac tcacataacc accagcattc ccataatcat 480ttaaattcag aaaatcaaac tgtgaccagt gtatccacaa aaagaaacca taaatgtgat 540ccagagaaag agacagttga agtgtctgta aaatctgatg ataaacatat gcatgaccat 600aatcaccgcc tacgtcatca ccatcgtttg catcatcatc ttgatcataa caacactcac 660cattttcata atgattccat tactcccagt gagcgtgggg agcctagcaa tgaaccttca 720acagagacca ataaaaccca ggaacaatct gatgttaaac taccgaaagg aaagaggaag 780aaaaaaggga ggaaaagtaa tgaaaattct gaggttatta caccaggttt tccccctaac 840catgatcagg gtgaacagta tgagcataat cgggtccaca aacctgatcg tgtacataac 900ccaggtcatt ctcatgtaca tcttccagaa cgtaatggtc atgatcctgg tcgtggacac 960caagatcttg atcctgataa tgaaggtgaa cttcgacata ctagaaagag agaagcacca 1020catgttaaaa ataatgcaat aatttctttg agaaaagatc taaatgaaga tgaccatcat 1080catgaatgtt tgaacgtcac tcagttatta aaatactatg gtcatggtgc caactctccc 1140atctcaactg atttatttac atacctttgc cctgcattgt tatatcaaat cgacagcaga 1200ctttgtattg agcattttga caaactttta gttgaagata taaataagga taaaaacctg 1260gttcctgaag atgaggcaaa tataggggca tcagcctgga tttgtggtat catttctatc 1320actgtcatta gcctgctttc cttgctaggc gtgatcttgg ttcctatcat taaccaagga 1380tgcttcaaat tccttcttac attccttgtt gcattagctg taggaacaat gagtggagac 1440gcccttcttc atctactgcc ccattctcag ggtggacatg atcacagtca ccaacatgca 1500catgggcatg gacattctca tggacatgaa tctaacaagt ttttggaaga atatgatgct 1560gtattgaaag gacttgttgc tctaggaggc atttacttgc tatttatcat tgaacactgc 1620attagaatgt ttaagcacta caaacaacaa agaggaaaac agaaatggtt tatgaaacag 1680aacacagaag aatcaactat tggaagaaag ctttcagatc acaagttaaa caatacacca 1740gattctgact ggcttcaact caagcctctt gccggaactg atgactcggt tgtttctgaa 1800gatcgactta atgaaactga actgacagat ttagaaggcc aacaagaatc ccctcctaaa 1860aattaccttt gtatagaaga ggagaaaatc atagaccatt ctcacagtga tggattacat 1920accattcatg agcatgatct ccatgctgct gcacataacc accacggcga gaacaaaact 1980gtgctgagga agcataatca ccagtggcac cacaagcatt ctcatcattc ccatggcccc 2040tgtcattctg gatccgatct gaaagaaaca ggaatagcta atatagcctg gatggtgatc 2100atgggggatg gcatccacaa cttcagtgat gggctcgcaa ttggtgcagc tttcagtgct 2160ggattgacag gaggaatcag tacttctata gccgtcttct gtcatgaact gccacatgaa 2220ttaggagatt ttgcagttct tcttaaagca ggcatgactg taaagcaagc aattgtatac 2280aacctcctct ctgccatgat ggcttacata ggcatgctca taggcacagc tgttggtcag 2340tatgccaata acatcacact ttggatcttt gcagtcactg caggcatgtt cctctatgta 2400gccttggtgg atatgcttcc agaaatgttg catggtgatg gtgacaatga agaacatggc 2460ttttgtcctg tggggcaatt catccttcag aatttaggat tgctctttgg atttgccatt 2520atgctggtga ttgccctcta tgaagataaa attgtgtttg acatccagtt ttgacctttc 2580ccagtaatca ctgttgatta cgagaatgtt accatgcagc tttgcatctg ttccttgtac 2640tgtatgcaca ttgctcaaag gaaagtcagt ggcttgcact acttacaagt ttcatagatt 2700tgagcctaac cacaagaggc tggtgcttag tactgttttc cctgcacgta ggggtctttt 2760aaaaatataa agcttgtgat aaagagagga gaatatggga ctccatgaac cagtgttgat 2820atgtttgatt aagacttttc acaaaataat catataaaac actagtctct ttattagtag 2880aaacttctgt ggctatgcag aaatagagat cgaaccaaaa aaaatcattt aaactttaaa 2940aatattttaa atggactttg gggagacatt ttttgtgtgt tttaagaatg aattgtagtg 3000ctctttaatt cagctacata tattcatgtg gtgataggga tcaacttgac acaactttga 3060aactgcataa agtagacata ggaactagag gaaagctcag gctgcattag agtatgaatt 3120tagcattggg aaaagccctt attcttgaat ctagagttac tatttttgta tatatttgca 3180tagtgtttaa acctgcagcc taaactactg aaatttgtga ttgtatgttt gtgtgagctt 3240cagtttaatg aaagattcat aatggttctt tgtattatta taatacttgg tgttggggtg 3300ttctttctgt tttgtttttt actttaattt tgttttgatt tttttttttt ttttttggcg 3360ggggtaggtg agggtttgga gcatgtggtc tttttaaaaa attgtaaccc tctagaaaat 3420atcaaagaaa tgaaccagac gtggtttaaa tagttgattt tcctatttta acagtaccaa 3480ctagttaatt gggaaatgta agttctgaat gttcacattg ctttaccagt ttggcactgg 3540aaccaagagc acatgtcgtg gctggctaca aggttgtaaa gcagaaaatc gaagtttacc 3600atgtctgtaa tgtgtacatg aagtgtcaat ttagaacagt tactaggata aactccatta 3660ttgccatggc tgtcatggta cccaagtgac ttggaagatg catttaaatt actcagctga 3720aatcacttga tcatcttgtg ccaagatatg ctgttggtgc ctgataggga ttagtctttt 3780aggtgccctg ttctcctacc ataattgtga atgatttgtg agaagtgcaa gccatgttta 3840tcctgaattt ttacttaata atttgtatta ctagtcatat gcatgtagct ttctgtttac 3900atcctatgcc acatggtctt catttatgcc aggtaaactg tatttgaact atgtgcagct 3960agctttgttt taatctgctt ggcaaccagt gtagctgctg taacaatcta tcttattgtt 4020caaatatata agagccaaac tcttttccat tccatctaaa atgttttcat ttagtactct 4080tctttcctcc tactctatga acttcaaaac aaaaacaaaa ctttgagagc agcacatgca 4140tccaggtatt tatagattat

tgccagtgtc ttttctgtat gctataagca agggagctta 4200ggtgttattt ctttaattta tgcttgaatc tgaaaaatta tttctgactt actccatggc 4260ctccttataa taagtagaag ttttatatat aattaatttt cagcattggg cactgaatta 4320ggacagtcct catctcattg cttggccctt caagcaacct agctaaaagg tgctgatatt 4380ttatttagta ctgccaactt caagtgattt agatatctat ctatctagat ttctgaacca 4440agatatattt atagttcact tttgggtttt tatacccacg gtaggattct gcattccagc 4500attaaatctg cttcatttta gaacctttat aaaagcaata gctggaatat actcccagtt 4560ttaaaataaa tgcctgattg atttaaagca agtaggttat gctgaagtat ataaagaagt 4620tttatattct ctcaaaaatg gtattatctt tctttatttg ctagattctt acaaatcttt 4680taagagggct gtaacagttg ctgctagtat tagggttcca catcattcta atgtatagtt 4740tcaagtctta atagacaatc tgaattccac tacatttctt ttggctccaa cattcctttt 4800agcttgacca gtctaattta aaatgtgttt gttggaggtc attaacgtta cttgtacaat 4860gctgtcactg tgtgacatcc atatgaattt tggtatatat caatcaatca atcaatcaca 4920ttgcattcaa tcaatcagct gtgattgatt gattatgctt agaaatacta tagtaactag 4980atgcagtgtg aattttttcc attaacaaac aaacaagtca gtggcttaaa tgtgattatg 5040gtcctgcaag gtgattcttg ctaaaatatc taaacttttg ttttgtttta actgaatcat 5100tttttaactt aaaaagctgg aaaatatcaa atgctgtttt ttttttttca ttgtcaacag 5160tggtgtgtca ttttatgtat gttcctaatg cttatggaac tcctccaaaa taaagttact 5220caaagag 522745432DNAHomo sapiens 4agttgatcac tctgaagctt tttggctaaa gcgtttgggt ttagagcttc cattactcat 60tcgccttgcc caaggcctca gcaaccgacg ttcgaaagcc aggagaaaag gcgaatgata 120aagggcgctc cacgcatgcg ttaagaagcc gccccaactc ccccgcggcg ttctttcttg 180gaacaaaact agcgcggagc cacggaactc cgcagtttgc gtagacttga atttcctatt 240cctcggacga tccatgtgga atccgaaaaa tagaaatgaa ggtacatatg cacacaaaat 300tttgcctcat ttgtttgctg acatttattt ttcatcattg caaccattgc catgaagaac 360atgaccatgg ccctgaagcg cttcacagac agcatcgtgg aatgacagaa ttggagccaa 420gcaaattttc aaagcaagct gctgaaaatg aaaaaaaata ctatattgaa aaactttttg 480agcgttatgg tgaaaatgga agattatcct tttttggttt ggagaaactt ttaacaaact 540tgggccttgg agagagaaaa gtagttgaga ttaatcatga ggatcttggc cacgatcatg 600tttctcattt agatattttg gcagttcaag agggaaagca ttttcactca cataaccacc 660agcattccca taatcattta aattcagaaa atcaaactgt gaccagtgta tccacaaaaa 720gaaaccataa atgtgatcca gagaaagaga cagttgaagt gtctgtaaaa tctgatgata 780aacatatgca tgaccataat caccgcctac gtcatcacca tcgtttgcat catcatcttg 840atcataacaa cactcaccat tttcataatg attccattac tcccagtgag cgtggggagc 900ctagcaatga accttcaaca gagaccaata aaacccagga acaatctgat gttaaactac 960cgaaaggaaa gaggaagaaa aaagggagga aaagtaatga aaattctgag gttattacac 1020caggttttcc ccctaaccat gatcagggtg aacagtatga gcataatcgg gtccacaaac 1080ctgatcgtgt acataaccca ggtcattctc atgtacatct tccagaacgt aatggtcatg 1140atcctggtcg tggacaccaa gatcttgatc ctgataatga aggtgaactt cgacatacta 1200gaaagagaga agcaccacat gttaaaaata atgcaataat ttctttgaga aaagatctaa 1260atgaagatga ccatcatcat gaatgtttga acgtcactca gttattaaaa tactatggtc 1320atggtgccaa ctctcccatc tcaactgatt tatttacata cctttgccct gcattgttat 1380atcaaatcga cagcagactt tgtattgagc attttgacaa acttttagtt gaagatataa 1440ataaggataa aaacctggtt cctgaagatg aggcaaatat aggggcatca gcctggattt 1500gtggtatcat ttctatcact gtcattagcc tgctttcctt gctaggcgtg atcttggttc 1560ctatcattaa ccaaggatgc ttcaaattcc ttcttacatt ccttgttgca ttagctgtag 1620gaacaatgag tggagacgcc cttcttcatc tactgcccca ttctcagggt ggacatgatc 1680acagtcacca acatgcacat gggcatggac attctcatgg acatgaatct aacaagtttt 1740tggaagaata tgatgctgta ttgaaaggac ttgttgctct aggaggcatt tacttgctat 1800ttatcattga acactgcatt agaatgttta agcactacaa acaacaaaga ggaaaacaga 1860aatggtttat gaaacagaac acagaagaat caactattgg aagaaagctt tcagatcaca 1920agttaaacaa tacaccagat tctgactggc ttcaactcaa gcctcttgcc ggaactgatg 1980actcggttgt ttctgaagat cgacttaatg aaactgaact gacagattta gaaggccaac 2040aagaatcccc tcctaaaaat tacctttgta tagaagagga gaaaatcata gaccattctc 2100acagtgatgg attacatacc attcatgagc atgatctcca tgctgctgca cataaccacc 2160acggcgagaa caaaactgtg ctgaggaagc ataatcacca gtggcaccac aagcattctc 2220atcattccca tggcccctgt cattctggat ccgatctgaa agaaacagga atagctaata 2280tagcctggat ggtgatcatg ggggatggca tccacaactt cagtgatggg ctcgcaattg 2340gtgcagcttt cagtgctgga ttgacaggag gaatcagtac ttctatagcc gtcttctgtc 2400atgaactgcc acatgaatta ggagattttg cagttcttct taaagcaggc atgactgtaa 2460agcaagcaat tgtatacaac ctcctctctg ccatgatggc ttacataggc atgctcatag 2520gcacagctgt tggtcagtat gccaataaca tcacactttg gatctttgca gtcactgcag 2580gcatgttcct ctatgtagcc ttggtggata tgcttccaga aatgttgcat ggtgatggtg 2640acaatgaaga acatggcttt tgtcctgtgg ggcaattcat ccttcagaat ttaggattgc 2700tctttggatt tgccattatg ctggtgattg ccctctatga agataaaatt gtgtttgaca 2760tccagttttg acctttccca gtaatcactg ttgattacga gaatgttacc atgcagcttt 2820gcatctgttc cttgtactgt atgcacattg ctcaaaggaa agtcagtggc ttgcactact 2880tacaagtttc atagatttga gcctaaccac aagaggctgg tgcttagtac tgttttccct 2940gcacgtaggg gtcttttaaa aatataaagc ttgtgataaa gagaggagaa tatgggactc 3000catgaaccag tgttgatatg tttgattaag acttttcaca aaataatcat ataaaacact 3060agtctcttta ttagtagaaa cttctgtggc tatgcagaaa tagagatcga accaaaaaaa 3120atcatttaaa ctttaaaaat attttaaatg gactttgggg agacattttt tgtgtgtttt 3180aagaatgaat tgtagtgctc tttaattcag ctacatatat tcatgtggtg atagggatca 3240acttgacaca actttgaaac tgcataaagt agacatagga actagaggaa agctcaggct 3300gcattagagt atgaatttag cattgggaaa agcccttatt cttgaatcta gagttactat 3360ttttgtatat atttgcatag tgtttaaacc tgcagcctaa actactgaaa tttgtgattg 3420tatgtttgtg tgagcttcag tttaatgaaa gattcataat ggttctttgt attattataa 3480tacttggtgt tggggtgttc tttctgtttt gttttttact ttaattttgt tttgattttt 3540tttttttttt tttggcgggg gtaggtgagg gtttggagca tgtggtcttt ttaaaaaatt 3600gtaaccctct agaaaatatc aaagaaatga accagacgtg gtttaaatag ttgattttcc 3660tattttaaca gtaccaacta gttaattggg aaatgtaagt tctgaatgtt cacattgctt 3720taccagtttg gcactggaac caagagcaca tgtcgtggct ggctacaagg ttgtaaagca 3780gaaaatcgaa gtttaccatg tctgtaatgt gtacatgaag tgtcaattta gaacagttac 3840taggataaac tccattattg ccatggctgt catggtaccc aagtgacttg gaagatgcat 3900ttaaattact cagctgaaat cacttgatca tcttgtgcca agatatgctg ttggtgcctg 3960atagggatta gtcttttagg tgccctgttc tcctaccata attgtgaatg atttgtgaga 4020agtgcaagcc atgtttatcc tgaattttta cttaataatt tgtattacta gtcatatgca 4080tgtagctttc tgtttacatc ctatgccaca tggtcttcat ttatgccagg taaactgtat 4140ttgaactatg tgcagctagc tttgttttaa tctgcttggc aaccagtgta gctgctgtaa 4200caatctatct tattgttcaa atatataaga gccaaactct tttccattcc atctaaaatg 4260ttttcattta gtactcttct ttcctcctac tctatgaact tcaaaacaaa aacaaaactt 4320tgagagcagc acatgcatcc aggtatttat agattattgc cagtgtcttt tctgtatgct 4380ataagcaagg gagcttaggt gttatttctt taatttatgc ttgaatctga aaaattattt 4440ctgacttact ccatggcctc cttataataa gtagaagttt tatatataat taattttcag 4500cattgggcac tgaattagga cagtcctcat ctcattgctt ggcccttcaa gcaacctagc 4560taaaaggtgc tgatatttta tttagtactg ccaacttcaa gtgatttaga tatctatcta 4620tctagatttc tgaaccaaga tatatttata gttcactttt gggtttttat acccacggta 4680ggattctgca ttccagcatt aaatctgctt cattttagaa cctttataaa agcaatagct 4740ggaatatact cccagtttta aaataaatgc ctgattgatt taaagcaagt aggttatgct 4800gaagtatata aagaagtttt atattctctc aaaaatggta ttatctttct ttatttgcta 4860gattcttaca aatcttttaa gagggctgta acagttgctg ctagtattag ggttccacat 4920cattctaatg tatagtttca agtcttaata gacaatctga attccactac atttcttttg 4980gctccaacat tccttttagc ttgaccagtc taatttaaaa tgtgtttgtt ggaggtcatt 5040aacgttactt gtacaatgct gtcactgtgt gacatccata tgaattttgg tatatatcaa 5100tcaatcaatc aatcacattg cattcaatca atcagctgtg attgattgat tatgcttaga 5160aatactatag taactagatg cagtgtgaat tttttccatt aacaaacaaa caagtcagtg 5220gcttaaatgt gattatggtc ctgcaaggtg attcttgcta aaatatctaa acttttgttt 5280tgttttaact gaatcatttt ttaacttaaa aagctggaaa atatcaaatg ctgttttttt 5340tttttcattg tcaacagtgg tgtgtcattt tatgtatgtt cctaatgctt atggaactcc 5400tccaaaataa agttactcaa agagagcaaa ta 54325552PRTHomo sapiens 5Met Ala Ala Leu Ala Pro Val Gly Ser Pro Ala Ser Arg Gly Pro Arg 1 5 10 15 Leu Ala Ala Gly Leu Arg Leu Leu Pro Met Leu Gly Leu Leu Gln Leu 20 25 30 Leu Ala Glu Pro Gly Leu Gly Arg Val His His Leu Ala Leu Lys Asp 35 40 45 Asp Val Arg His Lys Val His Leu Asn Thr Phe Gly Phe Phe Lys Asp 50 55 60 Gly Tyr Met Val Val Asn Val Ser Ser Leu Ser Leu Asn Glu Pro Glu 65 70 75 80 Asp Lys Asp Val Thr Ile Gly Phe Ser Leu Asp Arg Thr Lys Asn Asp 85 90 95 Gly Phe Ser Ser Tyr Leu Asp Glu Asp Val Asn Tyr Cys Ile Leu Lys 100 105 110 Lys Gln Ser Val Ser Val Thr Leu Leu Ile Leu Asp Ile Ser Arg Ser 115 120 125 Glu Val Arg Val Lys Ser Pro Pro Glu Ala Gly Thr Gln Leu Pro Lys 130 135 140 Ile Ile Phe Ser Arg Asp Glu Lys Val Leu Gly Gln Ser Gln Glu Pro 145 150 155 160 Asn Val Asn Pro Ala Ser Ala Gly Asn Gln Thr Gln Lys Thr Gln Asp 165 170 175 Gly Gly Lys Ser Lys Arg Ser Thr Val Asp Ser Lys Ala Met Gly Glu 180 185 190 Lys Ser Phe Ser Val His Asn Asn Gly Gly Ala Val Ser Phe Gln Phe 195 200 205 Phe Phe Asn Ile Ser Thr Asp Asp Gln Glu Gly Leu Tyr Ser Leu Tyr 210 215 220 Phe His Lys Cys Leu Gly Lys Glu Leu Pro Ser Asp Lys Phe Thr Phe 225 230 235 240 Ser Leu Asp Ile Glu Ile Thr Glu Lys Asn Pro Asp Ser Tyr Leu Ser 245 250 255 Ala Gly Glu Ile Pro Leu Pro Lys Leu Tyr Ile Ser Met Ala Phe Phe 260 265 270 Phe Phe Leu Ser Gly Thr Ile Trp Ile His Ile Leu Arg Lys Arg Arg 275 280 285 Asn Asp Val Phe Lys Ile His Trp Leu Met Ala Ala Leu Pro Phe Thr 290 295 300 Lys Ser Leu Ser Leu Val Phe His Ala Ile Asp Tyr His Tyr Ile Ser 305 310 315 320 Ser Gln Gly Phe Pro Ile Glu Gly Trp Ala Val Val Tyr Tyr Ile Thr 325 330 335 His Leu Leu Lys Gly Ala Leu Leu Phe Ile Thr Ile Ala Leu Ile Gly 340 345 350 Thr Gly Trp Ala Phe Ile Lys His Ile Leu Ser Asp Lys Asp Lys Lys 355 360 365 Ile Phe Met Ile Val Ile Pro Leu Gln Val Leu Ala Asn Val Ala Tyr 370 375 380 Ile Ile Ile Glu Ser Thr Glu Glu Gly Thr Thr Glu Tyr Gly Leu Trp 385 390 395 400 Lys Asp Ser Leu Phe Leu Val Asp Leu Leu Cys Cys Gly Ala Ile Leu 405 410 415 Phe Pro Val Val Trp Ser Ile Arg His Leu Gln Glu Ala Ser Ala Thr 420 425 430 Asp Gly Lys Ala Ala Ile Asn Leu Ala Lys Leu Lys Leu Phe Arg His 435 440 445 Tyr Tyr Val Leu Ile Val Cys Tyr Ile Tyr Phe Thr Arg Ile Ile Ala 450 455 460 Phe Leu Leu Lys Leu Ala Val Pro Phe Gln Trp Lys Trp Leu Tyr Gln 465 470 475 480 Leu Leu Asp Glu Thr Ala Thr Leu Val Phe Phe Val Leu Thr Gly Tyr 485 490 495 Lys Phe Arg Pro Ala Ser Asp Asn Pro Tyr Leu Gln Leu Ser Gln Glu 500 505 510 Glu Glu Asp Leu Glu Met Glu Ser Val Val Thr Thr Ser Gly Val Met 515 520 525 Glu Ser Met Lys Lys Val Lys Lys Val Thr Asn Gly Ser Val Glu Pro 530 535 540 Gln Gly Glu Trp Glu Gly Ala Val 545 550 6600PRTHomo sapiens 6Met Ala Ala Leu Ala Pro Val Gly Ser Pro Ala Ser Arg Gly Pro Arg 1 5 10 15 Leu Ala Ala Gly Leu Arg Leu Leu Pro Met Leu Gly Leu Leu Gln Leu 20 25 30 Leu Ala Glu Pro Gly Leu Gly Arg Val His His Leu Ala Leu Lys Asp 35 40 45 Asp Val Arg His Lys Val His Leu Asn Thr Phe Gly Phe Phe Lys Asp 50 55 60 Gly Tyr Met Val Val Asn Val Ser Ser Leu Ser Leu Asn Glu Pro Glu 65 70 75 80 Asp Lys Asp Val Thr Ile Gly Phe Ser Leu Asp Arg Thr Lys Asn Asp 85 90 95 Gly Phe Ser Ser Tyr Leu Asp Glu Asp Val Asn Tyr Cys Ile Leu Lys 100 105 110 Lys Gln Ser Val Ser Val Thr Leu Leu Ile Leu Asp Ile Ser Arg Ser 115 120 125 Glu Val Arg Val Lys Ser Pro Pro Glu Ala Gly Thr Gln Leu Pro Lys 130 135 140 Ile Ile Phe Ser Arg Asp Glu Lys Val Leu Gly Gln Ser Gln Glu Pro 145 150 155 160 Asn Val Asn Pro Ala Ser Ala Gly Asn Gln Thr Gln Lys Thr Gln Asp 165 170 175 Gly Gly Lys Ser Lys Arg Ser Thr Val Asp Ser Lys Ala Met Gly Glu 180 185 190 Lys Ser Phe Ser Val His Asn Asn Gly Gly Ala Val Ser Phe Gln Phe 195 200 205 Phe Phe Asn Ile Ser Thr Asp Asp Gln Glu Gly Leu Tyr Ser Leu Tyr 210 215 220 Phe His Lys Cys Leu Gly Lys Glu Leu Pro Ser Asp Lys Phe Thr Phe 225 230 235 240 Ser Leu Asp Ile Glu Ile Thr Glu Lys Asn Pro Asp Ser Tyr Leu Ser 245 250 255 Ala Gly Glu Ile Pro Leu Pro Lys Leu Tyr Ile Ser Met Ala Phe Phe 260 265 270 Phe Phe Leu Ser Gly Thr Ile Trp Ile His Ile Leu Arg Lys Arg Arg 275 280 285 Asn Asp Val Phe Lys Ile His Trp Leu Met Ala Ala Leu Pro Phe Thr 290 295 300 Lys Ser Leu Ser Leu Val Phe His Ala Ile Asp Tyr His Tyr Ile Ser 305 310 315 320 Ser Gln Gly Phe Pro Ile Glu Gly Trp Ala Val Val Tyr Tyr Ile Thr 325 330 335 His Leu Leu Lys Gly Ala Leu Leu Phe Ile Thr Ile Ala Leu Ile Gly 340 345 350 Thr Gly Trp Ala Phe Ile Lys His Ile Leu Ser Asp Lys Asp Lys Lys 355 360 365 Ile Phe Met Ile Val Ile Pro Leu Gln Val Leu Ala Asn Val Ala Tyr 370 375 380 Ile Ile Ile Glu Ser Thr Glu Glu Gly Thr Thr Glu Tyr Gly Leu Trp 385 390 395 400 Lys Asp Ser Leu Phe Leu Val Asp Leu Leu Cys Cys Gly Ala Ile Leu 405 410 415 Phe Pro Val Val Trp Ser Ile Arg His Leu Gln Glu Ala Ser Ala Thr 420 425 430 Asp Gly Lys Gly Asp Ser Met Gly Pro Leu Gln Gln Arg Ala Asn Leu 435 440 445 Arg Ala Gly Ser Arg Ile Glu Ser His His Phe Ala Gln Ala Asp Leu 450 455 460 Glu Leu Leu Ala Ser Ser Cys Pro Pro Ala Ser Val Ser Gln Arg Ala 465 470 475 480 Gly Ile Thr Ala Ala Ile Asn Leu Ala Lys Leu Lys Leu Phe Arg His 485 490 495 Tyr Tyr Val Leu Ile Val Cys Tyr Ile Tyr Phe Thr Arg Ile Ile Ala 500 505 510 Phe Leu Leu Lys Leu Ala Val Pro Phe Gln Trp Lys Trp Leu Tyr Gln 515 520 525 Leu Leu Asp Glu Thr Ala Thr Leu Val Phe Phe Val Leu Thr Gly Tyr 530 535 540 Lys Phe Arg Pro Ala Ser Asp Asn Pro Tyr Leu Gln Leu Ser Gln Glu 545 550 555 560 Glu Glu Asp Leu Glu Met Glu Ser Val Val Thr Thr Ser Gly Val Met 565 570 575 Glu Ser Met Lys Lys Val Lys Lys Val Thr Asn Gly Ser Val Glu Pro 580 585 590 Gln Gly Glu Trp Glu Gly Ala Val 595 600 7600PRTHomo sapiens 7Met Ala Ala Leu Ala Pro Val Gly Ser Pro Ala Ser Arg Gly Pro Arg 1 5 10 15 Leu Ala Ala Gly Leu Arg Leu Leu Pro Met Leu Gly Leu Leu Gln Leu 20 25 30 Leu Ala Glu Pro Gly Leu Gly Arg Val His His Leu Ala Leu Lys Asp 35 40 45 Asp Val Arg His Lys Val His Leu Asn Thr Phe Gly Phe Phe Lys Asp 50 55 60 Gly Tyr Met Val Val Asn Val Ser Ser Leu Ser Leu Asn Glu Pro Glu 65 70 75 80 Asp Lys Asp Val Thr Ile Gly Phe Ser Leu Asp Arg Thr Lys Asn Asp 85 90 95 Gly Phe Ser Ser Tyr Leu Asp Glu Asp Val Asn Tyr Cys Ile Leu Lys 100 105 110 Lys Gln Ser Val Ser Val Thr Leu Leu Ile Leu Asp Ile Ser Arg Ser 115 120 125 Glu Val Arg Val Lys Ser Pro Pro Glu Ala Gly Thr Gln Leu Pro Lys 130

135 140 Ile Ile Phe Ser Arg Asp Glu Lys Val Leu Gly Gln Ser Gln Glu Pro 145 150 155 160 Asn Val Asn Pro Ala Ser Ala Gly Asn Gln Thr Gln Lys Thr Gln Asp 165 170 175 Gly Gly Lys Ser Lys Arg Ser Thr Val Asp Ser Lys Ala Met Gly Glu 180 185 190 Lys Ser Phe Ser Val His Asn Asn Gly Gly Ala Val Ser Phe Gln Phe 195 200 205 Phe Phe Asn Ile Ser Thr Asp Asp Gln Glu Gly Leu Tyr Ser Leu Tyr 210 215 220 Phe His Lys Cys Leu Gly Lys Glu Leu Pro Ser Asp Lys Phe Thr Phe 225 230 235 240 Ser Leu Asp Ile Glu Ile Thr Glu Lys Asn Pro Asp Ser Tyr Leu Ser 245 250 255 Ala Gly Glu Ile Pro Leu Pro Lys Leu Tyr Ile Ser Met Ala Phe Phe 260 265 270 Phe Phe Leu Ser Gly Thr Ile Trp Ile His Ile Leu Arg Lys Arg Arg 275 280 285 Asn Asp Val Phe Lys Ile His Trp Leu Met Ala Ala Leu Pro Phe Thr 290 295 300 Lys Ser Leu Ser Leu Val Phe His Ala Ile Asp Tyr His Tyr Ile Ser 305 310 315 320 Ser Gln Gly Phe Pro Ile Glu Gly Trp Ala Val Val Tyr Tyr Ile Thr 325 330 335 His Leu Leu Lys Gly Ala Leu Leu Phe Ile Thr Ile Ala Leu Ile Gly 340 345 350 Thr Gly Trp Ala Phe Ile Lys His Ile Leu Ser Asp Lys Asp Lys Lys 355 360 365 Ile Phe Met Ile Val Ile Pro Leu Gln Val Leu Ala Asn Val Ala Tyr 370 375 380 Ile Ile Ile Glu Ser Thr Glu Glu Gly Thr Thr Glu Tyr Gly Leu Trp 385 390 395 400 Lys Asp Ser Leu Phe Leu Val Asp Leu Leu Cys Cys Gly Ala Ile Leu 405 410 415 Phe Pro Val Val Trp Ser Ile Arg His Leu Gln Glu Ala Ser Ala Thr 420 425 430 Asp Gly Lys Gly Asp Ser Met Gly Pro Leu Gln Gln Arg Ala Asn Leu 435 440 445 Arg Ala Gly Ser Arg Ile Glu Ser His His Phe Ala Gln Ala Asp Leu 450 455 460 Glu Leu Leu Ala Ser Ser Cys Pro Pro Ala Ser Val Ser Gln Arg Ala 465 470 475 480 Gly Ile Thr Ala Ala Ile Asn Leu Ala Lys Leu Lys Leu Phe Arg His 485 490 495 Tyr Tyr Val Leu Ile Val Cys Tyr Ile Tyr Phe Thr Arg Ile Ile Ala 500 505 510 Phe Leu Leu Lys Leu Ala Val Pro Phe Gln Trp Lys Trp Leu Tyr Gln 515 520 525 Leu Leu Asp Glu Thr Ala Thr Leu Val Phe Phe Val Leu Thr Gly Tyr 530 535 540 Lys Phe Arg Pro Ala Ser Asp Asn Pro Tyr Leu Gln Leu Ser Gln Glu 545 550 555 560 Glu Glu Asp Leu Glu Met Glu Ser Val Val Thr Thr Ser Gly Val Met 565 570 575 Glu Ser Met Lys Lys Val Lys Lys Val Thr Asn Gly Ser Val Glu Pro 580 585 590 Gln Gly Glu Trp Glu Gly Ala Val 595 600 86874DNAHomo sapiens 8ccgaggtggg cagcacaggc tcctcgacga cttcctaggt cgcaatctcc aggaaaacga 60ccacagggtc agcggagcta gccgccgagc cccgctcccc gggcccttcc ggcggctgcg 120ccctttcacc ccggacgtgg gcgggagagg aagcggctgg tgatgctgga acaaacatgg 180ccgctctggc gcccgtcggc tcccccgcct cccgcggtcc taggctggcc gcgggcctcc 240ggctgctccc aatgctgggt ttgctgcagt tgctggccga gcctggcctg ggccgcgtcc 300atcacctggc actcaaggat gatgtgaggc ataaagttca tctgaacacc tttggcttct 360tcaaggatgg gtacatggtg gtgaatgtca gtagcctctc actgaatgag cctgaagaca 420aggatgtgac tattggattt agcctagacc gtacaaagaa tgatggcttt tcttcttacc 480tggatgaaga tgtgaattac tgtattttaa agaaacagtc tgtctctgtc acccttttaa 540tcctagacat ctccagaagt gaggtaagag taaagtctcc accagaagct ggtacccagt 600taccaaagat catcttcagc agggatgaga aagtccttgg tcagagccag gagcctaatg 660ttaaccctgc ttcagcaggc aaccagaccc agaagacaca agatggtgga aagtctaaaa 720gaagtacagt ggattcaaag gccatgggag agaaatcctt ttctgttcat aataatggtg 780gggcagtgtc atttcagttt ttctttaaca tcagcactga tgaccaagaa ggcctttaca 840gtctttattt tcataaatgc cttggaaaag aattgccaag tgacaagttt acattcagcc 900ttgatattga gatcacagag aagaatcctg acagctacct ctcagcagga gaaattcctc 960tccccaaatt atacatctca atggcctttt tcttctttct ttctgggacc atctggattc 1020atatccttcg aaaacgacgg aatgatgtat ttaaaatcca ctggctgatg gcggcccttc 1080ctttcaccaa gtctctttcc ttggtgttcc atgcaattga ctaccactac atctcctccc 1140agggcttccc tatcgaaggc tgggctgttg tgtactacat aactcacctt ttgaaagggg 1200cgctactctt catcaccatt gcactcattg gcactggctg ggctttcatt aagcacatcc 1260tttctgataa agacaaaaag atcttcatga ttgtcattcc actccaggtc ctggcaaatg 1320tagcctacat catcatagag tccaccgagg agggcacgac tgaatatggc ttgtggaagg 1380actctctatt tctggtcgac ctgttgtgtt gtggtgccat cctcttccca gtggtgtggt 1440caatcagaca tttacaagaa gcatcagcaa cagatggaaa agctgctatt aacttagcaa 1500agctgaaact tttcagacat tattacgtct tgattgtgtg ttacatatac ttcactagga 1560tcattgcatt tctcctcaaa ctcgctgttc cattccagtg gaagtggctc taccagctcc 1620tggatgaaac ggccacactg gtcttctttg ttctaacggg gtataaattc cgtccggctt 1680cagataaccc ctacctacaa ctttctcagg aagaagaaga cttggaaatg gagtccgttg 1740tgacaacatc tggggtgatg gaaagtatga agaaagtcaa gaaggtgacc aacggctccg 1800tggagcccca gggcgagtgg gaaggcgccg tgtgacagag ccgaccctga ggatggcact 1860gtccaaggaa actgttaact tattcatagt cctattggac agcaggagca gctcctacag 1920tgaactattg gcaccaccga cagtgacacc agggcacatg gctggagcac agtgccgcgg 1980aaacctgatt ttgtactctc ttttatggaa acgatctgtg gctgtttaga ggcagctgga 2040tcctctttca ggcgggaatg ggagggcggg cacagggagg aggagaggaa gagaaaagga 2100agaattcatt tttaatttag gtttcttttt ttcttcttca tttcggagct ctaaggtgta 2160tgcagttgtg accccatgtg tggggaagtg tagcaaggac ggctggtgga gggggaagga 2220gggtgcgagg tgtctgtctg atgctttagg aaatgtctac tgaggaccct gggacttaag 2280aagaagggcg gggagagtgc cattgcctgt ttgggagaca aaaatgaacg aaaacaggtg 2340actttggaaa gcaaagtcaa aacccagttt aggatgtagc acctgcccca ggattcctgc 2400cctcggcttt gccccagacc cttattccag atgctgagag tgaccaggac agcagctcct 2460gaggcccagt ggtcttcttt ccaacaggaa aagaaggctg tgatgtcgct gtcaggatca 2520tgccctgtgg cacagcacag gtggtgggag gtggttttct gactgagatg ttgcctgatg 2580gatggaaaga aatgtatttt taagttcaaa aagcattatc ctgtggcgtt gcctggacat 2640ccactccctg acagcccaga gcagcactgt ctggcttccc ttcatgcttg tggctttgtt 2700gtgtttgatc agaattttgg gggaaatgga aagttttcct caaggagcag ctgggggcag 2760aataggtagt atttaagcaa atacttaagt ccaagcaaat catccccatt aaaaagcttt 2820tcctgtaggc tagtaggatt tctaaataga tgaattcaac agacttggtc cccatagtcc 2880aagagtatgt atgtgaagaa agtgagcatg attcaacagt ttcactctca gggattttag 2940gatggcaaaa tacttcacag aaactcaatg attaagttcc cttccacact tccagagctt 3000gaatgaacac aggtagccac ctaaattgag cagtattgca actcagagag aaaatcatct 3060gaatagtagg acaagctcag aaggtacatt gtgactgagg gcttaaaagg agaccaaaac 3120atggccccat cagggaagct tcttaatgct tggggggcca gctaggtagg gttgcttcca 3180aaagctggag cccacccctg cctaggggtt gtcagagagc cacacctgca ggggaacagg 3240tacctccgag ggtgagagtc gtggtctctg ggagttgttt tctcacctct ggcttagaag 3300ggtcaggcag aaaccacagg atgtggggtc acactcactg tcccaagttt gggaacctga 3360aaaagtctcc attcagaaca tggttgttct ccctgtccca tgctatctta tcttcctaaa 3420tgactaatga ggaagcgggt gttctttttc tgcactttga ttcgccatct gggttctgta 3480gggtgctctg aaggtgtgat ctgccttctg gctgatgtgg aggaagagca agcgccttcc 3540caggccacag ctgctcacct ctcggcagat attttaggca agcatccgtg tgtcttccca 3600tcttcaggag aaaggtaaat gcaccctaag tgttcacttc tggacctttt tcaagttcac 3660ttgggactgt gtgacagaag ggagttggag ggaggatggg aatattttta acactttgtt 3720ttcctgtgca gaaacataat accagttttc gcagaaatgt gtctcaatct gtgactacca 3780aagccctcct cagtccttcc ctcagaggga cacatttgct gtttctcccg caagcagatg 3840ttgtggatga ggcgatagac tccttggcaa gaacgaaagg tgtgatgaaa cctccctgct 3900cggaagggtc tccgtggagg tgtcctcatt tcacatgctg ggttttgcaa gcgaggaagc 3960caggcagtgg aggaactaga gagaggcagg cgtgtgtgtg gacaagcgct ggagccgcag 4020ccctcagact ggcacgggaa cgccagcgtt gggtgttcag attccacgcg tatgtctggg 4080ctcactcaca gcatggccga gtgtctgcag tgctggtcct gacccttcca gagcagcagt 4140ggacagatga gataagactg tttcagaaac aaagatggcc acagccttcc taacaagcag 4200gtcatctggc catgtctgta ttgtaactgg taaaaggctt caagtcagat tgatgatcaa 4260gaaaagtcaa aaccccagcc caagattggg aaagcaggtg gtggttccaa gcttttaaaa 4320aattattgaa gctctccatc ctgttctgtg agtgtgtctt ctctttctcc ttcacgtcat 4380agccgtgacc caccgttcat ctctgctctt gcgtaaagat gaccgatgga gtccaaagcc 4440aagtggcttc accagctgac aagccaccct cctgcagcct gagtttcaca gtccactggg 4500ttcgttgtca tgcggtgttt gaatggttaa gcccttgcag tatttcagat cgggcaaaaa 4560atatcggatg cacatagcag aaccattggt ggtatttata gctttgcttt gtactcctca 4620ctgtttctgc ctacgcaaaa tatccatgtt tcctctgaga aatctgttgt ggactgaaag 4680cgctgctggc tgtgaaattt aataaagtgt gtatgctttg ctagaaaatt atttcttgga 4740caataggaac agtcattgat ctgtaaatcc tggctcttaa cagtgagtgg ccaaggactt 4800gatcagccca tttcttggtc cctcagtgct ttaaaattta agtagcactg cattttgtaa 4860tgttgaatat gactctagtg acttgtagga ggcacttgtg aggagatgct tgcttcagtg 4920taaaagatgc tcatggcctg agtcagttga gttttctttc aagaaaccac ttcagagtga 4980aatatccagg gtttccccgc cctggacatg tccagcctgc ccaggcagca cacagccctg 5040taagtccacc tcgtgtgggt gagatttcct cctgcgtgat gacctcatcg ccatctctgc 5100tgtctcattc cacagcctcc ctccctcttc tctcctcctc tgccctcgcc cttccccctt 5160ccccatcccc tccccctcct cctctgccct cgcccttacc cctccccctt ccccttccgc 5220tcctcctccc tcctccacct ctttctcctc ctccttccct cctcctccct cctcttccct 5280tctctgccat ctttctcccc gtgcctattg atcccacata ggctcattct gggtacaccg 5340gctaaaggct ttggtgcatt gcagcgtttt ctcccagcag ctgtgtgaaa gatgcatttt 5400ctaagctaag gagaattttc tcaagagtgg catactcatg ccaaatatta ttgctctggg 5460ccatataggc tggtcttcct ccacactaaa atgggtgtct tgttttggta cttaaaacag 5520tctactccag gcatccagtc cttacagacc aaggaagagc atagcgatgc ctgttggaat 5580tgcagatgca ttctggcctt ctcccccgtc ctgaaacatt ttctttgagg aaggctctta 5640gaacattaga tagtctgctg aggttgttgg cccagctcca tacacccagt agaacagtgg 5700aacaactcat gcttcatgct gccaagctgc tgtacttcaa aggaaacaga tctagcacac 5760tgctgcaccc ctgcttccac actccacact tcaccccgct gcttttctct gacccgcccc 5820tggccttgta agactcacgt aagctaagtc caggatgcct gtggcctgcg gcttgattct 5880tccctttagg attcagcaag ttaatggctt cctcgctata gaagtgagac tttgacttga 5940tgcctcttgg tatatcaaaa agatattcat ccagaaagta ccaaatgttc tgaaagaccc 6000gctcttcact ccagttttcc ctagggtgtt tctggcaggg cgtttttaaa aggcatctac 6060ctgagttgac gctaatactt gtcaccacct ggaacgtagt tatcggtcgg caggctgaac 6120atactccaga ttccccagag gccacttctg tagcccagcg atgcatctga gcctctctgc 6180gtggtttatg cttgaaaaat agataatgct tttagatggt tcactgccag gccatgggcc 6240ccacacatct caggccctgt gtgagggagc acactgagat ggtgcaggag tgaatgggca 6300tggcttggcc tcgctacctc ggggacctgt tggagttctg gcagcagggt gtctgcaggt 6360gggacggcgt tctgggcaga gtcagaatgg tcagaatgaa acagaacagc caactcaccc 6420acaggacagc ttattttgag gcaaggtttt ggattttgga ggaagcagcc agatgaggcg 6480gtgagcctcc agaaggtcag cctttggagc acgtaagata ctgttacagg gtccagaaat 6540cgtgttcaca tgggggcttt gactcttcaa acagcttttg cagatcgtaa attgcatttg 6600cctagtcgtg tgacctcaaa agaagtcaga catatttaat ccagaaatag tttcgtttga 6660gggagggctt gcaggtctgt aaatagcatt tgctttcctg gttagagatt gggatgcaga 6720aggagttttc agtatttttt ttaaaacact aatgatcatt gaagagtatt tatgtaaaca 6780tacaacgtat aatgggtggg ggatccgatc atggtgatgt acggggtgaa ttctcttgcc 6840gtgttgcaaa tgtgtaaaat aaagattatc tggc 687497018DNAHomo sapiens 9ccgaggtggg cagcacaggc tcctcgacga cttcctaggt cgcaatctcc aggaaaacga 60ccacagggtc agcggagcta gccgccgagc cccgctcccc gggcccttcc ggcggctgcg 120ccctttcacc ccggacgtgg gcgggagagg aagcggctgg tgatgctgga acaaacatgg 180ccgctctggc gcccgtcggc tcccccgcct cccgcggtcc taggctggcc gcgggcctcc 240ggctgctccc aatgctgggt ttgctgcagt tgctggccga gcctggcctg ggccgcgtcc 300atcacctggc actcaaggat gatgtgaggc ataaagttca tctgaacacc tttggcttct 360tcaaggatgg gtacatggtg gtgaatgtca gtagcctctc actgaatgag cctgaagaca 420aggatgtgac tattggattt agcctagacc gtacaaagaa tgatggcttt tcttcttacc 480tggatgaaga tgtgaattac tgtattttaa agaaacagtc tgtctctgtc acccttttaa 540tcctagacat ctccagaagt gaggtaagag taaagtctcc accagaagct ggtacccagt 600taccaaagat catcttcagc agggatgaga aagtccttgg tcagagccag gagcctaatg 660ttaaccctgc ttcagcaggc aaccagaccc agaagacaca agatggtgga aagtctaaaa 720gaagtacagt ggattcaaag gccatgggag agaaatcctt ttctgttcat aataatggtg 780gggcagtgtc atttcagttt ttctttaaca tcagcactga tgaccaagaa ggcctttaca 840gtctttattt tcataaatgc cttggaaaag aattgccaag tgacaagttt acattcagcc 900ttgatattga gatcacagag aagaatcctg acagctacct ctcagcagga gaaattcctc 960tccccaaatt atacatctca atggcctttt tcttctttct ttctgggacc atctggattc 1020atatccttcg aaaacgacgg aatgatgtat ttaaaatcca ctggctgatg gcggcccttc 1080ctttcaccaa gtctctttcc ttggtgttcc atgcaattga ctaccactac atctcctccc 1140agggcttccc tatcgaaggc tgggctgttg tgtactacat aactcacctt ttgaaagggg 1200cgctactctt catcaccatt gcactcattg gcactggctg ggctttcatt aagcacatcc 1260tttctgataa agacaaaaag atcttcatga ttgtcattcc actccaggtc ctggcaaatg 1320tagcctacat catcatagag tccaccgagg agggcacgac tgaatatggc ttgtggaagg 1380actctctatt tctggtcgac ctgttgtgtt gtggtgccat cctcttccca gtggtgtggt 1440caatcagaca tttacaagaa gcatcagcaa cagatggaaa aggtgacagc atgggacctc 1500ttcagcagag agcgaatcta agagcaggaa gtcgcataga gtctcaccat tttgcccagg 1560ctgatcttga actcctggcc tctagctgtc ctcctgcctc agtctcccaa agggctggga 1620ttacagctgc tattaactta gcaaagctga aacttttcag acattattac gtcttgattg 1680tgtgttacat atacttcact aggatcattg catttctcct caaactcgct gttccattcc 1740agtggaagtg gctctaccag ctcctggatg aaacggccac actggtcttc tttgttctaa 1800cggggtataa attccgtccg gcttcagata acccctacct acaactttct caggaagaag 1860aagacttgga aatggagtcc gttgtgacaa catctggggt gatggaaagt atgaagaaag 1920tcaagaaggt gaccaacggc tccgtggagc cccagggcga gtgggaaggc gccgtgtgac 1980agagccgacc ctgaggatgg cactgtccaa ggaaactgtt aacttattca tagtcctatt 2040ggacagcagg agcagctcct acagtgaact attggcacca ccgacagtga caccagggca 2100catggctgga gcacagtgcc gcggaaacct gattttgtac tctcttttat ggaaacgatc 2160tgtggctgtt tagaggcagc tggatcctct ttcaggcggg aatgggaggg cgggcacagg 2220gaggaggaga ggaagagaaa aggaagaatt catttttaat ttaggtttct ttttttcttc 2280ttcatttcgg agctctaagg tgtatgcagt tgtgacccca tgtgtgggga agtgtagcaa 2340ggacggctgg tggaggggga aggagggtgc gaggtgtctg tctgatgctt taggaaatgt 2400ctactgagga ccctgggact taagaagaag ggcggggaga gtgccattgc ctgtttggga 2460gacaaaaatg aacgaaaaca ggtgactttg gaaagcaaag tcaaaaccca gtttaggatg 2520tagcacctgc cccaggattc ctgccctcgg ctttgcccca gacccttatt ccagatgctg 2580agagtgacca ggacagcagc tcctgaggcc cagtggtctt ctttccaaca ggaaaagaag 2640gctgtgatgt cgctgtcagg atcatgccct gtggcacagc acaggtggtg ggaggtggtt 2700ttctgactga gatgttgcct gatggatgga aagaaatgta tttttaagtt caaaaagcat 2760tatcctgtgg cgttgcctgg acatccactc cctgacagcc cagagcagca ctgtctggct 2820tcccttcatg cttgtggctt tgttgtgttt gatcagaatt ttgggggaaa tggaaagttt 2880tcctcaagga gcagctgggg gcagaatagg tagtatttaa gcaaatactt aagtccaagc 2940aaatcatccc cattaaaaag cttttcctgt aggctagtag gatttctaaa tagatgaatt 3000caacagactt ggtccccata gtccaagagt atgtatgtga agaaagtgag catgattcaa 3060cagtttcact ctcagggatt ttaggatggc aaaatacttc acagaaactc aatgattaag 3120ttcccttcca cacttccaga gcttgaatga acacaggtag ccacctaaat tgagcagtat 3180tgcaactcag agagaaaatc atctgaatag taggacaagc tcagaaggta cattgtgact 3240gagggcttaa aaggagacca aaacatggcc ccatcaggga agcttcttaa tgcttggggg 3300gccagctagg tagggttgct tccaaaagct ggagcccacc cctgcctagg ggttgtcaga 3360gagccacacc tgcaggggaa caggtacctc cgagggtgag agtcgtggtc tctgggagtt 3420gttttctcac ctctggctta gaagggtcag gcagaaacca caggatgtgg ggtcacactc 3480actgtcccaa gtttgggaac ctgaaaaagt ctccattcag aacatggttg ttctccctgt 3540cccatgctat cttatcttcc taaatgacta atgaggaagc gggtgttctt tttctgcact 3600ttgattcgcc atctgggttc tgtagggtgc tctgaaggtg tgatctgcct tctggctgat 3660gtggaggaag agcaagcgcc ttcccaggcc acagctgctc acctctcggc agatatttta 3720ggcaagcatc cgtgtgtctt cccatcttca ggagaaaggt aaatgcaccc taagtgttca 3780cttctggacc tttttcaagt tcacttggga ctgtgtgaca gaagggagtt ggagggagga 3840tgggaatatt tttaacactt tgttttcctg tgcagaaaca taataccagt tttcgcagaa 3900atgtgtctca atctgtgact accaaagccc tcctcagtcc ttccctcaga gggacacatt 3960tgctgtttct cccgcaagca gatgttgtgg atgaggcgat agactccttg gcaagaacga 4020aaggtgtgat gaaacctccc tgctcggaag ggtctccgtg gaggtgtcct catttcacat 4080gctgggtttt gcaagcgagg aagccaggca gtggaggaac tagagagagg caggcgtgtg 4140tgtggacaag cgctggagcc gcagccctca gactggcacg ggaacgccag cgttgggtgt 4200tcagattcca cgcgtatgtc tgggctcact cacagcatgg ccgagtgtct gcagtgctgg 4260tcctgaccct tccagagcag cagtggacag atgagataag actgtttcag aaacaaagat 4320ggccacagcc ttcctaacaa gcaggtcatc tggccatgtc tgtattgtaa ctggtaaaag 4380gcttcaagtc agattgatga tcaagaaaag tcaaaacccc agcccaagat tgggaaagca 4440ggtggtggtt ccaagctttt aaaaaattat tgaagctctc catcctgttc tgtgagtgtg 4500tcttctcttt ctccttcacg tcatagccgt gacccaccgt tcatctctgc tcttgcgtaa 4560agatgaccga tggagtccaa agccaagtgg cttcaccagc tgacaagcca ccctcctgca 4620gcctgagttt cacagtccac tgggttcgtt gtcatgcggt gtttgaatgg ttaagccctt 4680gcagtatttc agatcgggca aaaaatatcg gatgcacata gcagaaccat tggtggtatt 4740tatagctttg ctttgtactc ctcactgttt ctgcctacgc aaaatatcca tgtttcctct 4800gagaaatctg ttgtggactg aaagcgctgc tggctgtgaa atttaataaa gtgtgtatgc 4860tttgctagaa aattatttct tggacaatag gaacagtcat tgatctgtaa atcctggctc 4920ttaacagtga gtggccaagg acttgatcag cccatttctt ggtccctcag tgctttaaaa 4980tttaagtagc actgcatttt gtaatgttga atatgactct agtgacttgt aggaggcact 5040tgtgaggaga tgcttgcttc agtgtaaaag atgctcatgg cctgagtcag

ttgagttttc 5100tttcaagaaa ccacttcaga gtgaaatatc cagggtttcc ccgccctgga catgtccagc 5160ctgcccaggc agcacacagc cctgtaagtc cacctcgtgt gggtgagatt tcctcctgcg 5220tgatgacctc atcgccatct ctgctgtctc attccacagc ctccctccct cttctctcct 5280cctctgccct cgcccttccc ccttccccat cccctccccc tcctcctctg ccctcgccct 5340tacccctccc ccttcccctt ccgctcctcc tccctcctcc acctctttct cctcctcctt 5400ccctcctcct ccctcctctt cccttctctg ccatctttct ccccgtgcct attgatccca 5460cataggctca ttctgggtac accggctaaa ggctttggtg cattgcagcg ttttctccca 5520gcagctgtgt gaaagatgca ttttctaagc taaggagaat tttctcaaga gtggcatact 5580catgccaaat attattgctc tgggccatat aggctggtct tcctccacac taaaatgggt 5640gtcttgtttt ggtacttaaa acagtctact ccaggcatcc agtccttaca gaccaaggaa 5700gagcatagcg atgcctgttg gaattgcaga tgcattctgg ccttctcccc cgtcctgaaa 5760cattttcttt gaggaaggct cttagaacat tagatagtct gctgaggttg ttggcccagc 5820tccatacacc cagtagaaca gtggaacaac tcatgcttca tgctgccaag ctgctgtact 5880tcaaaggaaa cagatctagc acactgctgc acccctgctt ccacactcca cacttcaccc 5940cgctgctttt ctctgacccg cccctggcct tgtaagactc acgtaagcta agtccaggat 6000gcctgtggcc tgcggcttga ttcttccctt taggattcag caagttaatg gcttcctcgc 6060tatagaagtg agactttgac ttgatgcctc ttggtatatc aaaaagatat tcatccagaa 6120agtaccaaat gttctgaaag acccgctctt cactccagtt ttccctaggg tgtttctggc 6180agggcgtttt taaaaggcat ctacctgagt tgacgctaat acttgtcacc acctggaacg 6240tagttatcgg tcggcaggct gaacatactc cagattcccc agaggccact tctgtagccc 6300agcgatgcat ctgagcctct ctgcgtggtt tatgcttgaa aaatagataa tgcttttaga 6360tggttcactg ccaggccatg ggccccacac atctcaggcc ctgtgtgagg gagcacactg 6420agatggtgca ggagtgaatg ggcatggctt ggcctcgcta cctcggggac ctgttggagt 6480tctggcagca gggtgtctgc aggtgggacg gcgttctggg cagagtcaga atggtcagaa 6540tgaaacagaa cagccaactc acccacagga cagcttattt tgaggcaagg ttttggattt 6600tggaggaagc agccagatga ggcggtgagc ctccagaagg tcagcctttg gagcacgtaa 6660gatactgtta cagggtccag aaatcgtgtt cacatggggg ctttgactct tcaaacagct 6720tttgcagatc gtaaattgca tttgcctagt cgtgtgacct caaaagaagt cagacatatt 6780taatccagaa atagtttcgt ttgagggagg gcttgcaggt ctgtaaatag catttgcttt 6840cctggttaga gattgggatg cagaaggagt tttcagtatt ttttttaaaa cactaatgat 6900cattgaagag tatttatgta aacatacaac gtataatggg tgggggatcc gatcatggtg 6960atgtacgggg tgaattctct tgccgtgttg caaatgtgta aaataaagat tatctggc 7018107353DNAHomo sapiens 10attattgcaa ccatttcgct tgtatttgag tgtgaagcgc ctagaaaacc acaggacccc 60tacggcgagc cgggaatttt tagatatttt cctccgagtc aacgctcagt gaaatcagtt 120caatcagtgg cccgacaccg tgcggctgac acagcttatc ccccgaccct gagatcaggg 180gtccccggag cccaaggtcg cttccaaagc tcagcgaggc ggaggtgcgg cccgggctgg 240tctggttcgg ccaccgttgt tatggcaacc gccaataggt tggcttcatc tctaactgaa 300agtctgcaca ggagcggccg atcgaagggc cccgaggtgg gcagcacagg ctcctcgacg 360acttcctagg tcgcaatctc caggaaaacg accacagggt cagcggagct agccgccgag 420ccccgctccc cgggcccttc cggcggctgc gccctttcac cccggacgtg ggcgggagag 480gaagcggctg gtgatgctgg aacaaacatg gccgctctgg cgcccgtcgg ctcccccgcc 540tcccgcggtc ctaggctggc cgcgggcctc cggctgctcc caatgctggg tttgctgcag 600ttgctggccg agcctggcct gggccgcgtc catcacctgg cactcaagga tgatgtgagg 660cataaagttc atctgaacac ctttggcttc ttcaaggatg ggtacatggt ggtgaatgtc 720agtagcctct cactgaatga gcctgaagac aaggatgtga ctattggatt tagcctagac 780cgtacaaaga atgatggctt ttcttcttac ctggatgaag atgtgaatta ctgtatttta 840aagaaacagt ctgtctctgt caccctttta atcctagaca tctccagaag tgaggtaaga 900gtaaagtctc caccagaagc tggtacccag ttaccaaaga tcatcttcag cagggatgag 960aaagtccttg gtcagagcca ggagcctaat gttaaccctg cttcagcagg caaccagacc 1020cagaagacac aagatggtgg aaagtctaaa agaagtacag tggattcaaa ggccatggga 1080gagaaatcct tttctgttca taataatggt ggggcagtgt catttcagtt tttctttaac 1140atcagcactg atgaccaaga aggcctttac agtctttatt ttcataaatg ccttggaaaa 1200gaattgccaa gtgacaagtt tacattcagc cttgatattg agatcacaga gaagaatcct 1260gacagctacc tctcagcagg agaaattcct ctccccaaat tatacatctc aatggccttt 1320ttcttctttc tttctgggac catctggatt catatccttc gaaaacgacg gaatgatgta 1380tttaaaatcc actggctgat ggcggccctt cctttcacca agtctctttc cttggtgttc 1440catgcaattg actaccacta catctcctcc cagggcttcc ctatcgaagg ctgggctgtt 1500gtgtactaca taactcacct tttgaaaggg gcgctactct tcatcaccat tgcactcatt 1560ggcactggct gggctttcat taagcacatc ctttctgata aagacaaaaa gatcttcatg 1620attgtcattc cactccaggt cctggcaaat gtagcctaca tcatcataga gtccaccgag 1680gagggcacga ctgaatatgg cttgtggaag gactctctat ttctggtcga cctgttgtgt 1740tgtggtgcca tcctcttccc agtggtgtgg tcaatcagac atttacaaga agcatcagca 1800acagatggaa aaggtgacag catgggacct cttcagcaga gagcgaatct aagagcagga 1860agtcgcatag agtctcacca ttttgcccag gctgatcttg aactcctggc ctctagctgt 1920cctcctgcct cagtctccca aagggctggg attacagctg ctattaactt agcaaagctg 1980aaacttttca gacattatta cgtcttgatt gtgtgttaca tatacttcac taggatcatt 2040gcatttctcc tcaaactcgc tgttccattc cagtggaagt ggctctacca gctcctggat 2100gaaacggcca cactggtctt ctttgttcta acggggtata aattccgtcc ggcttcagat 2160aacccctacc tacaactttc tcaggaagaa gaagacttgg aaatggagtc cgttgtgaca 2220acatctgggg tgatggaaag tatgaagaaa gtcaagaagg tgaccaacgg ctccgtggag 2280ccccagggcg agtgggaagg cgccgtgtga cagagccgac cctgaggatg gcactgtcca 2340aggaaactgt taacttattc atagtcctat tggacagcag gagcagctcc tacagtgaac 2400tattggcacc accgacagtg acaccagggc acatggctgg agcacagtgc cgcggaaacc 2460tgattttgta ctctctttta tggaaacgat ctgtggctgt ttagaggcag ctggatcctc 2520tttcaggcgg gaatgggagg gcgggcacag ggaggaggag aggaagagaa aaggaagaat 2580tcatttttaa tttaggtttc tttttttctt cttcatttcg gagctctaag gtgtatgcag 2640ttgtgacccc atgtgtgggg aagtgtagca aggacggctg gtggaggggg aaggagggtg 2700cgaggtgtct gtctgatgct ttaggaaatg tctactgagg accctgggac ttaagaagaa 2760gggcggggag agtgccattg cctgtttggg agacaaaaat gaacgaaaac aggtgacttt 2820ggaaagcaaa gtcaaaaccc agtttaggat gtagcacctg ccccaggatt cctgccctcg 2880gctttgcccc agacccttat tccagatgct gagagtgacc aggacagcag ctcctgaggc 2940ccagtggtct tctttccaac aggaaaagaa ggctgtgatg tcgctgtcag gatcatgccc 3000tgtggcacag cacaggtggt gggaggtggt tttctgactg agatgttgcc tgatggatgg 3060aaagaaatgt atttttaagt tcaaaaagca ttatcctgtg gcgttgcctg gacatccact 3120ccctgacagc ccagagcagc actgtctggc ttcccttcat gcttgtggct ttgttgtgtt 3180tgatcagaat tttgggggaa atggaaagtt ttcctcaagg agcagctggg ggcagaatag 3240gtagtattta agcaaatact taagtccaag caaatcatcc ccattaaaaa gcttttcctg 3300taggctagta ggatttctaa atagatgaat tcaacagact tggtccccat agtccaagag 3360tatgtatgtg aagaaagtga gcatgattca acagtttcac tctcagggat tttaggatgg 3420caaaatactt cacagaaact caatgattaa gttcccttcc acacttccag agcttgaatg 3480aacacaggta gccacctaaa ttgagcagta ttgcaactca gagagaaaat catctgaata 3540gtaggacaag ctcagaaggt acattgtgac tgagggctta aaaggagacc aaaacatggc 3600cccatcaggg aagcttctta atgcttgggg ggccagctag gtagggttgc ttccaaaagc 3660tggagcccac ccctgcctag gggttgtcag agagccacac ctgcagggga acaggtacct 3720ccgagggtga gagtcgtggt ctctgggagt tgttttctca cctctggctt agaagggtca 3780ggcagaaacc acaggatgtg gggtcacact cactgtccca agtttgggaa cctgaaaaag 3840tctccattca gaacatggtt gttctccctg tcccatgcta tcttatcttc ctaaatgact 3900aatgaggaag cgggtgttct ttttctgcac tttgattcgc catctgggtt ctgtagggtg 3960ctctgaaggt gtgatctgcc ttctggctga tgtggaggaa gagcaagcgc cttcccaggc 4020cacagctgct cacctctcgg cagatatttt aggcaagcat ccgtgtgtct tcccatcttc 4080aggagaaagg taaatgcacc ctaagtgttc acttctggac ctttttcaag ttcacttggg 4140actgtgtgac agaagggagt tggagggagg atgggaatat ttttaacact ttgttttcct 4200gtgcagaaac ataataccag ttttcgcaga aatgtgtctc aatctgtgac taccaaagcc 4260ctcctcagtc cttccctcag agggacacat ttgctgtttc tcccgcaagc agatgttgtg 4320gatgaggcga tagactcctt ggcaagaacg aaaggtgtga tgaaacctcc ctgctcggaa 4380gggtctccgt ggaggtgtcc tcatttcaca tgctgggttt tgcaagcgag gaagccaggc 4440agtggaggaa ctagagagag gcaggcgtgt gtgtggacaa gcgctggagc cgcagccctc 4500agactggcac gggaacgcca gcgttgggtg ttcagattcc acgcgtatgt ctgggctcac 4560tcacagcatg gccgagtgtc tgcagtgctg gtcctgaccc ttccagagca gcagtggaca 4620gatgagataa gactgtttca gaaacaaaga tggccacagc cttcctaaca agcaggtcat 4680ctggccatgt ctgtattgta actggtaaaa ggcttcaagt cagattgatg atcaagaaaa 4740gtcaaaaccc cagcccaaga ttgggaaagc aggtggtggt tccaagcttt taaaaaatta 4800ttgaagctct ccatcctgtt ctgtgagtgt gtcttctctt tctccttcac gtcatagccg 4860tgacccaccg ttcatctctg ctcttgcgta aagatgaccg atggagtcca aagccaagtg 4920gcttcaccag ctgacaagcc accctcctgc agcctgagtt tcacagtcca ctgggttcgt 4980tgtcatgcgg tgtttgaatg gttaagccct tgcagtattt cagatcgggc aaaaaatatc 5040ggatgcacat agcagaacca ttggtggtat ttatagcttt gctttgtact cctcactgtt 5100tctgcctacg caaaatatcc atgtttcctc tgagaaatct gttgtggact gaaagcgctg 5160ctggctgtga aatttaataa agtgtgtatg ctttgctaga aaattatttc ttggacaata 5220ggaacagtca ttgatctgta aatcctggct cttaacagtg agtggccaag gacttgatca 5280gcccatttct tggtccctca gtgctttaaa atttaagtag cactgcattt tgtaatgttg 5340aatatgactc tagtgacttg taggaggcac ttgtgaggag atgcttgctt cagtgtaaaa 5400gatgctcatg gcctgagtca gttgagtttt ctttcaagaa accacttcag agtgaaatat 5460ccagggtttc cccgccctgg acatgtccag cctgcccagg cagcacacag ccctgtaagt 5520ccacctcgtg tgggtgagat ttcctcctgc gtgatgacct catcgccatc tctgctgtct 5580cattccacag cctccctccc tcttctctcc tcctctgccc tcgcccttcc cccttcccca 5640tcccctcccc ctcctcctct gccctcgccc ttacccctcc cccttcccct tccgctcctc 5700ctccctcctc cacctctttc tcctcctcct tccctcctcc tccctcctct tcccttctct 5760gccatctttc tccccgtgcc tattgatccc acataggctc attctgggta caccggctaa 5820aggctttggt gcattgcagc gttttctccc agcagctgtg tgaaagatgc attttctaag 5880ctaaggagaa ttttctcaag agtggcatac tcatgccaaa tattattgct ctgggccata 5940taggctggtc ttcctccaca ctaaaatggg tgtcttgttt tggtacttaa aacagtctac 6000tccaggcatc cagtccttac agaccaagga agagcatagc gatgcctgtt ggaattgcag 6060atgcattctg gccttctccc ccgtcctgaa acattttctt tgaggaaggc tcttagaaca 6120ttagatagtc tgctgaggtt gttggcccag ctccatacac ccagtagaac agtggaacaa 6180ctcatgcttc atgctgccaa gctgctgtac ttcaaaggaa acagatctag cacactgctg 6240cacccctgct tccacactcc acacttcacc ccgctgcttt tctctgaccc gcccctggcc 6300ttgtaagact cacgtaagct aagtccagga tgcctgtggc ctgcggcttg attcttccct 6360ttaggattca gcaagttaat ggcttcctcg ctatagaagt gagactttga cttgatgcct 6420cttggtatat caaaaagata ttcatccaga aagtaccaaa tgttctgaaa gacccgctct 6480tcactccagt tttccctagg gtgtttctgg cagggcgttt ttaaaaggca tctacctgag 6540ttgacgctaa tacttgtcac cacctggaac gtagttatcg gtcggcaggc tgaacatact 6600ccagattccc cagaggccac ttctgtagcc cagcgatgca tctgagcctc tctgcgtggt 6660ttatgcttga aaaatagata atgcttttag atggttcact gccaggccat gggccccaca 6720catctcaggc cctgtgtgag ggagcacact gagatggtgc aggagtgaat gggcatggct 6780tggcctcgct acctcgggga cctgttggag ttctggcagc agggtgtctg caggtgggac 6840ggcgttctgg gcagagtcag aatggtcaga atgaaacaga acagccaact cacccacagg 6900acagcttatt ttgaggcaag gttttggatt ttggaggaag cagccagatg aggcggtgag 6960cctccagaag gtcagccttt ggagcacgta agatactgtt acagggtcca gaaatcgtgt 7020tcacatgggg gctttgactc ttcaaacagc ttttgcagat cgtaaattgc atttgcctag 7080tcgtgtgacc tcaaaagaag tcagacatat ttaatccaga aatagtttcg tttgagggag 7140ggcttgcagg tctgtaaata gcatttgctt tcctggttag agattgggat gcagaaggag 7200ttttcagtat tttttttaaa acactaatga tcattgaaga gtatttatgt aaacatacaa 7260cgtataatgg gtgggggatc cgatcatggt gatgtacggg gtgaattctc ttgccgtgtt 7320gcaaatgtgt aaaataaaga ttatctggca gaa 735311279PRTHomo sapiens 11Met Met Leu Met Gln Ala Leu Val Leu Phe Thr Leu Asp Ser Leu Asp 1 5 10 15 Met Leu Pro Ala Val Lys Ala Thr Trp Leu Tyr Gly Ile Gln Ile Thr 20 25 30 Ser Leu Leu Leu Val Cys Ile Leu Gln Phe Phe Asn Ser Met Ile Leu 35 40 45 Gly Ser Leu Leu Ile Ser Phe Asn Leu Ser Val Phe Ile Ala Arg Lys 50 55 60 Leu Gln Lys Asn Leu Lys Thr Gly Ser Phe Leu Asn Arg Leu Gly Lys 65 70 75 80 Leu Leu Leu His Leu Phe Met Val Leu Cys Leu Thr Leu Phe Leu Asn 85 90 95 Asn Ile Ile Lys Lys Ile Leu Asn Leu Lys Ser Asp Glu His Ile Phe 100 105 110 Lys Phe Leu Lys Ala Lys Phe Gly Leu Gly Ala Thr Arg Asp Phe Asp 115 120 125 Ala Asn Leu Tyr Leu Cys Glu Glu Ala Phe Gly Leu Leu Pro Phe Asn 130 135 140 Thr Phe Gly Arg Leu Ser Asp Thr Leu Leu Phe Tyr Ala Tyr Ile Phe 145 150 155 160 Val Leu Ser Ile Thr Val Ile Val Ala Phe Val Val Ala Phe His Asn 165 170 175 Leu Ser Asp Ser Thr Asn Gln Gln Ser Val Gly Lys Met Glu Lys Gly 180 185 190 Thr Val Asp Leu Lys Pro Glu Thr Ala Tyr Asn Leu Ile His Thr Ile 195 200 205 Leu Phe Gly Phe Leu Ala Leu Ser Thr Met Arg Met Lys Tyr Leu Trp 210 215 220 Thr Ser His Met Cys Val Phe Ala Ser Phe Gly Leu Cys Ser Pro Glu 225 230 235 240 Ile Trp Glu Leu Leu Leu Lys Ser Val His Leu Tyr Asn Pro Lys Arg 245 250 255 Ile Cys Ile Met Arg Tyr Ser Val Pro Ile Leu Ile Leu Leu Tyr Leu 260 265 270 Cys Tyr Lys Asn Gln Lys Ser 275 12102PRTHomo sapiens 12Met Met Ser Ile Arg Gln Arg Arg Glu Ile Arg Ala Thr Glu Val Ser 1 5 10 15 Glu Asp Phe Pro Ala Gln Glu Glu Asn Val Lys Leu Glu Asn Lys Leu 20 25 30 Pro Ser Gly Cys Thr Ser Arg Arg Leu Trp Lys Ile Leu Ser Leu Thr 35 40 45 Ile Gly Gly Thr Ile Ala Leu Cys Ile Gly Leu Leu Thr Ser Val Tyr 50 55 60 Leu Ala Thr Leu His Glu Asn Asp Leu Trp Phe Ser Asn Ile Lys Val 65 70 75 80 Trp Ser Phe Phe Asp His Cys Ile Ile His Ser Val Gly Ser Pro Val 85 90 95 Val Ser His Val Asp Glu 100 13716PRTHomo sapiens 13Met Met Ser Ile Arg Gln Arg Arg Glu Ile Arg Ala Thr Glu Val Ser 1 5 10 15 Glu Asp Phe Pro Ala Gln Glu Glu Asn Val Lys Leu Glu Asn Lys Leu 20 25 30 Pro Ser Gly Cys Thr Ser Arg Arg Leu Trp Lys Ile Leu Ser Leu Thr 35 40 45 Ile Gly Gly Thr Ile Ala Leu Cys Ile Gly Leu Leu Thr Ser Val Tyr 50 55 60 Leu Ala Thr Leu His Glu Asn Asp Leu Trp Phe Ser Asn Ile Lys Glu 65 70 75 80 Val Glu Arg Glu Ile Ser Phe Arg Thr Glu Cys Gly Leu Tyr Tyr Ser 85 90 95 Tyr Tyr Lys Gln Met Leu Gln Ala Pro Thr Leu Val Gln Gly Phe His 100 105 110 Gly Leu Ile Tyr Asp Asn Lys Thr Glu Ser Met Lys Thr Ile Asn Leu 115 120 125 Leu Gln Arg Met Asn Ile Tyr Gln Glu Val Phe Leu Ser Ile Leu Tyr 130 135 140 Arg Val Leu Pro Ile Gln Lys Tyr Leu Glu Pro Val Tyr Phe Tyr Ile 145 150 155 160 Tyr Thr Leu Phe Gly Leu Gln Ala Ile Tyr Val Thr Ala Leu Tyr Ile 165 170 175 Thr Ser Trp Leu Leu Ser Gly Thr Trp Leu Ser Gly Leu Leu Ala Ala 180 185 190 Phe Trp Tyr Val Thr Asn Arg Ile Asp Thr Thr Arg Val Glu Phe Thr 195 200 205 Ile Pro Leu Arg Glu Asn Trp Ala Leu Pro Phe Phe Ala Ile Gln Ile 210 215 220 Ala Ala Ile Thr Tyr Phe Leu Arg Pro Asn Leu Gln Pro Leu Ser Glu 225 230 235 240 Arg Leu Thr Leu Leu Ala Ile Phe Ile Ser Thr Phe Leu Phe Ser Leu 245 250 255 Thr Trp Gln Phe Asn Gln Phe Met Met Leu Met Gln Ala Leu Val Leu 260 265 270 Phe Thr Leu Asp Ser Leu Asp Met Leu Pro Ala Val Lys Ala Thr Trp 275 280 285 Leu Tyr Gly Ile Gln Ile Thr Ser Leu Leu Leu Val Cys Ile Leu Gln 290 295 300 Phe Phe Asn Ser Met Ile Leu Gly Ser Leu Leu Ile Ser Phe Asn Leu 305 310 315 320 Ser Val Phe Ile Ala Arg Lys Leu Gln Lys Asn Leu Lys Thr Gly Ser 325 330 335 Phe Leu Asn Arg Leu Gly Lys Leu Leu Leu His Leu Phe Met Val Leu 340 345 350 Cys Leu Thr Leu Phe Leu Asn Asn Ile Ile Lys Lys Ile Leu Asn Leu 355 360 365 Lys Ser Asp Glu His Ile Phe Lys Phe Leu Lys Ala Lys Phe Gly Leu 370 375 380 Gly Ala Thr Arg Asp Phe Asp Ala Asn Leu Tyr Leu Cys Glu Glu Ala 385 390 395 400 Phe Gly Leu Leu Pro Phe Asn Thr Phe Gly Arg Leu Ser Asp Thr Leu 405 410 415 Leu Phe Tyr Ala Tyr Ile Phe Val Leu Ser Ile Thr Val Ile Val Ala 420 425 430 Phe Val Val Ala Phe His Asn Leu Ser Asp Ser Thr Asn Gln Gln Ser 435 440 445 Val Gly Lys Met Glu Lys Gly Thr Val Asp Leu Lys Pro Glu Thr Ala 450 455 460 Tyr Asn Leu Ile His Thr Ile Leu Phe Gly Phe Leu Ala Leu Ser Thr 465 470 475 480 Met Arg Met Lys Tyr Leu Trp Thr Ser His Met

Cys Val Phe Ala Ser 485 490 495 Phe Gly Leu Cys Ser Pro Glu Ile Trp Glu Leu Leu Leu Lys Ser Val 500 505 510 His Leu Tyr Asn Pro Lys Arg Ile Cys Ile Met Arg Tyr Ser Val Pro 515 520 525 Ile Leu Ile Leu Leu Tyr Leu Cys Tyr Lys Phe Trp Pro Gly Met Met 530 535 540 Asp Glu Leu Ser Glu Leu Arg Glu Phe Tyr Asp Pro Asp Thr Val Glu 545 550 555 560 Leu Met Asn Trp Ile Asn Ser Asn Thr Pro Arg Lys Ala Val Phe Ala 565 570 575 Gly Ser Met Gln Leu Leu Ala Gly Val Lys Leu Cys Thr Gly Arg Thr 580 585 590 Leu Thr Asn His Pro His Tyr Glu Asp Ser Ser Leu Arg Glu Arg Thr 595 600 605 Arg Ala Val Tyr Gln Ile Tyr Ala Lys Arg Ala Pro Glu Glu Val His 610 615 620 Ala Leu Leu Arg Ser Phe Gly Thr Asp Tyr Val Ile Leu Glu Asp Ser 625 630 635 640 Ile Cys Tyr Glu Arg Arg His Arg Arg Gly Cys Arg Leu Arg Asp Leu 645 650 655 Leu Asp Ile Ala Asn Gly His Met Met Asp Gly Pro Gly Glu Asn Asp 660 665 670 Pro Asp Leu Lys Pro Ala Asp His Pro Arg Phe Cys Glu Glu Ile Lys 675 680 685 Arg Asn Leu Pro Pro Tyr Val Ala Tyr Phe Thr Arg Val Phe Gln Asn 690 695 700 Lys Thr Phe His Val Tyr Lys Leu Ser Arg Asn Lys 705 710 715 14112PRTHomo sapiens 14Met Met Ser Ile Arg Gln Arg Arg Glu Ile Arg Ala Thr Glu Val Ser 1 5 10 15 Glu Asp Phe Pro Ala Gln Glu Glu Asn Val Lys Leu Glu Asn Lys Leu 20 25 30 Pro Ser Gly Cys Thr Ser Arg Arg Leu Trp Lys Ile Leu Ser Leu Thr 35 40 45 Ile Gly Gly Thr Pro Phe Ala Leu Asp Phe Leu His Leu Ser Thr Leu 50 55 60 Pro Arg Tyr Met Lys Met Ile Tyr Gly Phe Leu Ile Leu Arg Lys Trp 65 70 75 80 Ser Glu Lys Ser His Ser Glu Gln Ser Val Ala Cys Ile Thr Pro Thr 85 90 95 Thr Ser Arg Cys Cys Arg Leu Gln Pro Ser Cys Lys Val Ile Thr Thr 100 105 110 151841DNAHomo sapiens 15aagtttgcgg agcggcttct gctcgtcggc cgtgcggcga ggcagggcct gggctgcgac 60cccggcggcc gctcgcggtc ttgggagagc tggggcgcgt gcctgaactt cccggctgcc 120cctgtccttg gagacctacc tgatggggac gccaggtgtg caggggcgtg gcgcgtagga 180gtgatttgga gaacaatgca tgtaagtctg acatcatgat gtccatccgg caaagaagag 240aaataagagc cacagaagtt tctgaagact ttccagccca agaagaaaat gtgaagttgg 300aaaataaatt gccatctggt tgtaccagta gaagattatg gaagattttg tcattgacaa 360ttggtggaac cattgccctt tgcattggac ttcttacatc tgtctacctt gccacgttac 420atgaaaatga tttatggttt tctaatatta aggaagtgga gcgagaaatc tcattcagaa 480cagagtgtgg cctgtattac tcctactaca agcagatgct gcaggctcca accctcgtgc 540aaggttttca tggcctaata tatgataata aaactgaatc tatgaagaca attaacctcc 600ttcagcgaat gaatatttac caagaggttt ttctcagtat tttatataga gttctaccca 660tacagaaata tttagagcca gtttattttt atatttacac cttatttggg ctccaggcga 720tctatgtcac agctctctac ataaccagct ggctactcag tggtacatgg ctgtcaggac 780tgttggcagc tttctggtat gtcacaaata gaatagatac cacaagagtt gagtttacca 840tcccactgag ggagaactgg gcgctgccat tctttgcaat tcagatagca gcaattacat 900atttcctgag accaaactta cagcctcttt ctgaaaggct gacacttctt gccattttca 960tatcaacttt tctctttagt ctgacatggc aatttaatca atttatgatg ctgatgcaag 1020cattagtgct gttcacactg gactccctgg acatgctgcc agcagtgaag gcgacatggc 1080tgtatggaat acagataaca agtttactcc tggtctgcat tcttcagttt tttaattcca 1140tgattcttgg atcactgctt atcagtttta acctttcagt attcattgca agaaaacttc 1200agaaaaatct gaaaactgga agcttcctta ataggcttgg gaaacttttg ttacatttat 1260ttatggtttt atgtttgaca ctttttctca acaacataat taagaaaatt cttaacctga 1320agtcagatga acacatattt aaatttctga aggcaaaatt tgggcttgga gcaacaaggg 1380attttgatgc aaatctctat ctgtgtgaag aagcttttgg cctcctgcct tttaatacat 1440ttggaaggct ttcagatact ctgctttttt atgcttacat attcgttctg tccatcacag 1500tgattgtagc attcgttgtt gcctttcata atctcagtga ttctacaaat caacaatccg 1560tgggtaaaat ggaaaaaggc acagttgacc tgaaaccaga aactgcctac aacttaatac 1620ataccattct gtttggattc ttggcattga gtacaatgag aatgaagtac ctctggacgt 1680cacacatgtg tgtgttcgca tcattcggcc tatgtagccc tgaaatatgg gagttacttc 1740tgaagtcagt ccatctttat aacccaaaga ggatatgtat aatgcgatat tcagtaccga 1800tattaatact gctgtatcta tgctataaga atcagaaatc t 1841162206DNAHomo sapiens 16cggttctgcc ctccttgtac ccgcggcgcg ctgcggcccg tggcgcggcc ccgttcccgc 60ctagccccgt cggcctcctt cccctcccgg agccgcgcgt gaggacggct gaggccgcag 120gagtgatttg gagaacaatg catgtaagtc tgacatcatg atgtccatcc ggcaaagaag 180agaaataaga gccacagaag tttctgaaga ctttccagcc caagaagaaa atgtgaagtt 240ggaaaataaa ttgccatctg gttgtaccag tagaagatta tggaagattt tgtcattgac 300aattggtgga accattgccc tttgcattgg acttcttaca tctgtctacc ttgccacgtt 360acatgaaaat gatttatggt tttctaatat taaggtatgg agtttctttg accattgtat 420cattcactca gtgggatctc cagtagtaag ccatgtggat gaatgaccaa ggcaacacag 480ttttgccata aagaatccaa tctctagaaa ggttggacta tagagtgaaa taacttttgt 540gtttattatt ttaaaataac atattagaat ctttttttaa atttttcttt attatttatt 600tatttttgag atggagtctc actctgtcac ccaggctgga gtgcggtggc gcaatcttgg 660ctcactacaa cctctgcctc gcaggttcag gtgattcttc tggcttagcc tcccaagtag 720ctgggactat aggtgcgtgc caccacaccc agctaatttt tgtattttta ctagagacgg 780ggtttcagca tattgaccag gctgatctcg aactcctgac cttgtgatct gcctgtctca 840gcctcccaaa gtgctgggat tacaggcgtg agccactgcg tccagccaga atctttattt 900ttcattttaa ttttttgaga tagggtattg ctctgtcacc caggctagaa tgcagtggtg 960caaacatggg tcactgcagc ctcaacctcc tgggctcaag tgagtatcct gcctaagctt 1020cctgtgtcac tgggacccca ggcatgcacc acctcaccaa gctaaatttg atttttttgt 1080agagacaggg tctcactttg ttgcccatgc tggtctcgaa ctcctgggct caagcgatcc 1140tactgccctg gtcttccaaa atatgagaat gagccatagc acccagccca gaatttttat 1200aatcaagtga gttttttctt tttcattaac ttattccatt tatttagcag ttattctaaa 1260ttagtatttt tcaagttata gattgtgaaa ttagtgcagt aggtcatgag taacattttt 1320cttaatgaaa tcaaaaagaa agaatactat cacatctagt agggttgagg attgttttgt 1380gaaactttta attttatata tatatatata tgcacaaact gggtcacagt atacaaggta 1440cttccttttc ttttttttct tgttggctac aacaggaaaa aaaaaaaaca gaaaaggaaa 1500taaaaaagcc actgctttaa atcatggggt ctaaatgtgg ctccacagag ggtcctcagc 1560atgttcatga ctatctaata ctctgtgcaa gtggttttgc agggcatagg gcgatgggga 1620agccatatgt ttccagggaa aggaactgta attttaatca gattttcagg agggttagcc 1680gggcgtcacg cctgtaatcc cagcactttg ggaggtcgag gcgggcagat cacttgaagt 1740caggagttca agaccagcct ggccaacatg gtggaaccct atctctacta aaaatacaaa 1800aattagccgg gcatggtgac acacacctgt aatctcagct actcaggagg ctgaggcaca 1860agaatcactt gaactcggga ggaagaggtt gcagtgagct gagatcccac cactgcactc 1920cagcctgggc aacagagcaa tactctttat caaaaaaaaa aagaaaaaag ttgagggggt 1980ggtctgtgac tctttaaaca cgtttccttg ttttctttct ctctctcttt ttcaacattt 2040ctagaactcc tcttggcatt gttttcagaa ctcgtatata acttacatgt ggaaatttgc 2100atccaaatat accttacatt ttaatctaat atgtcatgat ctttaaccta aactgtggtg 2160tctaatgact agttgcttgt aaaaataaac aaacaccttc aaagcc 2206174456DNAHomo sapiens 17aagtttgcgg agcggcttct gctcgtcggc cgtgcggcga ggcagggcct gggctgcgac 60cccggcggcc gctcgcggtc ttgggagagc tggggcgcgt gcctgaactt cccggctgcc 120cctgtccttg gagacctacc tgatggggac gccaggtgtg caggggcgtg gcgcgtagga 180gtgatttgga gaacaatgca tgtaagtctg acatcatgat gtccatccgg caaagaagag 240aaataagagc cacagaagtt tctgaagact ttccagccca agaagaaaat gtgaagttgg 300aaaataaatt gccatctggt tgtaccagta gaagattatg gaagattttg tcattgacaa 360ttggtggaac cattgccctt tgcattggac ttcttacatc tgtctacctt gccacgttac 420atgaaaatga tttatggttt tctaatatta aggaagtgga gcgagaaatc tcattcagaa 480cagagtgtgg cctgtattac tcctactaca agcagatgct gcaggctcca accctcgtgc 540aaggttttca tggcctaata tatgataata aaactgaatc tatgaagaca attaacctcc 600ttcagcgaat gaatatttac caagaggttt ttctcagtat tttatataga gttctaccca 660tacagaaata tttagagcca gtttattttt atatttacac cttatttggg ctccaggcga 720tctatgtcac agctctctac ataaccagct ggctactcag tggtacatgg ctgtcaggac 780tgttggcagc tttctggtat gtcacaaata gaatagatac cacaagagtt gagtttacca 840tcccactgag ggagaactgg gcgctgccat tctttgcaat tcagatagca gcaattacat 900atttcctgag accaaactta cagcctcttt ctgaaaggct gacacttctt gccattttca 960tatcaacttt tctctttagt ctgacatggc aatttaatca atttatgatg ctgatgcaag 1020cattagtgct gttcacactg gactccctgg acatgctgcc agcagtgaag gcgacatggc 1080tgtatggaat acagataaca agtttactcc tggtctgcat tcttcagttt tttaattcca 1140tgattcttgg atcactgctt atcagtttta acctttcagt attcattgca agaaaacttc 1200agaaaaatct gaaaactgga agcttcctta ataggcttgg gaaacttttg ttacatttat 1260ttatggtttt atgtttgaca ctttttctca acaacataat taagaaaatt cttaacctga 1320agtcagatga acacatattt aaatttctga aggcaaaatt tgggcttgga gcaacaaggg 1380attttgatgc aaatctctat ctgtgtgaag aagcttttgg cctcctgcct tttaatacat 1440ttggaaggct ttcagatact ctgctttttt atgcttacat attcgttctg tccatcacag 1500tgattgtagc attcgttgtt gcctttcata atctcagtga ttctacaaat caacaatccg 1560tgggtaaaat ggaaaaaggc acagttgacc tgaaaccaga aactgcctac aacttaatac 1620ataccattct gtttggattc ttggcattga gtacaatgag aatgaagtac ctctggacgt 1680cacacatgtg tgtgttcgca tcattcggcc tatgtagccc tgaaatatgg gagttacttc 1740tgaagtcagt ccatctttat aacccaaaga ggatatgtat aatgcgatat tcagtaccga 1800tattaatact gctgtatcta tgctataagt tctggccagg aatgatggat gaactctccg 1860agttgagaga attctatgat ccagatacag tggagctgat gaactggatt aactctaaca 1920ctccaagaaa ggctgtgttt gcgggaagca tgcagttgct ggccggagtc aagctgtgca 1980cgggaaggac cctaaccaac cacccgcact atgaagacag cagcctgaga gagcggacca 2040gagcggttta tcagatatat gccaagaggg caccagagga agtgcatgcc ctcctaaggt 2100ccttcggcac tgactacgta atcctggaag acagcatctg ctacgagcgg aggcaccgcc 2160ggggctgccg actccgggac ctgctggaca ttgccaacgg ccacatgatg gatggcccag 2220gagagaatga tcctgatttg aaacctgcag accaccctcg cttctgtgaa gagatcaaaa 2280gaaacctgcc tccctacgtg gcctacttca ccagagtgtt ccagaacaaa accttccacg 2340tttacaagct gtccagaaac aagtagcgca gatttctgcc cagtgtctat ttttgatacg 2400gagaaactgc atcatgatga aactcaatag atgacgtttc ctatgtaagt aggtagccca 2460aaccttcaag ctgtgatatg agtaagttct acagatgttt acacaagtgt tgccatcttt 2520gaaagcatct tctacaagca gaagtctttt tcgttgtgtg tctatctttc tcattaatgt 2580tctttagcct aaatgttaac aactttctaa gagtgaccta gaattatgtt gttggagaga 2640atgatgtgtg ttccatggat acctggatag gcacataaca tgttggaaga tgagcacctg 2700ctcaggattt gaaatacgtt taattttcag gtgacttaag acagctatga ttgaatcaac 2760tagagatgat gatcgactta tttaatatga tttcactggt gaagaccaat tggtagcttt 2820ttaaaaagca ctttagtgtc ctgttttacc ttaaaatgtt ataatatttt ccagttgtca 2880tgctgtcaac attaacaaaa aaaatcatgt taaggctttg tatcaaacat tttgttacac 2940tctgtctgaa atgtaatgtg gagtacttca gcagtatgtg tcatgtattg tgtgtgtctg 3000tgtgtgtgca tgtgcacaca tgtgttttaa tgctgggcac agaaaagtgt tacaagttcc 3060atatcgtaag tccttaaagg ggcagaaata tatgtagcca agtagaattt attacatttt 3120agtgttatta ttttaaaact tactgatact ctttaacctc tcctgcagta atagttttgc 3180tttatttctt actcatttca atttattggg tttgcaaaat tttgtaaact ttttgtgttt 3240ttagcctttg tattttttac agcctagaat cttgcaaagt ctgaatattt tttaaatgtt 3300ctatcttaac tagttcacta atacagtatt tttagcagac agcattttca gacagcattt 3360tcataccaag tttgacttgt ggtctccaat cttactggga aggccctggt agtgtaattc 3420ttttccttat taaaaggtaa ccaagtgcct ctaagtcatg cttatttgta aacaacaaag 3480aagagtatat gtacctgctc aaaatttttt tgataatcgc ttatataatt aatttctaat 3540gatgaggaca tgtaaaagtt gccagtaaga acatagtatg catttaatta aatcaagatg 3600gctaatggaa ttaactttct cccctgttct tgccaggtgg aaatgattta agcatttctc 3660cttgcagttg tattgaagta aattaccata ggcatcaaga tggctgcatc acattttcaa 3720atgattttat attcagttgc tacttataaa gcagcattca aaaagtcttt tacactgtca 3780tgttggacac aagcagactc agcttttatc aaaacttgtt taaataaaaa attgacagta 3840gctgggttat taaattatgc aactgaaact cctgaattat atcttttctg tatcccttaa 3900taagattgga gaccactgcc gtttaggata atacaataat aaaacgtttt aatcagtact 3960aaaactttaa ttaagccaat aatgatgcat gcctgttgta gctgacagca tgggtcagta 4020catccttcag cgagtgcctt actctaattg aaaccaagca cacgtaaggt acaatatgtt 4080agactctgtg attttgtttt caaaatcctc tgttatggct atatttaaat ttattttaaa 4140tattcctgta tgtattcatc taagcatttg ggcatttgga gtcttaatat acaagaaaca 4200cgtacttaaa tttttatgct tatcaccgca atgatggcaa acagtgattt tttttttcat 4260agtttaggtg tcattgttgc cagcaccttt agtgctcagt cttcagtgaa aaatataaag 4320tgccaaaaaa atcttgcaag acagaatcca tacttaacac tctttccaag acactgtgac 4380catgtacagt agctatttcc tgatgaccaa atctctcaac gaatcatgtt attaataaat 4440atttttagca ctcatc 445618336DNAHomo sapiens 18atgatgtcca tccggcaaag aagagaaata agagccacag aagtttctga agactttcca 60gcccaagaag aaaatgtgaa gttggaaaat aaattgccat ctggttgtac cagtagaaga 120ttatggaaga ttttgtcatt gacaattggt ggaaccccct ttgcattgga cttcttacat 180ctgtctacct tgccacgtta catgaaaatg atttatggtt ttctaatatt aaggaagtgg 240agcgagaaat ctcattcaga acagagtgtg gcctgtatta ctcctactac aagcagatgc 300tgcaggctcc aaccctcgtg caaggtaatt acaact 33619173PRTHomo sapiens 19Met Thr Leu Cys His Arg Asp Ser Phe Gly Ser Trp His Leu Phe His 1 5 10 15 Leu Leu Leu Leu Glu Tyr Met Ile His Ile Leu Gln Ser Cys Leu Glu 20 25 30 Glu Glu Glu Glu Glu Glu Asp Met Gly Thr Val Lys Glu Met Leu Pro 35 40 45 Asp Asp Pro Thr Leu Gly Gln Pro Asp Gln Ala Leu Phe His Ser Leu 50 55 60 Asn Ser Ser Leu Ser Gln Ala Cys Ala Ser Pro Ser Met Glu Pro Leu 65 70 75 80 Gly Val Met Pro Thr His Met Gly Gln Gly Arg Tyr Pro Val Gly Val 85 90 95 Ser Asn Met Val Leu Arg Ile Leu Gly Phe Leu Val Asp Thr Ala Met 100 105 110 Gly Asn Lys Leu Ile Gln Val Leu Leu Glu Asp Glu Thr Thr Glu Ser 115 120 125 Ala Val Lys Leu Ser Leu Pro Met Gly Gln Glu Ala Leu Ile Thr Leu 130 135 140 Lys Asp Gly Gln Gln Phe Val Ile Gln Ile Ser Asp Val Pro Gln Ser 145 150 155 160 Ser Glu Asp Ile Tyr Phe Arg Glu Asn Asn Ala Asn Val 165 170 201907DNAHomo sapiens 20ataactccaa cgctcaagca agtcaaggac acccacggac tcaacaccgc gaccagattg 60gaaaaggtgt tggtcgacaa cttctgcatt tgcgaagagt gcagcgtccc tcgctgtctc 120atgtatgaga tttacgtgga gacctgtggg caaaacactg agaaccaagt caacccggcc 180acctttggga agcttgtgag attggttttt ccggaccttg gcacccggag gctgggcact 240agaggaagtg ccaggtatca ttatgatgga atctgtatca agaaaagctc tttcttctat 300gcccagtatt gctacctgat aggtgaaaaa aggtatcaca gtggagatgc cattgccttt 360gaaaaatcta ctaattataa cagcattatc caacaagaag caacatgtga agatcattca 420ccgatgaaga cagacccagt tggatcccct ttgtctgaat tcaggagatg tccatttctg 480gagcaagaac aggcaaagaa atactcctgt aatatgatgg ccttccttgc tgacgaatac 540tgcaactatt gtcgagacat tttacgaaat gtgaggaact gagaacttga gagggtggag 600gacttgctta cttccttctg gaagtctctg cagcaagaca cagtcatgct gatgtcattg 660cctgacgtgt gccagctctt taaatgctac gacgtccagc tgtacaaggg aattgaggat 720gttctccttc atgacttctt ggaagatgtt tctattcagt acctgaaatc tgtgcagtta 780tttagtaaga aatttaagct gtggctcctt aatgctttgg aaggtgttcc agccctcttg 840cagatctcca aactcaaaga ctatgcgaat ggtattgaaa agtaagaggc gtgtcagcgt 900tttgaagtca gatctacagg ccatcatcaa tcaaggcact ttggctactt ctaagaaagc 960cctggcaagt gaccggagtg gcgcagatga actggagaac aacccagaga tgaaatgttt 1020aagaaactta atttctttgc tgggaacatc aacagatctc agggtattcc tcagctgtct 1080gtcttcacat ctccaagcat ttgtgttcca gacaagcaga agcaaagaag agtttaccaa 1140attggccgcc agcttccagc tgagatggaa tcttcttctc actgctgtaa gcaaagccat 1200gaccctctgc cacagagata gttttggctc ctggcatctg tttcacttgt tgcttttgga 1260atatatgatt catatacttc agtcatgcct agaggaggaa gaggaggagg aggacatggg 1320gactgtcaag gaaatgctac cagatgaccc gactctcggc cagccagacc aggcactttt 1380ccattctctg aattcctcac tgtcgcaggc gtgtgccagc cccagcatgg agccactggg 1440ggtgatgccc acacacatgg gccagggccg atatcccgtg ggtgtgagca acatggtcct 1500caggatcctg ggcttcctgg tggacactgc catgggcaat aagctcatcc aggtgctgtt 1560ggaagatgaa accactgaaa gcgcagttaa actcagcctt cctatgggac aagaagccct 1620cataacccta aaagatggac aacaatttgt gattcagata tcagatgtac cccaaagctc 1680tgaagatatt tatttcagag aaaacaatgc taatgtgtga gattatttat ttgaatagag 1740aataagaaaa ctgatagact tgcattctta aaaatattaa atactaaagt ttttctattg 1800acgaaagatg atgttatgta tataatagat gtagcattgt ctattttatg tttatatgta 1860tttcaaggag gtggtttcga taaaatatgt aaactgattt ggagaat 1907211328PRTHomo sapiens 21Met Ser Ser Gly Asp Pro Ala His Leu Gly Leu Cys Leu Trp Leu Trp 1 5 10 15 Leu Gly Ala Thr Leu Gly Arg Glu Gln Val Gln Ala Ser Gly Leu Leu 20 25 30 Arg Leu Ala Val Leu Pro Glu Asp Arg Leu Gln Met Lys Trp Arg Glu 35 40 45 Ser Glu Gly Ser Gly Leu Gly Tyr Leu Val Gln Val Lys Pro Met Ala 50 55 60 Gly Asp Ser Glu Gln Glu Val Ile Leu Thr Thr Lys Thr Pro Lys Ala 65 70 75 80 Thr Val Gly Gly Leu Ser Pro Ser Lys Gly Tyr Thr Leu Gln Ile Phe 85 90 95 Glu Leu Thr Gly Ser Gly Arg Phe Leu Leu Ala

Arg Arg Glu Phe Val 100 105 110 Ile Glu Asp Leu Lys Ser Ser Ser Leu Asp Arg Ser Ser Gln Arg Pro 115 120 125 Leu Gly Ser Gly Ala Pro Glu Pro Thr Pro Ser His Thr Gly Ser Pro 130 135 140 Asp Pro Glu Gln Ala Ser Glu Pro Gln Val Ala Phe Thr Pro Ser Gln 145 150 155 160 Asp Pro Arg Thr Pro Gly Gly Ser Glu Trp Arg Glu Thr Gly Pro Gln 165 170 175 Phe Arg Cys Leu Pro Pro Val Pro Ala Asp Met Val Phe Leu Val Asp 180 185 190 Gly Ser Trp Ser Ile Gly His Ser His Phe Gln Gln Val Lys Asp Phe 195 200 205 Leu Ala Ser Val Ile Ala Pro Phe Glu Ile Gly Pro Asp Lys Val Gln 210 215 220 Val Gly Leu Thr Gln Tyr Ser Gly Asp Ala Gln Thr Glu Trp Asp Leu 225 230 235 240 Asn Ser Leu Ser Thr Lys Glu Gln Val Leu Ala Ala Val Arg Arg Leu 245 250 255 Arg Tyr Lys Gly Gly Asn Thr Phe Thr Gly Leu Ala Leu Thr His Val 260 265 270 Leu Gly Gln Asn Leu Gln Pro Ala Ala Gly Leu Arg Pro Glu Ala Ala 275 280 285 Lys Val Val Ile Leu Val Thr Asp Gly Lys Ser Gln Asp Asp Val His 290 295 300 Thr Ala Ala Arg Val Leu Lys Asp Leu Gly Val Asn Val Phe Ala Val 305 310 315 320 Gly Val Lys Asn Ala Asp Glu Ala Glu Leu Arg Leu Leu Ala Ser Pro 325 330 335 Pro Arg Asp Ile Thr Val His Ser Val Leu Asp Phe Leu Gln Leu Gly 340 345 350 Ala Leu Ala Gly Leu Leu Ser Arg Leu Ile Cys Gln Arg Leu Gln Gly 355 360 365 Gly Ser Pro Arg Gln Gly Pro Ala Ala Ala Pro Ala Leu Asp Thr Leu 370 375 380 Pro Ala Pro Thr Ser Leu Val Leu Ser Gln Val Thr Ser Ser Ser Ile 385 390 395 400 Arg Leu Ser Trp Thr Pro Ala Pro Arg His Pro Leu Lys Tyr Leu Ile 405 410 415 Val Trp Arg Ala Ser Arg Gly Gly Thr Pro Arg Glu Val Val Val Glu 420 425 430 Gly Pro Ala Ala Ser Thr Glu Leu His Asn Leu Ala Ser Arg Thr Glu 435 440 445 Tyr Leu Val Ser Val Phe Pro Ile Tyr Glu Gly Gly Val Gly Glu Gly 450 455 460 Leu Arg Gly Leu Val Thr Thr Ala Pro Leu Pro Pro Pro Arg Ala Leu 465 470 475 480 Thr Leu Ala Ala Val Thr Pro Arg Thr Val His Leu Thr Trp Gln Pro 485 490 495 Ser Ala Gly Ala Thr His Tyr Leu Val Arg Cys Ser Pro Ala Ser Pro 500 505 510 Lys Gly Glu Glu Glu Glu Arg Glu Val Gln Val Gly Arg Pro Glu Val 515 520 525 Leu Leu Asp Gly Leu Glu Pro Gly Arg Asp Tyr Glu Val Ser Val Gln 530 535 540 Ser Leu Arg Gly Pro Glu Gly Ser Glu Ala Arg Gly Ile Arg Ala Arg 545 550 555 560 Thr Pro Thr Leu Ala Pro Pro Arg His Leu Gly Phe Ser Asp Val Ser 565 570 575 His Asp Ala Ala Arg Val Phe Trp Glu Gly Ala Pro Arg Pro Val Arg 580 585 590 Leu Val Arg Val Thr Tyr Val Ser Ser Glu Gly Gly His Ser Gly Gln 595 600 605 Thr Glu Ala Pro Gly Asn Ala Thr Ser Ala Thr Leu Gly Pro Leu Ser 610 615 620 Ser Ser Thr Thr Tyr Thr Val Arg Val Thr Cys Leu Tyr Pro Gly Gly 625 630 635 640 Gly Ser Ser Thr Leu Thr Gly Arg Val Thr Thr Lys Lys Ala Pro Ser 645 650 655 Pro Ser Gln Leu Ser Met Thr Glu Leu Pro Gly Asp Ala Val Gln Leu 660 665 670 Ala Trp Val Ala Ala Ala Pro Ser Gly Val Leu Val Tyr Gln Ile Thr 675 680 685 Trp Thr Pro Leu Gly Glu Gly Lys Ala His Glu Ile Ser Val Pro Gly 690 695 700 Asn Leu Gly Thr Ala Val Leu Pro Gly Leu Gly Arg His Thr Glu Tyr 705 710 715 720 Asp Val Thr Ile Leu Ala Tyr Tyr Arg Asp Gly Ala Arg Ser Asp Pro 725 730 735 Val Ser Leu Arg Tyr Thr Pro Ser Thr Val Ser Arg Ser Pro Pro Ser 740 745 750 Asn Leu Ala Leu Ala Ser Glu Thr Pro Asp Ser Leu Gln Val Ser Trp 755 760 765 Thr Pro Pro Leu Gly Arg Val Leu His Tyr Trp Leu Thr Tyr Ala Pro 770 775 780 Ala Ser Gly Leu Gly Pro Glu Lys Ser Val Ser Val Pro Gly Ala Arg 785 790 795 800 Ser His Val Thr Leu Pro Asp Leu Gln Ala Ala Thr Lys Tyr Arg Val 805 810 815 Leu Val Ser Ala Ile Tyr Ala Ala Gly Arg Ser Glu Ala Val Ser Ala 820 825 830 Thr Gly Gln Thr Ala Cys Pro Ala Leu Arg Pro Asp Gly Ser Leu Pro 835 840 845 Gly Phe Asp Leu Met Val Ala Phe Ser Leu Val Glu Lys Ala Tyr Ala 850 855 860 Ser Ile Arg Gly Val Ala Met Glu Pro Ser Ala Phe Gly Gly Thr Pro 865 870 875 880 Thr Phe Thr Leu Phe Lys Asp Ala Gln Leu Thr Arg Arg Val Ser Asp 885 890 895 Val Tyr Pro Ala Pro Leu Pro Pro Glu His Thr Ile Val Phe Leu Val 900 905 910 Arg Leu Leu Pro Glu Thr Pro Arg Glu Ala Phe Ala Leu Trp Gln Met 915 920 925 Thr Ala Glu Asp Phe Gln Pro Leu Leu Gly Val Leu Leu Asp Ala Gly 930 935 940 Lys Lys Ser Leu Thr Tyr Phe His Arg Asp Pro Arg Ala Ala Leu Gln 945 950 955 960 Glu Ala Thr Phe Asp Pro Gln Glu Val Arg Lys Ile Phe Phe Gly Ser 965 970 975 Phe His Lys Val His Val Ala Val Gly Arg Ser Lys Val Arg Leu Tyr 980 985 990 Val Asp Cys Arg Lys Val Ala Glu Arg Pro Leu Gly Glu Met Gly Ser 995 1000 1005 Pro Pro Ala Ala Gly Phe Val Thr Leu Gly Arg Leu Ala Lys Ala 1010 1015 1020 Arg Gly Pro Arg Ser Ser Ser Ala Ala Phe Gln Leu Gln Met Leu 1025 1030 1035 Gln Ile Val Cys Ser Asp Thr Trp Ala Asp Glu Asp Arg Cys Cys 1040 1045 1050 Glu Leu Pro Ala Ser Arg Asp Gly Glu Thr Cys Pro Ala Phe Val 1055 1060 1065 Ser Ala Cys Ser Cys Ser Ser Glu Thr Pro Gly Pro Pro Gly Pro 1070 1075 1080 Gln Gly Pro Pro Gly Leu Pro Gly Arg Asn Gly Thr Pro Gly Glu 1085 1090 1095 Gln Gly Phe Pro Gly Pro Arg Gly Glu Pro Gly Pro Pro Gly Gln 1100 1105 1110 Met Gly Pro Glu Gly Pro Gly Gly Gln Gln Gly Ser Pro Gly Thr 1115 1120 1125 Gln Gly Arg Ala Val Gln Gly Pro Val Gly Pro Pro Gly Val Lys 1130 1135 1140 Gly Glu Lys Gly Asp His Gly Leu Pro Gly Leu Gln Gly His Pro 1145 1150 1155 Gly His Gln Gly Ile Pro Gly Arg Val Gly Leu Gln Gly Pro Lys 1160 1165 1170 Gly Met Arg Gly Leu Glu Gly Thr Ala Gly Leu Pro Gly Pro Gly 1175 1180 1185 Pro Arg Gly Phe Gln Gly Met Ala Gly Ala Arg Gly Thr Ser Gly 1190 1195 1200 Glu Arg Gly Pro Pro Gly Thr Val Gly Pro Thr Gly Leu Pro Gly 1205 1210 1215 Pro Lys Gly Glu Arg Gly Glu Lys Gly Glu Pro Gln Ser Leu Ala 1220 1225 1230 Thr Leu Tyr Gln Leu Val Ser Gln Ala Cys Glu Ser Ala Ile Gln 1235 1240 1245 Thr His Val Ser Lys Phe Asp Ser Phe His Glu Asn Thr Arg Pro 1250 1255 1260 Pro Met Pro Ile Leu Glu Gln Lys Leu Glu Pro Gly Thr Glu Pro 1265 1270 1275 Leu Gly Ser Pro Gly Thr Arg Ser Lys Ala Leu Val Pro Gly Glu 1280 1285 1290 Trp Gly Arg Gly Gly Arg His Leu Glu Gly Arg Gly Glu Pro Gly 1295 1300 1305 Ala Val Gly Gln Met Gly Ser Pro Gly Gln Gln Gly Ala Ser Thr 1310 1315 1320 Gln Gly Leu Trp Glu 1325 221278PRTHomo sapiens 22Met Ser Ser Gly Asp Pro Ala His Leu Gly Leu Cys Leu Trp Leu Trp 1 5 10 15 Leu Gly Ala Thr Leu Gly Arg Glu Gln Val Gln Ala Ser Gly Leu Leu 20 25 30 Arg Leu Ala Val Leu Pro Glu Asp Arg Leu Gln Met Lys Trp Arg Glu 35 40 45 Ser Glu Gly Ser Gly Leu Gly Tyr Leu Val Gln Val Lys Pro Met Ala 50 55 60 Gly Asp Ser Glu Gln Glu Val Ile Leu Thr Thr Lys Thr Pro Lys Ala 65 70 75 80 Thr Val Gly Gly Leu Ser Pro Ser Lys Gly Tyr Thr Leu Gln Ile Phe 85 90 95 Glu Leu Thr Gly Ser Gly Arg Phe Leu Leu Ala Arg Arg Glu Phe Val 100 105 110 Ile Glu Asp Leu Lys Ser Ser Ser Leu Asp Arg Ser Ser Gln Arg Pro 115 120 125 Leu Gly Ser Gly Ala Pro Glu Pro Thr Pro Ser His Thr Gly Ser Pro 130 135 140 Asp Pro Glu Gln Ala Ser Glu Pro Gln Val Ala Phe Thr Pro Ser Gln 145 150 155 160 Asp Pro Arg Thr Pro Gly Gly Ser Glu Trp Arg Glu Thr Gly Pro Gln 165 170 175 Phe Arg Cys Leu Pro Pro Val Pro Ala Asp Met Val Phe Leu Val Asp 180 185 190 Gly Ser Trp Ser Ile Gly His Ser His Phe Gln Gln Val Lys Asp Phe 195 200 205 Leu Ala Ser Val Ile Ala Pro Phe Glu Ile Gly Pro Asp Lys Val Gln 210 215 220 Val Gly Leu Thr Gln Tyr Ser Gly Asp Ala Gln Thr Glu Trp Asp Leu 225 230 235 240 Asn Ser Leu Ser Thr Lys Glu Gln Val Leu Ala Ala Val Arg Arg Leu 245 250 255 Arg Tyr Lys Gly Gly Asn Thr Phe Thr Gly Leu Ala Leu Thr His Val 260 265 270 Leu Gly Gln Asn Leu Gln Pro Ala Ala Gly Leu Arg Pro Glu Ala Ala 275 280 285 Lys Val Val Ile Leu Val Thr Asp Gly Lys Ser Gln Asp Asp Val His 290 295 300 Thr Ala Ala Arg Val Leu Lys Asp Leu Gly Val Asn Val Phe Ala Val 305 310 315 320 Gly Val Lys Asn Ala Asp Glu Ala Glu Leu Arg Leu Leu Ala Ser Pro 325 330 335 Pro Arg Asp Ile Thr Val His Ser Val Leu Asp Phe Leu Gln Leu Gly 340 345 350 Ala Leu Ala Gly Leu Leu Ser Arg Leu Ile Cys Gln Arg Leu Gln Gly 355 360 365 Gly Ser Pro Arg Gln Gly Pro Ala Ala Ala Pro Ala Leu Asp Thr Leu 370 375 380 Pro Ala Pro Thr Ser Leu Val Leu Ser Gln Val Thr Ser Ser Ser Ile 385 390 395 400 Arg Leu Ser Trp Thr Pro Ala Pro Arg His Pro Leu Lys Tyr Leu Ile 405 410 415 Val Trp Arg Ala Ser Arg Gly Gly Thr Pro Arg Glu Val Val Val Glu 420 425 430 Gly Pro Ala Ala Ser Thr Glu Leu His Asn Leu Ala Ser Arg Thr Glu 435 440 445 Tyr Leu Val Ser Val Phe Pro Ile Tyr Glu Gly Gly Val Gly Glu Gly 450 455 460 Leu Arg Gly Leu Val Thr Thr Ala Pro Leu Pro Pro Pro Arg Ala Leu 465 470 475 480 Thr Leu Ala Ala Val Thr Pro Arg Thr Val His Leu Thr Trp Gln Pro 485 490 495 Ser Ala Gly Ala Thr His Tyr Leu Val Arg Cys Ser Pro Ala Ser Pro 500 505 510 Lys Gly Glu Glu Glu Glu Arg Glu Val Gln Val Gly Arg Pro Glu Val 515 520 525 Leu Leu Asp Gly Leu Glu Pro Gly Arg Asp Tyr Glu Val Ser Val Gln 530 535 540 Ser Leu Arg Gly Pro Glu Gly Ser Glu Ala Arg Gly Ile Arg Ala Arg 545 550 555 560 Thr Pro Thr Leu Ala Pro Pro Arg His Leu Gly Phe Ser Asp Val Ser 565 570 575 His Asp Ala Ala Arg Val Phe Trp Glu Gly Ala Pro Arg Pro Val Arg 580 585 590 Leu Val Arg Val Thr Tyr Val Ser Ser Glu Gly Gly His Ser Gly Gln 595 600 605 Thr Glu Ala Pro Gly Asn Ala Thr Ser Ala Thr Leu Gly Pro Leu Ser 610 615 620 Ser Ser Thr Thr Tyr Thr Val Arg Val Thr Cys Leu Tyr Pro Gly Gly 625 630 635 640 Gly Ser Ser Thr Leu Thr Gly Arg Val Thr Thr Lys Lys Ala Pro Ser 645 650 655 Pro Ser Gln Leu Ser Met Thr Glu Leu Pro Gly Asp Ala Val Gln Leu 660 665 670 Ala Trp Val Ala Ala Ala Pro Ser Gly Val Leu Val Tyr Gln Ile Thr 675 680 685 Trp Thr Pro Leu Gly Glu Gly Lys Ala His Glu Ile Ser Val Pro Gly 690 695 700 Asn Leu Gly Thr Ala Val Leu Pro Gly Leu Gly Arg His Thr Glu Tyr 705 710 715 720 Asp Val Thr Ile Leu Ala Tyr Tyr Arg Asp Gly Ala Arg Ser Asp Pro 725 730 735 Val Ser Leu Arg Tyr Thr Pro Ser Thr Val Ser Arg Ser Pro Pro Ser 740 745 750 Asn Leu Ala Leu Ala Ser Glu Thr Pro Asp Ser Leu Gln Val Ser Trp 755 760 765 Thr Pro Pro Leu Gly Arg Val Leu His Tyr Trp Leu Thr Tyr Ala Pro 770 775 780 Ala Ser Gly Leu Gly Pro Glu Lys Ser Val Ser Val Pro Gly Ala Arg 785 790 795 800 Ser His Val Thr Leu Pro Asp Leu Gln Ala Ala Thr Lys Tyr Arg Val 805 810 815 Leu Val Ser Ala Ile Tyr Ala Ala Gly Arg Ser Glu Ala Val Ser Ala 820 825 830 Thr Gly Gln Thr Ala Cys Pro Ala Leu Arg Pro Asp Gly Ser Leu Pro 835 840 845 Gly Phe Asp Leu Met Val Ala Phe Ser Leu Val Glu Lys Ala Tyr Ala 850 855 860 Ser Ile Arg Gly Val Ala Met Glu Pro Ser Ala Phe Gly Gly Thr Pro 865 870 875 880 Thr Phe Thr Leu Phe Lys Asp Ala Gln Leu Thr Arg Arg Val Ser Asp 885 890 895 Val Tyr Pro Ala Pro Leu Pro Pro Glu His Thr Ile Val Phe Leu Val 900 905 910 Arg Leu Leu Pro Glu Thr Pro Arg Glu Ala Phe Ala Leu Trp Gln Met 915 920 925 Thr Ala Glu Asp Phe Gln Pro Leu Leu Gly Val Leu Leu Asp Ala Gly 930 935 940 Lys Lys Ser Leu Thr Tyr Phe His Arg Asp Pro Arg Ala Ala Leu Gln 945 950 955 960 Glu Ala Thr Phe Asp Pro Gln Glu Val Arg Lys Ile Phe Phe Gly Ser 965 970 975 Phe His Lys Val His Val Ala Val Gly Arg Ser Lys Val Arg Leu Tyr 980 985 990 Val Asp Cys Arg Lys Val Ala Glu Arg Pro Leu Gly Glu Met Gly Ser 995 1000 1005 Pro Pro Ala Ala Gly Phe Val Thr Leu Gly Arg Leu Ala Lys Ala 1010 1015 1020 Arg Gly Pro Arg Ser Ser Ser Ala Ala Phe Gln Leu Gln Met Leu 1025 1030 1035 Gln Ile Val Cys Ser Asp Thr Trp Ala Asp Glu Asp Arg Cys Cys 1040 1045 1050 Glu Leu Pro Ala Ser Arg Asp Gly

Glu Thr Cys Pro Ala Phe Val 1055 1060 1065 Ser Ala Cys Ser Cys Ser Ser Glu Thr Pro Gly Pro Pro Gly Pro 1070 1075 1080 Gln Gly Pro Pro Gly Pro Pro Gly Val Lys Gly Glu Lys Gly Asp 1085 1090 1095 His Gly Leu Pro Gly Leu Gln Gly His Pro Gly His Gln Gly Ile 1100 1105 1110 Pro Gly Arg Val Gly Leu Gln Gly Pro Lys Gly Met Arg Gly Leu 1115 1120 1125 Glu Gly Thr Ala Gly Leu Pro Gly Pro Gly Pro Arg Gly Phe Gln 1130 1135 1140 Gly Met Ala Gly Ala Arg Gly Thr Ser Gly Glu Arg Gly Pro Pro 1145 1150 1155 Gly Thr Val Gly Pro Thr Gly Leu Pro Gly Pro Lys Gly Glu Arg 1160 1165 1170 Gly Glu Lys Gly Glu Pro Gln Ser Leu Ala Thr Leu Tyr Gln Leu 1175 1180 1185 Val Ser Gln Ala Cys Glu Ser Ala Ile Gln Thr His Val Ser Lys 1190 1195 1200 Phe Asp Ser Phe His Glu Asn Thr Arg Pro Pro Met Pro Ile Leu 1205 1210 1215 Glu Gln Lys Leu Glu Pro Gly Thr Glu Pro Leu Gly Ser Pro Gly 1220 1225 1230 Thr Arg Ser Lys Ala Leu Val Pro Gly Glu Trp Gly Arg Gly Gly 1235 1240 1245 Arg His Leu Glu Gly Arg Gly Glu Pro Gly Ala Val Gly Gln Met 1250 1255 1260 Gly Ser Pro Gly Gln Gln Gly Ala Ser Thr Gln Gly Leu Trp Glu 1265 1270 1275 231283PRTHomo sapiens 23Met Ser Ser Gly Asp Pro Ala His Leu Gly Leu Cys Leu Trp Leu Trp 1 5 10 15 Leu Gly Ala Thr Leu Gly Arg Glu Gln Val Gln Ala Ser Gly Leu Leu 20 25 30 Arg Leu Ala Val Leu Pro Glu Asp Arg Leu Gln Met Lys Trp Arg Glu 35 40 45 Ser Glu Gly Ser Gly Leu Gly Tyr Leu Val Gln Val Lys Pro Met Ala 50 55 60 Gly Asp Ser Glu Gln Glu Val Ile Leu Thr Thr Lys Thr Pro Lys Ala 65 70 75 80 Thr Val Gly Gly Leu Ser Pro Ser Lys Gly Tyr Thr Leu Gln Ile Phe 85 90 95 Glu Leu Thr Gly Ser Gly Arg Phe Leu Leu Ala Arg Arg Glu Phe Val 100 105 110 Ile Glu Asp Leu Lys Ser Ser Ser Leu Asp Arg Ser Ser Gln Arg Pro 115 120 125 Leu Gly Ser Gly Ala Pro Glu Pro Thr Pro Ser His Thr Gly Ser Pro 130 135 140 Asp Pro Glu Gln Ala Ser Glu Pro Gln Val Ala Phe Thr Pro Ser Gln 145 150 155 160 Asp Pro Arg Thr Pro Ala Gly Pro Gln Phe Arg Cys Leu Pro Pro Val 165 170 175 Pro Ala Asp Met Val Phe Leu Val Asp Gly Ser Trp Ser Ile Gly His 180 185 190 Ser His Phe Gln Gln Val Lys Asp Phe Leu Ala Ser Val Ile Ala Pro 195 200 205 Phe Glu Ile Gly Pro Asp Lys Val Gln Val Gly Leu Thr Gln Tyr Ser 210 215 220 Gly Asp Ala Gln Thr Glu Trp Asp Leu Asn Ser Leu Ser Thr Lys Glu 225 230 235 240 Gln Val Leu Ala Ala Val Arg Arg Leu Arg Tyr Lys Gly Gly Asn Thr 245 250 255 Phe Thr Gly Leu Ala Leu Thr His Val Leu Gly Gln Asn Leu Gln Pro 260 265 270 Ala Ala Gly Leu Arg Pro Glu Ala Ala Lys Val Val Ile Leu Val Thr 275 280 285 Asp Gly Lys Ser Gln Asp Asp Val His Thr Ala Ala Arg Val Leu Lys 290 295 300 Asp Leu Gly Val Asn Val Phe Ala Val Gly Val Lys Asn Ala Asp Glu 305 310 315 320 Ala Glu Leu Arg Leu Leu Ala Ser Pro Pro Arg Asp Ile Thr Val His 325 330 335 Ser Val Leu Asp Phe Leu Gln Leu Gly Ala Leu Ala Gly Leu Leu Ser 340 345 350 Arg Leu Ile Cys Gln Arg Leu Gln Gly Gly Ser Pro Arg Gln Gly Pro 355 360 365 Ala Ala Ala Pro Ala Leu Asp Thr Leu Pro Ala Pro Thr Ser Leu Val 370 375 380 Leu Ser Gln Val Thr Ser Ser Ser Ile Arg Leu Ser Trp Thr Pro Ala 385 390 395 400 Pro Arg His Pro Leu Lys Tyr Leu Ile Val Trp Arg Ala Ser Arg Gly 405 410 415 Gly Thr Pro Arg Glu Val Val Val Glu Gly Pro Ala Ala Ser Thr Glu 420 425 430 Leu His Asn Leu Ala Ser Arg Thr Glu Tyr Leu Val Ser Val Phe Pro 435 440 445 Ile Tyr Glu Gly Gly Val Gly Glu Gly Leu Arg Gly Leu Val Thr Thr 450 455 460 Ala Pro Leu Pro Pro Pro Arg Ala Leu Thr Leu Ala Ala Val Thr Pro 465 470 475 480 Arg Thr Val His Leu Thr Trp Gln Pro Ser Ala Gly Ala Thr His Tyr 485 490 495 Leu Val Arg Cys Ser Pro Ala Ser Pro Lys Gly Glu Glu Glu Glu Arg 500 505 510 Glu Val Gln Val Gly Arg Pro Glu Val Leu Leu Asp Gly Leu Glu Pro 515 520 525 Gly Arg Asp Tyr Glu Val Ser Val Gln Ser Leu Arg Gly Pro Glu Gly 530 535 540 Ser Glu Ala Arg Gly Ile Arg Ala Arg Thr Pro Thr Leu Ala Pro Pro 545 550 555 560 Arg His Leu Gly Phe Ser Asp Val Ser His Asp Ala Ala Arg Val Phe 565 570 575 Trp Glu Gly Ala Pro Arg Pro Val Arg Leu Val Arg Val Thr Tyr Val 580 585 590 Ser Ser Glu Gly Gly His Ser Gly Gln Thr Glu Ala Pro Gly Asn Ala 595 600 605 Thr Ser Ala Thr Leu Gly Pro Leu Ser Ser Ser Thr Thr Tyr Thr Val 610 615 620 Arg Val Thr Cys Leu Tyr Pro Gly Gly Gly Ser Ser Thr Leu Thr Gly 625 630 635 640 Arg Val Thr Thr Lys Lys Ala Pro Ser Pro Ser Gln Leu Ser Met Thr 645 650 655 Glu Leu Pro Gly Asp Ala Val Gln Leu Ala Trp Val Ala Ala Ala Pro 660 665 670 Ser Gly Val Leu Val Tyr Gln Ile Thr Trp Thr Pro Leu Gly Glu Gly 675 680 685 Lys Ala His Glu Ile Ser Val Pro Gly Asn Leu Gly Thr Ala Val Leu 690 695 700 Pro Gly Leu Gly Arg His Thr Glu Tyr Asp Val Thr Ile Leu Ala Tyr 705 710 715 720 Tyr Arg Asp Gly Ala Arg Ser Asp Pro Val Ser Leu Arg Tyr Thr Pro 725 730 735 Ser Thr Val Ser Arg Ser Pro Pro Ser Asn Leu Ala Leu Ala Ser Glu 740 745 750 Thr Pro Asp Ser Leu Gln Val Ser Trp Thr Pro Pro Leu Gly Arg Val 755 760 765 Leu His Tyr Trp Leu Thr Tyr Ala Pro Ala Ser Gly Leu Gly Pro Glu 770 775 780 Lys Ser Val Ser Val Pro Gly Ala Arg Ser His Val Thr Leu Pro Asp 785 790 795 800 Leu Gln Ala Ala Thr Lys Tyr Arg Val Leu Val Ser Ala Ile Tyr Ala 805 810 815 Ala Gly Arg Ser Glu Ala Val Ser Ala Thr Gly Gln Thr Ala Cys Pro 820 825 830 Ala Leu Arg Pro Asp Gly Ser Leu Pro Gly Phe Asp Leu Met Val Ala 835 840 845 Phe Ser Leu Val Glu Lys Ala Tyr Ala Ser Ile Arg Gly Val Ala Met 850 855 860 Glu Pro Ser Ala Phe Gly Gly Thr Pro Thr Phe Thr Leu Phe Lys Asp 865 870 875 880 Ala Gln Leu Thr Arg Arg Val Ser Asp Val Tyr Pro Ala Pro Leu Pro 885 890 895 Pro Glu His Thr Ile Val Phe Leu Val Arg Leu Leu Pro Glu Thr Pro 900 905 910 Arg Glu Ala Phe Ala Leu Trp Gln Met Thr Ala Glu Asp Phe Gln Pro 915 920 925 Leu Leu Gly Val Leu Leu Asp Ala Gly Lys Lys Ser Leu Thr Tyr Phe 930 935 940 His Arg Asp Pro Arg Ala Ala Leu Gln Glu Ala Thr Phe Asp Pro Gln 945 950 955 960 Glu Val Arg Lys Ile Phe Phe Gly Ser Phe His Lys Val His Val Ala 965 970 975 Val Gly Arg Ser Lys Val Arg Leu Tyr Val Asp Cys Arg Lys Val Ala 980 985 990 Glu Arg Pro Leu Gly Glu Met Gly Ser Pro Pro Ala Ala Gly Phe Val 995 1000 1005 Thr Leu Gly Arg Leu Ala Lys Ala Arg Gly Pro Arg Ser Ser Ser 1010 1015 1020 Ala Ala Phe Gln Leu Gln Met Leu Gln Ile Val Cys Ser Asp Thr 1025 1030 1035 Trp Ala Asp Glu Asp Arg Cys Cys Glu Leu Pro Ala Ser Arg Asp 1040 1045 1050 Gly Glu Thr Cys Pro Ala Phe Val Ser Ala Cys Ser Cys Ser Ser 1055 1060 1065 Glu Thr Pro Gly Pro Pro Gly Pro Gln Gly Pro Pro Gly Leu Pro 1070 1075 1080 Gly Arg Asn Gly Thr Pro Gly Glu Gln Gly Phe Pro Gly Pro Arg 1085 1090 1095 Gly Pro Pro Gly Val Lys Gly Glu Lys Gly Asp His Gly Leu Pro 1100 1105 1110 Gly Leu Gln Gly His Pro Gly His Gln Gly Ile Pro Gly Arg Val 1115 1120 1125 Gly Leu Gln Gly Pro Lys Gly Met Arg Gly Leu Glu Gly Thr Ala 1130 1135 1140 Gly Leu Pro Gly Pro Gly Pro Arg Gly Phe Gln Gly Met Ala Gly 1145 1150 1155 Ala Arg Gly Thr Ser Gly Glu Arg Gly Pro Pro Gly Thr Val Gly 1160 1165 1170 Pro Thr Gly Leu Pro Gly Pro Lys Gly Glu Arg Gly Glu Lys Gly 1175 1180 1185 Glu Pro Gln Ser Leu Ala Thr Leu Tyr Gln Leu Val Ser Gln Ala 1190 1195 1200 Ser His Val Ser Lys Phe Asp Ser Phe His Glu Asn Thr Arg Pro 1205 1210 1215 Pro Met Pro Ile Leu Glu Gln Lys Leu Glu Pro Gly Thr Glu Pro 1220 1225 1230 Leu Gly Ser Pro Gly Thr Arg Ser Lys Ala Leu Val Pro Gly Glu 1235 1240 1245 Trp Gly Arg Gly Gly Arg His Leu Glu Gly Arg Gly Glu Pro Gly 1250 1255 1260 Ala Val Gly Gln Met Gly Ser Pro Gly Gln Gln Gly Ala Ser Thr 1265 1270 1275 Gln Gly Leu Trp Glu 1280 243987DNAHomo sapiens 24atgagctccg gagaccctgc acacctcggc ctctgcctct ggctgtggct gggcgccacc 60ctgggaagag agcaagttca agcaagcggt ctcctgaggc tggctgtgct gcctgaggac 120cggctgcaga tgaagtggag agagtcggag gggagcggcc tcggctacct ggtgcaggtg 180aagcccatgg caggggactc ggaacaggag gtgatactga ccaccaagac ccctaaggcc 240acagtggggg gcctgagccc ctccaagggc tacaccttgc agatcttcga gctcactggc 300tctgggcgct tcctgctagc tcggagggag tttgtgattg aggatctgaa gagtagctcc 360ctggacagga gcagccagag gcccctcggc tctggagccc cggagcccac cccctcccac 420acggggagcc cagaccctga gcaggcttct gagccccaag ttgccttcac accaagccag 480gatccgcgca ctcctggtgg gtcagagtgg agagagaccg gcccccagtt ccgctgcctg 540ccccccgtgc ctgctgacat ggtcttcctg gtggacgggt cctggagcat tggccacagt 600cacttccagc aggtcaagga cttcctggcc agtgtcatcg caccctttga aatcgggccg 660gataaggtcc aagtaggcct gactcagtac agcggggatg ctcagactga gtgggacctg 720aactccctca gcaccaagga acaggtgctg gcagctgtgc gccgcctccg ctacaagggg 780gggaacacgt tcacaggcct tgccctgacc cacgtgctgg ggcagaacct gcagccggcg 840gctggcctcc gtccagaggc agccaaggtg gtgattctgg tgacggacgg caagtcccag 900gacgatgtgc acactgctgc ccgtgtcctc aaggacctgg gcgtgaacgt cttcgctgtg 960ggtgtgaaga acgccgatga ggctgagctg aggctcctgg cgtccccgcc gagggacatc 1020accgtccaca gcgtgctgga cttcctgcag ctcggcgcgc tggctggcct gctcagccgt 1080ctcatctgcc agaggctcca gggtgggagc ccgcggcagg gcccagcagc ggctccagcc 1140ctggacaccc tccctgcccc caccagcctg gtcctgagcc aggtgacctc ctccagcatc 1200cgcctgtcct ggactccagc cccccggcac cccctcaagt atctgatcgt ttggcgagcc 1260tctagaggtg gcacccccag ggaggtggtg gtggagggac ccgccgcctc cacggagctg 1320cacaacctgg cctcccgcac agagtacctg gtctccgtgt tccccatcta tgagggcggg 1380gttggcgaag gcctgcgggg cctggtgacc acagcacctc tgcctccgcc ccgggcgctg 1440accctggccg cagtgacgcc cagaaccgtc cacctcacct ggcagccctc ggccggggcc 1500acccactacc tggtgcgatg ttctcctgct tcccccaagg gtgaagagga ggagcgagag 1560gtgcaggtcg ggcggcccga ggtgctgctg gatggcctgg aacctggcag ggactatgag 1620gtctcggtgc agagcctgcg aggccctgag ggcagcgagg cccggggcat ccgtgccagg 1680acccccaccc tggccccccc gagacacctg ggcttctcag acgtgagcca cgacgcggca 1740cgagtgttct gggagggtgc cccgaggcct gtgcgcctgg tcagggtcac ctatgtgtcc 1800agcgagggtg gacactcggg gcagacagag gctcctggga acgccacctc ggccacgctg 1860gggcctctct cttcctccac cacctacact gtccgtgtca cctgcctcta ccctgggggt 1920ggctcctcta cgctgactgg ccgggtgacc accaagaaag ctcccagccc aagccagctg 1980tccatgacgg agctgccagg ggatgcagtc cagctggcgt gggtggccgc agccccgtct 2040ggcgtgcttg tctaccagat cacgtggacg cccctgggag aggggaaggc tcacgagatc 2100tctgtcccag ggaacctcgg cacggccgtc ctgcctggcc tagggaggca cacagagtac 2160gacgtcacca tcttggccta ctacagggac ggggcccgca gtgaccctgt gtccctccgc 2220tataccccct ccacggtgag caggagccca ccctccaacc tggccctggc ctcggagacc 2280cccgacagcc tgcaggtcag ctggacgccc ccgcttggcc gcgtgctcca ttactggctc 2340acctacgccc ccgcctctgg cttgggaccc gagaaatccg tctctgtgcc aggagccagg 2400agccacgtga cactgcccga cctgcaggca gccacgaagt acagggtcct ggtctcagct 2460atctatgcag caggcaggag tgaggctgtg tctgccacgg gccagacagc ctgcccagcc 2520ctccgccctg acggctccct cccagggttt gacctgatgg tggccttcag cctggtggaa 2580aaggcttatg cgtccatccg gggcgtggcc atggagccct ctgccttcgg tgggaccccg 2640accttcacgc tcttcaagga cgcccagctg acaagacggg tcagtgacgt ctacccagcc 2700cccctacctc cagagcacac catcgtcttc cttgtgcgcc tacttcccga gacaccccgt 2760gaggccttcg cgctgtggca gatgacagcc gaggacttcc agcccctcct tggggttctg 2820ctggatgccg ggaagaagtc cctgacctac ttccaccgtg accccagggc tgccttgcag 2880gaggccacct tcgacccgca ggaagtgagg aagattttct tcgggagctt ccacaaggtg 2940cacgtggctg tgggccgctc caaggtcagg ctctatgtgg actgccggaa ggtggctgag 3000cggccccttg gggagatggg cagcccaccc gctgcgggct tcgtcacgct ggggaggctg 3060gccaaggcca ggggcccccg gagcagttcg gccgcgtttc agctccagat gctgcagatc 3120gtgtgcagtg acacctgggc cgatgaggac cggtgctgtg agctccctgc ctcgagggat 3180ggagagacct gccccgcctt cgtgtctgcc tgttcctgtt cctcagagac ccctgggccc 3240ccaggacctc aaggaccccc aggcctccct gggaggaatg gcaccccagg agagcagggc 3300ttcccagggc ccaggggaga gcccgggcca cccggacaga tgggaccaga aggtcctgga 3360ggccagcagg gctcgccggg gacccagggc cgtgcagtcc aggggcctgt gggtccacca 3420ggggtcaaag gagagaaggg agaccatggg cttccaggct tgcagggcca ccccggccac 3480cagggcatcc ccgggagagt tggcctccag ggaccaaagg gaatgagagg cctggaggga 3540actgctggcc tgcctggacc cggccccagg gggttccagg gcatggcagg ggccaggggc 3600actagtggag agcgaggacc tccagggacc gtggggccca caggactgcc agggcccaaa 3660ggggaacgag gagagaaggg cgagccgcag tcccttgcca ccctctacca gcttgtgagc 3720caggcctgtg agtctgccat tcagacacac gtgtcaaagt tcgactcctt ccacgagaac 3780accaggcccc ccatgcccat cttggagcag aagctggagc cgggcactga gcccctgggg 3840tcccctggca cccgcagcaa ggccctggtt cctggagaat gggggcgtgg tggccgccac 3900cttgagggca gaggggagcc tggagctgtt ggtcagatgg gcagccctgg gcagcagggg 3960gctagcaccc agggcctctg ggagtga 3987253837DNAHomo sapiens 25atgagctccg gagaccctgc acacctcggc ctctgcctct ggctgtggct gggcgccacc 60ctgggaagag agcaagttca agcaagcggt ctcctgaggc tggctgtgct gcctgaggac 120cggctgcaga tgaagtggag agagtcggag gggagcggcc tcggctacct ggtgcaggtg 180aagcccatgg caggggactc ggaacaggag gtgatactga ccaccaagac ccctaaggcc 240acagtggggg gcctgagccc ctccaagggc tacaccttgc agatcttcga gctcactggc 300tctgggcgct tcctgctagc tcggagggag tttgtgattg aggatctgaa gagtagctcc 360ctggacagga gcagccagag gcccctcggc tctggagccc cggagcccac cccctcccac 420acggggagcc cagaccctga gcaggcttct gagccccaag ttgccttcac accaagccag 480gatccgcgca ctcctggtgg gtcagagtgg agagagaccg gcccccagtt ccgctgcctg 540ccccccgtgc ctgctgacat ggtcttcctg gtggacgggt cctggagcat tggccacagt 600cacttccagc aggtcaagga cttcctggcc agtgtcatcg caccctttga aatcgggccg 660gataaggtcc aagtaggcct gactcagtac agcggggatg ctcagactga gtgggacctg 720aactccctca gcaccaagga acaggtgctg gcagctgtgc gccgcctccg ctacaagggg 780gggaacacgt tcacaggcct tgccctgacc cacgtgctgg ggcagaacct gcagccggcg 840gctggcctcc gtccagaggc agccaaggtg gtgattctgg tgacggacgg caagtcccag 900gacgatgtgc acactgctgc ccgtgtcctc aaggacctgg gcgtgaacgt cttcgctgtg 960ggtgtgaaga acgccgatga

ggctgagctg aggctcctgg cgtccccgcc gagggacatc 1020accgtccaca gcgtgctgga cttcctgcag ctcggcgcgc tggctggcct gctcagccgt 1080ctcatctgcc agaggctcca gggtgggagc ccgcggcagg gcccagcagc ggctccagcc 1140ctggacaccc tccctgcccc caccagcctg gtcctgagcc aggtgacctc ctccagcatc 1200cgcctgtcct ggactccagc cccccggcac cccctcaagt atctgatcgt ttggcgagcc 1260tctagaggtg gcacccccag ggaggtggtg gtggagggac ccgccgcctc cacggagctg 1320cacaacctgg cctcccgcac agagtacctg gtctccgtgt tccccatcta tgagggcggg 1380gttggcgaag gcctgcgggg cctggtgacc acagcacctc tgcctccgcc ccgggcgctg 1440accctggccg cagtgacgcc cagaaccgtc cacctcacct ggcagccctc ggccggggcc 1500acccactacc tggtgcgatg ttctcctgct tcccccaagg gtgaagagga ggagcgagag 1560gtgcaggtcg ggcggcccga ggtgctgctg gatggcctgg aacctggcag ggactatgag 1620gtctcggtgc agagcctgcg aggccctgag ggcagcgagg cccggggcat ccgtgccagg 1680acccccaccc tggccccccc gagacacctg ggcttctcag acgtgagcca cgacgcggca 1740cgagtgttct gggagggtgc cccgaggcct gtgcgcctgg tcagggtcac ctatgtgtcc 1800agcgagggtg gacactcggg gcagacagag gctcctggga acgccacctc ggccacgctg 1860gggcctctct cttcctccac cacctacact gtccgtgtca cctgcctcta ccctgggggt 1920ggctcctcta cgctgactgg ccgggtgacc accaagaaag ctcccagccc aagccagctg 1980tccatgacgg agctgccagg ggatgcagtc cagctggcgt gggtggccgc agccccgtct 2040ggcgtgcttg tctaccagat cacgtggacg cccctgggag aggggaaggc tcacgagatc 2100tctgtcccag ggaacctcgg cacggccgtc ctgcctggcc tagggaggca cacagagtac 2160gacgtcacca tcttggccta ctacagggac ggggcccgca gtgaccctgt gtccctccgc 2220tataccccct ccacggtgag caggagccca ccctccaacc tggccctggc ctcggagacc 2280cccgacagcc tgcaggtcag ctggacgccc ccgcttggcc gcgtgctcca ttactggctc 2340acctacgccc ccgcctctgg cttgggaccc gagaaatccg tctctgtgcc aggagccagg 2400agccacgtga cactgcccga cctgcaggca gccacgaagt acagggtcct ggtctcagct 2460atctatgcag caggcaggag tgaggctgtg tctgccacgg gccagacagc ctgcccagcc 2520ctccgccctg acggctccct cccagggttt gacctgatgg tggccttcag cctggtggaa 2580aaggcttatg cgtccatccg gggcgtggcc atggagccct ctgccttcgg tgggaccccg 2640accttcacgc tcttcaagga cgcccagctg acaagacggg tcagtgacgt ctacccagcc 2700cccctacctc cagagcacac catcgtcttc cttgtgcgcc tacttcccga gacaccccgt 2760gaggccttcg cgctgtggca gatgacagcc gaggacttcc agcccctcct tggggttctg 2820ctggatgccg ggaagaagtc cctgacctac ttccaccgtg accccagggc tgccttgcag 2880gaggccacct tcgacccgca ggaagtgagg aagattttct tcgggagctt ccacaaggtg 2940cacgtggctg tgggccgctc caaggtcagg ctctatgtgg actgccggaa ggtggctgag 3000cggccccttg gggagatggg cagcccaccc gctgcgggct tcgtcacgct ggggaggctg 3060gccaaggcca ggggcccccg gagcagttcg gccgcgtttc agctccagat gctgcagatc 3120gtgtgcagtg acacctgggc cgatgaggac cggtgctgtg agctccctgc ctcgagggat 3180ggagagacct gccccgcctt cgtgtctgcc tgttcctgtt cctcagagac ccctgggccc 3240ccaggacctc aaggaccccc aggtccacca ggggtcaaag gagagaaggg agaccatggg 3300cttccaggct tgcagggcca ccccggccac cagggcatcc ccgggagagt tggcctccag 3360ggaccaaagg gaatgagagg cctggaggga actgctggcc tgcctggacc tggccccagg 3420gggttccagg gcatggcagg ggccaggggc actagtggag agcgaggacc tccagggacc 3480gtggggccca caggactgcc agggcccaaa ggggaacgag gagagaaggg cgagccgcag 3540tcccttgcca ccctctacca gcttgtgagc caggcctgtg agtctgccat tcagacacac 3600gtgtcaaagt tcgactcctt ccacgagaac accaggcccc ccatgcccat cttggagcag 3660aagctggagc cgggcactga gcccctgggg tcccctggca cccgcagcaa ggccctggtt 3720cctggagaat gggggcgtgg tggccgccac cttgagggca gaggggagcc tggagctgtt 3780ggtcagatgg gcagccctgg gcagcagggg gctagcaccc agggcctctg ggagtga 3837264168DNAHomo sapiens 26ataagctcca gccttcctgt ggccacagca ggaccagagt ggaccagcac accccaggag 60agaggactgg ggtcccagga gtaggaggag cccgagcacc atgagctccg gagaccctgc 120acacctcggc ctctgcctct ggctgtggct gggcgccacc ctgggaagag agcaagttca 180agcaagcggt ctcctgaggc tggctgtgct gcctgaggac cggctgcaga tgaagtggag 240agagtcggag gggagcggcc tcggctacct ggtgcaggtg aagcccatgg caggggactc 300ggaacaggag gtgatactga ccaccaagac ccctaaggcc acagtggggg gcctgagccc 360ctccaagggc tacaccttgc agatcttcga gctcactggc tctgggcgct tcctgctagc 420tcggagggag tttgtgattg aggatctgaa gagtagctcc ctggacagga gcagccagag 480gcccctcggc tctggagccc cggagcccac cccctcccac acggggagcc cagaccctga 540gcaggcttct gagccccaag ttgccttcac accaagccag gatccgcgca ctcctgccgg 600cccccagttc cgctgcctgc cccccgtgcc tgctgacatg gtcttcctgg tggacgggtc 660ctggagcatt ggccacagtc acttccagca ggtcaaggac ttcctggcca gtgtcatcgc 720accctttgaa atcgggccgg ataaggtcca agtaggcctg actcagtaca gcggggatgc 780tcagactgag tgggacctga actccctcag caccaaggaa caggtgctgg cagctgtgcg 840ccgcctccgc tacaaggggg ggaacacgtt cacaggcctt gccctgaccc acgtgctggg 900gcagaacctg cagccggcgg ctggcctccg tccagaggca gccaaggtgg tgattctggt 960gacggacggc aagtcccagg acgatgtgca cactgctgcc cgtgtcctca aggacctggg 1020cgtgaacgtc ttcgctgtgg gtgtgaagaa cgccgatgag gctgagctga ggctcctggc 1080gtccccgccg agggacatca ccgtccacag cgtgctggac ttcctgcagc tcggcgcgct 1140ggctggcctg ctcagccgtc tcatctgcca gaggctccag ggtgggagcc cgcggcaggg 1200cccagcagcg gctccagccc tggacaccct ccctgccccc accagcctgg tcctgagcca 1260ggtgacctcc tccagcatcc gcctgtcctg gactccagcc ccccggcacc ccctcaagta 1320tctgatcgtt tggcgagcct ctagaggtgg cacccccagg gaggtggtgg tggagggacc 1380cgccgcctcc acggagctgc acaacctggc ctcccgcaca gagtacctgg tctccgtgtt 1440ccccatctat gagggcgggg ttggcgaagg cctgcggggc ctggtgacca cagcacctct 1500gcctccgccc cgggcgctga ccctggccgc agtgacgccc agaaccgtcc acctcacctg 1560gcagccctcg gccggggcca cccactacct ggtgcgatgt tctcctgctt cccccaaggg 1620tgaagaggag gagcgagagg tgcaggtcgg gcggcccgag gtgctgctgg atggcctgga 1680acctggcagg gactatgagg tctcggtgca gagcctgcga ggccctgagg gcagcgaggc 1740ccggggcatc cgtgccagga cccccaccct ggcccccccg agacacctgg gcttctcaga 1800cgtgagccac gacgcggcac gagtgttctg ggagggtgcc ccgaggcctg tgcgcctggt 1860cagggtcacc tatgtgtcca gcgagggtgg acactcgggg cagacagagg ctcctgggaa 1920cgccacctcg gccacgctgg ggcctctctc ttcctccacc acctacactg tccgtgtcac 1980ctgcctctac cctgggggtg gctcctctac gctgactggc cgggtgacca ccaagaaagc 2040tcccagccca agccagctgt ccatgacgga gctgccaggg gatgcagtcc agctggcgtg 2100ggtggccgca gccccgtctg gcgtgcttgt ctaccagatc acgtggacgc ccctgggaga 2160ggggaaggct cacgagatct ctgtcccagg gaacctcggc acggccgtcc tgcctggcct 2220agggaggcac acagagtacg acgtcaccat cttggcctac tacagggacg gggcccgcag 2280tgaccctgtg tccctccgct ataccccctc cacggtgagc aggagcccac cctccaacct 2340ggccctggcc tcggagaccc ccgacagcct gcaggtcagc tggacgcccc cgcttggccg 2400cgtgctccat tactggctca cctacgcccc cgcctctggc ttgggacccg agaaatccgt 2460ctctgtgcca ggagccagga gccacgtgac actgcccgac ctgcaggcag ccacgaagta 2520cagggtcctg gtctcagcta tctatgcagc aggcaggagt gaggctgtgt ctgccacggg 2580ccagacagcc tgcccagccc tccgccctga cggctccctc ccagggtttg acctgatggt 2640ggccttcagc ctggtggaaa aggcttatgc gtccatccgg ggcgtggcca tggagccctc 2700tgccttcggt gggaccccga ccttcacgct cttcaaggac gcccagctga caagacgggt 2760cagtgacgtc tacccagccc ccctacctcc agagcacacc atcgtcttcc ttgtgcgcct 2820acttcccgag acaccccgtg aggccttcgc gctgtggcag atgacagccg aggacttcca 2880gcccctcctt ggggttctgc tggatgccgg gaagaagtcc ctgacctact tccaccgtga 2940ccccagggct gccttgcagg aggccacctt cgacccgcag gaagtgagga agattttctt 3000cgggagcttc cacaaggtgc acgtggctgt gggccgctcc aaggtcaggc tctatgtgga 3060ctgccggaag gtggctgagc ggccccttgg ggagatgggc agcccacccg ctgcgggctt 3120cgtcacgctg gggaggctgg ccaaggccag gggcccccgg agcagttcgg ccgcgtttca 3180gctccagatg ctgcagatcg tgtgcagtga cacctgggcc gatgaggacc ggtgctgtga 3240gctccctgcc tcgagggatg gagagacctg ccccgccttc gtgtctgcct gttcctgttc 3300ctcagagacc cctgggcccc caggacctca aggaccccca ggcctccctg ggaggaatgg 3360caccccagga gagcagggct tcccagggcc caggggtcca ccaggggtca aaggagagaa 3420gggagaccat gggcttccag gcttgcaggg ccaccccggc caccagggca tccccgggag 3480agttggcctc cagggaccaa agggaatgag aggcctggag ggaactgctg gcctgcctgg 3540acctggcccc agggggttcc agggcatggc aggggccagg ggcactagtg gagagcgagg 3600acctccaggg accgtggggc ccacaggact gccagggccc aaaggggaac gaggagagaa 3660gggcgagccg cagtcccttg ccaccctcta ccagcttgtg agccaggcct cacacgtgtc 3720aaagttcgac tccttccacg agaacaccag gccccccatg cccatcttgg agcagaagct 3780ggagccgggc actgagcccc tggggtcccc tggcacccgc agcaaggccc tggttcctgg 3840agaatggggg cgtggtggcc gccaccttga gggcagaggg gagcctggag ctgttggtca 3900gatgggcagc cctgggcagc agggggctag cacccagggc ctctgggagt gacaggacat 3960tttctgcact gccccgagga acgctgagcc ttcctccctg ggtttgtctg gacaccgaga 4020gcgaccacat cctggagaag ccaggagaaa agctcaggaa gagcctgcag gtggaaggag 4080agggaagcag cggcctcggc caaggcccac cccatactct tggctctgta gcatttccaa 4140gttcagataa acccctgagt gctcaccc 416827740PRTHomo sapiens 27Ala Val Arg Ala Glu Pro Gly Leu Glu Glu Leu Ser Ser Gly Leu Arg 1 5 10 15 Ala His Ser Pro Ser Ala Thr Thr Val Cys Glu Pro Glu Ala Gln Gly 20 25 30 Ser Ala Ser Gly Cys Arg Tyr Ala Ala His Pro His Trp Gly Leu Gly 35 40 45 Gly Ala Ala Ala Ala Gly Gly Ser Trp Glu Pro Gln Pro Pro Arg Pro 50 55 60 Val Cys Glu Pro Ala Gly Arg Gly Lys Pro His Pro Pro Ala Ala Pro 65 70 75 80 Arg Ser Pro Leu Leu Pro Gly Ser Arg Arg Arg Pro Gln Ala Ala Gln 85 90 95 Pro Gly Ala Arg Ala Arg Thr Ser Pro Pro Pro Ala Ser Ala Arg Asn 100 105 110 Met Ala Ala Arg Pro Ala Ala Thr Leu Ala Trp Ser Leu Leu Leu Leu 115 120 125 Ser Ser Ala Leu Leu Arg Glu Gly Cys Arg Ala Arg Phe Val Ala Glu 130 135 140 Arg Asp Ser Glu Asp Asp Gly Glu Glu Pro Val Val Phe Pro Glu Ser 145 150 155 160 Pro Leu Gln Ser Pro Thr Val Leu Val Ala Val Leu Ala Arg Asn Ala 165 170 175 Ala His Thr Leu Pro His Phe Leu Gly Cys Leu Glu Arg Leu Asp Tyr 180 185 190 Pro Lys Ser Arg Met Ala Ile Trp Ala Ala Thr Asp His Asn Val Asp 195 200 205 Asn Thr Thr Glu Ile Phe Arg Glu Trp Leu Lys Asn Val Gln Arg Leu 210 215 220 Tyr His Tyr Val Glu Trp Arg Pro Met Asp Glu Pro Glu Ser Tyr Pro 225 230 235 240 Asp Glu Ile Gly Pro Lys His Trp Pro Thr Ser Arg Phe Ala His Val 245 250 255 Met Lys Leu Arg Gln Ala Ala Leu Arg Thr Ala Arg Glu Lys Trp Ser 260 265 270 Asp Tyr Ile Leu Phe Ile Asp Val Asp Asn Phe Leu Thr Asn Pro Gln 275 280 285 Thr Leu Asn Leu Leu Ile Ala Glu Asn Lys Thr Ile Val Ala Pro Met 290 295 300 Leu Glu Ser Arg Gly Leu Tyr Ser Asn Phe Trp Cys Gly Ile Thr Pro 305 310 315 320 Lys Ala Lys Gly Phe Tyr Lys Arg Thr Pro Asp Tyr Val Gln Ile Arg 325 330 335 Glu Trp Lys Arg Thr Gly Cys Phe Pro Val Pro Met Val His Ser Thr 340 345 350 Phe Leu Ile Asp Leu Arg Lys Glu Ala Ser Asp Lys Leu Thr Phe Tyr 355 360 365 Pro Pro His Gln Asp Tyr Thr Trp Thr Phe Asp Asp Ile Ile Val Phe 370 375 380 Ala Phe Ser Ser Arg Gln Ala Gly Ile Gln Met Tyr Leu Cys Asn Arg 385 390 395 400 Glu His Tyr Gly Tyr Leu Pro Ile Pro Leu Lys Pro His Gln Thr Leu 405 410 415 Gln Glu Asp Ile Glu Asn Leu Ile His Val Gln Ile Glu Ala Met Ile 420 425 430 Asp Arg Pro Pro Met Glu Pro Ser Gln Tyr Val Ser Val Val Pro Lys 435 440 445 Tyr Pro Asp Lys Met Gly Phe Asp Glu Ile Phe Met Ile Asn Leu Lys 450 455 460 Arg Arg Lys Asp Arg Arg Asp Arg Met Leu Arg Thr Leu Tyr Glu Gln 465 470 475 480 Glu Ile Glu Val Lys Ile Val Glu Ala Val Asp Gly Lys Ala Leu Asn 485 490 495 Thr Ser Gln Leu Lys Ala Leu Asn Ile Glu Met Leu Pro Gly Tyr Arg 500 505 510 Asp Pro Tyr Ser Ser Arg Pro Leu Thr Arg Gly Glu Ile Gly Cys Phe 515 520 525 Leu Ser His Tyr Ser Val Trp Lys Glu Val Ile Asp Arg Glu Leu Glu 530 535 540 Lys Thr Leu Val Ile Glu Asp Asp Val Arg Phe Glu His Gln Phe Lys 545 550 555 560 Lys Lys Leu Met Lys Leu Met Asp Asn Ile Asp Gln Ala Gln Leu Asp 565 570 575 Trp Glu Leu Ile Tyr Ile Gly Arg Lys Arg Met Gln Val Lys Glu Pro 580 585 590 Glu Lys Ala Val Pro Asn Val Ala Asn Leu Val Glu Ala Asp Tyr Ser 595 600 605 Tyr Trp Thr Leu Gly Tyr Val Ile Ser Leu Glu Gly Ala Gln Lys Leu 610 615 620 Val Gly Ala Asn Pro Phe Gly Lys Met Leu Pro Val Asp Glu Phe Leu 625 630 635 640 Pro Val Met Tyr Asn Lys His Pro Val Ala Glu Tyr Lys Glu Tyr Tyr 645 650 655 Glu Ser Arg Asp Leu Lys Ala Phe Ser Ala Glu Pro Leu Leu Ile Tyr 660 665 670 Pro Thr His Tyr Thr Gly Gln Pro Gly Tyr Leu Ser Asp Thr Glu Thr 675 680 685 Ser Thr Ile Trp Asp Asn Glu Thr Val Ala Thr Asp Trp Asp Arg Thr 690 695 700 His Ala Trp Lys Ser Arg Lys Gln Ser Arg Ile Tyr Ser Asn Ala Lys 705 710 715 720 Asn Thr Glu Ala Leu Pro Pro Pro Thr Ser Leu Asp Thr Val Pro Ser 725 730 735 Arg Asp Glu Leu 740 28626PRTHomo sapiens 28Met Ala Ala Arg Pro Ala Ala Thr Leu Ala Trp Ser Leu Leu Leu Leu 1 5 10 15 Ser Ser Ala Leu Leu Arg Glu Gly Cys Arg Ala Arg Phe Val Ala Glu 20 25 30 Arg Asp Ser Glu Asp Asp Gly Glu Glu Pro Val Val Phe Pro Glu Ser 35 40 45 Pro Leu Gln Ser Pro Thr Val Leu Val Ala Val Leu Ala Arg Asn Ala 50 55 60 Ala His Thr Leu Pro His Phe Leu Gly Cys Leu Glu Arg Leu Asp Tyr 65 70 75 80 Pro Lys Ser Arg Met Ala Ile Trp Ala Ala Thr Asp His Asn Val Asp 85 90 95 Asn Thr Thr Glu Ile Phe Arg Glu Trp Leu Lys Asn Val Gln Arg Leu 100 105 110 Tyr His Tyr Val Glu Trp Arg Pro Met Asp Glu Pro Glu Ser Tyr Pro 115 120 125 Asp Glu Ile Gly Pro Lys His Trp Pro Thr Ser Arg Phe Ala His Val 130 135 140 Met Lys Leu Arg Gln Ala Ala Leu Arg Thr Ala Arg Glu Lys Trp Ser 145 150 155 160 Asp Tyr Ile Leu Phe Ile Asp Val Asp Asn Phe Leu Thr Asn Pro Gln 165 170 175 Thr Leu Asn Leu Leu Ile Ala Glu Asn Lys Thr Ile Val Ala Pro Met 180 185 190 Leu Glu Ser Arg Gly Leu Tyr Ser Asn Phe Trp Cys Gly Ile Thr Pro 195 200 205 Lys Gly Phe Tyr Lys Arg Thr Pro Asp Tyr Val Gln Ile Arg Glu Trp 210 215 220 Lys Arg Thr Gly Cys Phe Pro Val Pro Met Val His Ser Thr Phe Leu 225 230 235 240 Ile Asp Leu Arg Lys Glu Ala Ser Asp Lys Leu Thr Phe Tyr Pro Pro 245 250 255 His Gln Asp Tyr Thr Trp Thr Phe Asp Asp Ile Ile Val Phe Ala Phe 260 265 270 Ser Ser Arg Gln Ala Gly Ile Gln Met Tyr Leu Cys Asn Arg Glu His 275 280 285 Tyr Gly Tyr Leu Pro Ile Pro Leu Lys Pro His Gln Thr Leu Gln Glu 290 295 300 Asp Ile Glu Asn Leu Ile His Val Gln Ile Glu Ala Met Ile Asp Arg 305 310 315 320 Pro Pro Met Glu Pro Ser Gln Tyr Val Ser Val Val Pro Lys Tyr Pro 325 330 335 Asp Lys Met Gly Phe Asp Glu Ile Phe Met Ile Asn Leu Lys Arg Arg 340 345 350 Lys Asp Arg Arg Asp Arg Met Leu Arg Thr Leu Tyr Glu Gln Glu Ile 355 360 365 Glu Val Lys Ile Val Glu Ala Val Asp Gly Lys Ala Leu Asn Thr Ser 370 375 380 Gln Leu Lys Ala Leu Asn Ile Glu Met Leu Pro Gly Tyr Arg Asp Pro 385 390 395 400 Tyr Ser Ser Arg Pro Leu Thr Arg Gly Glu Ile Gly Cys Phe Leu Ser 405 410 415 His Tyr Ser Val Trp Lys Glu Val Ile Asp Arg Glu Leu Glu Lys Thr 420 425 430 Leu Val Ile Glu Asp Asp Val Arg Phe Glu His Gln Phe Lys Lys Lys 435 440 445 Leu Met Lys Leu Met Asp Asn Ile Asp Gln Ala Gln Leu Asp Trp Glu 450 455 460 Leu Ile Tyr Ile Gly Arg Lys Arg Met Gln Val Lys Glu Pro Glu Lys 465 470

475 480 Ala Val Pro Asn Val Ala Asn Leu Val Glu Ala Asp Tyr Ser Tyr Trp 485 490 495 Thr Leu Gly Tyr Val Ile Ser Leu Glu Gly Ala Gln Lys Leu Val Gly 500 505 510 Ala Asn Pro Phe Gly Lys Met Leu Pro Val Asp Glu Phe Leu Pro Val 515 520 525 Met Tyr Asn Lys His Pro Val Ala Glu Tyr Lys Glu Tyr Tyr Glu Ser 530 535 540 Arg Asp Leu Lys Ala Phe Ser Ala Glu Pro Leu Leu Ile Tyr Pro Thr 545 550 555 560 His Tyr Thr Gly Gln Pro Gly Tyr Leu Ser Asp Thr Glu Thr Ser Thr 565 570 575 Ile Trp Asp Asn Glu Thr Val Ala Thr Asp Trp Asp Arg Thr His Ala 580 585 590 Trp Lys Ser Arg Lys Gln Ser Arg Ile Tyr Ser Asn Ala Lys Asn Thr 595 600 605 Glu Ala Leu Pro Pro Pro Thr Ser Leu Asp Thr Val Pro Ser Arg Asp 610 615 620 Glu Leu 625 29363PRTHomo sapiens 29Met Phe Ser Asp Cys Trp Leu Cys Pro Ser Leu Gly Ser Ser Gly Ile 1 5 10 15 Gln Met Tyr Leu Cys Asn Arg Glu His Tyr Gly Tyr Leu Pro Ile Pro 20 25 30 Leu Lys Pro His Gln Thr Leu Gln Glu Asp Ile Glu Asn Leu Ile His 35 40 45 Val Gln Ile Glu Ala Met Ile Asp Arg Pro Pro Met Glu Pro Ser Gln 50 55 60 Tyr Val Ser Val Val Pro Lys Tyr Pro Asp Lys Met Gly Phe Asp Glu 65 70 75 80 Ile Phe Met Ile Asn Leu Lys Arg Arg Lys Asp Arg Arg Asp Arg Met 85 90 95 Leu Arg Thr Leu Tyr Glu Gln Glu Ile Glu Val Lys Ile Val Glu Ala 100 105 110 Val Asp Gly Lys Ala Leu Asn Thr Ser Gln Leu Lys Ala Leu Asn Ile 115 120 125 Glu Met Leu Pro Gly Tyr Arg Asp Pro Tyr Ser Ser Arg Pro Leu Thr 130 135 140 Arg Gly Glu Ile Gly Cys Phe Leu Ser His Tyr Ser Val Trp Lys Glu 145 150 155 160 Val Ile Asp Arg Glu Leu Glu Lys Thr Leu Val Ile Glu Asp Asp Val 165 170 175 Arg Phe Glu His Gln Phe Lys Lys Lys Leu Met Lys Leu Met Asp Asn 180 185 190 Ile Asp Gln Ala Gln Leu Asp Trp Glu Leu Ile Tyr Ile Gly Arg Lys 195 200 205 Arg Met Gln Val Lys Glu Pro Glu Lys Ala Val Pro Asn Val Ala Asn 210 215 220 Leu Val Glu Ala Asp Tyr Ser Tyr Trp Thr Leu Gly Tyr Val Ile Ser 225 230 235 240 Leu Glu Gly Ala Gln Lys Leu Val Gly Ala Asn Pro Phe Gly Lys Met 245 250 255 Leu Pro Val Asp Glu Phe Leu Pro Val Met Tyr Asn Lys His Pro Val 260 265 270 Ala Glu Tyr Lys Glu Tyr Tyr Glu Ser Arg Asp Leu Lys Ala Phe Ser 275 280 285 Ala Glu Pro Leu Leu Ile Tyr Pro Thr His Tyr Thr Gly Gln Pro Gly 290 295 300 Tyr Leu Ser Asp Thr Glu Thr Ser Thr Ile Trp Asp Asn Glu Thr Val 305 310 315 320 Ala Thr Asp Trp Asp Arg Thr His Ala Trp Lys Ser Arg Lys Gln Ser 325 330 335 Arg Ile Tyr Ser Asn Ala Lys Asn Thr Glu Ala Leu Pro Pro Pro Thr 340 345 350 Ser Leu Asp Thr Val Pro Ser Arg Asp Glu Leu 355 360 30234PRTHomo sapiens 30Met Leu Pro Gly Tyr Arg Asp Pro Tyr Ser Ser Arg Pro Leu Thr Arg 1 5 10 15 Gly Glu Ile Gly Cys Phe Leu Ser His Tyr Ser Val Trp Lys Glu Val 20 25 30 Ile Asp Arg Glu Leu Glu Lys Thr Leu Val Ile Glu Asp Asp Val Arg 35 40 45 Phe Glu His Gln Phe Lys Lys Lys Leu Met Lys Leu Met Asp Asn Ile 50 55 60 Asp Gln Ala Gln Leu Asp Trp Glu Leu Ile Tyr Ile Gly Arg Lys Arg 65 70 75 80 Met Gln Val Lys Glu Pro Glu Lys Ala Val Pro Asn Val Ala Asn Leu 85 90 95 Val Glu Ala Asp Tyr Ser Tyr Trp Thr Leu Gly Tyr Val Ile Ser Leu 100 105 110 Glu Gly Ala Gln Lys Leu Val Gly Ala Asn Pro Phe Gly Lys Met Leu 115 120 125 Pro Val Asp Glu Phe Leu Pro Val Met Tyr Asn Lys His Pro Val Ala 130 135 140 Glu Tyr Lys Glu Tyr Tyr Glu Ser Arg Asp Leu Lys Ala Phe Ser Ala 145 150 155 160 Glu Pro Leu Leu Ile Tyr Pro Thr His Tyr Thr Gly Gln Pro Gly Tyr 165 170 175 Leu Ser Asp Thr Glu Thr Ser Thr Ile Trp Asp Asn Glu Thr Val Ala 180 185 190 Thr Asp Trp Asp Arg Thr His Ala Trp Lys Ser Arg Lys Gln Ser Arg 195 200 205 Ile Tyr Ser Asn Ala Lys Asn Thr Glu Ala Leu Pro Pro Pro Thr Ser 210 215 220 Leu Asp Thr Val Pro Ser Arg Asp Glu Leu 225 230 315188DNAHomo sapiens 31cacaactcgc ggctctaggg aaggccacag acctcagctg tacgagcgga acctggacta 60gaggaacttt cctcaggact cagggcacac agcccctcgg ccactactgt ctgcgagccc 120gaggcgcagg ggagcgccag cggctgcagg tacgctgcgc acccgcactg gggcctgggc 180ggtgcagcgg cggcgggagg gtcctgggaa ccgcagccgc cgaggccagt gtgtgagccg 240gccggccgcg gcaagccgca tcccccggcc gcccctcgca gccccttgct ccccggcagc 300cgccgccgcc ctcaggcagc ccagccgggc gctcgcgcca ggacttcccc gccgcccgcc 360tcggcccgga acatggctgc gcgccctgct gccaccctcg cctggtcgct actgctcctc 420tcctcagccc tgctccgcga aggctgccga gcgcgcttcg tcgccgagcg ggactcggag 480gacgacggag aggagccggt ggttttcccg gagtcgcccc tgcagagccc cacggtgctc 540gtggcggtcc tcgcccgcaa cgcggcgcac acgctgccgc acttcctcgg ctgcctggag 600cggctggact accccaagag caggatggcc atctgggcag ccactgatca caatgtggat 660aatacaacag aaatattcag ggagtggttg aaaaatgtac agagactcta tcactatgtg 720gagtggaggc ctatggatga accagagtct taccctgatg aaattggacc aaagcactgg 780ccaacctccc ggtttgccca tgtgatgaaa ctacgacagg cagcccttcg aactgcgagg 840gaaaaatggt cagactacat tctgttcata gatgttgaca atttcctgac taatccacag 900accctcaatc tactgattgc agaaaacaaa actattgtgg cccccatgct ggagtctcgg 960ggcctgtatt ctaatttctg gtgcggaatc acccctaagg caaagggctt ctataagagg 1020accccagact acgttcagat tcgagaatgg aagaggacag gctgcttccc cgtccccatg 1080gtccactcca ccttcctaat tgacctcagg aaggaggcct cggacaagct gactttctac 1140cccccacacc aggactacac ctggaccttt gatgacatca ttgtctttgc cttctccagc 1200aggcaagcag gcatccagat gtacctctgc aacagagagc actatggcta cctgcccatc 1260cccctgaagc cccatcagac actgcaggaa gacatcgaga acctcatcca tgtgcagatt 1320gaagcaatga ttgaccgtcc tccaatggaa ccctcccagt atgtctcagt tgtccctaaa 1380tatccagaca agatgggatt tgatgagatt ttcatgataa acctcaaacg cagaaaggac 1440aggcgggacc ggatgctgcg cacactgtat gaacaggaga ttgaggtcaa gattgtcgag 1500gctgtggatg gaaaggcact caacacaagc cagctgaagg cactgaatat tgaaatgctg 1560cctggctatc gagatcccta ttcctccagg cctctaacaa ggggtgaaat cggctgcttt 1620ctcagccact actcagtctg gaaagaggta attgatcgag agctagagaa gactcttgta 1680attgaagacg atgtgcgttt tgagcatcag tttaagaaga agctgatgaa gctgatggat 1740aacattgacc aggctcagct ggactgggaa ctgatttata ttggtaggaa gaggatgcaa 1800gtaaaggagc cagagaaagc agtgcccaat gtggcaaacc tggtcgaagc cgactattcc 1860tactggaccc tgggctacgt catctctctg gaaggagcac agaagctggt tggagccaat 1920ccttttggga agatgctgcc agtggatgag tttctgccag tcatgtacaa caagcatccc 1980gtagccgagt acaaggagta ttatgaatcc agggacctga aagccttctc tgcagaaccc 2040ttgctcatct accctacgca ctacacaggc cagccggggt acctgagtga cacggagacc 2100tccaccatct gggacaatga gacagtggcc accgactggg ataggacaca tgcctggaag 2160tcccggaagc aaagccgcat ctacagcaat gccaagaaca cagaggccct gccaccgcca 2220acctccctgg acactgtgcc ttcaagggat gagctatgaa ggctccctgg gagtgtggcc 2280cacatcagtt caacatcctc tggtttttct aaagggctat tcatctgttt gctccagttt 2340tctgttttgt tttgttctta gtggtcacag tcatctaacc aaagtgatct agtgtgatag 2400atcgaaatta acatattttt gaccatggaa ggaaataagg aaattcaacc caaatttccc 2460aagacggctg aaagacaggt tttttggaaa ctgttaagat aaactgtaat ccagacacct 2520aattcttcag ttcactactc atgtgatact gattcccaca ttaaggttga acaacatggc 2580tcagagtctt gttcaagaga aagtgatcac cgagctgtca catcagcaaa tatgtagtca 2640aggcagccag gccaactaga ccacacttat tggtctagtt tgtccgtttt atatgacatt 2700gaaaacttgt gtgtgcaact tttgggggac aggaatcact taaaatcata tttatttggc 2760tttttattta aaggattctg tcacaagtct tattgaaaag tagatttttt aaaaaaaaaa 2820aatcttagtc cctgttatcc agtaggggtg ggtatttggg tccgactgag acttggcctg 2880tgaccatcat ggcagttgga gttctcatat agaggtgacc agtttgccat gtggatataa 2940tttagtagat atttgacagt ttgtgtaggt atttgaggga aaaaactcaa tgtttggctt 3000ttttattatg gccactcgag tcaggatgct ctatttataa agataaatgt aatatataaa 3060gggtgaggac tggctgtgca tcctgccctg tcccgggttt gcgcgctgct acagagcttc 3120acgctctccg ctccacccct tagcctggga acccaccgca ggtgtgagtt ctgtgagtca 3180ctgctaagag acagagcaca ttttcaggcc agcaactatc cttgccagag ttttttcatt 3240atattttgaa ttatttattt tacaaaatgg gcgaagatat tgtctttagg ataaggcaga 3300gaaacagatg ttgcagactt ccacggcacc cgggggagtg gtgggtgtgg acacattggt 3360tcggcaatct gattctcctg aatttcccag ccaggctctt gtggggaggc ctgtggatgg 3420ggggatttga actatttgga aacaaatgat tctctatctc aggtgagaaa cctggtcaga 3480aacaaagggc tggtcacctg atttaggcca gcaaccaggg aagctcttag aatcccaggc 3540ggacaccctt tctcaaaaga tatcccctaa gagtcctttc tgctttcttc acagattgat 3600tttatgtaaa atgcagagtt ggactacacg atttcttccc actccacaat ctgtcatcct 3660agtatagatc atggtggttt ccctcaagtt tatgttctca tgccctcaat ctgtaaattt 3720ttgtctccag aaaaaccctc ccaggcatcc cataccagca ccgttcctca tcactgtcca 3780tgcaccatgc agccatatgg ggggccgtgc acaccccaaa ccctgagctt cacacttaaa 3840ctcatgggga gggcccttca gagcagagtc cacaggcggg tggtgctaca tacacaagct 3900tagtgtacga gtgtaagata cactttaagc cagacaccta attcttcagt tcactgccca 3960tgtgatactg attgccacat taaggttgaa cagcatggct cagagtctag agaaagtgat 4020caccaagctg tctcatcacc aaataggtag tcaaggcagc ctcatctccc caggtgaggg 4080gcgggtcccc actttaggac aagaggcagc ttgccttcca ccagacgcca gcctcggcct 4140tccttcccga ctcactgtgg gtacccttct acactgacca gcaagctagg ccgctggagg 4200aaagggaact cacccaactc taaattgtgc cgcttagact tagctgtcag tgtgacttcc 4260tttcccaccc acccccagaa aaacagaaag agcatctggg gagcgagtga aaattcctta 4320ggtgattcct aagatttcct tgggtatctg gtttttgttt tcatatttga gtgtgtgcat 4380gtgtgcatga ctttaatgac ttttttaatg gggtgggagg tggctggggt gctggggttg 4440aaggaagttt gggttgattt ttgtggtgtt ttgtttaata gagaattttt tttttcctgt 4500tcccctgtca gctggtctga cagatttaag aactctcatt cttaaaagac tttggactta 4560aattctagca ttttagacta ggactgttct actgtgaaga aagttctgtc tcctttagcc 4620cggtttgttt ctccctgctc aggtctagaa tcccaagcag tgttcttttc tggtgaacac 4680tgtgagccgc agatgtgact ttttttttaa agtcatctct tcagcaatcc agaggttcct 4740tgacctcatt atttgtccta tctctccctt atagtcctaa gccaagacat ttgacctttg 4800acatttgacc tttgcagtgt catgtgaggg cgtcagtata gaggcctttg catctgggcc 4860tggcacccgc tctctgcctc tggaggctaa accctgtctg gatttctctt gggatctaac 4920gtgggatctt ctggacagac aaccgtgaca tcagcagtgc tggtgctgct gtgtgtggac 4980tgaacacctg cactttgcag aggacacgct gcatgggccc cgcttgcggt tcattcaggc 5040ctgctgcagg agctctgaga acaagaaaga gtggacaccc gttcccctgc atcatctgtc 5100ttgcgtgcta tttcagagtg gggaagtgat aaactatttg ccttctggag ctctttgtga 5160aaaattaaaa aaaaacttag ctcaaaga 5188325182DNAHomo sapiens 32cacaactcgc ggctctaggg aaggccacag acctcagctg tacgagcgga acctggacta 60gaggaacttt cctcaggact cagggcacac agcccctcgg ccactactgt ctgcgagccc 120gaggcgcagg ggagcgccag cggctgcagg tacgctgcgc acccgcactg gggcctgggc 180ggtgcagcgg cggcgggagg gtcctgggaa ccgcagccgc cgaggccagt gtgtgagccg 240gccggccgcg gcaagccgca tcccccggcc gcccctcgca gccccttgct ccccggcagc 300cgccgccgcc ctcaggcagc ccagccgggc gctcgcgcca ggacttcccc gccgcccgcc 360tcggcccgga acatggctgc gcgccctgct gccaccctcg cctggtcgct actgctcctc 420tcctcagccc tgctccgcga aggctgccga gcgcgcttcg tcgccgagcg ggactcggag 480gacgacggag aggagccggt ggttttcccg gagtcgcccc tgcagagccc cacggtgctc 540gtggcggtcc tcgcccgcaa cgcggcgcac acgctgccgc acttcctcgg ctgcctggag 600cggctggact accccaagag caggatggcc atctgggcag ccactgatca caatgtggat 660aatacaacag aaatattcag ggagtggttg aaaaatgtac agagactcta tcactatgtg 720gagtggaggc ctatggatga accagagtct taccctgatg aaattggacc aaagcactgg 780ccaacctccc ggtttgccca tgtgatgaaa ctacgacagg cagcccttcg aactgcgagg 840gaaaaatggt cagactacat tctgttcata gatgttgaca atttcctgac taatccacag 900accctcaatc tactgattgc agaaaacaaa actattgtgg cccccatgct ggagtctcgg 960ggcctgtatt ctaatttctg gtgcggaatc acccctaagg gcttctataa gaggacccca 1020gactacgttc agattcgaga atggaagagg acaggctgct tccccgtccc catggtccac 1080tccaccttcc taattgacct caggaaggag gcctcggaca agctgacttt ctacccccca 1140caccaggact acacctggac ctttgatgac atcattgtct ttgccttctc cagcaggcaa 1200gcaggcatcc agatgtacct ctgcaacaga gagcactatg gctacctgcc catccccctg 1260aagccccatc agacactgca ggaagacatc gagaacctca tccatgtgca gattgaagca 1320atgattgacc gtcctccaat ggaaccctcc cagtatgtct cagttgtccc taaatatcca 1380gacaagatgg gatttgatga gattttcatg ataaacctca aacgcagaaa ggacaggcgg 1440gaccggatgc tgcgcacact gtatgaacag gagattgagg tcaagattgt cgaggctgtg 1500gatggaaagg cactcaacac aagccagctg aaggcactga atattgaaat gctgcctggc 1560tatcgagatc cctattcctc caggcctcta acaaggggtg aaatcggctg ctttctcagc 1620cactactcag tctggaaaga ggtaattgat cgagagctag agaagactct tgtaattgaa 1680gacgatgtgc gttttgagca tcagtttaag aagaagctga tgaagctgat ggataacatt 1740gaccaggctc agctggactg ggaactgatt tatattggta ggaagaggat gcaagtaaag 1800gagccagaga aagcagtgcc caatgtggca aacctggtcg aagccgacta ttcctactgg 1860accctgggct acgtcatctc tctggaagga gcacagaagc tggttggagc caatcctttt 1920gggaagatgc tgccagtgga tgagtttctg ccagtcatgt acaacaagca tcccgtagcc 1980gagtacaagg agtattatga atccagggac ctgaaagcct tctctgcaga acccttgctc 2040atctacccta cgcactacac aggccagccg gggtacctga gtgacacgga gacctccacc 2100atctgggaca atgagacagt ggccaccgac tgggatagga cacatgcctg gaagtcccgg 2160aagcaaagcc gcatctacag caatgccaag aacacagagg ccctgccacc gccaacctcc 2220ctggacactg tgccttcaag ggatgagcta tgaaggctcc ctgggagtgt ggcccacatc 2280agttcaacat cctctggttt ttctaaaggg ctattcatct gtttgctcca gttttctgtt 2340ttgttttgtt cttagtggtc acagtcatct aaccaaagtg atctagtgtg atagatcgaa 2400attaacatat ttttgaccat ggaaggaaat aaggaaattc aacccaaatt tcccaagacg 2460gctgaaagac aggttttttg gaaactgtta agataaactg taatccagac acctaattct 2520tcagttcact actcatgtga tactgattcc cacattaagg ttgaacaaca tggctcagag 2580tcttgttcaa gagaaagtga tcaccgagct gtcacatcag caaatatgta gtcaaggcag 2640ccaggccaac tagaccacac ttattggtct agtttgtccg ttttatatga cattgaaaac 2700ttgtgtgtgc aacttttggg ggacaggaat cacttaaaat catatttatt tggcttttta 2760tttaaaggat tctgtcacaa gtcttattga aaagtagatt ttttaaaaaa aaaaaatctt 2820agtccctgtt atccagtagg ggtgggtatt tgggtccgac tgagacttgg cctgtgacca 2880tcatggcagt tggagttctc atatagaggt gaccagtttg ccatgtggat ataatttagt 2940agatatttga cagtttgtgt aggtatttga gggaaaaaac tcaatgtttg gcttttttat 3000tatggccact cgagtcagga tgctctattt ataaagataa atgtaatata taaagggtga 3060ggactggctg tgcatcctgc cctgtcccgg gtttgcgcgc tgctacagag cttcacgctc 3120tccgctccac cccttagcct gggaacccac cgcaggtgtg agttctgtga gtcactgcta 3180agagacagag cacattttca ggccagcaac tatccttgcc agagtttttt cattatattt 3240tgaattattt attttacaaa atgggcgaag atattgtctt taggataagg cagagaaaca 3300gatgttgcag acttccacgg cacccggggg agtggtgggt gtggacacat tggttcggca 3360atctgattct cctgaatttc ccagccaggc tcttgtgggg aggcctgtgg atggggggat 3420ttgaactatt tggaaacaaa tgattctcta tctcaggtga gaaacctggt cagaaacaaa 3480gggctggtca cctgatttag gccagcaacc agggaagctc ttagaatccc aggcggacac 3540cctttctcaa aagatatccc ctaagagtcc tttctgcttt cttcacagat tgattttatg 3600taaaatgcag agttggacta cacgatttct tcccactcca caatctgtca tcctagtata 3660gatcatggtg gtttccctca agtttatgtt ctcatgccct caatctgtaa atttttgtct 3720ccagaaaaac cctcccaggc atcccatacc agcaccgttc ctcatcactg tccatgcacc 3780atgcagccat atggggggcc gtgcacaccc caaaccctga gcttcacact taaactcatg 3840gggagggccc ttcagagcag agtccacagg cgggtggtgc tacatacaca agcttagtgt 3900acgagtgtaa gatacacttt aagccagaca cctaattctt cagttcactg cccatgtgat 3960actgattgcc acattaaggt tgaacagcat ggctcagagt ctagagaaag tgatcaccaa 4020gctgtctcat caccaaatag gtagtcaagg cagcctcatc tccccaggtg aggggcgggt 4080ccccacttta ggacaagagg cagcttgcct tccaccagac gccagcctcg gccttccttc 4140ccgactcact gtgggtaccc ttctacactg accagcaagc taggccgctg gaggaaaggg 4200aactcaccca actctaaatt gtgccgctta gacttagctg tcagtgtgac ttcctttccc 4260acccaccccc agaaaaacag aaagagcatc tggggagcga gtgaaaattc cttaggtgat 4320tcctaagatt tccttgggta tctggttttt gttttcatat ttgagtgtgt gcatgtgtgc 4380atgactttaa tgactttttt aatggggtgg gaggtggctg gggtgctggg gttgaaggaa 4440gtttgggttg atttttgtgg tgttttgttt aatagagaat tttttttttc ctgttcccct 4500gtcagctggt ctgacagatt taagaactct cattcttaaa agactttgga cttaaattct 4560agcattttag actaggactg ttctactgtg aagaaagttc tgtctccttt agcccggttt 4620gtttctccct gctcaggtct agaatcccaa gcagtgttct tttctggtga acactgtgag 4680ccgcagatgt gacttttttt ttaaagtcat ctcttcagca atccagaggt tccttgacct 4740cattatttgt cctatctctc ccttatagtc ctaagccaag acatttgacc tttgacattt 4800gacctttgca gtgtcatgtg agggcgtcag tatagaggcc tttgcatctg ggcctggcac 4860ccgctctctg cctctggagg

ctaaaccctg tctggatttc tcttgggatc taacgtggga 4920tcttctggac agacaaccgt gacatcagca gtgctggtgc tgctgtgtgt ggactgaaca 4980cctgcacttt gcagaggaca cgctgcatgg gccccgcttg cggttcattc aggcctgctg 5040caggagctct gagaacaaga aagagtggac acccgttccc ctgcatcatc tgtcttgcgt 5100gctatttcag agtggggaag tgataaacta tttgccttct ggagctcttt gtgaaaaatt 5160aaaaaaaaac ttagctcaaa ga 5182332325DNAHomo sapiens 33ttttcgtttt tttgttctat gtatgcctag atttagcttt aataatctgt cagcgttaca 60gaattttcct gtttgctttc atatcaaatg tggatttcat tcctcatttt cttgttattt 120cctgtaggtg gattacagtg ttccattctg ctctccagaa tgggaattcc ttgcctccct 180tttgatcctt tcatgccttt tttgttcatt ccaggccatg agtttgcttt ttcttcagag 240ggaaaaaatc aatttcctct gggctgaaga atagccgagt gtttttatta gtccccaatg 300ctgtgtttta cttttttaat aaaggagcat tttttaggtc tatagacata cctgaaacaa 360ccacttattt aattgcagtg tcaaactcag catcagacac agttcaggtt ccttctcctc 420gcttgtcact cagcagtatg ttttctgatt gttggctttg tccctctctt ggatcctcag 480gcatccagat gtacctctgc aacagagagc actatggcta cctgcccatc cccctgaagc 540cccatcagac actgcaggaa gacatcgaga acctcatcca tgtgcagatt gaagcaatga 600ttgaccgtcc tccaatggaa ccctcccagt atgtctcagt tgtccctaaa tatccagaca 660agatgggatt tgatgagatt ttcatgataa acctcaaacg cagaaaggac aggcgggacc 720ggatgctgcg cacactgtat gaacaggaga ttgaggtcaa gattgtcgag gctgtggatg 780gaaaggcact caacacaagc cagctgaagg cactgaatat tgaaatgctg cctggctatc 840gagatcccta ttcctccagg cctctaacaa ggggtgaaat cggctgcttt ctcagccact 900actcagtctg gaaagaggta attgatcgag agctagagaa gactcttgta attgaagacg 960atgtgcgttt tgagcatcag tttaagaaga agctgatgaa gctgatggat aacattgacc 1020aggctcagct ggactgggaa ctgatttata ttggtaggaa gaggatgcaa gtaaaggagc 1080cagagaaagc agtgcccaat gtggcaaacc tggtcgaagc cgactattcc tactggaccc 1140tgggctacgt catctctctg gaaggagcac agaagctggt tggagccaat ccttttggga 1200agatgctgcc agtggatgag tttctgccag tcatgtacaa caagcatccc gtagccgagt 1260acaaggagta ttatgaatcc agggacctga aagccttctc tgcagaaccc ttgctcatct 1320accctacgca ctacacaggc cagccggggt acctgagtga cacggagacc tccaccatct 1380gggacaatga gacagtggcc accgactggg ataggacaca tgcctggaag tcccggaagc 1440aaagccgcat ctacagcaat gccaagaaca cagaggccct gccaccgcca acctccctgg 1500acactgtgcc ttcaagggat gagctatgaa ggctccctgg gagtgtggcc cacatcagtt 1560caacatcctc tggtttttct aaagggctat tcatctgttt gctccagttt tctgttttgt 1620tttgttctta gtggtcacag tcatctaacc aaagtgatct agtgtgatag atcgaaatta 1680acatattttt gaccatggaa ggaaataagg aaattcaacc caaatttccc aagacggctg 1740aaagacaggt tttttggaaa ctgttaagat aaactgtaat ccagacacct aattcttcag 1800ttcactactc atgtgatact gattcccaca ttaaggttga acaacatggc tcagagtctt 1860gttcaagaga aagtgatcac cgagctgtca catcagcaaa tatgtagtca aggcagccag 1920gccaactaga ccacacttat tggtctagtt tgtccgtttt atatgacatt gaaaacttgt 1980gtgtgcaact tttgggggac aggaatcact taaaatcata tttatttggc tttttattta 2040aaggattctg tcacaagtct tattgaaaag tagatttttt aaaaaaaaaa aatcttagtc 2100cctgttatcc agtaggggtg ggtatttggg tccgactgag acttggcctg tgaccatcat 2160ggcagttgga gttctcatat agaggtgacc agtttgccat gtggatataa tttagtagat 2220atttgacagt ttgtgtaggt atttgaggga aaaaactcaa tgtttggctt ttttattatg 2280gccactcgag tcaggatgct ctatttataa agataaatgt aatat 2325344723DNAHomo sapiens 34ttttctattc ttttactttc aacctatttt tgtcttttaa tctaaaggct gcctcttata 60gataccacat ggttgtatta atttttaaaa cttactatgc caaattcttc cttcattgca 120gtgttttatt catttgcctt cactacattt ttttccagat ccttaaggtc agccagaggt 180gagagactag ggacttctca agtttttcct gggcgtgtgg acagccctga atatgcctgg 240ggccttctag attctcaggg atattttgga gcaaatcaaa ctccctatgg acctcttatt 300ccttggtttt tccttttaag ttttttggtg ggcttcttgt tagcatcagc tgataatgct 360tctcaaacag ctgtaatgtt aaacagttgt tgctcatggg ttttgacaaa tgacctgcag 420atagtgctgt tcacagaaaa tgaactctga gtcaggtcaa atcaagacaa gctctgagaa 480tgtagctttt caaggagcta ctagacaggt tgaatagaga ttattttctg ggggcaggtc 540tttaagggag tctcagtcca ttttgtccct gccagtggct tctgggttgg tggggtttgc 600cgctctctta gttgtgaggc tactggtttt caaggctttt gtagggctgg ggtgaacagg 660ttaggattag ggcagctgtg aacttcacaa agctcactgt gttaacagag attcagcagt 720aggtcttgat caaacactcc ttggattatt gcagtacttt aactaatttc cataggagtg 780tacttattaa atcttgtcaa aaaactgatg atgcttagaa ctatgttggg cactatgagg 840gatacacaaa aatatgatac cctcttgtct tcccagacct cataatacta gtgtgacaca 900ttagcctgga ggaacttcta aggaataaag ggaagcctct ctgtctttgt ctgtgtgggg 960gaggggtgtt tgcagaacta actctctctt tctctggcct ttgtggtggg gacctttgac 1020ttgttaatct ccagacgtgt tttatgtgat gcagggcact caacacaagc cagctgaagg 1080cactgaatat tgaaatgctg cctggctatc gagatcccta ttcctccagg cctctaacaa 1140ggggtgaaat cggctgcttt ctcagccact actcagtctg gaaagaggta attgatcgag 1200agctagagaa gactcttgta attgaagacg atgtgcgttt tgagcatcag tttaagaaga 1260agctgatgaa gctgatggat aacattgacc aggctcagct ggactgggaa ctgatttata 1320ttggtaggaa gaggatgcaa gtaaaggagc cagagaaagc agtgcccaat gtggcaaacc 1380tggtcgaagc cgactattcc tactggaccc tgggctacgt catctctctg gaaggagcac 1440agaagctggt tggagccaat ccttttggga agatgctgcc agtggatgag tttctgccag 1500tcatgtacaa caagcatccc gtagccgagt acaaggagta ttatgaatcc agggacctga 1560aagccttctc tgcagaaccc ttgctcatct accctacgca ctacacaggc cagccggggt 1620acctgagtga cacggagacc tccaccatct gggacaatga gacagtggcc accgactggg 1680ataggacaca tgcctggaag tcccggaagc aaagccgcat ctacagcaat gccaagaaca 1740cagaggccct gccaccgcca acctccctgg acactgtgcc ttcaagggat gagctatgaa 1800ggctccctgg gagtgtggcc cacatcagtt caacatcctc tggtttttct aaagggctat 1860tcatctgttt gctccagttt tctgttttgt tttgttctta gtggtcacag tcatctaacc 1920aaagtgatct agtgtgatag atcgaaatta acatattttt gaccatggaa ggaaataagg 1980aaattcaacc caaatttccc aagacggctg aaagacaggt tttttggaaa ctgttaagat 2040aaactgtaat ccagacacct aattcttcag ttcactactc atgtgatact gattcccaca 2100ttaaggttga acaacatggc tcagagtctt gttcaagaga aagtgatcac cgagctgtca 2160catcagcaaa tatgtagtca aggcagccag gccaactaga ccacacttat tggtctagtt 2220tgtccgtttt atatgacatt gaaaacttgt gtgtgcaact tttgggggac aggaatcact 2280taaaatcata tttatttggc tttttattta aaggattctg tcacaagtct tattgaaaag 2340tagatttttt aaaaaaaaaa aatcttagtc cctgttatcc agtaggggtg ggtatttggg 2400tccgactgag acttggcctg tgaccatcat ggcagttgga gttctcatat agaggtgacc 2460agtttgccat gtggatataa tttagtagat atttgacagt ttgtgtaggt atttgaggga 2520aaaaactcaa tgtttggctt ttttattatg gccactcgag tcaggatgct ctatttataa 2580agataaatgt aatatataaa gggtgaggac tggctgtgca tcctgccctg tcccgggttt 2640gcgcgctgct acagagcttc acgctctccg ctccacccct tagcctggga acccaccgca 2700ggtgtgagtt ctgtgagtca ctgctaagag acagagcaca ttttcaggcc agcaactatc 2760cttgccagag ttttttcatt atattttgaa ttatttattt tacaaaatgg gcgaagatat 2820tgtctttagg ataaggcaga gaaacagatg ttgcagactt ccacggcacc cgggggagtg 2880gtgggtgtgg acacattggt tcggcaatct gattctcctg aatttcccag ccaggctctt 2940gtggggaggc ctgtggatgg ggggatttga actatttgga aacaaatgat tctctatctc 3000aggtgagaaa cctggtcaga aacaaagggc tggtcacctg atttaggcca gcaaccaggg 3060aagctcttag aatcccaggc ggacaccctt tctcaaaaga tatcccctaa gagtcctttc 3120tgctttcttc acagattgat tttatgtaaa atgcagagtt ggactacacg atttcttccc 3180actccacaat ctgtcatcct agtatagatc atggtggttt ccctcaagtt tatgttctca 3240tgccctcaat ctgtaaattt ttgtctccag aaaaaccctc ccaggcatcc cataccagca 3300ccgttcctca tcactgtcca tgcaccatgc agccatatgg ggggccgtgc acaccccaaa 3360ccctgagctt cacacttaaa ctcatgggga gggcccttca gagcagagtc cacaggcggg 3420tggtgctaca tacacaagct tagtgtacga gtgtaagata cactttaagc cagacaccta 3480attcttcagt tcactgccca tgtgatactg attgccacat taaggttgaa cagcatggct 3540cagagtctag agaaagtgat caccaagctg tctcatcacc aaataggtag tcaaggcagc 3600ctcatctccc caggtgaggg gcgggtcccc actttaggac aagaggcagc ttgccttcca 3660ccagacgcca gcctcggcct tccttcccga ctcactgtgg gtacccttct acactgacca 3720gcaagctagg ccgctggagg aaagggaact cacccaactc taaattgtgc cgcttagact 3780tagctgtcag tgtgacttcc tttcccaccc acccccagaa aaacagaaag agcatctggg 3840gagcgagtga aaattcctta ggtgattcct aagatttcct tgggtatctg gtttttgttt 3900tcatatttga gtgtgtgcat gtgtgcatga ctttaatgac ttttttaatg gggtgggagg 3960tggctggggt gctggggttg aaggaagttt gggttgattt ttgtggtgtt ttgtttaata 4020gagaattttt tttttcctgt tcccctgtca gctggtctga cagatttaag aactctcatt 4080cttaaaagac tttggactta aattctagca ttttagacta ggactgttct actgtgaaga 4140aagttctgtc tcctttagcc cggtttgttt ctccctgctc aggtctagaa tcccaagcag 4200tgttcttttc tggtgaacac tgtgagccgc agatgtgact ttttttttaa agtcatctct 4260tcagcaatcc agaggttcct tgacctcatt atttgtccta tctctccctt atagtcctaa 4320gccaagacat ttgacctttg acatttgacc tttgcagtgt catgtgaggg cgtcagtata 4380gaggcctttg catctgggcc tggcacccgc tctctgcctc tggaggctaa accctgtctg 4440gatttctctt gggatctaac gtgggatctt ctggacagac aaccgtgaca tcagcagtgc 4500tggtgctgct gtgtgtggac tgaacacctg cactttgcag aggacacgct gcatgggccc 4560cgcttgcggt tcattcaggc ctgctgcagg agctctgaga acaagaaaga gtggacaccc 4620gttcccctgc atcatctgtc ttgcgtgcta tttcagagtg gggaagtgat aaactatttg 4680ccttctggag ctctttgtga aaaattaaaa aaaaacttag ctc 472335542PRTHomo sapiens 35Met Ala Gln Glu Ile Asp Leu Ser Ala Leu Lys Glu Leu Glu Arg Glu 1 5 10 15 Ala Ile Leu Gln Val Leu Tyr Arg Asp Gln Ala Val Gln Asn Thr Glu 20 25 30 Glu Glu Arg Thr Arg Lys Leu Lys Thr His Leu Gln His Leu Arg Trp 35 40 45 Lys Gly Ala Lys Asn Thr Asp Trp Glu His Lys Glu Lys Cys Cys Ala 50 55 60 Arg Cys Gln Gln Val Leu Gly Phe Leu Leu His Arg Gly Ala Val Cys 65 70 75 80 Arg Gly Cys Ser His Arg Val Cys Ala Gln Cys Arg Val Phe Leu Arg 85 90 95 Gly Thr His Ala Trp Lys Cys Thr Val Cys Phe Glu Asp Arg Asn Val 100 105 110 Lys Ile Lys Thr Gly Glu Trp Phe Tyr Glu Glu Arg Ala Lys Lys Phe 115 120 125 Pro Thr Gly Gly Lys His Glu Thr Val Gly Gly Gln Leu Leu Gln Ser 130 135 140 Tyr Gln Lys Leu Ser Lys Ile Ser Val Val Pro Pro Thr Pro Pro Pro 145 150 155 160 Val Ser Glu Ser Gln Cys Ser Arg Ser Pro Gly Arg Lys Val Ser Ala 165 170 175 Pro Asp Ile Leu Lys Pro Leu Asn Gln Glu Asp Pro Lys Cys Ser Thr 180 185 190 Asn Pro Ile Leu Lys Gln Gln Asn Leu Pro Ser Ser Pro Ala Pro Ser 195 200 205 Thr Ile Phe Ser Gly Gly Phe Arg His Gly Ser Leu Ile Ser Ile Asp 210 215 220 Ser Thr Cys Thr Glu Met Gly Asn Phe Asp Asn Ala Asn Val Thr Gly 225 230 235 240 Glu Ile Glu Phe Ala Ile His Tyr Cys Phe Lys Thr His Ser Leu Glu 245 250 255 Ile Cys Ile Lys Ala Cys Lys Asn Leu Ala Tyr Gly Glu Glu Lys Lys 260 265 270 Lys Lys Cys Asn Pro Tyr Val Lys Thr Tyr Leu Leu Pro Asp Arg Ser 275 280 285 Ser Gln Gly Lys Arg Lys Thr Gly Val Gln Arg Asn Thr Val Asp Pro 290 295 300 Thr Phe Gln Glu Thr Leu Lys Tyr Gln Val Ala Pro Ala Gln Leu Val 305 310 315 320 Thr Arg Gln Leu Gln Val Ser Val Trp His Leu Gly Thr Leu Ala Arg 325 330 335 Arg Val Phe Leu Gly Glu Val Ile Ile Pro Leu Ala Thr Trp Asp Phe 340 345 350 Glu Asp Ser Thr Thr Gln Ser Phe Arg Trp His Pro Leu Arg Ala Lys 355 360 365 Ala Glu Lys Tyr Glu Asp Ser Val Pro Gln Ser Asn Gly Glu Leu Thr 370 375 380 Val Arg Ala Lys Leu Val Leu Pro Ser Arg Pro Arg Lys Leu Gln Glu 385 390 395 400 Ala Gln Glu Gly Thr Asp Gln Pro Ser Leu His Gly Gln Leu Cys Leu 405 410 415 Val Val Leu Gly Ala Lys Asn Leu Pro Val Arg Pro Asp Gly Thr Leu 420 425 430 Asn Ser Phe Val Lys Gly Cys Leu Thr Leu Pro Asp Gln Gln Lys Leu 435 440 445 Arg Leu Lys Ser Pro Val Leu Arg Lys Gln Ala Cys Pro Gln Trp Lys 450 455 460 His Ser Phe Val Phe Ser Gly Val Thr Pro Ala Gln Leu Arg Gln Ser 465 470 475 480 Ser Leu Glu Leu Thr Val Trp Asp Gln Ala Leu Phe Gly Met Asn Asp 485 490 495 Arg Leu Leu Gly Gly Thr Arg Leu Gly Ser Lys Gly Asp Thr Ala Val 500 505 510 Gly Gly Asp Ala Cys Ser Leu Ser Lys Leu Gln Trp Gln Lys Val Leu 515 520 525 Ser Ser Pro Asn Leu Trp Thr Asp Met Thr Leu Val Leu His 530 535 540 36610PRTHomo sapiens 36Met Ala Gln Glu Ile Asp Leu Ser Ala Leu Lys Glu Leu Glu Arg Glu 1 5 10 15 Ala Ile Leu Gln Val Leu Tyr Arg Asp Gln Ala Val Gln Asn Thr Glu 20 25 30 Glu Glu Arg Thr Arg Lys Leu Lys Thr His Leu Gln His Leu Arg Trp 35 40 45 Lys Gly Ala Lys Asn Thr Asp Trp Glu His Lys Glu Lys Cys Cys Ala 50 55 60 Arg Cys Gln Gln Val Leu Gly Phe Leu Leu His Arg Gly Ala Val Cys 65 70 75 80 Arg Gly Cys Ser His Arg Val Cys Ala Gln Cys Arg Val Phe Leu Arg 85 90 95 Gly Thr His Ala Trp Lys Cys Thr Val Cys Phe Glu Asp Arg Asn Val 100 105 110 Lys Ile Lys Thr Gly Glu Trp Phe Tyr Glu Glu Arg Ala Lys Lys Phe 115 120 125 Pro Thr Gly Gly Lys His Glu Thr Val Gly Gly Gln Leu Leu Gln Ser 130 135 140 Tyr Gln Lys Leu Ser Lys Ile Ser Val Val Pro Pro Thr Pro Pro Pro 145 150 155 160 Val Ser Glu Ser Gln Cys Ser Arg Ser Pro Gly Arg Leu Gln Glu Phe 165 170 175 Gly Gln Phe Arg Gly Phe Asn Lys Ser Val Glu Asn Leu Phe Leu Ser 180 185 190 Leu Ala Thr His Val Lys Lys Leu Ser Lys Ser Gln Asn Asp Met Thr 195 200 205 Ser Glu Lys His Leu Leu Ala Thr Gly Pro Arg Gln Cys Val Gly Gln 210 215 220 Thr Glu Arg Arg Ser Gln Ser Asp Thr Ala Val Asn Val Thr Thr Arg 225 230 235 240 Lys Val Ser Ala Pro Asp Ile Leu Lys Pro Leu Asn Gln Glu Asp Pro 245 250 255 Lys Cys Ser Thr Asn Pro Ile Leu Lys Gln Gln Asn Leu Pro Ser Ser 260 265 270 Pro Ala Pro Ser Thr Ile Phe Ser Gly Gly Phe Arg His Gly Ser Leu 275 280 285 Ile Ser Ile Asp Ser Thr Cys Thr Glu Met Gly Asn Phe Asp Asn Ala 290 295 300 Asn Val Thr Gly Glu Ile Glu Phe Ala Ile His Tyr Cys Phe Lys Thr 305 310 315 320 His Ser Leu Glu Ile Cys Ile Lys Ala Cys Lys Asn Leu Ala Tyr Gly 325 330 335 Glu Glu Lys Lys Lys Lys Cys Asn Pro Tyr Val Lys Thr Tyr Leu Leu 340 345 350 Pro Asp Arg Ser Ser Gln Gly Lys Arg Lys Thr Gly Val Gln Arg Asn 355 360 365 Thr Val Asp Pro Thr Phe Gln Glu Thr Leu Lys Tyr Gln Val Ala Pro 370 375 380 Ala Gln Leu Val Thr Arg Gln Leu Gln Val Ser Val Trp His Leu Gly 385 390 395 400 Thr Leu Ala Arg Arg Val Phe Leu Gly Glu Val Ile Ile Pro Leu Ala 405 410 415 Thr Trp Asp Phe Glu Asp Ser Thr Thr Gln Ser Phe Arg Trp His Pro 420 425 430 Leu Arg Ala Lys Ala Glu Lys Tyr Glu Asp Ser Val Pro Gln Ser Asn 435 440 445 Gly Glu Leu Thr Val Arg Ala Lys Leu Val Leu Pro Ser Arg Pro Arg 450 455 460 Lys Leu Gln Glu Ala Gln Glu Gly Thr Asp Gln Pro Ser Leu His Gly 465 470 475 480 Gln Leu Cys Leu Val Val Leu Gly Ala Lys Asn Leu Pro Val Arg Pro 485 490 495 Asp Gly Thr Leu Asn Ser Phe Val Lys Gly Cys Leu Thr Leu Pro Asp 500 505 510 Gln Gln Lys Leu Arg Leu Lys Ser Pro Val Leu Arg Lys Gln Ala Cys 515 520 525 Pro Gln Trp Lys His Ser Phe Val Phe Ser Gly Val Thr Pro Ala Gln 530 535 540 Leu Arg Gln Ser Ser Leu Glu Leu Thr Val Trp Asp Gln Ala Leu Phe 545 550 555 560 Gly Met Asn Asp Arg Leu Leu Gly Gly Thr Arg Leu Gly Ser Lys Gly 565 570 575 Asp Thr Ala Val Gly Gly Asp Ala Cys Ser Leu Ser Lys Leu Gln Trp 580 585 590 Gln Lys Val Leu Ser Ser Pro Asn Leu Trp Thr Asp Met Thr Leu Val 595 600 605 Leu His 610 37542PRTHomo sapiens 37Met Ala Gln Glu Ile Asp Leu Ser

Ala Leu Lys Glu Leu Glu Arg Glu 1 5 10 15 Ala Ile Leu Gln Val Leu Tyr Arg Asp Gln Ala Val Gln Asn Thr Glu 20 25 30 Glu Glu Arg Thr Arg Lys Leu Lys Thr His Leu Gln His Leu Arg Trp 35 40 45 Lys Gly Ala Lys Asn Thr Asp Trp Glu His Lys Glu Lys Cys Cys Ala 50 55 60 Arg Cys Gln Gln Val Leu Gly Phe Leu Leu His Arg Gly Ala Val Cys 65 70 75 80 Arg Gly Cys Ser His Arg Val Cys Ala Gln Cys Arg Val Phe Leu Arg 85 90 95 Gly Thr His Ala Trp Lys Cys Thr Val Cys Phe Glu Asp Arg Asn Val 100 105 110 Lys Ile Lys Thr Gly Glu Trp Phe Tyr Glu Glu Arg Ala Lys Lys Phe 115 120 125 Pro Thr Gly Gly Lys His Glu Thr Val Gly Gly Gln Leu Leu Gln Ser 130 135 140 Tyr Gln Lys Leu Ser Lys Ile Ser Val Val Pro Pro Thr Pro Pro Pro 145 150 155 160 Val Ser Glu Ser Gln Cys Ser Arg Ser Pro Gly Arg Lys Val Ser Ala 165 170 175 Pro Asp Ile Leu Lys Pro Leu Asn Gln Glu Asp Pro Lys Cys Ser Thr 180 185 190 Asn Pro Ile Leu Lys Gln Gln Asn Leu Pro Ser Ser Pro Ala Pro Ser 195 200 205 Thr Ile Phe Ser Gly Gly Phe Arg His Gly Ser Leu Ile Ser Ile Asp 210 215 220 Ser Thr Cys Thr Glu Met Gly Asn Phe Asp Asn Ala Asn Val Thr Gly 225 230 235 240 Glu Ile Glu Phe Ala Ile His Tyr Cys Phe Lys Thr His Ser Leu Glu 245 250 255 Ile Cys Ile Lys Ala Cys Lys Asn Leu Ala Tyr Gly Glu Glu Lys Lys 260 265 270 Lys Lys Cys Asn Pro Tyr Val Lys Thr Tyr Leu Leu Pro Asp Arg Ser 275 280 285 Ser Gln Gly Lys Arg Lys Thr Gly Val Gln Arg Asn Thr Val Asp Pro 290 295 300 Thr Phe Gln Glu Thr Leu Lys Tyr Gln Val Ala Pro Ala Gln Leu Val 305 310 315 320 Thr Arg Gln Leu Gln Val Ser Val Trp His Leu Gly Thr Leu Ala Arg 325 330 335 Arg Val Phe Leu Gly Glu Val Ile Ile Pro Leu Ala Thr Trp Asp Phe 340 345 350 Glu Asp Ser Thr Thr Gln Ser Phe Arg Trp His Pro Leu Arg Ala Lys 355 360 365 Ala Glu Lys Tyr Glu Asp Ser Val Pro Gln Ser Asn Gly Glu Leu Thr 370 375 380 Val Arg Ala Lys Leu Val Leu Pro Ser Arg Pro Arg Lys Leu Gln Glu 385 390 395 400 Ala Gln Glu Gly Thr Asp Gln Pro Ser Leu His Gly Gln Leu Cys Leu 405 410 415 Val Val Leu Gly Ala Lys Asn Leu Pro Val Arg Pro Asp Gly Thr Leu 420 425 430 Asn Ser Phe Val Lys Gly Cys Leu Thr Leu Pro Asp Gln Gln Lys Leu 435 440 445 Arg Leu Lys Ser Pro Val Leu Arg Lys Gln Ala Cys Pro Gln Trp Lys 450 455 460 His Ser Phe Val Phe Ser Gly Val Thr Pro Ala Gln Leu Arg Gln Ser 465 470 475 480 Ser Leu Glu Leu Thr Val Trp Asp Gln Ala Leu Phe Gly Met Asn Asp 485 490 495 Arg Leu Leu Gly Gly Thr Arg Leu Gly Ser Lys Gly Asp Thr Ala Val 500 505 510 Gly Gly Asp Ala Cys Ser Leu Ser Lys Leu Gln Trp Gln Lys Val Leu 515 520 525 Ser Ser Pro Asn Leu Trp Thr Asp Met Thr Leu Val Leu His 530 535 540 382277DNAHomo sapiens 38aagaagaatt gacttccttc tctgcagagc cggctctggt ctcttctctt gaagcagatg 60cgaaggctcc ccgaatgaga aagaatactc ggaatcagcg gtgaattgca gtgatctttc 120agagaaagcg cctgttcaac tttgtcctct ctcagagact cagagccttg gggcactgag 180ggatgccagt tctgcctgtt catctggaac ctggatctaa ggagggaaga ggcgttgccc 240ctgctggcat agtcaggtat tgaacgggct gagcttttca tgatggttcc tgctgacctg 300gaaacatctt aaatggaagg gcgtgagcgc ttggtccatg cagtgaagct cttccaacct 360gggtcaacga aaacggagaa gaaatggccc aagaaataga tctgagtgct ctcaaggagt 420tagaacgcga ggccattctc caggtcctgt accgagacca ggcggttcaa aacacagagg 480aggagaggac acggaaactg aaaacacacc tgcagcatct ccggtggaaa ggagcgaaga 540acacggactg ggagcacaaa gagaagtgct gtgcgcgctg ccagcaggtg ctggggttcc 600tgctgcaccg gggcgccgtg tgccggggct gcagccaccg cgtgtgtgcc cagtgccgag 660tgttcctgag ggggacccat gcctggaagt gcacggtgtg cttcgaggac aggaatgtca 720aaataaaaac tggagaatgg ttctatgagg aacgagccaa gaaatttcca actggaggca 780aacatgagac agttggaggg cagctcttgc aatcttatca gaagctgagc aaaatttctg 840tggttcctcc tactccacct cctgtcagcg agagccagtg cagccgcagt cctggcagga 900aggtcagtgc accagatatt ctgaaacctc tcaatcaaga ggatcccaaa tgctctacta 960accctatttt gaagcaacag aatctcccat ccagtccggc acccagtacc atattctctg 1020gaggttttag acacggaagt ttaattagca ttgacagcac ctgtacagag atgggcaatt 1080ttgacaatgc taatgtcact ggagaaatag aatttgccat tcattattgc ttcaaaaccc 1140attctttaga aatatgcatc aaggcctgta agaaccttgc ctatggagaa gaaaagaaga 1200aaaagtgcaa tccgtatgtg aagacctacc tgttgcccga cagatcctcc cagggaaagc 1260gcaagactgg agtccaaagg aacaccgtgg acccgacctt tcaggagacc ttgaagtatc 1320aggtggcccc tgcccagctg gtgacccggc agctgcaggt ctcggtgtgg catctgggca 1380cgctggcccg gagagtgttt cttggagaag tgatcattcc tctggccacg tgggactttg 1440aagacagcac aacacagtcc ttccgctggc atccgctccg ggccaaggcg gagaaatacg 1500aagacagcgt tcctcagagt aatggagagc tcacagtccg ggctaagctg gttctccctt 1560cacggcccag aaaactccaa gaggctcaag aagggacaga tcagccatca cttcatggtc 1620aactttgttt ggtagtgcta ggagccaaga atttacctgt gcggccagat ggcaccttga 1680actcatttgt taagggctgt ctcactctgc cagaccaaca aaaactgaga ctgaagtcgc 1740cagtcctgag gaagcaggct tgcccccagt ggaaacactc atttgtcttc agtggcgtaa 1800ccccagctca gctgaggcag tcaagcttgg agttaactgt ctgggatcag gccctctttg 1860gaatgaacga ccgcttgctt ggaggaacca gacttggttc aaagggagac acagctgttg 1920gcggggatgc atgctcacta tcgaagctcc agtggcagaa agtcctttcc agccccaatc 1980tatggacaga catgactctt gtcctgcact gacatgaagg cctcaaggtt ccaggttgca 2040gcaggcgtga ggcactgtgc gtctgcagag gggctacgaa ccaggtgcag ggtcccagct 2100ggagacccct ttgaccttga gcagtctcca tctgcggccc tgtcccatgg cttaaccgcc 2160tattggtatc tgtgtatatt tacgttaaac acaattatgt tacctaagcc tctggtgggt 2220tatctcctct ttgagatgta gaaaatggcc agattttaat aaacgttgtt acccatg 2277392131DNAHomo sapiens 39aggagggaag aggcgttgcc cctgctggca tagtcaggta ccagcccagc caggtattga 60acgggctgag cttttcatga tggttcctgc tgacctggaa acatcttaaa tggaagggcg 120tgagcgcttg gtccatgcag tgaagctctt ccaacctggg tcaacgaaaa cggagaagaa 180atggcccaag aaatagatct gagtgctctc aaggagttag aacgcgaggc cattctccag 240gtcctgtacc gagaccaggc ggttcaaaac acagaggagg agaggacacg gaaactgaaa 300acacacctgc agcatctccg gtggaaagga gcgaagaaca cggactggga gcacaaagag 360aagtgctgtg cgcgctgcca gcaggtgctg gggttcctgc tgcaccgggg cgccgtgtgc 420cggggctgca gccaccgcgt gtgtgcccag tgccgagtgt tcctgagggg gacccatgcc 480tggaagtgca cggtgtgctt cgaggacagg aatgtcaaaa taaaaactgg agaatggttc 540tatgaggaac gagccaagaa atttccaact ggaggcaaac atgagacagt tggagggcag 600ctcttgcaat cttatcagaa gctgagcaaa atttctgtgg ttcctcctac tccacctcct 660gtcagcgaga gccagtgcag ccgcagtcct ggcaggttac aggaatttgg tcagtttaga 720ggatttaata agtccgtgga aaatttgttt ctgtctcttg ctacccacgt gaaaaagctc 780tccaaatccc agaatgatat gacttctgag aagcatcttc tcgccacggg ccccaggcag 840tgtgtgggac agacagagag acggagccag tctgacactg cggtcaacgt caccaccagg 900aaggtcagtg caccagatat tctgaaacct ctcaatcaag aggatcccaa atgctctact 960aaccctattt tgaagcaaca gaatctccca tccagtccgg cacccagtac catattctct 1020ggaggtttta gacacggaag tttaattagc attgacagca cctgtacaga gatgggcaat 1080tttgacaatg ctaatgtcac tggagaaata gaatttgcca ttcattattg cttcaaaacc 1140cattctttag aaatatgcat caaggcctgt aagaaccttg cctatggaga agaaaagaag 1200aaaaagtgca atccgtatgt gaagacctac ctgttgcccg acagatcctc ccagggaaag 1260cgcaagactg gagtccaaag gaacaccgtg gacccgacct ttcaggagac cttgaagtat 1320caggtggccc ctgcccagct ggtgacccgg cagctgcagg tctcggtgtg gcatctgggc 1380acgctggccc ggagagtgtt tcttggagaa gtgatcattc ctctggccac gtgggacttt 1440gaagacagca caacacagtc cttccgctgg catccgctcc gggccaaggc ggagaaatac 1500gaagacagcg ttcctcagag taatggagag ctcacagtcc gggctaagct ggttctccct 1560tcacggccca gaaaactcca agaggctcaa gaagggacag atcagccatc acttcatggt 1620caactttgtt tggtagtgct aggagccaag aatttacctg tgcggccaga tggcaccttg 1680aactcatttg ttaagggctg tctcactctg ccagaccaac aaaaactgag actgaagtcg 1740ccagtcctga ggaagcaggc ttgcccccag tggaaacact catttgtctt cagtggcgta 1800accccagctc agctgaggca gtcaagcttg gagttaactg tctgggatca ggccctcttt 1860ggaatgaacg accgcttgct tggaggaacc agacttggtt caaagggaga cacagctgtt 1920ggcggggatg catgctcact atcgaagctc cagtggcaga aagtcctttc cagccccaat 1980ctatggacag acatgactct tgtcctgcac tgacatgaag gcctcaaggt tccaggttgc 2040agcaggcgtg aggcactgtg cgtctgcaga ggggctacga accaggtgca gggtcccagc 2100tggagacccc tttgaccttg agcagtctcc a 2131401927DNAHomo sapiens 40aggagggaag aggcgttgcc cctgctggca tagtcaggta ccagcccagc caggtattga 60acgggctgag cttttcatga tggttcctgc tgacctggaa acatcttaaa tggaagggcg 120tgagcgcttg gtccatgcag tgaagctctt ccaacctggg tcaacgaaaa cggagaagaa 180atggcccaag aaatagatct gagtgctctc aaggagttag aacgcgaggc cattctccag 240gtcctgtacc gagaccaggc ggttcaaaac acagaggagg agaggacacg gaaactgaaa 300acacacctgc agcatctccg gtggaaagga gcgaagaaca cggactggga gcacaaagag 360aagtgctgtg cgcgctgcca gcaggtgctg gggttcctgc tgcaccgggg cgccgtgtgc 420cggggctgca gccaccgcgt gtgtgcccag tgccgagtgt tcctgagggg gacccatgcc 480tggaagtgca cggtgtgctt cgaggacagg aatgtcaaaa taaaaactgg agaatggttc 540tatgaggaac gagccaagaa atttccaact ggaggcaaac atgagacagt tggagggcag 600ctcttgcaat cttatcagaa gctgagcaaa atttctgtgg ttcctcctac tccacctcct 660gtcagcgaga gccagtgcag ccgcagtcct ggcaggaagg tcagtgcacc agatattctg 720aaacctctca atcaagagga tcccaaatgc tctactaacc ctattttgaa gcaacagaat 780ctcccatcca gtccggcacc cagtaccata ttctctggag gttttagaca cggaagttta 840attagcattg acagcacctg tacagagatg ggcaattttg acaatgctaa tgtcactgga 900gaaatagaat ttgccattca ttattgcttc aaaacccatt ctttagaaat atgcatcaag 960gcctgtaaga accttgccta tggagaagaa aagaagaaaa agtgcaatcc gtatgtgaag 1020acctacctgt tgcccgacag atcctcccag ggaaagcgca agactggagt ccaaaggaac 1080accgtggacc cgacctttca ggagaccttg aagtatcagg tggcccctgc ccagctggtg 1140acccggcagc tgcaggtctc ggtgtggcat ctgggcacgc tggcccggag agtgtttctt 1200ggagaagtga tcattcctct ggccacgtgg gactttgaag acagcacaac acagtccttc 1260cgctggcatc cgctccgggc caaggcggag aaatacgaag acagcgttcc tcagagtaat 1320ggagagctca cagtccgggc taagctggtt ctcccttcac ggcccagaaa actccaagag 1380gctcaagaag ggacagatca gccatcactt catggtcaac tttgtttggt agtgctagga 1440gccaagaatt tacctgtgcg gccagatggc accttgaact catttgttaa gggctgtctc 1500actctgccag accaacaaaa actgagactg aagtcgccag tcctgaggaa gcaggcttgc 1560ccccagtgga aacactcatt tgtcttcagt ggcgtaaccc cagctcagct gaggcagtca 1620agcttggagt taactgtctg ggatcaggcc ctctttggaa tgaacgaccg cttgcttgga 1680ggaaccagac ttggttcaaa gggagacaca gctgttggcg gggatgcatg ctcactatcg 1740aagctccagt ggcagaaagt cctttccagc cccaatctat ggacagacat gactcttgtc 1800ctgcactgac atgaaggcct caaggttcca ggttgcagca ggcgtgaggc actgtgcgtc 1860tgcagagggg ctacgaacca ggtgcagggt cccagctgga gacccctttg accttgagca 1920gtctcca 192741406PRTHomo sapiens 41Met Asp Cys Arg Thr Lys Ala Asn Pro Asp Arg Thr Phe Asp Leu Val 1 5 10 15 Leu Lys Val Lys Cys His Ala Ser Glu Asn Glu Asp Pro Val Val Leu 20 25 30 Trp Lys Phe Pro Glu Asp Phe Gly Asp Gln Glu Ile Leu Gln Ser Val 35 40 45 Pro Lys Phe Cys Phe Pro Phe Asp Val Glu Arg Val Ser Gln Asn Gln 50 55 60 Val Gly Gln His Phe Thr Phe Val Leu Thr Asp Ile Glu Ser Lys Gln 65 70 75 80 Arg Phe Gly Phe Cys Arg Leu Thr Ser Gly Gly Thr Ile Cys Leu Cys 85 90 95 Ile Leu Ser Tyr Leu Pro Trp Phe Glu Val Tyr Tyr Lys Leu Leu Asn 100 105 110 Thr Leu Ala Asp Tyr Leu Ala Lys Glu Leu Glu Asn Asp Leu Asn Glu 115 120 125 Thr Leu Arg Ser Leu Tyr Asn His Pro Val Pro Lys Ala Asn Thr Pro 130 135 140 Val Asn Leu Ser Val His Ser Tyr Phe Ile Ala Pro Asp Val Thr Gly 145 150 155 160 Leu Pro Thr Ile Pro Glu Ser Arg Asn Leu Thr Glu Tyr Phe Val Ala 165 170 175 Val Asp Val Asn Asn Met Leu Gln Leu Tyr Ala Ser Met Leu His Glu 180 185 190 Arg Arg Ile Val Ile Ile Ser Ser Lys Leu Ser Thr Leu Thr Ala Cys 195 200 205 Ile His Gly Ser Ala Ala Leu Leu Tyr Pro Met Tyr Trp Gln His Ile 210 215 220 Tyr Ile Pro Val Leu Pro Pro His Leu Leu Asp Tyr Cys Cys Ala Pro 225 230 235 240 Met Pro Tyr Leu Ile Gly Ile His Ser Ser Leu Ile Glu Arg Val Lys 245 250 255 Asn Lys Ser Leu Glu Asp Val Val Met Leu Asn Val Asp Thr Asn Thr 260 265 270 Leu Glu Ser Pro Phe Ser Asp Leu Asn Asn Leu Pro Ser Asp Val Val 275 280 285 Ser Ala Leu Lys Asn Lys Leu Lys Lys Gln Ser Thr Ala Thr Gly Asp 290 295 300 Gly Val Ala Arg Ala Phe Leu Arg Ala Gln Ala Ala Leu Phe Gly Ser 305 310 315 320 Tyr Arg Asp Ala Leu Arg Tyr Lys Pro Gly Glu Pro Ile Thr Phe Cys 325 330 335 Glu Glu Ser Phe Val Lys His Arg Ser Ser Val Met Lys Gln Phe Leu 340 345 350 Glu Thr Ala Ile Asn Leu Gln Leu Phe Lys Gln Phe Ile Asp Gly Arg 355 360 365 Leu Ala Lys Leu Asn Ala Gly Arg Gly Phe Ser Asp Val Phe Glu Glu 370 375 380 Glu Ile Thr Ser Gly Gly Phe Cys Gly Gly Lys Asp Lys Leu Gln Tyr 385 390 395 400 Asp Tyr Pro Phe Ser Gln 405 42426PRTHomo sapiens 42Met Asp Cys Arg Thr Lys Ala Asn Pro Asp Arg Thr Phe Asp Leu Val 1 5 10 15 Leu Lys Val Lys Cys His Ala Ser Glu Asn Glu Asp Pro Val Val Leu 20 25 30 Trp Lys Phe Pro Glu Asp Phe Gly Asp Gln Glu Ile Leu Gln Ser Val 35 40 45 Pro Lys Phe Cys Phe Pro Phe Asp Val Glu Arg Val Ser Gln Asn Gln 50 55 60 Val Gly Gln His Phe Thr Phe Val Leu Thr Asp Ile Glu Ser Lys Gln 65 70 75 80 Arg Phe Gly Phe Cys Arg Leu Thr Ser Gly Gly Thr Ile Cys Leu Cys 85 90 95 Ile Leu Ser Tyr Leu Pro Trp Phe Glu Val Tyr Tyr Lys Leu Leu Asn 100 105 110 Thr Leu Ala Asp Tyr Leu Ala Lys Glu Leu Glu Asn Asp Leu Asn Glu 115 120 125 Thr Leu Arg Ser Leu Tyr Asn His Pro Val Pro Lys Ala Asn Thr Pro 130 135 140 Val Asn Leu Ser Val Asn Gln Glu Ile Phe Ile Ala Cys Glu Gln Val 145 150 155 160 Leu Lys Asp Gln Pro Ala Leu Val Pro His Ser Tyr Phe Ile Ala Pro 165 170 175 Asp Val Thr Gly Leu Pro Thr Ile Pro Glu Ser Arg Asn Leu Thr Glu 180 185 190 Tyr Phe Val Ala Val Asp Val Asn Asn Met Leu Gln Leu Tyr Ala Ser 195 200 205 Met Leu His Glu Arg Arg Ile Val Ile Ile Ser Ser Lys Leu Ser Thr 210 215 220 Leu Thr Ala Cys Ile His Gly Ser Ala Ala Leu Leu Tyr Pro Met Tyr 225 230 235 240 Trp Gln His Ile Tyr Ile Pro Val Leu Pro Pro His Leu Leu Asp Tyr 245 250 255 Cys Cys Ala Pro Met Pro Tyr Leu Ile Gly Ile His Ser Ser Leu Ile 260 265 270 Glu Arg Val Lys Asn Lys Ser Leu Glu Asp Val Val Met Leu Asn Val 275 280 285 Asp Thr Asn Thr Leu Glu Ser Pro Phe Ser Asp Leu Asn Asn Leu Pro 290 295 300 Ser Asp Val Val Ser Ala Leu Lys Asn Lys Leu Lys Lys Gln Ser Thr 305 310 315 320 Ala Thr Gly Asp Gly Val Ala Arg Ala Phe Leu Arg Ala Gln Ala Ala 325 330 335 Leu Phe Gly Ser Tyr Arg Asp Ala Leu Arg Tyr Lys Pro Gly Glu Pro 340 345 350 Ile Thr Phe Cys Glu Glu Ser Phe Val Lys His Arg Ser Ser Val Met 355

360 365 Lys Gln Phe Leu Glu Thr Ala Ile Asn Leu Gln Leu Phe Lys Gln Phe 370 375 380 Ile Asp Gly Arg Leu Ala Lys Leu Asn Ala Gly Arg Gly Phe Ser Asp 385 390 395 400 Val Phe Glu Glu Glu Ile Thr Ser Gly Gly Phe Cys Gly Gly Lys Asp 405 410 415 Lys Leu Gln Tyr Asp Tyr Pro Phe Ser Gln 420 425 43426PRTHomo sapiens 43Ile Glu Thr Lys Thr Arg Ala Asn Pro Asp Arg Thr Phe Asp Leu Val 1 5 10 15 Leu Lys Val Lys Cys His Ala Ser Glu Asn Glu Asp Pro Val Val Leu 20 25 30 Trp Lys Phe Pro Glu Asp Phe Gly Asp Gln Glu Ile Leu Gln Ser Val 35 40 45 Pro Lys Phe Cys Phe Pro Phe Asp Val Glu Arg Val Ser Gln Asn Gln 50 55 60 Val Gly Gln His Phe Thr Phe Val Leu Thr Asp Ile Glu Ser Lys Gln 65 70 75 80 Arg Phe Gly Phe Cys Arg Leu Thr Ser Gly Gly Thr Ile Cys Leu Cys 85 90 95 Ile Leu Ser Tyr Leu Pro Trp Phe Glu Val Tyr Tyr Lys Leu Leu Asn 100 105 110 Thr Leu Ala Asp Tyr Leu Ala Lys Glu Leu Glu Asn Asp Leu Asn Glu 115 120 125 Thr Leu Arg Ser Leu Tyr Asn His Pro Val Pro Lys Ala Asn Thr Pro 130 135 140 Val Asn Leu Ser Val Asn Gln Glu Ile Phe Ile Ala Cys Glu Gln Val 145 150 155 160 Leu Lys Asp Gln Pro Ala Leu Val Pro His Ser Tyr Phe Ile Ala Pro 165 170 175 Asp Val Thr Gly Leu Pro Thr Ile Pro Glu Ser Arg Asn Leu Thr Glu 180 185 190 Tyr Phe Val Ala Val Asp Val Asn Asn Met Leu Gln Leu Tyr Ala Ser 195 200 205 Met Leu His Glu Arg Arg Ile Val Ile Ile Ser Ser Lys Leu Ser Thr 210 215 220 Leu Thr Ala Cys Ile His Gly Ser Ala Ala Leu Leu Tyr Pro Met Tyr 225 230 235 240 Trp Gln His Ile Tyr Ile Pro Val Leu Pro Pro His Leu Leu Asp Tyr 245 250 255 Cys Cys Ala Pro Met Pro Tyr Leu Ile Gly Ile His Ser Ser Leu Ile 260 265 270 Glu Arg Val Lys Asn Lys Ser Leu Glu Asp Val Val Met Leu Asn Val 275 280 285 Asp Thr Asn Thr Leu Glu Ser Pro Phe Ser Asp Leu Asn Asn Leu Pro 290 295 300 Ser Asp Val Val Ser Ala Leu Lys Asn Lys Leu Lys Lys Gln Ser Thr 305 310 315 320 Ala Thr Gly Asp Gly Val Ala Arg Ala Phe Leu Arg Ala Gln Ala Ala 325 330 335 Leu Phe Gly Ser Tyr Arg Asp Ala Leu Arg Tyr Lys Pro Gly Glu Pro 340 345 350 Ile Thr Phe Cys Glu Glu Ser Phe Val Lys His Arg Ser Ser Val Met 355 360 365 Lys Gln Phe Leu Glu Thr Ala Ile Asn Leu Gln Leu Phe Lys Gln Phe 370 375 380 Ile Asp Gly Arg Leu Ala Lys Leu Asn Ala Gly Arg Gly Phe Ser Asp 385 390 395 400 Val Phe Glu Glu Glu Ile Thr Ser Gly Gly Phe Cys Gly Gly Lys Asp 405 410 415 Lys Leu Gln Tyr Asp Tyr Pro Phe Ser Gln 420 425 44396PRTHomo sapiens 44Met Ala Ala Ala Pro Arg Glu Glu Lys Arg Trp Pro Gln Pro Val Phe 1 5 10 15 Ser Asn Pro Val Val Leu Trp Lys Phe Pro Glu Asp Phe Gly Asp Gln 20 25 30 Glu Ile Leu Gln Ser Val Pro Lys Phe Cys Phe Pro Phe Asp Val Glu 35 40 45 Arg Val Ser Gln Asn Gln Val Gly Gln His Phe Thr Phe Val Leu Thr 50 55 60 Asp Ile Glu Ser Lys Gln Arg Phe Gly Phe Cys Arg Leu Thr Ser Gly 65 70 75 80 Gly Thr Ile Cys Leu Cys Ile Leu Ser Tyr Leu Pro Trp Phe Glu Val 85 90 95 Tyr Tyr Lys Leu Leu Asn Thr Leu Ala Asp Tyr Leu Ala Lys Glu Leu 100 105 110 Glu Asn Asp Leu Asn Glu Thr Leu Arg Ser Leu Tyr Asn His Pro Val 115 120 125 Pro Lys Ala Asn Thr Pro Val Asn Leu Ser Val His Ser Tyr Phe Ile 130 135 140 Ala Pro Asp Val Thr Gly Leu Pro Thr Ile Pro Glu Ser Arg Asn Leu 145 150 155 160 Thr Glu Tyr Phe Val Ala Val Asp Val Asn Asn Met Leu Gln Leu Tyr 165 170 175 Ala Ser Met Leu His Glu Arg Arg Ile Val Ile Ile Ser Ser Lys Leu 180 185 190 Ser Thr Leu Thr Ala Cys Ile His Gly Ser Ala Ala Leu Leu Tyr Pro 195 200 205 Met Tyr Trp Gln His Ile Tyr Ile Pro Val Leu Pro Pro His Leu Leu 210 215 220 Asp Tyr Cys Cys Ala Pro Met Pro Tyr Leu Ile Gly Ile His Ser Ser 225 230 235 240 Leu Ile Glu Arg Val Lys Asn Lys Ser Leu Glu Asp Val Val Met Leu 245 250 255 Asn Val Asp Thr Asn Thr Leu Glu Ser Pro Phe Ser Asp Leu Asn Asn 260 265 270 Leu Pro Ser Asp Val Val Ser Ala Leu Lys Asn Lys Leu Lys Lys Gln 275 280 285 Ser Thr Ala Thr Gly Asp Gly Val Ala Arg Ala Phe Leu Arg Ala Gln 290 295 300 Ala Ala Leu Phe Gly Ser Tyr Arg Asp Ala Leu Arg Tyr Lys Pro Gly 305 310 315 320 Glu Pro Ile Thr Phe Cys Glu Glu Ser Phe Val Lys His Arg Ser Ser 325 330 335 Val Met Lys Gln Phe Leu Glu Thr Ala Ile Asn Leu Gln Leu Phe Lys 340 345 350 Gln Phe Ile Asp Gly Arg Leu Ala Lys Leu Asn Ala Gly Arg Gly Phe 355 360 365 Ser Asp Val Phe Glu Glu Glu Ile Thr Ser Gly Gly Phe Cys Gly Gly 370 375 380 Lys Asp Lys Leu Gln Tyr Asp Tyr Pro Phe Ser Gln 385 390 395 452117DNAHomo sapiens 45gccgggggcg cagccgacat gggcccgccg ccacggctgc tgtgagcagc ctctttccct 60gtgtggccgc cggcgtgggc ggggacggcg cgaccctcgc gcggccgggc tgcgggcttc 120caggccagcg cgcgggggcc ggacggacag ccccacaccg acatgtaacc atggactgca 180ggaccaaggc aaatccagac agaacctttg acttggtgtt gaaagtgaaa tgtcatgcct 240ctgaaaatga agatcctgtg gtattgtgga aattcccaga ggactttgga gaccaggaaa 300tactacagag tgtgccaaag ttctgttttc cctttgacgt tgaaagggtg tctcagaatc 360aagttggaca gcactttacc tttgtactga cagacattga aagtaaacag agatttggat 420tctgcagact gacgtcagga ggcacaattt gtttatgcat ccttagttac cttccctggt 480ttgaagtgta ttacaagctt ctaaatactc ttgcagatta cttggctaag gaactggaaa 540atgatttgaa tgaaactctc agatcactgt ataaccaccc agtaccaaag gcaaatactc 600ctgtaaattt gagtgtgcat tcctacttca ttgcccctga tgtaactgga ctcccaacaa 660tacccgagag tagaaatctt acagaatatt ttgttgccgt ggatgtgaac aacatgctgc 720agctgtatgc cagtatgctg catgaaaggc gcatcgtgat tatctcgagc aaattaagca 780ctttaactgc ctgtatccat ggatcagctg ctcttctata cccaatgtat tggcaacaca 840tatacatccc agtgcttcct ccacacctgc tggactactg ctgtgcccca atgccatacc 900tgattggaat acactccagc ctcatagaga gagtgaaaaa caaatcattg gaagatgttg 960ttatgttaaa tgttgataca aacacattag aatcaccatt tagtgacttg aacaacctac 1020caagtgatgt ggtctcggcc ttgaaaaata aactgaagaa gcagtctaca gctacgggtg 1080atggagtagc tagggccttt cttagagcac aggctgcttt gtttggatcc tacagagatg 1140cactgagata caaacctggt gagcccatca ctttctgtga ggagagtttt gtaaagcacc 1200gctcaagcgt gatgaaacag ttcctggaaa ctgccattaa cctccagctt tttaagcagt 1260ttatcgatgg tcgactggca aaactaaatg caggaagggg tttctctgat gtatttgaag 1320aagagatcac ttcaggtggc ttttgtggag gtaaagacaa gttacaatat gattatccat 1380tttctcaata acaattttct tggtctttgc acttgtgtct gataaaacct atttcataaa 1440caactaatga tttcctccta aatatgtaat gtcttaaata catttttcat cttataaaag 1500ctatggaatt agcttatttt gcctgatacc tgttactcaa ggcattaagt tggcctcctg 1560aattggcagc tgttggcctc gataatctct taatattgct ggaaattagt aatacagaaa 1620tccaatcaac tcatatcttc ctgtctttcc ttctgaatag tagtattctc tgctagaaaa 1680ctactagtga tggttattac tgagtatgaa tttaagaact gaggttatga ttggtaatac 1740aatccaaaaa gaagggtctg aacaccaaaa ttctttatac atatttaagt aactgtatta 1800ttattataca gatgtcttta cctttttgac tttatagatc actgcagcat taagaaagtt 1860tccagtttac cattccataa gtacaattaa tccttctagt gtaaatgttc aaatactgtt 1920ataattatct aggcaattaa taatttacaa actgatattt ttgcacgatt gtagtggtgt 1980atagtcttga cttgcagagc attttgcttg agtccttgaa atgtcgtgtt cattcattat 2040ttgctgagtg cttacaatgt attaggcact gttctaaata ttaagtgtac taaataaaca 2100aaaatccttg tattctg 2117462177DNAHomo sapiens 46gccgggggcg cagccgacat gggcccgccg ccacggctgc tgtgagcagc ctctttccct 60gtgtggccgc cggcgtgggc ggggacggcg cgaccctcgc gcggccgggc tgcgggcttc 120caggccagcg cgcgggggcc ggacggacag ccccacaccg acatgtaacc atggactgca 180ggaccaaggc aaatccagac agaacctttg acttggtgtt gaaagtgaaa tgtcatgcct 240ctgaaaatga agatcctgtg gtattgtgga aattcccaga ggactttgga gaccaggaaa 300tactacagag tgtgccaaag ttctgttttc cctttgacgt tgaaagggtg tctcagaatc 360aagttggaca gcactttacc tttgtactga cagacattga aagtaaacag agatttggat 420tctgcagact gacgtcagga ggcacaattt gtttatgcat ccttagttac cttccctggt 480ttgaagtgta ttacaagctt ctaaatactc ttgcagatta cttggctaag gaactggaaa 540atgatttgaa tgaaactctc agatcactgt ataaccaccc agtaccaaag gcaaatactc 600ctgtaaattt gagtgtgaac caagagatat ttattgcctg tgagcaagtt ctgaaagatc 660agcctgctct agtaccgcat tcctacttca ttgcccctga tgtaactgga ctcccaacaa 720tacccgagag tagaaatctt acagaatatt ttgttgccgt ggatgtgaac aacatgctgc 780agctgtatgc cagtatgctg catgaaaggc gcatcgtgat tatctcgagc aaattaagca 840ctttaactgc ctgtatccat ggatcagctg ctcttctata cccaatgtat tggcaacaca 900tatacatccc agtgcttcct ccacacctgc tggactactg ctgtgcccca atgccatacc 960tgattggaat acactccagc ctcatagaga gagtgaaaaa caaatcattg gaagatgttg 1020ttatgttaaa tgttgataca aacacattag aatcaccatt tagtgacttg aacaacctac 1080caagtgatgt ggtctcggcc ttgaaaaata aactgaagaa gcagtctaca gctacgggtg 1140atggagtagc tagggccttt cttagagcac aggctgcttt gtttggatcc tacagagatg 1200cactgagata caaacctggt gagcccatca ctttctgtga ggagagtttt gtaaagcacc 1260gctcaagcgt gatgaaacag ttcctggaaa ctgccattaa cctccagctt tttaagcagt 1320ttatcgatgg tcgactggca aaactaaatg caggaagggg tttctctgat gtatttgaag 1380aagagatcac ttcaggtggc ttttgtggag gtaaagacaa gttacaatat gattatccat 1440tttctcaata acaattttct tggtctttgc acttgtgtct gataaaacct atttcataaa 1500caactaatga tttcctccta aatatgtaat gtcttaaata catttttcat cttataaaag 1560ctatggaatt agcttatttt gcctgatacc tgttactcaa ggcattaagt tggcctcctg 1620aattggcagc tgttggcctc gataatctct taatattgct ggaaattagt aatacagaaa 1680tccaatcaac tcatatcttc ctgtctttcc ttctgaatag tagtattctc tgctagaaaa 1740ctactagtga tggttattac tgagtatgaa tttaagaact gaggttatga ttggtaatac 1800aatccaaaaa gaagggtctg aacaccaaaa ttctttatac atatttaagt aactgtatta 1860ttattataca gatgtcttta cctttttgac tttatagatc actgcagcat taagaaagtt 1920tccagtttac cattccataa gtacaattaa tccttctagt gtaaatgttc aaatactgtt 1980ataattatct aggcaattaa taatttacaa actgatattt ttgcacgatt gtagtggtgt 2040atagtcttga cttgcagagc attttgcttg agtccttgaa atgtcgtgtt cattcattat 2100ttgctgagtg cttacaatgt attaggcact gttctaaata ttaagtgtac taaataaaca 2160aaaatccttg tattctg 2177472007DNAHomo sapiens 47attgagacaa aaacaagggc aaatccagac agaacctttg acttggtgtt gaaagtgaaa 60tgtcatgcct ctgaaaatga agatcctgtg gtattgtgga aattcccaga ggactttgga 120gaccaggaaa tactacagag tgtgccaaag ttctgttttc cctttgacgt tgaaagggtg 180tctcagaatc aagttggaca gcactttacc tttgtactga cagacattga aagtaaacag 240agatttggat tctgcagact gacgtcagga ggcacaattt gtttatgcat ccttagttac 300cttccctggt ttgaagtgta ttacaagctt ctaaatactc ttgcagatta cttggctaag 360gaactggaaa atgatttgaa tgaaactctc agatcactgt ataaccaccc agtaccaaag 420gcaaatactc ctgtaaattt gagtgtgaac caagagatat ttattgcctg tgagcaagtt 480ctgaaagatc agcctgctct agtaccgcat tcctacttca ttgcccctga tgtaactgga 540ctcccaacaa tacccgagag tagaaatctt acagaatatt ttgttgccgt ggatgtgaac 600aacatgctgc agctgtatgc cagtatgctg catgaaaggc gcatcgtgat tatctcgagc 660aaattaagca ctttaactgc ctgtatccat ggatcagctg ctcttctata cccaatgtat 720tggcaacaca tatacatccc agtgcttcct ccacacctgc tggactactg ctgtgcccca 780atgccatacc tgattggaat acactccagc ctcatagaga gagtgaaaaa caaatcattg 840gaagatgttg ttatgttaaa tgttgataca aacacattag aatcaccatt tagtgacttg 900aacaacctac caagtgatgt ggtctcggcc ttgaaaaata aactgaagaa gcagtctaca 960gctacgggtg atggagtagc tagggccttt cttagagcac aggctgcttt gtttggatcc 1020tacagagatg cactgagata caaacctggt gagcccatca ctttctgtga ggagagtttt 1080gtaaagcacc gctcaagcgt gatgaaacag ttcctggaaa ctgccattaa cctccagctt 1140tttaagcagt ttatcgatgg tcgactggca aaactaaatg caggaagggg tttctctgat 1200gtatttgaag aagagatcac ttcaggtggc ttttgtggag gtaaagacaa gttacaatat 1260gattatccat tttctcaata acaattttct tggtctttgc acttgtgtct gataaaacct 1320atttcataaa caactaatga tttcctccta aatatgtaat gtcttaaata catttttcat 1380cttataaaag ctatggaatt agcttatttt gcctgatacc tgttactcaa ggcattaagt 1440tggcctcctg aattggcagc tgttggcctc gataatctct taatattgct ggaaattagt 1500aatacagaaa tccaatcaac tcatatcttc ctgtctttcc ttctgaatag tagtattctc 1560tgctagaaaa ctactagtga tggttattac tgagtatgaa tttaagaact gaggttatga 1620ttggtaatac aatccaaaaa gaagggtctg aacaccaaaa ttctttatac atatttaagt 1680aactgtatta ttattataca gatgtcttta cctttttgac tttatagatc actgcagcat 1740taagaaagtt tccagtttac cattccataa gtacaattaa tccttctagt gtaaatgttc 1800aaatactgtt ataattatct aggcaattaa taatttacaa actgatattt ttgcacgatt 1860gtagtggtgt atagtcttga cttgcagagc attttgcttg agtccttgaa atgtcgtgtt 1920cattcattat ttgctgagtg cttacaatgt attaggcact gttctaaata ttaagtgtac 1980taaataaaca aaaatccttg tattctg 2007482197DNAHomo sapiens 48gcgggggccg gacggacagc cccacaccga catgtaacca tggactgcag gaccaaggca 60aatccagaca gaacctttga cttggtgttg aaagtgaaat gtcatgcctc tgaaaatgaa 120gaggacagtc cagcttatct gccgaggatt ccccctggaa aagtacgccg attcgcattt 180tgcattaaga aactggaaaa ctttcctgtc ggtcctggcg tagcgcctcc cgtgtccggg 240gtagaccttg taccggctga aaccgcatag ctcgaccttc atggcggcag ctccacggga 300ggagaaaaga tggccccaac ctgtattttc gaatcctgtg gtattgtgga aattcccaga 360ggactttgga gaccaggaaa tactacagag tgtgccaaag ttctgttttc cctttgacgt 420tgaaagggtg tctcagaatc aagttggaca gcactttacc tttgtactga cagacattga 480aagtaaacag agatttggat tctgcagact gacgtcagga ggcacaattt gtttatgcat 540ccttagttac cttccctggt ttgaagtgta ttacaagctt ctaaatactc ttgcagatta 600cttggctaag gaactggaaa atgatttgaa tgaaactctc agatcactgt ataaccaccc 660agtaccaaag gcaaatactc ctgtaaattt gagtgtgcat tcctacttca ttgcccctga 720tgtaactgga ctcccaacaa tacccgagag tagaaatctt acagaatatt ttgttgccgt 780ggatgtgaac aacatgctgc agctgtatgc cagtatgctg catgaaaggc gcatcgtgat 840tatctcgagc aaattaagca ctttaactgc ctgtatccat ggatcagctg ctcttctata 900cccaatgtat tggcaacaca tatacatccc agtgcttcct ccacacctgc tggactactg 960ctgtgcccca atgccatacc tgattggaat acactccagc ctcatagaga gagtgaaaaa 1020caaatcattg gaagatgttg ttatgttaaa tgttgataca aacacattag aatcaccatt 1080tagtgacttg aacaacctac caagtgatgt ggtctcggcc ttgaaaaata aactgaagaa 1140gcagtctaca gctacgggtg atggagtagc tagggccttt cttagagcac aggctgcttt 1200gtttggatcc tacagagatg cactgagata caaacctggt gagcccatca ctttctgtga 1260ggagagtttt gtaaagcacc gctcaagcgt gatgaaacag ttcctggaaa ctgccattaa 1320cctccagctt tttaagcagt ttatcgatgg tcgactggca aaactaaatg caggaagggg 1380tttctctgat gtatttgaag aagagatcac ttcaggtggc ttttgtggag gtaaagacaa 1440gttacaatat gattatccat tttctcaata acaattttct tggtctttgc acttgtgtct 1500gataaaacct atttcataaa caactaatga tttcctccta aatatgtaat gtcttaaata 1560catttttcat cttataaaag ctatggaatt agcttatttt gcctgatacc tgttactcaa 1620ggcattaagt tggcctcctg aattggcagc tgttggcctc gataatctct taatattgct 1680ggaaattagt aatacagaaa tccaatcaac tcatatcttc ctgtctttcc ttctgaatag 1740tagtattctc tgctagaaaa ctactagtga tggttattac tgagtatgaa tttaagaact 1800gaggttatga ttggtaatac aatccaaaaa gaagggtctg aacaccaaaa ttctttatac 1860atatttaagt aactgtatta ttattataca gatgtcttta cctttttgac tttatagatc 1920actgcagcat taagaaagtt tccagtttac cattccataa gtacaattaa tccttctagt 1980gtaaatgttc aaatactgtt ataattatct aggcaattaa taatttacaa actgatattt 2040ttgcacgatt gtagtggtgt atagtcttga cttgcagagc attttgcttg agtccttgaa 2100atgtcgtgtt cattcattat ttgctgagtg cttacaatgt attaggcact gttctaaata 2160ttaagtgtac taaataaaca aaaatccttg tattctg 219749131PRTHomo sapiens 49Met Phe Leu Gly Leu Val Gly Leu Arg Thr Lys Gly Arg Arg Trp Ile 1 5 10 15 Ser Ser Trp Ser Glu Gly Glu Asp Arg Gly Gln Ser Pro Glu Gly Val 20 25 30 Leu Leu Thr Trp Val Phe Gly Thr Lys Cys Val Met His Pro Cys Glu 35 40 45 Glu Thr Thr Lys Gln Ala Leu Cys Glu Gln Gln Gly Cys Leu Phe His 50 55 60 Leu Gly Ala Asp Glu Leu Ser Pro Lys Arg Glu Ser Ala Gln Ser Ile 65 70

75 80 Ser Phe Lys Trp Glu Asn Ser Ile Tyr Leu His Ala Thr Leu Phe Leu 85 90 95 Ile Gly Glu Tyr Leu His Leu Ala Phe Tyr Tyr Phe Leu Leu Val Leu 100 105 110 Tyr Ile Leu Cys Ser Phe Leu Ser Tyr Cys Leu Leu Leu Trp Leu Gly 115 120 125 Ser Phe Leu 130 502950DNAHomo sapiens 50attatctagg tctcggagga tggagaaatc aaaagtgcca ttttctggcc atttagaacc 60attgtcgagt ttgtattggg gccaagcagt gttgcagaag aaaataagac atttagattt 120tagttcaggt gatagttgaa gaaattttaa gttcttgaga acacaggcta agggagaaga 180aggaggaatg gagggtggaa gtttgcccat agtgaaggag gcaagtttaa agagaaaggt 240agagacatgg agaaagggtt ggggagcagc cctgggctgc aatgtgggtg agcagccaaa 300gcaggcatcc ccgcaattga cttgccacca agggaatgtg gttgaatgac caaggcaggc 360atccctgaag atatcagacg ccaatggaat gtgggtgaat aatcaggcag gcatccccgg 420aatgattaaa cactaaggga aggctgcctt cctgagtaca tgaccagcac cagagttttg 480ggtccatgga taaaatgtgt ctcctttgtc tctactagaa aatgaaagga attgaaatta 540agagaagaga gggagtgaag ggtggcacca agaatgaaag gagaaagagg ttgagggata 600gtgagaaagg ttggagaaga gagtaaaaag aggccactta cccgatttaa aatttgtgag 660atgttccttg ggctggttgg tctgaggacc aaaggtcgta ggtggatctc ttcatggagt 720gagggtgagg acaggggaca gtctcctgaa ggagtcctgc tgacctgggt ctttggcacc 780aaatgtgtca tgcatccatg tgaagagacc accaaacagg ctttgtgtga gcaacagggc 840tgtttgtttc acctgggtgc agacgagttg agtccaaaaa gagagtcagc ccagtctata 900tcttttaaat gggaaaattc aatctattta catgcaacgt tattcttgat aggtgagtac 960ttacatctag cattctatta ttttctgctt gttttgtata tcctttgttc cttcctctct 1020tactgtttac ttttgtggtt gggcagtttt ctgtagggat aagatttgag tcttatctct 1080ttctccctat gtgttagctt taccagtgag tttttatagt ttcacatatt tttatgatgc 1140tggttatcat cttctctgtg gggaacaggc cccccaaaac ctggccataa actggcccca 1200aaactggcca taaacaaaat ctctgcagca ctgtgacatg tacatgatgg tcttaacgcc 1260cacgctggaa ggttgtgggt ttaccagaat gagggcaagg aacacctggc ccacccaggg 1320tggaaaaccg cttaaaggca ttcttaaacc acaaacaata gcatgagtga tctgtgcctt 1380aaggccatgc tcctgctgca gatagctagt ccaacccatc cctttatttc agcccatctc 1440ttcatttccc ataaggaata attttagtta atctaatatc tatagaaaga atgctaatga 1500ctagcttgct gttaataaat acatgggtaa acctctgttg gaggctctca gctctgaagg 1560ctgtgagacc cttgatttcc tacttcactc ctctatattt ctgtgtcttt aattcctcta 1620gtgccactgg gttagagtct ccccgaccaa gctggtctca gcaagtggtc tccatcatgg 1680gggctcgaat ccaggttgaa gggtcaccag agtgatggtt ggagaacatg gaactagctg 1740gaggacacct gagtactctt aaagcaaacc ccgtggtgag taagaagggg agctcagaag 1800catcagggta acaatgggac aagtgtgggg tctggttcgt tccatcttgg aactttttca 1860cactgatgat gaggaagaag gagagtataa tgaagtaaca gaagaggtta tagagcaggt 1920ttatttgcca gctaaagcta aagtggcaaa ggagggagag gttcatccct acccttctgc 1980accccctcat tattattttg aagaaaaaga gtggcctgac cctccagatc tttcttttcc 2040agaggacagt gggcaaaaat tagttgcccc agtgactgtt caagcagcac ctcgagcgac 2100tgctcttagt tctattcagt caggaattca gcaagctaga tgagaaggtg attaagaggc 2160ttggcagttc cctgttagac tacactgccc agaccaacag ggaaatattg tagctacatt 2220tgagcctttt tgttttaaat tactcaaaga atttaaacaa gctattaatc agtatggacc 2280aggttctcct tttgtaatgg gactattaaa gaacattgct gtttccagtc agatgattcc 2340tactgactgg gacgctctta ctcaagcttg tctaactcct gcttagttct tacaatttaa 2400aacttggtgg gcagatgaag cttccattca ggcttctcac aacacgcagg accaacctca 2460aattaatata actgcagacc aacttttggg ggttggcagt tgggctggtt tagatgcaca 2520aatggtcatg caggatgatg ccatagaaca gcttagagga gcgtgcatta gagcttgggg 2580aaaaaaaatc acttcaagtg gagaacaata ccctttcttt agtgctataa aacagggacc 2640agaagaatca tatgtggatt ttatagctca gttacaggag tctcttaaaa agatgactgc 2700agatttggct gctcaggata tagtgttgca attattagct ttcaacaatg ctaatcctga 2760ttgccaggct gctctgtgac ctatcagagg gaaagcacat ttagttgatt atatcaaggc 2820ctgtggtggt atcagaggta atctgcatca ggccacctgc tagcacgggc aatggcagga 2880ctgagagtgg atacagaaag tactccattt cctggagctt gttttaactg tgggaagcat 2940ggtcatactg 295051132PRTHomo sapiens 51Met Ser Gln Gly Arg His Leu Leu Glu Phe Leu Pro Leu Tyr Ile Ala 1 5 10 15 Phe Met Leu Arg Gly Val Cys Arg Ile Asp Ala Gly Ser Leu Asn Pro 20 25 30 Glu Leu Phe Leu Pro Met Leu His Glu Glu Asp Trp Cys Trp Glu Ile 35 40 45 Ala Gly His Val Asp Ser Gln Glu Leu Phe Val Gly Leu Phe Ser Ser 50 55 60 Thr Ser Thr Gly His Ala Glu Leu Asp Lys Lys Val Asn Gly Leu Tyr 65 70 75 80 Tyr Asp Ser Val Phe Gln Leu Ser Leu Asp Arg Met Arg His Thr Arg 85 90 95 Ser Met Ala Arg Val Glu Arg Leu Arg His Arg Lys Ala Ile Gln Lys 100 105 110 Lys Thr Gln Leu Val His His Leu Leu Phe Lys Gly Trp Ala Ser Asp 115 120 125 Glu Thr Glu Ile 130 52399DNAHomo sapiens 52atgtcacaag gcaggcatct tcttgagttt cttccattgt acatagcttt catgttacgt 60ggggtttgta ggatagacgc tggaagcctt aatccagaac tgtttttgcc aatgttacat 120gaagaggatt ggtgttggga gatagctggc catgtggact cccaagagtt attcgttggt 180ttgttttcta gtacctctac tgggcatgca gagctggaca aaaaggttaa tggactttat 240tatgactctg tattccagtt gtctctggac cgtatgcgtc atacaaggag tatggctaga 300gtagagaggc tgagacacag gaaagcgatc cagaaaaaga ctcagttagt ccatcatctg 360ctatttaaag gatgggcttc tgatgaaact gaaatttag 39953431PRTHomo sapiens 53Met Arg Arg Leu Arg Arg Leu Ala His Leu Val Leu Phe Cys Pro Phe 1 5 10 15 Ser Lys Arg Leu Gln Gly Arg Leu Pro Gly Leu Arg Val Arg Cys Ile 20 25 30 Phe Leu Ala Trp Leu Gly Val Phe Ala Gly Ser Trp Leu Val Tyr Val 35 40 45 His Tyr Ser Ser Tyr Ser Glu Arg Cys Arg Gly His Val Cys Gln Val 50 55 60 Val Ile Cys Asp Gln Tyr Arg Lys Gly Ile Ile Ser Gly Ser Val Cys 65 70 75 80 Gln Asp Leu Cys Glu Leu His Met Val Glu Trp Arg Thr Cys Leu Ser 85 90 95 Val Ala Pro Gly Gln Gln Val Tyr Ser Gly Leu Trp Arg Asp Lys Asp 100 105 110 Val Thr Ile Lys Cys Gly Ile Glu Glu Thr Leu Asp Ser Lys Ala Arg 115 120 125 Ser Asp Ala Ala Pro Arg Arg Glu Leu Val Leu Phe Asp Lys Pro Thr 130 135 140 Arg Gly Thr Ser Ile Lys Glu Phe Arg Glu Met Thr Leu Ser Phe Leu 145 150 155 160 Lys Ala Asn Leu Gly Asp Leu Pro Ser Leu Pro Ala Leu Val Gly Gln 165 170 175 Val Leu Leu Met Ala Asp Phe Asn Lys Asp Asn Arg Val Ser Leu Ala 180 185 190 Glu Ala Lys Ser Val Trp Ala Leu Leu Gln Arg Asn Glu Phe Leu Leu 195 200 205 Leu Leu Ser Leu Gln Glu Lys Glu His Ala Ser Arg Leu Leu Gly Tyr 210 215 220 Cys Gly Asp Leu Tyr Leu Thr Glu Gly Val Pro His Gly Ala Trp His 225 230 235 240 Ala Ala Ala Leu Pro Pro Leu Leu Arg Pro Leu Leu Pro Pro Ala Leu 245 250 255 Gln Gly Ala Leu Gln Gln Trp Leu Gly Pro Ala Trp Pro Trp Arg Ala 260 265 270 Lys Ile Ala Ile Gly Leu Leu Glu Phe Val Glu Glu Leu Phe His Gly 275 280 285 Ser Tyr Gly Thr Phe Tyr Met Cys Glu Thr Thr Leu Ala Asn Val Gly 290 295 300 Tyr Thr Ala Thr Tyr Asp Phe Lys Met Ala Asp Leu Gln Gln Val Ala 305 310 315 320 Pro Glu Ala Thr Val Arg Arg Phe Leu Gln Gly Arg Arg Cys Glu His 325 330 335 Ser Thr Asp Cys Thr Tyr Gly Arg Asp Cys Arg Ala Pro Cys Asp Arg 340 345 350 Leu Met Arg Gln Cys Lys Gly Asp Leu Ile Gln Pro Asn Leu Ala Lys 355 360 365 Val Cys Ala Leu Leu Arg Gly Tyr Leu Leu Pro Gly Ala Pro Ala Asp 370 375 380 Leu Arg Glu Glu Leu Gly Thr Gln Leu Arg Thr Cys Thr Thr Leu Ser 385 390 395 400 Gly Leu Ala Ser Gln Val Glu Ala His His Ser Leu Val Leu Ser His 405 410 415 Leu Lys Thr Leu Leu Trp Lys Lys Ile Ser Asn Thr Lys Tyr Ser 420 425 430 54344PRTHomo sapiens 54Met Val Glu Trp Arg Thr Cys Leu Ser Val Ala Pro Gly Gln Gln Val 1 5 10 15 Tyr Ser Gly Leu Trp Arg Asp Lys Asp Val Thr Ile Lys Cys Gly Ile 20 25 30 Glu Glu Thr Leu Asp Ser Lys Ala Arg Ser Asp Ala Ala Pro Arg Arg 35 40 45 Glu Leu Val Leu Phe Asp Lys Pro Thr Arg Gly Thr Ser Ile Lys Glu 50 55 60 Phe Arg Glu Met Thr Leu Ser Phe Leu Lys Ala Asn Leu Gly Asp Leu 65 70 75 80 Pro Ser Leu Pro Ala Leu Val Gly Gln Val Leu Leu Met Ala Asp Phe 85 90 95 Asn Lys Asp Asn Arg Val Ser Leu Ala Glu Ala Lys Ser Val Trp Ala 100 105 110 Leu Leu Gln Arg Asn Glu Phe Leu Leu Leu Leu Ser Leu Gln Glu Lys 115 120 125 Glu His Ala Ser Arg Leu Leu Gly Tyr Cys Gly Asp Leu Tyr Leu Thr 130 135 140 Glu Gly Val Pro His Gly Ala Trp His Ala Ala Ala Leu Pro Pro Leu 145 150 155 160 Leu Arg Pro Leu Leu Pro Pro Ala Leu Gln Gly Ala Leu Gln Gln Trp 165 170 175 Leu Gly Pro Ala Trp Pro Trp Arg Ala Lys Ile Ala Ile Gly Leu Leu 180 185 190 Glu Phe Val Glu Glu Leu Phe His Gly Ser Tyr Gly Thr Phe Tyr Met 195 200 205 Cys Glu Thr Thr Leu Ala Asn Val Gly Tyr Thr Ala Thr Tyr Asp Phe 210 215 220 Lys Met Ala Asp Leu Gln Gln Val Ala Pro Glu Ala Thr Val Arg Arg 225 230 235 240 Phe Leu Gln Gly Arg Arg Cys Glu His Ser Thr Asp Cys Thr Tyr Gly 245 250 255 Arg Asp Cys Arg Ala Pro Cys Asp Arg Leu Met Arg Gln Cys Lys Gly 260 265 270 Asp Leu Ile Gln Pro Asn Leu Ala Lys Val Cys Ala Leu Leu Arg Gly 275 280 285 Tyr Leu Leu Pro Gly Ala Pro Ala Asp Leu Arg Glu Glu Leu Gly Thr 290 295 300 Gln Leu Arg Thr Cys Thr Thr Leu Ser Gly Leu Ala Ser Gln Val Glu 305 310 315 320 Ala His His Ser Leu Val Leu Ser His Leu Lys Thr Leu Leu Trp Lys 325 330 335 Lys Ile Ser Asn Thr Lys Tyr Ser 340 551645DNAHomo sapiens 55agggagcggc ggccgctgcg ggccgggccg ggccggggct gaggccgagc gagccgcggg 60gcccgcgcag ccccggccgg agcccaccat gcggcggctg cggcgcctgg cgcacctggt 120gctcttctgc cccttctcca agcgcctgca gggccggctc ccaggcctca gggtccgctg 180catcttcctg gcctggctgg gcgtctttgc aggcagctgg ctggtgtacg tgcactactc 240gtcctactcg gagcgctgtc gcggccatgt ctgccaggtg gtcatttgtg accagtaccg 300caaggggatc atctcgggct ccgtctgcca ggacctgtgt gagctgcata tggtggagtg 360gaggacctgc ctctcggtgg ccccgggcca gcaggtgtac agcgggctct ggcgggacaa 420ggatgtaacc atcaagtgtg gcattgagga gaccctcgac tccaaggccc ggtcggatgc 480ggccccccgg cgggagctgg tactgtttga caagcccacc cggggcacct ccatcaagga 540attccgggag atgaccctca gcttcctcaa ggcgaacctg ggagacctgc cttccctgcc 600ggcgctggtt ggccaggtcc tgctcatggc tgacttcaac aaggacaacc gggtgtccct 660ggcggaagcc aagtccgtgt gggccctgct gcagcgtaac gagttcctgc tgctgctgtc 720cctgcaggag aaggagcacg cctccagact gctgggctac tgtggggacc tctacctcac 780cgagggcgtg ccgcatggcg cctggcacgc ggccgccctt ccacccctgt tgcgcccact 840gctgccgcct gccctgcagg gtgctctcca gcagtggctg gggcctgcgt ggccttggcg 900ggccaagatc gccatcggcc tgctggagtt cgtggaggag ctcttccacg gctcttacgg 960gactttctac atgtgtgaga ccacactggc caacgtgggc tacacagcca cctacgactt 1020caagatggcc gacctgcagc aggtggcacc cgaggccacc gtgcgccgct tcctgcaggg 1080ccgccgctgc gagcacagca ccgactgcac ctacgggcgc gactgcaggg ccccgtgtga 1140caggctcatg aggcagtgca agggcgacct catccagccc aacctggcca aggtgtgcgc 1200actgctacgg ggctacctgc tgcctggcgc gcccgccgac ctccgcgagg agctgggcac 1260acagctgcgc acctgtacca cgctgagcgg gctggccagc caggtggagg cccatcactc 1320gctggtgctc agccacctca agactctgct ctggaagaag atctccaaca ccaagtactc 1380ttgatggggc agtgaggggc ctggccaccc ttcctggagc tggccaggtg ccagggtcca 1440accctccctc aaggaatcct gtcagaagat gtgaaatgca actgtgttgc aaaatcactc 1500ccctaccgtc agggctctgg attccagcac cacagacatg agaccccagc tcggagcaaa 1560ggcggacatg gacatcccgg caggagagtc ctccaagggg gtttgttact ctgaagaacg 1620taatgtcaat aaacagcttt tatgt 1645562410DNAHomo sapiens 56ggccgcagcc tgtcctcctg cctcagcctg gggaaggctg ggccgggcca aagagggagg 60ccaagtcctt gggacaggag gagacccaca cctgagatta gtggaaaccc agccagaagc 120tgccatgcag gtgtggctct gggcatcagg atctgtcgtg agagccccca tgagtgccca 180gtgcagaatg gctggcagcc cgccctgacg gggagcagag gggctggacg cggtgccctt 240cacgggatga cacccagctg tttgccctgt gtccaggggg ttgcttctct gacagaggcc 300ctatggctcg tgtctgactc ctgtccaggt tctgccagcc tgaccatcca tcgctctggc 360accaagagcc cacccttttg ttcttcctgg cgtcccaggg aaagccctgc ctgggtgggg 420cagctcctgg cccttcagat ggaagacgca gtccagtcag caccatcata ggaaacaagt 480tcagaaatgt ctcacttact attccgggca gggagggcgc catgagtcag ggggtgcatc 540ctccctcctg gcgtcacccg aggcaggaat gaagagtcag gcagagagcg cgcgtgtggc 600agctggtggt gtagatatta gggactagtg tgaattctag ttcaccggcc aatgcctgga 660tggtccagag ctgggtcggc tgggcggaga gctgcctcca ggttcctgcc tctggccctg 720gcgtggggtc gacactgggt gtggtgtgtg tctcatgtcc aggcagtggc ctttgctgtg 780ccgtcctgtt acaggagcca ggatggtggg gacgggaccg gaccggaggg ttggcggggc 840tgcccctgca gccgacagcc ccatcctgca gccaccaatg gcatgaccca gggccccggc 900actgcctgtg tgaggggctg gcagctttcc aactgcagca agtggaggcc cctgccagct 960tcgggcctgt gggcaggggc tcagtggggc aggggtgtgg ctgccccgcc cggcacgcct 1020gcacctgtct cctcagtgtg accagtaccg caaggggatc atctcgggct ccgtctgcca 1080ggacctgtgt gagctgcata tggtggagtg gaggacctgc ctctcggtgg ccccgggcca 1140gcaggtgtac agcgggctct ggcgggacaa ggatgtaacc atcaagtgtg gcattgagga 1200gaccctcgac tccaaggccc ggtcggatgc ggccccccgg cgggagctgg tactgtttga 1260caagcccacc cggggcacct ccatcaagga attccgggag atgaccctca gcttcctcaa 1320ggcgaacctg ggagacctgc cttccctgcc ggcgctggtt ggccaggtcc tgctcatggc 1380tgacttcaac aaggacaacc gggtgtccct ggcggaagcc aagtccgtgt gggccctgct 1440gcagcgtaac gagttcctgc tgctgctgtc cctgcaggag aaggagcacg cctccagact 1500gctgggctac tgtggggacc tctacctcac cgagggcgtg ccgcatggcg cctggcacgc 1560ggccgccctt ccacccctgt tgcgcccact gctgccgcct gccctgcagg gtgctctcca 1620gcagtggctg gggcctgcgt ggccttggcg ggccaagatc gccatcggcc tgctggagtt 1680cgtggaggag ctcttccacg gctcttacgg gactttctac atgtgtgaga ccacactggc 1740caacgtgggc tacacagcca cctacgactt caagatggcc gacctgcagc aggtggcacc 1800cgaggccacc gtgcgccgct tcctgcaggg ccgccgctgc gagcacagca ccgactgcac 1860ctacgggcgc gactgcaggg ccccgtgtga caggctcatg aggcagtgca agggcgacct 1920catccagccc aacctggcca aggtgtgcgc actgctacgg ggctacctgc tgcctggcgc 1980gcccgccgac ctccgcgagg agctgggcac acagctgcgc acctgtacca cgctgagcgg 2040gctggccagc caggtggagg cccatcactc gctggtgctc agccacctca agactctgct 2100ctggaagaag atctccaaca ccaagtactc ttgatggggc agtgaggggc ctggccaccc 2160ttcctggagc tggccaggtg ccagggtcca accctccctc aaggaatcct gtcagaagat 2220gtgaaatgca actgtgttgc aaaatcactc ccctaccgtc agggctctgg attccagcac 2280cacagacatg agaccccagc tcggagcaaa ggcggacatg gacatcccgg caggagagtc 2340ctccaagggg gtttgttact ctgaagaacg taatgtcaat aaacagcttt tatgtaatgc 2400ccagggctga 241057423PRTHomo sapiens 57Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Arg Thr Val Val Arg Pro Thr Tyr Lys 65 70 75 80 Val Met Tyr Lys Ile Val Thr Ala Pro Ser Ser Ala Ser Leu Glu Pro 85 90 95 Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu Arg Pro Thr Ala 100 105 110 Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu Leu Thr Glu Arg 115 120 125 Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr Val Ile Glu Gln 130 135 140 Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro Ala Pro Leu Trp 145 150 155 160 Gly Pro Pro Pro

Ala Gln Gly Ser Pro Gly Asp Gly Gly Leu Gln Asp 165 170 175 Gln Val Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly Pro Lys Gly Asp 180 185 190 Ala Gly Ser Arg Gly Pro Met Gly Met Arg Gly Pro Pro Gly Pro Gln 195 200 205 Gly Pro Pro Gly Ser Pro Gly Arg Ala Gly Ala Val Gly Thr Pro Gly 210 215 220 Glu Arg Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro 225 230 235 240 Pro Ala Pro Val Gly Pro Pro His Ala Arg Ile Ser Gln His Gly Asp 245 250 255 Pro Leu Leu Ser Asn Thr Phe Thr Glu Thr Asn Asn His Trp Pro Gln 260 265 270 Gly Pro Thr Gly Pro Pro Gly Pro Pro Gly Pro Met Gly Pro Pro Gly 275 280 285 Pro Pro Gly Pro Thr Gly Val Pro Gly Ser Pro Gly His Ile Gly Pro 290 295 300 Pro Gly Pro Thr Gly Pro Lys Gly Ile Ser Gly His Pro Gly Glu Lys 305 310 315 320 Gly Glu Arg Gly Leu Arg Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly 325 330 335 Gln Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly Glu Lys Ser His 340 345 350 Trp Gly Glu Gly Leu His Gln Leu Arg Glu Ala Leu Lys Ile Leu Ala 355 360 365 Glu Arg Val Leu Ile Leu Glu Thr Met Ile Gly Leu Tyr Glu Pro Glu 370 375 380 Leu Gly Ser Gly Ala Gly Pro Ala Gly Thr Gly Thr Pro Ser Leu Leu 385 390 395 400 Arg Gly Lys Arg Gly Gly His Ala Thr Asn Tyr Arg Ile Val Ala Pro 405 410 415 Arg Ser Arg Asp Glu Arg Gly 420 58212PRTHomo sapiens 58Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Val Ala Ala Ser Ser Ala 100 105 110 Ser Leu Glu Pro Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu 115 120 125 Arg Pro Thr Ala Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu 130 135 140 Leu Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr 145 150 155 160 Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro 165 170 175 Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly 180 185 190 Gly Leu Gln Gly Asp Pro Leu Leu Ser Asn Thr Phe Thr Glu Thr Asn 195 200 205 Asn His Trp Pro 210 59175PRTHomo sapiens 59Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Gly Cys Leu Asn Cys Ser 100 105 110 Lys Val Ser Glu Leu Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met 115 120 125 Thr Met Leu Thr Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala Thr 130 135 140 Pro Glu Asp Pro Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser 145 150 155 160 Pro Gly Asp Gly Gly Leu Gln Asp Gln Val Gly Ala Trp Gly Leu 165 170 175 6034PRTHomo sapiens 60Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg 61445PRTHomo sapiens 61Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Val Ala Ala Ser Ser Ala 100 105 110 Ser Leu Glu Pro Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu 115 120 125 Arg Pro Thr Ala Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu 130 135 140 Leu Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr 145 150 155 160 Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro 165 170 175 Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly 180 185 190 Gly Leu Gln Asp Gln Val Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly 195 200 205 Pro Lys Gly Asp Ala Gly Ser Arg Gly Pro Met Gly Met Arg Gly Pro 210 215 220 Pro Gly Pro Gln Gly Pro Pro Gly Ser Pro Gly Arg Ala Gly Ala Val 225 230 235 240 Gly Thr Pro Gly Glu Arg Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly 245 250 255 Pro Pro Gly Pro Pro Ala Pro Val Gly Pro Pro His Ala Arg Ile Ser 260 265 270 Gln His Gly Asp Pro Leu Leu Ser Asn Thr Phe Thr Glu Thr Asn Asn 275 280 285 His Trp Pro Gln Gly Pro Thr Gly Pro Pro Gly Pro Pro Gly Pro Met 290 295 300 Gly Pro Pro Gly Pro Pro Gly Pro Thr Gly Val Pro Gly Ser Pro Gly 305 310 315 320 His Ile Gly Pro Pro Gly Pro Thr Gly Pro Lys Gly Ile Ser Gly His 325 330 335 Pro Gly Glu Lys Gly Glu Arg Gly Leu Arg Gly Glu Pro Gly Pro Gln 340 345 350 Gly Ser Ala Gly Gln Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly 355 360 365 Glu Lys Ser His Trp Ala Pro Ser Leu Gln Ser Phe Leu Gln Gln Gln 370 375 380 Ala Gln Leu Glu Leu Leu Ala Arg Arg Val Thr Leu Leu Glu Ala Ile 385 390 395 400 Ile Trp Pro Glu Pro Glu Leu Gly Ser Gly Ala Gly Pro Ala Gly Thr 405 410 415 Gly Thr Pro Ser Leu Leu Arg Gly Lys Arg Gly Gly His Ala Thr Asn 420 425 430 Tyr Arg Ile Val Ala Pro Arg Ser Arg Asp Glu Arg Gly 435 440 445 62443PRTHomo sapiens 62Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Val Ala Ala Ser Ser Ala 100 105 110 Ser Leu Glu Pro Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu 115 120 125 Arg Pro Thr Ala Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu 130 135 140 Leu Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr 145 150 155 160 Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro 165 170 175 Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly 180 185 190 Gly Leu Gln Asp Gln Val Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly 195 200 205 Pro Lys Gly Asp Ala Gly Ser Arg Gly Pro Met Gly Met Arg Gly Pro 210 215 220 Pro Gly Pro Gln Gly Pro Pro Gly Ser Pro Gly Arg Ala Gly Ala Val 225 230 235 240 Gly Thr Pro Gly Glu Arg Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly 245 250 255 Pro Pro Gly Pro Pro Ala Pro Val Gly Pro Pro His Ala Arg Ile Ser 260 265 270 Gln His Gly Asp Pro Leu Leu Ser Asn Thr Phe Thr Glu Thr Asn Asn 275 280 285 His Trp Pro Gln Gly Pro Thr Gly Pro Pro Gly Pro Pro Gly Pro Met 290 295 300 Gly Pro Pro Gly Pro Pro Gly Pro Thr Gly Val Pro Gly Ser Pro Gly 305 310 315 320 His Ile Gly Pro Pro Gly Pro Thr Gly Pro Lys Gly Ile Ser Gly His 325 330 335 Pro Gly Glu Lys Gly Glu Arg Gly Leu Arg Gly Glu Pro Gly Pro Gln 340 345 350 Gly Ser Ala Gly Gln Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly 355 360 365 Glu Lys Ser His Trp Gly Glu Gly Leu His Gln Leu Arg Glu Ala Leu 370 375 380 Lys Ile Leu Ala Glu Arg Val Leu Ile Leu Glu Thr Met Ile Gly Leu 385 390 395 400 Tyr Glu Pro Glu Leu Gly Ser Gly Ala Gly Pro Ala Gly Thr Gly Thr 405 410 415 Pro Ser Leu Leu Arg Gly Lys Arg Gly Gly His Ala Thr Asn Tyr Arg 420 425 430 Ile Val Ala Pro Arg Ser Arg Asp Glu Arg Gly 435 440 63441PRTHomo sapiens 63Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Ala Ser Ser Ala Ser Leu 100 105 110 Glu Pro Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu Arg Pro 115 120 125 Thr Ala Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu Leu Thr 130 135 140 Glu Arg Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr Val Ile 145 150 155 160 Glu Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro Ala Pro 165 170 175 Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly Gly Leu 180 185 190 Gln Asp Gln Val Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly Pro Lys 195 200 205 Gly Asp Ala Gly Ser Arg Gly Pro Met Gly Met Arg Gly Pro Pro Gly 210 215 220 Pro Gln Gly Pro Pro Gly Ser Pro Gly Arg Ala Gly Ala Val Gly Thr 225 230 235 240 Pro Gly Glu Arg Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro 245 250 255 Gly Pro Pro Ala Pro Val Gly Pro Pro His Ala Arg Ile Ser Gln His 260 265 270 Gly Asp Pro Leu Leu Ser Asn Thr Phe Thr Glu Thr Asn Asn His Trp 275 280 285 Pro Gln Gly Pro Thr Gly Pro Pro Gly Pro Pro Gly Pro Met Gly Pro 290 295 300 Pro Gly Pro Pro Gly Pro Thr Gly Val Pro Gly Ser Pro Gly His Ile 305 310 315 320 Gly Pro Pro Gly Pro Thr Gly Pro Lys Gly Ile Ser Gly His Pro Gly 325 330 335 Glu Lys Gly Glu Arg Gly Leu Arg Gly Glu Pro Gly Pro Gln Gly Ser 340 345 350 Ala Gly Gln Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly Glu Lys 355 360 365 Ser His Trp Gly Glu Gly Leu His Gln Leu Arg Glu Ala Leu Lys Ile 370 375 380 Leu Ala Glu Arg Val Leu Ile Leu Glu Thr Met Ile Gly Leu Tyr Glu 385 390 395 400 Pro Glu Leu Gly Ser Gly Ala Gly Pro Ala Gly Thr Gly Thr Pro Ser 405 410 415 Leu Leu Arg Gly Lys Arg Gly Gly His Ala Thr Asn Tyr Arg Ile Val 420 425 430 Ala Pro Arg Ser Arg Asp Glu Arg Gly 435 440 64439PRTHomo sapiens 64Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Arg Thr Val Val Arg Pro Thr Tyr Lys 65 70 75 80 Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys Pro Gly 85 90 95 His Ser Gly Val Ser Cys Glu Glu Ala Ser Ser Ala Ser Leu Glu Pro 100 105 110 Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu Arg Pro Thr Ala 115 120 125 Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu Leu Thr Glu Arg 130 135 140 Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr Val Ile Glu Gln 145 150 155 160 Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro Ala Pro Leu Trp 165 170 175 Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly Gly Leu Gln Asp 180 185 190 Gln Val Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly Pro Lys Gly Asp 195 200 205 Ala Gly Ser Arg Gly Pro Met Gly Met Arg Gly Pro Pro Gly Pro Gln 210 215 220 Gly Pro Pro Gly Ser Pro Gly Arg Ala Gly Ala Val Gly Thr Pro Gly 225 230 235 240 Glu Arg Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro 245 250 255 Pro Ala Pro Val Gly Pro Pro His Ala Arg Ile Ser Gln His Gly Asp 260 265 270 Pro Leu Leu Ser Asn Thr Phe Thr Glu

Thr Asn Asn His Trp Pro Gln 275 280 285 Gly Pro Thr Gly Pro Pro Gly Pro Pro Gly Pro Met Gly Pro Pro Gly 290 295 300 Pro Pro Gly Pro Thr Gly Val Pro Gly Ser Pro Gly His Ile Gly Pro 305 310 315 320 Pro Gly Pro Thr Gly Pro Lys Gly Ile Ser Gly His Pro Gly Glu Lys 325 330 335 Gly Glu Arg Gly Leu Arg Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly 340 345 350 Gln Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly Glu Lys Ser His 355 360 365 Trp Gly Glu Gly Leu His Gln Leu Arg Glu Ala Leu Lys Ile Leu Ala 370 375 380 Glu Arg Val Leu Ile Leu Glu Thr Met Ile Gly Leu Tyr Glu Pro Glu 385 390 395 400 Leu Gly Ser Gly Ala Gly Pro Ala Gly Thr Gly Thr Pro Ser Leu Leu 405 410 415 Arg Gly Lys Arg Gly Gly His Ala Thr Asn Tyr Arg Ile Val Ala Pro 420 425 430 Arg Ser Arg Asp Glu Arg Gly 435 65422PRTHomo sapiens 65Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Val Ala Ala Ser Ser Ala 100 105 110 Ser Leu Glu Pro Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu 115 120 125 Arg Pro Thr Ala Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu 130 135 140 Leu Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr 145 150 155 160 Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro 165 170 175 Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly 180 185 190 Gly Leu Gln Asp Gln Val Gly Ala Trp Gly Leu Pro Gly Pro Thr Gly 195 200 205 Pro Lys Gly Asp Ala Gly Ser Arg Gly Pro Met Gly Met Arg Gly Pro 210 215 220 Pro Gly Pro Gln Gly Pro Pro Gly Ser Pro Gly Arg Ala Gly Ala Val 225 230 235 240 Gly Thr Pro Gly Glu Arg Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly 245 250 255 Pro Pro Gly Pro Pro Ala Pro Val Gly Pro Pro His Ala Arg Ile Ser 260 265 270 Gln His Gly Asp Pro Leu Leu Ser Asn Thr Phe Thr Glu Thr Asn Asn 275 280 285 His Trp Pro Gln Gly Pro Thr Gly Pro Pro Gly Pro Pro Gly Pro Met 290 295 300 Gly Pro Pro Gly Pro Pro Gly Pro Thr Gly Val Pro Gly Ser Pro Gly 305 310 315 320 His Ile Gly Leu Arg Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly Gln 325 330 335 Arg Gly Glu Pro Gly Pro Lys Gly Asp Pro Gly Glu Lys Ser His Trp 340 345 350 Gly Glu Gly Leu His Gln Leu Arg Glu Ala Leu Lys Ile Leu Ala Glu 355 360 365 Arg Val Leu Ile Leu Glu Thr Met Ile Gly Leu Tyr Glu Pro Glu Leu 370 375 380 Gly Ser Gly Ala Gly Pro Ala Gly Thr Gly Thr Pro Ser Leu Leu Arg 385 390 395 400 Gly Lys Arg Gly Gly His Ala Thr Asn Tyr Arg Ile Val Ala Pro Arg 405 410 415 Ser Arg Asp Glu Arg Gly 420 66212PRTHomo sapiens 66Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Val Ala Ala Ser Ser Ala 100 105 110 Ser Leu Glu Pro Met Trp Ser Gly Ser Thr Met Arg Arg Met Ala Leu 115 120 125 Arg Pro Thr Ala Phe Ser Gly Cys Leu Asn Cys Ser Lys Val Ser Glu 130 135 140 Leu Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met Thr Met Leu Thr 145 150 155 160 Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala Thr Pro Glu Asp Pro 165 170 175 Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser Pro Gly Asp Gly 180 185 190 Gly Leu Gln Gly Asp Pro Leu Leu Ser Asn Thr Phe Thr Glu Thr Asn 195 200 205 Asn His Trp Pro 210 67175PRTHomo sapiens 67Met Gly Gly Pro Arg Ala Trp Ala Leu Leu Cys Leu Gly Leu Leu Leu 1 5 10 15 Pro Gly Gly Gly Ala Ala Trp Ser Ile Gly Ala Ala Pro Phe Ser Gly 20 25 30 Arg Arg Asn Trp Cys Ser Tyr Val Val Thr Arg Thr Ile Ser Cys His 35 40 45 Val Gln Asn Gly Thr Tyr Leu Gln Arg Val Leu Gln Asn Cys Pro Trp 50 55 60 Pro Met Ser Cys Pro Gly Ser Ser Tyr Arg Thr Val Val Arg Pro Thr 65 70 75 80 Tyr Lys Val Met Tyr Lys Ile Val Thr Ala Arg Glu Trp Arg Cys Cys 85 90 95 Pro Gly His Ser Gly Val Ser Cys Glu Glu Gly Cys Leu Asn Cys Ser 100 105 110 Lys Val Ser Glu Leu Thr Glu Arg Leu Lys Val Leu Glu Ala Lys Met 115 120 125 Thr Met Leu Thr Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala Thr 130 135 140 Pro Glu Asp Pro Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly Ser 145 150 155 160 Pro Gly Asp Gly Gly Leu Gln Asp Gln Val Gly Ala Trp Gly Leu 165 170 175 68236PRTHomo sapiensmisc_feature(1)..(1)Xaa can be any naturally occurring amino acid 68Xaa Met Thr Met Leu Thr Val Ile Glu Gln Pro Val Pro Pro Thr Pro 1 5 10 15 Ala Thr Pro Glu Asp Pro Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln 20 25 30 Gly Ser Pro Gly Asp Gly Gly Leu Gln Gly Leu Pro Gly Ala Ile Glu 35 40 45 Ser Val Arg Val Pro Leu Leu Pro Arg Asn Asp Gln Val Gly Ala Trp 50 55 60 Gly Leu Pro Gly Pro Thr Gly Pro Lys Gly Asp Ala Gly Ser Arg Gly 65 70 75 80 Pro Met Gly Met Arg Gly Pro Pro Gly Pro Gln Gly Pro Pro Gly Ser 85 90 95 Pro Gly Arg Ala Gly Ala Val Gly Thr Pro Gly Glu Arg Gly Pro Pro 100 105 110 Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Ala Pro Val Gly 115 120 125 Pro Pro His Ala Arg Ile Ser Gln His Gly Asp Pro Leu Leu Ser Asn 130 135 140 Thr Phe Thr Glu Thr Asn Asn His Trp Pro Gln Gly Pro Thr Gly Pro 145 150 155 160 Pro Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Pro Pro Gly Pro Thr 165 170 175 Gly Val Pro Gly Ser Pro Gly His Ile Gly Pro Pro Gly Pro Thr Gly 180 185 190 Pro Lys Gly Ile Ser Gly His Pro Gly Glu Lys Gly Glu Arg Gly Leu 195 200 205 Arg Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly Gln Arg Gly Glu Pro 210 215 220 Gly Pro Lys Gly Asp Pro Gly Glu Lys Ser His Trp 225 230 235 69305PRTHomo sapiens 69Met Thr Met Leu Thr Val Ile Glu Gln Pro Val Pro Pro Thr Pro Ala 1 5 10 15 Thr Pro Glu Asp Pro Ala Pro Leu Trp Gly Pro Pro Pro Ala Gln Gly 20 25 30 Ser Pro Gly Asp Gly Gly Leu Gln Gly Leu Pro Gly Ala Ile Glu Ser 35 40 45 Val Arg Val Pro Leu Leu Pro Arg Asn Asp Gln Val Gly Ala Trp Gly 50 55 60 Leu Pro Gly Pro Thr Gly Pro Lys Gly Asp Ala Gly Ser Arg Gly Pro 65 70 75 80 Met Gly Met Arg Gly Pro Pro Gly Pro Gln Gly Pro Pro Gly Ser Pro 85 90 95 Gly Arg Ala Gly Ala Val Gly Thr Pro Gly Glu Arg Gly Pro Pro Gly 100 105 110 Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Ala Pro Val Gly Pro 115 120 125 Pro His Ala Arg Ile Ser Gln His Gly Asp Pro Leu Leu Ser Asn Thr 130 135 140 Phe Thr Glu Thr Asn Asn His Trp Pro Gln Gly Pro Thr Gly Pro Pro 145 150 155 160 Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Pro Pro Gly Pro Thr Gly 165 170 175 Val Pro Gly Ser Pro Gly His Ile Gly Pro Pro Gly Pro Thr Gly Pro 180 185 190 Lys Gly Ile Ser Gly His Pro Gly Glu Lys Gly Glu Arg Gly Leu Arg 195 200 205 Gly Glu Pro Gly Pro Gln Gly Ser Ala Gly Gln Arg Gly Glu Pro Gly 210 215 220 Pro Lys Gly Asp Pro Gly Glu Lys Ser His Trp Gly Glu Gly Leu His 225 230 235 240 Gln Leu Arg Glu Ala Leu Lys Ile Leu Ala Glu Arg Val Leu Ile Leu 245 250 255 Glu Thr Met Ile Gly Leu Tyr Glu Pro Glu Leu Gly Ser Gly Ala Gly 260 265 270 Pro Ala Gly Thr Gly Thr Pro Ser Leu Leu Arg Gly Lys Arg Gly Gly 275 280 285 His Ala Thr Asn Tyr Arg Ile Val Ala Pro Arg Ser Arg Asp Glu Arg 290 295 300 Gly 305 70226PRTHomo sapiens 70Met Lys Ser Ser Leu Met Phe Thr Asp Pro His Ser Leu Gly Thr Tyr 1 5 10 15 Thr Tyr Gln Ala Leu Ser Trp Ala Leu Gly Gly Val Arg His Val Pro 20 25 30 Ala Leu Leu Glu Leu Pro Cys Cys Trp Glu Gln Gly Trp Ala Glu Glu 35 40 45 Lys Gln Gln Cys Leu Pro His Val Thr Arg Val Ser Met Arg Gly Phe 50 55 60 Gly Gly Leu Gly Ala Pro Arg Lys Glu Asp Ser Ala Trp Thr Arg Trp 65 70 75 80 Arg Thr Arg Cys Cys Ala His Pro Pro Val Arg Leu Pro Gly Ser Leu 85 90 95 Gly Leu Trp Thr Pro Gly Pro Ser Leu Met Pro Thr Ala Pro Gly Cys 100 105 110 Leu Val Leu Ser Leu Lys Ala Thr Leu Gly Leu Leu Ala Ser Cys Ile 115 120 125 Pro Thr Asn Pro Cys Asp Ser Ile Ala Gly Pro Gln Gly Pro Pro Gly 130 135 140 Ser Pro Gly Arg Ala Gly Ala Val Gly Thr Pro Gly Glu Arg Gly Pro 145 150 155 160 Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Ala Pro Val 165 170 175 Gly Pro Pro His Ala Arg Ile Ser Gln His Gly Glu Ser Pro Trp Asp 180 185 190 Pro Ser Arg Trp Arg Trp Gly Trp Ser Ser His Gln His Ser Ala Arg 195 200 205 Tyr His Leu Pro Arg Ala Phe Cys Val Pro Ala Leu Leu Thr Ile Gly 210 215 220 His Met 225 712031DNAHomo sapiens 71gggctccgcg cgtccggggc ggctggcggc gcgggcaggc aggcggggag gacaggctgg 60gggcggcgac cgcgaggggc cgcgcgcgga gggcgcctgg tgcagcatgg gcggcccgcg 120ggcttgggcg ctgctctgcc tcgggctcct gctcccggga ggcggcgctg cgtggagcat 180cggggcagct ccgttctccg gacgcaggaa ctggtgctcc tatgtggtga cccgcaccat 240ctcatgccat gtgcagaatg gcacctacct tcagcgagtg ctgcagaact gcccctggcc 300catgagctgt ccggggagca gaactgtggt gagacccaca tacaaggtga tgtacaagat 360agtgaccgcc ccttcctctg cctccttgga gcccatgtgg tcgggcagta ccatgcggcg 420gatggcgctt cggcccacag ccttctcagg ttgtctcaac tgcagcaaag tgtcagagct 480gacagagcgg ctgaaggtgc tggaggccaa gatgaccatg ctgactgtca tagagcagcc 540agtacctcca acaccagcta cccctgagga ccctgccccg ctctggggtc cccctcctgc 600ccagggcagc cccggagatg gaggcctcca ggaccaagtc ggtgcttggg ggcttcccgg 660gcccaccggc cccaagggag atgccggcag tcggggccca atggggatga gaggcccacc 720aggtccacag ggccccccag ggagccctgg ccgggctgga gctgtgggca cccctggaga 780gaggggacct cctgggccac cagggcctcc tggcccccct gggcccccag cccctgttgg 840gccaccccat gcccggatct cccagcatgg agacccattg ctgtccaaca ccttcactga 900gaccaacaac cactggcccc agggacccac tgggcctcca ggccctccag ggcccatggg 960tccccctggg cctcctggcc ccacaggtgt ccctgggagt cctggtcaca taggaccccc 1020aggccccact ggacccaaag gaatctctgg ccacccagga gagaagggcg agagaggact 1080gcgtggggag cctggccccc aaggctctgc tgggcagcgg ggggaacctg gccctaaggg 1140agaccctggt gagaagagcc actgggggga ggggttgcac cagctacgcg aggctttgaa 1200gattttagct gagagggttt taatcttgga aacaatgatt gggctctatg aaccagagct 1260ggggtctggg gcgggccctg ccggcacagg cacccccagc ctccttcggg gcaagagggg 1320cggacatgca accaactacc ggatcgtggc ccccaggagc cgggacgaga gaggctgagg 1380gtggtggcgg cccctgaggc agaccaggcc aggcttcccc tcctacctgg actcggccag 1440ctgcctccag ggaccgcccg tccatattta ttaatgtcct cagggtccct tctgccatct 1500aggccttagg ggtaagcagg tctcagtcct ggcaccatgc acatgtctga ggctgagcaa 1560gggctgagag gagaggcttg ggcctcagtt tccctctgtg aagtgggggg aggcaggcct 1620tcaaggaggg atagaggtac aaggcttcgt ctcatctgct gtctgagcat ccaggcccaa 1680aggcactgag ggagtcagga gctggggctc ggcacatgca gagatgacag ggcagggggc 1740agtcttcctc cccctccccg accaaacctc ggggagccct cctgtgcccc tccctccttg 1800ttgtccagtg ctgggctccc caccccgagg tcaggctgcc caatcctctg actggatcac 1860cgggggcttc ttgcctcagt tcttccctct gagcccccag gccctcccgc atctcaggtt 1920ggggatgggg acatggagag gaaggggccg cctactcctg caaatgcttg tgacagatgc 1980caggaggtag atgtgtgctg gccaataaag gcccctacct gattccccgc a 203172769DNAHomo sapiens 72cgccctccgg ccgcggagct ggaaaccggg ctccgcgcgt ccggggcggc tggcggcgcg 60ggcaggcagg cggggaggac aggctggggg cggcgaccgc gaggggccgc gcgcggaggg 120cgcctggtgc agcatgggcg gcccgcgggc ttgggcgctg ctctgcctcg ggctcctgct 180cccgggaggc ggcgctgcgt ggagcatcgg ggcagctccg ttctccggac gcaggaactg 240gtgctcctat gtggtgaccc gcaccatctc atgccatgtg cagaatggca cctaccttca 300gcgagtgctg cagaactgcc cctggcccat gagctgtccg gggagcagct acagaactgt 360ggtgagaccc acatacaagg tgatgtacaa gatagtgacc gcccgtgagt ggaggtgctg 420ccctgggcac tcaggagtga gctgcgagga agttgcagct tcctctgcct ccttggagcc 480catgtggtcg ggcagtacca tgcggcggat ggcgcttcgg cccacagcct tctcaggttg 540tctcaactgc agcaaagtgt cagagctgac agagcggctg aaggtgctgg aggccaagat 600gaccatgctg actgtcatag agcagccagt acctccaaca ccagctaccc ctgaggaccc 660tgccccgctc tggggtcccc ctcctgccca gggcagcccc ggagatggag gcctccaggg 720agacccattg ctgtccaaca ccttcactga gaccaacaac cactggccc 76973641DNAHomo sapiens 73gctggaaacc gggctccgcg cgtccggggc ggctggcggc gcgggcaggc aggcggggag 60gacaggctgg gggcggcgac cgcgaggggc cgcgcgcgga gggcgcctgg tgcagcatgg 120gcggcccgcg ggcttgggcg ctgctctgcc tcgggctcct gctcccggga ggcggcgctg 180cgtggagcat cggggcagct ccgttctccg gacgcaggaa ctggtgctcc tatgtggtga 240cccgcaccat ctcatgccat gtgcagaatg gcacctacct tcagcgagtg ctgcagaact 300gcccctggcc catgagctgt ccggggagca gctacagaac tgtggtgaga cccacataca 360aggtgatgta caagatagtg accgcccgtg agtggaggtg ctgccctggg cactcaggag 420tgagctgcga ggaaggttgt ctcaactgca gcaaagtgtc agagctgaca gagcggctga 480aggtgctgga ggccaagatg accatgctga ctgtcataga gcagccagta cctccaacac 540cagctacccc tgaggaccct gccccgctct ggggtccccc tcctgcccag ggcagccccg 600gagatggagg cctccaggac caagtcggtg cttgggggct t

64174482DNAHomo sapiens 74cggcgcgggc aggcaggcgg ggaggacagg ctgggggcgg cgaccgcgag gggccgcgcg 60cggagggcgc ctggtgcagc atgggcggcc cgcgggcttg ggcgctgctc tgcctcgggc 120tcctgctccc gggaggcggc gctgcgtgga gcatcggggc agctccgttc tccggacgca 180gatgaccatg ctgactgtca tagagcagcc agtacctcca acaccagcta cccctgagga 240ccctgccccg ctctggggtc cccctcctgc ccagggcagc cccggagatg gaggcctcca 300ggaccaagtc ggtgcttggg ggcttcccgg gcccaccggc cccaagggag atgccggcag 360tcggggccca atggggatga gaggcccacc aggtccacag ggccccccag ggagccctgg 420ccgggctgga gctgtgggca cccctggaga gaggggacct cctgggccac cagggcctcc 480tg 482752066DNAHomo sapiens 75cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc actcaggagt gagctgcgag gaagttgcag cttcctctgc ctccttggag 420cccatgtggt cgggcagtac catgcggcgg atggcgcttc ggcccacagc cttctcaggt 480tgtctcaact gcagcaaagt gtcagagctg acagagcggc tgaaggtgct ggaggccaag 540atgaccatgc tgactgtcat agagcagcca gtacctccaa caccagctac ccctgaggac 600cctgccccgc tctggggtcc ccctcctgcc cagggcagcc ccggagatgg aggcctccag 660gaccaagtcg gtgcttgggg gcttcccggg cccaccggcc ccaagggaga tgccggcagt 720cggggcccaa tggggatgag aggcccacca ggtccacagg gccccccagg gagccctggc 780cgggctggag ctgtgggcac ccctggagag aggggacctc ctgggccacc agggcctcct 840ggcccccctg ggcccccagc ccctgttggg ccaccccatg cccggatctc ccagcatgga 900gacccattgc tgtccaacac cttcactgag accaacaacc actggcccca gggacccact 960gggcctccag gccctccagg gcccatgggt ccccctgggc ctcctggccc cacaggtgtc 1020cctgggagtc ctggtcacat aggaccccca ggccccactg gacccaaagg aatctctggc 1080cacccaggag agaagggcga gagaggactg cgtggggagc ctggccccca aggctctgct 1140gggcagcggg gggaacctgg ccctaaggga gaccctggtg agaagagcca ctgggctcct 1200agcttacaga gcttcctgca gcagcaggct cagctggagc tcctggccag acgggtcacc 1260ctcctggaag ccatcatctg gccagaacca gagctggggt ctggggcggg ccctgccggc 1320acaggcaccc ccagcctcct tcggggcaag aggggcggac atgcaaccaa ctaccggatc 1380gtggccccca ggagccggga cgagagaggc tgagggtggt ggcggcccct gaggcagacc 1440aggccaggct tcccctccta cctggactcg gccagctgcc tccagggacc gcccgtccat 1500atttattaat gtcctcaggg tcccttctgc catctaggcc ttaggggtaa gcaggtctca 1560gtcctggcac catgcacatg tctgaggctg agcaagggct gagaggagag gcttgggcct 1620cagtttccct ctgtgaagtg gggggaggca ggccttcaag gagggataga ggtacaaggc 1680ttcgtctcat ctgctgtctg agcatccagg cccaaaggca ctgagggagt caggagctgg 1740ggctcggcac atgcagagat gacagggcag ggggcagtct tcctccccct ccccgaccaa 1800acctcgggga gccctcctgt gcccctccct ccttgttgtc cagtgctggg ctccccaccc 1860cgaggtcagg ctgcccaatc ctctgactgg atcaccgggg gcttcttgcc tcagttcttc 1920cctctgagcc cccaggccct cccgcatctc aggttgggga tggggacatg gagaggaagg 1980ggccgcctac tcctgcaaat gcttgtgaca gatgccagga ggtagatgtg tgctggccaa 2040taaaggcccc tacctgattc cccgca 2066762060DNAHomo sapiens 76cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc actcaggagt gagctgcgag gaagttgcag cttcctctgc ctccttggag 420cccatgtggt cgggcagtac catgcggcgg atggcgcttc ggcccacagc cttctcaggt 480tgtctcaact gcagcaaagt gtcagagctg acagagcggc tgaaggtgct ggaggccaag 540atgaccatgc tgactgtcat agagcagcca gtacctccaa caccagctac ccctgaggac 600cctgccccgc tctggggtcc ccctcctgcc cagggcagcc ccggagatgg aggcctccag 660gaccaagtcg gtgcttgggg gcttcccggg cccaccggcc ccaagggaga tgccggcagt 720cggggcccaa tggggatgag aggcccacca ggtccacagg gccccccagg gagccctggc 780cgggctggag ctgtgggcac ccctggagag aggggacctc ctgggccacc agggcctcct 840ggcccccctg ggcccccagc ccctgttggg ccaccccatg cccggatctc ccagcatgga 900gacccattgc tgtccaacac cttcactgag accaacaacc actggcccca gggacccact 960gggcctccag gccctccagg gcccatgggt ccccctgggc ctcctggccc cacaggtgtc 1020cctgggagtc ctggtcacat aggaccccca ggccccactg gacccaaagg aatctctggc 1080cacccaggag agaagggcga gagaggactg cgtggggagc ctggccccca aggctctgct 1140gggcagcggg gggaacctgg ccctaaggga gaccctggtg agaagagcca ctggggggag 1200gggttgcacc agctacgcga ggctttgaag attttagctg agagggtttt aatcttggaa 1260acaatgattg ggctctatga accagagctg gggtctgggg cgggccctgc cggcacaggc 1320acccccagcc tccttcgggg caagaggggc ggacatgcaa ccaactaccg gatcgtggcc 1380cccaggagcc gggacgagag aggctgaggg tggtggcggc ccctgaggca gaccaggcca 1440ggcttcccct cctacctgga ctcggccagc tgcctccagg gaccgcccgt ccatatttat 1500taatgtcctc agggtccctt ctgccatcta ggccttaggg gtaagcaggt ctcagtcctg 1560gcaccatgca catgtctgag gctgagcaag ggctgagagg agaggcttgg gcctcagttt 1620ccctctgtga agtgggggga ggcaggcctt caaggaggga tagaggtaca aggcttcgtc 1680tcatctgctg tctgagcatc caggcccaaa ggcactgagg gagtcaggag ctggggctcg 1740gcacatgcag agatgacagg gcagggggca gtcttcctcc ccctccccga ccaaacctcg 1800gggagccctc ctgtgcccct ccctccttgt tgtccagtgc tgggctcccc accccgaggt 1860caggctgccc aatcctctga ctggatcacc gggggcttct tgcctcagtt cttccctctg 1920agcccccagg ccctcccgca tctcaggttg gggatgggga catggagagg aaggggccgc 1980ctactcctgc aaatgcttgt gacagatgcc aggaggtaga tgtgtgctgg ccaataaagg 2040cccctacctg attccccgca 2060772054DNAHomo sapiens 77cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc actcaggagt gagctgcgag gaagcttcct ctgcctcctt ggagcccatg 420tggtcgggca gtaccatgcg gcggatggcg cttcggccca cagccttctc aggttgtctc 480aactgcagca aagtgtcaga gctgacagag cggctgaagg tgctggaggc caagatgacc 540atgctgactg tcatagagca gccagtacct ccaacaccag ctacccctga ggaccctgcc 600ccgctctggg gtccccctcc tgcccagggc agccccggag atggaggcct ccaggaccaa 660gtcggtgctt gggggcttcc cgggcccacc ggccccaagg gagatgccgg cagtcggggc 720ccaatgggga tgagaggccc accaggtcca cagggccccc cagggagccc tggccgggct 780ggagctgtgg gcacccctgg agagagggga cctcctgggc caccagggcc tcctggcccc 840cctgggcccc cagcccctgt tgggccaccc catgcccgga tctcccagca tggagaccca 900ttgctgtcca acaccttcac tgagaccaac aaccactggc cccagggacc cactgggcct 960ccaggccctc cagggcccat gggtccccct gggcctcctg gccccacagg tgtccctggg 1020agtcctggtc acataggacc cccaggcccc actggaccca aaggaatctc tggccaccca 1080ggagagaagg gcgagagagg actgcgtggg gagcctggcc cccaaggctc tgctgggcag 1140cggggggaac ctggccctaa gggagaccct ggtgagaaga gccactgggg ggaggggttg 1200caccagctac gcgaggcttt gaagatttta gctgagaggg ttttaatctt ggaaacaatg 1260attgggctct atgaaccaga gctggggtct ggggcgggcc ctgccggcac aggcaccccc 1320agcctccttc ggggcaagag gggcggacat gcaaccaact accggatcgt ggcccccagg 1380agccgggacg agagaggctg agggtggtgg cggcccctga ggcagaccag gccaggcttc 1440ccctcctacc tggactcggc cagctgcctc cagggaccgc ccgtccatat ttattaatgt 1500cctcagggtc ccttctgcca tctaggcctt aggggtaagc aggtctcagt cctggcacca 1560tgcacatgtc tgaggctgag caagggctga gaggagaggc ttgggcctca gtttccctct 1620gtgaagtggg gggaggcagg ccttcaagga gggatagagg tacaaggctt cgtctcatct 1680gctgtctgag catccaggcc caaaggcact gagggagtca ggagctgggg ctcggcacat 1740gcagagatga cagggcaggg ggcagtcttc ctccccctcc ccgaccaaac ctcggggagc 1800cctcctgtgc ccctccctcc ttgttgtcca gtgctgggct ccccaccccg aggtcaggct 1860gcccaatcct ctgactggat caccgggggc ttcttgcctc agttcttccc tctgagcccc 1920caggccctcc cgcatctcag gttggggatg gggacatgga gaggaagggg ccgcctactc 1980ctgcaaatgc ttgtgacaga tgccaggagg tagatgtgtg ctggccaata aaggccccta 2040cctgattccc cgca 2054782048DNAHomo sapiens 78cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag aactgtggtg 300agacccacat acaaggtgat gtacaagata gtgaccgccc gtgagtggag gtgctgccct 360gggcactcag gagtgagctg cgaggaagct tcctctgcct ccttggagcc catgtggtcg 420ggcagtacca tgcggcggat ggcgcttcgg cccacagcct tctcaggttg tctcaactgc 480agcaaagtgt cagagctgac agagcggctg aaggtgctgg aggccaagat gaccatgctg 540actgtcatag agcagccagt acctccaaca ccagctaccc ctgaggaccc tgccccgctc 600tggggtcccc ctcctgccca gggcagcccc ggagatggag gcctccagga ccaagtcggt 660gcttgggggc ttcccgggcc caccggcccc aagggagatg ccggcagtcg gggcccaatg 720gggatgagag gcccaccagg tccacagggc cccccaggga gccctggccg ggctggagct 780gtgggcaccc ctggagagag gggacctcct gggccaccag ggcctcctgg cccccctggg 840cccccagccc ctgttgggcc accccatgcc cggatctccc agcatggaga cccattgctg 900tccaacacct tcactgagac caacaaccac tggccccagg gacccactgg gcctccaggc 960cctccagggc ccatgggtcc ccctgggcct cctggcccca caggtgtccc tgggagtcct 1020ggtcacatag gacccccagg ccccactgga cccaaaggaa tctctggcca cccaggagag 1080aagggcgaga gaggactgcg tggggagcct ggcccccaag gctctgctgg gcagcggggg 1140gaacctggcc ctaagggaga ccctggtgag aagagccact ggggggaggg gttgcaccag 1200ctacgcgagg ctttgaagat tttagctgag agggttttaa tcttggaaac aatgattggg 1260ctctatgaac cagagctggg gtctggggcg ggccctgccg gcacaggcac ccccagcctc 1320cttcggggca agaggggcgg acatgcaacc aactaccgga tcgtggcccc caggagccgg 1380gacgagagag gctgagggtg gtggcggccc ctgaggcaga ccaggccagg cttcccctcc 1440tacctggact cggccagctg cctccaggga ccgcccgtcc atatttatta atgtcctcag 1500ggtcccttct gccatctagg ccttaggggt aagcaggtct cagtcctggc accatgcaca 1560tgtctgaggc tgagcaaggg ctgagaggag aggcttgggc ctcagtttcc ctctgtgaag 1620tggggggagg caggccttca aggagggata gaggtacaag gcttcgtctc atctgctgtc 1680tgagcatcca ggcccaaagg cactgaggga gtcaggagct ggggctcggc acatgcagag 1740atgacagggc agggggcagt cttcctcccc ctccccgacc aaacctcggg gagccctcct 1800gtgcccctcc ctccttgttg tccagtgctg ggctccccac cccgaggtca ggctgcccaa 1860tcctctgact ggatcaccgg gggcttcttg cctcagttct tccctctgag cccccaggcc 1920ctcccgcatc tcaggttggg gatggggaca tggagaggaa ggggccgcct actcctgcaa 1980atgcttgtga cagatgccag gaggtagatg tgtgctggcc aataaaggcc cctacctgat 2040tccccgca 2048791997DNAHomo sapiens 79cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc actcaggagt gagctgcgag gaagttgcag cttcctctgc ctccttggag 420cccatgtggt cgggcagtac catgcggcgg atggcgcttc ggcccacagc cttctcaggt 480tgtctcaact gcagcaaagt gtcagagctg acagagcggc tgaaggtgct ggaggccaag 540atgaccatgc tgactgtcat agagcagcca gtacctccaa caccagctac ccctgaggac 600cctgccccgc tctggggtcc ccctcctgcc cagggcagcc ccggagatgg aggcctccag 660gaccaagtcg gtgcttgggg gcttcccggg cccaccggcc ccaagggaga tgccggcagt 720cggggcccaa tggggatgag aggcccacca ggtccacagg gccccccagg gagccctggc 780cgggctggag ctgtgggcac ccctggagag aggggacctc ctgggccacc agggcctcct 840ggcccccctg ggcccccagc ccctgttggg ccaccccatg cccggatctc ccagcatgga 900gacccattgc tgtccaacac cttcactgag accaacaacc actggcccca gggacccact 960gggcctccag gccctccagg gcccatgggt ccccctgggc ctcctggccc cacaggtgtc 1020cctgggagtc ctggtcacat aggactgcgt ggggagcctg gcccccaagg ctctgctggg 1080cagcgggggg aacctggccc taagggagac cctggtgaga agagccactg gggggagggg 1140ttgcaccagc tacgcgaggc tttgaagatt ttagctgaga gggttttaat cttggaaaca 1200atgattgggc tctatgaacc agagctgggg tctggggcgg gccctgccgg cacaggcacc 1260cccagcctcc ttcggggcaa gaggggcgga catgcaacca actaccggat cgtggccccc 1320aggagccggg acgagagagg ctgagggtgg tggcggcccc tgaggcagac caggccaggc 1380ttcccctcct acctggactc ggccagctgc ctccagggac cgcccgtcca tatttattaa 1440tgtcctcagg gtcccttctg ccatctaggc cttaggggta agcaggtctc agtcctggca 1500ccatgcacat gtctgaggct gagcaagggc tgagaggaga ggcttgggcc tcagtttccc 1560tctgtgaagt ggggggaggc aggccttcaa ggagggatag aggtacaagg cttcgtctca 1620tctgctgtct gagcatccag gcccaaaggc actgagggag tcaggagctg gggctcggca 1680catgcagaga tgacagggca gggggcagtc ttcctccccc tccccgacca aacctcgggg 1740agccctcctg tgcccctccc tccttgttgt ccagtgctgg gctccccacc ccgaggtcag 1800gctgcccaat cctctgactg gatcaccggg ggcttcttgc ctcagttctt ccctctgagc 1860ccccaggccc tcccgcatct caggttgggg atggggacat ggagaggaag gggccgccta 1920ctcctgcaaa tgcttgtgac agatgccagg aggtagatgt gtgctggcca ataaaggccc 1980ctacctgatt ccccgca 1997801823DNAHomo sapiens 80cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc actcaggagt gagctgcgag gaagttgcag cttcctctgc ctccttggag 420cccatgtggt cgggcagtac catgcggcgg atggcgcttc ggcccacagc cttctcaggt 480tgtctcaact gcagcaaagt gtcagagctg acagagcggc tgaaggtgct ggaggccaag 540atgaccatgc tgactgtcat agagcagcca gtacctccaa caccagctac ccctgaggac 600cctgccccgc tctggggtcc ccctcctgcc cagggcagcc ccggagatgg aggcctccag 660ggagacccat tgctgtccaa caccttcact gagaccaaca accactggcc ccagggaccc 720actgggcctc caggccctcc agggcccatg ggtccccctg ggcctcctgg ccccacaggt 780gtccctggga gtcctggtca cataggaccc ccaggcccca ctggacccaa aggaatctct 840ggccacccag gagagaaggg cgagagagga ctgcgtgggg agcctggccc ccaaggctct 900gctgggcagc ggggggaacc tggccctaag ggagaccctg gtgagaagag ccactggggg 960gaggggttgc accagctacg cgaggctttg aagattttag ctgagagggt tttaatcttg 1020gaaacaatga ttgggctcta tgaaccagag ctggggtctg gggcgggccc tgccggcaca 1080ggcaccccca gcctccttcg gggcaagagg ggcggacatg caaccaacta ccggatcgtg 1140gcccccagga gccgggacga gagaggctga gggtggtggc ggcccctgag gcagaccagg 1200ccaggcttcc cctcctacct ggactcggcc agctgcctcc agggaccgcc cgtccatatt 1260tattaatgtc ctcagggtcc cttctgccat ctaggcctta ggggtaagca ggtctcagtc 1320ctggcaccat gcacatgtct gaggctgagc aagggctgag aggagaggct tgggcctcag 1380tttccctctg tgaagtgggg ggaggcaggc cttcaaggag ggatagaggt acaaggcttc 1440gtctcatctg ctgtctgagc atccaggccc aaaggcactg agggagtcag gagctggggc 1500tcggcacatg cagagatgac agggcagggg gcagtcttcc tccccctccc cgaccaaacc 1560tcggggagcc ctcctgtgcc cctccctcct tgttgtccag tgctgggctc cccaccccga 1620ggtcaggctg cccaatcctc tgactggatc accgggggct tcttgcctca gttcttccct 1680ctgagccccc aggccctccc gcatctcagg ttggggatgg ggacatggag aggaaggggc 1740cgcctactcc tgcaaatgct tgtgacagat gccaggaggt agatgtgtgc tggccaataa 1800aggcccctac ctgattcccc gca 1823811976DNAHomo sapiens 81cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag 60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc actcaggagt gagctgcgag gaaggttgtc tcaactgcag caaagtgtca 420gagctgacag agcggctgaa ggtgctggag gccaagatga ccatgctgac tgtcatagag 480cagccagtac ctccaacacc agctacccct gaggaccctg ccccgctctg gggtccccct 540cctgcccagg gcagccccgg agatggaggc ctccaggacc aagtcggtgc ttgggggctt 600cccgggccca ccggccccaa gggagatgcc ggcagtcggg gcccaatggg gatgagaggc 660ccaccaggtc cacagggccc cccagggagc cctggccggg ctggagctgt gggcacccct 720ggagagaggg gacctcctgg gccaccaggg cctcctggcc cccctgggcc cccagcccct 780gttgggccac cccatgcccg gatctcccag catggagacc cattgctgtc caacaccttc 840actgagacca acaaccactg gccccaggga cccactgggc ctccaggccc tccagggccc 900atgggtcccc ctgggcctcc tggccccaca ggtgtccctg ggagtcctgg tcacatagga 960cccccaggcc ccactggacc caaaggaatc tctggccacc caggagagaa gggcgagaga 1020ggactgcgtg gggagcctgg cccccaaggc tctgctgggc agcgggggga acctggccct 1080aagggagacc ctggtgagaa gagccactgg ggggaggggt tgcaccagct acgcgaggct 1140ttgaagattt tagctgagag ggttttaatc ttggaaacaa tgattgggct ctatgaacca 1200gagctggggt ctggggcggg ccctgccggc acaggcaccc ccagcctcct tcggggcaag 1260aggggcggac atgcaaccaa ctaccggatc gtggccccca ggagccggga cgagagaggc 1320tgagggtggt ggcggcccct gaggcagacc aggccaggct tcccctccta cctggactcg 1380gccagctgcc tccagggacc gcccgtccat atttattaat gtcctcaggg tcccttctgc 1440catctaggcc ttaggggtaa gcaggtctca gtcctggcac catgcacatg tctgaggctg 1500agcaagggct gagaggagag gcttgggcct cagtttccct ctgtgaagtg gggggaggca 1560ggccttcaag gagggataga ggtacaaggc ttcgtctcat ctgctgtctg agcatccagg 1620cccaaaggca ctgagggagt caggagctgg ggctcggcac atgcagagat gacagggcag 1680ggggcagtct tcctccccct ccccgaccaa acctcgggga gccctcctgt gcccctccct 1740ccttgttgtc cagtgctggg ctccccaccc cgaggtcagg ctgcccaatc ctctgactgg 1800atcaccgggg gcttcttgcc tcagttcttc cctctgagcc cccaggccct cccgcatctc 1860aggttgggga tggggacatg gagaggaagg ggccgcctac tcctgcaaat gcttgtgaca 1920gatgccagga ggtagatgtg tgctggccaa taaaggcccc tacctgattc cccgca 1976822111DNAHomo sapiens 82cgggcaggca ggcggggagg acaggctggg ggcggcgacc gcgaggggcc gcgcgcggag

60ggcgcctggt gcagcatggg cggcccgcgg gcttgggcgc tgctctgcct cgggctcctg 120ctcccgggag gcggcgctgc gtggagcatc ggggcagctc cgttctccgg acgcaggaac 180tggtgctcct atgtggtgac ccgcaccatc tcatgccatg tgcagaatgg cacctacctt 240cagcgagtgc tgcagaactg cccctggccc atgagctgtc cggggagcag ctacagaact 300gtggtgagac ccacatacaa ggtgatgtac aagatagtga ccgcccgtga gtggaggtgc 360tgccctgggc actcaggagt gagctgcgag gaagttgcag cttcctctgc ctccttggag 420cccatgtggt cgggcagtac catgcggcgg atggcgcttc ggcccacagc cttctcaggt 480tgtctcaact gcagcaaagt gtcagagctg acagagcggc tgaaggtgct ggaggccaag 540atgaccatgc tgactgtcat agagcagcca gtacctccaa caccagctac ccctgaggac 600cctgccccgc tctggggtcc ccctcctgcc cagggcagcc ccggagatgg aggcctccag 660gggctgccag gagccataga gagtgtgagg gtcccgctgc ttccccgaaa tgaccaagtc 720ggtgcttggg ggcttcccgg gcccaccggc cccaagggag atgccggcag tcggggccca 780atggggatga gaggcccacc aggtccacag ggccccccag ggagccctgg ccgggctgga 840gctgtgggca cccctggaga gaggggacct cctgggccac cagggcctcc tggcccccct 900gggcccccag cccctgttgg gccaccccat gcccggatct cccagcatgg agacccattg 960ctgtccaaca ccttcactga gaccaacaac cactggcccc agggacccac tgggcctcca 1020ggccctccag ggcccatggg tccccctggg cctcctggcc ccacaggtgt ccctgggagt 1080cctggtcaca taggaccccc aggccccact ggacccaaag gaatctctgg ccacccagga 1140gagaagggcg agagaggact gcgtggggag cctggccccc aaggctctgc tgggcagcgg 1200ggggaacctg gccctaaggg agaccctggt gagaagagcc actgggggga ggggttgcac 1260cagctacgcg aggctttgaa gattttagct gagagggttt taatcttgga aacaatgatt 1320gggctctatg aaccagagct ggggtctggg gcgggccctg ccggcacagg cacccccagc 1380ctccttcggg gcaagagggg cggacatgca accaactacc ggatcgtggc ccccaggagc 1440cgggacgaga gaggctgagg gtggtggcgg cccctgaggc agaccaggcc aggcttcccc 1500tcctacctgg actcggccag ctgcctccag ggaccgcccg tccatattta ttaatgtcct 1560cagggtccct tctgccatct aggccttagg ggtaagcagg tctcagtcct ggcaccatgc 1620acatgtctga ggctgagcaa gggctgagag gagaggcttg ggcctcagtt tccctctgtg 1680aagtgggggg aggcaggcct tcaaggaggg atagaggtac aaggcttcgt ctcatctgct 1740gtctgagcat ccaggcccaa aggcactgag ggagtcagga gctggggctc ggcacatgca 1800gagatgacag ggcagggggc agtcttcctc cccctccccg accaaacctc ggggagccct 1860cctgtgcccc tccctccttg ttgtccagtg ctgggctccc caccccgagg tcaggctgcc 1920caatcctctg actggatcac cgggggcttc ttgcctcagt tcttccctct gagcccccag 1980gccctcccgc atctcaggtt ggggatgggg acatggagag gaaggggccg cctactcctg 2040caaatgcttg tgacagatgc caggaggtag atgtgtgctg gccaataaag gcccctacct 2100gattccccgc a 211183707DNAHomo sapiens 83agatgaccat gctgactgtc atagagcagc cagtacctcc aacaccagct acccctgagg 60accctgcccc gctctggggt ccccctcctg cccagggcag ccccggagat ggaggcctcc 120aggggctgcc aggagccata gagagtgtga gggtcccgct gcttccccga aatgaccaag 180tcggtgcttg ggggcttccc gggcccaccg gccccaaggg agatgccggc agtcggggcc 240caatggggat gagaggccca ccaggtccac agggcccccc agggagccct ggccgggctg 300gagctgtggg cacccctgga gagaggggac ctcctgggcc accagggcct cctggccccc 360ctgggccccc agcccctgtt gggccacccc atgcccggat ctcccagcat ggagacccat 420tgctgtccaa caccttcact gagaccaaca accactggcc ccagggaccc actgggcctc 480caggccctcc agggcccatg ggtccccctg ggcctcctgg ccccacaggt gtccctggga 540gtcctggtca cataggaccc ccaggcccca ctggacccaa aggaatctct ggccacccag 600gagagaaggg cgagagagga ctgcgtgggg agcctggccc ccaaggctct gctgggcagc 660ggggggaacc tggccctaag ggagaccctg gtgagaagag ccactgg 707842034DNAHomo sapiens 84ggtgagtgcc cgcaatgctg ccccacagct cctctggcca tcccctccac caggtgggcc 60cttccctgct cctgacatgg ccaggatgac ctgggccctt tcatctactt gcctcttcac 120tcagcacccc accacggagt gccctgccca cgcctgggct ccatgaagtc ctctcttatg 180ttcactgacc cacattccct gggcacctac acttatcagg ctctgagctg ggcactgggt 240ggggtcagac atgtccctgc ccttctggag cttccatgct gctgggagca gggctgggca 300gaggagaagc agcaatgctt gccccatgtg accagggttt ctatgagggg ttttgggggt 360ttgggagccc caaggaagga agactcagcc tggacgaggt ggagaactag gtgctgtgct 420catccccctg ttagactacc aggcagccta ggcctgtgga ctccggggcc ctctctcatg 480cccactgctc caggctgcct tgtcctgtcg ctcaaggcca ccctgggcct ccttgcctcc 540tgtataccca caaatccgtg tgattccatt gcaggtccac agggcccccc agggagccct 600ggccgggctg gagctgtggg cacccctgga gagaggggac ctcctgggcc accagggcct 660cctggccccc ctgggccccc agcccctgtt gggccacccc atgcccggat ctcccagcat 720ggtgagtccc cctgggatcc cagcaggtgg aggtgggggt ggagtagcca tcagcacagt 780gcccgctacc atctgccacg tgccttctgt gtgccagccc tgctcacgat aggccacatg 840tgacccagtc ctccagcagg cgccgttgtc ctcctgtggt tacaggtgag gaacactgag 900gaccagagag ggaaggtggc ttgccagggt cccacagcct gggcgtaggg gaacggcttc 960aaacccaggc tgcctccaga acctgtgctt agagccaccg ggcatcaggc cctcccaagc 1020cttggaactg gctggaatcc agttctcgga acactgggac gcaaaagacc cggcggcagg 1080aagtgagtcc tgaactccca aggccacagg cccggcccct cctccaggcc ctgacgtgcg 1140tccttggctt cttccctttg gcagcccagc ctgacctgcc catgggctgc caggggtcag 1200agtgtggagc gccaggtttc agcctcttct ccactgtgtt tttggtgcac aacccagcac 1260accattcatt cattctgcca tcccagcatt cattccatct cactatccat acgatgggga 1320caatgacagt gccagcctcc cagagctgcg taacatccat gtacagaagc ctggcacaca 1380gtaggtggtg gataaatggt atcttttatt gtcattccca tttgacaggt gacagtacag 1440gctctgaaaa gtagaaagtg ttgctggatg tcaccagctg gattgcagtg gggttagaac 1500ccacatctcc ctgcctcctg gtcttgcggg accaacactc tccacactcc tcaccctgga 1560gcaggtgccc aggtggtacc agccatgctg caggctgccc catagggcag tccaagctgt 1620cttggcagag gtggcaggtg aagactaacc accccactct acccagctct actcactcat 1680catctttgct cacccaggag acccattgct gtccaacacc ttcactgaga ccaacaacca 1740ctggccccag ggacccactg ggcctccagg ccctccaggg cccatgggtc cccctgggcc 1800tcctggcccc acaggtgtcc ctgggagtcc tggtcacata gtgagtagtt ctccttgtac 1860tctcacccat gtgtctgtcc atctttccat ctatgcatac atccatacat ctgtccatca 1920tccacccttg tatccatcta tccatccatc cattcatcct tccattcatt cattcaacaa 1980gtatttattg agcacttaat atgcaaacta ccttccataa atcttattca atcc 203485409PRTHomo sapiens 85Met Glu Glu Ser Trp Glu Ala Ala Pro Gly Gly Gln Ala Gly Ala Glu 1 5 10 15 Leu Pro Met Glu Pro Val Gly Ser Leu Val Pro Thr Leu Glu Gln Pro 20 25 30 Gln Val Pro Ala Lys Val Arg Gln Pro Glu Gly Pro Glu Ser Ser Pro 35 40 45 Ser Pro Ala Gly Ala Val Glu Lys Ala Ala Gly Ala Gly Leu Glu Pro 50 55 60 Ser Ser Lys Lys Lys Pro Pro Ser Pro Arg Pro Gly Ser Pro Arg Val 65 70 75 80 Pro Pro Leu Ser Leu Gly Tyr Gly Val Cys Pro Glu Pro Pro Ser Pro 85 90 95 Gly Pro Ala Leu Val Lys Leu Pro Arg Asn Gly Glu Ala Pro Gly Ala 100 105 110 Glu Pro Ala Pro Ser Ala Trp Ala Pro Met Glu Leu Gln Val Asp Val 115 120 125 Arg Val Lys Pro Val Gly Ala Ala Gly Gly Ser Ser Thr Pro Ser Pro 130 135 140 Arg Pro Ser Thr Arg Phe Leu Lys Val Pro Val Pro Glu Ser Pro Ala 145 150 155 160 Phe Ser Arg His Ala Asp Pro Ala His Gln Leu Leu Leu Arg Ala Pro 165 170 175 Ser Gln Gly Gly Thr Trp Gly Arg Arg Ser Pro Leu Ala Ala Ala Arg 180 185 190 Thr Glu Ser Gly Cys Asp Ala Glu Gly Arg Ala Ser Pro Ala Glu Gly 195 200 205 Ser Ala Gly Ser Pro Gly Ser Pro Thr Cys Cys Arg Cys Lys Glu Leu 210 215 220 Gly Leu Glu Lys Glu Asp Ala Ala Leu Leu Pro Arg Ala Gly Leu Asp 225 230 235 240 Gly Asp Glu Lys Leu Pro Arg Ala Val Thr Leu Thr Gly Leu Pro Met 245 250 255 Tyr Val Lys Ser Leu Tyr Trp Ala Leu Ala Phe Met Ala Val Leu Leu 260 265 270 Ala Val Ser Gly Val Val Ile Val Val Leu Ala Ser Arg Ala Gly Ala 275 280 285 Arg Cys Gln Gln Cys Pro Pro Gly Trp Val Leu Ser Glu Glu His Cys 290 295 300 Tyr Tyr Phe Ser Ala Glu Ala Gln Ala Trp Glu Ala Ser Gln Ala Phe 305 310 315 320 Cys Ser Ala Tyr His Ala Thr Leu Pro Leu Leu Ser His Thr Gln Asp 325 330 335 Phe Leu Gly Arg Tyr Pro Val Ser Arg His Ser Trp Val Gly Ala Trp 340 345 350 Arg Gly Pro Gln Gly Trp His Trp Ile Asp Glu Ala Pro Leu Pro Pro 355 360 365 Gln Leu Leu Pro Glu Asp Gly Glu Asp Asn Leu Asp Ile Asn Cys Gly 370 375 380 Ala Leu Glu Glu Gly Thr Leu Val Ala Ala Asn Cys Ser Thr Pro Arg 385 390 395 400 Pro Trp Val Cys Ala Lys Gly Thr Gln 405 86314PRTHomo sapiens 86Met Glu Glu Ser Trp Glu Ala Ala Pro Gly Gly Gln Ala Gly Ala Glu 1 5 10 15 Leu Pro Met Glu Pro Val Gly Ser Leu Val Pro Thr Leu Glu Gln Pro 20 25 30 Gln Val Pro Ala Lys Val Arg Gln Pro Glu Gly Pro Glu Ser Ser Pro 35 40 45 Ser Pro Ala Gly Ala Val Glu Lys Ala Ala Gly Ala Gly Leu Glu Pro 50 55 60 Ser Ser Lys Lys Lys Pro Pro Ser Pro Arg Pro Gly Ser Pro Arg Val 65 70 75 80 Pro Pro Leu Ser Leu Gly Tyr Gly Val Cys Pro Glu Pro Pro Ser Pro 85 90 95 Gly Pro Ala Leu Val Lys Leu Pro Arg Asn Gly Glu Ala Pro Gly Ala 100 105 110 Glu Pro Ala Pro Ser Ala Trp Ala Pro Met Glu Leu Gln Val Asp Val 115 120 125 Arg Val Lys Pro Val Gly Ala Ala Gly Gly Ser Ser Thr Pro Ser Pro 130 135 140 Arg Pro Ser Thr Arg Phe Leu Lys Val Pro Val Pro Glu Ser Pro Ala 145 150 155 160 Phe Ser Arg His Ala Asp Pro Ala His Gln Leu Leu Leu Arg Ala Pro 165 170 175 Ser Gln Gly Gly Thr Trp Gly Arg Arg Ser Pro Leu Ala Ala Ala Arg 180 185 190 Thr Glu Ser Gly Cys Asp Ala Glu Gly Arg Ala Ser Pro Ala Glu Gly 195 200 205 Ser Ala Gly Ser Pro Gly Ser Pro Thr Cys Cys Arg Cys Lys Glu Leu 210 215 220 Gly Leu Glu Lys Glu Asp Ala Ala Leu Leu Pro Arg Ala Gly Leu Asp 225 230 235 240 Gly Asp Glu Lys Leu Pro Arg Ala Val Thr Leu Thr Asp Ser Leu Arg 245 250 255 Thr Ala Arg Thr Ile Trp Ile Ser Thr Val Gly Pro Trp Arg Lys Ala 260 265 270 Arg Trp Trp Leu Gln Thr Ala Ala Leu Gln Asp Pro Gly Ser Val Pro 275 280 285 Arg Gly Pro Ser Asp Leu Gly Ser Ala Trp Ser Ser Ala Cys Gln Ala 290 295 300 Asp Ala Ala Pro Pro Thr Gly Glu Ala Ser 305 310 871544DNAHomo sapiens 87gagagcgaag ctcctctgca ctgggcccag gtgcgctcct cagcgtctcc gggtggcggg 60gcgcgcggga tggaggagtc ttgggaggct gcgcccggag gccaagccgg ggcagagctc 120ccaatggagc ccgtgggaag cctggtcccc acgctggagc agccgcaggt gcccgcgaag 180gtgcgacaac ctgaaggtcc cgaaagcagc ccaagtccgg ccggggccgt ggagaaggcg 240gcgggcgcag gcctggagcc ctcgagcaag aaaaagccgc cttcgcctcg ccccgggtcc 300ccgcgcgtgc cgccgctcag cctgggctac ggggtctgcc ccgagccgcc gtcaccgggc 360cctgccttgg tcaagctgcc ccggaatggc gaggcgcccg gggctgagcc tgcgcccagc 420gcctgggcgc ccatggagct gcaggtagat gtgcgcgtga agcccgtggg cgcggccggt 480ggcagcagca cgccatcgcc caggccctcc acgcgcttcc tcaaggtgcc ggtgcccgag 540tcccctgcct tctcccgcca cgcggacccg gcgcaccagc tcctgctgcg cgcaccatcc 600cagggcggca cgtggggccg ccgctcgccg ctggctgcag cccggacgga gagcggctgc 660gacgcagagg gccgggccag ccccgcggaa ggaagcgccg gctccccggg ctcccccacg 720tgctgccgct gcaaggagct ggggctggag aaggaggatg cggcgctgtt gccccgcgcg 780gggttggacg gcgacgagaa gctgccccgg gccgtaacgc ttacggggct acccatgtac 840gtgaagtccc tgtactgggc cctggcgttc atggctgtgc tcctggcagt ctctggggtt 900gtcattgtgg tcctggcctc aagagcagga gccagatgcc agcagtgccc cccaggctgg 960gtgttgtccg aggagcactg ttactacttc tctgcagaag cgcaggcctg ggaagccagc 1020caggctttct gctcagccta ccacgctacc ctccccctgc taagccacac ccaggacttc 1080ctgggcagat acccagtctc caggcactcc tgggtggggg cctggcgagg cccccagggc 1140tggcactgga tcgacgaggc cccactcccg ccccagctac tccctgagga cggcgaggac 1200aatctggata tcaactgtgg ggccctggag gaaggcacgc tggtggctgc aaactgcagc 1260actccaagac cctgggtctg tgccaagggg acccagtgat ctgggctctg cctggtcctc 1320agcctgccag gcagatgcag caccccctac aggggaggcc agttgagagc ttgggcagcc 1380tcttcctgga cccagttatc caggtcttca tgctctgctc aagggggcca catgagcgag 1440cctaggagct ggacttcaac ccaggaagat gcatccgagg gaaaggagat tttctatggc 1500ctcaggcctg agtgccaata ttagtctcca gcttctgtgg atga 1544881192DNAHomo sapiens 88gagagcgaag ctcctctgca ctgggcccag gtgcgctcct cagcgtctcc gggtggcggg 60gcgcgcggga tggaggagtc ttgggaggct gcgcccggag gccaagccgg ggcagagctc 120ccaatggagc ccgtgggaag cctggtcccc acgctggagc agccgcaggt gcccgcgaag 180gtgcgacaac ctgaaggtcc cgaaagcagc ccaagtccgg ccggggccgt ggagaaggcg 240gcgggcgcag gcctggagcc ctcgagcaag aaaaagccgc cttcgcctcg ccccgggtcc 300ccgcgcgtgc cgccgctcag cctgggctac ggggtctgcc ccgagccgcc gtcaccgggc 360cctgccttgg tcaagctgcc ccggaatggc gaggcgcccg gggctgagcc tgcgcccagc 420gcctgggcgc ccatggagct gcaggtagat gtgcgcgtga agcccgtggg cgcggccggt 480ggcagcagca cgccatcgcc caggccctcc acgcgcttcc tcaaggtgcc ggtgcccgag 540tcccctgcct tctcccgcca cgcggacccg gcgcaccagc tcctgctgcg cgcaccatcc 600cagggcggca cgtggggccg ccgctcgccg ctggctgcag cccggacgga gagcggctgc 660gacgcagagg gccgggccag ccccgcggaa ggaagcgccg gctccccggg ctcccccacg 720tgctgccgct gcaaggagct ggggctggag aaggaggatg cggcgctgtt gccccgcgcg 780gggttggacg gcgacgagaa gctgccccgg gccgtaacgc ttacggactc cctgaggacg 840gcgaggacaa tctggatatc aactgtgggg ccctggagga aggcacgctg gtggctgcaa 900actgcagcac tccaagaccc tgggtctgtg ccaaggggac ccagtgatct gggctctgcc 960tggtcctcag cctgccaggc agatgcagca ccccctacag gggaggccag ttgagagctt 1020gggcagcctc ttcctggacc cagttatcca ggtcttcatg ctctgctcaa gggggccaca 1080tgagcgagcc taggagctgg acttcaaccc aggaagatgc atccgaggga aaggagattt 1140tctatggcct caggcctgag tgccaatatt agtctccagc ttctgtggat ga 119289518PRTHomo sapiens 89Met Leu Ala Ala Ala Ser Lys Tyr Arg His Gly Asn Met Val Phe Phe 1 5 10 15 Asp Val Leu Gly Leu Phe Val Ile Ala Tyr Pro Ser Arg Ile Gly Ser 20 25 30 Ile Ile Asn Tyr Met Val Val Met Gly Val Val Leu Tyr Leu Gly Lys 35 40 45 Lys Phe Leu Gln Pro Lys His Lys Thr Gly Asn Tyr Lys Lys Asp Phe 50 55 60 Leu Cys Gly Leu Gly Ile Thr Leu Ile Ser Trp Phe Thr Ser Leu Val 65 70 75 80 Thr Val Leu Ile Ile Ala Val Phe Ile Ser Leu Ile Gly Gln Ser Leu 85 90 95 Ser Trp Tyr Asn His Phe Tyr Val Ser Val Cys Leu Tyr Gly Thr Ala 100 105 110 Thr Val Ala Lys Ile Ile Leu Ile His Thr Leu Ala Lys Arg Phe Tyr 115 120 125 Tyr Met Asn Ala Ser Ala Gln Tyr Leu Gly Glu Val Phe Phe Asp Ile 130 135 140 Ser Leu Phe Val His Cys Cys Phe Leu Val Thr Leu Thr Tyr Gln Gly 145 150 155 160 Leu Cys Ser Ala Phe Ile Ser Ala Val Trp Val Ala Phe Pro Leu Leu 165 170 175 Thr Lys Leu Cys Val His Lys Asp Phe Lys Gln His Gly Ala Gln Gly 180 185 190 Lys Phe Ile Ala Phe Tyr Leu Leu Gly Met Phe Ile Pro Tyr Leu Tyr 195 200 205 Ala Leu Tyr Leu Ile Trp Ala Val Phe Glu Met Phe Thr Pro Ile Leu 210 215 220 Gly Arg Ser Gly Ser Glu Ile Pro Pro Asp Val Val Leu Ala Ser Ile 225 230 235 240 Leu Ala Gly Cys Thr Met Ile Leu Ser Ser Tyr Phe Ile Asn Phe Ile 245 250 255 Tyr Leu Ala Lys Ser Thr Lys Lys Thr Met Leu Thr Leu Thr Leu Val 260 265 270 Cys Ala Ile Thr Phe Leu Leu Val Cys Ser Gly Thr Phe Phe Pro Tyr 275 280 285 Ser Ser Asn Pro Ala Asn Pro Lys Pro Lys Arg Val Phe Leu Gln His 290 295 300 Met Thr Arg Thr Phe His Asp Leu Glu Gly Asn Ala Val Lys Arg Asp 305 310 315 320 Ser Gly Ile Trp Ile Asn Gly Phe Asp Tyr Thr Gly Ile Ser His Ile 325 330 335 Thr Pro His Ile Pro Glu Ile Asn Asp Ser Ile Arg Ala His Cys Glu 340 345 350 Glu Asn Ala Pro Leu Cys Gly Phe Pro Trp Tyr Leu Pro Val His Phe 355 360 365 Leu Ile Arg Lys Asn Trp Tyr Leu Pro Ala Pro Glu Val Ser Pro Arg 370 375 380 Asn Pro Pro His Phe Arg Leu Ile Ser Lys Glu Gln Thr Pro Trp Asp 385 390

395 400 Ser Ile Lys Leu Thr Phe Glu Ala Thr Gly Pro Ser His Met Ser Phe 405 410 415 Tyr Val Arg Ala His Lys Gly Ser Thr Leu Ser Gln Trp Ser Leu Gly 420 425 430 Asn Gly Thr Pro Val Thr Ser Lys Gly Gly Asp Tyr Phe Val Phe Tyr 435 440 445 Ser His Gly Leu Gln Ala Ser Ala Trp Gln Phe Trp Ile Glu Val Gln 450 455 460 Val Ser Glu Glu His Pro Glu Gly Met Val Thr Val Ala Ile Ala Ala 465 470 475 480 His Tyr Leu Ser Gly Glu Asp Lys Arg Ser Pro Gln Leu Asp Ala Leu 485 490 495 Lys Glu Lys Phe Pro Asp Trp Thr Phe Pro Ser Ala Trp Val Cys Thr 500 505 510 Tyr Asp Leu Phe Val Phe 515 90904PRTHomo sapiens 90Met Glu Trp Gly Ser Glu Ser Ala Ala Val Arg Arg His Arg Val Gly 1 5 10 15 Val Glu Arg Arg Glu Gly Ala Ala Ala Ala Pro Pro Pro Glu Arg Glu 20 25 30 Ala Arg Ala Gln Glu Pro Leu Val Asp Gly Cys Ser Gly Gly Gly Arg 35 40 45 Thr Arg Lys Arg Ser Pro Gly Gly Ser Gly Gly Ala Ser Arg Gly Ala 50 55 60 Gly Thr Gly Leu Ser Glu Val Arg Ala Ala Leu Gly Leu Ala Leu Tyr 65 70 75 80 Leu Ile Ala Leu Arg Thr Leu Val Gln Leu Ser Leu Gln Gln Leu Val 85 90 95 Leu Arg Gly Ala Ala Gly His Arg Gly Glu Phe Asp Ala Leu Gln Ala 100 105 110 Arg Asp Tyr Leu Glu His Ile Thr Ser Ile Gly Pro Arg Thr Thr Gly 115 120 125 Ser Pro Glu Asn Glu Ile Leu Thr Val His Tyr Leu Leu Glu Gln Ile 130 135 140 Lys Leu Ile Glu Val Gln Ser Asn Ser Leu His Lys Ile Ser Val Asp 145 150 155 160 Val Gln Arg Pro Thr Gly Ser Phe Ser Ile Asp Phe Leu Gly Gly Phe 165 170 175 Thr Ser Tyr Tyr Asp Asn Ile Thr Asn Val Val Val Lys Leu Glu Pro 180 185 190 Arg Asp Gly Ala Gln His Ala Val Leu Ala Asn Cys His Phe Asp Ser 195 200 205 Val Ala Asn Ser Pro Gly Ala Ser Asp Asp Ala Val Ser Cys Ser Val 210 215 220 Met Leu Glu Val Leu Arg Val Leu Ser Thr Ser Ser Glu Ala Leu His 225 230 235 240 His Ala Val Ile Phe Leu Phe Asn Gly Ala Glu Glu Asn Val Leu Gln 245 250 255 Ala Ser His Gly Phe Ile Thr Gln His Pro Trp Ala Ser Leu Ile Arg 260 265 270 Ala Phe Ile Asn Leu Glu Ala Ala Gly Val Gly Gly Lys Glu Leu Val 275 280 285 Phe Gln Thr Gly Pro Glu Asn Pro Trp Leu Val Gln Ala Tyr Val Ser 290 295 300 Ala Ala Lys His Pro Phe Ala Ser Val Val Ala Gln Glu Val Phe Gln 305 310 315 320 Ser Gly Ile Ile Pro Ser Asp Thr Asp Phe Arg Ile Tyr Arg Asp Phe 325 330 335 Gly Asn Ile Pro Gly Ile Asp Leu Ala Phe Ile Glu Asn Gly Tyr Ile 340 345 350 Tyr His Thr Lys Tyr Asp Thr Ala Asp Arg Ile Leu Thr Asp Ser Ile 355 360 365 Gln Arg Ala Gly Asp Asn Ile Leu Ala Val Leu Lys His Leu Ala Thr 370 375 380 Ser Asp Met Leu Ala Ala Ala Ser Lys Tyr Arg His Gly Asn Met Val 385 390 395 400 Phe Phe Asp Val Leu Gly Leu Phe Val Ile Ala Tyr Pro Ser Arg Ile 405 410 415 Gly Ser Ile Ile Asn Tyr Met Val Val Met Gly Val Val Leu Tyr Leu 420 425 430 Gly Lys Lys Phe Leu Gln Pro Lys His Lys Thr Gly Asn Tyr Lys Lys 435 440 445 Asp Phe Leu Cys Gly Leu Gly Ile Thr Leu Ile Ser Trp Phe Thr Ser 450 455 460 Leu Val Thr Val Leu Ile Ile Ala Val Phe Ile Ser Leu Ile Gly Gln 465 470 475 480 Ser Leu Ser Trp Tyr Asn His Phe Tyr Val Ser Val Cys Leu Tyr Gly 485 490 495 Thr Ala Thr Val Ala Lys Ile Ile Leu Ile His Thr Leu Ala Lys Arg 500 505 510 Phe Tyr Tyr Met Asn Ala Ser Ala Gln Tyr Leu Gly Glu Val Phe Phe 515 520 525 Asp Ile Ser Leu Phe Val His Cys Cys Phe Leu Val Thr Leu Thr Tyr 530 535 540 Gln Gly Leu Cys Ser Ala Phe Ile Ser Ala Val Trp Val Ala Phe Pro 545 550 555 560 Leu Leu Thr Lys Leu Cys Val His Lys Asp Phe Lys Gln His Gly Ala 565 570 575 Gln Gly Lys Phe Ile Ala Phe Tyr Leu Leu Gly Met Phe Ile Pro Tyr 580 585 590 Leu Tyr Ala Leu Tyr Leu Ile Trp Ala Val Phe Glu Met Phe Thr Pro 595 600 605 Ile Leu Gly Arg Ser Gly Ser Glu Ile Pro Pro Asp Val Val Leu Ala 610 615 620 Ser Ile Leu Ala Gly Cys Thr Met Ile Leu Ser Ser Tyr Phe Ile Asn 625 630 635 640 Phe Ile Tyr Leu Ala Lys Ser Thr Lys Lys Thr Met Leu Thr Leu Thr 645 650 655 Leu Val Cys Ala Ile Thr Phe Leu Leu Val Cys Ser Gly Thr Phe Phe 660 665 670 Pro Tyr Ser Ser Asn Pro Ala Asn Pro Lys Pro Lys Arg Val Phe Leu 675 680 685 Gln His Met Thr Arg Thr Phe His Asp Leu Glu Gly Asn Ala Val Lys 690 695 700 Arg Asp Ser Gly Ile Trp Ile Asn Gly Phe Asp Tyr Thr Gly Ile Ser 705 710 715 720 His Ile Thr Pro His Ile Pro Glu Ile Asn Asp Ser Ile Arg Ala His 725 730 735 Cys Glu Glu Asn Ala Pro Leu Cys Gly Phe Pro Trp Tyr Leu Pro Val 740 745 750 His Phe Leu Ile Arg Lys Asn Trp Tyr Leu Pro Ala Pro Glu Val Ser 755 760 765 Pro Arg Asn Pro Pro His Phe Arg Leu Ile Ser Lys Glu Gln Thr Pro 770 775 780 Trp Asp Ser Ile Lys Leu Thr Phe Glu Ala Thr Gly Pro Ser His Met 785 790 795 800 Ser Phe Tyr Val Arg Ala His Lys Gly Ser Thr Leu Ser Gln Trp Ser 805 810 815 Leu Gly Asn Gly Thr Pro Val Thr Ser Lys Gly Gly Asp Tyr Phe Val 820 825 830 Phe Tyr Ser His Gly Leu Gln Ala Ser Ala Trp Gln Phe Trp Ile Glu 835 840 845 Val Gln Val Ser Glu Glu His Pro Glu Gly Met Val Thr Val Ala Ile 850 855 860 Ala Ala His Tyr Leu Ser Gly Glu Asp Lys Arg Ser Pro Gln Leu Asp 865 870 875 880 Ala Leu Lys Glu Lys Phe Pro Asp Trp Thr Phe Pro Ser Ala Trp Val 885 890 895 Cys Thr Tyr Asp Leu Phe Val Phe 900 91419PRTHomo sapiens 91Met Val Val Met Gly Val Val Leu Tyr Leu Gly Lys Lys Phe Leu Gln 1 5 10 15 Pro Lys His Lys Thr Gly Asn Tyr Lys Lys Asp Phe Leu Cys Gly Leu 20 25 30 Gly Ile Thr Leu Ile Ser Trp Phe Thr Ser Leu Val Thr Val Leu Ile 35 40 45 Ile Ala Val Phe Ile Ser Leu Ile Gly Gln Ser Leu Ser Trp Tyr Asn 50 55 60 His Phe Tyr Val Ser Val Cys Leu Tyr Gly Thr Ala Thr Val Ala Lys 65 70 75 80 Ile Ile Leu Ile His Thr Leu Ala Lys Arg Phe Tyr Tyr Met Asn Ala 85 90 95 Ser Ala Gln Tyr Leu Gly Glu Val Phe Phe Asp Ile Ser Leu Phe Val 100 105 110 His Cys Cys Phe Leu Val Thr Leu Thr Tyr Gln Gly Leu Cys Ser Ala 115 120 125 Phe Ile Ser Ala Val Trp Val Ala Phe Pro Leu Leu Thr Lys Leu Cys 130 135 140 Val His Lys Asp Phe Lys Gln His Gly Ala Gln Gly Lys Phe Ile Ala 145 150 155 160 Phe Tyr Leu Leu Gly Met Phe Ile Pro Tyr Leu Tyr Ala Leu Tyr Leu 165 170 175 Ile Trp Ala Val Phe Glu Met Phe Thr Pro Ile Leu Gly Arg Ser Gly 180 185 190 Ser Glu Ile Pro Pro Asp Val Val Leu Ala Ser Ile Leu Ala Gly Cys 195 200 205 Thr Met Ile Leu Ser Ser Tyr Phe Ile Asn Phe Ile Tyr Leu Ala Lys 210 215 220 Ser Thr Lys Lys Thr Met Leu Thr Leu Thr Leu Val Cys Ala Ile Thr 225 230 235 240 Phe Leu Leu Val Cys Ser Gly Thr Phe Phe Pro Tyr Ser Ser Asn Pro 245 250 255 Ala Asn Pro Lys Pro Lys Arg Val Phe Leu Gln His Met Thr Arg Thr 260 265 270 Phe His Asp Leu Glu Gly Asn Ala Val Lys Arg Asp Ser Gly Ile Trp 275 280 285 Ile Asn Gly Phe Asp Tyr Thr Gly Ile Ser His Ile Thr Pro His Ile 290 295 300 Pro Glu Ile Asn Asp Ser Ile Arg Ala His Cys Glu Glu Asn Ala Pro 305 310 315 320 Leu Cys Gly Phe Pro Trp Tyr Leu Pro Val His Phe Leu Ile Arg Lys 325 330 335 Asn Trp Tyr Leu Pro Ala Pro Glu Val Ser Pro Arg Asn Pro Pro His 340 345 350 Phe Arg Leu Ile Ser Lys Glu Gln Thr Pro Trp Asp Ser Ile Lys Leu 355 360 365 Thr Phe Glu Ala Thr Ala Cys Leu Pro Ile Leu Gln Ile Leu Asp Leu 370 375 380 Pro Ala Ser Thr Ile Met Thr Lys Pro Tyr Val Leu Leu Cys Ser Ser 385 390 395 400 Pro Gln Arg Val Asn Thr Phe Ser Val Val Ser Trp Gln Trp His Pro 405 410 415 Ser His Lys 924974DNAHomo sapiens 92ggcgcgggga ccgggctgtc tgaggtgcgc gccgcgctgg ggctcgcgct ctacctgatc 60gcgctgcgga cgctggtgca gctctcgctg cagcagctcg tgctacgcgg ggccgctgga 120caccgcgggg agttcgacgc gctccaagcc agggattatc ttgaacacat aacctccatt 180ggccccagga ctacaggaag tccagaaaat gaaattctga ccgtgcacta ccttttggaa 240cagattaaac tgattgaagt gcaaagcaac agccttcata agatttcagt agatgtacaa 300cggcccacag gctcttttag cattgatttc ttgggaggtt ttacaagcta ttatgacaac 360atcaccaatg ttgtggtaaa gctggaaccc agagatggag cccagcatgc tgtcttggct 420aattgtcatt ttgactcagt agcaaactca ccaggccagt catggtttca ttactcagca 480cccctgggct agcttgattc gtgcattcat taacctagag gcagcaggtg taggagggaa 540agaacttgta ttccaaacag gtcctgaaaa tccttggttg gttcaagctt atgtttcagc 600agctaaacac ccttttgctt ctgtggtggc tcaggaggtt tttcagagtg gaatcattcc 660ttcagatact gactttcgta tctacaggga ttttgggaac attccaggaa tagacttagc 720ttttattgag aatggataca tttatcacac caagtatgac acagcggaca gaattctaac 780agattccatt cagagagcag gtgacaacat tttagcagtt cttaagcatc tagctacatc 840tgatatgctg gctgctgctt ctaagtatcg acatggaaac atggtcttct ttgatgtgct 900gggcctgttt gtcattgcct acccctctcg tattggctca atcataaact acatggtggt 960aatgggtgtt gttttgtacc tgggcaaaaa atttttgcag cccaaacata agactggtaa 1020ctacaagaag gacttcttgt gtggacttgg catcactttg atcagctggt tcactagcct 1080tgttaccgtt ctcattatag cagtgttcat ctctcttatt ggacagtctc tctcatggta 1140taaccacttc tatgtctccg tttgtctgta tggaactgca actgtagcca aaataatact 1200tatacatact cttgcgaaaa gattttatta catgaatgcc agtgcccagt atctgggaga 1260agtatttttt gacatttcgc tgtttgtcca ttgctgtttt cttgttaccc tcacttacca 1320aggactttgc tcggcgttta ttagtgctgt ctgggtagca ttcccattgc tcacaaagct 1380ctgtgtgcat aaggacttca agcagcatgg tgcccaagga aaatttattg ctttttacct 1440tttggggatg tttattcctt atctttatgc attgtacctc atctgggcag tatttgagat 1500gtttacccct atcctcggga gaagtggttc tgaaatccca cctgatgttg tgctggcatc 1560cattttggct ggctgtacaa tgattctctc gtcctatttt attaacttca tctaccttgc 1620caagagcaca aaaaaaacca tgctaacttt aactttggta tgtgcaatta cattcctcct 1680tgtttgcagt ggaacatttt ttccatatag ctccaatcct gctaatccga agccaaagag 1740agtgtttctt cagcatatga ctagaacatt ccatgacttg gaaggaaatg cagttaaacg 1800ggactctgga atatggatca atgggtttga ttatactgga atttctcaca taacccctca 1860cattcctgag atcaatgata gtatccgagc tcactgtgag gagaatgcac ctctttgtgg 1920ttttccttgg tatcttccag tgcactttct gatcaggaaa aactggtatc ttcctgcccc 1980agaagtttct ccaagaaatc ctcctcattt ccgactcata tccaaagaac agacaccttg 2040ggattctata aaattgactt ttgaagcaac aggaccaagc catatgtcct tctatgttcg 2100agcccacaaa gggtcaacac tttctcagtg gtctcttggc aatggcaccc cagtcacaag 2160taaaggagga gactactttg tcttttactc ccatggactc caggcctctg catggcagtt 2220ctggatagaa gtgcaggttt cagaagaaca tcctgaagga atggtcaccg tggccattgc 2280tgcccactat ctgtctgggg aagacaagag atcccctcaa ctggatgctc tgaaggaaaa 2340gttcccagat tggacatttc cctctgcctg ggtgtgcacc tacgatctct ttgtatttta 2400atcttgtgga tgagctctaa gtacatgccc agtggatact ccatgtgaca tggtttctcc 2460ctatgttacg tggatgtttg taacgtaagt caatgaattt taatgatcat atgttcaaag 2520agctttctgg gttaacgctt ttcagggcca agcactataa gggtttagct gtggcgcagt 2580gatgcatggc ctgttgacac ttgaaaatgc cagtcttttg gcacttcagc acatgtgggt 2640actgccacta cacacacgtc attttatatg accttaagga caaagccaac aatccacttc 2700aatagctgcc cctttaggat caagaaagat gtacactgtc agagcattgt taatgagaca 2760aaagttgttt ccaatttaag ccccaaaacc atttgttgta ttagtggatg gtgggtaaaa 2820tatcattcac tgaggtaatg attccccttg agaatataac tctgtgtagg tcactggaaa 2880gtgattgcca tagggctggg agagaagcat tgcactcttg aggctgtagc ctgtgtcaag 2940ctgtttcttc aggcagcctc tcaaatgtgc tttgtctctc tgtgctgagg cctggaccct 3000gtgctgagct ggtgactcac tgtcctgaca agtggacaca cagatgcact gctgtgctgc 3060tttcctgagg tggttttcta tgcctgtttt cctctgaaac atgtctgtta cccctctcca 3120tcttaccaag ttgaaaaggg gaatatttgg ccacataccc ctctggtttt cgtaggttct 3180tttggttcag aatattgttt gtgccagtac atgaccttaa cttccttcct cagagcactg 3240agctgccatc tgggctattc tggggtagaa ggaaggctgg gagtggtggg aattttataa 3300atatttattc tcttttcttt gtttcatagg agtcttgtgt tatacaaggt tagtccttca 3360tggtataatc ttactgatgc actgggccta tctttttgtt ttccagccag ttgaatagat 3420tagtttttct cagtaactta ctatccagca gactggcttt cctgagactt gaggttgtgg 3480cttatactgg aatgagacca ctgtacgtgt aggtggttca gatcctgcgt aatggcagca 3540tgaggactta aaaggtggtt ttcattttga agatggctat gtagcttgta aggtgtatca 3600cagcagtacc tctcatggct ttttggttcc agcagtgagg gcattggtga gatcaatggt 3660aaactgtgca agctttcttt ttatcattag gaaatgtgaa acgttggaca aattttgagt 3720tttaacaagg acaaaaagtt gaaagaaaag gcacagttaa caaaaaaggg tggctagatt 3780tatcttgggt gatggaggaa atgagagagg aatgctcttg aaaggtggtc tgtggatctg 3840tctgaataga aagagcacag taagtatgca ttgccggaga aaacgtcctt gaagctgctt 3900gtctcatgtg tatgatgtgc tttttaaatc atgcccctcg ttgcctgcct aatctgtgac 3960tccctaaaaa ctaactgggc ccatgtagat ggggctgcaa ccagagctga ataacatgtt 4020aggctcacac atgcatcagc actgcacact ggaatcattg ctcttcctgg actttgtaga 4080aatcagtctc aagtgcttca agagtctggc tcctgctact tttatctgtc aggtagcaca 4140taaggtttgc agggtttata ttttgtatag aatcacagtt gtggagaaaa agtaataatt 4200tctcaatgaa ttttaaaaat gggcctattt tctatccccg tggttcatct gatataatta 4260gtgttccctg tgaattcccc ccctctatgg gaaggatgcc tttactcttt atcagtaata 4320aattatgact gttttcatat tgccttaggg ttatttccct gtgtaaacca ttgtcttttg 4380ttttggtttt ctttagcatt atgaagcttt ggtattgtac aaggtcagta gtaagatgct 4440cactagtctc agggcttgtg taatattctg ggaggtcatt taaatgccag aaatggtcaa 4500gcaattatac acagtattta tgactctgtt aagcataccg tttgtctgtc acattagtag 4560attctgagat taaaaaaaat ttttaaagag tgatcattta aataatttct aaaagggtct 4620tttcaagctc taacaaagtc actaacaaat gcattatttt ctacagaatt agatgttagt 4680agtacagtac tgcatattca gggaaaaagt gtgaggaatt gatttcaaaa tagttcgttc 4740ttgtgtttga cctaagaatg attgtcgcat gaagtgtttg tttttacagt ttagcatata 4800taaacaaaca tgataggatt ccttaagatg ttaccaccca gggggccaca agccagcctg 4860ctgtctcagg aagctgtaga aggagtgttt gtcaatttct tgtcactggt ttgctgactt 4920actgaggatt aattgttgcc ttacaatgtt actgaaataa actgtttaat atac 4974935338DNAHomo sapiens 93ggccggggct gtcgcgggtt ggggcggttg ggctggcagc tgaggctcgt ggccatggag 60tggggttctg agtcggctgc tgtgaggcgg caccgcgtcg gagtagagcg tcgagaggga 120gcggcggccg cgccaccgcc ggagagggag gcccgagcgc aggagcctct ggtggatggg 180tgcagcggcg gcgggaggac gcggaagagg agccccgggg gtagcggcgg cgcgagcagg 240ggcgcgggga ccgggctgtc tgaggtgcgc gccgcgctgg ggctcgcgct ctacctgatc 300gcgctgcgga cgctggtgca gctctcgctg cagcagctcg tgctacgcgg ggccgctgga 360caccgcgggg agttcgacgc gctccaagcc agggattatc ttgaacacat aacctccatt 420ggccccagga ctacaggaag tccagaaaat gaaattctga ccgtgcacta ccttttggaa 480cagattaaac tgattgaagt gcaaagcaac agccttcata agatttcagt agatgtacaa 540cggcccacag gctcttttag cattgatttc ttgggaggtt

ttacaagcta ttatgacaac 600atcaccaatg ttgtggtaaa gctggaaccc agagatggag cccagcatgc tgtcttggct 660aattgtcatt ttgactcagt agcaaactca ccaggtgcca gtgatgatgc agttagctgc 720tcagtgatgc tggaagtcct tcgcgtcttg tcaacatctt cagaagcctt gcatcatgct 780gtcatatttc tctttaatgg tgctgaggaa aatgtcttgc aagccagtca tggtttcatt 840actcagcacc cctgggctag cttgattcgt gcattcatta acctagaggc agcaggtgta 900ggagggaaag aacttgtatt ccaaacaggt cctgaaaatc cttggttggt tcaagcttat 960gtttcagcag ctaaacaccc ttttgcttct gtggtggctc aggaggtttt tcagagtgga 1020atcattcctt cagatactga ctttcgtatc tacagggatt ttgggaacat tccaggaata 1080gacttagctt ttattgagaa tggatacatt tatcacacca agtatgacac agcggacaga 1140attctaacag attccattca gagagcaggt gacaacattt tagcagttct taagcatcta 1200gctacatctg atatgctggc tgctgcttct aagtatcgac atggaaacat ggtcttcttt 1260gatgtgctgg gcctgtttgt cattgcctac ccctctcgta ttggctcaat cataaactac 1320atggtggtaa tgggtgttgt tttgtacctg ggcaaaaaat ttttgcagcc caaacataag 1380actggtaact acaagaagga cttcttgtgt ggacttggca tcactttgat cagctggttc 1440actagccttg ttaccgttct cattatagca gtgttcatct ctcttattgg acagtctctc 1500tcatggtata accacttcta tgtctccgtt tgtctgtatg gaactgcaac tgtagccaaa 1560ataatactta tacatactct tgcgaaaaga ttttattaca tgaatgccag tgcccagtat 1620ctgggagaag tattttttga catttcgctg tttgtccatt gctgttttct tgttaccctc 1680acttaccaag gactttgctc ggcgtttatt agtgctgtct gggtagcatt cccattgctc 1740acaaagctct gtgtgcataa ggacttcaag cagcatggtg cccaaggaaa atttattgct 1800ttttaccttt tggggatgtt tattccttat ctttatgcat tgtacctcat ctgggcagta 1860tttgagatgt ttacccctat cctcgggaga agtggttctg aaatcccacc tgatgttgtg 1920ctggcatcca ttttggctgg ctgtacaatg attctctcgt cctattttat taacttcatc 1980taccttgcca agagcacaaa aaaaaccatg ctaactttaa ctttggtatg tgcaattaca 2040ttcctccttg tttgcagtgg aacatttttt ccatatagct ccaatcctgc taatccgaag 2100ccaaagagag tgtttcttca gcatatgact agaacattcc atgacttgga aggaaatgca 2160gttaaacggg actctggaat atggatcaat gggtttgatt atactggaat ttctcacata 2220acccctcaca ttcctgagat caatgatagt atccgagctc actgtgagga gaatgcacct 2280ctttgtggtt ttccttggta tcttccagtg cactttctga tcaggaaaaa ctggtatctt 2340cctgccccag aagtttctcc aagaaatcct cctcatttcc gactcatatc caaagaacag 2400acaccttggg attctataaa attgactttt gaagcaacag gaccaagcca tatgtccttc 2460tatgttcgag cccacaaagg gtcaacactt tctcagtggt ctcttggcaa tggcacccca 2520gtcacaagta aaggaggaga ctactttgtc ttttactccc atggactcca ggcctctgca 2580tggcagttct ggatagaagt gcaggtttca gaagaacatc ctgaaggaat ggtcaccgtg 2640gccattgctg cccactatct gtctggggaa gacaagagat cccctcaact ggatgctctg 2700aaggaaaagt tcccagattg gacatttccc tctgcctggg tgtgcaccta cgatctcttt 2760gtattttaat cttgtggatg agctctaagt acatgcccag tggatactcc atgtgacatg 2820gtttctccct atgttacgtg gatgtttgta acgtaagtca atgaatttta atgatcatat 2880gttcaaagag ctttctgggt taacgctttt cagggccaag cactataagg gtttagctgt 2940ggcgcagtga tgcatggcct gttgacactt gaaaatgcca gtcttttggc acttcagcac 3000atgtgggtac tgccactaca cacacgtcat tttatatgac cttaaggaca aagccaacaa 3060tccacttcaa tagctgcccc tttaggatca agaaagatgt acactgtcag agcattgtta 3120atgagacaaa agttgtttcc aatttaagcc ccaaaaccat ttgttgtatt agtggatggt 3180gggtaaaata tcattcactg aggtaatgat tccccttgag aatataactc tgtgtaggtc 3240actggaaagt gattgccata gggctgggag agaagcattg cactcttgag gctgtagcct 3300gtgtcaagct gtttcttcag gcagcctctc aaatgtgctt tgtctctctg tgctgaggcc 3360tggaccctgt gctgagctgg tgactcactg tcctgacaag tggacacaca gatgcactgc 3420tgtgctgctt tcctgaggtg gttttctatg cctgttttcc tctgaaacat gtctgttacc 3480cctctccatc ttaccaagtt gaaaagggga atatttggcc acatacccct ctggttttcg 3540taggttcttt tggttcagaa tattgtttgt gccagtacat gaccttaact tccttcctca 3600gagcactgag ctgccatctg ggctattctg gggtagaagg aaggctggga gtggtgggaa 3660ttttataaat atttattctc ttttctttgt ttcataggag tcttgtgtta tacaaggtta 3720gtccttcatg gtataatctt actgatgcac tgggcctatc tttttgtttt ccagccagtt 3780gaatagatta gtttttctca gtaacttact atccagcaga ctggctttcc tgagacttga 3840ggttgtggct tatactggaa tgagaccact gtacgtgtag gtggttcaga tcctgcgtaa 3900tggcagcatg aggacttaaa aggtggtttt cattttgaag atggctatgt agcttgtaag 3960gtgtatcaca gcagtacctc tcatggcttt ttggttccag cagtgagggc attggtgaga 4020tcaatggtaa actgtgcaag ctttcttttt atcattagga aatgtgaaac gttggacaaa 4080ttttgagttt taacaaggac aaaaagttga aagaaaaggc acagttaaca aaaaagggtg 4140gctagattta tcttgggtga tggaggaaat gagagaggaa tgctcttgaa aggtggtctg 4200tggatctgtc tgaatagaaa gagcacagta agtatgcatt gccggagaaa acgtccttga 4260agctgcttgt ctcatgtgta tgatgtgctt tttaaatcat gcccctcgtt gcctgcctaa 4320tctgtgactc cctaaaaact aactgggccc atgtagatgg ggctgcaacc agagctgaat 4380aacatgttag gctcacacat gcatcagcac tgcacactgg aatcattgct cttcctggac 4440tttgtagaaa tcagtctcaa gtgcttcaag agtctggctc ctgctacttt tatctgtcag 4500gtagcacata aggtttgcag ggtttatatt ttgtatagaa tcacagttgt ggagaaaaag 4560taataatttc tcaatgaatt ttaaaaatgg gcctattttc tatccccgtg gttcatctga 4620tataattagt gttccctgtg aattcccccc ctctatggga aggatgcctt tactctttat 4680cagtaataaa ttatgactgt tttcatattg ccttagggtt atttccctgt gtaaaccatt 4740gtcttttgtt ttggttttct ttagcattat gaagctttgg tattgtacaa ggtcagtagt 4800aagatgctca ctagtctcag ggcttgtgta atattctggg aggtcattta aatgccagaa 4860atggtcaagc aattatacac agtatttatg actctgttaa gcataccgtt tgtctgtcac 4920attagtagat tctgagatta aaaaaaattt ttaaagagtg atcatttaaa taatttctaa 4980aagggtcttt tcaagctcta acaaagtcac taacaaatgc attattttct acagaattag 5040atgttagtag tacagtactg catattcagg gaaaaagtgt gaggaattga tttcaaaata 5100gttcgttctt gtgtttgacc taagaatgat tgtcgcatga agtgtttgtt tttacagttt 5160agcatatata aacaaacatg ataggattcc ttaagatgtt accacccagg gggccacaag 5220ccagcctgct gtctcaggaa gctgtagaag gagtgtttgt caatttcttg tcactggttt 5280gctgacttac tgaggattaa ttgttgcctt acaatgttac tgaaataaac tgtttaat 5338945387DNAHomo sapiens 94ggccggggct gtcgcgggtt ggggcggttg ggctggcagc tgaggctcgt ggccatggag 60tggggttctg agtcggctgc tgtgaggcgg caccgcgtcg gagtagagcg tcgagaggga 120gcggcggccg cgccaccgcc ggagagggag gcccgagcgc aggagcctct ggtggatggg 180tgcagcggcg gcgggaggac gcggaagagg agccccgggg gtagcggcgg cgcgagcagg 240ggcgcgggga ccgggctgtc tgaggtgcgc gccgcgctgg ggctcgcgct ctacctgatc 300gcgctgcgga cgctggtgca gctctcgctg cagcagctcg tgctacgcgg ggccgctgga 360caccgcgggg agttcgacgc gctccaagcc agggattatc ttgaacacat aacctccatt 420ggccccagga ctacaggaag tccagaaaat gaaattctga ccgtgcacta ccttttggaa 480cagattaaac tgattgaagt gcaaagcaac agccttcata agatttcagt agatgtacaa 540cggcccacag gctcttttag cattgatttc ttgggaggtt ttacaagcta ttatgacaac 600atcaccaatg ttgtggtaaa gctggaaccc agagatggag cccagcatgc tgtcttggct 660aattgtcatt ttgactcagt agcaaactca ccaggtgcca gtgatgatgc agttagctgc 720tcagtgatgc tggaagtcct tcgcgtcttg tcaacatctt cagaagcctt gcatcatgct 780gtcatatttc tctttaatgg tgctgaggaa aatgtcttgc aagccagtca tggtttcatt 840actcagcacc cctgggctag cttgattcgt gcattcatta acctagaggc agcaggtgta 900ggagggaaag aacttgtatt ccaaacaggt cctgaaaatc cttggttggt tcaagcttat 960gtttcagcag ctaaacaccc ttttgcttct gtggtggctc aggaggtttt tcagagtgga 1020atcattcctt cagatactga ctttcgtatc tacagggatt ttgggaacat tccaggaata 1080gacttagctt ttattgagaa tggatacatt tatcacacca agtatgacac agcggacaga 1140attctaacag attccattca gagagcaggt gacaacattt tagcagttct taagcatcta 1200gctacatctg atatgctggc tgctgcttct aagtatcgac atggaaacat ggtcttcttt 1260gatgtgctgg gcctgtttgt cattgcctac ccctctcgta ttggctcaat cataaactac 1320atggtggtaa tgggtgttgt tttgtacctg ggcaaaaaat ttttgcagcc caaacataag 1380actggtaact acaagaagga cttcttgtgt ggacttggca tcactttgat cagctggttc 1440actagccttg ttaccgttct cattatagca gtgttcatct ctcttattgg acagtctctc 1500tcatggtata accacttcta tgtctccgtt tgtctgtatg gaactgcaac tgtagccaaa 1560ataatactta tacatactct tgcgaaaaga ttttattaca tgaatgccag tgcccagtat 1620ctgggagaag tattttttga catttcgctg tttgtccatt gctgttttct tgttaccctc 1680acttaccaag gactttgctc ggcgtttatt agtgctgtct gggtagcatt cccattgctc 1740acaaagctct gtgtgcataa ggacttcaag cagcatggtg cccaaggaaa atttattgct 1800ttttaccttt tggggatgtt tattccttat ctttatgcat tgtacctcat ctgggcagta 1860tttgagatgt ttacccctat cctcgggaga agtggttctg aaatcccacc tgatgttgtg 1920ctggcatcca ttttggctgg ctgtacaatg attctctcgt cctattttat taacttcatc 1980taccttgcca agagcacaaa aaaaaccatg ctaactttaa ctttggtatg tgcaattaca 2040ttcctccttg tttgcagtgg aacatttttt ccatatagct ccaatcctgc taatccgaag 2100ccaaagagag tgtttcttca gcatatgact agaacattcc atgacttgga aggaaatgca 2160gttaaacggg actctggaat atggatcaat gggtttgatt atactggaat ttctcacata 2220acccctcaca ttcctgagat caatgatagt atccgagctc actgtgagga gaatgcacct 2280ctttgtggtt ttccttggta tcttccagtg cactttctga tcaggaaaaa ctggtatctt 2340cctgccccag aagtttctcc aagaaatcct cctcatttcc gactcatatc caaagaacag 2400acaccttggg attctataaa attgactttt gaagcaacag cctgcctgcc tatccttcag 2460attttggact tgccagcctc aacaatcatg accaagccat atgtccttct atgttcgagc 2520ccacaaaggg tcaacacttt ctcagtggtc tcttggcaat ggcaccccag tcacaagtaa 2580aggaggagac tactttgtct tttactccca tggactccag gcctctgcat ggcagttctg 2640gatagaagtg caggtttcag aagaacatcc tgaaggaatg gtcaccgtgg ccattgctgc 2700ccactatctg tctggggaag acaagagatc ccctcaactg gatgctctga aggaaaagtt 2760cccagattgg acatttccct ctgcctgggt gtgcacctac gatctctttg tattttaatc 2820ttgtggatga gctctaagta catgcccagt ggatactcca tgtgacatgg tttctcccta 2880tgttacgtgg atgtttgtaa cgtaagtcaa tgaattttaa tgatcatatg ttcaaagagc 2940tttctgggtt aacgcttttc agggccaagc actataaggg tttagctgtg gcgcagtgat 3000gcatggcctg ttgacacttg aaaatgccag tcttttggca cttcagcaca tgtgggtact 3060gccactacac acacgtcatt ttatatgacc ttaaggacaa agccaacaat ccacttcaat 3120agctgcccct ttaggatcaa gaaagatgta cactgtcaga gcattgttaa tgagacaaaa 3180gttgtttcca atttaagccc caaaaccatt tgttgtatta gtggatggtg ggtaaaatat 3240cattcactga ggtaatgatt ccccttgaga atataactct gtgtaggtca ctggaaagtg 3300attgccatag ggctgggaga gaagcattgc actcttgagg ctgtagcctg tgtcaagctg 3360tttcttcagg cagcctctca aatgtgcttt gtctctctgt gctgaggcct ggaccctgtg 3420ctgagctggt gactcactgt cctgacaagt ggacacacag atgcactgct gtgctgcttt 3480cctgaggtgg ttttctatgc ctgttttcct ctgaaacatg tctgttaccc ctctccatct 3540taccaagttg aaaaggggaa tatttggcca catacccctc tggttttcgt aggttctttt 3600ggttcagaat attgtttgtg ccagtacatg accttaactt ccttcctcag agcactgagc 3660tgccatctgg gctattctgg ggtagaagga aggctgggag tggtgggaat tttataaata 3720tttattctct tttctttgtt tcataggagt cttgtgttat acaaggttag tccttcatgg 3780tataatctta ctgatgcact gggcctatct ttttgttttc cagccagttg aatagattag 3840tttttctcag taacttacta tccagcagac tggctttcct gagacttgag gttgtggctt 3900atactggaat gagaccactg tacgtgtagg tggttcagat cctgcgtaat ggcagcatga 3960ggacttaaaa ggtggttttc attttgaaga tggctatgta gcttgtaagg tgtatcacag 4020cagtacctct catggctttt tggttccagc agtgagggca ttggtgagat caatggtaaa 4080ctgtgcaagc tttcttttta tcattaggaa atgtgaaacg ttggacaaat tttgagtttt 4140aacaaggaca aaaagttgaa agaaaaggca cagttaacaa aaaagggtgg ctagatttat 4200cttgggtgat ggaggaaatg agagaggaat gctcttgaaa ggtggtctgt ggatctgtct 4260gaatagaaag agcacagtaa gtatgcattg ccggagaaaa cgtccttgaa gctgcttgtc 4320tcatgtgtat gatgtgcttt ttaaatcatg cccctcgttg cctgcctaat ctgtgactcc 4380ctaaaaacta actgggccca tgtagatggg gctgcaacca gagctgaata acatgttagg 4440ctcacacatg catcagcact gcacactgga atcattgctc ttcctggact ttgtagaaat 4500cagtctcaag tgcttcaaga gtctggctcc tgctactttt atctgtcagg tagcacataa 4560ggtttgcagg gtttatattt tgtatagaat cacagttgtg gagaaaaagt aataatttct 4620caatgaattt taaaaatggg cctattttct atccccgtgg ttcatctgat ataattagtg 4680ttccctgtga attccccccc tctatgggaa ggatgccttt actctttatc agtaataaat 4740tatgactgtt ttcatattgc cttagggtta tttccctgtg taaaccattg tcttttgttt 4800tggttttctt tagcattatg aagctttggt attgtacaag gtcagtagta agatgctcac 4860tagtctcagg gcttgtgtaa tattctggga ggtcatttaa atgccagaaa tggtcaagca 4920attatacaca gtatttatga ctctgttaag cataccgttt gtctgtcaca ttagtagatt 4980ctgagattaa aaaaaatttt taaagagtga tcatttaaat aatttctaaa agggtctttt 5040caagctctaa caaagtcact aacaaatgca ttattttcta cagaattaga tgttagtagt 5100acagtactgc atattcaggg aaaaagtgtg aggaattgat ttcaaaatag ttcgttcttg 5160tgtttgacct aagaatgatt gtcgcatgaa gtgtttgttt ttacagttta gcatatataa 5220acaaacatga taggattcct taagatgtta ccacccaggg ggccacaagc cagcctgctg 5280tctcaggaag ctgtagaagg agtgtttgtc aatttcttgt cactggtttg ctgacttact 5340gaggattaat tgttgcctta caatgttact gaaataaact gtttaat 538795202PRTHomo sapiens 95Met Glu Arg Gly Ala Gly Ala Lys Leu Leu Pro Leu Leu Leu Leu Leu 1 5 10 15 Arg Ala Thr Gly Phe Thr Cys Ala Gln Thr Asp Gly Arg Asn Gly Tyr 20 25 30 Thr Ala Val Ile Glu Val Thr Ser Gly Gly Pro Trp Gly Asp Trp Ala 35 40 45 Trp Pro Glu Met Cys Pro Asp Gly Phe Phe Ala Ser Gly Phe Ser Leu 50 55 60 Lys Val Glu Pro Pro Gln Gly Ile Pro Gly Asp Asp Thr Ala Leu Asn 65 70 75 80 Gly Ile Arg Leu His Cys Ala Arg Gly Asn Val Leu Gly Asn Thr His 85 90 95 Val Val Glu Ser Gln Ser Gly Ser Trp Gly Glu Trp Ser Glu Pro Leu 100 105 110 Trp Cys Arg Gly Gly Ala Tyr Leu Val Ala Phe Ser Leu Arg Val Glu 115 120 125 Ala Pro Thr Thr Leu Gly Asp Asn Thr Ala Ala Asn Asn Val Arg Phe 130 135 140 Arg Cys Ser Asp Gly Glu Glu Leu Gln Gly Pro Gly Leu Ser Trp Gly 145 150 155 160 Asp Phe Gly Asp Trp Ser Asp His Cys Pro Lys Gly Ala Cys Gly Leu 165 170 175 Gln Thr Lys Ile Gln Gly Pro Arg Gly Leu Gly Asp Asp Thr Ala Leu 180 185 190 Asn Asp Ala Arg Leu Phe Cys Cys Arg Ser 195 200 9670PRTHomo sapiens 96Met Glu Arg Gly Ala Gly Ala Lys Leu Leu Pro Leu Leu Leu Leu Leu 1 5 10 15 Arg Ala Thr Gly Phe Thr Cys Ala Gln Thr Asp Gly Arg Asn Gly Tyr 20 25 30 Thr Ala Val Ile Glu Val Thr Ser Gly Gly Pro Trp Gly Asp Trp Ala 35 40 45 Trp Pro Glu Met Cys Pro Asp Gly Phe Phe Ala Ser Gly Phe Ser Leu 50 55 60 Lys Leu Gly Arg Met Glu 65 70 97102PRTHomo sapiens 97Met Glu Arg Gly Ala Gly Ala Lys Leu Leu Pro Leu Leu Leu Leu Leu 1 5 10 15 Arg Ala Thr Gly Phe Thr Cys Ala Gln Thr Asp Gly Arg Asn Gly Tyr 20 25 30 Thr Ala Val Ile Glu Val Thr Ser Gly Gly Pro Trp Gly Asp Trp Ala 35 40 45 Trp Pro Glu Met Cys Pro Asp Gly Phe Phe Ala Ser Gly Phe Ser Leu 50 55 60 Lys Val Glu Pro Pro Gln Gly Ile Pro Gly Asp Asp Thr Ala Leu Asn 65 70 75 80 Gly Ile Arg Leu His Cys Ala Arg Gly Asn Val Leu Gly Asn Thr His 85 90 95 Val Leu Gly Arg Met Glu 100 98114PRTHomo sapiens 98Met Glu Arg Gly Ala Gly Ala Lys Leu Leu Pro Leu Leu Leu Leu Leu 1 5 10 15 Arg Ala Thr Gly Phe Thr Cys Ala Gln Thr Asp Gly Arg Asn Gly Tyr 20 25 30 Thr Ala Val Ile Glu Val Thr Ser Gly Gly Pro Trp Gly Asp Trp Ala 35 40 45 Trp Pro Glu Met Cys Pro Asp Gly Phe Phe Ala Ser Gly Phe Ser Leu 50 55 60 Lys Val Glu Pro Pro Gln Gly Ile Pro Gly Asp Asp Thr Ala Leu Asn 65 70 75 80 Gly Ile Arg Leu His Cys Ala Arg Gly Asn Val Leu Gly Asn Thr His 85 90 95 Val Val Glu Ser Gln Ser Gly Arg Trp Gly Ala Gly Val Glu Asp Pro 100 105 110 Leu Gly 99769DNAHomo sapiens 99ggaggccctt ccgcaatcgg agccctcaca gaggccaaac tgatataaat ctgcttagga 60ggcctgattc acagacgcta caggatggag cggggcgcag gagccaagct gctgccgctg 120ctgctgcttc tgcgggcgac tggtttcaca tgtgcacaga cagatggccg gaacggctac 180acggcggtca tcgaagtgac cagcgggggt ccctggggcg actgggcctg gcctgagatg 240tgtcccgatg gattcttcgc cagcgggttc tcgctcaagg tggagcctcc ccaaggcatt 300cctggcgacg acactgcact gaatgggatc aggctgcact gcgcgcgcgg gaacgtccta 360ggcaatacgc acgtggtaga gtcccagtct ggaagctggg gcgaatggag tgagccgctg 420tggtgtcgcg gcggcgccta cctagtggct ttctcgcttc gcgtggaggc acccacgacc 480ctcggtgaca acacagcagc gaacaacgtg cgcttccgct gttcagacgg cgaggaactg 540caggggcctg ggctgagctg gggagacttt ggagactgga gtgaccattg ccccaagggc 600gcgtgcggcc tgcagaccaa gatccaggga cctagaggcc tcggcgatga cactgcgctg 660aacgacgcgc gcttattctg ctgccgcagt tgaacggcgc cgccgccgcc gctctctccc 720gggccaggag gctagtccca cctcttgcta ttaaagcttc tctgagttg 769100653DNAHomo sapiens 100ggaggccctt ccgcaatcgg agccctcaca gaggccaaac tgatataaat ctgcttagga 60ggcctgattc acagacgcta caggatggag cggggcgcag gagccaagct gctgccgctg 120ctgctgcttc tgcgggcgac tggtttcaca tgtgcacaga cagatggccg gaacggctac 180acggcggtca tcgaagtgac cagcgggggt ccctggggcg actgggcctg gcctgagatg 240tgtcccgatg gattcttcgc cagcgggttc tcgctcaagc tggggcgaat ggagtgagcc 300gctgtggtgt cgcggcggcg cctacctagt ggctttctcg cttcgcgtgg aggcacccac 360gaccctcggt gacaacacag cagcgaacaa cgtgcgcttc cgctgttcag acggcgagga 420actgcagggg cctgggctga gctggggaga ctttggagac tggagtgacc attgccccaa 480gggcgcgtgc ggcctgcaga ccaagatcca gggacctaga ggcctcggcg atgacactgc 540gctgaacgac gcgcgcttat tctgctgccg cagttgaacg gcgccgccgc cgccgctctc 600tcccgggcca ggaggctagt cccacctctt gctattaaag cttctctgag ttg 653101747DNAHomo sapiens 101aggcccttcc gcaatcggag ccctcacaga ggccaaactg atataaatct gcttaggagg 60cctgattcac agacgctaca ggatggagcg gggcgcagga

gccaagctgc tgccgctgct 120gctgcttctg cgggcgactg gtttcacatg tgcacagaca gatggccgga acggctacac 180ggcggtcatc gaagtgacca gcgggggtcc ctggggcgac tgggcctggc ctgagatgtg 240tcccgatgga ttcttcgcca gcgggttctc gctcaaggtg gagcctcccc aaggcattcc 300tggcgacgac actgcactga atgggatcag gctgcactgc gcgcgcggga acgtcctagg 360caatacgcac gtgctggggc gaatggagtg agccgctgtg gtgtcgcggc ggcgcctacc 420tagtggcttt ctcgcttcgc gtggaggcac ccacgaccct cggtgacaac acagcagcga 480acaacgtgcg cttccgctgt tcagacggcg aggaactgca ggggcctggg ctgagctggg 540gagactttgg agactggagt gaccattgcc ccaagggcgc gtgcggcctg cagaccaaga 600tccagggacc tagaggcctc ggcgatgaca ctgcgctgaa cgacgcgcgc ttattctgct 660gccgcagttg aacggcgccg ccgccgccgc tctctcccgg gccaggaggc tagtcccacc 720tcttgctatt aaagcttctc tgagttg 747102803DNAHomo sapiens 102aggcccttcc gcaatcggag ccctcacaga ggccaaactg atataaatct gcttaggagg 60cctgattcac agacgctaca ggatggagcg gggcgcagga gccaagctgc tgccgctgct 120gctgcttctg cgggcgactg gtttcacatg tgcacagaca gatggccgga acggctacac 180ggcggtcatc gaagtgacca gcgggggtcc ctggggcgac tgggcctggc ctgagatgtg 240tcccgatgga ttcttcgcca gcgggttctc gctcaaggtg gagcctcccc aaggcattcc 300tggcgacgac actgcactga atgggatcag gctgcactgc gcgcgcggga acgtcctagg 360caatacgcac gtggtagagt cccagtctgg aaggtggggc gcaggggtcg aggatccctt 420ggggtgatgc tggggcgaat ggagtgagcc gctgtggtgt cgcggcggcg cctacctagt 480ggctttctcg cttcgcgtgg aggcacccac gaccctcggt gacaacacag cagcgaacaa 540cgtgcgcttc cgctgttcag acggcgagga actgcagggg cctgggctga gctggggaga 600ctttggagac tggagtgacc attgccccaa gggcgcgtgc ggcctgcaga ccaagatcca 660gggacctaga ggcctcggcg atgacactgc gctgaacgac gcgcgcttat tctgctgccg 720cagttgaacg gcgccgccgc cgccgctctc tcccgggcca ggaggctagt cccacctctt 780gctattaaag cttctctgag ttg 80310321DNAHomo sapiens 103cgaggacaat ctggatatca a 2110421DNAHomo sapiens 104ctggagccct cgagcaagaa a 2110521DNAHomo sapiens 105cccgtggttc atctgatata a 2110621DNAHomo sapiens 106aaggactttg ctcggcgttt a 2110721DNAHomo sapiens 107tacgtggatg tttgtaacgt a 2110821DNAHomo sapiens 108ctcgtattgg ctcaatcata a 21

* * * * *

References


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed