Compositions And Methods For The Detection Diagnosis And Therapy Of Hematological Malignancies Algate; Paul A. ; et al. [CORIXA CORPORATION]

Compositions And Methods For The Detection Diagnosis And Therapy Of Hematological Malignancies

Algate; Paul A. ; et al.

Patent Application Summary

U.S. patent application number 15/269027 was filed with the patent office on 2017-01-05 for compositions and methods for the detection diagnosis and therapy of hematological malignancies. The applicant listed for this patent is CORIXA CORPORATION. Invention is credited to Paul A. Algate, Lauren Carter, Jonathan David Clapper, Alexander Gaiger, Jane Mannion, Patricia Dianne McNeill, Nadia Ordonez, Aijun Wang.

Application Number	20170002088 15/269027
Document ID	/
Family ID	27609444
Filed Date	2017-01-05

United States Patent Application	20170002088
Kind Code	A1
Algate; Paul A. ; et al.	January 5, 2017

COMPOSITIONS AND METHODS FOR THE DETECTION DIAGNOSIS AND THERAPY OF HEMATOLOGICAL MALIGNANCIES

Abstract

Disclosed are methods and compositions for the detection, diagnosis, prognosis, and therapy of hematological malignancies, and in particular, B cell leukemias, lymphomas and multiple myelomas. Disclosed are compositions, methods and kits for eliciting immune and T cell responses to specific malignancy-related antigenic polypeptides and antigenic polypeptide fragments thereof in an animal. Also disclosed are compositions and methods for use in the identification of cells and biological samples containing one or more hematological malignancy-related compositions, and methods for the detection and diagnosis of such diseases and affected cell types. Also disclosed are diagnostic and therapeutic kits, as well as methods for the diagnosis, therapy and/or prevention of a variety of leukemias and lymphomas.

Inventors:

Algate; Paul A.; (Issaquah, WA) ; Carter; Lauren; (Seattle, WA) ; Clapper; Jonathan David; (Seattle, WA) ; Gaiger; Alexander; (Vienna, AT) ; Mannion; Jane; (Edmonds, WA) ; McNeill; Patricia Dianne; (Federal Way, WA) ; Ordonez; Nadia; (Brier, WA) ; Wang; Aijun; (Issaquah, WA)

Applicant:

Name	City	State	Country	Type
CORIXA CORPORATION	Wilmington	DE	US

Family ID:

27609444

Appl. No.:

15/269027

Filed:

September 19, 2016

Related U.S. Patent Documents


Application Number	Filing Date	Patent Number
14298475	Jun 6, 2014
15269027
10501841	Aug 31, 2005	8920776
PCT/US03/02353	Jan 22, 2003
14298475

Current U.S. Class:	1/1
Current CPC Class:	A61P 35/00 20180101; C12Q 1/6886 20130101; G01N 33/57426 20130101; C07K 16/28 20130101; C07K 2317/55 20130101; A61K 38/00 20130101; C12Q 2600/106 20130101; C07K 16/18 20130101; C07K 2317/569 20130101; A61K 39/0011 20130101; A61K 39/00 20130101; C07K 2317/24 20130101; C07K 2317/56 20130101; C12Q 2600/158 20130101; A61K 2039/505 20130101; C07K 14/47 20130101; C12Q 2600/112 20130101; C07K 2317/92 20130101; A61K 47/6851 20170801; C07K 16/3061 20130101; C12Q 2600/118 20130101; C07K 14/70535 20130101; A61K 51/1018 20130101
International Class:	C07K 16/30 20060101 C07K016/30; A61K 51/10 20060101 A61K051/10; C07K 16/28 20060101 C07K016/28; C12Q 1/68 20060101 C12Q001/68

Claims

1-54: (canceled)

55. A method for treating multiple myeloma in a mammalian subject wherein a B-cell from said subject overexpresses SEQ ID NO:4, comprising administering to said subject an effective amount of an isolated monoclonal antibody that specifically binds to a polypeptide comprising the sequence set forth in SEQ ID NO: 4, wherein said monoclonal antibody has a binding constant for SEQ ID NO:4 that exceeds 10.sup.3 L/mol.

56. The method of claim 55, wherein said antibody is a humanized antibody.

57. The method of claim 55, wherein said antibody is a chimeric antibody.

58. The method of claim 55, wherein said antibody is a Fab fragment.

59. The method of claim 55, wherein said antibody is a Fv fragment.

60. The method of claim 55, wherein said antibody is a scFv.

61. The method of claim 55, wherein said antibody further comprises a therapeutic moiety.

62. The method of claim 61, wherein the therapeutic moiety is a radionuclide.

63. The method of claim 62, wherein the radionuclide is chosen from: .sup.90Y, .sup.123I, .sup.125I, .sup.131I, .sup.186Re, .sup.211At, and .sup.212Bi.

64. The method of claim 55, wherein the mammalian subject is a human.

65. The method of claim 55, wherein administration is intravenous.

Description

CROSS-REFERENCES TO RELATED APPLICATIONS

[0001] This application is a continuation in part of U.S. patent application Ser. No. 10/057,475, filed Jan. 22, 2002, which is a continuation in part of U.S. patent application Ser. No. 10/040,862, filed Nov. 6, 2001, which is a continuation in part of U.S. Ser. No. 09/796,692 filed Mar. 1, 2001, which claims priority to U.S. Provisional Patent Application Ser. No. 60/186,126, filed Mar. 1, 2000; Ser. No. 60/190,479, filed Mar. 17, 2000; Ser. No. 60/200,545, filed Apr. 27, 2000; Ser. No. 60/200,303, filed Apr. 28, 2000; Ser. No. 60/200,779, filed Apr. 28, 2000; Ser. No. 60/200,999; filed May 1, 2000; Ser. No. 60/202,084, filed May 4, 2000; Ser. No. 60/206,201, filed May 22, 2000; Ser. No. 60/218,950, filed Jul. 14, 2000; Ser. No. 60/222,903, filed Aug. 3, 2000; Ser. No. 60/223,416, filed Aug. 4, 2000; and Ser. No. 60/223,378, filed Aug. 7, 2000; the entire specification, claims, sequences and figures of each of which is specifically incorporated herein by reference in its entirety without disclaimer and for all purposes.

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT

[0002] Not applicable.

REFERENCE TO A "SEQUENCE LISTING," A TABLE, OR A COMPUTER PROGRAM LISTING APPENDIX SUBMITTED ON A COMPACT DISK

[0003] Not applicable.

1. BACKGROUND OF THE INVENTION

[0004] 1.1 Field of the Invention

[0005] The present invention relates generally to the fields of cancer diagnosis and therapy. More particularly, it concerns the surprising discovery of compositions and methods for the detection and immunotherapy of hematological malignancies, and particularly, B cell leukemias, and lymphomas and multiple myelomas. The invention provides new, effective methods, compositions and kits for eliciting immune and T-cell response to antigenic polypeptides, and antigenic peptide fragments isolated therefrom, and methods for the use of such compositions for diagnosis, detection, treatment, monitoring, and/or prevention of various types of human hematological malignancies. In particular, the invention provides polypeptide, peptide, antibody, antigen binding fragment, hybridoma, host cell, vector, and polynucleotide compounds and compositions for use in identification and discrimination between various types of hematological malignancies, and methods for the detection, diagnosis, prognosis, monitoring, and therapy of such conditions in an affected animal.

[0006] 1.2 Description of Related Art

[0007] 1.2.1 Hematological Malignancies

[0008] Hematological malignancies, such as leukemias and lymphomas, are conditions characterized by abnormal growth and maturation of hematopoietic cells. Leukemias are generally neoplastic disorders of hematopoietic stem cells, and include adult and pediatric acute myeloid leukemia (AML), chronic myeloid leukemia (CML), acute lymphocytic leukemia (ALL), chronic lymphocytic leukemia (CLL) and secondary leukemia. Among lymphomas, there are two distinct groups: non-Hodgkin's lymphoma (NHL) and Hodgkin's disease. NHLs are the result of a clonal expansion of B- or T-cells, but the molecular pathogenesis of Hodgkin's disease, including lineage derivation and clonality, remains obscure. Other hematological malignancies include myelodysplastic syndromes (MDS), myeloproliferative syndromes (MPS) and multiple myeloma. Hematological malignancies are generally serious disorders, resulting in a variety of symptoms, including bone marrow failure and organ failure.

[0009] NHLs are the sixth most common cause of cancer related deaths in the United States. Only prostate, breast, lung, colorectal and bladder cancer currently exceed lymphoma in annual incidence. In 1995, more than 45,000 new NHLs were diagnosed, and over 21,000 patients died of these diseases. The average age of lymphoma patients is relatively young (42 years), and the resulting number of years of life lost to these diseases renders NHLs fourth in economic impact among cancers in the United States. In the past 15 years, the American Cancer Society reported a 50% increase in the incidence of NHLs, one of the largest increases for any cancer group. Much of this increase has been attributed to the development of lymphomas in younger men who have acquired AIDS. Lymphomas are also the third most common childhood malignancy and account for approximately 10% of cancers in children. The survival rate (all ages) varies from 73% (low risk) to 26% (high risk).

[0010] 1.3 Deficiencies in the Prior Art

[0011] Treatment for many hematological malignancies, including leukemias and lymphomas, remains difficult, and existing therapies are not universally effective. While treatments involving specific immunotherapy appear to have considerable potential, such treatments have been limited by the small number of known malignancy-associated antigens. Moreover the ability to detect such hematological malignancies in their early stages can be quite difficult depending upon the particular malady. The lack of a sufficient number of specific diagnostic and prognostic markers of the diseases, and identification of cells and tissues that can be affected, has significantly limited the field of oncology.

[0012] Accordingly, there remains a need in the art for improved methods for detecting, screening, diagnosis and treatment of hematological malignancies such as B cell leukemias and lymphomas and multiple myelomas. The present invention fulfills these and other inherent needs in the field, and provides significant advantages in the detection of cells, and cell types that express one or more polypeptides that have been shown to be over-expressed in one or more of such hematological malignancies.

2. SUMMARY OF THE INVENTION

[0013] The present invention addresses the foregoing long-felt need and other deficiencies in the art by identifying new and effective strategies for the identification, detection, screening, diagnosis, prognosis, prophylaxis, therapy, and immunomodulation of one or more hematological malignancies, and in particular, B cell leukemias and lymphomas, and multiple myelomas.

[0014] The present invention is based, in part, upon the surprising and unexpected discovery that certain previously unknown or unidentified human polypeptides, peptides, and antigenic fragments derived therefrom have now been identified that are overexpressed in one or more types of hematological malignancies. The genes encoding several of these polypeptides are now identified and obtained in isolated form, and have been characterized using a series of molecular biology methodologies including subtractive library analysis, microarray screening, polynucleotide sequencing, peptide and epitopic identification and characterization, as well as expression profiling, and in vitro whole gene cell priming. A set of these polynucleotides, and the polypeptides, peptides, and antigenic fragments they encode are now identified and implicated in the complex processes of hematological malignancy disease onset, progression, and/or outcome, and in particular, diseases such as leukemias and lymphomas.

[0015] The inventors have further demonstrated that a number of these polynucleotides, and their encoded polypeptides, as well as antibodies, antigen presenting cells, T cells, and the antigen binding fragments derived from such antibodies are useful in the development of particularly advantageous compositions and methods for the detection, diagnosis, prognosis, prophylaxis and/or therapy of one or more of these diseases, and particularly those conditions that are characterized by (a) an increased, altered, elevated, or sustained expression of one or more polynucleotides that comprise at least a first sequence region that comprises a nucleic acid sequence as disclosed in any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or (b) an increased, altered, elevated, or sustained biological activity of one or more polypeptides that comprise at least a first sequence region that comprises an amino acid sequence as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121.

[0016] The present invention also provides methods and uses for one or more of the disclosed peptide, polypeptide, antibody, antigen binding fragment, and polynucleotide compositions of the present invention in generating an immune response or in generating a T-cell response in an animal, and in particular in a mammal such as a human. The invention also provides methods and uses for one or more of these compositions in the identification, detection, and quantitation of hematological malignancy compositions in clinical samples, isolated cells, whole tissues, and even affected individuals. The compositions and methods disclosed herein also may be used in the preparation of one or more diagnostic reagents, assays, medicaments, or therapeutics, for diagnosis and/or therapy of such diseases.

[0017] In a first important embodiment, there is provided a composition comprising at least a first isolated peptide or polypeptide that comprises an amino acid sequence that is at least about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, or about 99% identical to the amino acid sequence encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121. Exemplary preferred sequences are those that comprise at least a first coding region that comprises an amino acid sequence that is at least about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, or about 94% identical to the amino acid sequence as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121, with those sequences that comprise at least a first coding region that comprises an amino acid sequence that is at least about 95%, about 96%, about 97%, about 98%, or about 99% identical to the amino acid sequence as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121 being examples of particularly preferred sequences in the practice of the present invention. Likewise, peptide and polypeptide compounds and compositions are also provided that comprise, consist essentially of, or consist of the amino acid sequence as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121.

[0018] In a similar fashion, there are also embodiments disclosed herein that provide compositions and methods for the detection, diagnosis, prognosis, prophylaxis, treatment, and therapy of B cell leukemia, lymphoma and multiple myeloma. Exemplary preferred peptide and polypeptide compounds and compositions relating to this aspect of the invention include, but are not limited to, those peptide and polypeptide compounds or compositions that comprise at least a first isolated peptide or polypeptide that comprises an amino acid sequence that is at least about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, or about 99% identical to the amino acid sequence as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121, and those that comprise at least a first coding region that comprises an amino acid sequence that is at least about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, or about 94% identical to the amino acid sequence as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121, and even those sequences that comprise at least a first coding region that comprises an amino acid sequence that is at least about 95%, about 96%, about 97%, about 98%, or about 99% identical to the amino acid sequence as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121.

[0019] Exemplary peptides of the present invention may be of any suitable length, depending upon the particular application thereof; and encompass those peptides that are about 9, about 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, or about 100 or so amino acids in length. Of course, the peptides of the invention may also encompass any intermediate lengths or integers within the stated ranges.

[0020] Exemplary polypeptides and proteins of the present invention may be of any suitable length, depending upon the particular application thereof, and encompass those polypeptides and proteins that are about 100, about 150, about 200, about 250, about 300, about 350, or about 400 or so amino acids in length. Of course, the polypeptides and proteins of the invention may also encompass any intermediate lengths or integers within the stated ranges.

[0021] The peptides, polypeptides, proteins, antibodies, and antigen binding fragments of the present invention will preferably comprise a sequence of at least about 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95 or 100 contiguous amino acids from any one of the peptides as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121.

[0022] Furthermore, the polypeptides, proteins, antibodies, and antigen binding fragments of the present invention will even more preferably comprise at least a first isolated coding region that comprises a sequence of at least about 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, or 200 contiguous amino acids from any one of the peptides as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121.

[0023] Likewise, the polypeptides, proteins, antibodies, and antigen binding fragments of the present invention may comprise at least a first isolated coding region that comprises a substantially longer sequence, such as for example, one of at least about 200, 220, 240, 260, 280, or 300 or more contiguous amino acids from any one of the peptides as encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121.

[0024] In illustrative embodiments, and particularly in those embodiments concerning methods and compositions relating to B cell leukemias, lymphomas and multiple myelomas, the polypeptides of the invention comprise an amino acid sequence that (a) comprises, (b) consists essentially of, or (c) consists of, the amino acid sequence encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121.

[0025] The polypeptides and proteins of the invention preferably comprise an amino acid sequence that is encoded by at least a first nucleic acid segment that comprises an at least 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 contiguous nucleotide sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124.

[0026] The polypeptides and proteins of the invention may also preferably comprise an amino acid sequence encoded by at least a first nucleic acid segment that comprises an at least about 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40 contiguous nucleotide sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124. The polypeptides and proteins of the invention may also preferably comprise one or more coding regions that comprise an amino acid sequence encoded by at least a first nucleic acid segment that comprises an at least about 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 contiguous nucleotide sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124. The polypeptides and proteins of the invention may also preferably comprise one or more coding regions that comprise an amino acid sequence encoded by at least a first nucleic acid segment that comprises an at least about 51, 52, 53, 54, 55, 56, 57, 58, 59, or 60 contiguous nucleotide sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124. The polypeptides and proteins of the invention may also preferably comprise one or more coding regions that comprise an amino acid sequence encoded by at least a first nucleic acid segment that comprises an at least about 70, 80, 90, 100, 110, 120, 130, 140 or 150 contiguous nucleotide sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124. The polypeptides and proteins of the invention may also preferably comprise one or more coding regions that comprise an amino acid sequence encoded by at least a first nucleic acid segment that comprises an at least about 175, 200, 225, 250, 275, 300, 325, 350, 375, or 400 contiguous nucleotide sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124. The polypeptides and proteins of the invention may also preferably comprise one or more coding regions that comprise an amino acid sequence encoded by at least a first nucleic acid segment that comprises an at least about 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, or 1500 contiguous nucleotide sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124.

[0027] In a second important embodiment, there is provided a composition comprising at least a first isolated polynucleotide that comprises a nucleic acid sequence that is at least about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, or about 99% identical to the nucleic acid sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124. Exemplary preferred sequences are those that comprise a nucleic acid sequence that is at least about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, or about 94% identical to the nucleic acid sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124, with those sequences that comprise at least a nucleic acid sequence that is at least about 95%, about 96%, about 97%, about 98%, or about 99% identical to the nucleic acid sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 being examples of particularly preferred sequences in the practice of the present invention.

[0028] In embodiments that relate particularly to compositions and methods for the detection, diagnosis, prognosis, prophylaxis, treatment, and therapy of B cell leukemias, lymphomas, and multiple myelomas exemplary preferred polynucleotide compositions include those compositions that comprise at least a first isolated nucleic acid segment that comprises a sequence that is at least about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, or about 99% identical to the nucleic acid sequence of any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124. Such polynucleotides will preferably comprise one or more isolated coding region, each of which may (a) comprise, (b) consist essentially of, or (c) consist of, the nucleic acid sequence of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124.

[0029] Exemplary polynucleotides of the present invention may be of any suitable length, depending upon the particular application thereof, and encompass those polynucleotides that (a) are at least about, or (b) comprise at least a first isolated nucleic acid segment that is at least about 27, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 120, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 320, 340, 360, 380, 400, 420, 440, 460, 480, 500, 520, 540, 560, 580, 600, 625, 650, 675, 700, 750, 800, 850, 900, 950, or 1000 or so nucleic acids in length, as well as longer polynucleotides that (a) are at least about, or (b) comprise at least a first isolated nucleic acid segment that is at least about 1000, 1025, 1050, 1075, 1100, 150, 1200, 1250, 1300, 1350, 1400, 1450, 1500, 1550, 1600, 1650, 1700, 1750, 1800, 1850, 1900, 1950, 2000, 2100, 2200, 2300, 2400, 2500, 2600, 2700, 2800, 2900, or 3000 or so nucleic acids in length, as well as substantially larger polynucleotides. Of course, the polynucleotides and nucleic acid segments of the invention may also encompass any intermediate lengths or integers within the stated ranges.

[0030] The compositions of the present invention may comprise a single polypeptide or polynucleotide, or alternatively, may comprise two or more such hematological malignancy compounds, such as for example, two or more polypeptides, two or more polynucleotides, or even combinations of one or more peptides or polypeptides, along with one or more polynucleotides. When two or more polypeptides are contemplated for particular applications, the second and/or third and/or fourth, etc. isolated peptides and/or polypeptides will preferably comprise an amino acid sequence that is at least about 91%, 93%, 95%, 97%, or 99% identical to the amino acid sequence encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121. Alternatively, the polynucleotides of the invention may comprise one or more coding regions that encode a first fusion protein or peptide, such as an adjuvant-coding region fused in correct reading frame to one or more of the disclosed hematological malignancy peptides or polypeptides. Alternatively, the fusion protein may comprise a hematological malignancy polypeptide or peptide fused, in correct reading frame, to a detectable protein or peptide, or to an immunostimulant protein or peptide, or other such construct. Fusion proteins such as these are particularly useful in those embodiments relating to diagnosis, detection, and therapy of one or more of the hematological malignancies as discussed herein.

[0031] The invention also provides a composition comprising at least a first hybridoma cell line that produces a monoclonal antibody having immunospecificity for one or more of the peptides or polypeptides as disclosed herein, or at least a first monoclonal antibody, or an antigen-binding fragment thereof, that has immunospecificity for such a peptide or polypeptide. The antigen binding fragments may comprise a light chain variable region, a heavy-chain variable region, a Fab fragment, a F(ab).sub.2 fragment, an Fv fragment, an scFv fragment, or an antigen-binding fragment of such an antibody.

[0032] The invention also provides a composition comprising at least a first isolated antigen-presenting cell that expresses a peptide or polypeptide as disclosed herein, or a plurality of isolated T cells that specifically react with such a peptide or polypeptide. Such pluralities of isolated T cells may be stimulated or expanded by contacting the T cells with one or more peptides or polypeptides as described herein. The T cells may be cloned prior to expansion, and may be obtained from bone marrow, a bone marrow fraction, peripheral blood, or a peripheral blood fraction from a healthy mammal, or from a mammal that is afflicted with at least a first hematological malignancy such as leukemia or lymphoma.

[0033] As described above, the isolated polypeptides of the invention may be on the order of from 9 to about 1000 amino acids in length, or alternatively, may be on the order of from 50 to about 900 amino acids in length, from 75 to about 800 amino acids in length, from 100 to about 700 amino acids in length, or from 125 to about 600 amino acids in length, or any other such suitable range.

[0034] The isolated nucleic acid segments that encode such isolated polypeptides may be on the order of from 27 to about 10,000 nucleotides in length, from 150 to about 8000 nucleotides in length, from 250 to about 6000 nucleotides in length, from 350 to about 4000 nucleotides in length, or from 450 to about 2000 nucleotides in length, or any other such suitable range.

[0035] The nucleic acid segment may be operably positioned under the control of at least a first heterologous, recombinant promoter, such as a tissue-specific, cell-specific, inducible, or otherwise regulated promoter. Such promoters may be further controlled or regulated by the presence of one or more additional enhancers or regulatory regions depending upon the particular cell type in which expression of the polynucleotide is desired. The polynucleotides and nucleic acid segments of the invention may also be comprised within a vector, such as a plasmid, or viral vector. The polypeptides and polynucleotides of the invention may also be comprised within a host cell, such as a recombinant host cell, or a human host cell such as a blood or bone marrow cell.

[0036] The polynucleotides of the invention may comprise at least a first isolated nucleic acid segment operably attached, in frame, to at least a second isolated nucleic acid segment, such that the polynucleotide encodes a fusion protein in which the first peptide or polypeptide is linked to the second peptide or polypeptide.

[0037] The polypeptides of the present invention may comprise a contiguous amino acid of any suitable length, such as for example, those of about 2000, about 1900, about 1850, about 1800, about 1750, about 1700, about 1650, about 1600, about 1550, about 1500, about 1450, about 1400, about 1350, about 1300, about 1250, about 1200, about 1150, about 1100 amino acids, or about 1000 or so amino acids in length. Likewise, the polypeptides and peptides of the present invention may comprise slightly shorter contiguous amino acid coding regions, such as for example, those of about 950, about 900, about 850, about 800, about 750, about 700, about 650, about 600, about 550, about 500, about 450, about 400, about 350, about 300, about 250, about 200, about 150, or even about 100 amino acids or so in length.

[0038] In similar fashion, the polypeptides and peptides of the present invention may comprise even smaller contiguous amino acid coding regions, such as for example, those of about 95, about 90, about 85, about 80, about 75, about 70, about 65, about 60, about 55, about 50, about 45, about 40, about 35, about 30, about 25, about 20, about 15, or even about 9 amino acids or so in length.

[0039] In all such embodiments, those peptides and polypeptides having intermediate lengths including all integers within the preferred ranges (e.g., those peptides and polypeptides that comprise at least a first coding region of at least about 94, about 93, about 92, about 91, about 89, about 88, about 87, about 86, about 84, about 83, about 82, about 81, about 79, about 78, about 77, about 76, about 74, about 73, about 72, about 71, about 69, about 68, about 67, about 66 or so amino acids in length, etc.) are all contemplated to fall within the scope of the present invention.

[0040] In particular embodiments, the peptides and polypeptides of the present invention may comprise a sequence of at least about 9, or about 10, or about 11, or about 12, or about 13, or about 14, or about 15, or about 16, or about 17, or about 18, or about 19, or about 20, or about 21, or about 22, or about 23, or about 24, or about 25, or about 26, or about 27, or about 28, or about 29, or about 30, or about 31, or about 32, or about 33, or about 34, or about 35, or about 36, or about 37, or about 38, or about 39, or about 40, or about 41, or about 42, or about 43, or about 44, or about 45, or about 46, or about 47, or about 48, or about 49, or about 50 contiguous amino acids as disclosed in any one or more of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121 herein.

[0041] In other embodiments, the peptides and polypeptides of the present invention may comprise a sequence of at least about 51, or about 52, or about 53, or about 54, or about 55, or about 56, or about 57, or about 58, or about 59, or about 60, or about 61, or about 62, or about 63, or about 64, or about 65, or about 66, or about 67, or about 68, or about 69, or about 70, or about 71, or about 72, or about 73, or about 74, or about 75, or about 76, or about 77, or about 78, or about 79, or about 80, or about 81, or about 82, or about 83, or about 84, or about 85, or about 86, or about 87, or about 88, or about 89, or about 90, about 91, or about 92, or about 93, or about 94, or about 95, or about 96, or about 97, or about 98, or about 99, or 100 contiguous amino acids as disclosed in any one or more of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 1.2, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121 herein.

[0042] In still other embodiments, the preferred peptides and polypeptides of the present invention comprise a sequence of at least about 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, or 400 or more contiguous amino acids as disclosed in any one or more of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121 herein.

[0043] The polypeptides of the invention typically will comprise at least a first contiguous amino acid sequence according to any one of the peptides encoded by any one of the above polynucleotides or disclosed in any one of SEQ ID NOs:10,471-10,474; SEQ ID NO: 10,481; SEQ ID NOs:10,599-10,819; SEQ ID NOs:10,820-10,842; SEQ ID NOs:10,849-10,908; and SEQ ID NOs:10,909-10,968, but may also, optionally comprise at least a second, at least a third, or even at least a fourth or greater contiguous amino acid sequence according to any one of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121. A single polypeptide may contain only a single coding region, or alternatively, a single polypeptide may comprise a plurality of identical or distinctly different contiguous amino acid sequences in accordance with any one of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121. In fact, the polypeptide may comprise a plurality of the same contiguous amino acid sequences, or they may comprise one or more different contiguous amino acid sequences of any of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121. For example, a single polypeptide can comprise a single contiguous amino acid sequence from one or more of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121, or alternatively, may comprise two or more distinctly different contiguous amino acid sequences from one or more of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121. In fact, the polypeptide may comprise 2, 3, 4, or even 5 distinct contiguous amino sequences of any one of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121. Alternatively, a single polypeptide may comprise 2, 3, 4, or even 5 distinct coding regions. For example, a polypeptide may comprise at least a first coding region that comprises a first contiguous amino acid sequence of any one of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121, and at least a second coding region that comprises a second contiguous amino acid sequence of any one of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121. In contrast, a polypeptide may comprise at least a first coding region that comprises a first contiguous amino acid sequence of any one of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121, and at least a second coding region that comprises a second distinctly different peptide or polypeptide, such as for example, an adjuvant or an immunostimulant peptide or polypeptide.

[0044] In such cases, the two coding regions may be separate on the same polypeptide, or the two coding regions may be operatively attached, each in the correct reading frame, such that a fusion polypeptide is produced, in which the first amino acid sequence of the first coding region is linked to the second amino acid sequence of the second coding region.

[0045] Throughout this disclosure, a phrase such as "a sequence as disclosed in SEQ ID NO:1 to SEQ ID NO:4" is intended to encompass any and all contiguous sequences disclosed by any one of these sequence identifiers. That is to say, "a sequence as disclosed in any of SEQ ID NO:1 through SEQ ID NO:4" means any sequence that is disclosed in any one of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, or SEQ ID NO:4. Likewise, "a sequence as disclosed in any of SEQ ID NOs:25 to 37" means any sequence that is disclosed in any one of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, or SEQ ID NO:37, and so forth.

[0046] Likewise, a phrase such as "at least a first sequence from any one of SEQ ID NO:55 to SEQ ID NO:62" is intended to refer to a first sequence that is disclosed in any one of SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, or SEQ ID NO:62.

[0047] It will also be understood that the kits, and compositions of the present invention comprise in an overall and general sense at least one or more particular polynucleotides, polypeptides, and peptides that comprise one or more contiguous sequence regions from one or more of the nucleic acid sequences disclosed herein in SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or from one or more of the amino acid sequences encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or disclosed in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121, and that such peptide, polypeptide and polynucleotide compositions may be used in one or more of the particular methods and uses disclosed herein for the diagnosis, detection, prophylaxis, and therapy of one or more hematological cancers, and in particular, lymphomas of a variety of specific types. It will also be understood to the skilled artisan having benefit of the teachings of the present specification, that the peptide and polypeptide compositions may be used to generate a T cell or an immune response in an animal, and that such compositions may also be administered to an animal from which immunospecific antibodies and antigen binding fragments may be isolated or identified that specifically bind to such peptides or polypeptides. Such an artisan will also recognize that the polynucleotides identified by the present disclosure may be used to produce such peptides, polypeptides, antibodies, and antigen binding fragments, by recombinant protein production methodologies that are also within the capability of the skilled artisan having benefit of the specific amino acid and nucleic acid sequences provided herein.

[0048] Likewise, it will be understood by a skilled artisan in the field, that one or more of the disclosed compositions may used in one or more diagnostic or detection methodologies to identify certain antibodies, peptides, polynucleotides, or polypeptides in a biological sample, in a host cell, or even within the body or tissues of an animal. It will be understood by a skilled artisan in the field, that one or more of the disclosed nucleic acid or amino acid compositions may used in the preparation or manufacture of one or more medicaments for use in the diagnosis, detection, prognosis, prophylaxis, or therapy of one or more hematological malignancies in an animal, and particularly those malignant conditions disclosed and claimed herein.

[0049] It will also be readily apparent to those of skill in the art, that the methods, kits, and uses, of the present invention preferably employ one or more of the compounds and/or compositions disclosed herein that comprise one or more contiguous nucleotide sequences as may be presented in SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 of the attached sequence listing.

[0050] Likewise, it will also be readily apparent to those of skill in the art, that the methods, kits, and uses, of the present invention may also employ one or more of the compounds and compositions disclosed herein that comprise one or more contiguous amino acid sequences of any of the peptides encoded by any one of SEQ ID NOS: 1-3, 5, 7, 9, 11, 13-14, 16-17, 19-20, 22-25, 27-28, 30-31, 33-34, 36, 38-39, 41-42, 44, 46-47, 49, 51, 53, 55, 57, 59-60, 62, 64-65, 67-70, 72-73, 75, 77-81, 83, 85-86, 88-100, 102-103, 105-106, 108, 110-113, 115-116, 118, or 124 or presented in any one of SEQ ID NOS: 4, 6, 8, 10, 12, 15, 18, 21, 26, 29, 32, 35, 37, 40, 43, 45, 48, 50, 52, 54, 56, 58, 61, 63, 66, 71, 74, 76, 82, 84, 87, 101, 104, 107, 109, 114, 117, or 119-121 of the attached sequence listing.

3. BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCES

[0051] The invention may be understood by reference to the following description taken in conjunction with the accompanying drawings, in which like reference numerals identify like elements, and in which:

[0052] FIG. 1 illustrates a schematic outline of the microarray chip technology approach used to identify the cDNA targets of the present invention as described Section 5.1.

[0053] FIG. 2 illustrates a schematic outline of the general protocol for in vitro whole gene CD8.sup.+ T cell priming procedure used to generate antigen-specific lines and to identify clones of interest.

[0054] FIG. 3 illustrates a schematic outline of the general protocol for in vitro whole gene CD4.sup.+ T cell priming procedure used to generate antigen-specific lines and to identify clones of interest.

[0055] FIG. 4 illustrates the panel of probes used to identify cDNAs that are overexpressed in lymphoma cells.

[0056] FIG. 5 lists the antigens that have similar tissue expression profiles as the known therapeutics, CD20 and CD52.

[0057] FIG. 6 illustrates the results of the TMpred report for Ly1484 long and Ly1484 short.

[0058] FIG. 7 illustrates the results of the TSITES analysis of Ly1484 long.

[0059] FIG. 8 illustrates the results of the TSITES analysis of Ly1484 short.

[0060] SEQ ID NO:1 is a full-length cDNA for Ly1728P.

[0061] SEQ ID NO:2 is a full-length protein sequence for Ly1728P.

[0062] SEQ ID NO:3 is a full-length cDNA sequence of Ly1732P.

[0063] SEQ ID NO:4 is a full-length protein of Ly1732P.

[0064] SEQ ID NO:5 is a full length cDNA sequence of Ly1888P.

[0065] SEQ ID NO:6 is a full length protein sequence of Ly1888P.

[0066] SEQ ID NO:7 is a full length cDNA sequence of Ly1452_His-tag-fusion.

[0067] SEQ ID NO:8 is a full length protein sequence of Ly1452_His-tag-fusion.

[0068] SEQ ID NO:9 is a full length cDNA sequence of Ly1452P, splice variant 1.

[0069] SEQ ID NO:10 is a full length protein sequence of Ly1452P, splice variant 1.

[0070] SEQ ID NO:11 is a full length cDNA sequence of Ly1452P, splice variant 2.

[0071] SEQ ID NO:12 is a full length protein sequence of Ly1452P, splice variant 2.

[0072] SEQ ID NO:13 is a partial cDNA sequence of Ly1462P.

[0073] SEQ ID NO:14 is a full length cDNA sequence of Ly1462P.

[0074] SEQ ID NO:15 is a full length protein sequence of Ly1462P.

[0075] SEQ ID NO:16 is a partial cDNA sequence of Ly1484P.

[0076] SEQ ID NO:17 is a full length cDNA sequence of Ly1484P.

[0077] SEQ ID NO:18 is a full length protein sequence of Ly1484P.

[0078] SEQ ID NO:19 is a partial cDNA sequence of Ly1486P.

[0079] SEQ ID NO:20 is a full length cDNA sequence of Ly1486P.

[0080] SEQ ID NO:21 is a full length protein sequence of Ly1486P.

[0081] SEQ ID NO:22 is a partial cDNA sequence of Ly1677P.

[0082] SEQ ID NO:23 is a partial cDNA sequence of Ly1682P.

[0083] SEQ ID NO:24 is a partial cDNA sequence of Ly1693P.

[0084] SEQ ID NO:25 is a full-length cDNA sequence of Ly1693P.

[0085] SEQ ID NO:26 is a full-length protein sequence of Ly1693P.

[0086] SEQ ID NO:27 is a partial cDNA sequence of Ly1697P.

[0087] SEQ ID NO:28 is a full-length cDNA sequence of Ly1715P.

[0088] SEQ ID NO:29 is a full-length protein sequence of Ly1715P.

[0089] SEQ ID NO:30 is a partial cDNA sequence of Ly1727P.

[0090] SEQ ID NO:31 is a full-length cDNA sequence of Ly1727P.

[0091] SEQ ID NO:32 is a full-length protein sequence of Ly1727P.

[0092] SEQ ID NO:33 is a partial cDNA sequence of Ly1885P.

[0093] SEQ ID NO:34 is a full-length cDNA sequence of Ly1885P.

[0094] SEQ ID NO:35 is a full-length protein sequence of Ly1885P.

[0095] SEQ ID NO:36 is a partial cDNA sequence of Ly1905P.

[0096] SEQ ID NO:37 is a partial protein sequence of Ly1905P.

[0097] SEQ ID NO:38 is a partial cDNA sequence of Ly1905P.

[0098] SEQ ID NO:39 is a full-length cDNA sequence of Ly1905P.

[0099] SEQ ID NO:40 is a full-length protein sequence of Ly1905P.

[0100] SEQ ID NO:41 is a partial cDNA sequence of Ly663S.

[0101] SEQ ID NO:42 is a full-length cDNA sequence of Ly663S.

[0102] SEQ ID NO:43 is a full-length protein sequence of Ly663S.

[0103] SEQ ID NO:44 is a full-length cDNA sequence of Ly664S.

[0104] SEQ ID NO:45 is a full-length protein sequence of Ly664S.

[0105] SEQ ID NO:46 is a partial cDNA sequence of Ly667S.

[0106] SEQ ID NO:47 is a full-length cDNA sequence of Ly667S.

[0107] SEQ ID NO:48 is a full-length protein sequence of Ly667S.

[0108] SEQ ID NO:49 is a partial cDNA sequence of Ly677S.

[0109] SEQ ID NO:50 is a partial protein sequence of Ly677S.

[0110] SEQ ID NO:51 is a partial cDNA sequence of Ly677S.

[0111] SEQ ID NO:52 is a partial protein sequence of Ly677S.

[0112] SEQ ID NO:53 is a full-length cDNA sequence of Ly677S.

[0113] SEQ ID NO:54 is a full-length protein sequence of Ly677S.

[0114] SEQ ID NO:55 is a full-length cDNA sequence of Ly1891P.

[0115] SEQ ID NO:56 is a full-length protein sequence of Ly1891P.

[0116] SEQ ID NO:57 is a full-length cDNA sequence of CD138.

[0117] SEQ ID NO:58 is a full-length protein sequence of CD138.

[0118] SEQ ID NO:59 is a partial cDNA sequence of CD22.

[0119] SEQ ID NO:60 is a full-length cDNA sequence of CD22.

[0120] SEQ ID NO:61 is a full-length protein sequence of CD22.

[0121] SEQ ID NO:62 is a partial cDNA sequence of CD79beta.

[0122] SEQ ID NO:63 is a partial protein sequence of CD79beta.

[0123] SEQ ID NO:64 is a partial cDNA sequence of CD79beta.

[0124] SEQ ID NO:65 is a full-length cDNA sequence of CD79beta.

[0125] SEQ ID NO:66 is a full-length protein sequence of CD79beta.

[0126] SEQ ID NO:67 is a partial cDNA sequence of Ly1450P.

[0127] SEQ ID NO:68 is a partial cDNA sequence of Ly1450P.

[0128] SEQ ID NO:69 is a partial cDNA sequence of Ly1451P.

[0129] SEQ ID NO:70 is a partial cDNA sequence of Ly1451P.

[0130] SEQ ID NO:71 is a partial protein sequence of Ly1451P.

[0131] SEQ ID NO:7272>Ly1454P, Old-SEQ-ID_3577, partial cDNA

[0132] SEQ ID NO:73 is a full-length cDNA sequence of Ly1454P.

[0133] SEQ ID NO:74 is a full-length protein sequence of Ly1454P.

[0134] SEQ ID NO:75 is a partial cDNA sequence of Ly1485P.

[0135] SEQ ID NO:76 is a partial protein sequence of Ly1485P.

[0136] SEQ ID NO:77 is a partial cDNA sequence of Ly1485P.

[0137] SEQ ID NO:78 is a partial cDNA sequence of Ly1500P.

[0138] SEQ ID NO:79 is a full-length cDNA sequence of Ly1500P, splice variant 1.

[0139] SEQ ID NO:80 is a full-length protein sequence of Ly500P, splice variant 1.

[0140] SEQ ID NO:81 is a full-length cDNA sequence of Ly1500P, splice variant 2.

[0141] SEQ ID NO:82 is a full-length protein sequence of Ly1500P, splice variant 2.

[0142] SEQ ID NO:83 is a full-length cDNA sequence of Ly1500P, splice variant 3.

[0143] SEQ ID NO:84 is a full-length protein sequence of Ly1500P, splice variant 3.

[0144] SEQ ID NO:85 is a partial cDNA sequence of Ly1516P.

[0145] SEQ ID NO:86 is a full-length cDNA sequence of Ly1516P, splice variant 1.

[0146] SEQ ID NO:87 is a full-length protein sequence of Ly1516P, splice variant 1.

[0147] SEQ ID NO:88 is a partial cDNA sequence of Ly1516P, splice variant 2.

[0148] SEQ ID NO:89 is a partial cDNA sequence of Ly1516P, splice variant 3.

[0149] SEQ ID NO:90 is a partial cDNA sequence of Ly1678P.

[0150] SEQ ID NO:91 is a partial cDNA sequence of Ly1678P.

[0151] SEQ ID NO:92 is a partial cDNA sequence of Ly1678P.

[0152] SEQ ID NO:93 is a partial cDNA sequence of Ly1678P.

[0153] SEQ ID NO:94 is a partial cDNA sequence of Ly1680P.

[0154] SEQ ID NO:95 is a partial cDNA sequence of Ly1686P.

[0155] SEQ ID NO:96 is a partial cDNA sequence of Ly1687P.

[0156] SEQ ID NO:97 is a partial cDNA sequence of Ly1706P.

[0157] SEQ ID NO:98 is a partial cDNA sequence of Ly1712P.

[0158] SEQ ID NO:99 is a partial cDNA sequence of Ly1729P.

[0159] SEQ ID NO:100 is a full-length cDNA sequence of Ly1729P.

[0160] SEQ ID NO:101 is a full-length protein sequence of Ly1729P.

[0161] SEQ ID NO:102 is a partial cDNA sequence of Ly1848P.

[0162] SEQ ID NO:103 is a partial cDNA sequence of Ly1859P.

[0163] SEQ ID NO:104 is a partial protein sequence of Ly1859P.

[0164] SEQ ID NO:105 is a partial cDNA sequence of Ly1859P.

[0165] SEQ ID NO:106 is a full-length cDNA sequence of Ly1859P.

[0166] SEQ ID NO:107 is a full-length protein sequence of Ly1859P.

[0167] SEQ ID NO:108 is a full length cDNA sequence for Ly1866P

[0168] SEQ ID NO:109 is a full length protein sequence for Ly1866P

[0169] SEQ ID NO:110 is a partial cDNA sequence for Ly1867P.

[0170] SEQ ID NO:111 is a partial cDNA sequence for Ly1868P.

[0171] SEQ ID NO:112 is a partial cDNA sequence for Ly1886P.

[0172] SEQ ID NO:113 is a full length cDNA sequence for Ly669S.

[0173] SEQ ID NO:114 is a full length protein sequence for Ly669S.

[0174] SEQ ID NO:115 is a partial cDNA sequence for Ly672S.

[0175] SEQ ID NO:116 is a full length cDNA sequence for Ly672S.

[0176] SEQ ID NO:117 is a full length cDNA sequence for Ly672S.

[0177] SEQ ID NO:118 is a partial cDNA sequence of Ly675S.

[0178] SEQ ID NO:119 is a partial protein sequence of Ly675S.

[0179] SEQ ID NO:120 is a partial protein sequence of Ly1484P.

[0180] SEQ ID NO:121 is a partial protein sequence of Ly1484P.

[0181] SEQ ID NO:122 is a PCR primer sequence for His-Ly1452P.

[0182] SEQ ID NO:123 is a PCR primer sequence for His-Ly1452P.

[0183] SEQ ID NO:124 is an open reading frame sequence for Ly1451P.

4. DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

[0184] In order that the invention herein described may be more fully understood, the following description of various illustrative embodiments is set forth.

[0185] The present invention is generally directed to compositions and methods for the immunotherapy and diagnosis of Hematological malignancies, such as B cell leukemias and lymphomas and multiple myelomas.

[0186] 4.1 Methods of Nucleic Acid Delivery and DNA Transfection

[0187] In certain embodiments, it is contemplated that one or more RNA or DNA and/or substituted polynucleotide compositions disclosed herein will be used to transfect an appropriate host cell. Technology for introduction of RNAs and DNAs, and vectors comprising them into suitable host cells is well known to those of skill in the art. In particular, such polynucleotides may be used to genetically transform one or more host cells, when therapeutic administration of one or more active peptides, compounds or vaccines is achieved through the expression of one or more polynucleotide constructs that encode one or more therapeutic compounds of interest.

[0188] A variety of means for introducing polynucleotides and/or polypeptides into suitable target cells is known to those of skill in the art. For example, when polynucleotides are contemplated for delivery to cells, several non-viral methods for the transfer of expression constructs into cultured mammalian cells are available to the skilled artisan for his use. These include, for example, calcium phosphate precipitation (Graham and Van Der Eb, 1973; Chen and Okayama, 1987; Rippe et al., 1990); DEAE-dextran precipitation (Gopal, 1985); electroporation (Wong and Neumann, 1982; Fromm et al., 1985; Tur-Kaspa et al., 1986; Potter et al., 1984; Suzuki et al., 1998; Vanbever et al., 1998), direct microinjection (Capecchi, 1980; Harland and Weintraub, 1985), DNA-loaded liposomes (Nicolau and Sene, 1982; Fraley et al., 1979; Takakura, 1998) and lipofectamine-DNA complexes, cell sonication (Fechheimer et al., 1987), gene bombardment using high velocity microprojectiles (Yang et al., 1990; Klein et al., 1992), and receptor-mediated transfection (Curiel et al., 1991; Wagner et al., 1992; Wu and Wu, 1987; Wu and Wu, 1988). Some of these techniques may be successfully adapted for in vivo or ex vivo use.

[0189] A bacterial cell, a yeast cell, or an animal cell transformed with one or more of the disclosed expression vectors represent an important aspect of the present invention. Such transformed host cells are often desirable for use in the expression of the various DNA gene constructs disclosed herein. In some aspects of the invention, it is often desirable to modulate, regulate, or otherwise control the expression of the gene segments disclosed herein. Such methods are routine to those of skill in the molecular genetic arts. Typically, when increased or over-expression of a particular gene is desired, various manipulations may be employed for enhancing the expression of the messenger RNA, particularly by using an active promoter, and in particular, a tissue-specific promoter such as those disclosed herein, as well as by employing sequences, which enhance the stability of the messenger RNA in the particular transformed host cell.

[0190] Typically, the initiation and translational termination region will involve stop codon(s), a terminator region, and optionally, a polyadenylation signal. In the direction of transcription, namely in the 5' to 3' direction of the coding or sense sequence, the construct will involve the transcriptional regulatory region, if any, and the promoter, where the regulatory region may be either 5' or 3' of the promoter, the ribosomal binding site, the initiation codon, the structural gene having an open reading frame in phase with the initiation codon, the stop codon(s), the polyadenylation signal sequence, if any, and the terminator region. This sequence as a double strand may be used by itself for transformation of a microorganism or eukaryotic host, but will usually be included with a DNA sequence involving a marker, where the second DNA sequence may be joined to the expression construct during introduction of the DNA into the host.

[0191] Where no functional replication system is present, the construct will also preferably include a sequence of at least about 30 or about 40 or about 50 base pairs (bp) or so, preferably at least about 60, about 70, about 80, or about 90 to about 100 or so bp, and usually not more than about 500 to about 1000 or so bp of a sequence homologous with a sequence in the host. In this way, the probability of legitimate recombination is enhanced, so that the gene will be integrated into the host and stably maintained by the host. Desirably, the regulatory regions of the expression construct will be in close proximity to (and also operably positioned relative to) the selected therapeutic gene providing for complementation as well as the gene providing for the competitive advantage. Therefore, in the event that the therapeutic gene is lost, the resulting organism will be likely to also lose the gene providing for the competitive advantage, so that it will be unable to compete in the environment with the gene retaining the intact construct.

[0192] The selected therapeutic gene can be introduced between the transcriptional and translational initiation region and the transcriptional and translational termination region, so as to be under the regulatory control of the initiation region. This construct may be included in a plasmid, which will include at least one replication system, but may include more than one, where one replication system is employed for cloning during the development of the plasmid and the second replication system is necessary for functioning in the ultimate host, in this case, a mammalian host cell. In addition, one or more markers may be present, which have been described previously. Where integration is desired, the plasmid will desirably include a sequence homologous with the host genome.

[0193] Genes or other nucleic acid segments, as disclosed herein, can be inserted into host cells using a variety of techniques that are well known in the art. Five general methods for delivering a nucleic segment into cells have been described: (1) chemical methods (Graham and Van Der Eb, 1973); (2) physical methods such as microinjection (Capecchi, 1980), electroporation (U.S. Pat. No. 5,472,869; Wong and Neumann, 1982; Fromm et al., 1985), microprojectiles bombardment (U.S. Pat. No. 5,874,265, specifically incorporated herein by reference in its entirety), "gene gun" (Yang et al., 1990); (3) viral vectors (Eglitis and Anderson, 1988); (4) receptor-mediated mechanisms (Curiel et al., 1991; Wagner et al., 1992); and (5) bacterial-mediated transformation.

[0194] 4.2 Hematological Malignancy Related-Specific Antibodies and Antigen-Binding Fragments Thereof

[0195] The present invention further provides antibodies and antigen-binding fragments thereof, that specifically bind to (or are immunospecific for) at least a first peptide or peptide variant as disclosed herein. As used herein, an antibody or an antigen-binding fragment is said to "specifically bind" to a peptide if it reacts at a detectable level (within, for example, an ELISA) with the peptide, and does not react detectably with unrelated peptides or proteins under similar conditions. As used herein, "binding" refers to a non-covalent association between two separate molecules such that a "complex" is formed. The ability to bind may be evaluated by, for example, determining a binding constant for the formation of the complex. The binding constant is the value obtained when the concentration of the complex is divided by the product of the component concentrations. In the context of the present invention, in general, two compounds are said to "bind" when the binding constant for complex formation exceeds about 10.sup.3 L/mol. The binding constant maybe determined using methods well known in the art.

[0196] Any agent that satisfies the above requirements may be a binding agent. In illustrative embodiments, a binding agent is an antibody or an antigen-binding fragment thereof. Such antibodies may be prepared by any of a variety of techniques known to those of ordinary skill in the art (Harlow and Lane, 1988). In general, antibodies can be produced by cell culture techniques, including the generation of monoclonal antibodies as described herein, or via transfection of antibody genes into suitable bacterial or mammalian cell hosts, in order to allow for the production of recombinant antibodies. In one technique, an immunogen comprising the peptide is initially injected into any of a wide variety of mammals (e.g., mice, rats, rabbits, sheep or goats). In this step, the peptides of this invention may serve as the immunogen without modification. Alternatively, particularly for relatively short peptides, a superior immune response may be elicited if the peptide is joined to a carrier protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen is injected into the animal host, preferably according to a predetermined schedule incorporating one or more booster immunizations, and the animals are bled periodically. Polyclonal antibodies specific for the peptide may then be purified from such antisera by, for example, affinity chromatography using the peptide coupled to a suitable solid support.

[0197] "Antibody" refers to a polypeptide encoded by an immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively.

[0198] An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each chain defines a variable region of about 100 to 110 or more amino acids primarily responsible for antigen recognition, i.e., an antigen recognition domain. As used herein, "antigen recognition domain" means that part of the antibody, recombinant molecule, the fusion protein, or the immunoconjugate of the invention which recognizes the target or portions thereof. Typically the antigen recognition domain comprises the variable region of the antibody or a portion thereof, e.g., one, two, three, four, five, six, or more hypervariable regions. The terms "V.sub.H" or "VH" refer to the variable region of an immunoglobulin heavy chain, including an Fv, scFv, dsFv or Fab. The terms "V.sub.L" or "VL" refer to the variable region of an immunoglobulin light chain, including an Fv, scFv, dsFv or Fab.

[0199] Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized fragments produced by digestion with various peptidases. Thus, for example, pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab which itself is a light chain joined to VH-CH1 by a disulfide bond. The F(ab)'2 may be reduced under mild conditions to break the disulfide linkage in the hinge region, thereby converting the F(ab)'2 dimer into an Fab' monomer. The Fab' monomer is essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 1993). Thus, the term antibody, as used herein, also includes antibody fragments either produced by the modification of whole antibodies.

[0200] As used herein, "fragment" is defined as at least a portion of the variable region of the immunoglobulin molecule, which binds to its target, i.e. the antigen recognition domain or the antigen binding region. Some of the constant region of the immunoglobulin may be included. Examples of antibody functional fragments include, but are not limited to, complete antibody molecules, humanized antibodies, antibody fragments, such as Fv, single chain Fv (scFv), hypervariable regions ro complementarity determining regions (CDRs), V.sub.L (light chain variable region), V.sub.H (heavy chain variable region), Fab, F(ab)2' and any combination of those or any other portion of an immunoglobulin peptide capable of binding to target antigen (see, e.g., Fundamental Immunology (Paul ed., 4th. 1999). As appreciated by one of skill in the art, various antibody fragments can be obtained by a variety of methods, for example, digestion of an intact antibody with an enzyme, such as pepsin; or de novo synthesis. Antibody fragments are often synthesized de novo either chemically or by using recombinant DNA methodology. Thus, the term antibody, as used herein, includes antibody fragments either produced by the modification of whole antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single chain Fv) or those identified using phage display libraries (see, e.g., McCafferty et al., (1990) Nature 348:552). The term antibody also includes bivalent or bispecific molecules, diabodies, triabodies, and tetrabodies. Bivalent and bispecific molecules are described in, e.g., Kostelny et al., J. Immunol. 148: 1547 (1992), Pack and Pluckthun, Biochemistry 31: 1579 (1992), Zhu et al. Protein Sci. 6: 781 (1997), Hu et al. Cancer Res. 56: 3055 (1996), Adams et al., Cancer Res. 53: 4026 (1993), and McCartney, et al., Protein Eng. 8: 301 (1995).

[0201] A "humanized antibody" refers to an antibody in which the antigen binding loops, i.e., complementarity determining regions (CDRs), comprised by the V.sub.H and V.sub.L regions are grafted to a human framework sequence. Typically, the humanized antibodies have the same binding specificity as the non-humanized antibodies described herein. Techniques for humanizing antibodies are well known in the art and are described in e.g., U.S. Pat. Nos. 4,816,567; 5,530,101; 5,859,205; 5,585,089; 5,693,761; 5,693,762; 5,777,085; 6,180,370; 6,210,671; and 6,329,511; WO 87/02671; EP Patent Application 0173494; Jones et al., Nature 321: 522 (1986); and Verhoyen et al., Science 239: 1534 (1988). Humanized antibodies are further described in, e.g., Winter and Milstein, Nature 349: 293 (1991).

[0202] For preparation of monoclonal or polyclonal antibodies, any technique known in the art can be used (see, e.g., Kohler & Milstein, Nature 256: 495-497 (1975); Kozbor et al., Immunology Today 4: 72 (1983); Cole et al., pp. 77-96 in MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc. (1985)).

[0203] Methods of producing of polyclonal antibodies are known to those of skill in the art. In an exemplary method, an inbred strain of mice (e.g., BALB/C mice) or rabbits is immunized with the chelate or a close structural analogue using a standard adjuvant, such as Freund's adjuvant, and a standard immunization protocol. Alternatively, or in addition to the use of an adjuvant, the chelate is coupled to a carrier that is itself immunogenic (e.g., keyhole limpit hemocyanin ("KLH"). The animal's immune response to the immunogen preparation is monitored by taking test bleeds and determining the titer of reactivity to the beta subunits. When appropriately high titers of antibody to the immunogen are obtained, blood is collected from the animal and antisera are prepared. Further fractionation of the antisera to enrich for antibodies reactive to the protein can be done if desired.

[0204] Monoclonal antibodies are obtained by various techniques familiar to those skilled in the art. Briefly, spleen cells from an animal immunized with a desired antigen are immortalized, commonly by fusion with a myeloma cell (see, for example, Kohler & Milstein, Eur. J. Immunol. 6: 511-519 (1976)). Alternative methods of immortalization include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other methods well known in the art. Colonies arising from single immortalized cells are screened for production of antibodies of the desired specificity and affinity for the antigen, and yield of the monoclonal antibodies produced by such cells may be enhanced by various techniques, including injection into the peritoneal cavity of a vertebrate host. Alternatively, one may isolate DNA sequences which encode a monoclonal antibody or a binding fragment thereof by screening a DNA library from human B cells according to the general protocol outlined by Huse et al., Science 246: 1275-1281 (1989).

[0205] Monoclonal antibodies and polyclonal sera are collected and titered against the immunogen in an immunoassay, for example, a solid phase immunoassay with the immunogen immobilized on a solid support. Typically, polyclonal antisera with a titer of 10.sup.4 or greater are selected and tested for cross reactivity against different chelates, using a competitive binding immunoassay. Specific polyclonal antisera and monoclonal antibodies will usually bind with a K.sub.d of at least about 0.1 mM, more usually at least about 1 .mu.M, preferably, at least about 0.1 .mu.M or better, and most preferably, 0.01 .mu.M or better.

[0206] Techniques for the production of single chain antibodies (U.S. Pat. No. 4,946,778) can be adapted to produce antibodies to reactive chelates and other diagnostic, analytical and therapeutic agents. Also, transgenic mice, or other organisms such as other mammals, may be used to express humanized antibodies. Alternatively, phage display technology can be used to produce and identify antibodies and heteromeric Fab fragments that specifically bind to selected antigens (see, e.g., McCafferty et al, Nature 348: 552-554 (1990); Marks et al., Biotechnology 10: 779-783 (1992)).

[0207] In an exemplary embodiment, an animal, such as a rabbit or mouse is immunized with a chelate, or an immunogenic construct. The antibodies produced as a result of the immunization are preferably isolated using standard methods.

[0208] In a still further preferred embodiment, the antibody is a humanized antibody. "Humanized" refers to a non-human polypeptide sequence that has been modified to minimize immunoreactivity in humans, typically by altering the amino acid sequence to mimic existing human sequences, without substantially altering the function of the polypeptide sequence (see, e.g., Jones et al., Nature 321: 522-525 (1986), and published UK patent application No. 8707252).

[0209] In another preferred embodiment, the present invention provides an antibody, as described above, further comprising a member selected from detectable labels, biologically active agents and combinations thereof attached to the antibody.

[0210] When the antibody is conjugated to a detectable label, the label is preferably a member selected from the group consisting of radioactive isotopes, fluorescent agents, fluorescent agent precursors, chromophores, enzymes and combinations thereof. Methods for conjugating various groups to antibodies are well known in the art. For example, a detectable label that is frequently conjugated to an antibody is an enzyme, such as horseradish peroxidase, alkaline phosphatase, .beta.-galactosidase, and glucose oxidase.

[0211] Methods of producing antibodies labeled with small molecules, for example, fluorescent agents, are well known in the art. Fluorescent labeled antibodies can be used in immunohistochemical staining (Osborn et al., Methods Cell Biol. 24: 97-132 (1990); in flow cytometry or cell sorting techniques (Ormerod, M. G. (ed.), FLOW CYTOMETRY. A PRACTICAL APPROACH, IRL Press, New York, 1990); for tracking and localization of antigens, and in various double-staining methods (Kawamura, A., Jr., FLUORESCENT ANTIBODY TECHNIQUES AND THEIR APPLICATION, Univ. Tokyo Press, Baltimore, 1977).

[0212] Many reactive fluorescent labels are available commercially (e.g., Molecular Probes, Eugene, Oreg.) or they can be synthesized using art-recognized techniques. In an exemplary embodiment, an antibody of the invention is labeled with an amine-reactive fluorescent agent, such as fluorescein isothiocyanate under mildly basic conditions. For other examples of antibody labeling techniques, see, Goding, J. Immunol. Methods 13: 215-226 (1976); and in, MONOCLONAL ANTIBODIES: PRINCIPLES AND PRACTICE, pp. 6-58, Academic Press, Orlando (1988).

[0213] Monoclonal antibodies specific for the antigenic peptide of interest may be prepared, for example, using the technique of Kohler and Milstein (1976) and improvements thereto. Briefly, these methods involve the preparation of immortal cell lines capable of producing antibodies having the desired specificity (i.e., reactivity with the peptide of interest). Such cell lines may be produced, for example, from spleen cells obtained from an animal immunized as described above. The spleen cells are then immortalized by, for example, fusion with a myeloma cell fusion partner, preferably one that is syngeneic with the immunized animal. A variety of fusion techniques may be employed. For example, the spleen cells and myeloma cells may be combined with a nonionic detergent for a few minutes and then plated at low density on a selective medium that supports the growth of hybrid cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, colonies of hybrids are observed. Single colonies are selected and their culture supernatants tested for binding activity against the peptide. Hybridomas having high reactivity and specificity are preferred.

[0214] Monoclonal antibodies may be isolated from the supernatants of growing hybridoma colonies. In addition, various techniques may be employed to enhance the yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from the ascites fluid or the blood. Contaminants may be removed from the antibodies by conventional techniques, such as chromatography, gel filtration, precipitation, and extraction. The peptides of this invention may be used in the purification process in, for example, an affinity chromatography step.

[0215] Within certain embodiments, the use of antigen-binding fragments of antibodies may be preferred. Such fragments include Fab fragments, which may be prepared using standard techniques. Briefly, immunoglobulins may be purified from rabbit serum by affinity chromatography on Protein A bead columns (Harlow and Lane, 1988) and digested by papain to yield Fab and Fc fragments. The Fab and Fc fragments may be separated by affinity chromatography on Protein A bead columns.

[0216] Monoclonal antibodies and fragments thereof may be coupled to one or more therapeutic agents. Suitable agents in this regard include radioactive tracers and chemotherapeutic agents, which may be used, for example, to purge autologous bone marrow in vitro). Representative therapeutic agents include radionuclides, differentiation inducers, drugs, toxins, and derivatives thereof. Preferred radionuclides include .sup.90Y, .sup.123I, .sup.125I, .sup.131I, .sup.186Re, .sup.188Re, .sup.211At, and .sup.212Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. Preferred differentiation inducers include phorbol esters and butyric acid. Preferred toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella toxin, and pokeweed antiviral protein. For diagnostic purposes, coupling of radioactive agents may be used to facilitate tracing of metastases or to determine the location of hematological malignancy related-positive tumors.

[0217] A therapeutic agent may be coupled (e.g., covalently bonded) to a suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A direct reaction between an agent and an antibody is possible when each possesses a substituent capable of reacting with the other. For example, a nucleophilic group, such as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl-containing group, such as an anhydride or an acid halide, or with an alkyl group containing a good leaving group (e.g., a halide) on the other.

[0218] Alternatively, it may be desirable to couple a therapeutic agent and an antibody via a linker group. A linker group can function as a spacer to distance an antibody from an agent in order to avoid interference with binding capabilities. A linker group can also serve to increase the chemical reactivity of a substituent on an agent or an antibody, and thus increase the coupling efficiency. An increase in chemical reactivity may also facilitate the use of agents, or functional groups on agents, which otherwise would not be possible.

[0219] It will be evident to those skilled in the art that a variety of bifunctional or polyfunctional reagents, both homo- and hetero-functional (such as those described in the catalog of the Pierce Chemical Co., Rockford, Ill.), may be employed as the linker group. Coupling may be affected, for example, through amino groups, carboxyl groups, and sulfhydryl groups or oxidized carbohydrate residues. There are numerous references describing such methodology, e.g., U.S. Pat. No. 4,671,958.

[0220] Where a therapeutic agent is more potent when free from the antibody portion of the immunoconjugates of the present invention, it may be desirable to use a linker group that is cleavable during or upon internalization into a cell. A number of different cleavable linker groups have been described. The mechanisms for the intracellular release of an agent from these linker groups include cleavage by reduction of a disulfide bond (U.S. Pat. No. 4,489,710), by irradiation of a photolabile bond (U.S. Pat. No. 4,625,014), by hydrolysis of derivatized amino acid side chains (U.S. Pat. No. 4,638,045), by serum complement-mediated hydrolysis (U.S. Pat. No. 4,671,958), and acid-catalyzed hydrolysis (U.S. Pat. No. 4,569,789).

[0221] It may be desirable to couple more than one agent to an antibody. In one embodiment, multiple molecules of an agent are coupled to one antibody molecule. In another embodiment, more than one type of agent may be coupled to one antibody. Regardless of the particular embodiment, immunoconjugates with more than one agent may be prepared in a variety of ways. For example, more than one agent may be coupled directly to an antibody molecule, or linkers that provide multiple sites for attachment can be used. Alternatively, a carrier can be used. A carrier may bear the agents in a variety of ways, including covalent bonding either directly or via a linker group. Suitable carriers include proteins such as albumins (U.S. Pat. No. 4,507,234), peptides and polysaccharides such as aminodextran (U.S. Pat. No. 4,699,784). A carrier may also bear an agent by noncovalent bonding or by encapsulation, such as within a liposome vesicle (U.S. Pat. No. 4,429,008 and U.S. Pat. No. 4,873,088). Carriers specific for radionuclide agents include radiohalogenated small molecules and chelating compounds. For example, U.S. Pat. No. 4,735,792 discloses representative radiohalogenated small molecules and their synthesis. A radionuclide chelate may be formed from chelating compounds that include those containing nitrogen and sulfur atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For example, U.S. Pat. No. 4,673,562 discloses representative chelating compounds and their synthesis.

[0222] A variety of routes of administration for the antibodies and immunoconjugates may be used. Typically, administration will be intravenous, intramuscular, subcutaneous or in the bed of a resected tumor. It will be evident that the precise dose of the antibody/immunoconjugate will vary depending upon the antibody used, the antigen density on the tumor, and the rate of clearance of the antibody.

[0223] Also provided herein are anti-idiotypic antibodies that mimic an immunogenic portion of hematological malignancy related. Such antibodies may be raised against an antibody, or an antigen-binding fragment thereof, that specifically binds to an immunogenic portion of hematological malignancy related, using well-known techniques. Anti-idiotypic antibodies that mimic an immunogenic portion of hematological malignancy related are those antibodies that bind to an antibody, or antigen-binding fragment thereof, that specifically binds to an immunogenic portion of hematological malignancy related, as described herein.

[0224] Irrespective of the source of the original hematological malignancy related peptide-specific antibody, the intact antibody, antibody multimers, or any one of a variety of functional, antigen-binding regions of the antibody may be used in the present invention. Exemplary functional regions include scFv, Fv, Fab', Fab and F(ab').sub.2 fragments of the hematological malignancy related peptide-specific antibodies. Techniques for preparing such constructs are well known to those in the art and are further exemplified herein.

[0225] The choice of antibody construct may be influenced by various factors. For example, prolonged half-life can result from the active readsorption of intact antibodies within the kidney, a property of the Fc piece of immunoglobulin. IgG based antibodies, therefore, are expected to exhibit slower blood clearance than their Fab' counterparts. However, Fab' fragment-based compositions will generally exhibit better tissue penetrating capability.

[0226] Antibody fragments can be obtained by proteolysis of the whole immunoglobulin by the non-specific thiol protease, papain. Papain digestion yields two identical antigen-binding fragments, termed "Fab fragments," each with a single antigen-binding site, and a residual "Fc fragment."

[0227] Papain should first be activated by reducing the sulfhydryl group in the active site with cysteine, 2-mercaptoethanol or dithiothreitol. Heavy metals in the stock enzyme should be removed by chelation with EDTA (2 mM) to ensure maximum enzyme activity. Enzyme and substrate are normally mixed together in the ratio of 1:100 by weight. After incubation, the reaction can be stopped by irreversible alkylation of the thiol group with iodoacetamide or simply by dialysis. The completeness of the digestion should be monitored by SDS-PAGE and the various fractions separated by Protein A-Sepharose or ion exchange chromatography.

[0228] The usual procedure for preparation of F(ab).sub.2 fragments from IgG of rabbit and human origin is limited proteolysis by the enzyme pepsin. The conditions, 100.times. antibody excess wt./wt. in acetate buffer at pH 4.5, 37.degree. C., suggest that antibody is cleaved at the C-terminal side of the inter-heavy-chain disulfide bond. Rates of digestion of mouse IgG may vary with subclass and it may be difficult to obtain high yields of active F(ab')2 fragments without some undigested or completely degraded IgG. In particular, IgG.sub.2b is highly susceptible to complete degradation. The other subclasses require different incubation conditions to produce optimal results, all of which is known in the art.

[0229] Pepsin treatment of intact antibodies yields an F(ab).sub.2 fragment that has two antigen-combining sites and is still capable of cross-linking antigen. Digestion of rat IgG by pepsin requires conditions including dialysis in 0.1 M acetate buffer, pH 4.5, and then incubation for four hrs with 1% wt./wt. pepsin; IgG.sub.1 and IgG.sub.2a digestion is improved if first dialyzed against 0.1 M formate buffer, pH 2.8, at 4.degree. C., for 16 hrs followed by acetate buffer. IgG.sub.2b gives more consistent results with incubation in staphylococcal V8 protease (3% wt./wt.) in 0.1 M sodium phosphate buffer, pH 7.8, for four hrs at 37.degree. C.

[0230] A Fab fragment also contains the constant domain of the light chain and the first constant domain (CH1) of the heavy chain. Fab' fragments differ from Fab fragments by the addition of a few residues at the carboxyl terminus of the heavy chain CH1 domain including one or more cysteine(s) from the antibody hinge region. F(ab').sub.2 antibody fragments were originally produced as pairs of Fab' fragments that have hinge cysteines between them. Other chemical couplings of antibody fragments are also known.

[0231] The term "variable," as used herein in reference to antibodies, means that certain portions of the variable domains differ extensively in sequence among antibodies, and are used in the binding and specificity of each particular antibody to its particular antigen. However, the variability is not evenly distributed throughout the variable domains of antibodies. It is concentrated in three segments termed "hypervariable regions," both in the light chain and the heavy chain variable domains.

[0232] The more highly conserved portions of variable domains are called the framework region (FR). The variable domains of native heavy and light chains each comprise four FRs (FR1, FR2, FR3 and FR4, respectively), largely adopting a R-sheet configuration, connected by three hypervariable regions, which form loops connecting, and in some cases, forming part of, the P-sheet structure.

[0233] The hypervariable regions in each chain are held together in close proximity by the FRs and, with the hypervariable regions from the other chain, contribute to the formation of the antigen-binding site of antibodies (Kabat et al., 1991, specifically incorporated herein by reference). The constant domains are not involved directly in binding an antibody to an antigen, but exhibit various effector functions, such as participation of the antibody in antibody-dependent cellular toxicity.

[0234] The term "hypervariable region," as used herein, refers to the amino acid residues of an antibody that are responsible for antigen-binding. The hypervariable region comprises amino acid residues from a "complementarity determining region" or "CDR" (i.e. residues 24-34 (L1), 50-56 (L2) and 89-97 (L3) in the light chain variable domain and 31-35 (H1), 50-56 (H2) and 95-102 (H3) in the heavy chain variable domain (Kabat et al., 1991, specifically incorporated herein by reference) and/or those residues from a "hypervariable loop" (i.e., residues 26-32 (L1), 50-52 (L2) and 91-96 (L3) in the light chain variable domain and 26-32 (H1), 53-55 (H2) and 96-101 (H3) in the heavy chain variable domain). "Framework" or "FR" residues are those variable domain residues other than the hypervariable region residues as herein defined.

[0235] An "Fv" fragment is the minimum antibody fragment that contains a complete antigen-recognition and binding site. This region consists of a dimer of one heavy chain and one light chain variable domain in tight, con-covalent association. It is in this configuration that three hypervariable regions of each variable domain interact to define an antigen-binding site on the surface of the V.sub.H-V.sub.L dimer. Collectively, six hypervariable regions confer antigen-binding specificity to the antibody. However, even a single variable domain (or half of an Fv comprising only three hypervariable regions specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding site.

[0236] "Single-chain Fv" or "sFv" antibody fragments comprise the V.sub.H and V.sub.L domains of antibody, wherein these domains are present in a single polypeptide chain. Generally, the Fv polypeptide further comprises a polypeptide linker between the V.sub.H and V.sub.L domains that enables the sFv to form the desired structure for antigen binding.

[0237] "Diabodies" are small antibody fragments with two antigen-binding sites, which fragments comprise a heavy chain variable domain (V.sub.H) connected to a light chain variable domain (V.sub.L) in the same polypeptide chain (V.sub.H-V.sub.L). By using a linker that is too short to allow pairing between the two domains on the same chain, the domains are forced to pair with the complementary domains of another chain and create two antigen-binding sites. Diabodies are described in European Pat. Appl. No. EP 404,097 and Intl. Pat. Appl. Publ. No. WO 93/11161, each specifically incorporated herein by reference. "Linear antibodies", which can be bispecific or monospecific, comprise a pair of tandem Fd segments (V.sub.H--C.sub.H1-V.sub.H-C.sub.H1) that form a pair of antigen binding regions, as described in Zapata et al. (1995), specifically incorporated herein by reference.

[0238] Other types of variants are antibodies with improved biological properties relative to the parent antibody from which they are generated. Such variants, or second-generation compounds, are typically substitutional variants involving one or more substituted hypervariable region residues of a parent antibody. A convenient way for generating such substitutional variants is affinity maturation using phage display.

[0239] In affinity maturation using phage display, several hypervariable region sites (e.g., 6 to 7 sites) are mutated to generate all possible amino substitutions at each site. The antibody variants thus generated are displayed in a monovalent fashion from filamentous phage particles as fusions to the gene III product of M13 packaged within each particle. The phage-displayed variants are then screened for their biological activity (e.g., binding affinity) as herein disclosed. In order to identify candidate hypervariable region sites for modification, alanine-scanning mutagenesis can be performed on hypervariable region residues identified as contributing significantly to antigen binding.

[0240] Alternatively, or in addition, the crystal structure of the antigen-antibody complex be delineated and analyzed to identify contact points between the antibody and target. Such contact residues and neighboring residues are candidates for substitution. Once such variants are generated, the panel of variants is subjected to screening, and antibodies with analogues but different or even superior properties in one or more relevant assays are selected for further development.

[0241] In using a Fab' or antigen binding fragment of an antibody, with the attendant benefits on tissue penetration, one may derive additional advantages from modifying the fragment to increase its half-life. A variety of techniques may be employed, such as manipulation or modification of the antibody molecule itself, and also conjugation to inert carriers. Any conjugation for the sole purpose of increasing half-life, rather than to deliver an agent to a target, should be approached carefully in that Fab' and other fragments are chosen to penetrate tissues. Nonetheless, conjugation to non-protein polymers, such PEG and the like, is contemplated.

[0242] Modifications other than conjugation are therefore based upon modifying the structure of the antibody fragment to render it more stable, and/or to reduce the rate of catabolism in the body. One mechanism for such modifications is the use of D-amino acids in place of L-amino acids. Those of ordinary skill in the art will understand that the introduction of such modifications needs to be followed by rigorous testing of the resultant molecule to ensure that it still retains the desired biological properties. Further stabilizing modifications include the use of the addition of stabilizing moieties to either the N-terminal or the C-terminal, or both, which is generally used to prolong the half-life of biological molecules. By way of example only, one may wish to modify the termini by acylation or amination.

[0243] Moderate conjugation-type modifications for use with the present invention include incorporating a salvage receptor binding epitope into the antibody fragment. Techniques for achieving this include mutation of the appropriate region of the antibody fragment or incorporating the epitope as a peptide tag that is attached to the antibody fragment. Intl. Pat. Appl. Publ. No. WO 96/32478 is specifically incorporated herein by reference for the purposes of further exemplifying such technology. Salvage receptor binding epitopes are typically regions of three or more amino acids from one or two lops of the Fc domain that are transferred to the analogous position on the antibody fragment. The salvage receptor-binding epitopes disclosed in Intl. Pat. Appl. Publ. No. WO 98/45331 are incorporated herein by reference for use with the present invention.

[0244] 4.3 T Cell Compositions Specific for Hematological Malignancy-Related Peptides

[0245] Immunotherapeutic compositions may also, or alternatively, comprise T cells specific for hematological malignancy related. Such cells may generally be prepared in vitro or ex vivo, using standard procedures. For example, T cells may be present within (or isolated from) bone marrow, peripheral blood or a fraction of bone marrow or peripheral blood of a mammal, such as a patient, using a commercially available cell separation system, such as the Isolex.TM. System, available from Nexell Therapeutics, Inc. (Irvine, Calif.; see also U.S. Pat. No. 5,240,856; U.S. Pat. No. 5,215,926; Intl. Pat. Appl. Publ. No. WO 89/06280; Intl. Pat. Appl. Publ. No. WO 91/16116 and Intl. Pat. Appl. Publ. No. WO 92/07243). Alternatively, T cells may be derived from related or unrelated humans, non-human mammals, cell lines or cultures.

[0246] T cells may be stimulated with hematological malignancy related peptide, polynucleotide encoding a hematological malignancy related peptide and/or an antigen-presenting cell (APC) that expresses a hematological malignancy related peptide. Such stimulation is performed under conditions and for a time sufficient to permit the generation of T cells that are specific for the hematological malignancy related peptide. Preferably, a hematological malignancy related peptide or polynucleotide is present within a delivery vehicle, such as a microsphere, to facilitate the generation of antigen-specific T cells. Briefly, T cells, which may be isolated from a patient or a related or unrelated donor by routine techniques (such as by Ficoll/Hypaque.RTM. density gradient centrifugation of peripheral blood lymphocytes), are incubated with hematological malignancy related peptide. For example, T cells may be incubated in vitro for 2-9 days (typically 4 days) at 37.degree. C. with hematological malignancy related peptide (e.g., 5 to 25 .mu.g/ml) or cells synthesizing a comparable amount of hematological malignancy related peptide. It may be desirable to incubate a separate aliquot of a T cell sample in the absence of hematological malignancy related peptide to serve as a control.

[0247] T cells are considered to be specific for a hematological malignancy related peptide if the T cells kill target cells coated with a hematological malignancy related peptide or expressing a gene encoding such a peptide. T cell specificity may be evaluated using any of a variety of standard techniques. For example, within a chromium release assay or proliferation assay, a stimulation index of more than two fold increase in lysis and/or proliferation, compared to negative controls, indicates T cell specificity. Such assays may be performed, for example, as described in Chen et al. (1994). Alternatively, detection of the proliferation of T cells may be accomplished by a variety of known techniques. For example, T cell proliferation can be detected by measuring an increased rate of DNA synthesis (e.g., by pulse-labeling cultures of T cells with tritiated thymidine and measuring the amount of tritiated thymidine incorporated into DNA). Other ways to detect T cell proliferation include measuring increases in interleukin-2 (IL-2) production, Ca.sup.2+ flux, or dye uptake, such as 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyl-tetrazolium. Alternatively, synthesis of lymphokines (such as interferon-gamma) can be measured or the relative number of T cells that can respond to a hematological malignancy related peptide may be quantified. Contact with a hematological malignancy related peptide (200 ng/ml -100 .mu.g/ml, preferably 100 ng/ml -25 .mu.g/ml) for 3-7 days should result in at least a two-fold increase in proliferation of the T cells and/or contact as described above for 2-3 hrs should result in activation of the T cells, as measured using standard cytokine assays in which a two-fold increase in the level of cytokine release (e.g., TNF or IFN-.gamma.) is indicative of T cell activation (Coligan et al., 1998). hematological malignancy related specific T cells may be expanded using standard techniques. Within preferred embodiments, the T cells are derived from a patient or a related or unrelated donor and are administered to the patient following stimulation and expansion.

[0248] T cells that have been activated in response to a hematological malignancy related peptide, polynucleotide or hematological malignancy related-expressing APC may be CD4.sup.+ and/or CD8.sup.+. Specific activation of CD4.sup.+ or CD8.sup.+ T cells may be detected in a variety of ways. Methods for detecting specific T cell activation include detecting the proliferation of T cells, the production of cytokines (e.g., lymphokines), or the generation of cytolytic activity (i.e., generation of cytotoxic T cells specific for hematological malignancy related). For CD4.sup.+ T cells, a preferred method for detecting specific T cell activation is the detection of the proliferation of T cells. For CD8.sup.+ T cells, a preferred method for detecting specific T cell activation is the detection of the generation of cytolytic activity.

[0249] For therapeutic purposes, CD4.sup.+ or CD8.sup.+ T cells that proliferate in response to the hematological malignancy related peptide, polynucleotide or APC can be expanded in number either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a variety of ways. For example, the T cells can be re-exposed to hematological malignancy related peptide, with or without the addition of T cell growth factors, such as interleukin-2, and/or stimulator cells that synthesize a hematological malignancy related peptide. The addition of stimulator cells is preferred where generating CD8.sup.+ T cell responses. T cells can be grown to large numbers in vitro with retention of specificity in response to intermittent restimulation with hematological malignancy related peptide. Briefly, for the primary in vitro stimulation (IVS), large numbers of lymphocytes (e.g., greater than 4.times.10.sup.7) may be placed in flasks with media containing human serum. hematological malignancy related peptide (e.g., peptide at 10 .mu.g/ml) may be added directly, along with tetanus Stoxoid (e.g., 5 .mu.g/ml). The flasks may then be incubated (e.g., 37.degree. C. for 7 days). For a second IVS, T cells are then harvested and placed in new flasks with 2-3.times.10.sup.7 irradiated peripheral blood mononuclear cells. hematological malignancy related peptide (e.g., 10 .mu.g/ml) is added directly. The flasks are incubated at 37.degree. C. for 7 days. On day 2 and day 4 after the second IVS, 2-5 units of interleukin-2 (IL-2) may be added. For a third IVS, the T cells may be placed in wells and stimulated with the individual's own EBV transformed B cells coated with the peptide. IL-2 may be added on days 2 and 4 of each cycle. As soon as the cells are shown to be specific cytotoxic T cells, they may be expanded using a 10-day stimulation cycle with higher IL-2 (20 units) on days 2, 4 and 6.

[0250] Alternatively, one or more T cells that proliferate in the presence of hematological malignancy related peptide can be expanded in number by cloning. Methods for cloning cells are well known in the art, and include limiting dilution. Responder T cells may be purified from the peripheral blood of sensitized patients by density gradient centrifugation and sheep red cell rosetting and established in culture by stimulating with the nominal antigen in the presence of irradiated autologous filler cells. In order to generate CD4.sup.+ T cell lines, hematological malignancy related peptide is used as the antigenic stimulus and autologous peripheral blood lymphocytes (PBL) or lymphoblastoid cell lines (LCL) immortalized by infection with Epstein Barr virus are used as antigen-presenting cells. In order to generate CD8.sup.+ T cell lines, autologous antigen-presenting cells transfected with an expression vector that produces hematological malignancy related peptide may be used as stimulator cells. Established T cell lines may be cloned 2-4 days following antigen stimulation by plating stimulated T cells at a frequency of 0.5 cells per well in 96-well flat-bottom plates with 1.times.10.sup.6 irradiated PBL or LCL cells and recombinant interleukin-2 (rIL2) (50 U/ml). Wells with established clonal growth may be identified at approximately 2-3 weeks after initial plating and restimulated with appropriate antigen in the presence of autologous antigen-presenting cells, then subsequently expanded by the addition of low doses of rIL2 (10 U/ml) 2-3 days following antigen stimulation. T cell clones may be maintained in 24-well plates by periodic restimulation with antigen and rIL2 approximately every two weeks. Cloned and/or expanded cells may be administered back to the patient as described, for example, by Chang et al., (1996).

[0251] Within certain embodiments, allogeneic T-cells may be primed (i.e., sensitized to hematological malignancy related) in vivo and/or in vitro. Such priming may be achieved by contacting T cells with a hematological malignancy related peptide, a polynucleotide encoding such a peptide or a cell producing such a peptide under conditions and for a time sufficient to permit the priming of T cells. In general, T cells are considered to be primed if, for example, contact with a hematological malignancy related peptide results in proliferation and/or activation of the T cells, as measured by standard proliferation, chromium release and/or cytokine release assays as described herein. A stimulation index of more than two fold increase in proliferation or lysis, and more than three fold increase in the level of cytokine, compared to negative controls indicates T-cell specificity. Cells primed in vitro may be employed, for example, within bone marrow transplantation or as donor lymphocyte infusion.

[0252] T cells specific for hematological malignancy related can kill cells that express hematological malignancy related protein. Introduction of genes encoding T-cell receptor (TCR) chains for hematological malignancy related are used as a means to quantitatively and qualitatively improve responses to hematological malignancy related bearing leukemia and cancer cells. Vaccines to increase the number of T cells that can react to hematological malignancy related positive cells are one method of targeting hematological malignancy related bearing cells. T cell therapy with T cells specific for hematological malignancy related is another method. An alternative method is to introduce the TCR chains specific for hematological malignancy related into T cells or other cells with lytic potential. In a suitable embodiment, the TCR alpha and beta chains are cloned out from a hematological malignancy related specific T cell line and used for adoptive T cell therapy, such as described in WO 96/30516, incorporated herein by reference.

[0253] 4.4 Pharmaceutical Compositions and Vaccine Formulations

[0254] Within certain aspects, peptides, polynucleotides, antibodies and/or T cells may be incorporated into pharmaceutical compositions or immunogenic compositions (i.e., vaccines). Alternatively, a pharmaceutical composition may comprise an antigen-presenting cell (e.g., a dendritic cell) transfected with a hematological malignancy related polynucleotide such that the antigen-presenting cell expresses a hematological malignancy related peptide. Pharmaceutical compositions comprise one or more such compounds or cells and a physiologically acceptable carrier or excipient. Vaccines may comprise one or more such compounds or cells and an immunostimulant, such as an adjuvant or a liposome (into which the compound is incorporated). An immunostimulant may be any substance that enhances or potentiates an immune response (antibody- and/or cell-mediated) to an exogenous antigen. Examples of immunostimulants include adjuvants, biodegradable microspheres (e.g., polylactic galactide) and liposomes (into which the compound is incorporated) (U.S. Pat. No. 4,235,877). Vaccine preparation is generally described in, for example, Powell and Newman (1995). Pharmaceutical compositions and vaccines within the scope of the present invention may also contain other compounds, which may be biologically active or inactive. For example, one or more immunogenic portions of other tumor antigens may be present, either incorporated into a fusion peptide or as a separate compound, within the composition or vaccine.

[0255] Within certain embodiments, pharmaceutical compositions and vaccines are designed to elicit T cell responses specific for a hematological malignancy related peptide in a patient, such as a human. In general, T cell responses may be favored through the use of relatively short peptides (e.g., comprising less than 23 consecutive amino acid residues of a native hematological malignancy related peptide, preferably 4-16 consecutive residues, more preferably 8-16 consecutive residues and still more preferably 8-10 consecutive residues). Alternatively, or in addition, a vaccine may comprise an immunostimulant that preferentially enhances a T cell response. In other words, the immunostimulant may enhance the level of a T cell response to a hematological malignancy related peptide by an amount that is proportionally greater than the amount by which an antibody response is enhanced. For example, when compared to a standard oil based adjuvant, such as CFA, an immunostimulant that preferentially enhances a T cell response may enhance a proliferative T cell response by at least two fold, a lytic response by at least 10%, and/or T cell activation by at least two fold compared to hematological malignancy related-negative control cell lines, while not detectably enhancing an antibody response. The amount by which a T cell or antibody response to a hematological malignancy related peptide is enhanced may generally be determined using any representative technique known in the art, such as the techniques provided herein.

[0256] A pharmaceutical composition or vaccine may contain DNA encoding one or more of the peptides as described above, such that the peptide is generated in situ. As noted above, the DNA may be present within any of a variety of delivery systems known to those of ordinary skill in the art, including nucleic acid expression systems, bacterial and viral expression systems and mammalian expression systems. Numerous gene delivery techniques are well known in the art (Rolland, 1998, and references cited therein). Appropriate nucleic acid expression systems contain the necessary DNA, cDNA or RNA sequences for expression in the patient (such as a suitable promoter and terminating signal). Bacterial delivery systems involve the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that expresses an immunogenic portion of the peptide on its cell surface or secretes such an epitope. In a preferred embodiment, the DNA may be introduced using a viral expression system (e.g., vaccinia or other pox virus, retrovirus, or adenovirus), which may involve the use of a non-pathogenic (defective), replication competent virus (Fisher-Hoch et al., 1989; Flexner et al., 1989; Flexner et al., 1990; U.S. Pat. No. 4,603,112, U.S. Pat. No. 4,769,330, U.S. Pat. No. 5,017,487; Intl. Pat. Appl. Publ. No. WO 89/01973; U.S. Pat. No. 4,777,127; Great Britain Patent No. GB 2,200,651; European Patent No. EP 0,345,242; Intl. Pat. Appl. Publ. No. WO 91/02805; Berkner, 1988; Rosenfeld et al., 1991; Kolls et al., 1994; Kass-Eisler et al., 1993; Guzman et al., 1993a; and Guzman et al., 1993). Techniques for incorporating DNA into such expression systems are well known to those of ordinary skill in the art. The DNA may also be "naked," as described, for example, in Ulmer et al. (1993) and reviewed by Cohen (1993). The uptake of naked DNA may be increased by coating the DNA onto biodegradable beads, which are efficiently transported into the cells. It will be apparent that a vaccine may comprise both a polynucleotide and a peptide component. Such vaccines may provide for an enhanced immune response.

[0257] As noted above, a pharmaceutical composition or vaccine may comprise an antigen-presenting cell that expresses a hematological malignancy related peptide. For therapeutic purposes, as described herein, the antigen-presenting cell is preferably an autologous dendritic cell. Such cells may beprepared and transfected using standard techniques (Reeves et al., 1996; Tuting et al., 1998; and Nair et al., 1998). Expression of a hematological malignancy related peptide on the surface of an antigen-presenting cell may be confirmed by in vitro stimulation and standard proliferation as well as chromium release assays, as described herein.

[0258] It will be apparent to those of ordinary skill in the art having the benefit of the present teachings that a vaccine may contain pharmaceutically acceptable salts of the polynucleotides and peptides provided herein. Such salts may be prepared from pharmaceutically acceptable non-toxic bases, including organic bases (e.g., salts of primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e.g., sodium, potassium, lithium, ammonium, calcium and magnesium salts). The phrases "pharmaceutically or pharmacologically acceptable" refer to molecular entities and compositions that do not produce an adverse, allergic or other significant untoward reaction when administered to an animal, or a human, as appropriate. As used herein, "pharmaceutically acceptable carrier" includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents and the like. The use of such media and agents for pharmaceutical active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active ingredient, its use in the therapeutic compositions is contemplated. For human administration, preparations should meet sterility, pyrogenicity, and general safety and purity standards as required by the Food and Drug Administration Office of Biologics standards. Supplementary active ingredients can also be incorporated into the compositions.

[0259] While any suitable carrier known to those of ordinary skill in the art may be employed in the pharmaceutical compositions of this invention, the type of carrier will vary depending on the mode of administration. Compositions of the present invention may be formulated for any appropriate manner of administration, including for example, topical, oral, nasal, intravenous, intracranial, intraperitoneal, subcutaneous or intramuscular administration. For parenteral administration, such as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a fat, a wax or a buffer. For oral administration, any of the above carriers or a solid carrier, such as mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, cellulose, glucose, sucrose, and magnesium carbonate, may be employed. Biodegradable microspheres (e.g., polylactate polyglycolate) may also be employed as carriers for the pharmaceutical compositions of this invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. Pat. Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 5,814,344 and 5,942,252. For certain topical applications, formulation as a cream or lotion, using well-known components, is preferred.

[0260] Such compositions may also comprise buffers (e.g., neutral buffered saline or phosphate buffered saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, peptides or amino acids such as glycine, antioxidants, bacteriostats, chelating agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a recipient, suspending agents, thickening agents and/or preservatives. Alternatively, compositions of the present invention may be formulated as a lyophilizate, or formulated with one or more liposomes, microspheres, nanoparticles, or micronized delivery systems using well-known technology.

[0261] Any of a variety of immunostimulants, such as adjuvants, may be employed in the preparation of vaccine compositions of this invention. Most adjuvants contain a substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. Suitable adjuvants are commercially available as, for example, alum-based adjuvants (e.g., Alhydrogel, Rehydragel, aluminum phosphate, Algammulin, aluminum hydroxide); oil based adjuvants (Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, Mich.), Specol, RIBI, TiterMax, Montanide ISA50 or Seppic MONTANIDE ISA 720); nonionic block copolymer-based adjuvants, cytokines (e.g., GM-CSF or Flat3-ligand); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, N.J.); AS-2 (SmithKline Beecham, Philadelphia, Pa.); salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and Quil A. Cytokines, such as GM-CSF or interleukin-2, -7, or -12, may also be used as adjuvants.

[0262] Hemocyanins and hemoerythrins may also be used in the invention. The use of hemocyanin from keyhole limpet (KLH) is particularly preferred, although other molluscan and arthropod hemocyanins and hemoerythrins may be employed. Various polysaccharide adjuvants may also be used. Polyamine varieties of polysaccharides are particularly preferred, such as chitin and chitosan, including deacetylated chitin.

[0263] A further preferred group of adjuvants are the muramyl dipeptide (MDP, N-acetylmuramyl-L-alanyl-a-isoglutamine) group of bacterial peptidoglycans. Derivatives of muramyl dipeptide, such as the amino acid derivative threonyl-MDP, and the fatty acid derivative MTPPE, are also contemplated.

[0264] U.S. Pat. No. 4,950,645 describes a lipophilic disaccharide-tripeptide derivative of muramyl dipeptide that is proposed for use in artificial liposomes formed from phosphatidyl choline and phosphatidyl glycerol. It is said to be effective in activating human monocytes and destroying tumor cells, but is non-toxic in generally high doses. The compounds of U.S. Pat. No. 4,950,645, and Intl. Pat. Appl. Publ. No. WO 91/16347 are also proposed for use in achieving particular aspects of the present invention.

[0265] BCG and BCG-cell wall skeleton (CWS) may also be used as adjuvants in the invention, with or without trehalose dimycolate. Trehalose dimycolate may be used itself. Azuma et al. (1988) show that trehalose dimycolate administration correlates with augmented resistance to influenza virus infection in mice. Trehalose dimycolate may be prepared as described in U.S. Pat. No. 4,579,945.

[0266] Amphipathic and surface-active agents, e.g., saponin and derivatives such as QS21 (Cambridge Biotech), form yet another group of preferred adjuvants for use with the immunogens of the present invention. Nonionic block copolymer surfactants (Rabinovich et al., 1994; Hunter et al., 1991) may also be employed. Oligonucleotides, as described by Yamamoto et al. (1988) are another useful group of adjuvants. Quil A and lentinen are also preferred adjuvants.

[0267] Superantigens are also contemplated for use as adjuvants in the present invention. "Superantigens" are generally bacterial products that stimulate a greater proportion of T lymphocytes than peptide antigens without a requirement for antigen processing (Mooney et. al., 1994). Superantigens include Staphylococcus exoproteins, such as the .alpha., .beta., .gamma. and .delta. enterotoxins from S. aureus and S. epidermidis, and the .alpha., .beta., .gamma. and .delta. E. coli exotoxins.

[0268] Common Staphylococcus enterotoxins are known as staphylococcal enterotoxin A (SEA) and staphylococcal enterotoxin B (SEB), with enterotoxins through E (SEE) being described (Rott et. al., 1992). Streptococcus pyogenes B (SEB), Clostridium perfringens enterotoxin (Bowness et. al., 1992), cytoplasmic membrane-associated protein (CAP) from S. pyogenes (Sato et. al., 1994) and toxic shock syndrome toxin-1 (TSST-1) from S. aureus (Schwab et. al., 1993) are further useful superantigens.

[0269] One group of adjuvants particularly preferred for use in the invention are the detoxified endotoxins, such as the refined detoxified endotoxin of U.S. Pat. No. 4,866,034. These refined detoxified endotoxins are effective in producing adjuvant responses in mammals.

[0270] The detoxified endotoxins may be combined with other adjuvants. Combination of detoxified endotoxins with trehalose dimycolate is contemplated, as described in U.S. Pat. No. 4,435,386. Combinations of detoxified endotoxins with trehalose dimycolate and endotoxic glycolipids is also contemplated (U.S. Pat. No. 4,505,899), as is combination of detoxified endotoxins with cell wall skeleton (CWS) or CWS and trehalose dimycolate, as described in U.S. Pat. Nos. 4,436,727, 4,436,728 and 4,505,900. Combinations of just CWS and trehalose dimycolate, without detoxified endotoxins are also envisioned to be useful, as described in U.S. Pat. No. 4,520,019.

[0271] MPL is currently one preferred immunopotentiating agent for use herein. References that concern the uses of MPL include Tomai et al. (1987), Chen et al. (1991) and Garg and Subbarao (1992), that each concern certain roles of MPL in the reactions of aging mice; Elliott et al. (1991), that concerns the D-galactosamine loaded mouse and its enhanced sensitivity to lipopolysaccharide and MPL; Chase et al. (1986), that relates to bacterial infections; and Masihi et al. (1988), that describes the effects of MPL and endotoxin on resistance of mice to Toxoplasma gondii. Fitzgerald (1991) also reported on the use of MPL to up-regulate the immunogenicty of a syphilis vaccine and to confer significant protection against challenge infection in rabbits.

[0272] Thus MPL is known to be safe for use, as shown in the above model systems. Phase-I clinical trials have also shown MPL to be safe for use (Vosika et al., 1984). Indeed, 100 .mu.g/m.sup.2 is known to be safe for human use, even on an outpatient basis (Vosika et al., 1984).

[0273] MPL generally induces polyclonal B cell activation (Baker et al., 1994), and has been shown to augment antibody production in many systems, for example, in immunologically immature mice (Baker et al., 1988); in aging mice (Tomai and Johnson, 1989); and in nude and Xid mice (Madonna and Vogel, 1986; Myers et al., 1995). Antibody production has been shown against erythrocytes (Hraba et al., 1993); T cell dependent and independent antigens; Pnu-immune vaccine (Garg and Subbarao, 1992); isolated tumor-associated antigens (U.S. Pat. No. 4,877,611); against syngeneic tumor cells (Livingston et al., 1985; Ravindranath et al., 1994a; b); and against tumor-associated gangliosides (Ravindranath et al., 1994a; b).

[0274] Another useful attribute of MPL is that is augments IgM responses, as shown by Baker et al. (1988a), who describe the ability of MPL to increase antibody responses in young mice. This is a particularly useful feature of an adjuvant for use in certain embodiments of the present invention. Myers et al. (1995) recently reported on the ability of MPL to induce IgM antibodies, by virtue T cell-independent antibody production.

[0275] In the Myers et al. (1995) studies, MPL was conjugated to the hapten, TNP. MPL was proposed for use as a carrier for other haptens, such as peptides.

[0276] MPL also activates and recruits macrophages (Verma et al., 1992). Tomai and Johnson (1989) showed that MPL-stimulated T cells enhance IL-1 secretion by macrophages. MPL is also known to activate superoxide production, lysozyme activity, phagocytosis, and killing of Candida in murine peritoneal macrophages (Chen et al., 1991).

[0277] The effects of MPL on T cells include the endogenous production of cytotoxic factors, such as TNF, in serum of BCG-primed mice by MPL (Bennett et al., 1988). Kovach et al. (1990) and Elliot et al. (1991) also show that MPL induces TNF activity. MPL is known to act with TNF-.alpha. to induce release of IFN-.gamma. by NK cells. IFN-.gamma. production by T cells in response to MPL was also documented by Tomai and Johnson (1989), and Odean et al. (1990).

[0278] MPL is also known to be a potent T cell adjuvant. For example, MPL stimulates proliferation of melanoma-antigen specific CTLs (Mitchell et al., 1988, 1993). Further, Baker et al. (1988b) showed that nontoxic MPL inactivated suppressor T cell activity. Naturally, in the physiological environment, the inactivation of T suppressor cells allows for increased benefit for the animal, as realized by, e.g., increased antibody production. Johnson and Tomai (1988) have reported on the possible cellular and molecular mediators of the adjuvant action of MPL.

[0279] MPL is also known to induce aggregation of platelets and to phosphorylate a platelet protein prior to induction of serotonin secretion (Grabarek et al., 1990). This study shows that MPL is involved in protein kinase C activation and signal transduction.

[0280] Many articles concern the structure and function of MPL include. These include Johnson et al. (1990), that describes the structural characterization of MPL homologs obtained from Salmonella minnesota Re595 lipopolysaccharide. The work of Johnson et al. (1990), in common with Grabarek et al. (1990), shows that the fatty acid moieties of MPL can vary, even in commercial species. In separating MPL into eight fractions by thin layer chromatography, Johnson et al. (1990) found that three were particularly active, as assessed using human platelet responses. The chemical components of the various MPL species were characterized by Johnson et al. (1990).

[0281] Baker et al. (1992) further analyzed the structural features that influence the ability of lipid A and its analogs to abolish expression of suppressor T cell activity. They reported that decreasing the number of phosphate groups in lipid A from two to one (i.e., creating monophosphoryl lipid A, MPL) as well as decreasing the fatty acyl content, primarily by removing the residue at the 3 position, resulted in a progressive reduction in toxicity; however, these structural modifications did not influence its ability to abolish the expression of Ts function (Baker et al., 1992). These types of MPL are ideal for use in the present invention.

[0282] Baker et al. (1992) also showed that reducing the fatty acyl content from five to four (lipid A precursor IV.sub.A or I.sub.a) eliminated the capacity to influence Ts function but not to induce polyclonal activation of B cells. These studies show that in order to be able to abolish the expression of Ts function, lipid A must be a glucosamine disaccharide; may have either one or two phosphate groups; and must have at least five fatty acyl groups. Also, the chain length of the nonhydroxylated fatty acid, as well as the location of acyloxyacyl groups (2' versus 3' position), may play an important role (Baker et al., 1992).

[0283] In examining the relationship between chain length and position of fatty acyl groups on the ability of lipid A to abolish the expression of suppressor T-cell (Ts) activity, Baker et al. (1994) found that fatty acyl chain lengths of C.sub.12 to C.sub.14 appeared to be optimal for bioactivity. Therefore, although their use is still possible, lipid A preparations with fatty acyl groups of relatively short chain length (C.sub.10 to C.sub.12 from Pseudomonas aeruginosa and Chromobacterium violaceum) or predominantly long chain length (C.sub.18 from Helicobacter pylori) are less preferred for use in this invention.

[0284] Baker et al. (1994) also showed that the lipid A proximal inner core region oligosaccharides of some bacterial lipopolysaccharides increase the expression of Ts activity; due mainly to the capacity of such oligosaccharides, which are relatively conserved in structure among gram-negative bacterial, to enlarge or expand upon the population of CD8.sup.+ Ts generated during the course of a normal antibody response to unrelated microbial antigens. The minimal structure required for the expression of the added immunosuppression observed was reported to be a hexasaccharide containing one 2-keto-3-deoxyoctonate residue, two glucose residues, and three heptose residues to which are attached two pyrophosphorylethanolamine groups (Baker et al., 1994). This information may be considered in utilizing or even designing further adjuvants for use in the invention.

[0285] In a generally related line of work, Tanamoto et al. (1994a; b; 1995) described the dissociation of endotoxic activities in a chemically synthesized Lipid A precursor after acetylation or succinylation. Thus, compounds such as "acetyl 406" and "succinyl 516" (Tanamoto et al., 1994a; b; 1995) are also contemplated for use in the invention.

[0286] Synthetic MPLs form a particularly preferred group of antigens. For example, Brade et al. (1993) described an artificial glycoconjugate containing the bisphosphorylated glucosamine disaccharide backbone of lipid A that binds to anti-Lipid A MAbs. This is one candidate for use in certain aspects of the invention.

[0287] The MPL derivatives described in U.S. Pat. No. 4,987,237 are particularly contemplated for use in the present invention. U.S. Pat. No. 4,987,237 describes MPL derivatives that contain one or more free groups, such as amines, on a side chain attached to the primary hydroxyl groups of the monophosphoryl lipid A nucleus through an ester group. The derivatives provide a convenient method for coupling the lipid A through coupling agents to various biologically active materials. The immunostimulant properties of lipid A are maintained. All MPL derivatives in accordance with U.S. Pat. No. 4,987,237 are envisioned for use in the MPL adjuvant-incorporated cells of this invention.

[0288] Various adjuvants, even those that are not commonly used in humans, may still be employed in animals, where, for example, one desires to raise antibodies or to subsequently obtain activated T cells. The toxicity or other adverse effects that may result from either the adjuvant or the cells, e.g., as may occur using non-irradiated tumor cells, is irrelevant in such circumstances.

[0289] Within the vaccines provided herein, the adjuvant composition is preferably designed to induce an immune response predominantly of the Th1 type. High levels of Th1-type cytokines (e.g., IFN-.gamma., TNF.alpha., IL-2 and IL-12) tend to favor the induction of cell-mediated immune responses to an administered antigen. In contrast, high levels of Th2-type cytokines (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the induction of humoral immune responses. Following application of a vaccine as provided herein, a patient will support an immune response that includes Th1- and Th2-type responses. Within a preferred embodiment, in which a response is predominantly Th1-type, the level of Th1-type cytokines will increase to a greater extent than the level of Th2-type cytokines. The levels of these cytokines may be readily assessed using standard assays. For a review of the families of cytokines see e.g., Mosmann and Coffman (1989).

[0290] Preferred adjuvants for use in eliciting a predominantly Th1-type response include, for example, a combination of monophosphoryl lipid A, preferably 3-de-O-acylated monophosphoryl lipid A (3D-MPL), together with an aluminum salt. MPL adjuvants are available from Corixa Corporation (Seattle, Wash.; see e.g., U.S. Pat. Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094, each of which is specifically incorporated herein by reference in its entirety). CpG-containing oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a predominantly Th1 response. Such oligonucleotides are well known and are described, for example, in Intl. Pat. Appl. Publ. No. WO 96/02555 and Intl. Pat. Appl. Publ. No. WO 99/33488. Immunostimulatory DNA sequences are also described, for example, by Sato et al. (1996). Another preferred adjuvant is a saponin, preferably QS21 (Aquila Biopharmaceuticals Inc., Framingham, Mass.), which may be used alone or in combination with other adjuvants. For example, an enhanced system involves the combination of a monophosphoryl lipid A and saponin derivative, such as the combination of QS21 and 3D-MPL (see e.g., Intl. Pat. Appl. Publ. No. WO 94/00153), or a less reactogenic composition where the QS21 is quenched with cholesterol (see e.g., Intl. Pat. Appl. Publ. No. WO 96/33739). Other preferred formulations comprise an oil-in-water emulsion and tocopherol. A particularly potent adjuvant formulation involving QS21, 3D-MPL and tocopherol in an oil-in-water emulsion has also been described (see e.g., Intl. Pat. Appl. Publ. No. WO 95/17210).

[0291] Other preferred adjuvants include Montanide ISA 720 (Seppic), SAF (Chiron), ISCOMS (CSL), MF-59 (Chiron), the SBAS series of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, Belgium), Detox (Corixa Corporation), RC-529 (Corixa Corporation) and aminoalkyl glucosaminide 4-phosphates (AGPs).

[0292] Any vaccine provided herein may be prepared using well-known methods that result in a combination of one or more antigens, one or more immunostimulants or adjuvants and one or more suitable carriers, excipients, or pharmaceutically acceptable buffers. The compositions described herein may be administered as part of a sustained release formulation (i.e., a formulation such as a capsule, sponge or gel [composed of polysaccharides, for example] that effects a slow release of compound following administration). Such formulations may generally be prepared using well-known technology (Coombes et al., 1996) and administered by, for example, oral, rectal or subcutaneous implantation, or by implantation at the desired target site. Sustained-release formulations may contain a peptide, polynucleotide or antibody dispersed in a carrier matrix and/or contained within a reservoir surrounded by a rate-controlling membrane.

[0293] Carriers for use within such formulations are preferably biocompatible, and may also be biodegradable; preferably the formulation provides a relatively constant level of active component release. Such carriers include microparticles of poly(lactide-co-glycolide), as well as polyacrylate, latex, starch, cellulose and dextran. Other delayed-release carriers include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer comprising an amphiphilic compound, such as a phospholipid (U.S. Pat. No. 5,151,254; Intl. Pat. Appl. Publ. No. WO 94/20078; Intl. Pat. Appl. Publ. No. WO/94/23701; and Intl. Pat. Appl. Publ. No. WO 96/06638). The amount of active compound contained within a sustained release formulation depends upon the site of implantation, the rate and expected duration of release and the nature of the condition to be treated or prevented.

[0294] Any of a variety of delivery vehicles may be employed within pharmaceutical compositions and vaccines to facilitate production of an antigen-specific immune response that targets tumor cells. Delivery vehicles include antigen-presenting cells (APCs), such as dendritic cells, macrophages, B cells, monocytes and other cells that may be engineered to be efficient APCs. Such cells may, but need not, be genetically modified to increase the capacity for presenting the antigen, to improve activation and/or maintenance of the T cell response, to have anti-tumor effects per se and/or to be immunologically compatible with the receiver (i.e., matched HLA haplotype). APCs may generally be isolated from any of a variety of biological fluids and organs, including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic or xenogeneic cells.

[0295] Certain preferred embodiments of the present invention use dendritic cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent APCs (Banchereau and Steinman, 1998) and have been shown to be effective as a physiological adjuvant for eliciting prophylactic or therapeutic antitumor immunity (Timmerman and Levy, 1999). In general, dendritic cells may be identified based on their typical shape (stellate in situ, with marked cytoplasmic processes (dendrites) visible in vitro), their ability to take up, process and present antigens with high efficiency and their ability to activate naive T cell responses. Dendritic cells may, of course, be engineered to express specific cell-surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex vivo, and such modified dendritic cells are contemplated by the present invention. As an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called exosomes) may be used within a vaccine (Zitvogel et al., 1998).

[0296] Dendritic cells and progenitors may be obtained from peripheral blood, bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For example, dendritic cells may be differentiated ex vivo by adding a combination of cytokines such as GM-CSF, IL-4, IL-13 and/or TNF.alpha. to cultures of monocytes harvested from peripheral blood. Alternatively, CD34 positive cells harvested from peripheral blood, umbilical cord blood or bone marrow may be differentiated into dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNF.alpha., CD40 ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, maturation and proliferation of dendritic cells.

[0297] Dendritic cells are conveniently categorized as "immature" and "mature" cells, which allows a simple way to discriminate between two well characterized phenotypes. However, this nomenclature should not be construed to exclude all possible intermediate stages of differentiation. Immature dendritic cells are characterized as APC with a high capacity for antigen uptake and processing, which correlates with the high expression of Fc.gamma. receptor and mannose receptor. The mature phenotype is typically characterized by a lower expression of these markers, but a high expression of cell surface molecules responsible for T cell activation such as class I and class II MHC, adhesion molecules (e.g., CD54 and CD11) and costimulatory molecules (e.g., CD40, CD80, CD86 and 4-1BB).

[0298] APCs may generally be transfected with a polynucleotide encoding a hematological malignancy related peptide, such that the peptide, or an immunogenic portion thereof, is expressed on the cell surface. Such transfection may take place ex vivo, and a composition or vaccine comprising such transfected cells may then be used for therapeutic purposes, as described herein. Alternatively, a gene delivery vehicle that targets a dendritic or other antigen-presenting cell may be administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex vivo transfection of dendritic cells, for example, may generally be performed using any methods known in the art, such as those described in Intl. Pat. Appl. Publ. No. WO 97/24447, or the gene gun approach described by Mahvi et al. (1997). Antigen loading of dendritic cells may be achieved by incubating dendritic cells or progenitor cells with the hematological malignancy related peptide, DNA (naked or within a plasmid vector) or RNA; or with antigen-expressing recombinant bacterium or viruses (e.g., vaccinia, fowlpox, adenovirus or lentivirus vectors). Prior to loading, the peptide may be covalently conjugated to an immunological partner that provides T cell help (e.g., a carrier molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated immunological partner, separately or in the presence of the peptide.

[0299] Combined therapeutics is also contemplated, and the same type of underlying pharmaceutical compositions may be employed for both single and combined medicaments. Vaccines and pharmaceutical compositions may be presented in unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers are preferably hermetically sealed to preserve sterility of the formulation until use. In general, formulations may be stored as suspensions, solutions or emulsions in oily or aqueous vehicles. Alternatively, a vaccine or pharmaceutical composition may be stored in a freeze-dried condition requiring only the addition of a sterile liquid carrier immediately prior to use.

[0300] 4.5 Diagnostic and Prognostic Methods for Hematological Malignancy Diseases

[0301] The present invention further provides methods for detecting a malignant disease associated with one or more of the polypeptide or polynucleotide compositions disclosed herein, and for monitoring the effectiveness of an immunization or therapy for such a disease. To determine the presence or absence of a malignant disease associated with one or more of the polypeptide or polynucleotide compositions disclosed herein, a patient may be tested for the level of T cells specific for one or more of such compositions. Within certain methods, a biological sample comprising CD4.sup.+ and/or CD8.sup.+ T cells isolated from a patient is incubated with one or more of the polypeptide or polynucleotide compositions disclosed herein, and/or an APC that expresses one or more of such peptides or polypeptides, and the presence or absence of specific activation of the T cells is detected, as described herein. Suitable biological samples include, but are not limited to, isolated T cells. For example, T cells may be isolated from a patient by routine techniques (such as by Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T cells may be incubated in vitro for 2-9 days (typically 4 days) at 37.degree. C. with one or more of the disclosed peptide, polypeptide or polynucleotide compositions (e.g., 5-25 .mu.g/ml). It may be desirable to incubate another aliquot of a T cell sample in the absence of the composition to serve as a control. For CD4.sup.+ T cells, activation is preferably detected by evaluating proliferation of the T cells. For CD8.sup.+ T cells, activation is preferably detected by evaluating cytolytic activity. A level of proliferation that is at least two fold greater and/or a level of cytolytic activity that is at least 20% greater than in disease-free patients indicates the presence of a malignant disease associated with expression or one or more of the disclosed polypeptide or polynucleotide compositions. Further correlation may be made, using methods well known in the art, between the level of proliferation and/or cytolytic activity and the predicted response to therapy. In particular, patients that display a higher antibody, proliferative and/or lytic response may be expected to show a greater response to therapy.

[0302] Within other methods, a biological sample obtained from a patient is tested for the level of antibody specific for one or more of the hematological malignancy-related peptides or polypeptide s disclosed herein. The biological sample is incubated with hematological malignancy-related peptide or polypeptide, or a polynucleotide encoding such a peptide or polypeptide, and/or an APC that expresses such a peptide or polypeptide under conditions and for a time sufficient to allow immunocomplexes to form. Immunocomplexes formed between the selected peptide or polypeptide and antibodies in the biological sample that specifically bind to the selected peptide or polypeptide are then detected. A biological sample for use within such methods may be any sample obtained from a patient that would be expected to contain antibodies. Suitable biological samples include blood, sera, ascites, bone marrow, pleural effusion, and cerebrospinal fluid.

[0303] The biological sample is incubated with the selected peptide or polypeptide in a reaction mixture under conditions and for a time sufficient to permit immunocomplexes to form between the selected peptide or polypeptide and antibodies that are immunospecific for such a peptide or polypeptide. For example, a biological sample and a selected peptide or polypeptide peptide may be incubated at 4.degree. C. for 24-48 hrs.

[0304] Following the incubation, the reaction mixture is tested for the presence of immunocomplexes. Detection of immunocomplexes formed between the selected peptide or polypeptide and antibodies present in the biological sample may be accomplished by a variety of known techniques, such as radioimmunoassays (RIA) and enzyme linked immunosorbent assays (ELISA). Suitable assays are well known in the art and are amply described in the scientific and patent literature (Harlow and Lane, 1988). Assays that may be used include, but are not limited to, the double monoclonal antibody sandwich immunoassay technique (U.S. Pat. No. 4,376,110); monoclonal-polyclonal antibody sandwich assays (Wide et al., 1970); the "western blot" method (U.S. Pat. No. 4,452,901); immunoprecipitation of labeled ligand (Brown et al., 1980); enzyme-linked immunosorbent assays (Raines and Ross, 1982); immunocytochemical techniques, including the use of fluorochromes (Brooks et al., 1980); and neutralization of activity (Bowen-Pope et al., 1984). Other immunoassays include, but are not limited to, those described in U.S. Pat. Nos. 3,817,827; 3,850,752; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; and 4,098,876.

[0305] For detection purposes, the selected peptide or polypeptide may either be labeled or unlabeled. Unlabeled polypeptide peptide may be used in agglutination assays or in combination with labeled detection reagents that bind to the immunocomplexes (e.g., anti-immunoglobulin, protein G, Protein A or a lectin and secondary antibodies, or antigen-binding fragments thereof, capable of binding to the antibodies that specifically bind to the selected hematological malignancy-related peptide or polypeptide). If the selected peptide or polypeptide is labeled, the reporter group may be any suitable reporter group known in the art, including radioisotopes, fluorescent groups, luminescent groups, enzymes, biotin and dye particles.

[0306] Within certain assays, unlabeled peptide or polypeptide is immobilized on a solid support. The solid support may be any material known to those of ordinary skill in the art to which the peptide may be attached. For example, the solid support may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a plastic material such as polystyrene or polyvinylchloride. The support may also be a magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. Pat. No. 5,359,681. The peptide may be immobilized on the solid support using a variety of techniques known to those of skill in the art, which are amply described in the patent and scientific literature. In the context of the present invention, the term "immobilization" refers to both noncovalent association, such as adsorption, and covalent attachment (which may be a direct linkage between the antigen and functional groups on the support or may be a linkage by way of a cross-linking agent). Immobilization by adsorption to a well in a microtiter plate or to a membrane is preferred. In such cases, adsorption may be achieved by contacting the selected peptide or polypeptide, in a suitable buffer, with the solid support for a suitable amount of time. The contact time varies with temperature, but is typically between about 1 hour and about 1 day. In general, contacting a well of a plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of peptide ranging from about 10 ng to about 10 .mu.g, and preferably about 100 ng to about 1 .mu.g, is sufficient to immobilize an adequate amount of peptide.

[0307] Following immobilization, the remaining protein binding sites on the support are typically blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as bovine serum albumin, Tween.TM. 20.TM. (Sigma Chemical Co., St. Louis, Mo.), heat-inactivated normal goat serum (NGS), or BLOTTO (buffered solution of nonfat dry milk which also contains a preservative, salts, and an antifoaming agent) may be used. The support is then incubated with a biological sample suspected of containing specific antibody. The sample can be applied neat, or, more often, it can be diluted, usually in a buffered solution which contains a small amount (0.1%-5.0% by weight) of protein, such as BSA, NGS, or BLOTTO. In general, an appropriate contact time (i.e., incubation time) is a period of time that is sufficient to detect the presence of antibody or an antigen binding fragment that is immunospecific for the selected peptide or polypeptide within a sample containing such an antibody or binding fragment thereof. Preferably, the contact time is sufficient to achieve a level of binding that is at least about 95% of that achieved at equilibrium between bound and unbound antibody or antibody fragment. Those of ordinary skill in the art will recognize that the time necessary to achieve equilibrium may be readily determined by assaying the level of binding that occurs over a period of time. At room temperature, an incubation time of about 30 min is generally sufficient.

[0308] Unbound sample may then be removed by washing the solid support with an appropriate buffer, such as PBS containing 0.1% Tween.TM. 20. A detection reagent that binds to the immunocomplexes and that comprises at least a first detectable label or "reporter" molecule may then be added. The detection reagent is incubated with the immunocomplex for an amount of time sufficient to detect the bound antibody or antigen binding fragment thereof. An appropriate amount of time may generally be determined by assaying the level of binding that occurs over a period of time. Unbound label or detection reagent is then removed and bound label or detection reagent is detected using a suitable assay or analytical instrument. The method employed for detecting the reporter group depends upon the nature of the reporter group. For radioactive labels, scintillation counting or autoradiographic methods are generally appropriate. Spectroscopic methods may be used to detect dyes, luminescent or chemiluminescent moieties and various chromogens, fluorescent labels and such like. Biotin may be detected using avidin, coupled to a different reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme reporter groups (e.g., horseradish peroxidase, .beta.-galactosidase, alkaline phosphatase and glucose oxidase) may generally be detected by the addition of substrate (generally for a specific period of time), followed by spectroscopic or other analysis of the reaction products. Regardless of the specific method employed, a level of bound detection reagent that is at least two fold greater than background (i.e., the level observed for a biological sample obtained from a disease-free individual) indicates the presence of a malignant disease associated with expression of the selected peptide or polypeptide.

[0309] In general, methods for monitoring the effectiveness of an immunization or therapy involve monitoring changes in the level of antibodies or T cells specific for the selected peptide or polypeptide in a sample, or in an animal such as a human patient. Methods in which antibody levels are monitored may comprise the steps of: (a) incubating a first biological sample, obtained from a patient prior to a therapy or immunization, with a selected peptide or polypeptide, wherein the incubation is performed under conditions and for a time sufficient to allow immunocomplexes to form; (b) detecting immunocomplexes formed between the selected peptide or polypeptide and antibodies or antigen binding fragments in the biological sample that specifically bind to the selected peptide or polypeptide; (c) repeating steps (a) and (b) using a second biological sample taken from the patient at later time, such as for example, following a given therapy or immunization; and (d) comparing the number of immunocomplexes detected in the first and second biological samples. Alternatively, a polynucleotide encoding the selected peptide or polypeptide, or an APC expressing the selected peptide or polypeptide may be employed in place of the selected peptide or polypeptide itself. Within such methods, immunocomplexes between the selected peptide or polypeptide encoded by a polynucleotide, or expressed by the APC, and antibodies and/or antigen binding fragments in the biological sample are detected.

[0310] Methods in which T cell activation and/or the number of hematological malignancy polypeptide-specific precursors are monitored may comprise the steps of: (a) incubating a first biological sample comprising CD4.sup.+ and/or CD8.sup.+ cells (e.g., bone marrow, peripheral blood or a fraction thereof), obtained from a patient prior to a therapy or immunization, with a hematological malignancy peptide or polypeptide, wherein the incubation is performed under conditions and for a time sufficient to allow specific activation, proliferation and/or lysis of T cells; (b) detecting an amount of activation, proliferation and/or lysis of the T cells; (c) repeating steps (a) and (b) using a second biological sample comprising CD4.sup.+ and/or CD8.sup.+ T cells, and taken from the same patient following therapy or immunization; and (d) comparing the amount of activation, proliferation and/or lysis of T cells in the first and second biological samples. Alternatively, a polynucleotide encoding a hematological malignancy related peptide, or an APC expressing such a peptide may be employed in place of the hematological malignancy peptide itself.

[0311] A biological sample for use within such methods may be any sample obtained from a patient that would be expected to contain antibodies, CD4.sup.+ T cells and/or CD8.sup.+ T cells. Suitable biological samples include blood, sera, ascites, bone marrow, pleural effusion and cerebrospinal fluid. A first biological sample may be obtained prior to initiation of therapy or immunization or part way through a therapy or vaccination regime. The second biological sample should be obtained in a similar manner, but at a time following additional therapy or immunization. The second biological sample may be obtained at the completion of, or part way through, therapy or immunization, provided that at least a portion of therapy or immunization takes place between the isolation of the first and second biological samples.

[0312] Incubation and detection steps for both samples may generally be performed as described above. A statistically significant increase in the number of immunocomplexes in the second sample relative to the first sample reflects successful therapy or immunization.

[0313] 4.6 Administration of Pharmaceutical Compositions and Formulations

[0314] In certain embodiments, the present invention concerns formulation of one or more of the polynucleotide, polypeptide, peptide, antibody, or antigen binding fragment compositions disclosed herein in pharmaceutically acceptable solutions for administration to a cell or an animal, either alone, or in combination with one or more other modalities of anti-cancer therapy, or in combination with one or more diagnostic or therapeutic agents.

[0315] It will also be understood that, if desired, the nucleic acid segment, RNA, or DNA compositions disclosed herein may be administered in combination with other agents as well, such as, e.g., proteins or peptides or various pharmaceutically-active agents. As long as the composition comprises at least one of the genetic expression constructs disclosed herein, there is virtually no limit to other components that may also be included, given that the additional agents do not cause a significant adverse effect upon contact with the target cells or host tissues. The RNA- or DNA-derived compositions may thus be delivered along with various other agents as required in the particular instance. Such RNA or DNA compositions may be purified from host cells or other biological sources, or alternatively may be chemically synthesized as described herein. Likewise, such compositions may comprise substituted or derivatized RNA or DNA compositions. Such compositions may include one or more therapeutic gene constructs, either alone, or in combination with one or more modified peptide or nucleic acid substituent derivatives, and/or other anticancer therapeutics.

[0316] The formulation of pharmaceutically-acceptable excipients and carrier solutions are well-known to those of skill in the art, as is the development of suitable dosing and treatment regimens for using the particular compositions described herein in a variety of treatment regimens, including e.g., oral, intravenous, intranasal, transdermal, intraprostatic, intratumoral, and/or intramuscular administration and formulation.

[0317] 4.6.1 Injectable Delivery

[0318] For example, the pharmaceutical compositions disclosed herein may be administered parenterally, intravenously, intramuscularly, or even intraperitoneally as described in U.S. Pat. No. 5,543,158, U.S. Pat. No. 5,641,515 and U.S. Pat. No. 5,399,363 (each specifically incorporated herein by reference in its entirety). Solutions of the active compounds as free-base or pharmacologically acceptable salts may be prepared in water suitably mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations contain a preservative to prevent the growth of microorganisms.

[0319] The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions (U.S. Pat. No. 5,466,468, specifically incorporated herein by reference in its entirety). In all cases the form must be sterile and must be fluid to the extent that easy syringability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms, such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable oils. Proper fluidity may be maintained, for example, by the use of a coating, such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. The prevention of the action of microorganisms can be brought about by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.

[0320] For parenteral administration in an aqueous solution, for example, the solution should be suitably buffered if necessary and the liquid diluent first rendered isotonic with sufficient saline or glucose. These particular aqueous solutions are especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this connection, sterile aqueous media that can be employed will be known to those of skill in the art in light of the present disclosure. For example, one dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, Hoover, 1975). Some variation in dosage will necessarily occur depending on the condition of the subject being treated. The person responsible for administration will, in any event, determine the appropriate dose for the individual subject. Moreover, for human administration, preparations should meet sterility, pyrogenicity, and general safety and purity standards as required by FDA Office of Biologics standards.

[0321] Sterile injectable solutions may be prepared by incorporating the gene therapy constructs in the required amount in the appropriate solvent with several of the other ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the various sterilized active ingredients into a sterile vehicle which contains the basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum-drying and freeze-drying techniques which yield a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.

[0322] The compositions disclosed herein may be formulated in a neutral or salt form. Pharmaceutically-acceptable salts, include the acid addition salts and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be administered in a manner compatible with the dosage formulation and in such amount as is therapeutically effective. The formulations are easily administered in a variety of dosage forms such as injectable solutions, drug release capsules and the like.

[0323] As used herein, "carrier" includes any and all solvents, dispersion media, vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use of such media and agents for pharmaceutical active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active ingredient, its use in the therapeutic compositions is contemplated. Supplementary active ingredients can also be incorporated into the compositions.

[0324] 4.6.2 Intranasal Delivery

[0325] One may use nasal solutions or sprays, aerosols or even inhalants for the treatment of hematological malignancies with one of more of the disclosed peptides and polynucleotides. Nasal solutions are usually aqueous solutions designed for administration to the nasal passages in drops or sprays. Nasal solutions are prepared so that they are similar in many respects to nasal secretions, so that normal ciliary action is maintained. Thus, the aqueous nasal solutions usually are isotonic and slightly buffered to maintain a pH of from about 5.5 to about 6.5. In addition, antimicrobial preservatives, similar to those used in ophthalmic preparations, and appropriate drug stabilizers, if required, may be included in the formulation. Various commercial nasal preparations are known.

[0326] Inhalations and inhalants are pharmaceutical preparations designed for delivering a drug or compound into the respiratory tree of a patient. A vapor or mist is administered and reaches the affected area, often to give relief from symptoms of bronchial and nasal congestion. However, this route can also be employed to deliver agents into the systemic circulation. Inhalations may be administered by the nasal or oral respiratory routes. The administration of inhalation solutions is only effective if the droplets are sufficiently fine and uniform in size so that the mist reaches the bronchioles.

[0327] Another group of products, also known as inhalations, and sometimes called insufflations, consists of finely powdered or liquid drugs that are carried into the respiratory passages by the use of special delivery systems, such as pharmaceutical aerosols, that hold a solution or suspension of the drug in a liquefied gas propellant. When released through a suitable valve and oral adapter, a metered does of the inhalation is propelled into the respiratory tract of the patient.

[0328] Particle size is of importance in the administration of this type of preparation. It has been reported that the optimum particle size for penetration into the pulmonary cavity is of the order of about 0.5 to about 7 .mu.m. Fine mists are produced by pressurized aerosols and hence their use in considered advantageous.

[0329] 4.6.3 Liposome-, Nanocapsule-, and Microparticle-Mediated Delivery

[0330] In certain embodiments, the inventors contemplate the use of liposomes, nanocapsules, microparticles, microspheres, lipid particles, vesicles, and the like, for the introduction of the polynucleotide compositions of the present invention into suitable host cells. In particular, the polynucleotide compositions of the present invention may be formulated for delivery either encapsulated in a lipid particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like.

[0331] Such formulations may be preferred for the introduction of pharmaceutically acceptable formulations of the nucleic acids disclosed herein. The formation and use of liposomes is generally known to those of skill in the art (see for example, Couvreur et al., 1977; Couvreur, 1988; Lasic, 1998; which describes the use of liposomes and nanocapsules in the targeted antibiotic therapy for intracellular bacterial infections and diseases). Recently, liposomes were developed with improved serum stability and circulation half-lives (Gabizon and Papahadjopoulos, 1988; Allen and Choun, 1987; U.S. Pat. No. 5,741,516, specifically incorporated herein by reference in its entirety). Further, various methods of liposome and liposome like preparations as potential drug carriers have been reviewed (Takakura, 1998; Chandran et al., 1997; Margalit, 1995; U.S. Pat. No. 5,567,434; U.S. Pat. No. 5,552,157; U.S. Pat. No. 5,565,213; U.S. Pat. No. 5,738,868 and U.S. Pat. No. 5,795,587, each specifically incorporated herein by reference in its entirety).

[0332] Liposomes have been used successfully with a number of cell types that are normally resistant to transfection by other procedures including T cell suspensions, primary hepatocyte cultures and PC12 cells (Renneisen et al., 1990; Muller et al., 1990). In addition, liposomes are free of the DNA length constraints that are typical of viral-based delivery systems. Liposomes have been used effectively to introduce genes, drugs (Heath and Martin, 1986; Heath et al., 1986; Balazsovits et al., 1989; Fresta and Puglisi, 1996), radiotherapeutic agents (Pikul et al., 1987), enzymes (Imaizumi et al., 1990a; Imaizumi et al., 1990b), viruses (Faller and Baltimore, 1984), transcription factors and allosteric effectors (Nicolau and Gersonde, 1979) into a variety of cultured cell lines and animals. In addition, several successful clinical trails examining the effectiveness of liposome-mediated drug delivery have been completed (Lopez-Berestein et al., 1985a; 1985b; Coune, 1988; Sculier et al., 1988). Furthermore, several studies suggest that the use of liposomes is not associated with autoimmune responses, toxicity or gonadal localization after systemic delivery (Mori and Fukatsu, 1992).

[0333] Liposomes are formed from phospholipids that are dispersed in an aqueous medium and spontaneously form multilamellar concentric bilayer vesicles (also termed multilamellar vesicles (MLVs). MLVs generally have diameters of from 25 nm to 4 .mu.m. Sonication of MLVs results in the formation of small unilamellar vesicles (SUVs) with diameters in the range of 200 to 500 .ANG., containing an aqueous solution in the core.

[0334] Liposomes bear resemblance to cellular membranes and are contemplated for use in connection with the present invention as carriers for the peptide compositions. They are widely suitable as both water- and lipid-soluble substances can be entrapped, i.e. in the aqueous spaces and within the bilayer itself, respectively. It is possible that the drug-bearing liposomes may even be employed for site-specific delivery of active agents by selectively modifying the liposomal formulation.

[0335] In addition to the teachings of Couvreur et al. (1977; 1988), the following information may be utilized in generating liposomal formulations. Phospholipids can form a variety of structures other than liposomes when dispersed in water, depending on the molar ratio of lipid to water. At low ratios the liposome is the preferred structure. The physical characteristics of liposomes depend on pH, ionic strength and the presence of divalent cations. Liposomes can show low permeability to ionic and polar substances, but at elevated temperatures undergo a phase transition which markedly alters their permeability. The phase transition involves a change from a closely packed, ordered structure, known as the gel state, to a loosely packed, less-ordered structure, known as the fluid state. This occurs at a characteristic phase-transition temperature and results in an increase in permeability to ions, sugars, and drugs.

[0336] Alternatively, the invention provides for pharmaceutically acceptable nanocapsule formulations of the polynucleotide compositions of the present invention. Nanocapsules can generally entrap compounds in a stable and reproducible way (Henry-Michelland et al., 1987; Quintanar-Guerrero et al., 1998; Douglas et al., 1987). To avoid side effects due to intracellular polymeric overloading, such ultrafine particles (sized around 0.1 .mu.m) should be designed using polymers able to be degraded in vivo. Biodegradable polyalkyl-cyanoacrylate nanoparticles that meet these requirements are contemplated for use in the present invention, and such particles may be are easily made, as described (Couvreur et al., 1980; 1988; zur Muhlen et al., 1998; Zambaux et al. 1998; Pinto-Alphandry et al., 1995 and U.S. Pat. No. 5,145,684, specifically incorporated herein by reference in its entirety). In particular, methods of polynucleotide delivery to a target cell using either nanoparticles or nanospheres (Schwab et al., 1994; Truong-Le et al., 1998) are also particularly contemplated to be useful in formulating the disclosed compositions for administration to an animal, and to a human in particular.

[0337] 4.7 Therapeutic Agents and Kits

[0338] The invention also provides one or more of the hematological malignancy-related compositions formulated with one or more pharmaceutically acceptable excipients, carriers, diluents, adjuvants, and/or other components for use in the preparation of medicaments, or diagnostic reagents, as well as various kits comprising one or more of such compositions, medicaments, or formulations intended for administration to an animal in need thereof, or for use in one or more diagnostic assays for identifying polynucleotides, polypeptides, and/or antibodies that are specific for one or more hematological malignancy-related compounds as described herein. In addition to the disclosed epitopes, antibodies and antigen binding fragments, antibody- or antigen binding fragment-encoding polynucleotides or additional anticancer agents, polynucleotides, peptides, antigens, or other therapeutic compounds as may be employed in the formulation of particular compositions and formulations disclosed herein, and particularly in the preparation of anticancer agents or anti-hematological malignancies therapies for administration to the affected mammal:

[0339] As such, preferred animals for administration of the pharmaceutical compositions disclosed herein include mammals, and particularly humans. Other preferred animals include primates, sheep, goats, bovines, equines, porcines, lupines, canines, and felines, as well as any other mammalian species commonly considered pets, livestock, or commercially relevant animal species. The compositions and formulations may include partially or significantly purified polypeptide, polynucleotide, or antibody or antigen binding fragment compositions, either alone, or in combination with one or more additional active ingredients, anticancer agents, vaccines, adjuvants, or other therapeutics which may be obtained from natural or recombinant sources, or which may be obtainable naturally or either chemically synthesized, or alternatively produced in vitro from recombinant host cells expressing one or more nucleic acid segments that encode one or more such additional active ingredients, carriers, adjuvants, cofactors, or other therapeutic compound.

[0340] 4.8 Diagnostic Reagents and Kits

[0341] The invention further provides diagnostic reagents and kits comprising one or more such reagents for use in a variety of diagnostic assays, including for example, immunoassays such as ELISA and "sandwich"-type immunoassays. Such kits may preferably include at least a first peptide, or a first antibody or antigen binding fragment of the invention, a functional fragment thereof, or a cocktail thereof, and means for signal generation. The kit's components may be pre-attached to a solid support, or may be applied to the surface of a solid support when the kit is used. The signal generating means may come pre-associated with an antibody of the invention or may require combination with one or more components, e.g., buffers, antibody-enzyme conjugates, enzyme substrates, or the like, prior to use. Kits may also include additional reagents, e.g., blocking reagents for reducing nonspecific binding to the solid phase surface, washing reagents, enzyme substrates, and the like. The solid phase surface may be in the form of microtiter plates, microspheres, or other materials suitable for immobilizing proteins, peptides, or polypeptides. Preferably, an enzyme that catalyzes the formation of a chemiluminescent or chromogenic product or the reduction of a chemiluminescent or chromogenic substrate is a component of the signal generating means. Such enzymes are well known in the art.

[0342] Such kits are useful in the detection, monitoring and diagnosis of conditions characterized by over-expression or inappropriate expression of hematological malignancy-related peptides, polypeptides, antibodies, and/or polynucleotides, as well as hybridomas, host cells, and vectors comprising one or more such compositions as disclosed herein.

[0343] The therapeutic and diagnostic kits of the present invention may also be prepared that comprise at least one of the antibody, peptide, antigen binding fragment, hybridoma, vector, vaccine, polynucleotide, or cellular compositions disclosed herein and instructions for using the composition as a diagnostic reagent or therapeutic agent. Containers for use in such kits may typically comprise at least one vial, test tube, flask, bottle, syringe or other suitable container, into which one or more of the diagnostic and/or therapeutic composition(s) may be placed, and preferably suitably aliquoted. Where a second therapeutic agent is also provided, the kit may also contain a second distinct container into which this second diagnostic and/or therapeutic composition may be placed. Alternatively, a plurality of compounds may be prepared in a single pharmaceutical composition, and may be packaged in a single container means, such as a vial, flask, syringe, bottle, or other suitable single container. The kits of the present invention will also typically include a means for containing the vial(s) in close confinement for commercial sale, such as, e.g., injection or blow-molded plastic containers into which the desired vial(s) are retained. Where a radiolabel, chromogenic, fluorigenic, or other type of detectable label or detecting means is included within the kit, the labeling agent may be provided either in the same container as the diagnostic or therapeutic composition itself, or may alternatively be placed in a second distinct container means into which this second composition may be placed and suitably aliquoted. Alternatively, the detection reagent and the label may be prepared in a single container means, and in most cases, the kit will also typically include a means for containing the vial(s) in close confinement for commercial sale and/or convenient packaging and delivery.

[0344] 4.9 Polynucleotide Compositions

[0345] As used herein, the terms "DNA segment" and "polynucleotide" refer to a DNA molecule that has been isolated free of total genomic DNA of a particular species. Therefore, a DNA segment encoding a polypeptide refers to a DNA segment that contains one or more coding sequences yet is substantially isolated away from, or purified free from, total genomic DNA of the species from which the DNA segment is obtained. Included within the terms "DNA segment" and "polynucleotide" are DNA segments and smaller fragments of such segments, and also recombinant vectors, including, for example, plasmids, cosmids, phagemids, phage, viruses, and the like.

[0346] As will be understood by those skilled in the art, the DNA segments of this invention can include genomic sequences, extra-genomic and plasmid-encoded sequences and smaller engineered gene segments that express, or may be adapted to express, proteins, polypeptides, peptides and the like. Such segments may be naturally isolated, or modified synthetically by the hand of man.

[0347] "Isolated," as used herein, means that a polynucleotide is substantially away from other coding sequences, and that the DNA segment does not contain large portions of unrelated coding DNA, such as large chromosomal fragments or other functional genes or polypeptide coding regions. Of course, this refers to the DNA segment as originally isolated, and does not exclude genes or coding regions later added to the segment by the hand of man.

[0348] As will be recognized by the skilled artisan, polynucleotides may be single-stranded (coding or antisense) or double-stranded, and may be DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules include HnRNA molecules, which contain introns and correspond to a DNA molecule in a one-to-one manner, and mRNA molecules, which do not contain introns. Additional coding or non-coding sequences may, but need not, be present within a polynucleotide of the present invention, and a polynucleotide may, but need not, be linked to other molecules and/or support materials.

[0349] Polynucleotides may comprise a native sequence (i.e., an endogenous sequence that encodes a hematological malignancy-related tumor protein or a portion thereof) or may comprise a variant, or a biological or antigenic functional equivalent of such a sequence. Polynucleotide variants may contain one or more substitutions, additions, deletions and/or insertions, as further described below, preferably such that the immunogenicity of the encoded polypeptide is not diminished, relative to a native tumor protein. The effect on the immunogenicity of the encoded polypeptide may generally be assessed as described herein. The term "variants" also encompasses homologous genes of xenogenic origin.

[0350] When comparing polynucleotide or polypeptide sequences, two sequences are said to be "identical" if the sequence of nucleotides or amino acids in the two sequences is the same when aligned for maximum correspondence, as described below. Comparisons between two sequences are typically performed by comparing the sequences over a comparison window to identify and compare local regions of sequence similarity. A "comparison window" as used herein, refers to a segment of at least about 20 contiguous positions, usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned.

[0351] Optimal alignment of sequences for comparison may be conducted using the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., Madison, Wis.), using default parameters. This program embodies several alignment schemes described in the following references: Dayhoff, M. O. (1978) A model of evolutionary change in proteins--Matrices for detecting distant relationships. In Dayhoff, M. O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical Research Foundation, Washington D.C. Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology vol. 183, Academic Press, Inc., San Diego, Calif.; Higgins, D. G. and Sharp, P. M. (1989) CABIOS 5:151-153; Myers, E. W. and Muller W. (1988) CABIOS 4:11-17; Robinson, E. D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) Mol. Biol. Evol. 4:406-425; Sneath, P. H. A. and Sokal, R. R. (1973) Numerical Taxonomy--the Principles and Practice of Numerical Taxonomy, Freeman Press, San Francisco, Calif.; Wilbur, W. J. and Lipman, D. J. (1983) Proc. Natl. Acad., Sci. USA 80:726-730.

[0352] Alternatively, optimal alignment of sequences for comparison may be conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wis.), or by inspection.

[0353] One preferred example of algorithms that are suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. BLAST and BLAST 2:0 can be used, for example with the parameters described herein, to determine percent sequence identity for the polynucleotides and polypeptides of the invention. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information. In one illustrative example, cumulative scores can be calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix can be used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and a comparison of both strands.

[0354] Preferably, the "percentage of sequence identity" is determined by comparing two optimally aligned sequences over a window of comparison of at least 20 positions, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid bases or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the reference sequence (i.e., the window size) and multiplying the results by 100 to yield the percentage of sequence identity.

[0355] Therefore, the present invention encompasses polynucleotide and polypeptide sequences having substantial identity to the sequences disclosed herein, for example those comprising at least 50% sequence identity, preferably at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or higher, sequence identity compared to a polynucleotide or polypeptide sequence of this invention using the methods described herein, (e.g., BLAST analysis using standard parameters, as described below). One skilled in this art will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning and the like.

[0356] In additional embodiments, the present invention provides isolated polynucleotides and polypeptides comprising various lengths of contiguous stretches of sequence identical to or complementary to one or more of the sequences disclosed herein. For example, polynucleotides are provided by this invention that comprise at least about 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more contiguous nucleotides of one or more of the sequences disclosed herein as well as all intermediate lengths there between. It will be readily understood that "intermediate lengths", in this context, means any length between the quoted values, such as 16, 17, 18, 19, etc.; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, etc.; 150, 151, 152, 153, etc.; including all integers through 200-500; 500-1,000, and the like.

[0357] The polynucleotides of the present invention, or fragments thereof, regardless of the length of the coding sequence itself, may be combined with other DNA sequences, such as promoters, polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding segments, and the like, such that their overall length may vary considerably. It is therefore contemplated that a nucleic acid fragment of almost any length may be employed, with the total length preferably being limited by the ease of preparation and use in the intended recombinant DNA protocol. For example, illustrative DNA segments with total lengths of about 10,000, about 5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 base pairs in length, and the like, (including all intermediate lengths) are contemplated to be useful in many implementations of this invention.

[0358] In other embodiments, the present invention is directed to polynucleotides that are capable of hybridizing under moderately stringent conditions to a polynucleotide sequence provided herein, or a fragment thereof, or a complementary sequence thereof. Hybridization techniques are well known in the art of molecular biology. For purposes of illustration, suitable moderately stringent conditions for testing the hybridization of a polynucleotide of this invention with other polynucleotides include prewashing in a solution of 5.times.SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); hybridizing at 50.degree. C.-65.degree. C., 5.times.SSC, overnight; followed by washing twice at 65.degree. C. for 20 minutes with each of 2.times., 0.5.times. and 0.2.times.SSC containing 0.1% SDS.

[0359] Moreover, it will be appreciated by those of ordinary skill in the art that, as a result of the degeneracy of the genetic code, there are many nucleotide sequences that encode a polypeptide as described herein. Some of these polynucleotides bear minimal homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides that vary due to differences in codon usage are specifically contemplated by the present invention. Further, alleles of the genes comprising the polynucleotide sequences provided herein are within the scope of the present invention. Alleles are endogenous genes that are altered as a result of one or more mutations, such as deletions, additions and/or substitutions of nucleotides. The resulting mRNA and protein may, but need not, have an altered structure or function. Alleles may be identified using standard techniques (such as hybridization, amplification and/or database sequence comparison).

[0360] 4.10 Probes and Primers

[0361] In other embodiments of the present invention, the polynucleotide sequences provided herein can be advantageously used as probes or primers for nucleic acid hybridization. As such, it is contemplated that nucleic acid segments that comprise a sequence region of at least about 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, or 95 nucleotide long contiguous sequence that has the same sequence as, or is complementary to, at least a 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, or 95 nucleotide long contiguous sequence the disclosed polynucleotides will find particular utility in a variety of hybridization embodiments. Longer contiguous identical or complementary sequences, e.g., those of about 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 525, 550, 575, 600, 650, 700, 750, 800, 850, 900, 950, or even 1000 or so nucleotides (including all intermediate lengths) and all full-length sequences as the disclosed polynucleotides will also be of use in certain embodiments as probes, primers, or amplification targets and such like.

[0362] The ability of such nucleic acid probes to specifically hybridize to a sequence of interest will enable them to be of use in detecting the presence of complementary sequences in a given sample. However, other uses are also envisioned, such as the use of the sequence information for the preparation of mutant species primers, or primers, for use in preparing other genetic constructions, and for identifying and characterizing full-length polynucleotides and full, or substantially full-length cDNAs, mRNAs, and such like.

[0363] Polynucleotide molecules having sequence regions consisting of contiguous nucleotide stretches identical or complementary to one or more polynucleotide sequences as disclosed herein, are particularly contemplated as hybridization probes for use in, e.g., Southern hybridization analyses and Northern blotting. This would allow a gene product, or fragment thereof, to be analyzed, both in diverse cell types and also in various bacterial cells. The total size of fragment, as well as the size of the complementary stretch(es), will ultimately depend on the intended use or application of the particular nucleic acid segment. Smaller fragments will generally find use in hybridization embodiments, wherein the length of the contiguous complementary region may be varied, such as between about 15, 20, 25, 30, 35, 40, 45, 50, 55, 60 or so and up to and including larger contiguous complementary sequences, including those of about 70, 80, 90, 100, 120, 140, 160, 180, or 200 or so nucleotides in length may also be used, according to the given desired goal, and the particular length of the complementary sequences one wishes to detect by hybridization analysis.

[0364] The use of a hybridization probe of about between about 20 and about 500 nucleotides in length allows the formation of a duplex molecule that is both stable and selective. Molecules having contiguous complementary sequences over stretches greater than about 20 or so bases in length are generally preferred, though, in order to increase stability and selectivity of the hybrid, and thereby improve the quality and degree of specific hybrid molecules obtained. One will generally prefer to design nucleic acid molecules having gene-complementary stretches of between about 25 and 300 or so contiguous nucleotides, or even longer where desired.

[0365] Hybridization probes may be selected from any portion of any of the sequences disclosed herein. All that is required is to review the disclosed sequences, or to any contiguous portion of such a sequence, from about 15 to 30 nucleotides in length up to and including the full length sequence, that one wishes to utilize as a probe or primer. The choice of probe and primer sequences may be governed by various factors. For example, one may wish to employ primers from towards the termini of the total sequence.

[0366] Small polynucleotide segments or fragments may be readily prepared by, for example, directly synthesizing the fragment by chemical means, as is commonly practiced using an automated oligonucleotide synthesizer. Also, fragments may be obtained by application of nucleic acid reproduction technology, such as the PCR.TM. technology of U.S. Pat. No. 4,683,202 (incorporated herein by reference), by introducing selected sequences into recombinant vectors for recombinant production, and by other recombinant DNA techniques generally known to those of skill in the art of molecular biology.

[0367] The nucleotide sequences of the invention may be used for their ability to selectively form duplex molecules with complementary stretches of the entire gene or gene fragments of interest. Depending on the application envisioned, one will typically desire to employ varying conditions of hybridization to achieve varying degrees of selectivity of probe towards target sequence. For applications requiring high selectivity, one will typically desire to employ relatively stringent conditions to form the hybrids, e.g., one will select relatively low salt and/or high temperature conditions, such as provided by a salt concentration of from about 0.02 M to about 0.15 M salt at temperatures of from about 50.degree. C. to about 70.degree. C. Such selective conditions tolerate little, if any, mismatch between the probe and the template or target strand, and would be particularly suitable for isolating related sequences.

[0368] Of course, for some applications, for example, where one desires to prepare mutants employing a mutant primer strand hybridized to an underlying template, less stringent (reduced stringency) hybridization conditions will typically be needed in order to allow formation of the heteroduplex. In these circumstances, one may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M salt, at temperatures ranging from about 20.degree. C. to about 55.degree. C. Cross-hybridizing species can thereby be readily identified as positively hybridizing signals with respect to control hybridizations. In any case, it is generally appreciated that conditions can be rendered more stringent by the addition of increasing amounts of formamide, which serves to destabilize the hybrid duplex in the same manner as increased temperature. Thus, hybridization conditions can be readily manipulated, and thus will generally be a method of choice depending on the desired results.

[0369] 4.11 Polynucleotide Identification and Characterization

[0370] Polynucleotides may be identified, prepared and/or manipulated using any of a variety of well established techniques. For example, a polynucleotide may be identified, as described in more detail below, by screening a microarray of cDNAs for tumor-associated expression (i.e., expression that is at least two fold greater in a tumor than in normal tissue, as determined using a representative assay provided herein). Such screens may be performed, for example, using a Synteni microarray (Palo Alto, Calif.) according to the manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl. Acad. Sci. USA 93:10614-10619, 1996 and Heller et al., Proc. Natl. Acad. Sci. USA 94:2150-2155, 1997). Alternatively, polynucleotides may be amplified from cDNA prepared from cells expressing the proteins described herein, such as hematological malignancy-related tumor cells. Such polynucleotides may be amplified via polymerase chain reaction (PCR). For this approach, sequence-specific primers may be designed based on the sequences provided herein, and may be purchased or synthesized.

[0371] An amplified portion of a polynucleotide of the present invention may be used to isolate a full length gene from a suitable library (e.g., a hematological malignancy-related tumor cDNA library) using well known techniques. Within such techniques, a library (cDNA or genomic) is screened using one or more polynucleotide probes or primers suitable for amplification. Preferably, a library is size-selected to include larger molecules. Random primed libraries may also be preferred for identifying 5' and upstream regions of genes. Genomic libraries are preferred for obtaining introns and extending 5' sequences.

[0372] For hybridization techniques, a partial sequence may be labeled (e.g., by nick-translation or end-labeling with .sup.32P) using well known techniques. A bacterial or bacteriophage library is then generally screened by hybridizing filters containing denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe (see Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, N.Y., 1989). Hybridizing colonies or plaques are selected and expanded, and the DNA is isolated for further analysis. cDNA clones may be analyzed to determine the amount of additional sequence by, for example, PCR using a primer from the partial sequence and a primer from the vector. Restriction maps and partial sequences may be generated to identify one or more overlapping clones. The complete sequence may then be determined using standard techniques, which may involve generating a series of deletion clones. The resulting overlapping sequences can then assembled into a single contiguous sequence. A full length cDNA molecule can be generated by ligating suitable fragments, using well known techniques.

[0373] Alternatively, there are numerous amplification techniques for obtaining a full length coding sequence from a partial cDNA sequence. Within such techniques, amplification is generally performed via PCR. Any of a variety of commercially available kits may be used to perform the amplification step. Primers may be designed using, for example, software or algorithms or formulas well known in the art.

[0374] One such amplification technique is inverse PCR (see Triglia et al., Nucl. Acids Res. 16:8186, 1988), which uses restriction enzymes to generate a fragment in the known region of the gene. The fragment is then circularized by intramolecular ligation and used as a template for PCR with divergent primers derived from the known region. Within an alternative approach, sequences adjacent to a partial sequence may be retrieved by amplification with a primer to a linker sequence and a primer specific to a known region. The amplified sequences are typically subjected to a second round of amplification with the same linker primer and a second primer specific to the known region. A variation on this procedure, which employs two primers that initiate extension in opposite directions from the known sequence, is described in WO 96/38591. Another such technique is known as "rapid amplification of cDNA ends" or RACE. This technique involves the use of an internal primer and an external primer, which hybridizes to a polyA region or vector sequence, to identify sequences that are 5' and 3' of a known sequence. Additional techniques include capture PCR (Lagerstrom et al., PCR Methods Applic. 1:111-19, 1991) and walking PCR (Parker et al., Nucl. Acids. Res. 19:3055-60, 1991). Other methods employing amplification may also be employed to obtain a full length cDNA sequence.

[0375] In certain instances, it is possible to obtain a full length cDNA sequence by analysis of sequences provided in an expressed sequence tag (EST) database, such as that available from GenBank. Searches for overlapping ESTs may generally be performed using well known programs (e.g., NCBI BLAST searches), and such ESTs may be used to generate a contiguous full length sequence. Full length DNA sequences may also be obtained by analysis of genomic fragments.

[0376] 4.12 Polynucleotide Expression in Host Cells

[0377] In other embodiments of the invention, polynucleotide sequences or fragments thereof which encode polypeptides of the invention, or fusion proteins or functional equivalents thereof, may be used in recombinant DNA molecules to direct expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences that encode substantially the same or a functionally equivalent amino acid sequence may be produced and these sequences may be used to clone and express a given polypeptide.

[0378] As will be understood by those of skill in the art, it may be advantageous in some instances to produce polypeptide-encoding nucleotide sequences possessing non-naturally occurring codons. For example, codons preferred by a particular prokaryotic or eukaryotic host can be selected to increase the rate of protein expression or to produce a recombinant RNA transcript having desirable properties, such as a half-life which is longer than that of a transcript generated from the naturally occurring sequence.

[0379] Moreover, the polynucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter polypeptide encoding sequences for a variety of reasons, including but not limited to, alterations which modify the cloning, processing, and/or expression of the gene product. For example, DNA shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the nucleotide sequences. In addition, site-directed mutagenesis may be used to insert new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, or introduce mutations, and so forth.

[0380] In another embodiment of the invention, natural, modified, or recombinant nucleic acid sequences may be ligated to a heterologous sequence to encode a fusion protein. For example, to screen peptide libraries for inhibitors of polypeptide activity, it may be useful to encode a chimeric protein that can be recognized by a commercially available antibody. A fusion protein may also be engineered to contain a cleavage site located between the polypeptide-encoding sequence and the heterologous protein sequence, so that the polypeptide may be cleaved and purified away from the heterologous moiety.

[0381] Sequences encoding a desired polypeptide may be synthesized, in whole or in part, using chemical methods well known in the art (see Caruthers, M. H. et al. (1980) Nucl. Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) Nucl. Acids Res. Symp. Ser. 225-232). Alternatively, the protein itself may be produced using chemical methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. For example, peptide synthesis can be performed using various solid-phase techniques (Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be achieved, for example, using the ABI 431A Peptide Synthesizer (Perkin Elmer, Palo Alto, Calif.).

[0382] A newly synthesized peptide may be substantially purified by preparative high performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures and Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable techniques available in the art. The composition of the synthetic peptides may be confirmed by amino acid analysis or sequencing (e.g., the Edman degradation procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, may be altered during direct synthesis and/or combined using chemical methods with sequences from other proteins, or any part thereof, to produce a variant polypeptide.

[0383] In order to express a desired polypeptide, the nucleotide sequences encoding the polypeptide, or functional equivalents, may be inserted into appropriate expression vector, i.e., a vector which contains the necessary elements for the transcription and translation of the inserted coding sequence. Methods which are well known to those skilled in the art may be used to construct expression vectors containing sequences encoding a polypeptide of interest and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. Such techniques are described in Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. N.Y.

[0384] A variety of expression vector/host systems may be utilized to contain and express polynucleotide sequences. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal cell systems.

[0385] The "control elements" or "regulatory sequences" present in an expression vector are those non-translated regions of the vector--enhancers, promoters, 5' and 3' untranslated regions--which interact with host cellular proteins to carry out transcription and translation. Such elements may vary in their strength and specificity. Depending on the vector system and host utilized, any number of suitable transcription and translation elements, including constitutive and inducible promoters, may be used. For example, when cloning in bacterial systems, inducible promoters such as the hybrid lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or PSPORT1 plasmid (Gibco BRL, Gaithersburg, Md.) and the like may be used. In mammalian cell systems, promoters from mammalian genes or from mammalian viruses are generally preferred. If it is necessary to generate a cell line that contains multiple copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be advantageously used with an appropriate selectable marker.

[0386] In bacterial systems, a number of expression vectors may be selected depending upon the use intended for the expressed polypeptide. For example, when large quantities are needed, for example for the induction of antibodies, vectors which direct high level expression of fusion proteins that are readily purified may be used. Such vectors include, but are not limited to, the multifunctional E. coli cloning and expression vectors such as BLUESCRIPT (Stratagene), in which the sequence encoding the polypeptide of interest may be ligated into the vector in frame with sequences for the amino-terminal Met and the subsequent 7 residues of .beta.-galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. M. Schuster (1989) J. Biol. Chem. 264:5503-5509); and the like. pGEX Vectors (Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In general, such fusion proteins are soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose beads followed by elution in the presence of free glutathione. Proteins made in such systems may be designed to include heparin, thrombin, or factor XA protease cleavage sites so that the cloned polypeptide of interest can be released from the GST moiety at will.

[0387] In the yeast, Saccharomyces cerevisiae, a number of vectors containing constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods Enzymol. 153:516-544.

[0388] In cases where plant expression vectors are used, the expression of sequences encoding polypeptides may be driven by any of a number of promoters. For example, viral promoters such as the 35S and 19S promoters of CaMV may be used alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-311. Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105). These constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. Such techniques are described in a number of generally available reviews (see, for example, Hobbs, S. or Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York, N.Y.; pp. 191-196).

[0389] An insect system may also be used to express a polypeptide of interest. For example, in one such system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a non-essential region of the virus, such as the polyhedrin gene, and placed under control of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence will render the polyhedrin gene inactive and produce recombinant virus lacking coat protein. The recombinant viruses may then be used to infect, for example, S. frugiperda cells or Trichoplusia larvae in which the polypeptide of interest may be expressed (Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Sci. 91:3224-3227).

[0390] In mammalian host cells, a number of viral-based expression systems are generally available. For example, in cases where an adenovirus is used as an expression vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader sequence. Insertion in a non-essential E1 or E3 region of the viral genome may be used to obtain a viable virus which is capable of expressing the polypeptide in infected host cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad. Sci. 81:3655-3659). In addition, transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells.

[0391] Specific initiation signals may also be used to achieve more efficient translation of sequences encoding a polypeptide of interest. Such signals include the ATG initiation codon and adjacent sequences. In cases where sequences encoding the polypeptide, its initiation codon, and upstream sequences are inserted into the appropriate expression vector, no additional transcriptional or translational control signals may be needed. However, in cases where only coding sequence, or a portion thereof, is inserted, exogenous translational control signals including the ATG initiation codon should be provided. Furthermore, the initiation codon should be in the correct reading frame to ensure translation of the entire insert. Exogenous translational elements and initiation codons may be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers which are appropriate for the particular cell system which is used, such as those described in the literature (Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162).

[0392] In addition, a host cell strain may be chosen for its ability to modulate the expression of the inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the polypeptide include, but are not limited to, acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" form of the protein may also be used to facilitate correct insertion, folding and/or function. Different host cells such as CHO, HeLa, MDCK, HEK293, and WI38, which have specific cellular machinery and characteristic mechanisms for such post-translational activities, may be chosen to ensure the correct modification and processing of the foreign protein.

[0393] For long-term, high-yield production of recombinant proteins, stable expression is generally preferred. For example, cell lines which stably express a polynucleotide of interest may be transformed using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector. Following the introduction of the vector, cells may be allowed to grow for 1-2 days in an enriched media before they are switched to selective media. The purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells which successfully express the introduced sequences. Resistant clones of stably transformed cells may be proliferated using tissue culture techniques appropriate to the cell type.

[0394] Any number of selection systems may be used to recover transformed cell lines. These include, but are not limited to, the herpes simplex virus thymidine kinase (Wigler, M. et al. (1977) Cell 11:223-32) and adenine phosphoribosyltransferase (Lowy, I. et al. (1990) Cell 22:817-23) genes which can be employed in tk.sup.- or aprt.sup.-cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can be used as the basis for selection; for example, dhfr which confers resistance to methotrexate (Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-70); npt, which confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et al (1981) J. Mol. Biol. 150:1-14); and als or pat, which confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). Additional selectable genes have been described, for example, trpB, which allows cells to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-51). Recently, the use of visible markers has gained popularity with such markers as anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and its substrate luciferin, being widely used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system (Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121-131).

[0395] Although the presence/absence of marker gene expression suggests that the gene of interest is also present, its presence and expression may need to be confirmed. For example, if the sequence encoding a polypeptide is inserted within a marker gene sequence, recombinant cells containing sequences can be identified by the absence of marker gene function. Alternatively, a marker gene can be placed in tandem with a polypeptide-encoding sequence under the control of a single promoter. Expression of the marker gene in response to induction or selection usually indicates expression of the tandem gene as well.

[0396] Alternatively, host cells which contain and express a desired polynucleotide sequence may be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations and protein bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for the detection and/or quantification of nucleic acid or protein.

[0397] A variety of protocols for detecting and measuring the expression of polynucleotide-encoded products, using either polyclonal or monoclonal antibodies specific for the product are known in the art. Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes on a given polypeptide may be preferred for some applications, but a competitive binding assay may also be employed. These and other assays are described, among other places, in Hampton, R. et al. (1990; Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. E. et al. (1983; J Exp. Med. 158:1211-1216).

[0398] A wide variety of labels and conjugation techniques are known by those skilled in the art and may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization or PCR probes for detecting sequences related to polynucleotides include oligolabeling, nick translation, end-labeling or PCR amplification using a labeled nucleotide. Alternatively, the sequences, or any portions thereof may be cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a-variety of commercially available kits. Suitable reporter molecules or labels, which may be used include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as substrates, cofactors, inhibitors, magnetic particles, and the like.

[0399] Host cells transformed with a polynucleotide sequence of interest may be cultured under conditions suitable for the expression and recovery of the protein from cell culture. The protein produced by a recombinant cell may be secreted or contained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing polynucleotides of the invention may be designed to contain signal sequences which direct secretion of the encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other recombinant constructions may be used to join sequences encoding a polypeptide of interest to nucleotide sequence encoding a polypeptide domain which will facilitate purification of soluble proteins. Such purification facilitating domains include, but are not limited to, metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilized metals, protein A domains that allow purification on immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, Calif.) between the purification domain and the encoded polypeptide may be used to facilitate purification. One such expression vector provides for expression of a fusion protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as described in Porath, J. et al. (1992, Prot. Exp. Purif. 3:263-281) while the enterokinase cleavage site provides a means for purifying the desired polypeptide from the fusion protein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. et al. (1993; DNA Cell Biol. 12:441-453).

[0400] In addition to recombinant production methods, polypeptides of the invention, and fragments thereof, may be produced by direct peptide synthesis using solid-phase techniques (Merrifield J. (1963) J. Am. Chem. Soc. 85:2149-2154). Protein synthesis may be performed using manual techniques or by automation. Automated synthesis may be achieved, for example, using Applied Biosystems 431A Peptide Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically synthesized separately and combined using chemical methods to produce the full length molecule.

[0401] 4.13 Site-Specific Mutagenesis

[0402] Site-specific mutagenesis is a technique useful in the preparation of individual peptides, or biologically functional equivalent polypeptides, through specific mutagenesis of the underlying polynucleotides that encode them. The technique, well-known to those of skill in the art, further provides a ready ability to prepare and test sequence variants, for example, incorporating one or more of the foregoing considerations, by introducing one or more nucleotide sequence changes into the DNA. Site-specific mutagenesis allows the production of mutants through the use of specific oligonucleotide sequences which encode the DNA sequence of the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and sequence complexity to form a stable duplex on both sides of the deletion junction being traversed. Mutations may be employed in a selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise change the properties of the polynucleotide itself, and/or alter the properties, activity, composition, stability, or primary sequence of the encoded polypeptide.

[0403] In certain embodiments of the present invention, the inventors contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or more properties of the encoded polypeptide, such as the antigenicity of a polypeptide vaccine. The techniques of site-specific mutagenesis are well-known in the art, and are widely used to create variants of both polypeptides and polynucleotides. For example, site-specific mutagenesis is often used to alter a specific portion of a DNA molecule. In such embodiments, a primer comprising typically about 14 to about 25 nucleotides or so in length is employed, with about 5 to about 10 residues on both sides of the junction of the sequence being altered.

[0404] As will be appreciated by those of skill in the art, site-specific mutagenesis techniques have often employed a phage vector that exists in both a single stranded and double stranded form. Typical vectors useful in site-directed mutagenesis include vectors such as the M13 phage. These phage are readily commercially-available and their use is generally well-known to those skilled in the art. Double-stranded plasmids are also routinely employed in site directed mutagenesis that eliminates the step of transferring the gene of interest from a plasmid to a phage.

[0405] In general, site-directed mutagenesis in accordance herewith is performed by first obtaining a single-stranded vector or melting apart of two strands of a double-stranded vector that includes within its sequence a DNA sequence that encodes the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is prepared, generally synthetically. This primer is then annealed with the single-stranded vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated sequence and the second strand bears the desired mutation. This heteroduplex vector is then used to transform appropriate cells, such as E. coli cells, and clones are selected which include recombinant vectors bearing the mutated sequence arrangement.

[0406] The preparation of sequence variants of the selected peptide-encoding DNA segments using site-directed mutagenesis provides a means of producing potentially useful species and is not meant to be limiting as there are other ways in which sequence variants of peptides and the DNA sequences encoding them may be obtained. For example, recombinant vectors encoding the desired peptide sequence may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence variants. Specific details regarding these methods and protocols are found in the teachings of Maloy et al., 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and Maniatis et al., 1982, each incorporated herein by reference, for that purpose.

[0407] As used herein, the term "oligonucleotide directed mutagenesis procedure" refers to template-dependent processes and vector-mediated propagation which result in an increase in the concentration of a specific nucleic acid molecule relative to its initial concentration, or in an increase in the concentration of a detectable signal, such as amplification. As used herein, the term "oligonucleotide directed mutagenesis procedure" is intended to refer to a process that involves the template-dependent extension of a primer molecule. The term template dependent process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the sequence of the newly synthesized strand of nucleic acid is dictated by the well-known rules of complementary base pairing (see, for example, Watson, 1987). Typically, vector mediated methodologies involve the introduction of the nucleic acid fragment into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of the amplified nucleic acid fragment. Examples of such methodologies are provided by U.S. Pat. No. 4,237,224, specifically incorporated herein by reference in its entirety.

[0408] 4.14 Polynucleotide Amplification Techniques

[0409] A number of template dependent processes are available to amplify the target sequences of interest present in a sample. One of the best known amplification methods is the polymerase chain reaction (PCR.TM.) which is described in detail in U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by reference in its entirety. Briefly, in PCR.TM., two primer sequences are prepared which are complementary to regions on opposite complementary strands of the target sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture along with a DNA polymerase (e.g., Taq polymerase). If the target sequence is present in a sample, the primers will bind to the target and the polymerase will cause the primers to be extended along the target sequence by adding on nucleotides. By raising and lowering the temperature of the reaction mixture, the extended primers will dissociate from the target to form reaction products, excess primers will bind to the target and to the reaction product and the process is repeated. Preferably reverse transcription and PCR.TM. amplification procedure may be performed in order to quantify the amount of mRNA amplified. Polymerase chain reaction methodologies are well known in the art.

[0410] Another method for amplification is the ligase chain reaction (referred to as LCR), disclosed in Eur. Pat. Appl. Publ. No. 320,308 (specifically incorporated herein by reference in its entirety). In LCR, two complementary probe pairs are prepared, and in the presence of the target sequence, each pair will bind to opposite complementary strands of the target such that they abut. In the presence of a ligase, the two probe pairs will link to form a single unit. By temperature cycling, as in PCR.TM., bound ligated units dissociate from the target and then serve as "target sequences" for ligation of excess probe pairs. U.S. Pat. No. 4,883,750, incorporated herein by reference in its entirety, describes an alternative method of amplification similar to LCR for binding probe pairs to a target sequence.

[0411] Qbeta Replicase, described in PCT Intl. Pat. Appl. Publ. No. PCT/US87/00880, incorporated herein by reference in its entirety, may also be used as still another amplification method in the present invention. In this method, a replicative sequence of RNA that has a region complementary to that of a target is added to a sample in the presence of an RNA polymerase. The polymerase will copy the replicative sequence that can then be detected.

[0412] An isothermal amplification method, in which restriction endonucleases and ligases are used to achieve the amplification of target molecules that contain nucleotide 5'-[.alpha.-thio]triphosphates in one strand of a restriction site (Walker et al., 1992, incorporated herein by reference in its entirety), may also be useful in the amplification of nucleic acids in the present invention.

[0413] Strand Displacement Amplification (SDA) is another method of carrying out isothermal amplification of nucleic acids which involves multiple rounds of strand displacement and synthesis, i.e. nick translation. A similar method, called Repair Chain Reaction (RCR) is another method of amplification which may be useful in the present invention and is involves annealing several probes throughout a region targeted for amplification, followed by a repair reaction in which only two of the four bases are present. The other two bases can be added as biotinylated derivatives for easy detection. A similar approach is used in SDA.

[0414] Sequences can also be detected using a cyclic probe reaction (CPR). In CPR, a probe having a 3' and 5' sequences of non-target DNA and an internal or "middle" sequence of the target protein specific RNA is hybridized to DNA which is present in a sample. Upon hybridization, the reaction is treated with RNaseH, and the products of the probe are identified as distinctive products by generating a signal that is released after digestion. The original template is annealed to another cycling probe and the reaction is repeated. Thus, CPR involves amplifying a signal generated by hybridization of a probe to a target gene specific expressed nucleic acid.

[0415] Still other amplification methods described in Great Britain Pat. Appl. No. 2 202 328, and in PCT Intl. Pat. Appl. Publ. No. PCT/US89/01025, each of which is incorporated herein by reference in its entirety, may be used in accordance with the present invention. In the former application, "modified" primers are used in a PCR-like, template and enzyme dependent synthesis. The primers may be modified by labeling with a capture moiety (e.g., biotin) and/or a detector moiety (e.g., enzyme). In the latter application, an excess of labeled probes is added to a sample. In the presence of the target sequence, the probe binds and is cleaved catalytically. After cleavage, the target sequence is released intact to be bound by excess probe. Cleavage of the labeled probe signals the presence of the target sequence.

[0416] Other nucleic acid amplification procedures include transcription-based amplification systems (TAS) (Kwoh et al., 1989; PCT Intl. Pat. Appl. Publ. No. WO 88/10315, incorporated herein by reference in its entirety), including nucleic acid sequence based amplification (NASBA) and 3SR. In NASBA, the nucleic acids can be prepared for amplification by standard phenol/chloroform extraction, heat denaturation of a sample, treatment with lysis buffer and minispin columns for isolation of DNA and RNA or guanidinium chloride extraction of RNA. These amplification techniques involve annealing a primer that has sequences specific to the target sequence. Following polymerization, DNA/RNA hybrids are digested with RNase H while double stranded DNA molecules are heat-denatured again. In either case the single stranded DNA is made fully double stranded by addition of second target-specific primer, followed by polymerization. The double stranded DNA molecules are then multiply transcribed by a polymerase such as T7 or SP6. In an isothermal cyclic reaction, the RNAs are reverse transcribed into DNA, and transcribed once again with a polymerase such as T7 or SP6. The resulting products, whether truncated or complete, indicate target-specific sequences.

[0417] Eur. Pat. Appl. Publ. No. 329,822, incorporated herein by reference in its entirety, disclose a nucleic acid amplification process involving cyclically synthesizing single-stranded RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which may be used in accordance with the present invention. The ssRNA is a first template for a first primer oligonucleotide, which is elongated by reverse transcriptase (RNA-dependent DNA polymerase). The RNA is then removed from resulting DNA:RNA duplex by the action of ribonuclease H (RNase H, an RNase specific for RNA in a duplex with either DNA or RNA). The resultant ssDNA is a second template for a second primer, which also includes the sequences of an RNA polymerase promoter (exemplified by T7 RNA polymerase) 5' to its homology to its template. This primer is then extended by DNA polymerase (exemplified by the large "Klenow" fragment of E. coli DNA polymerase I), resulting as a double-stranded DNA ("dsDNA") molecule, having a sequence identical to that of the original RNA between the primers and having additionally, at one end, a promoter sequence. This promoter sequence can be used by the appropriate RNA polymerase to make many RNA copies of the DNA. These copies can then re-enter the cycle leading to very swift amplification. With proper choice of enzymes, this amplification can be done isothermally without addition of enzymes at each cycle. Because of the cyclical nature of this process, the starting sequence can be chosen to be in the form of either DNA or RNA.

[0418] PCT Intl. Pat. Appl. Publ. No. WO 89/06700, incorporated herein by reference in its entirety, disclose a nucleic acid sequence amplification scheme based on the hybridization of a promoter/primer sequence to a target single-stranded DNA ("ssDNA") followed by transcription of many RNA copies of the sequence. This scheme is not cyclic; i.e. new templates are not produced from the resultant RNA transcripts. Other amplification methods include "RACE" (Frohman, 1990), and "one-sided PCR" (Ohara, 1989) which are well-known to those of skill in the art.

[0419] Methods based on ligation of two (or more) oligonucleotides in the presence of nucleic acid having the sequence of the resulting "di-oligonucleotide", thereby amplifying the di-oligonucleotide (Wu and Dean, 1996, incorporated herein by reference in its entirety), may also be used in the amplification of DNA sequences of the present invention.

[0420] 4.15 In Vivo Polynucleotide Delivery Techniques

[0421] In additional embodiments, genetic constructs comprising one or more of the polynucleotides of the invention are introduced into cells in vivo. This may be achieved using any of a variety or well known approaches, several of which are outlined below for the purpose of illustration.

[0422] 4.15.1 Adenovirus

[0423] One of the preferred methods for in vivo delivery of one or more nucleic acid sequences involves the use of an adenovirus expression vector. "Adenovirus expression vector" is meant to include those constructs containing adenovirus sequences sufficient to (a) support packaging of the construct and (b) to express a polynucleotide that has been cloned therein in a sense or antisense orientation. Of course, in the context of an antisense construct, expression does not require that the gene product be synthesized.

[0424] The expression vector comprises a genetically engineered form of an adenovirus. Knowledge of the genetic organization of adenovirus, a 36 kb, linear, double-stranded DNA virus, allows substitution of large pieces of adenoviral DNA with foreign sequences up to 7 kb (Grunhaus and Horwitz, 1992). In contrast to retrovirus, the adenoviral infection of host cells does not result in chromosomal integration because adenoviral DNA can replicate in an episomal manner without potential genotoxicity. Also, adenoviruses are structurally stable, and no genome rearrangement has been detected after extensive amplification. Adenovirus can infect virtually all epithelial cells regardless of their cell cycle stage. So far, adenoviral infection appears to be linked only to mild disease such as acute respiratory disease in humans.

[0425] Adenovirus is particularly suitable for use as a gene transfer vector because of its mid-sized genome, ease of manipulation, high titer, wide target-cell range and high infectivity. Both ends of the viral genome contain 100-200 base pair inverted repeats (ITRs), which are cis elements necessary for viral DNA replication and packaging. The early (E) and late (L) regions of the genome contain different transcription units that are divided by the onset of viral DNA replication. The E1 region (E1A and E1B) encodes proteins responsible for the regulation of transcription of the viral genome and a few cellular genes. The expression of the E2 region (E2A and E2B) results in the synthesis of the proteins for viral DNA replication. These proteins are involved in DNA replication, late gene expression and host cell shut-off (Renan, 1990). The products of the late genes, including the majority of the viral capsid proteins, are expressed only after significant processing of a single primary transcript issued by the major late promoter (MLP). The MLP, (located at 16.8 m.u.) is particularly efficient during the late phase of infection, and all the mRNA's issued from this promoter possess a 5'-tripartite leader (TPL) sequence which makes them preferred mRNA's for translation.

[0426] In a current system, recombinant adenovirus is generated from homologous recombination between shuttle vector and provirus vector. Due to the possible recombination between two proviral vectors, wild-type adenovirus may be generated from this process. Therefore, it is critical to isolate a single clone of virus from an individual plaque and examine its genomic structure.

[0427] Generation and propagation of the current adenovirus vectors, which are replication deficient, depend on a unique helper cell line, designated 293, which was transformed from human embryonic kidney cells by Ad5 DNA fragments and constitutively expresses E1 proteins (Graham et al., 1977). Since the E3 region is dispensable from the adenovirus genome (Jones and Shenk, 1978), the current adenovirus vectors, with the help of 293 cells, carry foreign DNA in either the E1, the D3 or both regions (Graham and Preve., 1991). In nature, adenovirus can package approximately 105% of the wild-type genome (Ghosh-Choudhury et al., 1987), providing capacity for about 2 extra kB of DNA. Combined with the approximately 5.5 kB of DNA that is replaceable in the E1 and E3 regions, the maximum capacity of the current adenovirus vector is under 7.5 kB, or about 15% of the total length of the vector. More than 80% of the adenovirus viral genome remains in the vector backbone and is the source of vector-borne cytotoxicity. Also, the replication deficiency of the E1-deleted virus is incomplete. For example, leakage of viral gene expression has been observed with the currently available vectors at high multiplicities of infection (MOI) (Mulligan, 1993).

[0428] Helper cell lines may be derived from human cells such as human embryonic kidney cells, muscle cells, hematopoietic cells or other human embryonic mesenchymal or epithelial cells. Alternatively, the helper cells may be derived from the cells of other mammalian species that are permissive for human adenovirus. Such cells include, e.g., Vero cells or other monkey embryonic mesenchymal or epithelial cells. As stated above, the currently preferred helper cell line is 293.

[0429] Recently, Racher et al. (1995) disclosed improved methods for culturing 293 cells and propagating adenovirus. In one format, natural cell aggregates are grown by inoculating individual cells into 1 liter siliconized spinner flasks (Techne, Cambridge, UK) containing 100-200 ml of medium. Following stirring at 40 rpm, the cell viability is estimated with trypan blue. In another format, Fibra-Cel microcarriers (Bibby Sterlin, Stone, UK) (5 g/l) is employed as follows. A cell inoculum, resuspended in 5 ml of medium, is added to the carrier (50 ml) in a 250 ml Erlenmeyer flask and left stationary, with occasional agitation, for 1 to 4 h. The medium is then replaced with 50 ml of fresh medium and shaking initiated. For virus production, cells are allowed to grow to about 80% confluence, after which time the medium is replaced (to 25% of the final volume) and adenovirus added at an MOI of 0.05. Cultures are left stationary overnight, following which the volume is increased to 100% and shaking commenced for another 72 h.

[0430] Other than the requirement that the adenovirus vector be replication defective, or at least conditionally defective, the nature of the adenovirus vector is not believed to be crucial to the successful practice of the invention. The adenovirus may be of any of the 42 different known serotypes or subgroups A-F. Adenovirus type 5 of subgroup C is the preferred starting material in order to obtain a conditional replication-defective adenovirus vector for use in the present invention, since Adenovirus type 5 is a human adenovirus about which a great deal of biochemical and genetic information is known, and it has historically been used for most constructions employing adenovirus as a vector.

[0431] As stated above, the typical vector according to the present invention is replication defective and will not have an adenovirus E1 region. Thus, it will be most convenient to introduce the polynucleotide encoding the gene of interest at the position from which the E1-coding sequences have been removed. However, the position of insertion of the construct within the adenovirus sequences is not critical to the invention. The polynucleotide encoding the gene of interest may also be inserted in lieu of the deleted E3 region in E3 replacement vectors as described by Karlsson et al. (1986) or in the E4 region where a helper cell line or helper virus complements the E4 defect.

[0432] Adenovirus is easy to grow and manipulate and exhibits broad host range in vitro and in vivo. This group of viruses can be obtained in high titers, e.g., 10.sup.9-10.sup.11 plaque-forming units per ml, and they are highly infective. The life cycle of adenovirus does not require integration into the host cell genome. The foreign genes delivered by adenovirus vectors are episomal and, therefore, have low genotoxicity to host cells. No side effects have been reported in studies of vaccination with wild-type adenovirus (Couch et al., 1963; Top et al., 1971), demonstrating their safety and therapeutic potential as in vivo gene transfer vectors.

[0433] Adenovirus vectors have been used in eukaryotic gene expression (Levrero et al., 1991; Gomez-Foix et al., 1992) and vaccine development (Grunhaus and Horwitz, 1992; Graham and Prevec, 1992). Recently, animal studies suggested that recombinant adenovirus could be used for gene therapy (Stratford-Perricaudet and Perricaudet, 1991; Stratford-Perricaudet et al., 1990; Rich et al., 1993). Studies in administering recombinant adenovirus to different tissues include trachea instillation (Rosenfeld et al., 1991; Rosenfeld et al., 1992), muscle injection (Ragot et al., 1993), peripheral intravenous injections (Herz and Gerard, 1993) and stereotactic inoculation into the brain (Le Gal La Salle et al., 1993).

[0434] 4.15.2 Retroviruses

[0435] The retroviruses are a group of single-stranded RNA viruses characterized by an ability to convert their RNA to double-Stranded DNA in infected cells by a process of reverse-transcription (Coffin, 1990). The resulting DNA then stably integrates into cellular chromosomes as a provirus and directs synthesis of viral proteins. The integration results in the retention of the viral gene sequences in the recipient cell and its descendants. The retroviral genome contains three genes, gag, pol, and env that code for capsid proteins, polymerase enzyme, and envelope components, respectively. A sequence found upstream from the gag gene contains a signal for packaging of the genome into virions. Two long terminal repeat (LTR) sequences are present at the 5' and 3' ends of the viral genome. These contain strong promoter and enhancer sequences and are also required for integration in the host cell genome (Coffin, 1990).

[0436] In order to construct a retroviral vector, a nucleic acid encoding one or more oligonucleotide or polynucleotide sequences of interest is inserted into the viral genome in the place of certain viral sequences to produce a virus that is replication-defective. In order to produce virions, a packaging cell line containing the gag, pol, and env genes but without the LTR and packaging components is constructed (Mann et al., 1983). When a recombinant plasmid containing a cDNA, together with the retroviral LTR and packaging sequences is introduced into this cell line (by calcium phosphate precipitation for example), the packaging sequence allows the RNA transcript of the recombinant plasmid to be packaged into viral particles, which are then secreted into the culture media (Nicolas and Rubenstein, 1988; Temin, 1986; Mann et al., 1983). The media containing the recombinant retroviruses is then collected, optionally concentrated, and used for gene transfer. Retroviral vectors are able to infect a broad variety of cell types. However, integration and stable expression require the division of host cells (Paskind et al., 1975).

[0437] A novel approach designed to allow specific targeting of retrovirus vectors was recently developed based on the chemical modification of a retrovirus by the chemical addition of lactose residues to the viral envelope. This modification could permit the specific infection of hepatocytes via sialoglycoprotein receptors.

[0438] A different approach to targeting of recombinant retroviruses was designed in which biotinylated antibodies against a retroviral envelope protein and against a specific cell receptor were used. The antibodies were coupled via the biotin components by using streptavidin (Roux et al., 1989). Using antibodies against major histocompatibility complex class I and class II antigens, they demonstrated the infection of a variety of human cells that bore those surface antigens with an ecotropic virus in vitro (Roux et al., 1989).

[0439] 4.15.3 Adeno-Associated Viruses

[0440] AAV (Ridgeway, 1988; Hermonat and Muzycska, 1984) is a parovirus, discovered as a contamination of adenoviral stocks. It is a ubiquitous virus (antibodies are present in 85% of the US human population) that has not been linked to any disease. It is also classified as a dependovirus, because its replications is dependent on the presence of a helper virus, such as adenovirus. Five serotypes have been isolated, of which AAV-2 is the best characterized. AAV has a single-stranded linear DNA that is encapsidated into capsid proteins VP1, VP2 and VP3 to form an icosahedral virion of 20 to 24 nm in diameter (Muzyczka and McLaughlin, 1988).

[0441] The AAV DNA is approximately 4.7 kilobases long. It contains two open reading frames and is flanked by two ITRs. There are two major genes in the AAV genome: rep and cap. The rep gene codes for proteins responsible for viral replications, whereas cap codes for capsid protein VP1-3. Each ITR forms a T-shaped hairpin structure. These terminal repeats are the only essential cis components of the AAV for chromosomal integration. Therefore, the AAV can be used as a vector with all viral coding sequences removed and replaced by the cassette of genes for delivery. Three viral promoters have been identified and named p5, p19, and p40, according to their map position. Transcription from p5 and p19 results in production of rep proteins, and transcription from p40 produces the capsid proteins (Hermonat and Muzyczka, 1984).

[0442] There are several factors that prompted researchers to study the possibility of using rAAV as an expression vector. One is that the requirements for delivering a gene to integrate into the host chromosome are surprisingly few. It is necessary to have the 145-bp ITRs, which are only 6% of the AAV genome. This leaves room in the vector to assemble a 4.5-kb DNA insertion. While this carrying capacity may prevent the AAV from delivering large genes, it is amply suited for delivering the antisense constructs of the present invention.

[0443] AAV is also a good choice of delivery vehicles due to its safety. There is a relatively complicated rescue mechanism: not only wild type adenovirus but also AAV genes are required to mobilize rAAV. Likewise, AAV is not pathogenic and not associated with any disease. The removal of viral coding sequences minimizes immune reactions to viral gene expression, and therefore, rAAV does not evoke an inflammatory response.

[0444] 4.15.4 Other Viral Vectors as Expression Constructs

[0445] Other viral vectors may be employed as expression constructs in the present invention for the delivery of oligonucleotide or polynucleotide sequences to a host cell. Vectors derived from viruses such as vaccinia virus (Ridgeway, 1988; Coupar et al., 1988), lentiviruses, polio viruses and herpes viruses may be employed. They offer several attractive features for various mammalian cells (Friedmann, 1989; Ridgeway, 1988; Coupar et al., 1988; Horwich et al., 1990).

[0446] With the recent recognition of defective hepatitis B viruses, new insight was gained into the structure-function relationship of different viral sequences. In vitro studies showed that the virus could retain the ability for helper-dependent packaging and reverse transcription despite the deletion of up to 80% of its genome (Horwich et al., 1990). This suggested that large portions of the genome could be replaced with foreign genetic material. The hepatotropism and persistence (integration) were particularly attractive properties for liver-directed gene transfer. Chang et al. (1991) introduced the chloramphenicol acetyltransferase (CAT) gene into duck hepatitis B virus genome in the place of the polymerase, surface, and pre-surface coding sequences. It was cotransfected with wild-type virus into an avian hepatoma cell line. Culture media containing high titers of the recombinant virus were used to infect primary duckling hepatocytes. Stable CAT gene expression was detected for at least 24 days after transfection (Chang et al., 1991).

[0447] 4.15.5 Non-Viral Vectors

[0448] In order to effect expression of the oligonucleotide or polynucleotide sequences of the present invention, the expression construct must be delivered into a cell. This delivery may be accomplished in vitro, as in laboratory procedures for transforming cells lines, or in vivo or ex vivo, as in the treatment of certain disease states. As described above, one preferred mechanism for delivery is via viral infection where the expression construct is encapsulated in an infectious viral particle.

[0449] Once the expression construct has been delivered into the cell the nucleic acid encoding the desired oligonucleotide or polynucleotide sequences may be positioned and expressed at different sites. In certain embodiments, the nucleic acid encoding the construct may be stably integrated into the genome of the cell. This integration may be in the specific location and orientation via homologous recombination (gene replacement) or it may be integrated in a random, non-specific location (gene augmentation). In yet further embodiments, the nucleic acid may be stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid segments or "episomes" encode sequences sufficient to permit maintenance and replication independent of or in synchronization with the host cell cycle. How the expression construct is delivered to a cell and where in the cell the nucleic acid remains is dependent on the type of expression construct employed.

[0450] In certain embodiments of the invention, the expression construct comprising one or more oligonucleotide or polynucleotide sequences may simply consist of naked recombinant DNA or plasmids. Transfer of the construct may be performed by any of the methods mentioned above which physically or chemically permeabilize the cell membrane. This is particularly applicable for transfer in vitro but it may be applied to in vivo use as well. Dubensky et al. (1984) successfully injected polyomavirus DNA in the form of calcium phosphate precipitates into liver and spleen of adult and newborn mice demonstrating active viral replication and acute infection. Benvenisty and Reshef (1986) also demonstrated that direct intraperitoneal injection of calcium phosphate-precipitated plasmids results in expression of the transfected genes. It is envisioned that DNA encoding a gene of interest may also be transferred in a similar manner in vivo and express the gene product.

[0451] Another embodiment of the invention for transferring a naked DNA expression construct into cells may involve particle bombardment. This method depends on the ability to accelerate DNA-coated microprojectiles to a high velocity allowing them to pierce cell membranes and enter cells without killing them (Klein et al., 1987). Several devices for accelerating small particles have been developed. One such device relies on a high voltage discharge to generate an electrical current, which in turn provides the motive force (Yang et al., 1990). The microprojectiles used have consisted of biologically inert substances such as tungsten or gold beads.

[0452] Selected organs including the liver, skin, and muscle tissue of rats and mice have been bombarded in vivo (Yang et al., 1990; Zelenin et al., 1991). This may require surgical exposure of the tissue or cells, to eliminate any intervening tissue between the gun and the target organ, i.e. ex vivo treatment. Again, DNA encoding a particular gene may be delivered via this method and still be incorporated by the present invention.

[0453] 4.16 Antisense Oligonucleotides

[0454] The end result of the flow of genetic information is the synthesis of protein. DNA is transcribed by polymerases into messenger RNA and translated on the ribosome to yield a folded, functional protein. Thus there are several steps along the route where protein synthesis can be inhibited. The native DNA segment coding for a polypeptide described herein, as all such mammalian DNA strands, has two strands: a sense strand and an antisense strand held together by hydrogen bonding. The messenger RNA coding for polypeptide has the same nucleotide sequence as the sense DNA strand except that the DNA thymidine is replaced by uridine. Thus, synthetic antisense nucleotide sequences will bind to a mRNA and inhibit expression of the protein encoded by that mRNA.

[0455] The targeting of antisense oligonucleotides to mRNA is thus one mechanism to shut down protein synthesis, and, consequently, represents a powerful and targeted therapeutic approach. For example, the synthesis of polygalactauronase and the muscarine type 2 acetylcholine receptor are inhibited by antisense oligonucleotides directed to their respective mRNA sequences (U.S. Pat. No. 5,739,119 and U.S. Pat. No. 5,759,829, each specifically incorporated herein by reference in its entirety). Further, examples of antisense inhibition have been demonstrated with the nuclear protein cyclin, the multiple drug resistance gene (MDG1), ICAM-1, E-selectin, STK-1, striatal GABA.sub.A receptor and human EGF (Jaskulski et al., 1988; Vasanthakumar and Ahmed, 1989; Peris et al., 1998; U.S. Pat. No. 5,801,154; U.S. Pat. No. 5,789,573; U.S. Pat. No. 5,718,709 and U.S. Pat. No. 5,610,288, each specifically incorporated herein by reference in its entirety). Antisense constructs have also been described that inhibit and can be used to treat a variety of abnormal cellular proliferations, e.g. cancer (U.S. Pat. No. 5,747,470; U.S. Pat. No. 5,591,317 and U.S. Pat. No. 5,783,683, each specifically incorporated herein by reference in its entirety).

[0456] Therefore, in exemplary embodiments, the invention provides oligonucleotide sequences that comprise all, or a portion of, any sequence that is capable of specifically binding to polynucleotide sequence described herein, or a complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA or derivatives thereof. In another embodiment, the oligonucleotides comprise RNA or derivatives thereof. In a third embodiment, the oligonucleotides are modified DNAs comprising a phosphorothioated modified backbone. In a fourth embodiment, the oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In each case, preferred compositions comprise a sequence region that is complementary, and more preferably substantially-complementary, and even more preferably, completely complementary to one or more portions of polynucleotides disclosed herein.

[0457] Selection of antisense compositions specific for a given gene sequence is based upon analysis of the chosen target sequence (i.e. in these illustrative examples the rat and human sequences) and determination of secondary structure, T.sub.m, binding energy, relative stability, and antisense compositions were selected based upon their relative inability to form dimers, hairpins, or other secondary structures that would reduce or prohibit specific binding to the target mRNA in a host cell.

[0458] Highly preferred target regions of the mRNA, are those which are at or near the AUG translation initiation codon, and those sequences which were substantially complementary to 5' regions of the mRNA. These secondary structure analyses and target site selection considerations were performed using v.4 of the OLIGO primer analysis software (Rychlik, 1997) and the BLASTN 2.0.5 algorithm software (Altschul et al., 1997).

[0459] The use of an antisense delivery method employing a short peptide vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a hydrophobic domain derived from the fusion sequence of HIV gp41 and a hydrophilic domain from the nuclear localization sequence of SV40 T-antigen (Morris et al., 1997). It has been demonstrated that several molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%). Further, the interaction with MPG strongly increases both the stability of the oligonucleotide to nuclease and the ability to cross the plasma membrane (Morris et al., 1997).

[0460] 4.17 Ribozymes

[0461] Although proteins traditionally have been used for catalysis of nucleic acids, another class of macromolecules has emerged as useful in this endeavor. Ribozymes are RNA-protein complexes that cleave nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that possess endonuclease activity (Kim and Cech, 1987; Gerlach et al., 1987; Forster and Symons, 1987). For example, a large number of ribozymes accelerate phosphoester transfer reactions with a high degree of specificity, often cleaving only one of several phosphoesters in an oligonucleotide substrate (Cech et al., 1981; Michel and Westhof, 1990; Reinhold-Hurek and Shub, 1992). This specificity has been attributed to the requirement that the substrate bind via specific base-pairing interactions to the internal guide sequence ("IGS") of the ribozyme prior to chemical reaction.

[0462] Ribozyme catalysis has primarily been observed as part of sequence-specific cleavage/ligation reactions involving nucleic acids (Joyce, 1989; Cech et al., 1981). For example, U.S. Pat. No. 5,354,855 (specifically incorporated herein by reference) reports that certain ribozymes can act as endonucleases with a sequence specificity greater than that of known ribonucleases and approaching that of the DNA restriction enzymes. Thus, sequence-specific ribozyme-mediated inhibition of gene expression may be particularly suited to therapeutic applications (Scanlon et al., 1991; Sarver et al., 1990). Recently, it was reported that ribozymes elicited genetic changes in some cells lines to which they were applied; the altered genes included the oncogenes H-ras, c-fos and genes of HIV. Most of this work involved the modification of a target mRNA, based on a specific mutant codon that is cleaved by a specific ribozyme.

[0463] Six basic varieties of naturally-occurring enzymatic RNAs are known presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and thus can cleave other RNA molecules) under physiological conditions. In general, enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs through the target binding portion of a enzymatic nucleic acid which is held in close proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through complementary base-pairing, and once bound to the correct site, acts enzymatically to cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and cleaved its RNA target, it is released from that RNA to search for another target and can repeatedly bind and cleave new targets.

[0464] The enzymatic nature of a ribozyme is advantageous over many technologies, such as antisense technology (where a nucleic acid molecule simply binds to a nucleic acid target to block its translation) since the concentration of ribozyme necessary to affect a therapeutic treatment is lower than that of an antisense oligonucleotide. This advantage reflects the ability of the ribozyme to act enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity of inhibition depending not only on the base pairing mechanism of binding to the target RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base-substitutions, near the site of cleavage can completely eliminate catalytic activity of a ribozyme. Similar mismatches in antisense molecules do not prevent their action (Woolf et al., 1992). Thus, the specificity of action of a ribozyme is greater than that of an antisense oligonucleotide binding the same RNA site.

[0465] The enzymatic nucleic acid molecule may be formed in a hammerhead, hairpin, a hepatitis .delta. virus, group I intron or RNaseP RNA (in association with an RNA guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are described by Rossi et al. (1992). Examples of hairpin motifs are described by Hampel et al. (Eur. Pat. Appl. Publ. No. EP 0360257), Hampel and Tritz (1989), Hampel et al. (1990) and U.S. Pat. No. 5,631,359 (specifically incorporated herein by reference). An example of the hepatitis .delta. virus motif is described by Perrotta and Been (1992); an example of the RNaseP motif is described by Guerrier-Takada et al. (1983); Neurospora VS RNA ribozyme motif is described by Collins (Saville and Collins, 1990; Saville and Collins, 1991; Collins and Olive, 1993); and an example of the Group I intron is described in (U.S. Pat. No. 4,987,071, specifically incorporated herein by reference). All that is important in an enzymatic nucleic acid molecule of this invention is that it has a specific substrate binding site which is complementary to one or more of the target gene RNA regions, and that it have nucleotide sequences within or surrounding that substrate binding site which impart an RNA cleaving activity to the molecule. Thus the ribozyme constructs need not be limited to specific motifs mentioned herein.

[0466] In certain embodiments, it may be important to produce enzymatic cleaving agents which exhibit a high degree of specificity for the RNA of a desired target, such as one of the sequences disclosed herein. The enzymatic nucleic acid molecule is preferably targeted to a highly conserved sequence region of a target mRNA. Such enzymatic nucleic acid molecules can be delivered exogenously to specific cells as required. Alternatively, the ribozymes can be expressed from DNA or RNA vectors that are delivered to specific cells.

[0467] Small enzymatic nucleic acid motifs (e.g., of the hammerhead or the hairpin structure) may also be used for exogenous delivery. The simple structure of these molecules increases the ability of the enzymatic nucleic acid to invade targeted regions of the mRNA structure. Alternatively, catalytic RNA molecules can be expressed within cells from eukaryotic promoters (e.g., Scanlon et al., 1991; Kashani-Sabet et al., 1992; Dropulic et al., 1992; Weerasinghe et al, 1991; Ojwang et al, 1992; Chen et al., 1992; Sarver et al., 1990). Those skilled in the art realize that any ribozyme can be expressed in eukaryotic cells from the appropriate DNA vector. The activity of such ribozymes can be augmented by their release from the primary transcript by a second ribozyme (Int. Pat. Appl. Publ. No. WO 93/23569, and Int. Pat. Appl. Publ. No. WO 94/02595, both hereby incorporated by reference; Ohkawa et al., 1992; Taira et al., 1991; and Ventura et al., 1993).

[0468] Ribozymes may be added directly, or can be complexed with cationic lipids, lipid complexes, packaged within liposomes, or otherwise delivered to target cells. The RNA or RNA complexes can be locally administered to relevant tissues ex vivo, or in vivo through injection, aerosol inhalation, infusion pump or stent, with or without their incorporation in biopolymers.

[0469] Ribozymes may be designed as described in Int. Pat. Appl. Publ. No. WO 93/23569 and Int. Pat. Appl. Publ. No. WO 94/02595, each specifically incorporated herein by reference) and synthesized to be tested in vitro and in vivo, as described. Such ribozymes can also be optimized for delivery. While specific examples are provided, those in the art will recognize that equivalent RNA targets in other species can be utilized when necessary.

[0470] Hammerhead or hairpin ribozymes may be individually analyzed by computer folding (Jaeger et al., 1989) to assess whether the ribozyme sequences fold into the appropriate secondary structure. Those ribozymes with unfavorable intramolecular interactions between the binding arms and the catalytic core are eliminated from consideration. Varying binding arm lengths can be chosen to optimize activity. Generally, at least 5 or so bases on each arm are able to bind to, or otherwise interact with, the target RNA.

[0471] Ribozymes of the hammerhead or hairpin motif may be designed to anneal to various sites in the mRNA message, and can be chemically synthesized. The method of synthesis used follows the procedure for normal RNA synthesis as described in Usman et al (1987) and in Scaringe et al. (1990) and makes use of common nucleic acid protecting and coupling groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'-end. Average stepwise coupling yields are typically >98%. Hairpin ribozymes may be synthesized in two parts and annealed to reconstruct an active ribozyme (Chowrira and Burke, 1992). Ribozymes may be modified extensively to enhance stability by modification with nuclease resistant groups, for example, 2'-amino, 2'-C-allyl, 2'-flouro, 2'-o-methyl, 2'-H (for a review see e.g., Usman and Cedergren, 1992). Ribozymes may be purified by gel electrophoresis using general methods or by high pressure liquid chromatography and resuspended in water.

[0472] Ribozyme activity can be optimized by altering the length of the ribozyme binding arms, or chemically synthesizing ribozymes with modifications that prevent their degradation by serum ribonucleases (see e.g., Int. Pat. Appl. Publ. No. WO 92/07065; Perrault et al, 1990; Pieken et al, 1991; Usman and Cedergren, 1992; Int. Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. Publ. No. WO 91/03162; Eur. Pat. Appl. Publ. No. 92110298.4; U.S. Pat. No. 5,334,711; and Int. Pat. Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can be made to the sugar moieties of enzymatic RNA molecules), modifications which enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis times and reduce chemical requirements.

[0473] Sullivan et al. (Int. Pat. Appl. Publ. No. WO 94/02595) describes the general methods for delivery of enzymatic RNA molecules. Ribozymes may be administered to cells by a variety of methods known to those familiar to the art, including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be directly delivered ex vivo to cells or tissues with or without the aforementioned vehicles. Alternatively, the RNA/vehicle combination may be locally delivered by direct inhalation, by direct injection or by use of a catheter, infusion pump or stent. Other routes of delivery include, but are not limited to, intravascular, intramuscular, subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions of ribozyme delivery and administration are provided in Int. Pat. Appl. Publ. No. WO 94/02595 and Int. Pat. Appl. Publ. No. WO 93/23569, each specifically incorporated herein by reference.

[0474] Another means of accumulating high concentrations of a ribozyme(s) within cells is to incorporate the ribozyme-encoding sequences into a DNA expression vector. Transcription of the ribozyme sequences are driven from a promoter for eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase III (pol III). Transcripts from pol II or pol III promoters will be expressed at high levels in all cells; the levels of a given pol II promoter in a given cell type will depend on the nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. Prokaryotic RNA polymerase promoters may also be used, providing that the prokaryotic RNA polymerase enzyme is expressed in the appropriate cells (Elroy-Stein and Moss, 1990; Gao and Huang, 1993; Lieber et al., 1993; Zhou et al., 1990). Ribozymes expressed from such promoters can function in mammalian cells (e.g. Kashani-Saber et al., 1992; Ojwang et al., 1992; Chen et al., 1992; Yu et al., 1993; L'Huillier et al., 1992; Lisziewicz et al., 1993). Such transcription units can be incorporated into a variety of vectors for introduction into mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as retroviral, semliki forest virus, sindbis virus vectors).

[0475] Ribozymes may be used as diagnostic tools to examine genetic drift and mutations within diseased cells. They can also be used to assess levels of the target RNA molecule. The close relationship between ribozyme activity and the structure of the target RNA allows the detection of mutations in any region of the molecule which alters the base-pairing and three-dimensional structure of the target RNA. By using multiple ribozymes, one may map nucleotide changes which are important to RNA structure and function in vitro, as well as in cells and tissues. Cleavage of target RNAs with ribozymes may be used to inhibit gene expression and define the role (essentially) of specified gene products in the progression of disease. In this manner, other genetic targets may be defined as important mediators of the disease. These studies will lead to better treatment of the disease progression by affording the possibility of combinational therapies (e.g., multiple ribozymes targeted to different genes, ribozymes coupled with known small molecule inhibitors, or intermittent treatment with combinations of ribozymes and/or other chemical or biological molecules). Other in vitro uses of ribozymes are well known in the art, and include detection of the presence of mRNA associated with an IL-5 related condition. Such RNA is detected by determining the presence of a cleavage product after treatment with a ribozyme using standard methodology.

[0476] 4.18 Peptide Nucleic Acids

[0477] In certain embodiments, the inventors contemplate the use of peptide nucleic acids (PNAs) in the practice of the methods of the invention. PNA is a DNA mimic in which the nucleobases are attached to a pseudopeptide backbone (Good and Nielsen, 1997). PNA is able to be utilized in a number methods that traditionally have used RNA or DNA. Often PNA sequences perform better in techniques than the corresponding RNA or DNA sequences and have utilities that are not inherent to RNA or DNA. A review of PNA including methods of making, characteristics of, and methods of using, is provided by Corey (1997) and is incorporated herein by reference. As such, in certain embodiments, one may prepare PNA sequences that are complementary to one or more portions of the ACE mRNA sequence, and such PNA compositions may be used to regulate, alter, decrease, or reduce the translation of ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which such PNA compositions have been administered.

[0478] PNAs have 2-aminoethyl-glycine linkages replacing the normal phosphodiester backbone of DNA (Nielsen et al., 1991; Hanvey et al., 1992; Hyrup and Nielsen, 1996; Neilsen, 1996). This chemistry has three important consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs are neutral molecules; secondly, PNAs are achiral, which avoids the need to develop a stereoselective synthesis; and thirdly, PNA synthesis uses standard Boc (Dueholm et al., 1994) or Fmoc (Thomson et al., 1995) protocols for solid-phase peptide synthesis, although other methods, including a modified Merrifield method, have been used (Christensen et al., 1995).

[0479] PNA monomers or ready-made oligomers are commercially available from PerSeptive Biosystems (Framingham, Mass.). PNA syntheses by either Boc or Fmoc protocols are straightforward using manual or automated protocols (Norton et al., 1995). The manual protocol lends itself to the production of chemically modified PNAs or the simultaneous synthesis of families of closely related PNAs.

[0480] As with peptide synthesis, the success of a particular PNA synthesis will depend on the properties of the chosen sequence. For example, while in theory PNAs can incorporate any combination of nucleotide bases, the presence of adjacent purines can lead to deletions of one or more residues in the product. In expectation of this difficulty, it is suggested that, in producing PNAs with adjacent purines, one should repeat the coupling of residues likely to be added inefficiently. This should be followed by the purification of PNAs by reverse-phase high-pressure liquid chromatography (Norton et al., 1995) providing yields and purity of product similar to those observed during the synthesis of peptides.

[0481] Modifications of PNAs for a given application may be accomplished by coupling amino acids during solid-phase synthesis or by attaching compounds that contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs can be modified after synthesis by coupling to an introduced lysine or cysteine. The ease with which PNAs can be modified facilitates optimization for better solubility or for specific functional requirements. Once synthesized, the identity of PNAs and their derivatives can be confirmed by mass spectrometry. Several studies have made and utilized modifications of PNAs (Norton et al., 1995; Haaima et al., 1996; Stetsenko et al., 1996; Petersen et al., 1995; Ulmann et al., 1996; Koch et al., 1995; Orum et al., 1995; Footer et al., 1996; Griffith et al., 1995; Kremsky et al., 1996; Pardridge et al., 1995; Boffa et al, 1995; Landsdorp et al., 1996; Gambacorti-Passerini et al., 1996; Armitage et al., 1997; Seeger et al., 1997; Ruskowski et al., 1997). U.S. Pat. No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in diagnostics, modulating protein in organisms, and treatment of conditions susceptible to therapeutics.

[0482] In contrast to DNA and RNA, which contain negatively charged linkages, the PNA backbone is neutral. In spite of this dramatic alteration, PNAs recognize complementary DNA and RNA by Watson-Crick pairing (Egholm et al., 1993), validating the initial modeling by Nielsen et al. (1991). PNAs lack 3' to 5' polarity and can bind in either parallel or anti-parallel fashion, with the anti-parallel mode being preferred (Egholm et al., 1993).

[0483] Hybridization of DNA oligonucleotides to DNA and RNA is destabilized by electrostatic repulsion between the negatively charged phosphate backbones of the complementary strands. By contrast, the absence of charge repulsion in PNA-DNA or PNA-RNA duplexes increases the melting temperature (T.sub.m) and reduces the dependence of T.sub.m on the concentration of mono- or divalent cations (Nielsen et al., 1991). The enhanced rate and affinity of hybridization are significant because they are responsible for the surprising ability of PNAs to perform strand invasion of complementary sequences within relaxed double-stranded DNA. In addition, the efficient hybridization at inverted repeats suggests that PNAs can recognize secondary structure effectively within double-stranded DNA. Enhanced recognition also occurs with PNAs immobilized on surfaces, and Wang et al. have shown that support-bound PNAs can be used to detect hybridization events (Wang et al., 1996).

[0484] One might expect that tight binding of PNAs to complementary sequences would also increase binding to similar (but not identical) sequences, reducing the sequence specificity of PNA recognition. As with DNA hybridization, however, selective recognition can be achieved by balancing oligomer length and incubation temperature. Moreover, selective hybridization of PNAs is encouraged by PNA-DNA hybridization being less tolerant of base mismatches than DNA-DNA hybridization. For example, a single mismatch within a 16 bp PNA-DNA duplex can reduce the T.sub.m by up to 15.degree. C. (Egholm et al., 1993). This high level of discrimination has allowed the development of several PNA-based strategies for the analysis of point mutations (Wang et al., 1996; Carlsson et al., 1996; Thiede et al., 1996; Webb and Hurskainen, 1996; Perry-O'Keefe et al., 1996).

[0485] High-affinity binding provides clear advantages for molecular recognition and the development of new applications for PNAs. For example, 11-13 nucleotide PNAs inhibit the activity of telomerase, a ribonucleo-protein that extends telomere ends using an essential RNA template, while the analogous DNA oligomers do not (Norton et al., 1996).

[0486] Neutral PNAs are more hydrophobic than analogous DNA oligomers, and this can lead to difficulty solubilizing them at neutral pH, especially if the PNAs have a high purine content or if they have the potential to form secondary structures. Their solubility can be enhanced by attaching one or more positive charges to the PNA termini (Nielsen et al., 1991).

[0487] Findings by Allfrey and colleagues suggest that strand invasion will occur spontaneously at sequences within chromosomal DNA (Boffa et al., 1995; Boffa et al., 1996). These studies targeted PNAs to triplet repeats of the nucleotides CAG and used this recognition to purify transcriptionally active DNA (Boffa et al., 1995) and to inhibit transcription (Boffa et al., 1996). This result suggests that if PNAs can be delivered within cells then they will have the potential to be general sequence-specific regulators of gene expression. Studies and reviews concerning the use of PNAs as antisense and anti-gene agents include Nielsen et al. (1993b), Hanvey et al. (1992), and Good and Nielsen (1997). Koppelhus et al. (1997) have used PNAs to inhibit HIV-1 inverse transcription, showing that PNAs may be used for antiviral therapies.

[0488] Methods of characterizing the antisense binding properties of PNAs are discussed in Rose (1993) and Jensen et al (1997). Rose uses capillary gel electrophoresis to determine binding of PNAs to their complementary oligonucleotide, measuring the relative binding kinetics and stoichiometry. Similar types of measurements were made by Jensen et al. using BIAcore.TM. technology.

[0489] Other applications of PNAs include use in DNA strand invasion (Nielsen et al., 1991), antisense inhibition (Hanvey et al., 1992), mutational analysis (Orum et al., 1993), enhancers of transcription (Mollegaard et al., 1994), nucleic acid purification (Orum et al., 1995), isolation of transcriptionally active genes (Boffa et al., 1995), blocking of transcription factor binding (Vickers et al., 1995), genome cleavage (Veselkov et al., 1996), biosensors (Wang et al., 1996), in situ hybridization (Thisted et al., 1996), and in a alternative to Southern blotting (Perry-O'Keefe, 1996).

[0490] 4.19 Polypeptide, Peptides and Peptide Variants

[0491] The present invention, in other aspects, provides polypeptide compositions. Generally, a polypeptide of the invention will be an isolated polypeptide (or an epitope, variant, or active fragment thereof) derived from a mammalian species. Preferably, the polypeptide is encoded by a polynucleotide sequence disclosed herein or a sequence which hybridizes under moderately stringent conditions to a polynucleotide sequence disclosed herein. Alternatively, the polypeptide may be defined as a polypeptide which comprises a contiguous amino acid sequence from an amino acid sequence disclosed herein, or which polypeptide comprises an entire amino acid sequence disclosed herein.

[0492] In the present invention, a polypeptide composition is also understood to comprise one or more polypeptides that are immunologically reactive with antibodies generated against a polypeptide of the invention, particularly a polypeptide having the amino acid sequence encoded by the disclosed polynucleotides, or to active fragments, or to variants or biological functional equivalents thereof.

[0493] Likewise, a polypeptide composition of the present invention is understood to comprise one or more polypeptides that are capable of eliciting antibodies that are immunologically reactive with one or more polypeptides encoded by one or more contiguous nucleic acid sequences disclosed in this application, or to active fragments, or to variants thereof, or to one or more nucleic acid sequences which hybridize to one or more of these sequences under conditions of moderate to high stringency.

[0494] As used herein, an active fragment of a polypeptide includes a whole or a portion of a polypeptide which is modified by conventional techniques, e.g., mutagenesis, or by addition, deletion, or substitution, but which active fragment exhibits substantially the same structure function, antigenicity, etc., as a polypeptide as described herein.

[0495] In certain illustrative embodiments, the polypeptides of the invention will comprise at least an immunogenic portion of a hematological malignancy-related tumor protein or a variant thereof, as described herein. As noted above, a "hematological malignancy-related tumor protein" is a protein that is expressed by hematological malignancy-related tumor cells. Proteins that are hematological malignancy-related tumor proteins also react detectably within an immunoassay (such as an ELISA) with antisera from a patient with hematological malignancy. Polypeptides as described herein may be of any length. Additional sequences derived from the native protein and/or heterologous sequences may be present, and such sequences may (but need not) possess further immunogenic or antigenic properties.

[0496] An "immunogenic portion," as used herein is a portion of a protein that is recognized (i.e., specifically bound) by a B-cell and/or T-cell surface antigen receptor. Such immunogenic portions generally comprise at least 5 amino acid residues, more preferably at least 10, and still more preferably at least 20 amino acid residues of a hematological malignancy-related tumor protein or a variant thereof. Certain preferred immunogenic portions include peptides in which an N-terminal leader sequence and/or transmembrane domain have been deleted. Other preferred immunogenic portions may contain a small N- and/or C-terminal deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), relative to the mature protein.

[0497] Immunogenic portions may generally be identified using well known techniques, such as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 1993) and references cited therein. Such techniques include screening polypeptides for the ability to react with antigen-specific antibodies, antisera and/or T-cell lines or clones. As used herein, antisera and antibodies are "antigen-specific" if they specifically bind to an antigen (i.e., they react with the protein in an ELISA or other immunoassay, and do not react detectably with unrelated proteins). Such antisera and antibodies may be prepared as described herein, and using well known techniques. An immunogenic portion of a native hematological malignancy-related tumor protein is a portion that reacts with such antisera and/or T-cells at a level that is not substantially less than the reactivity of the full length polypeptide (e.g., in an ELISA and/or T-cell reactivity assay). Such immunogenic portions may react within such assays at a level that is similar to or greater than the reactivity of the full length polypeptide. Such screens may generally be performed using methods well known to those of ordinary skill in the art, such as those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. For example, a polypeptide may be immobilized on a solid support and contacted with patient sera to allow binding of antibodies within the sera to the immobilized polypeptide. Unbound sera may then be removed and bound antibodies detected using, for example, .sup.125I-labeled Protein A.

[0498] As noted above, a composition may comprise a variant of a native hematological malignancy-related tumor protein. A polypeptide "variant," as used herein, is a polypeptide that differs from a native hematological malignancy-related tumor protein in one or more substitutions, deletions, additions and/or insertions, such that the immunogenicity of the polypeptide is not substantially diminished. In other words, the ability of a variant to react with antigen-specific antisera may be enhanced or unchanged, relative to the native protein, or may be diminished by less than 50%, and preferably less than 20%, relative to the native protein. Such variants may generally be identified by modifying one of the above polypeptide sequences and evaluating the reactivity of the modified polypeptide with antigen-specific antibodies or antisera as described herein. Preferred variants include those in which one or more portions, such as an N-terminal leader sequence or transmembrane domain, have been removed. Other preferred variants include variants in which a small portion (e.g., 1-30 amino acids, preferably 5-15 amino acids) has been removed from the N- and/or C-terminal of the mature protein.

[0499] Polypeptide variants encompassed by the present invention include those exhibiting at least about 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more identity (determined as described above) to the polypeptides disclosed herein.

[0500] Preferably, a variant contains conservative substitutions. A "conservative substitution" is one in which an amino acid is substituted for another amino acid that has similar properties, such that one skilled in the art of peptide chemistry would expect the secondary structure and hydropathic nature of the polypeptide to be substantially unchanged. Amino acid substitutions may generally be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic nature of the residues. For example, negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may represent conservative changes include: (1) ala, pro, gly, glu, asp, gln, asn, ser, thr, (2) cys, ser, tyr, thr, (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his. A variant may also, or alternatively, contain nonconservative changes. In a preferred embodiment, variant polypeptides differ from a native sequence by substitution, deletion or addition of five amino acids or fewer. Variants may also (or alternatively) be modified by, for example, the deletion or addition of amino acids that have minimal influence on the immunogenicity, secondary structure and hydropathic nature of the polypeptide.

[0501] As noted above, polypeptides may comprise a signal (or leader) sequence at the N-terminal end of the protein, which co-translationally or post-translationally directs transfer of the protein. The polypeptide may also be conjugated to a linker or other sequence for ease of synthesis, purification or identification of the polypeptide (e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For example, a polypeptide may be conjugated to an immunoglobulin Fc region.

[0502] Polypeptides may be prepared using any of a variety of well known techniques. Recombinant polypeptides encoded by DNA sequences as described above may be readily prepared from the DNA sequences using any of a variety of expression vectors known to those of ordinary skill in the art. Expression may be achieved in any appropriate host cell that has been transformed or transfected with an expression vector containing a DNA molecule that encodes a recombinant polypeptide. Suitable host cells include prokaryotes, yeast, and higher eukaryotic cells, such as mammalian cells and plant cells. Preferably, the host cells employed are E. coli, yeast or a mammalian cell line such as COS or CHO. Supernatants from suitable host/vector systems which secrete recombinant protein or polypeptide into culture media may be first concentrated using a commercially available filter. Following concentration, the concentrate may be applied to a suitable purification matrix such as an affinity matrix or an ion exchange resin. Finally, one or more reverse phase HPLC steps can be employed to further purify a recombinant polypeptide.

[0503] Portions and other variants having less than about 100 amino acids, and generally less than about 50 amino acids, may also be generated by synthetic means, using techniques well known to those of ordinary skill in the art. For example, such polypeptides may be synthesized using any of the commercially available solid-phase techniques, such as the Merrifield solid-phase synthesis method, where amino acids are sequentially added to a growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 85:2149-2146, 1963. Equipment for automated synthesis of polypeptides is commercially available from suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, Calif.), and may be operated according to the manufacturer's instructions.

[0504] Within certain specific embodiments, a polypeptide may be a fusion protein that comprises multiple polypeptides as described herein, or that comprises at least one polypeptide as described herein and an unrelated sequence, such as a known tumor protein. A fusion partner may, for example, assist in providing T helper epitopes (an immunological fusion partner), preferably T helper epitopes recognized by humans, or may assist in expressing the protein (an expression enhancer) at higher yields than the native recombinant protein. Certain preferred fusion partners are both immunological and expression enhancing fusion partners. Other fusion partners may be selected so as to increase the solubility of the protein or to enable the protein to be targeted to desired intracellular compartments. Still further fusion partners include affinity tags, which facilitate purification of the protein.

[0505] Fusion proteins may generally be prepared using standard techniques, including chemical conjugation. Preferably, a fusion protein is expressed as a recombinant protein, allowing the production of increased levels, relative to a non-fused protein, in an expression system. Briefly, DNA sequences encoding the polypeptide components may be assembled separately, and ligated into an appropriate expression vector. The 3' end of the DNA sequence encoding one polypeptide component is ligated, with or without a peptide linker, to the 5' end of a DNA sequence encoding the second polypeptide component so that the reading frames of the sequences are in phase. This permits translation into a single fusion protein that retains the biological activity of both component polypeptides.

[0506] A peptide linker sequence may be employed to separate the first and second polypeptide components by a distance sufficient to ensure that each polypeptide folds into its secondary and tertiary structures. Such a peptide linker sequence is incorporated into the fusion protein using standard techniques well known in the art. Suitable peptide linker sequences may be chosen based on the following factors: (1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a secondary structure that could interact with functional epitopes on the first and second polypeptides; and (3) the lack of hydrophobic or charged residues that might react with the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be used in the linker sequence. Amino acid sequences which may be usefully employed as linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., Proc. Natl. Acad. Sci. USA 83:8258-8262, 1986; U.S. Pat. No. 4,935,233 and U.S. Pat. No. 4,751,180. The linker sequence may generally be from 1 to about 50 amino acids in length. Linker sequences are not required when the first and second polypeptides have non-essential N-terminal amino acid regions that can be used to separate the functional domains and prevent steric interference.

[0507] The ligated DNA sequences are operably linked to suitable transcriptional or translational regulatory elements. The regulatory elements responsible for expression of DNA are located only 5' to the DNA sequence encoding the first polypeptides. Similarly, stop codons required to end translation and transcription termination signals are only present 3' to the DNA sequence encoding the second polypeptide.

[0508] Fusion proteins are also provided. Such proteins comprise a polypeptide as described herein together with an unrelated immunogenic protein. Preferably the immunogenic protein is capable of eliciting a recall response. Examples of such proteins include tetanus, tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl. J. Med., 336:86-91, 1997).

[0509] Within preferred embodiments, an immunological fusion partner is derived from protein D, a surface protein of the gram-negative bacterium Haemophilus influenza B (WO 91/18926). Preferably, a protein D derivative comprises approximately the first third of the protein (e.g., the first N-terminal 100-110 amino acids), and a protein D derivative may be lipidated. Within certain preferred embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to increase the expression level in E. coli (thus functioning as an expression enhancer). The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. Other fusion partners include the non-structural protein from influenzae virus, NS1 (hemagglutinin). Typically, the N-terminal 81 amino acids are used, although different fragments that include T-helper epitopes may be used.

[0510] In another embodiment, the immunological fusion partner is the protein known as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is derived from Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine amidase known as amidase LYTA (encoded by the LytA gene; Gene 43:265-292, 1986). LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to the choline or to some choline analogues such as DEAE. This property has been exploited for the development of E. coli C-LYTA expressing plasmids useful for expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA fragment at the amino terminus has been described (see Biotechnology 10:795-798, 1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated into a fusion protein. A repeat portion is found in the C-terminal region starting at residue 178. A particularly preferred repeat portion incorporates residues 188-305.

[0511] In general, polypeptides (including fusion proteins) and polynucleotides as described herein are isolated. An "isolated" polypeptide or polynucleotide is one that is removed from its original environment. For example, a naturally-occurring protein is isolated if it is separated from some or all of the coexisting materials in the natural system. Preferably, such polypeptides are at least about 90% pure, more preferably at least about 95% pure and most preferably at least about 99% pure. A polynucleotide is considered to be isolated if, for example, it is cloned into a vector that is not a part of the natural environment.

[0512] 4.20 Binding Agents

[0513] The present invention further employs agents, such as antibodies and antigen-binding fragments thereof, that specifically bind to a hematological malignancy-related antigen. As used herein, an antibody, or antigen-binding fragment thereof, is said to "specifically bind" to a hematological malignancy-related antigen if it reacts at a detectable level (within, for example, an ELISA) with, and does not react detectably with unrelated proteins under similar conditions. As used herein, "binding" refers to a noncovalent association between two separate molecules such that a complex is formed. The ability to bind may be evaluated by, for example, determining a binding constant for the formation of the complex. The binding constant is the value obtained when the concentration of the complex is divided by the product of the component concentrations. In general, two compounds are said to "bind," in the context of the present invention, when the binding constant for complex formation exceeds about 10.sup.3 L/mol. The binding constant maybe determined using methods well known in the art.

[0514] Binding agents may be further capable of differentiating between patients with and without a hematological malignancy. Such binding agents generate a signal indicating the presence of a hematological malignancy in at least about 20% of patients with the disease, and will generate a negative signal indicating the absence of the disease in at least about 90% of individuals without the disease. To determine whether a binding agent satisfies this requirement, biological samples (e.g., blood, sera, urine and/or tumor biopsies) from patients with and without a hematological malignancy (as determined using standard clinical tests) may be assayed as described herein for the presence of polypeptides that bind to the binding agent. It will be apparent that a statistically significant number of samples with and without the disease should be assayed. Each binding agent should satisfy the above criteria; however, those of ordinary skill in the art will recognize that binding agents may be used in combination to improve sensitivity.

[0515] Any agent that satisfies the above requirements may be a binding agent. For example, a binding agent may be a ribosome, with or without a peptide component, an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In general, antibodies can be produced by cell culture techniques, including the generation of monoclonal antibodies as described herein, or via transfection of antibody genes into suitable bacterial or mammalian cell hosts, in order to allow for the production of recombinant antibodies. In one technique, an immunogen comprising the polypeptide is initially injected into any of a wide variety of mammals (e.g., mice, rats, rabbits, sheep or goats). In this step, the polypeptides of this invention may serve as the immunogen without modification. Alternatively, particularly for relatively short polypeptides, a superior immune response may be elicited if the polypeptide is joined to a carrier protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen is injected into the animal host, preferably according to a predetermined schedule incorporating one or more booster immunizations, and the animals are bled periodically. Polyclonal antibodies specific for the polypeptide may then be purified from such antisera by, for example, affinity chromatography using the polypeptide coupled to a suitable solid support.

[0516] Monoclonal antibodies specific for an antigenic polypeptide of interest may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. Immunol. 6:511-519, 1976, and improvements thereto. Briefly, these methods involve the preparation of immortal cell lines capable of producing antibodies having the desired specificity (i.e., reactivity with the polypeptide of interest). Such cell lines may be produced, for example, from spleen cells obtained from an animal immunized as described above. The spleen cells are then immortalized by, for example, fusion with a myeloma cell fusion partner, preferably one that is syngeneic with the immunized animal. A variety of fusion techniques may be employed. For example, the spleen cells and myeloma cells may be combined with a nonionic detergent for a few minutes and then plated at low density on a selective medium that supports the growth of hybrid cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, colonies of hybrids are observed. Single colonies are selected and their culture supernatants tested for binding activity against the polypeptide. Hybridomas having high reactivity and specificity are preferred.

[0517] Monoclonal antibodies may be isolated from the supernatants of growing hybridoma colonies. In addition, various techniques may be employed to enhance the yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from the ascites fluid or the blood. Contaminants may be removed from the antibodies by conventional techniques, such as chromatography, gel filtration, precipitation, and extraction. The polypeptides of this invention may be used in the purification process in, for example, an affinity chromatography step.

[0518] Within certain embodiments, the use of antigen-binding fragments of antibodies may be preferred. Such fragments include Fab fragments, which may be prepared using standard techniques. Briefly, immunoglobulins may be purified from rabbit serum by affinity chromatography on Protein A bead columns (Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988) and digested by papain to yield Fab and Fc fragments. The Fab and Fc fragments may be separated by affinity chromatography on protein A bead columns.

[0519] Monoclonal antibodies, and fragments thereof, of the present invention may be coupled to one or more therapeutic agents, such as radionuclides, differentiation inducers, drugs, toxins, and derivatives thereof. Preferred radionuclides include .sup.90Y, .sup.123I, .sup.125I, .sup.131I, .sup.186Re, .sup.188Re, .sup.211At, and .sup.212Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. Preferred differentiation inducers include phorbol esters and butyric acid. Preferred toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella toxin, and pokeweed antiviral protein. For certain in vivo and ex vivo therapies, an antibody or fragment thereof is preferably coupled to a cytotoxic agent, such as a radioactive or chemotherapeutic moiety.

[0520] A therapeutic agent may be coupled (e.g., covalently bonded) to a suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A direct reaction between an agent and an antibody is possible when each possesses a substituent capable of reacting with the other. For example, a nucleophilic group, such as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl-containing group, such as an anhydride or an acid halide, or with an alkyl group containing a good leaving group (e.g., a halide) on the other.

[0521] Alternatively, it may be desirable to couple a therapeutic agent and an antibody via a linker group. A linker group can function as a spacer to distance an antibody from an agent in order to avoid interference with binding capabilities. A linker group can also serve to increase the chemical reactivity of a substituent on an agent or an antibody, and thus increase the coupling efficiency. An increase in chemical reactivity may also facilitate the use of agents, or functional groups on agents, which otherwise would not be possible.

[0522] It will be evident to those skilled in the art that a variety of bifunctional or polyfunctional reagents, both homo- and hetero-functional (such as those described in the catalog of the Pierce Chemical Co., Rockford, Ill.), may be employed as the linker group. Coupling may be effected, for example, through amino groups, carboxyl groups, sulfhydryl groups or oxidized carbohydrate residues. There are numerous references describing such methodology, e.g., U.S. Pat. No. 4,671,958.

[0523] Where a therapeutic agent is more potent when free from the antibody portion of the immunoconjugates of the present invention, it may be desirable to use a linker group which is cleavable during or upon internalization into a cell. A number of different cleavable linker groups have been described. The mechanisms for the intracellular release of an agent from these linker groups include cleavage by reduction of a disulfide bond (e.g., U.S. Pat. No. 4,489,710), by irradiation of a photolabile bond (e.g., U.S. Pat. No. 4,625,014), by hydrolysis of derivatized amino acid side chains (e.g., U.S. Pat. No. 4,638,045), by serum complement-mediated hydrolysis (e.g., U.S. Pat. No. 4,671,958), and acid-catalyzed hydrolysis (e.g., U.S. Pat. No. 4,569,789).

[0524] It may be desirable to couple more than one agent to an antibody. In one embodiment, multiple molecules of an agent are coupled to one antibody molecule. In another embodiment, more than one type of agent may be coupled to one antibody. Regardless of the particular embodiment, immunoconjugates with more than one agent may be prepared in a variety of ways. For example, more than one agent may be coupled directly to an antibody molecule, or linkers which provide multiple sites for attachment can be used. Alternatively, a carrier can be used.

[0525] A carrier may bear the agents in a variety of ways, including covalent bonding either directly or via a linker group. Suitable carriers include proteins such as albumins (e.g., U.S. Pat. No. 4,507,234), peptides and polysaccharides such as aminodextran (e.g., U.S. Pat. No. 4,699,784). A carrier may also bear an agent by noncovalent bonding or by encapsulation, such as within a liposome vesicle (e.g., U.S. Pat. Nos. 4,429,008 and 4,873,088). Carriers specific for radionuclide agents include radiohalogenated small molecules and chelating compounds. For example, U.S. Pat. No. 4,735,792 discloses representative radiohalogenated small molecules and their synthesis. A radionuclide chelate may be formed from chelating compounds that include those containing nitrogen and sulfur atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For example, U.S. Pat. No. 4,673,562 discloses representative chelating compounds and their synthesis.

[0526] A variety of routes of administration for the antibodies and immunoconjugates may be used. Typically, administration will be intravenous, intramuscular, subcutaneous or in the bed of a resected tumor. It will be evident that the precise dose of the antibody/immunoconjugate will vary depending upon the antibody used, the antigen density on the tumor, and the rate of clearance of the antibody.

[0527] 4.21 Vaccines

[0528] In certain preferred embodiments of the present invention, vaccines are provided. The vaccines will generally comprise one or more pharmaceutical compositions, such as those discussed above, in combination with an immunostimulant. An immunostimulant may be any substance that enhances or potentiates an immune response (antibody and/or cell-mediated) to an exogenous antigen. Examples of immunostimulants include adjuvants, biodegradable microspheres (e.g., polylactic galactide) and liposomes (into which the compound is incorporated; see e.g., Fullerton, U.S. Pat. No. 4,235,877). Vaccine preparation is generally described in, for example, M. F. Powell and M. J. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)," Plenum Press (NY, 1995). Pharmaceutical compositions and vaccines within the scope of the present invention may also contain other compounds, which may be biologically active or inactive. For example, one or more immunogenic portions of other tumor antigens may be present, either incorporated into a fusion polypeptide or as a separate compound, within the composition or vaccine.

[0529] Illustrative vaccines may contain DNA encoding one or more of the polypeptides as described above, such that the polypeptide is generated in situ. As noted above, the DNA may be present within any of a variety of delivery systems known to those of ordinary skill in the art, including nucleic acid expression systems, bacteria and viral expression systems. Numerous gene delivery techniques are well known in the art, such as those described by Rolland, Crit. Rev. Therap. Drug Carrier Systems 15:143-198, 1998, and references cited therein. Appropriate nucleic acid expression systems contain the necessary DNA sequences for expression in the patient (such as a suitable promoter and terminating signal). Bacterial delivery systems involve the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that expresses an immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. In a preferred embodiment, the DNA may be introduced using a viral expression system (e.g., vaccinia or other pox virus, retrovirus, or adenovirus), which may involve the use of a non-pathogenic (defective), replication competent virus. Suitable systems are disclosed, for example, in Fisher-Hoch et al., Proc. Natl. Acad. Sci. USA 86:317-321, 1989; Flexner et al., Ann. N.Y. Acad. Sci. 569:86-103, 1989; Flexner et al., Vaccine 8:17-21, 1990; U.S. Pat. Nos. 4,603,112, 4,769,330, and 5,017,487; WO 89/01973; U.S. Pat. No. 4,777,127; GB 2,200,651; EP 0,345,242; WO 91/02805; Berkner, Biotechniques 6:616-627, 1988; Rosenfeld et al., Science 252:431-434, 1991; Kolls et al., Proc. Natl. Acad. Sci. USA 91:215-219, 1994; Kass-Eisler et al., Proc. Natl. Acad Sci. USA 90:11498-11502, 1993; Guzman et al., Circulation 88:2838-2848, 1993; and Guzman et al., Cir. Res. 73:1202-1207, 1993. Techniques for incorporating DNA into such expression systems are well known to those of ordinary skill in the art. The DNA may also be "naked," as described, for example, in Ulmer et al., Science 259:1745-1749, 1993 and reviewed by Cohen, Science 259:1691-1692, 1993. The uptake of naked DNA may be increased by coating the DNA onto biodegradable beads, which are efficiently transported into the cells. It will be apparent that a vaccine may comprise both a polynucleotide and a polypeptide component. Such vaccines may provide for an enhanced immune response.

[0530] It will be apparent that a vaccine may contain pharmaceutically acceptable salts of the polynucleotides and polypeptides provided herein. Such salts may be prepared from pharmaceutically acceptable non-toxic bases, including organic bases (e.g., salts of primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e.g., sodium, potassium, lithium, ammonium, calcium and magnesium salts).

[0531] While any suitable carrier known to those of ordinary skill in the art may be employed in the vaccine compositions of this invention, the type of carrier will vary depending on the mode of administration. Compositions of the present invention may be formulated for any appropriate manner of administration, including for example, topical, oral, nasal, intravenous, intracranial, intraperitoneal, subcutaneous or intramuscular administration. For parenteral administration, such as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a fat, a wax or a buffer. For oral administration, any of the above carriers or a solid carrier, such as mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, cellulose, glucose, sucrose, and magnesium carbonate, may be employed. Biodegradable microspheres (e.g., polylactate polyglycolate) may also be employed as carriers for the pharmaceutical compositions of this invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. Pat. Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 5,814,344 and 5,942,252. One may also employ a carrier comprising the particulate-protein complexes described in U.S. Pat. No. 5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte responses in a host.

[0532] Such compositions may also comprise buffers (e.g., neutral buffered saline or phosphate buffered saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a recipient, suspending agents, thickening agents and/or preservatives. Alternatively, compositions of the present invention may be formulated as a lyophilizate. Compounds may also be encapsulated within liposomes using well known technology.

[0533] Any of a variety of immunostimulants may be employed in the vaccines of this invention. For example, an adjuvant may be included. Most adjuvants contain a substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. Suitable adjuvants are commercially available as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, Mich.); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, N.J.); AS-2 (SmithKline Beecham, Philadelphia, Pa.); aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as GM-CSF or interleukin-2, -7, or -12, may also be used as adjuvants.

[0534] Within the vaccines provided herein, the adjuvant composition is preferably designed to induce an immune response predominantly of the Th1 type. High levels of Th1-type cytokines (e.g., IFN-.gamma., TNF.alpha., IL-2 and IL-12) tend to favor the induction of cell mediated immune responses to an administered antigen. In contrast, high levels of Th2-type cytokines (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the induction of humoral immune responses. Following application of a vaccine as provided herein, a patient will support an immune response that includes Th1- and Th2-type responses. Within a preferred embodiment, in which a response is predominantly Th1-type, the level of Th1-type cytokines will increase to a greater extent than the level of Th2-type cytokines. The levels of these cytokines may be readily assessed using standard assays. For a review of the families of cytokines, see Mosmann and Coffman, Ann. Rev. Immunol. 7:145-173, 1989.

[0535] Preferred adjuvants for use in eliciting a predominantly Th1-type response include, for example, a combination of monophosphoryl lipid A, preferably 3-de-O-acylated monophosphoryl lipid A (3D-MPL), together with an aluminum salt. MPL adjuvants are available from Corixa Corporation (Seattle, Wash.; see U.S. Pat. Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a predominantly Th1 response. Such oligonucleotides are well known and are described, for example, in WO 96/02555, WO 99/33488 and U.S. Pat. Nos. 6,008,200 and 5,856,462. Immunostimulatory DNA sequences are also described, for example, by Sato et al., Science 273:352, 1996. Another preferred adjuvant is a saponin, preferably QS21 (Aquila Biopharmaceuticals Inc., Framingham, Mass.), which may be used alone or in combination with other adjuvants. For example, an enhanced system involves the combination of a monophosphoryl lipid A and saponin derivative, such as the combination of QS21 and 3D-MPL as described in WO 94/00153, or a less reactogenic composition where the QS21 is quenched with cholesterol, as described in WO 96/33739. Other preferred formulations comprise an oil-in-water emulsion and tocopherol. A particularly potent adjuvant formulation involving QS21, 3D-MPL and tocopherol in an oil-in-water emulsion is described in WO 95/17210.

[0536] Other preferred adjuvants include Montanide ISA 720 (Seppic, France), SAF (Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, Belgium), Detox (Corixa, Hamilton, Mont.), RC-529 (Corixa, Hamilton, Mont.) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described in pending U.S. patent application Ser. Nos. 08/853,826 and 09/074,720, the disclosures of which are incorporated herein by reference in their entireties.

[0537] Any vaccine provided herein may be prepared using well known methods that result in a combination of antigen, immune response enhancer and a suitable carrier or excipient. The compositions described herein may be administered as part of a sustained release formulation (i.e., a formulation such as a capsule, sponge or gel (composed of polysaccharides, for example) that effects a slow release of compound following administration). Such formulations may generally be prepared using well known technology (see, e.g., Coombes et al., Vaccine 14:1429-1438, 1996) and administered by, for example, oral, rectal or subcutaneous implantation, or by implantation at the desired target site. Sustained-release formulations may contain a polypeptide, polynucleotide or antibody dispersed in a carrier matrix and/or contained within a reservoir surrounded by a rate controlling membrane.

[0538] Carriers for use within such formulations are biocompatible, and may also be biodegradable; preferably the formulation provides a relatively constant level of active component release. Such carriers include microparticles of poly(lactide-co-glycolide), polyacrylate, latex, starch, cellulose, dextran and the like. Other delayed-release carriers include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer comprising an amphiphilic compound, such as a phospholipid (see e.g., U.S. Pat. No. 5,151,254 and PCT applications WO 94/20078, WO/94/23701 and WO 96/06638). The amount of active compound contained within a sustained release formulation depends upon the site of implantation, the rate and expected duration of release and the nature of the condition to be treated or prevented.

[0539] Any of a variety of delivery vehicles may be employed within pharmaceutical compositions and vaccines to facilitate production of an antigen-specific immune response that targets tumor cells. Delivery vehicles include antigen presenting cells (APCs), such as dendritic cells, macrophages, B cells, monocytes and other cells that may be engineered to be efficient APCs. Such cells may, but need not, be genetically modified to increase the capacity for presenting the antigen, to improve activation and/or maintenance of the T cell response, to have anti-tumor effects per se and/or to be immunologically compatible with the receiver (i.e., matched HLA haplotype). APCs may generally be isolated from any of a variety of biological fluids and organs, including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic or xenogeneic cells.

[0540] Certain preferred embodiments of the present invention use dendritic cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent APCs (Banchereau and Steinman, Nature 392:245-251, 1998) and have been shown to be effective as a physiological adjuvant for eliciting prophylactic or therapeutic antitumor immunity (see Timmerman and Levy, Ann. Rev. Med. 50:507-529, 1999). In general, dendritic cells may be identified based on their typical shape (stellate in situ, with marked cytoplasmic processes (dendrites) visible in vitro), their ability to take up, process and present antigens with high efficiency and their ability to activate naive T cell responses. Dendritic cells may, of course, be engineered to express specific cell-surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex vivo, and such modified dendritic cells are contemplated by the present invention. As an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called exosomes) may be used within a vaccine (see Zitvogel et al., Nature Med. 4:594-600, 1998).

[0541] Dendritic cells and progenitors may be obtained from peripheral blood, bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For example, dendritic cells may be differentiated ex vivo by adding a combination of cytokines such as GM-CSF, IL-4, IL-13 and/or TNF.alpha. to cultures of monocytes harvested from peripheral blood. Alternatively, CD34 positive cells harvested from peripheral blood, umbilical cord blood or bone marrow may be differentiated into dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNF.alpha., CD40 ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, maturation and proliferation of dendritic cells.

[0542] Dendritic cells are conveniently categorized as "immature" and "mature" cells, which allows a simple way to discriminate between two well characterized phenotypes. However, this nomenclature should not be construed to exclude all possible intermediate stages of differentiation. Immature dendritic cells are characterized as APC with a high capacity for antigen uptake and processing, which correlates with the high expression of Fey receptor and mannose receptor. The mature phenotype is typically characterized by a lower expression of these markers, but a high expression of cell surface molecules responsible for T cell activation such as class I and class II MHC, adhesion molecules (e.g., CD54 and CD11) and costimulatory molecules (e.g., CD40, CD80, CD86 and 4-1BB).

[0543] APCs may generally be transfected with a polynucleotide encoding a hematological malignancy-related tumor protein (or portion or other variant thereof) such that the hematological malignancy-related tumor polypeptide, or an immunogenic portion thereof, is expressed on the cell surface. Such transfection may take place ex vivo, and a composition or vaccine comprising such transfected cells may then be used for therapeutic purposes, as described herein. Alternatively, a gene delivery vehicle that targets a dendritic or other antigen presenting cell may be administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex vivo transfection of dendritic cells, for example, may generally be performed using any methods known in the art, such as those described in WO 97/24447, or the gene gun approach described by Mahvi et al., Immunology and cell Biology 75:456-460, 1997. Antigen loading of dendritic cells may be achieved by incubating dendritic cells or progenitor cells with the hematological malignancy-related tumor polypeptide, DNA (naked or within a plasmid vector) or RNA; or with antigen-expressing recombinant bacterium or viruses (e.g., vaccinia, fowlpox, adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be covalently conjugated to an immunological partner that provides T cell help (e.g., a carrier molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated immunological partner, separately or in the presence of the polypeptide.

[0544] Vaccines and pharmaceutical compositions may be presented in unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers are preferably hermetically sealed to preserve sterility of the formulation until use. In general, formulations may be stored as suspensions, solutions or emulsions in oily or aqueous vehicles. Alternatively, a vaccine or pharmaceutical composition may be stored in a freeze-dried condition requiring only the addition of a sterile liquid carrier immediately prior to use.

[0545] 4.22 Cancer Therapy

[0546] In further aspects of the present invention, the compositions described herein may be used for immunotherapy of cancer, such as hematological malignancy. Within such methods, pharmaceutical compositions and vaccines are typically administered to a patient. As used herein, a "patient" refers to any warm-blooded animal, preferably a human. A patient may or may not be afflicted with cancer. Accordingly, the above pharmaceutical compositions and vaccines may be used to prevent the development of a cancer or to treat a patient afflicted with a cancer. A cancer may be diagnosed using criteria generally accepted in the art, including the presence of a malignant tumor. Pharmaceutical compositions and vaccines may be administered either prior to or following surgical removal of primary tumors and/or treatment such as administration of radiotherapy or conventional chemotherapeutic drugs. Administration may be by any suitable method, including administration by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, intradermal, anal, vaginal, topical and oral routes.

[0547] Within certain embodiments, immunotherapy may be active immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous host immune system to react against tumors with the administration of immune response-modifying agents (such as polypeptides and polynucleotides as provided herein).

[0548] Within other embodiments, immunotherapy may be passive immunotherapy, in which treatment involves the delivery of agents with established tumor-immune reactivity (such as effector cells or antibodies) that can directly or indirectly mediate antitumor effects and does not necessarily depend on an intact host immune system. Examples of effector cells include T cells as discussed above, T lymphocytes (such as CD8.sup.+ cytotoxic T lymphocytes and CD4.sup.+ T-helper tumor-infiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine-activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and macrophages) expressing a polypeptide provided herein. T cell receptors and antibody receptors specific for the polypeptides recited herein may be cloned, expressed and transferred into other vectors or effector cells for adoptive immunotherapy. The polypeptides provided herein may also be used to generate antibodies or anti-idiotypic antibodies (as described above and in U.S. Pat. No. 4,918,164) for passive immunotherapy.

[0549] Effector cells may generally be obtained in sufficient quantities for adoptive immunotherapy by growth in vitro, as described herein. Culture conditions for expanding single antigen-specific effector cells to several billion in number with retention of antigen recognition in vivo are well known in the art. Such in vitro culture conditions typically use intermittent stimulation with antigen, often in the presence of cytokines (such as IL-2) and non-dividing feeder cells. As noted above, immunoreactive polypeptides as provided herein may be used to rapidly expand antigen-specific T cell cultures in order to generate a sufficient number of cells for immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides or transfected with one or more polynucleotides using standard techniques well known in the art. For example, antigen-presenting cells can be transfected with a polynucleotide having a promoter appropriate for increasing expression in a recombinant virus or other expression system. Cultured effector cells for use in therapy must be able to grow and distribute widely, and to survive long term in vivo. Studies have shown that cultured effector cells can be induced to grow in vivo and to survive long term in substantial numbers by repeated stimulation with antigen supplemented with IL-2 (see, for example, Cheever et al., Immunological Reviews 157:177, 1997).

[0550] Alternatively, a vector expressing a polypeptide recited herein may be introduced into antigen presenting cells taken from a patient and clonally propagated ex vivo for transplant back into the same patient. Transfected cells may be reintroduced into the patient using any means known in the art, preferably in sterile form by intravenous, intracavitary, intraperitoneal or intratumor administration.

[0551] Routes and frequency of administration of the therapeutic compositions described herein, as well as dosage, will vary from individual to individual, and may be readily established using standard techniques. In general, the pharmaceutical compositions and vaccines may be administered by injection (e.g., intracutaneous, intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. Preferably, between 1 and 10 doses may be administered over a 52 week period. Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations may be given periodically thereafter. Alternate protocols may be appropriate for individual patients. A suitable dose is an amount of a compound that, when administered as described above, is capable of promoting an anti-tumor immune response, and is at least 10-50% above the basal (i.e., untreated) level. Such response can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine-dependent generation of cytolytic effector cells capable of killing the patient's tumor cells in vitro. Such vaccines should also be capable of causing an immune response that leads to an improved clinical outcome (e.g., more frequent remissions, complete or partial or longer disease-free survival) in vaccinated patients as compared to non-vaccinated patients. In general, for pharmaceutical compositions and vaccines comprising one or more polypeptides, the amount of each polypeptide present in a dose ranges from about 25 .mu.g to 5 mg per kg of host. Suitable dose sizes will vary with the size of the patient, but will typically range from about 0.1 mL to about 5 mL.

[0552] In general, an appropriate dosage and treatment regimen provides the active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic benefit. Such a response can be monitored by establishing an improved clinical outcome (e.g., more frequent remissions, complete or partial, or longer disease-free survival) in treated patients as compared to non-treated patients. Increases in preexisting immune responses to a hematological malignancy-related tumor protein generally correlate with an improved clinical outcome. Such immune responses may generally be evaluated using standard proliferation, cytotoxicity or cytokine assays, which may be performed using samples obtained from a patient before and after treatment.

[0553] 4.23 Cancer Detection and Diagnosis

[0554] In general, a cancer may be detected in a patient based on the presence of one or more hematological malignancy-related tumor proteins and/or polynucleotides encoding such proteins in a biological sample (for example, blood, sera, sputum urine and/or tumor biopsies) obtained from the patient. In other words, such proteins may be used as markers to indicate the presence or absence of a cancer such as hematological malignancy. In addition, such proteins may be useful for the detection of other cancers. The binding agents provided herein generally permit detection of the level of antigen that binds to the agent in the biological sample. Polynucleotide primers and probes may be used to detect the level of mRNA encoding a tumor protein, which is also indicative of the presence or absence of a cancer. In general, a hematological malignancy-related tumor sequence should be present at a level that is at least three fold higher in tumor tissue than in normal tissue

[0555] There are a variety of assay formats known to those of ordinary skill in the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In general, the presence or absence of a cancer in a patient may be determined by (a) contacting a biological sample obtained from a patient with a binding agent; (b) detecting in the sample a level of polypeptide that binds to the binding agent; and (c) comparing the level of polypeptide with a predetermined cut-off value.

[0556] In a preferred embodiment, the assay involves the use of binding agent immobilized on a solid support to bind to and remove the polypeptide from the remainder of the sample. The bound polypeptide may then be detected using a detection reagent that contains a reporter group and specifically binds to the binding agent/polypeptide complex. Such detection reagents may comprise, for example, a binding agent that specifically binds to the polypeptide or an antibody or other agent that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G, protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a polypeptide is labeled with a reporter group and allowed to bind to the immobilized binding agent after incubation of the binding agent with the sample. The extent to which components of the sample inhibit the binding of the labeled polypeptide to the binding agent is indicative of the reactivity of the sample with the immobilized binding agent. Suitable polypeptides for use within such assays include full length hematological malignancy-related tumor proteins and portions thereof to which the binding agent binds, as described above.

[0557] The solid support may be any material known to those of ordinary skill in the art to which the tumor protein may be attached. For example, the solid support may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a plastic material such as polystyrene or polyvinylchloride. The support may also be a magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. Pat. No. 5,359,681. The binding agent may be immobilized on the solid support using a variety of techniques known to those of skill in the art, which are amply described in the patent and scientific literature. In the context of the present invention, the term "immobilization" refers to both noncovalent association, such as adsorption, and covalent attachment (which may be a direct linkage between the agent and functional groups on the support or may be a linkage by way of a cross-linking agent). Immobilization by adsorption to a well in a microtiter plate or to a membrane is preferred. In such cases, adsorption may be achieved by contacting the binding agent, in a suitable buffer, with the solid support for a suitable amount of time. The contact time varies with temperature, but is typically between about 1 hour and about 1 day. In general, contacting a well of a plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 0.10 .mu.g, and preferably about 100 ng to about 1 .mu.g, is sufficient to immobilize an adequate amount of binding agent.

[0558] Covalent attachment of binding agent to a solid support may generally be achieved by first reacting the support with a bifunctional reagent that will react with both the support and a functional group, such as a hydroxyl or amino group, on the binding agent. For example, the binding agent may be covalently attached to supports having an appropriate polymer coating using benzoquinone or by condensation of an aldehyde group on the support with an amine and an active hydrogen on the binding partner (see, e.g., Pierce Immunotechnology Catalog and Handbook, 1991, at A12-A13).

[0559] In certain embodiments, the assay is a two-antibody sandwich assay. This assay may be performed by first contacting an antibody that has been immobilized on a solid support, commonly the well of a microtiter plate, with the sample, such that polypeptides within the sample are allowed to bind to the immobilized antibody. Unbound sample is then removed from the immobilized polypeptide-antibody complexes and a detection reagent (preferably a second antibody capable of binding to a different site on the polypeptide) containing a reporter group is added. The amount of detection reagent that remains bound to the solid support is then determined using a method appropriate for the specific reporter group.

[0560] More specifically, once the antibody is immobilized on the support as described above, the remaining protein binding sites on the support are typically blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as bovine serum albumin or Tween 20.TM. (Sigma Chemical Co., St. Louis, Mo.). The immobilized antibody is then incubated with the sample, and polypeptide is allowed to bind to the antibody. The sample may be diluted with a suitable diluent, such as phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact time (i.e., incubation time) is a period of time that is sufficient to detect the presence of polypeptide within a sample obtained from an individual with hematological malignancy. Preferably, the contact time is sufficient to achieve a level of binding that is at least about 95% of that achieved at equilibrium between bound and unbound polypeptide. Those of ordinary skill in the art will recognize that the time necessary to achieve equilibrium may be readily determined by assaying the level of binding that occurs over a period of time. At room temperature, an incubation time of about 30 minutes is generally sufficient.

[0561] Unbound sample may then be removed by washing the solid support with an appropriate buffer, such as PBS containing 0.1% Tween 20.TM.. The second antibody, which contains a reporter group, may then be added to the solid support. Preferred reporter groups include those groups recited above.

[0562] The detection reagent is then incubated with the immobilized antibody-polypeptide complex for an amount of time sufficient to detect the bound polypeptide. An appropriate amount of time may generally be determined by assaying the level of binding that occurs over a period of time. Unbound detection reagent is then removed and bound detection reagent is detected using the reporter group. The method employed for detecting the reporter group depends upon the nature of the reporter group. For radioactive groups, scintillation counting or autoradiographic methods are generally appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups and fluorescent groups. Biotin may be detected using avidin, coupled to a different reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme reporter groups may generally be detected by the addition of substrate (generally for a specific period of time), followed by spectroscopic or other analysis of the reaction products.

[0563] To determine the presence or absence of a cancer, such as hematological malignancy, the signal detected from the reporter group that remains bound to the solid support is generally compared to a signal that corresponds to a predetermined cut-off value. In one preferred embodiment, the cut-off value for the detection of a cancer is the average mean signal obtained when the immobilized antibody is incubated with samples from patients without the cancer. In general, a sample generating a signal that is three standard deviations above the predetermined cut-off value is considered positive for the cancer. In an alternate preferred embodiment, the cut-off value is determined using a Receiver Operator Curve, according to the method of Sackett et al., Clinical Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot of pairs of true positive rates (i.e., sensitivity) and false positive rates (100%-specificity) that correspond to each possible cut-off value for the diagnostic test result. The cut-off value on the plot that is the closest to the upper left-hand corner (i.e., the value that encloses the largest area) is the most accurate cut-off value, and a sample generating a signal that is higher than the cut-off value determined by this method may be considered positive. Alternatively, the cut-off value may be shifted to the left along the plot, to minimize the false positive rate, or to the right, to minimize the false negative rate. In general, a sample generating a signal that is higher than the cut-off value determined by this method is considered positive for a cancer.

[0564] In a related embodiment, the assay is performed in a flow-through or strip test format, wherein the binding agent is immobilized on a membrane, such as nitrocellulose. In the flow-through test, polypeptides within the sample bind to the immobilized binding agent as the sample passes through the membrane. A second, labeled binding agent then binds to the binding agent-polypeptide complex as a solution containing the second binding agent flows through the membrane. The detection of bound second binding agent may then be performed as described above. In the strip test format, one end of the membrane to which binding agent is bound is immersed in a solution containing the sample. The sample migrates along the membrane through a region containing second binding agent and to the area of immobilized binding agent. Concentration of second binding agent at the area of immobilized antibody indicates the presence of a cancer. Typically, the concentration of second binding agent at that site generates a pattern, such as a line, that can be read visually. The absence of such a pattern indicates a negative result. In general, the amount of binding agent immobilized on the membrane is selected to generate a visually discernible pattern when the biological sample contains a level of polypeptide that would be sufficient to generate a positive signal in the two-antibody sandwich assay, in the format discussed above. Preferred binding agents for use in such assays are antibodies and antigen-binding fragments thereof. Preferably, the amount of antibody immobilized on the membrane ranges from about 25 ng to about 1 .mu.g, and more preferably from about 50 ng to about 500 ng. Such tests can typically be performed with a very small amount of biological sample.

[0565] Of course, numerous other assay protocols exist that are suitable for use with the tumor proteins or binding agents of the present invention. The above descriptions are intended to be exemplary only. For example, it will be apparent to those of ordinary skill in the art that the above protocols may be readily modified to use hematological malignancy-related tumor polypeptides to detect antibodies that bind to such polypeptides in a biological sample. The detection of such hematological malignancy-related tumor protein specific antibodies may correlate with the presence of a cancer.

[0566] A cancer may also, or alternatively, be detected based on the presence of T cells that specifically react with a hematological malignancy-related tumor protein in a biological sample. Within certain methods, a biological sample comprising CD4.sup.+ and/or CD8.sup.+ T cells isolated from a patient is incubated with a hematological malignancy-related tumor polypeptide, a polynucleotide encoding such a polypeptide and/or an APC that expresses at least an immunogenic portion of such a polypeptide, and the presence or absence of specific activation of the T cells is detected. Suitable biological samples include, but are not limited to, isolated T cells. For example, T cells may be isolated from a patient by routine techniques (such as by Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T cells may be incubated in vitro for 2-9 days (typically 4 days) at 37.degree. C. with polypeptide (e.g., 5-25 .mu.g/ml). It may be desirable to incubate another aliquot of a T cell sample in the absence of hematological malignancy-related tumor polypeptide to serve as a control. For CD4.sup.+ T cells, activation is preferably detected by evaluating proliferation of the T cells. For CD8.sup.+ T cells, activation is preferably detected by evaluating cytolytic activity. A level of proliferation that is at least two fold greater and/or a level of cytolytic activity that is at least 20% greater than in disease-free patients indicates the presence of a cancer in the patient.

[0567] As noted above, a cancer may also, or alternatively, be detected based on the level of mRNA encoding a hematological malignancy-related tumor protein in a biological sample. For example, at least two oligonucleotide primers may be employed in a polymerase chain reaction (PCR) based assay to amplify a portion of a hematological malignancy-related tumor cDNA derived from a biological sample, wherein at least one of the oligonucleotide primers is specific for (i.e., hybridizes to) a polynucleotide encoding the hematological malignancy-related tumor protein. The amplified cDNA is then separated and detected using techniques well known in the art, such as gel electrophoresis. Similarly, oligonucleotide probes that specifically hybridize to a polynucleotide encoding a hematological malignancy-related tumor protein may be used in a hybridization assay to detect the presence of polynucleotide encoding the tumor protein in a biological sample.

[0568] To permit hybridization under assay conditions, oligonucleotide primers and probes should comprise an oligonucleotide sequence that has at least about 60%, preferably at least about 75% and more preferably at least about 90%, identity to a portion of a polynucleotide encoding a hematological malignancy-related tumor protein that is at least 10 nucleotides, and preferably at least 20 nucleotides, in length. Preferably, oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a polypeptide described herein under moderately stringent conditions, as defined above. Oligonucleotide primers and/or probes which may be usefully employed in the diagnostic methods described herein preferably are at least 10-40 nucleotides in length. In a preferred embodiment, the oligonucleotide primers comprise at least 10 contiguous nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule having a sequence disclosed in this application. Techniques for both PCR based assays and hybridization assays are well known in the art (see, for example, Mullis et al., Cold Spring Harbor Symp. Quant. Biol., 51:263, 1987; Erlich ed., PCR Technology, Stockton Press, N Y, 1989).

[0569] One preferred assay employs RT-PCR, in which PCR is applied in conjunction with reverse transcription. Typically, RNA is extracted from a biological sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. PCR amplification using at least one specific primer generates a cDNA molecule, which may be separated and visualized using, for example, gel electrophoresis. Amplification may be performed on biological samples taken from a test patient and from an individual who is not afflicted with a cancer. The amplification reaction may be performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold or greater increase in expression in several dilutions of the test patient sample as compared to the same dilutions of the non-cancerous sample is typically considered positive.

[0570] In another embodiment, the compositions described herein may be used as markers for the progression of cancer. In this embodiment, assays as described above for the diagnosis of a cancer may be performed over time, and the change in the level of reactive polypeptide(s) or polynucleotide(s) evaluated. For example, the assays may be performed every 24-72 hours for a period of 6 months to 1 year, and thereafter performed as needed. In general, a cancer is progressing in those patients in whom the level of polypeptide or polynucleotide detected increases over time. In contrast, the cancer is not progressing when the level of reactive polypeptide or polynucleotide either remains constant or decreases with time.

[0571] Certain in vivo diagnostic assays may be performed directly on a tumor. One such assay involves contacting tumor cells with a binding agent. The bound binding agent may then be detected directly or indirectly via a reporter group. Such binding agents may also be used in histological applications. Alternatively, polynucleotide probes may be used within such applications.

[0572] As noted above, to improve sensitivity, multiple hematological malignancy-related tumor protein markers may be assayed within a given sample. It will be apparent that binding agents specific for different proteins provided herein may be combined within a single assay. Further, multiple primers or probes may be used concurrently. The selection of tumor protein markers may be based on routine experiments to determine combinations that results in optimal sensitivity. In addition, or alternatively, assays for tumor proteins provided herein may be combined with assays for other known tumor antigens.

[0573] 4.24 Preparation of DNA Sequences

[0574] Certain nucleic acid sequences of cDNA molecules encoding portions of hematological malignancy-related antigens were isolated by PCR.TM.-based subtraction. This technique serves to normalize differentially expressed cDNAs, facilitating the recovery of rare transcripts, and also has the advantage of permitting enrichment of cDNAs with small amounts of polyA RNA material and without multiple rounds of hybridization. To obtain antigens overexpressed in non-Hodgkin's lymphomas, two subtractions were performed with a tester library prepared from a pool of three T cell non-Hodgkin's lymphoma mRNAs. The two libraries were independently subtracted with different pools of driver cDNAs. Driver #1 contained cDNA prepared from specific normal tissues (lymph node, bone marrow, T cells, heart and brain), and this subtraction generated the library TCS-D1 (T cell non-Hodgkin's lymphoma subtracted library with driver #1). Driver #2 contained non-specific normal tissues (colon, large intestine, lung, pancreas, spinal cord, skeletal muscle, liver, kidney, skin and brain), and this subtraction generated the library TCS-D2 (T cell non-Hodgkin's lymphoma subtraction library with driver #2). Two other subtractions were performed with a tester library prepared from a pool of three B cell non-Hodgkin's lymphoma mRNAs. The two libraries were independently subtracted with different pools of driver cDNAs. Driver #1 contained cDNA prepared from specific normal tissues (lymph node, bone marrow, B cells, heart and brain), and this subtraction generated the library BCNHL/D1 (B cell non-Hodgkin's lymphoma subtracted library with driver #1). Driver #2 contained non-specific normal tissues (brain, lung, pancreas, spinal cord, skeletal muscle, colon, spleen, large intestine and PBMC), and this subtraction generated the library BCNHL/D2 (B cell non-Hodgkin's lymphoma subtraction library with driver #2). PCR.TM.-amplified pools were generated from the subtracted libraries and clones were sequenced.

[0575] Hematological malignancy-related antigen sequences may be further characterized using any of a variety of well known techniques. For example, PCR.TM. amplified clones may be arrayed onto glass slides for microarray analysis. To determine tissue distribution, the arrayed clones may be used as targets to be hybridized with different first strand cDNA probes, including lymphoma probes, leukemia probes and probes from different normal tissues. Leukemia and lymphoma probes may be generated from cryopreserved samples obtained at the time of diagnosis from NHL, Hodgkin's disease, AML, CML, CLL, ALL, MDS and myeloma patients with poor outcome (patients who failed to achieve complete remission following conventional chemotherapy or relapsed) or good outcome (patients who achieved long term remission). To analyze gene expression during hematopoetic differentiation, probes may be generated from >95% pure fractions of CD34+, CD2+, CD14+, CD15+ and CD19+ cells derived from healthy individuals.

[0576] Polynucleotide variants may generally be prepared by any method known in the art, including chemical synthesis by, for example, solid phase phosphoramidite chemical synthesis. Modifications in a polynucleotide sequence may also be introduced using standard mutagenesis techniques, such as oligonucleotide-directed site-specific mutagenesis (see Adelman et al., DNA 2:183, 1983). Alternatively, RNA molecules may be generated by in vitro or in vivo transcription of DNA sequences, provided that the DNA is incorporated into a vector with a suitable RNA polymerase promoter (such as T7 or SP6). Certain portions may be used to prepare an encoded polypeptide, as described herein. In addition, or alternatively, a portion may be administered to a patient such that the encoded polypeptide is generated in vivo (e.g., by transfecting antigen-presenting cells, such as dendritic cells, with a cDNA construct encoding a hematological malignancy-related antigen, and administering the transfected cells to the patient).

[0577] A portion of a sequence complementary to a coding sequence (i.e., an antisense polynucleotide) may also be used as a probe or to modulate hematological malignancy-related antigen expression. cDNA constructs that can be transcribed into antisense RNA may also be introduced into cells or tissues to facilitate the production of antisense RNA. An antisense polynucleotide may be used, as described herein, to inhibit expression of a hematological malignancy-related antigen. Antisense technology can be used to control gene expression through triple-helix formation, which compromises the ability of the double helix to open sufficiently for the binding of polymerases, transcription factors or regulatory molecules (see Gee et al., In Huber and Canrr, Molecular and Immunologic Approaches, Futura Publishing Co. (Mt. Kisco, N.Y.; 1994)). Alternatively, an antisense molecule may be designed to hybridize with a control region of a gene (e.g., promoter, enhancer or transcription initiation site), and block transcription of the gene; or to block translation by inhibiting binding of a transcript to ribosomes.

[0578] A portion of a coding sequence or of a complementary sequence may also be designed as a probe or primer to detect gene expression. Probes may be labeled with a variety of reporter groups, such as radionuclides and enzymes, and are preferably at least 10 nucleotides in length, more preferably at least 20 nucleotides in length and still more preferably at least 30 nucleotides in length. Primers, as noted above, are preferably 22-30 nucleotides in length.

[0579] Any polynucleotide may be further modified to increase stability in vivo. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends; the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages in the backbone; and/or the inclusion of nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl-methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and uridine.

[0580] Hematological malignancy-related antigen polynucleotides may be joined to a variety of other nucleotide sequences using established recombinant DNA techniques. For example, a polynucleotide may be cloned into any of a variety of cloning vectors, including plasmids, phagemids, lambda phage derivatives and cosmids. Vectors of particular interest include expression vectors, replication vectors, probe generation vectors and sequencing vectors. In general, a vector will contain an origin of replication functional in at least one organism, convenient restriction endonuclease sites and one or more selectable markers. Other elements will depend upon the desired use, and will be apparent to those of ordinary skill in the art.

[0581] Within certain embodiments, polynucleotides may be formulated so as to permit entry into a cell of a mammal, and expression therein. Such formulations are particularly useful for therapeutic purposes, as described below. Those of ordinary skill in the art will appreciate that there are many ways to achieve expression of a polynucleotide in a target cell, and any suitable method may be employed. For example, a polynucleotide may be incorporated into a viral vector such as, but not limited to, adenovirus, adeno-associated virus, retrovirus, or vaccinia or other pox virus (e.g., avian pox virus). Techniques for incorporating DNA into such vectors are well known to those of ordinary skill in the art. A retroviral vector may additionally transfer or incorporate a gene for a selectable marker (to aid in the identification or selection of transduced cells) and/or a targeting moiety, such as a gene that encodes a ligand for a receptor on a specific target cell, to render the vector target specific. Targeting may also be accomplished using an antibody, by methods known to those of ordinary skill in the art.

[0582] Other formulations for therapeutic purposes include colloidal dispersion systems, such as macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes. A preferred colloidal system for use as a delivery vehicle in vitro and in vivo is a liposome (i.e., an artificial membrane vesicle). The preparation and use of such systems is well known in the art.

[0583] 4.25 Therapeutic Methods

[0584] In further aspects of the present invention, the compositions described herein may be used for immunotherapy of hematological malignancies including adult and pediatric AML, CML, ALL, CLL, myelodysplastic syndromes (MDS), myeloproliferative syndromes (MPS), secondary leukemia, multiple myeloma, Hodgkin's lymphoma and Non-Hodgkin's lymphomas. In addition, compositions described herein may be used for therapy of diseases associated with an autoimmune response against hematopoetic precursor cells, such as severe aplastic anemia.

[0585] Immunotherapy may be performed using any of a variety of techniques, in which compounds or cells provided herein function to remove hematological malignancy-related antigen-expressing cells from a patient. Such removal may take place as a result of enhancing or inducing an immune response in a patient specific for hematological malignancy-related antigen or a cell expressing hematological malignancy-related antigen. Alternatively, hematological malignancy-related antigen-expressing cells may be removed ex vivo (e.g., by treatment of autologous bone marrow, peripheral blood or a fraction of bone marrow or peripheral blood). Fractions of bone marrow or peripheral blood may be obtained using any standard technique in the art.

[0586] Within such methods, pharmaceutical compositions and vaccines are typically administered to a patient. As used herein, a "patient" refers to any warm-blooded animal, preferably a human. A patient may or may not be afflicted with a hematological malignancy. Accordingly, the above pharmaceutical compositions and vaccines may be used to prevent the development of a malignancy or to treat a patient afflicted with a malignancy. A hematological malignancy may be diagnosed using criteria generally accepted in the art. Pharmaceutical compositions and vaccines may be administered either prior to or following surgical removal of primary tumors and/or treatment such as administration of radiotherapy or conventional chemotherapeutic drugs, or bone marrow transplantation (autologous, allogeneic or syngeneic).

[0587] Within certain embodiments, immunotherapy may be active immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous host immune system to react against tumors with the administration of immune response-modifying agents (such as polypeptides and polynucleotides as provided herein).

[0588] Within other embodiments, immunotherapy may be passive immunotherapy, in which treatment involves the delivery of agents with established tumor-immune reactivity (such as effector cells or antibodies) that can directly or indirectly mediate antitumor effects and does not necessarily depend on an intact host immune system. Examples of effector cells include T cells as discussed above, T lymphocytes (such as CD8.sup.+ cytotoxic T lymphocytes and CD4.sup.+ T-helper tumor-infiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine-activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and macrophages) expressing a polypeptide provided herein. T cell receptors and antibody receptors specific for the polypeptides recited herein may be cloned, expressed and transferred into other vectors or effector cells for adoptive immunotherapy. The polypeptides provided herein may also be used to generate antibodies or anti-idiotypic antibodies (as described above and in U.S. Pat. No. 4,918,164) for passive immunotherapy.

[0589] Effector cells may generally be obtained in sufficient quantities for adoptive immunotherapy by growth in vitro, as described herein. Culture conditions for expanding single antigen-specific effector cells to several billion in number with retention of antigen recognition in vivo are well known in the art. Such in vitro culture conditions typically use intermittent stimulation with antigen, often in the presence of cytokines (such as IL-2) and non-dividing feeder cells. As noted above, immunoreactive polypeptides as provided herein may be used to rapidly expand antigen-specific T cell cultures in order to generate a sufficient number of cells for immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage or B cells, may be pulsed with immunoreactive polypeptides or transfected with one or more polynucleotides using standard techniques well known in the art. For example, antigen-presenting cells can be transfected with a polynucleotide having a promoter appropriate for increasing expression in a recombinant virus or other expression system. Cultured effector cells for use in therapy must be able to grow and distribute widely, and to survive long term in vivo. Studies have shown that cultured effector cells can be induced to grow in vivo and to survive long term in substantial numbers by repeated stimulation with antigen supplemented with IL-2 (see, for example, Cheever et al., Immunological Reviews 157:177, 1997).

[0590] Alternatively, a vector expressing a polypeptide recited herein may be introduced into antigen presenting cells taken from a patient and clonally propagated ex vivo for transplant back into the same patient. Transfected cells may be reintroduced into the patient using any means known in the art, preferably in sterile form by intravenous, intracavitary, intraperitoneal or intratumor administration.

[0591] The compositions provided herein may be used alone or in combination with conventional therapeutic regimens such as surgery, irradiation, chemotherapy and/or bone marrow transplantation (autologous, syngeneic, allogeneic or unrelated). As discussed in greater detail below, binding agents and T cells as provided herein may be used for purging of autologous stem cells. Such purging may be beneficial prior to, for example, bone marrow transplantation or transfusion of blood or components thereof. Binding agents, T cells, antigen presenting cells (APC) and compositions provided herein may further be used for expanding and stimulating (or priming) autologous, allogeneic, syngeneic or unrelated hematological malignancy-related antigen-specific T-cells in vitro and/or in vivo. Such hematological malignancy-related antigen-specific T cells may be used, for example, within donor lymphocyte infusions.

[0592] Routes and frequency of administration of the therapeutic compositions described herein, as well as dosage, will vary from individual to individual, and may be readily established using standard techniques. In general, the pharmaceutical compositions and vaccines may be administered by injection (e.g., intracutaneous, intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. Preferably, between 1 and 10 doses may be administered over a 52 week period. Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations may be given periodically thereafter. Alternate protocols may be appropriate for individual patients. A suitable dose is an amount of a compound that, when administered as described above, is capable of promoting an anti-tumor immune response, and is at least 10-50% above the basal (i.e., untreated) level. Such response can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine-dependent generation of cytolytic effector cells capable of killing the patient's tumor cells in vitro. Such vaccines should also be capable of causing an immune response that leads to an improved clinical outcome (e.g., more frequent remissions, complete or partial or longer disease-free survival) in vaccinated patients as compared to non-vaccinated patients. In general, for pharmaceutical compositions and vaccines comprising one or more polypeptides, the amount of each polypeptide present in a dose ranges from about 100 .mu.g to 5 mg per kg of host. Suitable dose sizes will vary with the size of the patient, but will typically range from about 0.1 mL to about 5 mL.

[0593] In general, an appropriate dosage and treatment regimen provides the active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic benefit. Such a response can be monitored by establishing an improved clinical outcome (e.g., more frequent remissions, complete or partial, or longer disease-free survival) in treated patients as compared to non-treated patients. Increases in preexisting immune responses to a hematological malignancy-related antigen generally correlate with an improved clinical outcome. Such immune responses may generally be evaluated using standard proliferation, cytotoxicity or cytokine assays, which may be performed using samples obtained from a patient before and after treatment.

[0594] Within further aspects, methods for inhibiting the development of a malignant disease associated with hematological malignancy-related antigen expression involve the administration of autologous T cells that have been activated in response to a hematological malignancy-related antigen polypeptide or hematological malignancy-related antigen-expressing APC, as described above. Such T cells may be CD4.sup.+ and/or CD8.sup.+, and may be proliferated as described above. The T cells may be administered to the individual in an amount effective to inhibit the development of a malignant disease. Typically, about 1.times.10.sup.9 to 1.times.10.sup.11 T cells/M.sup.2 are administered intravenously, intracavitary or in the bed of a resected tumor. It will be evident to those skilled in the art that the number of cells and the frequency of administration will be dependent upon the response of the patient.

[0595] Within certain embodiments, T cells may be stimulated prior to an autologous bone marrow transplantation. Such stimulation may take place in vivo or in vitro. For in vitro stimulation, bone marrow and/or peripheral blood (or a fraction of bone marrow or peripheral blood) obtained from a patient may be contacted with a hematological malignancy-related antigen polypeptide, a polynucleotide encoding a hematological malignancy-related antigen polypeptide and/or an APC that expresses a hematological malignancy-related antigen polypeptide under conditions and for a time sufficient to permit the stimulation of T cells as described above. Bone marrow, peripheral blood stem cells and/or hematological malignancy-related antigen-specific T cells may then be administered to a patient using standard techniques.

[0596] Within related embodiments, T cells of a related or unrelated donor may be stimulated prior to a syngeneic or allogeneic (related or unrelated) bone marrow transplantation. Such stimulation may take place in vivo or in vitro. For in vitro stimulation, bone marrow and/or peripheral blood (or a fraction of bone marrow or peripheral blood) obtained from a related or unrelated donor may be contacted with a-hematological malignancy-related antigen polypeptide, hematological malignancy-related antigen polynucleotide and/or APC that expresses a hematological malignancy-related antigen polypeptide under conditions and for a time sufficient to permit the stimulation of T cells as described above. Bone marrow, peripheral blood stem cells and/or hematological malignancy-related antigen-specific T cells may then be administered to a patient using standard techniques.

[0597] Within other embodiments, hematological malignancy-related antigen-specific T cells, antibodies or antigen-binding fragments thereof as described herein may be used to remove cells expressing hematological malignancy-related antigen from a biological sample, such as autologous bone marrow, peripheral blood or a fraction of bone marrow or peripheral blood (e.g., CD34.sup.+ enriched peripheral blood (PB) prior to administration to a patient). Such methods may be performed by contacting the biological sample with such T cells, antibodies or antibody fragments under conditions and for a time sufficient to permit the reduction of hematological malignancy-related antigen expressing cells to less than 10%, preferably less than 5% and more preferably less than 1%, of the total number of myeloid or lymphatic cells in the bone marrow or peripheral blood. Such contact may be achieved, for example, using a column to which antibodies are attached using standard techniques. Antigen-expressing cells are retained on the column. The extent to which such cells have been removed may be readily determined by standard methods such as, for example, qualitative and quantitative PCR analysis, morphology, immunohistochemistry and FACS analysis. Bone marrow or PB (or a fraction thereof) may then be administered to a patient using standard techniques.

[0598] 4.26 Diagnostic Methods

[0599] In general, a hematological malignancy may be detected in a patient based on the presence of hematological malignancy-related antigen and/or polynucleotide in a biological sample (such as blood, sera, urine and/or tumor biopsies) obtained from the patient. In other words, hematological malignancy-related antigens may be used as a marker to indicate the presence or absence of such a malignancy. The binding agents provided herein generally permit detection of the level of antigen that binds to the agent in the biological sample. Polynucleotide primers and probes may be used to detect the level of mRNA encoding hematological malignancy-related antigen, which is also indicative of the presence or absence of a hematological malignancy. In general, hematological malignancy-related antigen should be present at a level that is at least three fold higher in a sample obtained from a patient afflicted with a hematological malignancy than in the sample obtained from an individual not so afflicted.

[0600] There are a variety of assay formats known to those of ordinary skill in the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In general, the presence or absence of a hematological malignancy in a patient may be determined by (a) contacting a biological sample obtained from a patient with a binding agent; (b) detecting in the sample a level of polypeptide that binds to the binding agent; and (c) comparing the level of polypeptide with a predetermined cut-off value.

[0601] In a preferred embodiment, the assay involves the use of binding agent immobilized on a solid support to bind to and remove the polypeptide from the remainder of the sample. The bound polypeptide may then be detected using a detection reagent that contains a reporter group and specifically binds to the binding agent/polypeptide complex. Such detection reagents may comprise, for example, a binding agent that specifically binds to the polypeptide or an antibody or other agent that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G, protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a polypeptide is labeled with a reporter group and allowed to bind to the immobilized binding agent after incubation of the binding agent with the sample. The extent to which components of the sample inhibit the binding of the labeled polypeptide to the binding agent is indicative of the reactivity of the sample with the immobilized binding agent. Suitable polypeptides for use within such assays include full length hematological malignancy-related antigens and portions thereof to which the binding agent binds, as described above.

[0602] The solid support may be any material known to those of ordinary skill in the art to which the hematological malignancy-related antigen polypeptide may be attached. For example, the solid support may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a plastic material such as polystyrene or polyvinylchloride. The support may also be a magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. Pat. No. 5,359,681. The binding agent may be immobilized on the solid support using a variety of techniques known to those of skill in the art, which are amply described in the patent and scientific literature. In the context of the present invention, the term "immobilization" refers to both noncovalent association, such as adsorption, and covalent attachment (which may be a direct linkage between the agent and functional groups on the support or may be a linkage by way of a cross-linking agent). Immobilization by adsorption to a well in a microtiter plate or to a membrane is preferred. In such cases, adsorption may be achieved by contacting the binding agent, in a suitable buffer, with the solid support for a suitable amount of time. The contact time varies with temperature, but is typically between about 1 hour and about 1 day. In general, contacting a well of a plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 10 .mu.g, and preferably about 100 ng to about 1 .mu.g, is sufficient to immobilize an adequate amount of binding agent.

[0603] Covalent attachment of binding agent to a solid support may generally be achieved by first reacting the support with a bifunctional reagent that will react with both the support and a functional group, such as a hydroxyl or amino group, on the binding agent. For example, the binding agent may be covalently attached to supports having an appropriate polymer coating using benzoquinone or by condensation of an aldehyde group on the support with an amine and an active hydrogen on the binding partner (see, e.g., Pierce Immunotechnology Catalog and Handbook, 1991, at A12-A13).

[0604] In certain embodiments, the assay is a two-antibody sandwich assay. This assay may be performed by first contacting an antibody that has been immobilized on a solid support, commonly the well of a microtiter plate, with the sample, such that polypeptides within the sample are allowed to bind to the immobilized antibody. Unbound sample is then removed from the immobilized polypeptide-antibody complexes and a detection reagent (preferably a second antibody capable of binding to a different site on the polypeptide) containing a reporter group is added. The amount of detection reagent that remains bound to the solid support is then determined using a method appropriate for the specific reporter group.

[0605] More specifically, once the antibody is immobilized on the support as described above, the remaining protein binding sites on the support are typically blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as bovine serum albumin or Tween 20.TM. (Sigma Chemical Co., St. Louis, Mo.). The immobilized antibody is then incubated with the sample, and polypeptide is allowed to bind to the antibody. The sample may be diluted with a suitable diluent, such as phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact time (i.e., incubation time) is a period of time that is sufficient to detect the presence of polypeptide within a sample obtained from an individual with a hematological malignancy. Preferably, the contact time is sufficient to achieve a level of binding that is at least about 95% of that achieved at equilibrium between bound and unbound polypeptide. Those of ordinary skill in the art will recognize that the time necessary to achieve equilibrium may be readily determined by assaying the level of binding that occurs over a period of time. At room temperature, an incubation time of about 30 minutes is generally sufficient.

[0606] Unbound sample may then be removed by washing the solid support with an appropriate buffer, such as PBS containing 0.1% Tween 20.TM.. The second antibody, which contains a reporter group, may then be added to the solid support. Preferred reporter groups include those groups recited above.

[0607] The detection reagent is then incubated with the immobilized antibody-polypeptide complex for an amount of time sufficient to detect the bound polypeptide. An appropriate amount of time may generally be determined by assaying the level of binding that occurs over a period of time. Unbound detection reagent is then removed and bound detection reagent is detected using the reporter group. The method employed for detecting the reporter group depends upon the nature of the reporter group. For radioactive groups, scintillation counting or autoradiographic methods are generally appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups and fluorescent groups. Biotin may be detected using avidin, coupled to a different reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme reporter groups may generally be detected by the addition of substrate (generally for a specific period of time), followed by spectroscopic or other analysis of the reaction products.

[0608] To determine the presence or absence of a hematological malignancy, the signal detected from the reporter group that remains bound to the solid support is generally compared to a signal that corresponds to a predetermined cut-off value. In one preferred embodiment, the cut-off value for the detection of a hematological malignancy is the average mean signal obtained when the immobilized antibody is incubated with samples from patients without the malignancy. In general, a sample generating a signal that is three standard deviations above the predetermined cut-off value is considered positive for the malignancy. In an alternate preferred embodiment, the cut-off value is determined using a Receiver Operator Curve, according to the method of Sackett et al., Clinical Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot of pairs of true positive rates (i.e., sensitivity) and false positive rates (100%-specificity) that correspond to each possible cut-off value for the diagnostic test result. The cut-off value on the plot that is the closest to the upper left-hand corner (i.e., the value that encloses the largest area) is the most accurate cut-off value, and a sample generating a signal that is higher than the cut-off value determined by this method may be considered positive. Alternatively, the cut-off value may be shifted to the left along the plot, to minimize the false positive rate, or to the right, to minimize the false negative rate. In general, a sample generating a signal that is higher than the cut-off value determined by this method is considered positive for a malignancy.

[0609] In a related embodiment, the assay is performed in a flow-through or strip test format, wherein the binding agent is immobilized on a membrane, such as nitrocellulose. In the flow-through test, polypeptides within the sample bind to the immobilized binding agent as the sample passes through the membrane. A second, labeled binding agent then binds to the binding agent-polypeptide complex as a solution containing the second binding agent flows through the membrane. The detection of bound second binding agent may then be performed as described above. In the strip test format, one end of the membrane to which binding agent is bound is immersed in a solution containing the sample. The sample migrates along the membrane through a region containing second binding agent and to the area of immobilized binding agent. Concentration of second binding agent at the area of immobilized antibody indicates the presence of a hematological malignancy. Typically, the concentration of second binding agent at that site generates a pattern, such as a line, that can be read visually. The absence of such a pattern indicates a negative result. In general, the amount of binding agent immobilized on the membrane is selected to generate a visually discernible pattern when the biological sample contains a level of polypeptide that would be sufficient to generate a positive signal in the two-antibody sandwich assay, in the format discussed above. Preferred binding agents for use in such assays are antibodies and antigen-binding fragments thereof. Preferably, the amount of antibody immobilized on the membrane ranges from about 25 ng to about 1 .mu.g, and more preferably from about 50 ng to about 500 ng. Such tests can typically be performed with a very small amount of biological sample.

[0610] Of course, numerous other assay protocols exist that are suitable for use with the hematological malignancy-related antigen sequences or binding agents of the present invention. The above descriptions are intended to be exemplary only. For example, it will be apparent to those of ordinary skill in the art that the above protocols may be readily modified to use hematological malignancy-related antigen polypeptides to detect antibodies that bind to such polypeptides in a biological sample. The detection of hematological malignancy-related antigen-specific antibodies may correlate with the presence of a hematological.

[0611] A malignancy may also, or alternatively, be detected based on the presence of T cells that specifically react with hematological malignancy-related antigen in a biological sample. Within certain methods, a biological sample comprising CD4.sup.+ and/or CD8.sup.+ T cells isolated from a patient is incubated with a hematological malignancy-related antigen polypeptide, a polynucleotide encoding such a polypeptide and/or an APC that expresses such a polypeptide, and the presence or absence of specific activation of the T cells is detected. Suitable biological samples include, but are not limited to, isolated T cells. For example, T cells may be isolated from a patient by routine techniques (such as by Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T cells may be incubated in vitro for 2-9 days (typically 4 days) at 37.degree. C. with Mtb-81 or Mtb-67.2 polypeptide (e.g., 5-25 .mu.g/ml). It may be desirable to incubate another aliquot of a T cell sample in the absence of hematological malignancy-related antigen polypeptide to serve as a control. For CD4.sup.+T cells, activation is preferably detected by evaluating proliferation of the T cells. For CD8.sup.+ T cells, activation is preferably detected by evaluating cytolytic activity. A level of proliferation that is at least two fold greater and/or a level of cytolytic activity that is at least 20% greater than in disease-free patients indicates the presence of a hematological malignancy in the patient.

[0612] As noted above, a hematological malignancy may also, or alternatively, be detected based on the level of mRNA encoding hematological malignancy-related antigen in a biological sample. For example, at least two oligonucleotide primers may be employed in a polymerase chain reaction (PCR) based assay to amplify a portion of hematological malignancy-related antigen cDNA derived from a biological sample, wherein at least one of the oligonucleotide primers is specific for (i.e., hybridizes to) a polynucleotide encoding the hematological malignancy-related antigen protein. The amplified cDNA is then separated and detected using techniques well known in the art, such as gel electrophoresis. Similarly, oligonucleotide probes that specifically hybridize to a polynucleotide encoding hematological malignancy-related antigen may be used in a hybridization assay to detect the presence of polynucleotide encoding hematological malignancy-related antigen in a biological sample.

[0613] To permit hybridization under assay conditions, oligonucleotide primers and probes should comprise an oligonucleotide sequence that has at least about 60%, preferably at least about 75% and more preferably at least about 90%, identity to a portion of a polynucleotide encoding hematological malignancy-related antigen that is at least 10 nucleotides, and preferably at least 20 nucleotides, in length. Preferably, oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a polypeptide described herein under moderately stringent conditions, as defined above. Oligonucleotide primers and/or probes which may be usefully employed in the diagnostic methods described herein preferably are at least 10-40 nucleotides in length. Techniques for both PCR based assays and hybridization assays are well known in the art (see, for example, Mullis et al., Cold Spring Harbor Symp. Quant. Biol., 51:263, 1987; Erlich ed., PCR Technology, Stockton Press, N Y, 1989).

[0614] One preferred assay employs RT-PCR, in which PCR is applied in conjunction with reverse transcription. Typically, RNA is extracted from a biological sample such as a biopsy tissue and is reverse transcribed to produce cDNA molecules. PCR amplification using at least one specific primer generates a cDNA molecule, which may be separated and visualized using, for example, gel electrophoresis. Amplification may be performed on biological samples taken from a test patient and from an individual who is not afflicted with a hematological malignancy. The amplification reaction may be performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold or greater increase in expression in several dilutions of the test patient sample as compared to the same dilutions of the sample from a normal individual is typically considered positive.

[0615] In preferred embodiments, such assays may be performed using samples enriched for cells expressing the hematological malignancy-related antigen(s) of interest. Such enrichment may be achieved, for example, using a binding agent as provided herein to remove the cells from the remainder of the biological sample. The removed cells may then be assayed as described above for biological samples.

[0616] In further embodiments, hematological malignancy-related antigens may be used as markers for monitoring disease progression or the response to therapy of a hematological malignancy. In this embodiment, assays as described above for the diagnosis of a hematological malignancy may be performed over time, and the change in the level of reactive polypeptide(s) evaluated. For example, the assays may be performed every 24-72 hours for a period of 6 months to 1 year, and thereafter performed as needed. In general, a malignancy is progressing in those patients in whom the level of polypeptide detected by the binding agent increases over time. In contrast, the malignancy is not progressing when the level of reactive polypeptide either remains constant or decreases with time.

[0617] Certain in vivo diagnostic assays may be performed directly on a tumor. One such assay involves contacting tumor cells with a binding agent. The bound binding agent may then be detected directly or indirectly via a reporter group. Such binding agents may also be used in histological applications. Alternatively, polynucleotide probes may be used within such applications.

[0618] As noted above, to improve sensitivity, multiple markers may be assayed within a given sample. It will be apparent that binding agents specific for different proteins provided herein may be combined within a single assay. Further, multiple primers or probes may be used concurrently. The selection of markers may be based on routine experiments to determine combinations that results in optimal sensitivity.

[0619] Further diagnostic applications include the detection of extramedullary disease (e.g., cerebral infiltration of blasts in leukemias). Within such methods, a binding agent may be coupled to a tracer substance, and the diagnosis is performed in vivo using well known techniques. Coupled binding agent may be administered as described above, and extramedullary disease may be detected based on assaying the presence of tracer substance. Alternatively, a tracer substance may be associated with a T cell specific for hematological malignancy-related antigen, permitting detection of extramedullary disease based on assays to detect the location of the tracer substance.

[0620] 4.27 Exemplary Definitions

[0621] In accordance with the present invention, nucleic acid sequences include, but are not limited to, DNAs (including and not limited to genomic or extragenomic DNAs), genes, peptide nucleic acids (PNAs) RNAs (including, but not limited to, rRNAs, mRNAs and tRNAs), nucleosides, and suitable nucleic acid segments either obtained from native sources, chemically synthesized, modified, or otherwise prepared in whole or in part by the hand of man.

[0622] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and compositions similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and compositions are described herein. For purposes of the present invention, the following terms are defined below:

[0623] A, an: In accordance with long standing patent law convention, the words "a" and "an" when used in this application, including the claims, denotes "one or more".

[0624] Expression: The combination of intracellular processes, including transcription and translation undergone by a polynucleotide such as a structural gene to synthesize the encoded peptide or polypeptide.

[0625] Promoter: a term used to generally describe the region or regions of a nucleic acid sequence that regulates transcription.

[0626] Regulatory Element: a term used to generally describe the region or regions of a nucleic acid sequence that regulates transcription.

[0627] Structural gene: A gene or sequence region that is expressed to produce an encoded peptide or polypeptide.

[0628] Transformation: A process of introducing an exogenous polynucleotide sequence (e.g., a vector, a recombinant DNA or RNA molecule) into a host cell or protoplast in which that exogenous nucleic acid segment is incorporated into at least a first chromosome or is capable of autonomous replication within the transformed host cell. Transfection, electroporation, and naked nucleic acid uptake all represent examples of techniques used to transform a host cell with one or more polynucleotides.

[0629] Transformed cell: A host cell whose nucleic acid complement has been altered by the introduction of one or more exogenous polynucleotides into that cell.

[0630] Transgenic cell: Any cell derived or regenerated from a transformed cell or derived from a transgenic cell, or from the progeny or offspring of any generation of such a transformed host cell.

[0631] Transgenic animal: An animal or a progeny or an offspring of any generation thereof that is derived from a transformed animal cell, wherein the animal's DNA contains an introduced exogenous nucleic acid molecule not originally present in a native, wild type, non-transgenic animal of the same species. The terms "transgenic animal" and "transformed animal" have sometimes been used in the art as synonymous terms to define an animal, the genetic contents of which has been modified to contain one or more exogenous nucleic acid segments.

[0632] Vector: A nucleic acid molecule, typically comprised of DNA, capable of replication in a host cell and/or to which another nucleic acid segment can be operatively linked so as to bring about replication of the attached segment. A plasmid, cosmid, or a virus is an exemplary vector.

[0633] The terms "substantially corresponds to", "substantially homologous", or "substantial identity" as used herein denotes a characteristic of a nucleic acid or an amino acid sequence, wherein a selected nucleic acid or amino acid sequence has at least about 70 or about 75 percent sequence identity as compared to a selected reference nucleic acid or amino acid sequence. More typically, the selected sequence and the reference sequence will have at least about 76, 77, 78, 79, 80, 81, 82, 83, 84 or even 85 percent sequence identity, and more preferably at least about 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 percent sequence identity. More preferably still, highly homologous sequences often share greater than at least about 96, 97, 98, or 99 percent sequence identity between the selected sequence and the reference sequence to which it was compared. The percentage of sequence identity may be calculated over the entire length of the sequences to be compared, or may be calculated by excluding small deletions or additions which total less than about 25 percent or so of the chosen reference sequence. The reference sequence may be a subset of a larger sequence, such as a portion of a gene or flanking sequence, or a repetitive portion of a chromosome. However, in the case of sequence homology of two or more polynucleotide sequences, the reference sequence will typically comprise at least about 18-25 nucleotides, more typically at least about 26 to 35 nucleotides, and even more typically at least about 40, 50, 60, 70, 80, 90, or even 100 or so nucleotides. Desirably, which highly homologous fragments are desired, the extent of percent identity between the two sequences will be at least about 80%, preferably at least about 85%, and more preferably about 90% or 95% or higher, as readily determined by one or more of the sequence comparison algorithms well-known to those of skill in the art, such as e.g., the FASTA program analysis described by Pearson and Lipman (1988).

[0634] The term "naturally occurring" as used herein as applied to an object refers to the fact that an object can be found in nature. For example, a polypeptide or polynucleotide sequence that is present in an organism (including viruses) that can be isolated from a source in nature and which has not been intentionally modified by the hand of man in a laboratory is naturally-occurring. As used herein, laboratory strains of rodents that may have been selectively bred according to classical genetics are considered naturally occurring animals.

[0635] As used herein, a "heterologous" is defined in relation to a predetermined referenced gene sequence. For example, with respect to a structural gene sequence, a heterologous promoter is defined as a promoter which does not naturally occur adjacent to the referenced structural gene, but which is positioned by laboratory manipulation. Likewise, a heterologous gene or nucleic acid segment is defined as a gene or segment that does not naturally occur adjacent to the referenced promoter and/or enhancer elements.

[0636] "Transcriptional regulatory element" refers to a polynucleotide sequence that activates transcription alone or in combination with one or more other nucleic acid sequences. A transcriptional regulatory element can, for example, comprise one or more promoters, one or more response elements, one or more negative regulatory elements, and/or one or more enhancers.

[0637] As used herein, a "transcription factor recognition site" and a "transcription factor binding site" refer to a polynucleotide sequence(s) or sequence motif(s) which are identified as being sites for the sequence-specific interaction of one or more transcription factors, frequently taking the form of direct protein-DNA binding. Typically, transcription factor binding sites can be identified by DNA footprinting, gel mobility shift assays, and the like, and/or can be predicted on the basis of known consensus sequence motifs, or by other methods known to those of skill in the art.

[0638] As used herein, the term "operably linked" refers to a linkage of two or more polynucleotides or two or more nucleic acid sequences in a functional relationship. A nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence. For instance, a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the coding sequence. Operably linked means that the DNA sequences being linked are typically contiguous and, where necessary to join two protein coding regions, contiguous and in reading frame. However, since enhancers generally function when separated from the promoter by several kilobases and intronic sequences may be of variable lengths, some polynucleotide elements may be operably linked but not contiguous.

[0639] "Transcriptional unit" refers to a polynucleotide sequence that comprises at least a first structural gene operably linked to at least a first cis-acting promoter sequence and optionally linked operably to one or more other cis-acting nucleic acid sequences necessary for efficient transcription of the structural gene sequences, and at least a first distal regulatory element as may be required for the appropriate tissue-specific and developmental transcription of the structural gene sequence operably positioned under the control of the promoter and/or enhancer elements, as well as any additional cis sequences that are necessary for efficient transcription and translation (e.g., polyadenylation site(s), mRNA stability controlling sequence(s), etc.

[0640] As noted above, the present invention is generally directed to compositions and methods for using the compositions, for example in the therapy and diagnosis of cancer, such as hematological malignancy. Certain illustrative compositions described herein include hematological malignancy-related tumor polypeptides, polynucleotides encoding such polypeptides, binding agents such as antibodies, antigen presenting cells (APCs) and/or immune system cells (e.g., T cells). A "hematological malignancy-related tumor protein," as the term is used herein, refers generally to a protein that is expressed in hematological malignancy-related tumor cells at a level that is at least two fold, and preferably at least five fold, greater than the level of expression in a normal tissue, as determined using a representative assay provided herein. Certain hematological malignancy-related tumor proteins are tumor proteins that react detectably (within an immunoassay, such as an ELISA or Western blot) with antisera of a patient afflicted with hematological malignancy.

[0641] 4.28 Biological Functional Equivalents

[0642] Modification and changes may be made in the structure of the polynucleotides and peptides of the present invention and still obtain a functional molecule that encodes a peptide with desirable characteristics, or still obtain a genetic construct with the desirable expression specificity and/or properties. As it is often desirable to introduce one or more mutations into a specific polynucleotide sequence, various means of introducing mutations into a polynucleotide or peptide sequence known to those of skill in the art may be employed for the preparation of heterologous sequences that may be introduced into the selected cell or animal species. In certain circumstances, the resulting encoded peptide sequence is altered by this mutation, or in other cases, the sequence of the peptide is unchanged by one or more mutations in the encoding polynucleotide. In other circumstances, one or more changes are introduced into the promoter and/or enhancer regions of the polynucleotide constructs to alter the activity, or specificity of the expression elements and thus alter the expression of the heterologous therapeutic nucleic acid segment operably positioned under the control of the elements.

[0643] When it is desirable to alter the amino acid sequence of one or more of the heterologous peptides encoded by the expression construct to create an equivalent, or even an improved, second-generation molecules, the amino acid changes may be achieved by changing one or more of the codons of the encoding DNA sequence, according to Table 1.

[0644] For example, certain amino acids may be substituted for other amino acids in a protein structure without appreciable loss of interactive binding capacity with structures such as, for example, antigen-binding regions of antibodies or binding sites on substrate molecules. Since it is the interactive capacity and nature of a protein that defines that protein's biological functional activity, certain amino acid sequence substitutions can be made in a protein sequence, and, of course, its underlying DNA coding sequence, and nevertheless obtain a protein with like properties. It is thus contemplated by the inventors that various changes may be made in the peptide sequences of the disclosed compositions, or corresponding DNA sequences which encode said peptides without appreciable loss of their biological utility or activity.

TABLE-US-00001 TABLE 1 Amino Acids Codons Alanine Ala A GCA GCC CCC GCU Cysteine Cys C UGC UGU Aspartic acid Asp D GAC GAU Glutamic acid Glu E GAA GAG Phenylalanine Phe F UUC UUU Glycine Gly G GGA CCC GGG GGU Histidine His H CAC CAU Isoleucine Ile I AUA AUC AUU Lysine Lys K AAA AAG Leucine Leu L UUA UUG CUA CUC CUG CUU Methionine Met M AUG Asparagine Asn N AAC AAU Proline Pro P CCA CCC CCG CCU Glutamine Gln Q CAA CAG Arginine Arg R AGA AGG CGA CGC CGG CGU Serine Ser S AGC AGU UCA UCC UCG UCU Threonine Thr T ACA ACC ACG ACU Valine Val V GUA GUC GUG GUU Tryptophan Trp W UGG Tyrosine Tyr Y UAC UAU

[0645] In making such changes, the hydropathic index of amino acids may be considered. The importance of the hydropathic amino acid index in conferring interactive biologic function on a protein is generally understood in the art (Kyte and Doolittle, 1982, incorporate herein by reference). It is accepted that the relative hydropathic character of the amino acid contributes to the secondary structure of the resultant protein, which in turn defines the interaction of the protein with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and the like. Each amino acid has been assigned a hydropathic index on the basis of their hydrophobicity and charge characteristics (Kyte and Doolittle, 1982), these are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5).

[0646] It is known in the art that certain amino acids may be substituted by other amino acids having a similar hydropathic index or score and still result in a protein with similar biological activity, i.e. still obtain a biological functionally equivalent protein. In making such changes, the substitution of amino acids whose hydropathic indices are within .+-.2 is preferred, those that are within .+-.1 are particularly preferred, and those within .+-.0.5 are even more particularly preferred. It is also understood in the art that the substitution of like amino acids can be made effectively on the basis of hydrophilicity. U.S. Pat. No. 4,554,101, incorporated herein by reference, states that the greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein.

[0647] As detailed in U.S. Pat. No. 4,554,101, the following hydrophilicity values have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0.+-.1); glutamate (+3.0.+-.1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5.+-.1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be substituted for another having a similar hydrophilicity value and still obtain a biologically equivalent, and in particular, an immunologically equivalent protein. In such changes, the substitution of amino acids whose hydrophilicity values are within .+-.2 is preferred, those that are within .+-.1 are particularly preferred, and those within .+-.0.5 are even more particularly preferred.

[0648] As outlined above, amino acid substitutions are generally therefore based on the relative similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions which take several of the foregoing characteristics into consideration are well known to those of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine and asparagine; and valine, leucine and isoleucine.

5. EXAMPLES

[0649] The following examples are included to demonstrate preferred embodiments of the invention. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the invention described in the appended claims.

5.1 Example 1

Identification of Hematological Malignancy-Related Antigen Polynucleotides

[0650] This Example illustrates the identification of hematological malignancy-related antigen polynucleotides from non-Hodgkin's lymphomas.

[0651] Hematological malignancy-related antigen polynucleotides were isolated by PCR-based subtraction. PolyA mRNA was prepared from T cell non-Hodgkin's lymphomas, B cell non-Hodgkin's lymphomas and normal tissues. Six cDNA libraries were constructed, PCR-subtracted and analyzed. Two libraries were constructed using pools of three T cell non-Hodgkin's lymphoma mRNAs (referred to herein as TCS libraries). Two others were constructed using pools of three B cell non-Hodgkin's lymphoma mRNAs (referred to herein as BCNHL libraries). Two other libraries were constructed using a pool of 2 Hodgkin's lymphoma mRNAs (referred to herein as HLS libraries. cDNA synthesis, hybridization and PCR amplification were performed according to Clontech's user manual (PCR-Select cDNA Subtraction), with the following changes: 1) cDNA was restricted with a mixture of enzymes, including MscI, PvuII, StuI and DraI, instead of the single enzyme RsaI; and 2) the ratio of driver to tester cDNA was increased in the hybridization steps (to 76:1) to give a more stringent subtraction.

[0652] The two TCS libraries were independently subtracted with different pools of driver cDNAs. Driver #1 contained cDNA prepared from specific normal tissues (lymph node, bone marrow, T cells, heart and brain), and this subtraction generated the library TCS-D1 (T cell non-Hodgkin's lymphoma subtracted library with driver #1). Driver #2 contained non-specific normal tissues (colon, large intestine, lung, pancreas, spinal cord, skeletal muscle, liver, kidney, skin and brain), and this subtraction generated the library TCS-D2 (T cell non-Hodgkin's lymphoma subtraction library with driver #2).

[0653] Similarly, the two BCNHL libraries were independently subtracted with different pools of driver cDNAs. Driver #1 contained cDNA prepared from specific normal tissues (lymph node, bone marrow, B cells, heart and brain), and this subtraction generated the library BCNHL/D1 (B cell non-Hodgkin's lymphoma subtracted library with driver #1). Driver #2 contained non-specific normal tissues (brain, lung, pancreas, spinal cord, skeletal muscle, colon, spleen, large intestine and PBMC), and this subtraction generated the library BCNHL/D2 (B cell non-Hodgkin's lymphoma subtraction library with driver #2).

[0654] The two HLS libraries were independently subtracted with different pools of driver cDNAs. Driver #1 contained cDNA prepared from specific normal tissues (lymph node, bone marrow, B cells and lung) and this subtraction generated HLS-D1 (Hodgkin's lymphoma subtraction library with driver #1). Driver #2 contained non-specific normal tissues (colon, large intestine, lung, pancreas, spinal cord, skeletal muscle, liver, kidney, skin and brain) and this generated the library HLS-D2 (Hodgkin's lymphoma subtraction library with driver #2).

[0655] To analyze the efficiency of the subtraction, actin (a housekeeping gene) was PCR amplified from dilutions of subtracted as well as unsubtracted PCR samples. Furthermore, the complexity and redundancy of each library was characterized by sequencing 96 clones from each of the PCR subtraction libraries (TCS-D1, TCS-D2, BCNHL/D1, BCNHL/D2, HLS-D1 and HLS-D2). These analyses indicated that the libraries are enriched for genes overexpressed in leukemia tissues and specifically T cell and B cell non-Hodgkin's lymphoma and M. Hodgkin's lymphoma samples.

[0656] Following PCR amplification, the cDNAs were cloned into the pCR2.1-TOPO plasmid vector (Invitrogen).

[0657] Sequences obtained from these analyses were searched against known sequences in the publicly available databases using the BLAST 2.0 release. The default BLAST parameters used were as follows: GAP PARAMETERS: Open Gap=0, Extended Gap=0; OUTPUT PARAMETERS: Expect=10.0, Threshold=0, Number of Alignments=250; For BLASTN, the search parameters were as follows: Mismatch=-3, Reward=1, Word size=0. The alignments were presented pair-wise, with a window percent identity=22. All available protein and nucleotide databases were searched, including, PIR, SwissPROT, GenBank, Mouse EST, Human EST, Other EST, Human repeat and high throughput sequences, and published patents and patent application database.

[0658] From these, a number of unique sequences were identified that represented novel polynucleotide sequences that had not previously been described in the GenBank and other sequence databases. A number of other sequences were identified that appeared to contain significant homology with one or more sequences previously identified in the databases, although they were described only as genomic or cDNA clones, and had no known function. The remaining sequences corresponded to known genes. The clones obtained from this analysis are summarized in Tables 2-6 in co-pending application U.S. Ser. No. 09/796,692.

5.2 Example 2

Analysis of Subtracted cDNA Sequences by Microarray Analysis

[0659] Subtracted cDNA sequences were analyzed by microarray analysis to evaluate their expression in hematological malignancies and normal tissues. Using this approach, cDNA sequences were PCR amplified and their mRNA expression profiles in hematological malignancies and normal tissues are examined using cDNA microarray technology essentially as described (Shena et al., 1995).

[0660] In brief, the clones identified from the subtracted cDNA libraries analyses were immobilized and arrayed onto glass slides as multiple replicas on microarray slides and the slides were hybridized with two different sets of probes, with each location on the microarray slide corresponding to a unique cDNA clone (as many as 5500 clones can be arrayed on a single slide, or chip). Each chip is hybridized with a pair of cDNA probes that are fluorescence-labeled with Cy3 and Cy5, respectively. The set of probes derived from the hematological malignancies was labeled with cy3 while the other set of probes derived from a pool of normal tissues was labeled with cy5. Typically, 1 .mu.g of polyA.sup.+ RNA was used to generate each cDNA probe. After hybridization, the chips were scanned and the fluorescence intensity recorded for both Cy3 and Cy5 channels. The difference in intensities (i.e., cy3/cy5 ratios) following hybridization with both probe sets provided the information on the relative expression level of each cDNA sequences immobilized on the slide in tumor versus normal tissues. There are multiple built-in quality control steps. First, the probe quality is monitored using a panel of ubiquitously expressed genes. Secondly, the control plate also can include yeast DNA fragments of which complementary RNA may be spiked into the probe synthesis for measuring the quality of the probe and the sensitivity of the analysis. This methodology provides a sensitivity of 1 in 100,000 copies of mRNA, and the reproducibility of the technology may be ensured by including duplicated control cDNA elements at different locations.

[0661] Analysis of hematological malignancy subtracted clones by microarray analyses on a variety of microarray chips identified the sequences set forth in SEQ ID NO: 1 through SEQ ID NO:668 of co-pending application U.S. Ser. No. 09/796,692 as being at least two-fold overexpressed in hematological malignancies versus normal tissues.

5.3 Example 3

Polynucleotide and Polypeptide Compositions: Brief Description of the cDNA Clones and Open Reading Frames Identified by Subtractive Hybridization and Microarray Analysis

[0662] Table 7 in co-pending application U.S. Ser. No. 09/796,692 lists the sequences of the polynucleotides obtained during the analyses of the present invention. Shown are the 668 polynucleotide sequences, along with their clone name identifiers, as well as the serial number and filing date of the priority provisional patent application in which the clone was first identified.

[0663] Table 8 in co-pending application U.S. Ser. No. 09/796,692 identifies the putative open reading frames obtained from analyses of the cDNA sequences obtained in SEQ ID NO:1-SEQ ID NO:668 in the co-pending application. Shown are the sequence identifiers, the clone name and translation frame, and the start and stop nucleotides in the corresponding DNA sequence used to generate the polypeptide sequence of the open reading frame.

[0664] Table 9 in co-pending application U.S. Ser. No. 09/796,692 identifies an additional set of particular hematological malignancy-related cDNA sequences that were obtained using the subtractive library and microarray methods as described above. These sequences, designated SEQ ID NO:2533-SEQ ID NO:9597 in the co-pending application, are shown in the Table along with the original clone name, and the serial number and filing date of the priority provisional application in which the clone was first described.

5.4 Example 4

Additional Analysis of cDNA Clones and Orfs Identified by Subtractive Hybridization and Microarray Analysis

[0665] This example describes microarray analysis of leukemia tumor- and tissue-specific cDNAs.

[0666] Microarray analysis identifies many potential genes that are overexpressed in specific tissues/tumors. However, these genes often represent known genes or genes that subsequently are found by RealTime PCR analysis to have a broader expression profile. This disclosure describes analyses which combine microarray analysis (CorixArray) and comparisons to public databases to identify and prioritize candidate sequences for RealTime analysis, thus allowing the identification of sequences with favorable expression profiles in a more efficient manner.

[0667] Clones are tested for overexpression in lymphoma tumor samples as compared to normal tissues using Corixa Leukemia/Lymphoma Chip#3 (LyC3). The analyzed clones are originally randomly picked from lymphoma PCR subtracted libraries: B-cell non-Hodgkin's lymphoma libraries (BCNHL/D1 and BCNHL/D2; CID000153); T-cell non-Hodgkin's lymphoma libraries (TCS-D1 and TCS-D2; CID000166); Hodgkin's lymphoma libraries (HLS-D1 and HLS-D2; CID000204 and CID000275) and a Clontech-generated T8 leukemia PCR subtracted library. A total of 5184 clones were arrayed: 2304 from BCNHL libraries, 288 from TCS libraries, 1344 from HLS libraries, and 960 from a Clontech-T8 library. In addition, a selection of 288 clones from the above libraries that had been identified from prior leukemia/lymphoma chips were re-analyzed on LyC3.

[0668] cDNA inserts for arraying are amplified by PCR using vector-specific primers. The arrays are probed with 43 probe pairs. Analysis is performed using CorixArray computational analysis. Analysis consisted of determining the ratio of the mean or median hybridization signal for a particular element (cDNA) using two sets of probes. The ratio is a reflection of the over- or under-expression of the element (cDNA) within the probe population. Probe groups are set up to identify elements (cDNAs) with high differential expression in probe group #1. Probe group #1 typically consists of 20 tumor RNAs, each probe representing a subset of lymphoma (e.g., B-cell non-Hodgkins lymphoma, T-cell non-Hodgkins lymphoma and Hodgkins lymphoma). Probes in group #2 include 16 essential and non-essential normal tissues (see, FIG. 4). A threshold (fold-overexpression in probe group #1) is set at 3.0. This threshold is set to identify elements with overexpression that could be reproducibly detected based on the quality of the chip. The sequences are sorted initially based on their CorixArray analysis, specifically on the basis of their mean signal 2 values.

[0669] Sequences having a mean signal 2<0.1 can be considered as sequences with low/no expression in normal tissues. Sequences having a mean signal 2 between 0.1 and 0.2 can be considered as clones with a potential for some expression in normal tissues. Sequences having a mean signal 2>0.2 and can be considered as clones that have the potential to have expression in some normal tissues.

5.5 Example 5

Identification of Candidate Genes with the Same Tissue Expression Profile as CD20 and CD52

[0670] This example identifies leukemia tumor and tissue-specific genes that have similar tissue expression profiles as CD20 and CD52 (FIG. 3). Antibodies against these two markers have been used for the therapy of hematological malignancies and other diseases associated with expression of these markers, i.e., FDA-approved Rituximab (anti-CD20 Ab) and Campath (anti-CD52 Ab). The similarity in gene expression between our candidate genes and CD20 and CD52 suggests that the described genes will also be useful as compounds for the diagnosis and therapy of hematological malignancies and other cancerous and non-cancerous diseases associated with expression of one or more of the described antigens.

[0671] RealTime PCR was used to compare the expression profiles of the candidate genes with the expression profiles of CD20 and CD52. FIG. 8 illustrates the expression of the candidate genes in hematopoietic subsets and hematological malignancies. Data summarized in this sheets shows that using a combination of the PCR subtracted cDNA libraries, microarray analyses, and RealTime PCR, it is possible to identify genes differentially expressed in normal B-cells, lymphomas, myeloma, chronic lymphocytic leukemia, and acute myeloid leukemia.

5.6 Example 6

Analysis of Ly1484 (SEQ ID NOS:16-18 and 120-121), One of the Genes with a Similar Expression Profile as CD20 and CD52

[0672] This example illustrates the typical procedure used to identify antigens for use as therapeutics, diagnostics, etc. and preferred methods for developing therapeutics and diagnostics for leukemia/lymphoma diseases. First, candidate genes highly enriched in leukemia/lymphoma cells are identified using PCR subtraction library cloning. Next, subtracted cDNA sequences are analyzed by microarray analysis to evaluate their expression in hematological malignancies and normal tissues. Since microarray analysis often identifies genes that represent known genes or genes that subsequently are found by RealTime analysis to have broader expression profiles, the microarray analysis is combined with comparisons to public databases to identify and prioritize candidate sequences for analysis. Next, RealTime PCR is used to analyze the expression profiles in various hematological subsets. In some cases, further analysis is focused on antigens with expression profiles similar to known therapeutics. For these genes, structural prediction programs are used to identify transmembrane domains, antigen-specific CTL are generated using human in vitro priming, and humanized monoclonal and polyclonal antibodies are generated as reagents for the diagnosis and therapy of malignancies and autoimmune disorders associated with antigen expression.

[0673] Ly1484P PCR subtraction library clone sequences matched the Genbank clone KIAA1607 (acc. no. XM033378, XM033379 and AB046827) and FJL00111 (acc. no. AK024502). Overexpression of Ly1484 was documented by microarray analyses and RealTime PCR. Ly1484P is overexpressed in B cell neoplasms, while expression in normal tissues is restricted to normal B-cells.

[0674] A full length sequence of candidate Ly1484P was obtained using the Genbank database. Ly1484P was mapped to human chromosome 10. There is both a long and short version of Ly1484P (long version--SEQ ID NO: 120; short version--SEQ ID NO:121). TMpred analysis of Ly1484P indicates that this protein contains a transmembrane domain. Using the TSITES program, T-helper epitopes have also been identified (FIGS. 7 & 8). Polypeptides have been generated and are being used to generate antibodies that are specific for Ly1484P. These humanized monoclonal antibodies may be used (conjugated or unconjugated) for the diagnosis and therapy of malignancies and autoimmune disorders associated with Ly1484 expression.

5.7 Example 7

Identification of Transmembrane Domains

[0675] Structural prediction programs known to those of skill in the art were used to identify transmembrane domains in the candidate antigens described herein.

[0676] For Ly1728P, amino acid residues 6-22, 47-65, 227-243, and 228-245 of SEQ ID NO:2 were identified as putative transmembrane domains.

[0677] For Ly1732P, amino acid residues 60-76 and 55-76 of SEQ ID NO:4 were identified as putative transmembrane domains.

[0678] For Ly1888P, amino acids 1-22, 3-21, 52-68, 254-273, 252-273, and 256-272 of SEQ ID NO: 6 were identified as putative transmembrane domains.

[0679] For Ly1452P amino acids 354-375, 355-377 and 425-441 of SEQ ID NO: 10 (splice variant 1) were identified as putative transmembrane domains and amino acids 354-375, 355-377 and 425-441 of SEQ ID NO: 12 (splice variant 2) were identified as putative transmembrane domains.

[0680] Ly1462P, amino acids 2-23, 3-21, 369-385, 976-999, 977-993, and 979-1000 of SEQ ID NO: 15 were identified as putative transmembrane domains.

[0681] For Ly1484P, amino acids 51-67, 322-338, 666-682, 736-752, 1078-1094, 24-43, 53-69, 118-136, 319-335, 730-752, 1586-1602, 48-69, 88-109, 114-135, 196-217, 300-321, 323-344, 389-410, 502-523, 659-680, 714-735, 1076-1097, 1158-1179, and 1321-1342 of SEQ ID NO: 18 were identified as putative transmembrane domains, amino acids 10-86, 63-84, 118-139, 480-501, 562-583, 725-746, 70-86, 113-129, 134-156, 280-296, 481-497, 560-577, 653-674, 721-738, 734-752, 833-869, 879-895, 990-1006, 1023-1048, 1070-1087, 1112-1138, 1135-1170, and 1146-1170 of SEQ ID NO: 120 were identified as putative transmembrane domains, amino acids 94-129, 102-123, 367-383, 394-423, 447-464, and 493-513, of SEQ ID NO: 121 were identified as putative transmembrane domains.

[0682] For Ly1486P, amino acids 24-40 and 24-45 of SEQ ID NO: 21 were identified as putative transmembrane domains.

[0683] For Ly1693P, amino acids 47-63, 80-96, 114-130, 158-174, 207-223, 237-253, 289-305, 117-134, 144-160, 167-184, 516-535, 44-65, 85-106, 111-132, 150-171, 204-225, 242-263, and 290-311 of SEQ ID NO: 26 were identified as putative transmembrane domains.

[0684] For Ly1715P, amino acids 38-54, 38-56, and 34-55 of SEQ ID NO: 29 were identified as putative transmembrane domains.

[0685] For Ly1727P, amino acids 68-100, 214-234, 304-320, 84-105, 217-238, and 302-323 of SEQ ID NO: 32 were identified as putative transmembrane domains.

[0686] For Ly1905P, amino acids 68-100, 214-234, 84-105, and 217-238 of SEQ ID NO: 40 were identified as putative transmembrane domains.

[0687] For Ly1885P, amino acids 218-234, 219-235, 626-654, and 219-240 of SEQ ID NO: 35 were identified as putative transmembrane domains.

[0688] For Ly663S, amino acids 22-38, 41-57, 60-76, 92-108, 242-258, 20-38, 23-57, 60-77, 86-102, 241-257, 14-35, 89-110, and 248-269 of SEQ ID NO: 43 were identified as putative transmembrane domains.

[0689] For Ly664S, amino acids 11-27, 15-31, 74-93, 209-227, 8-29, and 67-88 of SEQ ID NO: 45 were identified as putative transmembrane domains.

[0690] For Ly667S, amino acids 13-31, 139-157, 184-202, 231-247, 329-351, 435-451, 473-490, 609-626, 685-705, 688-704, 7-28, 233-254, and 685-706 of SEQ ID NO: 48 were identified as putative transmembrane domains.

[0691] For Ly677S, amino acids 149-165, 10-30, 144-165, 7-28, and 144-165 of SEQ ID NO: 54 were identified as putative transmembrane domains.

[0692] For Ly1891P, amino acids 31-47, 66-82, 93-109, 128-144, 171-187, 205-221, 242-258, 31-48, 64-82, 94-111, 120-145, 170-187, 205-221, 244-266, and 29-50, 63-84, 93-114, 124-145, 168-189, 206-227, and 241-262 of SEQ ID NO: 56 were identified as putative transmembrane domains.

[0693] For CD138, amino acids 255-271, 4-21, 258-276, 6-27, and 256-277 of SEQ ID NO: 58 were identified as putative transmembrane domains.

[0694] For CD22, amino acids 688-704, 3-19, 29-46, 157-174, 185-201, 349-366, 386-406, 479-505, 688-709, 1-22, 159-180, and 689-710 of SEQ ID NO: 61 were identified as putative transmembrane domains.

[0695] For CD79beta, amino acids 161-177, 4-29, 59-78, 160-181, 4-25, and 161-180 of SEQ ID NO: 66 were identified as putative transmembrane domains.

[0696] For Ly1454P, amino acids 5-22, 231-249, 7-28, and 425-446 of SEQ ID NO: 74 were identified as putative transmembrane domains. 0.16561 For Ly1485P, amino acid residues 2-19 of SEQ ID NO:76 were identified as a putative transmembrane domain.

[0697] Ly1500P, amino acid residues 10-31 and 327-344 of SEQ ID NO: 80 (splice variant 1) were identified as putative transmembrane domains, amino acid residues 13-38, 71-92, and 388-405 of SEQ ID NO: 82 (splice variant 2) were identified as putative transmembrane domains, and amino acids 25-46 and 341-359 of SEQ ID NO: 84 (splice variant 3) were identified as putative transmembrane domains.

[0698] For Ly1516P, amino acids 142-158, 177-193, 207-223, 238-254, 271-287, 142-158, 177-193, 207-223, 238-254, and 271-287 of SEQ ID NO:87 were identified as putative transmembrane domains.

[0699] For Ly1729P, amino acids 420-436 431-437, and 412-433 of SEQ ID NO:101 were identified as putative transmembrane domains.

[0700] For Ly1859P, amino acid residues 128-144, 293-311, 408-425, 435-454, 465-483, 516-533, 290-311; 435-456, and 507-528 of SEQ ID NO 107 were identified as putative transmembrane domains.

[0701] For Ly1866P, amino acids 47-65 and 50-71 of SEQ ID NO: 109 were identified as putative transmembrane domains.

[0702] For Ly669S, amino acids 489-505, 13-29, 38-57, 73-89, 94-114, 252-268, 307-324, 329-346, 489-509, 4-25, and 486-507 of SEQ ID NO:114 were identified as putative transmembrane domains.

[0703] For Ly672S, amino acids 11-27, 284-300, 325-341, 345-361, 407-423, 7-28, 102-118, 174-198, 283-299, 325-341, 347-383, 403-423, 431-454, 473-492, 11-32, 286-307, 322-343, 345-366, 404-425, 430-451, and 469-490 of SEQ ID NO: 117 were identified as putative transmembrane domains.

[0704] For Ly675S, amino acids 154-170, 187-203, 428-444, 518-534, 846-862, 81-97, 155-172, 235-251, 374-391, 428-444, 477-195, 520-542, 539-573, 694-714, 807-823, 843-862, 50-71, 77-98, 145-166, 518-539, 802-823, and 845-866 of SEQ ID NO:119 were identified as putative transmembrane domains.

5.8 Example 8

Realtime PCR Analysis to Identify Antigens Overexpressed in Chronic Lymphocytic Leukemia and Multiple Myeloma

[0705] Overexpression of candidate antigens in chronic lymphocytic leukemia (CLL) and multiple myeloma (MM) was confirmed by RealTime PCR.

[0706] Real-time PCR evaluates the level of PCR product accumulation during amplification (see, e.g., Gibson et al., Genome Research 6:995-1001 (1996); Heid et al., Genome Research 6:986-994 (1996)). RealTime PCR permits quantitative evaluation of mRNA levels in multiple samples. Briefly, mRNA is extracted from tumor and normal tissue and cDNA is prepared using standard techniques. Real-time PCR is performed, for example, using a Perkin Elmer/Applied Biosystems (Foster City, Calif.) 9700 Prism instrument. Matching primers are designed for genes of interest using, for example, the primer express program provided by Perkin Elmer/Applied Biosystems (Foster City, Calif.). Optimal concentrations of primers and probes are initially determined by those of ordinary skill in the art, and control (e.g., ft-actin) primers and probes are obtained commercially from, for example, Perkin Elmer/Applied Biosystems (Foster City, Calif.). To quantitate the amount of specific RNA in a sample, a standard curve is generated using a plasmid containing the gene of interest. Standard curves are generated using the Ct values determined in the real-time PCR, which are related to the initial cDNA concentration used in the assay. Standard dilutions ranging from 10-10.sup.6 copies of the gene of interest are generally sufficient. In addition, a standard curve is generated for the control sequence. This permits standardization of initial RNA content of a tissue sample to the amount of control for comparison purposes.

Summary of Results

TABLE-US-00002 [0707] Antigen CLL MM Ly1728P no yes Ly1732P yes yes Ly1888P yes no Ly 1452P yes no Ly1462P yes no Ly1484P not determined not determined Ly1486P yes no Ly1677P yes no Ly1682P yes no Ly1693P yes yes Ly1697P yes no Ly1715P yes yes Ly1727P yes yes Ly1905P yes yes Ly1885P yes yes Ly663S yes yes Ly664S yes yes Ly667S yes yes Ly677S yes yes Ly1891P not determined not determined CD138 no yes CD22 yes no CD79beta yes yes Ly1450P not determined not determined Ly1451P yes yes Ly1454P yes not determined Ly1485P not determined not determined Ly1500P yes no Ly1516P not determined not determined Ly1678P yes yes Ly1680P yes no Ly1686P yes no Ly1687P yes no Ly1706P yes yes Ly1712P yes yes Ly1729P yes yes Ly1848P yes yes Ly1859P yes no Ly1866P yes yes Ly1867P yes no Ly1868P yes no Ly1886P yes no Ly669S yes yes Ly672S yes yes Ly675S yes yes

[0708] These sequences can conveniently be used to diagnose, treat, and prevent malignant diseases that overexpress these genes, including multiple myeloma, B-cell lymphomas, and B-CLL. For example, monoclonal antibodies, including humanized monoclonal antibodies can be used for diagnosis and therapy of disorders associated with expression of antigens overexpressed in hematological malignancies.

5.9 Example 9

Sequence Analyses, Expression Analyses, and Structure Analyses of Other Antigens with Similar Expression Profiles as CD20 & CD52

Summary of Results

TABLE-US-00003 [0709] Antigen Sequence Analysis Ly1728P FOAP-2 ("novel gene over-expressed in macrophages") Ly1732P B-cell maturation factor (BCM), tumor necrosis factor receptor superfamily, member 17. BCM bins to TALL-1, a member of the TNF family. Ly1888P anti-Fas-induced apoptosis protein (TOSO); experimentally shown to be expressed on the cell surface. Ly 1452P anti-Fas-induced apoptosis protein (TOSO); experimentally shown to be expressed on the cell surface. Ly1462P Human Epstein-Barr virus complement receptor type II Ly1484P KIAA1607 (cDNA sequence present in GenBank). Ly1486P Fc fragment of IgE, low affinity II receptor. Ly1677P novel Ly1682P novel Ly1693P Chemokine receptor CXCR4 Ly1697P novel Ly1715P lectin-like NK cell receptor Ly1727P Splice variants of the hpim-2 gene (homologs of the mouse pim-2 oncogene). Predicted to be a serine threonine kinase with a role in cell proliferation. Ly1905P Splice variants of the hpim-2 gene (homologs of the mouse pim-2 oncogene). Predicted to be a serine threonine kinase with a role in cell proliferation. Ly1885P An apparent splice form of the cell cycle progression 8 protein: one of a family of proteins involved in restoration of cell cycle progression (by blocking arrest in G1 phase). Ly663S leukocyte surface antigen CD37 Ly664S protein with unknown function that shares 30-40% identity with various thioredoxins Ly667S Semaphorin B (semaphorins are a large family of secreted and transmembrane proteins; setna domains occur in the hepatocyte growth factor receptor) Ly677S leukocyte surface recptor CD79A Ly1891P orphan G-protein coupled receptor (GPRC5D) CD138 CD138 CD22 CD22 CD79beta CD79beta Ly1450P novel (matches GenBank seq FLJ23202; no ORF) Ly1451P novel (matches GenBank seq FLJ39358 no ORF) Ly1454P novel (matches GenBank seq FLJ40597; ORF) Ly1485P novel (matches genomic DNA only) Ly1500P BANK protein: B-cell scaffold protein with ankyrin repeats; a substrate for tyrosine phosphorylation upon B-cell antigen receptor stimulation. Ly1516P Rh-related antigen CD47, a signal transducer integrin- associated protein Ly1678P novel (matches genomic DNA only) Ly1680P novel (matches genomic DNA only) Ly1686P novel (matches genornic DNA only) Ly1687P novel (matches genomic DNA only) Ly1706P novel (matches genomic DNA only) Ly1712P novel Ly1729P Hematopoietic cell-specific Lyn substrate 1 (HCLS1) protein; t N-erminal half hs repeats with significant identity to a helix-turn-helix DNA binding motif; the C-terminal half is similar to domains that act as substrates for protein tyrosine kinases suggesting that this protein may be involved in signal transduction and regulation of gene expression. Ly1848P novel (matches genomic DNA only) Ly1859P novel (matches GenBank seq FLJ00140; ORF) Ly1866P Matches a gene in GenBank referred to as "similar to hypothetical protein PR01722." Ly1867P novel (matches genomic DNA only) Ly1868P novel (matches genomic DNA only) Ly1886P novel (matches genomic DNA only) Ly669S intercellular adhesion molecule 3 (ICAM3) Ly672S cisplatin resistance related protein CRR9p Ly675S KIAA0906 (partial cDNA/protein sequences present in GenBank).

5.10 Example 10

Sequence Analysis of Ly1451

[0710] The Ly1451 (SEQ ID NOS:69-71 and 124) sequence derived from a lymphoma PCR subtraction library clone was used to query several public databases, including GenBank and GenSeq. No matches (>90 identity) were detected for the 5'-proximal 51 bp suggesting that this sequence may contain a repeat element. A BLASTN search of the LifeSeq database (Incyte) identified a 980 bp template (template #1076101.8; SEQ ID NO:124 that contained all 240 bp of Ly1451. This template consisted of sequences from 6 clones, of which 2 (33%) were derived from hematologic/immune tissue libraries. Template #1076101.8 was part of a bin containing 11 templates derived from a total of 104 clones, of which 12 (9%) were derived from hematologic/immune tissue libraries.

[0711] This sequence (SEQ ID NO: 124) was used to search further public databases but no additional sequences were obtained. However, these searches indicate this sequence is a human endogenous retroviral sequence (HERV) encoding polypeptides corresponding to portions of the integrase and envelope genes. A single ORF with an ATG translational start site is contained in the forward read of LS1076101.8

[0712] The polypeptide encoded by this ORF (SEQ ID NO:124) is not predicted to have a transmembrane domain.

5.11 Example 11

Expression of Ly1452 Lymphoma Antigens Encoded by a Specific Gene, Ly1452, Associated with B Cell Leukemias, Lymphomas and Multiple Myelomas

[0713] Recombinantly expressed Ly1452 antigens were constructed to allow for quick and easy purification of the protein.

[0714] The open reading frame for Ly1452 was PCR amplified and subcloned into a modified pET28 vector with a His tag in-frame and recombinantly expressed in E. coli (His-Ly1452: SEQ ID NO:10,482 (nt), SEQ ID NO:10,483 (protein).

Ly1452P expression in E. coli

[0715] The open reading frame of the LS coding region was PCR amplified with the following primers:

TABLE-US-00004 PDM-797 (SEQ ID NO: 10,975) 5'gtgtcacaatctacagtcaggcaggattctcc 3' Tin 64.degree. C. PDM-799 (SEQ ID NO: 10,976) 5'gttatgtagcggccgcttatcatgttgctgcagag 3' Tm 67.degree. C.

[0716] Using the following conditions:

10 .mu.l 10.times. Herculase buffer 1 .mu.l 10 mM dNTPs 2 .mu.l 10 .mu.M each oligo 83 .mu.l sterile water 1.0 .mu.l Herculase DNA polymerase (Stratagene, La Jolla, Calif.)

50 .eta.g DNA

TABLE-US-00005 [0717] 98.degree. C. 3 minutes 98.degree. C. 40 seconds 60.degree. C. 30 seconds 72.degree. C. 2 minute .times. 10 cycles 98.degree. C. 40 seconds 60.degree. C. 30 seconds 72.degree. C. 2 minutes 30 seconds .times. 10 cycles 98.degree. C. 40 seconds 60.degree. C. 30 seconds 72.degree. C. 3 minutes .times. 10 cycles 98.degree. C. 40 seconds 60.degree. C. 30 seconds 72.degree. C. 3 minutes 30 seconds .times. 10 cycles 72.degree. C. 4 minutes

[0718] The PCR product was digested with Xho I and cloned into pPDM His (a modified pET28 vector with a His tag in frame on the 5' end) that had been digested with Eco72I and XhoI. Construct was confirmed through sequence analysis and transformed into BLR (DE3) pLysS and HMS 174 pLys S cells.

[0719] Recombinant proteins are also expressed without a histidine tag or with other lymphoma antigens. They are also expressed in other vectors, including other E. coli constructs, Baculovirus, yeast, and mammalian expression vectors. This recombinant antigen can be used to make polyclonal and monoclonal antibodies or used in immunological assays.

6. REFERENCES

[0720] The following references, to the extent that they provide exemplary procedural or other details supplementary to those set forth herein, are specifically incorporated herein by reference. [0721] U.S. Pat. No. 3,817,827. [0722] U.S. Pat. No. 3,850,752. [0723] U.S. Pat. No. 3,901,654. [0724] U.S. Pat. No. 3,935,074. [0725] U.S. Pat. No. 3,984,533. [0726] U.S. Pat. No. 3,996,345. [0727] U.S. Pat. No. 4,034,074. [0728] U.S. Pat. No. 4,098,876. [0729] U.S. Pat. No. 4,235,877. [0730] U.S. Pat. No. 4,376,110. [0731] U.S. Pat. No. 4,429,008. [0732] U.S. Pat. No. 4,436,727. [0733] U.S. Pat. No. 4,452,901. [0734] U.S. Pat. No. 4,489,710. [0735] U.S. Pat. No. 4,507,234. [0736] U.S. Pat. No. 4,554,101. [0737] U.S. Pat. No. 4,569,789. [0738] U.S. Pat. No. 4,603,112. [0739] U.S. Pat. No. 4,625,014. [0740] U.S. Pat. No. 4,638,045. [0741] U.S. Pat. No. 4,671,958. [0742] U.S. Pat. No. 4,673,562. [0743] U.S. Pat. No. 4,683,195. [0744] U.S. Pat. No. 4,699,784. [0745] U.S. Pat. No. 4,735,792. [0746] U.S. Pat. No. 4,751,180. [0747] U.S. Pat. No. 4,769,330. [0748] U.S. Pat. No. 4,777,127. [0749] U.S. Pat. No. 4,866,034. [0750] U.S. Pat. No. 4,873,088. [0751] U.S. Pat. No. 4,877,611. [0752] U.S. Pat. No. 4,897,268. [0753] U.S. Pat. No. 4,912,094. [0754] U.S. Pat. No. 4,935,233. [0755] U.S. Pat. No. 5,017,487. [0756] U.S. Pat. No. 5,075,109. [0757] U.S. Pat. No. 5,145,684. [0758] U.S. Pat. No. 5,151,254. [0759] U.S. Pat. No. 5,215,926. [0760] U.S. Pat. No. 5,240,856. [0761] U.S. Pat. No. 5,350,840. [0762] U.S. Pat. No. 5,359,681. [0763] U.S. Pat. No. 5,399,346. [0764] U.S. Pat. No. 5,399,363. [0765] U.S. Pat. No. 5,466,468. [0766] U.S. Pat. No. 5,472,869. [0767] U.S. Pat. No. 5,543,158. [0768] U.S. Pat. No. 5,552,157. [0769] U.S. Pat. No. 5,565,213. [0770] U.S. Pat. No. 5,567,434. [0771] U.S. Pat. No. 5,633,234. [0772] U.S. Pat. No. 5,641,515. [0773] U.S. Pat. No. 5,738,868. [0774] U.S. Pat. No. 5,741,516. [0775] U.S. Pat. No. 5,795,587. [0776] U.S. Pat. No. 5,811,128. [0777] U.S. Pat. No. 5,814,344. [0778] U.S. Pat. No. 5,820,883. [0779] U.S. Pat. No. 5,853,763. [0780] U.S. Pat. No. 5,874,265. [0781] U.S. Pat. No. 5,928,647. [0782] U.S. Pat. No. 5,942,252. [0783] U.S. Pat. No. 6,110,702. [0784] European Patent No. EP 0,345,242. [0785] Great Britain Patent No. GB 2,200,651. [0786] Intl. Pat. Appl. Publ. No. WO 89/01973. [0787] Intl. Pat. Appl. Publ. No. WO 89/06280. [0788] Intl. Pat. Appl. Publ. No. WO 91/02805. [0789] Intl. Pat. Appl. Publ. No. WO 91/16116. [0790] Intl. Pat. Appl. Publ. No. WO 92/07243. [0791] Intl. Pat. Appl. Publ. No. WO 94/00153. [0792] Intl. Pat. Appl. Publ. No. WO 94/20078. [0793] Intl. Pat. Appl. Publ. No. WO/94/23701. [0794] Intl. Pat. Appl. Publ. No. WO 95/17210. [0795] Intl. Pat. Appl. Publ. No. WO 96/02555. [0796] Intl. Pat. Appl. Publ. No. WO 96/06638. [0797] Intl. Pat. Appl. Publ. No. WO 96/30516. [0798] Intl. Pat. Appl. Publ. No. WO 96/33739. [0799] Intl. Pat. Appl. Publ. No. WO 97/24447. [0800] Intl. Pat. Appl. Publ. No. WO 99/33488. [0801] Aaroston and Todaro, J. Cell. Physiol., 72:141-48, 1968. [0802] Adelman et al., DNA, 2:183, 1983. [0803] Amin et al., Am. J. Pathol., 146:344-56, 1995. [0804] Akaza et al., "Expression of antitumor response. Role of attachment and viability of bacillus Calmette-Guerin to bladder cancer cells," Cancer, 72:558-63, 1993. [0805] American Type Culture Collection, Catalogue of Cell Lines and Hybridomas, 7th ed., 1992. [0806] Avrameas, "Natural autoantibodies: Self-recognition and physiological autoimmunity," In: Natural autoantibodies: Their Physiological Role and Regulatory Significance, Shoenfeld and Isenberg (eds.), CRC Press, Boca Raton, Fla., pp. 1-14, 1993. [0807] Azuma et al., "Correlation Between Augmented Resistance to Influenza Virus Infection and Histological Changes in Lung of Mice Treated with Trehalose-6,6'-dimycolate," Journal of Biological Response Modifiers, 7:473-82, 1988. [0808] Bajorin et al., Proc. Annu. Meet. Am. Soc. Clin. Oncol., 7:A967, 1988. [0809] Baker et al., "Ability of Monophosphoryl Lipid A To Augment the Antibody Response of Young Mice," Infection and Immunity, 56:3064-66, 1988a. [0810] Baker et al., "Enrichment of Suppressor T Cells by Means of Binding to Monophosphoryl Lipid A," Infection and Immunity, 58:726-31, 1990. [0811] Baker et al., "Inactivation of Suppressor T-Cell Activity by Nontoxic Monophosphoryl Lipid A," Infection and Immunity, 56:1076-83, 1988b. [0812] Baker et al., "Molecular structures that influence the immunomodulatory properties of the lipid A and inner core region oligosaccharides of bacterial lipopolysaccharides," Infection Immunity, 62:2257-69, 1994. [0813] Baker et al., "Structural Features That Influence the Ability of Lipid A and Its Analogs To Abolish Expression of Suppressor T Cell Activity," Infection and Immunity, 60:2694-701, 1992. [0814] Bakker et al., "Melanocyte lineage-specific antigen gp100 is recognized by melanoma-derived tumor-infiltrating lymphocytes," J. Exp. Med., 179:1005, 1994. [0815] Banchereau and Steinman, Nature, 392:245-51, 1998. [0816] Banerji et al., "Membrane lipid composition modulates the binding specificity of a monoclonal antibody against liposomes," Biochim. Biophys. Acta., 689:319-26, 1982. [0817] Barnd et al., "Specific tumor histocompatibility complex-unrestricted recognition of tumor-associated mucins by human cytotoxic T cells," Proc. Natl. Acad. Sci. USA, 86:7159, 1989. [0818] Barnoud et al., Am. J. Surg. Pathol., 24:830-36, 2000). [0819] Bartlett and Zbar, J. Natl. Cancer Inst., 48:1709, 1972. [0820] Bast et al., "BCG and Cancer," N. Engl. J. Med., 290:1413-20, 1974. [0821] Bennett et al., "Endogenous Production of Cytotoxic Factors in Serum of BCG-Primed Mice by Monophosphoryl Lipid A, a Detoxified Form of Endotoxin," Journal of Biological Response Modifiers, 7:65-76, 1988 [0822] Berkner, Biotechniques, 6:616-27, 1988. [0823] Berra et al., Int. J. Cancer, 36:363-66, 1985. [0824] Berra et al., J. Neurochem., 40:777-82, 1983. [0825] Bogoch, "Demonstration of serum precipitin to brain gangliosides," Nature, 183:392-93, 1960. [0826] Bouchon et al., Biochem. Internatl., 10:531-38, 1985. [0827] Bowen-Pope et al., Proc. Nat'l Acad. Sci. USA, 81:2396-400, 1984. [0828] Bowness et al., "Clostridium perfringens enterotoxin is a superantigen reactive with human T cell receptors V beta 6.9 and V beta 22," J. Exp. Med., 176:893-96, 1992. [0829] Brade et al., "An Artificial Glycoconjugate Containing the Bisphosphorylated Glucosamine Disaccharide Backbone of Lipid A Binds Lipid A Monoclonal Antibodies," Infection and Immunity, 61:4514-17, 1993. [0830] Brichard et al., "The tyrosinase gene codes for an antigen recognized by autologous cytolytic T lymphocytes on HLA-A2 melanomas," J. Exp. Med., 178:489, 1993. [0831] Brodin et al., "Mouse monoclonal antibodies with specificity for the melanoma-associated ganglioside disialyllactosyl ceramide (GD3) also react with the structural analogue disialylparagloboside," Biochim. Biophys. Acta., 837:349-53, 1985. [0832] Brooks et al., Clin. Exp. Immunol., 39: 477, 1980. [0833] Brown et al., J. Biol. Chem., 255:4980-83, 1980. [0834] Burchell et al., J. Immunol., 131:508-13, 1983. [0835] Bystryn et al., Cancer, 61:1065, 1988. [0836] Cahan et al., "Identification of a human neuroectodermal tumor antigen (OFA-I-2) as ganglioside GD2," Proc. Natl. Acad Sci. USA, 79:7629-33, 1982. [0837] Campbell et al., "Intercellular adhesion molecule-1 expression by bladder cancer cells: functional effects," J. Urol., 151:1385-90, 1994. [0838] Campbell, in Monoclonal Antibody Technology, Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 13, Burden and Von Knippenberg (eds.), Amsterdam, Elseview, pp. 75-83, 1984. [0839] Campbell et al., Int. J. Cancer, 78:182-88, 1998. [0840] Carr and Morrison, "A two-step mechanism for the interaction of Re lipopolysaccharide with erythrocyte membranes," Rev. Infect. Dis. 6:497-508, 1984. [0841] Carubia et al., Biochem. Biophys. Res. Commun., 120:500-04, 1984. [0842] Chang et al., Crit. Rev. Oncol. Hematol., 22:213, 1996. [0843] Chase et al., "Effect of Monophosphoryl Lipid A on Host Resistance to Bacterial Infection," Infection and Immunity, 53(3):711-12, 1986. [0844] Chen et al., "Activation of Macrophages From Aging Mice by Detoxified Lipid A," Journal of Leukocyte Biology, 49:416-22, 1991. [0845] Chen et al., Cancer Res., 54:1065-70, 1994. [0846] Cheng et al., "Bacillus Calmette-Gerin interacts with the carboxyl-terminal heparin bindings domain of fibronectin: implications for BCG-mediated antitumor activity," J. Urol., 152:1275-80, 1994. [0847] Cheresh and Klier, "Disialoganglioside GD3 distributes preferentially into substrate associated microprocesses on human melanoma cells during their attachment to fibronectin," J. Cell. Biol., 102:1887-97, 1986. [0848] Cheresh et al., "A monoclonal antibody recognizes an O-acetyl sialic acid in a human melanoma-associated ganglioside," J. Biol. Chem., 259:7453-59, 1984. [0849] Cheresh et al., "Disialoganglioside GD3 on human melanoma serves as a relevant target antigen for monoclonal antibody-mediated tumor cytolysis," Proc. Natl. Acad. Sci. USA, 82:5155-59, 1985. [0850] Cheresh et al., "Disialogangliosides GD2 and GD3 are involved in the attachment of human melanoma and neuroblastoma cells to extracellular matrix proteins," J. Cell. Biol., 102:688-96, 1986. [0851] Cheresh et al., "Localization of gangliosides GD2 and GD3 in adhesion plaques and on the surface of human melanoma cells," Proc. Natl. Acad. Sci. USA, 81:5767-71, 1984. [0852] Cheung et al., "Detection of neuroblastoma cells in bone marrow using GD2 specific monoclonal antibodies," J. Clin. Oncol., 4:363-69, 1986. [0853] Chu and Sharom, "Gangliosides inhibit T-lymphocyte proliferation by preventing the interaction of interleukin-2 with its cell surface receptors," Immunology, 79:10-16, 1993. [0854] Cohen, Science, 259:1691-92, 1993. [0855] Colcher et al., Proc. Natl. Acad. Sci. USA, 78:3199, 1987. [0856] Coligan et al., in Current Protocols in Immunology, Vol. 1, Wiley Interscience. Greene (ed.), 1998. [0857] Coombes et al., Vaccine, 14:1429-38, 1996. [0858] Coulie et al., "A new gene coding for a differentiation antigen recognized by autologous cytologic T lymphocytes on HLA-A2 melanomas," J. Exp. Med., 180:35, 1994. [0859] Deavin et al., Mol. Immunol., 33:145-55, 1996. [0860] DeBruijn et al., Eur. J. Immunol., 21:2963-70, 1991. [0861] Dippold et al., "Immunohistochemical localization of ganglioside GD3 in human malignant melanoma, epithelial tumors and normal tissues," Cancer Res., 45:3699-705, 1985. [0862] Dippold et al., "Inflammatory response at the tumor site after systemic application of monoclonal anti-GD3-ganglioside antibody to patients with malignant melanoma," Am. Assoc. Cancer Res., 978:247, 1984. [0863] Dippold et al., Proc. Natl. Acad. Sci. USA, 77:6115, 1980. [0864] Dresser and Phillips, in Immunopotentiation, CIBA Foundation Symposium 18, Elsevier, Amsterdam, p.3, 1973. [0865] Dwivedi et al., "Plasma lipid-bound sialic acid alterations in neoplastic diseases," Experientia, 46:91-94, 1990. [0866] Elder, "Skin Cancer," Cancer, 75:245-56, 1995. [0867] Elliott et al., "The D-Galactosamine Loaded Mouse and Its Enhanced Sensitivity to Lipopolysaccharide and Monophosphoryl Lipid A: A Role for Superoxide," J. Immunol., 10:69-74, 1991. [0868] Euhus et al., Cancer Immunol Immunother., 29:247-54, 1989. [0869] Fawwaz et al., Statutory Invention Registration Patent No. H819, application no. 6-6-5, 439, 1990. [0870] Fischer, Handb. Lipid Res., 6:123-234, 1990. [0871] Fisher-Hoch et al., Proc. Nat'l Acad. Sci. USA, 86:317-21, 1989. [0872] Fitzgerald, "Syphilis vaccine: up-regulation of immunogenicity by cyclophosphamide, Ribi adjuvant, and indomethacin confers significant protection against challenge infection in rabbits," Vaccine, 9:265-72, 1991. [0873] Fleischmann, Park and Hassan, "Fibronectin expression on surgical specimens correlated with the response to intravesical bacillus Calmette-Guerin therapy," J. Urol., 149:268-71, 1993. [0874] Flexner et al., Ann. N.Y. Acad. Sci., 569:86-103, 1989. [0875] Flexner et al., Vaccine, 8:17-21, 1990. [0876] Foster et al., Cancer Res., 57:3325-30, 1997. [0877] Fraizer et al., Blood, 86:4704-06, 1995. [0878] Fredman et al., Neurol. Res., 8:123-26, 1986. [0879] Freimer et al., "Gangliosides elicit a T-cell independent antibody response," J. Autoimmun., 6:281-89, 1993. [0880] Freudenberg et al., "ELISA for antibodies to Lipid A, Lipopolyasscharides and other hydrophobic antigens," Infection, 17:322-24, 1989. [0881] Gaiger et al., Blood, 96:1334, 2000 [0882] Garg and Subbarao, "Immune Responses of Systemic and Mucosal Lymphoid Organs to Pnu-Immune Vaccine as a Function of Age and the Efficacy of Monophosphoryl Lipid A as an Adjuvant," Infection and Immunity, 60:2329-36, 1992. [0883] Gaugler et al., "Human gene MAGE-3 codes for an antigen recognized on a human melanoma by autologous cytolytic T lymphocytes," J. Exp. Med., 179:921, 1994. [0884] Gefter et al., Somatic Cell Genet., 3:231-36, 1977. [0885] Gennaro et al., Am. J. Ind. Med., 37:275-82, 2000. [0886] Gillard et al., "Antibodies against ganglioside GT.sub.3 in the sera of patients with type I Diabetes mellitus," J. Immunol., 142:3826-32, 1989. [0887] Gillis, Nature, 268:154-56, 1977. [0888] Glynn, McCoy and Fefer, Cancer Res., 28:434-39, 1968. [0889] Goding, in Monoclonal Antibodies: Principles and Practice, 2d ed., Orlando, Fla., Academic Press, pp. 60-61, 65-66, 71-74, 1986. [0890] Goff et al., Eur. J. Biochem., 130:553-57, 1983. [0891] Grabarek et al., "Endotoxic Lipid A Interaction with Human Platelets," The Journal of Biological Chemistry, 265:8117-21, 1990. [0892] Graus et al., "Distribution of the ganglioside GD3 in the human nervous system detected by R24 mouse monoclonal antibody," Brain Res., 324:190-94, 1984. [0893] Guzman et al., Circulation, 88:2838-48, 1993a. [0894] Guzman et al., Cir. Res., 73:1202-07, 1993b. [0895] Hachida et al., Transplant Proc., 22:1663-70, 1990. [0896] Hachida et al., Transplantation, 56:479-82, 1993. [0897] Harada et al., Mol. Urol., 3:357-364, 1999. [0898] Hardings et al., "Effects of pH and polysaccharides on peptide binding to class II major histocompatibility complex molecules," Proc. Natl. Acad. Sci. USA, 88:2740-44, 1991. [0899] Harel et al., Cancer Res., 50:6311, 1990. [0900] Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. [0901] Helling et al., "Construction of Immunogenic GD.sub.3-conjugate vaccines," Ann. N. Y. Acad. Sci., 690:396-97, 1993. [0902] Hellstrom et al., "Strong anti-tumor activities of IgG 3 antibodies to a human melanoma-associated ganglioside," Proc. Natl. Acad. Sci. USA, 82:1499-1502, 1985. [0903] Hirabayashi et al., "Syngeneic monoclonal antibody against melanoma antigen with interspecies cross-reactivity recognize GM.sub.3, a prominent ganglioside of B16 melanoma," Biol. Chem., 260:13328-33, 1985. [0904] Hirabayashi et al.,

"Reactivity of mouse monoclonal antibody M2590 against B16 melanoma cells with chemically synthesized GM3 ganglioside," Biochim. Biophys. Acta, 875:126-28, 1986. [0905] Hirabayashi et al., Jpn. J. Cancer Res., 78:614-20, 1987. [0906] Hoon et al, "Gangliosides from melanoma immunomodulate response of T-cells to interleukin-2," Cell Immunol., 4:1111-19, 1988. [0907] Horibata and Harris, Exp. Cell. Res., 60:61, 1970. [0908] Horgan, "Total and lipid-bound sialic acid levels in sera from patients with Cancer," Clin. Chim. Acta., 118:327-31, 1982. [0909] Houghten et al., "Mouse monoclonal IgG3 antibody detecting GD3 ganglioside: A phase I trial in patients with malignant melanoma," Proc. Natl. Acad. Sci. USA, 82:1242-46, 1985. [0910] Houghton, "Cancer Antigens: Immune Recognition of Self and Altered Self," J. Exp. Med., 180:1-4, 1994. [0911] Hraba et al., "The Influence of Monophosphoryl Lipid A (MPL.TM.) on Erythrocyte Autoantibody Formation," Immunobiol., 189:448-56, 1993. [0912] Hunter et al., Vaccine, 9:250: 1991. [0913] Inoue et al., Blood, 88:2267-78, 1996. [0914] Irie and Morton, "Regression of cutaneous metastatic melanoma by intralesional injection with human monoclonal antibody to ganglioside GD2," Proc. Natl. Acad. Sci. USA, 83:8694-98, 1986. [0915] Irie and Ravindranath, "Gangliosides as targets for monoclonal antibody therapy of cancer," in Therapeutic monoclonal antibodies, Borrebaeck and Larrick (eds.), Stockton Press, New York, p. 75-94, 1990. [0916] Irie et al., "Human antibody to OFA-I, a tumor antigen, produced in vitro by Epstein-Barr virus transformed human B-lymphoid cell lines," Proc. Natl. Acad. Sci. USA, 79:5666-70, 1982. [0917] Irie et al., "Melanoma gangliosides and human monoclonal antibody," in Human Tumor Antigens and Specific Tumor Therapy, Metzgar and Mitchell (eds.), Alan R. Liss, Inc., New York, pp. 115-126, 1989. [0918] Ishioka et al., "MHC interaction and T cell recognition of carbohydrates and glycopeptides," J. Immunol., 148:2446-51, 1992. [0919] Jackson et al., "Induction of ICAM 1 expression on bladder tumours by BCG immunotherapy," J. Clin. Pathol., 47:309-12, 1994. [0920] Johnson and Tomai, "A Study of the Cellular and Molecular Mediators of the Adjuvant Action of a Nontoxic Monophosphoryl Lipid A," Adv. Exp. Med. Biol., 133:567-79, 1988. [0921] Johnson et al., "Characterization of a nontoxic monophosphoryl lipid A," Rev. Infect. Dis., 9:512-16, 1987. [0922] Johnson et al., "Structural Characterization of Monophosphoryl Lipid A Homologs Obtained from Salmonella minnesota Re595 Lipopolysaccharide," J. Biol. Chem., 265:8108-16, 1990. [0923] Johnston and Bystryn, "Effect of Cell Wall Skeleton and Monophosphoryl Lipid A Adjuvant on the Immunogenicity of a Murine B16 Melanoma Vaccine," Journal of the National Cancer Institute, 83:1240-45, 1991. [0924] Jones et al., J Natl Cancer Inst, 66:249-54, 1981. [0925] Kabat et al., "Sequences of Proteins of Immunological Interest," 5th ed., Public Health Service, National Institutes of Health, Bethesda, Md., pp 647-669, 1991. [0926] Kass-Eisler et al., Proc. Nat Acad. Sci. USA, 90:11498-502, 1993. [0927] Katopodis et al, "Lipid-associated sialic acid test for the detection of human cancer," Cancer Res., 42:5270-75, 1982. [0928] Kawaguchi et al., "Characteristic mode of action of gangliosides in selective modulation of CD4 on human T lymphocytes," Biochem. Biophys. Res. Commun., 158:1050-55, 1989. [0929] Kawakami et al., "Cloning of the gene coding for a shared human melanoma antigen recognized by autologous T cells infiltrating into tumor," Proc. Natl. Acad. Sci. USA, 91:3515, 1994. [0930] Kawakami, Eliyahu, Sakaguchi, Robbins, Rivoltini, Yannelli, Appella and Rosenberg, "Identification of the immunodominant peptides of the Mart-1 human melanoma antigen recognized by the majority of HLA-A2 restricted tumor infiltrating lymphocytes," J. Exp. Med., 180:347-52, 1994. [0931] Kensil et al., J. Am. Vet. Med. Assoc., 199:1423, 1991. [0932] Kloppel et al., "Glycolipid-bound sialic acid in serum: Increased levels in mice and humans bearing mammary carcinomas," Proc. Natl. Acad. Sci. USA, 74:3011-13, 1977. [0933] Kohler and Milstein, Nature, 256:495-97, 1975. [0934] Kohler and Milstein, Eur. J. Immunol., 6:511-19, 1976. [0935] Kolls et al., Proc. Nat'l Acad. Sci. USA, 91:215-19, 1994. [0936] Koscielak et al., "Glycolipid antigen and its antibody," Immunochemistry, 5:441-55, 1968. [0937] Kovach et al., "Lipid IV.sub.A Inhibits Synthesis and Release of Tumor Necrosis Factor Induced by Lipopolysaccharide in Human Whole Blood Ex Vivo," J. Exp. Med., 172:77-84, 1990. [0938] Kwok and Higuchi, Nature, 339:237-38, 1989. [0939] Kyogashima et al., Jpn. J. Cancer Res., 78:1229-32, 1987. [0940] Ladisch et al., "Shedding of GD2 ganglioside by human neuroblastoma," Int. J. Cancer, 39:73-76, 1987. [0941] Lamm et al., "A randomized trial of intravesical doxorubicin and immunotherapy with bacille Calmette-Guerin for transitional-cell carcinoma of the bladder," N. Engl. J. Med., 325:1205, 1991. [0942] Lengle et al., "Inhibition of the lectin-induced mitogenic response of thymocytes by glycolipids," Cancer Res., 39:817-922, 1979. [0943] Liepkalns et al., J. Neurochem., 36:1959-65, 1981. [0944] Livingston et al., "The Serologic Response to Meth A Sarcoma Vaccines After Cyclophosphamide Treatment is Additionally Increased by Various Adjuvants," The Journal of Immunology, 135(2):1505-09, 1985. [0945] Livingston et al., "Approaches to augmenting immunogenicity of the ganglioside GM.sub.2 in mice: purified GM.sub.2 is superior to whole cells," J. Immunol., 138:1524-29, 1987a. [0946] Livingston et al., "Vaccines containing purified GM.sub.2 gangliosides elicit GM.sub.2 antibodies in melanoma patients," Proc. Natl. Acad. Sci. USA, 84:2911-15, 1987b. [0947] Ljunggren et al., Nature, 346:476-80, 1990. [0948] Lozzio and Lozzio, Blood, 45:321-34, 1975. [0949] Madonna and Vogel, "Induction of Early-Phase Endotoxin Tolerance in Athymic (Nude) Mice, B-Cell-Deficient (xid) Mice, and Splenectomized Mice," Infection and Immunity, 53:707-10, 1986. [0950] Mahvi et al., Immunology and cell Biology, 75:456-60, 1997. [0951] Maratea et al., Gene, 40:39-46, 1985. [0952] Masihi et al., "Effects of Nontoxic Lipid A and Endotoxin on Resistance of Mice to Toxoplasma gondii," Journal of Biological Response Modifiers, 7:535-39, 1988 [0953] Melling et al., J. Immunol., 117:1267-74, 1976. [0954] Menssen et al., J. Cancer Res. Clin. Oncol., 126:226-32, 2000. [0955] Merrifield, J. Am. Chem. Soc., 85:2149-46, 1963. [0956] Miller and Esselman, "Modulation of immune response by antigen reactive lymphocytes after cultivation with gangliosides," J. Immunol., 115:839-43, 1975. [0957] Minden, "Shared Antigens Between Animal and Human Tumors and Microorganisms," in BCG in Cancer Immunotherapy, Lamoureux, Turcotte and Portelance (eds.); pp. 73-81, 1976. [0958] Miotti et al., Cancer Res., 65:826, 1985. [0959] Mitchell et al., "Active specific Immunotherapy of melanoma with allogeneic cell lysates: Rationale, results and possible mechanisms of action," Ann. N.Y. Acad. Sci., 690:153-66, 1993. [0960] Mitchell et al., "Active specific immunotherapy of melanoma: Phase I trial of allogeneic lysates and a novel adjuvant," Cancer Res., 48:5883-93, 1988. [0961] Mitchell et al., "Active-Specific Immunotherapy for Melanoma," Journal of Clinical Oncology, 8:856-59, 1990. [0962] Miyake et al., Cancer Res., 48:6154-60, 1988. [0963] Mooney et al., "Bacterial superantigen signaling via HLA class II on human B lymphocytes," Mol. Immunol., 31:675-81, 1994. [0964] Morrison et al., "Specific ganglioside binding to receptor sites on T lymphocytes that couple to ganglioside-induced decrease of CD4 expression," Life Sci., 45:1219-24, 1989. [0965] Morton and Ravindranath, in Cancer Medicine, 3rd ed., Holland et al. (eds.), Lea and Febiger, Philadelphia, p.967, 1993. [0966] Morton et al., "Polyvalent Melanoma Vaccine Improves Survival of Patients with Metastatic Melanoma," John Wayne Cancer Institute at Saint John's Hospital and Health Center, Santa Monica, Calif., reprinted from Specific Immunotherapy of Cancer with Vaccines, Volume 690 of the Annals of the New York Academy of Sciences, 1993. [0967] Morton et al., Ann. Surg., 216:463, 1992. [0968] Morton et al., in Biological Function of Gangliosides, Progress in Brain Research, Vol. 101, pp 251-275, 1994. [0969] Mosmann and Coffman, Ann. Rev. Immunol., 7:145-73, 1989. [0970] Munjal et al., "Combined measurement and significance of lipid-bound sialic acid and carcinoembryonic antigen in detection of human cancer," Diagn. Immunol., 2:36-43, 1984. [0971] Murphy et al., Proc. Natl. Acad. Sci. USA, 83:8258-62, 1986. [0972] Myers et al., "Monophosphoryl Lipid A Behaves as a T-Cell-Independent Type 1 Carrier for Hapten-Specific Antibody Responses in Mice," Infection and Immunity, 63:168-74, 1995. [0973] Naiki et al., "Properties of antisera to ganglioside GM.sub.1 and Asialo GM.sub.1", J. Immunol., 113:84-93, 1974. [0974] Nair et al., Nature Biotechnol., 16:364-69, 1998. [0975] Natoli et al., "A murine monoclonal antibody detecting the ganglioside GM2: Characterization of cell surface reactivity," Cancer Res., 46:4116-20, 1986. [0976] Nonomura et al., Hinyokika Kiyo, 45:593-97, 1999. [0977] Nudelman et al., "Characterization of a human melanoma-associated ganglioside antigen defined by a monoclonal antibody 4.2," J. Biol. Chem., 257:12752-56, 1982. [0978] Odean et al., "Involvement of Gamma Interferon in Antibody Enhancement by Adjuvants," Infection and Immunity, 58:427-32, 1990. [0979] Oji et al., Jpn. J. Cancer Res., 90:194-204, 1999. [0980] Old et al., Ann. N. Y. Acad. Sci., 101:80-106, 1962. [0981] Pan et al., Leukemia, 14:1634, 2000. [0982] Parker et al., J. Immunol., 152:163, 1994. [0983] Pascal et al., "Immunochemical studies on normal and Tay-Sachs' brain gangliosides," Proc. Soc. Exp. Biol. Med., 121:739-43, 1966. [0984] Patek, Collins and Cohn, "Transformed cell lines susceptible or resistant to in vive surveillance against tumorigenesis," Nature, 276:510-11, 1978. [0985] Patmasiriwat et al., Leukemia, 13:891-900, 1999. [0986] Paul, Fundamental Immunology, 3rd ed., Raven Press, pp. 243-47, 1993. [0987] Portoukalian et al., "Humoral immune response in disease-free advanced melanoma patients after vaccination with melanoma-associated gangliosides," Int. J. Cancer, 49:893-99, 1991. [0988] Portoukalian, "Alteration of gangliosides in plasma and red cells of human bearing melanoma tumors," Biochem. Biophys. Res. Commun., 85:916-20, 1978. [0989] Portoukalian, "Immunoregulatory activity of gangliosides shed by melanoma tumors," in Gangliosides and Cancer, Oettgen (ed.), New York, VCH Publishers, pp. 207-16, 1989. [0990] Powell and Newman (eds.), "Vaccine Design (the subunit and adjuvant approach)," Plenum Press, N Y, 1995. [0991] Prokazova et al., "Sialylated lactosylceramides. Possible inducers of non-specific immunosuppression and atherosclerotic lesions," Eur. J. Biochem., 171:1-10, 1988. [0992] Pukel et al., "GD3, a prominent ganglioside of human melanoma: Detection and characterization of mouse monoclonal antibody," J. Exp. Med., 155:1133-47, 1982. [0993] Qureshi et al., "Purification and structural determination of nontoxic lipid A obtained from the Lipopolysaccharide of Salmonella typhimurium," J. Biol. Chem., 257:11808-15, 1985. [0994] Rabinovich et al., "Vaccine Technologies: View to the Future," Science, 265:1401-02, 1994. [0995] Raines and Ross, J. Biol. Chem., 257:5154-60, 1982. [0996] Rapport and Graf, "Immunochemical Reactions of Lipids," Prog. Allergy, 13:273-331, 1969. [0997] Ravindranath and Irie, in Malignant Melanoma: Biology, Diagnosis, and Therapy, Nathanson (ed.), Kluwer Acad., Boston, p. 17, 1988. [0998] Ravindranath and Morton, "Role of gangliosides in active immunotherapy with melanoma vaccine," Int. Rev. Immunol., 7:303, 1991. [0999] Ravindranath et al., "Human melanoma antigen O-acetylated Ganglioside GD.sub.3 is recognized by Cancer antennarius lectin," J. Biol. Chem., 263:2079-86, 1988. [1000] Ravindranath et al., "An epitope common to gangliosides O-acetyl GD3 and GD3 recognized by antibodies in melanoma patients after active specific immunotherapy," Cancer Res., 49:3891-97, 1989. [1001] Ravindranath et al., "Ganglioside GM.sub.3:GD.sub.3 Ratio as an Index for the Management of Melanoma," Cancer, 67:3029-35, 1991. [1002] Ravindranath et al., "Efficacy of tumor cell vaccine after incorporating monophosphoryl lipid A (MPL) in tumor cell membranes containing tumor-associated ganglioside," Experientia, 50:648-653, 1994a. [1003] Ravindranath et al., "Attachment of Monophosphoryl Lipid A (MPL) to Cells and Liposomes Augments Antibody Response to membrane-bound Gangliosides," Journal of Autoimmunity, 7:803-16, 1994b. [1004] Ravindranath et al., "Factors affecting the fine specificity and sensitivity of serum antiganglioside antibodies in ELISA," J. Immunol. Methods, 169:257-72, 1994c. [1005] Real et al., "Class I (unique) tumor antigens of human melanoma. Identification of a 90,000 dalton cell surface glycoprotein by autologous antibody," J. Exp. Med., 160:1219, 1984. [1006] Reeves et al., Cancer Res., 56:5672-77, 1996. [1007] Reisfeld et al., Melanoma Antigens and Antibodies, p. 317, 1982. [1008] Ribi, "Beneficial modification of the endotoxin molecule," J. Biol. Resp. Mod., 3:1-9, 1984. [1009] Ribi et al., "Lipid A and immunotherapy," Rev. Infect. Dis., 6:567-72, 1984. [1010] Ribi et al., "Modulation of humoral and cell mediated immune responses by a structurally established nontoxic lipid A," in Immunobiology and Immunopharmacology of Bacterial Endotoxins, Szentivanji and Friedman (eds.), Plenum Press, New York, pp. 407-420, 1986. [1011] Riclman et al., Lancet 337:998, 1991. [1012] Rolland, Crit. Rev. Therap. Drug Carrier Systems, 15:143-98, 1998. [1013] Rosenberg et al., Ann. Surg., 210:474, 1989. [1014] Rosenberg et al., N. Engl. J. Med., 319:1676, 1988. [1015] Rosenfeld et al., Science, 252:431-34, 1991. [1016] Rothbard and Taylor, EMBO J., 7:93-100, 1988. [1017] Rott et al., "Protection from experimental allergic encephalomyelitis by application of a bacterial superantigen," Int. Immunol., 4:347-53, 1992. [1018] Sato et al., "Cytoplasmic membrane-associated protein (CAP) isolated from Streptococcus pyrogenes: as a new bacterial superantigen," Microbiol. Immunol., 38:139-47, 1994. [1019] Sato et al., Science, 273:352, 1996. [1020] Satoh et al., Pathol. Int., 50:458-71, 2000. [1021] Schuster et al., "Production of antibodies against phosphocholine, phosphatidylcholine, sphingomyelin, and lipid A by injection of liposomes containing lipid A," J. Immunol., 122:900-05, 1979. [1022] Schwab et al., "Superantigen can reactivate bacterial cell wall-induced arthritis," J. Immunol., 150:4151-59, 1993. [1023] Shafer and Spitznagel, "Sensitivity of Salmonella typhimurium to polymorphonuclear granulocyte extracts: Role of lipid A," Rev. Infect. Dis., 6:577-81, 1984. [1024] Shepard et al.,

J. Clin. Immunol., 11:117-27, 1991. [1025] Sherwin et al., "The production of antisera to gangliosides from human nervous tissue," Canad. J. Biochem., 42:1640-48, 1964. [1026] Shimizu et al., Int. J. Gynecol. Pathol., 19:158-63, 2000. [1027] Shy et al., "Antibodies to GM.sub.1 and GD.sub.1b in patients with motor neuron disease with plasma cell dyscrasia," Ann. Neurol., 25:511-18, 1989. [1028] Siddiqui et al., Cancer Res., 44:5262-65, 1984. [1029] Sidell et al., Cancer Immunol Immunother, 7:151-55, 1979. [1030] Sigma Cell Culture, Volume 9, Number 2, 1993. [1031] Skeiky et al., Infection and Immun., 67:3998-4007, 1999. [1032] Slavin and Strober, Nature, 272:624-26, 1977. [1033] Spinsanti et al., Leuk Lymphoma, 38:611-19, 2000. [1034] Stiess and Kruger, "Mammalian Cell Culture Media--Overview and Applications," The Source (Sigma Cell Culture Technical and Product News), 9:1-8, 1993. [1035] Stoute et al. New Engl. J. Med., 336:86-91, 1997. [1036] Svennerholm et al., "Tumor gangliosides as targets for active specific immunotherapy of melanoma in man," in Biological Function of Gangliosides, Progress in Brain Research, Vol. 101, 1994. [1037] Tai et al., "Ganglioside GM2 as a human tumor antigen (OFA-I-1)," Proc. Natl. Acad. Sci., 80:5392-96, 1983. [1038] Takada et al., "Molecular and Structural Requirements of a Lipoteichoic Acid from Enterococcus hirae ATCC 9790 for Cytokine-Inducing, Antitumor, and Antigenic Activities," Infection and Immunity, 63:57-65, 1995. [1039] Takahashi et al., J. Immunol., 140:3244, 1988. [1040] Tamaki et al., Leukemia, 13:393-99, 1999. [1041] Tamauchi et al., Immunology, 50:605, 1983. [1042] Tanamoto, "Dissociation of Endotoxic Activities in a Chemically Synthesized Lipid A Precursor after Acetylation," Infection and Immunity, 63:690-92, 1995. [1043] Tanamoto, "Free Hydroxyl Groups Are Not Required for Endotoxic Activity of Lipid A," Infection and Immunity, 62:1705-09, 1994a. [1044] Tanamoto, FEBS Lett., 351:325-29, 1994b. [1045] Tautu et al., "Improved procedure for determination of serum lipid-associated sialic acid: Application for early diagnosis of colorectal cancer," J. Natl. Cancer Inst., 80:1333-37, 1988. [1046] Thor et al., Cancer Res., 46:3118, 1986. [1047] Thurin et al., "Proton NMR and fast-atom bombardment mass spectrometry analysis of the melanoma-associated ganglioside 9-O-acetyl GD3," J. Biol. Chem., 260:14556-563, 1985. [1048] Timmerman and Levy, Ann. Rev. Med., 50:507-29, 1999. [1049] Tomai and Johnson, "T Cell and Interferon-.gamma. Involvement in the Adjuvant Action of a Detoxified Endotoxin," Journal of Biological Response Modifiers, 8:625-43, 1989. [1050] Tomai et al., "The Adjuvant Properties of a Nontoxic Monophosphoryl Lipid A in Hyporesponsive and Aging Mice," Journal of Biological Response Modifiers, 6:99-107, 1987. [1051] Tsuchida et al., J. Dermatol., 11:129-38, 1984. [1052] Tsuchida et al., "Gangliosides of Human Melanoma: Altered Expression in Vivo and in Vitro," Cancer Research, 47:1278-81, 1987. [1053] Tsuchida et al., "Gangliosides as tumor markers of human melanoma: bio-chemical and immunologic assays," in New Horizons of Tumor Immunotherapy, Torisu and Yoshida (eds.), pp. 315-325, 1989. [1054] Tuting et al., J. Immunol., 160:1139-47, 1998. [1055] Ulmer et al., Science, 259:1745-49, 1993. [1056] Vadhan-Raj et al., J. Clin. Oncol., 6:1636, 1988. [1057] van der Bruggen et al., "A gene encoding an antigen recognized by cytolytic T lymphocytes on a human melanoma," Science, 264:716, 1991. [1058] Verma et al., "Adjuvant Effects of Liposomes Containing Lipid A: Enhancement of Liposomal Antigen Presentation and Recruitment of Macrophages," Infection and Immunity, 60:2438-44, 1992. [1059] Vijayasaradhi et al., "The melanoma antigen gp75 is the human homologue of the mouse b (brown) locus gene product," J. Exp. Med., 171:1375, 1990. [1060] Von Bultzingslowen, Pneumologie, 53:266-75, 1999. [1061] Vosika et al., Cancer Immunol. Immunother., 18:107, 1984. [1062] Watson, Ralph, Sarkar and Cohn, "Leukemia viruses associated with mouse myeloma cells," Proc. Natl. Acad. Sci. USA, 66:344, 1970. [1063] Westrick et al., Biochim. Biophys. Acta, 750:141-48, 1983a [1064] Westrick et al., Cancer Res., 43:5890-94, 1983b [1065] Whisler and Yates, "Regulation of lymphocyte responses by human gangliosides. I. Characteristics of inhibitory effects and the induction of impaired activation," J Immuunol., 125:2106-12, 1980. [1066] Wide et al., in Radioimmunoassay Methods, Kirkham and Hunter (eds.), E. and S. Livingstone, Edinburgh, 1970. [1067] Wilschut, "Preparation and properties of phospholipid vesicles," in Methodologie des liposomes appliquee a la pharmacologie et a la biologies cellulaire, Leserman and Barbet (eds.), INSERM, Paris, pp. 1-10, 1982. [1068] Yamaguchi et al., "Cell-surface antigens of melanoma recognized by human monoclonal antibodies," Proc. Natl. Acad. Sci. USA, 84:2416-20, 1987. [1069] Yamamoto et al., "In vitro Augmentation of Natural Killer Cell Activity and Production of Interferon-.alpha./.beta. and -.gamma. with Deoxyribonucleic Acid Fraction from Mycobacterium bovis BCG," Jpn. J. Cancer Res., 79:866-73, 1988. [1070] Yeh et al., "A cell-surface antigen which is present in the ganglioside fraction and shared by human melanomas," Int. J. Cancer, 29:269-75, 1982. [1071] Yin et al., "Effect of Various Adjuvants on the Antibody Response of Mice to Pneumococcal Polysaccharides," J. Biol. Resp. Modifiers, 8:190-205, 1989. [1072] Yokoyama et al., "Immunochemical studies with gangliosides," J. Immunol., 90:372-80, 1963. [1073] Zapata et al., Protein Eng., 8:1057-62, 1995. [1074] Zitvogel et al., Nat. Med., 4:594-600, 1998.

[1075] All of the compositions and methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this invention have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the composition, methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes. Accordingly, the exclusive rights sought to be patented are as described in the claims below:

Sequence CWU 1

1

12412672DNAHomo sapiens 1cttccagaga gcaatatggc tggttcccca acatgcctca ccctcatcta tatcctttgg 60cagctcacag ggtcagcagc ctctggaccc gtgaaagagc tggtcggttc cgttggtggg 120gccgtgactt tccccctgaa gtccaaagta aagcaagttg actctattgt ctggaccttc 180aacacaaccc ctcttgtcac catacagcca gaagggggca ctatcatagt gacccaaaat 240cgtaataggg agagagtaga cttcccagat ggaggctact ccctgaagct cagcaaactg 300aagaagaatg actcagggat ctactatgtg gggatataca gctcatcact ccagcagccc 360tccacccagg agtacgtgct gcatgtctac gagcacctgt caaagcctaa agtcaccatg 420ggtctgcaga gcaataagaa tggcacctgt gtgaccaatc tgacatgctg catggaacat 480ggggaagagg atgtgattta tacctggaag gccctggggc aagcagccaa tgagtcccat 540aatgggtcca tcctccccat ctcctggaga tggggagaaa gtgatatgac cttcatctgc 600gttgccagga accctgtcag cagaaacttc tcaagcccca tccttgccag gaagctctgt 660gaaggtgctg ctgatgaccc agattcctcc atggtcctcc tgtgtctcct gttggtgccc 720ctcctgctca gtctctttgt actggggcta tttctttggt ttctgaagag agagagacaa 780gaagagtaca ttgaagagaa gaagagagtg gacatttgtc gggaaactcc taacatatgc 840ccccattctg gagagaacac agagtacgac acaatccctc acactaatag aacaatccta 900aaggaagatc cagcaaatac ggtttactcc actgtggaaa taccgaaaaa gatggaaaat 960ccccactcac tgctcacgat gccagacaca ccaaggctat ttgcctatga gaatgttatc 1020tagacagcag tgcactcccc taagtctctg ctcaaaaaaa aaacaattct cggcccaaag 1080aaaacaatca gaagaattca ctgatttgac tagaaacatc aaggaagaat gaagaacgtt 1140gacttttttc caggataaat tatctctgat gcttctttag atttaagagt tcataattcc 1200atccactgct gagaaatctc ctcaaaccca gaaggtttaa tcacttcatc ccaaaaatgg 1260gattgtgaat gtcagcaaac cataaaaaaa gtgcttagaa gtattcctat agaaatgtaa 1320atgcaaggtc acacatatta atgacagcct gttgtattaa tgatggctcc aggtcagtgt 1380ctggagtttc attccatccc agggcttgga tgtaaggatt ataccaagag tcttgctacc 1440aggagggcaa gaagaccaaa acagacagac aagtccagca gaagcagatg cacctgacaa 1500aaatggatgt attaattggc tctataaact atgtgcccag cactatgctg agcttacact 1560aattggtcag acgtgctgtc tgccctcatg aaattggctc caaatgaatg aactactttc 1620atgagcagtt gtagcaggcc tgaccacaga ttcccagagg gccaggtgtg gatccacagg 1680acttgaaggt caaagttcac aaagatgaag aatcagggta gctgaccatg tttggcagat 1740actataatgg agacacagaa gtgtgcatgg cccaaggaca aggacctcca gccaggcttc 1800atttatgcac ttgtgctgca aaagaaaagt ctaggtttta aggctgtgcc agaacccatc 1860ccaataaaga gaccgagtct gaagtcacat tgtaaatcta gtgtaggaga cttggagtca 1920ggcagtgaga ctggtggggc acggggggca gtgggtactt gtaaaccttt aaagatggtt 1980aattcattca atagatattt attaagaacc tatgcggccc ggcatggtgg ctcacacctg 2040taatcccagc actttgggag gccaaggtgg gtgggtcatc tgaggtcagg agttcaagac 2100cagcctggcc aacatggtga aaccccatct ctactaaaga tacaaaaatt tgctgagcgt 2160ggtggtgtgc acctgtaatc ccagctactc gagaggccaa ggcatgagaa tcgcttgaac 2220ctgggaggtg gaggttgcag tgagctgaga tggcaccact gcactccggc ctaggcaacg 2280agagcaaaac tccaatacaa acaaacaaac aaacacctgt gctaggtcag tctggcacgt 2340aagatgaaca tccctaccaa cacagagctc accatctctt atacttaagt gaaaaacatg 2400gggaagggga aaggggaatg gctgcttttg atatgttccc tgacacatat cttgaatgga 2460gacctcccta ccaagtgatg aaagtgttga aaaacttaat aacaaatgct tgttgggcaa 2520gaatgggatt gaggattatc ttctctcaga aaggcattgt gaaggaattg agccagatct 2580ctctccctac tgcaaaaccc tattgtagta aaaaagtctt ctttactatc ttaataaaac 2640agatattgtg agattcaaaa aaaaaaaaaa aa 26722335PRTHomo sapiens 2Met Ala Gly Ser Pro Thr Cys Leu Thr Leu Ile Tyr Ile Leu Trp Gln1 5 10 15Leu Thr Gly Ser Ala Ala Ser Gly Pro Val Lys Glu Leu Val Gly Ser 20 25 30Val Gly Gly Ala Val Thr Phe Pro Leu Lys Ser Lys Val Lys Gln Val 35 40 45Asp Ser Ile Val Trp Thr Phe Asn Thr Thr Pro Leu Val Thr Ile Gln 50 55 60Pro Glu Gly Gly Thr Ile Ile Val Thr Gln Asn Arg Asn Arg Glu Arg65 70 75 80Val Asp Phe Pro Asp Gly Gly Tyr Ser Leu Lys Leu Ser Lys Leu Lys 85 90 95Lys Asn Asp Ser Gly Ile Tyr Tyr Val Gly Ile Tyr Ser Ser Ser Leu 100 105 110Gln Gln Pro Ser Thr Gln Glu Tyr Val Leu His Val Tyr Glu His Leu 115 120 125Ser Lys Pro Lys Val Thr Met Gly Leu Gln Ser Asn Lys Asn Gly Thr 130 135 140Cys Val Thr Asn Leu Thr Cys Cys Met Glu His Gly Glu Glu Asp Val145 150 155 160Ile Tyr Thr Trp Lys Ala Leu Gly Gln Ala Ala Asn Glu Ser His Asn 165 170 175Gly Ser Ile Leu Pro Ile Ser Trp Arg Trp Gly Glu Ser Asp Met Thr 180 185 190Phe Ile Cys Val Ala Arg Asn Pro Val Ser Arg Asn Phe Ser Ser Pro 195 200 205Ile Leu Ala Arg Lys Leu Cys Glu Gly Ala Ala Asp Asp Pro Asp Ser 210 215 220Ser Met Val Leu Leu Cys Leu Leu Leu Val Pro Leu Leu Leu Ser Leu225 230 235 240Phe Val Leu Gly Leu Phe Leu Trp Phe Leu Lys Arg Glu Arg Gln Glu 245 250 255Glu Tyr Ile Glu Glu Lys Lys Arg Val Asp Ile Cys Arg Glu Thr Pro 260 265 270Asn Ile Cys Pro His Ser Gly Glu Asn Thr Glu Tyr Asp Thr Ile Pro 275 280 285His Thr Asn Arg Thr Ile Leu Lys Glu Asp Pro Ala Asn Thr Val Tyr 290 295 300Ser Thr Val Glu Ile Pro Lys Lys Met Glu Asn Pro His Ser Leu Leu305 310 315 320Thr Met Pro Asp Thr Pro Arg Leu Phe Ala Tyr Glu Asn Val Ile 325 330 3353834DNAHomo sapiens 3ttgtaagata ttacttgtcc ttccaggctg ttctttctgt agctcccttg ttttcttttt 60gtgatcatgt tgcagatggc tgggcagtgc tcccaaaatg aatattttga cagtttgttg 120catgcttgca taccttgtca acttcgatgt tcttctaata ctcctcctct aacatgtcag 180cgttattgta atgcaagtgt gaccaattca gtgaaaggaa cgaatgcgat tctctggacc 240tgtttgggac tgagcttaat aatttctttg gcagttttcg tgctaatgtt tttgctaagg 300aagataagct ctgaaccatt aaaggacgag tttaaaaaca caggatcagg tctcctgggc 360atggctaaca ttgacctgga aaagagcagg actggtgatg aaattattct tccgagaggc 420ctcgagtaca cggtggaaga atgcacctgt gaagactgca tcaagagcaa accgaaggtc 480gactctgacc attgctttcc actcccagct atggaggaag gcgcaaccat tcttgtcacc 540acgaaaacga atgactattg caagagcctg ccagctgctt tgagtgctac ggagatagag 600aaatcaattt ctgctaggta attaaccatt tcgactcgag cagtgccact ttaaaaatct 660tttgtcagaa tagatgatgt gtcagatctc tttaggatga ctgtattttt cagttgccga 720tacagctttt tgtcctctaa ctgtggaaac tctttatgtt agatatattt ctctaggtta 780ctgttgggag cttaatggta gaaacttcct tggtttctat gattaaagtc tttt 8344184PRTHomo sapiens 4Met Leu Gln Met Ala Gly Gln Cys Ser Gln Asn Glu Tyr Phe Asp Ser1 5 10 15Leu Leu His Ala Cys Ile Pro Cys Gln Leu Arg Cys Ser Ser Asn Thr 20 25 30Pro Pro Leu Thr Cys Gln Arg Tyr Cys Asn Ala Ser Val Thr Asn Ser 35 40 45Val Lys Gly Thr Asn Ala Ile Leu Trp Thr Cys Leu Gly Leu Ser Leu 50 55 60Ile Ile Ser Leu Ala Val Phe Val Leu Met Phe Leu Leu Arg Lys Ile65 70 75 80Ser Ser Glu Pro Leu Lys Asp Glu Phe Lys Asn Thr Gly Ser Gly Leu 85 90 95Leu Gly Met Ala Asn Ile Asp Leu Glu Lys Ser Arg Thr Gly Asp Glu 100 105 110Ile Ile Leu Pro Arg Gly Leu Glu Tyr Thr Val Glu Glu Cys Thr Cys 115 120 125Glu Asp Cys Ile Lys Ser Lys Pro Lys Val Asp Ser Asp His Cys Phe 130 135 140Pro Leu Pro Ala Met Glu Glu Gly Ala Thr Ile Leu Val Thr Thr Lys145 150 155 160Thr Asn Asp Tyr Cys Lys Ser Leu Pro Ala Ala Leu Ser Ala Thr Glu 165 170 175Ile Glu Lys Ser Ile Ser Ala Arg 18051339DNAHomo sapiens 5ttgcactcta gaagggacaa tggacttctg gctttggcca ctttacttcc tgccagtatc 60gggggccctg aggatcctcc cagaagtaaa ggtagagggg gagctgggcg gatcagttac 120catcaagtgc ccacttcctg aaatgcatgt gaggatatat ctgtgccggg agatggctgg 180atctggaaca tgtggtaccg tggtatccac caccaacttc atcaaggcag aatacaaggg 240ccgagttact ctgaagcaat acccacgcaa gaatctgttc ctagtggagg taacacagct 300gacagaaagt gacagcggag tctatgcctg cggagcgggc atgaacacag accggggaaa 360gacccagaaa gtcaccctga atgtccacag tgaatacgag ccatcatggg aagagcagcc 420aatgcctgag actccaaaat ggtttcatct gccctatttg ttccagatgc ctgcatatgc 480cagttcttcc aaattcgtaa ccagagttac cacaccagct caaaggggca aggtccctcc 540agttcaccac tcctccccca ccacccaaat cacccaccgc cctcgagtgt ccagagcatc 600ttcagtagca ggtgacaagc cccgaacctt cctgccatcc actacagcct caaaaatctc 660agctctggag gggctgctca agccccagac gcccagctac aaccaccaca ccaggctgca 720caggcagaga gcactggact atggctcaca gtctgggagg gaaggccaag gatttcacat 780cctgatcccg accatcctgg gccttttcct gctggcactt ctggggctgg tggtgaaaag 840ggccgttgaa aggaggaaag ccctctccag gcgggcccgc cgactggccg tgaggatgcg 900cgccctggag agctcccaga ggccccgcgg gtcgccgcga ccgcgctccc aaaacaacat 960ctacagcgcc tgcccgcggc gcgctcgtgg agcggacgct gcaggcacag gggaagcccc 1020cgttcccggc cccggagcgc cgttgccccc cgccccgctg caggtgtctg aatctccctg 1080gctccatgcc ccatctctga agaccagctg tgaatacgtg agcctctacc accagcctgc 1140cgccatgatg gaggacagtg attcagatga ctacatcaat gttcctgcct gacaactccc 1200cagctatccc ccaaccccag gctcggactg tggtgccaag gagtctcatc tatctgctga 1260tgtccaatac ctgcttcatg tgttctcaga gccctcatca ttcccatgcc ccatctcgat 1320cccatcccca tctatctgt 13396390PRTHomo sapiens 6Met Asp Phe Trp Leu Trp Pro Leu Tyr Phe Leu Pro Val Ser Gly Ala1 5 10 15Leu Arg Ile Leu Pro Glu Val Lys Val Glu Gly Glu Leu Gly Gly Ser 20 25 30Val Thr Ile Lys Cys Pro Leu Pro Glu Met His Val Arg Ile Tyr Leu 35 40 45Cys Arg Glu Met Ala Gly Ser Gly Thr Cys Gly Thr Val Val Ser Thr 50 55 60Thr Asn Phe Ile Lys Ala Glu Tyr Lys Gly Arg Val Thr Leu Lys Gln65 70 75 80Tyr Pro Arg Lys Asn Leu Phe Leu Val Glu Val Thr Gln Leu Thr Glu 85 90 95Ser Asp Ser Gly Val Tyr Ala Cys Gly Ala Gly Met Asn Thr Asp Arg 100 105 110Gly Lys Thr Gln Lys Val Thr Leu Asn Val His Ser Glu Tyr Glu Pro 115 120 125Ser Trp Glu Glu Gln Pro Met Pro Glu Thr Pro Lys Trp Phe His Leu 130 135 140Pro Tyr Leu Phe Gln Met Pro Ala Tyr Ala Ser Ser Ser Lys Phe Val145 150 155 160Thr Arg Val Thr Thr Pro Ala Gln Arg Gly Lys Val Pro Pro Val His 165 170 175His Ser Ser Pro Thr Thr Gln Ile Thr His Arg Pro Arg Val Ser Arg 180 185 190Ala Ser Ser Val Ala Gly Asp Lys Pro Arg Thr Phe Leu Pro Ser Thr 195 200 205Thr Ala Ser Lys Ile Ser Ala Leu Glu Gly Leu Leu Lys Pro Gln Thr 210 215 220Pro Ser Tyr Asn His His Thr Arg Leu His Arg Gln Arg Ala Leu Asp225 230 235 240Tyr Gly Ser Gln Ser Gly Arg Glu Gly Gln Gly Phe His Ile Leu Ile 245 250 255Pro Thr Ile Leu Gly Leu Phe Leu Leu Ala Leu Leu Gly Leu Val Val 260 265 270Lys Arg Ala Val Glu Arg Arg Lys Ala Leu Ser Arg Arg Ala Arg Arg 275 280 285Leu Ala Val Arg Met Arg Ala Leu Glu Ser Ser Gln Arg Pro Arg Gly 290 295 300Ser Pro Arg Pro Arg Ser Gln Asn Asn Ile Tyr Ser Ala Cys Pro Arg305 310 315 320Arg Ala Arg Gly Ala Asp Ala Ala Gly Thr Gly Glu Ala Pro Val Pro 325 330 335Gly Pro Gly Ala Pro Leu Pro Pro Ala Pro Leu Gln Val Ser Glu Ser 340 345 350Pro Trp Leu His Ala Pro Ser Leu Lys Thr Ser Cys Glu Tyr Val Ser 355 360 365Leu Tyr His Gln Pro Ala Ala Met Met Glu Asp Ser Asp Ser Asp Asp 370 375 380Tyr Ile Asn Val Pro Ala385 39072007DNAArtificial SequenceDescription of Artificial SequenceLy1452 open reading frame His tag fusion 7atgcagcatc accaccatca ccacgtgtca caatctacag tcaggcagga ttctcctgtg 60gagccctggg aagggatcag cgatcactct ggcattattg atggttcgcc cagactcctg 120aacactgacc atcctccttg ccaattagac atcaggctca tgaggcacaa agctgtctgg 180attaaccccc aggatgtgca gcaacagccg caggacttgc aatctcaggt gccagcagca 240gggaacagtg ggacccattt tgtgacagat gctgcctctc cctcaggccc ttcaccttcg 300tgcctcgggg actccctggc agagacaacg ttgtctgagg ataccacaga ctccgttggc 360agcgcttctc cccatggctc gagtgaaaag agtagcagct tctctctgtc ctcaacagag 420gtacacatgg tccgcccagg atactctcat cgggtgtctc tgcccacaag ccctgggatt 480ttggccacct ccccatatcc tgagactgac agtgcttttt ttgagccttc ccatctgaca 540tctgctgctg atgaaggtgc tgttcaagtc agtagaagaa ccatttcttc gaattccttc 600tcaccagagg tatttgtgct gcctgttgat gtagaaaagg aaaatgccca cttttatgtt 660gcagatatga ttatatcagc aatggagaaa atgaagtgta acattctgag tcaacagcag 720acagagagct ggagtaaaga agtcagtggg ttacttggga gtgatcagcc tgactctgaa 780atgacttttg ataccaacat aaagcaagag tctgggtctt ctacttcttc atacagtggc 840tatgaaggtt gtgctgtgtt acaggtcagc ccagtgactg aaacacgtac ttaccatgat 900gtgaaagaga tttgcaaatg cgatgttgat gaatttgtta ttttagagct tggagatttt 960aatgatatca cagaaacctg tagctgttcc tgcagctcct ctaagagtgt cacttatgag 1020ccagacttca attctgcaga actattagcc aaagagctgt accgcgtgtt ccagaagtgc 1080tggatactgt cagtagttaa ttctcagctg gcaggttccc tgagtgcagc tggctcgata 1140gtcgtaaatg aagagtgtgt ccgaaaagac tttgaatcca gtatgaatgt agtacaggaa 1200attaaattta agtctaggat cagagggact gaagactggg ctcctcctag atttcaaatc 1260atatttaata ttcatccacc actcaagagg gaccttgtgg tggcagccca gaattttttc 1320tgtgccggct gtggaactcc agtagagcct aagtttgtga agcggctccg gtactgcgaa 1380tacctaggga agtatttctg tgactgctgc cactcatatg cagagtcgtg catccctgcc 1440cgaatcctga tgatgtggga cttcaagaag tactacgtca gcaatttctc caaacagctg 1500ctcgacagca tatggcacca gcccattttc aatttgctga gcatcggcca aagcctgtat 1560gcgaaagcca aggagctgga cagagtgaag gaaattcagg agcagctctt ccatatcaag 1620aagctgttga agacctgtag gtttgctaac agtgcattaa aggagttcga gcaggtgccg 1680ggacacttga ctgatgagct ccacctgttc tcccttgagg acctggtcag gatcaagaaa 1740gggctgctgg cacccttact caaggacatt ctgaaagctt cccttgcaca tgtggctggc 1800tgtgagctgt gtcaaggaaa gggctttatt tgtgaatttt gccagaatac gactgtcatc 1860ttcccatttc agacagcaac atgtagaaga tgttcagcgt gcagggcttg ctttcacaaa 1920cagtgcttcc agtcctccga gtgcccccgg tgtgcgagga tcacagcgag gagaaaactt 1980ctggaaagtg tggcctctgc agcaaca 20078669PRTArtificial SequenceDescription of Artificial SequenceLy1452 open reading frame His tag fusion 8Met Gln His His His His His His Val Ser Gln Ser Thr Val Arg Gln1 5 10 15Asp Ser Pro Val Glu Pro Trp Glu Gly Ile Ser Asp His Ser Gly Ile 20 25 30Ile Asp Gly Ser Pro Arg Leu Leu Asn Thr Asp His Pro Pro Cys Gln 35 40 45Leu Asp Ile Arg Leu Met Arg His Lys Ala Val Trp Ile Asn Pro Gln 50 55 60Asp Val Gln Gln Gln Pro Gln Asp Leu Gln Ser Gln Val Pro Ala Ala65 70 75 80Gly Asn Ser Gly Thr His Phe Val Thr Asp Ala Ala Ser Pro Ser Gly 85 90 95Pro Ser Pro Ser Cys Leu Gly Asp Ser Leu Ala Glu Thr Thr Leu Ser 100 105 110Glu Asp Thr Thr Asp Ser Val Gly Ser Ala Ser Pro His Gly Ser Ser 115 120 125Glu Lys Ser Ser Ser Phe Ser Leu Ser Ser Thr Glu Val His Met Val 130 135 140Arg Pro Gly Tyr Ser His Arg Val Ser Leu Pro Thr Ser Pro Gly Ile145 150 155 160Leu Ala Thr Ser Pro Tyr Pro Glu Thr Asp Ser Ala Phe Phe Glu Pro 165 170 175Ser His Leu Thr Ser Ala Ala Asp Glu Gly Ala Val Gln Val Ser Arg 180 185 190Arg Thr Ile Ser Ser Asn Ser Phe Ser Pro Glu Val Phe Val Leu Pro 195 200 205Val Asp Val Glu Lys Glu Asn Ala His Phe Tyr Val Ala Asp Met Ile 210 215 220Ile Ser Ala Met Glu Lys Met Lys Cys Asn Ile Leu Ser Gln Gln Gln225 230 235 240Thr Glu Ser Trp Ser Lys Glu Val Ser Gly Leu Leu Gly Ser Asp Gln 245 250 255Pro Asp Ser Glu Met Thr Phe Asp Thr Asn Ile Lys Gln Glu Ser Gly 260 265 270Ser Ser Thr Ser Ser Tyr Ser Gly Tyr Glu Gly Cys Ala Val Leu Gln 275 280 285Val Ser Pro Val Thr Glu Thr Arg Thr Tyr His Asp Val Lys Glu Ile 290 295 300Cys Lys Cys Asp Val Asp Glu Phe Val Ile Leu Glu Leu Gly Asp Phe305 310 315 320Asn Asp Ile Thr Glu Thr Cys Ser Cys Ser Cys Ser Ser Ser Lys Ser 325 330 335Val Thr Tyr Glu Pro Asp Phe Asn Ser Ala Glu Leu Leu Ala Lys Glu 340 345 350Leu Tyr Arg Val Phe Gln Lys Cys Trp Ile Leu Ser Val Val Asn Ser 355 360 365Gln Leu Ala Gly Ser Leu Ser Ala Ala

Gly Ser Ile Val Val Asn Glu 370 375 380Glu Cys Val Arg Lys Asp Phe Glu Ser Ser Met Asn Val Val Gln Glu385 390 395 400Ile Lys Phe Lys Ser Arg Ile Arg Gly Thr Glu Asp Trp Ala Pro Pro 405 410 415Arg Phe Gln Ile Ile Phe Asn Ile His Pro Pro Leu Lys Arg Asp Leu 420 425 430Val Val Ala Ala Gln Asn Phe Phe Cys Ala Gly Cys Gly Thr Pro Val 435 440 445Glu Pro Lys Phe Val Lys Arg Leu Arg Tyr Cys Glu Tyr Leu Gly Lys 450 455 460Tyr Phe Cys Asp Cys Cys His Ser Tyr Ala Glu Ser Cys Ile Pro Ala465 470 475 480Arg Ile Leu Met Met Trp Asp Phe Lys Lys Tyr Tyr Val Ser Asn Phe 485 490 495Ser Lys Gln Leu Leu Asp Ser Ile Trp His Gln Pro Ile Phe Asn Leu 500 505 510Leu Ser Ile Gly Gln Ser Leu Tyr Ala Lys Ala Lys Glu Leu Asp Arg 515 520 525Val Lys Glu Ile Gln Glu Gln Leu Phe His Ile Lys Lys Leu Leu Lys 530 535 540Thr Cys Arg Phe Ala Asn Ser Ala Leu Lys Glu Phe Glu Gln Val Pro545 550 555 560Gly His Leu Thr Asp Glu Leu His Leu Phe Ser Leu Glu Asp Leu Val 565 570 575Arg Ile Lys Lys Gly Leu Leu Ala Pro Leu Leu Lys Asp Ile Leu Lys 580 585 590Ala Ser Leu Ala His Val Ala Gly Cys Glu Leu Cys Gln Gly Lys Gly 595 600 605Phe Ile Cys Glu Phe Cys Gln Asn Thr Thr Val Ile Phe Pro Phe Gln 610 615 620Thr Ala Thr Cys Arg Arg Cys Ser Ala Cys Arg Ala Cys Phe His Lys625 630 635 640Gln Cys Phe Gln Ser Ser Glu Cys Pro Arg Cys Ala Arg Ile Thr Ala 645 650 655Arg Arg Lys Leu Leu Glu Ser Val Ala Ser Ala Ala Thr 660 66593604DNAHomo sapiens 9gggagcctat agggttcaat aacttgcagt gtggttgggg cttatggatg ctggatgaag 60ataagtgagg gaggtttaca gggacagcat catgtcaggc cttgagggca agaatagctc 120tccagacccc cagctggcca tgtggtgagt tcagggccca aatcaagtag taccagcaat 180cagggaactc ctatctgttt tgaatggatt cacaccagcc acaagcctgg aaagatggtg 240tcacaatcta cagtcaggca ggattctcct gtggagccct gggaagggat cagcgatcac 300tctggcatta ttgatggttc gcccagactc ctgaacactg accatcctcc ttgccaatta 360gacatcaggc tcatgaggca caaagctgtc tggattaacc cccaggatgt gcagcaacag 420ccgcaggact tgcaatctca ggtgccagca gcagggaaca gtgggaccca ttttgtgaca 480gatgctgcct ctccctcagg cccttcacct tcgtgcctcg gggactccct ggcagagaca 540acgttgtctg aggataccac agactccgtt ggcagcgctt ctccccatgg ctcgagtgaa 600aagagtagca gcttctctct gtcctcaaca gaggtacaca tggtccgccc aggatactct 660catcgggtgt ctctgcccac aagccctggg attttggcca cctccccata tcctgagact 720gacagtgctt tttttgagcc ttcccatctg acatctgctg ctgatgaagg tgctgttcaa 780gtcagtagaa gaaccatttc ttcgaattcc ttctcaccag aggtatttgt gctgcctgtt 840gatgtagaaa aggaaaatgc ccacttttat gttgcagata tgattatatc agcaatggag 900aaaatgaagt gtaacattct gagtcaacag cagacagaga gctggagtaa agaagtcagt 960gggttacttg ggagtgatca gcctgactct gaaatgactt ttgataccaa cataaagcaa 1020gagtctgggt cttctacttc ttcatacagt ggctatgaag gttgtgctgt gttacaggtc 1080agcccagtga ctgaaacacg tacttaccat gatgtgaaag agatttgcaa atgcgatgtt 1140gatgaatttg ttattttaga gcttggagat tttaatgata tcacagaaac ctgtagctgt 1200tcctgcagct cctctaagag tgtcacttat gagccagact tcaattctgc agaactatta 1260gccaaagagc tgtaccgcgt gttccagaag tgctggatac tgtcagtagt taattctcag 1320ctggcaggtt ccctgagtgc agctggctcg atagtcgtaa atgaagagtg tgtccgaaaa 1380gactttgaat ccagtatgaa tgtagtacag gaaattaaat ttaagtctag gatcagaggg 1440actgaagact gggctcctcc tagatttcaa atcatattta atattcatcc accactcaag 1500agggaccttg tggtggcagc ccagaatttt ttctgtgccg gctgtggaac tccagtagag 1560cctaagtttg tgaagcggct ccggtactgc gaatacctag ggaagtattt ctgtgactgc 1620tgccactcat atgcagagtc gtgcatccct gcccgaatcc tgatgatgtg ggacttcaag 1680aagtactacg tcagcaattt ctccaaacag ctgctcgaca gcatatggca ccagcccatt 1740ttcaatttgc tgagcatcgg ccaaagcctg tatgcgaaag ccaaggagct ggacagagtg 1800aaggaaattc aggagcagct cttccatatc aagaagctgt tgaagacctg taggtttgct 1860aacagtgcat taaaggagtt cgagcaggtg ccgggacact tgactgatga gctccacctg 1920ttctcccttg aggacctggt caggatcaag aaagggctgc tggcaccctt actcaaggac 1980attctgaaag cttcccttgc acatgtggct ggctgtgagc tgtgtcaagg aaagggcttt 2040atttgtgaat tttgccagaa tacgactgtc atcttcccat ttcagacagc aacatgtaga 2100agatgttcag cgtgcagggc ttgctttcac aaacagtgct tccagtcctc cgagtgcccc 2160cggtgtgcga ggatcacagc gaggagaaaa cttctggaaa gtgtggcctc tgcagcaaca 2220tgatgcccct gagtactgtg aaaaagactg ttcaacatgc cttatgataa caccgatttg 2280tgtctattat tggtgacatt gttttagata ttgggtattg tatattaagg aaaaagatgg 2340tctatattct ctttattgca tatacttaat gtttcaaaag aatgcagatt ctgtgtttaa 2400gcacagggct gatagttgtg gttttgttta caaatgttct gttttggctg ctattggttt 2460tttaaagagg ttttttatac ttttgtattt gaatagttat gtttcactga tgctgagcca 2520gtttgtatgt gtgtgcatat atgtgaactg taactgacaa gatgaattac tcagtttctc 2580tttctctaaa gcttgtttga tgaaactggt tggtcctttc agtgaacaaa aatatgaccc 2640caaatctgtt tgctctggct tttatttctt caggaagcag acttccactt aaatgccatt 2700ttgtgattgt gtcaatcata cacattttat ttacttcaga gtttgaatag agagtacaca 2760tttcttctgc agatttattt catgatgagt ttgagttgct tagcagggcg tgtgggtccc 2820gttgaagtgc agtttgaagc aactgcttct agatggcact ctttcaggtg gcacaaattg 2880aacctgtatt tgtcatctct gttccacaca ctgcaatgtc aagggatgca gaagtgagta 2940gaattccatc cctgcccttg aggatcttgc tttaacagat gtaaaactga acataaggta 3000tttgcagatt taaacgaact gggggaaata atgaacagtg tgattctagt aataacatta 3060aaatcataga cattgactaa taaggttaaa tgaatcacaa aacctttatg aatttctttt 3120ttctaatagt tcttatatgt tttcctgaaa catgtgagcc tattcttttt tcttctactt 3180tctatatact ttctcccact tgagaaaggg gccttgaggc tgggtccctt catggtatac 3240ctttagactg aacggtttgc aacctagggc ttgggcatta cattccctgg gattcacatg 3300ccctaactaa acctaccttg attttctcag acagcacagg caggcaataa agcgtcacag 3360attgtcccct aaccccatcc agccatgtgt atgagtgtgt tttattcaat gggatagtac 3420tgagcacatg aaagaaatga atgacttctg tcaatctctt ttcattcagt cttctcattc 3480tgtcaattgt tttctcatcc gcagtgcctc tgccagaact gtgctcacat ccattattta 3540agccagatct tttctaagta ttatagaagt gtagaggcac atagaataaa taaaaccaga 3600cttc 360410662PRTHomo sapiens 10Met Val Ser Gln Ser Thr Val Arg Gln Asp Ser Pro Val Glu Pro Trp1 5 10 15Glu Gly Ile Ser Asp His Ser Gly Ile Ile Asp Gly Ser Pro Arg Leu 20 25 30Leu Asn Thr Asp His Pro Pro Cys Gln Leu Asp Ile Arg Leu Met Arg 35 40 45His Lys Ala Val Trp Ile Asn Pro Gln Asp Val Gln Gln Gln Pro Gln 50 55 60Asp Leu Gln Ser Gln Val Pro Ala Ala Gly Asn Ser Gly Thr His Phe65 70 75 80Val Thr Asp Ala Ala Ser Pro Ser Gly Pro Ser Pro Ser Cys Leu Gly 85 90 95Asp Ser Leu Ala Glu Thr Thr Leu Ser Glu Asp Thr Thr Asp Ser Val 100 105 110Gly Ser Ala Ser Pro His Gly Ser Ser Glu Lys Ser Ser Ser Phe Ser 115 120 125Leu Ser Ser Thr Glu Val His Met Val Arg Pro Gly Tyr Ser His Arg 130 135 140Val Ser Leu Pro Thr Ser Pro Gly Ile Leu Ala Thr Ser Pro Tyr Pro145 150 155 160Glu Thr Asp Ser Ala Phe Phe Glu Pro Ser His Leu Thr Ser Ala Ala 165 170 175Asp Glu Gly Ala Val Gln Val Ser Arg Arg Thr Ile Ser Ser Asn Ser 180 185 190Phe Ser Pro Glu Val Phe Val Leu Pro Val Asp Val Glu Lys Glu Asn 195 200 205Ala His Phe Tyr Val Ala Asp Met Ile Ile Ser Ala Met Glu Lys Met 210 215 220Lys Cys Asn Ile Leu Ser Gln Gln Gln Thr Glu Ser Trp Ser Lys Glu225 230 235 240Val Ser Gly Leu Leu Gly Ser Asp Gln Pro Asp Ser Glu Met Thr Phe 245 250 255Asp Thr Asn Ile Lys Gln Glu Ser Gly Ser Ser Thr Ser Ser Tyr Ser 260 265 270Gly Tyr Glu Gly Cys Ala Val Leu Gln Val Ser Pro Val Thr Glu Thr 275 280 285Arg Thr Tyr His Asp Val Lys Glu Ile Cys Lys Cys Asp Val Asp Glu 290 295 300Phe Val Ile Leu Glu Leu Gly Asp Phe Asn Asp Ile Thr Glu Thr Cys305 310 315 320Ser Cys Ser Cys Ser Ser Ser Lys Ser Val Thr Tyr Glu Pro Asp Phe 325 330 335Asn Ser Ala Glu Leu Leu Ala Lys Glu Leu Tyr Arg Val Phe Gln Lys 340 345 350Cys Trp Ile Leu Ser Val Val Asn Ser Gln Leu Ala Gly Ser Leu Ser 355 360 365Ala Ala Gly Ser Ile Val Val Asn Glu Glu Cys Val Arg Lys Asp Phe 370 375 380Glu Ser Ser Met Asn Val Val Gln Glu Ile Lys Phe Lys Ser Arg Ile385 390 395 400Arg Gly Thr Glu Asp Trp Ala Pro Pro Arg Phe Gln Ile Ile Phe Asn 405 410 415Ile His Pro Pro Leu Lys Arg Asp Leu Val Val Ala Ala Gln Asn Phe 420 425 430Phe Cys Ala Gly Cys Gly Thr Pro Val Glu Pro Lys Phe Val Lys Arg 435 440 445Leu Arg Tyr Cys Glu Tyr Leu Gly Lys Tyr Phe Cys Asp Cys Cys His 450 455 460Ser Tyr Ala Glu Ser Cys Ile Pro Ala Arg Ile Leu Met Met Trp Asp465 470 475 480Phe Lys Lys Tyr Tyr Val Ser Asn Phe Ser Lys Gln Leu Leu Asp Ser 485 490 495Ile Trp His Gln Pro Ile Phe Asn Leu Leu Ser Ile Gly Gln Ser Leu 500 505 510Tyr Ala Lys Ala Lys Glu Leu Asp Arg Val Lys Glu Ile Gln Glu Gln 515 520 525Leu Phe His Ile Lys Lys Leu Leu Lys Thr Cys Arg Phe Ala Asn Ser 530 535 540Ala Leu Lys Glu Phe Glu Gln Val Pro Gly His Leu Thr Asp Glu Leu545 550 555 560His Leu Phe Ser Leu Glu Asp Leu Val Arg Ile Lys Lys Gly Leu Leu 565 570 575Ala Pro Leu Leu Lys Asp Ile Leu Lys Ala Ser Leu Ala His Val Ala 580 585 590Gly Cys Glu Leu Cys Gln Gly Lys Gly Phe Ile Cys Glu Phe Cys Gln 595 600 605Asn Thr Thr Val Ile Phe Pro Phe Gln Thr Ala Thr Cys Arg Arg Cys 610 615 620Ser Ala Cys Arg Ala Cys Phe His Lys Gln Cys Phe Gln Ser Ser Glu625 630 635 640Cys Pro Arg Cys Ala Arg Ile Thr Ala Arg Arg Lys Leu Leu Glu Ser 645 650 655Val Ala Ser Ala Ala Thr 660112494DNAHomo sapiens 11ctttctgctg ttaccgggag cgcggtggcc acggaacgct gcccggagcc gcgcgaggga 60ggacccgacg cgcggcgttt acccagcgca gcgttccacc gctcgggttt ggctggaata 120gctctccaga cccccagctg gccatgtggt gagttcaggg cccaaatcaa gtagtaccag 180caatcaggga actcctatct gttttgaatg gattcacacc agccacaagc ctggaaagat 240ggtgtcacaa tctacagtca ggcaggattc tcctgtggag ccctgggaag ggatcagcga 300tcactctggc attattgatg gttcgcccag actcctgaac actgaccatc ctccttgcca 360attagacatc aggctcatga ggcacaaagc tgtctggatt aacccccagg atgtgcagca 420acagccgcag gacttgcaat ctcaggtgcc agcagcaggg aacagtggga cccattttgt 480gacagatgct gcctctccct caggcccttc accttcgtgc ctcggggact ccctggcaga 540gacaacgttg tctgaggata ccacagactc cgttggcagc gcttctcccc atggctcgag 600tgaaaagagt agcagcttct ctctgtcctc aacagaggta cacatggtcc gcccaggata 660ctctcatcgg gtgtctctgc ccacaagccc tgggattttg gccacctccc catatcctga 720gactgacagt gctttttttg agccttccca tctgacatct gctgctgatg aaggtgctgt 780tcaagtcagt agaagaacca tttcttcgaa ttccttctca ccagaggtat ttgtgctgcc 840tgttgatgta gaaaaggaaa atgcccactt ttatgttgca gatatgatta tatcagcaat 900ggagaaaatg aagtgtaaca ttctgagtca acagcagaca gagagctgga gtaaagaagt 960cagtgggtta cttgggagtg atcagcctga ctctgaaatg acttttgata ccaacataaa 1020gcaagagtct gggtcttcta cttcttcata cagtggctat gaaggttgtg ctgtgttaca 1080ggtcagccca gtgactgaaa cacgtactta ccatgatgtg aaagagattt gcaaatgcga 1140tgttgatgaa tttgttattt tagagcttgg agattttaat gatatcacag aaacctgtag 1200ctgttcctgc agctcctcta agagtgtcac ttatgagcca gacttcaatt ctgcagaact 1260attagccaaa gagctgtacc gcgtgttcca gaagtgctgg atactgtcag tagttaattc 1320tcagctggca ggttccctga gtgcagctgg ctcgatagtc gtaaatgaag agtgtgtccg 1380aaaagacttt gaatccagta tgaatgtagt acaggaaatt aaatttaagt ctaggatcag 1440agggactgaa gactgggctc ctcctagatt tcaaatcata tttaatattc atccaccact 1500caagagggac cttgtggtgg cagcccagaa ttttttctgt gccggctgtg gaactccagt 1560agagcctaag tttgtgaagc ggctccggta ctgcgaatac ctagggaagt atttctgtga 1620ctgctgccac tcatatgcag agtcgtgcat ccctgcccga atcctgatga tgtgggactt 1680caagaagtac tacgtcagca atttctccaa acagctgctc gacagcatat ggcaccagcc 1740cattttcaat ttgctgagca tcggccaaag cctgtatgcg aaagccaagg agctggacag 1800agtgaaggaa attcaggagc agctcttcca tatcaagaag ctgttgaaga cctgtaggtt 1860tgctaacagc tgtgtcaagg aaagggcttt atttgtgaat tttgccagaa tacgactgtc 1920atcttcccat ttcagacagc aacatgtaga agatgttcag cgtgcagggc ttgctttcac 1980aaacagtgct tccagtcctc cgagtgcccc cggtgtgcga ggatcacagc gaggagaaaa 2040cttctggaaa gtgtggcctc tgcagcaaca tgatgcccct gagtactgtg aaaaagactg 2100ttcaacatgc cttatgataa caccgatttg tgtctattat tggtgacatt gttttagata 2160ttgggtattg tatattaagg aaaaagatgg tctatattct ctttattgca tatacttaat 2220gtttcaaaag aatgcagatt ctgtgtttaa gcacagggct gatagttgtg gttttgttta 2280caaatgttct gttttggctg ctattggttt tttaaagagg ttttttatac ttttgtattt 2340gaatagttat gtttcactga tgctgagcca gtttgtatgt gtgtgcatat atgtgaactg 2400taactgacaa gatgaattac tcagtttctc tttctctaaa gcttgtttga tgaaactggt 2460tggtcctttc agtgaacaaa aatatgaccc caaa 249412635PRTHomo sapiens 12Met Val Ser Gln Ser Thr Val Arg Gln Asp Ser Pro Val Glu Pro Trp1 5 10 15Glu Gly Ile Ser Asp His Ser Gly Ile Ile Asp Gly Ser Pro Arg Leu 20 25 30Leu Asn Thr Asp His Pro Pro Cys Gln Leu Asp Ile Arg Leu Met Arg 35 40 45His Lys Ala Val Trp Ile Asn Pro Gln Asp Val Gln Gln Gln Pro Gln 50 55 60Asp Leu Gln Ser Gln Val Pro Ala Ala Gly Asn Ser Gly Thr His Phe65 70 75 80Val Thr Asp Ala Ala Ser Pro Ser Gly Pro Ser Pro Ser Cys Leu Gly 85 90 95Asp Ser Leu Ala Glu Thr Thr Leu Ser Glu Asp Thr Thr Asp Ser Val 100 105 110Gly Ser Ala Ser Pro His Gly Ser Ser Glu Lys Ser Ser Ser Phe Ser 115 120 125Leu Ser Ser Thr Glu Val His Met Val Arg Pro Gly Tyr Ser His Arg 130 135 140Val Ser Leu Pro Thr Ser Pro Gly Ile Leu Ala Thr Ser Pro Tyr Pro145 150 155 160Glu Thr Asp Ser Ala Phe Phe Glu Pro Ser His Leu Thr Ser Ala Ala 165 170 175Asp Glu Gly Ala Val Gln Val Ser Arg Arg Thr Ile Ser Ser Asn Ser 180 185 190Phe Ser Pro Glu Val Phe Val Leu Pro Val Asp Val Glu Lys Glu Asn 195 200 205Ala His Phe Tyr Val Ala Asp Met Ile Ile Ser Ala Met Glu Lys Met 210 215 220Lys Cys Asn Ile Leu Ser Gln Gln Gln Thr Glu Ser Trp Ser Lys Glu225 230 235 240Val Ser Gly Leu Leu Gly Ser Asp Gln Pro Asp Ser Glu Met Thr Phe 245 250 255Asp Thr Asn Ile Lys Gln Glu Ser Gly Ser Ser Thr Ser Ser Tyr Ser 260 265 270Gly Tyr Glu Gly Cys Ala Val Leu Gln Val Ser Pro Val Thr Glu Thr 275 280 285Arg Thr Tyr His Asp Val Lys Glu Ile Cys Lys Cys Asp Val Asp Glu 290 295 300Phe Val Ile Leu Glu Leu Gly Asp Phe Asn Asp Ile Thr Glu Thr Cys305 310 315 320Ser Cys Ser Cys Ser Ser Ser Lys Ser Val Thr Tyr Glu Pro Asp Phe 325 330 335Asn Ser Ala Glu Leu Leu Ala Lys Glu Leu Tyr Arg Val Phe Gln Lys 340 345 350Cys Trp Ile Leu Ser Val Val Asn Ser Gln Leu Ala Gly Ser Leu Ser 355 360 365Ala Ala Gly Ser Ile Val Val Asn Glu Glu Cys Val Arg Lys Asp Phe 370 375 380Glu Ser Ser Met Asn Val Val Gln Glu Ile Lys Phe Lys Ser Arg Ile385 390 395 400Arg Gly Thr Glu Asp Trp Ala Pro Pro Arg Phe Gln Ile Ile Phe Asn 405 410 415Ile His Pro Pro Leu Lys Arg Asp Leu Val Val Ala Ala Gln Asn Phe 420 425 430Phe Cys Ala Gly Cys Gly Thr Pro Val Glu Pro Lys Phe Val Lys Arg 435 440 445Leu Arg Tyr Cys Glu Tyr Leu Gly Lys Tyr Phe Cys Asp Cys Cys His 450 455 460Ser Tyr Ala Glu Ser Cys Ile Pro Ala Arg Ile Leu Met Met Trp Asp465 470 475 480Phe Lys Lys Tyr Tyr Val Ser Asn Phe Ser Lys Gln

Leu Leu Asp Ser 485 490 495Ile Trp His Gln Pro Ile Phe Asn Leu Leu Ser Ile Gly Gln Ser Leu 500 505 510Tyr Ala Lys Ala Lys Glu Leu Asp Arg Val Lys Glu Ile Gln Glu Gln 515 520 525Leu Phe His Ile Lys Lys Leu Leu Lys Thr Cys Arg Phe Ala Asn Ser 530 535 540Cys Val Lys Glu Arg Ala Leu Phe Val Asn Phe Ala Arg Ile Arg Leu545 550 555 560Ser Ser Ser His Phe Arg Gln Gln His Val Glu Asp Val Gln Arg Ala 565 570 575Gly Leu Ala Phe Thr Asn Ser Ala Ser Ser Pro Pro Ser Ala Pro Gly 580 585 590Val Arg Gly Ser Gln Arg Gly Glu Asn Phe Trp Lys Val Trp Pro Leu 595 600 605Gln Gln His Asp Ala Pro Glu Tyr Cys Glu Lys Asp Cys Ser Thr Cys 610 615 620Leu Met Ile Thr Pro Ile Cys Val Tyr Tyr Trp625 630 63513148DNAHomo sapiens 13ctggttcacg tgtggagcta gttaatacgt cctgccaaga tgggtaccag ttgactggac 60atgcttatca gatgtgtcaa gatgctgaaa atggaatttg gttcaaaaag attccacttt 120gtaaagttat ccactgcacc ctccacca 148144094DNAHomo sapiens 14ccagagctgc cggacgctcg cgggtctcgg aacgcatccc gccgcggggg cttcggccgt 60ggcatgggcg ccgcgggcct gctcggggtt ttcttggctc tcgtcgcacc gggggtcctc 120gggatttctt gtggctctcc tccgcctatc ctaaatggcc ggattagtta ttattctacc 180cccattgctg ttggtaccgt gataaggtac agttgttcag gtaccttccg cctcattgga 240gaaaaaagtc tattatgcat aactaaagac aaagtggatg gaacctggga taaacctgct 300cctaaatgtg aatatttcaa taaatattct tcttgccctg agcccatagt accaggagga 360tacaaaatta gaggctctac accctacaga catggtgatt ctgtgacatt tgcctgtaaa 420accaacttct ccatgaacgg aaacaagtct gtttggtgtc aagcaaataa tatgtggggg 480ccgacacgac taccaacctg tgtaagtgtt ttccctctcg agtgtccagc acttcctatg 540atccacaatg gacatcacac aagtgagaat gttggctcca ttgctccagg attgtctgtg 600acttacagct gtgaatctgg ttacttgctt gttggagaaa agatcattaa ctgtttgtct 660tcgggaaaat ggagtgctgt cccccccaca tgtgaagagg cacgctgtaa atctctagga 720cgatttccca atgggaaggt aaaggagcct ccaattctcc gggttggtgt aactgcaaac 780tttttctgtg atgaagggta tcgactgcaa ggcccacctt ctagtcggtg tgtaattgct 840ggacagggag ttgcttggac caaaatgcca gtatgtgaag aaattttttg cccatcacct 900ccccctattc tcaatggaag acatataggc aactcactag caaatgtctc atatggaagc 960atagtcactt acacttgtga cccggaccca gaggaaggag tgaacttcat ccttattgga 1020gagagcactc tccgttgtac agttgatagt cagaagactg ggacctggag tggccctgcc 1080ccacgctgtg aactttctac ttctgcggtt cagtgtccac atccccagat cctaagaggc 1140cgaatggtat ctgggcagaa agatcgatat acctataacg acactgtgat atttgcttgc 1200atgtttggct tcaccttgaa gggcagcaag caaatccgat gcaatgccca aggcacatgg 1260gagccatctg caccagtctg tgaaaaggaa tgccaggccc ctcctaacat cctcaatggg 1320caaaaggaag atagacacat ggtccgcttt gaccctggaa catctataaa atatagctgt 1380aaccctggct atgtgctggt gggagaagaa tccatacagt gtacctctga gggggtgtgg 1440acaccccctg taccccaatg caaagtggca gcgtgtgaag ctacaggaag gcaactcttg 1500acaaaacccc agcaccaatt tgttagacca gatgtcaact cttcttgtgg tgaagggtac 1560aagttaagtg ggagtgttta tcaggagtgt caaggcacaa ttccttggtt tatggagatt 1620cgtctttgta aagaaatcac ctgcccacca ccccctgtta tctacaatgg ggcacacacc 1680gggagttcct tagaagattt tccatatgga accacggtca cttacacatg taaccctggg 1740ccagaaagag gagtggaatt cagcctcatt ggagagagca ccatccgttg tacaagcaat 1800gatcaagaaa gaggcacctg gagtggccct gctcccctat gtaaactttc cctccttgct 1860gtccagtgct cacatgtcca tattgcaaat ggatacaaga tatctggcaa ggaagcccca 1920tatttctaca atgacactgt gacattcaag tgttatagtg gatttacttt gaagggcagt 1980agtcagattc gttgcaaacg tgataacacc tgggatcctg aaataccagt ttgtgaaaaa 2040ggctgccagc cacctcctgg gctccaccat ggtcgtcata caggtggaaa tacggtcttc 2100tttgtctctg ggatgactgt agactacact tgtgaccctg gctatttgct tgtgggaaac 2160aaatccattc actgtatgcc ttcaggaaat tggagtcctt ctgccccacg gtgtgaagaa 2220acatgccagc atgtgagaca gagtcttcaa gaacttccag ctggttcacg tgtggagcta 2280gttaatacgt cctgccaaga tgggtaccag ttgactggac atgcttatca gatgtgtcaa 2340gatgctgaaa atggaatttg gttcaaaaag attccacttt gtaaagttat tcactgtcac 2400cctccaccag tgattgtcaa tgggaagcac acaggcatga tggcagaaaa ctttctatat 2460ggaaatgaag tctcttatga atgtgaccaa ggattctatc tcctgggaga gaaaaattgc 2520agtgcagaag tgattctaaa ggcatggatc ttggagcgag ccttcccaca gtgcttacga 2580tctctgtgcc ctaatccaga agtcaaacat gggtacaagc tcaataaaac acattctgca 2640tattcccaca atgacatagt gtatgttgac tgcaatcctg gcttcatcat gaatggtagt 2700cgcgtgatta ggtgtcatac tgataacaca tgggtgccag gtgtgccaac ttgtatcaaa 2760aaagccttca tagggtgtcc acctccgcct aagaccccta acgggaacca tactggtgga 2820aacatagctc gattttctcc tggaatgtca atcctgtaca gctgtgacca aggctacctg 2880gtggtgggag agccactcct tctttgcaca catgagggaa cctggagcca acctgcccct 2940cattgtaaag aggtaaactg tagctcacca gcagatatgg atggaatcca gaaagggctg 3000gaaccaagga aaatgtatca gtatggagct gttgtaactc tggagtgtga agatgggtat 3060atgctggaag gcagtcccca gagccagtgc caatcggatc accaatggaa ccctcccctg 3120gcggtttgca gatcccgttc acttgctcct gtcctttgtg gtattgctgc aggtttgata 3180cttcttacct tcttgattgt cattacctta tacgtgatat caaaacacag agaacgcaat 3240tattatacag atacaagcca gaaagaagct tttcatttag aagcacgaga agtatattct 3300gttgatccat acaacccagc cagctgatca gaagacaaaa ctggtgtgtg cctcattgct 3360tggaattcag cggaatattg attagaaaga aactgctcta atatcagcaa gtctctttat 3420atggcctcaa gatcaatgaa atgatgtcat aagcgatcac ttcctatatg cacttattct 3480caagaagaac atctttatgg taaagatggg agcccagttt cactgccata tactcttcaa 3540ggactttctg aagcctcact tatgagatgc ctgaagccag gccatggcta taaacattac 3600atggctctaa aagttttgcc ctttttaagg aggcactaaa aagagctgtc ctggtatcta 3660gacccatctt ctttttgaaa tcacatactc atgttactat ctgcttttgg ttataatgtg 3720tttttaatta tctaaagtat gaagcatttt ctggggttat gatggcctta cttttattag 3780gaagtatggt tttattttga tagtagcttc cttcctcggt ggtgttaatc atttcgtttt 3840taccctttac cttcggattt gagtttctct cacattactg tatatacttt gccttccata 3900atcactcagt gattgcaatt tgcacaagtt tttttaaatt atgggaatca agatttaatc 3960ctagagattt ggtgtacaat tcaggctttg gatgtttctt tagcagtttt gtgataagtt 4020ctagttgctt gtaaaatttc acttaataat gtgtacatta gtcattcaat aaattgtaat 4080tgtaaagaaa acat 4094151033PRTHomo sapiens 15Met Gly Ala Ala Gly Leu Leu Gly Val Phe Leu Ala Leu Val Ala Pro1 5 10 15Gly Val Leu Gly Ile Ser Cys Gly Ser Pro Pro Pro Ile Leu Asn Gly 20 25 30Arg Ile Ser Tyr Tyr Ser Thr Pro Ile Ala Val Gly Thr Val Ile Arg 35 40 45Tyr Ser Cys Ser Gly Thr Phe Arg Leu Ile Gly Glu Lys Ser Leu Leu 50 55 60Cys Ile Thr Lys Asp Lys Val Asp Gly Thr Trp Asp Lys Pro Ala Pro65 70 75 80Lys Cys Glu Tyr Phe Asn Lys Tyr Ser Ser Cys Pro Glu Pro Ile Val 85 90 95Pro Gly Gly Tyr Lys Ile Arg Gly Ser Thr Pro Tyr Arg His Gly Asp 100 105 110Ser Val Thr Phe Ala Cys Lys Thr Asn Phe Ser Met Asn Gly Asn Lys 115 120 125Ser Val Trp Cys Gln Ala Asn Asn Met Trp Gly Pro Thr Arg Leu Pro 130 135 140Thr Cys Val Ser Val Phe Pro Leu Glu Cys Pro Ala Leu Pro Met Ile145 150 155 160His Asn Gly His His Thr Ser Glu Asn Val Gly Ser Ile Ala Pro Gly 165 170 175Leu Ser Val Thr Tyr Ser Cys Glu Ser Gly Tyr Leu Leu Val Gly Glu 180 185 190Lys Ile Ile Asn Cys Leu Ser Ser Gly Lys Trp Ser Ala Val Pro Pro 195 200 205Thr Cys Glu Glu Ala Arg Cys Lys Ser Leu Gly Arg Phe Pro Asn Gly 210 215 220Lys Val Lys Glu Pro Pro Ile Leu Arg Val Gly Val Thr Ala Asn Phe225 230 235 240Phe Cys Asp Glu Gly Tyr Arg Leu Gln Gly Pro Pro Ser Ser Arg Cys 245 250 255Val Ile Ala Gly Gln Gly Val Ala Trp Thr Lys Met Pro Val Cys Glu 260 265 270Glu Ile Phe Cys Pro Ser Pro Pro Pro Ile Leu Asn Gly Arg His Ile 275 280 285Gly Asn Ser Leu Ala Asn Val Ser Tyr Gly Ser Ile Val Thr Tyr Thr 290 295 300Cys Asp Pro Asp Pro Glu Glu Gly Val Asn Phe Ile Leu Ile Gly Glu305 310 315 320Ser Thr Leu Arg Cys Thr Val Asp Ser Gln Lys Thr Gly Thr Trp Ser 325 330 335Gly Pro Ala Pro Arg Cys Glu Leu Ser Thr Ser Ala Val Gln Cys Pro 340 345 350His Pro Gln Ile Leu Arg Gly Arg Met Val Ser Gly Gln Lys Asp Arg 355 360 365Tyr Thr Tyr Asn Asp Thr Val Ile Phe Ala Cys Met Phe Gly Phe Thr 370 375 380Leu Lys Gly Ser Lys Gln Ile Arg Cys Asn Ala Gln Gly Thr Trp Glu385 390 395 400Pro Ser Ala Pro Val Cys Glu Lys Glu Cys Gln Ala Pro Pro Asn Ile 405 410 415Leu Asn Gly Gln Lys Glu Asp Arg His Met Val Arg Phe Asp Pro Gly 420 425 430Thr Ser Ile Lys Tyr Ser Cys Asn Pro Gly Tyr Val Leu Val Gly Glu 435 440 445Glu Ser Ile Gln Cys Thr Ser Glu Gly Val Trp Thr Pro Pro Val Pro 450 455 460Gln Cys Lys Val Ala Ala Cys Glu Ala Thr Gly Arg Gln Leu Leu Thr465 470 475 480Lys Pro Gln His Gln Phe Val Arg Pro Asp Val Asn Ser Ser Cys Gly 485 490 495Glu Gly Tyr Lys Leu Ser Gly Ser Val Tyr Gln Glu Cys Gln Gly Thr 500 505 510Ile Pro Trp Phe Met Glu Ile Arg Leu Cys Lys Glu Ile Thr Cys Pro 515 520 525Pro Pro Pro Val Ile Tyr Asn Gly Ala His Thr Gly Ser Ser Leu Glu 530 535 540Asp Phe Pro Tyr Gly Thr Thr Val Thr Tyr Thr Cys Asn Pro Gly Pro545 550 555 560Glu Arg Gly Val Glu Phe Ser Leu Ile Gly Glu Ser Thr Ile Arg Cys 565 570 575Thr Ser Asn Asp Gln Glu Arg Gly Thr Trp Ser Gly Pro Ala Pro Leu 580 585 590Cys Lys Leu Ser Leu Leu Ala Val Gln Cys Ser His Val His Ile Ala 595 600 605Asn Gly Tyr Lys Ile Ser Gly Lys Glu Ala Pro Tyr Phe Tyr Asn Asp 610 615 620Thr Val Thr Phe Lys Cys Tyr Ser Gly Phe Thr Leu Lys Gly Ser Ser625 630 635 640Gln Ile Arg Cys Lys Ala Asp Asn Thr Trp Asp Pro Glu Ile Pro Val 645 650 655Cys Glu Lys Glu Thr Cys Gln His Val Arg Gln Ser Leu Gln Glu Leu 660 665 670Pro Ala Gly Ser Arg Val Glu Leu Val Asn Thr Ser Cys Gln Asp Gly 675 680 685Tyr Gln Leu Thr Gly His Ala Tyr Gln Met Cys Gln Asp Ala Glu Asn 690 695 700Gly Ile Trp Phe Lys Lys Ile Pro Leu Cys Lys Val Ile His Cys His705 710 715 720Pro Pro Pro Val Ile Val Asn Gly Lys His Thr Gly Met Met Ala Glu 725 730 735Asn Phe Leu Tyr Gly Asn Glu Val Ser Tyr Glu Cys Asp Gln Gly Phe 740 745 750Tyr Leu Leu Gly Glu Lys Lys Leu Gln Cys Arg Ser Asp Ser Lys Gly 755 760 765His Gly Ser Trp Ser Gly Pro Ser Pro Gln Cys Leu Arg Ser Pro Pro 770 775 780Val Thr Arg Cys Pro Asn Pro Glu Val Lys His Gly Tyr Lys Leu Asn785 790 795 800Lys Thr His Ser Ala Tyr Ser His Asn Asp Ile Val Tyr Val Asp Cys 805 810 815Asn Pro Gly Phe Ile Met Asn Gly Ser Arg Val Ile Arg Cys His Thr 820 825 830Asp Asn Thr Trp Val Pro Gly Val Pro Thr Cys Met Lys Lys Ala Phe 835 840 845Ile Gly Cys Pro Pro Pro Pro Lys Thr Pro Asn Gly Asn His Thr Gly 850 855 860Gly Asn Ile Ala Arg Phe Ser Pro Gly Met Ser Ile Leu Tyr Ser Cys865 870 875 880Asp Gln Gly Tyr Leu Leu Val Gly Glu Ala Leu Leu Leu Cys Thr His 885 890 895Glu Gly Thr Trp Ser Gln Pro Ala Pro His Cys Lys Glu Val Asn Cys 900 905 910Ser Ser Pro Ala Asp Met Asp Gly Ile Gln Lys Gly Leu Glu Pro Arg 915 920 925Lys Met Tyr Gln Tyr Gly Ala Val Val Thr Leu Glu Cys Glu Asp Gly 930 935 940Tyr Met Leu Glu Gly Ser Pro Gln Ser Gln Cys Gln Ser Asp His Gln945 950 955 960Trp Asn Pro Pro Leu Ala Val Cys Arg Ser Arg Ser Leu Ala Pro Val 965 970 975Leu Cys Gly Ile Ala Ala Gly Leu Ile Leu Leu Thr Phe Leu Ile Val 980 985 990Ile Thr Leu Tyr Val Ile Ser Lys His Arg Glu Arg Asn Tyr Tyr Thr 995 1000 1005Asp Thr Ser Gln Lys Glu Ala Phe His Leu Glu Ala Arg Glu Val Tyr 1010 1015 1020Ser Val Asp Pro Tyr Asn Pro Ala Ser1025 103016637DNAHomo sapiensmodified_base(9)n = g, a, c or t 16ctgggcaana ccaagtcaca gtttccagcg tgctgctcag ccctccgagt gtgtgtgctc 60atccttttca tagaagtccc atmkgscatg gagagggttg ggctgcarag ctgwgattgc 120cagaggccct tccttgagaa ctgtggggaa ggaggccctg ggggtttctt ctgtaggcag 180agctcaggcc ccagtcacct ctgccaccct cagcctggca ctgttgtgcc agagcctctg 240ctgcctctct cttcctaccc atctgcagac cagcagaata ttctccccct ctcatcacca 300accaggagtt tggtgtggtt tctggacacg gccagagcag tcactgcggg gctggttttg 360ctgggcttcc ctgtcaaagc aatgctaacg tccagctctc gactcaaggc caggttcttc 420tcccacttgt ggcctcttgg gcttggaggc tgagccaggg gctcctctcc tgctggccgt 480ccaggaacag acatcttcac atcctcagtc ttccaaaccc ggaccatgcc gtcttgactc 540ccggtgatga tgatctggct tgtgtcccat gctgggccct ccatcaggca gcaacaggtt 600atggctcctt ctgggcccca ggctgtggtg atgctgg 637174191DNAHomo sapiens 17agcgagactt ccagtccgag gtcctgcttt ctgctatgga actattccac atgacaagtg 60gaggtgatgc agcgatgttc agagacggca aagagcctca gccaagtgca gaagctgctg 120ctgccccttc tcttgccaac atctcctgct tcacccagaa gctggtggag aagctgtaca 180gtgggatgtt ctcggcagac cccaggcata tcctcctctt catcctggag cacatcatgg 240tggtcattga gactgcctct tctcaaaggg acactgtcct cagcacttta tacagcagtt 300taaataaagt cattctttat tgcctatcca agccccagca gtccctctcc gaatgcctcg 360gccttctcag catcctgggc tttctgcagg agcactggga tgttgtcttt gccacctaca 420attccaacat cagcttcctc ctgtgtctca tgcattgcct tttgctactc aatgagagaa 480gttacccaga aggatttgga ttggagccca agcctagaat gtctacttat catcaagtct 540tcctttcccc aaatgaagac gtgaaagaaa aaagagaaga cttaccaagt ttgagtgatg 600tccaacacaa catccagaag acagtgcaga ctctctggca gcagctggtg gcacaaaggc 660agcagaccct ggaggatgcc ttcaagatcg atctctctgt gaaacctgga gagagggaag 720tgaagattga agaggtcaca ccgctctggg aggagacgat gctcaaggcc tggcagcatt 780acttagcatc tgagaagaag tcactggcaa gtcgttcaaa tgttgcacac cacagcaaag 840tcactttgtg gagtggaagc ctgtcctcag ccatgaagct gatgcccggg cggcaggcca 900aggaccctga gtgcaagaca gaggattttg tgtcatgtat agagaactac agaagaagag 960gacaagagct atatgcatct ttatacaaag accatgtgca aaggcgaaaa tgtggcaaca 1020tcaaggcagc caacgcctgg gccaggatcc aggagcagct ttttggggag ctgggcttgt 1080ggagccaggg ggaagaaacc aagccctgtt ccccatggga actcgactgg agagaaggac 1140cagctcgaat gaggaaacgc atcaaacgct tgtctccttt ggaggccctg agctcaggaa 1200ggcacaagga aagccaagac aaaaatgatc atatttctca aacaaatgct gaaaaccaag 1260atgaactgac actgagggag gctgagggcg agccggacga ggtgggggtg gactgcaccc 1320agctgacctt cttcccagcc ttacacgaaa gtctgcactc agaagacttc ttggaactgt 1380gtcgggaaag acaagttatt ttacaagagc ttcttgataa agaaaaggtg acgcagaagt 1440tctccctggt gattgtgcag ggccacctgg tgtcagaagg ggtcctgctt tttggccacc 1500aacacttcta catctgcgag aacttcacac tgtctcccac gggtgatgtc tactgtaccc 1560gtcactgctt atccaacatc agcgatccgt tcattttcaa cctgtgcagc aaagacaggt 1620ccactgacca ttactcgtgc cagtgccaca gctacgctga catgcgggag ctacggcagg 1680ctcgcttcct cctgcaggac atcgccctgg agatcttctt ccacaatgga tattccaagt 1740ttcttgtctt ctacaacaat gatcggagta aggcctttaa aagcttctgc tctttccaac 1800ccagcctgaa ggggaaagcc acctcggagg acaccctcaa tctaaggaga taccccggct 1860ctgacaggat catgctgcag aagtggcaga aaagggacat cagcaatttt gagtatctca 1920tgtacctcaa caccgcggct gggagaacct gcaatgacta catgcagtac ccagtgttcc 1980cctgggtcct cgcagactac acctcagaga cattgaactt ggcaaatccg aagattttcc 2040gggatctttc aaagcccatg ggggctcaga ccaaggaaag gaagctgaaa tttatccaga 2100ggtttaaaga agttgagaaa actgaaggag acatgactgt ccagtgccac tactacaccc 2160actactcctc ggccatcatc gtggcctcct acctggtccg gatgccaccc ttcacccagg 2220ccttctgcgc tctgcagggc ggaagcttcg acgtggcaga cagaatgttc cacagtgtga 2280agagcacgtg ggagtcggcc tccagagaga acatgagtga cgtcagggag ctgaccccag 2340agttcttcta cctgcctgag ttcttaacca actgcaacgg ggtagagttc ggctgcatgc 2400aggacgggac tgtgctagga gacgtgcagc tccctccctg ggctgatggg gaccctcgga 2460aattcatcag cctgcacaga aaggccctgg aaagtgactt tgtcagtgcc aacctccacc 2520attggataga ccttattttt gggtacaagc agcaggggcc agccgcagtg gatgctgtta 2580atatcttcca cccctacttc tacggtgaca gaatggacct cagcagcatc actgaccccc 2640tcatcaaaag caccatcctg gggtttgtca gcaactttgg acaggtgccc aaacagctct 2700ttaccaaacc tcacccagcc aggactgcag cagggaagcc tctgcctgga aaggatgtct

2760ccacccccgt gagcctgcct ggccacccac agcccttttt ctacagcctg cagtcgctga 2820ggccctccca ggtcacggtc aaagatatgt acctcttttc tctaggctca gagtccccca 2880aaggggccat tggccacatt gtctctactg agaagaccat tctggctgta gagaggaaca 2940aagtgctgct gcctcctctc tggaacagga ccttcagctg gggctttgat gacttcagct 3000gctgcttggg gagctacggc tccgacaagg tcctgatgac attcgagaac ctggctgcct 3060ggggccgctg tctgtgcgcc gtgtgcccat ccccaacaac gattgtcacc tctgggacca 3120gcactgtggt gtgtgtgtgg gagctcagca tgaccaaagg ccgcccgagg ggcttgcgcc 3180tccggcaggc cttgtatgga cacacacagg ctgtcacgtg cctggcagcg tcagtcacct 3240tcagcctcct ggtgagcggc tcccaggact gcacctgtat cctgtgggat ctggaccacc 3300tcacccacgt gacccgcctg cccgcccatc gggaaggcat ctcagccatc accatcagtg 3360acgtctcagg caccattgtc tcctgtgcgg gagcacactt gtccctgtgg aatgtcaatg 3420gacagcccct ggccagcatc accacagcct ggggcccaga aggagccata acctgttgct 3480gcctgatgga gggcccagca tgggacacaa gccagatcat catcaccggg agtcaagacg 3540gcatggtccg ggtttggaag actgaggatg tgaagatgtc tgttcctgga cggccagcag 3600gagaggagcc cctggctcag cctccaagcc caagaggcca caagtgggag aagaacctgg 3660ccttgagtcg agagctggac gttagcattg ctttgacagg gaagcccagc aaaaccagcc 3720ccgcagtgac tgctctggcc gtgtccagaa accacaccaa actcctggtt ggtgatgaga 3780gggggagaat attctgctgg tctgcagatg ggtaggaaga gagaggcagc agaggctctg 3840gcacaacagt gccaggctga gggtggcaga ggtgactggg gcctgagctc tgcctacaga 3900agaaaccccc agggcctcct tccccacagt tctcaaggaa gggcctctgg caatcacagc 3960tctgcagccc aaccctctcc atggccgatg ggacttctat gaaaaggatg agcacacaca 4020ctcggagggc tgagcagcac gctggaaact gtgacttggt gatgcccagc tgcacacgaa 4080attacacatg actcacctta ttaagggcta ttgcactgaa aaaaaaaaaa agatgggtcg 4140cttactggaa attattgtat tgtctttatt ttattaaagc aactatgttt t 4191181606PRTHomo sapiens 18Met Asn Ile Ser Ser Arg Asp Asn Ala Met Pro Val Phe Leu Leu Arg1 5 10 15Asn Cys Ala Gly His Leu Ser Gly Ser Leu Arg Thr Ile Gly Ala Val 20 25 30Ala Val Gly Gln Leu Gly Val Arg Val Phe His Ser Ser Pro Ala Ala 35 40 45Ser Ser Leu Asp Phe Ile Gly Gly Pro Ala Ile Leu Leu Gly Leu Ile 50 55 60Ser Leu Ala Thr Asp Asp His Thr Met Tyr Ala Ala Val Lys Val Leu65 70 75 80His Ser Val Leu Thr Ser Asn Ala Met Cys Asp Phe Leu Met Gln His 85 90 95Ile Cys Gly Tyr Gln Ile Met Ala Phe Leu Leu Arg Lys Lys Ala Ser 100 105 110Leu Leu Asn His Arg Ile Phe Gln Leu Ile Leu Ser Val Ala Gly Thr 115 120 125Val Glu Leu Gly Phe Arg Ser Ser Ala Ile Thr Asn Thr Gly Val Phe 130 135 140Gln His Ile Leu Cys Asn Phe Glu Leu Trp Met Asn Thr Ala Asp Asn145 150 155 160Leu Glu Leu Ser Leu Phe Ser His Leu Leu Glu Ile Leu Gln Ser Pro 165 170 175Arg Glu Gly Pro Arg Asn Ala Glu Ala Ala His Gln Ala Gln Leu Ile 180 185 190Pro Lys Leu Ile Phe Leu Phe Asn Glu Pro Ser Leu Ile Pro Ser Lys 195 200 205Ile Pro Thr Ile Ile Gly Ile Leu Ala Cys Gln Leu Arg Gly His Phe 210 215 220Ser Thr Gln Asp Leu Leu Arg Ile Gly Leu Phe Val Val Tyr Thr Leu225 230 235 240Lys Pro Ser Ser Val Asn Glu Arg Gln Ile Cys Met Asp Gly Ala Leu 245 250 255Asp Pro Ser Leu Pro Ala Gly Ser Gln Thr Ser Gly Lys Thr Ile Trp 260 265 270Leu Arg Asn Gln Leu Leu Glu Met Leu Leu Ser Val Ile Ser Ser Pro 275 280 285Gln Leu His Leu Ser Ser Glu Ser Lys Glu Glu Met Phe Leu Lys Leu 290 295 300Gly Pro Asp Trp Phe Leu Leu Leu Leu Gln Gly His Leu His Ala Ser305 310 315 320Thr Thr Val Leu Ala Leu Lys Leu Leu Leu Tyr Phe Leu Ala Ser Pro 325 330 335Ser Leu Arg Thr Arg Phe Arg Asp Gly Leu Cys Ala Gly Ser Trp Val 340 345 350Glu Arg Ser Thr Glu Gly Val Asp Ile Val Met Asp Asn Leu Lys Ser 355 360 365Gln Ser Pro Leu Pro Glu Gln Ser Pro Cys Leu Leu Pro Gly Phe Arg 370 375 380Val Leu Asn Asp Phe Leu Ala His His Val His Ile Pro Glu Val Tyr385 390 395 400Leu Ile Val Ser Thr Phe Phe Leu Gln Thr Pro Leu Thr Glu Leu Met 405 410 415Asp Gly Pro Lys Asp Ser Leu Asp Ala Met Leu Gln Trp Leu Leu Gln 420 425 430Arg His His Gln Glu Glu Val Leu Gln Ala Gly Leu Cys Thr Glu Gly 435 440 445Ala Leu Leu Leu Leu Glu Met Leu Lys Ala Thr Met Ser Gln Pro Leu 450 455 460Ala Gly Ser Glu Asp Gly Ala Trp Ala Gln Thr Phe Pro Ala Ser Val465 470 475 480Leu Gln Phe Leu Ser Leu Val His Arg Thr Tyr Pro Gln Asp Pro Ala 485 490 495Trp Arg Ala Pro Glu Phe Leu Gln Thr Leu Ala Ile Ala Ala Phe Pro 500 505 510Leu Gly Ala Gln Lys Gly Val Gly Ala Glu Ser Thr Arg Asn Thr Ser 515 520 525Ser Pro Glu Ala Ala Ala Glu Gly Asp Ser Thr Val Glu Gly Leu Gln 530 535 540Ala Pro Thr Lys Ala His Pro Ala Arg Arg Lys Leu Arg Glu Phe Thr545 550 555 560Gln Leu Leu Leu Arg Glu Leu Leu Leu Gly Ala Ser Ser Pro Lys Gln 565 570 575Trp Leu Pro Leu Glu Val Leu Leu Glu Ala Ser Pro Asp His Ala Thr 580 585 590Ser Gln Gln Lys Arg Asp Phe Gln Ser Glu Val Leu Leu Ser Ala Met 595 600 605Glu Leu Phe His Met Thr Ser Gly Gly Asp Ala Ala Met Phe Arg Asp 610 615 620Gly Lys Glu Pro Gln Pro Ser Ala Glu Ala Ala Ala Ala Pro Ser Leu625 630 635 640Ala Asn Ile Ser Cys Phe Thr Gln Lys Leu Val Glu Lys Leu Tyr Ser 645 650 655Gly Met Phe Ser Ala Asp Pro Arg His Ile Leu Leu Phe Ile Leu Glu 660 665 670His Ile Met Val Val Ile Glu Thr Ala Ser Ser Gln Arg Asp Thr Val 675 680 685Leu Ser Thr Leu Tyr Ser Ser Leu Asn Lys Val Ile Leu Tyr Cys Leu 690 695 700Ser Lys Pro Gln Gln Ser Leu Ser Glu Cys Leu Gly Leu Leu Ser Ile705 710 715 720Leu Gly Phe Leu Gln Glu His Trp Asp Val Val Phe Ala Thr Tyr Asn 725 730 735Ser Asn Ile Ser Phe Leu Leu Cys Leu Met His Cys Leu Leu Leu Leu 740 745 750Asn Glu Arg Ser Tyr Pro Glu Gly Phe Gly Leu Glu Pro Lys Pro Arg 755 760 765Met Ser Thr Tyr His Gln Val Phe Leu Ser Pro Asn Glu Asp Val Lys 770 775 780Glu Lys Arg Glu Asp Leu Pro Ser Leu Ser Asp Val Gln His Asn Ile785 790 795 800Gln Lys Thr Val Gln Thr Leu Trp Gln Gln Leu Val Ala Gln Arg Gln 805 810 815Gln Thr Leu Glu Asp Ala Phe Lys Ile Asp Leu Ser Val Lys Pro Gly 820 825 830Glu Arg Glu Val Lys Ile Glu Glu Val Thr Pro Leu Trp Glu Glu Thr 835 840 845Met Leu Lys Ala Trp Gln His Tyr Leu Ala Ser Glu Lys Lys Ser Leu 850 855 860Ala Ser Arg Ser Asn Val Ala His His Ser Lys Val Thr Leu Trp Ser865 870 875 880Gly Ser Leu Ser Ser Ala Met Lys Leu Met Pro Gly Arg Gln Ala Lys 885 890 895Asp Pro Glu Cys Lys Thr Glu Asp Phe Val Ser Cys Ile Glu Asn Tyr 900 905 910Arg Arg Arg Gly Gln Glu Leu Tyr Ala Ser Leu Tyr Lys Asp His Val 915 920 925Gln Arg Arg Lys Cys Gly Asn Ile Lys Ala Ala Asn Ala Trp Ala Arg 930 935 940Ile Gln Glu Gln Leu Phe Gly Glu Leu Gly Leu Trp Ser Gln Gly Glu945 950 955 960Glu Thr Lys Pro Cys Ser Pro Trp Glu Leu Asp Trp Arg Glu Gly Pro 965 970 975Ala Arg Met Arg Lys Arg Ile Lys Arg Leu Ser Pro Leu Glu Ala Leu 980 985 990Ser Ser Gly Arg His Lys Glu Ser Gln Asp Lys Asn Asp His Ile Ser 995 1000 1005Gln Thr Asn Ala Glu Asn Gln Asp Glu Leu Thr Leu Arg Glu Ala Glu 1010 1015 1020Gly Glu Pro Asp Glu Val Gly Val Asp Cys Thr Gln Leu Thr Phe Phe1025 1030 1035 1040Pro Ala Leu His Glu Ser Leu His Ser Glu Asp Phe Leu Glu Leu Cys 1045 1050 1055Arg Glu Arg Gln Val Ile Leu Gln Glu Leu Leu Asp Lys Glu Lys Val 1060 1065 1070Thr Gln Lys Phe Ser Leu Val Ile Val Gln Gly His Leu Val Ser Glu 1075 1080 1085Gly Val Leu Leu Phe Gly His Gln His Phe Tyr Ile Cys Glu Asn Phe 1090 1095 1100Thr Leu Ser Pro Thr Gly Asp Val Tyr Cys Thr Arg His Cys Leu Ser1105 1110 1115 1120Asn Ile Ser Asp Pro Phe Ile Phe Asn Leu Cys Ser Lys Asp Arg Ser 1125 1130 1135Thr Asp His Tyr Ser Cys Gln Cys His Ser Tyr Ala Asp Met Arg Glu 1140 1145 1150Leu Arg Gln Ala Arg Phe Leu Leu Gln Asp Ile Ala Leu Glu Ile Phe 1155 1160 1165Phe His Asn Gly Tyr Ser Lys Phe Leu Val Phe Tyr Asn Asn Asp Arg 1170 1175 1180Ser Lys Ala Phe Lys Ser Phe Cys Ser Phe Gln Pro Ser Leu Lys Gly1185 1190 1195 1200Lys Ala Thr Ser Glu Asp Thr Leu Asn Leu Arg Arg Tyr Pro Gly Ser 1205 1210 1215Asp Arg Ile Met Leu Gln Lys Trp Gln Lys Arg Asp Ile Ser Asn Phe 1220 1225 1230Glu Tyr Leu Met Tyr Leu Asn Thr Ala Ala Gly Arg Thr Cys Asn Asp 1235 1240 1245Tyr Met Gln Tyr Pro Val Phe Pro Trp Val Leu Ala Asp Tyr Thr Ser 1250 1255 1260Glu Thr Leu Asn Leu Ala Asn Pro Lys Ile Phe Arg Asp Leu Ser Lys1265 1270 1275 1280Pro Met Gly Ala Gln Thr Lys Glu Arg Lys Leu Lys Phe Ile Gln Arg 1285 1290 1295Phe Lys Glu Val Glu Lys Thr Glu Gly Asp Met Thr Val Gln Cys His 1300 1305 1310Tyr Tyr Thr His Tyr Ser Ser Ala Ile Ile Val Ala Ser Tyr Leu Val 1315 1320 1325Arg Met Pro Pro Phe Thr Gln Ala Phe Cys Ala Leu Gln Gly Gly Ser 1330 1335 1340Phe Asp Val Ala Asp Arg Met Phe His Ser Val Lys Ser Thr Trp Glu1345 1350 1355 1360Ser Ala Ser Arg Glu Asn Met Ser Asp Val Arg Glu Leu Thr Pro Glu 1365 1370 1375Phe Phe Tyr Leu Pro Glu Phe Leu Thr Asn Cys Asn Gly Val Glu Phe 1380 1385 1390Gly Cys Met Gln Asp Gly Thr Val Leu Gly Asp Val Gln Leu Pro Pro 1395 1400 1405Trp Ala Asp Gly Asp Pro Arg Lys Phe Ile Ser Leu His Arg Lys Ala 1410 1415 1420Leu Glu Ser Asp Phe Val Ser Ala Asn Leu His His Trp Ile Asp Leu1425 1430 1435 1440Ile Phe Gly Tyr Lys Gln Gln Gly Pro Ala Ala Val Asp Ala Val Asn 1445 1450 1455Ile Phe His Pro Tyr Phe Tyr Gly Asp Arg Met Asp Leu Ser Ser Ile 1460 1465 1470Thr Asp Pro Leu Ile Lys Ser Thr Ile Leu Gly Phe Val Ser Asn Phe 1475 1480 1485Gly Gln Val Pro Lys Gln Leu Phe Thr Lys Pro His Pro Ala Arg Thr 1490 1495 1500Ala Ala Gly Lys Pro Leu Pro Gly Lys Asp Val Ser Thr Pro Val Ser1505 1510 1515 1520Leu Pro Gly His Pro Gln Pro Phe Phe Tyr Ser Leu Gln Ser Leu Arg 1525 1530 1535Pro Ser Gln Val Thr Val Lys Asp Met Tyr Leu Phe Ser Leu Gly Ser 1540 1545 1550Glu Ser Pro Lys Gly Ala Ile Gly His Ile Val Ser Thr Glu Lys Thr 1555 1560 1565Ile Leu Ala Val Glu Arg Asn Lys Val Leu Leu Pro Pro Leu Trp Asn 1570 1575 1580Arg Thr Phe Ser Trp Gly Phe Asp Asp Phe Ser Cys Cys Leu Gly Ser1585 1590 1595 1600Tyr Gly Ser Asp Lys Ser 160519426DNAHomo sapiens 19ctgcccttcc atgtcgtcac aggcataccg ggcgtggacc cactgcttgg tgcccttgcc 60gaagtagtag cacttccgtt ggaaattgat ccacttttca gggcacgtgt tgcacacaaa 120gccgctggac acctgcaact ccatccttag ctttgtcacc tcctcccgga gtctttccag 180caaatctgaa gcttcgttcc tctcgttcaa ttcctgggac ttgaagctgc tcagatctgc 240ttgaagcccg ttcaggttcc aggacagctc caagtcctga gatttcaatc tctgctgttc 300agctcgaagt tcctccagtt cctgtgaaat ctgcgtggac tgggatttct gcgccatctg 360gtcaccgtgg tggctttcca agttcttgga aacttgagag acgttccggg cagccctctc 420ttccag 426201569DNAHomo sapiens 20ggcacgaggc tgcttaaacc tctgtctctg acggtccctg ccaatcgctc tggtcgaccc 60caacacacta ggaggacaga cacaggctcc aaactccact aaccagagct gtgattgtgc 120ccgctgagtg gactgcgttg tcagggagtg agtgctccat catcgggaga atccaagcag 180gaccgccatg gaggaaggtc aatattcaga gatcgaggag cttcccagga ggcggtgttg 240caggcgtggg actcagatcg tgctgctggg gctggtgacc gccgctctgt gggctgggct 300gctgactctg cttctcctgt ggcactggga caccacacag agtctaaaac agctggaaga 360gagggctgcc cggaacgtct ctcaagtttc caagaacttg gaaagccacc acggtgacca 420gatggcgcag aaatcccagt ccacgcagat ttcacaggaa ctggaggaac ttcgagctga 480acagcagaga ttgaaatctc aggacttgga gctgtcctgg aacctgaacg ggcttcaagc 540agatctgagc agcttcaagt cccaggaatt gaacgagagg aacgaagctt cagatttgct 600ggaaagactc cgggaggagg tgacaaagct aaggatggag ttgcaggtgt ccagcggctt 660tgtgtgcaac acgtgccctg aaaagtggat caatttccaa cggaagtgct actacttcgg 720caagggcacc aagcagtggg tccacgcccg gtatgcctgt gacgacatgg aagggcagct 780ggtcagcatc cacagcccgg aggagcagga cttcctgacc aagcatgcca gccacaccgg 840ctcctggatt ggccttcgga acttggacct gaagggggag tttatctggg tggatgggag 900ccacgtggac tacagcaact gggctccagg ggagcccacc agccggagcc agggcgagga 960ctgcgtgatg atgcggggct ccggtcgctg gaacgacgcc ttctgcgacc gtaagctggg 1020cgcctgggtg tgcgaccggc tggccacatg cacgccgcca gccagcgaag gttccgcgga 1080gtccatggga cctgattcaa gaccagaccc tgacggccgc ctgcccaccc cctctgcccc 1140tctccactct tgagcatgga tacagccagg cccagagcaa gaccctgaag acccccaacc 1200acggcctaaa agcctctttg tggctgaaag gtccctgtga cattttctgc cacccaaacg 1260gaggcagctg acacatctcc cgctcctcta tggcccctgc cttcccagga gtacacccca 1320acagcaccct ctccagatgg gagtgccccc aacagcaccc tctccagatg agagtacacc 1380ccaacagcac cctctccaga tgagagtaca ccccaacagc accctctcca gatgagagta 1440caccccaaca gcaccctctc cagatgcagc cccatctcct cagcacccca ggacctgagt 1500atccccagct caggtggtga gtcctcctgt ccagcctgca tcaataaaat ggggcagtga 1560tggcctccc 156921321PRTHomo sapiens 21Met Glu Glu Gly Gln Tyr Ser Glu Ile Glu Glu Leu Pro Arg Arg Arg1 5 10 15Cys Cys Arg Arg Gly Thr Gln Ile Val Leu Leu Gly Leu Val Thr Ala 20 25 30Ala Leu Trp Ala Gly Leu Leu Thr Leu Leu Leu Leu Trp His Trp Asp 35 40 45Thr Thr Gln Ser Leu Lys Gln Leu Glu Glu Arg Ala Ala Arg Asn Val 50 55 60Ser Gln Val Ser Lys Asn Leu Glu Ser His His Gly Asp Gln Met Ala65 70 75 80Gln Lys Ser Gln Ser Thr Gln Ile Ser Gln Glu Leu Glu Glu Leu Arg 85 90 95Ala Glu Gln Gln Arg Leu Lys Ser Gln Asp Leu Glu Leu Ser Trp Asn 100 105 110Leu Asn Gly Leu Gln Ala Asp Leu Ser Ser Phe Lys Ser Gln Glu Leu 115 120 125Asn Glu Arg Asn Glu Ala Ser Asp Leu Leu Glu Arg Leu Arg Glu Glu 130 135 140Val Thr Lys Leu Arg Met Glu Leu Gln Val Ser Ser Gly Phe Val Cys145 150 155 160Asn Thr Cys Pro Glu Lys Trp Ile Asn Phe Gln Arg Lys Cys Tyr Tyr 165 170 175Phe Gly Lys Gly Thr Lys Gln Trp Val His Ala Arg Tyr Ala Cys Asp 180 185 190Asp Met Glu Gly Gln Leu Val Ser Ile His Ser Pro Glu Glu Gln Asp 195 200 205Phe Leu Thr Lys His Ala Ser His Thr Gly Ser Trp Ile Gly Leu Arg 210 215 220Asn Leu Asp Leu Lys Gly Glu Phe Ile Trp Val Asp Gly Ser His Val225 230 235 240Asp Tyr Ser Asn Trp Ala Pro Gly Glu Pro Thr Ser Arg Ser Gln Gly 245 250 255Glu Asp Cys Val Met Met Arg Gly Ser Gly Arg Trp Asn Asp Ala Phe 260

265 270Cys Asp Arg Lys Leu Gly Ala Trp Val Cys Asp Arg Leu Ala Thr Cys 275 280 285Thr Pro Pro Ala Ser Glu Gly Ser Ala Glu Ser Met Gly Pro Asp Ser 290 295 300Arg Pro Asp Pro Asp Gly Arg Leu Pro Thr Pro Ser Ala Pro Leu His305 310 315 320Ser221076DNAHomo sapiensmodified_base(1)..(1076)n = g, a, c or t 22atcagcacga atacattcac gtccaacaac acatcaacta ccaacaccat caccacgagc 60acattcatgc ccaacaacac atcaaccacc aacacctttg ccacaaacac attcatgtct 120aacaacacat caattaccaa caccagcgcc acgaacacat tcacgtccat caacacatca 180accaccaaca ccatcagcac gagcacattc acatccaata acacatcaac taccaacacc 240agcaccatga acacattcat gcccaacaac acgtaaaccc ctaacactgt caccacaaac 300accttacagc cagcagaacg ccagtcacta acaccatcgc catcagcact tcgtggttag 360caacacctca gctgacgcca atgtcaccac aaacacctca tggccagaag cagctcaacc 420accaacaccg ttattataaa tacattcttg accatcaatg cttcaactgc tgacaccatt 480accataaata catccatggc cagcaatact tcaatcacca acaccatgac cagcagcacg 540tcagcggcca tcactgtcac cacaaacacc ttcatatcca ataacacttc aaccaccatc 600atcaccacaa acacctcata gccaacagtg cctcagccac caaccccatc atgacaaaca 660cctcatggcc agcagcactt caaccaccaa caaacccctc caaggtcagc aacaccttca 720tcaccgacat cattaacaga ggtacccacc accagcagca gcttatctcc accacccaca 780ccacagccaa caccatcttt accaaccaca ccagccgtgt cttcatcact ggcaccgaca 840gcaaaaccag tgctgtggcc aggtccacca gcgattactt tccccaagca ccatccctac 900caacagccct ggtcatcatc actggcagaa cccgccaaac cagcactcct agccaatgtc 960tgggaaattg ngatnatttt cttccagtgg gaggctntgg tcaggagagc caatgggatt 1020gcaagactag gtcccacaat ccctcaatat ggtctctttn tccccttccc cccacc 107623476DNAHomo sapiens 23aggtacgcgg ggacgttcaa cgacttactg gggagagaaa gaaaaggaac gggagctgag 60agctgggagt ggagtatgaa gaccaaggaa ttctcttaaa gacctgagca gttatctgga 120actcctcaca aaatcacagt aatggatatt atttcccagt tcctactcta cactgggcat 180agatgttatg gacatcttct gagtcccaca acacccctgc aaggcagata tgatacactc 240ctttcaccta tggagaacgg aggctcaaag aggctaggtg accctcagga aacacagatg 300agaggtcccc gcccagtctg cccagctctg aaatcttcca tgccaactcc cttagggcga 360tcctgagtct agcctgtaca ggcagttcat gtggttgtat ttgaataaaa tccctttcct 420ccagaataaa aaaaaaaaaa taaaaaaaaa aatgaaaaat tgaaagggaa aaaaaa 47624421DNAHomo sapiens 24ccaccaacag tcagaggcca aggaagctgt tggctgaaaa ggtggtctat gttggcgtct 60ggatccctgc cctcctgctg actattcccg acttcatctt tgccaacgtc agtgaggcag 120atgacagata tatctgtgac cgcttctacc ccaatgactt gtgggtggtt gtgttccagt 180ttcagcacat catggttggc cttatcctgc ctggtattgt catcctgtcc tgctattgca 240ttatcatctc caagctgtca cactccaagg gccaccagaa gcgcaaggcc ctcaagacca 300cagtcatcct catcctggct ttcttcgcct gttggctgcc ttactacatt gggatcagca 360tcgactcctt catcctcctg gaaatcatca agcaagggtg tgagtttgag aacactgtgc 420a 421258747DNAHomo sapiens 25caattctgaa tcctgccttt tgcacttaat gtttcataag tatttcccca tgtcactaaa 60aattcttcca aataacattc acgatgtcca tatggaattt cagatgtgga tgaaccaaaa 120tcttgtcaac tattccacta acagtggtta tttagggatg ttcagacatt tcactattta 180aaaaaaaatg tttccacaaa tacctttgtg gcataagttt ttatgagtgg agttactgtt 240ctgaagttcc tgctgaatag aaaatgcttt ccagtgaggc tgtcccaagc cacattccca 300tcagtgacaa gcgagagaca gctggtcttt tcaaatccgg agaccaaata ttatctttga 360aaaaaaatgg atttttgcct aatttggtag tcaccaaata gcatctcatt gttcttttaa 420ttatctgctt ccttttagta gagatcccta aaaagatctg aaaggagtct tcagataaag 480gaaggagctt tcttttgtct gtctacaatc aacaaatatt tattatgcaa accattttgc 540tccgagtttt ctcctctttc cctttttgga cagatttggg agatctcacc tttcaggttt 600tagacatcgt gcagggagga gttttgaggt agggtgcagc ttacggtcca ggataaaaca 660tactgattct gccactacca ggctttgtga aaagcaagtc atgaaaacgc tctgaaattc 720tagaccttca gtagatagga tctaccgtgt ctataaaaat atgaagatcc ttaagtttta 780ttaaagattc gaaaaaagta aaagtgtttt tacggtttta ttttcatttt tatttcttac 840cgttatcgtt tattataaag gatattataa aggatacaga tgaagagata cgtaatgcaa 900ggcctgtgag aaggggcgtg gagcttccga aacctcttcc agccaccacc ctccaagaac 960ctggagtttc tttttttttt tttaattcta caaatgtaat attagaattg attttatctg 1020gccattagtg tgtgtcctaa ctcgttcgtt tctgagagtc ccatctcccg gcccgggata 1080tcatctttcc tgtgtcagtg aaagtgcaga gtagatgaga acctttaacc accaacatta 1140gggaggggtc ccagacaaag ggggtaagtc atgctctgta gagaaaaggt tccctgcctc 1200cgaactacct ctggaacact ccagtaaatg tttcctcttt tgatatagaa aagagggatc 1260gtgtgtagag tgcagtctgg gcaatccctc tcctcgggac catttcgggg taggggcctc 1320tggggtccgt gtcgcgacgc acgcgcctcg gtcccagcta tctccgcagc gggccacccc 1380gcctgcggac gcagtttctc ggccccgccc cacactcgct cccccgcccc acccagtctc 1440cgcgccggag ggaagtggcg cgagggggaa agcactgtct gcgcgcccac tgcaaacctc 1500agccagtctg agatcgcttt aaacgtctga cccccacccc cactccgccc cgcccagttc 1560ttcaacctaa tttctgattc gtgccaaagc ttgtcctctg ctcaaaatcg tggaagacgc 1620cgagtatggg gaccgaagac ctgggttcaa gcccggcttg gaatccctgc ccatccctgg 1680catttcatct ctccgggctt atttgctggt ttctccgaat gcgggccttg tctggttcac 1740gctggatccc caacgcctag aacagtgcgt ggcacgcagt tcgtccttct ataaatatcg 1800gactaaatgc atctctgtga tggtaatacc cacacggtgt tgtgagaatg aatgagtgat 1860tctgtgcaag ttcctagtga tctgttacaa aaagtactgg tcgctaaatt actcttataa 1920taaagcatac ttttaggata ataaagcact attcgcgaat tggttaccgc tattatgaaa 1980ttactgagca atacatatct acatctgatc agtctccaga attatgccaa atcctacctt 2040cttctgaaag tatctcctaa ttatctgcac ctgaccctag tgatgctgtg aatgtgcaag 2100tatagctaca tcctccgaag gaaaggatct ttactccttt tacctcctga atgggctgcg 2160tctgctgaaa gcgcggggga atgggcggtt ggaagcttgg ccctacttcc agcattgccg 2220cctactggtt gggttactcc agcaagtcac tccccttccc tgggcctcag tgtctctact 2280gtagcattcc caggtctgga attccatcca ctttagcaag gatggacgcg ccacagagag 2340acgcgttcct agcccgcgct tcccacctgt cttcaggcgc atcccgcttc cctcaaactt 2400aggaaatgcc tctgggaggt cctgtccggc tccggactca ctaccgacca cccgcaaaca 2460gcagggtccc ctgggcttcc caagccgcgc acctctccgc cccgcccctg cgccctcctt 2520cctcgcgtct gcccctctcc cccaccccgc cttctccctc cccgccccag cggcgcatgc 2580gccgcgctcg gagcgtgttt ttataaaagt ccggccgcgg ccagaaactt cagtttgttg 2640gctgcggcag caggtagcaa agtgacgccg agggcctgag tgctccagta gccaccgcat 2700ctggagaacc agcggttacc atggagggga tcagtgtaag tccagtttca acctgctttg 2760tcataaatgt acaaacgttt gaacttagag cgcagcccct ctccgagcgg gcagaagcgg 2820ccaggacatt ggaggtaccc gtactccaaa aaagggtcac cgaaaggagt tttcttgacc 2880atgcctatat agtgcgggtg ggtggggggg gagcaggatt ggaatctttt tctctgtgag 2940tcgaggagaa acgactggaa agagcgttcc agtggctgca tgtgtctccc ccttgagtcc 3000cgccgcgcgc ggcggcttgc acgctgtttg caaacgtaag aacattctgt gcacaagtgc 3060agagaaggcg tgcgcgctgc ctcgggactc agaccaccgg tctcttcctt ggggaagcgg 3120ggatgtcttg gagcgagtta cattgtctga atttagaggc ggagggcggc gtgcctgggc 3180tgagttccca ggaggagatt gcgcccgctt taacttcggg gttaagcgcc tggtgactgt 3240tcttgacact gggtgcgtgt ttgttaaact ctgtgcggcc gacggagctg tgccagtctc 3300ccagcacagt aggcagaggg cgggagaggc gggtggaccc accgcgccga tcctctgagg 3360ggatcgagtg gtggcagcag ctaggagttg atccgcccgc gcgctttggg tttgaggggg 3420aaaccttccc gccgtccgaa gcgcgcctct tccccacggc cgcgagtggg tcctgcagtt 3480cgagagtttg gggtcgtgca gaggtcagcg gagtggtttg acctcccctt tgacaccgcg 3540cagctgccag ccctgagatt tgcgctccgg ggataggagc gggtacgggg tgaggggcgg 3600gggcggttaa gaccgcacct gggctgccag gtcgccgccg cgaagactgg caggtgcaag 3660tggggaaacc gtttggctct ctccgagtcc agttgtgatg tttaaccgtc ggtggtttcc 3720agaaaccttt tgaaaccctc ttgctaggga gtttttggtt tcctgcagcg gcgcgcaatt 3780caaagacgct cgcggcggag ccgcccagtc gctccccagc accctgtggg acagagcctg 3840gcgtgtcgcc cagcggagcc cctgcagcgc tgcttgcggg cggttggcgt gggtgtagtg 3900ggcagccgcg gcggcccggg gctggacgac ccggcccccc gcgtgcccac cgcctggagg 3960cttccagctg cccacctccg gccgggttaa ctggatcagt ggcggggtaa tgggaaacca 4020cccgggagag tgaggaaatg aaacttgggg cgaggaccac gggtgcagac cccgttacct 4080tctccaccca ggaaaatgcc ccgctcccta acgtcccaaa cgcgccaagt gataaacacg 4140aggatggcaa gagacccaca caccggagga gcgcccgctt gggggaggag gtgccgtttg 4200ttcattttct gacactcccg cccaatatac cccaagcacc gaagggcctt cgttttaaga 4260ccgcattctc tttacccact acaagttgct tgaagcccag aatggtttgt atttaggcag 4320gcgtgggaaa attaagtttt tgcgccttag gagaatgagt ctttgcaacg cccccgccct 4380ccccccgtga tcctcccttc tcccctcttc cctccctggg cgaaaaactt cttacaaaaa 4440gttaatcact gcccctccta gcagcaccca ccccaccccc cacgccgcct gggagtggcc 4500tctttgtgtg tatttttttt ttcctcctaa ggaaggtttt ttttcttccc tctagtgggc 4560ggggcagagg agttagccaa gatgtgactt tgaaaccctc agcgtctcag tgcccttttg 4620ttctaaacaa agaattttgt aattggttct accaaagaag gatataatga agtcactatg 4680ggaaaagatg gggaggagag ttgtaggatt ctacattaat tctcttgtgc ccttagccca 4740ctacttcaga atttcctgaa gaaagcaagc ctgaattggt tttttaaatt gctttaaaaa 4800atttttttaa ctgggttaat gcttgctgaa ttggaagtga atgtccattc ctttgcctct 4860tttgcagata tacacttcag ataactacac cgaggaaatg ggctcagggg actatgactc 4920catgaaggaa ccctgtttcc gtgaagaaaa tgctaatttc aataaaatct tcctgcccac 4980catctactcc atcatcttct taactggcat tgtgggcaat ggattggtca tcctggtcat 5040gggttaccag aagaaactga gaagcatgac ggacaagtac aggctgcacc tgtcagtggc 5100cgacctcctc tttgtcatca cgcttccctt ctgggcagtt gatgccgtgg caaactggta 5160ctttgggaac ttcctatgca aggcagtcca tgtcatttac acagtcaacc tctacagcag 5220tgtcctcatc ctggccttca tcagtctgga ccgctacctg gccatcgtcc acgccaccaa 5280cagtcagagg ccaaggaagc tgttggctga aaaggtggtc tatgttggcg tctggatccc 5340tgccctcctg ctgactattc ccgacttcat ctttgccaac gtcagtgagg cagatgacag 5400atatatctgt gaccgcttct accccaatga cttgtgggtg gttgtgttcc agtttcagca 5460catcatggtt ggccttatcc tgcctggtat tgtcatcctg tcctgctatt gcattatcat 5520ctccaagctg tcacactcca agggccacca gaagcgcaag gccctcaaga ccacagtcat 5580cctcatcctg gctttcttcg cctgttggct gccttactac attgggatca gcatcgactc 5640cttcatcctc ctggaaatca tcaagcaagg gtgtgagttt gagaacactg tgcacaagtg 5700gatttccatc accgaggccc tagctttctt ccactgttgt ctgaacccca tcctctatgc 5760tttccttgga gccaaattta aaacctctgc ccagcacgca ctcacctctg tgagcagagg 5820gtccagcctc aagatcctct ccaaaggaaa gcgaggtgga cattcatctg tttccactga 5880gtctgagtct tcaagttttc actccagcta acacagatgt aaaagacttt tttttatacg 5940ataaataact tttttttaag ttacacattt ttcagatata aaagactgac caatattgta 6000cagtttttat tgcttgttgg atttttgtct tgtgtttctt tagtttttgt gaagtttaat 6060tgacttattt atataaattt tttttgtttc atattgatgt gtgtctaggc aggacctgtg 6120gccaagttct tagttgctgt atgtctcgtg gtaggactgt agaaaaggga actgaacatt 6180ccagagcgtg tagtgaatca cgtaaagcta gaaatgatcc ccagctgttt atgcatagat 6240aatctctcca ttcccgtgga acgtttttcc tgttcttaag acgtgatttt gctgtagaag 6300atggcactta taaccaaagc ccaaagtggt atagaaatgc tggtttttca gttttcagga 6360gtgggttgat ttcagcacct acagtgtaca gtcttgtatt aagttgttaa taaaagtaca 6420tgttaaactt acttagtgtt atgttctgat ttctgttgac attcttttgg ctagtagaag 6480acaaaagtaa tacatttatg gtatgcaaag cactatccta ggtatttcat tgtaatattt 6540tacttacccc ttatcacaac tctgatagat tctgcttctg ttactaatta cattttatag 6600aagaggaaac ggaggcacag aaagcctaag taacttggtt aaaggcatgt agtaagtatc 6660aaatcctgta ttttaaacca ggtaacatga cttaacgaat ctgaagcctt caccacttta 6720aattcaaatg gaagtttaga aatggccagc cagcacctat ttgtatgaaa ggtcatcttt 6780cagaggataa gcatgtataa agaagaaaag gtatgcagtc gtgtttggat tttactccac 6840catccacttg tgaaacccag gtctgtgcaa tgccagacgg tgtgtgcttt cctcatccag 6900tatcctcagt gtagataacc atcactccct tttcacagac aagagaactg agattcagag 6960actttccata cattgcactt tcaagggggc aaagccaaga actaattctg tttattgttc 7020cagctcttgc tcttaactct tacctactat tgcccttcag aacacctggg cataagtcaa 7080ctgaactgct aataaagaaa gccaaaagtg aatgttttct tcataaaatt aaccatgacc 7140aaaatactcc tcttgtaata tcttctatgc aaatctcaac acttttattc ttaaactatc 7200gcaacaccta gcacctcctc aaggactcag ccaagcagct acaagttaat actgatattt 7260gttagagtca gaaggaaggt ccactgaagc aagctccctg ttgctcacat tttgcacaag 7320attttggaga cttatgtaac cacccgttgc tattaacacg accattgtgc aagccccagg 7380ctcttgagta aatttcagct ttggtttcta tttaaagata atttctaaac tctagccata 7440cctacctcac attggaacac aaacagggta cactccaggc atgcactcag ataataagta 7500ggatataatt acgacaatat ttggtctact tttagtaatt gtttctggca cagaaaatcc 7560attttggagg aaaaattgca atgccttatc tttctgaggc aaatcacatt tgttcaaggc 7620aaattataga tcctgtgaag ggaaataact taattactta aaatagaatc caatttggct 7680gtacattttt gctgccgtct atggatctgg ggtaattcaa agtggtattc atattctact 7740tgaggacaca attagatttc agataggaaa ttatcttgag gtttcttggt tttccctgag 7800aagcctaatt ggatcaccct tcatttaagc atagttttac atgcactctc tcaaaggctt 7860agtcttaaag ccacaaccat tgagacagac ttcacttgaa ccctctctat aaatatttat 7920tctccgggag acaatagaag aaatccttgg aaggcatgct ttttctttct catcttggct 7980tgaaacctcc ttaccccaga ttcctctcct ttaccgtgga gtcacaacaa aaggaactga 8040gccaaaacaa aattcccagt gtcaccagtc ttaatggata tttcattctc ccttggaaca 8100aagatggaat agcttttttt ccaaaagaaa aacaagcctt ggctctctcc ctgccccaaa 8160agggtgcccc ccacccccat cattctctgt cccaaccctg ccatgttaga gcgtctccaa 8220agccttccct gtgtcgtggt ttgtctgaca atgtggggaa acccagtctg ctggccagcc 8280cttgcatgaa gtagctgatt gttccctctc ctcatccctt atgaatgggg cccttgaagt 8340tcagtcatgt agattcagtt gtataatgaa agctaaaata tttaaattgt atgcatgctg 8400ccaataacag catacatctg acatctaact tattaataac attaagcctg caactagggg 8460ggaaagtgga tgttttttct tgcaaagcct ttgttttcct aaaatgacac ttgaaaattt 8520atctccccct actgcaggct tcccagcccc cttttataat tatgcttaaa ttaaaataat 8580gattctggga tactcttttg gggagatacc ctacaggctt tattttaata attgaactaa 8640gtgtttgtga ctttctccta gatattgtca aatattaaat aaaggctcca taaacaattg 8700agctgtctta ttcccagata atacccattt aggaggggca aggatcc 874726568PRTHomo sapiens 26Met Asp Val Asp Glu Gly Gln Asp Met Ser Gln Val Ser Gly Lys Glu1 5 10 15Ser Pro Pro Val Ser Asp Thr Pro Asp Glu Gly Asp Glu Pro Met Pro 20 25 30Val Pro Glu Asp Leu Ser Thr Thr Ser Gly Ala Gln Gln Asn Ser Lys 35 40 45Ser Asp Arg Gly Met Ala Ser Asn Val Lys Val Glu Thr Gln Ser Asp 50 55 60Glu Glu Asn Gly Arg Ala Cys Glu Met Asn Gly Glu Glu Cys Ala Glu65 70 75 80Asp Leu Arg Met Leu Asp Ala Ser Gly Glu Lys Met Asn Gly Ser His 85 90 95Arg Asp Gln Gly Ser Ser Ala Leu Ser Gly Val Gly Gly Ile Arg Leu 100 105 110Pro Asn Gly Lys Leu Lys Cys Asp Ile Cys Gly Ile Val Cys Ile Gly 115 120 125Pro Asn Val Leu Met Val His Lys Arg Ser His Thr Gly Glu Arg Pro 130 135 140Phe Gln Cys Asn Gln Cys Ser Ser Ala Leu Ser Gly Val Gly Gly Ile145 150 155 160Arg Leu Pro Asn Gly Lys Leu Lys Cys Asp Ile Cys Gly Ile Val Cys 165 170 175Ile Gly Pro Asn Val Leu Met Val His Lys Arg Ser His Thr Gly Glu 180 185 190Arg Pro Phe Gln Cys Asn Gln Cys Gly Ala Ser Phe Thr Gln Lys Gly 195 200 205Asn Leu Leu Arg His Ile Lys Leu His Ser Gly Glu Lys Pro Phe Lys 210 215 220Cys His Leu Cys Asn Tyr Ala Cys Arg Arg Arg Asp Ala Leu Thr Gly225 230 235 240His Leu Arg Thr His Ser Val Gly Lys Pro His Lys Cys Gly Tyr Cys 245 250 255Gly Arg Ser Tyr Lys Gln Arg Ser Ser Leu Glu Glu His Lys Glu Arg 260 265 270Cys His Asn Tyr Leu Glu Ser Met Gly Leu Pro Gly Met Tyr Pro Val 275 280 285Ile Lys Glu Glu Thr Asn His Asn Glu Met Ala Glu Asp Leu Cys Lys 290 295 300Ile Gly Ala Glu Arg Ser Leu Val Leu Asp Arg Leu Ala Ser Asn Val305 310 315 320Ala Lys Arg Lys Ser Ser Met Pro Gln Lys Phe Leu Gly Asp Lys Cys 325 330 335Leu Ser Asp Met Pro Tyr Asp Ser Ala Asn Tyr Glu Lys Glu Asp Met 340 345 350Met Thr Ser His Val Met Asp Gln Ala Ile Asn Asn Ala Ile Asn Tyr 355 360 365Leu Gly Ala Glu Ser Leu Arg Pro Leu Val Gln Thr Pro Pro Gly Ser 370 375 380Ser Glu Val Val Pro Val Ile Ser Ser Met Tyr Gln Leu His Lys Pro385 390 395 400Pro Ser Asp Gly Pro Pro Arg Ser Asn His Ser Ala Gln Asp Ala Val 405 410 415Asp Asn Leu Leu Leu Leu Ser Lys Ala Lys Ser Val Ser Ser Glu Arg 420 425 430Glu Ala Ser Pro Ser Asn Ser Cys Gln Asp Ser Thr Asp Thr Glu Ser 435 440 445Asn Ala Glu Glu Gln Arg Ser Gly Leu Ile Tyr Leu Thr Asn His Ile 450 455 460Asn Pro His Ala Arg Asn Gly Leu Ala Leu Lys Glu Glu Gln Arg Ala465 470 475 480Tyr Glu Val Leu Arg Ala Ala Ser Glu Asn Ser Gln Asp Ala Phe Arg 485 490 495Val Val Ser Thr Ser Gly Glu Gln Leu Lys Val Tyr Lys Cys Glu His 500 505 510Cys Arg Val Leu Phe Leu Asp His Val Met Tyr Thr Ile His Met Gly 515 520 525Cys His Gly Cys His Gly Phe Arg Asp Pro Phe Glu Cys Asn Met Cys 530 535 540Gly Tyr His Ser Gln Asp Arg Tyr Glu Phe Ser Ser His Ile Thr Arg545 550 555 560Gly Glu His Arg Tyr His Leu Ser 56527350DNAHomo sapiens 27ccagagagta agaataggag gagaaaacat gctgcagatg taggcggggc ccagattgta 60gacagcatag aaataatttt gggcttttcc tgttaaattc ctctagcttc taggatacat 120tttttttaac ttttgtcttt gagataattt tagatttaca gaagagttgc aaaaagagta 180gagagagttc ctgtacaccc ttcacccagc ttcctctact gctaacatct tacataatca 240tagtttcaac ctgagaaatt agcatggggt acagtcctat taatgaaacc ccaggcttta 300ttcagatttc accaggtttt cagtaacatc ctttatctgt ttcagaattt

35028850DNAHomo sapiens 28gaattccggc aaaatgcatg acagtaacaa tgtggagaaa gacattacac catctgaatt 60gcctgcaaac ccaggttgtc tgcattcaaa agagcattct attaaagcta ccttaatttg 120gcgcttattt ttcttaatca tgtttctgac aatcatagtg tgtggaatgg ttgctgcttt 180aagcgcaata agagctaact gccatcaaga gccatcagta tgtcttcaag ctgcatgccc 240agaaagctgg attggttttc aaagaaagtg tttctatttt tctgatgaca ccaagaactg 300gacatcaagt cagaggtttt gtgactcaca agatgctgat cttgctcagg ttgaaagctt 360ccaggaactg aatttcctgt tgagatataa aggcccatct gatcactgga ttgggctgag 420cagagaacaa ggccaaccat ggaaatggat aaatggtact gaatggacaa gacagtttcc 480tatcctggga gcaggagagt gtgcctattt gaatgacaaa ggtgccagta gtgccaggca 540ctacacagag aggaagtgga tttgttccaa atcagatata catgtctaga tgttacagca 600aagccccaac taatctttag aagcatattg gaactgataa ctccatttta aaatgagcaa 660agaatttatt tcttatacca acaggtatat gaaaatatgc tcaatatcac taataactgg 720gaaaatacaa atcaaaatca tagtaaaata ttacctgttt tcatggtgct aatattacct 780gttctcccac tgctaatgac atacccgaga atgagtaatt tataaataaa agagatttaa 840ttgaaaaaaa 85029191PRTHomo sapiens 29Met His Asp Ser Asn Asn Val Glu Lys Asp Ile Thr Pro Ser Glu Leu1 5 10 15Pro Ala Asn Pro Gly Cys Leu His Ser Lys Glu His Ser Ile Lys Ala 20 25 30Thr Leu Ile Trp Arg Leu Phe Phe Leu Ile Met Phe Leu Thr Ile Ile 35 40 45Val Cys Gly Met Val Ala Ala Leu Ser Ala Ile Arg Ala Asn Cys His 50 55 60Gln Glu Pro Ser Val Cys Leu Gln Ala Ala Cys Pro Glu Ser Trp Ile65 70 75 80Gly Phe Gln Arg Lys Cys Phe Tyr Phe Ser Asp Asp Thr Lys Asn Trp 85 90 95Thr Ser Ser Gln Arg Phe Cys Asp Ser Gln Asp Ala Asp Leu Ala Gln 100 105 110Val Glu Ser Phe Gln Glu Leu Asn Phe Leu Leu Arg Tyr Lys Gly Pro 115 120 125Ser Asp His Trp Ile Gly Leu Ser Arg Glu Gln Gly Gln Pro Trp Lys 130 135 140Trp Ile Asn Gly Thr Glu Trp Thr Arg Gln Phe Pro Ile Leu Gly Ala145 150 155 160Gly Glu Cys Ala Tyr Leu Asn Asp Lys Gly Ala Ser Ser Ala Arg His 165 170 175Tyr Thr Glu Arg Lys Trp Ile Cys Ser Lys Ser Asp Ile His Val 180 185 19030558DNAHomo sapiensmodified_base(1)..(558)n = g, a, c or t 30ccatgggatg gctcttctga ccattggggg ccaggccagg ccaggccagg cttagggcag 60caaggaccag gccaaagggg cagggcctcc tttggagggg ttgaggggta catcctcggc 120tggtgtttgc atccaggggt ccagcaggat ctcttccagt gagggtcggg aagaaggttt 180gggggccagg caccggcgga ttagggcaca gcaatcttgg ggaaaacatg ggcttgggaa 240gtggagctca gcttccagaa tctcctggtc cctctcaaag ggaatgtccc cacacaccat 300gtcatagagg aggatgccca gtgaccagac agtggccggg agtgcatggt actggtgtcg 360agagatccac tctggggggc tgtacaccct tgtcccatca aagtcagtgt agggttcatc 420atgaagcagg gcaccagaac caaaatcaat gagtttggca cagccacggc gtaggtctat 480caggatgntc tcatccttga tgtcacgatg gacaactnca cgggaaatgg cagtgctgga 540tggctgccac tactttgg 558312088DNAHomo sapiens 31gaattcggca cgagcgcgcg gcgaatctca acgctgcgcc gtctgcgggc gcttccgggc 60caccagtttc tctgctttcc accctggcgc cccccagccc tggctcccca gctgcgctgc 120cccgggcgtc cacgccctgc gggcttagcg ggttcagtgg gctcaatctg cgcagcgcca 180cctccatgtt gaccaagcct ctacaggggc ctcccgcgcc ccccgggacc cccacgccgc 240cgccaggagg caaggatcgg gaagcgttcg aggccgagta tcgactcggc cccctcctgg 300gtaagggggg ctttggcacc gtcttcgcag gacaccgcct cacagatcga ctccaggtgg 360ccatcaaagt gattccccgg aatcgtgtgc tgggctggtc ccccttgtca gactcagtca 420catgcccact cgaagtcgca ctgctatgga aagtgggtgc aggtggtggg caccctggcg 480tgatccgcct gcttgactgg tttgagacac aggaaggctt catgctggtc ctcgagcggc 540ctttgcccgc ccaggatctc tttgactata tcacagagaa gggcccactg ggtgaaggcc 600caagccgctg cttctttggc caagtagtgg cagccatcca gcactgccat tcccgtggag 660ttgtccatcg tgacatcaag gatgagaaca tcctgataga cctacgccgt ggctgtgcca 720aactcattga ttttggttct ggtgccctgc ttcatgatga accctacact gactttgatg 780ggacaagggt gtacagcccc ccagagtgga tctctcgaca ccagtaccat gcactcccgg 840ccactgtctg gtcactgggc atcctcctct atgacatggt gtgtggggac attccctttg 900agagggacca ggagattctg gaagctgagc tccacttccc agcccatgtc tccccagact 960gctgtgccct aatccgccgg tgcctggccc ccaaaccttc ttcccgaccc tcactggaag 1020agatcctgct ggacccctgg atgcaaacac cagccgagga tgttacccct caacccctcc 1080aaaggaggcc ctgccccttt ggcctggtcc ttgctaccct aagcctggcc tggcctggcc 1140tggcccccaa tggtcagaag agccatccca tggccatgtc acagggatag atggacattt 1200gttgacttgg ttttacaggt cattaccagt cattaaagtc cagtattact aaggtaaggg 1260attgaggatc aggggttaga agacataaac caagtttgcc cagttccctt cccaatccta 1320caaaggagcc ttcctcccag aacctgtggt ccctgatttt ggagggggaa cttcttgctt 1380ctcattttgc taaggaagtt tattttggtg aagttgttcc cattttgagc cccgggactc 1440ttattttgat gatgtgtcac cccacattgg cacctcctac taccaccaca caaacttagt 1500tcatatgctt ttacttgggc aagggtgctt tccttccaat accccagtag cttttatttt 1560agtaaaggga ccctttcccc tagcctaggg tcccatattg ggtcaagctg cttacctgcc 1620tcagcccagg attttttatt ttgggggagg taatgccctg ttgttacccc aaggcttctt 1680tttttttttt tttttttttg ggtgagggga ccctactttg ttatcccaag tgctcttatt 1740ctggtgagaa gaaccttaat tccataattt gggaaggaat ggaagatgga caccaccgga 1800caccaccaga caataggatg ggatggatgg ttttttgggg gatgggctag gggaaataag 1860gcttgctgtt tgttttcctg gggcgctccc tccaattttg cagatttttg caacctcctc 1920ctgagccggg attgtccaat tactaaaatg taaataatca cgtattgtgg ggaggggagt 1980tccaagtgtg ccctcctttt ttttcctgcc tggattattt aaaaagccat gtgtggaaac 2040ccactattta ataaaagtaa tagaatcaga aaaaaaaaaa aaaaaaaa 208832334PRTHomo sapiens 32Met Leu Thr Lys Pro Leu Gln Gly Pro Pro Ala Pro Pro Gly Thr Pro1 5 10 15Thr Pro Pro Pro Gly Gly Lys Asp Arg Glu Ala Phe Glu Ala Glu Tyr 20 25 30Arg Leu Gly Pro Leu Leu Gly Lys Gly Gly Phe Gly Thr Val Phe Ala 35 40 45Gly His Arg Leu Thr Asp Arg Leu Gln Val Ala Ile Lys Val Ile Pro 50 55 60Arg Asn Arg Val Leu Gly Trp Ser Pro Leu Ser Asp Ser Val Thr Cys65 70 75 80Pro Leu Glu Val Ala Leu Leu Trp Lys Val Gly Ala Gly Gly Gly His 85 90 95Pro Gly Val Ile Arg Leu Leu Asp Trp Phe Glu Thr Gln Glu Gly Phe 100 105 110Met Leu Val Leu Glu Arg Pro Leu Pro Ala Gln Asp Leu Phe Asp Tyr 115 120 125Ile Thr Glu Lys Gly Pro Leu Gly Glu Gly Pro Ser Arg Cys Phe Phe 130 135 140Gly Gln Val Val Ala Ala Ile Gln His Cys His Ser Arg Gly Val Val145 150 155 160His Arg Asp Ile Lys Asp Glu Asn Ile Leu Ile Asp Leu Arg Arg Gly 165 170 175Cys Ala Lys Leu Ile Asp Phe Gly Ser Gly Ala Leu Leu His Asp Glu 180 185 190Pro Tyr Thr Asp Phe Asp Gly Thr Arg Val Tyr Ser Pro Pro Glu Trp 195 200 205Ile Ser Arg His Gln Tyr His Ala Leu Pro Ala Thr Val Trp Ser Leu 210 215 220Gly Ile Leu Leu Tyr Asp Met Val Cys Gly Asp Ile Pro Phe Glu Arg225 230 235 240Asp Gln Glu Ile Leu Glu Ala Glu Leu His Phe Pro Ala His Val Ser 245 250 255Pro Asp Cys Cys Ala Leu Ile Arg Arg Cys Leu Ala Pro Lys Pro Ser 260 265 270Ser Arg Pro Ser Leu Glu Glu Ile Leu Leu Asp Pro Trp Met Gln Thr 275 280 285Pro Ala Glu Asp Val Thr Pro Gln Pro Leu Gln Arg Arg Pro Cys Pro 290 295 300Phe Gly Leu Val Leu Ala Thr Leu Ser Leu Ala Trp Pro Gly Leu Ala305 310 315 320Pro Asn Gly Gln Lys Ser His Pro Met Ala Met Ser Gln Gly 325 330331215DNAHomo sapiens 33ggggggactt gagtatcctt tgttaccctc aggagatcct gaaaccagtc ccccatggat 60actgagggct gactgtatag tcctatcctc acggaacttt cattctaatg ggggaagact 120gactataaac aaaatatatg taataggtgg tggtaagtac cgtggagaag taacaaatgg 180ggcaaagtga gttatacagc tccattctta gaaaccttgg agtacttttc ttagtttata 240ctcgtggtgg tttccttttg tctcctttat tacatgggac tctgacatgt gcccatagct 300agggtgacag taggatctac ccgatagtag ggtggcagta ggatctaccc aaaaagcgtc 360ctgctgatac aggaccaaag catcctgttg ttctcgagcc tataaaaaga gctaatggtg 420ttgcttctct taactgtggc ctcctacact gtgttttgga tgattggtga tgtcttggat 480attctgtttc tttggaactt tgaatataca acactttact agggaattag caatggaagc 540agagcaaaga tgtacagagg aaacaatgcg taactctgat ggaattgaag tcatgaggca 600gcagagagct taaattacag ctttaaaaat ttttattttt tagagggaat ttacttggga 660gtaacagcag taatagttaa cggagccaga atgcttgagt catataattg caaagcagag 720ttgggagcaa cagatgctaa agagtagttg ctgtagttcc tctttgggtc gtaggagcag 780ttgtcatatt actatatagc tactgcatga agaagagttc ttagtgaggc ctgggtgatc 840agctcttctt agtattctgt gtgaccccat ttgacctttt aacaaatccc taagtaaata 900aatagcccct caggaaaact aagtttttct ctgctgtttt tttgcttgag agagctataa 960ctgtaataga cttatatttc tgaacatttt agtgcttgcc aatatttggt aatatttatg 1020tttcctatat ttgtaatgaa cattcttctt ccggtacatt ttttgttaaa ttattgtttg 1080atggataaaa gttcaccttt tattgtataa aattgactga gattaattta tacacattga 1140caatgggtaa atagaatttt tcagattatt aaaagctgaa ggatgcccac gtaagcaaaa 1200aaaaaaaaaa aaaaa 1215343144DNAHomo sapiens 34tcctctttcc gtgcgcgagt gcacagctcc ggaggcccga gccgaccctg gggcgtccgg 60tccggtggtc ttgcagcctc caaaccccga gtgctatacc gaactgcgcg ccaagggtgg 120gagagctgac ggcctgggcc acccttcttc cttcactggg caggctttga ggtgcttgtc 180ggtctggact gatgaaaatc catatgacct gaaagatgtc tgaaaattcc agtgacagtg 240attcatcttg tggttggact gtcatcagtc atgaggggtc agatatagaa atgttgaatt 300ctgtgacccc cactgacagc tgtgagcccg ccccagaatg ttcatcttta gagcaagagg 360agcttcaagc attgcagata gagcaaggag aaagcagcca aaatggcaca gtgcttatgg 420aagaaactgc ttatccagct ttggaggaaa ccagctcaac aattgaggca gaggaacaaa 480agatacccga agacagtatc tatattggaa ctgccagtga tgattctgat attgttaccc 540ttgagccacc taagttagaa gaaattggaa atcaagaagt tgtcattgtt gaagaagcac 600agagttcaga agactttaac atgggctctt cctctagcag ccagtatact ttctgtcagc 660cagaaactgt attttcatct cagcctagtg acgatgaatc aagtagtgat gaaaccagta 720atcagcccag tcctgccttt agacgacgcc gtgctaggaa gaagaccgtt tctgcttcag 780aatctgaaga ccggctagtt gctgaacaag aaactgaacc ttctaaggag ttgagtaaac 840gtcagttcag tagtggtctc aataagtgtg ttatacttgc tttggtgatt gcaatcagca 900tgggatttgg ccatttctat ggcacaattc agattcagaa gcgtcaacag ttagtcagaa 960agatacatga agatgaattg aatgatatga aggattatct ttcccagtgt caacaggaac 1020aagaatcttt tatagattat aagtcattga aagaaaatct tgcaaggtgt tggacactta 1080ctgaagcaga gaagatgtcc tttgaaactc agaaaacgaa ccttgctaca gaaaatcagt 1140atttaagagt atccctggag aaggaagaaa aagccttatc ctcattacag gaagagttaa 1200acaaactaag agaacagatt agaatattgg aagataaagg gacaagtact gaattagtta 1260aagaaaatca gaaacttaag cagcatttgg aagaggaaaa gcagaaaaaa cacagctttc 1320ttagtcaaag ggagactctg ttgacagaag caaagatgct aaagagagaa ctggagagag 1380aacgactagt aactacggct ttaagggggg aactccagca gttaagtggt agtcagttac 1440atggcaagtc agattctccc aatgtatata ctgaaaaaaa ggaaatagca atcttacggg 1500aaagactcac tgagctggaa cggaagctaa ccttcgaaca gcagcgttct gatttgtggg 1560aaagattgta tgttgaggca aaagatcaaa atggaaaaca aggaacagat ggaaaaaaga 1620aagggggcag aggaagccac agggctaaaa ataagtcaaa ggaaacattt ttgggttcag 1680ttaaggaaac atttgatgcc atgaagaatt ctaccaagga gtttgtaagg catcataaag 1740agaaaattaa gcaggctaaa gaagctgtga aggaaaatct gaaaaaattc tcagattcag 1800ttaaatccac tttcagacac tttaaagata ccaccaagaa tatctttgat gaaaagggta 1860ataaaagatt tggtgctaca aaagaagcag ctgaaaaacc aagaacagtt tttagtgact 1920atttacatcc acagtataag gcacctacag aaaaccattc aaggccctac tatgcaaaaa 1980gatggaagga agaaaagcca gttcacttta aagaattcag aaaaaataca aattcaaaga 2040aatgcagtcc tgggcatgat tgtagagaaa attctcattc tttcagaaag gcttgttctg 2100gtgtatttga ttgtgctcaa caagagtcca tgagcctttt taacacagtg gtgatcccta 2160taaggatgga tgaatttaga cagataattc aaaggtacat gttaaaagaa ctggatactt 2220tttgtcgctg gaacgaactt gatcagttca tcaataagtt tttcctaaac ggtgtcttta 2280tacatgatca gaagctcttc actgactttg ttaatgatgt taagattatc ttaggaaaca 2340tgaaggaata tgaagtagat aatgatggag tatttgagaa gttggatgaa tatatatata 2400gacacttctt tggtcacact ttttcccctc catatggacc caggtcggtt tacataaaac 2460cgtgtcatta cagtagtttg taacatttgt agattggata cgatttttat gatttgatga 2520gtttcttgta aggttaccgt ttctaagagt tgtgctttat ggccactgag agaattcaga 2580ataaattgaa agatggagtc taaaaattat tagctgttac aaatggaaca atttcattat 2640aacgtgatca ctttgacttg agcaaatggt ttaattttta tcttaaaatc agttaagaat 2700atataaaatc ctacctttgg ccaagtttgt ttcttttcat tatagtttat atgaaaagat 2760caccttaagt gaaattattt tccttatttt cctttaatct tttatgtatt tattcacttc 2820tggaagctag gaatgagcaa cacaaatttt actctgaagt cagaagagct catatatata 2880attctaatgt cccacctatg tccattccat gtaccagctt agttatatac tagtcacata 2940attatctttg ataaaggtag aggcacaaag aggcaaacta acaagtcaaa ttctaatgtg 3000tgtacttcat aataattttt tatccatttt catcttcttt atctttatat tctgtaacat 3060gaaacttacc taatcttcaa atgttagctt cattttttac ctttgaaata cttaatcttt 3120ctgaataaat ataatggtct ataa 314435755PRTHomo sapiens 35Met Ser Glu Asn Ser Ser Asp Ser Asp Ser Ser Cys Gly Trp Thr Val1 5 10 15Ile Ser His Glu Gly Ser Asp Ile Glu Met Leu Asn Ser Val Thr Pro 20 25 30Thr Asp Ser Cys Glu Pro Ala Pro Glu Cys Ser Ser Leu Glu Gln Glu 35 40 45Glu Leu Gln Ala Leu Gln Ile Glu Gln Gly Glu Ser Ser Gln Asn Gly 50 55 60Thr Val Leu Met Glu Glu Thr Ala Tyr Pro Ala Leu Glu Glu Thr Ser65 70 75 80Ser Thr Ile Glu Ala Glu Glu Gln Lys Ile Pro Glu Asp Ser Ile Tyr 85 90 95Ile Gly Thr Ala Ser Asp Asp Ser Asp Ile Val Thr Leu Glu Pro Pro 100 105 110Lys Leu Glu Glu Ile Gly Asn Gln Glu Val Val Ile Val Glu Glu Ala 115 120 125Gln Ser Ser Glu Asp Phe Asn Met Gly Ser Ser Ser Ser Ser Gln Tyr 130 135 140Thr Phe Cys Gln Pro Glu Thr Val Phe Ser Ser Gln Pro Ser Asp Asp145 150 155 160Glu Ser Ser Ser Asp Glu Thr Ser Asn Gln Pro Ser Pro Ala Phe Arg 165 170 175Arg Arg Arg Ala Arg Lys Lys Thr Val Ser Ala Ser Glu Ser Glu Asp 180 185 190Arg Leu Val Ala Glu Gln Glu Thr Glu Pro Ser Lys Glu Leu Ser Lys 195 200 205Arg Gln Phe Ser Ser Gly Leu Asn Lys Cys Val Ile Leu Ala Leu Val 210 215 220Ile Ala Ile Ser Met Gly Phe Gly His Phe Tyr Gly Thr Ile Gln Ile225 230 235 240Gln Lys Arg Gln Gln Leu Val Arg Lys Ile His Glu Asp Glu Leu Asn 245 250 255Asp Met Lys Asp Tyr Leu Ser Gln Cys Gln Gln Glu Gln Glu Ser Phe 260 265 270Ile Asp Tyr Lys Ser Leu Lys Glu Asn Leu Ala Arg Cys Trp Thr Leu 275 280 285Thr Glu Ala Glu Lys Met Ser Phe Glu Thr Gln Lys Thr Asn Leu Ala 290 295 300Thr Glu Asn Gln Tyr Leu Arg Val Ser Leu Glu Lys Glu Glu Lys Ala305 310 315 320Leu Ser Ser Leu Gln Glu Glu Leu Asn Lys Leu Arg Glu Gln Ile Arg 325 330 335Ile Leu Glu Asp Lys Gly Thr Ser Thr Glu Leu Val Lys Glu Asn Gln 340 345 350Lys Leu Lys Gln His Leu Glu Glu Glu Lys Gln Lys Lys His Ser Phe 355 360 365Leu Ser Gln Arg Glu Thr Leu Leu Thr Glu Ala Lys Met Leu Lys Arg 370 375 380Glu Leu Glu Arg Glu Arg Leu Val Thr Thr Ala Leu Arg Gly Glu Leu385 390 395 400Gln Gln Leu Ser Gly Ser Gln Leu His Gly Lys Ser Asp Ser Pro Asn 405 410 415Val Tyr Thr Glu Lys Lys Glu Ile Ala Ile Leu Arg Glu Arg Leu Thr 420 425 430Glu Leu Glu Arg Lys Leu Thr Phe Glu Gln Gln Arg Ser Asp Leu Trp 435 440 445Glu Arg Leu Tyr Val Glu Ala Lys Asp Gln Asn Gly Lys Gln Gly Thr 450 455 460Asp Gly Lys Lys Lys Gly Gly Arg Gly Ser His Arg Ala Lys Asn Lys465 470 475 480Ser Lys Glu Thr Phe Leu Gly Ser Val Lys Glu Thr Phe Asp Ala Met 485 490 495Lys Asn Ser Thr Lys Glu Phe Val Arg His His Lys Glu Lys Ile Lys 500 505 510Gln Ala Lys Glu Ala Val Lys Glu Asn Leu Lys Lys Phe Ser Asp Ser 515 520 525Val Lys Ser Thr Phe Arg His Phe Lys Asp Thr Thr Lys Asn Ile Phe 530 535 540Asp Glu Lys Gly Asn Lys Arg Phe Gly Ala Thr Lys Glu Ala Ala Glu545 550 555 560Lys Pro Arg Thr Val Phe Ser Asp Tyr Leu His Pro Gln Tyr Lys Ala 565 570 575Pro Thr Glu Asn His Ser Arg Pro Tyr Tyr Ala Lys Arg Trp Lys Glu 580 585 590Glu Lys Pro Val His Phe Lys Glu Phe Arg Lys

Asn Thr Asn Ser Lys 595 600 605Lys Cys Ser Pro Gly His Asp Cys Arg Glu Asn Ser His Ser Phe Arg 610 615 620Lys Ala Cys Ser Gly Val Phe Asp Cys Ala Gln Gln Glu Ser Met Ser625 630 635 640Leu Phe Asn Thr Val Val Ile Pro Ile Arg Met Asp Glu Phe Arg Gln 645 650 655Ile Ile Gln Arg Tyr Met Leu Lys Glu Leu Asp Thr Phe Cys Arg Trp 660 665 670Asn Glu Leu Asp Gln Phe Ile Asn Lys Phe Phe Leu Asn Gly Val Phe 675 680 685Ile His Asp Gln Lys Leu Phe Thr Asp Phe Val Asn Asp Val Lys Ile 690 695 700Ile Leu Gly Asn Met Lys Glu Tyr Glu Val Asp Asn Asp Gly Val Phe705 710 715 720Glu Lys Leu Asp Glu Tyr Ile Tyr Arg His Phe Phe Gly His Thr Phe 725 730 735Ser Pro Pro Tyr Gly Pro Arg Ser Val Tyr Ile Lys Pro Cys His Tyr 740 745 750Ser Ser Leu 75536558DNAHomo sapiensmodified_base(1)..(558)n = g, a, c or t 36ccatgggatg gctcttctga ccattggggg ccaggccagg ccaggccagg cttagggcag 60caaggaccag gccaaagggg cagggcctcc tttggagggg ttgaggggta catcctcggc 120tggtgtttgc atccaggggt ccagcaggat ctcttccagt gagggtcggg aagaaggttt 180gggggccagg caccggcgga ttagggcaca gcaatcttgg ggaaaacatg ggcttgggaa 240gtggagctca gcttccagaa tctcctggtc cctctcaaag ggaatgtccc cacacaccat 300gtcatagagg aggatgccca gtgaccagac agtggccggg agtgcatggt actggtgtcg 360agagatccac tctggggggc tgtacaccct tgtcccatca aagtcagtgt agggttcatc 420atgaagcagg gcaccagaac caaaatcaat gagtttggca cagccacggc gtaggtctat 480caggatgntc tcatccttga tgtcacgatg gacaactnca cgggaaatgg cagtgctgga 540tggctgccac tactttgg 5583786PRTHomo sapiensMOD_RES(1)..(86)Xaa = any amino acid 37Gln Val Val Ala Xaa Ile Gln His Cys His Ser Arg Gly Val Val His1 5 10 15Arg Asp Ile Lys Asp Glu Asn Ile Leu Ile Asp Leu Arg Arg Gly Cys 20 25 30Ala Lys Leu Ile Asp Phe Gly Ser Gly Ala Leu Leu His Asp Glu Pro 35 40 45Tyr Thr Asp Phe Asp Gly Thr Arg Val Tyr Ser Pro Pro Glu Trp Ile 50 55 60Ser Arg His Gln Tyr His Ala Leu Pro Ala Thr Val Trp Ser Leu Gly65 70 75 80Ile Xaa Leu Tyr Asp Met 8538584DNAHomo sapiensmodified_base(1)..(584)n = g, a, c or t 38aaataatcca ggcaggagaa gagaggaggg cacacttgga actcccctcc ccacaatacg 60tgattattta cattttagta attggacaat cccggctcag gaggaggttg caagaatctg 120caaaagttgg agggagcgcc ccaggagaac aaacagcaag ccttatttcc cctagcccat 180cccccaaaaa accatccatc ccatcctagt gtctggtggt gtccggtggt gtccatcttc 240cattccttcc caaattatgg aagtaaggtt cttctcacca gaataagagc acttgggata 300acagagtagg gtcccctcac ccaaaaaaaa aaaaaaaaan gaagccttgg ggtaacaaca 360gggcattacc tcccccagaa taaagaatcc tgggctgagg caggtaagca gcttgaccca 420atatgggacc ctaggctagg ggaaagggtc cctttactaa aataaaagct actggggtat 480tggaaggaaa gcacccttgc ccaagtaaga gcatatgaac taagtttgng tggnggtagt 540aggaggngcc aatgtggggt gacacatcat cagaataaga gtcc 584392052DNAHomo sapiens 39cgcgcgcggc gaatctcaac gctgcgccgt ctgcgggcgc ttccgggcca ccagtttctc 60tgctttccac cctggcgccc cccagccctg gctccccagc tgcgctgccc cgggcgtcca 120cgccctgcgg gcttagcggg ttcagtgggc tcaatctgcg cagcgccacc tccatgttga 180ccaagcctct acaggggcct cccgcgcccc ccgggacccc cacgccgccg ccaggaggca 240aggatcggga agcgttcgag gccgagtatc gactcggccc cctcctgggt aaggggggct 300ttggcaccgt cttcgcagga caccgcctca cagatcgact ccaggtggcc atcaaagtga 360ttccccggaa tcgtgtgctg ggctggtccc ccttgtcaga ctcagtcaca tgcccactcg 420aagtcgcact gctatggaaa gtgggtgcag gtggtgggca ccctggcgtg atccgcctgc 480ttgactggtt tgagacacag gagggcttca tgctggtcct cgagcggcct ttgcccgccc 540aggatctctt tgactatatc acagagaagg gcccactggg tgaaggccca agccgctgct 600tctttggcca agtagtggca gccatccagc actgccattc ccgtggagtt gtccatcgtg 660acatcaagga tgagaacatc ctgatagacc tacgccgtgg ctgtgccaaa ctcattgatt 720ttggttctgg tgccctgctt catgatgaac cctacactga ctttgatggg acaagggtgt 780acagcccccc agagtggatc tctcgacacc agtaccatgc actcccggcc actgtctggt 840cactgggcat cctcctctat gacatggtgt gtggggacat tccctttgag agggaccagg 900agattctgga agctgagctc cacttcccag cccatgtctc cccagactgc tgtgccctaa 960tccgccggtg cctggccccc aaaccttctt cccgaccctc actggaagag atcctgctgg 1020acccctggat gcaaacacca gccgaggatg tacccctcaa cccctccaaa ggaggccctg 1080cccctttggc ctggtccttg ctaccctaag cctggcctgg cctggcctgg cccccaatgg 1140tcagaagagc catcccatgg ccatgtcaca gggatagatg gacatttgtt gacttggttt 1200tacaggtcat taccagtcat taaagtccag tattactaag gtaagggatt gaggatcagg 1260ggttagaaga cataaaccaa gtctgcccag ttcccttccc aatcctacaa aggagccttc 1320ctcccagaac ctgtggtccc tgattctgga gggggaactt cttgcttctc attttgctaa 1380ggaagtttat tttggtgaag ttgttcccat tctgagcccc gggactctta ttctgatgat 1440gtgtcacccc acattggcac ctcctactac caccacacaa acttagttca tatgctctta 1500cttgggcaag ggtgctttcc ttccaatacc ccagtagctt ttattttagt aaagggaccc 1560tttcccctag cctagggtcc catattgggt caagctgctt acctgcctca gcccaggatt 1620ctttattctg ggggaggtaa tgccctgttg ttaccccaag gcttcttttt tttttttttt 1680tgggtgaggg gaccctactc tgttatccca agtgctctta ttctggtgag aagaacctta 1740cttccataat ttgggaagga atggaagatg gacaccaccg gacaccacca gacactagga 1800tgggatggat ggttttttgg gggatgggct aggggaaata aggcttgctg tttgttctcc 1860tggggcgctc cctccaactt ttgcagattc ttgcaacctc ctcctgagcc gggattgtcc 1920aattactaaa atgtaaataa tcacgtattg tggggagggg agttccaagt gtgccctcct 1980ctcttctcct gcctggatta tttaaaaagc catgtgtgga aacccactat ttaataaaag 2040taatagaatc ag 205240311PRTHomo sapiens 40Met Leu Thr Lys Pro Leu Gln Gly Pro Pro Ala Pro Pro Gly Thr Pro1 5 10 15Thr Pro Pro Pro Gly Gly Lys Asp Arg Glu Ala Phe Glu Ala Glu Tyr 20 25 30Arg Leu Gly Pro Leu Leu Gly Lys Gly Gly Phe Gly Thr Val Phe Ala 35 40 45Gly His Arg Leu Thr Asp Arg Leu Gln Val Ala Ile Lys Val Ile Pro 50 55 60Arg Asn Arg Val Leu Gly Trp Ser Pro Leu Ser Asp Ser Val Thr Cys65 70 75 80Pro Leu Glu Val Ala Leu Leu Trp Lys Val Gly Ala Gly Gly Gly His 85 90 95Pro Gly Val Ile Arg Leu Leu Asp Trp Phe Glu Thr Gln Glu Gly Phe 100 105 110Met Leu Val Leu Glu Arg Pro Leu Pro Ala Gln Asp Leu Phe Asp Tyr 115 120 125Ile Thr Glu Lys Gly Pro Leu Gly Glu Gly Pro Ser Arg Cys Phe Phe 130 135 140Gly Gln Val Val Ala Ala Ile Gln His Cys His Ser Arg Gly Val Val145 150 155 160His Arg Asp Ile Lys Asp Glu Asn Ile Leu Ile Asp Leu Arg Arg Gly 165 170 175Cys Ala Lys Leu Ile Asp Phe Gly Ser Gly Ala Leu Leu His Asp Glu 180 185 190Pro Tyr Thr Asp Phe Asp Gly Thr Arg Val Tyr Ser Pro Pro Glu Trp 195 200 205Ile Ser Arg His Gln Tyr His Ala Leu Pro Ala Thr Val Trp Ser Leu 210 215 220Gly Ile Leu Leu Tyr Asp Met Val Cys Gly Asp Ile Pro Phe Glu Arg225 230 235 240Asp Gln Glu Ile Leu Glu Ala Glu Leu His Phe Pro Ala His Val Ser 245 250 255Pro Asp Cys Cys Ala Leu Ile Arg Arg Cys Leu Ala Pro Lys Pro Ser 260 265 270Ser Arg Pro Ser Leu Glu Glu Ile Leu Leu Asp Pro Trp Met Gln Thr 275 280 285Pro Ala Glu Asp Val Pro Leu Asn Pro Ser Lys Gly Gly Pro Ala Pro 290 295 300Leu Ala Trp Ser Leu Leu Pro305 31041105DNAHomo sapiensmodified_base(1)..(105)n = g, a, c or t 41ctggaactgc acntagtccc agctctcctc ggccgcggtc tcctcggggn tggtgccgta 60cttttggatg gttttctcta cnacntcccg caagcttccn tccag 105421125DNAHomo sapiens 42gtctccccca ctgtcagcac ctcttctgtg tggtgagtgg accgcttacc ccactaggtg 60aagatgtcag cccaggagag ctgcctcagc ctcatcaagt acttcctctt cgttttcaac 120ctcttcttct tcgtcctcgg cagcctgatc ttctgcttcg gcatctggat cctcatcgac 180aagaccagct tcgtgtcctt tgtgggcttg gccttcgtgc ctctgcagat ctggtccaaa 240gtcctggcca tctcaggaat cttcaccatg ggcatcgccc tcctgggttg tgtgggggcc 300ctcaaggagc tccgctgcct cctgggcctg tattttggga tgctgctgct cctgtttgcc 360acacagatca ccctgggaat cctcatctcc actcagcggg cccagctgga gcgaagcttg 420cgggacgtcg tagagaaaac catccaaaag tacggcacca accccgagga gaccgcggcc 480gaggagagct gggactatgt gcagttccag ctgcgctgct gcggctggca ctacccgcag 540gactggttcc aagtcctcat cctgagaggt aacgggtcgg aggcgcaccg cgtgccctgc 600tcctgctaca acttgtcggc gaccaacgac tccacaatcc tagataaggt gatcttgccc 660cagctcagca ggcttggaca cctggcgcgg tccagacaca gtgcagacat ctgcgctgtc 720cctgcagaga gccacatcta ccgcgagggc tgcgcgcagg gcctccagaa gtggctgcac 780aacaacctta tttccatagt gggcatttgc ctgggcgtcg gcctactcga gctcgggttc 840atgacgctct cgatattcct gtgcagaaac ctggaccacg tctacaaccg gctcgctcga 900taccgttagg ccccgccctc cccaaagtcc cgccccgccc ccgtcacgtg cgctgggcac 960ttccctgctg cctgtaaata tttgtttaat ccccagttcg cctggagccc tccgccttca 1020cattcccctg gggacccacg tggctgcgtg cccctgctgc tgtcacctct cccacgggac 1080ctggggcttt cgtccacagc ttcctgtccc catctgtcgg cctac 112543281PRTHomo sapiens 43Met Ser Ala Gln Glu Ser Cys Leu Ser Leu Ile Lys Tyr Phe Leu Phe1 5 10 15Val Phe Asn Leu Phe Phe Phe Val Leu Gly Ser Leu Ile Phe Cys Phe 20 25 30Gly Ile Trp Ile Leu Ile Asp Lys Thr Ser Phe Val Ser Phe Val Gly 35 40 45Leu Ala Phe Val Pro Leu Gln Ile Trp Ser Lys Val Leu Ala Ile Ser 50 55 60Gly Ile Phe Thr Met Gly Ile Ala Leu Leu Gly Cys Val Gly Ala Leu65 70 75 80Lys Glu Leu Arg Cys Leu Leu Gly Leu Tyr Phe Gly Met Leu Leu Leu 85 90 95Leu Phe Ala Thr Gln Ile Thr Leu Gly Ile Leu Ile Ser Thr Gln Arg 100 105 110Ala Gln Leu Glu Arg Ser Leu Arg Asp Val Val Glu Lys Thr Ile Gln 115 120 125Lys Tyr Gly Thr Asn Pro Glu Glu Thr Ala Ala Glu Glu Ser Trp Asp 130 135 140Tyr Val Gln Phe Gln Leu Arg Cys Cys Gly Trp His Tyr Pro Gln Asp145 150 155 160Trp Phe Gln Val Leu Ile Leu Arg Gly Asn Gly Ser Glu Ala His Arg 165 170 175Val Pro Cys Ser Cys Tyr Asn Leu Ser Ala Thr Asn Asp Ser Thr Ile 180 185 190Leu Asp Lys Val Ile Leu Pro Gln Leu Ser Arg Leu Gly His Leu Ala 195 200 205Arg Ser Arg His Ser Ala Asp Ile Cys Ala Val Pro Ala Glu Ser His 210 215 220Ile Tyr Arg Glu Gly Cys Ala Gln Gly Leu Gln Lys Trp Leu His Asn225 230 235 240Asn Leu Ile Ser Ile Val Gly Ile Cys Leu Gly Val Gly Leu Leu Glu 245 250 255Leu Gly Phe Met Thr Leu Ser Ile Phe Leu Cys Arg Asn Leu Asp His 260 265 270Val Tyr Asn Arg Leu Ala Arg Tyr Arg 275 280442915DNAHomo sapiens 44agccccgccg cgatgcccgc gcgcccagga cgcctcctcc cgctgctggc ccggccggcg 60gccctgactg cgctgctgct gctgctgctg ggccatggcg gcggcgggcg ctggggcgcc 120cgggcccagg aggcggcggc ggcggcggcg gacgggcccc ccgcggcaga cggcgaggac 180ggacaggacc cgcacagcaa gcacctgtac acggccgaca tgttcacgca cgggatccag 240agcgccgcgc acttcgtcat gttcttcgcg ccctggtgtg gacactgcca gcggctgcag 300ccgacttgga atgacctggg agacaaatac aacagcatgg aagatgccaa agtctatgtg 360gctaaagtgg actgcacggc ccactccgac gtgtgctccg cccagggggt gcgaggatac 420cccaccttaa agcttttcaa gccaggccaa gaagctgtga agtaccaggg tcctcgggac 480ttccagacac tggaaaactg gatgctgcag acactgaacg aggagccagt gacaccagag 540ccggaagtgg aaccgcccag tgcccccgag ctcaagcaag ggctgtatga gctctcagca 600agcaactttg agctgcacgt tgcacaaggc gaccacttta tcaagttctt cgctccgtgg 660tgtggtcact gcaaagccct ggctccaacc tgggagcagc tggctctggg ccttgaacat 720tccgaaactg tcaagattgg caaggttgat tgtacacagc actatgaact ctgctccgga 780aaccaggttc gtggctatcc cactcttctc tggttccgag atgggaaaaa ggtggatcag 840tacaagggaa agcgggattt ggagtcactg agggagtacg tggagtcgca gctgcagcgc 900acagagactg gagcgacgga gaccgtcacg ccctcagagg ccccggtgct ggcagctgag 960cccgaggctg acaagggcac tgtgttggca ctcactgaaa ataacttcga tgacaccatt 1020gcagaaggaa taaccttcat caagttttat gctccatggt gtggtcattg taggactctg 1080gctcctactt gggaggaact ctctaaaaag gaattccctg gtctggcggg ggtcaagatc 1140gccgaagtag actgcactgc tgaacggaat atctgcagca agtattcggt acgaggctac 1200cccacgttat tgcttttccg aggagggaag aaagtcagtg agcacagtgg aggcagagac 1260cttgactcgt tacaccgctt tgtcctgagc caagcgaaag acgaacttta ggaacacagt 1320tggaggtcac ctctcctgcc cagctcccgc accctgcgtt taggagttca gtcccacaga 1380ggccactggg ttcccagtgg tggctgttca gaaagcagaa catactaagc gtgaggtatc 1440ttctttgtgt gtgtgttttc caagccaaca cactctacag attctttatt aaatgtgtaa 1500ctcatggtca ctgtgtaaac attttcagtg gcgatatatc ccctttgacc ttctcttgat 1560gaaatttaca tggtttcctt tgagactaaa atagcgttga gggaaatgaa attgctggac 1620tatttgtggc tcctgagttg agtgattttg gtgaaagaaa gcacatccaa agcatagttt 1680acctgcccac gagttctgga aaggttgcct tgtggcagta ttgacgttcc tctgatctta 1740aggtcacagt tgactcaata ctgtgttggt ccgtagcatg gagcagattg aaatgcaaaa 1800acccacacct ctggaggata ccttcacggc cgctgctgga gcttctgttg ctgtgaatac 1860ttctctcagt gtgagaggtt agccgtgatg aaagcagcgt tacttctgac cgtgcctgag 1920taagagaatg ctgatgccat aactttatgt gtcgatactt gtcaaatcag ttactgttca 1980ggggatcctt ctgtttctca cggggtgaaa catgtcttta gttcctcatg ttaacacgaa 2040gccagagccc acatgaactg ttggatgtct tccttagaaa gggtaggcat ggaaaattcc 2100acgaggctca ttctcagtat ctcattaact cattgaaaga ttccagttgt atttgtcacc 2160tggggtgaca agaccagaca ggctttccca ggcctgggta tccagggagg ctctgcagcc 2220ctgctgaagg gccctaacta gagttctaga gtttctgatt ctgtttctca gtagtccttt 2280tagaggcttg ctatacttgg tctgcttcaa ggaggtcgac cttctaatgt atgaagaatg 2340ggatgcattt gatctcaaga ccaaagacag atgtcagtgg gctgctctgg ccctggtgtg 2400cacggctgtg gcagctgttg atgccagtgt cctctaactc atgctgtcct tgtgattaaa 2460cacctctatc tcccttggga ataagcacat acaggcttaa gctctaagat agataggtgt 2520ttgtcctttt accatcgagc tacttcccat aataaccact ttgcatccaa cactcttcac 2580ccacctccca tacgcaaggg gatgtggata cttggcccaa agtaactggt ggtaggaatc 2640ttagaaacaa gaccacttat actgtctgtc tgaggcagaa gataacagca gcatctcgac 2700cagcctctgc cttaaaggaa atctttatta atcacgtatg gttcacagat aattcttttt 2760ttaaaaaaac ccaacctcct agagaagcac aactgtcaag agtcttgtac acacaacttc 2820agctttgcat cacgagtctt gtattccaag aaaatcaaag tggtacaatt tgtttgttta 2880cactatgata ctttctaaat aaactccttt ttttt 291545432PRTHomo sapiens 45Met Pro Ala Arg Pro Gly Arg Leu Leu Pro Leu Leu Ala Arg Pro Ala1 5 10 15Ala Leu Thr Ala Leu Leu Leu Leu Leu Leu Gly His Gly Gly Gly Gly 20 25 30Arg Trp Gly Ala Arg Ala Gln Glu Ala Ala Ala Ala Ala Ala Asp Gly 35 40 45Pro Pro Ala Ala Asp Gly Glu Asp Gly Gln Asp Pro His Ser Lys His 50 55 60Leu Tyr Thr Ala Asp Met Phe Thr His Gly Ile Gln Ser Ala Ala His65 70 75 80Phe Val Met Phe Phe Ala Pro Trp Cys Gly His Cys Gln Arg Leu Gln 85 90 95Pro Thr Trp Asn Asp Leu Gly Asp Lys Tyr Asn Ser Met Glu Asp Ala 100 105 110Lys Val Tyr Val Ala Lys Val Asp Cys Thr Ala His Ser Asp Val Cys 115 120 125Ser Ala Gln Gly Val Arg Gly Tyr Pro Thr Leu Lys Leu Phe Lys Pro 130 135 140Gly Gln Glu Ala Val Lys Tyr Gln Gly Pro Arg Asp Phe Gln Thr Leu145 150 155 160Glu Asn Trp Met Leu Gln Thr Leu Asn Glu Glu Pro Val Thr Pro Glu 165 170 175Pro Glu Val Glu Pro Pro Ser Ala Pro Glu Leu Lys Gln Gly Leu Tyr 180 185 190Glu Leu Ser Ala Ser Asn Phe Glu Leu His Val Ala Gln Gly Asp His 195 200 205Phe Ile Lys Phe Phe Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala 210 215 220Pro Thr Trp Glu Gln Leu Ala Leu Gly Leu Glu His Ser Glu Thr Val225 230 235 240Lys Ile Gly Lys Val Asp Cys Thr Gln His Tyr Glu Leu Cys Ser Gly 245 250 255Asn Gln Val Arg Gly Tyr Pro Thr Leu Leu Trp Phe Arg Asp Gly Lys 260 265 270Lys Val Asp Gln Tyr Lys Gly Lys Arg Asp Leu Glu Ser Leu Arg Glu 275 280 285Tyr Val Glu Ser Gln Leu Gln Arg Thr Glu Thr Gly Ala Thr Glu Thr 290 295 300Val Thr Pro Ser Glu Ala Pro Val Leu Ala Ala Glu Pro Glu Ala Asp305 310 315 320Lys Gly Thr Val Leu Ala Leu Thr Glu Asn Asn Phe Asp Asp Thr Ile 325 330 335Ala Glu Gly Ile Thr Phe Ile Lys Phe Tyr Ala Pro Trp Cys Gly His 340 345 350Cys Arg Thr Leu Ala Pro Thr Trp Glu Glu Leu

Ser Lys Lys Glu Phe 355 360 365Pro Gly Leu Ala Gly Val Lys Ile Ala Glu Val Asp Cys Thr Ala Glu 370 375 380Arg Asn Ile Cys Ser Lys Tyr Ser Val Arg Gly Tyr Pro Thr Leu Leu385 390 395 400Leu Phe Arg Gly Gly Lys Lys Val Ser Glu His Ser Gly Gly Arg Asp 405 410 415Leu Asp Ser Leu His Arg Phe Val Leu Ser Gln Ala Lys Asp Glu Leu 420 425 43046551DNAHomo sapiens 46ccagccagtg acagaaaaaa gagtgaatgt gcctttaaga agaagagcaa tgagacacag 60tgtttcaact tcatccgtgt cctggtttct tacaatgtca cccatctcta cacctgcggc 120accttcgcct tcagccctgc ttgtaccttc attgaacttc aagattccta cctgttgccc 180atctcggagg acaaggtcat ggagggaaaa ggccaaagcc cctttgaccc cgctcacaag 240catacggctg tcttggtgga tgggatgctc tattctggta ctatgaacaa cttcctgggc 300agtgagccca tcctgatgcg cacactggga tcccagcctg tcctcaagac cgacaacttc 360ctccgctggc tgcatcatga cgcctccttt gtggcagcca tcccttcgac ccaggtcgtc 420tacttcttct tcgaggagac agccagcgag tttgacttct ttgagaggct ccacacatcg 480cgggtggcta gagtctgcaa gaatgacgtg ggcggcgaaa agctgctgca gaagaagtgg 540accaccttcc t 551473252DNAHomo sapiens 47aggatgatga aagtgagacc gtcttagggc ccttccagat agtgaacctt ctctgcccca 60atgccccacc cctgccacca atacacacgc ttctgctgcc tggggctctc ctattggtcc 120tcggggggat gtggtaagaa ctgctcaccc agaaagtgcc cgggtgcctg tttccccaga 180cctccctggt gacagtctgt ggctgagcat ggccctccca gccctgggcc tggacccctg 240gagcctcctg ggccttttcc tcttccaact gcttcagctg ctgctgccga cgacgaccgc 300ggggggaggc gggcaggggc ccatgcccag ggtcagatac tatgcagggg atgaacgtag 360ggcacttagc ttcttccacc agaagggcct ccaggatttt gacactctgc tcctgagtgg 420tgatggaaat actctctacg tgggggctcg agaagccatt ctggccttgg atatccagga 480tccaggggtc cccaggctaa agaacatgat accgtggcca gccagtgaca gaaaaaagag 540tgaatgtgcc tttaagaaga agagcaatga gacacagtgt ttcaacttca tccgtgtcct 600ggtttcttac aatgtcaccc atctctacac ctgcggcacc ttcgccttca gccctgcttg 660taccttcatt gaacttcaag attcctacct gttgcccatc tcggaggaca aggtcatgga 720gggaaaaggc caaagcccct ttgaccccgc tcacaagcat acggctgtct tggtggatgg 780gatgctctat tctggtacta tgaacaactt cctgggcagt gagcccatcc tgatgcgcac 840actgggatcc cagcctgtcc tcaagaccga caacttcctc cgctggctgc atcatgacgc 900ctcctttgtg gcagccatcc cttcgaccca ggtcgtctac ttcttcttcg aggagacagc 960cagcgagttt gacttctttg agaggctcca cacatcgcgg gtggctagag tctgcaagaa 1020tgacgtgggc ggcgaaaagc tgctgcagaa gaagtggacc accttcctga aggcccagct 1080gctctctgca cccagccggg gcagctgccc ttcaacgtca tccgccacgc ggtcctgctc 1140cccgccgatt ctcccacagc tccccacatc tacgcagtct tcacctccca gtgggcaggt 1200tggcgggacc aggagctctg cggtttgtgc cttctctctc ttggacattg aacgtgtctt 1260taaggggaaa ttcaaagagt tgaacaaaga aacttcacgc tggactactt ataggggccc 1320tgagaccaac ccccggccag gcagttgctc agtgggcccc tcctctgata aggccctgac 1380cttcatgaag gaccatttcc tgatggatga gcaagtggtg gggacgcccc tgctggtgaa 1440atctggcgtg gagtatacac ggcttgcagt ggagacagcc cagggccttg atgggcacag 1500ccatcttgtc atgtacctgg gaaccaccac agggtcgctc cacaaggctg tggtaagtgg 1560ggacagcagt gctcatctgg tggaagagat tcagctgttc cctgaccctg aacctgttcg 1620caacctgcag ctggccccca cccagggtgc agtgtttgta ggcttctcag gaggtgtctg 1680gagggtgccc cgagccaact gtagtgtcta tgagagctgt gtggactgtg tccttgcccg 1740ggacccccac tgtgcctggg accctgagtc ccgaacctgt tgcctcctgt ctgcccccaa 1800cctgaactcc tggaagcagg acatggagcg ggggaaccca gagtgggcat gtgccagtgg 1860ccccatgagc aggagccttc ggcctcagag ccgcccgcaa atcattaaag aagtcctggc 1920tgtccccaac tccatcctgg agctcccctg cccccacctg tcagccttgg cctcttatta 1980ttggagtcat ggcccagcag cagtcccaga agcctcttcc actgtctaca atggctccct 2040cttgctgata gtgcaggatg gagttggggg tctctaccag tgctgggcaa ctgagaatgg 2100cttttcatac cctgtgatct cctactgggt ggacagccag gaccagaccc tggccctgga 2160tcctgaactg gcaggcatcc cccgggagca tgtgaaggtc ccgttgacca gggtcagtgg 2220tggggccgcc ctggctgccc agcagtccta ctggccccac tttgtcactg tcactgtcct 2280ctttgcctta gtgctttcag gagccctcat catcctcgtg gcctccccat tgagagcact 2340ccgggctcgg ggcaaggttc agggctgtga gaccctgcgc cctggggaga aggccccgtt 2400aagcagagag caacacctcc agtctcccaa ggaatgcagg acctctgcca gtgatgtgga 2460cgctgacaac aactgcctag gcactgaggt agcttaaact ctaggcacag gccggggctg 2520cggtgcaggc acctggccat gctggctggg cggcccaagc acagccctga ctaggatgac 2580agcagcacaa aagaccacct ttctcccctg agaggagctt ctgctactct gcatcactga 2640tgacactcag cagggtgatg cacagcagtc tgcctcccct atgggactcc cttctaccaa 2700gcacatgagc tctctaacag ggtgggggct acccccagac ctgctcctac actgatattg 2760aagaacctgg agaggatcct tcagttctgg ccattccagg gaccctccag aaacacagtg 2820tttcaagaga ccctaaaaaa cctgcctgtc ccaggaccct atggtaatga acaccaaaca 2880tctaaacaat catatgctaa catgccactc ctggaaactc cactctgaag ctgccgcttt 2940ggacaccaac actcccttct cccagggtca tgcagggatc tgctccctcc tgcttccctt 3000accagtcgtg caccgctgac tcccaggaag tctttcctga agtctgacca cctttcttct 3060tgcttcagtt ggggcagact ctgatccctt ctgccctggc agaatggcag gggtaatctg 3120agccttcttc actcctttac cctagctgac cccttcacct ctccccctcc cttttccttt 3180gttttgggat tcagaaaact gcttgtcaga gactgtttat tttttattaa aaatataagg 3240cttatgtatg at 325248762PRTHomo sapiens 48Met Ala Leu Pro Ala Leu Gly Leu Asp Pro Trp Ser Leu Leu Gly Leu1 5 10 15Phe Leu Phe Gln Leu Leu Gln Leu Leu Leu Pro Thr Thr Thr Ala Gly 20 25 30Gly Gly Gly Gln Gly Pro Met Pro Arg Val Arg Tyr Tyr Ala Gly Asp 35 40 45Glu Arg Arg Ala Leu Ser Phe Phe His Gln Lys Gly Leu Gln Asp Phe 50 55 60Asp Thr Leu Leu Leu Ser Gly Asp Gly Asn Thr Leu Tyr Val Gly Ala65 70 75 80Arg Glu Ala Ile Leu Ala Leu Asp Ile Gln Asp Pro Gly Val Pro Arg 85 90 95Leu Lys Asn Met Ile Pro Trp Pro Ala Ser Asp Arg Lys Lys Ser Glu 100 105 110Cys Ala Phe Lys Lys Lys Ser Asn Glu Thr Gln Cys Phe Asn Phe Ile 115 120 125Arg Val Leu Val Ser Tyr Asn Val Thr His Leu Tyr Thr Cys Gly Thr 130 135 140Phe Ala Phe Ser Pro Ala Cys Thr Phe Ile Glu Leu Gln Asp Ser Tyr145 150 155 160Leu Leu Pro Ile Ser Glu Asp Lys Val Met Glu Gly Lys Gly Gln Ser 165 170 175Pro Phe Asp Pro Ala His Lys His Thr Ala Val Leu Val Asp Gly Met 180 185 190Leu Tyr Ser Gly Thr Met Asn Asn Phe Leu Gly Ser Glu Pro Ile Leu 195 200 205Met Arg Thr Leu Gly Ser Gln Pro Val Leu Lys Thr Asp Asn Phe Leu 210 215 220Arg Trp Leu His His Asp Ala Ser Phe Val Ala Ala Ile Pro Ser Thr225 230 235 240Gln Val Val Tyr Phe Phe Phe Glu Glu Thr Ala Ser Glu Phe Asp Phe 245 250 255Phe Glu Arg Leu His Thr Ser Arg Val Ala Arg Val Cys Lys Asn Asp 260 265 270Val Gly Gly Glu Lys Leu Leu Gln Lys Lys Trp Thr Thr Phe Leu Lys 275 280 285Ala Gln Leu Leu Ser Ala Pro Ser Arg Gly Ser Cys Pro Ser Thr Ser 290 295 300Ser Ala Thr Arg Ser Cys Ser Pro Pro Ile Leu Pro Gln Leu Pro Thr305 310 315 320Ser Thr Gln Ser Ser Pro Pro Ser Gly Gln Val Gly Gly Thr Arg Ser 325 330 335Ser Ala Val Cys Ala Phe Ser Leu Leu Asp Ile Glu Arg Val Phe Lys 340 345 350Gly Lys Phe Lys Glu Leu Asn Lys Glu Thr Ser Arg Trp Thr Thr Tyr 355 360 365Arg Gly Pro Glu Thr Asn Pro Arg Pro Gly Ser Cys Ser Val Gly Pro 370 375 380Ser Ser Asp Lys Ala Leu Thr Phe Met Lys Asp His Phe Leu Met Asp385 390 395 400Glu Gln Val Val Gly Thr Pro Leu Leu Val Lys Ser Gly Val Glu Tyr 405 410 415Thr Arg Leu Ala Val Glu Thr Ala Gln Gly Leu Asp Gly His Ser His 420 425 430Leu Val Met Tyr Leu Gly Thr Thr Thr Gly Ser Leu His Lys Ala Val 435 440 445Val Ser Gly Asp Ser Ser Ala His Leu Val Glu Glu Ile Gln Leu Phe 450 455 460Pro Asp Pro Glu Pro Val Arg Asn Leu Gln Leu Ala Pro Thr Gln Gly465 470 475 480Ala Val Phe Val Gly Phe Ser Gly Gly Val Trp Arg Val Pro Arg Ala 485 490 495Asn Cys Ser Val Tyr Glu Ser Cys Val Asp Cys Val Leu Ala Arg Asp 500 505 510Pro His Cys Ala Trp Asp Pro Glu Ser Arg Thr Cys Cys Leu Leu Ser 515 520 525Ala Pro Asn Leu Asn Ser Trp Lys Gln Asp Met Glu Arg Gly Asn Pro 530 535 540Glu Trp Ala Cys Ala Ser Gly Pro Met Ser Arg Ser Leu Arg Pro Gln545 550 555 560Ser Arg Pro Gln Ile Ile Lys Glu Val Leu Ala Val Pro Asn Ser Ile 565 570 575Leu Glu Leu Pro Cys Pro His Leu Ser Ala Leu Ala Ser Tyr Tyr Trp 580 585 590Ser His Gly Pro Ala Ala Val Pro Glu Ala Ser Ser Thr Val Tyr Asn 595 600 605Gly Ser Leu Leu Leu Ile Val Gln Asp Gly Val Gly Gly Leu Tyr Gln 610 615 620Cys Trp Ala Thr Glu Asn Gly Phe Ser Tyr Pro Val Ile Ser Tyr Trp625 630 635 640Val Asp Ser Gln Asp Gln Thr Leu Ala Leu Asp Pro Glu Leu Ala Gly 645 650 655Ile Pro Arg Glu His Val Lys Val Pro Leu Thr Arg Val Ser Gly Gly 660 665 670Ala Ala Leu Ala Ala Gln Gln Ser Tyr Trp Pro His Phe Val Thr Val 675 680 685Thr Val Leu Phe Ala Leu Val Leu Ser Gly Ala Leu Ile Ile Leu Val 690 695 700Ala Ser Pro Leu Arg Ala Leu Arg Ala Arg Gly Lys Val Gln Gly Cys705 710 715 720Glu Thr Leu Arg Pro Gly Glu Lys Ala Pro Leu Ser Arg Glu Gln His 725 730 735Leu Gln Ser Pro Lys Glu Cys Arg Thr Ser Ala Ser Asp Val Asp Ala 740 745 750Asp Asn Asn Cys Leu Gly Thr Glu Val Ala 755 76049182DNAHomo sapiensmodified_base(1)..(182)n = g, a, c or t 49accagcagtc ctgcggcacc tacctccgcg tgcgccagcc gccccccagg cccttcctgg 60acatggggga gggcaccaag aaccgaatca tcacagccga ggggatcatc ctcctgttct 120gcgcggtggt gcctgggacg ctgctgctgt tnaggaaacg atggcaagaa cganaactcn 180gg 1825060PRTHomo sapiensMOD_RES(1)..(60)Xaa = any amino acid 50Gln Gln Ser Cys Gly Thr Tyr Leu Arg Val Arg Gln Pro Pro Pro Arg1 5 10 15Pro Phe Leu Asp Met Gly Glu Gly Thr Lys Asn Arg Ile Ile Thr Ala 20 25 30Glu Gly Ile Ile Leu Leu Phe Cys Ala Val Val Pro Gly Thr Leu Leu 35 40 45Leu Xaa Arg Lys Arg Trp Gln Glu Arg Xaa Leu Xaa 50 55 6051182DNAHomo sapiensmodified_base(1)..(182)n = g, a, c or t 51accagcagtc ctgcggcacc tacctccgcg tgcgccagcc gccccccagg cccttcctgg 60acatggggga gggcaccaag aaccgaatca tcacagccga ggggatcatc ctcctgttct 120gcgcggtggt gcctgggacg ctgctgctgt tnaggaaacg atggcaagaa cganaactcn 180gg 1825260PRTHomo sapiensMOD_RES(1)..(60)Xaa = any amino acid 52Gln Gln Ser Cys Gly Thr Tyr Leu Arg Val Arg Gln Pro Pro Pro Arg1 5 10 15Pro Phe Leu Asp Met Gly Glu Gly Thr Lys Asn Arg Ile Ile Thr Ala 20 25 30Glu Gly Ile Ile Leu Leu Phe Cys Ala Val Val Pro Gly Thr Leu Leu 35 40 45Leu Xaa Arg Lys Arg Trp Gln Glu Arg Xaa Leu Xaa 50 55 60531107DNAHomo sapiens 53tgctgcaact caaactaacc aacccactgg gagaagatgc ctgggggtcc aggagtcctc 60caagctctgc ctgccaccat cttcctcctc ttcctgctgt ctgctgtcta cctgggccct 120gggtgccagg ccctgtggat gcacaaggtc ccagcatcat tgatggtgag cctgggggaa 180gacgcccact tccaatgccc gcacaatagc agcaacaacg ccaacgtcac ctggtggcgc 240gtcctccatg gcaactacac gtggccccct gagttcttgg gcccgggcga ggaccccaat 300ggtacgctga tcatccagaa tgtgaacaag agccatgggg gcatatacgt gtgccgggtc 360caggagggca acgagtcata ccagcagtcc tgcggcacct acctccgcgt gcgccagccg 420ccccccaggc ccttcctgga catgggggag ggcaccaaga accgaatcat cacagccgag 480gggatcatcc tcctgttctg cgcggtggtg cctgggacgc tgctgctgtt caggaaacga 540tggcagaacg agaagctcgg gttggatgcc ggggatgaat atgaagatga aaacctttat 600gaaggcctga acctggacga ctgctccatg tatgaggaca tctcccgggg cctccagggc 660acctaccagg atgtgggcag cctcaacata ggagatgtcc agctggagaa gccgtgacac 720ccctactcct gccaggctgc ccccgcctgc tgtgcaccca gctccagtgt ctcagctcac 780ttccctggga cattctcctt tcagcccttc tgggggcttc cttagtcata ttcccccagt 840ggggggtggg agggtaacct cactcttctc caggccaggc ctccttggac tcccctgggg 900gtgtcccact cttcttccct ctaaactgcc ccacctccta acctaatccc cacgccccgc 960tgcctttccc aggctcccct cacccagcgg gtaatgagcc cttaatcgct gcctctaggg 1020gagctgattg tagcagcctc gttagtgtca ccccctcctc cctgatctgt cagggccact 1080tagtgataat aaattcttcc caactgc 110754226PRTHomo sapiens 54Met Pro Gly Gly Pro Gly Val Leu Gln Ala Leu Pro Ala Thr Ile Phe1 5 10 15Leu Leu Phe Leu Leu Ser Ala Val Tyr Leu Gly Pro Gly Cys Gln Ala 20 25 30Leu Trp Met His Lys Val Pro Ala Ser Leu Met Val Ser Leu Gly Glu 35 40 45Asp Ala His Phe Gln Cys Pro His Asn Ser Ser Asn Asn Ala Asn Val 50 55 60Thr Trp Trp Arg Val Leu His Gly Asn Tyr Thr Trp Pro Pro Glu Phe65 70 75 80Leu Gly Pro Gly Glu Asp Pro Asn Gly Thr Leu Ile Ile Gln Asn Val 85 90 95Asn Lys Ser His Gly Gly Ile Tyr Val Cys Arg Val Gln Glu Gly Asn 100 105 110Glu Ser Tyr Gln Gln Ser Cys Gly Thr Tyr Leu Arg Val Arg Gln Pro 115 120 125Pro Pro Arg Pro Phe Leu Asp Met Gly Glu Gly Thr Lys Asn Arg Ile 130 135 140Ile Thr Ala Glu Gly Ile Ile Leu Leu Phe Cys Ala Val Val Pro Gly145 150 155 160Thr Leu Leu Leu Phe Arg Lys Arg Trp Gln Asn Glu Lys Leu Gly Leu 165 170 175Asp Ala Gly Asp Glu Tyr Glu Asp Glu Asn Leu Tyr Glu Gly Leu Asn 180 185 190Leu Asp Asp Cys Ser Met Tyr Glu Asp Ile Ser Arg Gly Leu Gln Gly 195 200 205Thr Tyr Gln Asp Val Gly Ser Leu Asn Ile Gly Asp Val Gln Leu Glu 210 215 220Lys Pro225551038DNAHomo sapiens 55atgtacaagg actgcatcga gtccactgga gactattttc ttctctgtga cgccgagggg 60ccatggggca tcattctgga gtccctggcc atacttggca tcgtggtcac aattctgcta 120ctcttagcat ttctcttcct catgcgaaag atccaagact gcagccagtg gaatgtcctc 180cccacccagc tcctcttcct cctgagtgtc ctggggctct tcggactcgc ttttgccttc 240atcatcgagc tcaatcaaca aactgccccc gtacgctact ttctctttgg ggttctcttt 300gctctctgtt tctcatgcct cttagctcat gcctccaatc tagtgaagct ggttcggggt 360tgtgtctcct tctcctggac gacaattctg tgcattgcta ttggttgcag tctgttgcaa 420atcattattg ccactgagta tgtgactctc atcatgacca gaggtatgat gtttgtgaat 480atgacaccct gccagctcaa tgtggacttt gttgtactcc tggtctatgt cctcttcctg 540atggccctca cattcttcgt ctccaaagcc accttctgtg gcccgtgtga gaactggaag 600cagcatggaa ggctcatctt tatcactgtg ctcttctcca tcatcatctg ggtggtgtgg 660atctccatgc tcctgagagg caacccgcag ttccagcgac agccccagtg ggacgacccg 720gtcgtctgca ttgctctggt caccaacgca tgggttttcc tgctgctgta catcgtccct 780gagctctgca ttctctacag atcgtgtaga caggagtgcc ctttacaagg caatgcctgc 840cccgtcacag cctaccaaca cagcttccaa gtggagaacc aggagctctc cagagcccga 900gacagtgatg gagctgagga ggatgtagca ttaacttcat atggtactcc cattcagccg 960cagactgttg atcccacaca agagtgtttc atcccacagg ctaaactaag cccccagcaa 1020gatgcaggag gagtataa 103856345PRTHomo sapiens 56Met Tyr Lys Asp Cys Ile Glu Ser Thr Gly Asp Tyr Phe Leu Leu Cys1 5 10 15Asp Ala Glu Gly Pro Trp Gly Ile Ile Leu Glu Ser Leu Ala Ile Leu 20 25 30Gly Ile Val Val Thr Ile Leu Leu Leu Leu Ala Phe Leu Phe Leu Met 35 40 45Arg Lys Ile Gln Asp Cys Ser Gln Trp Asn Val Leu Pro Thr Gln Leu 50 55 60Leu Phe Leu Leu Ser Val Leu Gly Leu Phe Gly Leu Ala Phe Ala Phe65 70 75 80Ile Ile Glu Leu Asn Gln Gln Thr Ala Pro Val Arg Tyr Phe Leu Phe 85 90 95Gly Val Leu Phe Ala Leu Cys Phe Ser Cys Leu Leu Ala His Ala Ser 100 105 110Asn Leu Val Lys Leu Val Arg Gly Cys Val Ser Phe Ser Trp Thr Thr 115 120 125Ile Leu Cys Ile Ala Ile Gly Cys Ser Leu Leu Gln Ile Ile Ile Ala 130 135 140Thr Glu Tyr Val Thr Leu Ile Met Thr Arg Gly Met Met Phe Val

Asn145 150 155 160Met Thr Pro Cys Gln Leu Asn Val Asp Phe Val Val Leu Leu Val Tyr 165 170 175Val Leu Phe Leu Met Ala Leu Thr Phe Phe Val Ser Lys Ala Thr Phe 180 185 190Cys Gly Pro Cys Glu Asn Trp Lys Gln His Gly Arg Leu Ile Phe Ile 195 200 205Thr Val Leu Phe Ser Ile Ile Ile Trp Val Val Trp Ile Ser Met Leu 210 215 220Leu Arg Gly Asn Pro Gln Phe Gln Arg Gln Pro Gln Trp Asp Asp Pro225 230 235 240Val Val Cys Ile Ala Leu Val Thr Asn Ala Trp Val Phe Leu Leu Leu 245 250 255Tyr Ile Val Pro Glu Leu Cys Ile Leu Tyr Arg Ser Cys Arg Gln Glu 260 265 270Cys Pro Leu Gln Gly Asn Ala Cys Pro Val Thr Ala Tyr Gln His Ser 275 280 285Phe Gln Val Glu Asn Gln Glu Leu Ser Arg Ala Arg Asp Ser Asp Gly 290 295 300Ala Glu Glu Asp Val Ala Leu Thr Ser Tyr Gly Thr Pro Ile Gln Pro305 310 315 320Gln Thr Val Asp Pro Thr Gln Glu Cys Phe Ile Pro Gln Ala Lys Leu 325 330 335Ser Pro Gln Gln Asp Ala Gly Gly Val 340 345572457DNAHomo sapiens 57ggcacgagga agggcctgtg ggtttattat aaggcggagc tcggcgggag aggtgcgggc 60cgaatccgag ccgagcggag aggaatccgg cagtagagag cggactccag ccggcggacc 120ctgcagccct cgcctgggac agcggcgcgc tgggcaggcg cccaagagag catcgagcag 180cggaacccgc gaagccggcc cgcagccgcg acccgcgcag cctgccgctc tcccgccgcc 240ggtccgggca gcatgaggcg cgcggcgctc tggctctggc tgtgcgcgct ggcgctgagc 300ctgcagccgg ccctgccgca aattgtggct actaatttgc cccctgaaga tcaagatggc 360tctggggatg actctgacaa cttctccggc tcaggtgcag gtgctttgca agatatcacc 420ttgtcacagc agaccccctc cacttggaag gacacgcagc tcctgacggc tattcccacg 480tctccagaac ccaccggcct ggaggctaca gctgcctcca cctccaccct gccggctgga 540gaggggccca aggagggaga ggctgtagtc ctgccagaag tggagcctgg cctcaccgcc 600cgggagcagg aggccacccc ccgacccagg gagaccacac agctcccgac cactcatcag 660gcctcaacga ccacagccac cacggcccag gagcccgcca cctcccaccc ccacagggac 720atgcagcctg gccaccatga gacctcaacc cctgcaggac ccagccaagc tgaccttcac 780actccccaca cagaggatgg aggtccttct gccaccgaga gggctgctga ggatggagcc 840tccagtcagc tcccagcagc agagggctct ggggagcagg acttcacctt tgaaacctcg 900ggggagaata cggctgtagt ggccgtggag cctgaccgcc ggaaccagtc cccagtggat 960cagggggcca cgggggcctc acagggcctc ctggacagga aagaggtgct gggaggggtc 1020attgccgtag gcctcgtggg gctcatcttt gctgtgtgcc tggtgggttt catgctgtac 1080cgcatgaaga agaaggacga aggcagctac tccttggagg agccgaaaca agccaacggc 1140ggggcctacc agaagcccac caaacaggag gaattctatg cctgacgcgg gagccatgcg 1200ccccctccgc cctgccactc actaggcccc cacttgcctc ttccttgaag aactgcaggc 1260cctggcctcc cctgccacca ggccacctcc ccagcattcc agcccctctg gtcgctcctg 1320cccacggagt cgtggggtgt gctgggagct ccactctgct tctctgactt ctgcctggag 1380acttagggca ccaggggttt ctcgcatagg acctttccac cacagccagc acctggcatc 1440gcaccattct gactcggttt ctccaaactg aagcagcctc tccccaggtc cagctctgga 1500ggggaggggg atccgactgc tttggaccta aatggcctca tgtggctgga agatcctgcg 1560ggtggggctt ggggctcaca cacctgtagc acttactggt aggaccaagc atcttggggg 1620ggtggccgct gagtggcagg ggacaggagt ccactttgtt tcgtggggag gtctaatcta 1680gatatcgact tgtttttgca catgtttcct ctagttcttt gttcatagcc cagtagacct 1740tgttacttct gaggtaagtt aagtaagttg attcggtatc cccccatctt gcttccctaa 1800tctatggtcg ggagacagca tcagggttaa gaagactttt tttttttttt tttttaaact 1860aggagaacca aatctggaag ccaaaatgta ggcttagttt gtgtgttgtc tcttgagttt 1920gtcgctcatg tgtgcaacag ggtatggact atctgtctgg tggccccgtt tctggtggtc 1980tgttggcagg ctggccagtc caggctgccg tggggccgcc gcctctttca agcagtcgtg 2040cctgtgtcca tgcgctcagg gccatgctga ggcctgggcc gctgccacgt tggagaagcc 2100cgtgtgagaa gtgaatgctg ggactcagcc ttcagacaga gaggactgta gggagggcgg 2160caggggcctg gagatcctcc tgcagaccac gcccgtcctg cctgtggcgc cgtctccagg 2220ggctgcttcc tcctggaaat tgacgagggg tgtcttgggc agagctggct ctgagcgcct 2280ccatccaagg ccaggttctc cgttagctcc tgtggcccca ccctgggccc tgggctggaa 2340tcaggaatat tttccaaaga gtgatagtct tttgcttttg gcaaaactct acttaatcca 2400atgggttttt ccctgtacag tagattttcc aaatgtaata aactttaata taaagta 245758310PRTHomo sapiens 58Met Arg Arg Ala Ala Leu Trp Leu Trp Leu Cys Ala Leu Ala Leu Ser1 5 10 15Leu Gln Pro Ala Leu Pro Gln Ile Val Ala Thr Asn Leu Pro Pro Glu 20 25 30Asp Gln Asp Gly Ser Gly Asp Asp Ser Asp Asn Phe Ser Gly Ser Gly 35 40 45Ala Gly Ala Leu Gln Asp Ile Thr Leu Ser Gln Gln Thr Pro Ser Thr 50 55 60Trp Lys Asp Thr Gln Leu Leu Thr Ala Ile Pro Thr Ser Pro Glu Pro65 70 75 80Thr Gly Leu Glu Ala Thr Ala Ala Ser Thr Ser Thr Leu Pro Ala Gly 85 90 95Glu Gly Pro Lys Glu Gly Glu Ala Val Val Leu Pro Glu Val Glu Pro 100 105 110Gly Leu Thr Ala Arg Glu Gln Glu Ala Thr Pro Arg Pro Arg Glu Thr 115 120 125Thr Gln Leu Pro Thr Thr His Gln Ala Ser Thr Thr Thr Ala Thr Thr 130 135 140Ala Gln Glu Pro Ala Thr Ser His Pro His Arg Asp Met Gln Pro Gly145 150 155 160His His Glu Thr Ser Thr Pro Ala Gly Pro Ser Gln Ala Asp Leu His 165 170 175Thr Pro His Thr Glu Asp Gly Gly Pro Ser Ala Thr Glu Arg Ala Ala 180 185 190Glu Asp Gly Ala Ser Ser Gln Leu Pro Ala Ala Glu Gly Ser Gly Glu 195 200 205Gln Asp Phe Thr Phe Glu Thr Ser Gly Glu Asn Thr Ala Val Val Ala 210 215 220Val Glu Pro Asp Arg Arg Asn Gln Ser Pro Val Asp Gln Gly Ala Thr225 230 235 240Gly Ala Ser Gln Gly Leu Leu Asp Arg Lys Glu Val Leu Gly Gly Val 245 250 255Ile Ala Val Gly Leu Val Gly Leu Ile Phe Ala Val Cys Leu Val Gly 260 265 270Phe Met Leu Tyr Arg Met Lys Lys Lys Asp Glu Gly Ser Tyr Ser Leu 275 280 285Glu Glu Pro Lys Gln Ala Asn Gly Gly Ala Tyr Gln Lys Pro Thr Lys 290 295 300Gln Glu Glu Phe Tyr Ala305 31059357DNAHomo sapiens 59ctggggctga ggatggagtc caagactgag aaatggatgg aacgaataca cctcaatgtc 60tctgaagggc cttttccacc tcatatccag ctccctccag aaattcaaga gtcccaggaa 120gtcactctga cctgcttgct gaatttctcc tgctatgggt atccgatcca attgcagtgg 180ctcctagagg gggttccaat gaggcaggct gctgtcacct cgacctcctt gaccatcaag 240tctgtcttca cccggagcga gctcaagttc tccccacagt ggagtcacca tgggaagatt 300gtgacctgcc agcttcagga tgcagatggg aagttcctct ccaatgacac ggtgcag 357603260DNAHomo sapiens 60ccatcccata gtgagggaag acacgcggaa acaggcttgc acccagacac gacaccatgc 60atctcctcgg cccctggctc ctgctcctgg ttctagaata cttggctttc tctgactcaa 120gtaaatgggt ttttgagcac cctgaaaccc tctacgcctg ggagggggcc tgcgtctgga 180tcccctgcac ctacagagcc ctagatggtg acctggaaag cttcatcctg ttccacaatc 240ctgagtataa caagaacacc tcgaagtttg atgggacaag actctatgaa agcacaaagg 300atgggaaggt tccttctgag cagaaaaggg tgcaattcct gggagacaag aataagaact 360gcacactgag tatccacccg gtgcacctca atgacagtgg tcagctgggg ctgaggatgg 420agtccaagac tgagaaatgg atggaacgaa tacacctcaa tgtctctgaa aggccttttc 480cacctcatat ccagctccct ccagaaattc aagagtccca ggaagtcact ctgacctgct 540tgctgaattt ctcctgctat gggtatccga tccaattgca gtggctccta gagggggttc 600caatgaggca ggctgctgtc acctcgacct ccttgaccat caagtctgtc ttcacccgga 660gcgagctcaa gttctcccca cagtggagtc accatgggaa gattgtgacc tgccagcttc 720aggatgcaga tgggaagttc ctctccaatg acacggtgca gctgaacgtg aagcacaccc 780cgaagttgga gatcaaggtc actcccagtg atgccatagt gagggagggg gactctgtga 840ccatgacctg cgaggtcagc agcagcaacc cggagtacac gacggtatcc tggctcaagg 900atgggacctc gctgaagaag cagaatacat tcacgctaaa cctgcgcgaa gtgaccaagg 960accagagtgg gaagtactgc tgtcaggtct ccaatgacgt gggcccggga aggtcggaag 1020aagtgttcct gcaagtgcag tatgccccgg aaccttccac ggttcagatc ctccactcac 1080cggctgtgga gggaagtcaa gtcgagtttc tttgcatgtc actggccaat cctcttccaa 1140caaattacac gtggtaccac aatgggaaag aaatgcaggg aaggacagag gagaaagtcc 1200acatcccaaa gatcctcccc tggcacgctg ggacttattc ctgtgtggca gaaaacattc 1260ttggtactgg acagaggggc ccgggagctg agctggatgt ccagtatcct cccaagaagg 1320tgaccacagt gattcaaaac cccatgccga ttcgagaagg agacacagtg accctttcct 1380gtaactacaa ttccagtaac cccagtgtta cccggtatga atggaaaccc catggcgcct 1440gggaggagcc atcgcttggg gtgctgaaga tccaaaacgt tggctgggac aacacaacca 1500tcgcctgcgc acgttgtaat agttggtgct cgtgggcctc ccctgtcgcc ctgaatgtcc 1560agtatgcccc ccgagacgtg agggtccgga aaatcaagcc cctttccgag attcactctg 1620gaaactcggt cagcctccaa tgtgacttct caagcagcca ccccaaagaa gtccagttct 1680tctgggagaa aaatggcagg cttctgggga aagaaagcca gctgaatttt gactccatct 1740ccccagaaga tgctgggagt tacagctgct gggtgaacaa ctccatagga cagacagcgt 1800ccaaggcctg gacacttgaa gtgctgtatg cacccaggag gctgcgtgtg tccatgagcc 1860cgggggacca agtgatggag gggaagagtg caaccctgac ctgtgagagt gacgccaacc 1920ctcccgtctc ccactacacc tggtttgact ggaataacca aagcctcccc caccacagcc 1980agaagctgag attggagccg gtgaaggtcc agcactcggg tgcctactgg tgccagggga 2040ccaacagtgt gggcaagggc cgttcgcctc tcagcaccct tactgtctac tatagcccgg 2100agaccatcgg caggcgagtg gctgtgggac tcgggtcctg cctcgccatc ctcatcctgg 2160caatctgtgg gctcaagctc cagcgacgtt ggaagaggac acagagccag caggggcttc 2220aggagaattc cagcggccag agcttctttg tgaggaataa aaaggttaga agggcccccc 2280tctctgaagg cccccactcc ctgggatgct acaatccaat gatggaagat ggcattagct 2340acaccaccct gcgctttccc gagatgaaca taccacgaac tggagatgca gagtcctcag 2400agatgcagag acctccccgg acctgcgatg acacggtcac ttattcagca ttgcacaagc 2460gccaagtggg cgactatgag aacgtcattc cagattttcc agaagatgag gggattcatt 2520actcagagct gatccagttt ggggtcgggg agcggcctca ggcacaagaa aatgtggact 2580atgtgatcct caaacattga cactggatgg gctgcagcag aggcactggg ggcagcgggg 2640gccagggaag tccccgagtt tccccagaca ccgccacatg gcttcctcct gcgtgcatgt 2700gcgcacacac acacacacac gcacacacac acacacacac tcactgcgga gaaccttgtg 2760cctggctcag agccagtctt tttggtgagg gtaaccccaa acctccaaaa ctcctgcccc 2820tgttctcttc cactctcctt gctacccaga aatcatctaa atacctgccc tgacatgcac 2880acctcccctg ccccaccagc ccactggcca tctccacccg gagctgctgt gtcctctgga 2940tctgctcgtc attttccttc ccttctccat ctctctggcc ctctacccct gatctgacat 3000ccccactcac gaatattatg cccagtttct gcctctgagg gaaagcccag aaaaggacag 3060aaacgaagta gaaaggggcc cagtcctggc ctggcttctc ctttggaagt gaggcattgc 3120acggggagac gtacgtatca gcggcccctt gactctgggg actccgggtt tgagatggac 3180acactggtgt ggattaacct gccagggaga cagagctcac aataaaaatg gctcagatgc 3240cacttcaaag aaaaaaaaaa 326061847PRTHomo sapiens 61Met His Leu Leu Gly Pro Trp Leu Leu Leu Leu Val Leu Glu Tyr Leu1 5 10 15Ala Phe Ser Asp Ser Ser Lys Trp Val Phe Glu His Pro Glu Thr Leu 20 25 30Tyr Ala Trp Glu Gly Ala Cys Val Trp Ile Pro Cys Thr Tyr Arg Ala 35 40 45Leu Asp Gly Asp Leu Glu Ser Phe Ile Leu Phe His Asn Pro Glu Tyr 50 55 60Asn Lys Asn Thr Ser Lys Phe Asp Gly Thr Arg Leu Tyr Glu Ser Thr65 70 75 80Lys Asp Gly Lys Val Pro Ser Glu Gln Lys Arg Val Gln Phe Leu Gly 85 90 95Asp Lys Asn Lys Asn Cys Thr Leu Ser Ile His Pro Val His Leu Asn 100 105 110Asp Ser Gly Gln Leu Gly Leu Arg Met Glu Ser Lys Thr Glu Lys Trp 115 120 125Met Glu Arg Ile His Leu Asn Val Ser Glu Arg Pro Phe Pro Pro His 130 135 140Ile Gln Leu Pro Pro Glu Ile Gln Glu Ser Gln Glu Val Thr Leu Thr145 150 155 160Cys Leu Leu Asn Phe Ser Cys Tyr Gly Tyr Pro Ile Gln Leu Gln Trp 165 170 175Leu Leu Glu Gly Val Pro Met Arg Gln Ala Ala Val Thr Ser Thr Ser 180 185 190Leu Thr Ile Lys Ser Val Phe Thr Arg Ser Glu Leu Lys Phe Ser Pro 195 200 205Gln Trp Ser His His Gly Lys Ile Val Thr Cys Gln Leu Gln Asp Ala 210 215 220Asp Gly Lys Phe Leu Ser Asn Asp Thr Val Gln Leu Asn Val Lys His225 230 235 240Thr Pro Lys Leu Glu Ile Lys Val Thr Pro Ser Asp Ala Ile Val Arg 245 250 255Glu Gly Asp Ser Val Thr Met Thr Cys Glu Val Ser Ser Ser Asn Pro 260 265 270Glu Tyr Thr Thr Val Ser Trp Leu Lys Asp Gly Thr Ser Leu Lys Lys 275 280 285Gln Asn Thr Phe Thr Leu Asn Leu Arg Glu Val Thr Lys Asp Gln Ser 290 295 300Gly Lys Tyr Cys Cys Gln Val Ser Asn Asp Val Gly Pro Gly Arg Ser305 310 315 320Glu Glu Val Phe Leu Gln Val Gln Tyr Ala Pro Glu Pro Ser Thr Val 325 330 335Gln Ile Leu His Ser Pro Ala Val Glu Gly Ser Gln Val Glu Phe Leu 340 345 350Cys Met Ser Leu Ala Asn Pro Leu Pro Thr Asn Tyr Thr Trp Tyr His 355 360 365Asn Gly Lys Glu Met Gln Gly Arg Thr Glu Glu Lys Val His Ile Pro 370 375 380Lys Ile Leu Pro Trp His Ala Gly Thr Tyr Ser Cys Val Ala Glu Asn385 390 395 400Ile Leu Gly Thr Gly Gln Arg Gly Pro Gly Ala Glu Leu Asp Val Gln 405 410 415Tyr Pro Pro Lys Lys Val Thr Thr Val Ile Gln Asn Pro Met Pro Ile 420 425 430Arg Glu Gly Asp Thr Val Thr Leu Ser Cys Asn Tyr Asn Ser Ser Asn 435 440 445Pro Ser Val Thr Arg Tyr Glu Trp Lys Pro His Gly Ala Trp Glu Glu 450 455 460Pro Ser Leu Gly Val Leu Lys Ile Gln Asn Val Gly Trp Asp Asn Thr465 470 475 480Thr Ile Ala Cys Ala Arg Cys Asn Ser Trp Cys Ser Trp Ala Ser Pro 485 490 495Val Ala Leu Asn Val Gln Tyr Ala Pro Arg Asp Val Arg Val Arg Lys 500 505 510Ile Lys Pro Leu Ser Glu Ile His Ser Gly Asn Ser Val Ser Leu Gln 515 520 525Cys Asp Phe Ser Ser Ser His Pro Lys Glu Val Gln Phe Phe Trp Glu 530 535 540Lys Asn Gly Arg Leu Leu Gly Lys Glu Ser Gln Leu Asn Phe Asp Ser545 550 555 560Ile Ser Pro Glu Asp Ala Gly Ser Tyr Ser Cys Trp Val Asn Asn Ser 565 570 575Ile Gly Gln Thr Ala Ser Lys Ala Trp Thr Leu Glu Val Leu Tyr Ala 580 585 590Pro Arg Arg Leu Arg Val Ser Met Ser Pro Gly Asp Gln Val Met Glu 595 600 605Gly Lys Ser Ala Thr Leu Thr Cys Glu Ser Asp Ala Asn Pro Pro Val 610 615 620Ser His Tyr Thr Trp Phe Asp Trp Asn Asn Gln Ser Leu Pro His His625 630 635 640Ser Gln Lys Leu Arg Leu Glu Pro Val Lys Val Gln His Ser Gly Ala 645 650 655Tyr Trp Cys Gln Gly Thr Asn Ser Val Gly Lys Gly Arg Ser Pro Leu 660 665 670Ser Thr Leu Thr Val Tyr Tyr Ser Pro Glu Thr Ile Gly Arg Arg Val 675 680 685Ala Val Gly Leu Gly Ser Cys Leu Ala Ile Leu Ile Leu Ala Ile Cys 690 695 700Gly Leu Lys Leu Gln Arg Arg Trp Lys Arg Thr Gln Ser Gln Gln Gly705 710 715 720Leu Gln Glu Asn Ser Ser Gly Gln Ser Phe Phe Val Arg Asn Lys Lys 725 730 735Val Arg Arg Ala Pro Leu Ser Glu Gly Pro His Ser Leu Gly Cys Tyr 740 745 750Asn Pro Met Met Glu Asp Gly Ile Ser Tyr Thr Thr Leu Arg Phe Pro 755 760 765Glu Met Asn Ile Pro Arg Thr Gly Asp Ala Glu Ser Ser Glu Met Gln 770 775 780Arg Pro Pro Arg Thr Cys Asp Asp Thr Val Thr Tyr Ser Ala Leu His785 790 795 800Lys Arg Gln Val Gly Asp Tyr Glu Asn Val Ile Pro Asp Phe Pro Glu 805 810 815Asp Glu Gly Ile His Tyr Ser Glu Leu Ile Gln Phe Gly Val Gly Glu 820 825 830Arg Pro Gln Ala Gln Glu Asn Val Asp Tyr Val Ile Leu Lys His 835 840 84562340DNAHomo sapiens 62ctggggggtc cgggaaaggg gttgggccat gagccaggca gctccgaagc agtcactgag 60gccagggagc ctgcacccag gtcatggggc gacctggctc tcactcctgg cctgggtgct 120cacctacaga ccacttcact tcccctgtcc gcagcgtcac tatgtcctca taggtggctg 180tctggtcaat gtccaggccc tcgtaggtgt gatcttcctc catgccagcc ttgctgtcat 240ccttgtccag cagcaggaag ataggcacga tgatgaagag gatgatcagc agcgtctgga 300tcatgatgat accatccttc agcgtgttcc tctgcttcag 3406379PRTHomo sapiens 63Leu Lys Gln Arg Asn Thr Leu Lys Asp Gly Ile Ile Met Ile Gln Thr1 5 10 15Leu Leu Ile Ile Leu Phe Ile Ile Val Pro Ile Phe Leu Leu Leu Asp 20 25 30Lys Asp Asp Ser

Lys Ala Gly Met Glu Glu Asp His Thr Tyr Glu Gly 35 40 45Leu Asp Ile Asp Gln Thr Ala Thr Tyr Glu Asp Ile Val Thr Leu Arg 50 55 60Thr Gly Glu Val Lys Trp Ser Val Gly Glu His Pro Gly Gln Glu65 70 7564340DNAHomo sapiens 64ctggggggtc cgggaaaggg gttgggccat gagccaggca gctccgaagc agtcactgag 60gccagggagc ctgcacccag gtcatggggc gacctggctc tcactcctgg cctgggtgct 120cacctacaga ccacttcact tcccctgtcc gcagcgtcac tatgtcctca taggtggctg 180tctggtcaat gtccaggccc tcgtaggtgt gatcttcctc catgccagcc ttgctgtcat 240ccttgtccag cagcaggaag ataggcacga tgatgaagag gatgatcagc agcgtctgga 300tcatgatgat accatccttc agcgtgttcc tctgcttcag 340651226DNAHomo sapiens 65ccacgcgtcc gcccacgcgt ccgcagagcg gtgaccatgg ccaggctggc gttgtctcct 60gtgcccagcc actggatggt ggcgttgctg ctgctgctct cagctgagcc agtaccagca 120gccagatcgg aggaccggta ccggaatccc aaaggtagtg cttgttcgcg gatctggcag 180agcccacgtt tcatagccag gaaacggggc ttcacggtga aaatgcactg ctacatgaac 240agcgcctccg gcaatgtgag ctggctctgg aagcaggaga tggacgagaa tccccagcag 300ctgaagctgg aaaagggccg catggaagag tcccagaacg aatctctcgc caccctcacc 360atccaaggca tccggtttga ggacaatggc atctacttct gccagcagaa gtgcaacaac 420acctcggagg tctaccaggg ctgcggcaca gagctgcgag tcatgggatt cagcaccttg 480gcacagctga agcagaggaa cacgctgaag gatggtatca tcatgatcca gacgctgctg 540atcatcctct tcatcatcgt gcctatcttc ctgctgctgg acaaggatga cagcaaggct 600ggcatggagg aagatcacac ctacgagggc ctggacattg accagacagc cacctatgag 660gacatagtga cgctgcggac aggggaagtg aagtggtctg taggtgagca cccaggccag 720gagtgagagc caggtcgccc catgacctgg gtgcaggctc cctggcctca gtgactgctt 780cggagctgcc tggctcatgg cccaacccct ttcccggacc ccccagctgg cctctgaagc 840tggcccacca gagctgccat ttgtctccag cccctggtcc ccagctcttg ccaaagggcc 900tggagtagaa ggacaacagg gcagcaactt ggagggagtt ctctggggat ggacgggacc 960cagccttctg ggggtgctat gaggtgatcc gtccccacac atgggatggg ggaggcagag 1020actggtccag agcccgcaaa tggactcgga gccgagggcc tcccagcaga gcttgggaag 1080ggccatggac ccaactgggc cccagaagag ccacaggaac atcattcctc tcccgcaacc 1140actcccaccc cagggaggcc ctggcctcca gtgccttccc ccgtggaata aacggtgtgt 1200cctgagaaac caaaaaaaaa aaaaaa 122666229PRTHomo sapiens 66Met Ala Arg Leu Ala Leu Ser Pro Val Pro Ser His Trp Met Val Ala1 5 10 15Leu Leu Leu Leu Leu Ser Ala Glu Pro Val Pro Ala Ala Arg Ser Glu 20 25 30Asp Arg Tyr Arg Asn Pro Lys Gly Ser Ala Cys Ser Arg Ile Trp Gln 35 40 45Ser Pro Arg Phe Ile Ala Arg Lys Arg Gly Phe Thr Val Lys Met His 50 55 60Cys Tyr Met Asn Ser Ala Ser Gly Asn Val Ser Trp Leu Trp Lys Gln65 70 75 80Glu Met Asp Glu Asn Pro Gln Gln Leu Lys Leu Glu Lys Gly Arg Met 85 90 95Glu Glu Ser Gln Asn Glu Ser Leu Ala Thr Leu Thr Ile Gln Gly Ile 100 105 110Arg Phe Glu Asp Asn Gly Ile Tyr Phe Cys Gln Gln Lys Cys Asn Asn 115 120 125Thr Ser Glu Val Tyr Gln Gly Cys Gly Thr Glu Leu Arg Val Met Gly 130 135 140Phe Ser Thr Leu Ala Gln Leu Lys Gln Arg Asn Thr Leu Lys Asp Gly145 150 155 160Ile Ile Met Ile Gln Thr Leu Leu Ile Ile Leu Phe Ile Ile Val Pro 165 170 175Ile Phe Leu Leu Leu Asp Lys Asp Asp Ser Lys Ala Gly Met Glu Glu 180 185 190Asp His Thr Tyr Glu Gly Leu Asp Ile Asp Gln Thr Ala Thr Tyr Glu 195 200 205Asp Ile Val Thr Leu Arg Thr Gly Glu Val Lys Trp Ser Val Gly Glu 210 215 220His Pro Gly Gln Glu22567449DNAHomo sapiensmodified_base(16)n = g, a, c or t 67aaaattgatc acaacnaggg aaaacaaaat aaaattaggg ggcaaagggt aggagtatgg 60ggggagggga gagcaaacct atcgaatata tcttagaatt ttgctcagaa atcactgctg 120cctctcaagt gttgcattgt ccctgcctaa accaagaagg ctaaacaaag cccctcctgt 180ttgaattctt aaggtaagaa atttctaagc taagaaaaca ctattgccta aaaccaatga 240tagtggagct catttacaaa taggcatgcc tcacacacac agtccaaagg caagacactg 300gctttgaaat taggctcatg atgtgattcc tattatatgt acctgatttt tttaggcccc 360aggtatgtgg accagagtta atgtcatgac tcttcaaaga tatgatgaaa agttgcccta 420gaaatctaga gatgcatgtt tatttaatt 449682359DNAHomo sapiens 68ctttcaagaa aatacatctg tgctgtattt tccccttccc tcaggccatg atctctgctg 60ttttccttac taactggcat gtcagtacaa gagtgattgt gaagctgctc cggaagggct 120ttatgctaac ctctgttgct tgatgacatg tcctcaggac tctgatatta aaactcaatc 180cttagataac aggtagcttt atcatggaag taggtagcaa tttggaatta gaccattctt 240agttattttt ttcttaatga attgatacat gcactttaaa aaatattttt gttattttgg 300gaagaaaaac tcagactttt aaaaaagtgt atattgtccc attataatat gtatatggaa 360gagtgaaatc tgaacgctgt cttatattaa gcagtagaat taggtattat cataaaaagt 420cttaatctgt agggaatatg agtttatgtt tatgagtcct gctcagtccc tctttgagag 480aattagttga aacccagact ctaaagtctg cttttatatt tgtttgttaa gaccacttat 540ctgcagaagg ttgcctttta accccagtgg ttctaaggtg tggaattgag tgaccctaat 600atttacataa gagacttgtt ttagtggagc ataagggagg ggcataagtt acaccgtttt 660gtgctgcttg agaactgtct tttaaaattg atcacaacga gggaaaacaa aataaaatta 720gggggcaaag ggtaggagta tggggggagg ggagagcaaa cctatcgaat atatcttaga 780attttgctca gaaatcactg ctgcctctca agtgttgcat tgtccctgcc taaaccaaga 840aggctaaaca aagcccctcc tgtttgaatt cttaaggtaa gaaatttcta agctaagaaa 900acactattgc ctaaaaccaa tgatagtgga gctcatttac aaataggcat gcctcacaca 960cacagtccaa aggcaagaca ctggctttga aattaggctc atgatgtgat tcctattata 1020tgtacctgat ttttttaggc cccaggtatg tggaccagag ttaatgtcat gactcttcaa 1080agatatgatg aaaagttgcc ctagaaatct agagatgcat gtttatttaa ttccatagtt 1140taaaaaaaaa tttaagcagg tagttgtggc ttatctgggg gcaaaataat atatgtgaaa 1200ttgcttccag aggacaaagt atattttcta aagtcctgaa ataggatcat gaacccttct 1260gaagttttgg tttgaaatat tatagtatat gatattacca aagagccctt aattcagagt 1320ttaaggggct ctcttcctga actctcttca tcactcaggg ttgaatgtgt aatgttcctt 1380gctattgatt gttattgttg attcttagga tcaggccaag aatcatctgg aaaacattat 1440cttaattccg tctctcatat cctaaacagt acattttact aagaaattcc atatgaaaaa 1500ctccactcat gtctcctgag attatcctgt aagtgaagta gctttcattt aaccaagcta 1560aattatttcc atttagccat gttaaagaga agccaagtct agagaaagca atcctgtaac 1620ccatgaatct ggtgtaccca ttttccctta acgtaacggg aagtgttttg aaattcccag 1680aagagagctg ttttgtaatc aaagtgatgg attataagaa agccagactt tggaaaagga 1740taattggaat aaagggaggt gcttgaagat tttccaaact actttatgtc atttagcttc 1800tattttctga agggctttct ttggtgccat gtactcagat cagtcagttg actgaaagat 1860gatcatgttt tcttcgtaaa gatttaagca attggcaact acaaagacat tattttctta 1920ctgttctata tcatgtactg ttgctgacat tacaaaaagg gtctggaagg gaaaccgtgt 1980cactgtttta tcttttttct ttaaaataca aaagtatccc aactaatcat ttattatggt 2040cagcttgttt tacatgtccc ctatgatgag aaatgctatc aacatctgtg atttctaaga 2100gtcttaccaa attgttactt taattcttgt gtcctgctga gtggtttttc ttttaaaata 2160ccatttttat caccctgtgg cactgggtgt gttactgcga ttacactgat gattctgagc 2220tgtgcttctt caagtagctc agttcttgcg ttttatatta ggtaacagtt ttgtgatgct 2280tttgtgcatt ctttgtcatc tcttctgagt tttcgaatct gtcataaata aactttttca 2340ctatgcacct ggtaaaaaa 235969240DNAHomo sapiens 69cctaagccgc ctaaggggct gcctcggctg tccatcagtt acctcgtttc ctgagcagag 60taattgggtg agattgttca tggaggcatt gctggctctc tagtcctgga acctacagtt 120ggtccaattc attatgccaa agggtccgtc taggaggttc ttgttccaag tattgagatt 180cccgagagaa gtaggtcccc ttagatagaa gcagagtttc tcagaggtat ttagcagcag 24070980DNAHomo sapiens 70gccgctgccg ctccaggaga caggttccca tgcaggaatg aaagacatgg aagggaagag 60gggggccagc tccctgagtc ctgtgtccac cagctgctgc taaatacctc tgagaaactc 120tgcttctatc taaggggacc tacttctctc gggaatctca atacttggaa caagaacctc 180ctagacggac cctttggcat aatgaattgg accaactgta ggttccagga ctagagagcc 240agcaatgcct ccatgaacaa tctcacccaa ttactctgct caggaaacga ggtaactgat 300ggacagccga ggcagcccct taggcggctt aggcctcccc tgtggagcat ccctgaggcg 360gactccggcc agcccgagtg atgcgatcca aagagcactc ccgggtagga aattgccccg 420gtggaatgcc tcaccagagc agcgtgtagc agttccctgt ggaggattaa cacagtggct 480gaacaccggg aaggaactgg cacttggagt ccggacatct gaaacttgta gactgggagc 540tgtacatgga tgggagcagc ttcaccaacc cctgcaaagt gactctgaag aagacgacaa 600gccctgctcc agtcacaccc ggaagctgac tggtccacgc acagctgaag catgaggaaa 660ctcatcgcgg gactaatttt ccttaaaatt tagacttgca cagtaaggac ttcaactgac 720cttcctcaga ctgagaactg tttccagtat atacatcaag tcactgaggt aggacaaaag 780attgctacat tcctattatt ttaaggttac atttttgggg acccctcttt cttctgttct 840agctattacc tttcttgtgt cacctagaaa aggaccagtc cttaattgta ttttaaaaac 900tgtgatcatg ggaagcttta aattggttca ataacacgca tcaagttggt tatttcctgg 960gctacatacc ttggatagat 98071118PRTHomo sapiens 71Met Asp Ser Arg Gly Ser Pro Leu Gly Gly Leu Gly Leu Pro Cys Gly1 5 10 15Ala Ser Leu Arg Arg Thr Pro Ala Ser Pro Ser Asp Ala Ile Gln Arg 20 25 30Ala Leu Pro Gly Arg Lys Leu Pro Arg Trp Asn Ala Ser Pro Glu Gln 35 40 45Arg Val Ala Val Pro Cys Gly Gly Leu Thr Gln Trp Leu Asn Thr Gly 50 55 60Lys Glu Leu Ala Leu Gly Val Arg Thr Ser Glu Thr Cys Arg Leu Gly65 70 75 80Ala Val His Gly Trp Glu Gln Leu His Gln Pro Leu Gln Ser Asp Ser 85 90 95Glu Glu Asp Asp Lys Pro Cys Ser Ser His Thr Arg Lys Leu Thr Gly 100 105 110Pro Arg Thr Ala Glu Ala 11572531DNAHomo sapiensmodified_base(519)n = g, a, c or t 72aaaaaggtaa ttttcagcat tttggcacct aaaagggaaa ctttcatctg cttacacagg 60ccagaagcaa agacaaagat tgcatgttgt tcttacagat gacttaaatc atctctttga 120tgataaaaat atttttaagc cgtgaaagtt atgagatatt ctgggtaagc ctgattatca 180aagaatacca caaatagctt tggagatcgt gtattgtttg tcactgagtc aaagagatct 240gtgggattgt gaggattctt gggtggaggg gtgactaatc ctgcacgtcc ctttgtgaag 300actccagtaa gtactcgcac aacgtacatg tgctttctcc cattgctgtc tggcttggag 360taggtgtcct tggcagaata actggcatcc acagcaaaat aggttccttt tccataggat 420acagcatttt tcccacacaa cttctattaa agccgtgctg attgacatat ggcactgagt 480ctgcatctgt cccatggaag aggagtctct cattattcnt atggtcattc t 531731956DNAHomo sapiens 73attgttatca actctttgat atctgatgat caatgctcca aagaattgga ttaatatttt 60tacacaatat tgttgtagtc agtaactgtt tctatttcca ggcattttta gatgaattca 120ctaactggtc aagaataaat cccaacaagg ccaggattcc catggcagga gatacccaag 180gtgtggtcgg gactgtctct aagccttgtt tcacagcata tgaaatgaaa atcggtgcaa 240ttacttttca ggttgctact ggagatatag ccactgaaca ggtagatgtt attgtaaact 300caacagcaag gacatttaat cggaaatcag gtgtgtcaag agctatttta gaaggtgctg 360gacaagctgt ggaaagtgaa tgtgctgtac tagctgcaca gcctcacaga gattttataa 420ttacaccagg tggatgctta aagtgcaaaa taataattca tgttcctggg ggaaaagatg 480tcaggaaaac ggtcaccagt gttctagaag agtgtgaaca gaggaagtac acatcggttt 540cccttccagc cattggaaca ggaaatgccg gaaaaaaccc tatcacagtt gctgataaca 600taatcgatgc tattgtagac ttctcatcac aacattccac cccatcatta aaaacagtta 660aagttgtcat ttttcaacct gagctgctaa atatattcta cgacagcatg aaaaaaagag 720acctctctgc atcactgaac tttcagtcca cattctccat gactacatgt aatcttcctg 780aacactggac tgacatgaat catcagctgt tttgcatggt ccagctagag ccaggacaat 840cagaatataa taccataaag gacaagttca cccgaacttg ttcttcctac gcaatagaga 900agattgagag gatacagaat gcatttctct ggcagagcta ccaggtaaag aaaaggcaaa 960tggatatcaa gaatgaccat aagaataatg agagactcct cttccatggg acagatgcag 1020actcagtgcc atatgtcaat cagcacggct ttaatagaag ttgtgctggg aaaaatgctg 1080tatcctatgg aaaaggaacc tattttgctg tggatgccag ttattctgcc aaggacacct 1140actccaagcc agacagcaat gggagaaagc acatgtacgt tgtgcgagta cttactggag 1200tcttcacaaa gggacgtgca ggattagtca cccctccacc caagaatcct cacaatccca 1260cagatctctt tgactcagtg acaaacaata cacgatctcc aaagctattt gtggtattct 1320ttgataatca ggcttaccca gaatatctca taactttcac ggcttaaaaa tatttttatc 1380atcaaagaga tgatttaagt catctgtaag aacaacatgc aatctttgtc tttgcttctg 1440gcctgtgtaa gcagatgaaa gtttcccttt taggtgccaa aatgctgaaa attacctttt 1500taaagtgctc tattgctgcg atttgtagca tacctttttt tctcagcaaa ttgatgggtg 1560gaagctgaga aatgtatggt aaatgtcaca gagctacaac cattcacaga caccaaatct 1620ctaggagaat aaaaagcaca ttattctttt tctatcagaa aaaaacaaga tgcatcacct 1680taaaaccaag atgacattgt tcttcttgga acatgttaag acatcgaatg gtggcgggtt 1740aaactgtact gcttaagtgg agcggctacc gttatgcatc tatcacagtt ggggattttg 1800ccttattaag gaaaacttgt caatagttca gctgaaatga ctgaatcaca gaatattaac 1860tctgttatgg aacaaatcat aacagatttt acctgtttac atttcaggta aaaatgtatc 1920gcattgttat ctaatattaa aaaattaccc ccaatt 195674444PRTHomo sapiens 74Met Leu Gln Arg Ile Gly Leu Ile Phe Leu His Asn Ile Val Val Val1 5 10 15Ser Asn Cys Phe Tyr Phe Gln Ala Phe Leu Asp Glu Phe Thr Asn Trp 20 25 30Ser Arg Ile Asn Pro Asn Lys Ala Arg Ile Pro Met Ala Gly Asp Thr 35 40 45Gln Gly Val Val Gly Thr Val Ser Lys Pro Cys Phe Thr Ala Tyr Glu 50 55 60Met Lys Ile Gly Ala Ile Thr Phe Gln Val Ala Thr Gly Asp Ile Ala65 70 75 80Thr Glu Gln Val Asp Val Ile Val Asn Ser Thr Ala Arg Thr Phe Asn 85 90 95Arg Lys Ser Gly Val Ser Arg Ala Ile Leu Glu Gly Ala Gly Gln Ala 100 105 110Val Glu Ser Glu Cys Ala Val Leu Ala Ala Gln Pro His Arg Asp Phe 115 120 125Ile Ile Thr Pro Gly Gly Cys Leu Lys Cys Lys Ile Ile Ile His Val 130 135 140Pro Gly Gly Lys Asp Val Arg Lys Thr Val Thr Ser Val Leu Glu Glu145 150 155 160Cys Glu Gln Arg Lys Tyr Thr Ser Val Ser Leu Pro Ala Ile Gly Thr 165 170 175Gly Asn Ala Gly Lys Asn Pro Ile Thr Val Ala Asp Asn Ile Ile Asp 180 185 190Ala Ile Val Asp Phe Ser Ser Gln His Ser Thr Pro Ser Leu Lys Thr 195 200 205Val Lys Val Val Ile Phe Gln Pro Glu Leu Leu Asn Ile Phe Tyr Asp 210 215 220Ser Met Lys Lys Arg Asp Leu Ser Ala Ser Leu Asn Phe Gln Ser Thr225 230 235 240Phe Ser Met Thr Thr Cys Asn Leu Pro Glu His Trp Thr Asp Met Asn 245 250 255His Gln Leu Phe Cys Met Val Gln Leu Glu Pro Gly Gln Ser Glu Tyr 260 265 270Asn Thr Ile Lys Asp Lys Phe Thr Arg Thr Cys Ser Ser Tyr Ala Ile 275 280 285Glu Lys Ile Glu Arg Ile Gln Asn Ala Phe Leu Trp Gln Ser Tyr Gln 290 295 300Val Lys Lys Arg Gln Met Asp Ile Lys Asn Asp His Lys Asn Asn Glu305 310 315 320Arg Leu Leu Phe His Gly Thr Asp Ala Asp Ser Val Pro Tyr Val Asn 325 330 335Gln His Gly Phe Asn Arg Ser Cys Ala Gly Lys Asn Ala Val Ser Tyr 340 345 350Gly Lys Gly Thr Tyr Phe Ala Val Asp Ala Ser Tyr Ser Ala Lys Asp 355 360 365Thr Tyr Ser Lys Pro Asp Ser Asn Gly Arg Lys His Met Tyr Val Val 370 375 380Arg Val Leu Thr Gly Val Phe Thr Lys Gly Arg Ala Gly Leu Val Thr385 390 395 400Pro Pro Pro Lys Asn Pro His Asn Pro Thr Asp Leu Phe Asp Ser Val 405 410 415Thr Asn Asn Thr Arg Ser Pro Lys Leu Phe Val Val Phe Phe Asp Asn 420 425 430Gln Ala Tyr Pro Glu Tyr Leu Ile Thr Phe Thr Ala 435 44075449DNAHomo sapiens 75cgaggtctga gctcctctgg ttcttctcta gacctgctcc ctctctgaaa tgcaaggccg 60tgcctttaat gggcttttgg cattctgtct ccagacctcc cttctcatct gaagggctct 120caggagaaca gagaaaaaac cagcctgtct ccaaactggc ccgtctcagg gactgggggc 180ctttaccccc agtgaaagat gcagacttta cagcgctgca gtacagtaga gtcaagtgac 240tccttcagat agttggatgg gtctctcgat cattcctgat aataacattt tgcctatgtt 300aagtgctttc cacctatcat gttaccttct aactactccc ttggttggat acaggtatta 360gccccatttc acaattaaga aattgaggct taaaaggatt aaagagtttt ttagaggaga 420aacagctctt ccttacagaa ggatcccaa 4497679PRTHomo sapiens 76Arg Ser His Leu Thr Leu Leu Tyr Cys Ser Ala Val Lys Ser Ala Ser1 5 10 15Phe Thr Gly Gly Lys Gly Pro Gln Ser Leu Arg Arg Ala Ser Leu Glu 20 25 30Thr Gly Trp Phe Phe Leu Cys Ser Pro Glu Ser Pro Ser Asp Glu Lys 35 40 45Gly Gly Leu Glu Thr Glu Cys Gln Lys Pro Ile Lys Gly Thr Ala Leu 50 55 60His Phe Arg Glu Gly Ala Gly Leu Glu Lys Asn Gln Arg Ser Ser65 70 75773067DNAHomo sapiens 77ggcacgagca atgggactta tcgctgctga tgttaacctt gatctcttgg ttcaggtggt 60gcctgccagc tgtctccact gtggagttac tatttttcct tttccccatt ttattcatca 120gaagccagtc actaagcgag gtcaaactcc aggacagggg aattaagtgc caccttctgg 180agagggagca ttcacattta ttacttggga tccttctgta aggaagagct gtttctcctc 240taaaaaactc tttaatcctt ttaagcctca atttcttaat tgtgaaatgg ggctaatacc 300tgtatccaac caagggagta gttagaaggt aacatgatag gtggaaagca cttaacatag 360gcaaaatgtt attatcagga atgatcgaga gacccatcca actatctgaa ggagtcactt 420aactctactg tactgcagcg ctgtaaagtc tgcatctttc

actgggggta aaggccccca 480gtccctgaga cgggccagtt tggagacagg ctggtttttt ctctgttctc ctgagagccc 540ttcagatgag aagggaggtc tggagacaga atgccaaaag cccattaaag gcacggcctt 600gcatttcaga gagggagcag gtctagagaa gaaccagagg agctcagctg agatatggtg 660tatggattgg attttggtag aagatgggaa gaaccaaaca cctgagaaac cactttgaag 720atcggggtca gagtaaggcc taacacatag ttggctccca gtaattattg gttgattgaa 780cagctcaaag agcaactcga ccaagaacac tggactggga gtccagttac ttggatcttg 840cattcctgat ttatttttat tttatatgta ttttttctat ttttttgaga cgaagtctca 900ctcactctgt cgcccaggct ggactacaat ggcacgatct cggctcactg caaactctgc 960ctcccaggtt caagcgattc tcctgcctca gcctctcgag tagctaggat tacaggcatg 1020caccaccacg ctggctaatt tttgtatttt tagtagagac ggggttttgc catgttggcc 1080atgctggtgt ccacctcctg acctcagttg atcttcctgc ctcagccttc caaaatgttg 1140ggattacagg cgtgagccac cgtgcctggc cgtgatttat tttttttgtg tatgtttgtt 1200tttgtcaact tgctgtgtga ccttaagcaa gttacttaac ttctctgggc ttcactttcc 1260atggatgaac attgtaaaga ggctggagag agatgaggac taggtacagg ctttagagga 1320gagccaccgc cccggacttc tccctctgtc accccgcttt ccatgaccct ccttgcctga 1380ctttgtgact ccttgcctcg ctatcaaaac aagtgctgca atctcagtgc tttccaagag 1440ccctgcattg ttagaaactt cccagcacgc agcaaaggct gctgcaatac tcgctctgcc 1500tgcctttgcc ctgcgcttcc tacttaccct ccttttgttt ctcccaaaca tctgtccctg 1560actatgctca tctcatgttt gtcctcagct gctgaaaggg ccacgtttgt tttcattaca 1620aataagacca ccgagtgggc tcctggcgtg ggggcgggag cagccgcgcg cagtcttcag 1680aggcagcccc ccaggctgtc tctggagggt gtgtctctgc ttccctttcc ccgtgtttat 1740tttcagacga agccaagtgg cccgggggga ccctccggac tcccagcctt cagagaggag 1800ggcagctcgg gctttcgccg cagtgcttcc tgcccgtcac gtgtgtgctc ctagccgggg 1860tcgggggagc tggtatcttg gcccttctgg gaggacgcgc acagcccgag gaggcagagc 1920cccagacggg aatgggcttt tcagaggtgg ggtgcgggcg aggggacgat gcattatttt 1980taatatttga tttatttttc caactggact tcttcccggg gctctttctg ggcccagctg 2040cctttgtgat cccgcgcccc ggtcctcggc ctctcacctc cagcgccggg gcgccccctg 2100ctgtcggaag cggctgtgac cgggcagagg tgctatctgg gactctgggt tctcagcccg 2160gggacagcga accgaggggc agatgatcca tcagaaaaga gccggcactg cccagccccg 2220cgcccctgcc cctgcctttt tccgggagcg cgccgcgccg cacccgctac ggccgcttga 2280ccccatcttt gagcccggcc ccaagctctg ggaccgtcgt gcccctcatc aaggaagagc 2340caaggacccc aaggagaagg tcaggagcgg cggtgtggat gtcccttggc tgcaggcccc 2400gccgcgcact cccttcagtc cttcccttct ctagggacca ggtagcatca gtgcctggat 2460ctcggccttg tgtgccctgc tccctgcccc acctactaag aaccaagtct ggttcaccgg 2520ctcccaagag ctggaaccca ttctcagcta gctgggggcc caggccaccc cttccctcca 2580gacctgtgtg ccttctgccc tggctccagg gccccccaca ccgtgaccag ggcgggatcc 2640ctatggggct ggccagtcgg caccgtgcca ggcccacagt gccctgggcg tccatggaag 2700tcgttctgtg tctttaaaat cagaaggaag acattaacct ttaggctgaa gaaaatgttt 2760tagtacacag caataactta tttgtcttta tccaacagcc ataaaatata actttaaata 2820ttctattgat agagaaagga gttcatgaag gcagaaatgc ctggggccca cgaacatccc 2880agtgtggccc tggacgggac atcatgctgg gcaacacagc taaaatgcgg gtgaagacca 2940gatttcttgc acatggcggt gacgggatgc tccctagaga gcttcaagtg gattctttgc 3000tttttatttt ctctcttaat aaaaatgtat gatgtttaca ttgtcagaga aaaaaaaaaa 3060aaaaaaa 306778554DNAHomo sapiensmodified_base(1)..(554)n = g, a, c or t 78aaagcgaatt catactataa cagcagaaac aaaacttcag atttcagaat ttgttattgg 60caaaatttat tctcattata cctgcttcat atgggtatat tactattaaa acagaatacc 120atagagtaat tgcattattt gaaaattctn tcattttaca atgcacttca ccaatgaaac 180agntaatttc cattttgaaa attaaaagaa aacagcacag agaagttaaa tgcggtgtag 240caaagttatg gggtctgctt gagggcacta acctcaacag attattcctc ctctccttag 300aataaccatg aaaatacaaa tttacttagc acatttttgc tttttaagta gctggttcat 360tttctgaatt tcccacattc agagttccag tcattattgt tacatcatgt ttgcagaaac 420cttgtcttat ttagtgtcta tttgcatata accctgaaaa cattattatt tgaaaacttt 480tctatatctc aaattaatat acattttcat aacctacctt tgnattaaga cttgcaattt 540tatcaatcta ttat 554793243DNAHomo sapiens 79ccgcagcctc cgcgggtggc aagcgggctg gggagagccg agggccaaag gaagagaaaa 60tcgcggggag tctctggccg ggagagtcca ggtagcgctc ggcgggcagc agtgcgcagg 120cccctcggct tcaaccgcca caatgctgcc agcagcgcca ggcaaggggc ttgggagccc 180ggaccccgcc ccctgcggcc cagcgccccc aggaaataca aaagatataa taatgatata 240tgaagaagat gctgaggaat gggctctgta cttgacagaa gtatttttac atgttgtgaa 300aagggaagcc atcctgttat atcgcttgga gaatttctct tttcggcatt tggagttgct 360gaacttaacg tcttacaaat gtaaactttt gatattatca aatagcctgc ttagagacct 420aactccaaag aaatgtcagt ttctggaaaa gatacttcat tcaccaaaaa gtgtagttac 480tttgctttgt ggagtgaaga gttcagatca gctctatgaa ttactaaata tctctcaaag 540cagatgggag atctcaactg aacaggaacc tgaagactac atctctgtaa tccagagtat 600catattcaaa gattctgaag actactttga ggtcaacatt ccaacagacc tacgagcaaa 660acattctggg gaaataagtg agagaaagga aattgaagaa ctatcagaag cttcaagaaa 720caccatacca ctagcagtgg tgcttcccac tgaaattcca tgtgagaatc ctggtgaaat 780attcataatt ttgagagatg aagtaattgg tgatactgta gaggttgaat ttacatcaag 840taataagcgc attagaacac ggccagccct ttggaataag aaagtctggt gcatgaaagc 900tttagagttt cctgctggtt cagtccatgt caatgtctac tgtgatggaa tcgttaaagc 960tacaaccaaa attaagtact acccaacagc aaaggcaaag gaatgcctat tcagaatggc 1020agattcagga gagagtttgt gccagaatag cattgaagaa cttgatggtg tccttacatc 1080catattcaaa catgagatac catattatga gttccagtct cttcaaactg aaatttgttc 1140tcaaaacaaa tatactcatt tcaaagaact tccaactctt ctccactgtg cagcaaaatt 1200tggcttaaag aacctggcta ttcatttgct tcaatgttca ggagcaacct gggcatctaa 1260gatgaaaaat atggagggtt cagaccccgc acatattgct gaaaggcatg gtcacaaaga 1320actcaagaaa atcttcgaag acttttcaat ccaagaaatt gacataaata atgagcaaga 1380aaatgattat gaagaggata ttgcctcatt ttccacatat attccttcca cacagaaccc 1440agcatttcat catgaaagca gaaagacata cgggcagagt gcagatggag ctgaggcaaa 1500tgaaatggaa ggggaaggaa aacagaatgg atcaggcatg gagaccaaac acagcccact 1560agaggttggc agtgagagtt ctgaagacca gtatgatgac ttgtatgtgt tcattcctgg 1620tgctgatcca gaaaataatt cacaagagcc actcatgagc agcagacctc ctctcccccc 1680gccgcgacct gtagctaatg ccttccaact ggaaagacct cacttcacct taccagggac 1740aatggtggaa ggccaaatgg aaagaagtca aaactggggt catcctggtg ttagacaaga 1800aacaggagat gaacccaaag gagaaaaaga gaagaaagaa gaggaaaaag agcaggagga 1860ggaagaagac ccatatactt ttgctgagat tgatgacagt gaatatgaca tgatattggc 1920caatctgagt ataaagaaaa aaactgggag tcggtctttc attataaata gacctcctgc 1980ccccacaccc cgacccacaa gtatacctcc aaaagaggaa actacacctt acatagctca 2040agtgtttcaa caaaagacag ccagaagaca atctgatgat gacaagttcc gtggtcttcc 2100taagaaacaa gacagagctc ggatagagag tccagccttt tctactctca ggggctgtct 2160aactgatggt caggaagaac tcatcctcct gcaggagaaa gtaaagaatg ggaaaatgtc 2220tatggatgaa gctctggaga aatttaaaca ctggcagatg ggaaaaagtg gcctggaaat 2280gattcagcag gagaaattac gacaactacg agactgcatt attgggaaaa ggccagaaga 2340agaaaatgtc tataataaac tcaccattgt gcaccatcca ggtggtaagg aaactgccca 2400caatgaaaat aagttttata atgtacactt cagcaataag cttcctgctc gaccccaagt 2460tgaaaaggaa tttggtttct gttgcaagaa agatcattaa agaaggttat tataatgaaa 2520ctcacgaatc tacggacatt ttgctttcag ggtgaagcaa gcttgaattt ggattgcctg 2580ctttctttaa agcgaattca tactataaca gcagaaacaa aacttcagat ttcagaattt 2640gttattggca aaatttattc tcattatacc tgcttcatat gggtatatta ctattaaaac 2700agaataccat agagtaattg cattatttga aaattctctc attttacaat gcacttcacc 2760aatgaaacag ctaatttcca ttttgaaaat taaaagaaaa cagcacagag aagttaaatg 2820cggtgtagca aagttatggg gtctgcttga gggcactaac ctcaacagat tattcctcct 2880ctccttagaa taaccatgaa aatacaaatt tacttagcac atttttgctt tttaagtagc 2940tggttcattt tctgaatttc tcacattcag agttccagtc attattgtta catcatgttt 3000gcagaaacct tgtcttattt agtgtctatt tgcatataac cctgaaaaca ttattatttg 3060aaaacttttc tatatctcaa attaatatac attttcataa cctacctttg tattaagact 3120tgcaatttta tcaatctatt atttcttaga aacaatttac tagcttagaa tagaaagcaa 3180tgttatcgtc atataatttt catgtacaaa tgccacaaat aaattgaatg tttaaagcta 3240aaa 324380755PRTHomo sapiens 80Met Ile Tyr Glu Glu Asp Ala Glu Glu Trp Ala Leu Tyr Leu Thr Glu1 5 10 15Val Phe Leu His Val Val Lys Arg Glu Ala Ile Leu Leu Tyr Arg Leu 20 25 30Glu Asn Phe Ser Phe Arg His Leu Glu Leu Leu Asn Leu Thr Ser Tyr 35 40 45Lys Cys Lys Leu Leu Ile Leu Ser Asn Ser Leu Leu Arg Asp Leu Thr 50 55 60Pro Lys Lys Cys Gln Phe Leu Glu Lys Ile Leu His Ser Pro Lys Ser65 70 75 80Val Val Thr Leu Leu Cys Gly Val Lys Ser Ser Asp Gln Leu Tyr Glu 85 90 95Leu Leu Asn Ile Ser Gln Ser Arg Trp Glu Ile Ser Thr Glu Gln Glu 100 105 110Pro Glu Asp Tyr Ile Ser Val Ile Gln Ser Ile Ile Phe Lys Asp Ser 115 120 125Glu Asp Tyr Phe Glu Val Asn Ile Pro Thr Asp Leu Arg Ala Lys His 130 135 140Ser Gly Glu Ile Ser Glu Arg Lys Glu Ile Glu Glu Leu Ser Glu Ala145 150 155 160Ser Arg Asn Thr Ile Pro Leu Ala Val Val Leu Pro Thr Glu Ile Pro 165 170 175Cys Glu Asn Pro Gly Glu Ile Phe Ile Ile Leu Arg Asp Glu Val Ile 180 185 190Gly Asp Thr Val Glu Val Glu Phe Thr Ser Ser Asn Lys Arg Ile Arg 195 200 205Thr Arg Pro Ala Leu Trp Asn Lys Lys Val Trp Cys Met Lys Ala Leu 210 215 220Glu Phe Pro Ala Gly Ser Val His Val Asn Val Tyr Cys Asp Gly Ile225 230 235 240Val Lys Ala Thr Thr Lys Ile Lys Tyr Tyr Pro Thr Ala Lys Ala Lys 245 250 255Glu Cys Leu Phe Arg Met Ala Asp Ser Gly Glu Ser Leu Cys Gln Asn 260 265 270Ser Ile Glu Glu Leu Asp Gly Val Leu Thr Ser Ile Phe Lys His Glu 275 280 285Ile Pro Tyr Tyr Glu Phe Gln Ser Leu Gln Thr Glu Ile Cys Ser Gln 290 295 300Asn Lys Tyr Thr His Phe Lys Glu Leu Pro Thr Leu Leu His Cys Ala305 310 315 320Ala Lys Phe Gly Leu Lys Asn Leu Ala Ile His Leu Leu Gln Cys Ser 325 330 335Gly Ala Thr Trp Ala Ser Lys Met Lys Asn Met Glu Gly Ser Asp Pro 340 345 350Ala His Ile Ala Glu Arg His Gly His Lys Glu Leu Lys Lys Ile Phe 355 360 365Glu Asp Phe Ser Ile Gln Glu Ile Asp Ile Asn Asn Glu Gln Glu Asn 370 375 380Asp Tyr Glu Glu Asp Ile Ala Ser Phe Ser Thr Tyr Ile Pro Ser Thr385 390 395 400Gln Asn Pro Ala Phe His His Glu Ser Arg Lys Thr Tyr Gly Gln Ser 405 410 415Ala Asp Gly Ala Glu Ala Asn Glu Met Glu Gly Glu Gly Lys Gln Asn 420 425 430Gly Ser Gly Met Glu Thr Lys His Ser Pro Leu Glu Val Gly Ser Glu 435 440 445Ser Ser Glu Asp Gln Tyr Asp Asp Leu Tyr Val Phe Ile Pro Gly Ala 450 455 460Asp Pro Glu Asn Asn Ser Gln Glu Pro Leu Met Ser Ser Arg Pro Pro465 470 475 480Leu Pro Pro Pro Arg Pro Val Ala Asn Ala Phe Gln Leu Glu Arg Pro 485 490 495His Phe Thr Leu Pro Gly Thr Met Val Glu Gly Gln Met Glu Arg Ser 500 505 510Gln Asn Trp Gly His Pro Gly Val Arg Gln Glu Thr Gly Asp Glu Pro 515 520 525Lys Gly Glu Lys Glu Lys Lys Glu Glu Glu Lys Glu Gln Glu Glu Glu 530 535 540Glu Asp Pro Tyr Thr Phe Ala Glu Ile Asp Asp Ser Glu Tyr Asp Met545 550 555 560Ile Leu Ala Asn Leu Ser Ile Lys Lys Lys Thr Gly Ser Arg Ser Phe 565 570 575Ile Ile Asn Arg Pro Pro Ala Pro Thr Pro Arg Pro Thr Ser Ile Pro 580 585 590Pro Lys Glu Glu Thr Thr Pro Tyr Ile Ala Gln Val Phe Gln Gln Lys 595 600 605Thr Ala Arg Arg Gln Ser Asp Asp Asp Lys Phe Arg Gly Leu Pro Lys 610 615 620Lys Gln Asp Arg Ala Arg Ile Glu Ser Pro Ala Phe Ser Thr Leu Arg625 630 635 640Gly Cys Leu Thr Asp Gly Gln Glu Glu Leu Ile Leu Leu Gln Glu Lys 645 650 655Val Lys Asn Gly Lys Met Ser Met Asp Glu Ala Leu Glu Lys Phe Lys 660 665 670His Trp Gln Met Gly Lys Ser Gly Leu Glu Met Ile Gln Gln Glu Lys 675 680 685Leu Arg Gln Leu Arg Asp Cys Ile Ile Gly Lys Arg Pro Glu Glu Glu 690 695 700Asn Val Tyr Asn Lys Leu Thr Ile Val His His Pro Gly Gly Lys Glu705 710 715 720Thr Ala His Asn Glu Asn Lys Phe Tyr Asn Val His Phe Ser Asn Lys 725 730 735Leu Pro Ala Arg Pro Gln Val Glu Lys Glu Phe Gly Phe Cys Cys Lys 740 745 750Lys Asp His 755813195DNAHomo sapiens 81ggaagagaaa atcgcgggga gtctctggcc gggagagtcc aggtagcgct cggcgggcag 60cagtgcgcag gcccctcggc ttcaaccgcc acaatgctgc cagcagcgcc aggcaagggg 120cttgggagcc cggaccccgc cccctgcggc ccagcgcccc caggaaatac aaaagatata 180ataatgatat atgaagaaga tgctgaggaa tgggctctgt acttgacaga agtattttta 240catgttgtga aaagggaagc catcctgtta tatcgcttgg agaatttctc ttttcggcat 300ttggagttgc tgaacttaac gtcttacaaa tgtaaacttt tgatattatc aaatagcctg 360cttagagacc taactccaaa gaaatgtcag tttctggaaa agatacttca ttcaccaaaa 420agtgtagtta ctttgctttg tggagtgaag agttcagatc agctctatga attactaaat 480atctctcaaa gcagatggga gatctcaact gaacaggaac ctgaagacta catctctgta 540atccagagta tcatattcaa agattctgaa gactactttg aggtcaacat tccaacagac 600ctacgagcaa aacattctgg ggaaataagt gagagaaagg aaattgaaga actatcagaa 660gcttcaagaa acaccatacc actagcagtg gtgcttccca ctgaaattcc atgtgagaat 720cctggtgaaa tattcataat tttgagagat gaagtaattg gtgatactgt agaggttgaa 780tttacatcaa gtaataagcg cattagaaca cggccagccc tttggaataa gaaagtctgg 840tgcatgaaag ctttagagtt tcctgctggt tcagtccatg tcaatgtcta ctgtgatgga 900atcgttaaag ctacaaccaa aattaagtac tacccaacag caaaggcaaa ggaatgccta 960ttcagaatgg cagattcagg agagagtttg tgccagaata gcattgaaga acttgatggt 1020gtccttacat ccatattcaa acatgagata ccatattatg agttccagtc tcttcaaact 1080gaaatttgtt ctcaaaacaa atatactcat ttcaaagaac ttccaactct tctccactgt 1140gcagcaaaat ttggcttaaa gaacctggct attcatttgc ttcaatgttc aggagcaacc 1200tgggcatcta agatgaaaaa tatggagggt tcagaccccg cacatattgc tgaaaggcat 1260ggtcacaaag aactcaagaa aatcttcgaa gacttttcaa tccaagaaat tgacataaat 1320aatgagcaag aaaatgatta tgaagaggat attgcctcat tttccacata tattccttcc 1380acacagaacc cagcatttca tcatgaaagc agaaagacat acgggcagag tgcagatgga 1440gctgaggcaa atgaaatgga aggggaagga aaacagaatg gatcaggcat ggagaccaaa 1500cacagcccac tagaggttgg cagtgagagt tctgaagacc agtatgatga cttgtatgtg 1560ttcattcctg gtgctgatcc agaaaataat tcacaagagc cactcatgag cagcagacct 1620cctctccccc cgccgcgacc tgtagctaat gccttccaac tggaaagacc tcacttcacc 1680ttaccaggga caatggtgga aggccaaatg gaaagaagtc aaaactgggg tcatcctggt 1740gttagacaag aaacaggaga tgaacccaaa ggagaaaaag agaagaaaga agaggaaaaa 1800gagcaggagg aggaagaaga cccatatact tttgctgaga ttgatgacag tgaatatgac 1860atgatattgg ccaatctgag tataaagaaa aaaactggga gtcggtcttt cattataaat 1920agacctcctg cccccacacc ccgacccaca agtatacctc caaaagagga aactacacct 1980tacatagctc aagtgtttca acaaaagaca gccagaagac aatctgatga tgacaagttc 2040cgtggtcttc ctaagaaaca agacagagct cggatagaga gtccagcctt ttctactctc 2100aggggctgtc taactgatgg tcaggaagaa ctcatcctcc tgcaggagaa agtaaagaat 2160gggaaaatgt ctatggatga agctctggag aaatttaaac actggcagat gggaaaaagt 2220ggcctggaaa tgattcagca ggagaaatta cgacaactac gagactgcat tattgggaaa 2280aggccagaag aagaaaatgt ctataataaa ctcaccattg tgcaccatcc aggtggtaag 2340gaaactgccc acaatgaaaa taagttttat aatgtacact tcagcaataa gcttcctgct 2400cgaccccaag ttgaaaagga atttggtttc tgttgcaaga aagatcatta aagaaggtta 2460ttataatgaa actcacgaat ctacggacat tttgctttca gggtgaagca agcttgaatt 2520tggattgcct gctttcttta aagcgaattc atactataac agcagaaaca aaacttcaga 2580tttcagaatt tgttattggc aaaatttatt ctcattatac ctgcttcata tgggtatatt 2640actattaaaa cagaatacca tagagtaatt gcattatttg aaaattctct cattttacaa 2700tgcacttcac caatgaaaca gctaatttcc attttgaaaa ttaaaagaaa acagcacaga 2760gaagttaaat gcggtgtagc aaagttatgg ggtctgcttg agggcactaa cctcaacaga 2820ttattcctcc tctccttaga ataaccatga aaatacaaat ttacttagca catttttgct 2880ttttaagtag ctggttcatt ttctgaattt ctcacattca gagttccagt cattattgtt 2940acatcatgtt tgcagaaacc ttgtcttatt tagtgtctat ttgcatataa ccctgaaaac 3000attattattt gaaaactttt ctatatctca aattaatata cattttcata acctaccttt 3060gtattaagac ttgcaatttt atcaatctat tatttcttag aaacaattta ctagcttaga 3120atagaaagca atgttatcgt catataattt tcatgtacaa atgccacaaa taaattgaat 3180gtttaaagct aaaaa 319582816PRTHomo sapiens 82Gly Arg Glu Asn Arg Gly Glu Ser Leu Ala Gly Arg Val Gln Val Ala1 5 10 15Leu Gly Gly Gln Gln Cys Ala Gly Pro Ser Ala Ser Thr Ala Thr Met 20 25 30Leu Pro Ala Ala Pro Gly Lys Gly Leu Gly Ser Pro Asp Pro Ala Pro 35 40 45Cys Gly Pro Ala Pro Pro Gly Asn Thr Lys Asp Ile Ile Met Ile Tyr 50 55 60Glu Glu Asp Ala Glu Glu Trp Ala Leu Tyr Leu Thr Glu Val Phe Leu65 70 75 80His Val Val Lys Arg Glu Ala Ile Leu Leu Tyr Arg Leu Glu Asn Phe

85 90 95Ser Phe Arg His Leu Glu Leu Leu Asn Leu Thr Ser Tyr Lys Cys Lys 100 105 110Leu Leu Ile Leu Ser Asn Ser Leu Leu Arg Asp Leu Thr Pro Lys Lys 115 120 125Cys Gln Phe Leu Glu Lys Ile Leu His Ser Pro Lys Ser Val Val Thr 130 135 140Leu Leu Cys Gly Val Lys Ser Ser Asp Gln Leu Tyr Glu Leu Leu Asn145 150 155 160Ile Ser Gln Ser Arg Trp Glu Ile Ser Thr Glu Gln Glu Pro Glu Asp 165 170 175Tyr Ile Ser Val Ile Gln Ser Ile Ile Phe Lys Asp Ser Glu Asp Tyr 180 185 190Phe Glu Val Asn Ile Pro Thr Asp Leu Arg Ala Lys His Ser Gly Glu 195 200 205Ile Ser Glu Arg Lys Glu Ile Glu Glu Leu Ser Glu Ala Ser Arg Asn 210 215 220Thr Ile Pro Leu Ala Val Val Leu Pro Thr Glu Ile Pro Cys Glu Asn225 230 235 240Pro Gly Glu Ile Phe Ile Ile Leu Arg Asp Glu Val Ile Gly Asp Thr 245 250 255Val Glu Val Glu Phe Thr Ser Ser Asn Lys Arg Ile Arg Thr Arg Pro 260 265 270Ala Leu Trp Asn Lys Lys Val Trp Cys Met Lys Ala Leu Glu Phe Pro 275 280 285Ala Gly Ser Val His Val Asn Val Tyr Cys Asp Gly Ile Val Lys Ala 290 295 300Thr Thr Lys Ile Lys Tyr Tyr Pro Thr Ala Lys Ala Lys Glu Cys Leu305 310 315 320Phe Arg Met Ala Asp Ser Gly Glu Ser Leu Cys Gln Asn Ser Ile Glu 325 330 335Glu Leu Asp Gly Val Leu Thr Ser Ile Phe Lys His Glu Ile Pro Tyr 340 345 350Tyr Glu Phe Gln Ser Leu Gln Thr Glu Ile Cys Ser Gln Asn Lys Tyr 355 360 365Thr His Phe Lys Glu Leu Pro Thr Leu Leu His Cys Ala Ala Lys Phe 370 375 380Gly Leu Lys Asn Leu Ala Ile His Leu Leu Gln Cys Ser Gly Ala Thr385 390 395 400Trp Ala Ser Lys Met Lys Asn Met Glu Gly Ser Asp Pro Ala His Ile 405 410 415Ala Glu Arg His Gly His Lys Glu Leu Lys Lys Ile Phe Glu Asp Phe 420 425 430Ser Ile Gln Glu Ile Asp Ile Asn Asn Glu Gln Glu Asn Asp Tyr Glu 435 440 445Glu Asp Ile Ala Ser Phe Ser Thr Tyr Ile Pro Ser Thr Gln Asn Pro 450 455 460Ala Phe His His Glu Ser Arg Lys Thr Tyr Gly Gln Ser Ala Asp Gly465 470 475 480Ala Glu Ala Asn Glu Met Glu Gly Glu Gly Lys Gln Asn Gly Ser Gly 485 490 495Met Glu Thr Lys His Ser Pro Leu Glu Val Gly Ser Glu Ser Ser Glu 500 505 510Asp Gln Tyr Asp Asp Leu Tyr Val Phe Ile Pro Gly Ala Asp Pro Glu 515 520 525Asn Asn Ser Gln Glu Pro Leu Met Ser Ser Arg Pro Pro Leu Pro Pro 530 535 540Pro Arg Pro Val Ala Asn Ala Phe Gln Leu Glu Arg Pro His Phe Thr545 550 555 560Leu Pro Gly Thr Met Val Glu Gly Gln Met Glu Arg Ser Gln Asn Trp 565 570 575Gly His Pro Gly Val Arg Gln Glu Thr Gly Asp Glu Pro Lys Gly Glu 580 585 590Lys Glu Lys Lys Glu Glu Glu Lys Glu Gln Glu Glu Glu Glu Asp Pro 595 600 605Tyr Thr Phe Ala Glu Ile Asp Asp Ser Glu Tyr Asp Met Ile Leu Ala 610 615 620Asn Leu Ser Ile Lys Lys Lys Thr Gly Ser Arg Ser Phe Ile Ile Asn625 630 635 640Arg Pro Pro Ala Pro Thr Pro Arg Pro Thr Ser Ile Pro Pro Lys Glu 645 650 655Glu Thr Thr Pro Tyr Ile Ala Gln Val Phe Gln Gln Lys Thr Ala Arg 660 665 670Arg Gln Ser Asp Asp Asp Lys Phe Arg Gly Leu Pro Lys Lys Gln Asp 675 680 685Arg Ala Arg Ile Glu Ser Pro Ala Phe Ser Thr Leu Arg Gly Cys Leu 690 695 700Thr Asp Gly Gln Glu Glu Leu Ile Leu Leu Gln Glu Lys Val Lys Asn705 710 715 720Gly Lys Met Ser Met Asp Glu Ala Leu Glu Lys Phe Lys His Trp Gln 725 730 735Met Gly Lys Ser Gly Leu Glu Met Ile Gln Gln Glu Lys Leu Arg Gln 740 745 750Leu Arg Asp Cys Ile Ile Gly Lys Arg Pro Glu Glu Glu Asn Val Tyr 755 760 765Asn Lys Leu Thr Ile Val His His Pro Gly Gly Lys Glu Thr Ala His 770 775 780Asn Glu Asn Lys Phe Tyr Asn Val His Phe Ser Asn Lys Leu Pro Ala785 790 795 800Arg Pro Gln Val Glu Lys Glu Phe Gly Phe Cys Cys Lys Lys Asp His 805 810 815833544DNAHomo sapiens 83attttggttt ctcttcaaga attaacaaac cacttactct tgaattctct tctagttaac 60acaggcatca ctacttccaa ttgatctcag gatgtgggat cctcatacac attttgaaca 120aaatcctctg tttcagcaag gaattcatat ttgcatatgg tgaagatggt ttctgaagtg 180agatcagaag tagagcttct aatgaccccc agaagcactg agtgaccaag tgacatacct 240gccaggccca ttgtgtccat cgctctcaga gcagctgggg attgtgcttg gctcccagag 300ctatggtgca aaaggcgggg tcgctagggc cactcaggga aagagaaccc agaaacatgg 360catgctgaca aaaggtagtc cctgcttatc cagcttcact ttctgctgat ttagttaccc 420atggtcaact gccatctgaa aataggaaat acaaaagata taataatgat atatgaagaa 480gatgctgagg aatgggctct gtacttgaca gaagtatttt tacatgttgt gaaaagggaa 540gccatcctgt tatatcgctt ggagaatttc tcttttcggc atttggagtt gctgaactta 600acgtcttaca aatgtaaact tttgatatta tcaaatagcc tgcttagaga cctaactcca 660aagaaatgtc agtttctgga aaagatactt cattcaccaa aaagtgtagt tactttgctt 720tgtggagtga agagttcaga tcagctctat gaattactaa atatctctca aagcagatgg 780gagatctcaa ctgaacagga acctgaagac tacatctctg taatccagag tatcatattc 840aaagattctg aagactactt tgaggtcaac attccaacag acctacgagc aaaacattct 900ggggaaataa gtgagagaaa ggaaattgaa gaactatcag aagcttcaag aaacaccata 960ccactagcag tggtgcttcc cactgaaatt ccatgtgagg atcctggtga aatattcata 1020attttgagag atgaagtaat tggtgatact gtagaggttg aatttacatc aagtaataag 1080cgcattagaa cacggccagc cctttggaat aagaaagtct ggtgcatgaa agctttagag 1140tttcctgctg gttcagtcca tgtcaatgtc tactgtgatg gaatcgttaa agctacaacc 1200aaaattaagt actacccaac agcaaaggca aaggaatgcc tattcagaat ggcagattca 1260ggagagagtt tgtgccagaa tagcattgaa gaacttgatg gtgtccttac atccatattc 1320aaacatgaga taccatatta tgagttccag tctcttcaaa ctgaaatttg ttctcaaaac 1380aaatatactc atttcaaaga acttccaact cttctccact gtgcagcaaa atttggctta 1440aagaacctgg ctattcattt gcttcaatgt tcaggagcaa cctgggcatc taagatgaaa 1500aatatggagg gttcagaccc cacacatatt gctgaaaggc atggtcacaa agaactcaag 1560aaaatcttcg aagacttttc aatccaagaa attgacataa ataatgagca agaaaatgat 1620tatgaagagg atattgcctc attttccaca tatattcctt ccacacagaa cccagcattt 1680catcatgaaa gcaggaagac atacgggcag agtgcagatg gagctgaggc aaatgaaatg 1740gaaggggaag gaaaacagaa tggatcaggc atggagacca aacacagccc actagaggtt 1800ggcagtgaga gttctgaaga ccagtatgat gacttgtatg tgttcattcc tggtgctgat 1860ccagaaaata attcacaaga gccactcatg agcagcagac ctcctctccc cccgccgcga 1920cctgtagcta atgccttcca actggaaaga cctcacttca ccttaccagg gacaatggtg 1980gaaggccaaa tggaaagaag tcaaaactgg ggtcatcctg gtgttagaca agaaacagga 2040gatgaaccca aaggagaaaa agagaagaaa gaagaggaaa aagagcagga ggaggaagaa 2100gacccatata cttttgctga gattgatgac agtgaatatg acatgatatt ggccaatctg 2160agtataaaga aaaaaactgg gagtcggtct ttcattataa atagacctcc tgcccccaca 2220ccccgaccca caagtatacc tccaaaagag gaaactacgc cttacatagc tcaagtgttt 2280caacaaaaga cagccagaag acaatctgat gatgacaagt tccgtggtct tcctaagaaa 2340caagacagag ctcggataga gagtccagcc ttttctactc tcaggggctg tctaactgat 2400ggtcaggaag aactcatcct cctgcaggag aaagtaaaga atgggaaaat gtctatggat 2460gaagctctgg agaaatttaa acactggcag atgggaaaaa gtggcctgga aatgattcag 2520caggagaaat tacgacaact acgagactgc attattggga aaaggccaga agaagaaaat 2580gtctataata aactcaccat tgtgcaccat ccaggtggta aggaaactgc ccacaatgaa 2640aataagtttt ataatgtaca cttcagcaat aagcttcctg ctcgacccca agttgaaaag 2700gaatttggtt tctgttgcaa gaaagatcat taaagaaggt tattataatg aaactcacga 2760atctacggac attttgcttt cagggtgaag caagcttgaa tttggattgc ctgctctctt 2820taaagcgaat tcatactatg acagcagaaa caaaacttca gatttcagaa tttgttattg 2880gcaaaattta ttctcattat acctgcttca tatgggtata ttactattaa aacagaatac 2940catagagtaa ttgcattatt tgaaaattct ctcattttac aatgcacttc accaatgaaa 3000cagctaattt ccattttgaa aattaaaaga aaacagcaca gagaagttaa atgcggtgta 3060gcaaagttat ggggtctgct tgagggcact aacctcaaca gattattcct cccctcctta 3120gaataaccat gaaaatacaa atttacttag cacatttctg ctttttaagt agctggttca 3180ttttctgaat ttctcacatt cagagttcca gtcattattg ttacatcatg tttgcagaaa 3240ccttgtctta tttagtgtct atttgcatat aaccctgaaa acattattat ttgaaaactt 3300ttctatatct caaattaata tacattttca taacctacct ttgtattaag acttgcaatt 3360ttatcaatct attatttctt agaaacaatt tactagctta gaatagaaag caatgttatc 3420gtcatataat tttcatgtac aaatgccaca aataaattga atgtttaaag ctatgtctga 3480gtttttaaag taaatttata agaattagcc aataaaattg cttctcggcc ttttggctaa 3540gatc 354484770PRTHomo sapiens 84Met Val Asn Cys His Leu Lys Ile Gly Asn Thr Lys Asp Ile Ile Met1 5 10 15Ile Tyr Glu Glu Asp Ala Glu Glu Trp Ala Leu Tyr Leu Thr Glu Val 20 25 30Phe Leu His Val Val Lys Arg Glu Ala Ile Leu Leu Tyr Arg Leu Glu 35 40 45Asn Phe Ser Phe Arg His Leu Glu Leu Leu Asn Leu Thr Ser Tyr Lys 50 55 60Cys Lys Leu Leu Ile Leu Ser Asn Ser Leu Leu Arg Asp Leu Thr Pro65 70 75 80Lys Lys Cys Gln Phe Leu Glu Lys Ile Leu His Ser Pro Lys Ser Val 85 90 95Val Thr Leu Leu Cys Gly Val Lys Ser Ser Asp Gln Leu Tyr Glu Leu 100 105 110Leu Asn Ile Ser Gln Ser Arg Trp Glu Ile Ser Thr Glu Gln Glu Pro 115 120 125Glu Asp Tyr Ile Ser Val Ile Gln Ser Ile Ile Phe Lys Asp Ser Glu 130 135 140Asp Tyr Phe Glu Val Asn Ile Pro Thr Asp Leu Arg Ala Lys His Ser145 150 155 160Gly Glu Ile Ser Glu Arg Lys Glu Ile Glu Glu Leu Ser Glu Ala Ser 165 170 175Arg Asn Thr Ile Pro Leu Ala Val Val Leu Pro Thr Glu Ile Pro Cys 180 185 190Glu Asp Pro Gly Glu Ile Phe Ile Ile Leu Arg Asp Glu Val Ile Gly 195 200 205Asp Thr Val Glu Val Glu Phe Thr Ser Ser Asn Lys Arg Ile Arg Thr 210 215 220Arg Pro Ala Leu Trp Asn Lys Lys Val Trp Cys Met Lys Ala Leu Glu225 230 235 240Phe Pro Ala Gly Ser Val His Val Asn Val Tyr Cys Asp Gly Ile Val 245 250 255Lys Ala Thr Thr Lys Ile Lys Tyr Tyr Pro Thr Ala Lys Ala Lys Glu 260 265 270Cys Leu Phe Arg Met Ala Asp Ser Gly Glu Ser Leu Cys Gln Asn Ser 275 280 285Ile Glu Glu Leu Asp Gly Val Leu Thr Ser Ile Phe Lys His Glu Ile 290 295 300Pro Tyr Tyr Glu Phe Gln Ser Leu Gln Thr Glu Ile Cys Ser Gln Asn305 310 315 320Lys Tyr Thr His Phe Lys Glu Leu Pro Thr Leu Leu His Cys Ala Ala 325 330 335Lys Phe Gly Leu Lys Asn Leu Ala Ile His Leu Leu Gln Cys Ser Gly 340 345 350Ala Thr Trp Ala Ser Lys Met Lys Asn Met Glu Gly Ser Asp Pro Thr 355 360 365His Ile Ala Glu Arg His Gly His Lys Glu Leu Lys Lys Ile Phe Glu 370 375 380Asp Phe Ser Ile Gln Glu Ile Asp Ile Asn Asn Glu Gln Glu Asn Asp385 390 395 400Tyr Glu Glu Asp Ile Ala Ser Phe Ser Thr Tyr Ile Pro Ser Thr Gln 405 410 415Asn Pro Ala Phe His His Glu Ser Arg Lys Thr Tyr Gly Gln Ser Ala 420 425 430Asp Gly Ala Glu Ala Asn Glu Met Glu Gly Glu Gly Lys Gln Asn Gly 435 440 445Ser Gly Met Glu Thr Lys His Ser Pro Leu Glu Val Gly Ser Glu Ser 450 455 460Ser Glu Asp Gln Tyr Asp Asp Leu Tyr Val Phe Ile Pro Gly Ala Asp465 470 475 480Pro Glu Asn Asn Ser Gln Glu Pro Leu Met Ser Ser Arg Pro Pro Leu 485 490 495Pro Pro Pro Arg Pro Val Ala Asn Ala Phe Gln Leu Glu Arg Pro His 500 505 510Phe Thr Leu Pro Gly Thr Met Val Glu Gly Gln Met Glu Arg Ser Gln 515 520 525Asn Trp Gly His Pro Gly Val Arg Gln Glu Thr Gly Asp Glu Pro Lys 530 535 540Gly Glu Lys Glu Lys Lys Glu Glu Glu Lys Glu Gln Glu Glu Glu Glu545 550 555 560Asp Pro Tyr Thr Phe Ala Glu Ile Asp Asp Ser Glu Tyr Asp Met Ile 565 570 575Leu Ala Asn Leu Ser Ile Lys Lys Lys Thr Gly Ser Arg Ser Phe Ile 580 585 590Ile Asn Arg Pro Pro Ala Pro Thr Pro Arg Pro Thr Ser Ile Pro Pro 595 600 605Lys Glu Glu Thr Thr Pro Tyr Ile Ala Gln Val Phe Gln Gln Lys Thr 610 615 620Ala Arg Arg Gln Ser Asp Asp Asp Lys Phe Arg Gly Leu Pro Lys Lys625 630 635 640Gln Asp Arg Ala Arg Ile Glu Ser Pro Ala Phe Ser Thr Leu Arg Gly 645 650 655Cys Leu Thr Asp Gly Gln Glu Glu Leu Ile Leu Leu Gln Glu Lys Val 660 665 670Lys Asn Gly Lys Met Ser Met Asp Glu Ala Leu Glu Lys Phe Lys His 675 680 685Trp Gln Met Gly Lys Ser Gly Leu Glu Met Ile Gln Gln Glu Lys Leu 690 695 700Arg Gln Leu Arg Asp Cys Ile Ile Gly Lys Arg Pro Glu Glu Glu Asn705 710 715 720Val Tyr Asn Lys Leu Thr Ile Val His His Pro Gly Gly Lys Glu Thr 725 730 735Ala His Asn Glu Asn Lys Phe Tyr Asn Val His Phe Ser Asn Lys Leu 740 745 750Pro Ala Arg Pro Gln Val Glu Lys Glu Phe Gly Phe Cys Cys Lys Lys 755 760 765Asp His 77085564DNAHomo sapiens 85cctctcagaa aactgagcat actagcaaga cagctcttct tgaaaaaaaa aatatgtata 60cacaaatata tacgtatatc tatatatacg tatgtatata cacacatgta tattcttcct 120tgattgtgta gctgtccaaa ataataacat atatagaggg agctgtattc ctttatacaa 180atctgatggc tcctgcagca ctttttcctt ctgaaaatat ttacattttg ctaacctagt 240ttgttacttt aaaaatcagt tttgatgaaa ggagggaaaa gcagatggac ttgaaaaaga 300tccaagctcc tattagaaaa ggtatgaaaa tctttatagt aaaatttttt ataaactaaa 360gttgtacctt ttaatatgta gtaaactctc atttatttgg ggttcgctct tggatctcat 420ccatccattg tgttctcttt aatgctgcct gccttttgag gcattcactg ccctagacaa 480tgccaccaga gatagtgggg gaaatgccag atgaaaccaa ctcttgctct cactagttgt 540cagcttctct ggataagtga ccac 564865024DNAHomo sapiens 86agcggagtgg gtcctgcctg tgacgcgcgg cggcggtcgg tcctgcctgt aacggcggcg 60gcggctgctg ctccagacac ctgcggcggc ggcggcgacc acgcggcggg cgcggagatg 120tggcccctgg tagcggcgct gttgctgggc tcggcgtgct gcggatcagc tcagctacta 180tttaataaaa caaaatctgt agaattcacg ttttgtaatg acactgtcgt cattccatgc 240tttgttacta atatggaggc acaaaacact actgaagtat acgtaaagtg gaaatttaaa 300ggaagagata tttacacctt tgatggagct ctaaacaagt ccactgtccc cactgacttt 360agtagtgcaa aaattgaagt ctcacaatta ctaaaaggag atgcctcttt gaagatggat 420aagagtgatg ctgtctcaca cacaggaaac tacacttgtg aagtaacaga attaaccaga 480gaaggtgaaa cgatcatcga gctaaaatat cgtgttgttt catggttttc tccaaatgaa 540aatattctta ttgttatttt cccaattttt gctatactcc tgttctgggg acagtttggt 600attaaaacac ttaaatatag atccggtggt atggatgaga aaacaattgc tttacttgtt 660gctggactag tgatcactgt cattgtcatt gttggagcca ttcttttcgt cccaggtgaa 720tattcattaa agaatgctac tggccttggt ttaattgtga cttctacagg gatattaata 780ttacttcact actatgtgtt tagtacagcg attggattaa cctccttcgt cattgccata 840ttggttattc aggtgatagc ctatatcctc gctgtggttg gactgagtct ctgtattgcg 900gcgtgtatac caatgcatgg ccctcttctg atttcaggtt tgagtatctt agctctagca 960caattacttg gactagttta tatgaaattt gtggcttcca atcagaagac tatacaacct 1020cctaggaata actgaagtga agtgatggac tccgatttgg agagtagtaa gacgtgaaag 1080gaatacactt gtgtttaagc accatggcct tgatgattca ctgttgggga gaagaaacaa 1140gaaaagtaac tggttgtcac ctatgagacc cttacgtgat tgttagttaa gtttttattc 1200aaagcagctg taatttagtt aataaaataa ttatgatcta tgttgtttgc ccaattgaga 1260tccagttttt tgttgttatt tttaatcaat taggggcaat agtagaatgg acaatttcca 1320agaatgatgc ctttcaggtc ctagggcctc tggcctctag gtaaccagtt taaattggtt 1380cagggtgata actacttagc actgccctgg tgattaccca gagatatcta tgaaaaccag 1440tggcttccat caaacctttg ccaactcagg ttcacagcag ctttgggcag ttatggcagt 1500atggcattag ctgagaggtg tctgccactt ctgggtcaat ggaataataa attaagtaca 1560ggcaggaatt tggttgggag catcttgtat gatctccgta tgatgtgata ttgatggaga 1620tagtggtcct cattcttggg ggttgccatt cccacattcc

cccttcaaca aacagtgtaa 1680caggtccttc ccagatttag ggtactttta ttgatggata tgttttcctt ttattcacat 1740aaccccttga aaccctgtct tgtcctcctg ttacttgctt ctgctgtaca agatgtagca 1800ccttttctcc tctttgaaca tggtctagtg acacggtagc accagttgca ggaaggagcc 1860agacttgttc tcagagcact gtgttcacac ttttcagcaa aaatagctat ggttgtaaca 1920tatgtattcc cttcctctga tttgaaggca aaaatctaca gtgtttcttc acttcttttc 1980tgatctgggg catgaaaaaa gcaagattga aatttgaact atgagtctcc tgcatggcaa 2040caaaatgtgt gtcaccatca ggccaacagg ccagcccttg aatggggatt tattactgtt 2100gtatctatgt tgcatgataa acattcatca ccttcctcct gtagtcctgc ctcgtactcc 2160ccttccccta tgattgaaaa gtaaacaaaa cccacatttc ctatcctggt tagaagaaaa 2220ttaatgttct gacagttgtg atcgcctgga gtacttttag acttttagca ttcgtttttt 2280acctgtttgt ggatgtgtgt ttgtatgtgc atacgtatga gataggcaca tgcatcttct 2340gtatggacaa aggtggggta cctacaggag agcaaaggtt aattttgtgc ttttagtaaa 2400aacatttaaa tacaaagttc tttattgggt ggaattatat ttgatgcaaa tatttgatca 2460cttaaaactt ttaaaacttc taggtaattt gccacgcttt ttgactgctc accaataccc 2520tgtaaaaata cgtaattctt cctgtttgtg taataagata ttcatatttg tagttgcatt 2580aataatagtt atttcttagt ccatcagatg ttcccgtgtg cctcttttat gccaaattga 2640ttgtcatatt tcatgttggg accaagtagt ttgcccatgg caaacctaaa tttatgacct 2700gctgaggcct ctcagaaaac tgagcatact agcaagacag ctcttcttga aaaaaaaaat 2760atgtatacac aaatatatac gtatatctat atatacgtat gtatatacac acatgtatat 2820tcttccttga ttgtgtagct gtccaaaata ataacatata tagagggagc tgtattcctt 2880tatacaaatc tgatggctcc tgcagcactt tttccttctg aaaatattta cattttgcta 2940acctagtttg ttactttaaa aatcagtttt gatgaaagga gggaaaagca gatggacttg 3000aaaaagatcc aagctcctat tagaaaaggt atgaaaatct ttatagtaaa attttttata 3060aactaaagtt gtacctttta atatgtagta aactctcatt tatttggggt tcgctcttgg 3120atctcatcca tccattgtgt tctctttaat gctgcctgcc ttttgaggca ttcactgccc 3180tagacaatgc caccagagat agtgggggaa atgccagatg aaaccaactc ttgctctcac 3240tagttgtcag cttctctgga taagtgacca cagaagcagg agtcctcctg cttgggcatc 3300attgggccag ttccttctct ttaaatcaga tttgtaatgg ctcccaaatt ccatcacatc 3360acatttaaat tgcagacagt gttttgcaca tcatgtatct gttttgtccc ataatatgct 3420ttttactccc tgatcccagt ttctgctgtt gactcttcca ttcagtttta tttattgtgt 3480gttctcacag tgacaccatt tgtccttttc tgcaacaacc tttccagcta cttttgccaa 3540attctatttg tcttctcctt caaaacattc tcctttgcag ttcctcttca tctgtgtagc 3600tgctcttttg tctcttaact taccattcct atagtacttt atgcatctct gcttagttct 3660attagttttt tggccttgct cttctccttg attttaaaat tccttctata gctagagctt 3720ttctttcttt cattctctct tcctgcagtg ttttgcatac atcagaagct aggtacataa 3780gttaaatgat tgagagttgg ctgtatttag atttatcact ttttaatagg gtgagcttga 3840gagttttctt tctttctgtt tttttttttt ttttttttga ctaatttcac atgctctaaa 3900aaccttcaaa ggtgattatt tttctcctgg aaactccagg tccattctgt ttaaatccct 3960aagaatgtca gaattaaaat aacagggcta tcgcgtaatt ggaaatattt cttttttcag 4020gatgctatag tcaatttagt aagtgaccac caaattgtta tttgcactaa caaagctcaa 4080aacacgataa gtttactcct ccatctcagt aataaaaatt aagctgtaat caaccttcta 4140ggtttctctt gtcttaaaat gggtattcaa aaatggggat ctgtggtgta tgtatggaaa 4200cacatactcc ttaatttacc tgttgttgga aactggagaa atgattgtcg ggcaaccgtt 4260tattttttat tgtattttat ttggttgagg gattttttta taaacagttt tacttgtgtc 4320atattttaaa attactaact gccatcacct gctggggtcc tttgttaggt cattttcagt 4380gactaatagg gataatccag gtaactttga agagatgagc agtgagtgac caggcagttt 4440ttctgccttt agctttgaca gttcttaatt aagatcattg aagaccagct ttctcataaa 4500tttctctttt tgaaaaaaag aaagcatttg tactaagctc ctctgtaaga caacatctta 4560aatcttaaaa gtgttgttat catgactggt gagagaagaa aacattttgt ttttattaaa 4620tggagcatta tttacaaaaa gccattgttg agaattagat cccacatcgt ataaatatct 4680attaaccatt ctaaataaag agaactccag tgttgctatg tgcaagatcc tctcttggag 4740cttttttgca tagcaattaa aggtgtgcta tttgtcagta gccatttttt tgcagtgatt 4800tgaagaccaa agttgtttta cagctgtgtt accgttaaag gttttttttt ttatatgtat 4860taaatcaatt tatcactgtt taaagctttg aatatctgca atctttgcca aggtactttt 4920ttatttaaaa aaaaacataa ctttgtaaat attaccctgt aatattatat atacttaata 4980aaacatttta agctaaaaaa aaaaaaacaa aaaaaaaaaa aaaa 502487305PRTHomo sapiens 87Met Trp Pro Leu Val Ala Ala Leu Leu Leu Gly Ser Ala Cys Cys Gly1 5 10 15Ser Ala Gln Leu Leu Phe Asn Lys Thr Lys Ser Val Glu Phe Thr Phe 20 25 30Cys Asn Asp Thr Val Val Ile Pro Cys Phe Val Thr Asn Met Glu Ala 35 40 45Gln Asn Thr Thr Glu Val Tyr Val Lys Trp Lys Phe Lys Gly Arg Asp 50 55 60Ile Tyr Thr Phe Asp Gly Ala Leu Asn Lys Ser Thr Val Pro Thr Asp65 70 75 80Phe Ser Ser Ala Lys Ile Glu Val Ser Gln Leu Leu Lys Gly Asp Ala 85 90 95Ser Leu Lys Met Asp Lys Ser Asp Ala Val Ser His Thr Gly Asn Tyr 100 105 110Thr Cys Glu Val Thr Glu Leu Thr Arg Glu Gly Glu Thr Ile Ile Glu 115 120 125Leu Lys Tyr Arg Val Val Ser Trp Phe Ser Pro Asn Glu Asn Ile Leu 130 135 140Ile Val Ile Phe Pro Ile Phe Ala Ile Leu Leu Phe Trp Gly Gln Phe145 150 155 160Gly Ile Lys Thr Leu Lys Tyr Arg Ser Gly Gly Met Asp Glu Lys Thr 165 170 175Ile Ala Leu Leu Val Ala Gly Leu Val Ile Thr Val Ile Val Ile Val 180 185 190Gly Ala Ile Leu Phe Val Pro Gly Glu Tyr Ser Leu Lys Asn Ala Thr 195 200 205Gly Leu Gly Leu Ile Val Thr Ser Thr Gly Ile Leu Ile Leu Leu His 210 215 220Tyr Tyr Val Phe Ser Thr Ala Ile Gly Leu Thr Ser Phe Val Ile Ala225 230 235 240Ile Leu Val Ile Gln Val Ile Ala Tyr Ile Leu Ala Val Val Gly Leu 245 250 255Ser Leu Cys Ile Ala Ala Cys Ile Pro Met His Gly Pro Leu Leu Ile 260 265 270Ser Gly Leu Ser Ile Leu Ala Leu Ala Gln Leu Leu Gly Leu Val Tyr 275 280 285Met Lys Phe Val Ala Ser Asn Gln Lys Thr Ile Gln Pro Pro Arg Asn 290 295 300Asn305886790DNAHomo sapiens 88ggagatattt tcttgttcaa tttaaggaga ggtaaatttg gtatcaatag aaaaaatgtt 60tctgaaaaat ttaaaccctg gaaatgtatt tatggcatgg agtcagatgt ttcagggaga 120gaagaacaaa tcaagaagca ttgcaagtat gctcatatgg aatgcttaag gcttgtggtt 180aaaaaatata tatatatggc tgtcaatgtc ttaggctcat ggtagcagca gaaatcgtaa 240taattctttt gtcacatggg ttatatccat attggagaga attaactcag gtgaaattaa 300cttgtacact gtttggtttt ataatattta gagggatcac aactgactga tgtccctttg 360aagtaccatt cttcataaat cttttttttt tcagaatggg ccagccaact gtgacatccc 420ttggatcgga gatttagaac tagaaagtat tctttctaca ttattaggga agaaaaggag 480ttacttggcg gttagcaata ttctattttg ttttgttttg tttttagaga cagggtctca 540ttatgttgac caggctggcc tcgagctcct gggctcaagc aatgctccca cctcagcctc 600ccaagtagct gggactacgg gcatgtgcca ctacacctgg cagtgtttat tctgataaat 660acatttatga gctcaaaaat gtaactctaa aaccttatct ctgaacttcc atattaccat 720cagaaattta gatagttgtt tagttctctt tttctttgta gaacatagat ataaggcatg 780gtttcattga agtcagttgt atatacatgt aactatcctg atgttcccaa ataaagctct 840gtatttatgc ttagtttatt ggggaggctg ctaaatgtag tgcatcccaa cccattttac 900cctgttctac tttaaaaaga ggttggcttc ttgtttggat acaaggacca agtcactccc 960ccaggttcct ccacagtaag ggaggcctat ttaaagccgc ccatggcact aacagaaact 1020ggactcctat gagctcagat acataactgg gcctcacagg ggtgggacag tatgtagtct 1080aggaattgga aggatccatt ccatatcaaa gaactgaagc atcgtgttgc cctctcagca 1140gcaagagtaa ggtgatgccc ctgtcagtta tagttcctga gttcctctgt ctttgattct 1200ttgcctatta gccagctagc tcaccctctt gtttatgcca ctgtttttta tcctattcat 1260gccttctcac agacaacttt tcttacctac agctttggac tcatccttgt ctcctttctg 1320tttctttttc actttccctt cccatcacca actttctggg tttttttctg tttcttctta 1380gagtccagtg gcagggagaa acttgtcagt ccagtctgtt gccatttttc ctgtttgaga 1440aagactcacc agcttttggc tggctcacag attggctttc cttgggtcag gacccaccct 1500tttccctgcc agctttggaa gcttgacaga attcgagtgt gcagtggtgg taaataaata 1560gtaaggaaca cagagcagtc ctggaggcgt gcctccatct gctgatgaga aaatccagtg 1620ctgtcatcca gcccaggtcc cagcggaatg ggcctctctg ttcagtagga tccccctcct 1680gctgagtggt tcatggcatg tttctgttca acgcttttcc atctgtagga ttcttattct 1740gtatttattt gtttttttgg gtttttttat tttttgagat ggagtctcgc tctgtcgccc 1800aggctggagt gcagtggcac gaccccagct cgctgcagcc tctgcctccc aggacgaggg 1860agatcctccc acctcagcct tccacgtagc tgggactaca ggcatgcacc acaggcatgc 1920accaccacgc cagctaattt ttgtattttt ggtagagaca gggttgcatc atgttgccca 1980ggctggtctt gaatgcctga gctcaagcaa tctatttgcc ttggcctccc aaagtgctgg 2040gattacaggc atgagccacc acggccagcc ttctcatttg ttttttttat aaggaagcta 2100tctcttcttc cctccccaac tagggtattc tttttccctt tcgtcacttt gctcatgtac 2160tgtattcctt caacttcatt aatgaatcca tttggaagca gtgaaaaagg caactcagaa 2220agctaagaag aaatagatag aggaatactc agagctatct gagtattttc tttagtttgt 2280tagctctttg gagctttgaa actggaaaga cccagggagt gatgtggaga aagagactga 2340gcttgtaaga cacaggagca gtgagctaag ggagatggag tagtggggac aaattctggc 2400acattctgtc tacactctgg gtagatagag gagggaggat ggagcaccca tggtgggggt 2460atgttggtga cagcattttc ccaccagcca gtgtaacaag tggctgattt gggggaaaga 2520tggcataaac aaatgagaga atgtgtttac tatttgatgt agatgggtta tttgcttcat 2580ttttcaaatc agtgtatata atcaagaata ttcagcatgt ttgaatagac tgtcagagct 2640ggaactcttt cattaacatc tctggcacct ttagttttag ccctgaacat tttatcttaa 2700aattaaacat taccaaatgc cttagtttat ttcatttatt aaatttatat tcttatttgt 2760tatttatatc agcttccaat cagaagacta tacaacctcc taggaataac tgaagtgaag 2820tgatggactc cgatttggag agtagtaaga cgtgaaagga atacacttgt gtttaagcac 2880catggccttg atgattcact gttggggaga agaaacaaga aaagtaactg gttgtcacct 2940atgagaccct tacgtgattg ttagttaagt ttttattcaa agcagctgta atttagttaa 3000taaaataatt atgatctatg ttgtttgccc aattgagatc cagttttttg ttgttatttt 3060taatcaatta ggggcaatag tagaatggac aatttccaag aatgatgcct ttcaggtcct 3120agggcctctg gcctctaggt aaccagttta aattggttca gggtgataac tacttagcac 3180tgcccctggt gattacccca gagatatcta tgaaaaccag tggcttccat caaacctttg 3240ccaactcagg ttcacagcag ctttgggcag ttatggcagt atggcattag ctgagaggtg 3300tctgccactt ctgggtcaat ggaataataa attaagtaca ggcaggaatt tggttgggag 3360catcttgtat gatctccgta tgatgtgata ttgatggaga tagtggtcct cattcttggg 3420ggttgccatt cccacattcc cccttcaaca aacagtgtaa caggtccttc ccagatttag 3480ggtactttta ttgatggata tgttttcctt ttattcacat aaccccttga aaccctgtct 3540tgtcctcctg ttacttgctt ctgctgtaca agatgtagca ccttttctcc tctttgaacg 3600tggtctagtg acacggtagc accagttgca ggaaggagcc agacttgttc tcagagcact 3660gtgttcacac ttttcagcaa aaatagctat ggttgtaaca tatgtattcc cttcctctga 3720tttgaaggca aaaatctaca gtgtttcttc acttcttttc tgatctgggg catgaaaaaa 3780gcaagattga aatttgaact atgagtctcc tgcatggcaa caaaatgtgt gtcaccatca 3840ggccaacagg ccagcccttg aatggggatt tattactgtt gtatctatgt tgcatgataa 3900acattcatca ccttcctcct gtagtcctgc ctcgtactcc ccttccccta tgattgaaaa 3960gtaaacaaaa cccacatttc ctatcctggt tagaagaaaa taaatgttct gacagttgtg 4020atcgcctgga gtacttttag acttttagca ttcgtttttt acctgtttgt ggatgtgtgt 4080ttgtatgtgc atacgtatga gataggcaca tgcatcttct gtatggacaa aggtggggta 4140cctacaggag agcaaaggtt aattttgtgc ttttagtaaa aacatttaaa tacaaagttc 4200tttattgggc ggaattatat ttgatgcaaa tatttgatca cttaaaactt ttaaaacttc 4260taggtaattt gccacgcttt ttgactgctc accaataccc tgtaaaaata cgtaattctt 4320cctgtttgtg taataagata ttcatatttg tagttgcatt aataatagtt atttcttagt 4380ccatcagatg ttcccgtgtg cctcttttat gccaaattga ttgtcatatt tcatgttggg 4440accaagtagt ttgcccatgg caaacctaaa tttatgacct gctgaggcct ctcagaaaac 4500tgagcatact agcaagacag ctcttcttga aaaaaaaaat atgtatacac aaatatatac 4560gtatatctat atatacgtat gtatatacac acatgtatat tcttccttga ttgtgtagct 4620gtccaaaata ataacatata tagagggagc tgtattcctt tatacaaatc tgatggctcc 4680tgcagcactt tttccttctg aaaatattta cattttgcta acctagtttg ttactttaaa 4740aatcagtttt gatgaaagga gggaaaagca gatggacttg aaaaagatcc aagctcctat 4800tagaaaaggt atgaaaatct ttatagtaaa attttttata aactaaagtt gtacctttta 4860atatgtagta aactctcatt tatttggggt tcgctcttgg atctcatcca tccattgtgt 4920tctctttaat gctgcctgcc ttttgaggca ttcactgccc tagacaatgc caccagagat 4980agtgggggaa atgccagatg aaaccaactc ttgctctcac tagttgtcag cttctctgga 5040taagtgacca cagaagcagg agtcctcctg cttgggcatc attgggccag ttccttctct 5100ttaaatcaga tttgtaatgg ctcccaaatt ccatcacatc acatttaaat tgcagacagt 5160gttttgcaca tcatgtatct gttttgtccc ataatatgct ttttactccc tgatcccagt 5220ttctgctgtt gactcttcca ttcagtttta ttaattgtgt gttctcacag tgacaccatt 5280tgtccttttc tgcaacaacc tttccagcta cttttgccaa attctatttg tcttctcctt 5340caaaacattc tcctttgcag ttcctcttca tctgtgtagc tgctcttttg tctcttaact 5400taccattcct atagtacttt atgcatctct gcttagttct attagttttt tggccttgct 5460cttctccttg attttaaaat tccttctata gctagagctt ttctttcttt cattctctct 5520tcctgcagtg ttttgcatac atcagaagct aggtacataa gttaaatgat tgagagttgg 5580ctgtatttag atttatcact ttttaatagg gtgagcttga gagttttctt tctttctggt 5640tttttttttt tttttttttt ttttttgact aatttcacat gctctaaaaa ccttcaaagg 5700tgattatttt tctcctggaa actccaggtc cattctgttt aaatccctaa gaatgtcaga 5760attaaaataa cagggctatc ccgtaattgg aaatatttct tttttcagga tgctatagtc 5820aatttagtaa gtggccacca aattgttatt tgcactaaca aagctcaaaa cacgataagt 5880ttactcctcc atctcagtaa taaaaattaa gctgtaatca accttctagg tttctcttgt 5940cttaaaatgg gtattcaaaa atggggatct gtggtgtatg tatggaaaca catactcctt 6000aatttacctg ttgttggaaa ctggagaaat gattgtcggg caaccgttta ttttttattg 6060tattttattt ggttgaggga tttttttata aacagtttta cttgtgtcat attttaaaat 6120tactaactgc catcacctgc tggggtcctt tgttaggtca ttttcagtga ctaataggga 6180taatccaggt aactttgaag agatgagcag ggagtgacca ggcagttttc ttgcctttag 6240ctttgacagt tcttaattaa gatcattgaa gaccagcttt ctcataaatt tctctttttg 6300aaaaaagaaa gcatttgtac taagctcctc tgtaagacaa catcttaaat cttaaaagtg 6360ttgttatcat gactggtgag agaagaaaac gttttgtttt tattaaatgg agcattattt 6420acaaaaagcc attgttgaga attagatccc acatcgtata aatatctatt aaccattcta 6480aataaagaga actccagtgt tgctatgtgc aagatcctct cttggagctt ttttgcatag 6540caattaaagg tgtgctattt gtcagtagcc atttttttgc agtgatttga agaccaaagt 6600tgttttacag ctgtgttacc gttaaaggtt ttttttttta tatgtattaa atcaatttat 6660cactgtttaa agctttgaat atctgcaatc tttgccaagg tactttttta tttaaaaaaa 6720aacataactt tgtaaatatt accctgtaat attatatata cttaataaaa cattttaagc 6780tataaaaaaa 6790892538DNAHomo sapiens 89ttttaaaact tctaggtaat ttgccacgct ttttgactgc tcaccaatac cctgtaaaaa 60tacgtaattc ttcctgtttg tgtaataaga tattcatatt tgtagttgca ttaataatag 120ttatttctta gtccatcaga tgttcccgtg tgcctctttt atgccaaatt gattgtcata 180tttcatgttg ggaccaagta gtttgcccat ggcaaaccta aatttatgac ctgctgaggc 240ctctcagaaa actgagcata ctagcaagac agctcttctt gaaaaaaaaa atatgtatac 300acaaatatat acgtatatct atatatacgt atgtatatac acacatgtat attcttcctt 360gattgtgtag ctgtccaaaa taataacata tatagaggga gctgtattcc tttatacaaa 420tctgatggct cctgcagcac tttttccttc tgaaaatatt tacattttgc taacctagtt 480tgttacttta aaaatcagtt ttgatgaaag gagggaaaag cagatggact tgaaaaagat 540ccaagctcct attagaaaag gtatgaaaat ctttatagta aaattcttta taaactaaag 600ttgtaccttt taatatgtag taaactctca tttatttggg gttcgctctt ggatctcatc 660catccattgt gttctcttta atgctgcctg ccttttgagg cattcactgc cctagacaat 720gccaccagag atagtggggg aaatgccaga tgaaaccaac tcttgctctc actagttgtc 780agcttctctg gataagtgac cacagaagca ggagtcctcc tgcttgggca tcattgggcc 840agttccttct ctttaaatca gatttgtaat ggctcccaaa ttccatcaca tcacatttaa 900attgcagaca gtgttttgca catcatgtat ctgttttgtc ccataatatg ctttttactc 960cctgatccca gtttctgctg ttgactcttc cattcagttt tatttattgt gtgttctcac 1020agtgacacca tttgtccttt tctgcaacaa cctttccagc tacttttgcc aaattctatt 1080tgtcttctcc ttcaaaacat tctcctttgc agttcctctt catctgtgta gctgctcttt 1140tgtctcttaa cttaccattc ctatagtact ttatgcatct ctgcttagtt ctattagttt 1200tttggccttg ctcttctcct tgattttaaa attccttcta tagctagagc ttttctttct 1260ttcattctct cttcctgcag tgttttgcat acatcagaag ctaggtacat aagttaaatg 1320attgagagtt ggctgtattt agatttatca ctttttaata gggtgagctt gagagttttc 1380tttctttctg tttttttttt tttttttttt tttttttttt ttgactaatt tcacatgctc 1440taaaaacctt caaaggtgat tatttttctc ctggaaactc caggtccatt ctgtttaaat 1500ccctaagaat gtcagaatta aaataacagg gctatcccgt aattggaaat atttcttttt 1560tcaggatgct atagtcaatt tagtaagtga ccaccaaatt gttatttgca ctaacaaagc 1620tcaaaacacg ataagtttac tccttcatct cagtaataaa aattaagctg taatcaacct 1680tctaggtttc tcttgtctta aaatgggtat tcaaaaatgg ggatctgtgg tgtatgtatg 1740gaaacacata ctccttaatt tacctgttgt tggaaactgg agaaatgatt gtcgggcaac 1800cgtttatttt ttattgtatt ttatttggtt gagggatttt tttataaaca gttttacttg 1860tgtcatattt taaaattact aactgccatc acctgctggg gtcctttgtt aggtcatttt 1920cagtgactaa tagggataat ccaggtaact ttgaagagat gagcagtgag tgaccaggca 1980gtttttctgc ctttagcttt gacagttctt aattaagatc attgaagacc agctttctca 2040taaatttctc tttttgaaaa aaagaaagca tttgtactaa gctcctctgt aagacaacat 2100cttaaatctt aaaagtgttg ttatcatgac tggtgagaga agaaaacatt ttgtttttat 2160taaatggagc attatttaca aaaagccatt gttgagaatt agatcccaca tcgtataaat 2220atctattaac cattctaaat aaagagaact ccagtgttgc tatgtgcaag atcctctctt 2280ggagcttttt tgcatagcaa ttaaaggtgt gctatttgtc agtagccatt tttttgcagt 2340gatttgaaga ccaaagttgt tttacagctg tgttaccgtt aaaggttttt ttttttatat 2400gtattaaatc aatttatcac tgtttaaagc tttgaatatc tgcaatcttt gccaaggtac 2460ttttttattt aaaaaaaaac ataactttgt aaatattacc ctgtaatatt atatatactt 2520aataaaacat tttaagct 253890550DNAHomo sapiensmodified_base(1)..(550)n = g, a, c or t 90ccatatcatg taccaaaagt tgctgaagtt tctcttctag ctggtaaagt aggagtttgc 60atgacttcac actttttttg cgtagtttct tctgttgtat gatggcgtga gtgtgtgtct 120tgggtaccgc tgtgtactac tgtgtgccta gattccatgc actctcgttg tgtttgaagt 180aaatattgga gaccggaggg taacaggttg gcctgttgat tacagctagt aatcgctgtg 240tcttgttccg ccccctccct gacaccccag cttcccagga tgtggaaagc ctggatctca

300gctccttgcc ccatatccct tctgtaattt gtacctaaag agtgtgatta tcctaattca 360agagtcacta aaactcatca cattatcatt gcatatcagc aaagggtaaa gtcctagcac 420caattgcttc acataccagc atgttccatt tccaatttag aattagccac ataataaaat 480cttagaatct tccttgagaa agagctgcct gagatgtagt tttgntatat ggntccccac 540cgaccatttt 550911209DNAHomo sapiens 91ccatatcatg taccaaaagt tgctgaagtt tctcttctag ctggtaaagt aggagtttgc 60atgacttcac actttttttg cgtagtttct tctgttgtat gatggcgtga gtgtgtgtct 120tgggtaccgc tgtgtactac tgtgtgccta gattccatgc actctcgttg tgtttgaagt 180aaatattgga gaccggaggg taacaggttg gcctgttgat tacagctagt aatcgctgtg 240tcttgttccg ccccctccct gacaccccag cttcccagga tgtggaaagc ctggatctca 300gctccttgcc ccatatccct tctgtaattt gtacctaaag agtgtgatta tcctaattca 360agagtcacta aaactcatca cattatcatt gcatatcagc aaagggtaaa gtcctagcac 420caattgcttc acataccagc atgttccatt tccaatttag aattagccac ataataaaat 480cttagaatct tccttgagaa agagctgcct gagatgtagt tttgttatat ggttccccac 540cgaccatttt tgtgcttttt tcttgttttg ttttgttttg actgcactgt gagttttgta 600gtgtcctctt cttgccaaaa caaacgcgag atgaactgga cttatgtaga caaatcgtga 660tgccagtgta tccttccttt cttcagttcc agcaataatg aatggtcaac ttttttaaaa 720tctagatctc tctcattcat ttcaatgtat ttttacttta agatgaacca aaattattag 780acttatttaa gatgtacagg catcagaaaa aagaagcaca taatgctttt ggtgcgatgg 840cactcactgt gaacatgtgt aaccacatat taatatgcaa tattgtttcc aatactttct 900aatacagttt tttataatgt tgtgtgtggt gattgttcag gtcgaatctg ttgtatccag 960tacagcttta ggtcttcagc tgcccttctg gcgagtacat gcacaggatt gtaaatgaga 1020aatgcagtca tatttccagt ctgcctctat gatgatgtta aattattgct gtttagctgt 1080gaacaaggga tgtaccactg gaggaataga gtatcctttt gtacacattt tgaaatgctt 1140cttctgtagt gatagaacaa ataaatgcaa cgaatactct gtcaaaaaaa aaaaaaaaaa 1200aaaaaaaaa 1209921661DNAHomo sapiens 92ccatatcatg taccaaaagt tgctgaagtt tctcttctag ctggtaaagt aggagtttgc 60atgacttcac actttttttg cgtagtttct tctgttgtat gatggcgtga gtgtgtgtct 120tgggtaccgc tgtgtactac tgtgtgccta gattccatgc actctcgttg tgtttgaagt 180aaatattgga gaccggaggg taacaggttg gcctgttgat tacagctagt aatcgctgtg 240tcttgttccg ccccctccct gacaccccag cttcccagga tgtggaaagc ctggatctca 300gctccttgcc ccatatccct tctgtaattt gtacctaaag agtgtgatta tcctaattca 360agagtcacta aaactcatca cattatcatt gcatatcagc aaagggtaaa gtcctagcac 420caattgcttc acataccagc atgttccatt tccaatttag aattagccac ataataaaat 480cttagaatct tccttgagaa agagctgcct gagatgtagt tttgttatat ggttccccac 540cgaccatttt tgtgcttttt tcttgttttg ttttgttttg actgcactgt gagttttgta 600gtgtcctctt cttgccaaaa caaacgcgag atgaactgga cttatgtaga caaatcgtga 660tgccagtgta tccttccttt cttcagttcc agcaataatg aatggtcaac ttttttaaaa 720tctagatctc tctcattcat ttcaatgtat ttttacttta agatgaacca aaattattag 780acttatttaa gatgtacagg catcagaaaa aagaagcaca taatgctttt ggtgcgatgg 840cactcactgt gaacatgtgt aaccacatat taatatgcaa tattgtttcc aatactttct 900aatacagttt tttataatgt tgtgtgtggt gattgttcag gtcgaatctg ttgtatccag 960tacagcttta ggtcttcagc tgcccttctg gcgagtacat gcacaggatt gtaaatgaga 1020aatgcagtca tatttccagt ctgcctctat gatgatgtta aattattgct gtttagctgt 1080gaacaaggga tgtaccactg gaggaataga gtatcctttt gtacacattt tgaaatgctt 1140cttctgtagt gatagaacaa ataaatgcaa cgaatactct gtctgcccta tcccgtgaag 1200tccacactgg cgtaagagaa ggcccagcag agcaggaatc tgcctagact ttctcccaat 1260gagatcccaa tatgagaggg agaagagatg ggcctcagga cagctgcaat accacttggg 1320aacacatgtg gtgtcttgat gtggccagcg cagcagttca gcacaacgta cctcccatct 1380acaacagtgc tggacgtggg aattctaagt cccagtcttg agggtgggtg gagatggagg 1440gcaacaagag atacatttcc agttctccac tgcagcatgc ttcagtcatt ctgtgagtgg 1500ccgggcccag ggccctcaca atttcactac cttgtcttta catagtcata agaattatcc 1560tcaacatagc cttttgacgc ttgtaaatct tgagtattca atttaaccct tttctgaatc 1620tccctggaaa caggtgcctg cctggattgc cttcttcttc c 1661936400DNAHomo sapiens 93gaattccggc gtcgcggacg catcccagtc tgggcgggac gctcggccgc ggcgaggcgg 60gcaagcctgg cagggcagag ggagccccgg ctccgaggtt gctcttcgcc cccgaggatc 120agtcttggcc ccaaagcgcg acgcacaaat ccacataacc tgaggaccat ggatgctgat 180gagggtcaag acatgtccca agtttcaggg aaggaaagcc cccctgtaag cgatactcca 240gatgagggcg atgagcccat gccgatcccc gaggacctct ccaccacctc gggaggacag 300caaagctcca agagtgacag agtcgtggcc agtaatgtta aagtagagac tcagagtgat 360gaagagaatg ggcgtgcctg tgaaatgaat ggggaagaat gtgcggagga tttacgaatg 420cttgatgcct cgggagagaa aatgaatggc tcccacaggg accaaggcag ctcggctttg 480tcgggagttg gaggcattcg acttcctaac ggaaaactaa agtgtgatat ctgtgggatc 540atttgcatcg ggcccaatgt gctcatggtt cacaaaagaa gccacactgg agaacggccc 600ttccagtgca atcagtgcgg ggcctcattc acccagaagg gcaacctgct ccggcacatc 660aagctgcatt ccggggagaa gcccttcaaa tgccacctct gcaactacgc ctgccgccgg 720agggacgccc tcactggcca cctgaggacg cactccgttg gtaaacctca caaatgtgga 780tattgtggcc gaagctataa acagcgaagc tctttagagg aacataaaga gcgctgccac 840aactacttgg aaagcatggg ccttccgggc acactgtacc cagtcattaa agaagaaact 900aatcacagtg aaatggcaga agacctgtgc aagataggat cagagagatc tctcgtgctg 960gacagactag caagtaacgt cgccaaacgt aagagctcta tgcctcagaa atttcttggg 1020gacaagggcc tgtccgacac gccctacgac agcagcgcca gctacgagaa ggagaacgaa 1080atgatgaagt cccacgtgat ggaccaagcc atcaacaacg ccatcaacta cctgggggcc 1140gagtccctgc gcccgctggt gcagacgccc ccgggcggtt ccgaggtggt cccggtcatc 1200agcccgatgt accagctgca caagccgctc gcggagggca ccccgcgctc caaccactcg 1260gcccaggaca gcgccgtgga gaacctgctg ctgctctcca aggccaagtt ggtgccctcg 1320gagcgcgagg cgtccccgag caacagctgt caagactcca cggacaccga gagcaacaac 1380gaggagcagc gcagcggtct catctacctg accaaccaca tcgccccgca cgcgcgcaac 1440ggcttgtcgc tcaaggagga gcaccgcgcc tacgacctgc tgcgcgccgc ctccgagaac 1500tcgcaggacg cgctccgcgt ggtcagcacc agcggggagc agatgaaggt gtacaagtgc 1560gaacactgcc gggtgctctt cctggatcac gtcatgtaca ccatccacat gggctgccac 1620ggcttccgtg atccttttga gtgcaacatg tgcggctacc acagccagga ccggtacgag 1680ttctcgtcgc acataacgcg aggggagcac cgcttccaca tgagctaaag ccctcccgcg 1740cccccacccc agaccccgag ccaccccagg aaaagcacaa ggactgccgc cttctcgctc 1800ccgccagcag catagactgg actggaccag acaatgttgt gtttggattt gtaactgttt 1860tttgtttttt gtttgagttg gttgattggg gtttgatttg cttttgaaaa gatttttatt 1920tttagaggca gggctgcatt gggagcatcc agaactgcta ccttcctaga tgtttcccca 1980gaccgctggc tgagattccc tcacctgtcg cttcctagaa tccccttctc caaacgatta 2040gtctaaattt tcagagagaa atagataaaa cacgccacag cctgggaagg agcgtgctct 2100accctgtgct aagcacgggg ttcgcgcacc aggtgtcttt ttccagtccc cagaagcaga 2160gagcacagcc cctgctgtgt gggtctgcag gtgagcagac aggacaggtg tgccgccacc 2220caagtgccaa gacacagcag ggccaacaac ctgtgcccag gccagcttcg agctacatgc 2280atctagggcg gagaggctgc acttgtgaga gaaaatactt atttcaagtc atattctgcg 2340gtaggaaaat gattgggttg gggaaagtcg gtgtctgtca gactgccctg ggtggaggga 2400gacgccgggt tagagccttt gggatcgtcc tggattcact ggcttggggg aggctgttca 2460gatggcctga gcctcccgag gcttgctgcc ccgtaggagg agactgtctt cccgtgggca 2520tatctgggga gccctgttcc ccgctttttc actcccatac ctttaatggc ccccaaaatc 2580tgtcactaca atttaaacac cagtcccgaa atttggatct tctttctttt tgaatctctc 2640aaacggcaac attcctcaga aaccaaagct ttatttcaaa tctcttcctt ccctggctgg 2700ttccatctag taccagaggc ctcttttcct gaagaaatcc aatcctagcc ctcattttaa 2760ttatgtacat ctgcttgtag ccacaagcct gaatttctca gtgttggtaa gtttctttac 2820ctaccctcac tatatattat tctcgtttta aaacccataa aggagtgatt tagaacagtc 2880attaattttc caactcaatg aaaatatgtg aagcccagca tctctgttgc taacacacag 2940agctcacctg tttgaaacca agctttcaaa catgttgaag ctctttactg taaaggcaag 3000ccagcatgtg tgtccacaca tacataggat ggctggctct gcacctgtag gatattggaa 3060tgcacagggc aattgaggga ctgagccaga ccttcggaga gtaatgccac cagatcccct 3120aggaaagagg aggcaaatgg cactgcaggt gagaaccccg cccatccgtg ctatgacatg 3180gaggcactga agcccgagga aggtgtgtgg agattctaat cccaacaagc aagggtctcc 3240ttcaagatta atgctatcaa tcattaaggt cattactctc aaccacctag gcaatgaaga 3300atataccatt tcaaatattt acagtacttg tcttcaccaa cactgtccca aggtgaaatg 3360aagcaacaga gaggaaattg tacataagta cctcagcatt taatccaaac aggggttctt 3420agtctcagca ctatgacatt ttgggctgac tacttatttg ttaggcggga gctctcctgt 3480gcattgtagg ataattagca gtatccctgg tggctaccca atagacgcca gtagcacccc 3540gaattgacaa cccaaactct ccagacatca ccaactgtcc cctgcgagga gaaatcactc 3600ctgggggaga accactgacc caaatgaatt ctaaaccaat caaatgtctg ggaagccctc 3660caagaaaaaa aatagaaaag cacttgaaga atattcccaa tattcccggt cagcagtatc 3720aaggctgact tgtgttcatg tggagtcatt ataaattcta taaatcaatt attccccttc 3780ggtcttcaaa aatatatttc ctcataaaca tttgagtttt gttgaaaaga tggagtttac 3840aaagatacca ttcttgagtc atggatttct ctgctcacag aagggtgtgg catttggaaa 3900cgggaataaa caaaattgct gcaccaatgc actgagtgaa ggaagagaga cagaggatca 3960agggctttag acagcactcc ttcaatatgc aatcacagag aaagatgcgc cttatccaag 4020ttaatatctc taaggtgaga gccttcttag agtcagtttg ttgcaaattt cacctactct 4080gttcttttcc atccatcccc ctgagtcagt tggttgaagg gagttatttt ttcaagtgga 4140attcaaacaa agctcaaacc agaactgtaa atagtgattg caggaattct tttctaaact 4200gctttgccct ttcctctcac tgccttttat agccaatata aatgtctctt tgcacacctt 4260ttgttgtggt tttatattgt aacaccattt ttctttgaaa ctattgtatt taaagtaagg 4320tttcatatta tgtcagcaag taattaactt atgtttaaaa ggtggccata tcatgtacca 4380aaagttgctg aagtttctct tctagctggt aaagtaggag tttgcatgac ttcacacttt 4440ttttgcgtag tttcttctgt tgtatgatgg cgtgagtgtg tgtcttgggt accgctgtgt 4500actactgtgt gcctagattc catgcactct cgttgtgttt gaagtaaata ttggagaccg 4560gagggtaaca ggttggcctg ttgattacag ctagtaatcg ctgtgtcttg ttccgccccc 4620tccctgacac cccagcttcc caggatgtgg aaagcctgga tctcagctcc ttgccccata 4680tcccttctgt aatttgtacc taaagagtgt gattatccta attcaagagt cactaaaact 4740catcacatta tcattgcata tcagcaaagg gtaaagtcct agcaccaatt gcttcacata 4800ccagcatgtt ccatttccaa tttagaatta gccacataat aaaatcttag aatcttcctt 4860gagaaagagc tgcctgagat gtagttttgt tatatggttc cccaccgacc atttttgtgc 4920ttttttcttg ttttgttttg ttttgactgc actgtgagtt ttgtagtgtc ctcttcttgc 4980caaaacaaac gcgagatgaa ctggacttat gtagacaaat cgtgatgcca gtgtatcctt 5040cctttcttca gttccagcaa taatgaatgg tcaacttttt taaaatctag atcattggag 5100accggagggt aacaggttgg cctgttgatt acagctagta atcgctgtgt cttgttccgc 5160cccctccctg acaccccagc ttcccaggat gtggaaagcc tggatctcag ctccttgccc 5220catatccctt ctgtaatttg tacctaaaga gtgtgattat cctaattgat ctctctcatt 5280catttcaatg tatttttact ttaagatgaa ccaaaattat tagacttatt taagatgtac 5340aggcatcaga aaaaagaagc acataatgct tttggtgcga tggcactcac tgtgaacatg 5400tgtaaccaca tattaatatg caatattgtt tccaatactt tctaatacag ttttttataa 5460tgttgtgtgt ggtgattgtt caggtcgaat ctgttgtatc cagtacagct ttaggtcttc 5520agctgccctt ctggcgagta catgcacagg attgtaaatg agaaatgcag tcatatttcc 5580agtctgcctc tatgatgatg ttaaattatt gctgtttagc tgtgaacaag ggatgtacca 5640ctggaggaat agagtatcct tttgtacaca ttttgaaatg cttcttctgt agtgatagaa 5700caaataaatg caacgaatac tctgtctgcc ctatcccgtg aagtccacac tggcgtaaga 5760gaaggcccag cagagcagga atctgcctag actttctccc aatgagatcc caatatgaga 5820gggagaagag atgggcctca ggacagctgc aataccactt gggaacacat gtggtgtctt 5880gatgtggcca gcgcacgagt tcagcacaac gtacctccca tctacaacag tgctggacgt 5940gggaattcta agtcccagtc ttgagggtgg gtggagatgg agggcaacaa gagatacatt 6000tccagttctc cactgcagca tgcttcagtc attctgtgag tggccgggcc cagggccctc 6060acaatttcac taccttgtct tttacatagt cataagaatt atcctcaaca tagccttttg 6120acgctgtaaa tcttgagtat tcatttaccc ttttctgatc tcctggaaac agctgcctgc 6180ctgcattgca cttctcttcc cgaggagtgg ggtaaattta aaagtcaagt tatagtttgg 6240atgttagtat agaattttga aattgggaat taaaaatcag gactggggac tgggagacca 6300aaaatttctg atcccatttc tgatggatgt gtcacacctt ttctgtcaaa ataaaatgtc 6360ttggaggtta tgactccttg gtgaaaaaaa aaaaaaaaaa 6400941364DNAHomo sapiens 94aatcaaaggt gggaggattt tccctaaact gacttagcag gactcttgtt acaattggac 60taggcaggct gaagacagga tgcaaggaca aagcttgttg aaaagaggcc tcagaggagc 120tcatctaaaa tttggtcaag gggagggtct ttcttggtcc ctcctcttgt tcaagggaaa 180aagagacatt cttctttcct ttgaacaata taagtcaatt tctcattggt ggcctttttt 240tcattaagga caagctgagc cccctgctga aacttggtag cagggcagcc agttgagaag 300atttctagat gtcaaggcat ctttagatga tggggtgagg actgcagtgg ccatcccaga 360tcatggattt tctggtttgc agtttgaatg tccttggtga tggcatagac atcagtgtca 420cagtcatggt tatttttccg agcagagtgt agaagtgtcc aacttcatct tgaagggctt 480ctttagrcag tcacaataaa gatctgggaa gttaggtttt agttctcagt gatgccaaat 540caggacagtg ggagaaaaat taaaaacctc agtttggaga gtggtagcca gatagtaaag 600ggaactagaa gaactgagaa tttggtaagg actgacaagc tgtgcatgat gacaggatcc 660cgttcaattt acaagtagat aacaaaacct gaaagacaag tacaggacca gaataataac 720ccataagaag gtgctatagt ttttataaaa tatctttcta cagtcatccc ccttttttga 780tccaaattaa ccaaagtaag attattcttg tttacaaaat aagtcttgtc tcattatatt 840tgacttactt atttgcataa ttgcagcaag aatggcaact gaccaggtag gcttatttaa 900gtttgcattg ctggaacttt ttacaagtaa tctcagatta tgctttcaag agttcttgaa 960gctataaagc caagtcaagc accaccaggc cttatctgca atgcctagag attccagatg 1020ggttcttctc ttcttgaggt cctaaaaaca tcctgagttt ctttggcctg ccagaaagtc 1080accttcctga ctcacctgta aggctgggaa ctccataatc caggtaccag gcagactttc 1140cgggagggct tcatatgcat tggctccata aagttaacct tagttcctca aaactgtctg 1200ttcatatgtg attttatgtc ttattctcag ttggaaatgc agaaatcacc tgtcttctgc 1260gtcgatcagg ctgggagctg cagaccggag ctgttcctat tcggccatct tggaatggac 1320ccccatgtct tattctcaaa taaaacattt tggtcaaaaa aaaa 136495411DNAHomo sapiens 95cctaatatga cattatttca aagcttatta taaaggaaca gtaatcaaac tagtgcaatt 60ttggcataaa gttagaaaaa cagatcaatg aagcagaaga gagagtccag aaacagaact 120gcacatttat gtgttggtga atgccaggga ttcagcttag gtctgattgc tcaccacaca 180gaaagccaat cactgagaca acaagtactg ccaggaagaa aggctttatt gctggtgatg 240ccagccagga tatgggagac aagtctaaaa tctgtctctg taaccaataa agttaggagt 300ttatgtagga gttgctcaac aggcagtagg tagttgaatc agggttctgg caccttgctg 360ttaggatgca gcgatctgga aatcttcagc tttctgatac tatctgggag g 411961632DNAHomo sapiensmodified_base(1)..(1632)n = g, a, c or t 96gtgccagtta taaaatatct tatattttct tataatgcct ccatagtttt attatatatt 60cactcaatac atcatttttc tatgtggtat gaggtaagaa tctaactttt actgatttta 120tctttatgca ttttttttaa tttaaaatgt ggggtaggga tctaactttt ttcaaacaca 180tataaatgtg cactactatt tatttaaata gtctgttctt ttccctttta ttattatgct 240atcttatctc acttgaattc aacctaagcc tgttttagac tccaactaat actacagatc 300ttcctaccac tcttcccctt gcataattaa cttcaagcac attagcctcc gggttcctca 360agcacaccaa atttagtccc agctcaggaa ctctgtactt tctatttcca tgctttaatg 420ttctttctct tgatatcctt gttttcttat ttccttcatt tgcatttctg ctttgatttt 480ctgtttctgg tccatggaca tttttatttt ctttttatag aacaaacaca gcttttttac 540attttgtatt tttcctgcca ttgctatgtg cttggagctc agggagggcc tcaaaggatg 600aaattggagt atggtgtgat cagaagtttg aacttctttg tattgtatga tcatcccttt 660accttaatac tcacatgaaa tgctatctat ggcttcttac attccacttc ttcttaatca 720atttctttct tcatgaactt aaacgttccc atcatttttg atagggtctg tgagtttatt 780tgtccaaaaa gcccaaaagc agaatttaag attgatagca tagctttgtg ctcaacagtt 840gtaatatttt tttccatggt cgtctagctt cttctgtttt ctttgagaaa tctatgtaat 900tgttgtttct ttataattaa tctatctttt ctctccagtt gcctttaaga ctttttatat 960ttgatattat gcaatttcac tatgatttgt ctaaatgtgc atttattttg ctagagatta 1020ataactcaag tctaaggtat catgtctttt ttcaagttta gaaaatattt ggctattatc 1080tctttattat catgctgcta cagcattatt tgaattcttt ccctcagaaa tttatattag 1140aagtttgcta gacttcattc tagtctcatg actcttaatt agtcttgcaa aattttcatt 1200tcattatcac ttattgcatt ttaggtaatt tcttaatctc tgtcttccag tttactggtt 1260ctttcttcag ctgtatctat tttattgttt aacctattta ttttctattt caatgattac 1320attttttgag attttattag caaaatggtt aaaagcatgg tttcagagga ttgtctggat 1380ttacattttg cctccattat ttactagctc tccagttttg gttaaattaa ttaacctttt 1440gtgcttcttg gtgtgtaaaa ttgaagtaac aattgtatat aaatatagtg ttttttggta 1500attaattaaa attatttgca taaaatgctt aagacagggc ctgaaatgac attgagtcct 1560caaaaaataa attattatta tcattccttc aaaaaaaaaa aaaaaaaaaa aaaaaaaana 1620aaaaaaanac ca 1632972378DNAHomo sapiens 97tctaaaagct gcggaattcc tcgagcactg ttggcctttg gtagatgccc ctctgggaga 60gatccccagg ggtgacagcc atgggccctg gaagggcctg ggctagggac agggaccaga 120gccagtccag ggagaggaca gagccaatgg actggggtgt actgtaacag ccctgctggc 180gagagggacc agggcaccgt cctccaggga gcccatgctg caagtcgggc cagaggtgcc 240cctgaacctg aaggccaatg agacccaaga caggccaagt gggttgtgag acccctgagg 300agctgggccc tggtcccagg cagcgctggc ccctgctgct gctgggtctg gccatggtcg 360cccatggcct gctgcgccca atggttgcac cgcaaagcgg ggacccagac cctggagcct 420cagttggaag cagccgatcc agcctgcgga gcctgtgggg caggtaaggg gcaagagata 480tgtgggggtc ctgcagcaga gctgggaaag ggtgaccaag gggggacaag ccagaggagt 540gaggaggaag gttaacccct aagaggggcc tgggctgaca ctggctttag taatgggttg 600atattttgtc catcacagat ttgtttgatt tactgttttt aatatcatat tacgatatta 660tttttcttca tttctgagtt ttctggcgcc acttaaattt tcaccagggt cagtgcctca 720atcacctagt cctagtcctc tgggtaggga aggaacagag gcagggacag gacatccaca 780gggggtggtg gccactgtcc ccacagggtg cccaggcctg ttcctccccc tcctcctctc 840tgcccatgtg cctcctgccc agtgagggca ggggccactc cctggagaag gcagcaaggg 900cttggtttgg tctcccccaa ggctgtctgt tcaccaactt gcacataaat acttactggg 960gccaggctca aggacacagg gagggtggga tgaaccgagg ggagctgtcc agtcattgga 1020acaggcccac ggcccatgtt tgcagcaatg aagggagagg gcatctccct ctgggatgat 1080gcccaggctg gtctcacaga tcgaggggca ctggctggtg atgggtgccc ccaaaagaca 1140gagcagtgtc agaggagagg agagcacagg atgaggctgg gagctcctgg gtgactggga 1200aggggaggca agaagaccat agggtccgtg caccattccc agtccaggac gagtccttgg 1260atggatttag gtagattgat tatcagagtc agatttgtgt ttttggaaaa atcagcaccg 1320gattggaggc tgatgcgacg cccaattaga ggagggagga gagggggtga tggccaagtc 1380cagggtaggt ggggatcctg gaggaagccg tgccttgggg atggggagga cactcagatt 1440cagagcaccc aggggcccag tttcctatga aatgggagca tgaggttgaa gtgagggctg 1500agcagagggg agcagacacg ctcggggact gtctatgggc attgaaaatg tataaccatt 1560ttagcaacag gcggcgagtc aaaacccaag gtgtgtttat ctaaactggg caattcctct 1620tctaggaatt tatcctaagg gttggttggg ggaataatca aagctgaaac caaatcttta 1680taacaagggt ggttaggtca gcattcttag tgatgggaga aaactggaaa aaatccaaat

1740atctaccaga aagggtgtga aaaaacacaa ttgtatttgg gggactgttg ttgattttgt 1800tttgaaacag tcttgatctg ttgctcaggc tggagtacag tggcgtggcc acagctcact 1860gcagcctcaa cctccagggc tcaaaagatc ctccagcctc agcctcctga gtagttagga 1920ctacagatgc aggccactac acctggctaa ttttgattag gattatcatt agtttagaga 1980cagagcctcg ctatattgct caggcctgtc tcaaattcct aagctcaagc aatctttctg 2040cctcagtttc ccacgtgctg gaattacagg cgtgagccac tgcacctgac ccaactgtgt 2100ttttaaagta tatatgcatt ttcaaaaacc tgtcagaaaa tatagaaaaa tgtcaatggt 2160gtgtctggct ggctgatggg atttcaccta attttaatgt ggctttataa ttttctggtt 2220ttgtgaagtt gttcacaaaa agagacattt cttctaatat aatttttaat acaacagtaa 2280tgtactcatg tgcattactc tttttgtaat gagtatatta caaaatgtaa tgacttttgt 2340acattactct tttttcttgc caaaaaaaaa aaaaaaaa 237898313DNAHomo sapiensmodified_base(1)..(313)n = g, a, c or t 98ccacaaaata aggtctaatt caataaatta tagtaaatta atgtaatata atattacatg 60ccactaaaaa gaataaggta gctgtatatt tcctggtatg gaaaaaacat attaatatgt 120tataaactat taggttggtg caaaactaat tgtggttttt gccattgaaa tggcattgaa 180ataaaagtgt aaagaaatct ataccagatg tagtaacagt ggtttggttc tgggaggttg 240gattacaggg agcatttgat ttctatgttg ngtatttcta tantgtttga attgtttaga 300atgaatctgt ntt 31399317DNAHomo sapiens 99ccagtatgga atccagaagg accgagtgga taagagcgct gtcggcttca atgaaatgga 60ggccccgacc acagcttata agaagacgac gcccatagaa gccgcttcta gtggtgcccg 120tgggctgaag gcgaaatttg agtccatggc tgaggagaag aggaagcgag aggaagagga 180gaaggcacag caggtggcca ggaggcaaca ggagcgaaag gctgtgacaa agaggagccc 240tgaggctcca cagccagtga tagctatgga agagccagca gtaccggccc cactgcccaa 300gaaaatctcc tcagagg 3171001968DNAHomo sapiens 100aattccgccg ggcgcttaga acagaggctt gcacaggtgg agatgtggaa gtctgtagtg 60ggccatgatg tgtctgtttc cgtggagacc cagggtgatg attgggacac agatcctgac 120tttgtgaatg acatctctga aaaggagcaa cgatggggag ccaagaccat cgaggggtct 180ggacgcacag aacacatcaa catccaccag ctgaggaaca aagtatcaga ggagcatgat 240gttctcagga agaaagagat ggagtcaggg cccaaagcat cccatggcta tggaggtcgg 300tttggagtag aaagagaccg aatggacaag agtgcagtgg gccatgagta tgttgccgag 360gtggagaagc actcttctca gacggatgct gccaaaggct ttgggggcaa gtacggagtt 420gagagggaca gggcagacaa gtcagcagtc ggctttgatt ataaaggaga agtggagaag 480catacatctc agaaagatta ctctcgtggc tttggtggcc ggtacggggt ggagaaggat 540aaatgggaca aagcagctct gggatatgac tacaagggag agacggagaa acacgagtcc 600cagagagatt atgccaaggg ctttggtggc cagtatggaa tccagaagga ccgagtggat 660aagagcgctg tcggcttcaa tgaaatggag gccccgacca cagcttataa gaagacgacg 720cccatagaag ccgcttctag tggtgcccgt gggctgaagg cgaaatttga gtccatggct 780gaggagaaga ggaagcgaga ggaagaggag aaggcacagc aggtggccag gaggcaacag 840gagcgaaagg ctgtgacaaa gaggagccct gaggctccac agccagtgat agctatggaa 900gagccagcag taccggcccc actgcccaag aaaatctcct cagaggcctg gcctccagtt 960gggactcctc catcatcaga gtctgagcct gtgagaacca gcagggaaca cccagtgccc 1020ttgctgccca ttaggcagac tctcccggag gacaatgagg agcccccagc tctgccccct 1080aggactctgg aaggcctcca ggtggaggaa gagccagtgt acgaagcaga gcctgagcct 1140gagcccgagc ctgagcccga gcctgagaat gactatgagg acgttgagga gatggacagg 1200catgagcagg aggatgaacc agagggggac tatgaggagg tgctcgagcc tgaagattct 1260tctttttctt ctgctctggc tggatcatca ggctgcccgg ctggggctgg ggctggggct 1320gtggctctgg ggatctcagc tgtggctcta tatgattacc aaggagaggg aagtgatgag 1380ctttcctttg atccggacga cgtaatcact gacattgaga tggtggacga gggctggtgg 1440cggggacgtt gccatggcca ctttggactc ttccctgcaa attatgtcaa gcttctggag 1500tgactagagc tcactgtcta ctgcaactgt gatttcccat gtccaaagtg gctctgctcc 1560accccctccc tattcctgat gcaaatgtct aaccagatga gtttctggac agacttccct 1620ctcctgcttc attaagggct tggggcagag acagcatggg gaaggaggtc cccttcccca 1680agagtcctct ctatcctgga tgagctcatg aacatttctc ttgtgttcct gactccttcc 1740caatgaacac ctctctgcca ccccaagctc tgctctcctc ctctgtgagc tctgggcttc 1800ccagtttgtt tacccgggaa agtacgtcta gattgtgtgg tttgcctcat tgtgctattt 1860gcccactttc cttccctgaa gaaatatctg tgaaccttct ttctgttcag tcctaaaatt 1920cgaaataaag tgagactatg gttcacctgt aaaaaaaaaa aaggaatt 1968101486PRTHomo sapiens 101Met Trp Lys Ser Val Val Gly His Asp Val Ser Val Ser Val Glu Thr1 5 10 15Gln Gly Asp Asp Trp Asp Thr Asp Pro Asp Phe Val Asn Asp Ile Ser 20 25 30Glu Lys Glu Gln Arg Trp Gly Ala Lys Thr Ile Glu Gly Ser Gly Arg 35 40 45Thr Glu His Ile Asn Ile His Gln Leu Arg Asn Lys Val Ser Glu Glu 50 55 60His Asp Val Leu Arg Lys Lys Glu Met Glu Ser Gly Pro Lys Ala Ser65 70 75 80His Gly Tyr Gly Gly Arg Phe Gly Val Glu Arg Asp Arg Met Asp Lys 85 90 95Ser Ala Val Gly His Glu Tyr Val Ala Glu Val Glu Lys His Ser Ser 100 105 110Gln Thr Asp Ala Ala Lys Gly Phe Gly Gly Lys Tyr Gly Val Glu Arg 115 120 125Asp Arg Ala Asp Lys Ser Ala Val Gly Phe Asp Tyr Lys Gly Glu Val 130 135 140Glu Lys His Thr Ser Gln Lys Asp Tyr Ser Arg Gly Phe Gly Gly Arg145 150 155 160Tyr Gly Val Glu Lys Asp Lys Trp Asp Lys Ala Ala Leu Gly Tyr Asp 165 170 175Tyr Lys Gly Glu Thr Glu Lys His Glu Ser Gln Arg Asp Tyr Ala Lys 180 185 190Gly Phe Gly Gly Gln Tyr Gly Ile Gln Lys Asp Arg Val Asp Lys Ser 195 200 205Ala Val Gly Phe Asn Glu Met Glu Ala Pro Thr Thr Ala Tyr Lys Lys 210 215 220Thr Thr Pro Ile Glu Ala Ala Ser Ser Gly Ala Arg Gly Leu Lys Ala225 230 235 240Lys Phe Glu Ser Met Ala Glu Glu Lys Arg Lys Arg Glu Glu Glu Glu 245 250 255Lys Ala Gln Gln Val Ala Arg Arg Gln Gln Glu Arg Lys Ala Val Thr 260 265 270Lys Arg Ser Pro Glu Ala Pro Gln Pro Val Ile Ala Met Glu Glu Pro 275 280 285Ala Val Pro Ala Pro Leu Pro Lys Lys Ile Ser Ser Glu Ala Trp Pro 290 295 300Pro Val Gly Thr Pro Pro Ser Ser Glu Ser Glu Pro Val Arg Thr Ser305 310 315 320Arg Glu His Pro Val Pro Leu Leu Pro Ile Arg Gln Thr Leu Pro Glu 325 330 335Asp Asn Glu Glu Pro Pro Ala Leu Pro Pro Arg Thr Leu Glu Gly Leu 340 345 350Gln Val Glu Glu Glu Pro Val Tyr Glu Ala Glu Pro Glu Pro Glu Pro 355 360 365Glu Pro Glu Pro Glu Pro Glu Asn Asp Tyr Glu Asp Val Glu Glu Met 370 375 380Asp Arg His Glu Gln Glu Asp Glu Pro Glu Gly Asp Tyr Glu Glu Val385 390 395 400Leu Glu Pro Glu Asp Ser Ser Phe Ser Ser Ala Leu Ala Gly Ser Ser 405 410 415Gly Cys Pro Ala Gly Ala Gly Ala Gly Ala Val Ala Leu Gly Ile Ser 420 425 430Ala Val Ala Leu Tyr Asp Tyr Gln Gly Glu Gly Ser Asp Glu Leu Ser 435 440 445Phe Asp Pro Asp Asp Val Ile Thr Asp Ile Glu Met Val Asp Glu Gly 450 455 460Trp Trp Arg Gly Arg Cys His Gly His Phe Gly Leu Phe Pro Ala Asn465 470 475 480Tyr Val Lys Leu Leu Glu 48510296DNAHomo sapiens 102ctgacagcat ctggctttca gttcctcagt caccactact ttgtaccaaa ttcactgttt 60tggctctgaa atctaatttt gagtttagca aggatg 96103349DNAHomo sapiens 103ccagagtgca ggatacatca ttggcaccaa gggtcttttt caattcttgg tcaatcctct 60gcagcaagca cccccggatg acgtcctcat agatgccctc agtggtcaga gcctggctgc 120ccacggcaag gacatccccc tcgaactcag gcagctcctt tttgcagcct ggctcgagtt 180ggctcagcac aaaaggtaaa aagatgcaga gaccccagcc tcggatgaac ctcctctgcg 240ccaacccgct gtccgatttg aatttcttca gcacgcgccc cctgactctc tccagcctct 300gggcagcctg gtcacagttg agggccgtcg tcagacactg gtcagccag 349104116PRTHomo sapiens 104Leu Ala Asp Gln Cys Leu Thr Thr Ala Leu Asn Cys Asp Gln Ala Ala1 5 10 15Gln Arg Leu Glu Arg Val Arg Gly Arg Val Leu Lys Lys Phe Lys Ser 20 25 30Asp Ser Gly Leu Ala Gln Arg Arg Phe Ile Arg Gly Trp Gly Leu Cys 35 40 45Ile Phe Leu Pro Phe Val Leu Ser Gln Leu Glu Pro Gly Cys Lys Lys 50 55 60Glu Leu Pro Glu Phe Glu Gly Asp Val Leu Ala Val Gly Ser Gln Ala65 70 75 80Leu Thr Thr Glu Gly Ile Tyr Glu Asp Val Ile Arg Gly Cys Leu Leu 85 90 95Gln Arg Ile Asp Gln Glu Leu Lys Lys Thr Leu Gly Ala Asn Asp Val 100 105 110Ser Cys Thr Leu 115105311DNAHomo sapiensmodified_base(1)..(311)n = g, a, c or t 105ctgcaagaca gcagagaanc tgccaatatc cagttagcag atgactttgc tggcaagcag 60aggaagncgg taaaagcttg tctcccagcc aggaaacttg acaccaagnt aagatttgga 120gctaggaaac aaacccaaaa ggctcacagc aagcggagaa aaaaacccca aaatctgtaa 180cctgtatcac aaagcgttca tatccttcag atataaagag ttattagata tcaataagaa 240aaatgcaaac actcctgaaa agtagaaaaa agctatgaac aggcaattca ctgaaattaa 300aaaaaaaaaa a 3111065107DNAHomo sapiens 106cgcaggcggt ggtcgtgggg aagggaagag gagccccggg agacgacagc agcatgggtg 60ggcggccttc gagccctctg gacaagcagc agcggcagca cctaaggggt caggtggaca 120ccctgctgag gaacttcctg ccttgctacc gtgggcagct ggcagcgtct gtcctgcggc 180agatctctcg agagctgggc cctcaggagc cgaccggaag ccagttgcta cgcagcaaaa 240agctgccccg agtccgtgag caccgaggac ccctgaccca gcttcggggc cacccacccc 300ggtggcagcc gatcttctgt gttctgcgtg gggacggccg cctagagtgg ttcagccaca 360aggaggaata tgaaaacggg ggccactgcc ttggctcaac agccctgaca ggatacacgc 420tcctgacttc ccagcgagaa tatctccgcc ttttggatgc tctctgccct gaatccttgg 480gagaccatac tcaggaagag cctgactccc tcttggaagt gcctgtgagc ttcccgctgt 540tcctgcagca ccccttccgc cggcacctct gcttctctgc agccaccagg gaggcacagc 600atgcctggag gctggccctg cagggtggca tccggcttca gggcacagtc ctgcagcgaa 660gccaggcccc tgctgcccgg gccttcctgg acgccgtccg actctaccgg cagcaccaag 720gccactttgg cgacgacgac gtgaccctag gctcagacgc cgaggtgctg accgcggtgc 780tgatgcggga gcaacttccc gcgctgcgag cccagaccct tcctggcctg cggggggcag 840gccgcgcccg cgcctgggcc tggaccgagc ttctagacgc cgttcacgca gctgtcctgg 900ccggggcctc cgccgggctc tgcgccttcc agcccgaaaa ggacgagctg cttgcgtcgc 960tggagaagac gatccgcccg gacgtggacc agctgctgcg gcagcgggcg cgtgtggcgg 1020ggcggctgag gacggatatc aggggaccgc tcgagtcgtg cctgcgccgg gaggtggacc 1080cgcagctgcc ccgggtcgtg cagaccctgc tgcgcaccgt ggaagcctcg ctcgaggcgg 1140tgcggaccct cctggctcaa ggcatggacc gactgtccca ccgcctgcgc cagagcccct 1200cgggcacgcg gctgcgcagg gaggtttact catttgggga gatgccgtgg gacttggcgc 1260tgatgcagac atgctaccgt gaggccgagc ggagccgggg gcgcttgggg cagctggcag 1320caccgtttgg ctttctgggg atgcagagcc tcgtgtttgg ggcccaagat cttgcacagc 1380agctcatggc tgacgccgtg gccaccttcc tgcagctggc tgaccagtgt ctgacgacgg 1440ccctcaactg tgaccaggct gcccagaggc tggagagagt cagggggcgc gtgctgaaga 1500aattcaaatc ggacagcggg ttggcgcaga ggaggttcat ccgaggctgg ggtctctgca 1560tctttttacc ttttgtgctg agccaactcg agccaggctg caaaaagacg gagtctcgct 1620ctgtcgccca ggctgtagtg cagtggtgtg atcttggctc gctgcggcct ccacctccta 1680ggttcaagcg atcctcccat ctcggcctcc caagtagctg ggattacagg cacccgctat 1740agggaccagc cccacagggt cggtgggtct ctccctgtgt gcagagacaa gagagtgtag 1800aaataaagac acaagacaaa gagataaaag aaaagacagc tgggcccggg ggaccactac 1860taccaagttg cggagaccgg tagtggcccc gaatgtctgg ctgcgctgtt atttattgga 1920tacaaagcaa aaggggcagg gtaaagagtg tgagtcatct ccaatgatag gtaaggtcac 1980gtgggtcatg tgtccactgg acagggggcc cctccctgcc tggcagctga ggcagagaga 2040gagaggagac aaagagaaag acagcttaag ccattatttc tgcatatcag agacttttag 2100tactttcact aactgactac tgctatctag aaggcagagc caggtgtaca ggatggaaca 2160cgaaggcgga ctaggagcga gaccactgaa gcacagcatc acagggagac ggttaggtct 2220ctggataact gtgggcaagc ctgactgata tcaggccctc cacaagaggt ggaggagcag 2280agtcttctct aaactccccc ggagaaaagg agactccctt tcccggtctg ctaagtagcc 2340ggtgtttttc cttgacactt ttcgctaccg ctagaccacg gtctgcctgg caacaggcat 2400cttcccagac gctggcgtca ccgctagacc aaggagccct tctgctggcc ctgtccgggc 2460ataacagaag gctcgcactc ttgtcttctg gtcatacctc actatgcccc ctcagctcct 2520atctctgtat ggcctggttt ttcctaggtt atgattgtag agtgaggatt attataatat 2580tggaataaag agtaactgct accaactaat cattaatgat attcatatat aatcatatct 2640aatatctata tctggtataa ctattcttgt tttatatttt gttatactgg aacagctcat 2700gtcctcggtc tcttgcctca gcacctgggt ggcttgccgc ccacaacccg ccaccacgcc 2760cagctaattt ttgtactttt ggtagagacg gtggtttcac catgttggtc aggctggtct 2820tgaactcctg acctcatgat ccgcccacct cagccaacca aagtgctggg attacaggca 2880tgagccaccg cacccggcct gtttatttta aaataaaaat atttaaaaat aaagataagg 2940aaactaaggc ccaagccccg ccccccaacc ccacagctaa tcaggcccag ggctagggca 3000gaagcctgtg ttgtaggcct ctagaggggc cctcctctcc atccgagccc ctaacccgcc 3060atggttccag gagctgcctg agttcgaggg ggatgtcctt gccgtgggca gccaggctct 3120gaccactgag ggcatctatg aggacgtcat ccgggggtgc ttgctgcaga ggattgacca 3180agacccttgg tgccaatgat gtatcctgca ctctggacgg ctgcttggag gtcccatggg 3240aacaggaggg agcagatgag gaaactgagg ctgagcggga aggaggggct tgtcccaggc 3300agccagactc tggtgcccag atccagccac tctgcccacc gccttctcca ggaacattcc 3360ggagctgaat cttcacccac atctatcttg tttctattgg ataaatgtct acaagtggaa 3420tttctgggcc aaaacggatg tgccatcttt aggcttttgt aacccctgca acttcagaaa 3480actgtaccat tttatactcc aagcagcagc atttatttgt gtattttccc caaggctttc 3540tttattttaa tttttttttt tttttttgag actgggtctt gctctgtcac ccgggctggg 3600gtgcagtggc aggatctcgg ctcactgcga cctccgcctc ccgggttcaa gcgattctcc 3660tgcctcagcc tcccgagtag ctgggatttc aggcacccgc caccatgcct ggttaattgt 3720gtttttggta gagatggggt ttcgccgtgt tggccaggct ggtctcgaac tcctgtcctt 3780aggtggtctg cccgcctcag cctcccggag tgctgggatt gcaggtgtga gccaccacac 3840gtggcctaat tttttttttt taaataatag agacaaggtc tcgctatgct gcccaggctg 3900atctcaaact cctggactca agcaatcctc ctgccttggc ctcccaaagt gctaggatta 3960taggagtgat ccactatgtc cagcctccaa atcctttcta aacactagga cttttcatga 4020aaagaaaaaa gctatgccag ttagacacac acagaaatct catgatttta ttttgaattt 4080ctttgactaa attgaactta caaataagtt tattatggcc gggcgtggcg gtgcacacct 4140gtggtcccgg cactttggga ggctgaggcg ggcagatcac ttgagctcag gagttcggga 4200ccagcctggc ggacgtggtg ggacctcatc tctacaaaaa atacaaaatt agcggccggg 4260agtggtggct cacgcctgtc atcccagcac tttgggaggc tgagacaggt ggattgcttg 4320agccaaggag ttttgaggcc agcttgggca atgtggtgaa acctgtctct actaaaaaat 4380aaaataaata aataaataaa taaataaata aataaataaa taaaatttaa aagaagctgg 4440gctgagatgg gagatttgcc tgagcctggg aactcaaggc tgcagtgagt ggtgattgca 4500ccactgcact ccagcctggg tgatgggagt gagaccctgt ctcaaaaaac aaaatccaaa 4560tatgttgatt agccatttac atgtttgtag tttttttttt tttaatttca gtgaattgcc 4620tgttcatagc ttttttctac ttttcaggag tgtttgcatt tttcttattg atatctaata 4680actctttata tctgaaggat atgaacgctt tgtgatacag gttacagatt ttggggtttt 4740tttctccgct tgctgtgagc cttttgggtt tgtttcctag ctccaaatct taacttggtg 4800tcaagtttcc tggctgggag acaagctttt accgacttcc tctgcttgcc agcaaagtca 4860tctgctaact ggatattggc agcttctctg ctgtcttgca gctgcttccg gagtgggttc 4920cacagggatt cccgtgtgtt cttggttcag cttgcagagg gactttcaca ctccctggag 4980accgtttcct cccattctgt ctggagtttt cggcctaccc caagacaatg agatattcct 5040gacctttcca cctatttccc tccaacccca ccttccaaaa tacatttgct caatacattt 5100gcacttc 5107107579PRTHomo sapiens 107Gln Ala Val Val Val Gly Lys Gly Arg Gly Ala Pro Gly Asp Asp Ser1 5 10 15Ser Met Gly Gly Arg Pro Ser Ser Pro Leu Asp Lys Gln Gln Arg Gln 20 25 30His Leu Arg Gly Gln Val Asp Thr Leu Leu Arg Asn Phe Leu Pro Cys 35 40 45Tyr Arg Gly Gln Leu Ala Ala Ser Val Leu Arg Gln Ile Ser Arg Glu 50 55 60Leu Gly Pro Gln Glu Pro Thr Gly Ser Gln Leu Leu Arg Ser Lys Lys65 70 75 80Leu Pro Arg Val Arg Glu His Arg Gly Pro Leu Thr Gln Leu Arg Gly 85 90 95His Pro Pro Arg Trp Gln Pro Ile Phe Cys Val Leu Arg Gly Asp Gly 100 105 110Arg Leu Glu Trp Phe Ser His Lys Glu Glu Tyr Glu Asn Gly Gly His 115 120 125Cys Leu Gly Ser Thr Ala Leu Thr Gly Tyr Thr Leu Leu Thr Ser Gln 130 135 140Arg Glu Tyr Leu Arg Leu Leu Asp Ala Leu Cys Pro Glu Ser Leu Gly145 150 155 160Asp His Thr Gln Glu Glu Pro Asp Ser Leu Leu Glu Val Pro Val Ser 165 170 175Phe Pro Leu Phe Leu Gln His Pro Phe Arg Arg His Leu Cys Phe Ser 180 185 190Ala Ala Thr Arg Glu Ala Gln His Ala Trp Arg Leu Ala Leu Gln Gly 195 200 205Gly Ile Arg Leu Gln Gly Thr Val Leu Gln Arg Ser Gln Ala Pro Ala 210 215 220Ala Arg Ala Phe Leu Asp Ala Val Arg Leu Tyr Arg Gln His Gln Gly225 230 235 240His Phe Gly Asp Asp Asp Val Thr Leu Gly Ser Asp Ala Glu Val Leu 245 250 255Thr Ala Val Leu Met Arg Glu Gln Leu Pro Ala Leu Arg Ala Gln Thr 260 265 270Leu Pro Gly Leu Arg Gly Ala Gly Arg Ala Arg Ala Trp Ala Trp Thr 275 280 285Glu Leu

Leu Asp Ala Val His Ala Ala Val Leu Ala Gly Ala Ser Ala 290 295 300Gly Leu Cys Ala Phe Gln Pro Glu Lys Asp Glu Leu Leu Ala Ser Leu305 310 315 320Glu Lys Thr Ile Arg Pro Asp Val Asp Gln Leu Leu Arg Gln Arg Ala 325 330 335Arg Val Ala Gly Arg Leu Arg Thr Asp Ile Arg Gly Pro Leu Glu Ser 340 345 350Cys Leu Arg Arg Glu Val Asp Pro Gln Leu Pro Arg Val Val Gln Thr 355 360 365Leu Leu Arg Thr Val Glu Ala Ser Leu Glu Ala Val Arg Thr Leu Leu 370 375 380Ala Gln Gly Met Asp Arg Leu Ser His Arg Leu Arg Gln Ser Pro Ser385 390 395 400Gly Thr Arg Leu Arg Arg Glu Val Tyr Ser Phe Gly Glu Met Pro Trp 405 410 415Asp Leu Ala Leu Met Gln Thr Cys Tyr Arg Glu Ala Glu Arg Ser Arg 420 425 430Gly Arg Leu Gly Gln Leu Ala Ala Pro Phe Gly Phe Leu Gly Met Gln 435 440 445Ser Leu Val Phe Gly Ala Gln Asp Leu Ala Gln Gln Leu Met Ala Asp 450 455 460Ala Val Ala Thr Phe Leu Gln Leu Ala Asp Gln Cys Leu Thr Thr Ala465 470 475 480Leu Asn Cys Asp Gln Ala Ala Gln Arg Leu Glu Arg Val Arg Gly Arg 485 490 495Val Leu Lys Lys Phe Lys Ser Asp Ser Gly Leu Ala Gln Arg Arg Phe 500 505 510Ile Arg Gly Trp Gly Leu Cys Ile Phe Leu Pro Phe Val Leu Ser Gln 515 520 525Leu Glu Pro Gly Cys Lys Lys Thr Glu Ser Arg Ser Val Ala Gln Ala 530 535 540Val Val Gln Trp Cys Asp Leu Gly Ser Leu Arg Pro Pro Pro Pro Arg545 550 555 560Phe Lys Arg Ser Ser His Leu Gly Leu Pro Ser Ser Trp Asp Tyr Arg 565 570 575His Pro Leu1082917DNAHomo sapiens 108ctagaatgct aattgcactt aggcctcatg gttctagtaa acggcagctg tgggcccttt 60tgcctcttcc cctgttcttg gcctcacatc tccagctgag ctgccggtct tggcttcctg 120gtcgcctctg tcccagagat ggtcccaggg agccatccta gggcaggtag cactgaggct 180cctgtggaaa caggagccac ctgctcagga gacccctttc ctgaggaagt ccttacctct 240ccccttgaga tgtaaaaatg gtccagcaga gacaagctcc cgtggaaaac agacaggagc 300atgggggcag ctgtcatggc tgtggcgggc acttttcctc agagtttctg ccttgcgctg 360gtccaggagc cattttgcac caaggacttg gtaggcagag gcagccccac tgtaaagaag 420ggtcagatta aaacaaaaaa ctgccaaaag catcccctct gccccccatg tggcactggc 480atcattctct gcttccctgg gaggaatttt ttcaccatgt tattgaaggg gatggttcat 540taaggactcc acccctcaga gctcactcag accccaagga cagaggtgac tggggcttgg 600tgacttgttc actccttttt tcccaggtat actgaagggg tgacagagag aggtcttcat 660ggcagaccag gccttcacag ctaatgggga gaggaactca tgttacctct gcaggcctgg 720ggtcctgagg gggtcttttg gcttcagcct gttcccccag aggcttgatc atcccacatt 780gtcccttcag ctcagctgct cttctccccc acccaccctg ggatgtgggt gctctgggct 840gaaccaaggc tatgacttct ggagagaggc tcaggggttg gtctgagagg cctgccatcc 900acccctcagg gagctaggtt ttctcagagg ctcagctgga cagcactttt tagaaaagtt 960tgtagcatta agctggttta aaatatgaag ttggttttgt tggatggctc ctgagctgac 1020tgactgatgt ctgaagtttg agacgaggga ttatttcagg gtggggccca atgtgatcta 1080atgcccagct ggggacaatt gtgcctcatc atttgctcaa attcctgggc ccccaagtta 1140gccccctccc aggagtggtc agcgggtcac agctgccccc actctataag cagggctaat 1200tgtgtaccct ttgcagaaat gcttttggtc tcctacccaa atactcacaa gggtcttatc 1260agacgcccgt cttaaagtcc agcatgctca gggaccctgt gtaggatctc gtttgtggtg 1320agtgggctgc tctgaggtct ccactgggct gccatttagc catgtgccat ctctgaagtc 1380agaggtgttt gactcccatt ccttgggctc tggagctttc cccaagaatt acatcagaga 1440aaaggaagaa ggggcctgca ggacccattg ggaatgagtt taatactgaa gtctggaatg 1500taagctcatg ccctagaggc ctctccatat ggctggtcag gggagctgcc ttcaggcttg 1560tgccccgtgt gctcagcagc tgcctctgtc cccctctact gtccctttca caccttgcct 1620ggccaagggg ctagacctcc caggctaagc ctcagattca gtgcaggaca caagctcatg 1680cccccgtctt gccagtgaca cttgaagcct cccgacttcc acagagtgct tcaggacaca 1740ttttgagtgg tattttcttt tctttttttc ttcttttttt tttttttgag atggagtctc 1800gctctgttgc ccaggctgga gtgcagtggc ctgatctcgg ctcactgcaa cctctgcctc 1860ccaggttcaa gcgattcttc tgcctcagcc tccagagtag ctgggactat agacatgcac 1920caccacgccc ggctaatttt gtatttttgg tcgagacggg gttttgccat gttagtcagg 1980ctggtcttga actcctgacc tcaagtgatc caccacctcg gcctcccaaa gtgttgagat 2040gacaggcacg agccaccagg cccagcctga gtggtatttt ctttagggac caggtagact 2100ttaaaacgag ggtaagagaa aagccagtgt ctttctgagg taaataattt ctgccaggaa 2160acttcccagc cccaccagca gcccccctaa aaaaatcact cgtgtcccca gggacttcta 2220aagcttgggg ctccaggaaa tcatccagta gagttggaga ttcagagatt tcttgaagcc 2280agggacatgc tcctaactcc tttcccatta aaggtgttag aatagaccag agggtgtccc 2340ttttccacag taatgggatc ggctggtgtg ccttcaggga ggaagaggga ggtggtcaag 2400cttgaaaaac tggctttagg atggttctga ctttgttctc cctccccaag tgttctcaac 2460ctccattctg cagtgttcag agttttaggg aaagggtttg ggtgccccag catccaggtg 2520ttgtgtggct tagcgcatgt gaagtgaaaa ccttctgggg ttgtttggaa gcagctttct 2580ggttcttgtg attgtatcct gaggtcccag aaccctattc tcccacgagg atcctcagtg 2640accatggtgg ccacacgcct ggccagcctg ctggctcctg ggtgagctga agaaccttgc 2700ctgtggcact tttcgagggt gagctggaac cgagagaaca tggtccccgt gctgggactc 2760atgcgggtca tttcctgccg gcctggtttc gcctggtcgt gtctttatga gcaccatgta 2820agcctccttg tattgagata attgggcatt aaacattaaa ctgcagctct gggaaaaaaa 2880aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaac 291710983PRTHomo sapiens 109Met Glu Ser Arg Ser Val Ala Gln Ala Gly Val Gln Trp Pro Asp Leu1 5 10 15Gly Ser Leu Gln Pro Leu Pro Pro Arg Phe Lys Arg Phe Phe Cys Leu 20 25 30Ser Leu Gln Ser Ser Trp Asp Tyr Arg His Ala Pro Pro Arg Pro Ala 35 40 45Asn Phe Val Phe Leu Val Glu Thr Gly Phe Cys His Val Ser Gln Ala 50 55 60Gly Leu Glu Leu Leu Thr Ser Ser Asp Pro Pro Pro Arg Pro Pro Lys65 70 75 80Val Leu Arg110509DNAHomo sapiensmodified_base(467)n = g, a, c or t 110aaaccttaag aaccacataa tactatataa tgcttttctg tacaaatctc aagaaacact 60ttcattcatt aaaacatcat gaaaatcctt aaatgtgtta aatggaaaaa aatgaaacca 120tgaacaaaaa agctatacat gtaggtgcat atttatctcc tcctgagttg ggagaaatct 180ttctaagcat agaaacaatg gtagcaaaag agaagaatag atttggctgg attaacaata 240aaaaatttct gccagaaata tgaaaattca atttagacaa aattcaatat aaacaaaatt 300aatatagaca aaggtggtaa acaggtggtt ctcagagaag ataaatacat gattatttaa 360cataaaaaga aatgttcaat gtttctagaa gacaaataat tacaaaccta aacaaattgt 420atatttgtta gattggcata aattataata atccaacatt gagttangtg gaatataaat 480tggtaaaata tttctggaag acaatttgg 509111525DNAHomo sapiens 111agaagtgatt atgggattaa aagaatacat aattacagtg ttttgggatt gggctctttt 60ttttcttaat agaaaagcag aaacttcata aataatagct gtgctttaga taccagataa 120caaatattgt ttcccctgaa gatatgacct actagaacta ctcacatata tagtccaata 180attgctgact taataggtat ggtaaaatag ctgataataa gtcagactct caagagtttc 240tgtaccttga ttattgacaa attcattgtt ttacatccta ctaaagaaca tgtgtgtggg 300gagggggtgg ggaactggtt cacaacataa tctgaaggag atcaaacatc tgtaaggaca 360ggtacccagt gatgataata tatctgaaaa cacaagccat ttttattctt tatcccaatt 420aacttgaggt actctaatga tgaagcactc gattgcacta tgacctcctt gagtgatggg 480cagcttggtt cctctctcac tttttgtttc tttttaatat gcaaa 525112183DNAHomo sapiensmodified_base(1)..(183)n = g, a, c or t 112aaaaaaagac aatttgagca ggacgaccct ctccaatctg ggtagcatgg ttagcctgtg 60cagtaacaac gtaggcttgg aggatgggtn caatgaaaat gattctgatt cggaaacgtt 120ttgactttgg actgtanaag cttttctttg atcacctgtg ntggaggaaa ggaaagaagc 180ctt 1831131750DNAHomo sapiens 113cagctctctg tcagaatggc caccatggta ccatccgtgt tgtggcccag ggcctgctgg 60actctgctgg tctgctgtct gctgacccca ggtgtccagg ggcaggagtt ccttttgcgg 120gtggagcccc agaaccctgt gctctctgct ggagggtccc tgtttgtgaa ctgcagtact 180gattgtccca gctctgagaa aatcgccttg gagacgtccc tatcaaagga gctggtggcc 240agtggcatgg gctgggcagc cttcaatctc agcaacgtga ctggcaacag tcggatcctc 300tgctcagtgt actgcaatgg ctcccagata acaggctcct ctaacatcac cgtgtacggg 360ctcccggagc gtgtggagct ggcacccctg cctccttggc agccggtggg ccagaacttc 420accctgcgct gccaagtgga gggtgggtcg ccccggacca gcctcacggt ggtgctgctt 480cgctgggagg aggagctgag ccggcagccc gcagtggagg agccagcgga ggtcactgcc 540actgtgctgg ccagcagaga cgaccacgga gcccctttct catgccgcac agaactggac 600atgcagcccc aggggctggg actgttcgtg aacacctcag ccccccgcca gctccgaacc 660tttgtcctgc ccgtgacccc cccgcgcctc gtggcccccc ggttcttgga ggtggaaacg 720tcgtggccgg tggactgcac cctagacggg ctttttccag cctcagaggc ccaggtctac 780ctggcgctgg gggaccagat gctgaatgcg acagtcatga accacgggga cacgctaacg 840gccacagcca cagccacggc gcgcgcggat caggagggtg cccgggagat cgtctgcaac 900gtgaccctag ggggcgagag acgggaggcc cgggagaact tgacggtctt tagcttccta 960ggacccattg tgaacctcag cgagcccacc gcccatgagg ggtccacagt gaccgtgagt 1020tgcatggctg gggctcgagt ccaggtcacg ctggacggag ttccggccgc ggccccgggg 1080cagccagctc aacttcagct aaatgctacc gagagtgacg acggacgcag cttcttctgc 1140agtgccactc tcgaggtgga cggcgagttc ttgcacagga acagtagcgt ccagctgcga 1200gtcctgtatg gtcccaaaat tgaccgagcc acatgccccc agcacttgaa atggaaagat 1260aaaacgagac acgtcctgca gtgccaagcc aggggcaacc cgtaccccga gctgcggtgt 1320ttgaaggaag gctccagccg ggaggtgccg gtggggatcc cgttcttcgt caacgtaaca 1380cataatggta cttatcagtg ccaagcgtcc agctcacgag gcaaatacac cctggtcgtg 1440gtgatggaca ttgaggctgg gagctcccac tttgtccccg tcttcgtggc ggtgttactg 1500accctgggcg tggtgactat cgtactggcc ttaatgtacg tcttcaggga gcaccaacgg 1560agcggcagtt accatgttag ggaggagagc acctatctgc ccctcacgtc tatgcagccg 1620acagaagcaa tgggggaaga accgtccaga gctgagtgac gctgggatcc gggatcaaag 1680ttggcggggg cttggctgtg ccctcagatt ccgcaccaat aaagccttca aactccctaa 1740aaaaaaaaaa 1750114547PRTHomo sapiens 114Met Ala Thr Met Val Pro Ser Val Leu Trp Pro Arg Ala Cys Trp Thr1 5 10 15Leu Leu Val Cys Cys Leu Leu Thr Pro Gly Val Gln Gly Gln Glu Phe 20 25 30Leu Leu Arg Val Glu Pro Gln Asn Pro Val Leu Ser Ala Gly Gly Ser 35 40 45Leu Phe Val Asn Cys Ser Thr Asp Cys Pro Ser Ser Glu Lys Ile Ala 50 55 60Leu Glu Thr Ser Leu Ser Lys Glu Leu Val Ala Ser Gly Met Gly Trp65 70 75 80Ala Ala Phe Asn Leu Ser Asn Val Thr Gly Asn Ser Arg Ile Leu Cys 85 90 95Ser Val Tyr Cys Asn Gly Ser Gln Ile Thr Gly Ser Ser Asn Ile Thr 100 105 110Val Tyr Gly Leu Pro Glu Arg Val Glu Leu Ala Pro Leu Pro Pro Trp 115 120 125Gln Pro Val Gly Gln Asn Phe Thr Leu Arg Cys Gln Val Glu Gly Gly 130 135 140Ser Pro Arg Thr Ser Leu Thr Val Val Leu Leu Arg Trp Glu Glu Glu145 150 155 160Leu Ser Arg Gln Pro Ala Val Glu Glu Pro Ala Glu Val Thr Ala Thr 165 170 175Val Leu Ala Ser Arg Asp Asp His Gly Ala Pro Phe Ser Cys Arg Thr 180 185 190Glu Leu Asp Met Gln Pro Gln Gly Leu Gly Leu Phe Val Asn Thr Ser 195 200 205Ala Pro Arg Gln Leu Arg Thr Phe Val Leu Pro Val Thr Pro Pro Arg 210 215 220Leu Val Ala Pro Arg Phe Leu Glu Val Glu Thr Ser Trp Pro Val Asp225 230 235 240Cys Thr Leu Asp Gly Leu Phe Pro Ala Ser Glu Ala Gln Val Tyr Leu 245 250 255Ala Leu Gly Asp Gln Met Leu Asn Ala Thr Val Met Asn His Gly Asp 260 265 270Thr Leu Thr Ala Thr Ala Thr Ala Thr Ala Arg Ala Asp Gln Glu Gly 275 280 285Ala Arg Glu Ile Val Cys Asn Val Thr Leu Gly Gly Glu Arg Arg Glu 290 295 300Ala Arg Glu Asn Leu Thr Val Phe Ser Phe Leu Gly Pro Ile Val Asn305 310 315 320Leu Ser Glu Pro Thr Ala His Glu Gly Ser Thr Val Thr Val Ser Cys 325 330 335Met Ala Gly Ala Arg Val Gln Val Thr Leu Asp Gly Val Pro Ala Ala 340 345 350Ala Pro Gly Gln Pro Ala Gln Leu Gln Leu Asn Ala Thr Glu Ser Asp 355 360 365Asp Gly Arg Ser Phe Phe Cys Ser Ala Thr Leu Glu Val Asp Gly Glu 370 375 380Phe Leu His Arg Asn Ser Ser Val Gln Leu Arg Val Leu Tyr Gly Pro385 390 395 400Lys Ile Asp Arg Ala Thr Cys Pro Gln His Leu Lys Trp Lys Asp Lys 405 410 415Thr Arg His Val Leu Gln Cys Gln Ala Arg Gly Asn Pro Tyr Pro Glu 420 425 430Leu Arg Cys Leu Lys Glu Gly Ser Ser Arg Glu Val Pro Val Gly Ile 435 440 445Pro Phe Phe Val Asn Val Thr His Asn Gly Thr Tyr Gln Cys Gln Ala 450 455 460Ser Ser Ser Arg Gly Lys Tyr Thr Leu Val Val Val Met Asp Ile Glu465 470 475 480Ala Gly Ser Ser His Phe Val Pro Val Phe Val Ala Val Leu Leu Thr 485 490 495Leu Gly Val Val Thr Ile Val Leu Ala Leu Met Tyr Val Phe Arg Glu 500 505 510His Gln Arg Ser Gly Ser Tyr His Val Arg Glu Glu Ser Thr Tyr Leu 515 520 525Pro Leu Thr Ser Met Gln Pro Thr Glu Ala Met Gly Glu Glu Pro Ser 530 535 540Arg Ala Glu545115275DNAHomo sapiens 115cctgatgccc gaatttcagt ttggcactta cagcgaatct gagaggaaaa ccgaggagta 60cgatactcag gccatgaagt acttgtcata cctgctgtac cctctctgtg tcgggggtgc 120tgtctattca ctcctgaata tcaaatataa gagctggtac tcctggttaa tcaacagctt 180cgtcaacggg gtctatgcct ttggtttcct cttcatgctg ccccagctct ttgtgaacta 240caagttgaag tcagtggcac atctgccctg gaagg 2751162040DNAHomo sapiens 116cagctccttc accagcttgg tggtgggcgt gttcgtggtc tacgtggtgc acacctgctg 60ggtcatgtac ggcatcgtct acacccgccc gtgctccggc gacgccaact gcatccagcc 120ctacctggcg cggcggccca agctgcagct gagcgtgtac accacgacga ggtcccacct 180gggtgctgag aacaacatcg acctggtctt gaatgtggaa gactttgatg tggagtccaa 240atttgaaagg acagttaatg tttctgtacc aaagaaaacg agaaacaatg ggacgctgta 300tgcctacatc ttcctccatc acgctggggt cctgccgtgg cacgacggga agcaggtgca 360cctggtcagt cctctgacca cctacatggt ccccaagcca gaagaaatca acctgctcac 420cggggagtct gatacacagc agatcgaggc ggagaagaag ccgacgagtg ccctggatga 480gccagtgtcc cactggcgac cgcggctggc gctgaacgtg atggcggaca actttgtctt 540tgacgggtcc tccctgcctg ccgatgtgca tcggtacatg aagatgatcc agctggggaa 600aaccgtgcat tacctgccca tcctgttcat cgaccagctc agcaaccgcg tgaaggacct 660gatggtcata aaccgctcca ccaccgagct gcccctcacc gtgtcctacg acaaggtctc 720actggggcgg ctgcgcttct ggatccacat gcaggacgcc gtgtactccc tgcagcagtt 780cgggttttca gagaaagatg ctgatgaggt gaaaggaatt tttgtagata ccaacttata 840cttcctggcg ctgaccttct ttgtcgcagc gttccatctt ctctttgatt tcctggcctt 900taaaaatgac atcagtttct ggaagaagaa gaagagcatg atcggcatgt ccaccaaggc 960agtgctctgg cgctgcttca gcaccgtggt catctttctg ttcctgctgg acgagcagac 1020gagcctgctg gtgctggtcc cggcgggtgt tggagccgcc attgagctgt ggaaagtgaa 1080gaaggcattg aagatgacta ttttttggag aggcctgatg cccgaatttc agtttggcac 1140ttacagcgaa tctgagagga aaaccgagga gtacgatact caggccatga agtacttgtc 1200atacctgctg taccctctct gtgtcggggg tgctgtctat tcactcctga atatcaaata 1260taagagctgg tactcctggt taatcaacag cttcgtcaac ggggtctatg cctttggttt 1320cctcttcatg ctgccccagc tctttgtgaa ctacaagttg aagtcagtgg cacatctgcc 1380ctggaaggcc ttcacctaca aggctttcaa caccttcatt gatgacgtct ttgccttcat 1440catcaccatg cccacgtctc accggctggc ctgcttccgg gacgacgtgg tgtttctggt 1500ctacctgtac cagcggtggc tttatcctgt ggataaacgc agagtgaacg agtttgggga 1560gtcctacgag gagaaggcca cgcgggcgcc ccacacggac tgaaggccgc ccgggctgcc 1620gccagccaag tgcaacttga attgtcaatg agtatttttg gaagcatttg gaggaattcc 1680tagacattgc gttttctgtg ttgccaaaat cccttcggac atttctcaga catctcccaa 1740gttcccatca cgtcagattt ggagctggta gcgcttacga tgcccccacg tgtgaacatc 1800tgtcttggtc acagagctgg gtgctgccgg tcaccttgag ctgtggtggc tcccggcaca 1860cgagtgtccg gggttcggcc atgtcctcac gcgggcaggg gtgggagccc tcacaggcaa 1920gggggctgtt ggatttccat ttcaggtggt tttctaagtg ctccttatgt gaatttcaaa 1980cacgtatgga attcattccg catggactct gggatcaaag gctctttcct cttttgtttg 2040117538PRTHomo sapiens 117Met Trp Ser Gly Arg Ser Ser Phe Thr Ser Leu Val Val Gly Val Phe1 5 10 15Val Val Tyr Val Val His Thr Cys Trp Val Met Tyr Gly Ile Val Tyr 20 25 30Thr Arg Pro Cys Ser Gly Asp Ala Asn Cys Ile Gln Pro Tyr Leu Ala 35 40 45Arg Arg Pro Lys Leu Gln Leu Ser Val Tyr Thr Thr Thr Arg Ser His 50 55 60Leu Gly Ala Glu Asn Asn Ile Asp Leu Val Leu Asn Val Glu Asp Phe65 70 75 80Asp Val Glu Ser Lys Phe Glu Arg Thr Val Asn Val Ser Val Pro Lys 85 90 95Lys Thr Arg Asn Asn Gly Thr Leu Tyr Ala Tyr Ile Phe Leu His His 100 105 110Ala Gly Val Leu Pro Trp His Asp Gly Lys Gln Val His Leu Val Ser 115 120 125Pro Leu Thr Thr Tyr Met Val Pro Lys Pro Glu Glu Ile Asn

Leu Leu 130 135 140Thr Gly Glu Ser Asp Thr Gln Gln Ile Glu Ala Glu Lys Lys Pro Thr145 150 155 160Ser Ala Leu Asp Glu Pro Val Ser His Trp Arg Pro Arg Leu Ala Leu 165 170 175Asn Val Met Ala Asp Asn Phe Val Phe Asp Gly Ser Ser Leu Pro Ala 180 185 190Asp Val His Arg Tyr Met Lys Met Ile Gln Leu Gly Lys Thr Val His 195 200 205Tyr Leu Pro Ile Leu Phe Ile Asp Gln Leu Ser Asn Arg Val Lys Asp 210 215 220Leu Met Val Ile Asn Arg Ser Thr Thr Glu Leu Pro Leu Thr Val Ser225 230 235 240Tyr Asp Lys Val Ser Leu Gly Arg Leu Arg Phe Trp Ile His Met Gln 245 250 255Asp Ala Val Tyr Ser Leu Gln Gln Phe Gly Phe Ser Glu Lys Asp Ala 260 265 270Asp Glu Val Lys Gly Ile Phe Val Asp Thr Asn Leu Tyr Phe Leu Ala 275 280 285Leu Thr Phe Phe Val Ala Ala Phe His Leu Leu Phe Asp Phe Leu Ala 290 295 300Phe Lys Asn Asp Ile Ser Phe Trp Lys Lys Lys Lys Ser Met Ile Gly305 310 315 320Met Ser Thr Lys Ala Val Leu Trp Arg Cys Phe Ser Thr Val Val Ile 325 330 335Phe Leu Phe Leu Leu Asp Glu Gln Thr Ser Leu Leu Val Leu Val Pro 340 345 350Ala Gly Val Gly Ala Ala Ile Glu Leu Trp Lys Val Lys Lys Ala Leu 355 360 365Lys Met Thr Ile Phe Trp Arg Gly Leu Met Pro Glu Phe Gln Phe Gly 370 375 380Thr Tyr Ser Glu Ser Glu Arg Lys Thr Glu Glu Tyr Asp Thr Gln Ala385 390 395 400Met Lys Tyr Leu Ser Tyr Leu Leu Tyr Pro Leu Cys Val Gly Gly Ala 405 410 415Val Tyr Ser Leu Leu Asn Ile Lys Tyr Lys Ser Trp Tyr Ser Trp Leu 420 425 430Ile Asn Ser Phe Val Asn Gly Val Tyr Ala Phe Gly Phe Leu Phe Met 435 440 445Leu Pro Gln Leu Phe Val Asn Tyr Lys Leu Lys Ser Val Ala His Leu 450 455 460Pro Trp Lys Ala Phe Thr Tyr Lys Ala Phe Asn Thr Phe Ile Asp Asp465 470 475 480Val Phe Ala Phe Ile Ile Thr Met Pro Thr Ser His Arg Leu Ala Cys 485 490 495Phe Arg Asp Asp Val Val Phe Leu Val Tyr Leu Tyr Gln Arg Trp Leu 500 505 510Tyr Pro Val Asp Lys Arg Arg Val Asn Glu Phe Gly Glu Ser Tyr Glu 515 520 525Glu Lys Ala Thr Arg Ala Pro His Thr Asp 530 5351184217DNAHomo sapiens 118cttcccggcc ccagccaagg ctgtcgttta cgtgtcggac attcaggagc tgtacatccg 60tgtggttgac aaggtggaga ttgggaagac agtgaaggca tacgtccgcg tgctggactt 120gcacaagaag cccttccttg ccaaatactt cccctttatg gacctgaagc tccgagcagc 180ctccccgatc attacattgg tggcccttga tgaagccctt gacaactaca ccatcacatt 240cctcatccgc ggtgtggcca tcggccagac cagtctaact gcaagtgtga ccaataaagc 300tggacagaga atcaactcag ccccacaaca gattgaagtc tttcccccgt tcaggctgat 360gcccaggaag gtgacactgc ttatcggggc cacgatgcag gtcacctccg agggcggccc 420ccagcctcag tccaacatcc ttttctccat cagcaatgag agcgttgcgc tggtgagcgc 480tgctgggctg gtacagggcc tcgccatcgg gaacggcact gtgtctgggc tcgtgcaggc 540agtggatgca gagaccggca aggtggtcat catctctcag gacctcgtgc aggtggaggt 600gctgctgcta agggccgtga ggatccgcgc ccccatcatg cggatgagga cgggcaccca 660gatgcccatc tatgtcaccg gcatcaccaa ccaccagaac cctttctcct ttggcaatgc 720cgtgccaggc ctgaccttcc actggtctgt caccaagcgg gacgtcctgg acctccgagg 780gcggcaccac gaggcgtcga tccgactccc gtcacagtac aactttgcca tgaacgtgct 840cggccgggta aaaggccgga ccgggctgag ggtggtggtc aaggctgtgg accccacatc 900ggggcagctg tatggcctgg ccagagaact ctcggatgag atccaagtcc aggtgtttga 960gaagctgcag ctgctcaacc ctgaaataga agcagaacaa atattaatgt cgcccaactc 1020atatataaag ctgcagacaa acagggatgg tgcagcctct ctgagctacc gcgtcctgga 1080tggacccgaa aaggttccag ttgtgcatgt tgatgagaaa ggctttctag catcagggtc 1140tatgatcggg acatccacca tcgaagtgat tgcacaagag ccctttgggg ccaaccaaac 1200catcattgtt gctgtaaagg tatcccctgt ttcctacctg agggtttcca tgagccctgt 1260cctgcacacc cagaacaagg aggccctggt ggccgtgcct ttgggaatga ccgtgacctt 1320cactgtccac ttccacgaca actctggaga tgtcttccat gctcacagtt cggtcctcaa 1380ctttgccact aacagagacg actttgtgca gatcgggaag ggccccacca acaacacctg 1440cgttgtccgc acagtcagcg tgggcctgac actgctccgt gtgtgggacg cagagcaccc 1500gggcctctcg gacttcatgc ccctgcctgt cctacaggcc atctccccag agctgtctgg 1560ggccatggtg gtgggggacg tgctctgtct ggccactgtt ctgaccagcc tggaaggcct 1620ctcaggaacc tggagctcct cagccaacag catcctccac atcgacccca agacgggtgt 1680ggctgtggcc cgggccgtgg gatccgtgac ggtttactat gaggtcgctg ggcacctgag 1740gacctacaag gaggtggtgg tcagcgtccc tcagaggatc atggcccgtc acctccaccc 1800catccagaca agcttccagg aggctacagc ctccaaagtg attgttgccg tgggagacag 1860aagctctaac ctgagaggcg agtgcacccc cacccagagg gaagtcatcc aggccttgca 1920cccagagacc ctcatcagct gccagtccca gttcaagccg gccgtctttg atttcccatc 1980tcaagatgtg ttcaccgtgg agccacagtt tgacactgct ctcggccagt acttctgctc 2040aatcacaatg cacaggctga cggacaagca gcggaagcac ctgagcatga agaagacagc 2100tctggtggtc agtgcctccc tctccagcag ccacttctcc acagagcagg tgggggccga 2160ggtgcccttc agcccaggtc tcttcgccga ccaggctgaa atccttttga gcaaccacta 2220caccagttcc gagatcaggg tctttggtgc cccggaggtt ctggagaact tggaggtgaa 2280atccgggtcc ccggccgtgc tggcattcgc aaaggagaag tcttttgggt ggcccagctt 2340catcacatac acggtcggcg tctcggaccc cgcggctggc agccaagggc ctctgtccac 2400taccctgacc ttctccagcc ccgtgaccaa ccaagccatt gccatcccag tgacagtggc 2460ttttgtgatg gatcgccgtg ggcccggtcc ttatggagcc agcctcttcc agcacttcct 2520ggattcctac caggtcatgt tcttcacgct cttcgccctg ttggctggga cagcggtcat 2580gatcatagcc taccacactg tctgcacgcc ccgggatctt gctgtgcctg cagccctcac 2640gcctcgagcc agccctggac acagccccca ctatttcgct gcctcatcac ccacatctcc 2700caatgcattg cctcctgctc gcaaagccag ccctccctca gggctgtgga gcccagccta 2760tgcctcccac taggccgcgt gaaggttccc ggaggatggg tctcagccga gcctcgtgca 2820cccccaagat ggaacatccc tgctgcattc acactggaac aagcccctcc agatgagtgc 2880cccggcccca ggccagcttc actgccgtct cttcacacag agctgtagtt tcggctctgc 2940ccattagctc attttatgta ggagttttaa atgtgtgttt ttttcctttc aagtcttaca 3000aagctaagac tttttggctc attccttttt gcatggttgt ctagggtttc tggacaatgt 3060gctgttgcat ttttattttc ctagccttgc taaaatcttt cccttctcaa gactttgagc 3120agttagaagt gctctttaga agttgtctgt gggtgatgtt actgtagtgg tctcagggaa 3180aggattgtcc agttacttta gggggttttt ggtggggttt ttccccctgt gaaaacttac 3240tttgccccta gtctggctgc tgctaggact tctgaggagc aatgggacat gagtgtccct 3300gtatctgcgc cactgccgca agggaagcct caggaaccag cacctggagg ccaggatagc 3360caagccctgg gtgagcgaga ggctggagaa cacaggagct cacccagggc tgctgcccaa 3420ccatgggcca ctgtgaacag acttcagtcc tctgtttttg tttcataagc cgttgagaca 3480tctgatggac ttggcttagg ccctgctggg acatcccacg tgtgatccct ttcactccat 3540caggacacca ggactgtcct taggaaaatg tccttgagat ggcagcagga gtcatatttt 3600ctgtgtgtgt gtttcggaaa gccgctgtgt cctgcctcag cacaaagacc cagtgtcatt 3660tgctcctcct gttcctgtgc cactccagaa cctcagcaga tctgagccac cgcctgccag 3720tgtgagaggc ggccactttc atggcagctt atcaggcgca gggccccaga cagcttccca 3780gccggcccta gagcccggcc tgggccaatg atggagggcg gccaccagcc cagggcctgc 3840ccatccagaa gggactcccc agggcctggg ggaggagacc cttggaaaag tcctctcttc 3900ccagctcctg attctggatc tgagattctc agatcacagg cccctgtgct ccaggccgag 3960gctgggccac cctcagggag atccagagac tcatgcccat ggccatccat gcgtggacgc 4020tgtgtggaga gtccaggatg acgggatccc gcacaagctc ccttcagtcc ttcagggctg 4080ggccatgtgg ttgatttttc taaagctgga gaaaggaaga attgtgcctt gcatattact 4140tgagcttaaa ctgacaacct ggatgtaaat aggagccttt ctactggttt atttaataaa 4200gttctatgtg atttttt 4217119923PRTHomo sapiens 119Phe Pro Ala Pro Ala Lys Ala Val Val Tyr Val Ser Asp Ile Gln Glu1 5 10 15Leu Tyr Ile Arg Val Val Asp Lys Val Glu Ile Gly Lys Thr Val Lys 20 25 30Ala Tyr Val Arg Val Leu Asp Leu His Lys Lys Pro Phe Leu Ala Lys 35 40 45Tyr Phe Pro Phe Met Asp Leu Lys Leu Arg Ala Ala Ser Pro Ile Ile 50 55 60Thr Leu Val Ala Leu Asp Glu Ala Leu Asp Asn Tyr Thr Ile Thr Phe65 70 75 80Leu Ile Arg Gly Val Ala Ile Gly Gln Thr Ser Leu Thr Ala Ser Val 85 90 95Thr Asn Lys Ala Gly Gln Arg Ile Asn Ser Ala Pro Gln Gln Ile Glu 100 105 110Val Phe Pro Pro Phe Arg Leu Met Pro Arg Lys Val Thr Leu Leu Ile 115 120 125Gly Ala Thr Met Gln Val Thr Ser Glu Gly Gly Pro Gln Pro Gln Ser 130 135 140Asn Ile Leu Phe Ser Ile Ser Asn Glu Ser Val Ala Leu Val Ser Ala145 150 155 160Ala Gly Leu Val Gln Gly Leu Ala Ile Gly Asn Gly Thr Val Ser Gly 165 170 175Leu Val Gln Ala Val Asp Ala Glu Thr Gly Lys Val Val Ile Ile Ser 180 185 190Gln Asp Leu Val Gln Val Glu Val Leu Leu Leu Arg Ala Val Arg Ile 195 200 205Arg Ala Pro Ile Met Arg Met Arg Thr Gly Thr Gln Met Pro Ile Tyr 210 215 220Val Thr Gly Ile Thr Asn His Gln Asn Pro Phe Ser Phe Gly Asn Ala225 230 235 240Val Pro Gly Leu Thr Phe His Trp Ser Val Thr Lys Arg Asp Val Leu 245 250 255Asp Leu Arg Gly Arg His His Glu Ala Ser Ile Arg Leu Pro Ser Gln 260 265 270Tyr Asn Phe Ala Met Asn Val Leu Gly Arg Val Lys Gly Arg Thr Gly 275 280 285Leu Arg Val Val Val Lys Ala Val Asp Pro Thr Ser Gly Gln Leu Tyr 290 295 300Gly Leu Ala Arg Glu Leu Ser Asp Glu Ile Gln Val Gln Val Phe Glu305 310 315 320Lys Leu Gln Leu Leu Asn Pro Glu Ile Glu Ala Glu Gln Ile Leu Met 325 330 335Ser Pro Asn Ser Tyr Ile Lys Leu Gln Thr Asn Arg Asp Gly Ala Ala 340 345 350Ser Leu Ser Tyr Arg Val Leu Asp Gly Pro Glu Lys Val Pro Val Val 355 360 365His Val Asp Glu Lys Gly Phe Leu Ala Ser Gly Ser Met Ile Gly Thr 370 375 380Ser Thr Ile Glu Val Ile Ala Gln Glu Pro Phe Gly Ala Asn Gln Thr385 390 395 400Ile Ile Val Ala Val Lys Val Ser Pro Val Ser Tyr Leu Arg Val Ser 405 410 415Met Ser Pro Val Leu His Thr Gln Asn Lys Glu Ala Leu Val Ala Val 420 425 430Pro Leu Gly Met Thr Val Thr Phe Thr Val His Phe His Asp Asn Ser 435 440 445Gly Asp Val Phe His Ala His Ser Ser Val Leu Asn Phe Ala Thr Asn 450 455 460Arg Asp Asp Phe Val Gln Ile Gly Lys Gly Pro Thr Asn Asn Thr Cys465 470 475 480Val Val Arg Thr Val Ser Val Gly Leu Thr Leu Leu Arg Val Trp Asp 485 490 495Ala Glu His Pro Gly Leu Ser Asp Phe Met Pro Leu Pro Val Leu Gln 500 505 510Ala Ile Ser Pro Glu Leu Ser Gly Ala Met Val Val Gly Asp Val Leu 515 520 525Cys Leu Ala Thr Val Leu Thr Ser Leu Glu Gly Leu Ser Gly Thr Trp 530 535 540Ser Ser Ser Ala Asn Ser Ile Leu His Ile Asp Pro Lys Thr Gly Val545 550 555 560Ala Val Ala Arg Ala Val Gly Ser Val Thr Val Tyr Tyr Glu Val Ala 565 570 575Gly His Leu Arg Thr Tyr Lys Glu Val Val Val Ser Val Pro Gln Arg 580 585 590Ile Met Ala Arg His Leu His Pro Ile Gln Thr Ser Phe Gln Glu Ala 595 600 605Thr Ala Ser Lys Val Ile Val Ala Val Gly Asp Arg Ser Ser Asn Leu 610 615 620Arg Gly Glu Cys Thr Pro Thr Gln Arg Glu Val Ile Gln Ala Leu His625 630 635 640Pro Glu Thr Leu Ile Ser Cys Gln Ser Gln Phe Lys Pro Ala Val Phe 645 650 655Asp Phe Pro Ser Gln Asp Val Phe Thr Val Glu Pro Gln Phe Asp Thr 660 665 670Ala Leu Gly Gln Tyr Phe Cys Ser Ile Thr Met His Arg Leu Thr Asp 675 680 685Lys Gln Arg Lys His Leu Ser Met Lys Lys Thr Ala Leu Val Val Ser 690 695 700Ala Ser Leu Ser Ser Ser His Phe Ser Thr Glu Gln Val Gly Ala Glu705 710 715 720Val Pro Phe Ser Pro Gly Leu Phe Ala Asp Gln Ala Glu Ile Leu Leu 725 730 735Ser Asn His Tyr Thr Ser Ser Glu Ile Arg Val Phe Gly Ala Pro Glu 740 745 750Val Leu Glu Asn Leu Glu Val Lys Ser Gly Ser Pro Ala Val Leu Ala 755 760 765Phe Ala Lys Glu Lys Ser Phe Gly Trp Pro Ser Phe Ile Thr Tyr Thr 770 775 780Val Gly Val Ser Asp Pro Ala Ala Gly Ser Gln Gly Pro Leu Ser Thr785 790 795 800Thr Leu Thr Phe Ser Ser Pro Val Thr Asn Gln Ala Ile Ala Ile Pro 805 810 815Val Thr Val Ala Phe Val Met Asp Arg Arg Gly Pro Gly Pro Tyr Gly 820 825 830Ala Ser Leu Phe Gln His Phe Leu Asp Ser Tyr Gln Val Met Phe Phe 835 840 845Thr Leu Phe Ala Leu Leu Ala Gly Thr Ala Val Met Ile Ile Ala Tyr 850 855 860His Thr Val Cys Thr Pro Arg Asp Leu Ala Val Pro Ala Ala Leu Thr865 870 875 880Pro Arg Ala Ser Pro Gly His Ser Pro His Tyr Phe Ala Ala Ser Ser 885 890 895Pro Thr Ser Pro Asn Ala Leu Pro Pro Ala Arg Lys Ala Ser Pro Pro 900 905 910Ser Gly Leu Trp Ser Pro Ala Tyr Ala Ser His 915 9201201270PRTHomo sapiens 120Arg Asp Phe Gln Ser Glu Val Leu Leu Ser Ala Met Glu Leu Phe His1 5 10 15Met Thr Ser Gly Gly Asp Ala Ala Met Phe Arg Asp Gly Lys Glu Pro 20 25 30Gln Pro Ser Ala Glu Ala Ala Ala Ala Pro Ser Leu Ala Asn Ile Ser 35 40 45Cys Phe Thr Gln Lys Leu Val Glu Lys Leu Tyr Ser Gly Met Phe Ser 50 55 60Ala Asp Pro Arg His Ile Leu Leu Phe Ile Leu Glu His Ile Met Val65 70 75 80Val Ile Glu Thr Ala Ser Ser Gln Arg Asp Thr Val Leu Ser Thr Leu 85 90 95Tyr Ser Ser Leu Asn Lys Val Ile Leu Tyr Cys Leu Ser Lys Pro Gln 100 105 110Gln Ser Leu Ser Glu Cys Leu Gly Leu Leu Ser Ile Leu Gly Phe Leu 115 120 125Gln Glu His Trp Asp Val Val Phe Ala Thr Tyr Asn Ser Asn Ile Ser 130 135 140Phe Leu Leu Cys Leu Met His Cys Leu Leu Leu Leu Asn Glu Arg Ser145 150 155 160Tyr Pro Glu Gly Phe Gly Leu Glu Pro Lys Pro Arg Met Ser Thr Tyr 165 170 175His Gln Val Phe Leu Ser Pro Asn Glu Asp Val Lys Glu Lys Arg Glu 180 185 190Asp Leu Pro Ser Leu Ser Asp Val Gln His Asn Ile Gln Lys Thr Val 195 200 205Gln Thr Leu Trp Gln Gln Leu Val Ala Gln Arg Gln Gln Thr Leu Glu 210 215 220Asp Ala Phe Lys Ile Asp Leu Ser Val Lys Pro Gly Glu Arg Glu Val225 230 235 240Lys Ile Glu Glu Val Thr Pro Leu Trp Glu Glu Thr Met Leu Lys Ala 245 250 255Trp Gln His Tyr Leu Ala Ser Glu Lys Lys Ser Leu Ala Ser Arg Ser 260 265 270Asn Val Ala His His Ser Lys Val Thr Leu Trp Ser Gly Ser Leu Ser 275 280 285Ser Ala Met Lys Leu Met Pro Gly Arg Gln Ala Lys Asp Pro Glu Cys 290 295 300Lys Thr Glu Asp Phe Val Ser Cys Ile Glu Asn Tyr Arg Arg Arg Gly305 310 315 320Gln Glu Leu Tyr Ala Ser Leu Tyr Lys Asp His Val Gln Arg Arg Lys 325 330 335Cys Gly Asn Ile Lys Ala Ala Asn Ala Trp Ala Arg Ile Gln Glu Gln 340 345 350Leu Phe Gly Glu Leu Gly Leu Trp Ser Gln Gly Glu Glu Thr Lys Pro 355 360 365Cys Ser Pro Trp Glu Leu Asp Trp Arg Glu Gly Pro Ala Arg Met Arg 370 375 380Lys Arg Ile Lys Arg Leu Ser Pro Leu Glu Ala Leu Ser Ser Gly Arg385 390 395 400His Lys Glu Ser Gln Asp Lys Asn Asp His Ile Ser Gln Thr Asn Ala 405 410 415Glu Asn Gln Asp Glu Leu Thr Leu Arg Glu Ala Glu Gly Glu Pro Asp 420 425 430Glu Val Gly Val Asp Cys Thr Gln Leu Thr Phe Phe Pro Ala

Leu His 435 440 445Glu Ser Leu His Ser Glu Asp Phe Leu Glu Leu Cys Arg Glu Arg Gln 450 455 460Val Ile Leu Gln Glu Leu Leu Asp Lys Glu Lys Val Thr Gln Lys Phe465 470 475 480Ser Leu Val Ile Val Gln Gly His Leu Val Ser Glu Gly Val Leu Leu 485 490 495Phe Gly His Gln His Phe Tyr Ile Cys Glu Asn Phe Thr Leu Ser Pro 500 505 510Thr Gly Asp Val Tyr Cys Thr Arg His Cys Leu Ser Asn Ile Ser Asp 515 520 525Pro Phe Ile Phe Asn Leu Cys Ser Lys Asp Arg Ser Thr Asp His Tyr 530 535 540Ser Cys Gln Cys His Ser Tyr Ala Asp Met Arg Glu Leu Arg Gln Ala545 550 555 560Arg Phe Leu Leu Gln Asp Ile Ala Leu Glu Ile Phe Phe His Asn Gly 565 570 575Tyr Ser Lys Phe Leu Val Phe Tyr Asn Asn Asp Arg Ser Lys Ala Phe 580 585 590Lys Ser Phe Cys Ser Phe Gln Pro Ser Leu Lys Gly Lys Ala Thr Ser 595 600 605Glu Asp Thr Leu Asn Leu Arg Arg Tyr Pro Gly Ser Asp Arg Ile Met 610 615 620Leu Gln Lys Trp Gln Lys Arg Asp Ile Ser Asn Phe Glu Tyr Leu Met625 630 635 640Tyr Leu Asn Thr Ala Ala Gly Arg Thr Cys Asn Asp Tyr Met Gln Tyr 645 650 655Pro Val Phe Pro Trp Val Leu Ala Asp Tyr Thr Ser Glu Thr Leu Asn 660 665 670Leu Ala Asn Pro Lys Ile Phe Arg Asp Leu Ser Lys Pro Met Gly Ala 675 680 685Gln Thr Lys Glu Arg Lys Leu Lys Phe Ile Gln Arg Phe Lys Glu Val 690 695 700Glu Lys Thr Glu Gly Asp Met Thr Val Gln Cys His Tyr Tyr Thr His705 710 715 720Tyr Ser Ser Ala Ile Ile Val Ala Ser Tyr Leu Val Arg Met Pro Pro 725 730 735Phe Thr Gln Ala Phe Cys Ala Leu Gln Gly Gly Ser Phe Asp Val Ala 740 745 750Asp Arg Met Phe His Ser Val Lys Ser Thr Trp Glu Ser Ala Ser Arg 755 760 765Glu Asn Met Ser Asp Val Arg Glu Leu Thr Pro Glu Phe Phe Tyr Leu 770 775 780Pro Glu Phe Leu Thr Asn Cys Asn Gly Val Glu Phe Gly Cys Met Gln785 790 795 800Asp Gly Thr Val Leu Gly Asp Val Gln Leu Pro Pro Trp Ala Asp Gly 805 810 815Asp Pro Arg Lys Phe Ile Ser Leu His Arg Lys Ala Leu Glu Ser Asp 820 825 830Phe Val Ser Ala Asn Leu His His Trp Ile Asp Leu Ile Phe Gly Tyr 835 840 845Lys Gln Gln Gly Pro Ala Ala Val Asp Ala Val Asn Ile Phe His Pro 850 855 860Tyr Phe Tyr Gly Asp Arg Met Asp Leu Ser Ser Ile Thr Asp Pro Leu865 870 875 880Ile Lys Ser Thr Ile Leu Gly Phe Val Ser Asn Phe Gly Gln Val Pro 885 890 895Lys Gln Leu Phe Thr Lys Pro His Pro Ala Arg Thr Ala Ala Gly Lys 900 905 910Pro Leu Pro Gly Lys Asp Val Ser Thr Pro Val Ser Leu Pro Gly His 915 920 925Pro Gln Pro Phe Phe Tyr Ser Leu Gln Ser Leu Arg Pro Ser Gln Val 930 935 940Thr Val Lys Asp Met Tyr Leu Phe Ser Leu Gly Ser Glu Ser Pro Lys945 950 955 960Gly Ala Ile Gly His Ile Val Ser Thr Glu Lys Thr Ile Leu Ala Val 965 970 975Glu Arg Asn Lys Val Leu Leu Pro Pro Leu Trp Asn Arg Thr Phe Ser 980 985 990Trp Gly Phe Asp Asp Phe Ser Cys Cys Leu Gly Ser Tyr Gly Ser Asp 995 1000 1005Lys Val Leu Met Thr Phe Glu Asn Leu Ala Ala Trp Gly Arg Cys Leu 1010 1015 1020Cys Ala Val Cys Pro Ser Pro Thr Thr Ile Val Thr Ser Gly Thr Ser1025 1030 1035 1040Thr Val Val Cys Val Trp Glu Leu Ser Met Thr Lys Gly Arg Pro Arg 1045 1050 1055Gly Leu Arg Leu Arg Gln Ala Leu Tyr Gly His Thr Gln Ala Val Thr 1060 1065 1070Cys Leu Ala Ala Ser Val Thr Phe Ser Leu Leu Val Ser Gly Ser Gln 1075 1080 1085Asp Cys Thr Cys Ile Leu Trp Asp Leu Asp His Leu Thr His Val Thr 1090 1095 1100Arg Leu Pro Ala His Arg Glu Gly Ile Ser Ala Ile Thr Ile Ser Asp1105 1110 1115 1120Val Ser Gly Thr Ile Val Ser Cys Ala Gly Ala His Leu Ser Leu Trp 1125 1130 1135Asn Val Asn Gly Gln Pro Leu Ala Ser Ile Thr Thr Ala Trp Gly Pro 1140 1145 1150Glu Gly Ala Ile Thr Cys Cys Cys Leu Met Glu Gly Pro Ala Trp Asp 1155 1160 1165Thr Ser Gln Ile Ile Ile Thr Gly Ser Gln Asp Gly Met Val Arg Val 1170 1175 1180Trp Lys Thr Glu Asp Val Lys Met Ser Val Pro Gly Arg Pro Ala Gly1185 1190 1195 1200Glu Glu Pro Leu Ala Gln Pro Pro Ser Pro Arg Gly His Lys Trp Glu 1205 1210 1215Lys Asn Leu Ala Leu Ser Arg Glu Leu Asp Val Ser Ile Ala Leu Thr 1220 1225 1230Gly Lys Pro Ser Lys Thr Ser Pro Ala Val Thr Ala Leu Ala Val Ser 1235 1240 1245Arg Asn His Thr Lys Leu Leu Val Gly Asp Glu Arg Gly Arg Ile Phe 1250 1255 1260Cys Trp Ser Ala Asp Gly1265 1270121647PRTHomo sapiens 121Met Leu Gln Lys Trp Gln Lys Arg Asp Ile Ser Asn Phe Glu Tyr Leu1 5 10 15Met Tyr Leu Asn Thr Ala Ala Gly Arg Thr Cys Asn Asp Tyr Met Gln 20 25 30Tyr Pro Val Phe Pro Trp Val Leu Ala Asp Tyr Thr Ser Glu Thr Leu 35 40 45Asn Leu Ala Asn Pro Lys Ile Phe Arg Asp Leu Ser Lys Pro Met Gly 50 55 60Ala Gln Thr Lys Glu Arg Lys Leu Lys Phe Ile Gln Arg Phe Lys Glu65 70 75 80Val Glu Lys Thr Glu Gly Asp Met Thr Val Gln Cys His Tyr Tyr Thr 85 90 95His Tyr Ser Ser Ala Ile Ile Val Ala Ser Tyr Leu Val Arg Met Pro 100 105 110Pro Phe Thr Gln Ala Phe Cys Ala Leu Gln Gly Gly Ser Phe Asp Val 115 120 125Ala Asp Arg Met Phe His Ser Val Lys Ser Thr Trp Glu Ser Ala Ser 130 135 140Arg Glu Asn Met Ser Asp Val Arg Glu Leu Thr Pro Glu Phe Phe Tyr145 150 155 160Leu Pro Glu Phe Leu Thr Asn Cys Asn Gly Val Glu Phe Gly Cys Met 165 170 175Gln Asp Gly Thr Val Leu Gly Asp Val Gln Leu Pro Pro Trp Ala Asp 180 185 190Gly Asp Pro Arg Lys Phe Ile Ser Leu His Arg Lys Ala Leu Glu Ser 195 200 205Asp Phe Val Ser Ala Asn Leu His His Trp Ile Asp Leu Ile Phe Gly 210 215 220Tyr Lys Gln Gln Gly Pro Ala Ala Val Asp Ala Val Asn Ile Phe His225 230 235 240Pro Tyr Phe Tyr Gly Asp Arg Met Asp Leu Ser Ser Ile Thr Asp Pro 245 250 255Leu Ile Lys Ser Thr Ile Leu Gly Phe Val Ser Asn Phe Gly Gln Val 260 265 270Pro Lys Gln Leu Phe Thr Lys Pro His Pro Ala Arg Thr Ala Ala Gly 275 280 285Lys Pro Leu Pro Gly Lys Asp Val Ser Thr Pro Val Ser Leu Pro Gly 290 295 300His Pro Gln Pro Phe Phe Tyr Ser Leu Gln Ser Leu Arg Pro Ser Gln305 310 315 320Val Thr Val Lys Asp Met Tyr Leu Phe Ser Leu Gly Ser Glu Ser Pro 325 330 335Lys Gly Ala Ile Gly His Ile Val Ser Thr Glu Lys Thr Ile Leu Ala 340 345 350Val Glu Arg Asn Lys Val Leu Leu Pro Pro Leu Trp Asn Arg Thr Phe 355 360 365Ser Trp Gly Phe Asp Asp Phe Ser Cys Cys Leu Gly Ser Tyr Gly Ser 370 375 380Asp Lys Val Leu Met Thr Phe Glu Asn Leu Ala Ala Trp Gly Arg Cys385 390 395 400Leu Cys Ala Val Cys Pro Ser Pro Thr Thr Ile Val Thr Ser Gly Thr 405 410 415Ser Thr Val Val Cys Val Trp Glu Leu Ser Met Thr Lys Gly Arg Pro 420 425 430Arg Gly Leu Arg Leu Arg Gln Ala Leu Tyr Gly His Thr Gln Ala Val 435 440 445Thr Cys Leu Ala Ala Ser Val Thr Phe Ser Leu Leu Val Ser Gly Ser 450 455 460Gln Asp Cys Thr Cys Ile Leu Trp Asp Leu Asp His Leu Thr His Val465 470 475 480Thr Arg Leu Pro Ala His Arg Glu Gly Ile Ser Ala Ile Thr Ile Ser 485 490 495Asp Val Ser Gly Thr Ile Val Ser Cys Ala Gly Ala His Leu Ser Leu 500 505 510Trp Asn Val Asn Gly Gln Pro Leu Ala Ser Ile Thr Thr Ala Trp Gly 515 520 525Pro Glu Gly Ala Ile Thr Cys Cys Cys Leu Met Glu Gly Pro Ala Trp 530 535 540Asp Thr Ser Gln Ile Ile Ile Thr Gly Ser Gln Asp Gly Met Val Arg545 550 555 560Val Trp Lys Thr Glu Asp Val Lys Met Ser Val Pro Gly Arg Pro Ala 565 570 575Gly Glu Glu Pro Leu Ala Gln Pro Pro Ser Pro Arg Gly His Lys Trp 580 585 590Glu Lys Asn Leu Ala Leu Ser Arg Glu Leu Asp Val Ser Ile Ala Leu 595 600 605Thr Gly Lys Pro Ser Lys Thr Ser Pro Ala Val Thr Ala Leu Ala Val 610 615 620Ser Arg Asn His Thr Lys Leu Leu Val Gly Asp Glu Arg Gly Arg Ile625 630 635 640Phe Cys Trp Ser Ala Asp Gly 64512232DNAArtificial SequenceDescription of Artificial SequencePCR amplification primer PDM-797 122gtgtcacaat ctacagtcag gcaggattct cc 3212335DNAArtificial SequenceDescription of Artificial SequencePCR amplification primer PDM-799 123gttatgtagc ggccgcttat catgttgctg cagag 35124980DNAHomo sapiens 124gccgctgccg ctccaggaga caggttccca tgcaggaatg aaagacatgg aagggaagag 60gggggccagc tccctgagtc ctgtgtccac cagctgctgc taaatacctc tgagaaactc 120tgcttctatc taaggggacc tacttctctc gggaatctca atacttggaa caagaacctc 180ctagacggac cctttggcat aatgaattgg accaactgta ggttccagga ctagagagcc 240agcaatgcct ccatgaacaa tctcacccaa ttactctgct caggaaacga ggtaactgat 300ggacagccga ggcagcccct taggcggctt aggcctcccc tgtggagcat ccctgaggcg 360gactccggcc agcccgagtg atgcgatcca aagagcactc ccgggtagga aattgccccg 420gtggaatgcc tcaccagagc agcgtgtagc agttccctgt ggaggattaa cacagtggct 480gaacaccggg aaggaactgg cacttggagt ccggacatct gaaacttgta gactgggagc 540tgtacatgga tgggagcagc ttcaccaacc cctgcaaagt gactctgaag aagacgacaa 600gccctgctcc agtcacaccc ggaagctgac tggtccacgc acagctgaag catgaggaaa 660ctcatcgcgg gactaatttt ccttaaaatt tagacttgca cagtaaggac ttcaactgac 720cttcctcaga ctgagaactg tttccagtat atacatcaag tcactgaggt aggacaaaag 780attgctacat tcctattatt ttaaggttac atttttgggg acccctcttt cttctgttct 840agctattacc tttcttgtgt cacctagaaa aggaccagtc cttaattgta ttttaaaaac 900tgtgatcatg ggaagcttta aattggttca ataacacgca tcaagttggt tatttcctgg 960gctacatacc ttggatagat 980

* * * * *