Diagnosis Of Systemic Lupus Erythematosus Using Protein, Peptide And Oligonucleotide Antigens

SOREK; Rachel ;   et al.

Patent Application Summary

U.S. patent application number 17/332178 was filed with the patent office on 2022-01-06 for diagnosis of systemic lupus erythematosus using protein, peptide and oligonucleotide antigens. This patent application is currently assigned to YEDA RESEARCH AND DEVELOPMENT CO., LTD.. The applicant listed for this patent is YEDA RESEARCH AND DEVELOPMENT CO., LTD.. Invention is credited to Irun R. COHEN, Keren JAKOBI, Anat REINER-BENAIM, Pennina SAFER, Rachel SOREK.

Application Number20220003762 17/332178
Document ID /
Family ID1000005839551
Filed Date2022-01-06

United States Patent Application 20220003762
Kind Code A1
SOREK; Rachel ;   et al. January 6, 2022

DIAGNOSIS OF SYSTEMIC LUPUS ERYTHEMATOSUS USING PROTEIN, PEPTIDE AND OLIGONUCLEOTIDE ANTIGENS

Abstract

Methods and kits for diagnosing or monitoring systemic lupus erythematosus (SLE) in a subject are provided. Particularly, the present invention relates to a specific antibody reactivity profile useful in diagnosing or monitoring SLE in a subject.


Inventors: SOREK; Rachel; (Zafaria, IL) ; JAKOBI; Keren; (Reut-Maccabim, IL) ; SAFER; Pennina; (Rechovot, IL) ; REINER-BENAIM; Anat; (Binyamina, IL) ; COHEN; Irun R.; (Rehovot, IL)
Applicant:
Name City State Country Type

YEDA RESEARCH AND DEVELOPMENT CO., LTD.

Rehovot

IL
Assignee: YEDA RESEARCH AND DEVELOPMENT CO., LTD.
Rehovot
IL

Family ID: 1000005839551
Appl. No.: 17/332178
Filed: May 27, 2021

Related U.S. Patent Documents

Application Number Filing Date Patent Number
15555258 Sep 1, 2017 11047855
PCT/IL2016/050229 Feb 29, 2016
17332178
62126616 Mar 1, 2015
62181231 Jun 18, 2015
62249284 Nov 1, 2015

Current U.S. Class: 1/1
Current CPC Class: G01N 2800/104 20130101; G16B 40/20 20190201; G01N 33/564 20130101; G16B 40/00 20190201; G01N 33/6854 20130101
International Class: G01N 33/564 20060101 G01N033/564; G01N 33/68 20060101 G01N033/68; G16B 40/20 20060101 G16B040/20

Claims



1. A method of detecting the levels of antibodies in a sample of a subject having or suspected of having systemic lupus erythematosus (SLE), the method comprising: (i) obtaining a serum, plasma or blood sample from the subject; (ii) exposing antibodies in the sample to at least four antigens, the antigens comprising ssDNA, U1 snRNP and Ro52, and at least one of: Sm, DNAse I, Histone H2a, Collagen III and Apo-SAA; and (iii) detecting in said sample the levels of antibodies reactive with the at least four antigens by quantifying the amount of antigen-antibody complex formed for each antigen, wherein the amount of antigen-antibody complex is indicative of the level of each respective antibody in said sample.

2. The method of claim 1, comprising detecting the levels of IgG antibodies to ssDNA, U1 snRNP, Sm, Apo-SAA and Ro52, and levels of IgM antibodies to H2a, in said sample and comparing the reactivity of antibodies in the sample to a reactivity of a healthy control by quadratic discriminant analysis (QDA).

3. The method of claim 1, comprising detecting the levels of IgG antibodies to ssDNA, U1 snRNP, Ro52, Collagen III and Apo-SAA, and levels of IgM antibodies in the sample to Histone III-S, in said sample and comparing the reactivity of antibodies in the sample to a reactivity of a healthy control by logistic regression (LR).

4. The method of claim 1, comprising detecting the levels of IgG antibodies to ssDNA, Sm, DNAse I, Ro52 and U1 snRNP, and levels of IgM antibodies to Histone III-S, in said sample, and comparing the reactivity of antibodies in the sample to a reactivity of a healthy control by support vector machines (SVMs).

5. The method of claim 1, wherein said sample is a serum sample.

6. The method of claim 1, wherein the antigens are provided in the form of an antigen array, or an antigen chip.
Description



CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is a continuation application of U.S. application Ser. No. 15/555,258, filed on Sep. 1, 2017, which is a 371 application of International Application No. PCT/IL2016/050229, filed on Feb. 29, 2016, which claims priority to U.S. Application No. 62/126,616, filed on Mar. 1, 2015, U.S. Application No. 62/181,231, filed on Jun. 18, 2015, and U.S. Application No. 62/249,284, filed on Nov. 1, 2015, which are all incorporated herein by reference.

SEQUENCE LISTING

[0002] The Sequence Listing submitted herewith as an ASCII text file (2021-07-20_Sequence Listing.txt, created on Jul. 20, 2021, 104469 bytes) via EFS-Web is hereby incorporated by reference.

FIELD OF THE INVENTION

[0003] The present invention relates to protein, peptide, polynucleotide and oligonucleotide antigens useful in diagnosing or monitoring an autoimmune disorder such as systemic lupus erythematosus (SLE) in a subject.

BACKGROUND OF THE INVENTION

[0004] Systemic lupus erythematosus (SLE) is a chronic, recurrent, potentially fatal multisystem inflammatory disorder mainly affecting women. SLE is associated with a large spectrum of autoantibodies. IgG antibodies to more than 100 different antigens including DNA, nucleosomes, histones, viral antigens, transcription factors and more have been reported in different SLE patients (Sherer et al., 2004, Semin. Arthritis. Rheum. 34:501-37). Surprisingly, there is no serologic diagnosis of SLE and SLE is diagnosed on the basis of eleven criteria defined by the American College of Rheumatology (ACR). These criteria include malar rash, discoid rash, photosensitivity, oral ulcers, arthritis, serositis, renal disorder, neurologic disorder, hematologic disorder (e.g., leucopenia, lymphopenia, hemolytic anemia or thrombocytopenia), immunologic disorder and antibody abnormalities (particularly anti-nuclear antibodies (ANA) and anti-DNA antibodies) (Tan et al., 1997, Arthritis Rheum 1997, 40:1725). According to these criteria, subjects can be clinically diagnosed with SLE if they meet at least four of the eleven criteria. Recently, the Systemic Lupus Collaborating Clinics (SLICC) revised these criteria, as reviewed in Petri et al. (Arthritis and Rheumatism, 2012, Vol. 64, pages 2677-2686). Nevertheless, SLE is still possible even in cases when less than four criteria are present.

[0005] ANA laboratory testing is one of the standards in SLE diagnosis; negative test results help `rule out` SLE in >95% of cases. Positive ANAs lack specificity, since any antibody to nuclear components is an ANA and can occur in many autoimmune rheumatic diseases, chronic inflammatory and infectious diseases, malignancies, and can also be induced by certain drugs. Furthermore, the unaffected healthy population is estimated to be 20% ANA positive (ANA+) at a 1:80 serum dilution-level. As a result, ANA+test results contribute to the false positive rate of the laboratory testing component when initially diagnosing SLE. Therefore, it is recommended that ANA+test results be followed by testing for antigen-specific ANAs, including anti double stranded DNA (dsDNA) and anti-Smith antibodies; however, these are detectable in only .about.30% of SLE patients.

[0006] Although the precise pathology of SLE is not clear, it is widely accepted that autoantibodies play an important role. Autoantibodies to DNA are highly heterogeneous with respect to their avidity, immunoglobulin subclass composition, cross-reactivity and complement fixing ability. A number of techniques have been utilized for DNA autoantibodies detection, including immunofluorescent assays (IFA), enzyme-linked immunosorbent assays (ELISAs) and radioimmunoassays (RIA). However, the clinical value of anti-dsDNA antibodies largely depends on the assay principle and analytical variables of the methods used to quantitate and immunologically characterize them.

[0007] F.J. Quintana et al. ("Antigen-chip technology for accessing global information about the state of the body", Lupus, 2006, Vol. 15(7), pages 428-30) describe the use of microarray technology and informatics to develop an antigen chip capable of detecting global patterns of antibodies binding to hundreds of antigens simultaneously. Lupus is disclosed to be one of the interests of the authors.

[0008] J.G. Hanly at al. ("Measurement of autoantibodies using multiplex methodology in patients with systemic lupus erythematosus", Journal of Immunological Methods, 2010, Vol. 352, pages 147-152) have compared laser bead immunoassay technology to more traditional measures of autoantibody detection in diagnosis and assessment of systemic lupus erythematosus (SLE). The autoantigens used included, for example, dsDNA, Sm, and RNP.

[0009] Q.Z. Li et al. ("Protein array autoantibody profiles for insights into systemic lupus erythematosus and incomplete lupus syndromes", Clinical & Experimental Immunology, 2006, Vol. 147 (1), pages 60-70) investigated the prevalence and clinical significance of a spectrum of autoantibodies in systemic lupus erythematosus and incomplete lupus syndromes using a proteome microarray bearing 70 autoantigens, such as ssDNA and U1 snRNP.

[0010] W.H. Robinson et al. ("Autoantigen microarrays for multiplex characterization of autoantibody responses", Nature Medicine, 2002, Vol. 8, pages 295-301) describe and characterize arrays bearing 196 autoantigens containing the major autoantigens in eight distinct human autoimmune diseases, including systemic lupus erythematosus. The autoantigens included, for example, ssDNA, Sm/RNP and U1 snRNP.

[0011] Flares occur in approximately 80% of patients during the course of their disease [Petri M, et al., Am J Med 1991; 91: 345-54], and generally require the introduction or increase in dose of a variety of potentially toxic therapies. The morbidity and mortality associated with flares can be substantial, and is related to organ damage resulting from active SLE per se and the adverse effects of corticosteroids and immunosuppressive drugs [Abu-Sharaka M, et al., J Rheumatol 1995; 22: 1259-64].

[0012] One of the most difficult challenges in clinical management of complex autoimmune diseases such as SLE is the accurate and early identification of the disease in a patient and differentiation between patients with active disease (flare) and those with non-active disease (in remission). There remains a need for improved diagnostic methods and kits useful in diagnosing SLE in a subject.

SUMMARY OF THE INVENTION

[0013] The present invention provides methods and kits for diagnosing or monitoring an autoimmune disorder, particularly systemic lupus erythematosus (SLE). The present invention further provides antigen probe arrays for practicing such a diagnosis, and antigen probe sets for generating such arrays.

[0014] The present invention is based, in part, on the unexpected results obtained when testing the antibody reactivity of SLE patients compared to healthy controls. Surprisingly, significantly different immunoglobulin G (IgG) and IgM reactivities to specific protein, peptide, polynucleotide and oligonucleotide antigens were found in the tested SLE patients, compared to healthy controls. Thus, the present invention provides unique protein, peptide polynucleotide and oligonucleotide antigens indicative to SLE. The present invention further provides antigen-autoantibody reactivity patterns relevant to SLE. In particular embodiments, the present invention provides highly sensitive, specific, reliable, accurate and discriminatory assays for diagnosing SLE, based on the indicative protein, peptide and oligonucleotide antigens, or on reactivity patterns thereof.

[0015] The present invention is also based, in part; on the use of specific classifiers involve machine learning algorithms on pre-selected features which contain the highest ranking of information discriminating SLE samples from healthy controls. For example, the logistic regression (LR) analysis of a particular antibody immune signature as described herein, provided an assay for diagnosing SLE with remarkably high sensitivity and specificity (0.98 and 0.59, respectively). The present invention is further based, in part, on the unexpected finding that the antibody reactivity profile in serum of SLE patients was clearly distinct from healthy control individuals.

[0016] Thus, according to embodiments of the invention, there are provided novel methods for diagnosing, ruling out a diagnosis, and monitoring the progression of SLE. According to embodiments of the invention, there are provided methods for diagnosing pre-lupus state or early lupus state.

[0017] According to embodiments of the invention, the methods comprise determining the reactivity of antibodies in a sample obtained or derived from a subject to a plurality of antigens as described herein. The methods of the invention further comprise a step of comparing the reactivity of antibodies in the sample to the plurality of antigens to control reactivity to said plurality of antigens. According to certain embodiments, a significantly different reactivity of the antibodies in the sample compared to the reactivity of the healthy control is an indication that the subject is afflicted with SLE.

[0018] According to a first aspect, the present invention provides a method of diagnosing SLE in a subject suspected of having SLE, the method comprising the steps of: obtaining a sample from the subject; determining the reactivity of antibodies in the sample to at least four antigens selected from the group consisting of ssDNA, Sm, DNAse I, Histone Ro52 (TRIM21), U1 snRNP, Collagen III, Apo-SAA, H2a and Oligo21, thereby determining the reactivity pattern of the sample to the plurality of antigens; and comparing the reactivity of antibodies in the sample to a reactivity of a healthy control by a supervised classification algorithm; wherein a significantly different reactivity of the antibodies in the sample compared to the reactivity of the healthy control is an indication that the subject is afflicted with SLE.

[0019] According to another embodiment, the source of said ssDNA antigen is selected from a human, a salmon and a calf. According to said embodiment, determining the reactivity of ssDNA comprises determining the reactivity of ssDNA from a human source, a salmon source or reactivity of ssDNA from a calf source. According to another embodiment, the source of said ssDNA antigen is a calf.

[0020] According to some embodiments, the reactivity of antibodies comprises IgG reactivities, IgM reactivities, or any combination thereof. According to some embodiments, the reactivity of the antibodies comprises increased IgG and IgM reactivities.

[0021] According to certain embodiments, the supervised classification algorithm is selected from the group consisting of a decision tree classifier, logistic regression (LR) classifier, nearest neighbor classifier, neural network classifier, Gaussian mixture model (GMM), Support Vector Machine (SVM) classifier, nearest centroid classifier, linear regression classifier, linear discriminant analysis (LDA) classifier, quadratic discriminant analysis (QDA) classifier and random forest classifier.

[0022] According to some embodiments, the supervised classification algorithm is selected from the group consisting of support vector machines (SVMs), logistic regression (LR), quadratic discriminant analysis (QDA), and linear discriminant analysis (LDA).

[0023] According to some embodiments, the method of the present invention comprising determining the reactivities of IgG antibodies in the sample to ssDNA, Sm, DNAse I, Ro52 and U1 snRNP, determining the reactivities of IgM antibodies in the sample to Histone III-S, and comparing the reactivity of antibodies in the sample to a reactivity of a healthy control by support vector machines (SVMs).

[0024] According to some embodiments, the method of the present invention comprising determining the reactivities of IgG antibodies in the sample to ssDNA, U1 snRNP, Ro52, Collagen III and Apo-SAA, determining the reactivities of IgM antibodies in the sample to Histone III-S, and comparing the reactivity of antibodies in the sample to a reactivity of a healthy control by logistic regression (LR).

[0025] According to some embodiments, the method of the present invention comprising determining the reactivities of IgG antibodies in the sample to ssDNA, U1 snRNP, Sm, Apo-SAA and Ro52, determining the reactivities of IgM antibodies in the sample to H2a, and comparing the reactivity of antibodies in the sample to a reactivity of a healthy control by quadratic discriminant analysis (QDA).

[0026] According to some embodiments, the method of the present invention comprising determining the reactivities of IgG antibodies in the sample to ssDNA, U1 snRNP and Sm, determining the reactivities of IgM antibodies in the sample to Histone U1 snRNP and Oligo21, and comparing the reactivity of antibodies in the sample to a reactivity of a healthy control by linear discriminant analysis (LDA).

[0027] According to some embodiments, the method of the present invention comprising determining the reactivities of antibodies in the sample to ssDNA, U1 snRNP and Histone III-S.

[0028] According to some embodiments, the logistic regression (LR), quadratic discriminant analysis (QDA), and linear discriminant analysis (LDA) have predicted probability values of 0-1.

[0029] According to some embodiments of the methods of the present invention, the sample obtained from the subject is a biological fluid. According to some embodiments, the sample is selected from the group consisting of plasma, serum, blood, cerebrospinal fluid, synovial fluid, sputum, urine, saliva, tears, lymph specimen, or any other biological fluid known in the art. Each possibility represents a separate embodiment of the invention. According to certain embodiments, the sample obtained from the subject is selected from the group consisting of serum, plasma and blood. According to one embodiment, the sample is a serum sample.

[0030] According to certain embodiments of the methods of the present invention, the control is selected from the group consisting of a sample from at least one healthy individual, a panel of control samples from a set of healthy individuals, a baseline sample from same subject, and a stored set of data from healthy individuals. Typically, a healthy individual is a subject not afflicted with SLE (or any other form of lupus). In another embodiment, a healthy individual is a subject not afflicted with an autoimmune disease.

[0031] According to certain embodiments, the subject is positive for an antinuclear antibody (ANA) test. According to certain embodiments, the method of the present invention can be used to rule out a diagnosis of SLE. According to certain embodiments, the method of the present invention can be used to rule in a diagnosis of SLE.

[0032] According to another aspect, the present invention provides a method for classifying a subject as having systemic lupus erythematosus (SLE) in an active phase or in a non-active phase, the method comprising the steps of: obtaining a sample from the subject; determining the reactivity of antibodies in the sample to at least four antigens selected from Table 8, thereby determining the reactivity pattern of the sample to the plurality of antigens; calculating a score based on the reactivity of antibodies in the sample by a supervised classification algorithm and comparing said score to a pre-determined threshold level; wherein a significantly different reactivity of the antibodies in the sample with a score above the pre-determined threshold level, is an indication that the subject is afflicted with an active phase of SLE

[0033] According to certain embodiments, the supervised classification algorithm is selected from the group consisting of support vector machines (SVMs), logistic regression (LR), and classification and regression tree (CART).

[0034] According to another aspect the present invention provides a kit for the diagnosis or monitoring of SLE in a subject comprising the plurality of antigens of the invention or a subset thereof.

[0035] According to another aspect, the present invention provides an antigen probe set comprising the plurality of antigen probes of the invention, or a subset thereof.

[0036] According to another aspect, the present invention provides an article of manufacture comprising the antigen probe set of the present invention.

[0037] According to another aspect, there is provided use of an antigen probe set comprising a plurality of antigen probes of the invention, for the preparation of a diagnostic kit for diagnosing SLE in a subject. Said diagnostic kit is, in some embodiments, useful for determining the reactivity of antibodies in a sample, thereby determining the reactivity pattern of the sample to said plurality of antigens. In some embodiments, a significant difference between the reactivity pattern of said sample compared to a reactivity pattern of a control sample is an indication for SLE.

[0038] Other objects, features and advantages of the present invention will become clear from the following description and drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

[0039] FIG. 1 demonstrates the performance of five classification methods (SVM, LR, QDA, LDA and Majority Vote) in the validation test. Validation of the performance of the different classification methods was performed on a set of 50 SLE patients and 50 healthy controls comparing the sensitivity (y-axis) and the specificity (1-specificity, x-axis) of the classification methods. Dashed lines represent the thresholds.

[0040] FIG. 2 demonstrates the verification of four classification methods (SVM, LR, QDA and LDA) as was performed by determining the area under the Receiver Operating Characteristics (ROC) curve for each classification method. The area under the curve (AUC) calculated were 0.95, 0.95, 0.95 and 0.94 respectively.

[0041] FIG. 3 demonstrates the distribution of score/probabilities (y-axis) of the validation samples (x-axis) using the SVM classifier for the separation between samples originating from healthy control (circle labeled) and samples originating from SLE patients (cross labeled). Dotted line represents the threshold.

[0042] FIG. 4 demonstrates the distribution of score/probabilities (y-axis) of the validation samples (x-axis) using the LDA classifier for the separation between samples originating from healthy control (circle labeled) and samples originating from SLE patients (cross labeled). Dotted line represents the threshold.

[0043] FIG. 5 demonstrates the distribution of score/probabilities (y-axis) of the validation samples (x-axis) using the QDA classifier for the separation between samples originating from healthy control (circle labeled) and samples originating from SLE patients (cross labeled). Dotted line represents the threshold.

[0044] FIG. 6 demonstrates the distribution of score/probabilities (y-axis) of the validation samples (x-axis) using the LR classifier for the separation between samples originating from healthy control (circle labeled) and samples originating from SLE patients (cross labeled). Dotted line represents the threshold.

[0045] FIG. 7 demonstrates the correlation heat-map for selected antigens using the logistic regression classifier for the prediction of disease activity.

[0046] FIG. 8 demonstrates the correlation heat-map for selected antigens using the SVM classifier for the prediction of disease activity.

[0047] FIG. 9 demonstrates the correlation heat-map for selected antigens using the CART classifier for the prediction of disease activity.

[0048] FIG. 10 demonstrates the ROC curve model performance comparison.

[0049] FIG. 11 demonstrates the total misclassification rate model performance comparison.

[0050] FIG. 12 demonstrates the agreement between the referring physician's clinical impression and the SLE-test results for both ANA+ and ANA- subjects, for the four classification models (SVM, LR, QDA, LDA).

[0051] FIG. 13 demonstrates the correlation between SLE disease activity Index (SLEDAI) and LDA Score (from the validation sample set).

DETAILED DESCRIPTION OF THE INVENTION

[0052] The present invention provides methods of diagnosing an autoimmune disease or disorder, specifically systemic lupus erythematosus (SLE), in a subject. The present invention further provides antigen probe sets or arrays for practicing such a diagnosis, and identifies specific antigen probe sets for generating such arrays. The platform technology of the present invention apply novel biomarker signature to measure changes in immune system response by observing changes in autoantibodies. The present invention can predict response prior to therapy and identify adverse events prior to irreversible injury/damage. The present invention can objectively identify disease or healthy state such as excluding SLE from patient with unclear diagnosis and excluding SLE from otherwise healthy ANA(+) patient. The classification methods of the present invention can be used to support decision making to the diagnosis or ruling out of SLE. The methods may also be used to track patients' immune profiles over time to monitor changes in disease state and/or response to therapy.

[0053] Without wishing to be bound by any particular theory or mechanism of action, the invention is based, in part, on the finding of unique, highly distinctive antibody reactivity profiles in serum of SLE patients, clearly distinct from healthy control individuals. Although serum autoantibodies have been extensively investigated in SLE, the unique antibody immune signatures as described herein have not been described before. Advantageously, the unique antibody signatures of the present disclosure provide highly sensitive and specific assays for diagnosing SLE or for ruling out a diagnosis of SLE.

[0054] The methods of the present invention allow the determination of the pattern of circulating antibodies to said array of antigens. This pattern is compared to SLE affected and healthy control patterns. The classifier algorithms of the present invention are used to determine the likelihood of the patient being affected with SLE, along with a probability score.

[0055] Further, the present invention provides, in some embodiments, unique antigen-autoantibody reactivity patterns particularly relevant to SLE. As exemplified herein below, a quadratic discriminant classifier (QDA) analysis including the following antigens: ssDNA, U1 snRNP, Sm, Apo-SAA and Ro52, H2a, exhibited an AUC value of 95%. Additional SLE-related antigens are presented herein below in Table 1.

[0056] As exemplified herein below, antigen analysis of autoantibodies (e.g., using microarray analysis) can identify serum autoantibody patterns associated with SLE. In particular embodiments, the methods of the invention are based on collective autoantibody patterns. The informative patterns include, in some embodiments, decreases and increases of IgG antibodies as well as decreases and increases of IgM antibodies, relative to those found in healthy controls.

[0057] In some embodiment, the method comprises: obtaining a sample from a subject; determining the reactivity of IgG and/or IgM antibodies in the sample to the plurality of antigens described herein; thereby determining the reactivity pattern of the sample to the plurality of antigens; and comparing the reactivity pattern of said sample to a control reactivity pattern; wherein a significant difference between the reactivity pattern of said sample obtained from the subject compared to the reactivity pattern of a control sample is an indication that the subject is afflicted with SLE.

[0058] In some embodiment, the plurality of antigens for discriminating SLE and healthy controls is selected from the group consisting of: ssDNA, Sm, DNAse I, Histone IIIS, Ro52, U1 snRNP, Collagen III, Apo-SAA, H2a and Oligo21. In particular embodiments, said plurality of antigens comprises Histone HIS and at least one, at least two, at least three, at least four, at least five, at least six or at least seven antigens selected from the group consisting of: U1 snRNP, ssDNA, Sm, DNAse I, Ro52, Collagen III, Apo-SAA, H2a and Oligo21.

[0059] In some embodiment, the plurality of antigens for discriminating SLE and healthy controls is selected from the group of antigens listed in Table 1 (SEQ ID NO: 1-11) and any combinations thereof.

TABLE-US-00001 TABLE 1 List of SLE related antigens used for discriminating SLE and healthy controls SEQ ID Manufacture Antigen NO: Full name/amino acid or oligonucleotide sequence (catalog no.) DNAse 1 Deoxyribonuclease I AKRON biotech I MRGMKLLGALLALAALLQGAVSLKIAAFNIQTFGETKMSNATLVSYIVQILSR (AK3778) YDIALVQEVRDSHLTAVGKLLDNLNQDAPDTYHYVVSEPLGRNSYKERYLFVY RPDQVSAVDSYYYDDGCEPCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAP GDAVAEIDALYDVYLDVQEKWGLEDVMLMGDFNAGCSYVRPSQWSSIRLWTSP TFQWLIPDSADTTATPTHCAYDRIVVAGMLLRGAVVPDSALPFNFQAAYGLSD QLAQAISDHYPVEVMLK salmon single stranded DNA (salmon testes) Sigma (D9156) ssDNA calf single stranded DNA (calf) Sigma (D8899) ssDNA Histone Histone from calf thymus Sigma (H5505) IIIS (Type III-S) Collagen 2 Type III collagen AKRON biotech III MMSFVQKGSWLLLALLHPTIILAQQEAVEGGCSHLGQSYADRDVWKPEPCQIC (AK9914) VCDSGSVLCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQGPQG PKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYD VKSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPS GPPGPPGAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGH RGFDGRNGEKGETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRPGLPGAAG ARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEP GPQGHAGAQGPPGPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPAGANGA PGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAKGEDGKDGSPGEPGANG LPGAAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVP GGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGP KGNDGAPGKNGERGGPGGPGPQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPG PQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGERGPPGLA GAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGD KGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRG SPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGS GPAGPPGPQGVKGERGSPGGPGAAGFPGARGLPGPPGSNGNPGPPGPSGSPGK DGPPGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGITG ARGLAGPPGMPGPRGSPGPQGVKGESGKPGANGLSGERGPPGPQGLPGLAGTA GEPGRDGNPGSDGLPGRDGSPGGKGDRGENGSPGAPGAPGHPGPPGPVGPAGK SGDRGESGPAGPAGAPGPAGSRGAPGPQGPRGDKGETGERGAAGIKGHRGFPG NPGAPGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPR GNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGVGAAAIAGIGGEKAGGFAPY YGDEPMDFKINTDEIMTSLKSVNGQIESLISPDGSRKNPARNCRDLKFCHPEL KSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSSAEKKH VWFGESMDGGFQFSYGNPELPEDVLDVHLAFLRLLSSRASQNITYHCKNSIAY MDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEY RTRKAVRLPIVDIAPYDIGGPDQEFGVDVGPVCFL U1 Small Nuclear Ribonucleoprotein 70 kDa Human Prospec (pro-445) snRNP Recombinant Ro52 3 52 kDa Ro protein Prospec (PRO-328) MASAARLTMMWEEVTCPICLDPFVEPVSIECGHSFCQECISQVGKGGGSVCPV CRQRFLLKNLRPNRQLANMVNNLKEISQEAREGTQGERCAVHGERLHLFCEKD GKALCWVCAQSRKHRDHAMVPLEEAAQEYQEKLQVALGELRRKQELAEKLEVE IAIKRADWKKTVETQKSRIHAEFVQQKNFLVEEEQRQLQELEKDEREQLRILG EKEAKLAQQSQALQELISELDRRCHSSALELLQEVIIVLERSESWNLKDLDIT SPELRSVCHVPGLKKMLRTCAVHITLDPDTANPWLILSEDRRQVRLGDTQQSI PGNEERFDSYPMVLGAQHFHSGKHYWEVDVTGKEAWDLGVCRDSVRRKGHFLL SSKSGFWTIWLWNKQKYEAGTYPQTPLHLQVPPCQVGIFLDYEAGMVSFYNIT DHGSLIYSFSECAFTGPLRPFFSPGFNDGGKNTAPLTLCPLNIGSQGSTDY Sm Smith antigen US Biological (s1014-29F) Apo- 4 Recombinant Human Apo-SAA Peprotec (300-13) SAA MRSFFSFLGE AFDGARDMWR AYSDMREANY IGSDKYFHAR GNYDAAKRGP GGVWAAEAIS NARENIQRFF GRGAEDSLAD QAANEWGRSG KDPNHFRPAG LPEKY H2a 5 Histone H2A Human Sigma (H9250) MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLA AVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQG GVLPNIQAVLLPKKTESHHKAKGK Oligo21 6 TTA GGG TTA GGG TTA GGG TTA GGG SBSGenetechCo.,Ltd

Definitions

[0060] It is to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting. It must be noted that, as used in the specification and the appended claims, the singular forms "a," "an" and "the" include plural referents unless the context clearly dictates otherwise.

[0061] The term "about" as used herein means in quantitative terms plus or minus 5%, or in another embodiment plus or minus 10%, or in another embodiment plus or minus 15%, or in another embodiment plus or minus 20%.

[0062] The terms "systemic lupus erythematosus", "lupus" and "SLE" as used herein are interchangeable, and generally refer to an autoimmune disease characterized by the criteria set by the 1982 American College of Rheumatology (ACR) for the diagnosis of SLE, and/or by the Systemic Lupus Collaborating Clinics (SLICC) revised criteria, reviewed in Petri et al. (Arthritis and Rheumatism, 2012, Vol. 64, pages 2677-2686).

[0063] "An SLE flare" is defined herein as either SLEDAI score>4 or physician assessed flares. The rationale for including physician assessed flares is that the SLEDAI does not cover some rarer manifestations of flares, e.g. gut vasculitis, new onset of peripheral neuropathy.

[0064] As used herein, `incipient flare` means the sub-clinical, beginning, early or emerging stages of a flare.

[0065] As used herein, `quiescent state` means the subject is not experiencing a clinical flare, in the presence or absence of serological indications. About 10% of SLE patients are "clinically quiescent, biochemically active" (i.e. no clinical flare though existing biomarkers suggest the presence of a flare) and another 10% of SLE patients are "clinically active, biochemically quiescent" (i.e. clinical flare though existing biomarkers do not suggest the presence of a flare).

[0066] The terms "patient," "individual," or "subject" are used interchangeably herein, and refer to a mammal, particularly, a human. The patient may have mild, intermediate or severe disease. The patient may be treatment naive, responding to any form of treatment, or refractory. The patient may be an individual in need of treatment or in need of diagnosis based on particular symptoms or family history. In some cases, the terms may refer to treatment in experimental animals, in veterinary application, and in the development of animal models for disease, including, but not limited to, rodents including mice, rats, and hamsters; and primates.

[0067] The term "healthy control" as used herein refers to a healthy individual; a plurality of healthy individuals, a data set or value corresponding to or obtained from a healthy individual or a plurality of healthy individuals. According to some embodiments, the control group comprises patients with other rheumatologic disorders or baseline sample from same patient.

[0068] As used herein the term "reference control" means a value that statistically correlates to a particular outcome when compared to an assay result. The reference value may be a threshold score value or a cutoff score value. Typically a reference value will be a threshold above which one outcome is more probable and below which an alternative threshold is more probable.

[0069] The terms "measuring", "detecting" and "determining" are used interchangeably throughout, and refer to methods which include obtaining a patient sample and detecting reactivity of antibodies in a sample. In some embodiments, the terms refer to obtaining a patient sample and detecting the reactivity of antibodies in the sample to one or more antigens. Measuring can be accomplished by methods known in the art and those further described herein.

[0070] The terms "sample," "patient sample," "biological sample," and the like, encompass a variety of sample types obtained from a patient, individual, or subject and can be used in a diagnostic or monitoring assay. The patient sample may be obtained from a healthy subject, a diseased patient or a patient having associated symptoms of SLE. Moreover, a sample obtained from a patient can be divided and only a portion may be used for diagnosis. Further, the sample, or a portion thereof, can be stored under conditions to maintain sample for later analysis. The definition specifically encompasses blood and other liquid samples of biological origin (including, but not limited to, peripheral blood, serum, plasma, cerebrospinal fluid, urine, saliva, stool and synovial fluid). In a specific embodiment, a sample comprises a blood sample. In another embodiment, a serum sample is used. The definition also includes samples that have been manipulated in any way after their procurement, such as by centrifugation, filtration, precipitation, dialysis, chromatography, treatment with reagents, washed or enriched for certain cell populations. The terms further encompass a clinical sample, and also include cells in culture, cell supernatants, tissue samples, organs, and the like. Samples may also comprise fresh-frozen and/or formalin-fixed, paraffin-embedded tissue blocks, such as blocks prepared from clinical or pathological biopsies, prepared for pathological analysis or study by immunohistochemistry. The samples may be tested immediately after collection, after storage at RT, 4 degrees, -20 degrees, or -80 degrees Celsius. After storage for 24 hours, 1 week, 1 month, 1 year, 10 years or up to 30 years.

[0071] As used herein, the term "autoantibodies" refers to antibodies that are capable of reacting against an antigenic constituent of an individual's own tissue or cells (e.g., the antibodies recognize and bind to "self-antigens").

[0072] Unless otherwise indicated, the term "oligonucleotide antigen" as used herein relates to a nucleotide sequence of between 15 and 40 nucleotides in length, alternatively between 17 and 28 nucleotides in length, or between 18-25 nucleotides in length. In certain embodiments, an oligonucleotide antigen consists of at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 16, at least 24, or more contiguous nucleotides. Each possibility represents a separate embodiment of the invention. In certain embodiments, an antigen consists of not more than 50, not more than 45, not more than 40, not more than 35, not more than 30, not more than 25, not more than 20, not more than 16, or less contiguous nucleotides. Each possibility represents a separate embodiment of the invention. In certain embodiments, an antigen consists of 10-30, 15-25 or 17-20 contiguous nucleotides. In certain embodiments, an antigen consists of 17, 18, 19 or 20 contiguous nucleotides.

[0073] The nomenclature used to refer to the oligonucleotide sequence of the oligonucleotide antigen disclosed in the present invention is as follows: an oligonucleotide antigen consisting of the oligonucleotide sequence of X2Y3Z2, i.e. two oligonucleotides of X followed by three oligonucleotides of Y followed by two oligonucleotides of Z is labeled as X2Y3Z2, (X)2(Y)3(Z)2, or XXYYYZZ, or referred to by its corresponding SEQ ID NO. It should be understood that in this example, X, Y and Z may relate to more than one oligonucleotide, e.g. to 2-20 oligonucleotides. Therefore, an oligonucleotide antigen consisting of the oligonucleotide sequence of X2, wherein X is a stretch of e.g. two oligonucleotides, e.g. YZ, is labeled as X2, (X)2, or YZYZ, or referred to by its corresponding SEQ ID NO.

[0074] Unless otherwise indicated, the terms "peptide antigen" as used herein relate to an amino-acid sequence of between 15 and 40 amino-acids in length, alternatively between 17 and 28 amino-acids in length, or between 18-25 amino-acids in length. In certain embodiments, a peptide antigen consists of at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 16, or more contiguous amino-acids. Each possibility represents a separate embodiment of the invention. In certain embodiments, an antigen consists of not more than 50, not more than 45, not more than 40, not more than 35, not more than 30, not more than 25, not more than 20, not more than 16, or less contiguous amino-acids. Each possibility represents a separate embodiment of the invention. In certain embodiments, an antigen consists of 10-30, 15-25 or 17-20 contiguous amino-acids. In certain embodiments, an antigen consists of 17, 18, 19 or 20 contiguous amino-acids.

[0075] As defined herein, "U1 SnRNP" refers to a ribonuclear protein, which is conserved between species. In a particular embodiment, Small Nuclear Ribonucleoprotein 70 kDa Human Recombinant (U1 SnRNP) is commercially available, e.g., from Prospec, catalog number pro-445.

[0076] "DNAse I" is considered the major serum nuclease. DNAse I is the founding member of the DNAse I-like family of divalent cation-dependent endonucleases. In a particular embodiment, DNAse I antigen is commercially available, e.g., from AKRON biotech, catalog number AK3778.

[0077] "Histones" are the chief protein components of chromatin. They act as spools around which DNA winds and they play a role in gene regulation. Six major histone classes are known: H1 (sometimes called the linker histone; also related to Histone H5); H2A; H2B; H3; H4; and archaeal histones. Two each of the class H2A, H2B, H3 and H4, so-called core histones, assemble to form one octameric nucleosome core particle by wrapping 146 base pairs of DNA around the protein spool in 1.65 left-handed super-helical turn. The linker histone H1 binds the nucleosome and the entry and exit sites of the DNA, thus locking the DNA into place and allowing the formation of higher order structure. The most basic such formation is the 10 nm fiber or beads on a string conformation. This involves the wrapping of DNA around nucleosomes with approximately 50 base pairs of DNA spaced between each nucleosome (also referred to as linker DNA). The assembled histones and DNA is called chromatin. Higher order structures include the 30 nm fiber (forming an irregular zigzag) and 100 nm fiber, these being the structures found in normal cells. During mitosis and meiosis, the condensed chromosomes are assembled through interactions between nucleosomes and other regulatory proteins. In a particular embodiment, Histone H2A human antigen is commercially available, e.g., from Sigma Aldrich, catalog number H9250. In another particular embodiment, Histone Type III-S calf antigen is commercially available, e.g., from Sigma Aldrich, catalog number H5505.

[0078] Single Strand Deoxyribonucleic Acid (ssDNA)

[0079] The reactivity of antibodies to the ssDNA antigen may be determined according to techniques known in the art. The ssDNA antigen may be obtained from any source, such as but not limit to, calf, human, horse, pig or bovine source. In a particular embodiment, ssDNA has a CAS number of 91080-16-9. The ssDNA antigen is commercially available, e.g., from Sigma Aldrich, catalog number D8899.

[0080] Collagen Type III

[0081] Type III collagen is the second most abundant collagen in human tissues and occurs particularly in tissues exhibiting elastic properties, such as skin, blood vessels and various internal organs. Mutations of type III collagen cause the most severe form of Ehlers-Danlos syndrome, EDS IV, which affect arteries, internal organs, joints and skin, and may cause sudden death when the large arteries rupture. In a particular embodiment, the type III collagen antigen of the present invention is a Bornstein and Traub Type III collagen, e.g., from human placenta. The reactivity of antibodies to the collagen-III antigen may be determined according to techniques known in the art. In a particular embodiment, collagen-III has a CAS number of 9007-34-5. The collagen-III antigen is commercially available, e.g., from Sigma Aldrich, catalog number C4407.

[0082] Ro52

[0083] The function of the Ro52 protein has not been fully established, although a role in ubiquitination and other regulatory processes has been proposed. Ro52 includes several predicted functional domains; two zinc-finger motifs are situated in the N-terminal region and a SPRY-region is near the C-terminus. The central part of Ro52 consists of a coiled-coil region, including a leucine zipper comprising amino acid (aa) residues 200-232. Leucine zippers, which contain periodic repeats of leucine amino acids every seventh residue, give rise to a helical structure, and are likely to be of importance for the correct folding of the protein, as well as its interaction with other molecules. Based on an analysis of sequence similarity, the 475 amino acid (aa) protein Ro52 belongs to the tripartite motif (TRIM) family. In a particular embodiment, the Ro52 antigen is commercially available, e.g., from Prospec catalog number PRO-328.

[0084] Sm

[0085] Sm antigen is a non-histone nuclear protein composed of several polypeptides of differing molecular weights. They include B (26 kD), B'(27 kD), and D (13 kD). The principle reactivity has been shown to reside in the B, B', and D polypeptides. The Sm antigen is involved in normal post-transcriptional, premessenger RNA processing to excise introns. It has been demonstrated that the Sm antigenicity is both RNase and DNase resistant and partially resistant to tryptic digestion. In a particular embodiment, the Sm antigen is commercially available, e.g., from US Biological catalog number s1014-29F.

[0086] Apo-SAA

[0087] Human Apo-SAA is a 104 amino acid polypeptide that circulates primarily in association with high-density lipoproteins (HDL). The level of Apo-SAA, normally 1-5 .mu.g/ml in plasma, increases 500-1000 fold within 24 hours of an inflammatory stimulus and, under these conditions, is the most abundant HDL apolipoprotein. The human SAA gene codes for a 122 amino acid polypeptide, which contains an 18 amino acid N-terminal signal sequence. Recombinant Apo-SAA is a consensus SAA molecule corresponding to human Apo-SAA1a, except for the presence of an N-terminal methionine, the substitution of asparagine for aspartic acid at position 60, and arginine for histidine at position 71 (the latter two substituted residues are present in Apo-SAA2.beta.). The calculated molecular weight of Recombinant Human Apo-SAA is 11.7 kDa. In a particular embodiment, the Apo-SAA antigen is commercially available, e.g., from Peprotec catalog number 300-13.

[0088] In terms of "functional analogues", it is well understood by those skilled in the art, that inherent in the definition of a biologically functional polypeptide or peptide analogue is the concept that there is a limit to the number of changes that may be made within a defined portion of the molecule and still result in a molecule with an acceptable level of equivalent biological activity. A plurality of distinct polypeptides or peptides with different substitutions may easily be made and used in accordance with the invention. It is also understood that certain residues are particularly important to the biological or structural properties of a polypeptide, and such residues may not generally be exchanged.

[0089] Functional analogues can be generated by conservative or non-conservative amino acid substitutions. Amino acid substitutions are generally based on the relative similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, charge, size and the like. Thus, within the scope of the invention, conservative amino acid changes means, an amino acid change at a particular position which is of the same type as originally present; i.e. a hydrophobic amino acid exchanged for a hydrophobic amino acid, a basic amino acid for a basic amino acid, etc. Examples of conservative substitutions include the substitution of non-polar (hydrophobic) residues such as isoleucine, valine, leucine or methionine for another, the substitution of one polar (hydrophilic) residue for another such as between arginine and lysine, between glutamine and asparagine, between glycine and serine, the substitution of one basic residue such as lysine, arginine or histidine for another, or the substitution of one acidic residue, such as aspartic acid or glutamic acid for another, the substitution of a branched chain amino acid, such as isoleucine, leucine, or valine for another, the substitution of one aromatic amino acid, such as phenylalanine, tyrosine or tryptophan for another. Such amino acid changes result in functional analogues in that they do not significantly alter the overall charge and/or configuration of the polypeptide. Examples of such conservative changes are well-known to the skilled artisan and are within the scope of the present invention. Conservative substitution also includes the use of a chemically derivatized residue in place of a non-derivatized residue provided that the resulting polypeptide is a biologically functional equivalent to the polypeptide antigens.

[0090] As used herein, the "reactivity of antibodies in a sample" or "reactivity of an antibody in a sample" to "an antigen" or to "a plurality of antigens" refers to the immune reactivity of at least one antibody in the sample to at least one specific antigen selected from the plurality of antigens. The immune reactivity of the antibody to the antigen, i.e. its ability to specifically bind the antigen, may be used to determine the amount of the antibody in the sample. The calculated levels of each one of the tested antibodies in the sample are collectively referred to as the reactivity pattern of the sample to these antigens. The reactivity pattern of the sample reflects the levels of each one of the tested antibodies in the sample, thereby providing a quantitative assay. In a preferred embodiment, the antibodies are quantitatively determined.

[0091] A "significant difference" between reactivity patterns refers, in different embodiments, to a statistically significant difference, or in other embodiments to a significant difference as recognized by a skilled artisan. In yet another preferred embodiment, a significant (quantitative) difference between the reactivity pattern of the sample obtained from the subject compared to the control reactivity pattern is an indication that the subject is afflicted with SLE. In specific embodiments, up-regulation or higher reactivity of the reactivity of an antibody in a sample to an antigen refers to an increase (i.e., elevation) of about at least two, about at least three, about at least four, or about at least five times higher (i.e., greater) than the reactivity levels of the antibody to the antigen in the control. In another embodiment, down-regulation or lower reactivity of the reactivity of an antibody in a sample to an antigen refers to a decrease (i.e., reduction) of about at least two, about at least three, about at least four, or about at least five times lower than the reactivity levels of the antibody to the antigen in the control.

[0092] In particular embodiments, said significant difference is determined using a cutoff of a positive predictive value (PPV) of at least 70%, at least 85%, at least 90%. Determining a PPV for a selected marker (e.g., an antigen) is well known to the ordinarily skilled artisan and is exemplified in the methods described below. Typically, positivity for an antigen is determined if it detected above 10% of the subjects in a specific study subgroup using a selected cutoff value, such as PPV.gtoreq.90%. For example, antigen i is determined to specifically characterize group A if it detected at least 10% of the subjects in group A with a PPV.gtoreq.90% when compared to a different test group B. Subjects in group A that are above the cutoff of PPV.gtoreq.90% for antigen i are considered to be positive for antigen i.

[0093] An antibody "directed to" an antigen, as used herein is an antibody which is capable of specifically binding the antigen. Determining the levels of antibodies directed to a plurality of antigens includes measuring the level of each antibody in the sample, wherein each antibody is directed to a specific antigen, including but not limited to, an antigen selected from Table 1. This step is typically performed using an immunoassay, as detailed herein.

[0094] In other embodiments, determining the reactivity of antibodies in said sample to said plurality of antigens, (and the levels of each one of the tested antibodies in the sample) is performed by a process comprising contacting the sample, under conditions such that a specific antigen-antibody complex may be formed, with an antigen probe set comprising said plurality of antigens, and quantifying the amount of antigen-antibody complex formed for each antigen probe. The amount of antigen-antibody complex is indicative of the level of the tested antibody in the sample (or the reactivity of the sample with the antigen).

[0095] In another embodiment the method comprises determining the reactivity of at least one IgG antibody and at least one IgM antibody in the sample to the plurality of antigens. In another embodiment, the method comprises determining the reactivity of a plurality of IgG antibodies and at least one IgM antibody in the sample to the plurality of antigens. In another embodiment, the method comprises determining the reactivity of at least one IgG antibody and a plurality of IgM antibodies in the sample to the plurality of antigens. According to another embodiment, the method comprises determining the reactivity of antibodies in the sample to a plurality of oligonucleotide antigens.

[0096] The reactivity of antibodies to the plurality of the antigens may be determined according to techniques known in the art. Typically, determining the reactivity of antibodies in the sample to the plurality of antigens is performed using an immunoassay. Advantageously, the plurality of antigens may be used in the form of an antigen array.

[0097] Antigen Probes and Antigen Probe Sets

[0098] According to further embodiments, the invention provides antigen probes and antigen probe sets useful for diagnosing SLE, as detailed herein.

[0099] The invention further provides a plurality of antigens also referred to herein as antigen probe sets. These antigen probe sets comprise a plurality of antigens which are reactive specifically with the sera of subjects having SLE. According to the principles of the invention, the plurality of antigens may advantageously be used in the form of an antigen array. According to some embodiments the antigen array is conveniently arranged in the form of an antigen chip.

[0100] A "probe" as used herein means any compound capable of specific binding to a component. According to one aspect, the present invention provides an antigen probe set comprising a plurality of antigens selected from Table 1. In one embodiment, said plurality of antigens is selected from the group consisting of: ssDNA, Sm, DNAse I, Histone Ro52, U1 snRNP, Collagen III, Apo-SAA, H2a and Oligo21, or any combinations or subset thereof.

[0101] As exemplified herein below, a subject suspected of having SLE can be differentiated from healthy controls by assaying and determining IgG and/or IgM antibody reactivities in a sample obtained from said subject. The reactivity of antibodies to the plurality of antigens of the invention may be determined according to techniques known in the art. Further, the antigens used in the present invention are known in the art and are commercially available, e.g., from Prospec or Sigma-Aldrich.

[0102] Preferably, the plurality of antigens of the methods and kits of the invention comprises a set of the antigens as disclosed herein. Yet in other embodiments, the plurality of antigens (or the antigen probe set) comprises or consists of a subset thereof, e.g. at least 2, 3, 4, 5, 6, 7, 8, 9 or 10 different antigens, each selected from the antigens of the present invention. Each possibility represents a separate embodiment of the invention. Such subsets may be selected so as to result in optimal sensitivity and/or specificity of the diagnostic assay.

[0103] Antigen probes to be used in the assays of the invention may be purified or synthesized using methods well known in the art. For example, an antigenic protein or peptide may be produced using known recombinant or synthetic methods, including, but not limited to, solid phase (e.g. Boc or f-Moc chemistry) and solution phase synthesis methods (Stewart and Young, 1963; Meienhofer, 1973; Schroder and Lupke, 1965; Sambrook et al., 2001). One of skill in the art will possess the required expertise to obtain or synthesize the antigen probes of the invention. Table 1 lists the SLE-related antigens of the invention as well as a non-limiting characterization of said antigens. Some antigen probes are also commercially available, e.g. from Prospec (Ness-Ziona, Israel) or Sigma Aldrich or additional manufactures listed in Table 1.

[0104] It should be noted, that the invention utilizes antigen probes as well as homologs, fragments, isoforms, partial sequences, mutant forms, post translationally modified forms, and derivatives thereof, as long as these homologs, fragments, isoforms, partial sequences, mutant forms, post translationally modified forms and derivatives are immunologically cross-reactive with these antigen probes. The term "immunologically cross-reactive" as used herein refers to two or more antigens that are specifically bound by the same antibody. The term "homolog" as used herein refers to a peptide or oligonucleotide which having at least 70%, at least 75%, at least 80%, at least 85% or at least 90% identity to the antigen's amino acid or nucleotide sequence. Cross-reactivity can be determined by any of a number of immunoassay techniques, such as a competition assay (measuring the ability of a test antigen to competitively inhibit the binding of an antibody to its known antigen).

[0105] The term "fragment" as used herein refers to a portion of a polypeptide, or polypeptide analog which remains immunologically cross-reactive with the antigen probes, e.g., to recognize immuno-specifically the target antigen. The fragment may have the length of about 40%, about 50%, about 60%, about 70%, about 80%, about 85%, about 90% or about 95% of the respective antigen.

[0106] The term peptide typically refers to a polypeptide of up to about 50 amino acid residues in length. According to particular embodiments, the antigenic peptides of the invention may be about 10-100, 10-80, 10-75, 10-50 or about 10-30 amino acids in length.

[0107] The term encompasses native peptides (including degradation products, synthetically synthesized peptides, or recombinant peptides), peptidomimetics (typically, synthetically synthesized peptides), and the peptide analogues peptoids and semipeptoids, and may have, for example, modifications rendering the peptides more stable while in a body or more capable of penetrating into cells. Such modifications include, but are not limited to: N-terminus modifications; C-terminus modifications; peptide bond modifications, including but not limited to CH.sub.2--NH, CH.sub.2--S, CH.sub.2-S=O, O=C--NH, CH.sub.2--O, CH.sub.2--CH.sub.2, S.dbd.C--NH, CH.dbd.CH, and CF=CH; backbone modifications; and residue modifications. According to some embodiments, the peptide antigens of the invention are BSA-conjugated peptides.

[0108] The antigens of the invention may be used having a terminal carboxy acid, as a carboxy amide, as a reduced terminal alcohol or as any pharmaceutically acceptable salt, e.g., as metal salt, including sodium, potassium, lithium or calcium salt, or as a salt with an organic base, or as a salt with a mineral acid, including sulfuric acid, hydrochloric acid or phosphoric acid, or with an organic acid e.g., acetic acid or maleic acid.

[0109] Functional derivatives consist of chemical modifications to amino acid side chains and/or the carboxyl and/or amino moieties of said peptides. Such derivatized molecules include, for example, those molecules in which free amino groups have been derivatized to form amine hydrochlorides, p-toluene sulfonyl groups, carbobenzoxy groups, t-butyloxycarbonyl groups, chloroacetyl groups or formyl groups. Free carboxyl groups may be derivatized to form salts, methyl and ethyl esters or other types of esters or hydrazides. Free hydroxyl groups may be derivatized to form O-acyl or O-alkyl derivatives. The imidazole nitrogen of histidine may be derivatized to form N-im-benzylhistidine. Also included as chemical derivatives, are those polypeptides, which contain one or more naturally occurring or modified amino acid derivatives of the twenty standard amino acid residues. For example: 4-hydroxyproline may be substituted for proline; 5-hydroxylysine may be substituted for lysine; 3-methylhistidine may be substituted for histidine; homoserine may be substituted or serine; and ornithine may be substituted for lysine.

[0110] The amino acid residues described herein are in the "L" isomeric form, unless otherwise indicated. However, residues in the "D" isomeric form can be substituted for any L-amino acid residue, as long as the peptide substantially retains the desired antibody specificity.

[0111] Suitable analogs may be readily synthesized by now-standard peptide synthesis methods and apparatus or recombinant methods. All such analogs will essentially be based on the antigens of the invention as regards their amino acid sequence but will have one or more amino acid residues deleted, substituted or added. When amino acid residues are substituted, such conservative replacements which are envisaged are those which do not significantly alter the structure or antigenicity of the polypeptide. For example basic amino acids will be replaced with other basic amino acids, acidic ones with acidic ones and neutral ones with neutral ones. In addition to analogs comprising conservative substitutions as detailed above, analogs comprising non-conservative amino acid substitutions are further contemplated, as long as these analogs are immunologically cross reactive with an antigen of the invention.

[0112] In other aspects, there are provided nucleic acids encoding these peptides, vectors comprising these nucleic acids and host cells containing them. These nucleic acids, vectors and host cells are readily produced by recombinant methods known in the art (see, e.g., Sambrook et al., 2001). For example, an isolated nucleic acid sequence encoding an antigen of the invention can be obtained from its natural source, either as an entire (i.e., complete) gene or a portion thereof. A nucleic acid molecule can also be produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical synthesis. Nucleic acid sequences include natural nucleic acid sequences and homologs thereof, including, but not limited to, natural allelic variants and modified nucleic acid sequences in which nucleotides have been inserted, deleted, substituted, and/or inverted in such a manner that such modifications do not substantially interfere with the nucleic acid molecule's ability to encode a functional peptide of the present invention.

[0113] According to the principles of the invention the kits comprise a plurality of antigens also referred to herein as antigen probe sets. These antigen probe sets comprising a plurality of antigens are reactive specifically with the sera of subjects having SLE. According to the principles of the invention, the plurality of antigens may advantageously be used in the form of an antigen array. According to some embodiments the antigen array is conveniently arranged in the form of an antigen chip.

[0114] According to another aspect, the present invention provides an article of manufacture comprising the at least one of the antigen probe sets described above.

[0115] In certain embodiments, the article of manufacture is in the form of an antigen probe array or in the form of an antigen chip or in the form of a dipstick or in the form of a lateral flow test or any other platform known to those skilled in the art. An "antigen probe array" generally refers to a plurality of antigen probes, either mixed in a single container or arranges in to or more containers. An "antigen chip" generally refers to a substantially two dimensional surface, onto which a plurality of antigens are attached or adhered. A "dipstick" generally refers to an object, onto which a plurality of antigens are attached or adhered, which is dipped into a liquid to perform a chemical test or to provide a measure of quantity found in the liquid. A "lateral flow test" generally refers to devices intended to detect the presence (or absence) of a target analyte in sample (matrix) without the need for specialized and costly equipment. In certain embodiments, the article of manufacture is in the form of a kit.

[0116] According to certain embodiments, the kit further comprises means for determining the reactivity of antibodies in a sample to at least one antigen of the plurality of antigens. According to another embodiment, the kit further comprises means for comparing reactivity of antibody in different samples to at least one antigen of the plurality of antigens. According to another embodiment, the kit further comprises instructions for use. For example, the aforementioned means may include reagents, detectable labels and/or containers which may be used for measuring specific binding of antibodies to the antigen probes of the invention. "Means" as used herein may also refer to devices, reagents and chemicals, such as vials, buffers and written protocols or instructions, used to perform biological or chemical assays.

[0117] In other embodiments, the kit may further comprise means for determining the reactivity of antibodies in a sample to the plurality of antigens. For example, the kit may contain reagents, detectable labels and/or containers which may be used for measuring specific binding of antibodies to the antigen probes of the invention. In a particular embodiment, said kit is in the form of an antigen array. In some embodiments, said kit comprises means for comparing reactivity patterns of antibodies in different samples to the plurality of antigens. In other embodiments, said kit may further comprise negative and/or positive control samples.

[0118] For example, a negative control sample may contain a sample from at least one healthy individual (e.g., an individual not-afflicted with SLE). A positive control may contain a sample from at least one individual afflicted with SLE, or a subtype of SLE which is being diagnosed. Other non-limiting examples are a panel of control samples from a set of healthy individuals or diseased individuals, or a stored set of data from control individuals.

[0119] Antibodies, Samples and Immunoassays

[0120] Antibodies, or immunoglobulins, comprise two heavy chains linked together by disulfide bonds and two light chains, each light chain being linked to a respective heavy chain by disulfide bonds in a "Y" shaped configuration. Each heavy chain has at one end a variable domain (VH) followed by a number of constant domains (CH). Each light chain has a variable domain (VL) at one end and a constant domain (CL) at its other end, the light chain variable domain being aligned with the variable domain of the heavy chain and the light chain constant domain being aligned with the first constant domain of the heavy chain (CH1). The variable domains of each pair of light and heavy chains form the antigen binding site.

[0121] The isotype of the heavy chain (gamma, alpha, delta, epsilon or mu) determines immunoglobulin class (IgG, IgA, IgD, IgE or IgM, respectively). The light chain is either of two isotypes (kappa, .kappa. or lambda, .lamda.) found in all antibody classes.

[0122] It should be understood that when the terms "antibody" or "antibodies" are used, this is intended to include intact antibodies, such as polyclonal antibodies or monoclonal antibodies (mAbs), as well as proteolytic fragments thereof such as the Fab or F(ab').sub.2 fragments. Further included within the scope of the invention (for example as immunoassay reagents, as detailed herein) are chimeric antibodies; recombinant and engineered antibodies, and fragments thereof.

[0123] Exemplary functional antibody fragments comprising whole or essentially whole variable regions of both light and heavy chains are defined as follows: (i) Fv, defined as a genetically engineered fragment consisting of the variable region of the light chain and the variable region of the heavy chain expressed as two chains; (ii) single-chain Fv ("scFv"), a genetically engineered single-chain molecule including the variable region of the light chain and the variable region of the heavy chain, linked by a suitable polypeptide linker, (iii) Fab, a fragment of an antibody molecule containing a monovalent antigen-binding portion of an antibody molecule, obtained by treating whole antibody with the enzyme papain to yield the intact light chain and the Fd fragment of the heavy chain, which consists of the variable and CH1 domains thereof (iv) Fab', a fragment of an antibody molecule containing a monovalent antigen-binding portion of an antibody molecule, obtained by treating whole antibody with the enzyme pepsin, followed by reduction (two Fab' fragments are obtained per antibody molecule); and (v) F(ab')2, a fragment of an antibody molecule containing a monovalent antigen-binding portion of an antibody molecule, obtained by treating whole antibody with the enzyme pepsin (i.e., a dimer of Fab' fragments held together by two disulfide bonds).

[0124] The term "antigen" as used herein is a molecule or a portion of a molecule capable of being bound by an antibody. The antigen is typically capable of inducing an animal to produce antibody capable of binding to an epitope of that antigen. An antigen may have one or more epitopes. The specific reaction referred to above is meant to indicate that the antigen will react, in a highly selective manner, with its corresponding antibody and not with the multitude of other antibodies which may be evoked by other antigens. An "antigenic peptide" is a peptide which is capable of specifically binding an antibody.

[0125] In another embodiment, detection of the capacity of an antibody to specifically bind an antigen probe may be performed by quantifying specific antigen-antibody complex formation. The term "specifically bind" as used herein means that the binding of an antibody to an antigen probe is not competitively inhibited by the presence of non-related molecules.

[0126] In certain embodiments, the method of the present invention is performed by determining the capacity of an antigen of the invention to specifically bind antibodies of the IgG isotype, or, in other embodiments, antibodies of the IgM, isolated from a subject.

[0127] Methods for obtaining suitable antibody-containing biological samples from a subject are well within the ability of those of skill in the art. Typically, suitable samples comprise whole blood and products derived therefrom, such as plasma and serum. In other embodiments, other antibody-containing samples may be used, e.g. CSF, urine and saliva samples.

[0128] Numerous well known fluid collection methods can be utilized to collect the biological sample from the subject in order to perform the methods of the invention.

[0129] In accordance with the present invention, any suitable immunoassay can be used with the subject peptides. Such techniques are well known to the ordinarily skilled artisan and have been described in many standard immunology manuals and texts. In certain preferable embodiments, determining the capacity of the antibodies to specifically bind the antigen probes is performed using an antigen probe array-based method. Preferably, the array is incubated with suitably diluted serum of the subject so as to allow specific binding between antibodies contained in the serum and the immobilized antigen probes, washing out unbound serum from the array, incubating the washed array with a detectable label-conjugated ligand of antibodies of the desired isotype, washing out unbound label from the array, and measuring levels of the label bound to each antigen probe.

[0130] In various embodiments, the method of the present invention further comprises diluting the sample before performing the determining step. In one embodiment, the sample is diluted 1:2, for instance, using PBS. In another embodiment, the sample is diluted 1:4, 1:6, 1:8, 1:15, 1:20, 1:50, or preferably 1:10. Each possibility represents a separate embodiment of the present invention. In another embodiment, the sample is diluted in the range of times 2-times 10. In another embodiment, the sample is diluted in the range of times 4-times 10. In another embodiment, the sample is diluted in the range of times 6-times 10. In another embodiment, the sample is diluted in the range of times 8-times 10.

[0131] The Antigen Chip

[0132] Antigen microarrays are used for the high-throughput characterization of the immune response (Robinson et al., 2002, Nat Med 8, 295-301), and have been used to analyze immune responses in vaccination and in autoimmune disorders (Robinson et al., 2002; Robinson et al., 2003, Nat Biotechnol. 21, 1033-9; Quintana et al., 2004; Kanter et al., 2006, Nat Med 12, 138-43). It has been hypothesized, that patterns of multiple reactivities may be more revealing than single antigen-antibody relationships (Quintana et al., 2006, Lupus 15, 428-30) as shown in previous analyses of autoimmune repertoires of mice (Quintana et al., 2004; Quintana et al., 2001, J Autoimmun 17, 191-7) and humans (Merbl et al., 2007, J Clin Invest 117, 712-8; Quintana et al., 2003, J Autoimmun 21, 65-75) in health and disease. Thus, autoantibody repertoires have the potential to provide both new insights into the pathogenesis of the disease and to serve as immune biomarkers (Cohen, 2007, Nat Rev Immunol. 7, 569-74) of the disease process.

[0133] According to some aspects the methods of the present invention may be practiced using antigen arrays as disclosed in WO 02/08755 and U.S. 2005/0260770, the contents of which are incorporated herein by reference. WO 02/08755 is directed to a system and an article of manufacture for clustering and thereby identifying predefined antigens reactive with undetermined immunoglobulins of sera derived from patient subjects in need of diagnosis of disease or monitoring of treatment. Further disclosed are diagnostic methods, and systems useful in these methods, employing the step of clustering a subset of antigens of a plurality of antigens, said subset of antigens being reactive with a plurality of antibodies being derived from a plurality of patients, and associating or disassociating the antibodies of a subject with the resulting cluster.

[0134] U.S. Pat. App. Pub. No. 2005/0260770 discloses an antigen array system and diagnostic uses thereof. The application provides a method of diagnosing an immune disease, particularly diabetes type 1, or a predisposition thereto in a subject, comprising determining a capacity of immunoglobulins of the subject to specifically bind each antigen probe of an antigen probe set. The teachings of said disclosures are incorporated in their entirety as if fully set forth herein.

[0135] In other embodiments, various other immunoassays may be used, including, without limitation, enzyme-linked immunosorbent assay (ELISA), flow cytometry with multiplex beads (such as the system made by Luminex), surface plasmon resonance (SPR), elipsometry, and various other immunoassays which employ, for example, laser scanning, light detecting, photon detecting via a photo-multiplier, photographing with a digital camera based system or video system, radiation counting, fluorescence detecting, electronic, magnetic detecting and any other system that allows quantitative measurement of antigen-antibody binding.

[0136] Various methods have been developed for preparing arrays suitable for the methods of the present invention. State-of-the-art methods involves using a robotic apparatus to apply or "spot" distinct solutions containing antigen probes to closely spaced specific addressable locations on the surface of a planar support, typically a glass support, such as a microscope slide, which is subsequently processed by suitable thermal and/or chemical treatment to attach antigen probes to the surface of the support. Conveniently, the glass surface is first activated by a chemical treatment that leaves a layer of reactive groups such as epoxy groups on the surface, which bind covalently any molecule containing free amine or thiol groups. Suitable supports may also include silicon, nitrocellulose, paper, cellulosic supports and the like.

[0137] Preferably, each antigen probe, or distinct subset of antigen probes of the present invention, which is attached to a specific addressable location of the array is attached independently to at least two, more preferably to at least three separate specific addressable locations of the array in order to enable generation of statistically robust data.

[0138] According to additional embodiments, the antigen probe set comprises at least 2, at least 3, at least 5, at least 10, at least 50, at least 100, at least 150, at least 200, at least 300 or more antigens, including one or a plurality of the antigens provided by the present invention. According to additional embodiments, the antigen probe set comprises at least 1, at least 6, at least 10, at least 100, at least 150, at least 200, or more oligonucleotide antigens, including one or a plurality of the oligonucleotide antigens provided by the present invention.

[0139] In addition to antigen probes of the invention, the array may advantageously include control antigen probes or other standard chemicals. Such control antigen probes may include normalization control probes. The signals obtained from the normalization control probes provide a control for variations in binding conditions, label intensity, "reading" efficiency and other factors that may cause the signal of a given binding antibody-probe ligand interaction to vary. For example, signals, such as fluorescence intensity, read from all other antigen probes of the antigen probe array are divided by the signal (e.g., fluorescence intensity) from the normalization control probes thereby normalizing the measurements. Normalization control probes can be bound to various addressable locations on the antigen probe array to control for spatial variation in antibody-ligand probe efficiency. Preferably, normalization control probes are located at the corners or edges of the array to control for edge effects, as well as in the middle of the array.

[0140] The labeled antibody ligands may be of any of various suitable types of antibody ligand. Preferably, the antibody ligand is an antibody which is capable of specifically binding the Fc portion of the antibodies of the subject used. For example, where the antibodies of the subject are of the IgM isotype, the antibody ligand is preferably an antibody capable of specifically binding to the Fc region of IgM antibodies of the subject.

[0141] The ligand of the antibodies of the subject may be conjugated to any of various types of detectable labels. Preferably the label is a fluorophore, most preferably Cy3. Alternately, the fluorophore may be any of various fluorophores, including Cy5, fluorescein isothiocyanate (FITC), phycoerythrin (PE), rhodamine, Texas red, and the like. Suitable fluorophore-conjugated antibodies specific for antibodies of a specific isotype are widely available from commercial suppliers and methods of their production are well established.

[0142] Antibodies of the subject may be isolated for analysis of their antigen probe binding capacity in any of various ways, depending on the application and purpose. While the subject's antibodies may be suitably and conveniently in the form of blood serum or plasma or a dilution thereof (e.g. 1:10 dilution), the antibodies may be subjected to any desired degree of purification prior to being tested for their capacity to specifically bind antigen probes. The method of the present invention may be practiced using whole antibodies of the subject, or antibody fragments of the subject which comprises an antibody variable region.

[0143] Data Analysis

[0144] Advantageously, the methods of the invention may employ the use of learning and pattern recognition analyzers, clustering algorithms and the like, in order to discriminate between reactivity patterns of healthy control subjects to those of patients having SLE. As such, this term specifically includes a difference measured by, for example, determining the reactivity of antibodies in a test sample to a plurality of antigens, and comparing the resulting reactivity pattern to the reactivity patterns of negative and positive control samples (e.g. samples obtained from control subjects which are not afflicted with SLE or patients afflicted with SLE, respectively) using such algorithms and/or analyzers. The difference may also be measured by comparing the reactivity pattern of the test sample to a predetermined classification rule obtained in such manner.

[0145] In some embodiments, the methods of the invention may employ the use of learning and pattern recognition analyzers, clustering algorithms and the like, in order to discriminate between reactivity patterns of subjects having a subtype of SLE to control subjects. For example, the methods may include determining the reactivity of antibodies in a test sample to a plurality of antigens, and comparing the resulting pattern to the reactivity patterns of negative and positive control samples using such algorithms and/or analyzers.

[0146] Thus, in another embodiment, a significant difference between the reactivity pattern of a test sample compared to a reactivity pattern of a control sample, wherein the difference is computed using a learning and pattern recognition algorithm, indicates that the subject is afflicted with SLE. For example, the algorithm may include, without limitation, supervised or non-supervised classifiers including statistical algorithms including, but not limited to, principal component analysis (PCA), partial least squares (PLS), multiple linear regression (MLR), principal component regression (PCR), discriminant function analysis (DFA) including linear discriminant analysis (LDA), and cluster analysis including nearest neighbor, artificial neural networks, coupled two-way clustering algorithms, multi-layer perceptrons (MLP), generalized regression neural network (GRNN), fuzzy inference systems (FIS), self-organizing map (SOM), genetic algorithms (GAS), neuro-fuzzy systems (NFS), adaptive resonance theory (ART).

[0147] In certain embodiments, the learning and pattern recognition algorithm is SVM. In machine learning, support vector machines (SVMs, also support vector networks) are supervised learning models with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis. Given a set of training examples, each marked as belonging to one of two categories, an SVM training algorithm builds a model that assigns new examples into one category or the other, making it a non-probabilistic binary linear classifier. An SVM model is a representation of the examples as points in space, mapped so that the examples of the separate categories are divided by a clear gap that is as wide as possible. New examples are then mapped into that same space and predicted to belong to a category based on which side of the gap they fall on.

[0148] In certain embodiments, the learning and pattern recognition algorithm is logistic regression (LR). In statistics, logistic regression, or logit regression, or logit model is a type of probabilistic statistical classification model. It is also used to predict a binary response from a binary predictor, used for predicting the outcome of a categorical dependent variable (i.e., a class label) based on one or more predictor variables (features). That is, it is used in estimating the parameters of a qualitative response model. The probabilities describing the possible outcomes of a single trial are modeled, as a function of the explanatory (predictor) variables, using a logistic function. Frequently "logistic regression" is used to refer specifically to the problem in which the dependent variable is binary, that is, the number of available categories is two.

[0149] "Logistic regression" is part of a category of statistical models called generalized linear models. Logistic regression allows one to predict a discrete outcome, such as group membership, from a set of variables that may be continuous, discrete, dichotomous, or a mix of any of these. The dependent or response variable is dichotomous, for example, one of two possible types of cancer. Logistic regression models the natural log of the odds ratio, i.e., the ratio of the probability of belonging to the first group (P) over the probability of belonging to the second group (1-P), as a linear combination of the different expression levels (in log-space) and of other explaining variables. The logistic regression output can be used as a classifier by prescribing that a case or sample will be classified into the first type if P is greater than 0.5 or 50%. Alternatively, the calculated probability P can be used as a variable in other contexts such as a 1D or 2D threshold classifier.

[0150] In certain embodiments, the learning and pattern recognition algorithm is linear discriminant analysis (LDA). LDA and the related Fisher's linear discriminant are methods used in statistics, pattern recognition and machine learning to find a linear combination of features which characterizes or separates two or more classes of objects or events. The resulting combination may be used as a linear classifier or, more commonly, for dimensionality reduction before later classification.

[0151] In certain embodiments, the learning and pattern recognition algorithm is Quadratic Discriminant analysis (QDA). A quadratic classifier is used in machine learning and statistical classification to separate measurements of two or more classes of objects or events by a quadric surface. It is a more general version of the linear classifier. QDA is closely related to LDA, where it is assumed that the measurements from each class are normally distributed. Unlike LDA however, in QDA there is no assumption that the covariance of each of the classes is identical. When the normality assumption is true, the best possible test for the hypothesis that a given measurement is from a given class is the likelihood ratio test.

[0152] In certain embodiments, the learning and pattern recognition algorithm is Classification and Decision Tree (CART). Decision tree learning uses a decision tree as a predictive model which maps observations about an item to conclusions about the item's target value. It is one of the predictive modelling approaches used in statistics, data mining and machine learning. Tree models where the target variable can take a finite set of values are called classification trees. In these tree structures, leaves represent class labels and branches represent conjunctions of features that lead to those class labels.

[0153] In certain embodiments, the learning and pattern recognition algorithm is random forest. Random forests are an ensemble learning method for classification, regression and other tasks, that operate by constructing a multitude of decision trees at training time and outputting the class that is the mode of the classes (classification) or mean prediction (regression) of the individual trees. Random forests correct for decision trees' habit of over fitting to their training set.

[0154] In certain embodiments, one or more algorithms or computer programs may be used for comparing the amount of each antibody quantified in the test sample against a predetermined cutoff (or against a number of predetermined cutoffs). Alternatively, one or more instructions for manually performing the necessary steps by a human can be provided.

[0155] Algorithms for determining and comparing pattern analysis include, but are not limited to, principal component analysis, Fischer linear analysis, neural network algorithms, genetic algorithms, fuzzy logic pattern recognition, and the like. After analysis is completed, the resulting information can, for example, be displayed on display, transmitted to a host computer, or stored on a storage device for subsequent retrieval.

[0156] Many of the algorithms are neural network based algorithms. A neural network has an input layer, processing layers and an output layer. The information in a neural network is distributed throughout the processing layers. The processing layers are made up of nodes that simulate the neurons by the interconnection to their nodes. Similar to statistical analysis revealing underlying patterns in a collection of data, neural networks locate consistent patterns in a collection of data, based on predetermined criteria.

[0157] Suitable pattern recognition algorithms include, but are not limited to, principal component analysis (PCA), Fisher linear discriminant analysis (FLDA), soft independent modeling of class analogy (SIMCA), K-nearest neighbors (KNN), neural networks, genetic algorithms, fuzzy logic, and other pattern recognition algorithms. In some embodiments, the Fisher linear discriminant analysis (FLDA) and canonical discriminant analysis (CDA) as well as combinations thereof are used to compare the output signature and the available data from the database.

[0158] In other embodiments, principal component analysis is used. Principal component analysis (PCA) involves a mathematical technique that transforms a number of correlated variables into a smaller number of uncorrelated variables. The smaller number of uncorrelated variables is known as principal components. The first principal component or eigenvector accounts for as much of the variability in the data as possible, and each succeeding component accounts for as much of the remaining variability as possible. The main objective of PCA is to reduce the dimensionality of the data set and to identify new underlying variables.

[0159] Principal component analysis compares the structure of two or more covariance matrices in a hierarchical fashion. For instance, one matrix might be identical to another except that each element of the matrix is multiplied by a single constant. The matrices are thus proportional to one another. More particularly, the matrices share identical eigenvectors (or principal components), but their eigenvalues differ by a constant. Another relationship between matrices is that they share principal components in common, but their eigenvalues differ. The mathematical technique used in principal component analysis is called eigenanalysis. The eigenvector associated with the largest eigenvalue has the same direction as the first principal component. The eigenvector associated with the second largest eigenvalue determines the direction of the second principal component. The sum of the eigenvalues equals the trace of the square matrix and the maximum number of eigenvectors equals the number of rows of this matrix.

[0160] In another embodiment, the algorithm is a classifier. One type of classifier is created by "training" the algorithm with data from the training set and whose performance is evaluated with the test set data. Examples of classifiers used in conjunction with the invention are discriminant analysis, decision tree analysis, receiver operator curves or split and score analysis.

[0161] The term "classification" refers to a procedure and/or algorithm in which individual items are placed into groups or classes based on quantitative information on one or more characteristics inherent in the items (referred to as traits, variables, characters, features, etc.) and based on a statistical model and/or a training set of previously labeled items.

[0162] As use herein, the term "data set" refers to numerical values obtained from the analysis. These numerical values associated with analysis may be values such as peak height and area under the curve.

[0163] The phrase "k-nearest neighbor" refers to a classification method that classifies a point by calculating the distances between the point and points in the training data set. It then assigns the point to the class that is most common among its k-nearest neighbors (where k is an integer).

[0164] The term "FDR" used herein when performing multiple statistical tests, for example in comparing the signal between two groups in multiple data features, there is an increasingly high probability of obtaining false positive results, by random differences between the groups that can reach levels that would otherwise be considered statistically significant. In order to limit the proportion of such false discoveries, statistical significance is defined only for data features in which the differences reached a p-value (by two-sided t-test) below a threshold, which is dependent on the number of tests performed and the distribution of p-values obtained in these tests.

[0165] The term "decision tree" refers to a classifier with a flow-chart-like tree structure employed for classification. Decision trees consist of repeated splits of a data set into subsets. Each split consists of a simple rule applied to one variable, e.g., "if value of "variable 1" larger than "threshold 1"; then go left, else go right". Accordingly, the given feature space is partitioned into a set of rectangles with each rectangle assigned to one class.

[0166] The terms "test set" or "unknown" or "validation set" refer to a subset of the entire available data set consisting of those entries not included in the training set. Test data is applied to evaluate classifier performance.

[0167] The terms "training set" or "known set" or "reference set" refer to a subset of the respective entire available data set. This subset is typically randomly selected, and is solely used for the purpose of classifier construction.

[0168] "Sensitivity," as used herein, may mean a statistical measure of how well a binary classification test correctly identifies a condition, for example, how frequently it correctly classifies a sample into the correct type out of two possible types. The sensitivity for class A is the proportion of cases that are determined to belong to class "A" by the test out of the cases that are in class "A," as determined by some absolute or gold standard.

[0169] "Specificity," as used herein, may mean a statistical measure of how well a binary classification test correctly identifies a condition, for example, how frequently it correctly classifies a sample into the correct type out of two possible types. The sensitivity for class A is the proportion of cases that are determined to belong to class "not A" by the test out of the cases that are in class "not A," as determined by some absolute or gold standard.

[0170] As used herein, the term "threshold" means the numerical value assigned for each run, which reflects a statistically significant point above the calculated baseline.

[0171] Diagnostic Methods

[0172] As used herein the term "diagnosing" or "diagnosis" refers to the process of identifying a medical condition or disease (e.g., SLE) by its signs, symptoms, and in particular from the results of various diagnostic procedures, including e.g. detecting the reactivity of antibodies in a biological sample (e.g. serum) obtained from an individual, to a plurality of antigens. Furthermore, as used herein the term "diagnosing" or "diagnosis" encompasses screening for a disease, detecting a presence or a severity of a disease, distinguishing a disease from other diseases including those diseases that may feature one or more similar or identical symptoms, providing prognosis of a disease, monitoring disease progression or relapse, as well as assessment of treatment efficacy and/or relapse of a disease, disorder or condition, as well as selecting a therapy and/or a treatment for a disease, optimization of a given therapy for a disease, selecting effective dosages or schedules for administering a therapeutic product, monitoring the treatment of a disease, and/or predicting the suitability of a therapy for specific patients or subpopulations or determining the appropriate dosing of a therapeutic product in patients or subpopulations.

[0173] Diagnostic methods differ in their sensitivity and specificity. The "sensitivity" of a diagnostic assay is the percentage of diseased individuals who test positive (percent of "true positives"). Diseased individuals not detected by the assay are "false negatives." Subjects who are not diseased and who test negative in the assay, are termed "true negatives." The "specificity" of a diagnostic assay is 1 minus the false positive rate, where the "false positive" rate is defined as the proportion of those without the disease who test positive. While a particular diagnostic method may not provide a definitive diagnosis of a condition, it suffices if the method provides a positive indication that aids in diagnosis. The "accuracy" of a diagnostic assay is the proximity of measurement results to the true value. The "p value" of a diagnostic assay is the probability of obtaining the observed sample results (or a more extreme result) when the null hypothesis is actually true.

[0174] In some embodiments, the methods of the invention are useful in diagnosing systemic lupus erythematosus (SLE) or lupus. "Lupus" as used herein is an autoimmune disease or disorder involving antibodies that attack connective tissue.

[0175] In another embodiment, the methods may result in determining a level of SLE disease activity. In a further embodiment, the methods may result in providing the comparison to an entity for monitoring SLE disease activity. In these embodiments, the methods can be used, for example, to differentiate between subjects with active disease (flare) and those with non-active disease (in remission).

[0176] In one embodiment, the subject being diagnosed according to the methods of the invention is symptomatic. In other embodiments, the subject is asymptomatic. In certain embodiments, the subject is not or was not receiving an immunosuppressive drug or an immunosuppressive treatment.

[0177] In one embodiment, the subject being diagnosed according to the methods of the invention is symptomatic. In other embodiments, the subject is asymptomatic. The diagnostic procedure can be performed in vivo or in vitro, preferably in vitro. In certain embodiments of the methods of the present invention, the diagnostic procedure is performed by non-invasive means or methods. According to some embodiments, the invention provides diagnostic methods useful for the detection of SLE or for ruling out of SLE.

[0178] The diagnostic procedure and platform of the present invention may be suitable for use as point of care device or point of service in clinic, in physician's office, in hospital laboratories, or in commercial diagnostic laboratories.

[0179] Criteria for Diagnosing Systemic Lupus Erythematosus (SLE)

[0180] The 1982 American College of Rheumatology (ACR) criteria describes features necessary to diagnose SLE. The presence of as few as 4 of the 11 criteria yields a sensitivity of 85% and a specificity of 95% for SLE. Patients with SLE may present with any combination of clinical features and serologic evidence of lupus. The ACR's criteria are (1) Serositis (pleurisy, pericarditis on examination or diagnostic ECG or imaging), (2) Oral ulcers (oral or nasopharyngeal, usually painless; palate is most specific), (3) Arthritis (nonerosive, two or more peripheral joints with tenderness or swelling), (4) Photosensitivity (unusual skin reaction to light exposure), (5) Blood disorders (leukopenia (<4.times.10.sup.3 cells/4 on more than one occasion), lymphopenia (<1500 cells/4 on more than one occasion), thrombocytopenia (<100.times.10.sup.3 cells/4 in the absence of offending medications), hemolytic anemia), (6) Renal involvement (proteinuria (>0.5 g/d or 3+positive on dipstick testing) or cellular casts), (7) ANAs (higher titers generally more specific (>1:160); must be in the absence of medications associated with drug-induced lupus), (8) Immunologic phenomena (dsDNA; anti-Smith (Sm) antibodies; antiphospholipid antibodies (anticardiolipin immunoglobulin G [IgG] or immunoglobulin M [IgM] or lupus anticoagulant); biologic false-positive serologic test results for syphilis, lupus erythematosus (LE) cells (omitted in 1997)), (9) Neurologic disorder (seizures or psychosis in the absence of other causes), (10) Malar rash (fixed erythema over the cheeks and nasal bridge, flat or raised), and (11) Discoid rash (erythematous raised-rimmed lesions with keratotic scaling and follicular plugging, often scarring).

[0181] The Systemic Lupus Collaborating Clinics (SLICC) recently revised and validated the American College of Rheumatology (ACR) SLE classification criteria in order to improve clinical relevance, meet stringent methodology requirements and incorporate new knowledge in SLE immunology (Petri et al., Arthritis and Rheumatism, 2012, Vol. 64, pages 2677-2686). Seventeen criteria were identified, including 11 clinical criteria and 6 immunological criteria. The SLICC criteria for SLE classification requires fulfillment of at least four criteria, with at least one clinical criterion and one immunologic criterion, or lupus nephritis as the sole clinical criterion in the presence of ANA or anti-dsDNA antibodies.

[0182] Two of the most commonly used instruments for SLE diagnosis are the Systemic Lupus Erythematosus Disease Activity Index (SLEDAI) and the Systemic Lupus Activity Measure (SLAM).

[0183] The SLEDAI is an index that measures disease activity by weighting the importance of each organ system involved. The SLEDAI includes 24 items, representing nine organ systems. The variables are obtained by history, physical examination and laboratory assessment. Each item is weighted from 1 to 8 based on the significance of the organ involved. For example, mouth ulcers are scored as 2, while seizures are scored as 8. The laboratory parameters that are included in the SLEDAI include white blood cell count, platelet count, urinalysis, serum C3, C4 and anti-dsDNA. The total maximum score is 105. SLEDAI class definition according to the present invention: Low SLEDAI: <=4 ("under control"). High SLEDAI: =>5

[0184] The SLAM includes 32 items representing 11 organ systems. The items are scored not only as present/absent, but graded on a scale of 1 to 3 based on severity. The total possible score for the SLAM is 86. Both the SLEDAI and the SLAM have been shown to be valid, reliable, and sensitive to change over time (Liang et al. 1989, Arth Rheum 32:1107-18), and are widely used in research protocols and clinical trials. These indices are particularly useful for examining the value of newly proposed serologic or inflammatory markers of disease activity in SLE.

[0185] Despite the obvious utility of these instruments, there are some drawbacks. First, there is not always complete agreement between the SLAM and the SLEDAI in the same set of patients. There are several possible reasons for these discrepancies. Unlike the SLEDAI, the SLAM includes constitutional symptoms such as fatigue and fever, which may or may not be considered attributable to active SLE; this activity index relies on physician interpretation. In addition, the SLEDAI does not capture mild degrees of activity in some organ systems and does not have descriptors for several types of activity, such as hemolytic anemia.

[0186] The following examples are presented in order to more fully illustrate some embodiments of the invention. They should, in no way be construed, however, as limiting the broad scope of the invention.

EXAMPLES

Example 1

[0187] Unique Antigen-Autoantibody Reactivity Patterns Capable of Differentiating SLE Patients from Healthy Control Group

[0188] Materials and Methods

[0189] Human Subjects

[0190] The study was approved by the Institutional Review Boards of the participating clinical unit; informed consent was obtained from all participants. All patient identifiers were kept confidential.

[0191] Serum samples were collected from 246 SLE patients from four independent sources (Albert Einstein College of Medicine, Medical University of South Carolina, Johns Hopkins University and Emory University) and were compared with sera samples of 252 healthy controls obtained from five independent sources.

[0192] To be considered an SLE patient, the following criteria had to be met: females, age 18-60 at time of sample collection, not pregnant, known demographics (age, race), medication record at time of sample collection, serum samples collected within three years of diagnosis of SLE, definitive diagnosis of SLE: ACR score>=4 (out of 11 criteria).

[0193] To be considered a healthy control, the following criteria had to be met: females, age 18-60 at time of sample collection, not pregnant, known demographics (age, race); no record of immunologically active disease, no steroid use within the past three months and no first degree relatives with SLE.

[0194] Sample Demographics

[0195] For the classifier development and verification, 196 SLE samples (average age-35) were collected from African patients (109), Hispanic patients (40) White and Caucasian patients (37), Indian/Asian/middle eastern patients (4) and others (6). 201 healthy control samples were collected from subjects with similar demographic distribution and similar average age.

[0196] For the classifier validation, 50 SLE samples (average age-40) were collected from African patients (23), Hispanic patients (9), White and Caucasian patients (15), Indian/Asian/middle eastern patient (1) and others (2). 51 healthy control samples were collected from subjects with similar demographic distribution and similar average age.

[0197] Antigen Microarrays and Serum Testing

[0198] Antigen microarray chips were prepared as previously described (Quintana et al. Lupus. 2006; 15: 428-30). Briefly, the antigens were spotted on epoxy-activated glass substrates (in-house produced epoxyhexyltriethoxysilane (EHTES) activated epoxy slides) using a Scienion S-11 non-contact microarray printer (Scienion AG, Germany). The microarrays were then blocked with 1% casein for one hour at room temperature. Test serum samples in 1% casein blocking buffer (1:20 dilution) were incubated under a coverslip for one hour at 37.degree.. The arrays were then washed and incubated for one hour at 37.degree. with a 1:500 dilution of two detection antibodies, mixed together: a goat anti-human IgG Cy3-conjugated antibody, and a goat anti-human IgM AF647-conjugated antibody (Jackson ImmunoResearch Laboratories Inc., West Grove, Pa.). Image acquisition was performed by laser at two wavelengths: 530 nm and 630 nm (Agilent Technologies, Santa Clara, Calif.) and the results were analyzed using Genepix pro 7 software (Molecular devices, Sunnyvale, Calif.). The quantitative range of signal intensity of binding to each antigen spot was 0-65,000; this range of detection made it possible to obtain reliable data at a 1:20 dilution of test samples.

[0199] Classifier Development and Verification

[0200] A total of 397 sera samples from SLE patients and healthy controls (HC) were tested; and 45 slides were tested with reference serum used as process control. Training was performed on a subset of 146 SLE patients and 151 healthy controls using four independent classification methods. Verification was performed on an additional set of 50 SLE patients and 50 healthy control samples.

[0201] Testing sessions: 23 test sessions were performed. 15 print batches were mixed and split for the 23 testing sessions, keeping constant ratios of SLE/HC, race, age, ACR score and sample source between the different testing sessions in order to eliminate testing bias resulting from one of these parameters. Each test session contained two print lots. SLE pool control slides were added (one slide per each print lot on each test session) as a process control and a basis for comparison between different testing sessions and print lots.

[0202] Scanning was performed on an Agilent fluorescence reader using PMT20 setting due to saturated intensities (>65,000) obtained for some of the SLE entities.

[0203] Classifier Validation

[0204] A total of 115 slides tested over 101 SLE/HC sera samples and 14 slides were used as process controls. Of the 101 sera samples tested, 50 samples were obtained from SLE patients and 51 samples were obtained from HC.

[0205] Validation testing sessions: seven test sessions were performed. Six print batches were mixed and split for the seven testing sessions, keeping constant ratios of SLE/HC, race, age and ACR score matched samples between the different testing sessions in order to eliminate testing bias resulting from one of these parameters. Each test session contained two print lots. SLE pool control slides were added (one slide per each print lot on each test session) as a process control and basis for comparison between different testing sessions and print lots. Test procedure and scanning were performed as described in the Classifier Development section.

[0206] Preprocessing

[0207] Images were extracted using Genepix Pro 7.0 with default settings and preprocessed as follows: [0208] 1. Signals were represented by spot mean intensity minus the median of the local background, followed by log (base 2) transformation for non-negative spots. [0209] 2. Negative spots were imputed by artificial low intensity values, a process performed separately for each channel. [0210] 3. The median intensities of all slides were adjusted equal to 9, for each channel separately. [0211] 4. Antigens printed in two sets (such as base iChip) were considered as two independent antigens. [0212] a. Antigen intensity per slide was represented by the median across all spots, excluding outlier spots and spots flagged by Genepix.

[0213] Classifier Development

[0214] Various feature selection and classifier construction methods were evaluated. The upper limit of the training accuracy (median score of bootstrapping sessions on set 1) was 88%, with several algorithm combinations reaching this level of performance. Four leading algorithms (SVM, LR, QDA, and LDA) were further considered for classifier formation.

[0215] Antigen Selection by Linear Discriminant Analysis (LDA) Algorithm

[0216] Antigens were ranked according to their selection frequency over bootstrapping iterations. The antigen list was screened in order to remove highly correlated features, i.e., Rho>=0.9. The final list includes six features, most of which were also selected by other methods, three of which are in the IgG channel and three are in the IgM channel:

TABLE-US-00002 TABLE 2 List of SLE related antigens having high separation capabilities identified by the LDA algorithm Fraction of Appearances Antigen Isotype 1 Calf ssDNA IgG 0.793333 U1 snRNP IgG 0.636667 Sm IgG 0.656667 Histone III-S IgM 0.633333 U1 snRNP IgM 0.52 Oligo21 IgM

[0217] Antigen Selection by the Quadratic Discriminant Classifier (QDA) Algorithm

[0218] Antigens were ranked according to their selection frequency over bootstrapping. The antigen list was screened in order to remove highly correlated features, i.e. Rho>=0.9. The final list includes six features, five of which are in the IgG channel and one is in IgM channel.

TABLE-US-00003 TABLE 3 List of SLE related antigens having high separation capabilities identified by the QDA algorithm Fraction of Appearances Antigen Isotype 1 Calf ssDNA IgG 0.923333 U1 snRNP IgG 0.996667 H2a IgM 0.433333 Sm IgG 0.8 Apo-SAA IgG 0.753333 Ro52 IgG

[0219] Antigen Selection by the Support Vector Machines (SVM) Algorithm

[0220] Antigens were ranked according to their average score over bootstrapping. The antigen list was screened in order to remove correlated features, i.e., Rho>=0.9. The final list includes six features, most of which were also selected by other methods, five of which are in the IgG channel and one is in IgM channel:

TABLE-US-00004 TABLE 4 List of SLE related antigens having high separation capabilities identified by the SVM algorithm Average score Antigen Isotype 0.919544 Calf ssDNA IgG 0.477504 Ro52 IgG 0.312718 Sm IgG 0.559492 U1 snRNP IgG 0.479361 Histone III-S IgM 0.32738 DNAse I IgG

[0221] Antigen Selection by the Logistic Regression (LR) Algorithm

[0222] Antigens were ranked according to their selection frequency over bootstrapping. The antigen list was manually screened in order to remove correlated features, i.e., Rho>=0.9. The final list includes six features, most of which were also selected by other methods, five of which are in the IgG channel and one is in the IgM channel.

TABLE-US-00005 TABLE 5 List of SLE related antigens having high separation capabilities identified by the LR algorithm Fraction of Appearances Antigen Isotype 1 Calf ssDNA IgG 1 U1 snRNP IgG 1 Ro52 IgG 1 Collagen III IgG 0.99 Apo-SAA IgG 1 Histone III-S IgM

TABLE-US-00006 TABLE 6 The performance of the classification methods used in the validation test Logistic Validation SVM Regression QDA LDA Majority AUC 0.95 0.95 0.95 0.94 Sensitivity 98% 98% 88% 94% 98% Specificity 59% 59% 76% 75% 67% Accuracy 78% 78% 82% 84% 82% PPV 70% 70% 79% 78% 74% NPV 96.7%.sup. 97% 87% 93% 97%

[0223] As shown in FIG. 1 and Table 6, all five classification methods (SVM, LR, QDA, LDA and Majority Vote) used in the validation test allowed for the differentiation between SLE patients and healthy subjects with high sensitivity ranging between 88-98% and specificity between 59-76% using a relatively small subset of <10 SLE-specific antigens.

[0224] As demonstrated in FIG. 2 the verification of the four classification methods (SVM, LR, QDA and LDA) was performed by determining the area under the Receiver Operating Characteristics (ROC) curve for each method. The area under the curve (AUC) calculated were 0.95, 0.95, 0.95 and 0.94 respectively. Validation results appear in Table 6 above. Each of the different classification methods was based on subset of antigen features with some antigens being represented in all four classification methods. FIG. 3 demonstrates the distribution of score/probabilities for the validation samples using the SVM classifier between healthy control (circle labeled) and SLE patients (cross labeled) based on the antigens disclosed in Table 4. FIG. 4 demonstrates the distribution of score/probabilities for the validation samples using the LDA classifier between healthy control (circle labeled) and SLE patients (cross labeled) based on the antigens disclosed in Table 2. FIG. 5 demonstrates the distribution of score/probabilities for the validation samples using the QDA classifier between healthy control (circle labeled) and SLE patients (cross labeled) based on the antigens disclosed in Table 3. FIG. 6 demonstrates the distribution of score/probabilities for the LR validation samples using the classifier between healthy control (circle labeled) and SLE patients (cross labeled) based on the antigens disclosed in Table 5.

Example 2

Rule Out of ANA Positive Patients

[0225] Anti-nuclear antigen (ANA) testing is part of the diagnostic decision making matrix in the diagnosis of SLE. ANA testing is characterized by high false positive rate (10-20%).

[0226] In order to test whether the classification methods of the present invention can better assess potential for SLE in ANA (+) otherwise healthy-appearing patients, fluorescent ANA test (FANA) was conducted on 136 healthy control patients. 24 patients scored positive (17.6%) at a ratio greater or equal to 1:80. As shown in Table 7 the test of the present invention ruled out a diagnosis of SLE in 50-80% of ANA (+) patients. Accordingly, the Rule Out test of the present invention can exclude SLE from the diagnosis in ANA+patients with a high degree of confidence. The Rule-Out test of the present invention can be used as a decision support tool for physicians in ruling out a diagnosis of SLE with a sensitivity of 94%, specificity of 75% and NPV of 93%. In the validation study, the inventors of the present invention were able to successfully rule out the diagnosis of SLE in up to 80% of ANA+subjects depending on the analytic approach.

TABLE-US-00007 TABLE 7 The performance of the Rule Out test 24 ANA Positive Healthy Controls (>1:80) SLE-key .RTM. SVM Negative 17 (71%) Positive 7 Logistic Regression Negative 13 (54%) Positive 11 QDA Negative 20 (83%) Positive 4 LDA Negative 16 (67%) Positive 8 Majority Vote Negative 18 (75%) Positive 6

[0227] As part of the diagnostic program, 154 clinical samples were evaluated with the SLE Rule-Out test. ANA test results obtained from 58 of the patients. Of these 35 were ANA+(60%) and 23 patients were ANA- (40%). As demonstrated in FIG. 12, results of post-hoc analysis suggests that the Rule-Out test results are highly correlated with the referring rheumatologist's clinical impression for both the ANA+(up to 95% agreement) and ANA-populations (100% agreement).

Example 3

Development of Disease Activity Monitoring Test for SLE

[0228] In order to develop a model that can classify high and low disease activity (SLEDAI class) with high specificity and high confidence that the patient's disease is under control (SLEDAI<=4), three analytical stages were included: 1. Development of a classifier: feature selection by training and testing, prediction on full dataset, model evaluation and comparison. 2. Verification, applying the classifier on data from the same print-lots and evaluating performance; and 3. Validation, applying the classifier on data from different print-lots.

[0229] The analysis was performed on 382 antigen intensities (both isotypes the IgG and the IgM). Serum samples were collected from 232 female SLE patients, of which 162 (.about.70%) have SLEDAI<=4 and 70 (.about.30%) have SLEDAI=>5. The samples were obtained from four US sites: Medical University of South Carolina (MUSC), Albert Einstein College of Medicine, Johns Hopkins University and Emory University.

[0230] 1. Classification:

[0231] The analysis consisted of two stages: (i) Selection of a group of antigens that evidently have a classification potential, and (ii) final classification modeling. Three classification approaches were used: classification and regression tree (CART), support vector machine (SVM) and logistic regression (LR). The first stage consisted of repeated sampling of 80% of the data. For each sampled subset, a model was fitted. The fitting procedure included sequential model reduction procedures for the logistic regression and support vector machines, which are helpful in selecting a group of antigens that are not correlated with each other and thus their contribution to the model is not redundant. For each fitting iteration, the coefficients of the antigens that were included in the model were recorded, and the value zero was assigned for those that were not included. The coefficients were averaged across all 1000 iterations, and the averages were plotted in a descending order. Such a curve decays quickly towards a low baseline level that represents the antigens with no effect. Thus only antigens of which averages appeared before the baseline region were selected for the final classification model. In the second stage, the final model for each modeling approach was obtained by fitting a model on the full dataset using only the selected antigens and the class assignment in accordance with each model was done for each observation.

[0232] 2. Evaluation:

[0233] Sensitivity and specificity were used for evaluating the performance of each model and for comparing the performance between the models. The support vector machine and logistic regression models yield a probability of belonging to the high SLEDAI class for each observation. Thus for these models specificity and sensitivity are subjected to the probability threshold determined, and the collection of all their possible combinations yield an ROC curve. For model evaluation and comparison, the ROC curve of the two models was plotted and the single sensitivity-specificity combination achieved by the decision tree model was superimposed on the plot.

[0234] 3. Results:

[0235] A. Antigen Selection:

[0236] For the logistic regression model, 20 antigens were selected, for the SVM model 22 antigens were selected, and for the decision tree model, 16 antigens were selected. As shown in Table 8, SVM and logistic regression generated antigen lists that are relatively similar, while the decision tree model generated a list that was included antigens with an evident difference between the classes at the univariate level. However, while the antigens selected by the logistic regression and SVM models are not correlated (see the hit-maps presented in FIGS. 7 and 8), most of these antigens selected by the decision tree model are highly correlated with each other (FIG. 9).

[0237] As can be seen by ROC curves shown in FIG. 10, SVM has better performance in terms of errors, compared to logistic regression, and both models outperformed the decision trees model. Considering the optimal probability threshold as the value corresponding to the minimal total error (FIG. 11), the logistic regression achieved sensitivity of 0.73 and specificity of 0.93, and the SVM model achieves sensitivity 0.89 of and specificity of 1. The decision tree model achieved sensitivity 0.67 of and specificity of 0.94

TABLE-US-00008 TABLE 8 List of antigens used for SLE activity monitoring test SEQ Classifier Antigen ID used Antibody name full name NO: Manufacturer cat# SVM huIgG- huIgG huIgG Jacksonimmuno 009-000- Set1_IgM Reasearch 003 SVM Gal2_IgG Gal2 Galectin-2 Human 7 Prospec cyt-725 SVM MBPmu_IgM MBPmu Myelin Basic Protein 8 Sigma M2941 from mouse SVM PRSS3_IgG PRSS3 Protease, serine, 3 9 AntibodiesOnline ABIN807673 SVM TgondiiP30_IgG TgondiiP30 Toxoplasma Gondii 10 Prospec tox-264 p30 (SAG1) SVM CRCP_IgM CRCP Calcitonin gene- 11 Prospec pro-919 related peptide- receptor component protein SVM Vitronectin_IgM Vitronectin Serum spreading 12 Peprotech 140-09 factor SVM TREM2_IgG TREM2 triggering receptor 13 SINOBIOLOGICAL, 11084- expressed on myeloid INC NOSH cells 2 SVM CK8_IgM CK8 Cytokeratin 8 full 14 Prospec pro-298 length SVM IL12_IgM IL12 Interleukin-12 rh (p70) 15 Peprotech 200-12 SVM CRP-Set1_IgM CRP C-Reactive Protein 16 Sigma C4063 SVM U1 snRNP_IgM U1 snRNP U1 small nuclear Prospec pro-445 ribonucleoprotein 70 kDa SVM CENPH_IgM CENPH centromere protein H 17 Prospec pro-966 SVM HMGB1_IgG HMGB1 Human high mobility 18 Prospec pro-581 group box 1 SVM TPO_IgM TPO Thyroid Peroxidase 19 Prospec enz-285 SVM Microglobulin- Microglobulin- microglobulin 20 Sigma M4890 b2_IgG b2 beta2, monocyte chemoattractant protein, SVM HGF_IgM HGF Hepatocyte Growth 21 Prospec CYT-244 Factor SVM DNAssCalf- DNA ss Calf DNA single stranded Sigma D8899 Set1_IgG SVM RaHuC3dIgG_ RaHuC3dIgG Rat anti Human c3d USBIOLOGICAL C0010- IgM (Isotype IgG2a) 16C monoclonal SVM INFSF12_1gG TNFSF12 human TNF ligand 23 Peprotech 310-06 superfamily member12 SVM Laminarin_IgG Laminarin Laminarin from Sigma L9634 Laminaria digitata SVM HaRooster- HaRooster Hyaluronic acid Sigma H5388 Set1_IgG sodium salt from rooster comb Logistic Vitronectin_IgM Vitronectin Serum spreading 12 Peprotech 140-09 Regression factor Logistic MBP mu_IgM MBPmu Myelin Basic Protein 8 Sigma M2941 Regression from mouse Logistic ASF1A_IgG ASF1A ASF1 anti-silencing 25 Prospec pro-682 Regression function 1 homolog 1 Logistic CollagenIII_IgM CollagenIII Collagen III (s X) 2 AKRONbiotech AK9914 Regression Logistic ASF1A_IgM ASF1A ASF1 anti-silencing 25 Prospec pro-682 Regression function 1 homolog 1 Logistic ACTN1_IgM ACTN1 Alpha-actinin-1 26 Prospec pro-518 Regression Logistic PRSS3_IgG PRSS3 Protease, serine, 3 9 AntibodiesOnline ABIN807 Regression 673 Logistic CRP-Set1_IgM CRP C-Reactive Protein 16 Sigma C4063 Regression Logistic Laminarin_IgG Laminarin Laminarin from Sigma L9634 Regression Laminaria digitata Logistic Microglobulin- Microglobulin- microglobulin 20 Sigma M4890 Regression b2_IgG b2 beta2,monocyte chemoattractant protein, Logistic HSP90_IgG HSP90 HSP90 bovine,Hsp90 6 Sigma H6774 Regression Native Human Protein, Logistic LPSse_IgM LPSse Lipopolysaccharides Sigma L5886 Regression from Salmonella enterica serotype abortus equi purified by phenol extraction Logistic oligo17_IgG oligo17 CCATAATTGCAAAG 22 SBSGenetechCo.,Ltd n/a Regression CTTCTG (2) CCA TAA TTG CAA AGC TTC TG Logistic SM-F_IgG SM-F small nuclear 24 Prospec pro-041 Regression ribonucleoprotein polypeptide F Logistic TgondiiROP4_ TgondiiROP4 Toxoplasma Gondii 27 Prospec tox-266 Regression IgG ROP4 (RH2) Mosaic Logistic GPI_IgG GPI Glucose-6-Phosphate 28 Prospec enz-430 Regression Isomerase Logistic CKBBI_IgG CKBBI Creatine kinase B 29 Prospec CKI-268 Regression chain Logistic PARP1_IgG PARP1 Poly (ADP-Ribose) 30 Prospec enz-477 Regression Polymerase 1 Human Logistic INC_IgG INC Cardiac TrponinC 31 Prospec pro-322 Regression Human Logistic Vimentin_IgG Vimentin Vimentin from bovine 32 Peprotech 110-10 Regression lens CART Histone3SCalf_ Histone3SCalf Histone from calf Sigma H5505 IgG thymus Type III-S CART H2a_IgG H2a Histone H2a 5 Sigma H9250 CART oligo17_IgG oligo17 CCATAATTGCAAAG 22 SBSGenetechCo.,Ltd n/a CTTCTG (2) CCA TAA TTG CAA AGC TTC TG CART oligo7_IgG oligo7 G10A10 33 SBSGenetechCo.,Ltd n/a GAG AGA GAG AGA GAG AGA GA CART CENPA_IgM CENPA Centromere Protein- 34 Prospec pro-389 A,rh CART TgondiiMIC3_ TgondiiMIC3 Toxoplasma Gondii 35 Prospec tox-261 IgG MIC 3 CART oligo16_IgG oligo16 CCATAATTGCAAAC 36 SBSGenetechCo.,Ltd n/a GTTCTG (1) CCA TAA TTG CAA ACG TTC TG CART Vitronectin_IgG Vitronectin Serum spreading 12 Peprotech 140-09 factor CART PRSS3_IgG PRSS Protease, serine, 3 9 AntibodiesOnline ABIN807673 CART oligo29_IgG oligo29 G9 37 SBSGenetechCo.,Ltd n/a GGG GGG GGG CART CK8_IgG CK8 Cytokeratin 8 full 14 Prospec pro-298 ength CART Oligo26_IgG Oligo26 T16G2 38 SBSGenetechCo.,Ltd n/a TTT TTT TTT TTT TTT TGG CART oligo1_IgG oligo1 A20 39 SBSGenetechCo.,Ltd n/a AAA AAA AAA AAA AAA AAA AA CART BMP4_IgM BMP4 Bone Morphogenetic 40 Peprotech 120- protein-4,rh 05ET CART oligo27_IgG oligo27 G16T1 41 SBSGenetechCo.,Ltd n/a GGG GGG GGG GGG GGG GT CART oligo20_IgG oligo20 CCA TAT TCG AAA 42 SBSGenetechCo.,Ltd n/a CGT TCT G (3) CCA TAA TTC GAA ACG TTC TG

Example 4

Correlation Between SLE-Key LDA Classification Score and SLE Disease Activity Index Score

[0238] In order to test whether the classification method (LDA) of the present invention can identify SLE autoantibodies profile in both active and in-active SLE patients, the correlation between SLEDAI score and the classification method (LDA) score of the present invention was tested.

[0239] The analysis was performed on 100 female SLE patients (validation set). The samples were obtained from four US sites: Medical University of South Carolina (MUSC), Albert Einstein College of Medicine, Johns Hopkins University and Emory University.

[0240] As shown in FIG. 13, the LDA score was correlated with the SLEDAI score. The LDA classifier score is higher for active disease state patients. Furthermore, the LDA classifier does not exclude SLE from the diagnosis for patients with an in-active disease state (i.e. SLEDAI score=0). These results suggest that the lupus autoantibodies profile is maintained in both active and in-active SLE patients and that the LDA classifier is robust and reliable in distinguishing between SLE and healthy controls autoantibodies fingerprint, regardless of the disease state.

[0241] The foregoing description of the specific embodiments will so fully reveal the general nature of the invention that others can, by applying current knowledge, readily modify and/or adapt for various applications such specific embodiments without undue experimentation and without departing from the generic concept, and, therefore, such adaptations and modifications should and are intended to be comprehended within the meaning and range of equivalents of the disclosed embodiments. It is to be understood that the phraseology or terminology employed herein is for the purpose of description and not of limitation. The means, materials, and steps for carrying out various disclosed functions may take a variety of alternative forms without departing from the invention.

Sequence CWU 1

1

421282PRTHomo sapiens 1Met Arg Gly Met Lys Leu Leu Gly Ala Leu Leu Ala Leu Ala Ala Leu1 5 10 15Leu Gln Gly Ala Val Ser Leu Lys Ile Ala Ala Phe Asn Ile Gln Thr 20 25 30Phe Gly Glu Thr Lys Met Ser Asn Ala Thr Leu Val Ser Tyr Ile Val 35 40 45Gln Ile Leu Ser Arg Tyr Asp Ile Ala Leu Val Gln Glu Val Arg Asp 50 55 60Ser His Leu Thr Ala Val Gly Lys Leu Leu Asp Asn Leu Asn Gln Asp65 70 75 80Ala Pro Asp Thr Tyr His Tyr Val Val Ser Glu Pro Leu Gly Arg Asn 85 90 95Ser Tyr Lys Glu Arg Tyr Leu Phe Val Tyr Arg Pro Asp Gln Val Ser 100 105 110Ala Val Asp Ser Tyr Tyr Tyr Asp Asp Gly Cys Glu Pro Cys Gly Asn 115 120 125Asp Thr Phe Asn Arg Glu Pro Ala Ile Val Arg Phe Phe Ser Arg Phe 130 135 140Thr Glu Val Arg Glu Phe Ala Ile Val Pro Leu His Ala Ala Pro Gly145 150 155 160Asp Ala Val Ala Glu Ile Asp Ala Leu Tyr Asp Val Tyr Leu Asp Val 165 170 175Gln Glu Lys Trp Gly Leu Glu Asp Val Met Leu Met Gly Asp Phe Asn 180 185 190Ala Gly Cys Ser Tyr Val Arg Pro Ser Gln Trp Ser Ser Ile Arg Leu 195 200 205Trp Thr Ser Pro Thr Phe Gln Trp Leu Ile Pro Asp Ser Ala Asp Thr 210 215 220Thr Ala Thr Pro Thr His Cys Ala Tyr Asp Arg Ile Val Val Ala Gly225 230 235 240Met Leu Leu Arg Gly Ala Val Val Pro Asp Ser Ala Leu Pro Phe Asn 245 250 255Phe Gln Ala Ala Tyr Gly Leu Ser Asp Gln Leu Ala Gln Ala Ile Ser 260 265 270Asp His Tyr Pro Val Glu Val Met Leu Lys 275 28021466PRTHomo sapiens 2Met Met Ser Phe Val Gln Lys Gly Ser Trp Leu Leu Leu Ala Leu Leu1 5 10 15His Pro Thr Ile Ile Leu Ala Gln Gln Glu Ala Val Glu Gly Gly Cys 20 25 30Ser His Leu Gly Gln Ser Tyr Ala Asp Arg Asp Val Trp Lys Pro Glu 35 40 45Pro Cys Gln Ile Cys Val Cys Asp Ser Gly Ser Val Leu Cys Asp Asp 50 55 60Ile Ile Cys Asp Asp Gln Glu Leu Asp Cys Pro Asn Pro Glu Ile Pro65 70 75 80Phe Gly Glu Cys Cys Ala Val Cys Pro Gln Pro Pro Thr Ala Pro Thr 85 90 95Arg Pro Pro Asn Gly Gln Gly Pro Gln Gly Pro Lys Gly Asp Pro Gly 100 105 110Pro Pro Gly Ile Pro Gly Arg Asn Gly Asp Pro Gly Ile Pro Gly Gln 115 120 125Pro Gly Ser Pro Gly Ser Pro Gly Pro Pro Gly Ile Cys Glu Ser Cys 130 135 140Pro Thr Gly Pro Gln Asn Tyr Ser Pro Gln Tyr Asp Ser Tyr Asp Val145 150 155 160Lys Ser Gly Val Ala Val Gly Gly Leu Ala Gly Tyr Pro Gly Pro Ala 165 170 175Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Thr Ser Gly His Pro Gly 180 185 190Ser Pro Gly Ser Pro Gly Tyr Gln Gly Pro Pro Gly Glu Pro Gly Gln 195 200 205Ala Gly Pro Ser Gly Pro Pro Gly Pro Pro Gly Ala Ile Gly Pro Ser 210 215 220Gly Pro Ala Gly Lys Asp Gly Glu Ser Gly Arg Pro Gly Arg Pro Gly225 230 235 240Glu Arg Gly Leu Pro Gly Pro Pro Gly Ile Lys Gly Pro Ala Gly Ile 245 250 255Pro Gly Phe Pro Gly Met Lys Gly His Arg Gly Phe Asp Gly Arg Asn 260 265 270Gly Glu Lys Gly Glu Thr Gly Ala Pro Gly Leu Lys Gly Glu Asn Gly 275 280 285Leu Pro Gly Glu Asn Gly Ala Pro Gly Pro Met Gly Pro Arg Gly Ala 290 295 300Pro Gly Glu Arg Gly Arg Pro Gly Leu Pro Gly Ala Ala Gly Ala Arg305 310 315 320Gly Asn Asp Gly Ala Arg Gly Ser Asp Gly Gln Pro Gly Pro Pro Gly 325 330 335Pro Pro Gly Thr Ala Gly Phe Pro Gly Ser Pro Gly Ala Lys Gly Glu 340 345 350Val Gly Pro Ala Gly Ser Pro Gly Ser Asn Gly Ala Pro Gly Gln Arg 355 360 365Gly Glu Pro Gly Pro Gln Gly His Ala Gly Ala Gln Gly Pro Pro Gly 370 375 380Pro Pro Gly Ile Asn Gly Ser Pro Gly Gly Lys Gly Glu Met Gly Pro385 390 395 400Ala Gly Ile Pro Gly Ala Pro Gly Leu Met Gly Ala Arg Gly Pro Pro 405 410 415Gly Pro Ala Gly Ala Asn Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly 420 425 430Glu Pro Gly Lys Asn Gly Ala Lys Gly Glu Pro Gly Pro Arg Gly Glu 435 440 445Arg Gly Glu Ala Gly Ile Pro Gly Val Pro Gly Ala Lys Gly Glu Asp 450 455 460Gly Lys Asp Gly Ser Pro Gly Glu Pro Gly Ala Asn Gly Leu Pro Gly465 470 475 480Ala Ala Gly Glu Arg Gly Ala Pro Gly Phe Arg Gly Pro Ala Gly Pro 485 490 495Asn Gly Ile Pro Gly Glu Lys Gly Pro Ala Gly Glu Arg Gly Ala Pro 500 505 510Gly Pro Ala Gly Pro Arg Gly Ala Ala Gly Glu Pro Gly Arg Asp Gly 515 520 525Val Pro Gly Gly Pro Gly Met Arg Gly Met Pro Gly Ser Pro Gly Gly 530 535 540Pro Gly Ser Asp Gly Lys Pro Gly Pro Pro Gly Ser Gln Gly Glu Ser545 550 555 560Gly Arg Pro Gly Pro Pro Gly Pro Ser Gly Pro Arg Gly Gln Pro Gly 565 570 575Val Met Gly Phe Pro Gly Pro Lys Gly Asn Asp Gly Ala Pro Gly Lys 580 585 590Asn Gly Glu Arg Gly Gly Pro Gly Gly Pro Gly Pro Gln Gly Pro Pro 595 600 605Gly Lys Asn Gly Glu Thr Gly Pro Gln Gly Pro Pro Gly Pro Thr Gly 610 615 620Pro Gly Gly Asp Lys Gly Asp Thr Gly Pro Pro Gly Pro Gln Gly Leu625 630 635 640Gln Gly Leu Pro Gly Thr Gly Gly Pro Pro Gly Glu Asn Gly Lys Pro 645 650 655Gly Glu Pro Gly Pro Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly 660 665 670Gly Lys Gly Asp Ala Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly Leu 675 680 685Ala Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro Glu 690 695 700Gly Gly Lys Gly Ala Ala Gly Pro Pro Gly Pro Pro Gly Ala Ala Gly705 710 715 720Thr Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Gly Leu Gly Ser 725 730 735Pro Gly Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly Ala Asp 740 745 750Gly Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro Ile Gly 755 760 765Pro Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly Gly Ala 770 775 780Pro Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly Glu Arg785 790 795 800Gly Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala Pro Gly 805 810 815Gln Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro Gly Glu 820 825 830Lys Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly Gly Ser 835 840 845Gly Pro Ala Gly Pro Pro Gly Pro Gln Gly Val Lys Gly Glu Arg Gly 850 855 860Ser Pro Gly Gly Pro Gly Ala Ala Gly Phe Pro Gly Ala Arg Gly Leu865 870 875 880Pro Gly Pro Pro Gly Ser Asn Gly Asn Pro Gly Pro Pro Gly Pro Ser 885 890 895Gly Ser Pro Gly Lys Asp Gly Pro Pro Gly Pro Ala Gly Asn Thr Gly 900 905 910Ala Pro Gly Ser Pro Gly Val Ser Gly Pro Lys Gly Asp Ala Gly Gln 915 920 925Pro Gly Glu Lys Gly Ser Pro Gly Ala Gln Gly Pro Pro Gly Ala Pro 930 935 940Gly Pro Leu Gly Ile Ala Gly Ile Thr Gly Ala Arg Gly Leu Ala Gly945 950 955 960Pro Pro Gly Met Pro Gly Pro Arg Gly Ser Pro Gly Pro Gln Gly Val 965 970 975Lys Gly Glu Ser Gly Lys Pro Gly Ala Asn Gly Leu Ser Gly Glu Arg 980 985 990Gly Pro Pro Gly Pro Gln Gly Leu Pro Gly Leu Ala Gly Thr Ala Gly 995 1000 1005Glu Pro Gly Arg Asp Gly Asn Pro Gly Ser Asp Gly Leu Pro Gly 1010 1015 1020Arg Asp Gly Ser Pro Gly Gly Lys Gly Asp Arg Gly Glu Asn Gly 1025 1030 1035Ser Pro Gly Ala Pro Gly Ala Pro Gly His Pro Gly Pro Pro Gly 1040 1045 1050Pro Val Gly Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu Ser Gly 1055 1060 1065Pro Ala Gly Pro Ala Gly Ala Pro Gly Pro Ala Gly Ser Arg Gly 1070 1075 1080Ala Pro Gly Pro Gln Gly Pro Arg Gly Asp Lys Gly Glu Thr Gly 1085 1090 1095Glu Arg Gly Ala Ala Gly Ile Lys Gly His Arg Gly Phe Pro Gly 1100 1105 1110Asn Pro Gly Ala Pro Gly Ser Pro Gly Pro Ala Gly Gln Gln Gly 1115 1120 1125Ala Ile Gly Ser Pro Gly Pro Ala Gly Pro Arg Gly Pro Val Gly 1130 1135 1140Pro Ser Gly Pro Pro Gly Lys Asp Gly Thr Ser Gly His Pro Gly 1145 1150 1155Pro Ile Gly Pro Pro Gly Pro Arg Gly Asn Arg Gly Glu Arg Gly 1160 1165 1170Ser Glu Gly Ser Pro Gly His Pro Gly Gln Pro Gly Pro Pro Gly 1175 1180 1185Pro Pro Gly Ala Pro Gly Pro Cys Cys Gly Gly Val Gly Ala Ala 1190 1195 1200Ala Ile Ala Gly Ile Gly Gly Glu Lys Ala Gly Gly Phe Ala Pro 1205 1210 1215Tyr Tyr Gly Asp Glu Pro Met Asp Phe Lys Ile Asn Thr Asp Glu 1220 1225 1230Ile Met Thr Ser Leu Lys Ser Val Asn Gly Gln Ile Glu Ser Leu 1235 1240 1245Ile Ser Pro Asp Gly Ser Arg Lys Asn Pro Ala Arg Asn Cys Arg 1250 1255 1260Asp Leu Lys Phe Cys His Pro Glu Leu Lys Ser Gly Glu Tyr Trp 1265 1270 1275Val Asp Pro Asn Gln Gly Cys Lys Leu Asp Ala Ile Lys Val Phe 1280 1285 1290Cys Asn Met Glu Thr Gly Glu Thr Cys Ile Ser Ala Asn Pro Leu 1295 1300 1305Asn Val Pro Arg Lys His Trp Trp Thr Asp Ser Ser Ala Glu Lys 1310 1315 1320Lys His Val Trp Phe Gly Glu Ser Met Asp Gly Gly Phe Gln Phe 1325 1330 1335Ser Tyr Gly Asn Pro Glu Leu Pro Glu Asp Val Leu Asp Val His 1340 1345 1350Leu Ala Phe Leu Arg Leu Leu Ser Ser Arg Ala Ser Gln Asn Ile 1355 1360 1365Thr Tyr His Cys Lys Asn Ser Ile Ala Tyr Met Asp Gln Ala Ser 1370 1375 1380Gly Asn Val Lys Lys Ala Leu Lys Leu Met Gly Ser Asn Glu Gly 1385 1390 1395Glu Phe Lys Ala Glu Gly Asn Ser Lys Phe Thr Tyr Thr Val Leu 1400 1405 1410Glu Asp Gly Cys Thr Lys His Thr Gly Glu Trp Ser Lys Thr Val 1415 1420 1425Phe Glu Tyr Arg Thr Arg Lys Ala Val Arg Leu Pro Ile Val Asp 1430 1435 1440Ile Ala Pro Tyr Asp Ile Gly Gly Pro Asp Gln Glu Phe Gly Val 1445 1450 1455Asp Val Gly Pro Val Cys Phe Leu 1460 14653475PRTHomo sapiens 3Met Ala Ser Ala Ala Arg Leu Thr Met Met Trp Glu Glu Val Thr Cys1 5 10 15Pro Ile Cys Leu Asp Pro Phe Val Glu Pro Val Ser Ile Glu Cys Gly 20 25 30His Ser Phe Cys Gln Glu Cys Ile Ser Gln Val Gly Lys Gly Gly Gly 35 40 45Ser Val Cys Pro Val Cys Arg Gln Arg Phe Leu Leu Lys Asn Leu Arg 50 55 60Pro Asn Arg Gln Leu Ala Asn Met Val Asn Asn Leu Lys Glu Ile Ser65 70 75 80Gln Glu Ala Arg Glu Gly Thr Gln Gly Glu Arg Cys Ala Val His Gly 85 90 95Glu Arg Leu His Leu Phe Cys Glu Lys Asp Gly Lys Ala Leu Cys Trp 100 105 110Val Cys Ala Gln Ser Arg Lys His Arg Asp His Ala Met Val Pro Leu 115 120 125Glu Glu Ala Ala Gln Glu Tyr Gln Glu Lys Leu Gln Val Ala Leu Gly 130 135 140Glu Leu Arg Arg Lys Gln Glu Leu Ala Glu Lys Leu Glu Val Glu Ile145 150 155 160Ala Ile Lys Arg Ala Asp Trp Lys Lys Thr Val Glu Thr Gln Lys Ser 165 170 175Arg Ile His Ala Glu Phe Val Gln Gln Lys Asn Phe Leu Val Glu Glu 180 185 190Glu Gln Arg Gln Leu Gln Glu Leu Glu Lys Asp Glu Arg Glu Gln Leu 195 200 205Arg Ile Leu Gly Glu Lys Glu Ala Lys Leu Ala Gln Gln Ser Gln Ala 210 215 220Leu Gln Glu Leu Ile Ser Glu Leu Asp Arg Arg Cys His Ser Ser Ala225 230 235 240Leu Glu Leu Leu Gln Glu Val Ile Ile Val Leu Glu Arg Ser Glu Ser 245 250 255Trp Asn Leu Lys Asp Leu Asp Ile Thr Ser Pro Glu Leu Arg Ser Val 260 265 270Cys His Val Pro Gly Leu Lys Lys Met Leu Arg Thr Cys Ala Val His 275 280 285Ile Thr Leu Asp Pro Asp Thr Ala Asn Pro Trp Leu Ile Leu Ser Glu 290 295 300Asp Arg Arg Gln Val Arg Leu Gly Asp Thr Gln Gln Ser Ile Pro Gly305 310 315 320Asn Glu Glu Arg Phe Asp Ser Tyr Pro Met Val Leu Gly Ala Gln His 325 330 335Phe His Ser Gly Lys His Tyr Trp Glu Val Asp Val Thr Gly Lys Glu 340 345 350Ala Trp Asp Leu Gly Val Cys Arg Asp Ser Val Arg Arg Lys Gly His 355 360 365Phe Leu Leu Ser Ser Lys Ser Gly Phe Trp Thr Ile Trp Leu Trp Asn 370 375 380Lys Gln Lys Tyr Glu Ala Gly Thr Tyr Pro Gln Thr Pro Leu His Leu385 390 395 400Gln Val Pro Pro Cys Gln Val Gly Ile Phe Leu Asp Tyr Glu Ala Gly 405 410 415Met Val Ser Phe Tyr Asn Ile Thr Asp His Gly Ser Leu Ile Tyr Ser 420 425 430Phe Ser Glu Cys Ala Phe Thr Gly Pro Leu Arg Pro Phe Phe Ser Pro 435 440 445Gly Phe Asn Asp Gly Gly Lys Asn Thr Ala Pro Leu Thr Leu Cys Pro 450 455 460Leu Asn Ile Gly Ser Gln Gly Ser Thr Asp Tyr465 470 4754105PRTHomo sapiens 4Met Arg Ser Phe Phe Ser Phe Leu Gly Glu Ala Phe Asp Gly Ala Arg1 5 10 15Asp Met Trp Arg Ala Tyr Ser Asp Met Arg Glu Ala Asn Tyr Ile Gly 20 25 30Ser Asp Lys Tyr Phe His Ala Arg Gly Asn Tyr Asp Ala Ala Lys Arg 35 40 45Gly Pro Gly Gly Val Trp Ala Ala Glu Ala Ile Ser Asn Ala Arg Glu 50 55 60Asn Ile Gln Arg Phe Phe Gly Arg Gly Ala Glu Asp Ser Leu Ala Asp65 70 75 80Gln Ala Ala Asn Glu Trp Gly Arg Ser Gly Lys Asp Pro Asn His Phe 85 90 95Arg Pro Ala Gly Leu Pro Glu Lys Tyr 100 1055130PRTHomo sapiens 5Met Ser Gly Arg Gly Lys Gln Gly Gly Lys Ala Arg Ala Lys Ala Lys1 5 10 15Thr Arg Ser Ser Arg Ala Gly Leu Gln Phe Pro Val Gly Arg Val His 20 25 30Arg Leu Leu Arg Lys Gly Asn Tyr Ala Glu Arg Val Gly Ala Gly Ala 35 40 45Pro Val Tyr Leu Ala Ala Val Leu Glu Tyr Leu Thr Ala Glu Ile Leu 50 55 60Glu Leu Ala Gly Asn Ala Ala Arg Asp Asn Lys Lys Thr Arg Ile Ile65 70 75 80Pro Arg His Leu Gln Leu Ala Ile Arg Asn Asp Glu Glu Leu Asn Lys 85 90 95Leu Leu Gly Lys Val Thr Ile Ala Gln Gly Gly Val Leu Pro Asn Ile 100 105 110Gln Ala Val Leu Leu Pro Lys Lys Thr Glu Ser His His Lys Ala Lys 115 120 125Gly Lys

130624DNAArtificial Sequencesynthetic 6ttagggttag ggttagggtt aggg 247132PRTHomo sapiens 7Met Thr Gly Glu Leu Glu Val Lys Asn Met Asp Met Lys Pro Gly Ser1 5 10 15Thr Leu Lys Ile Thr Gly Ser Ile Ala Asp Gly Thr Asp Gly Phe Val 20 25 30Ile Asn Leu Gly Gln Gly Thr Asp Lys Leu Asn Leu His Phe Asn Pro 35 40 45Arg Phe Ser Glu Ser Thr Ile Val Cys Asn Ser Leu Asp Gly Ser Asn 50 55 60Trp Gly Gln Glu Gln Arg Glu Asp His Leu Cys Phe Ser Pro Gly Ser65 70 75 80Glu Val Lys Phe Thr Val Thr Phe Glu Ser Asp Lys Phe Lys Val Lys 85 90 95Leu Pro Asp Gly His Glu Leu Thr Phe Pro Asn Arg Leu Gly His Ser 100 105 110His Leu Ser Tyr Leu Ser Val Arg Gly Gly Phe Asn Met Ser Ser Phe 115 120 125Lys Leu Lys Glu 1308250PRTMus musculus 8Met Gly Asn His Ser Gly Lys Arg Glu Leu Ser Ala Glu Lys Ala Ser1 5 10 15Lys Asp Gly Glu Ile His Arg Gly Glu Ala Gly Lys Lys Arg Ser Val 20 25 30Gly Lys Leu Ser Gln Thr Ala Ser Glu Asp Ser Asp Val Phe Gly Glu 35 40 45Ala Asp Ala Ile Gln Asn Asn Gly Thr Ser Ala Glu Asp Thr Ala Val 50 55 60Thr Asp Ser Lys His Thr Ala Asp Pro Lys Asn Asn Trp Gln Gly Ala65 70 75 80His Pro Ala Asp Pro Gly Asn Arg Pro His Leu Ile Arg Leu Phe Ser 85 90 95Arg Asp Ala Pro Gly Arg Glu Asp Asn Thr Phe Lys Asp Arg Pro Ser 100 105 110Glu Ser Asp Glu Leu Gln Thr Ile Gln Glu Asp Pro Thr Ala Ala Ser 115 120 125Gly Gly Leu Asp Val Met Ala Ser Gln Lys Arg Pro Ser Gln Arg Ser 130 135 140Lys Tyr Leu Ala Thr Ala Ser Thr Met Asp His Ala Arg His Gly Phe145 150 155 160Leu Pro Arg His Arg Asp Thr Gly Ile Leu Asp Ser Ile Gly Arg Phe 165 170 175Phe Ser Gly Asp Arg Gly Ala Pro Lys Arg Gly Ser Gly Lys Asp Ser 180 185 190His Thr Arg Thr Thr His Tyr Gly Ser Leu Pro Gln Lys Ser Gln His 195 200 205Gly Arg Thr Gln Asp Glu Asn Pro Val Val His Phe Phe Lys Asn Ile 210 215 220Val Thr Pro Arg Thr Pro Pro Pro Ser Gln Gly Lys Gly Gly Arg Asp225 230 235 240Ser Arg Ser Gly Ser Pro Met Ala Arg Arg 245 2509491PRTHomo sapiens 9Met Asn Pro Phe Leu Ile Leu Ala Phe Val Gly Ala Ala Val Ala Val1 5 10 15Pro Phe Asp Asp Asp Asp Lys Ile Val Gly Gly Tyr Thr Cys Glu Glu 20 25 30Asn Ser Leu Pro Tyr Gln Val Ser Leu Asn Ser Gly Ser His Phe Cys 35 40 45Gly Gly Ser Leu Ile Ser Glu Gln Trp Val Val Ser Ala Ala His Cys 50 55 60Tyr Lys Thr Arg Ile Gln Val Arg Leu Gly Glu His Asn Ile Lys Val65 70 75 80Leu Glu Gly Asn Glu Gln Phe Ile Asn Ala Ala Lys Ile Ile Arg His 85 90 95Pro Gln Tyr Asp Arg Lys Thr Leu Asn Asn Asp Ile Met Leu Ile Lys 100 105 110Leu Ser Ser Arg Ala Val Ile Asn Ala Arg Val Ser Thr Ile Ser Leu 115 120 125Pro Thr Ala Pro Pro Ala Thr Gly Thr Lys Cys Leu Ile Ser Gly Trp 130 135 140Gly Asn Thr Ala Ser Ser Gly Ala Asp Tyr Pro Asp Glu Leu Gln Cys145 150 155 160Leu Asp Ala Pro Val Leu Ser Gln Ala Lys Cys Glu Ala Ser Tyr Pro 165 170 175Gly Lys Ile Thr Ser Asn Met Phe Cys Val Gly Phe Leu Glu Gly Gly 180 185 190Lys Asp Ser Cys Gln Gly Asp Ser Gly Gly Pro Val Val Cys Asn Gly 195 200 205Gln Leu Gln Gly Val Val Ser Trp Gly Asp Gly Cys Ala Gln Lys Asn 210 215 220Lys Pro Gly Val Tyr Thr Lys Val Tyr Asn Tyr Val Lys Trp Ile Lys225 230 235 240Asn Thr Ile Ala Ala Asn Ser Asp Ala Asp Gly Cys Glu Ala Leu Gly 245 250 255Thr Val Ala Val Pro Phe Asp Asp Asp Asp Lys Ile Val Gly Gly Tyr 260 265 270Thr Cys Glu Glu Asn Ser Leu Pro Tyr Gln Val Ser Leu Asn Ser Gly 275 280 285Ser His Phe Cys Gly Gly Ser Leu Ile Ser Glu Gln Trp Val Val Ser 290 295 300Ala Ala His Cys Tyr Lys Thr Arg Ile Gln Val Arg Leu Gly Glu His305 310 315 320Asn Ile Lys Val Leu Glu Gly Asn Glu Gln Phe Ile Asn Ala Ala Lys 325 330 335Ile Ile Arg His Pro Lys Tyr Asn Arg Asp Thr Leu Asp Asn Asp Ile 340 345 350Met Leu Ile Lys Leu Ser Ser Pro Ala Val Ile Asn Ala Arg Val Ser 355 360 365Thr Ile Ser Leu Pro Thr Thr Pro Pro Ala Ala Gly Thr Glu Cys Leu 370 375 380Ile Ser Gly Trp Gly Asn Thr Leu Ser Phe Gly Ala Asp Tyr Pro Asp385 390 395 400Glu Leu Lys Cys Leu Asp Ala Pro Val Leu Thr Gln Ala Glu Cys Lys 405 410 415Ala Ser Tyr Pro Gly Lys Ile Thr Asn Ser Met Phe Cys Val Gly Phe 420 425 430Leu Glu Gly Gly Lys Asp Ser Cys Gln Arg Asp Ser Gly Gly Pro Val 435 440 445Val Cys Asn Gly Gln Leu Gln Gly Val Val Ser Trp Gly His Gly Cys 450 455 460Ala Trp Lys Asn Arg Pro Gly Val Tyr Thr Lys Val Tyr Asn Tyr Val465 470 475 480Asp Trp Ile Lys Asp Thr Ile Ala Ala Asn Ser 485 49010336PRTToxoplasma gondii 10Met Ser Val Ser Leu His His Phe Ile Ile Ser Ser Gly Phe Leu Thr1 5 10 15Ser Met Phe Pro Lys Ala Val Arg Arg Ala Val Thr Ala Gly Val Phe 20 25 30Ala Ala Pro Thr Leu Met Ser Phe Leu Arg Cys Gly Ala Met Ala Ser 35 40 45Asp Pro Pro Leu Val Ala Asn Gln Val Val Thr Cys Pro Asp Lys Lys 50 55 60Ser Thr Ala Ala Val Ile Leu Thr Pro Thr Glu Asn His Phe Thr Leu65 70 75 80Lys Cys Pro Lys Thr Ala Leu Thr Glu Pro Pro Thr Leu Ala Tyr Ser 85 90 95Pro Asn Arg Gln Ile Cys Pro Ala Gly Thr Thr Ser Ser Cys Thr Ser 100 105 110Lys Ala Val Thr Leu Ser Ser Leu Ile Pro Glu Ala Glu Asp Ser Trp 115 120 125Trp Thr Gly Asp Ser Ala Ser Leu Asp Thr Ala Gly Ile Lys Leu Thr 130 135 140Val Pro Ile Glu Lys Phe Pro Val Thr Thr Gln Thr Phe Val Val Gly145 150 155 160Cys Ile Lys Gly Asp Asp Ala Gln Ser Cys Met Val Thr Val Thr Val 165 170 175Gln Ala Arg Ala Ser Ser Val Val Asn Asn Val Ala Arg Cys Ser Tyr 180 185 190Gly Ala Asn Ser Thr Leu Gly Pro Val Lys Leu Ser Ala Glu Gly Pro 195 200 205Thr Thr Met Thr Leu Val Cys Gly Lys Asp Gly Val Lys Val Pro Gln 210 215 220Asp Asn Asn Gln Tyr Cys Ser Gly Thr Thr Leu Thr Gly Cys Asn Glu225 230 235 240Lys Ser Phe Lys Asp Ile Leu Pro Lys Leu Ser Glu Asn Pro Trp Gln 245 250 255Gly Asn Ala Ser Ser Asp Asn Gly Ala Thr Leu Thr Ile Asn Lys Glu 260 265 270Ala Phe Pro Ala Glu Ser Lys Ser Val Ile Ile Gly Cys Thr Gly Gly 275 280 285Ser Pro Glu Lys His His Cys Thr Val Gln Leu Glu Phe Ala Gly Ala 290 295 300Ala Gly Ser Ala Lys Ser Ser Ala Gly Thr Ala Ser His Val Ser Ile305 310 315 320Phe Ala Met Val Thr Gly Leu Ile Gly Ser Ile Ala Ala Cys Val Ala 325 330 33511168PRTHomo sapiens 11Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro1 5 10 15Arg Gly Ser His Met Glu Val Lys Asp Ala Asn Ser Ala Leu Leu Ser 20 25 30Asn Tyr Glu Val Phe Gln Leu Leu Thr Asp Leu Lys Glu Gln Arg Lys 35 40 45Glu Ser Gly Lys Asn Lys His Ser Ser Gly Gln Gln Asn Leu Asn Thr 50 55 60Ile Thr Tyr Glu Thr Leu Lys Tyr Ile Ser Lys Thr Pro Cys Arg His65 70 75 80Gln Ser Pro Glu Ile Val Arg Glu Phe Leu Thr Ala Leu Lys Ser His 85 90 95Lys Leu Thr Lys Ala Glu Lys Leu Gln Leu Leu Asn His Arg Pro Val 100 105 110Thr Ala Val Glu Ile Gln Leu Met Val Glu Glu Ser Glu Glu Arg Leu 115 120 125Thr Glu Glu Gln Ile Glu Ala Leu Leu His Thr Val Thr Ser Ile Leu 130 135 140Pro Ala Glu Pro Glu Ala Glu Gln Lys Lys Asn Thr Asn Ser Asn Val145 150 155 160Ala Met Asp Glu Glu Asp Pro Ala 16512270PRTHomo sapiens 12Asp Gln Glu Ser Cys Lys Gly Arg Cys Thr Glu Gly Phe Asn Val Asp1 5 10 15Lys Lys Cys Gln Cys Asp Glu Leu Cys Ser Tyr Tyr Gln Ser Cys Cys 20 25 30Thr Asp Tyr Thr Ala Glu Cys Lys Pro Gln Val Thr Arg Gly Asp Val 35 40 45Phe Thr Met Pro Glu Asp Glu Tyr Thr Val Tyr Asp Asp Gly Glu Glu 50 55 60Lys Asn Asn Ala Thr Val His Glu Gln Val Gly Gly Pro Ser Leu Thr65 70 75 80Ser Asp Leu Gln Ala Gln Ser Lys Gly Asn Pro Glu Gln Thr Pro Val 85 90 95Leu Lys Pro Glu Glu Glu Ala Pro Ala Pro Glu Val Gly Ala Ser Lys 100 105 110Pro Glu Gly Ile Asp Ser Arg Pro Glu Thr Leu His Pro Gly Arg Pro 115 120 125Gln Pro Pro Ala Glu Glu Glu Leu Cys Ser Gly Lys Pro Phe Asp Ala 130 135 140Phe Thr Asp Leu Lys Asn Gly Ser Leu Phe Ala Phe Arg Gly Gln Tyr145 150 155 160Cys Tyr Glu Leu Asp Glu Lys Ala Val Arg Pro Gly Tyr Pro Lys Leu 165 170 175Ile Arg Asp Val Trp Gly Ile Glu Gly Pro Ile Asp Ala Ala Phe Thr 180 185 190Arg Ile Asn Cys Gln Gly Lys Thr Tyr Leu Phe Lys Gly Ser Gln Tyr 195 200 205Trp Arg Phe Glu Asp Gly Val Leu Asp Pro Asp Tyr Pro Arg Asn Ile 210 215 220Ser Asp Gly Phe Asp Gly Ile Pro Asp Asn Val Asp Ala Ala Leu Ala225 230 235 240Leu Pro Ala His Ser Tyr Ser Gly Arg Glu Arg Val Tyr Phe Phe Lys 245 250 255Gly Lys Gln Tyr Trp Glu Tyr Gln Phe Gln His Gln Pro Ser 260 265 27013230PRTHomo sapiens 13Met Glu Pro Leu Arg Leu Leu Ile Leu Leu Phe Val Thr Glu Leu Ser1 5 10 15Gly Ala His Asn Thr Thr Val Phe Gln Gly Val Ala Gly Gln Ser Leu 20 25 30Gln Val Ser Cys Pro Tyr Asp Ser Met Lys His Trp Gly Arg Arg Lys 35 40 45Ala Trp Cys Arg Gln Leu Gly Glu Lys Gly Pro Cys Gln Arg Val Val 50 55 60Ser Thr His Asn Leu Trp Leu Leu Ser Phe Leu Arg Arg Trp Asn Gly65 70 75 80Ser Thr Ala Ile Thr Asp Asp Thr Leu Gly Gly Thr Leu Thr Ile Thr 85 90 95Leu Arg Asn Leu Gln Pro His Asp Ala Gly Leu Tyr Gln Cys Gln Ser 100 105 110Leu His Gly Ser Glu Ala Asp Thr Leu Arg Lys Val Leu Val Glu Val 115 120 125Leu Ala Asp Pro Leu Asp His Arg Asp Ala Gly Asp Leu Trp Phe Pro 130 135 140Gly Glu Ser Glu Ser Phe Glu Asp Ala His Val Glu His Ser Ile Ser145 150 155 160Arg Ser Leu Leu Glu Gly Glu Ile Pro Phe Pro Pro Thr Ser Ile Leu 165 170 175Leu Leu Leu Ala Cys Ile Phe Leu Ile Lys Ile Leu Ala Ala Ser Ala 180 185 190Leu Trp Ala Ala Ala Trp His Gly Gln Lys Pro Gly Thr His Pro Pro 195 200 205Ser Glu Leu Asp Cys Gly His Asp Pro Gly Tyr Gln Leu Gln Thr Leu 210 215 220Pro Gly Leu Arg Asp Thr225 23014350PRTHomo sapiens 14Met Ser Ile Arg Val Thr Gln Lys Ser Tyr Lys Val Ser Thr Ser Gly1 5 10 15Pro Arg Ala Phe Ser Ser Arg Ser Tyr Thr Ser Gly Pro Gly Ser Arg 20 25 30Ile Ser Ser Ser Ser Phe Ser Arg Val Gly Ser Ser Asn Phe Arg Gly 35 40 45Gly Leu Gly Gly Gly Tyr Gly Gly Ala Ser Gly Met Gly Gly Ile Thr 50 55 60Ala Val Thr Val Asn Gln Ser Leu Leu Ser Pro Leu Val Leu Glu Val65 70 75 80Asp Pro Asn Ile Gln Ala Val Arg Thr Gln Glu Lys Glu Gln Ile Lys 85 90 95Thr Leu Asn Asn Lys Phe Ala Ser Phe Ile Asp Lys Val Arg Phe Leu 100 105 110Glu Gln Gln Asn Lys Met Leu Glu Thr Lys Trp Ser Leu Leu Gln Gln 115 120 125Gln Lys Thr Ala Arg Ser Asn Met Asp Asn Met Phe Glu Ser Tyr Ile 130 135 140Asn Asn Leu Arg Arg Gln Leu Glu Thr Leu Gly Gln Glu Lys Leu Lys145 150 155 160Leu Glu Ala Glu Leu Gly Asn Met Gln Gly Leu Val Glu Asp Phe Lys 165 170 175Asn Lys Tyr Glu Asp Glu Ile Asn Lys Arg Thr Glu Met Glu Asn Glu 180 185 190Phe Val Leu Ile Lys Lys Asp Val Asp Glu Ala Tyr Met Asn Lys Val 195 200 205Glu Leu Glu Ser Arg Leu Glu Gly Leu Thr Asp Glu Ile Asn Phe Leu 210 215 220Arg Gln Leu Tyr Glu Glu Glu Ile Arg Glu Leu Gln Ser Gln Ile Ser225 230 235 240Asp Thr Ser Val Val Leu Ser Met Asp Asn Ser Arg Ser Leu Asp Met 245 250 255Asp Ser Ile Ile Ala Glu Val Lys Ala Gln Tyr Glu Asp Ile Ala Asn 260 265 270Arg Ser Arg Ala Glu Ala Glu Ser Met Tyr Gln Ile Lys Tyr Glu Glu 275 280 285Leu Gln Ser Leu Ala Gly Lys His Gly Asp Asp Leu Arg Arg Thr Lys 290 295 300Thr Glu Ile Ser Glu Met Asn Arg Asn Ile Ser Arg Leu Gln Ala Glu305 310 315 320Ile Glu Gly Leu Lys Gly Gln Arg Ala Ser Leu Glu Ala Ala Ile Ala 325 330 335Asp Ala Glu Gln Arg Gly Glu Leu Ala Ile Lys Asp Ala Asn 340 345 35015197PRTHomo sapiens 15Arg Asn Leu Pro Val Ala Thr Pro Asp Pro Gly Met Phe Pro Cys Leu1 5 10 15His His Ser Gln Asn Leu Leu Arg Ala Val Ser Asn Met Leu Gln Lys 20 25 30Ala Arg Gln Thr Leu Glu Phe Tyr Pro Cys Thr Ser Glu Glu Ile Asp 35 40 45His Glu Asp Ile Thr Lys Asp Lys Thr Ser Thr Val Glu Ala Cys Leu 50 55 60Pro Leu Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn Ser Arg Glu Thr65 70 75 80Ser Phe Ile Thr Asn Gly Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe 85 90 95Met Met Ala Leu Cys Leu Ser Ser Ile Tyr Glu Asp Leu Lys Met Tyr 100 105 110Gln Val Glu Phe Lys Thr Met Asn Ala Lys Leu Leu Met Asp Pro Lys 115 120 125Arg Gln Ile Phe Leu Asp Gln Asn Met Leu Ala Val Ile Asp Glu Leu 130 135 140Met Gln Ala Leu Asn Phe Asn Ser Glu Thr Val Pro Gln Lys Ser Ser145 150 155 160Leu Glu Glu Pro Asp Phe Tyr Lys Thr Lys Ile Lys Leu Cys Ile Leu 165 170 175Leu His Ala Phe Arg Ile Arg Ala Val Thr Ile Asp Arg Val Met Ser 180 185 190Tyr Leu Asn Ala Ser 19516102PRTHomo sapiens 16Met Glu Lys Leu Leu Cys Phe Leu Val Leu Thr Ser Leu Ser His Ala1

5 10 15Phe Gly Gln Thr Asp Met Ser Arg Lys Ala Phe Val Phe Pro Lys Glu 20 25 30Ser Asp Thr Ser Tyr Val Ser Leu Lys Ala Pro Leu Thr Lys Pro Leu 35 40 45Lys Ala Phe Thr Val Cys Leu His Phe Tyr Thr Glu Leu Ser Ser Thr 50 55 60His Glu Ile Asn Thr Ile Tyr Leu Gly Gly Pro Phe Ser Pro Asn Val65 70 75 80Leu Asn Trp Arg Ala Leu Lys Tyr Glu Val Gln Gly Glu Val Phe Thr 85 90 95Lys Pro Gln Leu Trp Pro 10017247PRTHomo sapiens 17Met Glu Glu Gln Pro Gln Met Gln Asp Ala Asp Glu Pro Ala Asp Ser1 5 10 15Gly Gly Glu Gly Arg Ala Gly Gly Pro Pro Gln Val Ala Gly Ala Gln 20 25 30Ala Ala Cys Ser Glu Asp Arg Met Thr Leu Leu Leu Arg Leu Arg Ala 35 40 45Gln Thr Lys Gln Gln Leu Leu Glu Tyr Lys Ser Met Val Asp Ala Ser 50 55 60Glu Glu Lys Thr Pro Glu Gln Ile Met Gln Glu Lys Gln Ile Glu Ala65 70 75 80Lys Ile Glu Asp Leu Glu Asn Glu Ile Glu Glu Val Lys Val Ala Phe 85 90 95Glu Ile Lys Lys Leu Ala Leu Asp Arg Met Arg Leu Ser Thr Ala Leu 100 105 110Lys Lys Asn Leu Glu Lys Ile Ser Arg Gln Ser Ser Val Leu Met Asp 115 120 125Asn Met Lys His Leu Leu Glu Leu Asn Lys Leu Ile Met Lys Ser Gln 130 135 140Gln Glu Ser Trp Asp Leu Glu Glu Lys Leu Leu Asp Ile Arg Lys Lys145 150 155 160Arg Leu Gln Leu Lys Gln Ala Ser Glu Ser Lys Leu Leu Glu Ile Gln 165 170 175Thr Glu Lys Asn Lys Gln Lys Ile Asp Leu Asp Ser Met Glu Asn Ser 180 185 190Glu Arg Ile Lys Ile Ile Arg Gln Asn Leu Gln Met Glu Ile Lys Ile 195 200 205Thr Thr Val Ile Gln His Val Phe Gln Asn Leu Ile Leu Gly Ser Lys 210 215 220Val Asn Trp Ala Glu Asp Pro Ala Leu Lys Glu Ile Val Leu Gln Leu225 230 235 240Glu Lys Asn Val Asp Met Met 24518222PRTHomo sapiens 18Met Gly Lys Gly Asp Pro Lys Lys Pro Arg Gly Lys Met Ser Ser Tyr1 5 10 15Ala Phe Phe Val Gln Thr Cys Arg Glu Glu His Lys Lys Lys His Pro 20 25 30Asp Ala Ser Val Asn Phe Ser Glu Phe Ser Lys Lys Cys Ser Glu Arg 35 40 45Trp Lys Thr Met Ser Ala Lys Glu Lys Gly Lys Phe Glu Asp Met Ala 50 55 60Lys Ala Asp Lys Ala Arg Tyr Glu Arg Glu Met Lys Thr Tyr Ile Pro65 70 75 80Pro Lys Gly Glu Thr Lys Lys Lys Phe Lys Asp Pro Asn Ala Pro Lys 85 90 95Arg Pro Pro Ser Ala Phe Phe Leu Phe Cys Ser Glu Tyr Arg Pro Lys 100 105 110Ile Lys Gly Glu His Pro Gly Leu Ser Ile Gly Asp Val Ala Lys Lys 115 120 125Leu Gly Glu Met Trp Asn Asn Thr Ala Ala Asp Asp Lys Gln Pro Tyr 130 135 140Glu Lys Lys Ala Ala Lys Leu Lys Glu Lys Tyr Glu Lys Asp Ile Ala145 150 155 160Ala Tyr Arg Ala Lys Gly Lys Pro Asp Ala Ala Lys Lys Gly Val Val 165 170 175Lys Ala Glu Lys Ser Lys Lys Lys Glu Glu Glu Glu Asp Glu Glu Asp 180 185 190Glu Glu Asp Glu Glu Glu Glu Glu Asp Glu Glu Asp Glu Asp Glu Glu 195 200 205Glu Asp Asp Asp Asp Glu Leu Glu His His His His His His 210 215 22019933PRTHomo sapiens 19Met Arg Ala Leu Ala Val Leu Ser Val Thr Leu Val Met Ala Cys Thr1 5 10 15Glu Ala Phe Phe Pro Phe Ile Ser Arg Gly Lys Glu Leu Leu Trp Gly 20 25 30Lys Pro Glu Glu Ser Arg Val Ser Ser Val Leu Glu Glu Ser Lys Arg 35 40 45Leu Val Asp Thr Ala Met Tyr Ala Thr Met Gln Arg Asn Leu Lys Lys 50 55 60Arg Gly Ile Leu Ser Pro Ala Gln Leu Leu Ser Phe Ser Lys Leu Pro65 70 75 80Glu Pro Thr Ser Gly Val Ile Ala Arg Ala Ala Glu Ile Met Glu Thr 85 90 95Ser Ile Gln Ala Met Lys Arg Lys Val Asn Leu Lys Thr Gln Gln Ser 100 105 110Gln His Pro Thr Asp Ala Leu Ser Glu Asp Leu Leu Ser Ile Ile Ala 115 120 125Asn Met Ser Gly Cys Leu Pro Tyr Met Leu Pro Pro Lys Cys Pro Asn 130 135 140Thr Cys Leu Ala Asn Lys Tyr Arg Pro Ile Thr Gly Ala Cys Asn Asn145 150 155 160Arg Asp His Pro Arg Trp Gly Ala Ser Asn Thr Ala Leu Ala Arg Trp 165 170 175Leu Pro Pro Val Tyr Glu Asp Gly Phe Ser Gln Pro Arg Gly Trp Asn 180 185 190Pro Gly Phe Leu Tyr Asn Gly Phe Pro Leu Pro Pro Val Arg Glu Val 195 200 205Thr Arg His Val Ile Gln Val Ser Asn Glu Val Val Thr Asp Asp Asp 210 215 220Arg Tyr Ser Asp Leu Leu Met Ala Trp Gly Gln Tyr Ile Asp His Asp225 230 235 240Ile Ala Phe Thr Pro Gln Ser Thr Ser Lys Ala Ala Phe Gly Gly Gly 245 250 255Ala Asp Cys Gln Met Thr Cys Glu Asn Gln Asn Pro Cys Phe Pro Ile 260 265 270Gln Leu Pro Glu Glu Ala Arg Pro Ala Ala Gly Thr Ala Cys Leu Pro 275 280 285Phe Tyr Arg Ser Ser Ala Ala Cys Gly Thr Gly Asp Gln Gly Ala Leu 290 295 300Phe Gly Asn Leu Ser Thr Ala Asn Pro Arg Gln Gln Met Asn Gly Leu305 310 315 320Thr Ser Phe Leu Asp Ala Ser Thr Val Tyr Gly Ser Ser Pro Ala Leu 325 330 335Glu Arg Gln Leu Arg Asn Trp Thr Ser Ala Glu Gly Leu Leu Arg Val 340 345 350His Ala Arg Leu Arg Asp Ser Gly Arg Ala Tyr Leu Pro Phe Val Pro 355 360 365Pro Arg Ala Pro Ala Ala Cys Ala Pro Glu Pro Gly Ile Pro Gly Glu 370 375 380Thr Arg Gly Pro Cys Phe Leu Ala Gly Asp Gly Arg Ala Ser Glu Val385 390 395 400Pro Ser Leu Thr Ala Leu His Thr Leu Trp Leu Arg Glu His Asn Arg 405 410 415Leu Ala Ala Ala Leu Lys Ala Leu Asn Ala His Trp Ser Ala Asp Ala 420 425 430Val Tyr Gln Glu Ala Arg Lys Val Val Gly Ala Leu His Gln Ile Ile 435 440 445Thr Leu Arg Asp Tyr Ile Pro Arg Ile Leu Gly Pro Glu Ala Phe Gln 450 455 460Gln Tyr Val Gly Pro Tyr Glu Gly Tyr Asp Ser Thr Ala Asn Pro Thr465 470 475 480Val Ser Asn Val Phe Ser Thr Ala Ala Phe Arg Phe Gly His Ala Thr 485 490 495Ile His Pro Leu Val Arg Arg Leu Asp Ala Ser Phe Gln Glu His Pro 500 505 510Asp Leu Pro Gly Leu Trp Leu His Gln Ala Phe Phe Ser Pro Trp Thr 515 520 525Leu Leu Arg Gly Gly Gly Leu Asp Pro Leu Ile Arg Gly Leu Leu Ala 530 535 540Arg Pro Ala Lys Leu Gln Val Gln Asp Gln Leu Met Asn Glu Glu Leu545 550 555 560Thr Glu Arg Leu Phe Val Leu Ser Asn Ser Ser Thr Leu Asp Leu Ala 565 570 575Ser Ile Asn Leu Gln Arg Gly Arg Asp His Gly Leu Pro Gly Tyr Asn 580 585 590Glu Trp Arg Glu Phe Cys Gly Leu Pro Arg Leu Glu Thr Pro Ala Asp 595 600 605Leu Ser Thr Ala Ile Ala Ser Arg Ser Val Ala Asp Lys Ile Leu Asp 610 615 620Leu Tyr Lys His Pro Asp Asn Ile Asp Val Trp Leu Gly Gly Leu Ala625 630 635 640Glu Asn Phe Leu Pro Arg Ala Arg Thr Gly Pro Leu Phe Ala Cys Leu 645 650 655Ile Gly Lys Gln Met Lys Ala Leu Arg Asp Gly Asp Trp Phe Trp Trp 660 665 670Glu Asn Ser His Val Phe Thr Asp Ala Gln Arg Arg Glu Leu Glu Lys 675 680 685His Ser Leu Ser Arg Val Ile Cys Asp Asn Thr Gly Leu Thr Arg Val 690 695 700Pro Met Asp Ala Phe Gln Val Gly Lys Phe Pro Glu Asp Phe Glu Ser705 710 715 720Cys Asp Ser Ile Thr Gly Met Asn Leu Glu Ala Trp Arg Glu Thr Phe 725 730 735Pro Gln Asp Asp Lys Cys Gly Phe Pro Glu Ser Val Glu Asn Gly Asp 740 745 750Phe Val His Cys Glu Glu Ser Gly Arg Arg Val Leu Val Tyr Ser Cys 755 760 765Arg His Gly Tyr Glu Leu Gln Gly Arg Glu Gln Leu Thr Cys Thr Gln 770 775 780Glu Gly Trp Asp Phe Gln Pro Pro Leu Cys Lys Asp Val Asn Glu Cys785 790 795 800Ala Asp Gly Ala His Pro Pro Cys His Ala Ser Ala Arg Cys Arg Asn 805 810 815Thr Lys Gly Gly Phe Gln Cys Leu Cys Ala Asp Pro Tyr Glu Leu Gly 820 825 830Asp Asp Gly Arg Thr Cys Val Asp Ser Gly Arg Leu Pro Arg Val Thr 835 840 845Trp Ile Ser Met Ser Leu Ala Ala Leu Leu Ile Gly Gly Phe Ala Gly 850 855 860Leu Thr Ser Thr Val Ile Cys Arg Trp Thr Arg Thr Gly Thr Lys Ser865 870 875 880Thr Leu Pro Ile Ser Glu Thr Gly Gly Gly Thr Pro Glu Leu Arg Cys 885 890 895Gly Lys His Gln Ala Val Gly Thr Ser Pro Gln Arg Ala Ala Ala Gln 900 905 910Asp Ser Glu Gln Glu Ser Ala Gly Met Glu Gly Arg Asp Thr His Arg 915 920 925Leu Pro Arg Ala Leu 93020119PRTHomo sapiens 20Met Ser Arg Ser Val Ala Leu Ala Val Leu Ala Leu Leu Ser Leu Ser1 5 10 15Gly Leu Glu Ala Ile Gln Arg Thr Pro Lys Ile Gln Val Tyr Ser Arg 20 25 30His Pro Ala Glu Asn Gly Lys Ser Asn Phe Leu Asn Cys Tyr Val Ser 35 40 45Gly Phe His Pro Ser Asp Ile Glu Val Asp Leu Leu Lys Asn Gly Glu 50 55 60Arg Ile Glu Lys Val Glu His Ser Asp Leu Ser Phe Ser Lys Asp Trp65 70 75 80Ser Phe Tyr Leu Leu Tyr Tyr Thr Glu Phe Thr Pro Thr Glu Lys Asp 85 90 95Glu Tyr Ala Cys Arg Val Asn His Val Thr Leu Ser Gln Pro Lys Ile 100 105 110Val Lys Trp Asp Arg Asp Met 11521692PRTHomo sapiens 21Gln Arg Lys Arg Arg Asn Thr Ile His Glu Phe Lys Lys Ser Ala Lys1 5 10 15Thr Thr Leu Ile Lys Ile Asp Pro Ala Leu Lys Ile Lys Thr Lys Lys 20 25 30Val Asn Thr Ala Asp Gln Cys Ala Asn Arg Cys Thr Arg Asn Lys Gly 35 40 45Leu Pro Phe Thr Cys Lys Ala Phe Val Phe Asp Lys Ala Arg Lys Gln 50 55 60Cys Leu Trp Phe Pro Phe Asn Ser Met Ser Ser Gly Val Lys Lys Glu65 70 75 80Phe Gly His Glu Phe Asp Leu Tyr Glu Asn Lys Asp Tyr Ile Arg Asn 85 90 95Cys Ile Ile Gly Lys Gly Arg Ser Tyr Lys Gly Thr Val Ser Ile Thr 100 105 110Lys Ser Gly Ile Lys Cys Gln Pro Trp Ser Ser Met Ile Pro His Glu 115 120 125His Ser Tyr Arg Gly Lys Asp Leu Gln Glu Asn Tyr Cys Arg Asn Pro 130 135 140Arg Gly Glu Glu Gly Gly Pro Trp Cys Phe Thr Ser Asn Pro Glu Val145 150 155 160Arg Tyr Glu Val Cys Asp Ile Pro Gln Cys Ser Glu Val Glu Cys Met 165 170 175Thr Cys Asn Gly Glu Ser Tyr Arg Gly Leu Met Asp His Thr Glu Ser 180 185 190Gly Lys Ile Cys Gln Arg Trp Asp His Gln Thr Pro His Arg His Lys 195 200 205Phe Leu Pro Glu Arg Tyr Pro Asp Lys Gly Phe Asp Asp Asn Tyr Cys 210 215 220Arg Asn Pro Asp Gly Gln Pro Arg Pro Trp Cys Tyr Thr Leu Asp Pro225 230 235 240His Thr Arg Trp Glu Tyr Cys Ala Ile Lys Thr Cys Ala Asp Asn Thr 245 250 255Met Asn Asp Thr Asp Val Pro Leu Glu Thr Thr Glu Cys Ile Gln Gly 260 265 270Gln Gly Glu Gly Tyr Arg Gly Thr Val Asn Thr Ile Trp Asn Gly Ile 275 280 285Pro Cys Gln Arg Trp Asp Ser Gln Tyr Pro His Glu His Asp Met Thr 290 295 300Pro Glu Asn Phe Lys Cys Lys Asp Leu Arg Glu Asn Tyr Cys Arg Asn305 310 315 320Pro Asp Gly Ser Glu Ser Pro Trp Cys Phe Thr Thr Asp Pro Asn Ile 325 330 335Arg Val Gly Tyr Cys Ser Gln Ile Pro Asn Cys Asp Met Ser His Gly 340 345 350Gln Asp Cys Tyr Arg Gly Asn Gly Lys Asn Tyr Met Gly Asn Leu Ser 355 360 365Gln Thr Arg Ser Gly Leu Thr Cys Ser Met Trp Asp Lys Asn Met Glu 370 375 380Asp Leu His Arg His Ile Phe Trp Glu Pro Asp Ala Ser Lys Leu Asn385 390 395 400Glu Asn Tyr Cys Arg Asn Pro Asp Asp Asp Ala His Gly Pro Trp Cys 405 410 415Tyr Thr Gly Asn Pro Leu Ile Pro Trp Asp Tyr Cys Pro Ile Ser Arg 420 425 430Cys Glu Gly Asp Thr Thr Pro Thr Ile Val Asn Leu Asp His Pro Val 435 440 445Ile Ser Cys Ala Lys Thr Lys Gln Leu Arg Val Val Asn Gly Ile Pro 450 455 460Thr Arg Thr Asn Ile Gly Trp Met Val Ser Leu Arg Tyr Arg Asn Lys465 470 475 480His Ile Cys Gly Gly Ser Leu Ile Lys Glu Ser Trp Val Leu Thr Ala 485 490 495Arg Gln Cys Phe Pro Ser Arg Asp Leu Lys Asp Tyr Glu Ala Trp Leu 500 505 510Gly Ile His Asp Val His Gly Arg Gly Asp Glu Lys Cys Lys Gln Val 515 520 525Leu Asn Val Ser Gln Leu Val Tyr Gly Pro Glu Gly Ser Asp Leu Val 530 535 540Leu Met Lys Leu Ala Arg Pro Ala Val Leu Asp Asp Phe Val Ser Thr545 550 555 560Ile Asp Leu Pro Asn Tyr Gly Cys Thr Ile Pro Glu Lys Thr Ser Cys 565 570 575Ser Val Tyr Gly Trp Gly Tyr Thr Gly Leu Ile Asn Tyr Asp Gly Leu 580 585 590Leu Arg Val Ala His Leu Tyr Ile Met Gly Asn Glu Lys Cys Ser Gln 595 600 605His His Arg Gly Lys Val Thr Leu Asn Glu Ser Glu Ile Cys Ala Gly 610 615 620Ala Glu Lys Ile Gly Ser Gly Pro Cys Glu Gly Asp Tyr Gly Gly Pro625 630 635 640Leu Val Cys Glu Gln His Lys Met Arg Met Val Leu Gly Val Ile Val 645 650 655Pro Gly Arg Gly Cys Ala Ile Pro Asn Arg Pro Gly Ile Phe Val Arg 660 665 670Val Ala Tyr Tyr Ala Lys Trp Ile His Lys Ile Ile Leu Thr Tyr Lys 675 680 685Val Pro Gln Ser 6902220DNAArtificial Sequencesynthetic 22ccataattgc aaagcttctg 2023154PRTHomo sapiens 23Met Lys Gly Arg Lys Thr Arg Ala Arg Arg Ala Ile Ala Ala His Tyr1 5 10 15Glu Val His Pro Arg Pro Gly Gln Asp Gly Ala Gln Ala Gly Val Asp 20 25 30Gly Thr Val Ser Gly Trp Glu Glu Ala Arg Ile Asn Ser Ser Ser Pro 35 40 45Leu Arg Tyr Asn Arg Gln Ile Gly Glu Phe Ile Val Thr Arg Ala Gly 50 55 60Leu Tyr Tyr Leu Tyr Cys Gln Val His Phe Asp Glu Gly Lys Ala Val65 70 75 80Tyr Leu Lys Leu Asp Leu Leu Val Asp Gly Val Leu Ala Leu Arg Cys 85 90 95Leu Glu Glu Phe Ser Ala Thr Ala Ala Ser Ser Leu Gly Pro Gln Leu 100 105 110Arg Leu Cys Gln Val Ser Gly Leu Leu Ala Leu Arg Pro Gly Ser Ser 115 120 125Leu Arg Ile Arg Thr Leu Pro Trp Ala His Leu Lys Ala Ala Pro Phe 130 135 140Leu Thr Tyr Phe Gly

Leu Phe Gln Val His145 15024106PRTHomo sapiens 24Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro1 5 10 15Arg Gly Ser His Met Ser Leu Pro Leu Asn Pro Lys Pro Phe Leu Asn 20 25 30Gly Leu Thr Gly Lys Pro Val Met Val Lys Leu Lys Trp Gly Met Glu 35 40 45Tyr Lys Gly Tyr Leu Val Ser Val Asp Gly Tyr Met Asn Met Gln Leu 50 55 60Ala Asn Thr Glu Glu Tyr Ile Asp Gly Ala Leu Ser Gly His Leu Gly65 70 75 80Glu Val Leu Ile Arg Cys Asn Asn Val Leu Tyr Ile Arg Gly Val Glu 85 90 95Glu Glu Glu Glu Asp Gly Glu Met Arg Glu 100 10525240PRTHomo sapiens 25Met Arg Gly Ser His His His His His His Gly Met Ala Ser Met Thr1 5 10 15Gly Gly Gln Gln Met Gly Arg Asp Leu Tyr Asp Asp Asp Asp Lys Asp 20 25 30Arg Trp Gly Ser Met Ala Lys Val Gln Val Asn Asn Val Val Val Leu 35 40 45Asp Asn Pro Ser Pro Phe Tyr Asn Pro Phe Gln Phe Glu Ile Thr Phe 50 55 60Glu Cys Ile Glu Asp Leu Ser Glu Asp Leu Glu Trp Lys Ile Ile Tyr65 70 75 80Val Gly Ser Ala Glu Ser Glu Glu Tyr Asp Gln Val Leu Asp Ser Val 85 90 95Leu Val Gly Pro Val Pro Ala Gly Arg His Met Phe Val Phe Gln Ala 100 105 110Asp Ala Pro Asn Pro Gly Leu Ile Pro Asp Ala Asp Ala Val Gly Val 115 120 125Thr Val Val Leu Ile Thr Cys Thr Tyr Arg Gly Gln Glu Phe Ile Arg 130 135 140Val Gly Tyr Tyr Val Asn Asn Glu Tyr Thr Glu Thr Glu Leu Arg Glu145 150 155 160Asn Pro Pro Val Lys Pro Asp Phe Ser Lys Leu Gln Arg Asn Ile Leu 165 170 175Ala Ser Asn Pro Arg Val Thr Arg Phe His Ile Asn Trp Glu Asp Asn 180 185 190Thr Glu Lys Leu Glu Asp Ala Glu Ser Ser Asn Pro Asn Leu Gln Ser 195 200 205Leu Leu Ser Thr Asp Ala Leu Pro Ser Ala Ser Lys Gly Trp Ser Thr 210 215 220Ser Glu Asn Ser Leu Asn Val Met Leu Glu Ser His Met Asp Cys Met225 230 235 24026893PRTGallus gallus 26Met Asp His His Tyr Asp Pro Gln Gln Thr Asn Asp Tyr Met Gln Pro1 5 10 15Glu Glu Asp Trp Asp Arg Asp Leu Leu Leu Asp Pro Ala Trp Glu Lys 20 25 30Gln Gln Arg Lys Thr Phe Thr Ala Trp Cys Asn Ser His Leu Arg Lys 35 40 45Ala Gly Thr Gln Ile Glu Asn Ile Glu Glu Asp Phe Arg Asp Gly Leu 50 55 60Lys Leu Met Leu Leu Leu Glu Val Ile Ser Gly Glu Arg Leu Ala Lys65 70 75 80Pro Glu Arg Gly Lys Met Arg Val His Lys Ile Ser Asn Val Asn Lys 85 90 95Ala Leu Asp Phe Ile Ala Ser Lys Gly Val Lys Leu Val Ser Ile Gly 100 105 110Ala Glu Glu Ile Val Asp Gly Asn Val Lys Met Thr Leu Gly Met Ile 115 120 125Trp Thr Ile Ile Leu Arg Phe Ala Ile Gln Asp Ile Ser Val Glu Glu 130 135 140Thr Ser Ala Lys Glu Gly Leu Leu Leu Trp Cys Gln Arg Lys Thr Ala145 150 155 160Pro Tyr Lys Asn Val Asn Ile Gln Asn Phe His Ile Ser Trp Lys Asp 165 170 175Gly Leu Gly Phe Cys Ala Leu Ile His Arg His Arg Pro Glu Leu Ile 180 185 190Asp Tyr Gly Lys Leu Arg Lys Asp Asp Pro Leu Thr Asn Leu Asn Thr 195 200 205Ala Phe Asp Val Ala Glu Lys Tyr Leu Asp Ile Pro Lys Met Leu Asp 210 215 220Ala Glu Asp Ile Val Gly Thr Ala Arg Pro Asp Glu Lys Ala Ile Met225 230 235 240Thr Tyr Val Ser Ser Phe Tyr His Ala Phe Ser Gly Ala Gln Lys Ala 245 250 255Glu Thr Ala Ala Asn Arg Ile Cys Lys Val Leu Ala Val Asn Gln Glu 260 265 270Asn Glu Gln Leu Met Glu Asp Tyr Glu Lys Leu Ala Ser Asp Leu Leu 275 280 285Glu Trp Ile Arg Arg Thr Ile Pro Trp Leu Glu Asn Arg Ala Pro Glu 290 295 300Asn Thr Met Gln Ala Met Gln Gln Lys Leu Glu Asp Phe Arg Asp Tyr305 310 315 320Arg Arg Leu His Lys Pro Pro Lys Val Gln Glu Lys Cys Gln Leu Glu 325 330 335Ile Asn Phe Asn Thr Leu Gln Thr Lys Leu Arg Leu Ser Asn Arg Pro 340 345 350Ala Phe Met Pro Ser Glu Gly Lys Met Val Ser Asp Ile Asn Asn Ala 355 360 365Trp Gly Gly Leu Glu Gln Ala Glu Lys Gly Tyr Glu Glu Trp Leu Leu 370 375 380Asn Glu Ile Arg Arg Leu Glu Arg Leu Asp His Leu Ala Glu Lys Phe385 390 395 400Arg Gln Lys Ala Ser Ile His Glu Ser Trp Thr Asp Gly Lys Glu Ala 405 410 415Met Leu Gln Gln Lys Asp Tyr Glu Thr Ala Thr Leu Ser Glu Ile Lys 420 425 430Ala Leu Leu Lys Lys His Glu Ala Phe Glu Ser Asp Leu Ala Ala His 435 440 445Gln Asp Arg Val Glu Gln Ile Ala Ala Ile Ala Gln Glu Leu Asn Glu 450 455 460Leu Asp Tyr Tyr Asp Ser Pro Ser Val Asn Ala Arg Cys Gln Lys Ile465 470 475 480Cys Asp Gln Trp Asp Asn Leu Gly Ala Leu Thr Gln Lys Arg Arg Glu 485 490 495Ala Leu Glu Arg Thr Glu Lys Leu Leu Glu Thr Ile Asp Gln Leu Tyr 500 505 510Leu Glu Tyr Ala Lys Arg Ala Ala Pro Phe Asn Asn Trp Met Glu Gly 515 520 525Ala Met Glu Asp Leu Gln Asp Thr Phe Ile Val His Thr Ile Glu Glu 530 535 540Ile Gln Gly Leu Thr Thr Ala His Glu Gln Phe Lys Ala Thr Leu Pro545 550 555 560Asp Ala Asp Lys Glu Arg Gln Ala Ile Leu Gly Ile His Asn Glu Val 565 570 575Ser Lys Ile Val Gln Thr Tyr His Val Asn Met Ala Gly Thr Asn Pro 580 585 590Tyr Thr Thr Ile Thr Pro Gln Glu Ile Asn Gly Lys Trp Glu His Val 595 600 605Arg Gln Leu Val Pro Arg Arg Asp Gln Ala Leu Met Glu Glu His Ala 610 615 620Arg Gln Gln Gln Asn Glu Arg Leu Arg Lys Gln Phe Gly Ala Gln Ala625 630 635 640Asn Val Ile Gly Pro Trp Ile Gln Thr Lys Met Glu Glu Ile Gly Arg 645 650 655Ile Ser Ile Glu Met His Gly Thr Leu Glu Asp Gln Leu Asn His Leu 660 665 670Arg Gln Tyr Glu Lys Ser Ile Val Asn Tyr Lys Pro Lys Ile Asp Gln 675 680 685Leu Glu Gly Asp His Gln Gln Ile Gln Glu Ala Leu Ile Phe Asp Asn 690 695 700Lys His Thr Asn Tyr Thr Met Glu His Ile Arg Val Gly Trp Glu Gln705 710 715 720Leu Leu Thr Thr Ile Ala Arg Thr Ile Asn Glu Val Glu Asn Gln Ile 725 730 735Leu Thr Arg Asp Ala Lys Gly Ile Ser Gln Glu Gln Met Asn Glu Phe 740 745 750Arg Ala Ser Phe Asn His Phe Asp Arg Asp His Ser Gly Thr Leu Gly 755 760 765Pro Glu Glu Phe Lys Ala Cys Leu Ile Ser Leu Gly Tyr Asp Ile Gly 770 775 780Asn Asp Ala Gln Gly Glu Ala Glu Phe Ala Arg Ile Met Ser Ile Val785 790 795 800Asp Pro Asn Arg Met Gly Val Val Thr Phe Gln Ala Phe Ile Asp Phe 805 810 815Met Ser Arg Glu Thr Ala Asp Thr Asp Thr Ala Asp Gln Val Met Ala 820 825 830Ser Phe Lys Ile Leu Ala Gly Asp Lys Asn Tyr Ile Thr Val Asp Glu 835 840 845Leu Arg Arg Glu Leu Pro Pro Asp Gln Ala Glu Tyr Cys Ile Ala Arg 850 855 860Met Ala Pro Tyr Asn Gly Arg Asp Ala Val Pro Gly Ala Leu Asp Tyr865 870 875 880Met Ser Phe Ser Thr Ala Leu Tyr Gly Glu Ser Asp Leu 885 89027578PRTToxoplasma gondii 27Met Gly His Pro Thr Ser Phe Gly Gln Pro Ser Cys Leu Val Trp Leu1 5 10 15Ala Ala Ala Phe Leu Val Leu Gly Leu Cys Leu Val Gln Gln Gly Ala 20 25 30Gly Arg Gln Arg Pro His Gln Trp Lys Ser Ser Glu Ala Ala Leu Ser 35 40 45Val Ser Pro Ala Gly Asp Ile Val Asp Lys Tyr Ser Arg Asp Ser Thr 50 55 60Glu Gly Glu Asn Thr Val Ser Glu Gly Glu Ala Glu Gly Ser Arg Gly65 70 75 80Gly Ser Trp Leu Glu Gln Glu Gly Val Glu Leu Arg Ser Pro Ser Gln 85 90 95Asp Ser Gln Thr Gly Thr Ser Thr Ala Ser Pro Thr Gly Phe Arg Arg 100 105 110Leu Leu Arg Arg Leu Arg Phe Trp Arg Arg Gly Ser Thr Arg Gly Ser 115 120 125Asp Asp Ala Ala Glu Val Ser Arg Arg Thr Arg Val Pro Leu His Thr 130 135 140Arg Leu Leu Gln His Leu Arg Arg Val Ala Arg Ile Ile Arg His Gly145 150 155 160Val Ser Ala Ala Ala Gly Arg Leu Phe Gly Arg Val Arg Gln Val Glu 165 170 175Ala Glu Arg Pro Gln Pro Val Phe Thr Glu Gly Asp Pro Pro Asp Leu 180 185 190Glu Thr Asn Ser Leu Tyr Tyr Arg Asp Lys Val Pro Gly Gln Gly Ile 195 200 205Ile Gln Glu Ile Leu Arg Gln Lys Pro Gly Ile Ala His His Pro Glu 210 215 220Ser Phe Ser Val Val Ala Ala Asp Glu Arg Val Ser Arg Thr Leu Trp225 230 235 240Ala Glu Gly Gly Val Val Arg Val Ala Ser Glu Leu Gly Gln Pro Gly 245 250 255Arg Val Leu Val Arg Gly Arg Arg Ile Gly Leu Phe Arg Pro Gly Met 260 265 270Gln Phe Glu Ala Thr Asp Gln Ala Thr Gly Glu Pro Met Thr Ala Leu 275 280 285Val Gly His Thr Val Leu Glu Ala Thr Ala Arg Asp Val Asp Ser Met 290 295 300Arg Asn Glu Gly Leu Ala Val Gly Leu Phe Gln Lys Val Lys Asn Pro305 310 315 320Tyr Leu Ala Asn Arg Tyr Leu Arg Phe Leu Ala Pro Phe Asp Leu Val 325 330 335Thr Ile Pro Gly Lys Pro Leu Val Gln Lys Ala Lys Ser Arg Asn Glu 340 345 350Val Gly Trp Val Lys Asn Leu Leu Phe Leu Leu Pro Pro Thr His Val 355 360 365Asp Met Glu Thr Phe Val Asp Glu Ile Gly Arg Phe Pro Gln Glu Asp 370 375 380Arg Pro Leu Ala Asp Ala Ala Arg Leu Tyr Leu Thr Val Gln Ala Val385 390 395 400Arg Leu Val Ala His Leu Gln Asp Glu Gly Val Val His Gly Lys Ile 405 410 415Met Pro Asp Ser Phe Cys Leu Lys Arg Glu Gly Gly Leu Tyr Leu Arg 420 425 430Asp Phe Gly Ser Leu Val Arg Ala Gly Ala Lys Val Val Val Pro Ala 435 440 445Glu Tyr Asp Glu Tyr Thr Pro Pro Glu Gly Arg Ala Ala Ala Arg Ser 450 455 460Arg Phe Gly Ser Gly Ala Thr Thr Met Thr Tyr Ala Phe Asp Ala Trp465 470 475 480Thr Leu Gly Ser Val Ile Phe Leu Ile Trp Cys Ser Arg Ala Pro Asp 485 490 495Thr Lys Ser Gly Tyr Glu Tyr Ser Val Glu Phe Phe Phe Ser Arg Cys 500 505 510Arg Arg Val Pro Glu Asn Val Lys Leu Leu Val Tyr Lys Leu Ile Asn 515 520 525Pro Ser Val Glu Ala Arg Leu Leu Ala Leu Gln Ala Ile Glu Thr Pro 530 535 540Glu Tyr Arg Glu Met Glu Glu Gln Leu Ser Ala Ala Ser Arg Leu Tyr545 550 555 560Ser Gly Asp Gly Thr Leu Thr Gly Gly Asp Asp Asp Met Pro Pro Leu 565 570 575Glu Thr28578PRTHomo sapiens 28Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro1 5 10 15Arg Gly Ser His Met Ala Ala Leu Thr Arg Asp Pro Gln Phe Gln Lys 20 25 30Leu Gln Gln Trp Tyr Arg Glu His Arg Ser Glu Leu Asn Leu Arg Arg 35 40 45Leu Phe Asp Ala Asn Lys Asp Arg Phe Asn His Phe Ser Leu Thr Leu 50 55 60Asn Thr Asn His Gly His Ile Leu Val Asp Tyr Ser Lys Asn Leu Val65 70 75 80Thr Glu Asp Val Met Arg Met Leu Val Asp Leu Ala Lys Ser Arg Gly 85 90 95Val Glu Ala Ala Arg Glu Arg Met Phe Asn Gly Glu Lys Ile Asn Tyr 100 105 110Thr Glu Gly Arg Ala Val Leu His Val Ala Leu Arg Asn Arg Ser Asn 115 120 125Thr Pro Ile Leu Val Asp Gly Lys Asp Val Met Pro Glu Val Asn Lys 130 135 140Val Leu Asp Lys Met Lys Ser Phe Cys Gln Arg Val Arg Ser Gly Asp145 150 155 160Trp Lys Gly Tyr Thr Gly Lys Thr Ile Thr Asp Val Ile Asn Ile Gly 165 170 175Ile Gly Gly Ser Asp Leu Gly Pro Leu Met Val Thr Glu Ala Leu Lys 180 185 190Pro Tyr Ser Ser Gly Gly Pro Arg Val Trp Tyr Val Ser Asn Ile Asp 195 200 205Gly Thr His Ile Ala Lys Thr Leu Ala Gln Leu Asn Pro Glu Ser Ser 210 215 220Leu Phe Ile Ile Ala Ser Lys Thr Phe Thr Thr Gln Glu Thr Ile Thr225 230 235 240Asn Ala Glu Thr Ala Lys Glu Trp Phe Leu Gln Ala Ala Lys Asp Pro 245 250 255Ser Ala Val Ala Lys His Phe Val Ala Leu Ser Thr Asn Thr Thr Lys 260 265 270Val Lys Glu Phe Gly Ile Asp Pro Gln Asn Met Phe Glu Phe Trp Asp 275 280 285Trp Val Gly Gly Arg Tyr Ser Leu Trp Ser Ala Ile Gly Leu Ser Ile 290 295 300Ala Leu His Val Gly Phe Asp Asn Phe Glu Gln Leu Leu Ser Gly Ala305 310 315 320His Trp Met Asp Gln His Phe Arg Thr Thr Pro Leu Glu Lys Asn Ala 325 330 335Pro Val Leu Leu Ala Leu Leu Gly Ile Trp Tyr Ile Asn Cys Phe Gly 340 345 350Cys Glu Thr His Ala Met Leu Pro Tyr Asp Gln Tyr Leu His Arg Phe 355 360 365Ala Ala Tyr Phe Gln Gln Gly Asp Met Glu Ser Asn Gly Lys Tyr Ile 370 375 380Thr Lys Ser Gly Thr Arg Val Asp His Gln Thr Gly Pro Ile Val Trp385 390 395 400Gly Glu Pro Gly Thr Asn Gly Gln His Ala Phe Tyr Gln Leu Ile His 405 410 415Gln Gly Thr Lys Met Ile Pro Cys Asp Phe Leu Ile Pro Val Gln Thr 420 425 430Gln His Pro Ile Arg Lys Gly Leu His His Lys Ile Leu Leu Ala Asn 435 440 445Phe Leu Ala Gln Thr Glu Ala Leu Met Arg Gly Lys Ser Thr Glu Glu 450 455 460Ala Arg Lys Glu Leu Gln Ala Ala Gly Lys Ser Pro Glu Asp Leu Glu465 470 475 480Arg Leu Leu Pro His Lys Val Phe Glu Gly Asn Arg Pro Thr Asn Ser 485 490 495Ile Val Phe Thr Lys Leu Thr Pro Phe Met Leu Gly Ala Leu Val Ala 500 505 510Met Tyr Glu His Lys Ile Phe Val Gln Gly Ile Ile Trp Asp Ile Asn 515 520 525Ser Phe Asp Gln Trp Gly Val Glu Leu Gly Lys Gln Leu Ala Lys Lys 530 535 540Ile Glu Pro Glu Leu Asp Gly Ser Ala Gln Val Thr Ser His Asp Ala545 550 555 560Ser Thr Asn Gly Leu Ile Asn Phe Ile Lys Gln Gln Arg Glu Ala Arg 565 570 575Val Gln29381PRTHomo sapiens 29Met Pro Phe Ser Asn Ser His Asn Ala Leu Lys Leu Arg Phe Pro Ala1 5 10 15Glu Asp Glu Phe Pro Asp Leu Ser Ala His Asn Asn His Met Ala Lys 20 25 30Val Leu Thr Pro Glu Leu Tyr Ala Glu Leu Arg Ala Lys Ser Thr Pro 35 40 45Ser Gly Phe Thr Leu Asp Asp Val Ile Gln Thr Gly Val Asp Asn

Pro 50 55 60Gly His Pro Tyr Ile Met Thr Val Gly Cys Val Ala Gly Asp Glu Glu65 70 75 80Ser Tyr Glu Val Phe Lys Asp Leu Phe Asp Pro Ile Ile Glu Asp Arg 85 90 95His Gly Gly Tyr Lys Pro Ser Asp Glu His Lys Thr Asp Leu Asn Pro 100 105 110Asp Asn Leu Gln Gly Gly Asp Asp Leu Asp Pro Asn Tyr Val Leu Ser 115 120 125Ser Arg Val Arg Thr Gly Arg Ser Ile Arg Gly Phe Cys Leu Pro Pro 130 135 140His Cys Ser Arg Gly Glu Arg Arg Ala Ile Glu Lys Leu Ala Val Glu145 150 155 160Ala Leu Ser Ser Leu Asp Gly Asp Leu Ala Gly Arg Tyr Tyr Ala Leu 165 170 175Lys Ser Met Thr Glu Ala Glu Gln Gln Gln Leu Ile Asp Asp His Phe 180 185 190Leu Phe Asp Lys Pro Val Ser Pro Leu Leu Leu Ala Ser Gly Met Ala 195 200 205Arg Asp Trp Pro Asp Ala Arg Gly Ile Trp His Asn Asp Asn Lys Thr 210 215 220Phe Leu Val Trp Val Asn Glu Glu Asp His Leu Arg Val Ile Ser Met225 230 235 240Gln Lys Gly Gly Asn Met Lys Glu Val Phe Thr Arg Phe Cys Thr Gly 245 250 255Leu Thr Gln Ile Glu Thr Leu Phe Lys Ser Lys Asp Tyr Glu Phe Met 260 265 270Trp Asn Pro His Leu Gly Tyr Ile Leu Thr Cys Pro Ser Asn Leu Gly 275 280 285Thr Gly Leu Arg Ala Gly Val His Ile Lys Leu Pro Asn Leu Gly Lys 290 295 300His Glu Lys Phe Ser Glu Val Leu Lys Arg Leu Arg Leu Gln Lys Arg305 310 315 320Gly Thr Gly Gly Val Asp Thr Ala Ala Val Gly Gly Val Phe Asp Val 325 330 335Ser Asn Ala Asp Arg Leu Gly Phe Ser Glu Val Glu Leu Val Gln Met 340 345 350Val Val Asp Gly Val Lys Leu Leu Ile Glu Met Glu Gln Arg Leu Glu 355 360 365Gln Gly Gln Ala Ile Asp Asp Leu Met Pro Ala Gln Lys 370 375 38030354PRTHomo sapiens 30Met Lys Ser Lys Leu Pro Lys Pro Val Gln Asp Leu Ile Lys Met Ile1 5 10 15Phe Asp Val Glu Ser Met Lys Lys Ala Met Val Glu Tyr Glu Ile Asp 20 25 30Leu Gln Lys Met Pro Leu Gly Lys Leu Ser Lys Arg Gln Ile Gln Ala 35 40 45Ala Tyr Ser Ile Leu Ser Glu Val Gln Gln Ala Val Ser Gln Gly Ser 50 55 60Ser Asp Ser Gln Ile Leu Asp Leu Ser Asn Arg Phe Tyr Thr Leu Ile65 70 75 80Pro His Asp Phe Gly Met Lys Lys Pro Pro Leu Leu Asn Asn Ala Asp 85 90 95Ser Val Gln Ala Lys Ala Glu Met Leu Asp Asn Leu Leu Asp Ile Glu 100 105 110Val Ala Tyr Ser Leu Leu Arg Gly Gly Ser Asp Asp Ser Ser Lys Asp 115 120 125Pro Ile Asp Val Asn Tyr Glu Lys Leu Lys Thr Asp Ile Lys Val Val 130 135 140Asp Arg Asp Ser Glu Glu Ala Glu Ile Ile Arg Lys Tyr Val Lys Asn145 150 155 160Thr His Ala Thr Thr His Asn Ala Tyr Asp Leu Glu Val Ile Asp Ile 165 170 175Phe Lys Ile Glu Arg Glu Gly Glu Cys Gln Arg Tyr Lys Pro Phe Lys 180 185 190Gln Leu His Asn Arg Arg Leu Leu Trp His Gly Ser Arg Thr Thr Asn 195 200 205Phe Ala Gly Ile Leu Ser Gln Gly Leu Arg Ile Ala Pro Pro Glu Ala 210 215 220Pro Val Thr Gly Tyr Met Phe Gly Lys Gly Ile Tyr Phe Ala Asp Met225 230 235 240Val Ser Lys Ser Ala Asn Tyr Cys His Thr Ser Gln Gly Asp Pro Ile 245 250 255Gly Leu Ile Leu Leu Gly Glu Val Ala Leu Gly Asn Met Tyr Glu Leu 260 265 270Lys His Ala Ser His Ile Ser Lys Leu Pro Lys Gly Lys His Ser Val 275 280 285Lys Gly Leu Gly Lys Thr Thr Pro Asp Pro Ser Ala Asn Ile Ser Leu 290 295 300Asp Gly Val Asp Val Pro Leu Gly Thr Gly Ile Ser Ser Gly Val Asn305 310 315 320Asp Thr Ser Leu Leu Tyr Asn Glu Tyr Ile Val Tyr Asp Ile Ala Gln 325 330 335Val Asn Leu Lys Tyr Leu Leu Lys Leu Lys Phe Asn Phe Lys Thr Ser 340 345 350Leu Trp31161PRTHomo sapiens 31Met Asp Asp Ile Tyr Lys Ala Ala Val Glu Gln Leu Thr Glu Glu Gln1 5 10 15Lys Asn Glu Phe Lys Ala Ala Phe Asp Ile Phe Val Leu Gly Ala Glu 20 25 30Asp Gly Cys Ile Ser Thr Lys Glu Leu Gly Lys Val Met Arg Met Leu 35 40 45Gly Gln Asn Pro Thr Pro Glu Glu Leu Gln Glu Met Ile Asp Glu Val 50 55 60Asp Glu Asp Gly Ser Gly Thr Val Asp Phe Asp Glu Phe Leu Val Met65 70 75 80Met Val Arg Cys Met Lys Asp Asp Ser Lys Gly Lys Ser Glu Glu Glu 85 90 95Leu Ser Asp Leu Phe Arg Met Phe Asp Lys Asn Ala Asp Gly Tyr Ile 100 105 110Asp Leu Asp Glu Leu Lys Ile Met Leu Gln Ala Thr Gly Glu Thr Ile 115 120 125Thr Glu Asp Asp Ile Glu Glu Leu Met Lys Asp Gly Asp Lys Asn Asn 130 135 140Asp Gly Arg Ile Asp Tyr Asp Glu Phe Leu Glu Phe Met Lys Gly Val145 150 155 160Glu32471PRTHomo sapiens 32Ser Thr Arg Ser Val Ser Ser Ser Ser Tyr Arg Arg Met Phe Gly Gly1 5 10 15Pro Gly Thr Ala Ser Arg Pro Ser Ser Ser Arg Ser Tyr Val Thr Thr 20 25 30Ser Thr Arg Thr Tyr Ser Leu Gly Ser Ala Leu Arg Pro Ser Thr Ser 35 40 45Arg Ser Leu Tyr Ala Ser Ser Pro Gly Gly Val Tyr Ala Thr Arg Ser 50 55 60Ser Ala Val Arg Leu Arg Ser Ser Val Pro Gly Val Arg Leu Leu Gln65 70 75 80Asp Ser Val Asp Phe Ser Leu Ala Asp Ala Ile Asn Thr Glu Phe Lys 85 90 95Asn Thr Arg Thr Asn Glu Lys Val Glu Leu Gln Glu Leu Asn Asp Arg 100 105 110Phe Ala Asn Tyr Ile Asp Lys Val Arg Phe Leu Glu Gln Gln Asn Lys 115 120 125Ile Leu Leu Ala Glu Leu Glu Gln Leu Lys Gly Gln Gly Lys Ser Arg 130 135 140Leu Gly Asp Leu Tyr Glu Glu Glu Met Arg Glu Leu Arg Arg Gln Val145 150 155 160Asp Gln Leu Thr Asn Asp Lys Ala Arg Val Glu Val Glu Arg Asp Asn 165 170 175Leu Ala Glu Asp Ile Met Arg Leu Arg Glu Lys Leu Gln Glu Glu Met 180 185 190Leu Gln Arg Glu Glu Ala Glu Asn Thr Leu Gln Ser Phe Arg Gln Asp 195 200 205Val Asp Asn Ala Ser Leu Ala Arg Leu Asp Leu Glu Arg Lys Val Glu 210 215 220Ser Leu Gln Glu Glu Ile Ala Phe Leu Lys Lys Leu His Glu Glu Glu225 230 235 240Ile Gln Glu Leu Gln Ala Gln Ile Gln Glu Gln His Val Gln Ile Asp 245 250 255Val Asp Val Ser Lys Pro Asp Leu Thr Ala Ala Leu Arg Asp Val Arg 260 265 270Gln Gln Tyr Glu Ser Val Ala Ala Lys Asn Leu Gln Glu Ala Glu Glu 275 280 285Trp Tyr Lys Ser Lys Phe Ala Asp Leu Ser Glu Ala Ala Asn Arg Asn 290 295 300Asn Asp Ala Leu Arg Gln Ala Lys Gln Glu Ser Thr Glu Tyr Arg Arg305 310 315 320Gln Val Gln Ser Leu Thr Cys Glu Val Asp Ala Leu Lys Gly Thr Asn 325 330 335Glu Ser Leu Glu Arg Gln Met Arg Glu Met Glu Glu Asn Phe Ala Val 340 345 350Glu Ala Ala Asn Tyr Gln Asp Thr Ile Gly Arg Leu Gln Asp Glu Ile 355 360 365Gln Asn Met Lys Glu Glu Met Ala Arg His Leu Arg Glu Tyr Gln Asp 370 375 380Leu Leu Asn Val Lys Met Ala Leu Asp Ile Glu Ile Ala Thr Tyr Arg385 390 395 400Lys Leu Leu Glu Gly Glu Glu Ser Arg Ile Ser Leu Pro Leu Pro Asn 405 410 415Phe Ser Ser Leu Asn Leu Arg Glu Thr Asn Leu Asp Ser Leu Pro Leu 420 425 430Val Asp Thr His Ser Lys Arg Thr Leu Leu Ile Lys Thr Val Glu Thr 435 440 445Arg Asp Gly Gln Val Ile Asn Glu Thr Ser Gln His His Asp Asp Leu 450 455 460Glu His His His His His His465 4703320DNAArtificial Sequencesynthetic 33gagagagaga gagagagaga 2034140PRTHomo sapiens 34Met Gly Pro Arg Arg Arg Ser Arg Lys Pro Glu Ala Pro Arg Arg Arg1 5 10 15Ser Pro Ser Pro Thr Pro Thr Pro Gly Pro Ser Arg Arg Gly Pro Ser 20 25 30Leu Gly Ala Ser Ser His Gln His Ser Arg Arg Arg Gln Gly Trp Leu 35 40 45Lys Glu Ile Arg Lys Leu Gln Lys Ser Thr His Leu Leu Ile Arg Lys 50 55 60Leu Pro Phe Ser Arg Leu Ala Arg Glu Ile Cys Val Lys Phe Thr Arg65 70 75 80Gly Val Asp Phe Asn Trp Gln Ala Gln Ala Leu Leu Ala Leu Gln Glu 85 90 95Ala Ala Glu Ala Phe Leu Val His Leu Phe Glu Asp Ala Tyr Leu Leu 100 105 110Thr Leu His Ala Gly Arg Val Thr Leu Phe Pro Lys Asp Val Gln Leu 115 120 125Ala Arg Arg Ile Arg Gly Leu Glu Glu Gly Leu Gly 130 135 14035359PRTToxoplasma gondii 35Met Arg Gly Gly Thr Ser Ala Leu Leu His Ala Leu Thr Phe Ser Gly1 5 10 15Ala Val Trp Met Cys Thr Pro Ala Glu Ala Leu Pro Ile Gln Lys Ser 20 25 30Val Gln Leu Gly Ser Phe Asp Lys Val Val Pro Ser Arg Glu Val Val 35 40 45Ser Glu Ser Leu Ala Pro Ser Phe Ala Val Thr Glu Thr His Ser Ser 50 55 60Val Gln Ser Pro Ser Lys Gln Glu Thr Gln Leu Cys Ala Ile Ser Ser65 70 75 80Glu Gly Lys Pro Cys Arg Asn Arg Gln Leu His Thr Asp Asn Gly Tyr 85 90 95Phe Ile Gly Ala Ser Cys Pro Lys Ser Ala Cys Cys Ser Lys Thr Met 100 105 110Cys Gly Pro Gly Gly Cys Gly Glu Phe Cys Ser Ser Asn Trp Ile Phe 115 120 125Cys Ser Ser Ser Leu Ile Tyr His Pro Asp Lys Ser Tyr Gly Gly Asp 130 135 140Cys Ser Cys Glu Lys Gln Gly His Arg Cys Asp Lys Asn Ala Glu Cys145 150 155 160Val Glu Asn Leu Asp Ala Gly Gly Gly Val His Cys Lys Cys Lys Asp 165 170 175Gly Phe Val Gly Thr Gly Leu Thr Cys Ser Glu Asp Pro Cys Ser Lys 180 185 190Arg Gly Asn Ala Lys Cys Gly Pro Asn Gly Thr Cys Ile Val Val Asp 195 200 205Ser Val Ser Tyr Thr Cys Thr Cys Gly Asp Gly Glu Thr Leu Val Asn 210 215 220Leu Pro Glu Gly Gly Gln Gly Cys Lys Arg Thr Gly Cys His Ala Phe225 230 235 240Arg Glu Asn Cys Ser Pro Gly Arg Cys Ile Asp Asp Ala Ser His Glu 245 250 255Asn Gly Tyr Thr Cys Glu Cys Pro Thr Gly Tyr Ser Arg Glu Val Thr 260 265 270Ser Lys Ala Glu Glu Ser Cys Val Glu Gly Val Glu Val Thr Leu Ala 275 280 285Glu Lys Cys Glu Lys Glu Phe Gly Ile Ser Ala Ser Ser Cys Lys Cys 290 295 300Asp Asn Gly Tyr Ser Gly Ser Ala Ser Ala Thr Ser His His Gly Lys305 310 315 320Gly Glu Ser Gly Ser Glu Gly Ser Leu Ser Glu Lys Met Asn Ile Val 325 330 335Phe Lys Cys Pro Ser Gly Tyr His Pro Arg Tyr His Ala His Thr Val 340 345 350Thr Cys Glu Lys Ile Lys Gln 3553620DNAArtificial Sequencesynthetic 36ccataattgc aaacgttctg 203710DNAArtificial Sequencesynthetic 37gggggggggg 103818DNAArtificial Sequencesynthetic 38tttttttttt ttttttgg 183920DNAArtificial Sequencesynthetic 39aaaaaaaaaa aaaaaaaaaa 2040106PRTHomo sapiens 40Lys Lys Asn Lys Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser1 5 10 15Asp Val Gly Trp Asn Asp Trp Ile Val Ala Pro Pro Gly Tyr Gln Ala 20 25 30Phe Tyr Cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn 35 40 45Ser Thr Asn His Ala Ile Val Gln Thr Leu Val Asn Ser Val Asn Ser 50 55 60Ser Ile Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala Ile Ser65 70 75 80Met Leu Tyr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gln 85 90 95Glu Met Val Val Glu Gly Cys Gly Cys Arg 100 1054117DNAArtificial Sequencesynthetic 41gggggggggg ggggggt 174220DNAArtificial Sequencesynthetic 42ccataattcg aaacgttctg 20

* * * * *


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed