Beta-2 Microglobulin-deficient Cells

Russell; David W.

Patent Application Summary

U.S. patent application number 14/111837 was filed with the patent office on 2014-05-15 for beta-2 microglobulin-deficient cells. This patent application is currently assigned to University of Washington through its Center for Commercialization. The applicant listed for this patent is David W. Russell. Invention is credited to David W. Russell.

Application Number20140134195 14/111837
Document ID /
Family ID46051930
Filed Date2014-05-15

United States Patent Application 20140134195
Kind Code A1
Russell; David W. May 15, 2014

BETA-2 MICROGLOBULIN-DEFICIENT CELLS

Abstract

The invention provides isolated primate cells preferably human cells that comprise a genetically engineered disruption in a beta-2 microglobulin (B2M) gene, which results in deficiency in MHC class I expression and function. Also provided are the method of using the cells for transplantation and treating a disease condition.


Inventors: Russell; David W.; (Seattle, WA)
Applicant:
Name City State Country Type

Russell; David W.

Seattle

WA

US
Assignee: University of Washington through its Center for Commercialization
Seattle
WA

Family ID: 46051930
Appl. No.: 14/111837
Filed: April 18, 2012
PCT Filed: April 18, 2012
PCT NO: PCT/US12/34051
371 Date: December 18, 2013

Related U.S. Patent Documents

Application Number Filing Date Patent Number
61477474 Apr 20, 2011

Current U.S. Class: 424/184.1 ; 424/93.21; 435/363; 435/366
Current CPC Class: A61K 39/0005 20130101; C07K 2319/00 20130101; C12N 2740/17043 20130101; A61K 35/28 20130101; A61P 17/02 20180101; A61P 37/06 20180101; A61K 35/12 20130101; A61P 7/06 20180101; C12N 2800/30 20130101; A61P 19/02 20180101; A61P 37/02 20180101; A61P 19/00 20180101; A61P 9/04 20180101; A61K 2039/515 20130101; A61P 7/00 20180101; A61P 31/00 20180101; C12N 2750/14143 20130101; A61P 3/10 20180101; C07K 14/70539 20130101
Class at Publication: 424/184.1 ; 435/363; 435/366; 424/93.21
International Class: A61K 39/00 20060101 A61K039/00; A61K 35/28 20060101 A61K035/28

Goverment Interests



[0002] This invention was made with government support under grant numbers R01GM086497 and R01DK55759 awarded by the National Institutes of Health. The government has certain rights in the invention.
Claims



1. An isolated cell comprising a genetically engineered disruption in a beta-2 microglobulin (B2M) gene, wherein the cell is a primate cell.

2. The cell of claim 1, wherein the cell comprises genetically engineered disruptions of all copies of the B2M gene.

3. The cell of claim 1, wherein the cell is a human cell.

4. The cell of any claim 1, wherein the cell further comprises one or more recombinant immunomodulatory genes, each capable of expressing an immunomodulatory polypeptide in the cell.

5. The cell of claim 4, wherein the one or more immunomodulatory genes comprise a polynucleotide capable of encoding a single chain fusion human leukocyte antigen (HLA) class I protein.

6. The cell of claim 5, wherein the single chain fusion HLA class I protein comprises at least a portion of B2M covalently linked to at least a portion of an HLA class I .alpha. chain selected from the group consisting of HLA-A, HLA-B, HLA-C, HLA-E, HLA-F and HLA-G.

7. The cell of claim 5, wherein the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of HLA class I .alpha. chain is selected from the group consisting of HLA-E, HLA-G and HLA-A.

8.-14. (canceled)

15. The cell of claim 5, wherein the cell further expresses a target peptide antigen that is presented by the single chain fusion HLA class I protein on the cell surface.

16. The cell of claim 15 wherein the target peptide antigen is covalently linked to the single chain fusion HLA class I protein.

17. The cell of claim 1, wherein the cell further comprises one or more recombinant genes capable of encoding a suicide gene product.

18. The cell of claim 17 wherein the suicide gene product comprises a protein selected from the group consisting of thymidine kinase and an apoptotic signaling protein.

19. The cell of any claim 1, wherein the cell has a normal karyotype.

20. The cell of claim 1, wherein the cell is a non-transformed cell.

21. The cell of claim 1, wherein the cell is a stem cell.

22.-25. (canceled)

26. A vaccine comprising the cell of claim 15, wherein the vaccine is capable of eliciting in a primate an immune response specific for the target peptide antigen.

27. The vaccine of claim 26, wherein the primate is a human and the cell is a human cell.

28. A method of transplantation in a patient in need thereof comprising the step of administering to the patient an effective amount of the cell of claim 1.

29-31. (canceled)

32. The method of claim 28, wherein the cell further expresses a single chain fusion HLA class I protein.

33. A method of treating a disease condition in a patient in need thereof comprising the step of administering to the patient an effective amount of the cell of claim 1, wherein the disease condition is diabetes, an autoimmune disease, cancer, infection, anemia, cytopenia, myocardial infarction, heart failure, skeletal or joint condition, osteogenesis imperfecta or burns.

34.-38. (canceled)

39. A kit comprising the cell of claim 1.

40.-43. (canceled)
Description



[0001] This application relates to and claims the benefit of priority to U.S. provisional application Ser. No. 61/477,474, filed Apr. 20, 2011, the disclosure of which is hereby incorporated by reference in its entirety.

BACKGROUND OF THE INVENTION

[0003] Human pluripotent stem cells have the potential to treat diseases affecting almost every organ system. However, the clinical use of human pluripotent stem cells and their derivatives has a major limitation--rejection of transplanted cells by the recipient due to differences in the major histocompatibility complex.

[0004] The major histocompatibility complex (MHC) is a cell surface multi-component molecule found in all vertebrates that mediates interactions of leukocytes with other leukocytes or other cells. The MHC gene family is divided into three groups: class I, class II and class III. In humans, MHC is referred to as human leukocyte antigen (HLA). The HLA class I (HLA-I) protein is expressed on all nucleated cells and consists of an HLA class I heavy chain (or .alpha. chain) and .beta.-2 microglobulin (B2M). HLA class I protein presents peptides on the cell surface to CD8+ cytotoxic T cells. Six HLA class I .alpha. chains have been identified to date, including three classical (HLA-A, HLA-B and HLA-C) and three non-classical (HLA-E, HLA-F and HLA-G) .alpha. chains. The specificity for peptide binding on the HLA class I molecule peptide binding cleft is determined by the .alpha. chain. Recognition by CD8+ T cells of the peptides presented by the HLA class I molecule mediates cellular immunity.

[0005] The HLA class I protein itself from an allogeneic source constitutes a foreign antigen in the context of transplantation. The recognition of non-self HLA class I protein is a major hurdle in using pluripotent cells for transplantation or replacement therapies. The first two clinical trials of human embryonic stem cells (ESCs) have been conducted that delivered ESCs to immune-privileged sites (such as spinal cord and eye) where allogeneic cells might survive. However, even these immune-privileged sites can eventually reject allogeneic cells, and most potential clinical applications do not involve immune-privileged sites. Alternatively, HLA-matched or partially matched cells from HLA-typed stem cell banks or pluripotent stem cell (iPSC) lines derived from each patient can be developed for transplantation. However, the development of individually matched cell line requires significant costs, months of cell culture, highly trained personnel, and extensive validation of the final product, all of which must be done with the approval of regulatory agencies. Furthermore, each cell line will likely behave somewhat differently in gene expression patterns, culture characteristics, differentiation potentials, and genetic variations.

[0006] Thus, although individualized stem cell preparations or HLA-diverse stem cell banks may address the current problem of transplantation, they require that multiple cell lines be characterized, differentiated into therapeutic cell products, and approved for human administration. This time-consuming, technically difficult, and expensive process is a major factor preventing stem cell-based therapies from entering clinical trials. Thus, there exists a need for a more effective and less expensive cell-based therapies that are not impeded by rejection.

SUMMARY OF THE INVENTION

[0007] In accordance with the present invention, in one aspect the invention provides an isolated primate cell comprising a genetically engineered disruption in a beta-2 microglobulin (B2M) gene. In certain particular embodiments, the cell comprises genetically engineered disruptions of all copies of the B2M gene.

[0008] In certain other embodiments, the cell further comprises one or more recombinant immunomodulatory genes. Suitable immunomodulatory genes include without limitation a gene encoding a viral protein that inhibits antigen presentation, a microRNA gene, and a gene that encodes a single chain (SC) fusion human leukocyte antigen (HLA) class I protein as described below. In certain preferred embodiments, the primate cell is a human cell.

[0009] In certain preferred embodiments, the one or more immunomodulatory genes comprise a polynucleotide capable of encoding a single chain fusion HLA class I protein. In certain particular embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M covalently linked to at least a portion of an HLA class I .alpha. chain selected from the group consisting of HLA-A, HLA-B, HLA-C, HLA-E, HLA-F and HLA-G. In certain preferred embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of an HLA class I .alpha. chain selected from the group consisting of HLA-C, HLA-E and HLA-G. In certain other preferred embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of an HLA class I .alpha. chain selected from the group consisting of HLA-A, HLA-E and HLA-G. In certain particular embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of HLA-A0201 (e.g., SEQ ID NO:16). In certain other particular embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of HLA-E (e.g., SEQ ID NOs:18 and 20).

[0010] In yet other particular embodiments, the cell has a normal karyotype. In certain other particular embodiments, the cell is a non-transformed cell. In particular, the cell can be a stem cell selected from the group consisting of a hematopoietic stem cell, an embryonic stem cell, an induced pluripotent stem cell, a liver stem cell, a neural stem cell, a pancreatic stem cell and a mesenchymal stem cell. In certain further embodiments, the cell further comprises one or more recombinant genes capable of encoding a suicide gene product. In certain particular embodiments, the suicide gene product comprises a protein selected from the group consisting of thymidine kinase and an apoptotic signaling protein.

[0011] In certain preferred embodiments, the stem cell is a pluripotent stem cell that expresses a single chain fusion HLA class I protein comprising at least a portion of B2M and at least a portion of an HLA class I .alpha. chain selected from the group consisting of HLA-A, HLA-B, HLA-C, HLA-E, HLA-F and HLA-G. In certain particular embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of HLA-A0201.

[0012] In certain other particular embodiments, the stem cell is a differentiated cell. In certain embodiments, the differentiated cell is selected from the group consisting of a dendritic cell, a pancreatic islet cell, a liver cell, a muscle cell, a keratinocyte, a neuronal cell, a hematopoietic cell, a lymphocyte, a red blood cell, a platelet, a skeletal muscle cell, an ocular cell, a mesenchymal cell, a fibroblast, a lung cell, a GI tract cell, a vascular cell, en endocrine cell, an adipocyte and a cardiomyocyte. In certain preferred embodiments, the differentiated cell is a human cell expressing a single chain fusion HLA class I protein comprising at least a portion of B2M and at least a portion of an HLA class I .alpha. chain selected from the group consisting of HLA-A, HLA-B, HLA-C, HLA-E, HLA-F and HLA-G. In certain particular embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of HLA-A0201.

[0013] In certain other embodiments, the cell further expresses a target peptide antigen that is presented by the single chain fusion HLA class I protein on the cell surface. In certain particular embodiments, the target peptide antigen is covalently linked to the single chain fusion HLA class I protein. In certain preferred embodiments, the target peptide antigen is derived from a protein of a pathogen or a cancer cell. Thus, in a related aspect, the invention provides a vaccine comprising the B2M-/- cells of the invention, wherein the vaccine is capable of eliciting in a primate an immune response specific for the target peptide antigen. In certain particular embodiments, the vaccine comprises a cell of the invention that is a differentiated dendritic cell. In certain other embodiments, the cell is a human cell of the invention, wherein the cell expresses a cytokine that further enhances the immune response. In certain preferred embodiments, the cytokine is IL2. In certain other preferred embodiments, the cytokine is IFN-.gamma.. In certain embodiments, the immune response comprises a humoral immune response; while in other embodiments, the immune response comprises a cellular immune response. In a further related aspect, the invention provides a kit comprising a vaccine that comprises the isolated cells of the invention and an immune adjuvant. In certain embodiments, the cell is a human cell.

[0014] In yet another aspect, the invention provides a method of transplantation in a patient in need thereof comprising the step of administering to the patient an effective amount of the isolated cell of the invention. In certain embodiments, the patient is immune competent. In certain particular embodiments, the patient is a primate and preferably a human. In certain preferred embodiments, the patient is a human and the cell is a human cell. In further embodiments the cell is a stem cell or a differentiated cell, optionally expressing a single chain fusion HLA class I protein.

[0015] In yet a further aspect, the invention provides a method of treating a disease condition in a patient in need thereof comprising the step of administering to the patient an effective amount of the B2M-/- cells of the invention, wherein the disease condition includes without limitation an endocrine disorder, diabetes, an autoimmune disease, cancer, infection, anemia, a platelet disorder, immunodeficiency, cytopenia, myocardial infarction, heart failure, liver failure, skeletal or joint condition, a neurological condition, stroke, paralysis, blindness or another visual disorder, muscular dystrophy, osteogenesis imperfecta, pulmonary disease, skin condition, or burns. In certain embodiments, the patient is immune competent. In certain particular embodiments, the patient is a primate and preferably a human. In certain preferred embodiments, the patient is a human and the cell is a human cell. In further embodiments the cell is a stem cell or a differentiated cell, optionally expressing a single chain fusion HLA class I protein. In certain particular embodiments, the disease condition is diabetes and the cell is a differentiated pancreatic islet cell. In further embodiments the differentiated pancreatic islet cell expresses a single chain fusion HLA class I protein.

[0016] In another aspect, the invention provides a kit comprising the isolated primate cells, preferably human cells, of the invention. In certain embodiments, the kit is for use in transplantation or for use in treating a disease condition. In certain other embodiments, the kit comprises an implant comprising the isolated primate cells, preferably human cells, of the invention.

[0017] Specific embodiments of the present invention will become evident from the following more detailed description of certain preferred embodiments and the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

[0018] FIG. 1. Creation of B2M-/- ESCs. FIG. 1A is an illustration of the AAV B2M targeting vectors with exons shown in large boxes. FIG. 1B presents Southern blots showing the results of gene targeting and Cre-mediated transgene excision (Cre-out). FIG. 1C presents results of flow cytometry showing a lack of HLA class I expression after gene targeting (with isotype controls).

[0019] FIG. 2. Tissue sections of teratoma developed from B2M-/- Cre-out human ESCs transplanted in immune-deficient mice. The sections were stained with DAPI, hematoxylin and eosin, or lineage-specific markers MAP-2 (microtubule associated protein-2) for ectoderm, .alpha.-SAM (.alpha.-smooth muscle actin) for mesoderm or FoxA2 (forkhead box protein A2) for endoderm. Scale bar=100 microns.

[0020] FIG. 3. Single chain fusion HLA class I constructs. FIG. 3A shows the foamy viral vector design for expressing single chain fusion HLA class I proteins. FIG. 3B illustrates the linear protein structure of single chain fusion HLA class I proteins. Sequences for exemplary single chain fusion HLA class I proteins are provided for HLA-bGBE (SEQ ID NOs:19 and 20, DNA and protein sequences, respectively), HLA-gBE (SEQ ID NOs:17 and 18) and HLA-bBA0201 (SEQ ID NOs:15 and 16). FIG. 3C presents results of flow cytometry showing single chain fusion HLA-E expression in B2M-/- ESCs (isotype controls). FIG. 3D presents results of flow cytometry showing single chain fusion HLA-A0201 expression in B2M-/- ESCs (isotype controls).

[0021] FIG. 4 outlines the experimental design for each differentiated cell type.

[0022] FIG. 5 shows differentiation of keratinocytes from B2M-/- ESCs.

DETAILED DESCRIPTION OF THE INVENTION

[0023] All publications, patents and patent applications cited herein are hereby expressly incorporated by reference for all purposes.

[0024] Within this application, unless otherwise stated, the techniques utilized may be found in any of several well-known references such as: Molecular Cloning: A Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press) and PCR Protocols: A Guide to Methods and Applications (Innis, et al. 1990. Academic Press, San Diego, Calif.).

[0025] As used herein, the singular forms "a", "an" and "the" include plural referents unless the context clearly dictates otherwise. For example, reference to "an isolated cell" means one or more isolated cells.

[0026] All embodiments disclosed herein can be combined unless the context clearly dictates otherwise.

[0027] In one aspect, the invention provides B2M deficient cells. In particular, the invention provides isolated primate cells, preferably human cells, comprising a genetically engineered disruption in a B2M gene. In certain preferred embodiments, the cell is a human cell comprising a genetically engineered disruption in the B2M gene. In a related aspect, the cell comprises genetically engineered disruptions of all copies of the B2M gene. In certain embodiments, the genetic disruptions in the B2M gene result in defective or no expression of the B2M protein. Since B2M is a common component of all HLA class I proteins, the disruptions preclude the expression of all natural HLA class I proteins on the cell surface. The B2M coding sequence is shown in SEQ ID NO:1 (GenBank Accession Number NM.sub.--004048) and the B2 M protein sequence is shown in SEQ ID NO:2. There may be many single nucleotide polymorphisms (SNPs) in the gene; as will be understood by those of skill in the art, the human cells and methods of the invention are applicable to any such B2M gene and SNPs.

[0028] The cells of these embodiments of the invention can be used, for example, as donor cells for transplantation in a recipient in need thereof. B2M deficient cells encompass cells that comprise a B2M-/- genetic background (referred to as B2M-/- cells). The term "B2M-/- cells" refers to primate cells, preferably human cells, that comprise genetically engineered disruptions in all copies of the B2M gene. The B2M-/- cells can serve as "universal donor cells" in that they are immunologically compatible to all or a significant percentage of recipients in a population. As used herein, a recipient or patient refers to a primate, and preferably a human. In certain particular embodiments, the cell is a human cell and the patient is a human.

[0029] The cells of the invention can be engineered to disrupt the B2M gene such that no functional endogenous B2M protein is produced from the disrupted genetic loci. In certain embodiments, the disruption results in expression of non-functional B2M proteins, including but not limited to truncations, deletions, point mutations and insertions. In other embodiments, the disruption results in no protein expression from the B2M gene.

[0030] Cells deficient in B2M expression are unable to express HLA class I proteins on the cell surface. HLA class I-deficiency provides further benefits; for example, cells without HLA class I expression cannot present auto-antigens that would otherwise prevent successful cell therapies for autoimmune diseases such as diabetes and rheumatoid arthritis. Similarly, therapeutic gene products introduced by the inventive cell therapies (e.g., dystrophin) that are missing in patients with certain genetic diseases (e.g., muscular dystrophy) will not be presented and recognized by the immune system as neo-antigens in replacement therapies.

[0031] Any suitable technique for disrupting one, two or all copies of the B2M gene can be used; exemplary techniques are disclosed throughout the application and are within the level of skill in the art based on the teachings herein and the teachings known in the art. Exemplary other techniques can be found, for example, in U.S. Patent Application Publication Number US2008/0219956, published Sep. 11, 2008, and incorporated by reference herein in its entirety. These techniques may optionally include steps to remove non-human DNA sequences from the cells after B2M gene disruption.

[0032] An exemplary embodiment of this method is as disclosed throughout the application, using an adeno-associated virus gene targeting vector, optionally including removing the transgene used for targeting via techniques such as those described below, or by removing the transgene used for targeting by Cre-mediated loxP recombination, or other suitable recombination techniques. See Khan et al. 2011, Protocol, 6:482-501, which is incorporated by reference in its entirety. Exemplary targeting vectors and exemplary vector diagrams are also disclosed herein. It is within the level of those of skill in the art, based on the teachings herein and known in the art, to utilize a variety of techniques for making the B2M-/- cells, preferably human cells, of the invention.

[0033] In certain embodiments, the cell genome of the B2M-/- cells may comprise no more than 100, no more than 50 or no more than 30 nucleotides of non-human DNA sequences. In certain other embodiments, the cell genome may comprise 6, 5, 4, 3, 2, 1, or 0 nucleotides of non-human DNA sequences. An exemplary technique for removing any non-human DNA introduced in disrupting the B2M gene is provided in FIG. 1A. The non-human DNA sequences can be removed by a second round of targeting to delete the HyTK or TKNeo transgenes in the first vectors or by the Cre-mediated loxP recombination.

[0034] In other embodiments, the cells instead can be engineered to recombinantly express a single chain fusion HLA class I protein in a B2M-/- genetic background. Thus, the B2M-/- cells as used herein also encompass primate, preferably human, cells that express one or more single chain fusion HLA class I proteins in a B2M-/- genetic background. The B2M-/- cells recombinantly expressing a single chain fusion HLA class I protein are nevertheless deficient in normal B2M function in that the cells do not express wild type B2M protein that form a non-covalently associated heterodimer with any HLA class I .alpha. chain on the cell surface.

[0035] The term "single chain fusion HLA class I protein," "single chain fusion HLA class I molecule" or "single chain fusion HLA class I antigen" refers to a fusion protein comprising at least a portion of the B2M protein covalently linked, either directly or via a linker sequence, to at least a portion of an HLA-I .alpha. chain. On the other hand, the term "HLA class I protein," "HLA class I molecule" or "HLA class I antigen" refers to a non-covalently associated heterodimer of B2M and an HLA .alpha. chain expressed on the surface of a wild type cell.

[0036] As used herein, the term "HLA class I .alpha. chain" or "HLA-I heavy chain" refers to the .alpha. chain of the HLA class I heterodimer. HLA class I heavy chain includes without limitation HLA class I .alpha. chains HLA-A, HLA-B, HLA-C, HLA-E, HLA-F, and HLA-G. Representative DNA and protein sequences are provided for HLA-A (GenBank No. K02883.1, SEQ ID NO:3; UniProt No. P01892, SEQ ID NO:4), HLA-B (NM.sub.--005514, SEQ ID NO:5; NP.sub.--005505; SEQ ID NO:6), HLA-C (NM.sub.--002117, SEQ ID NO:7; NP.sub.--002108, SEQ ID NO:8), HLA-E (NM.sub.--005516, SEQ ID NO:9; NP.sub.--005507, SEQ ID NO:10), HLA-F (NM.sub.--018950, SEQ ID NO:11; NP.sub.--061823, SEQ ID NO:12), and HLA-G (NM.sub.--002127, SEQ ID NO:13; NP.sub.--002118, SEQ ID NO:14).

[0037] In addition, although the term "HLA class I protein/molecule" is known to refer to the MHC class I protein/molecule in human, the terms HLA and MHC are sometimes used interchangeably throughout this application: for example, the term HLA class I protein can also be used to refer to the primate equivalent to the HLA class I protein in a primate. One of skill in the art will be able to discern the meaning of the term based on the content.

[0038] The term B2M-/- cells as used herein also encompasses cells having genetically engineered disruptions in all copies of the B2M gene, wherein one B2M allele is genetically engineered to express, instead of the wild type B2M protein, a single chain fusion HLA class I protein (i.e., genetically targeted knockin in one B2M allele). B2M-/- cells with such genetic background express B2M only in the context of the single chain fusion HLA class I protein from a B2M genetic locus. In certain advantageous embodiments, the expression of the single chain fusion HLA class I protein is regulated by the endogenous B2M regulatory sequence located at the B2M locus.

[0039] In related embodiments, B2M-/- cells further encompass cells having genetically engineered disruptions in all copies of the B2M gene, wherein all B2M alleles are genetically engineered to express, instead of the wild type B2M protein, single chain fusion HLA class I proteins (i.e., genetically targeted knockin in all B2M alleles). B2M-/- cells with such genetic disruptions express B2M only in the context of single chain fusion HLA class I proteins from the genetic loci of all the alleles of the B2M gene. In certain embodiments, the cells are genetically engineered to express the same type of single chain fusion HLA class I protein from the genetic loci of all alleles of the B2M gene; while in other embodiments, the cells are genetically engineered to express different types of single chain fusion HLA class I proteins from different genetic loci of different alleles of the B2M gene.

[0040] Throughout the application, the "cells of the invention," "isolated cells of the invention," "B2M-/- cells," "B2M-/- cells of the invention" or "stem cells or differentiated cells of the invention" sometimes can be used interchangeably to encompass all the B2M-/- cells described herein. In certain particular embodiments, the B2M-/- cells of the invention express a single chain fusion HLA class I protein as defined herein, in a B2M-/- background. The B2M-/- cells can be genetically engineered to express a single chain HLA class I protein either from the B2M locus or from other location of the genome. In certain particular embodiments, the cells of the invention comprise genetically engineered disruptions in all alleles of the B2M gene that preclude the expression of wild type B2M protein, and nevertheless express a single chain fusion HLA class I protein from a B2M genetic locus. In certain other particular embodiments, the cells of the invention comprise genetically engineered disruptions in all alleles of the B2M gene that preclude the expression of wild type B2M protein, and nevertheless express single chain fusion HLA class I proteins from all B2M genetic loci. The term "gene," "allele," and "genetic locus" may be used interchangeably throughout the application.

[0041] The "isolated cell" can be any suitable cell type for a given purpose. For example, the cell can be a pluripotent stem cell or a differentiated cell. "A stem cell" broadly encompasses any cells that are capable of further differentiation. "A pluripotent stem cell" refers to a stem cell that has the potential to differentiate into any of the three germ layers: endoderm, mesoderm or ectoderm. "An adult stem cell," on the other hand, is multipotent in that it can produce only a limited number of cell types. "An embryonic stem (ES) cell" refers to a pluripotent stem cell derived from the inner cell mass of the blastocyst, an early-stage embryo. "Induced pluripotent stem cells (iPS cells)" are pluripotent stem cell artificially derived from a non-pluripotent cell, typically an adult somatic cell, by artificially inducing expression of certain genes.

[0042] In certain embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of HLA-A, HLA-B, HLA-C, HLA-E, HLA-F or HLA-G (also referred to as a dimeric construct). In certain preferred embodiments, the HLA .alpha. chain contained in the single chain fusion HLA class I protein does not contain the leader sequence (or signal sequence) of the HLA class I .alpha. chain (leaderless HLA .alpha. chain). In certain other embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of HLA-C, HLA-E or HLA-G. In certain further embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of HLA-A, HLA-E or HLA-G. In certain preferred embodiments, the single chain fusion HLA class I protein comprises a leader sequence (or signal peptide) covalently linked to the at least a portion of B2M and at least a portion of an HLA .alpha. chain to ensure proper folding of the single chain fusion on the cell surface. The leader sequence can be the leader sequence of the B2M protein, the leader sequence of an HLA .alpha. chain protein or the leader sequence of other secretary proteins. In certain particular embodiments, the single chain fusion HLA class I protein comprises a B2M protein with its leader sequence removed. In certain other particular embodiments, the single chain fusion HLA class I protein comprises an HLA .alpha. chain protein with its leader sequence removed. Certain HLA class I .alpha. chains are highly polymorphic. As will be understood by those of skill in the art, the human cells and methods of the invention are applicable to any such HLA .alpha. chains and polymorphism thereof.

[0043] Single chain fusion HLA class I proteins comprising sequence variants and fragments of B2M and/or HLA .alpha. chains are contemplated by the instant invention, wherein such single chain fusion constructs nevertheless possess normal HLA class I functions, e.g., forming proper secondary structure of the heterodimer on the cell surface, presenting peptides in the peptide binding cleft and engaging the inhibitory receptors on the surface of NK cells. In certain embodiments, the variants share at least 75%, 80%, 81%, 85%, 88%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or complete sequence homology with the naturally occurring HLA heavy chains and B2M sequences, wherein the variants possess normal HLA class I functions. In certain other embodiments, the variants share at least 75%, 80%, 81%, 85%, 88%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or complete sequence homology with the sequences of B2M or HLA heavy chains as shown in SEQ ID NOs:2, 4, 6, 8, 10, 12 or 14.

[0044] In certain particular embodiments, the HLA-A variants share at least 85%, 88,%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or complete sequence homology with SEQ ID NO:4. In certain other particular embodiments, the HLA-B variants share at least 81%, 83%, 85%, 88,%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or complete sequence homology with SEQ ID NO:6. In certain further embodiments, the HLA-C variants share at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or complete sequence homology with SEQ ID NO:8. In yet other embodiments, the HLA-E variants share at least 97%, 98%, 99%, or complete sequence homology with SEQ ID NO:10. In certain particular embodiments, the HLA-F variants share at least 99%, or complete sequence homology with SEQ ID NO:12. In certain other embodiments, the HLA-G variants share at least 98%, 99%, or complete sequence homology with SEQ ID NO:14.

[0045] In certain other embodiments, the single chain fusion HLA class I protein comprises a full length B2M (including its leader sequence) and an HLA .alpha. chain without the leader sequence (leaderless HLA .alpha. chain); while in certain other embodiments, the single chain fusion HLA class I protein comprises a B2M protein without the leader sequence. It is understood that B2M-/- cells expressing two, three or more different types of single chain fusion HLA class I protein in any combination, for example, expressing SC fusion comprising HLA-A (or a leaderless HLA-A) and SC fusion comprising HLA-C (or a leaderless HLA-C), expressing SC fusion comprising HLA-A (or a leaderless HLA-A) and SC fusion comprising HLA-E (or a leaderless HLA-E), or expressing SC fusion comprising HLA-B (or a leaderless HLA-B), SC fusion comprising HLA-E (or a leaderless HLA-E) and SC fusion comprising HLA-G (or a leaderless HLA-G), etc., are all contemplated by the invention.

[0046] Natural killer (NK) cells are part of the innate immune response. Several pathogens can down regulate HLA class I protein expression in infected cells. The NK cells monitor infection by recognizing and inducing apoptosis in cells that do not express HLA class I proteins. The inhibitory receptors on the NK cell surface recognize HLA class I .alpha. chain alleles thereby preventing NK-medicated apoptosis in uninfected normal cells. Thus, in certain particular embodiments, the single chain fusion HLA-I protein inhibits NK cell-mediated killing of cells that do not express endogenous HLA class I proteins by binding to the inhibitory receptors on the NK cells. For example, HLA-E is a ligand for the CD94/NKG2 receptor of NK cells that inhibits NK cell-mediated apoptosis. Thus, in certain particular embodiments, the B2M-/- cell expresses the single chain fusion HLA class I protein comprising at least a portion of B2M and at least a portion of HLA-E. In addition, HLA-G is normally expressed on the surface of placental cytotrophoblasts that do not express HLA-A, B or C, and it protects these cells from NK cell-mediated lysis by interacting with the inhibitory ILT2(LIR1) receptor on NK cells (Pazmany et al., 1996, Science 274, 792-795). Thus, in certain other preferred embodiments, the B2M-/- cell expresses the single chain fusion HLA class I protein comprising at least a portion of B2M and at least a portion of HLA-G.

[0047] In certain particular embodiments, the single chain fusion HLA class I protein comprises at least a portion of B2M and at least a portion of HLA-A0201, an allele of HLA-A. HLA-A0201 (SEQ ID NO:4) is a common HLA class I allele found in a large percentage of the population in the United States. Thus, in certain advantageous embodiments, the isolated cell expresses the single chain fusion HLA class I protein comprising at least a portion of B2M and at least a portion of HLA-A0201 in a B2M-/- genetic background, wherein the isolated cell is immune compatible with a large percentage of the human population in the United States. Other suitable common alleles that can be used include without limitation HLA-A0101, HLA-A0301, HLA-B0702, HLA-B0801, HLA-00401, HLA-00701, and HLA-00702. In certain preferred embodiments, the HLA allele comprises at least a portion of HLA-A0201 (SEQ ID NO:4), HLA-B0702 (SEQ ID NO:6) or HLA-00401 (SEQ ID NO:8).

[0048] In certain further embodiments, the single chain fusion HLA class I protein also comprises a specific peptide antigen that occupies the peptide binding cleft of the single chain fusion HLA class I protein, wherein the peptide antigen is covalently linked to the single chain fusion HLA class I protein (also referred to as a trimeric construct). An example of the trimeric construct is shown in FIG. 3B. The HLA-bGBE construct of FIG. 3B comprises B2M and HLA-E covalently linked to a peptide antigen (such as, but not limited to, the HLA-G peptide antigen as illustrated in the figure) (SEQ ID NO:23) designed to occupy the peptide binding cleft of the single chain fusion HLA class I protein. In certain other embodiments, the covalently linked peptide antigen is cleaved via a built in protease cleavage site, and the cleaved peptide antigen can bind to the peptide binding cleft of the single chain fusion HLA-I protein for presentation. In certain alternative embodiments, the peptide antigen occupying the peptide binding cleft of the single chain fusion HLA class I protein is produced by the intracellular antigen processing pathway, in which the peptide antigen is produced by proteasome, transported to and loaded onto the single chain fusion HLA class I protein in the endoplasmic reticulum. In certain particular embodiments, the peptide antigen comprises a peptide of a tumor antigen. In certain other embodiments, the peptide antigen comprises a peptide of a protein from a pathogen including without limitation a bacterium, a virus, a fungus and a parasite. In further embodiments, the peptide antigen comprises a peptide of a tumor antigen. In certain particular embodiments, the B2M-/- cell expresses a single chain fusion HLA class I protein that is covalently linked to a peptide that does not comprise an auto-antigen or neo-antigen to the patient. It is within the ability of a skilled person to design the single chain fusion HLA class I protein and the peptide antigen presented thereon to modulate the immune response that may be elicited in a recipient.

[0049] The isolated B2M-/- cell expressing a single chain fusion HLA class I protein comprising a specific peptide antigen either covalently or non-covalently bound to the single chain fusion HLA class I protein can be used, for example, for administration to a recipient to elicit an immune response. Accordingly, in a related aspect, the invention provides a vaccine comprising the isolated cell of the invention, wherein the vaccine is capable of eliciting in a recipient an immune response specific for the target peptide antigen. The immune response includes without limitation a cellular immune response and/or a humoral immune response. The vaccine may comprise a stem cell or a differentiated cell; in certain particular embodiments, the cell is a differentiated dendritic cell. In certain other embodiments, the cell further expresses a cytokine Any suitable cytokine can be used; in certain particular embodiments, the cytokine is IL2 or IFN-.gamma.. In certain preferred embodiments, the cell is a human cell and the recipient is a human.

[0050] The single chain fusion HLA class I protein can be expressed from an expression vector that allows either transient or more preferably, stable expression of the protein in a B2M-/- cell. Exemplary suitable expression vectors are known in the art. One such example is a retroviral vector, which is capable of integrating into the cellular genome to provide long-term, stable expression of an exogenous gene. In certain particular embodiments, the viral vector is derived from human foamy virus, a type of retrovirus. Other suitable viral vectors include without limitation vectors derived from retrovirus, adenoviral virus, adeno-associated virus, lentivirus, herpes simplex virus, vaccinia virus, and pox virus.

[0051] In certain preferred embodiments, the polynucleotide capable of encoding a single chain fusion HLA class I protein is integrated into the chromosome of the cells, preferably into the B2M or the HLA loci, for stable expression. Thus, in certain preferred embodiments, the B2M loci are disrupted by inserting in the B2M loci the polynucleotide capable of encoding a single chain fusion HLA class I protein to replace the expression of the endogenous wild type B2M protein. The result of such gene targeting disrupts normal B2M expression and precludes formation of wild type HLA class I proteins but permits expression of a predetermined single chain fusion HLA class I protein of choice on the surface of the otherwise B2M deficient cells. Other expression vectors are also contemplated and the selection of suitable expression vector is within the ability of one ordinary skill in the art.

[0052] According to the vector design, the polynucleotide capable of expressing a single chain fusion HLA class I protein is delivered to a cell by viral infection (when a viral vector is used) or by other delivery methods including without limitation transfection, electroporation, gene targeting or liposome-mediated DNA delivery.

[0053] Any immune effects of the single chain fusion HLA class I protein expressing B2M-/- cells can be studied by various means. For example, B2M-/- cells expressing a SC fusion HLA class I protein can be differentiated into antigen-presenting dendritic cells (iDCs). Suppression of NK cell-mediated lysis can be measured by chromium release assays after incubating iDCs with normal human NK cells and NKL cell lines. A variety of controls (untransduced B2M-/- iDCs, B2M+/+iDCs, the 721.221 class I-negative cell line, and anti-receptor and anti-HLA antibodies) can be used to establish the specificity of the interactions. Additional characterization can be done with Elispot assays by incubating the cells with T cells.

[0054] In a related aspect, the invention provides an HLA class I-typed B2M-/- cell bank, wherein the cells of the cell bank comprise a B2M-/- genetic background and are engineered to express one or more types of single chain fusion HLA class I proteins in which the HLA .alpha. chain is selected from the group consisting of HLA-A, HLA-B, HLA-C, HLA-E, HLA-F and HLA-G. In certain embodiments, the cell bank comprises a population of cells that expresses a single chain fusion HLA class I protein in which the HLA .alpha. chain comprises HLA-A. In certain other embodiments, the cell bank comprises a population of cells that expresses a single chain fusion HLA class I protein in which the HLA .alpha. chain comprises HLA-B. In certain further embodiments, the cell bank comprises a population of cells that expresses a single chain fusion HLA class I protein in which the HLA .alpha. chain comprises HLA-C. In yet other embodiments, the cell bank comprises a population of cells that expresses a single chain fusion HLA class I protein in which the HLA .alpha. chain comprises HLA-E. In certain other embodiments, the cell bank comprises a population of cells that expresses a single chain fusion HLA class I protein in which the HLA .alpha. chain comprises HLA-F. In certain particular embodiments, the cell bank comprises a population of cells that expresses a single chain fusion HLA class I protein in which the HLA .alpha. chain comprises HLA-G. In certain particular embodiments, the cell bank comprises the above-described one or more or preferably all populations of cells.

[0055] The cells of the cell bank can be pluripotent stem cells or differentiated cells. In certain particular embodiments, the cell bank comprises different types of differentiated cells, such as skin cells, pancreatic beta islet cells, etc., that express the same single chain fusion HLA class I protein. While in other particular embodiments, the cell bank comprises different types of differentiated cells, such as skin cells, pancreatic beta islet cells, etc., that each express different single chain fusion HLA class I proteins. It can be determined by a skilled researcher or clinician to choose suitable donor cells from the cell bank for a given patient. In certain other embodiments, some of the cells of the cell bank express the HLA class I allele that matches the HLA class I allele of the patient to whom the cells are administered. In certain preferred embodiments, the cell is a human cell and the patient is a human. In certain particular embodiments, the cells express a single chain fusion HLA class I protein comprising B2M and HLA-A0201 that matches the HLA allele of a large portion of the population in the United States.

[0056] In another aspect, the invention provides a method of transplantation in a patient in need thereof comprising the step of administering to the patient an effective amount of the cells of the invention for transplantation. Because the B2M-/- cells do not express wild type HLA class I protein on the cell surface, the cells when administered to a patient elicit minimal or no immune responses in the patient. Thus, transplantation using the B2M-/- cells limits the need for taking immune suppressant therapies. Thus, in certain preferred embodiments, the patient is immune competent. In certain other embodiments, the cell is an isogeneic cell; while in other embodiments, the cell is an allogeneic cell.

[0057] In certain further embodiments, the cells of the invention are pluripotent stem cells; while in other embodiments, the cells of the invention are differentiated cells. In certain preferred embodiments, the cell is a human cell and the patient is a human patient. In certain particular embodiments, the method of transplantation comprises administering to a human an effective amount of the pluripotent stem cells or differentiated cells. In certain preferred embodiments, the cells of the invention further express one or more engineered single chain fusion HLA class I proteins. In certain other embodiments, the cells are able to escape NK cell-mediated killing and elicit minimal or no immune response in the recipient after transplantation.

[0058] Transplantation therapy, replacement therapy or regenerative therapy refers to therapies for a disease condition by administering to a patient cells or tissues to replenish or replace defective cellular functions in a target organ. In certain particular embodiments, the need for transplantation arises as a result of physical or pathological injuries to a tissue or organ. In certain other particular embodiments, the need for transplantation arises as a result of one or more genetic defect or mutation in the patient and the transplantation of the cells of the invention replenishes or replaces defective cellular functions in the patient without the need for gene therapy to correct the underlying genetic mutation of the patient. In certain further embodiments, the transplantation includes without limitation hematopoietic stem cell transplantation, or transplantation of cells that are incorporated into an organ such as liver, kidney, pancreas, lung, brain, muscle, heart, gastrointestinal tract, nervous system, skin, bones, bone marrow, fat, connective tissue, immune system, or blood vessels. In certain particular embodiments, the target organ is a solid organ.

[0059] In certain particular embodiments, the cells administered to the recipient may or may not be incorporated into an organ in need of such therapy. In certain embodiments, the cells of the invention are differentiated into the desired cell type, either before or after transplantation, and provide the necessary cellular function without itself being incorporated into the tissue at the site of transplantation. For example, in certain embodiments for treating diabetes, the cells of the invention either as pluripotent stem cells or differentiated pancreatic beta islet cells are transplanted to a diabetic patient. The transplanted cells need not reconstitute a functioning pancreas: they just need to secrete insulin in response to glucose levels. In certain particular embodiments, the cells are transplanted into an ectopic location and are not fully incorporated into the pancreas. Transplantation of pluripotent cells of the invention, differentiated cells of the invention, or a tissue differentiated and developed ex vivo from the cells of the invention are all contemplated by the invention. In certain preferred embodiments, the cell is a human cell and the patient is a human patient. In certain other preferred embodiments, the cells of the invention express one or more single chain fusion HLA class I proteins.

[0060] In a further aspect, the invention provides a method of treating a disease condition in a patient in need thereof comprising the step of administering to the patient an effective amount of the cell of the invention to treat the disease condition, wherein the disease condition is diabetes, an autoimmune disease, cancer, infection, anemia, cytopenia, myocardial infarction, heart failure, skeletal or joint condition, osteogenesis imperfecta or burns. In certain particular embodiments, the disease condition results from pathological or physical injuries to a tissue or organ. In certain embodiments, the cells of the invention are stem cells; while in other embodiments, the cells of the invention are differentiated cells. In certain preferred embodiments, the cell is a human cell and the patient is a human patient. In certain particular embodiments, the human cell is a differentiated cell. Transplantation of a tissue developed ex vivo from the cells of the invention is also contemplated by the invention. In certain preferred embodiments, the cells of the invention further express one or more single chain fusion HLA class I proteins. In certain embodiments, the cell is an isogeneic cell; while in other embodiments, the cell is an allogeneic cell.

[0061] In certain particular embodiments, the cell is a differentiated cell including without limitation a dendritic cell, lymphocyte, red blood cell, platelet, hematopoietic cell, pancreatic islet cell, liver cell, muscle cell, keratinocyte, cardiomyocyte, neuronal cell, skeletal muscle cell, ocular cell, mesenchymal cell, fibroblast, lung cell, GI tract cell, vascular cell, endocrine cell and adipocyte. In certain other particular embodiments, the invention provides a method of treating a disease condition in a solid organ. In certain embodiments, the cells of the invention used in treating a disease condition express one or more single chain fusion HLA class I proteins.

[0062] "Treating" a patient having a disease or disorder means accomplishing one or more of the following: (a) reducing the severity of the disease; (b) arresting the development of the disease or disorder; (c) inhibiting worsening of the disease or disorder; (d) limiting or preventing recurrence of the disease or disorder in patients that have previously had the disease or disorder; (e) causing regression of the disease or disorder; (f) improving or eliminating the symptoms of the disease or disorder; and (f) improving survival. In certain preferred embodiments, the disease or disorder is a disease or disorder that can be treated by transplantation of tissues or cells.

[0063] The effective amount of the isolated cells of the invention for transplantation or for treating a disease condition depends on a number of factors, such as the type of tissue, the severity of the disease condition, the transplantation reaction, the reason for transplantation, and the age and general health of the patient. The effective amount can be determined by a skilled researcher or clinician by routine practice. Due to the reduced immunogenicity of the transplanted cells, relative large amount of cells can be tolerated by a patient to achieve the desired therapeutic effects. Alternatively, the cells can be repeatedly transplanted at intervals until a desired therapeutic effect is achieved.

[0064] The route for administration of the cells of the invention is not limited in any particular way. Exemplary delivery routes include without limitation intravenous, intramuscular, subdermal, intraperitoneal, transcutaneous, intracutaneous, and subcutaneous route. The cells of the present invention can also be administered topically by injection. For example, the cells can be injected into an injured joint, a fractured bone, an infarct site, an ischemic site or their periphery.

[0065] In certain particular embodiments, the cells are administered via a delivery device including without limitation a syringe. For example, the cells can be suspended in a solution or a pharmaceutical composition contained in such a delivery device. The "solution" or "pharmaceutical composition" comprises a physiological compatible buffer and optionally a pharmaceutically acceptable carrier or diluent in which the cells of the invention remain viable. The use of such carriers and diluents is well known in the art. The solution includes without limitation physiologically compatible buffers such as Hank's solution, Ringer' solution, or physiologically buffered saline. The cells can be kept in the solution or pharmaceutical composition for short term storage without losing viability. In certain particular embodiments, the cells are frozen for long term storage without losing viability according to cryopreservation methods well-known in the art.

[0066] Aqueous injection suspensions may contain substances which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran, but still fluid to the extent that can be easily delivered by syringe injection. The solution is preferably sterile, stable under the conditions of manufacture and storage and is free of microorganism contamination through the use of, for example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. The cells contained in the solution can be stem cells or differentiated cells as described herein, in a pharmaceutically acceptable carrier or diluent and, as required, other ingredients indicated above.

[0067] The cells may be administered systemically (e.g., intravenously) or locally (e.g., directly into a myocardial defect under the guidance of echocardiogram, or by direct application to damaged tissues or organs accessible during open surgery). For injections, the cells may be in an injectable liquid suspension preparation or in a biocompatible medium which is injectable in liquid form and becomes semi-solid at the site of damaged tissue. A syringe, a controllable endoscopic delivery device or other similar devises can be used so long as the needle lumen is of sufficient diameter (e.g. at least 30 gauge or larger) to avoid physical damages to the cells during delivery.

[0068] In certain other embodiments, the cells can be transplanted via a solid support, e.g., a planar surface or three-dimensional matrix. The matrix or planar surface is surgically implanted into the appropriate site in a patient. For example, a patient needing a pancreatic graft can have differentiated cells on a solid support surgically implanted in the pancreas tissue. Exemplary solid support includes without limitation a patch, a gel matrix (such as GELFOAM.RTM. from Pharmacia-Upjohn), polyvinyl alcohol sponge (PVA)-collagen gel implants (such as IVALON, Unipoint Industries, High Point, N.C.) and other similar or equivalent devices. A variety of other encapsulation technologies can be used with the cells of the invention, for example, WO 91/10470; WO 91/10425; U.S. Pat. No. 5,837,234; U.S. Pat. No. 5,011,472; U.S. Pat. No. 4,892,538).

[0069] The cells of the invention can be differentiated into various cell types of all three lineages, including without limitation hematopoietic, mesenchymal, pancreatic endoderm, cardiac and keratinocytes cells. In certain embodiments, the differentiated cell further expresses a single chain fusion HLA class I protein. In general, each cell type can be analyzed for HLA class I protein expression, reactivity with human T cells and NK cells, appropriate differentiation markers, and xenotransplantation in immunodeficient mice to examine in vivo developmental potential. See FIG. 4. A brief discussion of each differentiated cell type follows.

[0070] In certain embodiments, the cells of the invention can be differentiated to hematopoietic cells for treating various hematopoietic diseases currently treated by bone marrow transplantation. Patients receiving transfusion can become refractory to platelet transfusions due to HLA mismatches. Anemic or cytopenic patients can be treated by delivering the cells of the invention-derived erythrocytes, platelets or neutrophils to treat bleeding or infection.

[0071] Further, stem cells of the invention-derived dendritic cells are antigen-presenting cells that can be used as cellular vaccines when properly engineered. In certain embodiments, the cells of the invention engineered to express a single chain fusion HLA class I protein and a unique peptide antigen are used to vaccinate against specific pathogen or tumor antigens. In certain other embodiments, differentiated B2M-/- cytotoxic lymphocytes with HLA-restricted reactivity against specific antigens are used to eliminate infected cells or tumor cells.

[0072] To obtain hematopoietic cells, the pluripotent cells are first allowed to form embryoid bodies, thereafter non-adherent cells were cultured in the presence of hematopoietic cytokines to develop into specific cell lineages. The differentiation of hematopoietic cells from the cells of the invention that express a single chain fusion HLA class I protein expressers can be analyzed by flow cytometry and colony assays. The different cell populations are sorted based on their surface markers, and used to monitor the expression of HLA genes and reactivity with human NK cells and T cells as measured by Elispot, mixed lymphocyte reactions, and cytotoxicity assays. The effectiveness of the single chain fusion HLA constructs on suppression of NK cell-mediated killing can be examined at different stages of differentiation and transplantation. See Bix et al., 1991, Nature 349, 329-331. The hematopoietic stem cells can also be assayed using xenotransplantation models in, for example, immunodeficient mice (SCID-repopulating cells or SRCs).

[0073] The cells of the invention can be differentiated into hematopoietic cell either before or after the cells are administered to a patient. In certain preferred embodiments, the cell is a human cell and the patient is a human. In vitro hematopoietic differentiation can be performed according to established protocols. See for example, Slukvin et al., 2006, J Immunol 176:2924-32, and Chang et al., 2006, Blood 108:1515-23.

[0074] In certain other embodiments, the cells of the invention can be differentiated into mesenchymal stem cells. In certain embodiments, the cells of the invention express one or more single chain fusion HLA class I proteins. MSCs have the potential to form several differentiated cell types, including marrow stromal cells, adipocytes, osteoblasts, and chondrocytes. Thus, inducing pluripotent stem cells to form MSCs (iMSCs) is useful in treating skeletal and joint conditions. The iMSCs can be further differentiated into osteoblasts and formed bone in vivo. Deyle et al., 2012, Mol Ther. 20(1):204-13. Cellular responses of T cells and NK cells to ESCs, iMSCs, and their more terminally differentiated derivatives such as osteoblasts can be examined.

[0075] In certain particular embodiments, the mesenchymal stem cells are capable of differentiating into non-limiting examples of cell types such as marrow stromal cells, adipocytes, osteoblasts, osteocytes and chondrocytes. The cells of the invention are differentiated into mesenchymal stem cells either before or after the cells are administered to a patient. In certain preferred embodiments, the cell is a human cell and the patient is a human. In vitro mesenchymal differentiation can be performed according to established protocols. See for example, Deyle et al., supra.

[0076] In yet other particular embodiments, the cells of the invention can be differentiated into insulin-producing pancreatic islet cells. In certain embodiments, the cells of the invention express one or more single chain fusion HLA class I proteins. The cells of the invention can be used to treat insulin-dependent diabetes mellitus. Advantageously, the transplanted cells do not need to reconstitute a functioning pancreas: they just need to secrete insulin in response to glucose levels. Therefore the treatment can succeed with different cell doses, with cells that are not perfectly differentiated into adult cell types, and when cells are transplanted into an ectopic location. Specific auto-antigens such as those derived from GAD65 or Insulin can cause autoimmune destruction of 13 cells in diabetes (Di Lorenzo et al., 2007, Clin Exp Immunol 148, 1-16). Thus, B2M-/- cells or B2M-/- cells expressing a single chain fusion HLA class I protein presenting a predetermined peptide antigen provide additional advantages in that they do not present these auto-antigens and can avoid autoimmune rejection and prevent a relapse of diabetes after transplantation.

[0077] The cells of the invention can be differentiated into pancreatic cells as described previously, which employs exposure of cells to different cytokines and drugs to promote sequential formation of mesendoderm, definitive endoderm, and pancreatic progenitors (Kroon et al., 2008, Nat Biotechnol 26, 443-452). These cells can be further cultured in implants in immunodeficient mice. The cells of the invention with or without expressing a single chain fusion HLA class I protein and wild-type cell lines can be analyzed at different developmental stages for their reactivity with T cells and NK cells.

[0078] The cells of the invention are differentiated into pancreatic islet cell either before or after patient administration. In certain preferred embodiments, the cell is a human cell and the patient is a human. In vitro hematopoietic differentiation can be performed according to established protocols. See for example, Kroon et al., 2008, Nat Biotechnol 26, 443-452.

[0079] In certain other particular embodiments, the cells of the invention can be differentiated into cardiomyocytes. In certain embodiments, the cells of the invention further express one or more single chain fusion HLA class I proteins. The common clinical problems of myocardial infarction and congestive heart failure can be treated by transplanting healthy stem cell-derived cardiomyocytes that engraft and re-establish functional myocardium. The cells of the invention-derived cardiomyocytes allow these treatments to proceed with pre-packaged cells and avoid the immunosuppression currently required for allogeneic heart transplants. Physiologically relevant tests can be performed on the cardiomyocytes derived from the cells of the invention, such as electrical conduction and contraction studies. B2M-/- stem cells or differentiated cardiomyocytes with or without expressing a single chain fusion HLA class I protein can be tested to determine their immunological reactivity when expressing cardiomyocyte genes, and to establish which HLA modifications minimize these immune responses.

[0080] The cells of the invention can be differentiated into cardiomyocytes either before or after the cells are administered to a patient. In certain preferred embodiments, the cell is a human cell and the patient is a human. In certain embodiments, the cells of the invention are differentiated into cardiomyocytes for treating diseases including without limitation myocardial infarction and congestive heart failure. In vitro cardiomyocyte differentiation can be performed according to established protocols. See for example, Laflamme et al., 2007, Nat Biotechnol 25, 1015-1024.

[0081] In yet other particular embodiments, the cells of the invention can be differentiated into keratinocytes. In certain embodiments, the cells of the invention used for differentiation into keratinocytes express one or more single chain fusion HLA class I proteins. Severe burns and genetic skin conditions require treatment with skin grafts, and this is currently done with a variety of cell sources such as porcine skin grafts and cultured autologous human keratinocytes. Keratinocytes derived from the cells of the invention can provide a major clinical advance, since burns could be treated as an emergency with pre-packaged cells, and genetic diseases such as epidermolysis bullosum can be treated with normal cells (albeit with the B2M-/- background in the cellular chromosome) that do not require correction of the responsible genetic mutations. In many cases the cells only need to engraft long enough for neighboring host cells to repopulate the affected area. FIG. 5 shows in vitro differentiation of Keratin 5+ and Keratin 14+ keratinocyte colonies from the cells of the invention. The cells of the invention were cultured in matrigel cultures, followed by expansion in serum-free keratinocyte medium containing all-trans retinoic acid and BMP4 as described previously (Itoh et al., 2011, PNAS USA 108, 8797-8802). For in vivo differentiation, the cells of the invention can be embedded in polyvinyl alcohol sponge (PVA)-collagen gel implants for transplantation into a recipient. The cells of the invention can be differentiated into keratinocytes either before or after transplantation. In certain preferred embodiments, the cell is a human cell and the patient is a human.

[0082] In yet another aspect, the invention provides a use of the cells of the invention for the preparation for a medicament for transplantation. In a related aspect, the invention provides a use of the cells of the invention for the preparation for a medicament for treating a disease condition.

[0083] Further, the cells of the invention can serve as a research tool to provide a system for studying the functions of immunoregulatory proteins in a B2M-/- genetic background. In certain embodiments, the cells of the invention further express one or more single chain fusion HLA class I proteins. Accordingly, in a related aspect, the invention provides a method of determining the function of an immunoregulatory protein comprising the steps of introducing one or more immunoregulatory genes into the cells of the invention of the invention and assaying for the activities of the immunoregulatory genes. In certain preferred embodiments, the cell is a human cell. For example, the cells of the invention can be used to study the function of an immune regulatory gene, or to study an immune response, in the absence of unwanted class I antigens. In certain embodiments, the cells of the invention express HLA-F or a single chain fusion HLA class I protein comprising B2M and HLA-F, wherein the function of the HLA-F can be studied in the B2M-/- background. In a further related aspect, the invention provides a method of identifying a compound or molecule that modulates the function of the immunoregulatory protein comprising the steps of contacted the B2M-/- cells comprising the one or more immunoregulatory genes with a compound or molecule of interest and assaying for the activities of the immunoregulatory genes. In certain preferred embodiments, the cell is a human cell.

[0084] In yet another related aspect, the invention provides an in vivo research tool in a mammal, particular in a non-human primate, that are administered the cells of the invention, for studying the functions of immunoregulatory genes, or identifying a compound that modulates the function of an immunoregulatory gene in the administered cells in a B2M-/- genetic background. In certain embodiments, the cells of the invention further express one or more single chain fusion HLA class I proteins.

[0085] Mice, especially immune deficient mice, have been used as a model system for studying human cells in vivo. Human stem cells can behave differently in mice. In addition, the mouse and human immune systems have different NK cell receptors and non-classical MHC class I genes (e.g. HLA-E, F and G). Therefore, a Macaca nemestrina (Mn, pigtailed macaque) model can be developed to study the cells of the invention. The Macaca mulatta genome has been sequenced, which is highly homologous to the nemestrina genome. Further, the organization of macaque MHC loci is similar to human HLA, including the non-classical genes. Homologs of the human HLA-E and HLA-G genes have been identified in macaques. The macaque MHC loci also contain homologs of many human NK cell receptors. Human B2M-/- ESCs as well as Mn B2M-/- ESCs can be used for transplantation in macaques.

[0086] MHC class I-deficient (B2M-/-) macaque ESCs can be developed using the same AAV-mediated gene targeting strategy described for human cells. Mn versions of the single-chain HLA class I fusion proteins are expressed in the B2M-/- macaques ESCs using the analogous viral vectors as described above.

[0087] Cells can be expanded in vitro and labeled with a vector expressing GFP for subsequent identification of transplanted cells. The cells can be embedded in polyvinyl alcohol sponge (PVA)-collagen gel implants, and placed subdermally into macaques. The implants can be harvested, sectioned and stained to determine the cell types that are present. Specific antibodies can be used to identify the differentiated cell types formed by the transplanted cells.

[0088] Any and every embodiment described above applies to any and every aspect of the invention, unless the context clearly indicates otherwise. All embodiments within and between different aspects can be combined unless the context clearly dictates otherwise.

[0089] The Examples, which follow, are illustrative of specific embodiments of the invention, and various uses thereof. They are set forth for explanatory purposes only, and are not to be taken as limiting the invention.

EXAMPLES

Example 1

Construction of Human Pluripotent Stem Cells with Knockout Mutations in B2M Genes

[0090] Human pluripotent stem cells were created with knockout mutations in both alleles of the beta-2 microglobulin (B2M) genes that encodes the common subunit required for surface expression of all HLA class I heterodimers (HLA-A, B, C, E, F and G). Adeno-associated virus (AAV) gene targeting vectors were used to construct B2M-/- (class I-negative) H1 human ESCs (University of Wisconsin). Human pluripotent stem cells were infected with AAV gene targeting vectors and the B2M gene was inactivated by homologous recombination. AAV mediated gene targeting methodology has been described previously in for example, Khan et al., 2011, Protocol, 482:482-501 and Khan et al., 1990, Science 248:1227-30. These references are hereby incorporated by reference in their entirety.

[0091] FIG. 1 describes the construction of HLA class I-negative human H1 ESCs cells using the adeno-associated virus (AAV) gene targeting vectors. The two AAV vectors used contain homologous arms surrounding exon 1 of the human B2M gene, and are designed to insert either a TKNeo or HyTK fusion gene encoding G418 or hygromycin resistance respectively into exon 1 (FIG. 1A). The H1 ESCs were infected with the AAV-B2M-ETKNpA vector and 30% of G418-resistant cells were targeted at one B2M allele based on Southern blot analysis. One of these clones was then infected with the AAV-B2M-EHyTKpA vector and 10% of hygromycin-resistant cells were targeted at B2M. Southern blot analysis of a representative clone that had deletions in both B2M alleles (B2M-/-) is shown in FIG. 1B (HyTK/TKN). None of the targeted clones analyzed contained random integrants. The sequences of the targeting vector plasmid pA2-B2METKMpA and pA2-B2MEHuTKpA are shown in SEQ ID NO:21 and SEQ ID NO:22, respectively.

[0092] Cre recombinase was then used to remove the floxed TKNeo and HyTK transgenes from the B2M loci. Cre was delivered transiently by a non-integrating foamy virus vector, which is a type of retroviral vector previously described that efficiently infects human ESCs. See Deyle, et al., 2010, J. Virol 84, 9341-9 and Gharwan et al., 2007, Mol Ther 15, 1827-1833. Four clones lacking the TKNeo and HyTK transgenes were selected by gancyclovir section that kills cells expressing thymidine kinase (TK). The results shown by Southern blot analysis demonstrated transgene-free double knockouts (FIG. 1B).

[0093] Karyotypes were checked on two of these clones and found to be normal (data not shown), and teratoma assays conducted in immunodeficient mice showed that these cells had trilineage developmental potential (FIG. 2). Flow cytometry with antibodies against B2M (anti-B2M-01-PE from SantaCruz Biotechnology) and pan-HLA class I antigens (W6/32 from Sigma-Aldrich) confirmed that these cells did not express HLA class I proteins on the cell surface (FIG. 1C).

Example 2

Expression of Single Chain Fusion HLA class I Proteins in B2M Knockout Cells

[0094] In mice, HLA class I-negative cells can be destructed by Natural Killer (NK) cells through the "missing self" mechanism. Bix et al., 1991, Nature 349, 329-331. Human NK cells have different receptors, but an analogous inhibition of NK cell killing is mediated through interactions of NK cell receptors with HLA-C, E and G. The "missing self" phenomenon has largely been described for class I-deficient hematopoietic cells, and the mouse transplantation data reported previously showing that many types of B2M-/- organs survived in B2M+/+ hosts suggests that it may be less important when transplanting cells form solid organs. However, given that it could significantly affect donor cell survival in some settings, specific HLA class I genes as single chain fusion proteins that suppress NK cell killing were introduced to the B2M-/- cells.

[0095] The strategy for expressing specific HLA class I genes in a B2M-/- background is shown in FIG. 3. The B2M chain was fused to the specific HLA class I heavy chain, thereby allowing surface expression of the class I chain even in B2M-/- cells. Integrating foamy virus vectors were used to express these single chain fusion proteins. Foamy virus vectors are a type of retroviral vector with a large packaging capacity that can efficiently infect human pluripotent stem cells (Gharwan at al., supra). One such representative single chain fusion HLA class I protein foamy virus construct is shown in FIG. 3A. The .DELTA..PHI.-EGP-PHLA-SC foamy vectors included a GFP-Puro fusion protein gene driven by an EF1 alpha promoter to allow for puromycin selection and GFP expression in transduced cells, and a separate expression cassette with a ubiquitously expressed PGK promoter driving an HLA single chain fusion (HLA-SC) construct (FIG. 3A). The vector design produced constitutive expression of both transgenes, but many other vector designs and internal promoters can also be used. For example, the GFP-Pur gene can be driven by the pGK promoter, and the EF1alpha promoter controls the expression of the SC HLA gene.

[0096] As shown in FIG. 3B, the HLA-bGBE trimeric single chain fusion construct included a covalently attached HLA-G peptide (SEQ ID NO:23) in the HLA-E peptide binding cleft, while the HLA-gBE dimeric construct included an HLA-G signal peptide that was cleaved off but still bound non-covalently to the HLA-E molecule. See Crew et al., 2005, Mol Immunol 42, 1205-1214. B2M-/- cells were transduced with these vectors. Puromycin-resistant clones were selected and HLA-E surface expression was analyzed by flow cytometry in pluripotent cells expressing these constructs (FIG. 3C).

[0097] Further, a specific classical HLA class I allele single chain fusion protein was constructed and expressed in B2M-/- H1 ESCs to create a "semi-universal" donor cell to facilitate compatibility with recipients. For example, in the U.S. the HLA-A0201 allele is present in 48% of Caucasians, 46% of Hispanics, and 24% of African-Americans, all of which should accept HLA-A0201+ stem cells. See www.allelfrequencies.net and Storkus et al., 1989, PNAS USA 86:2361-2364. The HLA-bBA0201 dimeric single chain fusion construct was introduced in B2M-/- H1 ESCs by foamy virus vectors (FIG. 3B) and the expression of the single chain fusion protein was analyzed by flow cytometry (FIG. 3D).

[0098] It should be understood that the foregoing disclosure emphasizes certain specific embodiments of the invention and that all modifications or alternatives equivalent thereto are within the spirit and scope of the invention as set forth in the appended claims.

Sequence CWU 1

1

231987DNAHomo sapiensCDS(61)..(420) 1aatataagtg gaggcgtcgc gctggcgggc attcctgaag ctgacagcat tcgggccgag 60atg tct cgc tcc gtg gcc tta gct gtg ctc gcg cta ctc tct ctt tct 108Met Ser Arg Ser Val Ala Leu Ala Val Leu Ala Leu Leu Ser Leu Ser1 5 10 15 ggc ctg gag gct atc cag cgt act cca aag att cag gtt tac tca cgt 156Gly Leu Glu Ala Ile Gln Arg Thr Pro Lys Ile Gln Val Tyr Ser Arg 20 25 30cat cca gca gag aat gga aag tca aat ttc ctg aat tgc tat gtg tct 204His Pro Ala Glu Asn Gly Lys Ser Asn Phe Leu Asn Cys Tyr Val Ser 35 40 45ggg ttt cat cca tcc gac att gaa gtt gac tta ctg aag aat gga gag 252Gly Phe His Pro Ser Asp Ile Glu Val Asp Leu Leu Lys Asn Gly Glu 50 55 60aga att gaa aaa gtg gag cat tca gac ttg tct ttc agc aag gac tgg 300Arg Ile Glu Lys Val Glu His Ser Asp Leu Ser Phe Ser Lys Asp Trp65 70 75 80tct ttc tat ctc ttg tac tac act gaa ttc acc ccc act gaa aaa gat 348Ser Phe Tyr Leu Leu Tyr Tyr Thr Glu Phe Thr Pro Thr Glu Lys Asp 85 90 95gag tat gcc tgc cgt gtg aac cat gtg act ttg tca cag ccc aag ata 396Glu Tyr Ala Cys Arg Val Asn His Val Thr Leu Ser Gln Pro Lys Ile 100 105 110gtt aag tgg gat cga gac atg taa gcagcatcat ggaggtttga agatgccgca 450Val Lys Trp Asp Arg Asp Met 115tttggattgg atgaattcca aattctgctt gcttgctttt taatattgat atgcttatac 510acttacactt tatgcacaaa atgtagggtt ataataatgt taacatggac atgatcttct 570ttataattct actttgagtg ctgtctccat gtttgatgta tctgagcagg ttgctccaca 630ggtagctcta ggagggctgg caacttagag gtggggagca gagaattctc ttatccaaca 690tcaacatctt ggtcagattt gaactcttca atctcttgca ctcaaagctt gttaagatag 750ttaagcgtgc ataagttaac ttccaattta catactctgc ttagaatttg ggggaaaatt 810tagaaatata attgacagga ttattggaaa tttgttataa tgaatgaaac attttgtcat 870ataagattca tatttacttc ttatacattt gataaagtaa ggcatggttg tggttaatct 930ggtttatttt tgttccacaa gttaaataaa tcataaaact tgatgtgtta tctctta 9872119PRTHomo sapiens 2Met Ser Arg Ser Val Ala Leu Ala Val Leu Ala Leu Leu Ser Leu Ser 1 5 10 15 Gly Leu Glu Ala Ile Gln Arg Thr Pro Lys Ile Gln Val Tyr Ser Arg 20 25 30 His Pro Ala Glu Asn Gly Lys Ser Asn Phe Leu Asn Cys Tyr Val Ser 35 40 45 Gly Phe His Pro Ser Asp Ile Glu Val Asp Leu Leu Lys Asn Gly Glu 50 55 60 Arg Ile Glu Lys Val Glu His Ser Asp Leu Ser Phe Ser Lys Asp Trp 65 70 75 80 Ser Phe Tyr Leu Leu Tyr Tyr Thr Glu Phe Thr Pro Thr Glu Lys Asp 85 90 95 Glu Tyr Ala Cys Arg Val Asn His Val Thr Leu Ser Gln Pro Lys Ile 100 105 110 Val Lys Trp Asp Arg Asp Met 115 31611DNAHomo sapiensCDS(85)..(1182) 3gagaagccaa tcagtgtcgt cgcggtcgct gttctaaagt ccgcacgcac ccaccgggac 60tcagattctc cccagacgcc gagg atg gcc gtc atg gcg ccc cga acc ctc 111 Met Ala Val Met Ala Pro Arg Thr Leu 1 5 ctc ctg cta ctc tcg ggg gcc ctg gcc ctg acc cag acc tgg gcg ggc 159Leu Leu Leu Leu Ser Gly Ala Leu Ala Leu Thr Gln Thr Trp Ala Gly10 15 20 25tcc cac tcc atg agg tat ttc ttc aca tcc gtg tcc cgg ccc ggc cgc 207Ser His Ser Met Arg Tyr Phe Phe Thr Ser Val Ser Arg Pro Gly Arg 30 35 40ggg gag ccc cgc ttc atc gcc gtg ggc tac gtg gac gac acg cag ttc 255Gly Glu Pro Arg Phe Ile Ala Val Gly Tyr Val Asp Asp Thr Gln Phe 45 50 55gtg cgg ttc gac agc gac gcc gcg agc cag aag atg gag ccg cgg gcg 303Val Arg Phe Asp Ser Asp Ala Ala Ser Gln Lys Met Glu Pro Arg Ala 60 65 70ccg tgg ata gag cag gag ggg ccg gag tat tgg gac cag gag aca cgg 351Pro Trp Ile Glu Gln Glu Gly Pro Glu Tyr Trp Asp Gln Glu Thr Arg 75 80 85aat atg aag gcc cac tca cag act gac cga gcg aac ctg ggg acc ctg 399Asn Met Lys Ala His Ser Gln Thr Asp Arg Ala Asn Leu Gly Thr Leu90 95 100 105cgc ggc tac tac aac cag agc gag gac ggt tct cac acc atc cag ata 447Arg Gly Tyr Tyr Asn Gln Ser Glu Asp Gly Ser His Thr Ile Gln Ile 110 115 120atg tat ggc tgc gac gtg ggg ccg gac ggg cgc ttc ctc cgc ggg tac 495Met Tyr Gly Cys Asp Val Gly Pro Asp Gly Arg Phe Leu Arg Gly Tyr 125 130 135cgg cag gac gcc tac gac ggc aag gat tac atc gcc ctg aac gag gac 543Arg Gln Asp Ala Tyr Asp Gly Lys Asp Tyr Ile Ala Leu Asn Glu Asp 140 145 150ctg cgc tct tgg acc gcg gcg gac atg gca gct cag atc acc aag cgc 591Leu Arg Ser Trp Thr Ala Ala Asp Met Ala Ala Gln Ile Thr Lys Arg 155 160 165aag tgg gag gcg gtc cat gcg gcg gag cag cgg aga gtc tac ctg gag 639Lys Trp Glu Ala Val His Ala Ala Glu Gln Arg Arg Val Tyr Leu Glu170 175 180 185ggc cgg tgc gtg gac ggg ctc cgc aga tac ctg gag aac ggg aag gag 687Gly Arg Cys Val Asp Gly Leu Arg Arg Tyr Leu Glu Asn Gly Lys Glu 190 195 200acg ctg cag cgc acg gac ccc ccc aag aca cat atg acc cac cac ccc 735Thr Leu Gln Arg Thr Asp Pro Pro Lys Thr His Met Thr His His Pro 205 210 215atc tct gac cat gag gcc acc ctg agg tgc tgg gcc ctg ggc ttc tac 783Ile Ser Asp His Glu Ala Thr Leu Arg Cys Trp Ala Leu Gly Phe Tyr 220 225 230cct gcg gag atc aca ctg acc tgg cag cgg gat ggg gag gac cag acc 831Pro Ala Glu Ile Thr Leu Thr Trp Gln Arg Asp Gly Glu Asp Gln Thr 235 240 245cag gac acg gag ctc gtg gag acc agg cct gca ggg gat gga acc ttc 879Gln Asp Thr Glu Leu Val Glu Thr Arg Pro Ala Gly Asp Gly Thr Phe250 255 260 265cag aag tgg gcg gct gtg gtg gtg cct tct gga gag gag cag aga tac 927Gln Lys Trp Ala Ala Val Val Val Pro Ser Gly Glu Glu Gln Arg Tyr 270 275 280acc tgc cat gtg cag cat gag ggt ctg ccc aag ccc ctc acc ctg aga 975Thr Cys His Val Gln His Glu Gly Leu Pro Lys Pro Leu Thr Leu Arg 285 290 295tgg gag ctg tct tcc cag ccc acc atc ccc atc gtg ggc atc att gct 1023Trp Glu Leu Ser Ser Gln Pro Thr Ile Pro Ile Val Gly Ile Ile Ala 300 305 310ggc ctg gtt ctc ctt gga gct gtg atc act gga gct gtg gtc gct gcc 1071Gly Leu Val Leu Leu Gly Ala Val Ile Thr Gly Ala Val Val Ala Ala 315 320 325gtg atg tgg agg agg aag agc tca gat aga aaa gga ggg agt tac act 1119Val Met Trp Arg Arg Lys Ser Ser Asp Arg Lys Gly Gly Ser Tyr Thr330 335 340 345cag gct gca agc agt gac agt gcc cag ggc tct gat gtg tct ctc aca 1167Gln Ala Ala Ser Ser Asp Ser Ala Gln Gly Ser Asp Val Ser Leu Thr 350 355 360gct tgt aaa gtg tga gacagctgcc ttgtgtggga ctgagaggca agagttgttc 1222Ala Cys Lys Val 365ctgcccttcc ctttgtgact tgaagaaccc tgactttgtt tctgcaaagg cacctgcatg 1282tgtctgtgtt cgtgtaggca taatgtgagg aggtggggag agcaccccac ccccatgtcc 1342accatgaccc tcttcccacg ctgacctgtg ctccctctcc aatcatcttt cctgttccag 1402agaggtgggg ctgaggtgtc tccatctctg tctcaacttc atggtgcact gagctgtaac 1462ttcttccttc cctattaaaa ttagaacctg agtataaatt tactttctca aattcttgcc 1522atgagaggtt gatgagttaa ttaaaggaga agattcctaa aatttgagag acaaaattaa 1582tggaacgcat gagaaccttc cagagtcca 16114365PRTHomo sapiens 4Met Ala Val Met Ala Pro Arg Thr Leu Leu Leu Leu Leu Ser Gly Ala 1 5 10 15 Leu Ala Leu Thr Gln Thr Trp Ala Gly Ser His Ser Met Arg Tyr Phe 20 25 30 Phe Thr Ser Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ala 35 40 45 Val Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp Ser Asp Ala 50 55 60 Ala Ser Gln Lys Met Glu Pro Arg Ala Pro Trp Ile Glu Gln Glu Gly 65 70 75 80 Pro Glu Tyr Trp Asp Gln Glu Thr Arg Asn Met Lys Ala His Ser Gln 85 90 95 Thr Asp Arg Ala Asn Leu Gly Thr Leu Arg Gly Tyr Tyr Asn Gln Ser 100 105 110 Glu Asp Gly Ser His Thr Ile Gln Ile Met Tyr Gly Cys Asp Val Gly 115 120 125 Pro Asp Gly Arg Phe Leu Arg Gly Tyr Arg Gln Asp Ala Tyr Asp Gly 130 135 140 Lys Asp Tyr Ile Ala Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Ala 145 150 155 160 Asp Met Ala Ala Gln Ile Thr Lys Arg Lys Trp Glu Ala Val His Ala 165 170 175 Ala Glu Gln Arg Arg Val Tyr Leu Glu Gly Arg Cys Val Asp Gly Leu 180 185 190 Arg Arg Tyr Leu Glu Asn Gly Lys Glu Thr Leu Gln Arg Thr Asp Pro 195 200 205 Pro Lys Thr His Met Thr His His Pro Ile Ser Asp His Glu Ala Thr 210 215 220 Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu Thr 225 230 235 240 Trp Gln Arg Asp Gly Glu Asp Gln Thr Gln Asp Thr Glu Leu Val Glu 245 250 255 Thr Arg Pro Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala Ala Val Val 260 265 270 Val Pro Ser Gly Glu Glu Gln Arg Tyr Thr Cys His Val Gln His Glu 275 280 285 Gly Leu Pro Lys Pro Leu Thr Leu Arg Trp Glu Leu Ser Ser Gln Pro 290 295 300 Thr Ile Pro Ile Val Gly Ile Ile Ala Gly Leu Val Leu Leu Gly Ala 305 310 315 320 Val Ile Thr Gly Ala Val Val Ala Ala Val Met Trp Arg Arg Lys Ser 325 330 335 Ser Asp Arg Lys Gly Gly Ser Tyr Thr Gln Ala Ala Ser Ser Asp Ser 340 345 350 Ala Gln Gly Ser Asp Val Ser Leu Thr Ala Cys Lys Val 355 360 365 51578DNAHomo sapiensCDS(55)..(1143) 5agttctaaag tccccacgca cccacccgga ctcagagtct cctcagacgc cgag atg 57 Met 1ctg gtc atg gcg ccc cga acc gtc ctc ctg ctg ctc tcg gcg gcc ctg 105Leu Val Met Ala Pro Arg Thr Val Leu Leu Leu Leu Ser Ala Ala Leu 5 10 15gcc ctg acc gag acc tgg gcc ggc tcc cac tcc atg agg tat ttc tac 153Ala Leu Thr Glu Thr Trp Ala Gly Ser His Ser Met Arg Tyr Phe Tyr 20 25 30acc tcc gtg tcc cgg ccc ggc cgc ggg gag ccc cgc ttc atc tca gtg 201Thr Ser Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ser Val 35 40 45ggc tac gtg gac gac acc cag ttc gtg agg ttc gac agc gac gcc gcg 249Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp Ser Asp Ala Ala50 55 60 65agt ccg aga gag gag ccg cgg gcg ccg tgg ata gag cag gag ggg ccg 297Ser Pro Arg Glu Glu Pro Arg Ala Pro Trp Ile Glu Gln Glu Gly Pro 70 75 80gag tat tgg gac cgg aac aca cag atc tac aag gcc cag gca cag act 345Glu Tyr Trp Asp Arg Asn Thr Gln Ile Tyr Lys Ala Gln Ala Gln Thr 85 90 95gac cga gag agc ctg cgg aac ctg cgc ggc tac tac aac cag agc gag 393Asp Arg Glu Ser Leu Arg Asn Leu Arg Gly Tyr Tyr Asn Gln Ser Glu 100 105 110gcc ggg tct cac acc ctc cag agc atg tac ggc tgc gac gtg ggg ccg 441Ala Gly Ser His Thr Leu Gln Ser Met Tyr Gly Cys Asp Val Gly Pro 115 120 125gac ggg cgc ctc ctc cgc ggg cat gac cag tac gcc tac gac ggc aag 489Asp Gly Arg Leu Leu Arg Gly His Asp Gln Tyr Ala Tyr Asp Gly Lys130 135 140 145gat tac atc gcc ctg aac gag gac ctg cgc tcc tgg acc gcc gcg gac 537Asp Tyr Ile Ala Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Ala Asp 150 155 160acg gcg gct cag atc acc cag cgc aag tgg gag gcg gcc cgt gag gcg 585Thr Ala Ala Gln Ile Thr Gln Arg Lys Trp Glu Ala Ala Arg Glu Ala 165 170 175gag cag cgg aga gcc tac ctg gag ggc gag tgc gtg gag tgg ctc cgc 633Glu Gln Arg Arg Ala Tyr Leu Glu Gly Glu Cys Val Glu Trp Leu Arg 180 185 190aga tac ctg gag aac ggg aag gac aag ctg gag cgc gct gac ccc cca 681Arg Tyr Leu Glu Asn Gly Lys Asp Lys Leu Glu Arg Ala Asp Pro Pro 195 200 205aag aca cac gtg acc cac cac ccc atc tct gac cat gag gcc acc ctg 729Lys Thr His Val Thr His His Pro Ile Ser Asp His Glu Ala Thr Leu210 215 220 225agg tgc tgg gcc ctg ggt ttc tac cct gcg gag atc aca ctg acc tgg 777Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu Thr Trp 230 235 240cag cgg gat ggc gag gac caa act cag gac act gag ctt gtg gag acc 825Gln Arg Asp Gly Glu Asp Gln Thr Gln Asp Thr Glu Leu Val Glu Thr 245 250 255aga cca gca gga gat aga acc ttc cag aag tgg gca gct gtg gtg gtg 873Arg Pro Ala Gly Asp Arg Thr Phe Gln Lys Trp Ala Ala Val Val Val 260 265 270cct tct gga gaa gag cag aga tac aca tgc cat gta cag cat gag ggg 921Pro Ser Gly Glu Glu Gln Arg Tyr Thr Cys His Val Gln His Glu Gly 275 280 285ctg ccg aag ccc ctc acc ctg aga tgg gag ccg tct tcc cag tcc acc 969Leu Pro Lys Pro Leu Thr Leu Arg Trp Glu Pro Ser Ser Gln Ser Thr290 295 300 305gtc ccc atc gtg ggc att gtt gct ggc ctg gct gtc cta gca gtt gtg 1017Val Pro Ile Val Gly Ile Val Ala Gly Leu Ala Val Leu Ala Val Val 310 315 320gtc atc gga gct gtg gtc gct gct gtg atg tgt agg agg aag agt tca 1065Val Ile Gly Ala Val Val Ala Ala Val Met Cys Arg Arg Lys Ser Ser 325 330 335ggt gga aaa gga ggg agc tac tct cag gct gcg tgc agc gac agt gcc 1113Gly Gly Lys Gly Gly Ser Tyr Ser Gln Ala Ala Cys Ser Asp Ser Ala 340 345 350cag ggc tct gat gtg tct ctc aca gct tga aaagcctgag acagctgtct 1163Gln Gly Ser Asp Val Ser Leu Thr Ala 355 360tgtgagggac tgagatgcag gatttcttca cgcctcccct ttgtgacttc aagagcctct 1223ggcatctctt tctgcaaagg cacctgaatg tgtctgcgtc cctgttagca taatgtgagg 1283aggtggagag acagcccacc cttgtgtcca ctgtgacccc tgttcccatg ctgacctgtg 1343tttcctcccc agtcatcttt cttgttccag agaggtgggg ctggatgtct ccatctctgt 1403ctcaacttta cgtgcactga gctgcaactt cttacttccc tactgaaaat aagaatctga 1463atataaattt gttttctcaa atatttgcta tgagaggttg atggattaat taaataagtc 1523aattcctgga atttgagaga gcaaataaag acctgagaac cttccagaaa aaaaa 15786362PRTHomo sapiens 6Met Leu Val Met Ala Pro Arg Thr Val Leu Leu Leu Leu Ser Ala Ala 1 5 10 15 Leu Ala Leu Thr Glu Thr Trp Ala Gly Ser His Ser Met Arg Tyr Phe 20 25 30 Tyr Thr Ser Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ser 35 40 45 Val Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp Ser Asp Ala 50 55 60 Ala Ser Pro Arg Glu Glu Pro Arg Ala Pro Trp Ile Glu Gln Glu Gly 65 70 75 80 Pro Glu Tyr Trp Asp Arg Asn Thr Gln Ile Tyr Lys Ala Gln Ala Gln 85 90 95 Thr Asp Arg Glu Ser Leu Arg Asn Leu Arg Gly Tyr Tyr Asn Gln Ser 100 105 110 Glu Ala Gly Ser His Thr Leu Gln Ser Met Tyr Gly Cys Asp Val Gly 115 120 125 Pro Asp Gly Arg Leu Leu Arg Gly His Asp Gln Tyr Ala Tyr Asp Gly 130 135 140 Lys Asp Tyr Ile Ala Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Ala 145 150 155 160 Asp Thr Ala Ala Gln Ile Thr Gln Arg Lys Trp Glu Ala Ala Arg Glu 165 170 175 Ala Glu Gln Arg Arg Ala Tyr Leu Glu Gly Glu Cys Val Glu Trp Leu 180 185 190 Arg Arg Tyr Leu Glu Asn Gly Lys Asp Lys Leu Glu Arg Ala Asp Pro 195 200 205 Pro Lys Thr His Val Thr His His Pro Ile Ser Asp His Glu Ala Thr 210 215 220 Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu Thr 225 230 235 240 Trp Gln Arg Asp Gly Glu Asp Gln Thr Gln Asp Thr Glu Leu Val Glu

245 250 255 Thr Arg Pro Ala Gly Asp Arg Thr Phe Gln Lys Trp Ala Ala Val Val 260 265 270 Val Pro Ser Gly Glu Glu Gln Arg Tyr Thr Cys His Val Gln His Glu 275 280 285 Gly Leu Pro Lys Pro Leu Thr Leu Arg Trp Glu Pro Ser Ser Gln Ser 290 295 300 Thr Val Pro Ile Val Gly Ile Val Ala Gly Leu Ala Val Leu Ala Val 305 310 315 320 Val Val Ile Gly Ala Val Val Ala Ala Val Met Cys Arg Arg Lys Ser 325 330 335 Ser Gly Gly Lys Gly Gly Ser Tyr Ser Gln Ala Ala Cys Ser Asp Ser 340 345 350 Ala Gln Gly Ser Asp Val Ser Leu Thr Ala 355 360 71586DNAHomo sapiensCDS(66)..(1166) 7tccgcagtcc cggttctaaa gtccccagtc acccacccgg actcacattc tccccagagg 60ccgag atg cgg gtc atg gcg ccc cga gcc ctc ctc ctg ctg ctc tcg gga 110 Met Arg Val Met Ala Pro Arg Ala Leu Leu Leu Leu Leu Ser Gly 1 5 10 15ggc ctg gcc ctg acc gag acc tgg gcc tgc tcc cac tcc atg agg tat 158Gly Leu Ala Leu Thr Glu Thr Trp Ala Cys Ser His Ser Met Arg Tyr 20 25 30ttc gac acc gcc gtg tcc cgg ccc ggc cgc gga gag ccc cgc ttc atc 206Phe Asp Thr Ala Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile 35 40 45tca gtg ggc tac gtg gac gac acg cag ttc gtg cgg ttc gac agc gac 254Ser Val Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp Ser Asp 50 55 60gcc gcg agt ccg aga ggg gag ccg cgg gcg ccg tgg gtg gag cag gag 302Ala Ala Ser Pro Arg Gly Glu Pro Arg Ala Pro Trp Val Glu Gln Glu 65 70 75ggg ccg gag tat tgg gac cgg gag aca cag aag tac aag cgc cag gca 350Gly Pro Glu Tyr Trp Asp Arg Glu Thr Gln Lys Tyr Lys Arg Gln Ala80 85 90 95cag gct gac cga gtg agc ctg cgg aac ctg cgc ggc tac tac aac cag 398Gln Ala Asp Arg Val Ser Leu Arg Asn Leu Arg Gly Tyr Tyr Asn Gln 100 105 110agc gag gac ggg tct cac acc ctc cag agg atg tct ggc tgc gac ctg 446Ser Glu Asp Gly Ser His Thr Leu Gln Arg Met Ser Gly Cys Asp Leu 115 120 125ggg ccc gac ggg cgc ctc ctc cgc ggg tat gac cag tcc gcc tac gac 494Gly Pro Asp Gly Arg Leu Leu Arg Gly Tyr Asp Gln Ser Ala Tyr Asp 130 135 140ggc aag gat tac atc gcc ctg aac gag gac ctg cgc tcc tgg acc gcc 542Gly Lys Asp Tyr Ile Ala Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala 145 150 155gcg gac acc gcg gct cag atc acc cag cgc aag ttg gag gcg gcc cgt 590Ala Asp Thr Ala Ala Gln Ile Thr Gln Arg Lys Leu Glu Ala Ala Arg160 165 170 175gcg gcg gag cag ctg aga gcc tac ctg gag ggc acg tgc gtg gag tgg 638Ala Ala Glu Gln Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Trp 180 185 190ctc cgc aga tac ctg gag aac ggg aag gag acg ctg cag cgc gca gaa 686Leu Arg Arg Tyr Leu Glu Asn Gly Lys Glu Thr Leu Gln Arg Ala Glu 195 200 205ccc cca aag aca cac gtg acc cac cac ccc ctc tct gac cat gag gcc 734Pro Pro Lys Thr His Val Thr His His Pro Leu Ser Asp His Glu Ala 210 215 220acc ctg agg tgc tgg gcc ctg ggc ttc tac cct gcg gag atc aca ctg 782Thr Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu 225 230 235acc tgg cag cgg gat ggg gag gac cag acc cag gac acc gag ctt gtg 830Thr Trp Gln Arg Asp Gly Glu Asp Gln Thr Gln Asp Thr Glu Leu Val240 245 250 255gag acc agg cca gca gga gat gga acc ttc cag aag tgg gca gct gtg 878Glu Thr Arg Pro Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala Ala Val 260 265 270gtg gtg cct tct gga caa gag cag aga tac acg tgc cat atg cag cac 926Val Val Pro Ser Gly Gln Glu Gln Arg Tyr Thr Cys His Met Gln His 275 280 285gag ggg ctg caa gag ccc ctc acc ctg agc tgg gag cca tct tcc cag 974Glu Gly Leu Gln Glu Pro Leu Thr Leu Ser Trp Glu Pro Ser Ser Gln 290 295 300ccc acc atc ccc atc atg ggc atc gtt gct ggc ctg gct gtc ctg gtt 1022Pro Thr Ile Pro Ile Met Gly Ile Val Ala Gly Leu Ala Val Leu Val 305 310 315gtc cta gct gtc ctt gga gct gtg gtc acc gct atg atg tgt agg agg 1070Val Leu Ala Val Leu Gly Ala Val Val Thr Ala Met Met Cys Arg Arg320 325 330 335aag agc tca ggt gga aaa gga ggg agc tgc tct cag gct gcg tgc agc 1118Lys Ser Ser Gly Gly Lys Gly Gly Ser Cys Ser Gln Ala Ala Cys Ser 340 345 350aac agt gcc cag ggc tct gat gag tct ctc atc act tgt aaa gcc tga 1166Asn Ser Ala Gln Gly Ser Asp Glu Ser Leu Ile Thr Cys Lys Ala 355 360 365gacagctgcc tgtgtgggac tgagatgcag gatttcttca cacctctcct ttgtgacttc 1226aagagcctct ggcatctctt tctgcaaagg cacctgaatg tgtctgcgtt cctgttagca 1286taatgtgagg aggtggagag acagcccacc cccgtgtcca ccgtgacccc tgtccccaca 1346ctgacctgtg ttccctcccc gatcatcttt cctgttccag agaggtgggg ctggatgtct 1406ccatctctgt ctcaaattca tggtgcactg agctgcaact tcttacttcc ctaatgaagt 1466taagaacctg aatataaatt tgtgttctca aatatttgct atgaagcgtt gatggattaa 1526ttaaataagt caattcctag aagttgagag agcaaataaa gacctgagaa ccttccagaa 15868366PRTHomo sapiens 8Met Arg Val Met Ala Pro Arg Ala Leu Leu Leu Leu Leu Ser Gly Gly 1 5 10 15 Leu Ala Leu Thr Glu Thr Trp Ala Cys Ser His Ser Met Arg Tyr Phe 20 25 30 Asp Thr Ala Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ser 35 40 45 Val Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp Ser Asp Ala 50 55 60 Ala Ser Pro Arg Gly Glu Pro Arg Ala Pro Trp Val Glu Gln Glu Gly 65 70 75 80 Pro Glu Tyr Trp Asp Arg Glu Thr Gln Lys Tyr Lys Arg Gln Ala Gln 85 90 95 Ala Asp Arg Val Ser Leu Arg Asn Leu Arg Gly Tyr Tyr Asn Gln Ser 100 105 110 Glu Asp Gly Ser His Thr Leu Gln Arg Met Ser Gly Cys Asp Leu Gly 115 120 125 Pro Asp Gly Arg Leu Leu Arg Gly Tyr Asp Gln Ser Ala Tyr Asp Gly 130 135 140 Lys Asp Tyr Ile Ala Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Ala 145 150 155 160 Asp Thr Ala Ala Gln Ile Thr Gln Arg Lys Leu Glu Ala Ala Arg Ala 165 170 175 Ala Glu Gln Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Trp Leu 180 185 190 Arg Arg Tyr Leu Glu Asn Gly Lys Glu Thr Leu Gln Arg Ala Glu Pro 195 200 205 Pro Lys Thr His Val Thr His His Pro Leu Ser Asp His Glu Ala Thr 210 215 220 Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu Thr 225 230 235 240 Trp Gln Arg Asp Gly Glu Asp Gln Thr Gln Asp Thr Glu Leu Val Glu 245 250 255 Thr Arg Pro Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala Ala Val Val 260 265 270 Val Pro Ser Gly Gln Glu Gln Arg Tyr Thr Cys His Met Gln His Glu 275 280 285 Gly Leu Gln Glu Pro Leu Thr Leu Ser Trp Glu Pro Ser Ser Gln Pro 290 295 300 Thr Ile Pro Ile Met Gly Ile Val Ala Gly Leu Ala Val Leu Val Val 305 310 315 320 Leu Ala Val Leu Gly Ala Val Val Thr Ala Met Met Cys Arg Arg Lys 325 330 335 Ser Ser Gly Gly Lys Gly Gly Ser Cys Ser Gln Ala Ala Cys Ser Asn 340 345 350 Ser Ala Gln Gly Ser Asp Glu Ser Leu Ile Thr Cys Lys Ala 355 360 365 92679DNAHomo sapiensCDS(127)..(1203) 9gcagactcag ttctcattcc caatgggtgt cgggtttcta gagaagccaa tcagcgtcgc 60cacgactccc gactataaag tccccatccg gactcaagaa gttctcagga ctcagaggct 120gggatc atg gta gat gga acc ctc ctt tta ctc ctc tcg gag gcc ctg 168 Met Val Asp Gly Thr Leu Leu Leu Leu Leu Ser Glu Ala Leu 1 5 10gcc ctt acc cag acc tgg gcg ggc tcc cac tcc ttg aag tat ttc cac 216Ala Leu Thr Gln Thr Trp Ala Gly Ser His Ser Leu Lys Tyr Phe His15 20 25 30act tcc gtg tcc cgg ccc ggc cgc ggg gag ccc cgc ttc atc tct gtg 264Thr Ser Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ser Val 35 40 45ggc tac gtg gac gac acc cag ttc gtg cgc ttc gac aac gac gcc gcg 312Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp Asn Asp Ala Ala 50 55 60agt ccg agg atg gtg ccg cgg gcg ccg tgg atg gag cag gag ggg tca 360Ser Pro Arg Met Val Pro Arg Ala Pro Trp Met Glu Gln Glu Gly Ser 65 70 75gag tat tgg gac cgg gag aca cgg agc gcc agg gac acc gca cag att 408Glu Tyr Trp Asp Arg Glu Thr Arg Ser Ala Arg Asp Thr Ala Gln Ile 80 85 90ttc cga gtg aat ctg cgg acg ctg cgc ggc tac tac aat cag agc gag 456Phe Arg Val Asn Leu Arg Thr Leu Arg Gly Tyr Tyr Asn Gln Ser Glu95 100 105 110gcc ggg tct cac acc ctg cag tgg atg cat ggc tgc gag ctg ggg ccc 504Ala Gly Ser His Thr Leu Gln Trp Met His Gly Cys Glu Leu Gly Pro 115 120 125gac ggg cgc ttc ctc cgc ggg tat gaa cag ttc gcc tac gac ggc aag 552Asp Gly Arg Phe Leu Arg Gly Tyr Glu Gln Phe Ala Tyr Asp Gly Lys 130 135 140gat tat ctc acc ctg aat gag gac ctg cgc tcc tgg acc gcg gtg gac 600Asp Tyr Leu Thr Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Val Asp 145 150 155acg gcg gct cag atc tcc gag caa aag tca aat gat gcc tct gag gcg 648Thr Ala Ala Gln Ile Ser Glu Gln Lys Ser Asn Asp Ala Ser Glu Ala 160 165 170gag cac cag aga gcc tac ctg gaa gac aca tgc gtg gag tgg ctc cac 696Glu His Gln Arg Ala Tyr Leu Glu Asp Thr Cys Val Glu Trp Leu His175 180 185 190aaa tac ctg gag aag ggg aag gag acg ctg ctt cac ctg gag ccc cca 744Lys Tyr Leu Glu Lys Gly Lys Glu Thr Leu Leu His Leu Glu Pro Pro 195 200 205aag aca cac gtg act cac cac ccc atc tct gac cat gag gcc acc ctg 792Lys Thr His Val Thr His His Pro Ile Ser Asp His Glu Ala Thr Leu 210 215 220agg tgc tgg gcc ctg ggc ttc tac cct gcg gag atc aca ctg acc tgg 840Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu Thr Trp 225 230 235cag cag gat ggg gag ggc cat acc cag gac acg gag ctc gtg gag acc 888Gln Gln Asp Gly Glu Gly His Thr Gln Asp Thr Glu Leu Val Glu Thr 240 245 250agg cct gca ggg gat gga acc ttc cag aag tgg gca gct gtg gtg gtg 936Arg Pro Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala Ala Val Val Val255 260 265 270cct tct gga gag gag cag aga tac acg tgc cat gtg cag cat gag ggg 984Pro Ser Gly Glu Glu Gln Arg Tyr Thr Cys His Val Gln His Glu Gly 275 280 285cta ccc gag ccc gtc acc ctg aga tgg aag ccg gct tcc cag ccc acc 1032Leu Pro Glu Pro Val Thr Leu Arg Trp Lys Pro Ala Ser Gln Pro Thr 290 295 300atc ccc atc gtg ggc atc att gct ggc ctg gtt ctc ctt gga tct gtg 1080Ile Pro Ile Val Gly Ile Ile Ala Gly Leu Val Leu Leu Gly Ser Val 305 310 315gtc tct gga gct gtg gtt gct gct gtg ata tgg agg aag aag agc tca 1128Val Ser Gly Ala Val Val Ala Ala Val Ile Trp Arg Lys Lys Ser Ser 320 325 330ggt gga aaa gga ggg agc tac tct aag gct gag tgg agc gac agt gcc 1176Gly Gly Lys Gly Gly Ser Tyr Ser Lys Ala Glu Trp Ser Asp Ser Ala335 340 345 350cag ggg tct gag tct cac agc ttg taa agcctgagac agctgccttg 1223Gln Gly Ser Glu Ser His Ser Leu 355tgtgcgactg agatgcacag ctgccttgtg tgcgactgag atgcaggatt tcctcacgcc 1283tcccctatgt gtcttagggg actctggctt ctctttttgc aagggcctct gaatctgtct 1343gtgtccctgt tagcacaatg tgaggaggta gagaaacagt ccacctctgt gtctaccatg 1403acccccttcc tcacactgac ctgtgttcct tccctgttct cttttctatt aaaaataaga 1463acctgggcag agtgcggcag ctcatgcctg taatcccagc acttagggag gccgaggagg 1523gcagatcacg aggtcaggag atcgaaacca tcctggctaa cacggtgaaa ccccgtctct 1583actaaaaaat acaaaaaatt agctgggcgc agaggcacgg gcctgtagtc ccagctactc 1643aggaggcgga ggcaggagaa tggcgtcaac ccgggaggcg gaggttgcag tgagccagga 1703ttgtgcgact gcactccagc ctgggtgaca gggtgaaacg ccatctcaaa aaataaaaat 1763tgaaaaataa aaaaagaacc tggatctcaa tttaattttt catattcttg caatgaaatg 1823gacttgagga agctaagatc atagctagaa atacagataa ttccacagca catctctagc 1883aaatttagcc tattcctatt ctctagccta ttccttacca cctgtaatct tgaccatata 1943ccttggagtt gaatattgtt ttcatactgc tgtggtttga atgttccctc caacactcat 2003gttgagactt aatccctaat gtggcaatac tgaaaggtgg ggcctttgag atgtgattgg 2063atcgtaaggc tgtgccttca ttcatgggtt aatggattaa tgggttatca caggaatggg 2123actggtggct ttataagaag aggaaaagag aactgagcta gcatgcccag cccacagaga 2183gcctccacta gagtgatgct aagtggaaat gtgaggtgca gctgccacag agggccccca 2243ccagggaaat gtctagtgtc tagtggatcc aggccacagg agagagtgcc ttgtggagcg 2303ctgggagcag gacctgacca ccaccaggac cccagaactg tggagtcagt ggcagcatgc 2363agcgccccct tgggaaagct ttaggcacca gcctgcaacc cattcgagca gccacgtagg 2423ctgcacccag caaagccaca ggcacggggc tacctgaggc cttgggggcc caatccctgc 2483tccagtgtgt ccgtgaggca gcacacgaag tcaaaagaga ttattctctt cccacagata 2543ccttttctct cccatgaccc tttaacagca tctgcttcat tcccctcacc ttcccaggct 2603gatctgaggt aaactttgaa gtaaaataaa agctgtgttt gagcatcatt tgtatttcaa 2663aaaaaaaaaa aaaaaa 267910358PRTHomo sapiens 10Met Val Asp Gly Thr Leu Leu Leu Leu Leu Ser Glu Ala Leu Ala Leu 1 5 10 15 Thr Gln Thr Trp Ala Gly Ser His Ser Leu Lys Tyr Phe His Thr Ser 20 25 30 Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ser Val Gly Tyr 35 40 45 Val Asp Asp Thr Gln Phe Val Arg Phe Asp Asn Asp Ala Ala Ser Pro 50 55 60 Arg Met Val Pro Arg Ala Pro Trp Met Glu Gln Glu Gly Ser Glu Tyr 65 70 75 80 Trp Asp Arg Glu Thr Arg Ser Ala Arg Asp Thr Ala Gln Ile Phe Arg 85 90 95 Val Asn Leu Arg Thr Leu Arg Gly Tyr Tyr Asn Gln Ser Glu Ala Gly 100 105 110 Ser His Thr Leu Gln Trp Met His Gly Cys Glu Leu Gly Pro Asp Gly 115 120 125 Arg Phe Leu Arg Gly Tyr Glu Gln Phe Ala Tyr Asp Gly Lys Asp Tyr 130 135 140 Leu Thr Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Val Asp Thr Ala 145 150 155 160 Ala Gln Ile Ser Glu Gln Lys Ser Asn Asp Ala Ser Glu Ala Glu His 165 170 175 Gln Arg Ala Tyr Leu Glu Asp Thr Cys Val Glu Trp Leu His Lys Tyr 180 185 190 Leu Glu Lys Gly Lys Glu Thr Leu Leu His Leu Glu Pro Pro Lys Thr 195 200 205 His Val Thr His His Pro Ile Ser Asp His Glu Ala Thr Leu Arg Cys 210 215 220 Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu Thr Trp Gln Gln 225 230 235 240 Asp Gly Glu Gly His Thr Gln Asp Thr Glu Leu Val Glu Thr Arg Pro 245 250 255 Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala Ala Val Val Val Pro Ser 260 265 270 Gly Glu Glu Gln Arg Tyr Thr Cys His Val Gln His Glu Gly Leu Pro 275 280 285 Glu Pro Val Thr Leu Arg Trp Lys Pro Ala Ser Gln Pro Thr Ile Pro 290 295 300 Ile Val Gly Ile Ile Ala Gly Leu Val Leu Leu Gly Ser Val Val Ser 305 310 315 320 Gly Ala Val Val Ala Ala Val Ile Trp Arg Lys Lys Ser Ser Gly Gly 325 330 335 Lys Gly Gly Ser Tyr Ser Lys Ala Glu Trp Ser Asp Ser Ala Gln Gly 340 345 350 Ser Glu Ser His Ser Leu 355 111301DNAHomo sapiensCDS(125)..(1165) 11tttctcactc ccattgggcg tcgcgtttct agagaagcca atcagtgtcg ccgcagttcc 60caggttctaa agtcccacgc accccgcggg actcatattt ttcccagacg cggaggttgg 120ggtc atg gcg ccc cga agc ctc ctc ctg ctg ctc tca ggg gcc ctg gcc 169 Met Ala Pro Arg Ser Leu Leu

Leu Leu Leu Ser Gly Ala Leu Ala 1 5 10 15ctg acc gat act tgg gcg ggc tcc cac tcc ttg agg tat ttc agc acc 217Leu Thr Asp Thr Trp Ala Gly Ser His Ser Leu Arg Tyr Phe Ser Thr 20 25 30gct gtg tcg cgg ccc ggc cgc ggg gag ccc cgc tac atc gcc gtg gag 265Ala Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Tyr Ile Ala Val Glu 35 40 45tac gta gac gac acg caa ttc ctg cgg ttc gac agc gac gcc gcg att 313Tyr Val Asp Asp Thr Gln Phe Leu Arg Phe Asp Ser Asp Ala Ala Ile 50 55 60ccg agg atg gag ccg cgg gag ccg tgg gtg gag caa gag ggg ccg cag 361Pro Arg Met Glu Pro Arg Glu Pro Trp Val Glu Gln Glu Gly Pro Gln 65 70 75tat tgg gag tgg acc aca ggg tac gcc aag gcc aac gca cag act gac 409Tyr Trp Glu Trp Thr Thr Gly Tyr Ala Lys Ala Asn Ala Gln Thr Asp80 85 90 95cga gtg gcc ctg agg aac ctg ctc cgc cgc tac aac cag agc gag gct 457Arg Val Ala Leu Arg Asn Leu Leu Arg Arg Tyr Asn Gln Ser Glu Ala 100 105 110ggg tct cac acc ctc cag gga atg aat ggc tgc gac atg ggg ccc gac 505Gly Ser His Thr Leu Gln Gly Met Asn Gly Cys Asp Met Gly Pro Asp 115 120 125gga cgc ctc ctc cgc ggg tat cac cag cac gcg tac gac ggc aag gat 553Gly Arg Leu Leu Arg Gly Tyr His Gln His Ala Tyr Asp Gly Lys Asp 130 135 140tac atc tcc ctg aac gag gac ctg cgc tcc tgg acc gcg gcg gac acc 601Tyr Ile Ser Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Ala Asp Thr 145 150 155gtg gct cag atc acc cag cgc ttc tat gag gca gag gaa tat gca gag 649Val Ala Gln Ile Thr Gln Arg Phe Tyr Glu Ala Glu Glu Tyr Ala Glu160 165 170 175gag ttc agg acc tac ctg gag ggc gag tgc ctg gag ttg ctc cgc aga 697Glu Phe Arg Thr Tyr Leu Glu Gly Glu Cys Leu Glu Leu Leu Arg Arg 180 185 190tac ttg gag aat ggg aag gag acg cta cag cgc gca gat cct cca aag 745Tyr Leu Glu Asn Gly Lys Glu Thr Leu Gln Arg Ala Asp Pro Pro Lys 195 200 205gca cac gtt gcc cac cac ccc atc tct gac cat gag gcc acc ctg agg 793Ala His Val Ala His His Pro Ile Ser Asp His Glu Ala Thr Leu Arg 210 215 220tgc tgg gcc ctg ggc ttc tac cct gcg gag atc acg ctg acc tgg cag 841Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu Thr Trp Gln 225 230 235cgg gat ggg gag gaa cag acc cag gac aca gag ctt gtg gag acc agg 889Arg Asp Gly Glu Glu Gln Thr Gln Asp Thr Glu Leu Val Glu Thr Arg240 245 250 255cct gca ggg gat gga acc ttc cag aag tgg gcc gct gtg gtg gtg cct 937Pro Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala Ala Val Val Val Pro 260 265 270cct gga gag gaa cag aga tac aca tgc cat gtg cag cac gag ggg ctg 985Pro Gly Glu Glu Gln Arg Tyr Thr Cys His Val Gln His Glu Gly Leu 275 280 285ccc cag ccc ctc atc ctg aga tgg gag cag tct ccc cag ccc acc atc 1033Pro Gln Pro Leu Ile Leu Arg Trp Glu Gln Ser Pro Gln Pro Thr Ile 290 295 300ccc atc gtg ggc atc gtt gct ggc ctt gtt gtc ctt gga gct gtg gtc 1081Pro Ile Val Gly Ile Val Ala Gly Leu Val Val Leu Gly Ala Val Val 305 310 315act gga gct gtg gtc gct gct gtg atg tgg agg aag aag agc tca gat 1129Thr Gly Ala Val Val Ala Ala Val Met Trp Arg Lys Lys Ser Ser Asp320 325 330 335aga aac aga ggg agc tac tct cag gct gca gtg tga gacagcttcc 1175Arg Asn Arg Gly Ser Tyr Ser Gln Ala Ala Val 340 345ttgtgtggga ctgagaagca agatatcaat gtagcagaat tgcacttgtg cctcacgaac 1235atacataaat tttaaaaata aagaataaaa atatatcttt ttatagataa aaaaaaaaaa 1295aaaaaa 130112346PRTHomo sapiens 12Met Ala Pro Arg Ser Leu Leu Leu Leu Leu Ser Gly Ala Leu Ala Leu 1 5 10 15 Thr Asp Thr Trp Ala Gly Ser His Ser Leu Arg Tyr Phe Ser Thr Ala 20 25 30 Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Tyr Ile Ala Val Glu Tyr 35 40 45 Val Asp Asp Thr Gln Phe Leu Arg Phe Asp Ser Asp Ala Ala Ile Pro 50 55 60 Arg Met Glu Pro Arg Glu Pro Trp Val Glu Gln Glu Gly Pro Gln Tyr 65 70 75 80 Trp Glu Trp Thr Thr Gly Tyr Ala Lys Ala Asn Ala Gln Thr Asp Arg 85 90 95 Val Ala Leu Arg Asn Leu Leu Arg Arg Tyr Asn Gln Ser Glu Ala Gly 100 105 110 Ser His Thr Leu Gln Gly Met Asn Gly Cys Asp Met Gly Pro Asp Gly 115 120 125 Arg Leu Leu Arg Gly Tyr His Gln His Ala Tyr Asp Gly Lys Asp Tyr 130 135 140 Ile Ser Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Ala Asp Thr Val 145 150 155 160 Ala Gln Ile Thr Gln Arg Phe Tyr Glu Ala Glu Glu Tyr Ala Glu Glu 165 170 175 Phe Arg Thr Tyr Leu Glu Gly Glu Cys Leu Glu Leu Leu Arg Arg Tyr 180 185 190 Leu Glu Asn Gly Lys Glu Thr Leu Gln Arg Ala Asp Pro Pro Lys Ala 195 200 205 His Val Ala His His Pro Ile Ser Asp His Glu Ala Thr Leu Arg Cys 210 215 220 Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu Thr Trp Gln Arg 225 230 235 240 Asp Gly Glu Glu Gln Thr Gln Asp Thr Glu Leu Val Glu Thr Arg Pro 245 250 255 Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala Ala Val Val Val Pro Pro 260 265 270 Gly Glu Glu Gln Arg Tyr Thr Cys His Val Gln His Glu Gly Leu Pro 275 280 285 Gln Pro Leu Ile Leu Arg Trp Glu Gln Ser Pro Gln Pro Thr Ile Pro 290 295 300 Ile Val Gly Ile Val Ala Gly Leu Val Val Leu Gly Ala Val Val Thr 305 310 315 320 Gly Ala Val Val Ala Ala Val Met Trp Arg Lys Lys Ser Ser Asp Arg 325 330 335 Asn Arg Gly Ser Tyr Ser Gln Ala Ala Val 340 345 131578DNAHomo sapiensCDS(179)..(1195) 13agtgtggtac tttgtcttga ggagatgtcc tggactcaca cggaaactta gggctacgga 60atgaagttct cactcccatt aggtgacagg tttttagaga agccaatcag cgtcgccgcg 120gtcctggttc taaagtcctc gctcacccac ccggactcat tctccccaga cgccaagg 178atg gtg gtc atg gcg ccc cga acc ctc ttc ctg ctg ctc tcg ggg gcc 226Met Val Val Met Ala Pro Arg Thr Leu Phe Leu Leu Leu Ser Gly Ala1 5 10 15ctg acc ctg acc gag acc tgg gcg ggc tcc cac tcc atg agg tat ttc 274Leu Thr Leu Thr Glu Thr Trp Ala Gly Ser His Ser Met Arg Tyr Phe 20 25 30agc gcc gcc gtg tcc cgg ccc ggc cgc ggg gag ccc cgc ttc atc gcc 322Ser Ala Ala Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ala 35 40 45atg ggc tac gtg gac gac acg cag ttc gtg cgg ttc gac agc gac tcg 370Met Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp Ser Asp Ser 50 55 60gcg tgt ccg agg atg gag ccg cgg gcg ccg tgg gtg gag cag gag ggg 418Ala Cys Pro Arg Met Glu Pro Arg Ala Pro Trp Val Glu Gln Glu Gly65 70 75 80ccg gag tat tgg gaa gag gag aca cgg aac acc aag gcc cac gca cag 466Pro Glu Tyr Trp Glu Glu Glu Thr Arg Asn Thr Lys Ala His Ala Gln 85 90 95act gac aga atg aac ctg cag acc ctg cgc ggc tac tac aac cag agc 514Thr Asp Arg Met Asn Leu Gln Thr Leu Arg Gly Tyr Tyr Asn Gln Ser 100 105 110gag gcc agt tct cac acc ctc cag tgg atg att ggc tgc gac ctg ggg 562Glu Ala Ser Ser His Thr Leu Gln Trp Met Ile Gly Cys Asp Leu Gly 115 120 125tcc gac gga cgc ctc ctc cgc ggg tat gaa cag tat gcc tac gat ggc 610Ser Asp Gly Arg Leu Leu Arg Gly Tyr Glu Gln Tyr Ala Tyr Asp Gly 130 135 140aag gat tac ctc gcc ctg aac gag gac ctg cgc tcc tgg acc gca gcg 658Lys Asp Tyr Leu Ala Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Ala145 150 155 160gac act gcg gct cag atc tcc aag cgc aag tgt gag gcg gcc aat gtg 706Asp Thr Ala Ala Gln Ile Ser Lys Arg Lys Cys Glu Ala Ala Asn Val 165 170 175gct gaa caa agg aga gcc tac ctg gag ggc acg tgc gtg gag tgg ctc 754Ala Glu Gln Arg Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Trp Leu 180 185 190cac aga tac ctg gag aac ggg aag gag atg ctg cag cgc gcg gac ccc 802His Arg Tyr Leu Glu Asn Gly Lys Glu Met Leu Gln Arg Ala Asp Pro 195 200 205ccc aag aca cac gtg acc cac cac cct gtc ttt gac tat gag gcc acc 850Pro Lys Thr His Val Thr His His Pro Val Phe Asp Tyr Glu Ala Thr 210 215 220ctg agg tgc tgg gcc ctg ggc ttc tac cct gcg gag atc ata ctg acc 898Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Ile Leu Thr225 230 235 240tgg cag cgg gat ggg gag gac cag acc cag gac gtg gag ctc gtg gag 946Trp Gln Arg Asp Gly Glu Asp Gln Thr Gln Asp Val Glu Leu Val Glu 245 250 255acc agg cct gca ggg gat gga acc ttc cag aag tgg gca gct gtg gtg 994Thr Arg Pro Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala Ala Val Val 260 265 270gtg cct tct gga gag gag cag aga tac acg tgc cat gtg cag cat gag 1042Val Pro Ser Gly Glu Glu Gln Arg Tyr Thr Cys His Val Gln His Glu 275 280 285ggg ctg ccg gag ccc ctc atg ctg aga tgg aag cag tct tcc ctg ccc 1090Gly Leu Pro Glu Pro Leu Met Leu Arg Trp Lys Gln Ser Ser Leu Pro 290 295 300acc atc ccc atc atg ggt atc gtt gct ggc ctg gtt gtc ctt gca gct 1138Thr Ile Pro Ile Met Gly Ile Val Ala Gly Leu Val Val Leu Ala Ala305 310 315 320gta gtc act gga gct gcg gtc gct gct gtg ctg tgg aga aag aag agc 1186Val Val Thr Gly Ala Ala Val Ala Ala Val Leu Trp Arg Lys Lys Ser 325 330 335tca gat tga aaaggaggga gctactctca ggctgcaatg tgaaacagct 1235Ser Asp gccctgtgtg ggactgagtg gcaagtccct ttgtgacttc aagaaccctg actcctcttt 1295gtgcagagac cagcccaccc ctgtgcccac catgaccctc ttcctcatgc tgaactgcat 1355tccttcccca atcacctttc ctgttccaga aaaggggctg ggatgtctcc gtctctgtct 1415caaatttgtg gtccactgag ctataactta cttctgtatt aaaattagaa tctgagtata 1475aatttacttt ttcaaattat ttccaagaga gattgatggg ttaattaaag gagaagattc 1535ctgaaatttg agagacaaaa taaatggaag acatgagaac ttt 157814338PRTHomo sapiens 14Met Val Val Met Ala Pro Arg Thr Leu Phe Leu Leu Leu Ser Gly Ala 1 5 10 15 Leu Thr Leu Thr Glu Thr Trp Ala Gly Ser His Ser Met Arg Tyr Phe 20 25 30 Ser Ala Ala Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ala 35 40 45 Met Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp Ser Asp Ser 50 55 60 Ala Cys Pro Arg Met Glu Pro Arg Ala Pro Trp Val Glu Gln Glu Gly 65 70 75 80 Pro Glu Tyr Trp Glu Glu Glu Thr Arg Asn Thr Lys Ala His Ala Gln 85 90 95 Thr Asp Arg Met Asn Leu Gln Thr Leu Arg Gly Tyr Tyr Asn Gln Ser 100 105 110 Glu Ala Ser Ser His Thr Leu Gln Trp Met Ile Gly Cys Asp Leu Gly 115 120 125 Ser Asp Gly Arg Leu Leu Arg Gly Tyr Glu Gln Tyr Ala Tyr Asp Gly 130 135 140 Lys Asp Tyr Leu Ala Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Ala 145 150 155 160 Asp Thr Ala Ala Gln Ile Ser Lys Arg Lys Cys Glu Ala Ala Asn Val 165 170 175 Ala Glu Gln Arg Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Trp Leu 180 185 190 His Arg Tyr Leu Glu Asn Gly Lys Glu Met Leu Gln Arg Ala Asp Pro 195 200 205 Pro Lys Thr His Val Thr His His Pro Val Phe Asp Tyr Glu Ala Thr 210 215 220 Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile Ile Leu Thr 225 230 235 240 Trp Gln Arg Asp Gly Glu Asp Gln Thr Gln Asp Val Glu Leu Val Glu 245 250 255 Thr Arg Pro Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala Ala Val Val 260 265 270 Val Pro Ser Gly Glu Glu Gln Arg Tyr Thr Cys His Val Gln His Glu 275 280 285 Gly Leu Pro Glu Pro Leu Met Leu Arg Trp Lys Gln Ser Ser Leu Pro 290 295 300 Thr Ile Pro Ile Met Gly Ile Val Ala Gly Leu Val Val Leu Ala Ala 305 310 315 320 Val Val Thr Gly Ala Ala Val Ala Ala Val Leu Trp Arg Lys Lys Ser 325 330 335 Ser Asp 159232DNAArtificial SequenceVector sequence for human B2M-HLA-A0201 fusion protein 15ctcgaggagc ttggcccatt gcatacgttg tatccatatc ataatatgta catttatatt 60ggctcatgtc caacattacc gccatgttga cattgattat tgactagtta ttaatagtaa 120tcaattacgg ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg 180gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg 240tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta 300cgctaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt 360gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac 420tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt 480tggcagtaca tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac 540cccattgacg tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt 600cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat 660ataagcagag cttctagatt gtacgggagc tcttcactac tcgctgcgtc gagagtgtac 720gagactctcc aggtttggta agaaatattt tatattgtta taatgttact atgatccatt 780aacactctgc ttatagattg taagggtgat tgcaatgctt tctgcataaa actttggttt 840tcttgttaat caataaaccg acttgattcg agaacctact catatattat tgtctctttt 900atactttatt aagtaaaagg atttgtatat tagccttgct aagggagaca tctagtgata 960taagtgtgaa ctacacttat cttaaatgat gtaactcctt aggataatca atatacaaaa 1020ttccatgaca attggcgccc aacgtggggc tcgaatataa gtcgggttta tttgtaaatt 1080atccctaggg acctccgagc aagcgagcat agcgggaggc atataaaagc caatagacaa 1140tggctagcag gaagtaatgt tgaagaatat gaacttgatg ttgaagctct ggttgtaatt 1200ttaagagata gaaatatacc aagaaatcct ttacatggag aagttatagg tcttcgcctt 1260actgaaggat ggtggggaca aattgagaga tttcagatgg tacgttgatc taaggctatg 1320gatttggcca tgggacaaga aatattagtt tatagtccca ttgtatctat gactaaaata 1380caaaaaactc cactaccaga aagaaaagct ttacccatta gatggataac atggatgact 1440tatttagaag atccaagaat ccaatttcat tatgataaaa ccttaccaga acttaagcat 1500attccagatg tatatacatc tagtcagtct cctgttaaac atccttctca atatgaagga 1560gtgttttata ctgatggctc ggccatcaaa agtcctgatc ctacaaaaag caataatgct 1620ggcatgggaa tagtacatgc cacatacaaa cctgaatatc aagttttgaa tcaatggtca 1680ataccactag gtaatcatac tgctcagatg gctgaaatag ctgcagttga atttgcctgt 1740aaaaaagctt taaaaatacc tggtcctgta ttagttataa ctgatagttt ctatgtagca 1800gaaagtgcta ataaagaatt accatactgg aaatctaatg ggtttgttaa taataagaaa 1860aagcctctta aacatatctc caaatggaaa tctattgctg agtgtttatc tatgaaacca 1920gacattacta ttcaacatga aaaaggcatc agcctacaaa taccagtatt catactgaaa 1980ggcaatgccc tagcagataa gcttgccacc caaggaagtt atgtggttaa ttgtaatacc 2040aaaaaaccaa acctggatgc agagttggat caattattac agggtcatta tataaaagga 2100tatcccaaac aatatacata ttttttagaa gatggcaaag taaaagtttc cagacctgaa 2160ggggttaaaa ttattccccc tcagtcagac agacaaaaaa ttgtgcttca agcccacaat 2220ttggctcaca ccggacgtga agccactctt ttaaaaattg ccaaccttta ttggtggcca 2280aatatgagaa aggatgtggt taaacaacta ggacgctgtc aacagtgttt aatcacaaat 2340gcttccaaca aagcctctgg tcctattcta agaccagata ggcctcaaaa accttttgat 2400aaattcttta ttgactatat tggacctttg ccaccttcac agggatacct atatgtatta 2460gtagttgttg atggaatgac aggattcact tggttatacc ccactaaggc tccttctact 2520agcgcaactg ttaaatctct caatgtactc actagtattg caattccaaa ggtgattcac 2580tctgatcaag gtgcagcatt cacttcttca acctttgctg aatgggcaaa ggaaagaggt 2640atacatttgg aattcagtac tccttatcac ccccaaagtg gtagtaaggt ggaaaggaaa 2700aatagtgata taaaacgact tttaactaaa ctgctagtag gaagacccac aaagtggtat 2760gacctattgc ctgttgtaca acttgcttta aacaacacct atagccctgt attaaaatat 2820actccacatc aactcttatt tggtatagat tcaaatactc catttgcaaa tcaagataca 2880cttgacttga ccagagaaga agaactttct cttttacagg aaattcgtac ttctttatac 2940catccatcca cccctccagc ctcctctcgt tcctggtctc ctgttgttgg ccaattggtc 3000caggagaggg tggctaggcc tgcttctttg

agacctcgtt ggcataaacc gtctactgta 3060cttaaggtgt tgaatccaag gactgttgtt attttggacc atcttggcaa caacagaact 3120gtaagtatag ataatttaaa acctacttct catcagaatg gcaccaccaa tgacactgca 3180acaatggatc atttggaaaa aaatgaataa agcgcatgag gcacttcaaa atacaacaac 3240tgtgactgaa cagcagaagg aacaaattat actggacatt caaaatgaag aagtacaacc 3300aactaggaga gataaattta gatatctgct ttatacttgt tgtgctacta gctcaagagt 3360attggcctgg atgtttttag tttgtatatt gttaatcatt gttttggttt catgctttgt 3420gactatatcc agaatacaat ggaataagga tattcaggta ttaggacctg taatagactg 3480gaatgttact caaagagctg tttatcaacc cttacagact agaaggattg cacgttccct 3540tagaatgcag catcctgttc caaaatatgt ggaggtaaat atgactagta ttccacaagg 3600tgtatactat gaaccccatc cggcctcgac ggtatcgatg gtaccggtat cgataagctt 3660gataacctcg agagatctaa ttctaccggg taggggaggc gcttttccca aggcagtctg 3720gagcatgcgc tttagcagcc ccgctggcac ttggcgctac acaagtggcc tctggcctcg 3780cacacattcc acatccaccg gtagcgccaa ccggctccgt tctttggtgg ccccttcgcg 3840ccacttctac tcctccccta gtcaggaagt ttcccccagc aagctcgcgt cgtgcaggac 3900gtgacaaatg gaagtagcac gtctcactag tctcgtgcag atggacagca ccgctgagca 3960atggaagcgg gtaggccttt ggggcagcgg ccaatagcag ctttgttcct tcgctttctg 4020ggctcagagg ctgggaaggg gtgggtccgg gggcgggctc aggggcgggc tcaggggcgg 4080gcgggcgccc gaaggtcctc ccgaggcccg gcattctgca cgcttcaaaa gcgcacgtct 4140gccgcgctgt tctcctcttc ctcatctccg ggcctttcga cctgcagccc gggggatcga 4200tcctaggtga tttaaatcca ccatgaccga gtacaagccc acggtgcgcc tcgccacccg 4260cgacgacgtc ccccgggccg tacgcaccct cgccgccgcg ttcgccgact accccgccac 4320gcgccacacc gtcgacccgg accgccacat cgagcgggtc accgagctgc aagaactctt 4380cctcacgcgc gtcgggctcg acatcggcaa ggtgtgggtc gcggacgacg gcgccgcggt 4440ggcggtctgg accacgccgg agagcgtcga agcgggggcg gtgttcgccg agatcggccc 4500gcgcatggcc gagttgagcg gttcccggct ggccgcgcag caacagatgg aaggcctcct 4560ggcgccgcac cggcccaagg agcccgcgtg gttcctggcc accgtcggcg tctcgcccga 4620ccaccagggc aagggtctgg gcagcgccgt cgtgctcccc ggagtggagg cggccgagcg 4680cgccggggtg cccgccttcc tggagacctc cgcgccccgc aacctcccct tctacgagcg 4740gctcggcttc accgtcaccg ccgacgtcga gtgcccgaag gaccgcgcga cctggtgcat 4800gacccgcaag cccggtgcct gacggatcca tcgctccggt gcccgtcagt gggcagagcg 4860cacatcgccc acagtccccg agaagttggg gggaggggtc ggcaattgaa ccggtgccta 4920gagaaggtgg cgcggggtaa actgggaaag tgatgtcgtg tactggctcc gcctttttcc 4980cgagggtggg ggagaaccgt atataagtgc agtagtcgcc gtgaacgttc tttttcgcaa 5040cgggtttgcc gccagaacac agctgcggcc gcgccgccac catgtctcgc tccgtggcct 5100tagctgtgct cgcgctactc tctctttctg gcctggaggc tatccagcgt actccaaaga 5160ttcaggttta ctcacgtcat ccagcagaga atggaaagtc aaatttcctg aattgctatg 5220tgtctgggtt tcatccatcc gacattgaag ttgacttact gaagaatgga gagagaattg 5280aaaaagtgga gcattcagac ttgtctttca gcaaggactg gtctttctat ctcttgtact 5340acactgaatt cacccccact gaaaaagatg agtatgcctg ccgtgtgaac catgtgactt 5400tgtcacagcc caagatagtt aagtgggatc gagacatggg cggaggcgga agcgggggcg 5460gcggatccgg cggaggcgga agcggcggcg ggggaagcgg ctctcactcc atgaggtatt 5520tcttcacatc cgtgtcccgg cccggccgcg gggagccccg cttcatcgca gtgggctacg 5580tggacgacac gcagttcgtg cggttcgaca gcgacgccgc gagccagagg atggagccgc 5640gggcgccgtg gatagagcag gagggtccgg agtattggga cggggagaca cggaaagtga 5700aggcccactc acagactcac cgagtggacc tggggaccct gcgcggctac tacaaccaga 5760gcgaggccgg ttctcacacc gtccagagga tgtatggctg cgacgtgggg tcggactggc 5820gcttcctccg cgggtaccac cagtacgcct acgacggcaa ggattacatc gccctgaaag 5880aggacctgcg ctcttggacc gcggcggaca tggcagctca gaccaccaag cacaagtggg 5940aggcggccca tgtggcggag cagttgagag cctacctgga gggcacgtgc gtggagtggc 6000tccgcagata cctggagaac gggaaggaga cgctgcagcg cacggacgcc cccaaaacgc 6060atatgactca ccacgctgtc tctgaccatg aagccaccct gaggtgctgg gccctgagct 6120tctaccctgc ggagatcaca ctgacctggc agcgggatgg ggaggaccag acccaggaca 6180cggagctcgt ggagaccagg cctgcagggg atggaacctt ccagaagtgg gcggctgtgg 6240tggtgccttc tggacaggag cagagataca cctgccatgt gcagcatgag ggtttgccca 6300agcccctcac cctgagatgg gagccgtctt cccagcccac catccccatc gtgggcatca 6360ttgctggcct ggttctcttt ggagctgtga tcactggagc tgtggtcgct gctgtgatgt 6420ggaggaggaa gagctcagat agaaaaggag ggagctactc tcaggctgca agcagtgaca 6480gtgcccaggg ctctgatgtg tctctcacag cttgtaaagt gtgagcggcc gcgactctag 6540atcaggcgcg ccgttaccaa gcagctatgg aagcttatgg acctcagaga ggaagtaacg 6600aggagagggt gtggtggaat gtcactagaa accagggaaa acaaggagga gagtattaca 6660gggaaggagg tgaagaacct cattacccaa atactcctgc tcctcataga cgtacctggg 6720atgagagaca caaggttctt aaattgtcct cattcgctac tccctctgac atccaacgct 6780gggctactaa ctctagattg tacgggagct ctcttcacta ctcgctgcgt cgagagtgta 6840cgagactctc caggtttggt aagaaatatt ttatattgtt ataatgttac tatgatccat 6900taacactctg cttatagatt gtaagggtga ttgcaatgct ttctgcataa aactttggtt 6960ttcttgttaa tcaataaacc gacttgattc gagaacctac tcatatatta ttgtctcttt 7020tatactttat taagtaaaag gatttgtata ttagccttgc taagggagac atctagtgat 7080ataagtgtga actacactta tcttaaatga tgtaactcct taggataatc aatatacaaa 7140attccatgac aattggcgat accgtcgacc gttcagctgc attaatgaat cggccaacgc 7200gcggggagag gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg 7260cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta 7320tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc 7380aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag 7440catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac 7500caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc 7560ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcaatg ctcacgctgt 7620aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc 7680gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga 7740cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta 7800ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta 7860tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga 7920tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg 7980cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag 8040tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc 8100tagatccttt taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact 8160tggtctgaca gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt 8220cgttcatcca tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta 8280ccatctggcc ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta 8340tcagcaataa accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc 8400gcctccatcc agtctattaa ttgttgccgg gaagctagag taagtagttc gccagttaat 8460agtttgcgca acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt 8520atggcttcat tcagctccgg ttcccaacga tcaaggcgag ttacatgatc ccccatgttg 8580tgcaaaaaag cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca 8640gtgttatcac tcatggttat ggcagcactg cataattctc ttactgtcat gccatccgta 8700agatgctttt ctgtgactgg tgagtactca accaagtcat tctgagaata gtgtatgcgg 8760cgaccgagtt gctcttgccc ggcgtcaata cgggataata ccgcgccaca tagcagaact 8820ttaaaagtgc tcatcattgg aaaacgttct tcggggcgaa aactctcaag gatcttaccg 8880ctgttgagat ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt 8940actttcacca gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga 9000ataagggcga cacggaaatg ttgaatactc atactcttcc tttttcaata ttattgaagc 9060atttatcagg gttattgtct catgagcgga tacatatttg aatgtattta gaaaaataaa 9120caaatagggg ttccgcgcac atttccccga aaagtgccac ctgacgtcta agaaaccatt 9180attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg tc 923216480PRTArtificial SequenceHLA-bBA0201 (human B2M-HLA-A0201 fusion peptide) 16Met Ser Arg Ser Val Ala Leu Ala Val Leu Ala Leu Leu Ser Leu Ser 1 5 10 15 Gly Leu Glu Ala Ile Gln Arg Thr Pro Lys Ile Gln Val Tyr Ser Arg 20 25 30 His Pro Ala Glu Asn Gly Lys Ser Asn Phe Leu Asn Cys Tyr Val Ser 35 40 45 Gly Phe His Pro Ser Asp Ile Glu Val Asp Leu Leu Lys Asn Gly Glu 50 55 60 Arg Ile Glu Lys Val Glu His Ser Asp Leu Ser Phe Ser Lys Asp Trp 65 70 75 80 Ser Phe Tyr Leu Leu Tyr Tyr Thr Glu Phe Thr Pro Thr Glu Lys Asp 85 90 95 Glu Tyr Ala Cys Arg Val Asn His Val Thr Leu Ser Gln Pro Lys Ile 100 105 110 Val Lys Trp Asp Arg Asp Met Gly Gly Gly Gly Ser Gly Gly Gly Gly 115 120 125 Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Ser His Ser Met 130 135 140 Arg Tyr Phe Phe Thr Ser Val Ser Arg Pro Gly Arg Gly Glu Pro Arg 145 150 155 160 Phe Ile Ala Val Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp 165 170 175 Ser Asp Ala Ala Ser Gln Arg Met Glu Pro Arg Ala Pro Trp Ile Glu 180 185 190 Gln Glu Gly Pro Glu Tyr Trp Asp Gly Glu Thr Arg Lys Val Lys Ala 195 200 205 His Ser Gln Thr His Arg Val Asp Leu Gly Thr Leu Arg Gly Tyr Tyr 210 215 220 Asn Gln Ser Glu Ala Gly Ser His Thr Val Gln Arg Met Tyr Gly Cys 225 230 235 240 Asp Val Gly Ser Asp Trp Arg Phe Leu Arg Gly Tyr His Gln Tyr Ala 245 250 255 Tyr Asp Gly Lys Asp Tyr Ile Ala Leu Lys Glu Asp Leu Arg Ser Trp 260 265 270 Thr Ala Ala Asp Met Ala Ala Gln Thr Thr Lys His Lys Trp Glu Ala 275 280 285 Ala His Val Ala Glu Gln Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val 290 295 300 Glu Trp Leu Arg Arg Tyr Leu Glu Asn Gly Lys Glu Thr Leu Gln Arg 305 310 315 320 Thr Asp Ala Pro Lys Thr His Met Thr His His Ala Val Ser Asp His 325 330 335 Glu Ala Thr Leu Arg Cys Trp Ala Leu Ser Phe Tyr Pro Ala Glu Ile 340 345 350 Thr Leu Thr Trp Gln Arg Asp Gly Glu Asp Gln Thr Gln Asp Thr Glu 355 360 365 Leu Val Glu Thr Arg Pro Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala 370 375 380 Ala Val Val Val Pro Ser Gly Gln Glu Gln Arg Tyr Thr Cys His Val 385 390 395 400 Gln His Glu Gly Leu Pro Lys Pro Leu Thr Leu Arg Trp Glu Pro Ser 405 410 415 Ser Gln Pro Thr Ile Pro Ile Val Gly Ile Ile Ala Gly Leu Val Leu 420 425 430 Phe Gly Ala Val Ile Thr Gly Ala Val Val Ala Ala Val Met Trp Arg 435 440 445 Arg Lys Ser Ser Asp Arg Lys Gly Gly Ser Tyr Ser Gln Ala Ala Ser 450 455 460 Ser Asp Ser Ala Gln Gly Ser Asp Val Ser Leu Thr Ala Cys Lys Val 465 470 475 480 179224DNAArtificial SequenceVector sequence for HLA-gBE 17ctcgaggagc ttggcccatt gcatacgttg tatccatatc ataatatgta catttatatt 60ggctcatgtc caacattacc gccatgttga cattgattat tgactagtta ttaatagtaa 120tcaattacgg ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg 180gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg 240tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta 300cgctaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt 360gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac 420tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt 480tggcagtaca tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac 540cccattgacg tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt 600cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat 660ataagcagag cttctagatt gtacgggagc tcttcactac tcgctgcgtc gagagtgtac 720gagactctcc aggtttggta agaaatattt tatattgtta taatgttact atgatccatt 780aacactctgc ttatagattg taagggtgat tgcaatgctt tctgcataaa actttggttt 840tcttgttaat caataaaccg acttgattcg agaacctact catatattat tgtctctttt 900atactttatt aagtaaaagg atttgtatat tagccttgct aagggagaca tctagtgata 960taagtgtgaa ctacacttat cttaaatgat gtaactcctt aggataatca atatacaaaa 1020ttccatgaca attggcgccc aacgtggggc tcgaatataa gtcgggttta tttgtaaatt 1080atccctaggg acctccgagc atagcgggag gcatataaaa gccaatagac aatggctagc 1140aggaagtaat gttgaagaat atgaacttga tgttgaagct ctggttgtaa ttttaagaga 1200tagaaatata ccaagaaatc ctttacatgg agaagttata ggtcttcgcc ttactgaagg 1260atggtgggga caaattgaga gatttcagat ggtacgttga tctaaggcta tggatttggc 1320catgggacaa gaaatattag tttatagtcc cattgtatct atgactaaaa tacaaaaaac 1380tccactacca gaaagaaaag ctttacccat tagatggata acatggatga cttatttaga 1440agatccaaga atccaatttc attatgataa aaccttacca gaacttaagc atattccaga 1500tgtatataca tctagtcagt ctcctgttaa acatccttct caatatgaag gagtgtttta 1560tactgatggc tcggccatca aaagtcctga tcctacaaaa agcaataatg ctggcatggg 1620aatagtacat gccacataca aacctgaata tcaagttttg aatcaatggt caataccact 1680aggtaatcat actgctcaga tggctgaaat agctgcagtt gaatttgcct gtaaaaaagc 1740tttaaaaata cctggtcctg tattagttat aactgatagt ttctatgtag cagaaagtgc 1800taataaagaa ttaccatact ggaaatctaa tgggtttgtt aataataaga aaaagcctct 1860taaacatatc tccaaatgga aatctattgc tgagtgttta tctatgaaac cagacattac 1920tattcaacat gaaaaaggca tcagcctaca aataccagta ttcatactga aaggcaatgc 1980cctagcagat aagcttgcca cccaaggaag ttatgtggtt aattgtaata ccaaaaaacc 2040aaacctggat gcagagttgg atcaattatt acagggtcat tatataaaag gatatcccaa 2100acaatataca tattttttag aagatggcaa agtaaaagtt tccagacctg aaggggttaa 2160aattattccc cctcagtcag acagacaaaa aattgtgctt caagcccaca atttggctca 2220caccggacgt gaagccactc ttttaaaaat tgccaacctt tattggtggc caaatatgag 2280aaaggatgtg gttaaacaac taggacgctg tcaacagtgt ttaatcacaa atgcttccaa 2340caaagcctct ggtcctattc taagaccaga taggcctcaa aaaccttttg ataaattctt 2400tattgactat attggacctt tgccaccttc acagggatac ctatatgtat tagtagttgt 2460tgatggaatg acaggattca cttggttata ccccactaag gctccttcta ctagcgcaac 2520tgttaaatct ctcaatgtac tcactagtat tgcaattcca aaggtgattc actctgatca 2580aggtgcagca ttcacttctt caacctttgc tgaatgggca aaggaaagag gtatacattt 2640ggaattcagt actccttatc acccccaaag tggtagtaag gtggaaagga aaaatagtga 2700tataaaacga cttttaacta aactgctagt aggaagaccc acaaagtggt atgacctatt 2760gcctgttgta caacttgctt taaacaacac ctatagccct gtattaaaat atactccaca 2820tcaactctta tttggtatag attcaaatac tccatttgca aatcaagata cacttgactt 2880gaccagagaa gaagaacttt ctcttttaca ggaaattcgt acttctttat accatccatc 2940cacccctcca gcctcctctc gttcctggtc tcctgttgtt ggccaattgg tccaggagag 3000ggtggctagg cctgcttctt tgagacctcg ttggcataaa ccgtctactg tacttaaggt 3060gttgaatcca aggactgttg ttattttgga ccatcttggc aacaacagaa ctgtaagtat 3120agataattta aaacctactt ctcatcagaa tggcaccacc aatgacactg caacaatgga 3180tcatttggaa aaaaatgaat aaagcgcatg aggcacttca aaatacaaca actgtgactg 3240aacagcagaa ggaacaaatt atactggaca ttcaaaatga agaagtacaa ccaactagga 3300gagataaatt tagatatctg ctttatactt gttgtgctac tagctcaaga gtattggcct 3360ggatgttttt agtttgtata ttgttaatca ttgttttggt ttcatgcttt gtgactatat 3420ccagaataca atggaataag gatattcagg tattaggacc tgtaatagac tggaatgtta 3480ctcaaagagc tgtttatcaa cccttacaga ctagaaggat tgcacgttcc cttagaatgc 3540agcatcctgt tccaaaatat gtggaggtaa atatgactag tattccacaa ggtgtatact 3600atgaacccca tccggcctcg acggtatcga tggtaccggt atcgataagc ttgataacct 3660cgagagatct aattctaccg ggtaggggag gcgcttttcc caaggcagtc tggagcatgc 3720gctttagcag ccccgctggc acttggcgct acacaagtgg cctctggcct cgcacacatt 3780ccacatccac cggtagcgcc aaccggctcc gttctttggt ggccccttcg cgccacttct 3840actcctcccc tagtcaggaa gtttccccca gcaagctcgc gtcgtgcagg acgtgacaaa 3900tggaagtagc acgtctcact agtctcgtgc agatggacag caccgctgag caatggaagc 3960gggtaggcct ttggggcagc ggccaatagc agctttgttc cttcgctttc tgggctcaga 4020ggctgggaag gggtgggtcc gggggcgggc tcaggggcgg gctcaggggc gggcgggcgc 4080ccgaaggtcc tcccgaggcc cggcattctg cacgcttcaa aagcgcacgt ctgccgcgct 4140gttctcctct tcctcatctc cgggcctttc gacctgcagc ccgggggatc gatcctaggt 4200gatttaaatc caccatgacc gagtacaagc ccacggtgcg cctcgccacc cgcgacgacg 4260tcccccgggc cgtacgcacc ctcgccgccg cgttcgccga ctaccccgcc acgcgccaca 4320ccgtcgaccc ggaccgccac atcgagcggg tcaccgagct gcaagaactc ttcctcacgc 4380gcgtcgggct cgacatcggc aaggtgtggg tcgcggacga cggcgccgcg gtggcggtct 4440ggaccacgcc ggagagcgtc gaagcggggg cggtgttcgc cgagatcggc ccgcgcatgg 4500ccgagttgag cggttcccgg ctggccgcgc agcaacagat ggaaggcctc ctggcgccgc 4560accggcccaa ggagcccgcg tggttcctgg ccaccgtcgg cgtctcgccc gaccaccagg 4620gcaagggtct gggcagcgcc gtcgtgctcc ccggagtgga ggcggccgag cgcgccgggg 4680tgcccgcctt cctggagacc tccgcgcccc gcaacctccc cttctacgag cggctcggct 4740tcaccgtcac cgccgacgtc gagtgcccga aggaccgcgc gacctggtgc atgacccgca 4800agcccggtgc ctgacggatc catcgctccg gtgcccgtca gtgggcagag cgcacatcgc 4860ccacagtccc cgagaagttg gggggagggg tcggcaattg aaccggtgcc tagagaaggt 4920ggcgcggggt aaactgggaa agtgatgtcg tgtactggct ccgccttttt cccgagggtg 4980ggggagaacc gtatataagt gcagtagtcg ccgtgaacgt tctttttcgc aacgggtttg 5040ccgccagaac acagctgcgg ccgcgttaac catggtggtc atggcccctc gaactctgtt 5100cctgctgctg agtggggctc tgaccctgac agagtccgga atccagcgga cccccaagat 5160tcaggtgtac agcagacacc ctgcagagaa cggcaaatcc aacttcctga attgctatgt 5220gtctgggttt catcccagtg acatcgaagt cgatctgctg aagaatggcg agagaattga 5280aaaagtcgag cactctgacc tgagcttctc caaggattgg tccttttacc tgctgtacta 5340taccgagttt actccaaccg aaaaagacga gtatgcctgt agggtgaacc atgtcacact 5400gagccagccc aagatcgtga aatgggaccg cgatatgggc gggggaggct cagggggagg 5460cgggagcgga ggcgggggct ccggcggggg aggatccggt tctcactcct tgaagtattt 5520ccacacttcc gtgtcccggc ccggccgcgg ggagccccgc ttcatctctg tgggctacgt

5580ggacgacacc cagttcgtgc gcttcgacaa cgacgccgcg agtccgagga tggtgccgcg 5640ggcgccgtgg atggagcagg aggggtcaga gtattgggac cgggagacac ggagcgccag 5700ggacaccgca cagattttcc gagtgaatct gcggacgctg cgcggctact acaatcagag 5760cgaggccggg tctcacaccc tgcagtggat gcatggctgc gagctggggc ccgacgggcg 5820cttcctccgc gggtatgaac agttcgccta cgacggcaag gattatctca ccctgaatga 5880ggacctgcgc tcctggaccg cggtggacac ggcggctcag atctccgagc aaaagtcaaa 5940tgatgcctct gaggcggagc accagagagc ctacctggaa gacacatgcg tggagtggct 6000ccacaaatac ctggagaagg ggaaggagac gctgcttcac ctggagcccc caaagacaca 6060cgtgactcac caccccatct ctgaccatga ggccaccctg aggtgctggg ccctgggctt 6120ctaccctgcg gagatcacac tgacctggca gcaggatggg gagggccata cccaggacac 6180ggagctcgtg gagaccaggc ctgcagggga tggaaccttc cagaagtggg cagctgtggt 6240ggtgccttct ggagaggagc agagatacac gtgccatgtg cagcatgagg ggctacccga 6300gcccgtcacc ctgagatgga agccggcttc ccagcccacc atccccatcg tgggcatcat 6360tgctggcctg gttctccttg gatctgtggt ctctggagct gtggttgctg ctgtgatatg 6420gaggaagaag agctcaggtg gaaaaggagg gagctactct aaggctgagt ggagcgacag 6480tgcccagggg tctgagtctc acagcttgta atctagagcg gccgcgactc tagatcaggc 6540gcgccgttac caagcagcta tggaagctta tggacctcag agaggaagta acgaggagag 6600ggtgtggtgg aatgtcacta gaaaccaggg aaaacaagga ggagagtatt acagggaagg 6660aggtgaagaa cctcattacc caaatactcc tgctcctcat agacgtacct gggatgagag 6720acacaaggtt cttaaattgt cctcattcgc tactccctct gacatccaac gctggactac 6780taactctaga ttgtacggga ggctcttcac tactcgctgc gtcgagagtg tacgagactc 6840tccaggtttg gtaagaaata ttttatattg ttataatgtt actatgatcc attaacactc 6900tgcttataga ttgtaagggt gattgcaatg ctttctgcat aaaactttgg ttttcttgtt 6960aatcaataaa ccgacttgat tcgagaacca actcctatat tattgtctct tttatacttt 7020attaagtaaa aggatttgta tattagcctt gctaagggag acatctagtg atataagtgt 7080gaactacact tatcttaaat gatgtaactc cttaggataa tcaatataca aaattccatg 7140acaattggcg ataccgtcga ccgttcagct gcattaatga atcggccaac gcgcggggag 7200aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 7260cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 7320atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 7380taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 7440aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 7500tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 7560gtccgccttt ctcccttcgg gaagcgtggc gctttctcaa tgctcacgct gtaggtatct 7620cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 7680cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 7740atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 7800tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 7860ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 7920acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 7980aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 8040aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 8100tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 8160cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 8220catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 8280ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 8340aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 8400ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 8460caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 8520attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 8580agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 8640actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 8700ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 8760ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 8820gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 8880atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 8940cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 9000gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 9060gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 9120ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 9180gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtc 922418476PRTArtificial SequenceHLA-gBE 18Met Ala Pro Arg Thr Leu Phe Leu Leu Leu Ser Gly Ala Leu Thr Leu 1 5 10 15 Thr Glu Ser Gly Ile Gln Arg Thr Pro Lys Ile Gln Val Tyr Ser Arg 20 25 30 His Pro Ala Glu Asn Gly Lys Ser Asn Phe Leu Asn Cys Tyr Val Ser 35 40 45 Gly Phe His Pro Ser Asp Ile Glu Val Asp Leu Leu Lys Asn Gly Glu 50 55 60 Arg Ile Glu Lys Val Glu His Ser Asp Leu Ser Phe Ser Lys Asp Trp 65 70 75 80 Ser Phe Tyr Leu Leu Tyr Tyr Thr Glu Phe Thr Pro Thr Glu Lys Asp 85 90 95 Glu Tyr Ala Cys Arg Val Asn His Val Thr Leu Ser Gln Pro Lys Ile 100 105 110 Val Lys Trp Asp Arg Asp Met Gly Gly Gly Gly Ser Gly Gly Gly Gly 115 120 125 Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Ser His Ser Leu 130 135 140 Lys Tyr Phe His Thr Ser Val Ser Arg Pro Gly Arg Gly Glu Pro Arg 145 150 155 160 Phe Ile Ser Val Gly Tyr Val Asp Asp Thr Gln Phe Val Arg Phe Asp 165 170 175 Asn Asp Ala Ala Ser Pro Arg Met Val Pro Arg Ala Pro Trp Met Glu 180 185 190 Gln Glu Gly Ser Glu Tyr Trp Asp Arg Glu Thr Arg Ser Ala Arg Asp 195 200 205 Thr Ala Gln Ile Phe Arg Val Asn Leu Arg Thr Leu Arg Gly Tyr Tyr 210 215 220 Asn Gln Ser Glu Ala Gly Ser His Thr Leu Gln Trp Met His Gly Cys 225 230 235 240 Glu Leu Gly Pro Asp Gly Arg Phe Leu Arg Gly Tyr Glu Gln Phe Ala 245 250 255 Tyr Asp Gly Lys Asp Tyr Leu Thr Leu Asn Glu Asp Leu Arg Ser Trp 260 265 270 Thr Ala Val Asp Thr Ala Ala Gln Ile Ser Glu Gln Lys Ser Asn Asp 275 280 285 Ala Ser Glu Ala Glu His Gln Arg Ala Tyr Leu Glu Asp Thr Cys Val 290 295 300 Glu Trp Leu His Lys Tyr Leu Glu Lys Gly Lys Glu Thr Leu Leu His 305 310 315 320 Leu Glu Pro Pro Lys Thr His Val Thr His His Pro Ile Ser Asp His 325 330 335 Glu Ala Thr Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala Glu Ile 340 345 350 Thr Leu Thr Trp Gln Gln Asp Gly Glu Gly His Thr Gln Asp Thr Glu 355 360 365 Leu Val Glu Thr Arg Pro Ala Gly Asp Gly Thr Phe Gln Lys Trp Ala 370 375 380 Ala Val Val Val Pro Ser Gly Glu Glu Gln Arg Tyr Thr Cys His Val 385 390 395 400 Gln His Glu Gly Leu Pro Glu Pro Val Thr Leu Arg Trp Lys Pro Ala 405 410 415 Ser Gln Pro Thr Ile Pro Ile Val Gly Ile Ile Ala Gly Leu Val Leu 420 425 430 Leu Gly Ser Val Val Ser Gly Ala Val Val Ala Ala Val Ile Trp Arg 435 440 445 Lys Lys Ser Ser Gly Gly Lys Gly Gly Ser Tyr Ser Lys Ala Glu Trp 450 455 460 Ser Asp Ser Ala Gln Gly Ser Glu Ser His Ser Leu 465 470 475 199287DNAArtificial SequenceVector sequence for HLA-bGBE 19ctcgaggagc ttggcccatt gcatacgttg tatccatatc ataatatgta catttatatt 60ggctcatgtc caacattacc gccatgttga cattgattat tgactagtta ttaatagtaa 120tcaattacgg ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg 180gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg 240tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta 300cgctaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt 360gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac 420tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt 480tggcagtaca tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac 540cccattgacg tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt 600cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat 660ataagcagag cttctagatt gtacgggagc tcttcactac tcgctgcgtc gagagtgtac 720gagactctcc aggtttggta agaaatattt tatattgtta taatgttact atgatccatt 780aacactctgc ttatagattg taagggtgat tgcaatgctt tctgcataaa actttggttt 840tcttgttaat caataaaccg acttgattcg agaacctact catatattat tgtctctttt 900atactttatt aagtaaaagg atttgtatat tagccttgct aagggagaca tctagtgata 960taagtgtgaa ctacacttat cttaaatgat gtaactcctt aggataatca atatacaaaa 1020ttccatgaca attggcgccc aacgtggggc tcgaatataa gtcgggttta tttgtaaatt 1080atccctaggg acctccgagc atagcgggag gcatataaaa gccaatagac aatggctagc 1140aggaagtaat gttgaagaat atgaacttga tgttgaagct ctggttgtaa ttttaagaga 1200tagaaatata ccaagaaatc ctttacatgg agaagttata ggtcttcgcc ttactgaagg 1260atggtgggga caaattgaga gatttcagat ggtacgttga tctaaggcta tggatttggc 1320catgggacaa gaaatattag tttatagtcc cattgtatct atgactaaaa tacaaaaaac 1380tccactacca gaaagaaaag ctttacccat tagatggata acatggatga cttatttaga 1440agatccaaga atccaatttc attatgataa aaccttacca gaacttaagc atattccaga 1500tgtatataca tctagtcagt ctcctgttaa acatccttct caatatgaag gagtgtttta 1560tactgatggc tcggccatca aaagtcctga tcctacaaaa agcaataatg ctggcatggg 1620aatagtacat gccacataca aacctgaata tcaagttttg aatcaatggt caataccact 1680aggtaatcat actgctcaga tggctgaaat agctgcagtt gaatttgcct gtaaaaaagc 1740tttaaaaata cctggtcctg tattagttat aactgatagt ttctatgtag cagaaagtgc 1800taataaagaa ttaccatact ggaaatctaa tgggtttgtt aataataaga aaaagcctct 1860taaacatatc tccaaatgga aatctattgc tgagtgttta tctatgaaac cagacattac 1920tattcaacat gaaaaaggca tcagcctaca aataccagta ttcatactga aaggcaatgc 1980cctagcagat aagcttgcca cccaaggaag ttatgtggtt aattgtaata ccaaaaaacc 2040aaacctggat gcagagttgg atcaattatt acagggtcat tatataaaag gatatcccaa 2100acaatataca tattttttag aagatggcaa agtaaaagtt tccagacctg aaggggttaa 2160aattattccc cctcagtcag acagacaaaa aattgtgctt caagcccaca atttggctca 2220caccggacgt gaagccactc ttttaaaaat tgccaacctt tattggtggc caaatatgag 2280aaaggatgtg gttaaacaac taggacgctg tcaacagtgt ttaatcacaa atgcttccaa 2340caaagcctct ggtcctattc taagaccaga taggcctcaa aaaccttttg ataaattctt 2400tattgactat attggacctt tgccaccttc acagggatac ctatatgtat tagtagttgt 2460tgatggaatg acaggattca cttggttata ccccactaag gctccttcta ctagcgcaac 2520tgttaaatct ctcaatgtac tcactagtat tgcaattcca aaggtgattc actctgatca 2580aggtgcagca ttcacttctt caacctttgc tgaatgggca aaggaaagag gtatacattt 2640ggaattcagt actccttatc acccccaaag tggtagtaag gtggaaagga aaaatagtga 2700tataaaacga cttttaacta aactgctagt aggaagaccc acaaagtggt atgacctatt 2760gcctgttgta caacttgctt taaacaacac ctatagccct gtattaaaat atactccaca 2820tcaactctta tttggtatag attcaaatac tccatttgca aatcaagata cacttgactt 2880gaccagagaa gaagaacttt ctcttttaca ggaaattcgt acttctttat accatccatc 2940cacccctcca gcctcctctc gttcctggtc tcctgttgtt ggccaattgg tccaggagag 3000ggtggctagg cctgcttctt tgagacctcg ttggcataaa ccgtctactg tacttaaggt 3060gttgaatcca aggactgttg ttattttgga ccatcttggc aacaacagaa ctgtaagtat 3120agataattta aaacctactt ctcatcagaa tggcaccacc aatgacactg caacaatgga 3180tcatttggaa aaaaatgaat aaagcgcatg aggcacttca aaatacaaca actgtgactg 3240aacagcagaa ggaacaaatt atactggaca ttcaaaatga agaagtacaa ccaactagga 3300gagataaatt tagatatctg ctttatactt gttgtgctac tagctcaaga gtattggcct 3360ggatgttttt agtttgtata ttgttaatca ttgttttggt ttcatgcttt gtgactatat 3420ccagaataca atggaataag gatattcagg tattaggacc tgtaatagac tggaatgtta 3480ctcaaagagc tgtttatcaa cccttacaga ctagaaggat tgcacgttcc cttagaatgc 3540agcatcctgt tccaaaatat gtggaggtaa atatgactag tattccacaa ggtgtatact 3600atgaacccca tccggcctcg acggtatcga tggtaccggt atcgataagc ttgataacct 3660cgagagatct aattctaccg ggtaggggag gcgcttttcc caaggcagtc tggagcatgc 3720gctttagcag ccccgctggc acttggcgct acacaagtgg cctctggcct cgcacacatt 3780ccacatccac cggtagcgcc aaccggctcc gttctttggt ggccccttcg cgccacttct 3840actcctcccc tagtcaggaa gtttccccca gcaagctcgc gtcgtgcagg acgtgacaaa 3900tggaagtagc acgtctcact agtctcgtgc agatggacag caccgctgag caatggaagc 3960gggtaggcct ttggggcagc ggccaatagc agctttgttc cttcgctttc tgggctcaga 4020ggctgggaag gggtgggtcc gggggcgggc tcaggggcgg gctcaggggc gggcgggcgc 4080ccgaaggtcc tcccgaggcc cggcattctg cacgcttcaa aagcgcacgt ctgccgcgct 4140gttctcctct tcctcatctc cgggcctttc gacctgcagc ccgggggatc gatcctaggt 4200gatttaaatc caccatgacc gagtacaagc ccacggtgcg cctcgccacc cgcgacgacg 4260tcccccgggc cgtacgcacc ctcgccgccg cgttcgccga ctaccccgcc acgcgccaca 4320ccgtcgaccc ggaccgccac atcgagcggg tcaccgagct gcaagaactc ttcctcacgc 4380gcgtcgggct cgacatcggc aaggtgtggg tcgcggacga cggcgccgcg gtggcggtct 4440ggaccacgcc ggagagcgtc gaagcggggg cggtgttcgc cgagatcggc ccgcgcatgg 4500ccgagttgag cggttcccgg ctggccgcgc agcaacagat ggaaggcctc ctggcgccgc 4560accggcccaa ggagcccgcg tggttcctgg ccaccgtcgg cgtctcgccc gaccaccagg 4620gcaagggtct gggcagcgcc gtcgtgctcc ccggagtgga ggcggccgag cgcgccgggg 4680tgcccgcctt cctggagacc tccgcgcccc gcaacctccc cttctacgag cggctcggct 4740tcaccgtcac cgccgacgtc gagtgcccga aggaccgcgc gacctggtgc atgacccgca 4800agcccggtgc ctgacggatc catcgctccg gtgcccgtca gtgggcagag cgcacatcgc 4860ccacagtccc cgagaagttg gggggagggg tcggcaattg aaccggtgcc tagagaaggt 4920ggcgcggggt aaactgggaa agtgatgtcg tgtactggct ccgccttttt cccgagggtg 4980ggggagaacc gtatataagt gcagtagtcg ccgtgaacgt tctttttcgc aacgggtttg 5040ccgccagaac acagctgcgg ccgcgttaac catgagccga tccgtggcac tggctgtcct 5100ggctctgctg tctctgagtg gcctggaagc agtgatggcc cctagaacac tgttcctggg 5160cggaggcggc tccggaggag gagggtctgg aggcggggga agtatccagc ggactcccaa 5220gattcaggtc tacagcagac accctgccga aaacgggaaa tccaacttcc tgaattgcta 5280tgtgtcaggc tttcatccca gcgacatcga ggtcgatctg ctgaagaatg gcgagcggat 5340tgaaaaagtg gagcactctg acctgtcatt cagcaaggat tggagctttt acctgctgta 5400ctatactgag tttaccccaa cagaaaaaga cgagtatgcc tgtagggtga accatgtcac 5460cctgagtcag cccaagatcg tgaaatggga ccgcgatatg ggcgggggag gctccggggg 5520aggcggctcc ggcggcgggg gaagtggcgg gggaggatcc ggttctcact ccttgaagta 5580tttccacact tccgtgtccc ggcccggccg cggggagccc cgcttcatct ctgtgggcta 5640cgtggacgac acccagttcg tgcgcttcga caacgacgcc gcgagtccga ggatggtgcc 5700gcgggcgccg tggatggagc aggaggggtc agagtattgg gaccgggaga cacggagcgc 5760cagggacacc gcacagattt tccgagtgaa tctgcggacg ctgcgcggct actacaatca 5820gagcgaggcc gggtctcaca ccctgcagtg gatgcatggc tgcgagctgg ggcccgacgg 5880gcgcttcctc cgcgggtatg aacagttcgc ctacgacggc aaggattatc tcaccctgaa 5940tgaggacctg cgctcctgga ccgcggtgga cacggcggct cagatctccg agcaaaagtc 6000aaatgatgcc tctgaggcgg agcaccagag agcctacctg gaagacacat gcgtggagtg 6060gctccacaaa tacctggaga aggggaagga gacgctgctt cacctggagc ccccaaagac 6120acacgtgact caccacccca tctctgacca tgaggccacc ctgaggtgct gggccctggg 6180cttctaccct gcggagatca cactgacctg gcagcaggat ggggagggcc atacccagga 6240cacggagctc gtggagacca ggcctgcagg ggatggaacc ttccagaagt gggcagctgt 6300ggtggtgcct tctggagagg agcagagata cacgtgccat gtgcagcatg aggggctacc 6360cgagcccgtc accctgagat ggaagccggc ttcccagccc accatcccca tcgtgggcat 6420cattgctggc ctggttctcc ttggatctgt ggtctctgga gctgtggttg ctgctgtgat 6480atggaggaag aagagctcag gtggaaaagg agggagctac tctaaggctg agtggagcga 6540cagtgcccag gggtctgagt ctcacagctt gtaatctaga gcggccgcga ctctagatca 6600ggcgcgccgt taccaagcag ctatggaagc ttatggacct cagagaggaa gtaacgagga 6660gagggtgtgg tggaatgtca ctagaaacca gggaaaacaa ggaggagagt attacaggga 6720aggaggtgaa gaacctcatt acccaaatac tcctgctcct catagacgta cctgggatga 6780gagacacaag gttcttaaat tgtcctcatt cgctactccc tctgacatcc aacgctggac 6840tactaactct agattgtacg ggaggctctt cactactcgc tgcgtcgaga gtgtacgaga 6900ctctccaggt ttggtaagaa atattttata ttgttataat gttactatga tccattaaca 6960ctctgcttat agattgtaag ggtgattgca atgctttctg cataaaactt tggttttctt 7020gttaatcaat aaaccgactt gattcgagaa ccaactccta tattattgtc tcttttatac 7080tttattaagt aaaaggattt gtatattagc cttgctaagg gagacatcta gtgatataag 7140tgtgaactac acttatctta aatgatgtaa ctccttagga taatcaatat acaaaattcc 7200atgacaattg gcgataccgt cgaccgttca gctgcattaa tgaatcggcc aacgcgcggg 7260gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 7320ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 7380agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 7440ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 7500caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 7560gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 7620cctgtccgcc tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta 7680tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 7740gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 7800cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 7860tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 7920tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 7980caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 8040aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 8100cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 8160ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt

aaacttggtc 8220tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 8280atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 8340tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 8400aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 8460catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 8520gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 8580ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 8640aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 8700atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 8760cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 8820gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 8880agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 8940gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 9000caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 9060ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 9120tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 9180aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat 9240catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtc 928720500PRTArtificial SequenceHLA-bGBE 20Met Ser Arg Ser Val Ala Leu Ala Val Leu Ala Leu Leu Ser Leu Ser 1 5 10 15 Gly Leu Glu Ala Val Met Ala Pro Arg Thr Leu Phe Leu Gly Gly Gly 20 25 30 Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ile Gln Arg Thr 35 40 45 Pro Lys Ile Gln Val Tyr Ser Arg His Pro Ala Glu Asn Gly Lys Ser 50 55 60 Asn Phe Leu Asn Cys Tyr Val Ser Gly Phe His Pro Ser Asp Ile Glu 65 70 75 80 Val Asp Leu Leu Lys Asn Gly Glu Arg Ile Glu Lys Val Glu His Ser 85 90 95 Asp Leu Ser Phe Ser Lys Asp Trp Ser Phe Tyr Leu Leu Tyr Tyr Thr 100 105 110 Glu Phe Thr Pro Thr Glu Lys Asp Glu Tyr Ala Cys Arg Val Asn His 115 120 125 Val Thr Leu Ser Gln Pro Lys Ile Val Lys Trp Asp Arg Asp Met Gly 130 135 140 Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 145 150 155 160 Gly Gly Ser Gly Ser His Ser Leu Lys Tyr Phe His Thr Ser Val Ser 165 170 175 Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ser Val Gly Tyr Val Asp 180 185 190 Asp Thr Gln Phe Val Arg Phe Asp Asn Asp Ala Ala Ser Pro Arg Met 195 200 205 Val Pro Arg Ala Pro Trp Met Glu Gln Glu Gly Ser Glu Tyr Trp Asp 210 215 220 Arg Glu Thr Arg Ser Ala Arg Asp Thr Ala Gln Ile Phe Arg Val Asn 225 230 235 240 Leu Arg Thr Leu Arg Gly Tyr Tyr Asn Gln Ser Glu Ala Gly Ser His 245 250 255 Thr Leu Gln Trp Met His Gly Cys Glu Leu Gly Pro Asp Gly Arg Phe 260 265 270 Leu Arg Gly Tyr Glu Gln Phe Ala Tyr Asp Gly Lys Asp Tyr Leu Thr 275 280 285 Leu Asn Glu Asp Leu Arg Ser Trp Thr Ala Val Asp Thr Ala Ala Gln 290 295 300 Ile Ser Glu Gln Lys Ser Asn Asp Ala Ser Glu Ala Glu His Gln Arg 305 310 315 320 Ala Tyr Leu Glu Asp Thr Cys Val Glu Trp Leu His Lys Tyr Leu Glu 325 330 335 Lys Gly Lys Glu Thr Leu Leu His Leu Glu Pro Pro Lys Thr His Val 340 345 350 Thr His His Pro Ile Ser Asp His Glu Ala Thr Leu Arg Cys Trp Ala 355 360 365 Leu Gly Phe Tyr Pro Ala Glu Ile Thr Leu Thr Trp Gln Gln Asp Gly 370 375 380 Glu Gly His Thr Gln Asp Thr Glu Leu Val Glu Thr Arg Pro Ala Gly 385 390 395 400 Asp Gly Thr Phe Gln Lys Trp Ala Ala Val Val Val Pro Ser Gly Glu 405 410 415 Glu Gln Arg Tyr Thr Cys His Val Gln His Glu Gly Leu Pro Glu Pro 420 425 430 Val Thr Leu Arg Trp Lys Pro Ala Ser Gln Pro Thr Ile Pro Ile Val 435 440 445 Gly Ile Ile Ala Gly Leu Val Leu Leu Gly Ser Val Val Ser Gly Ala 450 455 460 Val Val Ala Ala Val Ile Trp Arg Lys Lys Ser Ser Gly Gly Lys Gly 465 470 475 480 Gly Ser Tyr Ser Lys Ala Glu Trp Ser Asp Ser Ala Gln Gly Ser Glu 485 490 495 Ser His Ser Leu 500 217159DNAArtificial SequenceVector sequence for pA2-B2METKNpA 21ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120aggggttcct gcggccgcga tgcagtccaa actctcacta aaattgccga gccctttgtc 180ttccagtgtc taaaatatta atgtcaatgg aatcaggcca gagtttgaat tctagtctct 240tagcctttgt ttcccctgtc cataaaatga atgggggtaa ttctttcctc ctacagttta 300tttatatatt cactaattca ttcattcatc catccattcg ttcattcggt ttactgagta 360cctactatgt gccagcccct gttctagggt ggaaactaag agaatgatgt acctagaggg 420cgctggaagc tctaaagccc tagcagttac tgcttttact attagtggtc gtttttttct 480cccccccgcc ccccgacaaa tcaacagaac aaagaaaatt acctaaacag caaggacata 540gggaggaact tcttggcaca gaactttcca aacacttttt cctgaaggga tacaagaagc 600aagaaaggta ctctttcact aggaccttct ctgagctgtc ctcaggatgc ttttgggact 660atttttctta cccagagaat ggagaaaccc tgcagggaat tcccaagctg tagttataaa 720cagaagttct ccttctgcta ggtagcattc aaagatctta atcttctggg tttccgtttt 780ctcgaatgaa aaatgcaggt ccgagcagtt aactggctgg ggcaccatta gcaagtcact 840tagcatctct ggggccagtc tgcaaagcga gggggcagcc ttaatgtgcc tccagcctga 900agtcctagaa tgagcgcccg gtgtcccaag ctggggcgcg caccccagat cggagggcgc 960cgatgtacag acagcaaact cacccagtct agtgcatgcc ttcttaaaca tcacgagact 1020ctaagaaaag gaaactgaaa acgggaaagt ccctctctct aacctggcac tgcgtcgctg 1080gcttggagac aggtgacggt ccctgcgggc cttgtcctga ttggctgggc acgcgtttaa 1140tataagtgga ggcgtcgcgc tggcgggcat tcctgaagct gacagcattc gggccgagat 1200ctagataact tcgtataatg tatgctatac gaagttatgg atccatcgat tctgcgatcg 1260ctccggtgcc cgtcagtggg cagagcgcac atcgcccaca gtccccgaga agttgggggg 1320aggggtcggc aattgaaccg gtgcctagag aaggtggcgc ggggtaaact gggaaagtga 1380tgtcgtgtac tggctccgcc tttttcccga gggtggggga gaaccgtata taagtgcagt 1440agtcgccgtg aacgttcttt ttcgcaacgg gtttgccgcc agaacacagc tgaagcttcc 1500accatgccca cgctactgcg ggtttatata gacggtcccc acgggatggg gaaaaccacc 1560accacgcaac tgctggtggc cctgggttcg cgcgacgata tcgtctacgt acccgagccg 1620atgacttact ggcgggtgct gggggcttcc gagacaatcg cgaacatcta caccacacaa 1680caccgcctcg accagggtga gatatcggcc ggggacgcgg cggtggtaat gacaagcgcc 1740cagataacaa tgggcatgcc ttatgccgtg accgacgccg ttctggctcc tcatatcggg 1800ggggaggctg ggagctcaca tgccccgccc ccggccctca ccctcatctt cgaccgccat 1860cccatcgccg ccctcctgtg ctacccggcc gcgcggtacc ttatgggcag catgaccccc 1920caggccgtgc tggcgttcgt ggccctcatc ccgccgacct tgcccggcac caacatcgtg 1980cttggggccc ttccggagga cagacacatc gaccgcctgg ccaaacgcca gcgccccggc 2040gagcggctgg acctggctat gctggctgcg attcgccgcg tttacgggct acttgccaat 2100acggtgcggt atctgcagtg cggcgggtcg tggcgggagg actggggaca gctttcgggg 2160acggccgtgc cgccccaggg tgccgagccc cagagcaacg cgggcccacg accccatatc 2220ggggacacgt tatttaccct gtttcgggcc cccgagttgc tggcccccaa cggcgacctg 2280tataacgtgt ttgcctgggc cttggacgtc ttggccaaac gcctccgttc catgcacgtc 2340tttatcctgg attacgacca atcgcccgcc ggctgccggg acgccctgct gcaacttacc 2400tccgggatgg tccagaccca cgtcaccacc cccggctcca taccgacgat atgcgacctg 2460gcgcgcacgt ttgcccggga gatgggatcg gccattgaac aagatggatt gcacgcaggt 2520tctccggccg cttgggtgga gaggctattc ggctatgact gggcacaaca gacaatcggc 2580tgctctgatg ccgccgtgtt ccggctgtca gcgcaggggc gcccggttct ttttgtcaag 2640accgacctgt ccggtgccct gaatgaactg caggacgagg cagcgcggct atcgtggctg 2700gccacgacgg gcgttccttg cgcagctgtg ctcgacgttg tcactgaagc gggaagggac 2760tggctgctat tgggcgaagt gccggggcag gatctcctgt catctcacct tgctcctgcc 2820gagaaagtat ccatcatggc tgatgcaatg cggcggctgc atacgcttga tccggctacc 2880tgcccattcg accaccaagc gaaacatcgc atcgagcgag cacgtactcg gatggaagcc 2940ggtcttgtcg atcaggatga tctggacgaa gagcatcagg ggctcgcgcc agccgaactg 3000ttcgccaggc tcaaggcgcg catgcccgac ggcgaggatc tcgtcgtgac ccatggcgat 3060gcctgcttgc cgaatatcat ggtggaaaat ggccgctttt ctggattcat cgactgtggc 3120cggctgggtg tggcggaccg ctatcaggac atagcgttgg ctacccgtga tattgctgaa 3180gagcttggcg gcgaatgggc tgaccgcttc ctcgtgcttt acggtatcgc cgctcccgat 3240tcgcagcgca tcgccttcta tcgccttctt gacgagttct tctgagggtc tctagcataa 3300cttcgtataa tgtatgctat acgaagttat aataaaagat ccttattttc attggatctg 3360tgtgttggtt ttttgtgtgg gatccgtcga taccgtcgac tgtctcgctc cgtggcctta 3420gctgtgctcg cgctactctc tctttctggc ctggaggcta tccagcgtga gtctctccta 3480ccctcccgct ctggtccttc ctctcccgct ctgcaccctc tgtggccctc gctgtgctct 3540ctcgctccgt gacttccctt ctccaagttc tccttggtgg cccgccgtgg ggctagtcca 3600gggctggatc tcggggaagc ggcggggtgg cctgggagtg gggaaggggg tgcgcacccg 3660ggacgcgcgc tacttgcccc tttcggcggg gagcagggga gacctttggc ctacggcgac 3720gggagggtcg ggacaaagtt tagggcgtcg ataagcgtca gagcgccgag gttgggggag 3780ggtttctctt ccgctctttc gcggggcctc tggctccccc agcgcagctg gagtggggga 3840cgggtaggct cgtcccaaag gcgcggcgct gaggtttgtg aacgcgtgga ggggcgcttg 3900gggtctgggg gaggcgtcgc ccgggtaagc ctgtctgctg cggctctgct tcccttagac 3960tggagagctg tggacttcgt ctaggcgccc gctaagttcg catgtcctag cacctctggg 4020tctatgtggg gccacaccgt ggggaggaaa cagcacgcga cgtttgtaga atgcttggct 4080gtgatacaaa gcggtttcga ataattaact tatttgttcc catcacatgt cacttttaaa 4140aaattataag aactacccgt tattgacatc tttctgtgtg ccaaggactt tatgtgcttt 4200gcgtcattta attttgaaaa cagttatctt ccgccataga taactactat ggttatcttc 4260tgcctctcac agatgaagaa actaaggcac cgagatttta agaaacttaa ttacacaggg 4320gataaatggc agcaatcgag attgaagtca agcctaacca gggcttttgc gggagcgcat 4380gccttttggc tgtaattcgt gcgcggccgc aggaacccct agtgatggag ttggccactc 4440cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg 4500gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag ctgcctgcag gggcgcctga 4560tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcatacgt caaagcaacc 4620atagtacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 4680gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 4740cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 4800atttagtgct ttacggcacc tcgaccccaa aaaacttgat ttgggtgatg gttcacgtag 4860tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 4920tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcgggct attcttttga 4980tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 5040atttaacgcg aattttaaca aaatattaac gtttacaatt ttatggtgca ctctcagtac 5100aatctgctct gatgccgcat agttaagcca gccccgacac ccgccaacac ccgctgacgc 5160gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 5220gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgagac gaaagggcct 5280cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg 5340tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc 5400aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag 5460gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg 5520ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt 5580gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 5640tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt 5700attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa 5760tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag 5820agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac 5880aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac 5940tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac 6000cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac 6060tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact 6120tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg 6180tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt 6240tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat 6300aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta 6360gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa 6420tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 6480aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac 6540aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt 6600tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc 6660gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat 6720cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag 6780acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc 6840cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag 6900cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac 6960aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg 7020gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct 7080atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc 7140tcacatgtcc tgcaggcag 7159227199DNAArtificial SequenceVector sequence for pA2-B2MEHyTKpA 22cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc 60gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca 120actccatcac taggggttcc tgcggccgcg aaactaagag aatgatgtac ctagagggcg 180ctggaagctc taaagcccta gcagttactg cttttactat tagtggtcgt ttttttctcc 240cccccgcccc ccgacaaatc aacagaacaa agaaaattac ctaaacagca aggacatagg 300gaggaacttc ttggcacaga actttccaaa cactttttcc tgaagggata caagaagcaa 360gaaaggtact ctttcactag gaccttctct gagctgtcct caggatgctt ttgggactat 420ttttcttacc cagagaatgg agaaaccctg cagggaattc ccaagctgta gttataaaca 480gaagttctcc ttctgctagg tagcattcaa agatcttaat cttctgggtt tccgttttct 540cgaatgaaaa atgcaggtcc gagcagttaa ctggctgggg caccattagc aagtcactta 600gcatctctgg ggccagtctg caaagcgagg gggcagcctt aatgtgcctc cagcctgaag 660tcctagaatg agcgcccggt gtcccaagct ggggcgcgca ccccagatcg gagggcgccg 720atgtacagac agcaaactca cccagtctag tgcatgcctt cttaaacatc acgagactct 780aagaaaagga aactgaaaac gggaaagtcc ctctctctaa cctggcactg cgtcgctggc 840ttggagacag gtgacggtcc ctgcgggcct tgtcctgatt ggctgggcac gcgtttaata 900taagtggagg cgtcgcgctg gcgggcattc ctgaagctga cagcattcgg gccgagatct 960agataacttc gtataatgta tgctatacga agttatggat ccatcgattc tgcgatcgct 1020ccggtgcccg tcagtgggca gagcgcacat cgcccacagt ccccgagaag ttggggggag 1080gggtcggcaa ttgaaccggt gcctagagaa ggtggcgcgg ggtaaactgg gaaagtgatg 1140tcgtgtactg gctccgcctt tttcccgagg gtgggggaga accgtatata agtgcagtag 1200tcgccgtgaa cgttcttttt cgcaacgggt ttgccgccag aacacagctg aagcttccac 1260catgaaaaag cctgaactca ccgcgacgtc tgtcgagaag tttctgatcg aaaagttcga 1320cagcgtctcc gacctgatgc agctctcgga gggcgaagaa tctcgtgctt tcagcttcga 1380tgtaggaggg cgtggatatg tcctgcgggt aaatagctgc gccgatggtt tctacaaaga 1440tcgttatgtt tatcggcact ttgcatcggc cgcgctcccg attccggaag tgcttgacat 1500tggggaattc agcgagagcc tgacctattg catctcccgc cgtgcacagg gtgtcacgtt 1560gcaagacctg cctgaaaccg aactgcccgc tgttctgcag ccggtcgcgg aggccatgga 1620tgcgatcgct gcggccgatc ttagccagac gagcgggttc ggcccattcg gaccgcaagg 1680aatcggtcaa tacactacat ggcgtgattt catatgcgcg attgctgatc cccatgtgta 1740tcactggcaa actgtgatgg acgacaccgt cagtgcgtcc gtcgcgcagg ctctcgatga 1800gctgatgctt tgggccgagg actgccccga agtccggcac ctcgtgcacg cggatttcgg 1860ctccaacaat gtcctgacgg acaatggccg cataacagcg gtcattgact ggagcgaggc 1920gatgttcggg gattcccaat acgaggtcgc caacatcttc ttctggaggc cgtggttggc 1980ttgtatggag cagcagacgc gctacttcga gcggaggcat ccggagcttg caggatcgcc 2040gcggctccgg gcgtatatgc tccgcattgg tcttgaccaa ctctatcaga gcttggttga 2100cggcaatttc gatgatgcag cttgggcgca gggtcgatgc gacgcaatcg tccgatccgg 2160agccgggact gtcgggcgta cacaaatcgc ccgcagaagc gcggccgtct ggaccgatgg 2220ctgtgtagaa gtcgcgtctg cgttcgacca ggctgcgcgt tctcgcggcc atagcaaccg 2280acgtacggcg ttgcgccctc gccggcagca agaagccacg gaagtccgcc cggagcagaa 2340aatgcccacg ctactgcggg tttatataga cggtccccac gggatgggga aaaccaccac 2400cacgcaactg ctggtggccc tgggttcgcg cgacgatatc gtctacgtac ccgagccgat 2460gacttactgg cgggtgctgg gggcttccga gacaatcgcg aacatctaca ccacacaaca 2520ccgcctcgac cagggtgaga tatcggccgg ggacgcggcg gtggtaatga caagcgccca 2580gataacaatg ggcatgcctt atgccgtgac cgacgccgtt ctggctcctc atatcggggg 2640ggaggctggg agctcacatg ccccgccccc ggccctcacc ctcatcttcg accgccatcc 2700catcgccgcc ctcctgtgct acccggccgc gcggtacctt atgggcagca tgacccccca 2760ggccgtgctg gcgttcgtgg ccctcatccc gccgaccttg cccggcacca acatcgtgct 2820tggggccctt ccggaggaca gacacatcga ccgcctggcc aaacgccagc gccccggcga 2880gcggctggac ctggctatgc tggctgcgat tcgccgcgtt tacgggctac ttgccaatac 2940ggtgcggtat ctgcagtgcg gcgggtcgtg gcgggaggac tggggacagc tttcggggac 3000ggccgtgccg ccccagggtg ccgagcccca gagcaacgcg ggcccacgac cccatatcgg 3060ggacacgtta tttaccctgt ttcgggcccc cgagttgctg gcccccaacg gcgacctgta 3120taacgtgttt gcctgggcct tggacgtctt ggccaaacgc ctccgttcca tgcacgtctt 3180tatcctggat tacgaccaat cgcccgccgg ctgccgggac gccctgctgc aacttacctc 3240cgggatggtc cagacccacg tcaccacccc cggctccata ccgacgatat gcgacctggc 3300gcgcacgttt gcccgggaga tgggggaggc taactgaggt ctctagcata acttcgtata

3360atgtatgcta tacgaagtta taataaaaga tccttatttt cattggatct gtgtgttggt 3420tttttgtgtg ggatccgtcg ataccgtcga ctgtctcgct ccgtggcctt agctgtgctc 3480gcgctactct ctctttctgg cctggaggct atccagcgtg agtctctcct accctcccgc 3540tctggtcctt cctctcccgc tctgcaccct ctgtggccct cgctgtgctc tctcgctccg 3600tgacttccct tctccaagtt ctccttggtg gcccgccgtg gggctagtcc agggctggat 3660ctcggggaag cggcggggtg gcctgggagt ggggaagggg gtgcgcaccc gggacgcgcg 3720ctacttgccc ctttcggcgg ggagcagggg agacctttgg cctacggcga cgggagggtc 3780gggacaaagt ttagggcgtc gataagcgtc agagcgccga ggttggggga gggtttctct 3840tccgctcttt cgcggggcct ctggctcccc cagcgcagct ggagtggggg acgggtaggc 3900tcgtcccaaa ggcgcggcgc tgaggtttgt gaacgcgtgg aggggcgctt ggggtctggg 3960ggaggcgtcg cccgggtaag cctgtctgct gcggctctgc ttcccttaga ctggagagct 4020gtggacttcg tctaggcgcc cgctaagttc gcatgtccta gcacctctgg gtctatgtgg 4080ggccacaccg tggggaggaa acagcacgcg acgtttgtag aatgcttggc tgtgatacaa 4140agcggtttcg aataattaac ttatttgttc ccatcacatg tcacttttaa aaaattataa 4200gaactacccg ttattgacat ctttctgtgt gccaaggact ttatgtgctt tgcgtcattt 4260aattttgaaa acagttatct tccgccatag ataactacta tggttatctt ctgcctctca 4320cagatgaaga aactaaggca ccgagatttt aagaaactta attacacagg ggataaatgg 4380cagcaatcga gattgaagtc aagcctaacc agggcttttg cgggagcgca tgccttttgg 4440ctgtaattcg tgcgcggccg caggaacccc tagtgatgga gttggccact ccctctctgc 4500gcgctcgctc gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc 4560gggcggcctc agtgagcgag cgagcgcgca gctgcctgca ggggcgcctg atgcggtatt 4620ttctccttac gcatctgtgc ggtatttcac accgcatacg tcaaagcaac catagtacgc 4680gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac 4740acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt 4800cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc 4860tttacggcac ctcgacccca aaaaacttga tttgggtgat ggttcacgta gtgggccatc 4920gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact 4980cttgttccaa actggaacaa cactcaaccc tatctcgggc tattcttttg atttataagg 5040gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc 5100gaattttaac aaaatattaa cgtttacaat tttatggtgc actctcagta caatctgctc 5160tgatgccgca tagttaagcc agccccgaca cccgccaaca cccgctgacg cgccctgacg 5220ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg ggagctgcat 5280gtgtcagagg ttttcaccgt catcaccgaa acgcgcgaga cgaaagggcc tcgtgatacg 5340cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt 5400tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 5460tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 5520gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 5580ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 5640agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 5700agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 5760tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 5820tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 5880cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 5940aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 6000tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 6060tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 6120ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 6180ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 6240cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 6300gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 6360actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 6420aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 6480caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 6540aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 6600accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 6660aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 6720ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 6780agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 6840accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 6900gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 6960tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 7020cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 7080cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 7140cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgt 71992329PRTArtificial SequenceHLA-G peptide 23Met Ala Pro Arg Thr Leu Phe Leu Gly Gly Gly Gly Ser Gly Gly Gly 1 5 10 15 Gly Ser Gly Gly Gly Gly Ser Ile Gln Arg Thr Pro Lys 20 25

* * * * *

References


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed