Biomarker Specific To Brain/nerve Or Specific To Neuronal Differentiation WAKAMATSU; Ai ; et al. [ISOGAI; Takao]

Biomarker Specific To Brain/nerve Or Specific To Neuronal Differentiation

WAKAMATSU; Ai ; et al.

Patent Application Summary

U.S. patent application number 13/324610 was filed with the patent office on 2013-04-18 for biomarker specific to brain/nerve or specific to neuronal differentiation. This patent application is currently assigned to REVERSE PROTEOMICS RESEARCH INSTITUTE CO., LTD.. The applicant listed for this patent is Takao ISOGAI, Ai WAKAMATSU, Junichi YAMAMOTO. Invention is credited to Takao ISOGAI, Ai WAKAMATSU, Junichi YAMAMOTO.

Application Number	20130095107 13/324610
Document ID	/
Family ID	39759565
Filed Date	2013-04-18

United States Patent Application	20130095107
Kind Code	A1
WAKAMATSU; Ai ; et al.	April 18, 2013

BIOMARKER SPECIFIC TO BRAIN/NERVE OR SPECIFIC TO NEURONAL DIFFERENTIATION

Abstract

The invention provides a novel polypeptide and a specific partial peptide thereof, as well as a novel polynucleotide and a specific partial nucleotide thereof, that can be used as a biomarker specific for the brain/nerves or specific for nerve differentiation; an expression vector for such a polynucleotide and a specific partial peptide thereof; a transformant incorporating such an expression vector; an antisense molecule, RNAi-inducing nucleic acid (e.g., siRNA), aptamer, or antibody for such a biomarker, and a composition comprising the same; a mammalian cell or non-human mammal wherein the expression or a function of such a biomarker is regulated; a measuring means (e.g., primer set, nucleic acid probe, antibody, aptamer) for such a biomarker, and a reagent comprising the same and the like.

Inventors:

WAKAMATSU; Ai; (Tokyo, JP) ; YAMAMOTO; Junichi; (Sakura-shi, JP) ; ISOGAI; Takao; (Inashiki-gun, JP)

Applicant:

Name	City	State	Country	Type
WAKAMATSU; Ai YAMAMOTO; Junichi ISOGAI; Takao	Tokyo Sakura-shi Inashiki-gun		JP JP JP

Assignee:

REVERSE PROTEOMICS RESEARCH INSTITUTE CO., LTD.

Family ID:

39759565

Appl. No.:

13/324610

Filed:

December 13, 2011

Related U.S. Patent Documents


Application Number	Filing Date	Patent Number
12531430	Sep 15, 2009	8153764
PCT/JP2008/054575	Mar 13, 2008
13324610

Current U.S. Class:	424/139.1 ; 435/252.3; 435/252.31; 435/252.33; 435/254.2; 435/320.1; 435/331; 514/44R; 530/300; 530/387.3; 530/387.9; 536/23.1
Current CPC Class:	C12N 2310/14 20130101; A61K 38/00 20130101; A61P 25/00 20180101; C12N 2310/11 20130101; A61P 25/28 20180101; C07K 14/47 20130101; C12Q 2600/158 20130101; G01N 2800/28 20130101; C12N 15/113 20130101; C12Q 1/6883 20130101; G01N 33/6896 20130101; C12Q 2600/112 20130101
Class at Publication:	424/139.1 ; 530/300; 536/23.1; 435/320.1; 435/252.33; 530/387.9; 530/387.3; 435/331; 514/44.R; 435/252.31; 435/252.3; 435/254.2
International Class:	A61K 39/395 20060101 A61K039/395; C12N 15/11 20060101 C12N015/11; C12N 15/63 20060101 C12N015/63; C12N 1/21 20060101 C12N001/21; A61P 25/28 20060101 A61P025/28; C07K 16/00 20060101 C07K016/00; C12N 5/16 20060101 C12N005/16; A61K 31/7088 20060101 A61K031/7088; C12N 1/19 20060101 C12N001/19; C12N 5/10 20060101 C12N005/10; C07K 2/00 20060101 C07K002/00; C07H 21/00 20060101 C07H021/00

Foreign Application Data

Date	Code	Application Number
Mar 15, 2007	JP	2007-066430

Claims

1.-27. (canceled)

28. An isolated peptide comprising the amino acid sequence of SEQ ID NO: 60.

29. A polynucleotide that encodes the peptide of claim 28.

30. An expression vector comprising the polynucleotide of claim 29 and a promoter operably linked thereto.

31. A transformant incorporating the expression vector of claim 30.

32. An aptamer that binds the peptide of claim 28.

33. An antibody that binds the peptide of claim 28.

34. The antibody of claim 33, wherein the antibody is any one of the (i) to (iii) below: (i) a polyclonal antibody; (ii) a monoclonal antibody or a portion thereof; (iii) a chimeric antibody, a humanized antibody or a human antibody.

35. A cell that produces the antibody of claim 33.

36. The cell of claim 35, wherein the cell is a hybridoma.

37. A composition comprising (a) the antibody of claim 33, or an expression vector therefor, and (b) a pharmaceutically acceptable carrier.

38. A reagent or kit for detection or quantification of any one of the polypeptides encoded by the brain/nerve-specific genes 1 to 10, which reagent or kit comprises one or more antibodies of claim 33.

39. The peptide of claim 28 consisting of the amino acid sequence of SEQ ID NO: 60.

40. A polynucleotide that encodes the peptide of claim 39.

41. An expression vector comprising the polynucleotide of claim 40 and a promoter operably linked thereto.

42. A transformant incorporating the expression vector of claim 41.

43. An aptamer that binds the peptide of claim 39.

44. An antibody that binds the peptide of claim 39.

45. The antibody of claim 44, wherein the antibody is any one of the (i) to (iii) below: (i) a polyclonal antibody; (ii) a monoclonal antibody or a portion thereof; (iii) a chimeric antibody, a humanized antibody or a human antibody.

46. A cell that produces the antibody of claim 44.

47. The cell of claim 46, wherein the cell is a hybridoma.

48. A composition comprising (a) the antibody of claim 44, or an expression vector therefor, and (b) a pharmaceutically acceptable carrier.

49. A reagent or kit for detection or quantification of any one of the polypeptides encoded by the brain/nerve-specific genes 1 to 10, which reagent or kit comprise one or more antibodies of claim 44.

Description

TECHNICAL FIELD

[0001] The present invention provides a polypeptide and a partial peptide thereof, as well as a polynucleotide and a partial nucleotide thereof, that can be used as biomarkers specific for the brain/nerves or specific for nerve differentiation; an expression vector; a transformant; an antisense molecule, an RNAi-inducing nucleic acid (e.g., siRNA), an aptamer, an antibody, and a composition comprising them; a mammalian cell or a non-human mammal; a measuring means for a biomarker specific for the brain/nerves or specific for nerve differentiation (e.g., primer set, nucleic acid probe, antibody, aptamer), a measuring method and the like.

BACKGROUND ART

[0002] Although there have been remarkable advances in the analysis of human chromosome sequences thanks to the progress in human genome research, this does not mean that all the human genetic functions have been clarified. In humans, gene diversity is significantly associated with changes in gene functions. In fact, it is known that in humans, a plurality of mRNAs are transcribed from a particular region of a chromosome to produce different variants.

[0003] For the series of genes that have been discovered by the present inventors, and that can be used as biomarkers specific for the brain/nerves or specific for nerve differentiation (abbreviated as "brain/nerve-specific genes" or "brain/nerve-specific genes 1 to 10" as required), known variants have been reported. Examples of such known variants include known variants of brain/nerve-specific gene 1 (Genbank accession number: NM.sub.--133460.1; non-patent documents 1 and 2), brain/nerve-specific gene 2 (Genbank accession number: NM.sub.--005163.1; non-patent documents 3 and 4), brain/nerve-specific gene 3 (Genbank accession number: NM.sub.--181784.1; non-patent documents 5 and 6), brain/nerve-specific gene 4 (Genbank accession number: NM.sub.--003930.3; non-patent documents 7 and 8), brain/nerve-specific gene 5 (Genbank accession number: NM.sub.--000898.3; non-patent documents 9 and 10), brain/nerve-specific gene 6 (Genbank accession number: NM.sub.--005079.1; non-patent documents 11 and 12), brain/nerve-specific gene 7% (Genbank accession number: NM.sub.--001679.2; non-patent document 13 and 14), brain/nerve-specific gene 8 (Genbank accession number: NM.sub.--000431.1; non-patent documents 15 and 16), brain/nerve-specific gene 9 (Genbank accession number: NM.sub.--153449.2; non-patent document 17), and brain/nerve-specific gene 10 (Genbank accession number: NM.sub.--015009.1; non-patent documents 18 and 19).

[0004] However, it is not known that the brain/nerve-specific genes 1 to 10 can be useful as biomarkers specific for the brain/nerves or specific for nerve cell differentiation, and that the particular variants discovered by the present inventors exist in the brain/nerve-specific genes 1 to 10. [0005] [Non-patent document 1] Ota, T. et al., Nat. Genet. 36 (1), 40-45 (2004) [0006] [Non-patent document 2] Strausberg, R. L. et al., Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) [0007] [Non-patent document 3] Staal, S. P., Proc. Natl. Acad. Sci. U.S.A. 84 (14), 5034-5037 (1987) [0008] [Non-patent document 4] Staal, S. P. et al., Genomics 2 (1), 96-98 (1988) [0009] [Non-patent document 5] Wakioka, T. et al., Nature 412 (6847), 647-651 (2001) [0010] [Non-patent document 6] Kato, R. et al., Biochem. Biophys. Res. Commun. 302 (4), 767-772 (2003) [0011] [Non-patent document 7] Marie-Cardine, A. et al., FEES Lett. 435 (1), 55-60 (1998) [0012] [Non-patent document 8] Kouroku, Y. et al., Biochem. Biophys. Res. Commun. 252 (3), 738-742 (1998) [0013] [Non-patent document 9] Kochersperger, L. M. et al., J. Neurosci. Res. 16 (4), 601-616 (1986) [0014] [Non-patent document 10] Bach, A. W. et al., Proc. Natl. Acad. Sci. U.S.A. 85 (13), 4934-4938 (1988) [0015] [Non-patent document 11] Chen, S. L. et al., Oncogene 12 (4), 741-751 (1996) [0016] [Non-patent document 12] Byrne, J. A. et al., Genomics 35 (3), 523-532 (1996) [0017] [Non-patent document 13] Lingrel, J. B. et al., Prog. Nucleic Acid Res. Mol. Biol. 38, 37-89 (1990) [0018] [Non-patent document 14] Malik, N. et al., J. Biol. Chem. 271 (37), 22754-22758 (1996) [0019] [Non-patent document 15] Kopito, R. R. et al., Proc. Natl. Acad. Sci. U.S.A. 77 (10), 5738-5740 (1980) [0020] [Non-patent document 16] Schafer, B. L. et al., J. Biol. Chem. 267 (19), 13229-13238 (1992) [0021] [Non-patent document 17] Wu, X. et al., Genomics 80 (6), 553-557 (2002) [0022] [Non-patent document 18] Bach, I. et al., Nat. Genet. 22 (4), 394-399 (1999) [0023] [Non-patent document 19] Katoh, M. et al., Int. J. Mol. Med. 13 (4), 607-613 (2004)

DISCLOSURE OF THE INVENTION

Problems to be Solved by the Invention

[0024] Analyzing a biomarker specific for the brain/nerve cells or specific for nerve cell differentiation leads to the development of, for example, a reagent for nerve cell identification or nerve cell differentiation state determination, a diagnostic reagent for a disease based on a nerve cell disorder, a pharmaceutical for a disease based on a nerve cell disorder, having a new mechanism of action, and the like. Based on the findings obtained by expression profile analysis of specified genes, the present invention is directed to providing such reagents, pharmaceuticals and the like, and providing a means useful in developing such reagents, pharmaceuticals and the like.

Means of Solving the Problems

[0025] The present inventors conducted extensive investigations and discovered brain/nerve-specific genes 1 to 10 as biomarkers specific for the brain/nerves or specific for nerve cell differentiation. The present inventors also discovered novel variants of the brain/nerve-specific genes 1 to 10 that can be used as biomarkers specific for the brain/nerves or specific for nerve cell differentiation. Therefore, it is thought that by utilizing the brain/nerve-specific genes 1 to 10 and/or novel variants thereof, it will become possible to identify nerve cells, to determine nerve cell differentiation states, to diagnose a disease based on a nerve cell disorder, and the like. In particular, because the brain/nerve-specific genes 1 to 10 and/or novel variants thereof are expressed specifically in particular differentiation stages of nerve cells, the accuracy of the determination of nerve cells in the particular differentiation stages can be increased. It is also thought that by utilizing the brain/nerve-specific genes 1 to 10 and/or novel variants thereof, it will become possible to develop a novel pharmaceutical for a specified disease such as a disease based on a nerve cell disorder, and the like.

[0026] Based on the findings shown above, the present inventors developed the present invention.

[0027] Accordingly, the present invention relates to the following aspects and the like.

[1] A polypeptide of any one of 1) to 10) below or a specific partial peptide thereof: 1) a polypeptide having an amino acid sequence shown by SEQ ID NO:18 or SEQ ID NO:10 or substantially the same amino acid sequence thereas; 2) a polypeptide having the amino acid sequence shown by SEQ ID NO:43 or substantially the same amino acid sequence thereas; 3) a polypeptide having the amino acid sequence shown by SEQ ID NO:58 or substantially the same amino acid sequence thereas; 4) a polypeptide having the amino acid sequence shown by SEQ ID NO:74 or substantially the same amino acid sequence thereas; 5) a polypeptide having an amino acid sequence shown by SEQ ID NO:89 or SEQ ID NO:99 or substantially the same amino acid sequence thereas; 6) a polypeptide having the amino acid sequence shown by SEQ ID NO:118 or substantially the same amino acid sequence thereas; 7) a polypeptide having the amino acid sequence shown by SEQ ID NO:133 or substantially the same amino acid sequence thereas; 8) a polypeptide having an amino acid sequence shown by SEQ ID NO:152 or SEQ ID NO:159 or substantially the same amino acid sequence thereas; 9) a polypeptide having an amino acid sequence shown by SEQ ID NO:184 or SEQ ID NO:190 or substantially the same amino acid sequence thereas; and 10) a polypeptide having an amino acid sequence shown by SEQ ID NO:207, SEQ ID NO:213, SEQ ID NO:219, SEQ ID NO:225, SEQ ID NO:231 or SEQ ID NO:236 or substantially the same amino acid sequence thereas. [2] The polypeptide or specific partial peptide thereof according to [1] above, wherein the polypeptide is any of the polypeptides 1) to 10) below: 1) a polypeptide consisting of an amino acid sequence shown by SEQ ID NO:18 or SEQ ID NO:10; 2) a polypeptide consisting of the amino acid sequence shown by SEQ ID NO:43; 3) a polypeptide consisting of the amino acid sequence shown by SEQ ID NO:58; 4) a polypeptide consisting of the amino acid sequence shown by SEQ ID NO:74; 5) a polypeptide consisting of an amino acid sequence shown by SEQ ID NO:89 or SEQ ID NO:99; 6) a polypeptide consisting of the amino acid sequence shown by SEQ ID NO:118; 7) a polypeptide consisting of the amino acid sequence shown by SEQ ID NO:133; 8) a polypeptide consisting of an amino acid sequence shown by SEQ ID NO:152 or SEQ ID NO:159; 9) a polypeptide consisting of an amino acid sequence shown by SEQ ID NO:184 or SEQ ID NO:190; and 10) a polypeptide consisting of an amino acid sequence shown by SEQ ID NO:207, SEQ ID NO:213, SEQ ID NO:219, SEQ ID NO:225, SEQ ID NO:231 or SEQ ID NO:236. [3] The polypeptide or specific partial peptide thereof according to [1] or [2] above, which is fused with a polypeptide consisting of a heterologous amino acid sequence. [4] A partial peptide specific for a polypeptide encoded by one of the brain/nerve-specific genes 1 to 10, being any one of the partial peptides 1) to 10) below: 1) a partial peptide consisting of an amino acid sequence shown by SEQ ID NO:12, SEQ ID NO:15, SEQ ID NO:20 or SEQ ID NO:22 or a partial amino acid sequence thereof; 2) a partial peptide consisting of the amino acid sequence shown by SEQ ID NO:264 or a partial amino acid sequence thereof; 3) a partial peptide having the amino acid sequence shown by SEQ ID NO:60; 4) a partial peptide consisting of the amino acid sequence shown by SEQ ID NO:265 or a partial amino acid sequence thereof; 5) a partial peptide consisting of an amino acid sequence shown by SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:96, or SEQ ID NO:266 or a partial amino acid sequence thereof; 6) a partial peptide consisting of the amino acid sequence shown by SEQ ID NO:120 or a partial amino acid sequence thereof; 7) a partial peptide consisting of an amino acid sequence shown by SEQ ID NO:135, SEQ ID NO:138 or SEQ ID NO:139 or a partial amino acid sequence thereof; 8) a partial peptide consisting of an amino acid sequence shown by SEQ ID NO:156, SEQ ID NO:161 or SEQ ID NO:163 or a partial amino acid sequence thereof; 9) a partial peptide consisting of an amino acid sequence shown by SEQ ID NO:186 or SEQ ID NO:192 or a partial amino acid sequence thereof; and 10) a partial peptide consisting of an amino acid sequence shown by SEQ ID NO:209, SEQ ID NO:215, SEQ ID NO:221 or SEQ ID NO:227 or a partial amino acid sequence thereof, or a partial peptide having the amino acid sequence shown by SEQ ID NO:238. [5] A polynucleotide that encodes any one of the polypeptides [1] to [3] above, or any one of the specific partial peptides [1] to [4] above. [6] A polynucleotide of any one of 1) to 10) below or a specific partial nucleotide thereof: 1) a polynucleotide having a nucleic acid sequence shown by SEQ ID NO:16 or SEQ ID NO:8, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas; 2) a polynucleotide having the nucleic acid sequence shown by SEQ ID NO:41, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas; 3) a polynucleotide having the nucleic acid sequence shown by SEQ ID NO:56, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas; 4) a polynucleotide having the nucleic acid sequence shown by SEQ ID NO:72, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas; 5) a polynucleotide having a nucleic acid sequence shown by SEQ ID NO:87 or SEQ ID NO:97, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas; 6) a polynucleotide having the nucleic acid sequence shown by SEQ ID NO:116, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas; 7) a polynucleotide having the nucleic acid sequence shown by SEQ ID NO:131, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas; 8) a polynucleotide having a nucleic acid sequence shown by SEQ ID NO:150 or SEQ ID NO:157, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas; 9) a polynucleotide having a nucleic acid sequence shown by SEQ ID NO:182 or SEQ ID NO:188, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas; and 10) a polynucleotide having a nucleic acid sequence shown by SEQ ID NO:205, SEQ ID NO:211, SEQ ID NO:217, SEQ ID NO:223, SEQ ID NO:229 or SEQ ID NO:234, or a nucleic acid sequence corresponding to the ORF thereof, or substantially the same nucleic acid sequence thereas. [7] The polynucleotide or specific partial nucleotide thereof according to [6] above, wherein the any one of the polynucleotides 1) to 10) is any one of the polynucleotides 1) to 10) below: 1) a polynucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:16 or SEQ ID NO:8 or a nucleic acid sequence corresponding to the ORF thereof; 2) a polynucleotide consisting of the nucleic acid sequence shown by SEQ ID NO:41 or a nucleic acid sequence corresponding to the ORF thereof; 3) a polynucleotide consisting of the nucleic acid sequence shown by SEQ ID NO:56 or a nucleic acid sequence corresponding to the ORF thereof; 4) a polynucleotide consisting of the nucleic acid sequence shown by SEQ ID NO:72 or a nucleic acid sequence corresponding to the ORF thereof; 5) a polynucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:87 or SEQ ID NO:97 or a nucleic acid sequence corresponding to the ORF thereof; 6) a polynucleotide consisting of the nucleic acid sequence shown by SEQ ID NO:116 or a nucleic acid sequence corresponding to the ORF thereof; 7) a polynucleotide consisting of the nucleic acid sequence shown by SEQ ID NO:131 or a nucleic acid sequence corresponding to the ORF thereof; 8) a polynucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:150 or SEQ ID NO:157 or a nucleic acid sequence corresponding to the ORF thereof; 9) a polynucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:182 or SEQ ID NO:188 or a nucleic acid sequence corresponding to the ORF thereof; and 10) a polynucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:205, SEQ ID NO:211, SEQ ID NO:217, SEQ ID NO:223, SEQ ID NO:229 or SEQ ID NO:234 or a nucleic acid sequence corresponding to the ORF thereof. [8] A partial nucleotide specific for any one of the polynucleotides encoded by the brain/nerve-specific genes 1 to 10, being any one of the partial nucleotides 1) to 10) below: 1) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:11, SEQ ID NO:13, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:25, SEQ ID NO:29, SEQ ID NO:33, SEQ ID NO:39 or SEQ ID NO:40 or a partial nucleic acid sequence thereof; 2) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:48, SEQ ID NO:51 or SEQ ID NO:55 or a partial nucleic acid sequence thereof; 3) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:59, SEQ ID NO:61, SEQ ID NO:64, SEQ ID NO:67 or SEQ ID NO:71 or a partial nucleic acid sequence thereof; 4) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:79, SEQ ID NO:82 or SEQ ID NO:86 or a partial nucleic acid sequence thereof; 5) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:95, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:104, SEQ ID NO:107, SEQ ID NO:110, SEQ ID NO:114 or SEQ ID NO:115 or a partial nucleic acid sequence thereof; 6) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:119, SEQ ID NO:123, SEQ ID NO:126 or SEQ ID NO:130 or a partial nucleic acid sequence thereof; 7) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:134, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:142, SEQ ID NO:145 or SEQ ID NO:149 or a partial nucleic acid sequence thereof; 8) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:160, SEQ ID NO:162, SEQ ID NO:166, SEQ ID NO:170, SEQ ID NO:174, SEQ ID NO:180 or SEQ ID NO:181 or a partial nucleic acid sequence thereof; 9) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:185, SEQ ID NO:187, SEQ ID NO:191, SEQ ID NO:193, SEQ ID NO:199, SEQ ID NO:203 or SEQ ID NO:204 or a partial nucleic acid sequence thereof; and 10) a partial nucleotide consisting of a nucleic acid sequence shown by SEQ ID NO:208, SEQ ID NO:210, SEQ ID NO:214, SEQ ID NO:216, SEQ ID NO:220, SEQ ID NO:222, SEQ ID NO:226, SEQ ID NO:228, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:237, SEQ ID NO:239, SEQ ID NO:242, SEQ ID NO:245, SEQ ID NO:248, SEQ ID NO:251, SEQ ID NO:254, SEQ ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ ID NO:261, SEQ ID NO:262 or SEQ ID NO:263 or a partial nucleic acid sequence thereof. [9] An expression vector for the polypeptide according to any one of [1] to [3] above or the specific partial peptide according to any one of [1] to [4] above, comprising the polynucleotide according to any one of [5] to [7] above or the specific partial nucleotide according to any one of [6] to [8] above, and a promoter operably linked thereto. [10] A transformant incorporating the expression vector according to [9] above. [11] An antisense molecule comprising a nucleic acid sequence complementary to the nucleic acid sequence of the specific partial nucleotide according to [7] or [8] above, and capable of suppressing the expression of any one of the polypeptides encoded by the brain/nerve-specific genes 1 to 10. [12] An RNAi-inducing nucleic acid capable of suppressing the expression of any one of the polypeptides encoded by the brain/nerve-specific genes 1 to 10, that is configured by a sense strand consisting of the nucleic acid sequence of the specific partial nucleotide according to [7] or [8] above, and an antisense strand consisting of a nucleic acid sequence complementary thereto, and that may have an overhang at the 5' terminus and/or 3' terminus of one or both of the sense strand and the antisense strand. [13] The RNAi-inducing nucleic acid according to [12] above, wherein the RNAi-inducing nucleic acid is an siRNA. [14] An aptamer capable of binding to any one of the polypeptides encoded by the brain/nerve-specific genes 1 to 10 via a region corresponding to the specific partial peptide according to any one of [2] to [4] above. [15] An antibody capable of binding to any one of the polypeptides encoded by the brain/nerve-specific genes 1 to 10 via a region corresponding to the specific partial peptide according to any one of [2] to [4] above. [16] The antibody according to [15] above, wherein the antibody is any one of the i) to iii) below: i) a polyclonal antibody; ii) a monoclonal antibody or a portion thereof; iii) a chimeric antibody, a humanized antibody or a human antibody. [17] A cell that produces the antibody according to [15] or [16] above. [18] The cell according to [17] above, wherein the cell is a hybridoma. [19] A composition comprising the polypeptide according to any one of [1] to [3] above, the antisense molecule according to [11] above, the RNAi-inducing nucleic acid according to [12] or [13] above, the aptamer according to [14] above, the antibody according to [15] or [16] above, or an expression vector therefor, and a pharmaceutically acceptable carrier. [20] A mammalian cell or non-human mammal wherein the expression or a function of the polypeptide according to any one of [1] to [3] above is regulated. [21] A primer set specific for any one of the polynucleotides encoded by the brain/nerve-specific genes 1 to 10 or a specific partial nucleotide thereof, comprising the following (a) or (b): (a) a sense primer corresponding to a first nucleic acid sequence of the polynucleotide according to [7] above or the specific partial nucleotide according to [7] or [8] above; and (b) an antisense primer corresponding to a nucleic acid sequence complementary to a second nucleic acid sequence of the polynucleotide according to [7] above or the specific partial nucleotide according to [7] or [8] above. [22] A nucleic acid probe specific for any one of the polynucleotides encoded by the brain/nerve-specific genes 1 to 10 or a specific partial nucleotide thereof, being any one of the following (a) or (b): (a) a single-stranded polynucleotide comprising a nucleic acid sequence complementary to the nucleic acid sequence of the specific partial nucleotide according to [7] or [8] above; or (b) a double-stranded polynucleotide configured by a sense strand comprising the nucleic acid sequence of the specific partial nucleotide according to [7] or [8] above, and an antisense strand comprising a nucleic acid sequence complementary thereto. [23] A reagent or kit for detection or quantification of any one of the polypeptides or polynucleotides encoded by the brain/nerve-specific genes 1 to 10, comprising one or more substances or sets selected from among the aptamer according to [14] above, the antibody according to [15] or [16] above, the primer set according to [21] above and the nucleic acid probe according to [22] above. [24] The reagent or kit according to [23] above, being a reagent or kit for determination of nerve cell differentiation. [25] A method of detecting or quantifying any one of the polypeptides or polynucleotides encoded by the brain/nerve-specific genes 1 to 10, comprising measuring the expression of the polypeptide or polynucleotide in a biological sample or cell or tissue culture obtained from a mammal, wherein the biological sample or the culture contains a nerve cell or a tissue in the brain. [26] A method of detecting or quantifying the polypeptide according to [2] or [3] above or the polynucleotide according to [7] above, comprising measuring the expression of the polypeptide or the polynucleotide in a biological sample or cell or tissue culture obtained from a mammal. [27] The method of detection or quantification according to [26] above, wherein the biological sample or the culture contains a nerve cell or a tissue in the brain.

Effect of the Invention

[0028] A polypeptide of the present invention and a partial peptide of the present invention can be useful, for example, as a biomarker specific for the brain/nerves or specific for nerve cell differentiation, and in developing a substance capable of specifically recognizing a polypeptide of the present invention or a known polypeptide, or a substance capable of comprehensively recognizing both a polypeptide of the present invention and a known polypeptide, and a substance capable of specifically regulating a function of a polypeptide of the present invention or a known polypeptide, or a substance capable of comprehensively regulating functions of both a polypeptide of the present invention and a known polypeptide.

[0029] A polynucleotide of the present invention and a partial nucleotide of the present invention can be useful, for example, as a biomarker specific for the brain/nerves or specific for nerve cell differentiation, and in developing a substance capable of specifically recognizing a polynucleotide of the present invention or a known polynucleotide, or a substance capable of comprehensively recognizing both a polynucleotide of the present invention and a known polynucleotide, and a substance capable of specifically regulating the expression of a polypeptide of the present invention or a known polypeptide, or a substance capable of comprehensively regulating the expression of both a polypeptide of the present invention and a known polypeptide.

[0030] Related substances of the present invention (e.g., antisense molecules, RNAi-inducing nucleic acids such as siRNAs, aptamers and antibodies, and expression vectors therefor) can be useful as, for example, pharmaceuticals or reagents.

[0031] A cell of the present invention can be useful in, for example, producing a polypeptide of the present invention and a partial peptide of the present invention, and an antibody of the present invention. A cell of the present invention can also be useful in developing a pharmaceutical (e.g., a prophylactic or therapeutic drug for a disease based on a nerve cell disorder), identifying a further marker gene specific for the brain/nerves or specific for nerve cell differentiation, and analyzing a mechanism associated with nerve cell differentiation.

[0032] An animal of the present invention can be useful in, for example, developing a pharmaceutical, identifying a further marker gene specific for the brain/nerves or specific for nerve cell differentiation, and analyzing a mechanism associated with nerve cell differentiation.

[0033] Measuring means (e.g., primer set, nucleic acid probe, antibody, aptamer) and measuring methods of the present invention can be useful in, for example, specific detection and quantitation of a polynucleotide of the present invention or a known polynucleotide, or a polypeptide of the present invention or a known polypeptide, or comprehensive detection and quantitation of both a polynucleotide of the present invention and a known polynucleotide, or both a polypeptide of the present invention and a known polypeptide. These means and methods can also be utilized for determining nerve cell differentiation states and screening for pharmaceuticals, reagents or foods.

BEST MODE FOR CARRYING OUT THE INVENTION

1. Brain/Nerve-Specific Genes

[0034] A gene of the present invention can be a gene derived from an optionally chosen mammal. As examples of the mammal, primates and rodents, as well as laboratory animals, domestic animals, working animals, companion animals and the like can be mentioned. In detail, as examples of the mammal, humans, monkeys, rats, mice, rabbits, horses, cattle, goat, sheep, dogs, cats and the like can be mentioned. Preferably, the mammal is a human.

[0035] A gene of the present invention is capable or incapable of being expressed specifically in a tissue in the brain. A gene of the present invention is also capable of being expressed at a higher or lower level in a tissue in the brain, compared with a known polynucleotide and/or a known polypeptide. As examples of such tissues in the brain, the cerebrum, cerebral cortex, cerebellum, caudate nucleus, corpus callosum, hippocampus, substantia nigra, thalamus, hypothalamus, subthalamic nucleus, hypophysis, amygdala and the like can be mentioned.

[0036] A gene of the present invention is capable or incapable of being expressed specifically in nerve cells. A gene of the present invention is also capable of being expressed at a higher or lower level in nerve cells, compared with a known polynucleotide and/or a known polypeptide. As examples of such nerve cells, nerve cells in the aforementioned tissues can be mentioned.

[0037] Hereinafter, the polypeptides and partial peptides thereof, and polynucleotides and partial nucleotides thereof, provided by the present invention, are described.

1.1. Polypeptides and Partial Peptides Thereof

[0038] The present invention provides a polypeptide having an amino acid sequence shown by SEQ ID NO:X or substantially the same amino acid sequence thereas (abbreviated as "amino acid sequence shown by SEQ ID NO:X and the like" as required).

[0039] "SEQ ID NO:X" denotes the SEQ ID NO of an optionally chosen amino acid sequence disclosed herein. A polypeptide "having" an amino acid sequence shown by SEQ ID NO:X and the like means a polypeptide "consisting of" an amino acid sequence shown by SEQ ID NO:X and the like, and a polypeptide "comprising" the amino acid sequence and the like.

[0040] In one embodiment, substantially the same amino acid sequence as an amino acid sequence shown by SEQ ID NO:X can be an amino acid sequence having a specified amino acid sequence identity to the amino acid sequence shown by SEQ ID NO:X. The degree of amino acid sequence identity can be about 90% or more, preferably about 92% or more, more preferably about 95% or more, still more preferably about 96% or more, and most preferably about 97% or more, about 98% or more or about 99% or more. Amino acid sequence identity can be determined by a method known per se. Unless otherwise specified, amino acid sequence identity (%) is calculated by, for example, executing the commands for the maximum matching method, using the DNASIS sequence analytical software (Hitachi Software Engineering). The parameters for the calculation should be used in default settings. Amino acid sequence identity (%) can also be determined, without following the above procedures, using a program in common use in the art (for example, BLAST, FASTA and the like) in the default settings thereof. In another aspect, the identity (%) can be determined using an optionally chosen algorithm publicly known in the art, for example, the algorithms of Needleman et al. (1970) (J. Mol. Biol. 48: 444-453) and Myers and Miller (CABIOS, 1988, 4: 11-17) and the like. The algorithm of Needleman et al. is incorporated in the GAP program in the GCG software package, and the identity (%) can be determined by, for example, using BLOSUM 62 matrix or PAM250 matrix, with a gap weight of 16, 14, 12, 10, 8, 6 or 4, and a length weight of 1, 2, 3, 4, 5 or 6. The algorithm of Myers and Miller is incorporated in the ALIGN program, which is a portion of the GCG sequence alignment software package. When the ALIGN program is utilized to compare amino acid sequences, for example, PAM120 weight residue table, gap length penalty 12, gap penalty 4, can be used. For calculating amino acid sequence identity, the method that produces the least value among the above-mentioned methods may be employed.

[0041] In another embodiment, substantially the same amino acid sequence as an amino acid sequence shown by SEQ ID NO:X can be an amino acid sequence shown by SEQ ID NO:X wherein one or more amino acids have one or more modifications selected from among substitutions, additions, deletions and insertions. The number of amino acids modified is not particularly limited, as far as it is one or more; the number can be, for example, 1 to about 50, preferably 1 to about 30, more preferably 1 to about 20, still more preferably 1 to about 10, and most preferably 1 to about 5 (e.g., 1 or 2).

[0042] Substantially the same amino acid sequence as an amino acid sequence shown by SEQ ID NO:X may completely retain a characteristic portion thereof (e.g., a portion corresponding to a specific partial polypeptide described below), and may have another portion (e.g., a portion present in a known polypeptide) being substantially the same as the corresponding portion of the amino acid sequence shown by SEQ ID NO:X. Alternatively, substantially the same amino acid sequence as an amino acid sequence shown by SEQ ID NO:X may have a non-characteristic portion thereof being identical to the corresponding portion of the amino acid sequence shown by SEQ ID NO:X, and a characteristic portion thereof being substantially identical to the corresponding portion of the amino acid sequence shown by SEQ ID NO:X.

[0043] A polypeptide of the present invention can have a function that is homogenous or heterogeneous to that of a known polypeptide (e.g., known variant). A polypeptide of the present invention can also have an enhanced or reduced function compared with a known polypeptide (e.g., known variant).

[0044] In detail, the novel polypeptides of the brain/nerve-specific genes 1 to 10 are as follows.

1) Brain/Nerve-Specific Gene 1

[0045] D-BRACE3000012.1 (SEQ ID NO:18)

[0046] D-UTERU2026184.1 (SEQ ID NO:10)

[0047] As a known variant of the brain/nerve-specific gene 1, for example, a variant disclosed in an Example (human zinc finger protein 418 (ZNF418); total number of nucleotides in the ORF nucleic acid sequence: 2031; total number of amino acids in the protein: 676; see GenBank accession number: NM.sub.--133460.1) has been reported. A known variant of the brain/nerve-specific gene 1 can have a specified function (e.g., transcription regulatory capacity) (see, e.g., Ota, T. et al., Nat. Genet. 36 (1), 40-45 (2004)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 1 can also have these functions.

2) Brain/Nerve-Specific Gene 0.2

[0048] D-NT2RP8004156.1 (SEQ ID NO:43)

[0049] As a known variant of the brain/nerve-specific gene 2, for example, a variant disclosed in an Example (human v-akt mouse thymoma virus oncogene homologue 1 (AKT1); total number of nucleotides in the ORF nucleic acid sequence: 1443; total number of amino acids in the protein: 480; see GenBank accession number: NM.sub.--005163.1) has been reported. It has been reported that known variants of the brain/nerve-specific gene 2 have a specified function (e.g., kinase activity, anti-apoptotic activity, or cell cycle regulatory capacity) (see, e.g., Mirza, A. M. et al., Mol. Cell. Biol. 24 (24), 10868-10881 (2004); Koga, M. et al., Biochem. Biophys. Res. Commun. 324 (1), 321-325 (2004)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 2 can also have these functions.

3) Brain/Nerve-Specific Gene 3

[0050] D-NT2RI3005525.1 (SEQ ID NO:58)

[0051] As a known variant of the brain/nerve-specific gene 3, for example, a variant disclosed in an Example (human budding-related, EVH1 domain-containing 2 (SPRED2); total number of nucleotides in the ORF nucleic acid sequence: 1257; total number of amino acids in the protein: 418; see GenBank accession number: NM.sub.--181784.1) has been reported. It has been reported that known variants of the brain/nerve-specific gene 3 have a specified function (e.g., MAP kinase activation inhibitory capacity, tyrosine kinase-mediated Erk activation inhibitory capacity) (see, e.g., Nobuhisa, I. et al., J. Exp. Med. 199 (5), 737-742 (2004); Kato, R. et al., Biochem. Biophys. Res. Commun. 302 (4), 767-772 (2003)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 3 can also have these functions.

4) Brain/Nerve-Specific Gene 4

[0052] D-NT2RP8004592.1 (SEQ ID NO:74)

[0053] As a known variant of the brain/nerve-specific gene 4, for example, a variant disclosed in an Example (human src kinase related phosphoprotein 2 (SKAP2); total number of nucleotides in the ORF nucleic acid sequence: 1080; total number of amino acids in the protein: 359; see GenBank accession number: NM.sub.--003930.3) has been reported. It has been reported that known variants of the brain/nerve-specific gene 4 have a specified function (e.g., .alpha.-synuclein phosphorylation inhibitory capacity) (see, e.g., Takahashi, T. et al., J. Biol. Chem. 278 (43), 42225-42233 (2003)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 4 can also have these functions.

5) Brain/Nerve-Specific Gene 5

[0054] D-NT2RI2014164.1 (SEQ ID NO:89)

[0055] D-BRAMY2029564.1 (SEQ ID NO:99)

[0056] As a known variant of the brain/nerve-specific gene 5, for example, a variant disclosed in an Example (human monoamine oxidase B (MAOB); total number of nucleotides in the ORF nucleic acid sequence: 1563; total number of amino acids in the protein: 520; see GenBank accession number: NM.sub.--000898.3) has been reported. It has been reported that known variants of the brain/nerve-specific gene 5 have a specified function (e.g., monoamine oxidase activity) (see, e.g., Bach, A. W. et al., Proc. Natl. Acad. Sci. U.S.A. 85 (13), 4934-4938 (1988)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 5 can also have these functions.

6) Brain/Nerve-Specific Gene 6

[0057] D-BRHIP2003515.1 (SEQ ID NO:118)

[0058] As a known variant of the brain/nerve-specific gene 6, for example, a variant disclosed in an Example (human tumor protein D52 (TPD52); total number of nucleotides in the ORF nucleic acid sequence: 555; total number of amino acids in the protein: 184; see GenBank accession number: NM.sub.--005079.1) has been reported. It has been reported that known variants of the brain/nerve-specific gene 6 have a specified function (e.g., capability of Ca.sup.2+ dependent interaction with annexin VI) (see, e.g., Tiacci, E. et al., Blood 105 (7), 2812-2820 (2005)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 6 can also have these functions.

7) Brain/Nerve-Specific Gene 7

[0059] D-BRACE2044661.1 (SEQ ID NO:133)

[0060] As a known variant of the brain/nerve-specific gene 7, for example, a variant disclosed in an Example (human ATPase, Na.sup.+/K.sup.+ transport property, .beta.3 polypeptide (ATP1B3); total number of nucleotides in the ORF nucleic acid sequence: 840; total number of amino acids in the protein: 279; see GenBank accession number: NM.sub.--001679.2) has been reported. It has been reported that known variants of the brain/nerve-specific gene 7 have a specified function (e.g., ATP hydrolysis activity in the presence of an ion such as Na.sup.+ or K.sup.+) (see, e.g., Malik, N. et al., J. Biol. Chem. 271 (37), 22754-22758 (1996)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 7 can also have these functions.

8) Brain/Nerve-Specific Gene 8

[0061] D-3NB692002462.1 (SEQ ID NO:152)

[0062] D-BRCAN2027778.1 (SEQ ID NO:159)

[0063] As a known variant of the brain/nerve-specific gene 8, for example, a variant disclosed in an Example (human mevalonic acid kinase (MVK); total number of nucleotides in the ORF nucleic acid sequence: 1191; total number of amino acids in the protein: 396; see GenBank accession number: NM.sub.--000431.1) has been reported. It has been reported that known variants brain/nerve-specific gene 8 have a specified function (e.g., mevalonic acid kinase activity) (see, e.g., Hogenboom, S. et al., J. Cell. Sci. 117 (PT 4), 631-639 (2004)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 8 can also have these functions.

9) Brain/Nerve-Specific Gene 9

[0064] D-NT2RI3001005.1 (SEQ ID NO:184)

[0065] D-NT2RI3005261.1 (SEQ ID NO:190)

[0066] As a known variant of the brain/nerve-specific gene 9, for example, a variant disclosed in an Example (human solute carrier family 2 (promoting glucose transporter), member 14 (SLC2A14); total number of nucleotides in the ORF nucleic acid sequence: 1563; total number of amino acids in the protein: 520; see GenBank accession number: NM.sub.--153449.2) has been reported. Known variants of the brain/nerve-specific gene 9 can have a specified function (e.g., glucose transportation capacity) (see, e.g., Wu, X. et al., Genomics 80 (6), 553-557 (2002)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 9 can also have these functions.

10) Brain/Nerve-Specific Gene 10

[0067] D-OCBBF2010718.1 (SEQ ID NO:207)

[0068] D-OCBBF3004194.1 (SEQ ID NO:213)

[0069] D-NT2RP8000826.1 (SEQ ID NO:219)

[0070] D-NT2RP7007268.1 (SEQ ID NO:225)

[0071] D-BRAWH3008172.1 (SEQ ID NO:331)

[0072] D-BRAWH3011965.1 (SEQ ID NO:236)

[0073] As a known variant of the brain/nerve-specific gene 10, for example, a variant disclosed in an Example (human PDZ domain-containing RING finger 3 (PDZRN3); total number of nucleotides in the ORF nucleic acid sequence: 3201; total number of amino acids in the protein: 1066; see GenBank accession number: NM.sub.--015009.1) has been reported. Known variants of the brain/nerve-specific gene 10 can have a specified function (e.g., capability of binding to a cell surface protein such as neuroligin via the PDZ domain thereof) (see, e.g., Meyer, G. et al., Neuropharmacology 47 (5), 724-733 (2004)). Generally, it is known that a plurality of variants resulting from a single locus (splicing variants) have similar functions, although the degree can vary. Therefore, novel variants of the brain/nerve-specific gene 10 can also have these functions.

[0074] A polypeptide of the present invention can be useful in, for example, developing a substance capable of specifically recognizing a polypeptide of the present invention, a substance incapable of specifically recognizing a polypeptide of the present invention, or a substance capable of comprehensively recognizing both a polypeptide of the present invention and a known polypeptide, and in developing a substance capable of specifically regulating a function of a polypeptide of the present invention, a substance incapable of specifically regulating a function of a polypeptide of the present invention, or a substance capable of comprehensively recognizing functions of both a polypeptide of the present invention and a known polypeptide.

[0075] The present invention also provides a partial peptide.

[0076] "A partial peptide" consists of at least 6, preferably at least 8, more preferably at least 10, still more preferably at least 12, and most preferably at least 15, consecutive amino acid residues selected from among subject polypeptides, that can have a specified utility (e.g., use as an immunogenic or antigenic peptide, a functional peptide having a particular domain and the like).

[0077] "An insert amino acid sequence of a polypeptide of the present invention" refers to an amino acid sequence that is incorporated in a polypeptide of the present invention (e.g., novel variant), but lacked in a known polypeptide (e.g., known variant). Meanwhile, "an insert amino acid sequence of a known polypeptide" refers to an amino acid sequence that is incorporated in a known polypeptide (e.g., known variant), but lacked in a polypeptide of the present invention (e.g., novel variant). These insert amino acid sequences are obvious from the disclosure herein.

[0078] "A deleted amino acid sequence of a polypeptide of the present invention" refers to an amino acid sequence that is lacked in a polypeptide of the present invention (e.g., novel variant), but incorporated in a known polypeptide (e.g., known variant). Meanwhile, "a deleted amino acid sequence of a known polypeptide" refers to an amino acid sequence that is lacked in a known polypeptide (e.g., known variant), but incorporated in a polypeptide of the present invention (e.g., novel variant). These deleted amino acid sequences are obvious from the disclosure herein. "A deleted amino acid sequence of a polypeptide of the present invention" can have the same definition as that for "an insert amino acid sequence of a known polypeptide"; "a deleted amino acid sequence of a known polypeptide" can have the same definition as that for "an insert amino acid sequence of a polypeptide of the present invention".

[0079] A partial peptide of the present invention can be a) a specific partial peptide of a polypeptide of the present invention, capable of distinguishing a polypeptide of the present invention from a known polypeptide (abbreviated as "specific partial peptide A" as required), b) a specific partial peptide of a known polypeptide, capable of distinguishing a known polypeptide from a polypeptide of the present invention (abbreviated as "specific partial peptide B" as required), or c) a partial peptide common to both a polypeptide of the present invention and a known polypeptide (abbreviated as "shared partial peptide" as required). For these particular partial peptides, there appears a motivation for preparing them or utilizing them as markers on the basis of the present inventors' findings; however, without these findings, there is no motivation for preparing them or utilizing them as markers. Being partial peptides specific for the polypeptides encoded by the brain/nerve-specific genes 1 to 10, the specific partial peptides A and B are abbreviated as "specific partial peptides of the present invention" or "specific partial peptides" as required.

[0080] The specific partial peptide A of the present invention is a partial peptide that is present only in a polypeptide having an amino acid sequence shown by SEQ ID NO:X and the like, and that is not present in any known polypeptide. As examples of the specific partial peptide A, i) a partial peptide consisting of an insert amino acid sequence of a polypeptide of the present invention or a partial amino acid sequence thereof, ii) a partial peptide consisting of an insert amino acid sequence of a polypeptide of the present invention or a terminal partial amino acid sequence thereof and an adjacent amino acid sequence thereof, and iii) a partial peptide consisting of an amino acid sequence wherein both amino acid sequences present on the N-terminal side and C-terminal side relative to an insert amino acid sequence of a known polypeptide are linked together, formed as a result of exon deletion, can be mentioned.

[0081] The specific partial peptide A of i) above consists of an insert amino acid sequence of a polypeptide of the present invention or a partial amino acid sequence thereof. Such partial amino acid sequences are obvious from the disclosure herein.

[0082] The specific partial peptide A of ii) above consists of an insert amino acid sequence of a polypeptide of the present invention or a terminal partial amino acid sequence thereof and an adjacent amino acid sequence thereof. As such terminal partial amino acid sequences, an amino acid sequence corresponding to an N-terminal portion of an insert amino acid sequence of a polypeptide of the present invention (abbreviated as "N-terminal partial amino acid sequence A" as required), and an amino acid sequence corresponding to a C-terminal portion of an insert amino acid sequence of a polypeptide of the present invention (abbreviated as "C-terminal partial amino acid sequence A" as required) can be mentioned. As such adjacent amino acid sequences, an amino acid sequence present on the N-terminal side relative to an insert amino acid sequence of a polypeptide of the present invention (abbreviated as "N-terminal adjacent amino acid sequence A" as required), and an amino acid sequence present on the C-terminal side relative to an insert amino acid sequence of a polypeptide of the present invention (abbreviated as "C-terminal adjacent amino acid sequence A" as required) can be mentioned. Therefore, the specific partial peptide A of ii) above can be a partial peptide consisting of an amino acid sequence spanning from a specified position of the N-terminal adjacent amino acid sequence A to a specified position of an insert amino acid sequence of a polypeptide of the present invention, a partial peptide consisting of an amino acid sequence spanning from a specified position of an insert amino acid sequence of a polypeptide of the present invention to a specified position of the C-terminal adjacent amino acid sequence A, or a partial peptide consisting of an amino acid sequence comprising the whole insert amino acid sequence of a polypeptide of the present invention, spanning from a specified position of the N-terminal adjacent amino acid sequence A to a specified position of the C-terminal adjacent amino acid sequence A. The number of amino acid residues in the insert amino acid sequence (or N-terminal or C-terminal partial amino acid sequence A) or adjacent amino acid sequence (or N-terminal or C-terminal adjacent amino acid sequence A), contained in the specific partial peptide A of ii) above, is not particularly limited, as far as it is a number that ensures the specificity of the specific partial peptide A of ii) above; the number can be, for example, at least 3, preferably at least 4, more preferably at least 5, still more preferably at least 6, and most preferably at least 7, 8, 9 or 10. Such terminal partial amino acid sequences and such adjacent amino acid sequences are obvious from the disclosure herein.

[0083] The specific partial peptide A of iii) above is a partial peptide not present in a known polypeptide, consisting of an amino acid sequence wherein both amino acid sequences present on the N-terminal side and C-terminal side relative to an insert amino acid sequence of a known polypeptide are linked together (in a polypeptide of the present invention, these amino acid sequences are linked together as a result of exon deletion). The number of amino acid residues in each amino acid sequence present on the N-terminal side and C-terminal side relative to an insert amino acid sequence of a known polypeptide, contained in the specific partial peptide A of iii) above, is not particularly limited, as far as it is a number that ensures the specificity of the specific partial peptide A of iii) above; the number can be, for example, at least 3, preferably at least 4, more preferably at least 5, still more preferably at least 6, and most preferably at least 7, 8, 9 or 10.

[0084] The specific partial peptide A of the present invention can be useful as, for example, a target for specifically detecting a polypeptide of the present invention, and as a marker specific for the brain/nerves or specific for nerve differentiation. The specific partial peptide A of the present invention can also be useful in developing a substance capable of specifically recognizing a polypeptide of the present invention, or a substance incapable of specifically recognizing a polypeptide of the present invention, or developing a substance capable of specifically regulating a function of a polypeptide of the present invention, or a substance incapable of specifically regulating a function of a polypeptide of the present invention.

[0085] The specific partial peptide B of the present invention is a partial peptide that is present only in a known polypeptide, and that is not present in a polypeptide having an amino acid sequence shown by SEQ ID NO:X and the like. As examples of the specific partial peptide B, i) a partial peptide consisting of an insert amino acid sequence of a known polypeptide or a partial amino acid sequence thereof, ii) a partial peptide consisting of an insert amino acid sequence of a known polypeptide or a terminal partial amino acid sequence thereof and an adjacent amino acid sequence thereof, and iii) a partial peptide consisting of an amino acid sequence wherein both amino acid sequences present on the N-terminal side and C-terminal side relative to an insert amino acid sequence of a polypeptide of the present invention are linked together, formed as a result of exon deletion, can be mentioned.

[0086] The specific partial peptide B of i) above consists of an insert amino acid sequence of a known polypeptide or a partial amino acid sequence thereof. Such partial amino acid sequences are obvious from the disclosure herein.

[0087] The specific partial peptide B of ii) above consists of an insert amino acid sequence of a known polypeptide or a terminal partial amino acid sequence thereof and an adjacent amino acid sequence thereof. As such terminal partial amino acid sequences, an amino acid sequence corresponding to an N-terminal portion of an insert amino acid sequence of a known polypeptide (abbreviated as "N-terminal partial amino acid sequence B" as required), and an amino acid sequence corresponding to a C-terminal portion of an insert amino acid sequence of a known polypeptide (abbreviated as "C-terminal partial amino acid sequence B" as required) can be mentioned. As such adjacent amino acid sequences, an amino acid sequence present on the N-terminal side relative to an insert amino acid sequence of a known polypeptide (abbreviated as "N-terminal adjacent amino acid sequence B" as required), and an amino acid sequence present on the C-terminal side relative to an insert amino acid sequence of a known polypeptide (abbreviated as "C-terminal adjacent amino acid sequence B" as required) can be mentioned. Therefore, the specific partial peptide B of ii) above can be a partial peptide consisting of an amino acid sequence spanning from a specified position of the N-terminal adjacent amino acid sequence B to a specified position of an insert amino acid sequence of a known polypeptide, a partial peptide consisting of an amino acid sequence spanning from a specified position of an insert amino acid sequence of a known polypeptide to a specified position of the C-terminal adjacent amino acid sequence B, or a partial peptide consisting of an amino acid sequence comprising the whole insert amino acid sequence of a known polypeptide, spanning from a specified position of the N-terminal adjacent amino acid sequence B to a specified position of the C-terminal adjacent amino acid sequence B. The number of amino acid residues in the insert amino acid sequence (or N-terminal or C-terminal partial amino acid sequence B) or adjacent amino acid sequence (or N-terminal or C-terminal adjacent amino acid sequence B), contained in the specific partial peptide B of ii) above, is not particularly limited, as far as it is a number that ensures the specificity of the specific partial peptide B of ii) above; the number can be, for example, at least 3, preferably at least 4, more preferably at least 5, still more preferably at least 6, and most preferably at least 7, 8, 9 or 10. Such terminal partial amino acid sequences and such adjacent amino acid sequences are obvious from the disclosure herein.

[0088] The specific partial peptide B of iii) above is a partial peptide that is not present in a polypeptide of the present invention, consisting of an amino acid sequence wherein both amino acid sequences present on the N-terminal side and C-terminal side relative to an insert amino acid sequence of a polypeptide of the present invention are linked together (in a known polypeptide, these amino acid sequences are linked together as a result of exon deletion). The number of amino acid residues in each amino acid sequence present on the N-terminal side and C-terminal side relative to the insert amino acid sequence of a polypeptide of the present invention, contained in the specific partial peptide B of iii) above, is not particularly limited, as far as it is a number that ensures the specificity of the specific partial peptide B of iii) above; the number can be, for example, at least 3, preferably at least 4, more preferably at least 5, still more preferably at least 6, and most preferably at least 7, 8, 9 or 10, respectively.

[0089] The specific partial peptide B of the present invention can be useful as, for example, a target for specifically detecting a known polypeptide, and as a marker specific for the brain/nerves or specific for nerve differentiation, or as a marker not specific therefor. The specific partial peptide B of the present invention can also be useful in developing a substance capable of specifically recognizing a known polypeptide, or a substance incapable of specifically recognizing a known polypeptide, or developing a substance capable of specifically regulating a function of a known polypeptide, or a substance incapable of specifically regulating a function of a known polypeptide.

[0090] A shared partial peptide of the present invention can be a non-specific partial peptide that is present in both a polypeptide of the present invention and a known polypeptide. Such partial peptides are obvious from the disclosure herein. A shared partial peptide of the present invention can be useful as, for example, a target for comprehensively detecting both a polypeptide of the present invention and a known polypeptide, and as a marker specific for the brain/nerves or specific for nerve differentiation, or as a marker not specific therefor. A shared partial peptide of the present invention can also be useful in developing a substance capable of comprehensively recognizing both a polypeptide of the present invention and a known polypeptide, or a substance capable of comprehensively regulating functions of both a polypeptide of the present invention and a known polypeptide.

[0091] A polypeptide of the present invention or a specific partial peptide thereof may be fused with a polypeptide consisting of a heterologous amino acid sequence. As such a polypeptide, a polypeptide that facilitates purification or solubilization can be mentioned. In detail, as such polypeptides, histidine tag, maltose-binding protein (MBP), glutathione-S-transferase (GST), calmodulin-binding peptide (CBP), FLAG, and the Fc region of IgG molecule can be mentioned.

[0092] A polypeptide of the present invention and a partial peptide thereof may be provided in the form of a salt. As examples of the salt, salts with inorganic bases (e.g., alkali metals such as sodium and potassium; alkaline earth metals such as calcium and magnesium; aluminum, ammonium), salts with organic bases (e.g., trimethylamine, triethylamine, pyridine, picoline, ethanolamine, diethanolamine, triethanolamine, dicyclohexylamine, N,N-dibenzylethylenediamine), salts with inorganic acids (e.g., hydrochloric acid, hydrobromic acid, nitric acid, sulfuric acid, phosphoric acid), salts with organic acids (e.g., formic acid, acetic acid, trifluoroacetic acid, fumaric acid, oxalic acid, tartaric acid, maleic acid, citric acid, succinic acid, malic acid, methanesulfonic acid, benzenesulfonic acid, p-toluenesulfonic acid), salts with basic amino acids (e.g., arginine, lysine, ornithine) or salts with acidic amino acids (e.g., aspartic acid, glutamic acid) and the like can be mentioned.

[0093] A polypeptide of the present invention and a partial peptide thereof can be prepared by a method known per se. For example, a polypeptide of the present invention and a partial peptide thereof 1) may be recovered from an expression site, 2) may be recovered from a transformant described below, which expresses a polypeptide of the present invention and a partial peptide thereof, or a culture supernatant thereof, 3) may be synthesized using a cell-free system based on a rabbit reticulocyte lysate, wheat germ lysate, Escherichia coli lysate and the like, or 4) may be synthesized organochemically (e.g., solid phase synthesis). A polypeptide of the present invention and a partial peptide thereof are purified as appropriate by methods based on differences in solubility, such as salting-out and solvent precipitation; methods based mainly on differences in molecular weight, such as dialysis, ultrafiltration, gel filtration, and SDS-polyacrylamide gel electrophoresis; methods based on differences in electric charge, such as ion exchange chromatography; methods based on specific affinity, such as affinity chromatography and use of antibody; methods based on differences in hydrophobicity, such as reverse phase high performance liquid chromatography; methods based on differences in isoelectric point, such as isoelectric focusing; and combinations thereof the like.

1.2. Polynucleotides and Partial Nucleotides Thereof

[0094] The present invention provides a polynucleotide having a nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2, or substantially the same nucleic acid sequence thereas (abbreviated as "nucleic acid sequence shown by SEQ ID NO:Y and the like" as required).

[0095] "SEQ ID NO:Y" denotes the SEQ ID NO of an optionally chosen nucleic acid sequence disclosed herein. A polynucleotide "having" SEQ ID NO:Y and the like means a polynucleotide "consisting of" SEQ ID NO:Y and the like, or a polynucleotide "comprising" the nucleic acid sequence and the like.

[0096] "The nucleic acid sequence Y1" denotes a nucleic acid sequence corresponding to the coding portion (that is, the entire open reading frame (ORF) or a portion thereof) in a nucleic acid sequence shown by SEQ ID NO:Y. In other words, "the nucleic acid sequence Y1" denotes a nucleic acid sequence shown by SEQ ID NO:Y when the nucleic acid sequence shown by SEQ ID NO:Y consists of a nucleic acid sequence corresponding to the coding portion only, and it denotes a nucleic acid sequence corresponding to the coding portion only when the nucleic acid sequence shown by SEQ ID NO:Y comprises nucleic acid sequences corresponding to both the coding portion and the non-coding portion.

[0097] "The nucleic acid sequence Y2" denotes a nucleic acid sequence corresponding to a non-coding portion (e.g., 5' or 3' noncoding region) in a nucleic acid sequence shown by SEQ ID NO:Y. In other words, "the nucleic acid sequence Y2" denotes a nucleic acid sequence shown by SEQ ID NO:Y when the nucleic acid sequence shown by SEQ ID NO:Y consists of a nucleic acid sequence corresponding to the non-coding portion only, and it denotes a nucleic acid sequence corresponding to the non-coding portion only when the nucleic acid sequence shown by SEQ ID NO:Y comprises nucleic acid sequences corresponding to both the non-coding portion and the coding portion.

[0098] Therefore, a nucleic acid sequence denoted by "SEQ ID NO:Y" can be denoted by any one of i) the nucleic acid sequence Y1 (when the nucleic acid sequence shown by SEQ ID NO:Y as a whole is a nucleic acid sequence corresponding to the coding portion), ii) the nucleic acid sequence Y2 (when the nucleic acid sequence shown by SEQ ID NO:Y as a whole is a nucleic acid sequence corresponding to the non-coding portion), or iii) a nucleic acid sequence comprising the nucleic acid sequence Y1 and the nucleic acid sequence Y2 (when the nucleic acid sequence shown by SEQ ID NO:Y comprises nucleic acid sequences corresponding to the coding portion and the non-coding portion).

[0099] In one embodiment, substantially the same nucleic acid sequence as a nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2 can be a nucleic acid sequence having a specified sequence identity to the nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2. The degree of nucleic acid sequence identity can be about 90% or more, preferably about 92% or more, more preferably about 95% or more, still more preferably about 96% or more, and most preferably about 97% or more, about 98% or more or about 99% or more. Nucleic acid sequence identity can be determined by a method known per se. For example, nucleic acid sequence identity (%) can be determined by the same method as that described above for amino acid sequence identity (%).

[0100] In another embodiment, substantially the same nucleic acid sequence as a nucleic acid sequence shown by SEQ ID NO:Y or the nucleic acid sequence Y1 or the nucleic acid sequence Y2 can be the nucleic acid sequence shown by SEQ ID NO:Y or the nucleic acid sequence Y1 or the nucleic acid sequence Y2, wherein one or more nucleotides have one or more modifications selected from among substitutions, additions, deletions and insertions. The number of nucleotides modified is not particularly limited, as far as it is one or more, and the number can be, for example, 1 to about 100, preferably 1 to about 70, more preferably 1 to about 50, still more preferably 1 to about 30, and most preferably 1 to about 20, 1 to about 10 or 1 to about 5 (e.g., 1 or 2).

[0101] In still another embodiment, substantially the same nucleic acid sequence as a nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2 can be a polynucleotide that can be hybridized to a nucleic acid sequence complementary to the nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2 under high stringent conditions. Hybridization conditions under high stringent conditions can be set with reference to reported conditions (see, e.g., Current Protocols in Molecular Biology, John Wiley & Sons, 6.3.1-6.3.6 (1999)). For example, as hybridization conditions under high stringent conditions, hybridization with 6.times.SSC (sodium chloride/sodium citrate)/45.degree. C., followed by washing with 0.2.times.SSC/0.1% SDS/50 to 65.degree. C. once or twice or more, can be mentioned.

[0102] Substantially the same nucleic acid sequence as a nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2 may completely retain a characteristic portion thereof (e.g., a portion corresponding to a specific partial nucleotide described below), and may have another portion (e.g., a portion present in a known polynucleotide) being substantially the same as the corresponding portion of the nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2. Alternatively, substantially the same nucleic acid sequence as a nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2 may have a non-characteristic portion thereof being the same as the corresponding portion of the nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2, and a characteristic portion thereof being substantially the same as the corresponding portion of the nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2.

[0103] A polynucleotide of the present invention is capable of encoding a polypeptide of the present invention. Therefore, a polynucleotide of the present invention can be a polynucleotide such that the polypeptide encoded thereby is capable of being functionally equivalent to a polypeptide of the present invention.

[0104] In detail, for the brain/nerve-specific genes 1 to 10, the nucleic acid sequence Y of the polynucleotide, and the SEQ ID NO:Y and Ya-th to Yb-th of the ORF-corresponding portion thereof (Ya-th to Yb-th nucleotide residues in the nucleic acid sequence Y) are as follows.

1) Brain/Nerve-Specific Gene 1

[0105] D-BRACE3000012.1 (SEQ ID NO:16 or SEQ ID NO:17, and 465th to 2558th)

[0106] D-UTERU2026184.1 (SEQ ID NO:8 or SEQ ID NO:9, and 191st to 2119th)

2) Brain/Nerve-Specific Gene 2

[0107] D-NT2RP8004156.1 (SEQ ID NO:41 or SEQ ID NO:42, and 131st to 1387th)

3) Brain/Nerve-Specific Gene 3

[0108] D-NT2RI3005525.1 (SEQ. ID NO:56 or SEQ ID NO:57, and 45th to 1292nd)

4) Brain/Nerve-Specific Gene 4

[0109] D-NT2RP8004592.1 (SEQ ID NO:72 or SEQ ID NO:73, and 620th to 1183rd)

5) Brain/Nerve-Specific Gene 5

[0110] D-NT2RI2014164.1 (SEQ ID NO:87 or SEQ ID NO:88, and 162nd to 1397th)

[0111] D-BRAMY2029564.1 (SEQ ID NO:97 or SEQ ID NO:98, and 143rd to 1657th)

6) Brain/Nerve-Specific Gene 6

[0112] D-BRHIP2003515.1 (SEQ ID NO:116 or SEQ ID NO:117, and 84th to 707th)

7) Brain/Nerve-Specific Gene 7

[0113] D-BRACE2044661.1 (SEQ ID NO:131 or SEQ ID NO:132, and 297th to 878th)

8) Brain/Nerve-Specific Gene 8

[0114] D-3NB692002462.1 (SEQ ID NO:150 or SEQ ID NO:151, and 343rd to 951st)

[0115] D-BRCAN2027778.1 (SEQ ID NO:157 or SEQ ID NO:158, and 52nd to 1086th)

9) Brain/Nerve-Specific Gene 9

[0116] D-NT2RI3001005.1 (SEQ ID NO:182 or SEQ ID NO:183, and 22nd to 1629th)

[0117] D-NT2RI3005261.1 (SEQ ID NO:188 or SEQ ID NO:189, and 22nd to 1629th)

10) Brain/Nerve-Specific Gene 10

[0118] D-OCBBF2010718.1 (SEQ ID NO:205 or SEQ ID NO:206, and 144th to 2495th)

[0119] D-OCBBF3004194.1 (SEQ ID NO:211 or SEQ ID NO:212, and 129th to 2480th)

[0120] D-NT2RP8000826.1 (SEQ ID NO:217 or SEQ ID NO:218, and 95th to 2461st)

[0121] D-NT2RP7007268.1 (SEQ ID NO:223 or SEQ ID NO:224, and 95th to 2461st)

[0122] D-BRAWH3008172.1 (SEQ ID NO:229 or SEQ ID NO:330, and 281st to 2452nd)

[0123] D-BRAWH3011965.1 (SEQ ID NO:234 or SEQ ID NO:235, and 300th to 1574th)

[0124] A polynucleotide of the present invention can be useful in, for example, developing a substance capable of specifically recognizing a polynucleotide of the present invention, a substance incapable of specifically recognizing a polynucleotide of the present invention, or a substance capable of comprehensively recognizing both a polynucleotide of the present invention and a known polynucleotide, and developing a substance capable of specifically regulating the expression of a polypeptide of the present invention, a substance incapable of specifically regulating the expression of a polypeptide of the present invention, or a substance capable of comprehensively regulating the expression of both a polypeptide of the present invention and a known polypeptide.

[0125] The present invention also provides a partial nucleotide.

[0126] "A partial nucleotide" consists of at least 15, preferably at least 16, more preferably at least 18, still more preferably at least 20, and most preferably at least 22, 23, 24 or 25, consecutive nucleotide residues selected from among subject polynucleotides, that can have a specified utility (e.g., use as a probe, a primer, a polynucleotide that encodes an immunogenic or antigenic peptide, a polynucleotide that encodes a functional peptide having a particular domain and the like).

[0127] "An insert nucleic acid sequence of a polynucleotide of the present invention" refers to a nucleic acid sequence that is incorporated in a polynucleotide of the present invention (e.g., novel variant), but lacked in a known polynucleotide (e.g., known variant). Meanwhile, "an insert nucleic acid sequence of a known polynucleotide" refers to a nucleic acid sequence that is incorporated in a known polynucleotide (e.g., known variant), but lacked in a polynucleotide of the present invention (e.g., novel variant). These insert nucleic acid sequences are obvious from the disclosure herein.

[0128] "A deletion nucleic acid sequence of a polynucleotide of the present invention" refers to a nucleic acid sequence that is lacked in a polynucleotide of the present invention (e.g., novel variant), but inserted in a known polynucleotide (e.g., known variant). Meanwhile, "a deletion nucleic acid sequence of a known polynucleotide" refers to a nucleic acid sequence that is lacked in a known polynucleotide (e.g., known variant), but inserted in a polynucleotide of the present invention (e.g., novel variant). These deletion nucleic acid sequences are obvious from the disclosure herein. "A deletion nucleic acid sequence of a polynucleotide of the present invention" can have the same definition as that for "an insert nucleic acid sequence of a known polynucleotide"; "a deletion nucleic acid sequence of a known polynucleotide" can have the same definition as that for "an insert nucleic acid sequence of a polynucleotide of the present invention".

[0129] A partial nucleotide of the present invention can be a) a specific partial nucleotide of a polynucleotide of the present invention, capable of distinguishing a polynucleotide of the present invention from a known polynucleotide (abbreviated as "specific partial nucleotide A" as required), b) a specific partial nucleotide of a known polynucleotide, capable of distinguishing a known polynucleotide from a polynucleotide of the present invention (abbreviated as "specific partial nucleotide B" as required, or c) a partial nucleotide common to both a polynucleotide of the present invention and a known polynucleotide (abbreviated as "shared partial nucleotide" as required). For these particular partial nucleotides, there appears a motivation for preparing them or utilizing them as markers on the basis of the present inventors' findings, but without these findings, there is no motivation for preparing them or utilizing them as markers. Being partial nucleotides specific for polynucleotides encoded by brain/nerve-specific genes 1 to 10, the specific partial nucleotides A and B are abbreviated as "specific partial nucleotides of the present invention" or "specific partial nucleotides" as required.

[0130] The specific partial nucleotide A of the present invention is a partial nucleotide that is present only in a polynucleotide having a nucleic acid sequence shown by SEQ ID NO:Y and the like, and that is not present in any known polynucleotide. As examples of the specific partial nucleotide A, i) a partial nucleotide consisting of an insert nucleic acid sequence of a polynucleotide of the present invention or a partial nucleic acid sequence thereof, ii) a partial nucleotide consisting of an insert nucleic acid sequence of a polynucleotide of the present invention or a terminal partial nucleic acid sequence thereof and an adjacent nucleic acid sequence thereof, and iii) a partial nucleotide consisting of a nucleic acid sequence wherein both nucleic acid sequences present on the 5' and 3' sides relative to an insert nucleic acid sequence of a known polynucleotide are linked together, formed as a result of exon deletion, can be mentioned.

[0131] The specific partial nucleotide A of i) above consists of an insert nucleic acid sequence of a polynucleotide of the present invention or a partial nucleic acid sequence thereof. Such partial nucleic acid sequences are obvious from the disclosure herein.

[0132] The specific partial nucleotide A of ii) above consists of an insert nucleic acid sequence of a polynucleotide of the present invention or a terminal partial nucleic acid sequence thereof and an adjacent nucleic acid sequence thereof. As such terminal partial nucleic acid sequences, a nucleic acid sequence corresponding to a 5'-terminal portion in an insert nucleic acid sequence of a polynucleotide of the present invention (abbreviated as "5'-terminal partial nucleic acid sequence A" as required), and a nucleic acid sequence corresponding to a 3'-terminal portion in an insert nucleic acid sequence of a polypeptide of the present invention (abbreviated as "3'-terminal partial nucleic acid sequence A" as required) can be mentioned. As such adjacent nucleic acid sequences, a nucleic acid sequence present on the 5' side relative to an insert nucleic acid sequence of a polynucleotide of the present invention (abbreviated as "5' adjacent nucleic acid sequence A" as required), and a nucleic acid sequence present on the 3' side relative to an insert nucleic acid sequence of a polynucleotide of the present invention (abbreviated as "3' adjacent nucleic acid sequence A" as required) can be mentioned. Therefore, the specific partial nucleotide A of ii) above can be a partial nucleotide consisting of a nucleic acid sequence spanning from a specified position of the 5' adjacent nucleic acid sequence A to a specified position of an insert nucleic acid sequence of a polynucleotide of the present invention, a partial nucleotide consisting of a nucleic acid sequence spanning from a specified position of an insert nucleic acid sequence of a polynucleotide of the present invention to a specified position of the 3' adjacent nucleic acid sequence A, or a partial nucleotide consisting of a nucleic acid sequence comprising the whole insert nucleic acid sequence of a polynucleotide of the present invention, spanning from a specified position of the 5' adjacent nucleic acid sequence A to a specified position of the 3' adjacent nucleic acid sequence A. The number of nucleotide residues in the insert nucleic acid sequence (or 5'-terminal or 3'-terminal partial nucleic acid sequence A) or adjacent nucleic acid sequence (or 5'-terminal or 3'-terminal adjacent nucleic acid sequence A), contained in the specific partial nucleotide A of ii) above, is not particularly limited, as far as it is a number that ensures the specificity of the specific partial nucleotide A of ii) above; the number can be, for example, at least 3, preferably at least 4, more preferably at least 5, still more preferably at least 6, and most preferably at least 7, 8, 9 or 10. Such terminal partial nucleic acid sequences and such adjacent nucleic acid sequences are obvious from the disclosure herein.

[0133] The specific partial nucleotide A of iii) above is a partial nucleotide not present in a known polynucleotide, which nucleotide consisting of a nucleic acid sequence wherein both nucleic acid sequences present on the 5' and 3' sides relative to an insert nucleic acid sequence of a known polynucleotide are linked together (in a polynucleotide of the present invention, these nucleic acid sequences are linked together as a result of exon deletion). The number of nucleotide residues in each nucleic acid sequence present on the 5' and 3' sides relative to an insert nucleic acid sequence of a known polynucleotide, contained in the specific partial nucleotide A of iii) above, is not particularly limited, as far as it is a number that ensures the specificity of the specific partial nucleotide A of iii) above; the number can be, for example, at least 3, preferably at least 4, more preferably at least 5, still more preferably at least 6, and most preferably at least 7, 8, 9 or 10, respectively.

[0134] The specific partial nucleotide A of the present invention can be useful as, for example, a target for specifically detecting a polynucleotide of the present invention, and as a biomarker specific for the brain/nerves or specific for nerve differentiation. The specific partial nucleotide A of the present invention can also be useful in developing a substance capable of specifically recognizing a polynucleotide of the present invention, or a substance incapable of specifically recognizing a polynucleotide of the present invention, or developing a substance capable of specifically regulating the expression of a polypeptide of the present invention, or a substance incapable of specifically regulating the expression of a polypeptide of the present invention.

[0135] The specific partial nucleotide B of the present invention is a partial nucleotide that is present only in a known polynucleotide, and not present in a polynucleotide having a nucleic acid sequence shown by SEQ ID NO:X and the like. As examples of the specific partial nucleotide B, i) a partial nucleotide consisting of an insert nucleic acid sequence of a known polynucleotide or a partial nucleic acid sequence thereof, ii) a partial nucleotide consisting of an insert nucleic acid sequence of a known polynucleotide or a terminal partial nucleic acid sequence thereof and an adjacent nucleic acid sequence thereof, and iii) a partial nucleotide consisting of a nucleic acid sequence wherein both nucleic acid sequences present on the 5' and 3' sides relative to an insert nucleic acid sequence of a polynucleotide of the present invention are linked together, formed as a result of exon deletion, can be mentioned.

[0136] The specific partial nucleotide B of i) above consists of an insert nucleic acid sequence of a known polynucleotide or a partial nucleic acid sequence thereof. Such partial nucleic acid sequences are obvious from the disclosure herein.

[0137] The specific partial nucleotide B of ii) above consists of an insert nucleic acid sequence of a known polynucleotide or a terminal partial nucleic acid sequence thereof and an adjacent nucleic acid sequence thereof. As such terminal partial nucleic acid sequences, a nucleic acid sequence corresponding to a 5'-terminal portion in an insert nucleic acid sequence of a known polynucleotide (abbreviated as "5'-terminal partial nucleic acid sequence B" as required), and a nucleic acid sequence corresponding to a 3'-terminal portion in an insert nucleic acid sequence of a known polynucleotide (abbreviated as "3'-terminal partial nucleic acid sequence B" as required) can be mentioned. As such adjacent nucleic acid sequences, a nucleic acid sequence present on the 5' side relative to an insert nucleic acid sequence of a known polynucleotide (abbreviated as "5' adjacent nucleic acid sequence B" as required), and a nucleic acid sequence present on the 3' side relative to an insert nucleic acid sequence of a known polynucleotide (abbreviated as "3' adjacent nucleic acid sequence B" as required) can be mentioned. Therefore, the specific partial nucleotide B of ii) above can be a partial nucleotide consisting of a nucleic acid sequence spanning from a specified position of the 5' adjacent nucleic acid sequence B to a specified position of an insert nucleic acid sequence of a known polynucleotide, a partial nucleotide consisting of a nucleic acid sequence spanning from a specified position of an insert nucleic acid sequence of a known polynucleotide to a specified position of the 3' adjacent nucleic acid sequence B, or a partial nucleotide consisting of a nucleic acid sequence comprising the whole insert nucleic acid sequence of a known polynucleotide, spanning from a specified position of the 5' adjacent nucleic acid sequence B to a specified position of the 3' adjacent nucleic acid sequence B. The number of nucleotide residues in the insert nucleic acid sequence (or 5'-terminal or 3'-terminal partial nucleic acid sequence B) or adjacent nucleic acid sequence (or 5'-terminal or 3'-terminal adjacent nucleic acid sequence B), contained in the specific partial nucleotide B of ii) above, is not particularly limited, as far as it is a number that ensures the specificity of the specific partial nucleotide B of ii) above; the number can be, for example, at least 3, preferably at least 4, more preferably at least 5, still more preferably at least 6, and most preferably at least 7, 8, 9 or 10. Such terminal partial nucleic acid sequences and such adjacent nucleic acid sequences are obvious from the disclosure herein.

[0138] The specific partial nucleotide B of iii) above is a partial nucleotide not present in a polynucleotide of the present invention, consisting of a nucleic acid sequence wherein both nucleic acid sequences present on the 5' and 3' sides relative to an insert nucleic acid sequence of a polynucleotide of the present invention are linked together (in a known polynucleotide, these nucleic acid sequences are linked together as a result of exon deletion). The number of nucleotide residues in each nucleic acid sequence present on the 5' and 3' sides relative to an insert nucleic acid sequence of a polynucleotide of the present invention, contained in the specific partial nucleotide B of iii) above, is not particularly limited, as far as it is a number that ensures the specificity of the specific partial nucleotide B of iii) above, and the number can be, for example, at least 3, preferably at least 4, more preferably at least 5, still more preferably at least 6, and most preferably at least 7, 8, 9 or 10, respectively.

[0139] The specific partial nucleotide B of the present invention can be useful as, for example, as a target for specifically detecting a known polynucleotide, and as a biomarker specific for the brain/nerves or specific for nerve differentiation, or as a marker not specific therefor. The specific partial nucleotide B of the present invention can also be useful in developing a substance capable of specifically recognizing a known polynucleotide, or a substance incapable of specifically recognizing a known polynucleotide, or developing a substance capable of specifically regulating the expression of a known polypeptide, or a substance incapable of specifically regulating the expression of a known polypeptide.

[0140] A shared partial nucleotide of the present invention can be a nonspecific partial nucleotide that is present in both a polynucleotide of the present invention and a known polynucleotide. Such partial nucleotides are obvious from the disclosure herein. A shared partial nucleotide of the present invention can be useful as, for example, a target for comprehensively detecting both a polynucleotide of the present invention and a known polynucleotide, and as a biomarker specific for the brain/nerves or specific for nerve differentiation, or as a marker not specific therefor. A shared partial nucleotide of the present invention can also be useful in developing a substance capable of comprehensively recognizing both a polynucleotide of the present invention and a known polynucleotide, or a substance capable of comprehensively regulating the expression of both a polypeptide of the present invention and a known polypeptide.

[0141] A polynucleotide of the present invention and a partial nucleotide thereof are capable of encoding a polypeptide of the present invention or a partial peptide of the present invention. A polynucleotide of the present invention or a partial nucleotide of the present invention may be fused with a polynucleotide consisting of a heterologous nucleic acid sequence. As such heterologous nucleic acid sequences, those that encode the above-described heterologous amino acid sequences can be mentioned.

[0142] A polynucleotide of the present invention and a partial nucleotide thereof may be provided in the form of a salt. As the salt, those described above can be mentioned.

[0143] A polynucleotide of the present invention and a partial nucleotide thereof can be prepared by a method known per se. For example, the same nucleic acid sequence as a nucleic acid sequence shown by SEQ ID NO:Y, or the nucleic acid sequence Y1 or the nucleic acid sequence Y2 can be cloned using a specified tissue or cell. Moreover, substantially the same nucleic acid sequence as a nucleic acid sequence shown by SEQ ID NO:Y or the nucleic acid sequence Y1 or the nucleic acid sequence Y2 can be prepared by introducing a mutation into a polynucleotide cloned as described above. As examples of the method of mutagenesis, methods such as the synthetic oligonucleotide site-directed mutagenesis method, the gapped duplex method, a method of randomly introducing point mutations (for example, treatment with nitrous acid or sulfurous acid), the cassette mutation method, the linker scanning method, and the mismatch primer method can be mentioned.

2. Related Substances

[0144] The present invention provides a series of related substances that can be developed on the basis of a polypeptide of the present invention and a partial peptide of the present invention, and a polynucleotide of the present invention and a partial nucleotide of the present invention. The related substances of the present invention described below can be useful as, for example, pharmaceuticals. When a related substance of the present invention is a pharmaceutical, the target disease can be, for example, a disease based on a nerve cell disorder. In detail, as such diseases, Parkinson's disease, Huntington's chorea, Alzheimer's disease, ischemic cerebral diseases (e.g., cerebral stroke), epilepsy, brain trauma, motor nerve disease, multiple sclerosis, amyotrophic lateral sclerosis, diseases caused by nerve toxic disorders and the like can be mentioned.

2.1. Antisense Molecules

[0145] The present invention provides antisense molecules.

[0146] The type of the antisense molecule may be a DNA or an RNA, or may be a DNA/RNA chimera. The antisense molecule may be one having a phosphodiester bond of the natural type, or a modified nucleotide of the thiophosphate type (P.dbd.O in phosphate bond replaced with P.dbd.S), 2'-O-methyl type or the like, which are stable to degrading enzymes. Other important factors for the designing of the antisense molecule include increases in water-solubility and cell membrane permeability and the like; these can also be cleared by choosing appropriate dosage forms such as those using liposome or microspheres. The length of the antisense molecule is not particularly limited, as far as the molecule is capable of specifically hybridizing to the transcription product; the antisense molecule may be of a sequence of about 15 nucleotides for the shortest, or of a sequence complementary to the entire sequence of the transcription product for the longest. Considering the ease of synthesis, antigenicity issue and the like, for example, oligonucleotides consisting of about 15 nucleotides or more, preferably about 15 to about 100 nucleotides, and more preferably about 18 to about 50 nucleotides, can be mentioned. Furthermore, the antisense molecule may be one capable of not only inhibiting the translation of the transcription product by hybridizing thereto, but also binding to a double-stranded DNA to form a triple strand (triplex) to inhibit the transcription into mRNA.

[0147] An antisense molecule of the present invention can comprise a nucleic acid sequence complementary to a nucleic acid sequence corresponding to a partial nucleotide of the present invention (e.g., specific partial nucleotides A and B of the present invention, a shared partial nucleotide of the present invention). Therefore, an antisense molecule of the present invention can be an antisense molecule specific for a polynucleotide of the present invention, an antisense molecule specific for a known polynucleotide, or an antisense molecule common to both a polynucleotide of the present invention and a known polynucleotide. An antisense molecule of the present invention can be useful in specifically suppressing the expression of a polypeptide of the present invention or a known polypeptide, or comprehensively suppressing the expression of both a polypeptide of the present invention and a known polypeptide.

2.2. RNAi-Inducing Nucleic Acids

[0148] The present invention provides RNAi-inducing nucleic acids.

[0149] An RNAi-inducible nucleic acid refers to a polynucleotide, preferably an RNA, capable of inducing the RNA interference (RNAi) effect when transferred into cells. The RNAi effect refers to the phenomenon in which a double-stranded RNA comprising the same nucleic acid sequence as that of mRNA, or a partial sequence thereof, suppresses the expression of the mRNA. To obtain the RNAi effect, it is preferable to use, for example, a double-stranded RNA having the same nucleic acid sequence as that of a target mRNA comprising at least 20 or more continuous bases (or a partial sequence thereof). The double-stranded structure may be configured by different strands, or may be a double strand conferred by stem loop structure of a single RNA. As examples of the RNAi-inducing nucleic acid, siRNA, miRNA and the like can be mentioned, and siRNA is preferable. The siRNA is not particularly limited, as far as it can induce RNAi, and the siRNA can be, for example, 21 to 27 bases long, preferably 21 to 25 bases long.

[0150] An RNAi-inducing nucleic acid of the present invention can be a double-stranded polynucleotide configured by a sense strand consisting of a nucleic acid sequence corresponding to a partial nucleotide of the present invention (e.g., specific partial nucleotides A and B of the present invention, a shared partial nucleotide of the present invention), and an antisense strand consisting of a nucleic acid sequence complementary thereto. An RNAi-inducing nucleic acid of the present invention may also have an overhang at the 5' terminus and/or 3' terminus of one or both of the sense strand and the antisense strand. The overhang can be one formed as a result of the addition of one to several (e.g., 1, 2 or 3) bases at the 5' terminus and/or 3' terminus of the sense strand and/or antisense strand. An RNAi-inducing nucleic acid of the present invention can be an RNAi-inducing nucleic acid specific for a polynucleotide of the present invention, an RNAi-inducing nucleic acid specific for a known polynucleotide, or an RNAi-inducing nucleic acid common to both a polynucleotide of the present invention and a known polynucleotide. An RNAi-inducing nucleic acid of the present invention can be useful in specifically suppressing the expression of a polypeptide of the present invention or a known polypeptide, or comprehensively suppressing the expression of both a polypeptide of the present invention and a known polypeptide.

2.3. Aptamers

[0151] The present invention provides aptamers.

[0152] An aptamer refers to a polynucleotide having a binding activity (or inhibitory activity) on a specified target molecule. An aptamer of the present invention can be an RNA, a DNA, a modified nucleotide or a mixture thereof. An aptamer of the present invention can also be in a linear or circular form. The length of the aptamer is not particularly limited, and can normally be about 16 to about 200 nucleotides, and can be, for example, about 100 nucleotides or less, preferably about 50 nucleotides or less, and more preferably about 40 nucleotides or less. The length of an aptamer of the present invention may be, for example, about 18, about 20, about 25 or about 30 nucleotides or more. The aptamer, for increasing the bindability, stability, drug delivering quality and the like, may be one wherein a sugar residue (e.g., ribose) of each nucleotide is modified. As examples of a portion of the sugar residue modified, ones wherein the oxygen atom at the 2'-position, 3'-position and/or 4'-position of the sugar residue is replaced with another atom and the like can be mentioned. As examples of types of modifications, fluorination, O-alkylation, O-allylation, S-alkylation, S-allylation and amination can be mentioned (see, e.g., Sproat et al., (1991) Nucle. Acid. Res. 19, 733-738; Cotton et al., (1991) Nucl. Acid. Res. 19, 2629-2635). The aptamer may be one wherein a purine or pyrimidine is altered. As examples of such alterations, alteration of the 5-position pyrimidine, alteration of the 8-position purine, alteration by an exocyclic amine, substitution by 4-thiouridine, and substitution by 5-bromo or 5-iodo-uracil can be mentioned. The phosphate group contained in an aptamer of the present invention may be altered to make it resistant to nuclease and hydrolysis. For example, the phosphate group may be substituted by a thioate, a dithioate or an amidate. An aptamer can be prepared according to available reports (for example, Ellington et al., (1990) Nature, 346, 818-822; Tuerk et al., (1990) Science, 249, 505-510).

[0153] An aptamer of the present invention is capable of binding specifically to a polypeptide of the present invention or a known polypeptide, or both a polypeptide of the present invention and a known polypeptide, via a region corresponding to a partial peptide of the present invention. Therefore, an aptamer of the present invention can be an aptamer specific for a polypeptide of the present invention, an aptamer specific for a known polypeptide, or an aptamer common to both a polypeptide of the present invention and a known polypeptide. Such a specific aptamer can be prepared by, for example, selecting (a) a polynucleotide that binds to a polypeptide of the present invention or a specific partial peptide thereof, and that does not bind to a known polypeptide, (b) a polynucleotide that binds to a known polypeptide or a specific partial peptide thereof, and that does not bind to a polypeptide of the present invention, or (c) a polynucleotide that binds to both a polypeptide of the present invention and a known polypeptide or to a shared partial peptide of the present invention, by the SELEX method.

2.4. Antibodies

[0154] The present invention provides antibodies.

[0155] An antibody of the present invention may be a polyclonal antibody (antiserum) or a monoclonal antibody, and can be prepared by a commonly known immunological technique. Although the monoclonal antibody may be of any isotype, IgG, IgM, IgA, IgD, IgE, or the like, IgG or IgM is preferable.

[0156] For example, the polyclonal antibody can be acquired by administering the above-described antigen (as required, may be prepared as a complex crosslinked to a carrier protein such as bovine serum albumin or KLH ((Keyhole Limpet Hemocyanin)), along with a commercially available adjuvant (for example, Freund's complete or incomplete adjuvant), to an animal subcutaneously or intraperitoneally about 2 to 4 times at intervals of 2 to 3 weeks (the antibody titer of partially drawn serum has been determined by a known antigen-antibody reaction and its elevation has been confirmed in advance), collecting whole blood about 3 to about 10 days after final immunization, and purifying the antiserum. As the animal to receive the antigen, mammals such as rats, mice, rabbits, goat, guinea pigs, and hamsters can be mentioned.

[0157] The monoclonal antibody can also be prepared by a cell fusion method. For example, the above-described antigen, along with a commercially available adjuvant, is subcutaneously or intraperitoneally administered to a mouse 2 to 4 times, and 3 days after final administration, the spleen or lymph nodes are collected, and leukocytes are collected. These leukocytes and myeloma cells (for example, NS-1, P3X63Ag8 and the like) are cell-fused to obtain a hybridoma that produces a monoclonal antibody against the factor. This cell fusion may be performed by the PEG method or the voltage pulse method. A hybridoma that produces the desired monoclonal antibody can be selected by detecting an antibody that binds specifically to the antigen, in the culture supernatant, using a widely known EIA or RIA method and the like. Cultivation of the hybridoma that produces the monoclonal antibody can be performed in vitro, or in vivo such as in ascitic fluid of a mouse or rat, preferably a mouse, and the antibody can be acquired from the culture supernatant of the hybridoma and the ascitic fluid of the animal.

[0158] An antibody of the present invention may also be a chimeric antibody, a humanized antibody or a human antibody.

[0159] A chimeric antibody means a monoclonal antibody derived from immunoglobulins of animal species having mutually different variable regions and constant regions. For example, a chimeric antibody can be a mouse/human chimeric monoclonal antibody whose variable region is a variable region derived from a mouse immunoglobulin, and whose constant region is a constant region derived from a human immunoglobulin. The constant region derived from a human immunoglobulin has an amino acid sequence unique depending on the isotype, such as IgG, IgM, IgA, IgD, and IgE, and the constant region of a recombinant chimeric monoclonal antibody in the present invention may be the constant region of a human immunoglobulin belonging to any isotype. The constant region of human IgG is preferable.

[0160] A chimeric antibody can be prepared by a method known per se. For example, a mouse/human chimeric monoclonal antibody can be prepared according to available reports (e.g., Jikken Igaku (extra issue), Vol. 6, No. 10, 1988 and JP-B-HEI-3-73280). In detail, a chimeric antibody can be prepared by inserting the C.sub.H gene acquired from the DNA that encodes a human immunoglobulin (C gene that encodes H chain constant region) downstream of the active V.sub.H gene acquired from the DNA that encodes a mouse monoclonal antibody isolated from a hybridoma that produces the mouse monoclonal antibody (rearranged VDJ gene that encodes H chain variable region), and inserting the C.sub.L gene acquired from the DNA that encodes a human immunoglobulin (C gene that encodes L chain constant region) downstream of the active V.sub.L gene acquired from the DNA that encodes a mouse monoclonal antibody isolated from the hybridoma (rearranged VJ gene that encodes L chain variable region), in a way that allows the expression of each gene, into one or separate expression vectors, transforming a host cell with the expression vector, and culturing the transformant cell.

[0161] A humanized antibody means a monoclonal antibody prepared by a gene engineering technique, for example, a human type monoclonal antibody wherein a portion or all of the complementarity-determining region of the ultra-variable region thereof is derived from a mouse monoclonal antibody, and the framework region of the variable region thereof and the constant region thereof are derived from a human immunoglobulin. The complementarity-determining regions of the ultra-variable region are three regions that are present in the ultra-variable region in the variable region of the antibody, and that complementarily directly bind to the antigen (Complementarity-determining regions; CDR1, CDR2, CDR3), and the framework regions of the variable region are four relatively highly conserved regions locating in the front and back of the three complementarity-determining regions (Framework; FR1, FR2, FR3, FR4). In other words, a humanized antibody means, for example, a monoclonal antibody wherein all regions other than a portion or all of the complementarity-determining region of the ultra-variable region of a mouse monoclonal antibody is replaced with a corresponding region of a human immunoglobulin.

[0162] A humanized antibody can be prepared by a method known per se. For example, a recombinant humanized antibody derived from a mouse monoclonal antibody can be prepared according to available reports (e.g., Japanese Patent Application Kohyo Publication No. HEI-4-506458 and JP-A-SHO-62-296890). In detail, from a hybridoma that produces a mouse monoclonal antibody, at least one mouse H chain CDR gene and at least one mouse L chain CDR gene corresponding to the mouse H chain CDR gene are isolated, and from a human immunoglobulin gene, the human H chain gene that encodes all regions other than the human H chain CDR corresponding to the mouse H chain gene and the human L chain gene that encodes all regions other than the human L chain CDR corresponding to the mouse L chain CDR are isolated. The mouse H chain CDR gene and human H chain gene isolated are introduced into an appropriate expression vector expressibly; likewise, the mouse L chain CDR gene and the human L chain gene are introduced into another appropriate expression vector expressively. Alternatively, the mouse H chain CDR gene/human H chain gene and the mouse L chain CDR gene/human L chain gene can be introduced into the same expression vector expressively. By transforming a host cell with the expression vector thus prepared to obtain a cell that produces a humanized antibody, and culturing the cell, a desired humanized antibody can be obtained from the culture supernatant.

[0163] A human antibody means an antibody wherein all regions comprising the variable regions and constant regions of the H chain and L chain constituting an immunoglobulin are derived from the gene that encodes a human immunoglobulin.

[0164] A human antibody can be prepared by a method known per se. For example, a human antibody can be produced by immunologically sensitizing with an antigen a transgenic animal prepared by incorporating at least a human immunoglobulin gene into a gene locus of a non-human mammal such as a mouse, in the same way as the above-described method of preparing a polyclonal antibody or a monoclonal antibody. For example, a transgenic mouse that produces a human antibody can be prepared according to available reports (Nature Genetics, Vol. 15, p. 146-156, 1997; Nature Genetics, Vol. 7, p. 13-21, 1994; Japanese Patent Application Kohyo Publication No. HEI-4-504365; International Patent Application Publication WO94/25585; Nature, Vol. 368, p. 856-859, 1994; and Japanese Patent Application Kohyo Publication No. HEI-6-500233).

[0165] An antibody of the present invention can also be a portion of an antibody of the present invention described above (e.g., monoclonal antibody). As examples of such antibodies, F(ab').sub.2, Fab', Fab, and Fv fragments, and single-chain antibodies can be mentioned.

[0166] An antibody of the present invention is capable of binding specifically to a polypeptide of the present invention or a known polypeptide, or both a polypeptide of the present invention and a known polypeptide, via a region corresponding to a partial peptide of the present invention. Therefore, an antibody of the present invention can be an antibody specific for a polypeptide of the present invention, an antibody specific for a known polypeptide, or an antibody common to both a polypeptide of the present invention and a known polypeptide. Such a specific antibody can be prepared by, for example, using a specific partial peptide of a polypeptide of the present invention, a specific partial peptide of a known polypeptide, or a shared partial peptide of the present invention as an antigen.

2.5. Expression Vectors

[0167] The present invention provides expression vectors for the above-described substances.

[0168] An expression vector of the present invention can comprise a polynucleotide that encodes a desired polypeptide to be expressed or a desired polynucleotide to be expressed, and a promoter operably linked to the polynucleotide. "A promoter is operably linked to a polynucleotide" means that the promoter is bound to a polynucleotide that encodes the gene in a way such that allows the expression of the polynucleotide under the control thereof, or the expression of the polypeptide encoded by the polynucleotide.

[0169] The backbone for an expression vector of the present invention is not particularly limited, as far as it allows production of a desired substance in a specified cell; for example, plasmid vectors and viral vectors can be mentioned. When an expression vector is used as a pharmaceutical, as vectors suitable for administration to mammals, viral vectors such as adenovirus, retrovirus, adeno-associated virus, herpesvirus, vaccinia virus, poxvirus, poliovirus, Sindbis virus, and Sendai virus can be mentioned.

[0170] When a prokaryotic cell is used as the host cell, an expression vector allowing the prokaryotic cell to be utilized as the host cell can be used. Such an expression vector can comprise, for example, elements such as a promoter-operator region, an initiation codon, a polynucleotide that encods a polypeptide of the present invention or a partial peptide thereof, a stop codon, a terminator region and a replication origin. A promoter-operator region for expressing a polypeptide of the present invention in a bacterium comprises a promoter, an operator and a Shine-Dalgarno (SD) sequence. These elements may be ones known per se.

[0171] When a eukaryotic cell is used as the host cell, an expression vector allowing the eukaryotic cell to be utilized as the host cell can be used. In this case, the promoter used is not particularly limited, as far as it is capable of functioning in eukaryotic organisms such as mammals. When the expression of a polypeptide is desired, as examples of such promoters, viral promoters such as SV40-derived initial promoter, cytomegalovirus LTR, Rous sarcoma virus LTR, MoMuLV-derived LTR, and adenovirus-derived initial promoter, and mammalian constituent protein gene promoters such as .beta.-actin gene promoter, PGK gene promoter, and transferrin gene promoter, and the like can be mentioned. When the expression of a polynucleotide is desired, the promoter can be a polIII promoter (e.g., tRNA promoter, U6 promoter, H1 promoter).

[0172] An expression vector of the present invention can further comprise sites for transcription initiation and transcription termination, and a ribosome-binding site required for translation in the transcription region, a replication origin and a selection marker gene (e.g., ampicillin, tetracycline, kanamycin, spectinomycin, erythromycin, chloramphenicol) and the like. An expression vector of the present invention can be prepared by a method known per se (see, e.g., Molecular Cloning, 2.sup.nd edition, Sambrook et al., Cold Spring Harbor Lab. Press (1989)).

3. Compositions

[0173] The present invention provides compositions comprising the above-described substances.

[0174] A composition of the present invention can comprise, in addition to the above-described substances, an optionally chosen carrier, for example, a pharmaceutically acceptable carrier. As examples of the pharmaceutically acceptable carrier, excipients such as sucrose, starch, mannitol, sorbitol, lactose, glucose, cellulose, talc, calcium phosphate, and calcium carbonate, binders such as cellulose, methylcellulose, hydroxypropylcellulose, polypropylpyrrolidone, gelatin, gum arabic, polyethylene glycol, sucrose, and starch, disintegrants such as starch, carboxymethylcellulose, hydroxypropylstarch, sodium-glycol-starch, sodium hydrogen carbonate, calcium phosphate, and calcium citrate, lubricants such as magnesium stearate, Aerosil, talc, and sodium lauryl sulfate, flavoring agents such as citric acid, menthol, glycyrrhizin ammonium salt, glycine, and orange flour, preservatives such as sodium benzoate, sodium hydrogen sulfite, methylparaben, and propylparaben, stabilizing agents such as citric acid, sodium citrate, and acetic acid, suspending agents such as methyl cellulose, polyvinylpyrrolidone, and aluminum stearate, dispersing agents such as surfactants, diluents such as water, physiological saline, and orange juice, and base waxes such as cacao butter, polyethylene glycol, and kerosene, and the like can be mentioned, which, however, are not to be construed as limiting.

[0175] Preparations suitable for oral administration are liquids prepared by dissolving an effective amount of a substance in a diluent such as water, physiological saline or orange juice, capsules, saches or tablets containing an effective amount of a substance in the form of solids or granules, suspensions prepared by suspending an effective amount of a substance in an appropriate dispersant, emulsions prepared by dispersing and emulsifying a solution, an effective amount of a substance is dissolved therein, in an appropriate dispersant, and the like.

[0176] Preparations suitable for parenteral administration (for example, intravenous injection, subcutaneous injection, intramuscular injection, topical injection, intraperitoneal administration and the like) are aqueous and non-aqueous isotonic sterile injectable liquids, which may contain an antioxidant, a buffer solution, a bacteriostatic agent, an isotonizing agent and the like. Aqueous and non-aqueous sterile suspensions can also be mentioned, which may contain a suspending agent, a solubilizer, a thickening agent, a stabilizer, an antiseptic and the like. These preparations can be enclosed in containers such as ampoules and vials for unit dosage or a plurality of dosages. It is also possible to freeze-dry the active ingredient and a pharmaceutically acceptable carrier, and store the preparation in a state that may be dissolved or suspended in an appropriate sterile vehicle just before use.

[0177] Although the dosage of a composition of the present invention varies depending on the activity and kind of active ingredient, seriousness of illness, recipient animal species, the recipient's drug tolerance, body weight, age, and the like, it is normally about 0.001 to about 500 mg/kg as the amount of active ingredient per day for an adult.

[0178] A composition of the present invention enables a regulation (e.g., promotion or suppression) of the expression or a function of a polypeptide of the present invention. A composition of the present invention can be useful as, for example, a pharmaceutical (e.g., a prophylactic or therapeutic drug for a disease as described above), reagent or food.

4. Cells

[0179] The present invention provides transformants that produce a polypeptide of the present invention or a partial peptide of the present invention, cells that produce an antibody of the present invention, and cells wherein the expression or a function of a polynucleotide or polypeptide of the present invention is regulated.

4.1. Transformants

[0180] A transformant of the present invention can be a cell transformed with an expression vector of the present invention, that expresses a polypeptide of the present invention or a partial peptide of the present invention. The host cell used to prepare the transformant is not particularly limited, as far as it is compatible with the expression vector, and capable of expressing the desired polynucleotide or polypeptide and the like; for example, primary culture cells or cell lines can be mentioned. In detail, as examples of such host cells, cells of prokaryotic organisms such as Escherichia coli, bacteria of the genus Bacillus (e.g., Bacillus subtilis), and actinomyces, and cells of eukaryotic organisms, such as yeast, insect cells, bird cells, and mammalian cells (e.g., cells derived from the above-described mammals: e.g., CHO cells) can be mentioned. A transformant of the present invention can be prepared by a method known per se (see, e.g., Molecular Cloning, 2.sup.nd edition, Sambrook et al., Cold Spring Harbor Lab. Press (1989)).

[0181] Cultivation of the transformant can be performed in a nutrient medium such as a liquid medium by a method known per se. The medium preferably contains a carbon source, a nitrogen source, an inorganic substance and the like necessary for the growth of the transformant. Here, as examples of the carbon source, glucose, dextrin, soluble starch, sucrose and the like can be mentioned; as examples of the nitrogen source, inorganic or organic substances such as an ammonium salt, a nitrate salt, corn steep liquor, peptone, casein, meat extract, soybean cake, potato extract and the like can be mentioned; as examples of the inorganic substance, calcium chloride, sodium dihydrogen phosphate, magnesium chloride and the like can be mentioned. In addition, the medium may be supplemented with yeast extract, vitamins and the like. Culturing conditions, for example, temperature, medium pH and culturing time, are chosen as appropriate to allow a polypeptide of the present invention to be produced in a large amount. Culturing temperature is, for example, 30 to 37.degree. C.

4.2. Antibody Producing Cells

[0182] An antibody-producing cell of the present invention can be an optionally chosen cell that produces an antibody of the present invention. As antibody-producing cells of the present invention, the above-described hybridomas, and a transformant cell incorporating an expression vector for one of the above-described antibodies can be mentioned. When an antibody-producing cell of the present invention is a transformant cell, details of the expression vector, host cell, cell culture and the like used to prepare the transformant cell can be the same as those described above.

4.3. Cells Wherein the Expression or a Function of a Polypeptide of the Present Invention is Regulated

[0183] The present invention provides cells wherein the expression or a function of a polypeptide of the present invention is regulated.

[0184] A cell of the present invention can be an isolated and/or purified one. A cell of the present invention can be a cell derived from one of the above-described tissues, or a cell of one of the above-described kinds. A cell of the present invention can be derived from one of the above-described mammals. A cell of the present invention can be a primary culture cell or cell line, or a normal cell, or a cell derived from a mammal with one of the above-described diseases. A cell of the present invention can be a cell wherein the expression or a function of a polypeptide of the present invention is regulated specifically. A cell of the present invention can have a nerve cell-related action or nerve cell-related phenotype thereof being variable as a result of a regulation (e.g., promotion, suppression) of the expression or a function of a polypeptide of the present invention. A cell of the present invention can be a cell wherein the expression of a polypeptide of the present invention is regulated transiently, or a cell wherein the expression is regulated permanently (e.g., homozygousity- or heterozygousity-deficient cells). A cell of the present invention can also be a transformant or a non-transformant.

[0185] A cell of the present invention can be prepared by, for example, treating a cell with one of the above-described substances capable of regulating the expression or a function of a polynucleotide of the present invention or a polypeptide of the present invention (e.g., polypeptides of the present invention, antisense molecules, RNAi-inducing nucleic acids, antibodies, or expression vectors therefor). A cell of the present invention can also be prepared by isolating and/or purifying a cell from a transgenic animal or gene-deficient (so-called knockout) animal described below.

[0186] A cell wherein the expression or a function of a polypeptide of the present invention is regulated can be useful in, for example, developing a pharmaceutical (e.g., a prophylactic or therapeutic drug as described above), reagent or food, identifying a further marker gene specific for the brain/nerves or specific for nerve cell differentiation, and analyzing mechanisms associated with nerve cell differentiation. These can be performed by, for example, an expression profile analysis comprising measuring the expression profile in a cell of the present invention using a microarray, protein chip (e.g., antibody chip, or non-antibody chip such as chip manufactured by Ciphergen) and the like, and comparing the profile with the expression profile of a control cell. A cell of the present invention can also be useful as a cell model of a disease as described above.

5. Animals

[0187] The present invention provides animals wherein the expression or a function of a polypeptide of the present invention is regulated.

[0188] An animal of the present invention can be an animal with or without a genome alteration. The species of an animal of the present invention can be, for example, the same as one of the above-described non-human mammals.

[0189] In one embodiment, an animal of the present invention can be a transgenic animal with a genome alteration. A transgenic animal of the present invention is capable of expressing a polypeptide of the present invention. A transgenic animal of the present invention is also capable of expressing a polypeptide of the present invention specifically in one of the above-described cells or tissues.

[0190] A transgenic animal of the present invention can be prepared by a method known per se. In more detail, a transgenic animal of the present invention can be prepared by, for example, introducing a polynucleotide of the present invention linked operably to a specified promoter (e.g., a promoter that is non-specific or specific for one of the above-described cells or tissues) (e.g., may be in the form of an expression vector of the present invention) into a fertilized egg of an animal or another cell (e.g., unfertilized egg, spermatozoon or a progenitor cell thereof) in the initial stage of development. As examples of the method of gene introduction, the electroporation method, lipofection method, aggregation method, calcium phosphate coprecipitation method, and microinjection method can be mentioned. A transgenic animal of the present invention may be an animal prepared by mating a thus-prepared animal and another animal of the same species (e.g., animal model of a disease as described above).

[0191] In another embodiment, an animal of the present invention can be a gene-deficient animal with a genome alteration. A gene-deficient animal of the present invention is incapable of expressing a polypeptide of the present invention. A gene-deficient animal of the present invention is also incapable of expressing a polypeptide of the present invention specifically in one of the above-described cells or tissues.

[0192] A gene-deficient animal of the present invention can be prepared by a method known per se. In more detail, a gene-deficient animal of the present invention can be prepared using an embryonic stem cell (ES cell) specifically lacking a brain/nerve-specific gene. Such an ES cell can be prepared by, for example, introducing a specified targeting vector into ES cells, and selecting an ES cell showing homologous recombination from among the ES cells incorporating the targeting vector.

[0193] As a targeting vector, a targeting vector capable of inducing homologous recombination that causes specific expressional failure of a polynucleotide or polypeptide of the present invention can be used. Such a targeting vector comprises a first polynucleotide and second polynucleotide that are homologous or specifically homologous to a brain/nerve-specific gene (of the polynucleotides, at least one comprises a splicing donor signal for the brain/nerve-specific gene, and comprises a mutation that nullifies the splicing that produces at least one isoform in the signal), and, as required, a selection marker. A splicing donor signal for the brain/nerve-specific gene, and a mutation that nullifies the splicing that produces at least one isoform in the signal can be easily determined by a person skilled in the art. The first and second polynucleotides are polynucleotides having a sequence identity and length that are sufficient to produce homologous recombination in the genomic DNA associated with the brain/nerve-specific gene. The first and second polynucleotides are chosen in a way such that specific deficiency of a particular isoform is produced. As selection markers, positive selection markers (e.g., neomycin resistance gene, hygromycin B phosphotransferase (BPH) gene, blasticidin S deaminase gene, puromycin resistance gene), negative selection markers (e.g., herpes simplex virus (HSV) thymidine kinase (tk) gene, diphtheria toxin A fragment (DTA) gene) and the like can be mentioned. The targeting vector can comprise either a positive selection marker or a negative selection marker or both. The targeting vector may comprise two or more recombinase target sequences (e.g., loxP sequence, which is used in the Cre/loxP system derived from bacteriophage P1, FRT sequence, which is used in yeast-derived FLP/FRT system). The present invention also provides such a targeting vector.

[0194] As the method for introducing a targeting vector into an ES cell, a method known per se can be used. As examples of such methods, the calcium phosphate method, lipofection method/liposome method, electroporation method and the like can be mentioned. When a targeting vector is introduced into a cell, homologous recombination of the genomic DNA associated with the brain/nerve-specific gene occurs in the cell. Although an ES cell may be established by culturing an inner cell mass separated from a blastocyst of an optionally chosen animal on feeder cells, an existing ES cell may be utilized.

[0195] To select an ES cell showing homologous recombination, cells after introduction of a targeting vector are screened for. For example, after selection is performed by positive selection, negative selection and the like, screening based on genotype (for example, PCR method, Southern blot hybridization method) is performed. It is also preferable to further perform karyotype analysis on the ES cell obtained. In the karyotype analysis, the absence of chromosome aberrations in the selected ES cell is checked. Karyotype analysis can be performed by a method known per se. It is preferable that the karyotype of the ES cell be confirmed in advance before introducing the targeting vector.

[0196] A gene-deficient animal of the present invention can be prepared by transplanting to an animal a chimeric embryo obtained by introducing an ES cell obtained as described above into an embryo, and then mating the chimeric animal obtained. As examples of the embryo, blastocysts, 8-cell stage embryos and the like can be mentioned. The embryo can be obtained by mating a female animal undergoing an overovulation treatment with a hormone preparation (for example, PMSG, which has FSH-like action, and hCG, which has LH action, are used) and the like with a male animal, and the like. As methods of introducing an ES cell into an embryo, the micromanipulation method, aggregation method and the like can be mentioned.

[0197] The animal receiving a chimeric embryo transplanted is preferably a pseudo-pregnant animal. A pseudo-pregnant animal can be obtained by mating a female animal in the normal sexual cycle with a male animal emasculated by vasoligation and the like. The animal incorporating the chimeric embryo becomes pregnant and delivers a chimeric animal. Next, it is determined whether or not the animal born is a chimeric animal. Whether or not the animal born is a chimeric animal can be determined by a method known per se, for example, by the body color or coat color. For the determination, a DNA may be extracted from a portion of the body and subjected to Southern blot analysis or PCR assay. The mating can be performed preferably between a wild-type animal and a chimeric animal, or between chimeric animals. Whether or not the deficiency of the brain/nerve-specific gene has been introduced into the germ cell line of the chimeric animal and heterozygous offspring lacking the brain/nerve-specific gene has been obtained can be determined by a method known per se with various characters as indexes; for example, this can be determined by the body color or coat color of the offspring animal. For the determination, a DNA may be extracted from a portion of the body and subjected to Southern blot analysis or PCR assay. Furthermore, by mating thus-obtained heterozygotes, a homozygote can be prepared. A gene-deficient animal of the present invention may also be an animal prepared by mating an animal thus prepared and another animal of the same species (e.g., animal model of disease based on nerve cell disorder, transgenic animal).

[0198] In a still another embodiment, an animal of the present invention can be an animal without a genome alteration. Such an animal can be prepared by treating an animal with one of the above-described substances capable of regulating the expression or a function of a polynucleotide of the present invention or a polypeptide of the present invention (e.g., polypeptides of the present invention, antisense molecules, RNAi-inducing nucleic acids, antibodies, or expression vectors therefor). Such an animal can also be an animal capable or incapable of expressing a polypeptide of the present invention specifically in one of the above-described tissues by topical treatment. The animal treatment can be performed using a method mentioned with respect to a composition of the present invention.

[0199] An animal of the present invention can be useful in, for example, developing a pharmaceutical (e.g., a prophylactic or therapeutic drug as described above), reagent or food, identifying a further marker gene specific for the brain/nerves or specific for nerve cell differentiation, and analyzing mechanisms associated with nerve cell differentiation. These can be performed by, for example, an expression profile analysis comprising measuring an expression profile (particularly expression profile of a nerve cell or a tissue in the brain) using a microarray, protein chip (e.g., antibody chip, or non-antibody chip such as a chip manufactured by Ciphergen) and the like in an animal of the present invention, and comparing the profile with the expression profile of a control animal. An animal of the present invention can also be useful as an animal model of a disease as described above.

6. Measuring Means and Measuring Method

[0200] The present invention provides measuring means (e.g., primer set, nucleic acid probe, antibody, aptamer) and measuring methods for target polynucleotides and polypeptides.

6.1. Primer Set and Method of Use Thereof

[0201] A primer set of the present invention can be used for specific detection and quantitation of a polynucleotide of the present invention or a known polynucleotide, or comprehensive detection and quantitation of both a polynucleotide of the present invention and a known polynucleotide. For example, such detection and quantitation can be achieved, after preparing total RNA from a biological sample, by utilizing a method of gene amplification such as a PCR (e.g., RT-PCR, real-time PCR, quantitative PCR), LAMP (Loop-mediated isothermal amplification) (see, e.g., WO00/28082), or ICAN (Isothermal and Chimeric primer-initiated Amplification of Nucleic acids) (see, e.g., WO00/56877). Because the number of primers required differs depending on the kind of the method of gene amplification, the number of primers is not particularly limited; for example, a primer set of the present invention can comprise two or more primers constituted by a sense and antisense primer. The two or more primers may be mixed in advance or not. Each of the sense and antisense primers is not particularly limited, as far as it is of a size enabling specific amplification of the target region; each primer consists of 12 (for example, at least about 15, preferably at least about 18, more preferably at least about 20 and the like) consecutive nucleotide residues. The sense and antisense primer, when the size of the polynucleotide amplified thereby is to be visually detected, can be designed to allow it to be visually detectable. The visually detectable size is not particularly limited, and can be, for example, at least about 50, preferably at least 70, more preferably at least about 100, still more preferably at least about 150, and most preferably at least about 200, about 300, about 400, about 500 or more nucleotide residues long. The sense and antisense primer do not require that the polynucleotide amplified thereby be visually detected, and may be detected by a fluorescence signal and the like, as is commonly used in real-time PCR.

[0202] A primer set of the present invention can be a) a primer set specific for a polynucleotide of the present invention, capable of distinguishing a polynucleotide of the present invention from a known polynucleotide (abbreviated as "specific primer set A" as required), b) a primer set specific for a known polynucleotide, capable of distinguishing a known polynucleotide from a polynucleotide of the present invention (abbreviated as "specific primer set B" as required), or c) a primer set common to both a polynucleotide of the present invention and a known polynucleotide (abbreviated as "shared primer set" as required) wherein a polynucleotide of the present invention and a known polynucleotide do not distinguish each other.

[0203] The specific primer set A of the present invention can comprise i) a sense and antisense primer designed to make it possible to distinguish the size of the polynucleotide of the present invention or partial nucleotide thereof to be amplified from the size of the known polynucleotide or partial nucleotide thereof to be amplified, or ii) a sense and antisense primer designed to allow a polynucleotide of the present invention or a partial nucleotide thereof alone to be amplified, and not to allow a known polynucleotide to be amplified.

[0204] The sense and antisense primers of i) above are preferably, for example, a) a sense primer corresponding to a nucleic acid sequence present on the 5' side relative to the nucleic acid sequence of the above-described specific partial nucleotide A (particularly an insert nucleic acid sequence of a polynucleotide of the present invention), and an antisense primer corresponding to a nucleic acid sequence complementary to a nucleic acid sequence present on the 3' side relative to the nucleic acid sequence, or b) a sense primer corresponding to a nucleic acid sequence present on the 5' side relative to the nucleic acid sequence of the above-described specific partial nucleotide B (particularly an insert nucleic acid sequence of a known polynucleotide), and an antisense primer corresponding to a nucleic acid sequence complementary to a nucleic acid sequence present on the 3' side relative to the nucleic acid sequence.

[0205] The sense and antisense primers of ii) above are preferably, for example, a) a sense primer corresponding to the nucleic acid sequence of the above-described specific partial nucleotide A (particularly an insert nucleic acid sequence of a polynucleotide of the present invention), and a specified antisense primer, b) a specified sense primer, and a sense primer corresponding to the nucleic acid sequence of the above-described specific partial nucleotide A (particularly an insert nucleic acid sequence of a polynucleotide of the present invention), or c) a sense and antisense primer corresponding to the nucleic acid sequence of the above-described specific partial nucleotide A (particularly an insert nucleic acid sequence of a polynucleotide of the present invention).

[0206] The specific primer set B of the present invention can comprise i) a sense and antisense primer designed to make it possible to distinguish the size of the known polynucleotide or partial nucleotide thereof to be amplified from the size of the polynucleotide of the present invention or partial nucleotide thereof to be amplified, or ii) a sense and antisense primer designed to allow a known polynucleotide or a partial nucleotide thereof alone to be amplified, and not to allow a polynucleotide of the present invention to be amplified.

[0207] The sense and antisense primers of i) above are preferably, for example, a) a sense primer corresponding to a nucleic acid sequence present on the 5' side relative to the nucleic acid sequence of the above-described specific partial nucleotide B (particularly an insert nucleic acid sequence of a known polynucleotide), and an antisense primer corresponding to a nucleic acid sequence complementary to a nucleic acid sequence present on the 3' side relative to the nucleic acid sequence, or b) a sense primer corresponding to a nucleic acid sequence present on the 5' side relative to the nucleic acid sequence of the above-described specific partial nucleotide A (particularly an insert nucleic acid sequence of a polynucleotide of the present invention), and an antisense primer corresponding to a nucleic acid sequence complementary to a nucleic acid sequence present on the 3' side relative to the nucleic acid sequence.

[0208] The sense and antisense primers of ii) above are preferably, for example, a) a sense primer corresponding to the nucleic acid sequence of the above-described specific partial nucleotide B (particularly an insert nucleic acid sequence of a known polynucleotide), and a specified antisense primer, b) a specified sense primer, and a sense primer corresponding to the nucleic acid sequence of the above-described specific partial nucleotide B (particularly an insert nucleic acid sequence of a known polynucleotide), or c) a sense and antisense primer corresponding to the nucleic acid sequence of the above-described specific partial nucleotide B (particularly an insert nucleic acid sequence of a known polynucleotide).

[0209] A shared primer set of the present invention can comprise a sense and antisense primer designed to equalize the size of the known polynucleotide or partial nucleotide thereof to be amplified to the size of the polynucleotide of the present invention or partial nucleotide thereof to be amplified. Such a sense and antisense primer are preferably, for example, a sense and antisense primer designed not to allow the polynucleotide of the present invention or partial nucleotide thereof to be amplified, and the known polynucleotide or partial nucleotide thereof to be amplified, to comprise the nucleic acid sequences of the above-described specific partial nucleotides A and B.

6.2. Nucleic Acid Probe and Method of Use Thereof

[0210] A nucleic acid probe of the present invention can be used for specific detection and quantitation of a polynucleotide of the present invention or a known polynucleotide, or comprehensive detection and quantitation of both a polynucleotide of the present invention and a known polynucleotide. For example, such a detection and quantitation can be achieved, after preparing total RNA from a biological sample, by utilizing Northern blotting, a nucleic acid array wherein a nucleic acid probe of the present invention is immobilized, and the like. Although the nucleic acid probe can be a DNA, an RNA, a modified nucleic acid or a chimeric molecule thereof and the like, a DNA is preferable in consideration of safety, convenience and the like. The nucleic acid probe may also be any one of a single-stranded or a double-stranded polynucleotide. The size of the nucleic acid probe is not particularly limited, as far as it is capable of specifically hybridizing to the transcription product of the target gene; the size is, for example, at least about 15 or 16, preferably about 15 to about 1000, more preferably about 20 to about 500, and still more preferably about 25 to about 300. When a nucleic acid probe of the present invention is a single-stranded polynucleotide, the nucleic acid probe of the present invention can be the same as an antisense molecule of the present invention. When a nucleic acid probe of the present invention is a double-stranded polynucleotide, the nucleic acid probe of the present invention can be configured by an antisense molecule of the present invention and a polynucleotide molecule complementary thereto.

[0211] A nucleic acid probe of the present invention can be a) a nucleic acid probe specific for a polynucleotide of the present invention, capable of distinguishing a polynucleotide of the present invention from a known polynucleotide (abbreviated as "specific nucleic acid probe A" as required), b) a nucleic acid probe specific for a known polynucleotide, capable of distinguishing a known polynucleotide from a polynucleotide of the present invention (abbreviated as "specific nucleic acid probe B" as required), or c) a nucleic acid probe common to both a polynucleotide of the present invention and a known polynucleotide, wherein a polynucleotide of the present invention and a known polynucleotide do not distinguish each other (abbreviated as "shared nucleic acid probe" as required).

[0212] The specific nucleic acid probe A of the present invention can be a polynucleotide having a nucleic acid sequence complementary to the nucleic acid sequence of the above-described specific partial nucleotide A (particularly an insert nucleic acid sequence of a polynucleotide of the present invention) (a single-stranded polynucleotide), or a polynucleotide having the nucleic acid sequence of the above-described specific partial nucleotide A (particularly an insert nucleic acid sequence of a polynucleotide of the present invention) and a nucleic acid sequence complementary to the nucleic acid sequence (a double-stranded polynucleotide).

[0213] The specific nucleic acid probe B of the present invention can be a polynucleotide having a nucleic acid sequence complementary to the nucleic acid sequence of the above-described specific partial nucleotide B (particularly an insert nucleic acid sequence of a known polynucleotide) (a single-stranded polynucleotide), or a polynucleotide having the nucleic acid sequence of the above-described specific partial nucleotide B (particularly an insert nucleic acid sequence of a known polynucleotide) and a nucleic acid sequence complementary to the nucleic acid sequence (a double-stranded polynucleotide).

[0214] A shared nucleic acid probe of the present invention can be a polynucleotide having a nucleic acid sequence complementary to the nucleic acid sequence of the above-described shared partial nucleotide (a single-stranded polynucleotide), or a polynucleotide having a nucleic acid sequence complementary to the nucleic acid sequence of the above-described shared partial nucleotide and the nucleic acid sequence (a double-stranded polynucleotide).

[0215] A nucleic acid probe of the present invention may be provided in a state immobilized on a support (i.e., as an array). The support for such a nucleic acid array is not particularly limited, as far as it is a support in common use in the art; for example, membranes (e.g., nylon membranes), glass, plastics, metals, plates and the like can be mentioned. A nucleic acid array in the present invention can assume a form known per se; for example, an array wherein a nucleic acid is directly synthesized on a support (so-called affimetrix type), an array wherein a nucleic acid is immobilized on a support (so-called Stanford type), fiber-type array, and electrochemical array (ECA) can be mentioned.

6.3. Antibodies and Aptamers and Method of Use Thereof

[0216] An antibody and aptamer of the present invention can be used for specific detection and quantitation of a polypeptide of the present invention, a known polypeptide, or both a polypeptide of the present invention and a known polypeptide. For example, such a detection and quantitation can be achieved, after preparing an extract from a biological sample, or using a biological sample, by an immunological technique or an affinity-based method. As examples of such immunological techniques, enzyme immunoassay (EIA) (e.g., direct competitive ELISA, indirect competitive ELISA, sandwich ELISA), radioimmunoassay (RIA), fluorescent immunoassay (FIA), immunochromatography, luminescence immunoassay, spin immunoassay, Western blotting, and immunohistochemical staining can be mentioned. An affinity-based method can be performed in accordance with one of the above-described immunological techniques. The antibody and aptamer used for a measurement of a polypeptide of the present invention, a known polypeptide, or both a polypeptide of the present invention and a known polypeptide can be the same as the above-described antibody and aptamer of the present invention.

[0217] An antibody and aptamer of the present invention can be a) an antibody and aptamer specific for a polypeptide of the present invention, that make it possible to distinguish a polypeptide of the present invention from a known polypeptide (abbreviated as "specific antibody and aptamer A" as required), b) an antibody and aptamer specific for a known polypeptide, that make it possible to distinguish a known polypeptide from a polypeptide of the present invention (abbreviated as "specific antibody and aptamer B" as required), or c) an antibody and an aptamer common to both a polypeptide of the present invention and a known polypeptide, that do not distinguish between a polypeptide of the present invention and a known polypeptide (abbreviated as "shared antibody and aptamer" as required). The specific antibody and aptamer A of the present invention are capable of binding to the above-described specific partial peptide A (particularly a partial peptide consisting of an insert amino acid sequence of a polypeptide of the present invention). The specific antibody and aptamer B of the present invention are capable of binding to the above-described specific partial peptide B (particularly a partial peptide consisting of an insert amino acid sequence of a known polypeptide). A shared antibody and aptamer of the present invention are capable of binding to the above-described shared partial peptide.

[0218] An antibody and aptamer of the present invention may be provided in a form immobilized on a support (i.e., as an array). The support for such a nucleic acid array is not particularly limited, as far as it is a support in common use in the art; for example, membranes (e.g., nitrocellulose membranes), glass, plastics, metals, and plates (e.g., multiwell plates) can be mentioned.

6.4. Supplementary Matters Concerning Measuring Means of the Present Invention

[0219] A measuring means of the present invention can be provided in a form labeled with a labeling substance as required. As examples of the labeling substance, fluorescent substances such as FITC and FAM, luminescent substances such as luminol, luciferin and lucigenin, radioisotopes such as .sup.3H, .sup.14C, .sup.32P, .sup.35S, and .sup.123I, affinity substances such as biotin and streptavidin, and the like can be mentioned.

[0220] A measuring means of the present invention may be provided in the form of a kit comprising an additional constituent, in addition to the measuring means. In this case, the various constituents contained in the kit can be provided in mutually isolated forms, for example, in forms housed in different containers. For example, when the measuring means is not labeled with a labeling substance, the kit can further comprise a labeling substance. A kit of the present invention can comprise two or more measuring means for two or more target genes (e.g., a combination of a brain/nerve-specific gene and a known gene, a combination of two or more brain/nerve-specific genes). When the measuring means of the present invention is provided in the form of an array, the array of the present invention can be one wherein two or more measuring means for two or more target genes are immobilized. A kit and array of to the present invention can also comprise a measuring means as described above with respect to a housekeeping gene (e.g., GAPDH, .beta.-actin).

6.5. Measuring Methods of the Present Invention

[0221] The present invention also provides a method of detecting or quantifying a target polypeptide or polynucleotide using a measuring means of the present invention.

[0222] A measurement of a target polynucleotide and polypeptide can be properly performed according to the kind of the measuring means by the above-described method.

[0223] In a method of the present invention, the expression level of a target polynucleotide or polypeptide in a biological sample obtained from one of the above-described mammals (e.g., human) or a culture (e.g., cell or tissue culture) can be measured. The biological sample is not particularly limited, as far as it is, for example, a sample containing a cell or tissue expressing the target polynucleotide or polynucleotide, or, if the target polynucleotide or polypeptide is secreted or oozed or the like, an animal-derived sample (e.g., blood, plasma, serum, saliva, cerebrospinal fluid, tear, urine) containing the polynucleotide or polypeptide secreted or oozed or the like. The biological sample can be one containing one of the above-described cells or tissues (e.g., nerve cell or a tissue in the brain). The biological sample used in the present invention, unless otherwise specified, can be a biological sample collected from a mammal in advance; in a particular aspect, a method of the present invention can comprise collecting a biological sample from a mammal.

[0224] In one embodiment, a method of the present invention can be utilized to identify a nerve cell, to determine a nerve cell differentiation state, or to diagnose a disease based on a nerve cell disorder (e.g., determination of onset or likelihood of onset). This method can comprise measuring the expression level of a target polynucleotide or polypeptide in a biological sample collected from an animal, and evaluating the onset or likelihood of onset of a target disease on the basis of the measured expression level or relative expression rate. For example, the measured expression level or relative expression rate is compared with the expression level in a mammal not suffering the target disease (e.g., normal animal). The expression level or expression rate in a mammal not suffering the target disease can be determined by a method known per se. By such a comparison, it is determined whether or not the animal possibly has the target disease, or whether or not the animal is likely to suffer the disease. It is known that in a mammal having a particular disease manifested, an expressional change in the gene associated with the disease is often observed. It is also known that before the onset of a particular disease, an expressional change in a particular gene is often observed. Therefore, by such an analysis, it is possible to determine the onset or likelihood of onset of a target disease. Such a method can be useful in, for example, conveniently determining and early detecting a target disease. Of course, a measuring means of the present invention and a reagent or kit of the present invention can also be utilized for such a determination.

[0225] In detail, the changes in the expression profiles of the brain/nerve-specific genes 1 to 10 in nerve cells or tissues in the brain are as described in Examples. Therefore, using a measuring means of the present invention that enables a specific measurement of a polynucleotide of the present invention and a partial nucleotide of the present invention (e.g., specific partial nucleotide A of the present invention, specific partial nucleotide B of the present invention, shared partial nucleotide of the present invention), and a polypeptide of the present invention and a partial peptide of the present invention (e.g., specific partial peptide A of the present invention, specific partial peptide B of the present invention, shared partial peptide of the present invention), by evaluating the degree of the expression of the brain/nerve-specific genes 1 to 10 and/or relative expression ratios thereof, it is possible to identify a nerve cell, to determine a differentiation state of a nerve cell, or to diagnose a disease based on a nerve cell disorder.

[0226] In another embodiment, a method of the present invention can be utilized for screening for a pharmaceutical, reagent or food and the like. For example, in one methodology, the screening method can comprise determining whether or not a test material is capable of regulating (e.g., increasing or decreasing) the number of nerve cells. Because the number of nerve cells and the expression level of a brain/nerve-specific gene can correlate with each other, such a screening can be performed by measuring the expression level of the brain/nerve-specific gene. In another methodology, the screening method can comprise determining whether or not a test material is capable of regulating the expression or a function of a target polynucleotide or polypeptide. Such a screening method can be utilized as, for example, a screening method for a pharmaceutical effective for a specified disease (e.g., disease based on a nerve cell disorder) and the like, comprising selecting a test substance capable of regulating the expression or a function of a target, and a screening method for a pharmaceutical with a decreased specified action (e.g., adverse reactions such as nerve cell differentiation regulatory action) and the like, comprising selecting a test substance incapable of regulating the expression or a function of a target. The test material subjected to the screening method can be a commonly known compound or a novel compound or a composition; as examples, nucleic acids, glucides, lipids, proteins, peptides, organic low molecular compounds, compound libraries prepared using combinatorial chemistry technology, random peptide libraries prepared by solid phase synthesis or the phage display method, or naturally occurring ingredients derived from microorganisms, animals, plants, marine organisms and the like, existing pharmaceuticals, reagents or foods and the like can be mentioned. In the screening method, mammals, cells and tissues (e.g., nerve cell and a tissue in the brain), or reconstitution systems (non-cell systems) as described above can be used. Pharmaceuticals and the like obtained by the screening method are also provided by the present invention.

[0227] The disclosures in all publications mentioned herein, including patents and patent application specifications, are incorporated herein by reference to the extent that all of them have been given expressly.

EXAMPLES

[0228] The present invention is hereinafter described in further detail with reference to Examples; however, the present invention is not limited to the Examples and the like by any means.

Example 1

Preparation and Sequence Analysis of Human cDNA Libraries

[0229] (1) Preparation and Sequence Analysis of cDNA Libraries by the Improved Oligocap method 1) Extraction and Purchase of mRNAs

[0230] From human tissues (shown below), by a method described in a literature document (J. Sambrook, E. F. Fritsch & T. Maniatis, Molecular Cloning Second edition, Cold Spring harbor Laboratory Press, 1989), mRNAs were extracted as total RNAs. After cultivation of cultured human cells or primary culture human cells (shown below) by the methods described in the catalogues thereof, mRNAs were extracted as total RNAs by a method described in a literature document (J. Sambrook, E. F. Fritsch & T. Maniatis, Molecular Cloning Second edition, Cold Spring harbor Laboratory Press, 1989).

[0231] Hereinafter, the relationships between the names of libraries and the derivations thereof are shown in the order of "name of library: derivation". If a library was generated by subtraction, how to generate the subtraction library is also shown.

<Extraction of mRNAs from Human Tissues> NTONG: Tongue; CTONG: Tongue, Tumor; FCBBF: Brain, Fetal; OCBBF: Brain, Fetal; PLACE: Placenta; SYNOV: Synovial membrane tissue from rheumatioid arthritis; CORDB: Cord blood. <Extraction of mRNAs from Cultured Cells> BNGH4: H4 cell (ATCC #HTB-148); IMR32: IMR32 cell (ATCC #CCL-127); SKNMC: SK-N-MC cell (ATCC #HTB-10); 3NB69: NB69 cell (RCB #RCB0480); BGGI1: GI1 cell (RCB #RCB0763); NB9N4: NB9 cell (RCB #RCB0477); SKNSH: SK-N-SH cell (RCB #RCB0426); AHMSC: HMSC cell (Human mesenchymal cell); CHONS: Chondrocyte; ERLTF: TF-1 cell (erythroleukemia); HELAC: HeLa cell; JCMLC: leukemia cell (Leukemia, myelogenous); MESTC: Mesenchyme stem cell; N1ESE: Mesenchymal stem cell; NCRRM: Embryonal carcinoma; NCRRP: Embryonal carcinoma treated with retinoic acid (RA) to induce differentiation; T1ESE: Mesenchymal stem cell treated with trichostatin and 5-azacytidine to induce differentiation; NT2RM: NT2 cell (STARATAGENE #204101); NT2RP: NT2 cell treated with retinoic acid (RA) to induce differentiation for 5 weeks; NT2RI: NT2 cell treated with RA to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks; NT2NE: NT2 cell treated with RA and treated with a growth inhibitor to induce nerve differentiation, followed by nerve concentration and recovery (NT2 Neuron); NTISM: a library generated by subtracting cDNAs that overlap with the mRNA of undifferentiated NT2 cells from a cDNA library prepared from an mRNA of NT2 cell (STARATAGENE #204101) treated with RA to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks, using Subtract Kit (Invitrogen #K4320-01) (NT2RI-NT2RM). RCB indicates that the cell line was supplied by the RIKEN Gene Bank--Cell Development Bank, and ATCC indicates that the cell line was supplied by the American Type Culture Collection. <Extraction of mRNAs from Primary Culture Cells> ASTRO: Normal Human Astrocyte NHA5732, Takara Shuzo #CC2565; DFNES: Normal Human Dermal Fibroblasts (Neonatal Skin); NHDF-Neo) NHDF2564, Takara Shuzo #CC2509; MESAN: Normal human mesangial cells NHMC56046-2, Takara Shuzo #CC2559; NHNPC: Normal human neural progenitor cells NHNP5958, Takara Shuzo #CC2599; PEBLM: Human peripheral blood mononuclear cells HPBMC5939, Takara Shuzo #CC2702; HSYRA: HS-RA (Human synoviocytes from rheumatioid arthritis), Toyobo #T404K-05; PUAEN: Human pulmonary artery endothelial cells, Toyobo #T302K-05; UMVEN: Human umbilical vein endothelial cells HUVEC, Toyobo #T200K-05; HCASM: HCASMC (Human coronary artery smooth muscle cells), Toyobo #T305K-05; HCHON: HC (Human Chondrocytes), Toyobo #T402K-05; HHDPC: HDPC (Human dermal papilla cells), Toyobo #THPCK-001; CD34C: CD34+ cell (AllCells, LLC #CB14435M); D3OST: CD34+ cells treated with osteoclast differentiation factor (ODF) to induce differentiation for 3 days; D6OST: CD34+ cells treated with an ODF to induce differentiation for 6 days; D9OST: CD34+ cells treated with ODF to induce differentiation for 9 days; ACTVT: activated T-cell; LYMPB: Lymphoblast, EB virus transferred B cell; NETRP: Neutrophil.

[0232] Next, mRNAs extracted as total RNAs from the human tissues shown below were purchased. Hereinafter, the relationships between the names of libraries and the derivations thereof are shown in the order of "name of library: derivation". If a library was generated by subtraction, how to generate the subtraction library is also shown.

<mRNAs from Human Tissues Purchased as Total RNAs> ADRGL: Adrenal gland, CLONTECH #64016-1; BRACE: Brain, cerebellum, CLONTECH #64035-1; BRAWH: Brain, whole, CLONTECH #64020-1; FEBRA: Brain, Fetal, CLONTECH #64019-1; FELIV: Liver, Fetal, CLONTECH #64018-1; HEART: Heart, CLONTECH #64025-1; HLUNG: Lung, CLONTECH #64023-1; KIDNE: Kidney, CLONTECH #64030-1; LIVER: Liver, CLONTECH #64022-1; MAMGL: Mammary Gland, CLONTECH #64037-1; PANCR: Pancreas, CLONTECH #64031-1; PROST: Prostate, CLONTECH #64038-1; SALGL: Salivary Gland, CLONTECH #64026-1; SKMUS: Skeletal Muscle, CLONTECH #64033-1; SMINT: Small Intestine, CLONTECH #64039-1; SPLEN: Spleen, CLONTECH #64034-1; STOMA: Stomach, CLONTECH #64090-1; TBAES: Breast, Tumor, CLONTECH #64015-1; TCERX: Cervix, Tumor, CLONTECH #64010-1; TCOLN: Colon, Tumor, CLONTECH #64014-1; TESTI: Testis, CLONTECH #64027-1; THYMU: Thymus, CLONTECH #64028-1; TLUNG: Lung, Tumor, CLONTECH #64013-1; TOVAR: Ovary, Tumor, CLONTECH #64011-1; TRACH: Trachea, CLONTECH #64091-1; TUTER: Uterus, Tumor, CLONTECH #64008-1; UTERU: Uterus, CLONTECH #64029-1; ADIPS: Adipose, Invitrogen #D6005-01; BLADE: Bladder, Invitrogen #D6020-01; BRALZ: Brain, cortex, Alzheimer, Invitrogen #D6830-01; CERVX: Cervix, Invitrogen #D6047-01; COLON: Colon, Invitrogen #D6050-0; NESOP: Esophagus, Invitrogen #D6060-01; PERIC: Pericardium, Invitrogen #D6105-01; RECTM: Rectum, Invitrogen #D6110-01; TESOP: Esophageal, Tumor, Invitrogen #D6860-01; TKIDN: Kidney, Tumor, Invitrogen #D6870-01; TLIVE: Liver, Tumor, Invitrogen #D6880-01; TSTOM: Stomach, Tumor, Invitrogen #D6920-01; BEAST: Adult Breast, STARATAGENE #735044; FEHRT: Heart, Fetal, STARATAGENE #738012; FEKID: Kidney, Fetal, STARATAGENE #738014; FELNG: Lung, Fetal, STARATAGENE #738020; NOVAR: Adult Ovary, STARATAGENE #735260; BRASW: a library generated by subtracting cDNAs that overlap with the mRNA of BRAWH (Brain, whole, CLONTECH #64020-1) from a cDNA library prepared from the mRNA of BRALZ (Brain, cortex, Alzheimer, Invitrogen #D6830-01), using Subtract Kit (Invitrogen #K4320-01) (BRALZ-BRAWH).

[0233] Furthermore, mRNAs extracted and purified as polyA(+) RNAs from the human tissues shown below were purchased. From an RNA prepared by mixing polyA(+) RNA derived from each tissue with polyA(-) RNA, a cDNA library was prepared. The polyA(-) RNA was prepared by removing the polyA(+) RNA from the total RNA of Brain, whole, CLONTECH #64020-1 by means of oligo dT cellulose. Hereinafter, the relationships between the names of libraries and the derivations thereof are shown in the order of "name of library: derivation".

<mRNAs from Human Tissues Purchased as PolyA(+) RNAs> BRAMY: Brain, amygdala, CLONTECH #6574-1; BRCAN: Brain, caudate nucleus, CLONTECH #6575-1; BRCOC: Brain, corpus callosum, CLONTECH #6577-1; BRHIP: Brain, hippocampus, CLONTECH #6578-1; BRSSN: Brain, substantia nigra, CLONTECH #6580-1; BRSTN: Brain, subthalamic nucleus, CLONTECH #6581-1; BRTHA: Brain, thalamus, CLONTECH #6582-1. 2) Preparation of cDNA Libraries by the Improved Oligocap Method

[0234] From each RNA, by a method (WO 01/04286) developed by improving the oligocap method [M. Maruyama and S. Sugano, Gene, 138: 171-174 (1994)], a cDNA library was prepared. Using an Oligo-cap linker (SEQ ID NO:1) and an Oligo dT primer (SEQ ID NO:2), as described in WO 01/04286, BAP (Bacterial Alkaline Phosphatase) treatment, TAP (Tobacco Acid Pyrophosphatase) treatment, RNA ligation, synthesis of first strand cDNA and removal of RNA were performed. Next, using 5' (SEQ ID NO:3) and 3' (SEQ ID NO:4) PCR primers, by PCR (polymerase chain reaction), the first strand cDNA was converted to a double-stranded cDNA, and cleaved with SfiI. Next, the cDNA fragment, usually fractionated into 2 kb or more (3 kb or more as the case may be), was cloned into the vector pME18SFL3 (GenBank AB009864, Expression vector), previously cleaved with DraIII, in a determined orientation of the cDNA, whereby a cDNA library was prepared.

[0235] The relationships between the names of the cDNA libraries used for 5'-terminal sequence analysis of the cDNAs and the derivations thereof are shown in Tables 1-1 to 1-6. The number of the 5'-terminal sequences of the cDNAs in each cDNA library after mapping onto the human genome are also shown in Table 1.

3) 5'-terminal sequence analysis of cDNAs from cDNA Libraries Prepared by the Improved Oligocap Method

[0236] The 5'-terminal nucleic acid sequences of cDNAs acquired from each cDNA library, after a sequencing reaction using a DNA sequencing reagent (BigDye Terminator Cycle Sequencing FS Ready Reaction Kit, manufactured by PE Biosystems) according to the manual, were analyzed using a DNA sequencer (ABI PRISM 3700, manufactured by PE Biosystems). For the data obtained, a database was constructed. The 5'-terminus full-length rate of each cDNA library prepared by the improved oligocap method was 90% on average, being a high full-length rate (calculated with the protein coding region of a known mRNA as an index).

4) Full-Length cDNA Nucleic Acid Analysis

[0237] For cDNAs selected for full-length cDNA nucleic acid analysis, the nucleic acid sequence of each full-length cDNA was determined. The nucleic acid sequences were determined mainly by a primer walking method based on the dideoxy terminator method using a custom-synthesized DNA primer. Specifically, a sequencing reaction was performed using a custom-synthesized DNA primer with a DNA sequencing reagent manufactured by PE Biosystem as directed in the manual, after which the DNA nucleic acid sequence was analyzed using a sequencer manufactured by the same company. The full-length nucleic acid sequence was finally established by completely overlapping the partial nucleic acid sequences determined by the above-described method. Next, the region of translation into protein was predicted from the determined full-length cDNA nucleic acid sequence, and the amino acid sequence was determined.

(2) Preparation of cDNA Libraries by the Oligocap Method and Sequence Analysis 1) Preparation of cDNA libraries by the oligocap method

[0238] Being human fetal testis derived teratocarcinoma cells, NT-2 neuronal precursor cells (purchased from Stratagene), which can be differentiated into nerve cells by retinoic acid treatment, were used after being treated per the attached manual as follows. [0239] NT-2 cells cultured without differentiation induction with retinoic acid (NT2RM) [0240] NT-2 cells cultured, followed by differentiation induction by the addition of retinoic acid, then cultured for 2 days and 2 weeks (NT2RP)

[0241] Cultured human cell SK-N-MC (ATCC HTB-10) (SKNMC), cultured human cell Y79 (ATCC HTB-18) (Y79AA), cultured human cell GI1 (RCB RCB0763) (BGGI1), cultured human cell H4 (ATCC HTB-148) (BNGH4), cultured human cell IMR32 (ATCC CCL-127) (IMR32), and cultured human cell NB9 (RCB #RCB0477) (NB9N4) were cultured by the methods described in the catalogues thereof. RCB indicates that the cell line was supplied by the RIKEN Gene Bank--Cell Development Bank, and ATCC indicates that the cell line was supplied by the American Type Culture Collection.

[0242] The cultured cells of each line were collected, and by a method described in a literature document (J. Sambrook, E. F. Fritsch & T. Maniatis, Molecular Cloning Second edition, Cold Spring harbor Laboratory Press 1989), mRNAs were extracted. Furthermore, poly(A)+ RNAs were purified by means of oligo dT cellulose. Likewise, from human placenta tissue (PLACE), human ovarian cancer tissue (OVARC), tissue rich in head portion from 10-week-gestional fetal human (HEMBA), tissue rich in trunk portion from 10-week-gestional fetal human (HEMBB), human mammary gland tissue (MAMMA), human thyroid tissue (THYRO), and human vascular endothelial tissue primary culture cell (VESEN), by a method described in a literature document (J. Sambrook, E. F. Fritsch & T. Maniatis, Molecular Cloning Second edition, Cold Spring harbor Laboratory Press, 1989), mRNAs were extracted. Furthermore, poly(A)+ RNAs were purified by means of oligo dT cellulose.

[0243] From all these poly(A)+ RNAs, by the oligocap method [M. Maruyama and S. Sugano, Gene, 138: 171-174 (1994)], respective cDNA libraries were prepared. Using an Oligo-cap linker (SEQ ID NO:1) and an Oligo dT primer (SEQ ID NO:2), as directed in a literature document [Suzuki and Sugano, Protein, Nucleic Acid and Enzyme, 41: 197-201 (1996), Y. Suzuki et al., Gene, 200: 149-156 (1997)], BAP (Bacterial Alkaline Phosphatase) treatment, TAP (Tobacco Acid Phosphatase) treatment, RNA ligation, synthesis of first strand cDNA and removal of RNA were performed. Next, using 5' (SEQ ID NO:3) and 3' (SEQ ID NO:4) PCR primers, the first strand cDNA was converted to a double-stranded cDNA by PCR (polymerase chain reaction), and cleaved with SfiI. Next, the cDNA was cloned into the vector pUC19FL3 (for some cases of NT2RM and NT2RP) or pME18SFL3 (GenBank AB009864, Expression vector), previously cleaved with DraIII, in a determined orientation of the cDNA, whereby a cDNA library was prepared.

[0244] The relationships between the names of the cDNA libraries used for 5'-terminal sequence analysis of the cDNAs and the derivations thereof are shown in Tables 1-1 to 1-6. The number of 5'-terminal sequences of the cDNAs in each cDNA library after mapping onto the human genome are also shown in Tables 1-1 to 1-6.

TABLE-US-00001 TABLES 1-1 to 1-6 number of 5'-terminal sequences (only those which permitted mapping onto human genome) Improved oligocap method CORDB Cord blood Extraction of mRNAs from 708 human tissues CTONG Tongue, Cancer Extraction of mRNAs from 31,371 human tissues FCBBF Brain, Fetal Extraction of mRNAs from 31,986 human tissues NTONG Tongue Extraction of mRNAs from 7,125 human tissues OCBBF Brain, Fetal Extraction of mRNAs from 47,574 human tissues PLACE Placenta Extraction of mRNAs from 33,231 human tissues SYNOV Synovial membrane tissue from Extraction of mRNAs from 27,489 rheumatoid arthritis human tissues BRAMY Brain, amygdala, CLONTECH #6574-1 mRNAs from human tissues 58,640 purchased as polyA(+) RNAs BRCAN Brain, caudate nucleus, CLONTECH mRNAs from human tissues 25,786 #6575-1 purchased as polyA(+) RNAs BRCOC Brain, corpus callosum, CLONTECH mRNAs from human tissues 16,718 #6577-1 purchased as polyA(+) RNAs BRHIP Brain, hippocampus, CLONTECH #6578-1 mRNAs from human tissues 57,918 purchased as polyA(+) RNAs BRSSN Brain, substantia nigra, CLONTECH mRNAs from human tissues 15,897 #6580-1 purchased as polyA(+) RNAs BRSTN Brain, subthalamic nucleus, CLONTECH mRNAs from human tissues 16,308 #6581-1 purchased as polyA(+) RNAs BRTHA Brain, thalamus, CLONTECH #6582-1 mRNAs from human tissues 53,267 purchased as polyA(+) RNAs ADIPS Adipose, Invitrogen #D6005-01 mRNAs from human tissues 608 purchased as total RNAs ADRGL Adrenal gland, CLONTECH #64016-1 mRNAs from human tissues 10,223 purchased as total RNAs BEAST Adult Breast, STARATAGENE #735044 mRNAs from human tissues 2,731 purchased as total RNAs BLADE Bladder, Invitrogen #D6020-01 mRNAs from human tissues 8,431 purchased as total RNAs BRACE Brain, cerebellum, CLONTECH #64035-1 mRNAs from human tissues 82,880 purchased as total RNAs BRALZ Brain, cortex, Alzheimer, Invitrogen mRNAs from human tissues 16,360 #D6830-01 purchased as total RNAs BRASW A library generated by subtracting cDNAs mRNAs from human tissues 157 that overlap with the mRNA of BRAWH purchased as total RNAs (Brain, whole, CLONTECH #64020-1) from a cDNA library prepared from the mRNA of BRALZ (Brain, cortex, Alzheimer, Invitrogen #D6830-01), using Subtract Kit (Invitrogen #K4320-01) (BRALZ-BRAWH) BRAWH Brain, whole, CLONTECH #64020-1 mRNAs from human tissues 59,069 purchased as total RNAs CERVX Cervix, Invitrogen #D6047-01 mRNAs from human tissues 2,836 purchased as total RNAs COLON Colon, Invitrogen #D6050-0 mRNAs from human tissues 8,398 purchased as total RNAs FEBRA Brain, Fetal, CLONTECH #64019-1 mRNAs from human tissues 23,578 purchased as total RNAs FEHRT Heart, Fetal, STARATAGENE #738012 mRNAs from human tissues 2,859 purchased as total RNAs FEKID Kidney, Fetal, STARATAGENE #738014 mRNAs from human tissues 2,747 purchased as total RNAs FELIV Liver, Fetal, CLONTECH #64018-1 mRNAs from human tissues 186 purchased as total RNAs FELNG Lung, Fetal, STARATAGENE #738020 mRNAs from human tissues 2,764 purchased as total RNAs HEART Heart, CLONTECH #64025-1 mRNAs from human tissues 8,889 purchased as total RNAs HLUNG Lung, CLONTECH #64023-1 mRNAs from human tissues 16,146 purchased as total RNAs KIDNE Kidney, CLONTECH #64030-1 mRNAs from human tissues 17,008 purchased as total RNAs LIVER Liver, CLONTECH #64022-1 mRNAs from human tissues 6,843 purchased as total RNAs MAMGL Mammary Gland, CLONTECH #64037-1 mRNAs from human tissues 182 purchased as total RNAs NESOP Esophagus, Invitrogen #D6060-01 mRNAs from human tissues 2,690 purchased as total RNAs NOVAR Adult Ovary, STARATAGENE #735260 mRNAs from human tissues 2,486 purchased as total RNAs PANCR Pancreas, CLONTECH #64031-1 mRNAs from human tissues 179 purchased as total RNAs PERIC Pericardium, Invitrogen #D6105-01 mRNAs from human tissues 8,781 purchased as total RNAs PROST Prostate, CLONTECH #64038-1 mRNAs from human tissues 16,671 purchased as total RNAs RECTM Rectum, Invitrogen #D6110-01 mRNAs from human tissues 2,723 purchased as total RNAs SALGL Salivary Gland, CLONTECH #64026-1 mRNAs from human tissues 183 purchased as total RNAs SKMUS Skeletal Muscle, CLONTECH #64033-1 mRNAs from human tissues 8,424 purchased as total RNAs SMINT Small Intestine, CLONTECH #64039-1 mRNAs from human tissues 16,767 purchased as total RNAs SPLEN Spleen, CLONTECH #64034-1 mRNAs from human tissues 33,950 purchased as total RNAs STOMA Stomach, CLONTECH #64090-1 mRNAs from human tissues 8,685 purchased as total RNAs TBAES Breast, Tumor, CLONTECH #64015-1 mRNAs from human tissues 8,416 purchased as total RNAs TCERX Cervix, Tumor, CLONTECH #64010-1 mRNAs from human tissues 2,797 purchased as total RNAs TCOLN Colon, Tumor, CLONTECH #64014-1 mRNAs from human tissues 2,798 purchased as total RNAs TESOP Esophageal, Tumor, Invitrogen #D6860-01 mRNAs from human tissues 8,500 purchased as total RNAs TESTI Testis, CLONTECH #64027-1 mRNAs from human tissues 90,188 purchased as total RNAs THYMU Thymus, CLONTECH #64028-1 mRNAs from human tissues 70,578 purchased as total RNAs TKIDN Kidney, Tumor, Invitrogen #D6870-01 mRNAs from human tissues 15,970 purchased as total RNAs TLIVE Liver, Tumor, Invitrogen #D6880-01 mRNAs from human tissues 8,627 purchased as total RNAs TLUNG Lung, Tumor, CLONTECH #64013-1 mRNAs from human tissues 2,844 purchased as total RNAs TOVAR Ovary, Tumor, CLONTECH #64011-1 mRNAs from human tissues 2,722 purchased as total RNAs TRACH Trachea, CLONTECH #64091-1 mRNAs from human tissues 52,352 purchased as total RNAs TSTOM Stomach, Tumor, Invitrogen #D6920-01 mRNAs from human tissues 2,757 purchased as total RNAs TUTER Uterus, Tumor, CLONTECH #64008-1 mRNAs from human tissues 2,668 purchased as total RNAs UTERU Uterus, CLONTECH #64029-1 mRNAs from human tissues 49,561 purchased as total RNAs ACTVT Activated T-cell Extraction of mRNAs from 679 primary culture human cells ASTRO Normal Human Astrocyte NHA5732, Extraction of mRNAs from 17,162 Takara Shuzo #CC2565 primary culture human cells CD34C CD34+ cell (AllCells, LLC #CB14435M) Extraction of mRNAs from 1,420 primary culture human cells D3OST CD34+ cells treated with osteoclast Extraction of mRNAs from 5,092 differentiation factor (ODF) to induce primary culture human cells differentiation for 3 days D6OST CD34+ cells treated with osteoclast Extraction of mRNAs from 888 differentiation factor (ODF) to induce primary culture human cells differentiation for 6 days D9OST CD34+ cells treated with osteoclast Extraction of mRNAs from 4,407 differentiation factor (ODF) to induce primary culture human cells differentiation for 9 days DFNES Normal Human Dermal Fibroblasts Extraction of mRNAs from 10,103 (Neonatal Skin; NHDF-Neo) NHDF2564, primary culture human cells Takara Shuzo #CC2509 HCASM HCASMC (Human coronary artery smooth Extraction of mRNAs from 8,949 muscle cells), Toyobo #T305K-05 primary culture human cells HCHON HC (Human Chondrocytes), Toyobo Extraction of mRNAs from 9,397 #T402K-05 primary culture human cells HHDPC HDPC (Human dermal papilla cells), Extraction of mRNAs from 8,453 Toyobo #THPCK-001 primary culture human cells HSYRA HS-RA (Human synoviocytes from Extraction of mRNAs from 7,955 rheumatioid arthritis), Toyobo #T404K-05 primary culture human cells LYMPB Lymphoblast, EB virus transferred B cell Extraction of mRNAs from 2,617 primary culture human cells MESAN Normal human mesangial cells Extraction of mRNAs from 16,053 NHMC56046-2, Takara Shuzo primary culture human cells NETRP Neutrophil Extraction of mRNAs from 9,170 primary culture human cells NHNPC Normal human neural progenitor cells Extraction of mRNAs from 2,377 NHNP5958, Takara Shuzo primary culture human cells PEBLM Human peripheral blood mononuclear cells Extraction of mRNAs from 7,900 HPBMC5939, Takara Shuzo #CC2702 primary culture human cells PUAEN Human pulmonary artery endothelial cells, Extraction of mRNAs from 10,544 Toyobo #T302K-05 primary culture human cells UMVEN Human umbilical vein endothelial cells Extraction of mRNAs from 631 HUVEC, Toyobo primary culture human cells 3NB69 NB69 cell (RCB #RCB0480) Extraction of mRNAs from 8,153 cultured human cells AHMSC HMSC cell (Human mesenchymal cell) Extraction of mRNAs from 668 cultured human cells BGGI1 GI1 cell (Gioma separated from Extraction of mRNAs from 1,899 gliosarcoma; RCB #RCB0763) cultured human cells BNGH4 H4 cell (Neuroglioma; ATCC #HTB-148) Extraction of mRNAs from 7,699 cultured human cells CHONS Chondrocyte; Cell Applications, Inc. Extraction of mRNAs from 2,687 #1205F cultured human cells ERLTF TF-1 cell (erythroleukemia) Extraction of mRNAs from 2,169 cultured human cells HELAC HeLa cell (from cervical cancer) Extraction of mRNAs from 676 cultured human cells IMR32 IMR32 cell (Neuroblastoma; ATCC #CCL- Extraction of mRNAs from 16,867 127) cultured human cells JCMLC Leukemia, myelogenous Extraction of mRNAs from 2,156 cultured human cells MESTC Mesenchyme stem cell Extraction of mRNAs from 687 cultured human cells N1ESE Mesenchymal stem cell Extraction of mRNAs from 2,624 cultured human cells NB9N4 NB9 cell (Neuroblastoma; RCB #RCB0477) Extraction of mRNAs from 1,759 cultured human cells NCRRM Embryonal carcinoma Extraction of mRNAs from 698 cultured human cells NCRRP Embryonal carcinoma treated with retinoic Extraction of mRNAs from 691 acid (RA) to induce differentiation cultured human cells NT2NE NT2 cell treated with RA and treated with a Extraction of mRNAs from 16,337 growth inhibitor to induce nerve cultured human cells differentiation, followed by nerve concentration and recovery (NT2 Neuron) NT2RI NT2 cell treated with RA to induce Extraction of mRNAs from 32,662 differentiation for 5 weeks, and thereafter cultured human cells treated with a growth inhibitor for 2 weeks NT2RM NT2 cell (STARATAGENE #204101) Extraction of mRNAs from 2,026 cultured human cells NT2RP NT2 cell treated with retinoic acid (RA) to Extraction of mRNAs from 24,634 induce differentiation for 5 weeks cultured human cells NTISM a library generated by subtracting cDNAs Extraction of mRNAs from 180 that overlap with the mRNA of cultured human cells undifferentiated NT2 cells from a cDNA library prepared from an mRNA of NT2 cell (STARATAGENE #204101) treated with RA to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks, using Subtract Kit (Invitrogen #K4320-01) (NT2RI-NT2RM) SKNMC SK-N-MC cell (Neuroepithelioma; ATCC Extraction of mRNAs from 7,607 #HTB-10) cultured human cells SKNSH SK-N-SH cell (Neuroblastoma; RCB Extraction of mRNAs from 8,662 #RCB0426) cultured human cells T1ESE Mesenchymal stem cell treated with Extraction of mRNAs from 2,685 trichostatin and 5-azacytidine to induce cultured human cells differentiation Oligocap method

HEMBA tissue rich in head portion from 10-week- mRNAs from human tissues 7,033 gestional fetal human (whole embryo, mainly head) HEMBB tissue rich in trunk portion from 10-week- mRNAs from human tissues 2,581 gestional fetal human (whole embryo, mainly body) MAMMA Mammary Gland mRNAs from human tissues 2,987 OVARC Ovary, Tumor mRNAs from human tissues 2,058 PLACE Placenta mRNAs from human tissues 12,859 THYRO Thyroid gland mRNAs from human tissues 1,863 VESEN Human umbilical vein endothelial cells Extraction of mRNAs from 1,309 primary culture human cells NB9N3 NB9 cell (Neuroblastoma; RCB #RCB0477) Extraction of mRNAs from 96 cultured human cells NT2RM NT2 cell (STARATAGENE #204101) Extraction of mRNAs from 5,375 cultured human cells NT2RP NT2 cell treated with retinoic acid (RA) to Extraction of mRNAs from 14,608 induce differentiation for 2 days and 2 cultured human cells weeks Y79AA Y79 cell (Retinoblastoma; ATCC HTB-18) Extraction of mRNAs from 2,377 cultured human cells BGGI1 GI1 cell (Gioma separated from Extraction of mRNAs from 62 gliosarcoma; RCB #RCB0763) cultured human cells BNGH4 H4 cell (Neuroglioma; ATCC #HTB-148) Extraction of mRNAs from 89 cultured human cells IMR32 IMR32 cell (Neuroblastoma; ATCC #CCL- Extraction of mRNAs from 94 127) cultured human cells SKNMC SK-N-MC cell (Neuroepithelioma; ATCC Extraction of mRNAs from 92 #HTB-10) cultured human cells either oligocap method or improved oligocap method, not distinguished BGGI1 GI1 cell (Gioma separated from Extraction of mRNAs from 1 gliosarcoma; RCB #RCB0763) cultured human cells BNGH4 H4 cell (Neuroglioma; ATCC #HTB-148) Extraction of mRNAs from 3 cultured human cells IMR32 IMR32 cell (Neuroblastoma; ATCC #CCL- Extraction of mRNAs from 1 127) cultured human cells SKNMC SK-N-MC cell (Neuroepithelioma; ATCC Extraction of mRNAs from 1 #HTB-10) cultured human cells NT2RM NT2 cell (STARATAGENE #204101) Extraction of mRNAs from 48 cultured human cells Total 1,440,790

2) 5'-terminal sequence analysis of cDNAs from cDNA Libraries Prepared by the Oligocap Method

[0245] The 5'-terminal or 3'-terminal nucleic acid sequences of cDNAs acquired from each cDNA library, after a sequencing reaction using a DNA sequencing reagent (Dye Terminator Cycle Sequencing FS Ready Reaction Kit, dRhodamine Terminator Cycle Sequencing FS Ready Reaction Kit or BigDye Terminator Cycle Sequencing FS Ready Reaction Kit, manufactured by PE Biosystems) according to the manual, were analyzed for DNA nucleic acid sequences using a DNA sequencer (ABI PRISM 377, manufactured by PE Biosystems). For the data obtained, a database was constructed. The 5'-terminus full-length rate of each cDNA library prepared by the oligocap method was 60% on average (calculated with the protein coding region of a known mRNA as an index).

3) Full-Length cDNA Nucleic Acid Analysis

[0246] For cDNAs selected for full-length cDNA nucleic acid analysis, the nucleic acid sequence of each full-length cDNA was determined. The nucleic acid sequences were determined mainly by a primer walking method based on the dideoxy terminator method using a custom-synthesized DNA primer. Specifically, a sequencing reaction was performed using a custom-synthesized DNA primer with a DNA sequencing reagent manufactured by PE Biosystem as directed in the manual, after which the DNA nucleic acid sequence was analyzed using a sequencer manufactured by the same company. For some clones, a DNA sequencer manufactured by Licor was also utilized. For some cDNAs, no custom primer was used, but the shotgun method, in which cDNA-containing plasmids are randomly cleaved, was used with a DNA sequencer to determine the DNA nucleic acid sequence. The full-length nucleic acid sequence was finally established by completely overlapping the partial nucleic acid sequences determined by the above-described method. Next, the region of translation into protein was predicted from the determined full-length nucleic acid sequence, and the amino acid sequence was determined.

Example 2

Genome Mapping and Clustering

(1) Sequence Data Set

[0247] The following sequences were used as a data set.

[0248] Human genome sequence: UCSC hg 17 (NCBI Build 35) (http://www.genome.ucsc.edu/)

[0249] Human full-length cDNAs, 19,265 sequences, newly acquired and subjected to full-length cDNA sequence analysis by us

[0250] Out of human full-length cDNA sequences acquired and subjected to full-length cDNA sequence analysis by us, and registered with an existing public database (DDBJ/GenBank/EMBL) (accession numbers: AB038269, AB045981, AB056476, AB056477, AK000001 to AK002212, AK021413 to AK027260, AK027263 to AK027902, AK054561 to AK058202, AK074029 to AK074481, AK074483 to AK075325, AK075326 to AK075566, AK090395 to AK098842, AK122580 to AK129030, AK129488 to AK131107, AK131190 to AK131575, AK160364 to AK160386, AK172724 to AK172740, AK172741 to AK172866), 30,754 sequences that can be used for genome mapping

[0251] 2039 sequences that had been registered with the database HUGE of Kazusa DNA Research Institute by Feb. 3, 2005 (http://www.kazusa.or.jp/huge/)

[0252] Human full-length cDNAs, 20,878 sequences, that had been listed on the Full Length Clone List on the website of Mammalian Gene Collection (http://mgc.nci.nih.gov/) and included in GenBank gbpri (ftp://ftp.ncbi.nih.gov/genbank/) by Jan. 30, 2005

[0253] Human full-length cDNAs, 9,280 sequences, that had been registered as Deutsches Krebsforschungszentrum (DKFZ) in GenBank gbpri before Jan. 30, 2005

[0254] Human full-length cDNAs, 13,984 sequences, being constituent sequences of the human RefSeq sequences of the Jan. 31, 2005 version (http://www.ncbi.nlm.nih.gov/RefSeq/), registered as mRNAs, and included in GenBank gbpri

[0255] Human RefSeq sequences of the Jan. 31, 2005 version (http://www.ncbi.nlm.nih.gov/RefSeq/), 28,931 sequences

[0256] Out of the human genome assemble sequences in Feb. 10, 2005 Ensembl (http://www.ensembl.org/) (NCBI35.nov.sub.--26.35), 33,666 sequences of NCBI35.nov.sub.--26.35 that had been mapped to the hg 17 human genome in UCSC (University of California, Santa Cruz, http://www.genome.ucsc.edu/)

[0257] Human cDNA 5'-terminal sequence, 1,456,213 sequences, and 3'-terminal sequence, 109,283 sequences, subjected to sequence analysis in our project (including published sequences with accession numbers: AU116788 to AU160826, AU279383 to AU280837, DA000001 to DA999999, DB000001 to DB384947)

(2) Genome Mapping

[0258] The above-described data set was subjected to genome mapping using BLASTN (ftp://ftp.ncbi.nih.gov/blast/), under the conditions of Identity of 95% or more and consensus length of 50 base pairs (bp) or more. About 99% of the sequences in the data set used for the mapping permitted genome mapping.

(3) Clustering

[0259] After the genome mapping, a sequence group contained in a genome region, as a single assembly, was allowed to form a cluster. Hence, each cluster was chosen in a way such that the outer sides of both ends of each genome region in the sequence group would not overlap the sequences mapped on each genome region. As a result, a total of 87,173 clusters existed. Therefrom, 17,535 clusters configured solely with human cDNA 3'-terminal sequences that were acquired and subjected to sequence analysis in our project were excluded, leaving 69,638 clusters. Of these clusters, 36,782 clusters were excluded since they were configured solely with human cDNA 5'-terminal sequences that were acquired and subjected to sequence analysis in our project (those having none of full-length cDNA, RefSeq, and Ensembl sequences were excluded). As a result, 32,856 clusters were found to comprise at least one of full-length cDNAs, RefSeq, and Ensembl sequences. By selecting clusters comprising one or more of full-length cDNAs, RefSeq, and Ensembl sequences, which are expected to have an ORF (open reading frame, coding region) with a reliability above a given level, 21,703 clusters were acquired. For these 21,703 clusters, expression specificity was determined.

Example 3

Experimental Procedures for Real-Time PCR

[0260] (1) Synthesis of Template cDNAs 1) Human mRNA (Human Total RNA) Used as Template

[0261] A reaction was carried out with 50 .mu.g of Human Total RNA per 150 .mu.l of the system.

[0262] To 50 .mu.g of Total RNA dissolved in 87 .mu.l of H.sub.2O, 10 .mu.l of a random primer (concentration 65 ng/.mu.l) and 7.5 .mu.l of dNTP Mix (concentration 10 mM each dNTP Mix) were added. This was followed by incubation at 65.degree. C. for 5 minutes and on ice for 1 minute. 30 .mu.l of 5.times. reaction buffer solution (attached to the Invitrogen SuperScript III RT kit) and 7.5 .mu.l of 0.1M DTT and 3 .mu.l of RNase Inhibitor (STRATAGENE) and 5 .mu.l of SuperScript III RT (Invitrogen) were added. This was followed by incubation at 25.degree. C. for 5 minutes, incubation at 50.degree. C. for 60 minutes, and incubation at 70.degree. C. for 15 minutes. After the reaction, phenol-chloroform extraction was performed to deactivate the enzyme. By adding 3 .mu.l of EDTA (0.5M) and 22.5 .mu.l of 0.1N NaOH, alkali treatment was performed to degrade the RNA. After 30 .mu.l of Tris (1M pH 7.8) was added to neutralize the reaction liquid, ethanol precipitation was performed, and the precipitate was dissolved in 100 .mu.l of TE buffer solution.

[0263] Human mRNAs from the mRNA sources (Human Total RNAs) were acquired by the method described in Example 1.

[0264] A list of the human mRNAs used in the experiments is shown in Table 2.

2) Human mRNA (Human PolyA(+) RNA) Used as Template

[0265] A reaction was carried out with 5 .mu.g of human PolyA RNA per 100 .mu.l of the system.

[0266] To 5 .mu.g of PolyA(+) RNA dissolved in 58 .mu.l of H.sub.2O, 5 .mu.l of a random primer (concentration 65 ng/.mu.l) and 5 .mu.l of dNTP Mix (concentration 10 mM each dNTP Mix) were added. This was followed by incubation at 65.degree. C. for 5 minutes and incubation on ice for 1 minute. 20 .mu.l of 5.times. reaction buffer solution (attached to the Invitrogen SuperScript III RTkit), 5 .mu.l of 0.1M DTT, 2 .mu.l of RNase Inhibitor (STRATAGENE) and 5 .mu.l of SuperScript III RT (Invitrogens) were added. This was followed by incubation at 25.degree. C. for 5 minutes, incubation at 50.degree. C. for 60 minutes, and incubation at 70.degree. C. for 15 minutes. After the reaction, phenol-chloroform extraction was performed to deactivate the enzyme. By adding 2 .mu.l of EDTA (0.5M) and 15 .mu.l of 0.1N NaOH, alkali treatment was performed to degrade the RNA. After 20 .mu.l of Tris (1M pH 7.8) was added to neutralize the reaction liquid, ethanol precipitation was performed, and the precipitate was dissolved in 50 .mu.l of TE buffer solution.

[0267] A list of the human mRNAs used in the experiments is shown in Table 2.

TABLE-US-00002 TABLE 2 Product name Manufacturer Catalog number Human total RNA purchased 1 Bone Marrow Human Bone Marrow Total Clontech 636548 RNA 2 Brain, whole Human Brain Total RNA Clontech 636530 3 Fetal Brain Human Fetal Brain Total Clontech 636526 RNA 4 Heart Human Heart Total RNA Clontech 636532 5 Kidney Human Kidney Total RNA Clontech 636529/636514 6 Liver Human Liver Total RNA Clontech 636531 7 Lung Human Lung Total RNA Clontech 636524 8 Thymus Human Thymus Total RNA Clontech 636549 9 Uterus Human Uterus Total RNA Clontech 636551/636513 10 Spinal Cord Human Spinal Cord Total Clontech 636554 RNA 11 Colon Human Colon Total RNA Clontech 636521 12 Colon Tumor Human Colon Tumor Total Clontech 636634 RNA 13 Kidney Tumor Human Kidney Tumor Total Clontech 636632 RNA 14 Liver Tumor Human Liver Total RNA CHEMICOM RNA569 15 Lung Tumor Human Lung Tumor Total Clontech 636633 RNA 16 Ovary Human Ovary Total RNA Clontech 636555 17 Ovary Tumor Human Ovary Tumor Total Clontech 636631 RNA 18 Spleen Human Spleen Total RNA Clontech 636525 19 Stomach Human Stomach Total RNA Clontech 636522 20 Stomach Tumor Human Stomach Tumor Clontech 636629 Total RNA 21 Uterus Tumor Human Uterus Tumor Total Clontech 636628 RNA 22 ALZ Visual Cortex Occipital Human Visual Cortex Ambion B6336 Occipital ALZ Total RNA Human polyA(+) RNA purchased 1 Brain, whole Human Brain, whole Clontech 636102 PolyARNA 2 Brain cerebellum Brain, cerebellum Clontech 636122 3 Brain, amygdala Brain, amygdala Clontech 6574-1 4 Brain, caudate nucleus Brain, caudate nucleus Clontech 6575-1 5 Brain, corpus callosum Brain, corpus callosum Clontech 636133 6 Brain, hippocampus Brain, hippocampus Clontech 636134 7 Brain, substantia nigra Brain, substantia nigra Clontech 6580-1 8 Brain, thalamus Brain, thalamus Clontech 636135 9 Brain, subthalamic nucleus Brain, subthalamic nucleus Clontech 636167 Extraction of human total RNA Explanation of the derivation from an RNA source of mRNA 1 Tongue (normal) Normal tongue tissue 2 Tongue Tumor Tongue tumor tissue 3 NT2 cell (STARATAGENE Before treatment with NT2 #204101) retinoic acid (RA(-)) 4 NT2 cell treated with NT2 cell treated with retinoic retinoic acid (RA) to induce acid (RA) to induce differentiation differentiation for 5 weeks 5 NT2 cell treated with RA to NT2 cell treated with RA to induce differentiation induce differentiation for 5 followed by treatment with weeks, and thereafter a growth inhibitor (Inh) treated with a growth inhibitor for 2 weeks 6 NT2 cell treated with RA to NT2 cell treated with retinoic induce differentiation acid (RA) to induce differentiation for 1 day 7 NT2 cell treated with RA to NT2 cell treated with retinoic induce differentiation acid (RA) to induce differentiation for 2 days 8 NT2 cell treated with RA to NT2 cell treated with retinoic induce differentiation acid (RA) to induce differentiation for 1 week 9 NT2 cell treated with RA NT2 cell treated with RA and and treated with a-Inh to treated with a growth induce nerve differentiation inhibitor to induce nerve differentiation, followed by nerve cell concentration and recovery

(2) Design of Primers and Probes

[0268] Using Primer Express software 3.0, the primer design software attached to the Applied Biosystems real-time PCR 7500 Fast, with the sequences of portions that serve as the borders of the changing region, primers and probes were designed to allow the individual detection of cDNAs having other splice patterns transcribed from the same chromosome region as the cDNA to be comparatively examined under the conditions recommended by the software. Using the designed primers, real-time PCR was performed, and they were confirmed to produce a single band and to be capable of specifically detecting only one kind of cDNA.

(3) Expressional Analysis Using Real-Time PCR

[0269] 1) mRNAs Used

[0270] All mRNAs used were of human derivation.

[0271] The experiments on the four clusters chr14-45, chr7-2007, chr12-1875, and chr3-1507, out of the 10 experimental systems, were performed using SYBR GREEN as a real-time PCR reaction system, with 16 kinds of samples as template cDNAs: NT2 cells [NT2 RA(-)], NT2 cells treated with retinoic acid (RA) to induce differentiation for 24 hours [NT2 RA(+) 24 hr], NT2 cells treated with retinoic acid (RA) to induce differentiation for 48 hours [NT2 RA(+) 48 hr], NT2 cells treated with retinoic acid (RA) to induce differentiation for 1 week [NT2 RA(+) 1 week], NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks [NT2 RA(+)], NT2 cells treated with RA to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks [NT2 RA(+) Inh(+)], NT2 cells treated with RA and treated with a growth inhibitor to induce nerve differentiation, followed by nerve concentration and recovery (NT2 Neuron), Brain, Fetal, Brain, whole, Alzheimer patient cerebral cortex (ALZ Visual Cortex Occipital), Mix, viscus tissues [Heart, Kidney, Liver, Lung, Colon, Stomach], Mix, blood cells and related tissues [Bone Marrow, Thymus, Spinal Cord, Spleen], Mix, tumor tissues [Colon Tumor, Kidney Tumor, Liver Tumor, Lung Tumor, Ovary Tumor, Stomach Tumor, Uterus Tumor, Tongue Tumor], Mix, normal tissues [Colon, Kidney, Liver, Lung, Ovary, Stomach, Uterus, Tongue], whole brain polyA(+)RNA [Brain, whole PolyA(+) RNA], and Brain, hippocampus.

[0272] For the cluster chr12-1875, experiments were also performed with, in addition to the foregoing 16 kinds, additional samples: Colon, Kidney, Liver, Lung, Ovary, Stomach, Uterus, Tongue, Colon Tumor, Kidney Tumor, Liver Tumor, Lung Tumor, Ovary Tumor, Stomach Tumor, Uterus Tumor, and Tongue Tumor.

[0273] For the cluster chr3-1507, experiments were also performed with, in addition to the foregoing 16 kinds, additional samples: Brain cerebellum, Brain, amygdala, Brain, caudate nucleus, Brain, corpus callosum, Brain, substantia nigra, Brain, thalamus, and Brain, subthalamic nucleus.

[0274] The experiments on the 2 clusters chr19-32 and chr12+1658, out of the 10 experimental systems, were performed using TaqMan manufactured by Applied Biosystems as a real-time PCR reaction system, with a total of 16 kinds of samples as template cDNAs: NT2 cells [NT2 RA(-)], NT2 cells treated with retinoic acid (RA) to induce differentiation for 24 hours [NT2 RA(+) 24 hr], NT2 cells treated with retinoic acid (RA) to induce differentiation for 48 hours [NT2 RA(+) 48 hr], NT2 cells treated with retinoic acid (RA) to induce differentiation for 1 week [NT2 RA(+) 1 week], NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks [NT2 RA(+)], NT2 cells treated with RA to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks [NT2 RA(+) Inh(+)], NT2 cells treated with RA and treated with a growth inhibitor to induce nerve differentiation, followed by nerve concentration and recovery (NT2 Neuron), Brain, Fetal, Brain, whole, Alzheimer patient cerebral cortex (ALZ Visual Cortex Occipital), Mix, viscus tissues [Heart, Kidney, Liver, Lung, Colon, Stomach], Mix, blood cells and related tissues [Bone Marrow, Thymus, Spinal Cord, Spleen], Mix, tumor tissues [Colon Tumor, Kidney Tumor, Liver Tumor, Lung Tumor, Ovary Tumor, Stomach Tumor, Uterus Tumor, Tongue Tumor], Mix, normal tissues [Colon, Kidney, Liver, Lung, Ovary, Stomach, Uterus, Tongue], Brain, whole PolyA(+) RNA, and Brain, hippocampus.

[0275] The experiments on the 4 clusters chr2-2324, chrX-900, chr8-916, and chr3+2014, out of the 10 experimental systems, were performed using SYBR GREEN as a real-time PCR reaction system, with a total of 23 kinds of samples as template cDNAs: NT2 cells [NT2 RA(-)], NT2 cells treated with retinoic acid (RA) to induce differentiation for 24 hours [NT2 RA(+) 24 hr], NT2 cells treated with retinoic acid (RA) to induce differentiation for 48 hours [NT2 RA(+) 48 hr], NT2 cells treated with retinoic acid (RA) to induce differentiation for 1 week [NT2 RA(+) 1 week], NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks [NT2 RA(+)], NT2 cells treated with RA to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks [NT2 RA(+) Inh(+)], NT2 cells treated with RA and treated with a growth inhibitor to induce nerve differentiation, followed by nerve concentration and recovery (NT2 Neuron), Brain, Fetal, Brain, whole, Alzheimer patient cerebral cortex (ALZ Visual Cortex Occipital), Mix, viscus tissues [Heart, Kidney, Liver, Lung, Colon, Stomach], Mix, blood cells and related tissues [Bone Marrow, Thymus, Spinal Cord, Spleen], Mix, tumor tissues [Colon Tumor, Kidney Tumor, Liver Tumor, Lung Tumor, Ovary Tumor, Stomach Tumor, Uterus Tumor, Tongue Tumor], Mix, normal tissues [Colon, Kidney, Liver, Lung, Ovary, Stomach, Uterus, Tongue], Brain, whole PolyA(+) RNA, Brain, hippocampus, Brain cerebellum, Brain, amygdala, Brain, caudate nucleus, Brain, corpus callosum, Brain, substantia nigra, Brain, thalamus, and Brain, subthalamic nucleus.

2) Reaction System Using SYBR GREEN

[0276] The SYBR GREEN I Dye assay chemistry is an experimental system based on the characteristic of SYBR GREEN to emit strong fluorescence by binding to a double-stranded DNA. When the DNA denatures to single-stranded during the PCR reaction, SYBR GREEN leaves from the DNA and the fluorescence decreases rapidly; however, with the subsequent annealing/extension reaction, it binds to the double-stranded DNA to emit fluorescence again. In the SYBR GREEN I Dye assay chemistry, fluorescence intensity, which increases with every PCR cycle, is detected.

[0277] To a cDNA derived from each tissue, 0.2 .mu.l (equivalent to 100 ng of Total RNA), as the template, Forward Primer (final concentration 250 nM), Reverse Primer (final concentration 250 nM), and SYBR Green PCR Master Mix (ABI 4309155) were added, to make a total volume of 20 .mu.l. For endogenous control, GAPDH (Accession No; NM.sub.--002046.2) always served as a reaction control for all templates.

[0278] A PCR was performed under the conditions shown below, which represent the standard protocol for Applied Biosystems real-time PCR 7500 Fast. After an initial step at 50.degree. C. for 2 minutes and at 95.degree. C. for 10 minutes, denaturation at 95.degree. C. for 15 seconds and annealing elongation at 60.degree. C. for 1 minute were repeated in 40 cycles.

GAPDH-F (SEQ ID NO:5): Forward Primer for endogenous control GAPDH GAPDH-R (SEQ ID NO:6): Reverse Primer for endogenous control GAPDH

3) Reaction System Using TaqMan

[0279] The TaqMan assay chemistry is an experimental system employing the TaqMan probe, a probe phosphorylated at the 3' terminus and labeled with a Fluorescenin-series fluorescent dye (reporter) at the 5' terminus, and a Rhodamine-series fluorescent dye (quencher) at the 3' terminus. When the TaqMan probe occurs alone, the fluorescence energy of the reporter is consumed as excitation energy for the quencher, and the fluorescence of the reporter is suppressed, because the fluorescence wavelength is close to that of the quencher even if reporter excitation light is irradiated. However, when the TaqMan probe is degraded by the 5'-3' exonuclease activity of DNA polymerase during the elongation from the primer in the PCR reaction, the fluorescent dye of the reporter leaves from the 5' terminus of the TaqMan probe, and the distance from the fluorescent dye of the quencher increases, resulting in the emission of fluorescence. In the TaqMan assay chemistry, the fluorescence intensity from the reporter, which increases with every PCR cycle, is detected.

[0280] To 0.2 .mu.l (equivalent to 100 ng as converted to Total RNA) of a cDNA derived from each tissue as a template, Forward Primer (final concentration 900 nM), Reverse Primer (final concentration 900 nM), TaqMan Probe (final concentration 250 nM), and TaqMan Fast Universal PCR Master Mix (ABI 466073) were added, to make a total volume of 20 .mu.l. For endogenous control, GAPDH always served as a reaction control for all templates.

[0281] A PCR was performed under the conditions shown below, which represent the Fast protocol for Applied Biosystems real-time PCR 7500 Fast. After enzyme activation 95.degree. C. for 20 seconds, denaturation at 95.degree. C. for 3 seconds and annealing elongation at 60.degree. C. for 30 seconds were repeated in 40 cycles. GAPDH-Probe (SEQ ID NO:7): TaqMan Probe for endogenous control GAPDH

(4) Method of Statistical Analysis of Data

[0282] The results were analyzed using a relative quantitation method.

[0283] Using the RQ study software for Applied Biosystems real-time PCR 7500 Fast, a threshold was set in an exponential functional amplification region of the amplification curve. The number of cycles at that time was used as the Ct (threshold cycle). To make a correction for initial RNA content, the Ct of the endogenous control GAPDH was subtracted from the Ct obtained, and this value was used as the dCt [dCt=Target Ct-ENDOGENUS Ct (GAPDH)]. The dCt of the sample serving as the reference standard (control) was further subtracted from the dCt obtained, and this value was used as the ddCt [ddCt=Target dCt-Control dCt]. On the basis of this value, relative value was calculated, and this was used as the RQ [RQ=2.sup.-ddCt]. On the basis of this result, a logarithmic graph was generated, and the amounts amplified and hence expression levels with each primer and probe were compared.

[0284] In each Example, analytical results for RQ and Log.sub.10 RQ are shown. RQ scores are shown to the first decimal point. For samples not allowing detection by real-time PCR, "Undet." was written in the fields for RQ score and the score of Log.sub.10 RQ. Log.sub.10 RQ scores are shown to the second decimal point. However, for a mixed sample of control normal visceral tissues (Mix, viscus tissues) (RQ value "1.0"), "0.0" was written in the field for Log.sub.10 RQ scores.

Example 4

Cluster chr19-32 (Data Set: 103)

(1) Cluster Analysis

1) Cluster Characteristics

[0285] An analysis was performed on 8 sequences of full-length cDNAs subjected to genome mapping onto the cluster chr19-32 (Human genome UCSC hg18 (NCBI Build34) chromosome 19, 63,124,000 bp to 63,140,000 bp) [D-UTERU2026184.1, D-BRACE3000012.1, AB075836.1, AY695825.1, C-NT2RI2001083, ENST00000358502, ENST00000361044, NM.sub.--133460.1]. They were classifiable according to expression pattern difference into the following 3 kinds.

[1] D-UTERU2026184.1

[2] D-BRACE3000012.1

[3] AB075836.1, AY695825.1, C-NT2RI2001083 (AK056113.1), ENST00000358502, ENST00000361044, NM.sub.--133460.1

[0286] [1] and [2] are cDNAs that were newly acquired and subjected to full-length cDNA sequence analysis by us, having an ORF different from that of [3], which had been registered in an existing public DB (DDBJ/Genbank/EMBL).

[0287] [1], compared with the known [3], had a different ORF region because of the deletion of portions corresponding to the second and third exons of [3] in the ORF region.

[0288] [2], compared with the known [3], had an altered translation initiation point and a different ORF region because of the insertion of an exon different from the other patterns into the ORF region.

[0289] It was found that the ORF regions present in the 3 kinds of cDNA patterns [1] to [3] undergo splicing in different patterns, such as exon deletions and insertions, from the same chromosome region, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-UTERU2026184.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 103.sub.--[1].sub.--1-N0 (SEQ ID NO:8): The entire nucleic acid sequence region of D-UTERU2026184.1 103.sub.--[1].sub.--1-NA0 (SEQ ID NO:9): Both the entire nucleic acid sequence region and amino acid sequence of D-UTERU2026184.1 103.sub.--[1].sub.--1-A0 (SEQ ID NO:10): The entire amino acid sequence region of D-UTERU2026184.1

[0290] The 213-base exon present at the 213th to 425th bases of NM.sub.--133460.1 (SEQ ID NO:13), which is registered with an existing public DB, and serves for control, is deleted and not present in the region at the 223rd to 224th bases of D-UTERU2026184.1. The 2 bases present at the 520th to 521st bases of NM.sub.--133460.1 (SEQ ID NO:14) are also deleted and not present in the region at the 317th to 318th bases of D-UTERU2026184.1 (SEQ ID NO:11). Although the translation initiation point of NM.sub.--133460.1 is present on the 128-base insertion exon, D-UTERU2026184.1 is present on the first exon, which is shared by NM.sub.--133460.1; therefore, compared with NM.sub.--133460.1, the N-terminal amino acids differed by 43 residues.

103.sub.--[1].sub.--1-N1 (SEQ ID NO:11): Deletion nucleic acid sequence region of D-UTERU2026184.1 103.sub.--[1].sub.--1-A1 (SEQ ID NO:12): Amino acid region altered as a result of deletion of D-UTERU2026184.1 103.sub.--[1].sub.--1-N2 (identical to SEQ ID NO:11): ORF nucleic acid region in the deletion nucleic acid region of D-UTERU2026184.1 103.sub.--[1].sub.--1-A2 (identical to SEQ ID NO:12): ORF amino acid region related to the deletion nucleic acid region of D-UTERU2026184.1 103.sub.--[1]_C-N1 (SEQ ID NO:13): 213-base insert nucleic acid sequence present at the 213th to 425th bases of NM.sub.--133460.1 inserted into the region at the 223rd to 224th bases of D-UTERU2026184.1 103.sub.--[1]_C--N2 (SEQ ID NO:14): 2-base insert nucleic acid sequence present at the 520th to 521 bases of NM.sub.--133460.1 inserted into the region at the 317th to 318th bases of D-UTERU2026184.1 103.sub.--[1]_C-A1 (SEQ ID NO:15): Amino acid region related to the insert nucleic acid sequences at the 213th to 425th bases and the 520th to 521st bases of NM.sub.--133460.1, inserted into the region at the 223rd to 224th bases and the region at the 317th to 318th bases of D-UTERU2026184.1.

[0291] With this change, the Pfam motif "KRAB box", which is present at the 5th to 45th amino acids of NM.sub.--133460.1, which serves for control, disappeared in D-UTERU2026184.1 (http://pfam.janelia.org/).

3) Characteristics of D-BRACE3000012.1 ([2]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 103.sub.--[2].sub.--1-N0 (SEQ ID NO:16): The entire nucleic acid sequence region of D-BRACE3000012.1 103.sub.--[2].sub.--1-NA0 (SEQ ID NO:17): Both the entire nucleic acid sequence region and amino acid sequence of D-BRACE3000012.1 103.sub.--[2].sub.--1-A0 (SEQ ID NO:18): The entire amino acid sequence region of D-BRACE3000012.1

[0292] The sequence at the 314th to 533rd bases of D-BRACE3000012.1 (SEQ ID NO:19) is a variant with insertion of an exon not present in NM.sub.--133460.1, which is registered with an existing public DB and serves for control; because of its presence on the exon inserted, along with the translation initiation point, compared with the NM.sub.--133460.1, the N-terminal amino acids differed by 23 residues (SEQ ID NO:20).

103.sub.--[2].sub.--1-N1 (SEQ ID NO:19): 220-base insert nucleic acid sequence region of D-BRACE3000012.1 103.sub.--[2].sub.--1-A1 (SEQ ID NO:20): 23-residue insert amino acid sequence region of D-BRACE3000012.1 103.sub.--[2].sub.--1-N2 (SEQ ID NO:21): ORF nucleic acid sequence region in 220-base insert region of D-BRACE3000012.1 103.sub.--[2].sub.--1-A2 (SEQ ID NO:22): ORF amino acid region related to 220-base insert region of D-BRACE3000012.1

4) Expression Specificity Analysis and Design of Primers and TaqMan Probes for Real-Time PCR

[0293] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

103.sub.--01--A region specifically extracted by means of the sequence information at the border of a region lacking an exon in the cDNA pattern [1]: an ORF-altering region with exon deletion in the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us .fwdarw.Fragment 103.sub.--01 (SEQ ID NO:25) amplified by Primer103.sub.--01F (SEQ ID NO:23) and Primer103.sub.--01R (SEQ ID NO:24) TaqMan probe used 103.sub.--01TP: (SEQ ID NO:26) 103.sub.--02--A region specifically extracted by means of the sequence information on a region with exon insertion in the cDNA pattern [2]: an ORF-altering region with exon insertion in the cDNA pattern [2], which was newly subjected to full-length cDNA sequence analysis by us .fwdarw.Fragment 103.sub.--02 (SEQ ID NO:29) amplified by Primer103.sub.--02F (SEQ ID NO:27) and Primer103.sub.--02R (SEQ ID NO:28) TaqMan probe used 103.sub.--02TP: (SEQ ID NO:30) 103.sub.--03--A specific region that is distinguishable from both the deletion region [1] and insert region of [2] in the cDNA pattern [3] registered with an existing public DB, serving as a control for comparing [1] and [2] .fwdarw.Fragment 103.sub.--03 (SEQ ID NO:33) amplified by Primer103.sub.--03F (SEQ ID NO:31) and Primer103.sub.--03R (SEQ ID NO:32) TaqMan probe used 103.sub.--03TP: (SEQ ID NO:34) 103.sub.--04--A common region shared by all of [1] to [3]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA patterns [1] and [2], which were newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [3], registered with an existing public DB .fwdarw.Fragment 103.sub.--04 (SEQ ID NO:37) amplified by Primer103.sub.--04F (SEQ ID NO:35) and Primer103.sub.--04R (SEQ ID NO:36) TaqMan probe 103.sub.--04TP used: (SEQ ID NO:38)

[0294] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the exon regions specific for the cDNA patterns [1], [2], and [3] shown above, respectively, were found to be expressed at the following frequencies.

[0295] In the cDNA pattern [1], which was newly acquired and analyzed by us, one 5'-terminal sequence was present, the derivation thereof being Uterus for 1 sequence (analytical parameter 49,561).

[0296] In the cDNA pattern [2], which was newly acquired and analyzed by us, two 5'-terminal sequences were present, the derivations thereof being Brain, cerebellum for 1 sequence (analytical parameter 82,880), and NT2 cells treated with retinoic acid (RA) to induce differentiation (NT2RP) for 1 sequence (analytical parameter 39,242).

[0297] In the cDNA pattern [3], which is registered with an existing public DB, fourteen 5'-terminal sequences were present, the derivations thereof being NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks (NT2RP) for 4 sequences (analytical parameter 39,242), NT2 cells treated with RA to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks (NT2RI) for 2 sequences (analytical parameter 32,662), Brain, cerebellum for 1 sequence (analytical parameter 82,880), Brain, amygdala for 1 sequence (analytical parameter 58,640), Brain, hippocampus for 1 sequence (analytical parameter 57,918), Brain, substantia nigra for 1 sequence (analytical parameter 15,897), Normal Human Dermal Fibroblasts for 1 sequence (analytical parameter 10,103), Brain, Fetal for 2 sequences (analytical parameter 79,560), and Uterus for 1 sequence (analytical parameter 49,561).

[0298] From this result, it was found that the exon-deletion pattern [1] was expressed in Uterus, and that the exon-insertion pattern [2] was expressed in Brain, cerebellum and NT2 cells treated with retinoic acid to induce differentiation (NT2RP). It was found that the known sequence [3] was abundantly expressed in NT2 cells treated with retinoic acid to induce differentiation (NT2RP) and in brain tissues.

(2) Analysis of Expression Specificity by Real-Time PCR

[0299] To detect protein expression diversity changes due to exon selectivity, details of expression levels were analyzed by real-time PCR. The results are shown in Table 3.

TABLE-US-00003 TABLE 3 RQ Score Log.sub.10RQ Score 103_01 103_02 103_03 103_04 103_01 103_02 103_03 103_04 01 NT2RA(-) 0.1 6.3 0.9 0.4 -0.86 0.80 -0.04 -0.45 02 NT2RA(+) 24 hr 0.3 2.7 0.6 0.3 -0.54 0.42 -0.19 -0.59 03 NT2RA(+) 48 hr 0.2 2.0 0.7 0.3 -0.68 0.29 -0.14 -0.52 04 NT2RA(+) 1 week 1.9 1.5 0.9 0.8 0.27 0.17 -0.03 -0.11 05 NT2RA(+) 5 weeks 6.7 8.6 2.9 1.1 0.83 0.93 0.46 0.05 06 NT2RA(+) 5 weeks, Inh(+) 2.1 1.5 1.1 0.5 0.32 0.19 0.05 -0.30 07 NT2 Neuron 0.2 0.2 0.5 1.3 -0.77 -0.74 -0.30 0.10 08 Brain, Fetal 4.0 5.0 17.3 8.4 0.60 0.70 1.24 0.92 09 Brain, whole 8.6 3.2 6.1 5.3 0.93 0.51 0.78 0.73 10 ALZ Visual Cortex 1.2 0.7 2.2 3.0 0.08 -0.18 0.34 0.47 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 1.0 0.0 0.0 0.0 0.0 12 Mix, blood cells and 12.7 6.0 2.7 2.6 1.11 0.78 0.43 0.42 related tissues 13 Mix, tumor tissues 2.3 0.9 0.5 0.7 0.36 -0.02 -0.28 -0.17 14 Mix, normal tissues 2.4 1.9 1.2 1.8 0.37 0.29 0.09 0.24 15 Brain, whole PolyA(+) RNA 4.8 2.2 5.9 3.1 0.68 0.34 0.77 0.49 16 Brain, hippocampus 2.4 1.9 4.1 2.6 0.39 0.28 0.61 0.42

[0300] Expression levels were compared using the 16 samples shown in Example 3, including Brain, hippocampus, Brain, whole, Brain, Fetal, Alzheimer patient cerebral cortex (ALZ Visual Cortex Occipital), and NT2 cells in 7 differentiation stages. For experimental control, comparisons were made using the sample prepared by mixing normal visceral tissues in Example 3 (Mix, viscus tissues).

[0301] The ratio of ORF alteration due to exon insertion/deletion selectivity as compared between 103.sub.--01 (SEQ ID NO:25) and 103.sub.--02 (SEQ ID NO:29) changed greatly among the following differentiation stages of the brain and NT2 cells.

[0302] The expression of the exon-deletion pattern shown by 103.sub.--01 (SEQ ID NO:25) was low in undifferentiated NT2 cells NT2RA (-) and NT2RA (+) 48 hr, which represents the initial stage in which retinoic acid was added to induce differentiation; the expression was high in NT2RA(+) 1 week to NT2RA (+) 5 weeks, Inh (+), which represent the late stage of differentiation induction, and was low in NT2 Neuron. The expression in Brain, Fetal was also low (Table 3).

[0303] The expression of the exon insertion pattern shown by 103.sub.--02 (SEQ ID NO:29) was abundant in undifferentiated NT2 cells NT2RA(-) and the initial stage in which retinoic acid was added to induce differentiation, to NT2RA (+) 5 weeks, Inh (+); the expression was low in NT2 Neuron (Table 3). Not only in Fetal Brain, but also in the whole brain, the expression was low (Table 3).

[0304] These results demonstrated that by comparing the expression of selective exon regions 103.sub.--[1].sub.--1-N1 (SEQ ID NO:11) and 103.sub.--[2].sub.--1-N1 (SEQ ID NO:19) of newly acquired cDNAs shown by the detection regions 103.sub.--01 (SEQ ID NO:25) and 103.sub.--02 (SEQ ID NO:29), it is possible to use these regions as differentiation markers for detecting stages of nerve cell differentiation or regeneration. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0305] The following regions also seem to be useful as differentiation markers for detecting nerve cell differentiation or regeneration stages.

Upstream sequence 062.sub.--[1].sub.--1-N3 (SEQ ID NO:39), which comprises the 285th to 306th bases undergoing priming by Primer10301R (SEQ ID NO:24) in D-UTERU2026184.1 of the cDNA pattern [1]. Upstream sequence 062.sub.--[1].sub.--1-N3 (SEQ ID NO:40), which comprises the 521st to 541st bases undergoing priming by Primer103.sub.--02R (SEQ ID NO:28) in D-BRACE3000012.1 of the cDNA pattern [2]. Region 10301 (SEQ ID NO:25) amplified by Primer10301F (SEQ ID NO:23) and Primer103.sub.--01R (SEQ ID NO:24) in the cDNA pattern [1] Region 103.sub.--02 (SEQ ID NO:29) amplified by Primer103.sub.--02F (SEQ ID NO:27) and Primer103.sub.--02R (SEQ ID NO:28) in the cDNA pattern [2]

Example 5

Cluster chr14-45 (Data Set: 019)

(1) Cluster Analysis

1) Cluster Characteristics

[0306] An analysis was performed on 13 sequences of full-length cDNAs subjected to genome mapping onto the cluster chr14-45 (Human genome UCSC hg18 (NCBI Build34) chromosome 14, 104,305,000 bp to 104,335,000 bp) [D-NT2RP8004156.1, BC000479.2, BC084538.1, BX647722.1, BX648205.1, C-BRACE2006105, C-BRHIP2019884, C-PLACE7003657, C-TEST14021482, ENST00000310523, ENST00000349310, M63167.1, NM.sub.--005163.1]. They were classified according to expression pattern difference into 7 kinds, which mainly included the following 2 kinds.

[1] D-NT2RP8004156.1

[2] BC000479.2, BC084538.1, BX648205.1, C-PLACE7003657 (AK122894.1), ENST00000310523, M63167.1, NM.sub.--005163.1

[0307] [1] is a cDNA which was newly acquired and subjected to full-length cDNA sequence analysis by us, having an ORF different from that of [2] registered in an existing public DB.

[0308] [1] had a different ORF region because of its expression from a chromosome region located downstream of the known [2].

[0309] It was found that the ORF regions present in the 2 kinds of cDNA patterns [1] to [2] cause expression starting at different transcription initiation points, from the same chromosome region, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-NT2RP8004156.1 ([1]), Which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 019.sub.--[1].sub.--1-N0 (SEQ ID NO:41): The entire nucleic acid sequence region of D-NT2RP8004156.1 019.sub.--[1].sub.--1-NA0 (SEQ ID NO:42): Both the entire nucleic acid sequence region and amino acid sequence of D-NT2RP8004156.1 019.sub.--[1].sub.--1-A0 (SEQ ID NO:43): The entire amino acid sequence region of D-NT2RP8004156.1

[0310] The 1st to 119th bases of D-NT2RP8004156.1 (SEQ ID NO:44) is an exon that is not present in NM.sub.--005163.1, which is registered in an existing public DB and serves as a control, lacking homology to NM.sub.--005163.1.

[0311] With this change, the translation initiation point of D-NT2RP8004156.1 shifts toward the 3' side relative to NM.sub.--005163.1, and the 131st base of D-NT2RP8004156.1 becomes the translation initiation point. For this reason, the N-terminal amino acid sequence shortened by 62 residues compared with NM.sub.--005163.1 (SEQ ID NO:264).

019.sub.--[1].sub.--1-N1 (SEQ ID NO:44): A 119-base insert nucleic acid sequence region of D-NT2RP8004156.1 019.sub.--[1].sub.--1-N2 (SEQ ID NO:45): A 130-base 5'UTR region of an ORF whose translation initiation point is the 131st base of D-NT2RP8004156.1 019.sub.--[1]_C-A1 (SEQ ID NO:264): Amino acid sequence region lacking 62 residues of D-NT2RP8004156.1 present in NM.sub.--005163.1

[0312] With this change, the Pfam motif "PH domain" present at the 6th to 108th amino acids of NM.sub.--005163.1 disappeared in D-NT2RP8004156.1.

3) Expression Specificity Analysis and Design of Primers for Real-Time PCR

[0313] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

019.sub.--01--A specific region present on the N-terminal side of the cDNA pattern [1]: a translation initiation region of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, being a novel region not registered with an existing public DB .fwdarw.Fragment 019.sub.--01 (SEQ ID NO:48) amplified by Primer019.sub.--01F (SEQ ID NO:46) and Primer019.sub.--01R (SEQ ID NO:47) 019.sub.--02--A transcription initiation point region of [2], which is registered with an existing public DB, serving as a control for comparing [1] .fwdarw.Fragment 019.sub.--02 (SEQ ID NO:51) amplified by Primer019.sub.--02F (SEQ ID NO:49) and Primer01902R (SEQ ID NO:50) 019.sub.--03--A common region shared by all of [1] to [2]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [2], which is registered with an existing public DB .fwdarw.Fragment 019.sub.--03 (SEQ ID NO:54) amplified by Primer019.sub.--03F (SEQ ID NO:52) and Primer019.sub.--03R (SEQ ID NO:53)

[0314] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the exon regions specific for the respective cDNA patterns [1] to [2] shown above, respectively, were found to be expressed at the following frequencies.

[0315] In the cDNA pattern [1], which was newly acquired and analyzed by us, four 5'-terminal sequences were present, the derivation thereof being NT2 cells treated with retinoic acid (RA) to induce differentiation (NT2RP) for all sequences.

[0316] It was found that in the cDNA pattern [2], which is registered with an existing public DB, eleven 5'-terminal sequences were present: 4 sequences derived from brain tissues and 7 sequences from a plurality of other organs and the like were expressed.

[0317] From this result, it was found that the transcription initiation point of [1] was expressed specifically in NT2 cells after differentiation. From the transcription initiation point of [2], expression in a variety of organs was observed. Hence, it was thought that the mechanism of transcription in this chromosome region might be unique to the nerve cell differentiation stage of NT2 cells after differentiation, with a different transcription initiation point being used.

(2) Analysis of Expression Specificity by Real-Time PCR

[0318] To determine what are the states in which the transcription initiation point used for the expression changes, details of expression levels were analyzed by real-time PCR. The results are shown in Table 4 and Table 5.

TABLE-US-00004 TABLE 4 RQ Score Log.sub.10RQ Score 019_01 019_02 019_03 019_01 019_02 019_03 01 NT2RA(-) 0.2 0.1 0.1 -0.73 -1.02 -1.03 02 NT2RA(+) 24 hr 0.5 0.1 0.1 -0.29 -1.16 -1.15 03 NT2RA(+) 48 hr 0.2 0.1 0.1 -0.71 -1.05 -1.10 04 NT2RA(+) 1 week 1.4 0.1 0.2 0.16 -0.84 -0.75 05 NT2RA(+) 5 weeks 94.2 0.4 0.5 1.97 -0.35 -0.34 06 NT2RA(+) 5 weeks, Inh(+) 4.7 0.4 0.5 0.67 -0.37 -0.32 07 NT2 Neuron 0.0 0.1 0.0 -1.40 -1.03 -1.80 08 Brain, Fetal 1.1 1.5 1.4 0.03 0.17 0.16 09 Brain, whole 0.1 0.6 0.6 -1.06 -0.24 -0.25 10 ALZ Visual Cortex 0.1 0.2 0.2 -1.00 -0.74 -0.72 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 0.0 0.0 0.0 12 Mix, blood cells and 1.7 0.7 0.4 0.23 -0.15 -0.38 related tissues 13 Mix, tumor tissues 2.2 0.7 0.8 0.35 -0.18 -0.09 14 Mix, normal tissues 1.3 0.9 0.9 0.10 -0.02 -0.04 15 Brain, whole PolyA(+) RNA 0.2 0.4 0.3 -0.75 -0.35 -0.46 16 Brain, hippocampus 0.2 0.4 0.3 -0.81 -0.43 -0.51

TABLE-US-00005 TABLE 5 RQ Score Log.sub.10RQ Score 019_01 019_02 019_03 019_01 019_02 019_03 01 NT2RA(-) 0.1 0.1 0.1 -1.01 -1.01 -1.01 02 NT2RA(+) 24 hr 0.2 0.1 0.1 -0.66 -1.23 -1.11 03 NT2RA(+) 48 hr 0.0 0.1 0.1 -1.66 -1.02 -1.08 04 NT2RA(+) 1 week 0.6 0.2 0.2 -0.22 -0.80 -0.72 05 NT2RA(+) 5 weeks 40.2 0.5 0.5 1.60 -0.32 -0.29 06 NT2RA(+) 5 weeks, Inh(+) 2.0 0.5 0.6 0.30 -0.29 -0.25 07 NT2 Neuron 0.0 0.1 0.0 -1.52 -1.04 -1.80 08 Brain, Fetal 0.4 1.5 1.5 -0.36 0.19 0.17 09 Brain, whole 0.2 0.6 0.6 -0.73 -0.21 -0.22 10 ALZ Visual Cortex 0.0 0.2 0.2 -1.40 -0.72 -0.72 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 0.0 0.0 0.0 12 Mix, blood cells and 1.1 0.9 0.7 0.03 -0.07 -0.13 related tissues 13 Mix, tumor tissues 0.7 0.6 0.6 -0.17 -0.25 -0.19 14 Mix, normal tissues 0.3 0.9 0.9 -0.59 -0.03 -0.04 15 Brain, whole PolyA(+) RNA 0.1 0.6 0.4 -1.11 -0.25 -0.36 16 Brain, hippocampus 0.1 0.5 0.4 -1.08 -0.29 -0.40

[0319] Expression levels were compared using the 16 samples shown in Example 3, including Brain, hippocampus, Brain, whole, Brain, Fetal, ALZ Visual Cortex Occipital, and NT2 cells at 7 different differentiation stages. The comparison was made using the mixed sample of normal visceral tissues shown in Example 3 (Mix, viscus tissues) as an experimental control.

[0320] The ratio of ORF alteration due to transcription initiation point selectivity as compared between 019.sub.--01 (SEQ ID NO:48) and 019.sub.--02 (SEQ ID NO:51) changed greatly depending on NT2 cell differentiation stage. When compared in detail with respect to NT2 cell differentiation, no major difference was observed between the 2 kinds of transcription initiation points shown by 019.sub.--01 (SEQ ID NO:48) and 019.sub.--02 (SEQ ID NO:51) in undifferentiated NT2 cells NT2RA (-) and NT2RA (+) 48 hr, which represents the initial stage in which retinoic acid was added to induce differentiation (Table 4 and Table 5). However, in NT2RA (+) 1 week, which represents an advanced stage of differentiation, the difference widened; in NT2RA (+) 5 weeks, the ratio of transcription from the downstream transcription initiation point shown by 019.sub.--01 (SEQ ID NO:48) increased considerably (Table 4 and Table 5). However, thereafter in NT2RA (+) 5 weeks, Inh (+), the difference decreased; in NT2% Neuron, on the contrary, the ratio of transcription from the known transcription initiation point shown by 019.sub.--02 (SEQ ID NO:51) increased (Table 4 and Table 5). In other tissues, no major difference was observed.

[0321] These results demonstrated that by comparing the expression of the 5'-terminal region of a newly acquired cDNA region shown by the detection region 019.sub.--01 (sequence No. 019-8) (a region close to the transcription initiation point) 019.sub.--[1].sub.--1-N1 (SEQ ID NO:44), it is possible to use the 5'-terminal region as a differentiation marker for detecting cells in nerve cell differentiation or regeneration stages, particularly in the late stage of nerve differentiation or regeneration. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0322] The following regions also seem to be useful as differentiation markers for detecting cells in the late stage of nerve differentiation or regeneration.

Upstream sequence 019.sub.--[1].sub.--1-N3 (SEQ ID NO:55), which comprises the 195th to 213th bases undergoing priming by Primer019.sub.--01R (SEQ ID NO:47) in D-NT2RP8004156.1 of the cDNA pattern [1]. Region 019.sub.--01 (SEQ ID NO:48) amplified by Primer019.sub.--01F (SEQ ID NO:46) and Primer019.sub.--01R (SEQ ID NO:47) in the cDNA pattern [1]

Example 6

Cluster chr2-2324 (Data Set: 031)

(1) Cluster Analysis

1) Cluster Characteristics

[0323] An analysis was performed on 7 sequences of full-length cDNAs subjected to genome mapping onto the cluster chr2-2324 (Human genome UCSC hg18 (NCBI Build34) chromosome 2, 65,440,000 bp to 65,580,000 bp) [D-NT2RI3005525.1, D-TRACH3029063.1, AY299090.1, C-HEP03447, C-NT2RP7004925, ENST00000356388, NM.sub.--181784.1]. They were classified according to expression pattern difference into 5 kinds, which mainly included the following 2 kinds.

[1] D-NT2RI3005525.1

[2] AY299090.1, C-NT2RP7004925 (AK056479.1), NM.sub.--181784.1

[0324] [1] is a cDNA which was newly acquired and subjected to full-length cDNA by us, and had a different ORF from [2] registered with an existing public DB.

[0325] [1] had a different ORF region because of its expression from a chromosome region located downstream of the known [2], and also because of the presence of the translation initiation point on a new exon lacking identity to [2].

[0326] It was found that the ORF regions present in the 2 kinds of cDNA patterns [1] to [2] cause expression starting at different transcription initiation points, from the same chromosome region, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-NT2RI3005525.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA by Us 031.sub.--[1].sub.--1-N0 (SEQ ID NO:56): The entire nucleic acid sequence region of D-NT2RI3005525.1 031.sub.--[1].sub.--1-NA0 (SEQ ID NO:57): Both the entire nucleic acid sequence region and amino acid sequence of D-NT2RI3005525.1 031.sub.--[1].sub.--1-A0 (SEQ ID NO:58): The entire amino acid sequence region of D-NT2RI3005525.1

[0327] The sequence at the 1st to 61st bases of D-NT2RI3005525.1 (SEQ ID NO:59) is a variant incorporating an exon that is not present in NM.sub.--181784.1, which is registered with an existing public DB, and serves for control; because of the presence thereof along with the translation initiation point on the exon inserted, the N-terminal amino acids differed by 6 residues, compared with NM.sub.--181784.1 (SEQ ID NO:60).

031.sub.--[1].sub.--1-N1 (SEQ ID NO:59): 61-base insert nucleic acid sequence region of D-NT2RI3005525.1 031.sub.--[1].sub.--1-A1 (SEQ ID NO:60): 6-residue insert amino acid sequence region of D-NT2RI3005525.1 031.sub.--[1].sub.--1-N2 (SEQ ID NO:61): ORF nucleic acid sequence region in 61-base insert region of D-NT2RI3005525.1 031.sub.--[1].sub.--1-A2 (identical to SEQ ID NO:60): ORF amino acid region related to 61-base insert region of D-NT2RI3005525.1

3) Expression Specificity Analysis and Design of Primer for Real-Time PCR

[0328] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

031.sub.--01--A specific region present on the N-terminal side of the cDNA pattern [1]: a translation initiation region of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, being a novel region not registered with an existing public DB .fwdarw.Fragment 031.sub.--01 (SEQ ID NO:64) amplified by Primer031.sub.--01F (SEQ ID NO:62) and Primer031.sub.--01R (SEQ ID NO:63) 031.sub.--02--A transcription initiation point region of [2], registered with an existing public DB, serving as a control for comparing [1] .fwdarw.Fragment 031.sub.--02 (SEQ ID NO:67) amplified by Primer031.sub.--02F (SEQ ID NO:65) and Primer031.sub.--02R (SEQ ID NO:66) 031.sub.--03--A common region shared by all of [1] to [2]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [2], which is registered with an existing public DB .fwdarw.Fragment 031.sub.--03 (SEQ ID NO:70) amplified by Primer031.sub.--03F (SEQ ID NO:68) and Primer03103R (SEQ ID NO:69)

[0329] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the exon regions specific for the cDNA patterns [1] and [2] shown above, respectively, were found to be expressed at the following frequencies.

[0330] In the cDNA pattern [1], which was newly acquired and analyzed by us, twenty-eight 5'-terminal sequences were present, the derivations thereof being Brain, whole for 13 sequences (analytical parameter 59,069), Brain, hippocampus for 8 sequences (analytical parameter 57,918), Brain, amygdala for 5 sequences (analytical parameter 58,640), HDPC (Human dermal papilla cells) for 1 sequence (analytical parameter 8,453), and NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks (NT2RI) for 1 sequence (analytical parameter 32,662).

[0331] In the cDNA pattern [2], which is registered with an existing public DB, thirty-five 5'-terminal sequences were present, the derivations thereof being Brain, whole for 10 sequences (analytical parameter 59,069), Brain, cerebellum for 5 sequences (analytical parameter 82,880), Brain, Fetal for 5 sequences (analytical parameter 47,574), Brain, hippocampus for 3 sequences (analytical parameter 57,918), Trachea for 3 sequences (analytical parameter 52,352), Brain, thalamus for 2 sequences (analytical parameter 53,267), NT2 cells treated with retinoic acid (RA) to induce differentiation (NT2RP) for 2 sequences (analytical parameter 39,242), Thymus for 2 sequences (analytical parameter 70,578), NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks (NT2RI) for 1 sequence (analytical parameter 32,662), Testis for 1 sequence (analytical parameter 90,188), and Uterus for 1 sequence (analytical parameter 49,561).

[0332] From this result, it was found that the transcription initiation point of [1] was expressed abundantly in the brain, particularly in Brain, hippocampus and Brain, amygdala. It was found that the transcription initiation point of [2] was also abundantly expressed in the brain, but expressed in a wider variety of tissues compared with the transcription initiation point of [1]. From this result, it was thought that the mechanism of transcription in this chromosome region might be unique to particular portions of the brain, with a different is transcription initiation point being used.

(2) Analysis of Expression Specificity by Real-Time PCR

[0333] To determine what are the portions and states in which the transcription initiation point used for the expression changes, details of expression levels were analyzed by real-time PCR. The results are shown in Table 6 and Table 7.

TABLE-US-00006 TABLE 6 RQ Score Log.sub.10RQ Score 031_01 031_02 031_03 031_01 031_02 031_03 01 NT2RA(-) 0.0 0.1 0.2 -3.12 -0.85 -0.82 02 NT2RA(+) 24 hr 0.0 0.5 0.8 -2.48 -0.34 -0.09 03 NT2RA(+) 48 hr 0.0 0.4 0.9 -2.48 -0.41 -0.03 04 NT2RA(+) 1 week 0.0 0.2 0.4 -2.32 -0.81 -0.43 05 NT2RA(+) 5 weeks 0.9 0.4 0.4 -0.03 -0.45 -0.39 06 NT2RA(+) 5 weeks, Inh(+) 2.3 0.4 0.5 0.36 -0.37 -0.29 07 NT2 Neuron 0.1 0.0 0.1 -1.00 -1.51 -0.83 08 Brain, Fetal 0.5 1.7 2.1 -0.33 0.22 0.32 09 Brain, whole 15.4 1.4 2.1 1.19 0.16 0.31 10 ALZ Visual Cortex 8.1 0.4 0.6 0.91 -0.44 -0.20 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 0.0 0.0 0.0 12 Mix, blood cells and 0.6 0.7 0.9 -0.21 -0.17 -0.06 related tissues 13 Mix, tumor tissues 0.5 0.4 0.5 -0.31 -0.35 -0.29 14 Mix, normal tissues 0.9 0.9 1.2 -0.04 -0.04 0.08 15 Brain, whole PolyA(+) RNA 4.2 0.2 0.3 0.63 -0.71 -0.59 16 Brain, hippocampus 2.8 0.1 0.2 0.44 -0.87 -0.74 17 Brain, cerebellum 0.0 0.2 0.3 -1.61 -0.65 -0.55 18 Brain, amygdala 3.1 0.1 0.2 0.49 -0.95 -0.75 19 Brain, caudate nucleus 0.2 0.1 0.1 -0.78 -1.00 -0.88 20 Brain, corpus callosum 0.2 0.1 0.1 -0.61 -1.10 -1.02 21 Brain, substantia nigra 0.2 0.1 0.2 -0.72 -0.85 -0.78 22 Brain, thalamus 0.2 0.1 0.1 -0.75 -1.16 -1.05 23 Brain, subthalamic nucleus 0.1 0.1 0.1 -1.16 -1.24 -0.96

TABLE-US-00007 TABLE 7 RQ Score Log.sub.10RQ Score 031_01 031_02 031_03 031_01 031_02 031_03 01 Brain, Fetal 0.3 1.9 1.8 -0.46 0.28 0.27 02 Brain, whole 10.2 1.3 1.8 1.01 0.10 0.26 03 ALZ Visual Cortex 5.6 0.4 0.6 0.75 -0.44 -0.21 Occipital 04 Mix, viscus tissues 1.0 1.0 1.0 0.0 0.0 0.0 05 Mix, blood cells and 0.5 0.8 0.9 -0.31 -0.11 -0.03 related tissues 06 Mix, tumor tissues 0.8 0.7 0.8 -0.11 -0.17 -0.08 07 Mix, normal tissues 0.8 1.1 1.3 -0.11 0.05 0.13 08 Brain, whole PolyA(+) RNA 3.0 0.1 0.3 0.48 -0.82 -0.57 09 Brain, hippocampus 2.1 0.1 0.2 0.32 -0.88 -0.72 10 Brain, cerebellum 0.0 0.1 0.2 -1.96 -0.87 -0.80 11 Brain, amygdala 2.3 0.1 0.2 0.37 -0.97 -0.75 12 Brain, caudate nucleus 0.1 0.1 0.1 -0.96 -1.06 -0.96 13 Brain, corpus callosum 0.2 0.1 0.1 -0.82 -1.16 -1.09 14 Brain, substantia nigra 0.1 0.1 0.1 -0.99 -1.01 -0.95 15 Brain, thalamus 0.1 0.0 0.1 -1.05 -1.34 -1.23 16 Brain, subthalamic nucleus 0.0 0.1 0.1 -1.37 -1.28 -1.03

[0334] Expression levels were compared using the 23 kinds of samples shown in Example 3, including 11 kinds of brain tissues and NT2 cells at 7 different differentiation stages. The comparison was made using the mixed sample of normal visceral tissues shown in Example 3 (Mix, viscus tissues) as an experimental control.

[0335] The ratio of ORF alteration due to transcription initiation point selectivity as compared between 031.sub.--01 (SEQ ID NO:64) and 031.sub.--02 (SEQ ID NO:67) changed greatly among the following brain portions and NT2 cell differentiation stages.

[0336] In the brain, particularly in Brain, hippocampus and Brain, amygdala, the transcription from the downstream transcription initiation point shown by 031.sub.--01 (SEQ ID NO:64) was abundant (Table 6 and Table 7). No major difference was observed among the other portions of the brain.

[0337] Furthermore, when compared in detail with respect to NT2 cell differentiation, the expression of the mRNA transcribed from the transcription initiation point shown by 031.sub.--02 (SEQ ID NO:67), registered with an existing public DB, was abundant in undifferentiated NT2 cells NT2RA (-) and NT2RA (+) 1 week, which represents the initial stage in which retinoic acid was added to induce differentiation; however, in NT2RA (+) 5 weeks, predicted to be rich in nerve cells after differentiation, the expression level reversed; in the subsequent stages of NT2RA (+) 5 weeks, Inh (+), and NT2 Neuron, the expression of the mRNA transcribed from the downstream transcription initiation point shown by 031.sub.--01 (SEQ ID NO:64) was abundant (Table 6 and Table 7).

[0338] These results demonstrated that by comparing the expression of the 5'-terminal region 031.sub.--[1]-N1 (SEQ ID NO:59) of a newly acquired cDNA shown by the detection region 031.sub.--01 (SEQ ID NO:64) (a region close to the transcription initiation point), it is possible to use the 5'-terminal region as a marker specific for the brain, particularly for nerve-rich portions such as Brain, hippocampus (nerve differentiation, nerve regeneration marker and the like), and as a differentiation marker for detecting cells in nerve cell differentiation or regeneration stages, particularly those that have differentiated into a nerve. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0339] The following regions also seem to be useful as markers specific to the brain, particularly to the nerve-rich portions such as Brain, hippocampus (nerve differentiation, nerve regeneration marker and the like), and as differentiation markers for detecting nerve cells in differentiation or regeneration stages, particularly those that have differentiated into nerves.

Upstream sequence 031.sub.--[1].sub.--1-N3 (SEQ ID NO:71), which comprises the 80th to 101st bases undergoing priming by Primer031.sub.--01R (SEQ ID NO:63) in D-NT2RI3005525.1 of the cDNA pattern [1]. Region 031.sub.--01 (SEQ ID NO:64) amplified by Primer031.sub.--01F (SEQ ID NO:62) and Primer031.sub.--01R (SEQ ID NO:63) in the cDNA pattern [1]

Example 7

Cluster chr7-2007 (Data Set: 067)

(1) Cluster Analysis

1) Cluster Characteristics

[0340] An analysis was performed on 10 sequences of full-length cDNAs subjected to genome mapping onto the cluster chr7-2007 (Human genome UCSC hg18 (NCBI Build34) chromosome 7, 26,400,000 bp to 26,850,000 bp) [D-NT2RP8004592.1, D-NT2RP7010844.1, Z-NT2RP7020087-01, BC002893.2, BC036044.1, ENST00000338865, ENST00000345317, NM.sub.--003930.3, XM.sub.--498174.1, XM.sub.--499404.1]. They were classified according to expression pattern difference into 5 kinds, which mainly included the following 2 kinds.

[1] D-NT2RP8004592.1

[2] BC002893.2, BC036044.1, NM.sub.--003930.3

[0341] [1] is a cDNA which was newly acquired and subjected to full-length cDNA sequence analysis by us, having a different ORF from [2], which is registered with an existing public DB.

[0342] [1] had a different ORF region from [2] because of its expression from a chromosome region located downstream of the known [2], and hence a shift of the translation initiation point toward the C-terminal side.

[0343] It was found that the ORF regions present in the 2 kinds of cDNA patterns [1] and [2] cause expression starting at different transcription initiation points, from the same chromosome region, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-NT2RP8004592.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 067.sub.--[1].sub.--1-N0 (SEQ ID NO:72): The entire nucleic acid sequence region of D-NT2RP8004592.1 067.sub.--[1].sub.--1-NA0 (SEQ ID NO:73): Both the entire nucleic acid sequence region and amino acid sequence of D-NT2RP8004592.1 067.sub.--[1].sub.--1-A0 (SEQ ID NO:74): The entire amino acid sequence region of D-NT2RP8004592.1

[0344] The exon at the 1st to 169th bases of D-NT2RP8004592.1 (SEQ ID NO:75) (1st exon) is an exon that is not present in NM.sub.--003930.3, which is registered with an existing public DB, and serves for control, lacking homology thereto. The exon at the 1st to 359th bases of NM.sub.--003930.3 (first exon) is an exon that is not present in D-NT2RP8004592.1, lacking homology thereto. The second exon and beyond are present commonly in both cDNAs. The translation termination point of the ORF of NM.sub.--003930.3 is the same as that of D-NT2RP8004592.1; however, because the translation initiation point is present on the 1st exon, which is not present in D-NT2RP8004592.1, the N-terminus of the ORF differed. Because the translation initiation point of D-NT2RP8004592.1 is present on the 6th exon, which is shared by NM.sub.--003930.3, the amino acid sequence on the N-terminal side shortened by 172 residues, compared with NM.sub.--003930.3 (SEQ ID NO:265).

067.sub.--[1].sub.--1-N 1 (SEQ ID NO:75): A 169-base insert nucleic acid sequence region of D-NT2RP8004592.1 067.sub.--[1].sub.--1-N 2 (SEQ ID NO:76): A 619-base 5'UTR region of an ORF whose translation initiation point is the 620th base of D-NT2RP8004592.1 067.sub.--[1]_C-A1 (SEQ ID NO:265): A 172-residue deletion amino acid sequence region of D-NT2RP8004592.1 present in NM.sub.--003930.3

3) Expression Specificity Analysis and Design of Primers for Real-Time PCR

[0345] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

067.sub.--01--A specific region present on the N-terminal side of the cDNA pattern [1]: a translation initiation region of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, being a novel region not registered with an existing public DB .fwdarw.Fragment 067.sub.--01 (SEQ ID NO:79) amplified by Primer067.sub.--01F (SEQ ID NO:77) and Primer067.sub.--01R (SEQ ID NO:78) 067.sub.--03--Transcription initiation point region of [2], which is registered with an existing public DB, serving as a control for comparing [1] .fwdarw.Fragment 067.sub.--03 (SEQ ID NO:82) amplified by Primer067.sub.--03F (SEQ ID NO:80) and Primer067.sub.--03R (SEQ ID NO:81) 067.sub.--04--A common region shared by all of [1] to [2]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [2], which is registered with an existing public DB .fwdarw.Fragment 067.sub.--04 (SEQ ID NO:85) amplified by Primer067.sub.--04F (SEQ ID NO:83) and Primer067.sub.--04R (SEQ ID NO:84)

[0346] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the exon regions specific for the cDNA patterns [1] to [2] shown above, respectively, were found to be expressed at the following frequencies.

[0347] In the cDNA pattern [1], which was newly acquired and analyzed by us, eighteen 5'-terminal sequences were present, the derivations thereof being NT2 cells treated with retinoic acid (RA) to induce differentiation (NR2RP) for 16 sequences (analytical parameter 39,242), and NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks (NT2RI) for 2 sequences (analytical parameter 32,662); all were derived from NT2 cells after differentiation.

[0348] In the cDNA pattern [2], which is registered with an existing public DB, one hundred twenty-two (122) 5'-terminal sequences were present, the derivations thereof being NT2 cells for 45 sequences, brain tissues for 25 sequences, and others for 47 sequences.

[0349] From this result, it was found that the transcription initiation point of [1] was expressed specifically in NT2 cells after differentiation. From the transcription initiation point of [2], expression was observed in NT2 cells, brain tissues and various other tissues. Hence, it was suggested that in this chromosome region, the mechanism of transcription may differ, and may result in different transcription initiation points being used only at the nerve cell differentiation states of NT2 cells after differentiation.

(2) Analysis of Expression Specificity by Real-Time PCR

[0350] To determine what are the states in which the transcription initiation point used for the expression changes, details of expression levels were analyzed by real-time PCR. The results are shown in Table 8 and Table 9.

TABLE-US-00008 TABLE 8 RQ Score Log.sub.10RQ Score 067_01 067_03 067_04 067_01 067_03 067_04 01 NT2RA(-) 0.0 0.0 0.0 -1.58 -2.04 -1.79 02 NT2RA(+) 24 hr 3.1 0.5 0.6 0.49 -0.29 -0.19 03 NT2RA(+) 48 hr 8.6 1.6 1.5 0.93 0.21 0.19 04 NT2RA(+) 1 week 21.6 1.5 1.9 1.33 0.16 0.27 05 NT2RA(+) 5 weeks 103.8 3.3 11.3 2.02 0.52 1.05 06 NT2RA(+) 5 weeks, Inh(+) 3.2 1.2 2.6 0.51 0.07 0.41 07 NT2 Neuron 30.3 0.7 0.4 1.48 -0.16 -0.37 08 Brain, Fetal 0.1 0.2 0.3 -0.95 -0.66 -0.51 09 Brain, whole 0.9 0.6 1.0 -0.05 -0.19 0.01 10 ALZ Visual Cortex 0.4 0.3 0.7 -0.36 -0.51 -0.17 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 0.0 0.0 0.0 12 Mix, blood cells and 2.3 1.9 2.5 0.36 0.27 0.40 related tissues 13 Mix, tumor tissues 0.7 0.3 0.3 -0.14 -0.46 -0.49 14 Mix, normal tissues 2.3 1.0 1.2 0.37 -0.01 0.07 15 Brain, whole PolyA(+) RNA 0.2 0.2 0.5 -0.71 -0.66 -0.29 16 Brain, hippocampus 0.1 0.2 0.4 -0.94 -0.75 -0.35

TABLE-US-00009 TABLE 9 RQ Score Log.sub.10RQ Score 067_01 067_03 067_04 067_01 067_03 067_04 01 NT2RA(-) 0.0 0.0 0.0 -1.53 -2.04 -1.78 02 NT2RA(+) 24 hr 3.2 0.6 0.8 0.50 -0.23 -0.11 03 NT2RA(+) 48 hr 10.6 1.6 1.7 1.03 0.21 0.22 04 NT2RA(+) 1 week 25.0 1.5 1.9 1.40 0.18 0.28 05 NT2RA(+) 5 weeks 125.3 3.7 13.6 2.10 0.57 1.13 06 NT2RA(+) 5 weeks, Inh(+) 4.4 1.3 3.4 0.64 0.11 0.53 07 NT2 Neuron 25.5 0.6 0.4 1.41 -0.19 -0.37 08 Brain, Fetal 0.2 0.2 0.3 -0.63 -0.64 -0.48 09 Brain, whole 1.0 0.7 1.2 -0.01 -0.16 0.10 10 ALZ Visual Cortex 0.4 0.3 0.7 -0.35 -0.47 -0.15 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 0.0 0.0 0.0 12 Mix, blood cells and 2.1 2.1 3.2 0.32 0.32 0.50 related tissues 13 Mix, tumor tissues 0.4 0.3 0.4 -0.45 -0.48 -0.42 14 Mix, normal tissues 2.7 0.9 1.2 0.44 -0.02 0.08 15 Brain, whole PolyA(+) RNA 0.2 0.3 0.7 -0.60 -0.54 -0.16 16 Brain, hippocampus 0.1 0.2 0.6 -0.88 -0.62 -0.22

[0351] Expression levels were compared using the 16 samples shown in Example 3, including Brain, hippocampus, Brain, whole, Brain, Fetal, Alzheimer patient cerebral cortex (ALZ Visual Cortex Occipital), and NT2 cells at 7 different differentiation stages and the like. The comparison was made using the mixed sample of normal visceral tissues shown in Example 3 (Mix, viscus tissues) as an experimental control.

[0352] The ratio of ORF alteration due to transcription initiation point selectivity as compared between 067.sub.--01 (SEQ ID NO:79) and 067.sub.--03 (SEQ ID NO:82) changed greatly depending on NT2 cell differentiation stage. When compared in detail with respect to NT2 cell differentiation, the ratio of the transcription from the transcription initiation point shown by 067.sub.--01 (SEQ ID NO:79) was higher than that from the transcription initiation point shown by 067.sub.--03 (SEQ ID NO:82) in NT2RA (+) 1 week to NT2RA (+) 5 weeks, advanced stages of differentiation, compared with undifferentiated NT2 cells NT2RA (-) and NT2RA (+) 48 hr, which represents the initial stage in which retinoic acid was added to induce differentiation (Table 8 and Table 9). Subsequently, in NT2RA (+) 5 weeks, Inh (+), the difference narrowed, but in NT2 Neuron, the ratio of transcription represented by 067.sub.--01 (SEQ ID NO:79) increased again (Table 8 and Table 9).

[0353] These results demonstrated that by comparing the expression of the 5'-terminal region (a region close to the transcription initiation point) 067.sub.--[1]-N1 (SEQ ID NO:75) of a newly acquired cDNA shown by the detection region 067.sub.--01 (SEQ ID NO:79), it is possible to use the 5'-terminal region as a differentiation marker for detecting cells in nerve cell differentiation or regeneration stages, particularly those that have differentiated into a nerve. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0354] The following regions also seem to be useful as differentiation markers for detecting nerve cells in differentiation or regeneration stages, particularly those that have differentiated into nerves.

Upstream sequence 067.sub.--[1].sub.--1-N3 (SEQ ID NO:86), which comprises the 65th to 84th bases undergoing priming by Primer067.sub.--01R (SEQ ID NO:78) in D-NT2RP8004592.1 of the cDNA pattern [1]. Region 067.sub.--01 (SEQ ID NO:79) amplified by Primer067.sub.--01F (SEQ ID NO:77) and Primer067.sub.--01R (SEQ ID NO:78) in the cDNA pattern [1].

Example 8

Cluster chrX-900 (Data Set: 122)

(1) Cluster Analysis

1) Cluster Characteristics

[0355] An analysis was performed on 7 full-length cDNAs subjected to genome mapping onto the cluster chrX-900 (Human genome UCSC hg18 (NCBI Build34) chromosome X, 43,380,000 bp to 43,500,000 bp) [D-NT2RI2014164.1, D-BRAMY2029564.1, D-BRAMY2029564.1, BC022494.1, ENST00000265833, M69177.1, NM.sub.--000898.3]. They were classified according to expression pattern difference into 4 kinds, which mainly included the following 3 kinds.

[1] D-NT2RI2014164.1

[2] D-BRAMY2029564.1

[3] BC022494.1, ENST00000265833, M69177.1, NM.sub.--000898.3

[0356] [1] is a cDNA which was newly acquired and subjected to full-length cDNA sequence analysis by us, and had a different ORF from that of [3], which is registered with an existing public DB, because of the expression thereof from a chromosome region located downstream of the known [3].

[0357] [2] is a cDNA which was newly acquired and subjected to full-length cDNA sequence analysis by us, having a different ORF from that of the known [3] because of the insertion of an exon different from the other patterns in the ORF region [3].

[0358] It was found that the ORF regions present in the 3 kinds of cDNA patterns [1] to [3] cause expression starting at different transcription initiation points, from the same chromosome region, and have different splice patterns, such as exon insertions, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-NT2RI2014164.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 122.sub.--[1].sub.--1-N0 (SEQ ID NO:87): The entire nucleic acid sequence region of D-NT2RI2014164.1 122.sub.--[1].sub.--1-NA0 (SEQ ID NO:88): Both the entire nucleic acid sequence region and amino acid sequence of D-NT2RI2014164.1 122.sub.--[1].sub.--1-A0 (SEQ ID NO:89): The entire amino acid sequence region of D-NT2RI2014164.1

[0359] The sequence at the 1st to 156th bases of D-NT2RI2014164.1 (SEQ ID NO:90) is an exon that is not present in NM.sub.--000898.3, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--000898.3. With this change, the translation initiation point of D-NT2RI2014164.1 shifts toward the 3' side relative to NM.sub.--000898.3, and the 162nd base of D-NT2RI2014164.1 becomes the translation initiation point. For this reason, the amino acid sequence shortened by 16 residues, compared with NM.sub.--000898.3 (SEQ ID NO:266).

[0360] The 98-base exon present at the 1,274th to 1,371st bases of NM.sub.--000898.3 (SEQ ID NO:95) is lacked and not present in the region at the 1,250th to 1,251st bases of D-NT2RI2014164.1 (SEQ ID NO:92).

[0361] With this change, because of a translation frame change to cause the termination of the ORF at a stop codon different from that of NM.sub.--000898.3, the C-terminal amino acids differed by 48 residues, compared with NM.sub.--000898.3 (SEQ ID NO:93). 122.sub.--[1].sub.--1-N1 (SEQ ID NO:90): A 156-base insert nucleic acid sequence region of D-NT2RI2014164.1

122.sub.--[1].sub.--1-N2 (SEQ ID NO:91): A 161-base 5'UTR region of an ORF whose translation initiation point is the 162nd base of D-NT2RI2014164.1 122.sub.--[1].sub.--1-N3 (SEQ ID NO:92): A deletion nucleic acid sequence region of D-NT2RI2014164.1 122.sub.--[1].sub.--1-A1 (SEQ ID NO:93): Amino acid sequence region 122.sub.--[1].sub.--1-N4 altered as a result of deletion of D-NT2RI2014164.1 (identical to SEQ ID NO:92): an ORF nucleic acid region in the deletion region of D-NT2RI2014164.1 122.sub.--[1].sub.--1-A2 (SEQ ID NO:94): An ORF amino acid sequence region related to the deletion region of D-NT2RI2014164.1 122.sub.--[1]_C-N1 (SEQ ID NO:95): A 98-base exon nucleic acid sequence present at the 1,274th to 1,371th bases of NM.sub.--000898.3 inserted into the region at the 1,250th to 1,251st bases of D-NT2RI2014164.1 122.sub.--[1]_C-A1 (SEQ ID NO:96): A 33-residue amino acid sequence related to the 98-base exon nucleic acid sequence present at the 1,274th to 1,371st bases of NM.sub.--000898.3 inserted into the region at the 1,250th to 1,251st bases of D-NT2RI2014164.1 122.sub.--[1]_C-A2 (SEQ ID NO:266): A 16-residue deletion amino acid sequence region of D-NT2RI2014164.1 present in NM.sub.--000898.3 3) Characteristics of D-BRAMY2029564.1 ([2]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 122.sub.--[2].sub.--1-N0 (SEQ ID NO:97): The entire nucleic acid sequence region of D-BRAMY2029564.1 122.sub.--[2].sub.--1-NA0 (SEQ ID NO:98): Both the entire nucleic acid sequence region and amino acid sequence of D-BRAMY2029564.1 122.sub.--[2].sub.--1-A0 (SEQ ID NO:99): The entire amino acid sequence region of D-BRAMY2029564.1

[0362] The 90th to 140th bases of D-BRAMY2029564.1 (SEQ ID NO:100) is an exon that is not present in NM.sub.--000898.3, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--000898.3. With this change, the translation initiation point of D-BRAMY2029564.1 shifts toward the 3' side, compared with NM.sub.--000898.3, and the 143rd base of D-BRAMY2029564.1 becomes a translation initiation point. For this reason, the amino acid sequence shortened by 16 residues, compared with NM.sub.--000898.3 (identical to SEQ ID NO:266).

122.sub.--[2].sub.--1-N1 (SEQ ID NO:100): A 43-base insert nucleic acid sequence region of D-BRAMY2029564.1 122.sub.--[2].sub.--1-N2 (SEQ ID NO:101): A 142-base 5'UTR region of an ORF whose translation initiation point is the 143rd base of D-BRAMY2029564.1 122.sub.--[2]_C-A1 (identical to SEQ ID NO:266): A 16-residue deletion amino acid sequence region of D-BRAMY2029564.1 present in NM.sub.--000898.3

4) Expression Specificity Analysis and Design of Primers for Real-Time PCR

[0363] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

122.sub.--01--A specific region present on the N-terminal side of the cDNA pattern [1]: a translation initiation region of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, being a novel region not registered with an existing public DB .fwdarw.Fragment 122.sub.--01 (SEQ ID NO:104) amplified by Primer122.sub.--01F (SEQ ID NO:102) and Primer122.sub.--01R (SEQ ID NO:103) 122.sub.--02--A region specifically extracted by means of the sequence information on regions of the exon insertion of cDNA pattern [2]: an ORF-altering exon insert region in the cDNA pattern [2], which was newly subjected to full-length cDNA sequence analysis by us .fwdarw.Fragment 122.sub.--02 (SEQ ID NO:107) amplified by Primer122.sub.--02F (SEQ ID NO:105) and Primer122.sub.--02R (SEQ ID NO:106) 122.sub.--03--A transcription initiation point region of [3], which is registered with an existing public DB, serving as a control for comparing [1] and [2] .fwdarw.Fragment 122.sub.--03 (SEQ ID NO:110) amplified by Primer122.sub.--03F (SEQ ID NO:108) and Primer122.sub.--03R (SEQ ID NO:109) 122.sub.--04--A common region shared by all of [1] to [3]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA patterns [1] and [2], which were newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [3], which is registered with an existing public DB .fwdarw.Fragment 122.sub.--04 (SEQ ID NO:113) amplified by Primer122.sub.--04F (SEQ ID NO:111) and Primer122.sub.--04R (SEQ ID NO:112)

[0364] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the exon regions specific to the cDNA patterns [1] to [3] shown above, respectively, were found to be expressed at the following frequencies.

[0365] In the cDNA pattern [1], which was newly acquired and analyzed by us, four 5'-terminal sequences were present, the derivations thereof being NT2 cells treated with retinoic acid (RA) and treated with a growth inhibitor to induce nerve differentiation, followed by nerve concentration and recovery (NT2NE) for 2 sequences (analytical parameter 16,337), and NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks (NT2RI) for 2 sequences (analytical parameter 32,662).

[0366] In the cDNA pattern [2], which was newly acquired and analyzed by us, two 5'-terminal sequences were present, the derivation thereof being Brain, amygdala for the 2 sequences (analytical parameter 58,640).

[0367] In the cDNA pattern [3], which is registered with an existing public DB, fifty-nine 5'-terminal sequences were present, the derivations thereof being Uterus for 11 sequence (analytical parameter 49,561), brain tissues for 19 sequences, and a variety of other tissues for the other sequences.

[0368] From this result, it was found that the transcription initiation point of [1] was abundantly expressed in differentiated NT2 cells. It was also found that the exon insertion pattern [2] was abundantly expressed in the brain. The transcription initiation point of [3] was expressed in various tissues. Hence, it was thought that the mechanism of transcription or splice pattern in this chromosome region might be unique to particular tissues such as the brain and nerve cells after differentiation, to alter amino acids, with a selection mechanism arising for mRNA pattern changes resulting in the expression of different proteins.

(2) Analysis of Expression Specificity by Real-Time PCR

[0369] To detect protein expression diversity changes due to transcription initiation point or exon selectivity among different tissues, details of expression levels were analyzed by real-time PCR. The results are shown in Table 10 and Table 11.

TABLE-US-00010 TABLE 10 RQ Score Log.sub.10RQ Score 122_01 122_02 122_03 122_04 122_01 122_02 122_03 122_04 01 NT2RA(-) 0.0 0.0 0.0 0.0 -1.89 -1.37 -1.82 -1.89 02 NT2RA(+) 24 hr 0.3 0.1 0.0 0.0 -0.59 -0.99 -2.55 -2.85 03 NT2RA(+) 48 hr 1.3 0.6 0.0 0.0 0.11 -0.25 -2.28 -2.58 04 NT2RA(+) 1 week 3.4 1.3 0.1 0.0 0.54 0.10 -1.16 -1.65 05 NT2RA(+) 5 weeks 0.3 0.2 0.1 0.1 -0.51 -0.72 -1.00 -1.09 06 NT2RA(+) 5 weeks, Inh(+) 0.5 0.3 0.2 0.2 -0.28 -0.59 -0.61 -0.69 07 NT2 Neuron 5.1 0.6 0.0 0.0 0.71 -0.19 -1.34 -2.21 08 Brain, Fetal 0.6 1.7 0.4 0.2 -0.19 0.22 -0.43 -0.73 09 Brain, whole 2.3 13.4 1.1 0.5 0.36 1.13 0.02 -0.26 10 ALZ Visual Cortex 1.0 6.5 0.9 0.4 -0.01 0.82 -0.07 -0.42 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 1.0 0.0 0.0 0.0 0.0 12 Mix, blood cells and 1.1 2.1 1.1 0.7 0.06 0.33 0.03 -0.13 related tissues 13 Mix, tumor tissues 0.5 0.2 0.2 0.3 -0.33 -0.61 -0.67 -0.53 14 Mix, normal tissues 3.7 4.6 1.7 1.3 0.57 0.67 0.23 0.12 15 Brain, whole PolyA(+) RNA 0.3 7.2 0.7 0.3 -0.48 0.86 -0.17 -0.48 16 Brain, hippocampus 0.4 4.6 0.9 0.5 -0.44 0.66 -0.05 -0.35 17 Brain, cerebellum 0.2 2.9 0.5 0.2 -0.81 0.47 -0.31 -0.68 18 Brain, amygdala 0.5 4.2 1.0 0.5 -0.32 0.62 0.00 -0.29 19 Brain, caudate nucleus 0.4 4.5 0.9 0.7 -0.35 0.66 -0.06 -0.17 20 Brain, corpus callosum 0.2 0.6 1.1 0.6 -0.64 -0.20 0.05 -0.19 21 Brain, substantia nigra 0.4 2.2 1.1 0.6 -0.44 0.35 0.02 -0.23 22 Brain, thalamus 0.2 4.0 0.6 0.3 -0.76 0.60 -0.23 -0.48 23 Brain, subthalamic nucleus 0.1 0.8 0.8 0.4 -1.12 -0.09 -0.07 -0.43

TABLE-US-00011 TABLE 11 RQ Score Log.sub.10RQ Score 122_01 122_02 122_03 122_04 122_01 122_02 122_03 122_04 01 NT2RA(-) 0.0 0.0 0.0 0.0 -1.68 -1.42 -1.85 -1.83 02 NT2RA(+) 24 hr 0.8 0.2 0.0 0.0 -0.08 -0.82 -2.32 -2.79 03 NT2RA(+) 48 hr 3.4 0.7 0.0 0.0 0.54 -0.15 -2.32 -2.70 04 NT2RA(+) 1 week 8.5 2.3 0.1 0.0 0.93 0.36 -1.23 -1.65 05 NT2RA(+) 5 weeks 0.8 0.3 0.1 0.1 -0.11 -0.52 -0.98 -1.15 06 NT2RA(+) 5 weeks, Inh(+) 1.4 0.5 0.3 0.2 0.16 -0.32 -0.55 -0.62 07 NT2 Neuron 14.1 0.7 0.0 0.0 1.15 -0.18 -1.36 -2.21 08 Brain, Fetal 1.6 2.7 0.4 0.2 0.21 0.43 -0.45 -0.75 09 Brain, whole 7.5 19.4 1.4 0.6 0.87 1.29 0.13 -0.21 10 ALZ Visual Cortex 3.0 10.9 1.0 0.4 0.48 1.04 -0.02 -0.40 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 1.0 0.0 0.0 0.0 0.0 12 Mix, blood cells and 1.3 4.0 1.0 0.4 0.11 0.60 0.01 -0.42 related tissues 13 Mix, tumor tissues 1.2 0.5 0.2 0.2 0.08 -0.34 -0.64 -0.74 14 Mix, normal tissues 5.0 11.8 2.0 1.8 0.70 1.07 0.30 0.25 15 Brain, whole PolyA(+) RNA 1.1 12.0 0.7 0.4 0.04 1.08 -0.13 -0.45 16 Brain, hippocampus 1.2 8.0 1.0 0.5 0.06 0.90 -0.01 -0.30 17 Brain, cerebellum 0.4 4.0 0.5 0.2 -0.43 0.60 -0.30 -0.69 18 Brain, amygdala 0.9 5.6 0.9 0.5 -0.03 0.75 -0.03 -0.31 19 Brain, caudate nucleus 1.1 6.6 1.1 0.5 0.03 0.82 0.05 -0.27 20 Brain, corpus callosum 0.5 1.1 1.1 0.6 -0.26 0.04 0.04 -0.21 21 Brain, substantia nigra 0.8 2.6 0.7 0.4 -0.10 0.41 -0.19 -0.39 22 Brain, thalamus 0.4 5.2 0.5 0.3 -0.35 0.72 -0.32 -0.58 23 Brain, subthalamic nucleus 0.2 0.9 0.6 0.3 -0.77 -0.06 -0.25 -0.57

[0370] Expression levels were compared using the 23 kinds of samples shown in Example 3, including 11 kinds of brain tissues and NT2 cells at 7 different differentiation stages. The comparison was made using the mixed sample of normal visceral tissues shown in Example 3 (Mix, viscus tissues) as an experimental control.

[0371] The ratio of ORF alteration due to transcription initiation point selectivity and exon selectivity as compared among 122.sub.--01 (SEQ ID NO:104), 122.sub.--02 (SEQ ID NO:107) and 122.sub.--03 (SEQ ID NO:110) changed greatly among the following differentiation stages of the brain and NT2 cells.

[0372] In all portions of the brain, the expression in the pattern of insertion of the exon shown by 122.sub.--02 (SEQ ID NO:107) was more abundant than 122.sub.--03 (SEQ ID NO:110) (Table 10 and Table 11).

[0373] For the downstream transcription initiation point shown by 122.sub.--01 (SEQ ID NO:104), it was found that the expression level varied greatly among the differentiation stages of NT2 cells. When compared in detail with respect to NT2 cell differentiation, it was found that the expression level of the pattern with an insertion was the same as that of the pattern without an insertion at the stage of undifferentiated NT2 cells NT2RA (-); however, in initial stages of differentiation such as NT2RA (+) 24 hr, NT2RA (+) 48 hr, and NT2RA (+) 1 week, which represent the initial stage in which retinoic acid was added to induce differentiation, the ratio of selection of the downstream transcription initiation point increased greatly, the difference being smaller in the late stage of differentiation (Table 10 and Table 11).

[0374] These results demonstrated that by comparing the expression of the 5'-terminal region of a newly acquired cDNA shown by the detection region 122.sub.--01 (SEQ ID NO:104) (a region close to the transcription initiation point), 122.sub.--[1]-N1 (SEQ ID NO:90), or the expression of a newly acquired cDNA region 122.sub.--[2]-N1 (SEQ ID NO:100), shown by the detection region 122.sub.--02 (SEQ ID NO:107), it is possible to use these regions as differentiation markers for detecting cells in nerve cell differentiation or regeneration stages, particularly those in an early stage of differentiation into nerve cells. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0375] The following regions also seem to be useful as differentiation markers for detecting nerve cell differentiation or regeneration stages, particularly initial stages of differentiation into nerve cells.

Upstream sequence 031.sub.--[1].sub.--1-N3 (SEQ ID NO:114), which comprises the 138th to 162nd bases undergoing priming by Primer122.sub.--01R (SEQ ID NO:103) in D-NT2RI2014164.1 of the cDNA pattern [1]. Upstream sequence 031.sub.--[1].sub.--1-N3 (SEQ ID NO:115), which comprises the 177th to 198th bases undergoing priming by Primer122.sub.--02R (SEQ ID NO:106) in D-BRAMY2029564.1 of the cDNA pattern [2]. Region 122.sub.--01 (SEQ ID NO:104) amplified by Primer122.sub.--01F (SEQ ID NO:102) and Primer122.sub.--01R (SEQ ID NO:103) in the cDNA pattern [1]. Region 122.sub.--02 (SEQ ID NO:107) amplified by Primer122.sub.--02F (SEQ ID NO:105) and Primer122.sub.--02R (SEQ ID NO:106) in the cDNA pattern [2].

Example 9

Cluster chr8-916 (Data Set: 124)

(1) Cluster Analysis

1) Cluster Characteristics

[0376] An analysis was performed on 10 sequences of full-length cDNAs subjected to genome mapping onto the cluster chr8-916 (Human genome UCSC hg18 (NCBI Build34) chromosome 8, 81,100,000 bp to 81,325,000 bp) [D-BRHIP2003515.1, D-COLON2003937.1, Z-BRCOC2013886-01, BC018117.1, BX640835.1, C-SMINT1000078, ENST00000263850, NM.sub.--005079.1, U18914.1, XM.sub.--374275.1]. They were classified according to expression pattern difference into 4 kinds, which mainly included the following 2 kinds.

[1] D-BRHIP2003515.1

[2] BC018117.1, NM.sub.--005079.1, U18914.1

[0377] [1] is a cDNA newly acquired and subjected to full-length cDNA sequence analysis by us, and having a different ORF from [2], which had been registered with an existing public DB.

[0378] [1], compared with the known [2], had a different ORF region because of amino acid sequence alteration due to the insertion of an exon different from other patterns in the ORF region.

[0379] It was found that the ORF regions present in the 2 kinds of cDNA patterns [1] to [2] have different splice patterns, from the same chromosome region, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-BRHIP2003515.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 124.sub.--[1].sub.--1-N0 (SEQ ID NO:116): The entire nucleic acid sequence region of D-BRHIP2003515.1 124.sub.--[1].sub.--1-NA0 (SEQ ID NO:117): Both the entire nucleic acid sequence region and amino acid sequence of D-BRHIP2003515.1 124.sub.--[1].sub.--1-A0 (SEQ ID NO:118): The entire amino acid sequence region of D-BRHIP2003515.1

[0380] The sequence at the 471st to 539th bases of D-BRHIP2003515.1 (SEQ ID NO:119) is a variant incorporating an exon that is not present in NM.sub.--005079.1, which is registered with an existing public DB, and serves for control. The translation initiation point and translation termination point of D-BRHIP2003515.1 are the same as those of NM.sub.--005079.1; however, because of the insertion of a 69-base exon into D-BRHIP2003515.1, the amino acid length increased by 23 residues, compared with NM.sub.--005079.1 (SEQ ID NO:120).

124.sub.--[1].sub.--1-N1 (SEQ ID NO:119): A 69-base insert nucleic acid sequence region of D-BRHIP2003515.1 124.sub.--[1].sub.--1-A 1 (SEQ ID NO:120): A 23-residue insert amino acid sequence region of D-BRHIP2003515.1 124.sub.--[1].sub.--1-N 2 (identical to SEQ ID NO:119): An ORF nucleic acid sequence region in the 69-base insert region of D-BRHIP2003515.1 124.sub.--[1].sub.--1-A 2 (identical to SEQ ID NO:120): An ORF amino acid region related to the 69-base insert region of D-BRHIP2003515.1

3) Expression Specificity Analysis and Design of Primers for Real-Time PCR

[0381] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

124.sub.--04--A region specifically extracted by means of the sequence information at the border of a region having an exon inserted therein in the cDNA pattern [1]: an insert region of an ORF altering exon in the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us .fwdarw.Fragment 124.sub.--04 (SEQ ID NO:123) amplified by Primer124.sub.--04F (SEQ ID NO:121) and Primer12404R (SEQ ID NO:122) 124.sub.--05--A specific region corresponding to a deletion region of the cDNA pattern [2], which is registered with an existing public DB, compared with the insertion region of [1], serving as a control for comparing [1] .fwdarw.Fragment 124.sub.--05 (SEQ ID NO:126) amplified by Primer124.sub.--05F (SEQ ID NO:124) and Primer12405R (SEQ ID NO:125) 124.sub.--06--A common region shared by all of [1] to [2]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [2], which is registered with an existing public DB .fwdarw.Fragment 124.sub.--06 (SEQ ID NO:129) amplified by Primer124.sub.--06F (SEQ ID NO:127) and Primer124.sub.--06R (SEQ ID NO:128)

[0382] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the exon regions specific to the cDNA patterns [1] to [2] shown above, respectively, were found to be expressed at the following frequencies.

[0383] In the cDNA pattern [1], which was newly acquired and analyzed by us, twenty-one 5'-terminal sequences were present, the derivations thereof being brain tissues such as Brain, amygdala, Brain, cerebellum, and Brain, hippocampus for 18 sequences and Kidney, Tumor for 3 sequences.

[0384] In the cDNA pattern [2], which is registered with an existing public DB, fifty-one 5'-terminal sequences were present, the derivations thereof being brain tissues such as Brain, substantia nigra, Brain, hippocampus, Brain, amygdala, and Brain, corpus callosum for 17 sequences, tumor tissues such as Tongue, Tumor, and Kidney, Tumor for 9 sequences, and other normal tissues such as Lung, Small Intestine, and Trachea for sequences.

[0385] From this result, it was found that the exon insertion pattern [1] was abundantly expressed in the brain. It was also found that the exon deletion pattern [2] was expressed not only in the brain, but also in other various tissues. Hence, it was thought that the mechanism for amino acid alteration due to exon insertion in this chromosome region to cause the expression of different proteins, as with the pattern [1], might be unique to particular tissues.

(2) Analysis of Expression Specificity by Real-Time PCR

[0386] To detect protein expression diversity changes due to exon selectivity among different tissues, details of expression levels were analyzed by real-time PCR. The results are shown in Table 12 and Table 13.

TABLE-US-00012 TABLE 12 RQ Score Log.sub.10RQ Score 124_04 124_05 124_06 124_04 124_05 124_06 01 NT2RA(-) 0.1 0.1 0.1 -1.28 -1.07 -1.06 02 NT2RA(+) 24 hr 0.2 0.1 0.1 -0.74 -1.21 -1.22 03 NT2RA(+) 48 hr 0.0 0.1 0.1 -1.47 -1.25 -1.27 04 NT2RA(+) 1 week 0.1 0.0 0.0 -0.83 -1.58 -1.60 05 NT2RA(+) 5 weeks 8.6 0.0 0.1 0.93 -1.32 -1.07 06 NT2RA(+) 5 weeks, Inh(+) 4.7 0.1 0.1 0.67 -1.15 -1.03 07 NT2 Neuron 1.1 0.0 0.0 0.04 -2.08 -1.79 08 Brain, Fetal 148.6 0.0 0.4 2.17 -1.59 -0.38 09 Brain, whole 465.6 0.3 2.1 2.67 -0.47 0.32 10 ALZ Visual Cortex 286.6 0.3 1.1 2.46 -0.49 0.05 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 0.0 0.0 0.0 12 Mix, blood cells and 14.6 0.5 0.6 1.16 -0.34 -0.26 related tissues 13 Mix, tumor tissues 0.4 0.9 0.8 -0.40 -0.04 -0.09 14 Mix, normal tissues 1.0 0.9 0.8 -0.01 -0.06 -0.10 15 Brain, whole PolyA(+) RNA 190.4 0.3 1.3 2.28 -0.54 0.12 16 Brain, hippocampus 189.2 0.3 1.1 2.28 -0.50 0.06 17 Brain, cerebellum 247.5 0.1 1.6 2.39 -0.84 0.21 18 Brain, amygdala 191.9 0.2 0.9 2.28 -0.74 -0.03 19 Brain, caudate nucleus 134.8 0.4 0.9 2.13 -0.45 -0.05 20 Brain, corpus callosum 25.0 1.1 1.2 1.40 0.03 0.09 21 Brain, substantia nigra 70.3 0.5 0.9 1.85 -0.29 -0.06 22 Brain, thalamus 194.7 0.3 1.0 2.29 -0.57 0.01 23 Brain, subthalamic nucleus 22.2 0.6 0.6 1.35 -0.25 -0.25

TABLE-US-00013 TABLE 13 RQ Score Log.sub.10RQ Score 124_04 124_05 124_06 124_04 124_05 124_06 01 NT2RA(-) 0.2 0.1 0.1 -0.72 -1.19 -0.99 02 NT2RA(+) 24 hr 1.0 0.1 0.1 -0.02 -1.19 -1.14 03 NT2RA(+) 48 hr 0.3 0.1 0.1 -0.48 -1.24 -1.14 04 NT2RA(+) 1 week 0.7 0.0 0.0 -0.17 -1.56 -1.47 05 NT2RA(+) 5 weeks 41.4 0.0 0.1 1.62 -1.33 -0.97 06 NT2RA(+) 5 weeks, Inh(+) 30.7 0.1 0.1 1.49 -1.09 -0.89 07 NT2 Neuron 6.2 0.0 0.0 0.79 -1.95 -1.59 08 Brain, Fetal 839.9 0.0 0.5 2.92 -1.62 -0.27 09 Brain, whole 3655.9 0.3 2.6 3.56 -0.50 0.41 10 ALZ Visual Cortex 1899.0 0.3 1.6 3.28 -0.46 0.19 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 0.0 0.0 0.0 12 Mix, blood cells and 100.9 0.5 0.7 2.00 -0.28 -0.17 related tissues 13 Mix, tumor tissues 1.9 0.8 0.9 0.28 -0.11 -0.05 14 Mix, normal tissues 8.9 0.7 1.2 0.95 -0.16 0.08 15 Brain, whole PolyA(+) RNA 1539.2 0.3 1.7 3.19 -0.55 0.22 16 Brain, hippocampus 1524.4 0.3 1.5 3.18 -0.48 0.16 17 Brain, cerebellum 2130.5 0.2 2.3 3.33 -0.80 0.36 18 Brain, amygdala 1379.6 0.2 1.2 3.14 -0.75 0.09 19 Brain, caudate nucleus 804.0 0.4 1.1 2.91 -0.45 0.04 20 Brain, corpus callosum 163.7 1.1 1.4 2.21 0.04 0.16 21 Brain, substantia nigra 386.9 0.5 1.1 2.59 -0.32 0.04 22 Brain, thalamus 1285.4 0.3 1.3 3.11 -0.59 0.10 23 Brain, subthalamic nucleus 181.6 0.6 0.8 2.26 -0.26 -0.11

[0387] Expression levels were compared using the 23 kinds of samples shown in Example 3, including 11 kinds of brain tissues and NT2 cells at 7 different differentiation stages. The comparison was made using the mixed sample of normal visceral tissues shown in Example 3 (Mix, viscus tissues) as an experimental control.

[0388] The ratio of ORF alteration due to exon insertion/deletion selectivity as compared between 124.sub.--04 (SEQ ID NO:123) and 124.sub.--05 (SEQ ID NO:126) changed greatly among the following tissues and NT2 cell differentiation stages.

[0389] In all portions of the brain, the expression of the pattern for insertion of the exon shown by 124.sub.--04 (SEQ ID NO:123) was abundant (Table 12 and Table 13).

[0390] It was found that in NT2 cells, exon selectivity changed greatly depending on the stage of differentiation. When compared in detail with respect to NT2 cell differentiation, almost no difference was observed between the two patterns 124.sub.--04 (SEQ ID NO:123) and 124.sub.--05 (SEQ ID NO:126) in undifferentiated NT2 cells NT2RA (-) and NT2RA (+) 1 week, which represent the initial stage in which retinoic acid was added to NT2 cells to induce differentiation; however, in NT2RA (+) 5 weeks to NT2 Neuron, the expression of the pattern of insertion of the exon shown by 124.sub.--04 (SEQ ID NO:123) was considerably abundant (Table 12 and Table 13).

[0391] These results demonstrated that by comparing the expression of the selective exon region 124.sub.--[1].sub.--1-N1 (SEQ ID NO:119) of a newly acquired cDNA shown by the detection region 124.sub.--04 (SEQ ID NO:123), it is possible to use the exon region as a brain-specific marker, and as a differentiation marker for detecting cells in nerve cell differentiation or regeneration stages, particularly those after nerve differentiation or nerve regeneration. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0392] The following regions also seem to be useful as markers specific for the brain, and as differentiation markers for detecting nerve cells in differentiation or regeneration stages, particularly those after nerve differentiation or after nerve regeneration.

Upstream sequence 124.sub.--[1].sub.--1-N3 (SEQ ID NO:130), which comprises the 472nd to 491st bases undergoing priming by Primer124.sub.--04R (SEQ ID NO:122) in D-BRHIP2003515.1 of the cDNA pattern [1]. Region 12404 (SEQ ID NO:123) amplified by Primer124.sub.--04F (SEQ ID NO:121) and Primer12404R (SEQ ID NO:122) in the cDNA pattern [1]

Example 10

Cluster chr3+2014 (Data Set: 112)

(1) Cluster Analysis

1) Cluster Characteristics

[0393] An analysis was performed on 7 full-length cDNAs subjected to genome mapping onto the cluster chr3+2014 (Human genome UCSC hg18 (NCBI Build34) chromosome 3, 143,070,000 bp to 143,130,000 bp) [D-BRACE2044661.1, BC011835.2, C-BRAMY2022929, C-PRS09188, ENST00000286371, NM.sub.--001679.2, U51478.1]. They were classified according to expression pattern difference into 4 kinds, which mainly included the following 2 kinds.

[1] D-BRACE2044661.1

[2] BC011835.2, ENST00000286371, NM.sub.--001679.2, U51478.1

[0394] [1] is a cDNA newly acquired and subjected to full-length cDNA sequence analysis by us, and having a different ORF from [2], which had been registered with an existing public DB.

[0395] [1], compared with the known [2], had a different ORF because of translation initiation point alteration due to the insertion of an exon different from other patterns in the ORF region.

[0396] It was found that the ORF regions present in the 2 kinds of cDNA patterns [1] to [2] have different splice patterns, from the same chromosome region, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-BRACE2044661.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 112.sub.--[1].sub.--1-N0 (SEQ ID NO:131): The entire nucleic acid sequence region of D-BRACE2044661.1 112.sub.--[1].sub.--1-NA0 (SEQ ID NO:132): Both the entire nucleic acid sequence region and amino acid sequence of D-BRACE2044661.1 112.sub.--[1].sub.--1-A0 (SEQ ID NO:133): The entire amino acid sequence region of D-BRACE2044661.1

[0397] The 272nd to 363rd bases of D-BRACE2044661.1 (SEQ ID NO:134) is an exon that is not present in NM.sub.--001679.2, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--001679.2. Because a translation initiation point is present on this exon, the amino acids on the N-terminal side changed by 23 residues (SEQ ID NO:135).

112.sub.--[1].sub.--1-N1 (SEQ ID NO:134): A 92-base insert nucleic acid sequence region of D-BRACE2044661.1 112.sub.--[1].sub.--1-A1 (SEQ ID NO:135): A 23-residue insert amino acid sequence region D-BRACE2044661.1 112.sub.--[1].sub.--1-N2 (SEQ ID NO:136): An ORF nucleic acid sequence region in the 92-base insert region of D-BRACE2044661.1 112.sub.--[1].sub.--1-A2 (identical to SEQ ID NO:135): An ORF amino acid sequence region in the 92-base insert region of D-BRACE2044661.1

[0398] The sequence at the 837th to 856th bases of D-BRACE2044661.1 (SEQ ID NO:137) is an exon that is not present in NM.sub.--001679.2, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--001679.2. Because of a change in the translation frame by this insert sequence, the amino acids on the C-terminal side changed by 13 residues (SEQ ID NO:138).

112.sub.--[1].sub.--1-N3 (SEQ ID NO:137): A 20-base insert nucleic acid sequence region of D-BRACE2044661.1 112.sub.--[1].sub.--1-A3 (SEQ ID NO:138): A 13-residue insert amino acid sequence region of D-BRACE2044661.1 112.sub.--[1].sub.--1-N4 (identical to SEQ ID NO:137): An ORF nucleic acid sequence region in the 20-base insert region of D-BRACE2044661.1 112.sub.--[1].sub.--1-A4 (SEQ ID NO:139): An ORF amino acid sequence region in the 20-base insert region of D-BRACE2044661.1

3) Expression Specificity Analysis and Design of Primers for Real-Time PCR

[0399] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

112.sub.--01--A region incorporating an exon of the cDNA pattern [1], specifically extracted by means of the sequence information at the border: an ORF-altering exon insert region in the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us .fwdarw.Fragment 112.sub.--01 (SEQ ID NO:142) amplified by Primer112.sub.--01F (SEQ ID NO:140) and Primer112.sub.--01R (SEQ ID NO:141) 112.sub.--02--A specific region corresponding to a deletion region of the cDNA pattern [2], which is registered with an existing public DB, compared with the insert region of [1], serving as a control for comparatively examining [1] .fwdarw.Fragment 112.sub.--02 (SEQ ID NO:145) amplified by Primer112.sub.--02F (SEQ ID NO:143) and Primer112.sub.--02R (SEQ ID NO:144) 112.sub.--03--A common region shared by all of [1] to [2]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [2], which is registered with an existing public DB .fwdarw.Fragment 112.sub.--03 (SEQ ID NO:148) amplified by Primer112.sub.--03F (SEQ ID NO:146) and Primer112.sub.--03R (SEQ ID NO:147)

[0400] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the regions specific for the 2 kinds of cDNA patterns [1] to [2] shown above, respectively, were found to be expressed at the following frequencies.

[0401] In the cDNA pattern [1], which was newly acquired and analyzed by us, six 5'-terminal sequences were present, the derivations thereof being Brain, cerebellum for 3 sequences (analytical parameter 82,880), Brain, cortex, Alzheimer for 1 sequence (analytical parameter 16,360), Brain, amygdala for 1 sequence (analytical parameter 58,640), and tissues rich in head portion from 10-week-gestional fetal human (whole embryo, mainly head) for 1 sequence (analytical parameter 7,033).

[0402] In the cDNA pattern [2], which is registered with an existing public DB, twenty-four 5'-terminal sequences were present, the derivations thereof being Placenta for 4 sequences (analytical parameter 46,090), NT2 cells treated with retinoic acid (RA) to induce differentiation (NT2RP) for 3 sequences (analytical parameter 39,242), Tongue, Tumor for 2 sequences (analytical parameter 31,371), IMR32 cells (Neuroblastoma) for 2 sequences (analytical parameter 16964), NT2 cells treated with retinoic acid and a growth inhibitor to induce nerve differentiation, followed by nerve concentration and recovery (NT2NE) for 2 sequences (analytical parameter 16,337) and the like; this pattern was expressed in various tissues.

[0403] From this result, it was found that the exon insertion pattern [1] was abundantly expressed in the brain. It was also found that the exon deletion pattern [2] was expressed not only in the brain, but also in other various tissues. Hence, it was thought that the selection mechanism for mRNA pattern change in this chromosome region, which alters N-terminal amino acids and results in the expression of different proteins because of exon insertion as with the pattern [1], might be unique to particular tissues.

(2) Analysis of Expression Specificity by Real-Time PCR

[0404] To detect protein expression diversity changes due to exon selectivity among different tissues, details of expression levels were analyzed by real-time PCR. The results are shown in Table 14.

TABLE-US-00014 TABLE 14 RQ Score Log.sub.10RQ Score 112_01 112_02 112_03 112_01 112_02 112_03 01 NT2RA(-) 0.4 0.5 1.2 -0.35 -0.26 0.09 02 NT2RA(+) 24 hr 0.5 0.3 0.5 -0.33 -0.48 -0.33 03 NT2RA(+) 48 hr 0.4 0.5 0.6 -0.41 -0.32 -0.22 04 NT2RA(+) 1 week 0.2 0.5 0.6 -0.74 -0.32 -0.21 05 NT2RA(+) 5 weeks 2.0 0.9 2.0 0.29 -0.03 0.31 06 NT2RA(+) 5 weeks, Inh(+) 4.1 0.8 1.5 0.62 -0.12 0.18 07 NT2 Neuron 7.6 0.9 1.3 0.88 -0.03 0.11 08 Brain, Fetal 23.3 1.2 2.3 1.37 0.08 0.36 09 Brain, whole 158.2 0.6 1.8 2.20 -0.21 0.26 10 ALZ Visual Cortex 109.3 0.3 1.2 2.04 -0.55 0.08 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 0.0 0.0 0.0 12 Mix, blood cells and 10.1 0.8 1.2 1.00 -0.12 0.07 related tissues 13 Mix, tumor tissues 0.9 1.6 1.2 -0.03 0.22 0.06 14 Mix, normal tissues 2.4 1.3 1.6 0.37 0.10 0.20 15 Brain, whole PolyA(+) RNA 114.0 0.3 0.9 2.06 -0.56 -0.03 16 Brain, hippocampus 56.4 0.3 0.7 1.75 -0.55 -0.15 17 Brain, cerebellum 149.9 0.6 1.6 2.18 -0.20 0.21 18 Brain, amygdala 47.5 0.3 0.9 1.68 -0.47 -0.06 19 Brain, caudate nucleus 47.9 0.3 0.8 1.68 -0.59 -0.11 20 Brain, corpus callosum 8.7 0.3 0.6 0.94 -0.53 -0.19 21 Brain, substantia nigra 56.7 0.4 1.0 1.75 -0.37 0.01 22 Brain, thalamus 124.0 0.3 1.2 2.09 -0.60 0.07 23 Brain, subthalamic nucleus 26.1 0.4 0.7 1.42 -0.37 -0.16

[0405] Expression levels were compared using the 23 kinds of samples shown in Example 3, including 11 kinds of brain tissues and NT2 cells at 7 different differentiation stages. The comparison was made using the mixed sample of normal visceral tissues shown in Example 3 (Mix, viscus tissues) as an experimental control.

[0406] The ratio of ORF alteration due to exon insertion/deletion selectivity as compared between 112.sub.--01 (SEQ ID NO:142) and 112.sub.--02 (SEQ ID NO:145) changed greatly among the following brain portions and NT2 cell differentiation stages.

[0407] In the brain, particularly in Brain, cerebellum, Brain, hippocampus, Brain, amygdala, Brain, caudate nucleus, Brain, substantia nigra, and Brain, thalamus, the pattern of insertion of the exon shown by 112.sub.--01 (SEQ ID NO:142) was abundantly observed (Table 14).

[0408] It was also found that in NT2 cells, exon selectivity varied greatly depending on the stage of differentiation. When compared in detail with respect to NT2 cell differentiation, the expression of the exon deletion pattern shown by 112.sub.--02 (SEQ ID NO:145), which is registered with an existing public DB was more abundant in undifferentiated NT2 cells NT2RA (-) and NT2RA (+) 48 hr, NT2RA (+) 1 week, which represents the initial stage in which retinoic acid was added to induce differentiation; however, in NT2RA (+) 5 weeks, which is predicted to be rich in nerve cells after differentiation, the expression level reversed; even in NT2RA (+) 5 weeks, Inh (+) and NT2 Neuron, the expression of the exon insertion pattern shown by 112.sub.--01 (SEQ ID NO:142) was abundantly observed (Table 14).

[0409] These results demonstrated that by comparing the expression of the selective exon region 112.sub.--[1]-N1 (SEQ ID NO:134) of a newly acquired cDNA shown by the detection region 112.sub.--01 (SEQ ID NO:142), it is possible to use the exon region as a marker specific for the brain, particularly for portions such as Brain, cerebellum, Brain, hippocampus, Brain, amygdala, Brain, caudate nucleus, Brain, substantia nigra, and Brain, thalamus, and as a differentiation marker for detecting cells in nerve cell differentiation or regeneration stages, particularly those that have differentiated or regenerated into a nerve. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0410] The following regions also seem to be useful as differentiation markers.

Upstream sequence 112.sub.--[1].sub.--1-N5 (SEQ ID NO:149), which comprises the 363rd to 390th bases undergoing priming by Primer112.sub.--01R (SEQ ID NO:141) in D-BRACE2044661.1 of the cDNA pattern [1]. Region 112.sub.--01 (SEQ ID NO:142) amplified by Primer112.sub.--01F (SEQ ID NO:140) and Primer112.sub.--01R (SEQ ID NO:141) in the cDNA pattern [1].

Example 11

Cluster chr12+1658 (Data Set: 095)

(1) Cluster Analysis

1) Cluster Characteristics

[0411] An analysis was performed on 7 sequences of full-length cDNAs subjected to genome mapping onto the cluster chr12+1658 (Human genome UCSC hg18 (NCBI Build34) chromosome 12, 108,470,000 bp to 108,500,000 bp) [D-BRCAN2027778.1, D-3NB692002462.1, BC016140.1, C-NT2RP3000875, ENST00000228510, M88468.1, NM.sub.--000431.1]. They were classifiable according to expression pattern difference mainly into the following 3 kinds.

[1] D-3NB692002462.1

[2] D-BRCAN2027778.1

[3] BC016140.1, ENST00000228510, M88468.1, NM.sub.--000431.1

[0412] [1] and [2] are cDNAs which were newly acquired and subjected to full-length cDNA sequence analysis by us, and had a different ORF from [3], which had been registered with an existing public DB.

[0413] [1], compared with the known [3], had a different ORF region because of the deletion of portions corresponding to the third and fourth exons of [3] in the ORF region.

[0414] [2], compared with the known [3], had a different ORF region because of the deletion of a portion corresponding to the fourth exon of [3] in the ORF region.

[0415] It was found that the ORF regions present in the 3 kinds of cDNA patterns [1] to [3] have different splice patterns, such as exon deletions, from the same chromosome region, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-3NB692002462.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 095.sub.--[1].sub.--1-N0 (SEQ ID NO:150): The entire nucleic acid sequence region of D-3NB692002462.1 095.sub.--[1].sub.--1-NA0 (SEQ ID NO:151): Both the entire nucleic acid sequence region and amino acid sequence of D-3NB692002462.1 095.sub.--[1].sub.--1-A0 (SEQ ID NO:152): The entire amino acid sequence region of D-3NB692002462.1

[0416] The 301-base exon present at the 303rd to 603rd bases of NM.sub.--000431.1, which is registered with an existing public DB, and serves for control (SEQ ID NO:155), is lacked and not present in the region at the 287th to 288th bases of D-3NB692002462.1 (SEQ ID NO:153). The translation initiation point of NM.sub.--000431.1 is present on the first exon, shared by D-3NB692002462.1; however, in D-3NB692002462.1, because of the alteration of the frame due to deletion of the 301 bases, the translation initiation point shifts toward the 3' side, compared with NM.sub.--000431.1, and the 343rd base of D-3NB692002462.1 becomes the translation initiation point. For this reason, the N-terminal amino acid sequence shortened by 194 residues, compared with NM.sub.--000431.1.

095.sub.--[1].sub.--1-N1 (SEQ ID NO:153): A deletion nucleic acid sequence region of D-3NB692002462.1 095.sub.--[1].sub.--1-N2 (SEQ ID NO:154): A 342-base 5'UTR region of an ORF whose translation initiation point is the 343rd base of D-3NB692002462.1 095.sub.--[1]_C-N1 (SEQ ID NO:155): A 301-base exon nucleic acid sequence present in the region at the 303rd to 603rd bases of NM.sub.--000431.1 inserted into the region at the 287th to 288th bases of D-3NB692002462.1 095.sub.--[1]_C-A1 (SEQ ID NO:156): A 101-residue amino acid sequence related to the 301-base exon nucleic acid sequence present in the region at the 303rd to 603rd bases of NM.sub.--000431.1 inserted into the region at the 1,250th to 1,251st bases of D-3NB692002462.1

[0417] With this change, "GHMP kinase putative ATP-binding protein", the Pfam motif present at the 128th to 346th amino acids of NM.sub.--000431.1, disappeared in D-3NB692002462.1.

3) Characteristics of D-BRCAN2027778.1 ([2]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 095.sub.--[2].sub.--1-N0 (SEQ ID NO:157): The entire nucleic acid sequence region of D-BRCAN2027778.1 095.sub.--[2].sub.--1-NA0 (SEQ ID NO:158): Both the entire nucleic acid sequence region and amino acid sequence of D-BRCAN2027778.1 095.sub.--[2].sub.--1-A0 (SEQ ID NO:159): The entire amino acid sequence region of D-BRCAN2027778.1

[0418] The 156-base exon present at the 448th to 603rd bases of NM.sub.--000431.1, which is registered with an existing public DB, and serves for control (SEQ ID NO:162), is lacked and not present in the region at the 422nd to 423rd bases of D-BRCAN2027778.1 (SEQ ID NO:160).

095.sub.--[2].sub.--1-N1 (SEQ ID NO:160): A deletion nucleic acid sequence region of D-BRCAN2027778.1 095.sub.--[2].sub.--1-A1 (SEQ ID NO:161): An altered amino acid sequence region of D-BRCAN2027778.1 095.sub.--[2].sub.--1-N2 (identical to SEQ ID NO:160): An ORF nucleic acid sequence region in the deletion region of D-BRCAN2027778.1 095.sub.--[2].sub.--1-A2 (identical to SEQ ID NO:161): An ORF amino acid region related to the deletion region of D-BRCAN2027778.1 095.sub.--[2]_C-N1 (SEQ ID NO:162): A 156-base exon nucleic acid sequence present in the region at the 448th to 603rd bases of NM.sub.--000431.1 inserted into the region at the 422nd to 423rd bases of D-BRCAN2027778.1 095.sub.--[2]_C-A1 (SEQ ID NO:163): A 101-residue amino acid sequence related to the 156-base exon nucleic acid sequence present in the region at the 448th to 603rd bases of NM.sub.--000431.1 inserted into the region at the 423rd to 424th bases of D-BRCAN2027778.1

4) Expression Specificity Analysis and Design of Primers for Real-Time PCR and TaqMan Probe

[0419] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

095.sub.--01--A region specifically extracted by means of the sequence information at the border of regions of the exon deletion of cDNA pattern [1]: an ORF-altering exon deletion region in the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us .fwdarw.Fragment 095.sub.--01 (SEQ ID NO:166) amplified by Primer095.sub.--01F (SEQ ID NO:164) and Primer09501R (SEQ ID NO:165) TaqMan probe used 095.sub.--01TP: (SEQ ID NO:167) 095.sub.--02--A region specifically extracted by means of the sequence information at the border of regions of the exon deletion of cDNA pattern [2]: an ORF-altering exon deletion in the cDNA pattern [2], which was newly subjected to full-length cDNA sequence analysis by us .fwdarw.Fragment 095.sub.--02 (SEQ ID NO:170) amplified by Primer095.sub.--02F (SEQ ID NO:168) and Primer09502R (SEQ ID NO:169) TaqMan probe used 095.sub.--02TP: (SEQ ID NO:171) 095.sub.--03--A specific region of the cDNA pattern [3], which is registered with an existing public DB, that can be distinguished from both the deletion regions of [1] and [2], serving as a control for comparing [1] and [2] .fwdarw.Fragment 095.sub.--03 (SEQ ID NO:174) amplified by Primer095.sub.--03F (SEQ ID NO:172) and Primer09503R (SEQ ID NO:173) TaqMan probe used 095.sub.--03TP: (SEQ ID NO:175) 095.sub.--04--A common region shared by all of [1] to [3]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA patterns [1] and [2], which were newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [3], which is registered with an existing public DB .fwdarw.Fragment 095.sub.--04 (SEQ ID NO:178) amplified by Primer095.sub.--04F (SEQ ID NO:176) and Primer095.sub.--04R (SEQ ID NO:177) TaqMan probe used 095.sub.--04TP: (SEQ ID NO:179)

[0420] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the exon regions specific for the cDNA patterns [1] to [3] shown above, respectively, were found to be expressed at the following frequencies.

[0421] In the cDNA pattern [1], which was newly acquired and analyzed by us, three 5'-terminal sequences were present, the derivations thereof being NB69 cells for 1 sequence (analytical parameter 8,153), NT2 cells treated with retinoic acid (RA) to induce differentiation (NT2RP) for 1 sequence (analytical parameter 39,242), and SK-N-MC cells (Neuroepithelioma) for 1 sequence (analytical parameter 7,700).

[0422] In the cDNA pattern [2], which was newly acquired and analyzed by us, three 5'-terminal sequences were present, the derivations thereof being a library generated by subtracting cDNAs that overlap with the mRNA of BRAWH: Brain, whole from a cDNA library prepared from the mRNA of BRALZ [Alzheimer patient cerebral cortex (Brain, cortex, Alzheimer)] (BRALZ-BRAWH) for 1 sequence (analytical parameter 157), Brain, caudate nucleus for 1 sequence (analytical parameter 25,786), and NT2 cells treated with retinoic acid and a growth inhibitor to induce nerve differentiation, followed by nerve concentration and recovery (NT2NE) for 1 sequence (analytical parameter 16,337).

[0423] In the cDNA pattern [3], which is registered with an existing public DB, thirty-four 5'-terminal sequences were present, and expression was observed in various tissues, the derivations thereof being Brain, cerebellum for 4 sequences (analytical parameter 82,880), Testis for 4 sequences (analytical parameter 90,188), NT2 cells treated with RA and treated with a growth inhibitor to induce nerve differentiation, followed by nerve concentration and recovery (NT2NE) for 3 sequences (analytical parameter 16,337), Brain, whole for 2 sequences (analytical parameter 59,069), Brain, subthalamic nucleus for 2 sequences (analytical parameter 16,308), Kidney for 2 sequences (analytical parameter 17,008), and Thymus for 2 sequences (analytical parameter 70,578).

[0424] From this result, it was found that the exon deletion pattern [1] was expressed in differentiated NT2 cells and the like. It was also found that the exon deletion pattern [2] was abundantly expressed in the brain. The known sequence [3], compared with the patterns [1] and [2], was expressed in a wider variety of organs. Hence, it was thought that the selection mechanism for mRNA pattern change in this chromosome region, which alters amino acid sequences and results in the expression of different proteins because of exon selectivity as with the patterns [1] and [2], might be unique to particular tissues.

(2) Analysis of Expression Specificity by Real-Time PCR

[0425] To detect protein expression diversity changes due to exon selectivity among different tissues, details of expression levels were analyzed by real-time PCR. The results are shown in Table 15.

TABLE-US-00015 TABLE 15 RQ Score Log.sub.10RQ Score 095_01 095_02 095_03 095_04 095_01 095_02 095_03 095_04 01 NT2RA(-) 0.5 0.3 0.2 0.2 -0.34 -0.49 -0.78 -0.66 02 NT2RA(+) 24 hr 0.8 0.4 0.2 0.3 -0.11 -0.43 -0.78 -0.52 03 NT2RA(+) 48 hr 0.3 0.2 0.2 0.4 -0.49 -0.66 -0.70 -0.37 04 NT2RA(+) 1 week 0.9 0.4 0.7 1.2 -0.03 -0.36 -0.18 0.07 05 NT2RA(+) 5 weeks 0.2 0.2 0.3 0.3 -0.71 -0.80 -0.58 -0.48 06 NT2RA(+) 5 weeks, Inh(+) 0.1 0.2 0.3 0.2 -0.88 -0.64 -0.58 -0.63 07 NT2 Neuron 0.2 0.0 0.1 1.7 -0.72 -1.36 -1.16 0.23 08 Brain, Fetal 2.3 0.7 0.5 1.2 0.36 -0.16 -0.26 0.09 09 Brain, whole 1.0 0.3 0.4 0.7 -0.01 -0.52 -0.41 -0.16 10 ALZ Visual Cortex 0.5 0.2 0.2 0.3 -0.30 -0.73 -0.80 -0.60 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 1.0 0.0 0.0 0.0 0.0 12 Mix, blood cells and 2.1 1.0 0.9 1.0 0.33 0.01 -0.05 0.01 related tissues 13 Mix, tumor tissues 0.4 0.3 0.5 0.5 -0.45 -0.53 -0.30 -0.31 14 Mix, normal tissues 1.3 0.8 1.2 1.0 0.11 -0.11 0.07 0.00 15 Brain, whole PolyA(+) RNA 3.6 0.9 1.2 1.3 0.55 -0.05 0.09 0.10 16 Brain, hippocampus 1.9 0.4 0.7 0.7 0.27 -0.36 -0.13 -0.17

[0426] Expression levels were compared using the 16 samples shown in Example 3, including Brain, hippocampus, Brain, whole, Brain, Fetal, Alzheimer patient cerebral cortex (ALZ Visual Cortex Occipital) and NT2 cells at 7 different differentiation stages. The comparison was made using the mixed sample of normal visceral tissues shown in Example 3 (Mix, viscus tissues) as an experimental control.

[0427] The ratio of ORF alteration due to exon deletion selectivity as compared between 095.sub.--01 (SEQ ID NO:166) and 095.sub.--02 (SEQ ID NO:170) changed greatly among the following differentiation stages of the brain and NT2 cells.

The expression of the pattern of deletion of the exon shown by 095.sub.--01 (SEQ ID NO:166) was abundant in undifferentiated NT2 cells NT2RA (-) and NT2RA (+) 1 week, which represents the initial stage in which retinoic acid was added to induce differentiation. Although the expression decreased in NT2RA (+) 5 weeks to NT2RA(+) 5 weeks, Inh (+), which represent the late stage of differentiation induction, this pattern was again abundantly expressed in NT2 Neuron (Table 15).

[0428] The expression of the pattern of deletion of the exon shown by 095.sub.--02 (SEQ ID NO:170) was abundant in undifferentiated NT2 cells NT2RA (-) and NT2RA (+) 24 hr, which represents the initial stage in which retinoic acid was added to induce differentiation. In NT2RA (+) 5 weeks to NT2RA (+)5 weeks, Inh (+), which represent the late stage of differentiation, and NT2 Neuron, the expression level decreased (Table 15).

[0429] These results demonstrated that by comparing the expression of the selective exon regions 095.sub.--[1].sub.--1-N1 (SEQ ID NO:153) and 095.sub.--[2].sub.--1-N1 (SEQ ID NO:160) of newly acquired cDNAs shown by the detection regions 095.sub.--01 (SEQ ID NO:166) and 095.sub.--02 (SEQ ID NO:170), it is possible to use the exon regions as differentiation markers for detecting nerve cell differentiation or regeneration stages, particularly initial stages of differentiation into nerve cells.

[0430] Furthermore, it was demonstrated that the selective exon region 095.sub.--[1].sub.--1-N1 (SEQ ID NO:153) of a newly acquired cDNA shown by the detection region 095.sub.--01 (sequence No. 095-17), as a brain-specific marker, can be used as one of differentiation markers for detecting cells in nerve cell differentiation or regeneration stages, particularly those after nerve differentiation or nerve regeneration. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0431] The following regions also seem to be useful as differentiation markers for detecting nerve cell differentiation or regeneration.

Upstream sequence 095.sub.--[1].sub.--1-N3 (SEQ ID NO:180), which comprises the 304th to 326th bases undergoing priming by Primer095.sub.--01R (SEQ ID NO:165) in D-3NB692002462.1 of the cDNA pattern [1]. Upstream sequence 095.sub.--[2].sub.--1-N3 (SEQ ID NO:181), which comprises the 444th to 466th bases undergoing priming by Primer095.sub.--02R (SEQ ID NO:169) in D-BRCAN2027778.1 of the cDNA pattern [2]. Region 095.sub.--01 (SEQ ID NO:166) amplified by Primer095.sub.--01F (SEQ ID NO:164) and Primer095.sub.--01R (SEQ ID NO:165) in the cDNA pattern [1] Region 095.sub.--02 (SEQ ID NO:170) amplified by Primer095.sub.--02F (SEQ ID NO:168) and Primer095.sub.--02R (SEQ ID NO:169) in the cDNA pattern [2]

Example 12

Cluster chr12-1875 (Data Set: 017)

(1) Cluster Analysis

1) Cluster Characteristics

[0432] Analysis was performed on 10 sequences of full-length cDNAs genome-mapped to the cluster chr12-1875 (Human genome UCSC hg18 (NCBI Build34) chromosome 12, 7,840,000 bp to 7,960,000 bp) [D-NT2RI3001005.1, D-NT2RI3005261.1, AF481879.1, AL110298.1, AL832448.1, BC060766.1, C-TESTI1000257, C-TESTI4028880, ENST00000340749, NM.sub.--153449.2]. They were classified according to expression pattern difference into 4 kinds, which mainly included the following 2 kinds.

[1] D-NT2RI3001005.1, D-NT2RI3005261.1

[2] AF481879.1, C-TESTI4028880 (AK126026.1), NM.sub.--153449.2

[0433] [1] is a cDNA which was newly acquired and subjected to full-length cDNA sequence analysis by us, and had a different ORF region because of the expression thereof from a chromosome region upstream of the known [2], and also because of the presence of the translation initiation point on a new exon lacking identity to [2].

[0434] It was found that the ORF regions present in the 2 kinds of cDNA patterns [1] and [2] cause expression starting at different initiation points, from the same chromosome region, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-NT2RI3001005.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 017.sub.--[1].sub.--1-N0 (SEQ ID NO:182): The entire nucleic acid sequence region of D-NT2RI3001005.1 017.sub.--[1].sub.--1-NA0 (SEQ ID NO:183): Both the entire nucleic acid sequence region and amino acid sequence of D-NT2RI3001005.1 017.sub.--[1].sub.--1-A0 (SEQ ID NO:184): The entire amino acid sequence region of D-NT2RI3001005.1

[0435] The sequence at the 1st to 153rd bases of D-NT2RI3001005.1 (SEQ ID NO:185) is an exon that is not present in NM.sub.--153449.2, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--153449.2. Because the translation initiation point is present on this exon, the amino acids on the N-terminal side changed by 44 residues (SEQ ID NO:186).

017.sub.--[1].sub.--1-N1 (SEQ ID NO:185): A 153-base insert nucleic acid sequence region of D-NT2RI3001005.1 017.sub.--[1].sub.--1-A1 (SEQ ID NO:186): A 44-residue insert amino acid sequence region of D-NT2RI3001005.1 017.sub.--[1].sub.--1-N2 (SEQ ID NO:187): An ORF nucleic acid sequence region in the 153-base insert region of D-NT2RI3001005.1 017.sub.--[1].sub.--1-A2 (identical to SEQ ID NO:186): An ORF amino acid sequence region in the 153-base insert region of D-NT2RI3001005.1 3) Characteristics of D-NT2RI3005261.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 017.sub.--[1].sub.--2-N0 (SEQ ID NO:188): The entire nucleic acid sequence region of D-NT2RI3005261.1 017.sub.--[1].sub.--2-NA0 (SEQ ID NO:189): Both the entire nucleic acid sequence region and amino acid sequence of D-NT2RI3005261.1 017.sub.--[1].sub.--2-A0 (SEQ ID NO:190): The entire amino acid sequence region of D-NT2RI3005261.1

[0436] The sequence at the 1st to 153rd bases of D-NT2RI3005261.1 (SEQ ID NO:191) is an exon that is not present in NM.sub.--153449.2, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--153449.2. Because the translation initiation point is present on this exon, the amino acids on the N-terminal side changed by 44 residues (SEQ ID NO:192).

017.sub.--[1].sub.--2-N1 (SEQ ID NO:191): A 153-base insert nucleic acid sequence region of D-NT2RI3005261.1 017.sub.--[1].sub.--2-A1 (SEQ ID NO:192): A 44-residue insert amino acid sequence region of D-NT2RI3005261.1 017.sub.--[1].sub.--2-N2 (SEQ ID NO:193): An ORF nucleic acid sequence region in the 153-base insert region of D-NT2RI3005261.1 017.sub.--[1].sub.--2-A2 (identical to SEQ ID NO:192): An ORF amino acid sequence region in the 153-base insert region of D-NT2RI3005261.1 4) Characteristics of C-TESTI4028880 (AK126026.1) ([2]), which was Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us, and is Already Registered with a Public DB 017.sub.--[2].sub.--1-N0 (SEQ ID NO:194): The entire nucleic acid sequence region of C-TESTI4028880 017.sub.--[2].sub.--1-NA0 (SEQ ID NO:195): Both the entire nucleic acid sequence region and amino acid sequence of C-TESTI4028880 017.sub.--[2].sub.--1-A0 (SEQ ID NO:196): The entire amino acid sequence region of C-TESTI4028880

5) Expression Specificity Analysis and Design of Primers for Real-Time PCR

[0437] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

017.sub.--01--A specific region present on the N-terminal side of the cDNA pattern [1]: a translation initiation region of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, being a novel region not registered with an existing public DB .fwdarw.Fragment 017.sub.--01 (SEQ. ID NO:199) amplified by Primer017.sub.--01F (SEQ ID NO:197) and Primer017.sub.--01R (SEQ ID NO:198) 017.sub.--03--A common region shared by all of [1] to [2]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [2], which is registered with an existing public DB .fwdarw.Fragment 017.sub.--03 (SEQ ID NO:202) amplified by Primer017.sub.--03F (SEQ ID NO:200) and Primer017.sub.--03R (SEQ ID NO:201)

[0438] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the regions specific for the 2 kinds of cDNA patterns [1] to [2] shown above, respectively, were found to be expressed at the following frequencies.

[0439] In the cDNA pattern [1], which was newly acquired and analyzed by us, fourteen 5'-terminal sequences were present, the derivations thereof being NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks (NT2RI) for 13 sequences (analytical parameter 32,662), and NT2 cells treated with retinoic acid (RA) to induce differentiation (NT2RP) for 1 sequence (analytical parameter 39,242).

[0440] In the cDNA pattern [2], which is registered with an existing public DB, eighty-six 5'-terminal sequences were present, the derivations thereof being Testis for 85 sequences (analytical parameter 90,188), and NT2 cells treated with RA to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks (NT2RI) for 1 sequence (analytical parameter 32,662).

[0441] From this result, it was found that the transcription initiation point of [1] was expressed specifically in NT2 cells after differentiation. From the transcription initiation point of [2], the expression in Testis was very abundant. Hence, it was thought that the mechanism of transcription in this chromosome region might be different only it the situation of nerve cell differentiation of NT2 cells after differentiation, with a different transcription initiation point being used.

(2) Analysis of Expression Specificity by Real-Time PCR

[0442] To determine what are the states in which the transcription initiation point used for the expression changes, details of expressions level were analyzed by real-time PCR. The results are shown in Table 16 and Table 17.

TABLE-US-00016 TABLE 16 RQ Score Log.sub.10RQ Score 017_01 017_03 017_01 017_03 01 NT2RA(-) 81.1 4.7 1.91 0.67 02 NT2RA(+) 24 hr 29.4 1.8 1.47 0.25 03 NT2RA(+) 48 hr 34.8 1.6 1.54 0.21 04 NT2RA(+) 1 week 177.5 2.6 2.25 0.41 05 NT2RA(+) 5 weeks 39.2 0.8 1.59 -0.07 06 NT2RA(+) 5 weeks, Inh(+) 1250.2 7.0 3.10 0.85 07 NT2 Neuron 319.1 0.6 2.50 -0.19 08 Brain, Fetal 1.2 1.5 0.07 0.18 09 Brain, whole 0.6 2.3 -0.25 0.35 10 ALZ Visual Cortex 0.6 0.6 -0.23 -0.21 Occipital 11 Mix, viscus tissues 1.0 1.0 0.0 0.0 12 Mix, blood cells and 1.4 2.1 0.15 0.32 related tissues 13 Mix, tumor tissues 0.6 0.3 -0.24 -0.60 14 Mix, normal tissues 32.4 1.1 1.51 0.06 15 Brain, whole PolyA(+) RNA 0.1 0.5 -0.88 -0.27 16 Brain, hippocampus 0.7 0.5 -0.15 -0.31

TABLE-US-00017 TABLE 17 RQ Score Log.sub.10RQ Score 017_01 017_03 017_01 017_03 01 NT2RA(-) 30.9 5.2 1.49 0.72 02 NT2RA(+) 24 hr 11.3 1.7 1.05 0.22 03 NT2RA(+) 48 hr 15.5 1.6 1.19 0.22 04 NT2RA(+) 1 week 77.1 2.9 1.89 0.46 05 NT2RA(+) 5 weeks 17.5 1.0 1.24 -0.02 06 NT2RA(+) 5 weeks, Inh(+) 497.7 7.6 2.70 0.88 07 NT2 Neuron 145.3 0.6 2.16 -0.20 08 Brain, Fetal 1.0 1.8 -0.02 0.24 09 Brain, whole 0.3 2.6 -0.57 0.41 10 ALZ Visual Cortex 0.3 0.7 -0.46 -0.14 Occipital 11 Mix, viscus tissues 1.0 1.0 0.0 0.0 12 Mix, blood cells and 0.9 2.7 -0.02 0.43 related tissues 13 Mix, tumor tissues 1.7 0.3 0.24 -0.57 14 Mix, normal tissues 19.8 1.2 1.30 0.07 15 Brain, whole PolyA(+) RNA 0.2 0.7 -0.79 -0.16 16 Brain, hippocampus 0.5 0.7 -0.29 -0.16 17 Colon 0.8 0.1 -0.12 -0.92 18 Colon Tumor Undet. 0.0 Undet. -1.65 19 Kidney 0.7 0.3 -0.15 -0.50 20 Kidney Tumor 0.0 0.2 -1.60 -0.61 21 Liver 2.2 0.1 0.34 -0.94 22 Liver Tumor 14.8 0.1 1.17 -0.94 23 Lung 0.1 2.0 -0.91 0.30 24 Lung Tumor 0.3 0.6 -0.60 -0.25 25 Ovary 93.4 2.0 1.97 0.29 26 Ovary Tumor 6.7 0.2 0.83 -0.70 27 Stomach 1.1 0.7 0.04 -0.17 28 Stomach Tumor Undet. 0.1 Undet. -1.25 29 Uterus 2.5 1.6 0.40 0.21 30 Uterus Tumor 0.6 0.3 -0.21 -0.53 31 Tongue 33.7 0.2 1.53 -0.65 32 Tumor Tongue 15.6 0.1 1.19 -0.91

[0443] Expression levels were compared using the 32 samples shown in Example 3, including Brain, hippocampus, Brain, whole, Brain, Fetal, Alzheimer patient cerebral cortex (ALZ Visual Cortex Occipital), NT2 cells at 7 different differentiation stages, 8 kinds of normal tissues, and 8 kinds of tumor tissues and the like. The comparison was made using the mixed sample of normal visceral tissues shown in Example 3 (Mix, viscus tissues) as an experimental control.

[0444] The transcription initiation point shown by 017.sub.--01 (SEQ ID NO:199) is used selectively in NT2 cells. Hence, in NT2 cells at all stages, whether undifferentiated or differentiated, the ratio of transcription from the upstream transcription initiation point was considerably high (Table 16 and Table 17).

[0445] These results demonstrated that by detecting the expression of the 5'-terminal regions (regions close to the transcription initiation point) 017.sub.--[1].sub.--1-N1 (SEQ ID NO:185) and 017.sub.--[1].sub.--2-N1 (SEQ ID NO:191) of a newly acquired cDNA region shown by the detection region 017.sub.--01 (SEQ ID NO:199), it is possible to use the 5'-terminal regions as nerve cell markers. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0446] The following regions also seem to be useful as nerve cell markers.

Upstream sequence 017.sub.--[1].sub.--1-N3 (SEQ ID NO:203), which comprises the 143rd to 159th bases undergoing priming by Primer017.sub.--01R (SEQ ID NO:198) in D-NT2RI3001005.1 of the cDNA pattern [1]. Upstream sequence 017.sub.--[1].sub.--2-N3 (SEQ ID NO:204), which comprises the 143rd to 159th bases undergoing priming by Primer017.sub.--01R (SEQ ID NO:198) in D-NT2RI3005261.1 of the cDNA pattern [1]. Region 017.sub.--01 (SEQ ID NO:199) amplified by Primer017.sub.--01F (SEQ ID NO:197) and Primer017.sub.--01R (SEQ ID NO:198) in the cDNA pattern [1].

Example 13

Cluster chr3-1507 (Data Set: 023)

(1) Cluster Analysis

1) Cluster Characteristics

[0447] An analysis was performed on 15 sequences of full-length cDNAs subjected to genome mapping onto the cluster chr3-1507 (Human genome UCSC hg18 (NCBI Build34) chromosome 3, 73,500,000 bp to 73,800,000 bp) [D-OCBBF2010718.1, D-OCBBF3004194.1, D-NT2RP8000826.1, D-NT2RP7007268.1, D-BRAWH3008172.1, D-BRAWH3011965.1, AB029018.1, AL049958.1, AL157498.1, BC014432.1, C-HEMBA1005489, ENST00000263666, ENST00000308537, ENST00000319719, NM.sub.--015009.1]. They were classified according to expression pattern difference into 8 kinds, which mainly included the following 5 kinds.

[1] D-OCBBF2010718.1, D-OCBBF3004194.1

[2] D-NT2RP8000826.1, D-NT2RP7007268.1

[3] D-BRAWH3008172.1

[4] D-BRAWH3011965.1

[5] AB029018.1, ENST00000263666, NM.sub.--015009.1

[0448] [1], [2], [3], and [4] are cDNAs which were newly acquired and subjected to full-length cDNA sequence analysis by us, and had a different ORF from [5], which had been registered with an existing public DB.

[0449] [1], [2], [3], and [4] had a different ORF region because of the expression thereof from a chromosome region located downstream of the known [5], and also because of the presence of a translation initiation point different from [5].

[0450] It was found that the ORF regions present in the 5 kinds of cDNA patterns [1] to [5] cause expression starting at different transcription initiation points, from the same chromosome region, resulting in alterations of the amino acid sequences to produce diverse proteins and mRNAs.

2) Characteristics of D-OCBBF2010718.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 023.sub.--[1].sub.--1-N0 (SEQ ID NO:205): The entire nucleic acid sequence region of D-OCBBF2010718.1 023.sub.--[1].sub.--1-NA0 (SEQ ID NO:206): Both the entire nucleic acid sequence region and amino acid sequence of D-OCBBF2010718.1 023.sub.--[1].sub.--1-A0 (SEQ ID NO:207): The entire amino acid sequence region of D-OCBBF2010718.1

[0451] The 1st to 212th bases of D-OCBBF2010718.1 (SEQ ID NO:208) is an exon that is not present in NM.sub.--015009.1, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--015009.1. Because the translation initiation point is present on this exon, the amino acids on the N-terminal side changed by 23 residues (SEQ ID NO:209).

023.sub.--[1].sub.--1-N1 (SEQ ID NO:208): A 212-base insert nucleic acid sequence region of D-OCBBF2010718.1 023.sub.--[1].sub.--1-A1 (SEQ ID NO:209): A 23-residue insert amino acid sequence region of D-OCBBF2010718.1 023.sub.--[1].sub.--1-N2 (SEQ ID NO:210): An ORF nucleic acid sequence region in the 212-base insert region of D-OCBBF2010718.1 023.sub.--[1].sub.--1-A2 (identical to SEQ ID NO:209): An ORF amino acid sequence region in the 212-base insert region of D-OCBBF2010718.1 3) Characteristics of D-OCBBF3004194.1 ([1]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 023.sub.--[1].sub.--2-N0 (SEQ ID NO:211): The entire nucleic acid sequence region of D-OCBBF3004194.1 023.sub.--[1].sub.--2-NA0 (SEQ ID NO:212): Both the entire nucleic acid sequence region and amino acid sequence of D-OCBBF3004194.1 023.sub.--[1].sub.--2-A0 (SEQ ID NO:213): The entire amino acid sequence region of D-OCBBF3004194.1

[0452] The sequence at the 1st to 197th bases of D-OCBBF3004194.1 (SEQ ID NO:214) is an exon that is not present in NM.sub.--015009.1, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--015009.1. Because the translation initiation point is present on this exon, the amino acids on the N-terminal side changed by 23 residues (SEQ ID NO:215).

023.sub.--[1].sub.--2-N1 (SEQ ID NO:214): A 197-base insert nucleic acid sequence region of D-OCBBF3004194.1 023.sub.--[1].sub.--2-A1 (SEQ ID NO:215): A 23-residue insert amino acid sequence region of D-OCBBF3004194.1 023.sub.--[1].sub.--2-N2 (SEQ ID NO:216): An ORF nucleic acid sequence region in the 197-base insert region of D-OCBBF3004194.1 023.sub.--[1].sub.--2-A2 (identical to SEQ ID NO:215): An ORF amino acid sequence region in the 197-base insert region of D-OCBBF3004194.1 4) Characteristics of D-NT2RP8000826.1 ([2]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 023.sub.--[2].sub.--1-N0 (SEQ ID NO:217): The entire nucleic acid sequence region of D-NT2RP8000826.1 023.sub.--[2].sub.--1-NA0 (SEQ ID NO:218): Both the entire nucleic acid sequence region and amino acid sequence of D-NT2RP8000826.1 023.sub.--[2].sub.--1-A0 (SEQ ID NO:219): The entire amino acid sequence region of D-NT2RP8000826.1

[0453] The sequence at the 1st to 178th bases of D-NT2RP8000826.1 (SEQ ID NO:220) is an exon that is not present in NM.sub.--015009.1, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--015009.1. Because the translation initiation point is present on this exon, the amino acids on the N-terminal side changed by 28 residues (SEQ ID NO:221).

023.sub.--[2].sub.--1-N1 (SEQ ID NO:220): A 178-base insert nucleic acid sequence region of D-NT2RP8000826.1 023.sub.--[2].sub.--1-A1 (SEQ ID NO:221): A 28-residue insert amino acid sequence region of D-NT2RP8000826.1 023.sub.--[2].sub.--1-N2 (SEQ ID NO:222): An ORF nucleic acid sequence region in the 178-base insert region of D-NT2RP8000826.1 023.sub.--[2].sub.--1-A2 (identical to SEQ ID NO:221): An ORF amino acid sequence region in the 178-base insert region of D-NT2RP8000826.1 5) Characteristics of D-NT2RP7007268.1 ([2]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 023.sub.--[2].sub.--2-N0 (SEQ ID NO:223): The entire nucleic acid sequence region of D-NT2RP7007268.1 023.sub.--[2].sub.--2-NA0 (SEQ ID NO:224): Both the entire nucleic acid sequence region and amino acid sequence of D-NT2RP7007268.1 023.sub.--[2].sub.--2-A0 (SEQ ID NO:225): The entire amino acid sequence region of D-NT2RP7007268.1

[0454] The sequence at the 1st to 178th bases of D-NT2RP7007268.1 (SEQ ID NO:226) is an exon that is not present in NM.sub.--015009.1, which is registered with an existing public DB, and serve for control, lacking homology to NM.sub.--015009.1. Because the translation initiation point is present on this exon, the amino acids on the N-terminal side changed by 28 residues (SEQ ID NO:227).

023.sub.--[2].sub.--2-N1 (SEQ ID NO:226): A 178-base insert nucleic acid sequence region of D-NT2RP7007268.1 023.sub.--[2].sub.--2-A1 (SEQ ID NO:227): A 28-residue insert amino acid sequence region of D-NT2RP7007268.1 023.sub.--[2].sub.--2-N2 (SEQ ID NO:228): An ORF nucleic acid sequence region in the 178-base insert region of D-NT2RP7007268.1 023.sub.--[2].sub.--2-A2 (identical to SEQ ID NO:227): An ORF amino acid sequence region in the 178-base insert region of D-NT2RP7007268.1 6) Characteristics of D-BRAWH3008172.1 ([3]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 023.sub.--[3].sub.--1-N0 (SEQ ID NO:229): The entire nucleic acid sequence region of D-BRAWH3008172.1 023.sub.--[3].sub.--1-NA0 (SEQ ID NO:230): Both the entire nucleic acid sequence region and amino acid sequence of D-BRAWH3008172.1 023.sub.--[3].sub.--1-A0 (SEQ ID NO:231): The entire amino acid sequence region of D-BRAWH3008172.1

[0455] The sequence at the 1st to 169th bases of D-BRAWH3008172.1 (SEQ ID NO:232) is an exon that is not present in NM.sub.--015009.1, which is registered with an existing public DB, and serves for control, lacking homology to NM.sub.--015009.1. With this change, the translation initiation point of D-BRAWH3008172.1 shifts toward the 3' side relative to NM.sub.--015009.1, and the 281st base of D-BRAWH3008172.1 becomes the translation initiation point. For this reason, the amino acid sequence shortened by 343 residues compared with NM.sub.--015009.1.

023.sub.--[3].sub.--1-N1 (SEQ ID NO:232): A 169-base insert nucleic acid sequence region of D-BRAWH3008172.1 023.sub.--[3].sub.--1-N2 (SEQ ID NO:233): A 280-base 5'UTR region of an ORF whose translation initiation point is the 281st base of D-BRAWH3008172.1 7) Characteristics of D-BRAWH3011965.1 ([4]), which was Newly Acquired and Subjected to Full-Length cDNA Sequence Analysis by Us 023.sub.--[4].sub.--1-N0 (SEQ ID NO:234): The entire nucleic acid sequence region of D-BRAWH3011965.1 023.sub.--[4].sub.--1-NA0 (SEQ ID NO:235): Both the entire nucleic acid sequence region and amino acid sequence of D-BRAWH3011965.1 023.sub.--[4].sub.--1-A0 (SEQ ID NO:236): The entire amino acid sequence region of D-BRAWH3011965.1

[0456] The sequence at the 1st to 311th bases of D-BRAWH3011965.1 (SEQ ID NO:237) is an exon that is not present in NM.sub.--015009.1, which is registered in an existing public DB and serves as a control, lacking homology to NM.sub.--015009.1. Because the translation initiation point is present on this exon, the amino acids on the N-terminal side changed by 4 residues (SEQ ID NO:238).

023.sub.--[4].sub.--1-N1 (SEQ ID NO:237): A 311-base insert nucleic acid sequence region of D-BRAWH3011965.1 023.sub.--[4].sub.--1-A1 (SEQ ID NO:238): A 4-residue insert amino acid sequence region of D-BRAWH3011965.1 023.sub.--[4].sub.--1-N2 (SEQ ID NO:239): An ORF nucleic acid sequence region in the 311-base insert region of D-BRAWH3011965.1 023.sub.--[4].sub.--1-A2 (identical to SEQ ID NO:238): An ORF amino acid sequence region in the 311-base insert region of D-BRAWH3011965.1

8) Expression Specificity Analysis and Design of Primers for Real-Time PCR

[0457] To clearly distinguish between the characteristic regions shown above, and examine the respective expression levels thereof, the following regions were used as detection regions. It seemed possible to compare the expression levels of the individual characteristic regions by comparing the expression levels of the detection regions.

023.sub.--01--A specific region present on the N-terminal side of the cDNA pattern [1]: a translation initiation region of the cDNA pattern [1], which was newly subjected to full-length cDNA sequence analysis by us, being a novel region not registered with an existing public DB .fwdarw.Fragment 023.sub.--01 (SEQ ID NO:242) amplified by Primer023.sub.--01F (SEQ ID NO:240) and Primer023.sub.--01R (SEQ ID NO:241) 023.sub.--02--A specific region present on the N-terminal side of the cDNA pattern [2]: a translation initiation region of the cDNA pattern [2], which was newly subjected to full-length cDNA sequence analysis by us, being a novel region not registered with an existing public DB .fwdarw.Fragment 023.sub.--02 (SEQ ID NO:245) amplified by Primer023.sub.--02F (SEQ ID NO:243) and Primer023.sub.--02R (SEQ ID NO:244) 023.sub.--03--A specific region present on the N-terminal side of the cDNA pattern [3]: a translation initiation region of the cDNA pattern [3], which was newly subjected to full-length cDNA sequence analysis by us, being a novel region not registered with an existing public DB .fwdarw.Fragment 023.sub.--03 (SEQ ID NO:248) amplified by Primer023.sub.--03F (SEQ ID NO:246) and Primer023.sub.--03R (SEQ ID NO:247) 023.sub.--04--A specific region present on the N-terminal side of the cDNA pattern [4]: a translation initiation region of the cDNA pattern [4], which was newly subjected to full-length cDNA sequence analysis by us, being a novel region not registered with an existing public DB .fwdarw.Fragment 023.sub.--04 (SEQ ID NO:251) amplified by Primer023.sub.--04F (SEQ ID NO:249) and Primer023.sub.--04R (SEQ ID NO:250) 023.sub.--05--A specific region of the cDNA pattern [5], which is registered with an existing public DB, that can be distinguished from all of [1], [2], [3], and [4], serving as a control for comparing [1], [2], [3], and [4] .fwdarw.Fragment 023.sub.--05 (SEQ ID NO:254) amplified by Primer023.sub.--05F (SEQ ID NO:252) and Primer023.sub.--05R (SEQ ID NO:253) 023.sub.--06--A common region shared by all of [1] to [5]: a region common to all patterns, serving for control to compare the overall expression levels of the cDNA patterns [1], [2], [3], and [4], which were newly subjected to full-length cDNA sequence analysis by us, and the cDNA pattern [5] registered with an existing public DB .fwdarw.Fragment 023.sub.--06 (SEQ ID NO:257) amplified by Primer023.sub.--06F (SEQ ID NO:255) and Primer023.sub.--06R (SEQ ID NO:256)

[0458] By mapping the 5'-terminal sequences of about 1.44 million sequences acquired using the oligocap method onto the human genome sequence, and comparatively analyzing them, the regions specific for the 2 kinds of cDNA patterns [1] to [2] shown above, respectively, were found to be expressed at the following frequencies.

[0459] In the cDNA pattern [1], which was newly acquired and analyzed by us, thirty-two 5'-terminal sequences were present, the derivations thereof being NT2 cells treated with retinoic acid (RA) to induce differentiation (NT2RP) for 21 sequences (analytical parameter 39,242), Brain, Fetal for 8 sequences (analytical parameter 103,138), NT2 cells treated with retinoic acid (RA) to induce differentiation for 5 weeks, and thereafter treated with a growth inhibitor for 2 weeks (NT2RI) for 1 sequence (analytical parameter 32,662), Brain, hippocampus for 1 sequence (analytical parameter 57,918), and Brain, amygdala for 1 sequence (analytical parameter 58,640).

[0460] In the cDNA pattern [2], which was newly acquired and analyzed by us, twenty 5'-terminal sequences were present, the derivation thereof being NT2 cells treated with retinoic acid (RA) to induce differentiation (NT2RP) for the 20 sequences (analytical parameter 39,242).

[0461] In the cDNA pattern [3], which was newly acquired and analyzed by us, sixteen 5'-terminal sequences were present, the derivations thereof being Brain, whole for 8 sequences (analytical parameter 59,069), Brain, amygdala for 5 sequences (analytical parameter 58,640), Kidney, Tumor for 1 sequence (analytical parameter 15,970), Brain, thalamus for 1 sequence (analytical parameter 53,267), and Testis for 1 sequence (analytical parameter 90,188).

[0462] In the cDNA pattern [4], which was newly acquired and analyzed by us, five 5'-terminal sequences were present, the derivations thereof being Brain, whole for 3 sequences (analytical parameter 59,069), Brain, hippocampus for 1 sequence (analytical parameter 57,918), and Brain, thalamus for 1 sequence (analytical parameter 53,267).

[0463] In the cDNA pattern [5], which is registered with an existing public DB, two 5'-terminal sequences were present, the derivations thereof being Stomach, Tumor for 1 sequence (analytical parameter 2,757), and Prostate for 1 sequence (analytical parameter 16,671).

[0464] From this result, it was found that the transcription initiation point of [1] was abundantly expressed in differentiated NT2 cells and the fetal brain. It was found that the transcription initiation point of [2] was abundantly expressed in differentiated NT2 cells. It was found that the transcription initiation points of [3] and [4] were abundantly expressed in the brain. The known sequence [5] was expressed in gastric cancer and the prostate. Hence, it was thought that the mechanism of transcription in this chromosome region might differ among various organs and cell conditions, with different transcription initiation points being used.

(2) Analysis of Expression Specificity by Real-Time PCR

[0465] To determine what are the states in which the transcription initiation point used for expression changes, details of expression levels were analyzed by real-time PCR. The results are shown in Tables 18-1 and 18-2 and Tables 19-1 and 19-2.

TABLE-US-00018 TABLES 18-1, 18-2 RQ Score Log.sub.10RQ Score 023_01 023_02 023_03 023_04 023_05 023_06 023_01 023_02 023_03 023_04 023_05 023_06 01 NT2RA(-) 0.0 0.0 Undet. 0.0 0.0 0.0 -2.32 -2.82 Undet. -2.07 -1.63 -1.54 02 NT2RA(+) 24 hr 0.1 0.1 0.0 0.2 0.0 0.0 -1.11 -1.22 -3.03 -0.71 -1.62 -1.40 03 NT2RA(+) 48 hr 1.3 0.4 0.0 0.8 0.1 0.2 0.12 -0.38 -2.12 -0.11 -1.13 -0.81 04 NT2RA(+) 1 week 19.1 1.8 0.1 5.6 0.3 1.8 1.28 0.25 -1.24 0.75 -0.55 0.25 05 NT2RA(+) 5 weeks 39.7 1.2 0.0 0.2 0.4 1.9 1.60 0.08 -1.58 -0.63 -0.43 0.27 06 NT2RA(+) 5 weeks, Inh(+) 2.0 0.0 0.0 0.1 0.1 0.3 0.30 -1.64 -1.65 -1.29 -0.84 -0.55 07 NT2 Neuron 2.9 0.7 0.0 1.4 0.0 0.2 0.46 -0.17 -1.95 0.16 -1.77 -0.66 08 Brain, Fetal 53.3 3.5 34.6 11.3 1.7 3.5 1.73 0.54 1.54 1.05 0.23 0.54 09 Brain, whole 0.8 1.0 58.9 2.7 0.4 1.0 -0.12 -0.01 1.77 0.42 -0.46 0.01 10 ALZ Visual Cortex 0.5 0.6 27.6 1.2 0.2 0.6 -0.26 -0.24 1.44 0.07 -0.71 -0.20 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 1.0 1.0 1.0 0.0 0.0 0.0 0.0 0.0 0.0 12 Mix, blood cells and 1.0 0.3 0.3 0.6 0.3 0.4 -0.02 -0.54 -0.51 -0.20 -0.46 -0.40 related tissues 13 Mix, tumor tissues 1.0 0.1 0.0 0.0 0.1 0.3 -0.01 -1.22 -1.36 -1.41 -0.87 -0.50 14 Mix, normal tissues 2.1 2.9 3.0 4.2 2.3 2.4 0.33 0.47 0.48 0.63 0.36 0.37 15 Brain, whole PolyA(+) RNA 0.2 0.1 19.0 0.8 0.1 0.4 -0.71 -0.99 1.28 -0.12 -1.01 -0.38 16 Brain, hippocampus 0.3 0.0 9.7 0.4 0.1 0.3 -0.55 -1.39 0.98 -0.40 -0.96 -0.48

TABLE-US-00019 TABLES 19-1, 19-2 RQ Score Log.sub.10RQ Score 023_01 023_02 023_03 023_04 023_05 023_06 023_01 023_02 023_03 023_04 023_05 023_06 01 NT2RA(-) 0.0 0.0 Undet. 0.0 0.0 0.0 -2.22 -2.59 Undet. -1.70 -1.59 -1.53 02 NT2RA(+) 24 hr 0.1 0.1 0.0 0.5 0.0 0.0 -1.02 -0.91 -2.71 -0.31 -1.54 -1.34 03 NT2RA(+) 48 hr 1.7 0.6 0.0 1.2 0.1 0.2 0.23 -0.20 -2.18 0.08 -0.96 -0.65 04 NT2RA(+) 1 week 25.8 3.2 0.1 10.6 0.4 2.6 1.41 0.51 -0.96 1.03 -0.39 0.42 05 NT2RA(+) 5 weeks 48.9 1.8 0.0 0.2 0.5 3.0 1.69 0.26 -1.37 -0.62 -0.29 0.48 06 NT2RA(+) 5 weeks, Inh(+) 2.8 0.0 0.0 0.1 0.2 0.5 0.45 -1.36 -1.66 -1.02 -0.66 -0.32 07 NT2 Neuron 3.1 0.9 0.0 2.3 0.0 0.3 0.49 -0.02 -1.45 0.35 -1.85 -0.58 08 Brain, Fetal 67.9 6.0 54.2 25.8 1.9 4.3 1.83 0.78 1.73 1.41 0.29 0.63 09 Brain, whole 0.8 1.5 90.6 5.4 0.4 1.0 -0.10 0.17 1.96 0.74 -0.44 0.02 10 ALZ Visual Cortex 0.6 0.9 40.4 2.5 0.2 0.6 -0.21 -0.07 1.61 0.40 -0.65 -0.26 Occipital 11 Mix, viscus tissues 1.0 1.0 1.0 1.0 1.0 1.0 0.0 0.0 0.0 0.0 0.0 0.0 12 Mix, blood cells and 1.2 0.4 0.7 1.2 0.3 0.4 0.06 -0.35 -0.16 0.07 -0.50 -0.42 related tissues 13 Mix, tumor tissues 1.0 0.1 0.0 0.0 0.1 0.3 -0.02 -1.29 -1.53 -1.41 -0.92 -0.58 14 Mix, normal tissues 2.2 4.0 4.3 6.6 2.2 2.3 0.34 0.60 0.63 0.82 0.35 0.37 15 Brain, whole PolyA(+) RNA 0.3 0.2 31.4 1.3 0.1 0.5 -0.54 -0.75 1.50 0.12 -0.88 -0.28 16 Brain, hippocampus 0.4 0.1 16.0 0.7 0.1 0.5 -0.40 -1.13 1.20 -0.15 -0.86 -0.30 17 Brain, cerebellum 0.1 0.0 0.1 0.0 0.1 0.2 -0.86 -1.81 -0.86 -1.80 -1.09 -0.79 18 Brain, amygdala 1.4 0.1 11.9 0.5 0.1 0.5 0.14 -1.11 1.08 -0.33 -1.02 -0.31 19 Brain, caudate nucleus 0.1 0.0 1.3 0.1 0.1 0.2 -1.15 -1.56 0.12 -0.84 -1.13 -0.70 20 Brain, corpus callosum 0.1 0.0 1.1 0.2 0.1 0.2 -1.14 -1.61 0.05 -0.64 -1.05 -0.64 21 Brain, substantia nigra 0.1 0.1 2.4 0.3 0.1 0.2 -0.83 -1.13 0.39 -0.56 -1.04 -0.73 22 Brain, thalamus 0.2 0.0 4.4 0.2 0.0 0.2 -0.78 -1.57 0.65 -0.77 -1.41 -0.82 23 Brain, subthalamic nucleus 0.0 0.0 0.3 0.0 0.1 0.2 -1.70 -2.49 -0.55 -1.82 -1.24 -0.77

[0466] Expression levels were compared using the 23 samples shown in Example 3, including Brain, hippocampus, Brain, whole, Brain, Fetal, Alzheimer patient cerebral cortex (ALZ Visual Cortex Occipital), NT2 cells at 7 different differentiation stages, and 7 kinds of brain tissues. The comparison was made using the mixed sample of normal visceral tissues shown in Example 3 (Mix, viscus tissues) as an experimental control.

[0467] The transcription initiation points shown by 023.sub.--01 (SEQ ID NO:242), 023.sub.--02 (SEQ ID NO:245), and 023.sub.--04 (SEQ ID NO:251) were abundantly expressed in NT2 cells after differentiation, particularly in NT2RA (+) 1 week, which represents an advanced stage of differentiation, whereas 023.sub.--01 (SEQ ID NO:242) was most abundantly expressed in NT2RA (+) 5 weeks (Tables 18-1 and 18-2 and Tables 19-1 and 19-2). In the brain tissues, the expression from the transcription initiation points shown by 023.sub.--01 (SEQ ID NO:242), 023.sub.--03 (SEQ ID NO:248), and 023.sub.--04 (SEQ ID NO:251) was abundant, with the expression in Brain, Fetal being particularly abundant (Tables 18-1 and 18-2 and Tables 19-1 and 19-2).

[0468] These results demonstrated that by comparing the expression of transcription initiation point regions 023.sub.--[1].sub.--1-N1 (SEQ ID NO:208), 023.sub.--[1].sub.--2-N1 (SEQ ID NO:214), 023.sub.--[2].sub.--1-N1 (SEQ ID NO:220), 023.sub.--[2].sub.--2-N1 (SEQ ID NO:226), 023.sub.--[3].sub.--1-N1 (SEQ ID NO:232), and 023.sub.--[4].sub.--1-N1 (SEQ ID NO:237) of newly acquired cDNAs shown by the detection regions 023.sub.--01 (SEQ ID NO:242), 023.sub.--02 (SEQ ID NO:245), 023.sub.--03 (SEQ ID NO:248), and 023.sub.--04 (SEQ ID NO:251), it is possible to use these regions as differentiation markers for detecting nerve cell differentiation or regeneration stages, or as brain-specific markers. It also seems possible to develop a new drug by means of a compound, antibody, siRNA or the like that targets a region that exhibits specificity.

[0469] The following regions also seem to be useful as differentiation markers for detecting stages of nerve cell differentiation or regeneration and brain-specific markers.

Upstream sequence 023.sub.--[1].sub.--1-N3 (SEQ ID NO:258), which comprises the 191st to 219th bases undergoing priming by Primer023.sub.--01R (SEQ ID NO:241) in D-OCBBF2010718.1 of the cDNA pattern [1]. Upstream sequence 023.sub.--[1].sub.--2-N3 (SEQ ID NO:259), which comprises the 181st to 204th bases undergoing priming by Primer023.sub.--01R (SEQ ID NO:241) in D-OCBBF3004194.1 of the cDNA pattern [1]. Upstream sequence 023.sub.--[2].sub.--1-N3 (SEQ ID NO:260), which comprises the 158th to 179th bases undergoing priming by Primer023.sub.--02R (SEQ ID NO:244) in D-NT2RP8000826.1 of the cDNA pattern [2]. Upstream sequence 023.sub.--[2].sub.--2-N3 (SEQ ID NO:261), which comprises the 161st to 180th bases undergoing priming by Primer023.sub.--02R (SEQ ID NO:244) in D-NT2RP7007268.1 of the cDNA pattern [2]. Upstream sequence 023.sub.--[3].sub.--1-N3 (SEQ ID NO:262), which comprises the 293rd to 316th bases undergoing priming by Primer023.sub.--03R (SEQ ID NO:247) in D-BRAWH3008172.1 of the cDNA pattern [3]. Upstream sequence 023.sub.--[4].sub.--1-N3 (SEQ ID NO:263), which comprises the 65th to 84th bases undergoing priming by Primer023.sub.--04R (SEQ ID NO:250) in D-BRAWH3011965.1 of the cDNA pattern [4]. Region 023.sub.--01 (SEQ ID NO:242) amplified by Primer023.sub.--01F (SEQ ID NO:240) and Primer023.sub.--01R (SEQ ID NO:241) in the cDNA pattern [1]. Region 023.sub.--02 (SEQ ID NO:245) amplified by Primer023.sub.--02F (SEQ ID NO:243) and Primer023.sub.--02R (SEQ ID NO:244) in the cDNA pattern [2]. Region 023.sub.--03 (SEQ ID NO:248) amplified by Primer023.sub.--03F (SEQ ID NO:246) and Primer023.sub.--03R (SEQ ID NO:247) in the cDNA pattern [3]. Region 023.sub.--04 (SEQ ID NO:251) amplified by Primer023.sub.--04F (SEQ ID NO:249) and Primer023.sub.--04R (SEQ ID NO:250) in the cDNA pattern [4].

Example 14

OFR Information on Full-Length cDNA Sequences and Results of Homology Analysis and Results of Analysis of Motif and the Like

[0470] To determine the functions of 19 sequences of full-length cDNAs that were newly acquired and subjected to full-length cDNA sequence analysis by us, ORF prediction and annotation analysis were performed. Results of the annotation analysis can be updated when the database or analytical software for comparison is upgraded. Thereby, it is sometimes possible to newly add an annotation to sequences with no annotation given under the same conditions.

1) Prediction of ORFs of cDNAs undergoing Full-Length cDNA Sequence Analysis

[0471] Using ORF prediction/evaluation systems such as ATGpr (A. Salamov et al. (1998) Bioinformatics 14: 384-390) and TRins (K. Kimura et al. (2003) Genome Informatics 14: 456-457), ORFs were predicted from full-length cDNA sequences. The ORF region information predicted from the full-length cDNA sequences is shown below.

[0472] The ORF regions were denoted in compliance with the rules of "DDBJ/EMBL/GenBank Feature Table Definition" (http://www.ncbi.nlm.nih.gov/collab/FT/index.html). The ORF start position is the first character of the methionine-encoding base "ATG", and the stop position represents the third character of the stop codon. These are indicated by a partition "..". However, for the ORFs that do not have a stop codon, the stop position is indicated with the use of ">" in compliance with the denotation rules.

TABLE-US-00020 Name of cDNA sequence ORF region D-UTERU2026184.1 191 . . . 2119 D-BRACE3000012.1 465 . . . 2558 D-NT2RP8004156.1 131 . . . 1387 D-NT2RI3005525.1 45 . . . 1292 D-NT2RP8004592.1 620 . . . 1183 D-NT2RI2014164.1 162 . . . 1397 D-BRAMY2029564.1 143 . . . 1657 D-BRHIP2003515.1 84 . . . 707 D-BRACE2044661.1 297 . . . 878 D-3NB692002462.1 343 . . . 951 D-BRCAN2027778.1 52 . . . 1086 D-NT2RI3001005.1 22 . . . 1629 D-NT2RI3005261.1 22 . . . 1629 D-OCBBF2010718.1 144 . . . 2495 D-OCBBF3004194.1 129 . . . 2480 D-NT2RP8000826.1 95 . . . 2461 D-NT2RP7007268.1 95 . . . 2461 D-BRAWH3008172.1 281 . . . 2452 D-BRAWH3011965.1 300 . . . >1574

2) Results of Homology Analysis Using BLASTP (SwissProt)

[0473] Homology analysis was performed on the 19 ORF sequences shown in Example 14-1), using BLASTP (blastall 2.2.6; ftp://ftp.ncbi.nih.gov/blast/), for SwissProt of the Aug. 22, 2006 version (ftp://us.expasy.org/databases/swiss-prot/). Based on the results of the homology analysis, the sequences showing the highest homology with an E-value of 1E-10 or less are shown below. In the following cases, however, the applicable candidate is not selected, but the next candidate is shown.

Having a definition beginning with "ALU SUBFAMILY" Having a definition beginning with "Alu subfamily" Having a definition beginning with "!!!! ALU SUBFAMILY" Having a definition beginning with "B-CELL GROWTH FACTOR PRECURSOR" Having a definition including "NRK2" Having a definition beginning with "PROLINE-RICH" Having a definition beginning with "GLYCINE-RICH" Having a definition beginning with "EXTENSIN PRECURSOR" Having a definition beginning with "COLLAGEN" Having a definition beginning with "100 KD" Having a definition beginning with "RETROVIRUS-RELATED POL POLYPROTEIN" Having a definition beginning with "CUTICLE COLLAGEN" Having a definition beginning with "HYPOTHETICAL" Having a definition beginning with "Hypothetical" Having a definition beginning with "SALIVARY PROLINE-RICH PROTEIN" Having a definition beginning with "IMMEDIATE-EARLY PROTEIN" Having the accession No "P49646"

[0474] Individual data are shown with the name of cDNA sequence, ORF region, hit data accession number, hit data definition, hit data keyword, E-value, consensus length (amino acid length), and identity, separated by "//" in this order.

D-UTERU2026184.1// 191..2119// Q8TF45// Zinc finger protein 418// DNA-binding; Metal-binding; Nuclear protein; Repeat; Repressor; Transcription; Transcription regulation; Zinc; Zinc-finger.// 0// 601// 100 D-BRACE3000012.1// 465..2558// Q8TF45// Zinc finger protein 418// DNA-binding; Metal-binding; Nuclear protein; Repeat; Repressor; Transcription; Transcription regulation; Zinc; Zinc-finger.// 0// 674// 99 D-NT2RP8004156.1// 131..1387// P31749// RAC-alpha serine/threonine-protein kinase (EC2.7.11.1) (RAC-PK-alpha) (Protein kinase B) (PKB)(C-AKT)// 3D-structure; Apoptosis; ATP-binding; Carbohydrate metabolism; Glucose metabolism; Glycogen biosynthesis; Glycogen metabolism; Kinase; Nuclear protein; Nucleotide-binding; Phosphorylation; Serine/threonine-protein kinase; Sugar transport; Transferase; Translation regulation; Transport.// 0// 418// 100 D-NT2RI3005525.1// 45..1292// Q7Z698// Sprouty-related, EVH1-domain-containingprotein 2 (Spred-2)// Membrane; Phosphorylation.// 0// 409// 99 D-NT2RI2014164.1// 162..1397// P27338// Amine oxidase [flavin-containing] B (EC1.4.3.4) (Monoamine oxidase type B) (MAO-B)// 3D-structure; Acetylation; Direct protein sequencing; FAD; Flavoprotein; Membrane; Mitochondrion; Oxidoreductase; Transmembrane.// 0// 367// 93 D-BRAMY2029564.1// 143..1657// P27338// Amine oxidase [flavin-containing] B (EC1.4.3.4) (Monoamine oxidase type B) (MAO-B)// 3D-structure; Acetylation; Direct protein sequencing; FAD; Flavoprotein; Membrane; Mitochondrion; Oxidoreductase; Transmembrane.// 0// 504// 100 D-BRHIP2003515.1// 84.707// P55327// Tumor protein D52 (N8 protein)// Coiled coil.// 7E-93// 184// 88 D-BRACE2044661.1// 297.878// P54709// Sodium/potassium-transporting ATPase subunitbeta-3 (Sodium/potassium-dependent ATPase beta-3subunit) (ATPB-3) (CD298 antigen)// Glycoprotein; Ion transport; Membrane; Potassium; Potassium transport; Signal-anchor; Sodium; Sodium transport; Sodium/potassium transport; Transmembrane; Transport.// 1E-90// 158// 97 D-3NB692002462.1// 343..951// Q03426// Mevalonate kinase (EC 2.7.1.36) (MK)// ATP-binding; Cataract; Cholesterol biosynthesis; Disease mutation; Kinase; Lipid synthesis; Nucleotide-binding; Peroxisome; Polymorphism; Steroid biosynthesis; Sterol biosynthesis; Transferase.// 1E-112//202//100 D-BRCAN2027778.1// 52.1086// Q03426// Mevalonate kinase (EC 2.7.1.36) (MK)// ATP-binding; Cataract; Cholesterol biosynthesis; Disease mutation; Kinase; Lipid synthesis; Nucleotide-binding; Peroxisome; Polymorphism; Steroid biosynthesis; Sterol biosynthesis; Transferase.// 0// 343// 86 D-NT2RI3001005.1// 22.1629// Q8TDB8// Solute carrier family 2, facilitated glucosetransporter member 14 (Glucose transporter type 14)// Alternative splicing; Developmental protein; Differentiation; Glycoprotein; Membrane; Spermatogenesis; Sugar transport; Transmembrane; Transport.// 0// 490// 99 D-NT2RI3005261.1// 22..1629// Q8TDB8// Solute carrier family 2, facilitated glucosetransporter member 14 (Glucose transporter type 14)// Alternative splicing; Developmental protein; Differentiation; Glycoprotein; Membrane; Spermatogenesis; Sugar transport; Transmembrane; Transport.// 0// 491// 100 D-OCBBF2010718.1// 144..2495// Q9UPQ7// PDZ domain-containing RING finger protein 3(Ligand of Numb-protein X 3) (Semaphorin cytoplasmicdomain-associated protein 3) (SEMACAP3 protein)// 3D-structure; Alternative splicing; Coiled coil; Metal-binding; Polymorphism; Repeat; Zinc; Zinc-finger.// 0// 758// 99 D-OCBBF3004194.1// 129.2480// Q9UPQ7// PDZ domain-containing RING finger protein 3(Ligand of Numb-protein X 3) (Semaphorin cytoplasmicdomain-associated protein 3) (SEMACAP3 protein)// 3D-structure; Alternative splicing; Coiled coil; Metal-binding; Polymorphism; Repeat; Zinc; Zinc-finger.// 0// 760// 99 D-NT2RP8000826.1// 95.2461// Q9UPQ7// PDZ domain-containing RING finger protein 3(Ligand of Numb-protein X 3) (Semaphorin cytoplasmicdomain-associated protein 3) (SEMACAP3 protein)// 3D-structure; Alternative splicing; Coiled coil; Metal-binding; Polymorphism; Repeat; Zinc; Zinc-finger.// 0// 759// 99 D-NT2RP7007268.1// 95..2461// Q9UPQ7// PDZ domain-containing RING finger protein 3(Ligand of Numb-protein X 3) (Semaphorin cytoplasmicdomain-associated protein 3) (SEMACAP3 protein)// 3D-structure; Alternative splicing; Coiled coil; Metal-binding; Polymorphism; Repeat; Zinc; Zinc-finger.// 0// 759// 99 D-BRAWH3008172.1// 281..2452// Q9UPQ7// PDZ domain-containing RING finger protein 3(Ligand of Numb-protein X 3) (Semaphorin cytoplasmicdomain-associated protein 3) (SEMACAP3 protein)// 3D-structure; Alternative splicing; Coiled coil; Metal-binding; Polymorphism; Repeat; Zinc; Zinc-finger.// 0// 722// 99 D-BRAWH3011965.1// 300..>1574// Q9UPQ7// PDZ domain-containing RING finger protein 3(Ligand of Numb-protein X 3) (Semaphorin cytoplasmicdomain-associated protein 3) (SEMACAP3 protein)// 3D-structure; Alternative splicing; Coiled coil; Metal-binding; Polymorphism; Repeat; Zinc; Zinc-finger.// 0// 421// 99

3) Results of Homology Analysis Using BLASTP (RefSeq)

[0475] Homology analysis was performed on the 19 ORF sequences shown in Example 14-1), using BLASTP (blastall 2.2.6; ftp://ftp.ncbi.nih.gov/blast/), for RefSeq of the Jul. 15, 2006 version (human, mouse, rat; ftp://ftp.ncbi.nih.gov/refseq/). Based on the results of the homology analysis, the sequences showing the highest homology with an E-value of 1E-10 or less are shown below. In the following cases, however, the applicable candidate is not selected, but the next candidate is shown.

Having a definition beginning with "hypothetical protein FLJ" Having a definition beginning with "KIAA" Having a definition beginning with "hypothetical protein DKFZ" Having a definition beginning with "DKFZ" Having a definition beginning with "RIKEN cDNA" Having a definition beginning with "hypothetical protein MGC" Having a definition of "hypothetical protein" Having a definition beginning with "hypothetical protein PP" Having the definition as "neuronal thread protein" Having a definition beginning with "clone FLB" Having a definition beginning with "hypothetical protein PRO" Having the definition as "PRO0483 protein" Having a definition including "MNC" Having a definition including "MOST-1" Having a definition beginning with "similar to" Having a definition including "TPR gene on Y" Having a definition beginning with "HSPC" Having a definition beginning with "CGI-"

[0476] Individual data are shown with the name of cDNA sequence, ORF region, hit data accession number, hit data definition, E-value, consensus length (amino acid length), and identity separated by "//" in this order.

D-UTERU2026184.1// 191..2119// NP.sub.--597717.1// zinc finger protein 418 [Homo sapiens]// 0// 601// 100 D-BRACE3000012.1// 465..2558// NP.sub.--597717.1// zinc finger protein 418 [Homo sapiens]// 0// 674// 99 D-NT2RP8004156.1// 131..1387// NP.sub.--005154.2// v-akt murine thymoma viral oncogene homolog 1 [Homo sapiens]// 0// 418// 100 D-NT2RI3005525.1// 45..1292// NP.sub.--861449.1// sprouty-related protein with EVH-1 domain 2 [Homo sapiens]// 0// 408// 99 D-NT2RP8004592.1// 620..1183// NP.sub.--003921.2// src family associated phosphoprotein 2 [Homo sapiens]// 1E-110// 187// 100 D-NT2RI2014164.1// 162..1397// NP.sub.--000889.3// amine oxidase (flavin-containing) [Homo sapiens]// 0// 367// 93 D-BRAMY2029564.1// 143..1657// NP.sub.--000889.3// amine oxidase (flavin-containing) [Homo sapiens]// 0// 504// 100 D-BRHIP2003515.1// 84..707// NP.sub.--001020424.1// tumor protein D52 isoform 2 [Homo sapiens]// 1E-110// 207// 100 D-BRACE2044661.1// 297..878// NP.sub.--001670.1// Na+/K+-ATPase beta 3 subunit [Homo sapiens]// 5E-91// 158// 97 D-3NB692002462.1// 343..951// NP.sub.--000422.1// mevalonate kinase [Homo sapiens]// 1E-112// 202// 100 D-BRCAN2027778.1// 52..1086// NP.sub.--000422.1// mevalonate kinase [Homo sapiens]// 0// 343// 86 D-NT2RI3001005.1// 22..1629// NP.sub.--703150.1// glucose transporter 14 [Homo sapiens]// 0// 490// 99 D-NT2RI3005261.1// 22..1629// NP.sub.--703150.1// glucose transporter 14 [Homo sapiens]// 0// 491// 100 D-OCBBF2010718.1// 144..2495// NP.sub.--055824.1// PDZ domain containing RING finger 3 [Homo sapiens]// 0// 758// 99 D-OCBBF3004194.1// 129..2480// NP.sub.--055824.1// PDZ domain containing RING finger 3 [Homo sapiens]// 0// 760// 99 D-NT2RP8000826.1// 95..2461// NP.sub.--055824.1// PDZ domain containing RING finger 3 [Homo sapiens]// 0// 759// 99 D-NT2RP7007268.1// 95..2461// NP.sub.--055824.1// PDZ domain containing RING finger 3 [Homo sapiens]// 0// 759// 99 D-BRAWH3008172.1// 281..2452// NP.sub.--055824.1// PDZ domain containing RING finger 3 [Homo sapiens]// 0// 722// 99 D-BRAWH3011965.1// 300..>1574// NP.sub.--055824.1// PDZ domain containing RING finger 3 [Homo sapiens]// 0// 421// 99

4) Results of Motif Homology Analysis Using Pfam

[0477] Motif homology analysis was performed on the 19 ORF sequences shown in Example 14-1), using Pfam (ftp://ftp.sanger.ac.uk/pub/databases/Pfam/). The analytical program used was hmmpfam v2.3.2, and the analysis was performed for the November 2005 version of Pfam19.0. Based on the results of the homology analysis, the sequences showing the highest homology with an E-value of 1E-10 or less are shown below.

[0478] Individual data are shown with the name of cDNA sequence and ORF region, followed by hit data accession number, hit data name, hit data description, E-value, and InterPro ID, separated by " " in this order, presented repeatedly using as many "//" partitions as the hit data.

D-BRACE3000012.1// 465..2558// PF01352.15 KRAB KRAB box 2.1e-20 IPR001909 D-NT2RP8004156.1// 131..1387// PF00069.14 Pkinase Protein kinase domain 1.6e-113 IPR000719//PF07714.5 Pkinase_Tyr Protein tyrosine kinase 1.3e-18 // PF00433.12 Pkinase_C Protein kinase C terminal domain 1.4e-11 IPR000961 D-NT2RI3005525.1// 45..1292// PF05210.2 Sprouty Sprouty protein (Spry) 2.7e-11 IPR007875 D-NT2RI2014164.1// 162..1397// PF01593.12 Amino_oxidase Flavin containing amine oxidoreductase 9.2e-57 IPR002937 D-BRAMY2029564.1// 143..1657// PF01593.12 Amino_oxidase Flavin containing amine oxidoreductase 5.8e-103 IPR002937 D-BRHIP2003515.1// 84..707// PF04201.4 TPD52 Tumour protein D52 family 1.5e-119 IPR007327 D-BRACE2044661.1// 297..878// PF00287.7 Na_K-ATPase Sodium/potassium ATPase beta chain 3.1e-32 IPR000402 D-NT2RI3001005.1// 22..1629// PF00083.13 Sugar_tr Sugar (and other) transporter 6.3e-200 IPR005828// PF07690.5 MFS.sub.--1 Major Facilitator Superfamily 1.1e-14 IPR011701 D-NT2RI3005261.1// 22..1629// PF00083.13 Sugar_tr Sugar (and other) transporter 5.5e-200 IPR005828// PF07690.5 MFS.sub.--1 Major Facilitator Superfamily 1.1e-14 PRO11701 D-OCBBF2010718.1// 144..2495// PF00595.12 PDZ PDZ domain (Also known as DHR or GLGF) 2e-14 IPR001478 D-OCBBF3004194.1// 129..2480// PF00595.12 PDZ PDZ domain (Also known as DHR or GLGF) 7.1e-16 IPR001478 D-NT2RP8000826.1// 95..2461// PF00595.12 PDZ PDZ domain (Also known as DHR or GLGF) 7.1e-16 IPR001478 D-NT2RP7007268.1// 95..2461// PF00595.12 PDZ PDZ domain (Also known as DHR or GLGF) 7.1e-16 IPR001478 D-BRAWH3008172.1// 281..2452// PF00595.12 PDZ PDZ domain (Also known as DHR or GLGF) 7.1e-16 IPR001478 D-BRAWH3011965.1// 300..>1574// PF00595.12 PDZ PDZ domain (Also known as DHR or GLGF) 7.1e-16 IPR001478

5) Transmembrane Domain Prediction Analysis Using SOSUI

[0479] Transmembrane domain prediction analysis was performed on the 19 ORF sequences shown in Example 14-1), using SOSUI (http://bp.nuap.nagoya-u.ac.jp/sosui/). For the analysis, SOSUI version 1.5 was used. The sequences that permitted prediction of the transmembrane domain in the SOSUI analysis are shown below.

[0480] Individual data are shown with the name of cDNA sequence, ORF region, and number of transmembrane domain separated by "//".

D-NT2RI3005525.1// 45..1292// 1

D-BRACE2044661.1// 297..878// 2

D-NT2RI3001005.1// 22..1629// 11

D-NT2RI3005261.1// 22..1629// 11

6) N-Terminal Secretion Signal Sequence Prediction Analysis Using PSORT

[0481] N-terminal secretion signal sequence prediction was performed on the 19 ORF sequences shown in Example 14-1), using PSORT (http://psort.nibb.ac.jp/). PSORT II was used for the analysis.

[0482] In the PSORT analysis, no sequences permitted prediction of the N-terminal secretion signal sequence.

7) N-Terminal Secretion Signal Sequence Prediction Analysis Using SignalP ver. 3.0

[0483] N-terminal secretion signal sequence prediction was performed on the 19 ORF sequences shown in Example 14-1), using SignalP (http://www.cbs.dtu.dk/services/SignalP/). SignalP version 3.0 was used for the analysis. Sequences that permitted prediction of the N-terminal secretion signal sequence in the SignalP analysis are shown below.

[0484] Individual data are shown with the name of cDNA sequence and ORF region separated by "//".

D-BRACE2044661.1// 297..878

Summary of Examples 1 to 14

[0485] Although there have been remarkable advances in the analysis of human chromosome sequences thanks to the progress in human genome research, this does not mean that all the human genetic functions have been clarified. We analyzed human genes with a focus on the diversity thereof, and showed that the diversity is largely associated with gene functional changes.

[0486] By comparing human genome sequence information and data on human cDNAs, which are products of transcription therefrom, it was found that a plurality of mRNAs are transcribed from certain regions of chromosome. They occur in two cases: a case wherein there are different ORF regions predicted to encode and produce different proteins, and another case wherein there are different 5'UTR regions or 3'UTR regions, which are noncoding regions, and the same protein is produced. With an emphasis on cDNAs predicted to encode proteins different from those of known cDNAs that have already been analyzed, in particular, we performed search and sequence analysis of such cDNAs. Hence, it was found that the cause of the diversity resides mainly in transcription initiation point selectivity and exon selectivity. Regarding transcription initiation point selectivity, a change of the transcription factor used in a certain chromosome region produced a different position for transcription initiation, resulting in the cDNA diversity. As for exon selectivity, an increase or decrease in the exon used, despite transcription from the same chromosome region, at the time of transcription and splicing, resulted in the cDNA diversity.

[0487] How the genetic diversity is associated with gene functions was analyzed on the basis of our own information on the expression frequencies of mRNAs by the 5'-terminal sequences of about 1.50 million human cDNAs (5'-onepass sequences). Hence, a large number of cases were found wherein gene functions seemed to be significantly influenced by diversity features, including variation of transcription initiation region selective in a certain organ, and deletion of exon in a certain condition. We discovered genes whose diversity varies depending on the brain tissue portion and nerve cell differentiation stage, and conducted extensive analyses.

[0488] Regarding the analytical method, the expression levels were compared using real-time PCR (polymerase chain reaction). For example, assuming an exon predicted to be inserted selectively only after differentiation into nerve cells, a primer that specifically detects the exon region (01) is designed, a primer that specifically detects the pattern in which the exon is not inserted (02) is designed, and a primer that detects a region having both patterns in common (03) is designed. With the use of these 3 kinds of primers, the amounts amplified at the various stages of nerve cell differentiation are compared. The specific region detection results for 01 and 02 are compared with the amount amplified for the shared region 03 as the control at various stages of nerve cell differentiation, whereby it is possible to know how the exon selectivity was changed by nerve cell differentiation. Hence, the correlation between exon selectivity and tissue specific expression can be assessed.

[0489] By this method, we discovered many genes whose diversity is associated with tissue-specific expression. Being specific for the tissue in which the gene is expressed suggests that the diversity may significantly influence the function of the gene. Hence, by using a specific region with diversity as a gene marker, it seems possible to elucidate the function of a particular portion of the brain, and to detect nerve cell differentiation or regeneration stages in detail. Furthermore, for example, by proceeding to develop a pharmaceutical with a protein having a specific region expressed only at a certain stage of nerve cell differentiation or regeneration as the target, it seems possible to develop a pharmaceutical that is more effective with lower prevalence of adverse reactions.

<Explanation of Nerve Cell Differentiation>

[0490] The mRNAs related to nerve cell differentiation (mRNAs that induce differentiation of nervous system cells and exhibit an expressional change) are thought to be useful as therapeutic/diagnostic markers for nerve disease. By searching for an mRNA that exhibits an expressional change during the process of inducing differentiation of cultured human cells NT2 into nerve cells (retinoic acid (RA) stimulation or RA stimulation followed by treatment with growth inhibitor), such an mRNA can be discovered. These mRNAs are also thought to be associated with nerve regeneration.

<Explanation of Various Portions of the Brain>

1) Hippocampus

[0491] Among the brain tissues, the hippocampus is a very important portion that controls memory, having the function of fixing memory by determining whether or not the information obtained is necessary, and allowing other brain portions to store the memory. Clinical findings show that if the hippocampus is disordered, or, in the worst case, if the hippocampus is lacked, one is only able to remember new things for a short time. Some patients with dementia are thought to have an abnormality in the hippocampus. When comparing the whole brain tissue and the hippocampus, the mRNAs exhibiting expressional variation are mRNAs involved in memory or associated with dementia, and are thought to be useful in elucidating the mechanism for memory and as therapeutic/diagnostic markers.

2) Caudate Nucleus

[0492] The hippocampal system is a portion that is important to memory associated with spatial cognition. Spatial cognition is also said to be memory of remembering places. By contrast, the caudate nucleus is said to be a portion that is important to memory acquired through habits (habitual memory).

3) Amygdala

[0493] The amygdala is the emotional center of the brain. The information that has passed the amygdala causes emotional reactions, for example, panic and fear reactions. If a strong fear is produced upon affect assessment of a stimulus by the amygdala, the amygdala transmits warning signals to various portions of the brain. As a result, reactions such as palm sweating, palpitation, blood pressure elevation, and rapid secretion of adrenaline occur. The amygdala can also be said to be a tissue that controls a kind of instinct of defense in which warning signals are transmitted to the body to make the body in a warning state. When comparing the whole brain tissue and the amygdala, the mRNAs exhibiting expressional variation are mRNAs involved in emotional reactions, and are thought to be useful in elucidating the molecular mechanisms for emotional reactions, fear reactions, panic and the like.

4) Cerebellum

[0494] The cerebellum is the center of equilibrium, muscle movement, and motor learning. This region is thought to be involved in motor regulation; as the cerebellum acts, one can make smooth motions involuntarily. There is also increasing evidence for the involvement of the cerebellum not only in physical movement, but also in the habituation of higher movements such as reading and writing. When comparing the brain tissue as a whole and the cerebellum, the mRNAs exhibiting expressional variation are mRNAs involved in equilibrium and motor functions, and are thought to be useful in elucidating the molecular mechanisms for the motor functions under the control of the brain.

5) Thalamus

[0495] The thalamus is a portion where nerve cells that are highly associated with the cerebrum gather, transferring sensory information from the spinal cord and the like to the relevant portions of the cerebrum, and regulating the motor commands of the cerebrum. For example, in visual sensation, images are separated into size, shape, and color, and in auditory sensation, sounds are separated into volume and comfortability and sent to the sensory area of cerebral cortex. When comparing the whole brain tissue and the thalamus, the mRNAs exhibiting expressional variation are mRNAs involved in signal transduction from sensory organs, and are thought to be useful in elucidating the molecular mechanism for the signal transduction under the control of the brain.

6) Substantia Nigra

[0496] The substantia nigra is a nerve nucleus that occupies a portion of the midbrain. The substantia nigra is roughly divided into two portions: pars compacta and pars reticulata (and lateral portion), both of which are central constituents of the basal ganglion. The basal ganglion, along with the cerebellum, is known as a higher center responsible for important roles in the onset and control of voluntary movement. The basal ganglion roughly consists of the four nerve nuclei, i.e., striate body, pallidum, substantia nigra, and subthalamic nucleus, the striate body being divided into the caudate nucleus and the putamen, the pallidum into the lateral segment and the medial segment, and the substantia nigra into the pars compacta and the pars reticulata. When these nerve nuclei are re-classified from the viewpoint of the signal transduction modes of "input", "output", and "mutual communication", the striate body corresponds to the input portion of the basal ganglion, and the pallidal medial segment and the substantia nigra pars reticulata correspond to the output portion thereof. Connecting the input portion and the output portion indirectly, the pallidal lateral segment and the subthalamic nucleus are thought to be an interface of the basal ganglion; modifying the nervous activity of the striate body by dopamine, the substantia nigra pars compacta is thought to be a modifying portion of the basal ganglion.

[0497] An illness of the cerebro-nervous system characterized by an insufficient production of a neurotransmitter produced in the substantia nigra in the brain, known as dopamine, resulting in motor disorders such as hand tremor and stiffening of muscles making physical movement dull, is said to be Parkinson's disease. Brain nerve cells usually decrease little by little with aging; in Parkinson's disease, nerve cells of the substantia nigra decrease remarkably at higher rates than usual.

[0498] When comparing the whole brain tissue and the substantia nigra, the mRNAs exhibiting expressional variation are thought to be mRNAs involved in the above events.

7) Alzheimer Patient's Cerebral Cortex

[0499] Alzheimer's disease is an illness of the cerebro-nervous system characterized by loss of memory, that hampers daily activities and necessitates nursing care in advanced cases, eventually leading to atrophy of the brain. Although the causes of the onset thereof are said to be associated with environmental factors such as stress, as well as vascular factors such as hypertension and cholesterolemia, they have not been investigated in full. Therefore, when comparing normal brain tissue and Alzheimer pathologic tissues, the mRNAs exhibiting expressional variation are mRNAs associated with Alzheimer's disease, and are thought to be useful in elucidating the mechanisms for the onset of pathologic conditions, and as therapeutic/diagnostic markers.

[0500] This application is based on a patent application No. 2007-066430 filed in Japan (filing date: Mar. 15, 2007), the contents of which are incorporated in full herein by this reference.

[Sequence Listing]

Sequence CWU 1

1

266130RNAArtificial Sequencesynthesized oligo-cap linker used for oligo-cap method 1agcaucgagu cggccuuguu ggccuacugg 30242DNAArtificial Sequencesynthesized oligo(dT) primer used for oligo-cap method 2gcggctgaag acggcctatg tggccttttt tttttttttt tt 42321DNAArtificial Sequenceforward primer used for preparing cDNA library 3agcatcgagt cggccttgtt g 21421DNAArtificial Sequencereverse primer used for preparing cDNA library 4gcggctgaag acggcctatg t 21522DNAArtificial Sequenceforward primer used for detecting a polynucleotide encoding GAPDH 5ccaggtggtc tcctctgact tc 22619DNAArtificial Sequencereverse primer used for detecting a polynucleotide encoding GAPDH 6gtggtcgttg agggcaatg 19725DNAArtificial Sequenceprobe used for detecting a polynucleotide encoding GAPDH 7acagcgacac ccactcctcc acctt 2582499DNAHomo sapiens 8ggtagcgacc attttggtta atgttgggtg tgtttctgcg gtttgtgagg tgagaggcgc 60tggagctatg ggtccgaacc gcggtgtctg aacccagaag gtgaagagtc cttcttgctg 120cacagaggca gatcttaggc cccgtaacgg cgcccgccgc tcccggcagt gctttccccg 180cgtactcggg atggcggcgg ccgcgctgag gctcccggct caggttgttg gtgtggatca 240gaagatgagg aggcaccttc taagaagagc atttctatac aaagagtgtc tcaggtcagc 300actcctgggg caggtgtctc ccaagaaggc tcactcttgt gaaatgtgtg gcgcgatctt 360gggagacatt ttgcacttgg cagatcatca ggggacacat cacaagcaga aactgcacag 420gtgtgaggca tgggggaata aattgtatga tagttcaaac cgtccgcacc agaatcagta 480ccttggagag aaaccctata gaagcagtgt tgaggaagca ttgtttgtga agaggtgtaa 540gttccatgtg tcagaggagt catctatctt cattcagagt ggaaaggact ttttgcccag 600ctcaggatta ctgctgcagg aggccactca cactggggag aagtcaaaca gcaaacctga 660gtgtgagtct ccctttcagt ggggagatac tcattacagc tgtggagaat gcatgaaaca 720ttctagcacc aaacacgtat ttgttcaaca gcagagactt ccctctagag aggaatgtta 780ttgctgggaa tgtgggaaat cctttagcaa atatgatagc gtcagtaatc atcagagagt 840tcacactggg aaaagacctt atgaatgtgg agaatgtggg aaatctttta gtcataaggg 900cagccttgtt cagcatcagc gagttcacac tgggaaaaga ccttatgaat gtggagaatg 960tgggaaatct tttagtcata agggcagcct tgttcagcat cagcgagttc atactggaga 1020aagaccttat gagtgtggag aatgtgggaa atcttttagt caaaatggta ctctcattaa 1080acatcaacga gttcacactg gagaaagacc ttatgagtgt gaagaatgtg ggaaatgttt 1140tactcagaag ggcaatctca ttcaacatca acgaggtcac actagtgaaa gaccttatga 1200gtgtgaagaa tgtggaaaat gttttagtca aaagggcacc ctaactgaac atcatcgagt 1260tcacactaga gaacgacctt atgagtgtgg agaatgtggg aaatctttta gtcgaaaggg 1320acaccttagg aaccatcagc gaggtcacac tggagaaaga ccttacgagt gtggagaatg 1380tgggaaatct tttagtcgaa agggcaacct cattcagcat cagcgaagcc acactggaga 1440aaggccttat gagtgtagag agtgtaggaa attatttagg ggcaagtccc acctcattga 1500acaccagaga gttcacactg gagaaaggcc atatgaatgt aatgaatgtg ggaaatcatt 1560tcaagacagc tctgggtttc gtgttcatca gagagttcac actggagaaa aaccgtttga 1620gtgtagtgaa tgtgggaagt catttcctca aagctgttcc ctccttcgac atcggagagt 1680tcatactgga gaaaggcctt atgaatgtgg agaatgtgga aagtcatttc atcagagctc 1740ttccctcctt cgacatcaga aaacccacac tgcagaaaga ccttatgagt gcagagaatg 1800tgggaaattc ttctccagtc tccttgaaca caggagagtt cacactggag aaaggcctta 1860tgagtgcagg gaatgtggaa aaacatttac tcgaaggtct gcgcatttta aacatcagag 1920acttcatact cgaggaaagc cttacgagtg cagcgaatgt gggaaatcct ttgctgaaac 1980cttcagtctt actgaacaca ggagagtaca cactggagaa aggccttatg agtgcagtga 2040atgtggaaaa tcatttcatc gaagctcttc tctccttcga catcagagag ttcacacaga 2100aagaagtcct tacaagtgaa aagaaatttg ggaaattctt tagctaaacc tctgtgcatc 2160ttcttgatca gagggttctt actggatcag gaccttatga gtgtgacaaa cgtgggatat 2220tctttatgca gaagtcttgt tttattacat acagaagagc tcccactgca gaagggcctc 2280ttgagtgcga tgaatgtgag aaagccttct gccttctgtc attggataac agattgttct 2340cataaggaaa acactgtaca cgtacaggaa atattatttc ttgtaaaaca taacactgga 2400ggagatgcct tatgacggag ccatctgcct aaattgacat accttcagca tctgcataaa 2460ctcaattatg ttggagctgt gtggcatttt tcaccctgc 249992499DNAHomo sapiensCDS(191)..(2119) 9ggtagcgacc attttggtta atgttgggtg tgtttctgcg gtttgtgagg tgagaggcgc 60tggagctatg ggtccgaacc gcggtgtctg aacccagaag gtgaagagtc cttcttgctg 120cacagaggca gatcttaggc cccgtaacgg cgcccgccgc tcccggcagt gctttccccg 180cgtactcggg atg gcg gcg gcc gcg ctg agg ctc ccg gct cag gtt gtt 229 Met Ala Ala Ala Ala Leu Arg Leu Pro Ala Gln Val Val 1 5 10ggt gtg gat cag aag atg agg agg cac ctt cta aga aga gca ttt cta 277Gly Val Asp Gln Lys Met Arg Arg His Leu Leu Arg Arg Ala Phe Leu 15 20 25tac aaa gag tgt ctc agg tca gca ctc ctg ggg cag gtg tct ccc aag 325Tyr Lys Glu Cys Leu Arg Ser Ala Leu Leu Gly Gln Val Ser Pro Lys30 35 40 45aag gct cac tct tgt gaa atg tgt ggc gcg atc ttg gga gac att ttg 373Lys Ala His Ser Cys Glu Met Cys Gly Ala Ile Leu Gly Asp Ile Leu 50 55 60cac ttg gca gat cat cag ggg aca cat cac aag cag aaa ctg cac agg 421His Leu Ala Asp His Gln Gly Thr His His Lys Gln Lys Leu His Arg 65 70 75tgt gag gca tgg ggg aat aaa ttg tat gat agt tca aac cgt ccg cac 469Cys Glu Ala Trp Gly Asn Lys Leu Tyr Asp Ser Ser Asn Arg Pro His 80 85 90cag aat cag tac ctt gga gag aaa ccc tat aga agc agt gtt gag gaa 517Gln Asn Gln Tyr Leu Gly Glu Lys Pro Tyr Arg Ser Ser Val Glu Glu 95 100 105gca ttg ttt gtg aag agg tgt aag ttc cat gtg tca gag gag tca tct 565Ala Leu Phe Val Lys Arg Cys Lys Phe His Val Ser Glu Glu Ser Ser110 115 120 125atc ttc att cag agt gga aag gac ttt ttg ccc agc tca gga tta ctg 613Ile Phe Ile Gln Ser Gly Lys Asp Phe Leu Pro Ser Ser Gly Leu Leu 130 135 140ctg cag gag gcc act cac act ggg gag aag tca aac agc aaa cct gag 661Leu Gln Glu Ala Thr His Thr Gly Glu Lys Ser Asn Ser Lys Pro Glu 145 150 155tgt gag tct ccc ttt cag tgg gga gat act cat tac agc tgt gga gaa 709Cys Glu Ser Pro Phe Gln Trp Gly Asp Thr His Tyr Ser Cys Gly Glu 160 165 170tgc atg aaa cat tct agc acc aaa cac gta ttt gtt caa cag cag aga 757Cys Met Lys His Ser Ser Thr Lys His Val Phe Val Gln Gln Gln Arg 175 180 185ctt ccc tct aga gag gaa tgt tat tgc tgg gaa tgt ggg aaa tcc ttt 805Leu Pro Ser Arg Glu Glu Cys Tyr Cys Trp Glu Cys Gly Lys Ser Phe190 195 200 205agc aaa tat gat agc gtc agt aat cat cag aga gtt cac act ggg aaa 853Ser Lys Tyr Asp Ser Val Ser Asn His Gln Arg Val His Thr Gly Lys 210 215 220aga cct tat gaa tgt gga gaa tgt ggg aaa tct ttt agt cat aag ggc 901Arg Pro Tyr Glu Cys Gly Glu Cys Gly Lys Ser Phe Ser His Lys Gly 225 230 235agc ctt gtt cag cat cag cga gtt cac act ggg aaa aga cct tat gaa 949Ser Leu Val Gln His Gln Arg Val His Thr Gly Lys Arg Pro Tyr Glu 240 245 250tgt gga gaa tgt ggg aaa tct ttt agt cat aag ggc agc ctt gtt cag 997Cys Gly Glu Cys Gly Lys Ser Phe Ser His Lys Gly Ser Leu Val Gln 255 260 265cat cag cga gtt cat act gga gaa aga cct tat gag tgt gga gaa tgt 1045His Gln Arg Val His Thr Gly Glu Arg Pro Tyr Glu Cys Gly Glu Cys270 275 280 285ggg aaa tct ttt agt caa aat ggt act ctc att aaa cat caa cga gtt 1093Gly Lys Ser Phe Ser Gln Asn Gly Thr Leu Ile Lys His Gln Arg Val 290 295 300cac act gga gaa aga cct tat gag tgt gaa gaa tgt ggg aaa tgt ttt 1141His Thr Gly Glu Arg Pro Tyr Glu Cys Glu Glu Cys Gly Lys Cys Phe 305 310 315act cag aag ggc aat ctc att caa cat caa cga ggt cac act agt gaa 1189Thr Gln Lys Gly Asn Leu Ile Gln His Gln Arg Gly His Thr Ser Glu 320 325 330aga cct tat gag tgt gaa gaa tgt gga aaa tgt ttt agt caa aag ggc 1237Arg Pro Tyr Glu Cys Glu Glu Cys Gly Lys Cys Phe Ser Gln Lys Gly 335 340 345acc cta act gaa cat cat cga gtt cac act aga gaa cga cct tat gag 1285Thr Leu Thr Glu His His Arg Val His Thr Arg Glu Arg Pro Tyr Glu350 355 360 365tgt gga gaa tgt ggg aaa tct ttt agt cga aag gga cac ctt agg aac 1333Cys Gly Glu Cys Gly Lys Ser Phe Ser Arg Lys Gly His Leu Arg Asn 370 375 380cat cag cga ggt cac act gga gaa aga cct tac gag tgt gga gaa tgt 1381His Gln Arg Gly His Thr Gly Glu Arg Pro Tyr Glu Cys Gly Glu Cys 385 390 395ggg aaa tct ttt agt cga aag ggc aac ctc att cag cat cag cga agc 1429Gly Lys Ser Phe Ser Arg Lys Gly Asn Leu Ile Gln His Gln Arg Ser 400 405 410cac act gga gaa agg cct tat gag tgt aga gag tgt agg aaa tta ttt 1477His Thr Gly Glu Arg Pro Tyr Glu Cys Arg Glu Cys Arg Lys Leu Phe 415 420 425agg ggc aag tcc cac ctc att gaa cac cag aga gtt cac act gga gaa 1525Arg Gly Lys Ser His Leu Ile Glu His Gln Arg Val His Thr Gly Glu430 435 440 445agg cca tat gaa tgt aat gaa tgt ggg aaa tca ttt caa gac agc tct 1573Arg Pro Tyr Glu Cys Asn Glu Cys Gly Lys Ser Phe Gln Asp Ser Ser 450 455 460ggg ttt cgt gtt cat cag aga gtt cac act gga gaa aaa ccg ttt gag 1621Gly Phe Arg Val His Gln Arg Val His Thr Gly Glu Lys Pro Phe Glu 465 470 475tgt agt gaa tgt ggg aag tca ttt cct caa agc tgt tcc ctc ctt cga 1669Cys Ser Glu Cys Gly Lys Ser Phe Pro Gln Ser Cys Ser Leu Leu Arg 480 485 490cat cgg aga gtt cat act gga gaa agg cct tat gaa tgt gga gaa tgt 1717His Arg Arg Val His Thr Gly Glu Arg Pro Tyr Glu Cys Gly Glu Cys 495 500 505gga aag tca ttt cat cag agc tct tcc ctc ctt cga cat cag aaa acc 1765Gly Lys Ser Phe His Gln Ser Ser Ser Leu Leu Arg His Gln Lys Thr510 515 520 525cac act gca gaa aga cct tat gag tgc aga gaa tgt ggg aaa ttc ttc 1813His Thr Ala Glu Arg Pro Tyr Glu Cys Arg Glu Cys Gly Lys Phe Phe 530 535 540tcc agt ctc ctt gaa cac agg aga gtt cac act gga gaa agg cct tat 1861Ser Ser Leu Leu Glu His Arg Arg Val His Thr Gly Glu Arg Pro Tyr 545 550 555gag tgc agg gaa tgt gga aaa aca ttt act cga agg tct gcg cat ttt 1909Glu Cys Arg Glu Cys Gly Lys Thr Phe Thr Arg Arg Ser Ala His Phe 560 565 570aaa cat cag aga ctt cat act cga gga aag cct tac gag tgc agc gaa 1957Lys His Gln Arg Leu His Thr Arg Gly Lys Pro Tyr Glu Cys Ser Glu 575 580 585tgt ggg aaa tcc ttt gct gaa acc ttc agt ctt act gaa cac agg aga 2005Cys Gly Lys Ser Phe Ala Glu Thr Phe Ser Leu Thr Glu His Arg Arg590 595 600 605gta cac act gga gaa agg cct tat gag tgc agt gaa tgt gga aaa tca 2053Val His Thr Gly Glu Arg Pro Tyr Glu Cys Ser Glu Cys Gly Lys Ser 610 615 620ttt cat cga agc tct tct ctc ctt cga cat cag aga gtt cac aca gaa 2101Phe His Arg Ser Ser Ser Leu Leu Arg His Gln Arg Val His Thr Glu 625 630 635aga agt cct tac aag tga aaagaaattt gggaaattct ttagctaaac 2149Arg Ser Pro Tyr Lys 640ctctgtgcat cttcttgatc agagggttct tactggatca ggaccttatg agtgtgacaa 2209acgtgggata ttctttatgc agaagtcttg ttttattaca tacagaagag ctcccactgc 2269agaagggcct cttgagtgcg atgaatgtga gaaagccttc tgccttctgt cattggataa 2329cagattgttc tcataaggaa aacactgtac acgtacagga aatattattt cttgtaaaac 2389ataacactgg aggagatgcc ttatgacgga gccatctgcc taaattgaca taccttcagc 2449atctgcataa actcaattat gttggagctg tgtggcattt ttcaccctgc 249910642PRTHomo sapiens 10Met Ala Ala Ala Ala Leu Arg Leu Pro Ala Gln Val Val Gly Val Asp1 5 10 15Gln Lys Met Arg Arg His Leu Leu Arg Arg Ala Phe Leu Tyr Lys Glu 20 25 30Cys Leu Arg Ser Ala Leu Leu Gly Gln Val Ser Pro Lys Lys Ala His 35 40 45Ser Cys Glu Met Cys Gly Ala Ile Leu Gly Asp Ile Leu His Leu Ala 50 55 60Asp His Gln Gly Thr His His Lys Gln Lys Leu His Arg Cys Glu Ala65 70 75 80Trp Gly Asn Lys Leu Tyr Asp Ser Ser Asn Arg Pro His Gln Asn Gln 85 90 95Tyr Leu Gly Glu Lys Pro Tyr Arg Ser Ser Val Glu Glu Ala Leu Phe 100 105 110Val Lys Arg Cys Lys Phe His Val Ser Glu Glu Ser Ser Ile Phe Ile 115 120 125Gln Ser Gly Lys Asp Phe Leu Pro Ser Ser Gly Leu Leu Leu Gln Glu 130 135 140Ala Thr His Thr Gly Glu Lys Ser Asn Ser Lys Pro Glu Cys Glu Ser145 150 155 160Pro Phe Gln Trp Gly Asp Thr His Tyr Ser Cys Gly Glu Cys Met Lys 165 170 175His Ser Ser Thr Lys His Val Phe Val Gln Gln Gln Arg Leu Pro Ser 180 185 190Arg Glu Glu Cys Tyr Cys Trp Glu Cys Gly Lys Ser Phe Ser Lys Tyr 195 200 205Asp Ser Val Ser Asn His Gln Arg Val His Thr Gly Lys Arg Pro Tyr 210 215 220Glu Cys Gly Glu Cys Gly Lys Ser Phe Ser His Lys Gly Ser Leu Val225 230 235 240Gln His Gln Arg Val His Thr Gly Lys Arg Pro Tyr Glu Cys Gly Glu 245 250 255Cys Gly Lys Ser Phe Ser His Lys Gly Ser Leu Val Gln His Gln Arg 260 265 270Val His Thr Gly Glu Arg Pro Tyr Glu Cys Gly Glu Cys Gly Lys Ser 275 280 285Phe Ser Gln Asn Gly Thr Leu Ile Lys His Gln Arg Val His Thr Gly 290 295 300Glu Arg Pro Tyr Glu Cys Glu Glu Cys Gly Lys Cys Phe Thr Gln Lys305 310 315 320Gly Asn Leu Ile Gln His Gln Arg Gly His Thr Ser Glu Arg Pro Tyr 325 330 335Glu Cys Glu Glu Cys Gly Lys Cys Phe Ser Gln Lys Gly Thr Leu Thr 340 345 350Glu His His Arg Val His Thr Arg Glu Arg Pro Tyr Glu Cys Gly Glu 355 360 365Cys Gly Lys Ser Phe Ser Arg Lys Gly His Leu Arg Asn His Gln Arg 370 375 380Gly His Thr Gly Glu Arg Pro Tyr Glu Cys Gly Glu Cys Gly Lys Ser385 390 395 400Phe Ser Arg Lys Gly Asn Leu Ile Gln His Gln Arg Ser His Thr Gly 405 410 415Glu Arg Pro Tyr Glu Cys Arg Glu Cys Arg Lys Leu Phe Arg Gly Lys 420 425 430Ser His Leu Ile Glu His Gln Arg Val His Thr Gly Glu Arg Pro Tyr 435 440 445Glu Cys Asn Glu Cys Gly Lys Ser Phe Gln Asp Ser Ser Gly Phe Arg 450 455 460Val His Gln Arg Val His Thr Gly Glu Lys Pro Phe Glu Cys Ser Glu465 470 475 480Cys Gly Lys Ser Phe Pro Gln Ser Cys Ser Leu Leu Arg His Arg Arg 485 490 495Val His Thr Gly Glu Arg Pro Tyr Glu Cys Gly Glu Cys Gly Lys Ser 500 505 510Phe His Gln Ser Ser Ser Leu Leu Arg His Gln Lys Thr His Thr Ala 515 520 525Glu Arg Pro Tyr Glu Cys Arg Glu Cys Gly Lys Phe Phe Ser Ser Leu 530 535 540Leu Glu His Arg Arg Val His Thr Gly Glu Arg Pro Tyr Glu Cys Arg545 550 555 560Glu Cys Gly Lys Thr Phe Thr Arg Arg Ser Ala His Phe Lys His Gln 565 570 575Arg Leu His Thr Arg Gly Lys Pro Tyr Glu Cys Ser Glu Cys Gly Lys 580 585 590Ser Phe Ala Glu Thr Phe Ser Leu Thr Glu His Arg Arg Val His Thr 595 600 605Gly Glu Arg Pro Tyr Glu Cys Ser Glu Cys Gly Lys Ser Phe His Arg 610 615 620Ser Ser Ser Leu Leu Arg His Gln Arg Val His Thr Glu Arg Ser Pro625 630 635 640Tyr Lys1142DNAHomo sapiens 11gcgctgaggc tcccggctca ggttgttggt gtggatcaga ag 421243PRTHomo sapiens 12Met Ala Ala Ala Ala Leu Arg Leu Pro Ala Gln Val Val Gly Val Asp1 5 10 15Gln Lys Met Arg Arg His Leu Leu Arg Arg Ala Phe Leu Tyr Lys Glu 20 25 30Cys Leu Arg Ser Ala Leu Leu Gly Gln Val Ser 35 4013213DNAHomo sapiens 13gcatcatctg gctgcaaaga agagaacaca ctgtgtttga gggaggagga aggaggatca 60gagtttaaac tcctgccata atgcagggca ctgtggcatt tgaagatgtg gctgtgaact 120tttcccagga ggagtggagt ctccttagtg aggttcagag atgcctttac catgacgtga 180tgctggagaa ctgggtactt atatcctccc tgg 213142DNAHomo sapiens 14gt 21577PRTHomo sapiens 15Met Gln Gly Thr Val Ala Phe Glu Asp Val Ala Val Asn Phe Ser Gln1

5 10 15Glu Glu Trp Ser Leu Leu Ser Glu Val Gln Arg Cys Leu Tyr His Asp 20 25 30Val Met Leu Glu Asn Trp Val Leu Ile Ser Ser Leu Gly Cys Trp Cys 35 40 45Gly Ser Glu Asp Glu Glu Ala Pro Ser Lys Lys Ser Ile Ser Ile Gln 50 55 60Arg Val Ser Gln Val Ser Thr Pro Gly Ala Gly Val Ser65 70 75163551DNAHomo sapiens 16ctctggtagc gaccattttg gttaatgttg ggtgtgtttc tgcggtttgt gaggtgagag 60gcgctggagc tatgggtccg aaccgcggtg tctgaaccca gaaggtgaag agtccttctt 120gctgcacaga ggcagatcct aggccccgta acggcgcccg ccgctcccgg cagtgctttc 180cccgcgtact cgggatggcg gcggccgcgc tgaggctccc ggctcaggca tcatctggct 240gcaaagaaga gaacacactg tgtttgaggg aggaggaagg aggatcagag tttaaactcc 300tgccataatg cagaccattc tccataacaa gggagtggga tttaaatgac tctaggtgga 360cagaggaaag tcgtcattgt agaggggtca ccaagtgcaa cacagaagtg gaatagggcc 420atggatatct aaagccccac ctggttctgt ggggatgtgg agccatgcaa gaaggccttc 480atagaatgac ttgggactgc attgctattg ggagccatga gaattctgtg cagggcactg 540tggcatttga agatgtggct gtgaactttt cccaggagga gtggagtctc cttagtgagg 600ttcagagatg cctttaccat gacgtgatgc tggagaactg ggtacttata tcctccctgg 660gttgttggtg tggatcagaa gatgaggagg caccttctaa gaagagcatt tctatacaaa 720gagtgtctca ggtcagcact cctggggcag gtgtgtctcc caagaaggct cactcttgtg 780aaatgtgtgg cgcgatcttg ggagacattt tgcacttggc agatcatcag gggacacatc 840acaagcagaa actgcacagg tgtgaggcat gggggaataa attgtatgat agttcaaacc 900gtccgcacca gaatcagtac cttggagaga aaccctatag aagcagtgtt gaggaagcat 960tgtttgtgaa gaggtgtaag ttccatgtgt cagaggagtc atctatcttc attcagagtg 1020gaaaggactt tttgcccagc tcaggattac tgctgcagga ggccactcac actggggaga 1080agtcaaacag caaacctgag tgtgagtctc cctttcagtg gggagatact cgttacagct 1140gtggagaatg catgaaacat tctagcacca aacacgtatt tgttcaacag cagagacttc 1200cctctagaga ggaatgttat tgctgggaat gtgggaaatc ctttagcaaa tatgatagcg 1260tcagtaatca tcagagagtt cacactggga aaagacctta tgaatgtgga gaatgtggga 1320aatcttttag tcataagggc agccttgttc agcatcagcg agttcacact gggaaaagac 1380cttatgaatg tggagaatgt gggaaatctt ttagtcataa gggcagcctt gttcagcatc 1440agcgagttca tactggagaa agaccttatg agtgtggaga atgtgggaaa tcttttagtc 1500aaaatggtac tctcattaaa catcaacgag ttcacactgg agaaagacct tatgagtgtg 1560aagaatgtgg gaaatgtttt actcagaagg gcaatctcat tcaacatcaa cgaggtcaca 1620ctagtgaaag accttatgag tgtgaagaat gtggaaaatg ttttagtcaa aagggcaccc 1680taactgaaca tcatcgagtt cacactagag aacgacctta tgagtgtgga gaatgtggga 1740aatcttttag tcgaaaggga caccttagga accatcagcg gggtcacact ggagaaagac 1800cttacgagtg tggagaatgt gggaaatctt ttagtcgaaa gggcaacctc attcagcatc 1860agcgaagcca cactggagaa aggccttatg agtgtagaga gtgtaggaaa ttatttaggg 1920gcaagtccca cctcattgaa caccagagag ttcacactgg agaaaggcca tatgaatgta 1980atgaatgtgg gaaatcattt caagacagct ctgggtttcg tgttcatcag agagttcaca 2040ctggagaaaa accgtttgag tgtagtgaat gtgggaagtc atttcctcaa agctgttccc 2100tccttcgaca tcggagagtt catactggag aaaggcctta tgaatgtgga gaatgtggaa 2160agtcatttca tcagagctct tccctccttc gacatcagaa aactcacact gcagaaagac 2220cttatgagtg cagagaatgt gggaaattct tctccagtct ccttgaacac aggagagttc 2280acactggaga aaggccttat gaatgcaggg aatgtggaaa aacatttact cgaaggtctg 2340cgcattttaa acatcagaga cttcatactc gaggaaagcc ttacgagtgc agcgaatgtg 2400ggaaatcctt tgctgaaacc ttcagtctta ctgaacacag gagagtacac actggagaaa 2460ggccttatga gtgcagtgaa tgtggaaaat catttcatcg aagctcttct ctccttcgac 2520atcagagagt tcacacagaa agaagtcctt acaagtgaaa agaaatttgg gaaattcttt 2580agctaaacct ctgtgcatct tcttgatcag agggttctta ctggatcagg accttatgag 2640tgtgacaaac gtgggatatt ctttatgcag aagtcttgtt ttattacata cagaagagct 2700cccactgcag aagggcctct tgagtgcgat gaatgtgaga aagccttctg ccttctgtca 2760ttggataaca gattgttctc ataaggaaaa cactgtacac gtacaggaaa tattatttct 2820tgtaaaacat aacactggag gagatgcctt atgacggagc catctgccta aattggcata 2880ccttcagcat ctgcataaac tcaattatgt tggagctgtg tggcattttt caccctgccg 2940ggttcccttg ccagacatga tgtcggttat ctggcaaaag ccattttatg tcggccacga 3000ggcaggtgtt cactgtgcat cattcattca ccccatgatg ttctggaagt aaaccttggt 3060tgtctttcgt tggccagagg aattgaatgt ccgtctgtct gcccaggctg gagtggcacg 3120atctcagctc actgcagcct ccacctccag ggttcaagcg attctcctgc ctcagcctcc 3180cggggatctc aagggcattt ccctttgccc acctcgcctt ttcatatttg gtaaactgta 3240tgcatttgcc tccagcccaa gattataaat atgaactgat tatgatctgc atgttctctc 3300tttgggttca agcatttcct tacagaagag ccaccgtgga agtcatgggt aaatatgtgt 3360tgaattggta actccctctt ggagaatttc ttgtgaatta cacagcaata ggggaactca 3420tttaactgga gacataatct caatttgtaa agtgtggccc attttctaac atttttattt 3480tgcataccct cccctctctt ctcgattgat gaaactaaca aagaggttaa taaaagccca 3540tctcgtcatg t 3551173551DNAHomo sapiensCDS(465)..(2558) 17ctctggtagc gaccattttg gttaatgttg ggtgtgtttc tgcggtttgt gaggtgagag 60gcgctggagc tatgggtccg aaccgcggtg tctgaaccca gaaggtgaag agtccttctt 120gctgcacaga ggcagatcct aggccccgta acggcgcccg ccgctcccgg cagtgctttc 180cccgcgtact cgggatggcg gcggccgcgc tgaggctccc ggctcaggca tcatctggct 240gcaaagaaga gaacacactg tgtttgaggg aggaggaagg aggatcagag tttaaactcc 300tgccataatg cagaccattc tccataacaa gggagtggga tttaaatgac tctaggtgga 360cagaggaaag tcgtcattgt agaggggtca ccaagtgcaa cacagaagtg gaatagggcc 420atggatatct aaagccccac ctggttctgt ggggatgtgg agcc atg caa gaa ggc 476 Met Gln Glu Gly 1ctt cat aga atg act tgg gac tgc att gct att ggg agc cat gag aat 524Leu His Arg Met Thr Trp Asp Cys Ile Ala Ile Gly Ser His Glu Asn5 10 15 20tct gtg cag ggc act gtg gca ttt gaa gat gtg gct gtg aac ttt tcc 572Ser Val Gln Gly Thr Val Ala Phe Glu Asp Val Ala Val Asn Phe Ser 25 30 35cag gag gag tgg agt ctc ctt agt gag gtt cag aga tgc ctt tac cat 620Gln Glu Glu Trp Ser Leu Leu Ser Glu Val Gln Arg Cys Leu Tyr His 40 45 50gac gtg atg ctg gag aac tgg gta ctt ata tcc tcc ctg ggt tgt tgg 668Asp Val Met Leu Glu Asn Trp Val Leu Ile Ser Ser Leu Gly Cys Trp 55 60 65tgt gga tca gaa gat gag gag gca cct tct aag aag agc att tct ata 716Cys Gly Ser Glu Asp Glu Glu Ala Pro Ser Lys Lys Ser Ile Ser Ile 70 75 80caa aga gtg tct cag gtc agc act cct ggg gca ggt gtg tct ccc aag 764Gln Arg Val Ser Gln Val Ser Thr Pro Gly Ala Gly Val Ser Pro Lys85 90 95 100aag gct cac tct tgt gaa atg tgt ggc gcg atc ttg gga gac att ttg 812Lys Ala His Ser Cys Glu Met Cys Gly Ala Ile Leu Gly Asp Ile Leu 105 110 115cac ttg gca gat cat cag ggg aca cat cac aag cag aaa ctg cac agg 860His Leu Ala Asp His Gln Gly Thr His His Lys Gln Lys Leu His Arg 120 125 130tgt gag gca tgg ggg aat aaa ttg tat gat agt tca aac cgt ccg cac 908Cys Glu Ala Trp Gly Asn Lys Leu Tyr Asp Ser Ser Asn Arg Pro His 135 140 145cag aat cag tac ctt gga gag aaa ccc tat aga agc agt gtt gag gaa 956Gln Asn Gln Tyr Leu Gly Glu Lys Pro Tyr Arg Ser Ser Val Glu Glu 150 155 160gca ttg ttt gtg aag agg tgt aag ttc cat gtg tca gag gag tca tct 1004Ala Leu Phe Val Lys Arg Cys Lys Phe His Val Ser Glu Glu Ser Ser165 170 175 180atc ttc att cag agt gga aag gac ttt ttg ccc agc tca gga tta ctg 1052Ile Phe Ile Gln Ser Gly Lys Asp Phe Leu Pro Ser Ser Gly Leu Leu 185 190 195ctg cag gag gcc act cac act ggg gag aag tca aac agc aaa cct gag 1100Leu Gln Glu Ala Thr His Thr Gly Glu Lys Ser Asn Ser Lys Pro Glu 200 205 210tgt gag tct ccc ttt cag tgg gga gat act cgt tac agc tgt gga gaa 1148Cys Glu Ser Pro Phe Gln Trp Gly Asp Thr Arg Tyr Ser Cys Gly Glu 215 220 225tgc atg aaa cat tct agc acc aaa cac gta ttt gtt caa cag cag aga 1196Cys Met Lys His Ser Ser Thr Lys His Val Phe Val Gln Gln Gln Arg 230 235 240ctt ccc tct aga gag gaa tgt tat tgc tgg gaa tgt ggg aaa tcc ttt 1244Leu Pro Ser Arg Glu Glu Cys Tyr Cys Trp Glu Cys Gly Lys Ser Phe245 250 255 260agc aaa tat gat agc gtc agt aat cat cag aga gtt cac act ggg aaa 1292Ser Lys Tyr Asp Ser Val Ser Asn His Gln Arg Val His Thr Gly Lys 265 270 275aga cct tat gaa tgt gga gaa tgt ggg aaa tct ttt agt cat aag ggc 1340Arg Pro Tyr Glu Cys Gly Glu Cys Gly Lys Ser Phe Ser His Lys Gly 280 285 290agc ctt gtt cag cat cag cga gtt cac act ggg aaa aga cct tat gaa 1388Ser Leu Val Gln His Gln Arg Val His Thr Gly Lys Arg Pro Tyr Glu 295 300 305tgt gga gaa tgt ggg aaa tct ttt agt cat aag ggc agc ctt gtt cag 1436Cys Gly Glu Cys Gly Lys Ser Phe Ser His Lys Gly Ser Leu Val Gln 310 315 320cat cag cga gtt cat act gga gaa aga cct tat gag tgt gga gaa tgt 1484His Gln Arg Val His Thr Gly Glu Arg Pro Tyr Glu Cys Gly Glu Cys325 330 335 340ggg aaa tct ttt agt caa aat ggt act ctc att aaa cat caa cga gtt 1532Gly Lys Ser Phe Ser Gln Asn Gly Thr Leu Ile Lys His Gln Arg Val 345 350 355cac act gga gaa aga cct tat gag tgt gaa gaa tgt ggg aaa tgt ttt 1580His Thr Gly Glu Arg Pro Tyr Glu Cys Glu Glu Cys Gly Lys Cys Phe 360 365 370act cag aag ggc aat ctc att caa cat caa cga ggt cac act agt gaa 1628Thr Gln Lys Gly Asn Leu Ile Gln His Gln Arg Gly His Thr Ser Glu 375 380 385aga cct tat gag tgt gaa gaa tgt gga aaa tgt ttt agt caa aag ggc 1676Arg Pro Tyr Glu Cys Glu Glu Cys Gly Lys Cys Phe Ser Gln Lys Gly 390 395 400acc cta act gaa cat cat cga gtt cac act aga gaa cga cct tat gag 1724Thr Leu Thr Glu His His Arg Val His Thr Arg Glu Arg Pro Tyr Glu405 410 415 420tgt gga gaa tgt ggg aaa tct ttt agt cga aag gga cac ctt agg aac 1772Cys Gly Glu Cys Gly Lys Ser Phe Ser Arg Lys Gly His Leu Arg Asn 425 430 435cat cag cgg ggt cac act gga gaa aga cct tac gag tgt gga gaa tgt 1820His Gln Arg Gly His Thr Gly Glu Arg Pro Tyr Glu Cys Gly Glu Cys 440 445 450ggg aaa tct ttt agt cga aag ggc aac ctc att cag cat cag cga agc 1868Gly Lys Ser Phe Ser Arg Lys Gly Asn Leu Ile Gln His Gln Arg Ser 455 460 465cac act gga gaa agg cct tat gag tgt aga gag tgt agg aaa tta ttt 1916His Thr Gly Glu Arg Pro Tyr Glu Cys Arg Glu Cys Arg Lys Leu Phe 470 475 480agg ggc aag tcc cac ctc att gaa cac cag aga gtt cac act gga gaa 1964Arg Gly Lys Ser His Leu Ile Glu His Gln Arg Val His Thr Gly Glu485 490 495 500agg cca tat gaa tgt aat gaa tgt ggg aaa tca ttt caa gac agc tct 2012Arg Pro Tyr Glu Cys Asn Glu Cys Gly Lys Ser Phe Gln Asp Ser Ser 505 510 515ggg ttt cgt gtt cat cag aga gtt cac act gga gaa aaa ccg ttt gag 2060Gly Phe Arg Val His Gln Arg Val His Thr Gly Glu Lys Pro Phe Glu 520 525 530tgt agt gaa tgt ggg aag tca ttt cct caa agc tgt tcc ctc ctt cga 2108Cys Ser Glu Cys Gly Lys Ser Phe Pro Gln Ser Cys Ser Leu Leu Arg 535 540 545cat cgg aga gtt cat act gga gaa agg cct tat gaa tgt gga gaa tgt 2156His Arg Arg Val His Thr Gly Glu Arg Pro Tyr Glu Cys Gly Glu Cys 550 555 560gga aag tca ttt cat cag agc tct tcc ctc ctt cga cat cag aaa act 2204Gly Lys Ser Phe His Gln Ser Ser Ser Leu Leu Arg His Gln Lys Thr565 570 575 580cac act gca gaa aga cct tat gag tgc aga gaa tgt ggg aaa ttc ttc 2252His Thr Ala Glu Arg Pro Tyr Glu Cys Arg Glu Cys Gly Lys Phe Phe 585 590 595tcc agt ctc ctt gaa cac agg aga gtt cac act gga gaa agg cct tat 2300Ser Ser Leu Leu Glu His Arg Arg Val His Thr Gly Glu Arg Pro Tyr 600 605 610gaa tgc agg gaa tgt gga aaa aca ttt act cga agg tct gcg cat ttt 2348Glu Cys Arg Glu Cys Gly Lys Thr Phe Thr Arg Arg Ser Ala His Phe 615 620 625aaa cat cag aga ctt cat act cga gga aag cct tac gag tgc agc gaa 2396Lys His Gln Arg Leu His Thr Arg Gly Lys Pro Tyr Glu Cys Ser Glu 630 635 640tgt ggg aaa tcc ttt gct gaa acc ttc agt ctt act gaa cac agg aga 2444Cys Gly Lys Ser Phe Ala Glu Thr Phe Ser Leu Thr Glu His Arg Arg645 650 655 660gta cac act gga gaa agg cct tat gag tgc agt gaa tgt gga aaa tca 2492Val His Thr Gly Glu Arg Pro Tyr Glu Cys Ser Glu Cys Gly Lys Ser 665 670 675ttt cat cga agc tct tct ctc ctt cga cat cag aga gtt cac aca gaa 2540Phe His Arg Ser Ser Ser Leu Leu Arg His Gln Arg Val His Thr Glu 680 685 690aga agt cct tac aag tga aaagaaattt gggaaattct ttagctaaac 2588Arg Ser Pro Tyr Lys 695ctctgtgcat cttcttgatc agagggttct tactggatca ggaccttatg agtgtgacaa 2648acgtgggata ttctttatgc agaagtcttg ttttattaca tacagaagag ctcccactgc 2708agaagggcct cttgagtgcg atgaatgtga gaaagccttc tgccttctgt cattggataa 2768cagattgttc tcataaggaa aacactgtac acgtacagga aatattattt cttgtaaaac 2828ataacactgg aggagatgcc ttatgacgga gccatctgcc taaattggca taccttcagc 2888atctgcataa actcaattat gttggagctg tgtggcattt ttcaccctgc cgggttccct 2948tgccagacat gatgtcggtt atctggcaaa agccatttta tgtcggccac gaggcaggtg 3008ttcactgtgc atcattcatt caccccatga tgttctggaa gtaaaccttg gttgtctttc 3068gttggccaga ggaattgaat gtccgtctgt ctgcccaggc tggagtggca cgatctcagc 3128tcactgcagc ctccacctcc agggttcaag cgattctcct gcctcagcct cccggggatc 3188tcaagggcat ttccctttgc ccacctcgcc ttttcatatt tggtaaactg tatgcatttg 3248cctccagccc aagattataa atatgaactg attatgatct gcatgttctc tctttgggtt 3308caagcatttc cttacagaag agccaccgtg gaagtcatgg gtaaatatgt gttgaattgg 3368taactccctc ttggagaatt tcttgtgaat tacacagcaa taggggaact catttaactg 3428gagacataat ctcaatttgt aaagtgtggc ccattttcta acatttttat tttgcatacc 3488ctcccctctc ttctcgattg atgaaactaa caaagaggtt aataaaagcc catctcgtca 3548tgt 355118697PRTHomo sapiens 18Met Gln Glu Gly Leu His Arg Met Thr Trp Asp Cys Ile Ala Ile Gly1 5 10 15Ser His Glu Asn Ser Val Gln Gly Thr Val Ala Phe Glu Asp Val Ala 20 25 30Val Asn Phe Ser Gln Glu Glu Trp Ser Leu Leu Ser Glu Val Gln Arg 35 40 45Cys Leu Tyr His Asp Val Met Leu Glu Asn Trp Val Leu Ile Ser Ser 50 55 60Leu Gly Cys Trp Cys Gly Ser Glu Asp Glu Glu Ala Pro Ser Lys Lys65 70 75 80Ser Ile Ser Ile Gln Arg Val Ser Gln Val Ser Thr Pro Gly Ala Gly 85 90 95Val Ser Pro Lys Lys Ala His Ser Cys Glu Met Cys Gly Ala Ile Leu 100 105 110Gly Asp Ile Leu His Leu Ala Asp His Gln Gly Thr His His Lys Gln 115 120 125Lys Leu His Arg Cys Glu Ala Trp Gly Asn Lys Leu Tyr Asp Ser Ser 130 135 140Asn Arg Pro His Gln Asn Gln Tyr Leu Gly Glu Lys Pro Tyr Arg Ser145 150 155 160Ser Val Glu Glu Ala Leu Phe Val Lys Arg Cys Lys Phe His Val Ser 165 170 175Glu Glu Ser Ser Ile Phe Ile Gln Ser Gly Lys Asp Phe Leu Pro Ser 180 185 190Ser Gly Leu Leu Leu Gln Glu Ala Thr His Thr Gly Glu Lys Ser Asn 195 200 205Ser Lys Pro Glu Cys Glu Ser Pro Phe Gln Trp Gly Asp Thr Arg Tyr 210 215 220Ser Cys Gly Glu Cys Met Lys His Ser Ser Thr Lys His Val Phe Val225 230 235 240Gln Gln Gln Arg Leu Pro Ser Arg Glu Glu Cys Tyr Cys Trp Glu Cys 245 250 255Gly Lys Ser Phe Ser Lys Tyr Asp Ser Val Ser Asn His Gln Arg Val 260 265 270His Thr Gly Lys Arg Pro Tyr Glu Cys Gly Glu Cys Gly Lys Ser Phe 275 280 285Ser His Lys Gly Ser Leu Val Gln His Gln Arg Val His Thr Gly Lys 290 295 300Arg Pro Tyr Glu Cys Gly Glu Cys Gly Lys Ser Phe Ser His Lys Gly305 310 315 320Ser Leu Val Gln His Gln Arg Val His Thr Gly Glu Arg Pro Tyr Glu 325 330 335Cys Gly Glu Cys Gly Lys Ser Phe Ser Gln Asn Gly Thr Leu Ile Lys 340 345 350His Gln Arg Val His Thr Gly Glu Arg Pro Tyr Glu Cys Glu Glu Cys 355 360 365Gly Lys Cys Phe Thr Gln Lys Gly Asn Leu Ile Gln His Gln Arg Gly 370 375 380His Thr Ser Glu Arg Pro Tyr Glu Cys Glu Glu Cys Gly Lys Cys Phe385 390 395 400Ser Gln Lys Gly Thr Leu Thr Glu His His Arg Val His Thr Arg Glu 405 410 415Arg Pro Tyr Glu Cys Gly Glu Cys Gly Lys Ser Phe Ser Arg Lys Gly 420

425 430His Leu Arg Asn His Gln Arg Gly His Thr Gly Glu Arg Pro Tyr Glu 435 440 445Cys Gly Glu Cys Gly Lys Ser Phe Ser Arg Lys Gly Asn Leu Ile Gln 450 455 460His Gln Arg Ser His Thr Gly Glu Arg Pro Tyr Glu Cys Arg Glu Cys465 470 475 480Arg Lys Leu Phe Arg Gly Lys Ser His Leu Ile Glu His Gln Arg Val 485 490 495His Thr Gly Glu Arg Pro Tyr Glu Cys Asn Glu Cys Gly Lys Ser Phe 500 505 510Gln Asp Ser Ser Gly Phe Arg Val His Gln Arg Val His Thr Gly Glu 515 520 525Lys Pro Phe Glu Cys Ser Glu Cys Gly Lys Ser Phe Pro Gln Ser Cys 530 535 540Ser Leu Leu Arg His Arg Arg Val His Thr Gly Glu Arg Pro Tyr Glu545 550 555 560Cys Gly Glu Cys Gly Lys Ser Phe His Gln Ser Ser Ser Leu Leu Arg 565 570 575His Gln Lys Thr His Thr Ala Glu Arg Pro Tyr Glu Cys Arg Glu Cys 580 585 590Gly Lys Phe Phe Ser Ser Leu Leu Glu His Arg Arg Val His Thr Gly 595 600 605Glu Arg Pro Tyr Glu Cys Arg Glu Cys Gly Lys Thr Phe Thr Arg Arg 610 615 620Ser Ala His Phe Lys His Gln Arg Leu His Thr Arg Gly Lys Pro Tyr625 630 635 640Glu Cys Ser Glu Cys Gly Lys Ser Phe Ala Glu Thr Phe Ser Leu Thr 645 650 655Glu His Arg Arg Val His Thr Gly Glu Arg Pro Tyr Glu Cys Ser Glu 660 665 670Cys Gly Lys Ser Phe His Arg Ser Ser Ser Leu Leu Arg His Gln Arg 675 680 685Val His Thr Glu Arg Ser Pro Tyr Lys 690 69519220DNAHomo sapiens 19accattctcc ataacaaggg agtgggattt aaatgactct aggtggacag aggaaagtcg 60tcattgtaga ggggtcacca agtgcaacac agaagtggaa tagggccatg gatatctaaa 120gccccacctg gttctgtggg gatgtggagc catgcaagaa ggccttcata gaatgacttg 180ggactgcatt gctattggga gccatgagaa ttctgtgcag 2202023PRTHomo sapiens 20Met Gln Glu Gly Leu His Arg Met Thr Trp Asp Cys Ile Ala Ile Gly1 5 10 15Ser His Glu Asn Ser Val Gln 202169DNAHomo sapiens 21atgcaagaag gccttcatag aatgacttgg gactgcattg ctattgggag ccatgagaat 60tctgtgcag 692223PRTHomo sapiens 22Met Gln Glu Gly Leu His Arg Met Thr Trp Asp Cys Ile Ala Ile Gly1 5 10 15Ser His Glu Asn Ser Val Gln 202318DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 23gtgctttccc cgcgtact 182422DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 24aggagtgctg acctgagaca ct 2225138DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO23) and reverse primer (SEQ ID NO24) 25gtgctttccc cgcgtactcg ggatggcggc ggccgcgctg aggctcccgg ctcaggttgt 60tggtgtggat cagaagatga ggaggcacct tctaagaaga gcatttctat acaaagagtg 120tctcaggtca gcactcct 1382622DNAArtificial Sequenceprobe used for detecting the variants of the present invention 26ctcccggctc aggttgttgg tg 222723DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 27gatatctaaa gccccacctg gtt 232821DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 28acagtgccct gcacagaatt c 2129118DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO27) and reverse primer (SEQ ID NO28) 29gatatctaaa gccccacctg gttctgtggg gatgtggagc catgcaagaa ggccttcata 60gaatgacttg ggactgcatt gctattggga gccatgagaa ttctgtgcag ggcactgt 1183028DNAArtificial Sequenceprobe used for detecting the variants of the present invention 30atgacttggg actgcattgc tattggga 283122DNAArtificial Sequenceforward primer used for specifically detecting the known variant of the gene of the present invention 31ggaggaagga ggatcagagt tt 223220DNAArtificial Sequencereverse primer used for specifically detecting the known variant of the gene of the present invention 32agccacatct tcaaatgcca 203369DNAArtificial Sequencepartial polynucleotide specific to the known variant of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO31) and reverse primer (SEQ ID NO32) 33ggaggaagga ggatcagagt ttaaactcct gccataatgc agggcactgt ggcatttgaa 60gatgtggct 693425DNAArtificial Sequenceprobe used for specifically detecting the known variant of the gene of the present invention 34aactcctgcc ataatgcagg gcact 253524DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 35ttccatgtgt cagaggagtc atct 243624DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 36cactcaggtt tgctgtttga cttc 2437122DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO35) and reverse primer (SEQ ID NO36) 37ttccatgtgt cagaggagtc atctatcttc attcagagtg gaaaggactt tttgcccagc 60tcaggattac tgctgcagga ggccactcac actggggaga agtcaaacag caaacctgag 120tg 1223827DNAArtificial Sequenceprobe used for detecting the variants of the gene of the present invention 38ttttgcccag ctcaggatta ctgctgc 2739306DNAHomo sapiens 39ggtagcgacc attttggtta atgttgggtg tgtttctgcg gtttgtgagg tgagaggcgc 60tggagctatg ggtccgaacc gcggtgtctg aacccagaag gtgaagagtc cttcttgctg 120cacagaggca gatcttaggc cccgtaacgg cgcccgccgc tcccggcagt gctttccccg 180cgtactcggg atggcggcgg ccgcgctgag gctcccggct caggttgttg gtgtggatca 240gaagatgagg aggcaccttc taagaagagc atttctatac aaagagtgtc tcaggtcagc 300actcct 30640541DNAHomo sapiens 40ctctggtagc gaccattttg gttaatgttg ggtgtgtttc tgcggtttgt gaggtgagag 60gcgctggagc tatgggtccg aaccgcggtg tctgaaccca gaaggtgaag agtccttctt 120gctgcacaga ggcagatcct aggccccgta acggcgcccg ccgctcccgg cagtgctttc 180cccgcgtact cgggatggcg gcggccgcgc tgaggctccc ggctcaggca tcatctggct 240gcaaagaaga gaacacactg tgtttgaggg aggaggaagg aggatcagag tttaaactcc 300tgccataatg cagaccattc tccataacaa gggagtggga tttaaatgac tctaggtgga 360cagaggaaag tcgtcattgt agaggggtca ccaagtgcaa cacagaagtg gaatagggcc 420atggatatct aaagccccac ctggttctgt ggggatgtgg agccatgcaa gaaggccttc 480atagaatgac ttgggactgc attgctattg ggagccatga gaattctgtg cagggcactg 540t 541411573DNAHomo sapiens 41aaaaaaaaac tgccacggag ccccagcggc tacacactcc actcactcac acctctcagg 60gccccgcacg ttcccacagc cctcagggtg tacgtgctgt aggtgtggct gtgccccaga 120gtgccagctg atgaagacgg agcggccccg gcccaacacc ttcatcatcc gctgcctgca 180gtggaccact gtcatcgaac gcaccttcca tgtggagact cctgaggagc gggaggagtg 240gacaaccgcc atccagactg tggctgacgg cctcaagaag caggaggagg aggagatgga 300cttccggtcg ggctcaccca gtgacaactc aggggctgaa gagatggagg tgtccctggc 360caagcccaag caccgcgtga ccatgaacga gtttgagtac ctgaagctgc tgggcaaggg 420cactttcggc aaggtgatcc tggtgaagga gaaggccaca ggccgctact acgccatgaa 480gatcctcaag aaggaagtca tcgtggccaa ggacgaggtg gcccacacac tcaccgagaa 540ccgcgtcctg cagaactcca ggcacccctt cctcacagcc ctgaagtact ctttccagac 600ccacgaccgc ctctgctttg tcatggagta cgccaacggg ggcgagctgt tcttccacct 660gtcccgggaa cgtgtgttct ccgaggaccg ggcccgcttc tatggcgctg agattgtgtc 720agccctggac tacctgcact cggagaagaa cgtggtgtac cgggacctca agctggagaa 780cctcatgctg gacaaggacg ggcacattaa gatcacagac ttcgggctgt gcaaggaggg 840gatcaaggac ggtgccacca tgaagacctt ttgcggcaca cctgagtacc tggcccccga 900ggtgctggag gacaatgact acggccgtgc agtggactgg tgggggctgg gcgtggtcat 960gtacgagatg atgtgcggtc gcctgccctt ctacaaccag gaccatgaga agctttttga 1020gctcatcctc atggaggaga tccgcttccc gcgcacgctt ggtcccgagg ccaagtcctt 1080gctttcaggg ctgctcaaga aggaccccaa gcagaggctt ggcgggggct ccgaggacgc 1140caaggagatc atgcagcatc gcttctttgc cggtatcgtg tggcagcacg tgtacgagaa 1200gaagctcagc ccacccttca agccccaggt cacgtcggag actgacacca ggtattttga 1260tgaggagttc acggcccaga tgatcaccat cacaccacct gaccaagatg acagcatgga 1320gtgtgtggac agcgagcgca ggccccactt cccccagttc tcctactcgg ccagcggcac 1380ggcctgaggc ggcggtggac tgcgctggac gatagcttgg agggatggag aggcggcctc 1440gtgccatgat ctgtatttaa tggtttttat ttctcgggtg catttgagag aagccacgct 1500gtcctctcga gcccagatgg aaagacgttt ttgtgctgtg ggcagcaccc tcccccgcag 1560cggggtaggg aag 1573421573DNAHomo sapiensCDS(131)..(1387) 42aaaaaaaaac tgccacggag ccccagcggc tacacactcc actcactcac acctctcagg 60gccccgcacg ttcccacagc cctcagggtg tacgtgctgt aggtgtggct gtgccccaga 120gtgccagctg atg aag acg gag cgg ccc cgg ccc aac acc ttc atc atc 169 Met Lys Thr Glu Arg Pro Arg Pro Asn Thr Phe Ile Ile 1 5 10cgc tgc ctg cag tgg acc act gtc atc gaa cgc acc ttc cat gtg gag 217Arg Cys Leu Gln Trp Thr Thr Val Ile Glu Arg Thr Phe His Val Glu 15 20 25act cct gag gag cgg gag gag tgg aca acc gcc atc cag act gtg gct 265Thr Pro Glu Glu Arg Glu Glu Trp Thr Thr Ala Ile Gln Thr Val Ala30 35 40 45gac ggc ctc aag aag cag gag gag gag gag atg gac ttc cgg tcg ggc 313Asp Gly Leu Lys Lys Gln Glu Glu Glu Glu Met Asp Phe Arg Ser Gly 50 55 60tca ccc agt gac aac tca ggg gct gaa gag atg gag gtg tcc ctg gcc 361Ser Pro Ser Asp Asn Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala 65 70 75aag ccc aag cac cgc gtg acc atg aac gag ttt gag tac ctg aag ctg 409Lys Pro Lys His Arg Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu 80 85 90ctg ggc aag ggc act ttc ggc aag gtg atc ctg gtg aag gag aag gcc 457Leu Gly Lys Gly Thr Phe Gly Lys Val Ile Leu Val Lys Glu Lys Ala 95 100 105aca ggc cgc tac tac gcc atg aag atc ctc aag aag gaa gtc atc gtg 505Thr Gly Arg Tyr Tyr Ala Met Lys Ile Leu Lys Lys Glu Val Ile Val110 115 120 125gcc aag gac gag gtg gcc cac aca ctc acc gag aac cgc gtc ctg cag 553Ala Lys Asp Glu Val Ala His Thr Leu Thr Glu Asn Arg Val Leu Gln 130 135 140aac tcc agg cac ccc ttc ctc aca gcc ctg aag tac tct ttc cag acc 601Asn Ser Arg His Pro Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gln Thr 145 150 155cac gac cgc ctc tgc ttt gtc atg gag tac gcc aac ggg ggc gag ctg 649His Asp Arg Leu Cys Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu 160 165 170ttc ttc cac ctg tcc cgg gaa cgt gtg ttc tcc gag gac cgg gcc cgc 697Phe Phe His Leu Ser Arg Glu Arg Val Phe Ser Glu Asp Arg Ala Arg 175 180 185ttc tat ggc gct gag att gtg tca gcc ctg gac tac ctg cac tcg gag 745Phe Tyr Gly Ala Glu Ile Val Ser Ala Leu Asp Tyr Leu His Ser Glu190 195 200 205aag aac gtg gtg tac cgg gac ctc aag ctg gag aac ctc atg ctg gac 793Lys Asn Val Val Tyr Arg Asp Leu Lys Leu Glu Asn Leu Met Leu Asp 210 215 220aag gac ggg cac att aag atc aca gac ttc ggg ctg tgc aag gag ggg 841Lys Asp Gly His Ile Lys Ile Thr Asp Phe Gly Leu Cys Lys Glu Gly 225 230 235atc aag gac ggt gcc acc atg aag acc ttt tgc ggc aca cct gag tac 889Ile Lys Asp Gly Ala Thr Met Lys Thr Phe Cys Gly Thr Pro Glu Tyr 240 245 250ctg gcc ccc gag gtg ctg gag gac aat gac tac ggc cgt gca gtg gac 937Leu Ala Pro Glu Val Leu Glu Asp Asn Asp Tyr Gly Arg Ala Val Asp 255 260 265tgg tgg ggg ctg ggc gtg gtc atg tac gag atg atg tgc ggt cgc ctg 985Trp Trp Gly Leu Gly Val Val Met Tyr Glu Met Met Cys Gly Arg Leu270 275 280 285ccc ttc tac aac cag gac cat gag aag ctt ttt gag ctc atc ctc atg 1033Pro Phe Tyr Asn Gln Asp His Glu Lys Leu Phe Glu Leu Ile Leu Met 290 295 300gag gag atc cgc ttc ccg cgc acg ctt ggt ccc gag gcc aag tcc ttg 1081Glu Glu Ile Arg Phe Pro Arg Thr Leu Gly Pro Glu Ala Lys Ser Leu 305 310 315ctt tca ggg ctg ctc aag aag gac ccc aag cag agg ctt ggc ggg ggc 1129Leu Ser Gly Leu Leu Lys Lys Asp Pro Lys Gln Arg Leu Gly Gly Gly 320 325 330tcc gag gac gcc aag gag atc atg cag cat cgc ttc ttt gcc ggt atc 1177Ser Glu Asp Ala Lys Glu Ile Met Gln His Arg Phe Phe Ala Gly Ile 335 340 345gtg tgg cag cac gtg tac gag aag aag ctc agc cca ccc ttc aag ccc 1225Val Trp Gln His Val Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro350 355 360 365cag gtc acg tcg gag act gac acc agg tat ttt gat gag gag ttc acg 1273Gln Val Thr Ser Glu Thr Asp Thr Arg Tyr Phe Asp Glu Glu Phe Thr 370 375 380gcc cag atg atc acc atc aca cca cct gac caa gat gac agc atg gag 1321Ala Gln Met Ile Thr Ile Thr Pro Pro Asp Gln Asp Asp Ser Met Glu 385 390 395tgt gtg gac agc gag cgc agg ccc cac ttc ccc cag ttc tcc tac tcg 1369Cys Val Asp Ser Glu Arg Arg Pro His Phe Pro Gln Phe Ser Tyr Ser 400 405 410gcc agc ggc acg gcc tga ggcggcggtg gactgcgctg gacgatagct 1417Ala Ser Gly Thr Ala 415tggagggatg gagaggcggc ctcgtgccat gatctgtatt taatggtttt tatttctcgg 1477gtgcatttga gagaagccac gctgtcctct cgagcccaga tggaaagacg tttttgtgct 1537gtgggcagca ccctcccccg cagcggggta gggaag 157343418PRTHomo sapiens 43Met Lys Thr Glu Arg Pro Arg Pro Asn Thr Phe Ile Ile Arg Cys Leu1 5 10 15Gln Trp Thr Thr Val Ile Glu Arg Thr Phe His Val Glu Thr Pro Glu 20 25 30Glu Arg Glu Glu Trp Thr Thr Ala Ile Gln Thr Val Ala Asp Gly Leu 35 40 45Lys Lys Gln Glu Glu Glu Glu Met Asp Phe Arg Ser Gly Ser Pro Ser 50 55 60Asp Asn Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala Lys Pro Lys65 70 75 80His Arg Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Leu Gly Lys 85 90 95Gly Thr Phe Gly Lys Val Ile Leu Val Lys Glu Lys Ala Thr Gly Arg 100 105 110Tyr Tyr Ala Met Lys Ile Leu Lys Lys Glu Val Ile Val Ala Lys Asp 115 120 125Glu Val Ala His Thr Leu Thr Glu Asn Arg Val Leu Gln Asn Ser Arg 130 135 140His Pro Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gln Thr His Asp Arg145 150 155 160Leu Cys Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu Phe Phe His 165 170 175Leu Ser Arg Glu Arg Val Phe Ser Glu Asp Arg Ala Arg Phe Tyr Gly 180 185 190Ala Glu Ile Val Ser Ala Leu Asp Tyr Leu His Ser Glu Lys Asn Val 195 200 205Val Tyr Arg Asp Leu Lys Leu Glu Asn Leu Met Leu Asp Lys Asp Gly 210 215 220His Ile Lys Ile Thr Asp Phe Gly Leu Cys Lys Glu Gly Ile Lys Asp225 230 235 240Gly Ala Thr Met Lys Thr Phe Cys Gly Thr Pro Glu Tyr Leu Ala Pro 245 250 255Glu Val Leu Glu Asp Asn Asp Tyr Gly Arg Ala Val Asp Trp Trp Gly 260 265 270Leu Gly Val Val Met Tyr Glu Met Met Cys Gly Arg Leu Pro Phe Tyr 275 280 285Asn Gln Asp His Glu Lys Leu Phe Glu Leu Ile Leu Met Glu Glu Ile 290 295 300Arg Phe Pro Arg Thr Leu Gly Pro Glu Ala Lys Ser Leu Leu Ser Gly305 310 315 320Leu Leu Lys Lys Asp Pro Lys Gln Arg Leu Gly Gly Gly Ser Glu Asp 325 330 335Ala Lys Glu Ile Met Gln His Arg Phe Phe Ala Gly Ile Val Trp Gln 340 345 350His Val Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro Gln Val Thr 355 360 365Ser

Glu Thr Asp Thr Arg Tyr Phe Asp Glu Glu Phe Thr Ala Gln Met 370 375 380Ile Thr Ile Thr Pro Pro Asp Gln Asp Asp Ser Met Glu Cys Val Asp385 390 395 400Ser Glu Arg Arg Pro His Phe Pro Gln Phe Ser Tyr Ser Ala Ser Gly 405 410 415Thr Ala44119DNAHomo sapiens 44aaaaaaaaac tgccacggag ccccagcggc tacacactcc actcactcac acctctcagg 60gccccgcacg ttcccacagc cctcagggtg tacgtgctgt aggtgtggct gtgccccag 11945130DNAHomo sapiens 45aaaaaaaaac tgccacggag ccccagcggc tacacactcc actcactcac acctctcagg 60gccccgcacg ttcccacagc cctcagggtg tacgtgctgt aggtgtggct gtgccccaga 120gtgccagctg 1304622DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 46tcagggtgta cgtgctgtag gt 224719DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 47acatggaagg tgcgttcga 1948131DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO46) and reverse primer (SEQ ID NO47) 48tcagggtgta cgtgctgtag gtgtggctgt gccccagagt gccagctgat gaagacggag 60cggccccggc ccaacacctt catcatccgc tgcctgcagt ggaccactgt catcgaacgc 120accttccatg t 1314918DNAArtificial Sequenceforward primer used for specifically detecting the known variant of the gene of the present invention 49ggttggctgc acaaacga 185021DNAArtificial Sequencereverse primer used for specifically detecting the known variant of the gene of the present invention 50gagcctcacg ttggtccaca t 2151124DNAArtificial Sequencepartial polynucleotide specific to the known variant of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO49) and reverse primer (SEQ ID NO50) 51ggttggctgc acaaacgagg ggagtacatc aagacctggc ggccacgcta cttcctcctc 60aagaatgatg gcaccttcat tggctacaag gagcggccgc aggatgtgga ccaacgtgag 120gctc 1245218DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 52ccgcgtcctg cagaactc 185318DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 53cccgttggcg tactccat 1854100DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO52) and reverse primer (SEQ ID NO53) 54ccgcgtcctg cagaactcca ggcacccctt cctcacagcc ctgaagtact ctttccagac 60ccacgaccgc ctctgctttg tcatggagta cgccaacggg 10055213DNAHomo sapiens 55aaaaaaaaac tgccacggag ccccagcggc tacacactcc actcactcac acctctcagg 60gccccgcacg ttcccacagc cctcagggtg tacgtgctgt aggtgtggct gtgccccaga 120gtgccagctg atgaagacgg agcggccccg gcccaacacc ttcatcatcc gctgcctgca 180gtggaccact gtcatcgaac gcaccttcca tgt 213561800DNAHomo sapiens 56aataatgcta atacccagca ctcgggctcc acaatgtaga ggaaatggca tcgcctggca 60gtgacagcta tattgtgcgt gtcaaggctg tggttatgac cagagatgac tccagcgggg 120gatggttccc acgggaagga ggcgggatca gtcgcgtcgg ggtctgtaag gtcatgcacc 180ccgaaggcaa tggacgaagc ggctttctca tccatggtga acgacagaaa gacaaactgg 240tggtattgga atgctatgta agaaaggact tggtctacac caaagccaat ccaacgtttc 300atcactggaa ggtcgataat aggaagtttg gacttacttt ccaaagccct gctgatgccc 360gagcctttga caggggagta aggaaagcaa tcgaagacct tatagaaggt tcaacaacgt 420catcttccac catccataat gaagctgagc ttggcgatga tgacgttttt acaacagcta 480cagacagttc ttctaattcc tctcagaaga gagagcaacc tactcggaca atctcctctc 540ccacatcctg tgagcaccgg aggatttata ccctgggcca cctccacgac tcatacccca 600cagaccacta tcacctcgat cagccgatgc caaggcccta ccgccaggtg agcttcccgg 660acgacgacga ggagatcgtg cgcatcaacc cccgggagaa gatctggatg acggggtacg 720aggattaccg gcacgcaccc gtcaggggca agtacccgga cccctcggag gacgcggact 780cctcctacgt gcgcttcgcc aagggcgagg tccccaagca tgactacaac tacccctacg 840tggactcctc agactttggc ctaggcgagg accccaaagg ccgcgggggc agcgtgatca 900agacgcagcc ctcccggggc aagtcgcggc ggcggaagga ggacggagag cgctcgcggt 960gcgtgtactg cagggacatg ttcaaccacg aggagaaccg ccggggccac tgccaggacg 1020cgcccgactc cgtgagaact tgcatccgcc gggtgagctg catgtggtgc gcggacagca 1080tgctctatca ctgtatgtcg gaccccgagg gagactatac agacccttgc tcgtgcgata 1140ctagcgacga gaagttttgc ctccggtgga tggctcttat tgccttgtct ttcctggccc 1200cctgtatgtg ctgttacctg ccccttcggg cctgctacca ctgcggagtg atgtgcaggt 1260gctgtggcgg gaagcacaaa gcggccgcgt gactcagttt ccctcccttc tccctccatc 1320cgcagccaca ggggaactcg tctcttacat actctcatct tctcccccgc tcccttccac 1380tccaaggagc gaggagggca agcggcctcc cagctccctg gtacctcgag gcaccattcc 1440agccagggac gctgccgggt agactctcca ctccccctgc cgcccacact gcagcagcca 1500catccataca cacacgctcg cacagtgttc tgaggaagga accttcgcca cagactcctg 1560tactattaac aatctgtaac caagctaact gtctcatcca tgtgttgatt tcctgtttcc 1620tcctcccccg cctcttccag ttcaaaggag tctgcaattg gaactgctga ttttcggtgg 1680gttttgtagt tgatttttcc aagagcgtcg aagactctct ttctcttggt tcaccttgcc 1740tgtcgctagc aagcatctgg ttcagcggaa atgggatgtg agaatgatga aacccgacag 1800571800DNAHomo sapiensCDS(45)..(1292) 57aataatgcta atacccagca ctcgggctcc acaatgtaga ggaa atg gca tcg cct 56 Met Ala Ser Pro 1ggc agt gac agc tat att gtg cgt gtc aag gct gtg gtt atg acc aga 104Gly Ser Asp Ser Tyr Ile Val Arg Val Lys Ala Val Val Met Thr Arg5 10 15 20gat gac tcc agc ggg gga tgg ttc cca cgg gaa gga ggc ggg atc agt 152Asp Asp Ser Ser Gly Gly Trp Phe Pro Arg Glu Gly Gly Gly Ile Ser 25 30 35cgc gtc ggg gtc tgt aag gtc atg cac ccc gaa ggc aat gga cga agc 200Arg Val Gly Val Cys Lys Val Met His Pro Glu Gly Asn Gly Arg Ser 40 45 50ggc ttt ctc atc cat ggt gaa cga cag aaa gac aaa ctg gtg gta ttg 248Gly Phe Leu Ile His Gly Glu Arg Gln Lys Asp Lys Leu Val Val Leu 55 60 65gaa tgc tat gta aga aag gac ttg gtc tac acc aaa gcc aat cca acg 296Glu Cys Tyr Val Arg Lys Asp Leu Val Tyr Thr Lys Ala Asn Pro Thr 70 75 80ttt cat cac tgg aag gtc gat aat agg aag ttt gga ctt act ttc caa 344Phe His His Trp Lys Val Asp Asn Arg Lys Phe Gly Leu Thr Phe Gln85 90 95 100agc cct gct gat gcc cga gcc ttt gac agg gga gta agg aaa gca atc 392Ser Pro Ala Asp Ala Arg Ala Phe Asp Arg Gly Val Arg Lys Ala Ile 105 110 115gaa gac ctt ata gaa ggt tca aca acg tca tct tcc acc atc cat aat 440Glu Asp Leu Ile Glu Gly Ser Thr Thr Ser Ser Ser Thr Ile His Asn 120 125 130gaa gct gag ctt ggc gat gat gac gtt ttt aca aca gct aca gac agt 488Glu Ala Glu Leu Gly Asp Asp Asp Val Phe Thr Thr Ala Thr Asp Ser 135 140 145tct tct aat tcc tct cag aag aga gag caa cct act cgg aca atc tcc 536Ser Ser Asn Ser Ser Gln Lys Arg Glu Gln Pro Thr Arg Thr Ile Ser 150 155 160tct ccc aca tcc tgt gag cac cgg agg att tat acc ctg ggc cac ctc 584Ser Pro Thr Ser Cys Glu His Arg Arg Ile Tyr Thr Leu Gly His Leu165 170 175 180cac gac tca tac ccc aca gac cac tat cac ctc gat cag ccg atg cca 632His Asp Ser Tyr Pro Thr Asp His Tyr His Leu Asp Gln Pro Met Pro 185 190 195agg ccc tac cgc cag gtg agc ttc ccg gac gac gac gag gag atc gtg 680Arg Pro Tyr Arg Gln Val Ser Phe Pro Asp Asp Asp Glu Glu Ile Val 200 205 210cgc atc aac ccc cgg gag aag atc tgg atg acg ggg tac gag gat tac 728Arg Ile Asn Pro Arg Glu Lys Ile Trp Met Thr Gly Tyr Glu Asp Tyr 215 220 225cgg cac gca ccc gtc agg ggc aag tac ccg gac ccc tcg gag gac gcg 776Arg His Ala Pro Val Arg Gly Lys Tyr Pro Asp Pro Ser Glu Asp Ala 230 235 240gac tcc tcc tac gtg cgc ttc gcc aag ggc gag gtc ccc aag cat gac 824Asp Ser Ser Tyr Val Arg Phe Ala Lys Gly Glu Val Pro Lys His Asp245 250 255 260tac aac tac ccc tac gtg gac tcc tca gac ttt ggc cta ggc gag gac 872Tyr Asn Tyr Pro Tyr Val Asp Ser Ser Asp Phe Gly Leu Gly Glu Asp 265 270 275ccc aaa ggc cgc ggg ggc agc gtg atc aag acg cag ccc tcc cgg ggc 920Pro Lys Gly Arg Gly Gly Ser Val Ile Lys Thr Gln Pro Ser Arg Gly 280 285 290aag tcg cgg cgg cgg aag gag gac gga gag cgc tcg cgg tgc gtg tac 968Lys Ser Arg Arg Arg Lys Glu Asp Gly Glu Arg Ser Arg Cys Val Tyr 295 300 305tgc agg gac atg ttc aac cac gag gag aac cgc cgg ggc cac tgc cag 1016Cys Arg Asp Met Phe Asn His Glu Glu Asn Arg Arg Gly His Cys Gln 310 315 320gac gcg ccc gac tcc gtg aga act tgc atc cgc cgg gtg agc tgc atg 1064Asp Ala Pro Asp Ser Val Arg Thr Cys Ile Arg Arg Val Ser Cys Met325 330 335 340tgg tgc gcg gac agc atg ctc tat cac tgt atg tcg gac ccc gag gga 1112Trp Cys Ala Asp Ser Met Leu Tyr His Cys Met Ser Asp Pro Glu Gly 345 350 355gac tat aca gac cct tgc tcg tgc gat act agc gac gag aag ttt tgc 1160Asp Tyr Thr Asp Pro Cys Ser Cys Asp Thr Ser Asp Glu Lys Phe Cys 360 365 370ctc cgg tgg atg gct ctt att gcc ttg tct ttc ctg gcc ccc tgt atg 1208Leu Arg Trp Met Ala Leu Ile Ala Leu Ser Phe Leu Ala Pro Cys Met 375 380 385tgc tgt tac ctg ccc ctt cgg gcc tgc tac cac tgc gga gtg atg tgc 1256Cys Cys Tyr Leu Pro Leu Arg Ala Cys Tyr His Cys Gly Val Met Cys 390 395 400agg tgc tgt ggc ggg aag cac aaa gcg gcc gcg tga ctcagtttcc 1302Arg Cys Cys Gly Gly Lys His Lys Ala Ala Ala405 410 415ctcccttctc cctccatccg cagccacagg ggaactcgtc tcttacatac tctcatcttc 1362tcccccgctc ccttccactc caaggagcga ggagggcaag cggcctccca gctccctggt 1422acctcgaggc accattccag ccagggacgc tgccgggtag actctccact ccccctgccg 1482cccacactgc agcagccaca tccatacaca cacgctcgca cagtgttctg aggaaggaac 1542cttcgccaca gactcctgta ctattaacaa tctgtaacca agctaactgt ctcatccatg 1602tgttgatttc ctgtttcctc ctcccccgcc tcttccagtt caaaggagtc tgcaattgga 1662actgctgatt ttcggtgggt tttgtagttg atttttccaa gagcgtcgaa gactctcttt 1722ctcttggttc accttgcctg tcgctagcaa gcatctggtt cagcggaaat gggatgtgag 1782aatgatgaaa cccgacag 180058415PRTHomo sapiens 58Met Ala Ser Pro Gly Ser Asp Ser Tyr Ile Val Arg Val Lys Ala Val1 5 10 15Val Met Thr Arg Asp Asp Ser Ser Gly Gly Trp Phe Pro Arg Glu Gly 20 25 30Gly Gly Ile Ser Arg Val Gly Val Cys Lys Val Met His Pro Glu Gly 35 40 45Asn Gly Arg Ser Gly Phe Leu Ile His Gly Glu Arg Gln Lys Asp Lys 50 55 60Leu Val Val Leu Glu Cys Tyr Val Arg Lys Asp Leu Val Tyr Thr Lys65 70 75 80Ala Asn Pro Thr Phe His His Trp Lys Val Asp Asn Arg Lys Phe Gly 85 90 95Leu Thr Phe Gln Ser Pro Ala Asp Ala Arg Ala Phe Asp Arg Gly Val 100 105 110Arg Lys Ala Ile Glu Asp Leu Ile Glu Gly Ser Thr Thr Ser Ser Ser 115 120 125Thr Ile His Asn Glu Ala Glu Leu Gly Asp Asp Asp Val Phe Thr Thr 130 135 140Ala Thr Asp Ser Ser Ser Asn Ser Ser Gln Lys Arg Glu Gln Pro Thr145 150 155 160Arg Thr Ile Ser Ser Pro Thr Ser Cys Glu His Arg Arg Ile Tyr Thr 165 170 175Leu Gly His Leu His Asp Ser Tyr Pro Thr Asp His Tyr His Leu Asp 180 185 190Gln Pro Met Pro Arg Pro Tyr Arg Gln Val Ser Phe Pro Asp Asp Asp 195 200 205Glu Glu Ile Val Arg Ile Asn Pro Arg Glu Lys Ile Trp Met Thr Gly 210 215 220Tyr Glu Asp Tyr Arg His Ala Pro Val Arg Gly Lys Tyr Pro Asp Pro225 230 235 240Ser Glu Asp Ala Asp Ser Ser Tyr Val Arg Phe Ala Lys Gly Glu Val 245 250 255Pro Lys His Asp Tyr Asn Tyr Pro Tyr Val Asp Ser Ser Asp Phe Gly 260 265 270Leu Gly Glu Asp Pro Lys Gly Arg Gly Gly Ser Val Ile Lys Thr Gln 275 280 285Pro Ser Arg Gly Lys Ser Arg Arg Arg Lys Glu Asp Gly Glu Arg Ser 290 295 300Arg Cys Val Tyr Cys Arg Asp Met Phe Asn His Glu Glu Asn Arg Arg305 310 315 320Gly His Cys Gln Asp Ala Pro Asp Ser Val Arg Thr Cys Ile Arg Arg 325 330 335Val Ser Cys Met Trp Cys Ala Asp Ser Met Leu Tyr His Cys Met Ser 340 345 350Asp Pro Glu Gly Asp Tyr Thr Asp Pro Cys Ser Cys Asp Thr Ser Asp 355 360 365Glu Lys Phe Cys Leu Arg Trp Met Ala Leu Ile Ala Leu Ser Phe Leu 370 375 380Ala Pro Cys Met Cys Cys Tyr Leu Pro Leu Arg Ala Cys Tyr His Cys385 390 395 400Gly Val Met Cys Arg Cys Cys Gly Gly Lys His Lys Ala Ala Ala 405 410 4155961DNAHomo sapiens 59aataatgcta atacccagca ctcgggctcc acaatgtaga ggaaatggca tcgcctggca 60g 61606PRTHomo sapiens 60Met Ala Ser Pro Gly Ser1 56117DNAHomo sapiens 61atggcatcgc ctggcag 176222DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 62ggctccacaa tgtagaggaa at 226322DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 63ggtcataacc acagccttga ca 226477DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO62) and reverse primer (SEQ ID NO63) 64ggctccacaa tgtagaggaa atggcatcgc ctggcagtga cagctatatt gtgcgtgtca 60aggctgtggt tatgacc 776523DNAArtificial Sequenceforward primer used for specifically detecting the known variant of the gene of the present invention 65acaggcgtct aggtaacaag aaa 236620DNAArtificial Sequencereverse primer used for specifically detecting the known variant of the gene of the present invention 66cagccttgac acgcacaata 206778DNAArtificial Sequencepartial polynucleotide specific to the known variant of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO65) and reverse primer (SEQ ID NO66) 67acaggcgtct aggtaacaag aaaatgaccg aagaaacaca cccagacgat gacagctata 60ttgtgcgtgt caaggctg 786823DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 68gaccccgagg gagactatac aga 236919DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 69atccaccgga ggcaaaact 197071DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO68) and reverse primer (SEQ ID NO69) 70gaccccgagg gagactatac agacccttgc tcgtgcgata ctagcgacga gaagttttgc 60ctccggtgga t 7171101DNAHomo sapiens 71aataatgcta atacccagca ctcgggctcc acaatgtaga ggaaatggca tcgcctggca 60gtgacagcta tattgtgcgt gtcaaggctg tggttatgac c 101721420DNAHomo sapiens 72agagctggag catctcgcag ccctacaact tctttgcggc gcttggtctg tctttgcgtt 60gcgagatcag gatcggtgtg ttgctcagac accggccgaa cgccaccggc acgtctagga 120gtctcctgtg cctggaagag gctggtttca ttcgattcta aaagaaacgg atgttgaaac 180atttgtagca gatatactga aaggagaaaa tttatccaag aaagcaaagg aaaagagaga 240atcccttatt aagaagataa aagatgtaaa gtctatctat cttcaggaat ttcaagacaa 300aggtgatgca gaagatgggg aagaatatga tgaccctttt gctgggcctc cagacactat 360ttcattagcc tcagaacgat atgataaaga cgatgaagcc ccctctgatg gagcccagtt 420tcctccaatt gcagcacaag accttccttt tgttctaaag gctggctacc ttgaaaaacg 480cagaaaagat cacagctttc tgggatttga atggcagaaa cggtggtgtg ctctcagtaa 540aacggtattc tattattatg gaagtgataa

agacaaacaa cagaaaggtg aatttgcaat 600agatggctac agtgtcagaa tgaataacac tctaagaaag gatggaaaga aagattgctg 660ttttgaaatc tctgctcctg ataaacgtat atatcagttt acagcagctt ctcccaaaga 720tgctgaagaa tgggtacagc agctgaaatt tgtattgcaa gatatggaat ctgatattat 780tcctgaggat tatgatgaga gaggagaatt atatgatgat gttgatcatc ctctaccaat 840aagcaatcca ctaacaagca gtcaaccaat agatgatgaa atttatgaag aacttccaga 900agaagaagag gacagtgctc cagtgaaagt ggaagaacaa aggaagatga gtcaggatag 960tgtccatcac acctcagggg ataagagcac tgattatgct aatttttacc agggattgtg 1020ggattgtact ggagcttttt ctgatgagtt gtcatttaag cgtggtgatg tgatttacat 1080tcttagcaag gaatacaata gatatggctg gtgggtagga gaaatgaagg gagccattgg 1140cttggtgcct aaagcctaca taatggagat gtatgatatt tgagagtcct ggaaaaggaa 1200aattcttctg cttgtctgca aatgctctgg atttagaagc gtcatgaaag cacgagtgac 1260agctcctaac ctctccttgt tttattaaac attacttatc tttgactgtt attttatgca 1320gtcgctcatt aaaatattcc tctgatgtga aattaaatga aggatattaa tgtaaattag 1380atgcaaccag ttaagttata cctgttgcta ttttgcaaag 1420731420DNAHomo sapiensCDS(620)..(1183) 73agagctggag catctcgcag ccctacaact tctttgcggc gcttggtctg tctttgcgtt 60gcgagatcag gatcggtgtg ttgctcagac accggccgaa cgccaccggc acgtctagga 120gtctcctgtg cctggaagag gctggtttca ttcgattcta aaagaaacgg atgttgaaac 180atttgtagca gatatactga aaggagaaaa tttatccaag aaagcaaagg aaaagagaga 240atcccttatt aagaagataa aagatgtaaa gtctatctat cttcaggaat ttcaagacaa 300aggtgatgca gaagatgggg aagaatatga tgaccctttt gctgggcctc cagacactat 360ttcattagcc tcagaacgat atgataaaga cgatgaagcc ccctctgatg gagcccagtt 420tcctccaatt gcagcacaag accttccttt tgttctaaag gctggctacc ttgaaaaacg 480cagaaaagat cacagctttc tgggatttga atggcagaaa cggtggtgtg ctctcagtaa 540aacggtattc tattattatg gaagtgataa agacaaacaa cagaaaggtg aatttgcaat 600agatggctac agtgtcaga atg aat aac act cta aga aag gat gga aag aaa 652 Met Asn Asn Thr Leu Arg Lys Asp Gly Lys Lys 1 5 10gat tgc tgt ttt gaa atc tct gct cct gat aaa cgt ata tat cag ttt 700Asp Cys Cys Phe Glu Ile Ser Ala Pro Asp Lys Arg Ile Tyr Gln Phe 15 20 25aca gca gct tct ccc aaa gat gct gaa gaa tgg gta cag cag ctg aaa 748Thr Ala Ala Ser Pro Lys Asp Ala Glu Glu Trp Val Gln Gln Leu Lys 30 35 40ttt gta ttg caa gat atg gaa tct gat att att cct gag gat tat gat 796Phe Val Leu Gln Asp Met Glu Ser Asp Ile Ile Pro Glu Asp Tyr Asp 45 50 55gag aga gga gaa tta tat gat gat gtt gat cat cct cta cca ata agc 844Glu Arg Gly Glu Leu Tyr Asp Asp Val Asp His Pro Leu Pro Ile Ser60 65 70 75aat cca cta aca agc agt caa cca ata gat gat gaa att tat gaa gaa 892Asn Pro Leu Thr Ser Ser Gln Pro Ile Asp Asp Glu Ile Tyr Glu Glu 80 85 90ctt cca gaa gaa gaa gag gac agt gct cca gtg aaa gtg gaa gaa caa 940Leu Pro Glu Glu Glu Glu Asp Ser Ala Pro Val Lys Val Glu Glu Gln 95 100 105agg aag atg agt cag gat agt gtc cat cac acc tca ggg gat aag agc 988Arg Lys Met Ser Gln Asp Ser Val His His Thr Ser Gly Asp Lys Ser 110 115 120act gat tat gct aat ttt tac cag gga ttg tgg gat tgt act gga gct 1036Thr Asp Tyr Ala Asn Phe Tyr Gln Gly Leu Trp Asp Cys Thr Gly Ala 125 130 135ttt tct gat gag ttg tca ttt aag cgt ggt gat gtg att tac att ctt 1084Phe Ser Asp Glu Leu Ser Phe Lys Arg Gly Asp Val Ile Tyr Ile Leu140 145 150 155agc aag gaa tac aat aga tat ggc tgg tgg gta gga gaa atg aag gga 1132Ser Lys Glu Tyr Asn Arg Tyr Gly Trp Trp Val Gly Glu Met Lys Gly 160 165 170gcc att ggc ttg gtg cct aaa gcc tac ata atg gag atg tat gat att 1180Ala Ile Gly Leu Val Pro Lys Ala Tyr Ile Met Glu Met Tyr Asp Ile 175 180 185tga gagtcctgga aaaggaaaat tcttctgctt gtctgcaaat gctctggatt 1233tagaagcgtc atgaaagcac gagtgacagc tcctaacctc tccttgtttt attaaacatt 1293 acttatcttt gactgttatt ttatgcagtc gctcattaaa atattcctct gatgtgaaat 1353taaatgaagg atattaatgt aaattagatg caaccagtta agttatacct gttgctattt 1413 tgcaaag 142074187PRTHomo sapiens 74Met Asn Asn Thr Leu Arg Lys Asp Gly Lys Lys Asp Cys Cys Phe Glu1 5 10 15Ile Ser Ala Pro Asp Lys Arg Ile Tyr Gln Phe Thr Ala Ala Ser Pro 20 25 30Lys Asp Ala Glu Glu Trp Val Gln Gln Leu Lys Phe Val Leu Gln Asp 35 40 45Met Glu Ser Asp Ile Ile Pro Glu Asp Tyr Asp Glu Arg Gly Glu Leu 50 55 60Tyr Asp Asp Val Asp His Pro Leu Pro Ile Ser Asn Pro Leu Thr Ser65 70 75 80Ser Gln Pro Ile Asp Asp Glu Ile Tyr Glu Glu Leu Pro Glu Glu Glu 85 90 95Glu Asp Ser Ala Pro Val Lys Val Glu Glu Gln Arg Lys Met Ser Gln 100 105 110Asp Ser Val His His Thr Ser Gly Asp Lys Ser Thr Asp Tyr Ala Asn 115 120 125Phe Tyr Gln Gly Leu Trp Asp Cys Thr Gly Ala Phe Ser Asp Glu Leu 130 135 140Ser Phe Lys Arg Gly Asp Val Ile Tyr Ile Leu Ser Lys Glu Tyr Asn145 150 155 160Arg Tyr Gly Trp Trp Val Gly Glu Met Lys Gly Ala Ile Gly Leu Val 165 170 175Pro Lys Ala Tyr Ile Met Glu Met Tyr Asp Ile 180 18575169DNAHomo sapiens 75agagctggag catctcgcag ccctacaact tctttgcggc gcttggtctg tctttgcgtt 60gcgagatcag gatcggtgtg ttgctcagac accggccgaa cgccaccggc acgtctagga 120gtctcctgtg cctggaagag gctggtttca ttcgattcta aaagaaacg 16976619DNAHomo sapiens 76agagctggag catctcgcag ccctacaact tctttgcggc gcttggtctg tctttgcgtt 60gcgagatcag gatcggtgtg ttgctcagac accggccgaa cgccaccggc acgtctagga 120gtctcctgtg cctggaagag gctggtttca ttcgattcta aaagaaacgg atgttgaaac 180atttgtagca gatatactga aaggagaaaa tttatccaag aaagcaaagg aaaagagaga 240atcccttatt aagaagataa aagatgtaaa gtctatctat cttcaggaat ttcaagacaa 300aggtgatgca gaagatgggg aagaatatga tgaccctttt gctgggcctc cagacactat 360ttcattagcc tcagaacgat atgataaaga cgatgaagcc ccctctgatg gagcccagtt 420tcctccaatt gcagcacaag accttccttt tgttctaaag gctggctacc ttgaaaaacg 480cagaaaagat cacagctttc tgggatttga atggcagaaa cggtggtgtg ctctcagtaa 540aacggtattc tattattatg gaagtgataa agacaaacaa cagaaaggtg aatttgcaat 600agatggctac agtgtcaga 6197720DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 77tctcgcagcc ctacaacttc 207820DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 78gcaacacacc gatcctgatc 207972DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO77) and reverse primer (SEQ ID NO78) 79tctcgcagcc ctacaacttc tttgcggcgc ttggtctgtc tttgcgttgc gagatcagga 60tcggtgtgtt gc 728018DNAArtificial Sequenceforward primer used for specifically detecting the known variant of the gene of the present invention 80cctgcgctcc ctaacatg 188122DNAArtificial Sequencereverse primer used for specifically detecting the known variant of the gene of the present invention 81tgccaacagg ttcctaattt cc 228281DNAArtificial Sequencepartial polynucleotide specific to the known variant of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO80) and reverse primer (SEQ ID NO81) 82cctgcgctcc ctaacatgcc caaccccagc agcacctcct ctccctaccc cctccctgag 60gaaattagga acctgttggc a 818324DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 83catcctctac caataagcaa tcca 248427DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 84ctatcctgac tcatcttcct ttgttct 2785134DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO83) and reverse primer (SEQ ID NO84) 85catcctctac caataagcaa tccactaaca agcagtcaac caatagatga tgaaatttat 60gaagaacttc cagaagaaga agaggacagt gctccagtga aagtggaaga acaaaggaag 120atgagtcagg atag 1348684DNAHomo sapiens 86agagctggag catctcgcag ccctacaact tctttgcggc gcttggtctg tctttgcgtt 60gcgagatcag gatcggtgtg ttgc 84871660DNAHomo sapiens 87aagccagttt gaacttcagg aggaggaact atttggccaa aacacctcct aatatgtagt 60tcagtagaga tgccctggag aatgctgcac tgttccacat gcctgcaaat attcctctaa 120tgatacagtc ccaaggcttt ggacattgat agactccagg tatggcagca gccaaacttc 180tgcatgactc tggactgaat gtggttgttc tggaagcccg ggaccgtgtg ggaggcagga 240cttacactct taggaaccaa aaggttaaat atgtggacct tggaggatcc tatgttggac 300caacccagaa tcgtatcttg agattagcca aggagctagg attggagacc tacaaagtga 360atgaggttga gcgtctgatc caccatgtaa agggcaaatc ataccccttc agggggccat 420tcccacctgt atggaatcca attacctact tagatcataa caacttttgg aggacaatgg 480atgacatggg gcgagagatt ccgagtgatg ccccatggaa ggctcccctt gcagaagagt 540gggacaacat gacaatgaag gagctactgg acaagctctg ctggactgaa tctgcaaagc 600agcttgccac tctctttgtg aacctgtgtg tcactgcaga gacccatgag gtctctgctc 660tctggttcct gtggtatgtg aagcagtgtg gaggcacaac aagaatcatc tcgacaacaa 720atggaggaca ggagaggaaa tttgtgggcg gatctggtca agtgagtgag cggataatgg 780acctccttgg agaccgagtg aagctggaga ggcctgtgat ctacattgac cagacaagag 840aaaatgtcct tgtggagacc ctaaaccatg agatgtatga ggctaaatat gtgattagtg 900ctattcctcc tactctgggc atgaagattc acttcaatcc ccctctgcca atgatgagaa 960accagatgat cactcgtgtg cctttgggtt cagtcatcaa gtgtatagtt tattataaag 1020agcctttctg gaggaaaaag gattactgtg gaaccatgat tattgatgga gaagaagctc 1080cagttgccta cacgttggat gataccaaac ctgaaggcaa ctatgctgcc ataatgggat 1140ttatcctggc ccacaaagcc agaaaactgg cacgtcttac caaagaggaa aggttgaaga 1200aactttgtga actctatgcc aaggttctgg gttccctaga agctctggag ggttctacgc 1260cagccagtgg acaggattta ctttgcaggc accgagactg ccacacactg gagcggctac 1320atggaggggg ctgtagaggc cggggagaga gcagcccgag agatcctgca tgccatgggg 1380aagattccag aggatgaaat ctggcagtca gaaccagagt ctgtggatgt ccctgcacag 1440cccatcacca ccaccttttt ggagagacat ttgccctccg tgccaggcct gctcaggctg 1500attggattga ccaccatctt ttcagcaacg gctcttggct tcctggccca caaaaggggg 1560ctacttgtga gagtctaaag agagagggtg tctgtaatca cactctcttc ttactgtatt 1620tgggatatga gtttggggaa agagttgcag taaagttcca 1660881660DNAHomo sapiensCDS(162)..(1397) 88aagccagttt gaacttcagg aggaggaact atttggccaa aacacctcct aatatgtagt 60tcagtagaga tgccctggag aatgctgcac tgttccacat gcctgcaaat attcctctaa 120tgatacagtc ccaaggcttt ggacattgat agactccagg t atg gca gca gcc aaa 176 Met Ala Ala Ala Lys 1 5ctt ctg cat gac tct gga ctg aat gtg gtt gtt ctg gaa gcc cgg gac 224Leu Leu His Asp Ser Gly Leu Asn Val Val Val Leu Glu Ala Arg Asp 10 15 20cgt gtg gga ggc agg act tac act ctt agg aac caa aag gtt aaa tat 272Arg Val Gly Gly Arg Thr Tyr Thr Leu Arg Asn Gln Lys Val Lys Tyr 25 30 35gtg gac ctt gga gga tcc tat gtt gga cca acc cag aat cgt atc ttg 320Val Asp Leu Gly Gly Ser Tyr Val Gly Pro Thr Gln Asn Arg Ile Leu 40 45 50aga tta gcc aag gag cta gga ttg gag acc tac aaa gtg aat gag gtt 368Arg Leu Ala Lys Glu Leu Gly Leu Glu Thr Tyr Lys Val Asn Glu Val 55 60 65gag cgt ctg atc cac cat gta aag ggc aaa tca tac ccc ttc agg ggg 416Glu Arg Leu Ile His His Val Lys Gly Lys Ser Tyr Pro Phe Arg Gly70 75 80 85cca ttc cca cct gta tgg aat cca att acc tac tta gat cat aac aac 464Pro Phe Pro Pro Val Trp Asn Pro Ile Thr Tyr Leu Asp His Asn Asn 90 95 100ttt tgg agg aca atg gat gac atg ggg cga gag att ccg agt gat gcc 512Phe Trp Arg Thr Met Asp Asp Met Gly Arg Glu Ile Pro Ser Asp Ala 105 110 115cca tgg aag gct ccc ctt gca gaa gag tgg gac aac atg aca atg aag 560Pro Trp Lys Ala Pro Leu Ala Glu Glu Trp Asp Asn Met Thr Met Lys 120 125 130gag cta ctg gac aag ctc tgc tgg act gaa tct gca aag cag ctt gcc 608Glu Leu Leu Asp Lys Leu Cys Trp Thr Glu Ser Ala Lys Gln Leu Ala 135 140 145act ctc ttt gtg aac ctg tgt gtc act gca gag acc cat gag gtc tct 656Thr Leu Phe Val Asn Leu Cys Val Thr Ala Glu Thr His Glu Val Ser150 155 160 165gct ctc tgg ttc ctg tgg tat gtg aag cag tgt gga ggc aca aca aga 704Ala Leu Trp Phe Leu Trp Tyr Val Lys Gln Cys Gly Gly Thr Thr Arg 170 175 180atc atc tcg aca aca aat gga gga cag gag agg aaa ttt gtg ggc gga 752Ile Ile Ser Thr Thr Asn Gly Gly Gln Glu Arg Lys Phe Val Gly Gly 185 190 195tct ggt caa gtg agt gag cgg ata atg gac ctc ctt gga gac cga gtg 800Ser Gly Gln Val Ser Glu Arg Ile Met Asp Leu Leu Gly Asp Arg Val 200 205 210aag ctg gag agg cct gtg atc tac att gac cag aca aga gaa aat gtc 848Lys Leu Glu Arg Pro Val Ile Tyr Ile Asp Gln Thr Arg Glu Asn Val 215 220 225ctt gtg gag acc cta aac cat gag atg tat gag gct aaa tat gtg att 896Leu Val Glu Thr Leu Asn His Glu Met Tyr Glu Ala Lys Tyr Val Ile230 235 240 245agt gct att cct cct act ctg ggc atg aag att cac ttc aat ccc cct 944Ser Ala Ile Pro Pro Thr Leu Gly Met Lys Ile His Phe Asn Pro Pro 250 255 260ctg cca atg atg aga aac cag atg atc act cgt gtg cct ttg ggt tca 992Leu Pro Met Met Arg Asn Gln Met Ile Thr Arg Val Pro Leu Gly Ser 265 270 275gtc atc aag tgt ata gtt tat tat aaa gag cct ttc tgg agg aaa aag 1040Val Ile Lys Cys Ile Val Tyr Tyr Lys Glu Pro Phe Trp Arg Lys Lys 280 285 290gat tac tgt gga acc atg att att gat gga gaa gaa gct cca gtt gcc 1088Asp Tyr Cys Gly Thr Met Ile Ile Asp Gly Glu Glu Ala Pro Val Ala 295 300 305tac acg ttg gat gat acc aaa cct gaa ggc aac tat gct gcc ata atg 1136Tyr Thr Leu Asp Asp Thr Lys Pro Glu Gly Asn Tyr Ala Ala Ile Met310 315 320 325gga ttt atc ctg gcc cac aaa gcc aga aaa ctg gca cgt ctt acc aaa 1184Gly Phe Ile Leu Ala His Lys Ala Arg Lys Leu Ala Arg Leu Thr Lys 330 335 340gag gaa agg ttg aag aaa ctt tgt gaa ctc tat gcc aag gtt ctg ggt 1232Glu Glu Arg Leu Lys Lys Leu Cys Glu Leu Tyr Ala Lys Val Leu Gly 345 350 355tcc cta gaa gct ctg gag ggt tct acg cca gcc agt gga cag gat tta 1280Ser Leu Glu Ala Leu Glu Gly Ser Thr Pro Ala Ser Gly Gln Asp Leu 360 365 370ctt tgc agg cac cga gac tgc cac aca ctg gag cgg cta cat gga ggg 1328Leu Cys Arg His Arg Asp Cys His Thr Leu Glu Arg Leu His Gly Gly 375 380 385ggc tgt aga ggc cgg gga gag agc agc ccg aga gat cct gca tgc cat 1376Gly Cys Arg Gly Arg Gly Glu Ser Ser Pro Arg Asp Pro Ala Cys His390 395 400 405ggg gaa gat tcc aga gga tga aatctggcag tcagaaccag agtctgtgga 1427Gly Glu Asp Ser Arg Gly 410tgtccctgca cagcccatca ccaccacctt tttggagaga catttgccct ccgtgccagg 1487cctgctcagg ctgattggat tgaccaccat cttttcagca acggctcttg gcttcctggc 1547ccacaaaagg gggctacttg tgagagtcta aagagagagg gtgtctgtaa tcacactctc 1607ttcttactgt atttgggata tgagtttggg gaaagagttg cagtaaagtt cca 166089411PRTHomo sapiens 89Met Ala Ala Ala Lys Leu Leu His Asp Ser Gly Leu Asn Val Val Val1 5 10 15Leu Glu Ala Arg Asp Arg Val Gly Gly Arg Thr Tyr Thr Leu Arg Asn 20 25 30Gln Lys Val Lys Tyr Val Asp Leu Gly Gly Ser Tyr Val Gly Pro Thr 35 40 45Gln Asn Arg Ile Leu Arg Leu Ala Lys Glu Leu Gly Leu Glu Thr Tyr 50 55 60Lys Val Asn Glu Val Glu Arg Leu Ile His His Val Lys Gly Lys Ser65 70 75 80Tyr Pro Phe Arg Gly Pro Phe Pro Pro Val Trp Asn Pro Ile Thr Tyr 85 90 95Leu Asp His Asn Asn Phe Trp Arg Thr Met Asp Asp Met Gly Arg Glu 100 105 110Ile Pro Ser Asp Ala Pro Trp Lys Ala Pro Leu Ala Glu Glu Trp Asp 115 120 125Asn Met Thr Met Lys Glu Leu Leu Asp Lys Leu Cys Trp Thr Glu Ser 130 135 140Ala Lys Gln Leu Ala

Thr Leu Phe Val Asn Leu Cys Val Thr Ala Glu145 150 155 160Thr His Glu Val Ser Ala Leu Trp Phe Leu Trp Tyr Val Lys Gln Cys 165 170 175Gly Gly Thr Thr Arg Ile Ile Ser Thr Thr Asn Gly Gly Gln Glu Arg 180 185 190Lys Phe Val Gly Gly Ser Gly Gln Val Ser Glu Arg Ile Met Asp Leu 195 200 205Leu Gly Asp Arg Val Lys Leu Glu Arg Pro Val Ile Tyr Ile Asp Gln 210 215 220Thr Arg Glu Asn Val Leu Val Glu Thr Leu Asn His Glu Met Tyr Glu225 230 235 240Ala Lys Tyr Val Ile Ser Ala Ile Pro Pro Thr Leu Gly Met Lys Ile 245 250 255His Phe Asn Pro Pro Leu Pro Met Met Arg Asn Gln Met Ile Thr Arg 260 265 270Val Pro Leu Gly Ser Val Ile Lys Cys Ile Val Tyr Tyr Lys Glu Pro 275 280 285Phe Trp Arg Lys Lys Asp Tyr Cys Gly Thr Met Ile Ile Asp Gly Glu 290 295 300Glu Ala Pro Val Ala Tyr Thr Leu Asp Asp Thr Lys Pro Glu Gly Asn305 310 315 320Tyr Ala Ala Ile Met Gly Phe Ile Leu Ala His Lys Ala Arg Lys Leu 325 330 335Ala Arg Leu Thr Lys Glu Glu Arg Leu Lys Lys Leu Cys Glu Leu Tyr 340 345 350Ala Lys Val Leu Gly Ser Leu Glu Ala Leu Glu Gly Ser Thr Pro Ala 355 360 365Ser Gly Gln Asp Leu Leu Cys Arg His Arg Asp Cys His Thr Leu Glu 370 375 380Arg Leu His Gly Gly Gly Cys Arg Gly Arg Gly Glu Ser Ser Pro Arg385 390 395 400Asp Pro Ala Cys His Gly Glu Asp Ser Arg Gly 405 41090156DNAHomo sapiens 90aagccagttt gaacttcagg aggaggaact atttggccaa aacacctcct aatatgtagt 60tcagtagaga tgccctggag aatgctgcac tgttccacat gcctgcaaat attcctctaa 120tgatacagtc ccaaggcttt ggacattgat agactc 15691161DNAHomo sapiens 91aagccagttt gaacttcagg aggaggaact atttggccaa aacacctcct aatatgtagt 60tcagtagaga tgccctggag aatgctgcac tgttccacat gcctgcaaat attcctctaa 120tgatacagtc ccaaggcttt ggacattgat agactccagg t 1619242DNAHomo sapiens 92ggttccctag aagctctgga gggttctacg ccagccagtg ga 429348PRTHomo sapiens 93Gly Ser Thr Pro Ala Ser Gly Gln Asp Leu Leu Cys Arg His Arg Asp1 5 10 15Cys His Thr Leu Glu Arg Leu His Gly Gly Gly Cys Arg Gly Arg Gly 20 25 30Glu Ser Ser Pro Arg Asp Pro Ala Cys His Gly Glu Asp Ser Arg Gly 35 40 459414PRTHomo sapiens 94Gly Ser Leu Glu Ala Leu Glu Gly Ser Thr Pro Ala Ser Gly1 5 109598DNAHomo sapiens 95ccagtgcatt atgaagaaaa gaactggtgt gaggagcagt actctggggg ctgctacaca 60acttatttcc cccctgggat cctgactcaa tatggaag 989633PRTHomo sapiens 96Pro Val His Tyr Glu Glu Lys Asn Trp Cys Glu Glu Gln Tyr Ser Gly1 5 10 15Gly Cys Tyr Thr Thr Tyr Phe Pro Pro Gly Ile Leu Thr Gln Tyr Gly 20 25 30Arg971864DNAHomo sapiens 97aaagaaaacg gagcagcggg caccagggag gcctggaacg gggcgagcgc catgagcaac 60aaatgcgacg tggtcgtggt ggggggcggc atctcagaaa catcagactc tgggcctcta 120ttgacaaacc gtatccactt gtatggcagc agccaaactt ctgcatgact ctggactgaa 180tgtggttgtt ctggaagccc gggaccgtgt gggaggcagg acttacactc ttaggaacca 240aaaggttaaa tatgtggacc ttggaggatc ctatgttgga ccaacccaga atcgtatctt 300gagattagcc aaggagctag gattggagac ctacaaagtg aatgaggttg agcgtctgat 360ccaccatgta aagggcaaat catacccctt cagggggcca ttcccacctg tatggaatcc 420aattacctac ttagatcata acaacttttg gaggacaatg gatgacatgg ggcgagagat 480tccgagtgat gccccatgga aggctcccct tgcagaagag tgggacaaca tgacaatgaa 540ggagctactg gacaagctct gctggactga atctgcaaag cagcttgcca ctctctttgt 600gaacctgtgt gtcactgcag agacccatga ggtctctgct ctctggttcc tgtggtatgt 660gaagcagtgt ggaggcacaa caagaatcat ctcgacaaca aatggaggac aggagaggaa 720atttgtgggc ggatctggtc aagtgagtga gcggataatg gacctccttg gagaccgagt 780gaagctggag aggcctgtga tctacattga ccagacaaga gaaaatgtcc ttgtggagac 840cctaaaccat gagatgtatg aggctaaata tgtgattagt gctattcctc ctactctggg 900catgaagatt cacttcaatc cccctctgcc aatgatgaga aaccagatga tcactcgtgt 960gcctttgggt tcagtcatca agtgtatagt ttattataaa gagcctttct ggaggaaaaa 1020ggattactgt ggaaccatga ttattgatgg agaagaagct ccagttgcct acacgttgga 1080tgacaccaaa cctgaaggca actatgctgc cataatggga tttatcctgg cccacaaagc 1140cagaaaactg gcacgtctta ccaaagagga aaggttgaag aaactttgtg aactctatgc 1200caaggttctg ggttccctag aagctctgga gccagtgcat tatgaagaaa agaactggtg 1260tgaggagcag tactctgggg gctgctacac aacttatttc ccccctggga tcctgactca 1320atatggaagg gttctacgcc agccagtgga caggatttac tttgcaggca ccgagactgc 1380cacacactgg agcggctaca tggagggggc tgtagaggcc ggggagagag cagcccgaga 1440gatcctgcat gccatgggga agattccaga ggatgaaatc tggcagtcag aaccagagtc 1500tgtggatgtc cctgcacagc ccatcaccac cacctttttg gagagacatt tgccctccgt 1560gccaggcctg ctcaggctga ttggattgac caccatcttt tcagcaacgg ctcttggctt 1620cctggcccac aaaagggggc tacttgtgag agtctaaaga gagagggtgt ctgtaatcac 1680actctcttct tactgtattt gggatatgag tttggggaaa gagttgcagt aaagttccat 1740gaagacaaat agtgtggagt gaggcgggga gcatgaagat aaatccaact ctgactgtaa 1800aatacatggt atctctttct ccgttgtggc ccctgcttag tgtcccttac ctggcttagc 1860gttc 1864981864DNAHomo sapiensCDS(143)..(1657) 98aaagaaaacg gagcagcggg caccagggag gcctggaacg gggcgagcgc catgagcaac 60aaatgcgacg tggtcgtggt ggggggcggc atctcagaaa catcagactc tgggcctcta 120ttgacaaacc gtatccactt gt atg gca gca gcc aaa ctt ctg cat gac tct 172 Met Ala Ala Ala Lys Leu Leu His Asp Ser 1 5 10gga ctg aat gtg gtt gtt ctg gaa gcc cgg gac cgt gtg gga ggc agg 220Gly Leu Asn Val Val Val Leu Glu Ala Arg Asp Arg Val Gly Gly Arg 15 20 25act tac act ctt agg aac caa aag gtt aaa tat gtg gac ctt gga gga 268Thr Tyr Thr Leu Arg Asn Gln Lys Val Lys Tyr Val Asp Leu Gly Gly 30 35 40tcc tat gtt gga cca acc cag aat cgt atc ttg aga tta gcc aag gag 316Ser Tyr Val Gly Pro Thr Gln Asn Arg Ile Leu Arg Leu Ala Lys Glu 45 50 55cta gga ttg gag acc tac aaa gtg aat gag gtt gag cgt ctg atc cac 364Leu Gly Leu Glu Thr Tyr Lys Val Asn Glu Val Glu Arg Leu Ile His 60 65 70cat gta aag ggc aaa tca tac ccc ttc agg ggg cca ttc cca cct gta 412His Val Lys Gly Lys Ser Tyr Pro Phe Arg Gly Pro Phe Pro Pro Val75 80 85 90tgg aat cca att acc tac tta gat cat aac aac ttt tgg agg aca atg 460Trp Asn Pro Ile Thr Tyr Leu Asp His Asn Asn Phe Trp Arg Thr Met 95 100 105gat gac atg ggg cga gag att ccg agt gat gcc cca tgg aag gct ccc 508Asp Asp Met Gly Arg Glu Ile Pro Ser Asp Ala Pro Trp Lys Ala Pro 110 115 120ctt gca gaa gag tgg gac aac atg aca atg aag gag cta ctg gac aag 556Leu Ala Glu Glu Trp Asp Asn Met Thr Met Lys Glu Leu Leu Asp Lys 125 130 135ctc tgc tgg act gaa tct gca aag cag ctt gcc act ctc ttt gtg aac 604Leu Cys Trp Thr Glu Ser Ala Lys Gln Leu Ala Thr Leu Phe Val Asn 140 145 150ctg tgt gtc act gca gag acc cat gag gtc tct gct ctc tgg ttc ctg 652Leu Cys Val Thr Ala Glu Thr His Glu Val Ser Ala Leu Trp Phe Leu155 160 165 170tgg tat gtg aag cag tgt gga ggc aca aca aga atc atc tcg aca aca 700Trp Tyr Val Lys Gln Cys Gly Gly Thr Thr Arg Ile Ile Ser Thr Thr 175 180 185aat gga gga cag gag agg aaa ttt gtg ggc gga tct ggt caa gtg agt 748Asn Gly Gly Gln Glu Arg Lys Phe Val Gly Gly Ser Gly Gln Val Ser 190 195 200gag cgg ata atg gac ctc ctt gga gac cga gtg aag ctg gag agg cct 796Glu Arg Ile Met Asp Leu Leu Gly Asp Arg Val Lys Leu Glu Arg Pro 205 210 215gtg atc tac att gac cag aca aga gaa aat gtc ctt gtg gag acc cta 844Val Ile Tyr Ile Asp Gln Thr Arg Glu Asn Val Leu Val Glu Thr Leu 220 225 230aac cat gag atg tat gag gct aaa tat gtg att agt gct att cct cct 892Asn His Glu Met Tyr Glu Ala Lys Tyr Val Ile Ser Ala Ile Pro Pro235 240 245 250act ctg ggc atg aag att cac ttc aat ccc cct ctg cca atg atg aga 940Thr Leu Gly Met Lys Ile His Phe Asn Pro Pro Leu Pro Met Met Arg 255 260 265aac cag atg atc act cgt gtg cct ttg ggt tca gtc atc aag tgt ata 988Asn Gln Met Ile Thr Arg Val Pro Leu Gly Ser Val Ile Lys Cys Ile 270 275 280gtt tat tat aaa gag cct ttc tgg agg aaa aag gat tac tgt gga acc 1036Val Tyr Tyr Lys Glu Pro Phe Trp Arg Lys Lys Asp Tyr Cys Gly Thr 285 290 295atg att att gat gga gaa gaa gct cca gtt gcc tac acg ttg gat gac 1084Met Ile Ile Asp Gly Glu Glu Ala Pro Val Ala Tyr Thr Leu Asp Asp 300 305 310acc aaa cct gaa ggc aac tat gct gcc ata atg gga ttt atc ctg gcc 1132Thr Lys Pro Glu Gly Asn Tyr Ala Ala Ile Met Gly Phe Ile Leu Ala315 320 325 330cac aaa gcc aga aaa ctg gca cgt ctt acc aaa gag gaa agg ttg aag 1180His Lys Ala Arg Lys Leu Ala Arg Leu Thr Lys Glu Glu Arg Leu Lys 335 340 345aaa ctt tgt gaa ctc tat gcc aag gtt ctg ggt tcc cta gaa gct ctg 1228Lys Leu Cys Glu Leu Tyr Ala Lys Val Leu Gly Ser Leu Glu Ala Leu 350 355 360gag cca gtg cat tat gaa gaa aag aac tgg tgt gag gag cag tac tct 1276Glu Pro Val His Tyr Glu Glu Lys Asn Trp Cys Glu Glu Gln Tyr Ser 365 370 375ggg ggc tgc tac aca act tat ttc ccc cct ggg atc ctg act caa tat 1324Gly Gly Cys Tyr Thr Thr Tyr Phe Pro Pro Gly Ile Leu Thr Gln Tyr 380 385 390gga agg gtt cta cgc cag cca gtg gac agg att tac ttt gca ggc acc 1372Gly Arg Val Leu Arg Gln Pro Val Asp Arg Ile Tyr Phe Ala Gly Thr395 400 405 410gag act gcc aca cac tgg agc ggc tac atg gag ggg gct gta gag gcc 1420Glu Thr Ala Thr His Trp Ser Gly Tyr Met Glu Gly Ala Val Glu Ala 415 420 425ggg gag aga gca gcc cga gag atc ctg cat gcc atg ggg aag att cca 1468Gly Glu Arg Ala Ala Arg Glu Ile Leu His Ala Met Gly Lys Ile Pro 430 435 440gag gat gaa atc tgg cag tca gaa cca gag tct gtg gat gtc cct gca 1516Glu Asp Glu Ile Trp Gln Ser Glu Pro Glu Ser Val Asp Val Pro Ala 445 450 455cag ccc atc acc acc acc ttt ttg gag aga cat ttg ccc tcc gtg cca 1564Gln Pro Ile Thr Thr Thr Phe Leu Glu Arg His Leu Pro Ser Val Pro 460 465 470ggc ctg ctc agg ctg att gga ttg acc acc atc ttt tca gca acg gct 1612Gly Leu Leu Arg Leu Ile Gly Leu Thr Thr Ile Phe Ser Ala Thr Ala475 480 485 490ctt ggc ttc ctg gcc cac aaa agg ggg cta ctt gtg aga gtc taa 1657Leu Gly Phe Leu Ala His Lys Arg Gly Leu Leu Val Arg Val 495 500agagagaggg tgtctgtaat cacactctct tcttactgta tttgggatat gagtttgggg 1717aaagagttgc agtaaagttc catgaagaca aatagtgtgg agtgaggcgg ggagcatgaa 1777gataaatcca actctgactg taaaatacat ggtatctctt tctccgttgt ggcccctgct 1837tagtgtccct tacctggctt agcgttc 186499504PRTHomo sapiens 99Met Ala Ala Ala Lys Leu Leu His Asp Ser Gly Leu Asn Val Val Val1 5 10 15Leu Glu Ala Arg Asp Arg Val Gly Gly Arg Thr Tyr Thr Leu Arg Asn 20 25 30Gln Lys Val Lys Tyr Val Asp Leu Gly Gly Ser Tyr Val Gly Pro Thr 35 40 45Gln Asn Arg Ile Leu Arg Leu Ala Lys Glu Leu Gly Leu Glu Thr Tyr 50 55 60Lys Val Asn Glu Val Glu Arg Leu Ile His His Val Lys Gly Lys Ser65 70 75 80Tyr Pro Phe Arg Gly Pro Phe Pro Pro Val Trp Asn Pro Ile Thr Tyr 85 90 95Leu Asp His Asn Asn Phe Trp Arg Thr Met Asp Asp Met Gly Arg Glu 100 105 110Ile Pro Ser Asp Ala Pro Trp Lys Ala Pro Leu Ala Glu Glu Trp Asp 115 120 125Asn Met Thr Met Lys Glu Leu Leu Asp Lys Leu Cys Trp Thr Glu Ser 130 135 140Ala Lys Gln Leu Ala Thr Leu Phe Val Asn Leu Cys Val Thr Ala Glu145 150 155 160Thr His Glu Val Ser Ala Leu Trp Phe Leu Trp Tyr Val Lys Gln Cys 165 170 175Gly Gly Thr Thr Arg Ile Ile Ser Thr Thr Asn Gly Gly Gln Glu Arg 180 185 190Lys Phe Val Gly Gly Ser Gly Gln Val Ser Glu Arg Ile Met Asp Leu 195 200 205Leu Gly Asp Arg Val Lys Leu Glu Arg Pro Val Ile Tyr Ile Asp Gln 210 215 220Thr Arg Glu Asn Val Leu Val Glu Thr Leu Asn His Glu Met Tyr Glu225 230 235 240Ala Lys Tyr Val Ile Ser Ala Ile Pro Pro Thr Leu Gly Met Lys Ile 245 250 255His Phe Asn Pro Pro Leu Pro Met Met Arg Asn Gln Met Ile Thr Arg 260 265 270Val Pro Leu Gly Ser Val Ile Lys Cys Ile Val Tyr Tyr Lys Glu Pro 275 280 285Phe Trp Arg Lys Lys Asp Tyr Cys Gly Thr Met Ile Ile Asp Gly Glu 290 295 300Glu Ala Pro Val Ala Tyr Thr Leu Asp Asp Thr Lys Pro Glu Gly Asn305 310 315 320Tyr Ala Ala Ile Met Gly Phe Ile Leu Ala His Lys Ala Arg Lys Leu 325 330 335Ala Arg Leu Thr Lys Glu Glu Arg Leu Lys Lys Leu Cys Glu Leu Tyr 340 345 350Ala Lys Val Leu Gly Ser Leu Glu Ala Leu Glu Pro Val His Tyr Glu 355 360 365Glu Lys Asn Trp Cys Glu Glu Gln Tyr Ser Gly Gly Cys Tyr Thr Thr 370 375 380Tyr Phe Pro Pro Gly Ile Leu Thr Gln Tyr Gly Arg Val Leu Arg Gln385 390 395 400Pro Val Asp Arg Ile Tyr Phe Ala Gly Thr Glu Thr Ala Thr His Trp 405 410 415Ser Gly Tyr Met Glu Gly Ala Val Glu Ala Gly Glu Arg Ala Ala Arg 420 425 430Glu Ile Leu His Ala Met Gly Lys Ile Pro Glu Asp Glu Ile Trp Gln 435 440 445Ser Glu Pro Glu Ser Val Asp Val Pro Ala Gln Pro Ile Thr Thr Thr 450 455 460Phe Leu Glu Arg His Leu Pro Ser Val Pro Gly Leu Leu Arg Leu Ile465 470 475 480Gly Leu Thr Thr Ile Phe Ser Ala Thr Ala Leu Gly Phe Leu Ala His 485 490 495Lys Arg Gly Leu Leu Val Arg Val 50010043DNAHomo sapiens 100aaacatcaga ctctgggcct ctattgacaa accgtatcca ctt 43101142DNAHomo sapiens 101aaagaaaacg gagcagcggg caccagggag gcctggaacg gggcgagcgc catgagcaac 60aaatgcgacg tggtcgtggt ggggggcggc atctcagaaa catcagactc tgggcctcta 120ttgacaaacc gtatccactt gt 14210223DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 102tggccaaaac acctcctaat atg 2310326DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 103atacctggag tctatcaatg tccaaa 26104130DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO102) and reverse primer (SEQ ID NO103) 104tggccaaaac acctcctaat atgtagttca gtagagatgc cctggagaat gctgcactgt 60tccacatgcc tgcaaatatt cctctaatga tacagtccca aggctttgga cattgataga 120ctccaggtat 13010524DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 105ggcatctcag aaacatcaga ctct 2410622DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 106gcttccagaa caaccacatt ca 22107111DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO105) and reverse primer (SEQ ID NO106) 107ggcatctcag aaacatcaga ctctgggcct ctattgacaa accgtatcca cttgtatggc 60agcagccaaa cttctgcatg actctggact gaatgtggtt gttctggaag c 11110818DNAArtificial Sequenceforward primer used for specifically detecting the known variant of the gene of the present invention

108gagcgccatg agcaacaa 1810922DNAArtificial Sequencereverse primer used for specifically detecting the known variant of the gene of the present invention 109tccagagtca tgcagaagtt tg 2211088DNAArtificial Sequencepartial polynucleotide specific to the known variant of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO108) and reverse primer (SEQ ID NO109) 110gagcgccatg agcaacaaat gcgacgtggt cgtggtgggg ggcggcatct caggtatggc 60agcagccaaa cttctgcatg actctgga 8811123DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 111gctctctggt tcctgtggta tgt 2311220DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 112gatccgccca caaatttcct 2011398DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO111) and reverse primer (SEQ ID NO112) 113gctctctggt tcctgtggta tgtgaagcag tgtggaggca caacaagaat catctcgaca 60acaaatggag gacaggagag gaaatttgtg ggcggatc 98114162DNAHomo sapiens 114aagccagttt gaacttcagg aggaggaact atttggccaa aacacctcct aatatgtagt 60tcagtagaga tgccctggag aatgctgcac tgttccacat gcctgcaaat attcctctaa 120tgatacagtc ccaaggcttt ggacattgat agactccagg ta 162115198DNAHomo sapiens 115aaagaaaacg gagcagcggg caccagggag gcctggaacg gggcgagcgc catgagcaac 60aaatgcgacg tggtcgtggt ggggggcggc atctcagaaa catcagactc tgggcctcta 120ttgacaaacc gtatccactt gtatggcagc agccaaactt ctgcatgact ctggactgaa 180tgtggttgtt ctggaagc 1981161422DNAHomo sapiens 116ctgcgcggcg cggcgggcga tccgagccgg gacgggctgc aggcgggggt gctgcagagg 60acacgaggcg gcgggctgga gacatggacc gcggcgagca aggtctgctg agaacagacc 120cagtccctga ggaaggagaa gatgttgctg ccacgatcag tgccacagag accctctcgg 180aagaggagca ggaagagcta agaagagaac ttgcaaaggt agaagaagaa atccagactc 240tgtctcaagt gttagcagca aaagagaagc atctagcaga gatcaagcgg aaacttggaa 300tcaattctct acaggaacta aaacagaaca ttgccaaagg gtggcaagac gtgacagcaa 360catctgctta caagaagaca tctgaaacct tatcccaggc tggacagaag gcctcagctg 420ctttttcgtc tgttggctca gtcatcacca aaaagctgga agatgtaaaa ttgcaagcct 480tttcacattc ctttagtata cgttccattc agcattcaat tagcatgcct gctatgagaa 540actccccaac ttttaaatca tttgaagaaa aggtcgaaaa cttaaagtct aaagtagggg 600gaaccaagcc tgctggtggt gattttggag aagtcttgaa ttcggctgca aatgctagtg 660ccaccaccac ggagcctctt ccagaaaaga cacaggagag cctgtgagat tcctaccttt 720gttctgctac ccactgccag atgctgcaag cgaggtccaa gcacatcttg tcaacatgca 780ttgccatgaa tttctaccag atgtgctttt atttagcttt acatattcct ttgaccaaat 840agtttgtggg ttaaacaaaa tgaaaatatc ttcacctcta ttcttgggaa acacccttta 900gtgtacattt atgttccttt atttaggaaa caccattata aaaacactta tagtaaatgg 960ggacattcac tataatgatc taagaagcta cagattgtca tagttgtttt cctgctttac 1020aaaattgctc cagatctgga atgccagttt gacctttgtc ttctataata tttccttttt 1080ttcccctctt tgaatctctg tatatttgat tcttaactaa aattgttctc ttaaatattc 1140tgaatcctgg taattaaaag tttgggtgta ttttctttac ctccaaggaa agaactacta 1200gctacaaaaa atattttgga ataagcattg ttttggtata aggtacatat tttggttgaa 1260gacaccagac tgaagtaaac agctgtgcat ccaatttatt atagttttgt aagtaacaat 1320atgtaatcaa acttctaggt gacttgagag tggaacctcc tatatcatta tttagcaccg 1380tttgtgacag taaccatttc agtgtattgt ttattatacc ac 14221171422DNAHomo sapiensCDS(84)..(707) 117ctgcgcggcg cggcgggcga tccgagccgg gacgggctgc aggcgggggt gctgcagagg 60acacgaggcg gcgggctgga gac atg gac cgc ggc gag caa ggt ctg ctg aga 113 Met Asp Arg Gly Glu Gln Gly Leu Leu Arg 1 5 10aca gac cca gtc cct gag gaa gga gaa gat gtt gct gcc acg atc agt 161Thr Asp Pro Val Pro Glu Glu Gly Glu Asp Val Ala Ala Thr Ile Ser 15 20 25gcc aca gag acc ctc tcg gaa gag gag cag gaa gag cta aga aga gaa 209Ala Thr Glu Thr Leu Ser Glu Glu Glu Gln Glu Glu Leu Arg Arg Glu 30 35 40ctt gca aag gta gaa gaa gaa atc cag act ctg tct caa gtg tta gca 257Leu Ala Lys Val Glu Glu Glu Ile Gln Thr Leu Ser Gln Val Leu Ala 45 50 55gca aaa gag aag cat cta gca gag atc aag cgg aaa ctt gga atc aat 305Ala Lys Glu Lys His Leu Ala Glu Ile Lys Arg Lys Leu Gly Ile Asn 60 65 70tct cta cag gaa cta aaa cag aac att gcc aaa ggg tgg caa gac gtg 353Ser Leu Gln Glu Leu Lys Gln Asn Ile Ala Lys Gly Trp Gln Asp Val75 80 85 90aca gca aca tct gct tac aag aag aca tct gaa acc tta tcc cag gct 401Thr Ala Thr Ser Ala Tyr Lys Lys Thr Ser Glu Thr Leu Ser Gln Ala 95 100 105gga cag aag gcc tca gct gct ttt tcg tct gtt ggc tca gtc atc acc 449Gly Gln Lys Ala Ser Ala Ala Phe Ser Ser Val Gly Ser Val Ile Thr 110 115 120aaa aag ctg gaa gat gta aaa ttg caa gcc ttt tca cat tcc ttt agt 497Lys Lys Leu Glu Asp Val Lys Leu Gln Ala Phe Ser His Ser Phe Ser 125 130 135ata cgt tcc att cag cat tca att agc atg cct gct atg aga aac tcc 545Ile Arg Ser Ile Gln His Ser Ile Ser Met Pro Ala Met Arg Asn Ser 140 145 150cca act ttt aaa tca ttt gaa gaa aag gtc gaa aac tta aag tct aaa 593Pro Thr Phe Lys Ser Phe Glu Glu Lys Val Glu Asn Leu Lys Ser Lys155 160 165 170gta ggg gga acc aag cct gct ggt ggt gat ttt gga gaa gtc ttg aat 641Val Gly Gly Thr Lys Pro Ala Gly Gly Asp Phe Gly Glu Val Leu Asn 175 180 185tcg gct gca aat gct agt gcc acc acc acg gag cct ctt cca gaa aag 689Ser Ala Ala Asn Ala Ser Ala Thr Thr Thr Glu Pro Leu Pro Glu Lys 190 195 200aca cag gag agc ctg tga gattcctacc tttgttctgc tacccactgc 737Thr Gln Glu Ser Leu 205cagatgctgc aagcgaggtc caagcacatc ttgtcaacat gcattgccat gaatttctac 797cagatgtgct tttatttagc tttacatatt cctttgacca aatagtttgt gggttaaaca 857aaatgaaaat atcttcacct ctattcttgg gaaacaccct ttagtgtaca tttatgttcc 917tttatttagg aaacaccatt ataaaaacac ttatagtaaa tggggacatt cactataatg 977atctaagaag ctacagattg tcatagttgt tttcctgctt tacaaaattg ctccagatct 1037ggaatgccag tttgaccttt gtcttctata atatttcctt tttttcccct ctttgaatct 1097ctgtatattt gattcttaac taaaattgtt ctcttaaata ttctgaatcc tggtaattaa 1157aagtttgggt gtattttctt tacctccaag gaaagaacta ctagctacaa aaaatatttt 1217ggaataagca ttgttttggt ataaggtaca tattttggtt gaagacacca gactgaagta 1277aacagctgtg catccaattt attatagttt tgtaagtaac aatatgtaat caaacttcta 1337ggtgacttga gagtggaacc tcctatatca ttatttagca ccgtttgtga cagtaaccat 1397ttcagtgtat tgtttattat accac 1422118207PRTHomo sapiens 118Met Asp Arg Gly Glu Gln Gly Leu Leu Arg Thr Asp Pro Val Pro Glu1 5 10 15Glu Gly Glu Asp Val Ala Ala Thr Ile Ser Ala Thr Glu Thr Leu Ser 20 25 30Glu Glu Glu Gln Glu Glu Leu Arg Arg Glu Leu Ala Lys Val Glu Glu 35 40 45Glu Ile Gln Thr Leu Ser Gln Val Leu Ala Ala Lys Glu Lys His Leu 50 55 60Ala Glu Ile Lys Arg Lys Leu Gly Ile Asn Ser Leu Gln Glu Leu Lys65 70 75 80Gln Asn Ile Ala Lys Gly Trp Gln Asp Val Thr Ala Thr Ser Ala Tyr 85 90 95Lys Lys Thr Ser Glu Thr Leu Ser Gln Ala Gly Gln Lys Ala Ser Ala 100 105 110Ala Phe Ser Ser Val Gly Ser Val Ile Thr Lys Lys Leu Glu Asp Val 115 120 125Lys Leu Gln Ala Phe Ser His Ser Phe Ser Ile Arg Ser Ile Gln His 130 135 140Ser Ile Ser Met Pro Ala Met Arg Asn Ser Pro Thr Phe Lys Ser Phe145 150 155 160Glu Glu Lys Val Glu Asn Leu Lys Ser Lys Val Gly Gly Thr Lys Pro 165 170 175Ala Gly Gly Asp Phe Gly Glu Val Leu Asn Ser Ala Ala Asn Ala Ser 180 185 190Ala Thr Thr Thr Glu Pro Leu Pro Glu Lys Thr Gln Glu Ser Leu 195 200 20511969DNAHomo sapiens 119ttgcaagcct tttcacattc ctttagtata cgttccattc agcattcaat tagcatgcct 60gctatgaga 6912023PRTHomo sapiens 120Leu Gln Ala Phe Ser His Ser Phe Ser Ile Arg Ser Ile Gln His Ser1 5 10 15Ile Ser Met Pro Ala Met Arg 2012121DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 121tcagctgctt tttcgtctgt t 2112220DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 122ggaatgtgaa aaggcttgca 2012378DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO121) and reverse primer (SEQ ID NO122) 123tcagctgctt tttcgtctgt tggctcagtc atcaccaaaa agctggaaga tgtaaaattg 60caagcctttt cacattcc 7812420DNAArtificial Sequenceforward primer used for specifically detecting the known variant of the gene of the present invention 124cctcagctgc tttttcgtct 2012525DNAArtificial Sequencereverse primer used for specifically detecting the known variant of the gene of the present invention 125aaaagttggg gagtttttta catct 2512674DNAArtificial Sequencepartial polynucleotide specific to the known variant of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO124) and reverse primer (SEQ ID NO125) 126cctcagctgc tttttcgtct gttggctcag tcatcaccaa aaagctggaa gatgtaaaaa 60actccccaac tttt 7412725DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 127gacagcaaca tctgcttaca agaag 2512822DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 128atgactgagc caacagacga aa 2212993DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO127) and reverse primer (SEQ ID NO128) 129gacagcaaca tctgcttaca agaagacatc tgaaacctta tcccaggctg gacagaaggc 60ctcagctgct ttttcgtctg ttggctcagt cat 93130491DNAHomo sapiens 130ctgcgcggcg cggcgggcga tccgagccgg gacgggctgc aggcgggggt gctgcagagg 60acacgaggcg gcgggctgga gacatggacc gcggcgagca aggtctgctg agaacagacc 120cagtccctga ggaaggagaa gatgttgctg ccacgatcag tgccacagag accctctcgg 180aagaggagca ggaagagcta agaagagaac ttgcaaaggt agaagaagaa atccagactc 240tgtctcaagt gttagcagca aaagagaagc atctagcaga gatcaagcgg aaacttggaa 300tcaattctct acaggaacta aaacagaaca ttgccaaagg gtggcaagac gtgacagcaa 360catctgctta caagaagaca tctgaaacct tatcccaggc tggacagaag gcctcagctg 420ctttttcgtc tgttggctca gtcatcacca aaaagctgga agatgtaaaa ttgcaagcct 480tttcacattc c 4911311926DNAHomo sapiens 131aaaagtcggc tcgagtactc cccgtaacga ggaggtgttc tcggccgtcc cacccttcac 60tgccgtctcc gggctgcgcc gccggagccg ggacgcgcct ccgcagccct cgccgcctcc 120atccccgcgg ccgcagctcc tctcgccgtc cgcgcgcaca ccatgacgaa gaacgagaag 180aagtccctca accagagcct ggccgagtgg aagctcttca tctacaaccc gaccaccgga 240gaattcctgg ggcgcaccgc caagagctgg ggatcgcagt atgtggtagg gaagtgatgc 300tgtctgaagg tgacatcctg ttctcctctc ttctgtcctc tccatcctta ttttggccac 360ctggtttgat cttgctcttc tacctagttt tttatgggtt cctggctgca ctcttctcat 420tcacgatgtg ggttatgctt cagactctca acgatgaggt tccaaaatac cgtgaccaga 480ttcctagccc aggactcatg gtttttccaa aaccagtgac cgcattggaa tatacattca 540gtaggtctga tccaacttcg tatgcagggt acattgaaga ccttaagaag tttctaaaac 600catatacttt agaagaacag aagaacctca cagtctgtcc tgatggagca ctttttgaac 660agaagggtcc agtttatgtt gcatgtcagt ttcctatttc attacttcaa gcatgcagtg 720gtatgaatga tcctgatttt ggctattctc aaggaaaccc ttgtattctt gtgaaaatga 780acagaataat tggattaaag cctgaaggag tgccaaggat agattgtgtt tcaaagacaa 840ataatgtaaa agatggaatg aagatatacc aaatgtagca gtttatcctc ataatggaat 900gatagactta aaatatttcc catattatgg gaaaaaactg catgttgggt atctacagcc 960attggttgct gttcaggtca gctttgctcc taacaacact gggaaagaag taacagttga 1020gtgcaagatt gatggatcag ccaacctaaa aagtcaggat gatcgtgaca agtttttggg 1080acgagttatg ttcaaaatca cagcacgtgc atagtatgag taggatatct ccacagagta 1140aatgttgtgt tgtctgtctt cattttgtaa cagctggacc ttccattcta gaattatgag 1200accaccttgg agaaaggtgt gtggtacatg acattgggtt acatcataac gtgcttccag 1260atcatagtgt tcagtgtcct ctgaagtaac tgcctgttgc ctctgctgcc ctttgaacca 1320gtgtacagtc gccagatagg gaccggtgaa cacctgattc caaacatgta ggatgggggt 1380cttgtcctct ttttatgtgg tttaattgcc aagtgtctaa agcttaatat gccgtgctat 1440gtaaatattt tatggatata acaactgtca tattttgatg tcaacagagt tttagggata 1500aaatggtacc cggccaacat caagtgactt tatagctgca agaaatgtgg tatgtggaga 1560agttctgtat gtgaggaagg aaaaaaagaa aataaaagtg tgtttgaaaa atattatctt 1620gggttctttg taaaatttat tttttacatg ctgaattagc ctcgatcttt ttgattaaga 1680gcacaaactt ttttttgtaa aacatgtaaa aaaaaaactg ggattaattt ttagtgttgg 1740aactgcctct tattttaggc tgtagataaa atagcatttt taggttagcc agtgtgacta 1800tgcacctaat tttttatgag attaaattca taagacttaa tttgtacaat agtttgtgaa 1860atatcttgtt actgctttta tttagcagac tgtggactgt aataaagtat ataaattgtg 1920aaatat 19261321926DNAHomo sapiensCDS(297)..(878) 132aaaagtcggc tcgagtactc cccgtaacga ggaggtgttc tcggccgtcc cacccttcac 60tgccgtctcc gggctgcgcc gccggagccg ggacgcgcct ccgcagccct cgccgcctcc 120atccccgcgg ccgcagctcc tctcgccgtc cgcgcgcaca ccatgacgaa gaacgagaag 180aagtccctca accagagcct ggccgagtgg aagctcttca tctacaaccc gaccaccgga 240gaattcctgg ggcgcaccgc caagagctgg ggatcgcagt atgtggtagg gaagtg atg 299 Met 1ctg tct gaa ggt gac atc ctg ttc tcc tct ctt ctg tcc tct cca tcc 347Leu Ser Glu Gly Asp Ile Leu Phe Ser Ser Leu Leu Ser Ser Pro Ser 5 10 15tta ttt tgg cca cct ggt ttg atc ttg ctc ttc tac cta gtt ttt tat 395Leu Phe Trp Pro Pro Gly Leu Ile Leu Leu Phe Tyr Leu Val Phe Tyr 20 25 30ggg ttc ctg gct gca ctc ttc tca ttc acg atg tgg gtt atg ctt cag 443Gly Phe Leu Ala Ala Leu Phe Ser Phe Thr Met Trp Val Met Leu Gln 35 40 45act ctc aac gat gag gtt cca aaa tac cgt gac cag att cct agc cca 491Thr Leu Asn Asp Glu Val Pro Lys Tyr Arg Asp Gln Ile Pro Ser Pro50 55 60 65gga ctc atg gtt ttt cca aaa cca gtg acc gca ttg gaa tat aca ttc 539Gly Leu Met Val Phe Pro Lys Pro Val Thr Ala Leu Glu Tyr Thr Phe 70 75 80agt agg tct gat cca act tcg tat gca ggg tac att gaa gac ctt aag 587Ser Arg Ser Asp Pro Thr Ser Tyr Ala Gly Tyr Ile Glu Asp Leu Lys 85 90 95aag ttt cta aaa cca tat act tta gaa gaa cag aag aac ctc aca gtc 635Lys Phe Leu Lys Pro Tyr Thr Leu Glu Glu Gln Lys Asn Leu Thr Val 100 105 110tgt cct gat gga gca ctt ttt gaa cag aag ggt cca gtt tat gtt gca 683Cys Pro Asp Gly Ala Leu Phe Glu Gln Lys Gly Pro Val Tyr Val Ala 115 120 125tgt cag ttt cct att tca tta ctt caa gca tgc agt ggt atg aat gat 731Cys Gln Phe Pro Ile Ser Leu Leu Gln Ala Cys Ser Gly Met Asn Asp130 135 140 145cct gat ttt ggc tat tct caa gga aac cct tgt att ctt gtg aaa atg 779Pro Asp Phe Gly Tyr Ser Gln Gly Asn Pro Cys Ile Leu Val Lys Met 150 155 160aac aga ata att gga tta aag cct gaa gga gtg cca agg ata gat tgt 827Asn Arg Ile Ile Gly Leu Lys Pro Glu Gly Val Pro Arg Ile Asp Cys 165 170 175gtt tca aag aca aat aat gta aaa gat gga atg aag ata tac caa atg 875Val Ser Lys Thr Asn Asn Val Lys Asp Gly Met Lys Ile Tyr Gln Met 180 185 190tag cagtttatcc tcataatgga atgatagact taaaatattt cccatattat 928gggaaaaaac tgcatgttgg gtatctacag ccattggttg ctgttcaggt cagctttgct 988cctaacaaca ctgggaaaga agtaacagtt gagtgcaaga ttgatggatc agccaaccta 1048aaaagtcagg atgatcgtga caagtttttg ggacgagtta tgttcaaaat cacagcacgt 1108gcatagtatg agtaggatat ctccacagag taaatgttgt gttgtctgtc ttcattttgt 1168aacagctgga ccttccattc tagaattatg agaccacctt ggagaaaggt gtgtggtaca 1228tgacattggg ttacatcata acgtgcttcc agatcatagt gttcagtgtc ctctgaagta 1288actgcctgtt gcctctgctg ccctttgaac cagtgtacag tcgccagata gggaccggtg 1348aacacctgat tccaaacatg taggatgggg gtcttgtcct

ctttttatgt ggtttaattg 1408ccaagtgtct aaagcttaat atgccgtgct atgtaaatat tttatggata taacaactgt 1468catattttga tgtcaacaga gttttaggga taaaatggta cccggccaac atcaagtgac 1528tttatagctg caagaaatgt ggtatgtgga gaagttctgt atgtgaggaa ggaaaaaaag 1588aaaataaaag tgtgtttgaa aaatattatc ttgggttctt tgtaaaattt attttttaca 1648tgctgaatta gcctcgatct ttttgattaa gagcacaaac ttttttttgt aaaacatgta 1708aaaaaaaaac tgggattaat ttttagtgtt ggaactgcct cttattttag gctgtagata 1768aaatagcatt tttaggttag ccagtgtgac tatgcaccta attttttatg agattaaatt 1828cataagactt aatttgtaca atagtttgtg aaatatcttg ttactgcttt tatttagcag 1888actgtggact gtaataaagt atataaattg tgaaatat 1926133193PRTHomo sapiens 133Met Leu Ser Glu Gly Asp Ile Leu Phe Ser Ser Leu Leu Ser Ser Pro1 5 10 15Ser Leu Phe Trp Pro Pro Gly Leu Ile Leu Leu Phe Tyr Leu Val Phe 20 25 30Tyr Gly Phe Leu Ala Ala Leu Phe Ser Phe Thr Met Trp Val Met Leu 35 40 45Gln Thr Leu Asn Asp Glu Val Pro Lys Tyr Arg Asp Gln Ile Pro Ser 50 55 60Pro Gly Leu Met Val Phe Pro Lys Pro Val Thr Ala Leu Glu Tyr Thr65 70 75 80Phe Ser Arg Ser Asp Pro Thr Ser Tyr Ala Gly Tyr Ile Glu Asp Leu 85 90 95Lys Lys Phe Leu Lys Pro Tyr Thr Leu Glu Glu Gln Lys Asn Leu Thr 100 105 110Val Cys Pro Asp Gly Ala Leu Phe Glu Gln Lys Gly Pro Val Tyr Val 115 120 125Ala Cys Gln Phe Pro Ile Ser Leu Leu Gln Ala Cys Ser Gly Met Asn 130 135 140Asp Pro Asp Phe Gly Tyr Ser Gln Gly Asn Pro Cys Ile Leu Val Lys145 150 155 160Met Asn Arg Ile Ile Gly Leu Lys Pro Glu Gly Val Pro Arg Ile Asp 165 170 175Cys Val Ser Lys Thr Asn Asn Val Lys Asp Gly Met Lys Ile Tyr Gln 180 185 190Met13492DNAHomo sapiens 134gatcgcagta tgtggtaggg aagtgatgct gtctgaaggt gacatcctgt tctcctctct 60tctgtcctct ccatccttat tttggccacc tg 9213523PRTHomo sapiens 135Met Leu Ser Glu Gly Asp Ile Leu Phe Ser Ser Leu Leu Ser Ser Pro1 5 10 15Ser Leu Phe Trp Pro Pro Gly 2013667DNAHomo sapiens 136atgctgtctg aaggtgacat cctgttctcc tctcttctgt cctctccatc cttattttgg 60ccacctg 6713720DNAHomo sapiens 137acaaataatg taaaagatgg 2013813PRTHomo sapiens 138Thr Asn Asn Val Lys Asp Gly Met Lys Ile Tyr Gln Met1 5 101397PRTHomo sapiens 139Thr Asn Asn Val Lys Asp Gly1 514022DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 140cgcagtatgt ggtagggaag tg 2214128DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 141aaactaggta gaagagcaag atcaaacc 28142116DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO141) and reverse primer (SEQ ID NO142) 142cgcagtatgt ggtagggaag tgatgctgtc tgaaggtgac atcctgttct cctctcttct 60gtcctctcca tccttatttt ggccacctgg tttgatcttg ctcttctacc tagttt 11614322DNAArtificial Sequenceforward primer used for specifically detecting the known variant of the gene of the present invention 143ccgagtggaa gctcttcatc ta 2214421DNAArtificial Sequencereverse primer used for specifically detecting the known variant of the gene of the present invention 144aagatcaaac cccagctctt g 2114579DNAArtificial Sequencepartial polynucleotide specific to the known variant of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO143) and reverse primer (SEQ ID NO144) 145ccgagtggaa gctcttcatc tacaacccga ccaccggaga attcctgggg cgcaccgcca 60agagctgggg tttgatctt 7914623DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 146tcgtatgcag ggtacattga aga 2314723DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 147tgcaacataa actggaccct tct 23148126DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO146) and reverse primer (SEQ ID NO147) 148tcgtatgcag ggtacattga agaccttaag aagtttctaa aaccatatac tttagaagaa 60cagaagaacc tcacagtctg tcctgatgga gcactttttg aacagaaggg tccagtttat 120gttgca 126149390DNAHomo sapiens 149aaaagtcggc tcgagtactc cccgtaacga ggaggtgttc tcggccgtcc cacccttcac 60tgccgtctcc gggctgcgcc gccggagccg ggacgcgcct ccgcagccct cgccgcctcc 120atccccgcgg ccgcagctcc tctcgccgtc cgcgcgcaca ccatgacgaa gaacgagaag 180aagtccctca accagagcct ggccgagtgg aagctcttca tctacaaccc gaccaccgga 240gaattcctgg ggcgcaccgc caagagctgg ggatcgcagt atgtggtagg gaagtgatgc 300tgtctgaagg tgacatcctg ttctcctctc ttctgtcctc tccatcctta ttttggccac 360ctggtttgat cttgctcttc tacctagttt 3901501120DNAHomo sapiens 150tattcggcgc ggaggggcgg cggccgggga ggcggcggcg gcggcaggat tcccaggagc 60catgttgtca gaagtcctac tggtgtctgc tccggggaaa gtcatccttc atggagaaca 120tgccgtggta catggcaagg tagcactggc tgtatccttg aacttgagaa cattcctccg 180gcttcaaccc cacagcaatg ggaaagtgga cctcagctta cccaacattg gtatcaagcg 240ggcctgggat gtggccaggc ttcagtcact ggacacaagc tttctgggtg gaccaaggag 300gatttggagc taattaacaa gtgggccttc caaggggaga gaatgattca cgggaacccc 360tccggagtgg acaatgctgt cagcacctgg ggaggagccc tccgatacca tcaagggaag 420atttcatcct taaagaggtc gccagctctc cagatcctgc tgaccaacac caaagtccct 480cgcaatacca gggcccttgt ggctggcgtc agaaacaggc tgctcaagtt cccagagatc 540gtggcccccc tcctgacctc aatagatgcc atctccctgg agtgtgagcg cgtgctggga 600gagatggggg aagccccagc cccggagcag tacctcgtgc tggaagagct cattgacatg 660aaccagcacc atctgaatgc cctcggcgtg ggccacgcct ctctggacca gctctgccag 720gtgaccaggg cccgcggact tcacagcaag ctgactggcg caggcggtgg tggctgtggc 780atcacactcc tcaagccagg gctggagcag ccagaagtgg aggccacgaa gcaggccctg 840accagctgtg gctttgactg cttggaaacc agcatcggtg cccccggcgt ctccatccac 900tcagccacct ccctggacag ccgagtccag caagccctgg atggcctctg agaggagccc 960acgacactgc agccccaccc agatgcccct ttctggatta ttctgggggc tgcagttcga 1020ctctgtgctg gccagcgagc gcccagctcc tgacactgct ggagaggccc cagccgcttg 1080gcgatgccag ccaagctctg cagtcccagc ggtgggacct 11201511120DNAHomo sapiensCDS(343)..(951) 151tattcggcgc ggaggggcgg cggccgggga ggcggcggcg gcggcaggat tcccaggagc 60catgttgtca gaagtcctac tggtgtctgc tccggggaaa gtcatccttc atggagaaca 120tgccgtggta catggcaagg tagcactggc tgtatccttg aacttgagaa cattcctccg 180gcttcaaccc cacagcaatg ggaaagtgga cctcagctta cccaacattg gtatcaagcg 240ggcctgggat gtggccaggc ttcagtcact ggacacaagc tttctgggtg gaccaaggag 300gatttggagc taattaacaa gtgggccttc caaggggaga ga atg att cac ggg 354 Met Ile His Gly 1aac ccc tcc gga gtg gac aat gct gtc agc acc tgg gga gga gcc ctc 402Asn Pro Ser Gly Val Asp Asn Ala Val Ser Thr Trp Gly Gly Ala Leu5 10 15 20cga tac cat caa ggg aag att tca tcc tta aag agg tcg cca gct ctc 450Arg Tyr His Gln Gly Lys Ile Ser Ser Leu Lys Arg Ser Pro Ala Leu 25 30 35cag atc ctg ctg acc aac acc aaa gtc cct cgc aat acc agg gcc ctt 498Gln Ile Leu Leu Thr Asn Thr Lys Val Pro Arg Asn Thr Arg Ala Leu 40 45 50gtg gct ggc gtc aga aac agg ctg ctc aag ttc cca gag atc gtg gcc 546Val Ala Gly Val Arg Asn Arg Leu Leu Lys Phe Pro Glu Ile Val Ala 55 60 65ccc ctc ctg acc tca ata gat gcc atc tcc ctg gag tgt gag cgc gtg 594Pro Leu Leu Thr Ser Ile Asp Ala Ile Ser Leu Glu Cys Glu Arg Val 70 75 80ctg gga gag atg ggg gaa gcc cca gcc ccg gag cag tac ctc gtg ctg 642Leu Gly Glu Met Gly Glu Ala Pro Ala Pro Glu Gln Tyr Leu Val Leu85 90 95 100gaa gag ctc att gac atg aac cag cac cat ctg aat gcc ctc ggc gtg 690Glu Glu Leu Ile Asp Met Asn Gln His His Leu Asn Ala Leu Gly Val 105 110 115ggc cac gcc tct ctg gac cag ctc tgc cag gtg acc agg gcc cgc gga 738Gly His Ala Ser Leu Asp Gln Leu Cys Gln Val Thr Arg Ala Arg Gly 120 125 130ctt cac agc aag ctg act ggc gca ggc ggt ggt ggc tgt ggc atc aca 786Leu His Ser Lys Leu Thr Gly Ala Gly Gly Gly Gly Cys Gly Ile Thr 135 140 145ctc ctc aag cca ggg ctg gag cag cca gaa gtg gag gcc acg aag cag 834Leu Leu Lys Pro Gly Leu Glu Gln Pro Glu Val Glu Ala Thr Lys Gln 150 155 160gcc ctg acc agc tgt ggc ttt gac tgc ttg gaa acc agc atc ggt gcc 882Ala Leu Thr Ser Cys Gly Phe Asp Cys Leu Glu Thr Ser Ile Gly Ala165 170 175 180ccc ggc gtc tcc atc cac tca gcc acc tcc ctg gac agc cga gtc cag 930Pro Gly Val Ser Ile His Ser Ala Thr Ser Leu Asp Ser Arg Val Gln 185 190 195caa gcc ctg gat ggc ctc tga gaggagccca cgacactgca gccccaccca 981Gln Ala Leu Asp Gly Leu 200gatgcccctt tctggattat tctgggggct gcagttcgac tctgtgctgg ccagcgagcg 1041cccagctcct gacactgctg gagaggcccc agccgcttgg cgatgccagc caagctctgc 1101agtcccagcg gtgggacct 1120152202PRTHomo sapiens 152Met Ile His Gly Asn Pro Ser Gly Val Asp Asn Ala Val Ser Thr Trp1 5 10 15Gly Gly Ala Leu Arg Tyr His Gln Gly Lys Ile Ser Ser Leu Lys Arg 20 25 30Ser Pro Ala Leu Gln Ile Leu Leu Thr Asn Thr Lys Val Pro Arg Asn 35 40 45Thr Arg Ala Leu Val Ala Gly Val Arg Asn Arg Leu Leu Lys Phe Pro 50 55 60Glu Ile Val Ala Pro Leu Leu Thr Ser Ile Asp Ala Ile Ser Leu Glu65 70 75 80Cys Glu Arg Val Leu Gly Glu Met Gly Glu Ala Pro Ala Pro Glu Gln 85 90 95Tyr Leu Val Leu Glu Glu Leu Ile Asp Met Asn Gln His His Leu Asn 100 105 110Ala Leu Gly Val Gly His Ala Ser Leu Asp Gln Leu Cys Gln Val Thr 115 120 125Arg Ala Arg Gly Leu His Ser Lys Leu Thr Gly Ala Gly Gly Gly Gly 130 135 140Cys Gly Ile Thr Leu Leu Lys Pro Gly Leu Glu Gln Pro Glu Val Glu145 150 155 160Ala Thr Lys Gln Ala Leu Thr Ser Cys Gly Phe Asp Cys Leu Glu Thr 165 170 175Ser Ile Gly Ala Pro Gly Val Ser Ile His Ser Ala Thr Ser Leu Asp 180 185 190Ser Arg Val Gln Gln Ala Leu Asp Gly Leu 195 20015342DNAHomo sapiens 153cactggacac aagctttctg ggtggaccaa ggaggatttg ga 42154342DNAHomo sapiens 154tattcggcgc ggaggggcgg cggccgggga ggcggcggcg gcggcaggat tcccaggagc 60catgttgtca gaagtcctac tggtgtctgc tccggggaaa gtcatccttc atggagaaca 120tgccgtggta catggcaagg tagcactggc tgtatccttg aacttgagaa cattcctccg 180gcttcaaccc cacagcaatg ggaaagtgga cctcagctta cccaacattg gtatcaagcg 240ggcctgggat gtggccaggc ttcagtcact ggacacaagc tttctgggtg gaccaaggag 300gatttggagc taattaacaa gtgggccttc caaggggaga ga 342155301DNAHomo sapiens 155agcaaggtga tgtcacaaca cccacctcag agcaagtgga gaagctaaag gaggttgcag 60gcttgcctga cgactgtgct gtcaccgagc gcctggctgt gctggccttt ctttacttat 120acctgtccat ctgccggaag cagagggccc tgccgagcct ggatatcgta gtgtggtcgg 180agctgccccc cggggcgggc ttgggctcca gcgccgccta ctcggtgtgt ctggcagcag 240ccctcctgac tgtgtgcgag gagatcccaa acccgctgaa ggacggggat tgcgtcaaca 300g 301156101PRTHomo sapiens 156Glu Gln Gly Asp Val Thr Thr Pro Thr Ser Glu Gln Val Glu Lys Leu1 5 10 15Lys Glu Val Ala Gly Leu Pro Asp Asp Cys Ala Val Thr Glu Arg Leu 20 25 30Ala Val Leu Ala Phe Leu Tyr Leu Tyr Leu Ser Ile Cys Arg Lys Gln 35 40 45Arg Ala Leu Pro Ser Leu Asp Ile Val Val Trp Ser Glu Leu Pro Pro 50 55 60Gly Ala Gly Leu Gly Ser Ser Ala Ala Tyr Ser Val Cys Leu Ala Ala65 70 75 80Ala Leu Leu Thr Val Cys Glu Glu Ile Pro Asn Pro Leu Lys Asp Gly 85 90 95Asp Cys Val Asn Arg 1001571130DNAHomo sapiens 157ggaggggcgg cggccgggga ggcggcggcg gcggcaggat tcccaggagc catgttgtca 60gaagtcctac tggtgtctgc tccggggaaa gtcatccttc atggagaaca tgccgtggta 120catggcaagg tagcactggc tgtatccttg aacttgagaa cattcctccg gcttcaaccc 180cacagcaatg ggaaagtgga cctcagctta cccaacattg gtatcaagcg ggcctgggat 240gtggccaggc ttcagtcact ggacacaagc tttctggagc aaggtgatgt cacaacaccc 300acctcagagc aagtggagaa gctaaaggag gttgcaggct tgcctgacga ctgtgctgtc 360accgagcgcc tggctgtgct ggcctttctt tacttatacc tgtccatctg ccggaagcag 420aggtggacca aggaggattt ggagctaatt aacaagtggg ccttccaagg ggagagaatg 480attcacggga acccctccgg agtggacaat gctgtcagca cctggggagg agccctccga 540taccatcaag ggaagatttc atccttaaag aggtcgccag ctctcctgat cctgctgacc 600aacaccaaag tccctcgcaa taccagggcc cttgtggctg gcgtcagaaa caggctgctc 660aagttcccag agatcgtggc ccccctcctg acctcaatag atgccatctc cctggagtgt 720gagcgcgtgc tgggagagat gggggaagcc ccagccccgg agcagtacct cgtgctggaa 780gagctcatcg acatgaacca gcaccatctg aatgccctcg gcgtgggcca cgcctctctg 840gaccagctct gccaggtgac cagggcccgc ggacttcaca gcaagctgac tggcgcaggc 900ggtggtggct gtggcatcac actcctcaag ccagggctgg agcagccaga agtggaggcc 960acgaagcagg ccctgaccag ctgtggcttt gactgcttgg aaaccagcat cggtgccccc 1020ggcgtctcca tccactcagc cacctccctg gacagccgag tccagcaagc cctggatggc 1080ctctgagagg agcccacgac actgcagccc cacccagatg cccctttctg 11301581130DNAHomo sapiensCDS(52)..(1086) 158ggaggggcgg cggccgggga ggcggcggcg gcggcaggat tcccaggagc c atg ttg 57 Met Leu 1tca gaa gtc cta ctg gtg tct gct ccg ggg aaa gtc atc ctt cat gga 105Ser Glu Val Leu Leu Val Ser Ala Pro Gly Lys Val Ile Leu His Gly 5 10 15gaa cat gcc gtg gta cat ggc aag gta gca ctg gct gta tcc ttg aac 153Glu His Ala Val Val His Gly Lys Val Ala Leu Ala Val Ser Leu Asn 20 25 30ttg aga aca ttc ctc cgg ctt caa ccc cac agc aat ggg aaa gtg gac 201Leu Arg Thr Phe Leu Arg Leu Gln Pro His Ser Asn Gly Lys Val Asp35 40 45 50ctc agc tta ccc aac att ggt atc aag cgg gcc tgg gat gtg gcc agg 249Leu Ser Leu Pro Asn Ile Gly Ile Lys Arg Ala Trp Asp Val Ala Arg 55 60 65ctt cag tca ctg gac aca agc ttt ctg gag caa ggt gat gtc aca aca 297Leu Gln Ser Leu Asp Thr Ser Phe Leu Glu Gln Gly Asp Val Thr Thr 70 75 80ccc acc tca gag caa gtg gag aag cta aag gag gtt gca ggc ttg cct 345Pro Thr Ser Glu Gln Val Glu Lys Leu Lys Glu Val Ala Gly Leu Pro 85 90 95gac gac tgt gct gtc acc gag cgc ctg gct gtg ctg gcc ttt ctt tac 393Asp Asp Cys Ala Val Thr Glu Arg Leu Ala Val Leu Ala Phe Leu Tyr 100 105 110tta tac ctg tcc atc tgc cgg aag cag agg tgg acc aag gag gat ttg 441Leu Tyr Leu Ser Ile Cys Arg Lys Gln Arg Trp Thr Lys Glu Asp Leu115 120 125 130gag cta att aac aag tgg gcc ttc caa ggg gag aga atg att cac ggg 489Glu Leu Ile Asn Lys Trp Ala Phe Gln Gly Glu Arg Met Ile His Gly 135 140 145aac ccc tcc gga gtg gac aat gct gtc agc acc tgg gga gga gcc ctc 537Asn Pro Ser Gly Val Asp Asn Ala Val Ser Thr Trp Gly Gly Ala Leu 150 155 160cga tac cat caa ggg aag att tca tcc tta aag agg tcg cca gct ctc 585Arg Tyr His Gln Gly Lys Ile Ser Ser Leu Lys Arg Ser Pro Ala Leu 165 170 175ctg atc ctg ctg acc aac acc aaa gtc cct cgc aat acc agg gcc ctt 633Leu Ile Leu Leu Thr Asn Thr Lys Val Pro Arg Asn Thr Arg Ala Leu 180 185 190gtg gct ggc gtc aga aac agg ctg ctc aag ttc cca gag atc gtg gcc 681Val Ala Gly Val Arg Asn Arg Leu Leu Lys Phe Pro Glu Ile Val Ala195 200 205 210ccc ctc ctg acc tca ata gat gcc atc tcc ctg gag tgt gag cgc gtg 729Pro Leu Leu Thr Ser Ile Asp Ala Ile Ser Leu Glu Cys Glu Arg Val 215 220 225ctg gga gag atg ggg gaa gcc cca gcc ccg gag cag tac ctc gtg ctg 777Leu Gly Glu Met Gly

Glu Ala Pro Ala Pro Glu Gln Tyr Leu Val Leu 230 235 240gaa gag ctc atc gac atg aac cag cac cat ctg aat gcc ctc ggc gtg 825Glu Glu Leu Ile Asp Met Asn Gln His His Leu Asn Ala Leu Gly Val 245 250 255ggc cac gcc tct ctg gac cag ctc tgc cag gtg acc agg gcc cgc gga 873Gly His Ala Ser Leu Asp Gln Leu Cys Gln Val Thr Arg Ala Arg Gly 260 265 270ctt cac agc aag ctg act ggc gca ggc ggt ggt ggc tgt ggc atc aca 921Leu His Ser Lys Leu Thr Gly Ala Gly Gly Gly Gly Cys Gly Ile Thr275 280 285 290ctc ctc aag cca ggg ctg gag cag cca gaa gtg gag gcc acg aag cag 969Leu Leu Lys Pro Gly Leu Glu Gln Pro Glu Val Glu Ala Thr Lys Gln 295 300 305gcc ctg acc agc tgt ggc ttt gac tgc ttg gaa acc agc atc ggt gcc 1017Ala Leu Thr Ser Cys Gly Phe Asp Cys Leu Glu Thr Ser Ile Gly Ala 310 315 320ccc ggc gtc tcc atc cac tca gcc acc tcc ctg gac agc cga gtc cag 1065Pro Gly Val Ser Ile His Ser Ala Thr Ser Leu Asp Ser Arg Val Gln 325 330 335caa gcc ctg gat ggc ctc tga gaggagccca cgacactgca gccccaccca 1116Gln Ala Leu Asp Gly Leu 340gatgcccctt tctg 1130159344PRTHomo sapiens 159Met Leu Ser Glu Val Leu Leu Val Ser Ala Pro Gly Lys Val Ile Leu1 5 10 15His Gly Glu His Ala Val Val His Gly Lys Val Ala Leu Ala Val Ser 20 25 30Leu Asn Leu Arg Thr Phe Leu Arg Leu Gln Pro His Ser Asn Gly Lys 35 40 45Val Asp Leu Ser Leu Pro Asn Ile Gly Ile Lys Arg Ala Trp Asp Val 50 55 60Ala Arg Leu Gln Ser Leu Asp Thr Ser Phe Leu Glu Gln Gly Asp Val65 70 75 80Thr Thr Pro Thr Ser Glu Gln Val Glu Lys Leu Lys Glu Val Ala Gly 85 90 95Leu Pro Asp Asp Cys Ala Val Thr Glu Arg Leu Ala Val Leu Ala Phe 100 105 110Leu Tyr Leu Tyr Leu Ser Ile Cys Arg Lys Gln Arg Trp Thr Lys Glu 115 120 125Asp Leu Glu Leu Ile Asn Lys Trp Ala Phe Gln Gly Glu Arg Met Ile 130 135 140His Gly Asn Pro Ser Gly Val Asp Asn Ala Val Ser Thr Trp Gly Gly145 150 155 160Ala Leu Arg Tyr His Gln Gly Lys Ile Ser Ser Leu Lys Arg Ser Pro 165 170 175Ala Leu Leu Ile Leu Leu Thr Asn Thr Lys Val Pro Arg Asn Thr Arg 180 185 190Ala Leu Val Ala Gly Val Arg Asn Arg Leu Leu Lys Phe Pro Glu Ile 195 200 205Val Ala Pro Leu Leu Thr Ser Ile Asp Ala Ile Ser Leu Glu Cys Glu 210 215 220Arg Val Leu Gly Glu Met Gly Glu Ala Pro Ala Pro Glu Gln Tyr Leu225 230 235 240Val Leu Glu Glu Leu Ile Asp Met Asn Gln His His Leu Asn Ala Leu 245 250 255Gly Val Gly His Ala Ser Leu Asp Gln Leu Cys Gln Val Thr Arg Ala 260 265 270Arg Gly Leu His Ser Lys Leu Thr Gly Ala Gly Gly Gly Gly Cys Gly 275 280 285Ile Thr Leu Leu Lys Pro Gly Leu Glu Gln Pro Glu Val Glu Ala Thr 290 295 300Lys Gln Ala Leu Thr Ser Cys Gly Phe Asp Cys Leu Glu Thr Ser Ile305 310 315 320Gly Ala Pro Gly Val Ser Ile His Ser Ala Thr Ser Leu Asp Ser Arg 325 330 335Val Gln Gln Ala Leu Asp Gly Leu 34016042DNAHomo sapiens 160gtccatctgc cggaagcaga ggtggaccaa ggaggatttg ga 4216115PRTHomo sapiens 161Leu Ser Ile Cys Arg Lys Gln Arg Trp Thr Lys Glu Asp Leu Glu1 5 10 15162156DNAHomo sapiens 162ggccctgccg agcctggata tcgtagtgtg gtcggagctg ccccccgggg cgggcttggg 60ctccagcgcc gcctactcgg tgtgtctggc agcagccctc ctgactgtgt gcgaggagat 120cccaaacccg ctgaaggacg gggattgcgt caacag 15616353PRTHomo sapiens 163Arg Ala Leu Pro Ser Leu Asp Ile Val Val Trp Ser Glu Leu Pro Pro1 5 10 15Gly Ala Gly Leu Gly Ser Ser Ala Ala Tyr Ser Val Cys Leu Ala Ala 20 25 30Ala Leu Leu Thr Val Cys Glu Glu Ile Pro Asn Pro Leu Lys Asp Gly 35 40 45Asp Cys Val Asn Arg 5016419DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 164tggccaggct tcagtcact 1916523DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 165gcccacttgt taattagctc caa 2316675DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO164) and reverse primer (SEQ ID NO165) 166tggccaggct tcagtcactg gacacaagct ttctgggtgg accaaggagg atttggagct 60aattaacaag tgggc 7516729DNAArtificial Sequenceprobe used for detecting the variants of the present invention 167acacaagctt tctgggtgga ccaaggagg 2916819DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 168aggcttgcct gacgactgt 1916923DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 169ggaaggccca cttgttaatt agc 23170131DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO168) and reverse primer (SEQ ID NO169) 170aggcttgcct gacgactgtg ctgtcaccga gcgcctggct gtgctggcct ttctttactt 60atacctgtcc atctgccgga agcagaggtg gaccaaggag gatttggagc taattaacaa 120gtgggccttc c 13117127DNAArtificial Sequenceprobe used for detecting the variants of the present invention 171agcagaggtg gaccaaggag gatttgg 2717219DNAArtificial Sequenceforward primer used for specifically detecting the known variant of the gene of the present invention 172gatcccaaac ccgctgaag 1917322DNAArtificial Sequencereverse primer used for specifically detecting the known variant of the gene of the present invention 173gaatcattct ctccccttgg aa 22174101DNAArtificial Sequencepartial polynucleotide specific to the known variant of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO172) and reverse primer (SEQ ID NO173) 174gatcccaaac ccgctgaagg acggggattg cgtcaacagg tggaccaagg aggatttgga 60gctaattaac aagtgggcct tccaagggga gagaatgatt c 10117527DNAArtificial Sequenceprobe used for specifically detecting the known variant of the gene of the present invention 175cgtcaacagg tggaccaagg aggattt 2717619DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 176cgccagctct ccagatcct 1917721DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 177gggaacttga gcagcctgtt t 2117894DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO176) and reverse primer (SEQ ID NO177) 178cgccagctct ccagatcctg ctgaccaaca ccaaagtccc tcgcaatacc agggcccttg 60tggctggcgt cagaaacagg ctgctcaagt tccc 9417925DNAArtificial Sequenceprobe used for detecting the variants of the gene of the present invention 179ctgaccaaca ccaaagtccc tcgca 25180326DNAHomo sapiens 180tattcggcgc ggaggggcgg cggccgggga ggcggcggcg gcggcaggat tcccaggagc 60catgttgtca gaagtcctac tggtgtctgc tccggggaaa gtcatccttc atggagaaca 120tgccgtggta catggcaagg tagcactggc tgtatccttg aacttgagaa cattcctccg 180gcttcaaccc cacagcaatg ggaaagtgga cctcagctta cccaacattg gtatcaagcg 240ggcctgggat gtggccaggc ttcagtcact ggacacaagc tttctgggtg gaccaaggag 300gatttggagc taattaacaa gtgggc 326181466DNAHomo sapiens 181ggaggggcgg cggccgggga ggcggcggcg gcggcaggat tcccaggagc catgttgtca 60gaagtcctac tggtgtctgc tccggggaaa gtcatccttc atggagaaca tgccgtggta 120catggcaagg tagcactggc tgtatccttg aacttgagaa cattcctccg gcttcaaccc 180cacagcaatg ggaaagtgga cctcagctta cccaacattg gtatcaagcg ggcctgggat 240gtggccaggc ttcagtcact ggacacaagc tttctggagc aaggtgatgt cacaacaccc 300acctcagagc aagtggagaa gctaaaggag gttgcaggct tgcctgacga ctgtgctgtc 360accgagcgcc tggctgtgct ggcctttctt tacttatacc tgtccatctg ccggaagcag 420aggtggacca aggaggattt ggagctaatt aacaagtggg ccttcc 4661821872DNAHomo sapiens 182acaacctgac ttttgtccac aatgcaaaga ctccaactgt tgagagtaga agtactgctg 60ggtgtgaaac aaggagatga aatgcggcat ttcttttttt cttctcagac ctccactctg 120gagaagagtc agaatggggg cgtcggggag gaggtcaccc cagctctgat ctttgccatc 180acagttgcta caatcggctc tttccagttt ggctacaaca ctggggtcat caatgctcct 240gagacgatca taaaggaatt tatcaataaa actttgacgg acaaggcaaa tgcccctccc 300tctgaggtgc tgctcacgaa tctctggtcc ttgtctgtgg ccatattttc cgtcgggggt 360atgatcggct ccttttccgt cggactcttt gttaaccgct ttggcaggcg caattcaatg 420ctgattgtca acctgttggc tgccactggt ggctgcctta tgggactgtg taaaatagct 480gagtcagttg aaatgctgat cctgggccgc ttggttattg gcctcttctg cggactctgc 540acaggttttg tgcccatgta cattggagag atctcgccta ctgccctgag gggtgccttt 600ggcactctca accagctggg catagttatt ggaattctgg tggcccagat ctttggtctg 660gaactcatcc ttgggtctaa agagctatgg ccggtgctat taggctttac catccttcca 720gctatcctgc aaagtgcagc ccttccatgt tgccctgaaa gtcccagatt tttgctcatt 780aacagaaaaa aagaggagaa tgctacgcgg atcctccagc ggttgtgggg cacccaggat 840gtatcccaag acatccagga gatgaaagat gagagtgcaa ggatgtcaca agaaaagcaa 900gtcaccgtgc tggagctctt tagagtgtcc agctaccgac agcccatcat catttccatt 960gtgctccagc tctctcagca gctctctggg atcaatgctg tgttctatta ctcaacagga 1020atcttcaagg atgcaggtgt tcaacagccc atctatgcca ccatcagcgc gggtgtggtt 1080aatactatct tcactttact ttctctattt ctggtggaaa gggcaggaag aaggactctg 1140catatgatag gccttggagg gatggctttt tgttccacgc tcatgactgt ttctttgtta 1200ttaaagaatc actataatgg gatgagcttt gtctgtattg gggctatctt ggtctttgtg 1260gcctgttttg aaattggacc aggccccatt ccctggttta ttgtggccga actcttcagc 1320cagggccccc gcccagctgc gatggcagtg gccggctgct ccaactggac ctccaacttc 1380ctagtcggat tgctcttccc ctctgctgct tactatttag gagcctacgt ttttattatc 1440ttcaccggct tcctcattac cttcctggcc tttaccttct tcaaagtccc tgagacccgt 1500ggcaggactt ttgaggatat cacacgggcc tttgaagggc aggcacacgg tgcagataga 1560tctggaaagg acggcgtcat ggggatgaac agcatcgagc ctgctaagga gaccaccacc 1620aatgtctaag tcgtgcctcc ttccacctcc ctcccggcat gggaaggcca cctctccctc 1680aacaagggag agacctcatc aggatgaacc caggacgctt ctgaatgctg ctacttgatt 1740tctttctcat cccacgcact ccatgagcac cccaaggctg cagtttgttg gatcttcaat 1800ggctttttaa attttatttc ctggacatcc tcttctgctt aggagagacc gagtgaacct 1860accttcattt ca 18721831872DNAHomo sapiensCDS(22)..(1629) 183acaacctgac ttttgtccac a atg caa aga ctc caa ctg ttg aga gta gaa 51 Met Gln Arg Leu Gln Leu Leu Arg Val Glu 1 5 10gta ctg ctg ggt gtg aaa caa gga gat gaa atg cgg cat ttc ttt ttt 99Val Leu Leu Gly Val Lys Gln Gly Asp Glu Met Arg His Phe Phe Phe 15 20 25tct tct cag acc tcc act ctg gag aag agt cag aat ggg ggc gtc ggg 147Ser Ser Gln Thr Ser Thr Leu Glu Lys Ser Gln Asn Gly Gly Val Gly 30 35 40gag gag gtc acc cca gct ctg atc ttt gcc atc aca gtt gct aca atc 195Glu Glu Val Thr Pro Ala Leu Ile Phe Ala Ile Thr Val Ala Thr Ile 45 50 55ggc tct ttc cag ttt ggc tac aac act ggg gtc atc aat gct cct gag 243Gly Ser Phe Gln Phe Gly Tyr Asn Thr Gly Val Ile Asn Ala Pro Glu 60 65 70acg atc ata aag gaa ttt atc aat aaa act ttg acg gac aag gca aat 291Thr Ile Ile Lys Glu Phe Ile Asn Lys Thr Leu Thr Asp Lys Ala Asn75 80 85 90gcc cct ccc tct gag gtg ctg ctc acg aat ctc tgg tcc ttg tct gtg 339Ala Pro Pro Ser Glu Val Leu Leu Thr Asn Leu Trp Ser Leu Ser Val 95 100 105gcc ata ttt tcc gtc ggg ggt atg atc ggc tcc ttt tcc gtc gga ctc 387Ala Ile Phe Ser Val Gly Gly Met Ile Gly Ser Phe Ser Val Gly Leu 110 115 120ttt gtt aac cgc ttt ggc agg cgc aat tca atg ctg att gtc aac ctg 435Phe Val Asn Arg Phe Gly Arg Arg Asn Ser Met Leu Ile Val Asn Leu 125 130 135ttg gct gcc act ggt ggc tgc ctt atg gga ctg tgt aaa ata gct gag 483Leu Ala Ala Thr Gly Gly Cys Leu Met Gly Leu Cys Lys Ile Ala Glu 140 145 150tca gtt gaa atg ctg atc ctg ggc cgc ttg gtt att ggc ctc ttc tgc 531Ser Val Glu Met Leu Ile Leu Gly Arg Leu Val Ile Gly Leu Phe Cys155 160 165 170gga ctc tgc aca ggt ttt gtg ccc atg tac att gga gag atc tcg cct 579Gly Leu Cys Thr Gly Phe Val Pro Met Tyr Ile Gly Glu Ile Ser Pro 175 180 185act gcc ctg agg ggt gcc ttt ggc act ctc aac cag ctg ggc ata gtt 627Thr Ala Leu Arg Gly Ala Phe Gly Thr Leu Asn Gln Leu Gly Ile Val 190 195 200att gga att ctg gtg gcc cag atc ttt ggt ctg gaa ctc atc ctt ggg 675Ile Gly Ile Leu Val Ala Gln Ile Phe Gly Leu Glu Leu Ile Leu Gly 205 210 215tct aaa gag cta tgg ccg gtg cta tta ggc ttt acc atc ctt cca gct 723Ser Lys Glu Leu Trp Pro Val Leu Leu Gly Phe Thr Ile Leu Pro Ala 220 225 230atc ctg caa agt gca gcc ctt cca tgt tgc cct gaa agt ccc aga ttt 771Ile Leu Gln Ser Ala Ala Leu Pro Cys Cys Pro Glu Ser Pro Arg Phe235 240 245 250ttg ctc att aac aga aaa aaa gag gag aat gct acg cgg atc ctc cag 819Leu Leu Ile Asn Arg Lys Lys Glu Glu Asn Ala Thr Arg Ile Leu Gln 255 260 265cgg ttg tgg ggc acc cag gat gta tcc caa gac atc cag gag atg aaa 867Arg Leu Trp Gly Thr Gln Asp Val Ser Gln Asp Ile Gln Glu Met Lys 270 275 280gat gag agt gca agg atg tca caa gaa aag caa gtc acc gtg ctg gag 915Asp Glu Ser Ala Arg Met Ser Gln Glu Lys Gln Val Thr Val Leu Glu 285 290 295ctc ttt aga gtg tcc agc tac cga cag ccc atc atc att tcc att gtg 963Leu Phe Arg Val Ser Ser Tyr Arg Gln Pro Ile Ile Ile Ser Ile Val 300 305 310ctc cag ctc tct cag cag ctc tct ggg atc aat gct gtg ttc tat tac 1011Leu Gln Leu Ser Gln Gln Leu Ser Gly Ile Asn Ala Val Phe Tyr Tyr315 320 325 330tca aca gga atc ttc aag gat gca ggt gtt caa cag ccc atc tat gcc 1059Ser Thr Gly Ile Phe Lys Asp Ala Gly Val Gln Gln Pro Ile Tyr Ala 335 340 345acc atc agc gcg ggt gtg gtt aat act atc ttc act tta ctt tct cta 1107Thr Ile Ser Ala Gly Val Val Asn Thr Ile Phe Thr Leu Leu Ser Leu 350 355 360ttt ctg gtg gaa agg gca gga aga agg act ctg cat atg ata ggc ctt 1155Phe Leu Val Glu Arg Ala Gly Arg Arg Thr Leu His Met Ile Gly Leu 365 370 375gga ggg atg gct ttt tgt tcc acg ctc atg act gtt tct ttg tta tta 1203Gly Gly Met Ala Phe Cys Ser Thr Leu Met Thr Val Ser Leu Leu Leu 380 385 390aag aat cac tat aat ggg atg agc ttt gtc tgt att ggg gct atc ttg 1251Lys Asn His Tyr Asn Gly Met Ser Phe Val Cys Ile Gly Ala Ile Leu395 400 405 410gtc ttt gtg gcc tgt ttt gaa att gga cca ggc ccc att ccc tgg ttt 1299Val Phe Val Ala Cys Phe Glu Ile Gly Pro Gly Pro Ile Pro Trp Phe 415 420 425att gtg gcc gaa ctc ttc agc cag ggc ccc cgc cca gct gcg atg gca 1347Ile Val Ala Glu Leu Phe Ser Gln Gly Pro Arg Pro Ala Ala Met Ala 430 435 440gtg gcc ggc tgc tcc aac tgg acc tcc aac ttc cta gtc gga ttg ctc 1395Val Ala Gly Cys Ser Asn Trp Thr Ser Asn Phe Leu Val Gly Leu Leu 445 450 455ttc ccc tct gct gct tac tat tta gga gcc tac gtt ttt att atc ttc 1443Phe Pro Ser Ala Ala Tyr Tyr Leu Gly Ala Tyr Val Phe Ile Ile Phe 460 465 470acc ggc ttc ctc att acc ttc ctg gcc ttt acc ttc ttc aaa gtc cct 1491Thr Gly Phe Leu Ile Thr Phe Leu Ala Phe Thr Phe Phe Lys Val Pro475 480 485 490gag acc cgt ggc agg act ttt gag gat atc aca cgg gcc ttt gaa ggg 1539Glu Thr Arg Gly Arg Thr Phe Glu Asp Ile

Thr Arg Ala Phe Glu Gly 495 500 505cag gca cac ggt gca gat aga tct gga aag gac ggc gtc atg ggg atg 1587Gln Ala His Gly Ala Asp Arg Ser Gly Lys Asp Gly Val Met Gly Met 510 515 520aac agc atc gag cct gct aag gag acc acc acc aat gtc taa 1629Asn Ser Ile Glu Pro Ala Lys Glu Thr Thr Thr Asn Val 525 530 535gtcgtgcctc cttccacctc cctcccggca tgggaaggcc acctctccct caacaaggga 1689gagacctcat caggatgaac ccaggacgct tctgaatgct gctacttgat ttctttctca 1749tcccacgcac tccatgagca ccccaaggct gcagtttgtt ggatcttcaa tggcttttta 1809aattttattt cctggacatc ctcttctgct taggagagac cgagtgaacc taccttcatt 1869tca 1872184535PRTHomo sapiens 184Met Gln Arg Leu Gln Leu Leu Arg Val Glu Val Leu Leu Gly Val Lys1 5 10 15Gln Gly Asp Glu Met Arg His Phe Phe Phe Ser Ser Gln Thr Ser Thr 20 25 30Leu Glu Lys Ser Gln Asn Gly Gly Val Gly Glu Glu Val Thr Pro Ala 35 40 45Leu Ile Phe Ala Ile Thr Val Ala Thr Ile Gly Ser Phe Gln Phe Gly 50 55 60Tyr Asn Thr Gly Val Ile Asn Ala Pro Glu Thr Ile Ile Lys Glu Phe65 70 75 80Ile Asn Lys Thr Leu Thr Asp Lys Ala Asn Ala Pro Pro Ser Glu Val 85 90 95Leu Leu Thr Asn Leu Trp Ser Leu Ser Val Ala Ile Phe Ser Val Gly 100 105 110Gly Met Ile Gly Ser Phe Ser Val Gly Leu Phe Val Asn Arg Phe Gly 115 120 125Arg Arg Asn Ser Met Leu Ile Val Asn Leu Leu Ala Ala Thr Gly Gly 130 135 140Cys Leu Met Gly Leu Cys Lys Ile Ala Glu Ser Val Glu Met Leu Ile145 150 155 160Leu Gly Arg Leu Val Ile Gly Leu Phe Cys Gly Leu Cys Thr Gly Phe 165 170 175Val Pro Met Tyr Ile Gly Glu Ile Ser Pro Thr Ala Leu Arg Gly Ala 180 185 190Phe Gly Thr Leu Asn Gln Leu Gly Ile Val Ile Gly Ile Leu Val Ala 195 200 205Gln Ile Phe Gly Leu Glu Leu Ile Leu Gly Ser Lys Glu Leu Trp Pro 210 215 220Val Leu Leu Gly Phe Thr Ile Leu Pro Ala Ile Leu Gln Ser Ala Ala225 230 235 240Leu Pro Cys Cys Pro Glu Ser Pro Arg Phe Leu Leu Ile Asn Arg Lys 245 250 255Lys Glu Glu Asn Ala Thr Arg Ile Leu Gln Arg Leu Trp Gly Thr Gln 260 265 270Asp Val Ser Gln Asp Ile Gln Glu Met Lys Asp Glu Ser Ala Arg Met 275 280 285Ser Gln Glu Lys Gln Val Thr Val Leu Glu Leu Phe Arg Val Ser Ser 290 295 300Tyr Arg Gln Pro Ile Ile Ile Ser Ile Val Leu Gln Leu Ser Gln Gln305 310 315 320Leu Ser Gly Ile Asn Ala Val Phe Tyr Tyr Ser Thr Gly Ile Phe Lys 325 330 335Asp Ala Gly Val Gln Gln Pro Ile Tyr Ala Thr Ile Ser Ala Gly Val 340 345 350Val Asn Thr Ile Phe Thr Leu Leu Ser Leu Phe Leu Val Glu Arg Ala 355 360 365Gly Arg Arg Thr Leu His Met Ile Gly Leu Gly Gly Met Ala Phe Cys 370 375 380Ser Thr Leu Met Thr Val Ser Leu Leu Leu Lys Asn His Tyr Asn Gly385 390 395 400Met Ser Phe Val Cys Ile Gly Ala Ile Leu Val Phe Val Ala Cys Phe 405 410 415Glu Ile Gly Pro Gly Pro Ile Pro Trp Phe Ile Val Ala Glu Leu Phe 420 425 430Ser Gln Gly Pro Arg Pro Ala Ala Met Ala Val Ala Gly Cys Ser Asn 435 440 445Trp Thr Ser Asn Phe Leu Val Gly Leu Leu Phe Pro Ser Ala Ala Tyr 450 455 460Tyr Leu Gly Ala Tyr Val Phe Ile Ile Phe Thr Gly Phe Leu Ile Thr465 470 475 480Phe Leu Ala Phe Thr Phe Phe Lys Val Pro Glu Thr Arg Gly Arg Thr 485 490 495Phe Glu Asp Ile Thr Arg Ala Phe Glu Gly Gln Ala His Gly Ala Asp 500 505 510Arg Ser Gly Lys Asp Gly Val Met Gly Met Asn Ser Ile Glu Pro Ala 515 520 525Lys Glu Thr Thr Thr Asn Val 530 535185153DNAHomo sapiens 185acaacctgac ttttgtccac aatgcaaaga ctccaactgt tgagagtaga agtactgctg 60ggtgtgaaac aaggagatga aatgcggcat ttcttttttt cttctcagac ctccactctg 120gagaagagtc agaatggggg cgtcggggag gag 15318644PRTHomo sapiens 186Met Gln Arg Leu Gln Leu Leu Arg Val Glu Val Leu Leu Gly Val Lys1 5 10 15Gln Gly Asp Glu Met Arg His Phe Phe Phe Ser Ser Gln Thr Ser Thr 20 25 30Leu Glu Lys Ser Gln Asn Gly Gly Val Gly Glu Glu 35 40187132DNAHomo sapiens 187atgcaaagac tccaactgtt gagagtagaa gtactgctgg gtgtgaaaca aggagatgaa 60atgcggcatt tctttttttc ttctcagacc tccactctgg agaagagtca gaatgggggc 120gtcggggagg ag 1321881872DNAHomo sapiens 188acaacctgac ttttgttcac aatgcaaaga ctccaactgt tgagagtaga agtactgctg 60ggtgtgaaac aaggagatga aatgcggcat ttcttttttt cttctcagac ctccactctg 120gagaagagtc agaatggggg cgtcggggag gaggtcaccc cagctctgat ctttgccatc 180acagttgcta caatcggctc tttccagttt ggctacaaca ctggggtcat caatgctcct 240gagacgatca taaaggaatt tatcaataaa actttgacgg acaaggcaaa tgcccctccc 300tctgaggtgc tgctcacgaa tctctggtcc ttgtctgtgg ccatattttc cgtcgggggt 360atgatcggct ccttttccgt cggactcttt gttaaccgct ttggcaggcg caattcaatg 420ctgattgtca acctgttggc tgccactggt ggctgcctta tgggactgtg taaaatagct 480gagtcagttg aaatgctgat cctgggccgc ttggttattg gcctcttctg cggactctgc 540acaggttttg tgcccatgta cattggagag atctcgccta ctgccctgag gggtgccttt 600ggcactctca accagctggg catagttatt ggaattctgg tggcccagat ctttggtctg 660gaactcatcc ttgggtctga agagctatgg ccggtgctat taggctttac catccttcca 720gctatcctgc aaagtgcagc ccttccatgt tgccctgaaa gtcccagatt tttgctcatt 780aacagaaaaa aagaggagaa tgctacgcgg atcctccagc ggttgtgggg cacccaggat 840gtatcccaag acatccagga gatgaaagat gagagtgcaa ggatgtcaca agaaaagcaa 900gtcaccgtgc tggagctctt tagagtgtcc agctaccgac agcccatcat catttccatt 960gtgctccagc tctctcagca gctctctggg atcaatgctg tgttctatta ctcaacagga 1020atcttcaagg atgcaggtgt tcaacagccc atctatgcca ccatcagcgc gggtgtggtt 1080aatactatct tcactttact ttctctattt ctggtggaaa gggcaggaag aaggactctg 1140catatgatag gccttggagg gatggctttt tgttccacgc tcatgactgt ttctttgtta 1200ttaaagaatc actataatgg gatgagcttt gtctgtattg gggctatctt ggtctttgtg 1260gcctgttttg aaattggacc aggccccatt ccctggttta ttgtggccga actcttcagc 1320cagggccccc gcccagctgc gatggcagtg gccggctgct ccaactggac ctccaacttc 1380ctagtcggat tgctcttccc ctctgctgct tactatttag gagcctacgt ttttattatc 1440ttcaccggct tcctcattac cttcctggcc tttaccttct tcaaagtccc tgagacccgt 1500ggcaggactt ttgaggatat cacacgggcc tttgaagggc aggcacacgg tgcagataga 1560tctggaaagg acggcgtcat ggggatgaac agcatcgagc ctgctaagga gaccaccacc 1620aatgtctaag tcgtgcctcc ttccacctcc ctcccggcat gggaaagcca cctctccctc 1680aacaagggag agacctcatc aggatgaacc caggacgctt ctgaatgctg ctacttgatt 1740tctttctcat cccacgcact ccatgagcac cccaaggctg cagtttgttg gatcttcaat 1800ggctttttaa attttatttc ctggacatcc tcttctgctt aggagagacc gagtgaacct 1860accttcattt ca 18721891872DNAHomo sapiensCDS(22)..(1629) 189acaacctgac ttttgttcac a atg caa aga ctc caa ctg ttg aga gta gaa 51 Met Gln Arg Leu Gln Leu Leu Arg Val Glu 1 5 10gta ctg ctg ggt gtg aaa caa gga gat gaa atg cgg cat ttc ttt ttt 99Val Leu Leu Gly Val Lys Gln Gly Asp Glu Met Arg His Phe Phe Phe 15 20 25tct tct cag acc tcc act ctg gag aag agt cag aat ggg ggc gtc ggg 147Ser Ser Gln Thr Ser Thr Leu Glu Lys Ser Gln Asn Gly Gly Val Gly 30 35 40gag gag gtc acc cca gct ctg atc ttt gcc atc aca gtt gct aca atc 195Glu Glu Val Thr Pro Ala Leu Ile Phe Ala Ile Thr Val Ala Thr Ile 45 50 55ggc tct ttc cag ttt ggc tac aac act ggg gtc atc aat gct cct gag 243Gly Ser Phe Gln Phe Gly Tyr Asn Thr Gly Val Ile Asn Ala Pro Glu 60 65 70acg atc ata aag gaa ttt atc aat aaa act ttg acg gac aag gca aat 291Thr Ile Ile Lys Glu Phe Ile Asn Lys Thr Leu Thr Asp Lys Ala Asn75 80 85 90gcc cct ccc tct gag gtg ctg ctc acg aat ctc tgg tcc ttg tct gtg 339Ala Pro Pro Ser Glu Val Leu Leu Thr Asn Leu Trp Ser Leu Ser Val 95 100 105gcc ata ttt tcc gtc ggg ggt atg atc ggc tcc ttt tcc gtc gga ctc 387Ala Ile Phe Ser Val Gly Gly Met Ile Gly Ser Phe Ser Val Gly Leu 110 115 120ttt gtt aac cgc ttt ggc agg cgc aat tca atg ctg att gtc aac ctg 435Phe Val Asn Arg Phe Gly Arg Arg Asn Ser Met Leu Ile Val Asn Leu 125 130 135ttg gct gcc act ggt ggc tgc ctt atg gga ctg tgt aaa ata gct gag 483Leu Ala Ala Thr Gly Gly Cys Leu Met Gly Leu Cys Lys Ile Ala Glu 140 145 150tca gtt gaa atg ctg atc ctg ggc cgc ttg gtt att ggc ctc ttc tgc 531Ser Val Glu Met Leu Ile Leu Gly Arg Leu Val Ile Gly Leu Phe Cys155 160 165 170gga ctc tgc aca ggt ttt gtg ccc atg tac att gga gag atc tcg cct 579Gly Leu Cys Thr Gly Phe Val Pro Met Tyr Ile Gly Glu Ile Ser Pro 175 180 185act gcc ctg agg ggt gcc ttt ggc act ctc aac cag ctg ggc ata gtt 627Thr Ala Leu Arg Gly Ala Phe Gly Thr Leu Asn Gln Leu Gly Ile Val 190 195 200att gga att ctg gtg gcc cag atc ttt ggt ctg gaa ctc atc ctt ggg 675Ile Gly Ile Leu Val Ala Gln Ile Phe Gly Leu Glu Leu Ile Leu Gly 205 210 215tct gaa gag cta tgg ccg gtg cta tta ggc ttt acc atc ctt cca gct 723Ser Glu Glu Leu Trp Pro Val Leu Leu Gly Phe Thr Ile Leu Pro Ala 220 225 230atc ctg caa agt gca gcc ctt cca tgt tgc cct gaa agt ccc aga ttt 771Ile Leu Gln Ser Ala Ala Leu Pro Cys Cys Pro Glu Ser Pro Arg Phe235 240 245 250ttg ctc att aac aga aaa aaa gag gag aat gct acg cgg atc ctc cag 819Leu Leu Ile Asn Arg Lys Lys Glu Glu Asn Ala Thr Arg Ile Leu Gln 255 260 265cgg ttg tgg ggc acc cag gat gta tcc caa gac atc cag gag atg aaa 867Arg Leu Trp Gly Thr Gln Asp Val Ser Gln Asp Ile Gln Glu Met Lys 270 275 280gat gag agt gca agg atg tca caa gaa aag caa gtc acc gtg ctg gag 915Asp Glu Ser Ala Arg Met Ser Gln Glu Lys Gln Val Thr Val Leu Glu 285 290 295ctc ttt aga gtg tcc agc tac cga cag ccc atc atc att tcc att gtg 963Leu Phe Arg Val Ser Ser Tyr Arg Gln Pro Ile Ile Ile Ser Ile Val 300 305 310ctc cag ctc tct cag cag ctc tct ggg atc aat gct gtg ttc tat tac 1011Leu Gln Leu Ser Gln Gln Leu Ser Gly Ile Asn Ala Val Phe Tyr Tyr315 320 325 330tca aca gga atc ttc aag gat gca ggt gtt caa cag ccc atc tat gcc 1059Ser Thr Gly Ile Phe Lys Asp Ala Gly Val Gln Gln Pro Ile Tyr Ala 335 340 345acc atc agc gcg ggt gtg gtt aat act atc ttc act tta ctt tct cta 1107Thr Ile Ser Ala Gly Val Val Asn Thr Ile Phe Thr Leu Leu Ser Leu 350 355 360ttt ctg gtg gaa agg gca gga aga agg act ctg cat atg ata ggc ctt 1155Phe Leu Val Glu Arg Ala Gly Arg Arg Thr Leu His Met Ile Gly Leu 365 370 375gga ggg atg gct ttt tgt tcc acg ctc atg act gtt tct ttg tta tta 1203Gly Gly Met Ala Phe Cys Ser Thr Leu Met Thr Val Ser Leu Leu Leu 380 385 390aag aat cac tat aat ggg atg agc ttt gtc tgt att ggg gct atc ttg 1251Lys Asn His Tyr Asn Gly Met Ser Phe Val Cys Ile Gly Ala Ile Leu395 400 405 410gtc ttt gtg gcc tgt ttt gaa att gga cca ggc ccc att ccc tgg ttt 1299Val Phe Val Ala Cys Phe Glu Ile Gly Pro Gly Pro Ile Pro Trp Phe 415 420 425att gtg gcc gaa ctc ttc agc cag ggc ccc cgc cca gct gcg atg gca 1347Ile Val Ala Glu Leu Phe Ser Gln Gly Pro Arg Pro Ala Ala Met Ala 430 435 440gtg gcc ggc tgc tcc aac tgg acc tcc aac ttc cta gtc gga ttg ctc 1395Val Ala Gly Cys Ser Asn Trp Thr Ser Asn Phe Leu Val Gly Leu Leu 445 450 455ttc ccc tct gct gct tac tat tta gga gcc tac gtt ttt att atc ttc 1443Phe Pro Ser Ala Ala Tyr Tyr Leu Gly Ala Tyr Val Phe Ile Ile Phe 460 465 470acc ggc ttc ctc att acc ttc ctg gcc ttt acc ttc ttc aaa gtc cct 1491Thr Gly Phe Leu Ile Thr Phe Leu Ala Phe Thr Phe Phe Lys Val Pro475 480 485 490gag acc cgt ggc agg act ttt gag gat atc aca cgg gcc ttt gaa ggg 1539Glu Thr Arg Gly Arg Thr Phe Glu Asp Ile Thr Arg Ala Phe Glu Gly 495 500 505cag gca cac ggt gca gat aga tct gga aag gac ggc gtc atg ggg atg 1587Gln Ala His Gly Ala Asp Arg Ser Gly Lys Asp Gly Val Met Gly Met 510 515 520aac agc atc gag cct gct aag gag acc acc acc aat gtc taa 1629Asn Ser Ile Glu Pro Ala Lys Glu Thr Thr Thr Asn Val 525 530 535gtcgtgcctc cttccacctc cctcccggca tgggaaagcc acctctccct caacaaggga 1689gagacctcat caggatgaac ccaggacgct tctgaatgct gctacttgat ttctttctca 1749tcccacgcac tccatgagca ccccaaggct gcagtttgtt ggatcttcaa tggcttttta 1809aattttattt cctggacatc ctcttctgct taggagagac cgagtgaacc taccttcatt 1869tca 1872190535PRTHomo sapiens 190Met Gln Arg Leu Gln Leu Leu Arg Val Glu Val Leu Leu Gly Val Lys1 5 10 15Gln Gly Asp Glu Met Arg His Phe Phe Phe Ser Ser Gln Thr Ser Thr 20 25 30Leu Glu Lys Ser Gln Asn Gly Gly Val Gly Glu Glu Val Thr Pro Ala 35 40 45Leu Ile Phe Ala Ile Thr Val Ala Thr Ile Gly Ser Phe Gln Phe Gly 50 55 60Tyr Asn Thr Gly Val Ile Asn Ala Pro Glu Thr Ile Ile Lys Glu Phe65 70 75 80Ile Asn Lys Thr Leu Thr Asp Lys Ala Asn Ala Pro Pro Ser Glu Val 85 90 95Leu Leu Thr Asn Leu Trp Ser Leu Ser Val Ala Ile Phe Ser Val Gly 100 105 110Gly Met Ile Gly Ser Phe Ser Val Gly Leu Phe Val Asn Arg Phe Gly 115 120 125Arg Arg Asn Ser Met Leu Ile Val Asn Leu Leu Ala Ala Thr Gly Gly 130 135 140Cys Leu Met Gly Leu Cys Lys Ile Ala Glu Ser Val Glu Met Leu Ile145 150 155 160Leu Gly Arg Leu Val Ile Gly Leu Phe Cys Gly Leu Cys Thr Gly Phe 165 170 175Val Pro Met Tyr Ile Gly Glu Ile Ser Pro Thr Ala Leu Arg Gly Ala 180 185 190Phe Gly Thr Leu Asn Gln Leu Gly Ile Val Ile Gly Ile Leu Val Ala 195 200 205Gln Ile Phe Gly Leu Glu Leu Ile Leu Gly Ser Glu Glu Leu Trp Pro 210 215 220Val Leu Leu Gly Phe Thr Ile Leu Pro Ala Ile Leu Gln Ser Ala Ala225 230 235 240Leu Pro Cys Cys Pro Glu Ser Pro Arg Phe Leu Leu Ile Asn Arg Lys 245 250 255Lys Glu Glu Asn Ala Thr Arg Ile Leu Gln Arg Leu Trp Gly Thr Gln 260 265 270Asp Val Ser Gln Asp Ile Gln Glu Met Lys Asp Glu Ser Ala Arg Met 275 280 285Ser Gln Glu Lys Gln Val Thr Val Leu Glu Leu Phe Arg Val Ser Ser 290 295 300Tyr Arg Gln Pro Ile Ile Ile Ser Ile Val Leu Gln Leu Ser Gln Gln305 310 315 320Leu Ser Gly Ile Asn Ala Val Phe Tyr Tyr Ser Thr Gly Ile Phe Lys 325 330 335Asp Ala Gly Val Gln Gln Pro Ile Tyr Ala Thr Ile Ser Ala Gly Val 340 345 350Val Asn Thr Ile Phe Thr Leu Leu Ser Leu Phe Leu Val Glu Arg Ala 355 360 365Gly Arg Arg Thr Leu His Met Ile Gly Leu Gly Gly Met Ala Phe Cys 370 375 380Ser Thr Leu Met Thr Val Ser Leu Leu Leu Lys Asn His Tyr Asn Gly385 390 395 400Met Ser Phe Val Cys Ile Gly Ala Ile Leu Val Phe Val Ala Cys Phe 405 410 415Glu Ile Gly Pro Gly Pro Ile Pro Trp Phe Ile Val Ala Glu Leu Phe 420 425 430Ser Gln Gly Pro Arg Pro Ala Ala Met Ala Val Ala Gly Cys Ser Asn 435 440 445Trp Thr Ser Asn Phe Leu Val Gly Leu Leu Phe Pro Ser Ala Ala Tyr 450 455 460Tyr Leu Gly Ala Tyr Val Phe Ile

Ile Phe Thr Gly Phe Leu Ile Thr465 470 475 480Phe Leu Ala Phe Thr Phe Phe Lys Val Pro Glu Thr Arg Gly Arg Thr 485 490 495Phe Glu Asp Ile Thr Arg Ala Phe Glu Gly Gln Ala His Gly Ala Asp 500 505 510Arg Ser Gly Lys Asp Gly Val Met Gly Met Asn Ser Ile Glu Pro Ala 515 520 525Lys Glu Thr Thr Thr Asn Val 530 535191153DNAHomo sapiens 191acaacctgac ttttgttcac aatgcaaaga ctccaactgt tgagagtaga agtactgctg 60ggtgtgaaac aaggagatga aatgcggcat ttcttttttt cttctcagac ctccactctg 120gagaagagtc agaatggggg cgtcggggag gag 15319244PRTHomo sapiens 192Met Gln Arg Leu Gln Leu Leu Arg Val Glu Val Leu Leu Gly Val Lys1 5 10 15Gln Gly Asp Glu Met Arg His Phe Phe Phe Ser Ser Gln Thr Ser Thr 20 25 30Leu Glu Lys Ser Gln Asn Gly Gly Val Gly Glu Glu 35 40193132DNAHomo sapiens 193atgcaaagac tccaactgtt gagagtagaa gtactgctgg gtgtgaaaca aggagatgaa 60atgcggcatt tctttttttc ttctcagacc tccactctgg agaagagtca gaatgggggc 120gtcggggagg ag 1321944115DNAHomo sapiens 194agtcgcgggg tctgggagga gacctgaatg aaatgaggga gccttgggag catgatccag 60gcggagggaa ctggattcgg gaggaggaac tgccttggcc ttgaaagata cctaccagga 120gttcaagtgc tgtgcgggtg catcagcttt gtagatttgt gcaagatgaa aattggaatt 180gtcttaggaa attatggatc attcatttat tcagtgctgg attcattcag tgatttatgt 240ctgaagtgtg acagaagggg agtaaggcca agtgtccttg ccctctattg gagattctgc 300ctcccctggg acagatggct tcttgagcac actcccacga tgggtggctg ctctggtaca 360tctcatccac ttcttcatct gtgaagctgt cacccatggt ggtgagcagc tcctggaggt 420ggtcctcatg agtgaaacct gaggattcct cgttgaagca ggtattcatc catggggttc 480ttccgcggtg aagccagctt gtcgtgctgt cccccttgtc aatgaagcca tcatggttct 540ggtcaatcat gttgaaagcc tccttaaact cctggatgtg gaactggtca aacatcacga 600agacattgga tgtggccccc tgtgctgtgg tcgcttcttg gtcttggctt tggtccactt 660gctgaacatt ttggcttcag gaagcagtac cttgaagaga aattggagag ggagtcaatt 720cctaggatag cagagagatg gacaacagac agaatagatg gagtttcaca atggtggcca 780tgtgtctgga attggtgggt tcttggtctc actgacttca agaatgaagc cgcacaccct 840cgcagtcacc ccagctctga tctttgccat cacagttgct acaatcggct ctttccagtt 900tggctacaac actggggtca tcaatgctcc tgagacgatc ataaaggaat ttatcaataa 960aactttgacg gacaaggcaa atgcccctcc ctctgaggtg ctgctcacga atctctggtc 1020cttgtctgtg gccatatttt ccgtcggggg tatgatcggc tccttttccg tcggactctt 1080tgttaaccgc tttggcaggc gcaattcaat gctgattgtc aacctgttgg ctgccactgg 1140tggctgcctt atgggactgt gtaaaatagc tgagtcagtt gaaatgctga tcctgggccg 1200cttggttatt ggcctcttct gcggactctg cacaggtttt gtgcccatgt acattggaga 1260gatctcgcct actgccctga ggggtgcctt tggcactctc aaccagctgg gcatagttat 1320tggaattctg gtggcccaga tctttggtct ggaactcatc cttgggtctg aagagctatg 1380gccggtgcta ttaggcttta ccatccttcc agctatcctg caaagtgcag cccttccatg 1440ttgccctgaa agtcccagat ttttgctcat taacagaaaa aaagaggaga atgctacgcg 1500gatcctccag cggttgtggg gcacccagga tgtatcccaa gacatccagg agatgaaaga 1560tgagagtgca aggatgtcac aagaaaagca agtcaccgtg ctggagctct ttagagtgtc 1620cagctaccga cagcccatca tcatttccat tgtgctccag ctctctcagc agctctctgg 1680gatcaatgct gtgttctatt actcaacagg aatcttcaag gatgcaggtg ttcaacagcc 1740catctatgcc accatcagcg cgggtgtggt taatactatc ttcactttac tttctctatt 1800tctggtggaa agggcaggaa gaaggactct gcatatgata ggccttggag ggatggcttt 1860ttgttccacg ctcatgactg tttctttgtt attaaagaat cactataatg ggatgagctt 1920tgtctgtatt ggggctatct tggtctttgt ggcctgtttt gaaattggac caggccccat 1980tccctggttt attgtggccg aactcttcag ccagggcccc cgcccagctg cgatggcagt 2040ggccggctgc tccaactgga cctccaactt cctagtcgga ttgctcttcc cctctgctgc 2100ttactattta ggagcctacg tttttattat cttcaccggc ttcctcatta ccttcttggc 2160ctttaccttc ttcaaagtcc ctgagacccg tggcaggact tttgaggata tcacacgggc 2220ctttgaaggg caggcacacg gtgcagatag atctggaaag gacggcgtca tggggatgaa 2280cagcatcgag cctgctaagg agaccaccac caatgtctaa gtcgtgcctc cttccacctc 2340cctcccggca tgggaaagcc acctctccct caacaaggga gagactttat caggatgaac 2400ccaggacgct tctgaatgct gctacttgat ttctttctca tcccacgcac tccatgagca 2460ccccaaggct gcagtttgtt ggatcttcaa tggcttttta aattttattt cctggacatc 2520ctcttctgct taggagagac cgagtgaacc taccttcatt tcaggaggga ttggccgctt 2580ggcacatgac aactttgcca gcttttcctc ccttgggttc tgatattgcc gcactagggg 2640atataggaga ggaaaagtaa ggtgcagttg ccccaacctc agacttacca ggaagcagat 2700acatgtgagt gtggaaggca gagggggttt atgtaagagc accttcctca cttccataca 2760gctctacgcg gcaaattaac ttgagtttta tttatcttat cctctggttt aattacataa 2820atatttattt tttaagtgta attttgccaa ataataacaa cagaaggaaa ttgagattag 2880agggaggtgt ttaaagagag gttatagagt aaaagatttg atgctggaga ggttaaggtg 2940caataagaat tcagggagaa atgttgttca ttattggagg gtaaatgatg tggtgcctga 3000ggtctgtaca ttacctctta acaatttctg tccttcagat gaaaactctt tgatttctca 3060gaaaagttgt atgcctattt aataaagcta ctcatttcct ttggaacttt atctttaaga 3120taatagttta catgtagtag tacttgaaat ctaggattat taactaatat gggcattgta 3180gttaatggcg gttgatgggt tctaattttg gatggagtcc agggaagaga aagtgatttc 3240tagaaagcct gttcccctca ctggacgaaa taactccttg tagtagtctc attacttttg 3300aagtaatccc gccacctatc tagtgggaga gccatccaaa tgagaaacct aaaataattg 3360gttcttggta gagattcatt atttctccac tttgttcttt aggagatttt aggtgttgat 3420tttctgtttt attttaactc atacctttaa aggaattccc caaagaatgt ttatagcaaa 3480cttggaattt gtaacctcag ctctgggaga ggattttttt ctgagcgatt attatctaaa 3540gtgtgttgtt gctttaggct cacggcacgc ttgcgtatgt ctgttaccat gtcactgtgg 3600tcctatgccg aatgccctca ggggacttga atctttccaa taaaccaggt ttagacagta 3660tgagtcaatg tgcagtgcag cccacacttg agaggatgaa tgtatgtgca ctgtcacttt 3720gctctgggtg gaagtatgtt attgttgact tattttctct gtgtttgttc ctacagcccc 3780tttttcatat gttgctcagt ctccctttcc cttcttggtg cttacacatc tcagaccctt 3840tagccaaacc cttgccagtg acagtatttt ggttctcagt tctcactgtt ccctctgctc 3900ctggagcctt tgaataaaaa tgcacgtagc tatggagtgg ggtttagctg gaaaggtggc 3960cttccaactt cacgtcaact tctggctcct cagtttggca gtaaggcagg gaagttgttt 4020tcctatttct cactgagaag attgtgaata tttccatatg gattttccat tattgtttgt 4080ttgattcttt gttttaaaat aaaaattctg aatgt 41151954115DNAHomo sapiensCDS(758)..(2320) 195agtcgcgggg tctgggagga gacctgaatg aaatgaggga gccttgggag catgatccag 60gcggagggaa ctggattcgg gaggaggaac tgccttggcc ttgaaagata cctaccagga 120gttcaagtgc tgtgcgggtg catcagcttt gtagatttgt gcaagatgaa aattggaatt 180gtcttaggaa attatggatc attcatttat tcagtgctgg attcattcag tgatttatgt 240ctgaagtgtg acagaagggg agtaaggcca agtgtccttg ccctctattg gagattctgc 300ctcccctggg acagatggct tcttgagcac actcccacga tgggtggctg ctctggtaca 360tctcatccac ttcttcatct gtgaagctgt cacccatggt ggtgagcagc tcctggaggt 420ggtcctcatg agtgaaacct gaggattcct cgttgaagca ggtattcatc catggggttc 480ttccgcggtg aagccagctt gtcgtgctgt cccccttgtc aatgaagcca tcatggttct 540ggtcaatcat gttgaaagcc tccttaaact cctggatgtg gaactggtca aacatcacga 600agacattgga tgtggccccc tgtgctgtgg tcgcttcttg gtcttggctt tggtccactt 660gctgaacatt ttggcttcag gaagcagtac cttgaagaga aattggagag ggagtcaatt 720cctaggatag cagagagatg gacaacagac agaatag atg gag ttt cac aat ggt 775 Met Glu Phe His Asn Gly 1 5ggc cat gtg tct gga att ggt ggg ttc ttg gtc tca ctg act tca aga 823Gly His Val Ser Gly Ile Gly Gly Phe Leu Val Ser Leu Thr Ser Arg 10 15 20atg aag ccg cac acc ctc gca gtc acc cca gct ctg atc ttt gcc atc 871Met Lys Pro His Thr Leu Ala Val Thr Pro Ala Leu Ile Phe Ala Ile 25 30 35aca gtt gct aca atc ggc tct ttc cag ttt ggc tac aac act ggg gtc 919Thr Val Ala Thr Ile Gly Ser Phe Gln Phe Gly Tyr Asn Thr Gly Val 40 45 50atc aat gct cct gag acg atc ata aag gaa ttt atc aat aaa act ttg 967Ile Asn Ala Pro Glu Thr Ile Ile Lys Glu Phe Ile Asn Lys Thr Leu55 60 65 70acg gac aag gca aat gcc cct ccc tct gag gtg ctg ctc acg aat ctc 1015Thr Asp Lys Ala Asn Ala Pro Pro Ser Glu Val Leu Leu Thr Asn Leu 75 80 85tgg tcc ttg tct gtg gcc ata ttt tcc gtc ggg ggt atg atc ggc tcc 1063Trp Ser Leu Ser Val Ala Ile Phe Ser Val Gly Gly Met Ile Gly Ser 90 95 100ttt tcc gtc gga ctc ttt gtt aac cgc ttt ggc agg cgc aat tca atg 1111Phe Ser Val Gly Leu Phe Val Asn Arg Phe Gly Arg Arg Asn Ser Met 105 110 115ctg att gtc aac ctg ttg gct gcc act ggt ggc tgc ctt atg gga ctg 1159Leu Ile Val Asn Leu Leu Ala Ala Thr Gly Gly Cys Leu Met Gly Leu 120 125 130tgt aaa ata gct gag tca gtt gaa atg ctg atc ctg ggc cgc ttg gtt 1207Cys Lys Ile Ala Glu Ser Val Glu Met Leu Ile Leu Gly Arg Leu Val135 140 145 150att ggc ctc ttc tgc gga ctc tgc aca ggt ttt gtg ccc atg tac att 1255Ile Gly Leu Phe Cys Gly Leu Cys Thr Gly Phe Val Pro Met Tyr Ile 155 160 165gga gag atc tcg cct act gcc ctg agg ggt gcc ttt ggc act ctc aac 1303Gly Glu Ile Ser Pro Thr Ala Leu Arg Gly Ala Phe Gly Thr Leu Asn 170 175 180cag ctg ggc ata gtt att gga att ctg gtg gcc cag atc ttt ggt ctg 1351Gln Leu Gly Ile Val Ile Gly Ile Leu Val Ala Gln Ile Phe Gly Leu 185 190 195gaa ctc atc ctt ggg tct gaa gag cta tgg ccg gtg cta tta ggc ttt 1399Glu Leu Ile Leu Gly Ser Glu Glu Leu Trp Pro Val Leu Leu Gly Phe 200 205 210acc atc ctt cca gct atc ctg caa agt gca gcc ctt cca tgt tgc cct 1447Thr Ile Leu Pro Ala Ile Leu Gln Ser Ala Ala Leu Pro Cys Cys Pro215 220 225 230gaa agt ccc aga ttt ttg ctc att aac aga aaa aaa gag gag aat gct 1495Glu Ser Pro Arg Phe Leu Leu Ile Asn Arg Lys Lys Glu Glu Asn Ala 235 240 245acg cgg atc ctc cag cgg ttg tgg ggc acc cag gat gta tcc caa gac 1543Thr Arg Ile Leu Gln Arg Leu Trp Gly Thr Gln Asp Val Ser Gln Asp 250 255 260atc cag gag atg aaa gat gag agt gca agg atg tca caa gaa aag caa 1591Ile Gln Glu Met Lys Asp Glu Ser Ala Arg Met Ser Gln Glu Lys Gln 265 270 275gtc acc gtg ctg gag ctc ttt aga gtg tcc agc tac cga cag ccc atc 1639Val Thr Val Leu Glu Leu Phe Arg Val Ser Ser Tyr Arg Gln Pro Ile 280 285 290atc att tcc att gtg ctc cag ctc tct cag cag ctc tct ggg atc aat 1687Ile Ile Ser Ile Val Leu Gln Leu Ser Gln Gln Leu Ser Gly Ile Asn295 300 305 310gct gtg ttc tat tac tca aca gga atc ttc aag gat gca ggt gtt caa 1735Ala Val Phe Tyr Tyr Ser Thr Gly Ile Phe Lys Asp Ala Gly Val Gln 315 320 325cag ccc atc tat gcc acc atc agc gcg ggt gtg gtt aat act atc ttc 1783Gln Pro Ile Tyr Ala Thr Ile Ser Ala Gly Val Val Asn Thr Ile Phe 330 335 340act tta ctt tct cta ttt ctg gtg gaa agg gca gga aga agg act ctg 1831Thr Leu Leu Ser Leu Phe Leu Val Glu Arg Ala Gly Arg Arg Thr Leu 345 350 355cat atg ata ggc ctt gga ggg atg gct ttt tgt tcc acg ctc atg act 1879His Met Ile Gly Leu Gly Gly Met Ala Phe Cys Ser Thr Leu Met Thr 360 365 370gtt tct ttg tta tta aag aat cac tat aat ggg atg agc ttt gtc tgt 1927Val Ser Leu Leu Leu Lys Asn His Tyr Asn Gly Met Ser Phe Val Cys375 380 385 390att ggg gct atc ttg gtc ttt gtg gcc tgt ttt gaa att gga cca ggc 1975Ile Gly Ala Ile Leu Val Phe Val Ala Cys Phe Glu Ile Gly Pro Gly 395 400 405ccc att ccc tgg ttt att gtg gcc gaa ctc ttc agc cag ggc ccc cgc 2023Pro Ile Pro Trp Phe Ile Val Ala Glu Leu Phe Ser Gln Gly Pro Arg 410 415 420cca gct gcg atg gca gtg gcc ggc tgc tcc aac tgg acc tcc aac ttc 2071Pro Ala Ala Met Ala Val Ala Gly Cys Ser Asn Trp Thr Ser Asn Phe 425 430 435cta gtc gga ttg ctc ttc ccc tct gct gct tac tat tta gga gcc tac 2119Leu Val Gly Leu Leu Phe Pro Ser Ala Ala Tyr Tyr Leu Gly Ala Tyr 440 445 450gtt ttt att atc ttc acc ggc ttc ctc att acc ttc ttg gcc ttt acc 2167Val Phe Ile Ile Phe Thr Gly Phe Leu Ile Thr Phe Leu Ala Phe Thr455 460 465 470ttc ttc aaa gtc cct gag acc cgt ggc agg act ttt gag gat atc aca 2215Phe Phe Lys Val Pro Glu Thr Arg Gly Arg Thr Phe Glu Asp Ile Thr 475 480 485cgg gcc ttt gaa ggg cag gca cac ggt gca gat aga tct gga aag gac 2263Arg Ala Phe Glu Gly Gln Ala His Gly Ala Asp Arg Ser Gly Lys Asp 490 495 500ggc gtc atg ggg atg aac agc atc gag cct gct aag gag acc acc acc 2311Gly Val Met Gly Met Asn Ser Ile Glu Pro Ala Lys Glu Thr Thr Thr 505 510 515aat gtc taa gtcgtgcctc cttccacctc cctcccggca tgggaaagcc 2360Asn Val 520acctctccct caacaaggga gagactttat caggatgaac ccaggacgct tctgaatgct 2420gctacttgat ttctttctca tcccacgcac tccatgagca ccccaaggct gcagtttgtt 2480ggatcttcaa tggcttttta aattttattt cctggacatc ctcttctgct taggagagac 2540cgagtgaacc taccttcatt tcaggaggga ttggccgctt ggcacatgac aactttgcca 2600gcttttcctc ccttgggttc tgatattgcc gcactagggg atataggaga ggaaaagtaa 2660ggtgcagttg ccccaacctc agacttacca ggaagcagat acatgtgagt gtggaaggca 2720gagggggttt atgtaagagc accttcctca cttccataca gctctacgcg gcaaattaac 2780ttgagtttta tttatcttat cctctggttt aattacataa atatttattt tttaagtgta 2840attttgccaa ataataacaa cagaaggaaa ttgagattag agggaggtgt ttaaagagag 2900gttatagagt aaaagatttg atgctggaga ggttaaggtg caataagaat tcagggagaa 2960atgttgttca ttattggagg gtaaatgatg tggtgcctga ggtctgtaca ttacctctta 3020acaatttctg tccttcagat gaaaactctt tgatttctca gaaaagttgt atgcctattt 3080aataaagcta ctcatttcct ttggaacttt atctttaaga taatagttta catgtagtag 3140tacttgaaat ctaggattat taactaatat gggcattgta gttaatggcg gttgatgggt 3200tctaattttg gatggagtcc agggaagaga aagtgatttc tagaaagcct gttcccctca 3260ctggacgaaa taactccttg tagtagtctc attacttttg aagtaatccc gccacctatc 3320tagtgggaga gccatccaaa tgagaaacct aaaataattg gttcttggta gagattcatt 3380atttctccac tttgttcttt aggagatttt aggtgttgat tttctgtttt attttaactc 3440atacctttaa aggaattccc caaagaatgt ttatagcaaa cttggaattt gtaacctcag 3500ctctgggaga ggattttttt ctgagcgatt attatctaaa gtgtgttgtt gctttaggct 3560cacggcacgc ttgcgtatgt ctgttaccat gtcactgtgg tcctatgccg aatgccctca 3620ggggacttga atctttccaa taaaccaggt ttagacagta tgagtcaatg tgcagtgcag 3680cccacacttg agaggatgaa tgtatgtgca ctgtcacttt gctctgggtg gaagtatgtt 3740attgttgact tattttctct gtgtttgttc ctacagcccc tttttcatat gttgctcagt 3800ctccctttcc cttcttggtg cttacacatc tcagaccctt tagccaaacc cttgccagtg 3860acagtatttt ggttctcagt tctcactgtt ccctctgctc ctggagcctt tgaataaaaa 3920tgcacgtagc tatggagtgg ggtttagctg gaaaggtggc cttccaactt cacgtcaact 3980tctggctcct cagtttggca gtaaggcagg gaagttgttt tcctatttct cactgagaag 4040attgtgaata tttccatatg gattttccat tattgtttgt ttgattcttt gttttaaaat 4100aaaaattctg aatgt 4115196520PRTHomo sapiens 196Met Glu Phe His Asn Gly Gly His Val Ser Gly Ile Gly Gly Phe Leu1 5 10 15Val Ser Leu Thr Ser Arg Met Lys Pro His Thr Leu Ala Val Thr Pro 20 25 30Ala Leu Ile Phe Ala Ile Thr Val Ala Thr Ile Gly Ser Phe Gln Phe 35 40 45Gly Tyr Asn Thr Gly Val Ile Asn Ala Pro Glu Thr Ile Ile Lys Glu 50 55 60Phe Ile Asn Lys Thr Leu Thr Asp Lys Ala Asn Ala Pro Pro Ser Glu65 70 75 80Val Leu Leu Thr Asn Leu Trp Ser Leu Ser Val Ala Ile Phe Ser Val 85 90 95Gly Gly Met Ile Gly Ser Phe Ser Val Gly Leu Phe Val Asn Arg Phe 100 105 110Gly Arg Arg Asn Ser Met Leu Ile Val Asn Leu Leu Ala Ala Thr Gly 115 120 125Gly Cys Leu Met Gly Leu Cys Lys Ile Ala Glu Ser Val Glu Met Leu 130 135 140Ile Leu Gly Arg Leu Val Ile Gly Leu Phe Cys Gly Leu Cys Thr Gly145 150 155 160Phe Val Pro Met Tyr Ile Gly Glu Ile Ser Pro Thr Ala Leu Arg Gly 165 170 175Ala Phe Gly Thr Leu Asn Gln Leu Gly Ile Val Ile Gly Ile Leu Val 180 185 190Ala Gln Ile Phe Gly Leu Glu Leu Ile Leu Gly Ser Glu Glu Leu Trp 195 200 205Pro Val Leu Leu Gly Phe Thr Ile Leu Pro Ala Ile Leu Gln Ser Ala 210 215 220Ala Leu Pro Cys Cys Pro Glu Ser Pro Arg Phe Leu Leu Ile Asn Arg225 230 235 240Lys Lys Glu Glu Asn Ala Thr Arg Ile Leu Gln Arg Leu Trp Gly Thr 245 250 255Gln Asp Val Ser Gln Asp Ile Gln Glu Met Lys Asp Glu Ser Ala Arg 260 265 270Met Ser Gln Glu Lys Gln Val Thr Val Leu Glu Leu Phe Arg Val Ser 275 280 285Ser Tyr Arg Gln Pro Ile Ile Ile Ser Ile Val Leu Gln Leu Ser Gln 290 295 300Gln Leu Ser Gly Ile Asn Ala Val Phe Tyr Tyr Ser Thr Gly Ile Phe305 310 315 320Lys Asp Ala Gly Val Gln Gln Pro Ile Tyr Ala Thr Ile Ser Ala Gly

325 330 335Val Val Asn Thr Ile Phe Thr Leu Leu Ser Leu Phe Leu Val Glu Arg 340 345 350Ala Gly Arg Arg Thr Leu His Met Ile Gly Leu Gly Gly Met Ala Phe 355 360 365Cys Ser Thr Leu Met Thr Val Ser Leu Leu Leu Lys Asn His Tyr Asn 370 375 380Gly Met Ser Phe Val Cys Ile Gly Ala Ile Leu Val Phe Val Ala Cys385 390 395 400Phe Glu Ile Gly Pro Gly Pro Ile Pro Trp Phe Ile Val Ala Glu Leu 405 410 415Phe Ser Gln Gly Pro Arg Pro Ala Ala Met Ala Val Ala Gly Cys Ser 420 425 430Asn Trp Thr Ser Asn Phe Leu Val Gly Leu Leu Phe Pro Ser Ala Ala 435 440 445Tyr Tyr Leu Gly Ala Tyr Val Phe Ile Ile Phe Thr Gly Phe Leu Ile 450 455 460Thr Phe Leu Ala Phe Thr Phe Phe Lys Val Pro Glu Thr Arg Gly Arg465 470 475 480Thr Phe Glu Asp Ile Thr Arg Ala Phe Glu Gly Gln Ala His Gly Ala 485 490 495Asp Arg Ser Gly Lys Asp Gly Val Met Gly Met Asn Ser Ile Glu Pro 500 505 510Ala Lys Glu Thr Thr Thr Asn Val 515 52019723DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 197ggcatttctt tttttcttct cag 2319817DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 198ggtgacctcc tccccga 1719974DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO197) and reverse primer (SEQ ID NO198) 199ggcatttctt tttttcttct cagacctcca ctctggagaa gagtcagaat gggggcgtcg 60gggaggaggt cacc 7420022DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 200tttattgtgg ccgaactctt ca 2220122DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 201gcaatccgac taggaagttg ga 2220297DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO200) and reverse primer (SEQ ID NO201) 202tttattgtgg ccgaactctt cagccagggc ccccgcccag ctgcgatggc agtggccggc 60tgctccaact ggacctccaa cttcctagtc ggattgc 97203159DNAHomo sapiens 203acaacctgac ttttgtccac aatgcaaaga ctccaactgt tgagagtaga agtactgctg 60ggtgtgaaac aaggagatga aatgcggcat ttcttttttt cttctcagac ctccactctg 120gagaagagtc agaatggggg cgtcggggag gaggtcacc 159204159DNAHomo sapiens 204acaacctgac ttttgttcac aatgcaaaga ctccaactgt tgagagtaga agtactgctg 60ggtgtgaaac aaggagatga aatgcggcat ttcttttttt cttctcagac ctccactctg 120gagaagagtc agaatggggg cgtcggggag gaggtcacc 1592052724DNAHomo sapiens 205gagagcagcg ccaatgtgaa gcgttgcagt cgcttgactc acctgaggct ctccaaggat 60accttcaatg cctgcactgt aagggagctg cttttcccgg gtgctggcga gaacggaagc 120cttcctttga cgtttttcta aacatgggat gcagtctgtg cagcctgcag aagcaagagg 180agcagtacaa attactctat gaagtttgtc aggtcaacgg cagagactta tccagagcaa 240ctcatgacca ggctgtggaa gctttcaaga cagccaagga gcccatagtg gtgcaggtgt 300tgagaagaac accaaggacc aaaatgttca cgcctccatc agagtctcag ctggtggaca 360cgggaaccca aaccgacatc acctttgaac atatcatggc cctcactaag atgtcctctc 420ccagcccacc cgtgctggat ccctatctct tgccagagga gcatccctca gcccatgaat 480actacgatcc aaatgactac attggagaca tccatcagga gatggacagg gaggagctgg 540agctggagga agtggacctc tacagaatga acagccagga caagctgggc ctcactgtgt 600gctaccggac ggacgatgaa gacgacattg ggatttatat cagtgagatt gaccctaaca 660gcattgcagc caaggatggg cgcatccgag aaggagactg cattatccag attaatggga 720tagaggtgca gaaccgtgaa gaggctgtgg ctcttctaac cagtgaagaa aataaaaact 780tttcattgct gattgcaagg cctgaactcc agctggatga gggctggatg gatgatgaca 840ggaacgactt tctggatgac ctgcacatgg acatgctgga ggagcagcac caccaggcca 900tgcaattcac agctagcgtg ctgcagcaga agaagcacga cgaagacggt gggaccacag 960atacagccac catcttgtcc aaccagcacg agaaggacag cggtgtgggg cggaccgacg 1020agagcacccg taatgacgag agctcggagc aagagaacaa tggcgacgac gccaccgcat 1080cctccaaccc gctggcgggg cagaggaagc tcacctgcag ccaggacacc ttgggcagcg 1140gcgacctgcc cttcagcaac gagtctttca tttcggccga ctgcacggac gccgactacc 1200tggggatccc ggtggacgag tgcgagcgct tccgcgagct cctggagctc aagtgccagg 1260tgaagagcgc caccccttac ggcctgtact accctagcgg ccccctggac gccggcaaga 1320gtgaccctga gagcgtggac aaggagctgg agctgctgaa cgaagagctg cgcagcatcg 1380agctggagtg cctgagcatc gtgcgcgccc acaagatgca gcagctcaag gagcagtacc 1440gcgagtcctg gatgctgcac aacagcggct tccgcaacta caacaccagc atcgacgtgc 1500gcagacgcga gctctcagat atcaccgagc tcccggagaa atccgacaag gacagctcga 1560gcgcctacaa cacaggcgag agctgccgca gcaccccgct caccctggag atctcccccg 1620acaactcctt gaggagagcg gcggagggca tcagctgccc gagcagcgaa ggggctgtgg 1680ggaccacgga agcctacggg ccagcctcca agaatctgct ctccatcacg gaagatcccg 1740aagtgggcac ccctacctat agcccgtccc tgaaggagct ggaccccaac cagcccctgg 1800aaagcaaaga gcggagagcc agcgacggga gccggagccc cacgcccagc cagaagctgg 1860gcagcgccta cctgccctcc tatcaccact ccccatacaa gcacgcgcac atcccggcgc 1920acgcccagca ctaccagagc tacatgcagc tgatccagca gaagtcggcc gtggagtacg 1980cgcaaagcca gatgagcctg gtgagcatgt gcaaggacct gagctctccc accccgtcgg 2040agccgcgcat ggagtggaag gtgaagatcc gcagcgacgg gacgcgctac atcaccaaga 2100ggcccgtgcg ggaccgcctg ctgcgggagc gcgccctgaa gatccgggaa gagcgcagcg 2160gcatgaccac cgacgacgac gcggtgagcg agatgaagat ggggcgctac tggagcaagg 2220aggagaggaa gcagcacttg gtgaaggcca aggagcagcg gcggcggcgc gagttcatga 2280tgcagagcag gttggattgt ctcaaggagc agcaagcagc cgatgacagg aaggagatga 2340acattctcga actgagccac aaaaagatga tgaagaagag gaataagaaa atcttcgata 2400actggatgac gatccaagaa ctcttaaccc acggcacaaa atccccggac ggcactagag 2460tatacaattc cttcctatcg gtgactactg tataattttc acttctgcat tatgtacata 2520aaggagacca ctaccactgg ggtagaaatt cctgcctcgt tcaatgcggc aagtttttgt 2580atataagata agtacggtct tcatgtttat agtccaaatt tgcaaaccct acaactctgg 2640gtgtcatagg tctattttaa gggaagagag agaaaaacac ccttactatc ttggaaggca 2700atattaacaa acagagcttt tttc 27242062724DNAHomo sapiensCDS(144)..(2495) 206gagagcagcg ccaatgtgaa gcgttgcagt cgcttgactc acctgaggct ctccaaggat 60accttcaatg cctgcactgt aagggagctg cttttcccgg gtgctggcga gaacggaagc 120cttcctttga cgtttttcta aac atg gga tgc agt ctg tgc agc ctg cag aag 173 Met Gly Cys Ser Leu Cys Ser Leu Gln Lys 1 5 10caa gag gag cag tac aaa tta ctc tat gaa gtt tgt cag gtc aac ggc 221Gln Glu Glu Gln Tyr Lys Leu Leu Tyr Glu Val Cys Gln Val Asn Gly 15 20 25aga gac tta tcc aga gca act cat gac cag gct gtg gaa gct ttc aag 269Arg Asp Leu Ser Arg Ala Thr His Asp Gln Ala Val Glu Ala Phe Lys 30 35 40aca gcc aag gag ccc ata gtg gtg cag gtg ttg aga aga aca cca agg 317Thr Ala Lys Glu Pro Ile Val Val Gln Val Leu Arg Arg Thr Pro Arg 45 50 55acc aaa atg ttc acg cct cca tca gag tct cag ctg gtg gac acg gga 365Thr Lys Met Phe Thr Pro Pro Ser Glu Ser Gln Leu Val Asp Thr Gly 60 65 70acc caa acc gac atc acc ttt gaa cat atc atg gcc ctc act aag atg 413Thr Gln Thr Asp Ile Thr Phe Glu His Ile Met Ala Leu Thr Lys Met75 80 85 90tcc tct ccc agc cca ccc gtg ctg gat ccc tat ctc ttg cca gag gag 461Ser Ser Pro Ser Pro Pro Val Leu Asp Pro Tyr Leu Leu Pro Glu Glu 95 100 105cat ccc tca gcc cat gaa tac tac gat cca aat gac tac att gga gac 509His Pro Ser Ala His Glu Tyr Tyr Asp Pro Asn Asp Tyr Ile Gly Asp 110 115 120atc cat cag gag atg gac agg gag gag ctg gag ctg gag gaa gtg gac 557Ile His Gln Glu Met Asp Arg Glu Glu Leu Glu Leu Glu Glu Val Asp 125 130 135ctc tac aga atg aac agc cag gac aag ctg ggc ctc act gtg tgc tac 605Leu Tyr Arg Met Asn Ser Gln Asp Lys Leu Gly Leu Thr Val Cys Tyr 140 145 150cgg acg gac gat gaa gac gac att ggg att tat atc agt gag att gac 653Arg Thr Asp Asp Glu Asp Asp Ile Gly Ile Tyr Ile Ser Glu Ile Asp155 160 165 170cct aac agc att gca gcc aag gat ggg cgc atc cga gaa gga gac tgc 701Pro Asn Ser Ile Ala Ala Lys Asp Gly Arg Ile Arg Glu Gly Asp Cys 175 180 185att atc cag att aat ggg ata gag gtg cag aac cgt gaa gag gct gtg 749Ile Ile Gln Ile Asn Gly Ile Glu Val Gln Asn Arg Glu Glu Ala Val 190 195 200gct ctt cta acc agt gaa gaa aat aaa aac ttt tca ttg ctg att gca 797Ala Leu Leu Thr Ser Glu Glu Asn Lys Asn Phe Ser Leu Leu Ile Ala 205 210 215agg cct gaa ctc cag ctg gat gag ggc tgg atg gat gat gac agg aac 845Arg Pro Glu Leu Gln Leu Asp Glu Gly Trp Met Asp Asp Asp Arg Asn 220 225 230gac ttt ctg gat gac ctg cac atg gac atg ctg gag gag cag cac cac 893Asp Phe Leu Asp Asp Leu His Met Asp Met Leu Glu Glu Gln His His235 240 245 250cag gcc atg caa ttc aca gct agc gtg ctg cag cag aag aag cac gac 941Gln Ala Met Gln Phe Thr Ala Ser Val Leu Gln Gln Lys Lys His Asp 255 260 265gaa gac ggt ggg acc aca gat aca gcc acc atc ttg tcc aac cag cac 989Glu Asp Gly Gly Thr Thr Asp Thr Ala Thr Ile Leu Ser Asn Gln His 270 275 280gag aag gac agc ggt gtg ggg cgg acc gac gag agc acc cgt aat gac 1037Glu Lys Asp Ser Gly Val Gly Arg Thr Asp Glu Ser Thr Arg Asn Asp 285 290 295gag agc tcg gag caa gag aac aat ggc gac gac gcc acc gca tcc tcc 1085Glu Ser Ser Glu Gln Glu Asn Asn Gly Asp Asp Ala Thr Ala Ser Ser 300 305 310aac ccg ctg gcg ggg cag agg aag ctc acc tgc agc cag gac acc ttg 1133Asn Pro Leu Ala Gly Gln Arg Lys Leu Thr Cys Ser Gln Asp Thr Leu315 320 325 330ggc agc ggc gac ctg ccc ttc agc aac gag tct ttc att tcg gcc gac 1181Gly Ser Gly Asp Leu Pro Phe Ser Asn Glu Ser Phe Ile Ser Ala Asp 335 340 345tgc acg gac gcc gac tac ctg ggg atc ccg gtg gac gag tgc gag cgc 1229Cys Thr Asp Ala Asp Tyr Leu Gly Ile Pro Val Asp Glu Cys Glu Arg 350 355 360ttc cgc gag ctc ctg gag ctc aag tgc cag gtg aag agc gcc acc cct 1277Phe Arg Glu Leu Leu Glu Leu Lys Cys Gln Val Lys Ser Ala Thr Pro 365 370 375tac ggc ctg tac tac cct agc ggc ccc ctg gac gcc ggc aag agt gac 1325Tyr Gly Leu Tyr Tyr Pro Ser Gly Pro Leu Asp Ala Gly Lys Ser Asp 380 385 390cct gag agc gtg gac aag gag ctg gag ctg ctg aac gaa gag ctg cgc 1373Pro Glu Ser Val Asp Lys Glu Leu Glu Leu Leu Asn Glu Glu Leu Arg395 400 405 410agc atc gag ctg gag tgc ctg agc atc gtg cgc gcc cac aag atg cag 1421Ser Ile Glu Leu Glu Cys Leu Ser Ile Val Arg Ala His Lys Met Gln 415 420 425cag ctc aag gag cag tac cgc gag tcc tgg atg ctg cac aac agc ggc 1469Gln Leu Lys Glu Gln Tyr Arg Glu Ser Trp Met Leu His Asn Ser Gly 430 435 440ttc cgc aac tac aac acc agc atc gac gtg cgc aga cgc gag ctc tca 1517Phe Arg Asn Tyr Asn Thr Ser Ile Asp Val Arg Arg Arg Glu Leu Ser 445 450 455gat atc acc gag ctc ccg gag aaa tcc gac aag gac agc tcg agc gcc 1565Asp Ile Thr Glu Leu Pro Glu Lys Ser Asp Lys Asp Ser Ser Ser Ala 460 465 470tac aac aca ggc gag agc tgc cgc agc acc ccg ctc acc ctg gag atc 1613Tyr Asn Thr Gly Glu Ser Cys Arg Ser Thr Pro Leu Thr Leu Glu Ile475 480 485 490tcc ccc gac aac tcc ttg agg aga gcg gcg gag ggc atc agc tgc ccg 1661Ser Pro Asp Asn Ser Leu Arg Arg Ala Ala Glu Gly Ile Ser Cys Pro 495 500 505agc agc gaa ggg gct gtg ggg acc acg gaa gcc tac ggg cca gcc tcc 1709Ser Ser Glu Gly Ala Val Gly Thr Thr Glu Ala Tyr Gly Pro Ala Ser 510 515 520aag aat ctg ctc tcc atc acg gaa gat ccc gaa gtg ggc acc cct acc 1757Lys Asn Leu Leu Ser Ile Thr Glu Asp Pro Glu Val Gly Thr Pro Thr 525 530 535tat agc ccg tcc ctg aag gag ctg gac ccc aac cag ccc ctg gaa agc 1805Tyr Ser Pro Ser Leu Lys Glu Leu Asp Pro Asn Gln Pro Leu Glu Ser 540 545 550aaa gag cgg aga gcc agc gac ggg agc cgg agc ccc acg ccc agc cag 1853Lys Glu Arg Arg Ala Ser Asp Gly Ser Arg Ser Pro Thr Pro Ser Gln555 560 565 570aag ctg ggc agc gcc tac ctg ccc tcc tat cac cac tcc cca tac aag 1901Lys Leu Gly Ser Ala Tyr Leu Pro Ser Tyr His His Ser Pro Tyr Lys 575 580 585cac gcg cac atc ccg gcg cac gcc cag cac tac cag agc tac atg cag 1949His Ala His Ile Pro Ala His Ala Gln His Tyr Gln Ser Tyr Met Gln 590 595 600ctg atc cag cag aag tcg gcc gtg gag tac gcg caa agc cag atg agc 1997Leu Ile Gln Gln Lys Ser Ala Val Glu Tyr Ala Gln Ser Gln Met Ser 605 610 615ctg gtg agc atg tgc aag gac ctg agc tct ccc acc ccg tcg gag ccg 2045Leu Val Ser Met Cys Lys Asp Leu Ser Ser Pro Thr Pro Ser Glu Pro 620 625 630cgc atg gag tgg aag gtg aag atc cgc agc gac ggg acg cgc tac atc 2093Arg Met Glu Trp Lys Val Lys Ile Arg Ser Asp Gly Thr Arg Tyr Ile635 640 645 650acc aag agg ccc gtg cgg gac cgc ctg ctg cgg gag cgc gcc ctg aag 2141Thr Lys Arg Pro Val Arg Asp Arg Leu Leu Arg Glu Arg Ala Leu Lys 655 660 665atc cgg gaa gag cgc agc ggc atg acc acc gac gac gac gcg gtg agc 2189Ile Arg Glu Glu Arg Ser Gly Met Thr Thr Asp Asp Asp Ala Val Ser 670 675 680gag atg aag atg ggg cgc tac tgg agc aag gag gag agg aag cag cac 2237Glu Met Lys Met Gly Arg Tyr Trp Ser Lys Glu Glu Arg Lys Gln His 685 690 695ttg gtg aag gcc aag gag cag cgg cgg cgg cgc gag ttc atg atg cag 2285Leu Val Lys Ala Lys Glu Gln Arg Arg Arg Arg Glu Phe Met Met Gln 700 705 710agc agg ttg gat tgt ctc aag gag cag caa gca gcc gat gac agg aag 2333Ser Arg Leu Asp Cys Leu Lys Glu Gln Gln Ala Ala Asp Asp Arg Lys715 720 725 730gag atg aac att ctc gaa ctg agc cac aaa aag atg atg aag aag agg 2381Glu Met Asn Ile Leu Glu Leu Ser His Lys Lys Met Met Lys Lys Arg 735 740 745aat aag aaa atc ttc gat aac tgg atg acg atc caa gaa ctc tta acc 2429Asn Lys Lys Ile Phe Asp Asn Trp Met Thr Ile Gln Glu Leu Leu Thr 750 755 760cac ggc aca aaa tcc ccg gac ggc act aga gta tac aat tcc ttc cta 2477His Gly Thr Lys Ser Pro Asp Gly Thr Arg Val Tyr Asn Ser Phe Leu 765 770 775tcg gtg act act gta taa ttttcacttc tgcattatgt acataaagga 2525Ser Val Thr Thr Val 780gaccactacc actggggtag aaattcctgc ctcgttcaat gcggcaagtt tttgtatata 2585agataagtac ggtcttcatg tttatagtcc aaatttgcaa accctacaac tctgggtgtc 2645ataggtctat tttaagggaa gagagagaaa aacaccctta ctatcttgga aggcaatatt 2705aacaaacaga gcttttttc 2724207783PRTHomo sapiens 207Met Gly Cys Ser Leu Cys Ser Leu Gln Lys Gln Glu Glu Gln Tyr Lys1 5 10 15Leu Leu Tyr Glu Val Cys Gln Val Asn Gly Arg Asp Leu Ser Arg Ala 20 25 30Thr His Asp Gln Ala Val Glu Ala Phe Lys Thr Ala Lys Glu Pro Ile 35 40 45Val Val Gln Val Leu Arg Arg Thr Pro Arg Thr Lys Met Phe Thr Pro 50 55 60Pro Ser Glu Ser Gln Leu Val Asp Thr Gly Thr Gln Thr Asp Ile Thr65 70 75 80Phe Glu His Ile Met Ala Leu Thr Lys Met Ser Ser Pro Ser Pro Pro 85 90 95Val Leu Asp Pro Tyr Leu Leu Pro Glu Glu His Pro Ser Ala His Glu 100 105 110Tyr Tyr Asp Pro Asn Asp Tyr Ile Gly Asp Ile His Gln Glu Met Asp 115 120 125Arg Glu Glu Leu Glu Leu Glu Glu Val Asp Leu Tyr Arg Met Asn Ser 130 135 140Gln Asp Lys Leu Gly Leu Thr Val Cys Tyr Arg Thr Asp Asp Glu Asp145 150 155 160Asp Ile Gly Ile Tyr Ile Ser Glu Ile Asp Pro Asn Ser Ile Ala Ala 165 170 175Lys Asp Gly Arg Ile Arg Glu Gly Asp Cys Ile Ile Gln Ile Asn Gly 180 185 190Ile Glu Val Gln Asn Arg Glu Glu Ala Val Ala Leu Leu Thr Ser Glu 195

200 205Glu Asn Lys Asn Phe Ser Leu Leu Ile Ala Arg Pro Glu Leu Gln Leu 210 215 220Asp Glu Gly Trp Met Asp Asp Asp Arg Asn Asp Phe Leu Asp Asp Leu225 230 235 240His Met Asp Met Leu Glu Glu Gln His His Gln Ala Met Gln Phe Thr 245 250 255Ala Ser Val Leu Gln Gln Lys Lys His Asp Glu Asp Gly Gly Thr Thr 260 265 270Asp Thr Ala Thr Ile Leu Ser Asn Gln His Glu Lys Asp Ser Gly Val 275 280 285Gly Arg Thr Asp Glu Ser Thr Arg Asn Asp Glu Ser Ser Glu Gln Glu 290 295 300Asn Asn Gly Asp Asp Ala Thr Ala Ser Ser Asn Pro Leu Ala Gly Gln305 310 315 320Arg Lys Leu Thr Cys Ser Gln Asp Thr Leu Gly Ser Gly Asp Leu Pro 325 330 335Phe Ser Asn Glu Ser Phe Ile Ser Ala Asp Cys Thr Asp Ala Asp Tyr 340 345 350Leu Gly Ile Pro Val Asp Glu Cys Glu Arg Phe Arg Glu Leu Leu Glu 355 360 365Leu Lys Cys Gln Val Lys Ser Ala Thr Pro Tyr Gly Leu Tyr Tyr Pro 370 375 380Ser Gly Pro Leu Asp Ala Gly Lys Ser Asp Pro Glu Ser Val Asp Lys385 390 395 400Glu Leu Glu Leu Leu Asn Glu Glu Leu Arg Ser Ile Glu Leu Glu Cys 405 410 415Leu Ser Ile Val Arg Ala His Lys Met Gln Gln Leu Lys Glu Gln Tyr 420 425 430Arg Glu Ser Trp Met Leu His Asn Ser Gly Phe Arg Asn Tyr Asn Thr 435 440 445Ser Ile Asp Val Arg Arg Arg Glu Leu Ser Asp Ile Thr Glu Leu Pro 450 455 460Glu Lys Ser Asp Lys Asp Ser Ser Ser Ala Tyr Asn Thr Gly Glu Ser465 470 475 480Cys Arg Ser Thr Pro Leu Thr Leu Glu Ile Ser Pro Asp Asn Ser Leu 485 490 495Arg Arg Ala Ala Glu Gly Ile Ser Cys Pro Ser Ser Glu Gly Ala Val 500 505 510Gly Thr Thr Glu Ala Tyr Gly Pro Ala Ser Lys Asn Leu Leu Ser Ile 515 520 525Thr Glu Asp Pro Glu Val Gly Thr Pro Thr Tyr Ser Pro Ser Leu Lys 530 535 540Glu Leu Asp Pro Asn Gln Pro Leu Glu Ser Lys Glu Arg Arg Ala Ser545 550 555 560Asp Gly Ser Arg Ser Pro Thr Pro Ser Gln Lys Leu Gly Ser Ala Tyr 565 570 575Leu Pro Ser Tyr His His Ser Pro Tyr Lys His Ala His Ile Pro Ala 580 585 590His Ala Gln His Tyr Gln Ser Tyr Met Gln Leu Ile Gln Gln Lys Ser 595 600 605Ala Val Glu Tyr Ala Gln Ser Gln Met Ser Leu Val Ser Met Cys Lys 610 615 620Asp Leu Ser Ser Pro Thr Pro Ser Glu Pro Arg Met Glu Trp Lys Val625 630 635 640Lys Ile Arg Ser Asp Gly Thr Arg Tyr Ile Thr Lys Arg Pro Val Arg 645 650 655Asp Arg Leu Leu Arg Glu Arg Ala Leu Lys Ile Arg Glu Glu Arg Ser 660 665 670Gly Met Thr Thr Asp Asp Asp Ala Val Ser Glu Met Lys Met Gly Arg 675 680 685Tyr Trp Ser Lys Glu Glu Arg Lys Gln His Leu Val Lys Ala Lys Glu 690 695 700Gln Arg Arg Arg Arg Glu Phe Met Met Gln Ser Arg Leu Asp Cys Leu705 710 715 720Lys Glu Gln Gln Ala Ala Asp Asp Arg Lys Glu Met Asn Ile Leu Glu 725 730 735Leu Ser His Lys Lys Met Met Lys Lys Arg Asn Lys Lys Ile Phe Asp 740 745 750Asn Trp Met Thr Ile Gln Glu Leu Leu Thr His Gly Thr Lys Ser Pro 755 760 765Asp Gly Thr Arg Val Tyr Asn Ser Phe Leu Ser Val Thr Thr Val 770 775 780208212DNAHomo sapiens 208gagagcagcg ccaatgtgaa gcgttgcagt cgcttgactc acctgaggct ctccaaggat 60accttcaatg cctgcactgt aagggagctg cttttcccgg gtgctggcga gaacggaagc 120cttcctttga cgtttttcta aacatgggat gcagtctgtg cagcctgcag aagcaagagg 180agcagtacaa attactctat gaagtttgtc ag 21220923PRTHomo sapiens 209Met Gly Cys Ser Leu Cys Ser Leu Gln Lys Gln Glu Glu Gln Tyr Lys1 5 10 15Leu Leu Tyr Glu Val Cys Gln 2021069DNAHomo sapiens 210atgggatgca gtctgtgcag cctgcagaag caagaggagc agtacaaatt actctatgaa 60gtttgtcag 692112570DNAHomo sapiens 211gtgaagcgtt gcagtcgctt gactcacctg aggctctcca aggatacctt caatgcctgc 60actgtaaggg agctgctttt cccgggtgct ggcgagaacg gaagccttcc tttgacgttt 120ttctaaacat gggatgcagt ctgtgcagcc tgcagaagca agaggagcag tacaaattac 180tctatgaagt ttgtcaggtc aacggcagag acttatccag agcaactcat gaccaggctg 240tggaagcttt caagacagcc aaggagccca tagtggtgca ggtgttgaga agaacaccaa 300ggaccaaaat gttcacgcct ccatcagagt ctcagctggt ggacacggga acccaaaccg 360acatcacctt tgaacatatc atggccctca ctaagatgtc ctctcccagc ccacccgtac 420tggatcccta tctcttgcca gaggagcatc cctcagccca tgaatactac gatccaaatg 480actacattgg agacatccat caggagatgg acagggagga gctggagctg gaggaagtgg 540acctctacag aatgaacagc caggacaagc tgggcctcac tgtgtgctac cggacggacg 600atgaagacga cattgggatt tatatcagtg agattgaccc taacagcatt gcagccaagg 660atgggcgcat ccgagaagga gaccgcatta tccagattaa tgggatagag gtgcagaacc 720gtgaagaggc tgtggctctt ctaaccagtg aagaaaataa aaacttttca ttgctgattg 780caaggcctga actccagctg gatgagggct ggatggatga tgacaggaac gactttctgg 840atgacctgca catggacatg ctggaggagc agcaccacca ggccatgcaa ttcacagcta 900gcgtgctgca gcagaagaag cacgacgaag acggtgggac cacagataca gccaccatct 960tgtccaacca gcacgagaag gacagtggtg tggggcggac cgacgagagc acccgtaatg 1020acgagagctc ggagcaagag aacaatggcg acgacgccac cgcatcctcc aacccgctgg 1080cggggcagag gaagctcacc tgcagccagg acaccttggg cagcggcgac ctgcccttca 1140gcaacgagtc tttcatttcg gccgactgca cggacgccga ctacctgggg atcccggtgg 1200acgagtgcga gcgcttccgc gagctcctgg agctcaagtg ccaggtgaag agcgccaccc 1260cttacggcct gtactaccct agcggccccc tggacgccgg caagagtgac cctgagagcg 1320tggacaagga gctggagctg ctgaacgaag agctgcgcag catcgagctg gagtgcctga 1380gcatcgtgcg cgcccacaag atgcagcagc tcaaggagca gtaccgcgag tcctggatgc 1440tgcacaacag cggcttccgc aactacaaca ccagcatcga cgtgcgcaga cacgagctct 1500cagatatcac cgagctcccg gagaaatccg acaaggacag ctcgagcgcc tacaacacag 1560gcgagagctg ccgcagcacc ccgctcaccc tggagatctc ccccgacaac tccttgagga 1620gagcggcgga gggcatcagc tgcccgagca gcgaaggggc tgtggggacc acggaagcct 1680acgggccagc ctccaagaat ctgctctcca tcacggaaga tcccgaagtg ggcaccccta 1740cctatagccc gtccctgaag gagctggacc ccaaccagcc cctggaaagc aaagagcgga 1800gagccagcga cgggagccgg agccccacgc ccagccagaa gctgggcagc gcctacctgc 1860cctcctatca ccactcccca tacaagcacg cgcacatccc ggcgcacgcc cagcactacc 1920agagctacat gcagctgatc cagcagaagt cggccgtgga gtacgcgcaa agccagatga 1980gcctggtgag catgtgcaag gacctgagct ctcccacccc gtcggagccg cgcatggagt 2040ggaaggtgaa gatccgcagc gacgggacgc gctacatcac caagaggccc gtgcgggacc 2100gcctgctgcg ggagcgcgcc ctgaagatcc gggaagagcg cagcggcatg accaccgacg 2160acgacgcggt gagcgagatg aagatggggc gctactggag caaggaggag aggaagcagc 2220acctggtgaa ggccaaggag cagcggcggc ggcgcgagtt catgatgcag agcaggttgg 2280attgtctcaa ggagcagcaa gcagccgatg acaggaagga gatgaacatt ctcgaactga 2340gccacaaaaa gatgatgaag aagaggaata agaaaatctt cgataactgg atgacgatcc 2400aagaactctt aacccacggc acaaaatccc cggacggcac tagagtatac aattccttcc 2460tatcggtgac tactgtataa ttttcacttc tgcattatgt acataaagga gaccactacc 2520actggggtag aaattcctgc ctcgttcaat gcggcaagtt tttgtatata 25702122570DNAHomo sapiensCDS(129)..(2480) 212gtgaagcgtt gcagtcgctt gactcacctg aggctctcca aggatacctt caatgcctgc 60actgtaaggg agctgctttt cccgggtgct ggcgagaacg gaagccttcc tttgacgttt 120ttctaaac atg gga tgc agt ctg tgc agc ctg cag aag caa gag gag cag 170 Met Gly Cys Ser Leu Cys Ser Leu Gln Lys Gln Glu Glu Gln 1 5 10tac aaa tta ctc tat gaa gtt tgt cag gtc aac ggc aga gac tta tcc 218Tyr Lys Leu Leu Tyr Glu Val Cys Gln Val Asn Gly Arg Asp Leu Ser15 20 25 30aga gca act cat gac cag gct gtg gaa gct ttc aag aca gcc aag gag 266Arg Ala Thr His Asp Gln Ala Val Glu Ala Phe Lys Thr Ala Lys Glu 35 40 45ccc ata gtg gtg cag gtg ttg aga aga aca cca agg acc aaa atg ttc 314Pro Ile Val Val Gln Val Leu Arg Arg Thr Pro Arg Thr Lys Met Phe 50 55 60acg cct cca tca gag tct cag ctg gtg gac acg gga acc caa acc gac 362Thr Pro Pro Ser Glu Ser Gln Leu Val Asp Thr Gly Thr Gln Thr Asp 65 70 75atc acc ttt gaa cat atc atg gcc ctc act aag atg tcc tct ccc agc 410Ile Thr Phe Glu His Ile Met Ala Leu Thr Lys Met Ser Ser Pro Ser 80 85 90cca ccc gta ctg gat ccc tat ctc ttg cca gag gag cat ccc tca gcc 458Pro Pro Val Leu Asp Pro Tyr Leu Leu Pro Glu Glu His Pro Ser Ala95 100 105 110cat gaa tac tac gat cca aat gac tac att gga gac atc cat cag gag 506His Glu Tyr Tyr Asp Pro Asn Asp Tyr Ile Gly Asp Ile His Gln Glu 115 120 125atg gac agg gag gag ctg gag ctg gag gaa gtg gac ctc tac aga atg 554Met Asp Arg Glu Glu Leu Glu Leu Glu Glu Val Asp Leu Tyr Arg Met 130 135 140aac agc cag gac aag ctg ggc ctc act gtg tgc tac cgg acg gac gat 602Asn Ser Gln Asp Lys Leu Gly Leu Thr Val Cys Tyr Arg Thr Asp Asp 145 150 155gaa gac gac att ggg att tat atc agt gag att gac cct aac agc att 650Glu Asp Asp Ile Gly Ile Tyr Ile Ser Glu Ile Asp Pro Asn Ser Ile 160 165 170gca gcc aag gat ggg cgc atc cga gaa gga gac cgc att atc cag att 698Ala Ala Lys Asp Gly Arg Ile Arg Glu Gly Asp Arg Ile Ile Gln Ile175 180 185 190aat ggg ata gag gtg cag aac cgt gaa gag gct gtg gct ctt cta acc 746Asn Gly Ile Glu Val Gln Asn Arg Glu Glu Ala Val Ala Leu Leu Thr 195 200 205agt gaa gaa aat aaa aac ttt tca ttg ctg att gca agg cct gaa ctc 794Ser Glu Glu Asn Lys Asn Phe Ser Leu Leu Ile Ala Arg Pro Glu Leu 210 215 220cag ctg gat gag ggc tgg atg gat gat gac agg aac gac ttt ctg gat 842Gln Leu Asp Glu Gly Trp Met Asp Asp Asp Arg Asn Asp Phe Leu Asp 225 230 235gac ctg cac atg gac atg ctg gag gag cag cac cac cag gcc atg caa 890Asp Leu His Met Asp Met Leu Glu Glu Gln His His Gln Ala Met Gln 240 245 250ttc aca gct agc gtg ctg cag cag aag aag cac gac gaa gac ggt ggg 938Phe Thr Ala Ser Val Leu Gln Gln Lys Lys His Asp Glu Asp Gly Gly255 260 265 270acc aca gat aca gcc acc atc ttg tcc aac cag cac gag aag gac agt 986Thr Thr Asp Thr Ala Thr Ile Leu Ser Asn Gln His Glu Lys Asp Ser 275 280 285ggt gtg ggg cgg acc gac gag agc acc cgt aat gac gag agc tcg gag 1034Gly Val Gly Arg Thr Asp Glu Ser Thr Arg Asn Asp Glu Ser Ser Glu 290 295 300caa gag aac aat ggc gac gac gcc acc gca tcc tcc aac ccg ctg gcg 1082Gln Glu Asn Asn Gly Asp Asp Ala Thr Ala Ser Ser Asn Pro Leu Ala 305 310 315ggg cag agg aag ctc acc tgc agc cag gac acc ttg ggc agc ggc gac 1130Gly Gln Arg Lys Leu Thr Cys Ser Gln Asp Thr Leu Gly Ser Gly Asp 320 325 330ctg ccc ttc agc aac gag tct ttc att tcg gcc gac tgc acg gac gcc 1178Leu Pro Phe Ser Asn Glu Ser Phe Ile Ser Ala Asp Cys Thr Asp Ala335 340 345 350gac tac ctg ggg atc ccg gtg gac gag tgc gag cgc ttc cgc gag ctc 1226Asp Tyr Leu Gly Ile Pro Val Asp Glu Cys Glu Arg Phe Arg Glu Leu 355 360 365ctg gag ctc aag tgc cag gtg aag agc gcc acc cct tac ggc ctg tac 1274Leu Glu Leu Lys Cys Gln Val Lys Ser Ala Thr Pro Tyr Gly Leu Tyr 370 375 380tac cct agc ggc ccc ctg gac gcc ggc aag agt gac cct gag agc gtg 1322Tyr Pro Ser Gly Pro Leu Asp Ala Gly Lys Ser Asp Pro Glu Ser Val 385 390 395gac aag gag ctg gag ctg ctg aac gaa gag ctg cgc agc atc gag ctg 1370Asp Lys Glu Leu Glu Leu Leu Asn Glu Glu Leu Arg Ser Ile Glu Leu 400 405 410gag tgc ctg agc atc gtg cgc gcc cac aag atg cag cag ctc aag gag 1418Glu Cys Leu Ser Ile Val Arg Ala His Lys Met Gln Gln Leu Lys Glu415 420 425 430cag tac cgc gag tcc tgg atg ctg cac aac agc ggc ttc cgc aac tac 1466Gln Tyr Arg Glu Ser Trp Met Leu His Asn Ser Gly Phe Arg Asn Tyr 435 440 445aac acc agc atc gac gtg cgc aga cac gag ctc tca gat atc acc gag 1514Asn Thr Ser Ile Asp Val Arg Arg His Glu Leu Ser Asp Ile Thr Glu 450 455 460ctc ccg gag aaa tcc gac aag gac agc tcg agc gcc tac aac aca ggc 1562Leu Pro Glu Lys Ser Asp Lys Asp Ser Ser Ser Ala Tyr Asn Thr Gly 465 470 475gag agc tgc cgc agc acc ccg ctc acc ctg gag atc tcc ccc gac aac 1610Glu Ser Cys Arg Ser Thr Pro Leu Thr Leu Glu Ile Ser Pro Asp Asn 480 485 490tcc ttg agg aga gcg gcg gag ggc atc agc tgc ccg agc agc gaa ggg 1658Ser Leu Arg Arg Ala Ala Glu Gly Ile Ser Cys Pro Ser Ser Glu Gly495 500 505 510gct gtg ggg acc acg gaa gcc tac ggg cca gcc tcc aag aat ctg ctc 1706Ala Val Gly Thr Thr Glu Ala Tyr Gly Pro Ala Ser Lys Asn Leu Leu 515 520 525tcc atc acg gaa gat ccc gaa gtg ggc acc cct acc tat agc ccg tcc 1754Ser Ile Thr Glu Asp Pro Glu Val Gly Thr Pro Thr Tyr Ser Pro Ser 530 535 540ctg aag gag ctg gac ccc aac cag ccc ctg gaa agc aaa gag cgg aga 1802Leu Lys Glu Leu Asp Pro Asn Gln Pro Leu Glu Ser Lys Glu Arg Arg 545 550 555gcc agc gac ggg agc cgg agc ccc acg ccc agc cag aag ctg ggc agc 1850Ala Ser Asp Gly Ser Arg Ser Pro Thr Pro Ser Gln Lys Leu Gly Ser 560 565 570gcc tac ctg ccc tcc tat cac cac tcc cca tac aag cac gcg cac atc 1898Ala Tyr Leu Pro Ser Tyr His His Ser Pro Tyr Lys His Ala His Ile575 580 585 590ccg gcg cac gcc cag cac tac cag agc tac atg cag ctg atc cag cag 1946Pro Ala His Ala Gln His Tyr Gln Ser Tyr Met Gln Leu Ile Gln Gln 595 600 605aag tcg gcc gtg gag tac gcg caa agc cag atg agc ctg gtg agc atg 1994Lys Ser Ala Val Glu Tyr Ala Gln Ser Gln Met Ser Leu Val Ser Met 610 615 620tgc aag gac ctg agc tct ccc acc ccg tcg gag ccg cgc atg gag tgg 2042Cys Lys Asp Leu Ser Ser Pro Thr Pro Ser Glu Pro Arg Met Glu Trp 625 630 635aag gtg aag atc cgc agc gac ggg acg cgc tac atc acc aag agg ccc 2090Lys Val Lys Ile Arg Ser Asp Gly Thr Arg Tyr Ile Thr Lys Arg Pro 640 645 650gtg cgg gac cgc ctg ctg cgg gag cgc gcc ctg aag atc cgg gaa gag 2138Val Arg Asp Arg Leu Leu Arg Glu Arg Ala Leu Lys Ile Arg Glu Glu655 660 665 670cgc agc ggc atg acc acc gac gac gac gcg gtg agc gag atg aag atg 2186Arg Ser Gly Met Thr Thr Asp Asp Asp Ala Val Ser Glu Met Lys Met 675 680 685ggg cgc tac tgg agc aag gag gag agg aag cag cac ctg gtg aag gcc 2234Gly Arg Tyr Trp Ser Lys Glu Glu Arg Lys Gln His Leu Val Lys Ala 690 695 700aag gag cag cgg cgg cgg cgc gag ttc atg atg cag agc agg ttg gat 2282Lys Glu Gln Arg Arg Arg Arg Glu Phe Met Met Gln Ser Arg Leu Asp 705 710 715tgt ctc aag gag cag caa gca gcc gat gac agg aag gag atg aac att 2330Cys Leu Lys Glu Gln Gln Ala Ala Asp Asp Arg Lys Glu Met Asn Ile 720 725 730ctc gaa ctg agc cac aaa aag atg atg aag aag agg aat aag aaa atc 2378Leu Glu Leu Ser His Lys Lys Met Met Lys Lys Arg Asn Lys Lys Ile735 740 745 750ttc gat aac tgg atg acg atc caa gaa ctc tta acc cac ggc aca aaa 2426Phe Asp Asn Trp Met Thr Ile Gln Glu Leu Leu Thr His Gly Thr Lys 755 760 765tcc ccg gac ggc act aga gta tac aat tcc ttc cta tcg gtg act act 2474Ser Pro Asp Gly Thr Arg Val Tyr Asn Ser Phe Leu Ser Val Thr Thr 770 775 780gta taa ttttcacttc tgcattatgt acataaagga gaccactacc actggggtag 2530Valaaattcctgc ctcgttcaat gcggcaagtt tttgtatata 2570213783PRTHomo sapiens 213Met Gly Cys Ser Leu Cys Ser Leu Gln Lys Gln Glu Glu Gln Tyr Lys1 5 10 15Leu Leu Tyr Glu Val Cys Gln Val Asn Gly Arg Asp Leu Ser Arg Ala 20 25 30Thr His Asp Gln Ala Val Glu Ala Phe Lys Thr Ala Lys Glu Pro Ile 35 40 45Val Val Gln Val Leu Arg Arg Thr Pro Arg Thr Lys Met Phe Thr Pro 50 55 60Pro Ser Glu Ser Gln Leu

Val Asp Thr Gly Thr Gln Thr Asp Ile Thr65 70 75 80Phe Glu His Ile Met Ala Leu Thr Lys Met Ser Ser Pro Ser Pro Pro 85 90 95Val Leu Asp Pro Tyr Leu Leu Pro Glu Glu His Pro Ser Ala His Glu 100 105 110Tyr Tyr Asp Pro Asn Asp Tyr Ile Gly Asp Ile His Gln Glu Met Asp 115 120 125Arg Glu Glu Leu Glu Leu Glu Glu Val Asp Leu Tyr Arg Met Asn Ser 130 135 140Gln Asp Lys Leu Gly Leu Thr Val Cys Tyr Arg Thr Asp Asp Glu Asp145 150 155 160Asp Ile Gly Ile Tyr Ile Ser Glu Ile Asp Pro Asn Ser Ile Ala Ala 165 170 175Lys Asp Gly Arg Ile Arg Glu Gly Asp Arg Ile Ile Gln Ile Asn Gly 180 185 190Ile Glu Val Gln Asn Arg Glu Glu Ala Val Ala Leu Leu Thr Ser Glu 195 200 205Glu Asn Lys Asn Phe Ser Leu Leu Ile Ala Arg Pro Glu Leu Gln Leu 210 215 220Asp Glu Gly Trp Met Asp Asp Asp Arg Asn Asp Phe Leu Asp Asp Leu225 230 235 240His Met Asp Met Leu Glu Glu Gln His His Gln Ala Met Gln Phe Thr 245 250 255Ala Ser Val Leu Gln Gln Lys Lys His Asp Glu Asp Gly Gly Thr Thr 260 265 270Asp Thr Ala Thr Ile Leu Ser Asn Gln His Glu Lys Asp Ser Gly Val 275 280 285Gly Arg Thr Asp Glu Ser Thr Arg Asn Asp Glu Ser Ser Glu Gln Glu 290 295 300Asn Asn Gly Asp Asp Ala Thr Ala Ser Ser Asn Pro Leu Ala Gly Gln305 310 315 320Arg Lys Leu Thr Cys Ser Gln Asp Thr Leu Gly Ser Gly Asp Leu Pro 325 330 335Phe Ser Asn Glu Ser Phe Ile Ser Ala Asp Cys Thr Asp Ala Asp Tyr 340 345 350Leu Gly Ile Pro Val Asp Glu Cys Glu Arg Phe Arg Glu Leu Leu Glu 355 360 365Leu Lys Cys Gln Val Lys Ser Ala Thr Pro Tyr Gly Leu Tyr Tyr Pro 370 375 380Ser Gly Pro Leu Asp Ala Gly Lys Ser Asp Pro Glu Ser Val Asp Lys385 390 395 400Glu Leu Glu Leu Leu Asn Glu Glu Leu Arg Ser Ile Glu Leu Glu Cys 405 410 415Leu Ser Ile Val Arg Ala His Lys Met Gln Gln Leu Lys Glu Gln Tyr 420 425 430Arg Glu Ser Trp Met Leu His Asn Ser Gly Phe Arg Asn Tyr Asn Thr 435 440 445Ser Ile Asp Val Arg Arg His Glu Leu Ser Asp Ile Thr Glu Leu Pro 450 455 460Glu Lys Ser Asp Lys Asp Ser Ser Ser Ala Tyr Asn Thr Gly Glu Ser465 470 475 480Cys Arg Ser Thr Pro Leu Thr Leu Glu Ile Ser Pro Asp Asn Ser Leu 485 490 495Arg Arg Ala Ala Glu Gly Ile Ser Cys Pro Ser Ser Glu Gly Ala Val 500 505 510Gly Thr Thr Glu Ala Tyr Gly Pro Ala Ser Lys Asn Leu Leu Ser Ile 515 520 525Thr Glu Asp Pro Glu Val Gly Thr Pro Thr Tyr Ser Pro Ser Leu Lys 530 535 540Glu Leu Asp Pro Asn Gln Pro Leu Glu Ser Lys Glu Arg Arg Ala Ser545 550 555 560Asp Gly Ser Arg Ser Pro Thr Pro Ser Gln Lys Leu Gly Ser Ala Tyr 565 570 575Leu Pro Ser Tyr His His Ser Pro Tyr Lys His Ala His Ile Pro Ala 580 585 590His Ala Gln His Tyr Gln Ser Tyr Met Gln Leu Ile Gln Gln Lys Ser 595 600 605Ala Val Glu Tyr Ala Gln Ser Gln Met Ser Leu Val Ser Met Cys Lys 610 615 620Asp Leu Ser Ser Pro Thr Pro Ser Glu Pro Arg Met Glu Trp Lys Val625 630 635 640Lys Ile Arg Ser Asp Gly Thr Arg Tyr Ile Thr Lys Arg Pro Val Arg 645 650 655Asp Arg Leu Leu Arg Glu Arg Ala Leu Lys Ile Arg Glu Glu Arg Ser 660 665 670Gly Met Thr Thr Asp Asp Asp Ala Val Ser Glu Met Lys Met Gly Arg 675 680 685Tyr Trp Ser Lys Glu Glu Arg Lys Gln His Leu Val Lys Ala Lys Glu 690 695 700Gln Arg Arg Arg Arg Glu Phe Met Met Gln Ser Arg Leu Asp Cys Leu705 710 715 720Lys Glu Gln Gln Ala Ala Asp Asp Arg Lys Glu Met Asn Ile Leu Glu 725 730 735Leu Ser His Lys Lys Met Met Lys Lys Arg Asn Lys Lys Ile Phe Asp 740 745 750Asn Trp Met Thr Ile Gln Glu Leu Leu Thr His Gly Thr Lys Ser Pro 755 760 765Asp Gly Thr Arg Val Tyr Asn Ser Phe Leu Ser Val Thr Thr Val 770 775 780214197DNAHomo sapiens 214gtgaagcgtt gcagtcgctt gactcacctg aggctctcca aggatacctt caatgcctgc 60actgtaaggg agctgctttt cccgggtgct ggcgagaacg gaagccttcc tttgacgttt 120ttctaaacat gggatgcagt ctgtgcagcc tgcagaagca agaggagcag tacaaattac 180tctatgaagt ttgtcag 19721523PRTHomo sapiens 215Met Gly Cys Ser Leu Cys Ser Leu Gln Lys Gln Glu Glu Gln Tyr Lys1 5 10 15Leu Leu Tyr Glu Val Cys Gln 2021669DNAHomo sapiens 216atgggatgca gtctgtgcag cctgcagaag caagaggagc agtacaaatt actctatgaa 60gtttgtcag 692172698DNAHomo sapiens 217agcatttgct cagtactctc aacataaacc aagcttagat cgcaatgaaa ctgcggacct 60gcaggctgcc ttgcttcact gagtcaattt ttaaatgaag atggaacaca agaggatgat 120cttattgttc agtaaattta ctctaataca taaaccccat ggtggaagat ggtatttggt 180caacggcaga gacttatcca gagcaactca tgaccaggct gtggaagctt tcaagacagc 240caaggagccc atagtggtgc aggtgttgag aagaacacca aggaccaaaa tgttcacgcc 300tccatcagag tctcagctgg tggacacggg aacccaaacc gacatcacct ttgaacatat 360catggccctc actaagatgt cctctcccag cccacccgta ctggatccct atctcttgcc 420agaggagcat ccctcagccc atgaatacta cgatccaaat gactacattg gagacatcca 480tcaggagatg gacagggagg agctggagct ggaggaagtg gacctctaca gaatgaacag 540ccaggacaag ctgggcctca ctgtgtgcta ccggacggac gatgaagacg acattgggat 600ttatatcagt gagattgacc ctaacagcat tgcagccaag gatgggcgca tccgagaagg 660agaccgcatt atccagatta atgggataga ggtgcagaac cgtgaagagg ctgtggctct 720tctaaccagt gaagaaaata aaaacttttc attgctgatt gcaaggcctg aactccagct 780ggatgagggc tggatggatg atgacaggaa cgactttctg gatgacctgc acatggacat 840gctggaggag cagcaccacc aggccatgca attcacagct agcgtgctgc agcagaagaa 900gcacgacgaa gacggtggga ccacagatac agccaccatc ttgtccaacc agcacgagaa 960ggacagcggt gtggggcgga ccgacgagag cacccgtaat gacgagagct cggagcaaga 1020gaacaatggc gacgacgcca ccgcatcctc caacccgctg gcggggcaga ggaagctcac 1080ctgcagccag gacaccttgg gcagcggcga cctgcccttc agcaacgagt ctttcatttc 1140ggccgactgc acggacgccg actacctggg gatcccggtg gacgagtgcg agcgcttccg 1200cgagctcctg gagctcaagt gccaggtgaa gagcgccacc ccttacggcc tgtactaccc 1260tagcggcccc ctggacgccg gcaagagtga ccctgagagc gtggacaagg agctggagct 1320gctgaacgaa gagctgcgca gcatcgagct ggagtgcctg agcatcgtgc gcgcccacaa 1380gatgcagcag ctcaaggagc agtaccgcga gtcctggatg ctgcacaaca gcggcttccg 1440caactacaac accagcatcg acgtgcgcag acacgagctc tcggatatca ccgagctccc 1500ggagaaatcc gacaaggaca gctcgagcgc ctacaacaca ggcgagagct gccgcagcac 1560cccgctcacc ctggagatct cccccgacaa ctccttgagg agagcggtgg agggcatcag 1620ctgcccgagc agcgaagggg ctgtggggac cacggaagcc tacgggccag cctccaagaa 1680tctgctctcc atcacggaag atcccgaagt gggcacccct acctatagcc cgtccctgaa 1740ggagctggac cccaaccagc ccctggaaag caaagagcgg agagccagcg acgggagccg 1800gagccccacg cccagccaga agctgggcag cgcctacctg ccctcctatc accactcccc 1860atacaagcac gcgcacatcc cggcgcacgc ccagcactac cagagctaca tgcagctgat 1920ccagcagaag tcggccgtgg agtacgcgca aagccagatg agcctggtga gcatgtgcaa 1980ggacctgagc tctcccaccc cgtcggagcc gcgcatggag tggaaggtga agatccgcag 2040cgacgggacg cgctacatca ccaagaggcc cgtgcgggac cgcctgctgc gggagcgcgc 2100cctgaagatc cgggaagagc gcagcggcat gaccaccgac gacgacgcgg tgagcgagat 2160gaagatgggg cgctactgga gcaaggagga gaggaagcag cacctggtga aggccaagga 2220gcagcggcgg cggcgcgagt tcatgatgca gagcaggttg gattgtctca aggagcagca 2280agcagccgat gacaggaagg agatgaacat tctcgaactg agccacaaaa agatgatgaa 2340gaagaggaat aagaaaatct tcgataactg gatgacgatc caagaactct taacccacgg 2400cacaaaatcc ccggacggca ctagagtata caattccttc ctatcggtga ctactgtata 2460attttcactt ctgcattatg tacataaagg agaccactac cactggggta gaaattcctg 2520cctcgttcaa tgcggcaagt ttttgtatat aagataagta cggtcttcat gtttatagtc 2580caaatttgca aaccctacaa ctctgggtgt cataggtcta ttttaaggga agagagagaa 2640aaacaccctt actatcttgg aaggcaatat taacaaacag agcttttttc aaatagca 26982182698DNAHomo sapiensCDS(95)..(2461) 218agcatttgct cagtactctc aacataaacc aagcttagat cgcaatgaaa ctgcggacct 60gcaggctgcc ttgcttcact gagtcaattt ttaa atg aag atg gaa cac aag agg 115 Met Lys Met Glu His Lys Arg 1 5atg atc tta ttg ttc agt aaa ttt act cta ata cat aaa ccc cat ggt 163Met Ile Leu Leu Phe Ser Lys Phe Thr Leu Ile His Lys Pro His Gly 10 15 20gga aga tgg tat ttg gtc aac ggc aga gac tta tcc aga gca act cat 211Gly Arg Trp Tyr Leu Val Asn Gly Arg Asp Leu Ser Arg Ala Thr His 25 30 35gac cag gct gtg gaa gct ttc aag aca gcc aag gag ccc ata gtg gtg 259Asp Gln Ala Val Glu Ala Phe Lys Thr Ala Lys Glu Pro Ile Val Val40 45 50 55cag gtg ttg aga aga aca cca agg acc aaa atg ttc acg cct cca tca 307Gln Val Leu Arg Arg Thr Pro Arg Thr Lys Met Phe Thr Pro Pro Ser 60 65 70gag tct cag ctg gtg gac acg gga acc caa acc gac atc acc ttt gaa 355Glu Ser Gln Leu Val Asp Thr Gly Thr Gln Thr Asp Ile Thr Phe Glu 75 80 85cat atc atg gcc ctc act aag atg tcc tct ccc agc cca ccc gta ctg 403His Ile Met Ala Leu Thr Lys Met Ser Ser Pro Ser Pro Pro Val Leu 90 95 100gat ccc tat ctc ttg cca gag gag cat ccc tca gcc cat gaa tac tac 451Asp Pro Tyr Leu Leu Pro Glu Glu His Pro Ser Ala His Glu Tyr Tyr 105 110 115gat cca aat gac tac att gga gac atc cat cag gag atg gac agg gag 499Asp Pro Asn Asp Tyr Ile Gly Asp Ile His Gln Glu Met Asp Arg Glu120 125 130 135gag ctg gag ctg gag gaa gtg gac ctc tac aga atg aac agc cag gac 547Glu Leu Glu Leu Glu Glu Val Asp Leu Tyr Arg Met Asn Ser Gln Asp 140 145 150aag ctg ggc ctc act gtg tgc tac cgg acg gac gat gaa gac gac att 595Lys Leu Gly Leu Thr Val Cys Tyr Arg Thr Asp Asp Glu Asp Asp Ile 155 160 165ggg att tat atc agt gag att gac cct aac agc att gca gcc aag gat 643Gly Ile Tyr Ile Ser Glu Ile Asp Pro Asn Ser Ile Ala Ala Lys Asp 170 175 180ggg cgc atc cga gaa gga gac cgc att atc cag att aat ggg ata gag 691Gly Arg Ile Arg Glu Gly Asp Arg Ile Ile Gln Ile Asn Gly Ile Glu 185 190 195gtg cag aac cgt gaa gag gct gtg gct ctt cta acc agt gaa gaa aat 739Val Gln Asn Arg Glu Glu Ala Val Ala Leu Leu Thr Ser Glu Glu Asn200 205 210 215aaa aac ttt tca ttg ctg att gca agg cct gaa ctc cag ctg gat gag 787Lys Asn Phe Ser Leu Leu Ile Ala Arg Pro Glu Leu Gln Leu Asp Glu 220 225 230ggc tgg atg gat gat gac agg aac gac ttt ctg gat gac ctg cac atg 835Gly Trp Met Asp Asp Asp Arg Asn Asp Phe Leu Asp Asp Leu His Met 235 240 245gac atg ctg gag gag cag cac cac cag gcc atg caa ttc aca gct agc 883Asp Met Leu Glu Glu Gln His His Gln Ala Met Gln Phe Thr Ala Ser 250 255 260gtg ctg cag cag aag aag cac gac gaa gac ggt ggg acc aca gat aca 931Val Leu Gln Gln Lys Lys His Asp Glu Asp Gly Gly Thr Thr Asp Thr 265 270 275gcc acc atc ttg tcc aac cag cac gag aag gac agc ggt gtg ggg cgg 979Ala Thr Ile Leu Ser Asn Gln His Glu Lys Asp Ser Gly Val Gly Arg280 285 290 295acc gac gag agc acc cgt aat gac gag agc tcg gag caa gag aac aat 1027Thr Asp Glu Ser Thr Arg Asn Asp Glu Ser Ser Glu Gln Glu Asn Asn 300 305 310ggc gac gac gcc acc gca tcc tcc aac ccg ctg gcg ggg cag agg aag 1075Gly Asp Asp Ala Thr Ala Ser Ser Asn Pro Leu Ala Gly Gln Arg Lys 315 320 325ctc acc tgc agc cag gac acc ttg ggc agc ggc gac ctg ccc ttc agc 1123Leu Thr Cys Ser Gln Asp Thr Leu Gly Ser Gly Asp Leu Pro Phe Ser 330 335 340aac gag tct ttc att tcg gcc gac tgc acg gac gcc gac tac ctg ggg 1171Asn Glu Ser Phe Ile Ser Ala Asp Cys Thr Asp Ala Asp Tyr Leu Gly 345 350 355atc ccg gtg gac gag tgc gag cgc ttc cgc gag ctc ctg gag ctc aag 1219Ile Pro Val Asp Glu Cys Glu Arg Phe Arg Glu Leu Leu Glu Leu Lys360 365 370 375tgc cag gtg aag agc gcc acc cct tac ggc ctg tac tac cct agc ggc 1267Cys Gln Val Lys Ser Ala Thr Pro Tyr Gly Leu Tyr Tyr Pro Ser Gly 380 385 390ccc ctg gac gcc ggc aag agt gac cct gag agc gtg gac aag gag ctg 1315Pro Leu Asp Ala Gly Lys Ser Asp Pro Glu Ser Val Asp Lys Glu Leu 395 400 405gag ctg ctg aac gaa gag ctg cgc agc atc gag ctg gag tgc ctg agc 1363Glu Leu Leu Asn Glu Glu Leu Arg Ser Ile Glu Leu Glu Cys Leu Ser 410 415 420atc gtg cgc gcc cac aag atg cag cag ctc aag gag cag tac cgc gag 1411Ile Val Arg Ala His Lys Met Gln Gln Leu Lys Glu Gln Tyr Arg Glu 425 430 435tcc tgg atg ctg cac aac agc ggc ttc cgc aac tac aac acc agc atc 1459Ser Trp Met Leu His Asn Ser Gly Phe Arg Asn Tyr Asn Thr Ser Ile440 445 450 455gac gtg cgc aga cac gag ctc tcg gat atc acc gag ctc ccg gag aaa 1507Asp Val Arg Arg His Glu Leu Ser Asp Ile Thr Glu Leu Pro Glu Lys 460 465 470tcc gac aag gac agc tcg agc gcc tac aac aca ggc gag agc tgc cgc 1555Ser Asp Lys Asp Ser Ser Ser Ala Tyr Asn Thr Gly Glu Ser Cys Arg 475 480 485agc acc ccg ctc acc ctg gag atc tcc ccc gac aac tcc ttg agg aga 1603Ser Thr Pro Leu Thr Leu Glu Ile Ser Pro Asp Asn Ser Leu Arg Arg 490 495 500gcg gtg gag ggc atc agc tgc ccg agc agc gaa ggg gct gtg ggg acc 1651Ala Val Glu Gly Ile Ser Cys Pro Ser Ser Glu Gly Ala Val Gly Thr 505 510 515acg gaa gcc tac ggg cca gcc tcc aag aat ctg ctc tcc atc acg gaa 1699Thr Glu Ala Tyr Gly Pro Ala Ser Lys Asn Leu Leu Ser Ile Thr Glu520 525 530 535gat ccc gaa gtg ggc acc cct acc tat agc ccg tcc ctg aag gag ctg 1747Asp Pro Glu Val Gly Thr Pro Thr Tyr Ser Pro Ser Leu Lys Glu Leu 540 545 550gac ccc aac cag ccc ctg gaa agc aaa gag cgg aga gcc agc gac ggg 1795Asp Pro Asn Gln Pro Leu Glu Ser Lys Glu Arg Arg Ala Ser Asp Gly 555 560 565agc cgg agc ccc acg ccc agc cag aag ctg ggc agc gcc tac ctg ccc 1843Ser Arg Ser Pro Thr Pro Ser Gln Lys Leu Gly Ser Ala Tyr Leu Pro 570 575 580tcc tat cac cac tcc cca tac aag cac gcg cac atc ccg gcg cac gcc 1891Ser Tyr His His Ser Pro Tyr Lys His Ala His Ile Pro Ala His Ala 585 590 595cag cac tac cag agc tac atg cag ctg atc cag cag aag tcg gcc gtg 1939Gln His Tyr Gln Ser Tyr Met Gln Leu Ile Gln Gln Lys Ser Ala Val600 605 610 615gag tac gcg caa agc cag atg agc ctg gtg agc atg tgc aag gac ctg 1987Glu Tyr Ala Gln Ser Gln Met Ser Leu Val Ser Met Cys Lys Asp Leu 620 625 630agc tct ccc acc ccg tcg gag ccg cgc atg gag tgg aag gtg aag atc 2035Ser Ser Pro Thr Pro Ser Glu Pro Arg Met Glu Trp Lys Val Lys Ile 635 640 645cgc agc gac ggg acg cgc tac atc acc aag agg ccc gtg cgg gac cgc 2083Arg Ser Asp Gly Thr Arg Tyr Ile Thr Lys Arg Pro Val Arg Asp Arg 650 655 660ctg ctg cgg gag cgc gcc ctg aag atc cgg gaa gag cgc agc ggc atg 2131Leu Leu Arg Glu Arg Ala Leu Lys Ile Arg Glu Glu Arg Ser Gly Met 665 670 675acc acc gac gac gac gcg gtg agc gag atg aag atg ggg cgc tac tgg 2179Thr Thr Asp Asp Asp Ala Val Ser Glu Met Lys Met Gly Arg Tyr Trp680 685 690 695agc aag gag gag agg aag cag cac ctg gtg aag gcc aag gag cag cgg 2227Ser Lys Glu Glu Arg Lys Gln His Leu Val Lys Ala Lys Glu Gln Arg 700 705 710cgg cgg cgc gag ttc atg atg cag agc agg ttg gat tgt ctc aag gag 2275Arg Arg Arg Glu Phe Met Met Gln Ser Arg Leu Asp Cys Leu Lys Glu 715 720 725cag caa gca gcc gat gac agg aag gag atg aac att ctc gaa ctg agc 2323Gln Gln Ala Ala Asp Asp Arg Lys Glu Met Asn Ile Leu Glu Leu Ser 730 735

740cac aaa aag atg atg aag aag agg aat aag aaa atc ttc gat aac tgg 2371His Lys Lys Met Met Lys Lys Arg Asn Lys Lys Ile Phe Asp Asn Trp 745 750 755atg acg atc caa gaa ctc tta acc cac ggc aca aaa tcc ccg gac ggc 2419Met Thr Ile Gln Glu Leu Leu Thr His Gly Thr Lys Ser Pro Asp Gly760 765 770 775act aga gta tac aat tcc ttc cta tcg gtg act act gta taa 2461Thr Arg Val Tyr Asn Ser Phe Leu Ser Val Thr Thr Val 780 785ttttcacttc tgcattatgt acataaagga gaccactacc actggggtag aaattcctgc 2521ctcgttcaat gcggcaagtt tttgtatata agataagtac ggtcttcatg tttatagtcc 2581aaatttgcaa accctacaac tctgggtgtc ataggtctat tttaagggaa gagagagaaa 2641aacaccctta ctatcttgga aggcaatatt aacaaacaga gcttttttca aatagca 2698219788PRTHomo sapiens 219Met Lys Met Glu His Lys Arg Met Ile Leu Leu Phe Ser Lys Phe Thr1 5 10 15Leu Ile His Lys Pro His Gly Gly Arg Trp Tyr Leu Val Asn Gly Arg 20 25 30Asp Leu Ser Arg Ala Thr His Asp Gln Ala Val Glu Ala Phe Lys Thr 35 40 45Ala Lys Glu Pro Ile Val Val Gln Val Leu Arg Arg Thr Pro Arg Thr 50 55 60Lys Met Phe Thr Pro Pro Ser Glu Ser Gln Leu Val Asp Thr Gly Thr65 70 75 80Gln Thr Asp Ile Thr Phe Glu His Ile Met Ala Leu Thr Lys Met Ser 85 90 95Ser Pro Ser Pro Pro Val Leu Asp Pro Tyr Leu Leu Pro Glu Glu His 100 105 110Pro Ser Ala His Glu Tyr Tyr Asp Pro Asn Asp Tyr Ile Gly Asp Ile 115 120 125His Gln Glu Met Asp Arg Glu Glu Leu Glu Leu Glu Glu Val Asp Leu 130 135 140Tyr Arg Met Asn Ser Gln Asp Lys Leu Gly Leu Thr Val Cys Tyr Arg145 150 155 160Thr Asp Asp Glu Asp Asp Ile Gly Ile Tyr Ile Ser Glu Ile Asp Pro 165 170 175Asn Ser Ile Ala Ala Lys Asp Gly Arg Ile Arg Glu Gly Asp Arg Ile 180 185 190Ile Gln Ile Asn Gly Ile Glu Val Gln Asn Arg Glu Glu Ala Val Ala 195 200 205Leu Leu Thr Ser Glu Glu Asn Lys Asn Phe Ser Leu Leu Ile Ala Arg 210 215 220Pro Glu Leu Gln Leu Asp Glu Gly Trp Met Asp Asp Asp Arg Asn Asp225 230 235 240Phe Leu Asp Asp Leu His Met Asp Met Leu Glu Glu Gln His His Gln 245 250 255Ala Met Gln Phe Thr Ala Ser Val Leu Gln Gln Lys Lys His Asp Glu 260 265 270Asp Gly Gly Thr Thr Asp Thr Ala Thr Ile Leu Ser Asn Gln His Glu 275 280 285Lys Asp Ser Gly Val Gly Arg Thr Asp Glu Ser Thr Arg Asn Asp Glu 290 295 300Ser Ser Glu Gln Glu Asn Asn Gly Asp Asp Ala Thr Ala Ser Ser Asn305 310 315 320Pro Leu Ala Gly Gln Arg Lys Leu Thr Cys Ser Gln Asp Thr Leu Gly 325 330 335Ser Gly Asp Leu Pro Phe Ser Asn Glu Ser Phe Ile Ser Ala Asp Cys 340 345 350Thr Asp Ala Asp Tyr Leu Gly Ile Pro Val Asp Glu Cys Glu Arg Phe 355 360 365Arg Glu Leu Leu Glu Leu Lys Cys Gln Val Lys Ser Ala Thr Pro Tyr 370 375 380Gly Leu Tyr Tyr Pro Ser Gly Pro Leu Asp Ala Gly Lys Ser Asp Pro385 390 395 400Glu Ser Val Asp Lys Glu Leu Glu Leu Leu Asn Glu Glu Leu Arg Ser 405 410 415Ile Glu Leu Glu Cys Leu Ser Ile Val Arg Ala His Lys Met Gln Gln 420 425 430Leu Lys Glu Gln Tyr Arg Glu Ser Trp Met Leu His Asn Ser Gly Phe 435 440 445Arg Asn Tyr Asn Thr Ser Ile Asp Val Arg Arg His Glu Leu Ser Asp 450 455 460Ile Thr Glu Leu Pro Glu Lys Ser Asp Lys Asp Ser Ser Ser Ala Tyr465 470 475 480Asn Thr Gly Glu Ser Cys Arg Ser Thr Pro Leu Thr Leu Glu Ile Ser 485 490 495Pro Asp Asn Ser Leu Arg Arg Ala Val Glu Gly Ile Ser Cys Pro Ser 500 505 510Ser Glu Gly Ala Val Gly Thr Thr Glu Ala Tyr Gly Pro Ala Ser Lys 515 520 525Asn Leu Leu Ser Ile Thr Glu Asp Pro Glu Val Gly Thr Pro Thr Tyr 530 535 540Ser Pro Ser Leu Lys Glu Leu Asp Pro Asn Gln Pro Leu Glu Ser Lys545 550 555 560Glu Arg Arg Ala Ser Asp Gly Ser Arg Ser Pro Thr Pro Ser Gln Lys 565 570 575Leu Gly Ser Ala Tyr Leu Pro Ser Tyr His His Ser Pro Tyr Lys His 580 585 590Ala His Ile Pro Ala His Ala Gln His Tyr Gln Ser Tyr Met Gln Leu 595 600 605Ile Gln Gln Lys Ser Ala Val Glu Tyr Ala Gln Ser Gln Met Ser Leu 610 615 620Val Ser Met Cys Lys Asp Leu Ser Ser Pro Thr Pro Ser Glu Pro Arg625 630 635 640Met Glu Trp Lys Val Lys Ile Arg Ser Asp Gly Thr Arg Tyr Ile Thr 645 650 655Lys Arg Pro Val Arg Asp Arg Leu Leu Arg Glu Arg Ala Leu Lys Ile 660 665 670Arg Glu Glu Arg Ser Gly Met Thr Thr Asp Asp Asp Ala Val Ser Glu 675 680 685Met Lys Met Gly Arg Tyr Trp Ser Lys Glu Glu Arg Lys Gln His Leu 690 695 700Val Lys Ala Lys Glu Gln Arg Arg Arg Arg Glu Phe Met Met Gln Ser705 710 715 720Arg Leu Asp Cys Leu Lys Glu Gln Gln Ala Ala Asp Asp Arg Lys Glu 725 730 735Met Asn Ile Leu Glu Leu Ser His Lys Lys Met Met Lys Lys Arg Asn 740 745 750Lys Lys Ile Phe Asp Asn Trp Met Thr Ile Gln Glu Leu Leu Thr His 755 760 765Gly Thr Lys Ser Pro Asp Gly Thr Arg Val Tyr Asn Ser Phe Leu Ser 770 775 780Val Thr Thr Val785220178DNAHomo sapiens 220agcatttgct cagtactctc aacataaacc aagcttagat cgcaatgaaa ctgcggacct 60gcaggctgcc ttgcttcact gagtcaattt ttaaatgaag atggaacaca agaggatgat 120cttattgttc agtaaattta ctctaataca taaaccccat ggtggaagat ggtatttg 17822128PRTHomo sapiens 221Met Lys Met Glu His Lys Arg Met Ile Leu Leu Phe Ser Lys Phe Thr1 5 10 15Leu Ile His Lys Pro His Gly Gly Arg Trp Tyr Leu 20 2522284DNAHomo sapiens 222atgaagatgg aacacaagag gatgatctta ttgttcagta aatttactct aatacataaa 60ccccatggtg gaagatggta tttg 842232927DNAHomo sapiens 223agcatttgct cagtactctc aacataaacc aagcttagat cgcaatgaaa ctgcggacct 60gcaggctgcc ttgcttcact gagtcaattt ttaaatgaag atggaacaca agaggatgat 120cttattgttc agtaaattta ctctaataca taaaccccat ggtggaagat ggtatttggt 180caacggcaga gacttatcca gagcaactca tgaccaggct gtggaagctt tcaagacagc 240caaggagccc atagtggtgc aggtgttgag aagaacacca aggaccaaaa tgttcacgcc 300tccatcagag tctcagctgg tggacacggg aacccaaacc gacatcacct ttgaacatat 360catggccctc actaagatgt cctctcccag cccacccgta ctggatccct atctcttgcc 420agaggagcat ccctcagccc atgaatacta cgatccaaat gactacattg gagacatcca 480tcaggagatg gacagggagg agctggagct ggaggaagtg gacctctaca gaatgaacag 540ccaggacaag ctgggcctca ctgtgtgcta ccggacggac gatgaagacg acattgggat 600ttacatcagt gagattgacc ctaacagcat tgcagccaag gatgggcgca tccgagaagg 660agaccgcatt atccagatta atgggataga ggtgcagaac cgtgaagagg ctgtggctct 720tctaaccagt gaagaaaata aaaacttttc attgctgatt gcaaggcctg aactccagct 780ggatgagggc tggatggatg atgacaggaa cgactttctg gatgacctgc acatggacat 840gctggaggag cagcaccacc aggccatgca attcacagct agcgtgctgc agcagaagaa 900gcacgacgaa gacggtggga ccacagatac agccaccatc ttgtccaacc agcacgagaa 960ggacagcggt gtggggcgga ccgacgagag cacccgtaac gacgagagct cggagcaaga 1020gaacaatggc gacgacgcca ccgcatcctc caacccgctg gcggggcaga ggaagctcac 1080ctgcagccag gacaccttgg gcagcggcga cctgcccttc agcaacgagt ctttcatttc 1140ggccgactgc acggacgccg actacctggg gatcacggtg gacgagtgcg agcgcttccg 1200cgagctcctg gagctcaagt gccaggtgaa gagcgccacc ccttacggcc tgtactaccc 1260tagcggcccc ctggacgccg gcaagagtga ccctgagagc gtggacaagg agctggagct 1320gctgaacgaa gagctgcgca gcatcgagct ggagtgcctg agcatcgtgc gcgcccacaa 1380gatgcagcag ctcaaggagc agtaccgcga gtcctggatg ctgcacaaca gcggcttccg 1440caactacaac accagcatcg acgtgcgcag acacgagctc tcagatatca ccgagctccc 1500ggagaaatcc gacaaggaca gctcgagcgc ctacaacaca ggcgagagct gccgcagcac 1560cccgctcacc ctggagatct cccccgacaa ctccttgagg agagcggcgg agggcatcag 1620ctgcccgagc agcgaagggg ctgtggggac cacggaagcc tacgggccag cctccaagaa 1680tctgctctcc atcacggaag atcccgaagt gggcacccct acctatagcc cgtccctgaa 1740ggagctggac cccaaccagc ccctggaaag caaagagcgg agagccagcg acgggagccg 1800gagccccacg cccagccaga agctgggcag cgcctacctg ccctcctatc accactcccc 1860atacaagcac gcgcacatcc cggcgcacgc ccagcactac cagagctaca tgcagctgat 1920ccagcagaag tcggccgtgg agtacgcgca aagccagatg agcctggtga gcatgtgcaa 1980ggacctgagc tctcccaccc cgtcggagcc gcgcatggag tggaaggtga agatccgcag 2040cgacgggacg cgctacatca ccaagaggcc cgtgcgggac cgcctgctgc gggagcgcgc 2100cctgaagatc cgggaagagc gcagcggcat gaccaccgac gacgacgcgg tgagcgagat 2160gaagatgggg cgctactgga gcaaggagga gaggaagcag cacctggtga aggccaagga 2220gcagcggcgg cggcgcgagt tcatgatgca gagcaggttg gattgtctca aggagcagca 2280agcagccgat gacaggaagg agatgaacat tctcgaactg agccacaaaa agatgatgaa 2340gaagaggaat aagaaaatct tcgataactg gatgacgatc caagaactct taacccacgg 2400cacaaaatcc ccggacggca ctagagtata caattccttc ctatcggtga ctactgtata 2460attttcactt ctgcattatg tacataaagg agaccactac cactggggta gaaattcctg 2520cctcgttcaa tgcggcaagt ttttgtatat aagataagta cggtcttcat gtttacagtc 2580caaatttgca aaccctacaa ctctgggtgt cataggtcta ttttaaggga agagagagaa 2640aaacaccctt actatcttgg aaggcaatat taacaaacag atcttttttc aaatagcaat 2700tgtacttttc tacctgtacc cttttacata aagtgtttaa atttcagaaa gatcttttat 2760taagcatact ttcacagaat aacttgttta aactatattc atataaaaaa gttaaacacg 2820ctttttttcc tgcctaaaac acaaatacaa ctgccagtat gtatttttaa tggaacccta 2880ttttataatg gtacgttact gaatgtgttt catatgcgtg accgtta 29272242927DNAHomo sapiensCDS(95)..(2461) 224agcatttgct cagtactctc aacataaacc aagcttagat cgcaatgaaa ctgcggacct 60gcaggctgcc ttgcttcact gagtcaattt ttaa atg aag atg gaa cac aag agg 115 Met Lys Met Glu His Lys Arg 1 5atg atc tta ttg ttc agt aaa ttt act cta ata cat aaa ccc cat ggt 163Met Ile Leu Leu Phe Ser Lys Phe Thr Leu Ile His Lys Pro His Gly 10 15 20gga aga tgg tat ttg gtc aac ggc aga gac tta tcc aga gca act cat 211Gly Arg Trp Tyr Leu Val Asn Gly Arg Asp Leu Ser Arg Ala Thr His 25 30 35gac cag gct gtg gaa gct ttc aag aca gcc aag gag ccc ata gtg gtg 259Asp Gln Ala Val Glu Ala Phe Lys Thr Ala Lys Glu Pro Ile Val Val40 45 50 55cag gtg ttg aga aga aca cca agg acc aaa atg ttc acg cct cca tca 307Gln Val Leu Arg Arg Thr Pro Arg Thr Lys Met Phe Thr Pro Pro Ser 60 65 70gag tct cag ctg gtg gac acg gga acc caa acc gac atc acc ttt gaa 355Glu Ser Gln Leu Val Asp Thr Gly Thr Gln Thr Asp Ile Thr Phe Glu 75 80 85cat atc atg gcc ctc act aag atg tcc tct ccc agc cca ccc gta ctg 403His Ile Met Ala Leu Thr Lys Met Ser Ser Pro Ser Pro Pro Val Leu 90 95 100gat ccc tat ctc ttg cca gag gag cat ccc tca gcc cat gaa tac tac 451Asp Pro Tyr Leu Leu Pro Glu Glu His Pro Ser Ala His Glu Tyr Tyr 105 110 115gat cca aat gac tac att gga gac atc cat cag gag atg gac agg gag 499Asp Pro Asn Asp Tyr Ile Gly Asp Ile His Gln Glu Met Asp Arg Glu120 125 130 135gag ctg gag ctg gag gaa gtg gac ctc tac aga atg aac agc cag gac 547Glu Leu Glu Leu Glu Glu Val Asp Leu Tyr Arg Met Asn Ser Gln Asp 140 145 150aag ctg ggc ctc act gtg tgc tac cgg acg gac gat gaa gac gac att 595Lys Leu Gly Leu Thr Val Cys Tyr Arg Thr Asp Asp Glu Asp Asp Ile 155 160 165ggg att tac atc agt gag att gac cct aac agc att gca gcc aag gat 643Gly Ile Tyr Ile Ser Glu Ile Asp Pro Asn Ser Ile Ala Ala Lys Asp 170 175 180ggg cgc atc cga gaa gga gac cgc att atc cag att aat ggg ata gag 691Gly Arg Ile Arg Glu Gly Asp Arg Ile Ile Gln Ile Asn Gly Ile Glu 185 190 195gtg cag aac cgt gaa gag gct gtg gct ctt cta acc agt gaa gaa aat 739Val Gln Asn Arg Glu Glu Ala Val Ala Leu Leu Thr Ser Glu Glu Asn200 205 210 215aaa aac ttt tca ttg ctg att gca agg cct gaa ctc cag ctg gat gag 787Lys Asn Phe Ser Leu Leu Ile Ala Arg Pro Glu Leu Gln Leu Asp Glu 220 225 230ggc tgg atg gat gat gac agg aac gac ttt ctg gat gac ctg cac atg 835Gly Trp Met Asp Asp Asp Arg Asn Asp Phe Leu Asp Asp Leu His Met 235 240 245gac atg ctg gag gag cag cac cac cag gcc atg caa ttc aca gct agc 883Asp Met Leu Glu Glu Gln His His Gln Ala Met Gln Phe Thr Ala Ser 250 255 260gtg ctg cag cag aag aag cac gac gaa gac ggt ggg acc aca gat aca 931Val Leu Gln Gln Lys Lys His Asp Glu Asp Gly Gly Thr Thr Asp Thr 265 270 275gcc acc atc ttg tcc aac cag cac gag aag gac agc ggt gtg ggg cgg 979Ala Thr Ile Leu Ser Asn Gln His Glu Lys Asp Ser Gly Val Gly Arg280 285 290 295acc gac gag agc acc cgt aac gac gag agc tcg gag caa gag aac aat 1027Thr Asp Glu Ser Thr Arg Asn Asp Glu Ser Ser Glu Gln Glu Asn Asn 300 305 310ggc gac gac gcc acc gca tcc tcc aac ccg ctg gcg ggg cag agg aag 1075Gly Asp Asp Ala Thr Ala Ser Ser Asn Pro Leu Ala Gly Gln Arg Lys 315 320 325ctc acc tgc agc cag gac acc ttg ggc agc ggc gac ctg ccc ttc agc 1123Leu Thr Cys Ser Gln Asp Thr Leu Gly Ser Gly Asp Leu Pro Phe Ser 330 335 340aac gag tct ttc att tcg gcc gac tgc acg gac gcc gac tac ctg ggg 1171Asn Glu Ser Phe Ile Ser Ala Asp Cys Thr Asp Ala Asp Tyr Leu Gly 345 350 355atc acg gtg gac gag tgc gag cgc ttc cgc gag ctc ctg gag ctc aag 1219Ile Thr Val Asp Glu Cys Glu Arg Phe Arg Glu Leu Leu Glu Leu Lys360 365 370 375tgc cag gtg aag agc gcc acc cct tac ggc ctg tac tac cct agc ggc 1267Cys Gln Val Lys Ser Ala Thr Pro Tyr Gly Leu Tyr Tyr Pro Ser Gly 380 385 390ccc ctg gac gcc ggc aag agt gac cct gag agc gtg gac aag gag ctg 1315Pro Leu Asp Ala Gly Lys Ser Asp Pro Glu Ser Val Asp Lys Glu Leu 395 400 405gag ctg ctg aac gaa gag ctg cgc agc atc gag ctg gag tgc ctg agc 1363Glu Leu Leu Asn Glu Glu Leu Arg Ser Ile Glu Leu Glu Cys Leu Ser 410 415 420atc gtg cgc gcc cac aag atg cag cag ctc aag gag cag tac cgc gag 1411Ile Val Arg Ala His Lys Met Gln Gln Leu Lys Glu Gln Tyr Arg Glu 425 430 435tcc tgg atg ctg cac aac agc ggc ttc cgc aac tac aac acc agc atc 1459Ser Trp Met Leu His Asn Ser Gly Phe Arg Asn Tyr Asn Thr Ser Ile440 445 450 455gac gtg cgc aga cac gag ctc tca gat atc acc gag ctc ccg gag aaa 1507Asp Val Arg Arg His Glu Leu Ser Asp Ile Thr Glu Leu Pro Glu Lys 460 465 470tcc gac aag gac agc tcg agc gcc tac aac aca ggc gag agc tgc cgc 1555Ser Asp Lys Asp Ser Ser Ser Ala Tyr Asn Thr Gly Glu Ser Cys Arg 475 480 485agc acc ccg ctc acc ctg gag atc tcc ccc gac aac tcc ttg agg aga 1603Ser Thr Pro Leu Thr Leu Glu Ile Ser Pro Asp Asn Ser Leu Arg Arg 490 495 500gcg gcg gag ggc atc agc tgc ccg agc agc gaa ggg gct gtg ggg acc 1651Ala Ala Glu Gly Ile Ser Cys Pro Ser Ser Glu Gly Ala Val Gly Thr 505 510 515acg gaa gcc tac ggg cca gcc tcc aag aat ctg ctc tcc atc acg gaa 1699Thr Glu Ala Tyr Gly Pro Ala Ser Lys Asn Leu Leu Ser Ile Thr Glu520 525 530 535gat ccc gaa gtg ggc acc cct acc tat agc ccg tcc ctg aag gag ctg 1747Asp Pro Glu Val Gly Thr Pro Thr Tyr Ser Pro Ser Leu Lys Glu Leu 540 545 550gac ccc aac cag ccc ctg gaa agc aaa gag cgg aga gcc agc gac ggg 1795Asp Pro Asn Gln Pro Leu Glu Ser Lys Glu Arg Arg Ala Ser Asp Gly 555 560 565agc cgg agc ccc acg ccc agc cag aag ctg ggc agc gcc tac ctg ccc 1843Ser Arg Ser Pro Thr Pro Ser Gln Lys Leu Gly Ser Ala Tyr Leu Pro 570 575 580tcc tat cac cac tcc cca tac aag cac gcg cac atc ccg gcg cac gcc 1891Ser Tyr His His Ser Pro Tyr Lys His Ala His Ile Pro Ala His Ala 585 590 595cag cac tac cag agc tac atg cag ctg atc cag

cag aag tcg gcc gtg 1939Gln His Tyr Gln Ser Tyr Met Gln Leu Ile Gln Gln Lys Ser Ala Val600 605 610 615gag tac gcg caa agc cag atg agc ctg gtg agc atg tgc aag gac ctg 1987Glu Tyr Ala Gln Ser Gln Met Ser Leu Val Ser Met Cys Lys Asp Leu 620 625 630agc tct ccc acc ccg tcg gag ccg cgc atg gag tgg aag gtg aag atc 2035Ser Ser Pro Thr Pro Ser Glu Pro Arg Met Glu Trp Lys Val Lys Ile 635 640 645cgc agc gac ggg acg cgc tac atc acc aag agg ccc gtg cgg gac cgc 2083Arg Ser Asp Gly Thr Arg Tyr Ile Thr Lys Arg Pro Val Arg Asp Arg 650 655 660ctg ctg cgg gag cgc gcc ctg aag atc cgg gaa gag cgc agc ggc atg 2131Leu Leu Arg Glu Arg Ala Leu Lys Ile Arg Glu Glu Arg Ser Gly Met 665 670 675acc acc gac gac gac gcg gtg agc gag atg aag atg ggg cgc tac tgg 2179Thr Thr Asp Asp Asp Ala Val Ser Glu Met Lys Met Gly Arg Tyr Trp680 685 690 695agc aag gag gag agg aag cag cac ctg gtg aag gcc aag gag cag cgg 2227Ser Lys Glu Glu Arg Lys Gln His Leu Val Lys Ala Lys Glu Gln Arg 700 705 710cgg cgg cgc gag ttc atg atg cag agc agg ttg gat tgt ctc aag gag 2275Arg Arg Arg Glu Phe Met Met Gln Ser Arg Leu Asp Cys Leu Lys Glu 715 720 725cag caa gca gcc gat gac agg aag gag atg aac att ctc gaa ctg agc 2323Gln Gln Ala Ala Asp Asp Arg Lys Glu Met Asn Ile Leu Glu Leu Ser 730 735 740cac aaa aag atg atg aag aag agg aat aag aaa atc ttc gat aac tgg 2371His Lys Lys Met Met Lys Lys Arg Asn Lys Lys Ile Phe Asp Asn Trp 745 750 755atg acg atc caa gaa ctc tta acc cac ggc aca aaa tcc ccg gac ggc 2419Met Thr Ile Gln Glu Leu Leu Thr His Gly Thr Lys Ser Pro Asp Gly760 765 770 775act aga gta tac aat tcc ttc cta tcg gtg act act gta taa 2461Thr Arg Val Tyr Asn Ser Phe Leu Ser Val Thr Thr Val 780 785ttttcacttc tgcattatgt acataaagga gaccactacc actggggtag aaattcctgc 2521ctcgttcaat gcggcaagtt tttgtatata agataagtac ggtcttcatg tttacagtcc 2581aaatttgcaa accctacaac tctgggtgtc ataggtctat tttaagggaa gagagagaaa 2641aacaccctta ctatcttgga aggcaatatt aacaaacaga tcttttttca aatagcaatt 2701gtacttttct acctgtaccc ttttacataa agtgtttaaa tttcagaaag atcttttatt 2761aagcatactt tcacagaata acttgtttaa actatattca tataaaaaag ttaaacacgc 2821tttttttcct gcctaaaaca caaatacaac tgccagtatg tatttttaat ggaaccctat 2881tttataatgg tacgttactg aatgtgtttc atatgcgtga ccgtta 2927225788PRTHomo sapiens 225Met Lys Met Glu His Lys Arg Met Ile Leu Leu Phe Ser Lys Phe Thr1 5 10 15Leu Ile His Lys Pro His Gly Gly Arg Trp Tyr Leu Val Asn Gly Arg 20 25 30Asp Leu Ser Arg Ala Thr His Asp Gln Ala Val Glu Ala Phe Lys Thr 35 40 45Ala Lys Glu Pro Ile Val Val Gln Val Leu Arg Arg Thr Pro Arg Thr 50 55 60Lys Met Phe Thr Pro Pro Ser Glu Ser Gln Leu Val Asp Thr Gly Thr65 70 75 80Gln Thr Asp Ile Thr Phe Glu His Ile Met Ala Leu Thr Lys Met Ser 85 90 95Ser Pro Ser Pro Pro Val Leu Asp Pro Tyr Leu Leu Pro Glu Glu His 100 105 110Pro Ser Ala His Glu Tyr Tyr Asp Pro Asn Asp Tyr Ile Gly Asp Ile 115 120 125His Gln Glu Met Asp Arg Glu Glu Leu Glu Leu Glu Glu Val Asp Leu 130 135 140Tyr Arg Met Asn Ser Gln Asp Lys Leu Gly Leu Thr Val Cys Tyr Arg145 150 155 160Thr Asp Asp Glu Asp Asp Ile Gly Ile Tyr Ile Ser Glu Ile Asp Pro 165 170 175Asn Ser Ile Ala Ala Lys Asp Gly Arg Ile Arg Glu Gly Asp Arg Ile 180 185 190Ile Gln Ile Asn Gly Ile Glu Val Gln Asn Arg Glu Glu Ala Val Ala 195 200 205Leu Leu Thr Ser Glu Glu Asn Lys Asn Phe Ser Leu Leu Ile Ala Arg 210 215 220Pro Glu Leu Gln Leu Asp Glu Gly Trp Met Asp Asp Asp Arg Asn Asp225 230 235 240Phe Leu Asp Asp Leu His Met Asp Met Leu Glu Glu Gln His His Gln 245 250 255Ala Met Gln Phe Thr Ala Ser Val Leu Gln Gln Lys Lys His Asp Glu 260 265 270Asp Gly Gly Thr Thr Asp Thr Ala Thr Ile Leu Ser Asn Gln His Glu 275 280 285Lys Asp Ser Gly Val Gly Arg Thr Asp Glu Ser Thr Arg Asn Asp Glu 290 295 300Ser Ser Glu Gln Glu Asn Asn Gly Asp Asp Ala Thr Ala Ser Ser Asn305 310 315 320Pro Leu Ala Gly Gln Arg Lys Leu Thr Cys Ser Gln Asp Thr Leu Gly 325 330 335Ser Gly Asp Leu Pro Phe Ser Asn Glu Ser Phe Ile Ser Ala Asp Cys 340 345 350Thr Asp Ala Asp Tyr Leu Gly Ile Thr Val Asp Glu Cys Glu Arg Phe 355 360 365Arg Glu Leu Leu Glu Leu Lys Cys Gln Val Lys Ser Ala Thr Pro Tyr 370 375 380Gly Leu Tyr Tyr Pro Ser Gly Pro Leu Asp Ala Gly Lys Ser Asp Pro385 390 395 400Glu Ser Val Asp Lys Glu Leu Glu Leu Leu Asn Glu Glu Leu Arg Ser 405 410 415Ile Glu Leu Glu Cys Leu Ser Ile Val Arg Ala His Lys Met Gln Gln 420 425 430Leu Lys Glu Gln Tyr Arg Glu Ser Trp Met Leu His Asn Ser Gly Phe 435 440 445Arg Asn Tyr Asn Thr Ser Ile Asp Val Arg Arg His Glu Leu Ser Asp 450 455 460Ile Thr Glu Leu Pro Glu Lys Ser Asp Lys Asp Ser Ser Ser Ala Tyr465 470 475 480Asn Thr Gly Glu Ser Cys Arg Ser Thr Pro Leu Thr Leu Glu Ile Ser 485 490 495Pro Asp Asn Ser Leu Arg Arg Ala Ala Glu Gly Ile Ser Cys Pro Ser 500 505 510Ser Glu Gly Ala Val Gly Thr Thr Glu Ala Tyr Gly Pro Ala Ser Lys 515 520 525Asn Leu Leu Ser Ile Thr Glu Asp Pro Glu Val Gly Thr Pro Thr Tyr 530 535 540Ser Pro Ser Leu Lys Glu Leu Asp Pro Asn Gln Pro Leu Glu Ser Lys545 550 555 560Glu Arg Arg Ala Ser Asp Gly Ser Arg Ser Pro Thr Pro Ser Gln Lys 565 570 575Leu Gly Ser Ala Tyr Leu Pro Ser Tyr His His Ser Pro Tyr Lys His 580 585 590Ala His Ile Pro Ala His Ala Gln His Tyr Gln Ser Tyr Met Gln Leu 595 600 605Ile Gln Gln Lys Ser Ala Val Glu Tyr Ala Gln Ser Gln Met Ser Leu 610 615 620Val Ser Met Cys Lys Asp Leu Ser Ser Pro Thr Pro Ser Glu Pro Arg625 630 635 640Met Glu Trp Lys Val Lys Ile Arg Ser Asp Gly Thr Arg Tyr Ile Thr 645 650 655Lys Arg Pro Val Arg Asp Arg Leu Leu Arg Glu Arg Ala Leu Lys Ile 660 665 670Arg Glu Glu Arg Ser Gly Met Thr Thr Asp Asp Asp Ala Val Ser Glu 675 680 685Met Lys Met Gly Arg Tyr Trp Ser Lys Glu Glu Arg Lys Gln His Leu 690 695 700Val Lys Ala Lys Glu Gln Arg Arg Arg Arg Glu Phe Met Met Gln Ser705 710 715 720Arg Leu Asp Cys Leu Lys Glu Gln Gln Ala Ala Asp Asp Arg Lys Glu 725 730 735Met Asn Ile Leu Glu Leu Ser His Lys Lys Met Met Lys Lys Arg Asn 740 745 750Lys Lys Ile Phe Asp Asn Trp Met Thr Ile Gln Glu Leu Leu Thr His 755 760 765Gly Thr Lys Ser Pro Asp Gly Thr Arg Val Tyr Asn Ser Phe Leu Ser 770 775 780Val Thr Thr Val785226178DNAHomo sapiens 226agcatttgct cagtactctc aacataaacc aagcttagat cgcaatgaaa ctgcggacct 60gcaggctgcc ttgcttcact gagtcaattt ttaaatgaag atggaacaca agaggatgat 120cttattgttc agtaaattta ctctaataca taaaccccat ggtggaagat ggtatttg 17822728PRTHomo sapiens 227Met Lys Met Glu His Lys Arg Met Ile Leu Leu Phe Ser Lys Phe Thr1 5 10 15Leu Ile His Lys Pro His Gly Gly Arg Trp Tyr Leu 20 2522884DNAHomo sapiens 228atgaagatgg aacacaagag gatgatctta ttgttcagta aatttactct aatacataaa 60ccccatggtg gaagatggta tttg 842292738DNAHomo sapiens 229atatataata gccttgcaag tgaacggata agcaaagtca gttgctgaga aataacactt 60ccgagctgac gcatccaaat gtttaggata cacctgagat gtctctctta ctgaatagaa 120acaacatccc tggaagctct agacttctga ggcaccccag ctttccctgg tcaacggcag 180agacttatcc agagcaactc atgaccaggc tgtggaagct ttcaagacag ccaaggagcc 240catagtggtg caggtgttga gaagaacacc aaggaccaaa atgttcacgc ctccatcaga 300gtctcagctg gtggacacgg gaacccaaac cgacatcacc tttgaacata tcatggccct 360cactaagatg tcctctccca gcccacccgt actggatccc tatctcttgc cagaggagca 420tccctcagcc catgaatact acgatccaaa tgactacatt ggagacatcc atcaggagat 480ggacagggag gagctggagc tggaggaagt ggacctctac agaatgaaca gccaggacaa 540gctgggcctc actgtgtgct accggacgga cgatgaagac gacattggga tttatatcag 600tgagattgac cctaacagca ttgcagccaa ggatgggcgc atccgagaag gagaccgcat 660tatccagatt aatgggatag aggtgcagaa ccgtgaagag gctgtggctc ttctaaccag 720tgaagaaaat aaaaactttt cattgctgat tgcaaggcct gaactccagc tggatgaggg 780ctggatggat gatgacagga acgactttct ggatgacctg cacatggaca tgctggagga 840gcagcaccac caggccatgc aattcacagc tagcgtgctg cagcagaaga agcacgacga 900agacggtggg accacagata cagccaccat cttgtccaac cagcacgaga aggacagcgg 960tgtggggcgg accgacgaga gcacccgtaa tgacgagagc tcggagcaag agaacaatgg 1020cgacgacgcc accgcatcct ccaacccgct ggcggggcag aggaagctca cctgcagcca 1080ggacaccttg ggcagcggcg acctgccctt cagcaacgag tctttcattt cggccgactg 1140cacggacgcc gactacctgg ggatcccggt ggacgagtgc gagcgcttcc gcgagctcct 1200ggagctcaag tgccaggtga agagcgccac cccttacggc ctgtactacc ctagcggccc 1260cctggacgcc ggcaagagtg accctgagag cgtggacaag gagctggagc tgctgaacga 1320agagctgcgc agcatcgagc tggagtgcct gagcatcgtg cgcgcccaca agatgcagca 1380gctcaaggag cagtaccgcg agtcctggat gctgcacaac agcggcttcc gcaactacaa 1440caccagcatc gacgtgcgca gacacgagct ctcggatatc accgagctcc cggagaaatc 1500cgacaaggac agctcgagcg cctacaacac aggcgagagc tgccgcagca ccccgctcac 1560cctggagatc tcccccgaca actccttgag gagagcggtg gagggcatca gctgcccgag 1620cagcgaaggg gctgtgggga ccacggaagc ctacgggcca gcctccaaga atctgctctc 1680catcacggaa gatcccgaag tgggcacccc tacctatagc ccgtccctga aggagctgga 1740ccccaaccag cccctggaaa gcaaagagcg gagagccagc gacgggagcc ggagccccac 1800gcccagccag aagctgggca gcgcctacct gccctcctat caccactccc catacaagca 1860cgcgcacatc ccggcgcacg cccagcacta ccagagctac atgcagctga tccagcagaa 1920gtcggccgtg gagtacgcgc aaagccagat gagcctggtg agcatgtgca aggacctgag 1980ctctcccacc ccgtcggagc cgcgcatgga gtggaaggtg aagatccgca gcgacgggac 2040gcgctacatc accaagaggc ccgtgcggga ccgcctgctg cgggagcgcg ccctgaagat 2100ccgggaagag cgcagcggca tgaccaccga cgacgacgcg gtgagcgaga tgaagatggg 2160gcgctactgg agcaaggagg agaggaagca gcacctggtg aaggccaagg agcagcggcg 2220gcggcgcgag ttcatgatgc agagcaggtt ggattgtctc aaggagcagc aagcagccga 2280tgacaggaag gagatgaaca ttctcgaact gagccacaaa aagatgatga agaagaggaa 2340taagaaaatc ttcgataact ggatgacgat ccaagaactc ttaacccacg gcacaaaatc 2400cccggacggc actagagtat acaattcctt cctatcggtg actactgtat aattttcact 2460tctgcattat gtacataaag gagaccacta ccactggggt agaaattcct gcctcgttca 2520atgcggcaag tttttgtata taagataagt acggtcttca tgtttacagt ccaaatttgc 2580aaaccccaca actctgggtg tcgtaggtct attttaaggg aagagagaga aaaacaccct 2640tactatcttg gaaggcaata ttaacaaaca gagctttttt caaatagcaa ttgtactttt 2700ctacctgtac ccttttacat aaagtgttta aatttcag 27382302738DNAHomo sapiensCDS(281)..(2452) 230atatataata gccttgcaag tgaacggata agcaaagtca gttgctgaga aataacactt 60ccgagctgac gcatccaaat gtttaggata cacctgagat gtctctctta ctgaatagaa 120acaacatccc tggaagctct agacttctga ggcaccccag ctttccctgg tcaacggcag 180agacttatcc agagcaactc atgaccaggc tgtggaagct ttcaagacag ccaaggagcc 240catagtggtg caggtgttga gaagaacacc aaggaccaaa atg ttc acg cct cca 295 Met Phe Thr Pro Pro 1 5tca gag tct cag ctg gtg gac acg gga acc caa acc gac atc acc ttt 343Ser Glu Ser Gln Leu Val Asp Thr Gly Thr Gln Thr Asp Ile Thr Phe 10 15 20gaa cat atc atg gcc ctc act aag atg tcc tct ccc agc cca ccc gta 391Glu His Ile Met Ala Leu Thr Lys Met Ser Ser Pro Ser Pro Pro Val 25 30 35ctg gat ccc tat ctc ttg cca gag gag cat ccc tca gcc cat gaa tac 439Leu Asp Pro Tyr Leu Leu Pro Glu Glu His Pro Ser Ala His Glu Tyr 40 45 50tac gat cca aat gac tac att gga gac atc cat cag gag atg gac agg 487Tyr Asp Pro Asn Asp Tyr Ile Gly Asp Ile His Gln Glu Met Asp Arg 55 60 65gag gag ctg gag ctg gag gaa gtg gac ctc tac aga atg aac agc cag 535Glu Glu Leu Glu Leu Glu Glu Val Asp Leu Tyr Arg Met Asn Ser Gln70 75 80 85gac aag ctg ggc ctc act gtg tgc tac cgg acg gac gat gaa gac gac 583Asp Lys Leu Gly Leu Thr Val Cys Tyr Arg Thr Asp Asp Glu Asp Asp 90 95 100att ggg att tat atc agt gag att gac cct aac agc att gca gcc aag 631Ile Gly Ile Tyr Ile Ser Glu Ile Asp Pro Asn Ser Ile Ala Ala Lys 105 110 115gat ggg cgc atc cga gaa gga gac cgc att atc cag att aat ggg ata 679Asp Gly Arg Ile Arg Glu Gly Asp Arg Ile Ile Gln Ile Asn Gly Ile 120 125 130gag gtg cag aac cgt gaa gag gct gtg gct ctt cta acc agt gaa gaa 727Glu Val Gln Asn Arg Glu Glu Ala Val Ala Leu Leu Thr Ser Glu Glu 135 140 145aat aaa aac ttt tca ttg ctg att gca agg cct gaa ctc cag ctg gat 775Asn Lys Asn Phe Ser Leu Leu Ile Ala Arg Pro Glu Leu Gln Leu Asp150 155 160 165gag ggc tgg atg gat gat gac agg aac gac ttt ctg gat gac ctg cac 823Glu Gly Trp Met Asp Asp Asp Arg Asn Asp Phe Leu Asp Asp Leu His 170 175 180atg gac atg ctg gag gag cag cac cac cag gcc atg caa ttc aca gct 871Met Asp Met Leu Glu Glu Gln His His Gln Ala Met Gln Phe Thr Ala 185 190 195agc gtg ctg cag cag aag aag cac gac gaa gac ggt ggg acc aca gat 919Ser Val Leu Gln Gln Lys Lys His Asp Glu Asp Gly Gly Thr Thr Asp 200 205 210aca gcc acc atc ttg tcc aac cag cac gag aag gac agc ggt gtg ggg 967Thr Ala Thr Ile Leu Ser Asn Gln His Glu Lys Asp Ser Gly Val Gly 215 220 225cgg acc gac gag agc acc cgt aat gac gag agc tcg gag caa gag aac 1015Arg Thr Asp Glu Ser Thr Arg Asn Asp Glu Ser Ser Glu Gln Glu Asn230 235 240 245aat ggc gac gac gcc acc gca tcc tcc aac ccg ctg gcg ggg cag agg 1063Asn Gly Asp Asp Ala Thr Ala Ser Ser Asn Pro Leu Ala Gly Gln Arg 250 255 260aag ctc acc tgc agc cag gac acc ttg ggc agc ggc gac ctg ccc ttc 1111Lys Leu Thr Cys Ser Gln Asp Thr Leu Gly Ser Gly Asp Leu Pro Phe 265 270 275agc aac gag tct ttc att tcg gcc gac tgc acg gac gcc gac tac ctg 1159Ser Asn Glu Ser Phe Ile Ser Ala Asp Cys Thr Asp Ala Asp Tyr Leu 280 285 290ggg atc ccg gtg gac gag tgc gag cgc ttc cgc gag ctc ctg gag ctc 1207Gly Ile Pro Val Asp Glu Cys Glu Arg Phe Arg Glu Leu Leu Glu Leu 295 300 305aag tgc cag gtg aag agc gcc acc cct tac ggc ctg tac tac cct agc 1255Lys Cys Gln Val Lys Ser Ala Thr Pro Tyr Gly Leu Tyr Tyr Pro Ser310 315 320 325ggc ccc ctg gac gcc ggc aag agt gac cct gag agc gtg gac aag gag 1303Gly Pro Leu Asp Ala Gly Lys Ser Asp Pro Glu Ser Val Asp Lys Glu 330 335 340ctg gag ctg ctg aac gaa gag ctg cgc agc atc gag ctg gag tgc ctg 1351Leu Glu Leu Leu Asn Glu Glu Leu Arg Ser Ile Glu Leu Glu Cys Leu 345 350 355agc atc gtg cgc gcc cac aag atg cag cag ctc aag gag cag tac cgc 1399Ser Ile Val Arg Ala His Lys Met Gln Gln Leu Lys Glu Gln Tyr Arg 360 365 370gag tcc tgg atg ctg cac aac agc ggc ttc cgc aac tac aac acc agc 1447Glu Ser Trp Met Leu His Asn Ser Gly Phe Arg Asn Tyr Asn Thr Ser 375 380 385atc gac gtg cgc aga cac gag ctc tcg gat atc acc gag ctc ccg gag 1495Ile Asp Val Arg Arg His Glu Leu Ser Asp Ile Thr Glu Leu Pro Glu390 395 400 405aaa tcc gac aag gac agc tcg agc gcc tac aac aca ggc gag agc tgc 1543Lys Ser Asp Lys Asp Ser Ser Ser Ala Tyr Asn Thr Gly Glu Ser Cys 410 415 420cgc agc acc ccg ctc acc ctg gag atc tcc ccc gac aac tcc ttg agg 1591Arg Ser Thr Pro Leu Thr Leu Glu Ile Ser Pro Asp Asn Ser Leu Arg 425

430 435aga gcg gtg gag ggc atc agc tgc ccg agc agc gaa ggg gct gtg ggg 1639Arg Ala Val Glu Gly Ile Ser Cys Pro Ser Ser Glu Gly Ala Val Gly 440 445 450acc acg gaa gcc tac ggg cca gcc tcc aag aat ctg ctc tcc atc acg 1687Thr Thr Glu Ala Tyr Gly Pro Ala Ser Lys Asn Leu Leu Ser Ile Thr 455 460 465gaa gat ccc gaa gtg ggc acc cct acc tat agc ccg tcc ctg aag gag 1735Glu Asp Pro Glu Val Gly Thr Pro Thr Tyr Ser Pro Ser Leu Lys Glu470 475 480 485ctg gac ccc aac cag ccc ctg gaa agc aaa gag cgg aga gcc agc gac 1783Leu Asp Pro Asn Gln Pro Leu Glu Ser Lys Glu Arg Arg Ala Ser Asp 490 495 500ggg agc cgg agc ccc acg ccc agc cag aag ctg ggc agc gcc tac ctg 1831Gly Ser Arg Ser Pro Thr Pro Ser Gln Lys Leu Gly Ser Ala Tyr Leu 505 510 515ccc tcc tat cac cac tcc cca tac aag cac gcg cac atc ccg gcg cac 1879Pro Ser Tyr His His Ser Pro Tyr Lys His Ala His Ile Pro Ala His 520 525 530gcc cag cac tac cag agc tac atg cag ctg atc cag cag aag tcg gcc 1927Ala Gln His Tyr Gln Ser Tyr Met Gln Leu Ile Gln Gln Lys Ser Ala 535 540 545gtg gag tac gcg caa agc cag atg agc ctg gtg agc atg tgc aag gac 1975Val Glu Tyr Ala Gln Ser Gln Met Ser Leu Val Ser Met Cys Lys Asp550 555 560 565ctg agc tct ccc acc ccg tcg gag ccg cgc atg gag tgg aag gtg aag 2023Leu Ser Ser Pro Thr Pro Ser Glu Pro Arg Met Glu Trp Lys Val Lys 570 575 580atc cgc agc gac ggg acg cgc tac atc acc aag agg ccc gtg cgg gac 2071Ile Arg Ser Asp Gly Thr Arg Tyr Ile Thr Lys Arg Pro Val Arg Asp 585 590 595cgc ctg ctg cgg gag cgc gcc ctg aag atc cgg gaa gag cgc agc ggc 2119Arg Leu Leu Arg Glu Arg Ala Leu Lys Ile Arg Glu Glu Arg Ser Gly 600 605 610atg acc acc gac gac gac gcg gtg agc gag atg aag atg ggg cgc tac 2167Met Thr Thr Asp Asp Asp Ala Val Ser Glu Met Lys Met Gly Arg Tyr 615 620 625tgg agc aag gag gag agg aag cag cac ctg gtg aag gcc aag gag cag 2215Trp Ser Lys Glu Glu Arg Lys Gln His Leu Val Lys Ala Lys Glu Gln630 635 640 645cgg cgg cgg cgc gag ttc atg atg cag agc agg ttg gat tgt ctc aag 2263Arg Arg Arg Arg Glu Phe Met Met Gln Ser Arg Leu Asp Cys Leu Lys 650 655 660gag cag caa gca gcc gat gac agg aag gag atg aac att ctc gaa ctg 2311Glu Gln Gln Ala Ala Asp Asp Arg Lys Glu Met Asn Ile Leu Glu Leu 665 670 675agc cac aaa aag atg atg aag aag agg aat aag aaa atc ttc gat aac 2359Ser His Lys Lys Met Met Lys Lys Arg Asn Lys Lys Ile Phe Asp Asn 680 685 690tgg atg acg atc caa gaa ctc tta acc cac ggc aca aaa tcc ccg gac 2407Trp Met Thr Ile Gln Glu Leu Leu Thr His Gly Thr Lys Ser Pro Asp 695 700 705ggc act aga gta tac aat tcc ttc cta tcg gtg act act gta taa 2452Gly Thr Arg Val Tyr Asn Ser Phe Leu Ser Val Thr Thr Val710 715 720ttttcacttc tgcattatgt acataaagga gaccactacc actggggtag aaattcctgc 2512ctcgttcaat gcggcaagtt tttgtatata agataagtac ggtcttcatg tttacagtcc 2572aaatttgcaa accccacaac tctgggtgtc gtaggtctat tttaagggaa gagagagaaa 2632aacaccctta ctatcttgga aggcaatatt aacaaacaga gcttttttca aatagcaatt 2692gtacttttct acctgtaccc ttttacataa agtgtttaaa tttcag 2738231723PRTHomo sapiens 231Met Phe Thr Pro Pro Ser Glu Ser Gln Leu Val Asp Thr Gly Thr Gln1 5 10 15Thr Asp Ile Thr Phe Glu His Ile Met Ala Leu Thr Lys Met Ser Ser 20 25 30Pro Ser Pro Pro Val Leu Asp Pro Tyr Leu Leu Pro Glu Glu His Pro 35 40 45Ser Ala His Glu Tyr Tyr Asp Pro Asn Asp Tyr Ile Gly Asp Ile His 50 55 60Gln Glu Met Asp Arg Glu Glu Leu Glu Leu Glu Glu Val Asp Leu Tyr65 70 75 80Arg Met Asn Ser Gln Asp Lys Leu Gly Leu Thr Val Cys Tyr Arg Thr 85 90 95Asp Asp Glu Asp Asp Ile Gly Ile Tyr Ile Ser Glu Ile Asp Pro Asn 100 105 110Ser Ile Ala Ala Lys Asp Gly Arg Ile Arg Glu Gly Asp Arg Ile Ile 115 120 125Gln Ile Asn Gly Ile Glu Val Gln Asn Arg Glu Glu Ala Val Ala Leu 130 135 140Leu Thr Ser Glu Glu Asn Lys Asn Phe Ser Leu Leu Ile Ala Arg Pro145 150 155 160Glu Leu Gln Leu Asp Glu Gly Trp Met Asp Asp Asp Arg Asn Asp Phe 165 170 175Leu Asp Asp Leu His Met Asp Met Leu Glu Glu Gln His His Gln Ala 180 185 190Met Gln Phe Thr Ala Ser Val Leu Gln Gln Lys Lys His Asp Glu Asp 195 200 205Gly Gly Thr Thr Asp Thr Ala Thr Ile Leu Ser Asn Gln His Glu Lys 210 215 220Asp Ser Gly Val Gly Arg Thr Asp Glu Ser Thr Arg Asn Asp Glu Ser225 230 235 240Ser Glu Gln Glu Asn Asn Gly Asp Asp Ala Thr Ala Ser Ser Asn Pro 245 250 255Leu Ala Gly Gln Arg Lys Leu Thr Cys Ser Gln Asp Thr Leu Gly Ser 260 265 270Gly Asp Leu Pro Phe Ser Asn Glu Ser Phe Ile Ser Ala Asp Cys Thr 275 280 285Asp Ala Asp Tyr Leu Gly Ile Pro Val Asp Glu Cys Glu Arg Phe Arg 290 295 300Glu Leu Leu Glu Leu Lys Cys Gln Val Lys Ser Ala Thr Pro Tyr Gly305 310 315 320Leu Tyr Tyr Pro Ser Gly Pro Leu Asp Ala Gly Lys Ser Asp Pro Glu 325 330 335Ser Val Asp Lys Glu Leu Glu Leu Leu Asn Glu Glu Leu Arg Ser Ile 340 345 350Glu Leu Glu Cys Leu Ser Ile Val Arg Ala His Lys Met Gln Gln Leu 355 360 365Lys Glu Gln Tyr Arg Glu Ser Trp Met Leu His Asn Ser Gly Phe Arg 370 375 380Asn Tyr Asn Thr Ser Ile Asp Val Arg Arg His Glu Leu Ser Asp Ile385 390 395 400Thr Glu Leu Pro Glu Lys Ser Asp Lys Asp Ser Ser Ser Ala Tyr Asn 405 410 415Thr Gly Glu Ser Cys Arg Ser Thr Pro Leu Thr Leu Glu Ile Ser Pro 420 425 430Asp Asn Ser Leu Arg Arg Ala Val Glu Gly Ile Ser Cys Pro Ser Ser 435 440 445Glu Gly Ala Val Gly Thr Thr Glu Ala Tyr Gly Pro Ala Ser Lys Asn 450 455 460Leu Leu Ser Ile Thr Glu Asp Pro Glu Val Gly Thr Pro Thr Tyr Ser465 470 475 480Pro Ser Leu Lys Glu Leu Asp Pro Asn Gln Pro Leu Glu Ser Lys Glu 485 490 495Arg Arg Ala Ser Asp Gly Ser Arg Ser Pro Thr Pro Ser Gln Lys Leu 500 505 510Gly Ser Ala Tyr Leu Pro Ser Tyr His His Ser Pro Tyr Lys His Ala 515 520 525His Ile Pro Ala His Ala Gln His Tyr Gln Ser Tyr Met Gln Leu Ile 530 535 540Gln Gln Lys Ser Ala Val Glu Tyr Ala Gln Ser Gln Met Ser Leu Val545 550 555 560Ser Met Cys Lys Asp Leu Ser Ser Pro Thr Pro Ser Glu Pro Arg Met 565 570 575Glu Trp Lys Val Lys Ile Arg Ser Asp Gly Thr Arg Tyr Ile Thr Lys 580 585 590Arg Pro Val Arg Asp Arg Leu Leu Arg Glu Arg Ala Leu Lys Ile Arg 595 600 605Glu Glu Arg Ser Gly Met Thr Thr Asp Asp Asp Ala Val Ser Glu Met 610 615 620Lys Met Gly Arg Tyr Trp Ser Lys Glu Glu Arg Lys Gln His Leu Val625 630 635 640Lys Ala Lys Glu Gln Arg Arg Arg Arg Glu Phe Met Met Gln Ser Arg 645 650 655Leu Asp Cys Leu Lys Glu Gln Gln Ala Ala Asp Asp Arg Lys Glu Met 660 665 670Asn Ile Leu Glu Leu Ser His Lys Lys Met Met Lys Lys Arg Asn Lys 675 680 685Lys Ile Phe Asp Asn Trp Met Thr Ile Gln Glu Leu Leu Thr His Gly 690 695 700Thr Lys Ser Pro Asp Gly Thr Arg Val Tyr Asn Ser Phe Leu Ser Val705 710 715 720Thr Thr Val232169DNAHomo sapiens 232atatataata gccttgcaag tgaacggata agcaaagtca gttgctgaga aataacactt 60ccgagctgac gcatccaaat gtttaggata cacctgagat gtctctctta ctgaatagaa 120acaacatccc tggaagctct agacttctga ggcaccccag ctttccctg 169233280DNAHomo sapiens 233atatataata gccttgcaag tgaacggata agcaaagtca gttgctgaga aataacactt 60ccgagctgac gcatccaaat gtttaggata cacctgagat gtctctctta ctgaatagaa 120acaacatccc tggaagctct agacttctga ggcaccccag ctttccctgg tcaacggcag 180agacttatcc agagcaactc atgaccaggc tgtggaagct ttcaagacag ccaaggagcc 240catagtggtg caggtgttga gaagaacacc aaggaccaaa 2802341574DNAHomo sapiens 234gtagacgcct cggggcttaa gagaacgtga catggaagag acaggaaata tgtactcttg 60tacaggtatg cggcacacat gtggccccct ccctcccagc ctcctcacta aagattatta 120acctgcttat tctcctggtt cacctgaggc tgttttgaga gaatctcagt atctcagtct 180tcatttcata cgacattctt cgggctcctc ttgttcatct ttgcagactg aagtatttat 240gaaggacctt cagaaatctg attcctcctg atgaatcaaa gagtaaagaa gaatcaaaca 300tgattaacca ggtcaacggc agagacttat ccagagcaac tcatgaccag gctgtggaag 360ctttcaagac agccaaggag cccatagtgg tgcaggtgtt gagaagaaca ccaaggacca 420aaatgttcac gcctccatca gagtctcagc tggtggacac gggaacccaa accgacatca 480cctttgaaca tatcatggcc ctcactaaga tgtcctctcc cagcccaccc gtactggatc 540cctatctctt gccagaggag catccctcag cccatgaata ctacgatcca aatgactaca 600ttggagacat ccatcaggag atggacaggg aggagctgga gctggaggaa gtggacctct 660acagaatgaa cagccaggac aagctgggcc tcactgtgtg ctaccggacg gacgatgaag 720acgacattgg gatttatatc agtgagattg accctaacag cattgcagcc aaggatgggc 780gcatccgaga aggagaccgc attatccaga ttaatgggat agaggtgcag aaccgtgaag 840aggctgtggc tcttctaacc agtgaagaaa ataaaaactt ttcattgctg attgcaaggc 900ctgaactcca gctggatgag ggctggatgg atgatgacag gaacgacttt ctggatgacc 960tgcacatgga catgctggag gagcagcacc accaggccat gcaattcaca gctagcgtgc 1020tgcagcagaa gaagcacgac gaagacggtg ggaccacaga tacagccacc atcttgtcca 1080accagcacga gaaggacagc ggtgtggggc ggaccgacga gagcacccgt aatgacgaga 1140gctcggagca agagagcaat ggcgacgacg ccaccgcatc ctccaacccg ctggcggggc 1200agaggaagct cacctgcagc caggacacct tgggcagcgg cgacctgccc ttcagcaacg 1260agtctttcat ttcggccgac tgcacggacg ccgactacct ggggatcccg gtggacgagt 1320gcgagcgctt ccgcgagctc ctggagctca agtgccaggt gaagagcgcc accccttacg 1380gcctgtacta ccctagcggc cccctggacg ccggcaagag tgaccctgag agcgtggaca 1440aggagctgga gctgctgaac gaagagctgc gcagcatcga gctggagtgc ctgagcatcg 1500tgcgcgccca caagatgcag cagctcaagg agcagtaccg cgagtcctgg atgctgcaca 1560acagcggctt ccgc 15742351574DNAHomo sapiensCDS(300)..(1574) 235gtagacgcct cggggcttaa gagaacgtga catggaagag acaggaaata tgtactcttg 60tacaggtatg cggcacacat gtggccccct ccctcccagc ctcctcacta aagattatta 120acctgcttat tctcctggtt cacctgaggc tgttttgaga gaatctcagt atctcagtct 180tcatttcata cgacattctt cgggctcctc ttgttcatct ttgcagactg aagtatttat 240gaaggacctt cagaaatctg attcctcctg atgaatcaaa gagtaaagaa gaatcaaac 299atg att aac cag gtc aac ggc aga gac tta tcc aga gca act cat gac 347Met Ile Asn Gln Val Asn Gly Arg Asp Leu Ser Arg Ala Thr His Asp1 5 10 15cag gct gtg gaa gct ttc aag aca gcc aag gag ccc ata gtg gtg cag 395Gln Ala Val Glu Ala Phe Lys Thr Ala Lys Glu Pro Ile Val Val Gln 20 25 30gtg ttg aga aga aca cca agg acc aaa atg ttc acg cct cca tca gag 443Val Leu Arg Arg Thr Pro Arg Thr Lys Met Phe Thr Pro Pro Ser Glu 35 40 45tct cag ctg gtg gac acg gga acc caa acc gac atc acc ttt gaa cat 491Ser Gln Leu Val Asp Thr Gly Thr Gln Thr Asp Ile Thr Phe Glu His 50 55 60atc atg gcc ctc act aag atg tcc tct ccc agc cca ccc gta ctg gat 539Ile Met Ala Leu Thr Lys Met Ser Ser Pro Ser Pro Pro Val Leu Asp65 70 75 80ccc tat ctc ttg cca gag gag cat ccc tca gcc cat gaa tac tac gat 587Pro Tyr Leu Leu Pro Glu Glu His Pro Ser Ala His Glu Tyr Tyr Asp 85 90 95cca aat gac tac att gga gac atc cat cag gag atg gac agg gag gag 635Pro Asn Asp Tyr Ile Gly Asp Ile His Gln Glu Met Asp Arg Glu Glu 100 105 110ctg gag ctg gag gaa gtg gac ctc tac aga atg aac agc cag gac aag 683Leu Glu Leu Glu Glu Val Asp Leu Tyr Arg Met Asn Ser Gln Asp Lys 115 120 125ctg ggc ctc act gtg tgc tac cgg acg gac gat gaa gac gac att ggg 731Leu Gly Leu Thr Val Cys Tyr Arg Thr Asp Asp Glu Asp Asp Ile Gly 130 135 140att tat atc agt gag att gac cct aac agc att gca gcc aag gat ggg 779Ile Tyr Ile Ser Glu Ile Asp Pro Asn Ser Ile Ala Ala Lys Asp Gly145 150 155 160cgc atc cga gaa gga gac cgc att atc cag att aat ggg ata gag gtg 827Arg Ile Arg Glu Gly Asp Arg Ile Ile Gln Ile Asn Gly Ile Glu Val 165 170 175cag aac cgt gaa gag gct gtg gct ctt cta acc agt gaa gaa aat aaa 875Gln Asn Arg Glu Glu Ala Val Ala Leu Leu Thr Ser Glu Glu Asn Lys 180 185 190aac ttt tca ttg ctg att gca agg cct gaa ctc cag ctg gat gag ggc 923Asn Phe Ser Leu Leu Ile Ala Arg Pro Glu Leu Gln Leu Asp Glu Gly 195 200 205tgg atg gat gat gac agg aac gac ttt ctg gat gac ctg cac atg gac 971Trp Met Asp Asp Asp Arg Asn Asp Phe Leu Asp Asp Leu His Met Asp 210 215 220atg ctg gag gag cag cac cac cag gcc atg caa ttc aca gct agc gtg 1019Met Leu Glu Glu Gln His His Gln Ala Met Gln Phe Thr Ala Ser Val225 230 235 240ctg cag cag aag aag cac gac gaa gac ggt ggg acc aca gat aca gcc 1067Leu Gln Gln Lys Lys His Asp Glu Asp Gly Gly Thr Thr Asp Thr Ala 245 250 255acc atc ttg tcc aac cag cac gag aag gac agc ggt gtg ggg cgg acc 1115Thr Ile Leu Ser Asn Gln His Glu Lys Asp Ser Gly Val Gly Arg Thr 260 265 270gac gag agc acc cgt aat gac gag agc tcg gag caa gag agc aat ggc 1163Asp Glu Ser Thr Arg Asn Asp Glu Ser Ser Glu Gln Glu Ser Asn Gly 275 280 285gac gac gcc acc gca tcc tcc aac ccg ctg gcg ggg cag agg aag ctc 1211Asp Asp Ala Thr Ala Ser Ser Asn Pro Leu Ala Gly Gln Arg Lys Leu 290 295 300acc tgc agc cag gac acc ttg ggc agc ggc gac ctg ccc ttc agc aac 1259Thr Cys Ser Gln Asp Thr Leu Gly Ser Gly Asp Leu Pro Phe Ser Asn305 310 315 320gag tct ttc att tcg gcc gac tgc acg gac gcc gac tac ctg ggg atc 1307Glu Ser Phe Ile Ser Ala Asp Cys Thr Asp Ala Asp Tyr Leu Gly Ile 325 330 335ccg gtg gac gag tgc gag cgc ttc cgc gag ctc ctg gag ctc aag tgc 1355Pro Val Asp Glu Cys Glu Arg Phe Arg Glu Leu Leu Glu Leu Lys Cys 340 345 350cag gtg aag agc gcc acc cct tac ggc ctg tac tac cct agc ggc ccc 1403Gln Val Lys Ser Ala Thr Pro Tyr Gly Leu Tyr Tyr Pro Ser Gly Pro 355 360 365ctg gac gcc ggc aag agt gac cct gag agc gtg gac aag gag ctg gag 1451Leu Asp Ala Gly Lys Ser Asp Pro Glu Ser Val Asp Lys Glu Leu Glu 370 375 380ctg ctg aac gaa gag ctg cgc agc atc gag ctg gag tgc ctg agc atc 1499Leu Leu Asn Glu Glu Leu Arg Ser Ile Glu Leu Glu Cys Leu Ser Ile385 390 395 400gtg cgc gcc cac aag atg cag cag ctc aag gag cag tac cgc gag tcc 1547Val Arg Ala His Lys Met Gln Gln Leu Lys Glu Gln Tyr Arg Glu Ser 405 410 415tgg atg ctg cac aac agc ggc ttc cgc 1574Trp Met Leu His Asn Ser Gly Phe Arg 420 425236425PRTHomo sapiens 236Met Ile Asn Gln Val Asn Gly Arg Asp Leu Ser Arg Ala Thr His Asp1 5 10 15Gln Ala Val Glu Ala Phe Lys Thr Ala Lys Glu Pro Ile Val Val Gln 20 25 30Val Leu Arg Arg Thr Pro Arg Thr Lys Met Phe Thr Pro Pro Ser Glu 35 40 45Ser Gln Leu Val Asp Thr Gly Thr Gln Thr Asp Ile Thr Phe Glu His 50 55 60Ile Met Ala Leu Thr Lys Met Ser Ser Pro Ser Pro Pro Val Leu Asp65 70 75 80Pro Tyr Leu Leu Pro Glu Glu His Pro Ser Ala His Glu Tyr Tyr Asp 85 90 95Pro Asn Asp Tyr Ile Gly Asp Ile His Gln Glu Met Asp Arg Glu Glu 100 105 110Leu Glu Leu Glu Glu Val Asp Leu Tyr Arg Met Asn Ser Gln Asp Lys 115 120 125Leu Gly Leu Thr Val Cys Tyr Arg Thr Asp Asp Glu Asp Asp

Ile Gly 130 135 140Ile Tyr Ile Ser Glu Ile Asp Pro Asn Ser Ile Ala Ala Lys Asp Gly145 150 155 160Arg Ile Arg Glu Gly Asp Arg Ile Ile Gln Ile Asn Gly Ile Glu Val 165 170 175Gln Asn Arg Glu Glu Ala Val Ala Leu Leu Thr Ser Glu Glu Asn Lys 180 185 190Asn Phe Ser Leu Leu Ile Ala Arg Pro Glu Leu Gln Leu Asp Glu Gly 195 200 205Trp Met Asp Asp Asp Arg Asn Asp Phe Leu Asp Asp Leu His Met Asp 210 215 220Met Leu Glu Glu Gln His His Gln Ala Met Gln Phe Thr Ala Ser Val225 230 235 240Leu Gln Gln Lys Lys His Asp Glu Asp Gly Gly Thr Thr Asp Thr Ala 245 250 255Thr Ile Leu Ser Asn Gln His Glu Lys Asp Ser Gly Val Gly Arg Thr 260 265 270Asp Glu Ser Thr Arg Asn Asp Glu Ser Ser Glu Gln Glu Ser Asn Gly 275 280 285Asp Asp Ala Thr Ala Ser Ser Asn Pro Leu Ala Gly Gln Arg Lys Leu 290 295 300Thr Cys Ser Gln Asp Thr Leu Gly Ser Gly Asp Leu Pro Phe Ser Asn305 310 315 320Glu Ser Phe Ile Ser Ala Asp Cys Thr Asp Ala Asp Tyr Leu Gly Ile 325 330 335Pro Val Asp Glu Cys Glu Arg Phe Arg Glu Leu Leu Glu Leu Lys Cys 340 345 350Gln Val Lys Ser Ala Thr Pro Tyr Gly Leu Tyr Tyr Pro Ser Gly Pro 355 360 365Leu Asp Ala Gly Lys Ser Asp Pro Glu Ser Val Asp Lys Glu Leu Glu 370 375 380Leu Leu Asn Glu Glu Leu Arg Ser Ile Glu Leu Glu Cys Leu Ser Ile385 390 395 400Val Arg Ala His Lys Met Gln Gln Leu Lys Glu Gln Tyr Arg Glu Ser 405 410 415Trp Met Leu His Asn Ser Gly Phe Arg 420 425237311DNAHomo sapiens 237gtagacgcct cggggcttaa gagaacgtga catggaagag acaggaaata tgtactcttg 60tacaggtatg cggcacacat gtggccccct ccctcccagc ctcctcacta aagattatta 120acctgcttat tctcctggtt cacctgaggc tgttttgaga gaatctcagt atctcagtct 180tcatttcata cgacattctt cgggctcctc ttgttcatct ttgcagactg aagtatttat 240gaaggacctt cagaaatctg attcctcctg atgaatcaaa gagtaaagaa gaatcaaaca 300tgattaacca g 3112384PRTHomo sapiens 238Met Ile Asn Gln123912DNAHomo sapiens 239atgattaacc ag 1224021DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 240gccttccttt gacgtttttc t 2124124DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 241cgttgacctg acaaacttca taga 24242101DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO240) and reverse primer (SEQ ID NO241) 242gccttccttt gacgtttttc taaacatggg atgcagtctg tgcagcctgc agaagcaaga 60ggagcagtac aaattactct atgaagtttg tcaggtcaac g 10124322DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 243gcttagatcg caatgaaact gc 2224422DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 244ccaaatacca tcttccacca tg 22245147DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO243) and reverse primer (SEQ ID NO244) 245gcttagatcg caatgaaact gcggacctgc aggctgcctt gcttcactga gtcaattttt 60aaatgaagat ggaacacaag aggatgatct tattgttcag taaatttact ctaatacata 120aaccccatgg tggaagatgg tatttgg 14724625DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 246ccaaatgttt aggatacacc tgaga 2524720DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 247ctgccgttga ccagggaaag 20248106DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO246) and reverse primer (SEQ ID NO247) 248ccaaatgttt aggatacacc tgagatgtct ctcttactga atagaaacaa catccctgga 60agctctagac ttctgaggca ccccagcttt ccctggtcaa cggcag 10624927DNAArtificial Sequenceforward primer used for specifically detecting the variant of the present invention 249cagtatctca gtcttcattt catacga 2725024DNAArtificial Sequencereverse primer used for specifically detecting the variant of the present invention 250ttgacctggt taatcatgtt tgat 24251150DNAArtificial Sequencepartial polynucleotide specific to the variant of the present invention, which is obtained by PCR using forward primer (SEQ ID NO249) and reverse primer (SEQ ID NO250) 251cagtatctca gtcttcattt catacgacat tcttcgggct cctcttgttc atctttgcag 60actgaagtat ttatgaagga ccttcagaaa tctgattcct cctgatgaat caaagagtaa 120agaagaatca aacatgatta accaggtcaa 15025219DNAArtificial Sequenceforward primer used for specifically detecting the known variant of the gene of the present invention 252gcaagggcga agaaaccaa 1925322DNAArtificial Sequencereverse primer used for specifically detecting the known variant of the gene of the present invention 253gatgatccat cgtggttatc ca 22254112DNAArtificial Sequencepartial polynucleotide specific to the known variant of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO252) and reverse primer (SEQ ID NO253) 254gcaagggcga agaaaccaaa agtctgactc ttgtcctgca tcgggactcc ggctccctgg 60gattcaatat tattggtggc cggccgagtg tggataacca cgatggatca tc 11225520DNAArtificial Sequenceforward primer used for detecting the variants of the gene of the present invention 255caccactccc catacaagca 2025620DNAArtificial Sequencereverse primer used for detecting the variants of the gene of the present invention 256catctggctt tgcgcgtact 20257111DNAArtificial Sequencepartial polynucleotide common to the variants of the gene of the present invention, which is obtained by PCR using forward primer (SEQ ID NO255) and reverse primer (SEQ ID NO256) 257caccactccc catacaagca cgcgcacatc ccggcgcacg cccagcacta ccagagctac 60atgcagctga tccagcagaa gtcggccgtg gagtacgcgc aaagccagat g 111258219DNAHomo sapiens 258gagagcagcg ccaatgtgaa gcgttgcagt cgcttgactc acctgaggct ctccaaggat 60accttcaatg cctgcactgt aagggagctg cttttcccgg gtgctggcga gaacggaagc 120cttcctttga cgtttttcta aacatgggat gcagtctgtg cagcctgcag aagcaagagg 180agcagtacaa attactctat gaagtttgtc aggtcaacg 219259204DNAHomo sapiens 259gtgaagcgtt gcagtcgctt gactcacctg aggctctcca aggatacctt caatgcctgc 60actgtaaggg agctgctttt cccgggtgct ggcgagaacg gaagccttcc tttgacgttt 120ttctaaacat gggatgcagt ctgtgcagcc tgcagaagca agaggagcag tacaaattac 180tctatgaagt ttgtcaggtc aacg 204260179DNAHomo sapiens 260agcatttgct cagtactctc aacataaacc aagcttagat cgcaatgaaa ctgcggacct 60gcaggctgcc ttgcttcact gagtcaattt ttaaatgaag atggaacaca agaggatgat 120cttattgttc agtaaattta ctctaataca taaaccccat ggtggaagat ggtatttgg 179261179DNAHomo sapiens 261agcatttgct cagtactctc aacataaacc aagcttagat cgcaatgaaa ctgcggacct 60gcaggctgcc ttgcttcact gagtcaattt ttaaatgaag atggaacaca agaggatgat 120cttattgttc agtaaattta ctctaataca taaaccccat ggtggaagat ggtatttgg 179262180DNAHomo sapiens 262atatataata gccttgcaag tgaacggata agcaaagtca gttgctgaga aataacactt 60ccgagctgac gcatccaaat gtttaggata cacctgagat gtctctctta ctgaatagaa 120acaacatccc tggaagctct agacttctga ggcaccccag ctttccctgg tcaacggcag 180263316DNAHomo sapiens 263gtagacgcct cggggcttaa gagaacgtga catggaagag acaggaaata tgtactcttg 60tacaggtatg cggcacacat gtggccccct ccctcccagc ctcctcacta aagattatta 120acctgcttat tctcctggtt cacctgaggc tgttttgaga gaatctcagt atctcagtct 180tcatttcata cgacattctt cgggctcctc ttgttcatct ttgcagactg aagtatttat 240gaaggacctt cagaaatctg attcctcctg atgaatcaaa gagtaaagaa gaatcaaaca 300tgattaacca ggtcaa 31626462PRTHomo sapiens 264Met Ser Asp Val Ala Ile Val Lys Glu Gly Trp Leu His Lys Arg Gly1 5 10 15Glu Tyr Ile Lys Thr Trp Arg Pro Arg Tyr Phe Leu Leu Lys Asn Asp 20 25 30Gly Thr Phe Ile Gly Tyr Lys Glu Arg Pro Gln Asp Val Asp Gln Arg 35 40 45Glu Ala Pro Leu Asn Asn Phe Ser Val Ala Gln Cys Gln Leu 50 55 60265172PRTHomo sapiens 265Met Pro Asn Pro Ser Ser Thr Ser Ser Pro Tyr Pro Leu Pro Glu Glu1 5 10 15Ile Arg Asn Leu Leu Ala Asp Val Glu Thr Phe Val Ala Asp Ile Leu 20 25 30Lys Gly Glu Asn Leu Ser Lys Lys Ala Lys Glu Lys Arg Glu Ser Leu 35 40 45Ile Lys Lys Ile Lys Asp Val Lys Ser Ile Tyr Leu Gln Glu Phe Gln 50 55 60Asp Lys Gly Asp Ala Glu Asp Gly Glu Glu Tyr Asp Asp Pro Phe Ala65 70 75 80Gly Pro Pro Asp Thr Ile Ser Leu Ala Ser Glu Arg Tyr Asp Lys Asp 85 90 95Asp Glu Ala Pro Ser Asp Gly Ala Gln Phe Pro Pro Ile Ala Ala Gln 100 105 110Asp Leu Pro Phe Val Leu Lys Ala Gly Tyr Leu Glu Lys Arg Arg Lys 115 120 125Asp His Ser Phe Leu Gly Phe Glu Trp Gln Lys Arg Trp Cys Ala Leu 130 135 140Ser Lys Thr Val Phe Tyr Tyr Tyr Gly Ser Asp Lys Asp Lys Gln Gln145 150 155 160Lys Gly Glu Phe Ala Ile Asp Gly Tyr Ser Val Arg 165 17026616PRTHomo sapiens 266Met Ser Asn Lys Cys Asp Val Val Val Val Gly Gly Gly Ile Ser Gly1 5 10 15

* * * * *

Biomarker Specific To Brain/nerve Or Specific To Neuronal Differentiation

WAKAMATSU; Ai ; et al.

References